TOP ACTIVATIONS
MAX = 0.039

liked to play make - bel ieve . One
red cape and make - bel ieve he had super powers
the foundation of the sand castle with wet sand . It
. " I don 't see him ." Just then
and they played make - bel ieve . Finally , they
" Please keep your seat bel ts on . We will
- up and make - bel ieve . One day ,
" Look , look !" Tom says . "
she bowed , the town spe ople stopped to admire her
It is not a grass ho pper or a cricket .
a world of make - bel ieve . She
he was playing make - bel ieve at home , too
loved to play make - bel ieve . Today she was
was ready for make - bel ieve fun . <|endoftext|> Once
. It was a grass ho pper ! Tom was so
ongo didn 't like wet av oc ados , so he
some . " Look , Tom , fruit !"
it sang , the town spe ople would act nice .
Tom . " Look , Tom . I have
" Look , look , another plane !" Sara

MIN ACTIVATIONS
MIN = -0.037

. The c act i went exploring and
! The c act i were so surprised
!" The c act i were very happy
to do . T iny decided to be brave
were healed . Fel icity was so happy that
was . The w affle was warm and smelled
sleep . T ippy and his new friend
, a twin brother and sister . They were intelligent and
sorry again and thanked her sister . She went to her
away . The w affle breathed a sigh of
and Ben are brother and sister . They are very happy
start with ?" T oya thought for a second
their friends . Fel icity decided to keep the
and Lily were brother and sister . They liked to play
. The w affle was so scared he
help me ?" T oby was excited to serve
and Lily were brother and sister . They lived in a
and Ben are brother and sister . They like to play
in the alley . The t eddy bear turned to Lily
, dad , and little sister . They were a happy

INTERVAL 0.020 - 0.039
CONTAINS 0.202%

that day on , the pilot made sure to dress slowly
floor . " Look , I have a lot
" Wow , look at all the fish !"
mom my , I will be careful ." Tim
ran away as fast as she could . Back

INTERVAL 0.001 - 0.020
CONTAINS 43.021%

But he does not want to look like a baby in
it was great ! He had so much
piece of gum ! Tim my loved gum , so he
me reach the h anger ?" Her brother
. My mom my isn 't feeling well and I want

INTERVAL -0.018 - 0.001
CONTAINS 56.327%

The friends were curious , so they opened the
my told her to be careful because it was a cold
to the park . They had a great time playing on
water . <|endoftext|> Once upon a time there was a little
into a big house to find more toys to play with

INTERVAL -0.037 - -0.018
CONTAINS 0.450%

them some w affles to m unch on . The w
sun started to set , Mitt ens had to go home
eventually fell asleep . The monster came back and this time
was so huge that he could not fit in the seat
face . When Mitt ens saw the picture ,
Token liked
Token ID8288
Feature activation-2.22e-04

Pos logit contributions
ishly+0.008
 Hannah+0.007
 Pupp+0.007
rob+0.007
lor+0.007
Neg logit contributions
 eleven-0.006
 done-0.006
 unstable-0.006
 leaning-0.006
 taller-0.006
Token to
Token ID284
Feature activation-1.36e-02

Pos logit contributions
 Hannah+0.000
 Pupp+0.000
 Woo+0.000
 Cindy+0.000
 Gr+0.000
Neg logit contributions
 unstable-0.000
ror-0.000
cks-0.000
 eleven-0.000
Crash-0.000
Token play
Token ID711
Feature activation-2.65e-03

Pos logit contributions
ishly+0.027
 Hannah+0.026
lor+0.026
rob+0.026
 therapist+0.025
Neg logit contributions
 done-0.015
 eleven-0.015
 high-0.014
 taller-0.014
 leaning-0.014
Token make
Token ID787
Feature activation+8.44e-03

Pos logit contributions
ishly+0.006
 Hannah+0.006
 therapist+0.006
lor+0.006
 Hazel+0.006
Neg logit contributions
 eleven-0.002
 high-0.002
 taller-0.002
 higher-0.001
 near-0.001
Token-
Token ID12
Feature activation+9.47e-03

Pos logit contributions
 eleven+0.011
 taller+0.011
 done+0.010
 unstable+0.010
 higher+0.010
Neg logit contributions
ishly-0.015
 Hannah-0.014
 therapist-0.014
 Pupp-0.014
rob-0.014
Tokenbel
Token ID6667
Feature activation+3.84e-02

Pos logit contributions
 eleven+0.013
 unstable+0.012
 leaning+0.012
 done+0.012
ror+0.012
Neg logit contributions
ishly-0.015
 Hannah-0.015
 Pupp-0.015
 therapist-0.014
 Woo-0.014
Tokenieve
Token ID12311
Feature activation-5.07e-03

Pos logit contributions
 eleven+0.061
 unstable+0.060
ror+0.056
 leaning+0.055
 taller+0.054
Neg logit contributions
 Hannah-0.051
rob-0.050
 Pupp-0.050
ishly-0.050
 trends-0.048
Token.
Token ID13
Feature activation+9.75e-04

Pos logit contributions
 Hannah+0.008
ishly+0.008
 Pupp+0.008
 Holly+0.008
 therapist+0.008
Neg logit contributions
 eleven-0.007
 unstable-0.007
 done-0.007
 taller-0.006
 leaning-0.006
Token
Token ID198
Feature activation-5.59e-03

Pos logit contributions
 eleven+0.002
 unstable+0.001
 taller+0.001
 done+0.001
 twelve+0.001
Neg logit contributions
ishly-0.001
 Hannah-0.001
rob-0.001
 Pupp-0.001
 therapist-0.001
Token
Token ID198
Feature activation-6.99e-03

Pos logit contributions
 Hannah+0.008
ishly+0.008
 Pupp+0.008
 Holly+0.007
 Cindy+0.007
Neg logit contributions
 eleven-0.009
 unstable-0.009
 done-0.008
 taller-0.008
 leaning-0.008
TokenOne
Token ID3198
Feature activation+5.29e-03

Pos logit contributions
ishly+0.011
 Hannah+0.010
 Pupp+0.010
 therapist+0.010
 Cindy+0.010
Neg logit contributions
 eleven-0.011
 unstable-0.010
 done-0.010
 taller-0.010
 leaning-0.010
Token red
Token ID2266
Feature activation-7.62e-03

Pos logit contributions
 eleven+0.000
 done+0.000
 unstable+0.000
 high+0.000
 taller+0.000
Neg logit contributions
ishly-0.000
 Hannah-0.000
 Pupp-0.000
 Holly-0.000
 Cindy-0.000
Token cape
Token ID36023
Feature activation+2.95e-03

Pos logit contributions
 Hannah+0.012
ishly+0.012
 Hazel+0.012
 Dawn+0.012
 Cindy+0.012
Neg logit contributions
 high-0.011
 eleven-0.011
 done-0.010
 six-0.010
 eight-0.010
Token and
Token ID290
Feature activation-3.35e-03

Pos logit contributions
 eleven+0.004
 done+0.003
 unstable+0.003
 taller+0.003
 high+0.003
Neg logit contributions
 Hannah-0.005
ishly-0.005
 Pupp-0.005
 Holly-0.005
 Hazel-0.005
Token make
Token ID787
Feature activation+1.09e-02

Pos logit contributions
ishly+0.005
 Hannah+0.005
 Pupp+0.005
 Holly+0.005
 therapist+0.005
Neg logit contributions
 eleven-0.005
 unstable-0.005
 done-0.005
 taller-0.005
 leaning-0.004
Token-
Token ID12
Feature activation+1.45e-02

Pos logit contributions
 eleven+0.015
 unstable+0.014
ror+0.013
 leaning+0.013
 done+0.013
Neg logit contributions
 Hannah-0.018
 Pupp-0.017
ishly-0.017
 Woo-0.017
 Cindy-0.017
Tokenbel
Token ID6667
Feature activation+3.82e-02

Pos logit contributions
 eleven+0.018
 unstable+0.017
 leaning+0.016
 done+0.016
ror+0.016
Neg logit contributions
 Pupp-0.024
Gr-0.024
 Hannah-0.024
ishly-0.024
 therapist-0.024
Tokenieve
Token ID12311
Feature activation-5.01e-03

Pos logit contributions
 eleven+0.059
 unstable+0.059
 taller+0.054
 leaning+0.054
ror+0.054
Neg logit contributions
 Hannah-0.054
ishly-0.052
 Pupp-0.052
rob-0.050
 therapist-0.050
Token he
Token ID339
Feature activation+3.86e-03

Pos logit contributions
 Hannah+0.009
ishly+0.009
 Pupp+0.008
 Holly+0.008
 therapist+0.008
Neg logit contributions
 eleven-0.006
 unstable-0.006
 done-0.006
 taller-0.006
 leaning-0.006
Token had
Token ID550
Feature activation-8.39e-03

Pos logit contributions
 eleven+0.006
 unstable+0.006
 done+0.006
 taller+0.006
 leaning+0.006
Neg logit contributions
ishly-0.005
 Hannah-0.005
 Pupp-0.005
 Holly-0.005
 therapist-0.005
Token super
Token ID2208
Feature activation+7.00e-03

Pos logit contributions
 Hannah+0.015
ishly+0.015
 Pupp+0.014
 Holly+0.014
 Woo+0.014
Neg logit contributions
 eleven-0.010
 unstable-0.010
 done-0.009
 taller-0.009
 leaning-0.009
Tokenpowers
Token ID30132
Feature activation-1.94e-03

Pos logit contributions
 eleven+0.009
 unstable+0.008
 leaning+0.008
ror+0.008
 taller+0.008
Neg logit contributions
 Hannah-0.012
 Pupp-0.011
ishly-0.011
 trends-0.011
 therapist-0.011
Token the
Token ID262
Feature activation-9.19e-04

Pos logit contributions
 eleven+0.001
 unstable+0.001
 done+0.001
 taller+0.001
 leaning+0.001
Neg logit contributions
ishly-0.003
 Hannah-0.003
 Pupp-0.003
 Holly-0.003
 therapist-0.003
Token foundation
Token ID8489
Feature activation+1.34e-02

Pos logit contributions
 Hannah+0.001
 Ned+0.001
 trends+0.001
 c+0.001
ishly+0.001
Neg logit contributions
 eleven-0.001
ror-0.001
 unstable-0.001
 figuring-0.001
 leaning-0.001
Token of
Token ID286
Feature activation+5.35e-03

Pos logit contributions
 eleven+0.018
 unstable+0.018
 done+0.017
 taller+0.016
ror+0.016
Neg logit contributions
ishly-0.022
 Hannah-0.022
 Pupp-0.021
 Holly-0.021
 therapist-0.021
Token the
Token ID262
Feature activation-3.30e-03

Pos logit contributions
 eleven+0.006
 unstable+0.006
ror+0.006
 done+0.006
 taller+0.006
Neg logit contributions
 Hannah-0.010
ishly-0.010
 Pupp-0.009
 Cindy-0.009
 Holly-0.009
Token sand
Token ID6450
Feature activation+1.07e-02

Pos logit contributions
 Hannah+0.004
 trends+0.004
ishly+0.004
 Ned+0.004
 therapist+0.004
Neg logit contributions
 eleven-0.005
 unstable-0.005
ror-0.005
 leaning-0.004
cks-0.004
Tokencastle
Token ID18676
Feature activation+3.77e-02

Pos logit contributions
 eleven+0.017
 unstable+0.016
Crash+0.015
ror+0.015
 done+0.015
Neg logit contributions
ishly-0.015
 Hannah-0.015
 Holly-0.014
 therapist-0.014
 Pupp-0.014
Token with
Token ID351
Feature activation-2.60e-03

Pos logit contributions
 eleven+0.064
 unstable+0.063
ror+0.059
 done+0.058
Crash+0.058
Neg logit contributions
ishly-0.049
 Hannah-0.048
 Pupp-0.045
 Holly-0.045
 therapist-0.045
Token wet
Token ID9583
Feature activation-7.65e-03

Pos logit contributions
 Hannah+0.004
ishly+0.004
 Pupp+0.004
 Holly+0.003
 Cindy+0.003
Neg logit contributions
 eleven-0.004
 unstable-0.004
 done-0.004
 taller-0.004
ror-0.004
Token sand
Token ID6450
Feature activation+4.88e-03

Pos logit contributions
ishly+0.013
 Hannah+0.013
 Pupp+0.013
 Holly+0.013
 Hazel+0.013
Neg logit contributions
 eleven-0.010
 unstable-0.009
 done-0.009
 taller-0.009
 leaning-0.009
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
 eleven+0.008
 unstable+0.008
 done+0.008
ror+0.007
Crash+0.007
Neg logit contributions
ishly-0.007
 Hannah-0.006
 Holly-0.006
 Pupp-0.006
 therapist-0.006
Token It
Token ID632
Feature activation-9.16e-03

Pos logit contributions
 eleven+0.022
 unstable+0.022
 done+0.021
 taller+0.021
 leaning+0.020
Neg logit contributions
 Hannah-0.016
ishly-0.016
 Pupp-0.015
 Holly-0.014
 Cindy-0.014
Token.
Token ID13
Feature activation+1.11e-02

Pos logit contributions
ishly+0.009
 Hannah+0.009
 Pupp+0.009
rob+0.008
 Woo+0.008
Neg logit contributions
 eleven-0.008
 unstable-0.008
 done-0.008
 taller-0.007
 leaning-0.007
Token "
Token ID366
Feature activation-2.22e-03

Pos logit contributions
 eleven+0.020
 unstable+0.019
 done+0.019
cks+0.018
 taller+0.018
Neg logit contributions
 Hannah-0.014
ishly-0.013
 Pupp-0.013
 Cindy-0.012
 Holly-0.012
TokenI
Token ID40
Feature activation+1.26e-03

Pos logit contributions
ishly+0.003
 Hannah+0.003
 Pupp+0.003
 Holly+0.003
 therapist+0.003
Neg logit contributions
 eleven-0.003
 unstable-0.003
 done-0.003
 taller-0.003
 leaning-0.003
Token don
Token ID836
Feature activation+6.19e-03

Pos logit contributions
 eleven+0.001
 unstable+0.001
 done+0.001
 taller+0.001
ror+0.001
Neg logit contributions
ishly-0.002
 Hannah-0.002
 Pupp-0.002
 therapist-0.002
 Holly-0.002
Token't
Token ID470
Feature activation+1.43e-02

Pos logit contributions
 eleven+0.010
 unstable+0.010
 done+0.009
 taller+0.009
 leaning+0.009
Neg logit contributions
ishly-0.009
 Hannah-0.009
 Pupp-0.008
 Holly-0.008
 therapist-0.008
Token see
Token ID766
Feature activation+3.73e-02

Pos logit contributions
 eleven+0.019
 unstable+0.018
 done+0.017
 taller+0.017
 leaning+0.016
Neg logit contributions
ishly-0.025
 Hannah-0.024
 Pupp-0.024
 therapist-0.023
 Holly-0.023
Token him
Token ID683
Feature activation+4.86e-03

Pos logit contributions
 unstable+0.057
 eleven+0.054
 done+0.054
Crash+0.052
 taller+0.051
Neg logit contributions
 Hannah-0.054
 Pupp-0.051
 Cindy-0.050
 Woo-0.050
 Holly-0.049
Token."
Token ID526
Feature activation+1.42e-03

Pos logit contributions
 unstable+0.008
 eleven+0.008
 done+0.007
 taller+0.007
Crash+0.007
Neg logit contributions
ishly-0.007
 Hannah-0.007
 Pupp-0.007
 Holly-0.006
 Hazel-0.006
Token
Token ID198
Feature activation+3.89e-03

Pos logit contributions
 eleven+0.002
 unstable+0.002
 done+0.002
 taller+0.002
 leaning+0.002
Neg logit contributions
 Hannah-0.002
ishly-0.002
 Pupp-0.002
 Holly-0.002
 Cindy-0.002
TokenJust
Token ID5703
Feature activation+7.82e-03

Pos logit contributions
 eleven+0.007
 unstable+0.007
 done+0.006
 taller+0.006
 leaning+0.006
Neg logit contributions
 Hannah-0.005
ishly-0.005
 Pupp-0.005
 therapist-0.005
 Holly-0.005
Token then
Token ID788
Feature activation-3.58e-03

Pos logit contributions
 unstable+0.012
 eleven+0.011
 done+0.011
 taller+0.010
 leaning+0.010
Neg logit contributions
ishly-0.012
 Hannah-0.011
 Pupp-0.011
 therapist-0.011
 Cindy-0.010
Token and
Token ID290
Feature activation-5.15e-03

Pos logit contributions
 done+0.001
 eleven+0.000
 leaning+0.000
 unstable+0.000
 taller+0.000
Neg logit contributions
Gr-0.001
ishly-0.001
 Hannah-0.001
rob-0.001
 Pupp-0.001
Token they
Token ID484
Feature activation+2.54e-03

Pos logit contributions
ishly+0.009
lor+0.009
Gr+0.008
rob+0.008
 Hannah+0.008
Neg logit contributions
 done-0.007
 eleven-0.007
 unstable-0.006
 leaning-0.006
 taller-0.006
Token played
Token ID2826
Feature activation+2.10e-03

Pos logit contributions
 done+0.003
 eleven+0.003
 unstable+0.003
 high+0.003
 taller+0.003
Neg logit contributions
lor-0.005
 Hannah-0.004
rob-0.004
 Woo-0.004
ishly-0.004
Token make
Token ID787
Feature activation+9.47e-03

Pos logit contributions
 eleven+0.002
 done+0.002
 unstable+0.002
 taller+0.002
 high+0.002
Neg logit contributions
ishly-0.004
 Hannah-0.004
 Pupp-0.004
 therapist-0.004
 Hazel-0.004
Token-
Token ID12
Feature activation+3.80e-03

Pos logit contributions
 eleven+0.014
 done+0.013
 unstable+0.013
 taller+0.013
 higher+0.012
Neg logit contributions
ishly-0.015
 Hannah-0.014
 Pupp-0.014
 therapist-0.014
 Hazel-0.014
Tokenbel
Token ID6667
Feature activation+3.72e-02

Pos logit contributions
 eleven+0.003
 unstable+0.003
 done+0.003
 taller+0.003
 twelve+0.002
Neg logit contributions
ishly-0.009
 Hannah-0.008
 Holly-0.008
 Pupp-0.008
 Hazel-0.008
Tokenieve
Token ID12311
Feature activation-6.61e-03

Pos logit contributions
 eleven+0.057
 unstable+0.056
ror+0.052
 leaning+0.052
 taller+0.052
Neg logit contributions
 Hannah-0.053
ishly-0.052
 Pupp-0.051
rob-0.050
 therapist-0.049
Token.
Token ID13
Feature activation-6.62e-04

Pos logit contributions
ishly+0.012
 Hannah+0.012
 Pupp+0.011
 Cindy+0.011
 Woo+0.011
Neg logit contributions
 eleven-0.008
 unstable-0.008
 done-0.008
 taller-0.007
 leaning-0.007
Token Finally
Token ID9461
Feature activation-3.43e-03

Pos logit contributions
ishly+0.001
 Pupp+0.001
 Hannah+0.001
rob+0.001
 therapist+0.001
Neg logit contributions
 eleven-0.001
 done-0.001
 unstable-0.001
 taller-0.001
 twelve-0.001
Token,
Token ID11
Feature activation+4.02e-03

Pos logit contributions
ishly+0.006
 Pupp+0.005
rob+0.005
 Hannah+0.005
 therapist+0.005
Neg logit contributions
 eleven-0.005
 done-0.005
 unstable-0.004
 taller-0.004
 minutes-0.004
Token they
Token ID484
Feature activation+3.46e-03

Pos logit contributions
 eleven+0.005
 unstable+0.005
 done+0.005
 taller+0.005
ror+0.005
Neg logit contributions
 Hannah-0.007
ishly-0.007
 Holly-0.006
 Woo-0.006
 Pupp-0.006
Token "
Token ID366
Feature activation+1.92e-03

Pos logit contributions
 eleven+0.007
 unstable+0.006
 done+0.006
 taller+0.006
 leaning+0.006
Neg logit contributions
ishly-0.010
 Hannah-0.010
 Pupp-0.009
 Holly-0.009
 therapist-0.009
TokenPlease
Token ID5492
Feature activation-1.02e-03

Pos logit contributions
 eleven+0.003
 unstable+0.003
 done+0.003
 taller+0.003
 leaning+0.003
Neg logit contributions
ishly-0.003
 Hannah-0.003
 Pupp-0.003
 Holly-0.003
 Cindy-0.002
Token keep
Token ID1394
Feature activation+3.90e-03

Pos logit contributions
rob+0.002
ishly+0.002
 Hannah+0.002
 therapist+0.002
 trends+0.002
Neg logit contributions
 eleven-0.001
 done-0.001
ror-0.001
 high-0.001
 taller-0.001
Token your
Token ID534
Feature activation+7.46e-03

Pos logit contributions
 unstable+0.005
 eleven+0.005
 taller+0.004
ror+0.004
 done+0.004
Neg logit contributions
 Hannah-0.007
ishly-0.007
 Pupp-0.007
 Cindy-0.007
 Woo-0.007
Token seat
Token ID5852
Feature activation+1.26e-02

Pos logit contributions
 eleven+0.013
 unstable+0.013
 taller+0.012
 Eleven+0.012
Crash+0.012
Neg logit contributions
ishly-0.009
 Hannah-0.009
 trends-0.009
 puppy-0.009
 Pupp-0.009
Tokenbel
Token ID6667
Feature activation+3.70e-02

Pos logit contributions
 eleven+0.015
 done+0.015
 unstable+0.015
 taller+0.014
 high+0.014
Neg logit contributions
 Hannah-0.023
ishly-0.022
 Pupp-0.022
 Holly-0.022
 Hazel-0.021
Tokents
Token ID912
Feature activation+4.81e-03

Pos logit contributions
 eleven+0.055
 unstable+0.054
 taller+0.051
ror+0.050
 leaning+0.050
Neg logit contributions
ishly-0.055
 Hannah-0.055
 Pupp-0.052
rob-0.052
 therapist-0.052
Token on
Token ID319
Feature activation-2.00e-03

Pos logit contributions
 eleven+0.006
 unstable+0.006
 done+0.006
 taller+0.006
 leaning+0.005
Neg logit contributions
ishly-0.008
 Hannah-0.008
 Pupp-0.008
 Holly-0.008
 therapist-0.008
Token.
Token ID13
Feature activation-3.65e-03

Pos logit contributions
ishly+0.003
 Hannah+0.003
 Pupp+0.003
 Holly+0.003
 Cindy+0.003
Neg logit contributions
 eleven-0.003
 unstable-0.002
 done-0.002
 taller-0.002
ror-0.002
Token We
Token ID775
Feature activation-2.86e-03

Pos logit contributions
rob+0.006
ishly+0.006
 Hannah+0.006
 Holly+0.006
 therapist+0.006
Neg logit contributions
 eleven-0.005
 unstable-0.004
ror-0.004
 taller-0.004
 twelve-0.004
Token will
Token ID481
Feature activation-1.93e-03

Pos logit contributions
ishly+0.005
 Hannah+0.005
 Pupp+0.005
 Holly+0.005
 therapist+0.005
Neg logit contributions
 eleven-0.004
 done-0.004
 unstable-0.004
 taller-0.003
ror-0.003
Token-
Token ID12
Feature activation+2.04e-03

Pos logit contributions
ishly+0.028
 therapist+0.025
 Pupp+0.025
 Hannah+0.025
 Hazel+0.024
Neg logit contributions
 high-0.015
 taller-0.014
 higher-0.013
 done-0.013
 eleven-0.013
Tokenup
Token ID929
Feature activation-6.39e-03

Pos logit contributions
 eleven+0.004
 done+0.004
 unstable+0.004
 taller+0.004
 high+0.004
Neg logit contributions
ishly-0.002
 Hannah-0.002
 Hazel-0.002
 Holly-0.002
 Pupp-0.002
Token and
Token ID290
Feature activation-1.08e-02

Pos logit contributions
ishly+0.014
 Hannah+0.014
 Pupp+0.013
 Hazel+0.013
 Buster+0.013
Neg logit contributions
 done-0.006
 eleven-0.005
 high-0.005
 taller-0.005
 near-0.005
Token make
Token ID787
Feature activation+1.03e-02

Pos logit contributions
ishly+0.022
lor+0.021
 therapist+0.020
rob+0.020
 Pupp+0.019
Neg logit contributions
 done-0.012
 eleven-0.011
 high-0.011
 taller-0.011
 round-0.010
Token-
Token ID12
Feature activation+9.52e-03

Pos logit contributions
 eleven+0.015
 unstable+0.014
 done+0.014
 taller+0.014
 leaning+0.013
Neg logit contributions
ishly-0.017
 Hannah-0.016
 Pupp-0.016
 therapist-0.015
 Hazel-0.015
Tokenbel
Token ID6667
Feature activation+3.62e-02

Pos logit contributions
 eleven+0.016
 unstable+0.016
 done+0.016
 taller+0.015
ror+0.015
Neg logit contributions
ishly-0.013
 Hannah-0.012
 Pupp-0.012
 Holly-0.011
 Hazel-0.011
Tokenieve
Token ID12311
Feature activation-6.80e-03

Pos logit contributions
 eleven+0.054
 unstable+0.054
ror+0.050
 leaning+0.050
 taller+0.049
Neg logit contributions
 Hannah-0.052
rob-0.050
 Pupp-0.050
ishly-0.050
 trends-0.049
Token.
Token ID13
Feature activation+6.28e-03

Pos logit contributions
ishly+0.012
 Hannah+0.012
 Pupp+0.012
 Cindy+0.011
 Holly+0.011
Neg logit contributions
 eleven-0.008
 unstable-0.008
 done-0.008
 taller-0.008
 leaning-0.007
Token One
Token ID1881
Feature activation+2.14e-03

Pos logit contributions
 eleven+0.010
 unstable+0.009
 taller+0.009
 done+0.009
ror+0.009
Neg logit contributions
ishly-0.010
 Hannah-0.009
 therapist-0.009
 Pupp-0.009
rob-0.009
Token day
Token ID1110
Feature activation-4.36e-03

Pos logit contributions
 eleven+0.002
 unstable+0.001
 done+0.001
 taller+0.001
ror+0.001
Neg logit contributions
 Hannah-0.005
ishly-0.005
 Pupp-0.005
 Holly-0.005
 therapist-0.005
Token,
Token ID11
Feature activation+5.96e-03

Pos logit contributions
ishly+0.008
 Hannah+0.007
 therapist+0.007
rob+0.007
lor+0.007
Neg logit contributions
 eleven-0.006
 done-0.006
 unstable-0.005
 leaning-0.005
 high-0.005
Token
Token ID198
Feature activation-1.98e-03

Pos logit contributions
 unstable+0.013
 eleven+0.013
 done+0.013
 taller+0.013
 leaning+0.012
Neg logit contributions
 Hannah-0.009
 Pupp-0.009
 Holly-0.009
ishly-0.009
 Cindy-0.009
Token
Token ID198
Feature activation-1.84e-03

Pos logit contributions
 Hannah+0.003
 Pupp+0.003
ishly+0.003
 Holly+0.003
 Woo+0.003
Neg logit contributions
 eleven-0.003
 unstable-0.003
 done-0.003
 taller-0.003
ror-0.003
Token"
Token ID1
Feature activation+4.85e-04

Pos logit contributions
ishly+0.003
 Hannah+0.003
 Pupp+0.003
 Holly+0.003
 therapist+0.003
Neg logit contributions
 eleven-0.003
 unstable-0.003
 done-0.003
 taller-0.002
 leaning-0.002
TokenLook
Token ID8567
Feature activation+3.02e-02

Pos logit contributions
 eleven+0.001
 unstable+0.001
 done+0.001
 taller+0.001
 leaning+0.001
Neg logit contributions
 Hannah-0.001
ishly-0.001
 Pupp-0.001
 Holly-0.001
 therapist-0.001
Token,
Token ID11
Feature activation+9.69e-03

Pos logit contributions
 unstable+0.059
 leaning+0.053
 eleven+0.052
 taller+0.052
 impossible+0.052
Neg logit contributions
 Hannah-0.035
ishly-0.034
 Cindy-0.032
 Holly-0.032
 Pupp-0.032
Token look
Token ID804
Feature activation+3.60e-02

Pos logit contributions
 unstable+0.011
 eleven+0.011
 leaning+0.011
 done+0.010
 taller+0.010
Neg logit contributions
 Hannah-0.018
 Pupp-0.017
ishly-0.017
 Holly-0.017
 Woo-0.017
Token!"
Token ID2474
Feature activation-1.29e-02

Pos logit contributions
 unstable+0.072
 leaning+0.066
 eleven+0.064
 impossible+0.063
 Blood+0.063
Neg logit contributions
 Hannah-0.036
 Cindy-0.032
 Holly-0.032
 Woo-0.031
 c-0.031
Token Tom
Token ID4186
Feature activation+6.31e-07

Pos logit contributions
 Hannah+0.027
ishly+0.026
 Pupp+0.026
 Holly+0.025
 Cindy+0.025
Neg logit contributions
 eleven-0.012
 unstable-0.012
 taller-0.011
 done-0.011
ror-0.011
Token says
Token ID1139
Feature activation+8.65e-03

Feature not active on prev token;
no predictions were affected.
Token.
Token ID13
Feature activation+5.72e-03

Pos logit contributions
 eleven+0.017
 unstable+0.017
 done+0.016
Crash+0.016
ror+0.015
Neg logit contributions
ishly-0.010
 Hannah-0.009
 Pupp-0.008
 Holly-0.008
 Gr-0.008
Token "
Token ID366
Feature activation+1.18e-02

Pos logit contributions
 eleven+0.011
 unstable+0.010
 done+0.010
ror+0.010
 leaning+0.010
Neg logit contributions
 Hannah-0.007
 Pupp-0.006
ishly-0.006
 Holly-0.006
 therapist-0.006
Token she
Token ID673
Feature activation+1.14e-02

Pos logit contributions
ishly+0.010
 Hannah+0.009
rob+0.009
 Pupp+0.009
 Holly+0.009
Neg logit contributions
 eleven-0.009
 unstable-0.009
 done-0.009
 leaning-0.009
 taller-0.008
Token bowed
Token ID35737
Feature activation+6.30e-03

Pos logit contributions
 eleven+0.016
 unstable+0.015
 done+0.015
 taller+0.014
 leaning+0.014
Neg logit contributions
ishly-0.019
 Hannah-0.018
 Pupp-0.018
 Woo-0.017
 Holly-0.017
Token,
Token ID11
Feature activation-2.06e-03

Pos logit contributions
 eleven+0.007
 high+0.007
 taller+0.007
 higher+0.006
 done+0.006
Neg logit contributions
rob-0.012
ishly-0.012
 Hannah-0.012
lor-0.011
 Pupp-0.011
Token the
Token ID262
Feature activation-8.86e-04

Pos logit contributions
ishly+0.004
rob+0.003
lor+0.003
 Hannah+0.003
 therapist+0.003
Neg logit contributions
 eleven-0.003
 unstable-0.003
 done-0.003
 taller-0.003
 leaning-0.003
Token town
Token ID3240
Feature activation+1.77e-02

Pos logit contributions
ishly+0.001
 Hannah+0.001
 Pupp+0.001
 Holly+0.001
 therapist+0.001
Neg logit contributions
 eleven-0.001
 unstable-0.001
 done-0.001
 taller-0.001
 leaning-0.001
Tokenspe
Token ID4125
Feature activation+3.59e-02

Pos logit contributions
 eleven+0.028
 unstable+0.027
 done+0.027
 taller+0.026
 leaning+0.026
Neg logit contributions
 Hannah-0.025
ishly-0.025
 Pupp-0.024
 Woo-0.024
 Holly-0.024
Tokenople
Token ID643
Feature activation+6.43e-03

Pos logit contributions
 unstable+0.058
 eleven+0.056
 taller+0.056
 done+0.054
ror+0.053
Neg logit contributions
ishly-0.051
rob-0.045
 Pupp-0.044
 Hannah-0.044
 trends-0.043
Token stopped
Token ID5025
Feature activation+1.12e-02

Pos logit contributions
 eleven+0.009
 done+0.009
 unstable+0.008
 leaning+0.008
 taller+0.008
Neg logit contributions
 Hannah-0.011
ishly-0.011
 Pupp-0.010
 Woo-0.010
 Holly-0.010
Token to
Token ID284
Feature activation-1.89e-03

Pos logit contributions
 eleven+0.013
 unstable+0.013
 done+0.012
 taller+0.012
ror+0.011
Neg logit contributions
ishly-0.020
 Hannah-0.020
 Pupp-0.020
 Holly-0.019
 Cindy-0.019
Token admire
Token ID21099
Feature activation+8.56e-04

Pos logit contributions
ishly+0.003
 Hannah+0.003
rob+0.003
 therapist+0.003
 Holly+0.003
Neg logit contributions
 eleven-0.002
 done-0.002
 leaning-0.002
 high-0.002
 taller-0.002
Token her
Token ID607
Feature activation-3.70e-03

Pos logit contributions
 eleven+0.001
 done+0.001
 unstable+0.001
 leaning+0.001
 high+0.001
Neg logit contributions
ishly-0.002
rob-0.002
 therapist-0.002
lor-0.002
 Hannah-0.002
TokenIt
Token ID1026
Feature activation-1.05e-02

Pos logit contributions
ishly+0.010
 Hannah+0.009
 Buster+0.008
 Cindy+0.008
 trends+0.008
Neg logit contributions
 eleven-0.009
Crash-0.009
 unstable-0.008
 taller-0.008
 twelve-0.008
Token is
Token ID318
Feature activation+4.34e-03

Pos logit contributions
 Hannah+0.016
ishly+0.016
 Cindy+0.015
 Pupp+0.015
 Woo+0.015
Neg logit contributions
 eleven-0.016
 done-0.015
 unstable-0.015
 taller-0.015
ror-0.014
Token not
Token ID407
Feature activation+1.23e-02

Pos logit contributions
 unstable+0.006
 eleven+0.006
 done+0.005
Crash+0.005
 taller+0.005
Neg logit contributions
 Hannah-0.007
 Pupp-0.007
 Cindy-0.007
 Woo-0.007
 Holly-0.007
Token a
Token ID257
Feature activation-1.36e-02

Pos logit contributions
 eleven+0.015
 unstable+0.015
 done+0.014
 taller+0.014
ror+0.014
Neg logit contributions
 Hannah-0.021
 Pupp-0.021
ishly-0.021
 Cindy-0.020
 Woo-0.020
Token grass
Token ID8701
Feature activation-8.26e-03

Pos logit contributions
ishly+0.023
 Hannah+0.021
 Pupp+0.020
rob+0.020
 Hazel+0.019
Neg logit contributions
 eleven-0.020
 unstable-0.019
 taller-0.019
 high-0.018
 done-0.018
Tokenho
Token ID8873
Feature activation+3.57e-02

Pos logit contributions
 Hannah+0.016
Gr+0.015
 Cindy+0.015
 Natalie+0.015
 Buster+0.015
Neg logit contributions
 eleven-0.010
 taller-0.009
 high-0.009
 leaning-0.009
 unstable-0.009
Tokenpper
Token ID2848
Feature activation-2.16e-03

Pos logit contributions
 unstable+0.069
 eleven+0.067
 done+0.064
 taller+0.064
 minutes+0.063
Neg logit contributions
ishly-0.040
 Hannah-0.035
 Pupp-0.034
rob-0.034
 Woo-0.034
Token or
Token ID393
Feature activation-1.44e-02

Pos logit contributions
 Hannah+0.004
ishly+0.004
 Pupp+0.003
 Cindy+0.003
Gr+0.003
Neg logit contributions
 eleven-0.003
 taller-0.003
 done-0.003
 unstable-0.003
 leaning-0.003
Token a
Token ID257
Feature activation-1.57e-02

Pos logit contributions
ishly+0.029
 Hannah+0.029
lor+0.028
 trends+0.028
rob+0.028
Neg logit contributions
 eleven-0.015
 unstable-0.013
 taller-0.013
 twelve-0.013
 eight-0.013
Token cricket
Token ID18836
Feature activation-1.14e-02

Pos logit contributions
ishly+0.025
 Hannah+0.023
rob+0.023
lor+0.022
 trends+0.022
Neg logit contributions
 eleven-0.024
 taller-0.022
 unstable-0.022
 high-0.022
 leaning-0.022
Token.
Token ID13
Feature activation+1.12e-03

Pos logit contributions
 Hannah+0.018
ishly+0.018
 Pupp+0.017
 Cindy+0.017
 Holly+0.017
Neg logit contributions
 eleven-0.017
 unstable-0.016
 done-0.016
 taller-0.015
 leaning-0.015
Token a
Token ID257
Feature activation-4.00e-03

Pos logit contributions
ishly+0.008
rob+0.007
 Pupp+0.007
 Hannah+0.007
 therapist+0.007
Neg logit contributions
 eleven-0.004
 done-0.003
 unstable-0.003
 twelve-0.003
 leaning-0.003
Token world
Token ID995
Feature activation+8.78e-03

Pos logit contributions
 Hannah+0.007
ishly+0.007
 Pupp+0.006
 Holly+0.006
 therapist+0.006
Neg logit contributions
 eleven-0.005
 unstable-0.005
 done-0.005
 taller-0.005
ror-0.005
Token of
Token ID286
Feature activation-8.84e-03

Pos logit contributions
 eleven+0.012
 unstable+0.012
 done+0.012
 taller+0.012
 leaning+0.011
Neg logit contributions
ishly-0.014
 Hannah-0.014
 Pupp-0.013
 Woo-0.013
 Holly-0.013
Token make
Token ID787
Feature activation+9.27e-03

Pos logit contributions
ishly+0.015
 Hannah+0.015
 Pupp+0.014
 therapist+0.014
 Holly+0.014
Neg logit contributions
 eleven-0.012
 unstable-0.011
 done-0.011
 taller-0.011
 leaning-0.010
Token-
Token ID12
Feature activation+8.38e-03

Pos logit contributions
 eleven+0.013
 unstable+0.012
 done+0.012
 taller+0.012
 leaning+0.011
Neg logit contributions
ishly-0.016
 Hannah-0.015
 Pupp-0.015
 therapist-0.014
 Holly-0.014
Tokenbel
Token ID6667
Feature activation+3.49e-02

Pos logit contributions
 eleven+0.011
 unstable+0.011
 leaning+0.010
 done+0.010
ror+0.010
Neg logit contributions
ishly-0.013
 Pupp-0.013
Gr-0.013
 Hannah-0.013
 therapist-0.013
Tokenieve
Token ID12311
Feature activation-7.28e-03

Pos logit contributions
 eleven+0.057
 unstable+0.057
ror+0.052
 taller+0.052
 leaning+0.052
Neg logit contributions
 Hannah-0.045
ishly-0.044
rob-0.043
 Pupp-0.043
 trends-0.042
Token.
Token ID13
Feature activation+2.04e-03

Pos logit contributions
ishly+0.012
 Hannah+0.012
 Pupp+0.011
 Holly+0.011
 therapist+0.011
Neg logit contributions
 eleven-0.010
 unstable-0.010
 done-0.010
 taller-0.009
 leaning-0.009
Token
Token ID198
Feature activation+3.28e-03

Pos logit contributions
 eleven+0.003
 unstable+0.003
 done+0.003
 taller+0.003
ror+0.003
Neg logit contributions
ishly-0.003
 Hannah-0.003
rob-0.003
 Pupp-0.003
 therapist-0.003
Token
Token ID198
Feature activation+3.26e-03

Pos logit contributions
 eleven+0.005
 unstable+0.005
 done+0.005
 taller+0.005
 leaning+0.005
Neg logit contributions
 Hannah-0.005
ishly-0.005
 Pupp-0.005
 Holly-0.005
 Cindy-0.005
TokenShe
Token ID3347
Feature activation-1.41e-03

Pos logit contributions
 eleven+0.005
 unstable+0.005
 done+0.005
 taller+0.005
 leaning+0.005
Neg logit contributions
 Hannah-0.005
ishly-0.004
 Pupp-0.004
 Holly-0.004
 therapist-0.004
Token he
Token ID339
Feature activation+1.91e-03

Pos logit contributions
 eleven+0.003
 unstable+0.003
 done+0.003
 taller+0.003
 leaning+0.003
Neg logit contributions
ishly-0.003
 Hannah-0.003
rob-0.003
 therapist-0.003
lor-0.003
Token was
Token ID373
Feature activation-4.12e-03

Pos logit contributions
 done+0.002
 eleven+0.002
 unstable+0.002
 leaning+0.002
 taller+0.002
Neg logit contributions
ishly-0.004
 Hannah-0.003
 Pupp-0.003
 Cindy-0.003
 Hazel-0.003
Token playing
Token ID2712
Feature activation-6.79e-03

Pos logit contributions
 Hannah+0.008
 Holly+0.007
 Pupp+0.007
 Cindy+0.007
ishly+0.007
Neg logit contributions
 unstable-0.005
 eleven-0.004
Crash-0.004
 Eleven-0.004
ror-0.004
Token make
Token ID787
Feature activation+7.03e-03

Pos logit contributions
ishly+0.014
 Hannah+0.013
 Pupp+0.013
 Hazel+0.013
 Cindy+0.013
Neg logit contributions
 eleven-0.007
 taller-0.006
 done-0.006
 unstable-0.006
 higher-0.006
Token-
Token ID12
Feature activation+5.26e-03

Pos logit contributions
 done+0.008
 eleven+0.008
 taller+0.008
 higher+0.008
 high+0.008
Neg logit contributions
ishly-0.013
rob-0.012
 therapist-0.012
 Hannah-0.012
 Hazel-0.012
Tokenbel
Token ID6667
Feature activation+3.46e-02

Pos logit contributions
 eleven+0.006
 unstable+0.006
 done+0.005
 taller+0.005
 leaning+0.005
Neg logit contributions
ishly-0.010
 Hannah-0.010
 Pupp-0.010
 therapist-0.009
 Holly-0.009
Tokenieve
Token ID12311
Feature activation-6.31e-03

Pos logit contributions
 eleven+0.052
 unstable+0.051
 done+0.048
 taller+0.048
ror+0.047
Neg logit contributions
 Hannah-0.051
ishly-0.051
 Pupp-0.049
 therapist-0.047
rob-0.047
Token at
Token ID379
Feature activation+4.17e-03

Pos logit contributions
ishly+0.011
 Hannah+0.011
 Pupp+0.011
 Woo+0.010
 Cindy+0.010
Neg logit contributions
 eleven-0.008
 unstable-0.008
 done-0.008
 taller-0.007
 leaning-0.007
Token home
Token ID1363
Feature activation-1.39e-02

Pos logit contributions
 eleven+0.005
 unstable+0.005
 done+0.005
 taller+0.004
 leaning+0.004
Neg logit contributions
ishly-0.008
 Hannah-0.008
 Pupp-0.007
 Holly-0.007
 therapist-0.007
Token,
Token ID11
Feature activation-2.41e-03

Pos logit contributions
rob+0.028
ishly+0.028
 Woo+0.028
 Hannah+0.027
 Pupp+0.027
Neg logit contributions
 done-0.014
 taller-0.014
 eleven-0.013
 high-0.013
 leaning-0.013
Token too
Token ID1165
Feature activation-2.04e-03

Pos logit contributions
ishly+0.004
 Hannah+0.004
 Pupp+0.004
 Holly+0.004
 therapist+0.004
Neg logit contributions
 eleven-0.003
 unstable-0.003
 done-0.003
 taller-0.003
 leaning-0.003
Token loved
Token ID6151
Feature activation-2.02e-05

Pos logit contributions
ishly+0.015
 Hannah+0.015
rob+0.015
 Woo+0.014
 Pupp+0.014
Neg logit contributions
 done-0.016
 eleven-0.016
 unstable-0.016
 leaning-0.016
 taller-0.015
Token to
Token ID284
Feature activation-1.24e-02

Pos logit contributions
 Hannah+0.000
 Pupp+0.000
 Cindy+0.000
ishly+0.000
 Woo+0.000
Neg logit contributions
 unstable-0.000
 eleven-0.000
ror-0.000
 done-0.000
Crash-0.000
Token play
Token ID711
Feature activation-1.22e-03

Pos logit contributions
ishly+0.024
lor+0.024
rob+0.024
 Hannah+0.023
 therapist+0.023
Neg logit contributions
 high-0.013
 eleven-0.013
 done-0.013
 taller-0.013
 leaning-0.013
Token make
Token ID787
Feature activation+6.38e-03

Pos logit contributions
ishly+0.003
 Hannah+0.003
 Pupp+0.003
 therapist+0.003
 Hazel+0.003
Neg logit contributions
 eleven-0.001
 high-0.001
 taller-0.001
 higher-0.001
 done-0.001
Token-
Token ID12
Feature activation+7.78e-03

Pos logit contributions
 eleven+0.009
 taller+0.008
 unstable+0.008
 done+0.008
 leaning+0.008
Neg logit contributions
ishly-0.011
 Hannah-0.010
 Pupp-0.010
 therapist-0.010
 Hazel-0.010
Tokenbel
Token ID6667
Feature activation+3.46e-02

Pos logit contributions
 eleven+0.013
 unstable+0.013
 leaning+0.012
 done+0.012
ror+0.012
Neg logit contributions
ishly-0.010
 Hannah-0.010
 Pupp-0.010
Gr-0.010
rob-0.010
Tokenieve
Token ID12311
Feature activation-6.92e-03

Pos logit contributions
 unstable+0.055
 eleven+0.054
ror+0.050
 leaning+0.050
 taller+0.049
Neg logit contributions
 Hannah-0.046
rob-0.046
ishly-0.044
 Pupp-0.044
 trends-0.044
Token.
Token ID13
Feature activation+3.97e-03

Pos logit contributions
ishly+0.012
 Hannah+0.012
 Pupp+0.011
 Holly+0.011
 Woo+0.011
Neg logit contributions
 eleven-0.009
 unstable-0.009
 done-0.008
 taller-0.008
 leaning-0.008
Token Today
Token ID6288
Feature activation-1.45e-02

Pos logit contributions
 eleven+0.006
 unstable+0.005
 taller+0.005
 leaning+0.005
ror+0.005
Neg logit contributions
ishly-0.006
 therapist-0.006
rob-0.006
 Hannah-0.006
 Hazel-0.006
Token she
Token ID673
Feature activation-1.46e-03

Pos logit contributions
ishly+0.028
rob+0.027
 therapist+0.026
 shepherd+0.025
lor+0.024
Neg logit contributions
 done-0.019
 eleven-0.017
 high-0.016
 taller-0.016
 leaning-0.016
Token was
Token ID373
Feature activation-1.64e-03

Pos logit contributions
ishly+0.003
 Hannah+0.002
 Pupp+0.002
rob+0.002
 Woo+0.002
Neg logit contributions
 done-0.002
 eleven-0.002
 unstable-0.002
 taller-0.002
 leaning-0.002
Token was
Token ID373
Feature activation+1.40e-03

Pos logit contributions
 eleven+0.010
 done+0.010
 unstable+0.010
 taller+0.009
 leaning+0.009
Neg logit contributions
 Hannah-0.012
ishly-0.012
 Pupp-0.012
rob-0.012
 Woo-0.012
Token ready
Token ID3492
Feature activation+2.91e-03

Pos logit contributions
 eleven+0.002
 unstable+0.002
ror+0.001
 taller+0.001
Crash+0.001
Neg logit contributions
 Hannah-0.003
ishly-0.003
 Pupp-0.002
 Holly-0.002
 Cindy-0.002
Token for
Token ID329
Feature activation-1.81e-03

Pos logit contributions
 eleven+0.003
 done+0.003
 unstable+0.003
 taller+0.003
 leaning+0.003
Neg logit contributions
ishly-0.006
 Hannah-0.006
 Pupp-0.006
rob-0.006
 Holly-0.005
Token make
Token ID787
Feature activation+1.24e-02

Pos logit contributions
ishly+0.004
 Pupp+0.003
 Hannah+0.003
 therapist+0.003
rob+0.003
Neg logit contributions
 done-0.002
 eleven-0.002
 unstable-0.002
 twelve-0.002
 minutes-0.002
Token-
Token ID12
Feature activation+4.28e-03

Pos logit contributions
 eleven+0.015
 done+0.014
 unstable+0.014
 taller+0.014
 leaning+0.013
Neg logit contributions
ishly-0.023
 Hannah-0.022
 Pupp-0.022
 therapist-0.021
 Woo-0.021
Tokenbel
Token ID6667
Feature activation+3.40e-02

Pos logit contributions
 eleven+0.007
 unstable+0.007
 done+0.007
 taller+0.007
 leaning+0.006
Neg logit contributions
ishly-0.006
 Hannah-0.006
 Pupp-0.006
 Holly-0.005
 Cindy-0.005
Tokenieve
Token ID12311
Feature activation-1.36e-03

Pos logit contributions
 eleven+0.050
 unstable+0.050
 taller+0.046
ror+0.046
 leaning+0.046
Neg logit contributions
 Hannah-0.051
ishly-0.050
 Pupp-0.048
 therapist-0.047
rob-0.047
Token fun
Token ID1257
Feature activation-8.68e-03

Pos logit contributions
ishly+0.002
 Hannah+0.002
 Pupp+0.002
 Woo+0.002
 Cindy+0.002
Neg logit contributions
 eleven-0.002
 done-0.002
 unstable-0.002
 taller-0.002
 leaning-0.002
Token.
Token ID13
Feature activation-8.17e-04

Pos logit contributions
ishly+0.016
 Hannah+0.015
 Woo+0.015
 Pupp+0.015
 Hazel+0.015
Neg logit contributions
 done-0.011
 eleven-0.011
 taller-0.010
 minutes-0.010
 unstable-0.010
Token<|endoftext|>
Token ID50256
Feature activation+1.32e-03

Pos logit contributions
ishly+0.001
 Hannah+0.001
 Pupp+0.001
 Holly+0.001
 therapist+0.001
Neg logit contributions
 eleven-0.001
 unstable-0.001
 done-0.001
 taller-0.001
 leaning-0.001
TokenOnce
Token ID7454
Feature activation+5.18e-03

Pos logit contributions
 eleven+0.002
 unstable+0.002
 done+0.002
 taller+0.002
 leaning+0.002
Neg logit contributions
ishly-0.002
 Hannah-0.002
 Pupp-0.002
 therapist-0.002
 Cindy-0.002
Token.
Token ID13
Feature activation+4.47e-03

Pos logit contributions
ishly+0.004
 Hannah+0.004
 Pupp+0.004
 Holly+0.004
 Woo+0.004
Neg logit contributions
 eleven-0.005
 unstable-0.004
 done-0.004
 taller-0.004
 leaning-0.004
Token It
Token ID632
Feature activation-1.12e-02

Pos logit contributions
 eleven+0.008
 unstable+0.008
 done+0.008
ror+0.007
 taller+0.007
Neg logit contributions
 Hannah-0.006
 Pupp-0.005
 Cindy-0.005
ishly-0.005
 Holly-0.005
Token was
Token ID373
Feature activation-3.15e-03

Pos logit contributions
ishly+0.015
 Hannah+0.015
 Woo+0.014
 therapist+0.014
 Holly+0.014
Neg logit contributions
 unstable-0.019
 eleven-0.019
 done-0.018
 taller-0.018
 leaning-0.018
Token a
Token ID257
Feature activation+1.97e-03

Pos logit contributions
 Hannah+0.006
ishly+0.006
 Pupp+0.006
 Holly+0.005
 Cindy+0.005
Neg logit contributions
 eleven-0.004
 unstable-0.004
 done-0.003
 taller-0.003
ror-0.003
Token grass
Token ID8701
Feature activation-8.35e-03

Pos logit contributions
 unstable+0.002
 eleven+0.002
 taller+0.002
 leaning+0.002
 high+0.002
Neg logit contributions
ishly-0.004
 Hannah-0.004
rob-0.004
 Holly-0.003
lor-0.003
Tokenho
Token ID8873
Feature activation+3.39e-02

Pos logit contributions
 Hannah+0.014
ishly+0.014
 therapist+0.014
 trends+0.014
Gr+0.014
Neg logit contributions
 eleven-0.011
 leaning-0.011
 unstable-0.011
 taller-0.010
 high-0.010
Tokenpper
Token ID2848
Feature activation+5.85e-04

Pos logit contributions
 eleven+0.061
 unstable+0.061
 done+0.057
 taller+0.057
ror+0.055
Neg logit contributions
ishly-0.042
 Hannah-0.039
 Pupp-0.038
 Woo-0.037
 therapist-0.037
Token!
Token ID0
Feature activation+5.78e-04

Pos logit contributions
 eleven+0.001
 unstable+0.001
 done+0.001
 leaning+0.001
 taller+0.001
Neg logit contributions
ishly-0.001
 Hannah-0.001
 Pupp-0.001
 Holly-0.001
 therapist-0.001
Token Tom
Token ID4186
Feature activation+3.33e-03

Pos logit contributions
 eleven+0.001
 unstable+0.001
 done+0.001
ror+0.001
 taller+0.001
Neg logit contributions
 Hannah-0.001
 Pupp-0.001
ishly-0.001
 Cindy-0.001
 Woo-0.001
Token was
Token ID373
Feature activation-6.33e-03

Pos logit contributions
 eleven+0.006
 unstable+0.005
 done+0.005
 taller+0.005
ror+0.005
Neg logit contributions
ishly-0.005
 Hannah-0.004
 Pupp-0.004
 Holly-0.004
 Cindy-0.004
Token so
Token ID523
Feature activation-3.53e-03

Pos logit contributions
 Pupp+0.010
 Hannah+0.010
ishly+0.010
 Holly+0.009
 Cindy+0.009
Neg logit contributions
 unstable-0.009
 eleven-0.009
Crash-0.009
ror-0.009
 done-0.008
Tokenongo
Token ID25162
Feature activation+1.05e-02

Pos logit contributions
ishly+0.009
 Hannah+0.009
 Pupp+0.008
 Holly+0.008
 therapist+0.008
Neg logit contributions
 eleven-0.026
 unstable-0.025
 done-0.025
 taller-0.024
 leaning-0.024
Token didn
Token ID1422
Feature activation-1.06e-03

Pos logit contributions
 eleven+0.017
 unstable+0.017
 taller+0.016
 done+0.016
ror+0.016
Neg logit contributions
ishly-0.015
 Hannah-0.014
 Pupp-0.014
 Holly-0.013
 therapist-0.013
Token't
Token ID470
Feature activation+7.68e-03

Pos logit contributions
ishly+0.002
 Hannah+0.002
rob+0.002
 therapist+0.002
 trends+0.002
Neg logit contributions
 unstable-0.001
 eleven-0.001
 done-0.001
 taller-0.001
 high-0.001
Token like
Token ID588
Feature activation+1.54e-03

Pos logit contributions
 eleven+0.010
 unstable+0.009
 done+0.009
 taller+0.009
 leaning+0.008
Neg logit contributions
ishly-0.014
 Hannah-0.014
 Pupp-0.013
 therapist-0.013
 Holly-0.013
Token wet
Token ID9583
Feature activation-1.11e-02

Pos logit contributions
 eleven+0.002
ror+0.002
 unstable+0.002
 done+0.002
Crash+0.002
Neg logit contributions
 Hannah-0.002
 Pupp-0.002
 Woo-0.002
 Cindy-0.002
ishly-0.002
Token av
Token ID1196
Feature activation+3.37e-02

Pos logit contributions
ishly+0.018
 Hannah+0.017
 Pupp+0.017
 Holly+0.016
 therapist+0.016
Neg logit contributions
 eleven-0.016
 unstable-0.015
 done-0.015
 taller-0.014
 leaning-0.014
Tokenoc
Token ID420
Feature activation+1.38e-02

Pos logit contributions
 eleven+0.089
ror+0.088
 unstable+0.088
 done+0.086
 leaning+0.086
Neg logit contributions
rob-0.008
ishly-0.006
 Woo-0.005
 Pupp-0.005
 Hannah-0.003
Tokenados
Token ID22484
Feature activation+1.39e-02

Pos logit contributions
 unstable+0.021
 eleven+0.021
 done+0.020
 minutes+0.020
ror+0.020
Neg logit contributions
ishly-0.021
 Pupp-0.019
 Hannah-0.019
 Cindy-0.018
rob-0.018
Token,
Token ID11
Feature activation+6.17e-03

Pos logit contributions
 eleven+0.020
 unstable+0.020
 done+0.020
 taller+0.019
 leaning+0.019
Neg logit contributions
ishly-0.022
 Hannah-0.021
 Pupp-0.021
 therapist-0.020
 Holly-0.020
Token so
Token ID523
Feature activation-6.61e-03

Pos logit contributions
 eleven+0.007
 unstable+0.006
 done+0.006
 taller+0.006
 leaning+0.006
Neg logit contributions
ishly-0.012
lor-0.012
 Hannah-0.012
rob-0.012
 therapist-0.012
Token he
Token ID339
Feature activation+8.48e-03

Pos logit contributions
ishly+0.013
lor+0.012
rob+0.012
 trends+0.012
 therapist+0.012
Neg logit contributions
 high-0.008
 done-0.007
 unstable-0.007
 leaning-0.007
 wind-0.007
Token some
Token ID617
Feature activation-8.81e-04

Pos logit contributions
 Hannah+0.007
ishly+0.007
 Pupp+0.007
 Holly+0.007
 Cindy+0.007
Neg logit contributions
 eleven-0.005
 unstable-0.005
 done-0.005
ror-0.005
 leaning-0.005
Token.
Token ID13
Feature activation+9.80e-03

Pos logit contributions
 Hannah+0.001
ishly+0.001
 Holly+0.001
 Pupp+0.001
 Cindy+0.001
Neg logit contributions
 unstable-0.001
 eleven-0.001
ror-0.001
 done-0.001
 leaning-0.001
Token
Token ID198
Feature activation-1.51e-05

Pos logit contributions
 unstable+0.017
 eleven+0.017
 done+0.017
 taller+0.016
ror+0.016
Neg logit contributions
 Hannah-0.013
 Cindy-0.012
 Holly-0.012
 Pupp-0.012
ishly-0.012
Token
Token ID198
Feature activation-2.57e-03

Pos logit contributions
ishly+0.000
 Hannah+0.000
 Pupp+0.000
 Holly+0.000
 therapist+0.000
Neg logit contributions
 eleven-0.000
 unstable-0.000
 done-0.000
 taller-0.000
 leaning-0.000
Token"
Token ID1
Feature activation-1.28e-03

Pos logit contributions
ishly+0.005
 Hannah+0.004
 Pupp+0.004
rob+0.004
 therapist+0.004
Neg logit contributions
 eleven-0.003
 unstable-0.003
 done-0.003
 leaning-0.003
 taller-0.003
TokenLook
Token ID8567
Feature activation+3.37e-02

Pos logit contributions
 Hannah+0.002
ishly+0.002
 Pupp+0.002
 Holly+0.002
 therapist+0.002
Neg logit contributions
 eleven-0.002
 unstable-0.002
 done-0.002
 taller-0.002
 leaning-0.002
Token,
Token ID11
Feature activation+1.32e-02

Pos logit contributions
 unstable+0.071
 impossible+0.063
 eleven+0.062
 taller+0.062
 leaning+0.062
Neg logit contributions
 Hannah-0.035
ishly-0.033
 Cindy-0.032
 Holly-0.031
 Pupp-0.030
Token Tom
Token ID4186
Feature activation+1.78e-02

Pos logit contributions
 eleven+0.014
 unstable+0.013
 done+0.013
 taller+0.012
 leaning+0.012
Neg logit contributions
 Hannah-0.026
ishly-0.026
 Pupp-0.025
 Holly-0.025
 Cindy-0.025
Token,
Token ID11
Feature activation+1.41e-02

Pos logit contributions
 unstable+0.037
 eleven+0.036
 twelve+0.034
Crash+0.033
 taller+0.033
Neg logit contributions
 Hannah-0.016
ishly-0.016
 Pupp-0.015
 Cindy-0.014
 Holly-0.014
Token fruit
Token ID8234
Feature activation-4.69e-03

Pos logit contributions
 unstable+0.023
 eleven+0.022
 done+0.021
 leaning+0.021
ror+0.021
Neg logit contributions
 Hannah-0.019
 Pupp-0.019
 Holly-0.018
ishly-0.018
 Woo-0.018
Token!"
Token ID2474
Feature activation-5.25e-03

Pos logit contributions
 Hannah+0.006
ishly+0.006
 Holly+0.006
 Pupp+0.006
 Hazel+0.006
Neg logit contributions
 unstable-0.008
 eleven-0.008
ror-0.007
 leaning-0.007
 done-0.007
Token it
Token ID340
Feature activation+1.91e-03

Pos logit contributions
ishly+0.012
 Hannah+0.011
 Pupp+0.011
 Holly+0.011
 therapist+0.011
Neg logit contributions
 eleven-0.008
 unstable-0.007
 done-0.007
 taller-0.007
 leaning-0.007
Token sang
Token ID25889
Feature activation+5.21e-03

Pos logit contributions
 eleven+0.003
 unstable+0.003
 done+0.003
 taller+0.002
 leaning+0.002
Neg logit contributions
ishly-0.003
 Hannah-0.003
 Pupp-0.003
 Woo-0.003
 Cindy-0.003
Token,
Token ID11
Feature activation+1.76e-03

Pos logit contributions
 eleven+0.006
 taller+0.006
 high+0.006
 done+0.006
 higher+0.006
Neg logit contributions
ishly-0.010
rob-0.010
 Hannah-0.009
 therapist-0.009
 Woo-0.009
Token the
Token ID262
Feature activation+4.28e-03

Pos logit contributions
 eleven+0.002
 unstable+0.002
 done+0.002
 taller+0.002
 leaning+0.002
Neg logit contributions
ishly-0.003
 Hannah-0.003
 Pupp-0.003
 Holly-0.003
 therapist-0.003
Token town
Token ID3240
Feature activation+1.24e-02

Pos logit contributions
 eleven+0.007
 unstable+0.007
 done+0.007
ror+0.007
cks+0.007
Neg logit contributions
 Pupp-0.006
ishly-0.005
 puppy-0.005
 therapist-0.005
 Hannah-0.005
Tokenspe
Token ID4125
Feature activation+3.36e-02

Pos logit contributions
 eleven+0.019
 unstable+0.018
 done+0.018
 taller+0.018
 leaning+0.017
Neg logit contributions
 Hannah-0.019
ishly-0.018
 Woo-0.018
 Holly-0.018
 Hazel-0.017
Tokenople
Token ID643
Feature activation+5.46e-03

Pos logit contributions
 unstable+0.052
 eleven+0.051
 taller+0.050
 done+0.049
ror+0.048
Neg logit contributions
ishly-0.049
 Hannah-0.046
 Pupp-0.045
rob-0.044
 therapist-0.044
Token would
Token ID561
Feature activation-5.29e-03

Pos logit contributions
 eleven+0.008
 done+0.008
 unstable+0.007
 leaning+0.007
 taller+0.007
Neg logit contributions
 Hannah-0.009
ishly-0.009
 Pupp-0.008
 Woo-0.008
rob-0.008
Token act
Token ID719
Feature activation+9.16e-06

Pos logit contributions
rob+0.009
 Hannah+0.009
ishly+0.009
 therapist+0.009
 Pupp+0.009
Neg logit contributions
 eleven-0.007
 done-0.006
 high-0.006
 taller-0.006
 higher-0.006
Token nice
Token ID3621
Feature activation-7.07e-03

Feature not active on prev token;
no predictions were affected.
Token.
Token ID13
Feature activation+3.78e-04

Pos logit contributions
ishly+0.012
 Hannah+0.012
 Pupp+0.012
 Woo+0.012
rob+0.012
Neg logit contributions
 eleven-0.009
 done-0.009
 taller-0.008
 unstable-0.008
 higher-0.008
Token Tom
Token ID4186
Feature activation+6.05e-03

Pos logit contributions
 Hannah+0.009
ishly+0.008
 Pupp+0.008
 Holly+0.008
 Cindy+0.008
Neg logit contributions
 unstable-0.005
 eleven-0.005
 done-0.005
 taller-0.005
ror-0.005
Token.
Token ID13
Feature activation+2.50e-03

Pos logit contributions
 unstable+0.012
 eleven+0.012
Crash+0.011
 taller+0.011
 done+0.011
Neg logit contributions
 Hannah-0.007
ishly-0.007
 Holly-0.006
 Pupp-0.006
 Woo-0.006
Token
Token ID198
Feature activation-3.03e-03

Pos logit contributions
 unstable+0.005
 eleven+0.005
 done+0.005
 taller+0.005
ror+0.005
Neg logit contributions
 Hannah-0.003
ishly-0.003
 Pupp-0.003
 Woo-0.002
 Cindy-0.002
Token
Token ID198
Feature activation-3.07e-03

Pos logit contributions
 Hannah+0.004
ishly+0.004
 Pupp+0.004
 Holly+0.004
 Woo+0.004
Neg logit contributions
 eleven-0.005
 unstable-0.005
 done-0.004
 taller-0.004
ror-0.004
Token"
Token ID1
Feature activation-1.74e-03

Pos logit contributions
ishly+0.004
 Hannah+0.004
 Pupp+0.004
 Holly+0.004
 therapist+0.004
Neg logit contributions
 eleven-0.005
 unstable-0.005
 done-0.005
 taller-0.004
ror-0.004
TokenLook
Token ID8567
Feature activation+3.33e-02

Pos logit contributions
 Hannah+0.002
 therapist+0.002
 Pupp+0.002
 Woo+0.002
ishly+0.002
Neg logit contributions
 eleven-0.003
 unstable-0.003
 done-0.003
 taller-0.003
ror-0.003
Token,
Token ID11
Feature activation+1.59e-02

Pos logit contributions
 unstable+0.065
 eleven+0.059
 taller+0.058
 impossible+0.057
 leaning+0.057
Neg logit contributions
 Hannah-0.039
ishly-0.037
 Cindy-0.035
 Holly-0.035
 Pupp-0.034
Token Tom
Token ID4186
Feature activation+1.41e-02

Pos logit contributions
 eleven+0.017
 unstable+0.017
 done+0.016
 taller+0.016
 leaning+0.015
Neg logit contributions
 Hannah-0.030
ishly-0.030
 Pupp-0.029
 Holly-0.029
 Cindy-0.029
Token.
Token ID13
Feature activation+1.56e-02

Pos logit contributions
 unstable+0.030
 eleven+0.030
 twelve+0.028
 Eleven+0.027
Crash+0.027
Neg logit contributions
 Hannah-0.011
ishly-0.011
 Pupp-0.011
 c-0.010
 puppy-0.010
Token I
Token ID314
Feature activation-1.53e-03

Pos logit contributions
 unstable+0.026
 done+0.026
 eleven+0.026
 taller+0.024
 leaning+0.024
Neg logit contributions
 Hannah-0.021
ishly-0.021
 Pupp-0.020
 Woo-0.020
 Holly-0.019
Token have
Token ID423
Feature activation+1.30e-02

Pos logit contributions
 Hannah+0.002
ishly+0.002
 Pupp+0.002
 Holly+0.002
 Woo+0.002
Neg logit contributions
 eleven-0.002
 unstable-0.002
 leaning-0.002
 taller-0.002
 twelve-0.002
Token
Token ID198
Feature activation+4.13e-03

Pos logit contributions
 eleven+0.002
 unstable+0.002
 taller+0.001
 twelve+0.001
ror+0.001
Neg logit contributions
ishly-0.002
 Hannah-0.002
 trends-0.002
rob-0.002
 therapist-0.002
Token
Token ID198
Feature activation+3.54e-03

Pos logit contributions
 eleven+0.006
 unstable+0.006
 done+0.006
 taller+0.006
 leaning+0.005
Neg logit contributions
ishly-0.006
 Hannah-0.006
 Pupp-0.006
 Holly-0.006
 therapist-0.006
Token"
Token ID1
Feature activation+2.30e-03

Pos logit contributions
 eleven+0.005
 unstable+0.004
 taller+0.004
 high+0.004
Crash+0.004
Neg logit contributions
ishly-0.007
 Hannah-0.006
 trends-0.006
 Pupp-0.006
 Cindy-0.006
TokenLook
Token ID8567
Feature activation+2.83e-02

Pos logit contributions
 eleven+0.004
 unstable+0.004
 done+0.004
 taller+0.003
 leaning+0.003
Neg logit contributions
 Hannah-0.003
ishly-0.003
 Pupp-0.003
 Holly-0.003
 Woo-0.003
Token,
Token ID11
Feature activation+1.41e-02

Pos logit contributions
 unstable+0.055
 eleven+0.050
 leaning+0.049
 taller+0.049
 done+0.048
Neg logit contributions
 Hannah-0.034
ishly-0.032
 Holly-0.031
 Pupp-0.031
 Cindy-0.031
Token look
Token ID804
Feature activation+3.32e-02

Pos logit contributions
 eleven+0.017
 unstable+0.015
 taller+0.015
 done+0.015
 twelve+0.014
Neg logit contributions
ishly-0.027
rob-0.026
 Hannah-0.026
 trends-0.025
 therapist-0.025
Token,
Token ID11
Feature activation+1.09e-02

Pos logit contributions
 unstable+0.066
 leaning+0.062
 eleven+0.061
 done+0.059
Crash+0.058
Neg logit contributions
 Hannah-0.032
 Holly-0.029
 Cindy-0.029
ishly-0.028
 c-0.028
Token another
Token ID1194
Feature activation-8.39e-03

Pos logit contributions
 eleven+0.012
 unstable+0.011
 taller+0.011
 done+0.010
 twelve+0.010
Neg logit contributions
ishly-0.022
 Hannah-0.021
 therapist-0.020
rob-0.020
 trends-0.020
Token plane
Token ID6614
Feature activation+8.42e-03

Pos logit contributions
ishly+0.012
 Hannah+0.012
 Pupp+0.012
 Holly+0.011
 therapist+0.011
Neg logit contributions
 eleven-0.013
 unstable-0.013
 done-0.013
 taller-0.012
 leaning-0.012
Token!"
Token ID2474
Feature activation-1.20e-02

Pos logit contributions
 unstable+0.016
 eleven+0.015
ror+0.014
 Eleven+0.014
 taller+0.014
Neg logit contributions
ishly-0.009
 Pupp-0.008
rob-0.008
 Hannah-0.008
 c-0.008
Token Sara
Token ID24799
Feature activation+1.01e-02

Pos logit contributions
ishly+0.023
 trends+0.021
lor+0.021
rob+0.021
 Hannah+0.021
Neg logit contributions
 eleven-0.015
 unstable-0.014
 taller-0.013
 high-0.013
 twelve-0.013
Token.
Token ID13
Feature activation+1.61e-03

Pos logit contributions
 eleven+0.004
 done+0.004
 unstable+0.004
 taller+0.004
 leaning+0.003
Neg logit contributions
ishly-0.004
 Hannah-0.004
 Pupp-0.004
rob-0.004
 Woo-0.004
Token
Token ID220
Feature activation+1.48e-03

Pos logit contributions
 eleven+0.002
 unstable+0.002
 done+0.002
 taller+0.002
 leaning+0.002
Neg logit contributions
ishly-0.002
 Hannah-0.002
 Pupp-0.002
 therapist-0.002
 Holly-0.002
Token
Token ID198
Feature activation+6.68e-03

Pos logit contributions
 eleven+0.002
 unstable+0.002
 done+0.002
 taller+0.002
 leaning+0.002
Neg logit contributions
 Hannah-0.002
ishly-0.002
 Pupp-0.002
 Holly-0.002
 Cindy-0.002
Token
Token ID198
Feature activation+1.64e-03

Pos logit contributions
 eleven+0.010
 unstable+0.010
 done+0.010
 taller+0.009
ror+0.009
Neg logit contributions
 Hannah-0.010
ishly-0.010
 Pupp-0.009
 Holly-0.009
 Cindy-0.009
TokenThe
Token ID464
Feature activation-8.47e-03

Pos logit contributions
 eleven+0.003
 unstable+0.003
 done+0.003
 taller+0.003
ror+0.002
Neg logit contributions
ishly-0.002
 Hannah-0.002
 Pupp-0.002
 therapist-0.002
 Holly-0.002
Token c
Token ID269
Feature activation-3.69e-02

Pos logit contributions
ishly+0.015
 Hannah+0.014
rob+0.014
lor+0.013
 Woo+0.013
Neg logit contributions
 done-0.011
 eleven-0.011
 taller-0.010
 wind-0.010
 minutes-0.010
Tokenact
Token ID529
Feature activation-1.82e-02

Pos logit contributions
ishly+0.047
 Hannah+0.044
 Hazel+0.042
 Pupp+0.042
 Buster+0.040
Neg logit contributions
 done-0.065
 taller-0.063
 unstable-0.061
ror-0.061
 leaning-0.061
Tokeni
Token ID72
Feature activation-1.83e-03

Pos logit contributions
 Hannah+0.022
 Woo+0.022
 trends+0.022
 Pupp+0.022
 puppy+0.022
Neg logit contributions
 done-0.030
 leaning-0.030
cks-0.029
 eleven-0.029
ror-0.028
Token went
Token ID1816
Feature activation-1.57e-02

Pos logit contributions
 Hannah+0.003
ishly+0.003
 Woo+0.003
 Pupp+0.003
 Cindy+0.003
Neg logit contributions
 done-0.003
 eleven-0.003
 taller-0.002
 unstable-0.002
 leaning-0.002
Token exploring
Token ID13504
Feature activation-9.37e-03

Pos logit contributions
ishly+0.035
 Hannah+0.035
rob+0.034
 Pupp+0.034
 therapist+0.034
Neg logit contributions
 eleven-0.012
 done-0.012
 taller-0.011
 higher-0.011
 high-0.011
Token and
Token ID290
Feature activation-5.83e-03

Pos logit contributions
ishly+0.019
 Hannah+0.018
 Pupp+0.018
 therapist+0.018
 Woo+0.017
Neg logit contributions
 high-0.010
 eleven-0.010
 higher-0.009
 taller-0.009
 done-0.009
Token!
Token ID0
Feature activation-3.04e-03

Pos logit contributions
ishly+0.019
 Hannah+0.019
 Pupp+0.018
rob+0.018
 Woo+0.018
Neg logit contributions
 eleven-0.010
 leaning-0.010
 taller-0.010
 high-0.010
 done-0.010
Token
Token ID220
Feature activation-1.04e-03

Pos logit contributions
ishly+0.005
rob+0.005
 Hannah+0.005
 therapist+0.005
 trends+0.005
Neg logit contributions
 eleven-0.004
 unstable-0.004
 twelve-0.004
 done-0.004
 leaning-0.004
Token
Token ID198
Feature activation+2.93e-03

Pos logit contributions
ishly+0.002
 Hannah+0.002
 Pupp+0.002
 Holly+0.001
 therapist+0.001
Neg logit contributions
 eleven-0.002
 unstable-0.002
 done-0.001
 taller-0.001
 leaning-0.001
Token
Token ID198
Feature activation-1.25e-03

Pos logit contributions
 eleven+0.004
 unstable+0.004
 done+0.004
 taller+0.004
 leaning+0.004
Neg logit contributions
ishly-0.005
 Hannah-0.005
 Pupp-0.004
 Holly-0.004
 therapist-0.004
TokenThe
Token ID464
Feature activation-8.04e-03

Pos logit contributions
ishly+0.002
 Hannah+0.002
 Pupp+0.002
 Holly+0.002
 Woo+0.002
Neg logit contributions
 eleven-0.002
 unstable-0.002
 done-0.002
 taller-0.002
 leaning-0.002
Token c
Token ID269
Feature activation-3.65e-02

Pos logit contributions
ishly+0.010
rob+0.009
lor+0.009
 Hannah+0.009
 Buster+0.009
Neg logit contributions
 done-0.014
 eleven-0.014
 wind-0.014
 taller-0.013
 minutes-0.013
Tokenact
Token ID529
Feature activation-1.78e-02

Pos logit contributions
ishly+0.043
 Hannah+0.038
 Pupp+0.037
 Hazel+0.036
 Buster+0.036
Neg logit contributions
 done-0.067
 taller-0.065
ror-0.064
 leaning-0.063
 unstable-0.063
Tokeni
Token ID72
Feature activation-3.17e-03

Pos logit contributions
 trends+0.022
ishly+0.022
 Hannah+0.022
 Woo+0.022
 Pupp+0.022
Neg logit contributions
 done-0.028
 leaning-0.028
cks-0.027
ror-0.027
 eleven-0.027
Token were
Token ID547
Feature activation-7.21e-03

Pos logit contributions
 Hannah+0.005
rob+0.005
ishly+0.005
 Woo+0.005
 Pupp+0.005
Neg logit contributions
 done-0.004
 eleven-0.004
 leaning-0.004
ror-0.004
 taller-0.004
Token so
Token ID523
Feature activation-7.70e-03

Pos logit contributions
ishly+0.013
 Hannah+0.013
 Pupp+0.013
 Holly+0.012
 Cindy+0.012
Neg logit contributions
 eleven-0.009
 unstable-0.009
 done-0.008
 taller-0.008
ror-0.008
Token surprised
Token ID6655
Feature activation-1.73e-03

Pos logit contributions
 Hannah+0.017
ishly+0.017
 Pupp+0.016
 Holly+0.016
 Cindy+0.016
Neg logit contributions
 eleven-0.007
 unstable-0.006
 done-0.006
 taller-0.005
ror-0.005
Token!"
Token ID2474
Feature activation+6.39e-03

Pos logit contributions
ishly+0.005
 Hannah+0.005
 Pupp+0.005
 Holly+0.005
 therapist+0.005
Neg logit contributions
 eleven-0.005
 unstable-0.005
 done-0.005
 taller-0.005
 leaning-0.005
Token
Token ID220
Feature activation+7.45e-04

Pos logit contributions
 eleven+0.007
 done+0.007
 taller+0.007
 leaning+0.006
 high+0.006
Neg logit contributions
ishly-0.011
 Hannah-0.011
 trends-0.011
 Woo-0.011
 therapist-0.011
Token
Token ID198
Feature activation+9.98e-03

Pos logit contributions
 eleven+0.001
 unstable+0.001
 done+0.001
 taller+0.001
 leaning+0.001
Neg logit contributions
ishly-0.001
 Hannah-0.001
 Pupp-0.001
 Holly-0.001
 Cindy-0.001
Token
Token ID198
Feature activation+9.10e-03

Pos logit contributions
 eleven+0.011
 done+0.010
 taller+0.010
 unstable+0.010
 high+0.010
Neg logit contributions
ishly-0.019
 Hannah-0.018
 trends-0.018
 Pupp-0.017
 Woo-0.017
TokenThe
Token ID464
Feature activation-9.83e-03

Pos logit contributions
 eleven+0.014
 unstable+0.013
 taller+0.013
 done+0.013
 twelve+0.013
Neg logit contributions
ishly-0.014
 Hannah-0.014
 Cindy-0.013
 Woo-0.013
 Holly-0.013
Token c
Token ID269
Feature activation-3.64e-02

Pos logit contributions
ishly+0.017
rob+0.015
lor+0.015
 Hannah+0.015
 Woo+0.015
Neg logit contributions
 done-0.013
 taller-0.013
 wind-0.013
 eleven-0.012
 careful-0.012
Tokenact
Token ID529
Feature activation-1.65e-02

Pos logit contributions
ishly+0.044
 Hannah+0.041
 Pupp+0.039
 Hazel+0.039
 Buster+0.038
Neg logit contributions
 done-0.065
 taller-0.064
 unstable-0.063
ror-0.062
 leaning-0.062
Tokeni
Token ID72
Feature activation+2.18e-03

Pos logit contributions
 trends+0.019
 Woo+0.019
 Hannah+0.019
 Pupp+0.019
 puppy+0.019
Neg logit contributions
 done-0.028
 leaning-0.027
 eleven-0.026
ror-0.026
cks-0.026
Token were
Token ID547
Feature activation-5.43e-03

Pos logit contributions
 eleven+0.003
 done+0.003
 taller+0.003
 unstable+0.003
ror+0.003
Neg logit contributions
 Hannah-0.003
 Woo-0.003
ishly-0.003
rob-0.003
 Cindy-0.003
Token very
Token ID845
Feature activation-9.18e-03

Pos logit contributions
ishly+0.010
 Hannah+0.010
 Pupp+0.010
 Holly+0.009
 Cindy+0.009
Neg logit contributions
 eleven-0.006
 unstable-0.006
ror-0.006
 done-0.006
Crash-0.006
Token happy
Token ID3772
Feature activation+1.10e-03

Pos logit contributions
ishly+0.021
 Hannah+0.021
 Pupp+0.020
rob+0.020
 therapist+0.020
Neg logit contributions
 eleven-0.007
 unstable-0.007
 done-0.006
 taller-0.006
 leaning-0.006
Token to
Token ID284
Feature activation+1.50e-02

Pos logit contributions
 eleven+0.010
 unstable+0.009
 done+0.009
 taller+0.009
 leaning+0.008
Neg logit contributions
ishly-0.021
 Hannah-0.021
 Pupp-0.020
 Holly-0.020
 therapist-0.020
Token do
Token ID466
Feature activation+2.27e-03

Pos logit contributions
 eleven+0.020
 done+0.019
 unstable+0.019
 taller+0.018
 leaning+0.018
Neg logit contributions
ishly-0.026
 Hannah-0.026
 therapist-0.025
 Holly-0.024
 Pupp-0.024
Token.
Token ID13
Feature activation-7.54e-03

Pos logit contributions
 eleven+0.003
 done+0.003
 unstable+0.003
 taller+0.003
 leaning+0.003
Neg logit contributions
ishly-0.004
 Hannah-0.004
 Pupp-0.003
 therapist-0.003
 Cindy-0.003
Token
Token ID198
Feature activation+2.58e-03

Pos logit contributions
ishly+0.012
 Hannah+0.012
 therapist+0.011
 Pupp+0.011
 Holly+0.011
Neg logit contributions
 eleven-0.010
 unstable-0.010
 done-0.010
 taller-0.009
ror-0.009
Token
Token ID198
Feature activation-5.16e-03

Pos logit contributions
 eleven+0.004
 unstable+0.004
 done+0.004
 taller+0.004
 leaning+0.004
Neg logit contributions
 Hannah-0.004
ishly-0.004
 Pupp-0.004
 Holly-0.003
 Cindy-0.003
TokenT
Token ID51
Feature activation-3.62e-02

Pos logit contributions
ishly+0.009
 Hannah+0.008
 Holly+0.008
 trends+0.008
 Cindy+0.008
Neg logit contributions
 eleven-0.007
Crash-0.007
 unstable-0.007
 high-0.007
 done-0.007
Tokeniny
Token ID3541
Feature activation-6.20e-03

Pos logit contributions
ishly+0.073
 Hannah+0.069
 Holly+0.066
 Cindy+0.065
 Pupp+0.065
Neg logit contributions
ror-0.034
 unstable-0.034
 eleven-0.033
 done-0.033
 leaning-0.031
Token decided
Token ID3066
Feature activation-1.78e-03

Pos logit contributions
 Hannah+0.010
ishly+0.009
rob+0.009
 Woo+0.009
 Holly+0.009
Neg logit contributions
 eleven-0.009
 unstable-0.009
 done-0.009
 taller-0.008
ror-0.008
Token to
Token ID284
Feature activation-2.35e-03

Pos logit contributions
ishly+0.004
 Hannah+0.004
rob+0.004
 Holly+0.004
 Pupp+0.004
Neg logit contributions
 eleven-0.001
 done-0.001
 unstable-0.001
 taller-0.001
 leaning-0.001
Token be
Token ID307
Feature activation+6.59e-03

Pos logit contributions
rob+0.005
 Hannah+0.005
 therapist+0.005
 Holly+0.004
ishly+0.004
Neg logit contributions
 done-0.003
 eleven-0.002
 high-0.002
 unstable-0.002
 taller-0.002
Token brave
Token ID14802
Feature activation-3.57e-03

Pos logit contributions
 unstable+0.008
 eleven+0.008
 twelve+0.008
ror+0.008
 done+0.007
Neg logit contributions
 Pupp-0.011
 Hannah-0.011
ishly-0.011
 Woo-0.011
 Holly-0.010
Token were
Token ID547
Feature activation-4.60e-03

Pos logit contributions
rob+0.001
 trends+0.001
 Hannah+0.001
ishly+0.001
lor+0.001
Neg logit contributions
 done-0.001
 eleven-0.001
 unstable-0.001
 leaning-0.001
 high-0.001
Token healed
Token ID28557
Feature activation+9.50e-03

Pos logit contributions
ishly+0.008
 Hannah+0.008
 Pupp+0.008
 Cindy+0.008
 Holly+0.008
Neg logit contributions
 eleven-0.005
 unstable-0.005
ror-0.005
Crash-0.005
 Eleven-0.005
Token.
Token ID13
Feature activation-5.52e-03

Pos logit contributions
 eleven+0.013
 unstable+0.013
 done+0.013
 taller+0.013
 leaning+0.012
Neg logit contributions
 Hannah-0.015
ishly-0.015
 Pupp-0.015
rob-0.015
 therapist-0.014
Token
Token ID198
Feature activation+5.83e-03

Pos logit contributions
ishly+0.009
rob+0.008
 trends+0.008
 therapist+0.008
 Hannah+0.008
Neg logit contributions
 eleven-0.007
 unstable-0.007
ror-0.007
 eight-0.006
 done-0.006
Token
Token ID198
Feature activation+2.46e-03

Pos logit contributions
 eleven+0.007
 done+0.007
 unstable+0.006
 taller+0.006
 high+0.006
Neg logit contributions
ishly-0.011
 Hannah-0.010
 trends-0.010
 therapist-0.010
 Pupp-0.010
TokenFel
Token ID42493
Feature activation-3.60e-02

Pos logit contributions
 eleven+0.004
Crash+0.004
 eight+0.004
 unstable+0.004
 done+0.004
Neg logit contributions
ishly-0.003
 Hannah-0.003
 Cindy-0.003
 puppy-0.003
 Pupp-0.003
Tokenicity
Token ID8467
Feature activation-5.44e-03

Pos logit contributions
 Woo+0.052
ishly+0.052
 Hannah+0.049
rob+0.048
 Pupp+0.048
Neg logit contributions
 done-0.050
 minutes-0.050
 eleven-0.050
 unstable-0.050
ror-0.050
Token was
Token ID373
Feature activation-1.61e-03

Pos logit contributions
 Hannah+0.009
rob+0.009
ishly+0.008
 Woo+0.008
 Pupp+0.008
Neg logit contributions
 done-0.007
 eleven-0.007
 unstable-0.007
 leaning-0.007
 taller-0.007
Token so
Token ID523
Feature activation-4.78e-03

Pos logit contributions
ishly+0.003
 Hannah+0.003
 Pupp+0.003
 Holly+0.003
 Cindy+0.003
Neg logit contributions
 eleven-0.002
 unstable-0.002
ror-0.002
Crash-0.002
 Eleven-0.002
Token happy
Token ID3772
Feature activation+9.01e-04

Pos logit contributions
 Hannah+0.009
ishly+0.009
 Pupp+0.009
 Holly+0.008
 Cindy+0.008
Neg logit contributions
 eleven-0.006
 unstable-0.005
ror-0.005
 Eleven-0.005
 twelve-0.005
Token that
Token ID326
Feature activation+1.79e-03

Pos logit contributions
 eleven+0.001
 done+0.001
 leaning+0.001
 taller+0.001
 unstable+0.001
Neg logit contributions
ishly-0.002
 Hannah-0.002
rob-0.002
 Pupp-0.002
 therapist-0.001
Token was
Token ID373
Feature activation+5.18e-03

Pos logit contributions
 done+0.000
 unstable+0.000
 taller+0.000
 eleven+0.000
 leaning+0.000
Neg logit contributions
ishly-0.000
 Hannah-0.000
 Woo-0.000
lor-0.000
 Cindy-0.000
Token.
Token ID13
Feature activation+5.10e-03

Pos logit contributions
 eleven+0.007
 done+0.007
 unstable+0.007
 taller+0.007
 leaning+0.007
Neg logit contributions
ishly-0.009
 Hannah-0.008
 Pupp-0.008
 Holly-0.008
 therapist-0.008
Token
Token ID198
Feature activation+5.67e-03

Pos logit contributions
 eleven+0.009
 unstable+0.009
 done+0.009
 taller+0.008
 leaning+0.008
Neg logit contributions
 Hannah-0.007
ishly-0.006
 Pupp-0.006
 Holly-0.006
 Cindy-0.006
Token
Token ID198
Feature activation+3.08e-03

Pos logit contributions
 eleven+0.008
 unstable+0.008
 done+0.008
 taller+0.008
 leaning+0.008
Neg logit contributions
ishly-0.009
 Hannah-0.009
 Pupp-0.008
 Holly-0.008
 therapist-0.008
TokenThe
Token ID464
Feature activation-1.15e-02

Pos logit contributions
 eleven+0.005
 unstable+0.005
 done+0.004
 taller+0.004
 leaning+0.004
Neg logit contributions
ishly-0.005
 Hannah-0.005
 Pupp-0.004
rob-0.004
 Holly-0.004
Token w
Token ID266
Feature activation-3.59e-02

Pos logit contributions
ishly+0.016
 Hannah+0.016
 Pupp+0.015
 Cindy+0.015
 Woo+0.015
Neg logit contributions
 eleven-0.019
 done-0.019
 unstable-0.018
 taller-0.018
 leaning-0.017
Tokenaffle
Token ID30697
Feature activation-7.43e-03

Pos logit contributions
ishly+0.044
 Hannah+0.041
 Pupp+0.040
 Cindy+0.040
 Woo+0.040
Neg logit contributions
 done-0.066
 unstable-0.064
 taller-0.064
 eleven-0.064
ror-0.061
Token was
Token ID373
Feature activation-1.98e-05

Pos logit contributions
 Hannah+0.011
ishly+0.011
 Pupp+0.011
 Woo+0.011
 Cindy+0.011
Neg logit contributions
 eleven-0.011
 unstable-0.011
 done-0.011
 taller-0.010
 leaning-0.010
Token warm
Token ID5814
Feature activation-4.64e-03

Pos logit contributions
 Hannah+0.000
 Pupp+0.000
 Holly+0.000
ishly+0.000
 Cindy+0.000
Neg logit contributions
 eleven-0.000
 unstable-0.000
Crash-0.000
ror-0.000
cks-0.000
Token and
Token ID290
Feature activation+2.90e-03

Pos logit contributions
 Hannah+0.008
ishly+0.008
 Pupp+0.008
Gr+0.008
rob+0.008
Neg logit contributions
 eleven-0.006
 taller-0.006
 unstable-0.006
 done-0.006
 leaning-0.006
Token smelled
Token ID37609
Feature activation-1.25e-03

Pos logit contributions
 eleven+0.003
 unstable+0.003
 done+0.003
 taller+0.003
 leaning+0.003
Neg logit contributions
ishly-0.005
 Hannah-0.005
rob-0.005
 Pupp-0.005
lor-0.005
Token sleep
Token ID3993
Feature activation-5.22e-03

Pos logit contributions
 Hannah+0.003
ishly+0.003
rob+0.003
 therapist+0.003
 trends+0.002
Neg logit contributions
 done-0.002
 high-0.001
 taller-0.001
 eleven-0.001
 higher-0.001
Token.
Token ID13
Feature activation-4.06e-03

Pos logit contributions
 Pupp+0.010
ishly+0.009
 Hannah+0.009
rob+0.009
 Cindy+0.009
Neg logit contributions
 done-0.006
 eleven-0.006
 high-0.006
 unstable-0.005
 taller-0.005
Token
Token ID220
Feature activation+2.56e-03

Pos logit contributions
ishly+0.008
 trends+0.008
rob+0.008
 therapist+0.007
iv+0.007
Neg logit contributions
 eleven-0.004
 done-0.003
ror-0.003
 eight-0.003
 leaning-0.003
Token
Token ID198
Feature activation+6.40e-03

Pos logit contributions
 eleven+0.004
 unstable+0.004
 done+0.004
 taller+0.004
 leaning+0.004
Neg logit contributions
 Hannah-0.004
ishly-0.004
 Pupp-0.004
 Holly-0.003
 Cindy-0.003
Token
Token ID198
Feature activation+1.16e-03

Pos logit contributions
 eleven+0.008
 done+0.007
 unstable+0.007
 taller+0.007
 leaning+0.007
Neg logit contributions
ishly-0.012
 Hannah-0.011
 Pupp-0.011
 trends-0.011
 therapist-0.010
TokenT
Token ID51
Feature activation-3.55e-02

Pos logit contributions
Crash+0.002
 eleven+0.002
 done+0.002
 unstable+0.002
 eight+0.002
Neg logit contributions
ishly-0.002
 Hannah-0.002
 Pupp-0.002
 Cindy-0.001
 Holly-0.001
Tokenippy
Token ID41214
Feature activation-8.94e-03

Pos logit contributions
ishly+0.049
 Hannah+0.041
rob+0.041
 trends+0.040
 clothing+0.039
Neg logit contributions
 eleven-0.054
ror-0.053
 time-0.052
 eight-0.050
 unstable-0.049
Token and
Token ID290
Feature activation-4.50e-04

Pos logit contributions
 Hannah+0.015
rob+0.015
 Woo+0.015
 Cindy+0.014
ishly+0.014
Neg logit contributions
 done-0.012
 eleven-0.011
 eight-0.011
ror-0.010
 unstable-0.010
Token his
Token ID465
Feature activation-4.39e-03

Pos logit contributions
rob+0.001
lor+0.001
ishly+0.001
 trends+0.001
 therapist+0.001
Neg logit contributions
 eleven-0.001
 eight-0.001
 done-0.001
 high-0.001
 leaning-0.001
Token new
Token ID649
Feature activation-3.64e-03

Pos logit contributions
 Hannah+0.008
ishly+0.008
rob+0.007
 trends+0.007
 therapist+0.007
Neg logit contributions
 eleven-0.006
 done-0.005
 unstable-0.005
 taller-0.005
 high-0.005
Token friend
Token ID1545
Feature activation+1.73e-03

Pos logit contributions
 Hannah+0.006
rob+0.006
ishly+0.006
 Woo+0.006
 Cindy+0.006
Neg logit contributions
 eleven-0.005
 taller-0.004
 done-0.004
 unstable-0.004
 high-0.004
Token,
Token ID11
Feature activation-1.82e-02

Pos logit contributions
 Hannah+0.026
ishly+0.025
 therapist+0.025
 Holly+0.025
rob+0.025
Neg logit contributions
 eleven-0.031
 done-0.029
 unstable-0.029
 leaning-0.029
 taller-0.029
Token a
Token ID257
Feature activation+4.74e-03

Pos logit contributions
lor+0.037
ishly+0.037
 trends+0.036
 therapist+0.036
 Hannah+0.035
Neg logit contributions
 eleven-0.019
 taller-0.016
 unstable-0.016
 done-0.016
 eight-0.016
Token twin
Token ID15203
Feature activation-8.20e-03

Pos logit contributions
 eleven+0.007
 unstable+0.007
 done+0.007
 taller+0.007
 leaning+0.006
Neg logit contributions
ishly-0.007
 Hannah-0.007
 Pupp-0.007
 Holly-0.007
 therapist-0.007
Token brother
Token ID3956
Feature activation-2.75e-02

Pos logit contributions
 trends+0.015
 Hannah+0.015
lor+0.015
rob+0.014
ishly+0.014
Neg logit contributions
 eleven-0.009
 done-0.009
 leaning-0.009
 taller-0.008
 minutes-0.008
Token and
Token ID290
Feature activation-1.19e-03

Pos logit contributions
 trends+0.052
 therapist+0.051
 Hannah+0.051
lor+0.051
 Holly+0.050
Neg logit contributions
 eleven-0.030
 done-0.028
 leaning-0.028
 taller-0.027
 minutes-0.026
Token sister
Token ID6621
Feature activation-3.52e-02

Pos logit contributions
lor+0.002
ishly+0.002
 trends+0.002
 therapist+0.002
rob+0.002
Neg logit contributions
 eleven-0.001
 done-0.001
 unstable-0.001
 taller-0.001
 eight-0.001
Token.
Token ID13
Feature activation+3.03e-03

Pos logit contributions
lor+0.066
 therapist+0.066
rob+0.065
 trends+0.063
 Hannah+0.063
Neg logit contributions
 eleven-0.042
 done-0.040
 leaning-0.036
 minutes-0.036
 taller-0.035
Token They
Token ID1119
Feature activation+4.19e-03

Pos logit contributions
 eleven+0.005
 unstable+0.005
 done+0.005
 taller+0.005
 leaning+0.005
Neg logit contributions
ishly-0.004
 Hannah-0.004
 Pupp-0.004
 therapist-0.004
 Holly-0.004
Token were
Token ID547
Feature activation-5.61e-03

Pos logit contributions
 eleven+0.006
 done+0.006
 unstable+0.006
 taller+0.005
 high+0.005
Neg logit contributions
 Hannah-0.007
lor-0.007
 therapist-0.007
ishly-0.007
rob-0.007
Token intelligent
Token ID12661
Feature activation+5.72e-04

Pos logit contributions
 Pupp+0.009
 Hannah+0.009
ishly+0.009
 Cindy+0.008
 Woo+0.008
Neg logit contributions
 unstable-0.009
Crash-0.008
 eleven-0.008
 minutes-0.008
ror-0.007
Token and
Token ID290
Feature activation-6.38e-03

Pos logit contributions
 eleven+0.001
 unstable+0.001
 done+0.001
 taller+0.001
 leaning+0.001
Neg logit contributions
ishly-0.001
 Hannah-0.001
 Pupp-0.001
 therapist-0.001
 Holly-0.001
Token sorry
Token ID7926
Feature activation-4.18e-03

Pos logit contributions
 eleven+0.009
 unstable+0.009
 done+0.009
 taller+0.009
 leaning+0.008
Neg logit contributions
ishly-0.009
 Hannah-0.009
 Pupp-0.009
 therapist-0.009
 Holly-0.009
Token again
Token ID757
Feature activation-5.09e-03

Pos logit contributions
ishly+0.008
 Hannah+0.008
 Pupp+0.008
 Holly+0.008
 therapist+0.008
Neg logit contributions
 eleven-0.005
 unstable-0.005
 done-0.004
 leaning-0.004
 taller-0.004
Token and
Token ID290
Feature activation+5.24e-03

Pos logit contributions
ishly+0.009
 Hannah+0.009
rob+0.009
 Pupp+0.009
 Cindy+0.009
Neg logit contributions
 done-0.006
 eleven-0.006
 leaning-0.006
 taller-0.006
 unstable-0.006
Token thanked
Token ID26280
Feature activation+1.32e-02

Pos logit contributions
 eleven+0.009
 unstable+0.009
 done+0.008
ror+0.008
 taller+0.008
Neg logit contributions
ishly-0.007
 Hannah-0.007
 Pupp-0.007
 Holly-0.007
 therapist-0.007
Token her
Token ID607
Feature activation+4.70e-04

Pos logit contributions
 eleven+0.019
 unstable+0.017
 taller+0.017
 done+0.017
 leaning+0.017
Neg logit contributions
ishly-0.022
lor-0.021
rob-0.020
 therapist-0.020
 Pupp-0.020
Token sister
Token ID6621
Feature activation-3.52e-02

Pos logit contributions
 eleven+0.001
 unstable+0.001
 done+0.001
 taller+0.001
ror+0.001
Neg logit contributions
ishly-0.001
 Hannah-0.001
 Pupp-0.001
rob-0.001
 Cindy-0.001
Token.
Token ID13
Feature activation-6.66e-03

Pos logit contributions
 Hannah+0.057
ishly+0.057
 Pupp+0.056
rob+0.056
 Cindy+0.055
Neg logit contributions
 eleven-0.048
 done-0.045
 taller-0.044
 unstable-0.044
 twelve-0.042
Token She
Token ID1375
Feature activation-9.39e-05

Pos logit contributions
ishly+0.010
 Hannah+0.009
rob+0.009
 therapist+0.009
 Pupp+0.009
Neg logit contributions
 eleven-0.011
 unstable-0.010
 taller-0.010
 leaning-0.010
 done-0.010
Token went
Token ID1816
Feature activation-1.97e-02

Pos logit contributions
ishly+0.000
 Pupp+0.000
 Hannah+0.000
rob+0.000
 Cindy+0.000
Neg logit contributions
 done-0.000
 climbed-0.000
 eleven-0.000
 taller-0.000
 unstable-0.000
Token to
Token ID284
Feature activation-6.24e-03

Pos logit contributions
ishly+0.041
 Hannah+0.041
 Pupp+0.040
 Holly+0.039
 therapist+0.039
Neg logit contributions
 eleven-0.018
 unstable-0.017
 done-0.017
 taller-0.016
 leaning-0.015
Token her
Token ID607
Feature activation-7.71e-04

Pos logit contributions
ishly+0.010
 Hannah+0.010
 Pupp+0.010
 therapist+0.010
rob+0.010
Neg logit contributions
 eleven-0.008
 unstable-0.008
 done-0.008
 taller-0.008
 leaning-0.008
Token away
Token ID1497
Feature activation-6.64e-03

Pos logit contributions
ishly+0.026
 Hannah+0.026
 Pupp+0.025
 therapist+0.024
 Holly+0.024
Neg logit contributions
 eleven-0.013
 unstable-0.012
 done-0.012
 taller-0.011
 leaning-0.011
Token.
Token ID13
Feature activation-5.24e-03

Pos logit contributions
 Hannah+0.011
ishly+0.011
rob+0.011
 Pupp+0.010
 therapist+0.010
Neg logit contributions
 eleven-0.009
 unstable-0.009
 done-0.009
 taller-0.009
 leaning-0.008
Token
Token ID198
Feature activation+1.15e-03

Pos logit contributions
ishly+0.009
 Hannah+0.008
 therapist+0.008
rob+0.008
 Pupp+0.008
Neg logit contributions
 eleven-0.007
 unstable-0.007
 done-0.007
 taller-0.007
 leaning-0.006
Token
Token ID198
Feature activation-1.04e-03

Pos logit contributions
 eleven+0.002
 unstable+0.002
 done+0.002
 taller+0.002
 leaning+0.002
Neg logit contributions
 Hannah-0.002
ishly-0.002
 Pupp-0.002
 Holly-0.002
 Woo-0.002
TokenThe
Token ID464
Feature activation-6.49e-03

Pos logit contributions
ishly+0.002
 Hannah+0.002
 Pupp+0.002
 Holly+0.002
 Cindy+0.001
Neg logit contributions
 eleven-0.002
 unstable-0.002
 done-0.001
 taller-0.001
Crash-0.001
Token w
Token ID266
Feature activation-3.51e-02

Pos logit contributions
ishly+0.011
 Hannah+0.011
 Pupp+0.010
 Cindy+0.010
 Woo+0.010
Neg logit contributions
 eleven-0.009
 unstable-0.008
 done-0.008
 taller-0.008
 leaning-0.008
Tokenaffle
Token ID30697
Feature activation-1.22e-02

Pos logit contributions
ishly+0.047
 Cindy+0.045
 Hannah+0.044
 Woo+0.043
 Pupp+0.043
Neg logit contributions
 done-0.061
 unstable-0.060
 taller-0.057
 eleven-0.056
ror-0.055
Token breathed
Token ID39545
Feature activation+1.32e-02

Pos logit contributions
 Hannah+0.019
ishly+0.018
 Cindy+0.018
 Woo+0.018
 Pupp+0.018
Neg logit contributions
 done-0.018
 eleven-0.018
 unstable-0.017
 taller-0.016
 leaning-0.016
Token a
Token ID257
Feature activation+8.53e-03

Pos logit contributions
 eleven+0.015
 unstable+0.015
 done+0.014
ror+0.013
 leaning+0.013
Neg logit contributions
 Hannah-0.024
ishly-0.024
 Pupp-0.024
 Woo-0.023
 Holly-0.023
Token sigh
Token ID19680
Feature activation-3.98e-03

Pos logit contributions
 eleven+0.011
 unstable+0.011
 done+0.010
 taller+0.010
 leaning+0.010
Neg logit contributions
ishly-0.015
 Hannah-0.015
 Pupp-0.014
 Holly-0.014
 therapist-0.014
Token of
Token ID286
Feature activation+2.75e-03

Pos logit contributions
 Hannah+0.006
ishly+0.006
 Pupp+0.006
 Holly+0.006
 Cindy+0.006
Neg logit contributions
 eleven-0.006
 done-0.006
 unstable-0.005
 taller-0.005
 leaning-0.005
Token and
Token ID290
Feature activation-2.70e-03

Pos logit contributions
 unstable+0.027
 eleven+0.027
 taller+0.026
 done+0.025
ror+0.025
Neg logit contributions
 Hannah-0.021
ishly-0.020
 Pupp-0.020
 Woo-0.020
 Holly-0.019
Token Ben
Token ID3932
Feature activation+4.39e-03

Pos logit contributions
ishly+0.005
lor+0.005
rob+0.005
 trends+0.005
 therapist+0.005
Neg logit contributions
 eleven-0.003
 done-0.003
 high-0.003
 leaning-0.003
 unstable-0.002
Token are
Token ID389
Feature activation+1.10e-03

Pos logit contributions
 eleven+0.007
 unstable+0.007
 done+0.007
 taller+0.006
 leaning+0.006
Neg logit contributions
ishly-0.006
 Hannah-0.006
 Pupp-0.006
 therapist-0.006
 Holly-0.006
Token brother
Token ID3956
Feature activation-2.56e-02

Pos logit contributions
 unstable+0.002
Crash+0.002
 eleven+0.002
 twelve+0.002
 higher+0.002
Neg logit contributions
 Hannah-0.002
 Woo-0.002
 Holly-0.002
ishly-0.001
 Cindy-0.001
Token and
Token ID290
Feature activation-9.17e-04

Pos logit contributions
lor+0.053
 trends+0.050
ishly+0.050
 therapist+0.050
Gr+0.049
Neg logit contributions
 eleven-0.025
 taller-0.025
 done-0.023
 high-0.023
 leaning-0.023
Token sister
Token ID6621
Feature activation-3.51e-02

Pos logit contributions
lor+0.002
rob+0.002
ishly+0.002
 therapist+0.002
 trends+0.002
Neg logit contributions
 eleven-0.001
 done-0.001
 eight-0.001
 high-0.001
 leaning-0.001
Token.
Token ID13
Feature activation+1.85e-03

Pos logit contributions
lor+0.081
rob+0.077
ishly+0.075
 therapist+0.075
 Dawn+0.074
Neg logit contributions
 done-0.028
 eleven-0.026
 high-0.023
 taller-0.023
ror-0.022
Token They
Token ID1119
Feature activation+1.60e-03

Pos logit contributions
 eleven+0.002
 twelve+0.002
 Eleven+0.002
 high+0.002
 unstable+0.002
Neg logit contributions
ishly-0.003
rob-0.003
lor-0.003
 trends-0.003
 therapist-0.003
Token are
Token ID389
Feature activation+3.24e-03

Pos logit contributions
 done+0.002
 eleven+0.002
 high+0.002
 unstable+0.002
 taller+0.002
Neg logit contributions
lor-0.003
ishly-0.003
 therapist-0.003
 Hannah-0.003
rob-0.003
Token very
Token ID845
Feature activation-6.89e-03

Pos logit contributions
 unstable+0.005
 eleven+0.005
Crash+0.005
 done+0.004
 minutes+0.004
Neg logit contributions
 Hannah-0.005
 Pupp-0.005
ishly-0.005
 Holly-0.005
 Cindy-0.005
Token happy
Token ID3772
Feature activation-8.20e-04

Pos logit contributions
ishly+0.014
 Hannah+0.014
 Pupp+0.014
 therapist+0.013
 Holly+0.013
Neg logit contributions
 eleven-0.007
 unstable-0.006
 done-0.006
 taller-0.006
 leaning-0.006
Token start
Token ID923
Feature activation+1.23e-02

Pos logit contributions
 unstable+0.012
 eleven+0.011
 minutes+0.011
cks+0.011
 twelve+0.011
Neg logit contributions
 trends-0.007
 Hannah-0.007
 c-0.007
 adopt-0.007
ishly-0.007
Token with
Token ID351
Feature activation+4.60e-03

Pos logit contributions
 unstable+0.022
 eleven+0.021
Crash+0.020
cks+0.020
 minutes+0.020
Neg logit contributions
 Hannah-0.013
 Gr-0.012
 trends-0.012
 Woo-0.012
 Pupp-0.012
Token?"
Token ID1701
Feature activation+7.40e-03

Pos logit contributions
 unstable+0.007
 eleven+0.006
 taller+0.006
 done+0.006
 leaning+0.006
Neg logit contributions
 Hannah-0.007
ishly-0.007
 Pupp-0.007
 Holly-0.007
 Cindy-0.007
Token
Token ID198
Feature activation+2.56e-03

Pos logit contributions
 eleven+0.010
 taller+0.009
 leaning+0.009
 done+0.009
 twelve+0.009
Neg logit contributions
ishly-0.012
rob-0.011
lor-0.011
 trends-0.011
 therapist-0.011
Token
Token ID198
Feature activation+6.72e-03

Pos logit contributions
 eleven+0.002
 high+0.002
ror+0.002
 done+0.002
 taller+0.002
Neg logit contributions
ishly-0.005
 trends-0.005
rob-0.005
 therapist-0.005
 Hannah-0.005
TokenT
Token ID51
Feature activation-3.49e-02

Pos logit contributions
 eleven+0.011
 twelve+0.011
 taller+0.010
 unstable+0.010
 eight+0.010
Neg logit contributions
ishly-0.009
rob-0.009
 Hannah-0.008
 Woo-0.008
 trends-0.008
Tokenoya
Token ID23790
Feature activation-3.37e-03

Pos logit contributions
ishly+0.050
 Buster+0.047
 Woo+0.047
rob+0.046
 Hannah+0.045
Neg logit contributions
 eleven-0.052
ror-0.051
 done-0.050
 leaning-0.048
 taller-0.047
Token thought
Token ID1807
Feature activation+1.92e-04

Pos logit contributions
ishly+0.005
 Hannah+0.005
 Pupp+0.004
 Holly+0.004
 therapist+0.004
Neg logit contributions
 eleven-0.005
 unstable-0.005
 done-0.005
 taller-0.005
 leaning-0.005
Token for
Token ID329
Feature activation+3.02e-03

Pos logit contributions
 eleven+0.000
 unstable+0.000
 done+0.000
 taller+0.000
 leaning+0.000
Neg logit contributions
ishly-0.000
 Hannah-0.000
 Pupp-0.000
 therapist-0.000
 Woo-0.000
Token a
Token ID257
Feature activation+1.15e-02

Pos logit contributions
 done+0.002
 eleven+0.002
 taller+0.002
 unstable+0.002
 high+0.002
Neg logit contributions
ishly-0.008
rob-0.007
 therapist-0.007
lor-0.007
 Hannah-0.007
Token second
Token ID1218
Feature activation-1.69e-03

Pos logit contributions
 eleven+0.013
 unstable+0.013
 done+0.012
 taller+0.012
ror+0.012
Neg logit contributions
 Hannah-0.021
ishly-0.021
 Pupp-0.021
 Holly-0.020
 Cindy-0.020
Token their
Token ID511
Feature activation-6.79e-03

Pos logit contributions
rob+0.015
actus+0.015
ishly+0.014
 therapist+0.014
lor+0.014
Neg logit contributions
 eleven-0.009
 eight-0.008
 done-0.008
 figuring-0.008
 adding-0.008
Token friends
Token ID2460
Feature activation-1.32e-02

Pos logit contributions
ishly+0.010
 Hannah+0.010
 Pupp+0.009
 Holly+0.009
 therapist+0.009
Neg logit contributions
 eleven-0.011
 unstable-0.010
 done-0.010
 taller-0.010
 leaning-0.010
Token.
Token ID13
Feature activation-1.26e-02

Pos logit contributions
rob+0.025
ishly+0.024
lor+0.023
 Pupp+0.023
Gr+0.023
Neg logit contributions
 done-0.015
 eleven-0.015
 leaning-0.014
 taller-0.013
 higher-0.013
Token
Token ID198
Feature activation-2.91e-03

Pos logit contributions
ishly+0.021
rob+0.020
 therapist+0.020
 trends+0.019
 Hannah+0.019
Neg logit contributions
 eleven-0.015
 eight-0.014
ror-0.014
 done-0.014
 minutes-0.014
Token
Token ID198
Feature activation-7.79e-03

Pos logit contributions
ishly+0.006
 Hannah+0.005
 trends+0.005
 therapist+0.005
 Pupp+0.005
Neg logit contributions
 done-0.003
 eleven-0.003
 unstable-0.003
 high-0.003
 taller-0.003
TokenFel
Token ID42493
Feature activation-3.49e-02

Pos logit contributions
ishly+0.014
 Hannah+0.012
 Cindy+0.012
 Pupp+0.012
 Natalie+0.012
Neg logit contributions
 eleven-0.010
Crash-0.010
 done-0.010
 high-0.009
 minutes-0.009
Tokenicity
Token ID8467
Feature activation-9.49e-03

Pos logit contributions
ishly+0.048
 Woo+0.047
 Hannah+0.045
 Pupp+0.044
 Cindy+0.043
Neg logit contributions
 eleven-0.054
 done-0.054
 unstable-0.053
ror-0.052
 minutes-0.052
Token decided
Token ID3066
Feature activation-5.09e-03

Pos logit contributions
ishly+0.016
 Hannah+0.016
rob+0.016
 Woo+0.015
 Pupp+0.015
Neg logit contributions
 done-0.012
 eleven-0.012
 unstable-0.011
 leaning-0.011
 taller-0.011
Token to
Token ID284
Feature activation-7.89e-03

Pos logit contributions
rob+0.013
ishly+0.013
 Hannah+0.012
 Pupp+0.012
 therapist+0.012
Neg logit contributions
 done-0.003
 eleven-0.003
 high-0.002
 leaning-0.002
 unstable-0.002
Token keep
Token ID1394
Feature activation+4.12e-03

Pos logit contributions
rob+0.015
 Hannah+0.015
ishly+0.015
 therapist+0.015
 Holly+0.014
Neg logit contributions
 done-0.009
 eleven-0.009
 leaning-0.008
 unstable-0.008
 high-0.008
Token the
Token ID262
Feature activation+4.26e-03

Pos logit contributions
 unstable+0.005
 eleven+0.005
Crash+0.005
ror+0.005
 twelve+0.005
Neg logit contributions
 Hannah-0.007
ishly-0.007
 Pupp-0.007
 Cindy-0.007
 Buster-0.007
Token and
Token ID290
Feature activation-1.53e-03

Pos logit contributions
 eleven+0.018
 unstable+0.017
 done+0.017
 taller+0.016
 leaning+0.016
Neg logit contributions
ishly-0.019
 Hannah-0.019
 Pupp-0.018
 Holly-0.018
 therapist-0.018
Token Lily
Token ID20037
Feature activation+7.62e-03

Pos logit contributions
ishly+0.003
 trends+0.003
lor+0.003
rob+0.003
 therapist+0.003
Neg logit contributions
 eleven-0.002
 done-0.002
 high-0.001
 leaning-0.001
 eight-0.001
Token were
Token ID547
Feature activation-6.55e-03

Pos logit contributions
 eleven+0.008
 done+0.008
 unstable+0.008
 leaning+0.007
 high+0.007
Neg logit contributions
ishly-0.015
 Hannah-0.014
lor-0.014
 Holly-0.014
 therapist-0.014
Token brother
Token ID3956
Feature activation-2.71e-02

Pos logit contributions
 Hannah+0.010
ishly+0.009
 Holly+0.009
 Woo+0.009
 Cindy+0.009
Neg logit contributions
 unstable-0.011
 eleven-0.010
Crash-0.010
 twelve-0.010
 taller-0.009
Token and
Token ID290
Feature activation-1.38e-03

Pos logit contributions
lor+0.055
 trends+0.052
 therapist+0.052
rob+0.051
Gr+0.051
Neg logit contributions
 taller-0.027
 eleven-0.027
 done-0.026
 leaning-0.024
 minutes-0.024
Token sister
Token ID6621
Feature activation-3.44e-02

Pos logit contributions
lor+0.003
rob+0.002
 therapist+0.002
 trends+0.002
actus+0.002
Neg logit contributions
 eleven-0.002
 done-0.002
 leaning-0.002
 eight-0.002
 high-0.002
Token.
Token ID13
Feature activation+5.82e-03

Pos logit contributions
lor+0.075
rob+0.074
 therapist+0.070
 trends+0.068
 Dawn+0.068
Neg logit contributions
 done-0.032
 eleven-0.030
 minutes-0.027
 taller-0.026
ror-0.025
Token They
Token ID1119
Feature activation+3.47e-03

Pos logit contributions
 eleven+0.009
 unstable+0.009
 done+0.009
 taller+0.008
 twelve+0.008
Neg logit contributions
ishly-0.009
rob-0.008
 therapist-0.008
 Hannah-0.008
lor-0.008
Token liked
Token ID8288
Feature activation+1.04e-02

Pos logit contributions
 done+0.005
 eleven+0.004
 high+0.004
 unstable+0.004
 measured+0.004
Neg logit contributions
lor-0.007
rob-0.006
 therapist-0.006
 Hannah-0.006
ishly-0.006
Token to
Token ID284
Feature activation-9.28e-03

Pos logit contributions
 unstable+0.015
ror+0.015
 eleven+0.015
cks+0.014
Crash+0.014
Neg logit contributions
 Pupp-0.016
 Hannah-0.016
 Woo-0.016
 puppy-0.015
 c-0.015
Token play
Token ID711
Feature activation+4.40e-03

Pos logit contributions
lor+0.019
 therapist+0.019
ishly+0.019
 Hannah+0.018
rob+0.018
Neg logit contributions
 eleven-0.010
 done-0.009
 high-0.009
 taller-0.009
 unstable-0.008
Token.
Token ID13
Feature activation+6.93e-04

Feature not active on prev token;
no predictions were affected.
Token
Token ID220
Feature activation+3.62e-03

Pos logit contributions
 eleven+0.001
 unstable+0.001
 done+0.001
 leaning+0.001
ror+0.001
Neg logit contributions
 Hannah-0.001
 Pupp-0.001
ishly-0.001
 Cindy-0.001
 Holly-0.001
Token
Token ID198
Feature activation+1.99e-03

Pos logit contributions
 eleven+0.006
 unstable+0.006
 done+0.006
 taller+0.006
 leaning+0.006
Neg logit contributions
 Hannah-0.005
ishly-0.005
 Pupp-0.005
 Holly-0.004
 Woo-0.004
Token
Token ID198
Feature activation+2.25e-03

Pos logit contributions
 eleven+0.003
 unstable+0.003
 done+0.003
 taller+0.003
 leaning+0.003
Neg logit contributions
 Hannah-0.003
ishly-0.003
 Pupp-0.003
 Holly-0.003
 Cindy-0.003
TokenThe
Token ID464
Feature activation-8.85e-03

Pos logit contributions
 eleven+0.004
 unstable+0.004
 done+0.003
 taller+0.003
 leaning+0.003
Neg logit contributions
ishly-0.003
 Hannah-0.003
 Pupp-0.003
 Holly-0.003
 therapist-0.003
Token w
Token ID266
Feature activation-3.44e-02

Pos logit contributions
ishly+0.013
 Hannah+0.013
 Pupp+0.012
 Cindy+0.012
 Woo+0.012
Neg logit contributions
 eleven-0.014
 unstable-0.014
 done-0.013
 taller-0.013
 leaning-0.012
Tokenaffle
Token ID30697
Feature activation-5.62e-03

Pos logit contributions
ishly+0.045
 Cindy+0.042
 Hannah+0.042
 Woo+0.041
 Pupp+0.041
Neg logit contributions
 done-0.061
 unstable-0.060
 eleven-0.057
 taller-0.057
ror-0.055
Token was
Token ID373
Feature activation-7.73e-03

Pos logit contributions
ishly+0.008
 Hannah+0.008
 Pupp+0.008
 Holly+0.008
 Cindy+0.008
Neg logit contributions
 eleven-0.009
 unstable-0.009
 done-0.008
 taller-0.008
 leaning-0.008
Token so
Token ID523
Feature activation-1.24e-02

Pos logit contributions
 Pupp+0.012
ishly+0.012
 Hannah+0.011
 Woo+0.011
 Holly+0.011
Neg logit contributions
 eleven-0.011
ror-0.011
 unstable-0.010
Crash-0.010
 Eleven-0.010
Token scared
Token ID12008
Feature activation-1.19e-02

Pos logit contributions
 Pupp+0.023
 Hannah+0.022
ishly+0.022
 Holly+0.021
 Woo+0.021
Neg logit contributions
 eleven-0.015
ror-0.013
 unstable-0.013
 leaning-0.012
 twelve-0.012
Token he
Token ID339
Feature activation+3.78e-03

Pos logit contributions
ishly+0.021
 therapist+0.020
 Hannah+0.020
rob+0.020
lor+0.020
Neg logit contributions
 eleven-0.015
 unstable-0.014
 taller-0.014
 done-0.014
 leaning-0.014
Token help
Token ID1037
Feature activation+2.15e-05

Pos logit contributions
 done+0.000
 eleven+0.000
 taller+0.000
 high+0.000
 twelve+0.000
Neg logit contributions
rob-0.001
ishly-0.001
 Hannah-0.001
 therapist-0.001
lor-0.001
Token me
Token ID502
Feature activation+2.49e-03

Pos logit contributions
 eleven+0.000
 done+0.000
 twelve+0.000
 taller+0.000
ror+0.000
Neg logit contributions
ishly-0.000
rob-0.000
 therapist-0.000
 Pupp-0.000
 ranc-0.000
Token?"
Token ID1701
Feature activation-3.50e-03

Pos logit contributions
 done+0.003
 eleven+0.003
 taller+0.003
 higher+0.003
 leaning+0.003
Neg logit contributions
ishly-0.004
 Hannah-0.004
rob-0.004
 Pupp-0.004
 Cindy-0.004
Token
Token ID198
Feature activation+1.52e-03

Pos logit contributions
ishly+0.006
rob+0.006
 Pupp+0.006
 trends+0.006
 adoption+0.006
Neg logit contributions
 eleven-0.004
 taller-0.004
 done-0.004
 high-0.004
 higher-0.003
Token
Token ID198
Feature activation+2.19e-03

Pos logit contributions
 done+0.001
 eleven+0.001
 high+0.001
 eight+0.001
Crash+0.001
Neg logit contributions
ishly-0.003
 trends-0.003
 Pupp-0.003
 Cindy-0.003
 ranc-0.003
TokenT
Token ID51
Feature activation-3.43e-02

Pos logit contributions
Crash+0.004
 eight+0.004
 eleven+0.004
 high+0.004
 done+0.004
Neg logit contributions
ishly-0.003
 Pupp-0.002
 Cindy-0.002
 ranc-0.002
 adoption-0.002
Tokenoby
Token ID26730
Feature activation-9.80e-03

Pos logit contributions
ishly+0.055
 Pupp+0.052
 Cindy+0.050
 adopt+0.048
 Woo+0.047
Neg logit contributions
 done-0.047
ror-0.046
 eleven-0.046
 unstable-0.045
Crash-0.044
Token was
Token ID373
Feature activation-5.79e-04

Pos logit contributions
rob+0.016
 Woo+0.015
 Hannah+0.015
ishly+0.015
 Pupp+0.015
Neg logit contributions
 done-0.014
 eleven-0.013
 twelve-0.012
 unstable-0.012
 taller-0.012
Token excited
Token ID6568
Feature activation+3.73e-03

Pos logit contributions
 Hannah+0.001
ishly+0.001
 Pupp+0.001
 Holly+0.001
 Cindy+0.001
Neg logit contributions
 eleven-0.001
 unstable-0.001
Crash-0.001
ror-0.001
 done-0.001
Token to
Token ID284
Feature activation+9.15e-03

Pos logit contributions
 eleven+0.006
 unstable+0.005
 done+0.005
 taller+0.005
 leaning+0.005
Neg logit contributions
ishly-0.006
 Hannah-0.006
 Pupp-0.006
 Holly-0.005
 therapist-0.005
Token serve
Token ID4691
Feature activation-3.18e-03

Pos logit contributions
 done+0.011
 eleven+0.010
 taller+0.010
 leaning+0.010
 twelve+0.010
Neg logit contributions
rob-0.018
 Woo-0.017
 Hannah-0.017
ishly-0.017
lor-0.016
Token and
Token ID290
Feature activation-2.91e-03

Pos logit contributions
 eleven+0.019
 unstable+0.019
 taller+0.018
 done+0.018
 leaning+0.017
Neg logit contributions
 Hannah-0.019
ishly-0.019
 Pupp-0.018
 Holly-0.017
 Woo-0.017
Token Lily
Token ID20037
Feature activation+8.03e-03

Pos logit contributions
ishly+0.006
rob+0.006
lor+0.005
 trends+0.005
 therapist+0.005
Neg logit contributions
 done-0.003
 eleven-0.003
 high-0.003
 eight-0.003
 unstable-0.003
Token were
Token ID547
Feature activation-3.82e-03

Pos logit contributions
 eleven+0.011
 done+0.011
 unstable+0.011
 leaning+0.010
 taller+0.010
Neg logit contributions
ishly-0.013
 Hannah-0.013
 Pupp-0.012
 therapist-0.012
 Holly-0.012
Token brother
Token ID3956
Feature activation-2.59e-02

Pos logit contributions
 Hannah+0.005
ishly+0.005
 Woo+0.005
 Holly+0.005
 Pupp+0.005
Neg logit contributions
 unstable-0.007
 eleven-0.006
 twelve-0.006
Crash-0.006
 taller-0.006
Token and
Token ID290
Feature activation+1.16e-03

Pos logit contributions
lor+0.052
 trends+0.050
 therapist+0.050
ishly+0.050
Gr+0.050
Neg logit contributions
 eleven-0.026
 taller-0.026
 done-0.025
 high-0.023
 leaning-0.023
Token sister
Token ID6621
Feature activation-3.39e-02

Pos logit contributions
 eleven+0.001
 done+0.001
 taller+0.001
 eight+0.001
 leaning+0.001
Neg logit contributions
lor-0.002
rob-0.002
ishly-0.002
 therapist-0.002
 trends-0.002
Token.
Token ID13
Feature activation+7.55e-03

Pos logit contributions
lor+0.072
rob+0.070
 therapist+0.068
ishly+0.067
 Dawn+0.067
Neg logit contributions
 done-0.032
 eleven-0.031
 minutes-0.027
 taller-0.026
 high-0.026
Token They
Token ID1119
Feature activation+1.98e-03

Pos logit contributions
 eleven+0.012
 unstable+0.012
 done+0.011
 taller+0.011
 leaning+0.011
Neg logit contributions
ishly-0.011
 Hannah-0.010
 Pupp-0.010
 therapist-0.010
rob-0.010
Token lived
Token ID5615
Feature activation-3.49e-03

Pos logit contributions
 done+0.003
 eleven+0.002
 high+0.002
 unstable+0.002
 measured+0.002
Neg logit contributions
lor-0.004
rob-0.004
 therapist-0.004
 Hannah-0.003
ishly-0.003
Token in
Token ID287
Feature activation+1.79e-03

Pos logit contributions
 Holly+0.005
 Hannah+0.005
 Woo+0.005
 Gr+0.005
 Pupp+0.005
Neg logit contributions
 unstable-0.005
 eleven-0.005
 impossible-0.005
 twelve-0.005
 thirteen-0.005
Token a
Token ID257
Feature activation-1.59e-02

Pos logit contributions
 eleven+0.002
 unstable+0.002
 done+0.002
 taller+0.002
ror+0.002
Neg logit contributions
 Hannah-0.003
ishly-0.003
 Pupp-0.003
 Holly-0.003
 Cindy-0.003
Token and
Token ID290
Feature activation-1.17e-03

Pos logit contributions
 eleven+0.014
 unstable+0.014
 done+0.013
 taller+0.013
 leaning+0.013
Neg logit contributions
ishly-0.016
 Hannah-0.016
 Pupp-0.015
 Holly-0.015
 therapist-0.015
Token Ben
Token ID3932
Feature activation+7.20e-03

Pos logit contributions
ishly+0.002
lor+0.002
rob+0.002
 Hannah+0.002
 therapist+0.002
Neg logit contributions
 eleven-0.001
 done-0.001
 unstable-0.001
 leaning-0.001
 high-0.001
Token are
Token ID389
Feature activation+1.76e-03

Pos logit contributions
 eleven+0.011
 done+0.011
 unstable+0.010
 taller+0.010
 leaning+0.010
Neg logit contributions
ishly-0.011
 Hannah-0.011
 therapist-0.010
 Pupp-0.010
lor-0.010
Token brother
Token ID3956
Feature activation-2.72e-02

Pos logit contributions
 unstable+0.003
 eleven+0.003
Crash+0.003
 twelve+0.003
 taller+0.003
Neg logit contributions
 Hannah-0.003
 Holly-0.002
 Woo-0.002
ishly-0.002
 Cindy-0.002
Token and
Token ID290
Feature activation+7.92e-04

Pos logit contributions
lor+0.055
 therapist+0.052
 trends+0.052
ishly+0.052
Gr+0.052
Neg logit contributions
 eleven-0.028
 taller-0.028
 done-0.026
 high-0.025
 minutes-0.025
Token sister
Token ID6621
Feature activation-3.38e-02

Pos logit contributions
 eleven+0.001
 done+0.001
 eight+0.001
 high+0.001
 taller+0.001
Neg logit contributions
lor-0.002
rob-0.001
ishly-0.001
 therapist-0.001
 trends-0.001
Token.
Token ID13
Feature activation+4.15e-03

Pos logit contributions
lor+0.074
rob+0.071
 therapist+0.069
ishly+0.069
 Pupp+0.067
Neg logit contributions
 done-0.031
 eleven-0.031
 minutes-0.026
 taller-0.026
 high-0.026
Token They
Token ID1119
Feature activation+5.30e-03

Pos logit contributions
 eleven+0.006
 unstable+0.005
 Eleven+0.005
 done+0.005
 taller+0.005
Neg logit contributions
lor-0.007
ishly-0.007
rob-0.007
 therapist-0.007
 Pupp-0.006
Token like
Token ID588
Feature activation+1.14e-02

Pos logit contributions
 done+0.006
 eleven+0.006
 high+0.005
 unstable+0.005
 taller+0.005
Neg logit contributions
lor-0.010
ishly-0.010
 Hannah-0.010
 therapist-0.010
rob-0.010
Token to
Token ID284
Feature activation-1.13e-02

Pos logit contributions
 unstable+0.013
 eleven+0.013
 done+0.012
ror+0.012
 taller+0.012
Neg logit contributions
 Hannah-0.021
 Pupp-0.021
 Woo-0.020
 Holly-0.020
 Cindy-0.020
Token play
Token ID711
Feature activation+1.36e-03

Pos logit contributions
lor+0.024
ishly+0.023
 therapist+0.023
 Hannah+0.023
rob+0.022
Neg logit contributions
 done-0.012
 eleven-0.011
 high-0.011
 taller-0.010
 unstable-0.010
Token in
Token ID287
Feature activation+1.22e-03

Pos logit contributions
 eleven+0.003
 unstable+0.003
 done+0.003
 taller+0.003
 leaning+0.002
Neg logit contributions
 Hannah-0.004
ishly-0.004
 Pupp-0.003
rob-0.003
 therapist-0.003
Token the
Token ID262
Feature activation-3.30e-04

Pos logit contributions
 eleven+0.001
 unstable+0.001
 done+0.001
 taller+0.001
 leaning+0.001
Neg logit contributions
ishly-0.002
 Hannah-0.002
 Pupp-0.002
 Holly-0.002
 Cindy-0.002
Token alley
Token ID23978
Feature activation+2.47e-03

Pos logit contributions
ishly+0.001
 Hannah+0.001
 Pupp+0.001
 Holly+0.001
 therapist+0.001
Neg logit contributions
 eleven-0.000
 unstable-0.000
 done-0.000
 taller-0.000
 leaning-0.000
Token.
Token ID13
Feature activation-8.52e-03

Pos logit contributions
 eleven+0.004
 unstable+0.004
 done+0.004
 taller+0.003
 leaning+0.003
Neg logit contributions
ishly-0.004
 Hannah-0.004
 Pupp-0.004
 Holly-0.003
 therapist-0.003
Token The
Token ID383
Feature activation-4.07e-03

Pos logit contributions
ishly+0.013
 Hannah+0.013
 Pupp+0.013
 therapist+0.012
 Holly+0.012
Neg logit contributions
 eleven-0.012
 unstable-0.012
 done-0.012
 taller-0.011
 leaning-0.011
Token t
Token ID256
Feature activation-3.38e-02

Pos logit contributions
ishly+0.005
 Hannah+0.005
 Pupp+0.005
 Holly+0.005
 Cindy+0.005
Neg logit contributions
 eleven-0.007
 unstable-0.007
 done-0.007
 taller-0.006
 leaning-0.006
Tokeneddy
Token ID21874
Feature activation-3.49e-03

Pos logit contributions
ishly+0.046
 Hannah+0.046
 Cindy+0.044
 Holly+0.044
 Woo+0.043
Neg logit contributions
 unstable-0.052
 eleven-0.051
ror-0.051
 leaning-0.049
Crash-0.049
Token bear
Token ID6842
Feature activation-4.61e-03

Pos logit contributions
 Hannah+0.005
 trends+0.005
 Cindy+0.005
ishly+0.005
 Woo+0.005
Neg logit contributions
 done-0.005
 eleven-0.005
 leaning-0.005
 unstable-0.005
 taller-0.005
Token turned
Token ID2900
Feature activation+1.80e-03

Pos logit contributions
 Hannah+0.007
ishly+0.007
 Woo+0.007
 Pupp+0.007
 Cindy+0.007
Neg logit contributions
 done-0.007
 eleven-0.007
 unstable-0.007
 leaning-0.006
 taller-0.006
Token to
Token ID284
Feature activation+2.31e-03

Pos logit contributions
 eleven+0.002
 unstable+0.002
 done+0.002
 taller+0.001
ror+0.001
Neg logit contributions
ishly-0.004
 Hannah-0.004
 Pupp-0.004
 Holly-0.004
 Cindy-0.004
Token Lily
Token ID20037
Feature activation-1.98e-03

Pos logit contributions
 eleven+0.002
 done+0.002
 taller+0.002
 unstable+0.002
 high+0.002
Neg logit contributions
ishly-0.005
 therapist-0.005
 trends-0.004
lor-0.004
rob-0.004
Token,
Token ID11
Feature activation-1.07e-02

Pos logit contributions
 Hannah+0.016
ishly+0.016
 Pupp+0.015
 Holly+0.015
 Woo+0.015
Neg logit contributions
 eleven-0.019
 unstable-0.019
 taller-0.018
 done-0.018
ror-0.017
Token dad
Token ID9955
Feature activation-1.11e-02

Pos logit contributions
ishly+0.021
lor+0.021
 trends+0.021
rob+0.020
 therapist+0.020
Neg logit contributions
 leaning-0.012
 eleven-0.011
 done-0.011
 taller-0.011
 high-0.010
Token,
Token ID11
Feature activation-6.32e-03

Pos logit contributions
ishly+0.018
 Hannah+0.017
 therapist+0.017
 Holly+0.017
 Pupp+0.017
Neg logit contributions
 eleven-0.016
 done-0.015
 unstable-0.015
 leaning-0.015
 taller-0.014
Token and
Token ID290
Feature activation-2.04e-03

Pos logit contributions
ishly+0.011
 Hannah+0.010
 therapist+0.010
 trends+0.010
rob+0.010
Neg logit contributions
 eleven-0.008
 done-0.008
 leaning-0.008
 unstable-0.008
 taller-0.008
Token little
Token ID1310
Feature activation+1.78e-03

Pos logit contributions
ishly+0.003
 Hannah+0.003
lor+0.003
 Holly+0.003
 Pupp+0.003
Neg logit contributions
 eleven-0.003
 done-0.003
 unstable-0.003
 taller-0.003
 leaning-0.003
Token sister
Token ID6621
Feature activation-3.34e-02

Pos logit contributions
 eleven+0.002
 unstable+0.002
 done+0.002
 taller+0.002
 leaning+0.002
Neg logit contributions
ishly-0.003
 Hannah-0.003
 therapist-0.003
 Pupp-0.003
 Holly-0.003
Token.
Token ID13
Feature activation-5.42e-03

Pos logit contributions
lor+0.065
 therapist+0.065
rob+0.064
ishly+0.062
 Hazel+0.061
Neg logit contributions
 done-0.037
 eleven-0.037
 leaning-0.032
ror-0.031
 minutes-0.030
Token They
Token ID1119
Feature activation-1.49e-03

Pos logit contributions
ishly+0.009
 Hannah+0.008
 Pupp+0.008
 Holly+0.008
 therapist+0.008
Neg logit contributions
 eleven-0.008
 unstable-0.008
 done-0.007
 taller-0.007
 leaning-0.007
Token were
Token ID547
Feature activation-5.07e-03

Pos logit contributions
ishly+0.002
 Hannah+0.002
 therapist+0.002
 Pupp+0.002
 Holly+0.002
Neg logit contributions
 eleven-0.002
 done-0.002
 unstable-0.002
 taller-0.002
 leaning-0.002
Token a
Token ID257
Feature activation-3.47e-03

Pos logit contributions
 Pupp+0.008
 Cindy+0.007
 Woo+0.007
 Hannah+0.007
 Gr+0.007
Neg logit contributions
 unstable-0.009
Crash-0.008
 eleven-0.008
 minutes-0.008
isel-0.008
Token happy
Token ID3772
Feature activation+1.55e-03

Pos logit contributions
 Hannah+0.006
 Pupp+0.006
ishly+0.006
 Woo+0.005
 Cindy+0.005
Neg logit contributions
 eleven-0.005
 unstable-0.005
 done-0.004
ror-0.004
 taller-0.004
Token that
Token ID326
Feature activation+1.16e-02

Pos logit contributions
 eleven+0.003
 unstable+0.003
 done+0.003
 taller+0.003
 leaning+0.003
Neg logit contributions
ishly-0.004
 Hannah-0.004
 Pupp-0.004
 Holly-0.004
 therapist-0.003
Token day
Token ID1110
Feature activation-6.40e-03

Pos logit contributions
 unstable+0.013
 eleven+0.013
 taller+0.011
ror+0.011
cks+0.011
Neg logit contributions
 Hannah-0.023
 Pupp-0.022
ishly-0.022
 therapist-0.022
 Woo-0.022
Token on
Token ID319
Feature activation+5.55e-03

Pos logit contributions
ishly+0.012
 Hannah+0.012
 Pupp+0.012
 Woo+0.012
 Buster+0.011
Neg logit contributions
 unstable-0.007
 eleven-0.007
 taller-0.006
ror-0.006
 twelve-0.006
Token,
Token ID11
Feature activation+1.74e-03

Pos logit contributions
 eleven+0.007
 done+0.007
 unstable+0.007
 leaning+0.007
 high+0.007
Neg logit contributions
ishly-0.009
 Hannah-0.009
 trends-0.009
 Pupp-0.009
 therapist-0.009
Token the
Token ID262
Feature activation+2.66e-03

Pos logit contributions
 eleven+0.003
 unstable+0.003
 done+0.003
 taller+0.003
ror+0.003
Neg logit contributions
ishly-0.002
 Hannah-0.002
 Pupp-0.002
 Holly-0.002
 Cindy-0.002
Token pilot
Token ID8022
Feature activation+2.11e-02

Pos logit contributions
 eleven+0.006
 unstable+0.006
cks+0.006
 done+0.006
 Eleven+0.005
Neg logit contributions
 Pupp-0.002
 puppy-0.002
 Hannah-0.002
 therapist-0.002
 Cindy-0.002
Token made
Token ID925
Feature activation+9.50e-03

Pos logit contributions
 done+0.028
 eleven+0.028
 unstable+0.025
 leaning+0.025
 taller+0.025
Neg logit contributions
 Hannah-0.036
ishly-0.035
 Holly-0.034
 Pupp-0.034
 Cindy-0.033
Token sure
Token ID1654
Feature activation-1.02e-02

Pos logit contributions
 eleven+0.011
 unstable+0.011
ror+0.010
 leaning+0.010
Crash+0.009
Neg logit contributions
 Hannah-0.018
 Pupp-0.017
 Woo-0.017
 Holly-0.017
 Cindy-0.017
Token to
Token ID284
Feature activation+4.85e-03

Pos logit contributions
ishly+0.019
 Hannah+0.019
 Pupp+0.018
 Holly+0.018
 therapist+0.018
Neg logit contributions
 eleven-0.012
 unstable-0.011
 done-0.011
 taller-0.011
 leaning-0.010
Token dress
Token ID6576
Feature activation-8.03e-03

Pos logit contributions
 done+0.006
 eleven+0.005
 high+0.005
 leaning+0.005
ror+0.005
Neg logit contributions
ishly-0.009
rob-0.009
 therapist-0.009
 Hannah-0.009
mun-0.009
Token slowly
Token ID6364
Feature activation-8.52e-04

Pos logit contributions
ishly+0.019
 Pupp+0.018
rob+0.018
 therapist+0.018
 Hazel+0.018
Neg logit contributions
 higher-0.005
 done-0.005
 eleven-0.005
 taller-0.005
 high-0.005
Token floor
Token ID4314
Feature activation-1.51e-03

Pos logit contributions
 eleven+0.006
 unstable+0.005
 done+0.005
 taller+0.005
 twelve+0.005
Neg logit contributions
ishly-0.008
 Hannah-0.008
 Pupp-0.008
 therapist-0.008
 Woo-0.008
Token.
Token ID13
Feature activation+3.76e-03

Pos logit contributions
ishly+0.002
 Hannah+0.002
 Pupp+0.002
 Holly+0.002
 therapist+0.002
Neg logit contributions
 eleven-0.002
 unstable-0.002
 done-0.002
 taller-0.002
 leaning-0.002
Token
Token ID198
Feature activation-1.45e-04

Pos logit contributions
 eleven+0.006
 unstable+0.006
 done+0.006
 leaning+0.006
 taller+0.006
Neg logit contributions
 Hannah-0.005
 Holly-0.005
ishly-0.005
 Woo-0.005
 Pupp-0.005
Token
Token ID198
Feature activation+1.03e-04

Pos logit contributions
 Hannah+0.000
ishly+0.000
 Pupp+0.000
 Holly+0.000
 Woo+0.000
Neg logit contributions
 eleven-0.000
 unstable-0.000
 done-0.000
 taller-0.000
 leaning-0.000
Token"
Token ID1
Feature activation-3.64e-03

Pos logit contributions
 eleven+0.000
 unstable+0.000
 done+0.000
 taller+0.000
 twelve+0.000
Neg logit contributions
ishly-0.000
 Hannah-0.000
 Pupp-0.000
 Cindy-0.000
 therapist-0.000
TokenLook
Token ID8567
Feature activation+2.78e-02

Pos logit contributions
 Hannah+0.005
ishly+0.005
 Pupp+0.005
 Holly+0.005
 Woo+0.005
Neg logit contributions
 unstable-0.006
 eleven-0.005
 done-0.005
 taller-0.005
 leaning-0.005
Token,
Token ID11
Feature activation+1.13e-02

Pos logit contributions
 unstable+0.052
 leaning+0.046
 eleven+0.046
 taller+0.045
Crash+0.044
Neg logit contributions
 Hannah-0.036
 Holly-0.033
ishly-0.033
 Cindy-0.032
 Pupp-0.031
Token I
Token ID314
Feature activation+9.50e-03

Pos logit contributions
 eleven+0.015
 unstable+0.014
 done+0.014
 taller+0.014
 twelve+0.013
Neg logit contributions
ishly-0.020
 Hannah-0.019
rob-0.018
 Pupp-0.018
 therapist-0.018
Token have
Token ID423
Feature activation+6.91e-03

Pos logit contributions
 unstable+0.013
 eleven+0.013
 taller+0.013
 leaning+0.013
 higher+0.012
Neg logit contributions
 Hannah-0.015
 Holly-0.014
 Pupp-0.013
 Woo-0.013
 trends-0.013
Token a
Token ID257
Feature activation+9.70e-03

Pos logit contributions
 eleven+0.008
 unstable+0.008
 done+0.008
 taller+0.008
 leaning+0.007
Neg logit contributions
 Hannah-0.012
ishly-0.012
 Pupp-0.012
 Holly-0.012
 Cindy-0.012
Token lot
Token ID1256
Feature activation+3.15e-03

Pos logit contributions
 eleven+0.011
 unstable+0.011
 done+0.010
 taller+0.010
 leaning+0.010
Neg logit contributions
ishly-0.018
 Hannah-0.018
 Pupp-0.017
 Holly-0.017
 therapist-0.017
Token
Token ID198
Feature activation+1.26e-03

Pos logit contributions
 unstable+0.014
 done+0.014
 eleven+0.014
 leaning+0.014
 taller+0.014
Neg logit contributions
 Hannah-0.009
 Pupp-0.009
 Holly-0.009
 Hazel-0.009
 Woo-0.009
Token
Token ID198
Feature activation+1.01e-03

Pos logit contributions
 eleven+0.002
 unstable+0.002
 done+0.002
 taller+0.002
 leaning+0.002
Neg logit contributions
 Hannah-0.002
ishly-0.002
 Pupp-0.002
 Holly-0.002
 Woo-0.002
Token"
Token ID1
Feature activation-5.36e-03

Pos logit contributions
 eleven+0.001
 unstable+0.001
 done+0.001
 taller+0.001
 twelve+0.001
Neg logit contributions
ishly-0.002
 Hannah-0.002
 Pupp-0.002
 Cindy-0.001
 Holly-0.001
TokenWow
Token ID22017
Feature activation+1.08e-02

Pos logit contributions
ishly+0.010
 Hannah+0.009
 Pupp+0.009
 Holly+0.009
 Cindy+0.009
Neg logit contributions
 eleven-0.007
 unstable-0.007
 done-0.006
 taller-0.006
 twelve-0.006
Token,
Token ID11
Feature activation-3.94e-04

Pos logit contributions
 unstable+0.022
 eleven+0.020
 taller+0.020
 leaning+0.020
 done+0.019
Neg logit contributions
 Hannah-0.012
ishly-0.011
 Holly-0.011
 Pupp-0.011
 Hazel-0.011
Token look
Token ID804
Feature activation+2.67e-02

Pos logit contributions
ishly+0.001
 Hannah+0.001
 Pupp+0.001
 therapist+0.001
 Holly+0.001
Neg logit contributions
 eleven-0.001
 unstable-0.001
 done-0.001
 taller-0.000
 twelve-0.000
Token at
Token ID379
Feature activation+5.43e-04

Pos logit contributions
 unstable+0.035
 leaning+0.032
 eleven+0.031
 nailed+0.029
Crash+0.029
Neg logit contributions
 Hannah-0.044
 Cindy-0.042
 Holly-0.042
 c-0.042
 Pupp-0.041
Token all
Token ID477
Feature activation-3.24e-03

Pos logit contributions
 eleven+0.001
 done+0.001
 unstable+0.001
 taller+0.001
 twelve+0.000
Neg logit contributions
ishly-0.001
 therapist-0.001
 Hannah-0.001
 trends-0.001
Gr-0.001
Token the
Token ID262
Feature activation-2.98e-03

Pos logit contributions
 Hannah+0.006
ishly+0.006
 Pupp+0.006
 Holly+0.006
 Cindy+0.006
Neg logit contributions
 eleven-0.004
 unstable-0.004
 done-0.003
 taller-0.003
ror-0.003
Token fish
Token ID5916
Feature activation+4.25e-03

Pos logit contributions
 Hannah+0.004
 Pupp+0.004
 Holly+0.004
ishly+0.004
 trends+0.004
Neg logit contributions
 eleven-0.005
 unstable-0.004
ror-0.004
 done-0.004
 twelve-0.004
Token!"
Token ID2474
Feature activation-9.99e-03

Pos logit contributions
 unstable+0.007
 eleven+0.007
ror+0.006
 done+0.006
 taller+0.006
Neg logit contributions
ishly-0.006
 Hannah-0.006
 Pupp-0.005
 Holly-0.005
 therapist-0.005
Token mom
Token ID1995
Feature activation+1.14e-02

Pos logit contributions
 eleven+0.016
 unstable+0.016
 done+0.015
 taller+0.015
 leaning+0.014
Neg logit contributions
ishly-0.014
 Hannah-0.014
 Pupp-0.013
 Holly-0.013
 Cindy-0.013
Tokenmy
Token ID1820
Feature activation+9.71e-03

Pos logit contributions
 unstable+0.019
 eleven+0.018
 done+0.017
 taller+0.017
 leaning+0.017
Neg logit contributions
ishly-0.016
 Hannah-0.015
 Pupp-0.015
 Cindy-0.014
 Holly-0.014
Token,
Token ID11
Feature activation+9.74e-03

Pos logit contributions
 unstable+0.018
 eleven+0.017
 taller+0.016
 done+0.016
Crash+0.016
Neg logit contributions
ishly-0.012
 Hannah-0.011
 Pupp-0.011
 Cindy-0.011
 Holly-0.010
Token I
Token ID314
Feature activation-7.94e-03

Pos logit contributions
 eleven+0.015
 unstable+0.014
 done+0.014
 taller+0.013
 leaning+0.013
Neg logit contributions
 Hannah-0.015
ishly-0.015
 Pupp-0.014
 Holly-0.014
 Cindy-0.014
Token will
Token ID481
Feature activation+1.61e-03

Pos logit contributions
ishly+0.013
 Hannah+0.013
 Pupp+0.013
 Holly+0.013
 therapist+0.012
Neg logit contributions
 eleven-0.011
 unstable-0.010
 done-0.010
 taller-0.010
ror-0.010
Token be
Token ID307
Feature activation+2.08e-02

Pos logit contributions
 eleven+0.002
 unstable+0.002
 done+0.002
 taller+0.002
 leaning+0.002
Neg logit contributions
ishly-0.003
 Hannah-0.003
 Pupp-0.002
 Holly-0.002
 Cindy-0.002
Token careful
Token ID8161
Feature activation+3.09e-03

Pos logit contributions
 ago+0.029
sing+0.029
 unstable+0.029
Crash+0.028
ses+0.028
Neg logit contributions
 Hannah-0.031
 Woo-0.030
 c-0.029
 bl-0.029
 Holly-0.027
Token."
Token ID526
Feature activation-3.25e-03

Pos logit contributions
 unstable+0.005
 eleven+0.005
Crash+0.005
ror+0.005
 done+0.005
Neg logit contributions
ishly-0.004
 Hannah-0.004
 Pupp-0.004
 therapist-0.004
 Woo-0.003
Token
Token ID198
Feature activation+5.04e-03

Pos logit contributions
ishly+0.005
 Hannah+0.005
 Pupp+0.005
 therapist+0.005
 Holly+0.005
Neg logit contributions
 eleven-0.004
 unstable-0.004
 done-0.004
 taller-0.004
 leaning-0.004
Token
Token ID198
Feature activation+4.99e-03

Pos logit contributions
 eleven+0.008
 unstable+0.007
 done+0.007
 taller+0.007
 leaning+0.007
Neg logit contributions
ishly-0.008
 Hannah-0.008
 Pupp-0.007
 Holly-0.007
 therapist-0.007
TokenTim
Token ID14967
Feature activation-1.07e-03

Pos logit contributions
 eleven+0.008
 unstable+0.008
 done+0.008
 taller+0.007
 leaning+0.007
Neg logit contributions
ishly-0.007
 Hannah-0.007
 Pupp-0.007
 Holly-0.007
 Cindy-0.007
Token ran
Token ID4966
Feature activation-1.45e-02

Pos logit contributions
 done+0.008
 eleven+0.008
 unstable+0.007
 taller+0.007
 leaning+0.007
Neg logit contributions
rob-0.014
ishly-0.013
 Hannah-0.013
 trends-0.013
lor-0.013
Token away
Token ID1497
Feature activation-6.08e-03

Pos logit contributions
 Hannah+0.028
ishly+0.028
 Pupp+0.027
 Holly+0.027
 Woo+0.027
Neg logit contributions
 eleven-0.016
 unstable-0.016
 done-0.015
ror-0.014
 taller-0.014
Token as
Token ID355
Feature activation+7.63e-03

Pos logit contributions
ishly+0.010
 Hannah+0.010
 Pupp+0.009
rob+0.009
 therapist+0.009
Neg logit contributions
 eleven-0.008
 unstable-0.008
 done-0.008
 taller-0.008
 leaning-0.008
Token fast
Token ID3049
Feature activation+1.31e-02

Pos logit contributions
 eleven+0.010
 done+0.010
 unstable+0.009
ror+0.009
 taller+0.009
Neg logit contributions
 Hannah-0.013
 Pupp-0.013
 Holly-0.013
ishly-0.012
 Cindy-0.012
Token as
Token ID355
Feature activation+8.77e-03

Pos logit contributions
 unstable+0.016
 done+0.016
 eleven+0.016
ror+0.015
 taller+0.015
Neg logit contributions
 Hannah-0.023
ishly-0.023
 Woo-0.023
 Pupp-0.022
 Holly-0.022
Token she
Token ID673
Feature activation+2.52e-02

Pos logit contributions
 eleven+0.017
 unstable+0.016
 done+0.016
ror+0.015
 taller+0.015
Neg logit contributions
 Hannah-0.010
ishly-0.009
 Pupp-0.009
 Holly-0.009
 Cindy-0.009
Token could
Token ID714
Feature activation-5.75e-03

Pos logit contributions
 eleven+0.035
 unstable+0.034
 done+0.032
 taller+0.032
 leaning+0.031
Neg logit contributions
ishly-0.041
 Hannah-0.041
 Pupp-0.040
 Holly-0.039
 therapist-0.039
Token.
Token ID13
Feature activation+6.89e-03

Pos logit contributions
 Hannah+0.009
ishly+0.009
 Pupp+0.009
 therapist+0.008
rob+0.008
Neg logit contributions
 eleven-0.008
 unstable-0.008
 done-0.008
 taller-0.008
 leaning-0.008
Token
Token ID198
Feature activation+1.58e-03

Pos logit contributions
 eleven+0.011
 unstable+0.010
 done+0.010
 taller+0.010
 leaning+0.010
Neg logit contributions
ishly-0.010
 Hannah-0.010
 Pupp-0.010
 Holly-0.009
 therapist-0.009
Token
Token ID198
Feature activation+4.13e-03

Pos logit contributions
 eleven+0.002
 unstable+0.002
 done+0.002
 taller+0.002
 leaning+0.002
Neg logit contributions
ishly-0.003
 Hannah-0.003
 Pupp-0.002
 therapist-0.002
 trends-0.002
TokenBack
Token ID7282
Feature activation-4.05e-03

Pos logit contributions
 eleven+0.006
 unstable+0.005
Crash+0.005
 done+0.005
 taller+0.005
Neg logit contributions
ishly-0.007
rob-0.007
 Hannah-0.007
 Pupp-0.006
 trends-0.006
Token But
Token ID887
Feature activation-5.74e-03

Pos logit contributions
 Pupp+0.007
 Hannah+0.007
ishly+0.007
 Holly+0.007
 Cindy+0.007
Neg logit contributions
 done-0.015
 unstable-0.014
 eleven-0.014
 leaning-0.014
ror-0.013
Token he
Token ID339
Feature activation-4.61e-03

Pos logit contributions
 Hannah+0.007
ishly+0.007
 Pupp+0.007
 Holly+0.007
 Cindy+0.006
Neg logit contributions
 unstable-0.010
 eleven-0.010
ror-0.009
 done-0.009
 taller-0.009
Token does
Token ID857
Feature activation+1.42e-02

Pos logit contributions
 Hannah+0.009
ishly+0.009
 Pupp+0.009
 Cindy+0.009
 Woo+0.009
Neg logit contributions
 eleven-0.005
 unstable-0.005
 done-0.005
 taller-0.004
 leaning-0.004
Token not
Token ID407
Feature activation+9.70e-03

Pos logit contributions
 eleven+0.011
 unstable+0.010
 done+0.009
 taller+0.009
 leaning+0.008
Neg logit contributions
ishly-0.033
 Hannah-0.032
 Pupp-0.032
 Holly-0.031
 therapist-0.031
Token want
Token ID765
Feature activation-1.66e-04

Pos logit contributions
 eleven+0.014
 unstable+0.014
 done+0.014
 taller+0.013
ror+0.013
Neg logit contributions
ishly-0.015
 Hannah-0.015
 Pupp-0.014
 Holly-0.014
 therapist-0.014
Token to
Token ID284
Feature activation+1.35e-02

Pos logit contributions
 Hannah+0.000
ishly+0.000
 Pupp+0.000
 Holly+0.000
 Cindy+0.000
Neg logit contributions
 eleven-0.000
 unstable-0.000
 done-0.000
 taller-0.000
 leaning-0.000
Token look
Token ID804
Feature activation-1.48e-03

Pos logit contributions
 eleven+0.016
 done+0.015
 taller+0.014
 high+0.014
 unstable+0.014
Neg logit contributions
ishly-0.026
rob-0.026
lor-0.026
 therapist-0.026
 Hannah-0.025
Token like
Token ID588
Feature activation-4.42e-03

Pos logit contributions
ishly+0.003
 Hannah+0.003
 Pupp+0.003
 Holly+0.003
 Cindy+0.003
Neg logit contributions
 eleven-0.001
 unstable-0.001
 done-0.001
 taller-0.001
 leaning-0.001
Token a
Token ID257
Feature activation-5.00e-03

Pos logit contributions
ishly+0.008
 Hannah+0.008
rob+0.008
 Pupp+0.008
 therapist+0.008
Neg logit contributions
 eleven-0.005
 unstable-0.005
 done-0.005
 taller-0.005
 leaning-0.005
Token baby
Token ID5156
Feature activation+2.46e-03

Pos logit contributions
ishly+0.008
 Hannah+0.007
 Pupp+0.007
 Holly+0.007
 Hazel+0.007
Neg logit contributions
 eleven-0.008
 unstable-0.008
 taller-0.007
 done-0.007
 leaning-0.007
Token in
Token ID287
Feature activation-6.45e-03

Pos logit contributions
 eleven+0.004
 unstable+0.003
 taller+0.003
 done+0.003
 leaning+0.003
Neg logit contributions
 Hannah-0.004
ishly-0.004
 Pupp-0.004
 Holly-0.004
 Woo-0.004
Token it
Token ID340
Feature activation-1.99e-03

Pos logit contributions
 eleven+0.005
 unstable+0.005
 done+0.005
ror+0.005
 leaning+0.005
Neg logit contributions
 Hannah-0.006
ishly-0.005
 Pupp-0.005
 Holly-0.005
 Woo-0.005
Token was
Token ID373
Feature activation-5.48e-04

Pos logit contributions
 Hannah+0.003
ishly+0.003
 Woo+0.002
 Pupp+0.002
 therapist+0.002
Neg logit contributions
 eleven-0.003
 unstable-0.003
 done-0.003
 taller-0.003
 leaning-0.003
Token great
Token ID1049
Feature activation+5.00e-04

Pos logit contributions
ishly+0.001
 Hannah+0.001
 Pupp+0.001
 Holly+0.001
 Cindy+0.001
Neg logit contributions
 eleven-0.001
 unstable-0.001
 done-0.001
 taller-0.001
 leaning-0.001
Token!
Token ID0
Feature activation+8.43e-03

Pos logit contributions
 eleven+0.001
 unstable+0.001
 done+0.001
 taller+0.001
 leaning+0.001
Neg logit contributions
ishly-0.001
 Hannah-0.001
 Pupp-0.001
 Holly-0.001
 Cindy-0.001
Token
Token ID220
Feature activation+7.46e-03

Pos logit contributions
 eleven+0.016
 unstable+0.015
 done+0.015
 taller+0.014
 leaning+0.014
Neg logit contributions
 Hannah-0.010
ishly-0.010
 Pupp-0.010
 Holly-0.009
 Cindy-0.009
Token
Token ID198
Feature activation+8.51e-03

Pos logit contributions
 eleven+0.012
 unstable+0.012
 done+0.012
 taller+0.011
 leaning+0.011
Neg logit contributions
 Hannah-0.010
ishly-0.010
 Pupp-0.010
 Holly-0.009
 Cindy-0.009
Token
Token ID198
Feature activation+8.69e-03

Pos logit contributions
 eleven+0.014
 unstable+0.014
 done+0.013
 taller+0.013
 leaning+0.012
Neg logit contributions
 Hannah-0.012
ishly-0.012
 Pupp-0.011
 Holly-0.011
 Woo-0.011
TokenHe
Token ID1544
Feature activation+4.64e-03

Pos logit contributions
 eleven+0.014
 unstable+0.014
 done+0.013
 taller+0.013
 leaning+0.013
Neg logit contributions
ishly-0.012
 Hannah-0.012
 Pupp-0.012
 Holly-0.012
 therapist-0.012
Token had
Token ID550
Feature activation-1.80e-03

Pos logit contributions
 eleven+0.007
 unstable+0.007
 done+0.006
 taller+0.006
 leaning+0.006
Neg logit contributions
ishly-0.007
 Hannah-0.007
 Pupp-0.007
 Holly-0.007
 therapist-0.007
Token so
Token ID523
Feature activation+2.50e-03

Pos logit contributions
 Hannah+0.003
 Woo+0.003
 Pupp+0.003
 Holly+0.003
 Cindy+0.003
Neg logit contributions
 unstable-0.003
 eleven-0.002
ror-0.002
 leaning-0.002
cks-0.002
Token much
Token ID881
Feature activation+4.41e-03

Pos logit contributions
 eleven+0.003
 unstable+0.003
 done+0.003
ror+0.003
 leaning+0.003
Neg logit contributions
 Hannah-0.004
ishly-0.004
 Pupp-0.004
 Holly-0.004
 Woo-0.004
Token piece
Token ID3704
Feature activation-4.93e-03

Pos logit contributions
 eleven+0.004
 unstable+0.004
 taller+0.004
 done+0.004
 leaning+0.004
Neg logit contributions
ishly-0.006
 Hannah-0.006
rob-0.006
 Pupp-0.006
 Holly-0.006
Token of
Token ID286
Feature activation-5.82e-03

Pos logit contributions
ishly+0.007
 Hannah+0.007
 Pupp+0.006
 Holly+0.006
 Cindy+0.006
Neg logit contributions
 eleven-0.008
 unstable-0.008
 done-0.008
 taller-0.008
 leaning-0.007
Token gum
Token ID27142
Feature activation-1.91e-03

Pos logit contributions
ishly+0.011
 Hannah+0.011
 Pupp+0.010
lor+0.010
 Holly+0.010
Neg logit contributions
 eleven-0.007
 unstable-0.007
 taller-0.006
 done-0.006
 leaning-0.006
Token!
Token ID0
Feature activation-2.34e-04

Pos logit contributions
ishly+0.003
 Hannah+0.003
 Pupp+0.003
 Holly+0.003
 Cindy+0.003
Neg logit contributions
 eleven-0.003
 unstable-0.003
 done-0.003
 leaning-0.003
 taller-0.003
Token Tim
Token ID5045
Feature activation+2.73e-03

Pos logit contributions
ishly+0.000
 therapist+0.000
 Hannah+0.000
rob+0.000
 trends+0.000
Neg logit contributions
 eleven-0.000
 unstable-0.000
 taller-0.000
 done-0.000
 leaning-0.000
Tokenmy
Token ID1820
Feature activation+6.46e-03

Pos logit contributions
 eleven+0.005
 unstable+0.005
 taller+0.004
 twelve+0.004
ror+0.004
Neg logit contributions
ishly-0.004
 Hannah-0.004
 Pupp-0.003
 Holly-0.003
 Hazel-0.003
Token loved
Token ID6151
Feature activation+2.03e-03

Pos logit contributions
 eleven+0.012
 unstable+0.011
 taller+0.011
 done+0.011
ror+0.011
Neg logit contributions
ishly-0.008
 Hannah-0.008
 Pupp-0.008
 Holly-0.007
 therapist-0.007
Token gum
Token ID27142
Feature activation-5.76e-03

Pos logit contributions
 eleven+0.003
ror+0.003
 unstable+0.003
 done+0.003
Crash+0.003
Neg logit contributions
 Hannah-0.003
 Woo-0.003
 Pupp-0.003
 Cindy-0.003
 Holly-0.003
Token,
Token ID11
Feature activation+6.08e-03

Pos logit contributions
ishly+0.010
 Hannah+0.010
Gr+0.010
 Pupp+0.010
 Hanna+0.009
Neg logit contributions
 done-0.007
 taller-0.007
 eleven-0.007
 unstable-0.007
 high-0.007
Token so
Token ID523
Feature activation+3.66e-04

Pos logit contributions
 eleven+0.009
 unstable+0.009
 done+0.008
 taller+0.008
 leaning+0.008
Neg logit contributions
ishly-0.010
 Hannah-0.010
 Pupp-0.009
 Holly-0.009
 therapist-0.009
Token he
Token ID339
Feature activation+1.10e-02

Pos logit contributions
 eleven+0.001
 unstable+0.001
 done+0.001
 leaning+0.001
 taller+0.001
Neg logit contributions
ishly-0.001
 Hannah-0.001
 Holly-0.000
 therapist-0.000
 Pupp-0.000
Token me
Token ID502
Feature activation+3.79e-03

Pos logit contributions
 eleven+0.008
 done+0.007
 taller+0.007
 high+0.007
ror+0.007
Neg logit contributions
ishly-0.016
rob-0.015
 therapist-0.015
lor-0.015
 trends-0.014
Token reach
Token ID3151
Feature activation+1.22e-02

Pos logit contributions
 eleven+0.005
 unstable+0.005
 done+0.005
 taller+0.005
 leaning+0.004
Neg logit contributions
ishly-0.006
 Hannah-0.006
 Pupp-0.006
 therapist-0.006
 Holly-0.006
Token the
Token ID262
Feature activation+8.65e-03

Pos logit contributions
 eleven+0.014
 unstable+0.014
 done+0.013
 taller+0.013
 leaning+0.012
Neg logit contributions
ishly-0.023
 Hannah-0.023
 Pupp-0.022
 Holly-0.021
 therapist-0.021
Token h
Token ID289
Feature activation-8.97e-03

Pos logit contributions
 eleven+0.012
 unstable+0.012
 done+0.012
cks+0.011
ror+0.011
Neg logit contributions
 Hannah-0.013
ishly-0.013
 Pupp-0.013
 therapist-0.012
 puppy-0.012
Tokenanger
Token ID2564
Feature activation-2.89e-03

Pos logit contributions
ishly+0.011
 Pupp+0.011
 Hannah+0.011
 Cindy+0.010
 Woo+0.010
Neg logit contributions
 eleven-0.017
ror-0.016
 done-0.016
 unstable-0.016
 taller-0.016
Token?"
Token ID1701
Feature activation+3.58e-03

Pos logit contributions
 Hannah+0.004
ishly+0.004
 Pupp+0.004
 Holly+0.004
 Cindy+0.004
Neg logit contributions
 eleven-0.004
 done-0.004
 unstable-0.004
 taller-0.004
 leaning-0.004
Token
Token ID220
Feature activation+2.98e-03

Pos logit contributions
 eleven+0.004
 leaning+0.004
 taller+0.004
 done+0.004
 high+0.003
Neg logit contributions
rob-0.006
 trends-0.006
lor-0.006
ishly-0.006
 therapist-0.006
Token
Token ID198
Feature activation+5.51e-03

Pos logit contributions
 eleven+0.005
 unstable+0.005
 done+0.005
 taller+0.004
 leaning+0.004
Neg logit contributions
 Hannah-0.004
ishly-0.004
 Pupp-0.004
 Holly-0.004
 Cindy-0.004
Token
Token ID198
Feature activation+6.51e-03

Pos logit contributions
 eleven+0.006
 high+0.005
 done+0.005
ror+0.005
 unstable+0.005
Neg logit contributions
ishly-0.011
 trends-0.010
rob-0.010
 therapist-0.010
 Hannah-0.010
TokenHer
Token ID9360
Feature activation+1.15e-04

Pos logit contributions
 eleven+0.010
Crash+0.010
 eight+0.009
 high+0.009
 unstable+0.009
Neg logit contributions
ishly-0.009
rob-0.009
 Hannah-0.008
 Pupp-0.008
 trends-0.008
Token brother
Token ID3956
Feature activation-2.01e-02

Pos logit contributions
 eleven+0.000
 unstable+0.000
 done+0.000
 taller+0.000
 leaning+0.000
Neg logit contributions
ishly-0.000
 Hannah-0.000
 Pupp-0.000
 Holly-0.000
 therapist-0.000
Token.
Token ID13
Feature activation+1.14e-03

Pos logit contributions
 eleven+0.008
 done+0.008
 taller+0.008
 leaning+0.007
 unstable+0.007
Neg logit contributions
ishly-0.012
rob-0.011
 Hannah-0.011
 Pupp-0.011
 Holly-0.011
Token My
Token ID2011
Feature activation-6.30e-03

Pos logit contributions
 eleven+0.002
 unstable+0.001
 taller+0.001
ror+0.001
 twelve+0.001
Neg logit contributions
ishly-0.002
 Hannah-0.002
rob-0.002
 therapist-0.002
 Holly-0.002
Token mom
Token ID1995
Feature activation+2.31e-03

Pos logit contributions
ishly+0.010
 Hannah+0.010
 Pupp+0.009
 Holly+0.009
 therapist+0.009
Neg logit contributions
 eleven-0.009
 unstable-0.009
 done-0.009
 taller-0.008
 leaning-0.008
Tokenmy
Token ID1820
Feature activation+4.02e-03

Pos logit contributions
 eleven+0.003
 done+0.003
 unstable+0.003
 taller+0.003
 leaning+0.003
Neg logit contributions
ishly-0.004
 Hannah-0.004
 Pupp-0.004
 Holly-0.004
 therapist-0.003
Token isn
Token ID2125
Feature activation-2.78e-03

Pos logit contributions
 eleven+0.005
 done+0.005
 unstable+0.005
 taller+0.005
 leaning+0.005
Neg logit contributions
ishly-0.007
 Hannah-0.007
 Pupp-0.006
 Holly-0.006
 Woo-0.006
Token't
Token ID470
Feature activation+1.24e-02

Pos logit contributions
ishly+0.005
 trends+0.005
 Hannah+0.005
 therapist+0.005
rob+0.005
Neg logit contributions
 eleven-0.003
 leaning-0.003
 done-0.003
 unstable-0.003
 taller-0.003
Token feeling
Token ID4203
Feature activation+7.47e-03

Pos logit contributions
 eleven+0.015
 unstable+0.014
ror+0.013
Crash+0.013
 Eleven+0.012
Neg logit contributions
 Hannah-0.023
 Pupp-0.022
 Cindy-0.022
 Woo-0.022
 Hazel-0.021
Token well
Token ID880
Feature activation-8.53e-03

Pos logit contributions
 eleven+0.009
 unstable+0.008
 done+0.008
ror+0.007
 taller+0.007
Neg logit contributions
 Hannah-0.014
ishly-0.014
 Pupp-0.013
 Holly-0.013
 Cindy-0.013
Token and
Token ID290
Feature activation-7.55e-03

Pos logit contributions
ishly+0.015
 Hannah+0.015
 Pupp+0.015
rob+0.014
 therapist+0.014
Neg logit contributions
 eleven-0.011
 done-0.011
 unstable-0.010
 taller-0.010
 leaning-0.009
Token I
Token ID314
Feature activation-1.52e-03

Pos logit contributions
ishly+0.012
 Hannah+0.012
 Pupp+0.012
 Holly+0.012
 therapist+0.012
Neg logit contributions
 eleven-0.010
 unstable-0.010
 done-0.010
 taller-0.010
 leaning-0.009
Token want
Token ID765
Feature activation+6.11e-03

Pos logit contributions
ishly+0.002
 Hannah+0.002
rob+0.002
 Holly+0.002
 Cindy+0.002
Neg logit contributions
 done-0.002
 unstable-0.002
 eleven-0.002
 taller-0.002
 high-0.002
Token
Token ID198
Feature activation+1.79e-03

Pos logit contributions
 eleven+0.002
 unstable+0.001
 taller+0.001
ror+0.001
 done+0.001
Neg logit contributions
ishly-0.002
rob-0.002
 therapist-0.002
 Hannah-0.002
 trends-0.002
Token
Token ID198
Feature activation+6.93e-04

Pos logit contributions
 eleven+0.002
 unstable+0.002
 done+0.002
 taller+0.002
 high+0.002
Neg logit contributions
ishly-0.003
 Hannah-0.003
 Pupp-0.003
 therapist-0.003
rob-0.003
TokenThe
Token ID464
Feature activation-6.70e-03

Pos logit contributions
 eleven+0.001
Crash+0.001
 unstable+0.001
 done+0.001
 eight+0.001
Neg logit contributions
ishly-0.001
rob-0.001
 Hannah-0.001
 Pupp-0.001
 therapist-0.001
Token friends
Token ID2460
Feature activation+2.33e-03

Pos logit contributions
ishly+0.011
 Hannah+0.010
rob+0.010
 Pupp+0.010
 Woo+0.010
Neg logit contributions
 eleven-0.010
 unstable-0.009
 done-0.009
 taller-0.009
 leaning-0.009
Token were
Token ID547
Feature activation-7.43e-03

Pos logit contributions
 done+0.003
 eleven+0.003
 unstable+0.003
 leaning+0.003
 taller+0.003
Neg logit contributions
rob-0.004
 Hannah-0.004
 Woo-0.004
 therapist-0.004
ishly-0.004
Token curious
Token ID11040
Feature activation-9.21e-04

Pos logit contributions
ishly+0.013
 Hannah+0.012
 Pupp+0.012
 Cindy+0.012
 Holly+0.011
Neg logit contributions
 unstable-0.010
 eleven-0.010
Crash-0.010
ror-0.009
 Eleven-0.009
Token,
Token ID11
Feature activation+5.23e-03

Pos logit contributions
ishly+0.001
 Hannah+0.001
 Pupp+0.001
 Holly+0.001
 Cindy+0.001
Neg logit contributions
 eleven-0.001
 unstable-0.001
 done-0.001
 taller-0.001
ror-0.001
Token so
Token ID523
Feature activation-1.14e-02

Pos logit contributions
 eleven+0.006
 unstable+0.006
 done+0.006
 taller+0.006
 leaning+0.005
Neg logit contributions
ishly-0.010
 Hannah-0.009
rob-0.009
 therapist-0.009
 Pupp-0.009
Token they
Token ID484
Feature activation+2.30e-03

Pos logit contributions
rob+0.019
ishly+0.019
lor+0.018
 Hannah+0.018
 therapist+0.018
Neg logit contributions
 eleven-0.015
 done-0.015
 unstable-0.015
 leaning-0.014
 taller-0.014
Token opened
Token ID4721
Feature activation-2.22e-03

Pos logit contributions
 eleven+0.003
 unstable+0.003
 done+0.003
 taller+0.003
 leaning+0.003
Neg logit contributions
ishly-0.004
 Hannah-0.004
 Pupp-0.003
 Holly-0.003
 Cindy-0.003
Token the
Token ID262
Feature activation-4.14e-03

Pos logit contributions
 Hannah+0.004
ishly+0.004
 Pupp+0.004
 Cindy+0.004
 Holly+0.004
Neg logit contributions
 eleven-0.003
 unstable-0.003
 done-0.003
 taller-0.003
ror-0.003
Tokenmy
Token ID1820
Feature activation-2.15e-04

Pos logit contributions
ishly+0.008
 Hannah+0.008
 Pupp+0.008
 therapist+0.008
 Holly+0.008
Neg logit contributions
 eleven-0.006
 done-0.006
 unstable-0.006
 taller-0.005
 leaning-0.005
Token told
Token ID1297
Feature activation-9.37e-03

Pos logit contributions
rob+0.000
lor+0.000
 trends+0.000
 fluids+0.000
 Hannah+0.000
Neg logit contributions
 done-0.000
 leaning-0.000
 eleven-0.000
 taller-0.000
 climbed-0.000
Token her
Token ID607
Feature activation-6.96e-03

Pos logit contributions
rob+0.016
lor+0.015
ishly+0.015
 therapist+0.015
 Pupp+0.014
Neg logit contributions
 eleven-0.013
 done-0.012
 unstable-0.012
 taller-0.012
 high-0.011
Token to
Token ID284
Feature activation-9.90e-04

Pos logit contributions
rob+0.015
ishly+0.013
 Hannah+0.013
 therapist+0.013
Gr+0.013
Neg logit contributions
 done-0.007
 eleven-0.007
 taller-0.007
 high-0.006
 unstable-0.006
Token be
Token ID307
Feature activation+8.14e-03

Pos logit contributions
rob+0.002
 therapist+0.002
lor+0.002
Gr+0.002
 trends+0.002
Neg logit contributions
 done-0.001
 high-0.001
 eleven-0.001
 leaning-0.001
 wind-0.001
Token careful
Token ID8161
Feature activation-2.04e-03

Pos logit contributions
 unstable+0.008
 eleven+0.008
ror+0.007
 twelve+0.007
 done+0.006
Neg logit contributions
 Hannah-0.017
 Woo-0.016
 Pupp-0.016
 Holly-0.016
 Cindy-0.016
Token because
Token ID780
Feature activation-1.28e-03

Pos logit contributions
ishly+0.003
 Hannah+0.003
 Pupp+0.003
 Holly+0.003
 therapist+0.003
Neg logit contributions
 eleven-0.003
 unstable-0.003
 done-0.003
 taller-0.003
ror-0.003
Token it
Token ID340
Feature activation-1.61e-02

Pos logit contributions
ishly+0.003
lor+0.003
rob+0.003
u+0.003
Gr+0.002
Neg logit contributions
 leaning-0.001
 high-0.001
 eleven-0.001
 unstable-0.001
 taller-0.001
Token was
Token ID373
Feature activation+2.26e-03

Pos logit contributions
lor+0.028
 Hannah+0.027
 trends+0.027
ishly+0.027
 Cindy+0.026
Neg logit contributions
 done-0.021
 unstable-0.020
 leaning-0.019
 taller-0.018
 high-0.018
Token a
Token ID257
Feature activation-5.63e-03

Pos logit contributions
 eleven+0.002
 unstable+0.002
 done+0.002
 taller+0.002
 leaning+0.002
Neg logit contributions
ishly-0.005
 Hannah-0.004
rob-0.004
 therapist-0.004
 Pupp-0.004
Token cold
Token ID4692
Feature activation-5.51e-03

Pos logit contributions
ishly+0.011
rob+0.011
lor+0.010
iv+0.010
mun+0.010
Neg logit contributions
 high-0.007
 unstable-0.006
 time-0.006
 wind-0.006
 done-0.006
Token to
Token ID284
Feature activation-1.21e-02

Pos logit contributions
ishly+0.043
rob+0.043
 therapist+0.042
lor+0.041
 Pupp+0.041
Neg logit contributions
 eleven-0.013
 done-0.013
 high-0.011
 higher-0.011
 taller-0.010
Token the
Token ID262
Feature activation-1.05e-02

Pos logit contributions
ishly+0.028
lor+0.028
rob+0.027
 therapist+0.027
 trends+0.026
Neg logit contributions
 done-0.008
 eleven-0.008
 high-0.008
 taller-0.007
 twelve-0.007
Token park
Token ID3952
Feature activation-3.26e-03

Pos logit contributions
ishly+0.018
rob+0.017
 Hannah+0.017
 Pupp+0.016
lor+0.016
Neg logit contributions
 eleven-0.014
 unstable-0.013
 taller-0.013
 high-0.013
 done-0.013
Token.
Token ID13
Feature activation-2.24e-04

Pos logit contributions
ishly+0.006
 Hannah+0.006
rob+0.006
 Hazel+0.006
 Pupp+0.006
Neg logit contributions
 eleven-0.004
 done-0.004
 high-0.003
 taller-0.003
 higher-0.003
Token They
Token ID1119
Feature activation-3.03e-03

Pos logit contributions
ishly+0.000
rob+0.000
 therapist+0.000
 Hannah+0.000
 trends+0.000
Neg logit contributions
 eleven-0.000
 unstable-0.000
 taller-0.000
 done-0.000
ror-0.000
Token had
Token ID550
Feature activation-1.94e-03

Pos logit contributions
ishly+0.006
lor+0.006
rob+0.006
 therapist+0.006
 Hannah+0.006
Neg logit contributions
 done-0.003
 eleven-0.003
 high-0.003
 unstable-0.003
 leaning-0.003
Token a
Token ID257
Feature activation+1.93e-03

Pos logit contributions
 Hannah+0.003
ishly+0.003
 Pupp+0.003
 Woo+0.003
 Holly+0.003
Neg logit contributions
 unstable-0.003
 eleven-0.003
ror-0.003
 taller-0.002
 leaning-0.002
Token great
Token ID1049
Feature activation+1.08e-02

Pos logit contributions
 eleven+0.003
 unstable+0.002
 done+0.002
 taller+0.002
 leaning+0.002
Neg logit contributions
ishly-0.003
 Hannah-0.003
 Pupp-0.003
 Holly-0.003
 Cindy-0.003
Token time
Token ID640
Feature activation-9.26e-03

Pos logit contributions
 eleven+0.007
 unstable+0.007
 taller+0.006
ror+0.006
 done+0.005
Neg logit contributions
 Hannah-0.026
ishly-0.025
 Pupp-0.025
 Cindy-0.025
 Holly-0.025
Token playing
Token ID2712
Feature activation+8.47e-04

Pos logit contributions
ishly+0.016
 Hannah+0.016
 Pupp+0.016
 Holly+0.015
 Woo+0.015
Neg logit contributions
 unstable-0.012
 eleven-0.012
 done-0.011
 taller-0.011
ror-0.010
Token on
Token ID319
Feature activation+7.01e-04

Pos logit contributions
 eleven+0.001
 done+0.001
 high+0.001
 higher+0.001
 taller+0.001
Neg logit contributions
ishly-0.002
 Hannah-0.002
 therapist-0.002
 Pupp-0.002
 Hazel-0.002
Token water
Token ID1660
Feature activation+4.67e-03

Pos logit contributions
 eleven+0.002
 unstable+0.002
 done+0.002
 taller+0.002
 leaning+0.002
Neg logit contributions
ishly-0.002
rob-0.002
 Hannah-0.002
 Woo-0.002
lor-0.002
Token.
Token ID13
Feature activation-9.32e-04

Pos logit contributions
 eleven+0.005
 done+0.005
 unstable+0.005
 leaning+0.005
 taller+0.005
Neg logit contributions
 Hannah-0.008
ishly-0.008
 Pupp-0.008
rob-0.008
 Woo-0.008
Token<|endoftext|>
Token ID50256
Feature activation-1.69e-03

Pos logit contributions
ishly+0.002
 trends+0.002
 Hannah+0.002
 therapist+0.002
 Pupp+0.002
Neg logit contributions
 eleven-0.001
 unstable-0.001
 taller-0.001
 done-0.001
 leaning-0.001
TokenOnce
Token ID7454
Feature activation+3.62e-03

Pos logit contributions
ishly+0.003
 Hannah+0.003
 Pupp+0.003
 therapist+0.002
 Holly+0.002
Neg logit contributions
 eleven-0.002
 unstable-0.002
 done-0.002
 taller-0.002
 leaning-0.002
Token upon
Token ID2402
Feature activation+2.00e-03

Pos logit contributions
 eleven+0.006
 unstable+0.005
 twelve+0.005
 taller+0.005
Crash+0.005
Neg logit contributions
 Hannah-0.006
 Holly-0.005
 Cindy-0.005
 Pupp-0.005
ishly-0.005
Token a
Token ID257
Feature activation-7.16e-03

Pos logit contributions
 eleven+0.002
 unstable+0.002
 done+0.002
 taller+0.002
 leaning+0.002
Neg logit contributions
ishly-0.004
 Hannah-0.004
 Pupp-0.004
 therapist-0.004
 Holly-0.004
Token time
Token ID640
Feature activation+4.34e-04

Pos logit contributions
 Hannah+0.014
 Holly+0.013
Charl+0.013
 Woo+0.013
 Hazel+0.013
Neg logit contributions
 eleven-0.007
 unstable-0.007
 taller-0.007
 leaning-0.006
ror-0.006
Token there
Token ID612
Feature activation+1.11e-02

Pos logit contributions
 unstable+0.001
 eleven+0.001
 taller+0.001
 done+0.001
ror+0.001
Neg logit contributions
 Hannah-0.001
ishly-0.001
 Pupp-0.000
 Holly-0.000
 Cindy-0.000
Token was
Token ID373
Feature activation+5.81e-03

Pos logit contributions
 done+0.018
 eleven+0.018
 unstable+0.018
 leaning+0.017
 taller+0.017
Neg logit contributions
ishly-0.015
 Hannah-0.014
 trends-0.014
 therapist-0.014
 Pupp-0.013
Token a
Token ID257
Feature activation+9.59e-03

Pos logit contributions
 unstable+0.009
 eleven+0.009
ror+0.009
 done+0.008
 taller+0.008
Neg logit contributions
 Hannah-0.009
 Woo-0.008
 Pupp-0.008
 Holly-0.008
 Cindy-0.008
Token little
Token ID1310
Feature activation+1.09e-02

Pos logit contributions
 eleven+0.015
 unstable+0.014
 done+0.014
 taller+0.013
ror+0.013
Neg logit contributions
 Hannah-0.014
ishly-0.014
 Pupp-0.014
 Holly-0.013
 Cindy-0.013
Token into
Token ID656
Feature activation-2.98e-03

Pos logit contributions
ishly+0.033
 Hannah+0.033
 Pupp+0.032
 Holly+0.031
 Woo+0.031
Neg logit contributions
 eleven-0.015
 unstable-0.014
 done-0.013
 taller-0.013
ror-0.012
Token a
Token ID257
Feature activation-4.59e-03

Pos logit contributions
ishly+0.007
 therapist+0.006
lor+0.006
 Hannah+0.006
 Pupp+0.006
Neg logit contributions
 eleven-0.002
 unstable-0.002
 done-0.002
 taller-0.002
 leaning-0.002
Token big
Token ID1263
Feature activation+2.58e-03

Pos logit contributions
 Hannah+0.008
ishly+0.008
 Pupp+0.008
 Holly+0.008
 Woo+0.008
Neg logit contributions
 eleven-0.006
 unstable-0.006
 done-0.005
 taller-0.005
ror-0.005
Token house
Token ID2156
Feature activation+9.00e-03

Pos logit contributions
 eleven+0.004
 unstable+0.004
 done+0.003
 taller+0.003
ror+0.003
Neg logit contributions
 Hannah-0.004
ishly-0.004
 Pupp-0.004
 Holly-0.004
 Cindy-0.004
Token to
Token ID284
Feature activation-8.01e-03

Pos logit contributions
 unstable+0.016
 eleven+0.015
Crash+0.015
ror+0.014
 twelve+0.014
Neg logit contributions
ishly-0.012
 Hannah-0.011
 Pupp-0.011
 therapist-0.010
 Woo-0.010
Token find
Token ID1064
Feature activation-5.96e-04

Pos logit contributions
ishly+0.016
lor+0.016
 therapist+0.016
rob+0.015
 Hannah+0.015
Neg logit contributions
 done-0.009
 eleven-0.009
 taller-0.009
 high-0.008
 unstable-0.008
Token more
Token ID517
Feature activation-6.54e-03

Pos logit contributions
lor+0.001
ishly+0.001
 therapist+0.001
 trends+0.001
rob+0.001
Neg logit contributions
 eleven-0.000
 taller-0.000
 high-0.000
 higher-0.000
 done-0.000
Token toys
Token ID14958
Feature activation+2.49e-03

Pos logit contributions
ishly+0.013
 Hannah+0.012
 therapist+0.011
 Pupp+0.011
 Hazel+0.011
Neg logit contributions
 eleven-0.008
 unstable-0.007
 taller-0.007
 done-0.007
 leaning-0.007
Token to
Token ID284
Feature activation-9.38e-03

Pos logit contributions
 eleven+0.003
 unstable+0.003
 done+0.003
 taller+0.003
 leaning+0.003
Neg logit contributions
ishly-0.004
 Hannah-0.004
 Pupp-0.004
 therapist-0.004
 Holly-0.004
Token play
Token ID711
Feature activation-1.74e-03

Pos logit contributions
lor+0.024
 therapist+0.023
ishly+0.023
rob+0.022
pillar+0.021
Neg logit contributions
 done-0.007
 high-0.007
 taller-0.007
 eleven-0.006
 building-0.006
Token with
Token ID351
Feature activation+4.35e-03

Pos logit contributions
ishly+0.004
 Hannah+0.004
 therapist+0.004
 Hazel+0.003
lor+0.003
Neg logit contributions
 eleven-0.002
 high-0.002
 taller-0.002
 done-0.001
 unstable-0.001
Token them
Token ID606
Feature activation-8.07e-03

Pos logit contributions
ishly+0.012
 Hannah+0.012
 Pupp+0.012
 therapist+0.012
 Holly+0.011
Neg logit contributions
 eleven-0.012
 unstable-0.011
 done-0.011
 taller-0.011
 twelve-0.011
Token some
Token ID617
Feature activation-5.46e-03

Pos logit contributions
rob+0.019
lor+0.018
ishly+0.018
 therapist+0.017
 Beat+0.017
Neg logit contributions
 eleven-0.007
 high-0.006
 done-0.006
 six-0.006
 twelve-0.005
Token w
Token ID266
Feature activation-1.17e-02

Pos logit contributions
ishly+0.011
lor+0.011
rob+0.010
 Hannah+0.010
 Woo+0.010
Neg logit contributions
 done-0.006
 eleven-0.006
 taller-0.006
 high-0.006
 minutes-0.006
Tokenaffles
Token ID48501
Feature activation+6.62e-03

Pos logit contributions
ishly+0.016
 Pupp+0.016
 Hannah+0.016
 Woo+0.015
 Cindy+0.015
Neg logit contributions
 done-0.020
 eleven-0.020
 taller-0.018
 minutes-0.018
 unstable-0.018
Token to
Token ID284
Feature activation-1.23e-02

Pos logit contributions
 done+0.006
 eleven+0.006
 high+0.006
 taller+0.006
 twelve+0.005
Neg logit contributions
 Pupp-0.014
Gr-0.014
 Beat-0.013
ishly-0.013
 Hannah-0.013
Token m
Token ID285
Feature activation-1.97e-02

Pos logit contributions
lor+0.026
ishly+0.025
rob+0.024
 therapist+0.023
 trends+0.022
Neg logit contributions
 done-0.014
 eleven-0.013
 taller-0.012
 high-0.012
 twelve-0.012
Tokenunch
Token ID3316
Feature activation+3.47e-03

Pos logit contributions
 Hannah+0.032
 Pupp+0.032
 Beat+0.031
 Woo+0.031
 Cindy+0.031
Neg logit contributions
 done-0.028
 eleven-0.026
 leaning-0.025
 taller-0.024
oops-0.024
Token on
Token ID319
Feature activation-5.82e-03

Pos logit contributions
 eleven+0.004
 high+0.004
 done+0.004
 higher+0.004
 taller+0.004
Neg logit contributions
 Pupp-0.006
 trends-0.006
 Hannah-0.006
Gr-0.006
rob-0.006
Token.
Token ID13
Feature activation-5.65e-03

Pos logit contributions
ishly+0.011
 trends+0.011
rob+0.011
lor+0.011
 Pupp+0.011
Neg logit contributions
 eleven-0.006
 high-0.006
 twelve-0.006
 taller-0.006
 done-0.005
Token The
Token ID383
Feature activation-4.72e-03

Pos logit contributions
ishly+0.009
rob+0.009
 Hannah+0.008
 Pupp+0.008
 therapist+0.008
Neg logit contributions
 eleven-0.008
 unstable-0.007
 taller-0.007
 twelve-0.007
 done-0.007
Token w
Token ID266
Feature activation-3.12e-02

Pos logit contributions
ishly+0.007
 Hannah+0.007
 Woo+0.007
 Pupp+0.007
rob+0.006
Neg logit contributions
 eleven-0.007
 done-0.007
 taller-0.007
 unstable-0.007
 twelve-0.007
Token sun
Token ID4252
Feature activation+1.67e-02

Pos logit contributions
 eleven+0.001
 unstable+0.001
 taller+0.001
 done+0.001
ror+0.001
Neg logit contributions
ishly-0.001
 Hannah-0.001
 Pupp-0.001
 therapist-0.001
 Woo-0.001
Token started
Token ID2067
Feature activation+8.88e-03

Pos logit contributions
 eleven+0.025
 unstable+0.024
 done+0.023
ror+0.023
 taller+0.023
Neg logit contributions
ishly-0.025
 Hannah-0.025
 Pupp-0.025
 therapist-0.024
 Holly-0.024
Token to
Token ID284
Feature activation+8.33e-03

Pos logit contributions
 eleven+0.012
 unstable+0.011
ror+0.011
 twelve+0.011
cks+0.011
Neg logit contributions
 Pupp-0.015
ishly-0.015
 Hannah-0.015
 Woo-0.014
 Cindy-0.014
Token set
Token ID900
Feature activation+2.82e-03

Pos logit contributions
 eleven+0.010
 unstable+0.009
 done+0.009
 taller+0.009
 leaning+0.009
Neg logit contributions
ishly-0.015
 Hannah-0.015
 Pupp-0.015
 Holly-0.014
 therapist-0.014
Token,
Token ID11
Feature activation+1.83e-03

Pos logit contributions
 high+0.003
 done+0.003
 higher+0.003
 eleven+0.003
 taller+0.003
Neg logit contributions
rob-0.006
 therapist-0.005
 trends-0.005
lor-0.005
Gr-0.005
Token Mitt
Token ID16627
Feature activation-1.87e-02

Pos logit contributions
 eleven+0.002
 high+0.002
 unstable+0.002
 higher+0.002
 leaning+0.002
Neg logit contributions
 trends-0.003
lor-0.003
ishly-0.003
Gr-0.003
rob-0.003
Tokenens
Token ID641
Feature activation-5.75e-03

Pos logit contributions
ishly+0.028
 trends+0.027
 Hannah+0.027
lor+0.026
Gr+0.026
Neg logit contributions
 done-0.028
 eleven-0.028
 unstable-0.026
 taller-0.026
 leaning-0.025
Token had
Token ID550
Feature activation+3.94e-03

Pos logit contributions
ishly+0.010
 Hannah+0.010
rob+0.009
 Pupp+0.009
 Cindy+0.009
Neg logit contributions
 done-0.007
 eleven-0.007
 taller-0.007
 unstable-0.007
 leaning-0.007
Token to
Token ID284
Feature activation-3.02e-03

Pos logit contributions
 eleven+0.004
 done+0.004
 unstable+0.004
 taller+0.004
 twelve+0.004
Neg logit contributions
ishly-0.008
 Hannah-0.007
 Pupp-0.007
rob-0.007
 therapist-0.007
Token go
Token ID467
Feature activation-3.41e-03

Pos logit contributions
rob+0.006
 therapist+0.006
ishly+0.006
lor+0.006
 trends+0.006
Neg logit contributions
 done-0.003
 eleven-0.003
 taller-0.003
 high-0.003
 twelve-0.003
Token home
Token ID1363
Feature activation-4.16e-03

Pos logit contributions
ishly+0.007
 Hannah+0.007
 Pupp+0.007
 therapist+0.007
 Holly+0.007
Neg logit contributions
 eleven-0.003
 unstable-0.003
 done-0.003
 taller-0.003
 leaning-0.003
Token eventually
Token ID4191
Feature activation+6.14e-03

Pos logit contributions
 Hannah+0.007
ishly+0.007
 Pupp+0.007
 Holly+0.007
 Cindy+0.006
Neg logit contributions
 eleven-0.008
 unstable-0.007
 taller-0.007
 done-0.007
ror-0.007
Token fell
Token ID3214
Feature activation+7.77e-03

Pos logit contributions
 eleven+0.009
 done+0.009
 unstable+0.009
 taller+0.008
 leaning+0.008
Neg logit contributions
ishly-0.010
 Hannah-0.009
 Pupp-0.009
rob-0.009
 therapist-0.009
Token asleep
Token ID16039
Feature activation-1.58e-03

Pos logit contributions
 eleven+0.011
 unstable+0.010
 done+0.010
 taller+0.010
 leaning+0.010
Neg logit contributions
ishly-0.013
 Hannah-0.013
 Pupp-0.012
 therapist-0.012
 Holly-0.012
Token.
Token ID13
Feature activation-2.36e-03

Pos logit contributions
ishly+0.003
 Hannah+0.003
 Pupp+0.003
 Holly+0.003
 Cindy+0.003
Neg logit contributions
 eleven-0.002
 done-0.002
 unstable-0.002
 taller-0.002
 leaning-0.002
Token The
Token ID383
Feature activation-3.48e-03

Pos logit contributions
 Hannah+0.003
ishly+0.003
 Pupp+0.003
 Holly+0.003
 Cindy+0.003
Neg logit contributions
 eleven-0.004
 unstable-0.004
 done-0.004
 taller-0.004
 leaning-0.003
Token monster
Token ID9234
Feature activation-1.79e-02

Pos logit contributions
ishly+0.006
 Hannah+0.006
 Pupp+0.005
 Holly+0.005
 therapist+0.005
Neg logit contributions
 eleven-0.005
 unstable-0.005
 done-0.005
 taller-0.004
 leaning-0.004
Token came
Token ID1625
Feature activation+2.65e-03

Pos logit contributions
ishly+0.028
 Hannah+0.027
 Pupp+0.026
 Holly+0.026
 Cindy+0.026
Neg logit contributions
 eleven-0.026
 unstable-0.026
 done-0.025
 taller-0.024
 leaning-0.024
Token back
Token ID736
Feature activation-1.53e-02

Pos logit contributions
 eleven+0.003
 unstable+0.003
 done+0.003
 taller+0.003
ror+0.003
Neg logit contributions
ishly-0.005
 Hannah-0.005
 Pupp-0.005
 Holly-0.005
 Cindy-0.005
Token and
Token ID290
Feature activation+1.08e-03

Pos logit contributions
ishly+0.028
 Hannah+0.028
 Pupp+0.027
 therapist+0.027
 Holly+0.027
Neg logit contributions
 eleven-0.018
 unstable-0.017
 done-0.017
 taller-0.016
 leaning-0.016
Token this
Token ID428
Feature activation-6.89e-03

Pos logit contributions
 eleven+0.002
 unstable+0.002
 done+0.001
 taller+0.001
ror+0.001
Neg logit contributions
ishly-0.002
 Hannah-0.002
 Pupp-0.002
 Holly-0.002
 Woo-0.002
Token time
Token ID640
Feature activation+6.93e-03

Pos logit contributions
 Hannah+0.015
ishly+0.015
 Pupp+0.015
 therapist+0.014
 Holly+0.014
Neg logit contributions
 eleven-0.006
 unstable-0.005
 taller-0.005
 done-0.005
 leaning-0.005
Token was
Token ID373
Feature activation-9.61e-03

Pos logit contributions
ishly+0.017
 Hannah+0.017
 Pupp+0.016
 Holly+0.016
 therapist+0.015
Neg logit contributions
 eleven-0.026
 unstable-0.025
 done-0.025
 taller-0.024
 leaning-0.024
Token so
Token ID523
Feature activation-1.29e-02

Pos logit contributions
 Pupp+0.014
 Cindy+0.012
 Woo+0.012
 Hannah+0.012
 Holly+0.012
Neg logit contributions
 unstable-0.016
Crash-0.016
 minutes-0.015
 eleven-0.015
isel-0.015
Token huge
Token ID3236
Feature activation-6.47e-03

Pos logit contributions
 Pupp+0.024
 Hannah+0.023
 Holly+0.023
 Cindy+0.022
 Woo+0.022
Neg logit contributions
 eleven-0.015
 unstable-0.014
 done-0.013
 minutes-0.013
ror-0.012
Token that
Token ID326
Feature activation-6.41e-03

Pos logit contributions
ishly+0.009
 Hannah+0.009
 Pupp+0.008
 Holly+0.008
 Woo+0.008
Neg logit contributions
 eleven-0.011
 unstable-0.010
 done-0.010
 taller-0.010
ror-0.010
Token he
Token ID339
Feature activation-3.33e-03

Pos logit contributions
 Hannah+0.007
ishly+0.007
 Pupp+0.007
 Holly+0.007
 Cindy+0.007
Neg logit contributions
 eleven-0.012
 unstable-0.011
 done-0.011
 taller-0.011
ror-0.011
Token could
Token ID714
Feature activation-2.02e-02

Pos logit contributions
 Hannah+0.005
ishly+0.005
 Pupp+0.005
 Holly+0.005
 therapist+0.005
Neg logit contributions
 eleven-0.005
 unstable-0.005
 taller-0.005
 done-0.005
ror-0.005
Token not
Token ID407
Feature activation+1.08e-03

Pos logit contributions
ishly+0.035
 Hannah+0.035
 Pupp+0.034
 Holly+0.033
 Cindy+0.033
Neg logit contributions
 eleven-0.026
 unstable-0.025
 done-0.024
 taller-0.024
 leaning-0.023
Token fit
Token ID4197
Feature activation-4.46e-03

Pos logit contributions
 eleven+0.001
 unstable+0.001
 done+0.001
 taller+0.001
 leaning+0.001
Neg logit contributions
ishly-0.002
 Hannah-0.002
 Pupp-0.002
 Holly-0.002
 Cindy-0.002
Token in
Token ID287
Feature activation-4.51e-04

Pos logit contributions
 Hannah+0.009
ishly+0.009
 Pupp+0.009
 Holly+0.009
 Cindy+0.009
Neg logit contributions
 eleven-0.004
 unstable-0.004
 done-0.004
 taller-0.004
 leaning-0.004
Token the
Token ID262
Feature activation-7.52e-03

Pos logit contributions
 Hannah+0.001
 Pupp+0.001
 Holly+0.001
 Woo+0.001
 Cindy+0.001
Neg logit contributions
 unstable-0.001
 eleven-0.001
 done-0.001
ror-0.001
 taller-0.001
Token seat
Token ID5852
Feature activation+1.31e-02

Pos logit contributions
ishly+0.012
 Pupp+0.012
 Hannah+0.012
 Cindy+0.011
 Woo+0.011
Neg logit contributions
 unstable-0.010
 eleven-0.010
 done-0.009
ror-0.009
 leaning-0.009
Token face
Token ID1986
Feature activation-8.55e-04

Pos logit contributions
ishly+0.003
 Hannah+0.003
 Pupp+0.003
 Holly+0.003
 therapist+0.003
Neg logit contributions
 eleven-0.004
 unstable-0.004
 done-0.004
 taller-0.004
 leaning-0.004
Token.
Token ID13
Feature activation-2.26e-04

Pos logit contributions
rob+0.002
 Hannah+0.002
ishly+0.002
 Pupp+0.001
 Woo+0.001
Neg logit contributions
 eleven-0.001
 done-0.001
 taller-0.001
 leaning-0.001
 unstable-0.001
Token
Token ID198
Feature activation+2.19e-03

Pos logit contributions
ishly+0.000
rob+0.000
 trends+0.000
 therapist+0.000
 Hannah+0.000
Neg logit contributions
 eleven-0.000
 done-0.000
 unstable-0.000
 taller-0.000
 leaning-0.000
Token
Token ID198
Feature activation+1.34e-04

Pos logit contributions
 eleven+0.003
 unstable+0.003
 done+0.003
 taller+0.003
 leaning+0.003
Neg logit contributions
ishly-0.004
 Hannah-0.004
 Pupp-0.003
 therapist-0.003
 Cindy-0.003
TokenWhen
Token ID2215
Feature activation+5.77e-03

Pos logit contributions
 eleven+0.000
 done+0.000
 unstable+0.000
Crash+0.000
 high+0.000
Neg logit contributions
ishly-0.000
rob-0.000
 Pupp-0.000
 Hannah-0.000
 Cindy-0.000
Token Mitt
Token ID16627
Feature activation-2.26e-02

Pos logit contributions
 eleven+0.009
 done+0.009
 unstable+0.009
 taller+0.008
 leaning+0.008
Neg logit contributions
ishly-0.009
 Hannah-0.008
rob-0.008
 Pupp-0.008
 therapist-0.008
Tokenens
Token ID641
Feature activation-8.15e-03

Pos logit contributions
ishly+0.036
lor+0.034
 trends+0.034
 Pupp+0.034
Gr+0.033
Neg logit contributions
 done-0.033
 eleven-0.030
 leaning-0.027
 unstable-0.027
 minutes-0.027
Token saw
Token ID2497
Feature activation-2.75e-03

Pos logit contributions
ishly+0.014
 Hannah+0.014
 Pupp+0.014
 Holly+0.013
 therapist+0.013
Neg logit contributions
 eleven-0.010
 unstable-0.010
 done-0.010
 taller-0.009
 leaning-0.009
Token the
Token ID262
Feature activation+1.31e-04

Pos logit contributions
lor+0.004
ishly+0.004
 trends+0.004
rob+0.004
 therapist+0.004
Neg logit contributions
 eleven-0.004
 unstable-0.004
 taller-0.004
 done-0.004
 high-0.003
Token picture
Token ID4286
Feature activation+9.01e-04

Pos logit contributions
 eleven+0.000
 unstable+0.000
ror+0.000
 done+0.000
cks+0.000
Neg logit contributions
ishly-0.000
 Hannah-0.000
 Pupp-0.000
 puppy-0.000
 Woo-0.000
Token,
Token ID11
Feature activation+3.95e-05

Pos logit contributions
 eleven+0.002
 unstable+0.002
ror+0.001
 done+0.001
 taller+0.001
Neg logit contributions
ishly-0.001
 Hannah-0.001
 Pupp-0.001
 Holly-0.001
 therapist-0.001