TOP ACTIVATIONS
MAX = 6.940

parents smiled and said , " We are glad you understand
Dad smiled and said , " You worked hard to learn
smiled knowingly and said , " I knew you could do
father smiled and said , " We should always remember to
mother smiled and said , " G em ma , you
, smiled and said , " Yes , I will !"
parents smiled and said , " Let 's go outside and
him tightly , saying , " You wrapped that duck in
his mom reminded him , " Remember , it 's important
dad smiled and said , " That 's what it 's
dad smiled and said , " Yes , they are .
mom smiled and said , " That 's my good boy
dad laughed and said , " Ok , let 's get
mum smiled and said , " It 's okay , Sally
my smiled and said , " That 's right , Jill
mom smiled and said , " That 's right sweet heart
dad laughed and said , " Well , sort away then
Emma smiled and said , " See , I told you
engine smiled and said , " Yes ! I accept !"
hugged her and said , " See what happens when you

MIN ACTIVATIONS
MIN = -7.228

you help me unlock the door ?" the cater pillar said
What is happening to my mushroom ?" Just then
" What is in the box ?" Sam asked .
What do you want to eat , Mia ?" her mom
Why are you wearing a mask ?" she asked .
Can I play with your puppy ?" asked Tim my .
! Would you like a cookie ?" she asked her t
" Can I touch the flower ?" Lily asked Tom .
put hot sauce on our sandwiches ?" Anna cries .
" What is in the box ?" Lily asked .
Why did you steal my cereal ?" the girl asked in
" What is in the box ?" Tom asks .
is the lady wearing a veil ?" Her mom
" What 's in the cage , Daddy ?" Pete asked
" Do you like my mustache ?" he asks .
" What 's in the box ?" she said . The
, can you heal my doll ? She is hurt and
" Can I pet your dog ?" Tim my asked the
, what 's in the hole ?" asked Lily . Her
, can we pet your dog ?" L ila asked the

INTERVAL 3.398 - 6.940
CONTAINS 1.030%

smiled . She said , " Go and hide now ,
From that day on , Nem o knew that even though
the door and says , " Mail for mom and dad
that day on , Tim my played with his triangles every
to Tom . " Here , Tom , you

INTERVAL -0.144 - 3.398
CONTAINS 54.788%

pay attention to where he was going . Before he knew
the classroom . She was excited because her teacher was going
and went to the party . She played with her friends
r ained . The end . <|endoftext|> Once upon a time
are fast . They are happy . They reach

INTERVAL -3.686 - -0.144
CONTAINS 43.498%

in a small park , looking for food to eat .
pulled it away from her . " You have enough red
M um and Dad hugged each other and kissed on the
. Tom takes a deep breath . He walks
to cross the p uddle , can you help me ?"

INTERVAL -7.228 - -3.686
CONTAINS 0.683%

" What was that ?" asked Lily .
to give it back to me later . Do you mind
are you doing ?" The farmer says . He takes the
oh no , oh no !" Mom said .
be careful not to fall off the slide !"
Token parents
Token ID3397
Feature activation+1.10e+00

Pos logit contributions
Years+0.005
Days+0.005
umm+0.005
iked+0.005
 By+0.005
Neg logit contributions
?"-0.003
 Anna-0.003
 sob-0.003
 mouth-0.003
 ther-0.002
Token smiled
Token ID13541
Feature activation+2.38e+00

Pos logit contributions
Years+0.037
Days+0.037
umm+0.037
Hours+0.035
 By+0.034
Neg logit contributions
?"-0.014
 sob-0.012
 Anna-0.012
 ther-0.011
 mouth-0.010
Token and
Token ID290
Feature activation-1.89e-01

Pos logit contributions
Years+0.071
Days+0.071
umm+0.070
Hours+0.066
iked+0.065
Neg logit contributions
?"-0.039
 sob-0.037
 Anna-0.036
 mouth-0.033
 angry-0.033
Token said
Token ID531
Feature activation+1.48e+00

Pos logit contributions
?"+0.002
 Anna+0.002
 sob+0.002
 ther+0.002
 mouth+0.002
Neg logit contributions
Years-0.007
Days-0.007
umm-0.006
 By-0.006
iked-0.006
Token,
Token ID11
Feature activation-6.64e-01

Pos logit contributions
Years+0.046
Days+0.045
umm+0.045
Hours+0.042
 By+0.041
Neg logit contributions
?"-0.023
 Anna-0.022
 sob-0.021
 mouth-0.019
 ther-0.019
Token "
Token ID366
Feature activation+6.94e+00

Pos logit contributions
?"+0.010
 Anna+0.009
 sob+0.009
 Mr+0.008
 mouth+0.008
Neg logit contributions
umm-0.021
Years-0.021
Days-0.021
 hours-0.019
iked-0.019
TokenWe
Token ID1135
Feature activation+1.68e+00

Pos logit contributions
iked+0.202
umm+0.193
 achievement+0.180
oor+0.180
istence+0.177
Neg logit contributions
 Anna-0.116
Anna-0.099
 Mr-0.096
 sob-0.096
?"-0.095
Token are
Token ID389
Feature activation+1.02e+00

Pos logit contributions
umm+0.053
Years+0.052
Days+0.052
Hours+0.048
iked+0.048
Neg logit contributions
?"-0.027
 sob-0.023
 Anna-0.023
 ther-0.021
 mouth-0.021
Token glad
Token ID9675
Feature activation+1.12e+00

Pos logit contributions
Years+0.028
Days+0.028
umm+0.027
 By+0.026
Hours+0.026
Neg logit contributions
?"-0.019
 Anna-0.017
 sob-0.017
 ther-0.016
 mouth-0.016
Token you
Token ID345
Feature activation-8.89e-01

Pos logit contributions
Years+0.036
Days+0.036
umm+0.035
Hours+0.033
iked+0.033
Neg logit contributions
?"-0.017
 Anna-0.016
 sob-0.014
 Mr-0.014
 Sunny-0.013
Token understand
Token ID1833
Feature activation-1.61e+00

Pos logit contributions
 Anna+0.018
?"+0.017
 ther+0.017
 sob+0.017
 Milo+0.016
Neg logit contributions
Years-0.021
Days-0.021
 By-0.020
 hours-0.020
 lear-0.019
Token Dad
Token ID17415
Feature activation+1.58e+00

Pos logit contributions
Days+0.089
Years+0.089
umm+0.087
Hours+0.085
istence+0.084
Neg logit contributions
 mouth-0.019
?"-0.018
 angry-0.018
 sandwich-0.016
 frown-0.016
Token smiled
Token ID13541
Feature activation+2.53e+00

Pos logit contributions
Years+0.050
Days+0.050
umm+0.049
 By+0.046
iked+0.046
Neg logit contributions
?"-0.023
 Anna-0.021
 sob-0.020
 ther-0.019
 mouth-0.018
Token and
Token ID290
Feature activation-2.96e-01

Pos logit contributions
Days+0.077
Years+0.076
umm+0.074
Hours+0.070
iked+0.070
Neg logit contributions
?"-0.042
 Anna-0.039
 sob-0.037
 mouth-0.035
 Mr-0.035
Token said
Token ID531
Feature activation+1.63e+00

Pos logit contributions
?"+0.003
 Anna+0.003
 sob+0.003
 ther+0.003
 mouth+0.002
Neg logit contributions
Years-0.010
Days-0.010
umm-0.010
 By-0.010
iked-0.010
Token,
Token ID11
Feature activation-5.08e-01

Pos logit contributions
Years+0.051
Days+0.051
umm+0.050
Hours+0.047
 hours+0.046
Neg logit contributions
?"-0.026
 Anna-0.023
 sob-0.022
 mouth-0.020
 Mr-0.020
Token "
Token ID366
Feature activation+6.94e+00

Pos logit contributions
?"+0.007
 Anna+0.007
 sob+0.006
 Mr+0.006
 mouth+0.006
Neg logit contributions
umm-0.016
Days-0.016
Years-0.016
Hours-0.015
iked-0.015
TokenYou
Token ID1639
Feature activation-1.68e-01

Pos logit contributions
iked+0.179
Days+0.172
 achievement+0.156
seek+0.154
Hours+0.153
Neg logit contributions
 Anna-0.128
?"-0.127
 Mr-0.118
 my-0.117
 Ben-0.114
Token worked
Token ID3111
Feature activation+1.54e+00

Pos logit contributions
?"+0.003
 Anna+0.003
 sob+0.003
 mouth+0.002
 ther+0.002
Neg logit contributions
Years-0.005
Days-0.005
umm-0.005
iked-0.005
Hours-0.005
Token hard
Token ID1327
Feature activation+1.55e+00

Pos logit contributions
umm+0.041
Days+0.041
Years+0.040
iked+0.039
 lear+0.036
Neg logit contributions
?"-0.033
 Anna-0.029
 sob-0.026
 Mr-0.026
 angry-0.026
Token to
Token ID284
Feature activation+2.12e+00

Pos logit contributions
Days+0.052
Years+0.049
umm+0.049
Hours+0.046
Eventually+0.043
Neg logit contributions
?"-0.028
 Anna-0.021
 the-0.019
 my-0.019
 angry-0.019
Token learn
Token ID2193
Feature activation-4.18e-01

Pos logit contributions
iked+0.074
umm+0.072
Days+0.071
Years+0.070
itted+0.069
Neg logit contributions
?"-0.034
 hurt-0.023
 the-0.023
 Anna-0.022
 my-0.022
Token smiled
Token ID13541
Feature activation+2.64e+00

Pos logit contributions
Years+0.036
Days+0.036
umm+0.035
iked+0.033
Hours+0.033
Neg logit contributions
?"-0.019
 Anna-0.018
 sob-0.017
 ther-0.016
 mouth-0.016
Token knowingly
Token ID26526
Feature activation+1.68e+00

Pos logit contributions
Years+0.079
Days+0.079
umm+0.076
Hours+0.072
iked+0.072
Neg logit contributions
?"-0.045
 Anna-0.041
 sob-0.040
 mouth-0.038
 Mr-0.037
Token and
Token ID290
Feature activation+2.82e-02

Pos logit contributions
Years+0.053
Days+0.052
umm+0.051
Hours+0.049
iked+0.048
Neg logit contributions
?"-0.027
 Anna-0.024
 sob-0.023
 mouth-0.022
 angry-0.021
Token said
Token ID531
Feature activation+1.35e+00

Pos logit contributions
Years+0.001
Days+0.001
umm+0.001
 By+0.001
iked+0.001
Neg logit contributions
?"-0.000
 Anna-0.000
 sob-0.000
 ther-0.000
 mouth-0.000
Token,
Token ID11
Feature activation-6.13e-01

Pos logit contributions
Years+0.040
Days+0.040
umm+0.039
 By+0.037
iked+0.037
Neg logit contributions
?"-0.022
 Anna-0.020
 sob-0.020
 ther-0.019
 mouth-0.018
Token "
Token ID366
Feature activation+6.81e+00

Pos logit contributions
?"+0.010
 Anna+0.009
 sob+0.009
 Mr+0.008
 mouth+0.008
Neg logit contributions
Years-0.019
Days-0.019
umm-0.019
iked-0.017
Hours-0.017
TokenI
Token ID40
Feature activation+5.10e-01

Pos logit contributions
iked+0.161
Days+0.155
umm+0.145
Years+0.145
 achievement+0.143
Neg logit contributions
 Anna-0.136
?"-0.133
 Mr-0.123
 Ben-0.120
 Sunny-0.120
Token knew
Token ID2993
Feature activation+3.86e-01

Pos logit contributions
umm+0.015
Days+0.015
Years+0.015
iked+0.014
Hours+0.013
Neg logit contributions
?"-0.010
 Anna-0.008
 sob-0.008
 sees-0.008
 ther-0.008
Token you
Token ID345
Feature activation+6.63e-01

Pos logit contributions
Days+0.012
Years+0.012
umm+0.012
iked+0.011
Hours+0.011
Neg logit contributions
?"-0.006
 Anna-0.006
 sob-0.005
 mouth-0.005
 Mr-0.005
Token could
Token ID714
Feature activation+2.80e+00

Pos logit contributions
Years+0.019
Days+0.018
umm+0.018
 By+0.017
iked+0.017
Neg logit contributions
?"-0.012
 Anna-0.011
 sob-0.011
 ther-0.010
 mouth-0.010
Token do
Token ID466
Feature activation+1.40e+00

Pos logit contributions
Years+0.087
Days+0.087
umm+0.087
iked+0.085
Hours+0.081
Neg logit contributions
?"-0.051
 Anna-0.038
 sob-0.036
 hurt-0.036
 bear-0.035
Token father
Token ID2988
Feature activation+6.48e-01

Pos logit contributions
Years+0.044
Days+0.044
umm+0.044
iked+0.043
 By+0.042
Neg logit contributions
?"-0.021
 angry-0.018
 sob-0.018
 Anna-0.017
 Mr-0.017
Token smiled
Token ID13541
Feature activation+2.41e+00

Pos logit contributions
Years+0.022
Days+0.022
umm+0.021
 By+0.020
iked+0.020
Neg logit contributions
?"-0.008
 Anna-0.007
 sob-0.007
 ther-0.006
 mouth-0.006
Token and
Token ID290
Feature activation-7.61e-01

Pos logit contributions
Years+0.069
Days+0.069
umm+0.067
Hours+0.063
iked+0.063
Neg logit contributions
?"-0.043
 Anna-0.039
 sob-0.039
 mouth-0.036
 ther-0.036
Token said
Token ID531
Feature activation+1.46e+00

Pos logit contributions
?"+0.010
 Anna+0.008
 sob+0.008
 ther+0.008
 mouth+0.007
Neg logit contributions
Years-0.025
Days-0.025
umm-0.024
 By-0.024
iked-0.024
Token,
Token ID11
Feature activation-1.17e+00

Pos logit contributions
Years+0.046
Days+0.045
umm+0.044
Hours+0.042
 By+0.041
Neg logit contributions
?"-0.023
 Anna-0.021
 sob-0.020
 ther-0.018
 mouth-0.018
Token "
Token ID366
Feature activation+6.79e+00

Pos logit contributions
?"+0.018
 Anna+0.016
 sob+0.015
 Mr+0.014
 mouth+0.014
Neg logit contributions
Days-0.037
Years-0.037
umm-0.037
 hours-0.034
Hours-0.034
TokenWe
Token ID1135
Feature activation+1.16e+00

Pos logit contributions
iked+0.190
Days+0.186
Hours+0.175
 achievement+0.173
istence+0.173
Neg logit contributions
?"-0.109
 Anna-0.108
 Mr-0.101
 Sunny-0.097
 my-0.095
Token should
Token ID815
Feature activation+2.52e+00

Pos logit contributions
Years+0.031
Days+0.031
umm+0.030
iked+0.028
Hours+0.028
Neg logit contributions
?"-0.023
 Anna-0.021
 sob-0.020
 ther-0.019
 mouth-0.019
Token always
Token ID1464
Feature activation+2.45e+00

Pos logit contributions
Days+0.075
Years+0.074
umm+0.074
iked+0.072
Hours+0.071
Neg logit contributions
?"-0.052
 my-0.037
?-0.036
 mouth-0.036
 Mr-0.036
Token remember
Token ID3505
Feature activation+2.78e-01

Pos logit contributions
Days+0.086
umm+0.085
iked+0.084
Years+0.084
Hours+0.082
Neg logit contributions
?"-0.036
 mouth-0.024
 my-0.024
 frown-0.024
?-0.023
Token to
Token ID284
Feature activation+1.99e+00

Pos logit contributions
Years+0.008
Days+0.008
umm+0.008
iked+0.007
Hours+0.007
Neg logit contributions
?"-0.005
 Anna-0.005
 sob-0.004
 mouth-0.004
 ther-0.004
Token mother
Token ID2802
Feature activation-2.11e-01

Pos logit contributions
?"+0.005
 Anna+0.005
 sob+0.004
 ther+0.004
 mouth+0.004
Neg logit contributions
Years-0.011
Days-0.011
umm-0.011
 By-0.010
iked-0.010
Token smiled
Token ID13541
Feature activation+1.13e+00

Pos logit contributions
?"+0.003
 Anna+0.003
 ther+0.003
 mouth+0.002
chard+0.002
Neg logit contributions
Years-0.007
Days-0.007
 hours-0.006
umm-0.006
iked-0.006
Token and
Token ID290
Feature activation-1.34e+00

Pos logit contributions
Years+0.030
Days+0.030
umm+0.029
 By+0.027
iked+0.027
Neg logit contributions
?"-0.022
 Anna-0.020
 sob-0.020
 ther-0.019
 mouth-0.019
Token said
Token ID531
Feature activation+6.37e-02

Pos logit contributions
?"+0.017
 Anna+0.013
 ther+0.013
 sob+0.013
 mouth+0.012
Neg logit contributions
Years-0.046
Days-0.044
umm-0.043
 By-0.042
Hours-0.042
Token,
Token ID11
Feature activation-2.27e+00

Pos logit contributions
Years+0.002
Days+0.002
umm+0.002
iked+0.002
 By+0.002
Neg logit contributions
?"-0.001
 Anna-0.001
 sob-0.001
 ther-0.001
 mouth-0.001
Token "
Token ID366
Feature activation+6.78e+00

Pos logit contributions
?"+0.040
 Anna+0.037
 sob+0.036
 ther+0.034
 mouth+0.033
Neg logit contributions
Years-0.065
Days-0.064
umm-0.063
 By-0.059
iked-0.059
TokenG
Token ID38
Feature activation+2.68e+00

Pos logit contributions
iked+0.185
umm+0.175
 achievement+0.173
 hours+0.168
istence+0.168
Neg logit contributions
 Anna-0.111
?"-0.105
 Sara-0.100
 Mr-0.100
 sob-0.098
Tokenem
Token ID368
Feature activation+7.49e-01

Pos logit contributions
Years+0.063
Days+0.062
umm+0.062
 hours+0.060
 Hours+0.058
Neg logit contributions
?"-0.064
anut-0.055
 ther-0.052
 Anna-0.051
aj-0.050
Tokenma
Token ID2611
Feature activation+6.76e-01

Pos logit contributions
Years+0.022
Days+0.021
umm+0.021
iked+0.020
 By+0.020
Neg logit contributions
?"-0.012
 sob-0.012
 Anna-0.012
 ther-0.011
 mouth-0.011
Token,
Token ID11
Feature activation+5.82e-02

Pos logit contributions
Years+0.019
Days+0.019
umm+0.018
 By+0.018
iked+0.018
Neg logit contributions
?"-0.011
 sob-0.010
 Anna-0.010
 ther-0.010
chard-0.009
Token you
Token ID345
Feature activation-6.95e-01

Pos logit contributions
Years+0.002
Days+0.002
umm+0.002
iked+0.002
Hours+0.002
Neg logit contributions
?"-0.001
 Anna-0.001
 sob-0.001
 ther-0.001
 mouth-0.001
Token,
Token ID11
Feature activation+2.71e+00

Pos logit contributions
Days+0.103
Years+0.102
Hours+0.096
umm+0.095
Eventually+0.093
Neg logit contributions
?"-0.037
 Anna-0.035
 sob-0.032
 mouth-0.031
 Mr-0.028
Token smiled
Token ID13541
Feature activation+2.97e+00

Pos logit contributions
Days+0.070
Years+0.069
umm+0.069
iked+0.064
Hours+0.063
Neg logit contributions
 Anna-0.054
?"-0.054
 sob-0.051
 Mr-0.050
 mouth-0.049
Token and
Token ID290
Feature activation+1.56e+00

Pos logit contributions
Years+0.084
Days+0.084
umm+0.081
Hours+0.077
 By+0.076
Neg logit contributions
?"-0.054
 Anna-0.050
 sob-0.048
 mouth-0.045
 ther-0.045
Token said
Token ID531
Feature activation+2.09e+00

Pos logit contributions
Days+0.046
Years+0.045
umm+0.045
iked+0.042
Hours+0.041
Neg logit contributions
?"-0.027
 Anna-0.026
 sob-0.025
 mouth-0.024
 angry-0.023
Token,
Token ID11
Feature activation+3.26e-01

Pos logit contributions
Years+0.062
umm+0.062
Days+0.062
Hours+0.056
 hours+0.056
Neg logit contributions
?"-0.035
 Anna-0.033
 sob-0.031
 mouth-0.029
 Mr-0.029
Token "
Token ID366
Feature activation+6.78e+00

Pos logit contributions
umm+0.011
Days+0.010
Years+0.010
iked+0.010
 hours+0.010
Neg logit contributions
?"-0.005
 Anna-0.004
 sob-0.004
 Mr-0.004
 Sunny-0.004
TokenYes
Token ID5297
Feature activation-1.16e+00

Pos logit contributions
iked+0.161
Days+0.154
umm+0.148
 achievement+0.147
Years+0.145
Neg logit contributions
 Anna-0.132
?"-0.129
 Sunny-0.121
bo-0.116
 Fl-0.116
Token,
Token ID11
Feature activation+7.87e-01

Pos logit contributions
?"+0.021
 sob+0.020
 Anna+0.019
 ther+0.018
 mouth+0.018
Neg logit contributions
Years-0.032
Days-0.032
umm-0.031
 By-0.029
iked-0.029
Token I
Token ID314
Feature activation+9.91e-01

Pos logit contributions
Years+0.024
Days+0.024
umm+0.024
iked+0.023
Hours+0.023
Neg logit contributions
?"-0.012
 Anna-0.011
 sob-0.010
 Mr-0.010
 ther-0.010
Token will
Token ID481
Feature activation+1.52e+00

Pos logit contributions
umm+0.031
Days+0.030
Years+0.030
iked+0.028
Hours+0.027
Neg logit contributions
?"-0.017
 Anna-0.014
 sob-0.014
 ther-0.013
 sees-0.013
Token!"
Token ID2474
Feature activation+1.41e+00

Pos logit contributions
Years+0.045
Days+0.045
umm+0.045
iked+0.043
Hours+0.042
Neg logit contributions
?"-0.028
 Anna-0.023
 sob-0.022
 mouth-0.021
 Sunny-0.020
Token parents
Token ID3397
Feature activation+1.47e+00

Pos logit contributions
Years+0.008
Days+0.008
umm+0.007
 By+0.007
iked+0.007
Neg logit contributions
?"-0.003
 sob-0.003
 Anna-0.003
 mouth-0.003
 angry-0.003
Token smiled
Token ID13541
Feature activation+2.11e+00

Pos logit contributions
Years+0.046
Days+0.045
umm+0.045
Hours+0.042
 By+0.042
Neg logit contributions
?"-0.023
 Anna-0.020
 sob-0.020
 ther-0.018
 mouth-0.018
Token and
Token ID290
Feature activation-2.29e-01

Pos logit contributions
Years+0.062
Days+0.061
umm+0.060
Hours+0.056
 By+0.056
Neg logit contributions
?"-0.037
 Anna-0.034
 sob-0.034
 mouth-0.031
 ther-0.031
Token said
Token ID531
Feature activation+1.60e+00

Pos logit contributions
?"+0.003
 Anna+0.002
 sob+0.002
 ther+0.002
 mouth+0.002
Neg logit contributions
Years-0.008
Days-0.008
umm-0.007
 By-0.007
iked-0.007
Token,
Token ID11
Feature activation-7.64e-01

Pos logit contributions
Years+0.048
Days+0.047
umm+0.046
 By+0.044
iked+0.044
Neg logit contributions
?"-0.026
 Anna-0.024
 sob-0.023
 ther-0.022
 mouth-0.022
Token "
Token ID366
Feature activation+6.77e+00

Pos logit contributions
?"+0.011
 Anna+0.010
 sob+0.010
 Mr+0.009
 mouth+0.009
Neg logit contributions
umm-0.024
Days-0.024
Years-0.024
 hours-0.023
iked-0.022
TokenLet
Token ID5756
Feature activation+1.45e+00

Pos logit contributions
iked+0.177
 achievement+0.165
umm+0.164
Days+0.162
 example+0.158
Neg logit contributions
 Anna-0.121
?"-0.107
 Ben-0.105
 Sunny-0.103
 Mr-0.102
Token's
Token ID338
Feature activation+6.23e-01

Pos logit contributions
Years+0.047
Days+0.046
Hours+0.043
umm+0.043
 By+0.043
Neg logit contributions
chard-0.019
?"-0.018
 sob-0.017
 Frost-0.017
 Anna-0.017
Token go
Token ID467
Feature activation+6.00e-02

Pos logit contributions
Years+0.019
Days+0.019
umm+0.018
iked+0.017
 By+0.017
Neg logit contributions
?"-0.011
 Anna-0.009
 sob-0.009
 mouth-0.008
 ther-0.008
Token outside
Token ID2354
Feature activation-1.08e+00

Pos logit contributions
Years+0.002
Days+0.002
umm+0.002
Hours+0.002
iked+0.002
Neg logit contributions
?"-0.001
 Anna-0.001
 sob-0.001
 ther-0.001
 mouth-0.001
Token and
Token ID290
Feature activation-3.60e-01

Pos logit contributions
?"+0.020
 Anna+0.019
 sob+0.018
 ther+0.017
 mouth+0.017
Neg logit contributions
Years-0.031
Days-0.030
umm-0.029
Hours-0.028
iked-0.027
Token him
Token ID683
Feature activation+6.45e-02

Pos logit contributions
Years+0.051
Days+0.051
umm+0.050
Hours+0.047
iked+0.047
Neg logit contributions
?"-0.027
 Anna-0.025
 sob-0.024
 Mr-0.022
 ther-0.022
Token tightly
Token ID17707
Feature activation-3.61e-01

Pos logit contributions
Years+0.002
Days+0.001
umm+0.001
 By+0.001
 example+0.001
Neg logit contributions
?"-0.001
 sob-0.001
 Anna-0.001
 ther-0.001
 sees-0.001
Token,
Token ID11
Feature activation+5.42e-01

Pos logit contributions
?"+0.005
 Anna+0.005
 sob+0.005
 mouth+0.004
 ther+0.004
Neg logit contributions
Years-0.012
Days-0.012
umm-0.011
Hours-0.011
iked-0.011
Token saying
Token ID2282
Feature activation+8.97e-01

Pos logit contributions
Years+0.015
Days+0.014
umm+0.014
iked+0.013
Hours+0.013
Neg logit contributions
?"-0.011
 Anna-0.010
 sob-0.010
 mouth-0.009
 ther-0.009
Token,
Token ID11
Feature activation-7.99e-01

Pos logit contributions
Years+0.026
Days+0.026
umm+0.025
 By+0.024
iked+0.024
Neg logit contributions
?"-0.016
 Anna-0.014
 sob-0.014
 ther-0.013
 mouth-0.013
Token "
Token ID366
Feature activation+6.74e+00

Pos logit contributions
?"+0.012
 Anna+0.011
 sob+0.011
 mouth+0.010
 Mr+0.010
Neg logit contributions
Years-0.024
umm-0.024
Days-0.024
iked-0.023
 hours-0.023
TokenYou
Token ID1639
Feature activation-5.49e-01

Pos logit contributions
iked+0.164
Days+0.162
umm+0.157
Years+0.157
 example+0.151
Neg logit contributions
 Anna-0.133
?"-0.126
 Mr-0.120
 Sunny-0.119
 Sara-0.118
Token wrapped
Token ID12908
Feature activation-1.03e-01

Pos logit contributions
?"+0.010
 Anna+0.010
 sob+0.009
 ther+0.009
 mouth+0.009
Neg logit contributions
Years-0.015
Days-0.014
umm-0.014
 By-0.014
 hours-0.013
Token that
Token ID326
Feature activation-4.73e-01

Pos logit contributions
?"+0.001
 sob+0.001
 Anna+0.001
 ther+0.001
 mouth+0.001
Neg logit contributions
Years-0.003
Days-0.003
umm-0.003
 hours-0.003
 example-0.003
Token duck
Token ID22045
Feature activation-8.78e-01

Pos logit contributions
?"+0.007
 sob+0.007
 Anna+0.007
anut+0.007
cks+0.006
Neg logit contributions
Years-0.013
Days-0.013
umm-0.013
 example-0.012
 days-0.012
Token in
Token ID287
Feature activation-6.56e-01

Pos logit contributions
 sob+0.015
?"+0.015
 Anna+0.014
 ther+0.013
 Mr+0.013
Neg logit contributions
Years-0.024
Days-0.024
 hours-0.022
 days-0.022
 example-0.022
Token his
Token ID465
Feature activation+4.13e-01

Pos logit contributions
Days+0.048
Years+0.048
umm+0.048
istence+0.044
 By+0.044
Neg logit contributions
?"-0.021
 Anna-0.019
 sob-0.017
 mouth-0.017
 Mr-0.017
Token mom
Token ID1995
Feature activation+9.77e-01

Pos logit contributions
Years+0.015
Days+0.015
umm+0.014
 hours+0.014
 days+0.014
Neg logit contributions
?"-0.005
 Anna-0.003
 ther-0.003
 sob-0.003
 Sunny-0.003
Token reminded
Token ID14516
Feature activation+3.09e+00

Pos logit contributions
Years+0.030
Days+0.030
umm+0.028
 By+0.028
iked+0.027
Neg logit contributions
?"-0.015
 Anna-0.013
 ther-0.013
 sob-0.013
 Frost-0.012
Token him
Token ID683
Feature activation+2.86e+00

Pos logit contributions
umm+0.107
Days+0.106
Years+0.105
Hours+0.100
istence+0.099
Neg logit contributions
?"-0.037
 Anna-0.034
 Ben-0.030
 Mr-0.030
 Leo-0.029
Token,
Token ID11
Feature activation+1.01e+00

Pos logit contributions
umm+0.079
Years+0.077
Days+0.077
Hours+0.071
iked+0.069
Neg logit contributions
?"-0.058
 Anna-0.054
 sob-0.050
 mouth-0.049
 Mr-0.047
Token "
Token ID366
Feature activation+6.72e+00

Pos logit contributions
umm+0.030
Days+0.030
Years+0.030
iked+0.028
Hours+0.028
Neg logit contributions
?"-0.017
 Anna-0.016
 sob-0.015
 mouth-0.014
 Mr-0.014
TokenRemember
Token ID16676
Feature activation+1.32e+00

Pos logit contributions
umm+0.150
iked+0.149
Days+0.148
istence+0.144
 achievement+0.144
Neg logit contributions
?"-0.138
 Anna-0.125
bug-0.120
 Sara-0.120
 ther-0.120
Token,
Token ID11
Feature activation+1.59e+00

Pos logit contributions
Years+0.036
Days+0.036
umm+0.035
 By+0.034
iked+0.033
Neg logit contributions
?"-0.024
 sob-0.022
 Anna-0.022
 ther-0.021
 mouth-0.020
Token it
Token ID340
Feature activation+6.73e-01

Pos logit contributions
Days+0.048
Years+0.048
umm+0.047
Hours+0.045
iked+0.044
Neg logit contributions
?"-0.027
 Anna-0.024
 my-0.023
 Mr-0.022
 mouth-0.022
Token's
Token ID338
Feature activation+1.30e+00

Pos logit contributions
Years+0.024
Days+0.023
umm+0.023
 By+0.022
Hours+0.022
Neg logit contributions
?"-0.007
 Anna-0.006
 sob-0.006
 ther-0.006
 mouth-0.005
Token important
Token ID1593
Feature activation+1.59e+00

Pos logit contributions
Years+0.025
Days+0.024
umm+0.023
 By+0.021
Hours+0.021
Neg logit contributions
?"-0.036
 Anna-0.034
 sob-0.034
 ther-0.033
 mouth-0.032
Token dad
Token ID9955
Feature activation+9.75e-01

Pos logit contributions
Years+0.030
Days+0.030
umm+0.029
Hours+0.028
 By+0.028
Neg logit contributions
?"-0.008
 Anna-0.007
 sob-0.006
 mouth-0.006
 angry-0.006
Token smiled
Token ID13541
Feature activation+2.84e+00

Pos logit contributions
Years+0.034
Days+0.034
umm+0.033
iked+0.031
 By+0.031
Neg logit contributions
?"-0.011
 Anna-0.010
 sob-0.009
 ther-0.009
 mouth-0.008
Token and
Token ID290
Feature activation+8.69e-02

Pos logit contributions
Days+0.088
Years+0.088
umm+0.086
Hours+0.082
 hours+0.080
Neg logit contributions
?"-0.045
 Anna-0.041
 sob-0.041
 mouth-0.038
 Mr-0.038
Token said
Token ID531
Feature activation+1.77e+00

Pos logit contributions
Years+0.003
Days+0.003
umm+0.003
iked+0.003
 By+0.003
Neg logit contributions
?"-0.001
 Anna-0.001
 sob-0.001
 ther-0.001
 mouth-0.001
Token,
Token ID11
Feature activation-6.93e-01

Pos logit contributions
Years+0.057
Days+0.056
umm+0.055
Hours+0.052
 By+0.051
Neg logit contributions
?"-0.027
 Anna-0.024
 sob-0.023
 mouth-0.021
 ther-0.021
Token "
Token ID366
Feature activation+6.71e+00

Pos logit contributions
?"+0.011
 Anna+0.010
 sob+0.009
 Mr+0.009
 mouth+0.009
Neg logit contributions
Years-0.022
Days-0.022
umm-0.022
Hours-0.020
 hours-0.020
TokenThat
Token ID2504
Feature activation+1.65e+00

Pos logit contributions
iked+0.168
Days+0.162
istence+0.151
 achievement+0.151
Hours+0.149
Neg logit contributions
 Anna-0.126
?"-0.116
 Mr-0.114
 my-0.112
 Sunny-0.108
Token's
Token ID338
Feature activation+1.11e+00

Pos logit contributions
Days+0.061
Years+0.060
umm+0.059
iked+0.057
Hours+0.057
Neg logit contributions
?"-0.018
 Anna-0.014
 sob-0.013
 mouth-0.012
 ther-0.012
Token what
Token ID644
Feature activation+9.59e-01

Pos logit contributions
Years+0.030
Days+0.029
umm+0.028
 By+0.026
Hours+0.026
Neg logit contributions
?"-0.021
 sob-0.021
 Anna-0.020
 ther-0.020
 mouth-0.019
Token it
Token ID340
Feature activation-1.19e+00

Pos logit contributions
Years+0.027
Days+0.027
umm+0.026
 By+0.025
iked+0.025
Neg logit contributions
?"-0.016
 sob-0.015
 Anna-0.015
 ther-0.014
 mouth-0.014
Token's
Token ID338
Feature activation+3.04e-01

Pos logit contributions
 sob+0.017
chard+0.015
 Anna+0.015
 ther+0.015
anut+0.014
Neg logit contributions
Years-0.035
umm-0.034
 By-0.034
Days-0.033
astic-0.032
Token dad
Token ID9955
Feature activation+8.23e-01

Pos logit contributions
Years+0.037
Days+0.037
umm+0.036
 By+0.035
iked+0.035
Neg logit contributions
?"-0.010
 Anna-0.009
 sob-0.009
 mouth-0.009
 sandwich-0.008
Token smiled
Token ID13541
Feature activation+1.79e+00

Pos logit contributions
Years+0.027
Days+0.027
umm+0.026
iked+0.025
 hours+0.025
Neg logit contributions
?"-0.011
 Anna-0.009
 sob-0.009
 ther-0.009
 mouth-0.008
Token and
Token ID290
Feature activation-7.83e-01

Pos logit contributions
Years+0.053
Days+0.053
umm+0.051
Hours+0.048
 By+0.048
Neg logit contributions
?"-0.030
 Anna-0.028
 sob-0.027
 mouth-0.025
 ther-0.025
Token said
Token ID531
Feature activation+1.26e+00

Pos logit contributions
?"+0.010
 Anna+0.008
 sob+0.008
 ther+0.008
 mouth+0.007
Neg logit contributions
Years-0.027
Days-0.026
umm-0.025
iked-0.025
 By-0.025
Token,
Token ID11
Feature activation-1.79e+00

Pos logit contributions
Years+0.038
Days+0.037
umm+0.036
 By+0.034
iked+0.034
Neg logit contributions
?"-0.021
 Anna-0.019
 sob-0.019
 ther-0.017
 mouth-0.017
Token "
Token ID366
Feature activation+6.64e+00

Pos logit contributions
?"+0.031
 Anna+0.029
 sob+0.028
 ther+0.026
 mouth+0.026
Neg logit contributions
Years-0.052
Days-0.051
umm-0.050
iked-0.047
 By-0.047
TokenYes
Token ID5297
Feature activation-4.56e-01

Pos logit contributions
iked+0.188
 achievement+0.179
umm+0.174
istence+0.170
Days+0.170
Neg logit contributions
 Anna-0.100
bug-0.097
?"-0.094
 Mia-0.094
 smelled-0.092
Token,
Token ID11
Feature activation+1.41e+00

Pos logit contributions
?"+0.007
 Anna+0.007
 sob+0.006
 ther+0.006
 mouth+0.006
Neg logit contributions
Years-0.014
Days-0.014
umm-0.013
iked-0.013
 By-0.013
Token they
Token ID484
Feature activation+5.66e-01

Pos logit contributions
Years+0.044
Days+0.044
umm+0.043
seek+0.042
Hours+0.041
Neg logit contributions
?"-0.022
 Anna-0.019
 my-0.018
 Mia-0.017
 mouth-0.017
Token are
Token ID389
Feature activation+4.75e-01

Pos logit contributions
Years+0.019
Days+0.019
umm+0.018
iked+0.018
Hours+0.017
Neg logit contributions
?"-0.008
 Anna-0.007
 sob-0.007
 sees-0.006
 mouth-0.006
Token.
Token ID13
Feature activation+2.91e+00

Pos logit contributions
Years+0.014
Days+0.014
umm+0.014
iked+0.013
Hours+0.013
Neg logit contributions
?"-0.008
 Anna-0.008
 sob-0.007
 mouth-0.007
 ther-0.007
Token mom
Token ID1995
Feature activation+8.64e-01

Pos logit contributions
?"+0.000
 Anna+0.000
 sob+0.000
 ther+0.000
chard+0.000
Neg logit contributions
Years-0.001
Days-0.001
umm-0.001
 hours-0.001
 example-0.001
Token smiled
Token ID13541
Feature activation+2.15e+00

Pos logit contributions
Years+0.028
Days+0.028
umm+0.027
 By+0.026
iked+0.026
Neg logit contributions
?"-0.012
 Anna-0.011
 sob-0.011
 ther-0.010
 mouth-0.010
Token and
Token ID290
Feature activation+5.63e-02

Pos logit contributions
Years+0.065
Days+0.065
umm+0.064
iked+0.060
Hours+0.060
Neg logit contributions
?"-0.035
 Anna-0.032
 sob-0.032
 mouth-0.029
 Mr-0.029
Token said
Token ID531
Feature activation+1.39e+00

Pos logit contributions
Days+0.002
Years+0.002
umm+0.002
 By+0.002
iked+0.002
Neg logit contributions
?"-0.001
 Anna-0.001
 sob-0.001
 Mr-0.000
 ther-0.000
Token,
Token ID11
Feature activation-2.59e-01

Pos logit contributions
Years+0.044
Days+0.043
umm+0.043
Hours+0.040
 hours+0.039
Neg logit contributions
?"-0.022
 Anna-0.021
 sob-0.019
 Mr-0.018
 Sunny-0.018
Token "
Token ID366
Feature activation+6.63e+00

Pos logit contributions
?"+0.003
 Anna+0.003
 sob+0.003
 Mr+0.003
 Fl+0.003
Neg logit contributions
umm-0.009
Days-0.008
Years-0.008
 endings-0.008
 hours-0.008
TokenThat
Token ID2504
Feature activation+1.04e+00

Pos logit contributions
iked+0.171
umm+0.160
seek+0.155
 By+0.154
 achievement+0.154
Neg logit contributions
 Anna-0.108
?"-0.106
 my-0.105
 Mr-0.105
 ther-0.103
Token's
Token ID338
Feature activation+7.91e-01

Pos logit contributions
Years+0.037
Days+0.037
umm+0.036
 By+0.035
iked+0.035
Neg logit contributions
?"-0.011
 Anna-0.010
 sob-0.009
 ther-0.008
 mouth-0.008
Token my
Token ID616
Feature activation+1.60e+00

Pos logit contributions
Years+0.023
Days+0.023
umm+0.022
 By+0.021
Hours+0.021
Neg logit contributions
?"-0.014
 sob-0.013
 Anna-0.013
 ther-0.012
 mouth-0.012
Token good
Token ID922
Feature activation+2.27e+00

Pos logit contributions
Years+0.044
Days+0.044
umm+0.043
iked+0.042
Hours+0.041
Neg logit contributions
?"-0.030
 Anna-0.030
 mouth-0.028
 sob-0.028
 sandwich-0.027
Token boy
Token ID2933
Feature activation-5.73e-01

Pos logit contributions
Days+0.073
Years+0.072
Hours+0.068
umm+0.067
Eventually+0.066
Neg logit contributions
?"-0.039
 Anna-0.035
 mouth-0.033
 sob-0.032
 bear-0.032
Token dad
Token ID9955
Feature activation+2.15e+00

Pos logit contributions
?"+0.003
 Anna+0.003
 sob+0.003
 ther+0.003
 mouth+0.003
Neg logit contributions
Years-0.009
Days-0.008
umm-0.008
iked-0.008
 By-0.008
Token laughed
Token ID13818
Feature activation+1.49e+00

Pos logit contributions
umm+0.068
Years+0.068
Days+0.068
Hours+0.063
 By+0.062
Neg logit contributions
?"-0.033
 Anna-0.030
 sob-0.030
 sees-0.026
 Mr-0.026
Token and
Token ID290
Feature activation+5.88e-01

Pos logit contributions
Days+0.045
Years+0.045
umm+0.044
Hours+0.041
iked+0.041
Neg logit contributions
?"-0.025
 Anna-0.024
 sob-0.022
 Mr-0.021
 angry-0.020
Token said
Token ID531
Feature activation+1.67e+00

Pos logit contributions
Days+0.021
Years+0.021
umm+0.021
Hours+0.019
 By+0.019
Neg logit contributions
?"-0.006
 Anna-0.006
 sob-0.006
 Mr-0.005
 mouth-0.005
Token,
Token ID11
Feature activation-6.96e-01

Pos logit contributions
Years+0.052
Days+0.051
umm+0.050
Hours+0.047
 By+0.047
Neg logit contributions
?"-0.027
 Anna-0.024
 sob-0.023
 ther-0.022
 mouth-0.022
Token "
Token ID366
Feature activation+6.62e+00

Pos logit contributions
?"+0.010
 Anna+0.009
 sob+0.009
 Mr+0.008
 mouth+0.008
Neg logit contributions
umm-0.022
Years-0.022
Days-0.022
 hours-0.021
Hours-0.021
TokenOk
Token ID18690
Feature activation-1.10e-01

Pos logit contributions
iked+0.173
 achievement+0.163
umm+0.162
Days+0.158
 example+0.154
Neg logit contributions
 Anna-0.103
?"-0.097
 saw-0.097
my-0.096
 Mr-0.096
Token,
Token ID11
Feature activation+8.21e-01

Pos logit contributions
?"+0.002
 sob+0.002
 Anna+0.002
 ther+0.002
 mouth+0.002
Neg logit contributions
Years-0.003
Days-0.003
umm-0.003
 By-0.003
 hours-0.003
Token let
Token ID1309
Feature activation+1.42e-01

Pos logit contributions
Years+0.027
Days+0.027
umm+0.026
iked+0.026
Hours+0.025
Neg logit contributions
?"-0.011
 Anna-0.010
 Mr-0.009
 my-0.009
 sob-0.009
Token's
Token ID338
Feature activation+3.02e-01

Pos logit contributions
Days+0.005
Years+0.005
umm+0.005
Hours+0.005
 By+0.005
Neg logit contributions
?"-0.002
chard-0.002
 sob-0.002
 Anna-0.001
 ther-0.001
Token get
Token ID651
Feature activation+4.02e-01

Pos logit contributions
Years+0.009
Days+0.009
umm+0.009
iked+0.008
 By+0.008
Neg logit contributions
?"-0.005
 Anna-0.005
 sob-0.004
 mouth-0.004
 ther-0.004
Token mum
Token ID25682
Feature activation+5.19e-02

Pos logit contributions
?"+0.009
 Anna+0.008
cks+0.007
 sob+0.007
 ther+0.007
Neg logit contributions
Years-0.013
Days-0.013
umm-0.013
 example-0.012
 days-0.012
Token smiled
Token ID13541
Feature activation+1.41e+00

Pos logit contributions
Years+0.002
Days+0.002
 hours+0.001
umm+0.001
 lear+0.001
Neg logit contributions
?"-0.001
 Anna-0.001
 ther-0.001
chard-0.001
 Sunny-0.001
Token and
Token ID290
Feature activation-1.31e+00

Pos logit contributions
Years+0.039
Days+0.038
umm+0.037
 By+0.035
iked+0.035
Neg logit contributions
?"-0.027
 Anna-0.025
 sob-0.024
 ther-0.023
 mouth-0.022
Token said
Token ID531
Feature activation+3.64e-01

Pos logit contributions
?"+0.020
 Anna+0.016
 ther+0.015
 sob+0.015
 mouth+0.015
Neg logit contributions
Years-0.042
Days-0.041
Hours-0.038
umm-0.038
iked-0.038
Token,
Token ID11
Feature activation-1.99e+00

Pos logit contributions
Years+0.010
Days+0.010
umm+0.009
 By+0.009
iked+0.009
Neg logit contributions
?"-0.007
 Anna-0.006
 sob-0.006
 ther-0.006
 mouth-0.006
Token "
Token ID366
Feature activation+6.62e+00

Pos logit contributions
?"+0.039
 Anna+0.035
 sob+0.034
 ther+0.034
 sandwich+0.033
Neg logit contributions
Years-0.052
Days-0.052
umm-0.049
 By-0.049
iked-0.047
TokenIt
Token ID1026
Feature activation+1.41e-01

Pos logit contributions
iked+0.186
umm+0.182
Days+0.175
 hours+0.173
 achievement+0.172
Neg logit contributions
 Anna-0.109
?"-0.108
 sob-0.098
 Sara-0.095
 Ben-0.095
Token's
Token ID338
Feature activation+9.85e-01

Pos logit contributions
Years+0.005
Days+0.005
 By+0.004
umm+0.004
 example+0.004
Neg logit contributions
?"-0.002
chard-0.002
 Anna-0.002
 ther-0.002
 camel-0.001
Token okay
Token ID8788
Feature activation+5.20e-01

Pos logit contributions
Years+0.029
Days+0.029
 By+0.027
Hours+0.027
umm+0.027
Neg logit contributions
?"-0.016
 sob-0.015
 Anna-0.015
 camel-0.014
 ther-0.014
Token,
Token ID11
Feature activation+6.86e-01

Pos logit contributions
Years+0.015
Days+0.014
umm+0.014
 By+0.013
 example+0.013
Neg logit contributions
?"-0.009
 sob-0.008
 Anna-0.008
 ther-0.008
 mouth-0.008
Token Sally
Token ID25737
Feature activation+1.42e+00

Pos logit contributions
Years+0.021
Days+0.021
umm+0.020
iked+0.020
Hours+0.020
Neg logit contributions
?"-0.011
 Anna-0.010
 sob-0.009
 Mr-0.009
 mouth-0.009
Tokenmy
Token ID1820
Feature activation+8.55e-01

Pos logit contributions
Years+0.027
Days+0.027
umm+0.027
 By+0.025
iked+0.025
Neg logit contributions
?"-0.008
 Anna-0.007
 sob-0.006
 ther-0.006
 mouth-0.006
Token smiled
Token ID13541
Feature activation+2.03e+00

Pos logit contributions
Years+0.031
Days+0.030
umm+0.030
iked+0.028
 By+0.028
Neg logit contributions
?"-0.009
 Anna-0.008
 sob-0.008
 mouth-0.007
 ther-0.007
Token and
Token ID290
Feature activation-8.67e-01

Pos logit contributions
Years+0.059
Days+0.059
umm+0.057
Hours+0.054
iked+0.054
Neg logit contributions
?"-0.035
 Anna-0.033
 sob-0.032
 mouth-0.030
 ther-0.029
Token said
Token ID531
Feature activation+1.13e+00

Pos logit contributions
?"+0.010
 Anna+0.008
 sob+0.008
 ther+0.008
 mouth+0.007
Neg logit contributions
Years-0.030
Days-0.029
umm-0.029
 By-0.028
 hours-0.028
Token,
Token ID11
Feature activation-1.07e+00

Pos logit contributions
Years+0.034
Days+0.034
umm+0.033
 By+0.031
iked+0.031
Neg logit contributions
?"-0.018
 Anna-0.017
 sob-0.016
 ther-0.015
 mouth-0.015
Token "
Token ID366
Feature activation+6.62e+00

Pos logit contributions
?"+0.017
 Anna+0.016
 sob+0.015
 mouth+0.014
 ther+0.014
Neg logit contributions
Years-0.033
Days-0.033
umm-0.032
Hours-0.030
iked-0.030
TokenThat
Token ID2504
Feature activation+1.34e+00

Pos logit contributions
iked+0.178
Days+0.157
istence+0.156
 achievement+0.155
seek+0.150
Neg logit contributions
 Anna-0.130
?"-0.110
Anna-0.110
 Sara-0.107
 smelled-0.107
Token's
Token ID338
Feature activation+6.60e-01

Pos logit contributions
Days+0.050
Years+0.050
umm+0.049
iked+0.048
Hours+0.047
Neg logit contributions
?"-0.013
 Anna-0.011
 sob-0.010
 mouth-0.009
 ther-0.008
Token right
Token ID826
Feature activation+3.19e-02

Pos logit contributions
Years+0.019
Days+0.018
umm+0.018
 By+0.017
Hours+0.017
Neg logit contributions
?"-0.012
 sob-0.011
 Anna-0.011
 ther-0.011
 mouth-0.010
Token,
Token ID11
Feature activation-3.30e-01

Pos logit contributions
umm+0.001
Years+0.001
Days+0.001
 example+0.001
 hours+0.001
Neg logit contributions
 sob-0.001
?"-0.001
 Anna-0.001
 ther-0.001
chard-0.001
Token Jill
Token ID22772
Feature activation+1.02e+00

Pos logit contributions
?"+0.005
 Anna+0.005
 sob+0.004
 mouth+0.004
 ther+0.004
Neg logit contributions
Years-0.010
Days-0.010
umm-0.010
iked-0.010
Hours-0.010
Token mom
Token ID1995
Feature activation+7.09e-01

Pos logit contributions
?"+0.003
 Anna+0.002
 ther+0.002
 Sunny+0.002
chard+0.002
Neg logit contributions
Years-0.009
Days-0.009
umm-0.008
 days-0.008
 hours-0.008
Token smiled
Token ID13541
Feature activation+2.38e+00

Pos logit contributions
Years+0.024
Days+0.024
umm+0.024
iked+0.023
 By+0.023
Neg logit contributions
?"-0.008
 Anna-0.008
 sob-0.007
 mouth-0.007
 ther-0.006
Token and
Token ID290
Feature activation-4.21e-01

Pos logit contributions
Years+0.070
Days+0.069
umm+0.068
iked+0.064
Hours+0.064
Neg logit contributions
?"-0.041
 Anna-0.038
 sob-0.038
 mouth-0.035
 ther-0.034
Token said
Token ID531
Feature activation+1.54e+00

Pos logit contributions
?"+0.005
 Anna+0.004
 sob+0.004
 ther+0.003
 mouth+0.003
Neg logit contributions
Years-0.015
Days-0.015
umm-0.014
 By-0.014
iked-0.014
Token,
Token ID11
Feature activation-9.10e-01

Pos logit contributions
Years+0.048
Days+0.048
umm+0.047
Hours+0.044
 By+0.044
Neg logit contributions
?"-0.024
 Anna-0.022
 sob-0.021
 mouth-0.019
 ther-0.019
Token "
Token ID366
Feature activation+6.61e+00

Pos logit contributions
?"+0.013
 Anna+0.012
 sob+0.012
 mouth+0.011
 Mr+0.011
Neg logit contributions
umm-0.029
Years-0.029
Days-0.029
 hours-0.027
iked-0.027
TokenThat
Token ID2504
Feature activation+1.29e+00

Pos logit contributions
iked+0.185
umm+0.167
 achievement+0.159
 By+0.158
oor+0.158
Neg logit contributions
 Anna-0.118
 Sunny-0.101
?"-0.101
 Mr-0.100
Anna-0.098
Token's
Token ID338
Feature activation+6.20e-01

Pos logit contributions
Years+0.049
Days+0.049
umm+0.048
iked+0.047
Hours+0.046
Neg logit contributions
?"-0.012
 Anna-0.009
 sob-0.009
 mouth-0.008
 ther-0.007
Token right
Token ID826
Feature activation+2.40e-01

Pos logit contributions
Years+0.019
Days+0.018
umm+0.017
 By+0.017
Hours+0.017
Neg logit contributions
?"-0.010
 sob-0.010
 Anna-0.010
 ther-0.009
 mouth-0.009
Token sweet
Token ID6029
Feature activation+1.84e+00

Pos logit contributions
umm+0.006
Years+0.006
 example+0.006
Days+0.006
 days+0.005
Neg logit contributions
 sob-0.005
 Anna-0.004
chard-0.004
?"-0.004
 ther-0.004
Tokenheart
Token ID11499
Feature activation+4.88e-01

Pos logit contributions
Years+0.045
Days+0.044
umm+0.041
Hours+0.040
iked+0.040
Neg logit contributions
?"-0.042
 Anna-0.039
 sob-0.038
 mouth-0.036
 ther-0.036
Token dad
Token ID9955
Feature activation+4.85e-01

Pos logit contributions
Years+0.021
Days+0.021
umm+0.021
Hours+0.020
 By+0.020
Neg logit contributions
?"-0.005
 sob-0.004
 Anna-0.004
 angry-0.003
 mouth-0.003
Token laughed
Token ID13818
Feature activation+1.56e+00

Pos logit contributions
Years+0.017
Days+0.016
umm+0.016
 By+0.015
iked+0.015
Neg logit contributions
?"-0.006
 Anna-0.005
 sob-0.005
 ther-0.005
 mouth-0.004
Token and
Token ID290
Feature activation-5.69e-01

Pos logit contributions
Years+0.046
Days+0.045
umm+0.045
Hours+0.042
iked+0.041
Neg logit contributions
?"-0.027
 Anna-0.026
 sob-0.025
 mouth-0.023
 Mr-0.023
Token said
Token ID531
Feature activation+1.33e+00

Pos logit contributions
?"+0.006
 Anna+0.005
 sob+0.005
 ther+0.004
 mouth+0.004
Neg logit contributions
Years-0.021
Days-0.020
umm-0.020
 By-0.019
iked-0.019
Token,
Token ID11
Feature activation-1.21e+00

Pos logit contributions
Years+0.038
Days+0.037
umm+0.036
iked+0.035
 By+0.035
Neg logit contributions
?"-0.023
 sob-0.021
 Anna-0.021
 ther-0.020
 mouth-0.019
Token "
Token ID366
Feature activation+6.60e+00

Pos logit contributions
?"+0.020
 Anna+0.019
 sob+0.018
 ther+0.017
 mouth+0.017
Neg logit contributions
Years-0.036
Days-0.035
umm-0.035
iked-0.033
Hours-0.033
TokenWell
Token ID5779
Feature activation+1.73e-01

Pos logit contributions
iked+0.166
 achievement+0.166
umm+0.155
istence+0.152
 accomplishment+0.151
Neg logit contributions
 Anna-0.127
 Mia-0.113
 Sara-0.112
Anna-0.112
 Luna-0.109
Token,
Token ID11
Feature activation+7.12e-01

Pos logit contributions
umm+0.006
Years+0.005
Days+0.005
iked+0.005
itted+0.005
Neg logit contributions
?"-0.003
 Anna-0.003
 Ben-0.002
 sob-0.002
 my-0.002
Token sort
Token ID3297
Feature activation+1.78e-01

Pos logit contributions
Years+0.024
Days+0.024
umm+0.024
iked+0.023
Hours+0.023
Neg logit contributions
?"-0.009
 Anna-0.008
 sob-0.007
 mouth-0.007
 ther-0.007
Token away
Token ID1497
Feature activation-7.45e-01

Pos logit contributions
Years+0.006
Days+0.006
umm+0.005
iked+0.005
Hours+0.005
Neg logit contributions
?"-0.003
 Anna-0.003
 ther-0.002
 sob-0.002
 Mr-0.002
Token then
Token ID788
Feature activation+4.14e-01

Pos logit contributions
?"+0.011
 Anna+0.010
 sob+0.010
 ther+0.009
 mouth+0.009
Neg logit contributions
Years-0.023
Days-0.023
umm-0.023
iked-0.021
 By-0.021
Token Emma
Token ID18966
Feature activation+1.57e+00

Pos logit contributions
Days+0.048
Years+0.047
umm+0.045
 hours+0.042
iked+0.042
Neg logit contributions
 Anna-0.041
?"-0.036
 Mr-0.035
 Sunny-0.035
 Fl-0.035
Token smiled
Token ID13541
Feature activation+2.59e+00

Pos logit contributions
Years+0.050
Days+0.049
umm+0.049
iked+0.046
 By+0.045
Neg logit contributions
?"-0.023
 Anna-0.021
 sob-0.021
 mouth-0.019
 ther-0.019
Token and
Token ID290
Feature activation+4.43e-01

Pos logit contributions
Days+0.078
Years+0.078
umm+0.076
Hours+0.071
iked+0.071
Neg logit contributions
?"-0.043
 Anna-0.040
 sob-0.039
 mouth-0.037
 Mr-0.035
Token said
Token ID531
Feature activation+1.46e+00

Pos logit contributions
Days+0.015
Years+0.015
umm+0.015
 By+0.014
Hours+0.014
Neg logit contributions
?"-0.005
 Anna-0.005
 sob-0.005
 mouth-0.004
 ther-0.004
Token,
Token ID11
Feature activation-2.83e-01

Pos logit contributions
Years+0.048
umm+0.048
Days+0.047
Hours+0.044
 hours+0.043
Neg logit contributions
?"-0.021
 Anna-0.020
 sob-0.018
 Sunny-0.016
 mouth-0.016
Token "
Token ID366
Feature activation+6.60e+00

Pos logit contributions
?"+0.004
 Anna+0.003
 sob+0.003
 Mr+0.003
 Sunny+0.003
Neg logit contributions
umm-0.010
Days-0.009
Years-0.009
 endings-0.009
 hours-0.009
TokenSee
Token ID6214
Feature activation+4.51e-01

Pos logit contributions
iked+0.167
Days+0.162
umm+0.157
Years+0.157
 achievement+0.156
Neg logit contributions
 Anna-0.124
?"-0.113
 Sunny-0.111
 Ben-0.107
bug-0.104
Token,
Token ID11
Feature activation+1.45e-01

Pos logit contributions
Years+0.013
Days+0.013
umm+0.013
 By+0.012
iked+0.012
Neg logit contributions
?"-0.007
 sob-0.007
 Anna-0.007
 ther-0.007
 mouth-0.006
Token I
Token ID314
Feature activation-8.74e-02

Pos logit contributions
Years+0.004
Days+0.004
umm+0.004
iked+0.004
Hours+0.004
Neg logit contributions
?"-0.002
 Anna-0.002
 sob-0.002
 ther-0.002
 mouth-0.002
Token told
Token ID1297
Feature activation-4.78e-01

Pos logit contributions
?"+0.001
 Anna+0.001
 sob+0.001
 ther+0.001
 Sunny+0.001
Neg logit contributions
Years-0.003
Days-0.002
 By-0.002
umm-0.002
 hours-0.002
Token you
Token ID345
Feature activation+1.68e-01

Pos logit contributions
?"+0.008
 sob+0.007
 Anna+0.007
 ther+0.007
 mouth+0.007
Neg logit contributions
Years-0.014
Days-0.014
umm-0.013
 By-0.013
iked-0.013
Token engine
Token ID3113
Feature activation+1.57e+00

Pos logit contributions
umm+0.037
Years+0.035
iked+0.034
Days+0.034
 By+0.033
Neg logit contributions
?"-0.019
 sob-0.016
 bear-0.016
 Anna-0.016
 angry-0.016
Token smiled
Token ID13541
Feature activation+2.28e+00

Pos logit contributions
umm+0.051
Years+0.049
Days+0.049
 By+0.046
iked+0.046
Neg logit contributions
?"-0.022
 sob-0.021
 Anna-0.020
 mouth-0.019
 bear-0.018
Token and
Token ID290
Feature activation+6.89e-02

Pos logit contributions
Years+0.067
Days+0.067
umm+0.065
iked+0.061
 By+0.061
Neg logit contributions
?"-0.039
 Anna-0.036
 sob-0.035
 mouth-0.033
 Mr-0.032
Token said
Token ID531
Feature activation+3.83e-01

Pos logit contributions
Days+0.002
Years+0.002
umm+0.002
 By+0.002
iked+0.002
Neg logit contributions
?"-0.001
 Anna-0.001
 sob-0.001
 mouth-0.001
 Mr-0.001
Token,
Token ID11
Feature activation-1.43e+00

Pos logit contributions
Years+0.011
Days+0.011
umm+0.011
iked+0.010
 By+0.010
Neg logit contributions
?"-0.006
 Anna-0.006
 sob-0.006
 ther-0.005
 mouth-0.005
Token "
Token ID366
Feature activation+6.59e+00

Pos logit contributions
?"+0.021
 Anna+0.019
 sob+0.018
 Mr+0.017
 mouth+0.017
Neg logit contributions
umm-0.046
Days-0.046
Years-0.046
 hours-0.042
iked-0.042
TokenYes
Token ID5297
Feature activation-1.27e+00

Pos logit contributions
iked+0.178
 achievement+0.167
umm+0.161
 By+0.157
 err+0.156
Neg logit contributions
 Anna-0.111
Anna-0.100
 Sunny-0.099
?"-0.097
 Fl-0.094
Token!
Token ID0
Feature activation+2.43e+00

Pos logit contributions
 sob+0.024
?"+0.023
 ther+0.022
chard+0.022
 Anna+0.022
Neg logit contributions
Years-0.033
Days-0.032
umm-0.032
 By-0.030
 hours-0.030
Token I
Token ID314
Feature activation+8.11e-01

Pos logit contributions
Days+0.071
Years+0.070
umm+0.065
 hours+0.064
Hours+0.063
Neg logit contributions
?"-0.045
 Anna-0.043
 Mr-0.038
 mouth-0.038
 Fl-0.037
Token accept
Token ID2453
Feature activation-8.05e-01

Pos logit contributions
umm+0.026
Days+0.025
Years+0.025
iked+0.024
Hours+0.023
Neg logit contributions
?"-0.013
 sob-0.011
 Anna-0.011
 sees-0.010
 bear-0.010
Token!"
Token ID2474
Feature activation+5.33e-01

Pos logit contributions
?"+0.010
 Anna+0.009
 sob+0.008
 mouth+0.007
 Mr+0.007
Neg logit contributions
Years-0.028
Days-0.028
umm-0.028
iked-0.026
Hours-0.026
Token hugged
Token ID41130
Feature activation+1.18e+00

Pos logit contributions
Years+0.023
Days+0.023
umm+0.021
 By+0.021
 hours+0.021
Neg logit contributions
?"-0.011
 Anna-0.010
 ther-0.010
 sob-0.009
 Sunny-0.009
Token her
Token ID607
Feature activation+7.46e-01

Pos logit contributions
Years+0.036
Days+0.035
umm+0.035
 By+0.033
iked+0.033
Neg logit contributions
?"-0.019
 Anna-0.017
 sob-0.016
 ther-0.016
 mouth-0.015
Token and
Token ID290
Feature activation-5.02e-01

Pos logit contributions
Years+0.020
Days+0.020
umm+0.019
 By+0.018
iked+0.018
Neg logit contributions
?"-0.015
 Anna-0.014
 sob-0.013
 ther-0.013
 mouth-0.012
Token said
Token ID531
Feature activation+1.01e+00

Pos logit contributions
?"+0.006
 Anna+0.005
 ther+0.005
 sob+0.005
 mouth+0.004
Neg logit contributions
Years-0.017
Days-0.016
umm-0.016
 hours-0.016
 By-0.016
Token,
Token ID11
Feature activation-1.10e+00

Pos logit contributions
Years+0.029
Days+0.029
umm+0.028
 By+0.027
iked+0.027
Neg logit contributions
?"-0.017
 Anna-0.016
 sob-0.016
 ther-0.015
 mouth-0.014
Token "
Token ID366
Feature activation+6.58e+00

Pos logit contributions
?"+0.017
 Anna+0.016
 sob+0.015
 mouth+0.014
 Mr+0.014
Neg logit contributions
Years-0.034
Days-0.034
umm-0.034
Hours-0.031
iked-0.031
TokenSee
Token ID6214
Feature activation+3.30e-01

Pos logit contributions
iked+0.173
umm+0.163
Days+0.161
Years+0.160
istence+0.159
Neg logit contributions
?"-0.118
 Anna-0.117
 sob-0.109
 Sara-0.108
 Mr-0.105
Token what
Token ID644
Feature activation-1.94e-01

Pos logit contributions
Years+0.009
umm+0.009
Days+0.009
 By+0.008
 example+0.008
Neg logit contributions
 sob-0.005
?"-0.005
 Anna-0.005
 ther-0.005
chard-0.005
Token happens
Token ID4325
Feature activation-5.82e-02

Pos logit contributions
?"+0.003
 sob+0.003
 Anna+0.003
 ther+0.003
 Frost+0.003
Neg logit contributions
Years-0.005
Days-0.005
umm-0.005
 By-0.005
 hours-0.005
Token when
Token ID618
Feature activation-3.20e-01

Pos logit contributions
 sob+0.001
?"+0.001
 ther+0.001
 Anna+0.001
 sandwich+0.001
Neg logit contributions
Years-0.001
Days-0.001
umm-0.001
 By-0.001
 hours-0.001
Token you
Token ID345
Feature activation-8.12e-01

Pos logit contributions
?"+0.005
 Anna+0.005
 sob+0.005
 ther+0.004
 mouth+0.004
Neg logit contributions
Years-0.010
Days-0.010
umm-0.009
iked-0.009
 By-0.009
Token you
Token ID345
Feature activation-1.67e+00

Pos logit contributions
 sob+0.011
maid+0.010
 camel+0.010
 ther+0.010
aj+0.010
Neg logit contributions
Days-0.011
Years-0.011
 dedication-0.010
 days-0.010
 By-0.010
Token help
Token ID1037
Feature activation-3.82e+00

Pos logit contributions
?"+0.035
 Anna+0.035
 sob+0.034
 ther+0.032
 mouth+0.032
Neg logit contributions
Years-0.041
Days-0.040
umm-0.038
Hours-0.036
 By-0.036
Token me
Token ID502
Feature activation-4.16e+00

Pos logit contributions
 ther+0.065
aj+0.062
ccoli+0.061
maid+0.060
 sob+0.060
Neg logit contributions
Years-0.107
Days-0.103
 days-0.096
 hours-0.094
 example-0.094
Token unlock
Token ID12116
Feature activation-5.95e+00

Pos logit contributions
 Anna+0.065
 sob+0.059
 ther+0.058
 Leo+0.057
 Luna+0.057
Neg logit contributions
Years-0.120
Days-0.117
Hours-0.110
 example-0.109
umm-0.109
Token the
Token ID262
Feature activation-7.68e-01

Pos logit contributions
 sob+0.100
 camel+0.088
 ther+0.086
anut+0.086
maid+0.085
Neg logit contributions
 hours-0.164
Years-0.164
 days-0.163
 months-0.160
 weeks-0.159
Token door
Token ID3420
Feature activation-7.23e+00

Pos logit contributions
?"+0.013
 Anna+0.012
 sob+0.011
 mouth+0.011
 ther+0.011
Neg logit contributions
Years-0.023
Days-0.023
umm-0.022
iked-0.021
 By-0.021
Token?"
Token ID1701
Feature activation-4.75e+00

Pos logit contributions
 sob+0.035
 Milo+0.032
 Frost+0.031
 Anna+0.031
anut+0.028
Neg logit contributions
Years-0.279
 By-0.268
Days-0.265
umm-0.265
way-0.262
Token the
Token ID262
Feature activation-1.15e+00

Pos logit contributions
aj+0.083
maid+0.077
 sandwich+0.077
 ther+0.075
anut+0.075
Neg logit contributions
Years-0.128
 By-0.123
Days-0.123
 hours-0.119
 months-0.119
Token cater
Token ID20825
Feature activation-1.92e-01

Pos logit contributions
?"+0.016
 Anna+0.015
 sob+0.014
 ther+0.014
 Frost+0.013
Neg logit contributions
Days-0.036
Years-0.035
umm-0.035
 example-0.033
 days-0.033
Tokenpillar
Token ID41643
Feature activation-4.30e+00

Pos logit contributions
?"+0.006
 Anna+0.005
 sob+0.005
 ther+0.005
 mouth+0.005
Neg logit contributions
Years-0.003
Days-0.003
umm-0.003
 By-0.003
 lear-0.003
Token said
Token ID531
Feature activation-1.38e+00

Pos logit contributions
 ther+0.081
 trash+0.076
?"+0.074
 Anna+0.073
 mac+0.073
Neg logit contributions
Years-0.102
Days-0.100
udging-0.094
way-0.094
All-0.093
TokenWhat
Token ID2061
Feature activation-9.29e-01

Pos logit contributions
Years+0.085
Days+0.083
umm+0.081
iked+0.077
 By+0.076
Neg logit contributions
?"-0.084
 Anna-0.079
 sob-0.078
 ther-0.074
 mouth-0.072
Token is
Token ID318
Feature activation-6.85e-01

Pos logit contributions
anut+0.016
aj+0.016
 Nib+0.016
 sob+0.015
 ther+0.015
Neg logit contributions
ending-0.022
Days-0.022
Years-0.022
 endings-0.022
 By-0.022
Token happening
Token ID5836
Feature activation-4.32e+00

Pos logit contributions
 sob+0.014
?"+0.013
 Anna+0.013
 ther+0.013
aj+0.012
Neg logit contributions
Years-0.016
Days-0.016
umm-0.016
 By-0.015
 example-0.015
Token to
Token ID284
Feature activation-3.10e+00

Pos logit contributions
 sob+0.044
cks+0.037
anut+0.036
 Anna+0.035
 Luna+0.034
Neg logit contributions
umm-0.145
 By-0.144
Years-0.144
 example-0.143
way-0.138
Token my
Token ID616
Feature activation-1.44e+00

Pos logit contributions
aj+0.049
 sob+0.046
 scrap+0.042
ighed+0.041
chard+0.040
Neg logit contributions
umm-0.087
 By-0.085
Years-0.082
 example-0.081
Days-0.080
Token mushroom
Token ID28520
Feature activation-7.16e+00

Pos logit contributions
?"+0.019
 sob+0.019
 Anna+0.018
 ther+0.017
 Frost+0.016
Neg logit contributions
umm-0.047
Years-0.045
Days-0.044
 example-0.042
 days-0.042
Token?"
Token ID1701
Feature activation-2.12e+00

Pos logit contributions
 Frost+0.062
 Anna+0.062
cks+0.060
 sob+0.055
 Lia+0.048
Neg logit contributions
umm-0.247
 By-0.240
Years-0.240
way-0.231
Days-0.230
Token
Token ID198
Feature activation-1.70e+00

Pos logit contributions
?"+0.040
 ther+0.038
maid+0.038
anut+0.038
 sandwich+0.037
Neg logit contributions
Years-0.054
Days-0.052
 By-0.050
Hours-0.049
 Hours-0.048
Token
Token ID198
Feature activation-2.77e-02

Pos logit contributions
 sandwich+0.035
chard+0.035
?"+0.035
 mouth+0.035
maid+0.035
Neg logit contributions
Years-0.041
Days-0.040
 By-0.038
Hours-0.037
Eventually-0.037
TokenJust
Token ID5703
Feature activation-9.83e-01

Pos logit contributions
?"+0.001
 Anna+0.001
 sob+0.001
 ther+0.001
 mouth+0.001
Neg logit contributions
Years-0.001
Days-0.001
umm-0.001
 By-0.001
Hours-0.001
Token then
Token ID788
Feature activation-1.55e+00

Pos logit contributions
?"+0.026
 Anna+0.026
 sob+0.026
 ther+0.025
 mouth+0.024
Neg logit contributions
Years-0.018
Days-0.017
 hours-0.016
umm-0.016
 By-0.015
Token"
Token ID1
Feature activation+3.21e+00

Pos logit contributions
umm+0.031
Years+0.029
istence+0.029
 lear+0.028
Days+0.028
Neg logit contributions
?"-0.016
 Anna-0.014
 ther-0.014
 sob-0.013
 Mia-0.013
TokenWhat
Token ID2061
Feature activation-1.39e+00

Pos logit contributions
iked+0.084
umm+0.083
Days+0.083
Years+0.082
 By+0.081
Neg logit contributions
 Anna-0.060
?"-0.059
 sob-0.056
 ther-0.054
chard-0.052
Token is
Token ID318
Feature activation-9.91e-01

Pos logit contributions
 sob+0.020
?"+0.019
 ther+0.019
 Anna+0.019
 Nib+0.019
Neg logit contributions
Years-0.040
Days-0.039
 By-0.038
umm-0.038
 example-0.037
Token in
Token ID287
Feature activation-3.43e+00

Pos logit contributions
?"+0.016
 Anna+0.014
 ther+0.012
 sob+0.012
 Mr+0.012
Neg logit contributions
Days-0.032
Years-0.032
umm-0.032
iked-0.029
Hours-0.029
Token the
Token ID262
Feature activation-7.42e-02

Pos logit contributions
 sob+0.037
?"+0.036
 Anna+0.032
 ther+0.030
 mouth+0.030
Neg logit contributions
Years-0.117
Days-0.116
umm-0.112
 example-0.111
 By-0.110
Token box
Token ID3091
Feature activation-7.16e+00

Pos logit contributions
?"+0.001
 Anna+0.001
 ther+0.001
 sob+0.001
 mouth+0.001
Neg logit contributions
Years-0.003
Days-0.003
umm-0.003
iked-0.002
Hours-0.002
Token?"
Token ID1701
Feature activation-3.86e+00

Pos logit contributions
 Luna+0.056
 Frost+0.055
 Milo+0.054
 sob+0.054
 Anna+0.053
Neg logit contributions
Years-0.251
umm-0.248
 By-0.243
way-0.240
 months-0.238
Token Sam
Token ID3409
Feature activation-2.90e+00

Pos logit contributions
?"+0.045
 sob+0.038
 ther+0.037
anut+0.036
maid+0.035
Neg logit contributions
Years-0.128
Days-0.126
 By-0.122
umm-0.122
 hours-0.121
Token asked
Token ID1965
Feature activation-2.21e+00

Pos logit contributions
?"+0.049
 ther+0.049
 Anna+0.046
 Frost+0.045
 mouth+0.044
Neg logit contributions
Years-0.079
Days-0.077
umm-0.072
 hours-0.072
 By-0.071
Token.
Token ID13
Feature activation-2.65e+00

Pos logit contributions
?"+0.044
anut+0.041
 sees+0.040
aj+0.040
cks+0.040
Neg logit contributions
Years-0.051
iked-0.051
Days-0.049
 example-0.049
 days-0.048
Token
Token ID198
Feature activation-1.59e+00

Pos logit contributions
?"+0.041
 sob+0.036
 Anna+0.036
 ther+0.034
 mouth+0.032
Neg logit contributions
Years-0.082
Days-0.081
umm-0.079
 By-0.076
iked-0.075
TokenWhat
Token ID2061
Feature activation-9.91e-01

Pos logit contributions
Years+0.098
Days+0.096
umm+0.093
iked+0.093
istence+0.090
Neg logit contributions
?"-0.068
 Anna-0.064
 ther-0.060
 sob-0.060
 Frost-0.056
Token do
Token ID466
Feature activation-7.91e-01

Pos logit contributions
 sob+0.015
anut+0.014
 Nib+0.014
aj+0.014
 ther+0.014
Neg logit contributions
Years-0.026
umm-0.026
 By-0.026
 days-0.025
Days-0.025
Token you
Token ID345
Feature activation-1.46e+00

Pos logit contributions
 sob+0.017
?"+0.015
 frown+0.015
 limp+0.015
anut+0.015
Neg logit contributions
Years-0.019
Days-0.018
umm-0.018
 days-0.018
 By-0.017
Token want
Token ID765
Feature activation-3.03e+00

Pos logit contributions
 Anna+0.029
?"+0.028
 sob+0.028
 ther+0.027
 Sunny+0.026
Neg logit contributions
Years-0.037
Days-0.036
umm-0.035
 By-0.034
 hours-0.034
Token to
Token ID284
Feature activation-1.81e+00

Pos logit contributions
 sob+0.045
 Anna+0.041
 ther+0.040
anut+0.038
cks+0.038
Neg logit contributions
Years-0.090
 By-0.087
umm-0.086
Days-0.085
 example-0.084
Token eat
Token ID4483
Feature activation-7.15e+00

Pos logit contributions
 sob+0.034
 ther+0.032
 Anna+0.030
 Luna+0.030
 scrap+0.030
Neg logit contributions
Years-0.046
umm-0.045
Days-0.044
 By-0.043
 example-0.043
Token,
Token ID11
Feature activation-4.87e+00

Pos logit contributions
 sob+0.082
 Milo+0.066
 camel+0.065
 scrap+0.065
 Luna+0.063
Neg logit contributions
Years-0.232
 months-0.227
 days-0.226
umm-0.224
 example-0.222
Token Mia
Token ID32189
Feature activation-4.79e+00

Pos logit contributions
 Claus+0.076
 scrap+0.073
bo+0.072
cks+0.071
chard+0.071
Neg logit contributions
Years-0.134
 example-0.129
Days-0.129
 days-0.124
umm-0.123
Token?"
Token ID1701
Feature activation-4.16e+00

Pos logit contributions
 Frost+0.068
 Nib+0.067
 Lia+0.065
 ther+0.064
 Fl+0.063
Neg logit contributions
working-0.128
Years-0.122
 months-0.117
 planned-0.117
 days-0.117
Token her
Token ID607
Feature activation-2.33e+00

Pos logit contributions
anut+0.069
aj+0.066
ccoli+0.066
maid+0.065
 Claus+0.064
Neg logit contributions
Years-0.107
 days-0.106
 By-0.103
 hours-0.103
Days-0.102
Token mom
Token ID1995
Feature activation-3.75e+00

Pos logit contributions
cks+0.039
 Nib+0.038
?"+0.038
anut+0.037
chard+0.035
Neg logit contributions
Years-0.060
Days-0.059
 days-0.059
 example-0.057
 hours-0.057
TokenWhy
Token ID5195
Feature activation-9.21e-01

Pos logit contributions
Years+0.082
Days+0.080
umm+0.076
Hours+0.072
 lear+0.071
Neg logit contributions
?"-0.085
 mouth-0.078
 Anna-0.077
 sob-0.076
 ther-0.075
Token are
Token ID389
Feature activation-3.54e-02

Pos logit contributions
anut+0.019
chard+0.019
 sob+0.018
 ther+0.018
cks+0.017
Neg logit contributions
astic-0.018
 lear-0.018
All-0.018
ending-0.018
 endings-0.018
Token you
Token ID345
Feature activation-1.66e+00

Pos logit contributions
 sob+0.001
?"+0.001
maid+0.001
 wound+0.001
 ther+0.001
Neg logit contributions
Days-0.001
Years-0.001
iked-0.001
 days-0.001
 By-0.001
Token wearing
Token ID5762
Feature activation-4.83e+00

Pos logit contributions
 ther+0.037
 trash+0.036
?"+0.035
 sandwich+0.035
kins+0.035
Neg logit contributions
 By-0.031
Days-0.030
Years-0.030
istence-0.029
 example-0.029
Token a
Token ID257
Feature activation-2.10e+00

Pos logit contributions
bug+0.088
 ther+0.085
anut+0.083
 sob+0.080
bi+0.080
Neg logit contributions
 hours-0.123
 days-0.121
Years-0.117
 lessons-0.117
 example-0.112
Token mask
Token ID9335
Feature activation-7.03e+00

Pos logit contributions
 Frost+0.032
cks+0.032
bug+0.032
 Sunny+0.030
chard+0.030
Neg logit contributions
Years-0.057
Days-0.055
 hours-0.053
 days-0.052
itted-0.051
Token?"
Token ID1701
Feature activation-4.63e+00

Pos logit contributions
 Anna+0.044
 Frost+0.041
 Milo+0.040
 ther+0.040
cks+0.039
Neg logit contributions
Years-0.261
umm-0.254
Days-0.247
 lear-0.245
 By-0.244
Token she
Token ID673
Feature activation-4.74e+00

Pos logit contributions
maid+0.122
anut+0.119
aj+0.115
cks+0.114
ccoli+0.113
Neg logit contributions
 By-0.081
Well-0.081
 Soon-0.080
 Now-0.078
Years-0.077
Token asked
Token ID1965
Feature activation-2.87e+00

Pos logit contributions
 ther+0.118
 sandwich+0.111
 bowl+0.107
cks+0.106
ccoli+0.106
Neg logit contributions
Years-0.084
Days-0.081
 hours-0.077
 days-0.075
 months-0.073
Token.
Token ID13
Feature activation-3.15e+00

Pos logit contributions
aj+0.070
 camel+0.067
anut+0.066
cks+0.066
 ther+0.065
Neg logit contributions
 lear-0.051
 hours-0.050
 days-0.050
Years-0.048
 months-0.048
Token
Token ID198
Feature activation-2.98e+00

Pos logit contributions
cks+0.073
anut+0.071
maid+0.070
?"+0.068
 camel+0.068
Neg logit contributions
 By-0.067
Years-0.064
 Hours-0.062
 Along-0.060
All-0.060
TokenCan
Token ID6090
Feature activation-7.51e-01

Pos logit contributions
Years+0.051
Days+0.050
umm+0.047
Hours+0.045
Eventually+0.043
Neg logit contributions
?"-0.060
 Anna-0.055
 mouth-0.055
 sob-0.053
maid-0.053
Token I
Token ID314
Feature activation-1.82e+00

Pos logit contributions
cks+0.021
maid+0.020
 camel+0.020
aj+0.020
anut+0.019
Neg logit contributions
Days-0.011
Years-0.011
 By-0.010
 days-0.010
t-0.010
Token play
Token ID711
Feature activation-4.22e+00

Pos logit contributions
 sob+0.039
aj+0.039
cks+0.039
 Anna+0.036
maid+0.035
Neg logit contributions
Years-0.035
Days-0.035
 lear-0.034
�-0.033
 By-0.033
Token with
Token ID351
Feature activation-4.95e+00

Pos logit contributions
aj+0.104
bie+0.104
 sob+0.098
maid+0.098
 ther+0.098
Neg logit contributions
 example-0.076
Years-0.074
 lessons-0.069
Days-0.066
 hours-0.066
Token your
Token ID534
Feature activation-1.61e+00

Pos logit contributions
aj+0.114
 sob+0.113
 camel+0.107
 ther+0.105
 limp+0.103
Neg logit contributions
 days-0.097
 hours-0.091
 weeks-0.090
 months-0.089
 example-0.089
Token puppy
Token ID26188
Feature activation-7.02e+00

Pos logit contributions
?"+0.022
 sob+0.022
cks+0.022
bug+0.021
chard+0.021
Neg logit contributions
 days-0.044
 example-0.042
 achievement-0.041
umm-0.041
 hours-0.040
Token?"
Token ID1701
Feature activation-3.61e+00

Pos logit contributions
cks+0.138
 sob+0.122
 ther+0.120
 Frost+0.113
 Lia+0.111
Neg logit contributions
 example-0.151
 days-0.145
working-0.144
 lessons-0.143
Years-0.140
Token asked
Token ID1965
Feature activation-4.09e+00

Pos logit contributions
maid+0.087
anut+0.082
cks+0.081
 ther+0.078
?"+0.074
Neg logit contributions
 By-0.072
 hours-0.069
Years-0.069
 Hours-0.068
 days-0.067
Token Tim
Token ID5045
Feature activation-1.10e+00

Pos logit contributions
cks+0.091
anut+0.087
 camel+0.085
maid+0.081
aj+0.079
Neg logit contributions
 days-0.090
 hours-0.090
Years-0.089
 By-0.089
 Hours-0.087
Tokenmy
Token ID1820
Feature activation-2.28e+00

Pos logit contributions
 ther+0.025
 Frost+0.022
 Sunny+0.022
 camel+0.022
cks+0.022
Neg logit contributions
Years-0.024
Days-0.022
 example-0.021
udging-0.020
ending-0.020
Token.
Token ID13
Feature activation-2.71e+00

Pos logit contributions
 ther+0.052
?"+0.047
 sandwich+0.047
 camel+0.047
 Sunny+0.046
Neg logit contributions
Years-0.048
Days-0.046
 By-0.043
 example-0.043
iked-0.043
Token!
Token ID0
Feature activation+1.95e-01

Pos logit contributions
 sob+0.048
chard+0.044
 scrap+0.043
 ther+0.043
 Anna+0.043
Neg logit contributions
umm-0.074
Years-0.068
 example-0.068
 lear-0.068
 By-0.066
Token Would
Token ID10928
Feature activation-2.17e+00

Pos logit contributions
umm+0.004
 By+0.004
Years+0.004
Days+0.004
 lear+0.004
Neg logit contributions
?"-0.004
 sob-0.004
 angry-0.004
cks-0.004
anut-0.004
Token you
Token ID345
Feature activation-1.52e+00

Pos logit contributions
 sob+0.040
?"+0.040
 Anna+0.037
 mouth+0.037
maid+0.036
Neg logit contributions
Years-0.057
Days-0.055
umm-0.053
 By-0.053
iked-0.051
Token like
Token ID588
Feature activation-3.18e+00

Pos logit contributions
 sob+0.035
 Anna+0.032
?"+0.032
 mouth+0.032
 ther+0.031
Neg logit contributions
Years-0.034
Days-0.032
 hours-0.031
umm-0.031
 By-0.031
Token a
Token ID257
Feature activation-1.10e+00

Pos logit contributions
 sob+0.067
 ther+0.062
 Anna+0.061
 angry+0.061
anut+0.061
Neg logit contributions
Years-0.075
Days-0.070
 By-0.070
 hours-0.069
 example-0.068
Token cookie
Token ID19751
Feature activation-7.00e+00

Pos logit contributions
cks+0.022
 sob+0.021
chard+0.020
?"+0.019
anut+0.019
Neg logit contributions
Years-0.028
Days-0.027
 hours-0.026
 lear-0.026
umm-0.025
Token?"
Token ID1701
Feature activation-3.98e+00

Pos logit contributions
cks+0.056
 sob+0.052
 Frost+0.043
 mouth+0.041
 Milo+0.040
Neg logit contributions
Years-0.252
umm-0.244
 example-0.237
 occasion-0.235
 months-0.235
Token she
Token ID673
Feature activation-4.86e+00

Pos logit contributions
cks+0.109
anut+0.108
maid+0.107
aj+0.104
?"+0.103
Neg logit contributions
 By-0.062
 Hours-0.057
Years-0.056
 Soon-0.055
 lear-0.054
Token asked
Token ID1965
Feature activation-2.35e+00

Pos logit contributions
 ther+0.121
anut+0.119
cks+0.118
eds+0.111
 poop+0.111
Neg logit contributions
 lear-0.080
Years-0.079
 hours-0.077
 months-0.075
 weeks-0.073
Token her
Token ID607
Feature activation-2.69e+00

Pos logit contributions
?"+0.050
cks+0.049
 sees+0.048
 camel+0.048
 ther+0.047
Neg logit contributions
 lear-0.048
 hours-0.048
 days-0.048
iked-0.047
 months-0.047
Token t
Token ID256
Feature activation+3.29e-01

Pos logit contributions
cks+0.057
 ther+0.048
 hurts+0.046
 breat+0.045
?"+0.045
Neg logit contributions
 example-0.062
Years-0.062
 days-0.060
 hours-0.058
 months-0.058
Token"
Token ID1
Feature activation+2.24e+00

Pos logit contributions
?"+0.007
 Anna+0.006
 sob+0.005
 ther+0.005
 Luna+0.005
Neg logit contributions
umm-0.014
Years-0.013
iked-0.012
 By-0.012
Days-0.012
TokenCan
Token ID6090
Feature activation-6.40e-01

Pos logit contributions
Years+0.050
Days+0.050
umm+0.045
Hours+0.044
 lear+0.042
Neg logit contributions
?"-0.055
 Anna-0.050
 sob-0.050
 mouth-0.049
 ther-0.048
Token I
Token ID314
Feature activation-1.68e+00

Pos logit contributions
maid+0.018
aj+0.018
anut+0.018
cks+0.018
ackers+0.017
Neg logit contributions
Days-0.007
t-0.007
 By-0.007
 days-0.007
Years-0.007
Token touch
Token ID3638
Feature activation-4.67e+00

Pos logit contributions
 sob+0.033
 Anna+0.031
aj+0.031
 ther+0.030
 Milo+0.030
Neg logit contributions
Years-0.039
Days-0.038
Hours-0.035
 lear-0.035
 By-0.035
Token the
Token ID262
Feature activation-2.57e+00

Pos logit contributions
aj+0.135
 sob+0.126
 Claus+0.124
ccoli+0.122
 ther+0.116
Neg logit contributions
 days-0.072
 months-0.062
 hours-0.062
 example-0.062
 weeks-0.062
Token flower
Token ID15061
Feature activation-7.00e+00

Pos logit contributions
 Frost+0.047
 sob+0.046
 Milo+0.045
aj+0.044
ccoli+0.044
Neg logit contributions
 days-0.062
umm-0.061
 perspective-0.058
 example-0.058
 hours-0.058
Token?"
Token ID1701
Feature activation-4.20e+00

Pos logit contributions
 Frost+0.097
 scrap+0.091
 sob+0.089
 Rudy+0.085
 Milo+0.083
Neg logit contributions
umm-0.186
way-0.173
 example-0.172
 By-0.171
Years-0.169
Token Lily
Token ID20037
Feature activation-2.79e+00

Pos logit contributions
anut+0.077
maid+0.075
ccoli+0.072
 sandwich+0.068
aj+0.068
Neg logit contributions
 By-0.102
 hours-0.100
 Hours-0.100
 days-0.099
 Soon-0.099
Token asked
Token ID1965
Feature activation-2.97e+00

Pos logit contributions
 ther+0.053
?"+0.052
 Frost+0.050
maid+0.049
 Nib+0.049
Neg logit contributions
Years-0.066
Days-0.064
udging-0.061
 hours-0.061
 By-0.060
Token Tom
Token ID4186
Feature activation-1.45e+00

Pos logit contributions
?"+0.059
 camel+0.059
anut+0.056
cks+0.056
 sandwich+0.056
Neg logit contributions
iked-0.065
Years-0.064
 days-0.064
 hours-0.063
 lear-0.062
Token.
Token ID13
Feature activation-2.43e+00

Pos logit contributions
 ther+0.035
?"+0.034
 sandwich+0.032
 trash+0.032
 our+0.031
Neg logit contributions
Years-0.025
Days-0.025
iked-0.024
ending-0.024
 example-0.023
Token put
Token ID1234
Feature activation-4.17e+00

Pos logit contributions
 sob+0.034
 Milo+0.031
chard+0.031
maid+0.031
 ther+0.031
Neg logit contributions
Years-0.055
Days-0.053
 lear-0.052
 hours-0.052
 By-0.051
Token hot
Token ID3024
Feature activation-4.11e+00

Pos logit contributions
 sob+0.094
 camel+0.078
 whim+0.076
aj+0.076
 limp+0.072
Neg logit contributions
 days-0.095
 example-0.093
 weeks-0.090
 hours-0.090
 months-0.089
Token sauce
Token ID10746
Feature activation-5.93e+00

Pos logit contributions
 sob+0.067
 Anna+0.057
 whim+0.055
 Lia+0.054
 Luna+0.054
Neg logit contributions
umm-0.118
 days-0.117
 hours-0.111
 months-0.110
 weeks-0.110
Token on
Token ID319
Feature activation-5.65e+00

Pos logit contributions
 sob+0.081
 Anna+0.071
 mouth+0.071
 Luna+0.069
 Frost+0.067
Neg logit contributions
Years-0.182
umm-0.175
Days-0.171
 By-0.171
iked-0.167
Token our
Token ID674
Feature activation-9.84e-01

Pos logit contributions
 sob+0.111
 camel+0.095
aj+0.091
 ther+0.091
maid+0.090
Neg logit contributions
 days-0.134
 months-0.132
 example-0.131
 weeks-0.130
Years-0.130
Token sandwiches
Token ID29777
Feature activation-6.99e+00

Pos logit contributions
?"+0.019
 sob+0.017
 Anna+0.017
bug+0.016
chard+0.016
Neg logit contributions
 days-0.025
Days-0.025
Years-0.025
 hours-0.024
umm-0.024
Token?"
Token ID1701
Feature activation-3.51e+00

Pos logit contributions
 sob+0.061
 Frost+0.053
 Luna+0.053
 Milo+0.052
cks+0.052
Neg logit contributions
Years-0.240
umm-0.234
 By-0.226
 days-0.225
Days-0.224
Token Anna
Token ID11735
Feature activation-3.50e+00

Pos logit contributions
anut+0.041
ccoli+0.036
aj+0.034
cks+0.033
?"+0.032
Neg logit contributions
 By-0.114
 hours-0.113
 months-0.111
 days-0.111
 Hours-0.108
Token cries
Token ID24691
Feature activation-1.65e+00

Pos logit contributions
chard+0.045
 ther+0.045
?"+0.044
 sandwich+0.043
anut+0.042
Neg logit contributions
 By-0.101
Years-0.101
Days-0.098
 lear-0.098
 example-0.097
Token.
Token ID13
Feature activation-2.01e+00

Pos logit contributions
?"+0.030
 Anna+0.026
 ther+0.026
 sob+0.025
chard+0.025
Neg logit contributions
Years-0.045
Days-0.045
umm-0.043
 By-0.042
iked-0.042
Token
Token ID198
Feature activation-2.70e+00

Pos logit contributions
?"+0.034
 Anna+0.030
 sob+0.030
 ther+0.028
 mouth+0.028
Neg logit contributions
Years-0.059
Days-0.059
umm-0.057
 By-0.055
iked-0.054
Token"
Token ID1
Feature activation+3.65e+00

Pos logit contributions
umm+0.026
Years+0.025
Days+0.024
istence+0.024
iked+0.024
Neg logit contributions
?"-0.015
 Anna-0.014
 ther-0.012
 sob-0.012
 Mr-0.011
TokenWhat
Token ID2061
Feature activation-1.09e+00

Pos logit contributions
umm+0.097
istence+0.096
iked+0.096
 By+0.096
Days+0.094
Neg logit contributions
 Anna-0.068
?"-0.065
 sob-0.062
 ther-0.060
chard-0.060
Token is
Token ID318
Feature activation-8.25e-01

Pos logit contributions
 sob+0.016
anut+0.016
 Nib+0.015
 camel+0.015
 ther+0.015
Neg logit contributions
Years-0.030
Days-0.029
 By-0.029
 example-0.028
umm-0.028
Token in
Token ID287
Feature activation-3.51e+00

Pos logit contributions
?"+0.014
 Anna+0.012
 sob+0.011
 ther+0.011
 mouth+0.011
Neg logit contributions
Years-0.025
Days-0.025
umm-0.025
iked-0.023
Hours-0.023
Token the
Token ID262
Feature activation-5.77e-01

Pos logit contributions
 sob+0.044
?"+0.037
 Anna+0.034
 ther+0.034
 mouth+0.033
Neg logit contributions
Years-0.113
Days-0.112
 example-0.112
 months-0.107
 hours-0.107
Token box
Token ID3091
Feature activation-6.99e+00

Pos logit contributions
?"+0.007
 Anna+0.007
 sob+0.006
 ther+0.006
 mouth+0.006
Neg logit contributions
Years-0.020
Days-0.019
umm-0.019
iked-0.019
Hours-0.018
Token?"
Token ID1701
Feature activation-3.64e+00

Pos logit contributions
 Luna+0.057
 sob+0.057
 Milo+0.056
 Frost+0.055
 Anna+0.055
Neg logit contributions
Years-0.237
way-0.233
umm-0.233
 By-0.231
 example-0.225
Token Lily
Token ID20037
Feature activation-2.80e+00

Pos logit contributions
?"+0.045
 sob+0.040
 ther+0.038
anut+0.037
cks+0.037
Neg logit contributions
Years-0.117
Days-0.115
umm-0.113
 By-0.112
 hours-0.110
Token asked
Token ID1965
Feature activation-2.32e+00

Pos logit contributions
?"+0.045
 Anna+0.042
 ther+0.041
 sob+0.040
 mouth+0.039
Neg logit contributions
Years-0.082
Days-0.081
umm-0.077
 By-0.075
 hours-0.074
Token.
Token ID13
Feature activation-2.92e+00

Pos logit contributions
?"+0.047
 camel+0.045
 ther+0.044
anut+0.043
 sees+0.043
Neg logit contributions
iked-0.051
Years-0.050
 By-0.049
 lear-0.049
 days-0.048
Token
Token ID198
Feature activation-2.37e+00

Pos logit contributions
?"+0.056
 Anna+0.052
 sob+0.051
 ther+0.048
 mouth+0.048
Neg logit contributions
Years-0.080
Days-0.079
umm-0.077
 By-0.072
iked-0.072
TokenWhy
Token ID5195
Feature activation-1.81e+00

Pos logit contributions
Years+0.073
Days+0.070
umm+0.064
Hours+0.061
Well+0.060
Neg logit contributions
?"-0.087
 mouth-0.083
 Anna-0.079
 sandwich-0.078
 ther-0.077
Token did
Token ID750
Feature activation-7.50e-01

Pos logit contributions
anut+0.039
chard+0.038
cks+0.036
 ther+0.035
 camel+0.035
Neg logit contributions
 By-0.035
astic-0.034
 determination-0.034
ending-0.034
 lear-0.033
Token you
Token ID345
Feature activation-1.60e+00

Pos logit contributions
maid+0.017
anut+0.017
 ther+0.016
 sob+0.016
aj+0.016
Neg logit contributions
Years-0.015
 determination-0.015
 dedication-0.015
Days-0.015
 days-0.014
Token steal
Token ID8711
Feature activation-4.94e+00

Pos logit contributions
 Anna+0.031
aj+0.030
 ther+0.030
 sob+0.029
 Milo+0.029
Neg logit contributions
Years-0.039
Days-0.038
umm-0.035
 determination-0.035
istence-0.035
Token my
Token ID616
Feature activation-1.49e+00

Pos logit contributions
aj+0.092
 ther+0.091
 sob+0.090
anut+0.087
 camel+0.086
Neg logit contributions
 lessons-0.114
 example-0.112
Years-0.111
 months-0.107
 days-0.107
Token cereal
Token ID33158
Feature activation-6.93e+00

Pos logit contributions
cks+0.028
 Lia+0.027
bug+0.027
 sob+0.026
?"+0.026
Neg logit contributions
umm-0.038
 days-0.037
Years-0.035
 hours-0.035
 achievement-0.034
Token?"
Token ID1701
Feature activation-4.09e+00

Pos logit contributions
 sob+0.059
 Milo+0.056
 Anna+0.053
cks+0.053
 Luna+0.052
Neg logit contributions
umm-0.244
Years-0.239
 example-0.225
 months-0.222
way-0.221
Token the
Token ID262
Feature activation-1.37e+00

Pos logit contributions
aj+0.077
anut+0.076
maid+0.075
 sandwich+0.074
?"+0.073
Neg logit contributions
 By-0.101
Years-0.100
Days-0.095
 months-0.095
 Hours-0.095
Token girl
Token ID2576
Feature activation-4.23e+00

Pos logit contributions
?"+0.027
ccoli+0.025
aj+0.025
 Frost+0.025
 ther+0.025
Neg logit contributions
Years-0.034
Days-0.034
 days-0.031
 example-0.031
umm-0.031
Token asked
Token ID1965
Feature activation-1.67e+00

Pos logit contributions
 ther+0.089
?"+0.088
 sandwich+0.088
ccoli+0.084
 trash+0.084
Neg logit contributions
Years-0.085
Days-0.084
udging-0.083
 By-0.081
iked-0.080
Token in
Token ID287
Feature activation-1.01e+00

Pos logit contributions
?"+0.036
 ther+0.033
 sandwich+0.033
aj+0.032
 trash+0.032
Neg logit contributions
Years-0.036
 By-0.035
Days-0.035
iked-0.035
 hours-0.035
Token"
Token ID1
Feature activation+3.14e+00

Pos logit contributions
umm+0.027
Years+0.025
istence+0.025
Days+0.024
 lear+0.024
Neg logit contributions
?"-0.015
 Anna-0.014
 ther-0.012
 sob-0.012
 Sara-0.012
TokenWhat
Token ID2061
Feature activation-1.24e+00

Pos logit contributions
umm+0.084
Years+0.082
Days+0.081
istence+0.079
 By+0.079
Neg logit contributions
?"-0.060
 Anna-0.059
 sob-0.054
 ther-0.053
chard-0.051
Token is
Token ID318
Feature activation-1.06e+00

Pos logit contributions
 sob+0.017
?"+0.016
 Anna+0.015
 ther+0.015
 Nib+0.015
Neg logit contributions
Years-0.038
Days-0.037
 By-0.035
umm-0.035
 example-0.035
Token in
Token ID287
Feature activation-3.45e+00

Pos logit contributions
?"+0.017
 Anna+0.016
 sob+0.016
 ther+0.015
 mouth+0.014
Neg logit contributions
Years-0.032
Days-0.031
umm-0.031
 By-0.029
iked-0.029
Token the
Token ID262
Feature activation-5.37e-01

Pos logit contributions
 sob+0.042
?"+0.035
 Anna+0.033
 ther+0.032
 mouth+0.032
Neg logit contributions
Years-0.112
Days-0.111
 example-0.111
 months-0.107
 hours-0.107
Token box
Token ID3091
Feature activation-6.92e+00

Pos logit contributions
?"+0.006
 Anna+0.006
 ther+0.005
 mouth+0.005
 sob+0.005
Neg logit contributions
Years-0.019
Days-0.019
umm-0.019
iked-0.018
Hours-0.018
Token?"
Token ID1701
Feature activation-3.35e+00

Pos logit contributions
 sob+0.055
 Anna+0.050
 Luna+0.049
 Frost+0.049
 Milo+0.048
Neg logit contributions
Years-0.241
 By-0.236
umm-0.236
way-0.235
iked-0.231
Token Tom
Token ID4186
Feature activation-2.87e+00

Pos logit contributions
?"+0.043
 sob+0.038
 Anna+0.035
 ther+0.035
 mouth+0.034
Neg logit contributions
Years-0.110
Days-0.108
umm-0.105
 By-0.103
iked-0.103
Token asks
Token ID7893
Feature activation-2.10e+00

Pos logit contributions
 ther+0.037
 sandwich+0.034
?"+0.034
chard+0.034
 Anna+0.033
Neg logit contributions
Years-0.087
Days-0.083
 By-0.082
udging-0.081
 example-0.080
Token.
Token ID13
Feature activation-2.19e+00

Pos logit contributions
?"+0.041
anut+0.037
ccoli+0.036
 sees+0.036
aj+0.036
Neg logit contributions
Years-0.051
iked-0.051
Days-0.050
 days-0.048
 By-0.048
Token
Token ID198
Feature activation-1.42e+00

Pos logit contributions
?"+0.042
 sob+0.038
 Anna+0.038
 ther+0.036
 mouth+0.035
Neg logit contributions
Years-0.059
Days-0.058
umm-0.057
 By-0.054
iked-0.054
Token is
Token ID318
Feature activation-1.86e+00

Pos logit contributions
anut+0.035
chard+0.034
 camel+0.034
 Frost+0.033
cks+0.033
Neg logit contributions
All-0.033
 By-0.032
itting-0.032
ending-0.032
astic-0.031
Token the
Token ID262
Feature activation-8.98e-01

Pos logit contributions
aj+0.040
ackers+0.039
maid+0.038
anut+0.038
ighed+0.037
Neg logit contributions
 days-0.036
 By-0.035
 example-0.035
Days-0.035
Years-0.035
Token lady
Token ID10846
Feature activation-4.57e+00

Pos logit contributions
?"+0.012
 Frost+0.012
 sob+0.011
cks+0.011
 Milo+0.011
Neg logit contributions
Years-0.028
Days-0.027
umm-0.026
 days-0.026
 hours-0.026
Token wearing
Token ID5762
Feature activation-4.78e+00

Pos logit contributions
 Milo+0.072
 Frost+0.069
 Leo+0.064
 Rudy+0.062
 sob+0.062
Neg logit contributions
ending-0.127
working-0.118
 By-0.118
umm-0.117
 example-0.113
Token a
Token ID257
Feature activation-2.64e+00

Pos logit contributions
anut+0.097
bug+0.095
 ther+0.095
cks+0.093
 sob+0.092
Neg logit contributions
 hours-0.112
 days-0.110
 lessons-0.100
 weeks-0.100
Years-0.099
Token veil
Token ID30615
Feature activation-6.91e+00

Pos logit contributions
 Frost+0.046
 Nib+0.043
cks+0.043
 Mr+0.041
 Fl+0.040
Neg logit contributions
Years-0.071
Days-0.069
 hours-0.066
 days-0.064
 lear-0.062
Token?"
Token ID1701
Feature activation-2.86e+00

Pos logit contributions
cks+0.064
 Frost+0.061
 sob+0.058
 Anna+0.057
 Milo+0.056
Neg logit contributions
Years-0.231
umm-0.230
 By-0.223
 lear-0.219
 example-0.216
Token
Token ID198
Feature activation-2.75e+00

Pos logit contributions
cks+0.057
anut+0.056
 ther+0.056
 camel+0.056
?"+0.055
Neg logit contributions
Years-0.067
 By-0.064
Days-0.062
 Hours-0.060
Eventually-0.060
Token
Token ID198
Feature activation-1.43e+00

Pos logit contributions
 sandwich+0.066
 camel+0.065
anut+0.065
chard+0.065
 mac+0.065
Neg logit contributions
Years-0.056
 By-0.052
Days-0.052
Eventually-0.050
All-0.048
TokenHer
Token ID9360
Feature activation-1.91e+00

Pos logit contributions
?"+0.032
 Anna+0.030
 sob+0.030
 ther+0.030
 mouth+0.030
Neg logit contributions
Years-0.034
Days-0.034
Hours-0.030
Eventually-0.029
 By-0.029
Token mom
Token ID1995
Feature activation-1.38e+00

Pos logit contributions
cks+0.032
chard+0.028
?"+0.027
 ther+0.027
anut+0.026
Neg logit contributions
Years-0.054
Days-0.053
 days-0.051
umm-0.050
 example-0.049
Token"
Token ID1
Feature activation+2.79e+00

Pos logit contributions
Years+0.005
Days+0.005
umm+0.005
 By+0.005
Hours+0.005
Neg logit contributions
?"-0.005
 Anna-0.005
 sob-0.005
 ther-0.005
 mouth-0.005
TokenWhat
Token ID2061
Feature activation-3.09e-01

Pos logit contributions
Years+0.060
Days+0.059
umm+0.059
 hours+0.052
Hours+0.052
Neg logit contributions
?"-0.069
 Anna-0.066
 sob-0.064
 mouth-0.064
 Luna-0.060
Token's
Token ID338
Feature activation-6.20e-01

Pos logit contributions
 sob+0.005
anut+0.005
chard+0.004
 ther+0.004
 Nib+0.004
Neg logit contributions
 By-0.008
ending-0.008
 endings-0.008
umm-0.008
 days-0.008
Token in
Token ID287
Feature activation-3.64e+00

Pos logit contributions
 sob+0.015
?"+0.014
 Anna+0.014
 mouth+0.014
 Milo+0.013
Neg logit contributions
Years-0.013
Days-0.012
 days-0.011
 By-0.011
 hours-0.011
Token the
Token ID262
Feature activation-8.58e-01

Pos logit contributions
 sob+0.052
aj+0.043
ccoli+0.043
anut+0.041
maid+0.040
Neg logit contributions
 example-0.108
 days-0.106
 months-0.106
umm-0.104
 weeks-0.103
Token cage
Token ID17186
Feature activation-6.89e+00

Pos logit contributions
?"+0.010
 Anna+0.008
 sob+0.008
 Milo+0.007
maid+0.007
Neg logit contributions
Days-0.030
Years-0.029
umm-0.029
 By-0.028
 example-0.028
Token,
Token ID11
Feature activation-3.76e+00

Pos logit contributions
 Milo+0.071
 Frost+0.069
 Anna+0.066
anut+0.063
 sob+0.060
Neg logit contributions
way-0.226
Years-0.221
 By-0.221
umm-0.220
 months-0.210
Token Daddy
Token ID26926
Feature activation-3.77e+00

Pos logit contributions
 sob+0.089
cks+0.089
bug+0.086
bugs+0.083
c+0.082
Neg logit contributions
umm-0.073
Years-0.071
 example-0.069
 By-0.068
 determination-0.067
Token?"
Token ID1701
Feature activation-3.46e+00

Pos logit contributions
 Lia+0.038
 sob+0.037
 Luna+0.035
 mouth+0.035
cks+0.035
Neg logit contributions
working-0.116
umm-0.116
Years-0.115
 By-0.115
 planned-0.112
Token Pete
Token ID17542
Feature activation-3.83e+00

Pos logit contributions
 ther+0.065
?"+0.065
aj+0.065
ccoli+0.064
anut+0.063
Neg logit contributions
Years-0.076
 By-0.075
Days-0.073
udging-0.073
iked-0.072
Token asked
Token ID1965
Feature activation-1.93e+00

Pos logit contributions
 ther+0.076
?"+0.076
 Anna+0.071
 sandwich+0.070
 sees+0.069
Neg logit contributions
Years-0.088
Days-0.087
udging-0.081
 By-0.079
 example-0.079
Token"
Token ID1
Feature activation+3.72e+00

Pos logit contributions
?"+0.009
 sob+0.008
 Anna+0.008
 mouth+0.008
 ther+0.008
Neg logit contributions
Years-0.011
Days-0.011
umm-0.011
 By-0.010
Hours-0.010
TokenDo
Token ID5211
Feature activation+1.41e+00

Pos logit contributions
Years+0.092
Days+0.091
umm+0.090
iked+0.084
 By+0.083
Neg logit contributions
?"-0.081
 Anna-0.077
 sob-0.075
 ther-0.070
 Sunny-0.068
Token you
Token ID345
Feature activation-3.47e-01

Pos logit contributions
Years+0.041
Days+0.040
umm+0.039
 By+0.037
iked+0.037
Neg logit contributions
?"-0.024
 sob-0.022
 Anna-0.022
 ther-0.021
 mouth-0.020
Token like
Token ID588
Feature activation-1.81e+00

Pos logit contributions
 ther+0.008
 Anna+0.008
 Sunny+0.008
 sob+0.008
chard+0.008
Neg logit contributions
Years-0.007
 lear-0.006
Days-0.006
 hours-0.006
 example-0.006
Token my
Token ID616
Feature activation-4.22e-01

Pos logit contributions
 ther+0.025
 sob+0.025
maid+0.023
anut+0.022
chard+0.022
Neg logit contributions
Years-0.052
 days-0.051
 hours-0.050
Days-0.050
 example-0.049
Token mustache
Token ID49303
Feature activation-6.89e+00

Pos logit contributions
 ther+0.007
 Anna+0.007
 Frost+0.007
 Milo+0.007
 Fl+0.006
Neg logit contributions
 days-0.011
umm-0.011
Years-0.011
 lear-0.011
 hours-0.011
Token?"
Token ID1701
Feature activation-3.04e+00

Pos logit contributions
 ther+0.069
 Milo+0.068
 Frost+0.066
 Anna+0.061
anut+0.058
Neg logit contributions
Years-0.210
working-0.206
umm-0.201
 lessons-0.200
 days-0.197
Token he
Token ID339
Feature activation-5.75e+00

Pos logit contributions
maid+0.071
anut+0.068
ccoli+0.067
 ther+0.065
aj+0.064
Neg logit contributions
Years-0.065
 By-0.064
 months-0.062
 hours-0.061
 Hours-0.061
Token asks
Token ID7893
Feature activation-2.38e+00

Pos logit contributions
chard+0.105
anut+0.099
 ther+0.095
 sandwich+0.091
maid+0.090
Neg logit contributions
Years-0.145
 By-0.138
 lear-0.138
 example-0.137
Days-0.136
Token.
Token ID13
Feature activation-1.66e+00

Pos logit contributions
?"+0.053
ccoli+0.050
aj+0.049
 sandwich+0.048
maid+0.048
Neg logit contributions
iked-0.050
 days-0.047
Years-0.046
 months-0.045
udging-0.045
Token
Token ID198
Feature activation-1.81e+00

Pos logit contributions
?"+0.034
anut+0.032
maid+0.032
 sob+0.031
 ther+0.031
Neg logit contributions
Years-0.040
 By-0.039
Days-0.038
 Hours-0.036
iked-0.036
Token "
Token ID366
Feature activation+2.41e+00

Pos logit contributions
?"+0.033
 sob+0.030
 Anna+0.029
 ther+0.028
 sandwich+0.028
Neg logit contributions
Years-0.054
Days-0.053
 By-0.051
umm-0.051
iked-0.050
TokenWhat
Token ID2061
Feature activation-1.26e+00

Pos logit contributions
Years+0.064
Days+0.063
umm+0.061
 By+0.058
iked+0.057
Neg logit contributions
?"-0.048
 Anna-0.045
 sob-0.044
 ther-0.042
 mouth-0.041
Token's
Token ID338
Feature activation-8.17e-01

Pos logit contributions
 sob+0.019
anut+0.019
 ther+0.018
chard+0.018
aj+0.018
Neg logit contributions
 By-0.033
Years-0.032
ending-0.032
 days-0.032
 endings-0.032
Token in
Token ID287
Feature activation-3.83e+00

Pos logit contributions
 sob+0.017
maid+0.016
 mouth+0.015
 bowl+0.015
 Anna+0.015
Neg logit contributions
Years-0.020
Days-0.019
 example-0.018
 days-0.018
 By-0.017
Token the
Token ID262
Feature activation-2.42e-01

Pos logit contributions
 sob+0.054
maid+0.044
ccoli+0.042
anut+0.042
aj+0.041
Neg logit contributions
 example-0.116
 days-0.112
 months-0.112
 weeks-0.110
Years-0.109
Token box
Token ID3091
Feature activation-6.86e+00

Pos logit contributions
?"+0.004
 Anna+0.003
 sob+0.003
 Frost+0.003
 Sunny+0.003
Neg logit contributions
Years-0.007
Days-0.007
umm-0.007
 By-0.007
 example-0.007
Token?"
Token ID1701
Feature activation-3.51e+00

Pos logit contributions
 sob+0.078
 Anna+0.077
cks+0.075
 Milo+0.075
 Luna+0.074
Neg logit contributions
 By-0.209
umm-0.209
Years-0.201
way-0.199
iked-0.196
Token she
Token ID673
Feature activation-4.45e+00

Pos logit contributions
maid+0.068
anut+0.067
cks+0.065
?"+0.063
 sandwich+0.061
Neg logit contributions
Years-0.084
 By-0.084
 Hours-0.080
Days-0.079
 hours-0.079
Token said
Token ID531
Feature activation-1.01e+00

Pos logit contributions
 ther+0.095
 sandwich+0.092
cks+0.089
 trash+0.089
 bowl+0.089
Neg logit contributions
Years-0.098
Days-0.094
 hours-0.086
 By-0.086
Hours-0.085
Token.
Token ID13
Feature activation-2.88e+00

Pos logit contributions
?"+0.020
 ther+0.019
 sees+0.017
 trash+0.017
 sob+0.017
Neg logit contributions
Years-0.024
Days-0.024
iked-0.023
umm-0.023
 lear-0.023
Token The
Token ID383
Feature activation-1.99e-01

Pos logit contributions
?"+0.052
 sob+0.048
 sandwich+0.047
 angry+0.047
anut+0.046
Neg logit contributions
Years-0.076
Days-0.074
 By-0.074
umm-0.071
iked-0.070
Token,
Token ID11
Feature activation-2.40e+00

Pos logit contributions
chard+0.042
 ther+0.041
 camel+0.041
 Sunny+0.040
 Frost+0.039
Neg logit contributions
 By-0.050
umm-0.048
Years-0.048
 example-0.047
istence-0.046
Token can
Token ID460
Feature activation-1.06e-01

Pos logit contributions
cks+0.047
anut+0.043
 sob+0.042
maid+0.042
chard+0.042
Neg logit contributions
Years-0.057
Days-0.056
 days-0.056
 hours-0.054
 By-0.054
Token you
Token ID345
Feature activation-1.61e+00

Pos logit contributions
 camel+0.002
anut+0.002
 ther+0.002
maid+0.002
aj+0.002
Neg logit contributions
Years-0.002
Days-0.002
 dedication-0.002
ths-0.002
 days-0.002
Token heal
Token ID12035
Feature activation-5.34e+00

Pos logit contributions
 Anna+0.035
?"+0.034
 ther+0.034
 sob+0.033
 Frost+0.032
Neg logit contributions
Years-0.037
Days-0.037
umm-0.034
Hours-0.033
 By-0.032
Token my
Token ID616
Feature activation-1.01e+00

Pos logit contributions
 ther+0.090
 camel+0.084
bug+0.076
maid+0.076
bugs+0.073
Neg logit contributions
Years-0.144
 example-0.141
 days-0.133
 weeks-0.130
 months-0.129
Token doll
Token ID3654
Feature activation-6.85e+00

Pos logit contributions
?"+0.014
 Anna+0.013
 Leo+0.012
 Frost+0.012
cks+0.012
Neg logit contributions
Years-0.031
Days-0.031
umm-0.030
 example-0.029
 days-0.029
Token?
Token ID30
Feature activation-9.27e-01

Pos logit contributions
 Frost+0.075
cks+0.070
 Leo+0.066
 camel+0.066
 ther+0.065
Neg logit contributions
Years-0.217
 example-0.202
Days-0.202
working-0.199
 By-0.199
Token She
Token ID1375
Feature activation-3.24e+00

Pos logit contributions
?"+0.019
 sob+0.017
 ther+0.017
 Anna+0.017
anut+0.017
Neg logit contributions
Years-0.022
umm-0.022
Days-0.022
 By-0.021
 lear-0.020
Token is
Token ID318
Feature activation-1.68e+00

Pos logit contributions
 Anna+0.049
 ther+0.048
chard+0.048
?"+0.046
 sob+0.046
Neg logit contributions
Years-0.089
Days-0.086
umm-0.085
 By-0.084
 dedication-0.083
Token hurt
Token ID5938
Feature activation-2.77e+00

Pos logit contributions
 camel+0.034
maid+0.034
 ther+0.033
 Lia+0.033
 Leo+0.032
Neg logit contributions
 example-0.037
Years-0.036
 By-0.036
Days-0.036
 hours-0.035
Token and
Token ID290
Feature activation-2.51e+00

Pos logit contributions
 ther+0.034
 sob+0.034
 mouth+0.033
 sandwich+0.033
 Anna+0.033
Neg logit contributions
Years-0.086
Days-0.085
umm-0.082
 By-0.080
 example-0.079
Token"
Token ID1
Feature activation+2.84e+00

Pos logit contributions
Years+0.012
umm+0.012
Days+0.012
iked+0.011
 By+0.011
Neg logit contributions
?"-0.013
 Anna-0.012
 sob-0.012
 ther-0.011
 mouth-0.011
TokenCan
Token ID6090
Feature activation+3.12e-02

Pos logit contributions
Days+0.056
Years+0.055
Hours+0.047
All+0.046
Well+0.045
Neg logit contributions
?"-0.084
 mouth-0.079
aj-0.076
 sandwich-0.076
cks-0.075
Token I
Token ID314
Feature activation-9.66e-01

Pos logit contributions
Years+0.001
Days+0.001
 days+0.001
 By+0.000
 determination+0.000
Neg logit contributions
cks-0.001
maid-0.001
aj-0.001
 sob-0.001
anut-0.001
Token pet
Token ID4273
Feature activation-4.58e+00

Pos logit contributions
 sob+0.021
 Anna+0.020
cks+0.020
chard+0.020
 mouth+0.020
Neg logit contributions
Years-0.021
Days-0.021
iked-0.019
Hours-0.018
 By-0.018
Token your
Token ID534
Feature activation-1.40e+00

Pos logit contributions
aj+0.124
ccoli+0.113
 sob+0.110
kins+0.110
anut+0.109
Neg logit contributions
 lessons-0.084
 days-0.080
bye-0.078
astic-0.076
no-0.075
Token dog
Token ID3290
Feature activation-6.85e+00

Pos logit contributions
?"+0.027
 Anna+0.025
chard+0.025
cks+0.025
bug+0.024
Neg logit contributions
Years-0.032
 example-0.032
Days-0.031
 days-0.031
 lessons-0.030
Token?"
Token ID1701
Feature activation-3.90e+00

Pos logit contributions
cks+0.187
kins+0.166
 pans+0.164
anut+0.158
 wounds+0.157
Neg logit contributions
working-0.100
 By-0.099
bye-0.098
 example-0.097
 lessons-0.096
Token Tim
Token ID5045
Feature activation-1.55e+00

Pos logit contributions
maid+0.098
anut+0.093
aj+0.092
ccoli+0.091
 ther+0.089
Neg logit contributions
 hours-0.070
 days-0.069
 By-0.069
 Hours-0.065
 Soon-0.065
Tokenmy
Token ID1820
Feature activation-3.44e+00

Pos logit contributions
 ther+0.028
 Frost+0.027
 Sunny+0.026
 sandwich+0.026
 Nib+0.026
Neg logit contributions
Years-0.040
Days-0.039
udging-0.037
 perspective-0.035
ending-0.034
Token asked
Token ID1965
Feature activation-2.80e+00

Pos logit contributions
 ther+0.081
?"+0.072
 Frost+0.072
 sandwich+0.072
 trash+0.071
Neg logit contributions
Years-0.069
Days-0.068
 days-0.062
 hours-0.061
udging-0.061
Token the
Token ID262
Feature activation-8.48e-01

Pos logit contributions
cks+0.067
aj+0.066
anut+0.065
 camel+0.064
maid+0.063
Neg logit contributions
 days-0.052
 hours-0.051
 Hours-0.047
 months-0.047
Years-0.046
Token,
Token ID11
Feature activation-2.58e+00

Pos logit contributions
 sob+0.065
 ther+0.065
 wounds+0.065
 Frost+0.063
chard+0.063
Neg logit contributions
umm-0.049
 By-0.048
ths-0.045
astic-0.044
Years-0.044
Token what
Token ID644
Feature activation-7.29e-01

Pos logit contributions
cks+0.053
 sob+0.048
aj+0.048
anut+0.048
 swollen+0.044
Neg logit contributions
 example-0.058
Years-0.057
 hours-0.057
 days-0.056
 lear-0.055
Token's
Token ID338
Feature activation-1.11e+00

Pos logit contributions
anut+0.012
 sob+0.012
chard+0.012
aj+0.012
 Nib+0.011
Neg logit contributions
 By-0.018
 days-0.018
Years-0.017
 endings-0.017
ending-0.017
Token in
Token ID287
Feature activation-3.59e+00

Pos logit contributions
 sob+0.027
aj+0.025
maid+0.024
 mac+0.024
?"+0.024
Neg logit contributions
Years-0.022
Days-0.021
 example-0.020
 days-0.020
 hours-0.019
Token the
Token ID262
Feature activation-1.23e+00

Pos logit contributions
 sob+0.041
?"+0.036
aj+0.033
 mac+0.033
 Milo+0.032
Neg logit contributions
Days-0.114
 example-0.113
Years-0.113
 days-0.112
umm-0.112
Token hole
Token ID7604
Feature activation-6.80e+00

Pos logit contributions
?"+0.019
 Anna+0.018
 sob+0.018
 Milo+0.017
 Frost+0.017
Neg logit contributions
Days-0.036
Years-0.036
umm-0.035
 hours-0.034
 days-0.033
Token?"
Token ID1701
Feature activation-3.98e+00

Pos logit contributions
 sob+0.064
 Nib+0.060
 Luna+0.059
 Milo+0.058
 Anna+0.058
Neg logit contributions
way-0.225
umm-0.221
Years-0.219
 By-0.218
iked-0.217
Token asked
Token ID1965
Feature activation-3.71e+00

Pos logit contributions
anut+0.070
cks+0.069
maid+0.066
?"+0.064
ccoli+0.063
Neg logit contributions
Years-0.098
 By-0.096
 days-0.096
Days-0.095
 Hours-0.094
Token Lily
Token ID20037
Feature activation-1.79e+00

Pos logit contributions
cks+0.071
anut+0.070
 camel+0.064
ccoli+0.063
anuts+0.061
Neg logit contributions
 days-0.091
 hours-0.089
 By-0.087
Years-0.086
 months-0.086
Token.
Token ID13
Feature activation-3.42e+00

Pos logit contributions
 ther+0.044
 sandwich+0.039
 trash+0.039
 camel+0.038
?"+0.038
Neg logit contributions
Years-0.032
iked-0.031
ending-0.031
 By-0.031
udging-0.030
Token Her
Token ID2332
Feature activation-1.52e+00

Pos logit contributions
?"+0.070
 sob+0.062
 ther+0.062
 Anna+0.061
cks+0.061
Neg logit contributions
Years-0.085
Days-0.083
 By-0.079
iked-0.079
umm-0.079
Token,
Token ID11
Feature activation-3.13e+00

Pos logit contributions
 sob+0.087
 bleed+0.082
 wounds+0.082
 scrap+0.081
 limp+0.080
Neg logit contributions
umm-0.059
 days-0.057
 hours-0.055
astic-0.055
iked-0.054
Token can
Token ID460
Feature activation-1.42e+00

Pos logit contributions
 sob+0.057
cks+0.056
ighed+0.051
anut+0.051
aj+0.051
Neg logit contributions
 By-0.073
astic-0.073
umm-0.072
Years-0.070
ending-0.068
Token we
Token ID356
Feature activation-1.64e+00

Pos logit contributions
aj+0.035
maid+0.034
anut+0.034
 sob+0.034
 trash+0.033
Neg logit contributions
 By-0.023
 days-0.023
iked-0.022
ending-0.022
Years-0.022
Token pet
Token ID4273
Feature activation-4.39e+00

Pos logit contributions
 sob+0.033
 Anna+0.032
?"+0.029
 mouth+0.029
 ther+0.029
Neg logit contributions
Years-0.041
Days-0.040
umm-0.040
 By-0.038
 lear-0.037
Token your
Token ID534
Feature activation-1.03e+00

Pos logit contributions
 sob+0.099
aj+0.095
ccoli+0.090
anut+0.087
 frown+0.087
Neg logit contributions
 days-0.091
 lessons-0.091
astic-0.090
 months-0.088
 hours-0.086
Token dog
Token ID3290
Feature activation-6.80e+00

Pos logit contributions
?"+0.018
 sob+0.018
chard+0.017
 Anna+0.017
cks+0.016
Neg logit contributions
 example-0.024
 days-0.024
 lear-0.024
Years-0.023
iked-0.023
Token?"
Token ID1701
Feature activation-3.65e+00

Pos logit contributions
cks+0.097
anut+0.085
 wounds+0.080
 pans+0.080
 sob+0.079
Neg logit contributions
working-0.168
 By-0.165
 planned-0.164
 example-0.161
 months-0.161
Token L
Token ID406
Feature activation-7.63e-01

Pos logit contributions
?"+0.046
ccoli+0.046
maid+0.045
anut+0.045
 ther+0.043
Neg logit contributions
 By-0.108
Years-0.108
 hours-0.106
Days-0.105
 months-0.105
Tokenila
Token ID10102
Feature activation-3.50e+00

Pos logit contributions
 mouth+0.012
 sob+0.011
chard+0.011
?"+0.011
 Sunny+0.011
Neg logit contributions
Years-0.021
Days-0.021
iked-0.021
udging-0.021
umm-0.020
Token asked
Token ID1965
Feature activation-2.59e+00

Pos logit contributions
 ther+0.062
?"+0.060
 sandwich+0.057
 Anna+0.057
 Frost+0.055
Neg logit contributions
Years-0.090
Days-0.088
 By-0.085
udging-0.084
umm-0.082
Token the
Token ID262
Feature activation-7.34e-01

Pos logit contributions
?"+0.051
aj+0.048
 camel+0.048
 ther+0.047
 sandwich+0.046
Neg logit contributions
iked-0.058
 By-0.056
 hours-0.055
Years-0.055
 days-0.055
Token smiled
Token ID13541
Feature activation+1.42e+00

Pos logit contributions
cks+0.014
?"+0.014
 Luna+0.014
 ther+0.014
 hurts+0.013
Neg logit contributions
 hours-0.022
Years-0.022
 lear-0.021
 By-0.021
 determination-0.021
Token.
Token ID13
Feature activation+1.96e+00

Pos logit contributions
Years+0.040
Days+0.040
umm+0.039
 By+0.037
Hours+0.037
Neg logit contributions
?"-0.026
 Anna-0.024
 sob-0.023
 ther-0.022
 mouth-0.022
Token She
Token ID1375
Feature activation-9.51e-01

Pos logit contributions
Days+0.053
Years+0.053
umm+0.049
Hours+0.048
 hours+0.047
Neg logit contributions
?"-0.040
 Anna-0.038
 Mr-0.036
 Ben-0.035
 ther-0.034
Token said
Token ID531
Feature activation+1.57e+00

Pos logit contributions
?"+0.016
 Anna+0.015
 sob+0.015
 ther+0.014
 mouth+0.014
Neg logit contributions
Years-0.027
Days-0.026
umm-0.025
 hours-0.025
 By-0.025
Token,
Token ID11
Feature activation-1.28e+00

Pos logit contributions
Years+0.044
Days+0.043
umm+0.042
iked+0.041
 By+0.041
Neg logit contributions
?"-0.027
 sob-0.025
 Anna-0.025
 ther-0.024
 mouth-0.023
Token "
Token ID366
Feature activation+4.72e+00

Pos logit contributions
?"+0.026
 sob+0.024
 Anna+0.023
 ther+0.023
 mouth+0.022
Neg logit contributions
Years-0.032
Days-0.031
umm-0.030
 By-0.030
iked-0.029
TokenGo
Token ID5247
Feature activation+1.12e+00

Pos logit contributions
Days+0.116
Years+0.111
umm+0.109
iked+0.106
Hours+0.104
Neg logit contributions
?"-0.105
 Anna-0.098
 ther-0.091
 sob-0.089
 mac-0.088
Token and
Token ID290
Feature activation-1.07e+00

Pos logit contributions
Days+0.030
Years+0.030
umm+0.030
Hours+0.027
iked+0.027
Neg logit contributions
?"-0.023
 Anna-0.021
 sob-0.020
 ther-0.019
 mouth-0.019
Token hide
Token ID7808
Feature activation-2.68e+00

Pos logit contributions
 sob+0.019
?"+0.018
 ther+0.018
 Anna+0.018
cks+0.018
Neg logit contributions
Years-0.028
Days-0.026
umm-0.026
 By-0.026
 lear-0.025
Token now
Token ID783
Feature activation-2.43e+00

Pos logit contributions
 sob+0.044
?"+0.040
 ther+0.039
 mouth+0.038
 frown+0.037
Neg logit contributions
Years-0.076
Days-0.075
 hours-0.074
 example-0.073
 days-0.072
Token,
Token ID11
Feature activation-7.78e-01

Pos logit contributions
?"+0.035
 Anna+0.031
 sob+0.030
 ther+0.028
 mouth+0.028
Neg logit contributions
Years-0.078
Days-0.077
umm-0.076
iked-0.072
Hours-0.072
Token From
Token ID3574
Feature activation-9.98e-01

Pos logit contributions
Years+0.015
Days+0.015
umm+0.014
Hours+0.013
 hours+0.013
Neg logit contributions
?"-0.009
 Anna-0.008
 Mr-0.008
 sob-0.008
 ther-0.008
Token that
Token ID326
Feature activation+5.47e-01

Pos logit contributions
 sob+0.029
 ther+0.028
 Anna+0.027
 mouth+0.027
?"+0.027
Neg logit contributions
Years-0.017
Days-0.016
 days-0.014
 hours-0.014
 example-0.014
Token day
Token ID1110
Feature activation+1.16e-01

Pos logit contributions
Years+0.008
Days+0.008
umm+0.008
iked+0.007
Hours+0.007
Neg logit contributions
?"-0.018
 Anna-0.017
 sob-0.016
 ther-0.016
 mouth-0.016
Token on
Token ID319
Feature activation+6.52e-02

Pos logit contributions
Years+0.004
umm+0.004
Days+0.004
istence+0.004
Hours+0.004
Neg logit contributions
?"-0.001
 Anna-0.001
 ther-0.001
 sob-0.001
 mouth-0.001
Token,
Token ID11
Feature activation+6.93e-01

Pos logit contributions
Years+0.002
Days+0.002
umm+0.002
 By+0.002
iked+0.002
Neg logit contributions
?"-0.001
 Anna-0.001
 sob-0.001
 ther-0.001
 mouth-0.001
Token Nem
Token ID22547
Feature activation+3.44e+00

Pos logit contributions
Years+0.022
Days+0.022
umm+0.021
Hours+0.020
iked+0.020
Neg logit contributions
?"-0.011
 Anna-0.010
 sob-0.009
 ther-0.009
 mouth-0.009
Tokeno
Token ID78
Feature activation+3.11e+00

Pos logit contributions
Years+0.114
Days+0.113
umm+0.111
 By+0.108
iked+0.108
Neg logit contributions
?"-0.039
 sob-0.039
 Anna-0.038
 ther-0.034
 mouth-0.032
Token knew
Token ID2993
Feature activation+2.32e+00

Pos logit contributions
Years+0.085
Days+0.084
umm+0.083
iked+0.077
Hours+0.077
Neg logit contributions
?"-0.060
 Anna-0.055
 sob-0.053
 mouth-0.050
 ther-0.049
Token that
Token ID326
Feature activation+1.69e+00

Pos logit contributions
umm+0.059
Days+0.059
Years+0.059
Hours+0.053
istence+0.053
Neg logit contributions
?"-0.051
 Anna-0.046
 sob-0.045
 mouth-0.043
 Mr-0.043
Token even
Token ID772
Feature activation+1.24e+00

Pos logit contributions
Years+0.045
Days+0.044
umm+0.043
 By+0.041
 hours+0.041
Neg logit contributions
?"-0.033
 Anna-0.030
 sob-0.030
 ther-0.029
 mouth-0.028
Token though
Token ID996
Feature activation+2.19e+00

Pos logit contributions
Years+0.033
Days+0.032
umm+0.031
 hours+0.030
 By+0.030
Neg logit contributions
?"-0.024
 sob-0.023
 Anna-0.022
 ther-0.021
 mouth-0.021
Token the
Token ID262
Feature activation-1.10e+00

Pos logit contributions
?"+0.017
 sob+0.017
aj+0.016
anut+0.016
cks+0.015
Neg logit contributions
Years-0.034
 hours-0.034
 days-0.034
 example-0.033
 lessons-0.033
Token door
Token ID3420
Feature activation-1.88e+00

Pos logit contributions
?"+0.023
 Anna+0.022
 sob+0.022
 Milo+0.022
 Sunny+0.021
Neg logit contributions
Years-0.026
umm-0.025
Days-0.025
 By-0.024
 example-0.023
Token and
Token ID290
Feature activation-2.71e+00

Pos logit contributions
?"+0.030
 Anna+0.028
 sob+0.025
 mouth+0.024
 ther+0.023
Neg logit contributions
Days-0.061
Years-0.060
umm-0.059
Hours-0.055
 hours-0.055
Token says
Token ID1139
Feature activation-3.90e-01

Pos logit contributions
anut+0.049
maid+0.041
?"+0.040
ackers+0.040
chard+0.040
Neg logit contributions
 Hours-0.065
 By-0.065
Years-0.064
iked-0.063
 lear-0.062
Token,
Token ID11
Feature activation-2.31e+00

Pos logit contributions
anut+0.009
 sob+0.009
cks+0.009
 ther+0.009
chard+0.009
Neg logit contributions
iked-0.008
 By-0.007
 example-0.007
 lear-0.007
Years-0.007
Token "
Token ID366
Feature activation+4.18e+00

Pos logit contributions
?"+0.052
 sandwich+0.049
 sob+0.049
 ther+0.048
 Anna+0.047
Neg logit contributions
Years-0.050
 By-0.049
Days-0.048
iked-0.046
 lear-0.045
TokenMail
Token ID25804
Feature activation-2.30e+00

Pos logit contributions
Years+0.118
Days+0.117
umm+0.114
iked+0.109
 By+0.108
Neg logit contributions
?"-0.076
 Anna-0.071
 sob-0.069
 ther-0.066
 Mr-0.063
Token for
Token ID329
Feature activation-2.39e+00

Pos logit contributions
 sob+0.037
?"+0.034
 Anna+0.033
chard+0.032
 ther+0.032
Neg logit contributions
Years-0.066
umm-0.063
Days-0.063
 example-0.061
 By-0.061
Token mom
Token ID1995
Feature activation-2.19e+00

Pos logit contributions
?"+0.032
 sob+0.030
 Anna+0.029
 ther+0.027
 mouth+0.026
Neg logit contributions
Years-0.076
Days-0.076
umm-0.073
 hours-0.071
 By-0.071
Token and
Token ID290
Feature activation-2.58e+00

Pos logit contributions
 sob+0.043
 ther+0.041
 Anna+0.040
?"+0.040
chard+0.040
Neg logit contributions
Years-0.050
 By-0.050
Days-0.050
umm-0.049
 days-0.048
Token dad
Token ID9955
Feature activation-2.12e+00

Pos logit contributions
?"+0.038
 sob+0.037
 ther+0.035
 Anna+0.034
cks+0.032
Neg logit contributions
Years-0.076
umm-0.073
Days-0.073
 lear-0.072
 example-0.071
Token that
Token ID326
Feature activation+6.41e-01

Pos logit contributions
 sob+0.028
 Anna+0.028
 ther+0.028
 mouth+0.027
?"+0.027
Neg logit contributions
Years-0.019
Days-0.018
 hours-0.016
 days-0.016
umm-0.015
Token day
Token ID1110
Feature activation+5.53e-01

Pos logit contributions
Days+0.011
Years+0.011
umm+0.011
iked+0.010
Hours+0.010
Neg logit contributions
?"-0.019
 Anna-0.018
 sob-0.017
 Mr-0.017
 ther-0.017
Token on
Token ID319
Feature activation-5.91e-03

Pos logit contributions
Years+0.022
umm+0.022
Days+0.022
istence+0.020
 perspective+0.020
Neg logit contributions
?"-0.004
 Anna-0.003
 ther-0.003
 mouth-0.003
 sob-0.003
Token,
Token ID11
Feature activation+1.07e+00

Pos logit contributions
?"+0.000
 Anna+0.000
 sob+0.000
 ther+0.000
 mouth+0.000
Neg logit contributions
Years-0.000
Days-0.000
umm-0.000
 By-0.000
iked-0.000
Token Tim
Token ID5045
Feature activation+3.41e+00

Pos logit contributions
Days+0.037
Years+0.037
umm+0.037
iked+0.035
istence+0.035
Neg logit contributions
?"-0.013
 Anna-0.012
 Mr-0.011
 mouth-0.010
 Fl-0.010
Tokenmy
Token ID1820
Feature activation+3.76e+00

Pos logit contributions
Years+0.123
Days+0.122
umm+0.120
 By+0.114
iked+0.114
Neg logit contributions
?"-0.035
 Anna-0.030
 sob-0.029
 ther-0.026
 mouth-0.025
Token played
Token ID2826
Feature activation+1.63e+00

Pos logit contributions
umm+0.116
Years+0.114
Days+0.114
iked+0.107
Hours+0.103
Neg logit contributions
?"-0.068
 bear-0.056
 sees-0.055
 Anna-0.055
 mouth-0.054
Token with
Token ID351
Feature activation+7.28e-01

Pos logit contributions
Years+0.047
Days+0.046
umm+0.043
 hours+0.042
 By+0.042
Neg logit contributions
?"-0.029
 sob-0.027
 Anna-0.026
 ther-0.025
 mouth-0.025
Token his
Token ID465
Feature activation-1.40e-01

Pos logit contributions
Years+0.024
Days+0.023
 hours+0.023
 days+0.022
 example+0.022
Neg logit contributions
?"-0.009
 sob-0.009
 ther-0.009
 Sunny-0.007
 Anna-0.007
Token triangles
Token ID44360
Feature activation+2.34e+00

Pos logit contributions
?"+0.003
 sob+0.003
 Anna+0.003
 ther+0.002
 Sunny+0.002
Neg logit contributions
Years-0.004
Days-0.004
umm-0.004
 hours-0.003
 By-0.003
Token every
Token ID790
Feature activation+8.76e-01

Pos logit contributions
umm+0.055
Years+0.055
Days+0.055
istence+0.051
 By+0.050
Neg logit contributions
?"-0.052
 Anna-0.050
 sob-0.047
 sandwich-0.046
 mouth-0.046
Token to
Token ID284
Feature activation-8.13e-01

Pos logit contributions
?"+0.034
 sob+0.033
 ther+0.031
 Anna+0.030
 frown+0.030
Neg logit contributions
Years-0.031
Days-0.029
iked-0.028
 days-0.028
 hours-0.028
Token Tom
Token ID4186
Feature activation-1.90e+00

Pos logit contributions
?"+0.011
aj+0.011
 sob+0.010
 ther+0.010
maid+0.010
Neg logit contributions
Years-0.024
Days-0.024
 lear-0.024
 hours-0.023
 days-0.023
Token.
Token ID13
Feature activation+3.92e-02

Pos logit contributions
chard+0.044
 ther+0.043
maid+0.042
?"+0.042
 camel+0.041
Neg logit contributions
 By-0.033
 lear-0.032
Years-0.032
 example-0.032
 days-0.030
Token
Token ID198
Feature activation-2.15e+00

Pos logit contributions
Years+0.001
Days+0.001
umm+0.001
iked+0.001
Hours+0.001
Neg logit contributions
?"-0.001
 Anna-0.001
 sob-0.001
 mouth-0.001
 ther-0.001
Token
Token ID198
Feature activation-4.01e-01

Pos logit contributions
?"+0.040
 sob+0.039
 mouth+0.038
maid+0.038
 ther+0.038
Neg logit contributions
Years-0.056
Days-0.055
 By-0.053
Eventually-0.051
Hours-0.051
Token"
Token ID1
Feature activation+3.85e+00

Pos logit contributions
?"+0.007
 Anna+0.006
 sob+0.006
 ther+0.006
 Mr+0.006
Neg logit contributions
umm-0.012
Years-0.012
Days-0.011
iked-0.011
 hours-0.011
TokenHere
Token ID4342
Feature activation-1.79e+00

Pos logit contributions
Years+0.089
umm+0.088
Days+0.088
iked+0.084
 By+0.083
Neg logit contributions
?"-0.087
 Anna-0.086
 sob-0.083
 Sunny-0.078
 ther-0.078
Token,
Token ID11
Feature activation-1.44e+00

Pos logit contributions
chard+0.043
aj+0.041
anut+0.041
 sob+0.040
maid+0.039
Neg logit contributions
 days-0.033
 determination-0.030
 months-0.030
 endings-0.029
 weeks-0.029
Token Tom
Token ID4186
Feature activation-1.63e+00

Pos logit contributions
 sob+0.026
?"+0.025
maid+0.025
chard+0.025
aj+0.024
Neg logit contributions
Years-0.037
Days-0.036
 lear-0.034
 determination-0.033
 example-0.032
Token,
Token ID11
Feature activation-1.52e+00

Pos logit contributions
chard+0.037
 sob+0.035
 ther+0.034
 Anna+0.033
 Milo+0.033
Neg logit contributions
 By-0.030
Years-0.030
 days-0.030
 lear-0.029
 example-0.029
Token you
Token ID345
Feature activation-2.30e+00

Pos logit contributions
?"+0.027
 sob+0.027
aj+0.026
chard+0.026
 mouth+0.026
Neg logit contributions
Years-0.038
Days-0.037
 lear-0.037
 example-0.035
 hours-0.035
Token pay
Token ID1414
Feature activation-3.79e-01

Pos logit contributions
Years+0.048
Days+0.048
umm+0.047
iked+0.044
 By+0.044
Neg logit contributions
?"-0.030
 Anna-0.027
 sob-0.026
 ther-0.024
 mouth-0.024
Token attention
Token ID3241
Feature activation+3.77e-01

Pos logit contributions
?"+0.008
 sob+0.007
 ther+0.007
bug+0.007
chard+0.007
Neg logit contributions
Years-0.009
 hours-0.009
Days-0.009
 days-0.009
 example-0.009
Token to
Token ID284
Feature activation-1.47e-01

Pos logit contributions
Years+0.010
Days+0.010
umm+0.010
 By+0.009
iked+0.009
Neg logit contributions
?"-0.007
 sob-0.006
 Anna-0.006
 ther-0.006
 mouth-0.006
Token where
Token ID810
Feature activation+6.16e-01

Pos logit contributions
?"+0.002
 ther+0.002
 sob+0.002
aj+0.002
maid+0.002
Neg logit contributions
Years-0.004
Days-0.004
 hours-0.004
 days-0.004
 example-0.004
Token he
Token ID339
Feature activation+7.34e-01

Pos logit contributions
Years+0.015
Days+0.014
 hours+0.014
 days+0.014
iked+0.014
Neg logit contributions
?"-0.012
 sob-0.012
anut-0.011
 ther-0.011
maid-0.011
Token was
Token ID373
Feature activation+8.98e-02

Pos logit contributions
Years+0.021
Days+0.021
umm+0.020
 hours+0.019
iked+0.019
Neg logit contributions
?"-0.013
 Anna-0.012
 sob-0.012
 ther-0.011
 mouth-0.011
Token going
Token ID1016
Feature activation+6.62e-01

Pos logit contributions
Years+0.002
Days+0.002
 By+0.002
umm+0.002
 hours+0.002
Neg logit contributions
 Anna-0.002
?"-0.002
 sob-0.002
 ther-0.002
 Milo-0.002
Token.
Token ID13
Feature activation+1.07e+00

Pos logit contributions
Years+0.023
Days+0.022
umm+0.022
Hours+0.021
 hours+0.021
Neg logit contributions
?"-0.010
 Anna-0.008
 sob-0.007
 ther-0.007
 mouth-0.007
Token Before
Token ID7413
Feature activation+5.20e-01

Pos logit contributions
Years+0.027
Days+0.026
umm+0.025
iked+0.024
Hours+0.024
Neg logit contributions
?"-0.023
 Anna-0.022
 sob-0.021
 mouth-0.020
 ther-0.020
Token he
Token ID339
Feature activation+9.85e-01

Pos logit contributions
Years+0.015
Days+0.015
umm+0.014
iked+0.013
 example+0.013
Neg logit contributions
?"-0.010
 Anna-0.010
 sob-0.009
 Mr-0.009
 mouth-0.009
Token knew
Token ID2993
Feature activation+1.00e+00

Pos logit contributions
Years+0.026
Days+0.026
umm+0.024
 By+0.024
 hours+0.024
Neg logit contributions
?"-0.019
 Anna-0.018
 sob-0.017
 ther-0.017
 Sunny-0.016
Token the
Token ID262
Feature activation+7.41e-01

Pos logit contributions
 Anna+0.003
?"+0.003
 sob+0.002
 Mr+0.002
 mouth+0.002
Neg logit contributions
Years-0.015
Days-0.015
umm-0.014
iked-0.014
Hours-0.014
Token classroom
Token ID15806
Feature activation+2.54e+00

Pos logit contributions
Years+0.019
Days+0.019
umm+0.019
iked+0.018
 By+0.017
Neg logit contributions
?"-0.015
 Anna-0.014
 sob-0.014
 mouth-0.013
 ther-0.013
Token.
Token ID13
Feature activation+9.62e-01

Pos logit contributions
umm+0.088
Years+0.086
Days+0.084
istence+0.081
Hours+0.079
Neg logit contributions
?"-0.036
 Anna-0.034
 saw-0.033
 sees-0.031
 mouth-0.030
Token She
Token ID1375
Feature activation-3.89e-01

Pos logit contributions
Days+0.026
Years+0.026
umm+0.025
 hours+0.024
iked+0.023
Neg logit contributions
 Anna-0.020
?"-0.018
 Luna-0.018
 mouth-0.017
 Ben-0.017
Token was
Token ID373
Feature activation+2.50e-01

Pos logit contributions
?"+0.006
 Anna+0.006
 sob+0.006
 ther+0.006
 mouth+0.005
Neg logit contributions
Years-0.011
Days-0.011
umm-0.010
 hours-0.010
Hours-0.010
Token excited
Token ID6568
Feature activation+2.87e+00

Pos logit contributions
Years+0.006
Days+0.006
 By+0.006
Hours+0.006
 hours+0.006
Neg logit contributions
?"-0.005
 ther-0.004
 sob-0.004
 Anna-0.004
 sandwich-0.004
Token because
Token ID780
Feature activation+7.97e-01

Pos logit contributions
Days+0.101
umm+0.097
Years+0.097
Hours+0.096
Eventually+0.093
Neg logit contributions
?"-0.045
 Anna-0.038
 sob-0.032
 yawn-0.032
 Ben-0.032
Token her
Token ID607
Feature activation+2.54e-01

Pos logit contributions
umm+0.023
Days+0.023
Years+0.023
istence+0.021
iked+0.021
Neg logit contributions
?"-0.015
 Anna-0.015
 Ben-0.013
 Mr-0.013
 Sunny-0.012
Token teacher
Token ID4701
Feature activation+1.27e+00

Pos logit contributions
Years+0.007
Days+0.007
umm+0.007
iked+0.007
 hours+0.007
Neg logit contributions
?"-0.005
 sob-0.004
 Anna-0.004
 ther-0.004
 Sunny-0.004
Token was
Token ID373
Feature activation+7.50e-01

Pos logit contributions
Years+0.039
Days+0.039
umm+0.039
Hours+0.036
iked+0.036
Neg logit contributions
?"-0.020
 Anna-0.018
 sob-0.018
 mouth-0.016
 ther-0.016
Token going
Token ID1016
Feature activation+1.94e+00

Pos logit contributions
Years+0.018
Days+0.017
 By+0.016
umm+0.016
 hours+0.016
Neg logit contributions
?"-0.016
 sob-0.015
 ther-0.015
 Anna-0.015
 mouth-0.015
Token and
Token ID290
Feature activation-9.55e-01

Pos logit contributions
Years+0.000
Days+0.000
umm+0.000
iked+0.000
 By+0.000
Neg logit contributions
?"-0.000
 Anna-0.000
 sob-0.000
 ther-0.000
 mouth-0.000
Token went
Token ID1816
Feature activation-2.54e-01

Pos logit contributions
?"+0.017
 Anna+0.015
 sob+0.015
 ther+0.015
 Frost+0.014
Neg logit contributions
Years-0.026
Days-0.026
 hours-0.025
iked-0.024
Hours-0.024
Token to
Token ID284
Feature activation-1.06e+00

Pos logit contributions
?"+0.005
 sob+0.005
 Anna+0.005
 mouth+0.004
 ther+0.004
Neg logit contributions
Years-0.007
Days-0.006
umm-0.006
iked-0.006
 By-0.006
Token the
Token ID262
Feature activation-6.47e-01

Pos logit contributions
 sob+0.018
anut+0.018
?"+0.018
ccoli+0.018
 ther+0.017
Neg logit contributions
Years-0.027
 hours-0.026
 By-0.026
 example-0.026
Days-0.025
Token party
Token ID2151
Feature activation+9.83e-01

Pos logit contributions
?"+0.012
cks+0.011
 Frost+0.011
 sob+0.011
ccoli+0.010
Neg logit contributions
Days-0.018
Years-0.017
umm-0.016
 hours-0.016
 days-0.016
Token.
Token ID13
Feature activation+9.28e-01

Pos logit contributions
Years+0.033
Days+0.032
umm+0.032
Hours+0.030
istence+0.030
Neg logit contributions
?"-0.014
 Anna-0.013
 sob-0.012
 ther-0.011
 mouth-0.011
Token She
Token ID1375
Feature activation-6.09e-01

Pos logit contributions
Days+0.028
Years+0.027
umm+0.026
ths+0.025
 hours+0.024
Neg logit contributions
 Anna-0.019
 Lily-0.017
?"-0.017
 Mr-0.016
 Ben-0.016
Token played
Token ID2826
Feature activation+9.62e-01

Pos logit contributions
?"+0.010
 Anna+0.009
 sob+0.009
 ther+0.009
 Sunny+0.009
Neg logit contributions
Years-0.018
Days-0.017
 hours-0.016
iked-0.016
Hours-0.016
Token with
Token ID351
Feature activation-5.09e-01

Pos logit contributions
Years+0.027
Days+0.026
 hours+0.025
 days+0.024
 lear+0.024
Neg logit contributions
?"-0.017
 sob-0.016
 mouth-0.015
chard-0.015
anut-0.015
Token her
Token ID607
Feature activation-2.93e-01

Pos logit contributions
?"+0.011
ccoli+0.011
 ther+0.010
 sob+0.010
aj+0.010
Neg logit contributions
 hours-0.012
 days-0.012
 weeks-0.011
Years-0.011
 months-0.011
Token friends
Token ID2460
Feature activation+8.12e-01

Pos logit contributions
?"+0.005
 Frost+0.005
 Sunny+0.005
 sob+0.005
cks+0.005
Neg logit contributions
 days-0.007
 hours-0.007
Years-0.007
Days-0.007
 weeks-0.007
Token r
Token ID374
Feature activation+2.62e+00

Pos logit contributions
Years+0.003
Days+0.003
umm+0.003
 By+0.003
Hours+0.003
Neg logit contributions
?"-0.002
 Anna-0.002
 sob-0.002
 ther-0.002
 mouth-0.001
Tokenained
Token ID1328
Feature activation+1.56e+00

Pos logit contributions
Years+0.047
Days+0.046
umm+0.043
 lear+0.041
 hours+0.041
Neg logit contributions
?"-0.076
 Anna-0.070
 sob-0.070
 ther-0.068
 mouth-0.067
Token.
Token ID13
Feature activation+1.10e+00

Pos logit contributions
umm+0.048
Days+0.048
Years+0.047
iked+0.044
Hours+0.044
Neg logit contributions
?"-0.026
 Anna-0.024
 ther-0.021
 sob-0.021
 Fl-0.020
Token The
Token ID383
Feature activation+5.14e-01

Pos logit contributions
Years+0.033
Days+0.033
umm+0.032
 By+0.032
Hours+0.030
Neg logit contributions
?"-0.018
 sob-0.016
 Anna-0.015
 ther-0.015
 angry-0.014
Token end
Token ID886
Feature activation+1.17e+00

Pos logit contributions
Years+0.013
Days+0.012
umm+0.012
 By+0.011
iked+0.011
Neg logit contributions
?"-0.011
 Anna-0.011
 sob-0.010
 ther-0.010
 mouth-0.010
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
Years+0.033
Days+0.032
umm+0.030
 By+0.030
Hours+0.029
Neg logit contributions
?"-0.021
chard-0.020
 sob-0.020
anut-0.019
 Anna-0.019
Token<|endoftext|>
Token ID50256
Feature activation-1.63e+00

Pos logit contributions
Years+0.032
 By+0.031
Days+0.031
umm+0.030
Eventually+0.029
Neg logit contributions
?"-0.024
 sob-0.022
maid-0.021
chard-0.020
 Anna-0.020
TokenOnce
Token ID7454
Feature activation-3.80e+00

Pos logit contributions
 sob+0.033
anut+0.033
?"+0.033
chard+0.032
 angry+0.032
Neg logit contributions
 By-0.038
Years-0.035
 hours-0.035
Days-0.034
 days-0.034
Token upon
Token ID2402
Feature activation-5.95e-01

Pos logit contributions
 sob+0.057
 mac+0.056
 poop+0.052
cks+0.051
anut+0.049
Neg logit contributions
 By-0.110
Years-0.106
iked-0.106
 days-0.104
way-0.104
Token a
Token ID257
Feature activation+7.24e-01

Pos logit contributions
?"+0.010
 sob+0.010
 mouth+0.010
maid+0.009
 Anna+0.009
Neg logit contributions
Years-0.016
Days-0.016
 hours-0.015
 By-0.014
Hours-0.014
Token time
Token ID640
Feature activation-1.25e+00

Pos logit contributions
umm+0.017
Years+0.017
Days+0.017
iked+0.016
istence+0.015
Neg logit contributions
?"-0.017
 Anna-0.015
 sob-0.015
 ther-0.014
 mouth-0.014
Token are
Token ID389
Feature activation+1.93e-01

Pos logit contributions
?"+0.026
 ther+0.026
maid+0.025
aj+0.025
ccoli+0.024
Neg logit contributions
 lear-0.032
Years-0.031
Days-0.031
 determination-0.029
 hours-0.029
Token fast
Token ID3049
Feature activation+7.57e-01

Pos logit contributions
Days+0.005
 hours+0.005
Years+0.005
Hours+0.005
 days+0.005
Neg logit contributions
?"-0.004
maid-0.003
 ther-0.003
ccoli-0.003
 mac-0.003
Token.
Token ID13
Feature activation+1.62e+00

Pos logit contributions
Years+0.026
Days+0.026
umm+0.025
Hours+0.024
Eventually+0.024
Neg logit contributions
?"-0.010
 Anna-0.010
 sob-0.009
 angry-0.009
 sees-0.008
Token They
Token ID1119
Feature activation-1.34e+00

Pos logit contributions
Years+0.039
Days+0.039
umm+0.037
iked+0.035
Hours+0.034
Neg logit contributions
?"-0.038
 Anna-0.037
 sob-0.034
 Ben-0.033
 mouth-0.033
Token are
Token ID389
Feature activation+3.48e-01

Pos logit contributions
?"+0.024
 ther+0.023
 sob+0.022
maid+0.022
 Anna+0.022
Neg logit contributions
Years-0.035
Days-0.034
 lear-0.034
 hours-0.032
istence-0.031
Token happy
Token ID3772
Feature activation+7.36e-01

Pos logit contributions
Days+0.009
Years+0.008
 hours+0.008
Hours+0.008
 days+0.008
Neg logit contributions
?"-0.007
maid-0.006
 ther-0.006
 sob-0.006
ccoli-0.006
Token.
Token ID13
Feature activation+1.09e+00

Pos logit contributions
Years+0.023
Days+0.023
umm+0.022
 By+0.021
iked+0.021
Neg logit contributions
?"-0.012
 Anna-0.010
 sob-0.010
 ther-0.010
 mouth-0.009
Token
Token ID198
Feature activation-1.37e+00

Pos logit contributions
Years+0.026
Days+0.025
umm+0.024
 By+0.023
iked+0.023
Neg logit contributions
?"-0.025
 Anna-0.024
 sob-0.023
 ther-0.022
 mouth-0.022
Token
Token ID198
Feature activation+1.14e-01

Pos logit contributions
?"+0.025
 sob+0.024
maid+0.024
chard+0.024
 sandwich+0.024
Neg logit contributions
Years-0.036
Days-0.035
 By-0.034
Eventually-0.033
Hours-0.033
TokenThey
Token ID2990
Feature activation-8.18e-01

Pos logit contributions
umm+0.003
Years+0.003
Days+0.003
iked+0.003
 lear+0.003
Neg logit contributions
?"-0.002
 Anna-0.002
 sob-0.002
 ther-0.002
 Frost-0.002
Token reach
Token ID3151
Feature activation-1.86e-01

Pos logit contributions
?"+0.015
 sob+0.014
 ther+0.014
 mouth+0.014
 Anna+0.014
Neg logit contributions
Years-0.021
Days-0.021
 lear-0.020
 hours-0.019
 By-0.019
Token in
Token ID287
Feature activation-1.13e+00

Pos logit contributions
?"+0.010
 Anna+0.009
 sob+0.008
 ther+0.007
 mouth+0.007
Neg logit contributions
Years-0.031
Days-0.031
umm-0.031
iked-0.029
Hours-0.029
Token a
Token ID257
Feature activation-1.80e+00

Pos logit contributions
?"+0.010
 Anna+0.008
 sob+0.008
 ther+0.006
 mouth+0.006
Neg logit contributions
Years-0.043
Days-0.043
umm-0.042
 By-0.040
iked-0.040
Token small
Token ID1402
Feature activation-3.64e-01

Pos logit contributions
 sob+0.033
?"+0.031
 Anna+0.030
ccoli+0.029
 Frost+0.028
Neg logit contributions
Days-0.050
Years-0.049
umm-0.046
Hours-0.046
 hours-0.045
Token park
Token ID3952
Feature activation-1.75e-01

Pos logit contributions
?"+0.008
 sob+0.008
 Anna+0.007
 ther+0.007
 Frost+0.007
Neg logit contributions
umm-0.008
Years-0.008
Days-0.008
 hours-0.008
 example-0.008
Token,
Token ID11
Feature activation-9.51e-01

Pos logit contributions
?"+0.002
 Anna+0.002
 mouth+0.002
 sob+0.002
 ther+0.001
Neg logit contributions
umm-0.006
Years-0.006
Days-0.006
Hours-0.006
istence-0.006
Token looking
Token ID2045
Feature activation-3.79e-01

Pos logit contributions
?"+0.017
 Anna+0.015
 sob+0.015
 ther+0.014
 mouth+0.014
Neg logit contributions
Years-0.028
Days-0.028
umm-0.028
Hours-0.026
iked-0.026
Token for
Token ID329
Feature activation+3.59e-01

Pos logit contributions
?"+0.007
 Anna+0.007
 sob+0.007
 ther+0.006
 mouth+0.006
Neg logit contributions
Years-0.010
Days-0.010
umm-0.010
 By-0.009
iked-0.009
Token food
Token ID2057
Feature activation-3.54e-01

Pos logit contributions
Years+0.010
Days+0.010
 hours+0.010
umm+0.010
 days+0.010
Neg logit contributions
?"-0.006
 sob-0.006
 Anna-0.006
 ther-0.005
 Frost-0.005
Token to
Token ID284
Feature activation-2.00e-01

Pos logit contributions
?"+0.006
 Anna+0.005
 sob+0.005
 ther+0.005
 mouth+0.005
Neg logit contributions
Years-0.011
Days-0.010
umm-0.010
 By-0.010
iked-0.010
Token eat
Token ID4483
Feature activation-2.00e+00

Pos logit contributions
aj+0.004
maid+0.004
 sob+0.004
ccoli+0.004
?"+0.004
Neg logit contributions
Years-0.005
Days-0.005
 hours-0.005
 days-0.004
 months-0.004
Token.
Token ID13
Feature activation+2.37e-01

Pos logit contributions
 sob+0.035
?"+0.035
 Anna+0.033
 mouth+0.031
 sees+0.031
Neg logit contributions
Years-0.054
Days-0.053
umm-0.052
 hours-0.051
 days-0.050
Token pulled
Token ID5954
Feature activation-3.31e-01

Pos logit contributions
?"+0.037
 Anna+0.033
 sob+0.032
 ther+0.032
 Sunny+0.030
Neg logit contributions
Years-0.060
Days-0.060
umm-0.056
Hours-0.054
 hours-0.054
Token it
Token ID340
Feature activation-6.24e-01

Pos logit contributions
?"+0.004
 sob+0.004
 Anna+0.003
 ther+0.003
 mouth+0.003
Neg logit contributions
Years-0.011
Days-0.011
umm-0.011
 By-0.010
Hours-0.010
Token away
Token ID1497
Feature activation-1.02e+00

Pos logit contributions
?"+0.007
 Anna+0.007
 sob+0.006
 angry+0.006
 mouth+0.006
Neg logit contributions
umm-0.022
Years-0.022
Days-0.022
 hours-0.021
iked-0.021
Token from
Token ID422
Feature activation-1.55e+00

Pos logit contributions
 Anna+0.012
?"+0.012
 sob+0.012
 angry+0.011
 Mr+0.010
Neg logit contributions
umm-0.036
Days-0.035
Years-0.035
iked-0.033
Eventually-0.033
Token her
Token ID607
Feature activation+1.22e-01

Pos logit contributions
?"+0.014
 sob+0.011
 ther+0.011
 Anna+0.010
 mouth+0.010
Neg logit contributions
Years-0.058
Days-0.057
umm-0.055
 hours-0.054
Hours-0.054
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
Years+0.004
Days+0.004
umm+0.004
 By+0.004
 hours+0.004
Neg logit contributions
?"-0.002
 sob-0.002
 Anna-0.002
 ther-0.001
 Sunny-0.001
Token "
Token ID366
Feature activation+3.53e+00

Pos logit contributions
?"+0.013
 Anna+0.012
 sob+0.011
 ther+0.010
 mouth+0.010
Neg logit contributions
Years-0.035
Days-0.035
umm-0.034
 By-0.032
iked-0.032
TokenYou
Token ID1639
Feature activation-2.10e+00

Pos logit contributions
umm+0.088
iked+0.084
Years+0.083
Days+0.081
 By+0.079
Neg logit contributions
?"-0.075
 Anna-0.073
 sob-0.071
 angry-0.068
 ther-0.065
Token have
Token ID423
Feature activation+6.30e-01

Pos logit contributions
?"+0.032
 Anna+0.032
 sob+0.032
 ther+0.030
 Sunny+0.029
Neg logit contributions
Years-0.060
Days-0.060
umm-0.058
 By-0.056
Hours-0.055
Token enough
Token ID1576
Feature activation-1.36e+00

Pos logit contributions
Years+0.017
Days+0.017
umm+0.016
iked+0.015
 By+0.015
Neg logit contributions
?"-0.012
 Anna-0.011
 sob-0.011
 ther-0.011
 mouth-0.011
Token red
Token ID2266
Feature activation-3.10e-01

Pos logit contributions
?"+0.024
 Anna+0.021
 mouth+0.019
 sob+0.018
 ther+0.018
Neg logit contributions
Days-0.043
Years-0.043
umm-0.041
Hours-0.039
iked-0.038
TokenM
Token ID44
Feature activation+1.10e+00

Pos logit contributions
umm+0.019
Years+0.018
Days+0.018
iked+0.017
 hours+0.017
Neg logit contributions
?"-0.016
 Anna-0.015
 sob-0.014
 ther-0.014
 Mr-0.013
Tokenum
Token ID388
Feature activation-1.72e-01

Pos logit contributions
Days+0.025
Years+0.025
 hours+0.023
umm+0.023
istence+0.023
Neg logit contributions
?"-0.028
 Anna-0.024
 Sara-0.023
 sob-0.023
 ther-0.023
Token and
Token ID290
Feature activation-2.06e+00

Pos logit contributions
?"+0.003
 Anna+0.002
 ther+0.002
 sob+0.002
 mouth+0.002
Neg logit contributions
Years-0.005
Days-0.005
iked-0.005
 hours-0.005
umm-0.005
Token Dad
Token ID17415
Feature activation+9.71e-01

Pos logit contributions
?"+0.035
 sob+0.032
 Anna+0.030
 ther+0.030
 hurts+0.029
Neg logit contributions
Years-0.059
Days-0.056
umm-0.056
iked-0.055
 lear-0.055
Token hugged
Token ID41130
Feature activation+3.89e-01

Pos logit contributions
Years+0.030
Days+0.030
umm+0.029
iked+0.028
 hours+0.028
Neg logit contributions
?"-0.015
 Anna-0.013
 sob-0.013
 ther-0.012
 mouth-0.012
Token each
Token ID1123
Feature activation-3.06e-01

Pos logit contributions
Years+0.012
Days+0.012
umm+0.012
 hours+0.012
iked+0.011
Neg logit contributions
?"-0.006
 sob-0.005
 Anna-0.005
 ther-0.005
 mouth-0.004
Token other
Token ID584
Feature activation+7.41e-01

Pos logit contributions
?"+0.005
 sob+0.005
 Anna+0.005
 ther+0.005
 mouth+0.004
Neg logit contributions
Years-0.009
Days-0.009
umm-0.008
 hours-0.008
 By-0.008
Token and
Token ID290
Feature activation+3.39e-01

Pos logit contributions
Years+0.021
Days+0.021
umm+0.020
 By+0.019
iked+0.019
Neg logit contributions
?"-0.013
 Anna-0.012
 sob-0.012
 ther-0.011
 mouth-0.011
Token kissed
Token ID27946
Feature activation+6.41e-01

Pos logit contributions
Years+0.011
umm+0.010
 hours+0.010
Days+0.010
iked+0.010
Neg logit contributions
?"-0.005
 sob-0.004
 ther-0.004
 Anna-0.004
 mouth-0.004
Token on
Token ID319
Feature activation-3.29e-01

Pos logit contributions
Years+0.020
Days+0.019
umm+0.019
 hours+0.018
iked+0.018
Neg logit contributions
?"-0.010
 sob-0.009
 Anna-0.008
 ther-0.008
 mouth-0.008
Token the
Token ID262
Feature activation-4.65e-01

Pos logit contributions
?"+0.004
 sob+0.003
 Anna+0.003
 ther+0.003
 mouth+0.002
Neg logit contributions
Years-0.012
Days-0.012
umm-0.011
 By-0.011
 hours-0.011
Token.
Token ID13
Feature activation-1.53e+00

Pos logit contributions
?"+0.019
 ther+0.017
 Anna+0.017
 sob+0.016
 sees+0.016
Neg logit contributions
Years-0.028
Days-0.027
iked-0.026
 By-0.026
 lear-0.026
Token
Token ID198
Feature activation-1.71e+00

Pos logit contributions
?"+0.031
 sob+0.028
 ther+0.027
 Anna+0.027
 sandwich+0.026
Neg logit contributions
Years-0.039
Days-0.038
 By-0.037
umm-0.037
Hours-0.035
Token
Token ID198
Feature activation-4.90e-01

Pos logit contributions
?"+0.032
 sob+0.030
chard+0.029
 mouth+0.029
 ther+0.029
Neg logit contributions
Years-0.046
Days-0.045
 By-0.042
Hours-0.041
Eventually-0.041
TokenTom
Token ID13787
Feature activation-2.45e-01

Pos logit contributions
?"+0.010
 Anna+0.009
 sob+0.009
 ther+0.008
 Sunny+0.008
Neg logit contributions
umm-0.013
Years-0.013
Days-0.013
iked-0.012
 hours-0.012
Token takes
Token ID2753
Feature activation-7.07e-01

Pos logit contributions
chard+0.006
?"+0.005
 ther+0.005
 sob+0.005
 Anna+0.005
Neg logit contributions
Years-0.005
 By-0.005
Days-0.005
 lear-0.005
 example-0.005
Token a
Token ID257
Feature activation-1.04e+00

Pos logit contributions
 sob+0.011
?"+0.010
chard+0.010
ccoli+0.010
aj+0.010
Neg logit contributions
 example-0.021
Years-0.020
 days-0.020
 hours-0.020
Days-0.020
Token deep
Token ID2769
Feature activation-1.62e-01

Pos logit contributions
?"+0.020
anut+0.020
cks+0.019
aj+0.017
 sob+0.017
Neg logit contributions
 lear-0.024
igsaw-0.022
 achievement-0.022
Years-0.022
 occasion-0.021
Token breath
Token ID8033
Feature activation-8.99e-01

Pos logit contributions
chard+0.003
?"+0.003
 Mr+0.003
 Anna+0.003
 sob+0.003
Neg logit contributions
Years-0.004
 example-0.004
 lear-0.004
Days-0.003
umm-0.003
Token.
Token ID13
Feature activation+4.79e-01

Pos logit contributions
?"+0.014
 Anna+0.012
 sob+0.011
 angry+0.011
 Mr+0.010
Neg logit contributions
Years-0.030
Days-0.030
umm-0.029
Hours-0.027
 hours-0.027
Token He
Token ID679
Feature activation-3.05e+00

Pos logit contributions
Years+0.012
Days+0.012
umm+0.011
 By+0.011
iked+0.011
Neg logit contributions
?"-0.010
 Anna-0.010
 sob-0.010
 ther-0.009
 mouth-0.009
Token walks
Token ID11114
Feature activation+1.11e+00

Pos logit contributions
chard+0.046
?"+0.043
 ther+0.041
 Anna+0.040
 sob+0.040
Neg logit contributions
Years-0.090
Days-0.085
 lear-0.083
 By-0.083
Hours-0.081
Token to
Token ID284
Feature activation+1.23e-01

Pos logit contributions
?"+0.016
 sob+0.015
 Anna+0.015
 ther+0.015
 mouth+0.014
Neg logit contributions
Years-0.021
Days-0.021
umm-0.020
 By-0.019
Hours-0.019
Token cross
Token ID3272
Feature activation-2.22e+00

Pos logit contributions
Days+0.003
Years+0.003
Hours+0.003
 lear+0.003
 example+0.003
Neg logit contributions
 ther-0.002
 Anna-0.002
?"-0.002
chard-0.002
 sob-0.002
Token the
Token ID262
Feature activation+2.05e-01

Pos logit contributions
 sob+0.030
 sandwich+0.028
 ther+0.028
 mouth+0.026
maid+0.026
Neg logit contributions
Years-0.066
Days-0.064
 example-0.064
 hours-0.063
 days-0.062
Token p
Token ID279
Feature activation-3.25e-02

Pos logit contributions
Days+0.007
Years+0.007
umm+0.007
 hours+0.006
Hours+0.006
Neg logit contributions
?"-0.003
 Anna-0.003
 sob-0.002
 mouth-0.002
 Sunny-0.002
Tokenuddle
Token ID24500
Feature activation-1.85e+00

Pos logit contributions
?"+0.001
 Anna+0.001
 sob+0.001
 Sunny+0.001
 ther+0.001
Neg logit contributions
Years-0.001
Days-0.001
umm-0.001
iked-0.001
 By-0.001
Token,
Token ID11
Feature activation-2.10e+00

Pos logit contributions
?"+0.032
 Anna+0.029
 sob+0.029
 ther+0.027
 mouth+0.027
Neg logit contributions
Years-0.053
Days-0.053
umm-0.051
 By-0.048
iked-0.048
Token can
Token ID460
Feature activation+3.68e-01

Pos logit contributions
?"+0.042
cks+0.040
 sob+0.040
anut+0.039
 ther+0.039
Neg logit contributions
Years-0.051
Days-0.050
 By-0.047
umm-0.046
 hours-0.046
Token you
Token ID345
Feature activation-1.01e+00

Pos logit contributions
Days+0.008
Years+0.008
umm+0.007
Hours+0.007
 By+0.007
Neg logit contributions
aj-0.008
?"-0.008
maid-0.008
 mac-0.008
 ther-0.008
Token help
Token ID1037
Feature activation-2.69e+00

Pos logit contributions
 Anna+0.020
 sob+0.019
 ther+0.018
?"+0.018
 mouth+0.018
Neg logit contributions
Years-0.026
Days-0.026
Hours-0.023
umm-0.023
 By-0.023
Token me
Token ID502
Feature activation-3.02e+00

Pos logit contributions
 ther+0.048
aj+0.048
ccoli+0.046
maid+0.045
 sob+0.045
Neg logit contributions
Years-0.071
Days-0.069
 days-0.066
 example-0.064
 hours-0.064
Token?"
Token ID1701
Feature activation-1.27e+00

Pos logit contributions
 Anna+0.037
 sob+0.034
 ther+0.033
 Luna+0.030
 Mr+0.030
Neg logit contributions
Years-0.096
Days-0.093
 example-0.089
Hours-0.088
 By-0.087
Token
Token ID198
Feature activation-2.18e+00

Pos logit contributions
?"+0.009
anut+0.008
maid+0.008
 sob+0.008
chard+0.008
Neg logit contributions
 By-0.009
Years-0.009
Days-0.009
 Hours-0.008
Hours-0.008
Token
Token ID198
Feature activation-6.29e-01

Pos logit contributions
?"+0.040
 sob+0.039
 mouth+0.038
 ther+0.037
 sandwich+0.037
Neg logit contributions
Years-0.058
Days-0.057
 By-0.054
Hours-0.052
Eventually-0.052
Token"
Token ID1
Feature activation+2.59e+00

Pos logit contributions
?"+0.013
 Anna+0.012
 sob+0.012
 mouth+0.012
 ther+0.012
Neg logit contributions
Years-0.016
Days-0.016
umm-0.015
Hours-0.014
 By-0.014
TokenWhat
Token ID2061
Feature activation-1.31e+00

Pos logit contributions
Days+0.055
Years+0.054
umm+0.050
Hours+0.049
 hours+0.048
Neg logit contributions
?"-0.067
 mouth-0.062
 sob-0.060
 Anna-0.060
 sandwich-0.058
Token was
Token ID373
Feature activation-8.72e-01

Pos logit contributions
chard+0.022
anut+0.022
 sob+0.021
 Nib+0.020
aj+0.019
Neg logit contributions
 By-0.033
Days-0.033
Years-0.033
 days-0.032
 hours-0.032
Token that
Token ID326
Feature activation-4.11e+00

Pos logit contributions
?"+0.020
 Anna+0.019
 sob+0.019
 ther+0.018
 mouth+0.018
Neg logit contributions
Years-0.020
Days-0.020
umm-0.019
 By-0.018
 hours-0.018
Token?"
Token ID1701
Feature activation-3.52e+00

Pos logit contributions
anut+0.083
chard+0.080
aj+0.080
bugs+0.078
ighed+0.077
Neg logit contributions
umm-0.092
way-0.086
 days-0.083
 achievement-0.081
astic-0.080
Token asked
Token ID1965
Feature activation-3.54e+00

Pos logit contributions
?"+0.050
maid+0.050
anut+0.049
ccoli+0.047
 mac+0.043
Neg logit contributions
Years-0.102
Days-0.100
 By-0.099
 hours-0.097
iked-0.096
Token Lily
Token ID20037
Feature activation-2.29e+00

Pos logit contributions
?"+0.049
anut+0.046
ccoli+0.043
 sob+0.043
maid+0.043
Neg logit contributions
Years-0.105
iked-0.103
 hours-0.102
 days-0.102
Days-0.101
Token.
Token ID13
Feature activation-2.41e+00

Pos logit contributions
 ther+0.046
?"+0.045
 sandwich+0.044
 Anna+0.042
 trash+0.041
Neg logit contributions
Years-0.051
Days-0.051
udging-0.048
 By-0.048
iked-0.048
Token
Token ID198
Feature activation-2.76e+00

Pos logit contributions
?"+0.041
 Anna+0.036
 sob+0.036
 ther+0.035
 mouth+0.034
Neg logit contributions
Years-0.070
Days-0.069
umm-0.066
 By-0.065
Hours-0.064
Token to
Token ID284
Feature activation+1.22e-01

Pos logit contributions
Years+0.017
Days+0.017
umm+0.016
 By+0.016
 example+0.016
Neg logit contributions
?"-0.017
 sob-0.016
 Anna-0.016
 ther-0.016
 Frost-0.015
Token give
Token ID1577
Feature activation-2.02e+00

Pos logit contributions
Years+0.004
Days+0.004
umm+0.004
iked+0.003
 By+0.003
Neg logit contributions
?"-0.002
 Anna-0.002
 sob-0.002
 ther-0.002
 mouth-0.002
Token it
Token ID340
Feature activation-2.74e+00

Pos logit contributions
 polar+0.047
 ther+0.045
 sob+0.045
 camel+0.045
aj+0.043
Neg logit contributions
 example-0.043
Years-0.042
 days-0.041
 hours-0.040
Days-0.039
Token back
Token ID736
Feature activation-3.88e+00

Pos logit contributions
 sob+0.050
 ther+0.047
cks+0.046
?"+0.046
 Milo+0.044
Neg logit contributions
Years-0.075
Days-0.071
 example-0.068
 days-0.067
Hours-0.067
Token to
Token ID284
Feature activation-2.64e+00

Pos logit contributions
 sob+0.084
 ther+0.080
 Milo+0.077
 frown+0.076
 Anna+0.076
Neg logit contributions
Years-0.088
Days-0.083
 days-0.082
 hours-0.081
 example-0.081
Token me
Token ID502
Feature activation-4.03e+00

Pos logit contributions
 ther+0.035
 sob+0.035
 camel+0.032
maid+0.031
aj+0.031
Neg logit contributions
Years-0.078
 hours-0.078
 days-0.077
 months-0.076
 example-0.076
Token later
Token ID1568
Feature activation-3.24e+00

Pos logit contributions
 ther+0.072
 sob+0.070
 Frost+0.066
 camel+0.065
 trash+0.065
Neg logit contributions
Years-0.101
 example-0.098
Days-0.096
 days-0.094
 hours-0.094
Token.
Token ID13
Feature activation+9.20e-02

Pos logit contributions
?"+0.040
 sob+0.037
 Anna+0.036
 ther+0.034
 mouth+0.033
Neg logit contributions
Years-0.105
Days-0.104
umm-0.103
 By-0.099
iked-0.098
Token Do
Token ID2141
Feature activation+2.93e-01

Pos logit contributions
Years+0.002
 By+0.002
umm+0.002
Days+0.002
 example+0.002
Neg logit contributions
?"-0.002
 sob-0.002
 angry-0.002
 ther-0.002
 Anna-0.002
Token you
Token ID345
Feature activation+3.00e-01

Pos logit contributions
Years+0.008
Days+0.008
 By+0.007
umm+0.007
Hours+0.007
Neg logit contributions
?"-0.005
 sob-0.005
 ther-0.005
 pot-0.005
 mouth-0.005
Token mind
Token ID2000
Feature activation-2.34e+00

Pos logit contributions
Years+0.007
Days+0.006
 lear+0.006
 By+0.006
 hours+0.006
Neg logit contributions
 ther-0.007
 Anna-0.007
 sob-0.007
?"-0.006
 Frost-0.006
Token are
Token ID389
Feature activation+1.17e-01

Pos logit contributions
chard+0.023
anut+0.023
 sob+0.023
maid+0.022
 ther+0.022
Neg logit contributions
 By-0.034
Years-0.034
ending-0.033
Days-0.033
 lear-0.033
Token you
Token ID345
Feature activation-3.21e+00

Pos logit contributions
Years+0.003
Days+0.003
umm+0.003
 By+0.003
iked+0.003
Neg logit contributions
?"-0.002
 Anna-0.002
 sob-0.002
 ther-0.002
 mouth-0.002
Token doing
Token ID1804
Feature activation-2.56e+00

Pos logit contributions
 Anna+0.087
 Sunny+0.085
 Frost+0.083
 ther+0.082
 sob+0.081
Neg logit contributions
Years-0.053
Days-0.051
 By-0.050
 example-0.050
umm-0.047
Token?"
Token ID1701
Feature activation-3.35e+00

Pos logit contributions
 sob+0.028
 mouth+0.027
 Nib+0.027
 camel+0.027
anut+0.026
Neg logit contributions
Years-0.082
umm-0.080
 example-0.080
Days-0.078
 By-0.076
Token The
Token ID383
Feature activation-9.04e-01

Pos logit contributions
ccoli+0.072
maid+0.068
anut+0.067
aj+0.066
 sandwich+0.064
Neg logit contributions
 By-0.076
 months-0.071
Well-0.067
 Hours-0.066
 weeks-0.066
Token farmer
Token ID18739
Feature activation-4.11e+00

Pos logit contributions
?"+0.019
 Anna+0.017
bug+0.017
 sob+0.017
 ther+0.016
Neg logit contributions
Days-0.024
Years-0.024
 example-0.021
 lear-0.021
Hours-0.020
Token says
Token ID1139
Feature activation-1.34e+00

Pos logit contributions
chard+0.068
ccoli+0.063
aj+0.061
 sandwich+0.060
anut+0.058
Neg logit contributions
 By-0.100
udging-0.099
 example-0.095
Years-0.094
ths-0.093
Token.
Token ID13
Feature activation-9.89e-01

Pos logit contributions
?"+0.025
 Anna+0.022
 ther+0.022
 sob+0.021
 Sunny+0.021
Neg logit contributions
Years-0.034
Days-0.034
iked-0.033
 By-0.033
umm-0.033
Token He
Token ID679
Feature activation-3.83e+00

Pos logit contributions
?"+0.022
 sob+0.020
anut+0.020
maid+0.019
chard+0.019
Neg logit contributions
Years-0.022
Days-0.022
 By-0.022
umm-0.021
 Hours-0.020
Token takes
Token ID2753
Feature activation-7.59e-01

Pos logit contributions
?"+0.057
chard+0.057
 Sunny+0.051
anut+0.050
 sob+0.049
Neg logit contributions
Years-0.114
Days-0.109
 By-0.104
 lear-0.103
 example-0.102
Token the
Token ID262
Feature activation-8.51e-01

Pos logit contributions
?"+0.013
ccoli+0.012
aj+0.012
chard+0.012
 sob+0.012
Neg logit contributions
 example-0.021
Years-0.020
 hours-0.020
 Hours-0.020
Days-0.020
Token oh
Token ID11752
Feature activation-1.46e+00

Pos logit contributions
?"+0.035
 sob+0.033
aj+0.031
chard+0.031
anut+0.031
Neg logit contributions
Days-0.069
 example-0.068
Years-0.068
 hours-0.068
umm-0.067
Token no
Token ID645
Feature activation-2.08e+00

Pos logit contributions
 sob+0.026
chard+0.026
?"+0.025
 Anna+0.025
anut+0.025
Neg logit contributions
Days-0.035
Years-0.035
umm-0.035
 example-0.034
 lear-0.034
Token,
Token ID11
Feature activation-2.33e+00

Pos logit contributions
?"+0.032
 Anna+0.030
 sob+0.029
 ther+0.027
 mouth+0.026
Neg logit contributions
Years-0.064
Days-0.063
umm-0.062
 By-0.059
iked-0.059
Token oh
Token ID11752
Feature activation-1.84e+00

Pos logit contributions
?"+0.039
 sob+0.037
aj+0.035
chard+0.033
 ther+0.032
Neg logit contributions
Days-0.062
Years-0.061
umm-0.061
 example-0.060
 hours-0.060
Token no
Token ID645
Feature activation-2.19e+00

Pos logit contributions
 sob+0.031
chard+0.031
anut+0.030
 Sunny+0.030
 Anna+0.029
Neg logit contributions
Days-0.045
Years-0.045
 example-0.045
umm-0.044
 By-0.044
Token!"
Token ID2474
Feature activation-3.70e+00

Pos logit contributions
?"+0.035
 Anna+0.033
 sob+0.033
 ther+0.031
 mouth+0.030
Neg logit contributions
Years-0.065
Days-0.064
umm-0.063
 By-0.059
Hours-0.059
Token Mom
Token ID11254
Feature activation-2.84e+00

Pos logit contributions
anut+0.049
maid+0.045
ccoli+0.044
?"+0.043
chard+0.041
Neg logit contributions
 By-0.116
Years-0.112
 hours-0.109
Days-0.108
 months-0.108
Token said
Token ID531
Feature activation-7.97e-01

Pos logit contributions
?"+0.044
chard+0.042
 sandwich+0.040
 ther+0.040
ccoli+0.040
Neg logit contributions
Years-0.076
Days-0.076
udging-0.074
iked-0.073
 By-0.073
Token.
Token ID13
Feature activation-1.69e+00

Pos logit contributions
?"+0.015
 ther+0.013
 Anna+0.012
 sob+0.012
 sandwich+0.012
Neg logit contributions
Years-0.021
Days-0.021
iked-0.020
umm-0.020
 By-0.020
Token
Token ID198
Feature activation-1.89e+00

Pos logit contributions
?"+0.035
 sob+0.031
 Anna+0.030
 ther+0.030
 sandwich+0.029
Neg logit contributions
Years-0.043
Days-0.043
umm-0.041
 By-0.040
Hours-0.039
Token
Token ID198
Feature activation-3.14e-01

Pos logit contributions
?"+0.036
chard+0.034
maid+0.034
 mouth+0.034
 sob+0.034
Neg logit contributions
Years-0.049
Days-0.048
 By-0.046
Eventually-0.044
Hours-0.044
Token be
Token ID307
Feature activation+7.45e-03

Pos logit contributions
Years+0.013
Days+0.013
umm+0.013
iked+0.012
 By+0.012
Neg logit contributions
?"-0.009
 Anna-0.008
 sob-0.008
 mouth-0.007
 ther-0.007
Token careful
Token ID8161
Feature activation-1.81e-01

Pos logit contributions
Years+0.000
Days+0.000
Hours+0.000
 example+0.000
 hours+0.000
Neg logit contributions
?"-0.000
 sob-0.000
 ther-0.000
 Sunny-0.000
maid-0.000
Token not
Token ID407
Feature activation+1.17e-01

Pos logit contributions
?"+0.003
 Anna+0.002
 sob+0.002
 ther+0.002
 mouth+0.002
Neg logit contributions
Years-0.006
Days-0.006
umm-0.005
 By-0.005
iked-0.005
Token to
Token ID284
Feature activation-1.43e-01

Pos logit contributions
Years+0.003
Days+0.003
umm+0.003
 hours+0.003
 By+0.003
Neg logit contributions
?"-0.002
 Anna-0.002
 sob-0.002
 mouth-0.002
 ther-0.002
Token fall
Token ID2121
Feature activation-2.90e+00

Pos logit contributions
?"+0.002
 sob+0.002
 Anna+0.002
 ther+0.002
 mouth+0.002
Neg logit contributions
Years-0.004
Days-0.004
umm-0.004
Hours-0.004
 By-0.004
Token off
Token ID572
Feature activation-3.94e+00

Pos logit contributions
?"+0.039
 sob+0.038
 Anna+0.037
 ther+0.035
 mouth+0.034
Neg logit contributions
Years-0.090
Days-0.089
umm-0.087
 By-0.084
iked-0.083
Token the
Token ID262
Feature activation-9.09e-01

Pos logit contributions
 sob+0.049
?"+0.042
 ther+0.040
 Anna+0.038
 mouth+0.038
Neg logit contributions
Years-0.126
Days-0.125
umm-0.123
 hours-0.122
 days-0.121
Token slide
Token ID10649
Feature activation-2.58e+00

Pos logit contributions
?"+0.013
 Anna+0.011
 sob+0.011
 ther+0.010
 mouth+0.010
Neg logit contributions
Years-0.030
Days-0.029
umm-0.029
 By-0.027
iked-0.027
Token!"
Token ID2474
Feature activation-2.26e+00

Pos logit contributions
?"+0.035
 Anna+0.031
 sob+0.031
 ther+0.029
 mouth+0.028
Neg logit contributions
Years-0.084
Days-0.083
umm-0.081
 By-0.077
iked-0.077
Token
Token ID198
Feature activation-2.44e+00

Pos logit contributions
?"+0.035
 sob+0.031
 Anna+0.030
 ther+0.029
 mouth+0.028
Neg logit contributions
Years-0.068
Days-0.067
 By-0.064
umm-0.064
iked-0.063
Token
Token ID198
Feature activation-3.38e-01

Pos logit contributions
chard+0.053
 sob+0.049
?"+0.049
 mac+0.049
maid+0.048
Neg logit contributions
Years-0.058
Days-0.056
 By-0.055
Eventually-0.053
Hours-0.052