TOP ACTIVATIONS
MAX = 7.361

do can prevent us from feeling upset in the long run
truck 's wings began to feel light . Bob kept singing
and he was beginning to feel a bit stiff . The
the tree and settled in between the branches . Everyone cheered
and it always made him feel so lovely . <|endoftext|> There
no one ever has to feel so helpless and lonely again
friend and made their home feel safe and happy . <|endoftext|>
We can take breaks in between and eventually make it to
boots and went outside to feel the wet and heavy rain
you need to rest to feel better and keep doing what
longer and longer breaks in between . The
bit and was beginning to feel tired . He decided to
needs a new battery to feel refreshed too ."
sang until she began to feel sleepy . Just as she
my 's leg started to feel better . From that day
higher and faster than ever before . So it hopped on
how it had made them feel . The mill
went on he began to feel better . He and his
blanket around her body to feel extra cozy . The little
and the girl began to feel sleepy . Mum put the

MIN ACTIVATIONS
MIN = -11.041

swimming around her . The mer maid smiled and said ,
again . He said the mer maid and a dolphin were
made . The mer maid learnt that taking risks
castle so much . The mer maid said , " I
water . The mer maid smiled at Melody .
demands , crab ," the mer maid said . " I
yes . She followed the mer maid through the hidden door
was so thankful that the mer maid had forgiven him .
Jack asked the mer maid why the waves were
. He even saw a mer maid !
ever and he thanked the mer maid for being so kind
. The strong mer maid thought about the promise
They had never seen a mer maid before .
sun . A mer maid that lived in the
, she saw a beautiful mer maid swimming around her .
Joe waved goodbye to the mer maid , and hopped back
, and she wanted the mer maid to come too .
beach , he saw a mer maid . She had beautiful
passport belonged to a magical mer maid who had been visiting
looked back and saw the mer maid still waving and that

INTERVAL 2.761 - 7.361
CONTAINS 6.595%

! Jimmy was so strong , he could swing
badly . Tim my felt very sorry and wished he had
are very tired and hungry . They eat a y ummy
it was going to be so much fun !
and it started to burn the paper . She tried to

INTERVAL -1.840 - 2.761
CONTAINS 78.710%

a bunny named Benny . Benny loved to eat rad ishes
Mitt ens loved to sleep all day long and never wanted
<|endoftext|> Once upon a time , there was a little boy
back to her home . She asked her mom if she
little girl named Lily . She loved to play on the

INTERVAL -6.440 - -1.840
CONTAINS 14.691%

please repeat it !" From then on , Lily
how to make jam . " This is so much fun
One day , they decided to skip over to the kitchen
when she accidentally knocked over a chair . The chair was
It craw ls on Lily 's tower . It makes the

INTERVAL -11.041 - -6.440
CONTAINS 0.004%

Sam and Emma used their imag inations to explore the distant
Let 's pretend we are mer maid s and swim in
park . Jack had a brand new toy with him .
opened her eyes and her t eddy bear was still in
a few shells and some seaw eed and built a castle
Token do
Token ID466
Feature activation+4.54e-01

Pos logit contributions
rily+0.036
ware+0.032
bell+0.030
ge+0.030
yt+0.030
Neg logit contributions
 much-0.019
 very-0.018
 many-0.018
e-0.018
 a-0.017
Token can
Token ID460
Feature activation-3.25e+00

Pos logit contributions
 much+0.006
 very+0.006
 many+0.005
e+0.005
 a+0.005
Neg logit contributions
rily-0.015
ware-0.014
ge-0.014
uing-0.014
bell-0.013
Token prevent
Token ID2948
Feature activation+3.84e+00

Pos logit contributions
rily+0.112
ware+0.096
yt+0.092
ly+0.089
ory+0.088
Neg logit contributions
 much-0.053
 very-0.053
e-0.050
 many-0.048
 a-0.048
Token us
Token ID514
Feature activation+2.57e+00

Pos logit contributions
 much+0.059
e+0.057
 very+0.054
 many+0.054
 named+0.052
Neg logit contributions
rily-0.117
ge-0.105
ware-0.104
uing-0.103
fully-0.099
Token from
Token ID422
Feature activation+5.50e+00

Pos logit contributions
 much+0.041
e+0.040
 very+0.039
 many+0.038
 afraid+0.035
Neg logit contributions
rily-0.079
ware-0.072
uing-0.069
ge-0.068
bell-0.067
Token feeling
Token ID4203
Feature activation+6.89e+00

Pos logit contributions
 much+0.081
e+0.077
 very+0.076
 many+0.075
 named+0.068
Neg logit contributions
rily-0.175
ware-0.159
uing-0.155
ge-0.155
bell-0.151
Token upset
Token ID9247
Feature activation+1.28e+00

Pos logit contributions
e+0.078
 much+0.074
 many+0.069
 very+0.067
 named+0.066
Neg logit contributions
rily-0.232
ge-0.211
ware-0.210
uing-0.210
bell-0.200
Token in
Token ID287
Feature activation+2.79e+00

Pos logit contributions
 much+0.015
 very+0.014
 many+0.014
e+0.013
 a+0.012
Neg logit contributions
rily-0.046
ware-0.043
bell-0.041
ge-0.041
uing-0.041
Token the
Token ID262
Feature activation+3.95e+00

Pos logit contributions
 much+0.034
e+0.032
 very+0.031
 many+0.031
 named+0.028
Neg logit contributions
rily-0.095
ware-0.086
ge-0.085
uing-0.085
bell-0.082
Token long
Token ID890
Feature activation-8.95e-01

Pos logit contributions
e+0.068
 much+0.064
 named+0.060
ow+0.059
 many+0.059
Neg logit contributions
rily-0.106
ware-0.098
uing-0.096
ge-0.094
bell-0.094
Token run
Token ID1057
Feature activation-1.10e+00

Pos logit contributions
rily+0.031
ware+0.028
uing+0.028
ge+0.028
bell+0.027
Neg logit contributions
 much-0.011
e-0.011
 very-0.010
 many-0.010
 named-0.009
Token truck
Token ID7779
Feature activation-4.43e-01

Pos logit contributions
rily+0.063
ge+0.055
ware+0.054
uing+0.053
yt+0.049
Neg logit contributions
 much-0.067
 very-0.066
e-0.065
 many-0.064
 named-0.062
Token's
Token ID338
Feature activation-2.05e-01

Pos logit contributions
rily+0.013
ware+0.012
ge+0.012
uing+0.012
yt+0.011
Neg logit contributions
 much-0.008
 very-0.008
e-0.007
 many-0.007
 named-0.007
Token wings
Token ID12098
Feature activation-4.57e-01

Pos logit contributions
rily+0.006
ware+0.006
ge+0.005
uing+0.005
bell+0.005
Neg logit contributions
 much-0.003
 very-0.003
e-0.003
 many-0.003
 named-0.003
Token began
Token ID2540
Feature activation+2.18e+00

Pos logit contributions
rily+0.015
ware+0.013
uing+0.013
ge+0.013
pron+0.012
Neg logit contributions
 much-0.007
 very-0.007
e-0.006
 many-0.006
 a-0.006
Token to
Token ID284
Feature activation+2.86e-01

Pos logit contributions
 much+0.041
 very+0.039
e+0.038
 many+0.038
 named+0.035
Neg logit contributions
rily-0.061
ware-0.055
uing-0.054
ge-0.053
bell-0.051
Token feel
Token ID1254
Feature activation+6.88e+00

Pos logit contributions
 much+0.006
 very+0.006
 many+0.005
e+0.005
 a+0.005
Neg logit contributions
rily-0.008
ware-0.007
uing-0.007
ge-0.006
ly-0.006
Token light
Token ID1657
Feature activation-5.87e-01

Pos logit contributions
e+0.109
ow+0.095
 named+0.091
 many+0.088
 ost+0.086
Neg logit contributions
rily-0.205
ware-0.174
uing-0.173
ge-0.172
bell-0.169
Token.
Token ID13
Feature activation+1.67e-01

Pos logit contributions
rily+0.019
ware+0.018
uing+0.018
ge+0.017
pron+0.017
Neg logit contributions
 very-0.008
 much-0.008
 a-0.007
 many-0.007
e-0.007
Token Bob
Token ID5811
Feature activation+9.91e-01

Pos logit contributions
e+0.003
 much+0.003
 very+0.003
 many+0.003
 named+0.002
Neg logit contributions
rily-0.005
ware-0.004
bell-0.004
ro-0.004
uing-0.004
Token kept
Token ID4030
Feature activation+1.94e-02

Pos logit contributions
 much+0.018
e+0.017
 many+0.017
 very+0.016
 glad+0.015
Neg logit contributions
rily-0.028
ware-0.025
ge-0.024
uing-0.024
bell-0.023
Token singing
Token ID13777
Feature activation-4.29e-02

Pos logit contributions
 much+0.000
e+0.000
 many+0.000
 very+0.000
 named+0.000
Neg logit contributions
rily-0.001
ware-0.001
ge-0.001
uing-0.001
bell-0.001
Token and
Token ID290
Feature activation+1.85e+00

Pos logit contributions
 much+0.022
 very+0.021
e+0.021
 many+0.020
 a+0.018
Neg logit contributions
rily-0.069
ware-0.064
uing-0.063
ge-0.061
bell-0.061
Token he
Token ID339
Feature activation-5.22e-01

Pos logit contributions
 much+0.029
 very+0.027
e+0.027
 many+0.027
 glad+0.024
Neg logit contributions
rily-0.058
ware-0.053
uing-0.051
ge-0.051
bell-0.049
Token was
Token ID373
Feature activation+2.32e+00

Pos logit contributions
rily+0.014
ware+0.013
ge+0.013
uing+0.013
bell+0.012
Neg logit contributions
 much-0.010
e-0.009
 many-0.009
 very-0.009
 glad-0.008
Token beginning
Token ID3726
Feature activation+2.48e+00

Pos logit contributions
 much+0.031
e+0.031
 many+0.030
 And+0.028
 named+0.027
Neg logit contributions
rily-0.072
ge-0.064
uing-0.063
ware-0.062
bell-0.061
Token to
Token ID284
Feature activation-1.57e-01

Pos logit contributions
e+0.057
 much+0.054
 named+0.052
 very+0.051
 many+0.051
Neg logit contributions
rily-0.056
ge-0.050
ware-0.049
uing-0.047
bell-0.046
Token feel
Token ID1254
Feature activation+6.85e+00

Pos logit contributions
rily+0.004
ware+0.004
ge+0.004
uing+0.004
bell+0.004
Neg logit contributions
 much-0.003
e-0.003
 very-0.003
 many-0.003
 named-0.003
Token a
Token ID257
Feature activation-2.05e-01

Pos logit contributions
e+0.092
 named+0.081
ow+0.076
 ost+0.075
aks+0.073
Neg logit contributions
rily-0.210
ware-0.179
ge-0.178
ly-0.176
uing-0.174
Token bit
Token ID1643
Feature activation+1.89e+00

Pos logit contributions
rily+0.006
ware+0.005
ge+0.005
uing+0.005
bell+0.005
Neg logit contributions
 much-0.004
e-0.004
 very-0.004
 many-0.004
 named-0.003
Token stiff
Token ID15175
Feature activation-1.19e+00

Pos logit contributions
 much+0.021
e+0.021
 many+0.019
 very+0.019
 named+0.017
Neg logit contributions
rily-0.066
ware-0.060
ge-0.059
uing-0.059
bell-0.057
Token.
Token ID13
Feature activation+8.20e-01

Pos logit contributions
rily+0.041
uing+0.039
ware+0.038
bell+0.038
ge+0.037
Neg logit contributions
 much-0.014
 very-0.014
 many-0.012
 a-0.012
e-0.012
Token The
Token ID383
Feature activation-5.56e-03

Pos logit contributions
e+0.012
 much+0.011
 very+0.011
 many+0.010
 afraid+0.010
Neg logit contributions
rily-0.026
ware-0.024
uing-0.023
ge-0.023
bell-0.022
Token the
Token ID262
Feature activation+3.57e+00

Pos logit contributions
 much+0.039
e+0.037
 very+0.035
 many+0.034
 named+0.030
Neg logit contributions
rily-0.157
ware-0.144
ge-0.142
uing-0.142
bell-0.138
Token tree
Token ID5509
Feature activation+1.94e+00

Pos logit contributions
e+0.063
ow+0.059
 much+0.058
 glad+0.056
 many+0.054
Neg logit contributions
rily-0.092
bell-0.085
ware-0.084
ly-0.081
ro-0.080
Token and
Token ID290
Feature activation+6.72e-01

Pos logit contributions
 much+0.027
e+0.026
 very+0.025
 many+0.025
 named+0.023
Neg logit contributions
rily-0.062
ware-0.056
ge-0.055
uing-0.055
bell-0.053
Token settled
Token ID10282
Feature activation-1.78e+00

Pos logit contributions
 much+0.012
e+0.012
 very+0.011
 many+0.011
 glad+0.010
Neg logit contributions
rily-0.019
ware-0.017
ge-0.017
uing-0.016
bell-0.016
Token in
Token ID287
Feature activation+3.64e+00

Pos logit contributions
rily+0.055
ware+0.052
uing+0.051
ge+0.051
bell+0.050
Neg logit contributions
 much-0.026
 very-0.026
 many-0.025
e-0.024
 a-0.023
Token between
Token ID1022
Feature activation+6.75e+00

Pos logit contributions
 much+0.042
e+0.039
 very+0.038
 many+0.037
 named+0.033
Neg logit contributions
rily-0.128
ware-0.118
uing-0.116
ge-0.116
bell-0.112
Token the
Token ID262
Feature activation+2.76e+00

Pos logit contributions
e+0.065
 much+0.065
 very+0.061
 many+0.057
 named+0.054
Neg logit contributions
rily-0.243
ware-0.219
ge-0.219
uing-0.215
bell-0.211
Token branches
Token ID13737
Feature activation+3.45e+00

Pos logit contributions
 much+0.046
e+0.046
 very+0.043
 many+0.043
 named+0.041
Neg logit contributions
rily-0.080
ware-0.072
ge-0.070
uing-0.069
bell-0.068
Token.
Token ID13
Feature activation+9.34e-01

Pos logit contributions
 much+0.046
e+0.044
 very+0.043
 many+0.042
 named+0.038
Neg logit contributions
rily-0.113
ware-0.103
ge-0.101
uing-0.101
bell-0.098
Token Everyone
Token ID11075
Feature activation+5.29e-01

Pos logit contributions
e+0.013
 much+0.013
 very+0.013
 many+0.012
 afraid+0.011
Neg logit contributions
rily-0.029
ware-0.026
ge-0.025
bell-0.025
uing-0.025
Token cheered
Token ID34767
Feature activation-2.24e+00

Pos logit contributions
 much+0.010
 many+0.009
e+0.009
 And+0.009
icks+0.009
Neg logit contributions
rily-0.014
fully-0.012
ly-0.012
ware-0.012
ge-0.012
Token and
Token ID290
Feature activation-1.82e-01

Pos logit contributions
rily+0.084
ware+0.075
ge+0.074
uing+0.074
bell+0.071
Neg logit contributions
 much-0.049
e-0.048
 very-0.047
 many-0.046
 named-0.043
Token it
Token ID340
Feature activation-2.87e-01

Pos logit contributions
rily+0.005
ware+0.005
uing+0.005
ge+0.005
bell+0.005
Neg logit contributions
 much-0.003
e-0.003
 very-0.003
 many-0.003
 named-0.003
Token always
Token ID1464
Feature activation-1.18e+00

Pos logit contributions
rily+0.008
ware+0.008
ge+0.007
uing+0.007
bell+0.007
Neg logit contributions
 much-0.005
e-0.005
 many-0.005
 very-0.005
 named-0.004
Token made
Token ID925
Feature activation+1.64e+00

Pos logit contributions
rily+0.034
ware+0.031
ge+0.030
uing+0.030
bell+0.029
Neg logit contributions
 much-0.020
e-0.020
 very-0.019
 many-0.019
 named-0.018
Token him
Token ID683
Feature activation+1.91e-01

Pos logit contributions
 much+0.026
e+0.026
 very+0.025
 many+0.025
 named+0.024
Neg logit contributions
rily-0.048
ware-0.043
uing-0.043
ge-0.042
fully-0.041
Token feel
Token ID1254
Feature activation+6.74e+00

Pos logit contributions
e+0.003
 much+0.003
 named+0.003
 many+0.003
 very+0.003
Neg logit contributions
rily-0.005
ge-0.005
uing-0.005
ware-0.005
bell-0.005
Token so
Token ID523
Feature activation+4.74e+00

Pos logit contributions
e+0.087
ow+0.082
 many+0.076
icks+0.074
 ost+0.074
Neg logit contributions
rily-0.210
uing-0.176
ge-0.176
fully-0.173
ly-0.172
Token lovely
Token ID14081
Feature activation+1.82e+00

Pos logit contributions
e+0.064
 named+0.057
ow+0.054
 many+0.053
 Fire+0.051
Neg logit contributions
rily-0.147
ge-0.132
uing-0.130
bell-0.126
ly-0.126
Token.
Token ID13
Feature activation+8.91e-01

Pos logit contributions
 much+0.022
 very+0.021
 many+0.019
e+0.019
 a+0.018
Neg logit contributions
rily-0.063
ware-0.059
uing-0.059
ge-0.058
bell-0.057
Token<|endoftext|>
Token ID50256
Feature activation+1.69e+00

Pos logit contributions
e+0.016
 afraid+0.016
 very+0.016
 scared+0.015
 much+0.015
Neg logit contributions
rily-0.021
ware-0.019
ro-0.019
ge-0.018
uing-0.018
TokenThere
Token ID1858
Feature activation-2.11e+00

Pos logit contributions
 much+0.021
e+0.020
 very+0.019
 many+0.018
 glad+0.018
Neg logit contributions
rily-0.056
ware-0.051
ge-0.050
uing-0.049
ro-0.048
Token no
Token ID645
Feature activation+1.26e-01

Pos logit contributions
e+0.003
aw+0.003
ow+0.003
 named+0.003
 much+0.003
Neg logit contributions
rily-0.004
uing-0.004
ware-0.004
ge-0.004
fully-0.004
Token one
Token ID530
Feature activation-1.46e+00

Pos logit contributions
 much+0.002
e+0.002
 very+0.002
 many+0.002
 named+0.002
Neg logit contributions
rily-0.004
ware-0.003
ge-0.003
uing-0.003
bell-0.003
Token ever
Token ID1683
Feature activation+9.93e-01

Pos logit contributions
rily+0.034
ware+0.032
ge+0.030
uing+0.030
bell+0.030
Neg logit contributions
 much-0.029
e-0.028
 many-0.028
 very-0.027
 glad-0.026
Token has
Token ID468
Feature activation+2.96e+00

Pos logit contributions
 much+0.017
e+0.016
 very+0.016
 many+0.016
 named+0.014
Neg logit contributions
rily-0.030
ware-0.027
uing-0.026
ge-0.026
bell-0.025
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 much+0.045
e+0.042
 very+0.042
 many+0.041
 a+0.037
Neg logit contributions
rily-0.095
ware-0.085
uing-0.084
ge-0.084
bell-0.081
Token feel
Token ID1254
Feature activation+6.73e+00

Pos logit contributions
rily+0.030
ware+0.027
uing+0.026
ge+0.026
bell+0.025
Neg logit contributions
e-0.018
 much-0.017
 very-0.017
 many-0.016
 named-0.015
Token so
Token ID523
Feature activation+4.90e+00

Pos logit contributions
e+0.079
 much+0.070
 many+0.067
ow+0.065
 named+0.064
Neg logit contributions
rily-0.227
ware-0.200
ge-0.199
uing-0.196
bell-0.191
Token helpless
Token ID21144
Feature activation+1.09e-01

Pos logit contributions
e+0.056
 named+0.045
 much+0.045
 very+0.044
 many+0.043
Neg logit contributions
rily-0.169
ge-0.151
uing-0.150
ware-0.149
bell-0.148
Token and
Token ID290
Feature activation+3.20e+00

Pos logit contributions
 much+0.001
 very+0.001
 many+0.001
 a+0.001
 living+0.001
Neg logit contributions
rily-0.004
uing-0.004
ware-0.004
pron-0.004
ge-0.003
Token lonely
Token ID21757
Feature activation-1.25e+00

Pos logit contributions
e+0.039
 much+0.038
 many+0.035
 very+0.034
 named+0.033
Neg logit contributions
rily-0.106
ware-0.096
ge-0.096
uing-0.095
bell-0.092
Token again
Token ID757
Feature activation+2.15e+00

Pos logit contributions
rily+0.043
uing+0.040
ware+0.039
ge+0.039
bell+0.038
Neg logit contributions
 much-0.015
 very-0.015
e-0.014
 many-0.014
 a-0.013
Token friend
Token ID1545
Feature activation+1.07e+00

Pos logit contributions
e+0.014
 much+0.014
 many+0.013
 very+0.013
 named+0.012
Neg logit contributions
rily-0.023
ware-0.021
ge-0.021
uing-0.020
bell-0.020
Token and
Token ID290
Feature activation-6.16e-01

Pos logit contributions
 much+0.016
e+0.015
 many+0.015
 very+0.015
 glad+0.014
Neg logit contributions
rily-0.033
ware-0.030
ge-0.029
uing-0.029
bell-0.028
Token made
Token ID925
Feature activation+9.30e-01

Pos logit contributions
rily+0.018
ware+0.017
bell+0.016
fully+0.016
uing+0.016
Neg logit contributions
e-0.009
 much-0.009
 very-0.008
 many-0.008
ow-0.008
Token their
Token ID511
Feature activation-1.45e+00

Pos logit contributions
e+0.014
 named+0.013
 afraid+0.013
 much+0.012
 glad+0.012
Neg logit contributions
rily-0.028
uing-0.024
ware-0.024
fully-0.023
ly-0.023
Token home
Token ID1363
Feature activation+8.81e-02

Pos logit contributions
rily+0.036
ware+0.031
bell+0.031
ge+0.030
uing+0.030
Neg logit contributions
e-0.029
 much-0.029
aks-0.026
 afraid-0.026
aw-0.026
Token feel
Token ID1254
Feature activation+6.72e+00

Pos logit contributions
 much+0.001
 very+0.001
e+0.001
 many+0.001
 named+0.001
Neg logit contributions
rily-0.003
ware-0.003
ge-0.003
uing-0.003
bell-0.003
Token safe
Token ID3338
Feature activation+7.45e-01

Pos logit contributions
e+0.100
ow+0.092
 ost+0.081
aw+0.079
 many+0.078
Neg logit contributions
rily-0.201
uing-0.169
ware-0.168
bell-0.167
pron-0.167
Token and
Token ID290
Feature activation+3.03e+00

Pos logit contributions
 very+0.008
 much+0.008
 many+0.007
 a+0.007
e+0.007
Neg logit contributions
rily-0.026
uing-0.025
ware-0.025
ge-0.025
bell-0.024
Token happy
Token ID3772
Feature activation-1.15e-01

Pos logit contributions
 much+0.036
e+0.035
 very+0.033
 many+0.033
 named+0.030
Neg logit contributions
rily-0.103
ware-0.094
ge-0.093
uing-0.093
bell-0.090
Token.
Token ID13
Feature activation+1.58e+00

Pos logit contributions
rily+0.004
ware+0.004
ge+0.004
uing+0.004
bell+0.003
Neg logit contributions
 much-0.001
 very-0.001
 many-0.001
e-0.001
 named-0.001
Token<|endoftext|>
Token ID50256
Feature activation+2.40e+00

Pos logit contributions
e+0.033
 afraid+0.029
 very+0.028
 trapped+0.027
ow+0.027
Neg logit contributions
rily-0.034
ro-0.033
bell-0.032
oph-0.031
ware-0.031
Token We
Token ID775
Feature activation-1.40e+00

Pos logit contributions
 much+0.003
e+0.003
 very+0.003
 many+0.003
 afraid+0.003
Neg logit contributions
rily-0.008
ware-0.007
ge-0.007
uing-0.007
bell-0.007
Token can
Token ID460
Feature activation-3.86e+00

Pos logit contributions
rily+0.040
ware+0.036
ge+0.036
uing+0.036
bell+0.035
Neg logit contributions
 much-0.023
 many-0.021
e-0.020
 very-0.020
 glad-0.020
Token take
Token ID1011
Feature activation-1.65e-01

Pos logit contributions
rily+0.123
ware+0.111
ge+0.107
uing+0.104
yt+0.104
Neg logit contributions
e-0.061
 much-0.059
 very-0.058
 many-0.057
 a-0.053
Token breaks
Token ID9457
Feature activation-9.99e-01

Pos logit contributions
rily+0.005
ly+0.004
uing+0.004
ware+0.004
fully+0.004
Neg logit contributions
 named-0.002
 afraid-0.002
 much-0.002
e-0.002
 glad-0.002
Token in
Token ID287
Feature activation+3.93e+00

Pos logit contributions
rily+0.031
ware+0.028
uing+0.028
bell+0.028
ge+0.027
Neg logit contributions
 much-0.015
 very-0.015
 many-0.015
e-0.014
 a-0.014
Token between
Token ID1022
Feature activation+6.70e+00

Pos logit contributions
 much+0.036
e+0.035
 very+0.032
 many+0.031
 named+0.030
Neg logit contributions
rily-0.142
ware-0.131
ge-0.130
uing-0.128
bell-0.124
Token and
Token ID290
Feature activation+1.66e+00

Pos logit contributions
 much+0.067
e+0.065
 very+0.062
 many+0.058
 named+0.055
Neg logit contributions
rily-0.239
ware-0.219
ge-0.217
uing-0.216
bell-0.209
Token eventually
Token ID4191
Feature activation+1.78e+00

Pos logit contributions
 much+0.029
e+0.028
 very+0.028
 many+0.028
 named+0.025
Neg logit contributions
rily-0.048
ware-0.043
uing-0.042
ge-0.042
bell-0.040
Token make
Token ID787
Feature activation+1.02e+00

Pos logit contributions
 much+0.029
e+0.028
 very+0.027
 many+0.027
 named+0.025
Neg logit contributions
rily-0.053
ware-0.048
ge-0.047
uing-0.047
bell-0.045
Token it
Token ID340
Feature activation+4.31e-01

Pos logit contributions
e+0.013
 named+0.013
 much+0.012
 glad+0.012
 afraid+0.012
Neg logit contributions
rily-0.033
uing-0.029
ware-0.029
ge-0.028
fully-0.027
Token to
Token ID284
Feature activation-8.06e-01

Pos logit contributions
 much+0.007
 very+0.007
 many+0.007
e+0.007
 a+0.006
Neg logit contributions
rily-0.013
ware-0.012
uing-0.011
ge-0.011
bell-0.011
Token boots
Token ID14412
Feature activation-1.58e+00

Pos logit contributions
rily+0.058
ware+0.052
ge+0.051
uing+0.051
bell+0.048
Neg logit contributions
 much-0.035
 very-0.034
 many-0.033
e-0.033
 named-0.032
Token and
Token ID290
Feature activation-1.94e+00

Pos logit contributions
rily+0.056
ware+0.051
uing+0.050
ge+0.049
yt+0.047
Neg logit contributions
 very-0.020
 much-0.020
 many-0.019
 a-0.017
 named-0.017
Token went
Token ID1816
Feature activation-1.42e+00

Pos logit contributions
rily+0.058
ware+0.051
ge+0.051
uing+0.050
bell+0.047
Neg logit contributions
 much-0.034
 very-0.032
e-0.032
 many-0.032
 named-0.030
Token outside
Token ID2354
Feature activation+1.46e+00

Pos logit contributions
rily+0.051
ware+0.045
uing+0.044
ge+0.044
arded+0.042
Neg logit contributions
 much-0.018
 very-0.018
 many-0.017
 named-0.016
e-0.015
Token to
Token ID284
Feature activation-3.92e-01

Pos logit contributions
 much+0.020
 very+0.019
 many+0.018
e+0.018
 named+0.017
Neg logit contributions
rily-0.050
ware-0.045
uing-0.045
ge-0.044
bell-0.042
Token feel
Token ID1254
Feature activation+6.68e+00

Pos logit contributions
rily+0.011
ware+0.010
ge+0.010
uing+0.010
bell+0.009
Neg logit contributions
 much-0.007
e-0.007
 very-0.007
 many-0.007
 named-0.006
Token the
Token ID262
Feature activation+3.80e+00

Pos logit contributions
e+0.105
ow+0.090
 many+0.082
icks+0.080
aks+0.080
Neg logit contributions
rily-0.194
uing-0.169
ware-0.169
ge-0.166
fully-0.165
Token wet
Token ID9583
Feature activation-1.04e+00

Pos logit contributions
 much+0.058
e+0.058
 many+0.054
 very+0.054
 named+0.050
Neg logit contributions
rily-0.115
ware-0.104
ge-0.102
uing-0.102
bell-0.099
Token and
Token ID290
Feature activation+1.64e+00

Pos logit contributions
rily+0.034
ware+0.031
uing+0.031
ge+0.030
bell+0.028
Neg logit contributions
 much-0.015
 very-0.015
 many-0.014
e-0.013
 named-0.013
Token heavy
Token ID4334
Feature activation-1.23e+00

Pos logit contributions
e+0.023
 much+0.023
 many+0.021
 very+0.021
 named+0.020
Neg logit contributions
rily-0.052
ware-0.047
ge-0.046
uing-0.046
bell-0.045
Token rain
Token ID6290
Feature activation-1.12e+00

Pos logit contributions
rily+0.039
ware+0.035
uing+0.034
ge+0.034
bell+0.032
Neg logit contributions
 much-0.019
 very-0.018
e-0.018
 many-0.018
 named-0.016
Token you
Token ID345
Feature activation+8.37e-01

Pos logit contributions
rily+0.017
ware+0.015
ge+0.015
uing+0.015
fully+0.014
Neg logit contributions
e-0.010
 much-0.009
 very-0.009
 many-0.009
 named-0.009
Token need
Token ID761
Feature activation+7.67e-01

Pos logit contributions
 much+0.014
e+0.014
 many+0.013
 very+0.013
 named+0.012
Neg logit contributions
rily-0.024
ware-0.022
ge-0.022
uing-0.022
bell-0.021
Token to
Token ID284
Feature activation-1.49e+00

Pos logit contributions
 much+0.016
e+0.016
 very+0.015
 many+0.015
 named+0.014
Neg logit contributions
rily-0.019
ware-0.017
ge-0.017
uing-0.017
bell-0.016
Token rest
Token ID1334
Feature activation+2.25e-01

Pos logit contributions
rily+0.045
ware+0.041
ge+0.038
uing+0.038
bell+0.038
Neg logit contributions
 much-0.026
e-0.025
 very-0.025
 many-0.024
 named-0.022
Token to
Token ID284
Feature activation-6.54e-01

Pos logit contributions
 much+0.004
 very+0.003
 many+0.003
e+0.003
 glad+0.003
Neg logit contributions
rily-0.007
ware-0.006
ge-0.006
uing-0.006
bell-0.006
Token feel
Token ID1254
Feature activation+6.66e+00

Pos logit contributions
rily+0.021
ware+0.019
bell+0.018
uing+0.018
ge+0.018
Neg logit contributions
 much-0.011
e-0.010
 very-0.010
 many-0.010
 a-0.009
Token better
Token ID1365
Feature activation+1.06e+00

Pos logit contributions
e+0.093
ow+0.074
 much+0.074
 named+0.072
 many+0.070
Neg logit contributions
rily-0.212
ge-0.189
uing-0.189
ware-0.188
ly-0.182
Token and
Token ID290
Feature activation+2.41e+00

Pos logit contributions
 much+0.014
 very+0.014
 many+0.013
 a+0.012
e+0.011
Neg logit contributions
rily-0.037
ware-0.036
bell-0.035
uing-0.033
pron-0.033
Token keep
Token ID1394
Feature activation+1.30e+00

Pos logit contributions
 much+0.040
 very+0.038
 many+0.037
e+0.034
 a+0.033
Neg logit contributions
rily-0.077
ware-0.070
ge-0.067
uing-0.067
bell-0.066
Token doing
Token ID1804
Feature activation+7.06e-01

Pos logit contributions
 much+0.017
e+0.016
 many+0.015
 very+0.015
 named+0.014
Neg logit contributions
rily-0.043
ware-0.039
ge-0.039
uing-0.039
fully-0.037
Token what
Token ID644
Feature activation+7.33e-01

Pos logit contributions
e+0.009
 much+0.009
 very+0.008
 many+0.008
 named+0.008
Neg logit contributions
rily-0.023
ware-0.021
uing-0.021
ge-0.021
fully-0.020
Token longer
Token ID2392
Feature activation+1.28e+00

Pos logit contributions
rily+0.001
ly+0.001
ware+0.001
ge+0.001
uing+0.001
Neg logit contributions
 glad-0.000
 named-0.000
 afraid-0.000
 much-0.000
e-0.000
Token and
Token ID290
Feature activation+2.34e+00

Pos logit contributions
 much+0.017
 very+0.016
e+0.016
 many+0.015
 named+0.013
Neg logit contributions
rily-0.043
ware-0.039
uing-0.039
ge-0.038
bell-0.037
Token longer
Token ID2392
Feature activation+8.95e-01

Pos logit contributions
 much+0.037
e+0.036
 very+0.035
 many+0.035
 named+0.032
Neg logit contributions
rily-0.070
ware-0.064
ge-0.063
uing-0.062
bell-0.060
Token breaks
Token ID9457
Feature activation-2.68e-01

Pos logit contributions
 much+0.013
e+0.012
 very+0.012
 many+0.012
 named+0.011
Neg logit contributions
rily-0.029
ware-0.026
ge-0.026
uing-0.026
bell-0.025
Token in
Token ID287
Feature activation+3.75e+00

Pos logit contributions
rily+0.009
ware+0.008
uing+0.008
ge+0.008
bell+0.008
Neg logit contributions
 much-0.004
 very-0.004
 many-0.003
e-0.003
 a-0.003
Token between
Token ID1022
Feature activation+6.64e+00

Pos logit contributions
 much+0.042
e+0.040
 very+0.038
 many+0.037
 named+0.034
Neg logit contributions
rily-0.132
ware-0.121
ge-0.119
uing-0.118
bell-0.115
Token.
Token ID13
Feature activation+1.12e+00

Pos logit contributions
 much+0.062
e+0.058
 very+0.055
 many+0.055
 named+0.047
Neg logit contributions
rily-0.246
ware-0.227
ge-0.224
uing-0.223
bell-0.217
Token
Token ID220
Feature activation+7.45e-01

Pos logit contributions
e+0.018
 much+0.017
 very+0.016
 afraid+0.015
ow+0.015
Neg logit contributions
rily-0.033
ware-0.029
ro-0.029
bell-0.028
ge-0.027
Token
Token ID198
Feature activation-7.64e-01

Pos logit contributions
e+0.010
 much+0.009
 very+0.009
 many+0.009
 named+0.008
Neg logit contributions
rily-0.024
ware-0.022
ge-0.022
uing-0.022
bell-0.021
Token
Token ID198
Feature activation-1.52e+00

Pos logit contributions
rily+0.024
ge+0.022
ware+0.022
uing+0.022
bell+0.021
Neg logit contributions
 much-0.010
e-0.010
 very-0.009
 many-0.009
 afraid-0.008
TokenThe
Token ID464
Feature activation+2.30e-01

Pos logit contributions
rily+0.033
ge+0.030
uing+0.029
ro+0.029
ware+0.028
Neg logit contributions
 much-0.032
ow-0.032
e-0.031
 very-0.031
 glad-0.031
Token bit
Token ID1643
Feature activation+8.87e-01

Pos logit contributions
rily+0.065
ware+0.057
uing+0.055
ge+0.054
bell+0.052
Neg logit contributions
 much-0.059
 very-0.057
 many-0.054
e-0.053
 named-0.051
Token and
Token ID290
Feature activation+1.11e+00

Pos logit contributions
 much+0.013
 very+0.012
 many+0.012
e+0.011
 afraid+0.011
Neg logit contributions
rily-0.029
ware-0.027
uing-0.026
ge-0.026
pron-0.025
Token was
Token ID373
Feature activation+2.54e+00

Pos logit contributions
 much+0.020
e+0.019
 very+0.019
 many+0.019
 named+0.017
Neg logit contributions
rily-0.031
ware-0.028
ge-0.027
uing-0.027
bell-0.026
Token beginning
Token ID3726
Feature activation+2.24e+00

Pos logit contributions
e+0.034
 much+0.030
 And+0.030
 many+0.030
 lives+0.029
Neg logit contributions
rily-0.078
ge-0.068
uing-0.066
fully-0.065
bell-0.065
Token to
Token ID284
Feature activation-1.64e-01

Pos logit contributions
e+0.050
 much+0.048
 named+0.046
 very+0.045
 many+0.045
Neg logit contributions
rily-0.052
ge-0.046
ware-0.046
uing-0.044
bell-0.042
Token feel
Token ID1254
Feature activation+6.62e+00

Pos logit contributions
rily+0.005
ware+0.004
uing+0.004
ge+0.004
bell+0.004
Neg logit contributions
 much-0.003
 very-0.003
e-0.003
 many-0.003
 named-0.003
Token tired
Token ID10032
Feature activation-2.14e-01

Pos logit contributions
e+0.099
ow+0.083
aks+0.079
 named+0.077
inea+0.076
Neg logit contributions
rily-0.195
ge-0.163
ware-0.161
ly-0.158
ro-0.157
Token.
Token ID13
Feature activation+3.87e-01

Pos logit contributions
rily+0.008
uing+0.008
ware+0.008
pron+0.008
ge+0.007
Neg logit contributions
 much-0.002
 very-0.002
 a-0.002
 many-0.002
e-0.001
Token He
Token ID679
Feature activation-1.82e+00

Pos logit contributions
 much+0.006
 many+0.006
 very+0.006
e+0.006
 named+0.005
Neg logit contributions
rily-0.012
ware-0.011
ge-0.011
uing-0.010
bell-0.010
Token decided
Token ID3066
Feature activation-2.72e+00

Pos logit contributions
rily+0.050
ware+0.045
ge+0.044
uing+0.044
bell+0.042
Neg logit contributions
 much-0.034
e-0.033
 very-0.032
 many-0.032
 named-0.030
Token to
Token ID284
Feature activation-2.70e+00

Pos logit contributions
rily+0.085
ware+0.078
uing+0.076
ge+0.074
bell+0.072
Neg logit contributions
 much-0.048
 very-0.047
 many-0.046
 a-0.044
 glad-0.039
Token needs
Token ID2476
Feature activation+2.41e+00

Pos logit contributions
rily+0.003
ware+0.003
ge+0.002
uing+0.002
bell+0.002
Neg logit contributions
 much-0.001
 very-0.001
e-0.001
 many-0.001
 named-0.001
Token a
Token ID257
Feature activation-1.26e+00

Pos logit contributions
 much+0.037
 very+0.034
 many+0.034
e+0.032
 a+0.031
Neg logit contributions
rily-0.080
ware-0.071
ge-0.071
uing-0.071
bell-0.068
Token new
Token ID649
Feature activation-2.19e+00

Pos logit contributions
rily+0.043
ware+0.038
ge+0.037
uing+0.037
fully+0.036
Neg logit contributions
 much-0.020
 very-0.019
 many-0.017
 a-0.016
e-0.015
Token battery
Token ID6555
Feature activation+6.29e-01

Pos logit contributions
rily+0.069
uing+0.059
ware+0.058
ge+0.058
bell+0.055
Neg logit contributions
 much-0.038
 very-0.037
 many-0.035
 a-0.034
 named-0.033
Token to
Token ID284
Feature activation-1.35e+00

Pos logit contributions
 very+0.010
 much+0.009
e+0.009
 many+0.009
 a+0.008
Neg logit contributions
rily-0.020
uing-0.018
ge-0.018
ware-0.018
bell-0.017
Token feel
Token ID1254
Feature activation+6.60e+00

Pos logit contributions
rily+0.042
ware+0.037
uing+0.036
ge+0.035
yt+0.035
Neg logit contributions
 much-0.023
e-0.023
 very-0.022
 many-0.021
 a-0.021
Token refreshed
Token ID47193
Feature activation+1.22e+00

Pos logit contributions
 much+0.085
e+0.083
 many+0.078
 very+0.078
 named+0.072
Neg logit contributions
rily-0.218
ware-0.197
ge-0.194
uing-0.194
bell-0.187
Token too
Token ID1165
Feature activation+4.36e+00

Pos logit contributions
 much+0.013
 very+0.013
 a+0.012
 many+0.012
e+0.011
Neg logit contributions
ge-0.043
rily-0.043
ware-0.042
uing-0.041
bell-0.040
Token."
Token ID526
Feature activation+8.94e-01

Pos logit contributions
 much+0.080
 very+0.074
 many+0.073
e+0.072
 a+0.067
Neg logit contributions
rily-0.125
ware-0.117
uing-0.114
ge-0.112
bell-0.107
Token
Token ID220
Feature activation+9.80e-01

Pos logit contributions
e+0.013
 much+0.013
 very+0.012
 many+0.012
 afraid+0.011
Neg logit contributions
rily-0.027
ge-0.024
ware-0.024
uing-0.024
bell-0.023
Token
Token ID198
Feature activation-3.66e-01

Pos logit contributions
 much+0.012
e+0.011
 very+0.011
 many+0.011
 named+0.010
Neg logit contributions
rily-0.033
ware-0.030
ge-0.030
uing-0.030
bell-0.029
Token sang
Token ID25889
Feature activation-7.74e-01

Pos logit contributions
 much+0.012
e+0.011
 very+0.011
 many+0.011
 named+0.010
Neg logit contributions
rily-0.015
ware-0.013
ge-0.013
uing-0.013
fully-0.013
Token until
Token ID1566
Feature activation+2.66e+00

Pos logit contributions
rily+0.026
ware+0.025
ge+0.025
uing+0.024
bell+0.023
Neg logit contributions
 much-0.011
 very-0.010
 many-0.010
e-0.009
 a-0.009
Token she
Token ID673
Feature activation+3.17e-01

Pos logit contributions
e+0.039
 much+0.036
 afraid+0.034
 very+0.034
 named+0.034
Neg logit contributions
rily-0.081
ware-0.074
bell-0.071
ge-0.070
fully-0.070
Token began
Token ID2540
Feature activation+2.15e+00

Pos logit contributions
 much+0.005
e+0.005
 very+0.005
 many+0.005
 named+0.005
Neg logit contributions
rily-0.010
ware-0.009
ge-0.008
uing-0.008
bell-0.008
Token to
Token ID284
Feature activation+8.48e-02

Pos logit contributions
e+0.049
 much+0.049
 very+0.046
 many+0.045
 named+0.045
Neg logit contributions
rily-0.048
ware-0.042
ge-0.041
uing-0.040
bell-0.039
Token feel
Token ID1254
Feature activation+6.60e+00

Pos logit contributions
 much+0.001
e+0.001
 very+0.001
 many+0.001
 named+0.001
Neg logit contributions
rily-0.002
ware-0.002
uing-0.002
ge-0.002
bell-0.002
Token sleepy
Token ID42023
Feature activation-4.30e-01

Pos logit contributions
e+0.108
ow+0.094
 named+0.090
 ost+0.088
aks+0.087
Neg logit contributions
rily-0.192
ware-0.162
uing-0.158
bell-0.156
ge-0.155
Token.
Token ID13
Feature activation-4.99e-02

Pos logit contributions
rily+0.017
uing+0.016
ware+0.016
ge+0.016
pron+0.015
Neg logit contributions
 very-0.004
 much-0.004
 a-0.003
 many-0.003
e-0.003
Token Just
Token ID2329
Feature activation-6.97e-01

Pos logit contributions
rily+0.002
ware+0.001
ge+0.001
uing+0.001
bell+0.001
Neg logit contributions
 much-0.001
e-0.001
 very-0.001
 many-0.001
 named-0.001
Token as
Token ID355
Feature activation+4.13e+00

Pos logit contributions
rily+0.019
ware+0.017
ge+0.017
uing+0.017
bell+0.016
Neg logit contributions
 much-0.013
e-0.013
 very-0.012
 many-0.012
 named-0.012
Token she
Token ID673
Feature activation-3.56e-01

Pos logit contributions
e+0.070
aw+0.067
ow+0.062
aks+0.061
icks+0.061
Neg logit contributions
rily-0.113
ge-0.104
ware-0.101
bell-0.099
fully-0.098
Tokenmy
Token ID1820
Feature activation+5.04e-01

Pos logit contributions
rily+0.003
ware+0.003
ge+0.003
uing+0.003
bell+0.003
Neg logit contributions
 much-0.002
e-0.002
 very-0.002
 many-0.002
 named-0.002
Token's
Token ID338
Feature activation-1.87e+00

Pos logit contributions
 much+0.009
e+0.008
 very+0.008
 many+0.008
 named+0.008
Neg logit contributions
rily-0.015
ware-0.013
ge-0.013
uing-0.013
bell-0.013
Token leg
Token ID1232
Feature activation+4.35e-01

Pos logit contributions
rily+0.051
ware+0.046
uing+0.046
ge+0.045
bell+0.044
Neg logit contributions
 much-0.034
e-0.033
 many-0.031
 very-0.031
 named-0.030
Token started
Token ID2067
Feature activation+1.79e+00

Pos logit contributions
 much+0.007
e+0.006
 very+0.006
 many+0.006
 named+0.006
Neg logit contributions
rily-0.014
ware-0.012
ge-0.012
uing-0.012
bell-0.011
Token to
Token ID284
Feature activation+3.88e-01

Pos logit contributions
 much+0.035
 very+0.034
e+0.033
 many+0.033
 named+0.031
Neg logit contributions
rily-0.049
ware-0.044
uing-0.042
ge-0.042
bell-0.040
Token feel
Token ID1254
Feature activation+6.58e+00

Pos logit contributions
 much+0.007
 very+0.007
e+0.007
 many+0.006
 named+0.006
Neg logit contributions
rily-0.012
ware-0.010
uing-0.010
ge-0.010
fully-0.009
Token better
Token ID1365
Feature activation+6.13e-01

Pos logit contributions
e+0.110
ow+0.097
 named+0.088
 ost+0.087
 many+0.086
Neg logit contributions
rily-0.188
ware-0.161
uing-0.159
bell-0.155
ge-0.155
Token.
Token ID13
Feature activation+4.84e-01

Pos logit contributions
 very+0.005
 much+0.005
 many+0.004
 a+0.004
e+0.004
Neg logit contributions
rily-0.024
ware-0.023
bell-0.022
uing-0.022
ge-0.021
Token From
Token ID3574
Feature activation+2.01e+00

Pos logit contributions
e+0.009
 very+0.007
 much+0.007
 many+0.007
 afraid+0.007
Neg logit contributions
rily-0.014
bell-0.012
ware-0.012
ge-0.012
uing-0.011
Token that
Token ID326
Feature activation-1.88e+00

Pos logit contributions
 much+0.027
e+0.027
 many+0.024
 very+0.024
 afraid+0.022
Neg logit contributions
rily-0.065
ware-0.059
ge-0.058
uing-0.058
bell-0.056
Token day
Token ID1110
Feature activation+8.60e-01

Pos logit contributions
pron+0.043
fully+0.042
rily+0.042
ge+0.041
uing+0.040
Neg logit contributions
aw-0.037
icks-0.032
e-0.032
ow-0.032
 afraid-0.030
Token higher
Token ID2440
Feature activation+5.46e-01

Pos logit contributions
rily+0.012
ware+0.011
uing+0.011
ge+0.010
bell+0.010
Neg logit contributions
 much-0.005
 very-0.005
e-0.005
 many-0.005
 a-0.004
Token and
Token ID290
Feature activation+2.52e+00

Pos logit contributions
 very+0.008
 many+0.008
 much+0.008
 a+0.007
e+0.007
Neg logit contributions
rily-0.017
ware-0.016
uing-0.016
bell-0.016
pron-0.016
Token faster
Token ID5443
Feature activation-2.58e-01

Pos logit contributions
 much+0.049
 very+0.047
e+0.046
 many+0.046
 glad+0.042
Neg logit contributions
rily-0.069
ware-0.062
uing-0.060
ge-0.060
bell-0.058
Token than
Token ID621
Feature activation+1.77e+00

Pos logit contributions
rily+0.009
ware+0.008
bell+0.008
uing+0.008
pron+0.008
Neg logit contributions
 much-0.003
 very-0.003
 many-0.003
e-0.003
 a-0.003
Token ever
Token ID1683
Feature activation+2.12e+00

Pos logit contributions
 much+0.028
e+0.026
 very+0.026
 many+0.026
 named+0.023
Neg logit contributions
rily-0.055
ware-0.050
uing-0.049
ge-0.049
bell-0.047
Token before
Token ID878
Feature activation+6.58e+00

Pos logit contributions
 much+0.037
e+0.036
 very+0.036
 many+0.035
 afraid+0.032
Neg logit contributions
rily-0.062
ware-0.056
uing-0.055
ge-0.053
bell-0.053
Token.
Token ID13
Feature activation+6.30e-01

Pos logit contributions
 much+0.087
e+0.081
 very+0.080
 many+0.079
 named+0.071
Neg logit contributions
rily-0.219
ware-0.201
uing-0.198
ge-0.197
bell-0.190
Token So
Token ID1406
Feature activation+3.58e-01

Pos logit contributions
e+0.011
 much+0.011
 very+0.011
 afraid+0.010
 glad+0.010
Neg logit contributions
rily-0.017
ware-0.015
ro-0.015
bell-0.014
fully-0.014
Token it
Token ID340
Feature activation-3.23e-01

Pos logit contributions
e+0.006
 much+0.006
aks+0.006
 afraid+0.005
ow+0.005
Neg logit contributions
rily-0.010
fully-0.009
uing-0.009
ware-0.008
ly-0.008
Token hopped
Token ID45230
Feature activation-2.93e+00

Pos logit contributions
rily+0.010
ware+0.009
uing+0.009
ge+0.009
bell+0.009
Neg logit contributions
 much-0.005
 very-0.005
e-0.004
 many-0.004
 named-0.004
Token on
Token ID319
Feature activation+2.64e+00

Pos logit contributions
rily+0.088
ware+0.080
ge+0.079
uing+0.078
bell+0.075
Neg logit contributions
 much-0.049
 very-0.047
e-0.046
 many-0.046
 named-0.042
Token how
Token ID703
Feature activation+8.74e-01

Pos logit contributions
 much+0.018
e+0.017
 very+0.017
 many+0.017
 named+0.015
Neg logit contributions
rily-0.033
ware-0.030
ge-0.029
uing-0.029
bell-0.028
Token it
Token ID340
Feature activation-1.50e+00

Pos logit contributions
e+0.013
 much+0.011
 named+0.011
 very+0.011
 many+0.010
Neg logit contributions
rily-0.027
ge-0.024
uing-0.024
ware-0.023
fully-0.023
Token had
Token ID550
Feature activation+2.11e+00

Pos logit contributions
rily+0.044
ware+0.039
ge+0.039
uing+0.039
bell+0.037
Neg logit contributions
 much-0.025
e-0.025
 very-0.024
 many-0.024
 named-0.022
Token made
Token ID925
Feature activation+1.93e+00

Pos logit contributions
 much+0.030
e+0.029
 very+0.028
 many+0.028
 named+0.025
Neg logit contributions
rily-0.068
ware-0.062
ge-0.060
uing-0.060
bell-0.058
Token them
Token ID606
Feature activation+1.30e+00

Pos logit contributions
e+0.026
 much+0.025
 very+0.024
 many+0.023
 named+0.023
Neg logit contributions
rily-0.061
ware-0.055
uing-0.054
ge-0.054
fully-0.052
Token feel
Token ID1254
Feature activation+6.58e+00

Pos logit contributions
e+0.020
 much+0.020
 many+0.019
 very+0.019
 named+0.018
Neg logit contributions
rily-0.039
ge-0.035
ware-0.035
uing-0.035
bell-0.033
Token.
Token ID13
Feature activation+6.84e-01

Pos logit contributions
e+0.082
 much+0.076
 many+0.072
 very+0.069
 named+0.066
Neg logit contributions
rily-0.218
ware-0.195
ge-0.194
uing-0.194
bell-0.188
Token
Token ID198
Feature activation-6.34e-01

Pos logit contributions
e+0.011
 much+0.010
 very+0.010
 afraid+0.010
 scared+0.009
Neg logit contributions
rily-0.019
ware-0.017
bell-0.017
ro-0.017
fully-0.016
Token
Token ID198
Feature activation-1.50e+00

Pos logit contributions
rily+0.020
ge+0.018
ware+0.018
uing+0.018
fully+0.017
Neg logit contributions
e-0.009
 much-0.008
 very-0.008
 afraid-0.008
 many-0.007
TokenThe
Token ID464
Feature activation-2.37e-01

Pos logit contributions
rily+0.035
uing+0.030
ge+0.030
ware+0.030
bell+0.029
Neg logit contributions
 much-0.033
e-0.032
 very-0.032
 many-0.031
 glad-0.030
Token mill
Token ID3939
Feature activation-1.25e+00

Pos logit contributions
rily+0.006
ware+0.006
uing+0.005
 timer+0.005
is+0.005
Neg logit contributions
e-0.005
 much-0.004
ow-0.004
aw-0.004
 And-0.004
Token went
Token ID1816
Feature activation-1.51e+00

Pos logit contributions
 much+0.006
 very+0.006
 many+0.006
e+0.006
 a+0.006
Neg logit contributions
rily-0.011
ware-0.010
uing-0.010
ge-0.010
yt-0.010
Token on
Token ID319
Feature activation+4.37e+00

Pos logit contributions
rily+0.041
ware+0.037
ge+0.036
uing+0.036
bell+0.035
Neg logit contributions
 much-0.028
e-0.028
 very-0.026
 many-0.026
 named-0.025
Token he
Token ID339
Feature activation+4.18e-01

Pos logit contributions
e+0.058
 much+0.057
 glad+0.052
 afraid+0.052
 many+0.052
Neg logit contributions
rily-0.133
ware-0.125
ge-0.121
uing-0.118
bell-0.117
Token began
Token ID2540
Feature activation+1.69e+00

Pos logit contributions
 much+0.007
e+0.007
 very+0.007
 many+0.006
 named+0.006
Neg logit contributions
rily-0.013
ware-0.011
uing-0.011
ge-0.011
bell-0.010
Token to
Token ID284
Feature activation-6.57e-02

Pos logit contributions
 much+0.035
e+0.035
 many+0.033
 very+0.033
 glad+0.032
Neg logit contributions
rily-0.042
ware-0.037
ge-0.037
uing-0.036
bell-0.034
Token feel
Token ID1254
Feature activation+6.57e+00

Pos logit contributions
rily+0.002
ge+0.002
ware+0.002
uing+0.001
bell+0.001
Neg logit contributions
 much-0.001
e-0.001
 many-0.001
 very-0.001
 glad-0.001
Token better
Token ID1365
Feature activation+7.95e-01

Pos logit contributions
e+0.089
ow+0.070
 ost+0.070
aks+0.068
 named+0.066
Neg logit contributions
rily-0.202
ly-0.178
ware-0.176
uing-0.172
ge-0.172
Token.
Token ID13
Feature activation+5.08e-01

Pos logit contributions
 much+0.008
 very+0.008
e+0.007
 many+0.007
 a+0.006
Neg logit contributions
rily-0.029
ware-0.028
uing-0.027
bell-0.026
ge-0.026
Token He
Token ID679
Feature activation-1.66e+00

Pos logit contributions
 much+0.009
e+0.009
 very+0.008
 many+0.008
 named+0.008
Neg logit contributions
rily-0.014
ware-0.013
ge-0.013
uing-0.013
bell-0.012
Token and
Token ID290
Feature activation-1.73e+00

Pos logit contributions
rily+0.046
ware+0.042
ge+0.041
uing+0.040
bell+0.040
Neg logit contributions
 much-0.030
e-0.029
 many-0.028
 very-0.027
 glad-0.026
Token his
Token ID465
Feature activation-3.54e+00

Pos logit contributions
rily+0.056
ware+0.051
ge+0.050
uing+0.049
bell+0.048
Neg logit contributions
 much-0.024
e-0.023
 very-0.022
 many-0.022
 named-0.020
Token blanket
Token ID18447
Feature activation+1.41e-01

Pos logit contributions
rily+0.003
ge+0.003
uing+0.003
ware+0.003
ro+0.002
Neg logit contributions
 very-0.002
 much-0.002
 many-0.002
 a-0.002
e-0.002
Token around
Token ID1088
Feature activation+6.10e-01

Pos logit contributions
 very+0.002
 much+0.002
 many+0.002
e+0.002
 a+0.002
Neg logit contributions
rily-0.005
ge-0.004
ware-0.004
uing-0.004
yt-0.004
Token her
Token ID607
Feature activation-1.47e+00

Pos logit contributions
 much+0.006
e+0.006
 very+0.006
 many+0.006
 named+0.005
Neg logit contributions
rily-0.022
ware-0.020
ge-0.020
uing-0.020
bell-0.019
Token body
Token ID1767
Feature activation+5.27e-01

Pos logit contributions
rily+0.044
ware+0.040
ge+0.039
uing+0.039
bell+0.037
Neg logit contributions
 much-0.024
 very-0.024
e-0.024
 many-0.023
 named-0.021
Token to
Token ID284
Feature activation-1.78e+00

Pos logit contributions
 much+0.006
 very+0.006
e+0.006
 many+0.006
 a+0.005
Neg logit contributions
rily-0.018
ware-0.017
uing-0.017
ge-0.017
bell-0.016
Token feel
Token ID1254
Feature activation+6.57e+00

Pos logit contributions
rily+0.048
ware+0.043
ge+0.042
uing+0.041
yt+0.039
Neg logit contributions
e-0.037
 much-0.036
 very-0.036
 many-0.035
 named-0.033
Token extra
Token ID3131
Feature activation+2.04e+00

Pos logit contributions
e+0.088
 much+0.078
 many+0.075
ow+0.074
 named+0.073
Neg logit contributions
rily-0.215
ware-0.188
uing-0.186
ge-0.184
fully-0.179
Token cozy
Token ID37438
Feature activation+9.71e-01

Pos logit contributions
e+0.029
 much+0.029
 very+0.027
 many+0.027
 named+0.026
Neg logit contributions
rily-0.064
ware-0.057
uing-0.057
ge-0.057
bell-0.054
Token.
Token ID13
Feature activation+5.11e-01

Pos logit contributions
 very+0.011
 much+0.010
 many+0.009
e+0.009
 a+0.009
Neg logit contributions
rily-0.034
uing-0.033
ware-0.033
ge-0.032
bell-0.031
Token The
Token ID383
Feature activation-7.41e-02

Pos logit contributions
 much+0.006
 many+0.006
 very+0.006
e+0.006
 named+0.005
Neg logit contributions
rily-0.017
ge-0.016
ware-0.016
uing-0.016
bell-0.015
Token little
Token ID1310
Feature activation-7.63e-01

Pos logit contributions
rily+0.002
ge+0.002
ware+0.002
uing+0.002
fully+0.002
Neg logit contributions
 much-0.002
 very-0.001
 many-0.001
e-0.001
 named-0.001
Token and
Token ID290
Feature activation+1.70e+00

Pos logit contributions
 much+0.014
e+0.013
 very+0.013
 many+0.013
 named+0.011
Neg logit contributions
rily-0.031
ware-0.028
uing-0.027
ge-0.027
bell-0.026
Token the
Token ID262
Feature activation-3.55e-02

Pos logit contributions
 much+0.029
e+0.028
 very+0.027
 many+0.027
 named+0.025
Neg logit contributions
rily-0.049
ware-0.044
ge-0.043
uing-0.043
bell-0.041
Token girl
Token ID2576
Feature activation+1.88e+00

Pos logit contributions
rily+0.001
ware+0.001
ge+0.001
uing+0.001
bell+0.001
Neg logit contributions
 much-0.001
e-0.001
 many-0.001
 very-0.001
 named-0.001
Token began
Token ID2540
Feature activation+1.49e+00

Pos logit contributions
 much+0.033
e+0.032
 many+0.031
 very+0.031
 glad+0.029
Neg logit contributions
rily-0.053
ware-0.047
ge-0.047
uing-0.047
bell-0.045
Token to
Token ID284
Feature activation-2.47e-01

Pos logit contributions
 much+0.031
e+0.031
 very+0.030
 many+0.029
 named+0.028
Neg logit contributions
rily-0.037
ware-0.032
ge-0.032
uing-0.032
bell-0.030
Token feel
Token ID1254
Feature activation+6.57e+00

Pos logit contributions
rily+0.007
ware+0.006
uing+0.006
ge+0.006
fully+0.006
Neg logit contributions
 much-0.005
e-0.005
 very-0.005
 many-0.004
 named-0.004
Token sleepy
Token ID42023
Feature activation-7.61e-01

Pos logit contributions
e+0.099
ow+0.092
 ost+0.086
 named+0.080
inea+0.080
Neg logit contributions
rily-0.193
uing-0.157
ware-0.156
ly-0.156
ge-0.153
Token.
Token ID13
Feature activation+6.26e-01

Pos logit contributions
rily+0.029
ware+0.028
uing+0.028
ge+0.027
pron+0.026
Neg logit contributions
 very-0.007
 much-0.007
 a-0.006
 many-0.005
e-0.005
Token Mum
Token ID38367
Feature activation+1.57e+00

Pos logit contributions
e+0.009
 much+0.009
 very+0.009
 many+0.009
 named+0.008
Neg logit contributions
rily-0.020
ware-0.018
uing-0.017
ge-0.017
bell-0.017
Token put
Token ID1234
Feature activation-1.79e+00

Pos logit contributions
 much+0.030
e+0.028
 many+0.028
 very+0.028
 glad+0.026
Neg logit contributions
rily-0.043
ware-0.038
ge-0.037
uing-0.037
bell-0.036
Token the
Token ID262
Feature activation-4.40e-01

Pos logit contributions
rily+0.063
ware+0.057
uing+0.056
ge+0.055
fully+0.055
Neg logit contributions
 much-0.019
e-0.018
 many-0.016
 very-0.016
 named-0.016
Token swimming
Token ID14899
Feature activation-2.54e+00

Pos logit contributions
rily+0.023
ware+0.021
uing+0.020
ge+0.020
bell+0.019
Neg logit contributions
 much-0.010
e-0.010
 very-0.010
 many-0.009
 named-0.009
Token around
Token ID1088
Feature activation+5.80e-01

Pos logit contributions
rily+0.086
ware+0.080
ge+0.079
uing+0.079
bell+0.076
Neg logit contributions
 much-0.031
 very-0.030
e-0.029
 many-0.028
 named-0.026
Token her
Token ID607
Feature activation-1.36e+00

Pos logit contributions
 much+0.006
 very+0.006
e+0.006
 many+0.006
 named+0.005
Neg logit contributions
rily-0.020
ware-0.019
uing-0.019
ge-0.019
bell-0.018
Token.
Token ID13
Feature activation+4.95e-01

Pos logit contributions
rily+0.043
ware+0.039
ge+0.038
uing+0.038
bell+0.036
Neg logit contributions
 much-0.020
e-0.019
 very-0.019
 many-0.019
 named-0.017
Token The
Token ID383
Feature activation-5.05e-01

Pos logit contributions
 much+0.007
e+0.007
 very+0.007
 many+0.007
 named+0.006
Neg logit contributions
rily-0.016
ware-0.014
ge-0.014
uing-0.014
bell-0.014
Token mer
Token ID4017
Feature activation-1.10e+01

Pos logit contributions
rily+0.010
ge+0.009
ware+0.009
uing+0.009
bell+0.008
Neg logit contributions
 much-0.013
 very-0.013
e-0.013
 many-0.013
 named-0.012
Tokenmaid
Token ID23151
Feature activation-9.77e-01

Pos logit contributions
ge+0.186
 afford+0.174
pron+0.164
 checkout+0.154
side+0.153
Neg logit contributions
 very-0.276
e-0.265
 named-0.260
 scared-0.257
 lived-0.256
Token smiled
Token ID13541
Feature activation-3.54e-01

Pos logit contributions
rily+0.029
ware+0.026
ge+0.025
uing+0.025
bell+0.024
Neg logit contributions
 much-0.016
e-0.016
 very-0.016
 many-0.015
 named-0.015
Token and
Token ID290
Feature activation+5.63e-01

Pos logit contributions
rily+0.011
ware+0.011
ge+0.011
uing+0.010
bell+0.010
Neg logit contributions
 very-0.005
 much-0.005
 many-0.005
 a-0.005
e-0.004
Token said
Token ID531
Feature activation-1.42e+00

Pos logit contributions
 much+0.010
e+0.010
 very+0.010
 many+0.010
 glad+0.009
Neg logit contributions
rily-0.016
ware-0.014
ge-0.014
uing-0.014
bell-0.013
Token,
Token ID11
Feature activation-2.67e+00

Pos logit contributions
rily+0.049
ware+0.045
ge+0.044
uing+0.044
bell+0.042
Neg logit contributions
 much-0.017
e-0.016
 very-0.016
 many-0.016
 named-0.014
Token again
Token ID757
Feature activation+1.57e+00

Pos logit contributions
rily+0.057
ware+0.052
ge+0.051
uing+0.050
yt+0.048
Neg logit contributions
 much-0.025
 very-0.023
 many-0.023
e-0.022
 a-0.020
Token.
Token ID13
Feature activation+1.93e-01

Pos logit contributions
 much+0.020
 very+0.018
e+0.018
 many+0.018
 named+0.016
Neg logit contributions
rily-0.054
ware-0.050
uing-0.049
ge-0.048
bell-0.047
Token He
Token ID679
Feature activation-1.66e+00

Pos logit contributions
 much+0.003
e+0.003
 very+0.003
 many+0.003
 glad+0.003
Neg logit contributions
rily-0.006
ware-0.005
uing-0.005
ge-0.005
bell-0.005
Token said
Token ID531
Feature activation-1.87e+00

Pos logit contributions
rily+0.050
ware+0.045
ge+0.044
uing+0.044
bell+0.042
Neg logit contributions
 much-0.027
e-0.026
 very-0.026
 many-0.026
 named-0.024
Token the
Token ID262
Feature activation-5.18e-01

Pos logit contributions
rily+0.056
ware+0.051
ge+0.050
uing+0.049
bell+0.048
Neg logit contributions
 much-0.031
e-0.029
 very-0.029
 many-0.029
 named-0.026
Token mer
Token ID4017
Feature activation-1.07e+01

Pos logit contributions
rily+0.015
ware+0.013
uing+0.013
ge+0.013
bell+0.013
Neg logit contributions
e-0.009
 much-0.009
 many-0.008
 very-0.008
 named-0.008
Tokenmaid
Token ID23151
Feature activation-8.45e-01

Pos logit contributions
ge+0.199
 afford+0.168
side+0.162
 checkout+0.162
pron+0.154
Neg logit contributions
 very-0.257
e-0.244
 lives-0.238
ow-0.234
 afraid-0.231
Token and
Token ID290
Feature activation-1.78e+00

Pos logit contributions
rily+0.023
ware+0.021
uing+0.020
ge+0.020
bell+0.020
Neg logit contributions
 much-0.015
e-0.014
 many-0.014
 glad-0.013
 very-0.013
Token a
Token ID257
Feature activation-3.49e+00

Pos logit contributions
rily+0.053
ware+0.049
ge+0.047
uing+0.047
fully+0.046
Neg logit contributions
 much-0.028
e-0.028
 very-0.026
 many-0.025
 glad-0.024
Token dolphin
Token ID44136
Feature activation-1.77e-01

Pos logit contributions
rily+0.097
ware+0.089
bell+0.086
uing+0.086
ge+0.085
Neg logit contributions
e-0.061
 much-0.056
 many-0.053
ow-0.052
 named-0.051
Token were
Token ID547
Feature activation+2.11e+00

Pos logit contributions
rily+0.006
ware+0.005
ge+0.005
uing+0.005
bell+0.005
Neg logit contributions
 much-0.003
e-0.003
 very-0.003
 many-0.002
 named-0.002
Token made
Token ID925
Feature activation+3.30e-01

Pos logit contributions
rily+0.028
ware+0.025
ge+0.025
uing+0.024
bell+0.023
Neg logit contributions
 much-0.019
e-0.019
 very-0.018
 many-0.018
 named-0.017
Token.
Token ID13
Feature activation+6.25e-01

Pos logit contributions
e+0.005
 much+0.004
 afraid+0.004
 named+0.004
 glad+0.004
Neg logit contributions
rily-0.010
uing-0.009
ware-0.009
fully-0.009
ge-0.009
Token
Token ID220
Feature activation+7.73e-01

Pos logit contributions
e+0.010
 much+0.010
 very+0.010
 afraid+0.010
 many+0.010
Neg logit contributions
rily-0.018
ware-0.015
ro-0.015
bell-0.015
uing-0.015
Token
Token ID198
Feature activation-3.57e-01

Pos logit contributions
 much+0.010
e+0.010
 very+0.009
 many+0.009
 afraid+0.008
Neg logit contributions
rily-0.025
uing-0.023
ware-0.023
ge-0.022
bell-0.022
TokenThe
Token ID464
Feature activation-1.94e-01

Pos logit contributions
rily+0.012
ware+0.010
uing+0.010
ge+0.010
fully+0.010
Neg logit contributions
 much-0.005
e-0.004
 very-0.004
 many-0.004
 afraid-0.004
Token mer
Token ID4017
Feature activation-1.07e+01

Pos logit contributions
rily+0.006
ware+0.005
uing+0.005
ge+0.005
bell+0.005
Neg logit contributions
e-0.003
 much-0.003
 many-0.003
 very-0.003
ow-0.002
Tokenmaid
Token ID23151
Feature activation-5.46e-01

Pos logit contributions
ge+0.173
 afford+0.154
pron+0.145
side+0.143
bell+0.139
Neg logit contributions
 very-0.257
e-0.257
 lives-0.253
aw-0.247
 best-0.245
Token learnt
Token ID26338
Feature activation+3.09e-01

Pos logit contributions
rily+0.015
ware+0.014
ge+0.014
uing+0.014
bell+0.013
Neg logit contributions
 much-0.010
e-0.010
 very-0.009
 many-0.009
 named-0.009
Token that
Token ID326
Feature activation-1.65e+00

Pos logit contributions
e+0.003
 much+0.003
 very+0.003
 many+0.003
 named+0.003
Neg logit contributions
rily-0.011
ware-0.010
uing-0.010
ge-0.010
fully-0.010
Token taking
Token ID2263
Feature activation-3.23e-01

Pos logit contributions
rily+0.044
ware+0.040
ge+0.040
uing+0.040
fully+0.039
Neg logit contributions
e-0.028
aw-0.026
ow-0.025
 much-0.025
 named-0.025
Token risks
Token ID7476
Feature activation-4.54e-01

Pos logit contributions
rily+0.009
uing+0.008
ly+0.008
ware+0.008
ge+0.008
Neg logit contributions
 much-0.005
e-0.005
 named-0.005
 glad-0.005
 very-0.005
Token castle
Token ID16669
Feature activation-1.16e+00

Pos logit contributions
rily+0.055
uing+0.049
ge+0.049
ware+0.049
bell+0.046
Neg logit contributions
 very-0.039
 much-0.039
e-0.038
 many-0.037
 a-0.034
Token so
Token ID523
Feature activation+3.32e+00

Pos logit contributions
rily+0.042
ware+0.038
uing+0.037
ge+0.036
ory+0.035
Neg logit contributions
 very-0.015
 much-0.015
 a-0.013
 many-0.012
e-0.012
Token much
Token ID881
Feature activation+2.75e+00

Pos logit contributions
 much+0.047
e+0.045
 very+0.044
 many+0.043
 named+0.040
Neg logit contributions
rily-0.107
ware-0.097
ge-0.095
uing-0.095
bell-0.092
Token.
Token ID13
Feature activation-5.10e-01

Pos logit contributions
 much+0.034
 very+0.033
 a+0.029
 many+0.029
 scared+0.027
Neg logit contributions
rily-0.101
uing-0.092
ge-0.090
ware-0.089
arded-0.085
Token The
Token ID383
Feature activation-4.32e-01

Pos logit contributions
rily+0.019
ge+0.019
uing+0.018
ware+0.018
yt+0.018
Neg logit contributions
 much-0.004
 very-0.004
 many-0.004
 And-0.004
 a-0.004
Token mer
Token ID4017
Feature activation-1.07e+01

Pos logit contributions
rily+0.009
ge+0.008
uing+0.008
ware+0.008
bell+0.007
Neg logit contributions
 very-0.011
 much-0.011
 many-0.011
e-0.011
 named-0.010
Tokenmaid
Token ID23151
Feature activation-9.72e-01

Pos logit contributions
ge+0.174
 afford+0.146
pron+0.142
uster+0.133
rily+0.126
Neg logit contributions
 very-0.283
e-0.279
 lives-0.271
 lived-0.264
 afraid-0.262
Token said
Token ID531
Feature activation-1.68e+00

Pos logit contributions
rily+0.029
ware+0.026
ge+0.026
uing+0.026
ro+0.025
Neg logit contributions
 very-0.016
 much-0.016
e-0.015
 many-0.015
 named-0.015
Token,
Token ID11
Feature activation-2.89e+00

Pos logit contributions
rily+0.056
ware+0.051
ge+0.050
uing+0.050
bell+0.048
Neg logit contributions
 much-0.022
e-0.021
 very-0.020
 many-0.020
 named-0.018
Token "
Token ID366
Feature activation-3.39e+00

Pos logit contributions
rily+0.092
ge+0.086
ware+0.085
uing+0.083
yt+0.081
Neg logit contributions
 very-0.044
 much-0.043
 many-0.040
 a-0.038
 glad-0.037
TokenI
Token ID40
Feature activation-7.67e-01

Pos logit contributions
rily+0.105
ware+0.095
uing+0.095
ge+0.091
bell+0.088
Neg logit contributions
e-0.054
 much-0.053
 many-0.049
 very-0.049
 a-0.045
Token water
Token ID1660
Feature activation+3.71e-01

Pos logit contributions
e+0.029
 much+0.028
 very+0.026
ow+0.026
 many+0.025
Neg logit contributions
rily-0.044
ware-0.040
ge-0.038
bell-0.037
ly-0.037
Token.
Token ID13
Feature activation+5.70e-01

Pos logit contributions
 much+0.004
 very+0.004
 many+0.004
e+0.004
 named+0.003
Neg logit contributions
rily-0.013
ge-0.012
uing-0.012
ware-0.012
bell-0.012
Token
Token ID198
Feature activation-7.29e-01

Pos logit contributions
 much+0.007
 many+0.006
 very+0.006
e+0.006
 glad+0.005
Neg logit contributions
rily-0.020
ware-0.018
ge-0.018
uing-0.018
bell-0.017
Token
Token ID198
Feature activation-1.66e+00

Pos logit contributions
rily+0.026
ware+0.024
ge+0.023
uing+0.023
bell+0.022
Neg logit contributions
 much-0.008
e-0.008
 very-0.007
 many-0.007
 named-0.006
TokenThe
Token ID464
Feature activation-4.98e-01

Pos logit contributions
rily+0.050
ware+0.046
ge+0.043
uing+0.042
yt+0.041
Neg logit contributions
 much-0.030
 many-0.029
e-0.028
 very-0.028
 a-0.025
Token mer
Token ID4017
Feature activation-1.06e+01

Pos logit contributions
rily+0.012
ware+0.010
uing+0.010
ge+0.010
bell+0.009
Neg logit contributions
 much-0.011
e-0.011
 many-0.011
 very-0.011
 named-0.010
Tokenmaid
Token ID23151
Feature activation-4.97e-01

Pos logit contributions
ge+0.176
pron+0.155
 afford+0.142
bell+0.141
rily+0.140
Neg logit contributions
 very-0.266
e-0.262
 lives-0.259
aw-0.250
 named-0.244
Token smiled
Token ID13541
Feature activation-3.25e-01

Pos logit contributions
rily+0.014
ware+0.013
ge+0.013
uing+0.012
bell+0.012
Neg logit contributions
 much-0.009
e-0.009
 very-0.009
 many-0.008
 named-0.008
Token at
Token ID379
Feature activation+3.95e-01

Pos logit contributions
rily+0.010
ware+0.009
ge+0.009
uing+0.009
bell+0.009
Neg logit contributions
 much-0.005
 very-0.005
 many-0.005
e-0.004
 a-0.004
Token Melody
Token ID38906
Feature activation+2.58e+00

Pos logit contributions
 much+0.006
e+0.006
 many+0.006
 very+0.006
 a+0.005
Neg logit contributions
rily-0.012
ware-0.011
ge-0.011
uing-0.011
bell-0.010
Token.
Token ID13
Feature activation-3.63e-01

Pos logit contributions
 much+0.044
e+0.043
 very+0.041
 many+0.041
 afraid+0.039
Neg logit contributions
rily-0.074
ware-0.066
ge-0.065
uing-0.065
fully-0.062
Token demands
Token ID8665
Feature activation-1.44e+00

Pos logit contributions
rily+0.071
ware+0.064
ge+0.063
uing+0.063
bell+0.060
Neg logit contributions
 much-0.041
e-0.039
 very-0.039
 many-0.038
 named-0.035
Token,
Token ID11
Feature activation-2.37e+00

Pos logit contributions
rily+0.041
ware+0.037
ge+0.036
uing+0.036
bell+0.034
Neg logit contributions
 much-0.026
e-0.025
 very-0.024
 many-0.024
 named-0.022
Token crab
Token ID32202
Feature activation+4.71e-01

Pos logit contributions
rily+0.069
ware+0.063
ge+0.062
uing+0.062
bell+0.059
Neg logit contributions
 much-0.041
 very-0.039
e-0.038
 many-0.038
 named-0.035
Token,"
Token ID553
Feature activation-4.06e-01

Pos logit contributions
 much+0.009
e+0.008
 many+0.008
 very+0.008
 glad+0.008
Neg logit contributions
rily-0.013
ge-0.011
ware-0.011
uing-0.011
bell-0.011
Token the
Token ID262
Feature activation-9.11e-01

Pos logit contributions
rily+0.012
ware+0.010
ly+0.010
ge+0.010
fully+0.010
Neg logit contributions
 much-0.007
e-0.007
 many-0.006
 glad-0.006
 very-0.006
Token mer
Token ID4017
Feature activation-1.06e+01

Pos logit contributions
rily+0.023
ge+0.021
uing+0.021
ware+0.020
bell+0.019
Neg logit contributions
 very-0.019
 much-0.019
 many-0.019
e-0.018
 glad-0.018
Tokenmaid
Token ID23151
Feature activation-2.29e-01

Pos logit contributions
pron+0.144
ge+0.136
side+0.126
 afford+0.125
bell+0.124
Neg logit contributions
 very-0.287
e-0.283
 afraid-0.274
 scared-0.273
ow-0.270
Token said
Token ID531
Feature activation-1.74e+00

Pos logit contributions
rily+0.006
ware+0.006
ge+0.006
uing+0.006
bell+0.005
Neg logit contributions
 much-0.004
e-0.004
 very-0.004
 many-0.004
 named-0.004
Token.
Token ID13
Feature activation+1.66e-01

Pos logit contributions
rily+0.056
ware+0.050
ge+0.050
uing+0.049
bell+0.048
Neg logit contributions
 much-0.024
e-0.024
 many-0.023
 very-0.022
 named-0.021
Token "
Token ID366
Feature activation-3.52e+00

Pos logit contributions
 much+0.002
e+0.002
 very+0.002
 many+0.002
 named+0.002
Neg logit contributions
rily-0.005
ware-0.005
ge-0.005
uing-0.005
bell-0.005
TokenI
Token ID40
Feature activation-1.15e+00

Pos logit contributions
rily+0.108
ware+0.100
uing+0.098
ge+0.096
yt+0.095
Neg logit contributions
e-0.058
 much-0.055
 very-0.051
 many-0.050
 a-0.047
Token yes
Token ID3763
Feature activation-3.69e+00

Pos logit contributions
rily+0.049
ware+0.044
ge+0.044
uing+0.043
bell+0.042
Neg logit contributions
 much-0.028
 very-0.027
 many-0.026
e-0.026
 glad-0.024
Token.
Token ID13
Feature activation-6.50e-01

Pos logit contributions
ware+0.143
rily+0.143
ge+0.130
pron+0.128
uing+0.128
Neg logit contributions
 very-0.039
 much-0.036
 a-0.027
 many-0.026
e-0.023
Token She
Token ID1375
Feature activation-2.24e+00

Pos logit contributions
rily+0.023
ge+0.022
ware+0.022
uing+0.021
yt+0.020
Neg logit contributions
 many-0.007
 much-0.007
 very-0.007
 And-0.007
 a-0.006
Token followed
Token ID3940
Feature activation-3.44e+00

Pos logit contributions
rily+0.067
ge+0.062
ware+0.061
uing+0.060
fully+0.055
Neg logit contributions
 very-0.038
 much-0.037
 named-0.035
 many-0.035
e-0.035
Token the
Token ID262
Feature activation-8.35e-01

Pos logit contributions
rily+0.129
ware+0.120
uing+0.117
ge+0.117
bell+0.113
Neg logit contributions
 much-0.031
e-0.029
 very-0.028
 many-0.028
 named-0.023
Token mer
Token ID4017
Feature activation-1.04e+01

Pos logit contributions
rily+0.023
ge+0.020
uing+0.020
ware+0.020
bell+0.019
Neg logit contributions
 much-0.016
 very-0.016
 many-0.016
e-0.015
 glad-0.015
Tokenmaid
Token ID23151
Feature activation-2.00e-01

Pos logit contributions
ge+0.171
pron+0.159
 afford+0.148
side+0.143
bell+0.137
Neg logit contributions
 very-0.253
e-0.251
 lives-0.239
 named-0.230
ks-0.226
Token through
Token ID832
Feature activation+2.39e+00

Pos logit contributions
rily+0.006
ware+0.005
uing+0.005
ge+0.005
bell+0.005
Neg logit contributions
 much-0.003
 very-0.003
e-0.003
 many-0.003
 named-0.003
Token the
Token ID262
Feature activation+1.95e+00

Pos logit contributions
 much+0.018
e+0.018
 very+0.016
 glad+0.015
 many+0.014
Neg logit contributions
rily-0.090
ware-0.083
ge-0.082
uing-0.080
bell-0.080
Token hidden
Token ID7104
Feature activation-3.56e+00

Pos logit contributions
e+0.033
 much+0.032
ow+0.029
 glad+0.029
 many+0.029
Neg logit contributions
rily-0.056
ware-0.051
bell-0.049
ge-0.048
uing-0.048
Token door
Token ID3420
Feature activation+1.41e+00

Pos logit contributions
rily+0.098
ware+0.087
ge+0.087
uing+0.086
bell+0.082
Neg logit contributions
 much-0.067
e-0.064
 very-0.064
 many-0.063
 named-0.059
Token was
Token ID373
Feature activation+1.17e+00

Pos logit contributions
rily+0.066
ware+0.060
ge+0.059
uing+0.058
bell+0.056
Neg logit contributions
 much-0.040
e-0.039
 very-0.038
 many-0.038
 named-0.036
Token so
Token ID523
Feature activation+4.36e+00

Pos logit contributions
e+0.013
 many+0.012
 much+0.012
aw+0.011
 lives+0.011
Neg logit contributions
rily-0.038
ge-0.033
uing-0.033
bell-0.033
fully-0.032
Token thankful
Token ID25535
Feature activation-1.78e+00

Pos logit contributions
e+0.053
 much+0.048
 many+0.046
 named+0.044
aw+0.044
Neg logit contributions
rily-0.143
uing-0.126
ge-0.126
bell-0.125
ware-0.123
Token that
Token ID326
Feature activation-8.44e-01

Pos logit contributions
rily+0.060
ware+0.055
ge+0.054
uing+0.054
bell+0.052
Neg logit contributions
 much-0.024
 very-0.022
 many-0.022
e-0.021
 named-0.019
Token the
Token ID262
Feature activation+1.42e-02

Pos logit contributions
rily+0.022
ware+0.020
ge+0.019
uing+0.019
fully+0.018
Neg logit contributions
e-0.016
 much-0.015
aw-0.014
 very-0.014
ow-0.014
Token mer
Token ID4017
Feature activation-1.03e+01

Pos logit contributions
 much+0.000
e+0.000
 very+0.000
 many+0.000
 named+0.000
Neg logit contributions
rily-0.000
ware-0.000
ge-0.000
uing-0.000
bell-0.000
Tokenmaid
Token ID23151
Feature activation-8.15e-01

Pos logit contributions
ge+0.156
pron+0.142
 afford+0.138
bell+0.136
arded+0.135
Neg logit contributions
 very-0.264
 lives-0.256
e-0.250
 best-0.240
 glad-0.238
Token had
Token ID550
Feature activation+1.08e+00

Pos logit contributions
rily+0.024
ware+0.021
ge+0.021
uing+0.021
bell+0.020
Neg logit contributions
 much-0.014
e-0.013
 very-0.013
 many-0.013
 named-0.012
Token forgiven
Token ID32350
Feature activation-1.47e+00

Pos logit contributions
 much+0.017
e+0.016
 very+0.016
 many+0.016
 named+0.014
Neg logit contributions
rily-0.034
ware-0.031
ge-0.030
uing-0.030
bell-0.029
Token him
Token ID683
Feature activation-2.91e-01

Pos logit contributions
rily+0.048
ware+0.044
ge+0.043
uing+0.043
bell+0.042
Neg logit contributions
 much-0.020
e-0.020
 very-0.019
 many-0.019
 named-0.017
Token.
Token ID13
Feature activation+4.43e-01

Pos logit contributions
rily+0.009
ware+0.009
ge+0.008
uing+0.008
bell+0.008
Neg logit contributions
 much-0.004
e-0.004
 very-0.004
 many-0.004
 named-0.003
Token
Token ID198
Feature activation-1.07e+00

Pos logit contributions
 much+0.011
 many+0.010
 very+0.010
 And+0.010
 a+0.010
Neg logit contributions
rily-0.018
ge-0.017
ware-0.016
uing-0.016
yt-0.015
Token
Token ID198
Feature activation-1.92e+00

Pos logit contributions
rily+0.042
ware+0.040
ge+0.039
uing+0.039
bell+0.037
Neg logit contributions
 much-0.009
 many-0.008
 very-0.008
e-0.007
 And-0.006
TokenJack
Token ID14295
Feature activation+2.23e-01

Pos logit contributions
rily+0.048
ware+0.044
ge+0.041
uing+0.040
bell+0.038
Neg logit contributions
 much-0.045
 many-0.042
 very-0.042
e-0.042
 a-0.039
Token asked
Token ID1965
Feature activation-2.73e+00

Pos logit contributions
 much+0.004
e+0.004
 very+0.004
 many+0.004
 named+0.003
Neg logit contributions
rily-0.006
ware-0.006
ge-0.006
uing-0.006
bell-0.005
Token the
Token ID262
Feature activation-8.03e-01

Pos logit contributions
rily+0.108
ware+0.100
ge+0.099
uing+0.098
bell+0.096
Neg logit contributions
 much-0.020
 very-0.017
 many-0.017
e-0.017
 named-0.014
Token mer
Token ID4017
Feature activation-1.03e+01

Pos logit contributions
rily+0.018
ge+0.017
uing+0.016
ware+0.015
arded+0.014
Neg logit contributions
 very-0.020
 much-0.019
 many-0.019
 a-0.018
 named-0.018
Tokenmaid
Token ID23151
Feature activation-5.60e-01

Pos logit contributions
ge+0.177
 afford+0.163
pron+0.157
 checkout+0.157
side+0.154
Neg logit contributions
 very-0.250
e-0.239
ow-0.229
ks-0.227
 afraid-0.227
Token why
Token ID1521
Feature activation+9.24e-02

Pos logit contributions
rily+0.017
ware+0.015
uing+0.015
ge+0.015
bell+0.014
Neg logit contributions
e-0.009
 much-0.009
 many-0.008
 very-0.008
 glad-0.007
Token the
Token ID262
Feature activation-3.77e-01

Pos logit contributions
 much+0.002
e+0.001
 very+0.001
 many+0.001
 named+0.001
Neg logit contributions
rily-0.003
ware-0.002
ge-0.002
uing-0.002
bell-0.002
Token waves
Token ID9813
Feature activation-1.62e+00

Pos logit contributions
rily+0.009
ge+0.008
uing+0.008
ware+0.007
arded+0.007
Neg logit contributions
 very-0.009
 much-0.009
 many-0.009
e-0.008
 afraid-0.008
Token were
Token ID547
Feature activation+2.40e+00

Pos logit contributions
rily+0.052
ware+0.047
uing+0.046
ge+0.046
yt+0.043
Neg logit contributions
 much-0.024
 very-0.024
e-0.023
 many-0.021
 named-0.021
Token.
Token ID13
Feature activation+6.68e-01

Pos logit contributions
 much+0.005
e+0.005
 very+0.005
 many+0.004
 named+0.004
Neg logit contributions
rily-0.012
ware-0.011
ge-0.011
uing-0.011
bell-0.011
Token He
Token ID679
Feature activation-1.51e+00

Pos logit contributions
e+0.010
 much+0.010
 very+0.010
 afraid+0.009
 many+0.009
Neg logit contributions
rily-0.020
ware-0.018
ro-0.017
bell-0.017
uing-0.017
Token even
Token ID772
Feature activation+2.36e-01

Pos logit contributions
rily+0.045
ware+0.041
ge+0.040
uing+0.040
bell+0.039
Neg logit contributions
 much-0.025
e-0.024
 very-0.023
 many-0.023
 named-0.021
Token saw
Token ID2497
Feature activation+4.25e-01

Pos logit contributions
 much+0.004
e+0.004
 very+0.004
 many+0.004
 named+0.003
Neg logit contributions
rily-0.007
ware-0.007
uing-0.006
ge-0.006
bell-0.006
Token a
Token ID257
Feature activation-1.93e+00

Pos logit contributions
 much+0.004
e+0.004
 very+0.003
 many+0.003
 glad+0.003
Neg logit contributions
rily-0.016
ware-0.014
ge-0.014
uing-0.014
bell-0.014
Token mer
Token ID4017
Feature activation-1.02e+01

Pos logit contributions
rily+0.060
ware+0.054
ge+0.053
uing+0.053
fully+0.051
Neg logit contributions
 much-0.031
 very-0.029
 many-0.028
e-0.028
 named-0.026
Tokenmaid
Token ID23151
Feature activation-5.39e-01

Pos logit contributions
ge+0.166
 afford+0.134
ware+0.134
arded+0.133
side+0.127
Neg logit contributions
 very-0.278
e-0.259
 scared-0.249
 afraid-0.244
 named-0.243
Token!
Token ID0
Feature activation+9.93e-01

Pos logit contributions
rily+0.017
ware+0.015
uing+0.015
ge+0.015
bell+0.014
Neg logit contributions
 much-0.008
e-0.008
 very-0.008
 many-0.007
 named-0.007
Token
Token ID220
Feature activation+5.51e-01

Pos logit contributions
e+0.015
 much+0.014
 very+0.014
 many+0.013
 afraid+0.012
Neg logit contributions
rily-0.030
ware-0.027
bell-0.027
ro-0.026
ge-0.026
Token
Token ID198
Feature activation-7.76e-01

Pos logit contributions
e+0.007
 much+0.007
 very+0.007
 many+0.006
 named+0.006
Neg logit contributions
rily-0.018
ware-0.016
uing-0.016
ge-0.016
bell-0.015
Token
Token ID198
Feature activation-1.73e+00

Pos logit contributions
rily+0.028
ware+0.026
uing+0.025
ge+0.025
bell+0.025
Neg logit contributions
 much-0.008
e-0.008
 many-0.008
 very-0.007
 named-0.006
Token ever
Token ID1683
Feature activation+1.58e+00

Pos logit contributions
rily+0.017
ware+0.015
ge+0.015
uing+0.015
fully+0.014
Neg logit contributions
 much-0.011
e-0.010
 many-0.010
 very-0.010
 afraid-0.009
Token and
Token ID290
Feature activation+1.18e+00

Pos logit contributions
 much+0.023
e+0.022
 very+0.022
 many+0.021
 named+0.019
Neg logit contributions
rily-0.051
ware-0.047
uing-0.046
ge-0.045
bell-0.044
Token he
Token ID339
Feature activation-8.28e-01

Pos logit contributions
 much+0.021
e+0.020
 very+0.020
 many+0.020
 named+0.018
Neg logit contributions
rily-0.033
ware-0.030
ge-0.029
uing-0.029
bell-0.028
Token thanked
Token ID26280
Feature activation-3.12e+00

Pos logit contributions
rily+0.024
ware+0.022
ge+0.021
uing+0.021
bell+0.020
Neg logit contributions
 much-0.014
e-0.014
 very-0.013
 many-0.013
 named-0.012
Token the
Token ID262
Feature activation-3.85e-01

Pos logit contributions
rily+0.110
ware+0.101
ge+0.101
uing+0.100
bell+0.097
Neg logit contributions
 much-0.035
e-0.033
 very-0.033
 many-0.032
 named-0.028
Token mer
Token ID4017
Feature activation-1.01e+01

Pos logit contributions
rily+0.008
ware+0.007
ge+0.007
uing+0.007
bell+0.006
Neg logit contributions
 much-0.010
e-0.010
 very-0.009
 many-0.009
 named-0.009
Tokenmaid
Token ID23151
Feature activation+6.83e-01

Pos logit contributions
ge+0.177
 afford+0.155
arded+0.151
driver+0.148
pron+0.147
Neg logit contributions
 very-0.249
e-0.239
 named-0.230
 lives-0.226
 glad-0.220
Token for
Token ID329
Feature activation+1.25e+00

Pos logit contributions
 much+0.011
e+0.011
 very+0.011
 many+0.011
 named+0.010
Neg logit contributions
rily-0.020
ware-0.018
ge-0.018
uing-0.018
bell-0.017
Token being
Token ID852
Feature activation+3.15e+00

Pos logit contributions
 much+0.017
e+0.016
 very+0.016
 many+0.016
 named+0.014
Neg logit contributions
rily-0.040
ware-0.036
ge-0.036
uing-0.036
fully-0.034
Token so
Token ID523
Feature activation+3.03e+00

Pos logit contributions
 much+0.040
e+0.038
 many+0.037
 very+0.035
 And+0.034
Neg logit contributions
rily-0.102
ware-0.089
uing-0.088
ge-0.088
fully-0.086
Token kind
Token ID1611
Feature activation+9.40e-01

Pos logit contributions
e+0.037
 much+0.036
 many+0.034
 very+0.033
 named+0.031
Neg logit contributions
rily-0.101
ge-0.090
ware-0.090
uing-0.090
bell-0.088
Token.
Token ID13
Feature activation+6.41e-01

Pos logit contributions
 much+0.046
 very+0.043
 many+0.041
 a+0.035
e+0.034
Neg logit contributions
rily-0.128
ware-0.123
uing-0.118
ge-0.111
pron-0.111
Token
Token ID220
Feature activation+5.34e-01

Pos logit contributions
 much+0.008
e+0.008
 very+0.008
 many+0.008
 afraid+0.007
Neg logit contributions
rily-0.021
ware-0.019
uing-0.019
ge-0.019
bell-0.018
Token
Token ID198
Feature activation-1.31e+00

Pos logit contributions
 much+0.006
e+0.006
 very+0.006
 many+0.006
 named+0.005
Neg logit contributions
rily-0.018
ware-0.016
ge-0.016
uing-0.016
bell-0.016
TokenThe
Token ID464
Feature activation-4.06e-01

Pos logit contributions
rily+0.049
ware+0.045
uing+0.044
ge+0.044
bell+0.043
Neg logit contributions
 much-0.013
 many-0.012
e-0.012
 very-0.011
 named-0.009
Token strong
Token ID1913
Feature activation-4.03e+00

Pos logit contributions
rily+0.009
ge+0.007
ware+0.007
uing+0.007
bell+0.007
Neg logit contributions
 much-0.010
 very-0.010
e-0.010
 many-0.010
 named-0.009
Token mer
Token ID4017
Feature activation-1.01e+01

Pos logit contributions
rily+0.102
ge+0.094
uing+0.093
ware+0.091
bell+0.083
Neg logit contributions
 very-0.082
 much-0.082
e-0.079
 many-0.079
 named-0.076
Tokenmaid
Token ID23151
Feature activation-1.71e-01

Pos logit contributions
ge+0.177
 afford+0.147
pron+0.146
uster+0.137
driver+0.136
Neg logit contributions
 very-0.255
 lives-0.244
e-0.241
 lived-0.234
 named-0.231
Token thought
Token ID1807
Feature activation-3.68e+00

Pos logit contributions
rily+0.005
ware+0.004
ge+0.004
uing+0.004
bell+0.004
Neg logit contributions
 much-0.003
 very-0.003
e-0.003
 many-0.003
 named-0.003
Token about
Token ID546
Feature activation+1.01e+00

Pos logit contributions
rily+0.120
ware+0.110
uing+0.107
ge+0.106
fully+0.104
Neg logit contributions
e-0.048
 much-0.048
 very-0.045
 many-0.044
 named-0.042
Token the
Token ID262
Feature activation+1.31e+00

Pos logit contributions
e+0.015
 much+0.014
 very+0.013
 afraid+0.013
 named+0.013
Neg logit contributions
rily-0.030
ware-0.028
ge-0.027
uing-0.027
fully-0.026
Token promise
Token ID6991
Feature activation-3.76e-01

Pos logit contributions
 much+0.025
e+0.024
 many+0.023
 very+0.023
 named+0.022
Neg logit contributions
rily-0.036
ware-0.032
ge-0.031
uing-0.031
bell-0.030
Token They
Token ID1119
Feature activation-7.11e-01

Pos logit contributions
 much+0.010
 many+0.010
 very+0.009
 And+0.009
 a+0.008
Neg logit contributions
rily-0.039
ge-0.038
ware-0.037
uing-0.036
pron-0.035
Token had
Token ID550
Feature activation+8.98e-01

Pos logit contributions
rily+0.022
ware+0.020
ge+0.019
uing+0.019
bell+0.018
Neg logit contributions
 much-0.012
 very-0.011
e-0.011
 many-0.011
 named-0.010
Token never
Token ID1239
Feature activation-1.08e+00

Pos logit contributions
e+0.010
 much+0.010
 very+0.009
 many+0.009
 glad+0.009
Neg logit contributions
rily-0.030
ware-0.027
ge-0.027
uing-0.027
bell-0.026
Token seen
Token ID1775
Feature activation-1.25e-01

Pos logit contributions
rily+0.033
ware+0.030
ge+0.029
uing+0.029
bell+0.028
Neg logit contributions
 much-0.017
e-0.016
 very-0.016
 many-0.016
 glad-0.014
Token a
Token ID257
Feature activation-3.51e+00

Pos logit contributions
rily+0.004
ware+0.004
ge+0.004
uing+0.004
fully+0.004
Neg logit contributions
 much-0.001
e-0.001
 very-0.001
 many-0.001
 glad-0.001
Token mer
Token ID4017
Feature activation-1.00e+01

Pos logit contributions
rily+0.100
uing+0.090
ge+0.089
ware+0.085
arded+0.083
Neg logit contributions
 very-0.071
 much-0.069
 many-0.064
 afraid-0.060
 scared-0.060
Tokenmaid
Token ID23151
Feature activation-8.24e-01

Pos logit contributions
ge+0.166
side+0.155
 afford+0.144
cakes+0.142
ware+0.141
Neg logit contributions
 very-0.259
e-0.245
 scared-0.238
ow-0.234
 afraid-0.233
Token before
Token ID878
Feature activation+5.01e+00

Pos logit contributions
rily+0.023
ware+0.022
uing+0.021
ge+0.021
yt+0.020
Neg logit contributions
e-0.015
 very-0.015
 much-0.014
 named-0.014
 a-0.013
Token.
Token ID13
Feature activation+1.31e+00

Pos logit contributions
 much+0.060
 very+0.057
 many+0.055
e+0.054
 named+0.049
Neg logit contributions
rily-0.174
ware-0.160
uing-0.159
ge-0.158
bell-0.152
Token
Token ID198
Feature activation-5.63e-02

Pos logit contributions
 much+0.015
 many+0.014
 very+0.014
 And+0.013
 named+0.012
Neg logit contributions
rily-0.047
ge-0.044
ware-0.044
uing-0.043
yt-0.041
Token
Token ID198
Feature activation-9.07e-01

Pos logit contributions
rily+0.002
ware+0.002
uing+0.002
ge+0.002
bell+0.002
Neg logit contributions
e-0.001
 much-0.001
 very-0.001
 many-0.001
 afraid-0.001
Token sun
Token ID4252
Feature activation+1.46e-01

Pos logit contributions
e+0.085
 much+0.081
ow+0.079
 named+0.076
 many+0.075
Neg logit contributions
rily-0.117
ware-0.105
ly-0.102
bell-0.102
yt-0.098
Token.
Token ID13
Feature activation+5.19e-01

Pos logit contributions
 very+0.001
 much+0.001
e+0.001
 many+0.001
 a+0.001
Neg logit contributions
rily-0.005
uing-0.005
ge-0.005
ware-0.005
bell-0.005
Token
Token ID198
Feature activation-1.08e+00

Pos logit contributions
e+0.009
 much+0.008
 very+0.008
 afraid+0.008
 glad+0.007
Neg logit contributions
rily-0.015
ware-0.013
bell-0.013
ro-0.013
fully-0.013
Token
Token ID198
Feature activation-1.69e+00

Pos logit contributions
rily+0.035
ware+0.032
ge+0.032
uing+0.031
bell+0.031
Neg logit contributions
 much-0.014
e-0.014
 very-0.013
 many-0.012
 glad-0.012
TokenA
Token ID32
Feature activation-3.15e+00

Pos logit contributions
rily+0.035
ge+0.032
bell+0.031
ware+0.031
uing+0.030
Neg logit contributions
e-0.038
ow-0.036
 much-0.036
icks-0.036
 very-0.035
Token mer
Token ID4017
Feature activation-1.00e+01

Pos logit contributions
rily+0.076
ware+0.067
ge+0.065
uing+0.065
bell+0.063
Neg logit contributions
 much-0.068
e-0.068
 many-0.065
 very-0.065
 named-0.061
Tokenmaid
Token ID23151
Feature activation-3.74e-01

Pos logit contributions
ge+0.154
 afford+0.127
rily+0.123
side+0.121
pron+0.121
Neg logit contributions
 very-0.279
e-0.269
 scared-0.260
 afraid-0.260
ow-0.259
Token that
Token ID326
Feature activation-1.63e+00

Pos logit contributions
rily+0.012
ware+0.011
ge+0.011
uing+0.011
bell+0.011
Neg logit contributions
 much-0.005
e-0.005
 many-0.005
 very-0.005
 glad-0.004
Token lived
Token ID5615
Feature activation+3.60e-02

Pos logit contributions
ware+0.033
rily+0.029
yt+0.029
bell+0.028
pron+0.027
Neg logit contributions
ow-0.036
e-0.036
icks-0.035
aw-0.034
 much-0.034
Token in
Token ID287
Feature activation+3.80e+00

Pos logit contributions
 much+0.001
e+0.000
 many+0.000
 very+0.000
 And+0.000
Neg logit contributions
rily-0.001
ware-0.001
ge-0.001
uing-0.001
fully-0.001
Token the
Token ID262
Feature activation+3.08e+00

Pos logit contributions
aw+0.047
 afraid+0.041
 glad+0.041
 much+0.040
 And+0.040
Neg logit contributions
rily-0.113
ware-0.104
ro-0.098
ge-0.097
bell-0.097
Token,
Token ID11
Feature activation-1.46e+00

Pos logit contributions
 much+0.002
 very+0.002
 many+0.002
e+0.002
 a+0.002
Neg logit contributions
rily-0.006
ware-0.006
ge-0.006
uing-0.006
bell-0.005
Token she
Token ID673
Feature activation-6.22e-01

Pos logit contributions
rily+0.039
ware+0.035
ge+0.035
uing+0.034
fully+0.033
Neg logit contributions
e-0.027
 much-0.026
 very-0.025
 many-0.025
 named-0.024
Token saw
Token ID2497
Feature activation+3.81e-01

Pos logit contributions
rily+0.020
ware+0.018
ge+0.017
uing+0.017
bell+0.017
Neg logit contributions
 much-0.010
 very-0.009
e-0.009
 many-0.009
 named-0.008
Token a
Token ID257
Feature activation-2.68e+00

Pos logit contributions
e+0.003
 much+0.003
 very+0.002
 glad+0.002
 named+0.002
Neg logit contributions
rily-0.014
ware-0.013
fully-0.013
ge-0.012
uing-0.012
Token beautiful
Token ID4950
Feature activation-2.39e+00

Pos logit contributions
rily+0.082
ware+0.074
ge+0.073
uing+0.073
bell+0.070
Neg logit contributions
 much-0.043
 very-0.041
e-0.039
 many-0.039
 named-0.036
Token mer
Token ID4017
Feature activation-1.00e+01

Pos logit contributions
rily+0.067
ge+0.060
uing+0.060
ware+0.060
bell+0.056
Neg logit contributions
 much-0.043
 very-0.043
e-0.041
 many-0.041
 named-0.039
Tokenmaid
Token ID23151
Feature activation-7.11e-01

Pos logit contributions
ge+0.158
 afford+0.140
cakes+0.134
arded+0.133
ware+0.133
Neg logit contributions
 very-0.275
e-0.259
 scared-0.252
 afraid-0.246
 named-0.240
Token swimming
Token ID14899
Feature activation-2.54e+00

Pos logit contributions
rily+0.023
ware+0.021
uing+0.020
ge+0.020
bell+0.019
Neg logit contributions
 much-0.010
e-0.010
 very-0.010
 many-0.009
 named-0.009
Token around
Token ID1088
Feature activation+5.80e-01

Pos logit contributions
rily+0.086
ware+0.080
ge+0.079
uing+0.079
bell+0.076
Neg logit contributions
 much-0.031
 very-0.030
e-0.029
 many-0.028
 named-0.026
Token her
Token ID607
Feature activation-1.36e+00

Pos logit contributions
 much+0.006
 very+0.006
e+0.006
 many+0.006
 named+0.005
Neg logit contributions
rily-0.020
ware-0.019
uing-0.019
ge-0.019
bell-0.018
Token.
Token ID13
Feature activation+4.95e-01

Pos logit contributions
rily+0.043
ware+0.039
ge+0.038
uing+0.038
bell+0.036
Neg logit contributions
 much-0.020
e-0.019
 very-0.019
 many-0.019
 named-0.017
Token Joe
Token ID5689
Feature activation+8.46e-01

Pos logit contributions
e+0.007
 much+0.007
 very+0.007
 many+0.006
 named+0.006
Neg logit contributions
rily-0.009
ware-0.008
uing-0.008
ge-0.008
bell-0.008
Token waved
Token ID26834
Feature activation-7.29e-01

Pos logit contributions
 much+0.015
e+0.014
 very+0.014
 many+0.014
 named+0.013
Neg logit contributions
rily-0.024
ware-0.022
ge-0.022
uing-0.022
bell-0.021
Token goodbye
Token ID24829
Feature activation-6.95e-01

Pos logit contributions
rily+0.022
ware+0.021
ge+0.021
uing+0.020
pron+0.019
Neg logit contributions
 much-0.012
 very-0.011
 many-0.011
 a-0.011
e-0.010
Token to
Token ID284
Feature activation-1.66e+00

Pos logit contributions
rily+0.021
ware+0.020
uing+0.019
ge+0.019
bell+0.018
Neg logit contributions
 very-0.012
 much-0.012
 many-0.011
 a-0.011
e-0.010
Token the
Token ID262
Feature activation+6.66e-01

Pos logit contributions
rily+0.061
ware+0.057
ge+0.056
uing+0.055
bell+0.054
Neg logit contributions
 much-0.015
e-0.014
 very-0.014
 many-0.013
 named-0.012
Token mer
Token ID4017
Feature activation-1.00e+01

Pos logit contributions
 much+0.018
e+0.017
 very+0.017
 many+0.017
 named+0.016
Neg logit contributions
rily-0.013
ware-0.011
ge-0.011
uing-0.011
bell-0.010
Tokenmaid
Token ID23151
Feature activation+3.48e-01

Pos logit contributions
ge+0.178
 afford+0.161
driver+0.156
pron+0.153
arded+0.150
Neg logit contributions
 very-0.244
e-0.232
 named-0.223
 lives-0.216
 scared-0.212
Token,
Token ID11
Feature activation-4.04e+00

Pos logit contributions
 much+0.006
e+0.005
 very+0.005
 many+0.005
 named+0.005
Neg logit contributions
rily-0.010
ware-0.009
uing-0.009
ge-0.009
bell-0.009
Token and
Token ID290
Feature activation-1.28e+00

Pos logit contributions
rily+0.118
ware+0.109
uing+0.106
ge+0.105
bell+0.100
Neg logit contributions
 much-0.070
 very-0.068
 many-0.066
e-0.066
 glad-0.063
Token hopped
Token ID45230
Feature activation-3.05e+00

Pos logit contributions
rily+0.036
ware+0.032
ge+0.031
uing+0.031
bell+0.030
Neg logit contributions
 much-0.023
e-0.023
 very-0.022
 many-0.022
 glad-0.020
Token back
Token ID736
Feature activation+2.53e-01

Pos logit contributions
rily+0.089
ware+0.084
ge+0.083
uing+0.081
bell+0.078
Neg logit contributions
 much-0.052
 very-0.052
 many-0.049
e-0.049
 a-0.046
Token,
Token ID11
Feature activation-3.08e+00

Pos logit contributions
rily+0.026
ware+0.023
ge+0.023
uing+0.023
bell+0.022
Neg logit contributions
 much-0.012
e-0.011
 very-0.011
 many-0.011
 named-0.010
Token and
Token ID290
Feature activation+4.84e-01

Pos logit contributions
rily+0.083
ware+0.074
ge+0.073
uing+0.072
bell+0.070
Neg logit contributions
 much-0.058
e-0.057
 very-0.055
 many-0.055
 named-0.051
Token she
Token ID673
Feature activation-2.03e-01

Pos logit contributions
 much+0.008
e+0.007
 very+0.007
 many+0.007
 named+0.007
Neg logit contributions
rily-0.015
ware-0.013
ge-0.013
uing-0.013
bell-0.012
Token wanted
Token ID2227
Feature activation-1.52e+00

Pos logit contributions
rily+0.006
ware+0.005
ge+0.005
uing+0.005
bell+0.005
Neg logit contributions
 much-0.004
e-0.004
 many-0.003
 very-0.003
 glad-0.003
Token the
Token ID262
Feature activation+6.22e-01

Pos logit contributions
rily+0.037
ware+0.033
ge+0.032
uing+0.032
bell+0.030
Neg logit contributions
 much-0.033
e-0.032
 very-0.031
 many-0.031
 named-0.030
Token mer
Token ID4017
Feature activation-9.98e+00

Pos logit contributions
 much+0.012
e+0.011
 very+0.011
 many+0.011
 named+0.010
Neg logit contributions
rily-0.017
ware-0.015
ge-0.015
uing-0.015
bell-0.015
Tokenmaid
Token ID23151
Feature activation-7.18e-01

Pos logit contributions
ge+0.185
 checkout+0.151
driver+0.150
 afford+0.147
uster+0.147
Neg logit contributions
 very-0.246
e-0.231
 lives-0.226
 best-0.220
 afraid-0.215
Token to
Token ID284
Feature activation-1.60e+00

Pos logit contributions
rily+0.019
ware+0.017
ge+0.016
uing+0.016
bell+0.015
Neg logit contributions
 much-0.015
e-0.014
 very-0.014
 many-0.014
 named-0.013
Token come
Token ID1282
Feature activation-2.10e+00

Pos logit contributions
rily+0.049
ware+0.044
ge+0.043
uing+0.042
bell+0.041
Neg logit contributions
 much-0.028
e-0.028
 very-0.027
 many-0.026
 named-0.025
Token too
Token ID1165
Feature activation+3.56e+00

Pos logit contributions
rily+0.069
ware+0.069
uing+0.063
pron+0.061
ge+0.061
Neg logit contributions
 very-0.033
 much-0.031
 a-0.030
e-0.028
 many-0.028
Token.
Token ID13
Feature activation+6.41e-01

Pos logit contributions
 much+0.046
 very+0.043
 many+0.041
 a+0.035
e+0.034
Neg logit contributions
rily-0.128
ware-0.123
uing-0.118
ge-0.111
pron-0.111
Token beach
Token ID10481
Feature activation-5.91e-01

Pos logit contributions
 much+0.014
 very+0.014
 many+0.013
e+0.013
 named+0.012
Neg logit contributions
rily-0.016
ware-0.014
uing-0.014
ge-0.014
bell-0.013
Token,
Token ID11
Feature activation-9.97e-01

Pos logit contributions
rily+0.021
ware+0.019
ge+0.018
uing+0.018
bell+0.018
Neg logit contributions
 much-0.007
 very-0.007
e-0.007
 many-0.006
 named-0.006
Token he
Token ID339
Feature activation-8.68e-01

Pos logit contributions
rily+0.027
ware+0.024
ge+0.024
uing+0.024
bell+0.023
Neg logit contributions
 much-0.019
e-0.018
 very-0.018
 many-0.018
 named-0.017
Token saw
Token ID2497
Feature activation+6.65e-01

Pos logit contributions
rily+0.027
ware+0.024
uing+0.023
ge+0.023
bell+0.022
Neg logit contributions
 much-0.014
 very-0.014
e-0.014
 many-0.013
 named-0.012
Token a
Token ID257
Feature activation-2.24e+00

Pos logit contributions
e+0.007
 much+0.005
ow+0.005
 And+0.005
 glad+0.004
Neg logit contributions
rily-0.023
ware-0.022
fully-0.021
ly-0.021
ge-0.020
Token mer
Token ID4017
Feature activation-9.94e+00

Pos logit contributions
rily+0.068
ware+0.061
ge+0.060
uing+0.060
bell+0.058
Neg logit contributions
 much-0.035
e-0.034
 very-0.033
 many-0.033
 named-0.030
Tokenmaid
Token ID23151
Feature activation-7.69e-01

Pos logit contributions
ge+0.148
ware+0.121
arded+0.119
 afford+0.118
side+0.114
Neg logit contributions
 very-0.289
e-0.268
 scared-0.267
 afraid-0.262
aw-0.251
Token.
Token ID13
Feature activation+1.42e-02

Pos logit contributions
rily+0.025
ware+0.023
uing+0.022
ge+0.022
bell+0.021
Neg logit contributions
 much-0.011
e-0.010
 very-0.010
 many-0.010
 named-0.009
Token She
Token ID1375
Feature activation-1.32e+00

Pos logit contributions
 much+0.000
 very+0.000
 many+0.000
 And+0.000
 a+0.000
Neg logit contributions
rily-0.000
ge-0.000
uing-0.000
ware-0.000
yt-0.000
Token had
Token ID550
Feature activation+1.00e+00

Pos logit contributions
rily+0.041
ware+0.037
uing+0.037
ge+0.037
fully+0.034
Neg logit contributions
 much-0.021
 very-0.021
e-0.020
 many-0.019
 named-0.019
Token beautiful
Token ID4950
Feature activation+4.18e-02

Pos logit contributions
 much+0.008
e+0.008
 very+0.007
 many+0.007
 named+0.006
Neg logit contributions
rily-0.037
ware-0.034
ge-0.034
uing-0.033
bell-0.033
Token passport
Token ID17981
Feature activation-1.44e+00

Pos logit contributions
rily+0.016
ware+0.015
ge+0.014
uing+0.014
bell+0.014
Neg logit contributions
 much-0.011
e-0.011
 many-0.010
 very-0.010
 glad-0.010
Token belonged
Token ID19611
Feature activation+2.54e+00

Pos logit contributions
rily+0.047
ware+0.043
ge+0.042
uing+0.042
bell+0.040
Neg logit contributions
 much-0.019
e-0.019
 very-0.018
 many-0.018
 named-0.016
Token to
Token ID284
Feature activation+7.76e-01

Pos logit contributions
 much+0.052
e+0.050
 very+0.049
 many+0.049
 named+0.046
Neg logit contributions
rily-0.067
ware-0.059
ge-0.058
uing-0.058
bell-0.055
Token a
Token ID257
Feature activation-1.56e+00

Pos logit contributions
 much+0.008
e+0.007
 very+0.007
 many+0.007
 named+0.006
Neg logit contributions
rily-0.028
ware-0.026
ge-0.026
uing-0.026
bell-0.025
Token magical
Token ID10883
Feature activation-3.54e+00

Pos logit contributions
rily+0.047
ware+0.042
ge+0.041
uing+0.041
bell+0.039
Neg logit contributions
 much-0.026
e-0.025
 very-0.024
 many-0.024
 named-0.022
Token mer
Token ID4017
Feature activation-9.83e+00

Pos logit contributions
rily+0.102
ware+0.090
uing+0.090
ge+0.090
bell+0.084
Neg logit contributions
 much-0.063
 very-0.061
e-0.060
 many-0.059
 named-0.056
Tokenmaid
Token ID23151
Feature activation+2.84e-01

Pos logit contributions
ge+0.146
arded+0.124
cakes+0.123
 checkout+0.120
side+0.119
Neg logit contributions
 very-0.270
e-0.247
 lives-0.240
aw-0.236
 afraid-0.234
Token who
Token ID508
Feature activation+1.58e+00

Pos logit contributions
 much+0.003
e+0.003
 very+0.003
 many+0.003
 named+0.003
Neg logit contributions
rily-0.010
ware-0.009
uing-0.009
ge-0.009
bell-0.008
Token had
Token ID550
Feature activation+2.06e+00

Pos logit contributions
 much+0.024
e+0.022
 many+0.022
 very+0.022
 glad+0.020
Neg logit contributions
rily-0.048
ware-0.044
ge-0.043
uing-0.042
bell-0.042
Token been
Token ID587
Feature activation+2.56e+00

Pos logit contributions
 much+0.030
e+0.029
 very+0.028
 many+0.027
 named+0.026
Neg logit contributions
rily-0.065
ware-0.059
ge-0.058
uing-0.057
bell-0.056
Token visiting
Token ID10013
Feature activation-6.60e-01

Pos logit contributions
 much+0.038
e+0.036
 many+0.035
 very+0.035
 glad+0.032
Neg logit contributions
rily-0.079
ware-0.071
ge-0.071
uing-0.070
bell-0.068
Token looked
Token ID3114
Feature activation-8.12e-01

Pos logit contributions
rily+0.051
ware+0.046
ge+0.045
uing+0.045
bell+0.043
Neg logit contributions
 much-0.027
e-0.026
 very-0.025
 many-0.025
 named-0.023
Token back
Token ID736
Feature activation+7.56e-01

Pos logit contributions
rily+0.026
ware+0.023
uing+0.023
ge+0.023
bell+0.022
Neg logit contributions
e-0.012
 much-0.012
 many-0.011
 very-0.011
 named-0.010
Token and
Token ID290
Feature activation+6.63e-01

Pos logit contributions
 much+0.009
 very+0.009
 many+0.009
e+0.008
 glad+0.008
Neg logit contributions
rily-0.026
ware-0.024
uing-0.023
ge-0.023
bell-0.022
Token saw
Token ID2497
Feature activation+2.27e-01

Pos logit contributions
 much+0.012
 very+0.012
e+0.011
 many+0.011
 glad+0.011
Neg logit contributions
rily-0.019
ware-0.017
uing-0.017
ge-0.017
bell-0.016
Token the
Token ID262
Feature activation+1.63e+00

Pos logit contributions
e+0.003
 much+0.003
 very+0.002
 many+0.002
 named+0.002
Neg logit contributions
rily-0.008
ware-0.007
ge-0.007
uing-0.007
fully-0.007
Token mer
Token ID4017
Feature activation-9.79e+00

Pos logit contributions
 much+0.038
 very+0.037
e+0.036
 many+0.036
 glad+0.034
Neg logit contributions
rily-0.038
ware-0.033
ge-0.033
uing-0.033
bell-0.031
Tokenmaid
Token ID23151
Feature activation+2.16e-02

Pos logit contributions
ge+0.180
driver+0.154
 afford+0.151
arded+0.146
pron+0.141
Neg logit contributions
 very-0.247
e-0.227
 lives-0.220
 glad-0.220
 scared-0.217
Token still
Token ID991
Feature activation-1.32e+00

Pos logit contributions
 much+0.000
 very+0.000
e+0.000
 many+0.000
 glad+0.000
Neg logit contributions
rily-0.001
ware-0.001
uing-0.001
ge-0.001
bell-0.001
Token waving
Token ID25849
Feature activation+2.82e-01

Pos logit contributions
rily+0.042
ware+0.038
uing+0.037
ge+0.036
bell+0.035
Neg logit contributions
 much-0.020
 very-0.020
e-0.019
 many-0.019
 glad-0.018
Token and
Token ID290
Feature activation+1.18e+00

Pos logit contributions
 much+0.004
 very+0.004
 a+0.003
 many+0.003
e+0.003
Neg logit contributions
rily-0.009
ware-0.009
ge-0.009
uing-0.009
bell-0.008
Token that
Token ID326
Feature activation-9.92e-02

Pos logit contributions
 much+0.019
 very+0.019
e+0.018
 many+0.018
 glad+0.017
Neg logit contributions
rily-0.036
ware-0.033
uing-0.032
ge-0.032
bell-0.031
Token!
Token ID0
Feature activation-1.54e-02

Pos logit contributions
rily+0.043
ware+0.039
ge+0.039
uing+0.038
bell+0.037
Neg logit contributions
 much-0.015
e-0.014
 very-0.014
 many-0.013
 named-0.012
Token
Token ID198
Feature activation-1.69e+00

Pos logit contributions
rily+0.001
ware+0.001
ge+0.000
uing+0.000
bell+0.000
Neg logit contributions
 much-0.000
 very-0.000
 many-0.000
e-0.000
 And-0.000
Token
Token ID198
Feature activation-2.52e+00

Pos logit contributions
rily+0.061
ware+0.057
uing+0.055
ge+0.055
bell+0.054
Neg logit contributions
 much-0.018
 very-0.016
 many-0.016
e-0.016
 named-0.014
TokenJimmy
Token ID40335
Feature activation+6.18e-01

Pos logit contributions
rily+0.060
ware+0.054
ge+0.051
uing+0.050
bell+0.048
Neg logit contributions
 much-0.060
 very-0.057
 many-0.057
e-0.055
 a-0.052
Token was
Token ID373
Feature activation+1.05e+00

Pos logit contributions
 much+0.011
e+0.011
 very+0.010
 many+0.010
 named+0.009
Neg logit contributions
rily-0.018
ware-0.016
ge-0.016
uing-0.015
bell-0.015
Token so
Token ID523
Feature activation+4.50e+00

Pos logit contributions
e+0.015
 much+0.014
 lives+0.013
aw+0.013
 And+0.013
Neg logit contributions
rily-0.030
ge-0.026
uing-0.025
fully-0.025
bell-0.025
Token strong
Token ID1913
Feature activation+1.50e+00

Pos logit contributions
 much+0.058
e+0.058
 very+0.053
 many+0.053
 named+0.049
Neg logit contributions
rily-0.148
ge-0.134
ware-0.133
uing-0.133
bell-0.128
Token,
Token ID11
Feature activation-1.29e+00

Pos logit contributions
 much+0.018
 very+0.017
 many+0.016
e+0.016
 named+0.014
Neg logit contributions
rily-0.053
ware-0.049
uing-0.048
ge-0.047
bell-0.046
Token he
Token ID339
Feature activation-1.01e+00

Pos logit contributions
rily+0.036
ware+0.032
uing+0.032
ge+0.031
bell+0.030
Neg logit contributions
 much-0.024
 very-0.023
e-0.023
 many-0.023
 named-0.021
Token could
Token ID714
Feature activation-3.17e+00

Pos logit contributions
rily+0.031
ware+0.027
ge+0.027
uing+0.027
yt+0.025
Neg logit contributions
 very-0.018
 much-0.017
e-0.016
 many-0.016
 named-0.016
Token swing
Token ID9628
Feature activation-1.30e+00

Pos logit contributions
rily+0.100
ware+0.091
ge+0.088
uing+0.087
yt+0.083
Neg logit contributions
 much-0.052
 very-0.051
e-0.049
 many-0.048
 a-0.045
Token badly
Token ID11234
Feature activation+1.98e+00

Pos logit contributions
 much+0.026
e+0.025
 very+0.024
 many+0.024
 named+0.022
Neg logit contributions
rily-0.063
ware-0.058
ge-0.057
uing-0.056
bell-0.054
Token.
Token ID13
Feature activation+1.84e+00

Pos logit contributions
 much+0.024
 very+0.022
 many+0.022
e+0.021
 afraid+0.018
Neg logit contributions
rily-0.070
ware-0.065
uing-0.063
bell-0.062
ge-0.061
Token Tim
Token ID5045
Feature activation-1.38e+00

Pos logit contributions
e+0.037
 much+0.032
 many+0.031
 very+0.030
aks+0.030
Neg logit contributions
rily-0.047
bell-0.042
ge-0.041
ware-0.040
ly-0.040
Tokenmy
Token ID1820
Feature activation+7.49e-01

Pos logit contributions
rily+0.036
ge+0.032
ware+0.031
uing+0.031
bell+0.030
Neg logit contributions
 much-0.028
e-0.027
 many-0.026
 very-0.025
 named-0.024
Token felt
Token ID2936
Feature activation+4.18e+00

Pos logit contributions
 much+0.014
e+0.013
 many+0.013
aks+0.013
 glad+0.012
Neg logit contributions
rily-0.019
ly-0.016
ge-0.016
bell-0.016
ware-0.016
Token very
Token ID845
Feature activation+4.32e+00

Pos logit contributions
e+0.058
aks+0.056
 ost+0.054
 ze+0.052
 Africa+0.052
Neg logit contributions
rily-0.112
ly-0.108
 afford-0.094
fully-0.094
 trigger-0.092
Token sorry
Token ID7926
Feature activation-3.11e+00

Pos logit contributions
e+0.065
 Africa+0.059
 ze+0.055
 ost+0.055
aks+0.052
Neg logit contributions
rily-0.123
ly-0.118
uing-0.105
fully-0.103
ge-0.103
Token and
Token ID290
Feature activation+1.08e+00

Pos logit contributions
rily+0.104
ware+0.096
ge+0.094
uing+0.093
bell+0.091
Neg logit contributions
 much-0.043
 very-0.041
 many-0.040
e-0.037
 a-0.034
Token wished
Token ID16555
Feature activation-4.58e+00

Pos logit contributions
e+0.024
 much+0.021
aks+0.020
 many+0.020
ow+0.019
Neg logit contributions
rily-0.026
ly-0.023
bell-0.022
fully-0.022
ware-0.022
Token he
Token ID339
Feature activation-2.06e+00

Pos logit contributions
rily+0.125
ware+0.112
ge+0.110
uing+0.109
bell+0.105
Neg logit contributions
 much-0.088
e-0.084
 very-0.083
 many-0.082
 named-0.077
Token had
Token ID550
Feature activation+6.65e-01

Pos logit contributions
rily+0.062
ware+0.056
ge+0.055
uing+0.054
bell+0.054
Neg logit contributions
 much-0.032
e-0.031
 many-0.030
 very-0.030
 glad-0.028
Token are
Token ID389
Feature activation+1.08e+00

Pos logit contributions
rily+0.006
ware+0.006
uing+0.006
ge+0.006
bell+0.006
Neg logit contributions
 much-0.004
e-0.004
 many-0.003
 very-0.003
 afraid-0.003
Token very
Token ID845
Feature activation+4.57e+00

Pos logit contributions
e+0.015
aw+0.013
ow+0.012
 much+0.012
 And+0.012
Neg logit contributions
rily-0.033
uing-0.028
pron-0.028
ware-0.028
bell-0.028
Token tired
Token ID10032
Feature activation-4.26e-01

Pos logit contributions
e+0.059
 much+0.050
 many+0.048
 named+0.046
 very+0.046
Neg logit contributions
rily-0.150
uing-0.137
ge-0.134
bell-0.132
ware-0.132
Token and
Token ID290
Feature activation+3.25e+00

Pos logit contributions
rily+0.015
ware+0.014
uing+0.014
pron+0.013
ge+0.013
Neg logit contributions
 much-0.006
 very-0.006
 many-0.005
 a-0.005
e-0.004
Token hungry
Token ID14720
Feature activation-8.33e-01

Pos logit contributions
e+0.040
 much+0.035
 many+0.033
 named+0.033
aw+0.032
Neg logit contributions
rily-0.105
ge-0.094
uing-0.094
bell-0.093
ware-0.093
Token.
Token ID13
Feature activation+3.40e+00

Pos logit contributions
rily+0.030
uing+0.028
ware+0.028
ge+0.028
bell+0.027
Neg logit contributions
 much-0.009
 very-0.009
 many-0.008
e-0.008
 a-0.007
Token They
Token ID1119
Feature activation+4.11e-01

Pos logit contributions
e+0.053
 much+0.050
 very+0.048
 many+0.045
ow+0.044
Neg logit contributions
rily-0.102
ware-0.093
uing-0.090
bell-0.088
ro-0.088
Token eat
Token ID4483
Feature activation-1.50e+00

Pos logit contributions
 much+0.007
e+0.007
 many+0.006
 very+0.006
 glad+0.006
Neg logit contributions
rily-0.011
ware-0.010
bell-0.010
uing-0.010
ge-0.010
Token a
Token ID257
Feature activation-1.66e+00

Pos logit contributions
rily+0.056
ge+0.052
ware+0.052
uing+0.051
bell+0.050
Neg logit contributions
 much-0.014
 very-0.013
 many-0.012
e-0.012
 a-0.010
Token y
Token ID331
Feature activation-2.06e+00

Pos logit contributions
rily+0.046
ge+0.041
uing+0.040
ware+0.040
fully+0.038
Neg logit contributions
 much-0.033
 very-0.031
 many-0.031
e-0.029
 named-0.028
Tokenummy
Token ID13513
Feature activation-1.87e+00

Pos logit contributions
rily+0.039
ware+0.033
ge+0.032
uing+0.031
bell+0.030
Neg logit contributions
 much-0.057
e-0.055
 very-0.055
 many-0.054
 named-0.051
Token it
Token ID340
Feature activation-1.84e+00

Pos logit contributions
rily+0.098
ware+0.088
ge+0.087
uing+0.087
bell+0.084
Neg logit contributions
 much-0.034
 many-0.033
 very-0.032
e-0.030
 a-0.028
Token was
Token ID373
Feature activation+2.14e+00

Pos logit contributions
rily+0.061
ware+0.054
ge+0.054
uing+0.053
yt+0.051
Neg logit contributions
 much-0.028
 very-0.027
 many-0.026
e-0.025
 a-0.024
Token going
Token ID1016
Feature activation-1.11e+00

Pos logit contributions
 much+0.017
e+0.015
 very+0.015
 many+0.014
 named+0.012
Neg logit contributions
rily-0.082
ware-0.076
ge-0.075
uing-0.075
bell-0.072
Token to
Token ID284
Feature activation-1.91e-01

Pos logit contributions
rily+0.036
ware+0.032
ge+0.031
uing+0.031
yt+0.029
Neg logit contributions
 very-0.020
 much-0.019
 many-0.018
 a-0.017
 an-0.015
Token be
Token ID307
Feature activation+3.17e+00

Pos logit contributions
rily+0.006
ware+0.006
uing+0.006
ge+0.006
ly+0.005
Neg logit contributions
 much-0.003
 many-0.003
 very-0.003
e-0.003
 a-0.002
Token so
Token ID523
Feature activation+4.04e+00

Pos logit contributions
 much+0.029
e+0.027
 very+0.026
 many+0.025
 named+0.022
Neg logit contributions
rily-0.118
ware-0.108
ge-0.107
uing-0.107
bell-0.103
Token much
Token ID881
Feature activation+3.69e+00

Pos logit contributions
 much+0.042
e+0.040
 very+0.038
 many+0.037
 named+0.033
Neg logit contributions
rily-0.146
ware-0.134
ge-0.132
uing-0.131
bell-0.127
Token fun
Token ID1257
Feature activation-1.33e+00

Pos logit contributions
 much+0.045
 very+0.040
 many+0.037
 a+0.033
 scared+0.030
Neg logit contributions
rily-0.146
ware-0.132
uing-0.127
ge-0.125
fully-0.122
Token!
Token ID0
Feature activation+1.19e+00

Pos logit contributions
rily+0.047
ware+0.043
uing+0.042
ge+0.041
bell+0.040
Neg logit contributions
 much-0.017
 very-0.016
 many-0.015
e-0.014
 a-0.014
Token
Token ID220
Feature activation+6.97e-01

Pos logit contributions
e+0.019
 much+0.018
 very+0.017
 many+0.017
 afraid+0.016
Neg logit contributions
rily-0.036
ware-0.031
uing-0.031
bell-0.030
ro-0.030
Token
Token ID198
Feature activation-7.26e-01

Pos logit contributions
 much+0.008
e+0.008
 very+0.008
 many+0.008
 named+0.007
Neg logit contributions
rily-0.024
ware-0.022
ge-0.021
uing-0.021
bell-0.021
Token and
Token ID290
Feature activation-1.69e-01

Pos logit contributions
 very+0.006
 much+0.006
e+0.006
 a+0.005
 many+0.005
Neg logit contributions
rily-0.019
uing-0.018
ware-0.017
ge-0.017
arded-0.016
Token it
Token ID340
Feature activation-2.85e-01

Pos logit contributions
rily+0.005
ware+0.004
ge+0.004
uing+0.004
bell+0.004
Neg logit contributions
 much-0.003
e-0.003
 very-0.003
 many-0.003
 glad-0.003
Token started
Token ID2067
Feature activation+1.62e+00

Pos logit contributions
rily+0.009
uing+0.008
ge+0.008
ware+0.008
yt+0.008
Neg logit contributions
 much-0.004
 very-0.004
e-0.004
 many-0.004
 a-0.004
Token to
Token ID284
Feature activation+2.78e-01

Pos logit contributions
 much+0.034
e+0.033
 very+0.032
 many+0.032
 named+0.030
Neg logit contributions
rily-0.041
ware-0.037
ge-0.036
uing-0.036
bell-0.034
Token burn
Token ID4245
Feature activation+8.32e-01

Pos logit contributions
 much+0.006
 very+0.005
e+0.005
 many+0.005
 a+0.005
Neg logit contributions
rily-0.007
ware-0.006
uing-0.006
ge-0.006
bell-0.006
Token the
Token ID262
Feature activation+2.90e+00

Pos logit contributions
 much+0.011
e+0.011
 very+0.010
 many+0.010
 named+0.009
Neg logit contributions
rily-0.027
ware-0.025
ge-0.024
uing-0.024
bell-0.024
Token paper
Token ID3348
Feature activation+1.49e+00

Pos logit contributions
 much+0.054
e+0.053
 very+0.051
 many+0.051
 named+0.048
Neg logit contributions
rily-0.080
ware-0.072
ge-0.069
uing-0.069
bell-0.067
Token.
Token ID13
Feature activation+2.88e-01

Pos logit contributions
 very+0.017
 much+0.017
e+0.016
 many+0.015
 a+0.014
Neg logit contributions
rily-0.050
uing-0.047
ware-0.047
ge-0.047
ory-0.045
Token She
Token ID1375
Feature activation-5.94e-01

Pos logit contributions
 much+0.005
e+0.005
 very+0.004
 many+0.004
 named+0.004
Neg logit contributions
rily-0.009
ware-0.008
uing-0.008
ge-0.008
bell-0.007
Token tried
Token ID3088
Feature activation-2.69e+00

Pos logit contributions
rily+0.016
ware+0.015
ge+0.014
uing+0.014
bell+0.014
Neg logit contributions
 much-0.011
e-0.011
 many-0.010
 glad-0.010
 very-0.010
Token to
Token ID284
Feature activation-2.13e+00

Pos logit contributions
rily+0.075
ware+0.066
uing+0.065
ge+0.064
bell+0.062
Neg logit contributions
 much-0.054
 very-0.052
 many-0.052
e-0.049
 a-0.047
Token a
Token ID257
Feature activation-3.64e-01

Pos logit contributions
ow+0.030
 afraid+0.029
e+0.028
 ost+0.027
icks+0.025
Neg logit contributions
rily-0.105
ware-0.093
fully-0.092
pron-0.089
oph-0.086
Token bunny
Token ID44915
Feature activation+2.07e+00

Pos logit contributions
rily+0.008
ware+0.008
ge+0.007
bell+0.007
pron+0.007
Neg logit contributions
e-0.008
 much-0.008
ow-0.007
 many-0.007
 very-0.007
Token named
Token ID3706
Feature activation+1.11e+00

Pos logit contributions
 glad+0.023
 much+0.023
 worst+0.022
aks+0.021
 best+0.021
Neg logit contributions
rily-0.060
fully-0.057
ly-0.056
ware-0.055
ro-0.055
Token Benny
Token ID44275
Feature activation+9.37e-01

Pos logit contributions
 much+0.024
e+0.024
 very+0.023
 many+0.023
 glad+0.022
Neg logit contributions
rily-0.027
ware-0.023
ge-0.023
uing-0.022
fully-0.022
Token.
Token ID13
Feature activation-5.56e-02

Pos logit contributions
 glad+0.017
 much+0.017
 sorry+0.016
e+0.016
 afraid+0.016
Neg logit contributions
rily-0.022
ge-0.022
fully-0.021
ware-0.020
ly-0.020
Token Benny
Token ID44275
Feature activation-1.93e-01

Pos logit contributions
rily+0.001
ware+0.001
ge+0.001
uing+0.001
bell+0.001
Neg logit contributions
 much-0.001
 very-0.001
 many-0.001
e-0.001
 named-0.001
Token loved
Token ID6151
Feature activation-2.17e+00

Pos logit contributions
rily+0.004
ro+0.004
ly+0.004
fully+0.004
ware+0.004
Neg logit contributions
 sorry-0.003
ow-0.003
 glad-0.003
icks-0.003
 ost-0.003
Token to
Token ID284
Feature activation-1.84e+00

Pos logit contributions
rily+0.043
ware+0.040
ge+0.037
fully+0.036
uing+0.035
Neg logit contributions
e-0.054
 much-0.052
 glad-0.048
 named-0.047
 very-0.047
Token eat
Token ID4483
Feature activation-5.94e-01

Pos logit contributions
rily+0.051
ware+0.046
ge+0.045
uing+0.045
bell+0.043
Neg logit contributions
 much-0.034
e-0.032
 very-0.032
 many-0.031
 named-0.029
Token rad
Token ID2511
Feature activation-2.66e+00

Pos logit contributions
rily+0.015
ware+0.013
ge+0.012
uing+0.012
fully+0.012
Neg logit contributions
e-0.013
 much-0.013
 very-0.012
 many-0.012
 glad-0.012
Tokenishes
Token ID5614
Feature activation+4.09e-01

Pos logit contributions
rily+0.060
ware+0.053
uing+0.052
ge+0.051
bell+0.050
Neg logit contributions
 much-0.062
 very-0.061
 many-0.061
e-0.060
 named-0.056
Token Mitt
Token ID16627
Feature activation-1.68e-01

Pos logit contributions
rily+0.007
ware+0.006
ge+0.006
uing+0.006
bell+0.006
Neg logit contributions
 much-0.003
e-0.003
 very-0.003
 many-0.003
 glad-0.002
Tokenens
Token ID641
Feature activation+7.97e-01

Pos logit contributions
rily+0.005
ware+0.004
ge+0.004
uing+0.004
bell+0.004
Neg logit contributions
 much-0.003
e-0.003
 very-0.003
 many-0.003
 named-0.003
Token loved
Token ID6151
Feature activation-2.21e+00

Pos logit contributions
 much+0.013
e+0.012
 glad+0.012
icks+0.012
ow+0.012
Neg logit contributions
rily-0.019
ware-0.019
ly-0.019
ro-0.018
bell-0.018
Token to
Token ID284
Feature activation-1.97e+00

Pos logit contributions
rily+0.049
ware+0.044
ge+0.042
uing+0.041
fully+0.040
Neg logit contributions
e-0.051
 much-0.050
 very-0.047
 many-0.046
 glad-0.046
Token sleep
Token ID3993
Feature activation-1.12e+00

Pos logit contributions
rily+0.052
ware+0.047
ge+0.046
uing+0.046
bell+0.044
Neg logit contributions
 much-0.038
e-0.036
 very-0.036
 many-0.035
 glad-0.034
Token all
Token ID477
Feature activation+1.88e+00

Pos logit contributions
rily+0.037
ware+0.034
ge+0.033
uing+0.033
bell+0.032
Neg logit contributions
 much-0.015
e-0.014
 very-0.014
 many-0.014
 named-0.013
Token day
Token ID1110
Feature activation+9.23e-01

Pos logit contributions
aw+0.035
icks+0.033
 sorry+0.033
 afraid+0.032
 glad+0.032
Neg logit contributions
fully-0.042
ro-0.040
ly-0.039
rily-0.039
ge-0.037
Token long
Token ID890
Feature activation+2.81e+00

Pos logit contributions
 much+0.014
 very+0.013
 many+0.013
e+0.012
 named+0.011
Neg logit contributions
rily-0.030
ware-0.027
uing-0.027
ge-0.027
bell-0.026
Token and
Token ID290
Feature activation+1.80e+00

Pos logit contributions
 much+0.041
e+0.039
 glad+0.038
 very+0.037
 many+0.037
Neg logit contributions
rily-0.085
ware-0.078
ge-0.076
uing-0.075
fully-0.074
Token never
Token ID1239
Feature activation-1.05e+00

Pos logit contributions
e+0.025
 much+0.025
 many+0.023
 very+0.023
 named+0.022
Neg logit contributions
rily-0.055
ware-0.050
ge-0.050
uing-0.049
bell-0.048
Token wanted
Token ID2227
Feature activation-6.21e-01

Pos logit contributions
rily+0.029
ware+0.026
ge+0.026
uing+0.026
bell+0.025
Neg logit contributions
 much-0.019
e-0.018
 many-0.017
 very-0.017
 glad-0.016
Token<|endoftext|>
Token ID50256
Feature activation+1.87e+00

Pos logit contributions
e+0.024
 afraid+0.023
aks+0.022
 scared+0.022
ow+0.021
Neg logit contributions
rily-0.025
oph-0.024
fully-0.024
ro-0.023
ware-0.023
TokenOnce
Token ID7454
Feature activation-9.39e-02

Pos logit contributions
 much+0.026
e+0.025
 very+0.024
 many+0.024
 glad+0.024
Neg logit contributions
rily-0.057
ware-0.052
ge-0.051
ro-0.049
bell-0.049
Token upon
Token ID2402
Feature activation+2.98e+00

Pos logit contributions
rily+0.002
ge+0.002
ware+0.002
uing+0.002
bell+0.001
Neg logit contributions
e-0.002
 much-0.002
 very-0.002
 many-0.002
 named-0.002
Token a
Token ID257
Feature activation+2.48e+00

Pos logit contributions
e+0.016
 much+0.014
 glad+0.012
 named+0.012
 afraid+0.012
Neg logit contributions
rily-0.113
ware-0.104
ge-0.104
fully-0.102
bell-0.100
Token time
Token ID640
Feature activation+1.95e+00

Pos logit contributions
e+0.040
 much+0.039
 many+0.039
 very+0.038
 afraid+0.037
Neg logit contributions
rily-0.067
ge-0.060
uing-0.060
yt-0.059
ware-0.058
Token,
Token ID11
Feature activation-8.71e-01

Pos logit contributions
e+0.032
 much+0.030
 many+0.028
aw+0.028
 very+0.028
Neg logit contributions
rily-0.053
ge-0.050
uing-0.049
ware-0.048
yt-0.048
Token there
Token ID612
Feature activation-1.52e+00

Pos logit contributions
rily+0.022
ge+0.019
fully+0.019
ware+0.019
uing+0.018
Neg logit contributions
e-0.016
aw-0.016
 much-0.015
ow-0.015
 many-0.014
Token was
Token ID373
Feature activation+3.10e+00

Pos logit contributions
rily+0.036
ware+0.033
ge+0.032
uing+0.031
fully+0.030
Neg logit contributions
 much-0.029
e-0.028
 afraid-0.028
 glad-0.028
 many-0.028
Token a
Token ID257
Feature activation-3.29e-01

Pos logit contributions
ow+0.031
 afraid+0.030
e+0.029
 ost+0.028
icks+0.028
Neg logit contributions
rily-0.099
fully-0.088
ware-0.087
pron-0.084
bell-0.081
Token little
Token ID1310
Feature activation+9.55e-01

Pos logit contributions
rily+0.007
ware+0.007
pron+0.006
bell+0.006
yt+0.006
Neg logit contributions
e-0.007
 much-0.007
ow-0.007
 many-0.007
icks-0.006
Token boy
Token ID2933
Feature activation+2.94e+00

Pos logit contributions
e+0.025
 much+0.024
ow+0.023
 many+0.023
icks+0.022
Neg logit contributions
rily-0.018
ware-0.016
ly-0.016
yt-0.015
ory-0.015
Token back
Token ID736
Feature activation-2.11e-01

Pos logit contributions
rily+0.081
ware+0.072
ge+0.071
uing+0.070
bell+0.067
Neg logit contributions
 much-0.062
e-0.061
 very-0.059
 many-0.059
 named-0.056
Token to
Token ID284
Feature activation-1.09e+00

Pos logit contributions
rily+0.005
ge+0.005
ware+0.004
uing+0.004
bell+0.004
Neg logit contributions
 much-0.005
e-0.005
 very-0.004
 many-0.004
 glad-0.004
Token her
Token ID607
Feature activation-1.50e+00

Pos logit contributions
rily+0.034
ge+0.031
ware+0.031
fully+0.030
uing+0.030
Neg logit contributions
 much-0.015
e-0.014
aw-0.013
 very-0.013
 afraid-0.012
Token home
Token ID1363
Feature activation-5.06e-01

Pos logit contributions
rily+0.042
ware+0.038
ge+0.037
uing+0.036
bell+0.036
Neg logit contributions
e-0.027
 much-0.027
 very-0.024
 many-0.024
 glad-0.023
Token.
Token ID13
Feature activation-4.56e-01

Pos logit contributions
rily+0.016
ware+0.015
uing+0.014
ge+0.014
bell+0.014
Neg logit contributions
 much-0.007
e-0.007
 very-0.007
 many-0.007
 named-0.006
Token She
Token ID1375
Feature activation-1.38e+00

Pos logit contributions
rily+0.016
ware+0.014
ge+0.014
uing+0.014
yt+0.013
Neg logit contributions
 much-0.006
 very-0.006
 many-0.006
e-0.005
 And-0.005
Token asked
Token ID1965
Feature activation-2.39e+00

Pos logit contributions
rily+0.038
ware+0.034
ge+0.033
uing+0.033
bell+0.032
Neg logit contributions
 much-0.026
e-0.025
 many-0.024
 very-0.024
 glad-0.023
Token her
Token ID607
Feature activation-1.70e+00

Pos logit contributions
rily+0.076
ware+0.069
uing+0.069
ge+0.068
fully+0.068
Neg logit contributions
e-0.033
 much-0.032
 very-0.029
 many-0.028
 glad-0.027
Token mom
Token ID1995
Feature activation-1.07e+00

Pos logit contributions
rily+0.045
ware+0.041
uing+0.040
bell+0.039
ge+0.039
Neg logit contributions
e-0.034
 much-0.031
 very-0.028
 many-0.028
aw-0.028
Token if
Token ID611
Feature activation+7.99e-01

Pos logit contributions
rily+0.038
ware+0.034
ge+0.033
uing+0.032
ro+0.032
Neg logit contributions
 much-0.014
 very-0.014
 many-0.014
 a-0.012
 named-0.012
Token she
Token ID673
Feature activation-1.41e+00

Pos logit contributions
e+0.016
 much+0.015
 very+0.014
 many+0.014
aks+0.014
Neg logit contributions
rily-0.020
ware-0.018
uing-0.018
bell-0.017
ge-0.017
Token little
Token ID1310
Feature activation+8.60e-01

Pos logit contributions
rily+0.011
ware+0.010
pron+0.009
bell+0.009
uing+0.009
Neg logit contributions
e-0.011
 much-0.011
ow-0.010
 many-0.010
icks-0.010
Token girl
Token ID2576
Feature activation+3.16e+00

Pos logit contributions
e+0.021
 much+0.021
 many+0.020
ow+0.019
icks+0.019
Neg logit contributions
rily-0.017
ware-0.015
ly-0.015
yt-0.015
ory-0.014
Token named
Token ID3706
Feature activation+1.31e+00

Pos logit contributions
 much+0.036
 many+0.034
 glad+0.032
 best+0.031
icks+0.031
Neg logit contributions
bell-0.091
yt-0.088
rily-0.088
ly-0.085
ge-0.085
Token Lily
Token ID20037
Feature activation+2.18e+00

Pos logit contributions
 much+0.022
e+0.022
 very+0.021
 many+0.021
 named+0.019
Neg logit contributions
rily-0.038
ware-0.034
ge-0.034
uing-0.034
bell-0.032
Token.
Token ID13
Feature activation+1.44e-01

Pos logit contributions
 much+0.041
 glad+0.040
 sorry+0.040
 many+0.039
ow+0.039
Neg logit contributions
ge-0.048
rily-0.047
fully-0.046
bell-0.043
yt-0.043
Token She
Token ID1375
Feature activation-1.45e+00

Pos logit contributions
 much+0.002
e+0.002
 very+0.002
 many+0.002
 named+0.002
Neg logit contributions
rily-0.005
ware-0.004
ge-0.004
uing-0.004
bell-0.004
Token loved
Token ID6151
Feature activation-2.06e+00

Pos logit contributions
pron+0.035
ly+0.035
bell+0.035
rily+0.035
ro+0.034
Neg logit contributions
ow-0.023
 much-0.022
icks-0.021
aw-0.021
 cages-0.021
Token to
Token ID284
Feature activation-1.85e+00

Pos logit contributions
rily+0.052
ware+0.047
ge+0.045
uing+0.045
bell+0.043
Neg logit contributions
 much-0.043
e-0.042
 very-0.040
 many-0.040
 named-0.038
Token play
Token ID711
Feature activation-2.06e+00

Pos logit contributions
rily+0.054
ge+0.047
uing+0.047
ware+0.047
fully+0.044
Neg logit contributions
 much-0.034
e-0.034
 very-0.033
 many-0.032
 named-0.031
Token on
Token ID319
Feature activation+3.97e+00

Pos logit contributions
rily+0.069
ge+0.063
ware+0.061
uing+0.061
bell+0.060
Neg logit contributions
e-0.028
 much-0.027
 very-0.027
 many-0.026
 named-0.025
Token the
Token ID262
Feature activation+1.27e+00

Pos logit contributions
e+0.041
 much+0.040
 glad+0.038
 afraid+0.037
 named+0.035
Neg logit contributions
rily-0.134
ware-0.129
ge-0.123
bell-0.119
uing-0.119
Token please
Token ID3387
Feature activation-2.38e+00

Pos logit contributions
rily+0.080
ge+0.073
ware+0.072
uing+0.070
ro+0.065
Neg logit contributions
 much-0.055
 very-0.052
 many-0.050
e-0.049
 named-0.048
Token repeat
Token ID9585
Feature activation-4.87e-01

Pos logit contributions
rily+0.072
ware+0.065
ge+0.064
uing+0.062
bell+0.060
Neg logit contributions
 much-0.042
 very-0.040
e-0.040
 many-0.038
 named-0.037
Token it
Token ID340
Feature activation-2.25e-01

Pos logit contributions
rily+0.015
ware+0.014
ge+0.014
uing+0.014
bell+0.013
Neg logit contributions
 much-0.007
e-0.007
 very-0.007
 many-0.007
 named-0.006
Token!"
Token ID2474
Feature activation-4.88e-01

Pos logit contributions
rily+0.008
ware+0.007
ge+0.007
uing+0.007
yt+0.006
Neg logit contributions
 much-0.003
 very-0.003
 many-0.003
e-0.003
 a-0.003
Token
Token ID198
Feature activation-9.83e-01

Pos logit contributions
rily+0.016
ware+0.014
uing+0.014
ge+0.014
bell+0.014
Neg logit contributions
e-0.006
 much-0.006
 very-0.006
 many-0.006
 afraid-0.006
Token
Token ID198
Feature activation-1.87e+00

Pos logit contributions
rily+0.034
ware+0.031
uing+0.031
ge+0.030
fully+0.030
Neg logit contributions
e-0.010
 afraid-0.010
 much-0.009
 very-0.009
 scared-0.008
TokenFrom
Token ID4863
Feature activation+2.03e+00

Pos logit contributions
rily+0.052
ware+0.047
ge+0.046
uing+0.045
bell+0.044
Neg logit contributions
 much-0.036
e-0.034
 many-0.034
 very-0.033
 named-0.031
Token then
Token ID788
Feature activation-7.53e-01

Pos logit contributions
 much+0.023
e+0.022
 very+0.021
 many+0.021
 named+0.019
Neg logit contributions
rily-0.071
ware-0.065
ge-0.064
uing-0.064
bell-0.062
Token on
Token ID319
Feature activation+3.33e+00

Pos logit contributions
rily+0.021
uing+0.020
fully+0.019
ware+0.019
ge+0.019
Neg logit contributions
 afraid-0.010
 much-0.010
 glad-0.010
 scared-0.009
e-0.009
Token,
Token ID11
Feature activation-2.54e+00

Pos logit contributions
 much+0.043
e+0.041
 very+0.039
 many+0.038
 glad+0.036
Neg logit contributions
rily-0.110
ware-0.101
uing-0.099
ge-0.099
bell-0.096
Token Lily
Token ID20037
Feature activation+1.38e+00

Pos logit contributions
rily+0.082
ware+0.074
ge+0.073
uing+0.072
bell+0.070
Neg logit contributions
 much-0.040
 many-0.037
 very-0.037
e-0.036
 named-0.034
Token how
Token ID703
Feature activation+1.34e+00

Pos logit contributions
rily+0.008
ware+0.007
uing+0.007
ge+0.007
bell+0.007
Neg logit contributions
 much-0.003
e-0.003
 very-0.003
 many-0.003
 glad-0.003
Token to
Token ID284
Feature activation-1.84e+00

Pos logit contributions
e+0.031
 much+0.028
 named+0.028
 very+0.027
 And+0.027
Neg logit contributions
rily-0.030
ware-0.026
ge-0.026
uing-0.025
ly-0.024
Token make
Token ID787
Feature activation+1.04e+00

Pos logit contributions
rily+0.052
ge+0.045
uing+0.045
ware+0.044
fully+0.043
Neg logit contributions
e-0.035
 much-0.035
 very-0.035
 many-0.034
 named-0.033
Token jam
Token ID16853
Feature activation-6.57e-02

Pos logit contributions
 much+0.014
e+0.014
 very+0.013
 many+0.013
 glad+0.013
Neg logit contributions
rily-0.033
ware-0.030
uing-0.029
ge-0.029
bell-0.028
Token.
Token ID13
Feature activation-2.78e-01

Pos logit contributions
rily+0.002
ge+0.002
bell+0.002
arded+0.002
ware+0.002
Neg logit contributions
 much-0.001
 very-0.001
 many-0.001
 a-0.001
e-0.001
Token "
Token ID366
Feature activation-3.61e+00

Pos logit contributions
rily+0.010
ge+0.010
ware+0.009
uing+0.009
yt+0.009
Neg logit contributions
 much-0.003
 many-0.003
 And-0.003
 very-0.003
 a-0.003
TokenThis
Token ID1212
Feature activation-2.13e+00

Pos logit contributions
rily+0.107
ge+0.096
ware+0.095
uing+0.095
yt+0.088
Neg logit contributions
 much-0.064
e-0.063
 many-0.060
 very-0.059
 named-0.055
Token is
Token ID318
Feature activation+2.06e+00

Pos logit contributions
rily+0.066
ware+0.061
uing+0.059
ge+0.059
bell+0.057
Neg logit contributions
 much-0.031
e-0.031
 very-0.029
 many-0.029
 glad-0.027
Token so
Token ID523
Feature activation+3.37e+00

Pos logit contributions
e+0.028
 much+0.025
 many+0.023
 afraid+0.023
 glad+0.023
Neg logit contributions
rily-0.066
ware-0.060
uing-0.059
ge-0.058
bell-0.057
Token much
Token ID881
Feature activation+3.15e+00

Pos logit contributions
e+0.047
ow+0.039
 named+0.038
 afraid+0.037
 much+0.037
Neg logit contributions
rily-0.107
ware-0.097
uing-0.096
bell-0.092
ge-0.091
Token fun
Token ID1257
Feature activation-7.19e-01

Pos logit contributions
 much+0.040
 very+0.035
 many+0.034
 a+0.029
 scared+0.026
Neg logit contributions
rily-0.118
ge-0.107
ware-0.106
uing-0.104
bell-0.102
TokenOne
Token ID3198
Feature activation-3.05e-01

Pos logit contributions
rily+0.056
ware+0.050
ge+0.048
uing+0.047
bell+0.047
Neg logit contributions
 much-0.037
 many-0.035
 very-0.035
e-0.034
 named-0.032
Token day
Token ID1110
Feature activation-6.94e-01

Pos logit contributions
bell+0.007
ly+0.007
ro+0.007
pron+0.007
fully+0.007
Neg logit contributions
icks-0.006
e-0.005
aw-0.005
aks-0.005
cks-0.005
Token,
Token ID11
Feature activation-1.44e+00

Pos logit contributions
rily+0.021
ware+0.019
uing+0.019
ge+0.019
fully+0.018
Neg logit contributions
e-0.010
 much-0.010
 very-0.009
 many-0.009
 glad-0.009
Token they
Token ID484
Feature activation+3.29e-02

Pos logit contributions
rily+0.041
ware+0.037
ge+0.036
uing+0.036
bell+0.035
Neg logit contributions
 much-0.025
e-0.025
 very-0.024
 many-0.024
 named-0.022
Token decided
Token ID3066
Feature activation-2.04e+00

Pos logit contributions
 much+0.001
e+0.001
 very+0.001
 many+0.001
 named+0.001
Neg logit contributions
rily-0.001
ware-0.001
ge-0.001
uing-0.001
bell-0.001
Token to
Token ID284
Feature activation-2.25e+00

Pos logit contributions
rily+0.063
ware+0.057
ge+0.054
uing+0.054
ro+0.052
Neg logit contributions
 much-0.037
 very-0.036
 many-0.036
 a-0.034
 named-0.031
Token skip
Token ID14267
Feature activation-2.00e+00

Pos logit contributions
rily+0.068
ware+0.061
ge+0.060
uing+0.058
fully+0.055
Neg logit contributions
 much-0.039
e-0.038
 very-0.037
 many-0.036
 named-0.035
Token over
Token ID625
Feature activation+3.80e+00

Pos logit contributions
rily+0.065
ware+0.059
ge+0.058
uing+0.058
bell+0.056
Neg logit contributions
 much-0.027
e-0.026
 very-0.025
 many-0.025
 named-0.023
Token to
Token ID284
Feature activation-3.08e-01

Pos logit contributions
 much+0.035
 very+0.032
 many+0.032
e+0.031
 a+0.026
Neg logit contributions
rily-0.145
ware-0.132
uing-0.131
ge-0.130
bell-0.126
Token the
Token ID262
Feature activation+1.59e+00

Pos logit contributions
rily+0.011
ware+0.010
uing+0.010
ge+0.010
bell+0.009
Neg logit contributions
 much-0.003
e-0.003
 very-0.003
 many-0.003
 afraid-0.003
Token kitchen
Token ID9592
Feature activation-2.02e-01

Pos logit contributions
e+0.032
 much+0.030
 glad+0.027
 named+0.027
ow+0.027
Neg logit contributions
rily-0.039
ware-0.037
bell-0.036
ge-0.035
uing-0.035
Token when
Token ID618
Feature activation+3.48e+00

Pos logit contributions
 very+0.025
 much+0.024
e+0.024
 many+0.024
 named+0.022
Neg logit contributions
rily-0.064
ware-0.057
uing-0.056
ge-0.055
bell-0.054
Token she
Token ID673
Feature activation+6.66e-01

Pos logit contributions
e+0.067
aks+0.060
 And+0.060
ow+0.060
icks+0.060
Neg logit contributions
rily-0.086
ware-0.082
bell-0.077
ge-0.077
fully-0.074
Token accidentally
Token ID14716
Feature activation-2.92e-01

Pos logit contributions
 much+0.015
e+0.014
 many+0.014
 very+0.014
 glad+0.014
Neg logit contributions
rily-0.015
ware-0.014
ge-0.013
uing-0.013
bell-0.013
Token knocked
Token ID13642
Feature activation-5.18e-01

Pos logit contributions
rily+0.007
ware+0.007
ge+0.006
uing+0.006
bell+0.006
Neg logit contributions
 much-0.006
e-0.006
 many-0.006
 very-0.006
 glad-0.006
Token over
Token ID625
Feature activation+5.33e+00

Pos logit contributions
rily+0.017
ware+0.015
uing+0.015
ge+0.015
yt+0.014
Neg logit contributions
 much-0.007
 very-0.007
e-0.007
 many-0.007
 a-0.006
Token a
Token ID257
Feature activation-2.63e+00

Pos logit contributions
 much+0.026
 many+0.022
 very+0.021
e+0.021
 a+0.015
Neg logit contributions
rily-0.227
uing-0.210
ware-0.209
ge-0.208
bell-0.200
Token chair
Token ID5118
Feature activation+1.92e+00

Pos logit contributions
rily+0.082
uing+0.074
ge+0.074
ware+0.072
fully+0.070
Neg logit contributions
 very-0.042
 much-0.042
 many-0.039
e-0.037
 named-0.036
Token.
Token ID13
Feature activation-2.59e-01

Pos logit contributions
 much+0.028
 very+0.027
e+0.027
 many+0.025
 named+0.024
Neg logit contributions
rily-0.062
ware-0.056
uing-0.056
ge-0.055
yt-0.051
Token The
Token ID383
Feature activation-5.03e-01

Pos logit contributions
rily+0.009
ge+0.008
ware+0.008
uing+0.008
yt+0.008
Neg logit contributions
 much-0.003
 very-0.003
 many-0.003
e-0.003
 named-0.003
Token chair
Token ID5118
Feature activation+6.07e-01

Pos logit contributions
rily+0.012
ware+0.010
ge+0.010
uing+0.010
bell+0.010
Neg logit contributions
 much-0.011
e-0.011
 very-0.011
 many-0.011
 named-0.011
Token was
Token ID373
Feature activation+2.12e+00

Pos logit contributions
 much+0.010
e+0.009
 very+0.009
 many+0.009
 named+0.009
Neg logit contributions
rily-0.018
ware-0.016
ge-0.016
uing-0.016
bell-0.015
Token It
Token ID632
Feature activation-1.30e+00

Pos logit contributions
 much+0.021
 many+0.020
 very+0.019
 And+0.018
 a+0.017
Neg logit contributions
rily-0.072
ge-0.071
uing-0.068
ware-0.068
yt-0.065
Token craw
Token ID27784
Feature activation-2.10e+00

Pos logit contributions
rily+0.036
ware+0.034
bell+0.031
ge+0.031
uing+0.031
Neg logit contributions
 much-0.023
e-0.022
 many-0.020
 glad-0.020
 very-0.020
Tokenls
Token ID7278
Feature activation+4.77e-01

Pos logit contributions
rily+0.053
ware+0.049
ge+0.049
uing+0.049
bell+0.048
Neg logit contributions
 much-0.042
 many-0.041
 very-0.040
aw-0.038
e-0.038
Token on
Token ID319
Feature activation+2.51e+00

Pos logit contributions
 much+0.007
e+0.007
 very+0.007
 many+0.007
 a+0.006
Neg logit contributions
rily-0.015
ware-0.014
ge-0.013
uing-0.013
bell-0.013
Token Lily
Token ID20037
Feature activation+2.00e+00

Pos logit contributions
 much+0.023
 very+0.022
 many+0.021
e+0.021
 a+0.019
Neg logit contributions
rily-0.094
ge-0.088
ware-0.087
uing-0.087
bell-0.084
Token's
Token ID338
Feature activation-1.93e+00

Pos logit contributions
 much+0.035
e+0.033
 many+0.033
 very+0.032
 named+0.030
Neg logit contributions
rily-0.057
ware-0.051
ge-0.051
uing-0.050
bell-0.048
Token tower
Token ID10580
Feature activation+9.41e-01

Pos logit contributions
rily+0.054
ge+0.049
ware+0.048
uing+0.048
bell+0.046
Neg logit contributions
 much-0.035
 very-0.035
 many-0.034
e-0.034
 a-0.031
Token.
Token ID13
Feature activation+2.00e+00

Pos logit contributions
 much+0.012
e+0.012
 very+0.012
 many+0.011
 named+0.010
Neg logit contributions
rily-0.032
ware-0.029
uing-0.028
ge-0.028
bell-0.027
Token It
Token ID632
Feature activation-1.19e+00

Pos logit contributions
 much+0.021
 many+0.021
 And+0.019
 very+0.018
 a+0.018
Neg logit contributions
ge-0.072
rily-0.072
uing-0.069
ware-0.067
arded-0.067
Token makes
Token ID1838
Feature activation+1.80e+00

Pos logit contributions
rily+0.033
ware+0.031
bell+0.029
ge+0.028
ro+0.028
Neg logit contributions
 much-0.020
e-0.020
 glad-0.019
 many-0.019
 named-0.018
Token the
Token ID262
Feature activation+1.21e+00

Pos logit contributions
 much+0.014
e+0.014
 very+0.013
 many+0.012
 named+0.012
Neg logit contributions
rily-0.067
ware-0.061
uing-0.060
ge-0.060
bell-0.059
Token Sam
Token ID3409
Feature activation+4.69e-01

Pos logit contributions
rily+0.082
ware+0.073
ge+0.072
uing+0.072
bell+0.069
Neg logit contributions
 much-0.048
e-0.046
 very-0.045
 many-0.044
 glad-0.042
Token and
Token ID290
Feature activation-3.18e+00

Pos logit contributions
 very+0.007
e+0.007
 much+0.007
 named+0.007
 many+0.007
Neg logit contributions
rily-0.015
ware-0.013
uing-0.013
ge-0.013
pron-0.012
Token Emma
Token ID18966
Feature activation-1.11e-01

Pos logit contributions
rily+0.088
ware+0.081
uing+0.080
ge+0.078
bell+0.074
Neg logit contributions
 much-0.060
 very-0.059
 many-0.058
e-0.057
 a-0.056
Token used
Token ID973
Feature activation-1.28e+00

Pos logit contributions
rily+0.003
ware+0.003
ge+0.003
uing+0.003
bell+0.003
Neg logit contributions
 much-0.002
e-0.002
 very-0.002
 many-0.002
 glad-0.002
Token their
Token ID511
Feature activation-2.45e+00

Pos logit contributions
rily+0.046
ware+0.042
ge+0.041
uing+0.041
bell+0.040
Neg logit contributions
 much-0.013
e-0.013
 very-0.012
 many-0.012
 named-0.011
Token imag
Token ID3590
Feature activation-6.47e+00

Pos logit contributions
rily+0.068
ware+0.061
ge+0.060
uing+0.059
bell+0.058
Neg logit contributions
 much-0.045
e-0.044
 very-0.042
 many-0.042
 named-0.040
Tokeninations
Token ID7352
Feature activation-1.54e+00

Pos logit contributions
rily+0.122
ware+0.100
ge+0.094
uing+0.091
ro+0.090
Neg logit contributions
 much-0.184
e-0.179
 very-0.177
 many-0.174
 named-0.169
Token to
Token ID284
Feature activation-2.45e+00

Pos logit contributions
rily+0.045
ware+0.040
uing+0.039
ge+0.039
bell+0.037
Neg logit contributions
 much-0.027
 very-0.026
 many-0.026
e-0.025
 a-0.023
Token explore
Token ID7301
Feature activation-1.67e-01

Pos logit contributions
rily+0.071
ware+0.063
uing+0.060
ge+0.060
bell+0.057
Neg logit contributions
 much-0.045
e-0.045
 very-0.045
 many-0.044
 named-0.041
Token the
Token ID262
Feature activation+1.86e+00

Pos logit contributions
rily+0.006
ware+0.006
ge+0.006
uing+0.005
bell+0.005
Neg logit contributions
 much-0.002
e-0.002
 very-0.001
 many-0.001
 glad-0.001
Token distant
Token ID12899
Feature activation-2.23e+00

Pos logit contributions
e+0.031
 much+0.030
 named+0.027
 many+0.027
 very+0.027
Neg logit contributions
rily-0.053
ware-0.049
ge-0.048
bell-0.048
uing-0.047
TokenLet
Token ID5756
Feature activation-1.88e+00

Pos logit contributions
rily+0.106
ware+0.096
uing+0.095
ge+0.095
yt+0.087
Neg logit contributions
e-0.073
 much-0.069
 very-0.067
 many-0.064
 named-0.063
Token's
Token ID338
Feature activation-2.12e+00

Pos logit contributions
rily+0.069
ware+0.063
uing+0.063
ge+0.063
bell+0.061
Neg logit contributions
 much-0.018
e-0.017
 many-0.016
 very-0.016
 named-0.013
Token pretend
Token ID16614
Feature activation-2.85e+00

Pos logit contributions
rily+0.061
ware+0.055
ge+0.054
uing+0.053
fully+0.051
Neg logit contributions
e-0.039
 much-0.039
 very-0.038
 many-0.036
 named-0.036
Token we
Token ID356
Feature activation-1.17e+00

Pos logit contributions
rily+0.071
ware+0.063
uing+0.061
ge+0.060
bell+0.060
Neg logit contributions
e-0.058
 much-0.058
 many-0.055
 very-0.054
 glad-0.053
Token are
Token ID389
Feature activation+1.67e+00

Pos logit contributions
rily+0.036
ware+0.032
ge+0.032
uing+0.032
bell+0.030
Neg logit contributions
 much-0.018
e-0.018
 very-0.017
 many-0.017
 named-0.016
Token mer
Token ID4017
Feature activation-7.28e+00

Pos logit contributions
 much+0.022
 very+0.021
e+0.021
 many+0.020
 named+0.018
Neg logit contributions
rily-0.056
ware-0.052
ge-0.051
uing-0.051
bell-0.049
Tokenmaid
Token ID23151
Feature activation-3.51e-01

Pos logit contributions
ware+0.103
pron+0.101
side+0.101
ge+0.100
 occasion+0.098
Neg logit contributions
ow-0.197
e-0.194
ap-0.183
 very-0.180
 at-0.167
Tokens
Token ID82
Feature activation+1.37e+00

Pos logit contributions
rily+0.009
ware+0.008
ge+0.008
uing+0.008
ro+0.007
Neg logit contributions
e-0.007
 much-0.007
 very-0.007
 many-0.007
 named-0.007
Token and
Token ID290
Feature activation+1.64e-02

Pos logit contributions
 very+0.024
 much+0.024
 many+0.023
e+0.023
 a+0.022
Neg logit contributions
rily-0.040
ware-0.037
ge-0.036
uing-0.036
bell-0.034
Token swim
Token ID9422
Feature activation-1.80e+00

Pos logit contributions
 much+0.000
e+0.000
 very+0.000
 many+0.000
 named+0.000
Neg logit contributions
rily-0.000
ware-0.000
ge-0.000
uing-0.000
bell-0.000
Token in
Token ID287
Feature activation+3.71e+00

Pos logit contributions
rily+0.060
ware+0.056
ge+0.056
uing+0.055
bell+0.053
Neg logit contributions
 very-0.026
 much-0.024
 many-0.024
e-0.023
 a-0.022
Token park
Token ID3952
Feature activation+6.85e-01

Pos logit contributions
e+0.067
 much+0.063
icks+0.063
ow+0.059
 glad+0.057
Neg logit contributions
rily-0.089
bell-0.082
ware-0.081
ro-0.078
ge-0.077
Token.
Token ID13
Feature activation-2.86e-02

Pos logit contributions
e+0.011
 much+0.011
 glad+0.010
 afraid+0.010
 many+0.010
Neg logit contributions
rily-0.019
ge-0.017
ware-0.017
uing-0.017
bell-0.017
Token Jack
Token ID3619
Feature activation-5.39e-02

Pos logit contributions
rily+0.001
ware+0.001
ge+0.001
uing+0.001
bell+0.001
Neg logit contributions
 much-0.000
e-0.000
 very-0.000
 many-0.000
 named-0.000
Token had
Token ID550
Feature activation+8.02e-01

Pos logit contributions
rily+0.002
ware+0.001
ge+0.001
uing+0.001
bell+0.001
Neg logit contributions
 much-0.001
e-0.001
 many-0.001
 very-0.001
 glad-0.001
Token a
Token ID257
Feature activation-1.90e+00

Pos logit contributions
e+0.005
 much+0.004
 glad+0.003
 And+0.003
 very+0.003
Neg logit contributions
rily-0.030
ware-0.028
ge-0.028
bell-0.028
ro-0.027
Token brand
Token ID4508
Feature activation-7.11e+00

Pos logit contributions
rily+0.054
ware+0.049
ge+0.048
uing+0.048
bell+0.046
Neg logit contributions
 much-0.032
e-0.032
 many-0.030
 very-0.030
 glad-0.028
Token new
Token ID649
Feature activation-3.49e+00

Pos logit contributions
rily+0.225
ge+0.207
uing+0.197
ware+0.197
ro+0.189
Neg logit contributions
 much-0.110
 very-0.108
 many-0.103
 named-0.090
 glad-0.089
Token toy
Token ID13373
Feature activation-1.04e+00

Pos logit contributions
rily+0.095
ge+0.083
uing+0.083
ware+0.082
bell+0.077
Neg logit contributions
 much-0.068
 very-0.068
 many-0.065
e-0.064
 named-0.063
Token with
Token ID351
Feature activation+3.10e+00

Pos logit contributions
rily+0.032
ware+0.028
ge+0.028
uing+0.028
bell+0.027
Neg logit contributions
 much-0.017
e-0.016
 very-0.016
 many-0.015
 named-0.014
Token him
Token ID683
Feature activation-5.74e-01

Pos logit contributions
 much+0.030
e+0.029
 very+0.027
 many+0.026
 named+0.023
Neg logit contributions
rily-0.113
ware-0.104
ge-0.102
uing-0.101
bell-0.099
Token.
Token ID13
Feature activation-6.73e-01

Pos logit contributions
rily+0.018
ware+0.017
ge+0.017
uing+0.016
bell+0.016
Neg logit contributions
 much-0.008
e-0.008
 very-0.007
 many-0.007
 named-0.007
Token opened
Token ID4721
Feature activation-6.91e-01

Pos logit contributions
 much+0.025
e+0.024
 many+0.024
 very+0.024
 glad+0.022
Neg logit contributions
rily-0.045
ware-0.041
ge-0.040
uing-0.040
bell-0.039
Token her
Token ID607
Feature activation-1.78e+00

Pos logit contributions
rily+0.025
ware+0.022
ge+0.022
uing+0.022
bell+0.021
Neg logit contributions
 much-0.007
e-0.007
 very-0.006
 many-0.006
 named-0.006
Token eyes
Token ID2951
Feature activation+1.46e-01

Pos logit contributions
rily+0.051
ware+0.045
ge+0.045
uing+0.045
fully+0.043
Neg logit contributions
 much-0.032
 very-0.031
e-0.031
 many-0.030
 glad-0.028
Token and
Token ID290
Feature activation+4.78e-02

Pos logit contributions
 very+0.002
 much+0.002
 a+0.002
 many+0.002
 scared+0.001
Neg logit contributions
rily-0.005
ware-0.005
uing-0.005
ge-0.005
 occasions-0.005
Token her
Token ID607
Feature activation-2.85e+00

Pos logit contributions
 much+0.001
e+0.001
 very+0.001
 many+0.001
 glad+0.001
Neg logit contributions
rily-0.001
ware-0.001
ge-0.001
uing-0.001
fully-0.001
Token t
Token ID256
Feature activation-6.56e+00

Pos logit contributions
rily+0.070
uing+0.062
ge+0.062
ware+0.061
fully+0.057
Neg logit contributions
 very-0.063
 much-0.062
e-0.060
 many-0.060
 glad-0.057
Tokeneddy
Token ID21874
Feature activation-2.68e+00

Pos logit contributions
rily+0.114
ware+0.103
uing+0.098
ge+0.091
ro+0.087
Neg logit contributions
e-0.183
 much-0.181
 very-0.177
 many-0.173
ow-0.170
Token bear
Token ID6842
Feature activation-8.30e-01

Pos logit contributions
rily+0.065
ware+0.058
ge+0.056
uing+0.055
fully+0.054
Neg logit contributions
 much-0.059
 many-0.054
e-0.054
 very-0.052
 glad-0.052
Token was
Token ID373
Feature activation+2.68e+00

Pos logit contributions
rily+0.024
ware+0.022
ge+0.022
uing+0.021
bell+0.021
Neg logit contributions
 much-0.014
e-0.013
 very-0.013
 many-0.013
 glad-0.012
Token still
Token ID991
Feature activation+1.40e+00

Pos logit contributions
e+0.036
 much+0.034
 many+0.033
 And+0.032
 named+0.031
Neg logit contributions
rily-0.084
ge-0.074
uing-0.073
bell-0.073
ware-0.072
Token in
Token ID287
Feature activation+3.77e+00

Pos logit contributions
e+0.020
 much+0.020
 many+0.019
 And+0.018
 named+0.018
Neg logit contributions
rily-0.043
ge-0.038
ware-0.037
uing-0.037
yt-0.037
Token a
Token ID257
Feature activation-2.22e+00

Pos logit contributions
rily+0.056
ware+0.052
ge+0.050
uing+0.049
fully+0.049
Neg logit contributions
 much-0.015
e-0.014
 glad-0.013
 very-0.013
 afraid-0.012
Token few
Token ID1178
Feature activation+3.08e+00

Pos logit contributions
rily+0.061
ware+0.054
uing+0.053
ge+0.053
bell+0.051
Neg logit contributions
 much-0.043
 very-0.041
 many-0.040
e-0.039
 named-0.037
Token shells
Token ID19679
Feature activation+2.15e-01

Pos logit contributions
 much+0.059
e+0.057
 very+0.056
 many+0.056
 named+0.052
Neg logit contributions
rily-0.084
ware-0.074
ge-0.073
uing-0.073
bell-0.070
Token and
Token ID290
Feature activation-8.39e-01

Pos logit contributions
 very+0.003
 much+0.003
 many+0.003
e+0.003
 a+0.003
Neg logit contributions
rily-0.007
ware-0.006
uing-0.006
ge-0.006
bell-0.006
Token some
Token ID617
Feature activation+4.46e-01

Pos logit contributions
rily+0.025
uing+0.022
ge+0.022
ware+0.021
bell+0.021
Neg logit contributions
 many-0.015
 much-0.015
 very-0.014
 a-0.014
e-0.014
Token seaw
Token ID32085
Feature activation-6.54e+00

Pos logit contributions
 much+0.009
 very+0.009
 many+0.009
e+0.008
 a+0.008
Neg logit contributions
rily-0.012
ge-0.011
uing-0.011
ware-0.010
bell-0.010
Tokeneed
Token ID2308
Feature activation-7.44e-01

Pos logit contributions
rily+0.121
ge+0.106
bell+0.106
fully+0.095
ware+0.092
Neg logit contributions
e-0.175
aw-0.174
 much-0.172
 many-0.168
aks-0.168
Token and
Token ID290
Feature activation-7.67e-01

Pos logit contributions
rily+0.024
uing+0.022
pron+0.022
bell+0.022
ge+0.022
Neg logit contributions
 very-0.009
 many-0.009
 much-0.009
 a-0.008
e-0.008
Token built
Token ID3170
Feature activation-1.24e+00

Pos logit contributions
rily+0.022
uing+0.020
ge+0.020
ware+0.019
bell+0.019
Neg logit contributions
 much-0.014
 many-0.014
 very-0.013
e-0.013
 a-0.013
Token a
Token ID257
Feature activation-2.04e+00

Pos logit contributions
rily+0.047
ware+0.044
ge+0.043
uing+0.043
bell+0.042
Neg logit contributions
 much-0.010
e-0.009
 very-0.009
 many-0.009
 named-0.007
Token castle
Token ID16669
Feature activation-1.11e+00

Pos logit contributions
rily+0.061
ge+0.054
ware+0.054
uing+0.053
fully+0.052
Neg logit contributions
 much-0.036
 very-0.035
 many-0.033
e-0.032
 named-0.030