TOP ACTIVATIONS
MAX = 0.150

you can ride your sc ooter again ." "
do it . My sc ooter is not good for hills
girl with a pink sc ooter . She has long hair
want to see my sc ooter ?" The old
letting me ride your sc ooter . It was the best
his fun night of sc ooter riding . <|endoftext|> Jim was
Ben had a red sc ooter and Lily had a white
kept riding on his sc ooter , but this time he
didn 't have a sc ooter to ride . Lily wanted
She gave him his sc ooter and hugged him . She
the neighbourhood with his sc ooter . He felt excited to
too fast on his sc ooter . Tim my learned his
to play with their sc ooters in the park . Ben
and got on their sc ooters . They looked at each
As Tim rode the sc ooter , he went very fast
loved to ride her sc ooter every day . One day
pretended to throw his sc ooter , and the dog ran
ostr ich pen . Ost ric hes are big birds with
zoom around on his sc ooter . He was so fast
. They were called ost ric hes . Lily and Ben

MIN ACTIVATIONS
MIN = -0.152

creature was a little rac oon . He was very hungry
struggled out of its coc oon . It cl ung to
created the most beautiful coc oon . Finally ,
wall . Far ley watched as his mom
and said , " S hel ter is when you have
and bumped against the books hel ves . He smiled as
one day , the coc oon broke open and a beautiful
put them near the rac oon . He ate them so
cater pillar made a coc oon and stayed inside for a
He stayed inside the coc oon for days , until finally
Then I made a coc oon around myself and slept for
was a tiny butterfly coc oon ! He felt so excited
starting to spin a coc oon . " What are you
I 'm making a coc oon so I can turn into
tree and found the coc oon . He went inside and
her not to investigate books hel ves alone , but she
spreading out of the coc oon . He was no longer
there was an available coc oon in the nearby tree .
he spun a special coc oon and went to sleep .
to go into a coc oon and then he would become

INTERVAL 0.075 - 0.150
CONTAINS 0.112%

good friends . One day Paul wanted to paint
to be kind and help others , even if they are
he knew he was the fastest . But when
mouth . " Look , our snow man is
made some small balls of clay and stuck them on her

INTERVAL -0.001 - 0.075
CONTAINS 49.896%

eh ive . Bob o said , " Look , L
said , " Thank you for helping me help others ."
smiled again . " No , it 's not mine .
was still a bit angry but he said " It 's
how it would taste . He was so excited to try

INTERVAL -0.076 - -0.001
CONTAINS 49.898%

She then wrapped them around and around until the yarn was
in the tree ?" " Because it is my
. The fancy elephant felt good and went home . <|endoftext|>
new doll and coloring book . She couldn 't believe it
remember the day she sang with the truck driver . <|endoftext|>

INTERVAL -0.152 - -0.076
CONTAINS 0.094%

a calm , soothing voice filled the air . The bird
by the bath tub and filled it with warm water .
he was done examining the chair , he thought it was
girl who saw the seal struggling with the trash . She
a bag with him , filled with lots of things .
Token you
Token ID345
Feature activation+2.07e-02

Pos logit contributions
ags+0.005
ained+0.005
akes+0.005
agons+0.004
ined+0.004
Neg logit contributions
 Pen-0.002
 grand-0.002
�-0.002
 unw-0.002
 quickly-0.002
Token can
Token ID460
Feature activation-1.04e-02

Pos logit contributions
ained+0.035
ags+0.035
agons+0.033
ined+0.033
ahs+0.033
Neg logit contributions
 Pen-0.024
 unw-0.023
�-0.022
 grand-0.022
 quickly-0.022
Token ride
Token ID6594
Feature activation+1.50e-02

Pos logit contributions
 Pen+0.013
 unw+0.012
 grand+0.012
 unsc+0.011
 quickly+0.011
Neg logit contributions
ained-0.017
ags-0.016
ahs-0.016
akes-0.016
affles-0.016
Token your
Token ID534
Feature activation+4.34e-02

Pos logit contributions
affles+0.027
ained+0.027
 highlight+0.027
akes+0.027
ined+0.026
Neg logit contributions
 Pen-0.016
 grand-0.015
 quickly-0.013
An-0.013
 unw-0.013
Token sc
Token ID629
Feature activation+5.46e-02

Pos logit contributions
ained+0.077
akes+0.072
ined+0.071
affles+0.068
ahs+0.067
Neg logit contributions
 grand-0.053
 Pen-0.052
 unw-0.050
 keeper-0.046
 open-0.044
Tokenooter
Token ID25141
Feature activation+1.50e-01

Pos logit contributions
ined+0.053
ained+0.053
akes+0.051
affles+0.050
 highlight+0.049
Neg logit contributions
 grand-0.094
 unw-0.092
ents-0.091
keeper-0.090
 unsc-0.090
Token again
Token ID757
Feature activation-3.03e-03

Pos logit contributions
affles+0.236
ags+0.231
ained+0.231
 highlight+0.230
ined+0.229
Neg logit contributions
 Pen-0.171
 grand-0.167
 unw-0.161
 keeper-0.158
keeper-0.156
Token."
Token ID526
Feature activation+9.67e-03

Pos logit contributions
 Pen+0.004
 grand+0.003
 unw+0.003
 unsc+0.003
An+0.003
Neg logit contributions
ained-0.005
affles-0.005
akes-0.005
ags-0.005
ahs-0.005
Token
Token ID198
Feature activation+1.35e-03

Pos logit contributions
ained+0.019
akes+0.018
ags+0.018
ined+0.018
ahs+0.018
Neg logit contributions
 Pen-0.009
 unw-0.008
 grand-0.008
 unsc-0.007
An-0.007
Token
Token ID198
Feature activation-3.01e-04

Pos logit contributions
ags+0.002
ained+0.002
affles+0.002
ahs+0.002
akes+0.002
Neg logit contributions
 Pen-0.002
 unw-0.002
keeper-0.001
 unsc-0.001
 keeper-0.001
Token"
Token ID1
Feature activation-2.77e-02

Pos logit contributions
 Pen+0.000
 unw+0.000
 grand+0.000
keeper+0.000
 keeper+0.000
Neg logit contributions
affles-0.000
agons-0.000
ags-0.000
akes-0.000
ahs-0.000
Token do
Token ID466
Feature activation-4.92e-02

Pos logit contributions
akes+0.014
ained+0.013
ined+0.013
affles+0.013
ags+0.013
Neg logit contributions
 Pen-0.008
 quickly-0.007
 grand-0.007
 unw-0.007
 open-0.007
Token it
Token ID340
Feature activation+1.35e-02

Pos logit contributions
 Pen+0.046
 grand+0.045
 quickly+0.041
 unw+0.040
 unsc+0.039
Neg logit contributions
ags-0.093
ained-0.092
akes-0.092
ined-0.090
agged-0.089
Token.
Token ID13
Feature activation+1.73e-02

Pos logit contributions
ags+0.028
agons+0.027
affles+0.027
agged+0.027
 highlight+0.027
Neg logit contributions
 quickly-0.010
 unw-0.009
 Pen-0.009
�-0.008
 open-0.008
Token My
Token ID2011
Feature activation+4.73e-02

Pos logit contributions
ained+0.031
akes+0.031
ags+0.031
ined+0.030
affles+0.030
Neg logit contributions
 Pen-0.017
 unw-0.015
 grand-0.015
An-0.014
 keeper-0.014
Token sc
Token ID629
Feature activation+5.86e-02

Pos logit contributions
ained+0.086
akes+0.084
ined+0.083
ags+0.082
affles+0.080
Neg logit contributions
 Pen-0.050
 grand-0.049
 unw-0.044
 keeper-0.043
keeper-0.043
Tokenooter
Token ID25141
Feature activation+1.49e-01

Pos logit contributions
ained+0.063
akes+0.062
ined+0.059
 highlight+0.059
ags+0.055
Neg logit contributions
keeper-0.096
 grand-0.095
 unw-0.095
 Pen-0.094
 keeper-0.094
Token is
Token ID318
Feature activation-3.31e-02

Pos logit contributions
ags+0.293
affles+0.287
 highlight+0.284
ained+0.283
akes+0.277
Neg logit contributions
 Pen-0.121
 unw-0.119
 grand-0.119
 quickly-0.115
 keeper-0.115
Token not
Token ID407
Feature activation-1.96e-02

Pos logit contributions
 grand+0.030
 unw+0.030
 Pen+0.030
 open+0.028
 unsc+0.028
Neg logit contributions
akes-0.066
affles-0.063
ags-0.062
agons-0.061
orting-0.061
Token good
Token ID922
Feature activation-1.90e-02

Pos logit contributions
 grand+0.020
 Pen+0.019
 unw+0.019
 unsc+0.018
 open+0.018
Neg logit contributions
akes-0.036
affles-0.035
ained-0.034
ahs-0.034
agons-0.034
Token for
Token ID329
Feature activation+3.02e-02

Pos logit contributions
 grand+0.012
 quickly+0.011
 unw+0.010
 open+0.010
 keeper+0.009
Neg logit contributions
ined-0.042
akes-0.041
ags-0.041
 highlight-0.041
affles-0.041
Token hills
Token ID18639
Feature activation+3.88e-03

Pos logit contributions
akes+0.056
ained+0.055
ined+0.052
 highlight+0.051
ags+0.050
Neg logit contributions
 Pen-0.034
 grand-0.033
 unw-0.032
 quickly-0.032
 un-0.030
Token girl
Token ID2576
Feature activation+2.67e-03

Pos logit contributions
 Pen+0.029
 grand+0.027
 unw+0.027
 keeper+0.027
 unsc+0.025
Neg logit contributions
ained-0.061
ags-0.058
akes-0.057
affles-0.057
 highlight-0.056
Token with
Token ID351
Feature activation+1.41e-03

Pos logit contributions
akes+0.005
ags+0.005
ained+0.005
affles+0.005
agons+0.005
Neg logit contributions
 unw-0.003
 Pen-0.003
�-0.003
 unsc-0.003
 open-0.003
Token a
Token ID257
Feature activation+8.20e-03

Pos logit contributions
ained+0.002
ags+0.002
akes+0.002
ahs+0.002
ined+0.002
Neg logit contributions
 Pen-0.001
 unw-0.001
 grand-0.001
 unsc-0.001
 keeper-0.001
Token pink
Token ID11398
Feature activation+3.09e-02

Pos logit contributions
ained+0.017
ags+0.016
akes+0.016
affles+0.016
orting+0.016
Neg logit contributions
 grand-0.007
 Pen-0.007
 unw-0.007
 keeper-0.006
keeper-0.006
Token sc
Token ID629
Feature activation+4.71e-02

Pos logit contributions
ained+0.063
affles+0.057
orting+0.056
 highlight+0.056
akes+0.055
Neg logit contributions
 grand-0.031
 unw-0.030
 open-0.029
 Pen-0.028
 quickly-0.026
Tokenooter
Token ID25141
Feature activation+1.44e-01

Pos logit contributions
ained+0.054
ined+0.051
akes+0.051
 highlight+0.049
affles+0.048
Neg logit contributions
 unw-0.075
 grand-0.075
 Pen-0.074
keeper-0.074
 keeper-0.072
Token.
Token ID13
Feature activation+1.12e-03

Pos logit contributions
ags+0.289
affles+0.288
ained+0.282
 highlight+0.281
akes+0.280
Neg logit contributions
 Pen-0.108
 unw-0.106
 grand-0.101
 keeper-0.100
keeper-0.098
Token She
Token ID1375
Feature activation-1.97e-03

Pos logit contributions
ained+0.002
akes+0.002
ags+0.002
ahs+0.002
affles+0.002
Neg logit contributions
 Pen-0.001
 unw-0.001
 grand-0.001
 unsc-0.001
An-0.001
Token has
Token ID468
Feature activation+1.89e-02

Pos logit contributions
 Pen+0.002
 unw+0.002
 grand+0.002
 unsc+0.002
An+0.002
Neg logit contributions
ained-0.003
akes-0.003
ags-0.003
 highlight-0.003
affles-0.003
Token long
Token ID890
Feature activation-2.22e-03

Pos logit contributions
ained+0.033
akes+0.032
ags+0.032
ahs+0.032
affles+0.031
Neg logit contributions
 Pen-0.019
 unw-0.018
 grand-0.017
 unsc-0.017
 keeper-0.016
Token hair
Token ID4190
Feature activation+5.47e-02

Pos logit contributions
 grand+0.003
 unw+0.003
 Pen+0.003
 open+0.002
 gasped+0.002
Neg logit contributions
 highlight-0.004
akes-0.004
orting-0.004
ained-0.004
ancer-0.003
Token want
Token ID765
Feature activation+1.35e-02

Pos logit contributions
ained+0.047
ined+0.046
akes+0.045
ahs+0.045
agons+0.045
Neg logit contributions
 open-0.024
 unw-0.023
�-0.023
 un-0.023
 quickly-0.021
Token to
Token ID284
Feature activation+1.16e-03

Pos logit contributions
akes+0.037
ined+0.036
ained+0.036
agons+0.035
ging+0.035
Neg logit contributions
 grand-0.004
 Pen-0.003
 unw-0.002
 open-0.002
 keeper-0.002
Token see
Token ID766
Feature activation+2.25e-02

Pos logit contributions
ained+0.002
akes+0.002
ined+0.002
ags+0.002
affles+0.002
Neg logit contributions
 grand-0.001
 unw-0.001
 Pen-0.001
 keeper-0.001
 open-0.001
Token my
Token ID616
Feature activation+2.80e-02

Pos logit contributions
ained+0.047
akes+0.046
agons+0.044
ined+0.044
 highlight+0.043
Neg logit contributions
 grand-0.021
 Pen-0.019
 unw-0.018
 open-0.016
 cautiously-0.015
Token sc
Token ID629
Feature activation+4.99e-02

Pos logit contributions
ained+0.048
akes+0.046
ined+0.046
ags+0.045
affles+0.045
Neg logit contributions
 grand-0.032
 Pen-0.032
 unw-0.030
 keeper-0.030
keeper-0.029
Tokenooter
Token ID25141
Feature activation+1.43e-01

Pos logit contributions
ained+0.045
ined+0.045
akes+0.045
ags+0.041
 highlight+0.041
Neg logit contributions
 grand-0.090
 Pen-0.090
 keeper-0.089
 unw-0.088
keeper-0.088
Token?"
Token ID1701
Feature activation+2.12e-02

Pos logit contributions
akes+0.181
affles+0.180
agons+0.180
ags+0.175
ained+0.174
Neg logit contributions
 grand-0.213
 keeper-0.204
 Pen-0.199
 unw-0.199
 open-0.198
Token
Token ID198
Feature activation-1.88e-03

Pos logit contributions
ained+0.038
ags+0.037
ahs+0.037
akes+0.037
ined+0.036
Neg logit contributions
 Pen-0.020
 unw-0.019
 grand-0.017
 unsc-0.017
An-0.017
Token
Token ID198
Feature activation-2.92e-03

Pos logit contributions
 Pen+0.002
 unw+0.002
 unsc+0.002
 grand+0.002
 cautiously+0.002
Neg logit contributions
ags-0.003
ained-0.003
ined-0.002
ahs-0.002
agons-0.002
TokenThe
Token ID464
Feature activation+1.95e-02

Pos logit contributions
 Pen+0.004
 unw+0.003
abel+0.003
 grand+0.003
 Eva+0.003
Neg logit contributions
w-0.004
agons-0.004
ags-0.004
orting-0.004
ahs-0.004
Token old
Token ID1468
Feature activation-1.37e-02

Pos logit contributions
akes+0.045
ained+0.045
ags+0.044
agged+0.042
ined+0.042
Neg logit contributions
 grand-0.014
 keeper-0.013
 Pen-0.012
keeper-0.011
 security-0.010
Token letting
Token ID9616
Feature activation-8.81e-03

Pos logit contributions
akes+0.028
ained+0.028
ahs+0.027
unes+0.026
 highlight+0.026
Neg logit contributions
 Pen-0.021
 grand-0.020
 unw-0.020
 quickly-0.019
 unsc-0.019
Token me
Token ID502
Feature activation+6.33e-02

Pos logit contributions
 Pen+0.008
 grand+0.007
 unw+0.007
�+0.007
 quickly+0.007
Neg logit contributions
ained-0.017
akes-0.017
ined-0.017
ahs-0.017
affles-0.016
Token ride
Token ID6594
Feature activation+6.60e-03

Pos logit contributions
ained+0.116
ined+0.112
ahs+0.110
akes+0.109
agged+0.106
Neg logit contributions
 open-0.067
 unw-0.067
�-0.066
 un-0.065
 quickly-0.063
Token your
Token ID534
Feature activation+3.73e-02

Pos logit contributions
 highlight+0.011
ained+0.011
affles+0.011
akes+0.011
ined+0.011
Neg logit contributions
 Pen-0.008
 grand-0.007
 unw-0.006
 quickly-0.006
 keeper-0.006
Token sc
Token ID629
Feature activation+5.09e-02

Pos logit contributions
ained+0.065
akes+0.062
ined+0.062
affles+0.060
ags+0.059
Neg logit contributions
 grand-0.044
 Pen-0.043
 unw-0.041
 keeper-0.038
 unsc-0.037
Tokenooter
Token ID25141
Feature activation+1.43e-01

Pos logit contributions
ained+0.045
ined+0.044
akes+0.044
affles+0.042
 highlight+0.041
Neg logit contributions
 grand-0.091
 Pen-0.091
 unw-0.090
keeper-0.088
 keeper-0.088
Token.
Token ID13
Feature activation+1.11e-02

Pos logit contributions
affles+0.235
ained+0.234
ags+0.232
ined+0.230
 highlight+0.229
Neg logit contributions
 Pen-0.157
 grand-0.151
 unw-0.146
 keeper-0.142
keeper-0.139
Token It
Token ID632
Feature activation+2.84e-03

Pos logit contributions
ained+0.017
ags+0.016
ined+0.016
akes+0.016
affles+0.016
Neg logit contributions
 Pen-0.015
 grand-0.013
 unw-0.013
An-0.013
 unsc-0.013
Token was
Token ID373
Feature activation+8.75e-03

Pos logit contributions
ained+0.005
akes+0.005
ahs+0.005
ined+0.005
 highlight+0.005
Neg logit contributions
 Pen-0.003
 unw-0.002
 grand-0.002
 unsc-0.002
An-0.002
Token the
Token ID262
Feature activation+1.16e-02

Pos logit contributions
akes+0.018
affles+0.017
ags+0.017
ahs+0.017
agons+0.017
Neg logit contributions
 Pen-0.007
 grand-0.007
 unw-0.007
 unsc-0.006
 open-0.006
Token best
Token ID1266
Feature activation+4.70e-02

Pos logit contributions
affles+0.016
agged+0.016
akes+0.015
ained+0.015
unes+0.015
Neg logit contributions
 grand-0.018
 keeper-0.017
 open-0.017
 Pen-0.017
 unw-0.016
Token his
Token ID465
Feature activation+3.69e-03

Pos logit contributions
 Pen+0.001
 grand+0.001
 unw+0.001
 keeper+0.001
 quickly+0.001
Neg logit contributions
akes-0.002
affles-0.002
ained-0.002
 highlight-0.002
ahs-0.002
Token fun
Token ID1257
Feature activation+9.01e-03

Pos logit contributions
ained+0.007
akes+0.007
affles+0.006
ags+0.006
ined+0.006
Neg logit contributions
 grand-0.004
 keeper-0.004
 unw-0.004
 Pen-0.004
keeper-0.004
Token night
Token ID1755
Feature activation+1.78e-02

Pos logit contributions
ags+0.015
affles+0.015
ained+0.015
akes+0.015
unes+0.014
Neg logit contributions
 grand-0.010
 Pen-0.010
 quickly-0.010
 unw-0.010
 keeper-0.010
Token of
Token ID286
Feature activation-3.35e-03

Pos logit contributions
ags+0.030
ained+0.030
affles+0.030
akes+0.029
ahs+0.029
Neg logit contributions
 Pen-0.019
 unw-0.018
 grand-0.018
 unsc-0.017
 keeper-0.016
Token sc
Token ID629
Feature activation+4.47e-02

Pos logit contributions
 Pen+0.005
 grand+0.005
 unw+0.005
 quickly+0.004
 keeper+0.004
Neg logit contributions
akes-0.005
ained-0.005
affles-0.005
unes-0.005
ahs-0.005
Tokenooter
Token ID25141
Feature activation+1.43e-01

Pos logit contributions
ained+0.068
akes+0.067
ags+0.067
ahs+0.066
affles+0.065
Neg logit contributions
 Pen-0.055
 unw-0.052
 grand-0.051
 unsc-0.049
 keeper-0.048
Token riding
Token ID10311
Feature activation+3.75e-02

Pos logit contributions
ained+0.234
ags+0.232
akes+0.232
affles+0.229
ined+0.224
Neg logit contributions
 Pen-0.159
 unw-0.149
 grand-0.146
 keeper-0.139
 unsc-0.139
Token.
Token ID13
Feature activation-7.31e-03

Pos logit contributions
affles+0.060
akes+0.058
ained+0.057
 highlight+0.057
ags+0.055
Neg logit contributions
 Pen-0.046
 grand-0.045
 unw-0.043
 keeper-0.043
 quickly-0.042
Token<|endoftext|>
Token ID50256
Feature activation+5.03e-02

Pos logit contributions
 Pen+0.008
 unw+0.007
 unsc+0.007
 keeper+0.007
 grand+0.007
Neg logit contributions
ained-0.012
akes-0.012
ags-0.012
ahs-0.012
affles-0.012
TokenJim
Token ID18050
Feature activation-5.06e-02

Pos logit contributions
ained+0.079
akes+0.077
ags+0.077
ined+0.075
ahs+0.075
Neg logit contributions
 Pen-0.061
 unw-0.057
 grand-0.055
 unsc-0.053
 keeper-0.052
Token was
Token ID373
Feature activation-1.09e-02

Pos logit contributions
 Pen+0.048
 unw+0.044
 grand+0.043
 unsc+0.040
An+0.039
Neg logit contributions
ained-0.094
ags-0.093
akes-0.092
ahs-0.090
affles-0.090
Token Ben
Token ID3932
Feature activation-1.71e-02

Pos logit contributions
ined+0.020
affles+0.019
akes+0.019
ags+0.019
ained+0.019
Neg logit contributions
 Pen-0.009
 grand-0.008
An-0.007
 unw-0.007
Un-0.007
Token had
Token ID550
Feature activation+4.19e-02

Pos logit contributions
 Pen+0.022
 unw+0.020
 grand+0.020
 unsc+0.019
keeper+0.019
Neg logit contributions
ained-0.025
ahs-0.024
akes-0.024
affles-0.023
ined-0.023
Token a
Token ID257
Feature activation+8.70e-03

Pos logit contributions
ained+0.073
ags+0.071
ahs+0.070
akes+0.070
 highlight+0.069
Neg logit contributions
 Pen-0.044
 unw-0.040
 grand-0.038
keeper-0.038
 unsc-0.037
Token red
Token ID2266
Feature activation+4.08e-02

Pos logit contributions
akes+0.016
ained+0.016
unes+0.016
ined+0.016
affles+0.015
Neg logit contributions
 grand-0.010
 Pen-0.008
 keeper-0.008
 unw-0.008
 tight-0.008
Token sc
Token ID629
Feature activation+4.74e-02

Pos logit contributions
affles+0.079
reetings+0.078
 highlight+0.076
leness+0.076
ained+0.075
Neg logit contributions
 grand-0.038
 open-0.035
 bar-0.034
 unw-0.033
 quickly-0.032
Tokenooter
Token ID25141
Feature activation+1.42e-01

Pos logit contributions
ained+0.048
ined+0.047
akes+0.046
 highlight+0.045
affles+0.044
Neg logit contributions
 grand-0.080
 keeper-0.077
 unw-0.077
 Pen-0.077
keeper-0.076
Token and
Token ID290
Feature activation+8.94e-03

Pos logit contributions
ags+0.250
ained+0.250
akes+0.248
affles+0.247
 highlight+0.242
Neg logit contributions
 Pen-0.141
 grand-0.132
 unw-0.131
 keeper-0.126
 unsc-0.123
Token Lily
Token ID20037
Feature activation-3.19e-03

Pos logit contributions
ained+0.015
akes+0.015
ags+0.015
ahs+0.015
affles+0.015
Neg logit contributions
 Pen-0.009
 unw-0.009
 grand-0.009
 unsc-0.008
An-0.008
Token had
Token ID550
Feature activation+3.80e-02

Pos logit contributions
 Pen+0.004
 unw+0.003
 grand+0.003
 unsc+0.003
An+0.003
Neg logit contributions
ained-0.005
ags-0.005
akes-0.005
affles-0.005
ahs-0.005
Token a
Token ID257
Feature activation+4.53e-03

Pos logit contributions
ained+0.069
akes+0.069
ags+0.068
affles+0.067
ined+0.067
Neg logit contributions
 Pen-0.037
 unw-0.034
 grand-0.033
 unsc-0.031
An-0.031
Token white
Token ID2330
Feature activation+1.12e-02

Pos logit contributions
akes+0.009
ained+0.009
affles+0.009
ags+0.009
unes+0.009
Neg logit contributions
 grand-0.005
 tight-0.004
 open-0.004
 unw-0.004
 keeper-0.004
Token kept
Token ID4030
Feature activation+1.11e-03

Pos logit contributions
ained+0.010
ahs+0.010
akes+0.009
ags+0.009
ined+0.009
Neg logit contributions
 Pen-0.006
 unw-0.005
 grand-0.005
An-0.005
 keeper-0.005
Token riding
Token ID10311
Feature activation+4.29e-02

Pos logit contributions
akes+0.002
ained+0.002
unes+0.002
affles+0.002
 highlight+0.002
Neg logit contributions
 Pen-0.001
 grand-0.001
 quickly-0.001
 unw-0.001
 un-0.001
Token on
Token ID319
Feature activation+2.72e-02

Pos logit contributions
akes+0.078
affles+0.078
ained+0.077
 highlight+0.075
ined+0.074
Neg logit contributions
 Pen-0.044
 grand-0.042
 keeper-0.039
 unw-0.038
 quickly-0.038
Token his
Token ID465
Feature activation+1.03e-02

Pos logit contributions
ained+0.052
akes+0.051
ags+0.050
affles+0.050
ined+0.050
Neg logit contributions
 Pen-0.024
 unw-0.022
 grand-0.022
 unsc-0.020
 keeper-0.020
Token sc
Token ID629
Feature activation+4.63e-02

Pos logit contributions
ained+0.016
akes+0.015
ags+0.015
ahs+0.015
ined+0.015
Neg logit contributions
 Pen-0.013
 unw-0.012
 grand-0.012
 unsc-0.012
 keeper-0.012
Tokenooter
Token ID25141
Feature activation+1.42e-01

Pos logit contributions
ained+0.031
akes+0.030
ined+0.029
ags+0.028
affles+0.028
Neg logit contributions
 Pen-0.095
 unw-0.093
 grand-0.092
 unsc-0.090
 keeper-0.090
Token,
Token ID11
Feature activation+1.15e-02

Pos logit contributions
ained+0.217
akes+0.215
ags+0.215
affles+0.214
ined+0.210
Neg logit contributions
 Pen-0.172
 grand-0.160
 unw-0.160
 keeper-0.153
 unsc-0.152
Token but
Token ID475
Feature activation+2.33e-02

Pos logit contributions
affles+0.018
akes+0.017
ags+0.017
agons+0.017
ained+0.017
Neg logit contributions
 Pen-0.016
 grand-0.015
 unw-0.015
 unsc-0.014
 cautiously-0.014
Token this
Token ID428
Feature activation-4.86e-03

Pos logit contributions
ained+0.035
ahs+0.034
akes+0.033
 highlight+0.033
ined+0.032
Neg logit contributions
 Pen-0.030
 unw-0.028
 grand-0.026
An-0.026
 unsc-0.026
Token time
Token ID640
Feature activation+1.98e-02

Pos logit contributions
 Pen+0.008
 grand+0.008
 unw+0.008
 unsc+0.008
 keeper+0.008
Neg logit contributions
ags-0.005
akes-0.005
ained-0.005
affles-0.005
agons-0.005
Token he
Token ID339
Feature activation+2.01e-03

Pos logit contributions
ained+0.032
akes+0.032
ags+0.031
ahs+0.031
affles+0.031
Neg logit contributions
 Pen-0.023
 unw-0.021
 grand-0.021
 unsc-0.020
An-0.020
Token didn
Token ID1422
Feature activation-1.73e-02

Pos logit contributions
ags+0.108
ained+0.106
akes+0.105
agons+0.102
affles+0.101
Neg logit contributions
 Pen-0.059
 grand-0.055
 unw-0.053
 keeper-0.050
 quickly-0.049
Token't
Token ID470
Feature activation-7.44e-05

Pos logit contributions
 Pen+0.026
 unw+0.024
 grand+0.024
 keeper+0.023
 unsc+0.023
Neg logit contributions
ained-0.022
akes-0.021
ags-0.021
ahs-0.021
ined-0.021
Token have
Token ID423
Feature activation+1.65e-02

Pos logit contributions
 open+0.000
 un+0.000
 unt+0.000
 quickly+0.000
 unw+0.000
Neg logit contributions
ined-0.000
akes-0.000
ained-0.000
ates-0.000
affles-0.000
Token a
Token ID257
Feature activation+6.80e-03

Pos logit contributions
akes+0.028
ined+0.027
ained+0.027
agons+0.027
affles+0.027
Neg logit contributions
 Pen-0.019
 unw-0.018
 grand-0.018
 unsc-0.016
An-0.016
Token sc
Token ID629
Feature activation+5.12e-02

Pos logit contributions
ained+0.013
ined+0.013
akes+0.013
ags+0.013
affles+0.012
Neg logit contributions
 grand-0.007
 unw-0.007
 Pen-0.007
 open-0.007
 keeper-0.006
Tokenooter
Token ID25141
Feature activation+1.41e-01

Pos logit contributions
ained+0.051
akes+0.050
ined+0.050
ags+0.047
affles+0.046
Neg logit contributions
 Pen-0.089
 grand-0.089
 unw-0.089
keeper-0.086
 keeper-0.086
Token to
Token ID284
Feature activation-6.74e-03

Pos logit contributions
ags+0.261
affles+0.258
akes+0.254
agons+0.254
ained+0.254
Neg logit contributions
 Pen-0.126
 grand-0.126
 keeper-0.124
 unw-0.122
keeper-0.120
Token ride
Token ID6594
Feature activation+9.70e-04

Pos logit contributions
 unw+0.008
 open+0.008
 un+0.007
 grand+0.007
 quickly+0.007
Neg logit contributions
ained-0.012
akes-0.012
unes-0.012
 highlight-0.012
ahs-0.011
Token.
Token ID13
Feature activation-5.32e-03

Pos logit contributions
affles+0.002
 highlight+0.002
akes+0.002
ained+0.002
agons+0.002
Neg logit contributions
 Pen-0.001
 grand-0.001
 quickly-0.001
 keeper-0.001
 unw-0.001
Token Lily
Token ID20037
Feature activation-5.65e-03

Pos logit contributions
 Pen+0.006
 unw+0.005
 grand+0.005
 unsc+0.005
An+0.005
Neg logit contributions
ained-0.009
akes-0.009
ags-0.009
ahs-0.009
affles-0.009
Token wanted
Token ID2227
Feature activation+9.84e-03

Pos logit contributions
 Pen+0.007
 unw+0.006
An+0.006
 grand+0.006
 keeper+0.006
Neg logit contributions
ained-0.008
ahs-0.008
ined-0.008
akes-0.008
affles-0.008
Token She
Token ID1375
Feature activation+1.39e-03

Pos logit contributions
ained+0.009
ags+0.008
akes+0.008
ahs+0.008
affles+0.008
Neg logit contributions
 Pen-0.005
 unw-0.005
 grand-0.005
 unsc-0.005
 keeper-0.005
Token gave
Token ID2921
Feature activation+3.85e-02

Pos logit contributions
ahs+0.002
ined+0.002
affles+0.002
ained+0.002
 highlight+0.002
Neg logit contributions
 Pen-0.002
 keeper-0.001
An-0.001
Un-0.001
 unw-0.001
Token him
Token ID683
Feature activation+1.09e-02

Pos logit contributions
ained+0.064
akes+0.063
ined+0.061
ags+0.061
ahs+0.060
Neg logit contributions
 Pen-0.045
 unw-0.042
 grand-0.042
 unsc-0.038
 Eva-0.037
Token his
Token ID465
Feature activation+1.40e-02

Pos logit contributions
ags+0.018
ained+0.018
ahs+0.018
affles+0.017
akes+0.017
Neg logit contributions
 Pen-0.012
 unw-0.011
 keeper-0.010
 unsc-0.010
An-0.010
Token sc
Token ID629
Feature activation+4.62e-02

Pos logit contributions
ained+0.028
akes+0.026
ined+0.025
orting+0.025
ags+0.024
Neg logit contributions
 grand-0.016
 unw-0.014
 Pen-0.014
 open-0.013
 unsc-0.013
Tokenooter
Token ID25141
Feature activation+1.41e-01

Pos logit contributions
ained+0.047
akes+0.044
ined+0.043
 highlight+0.040
ags+0.040
Neg logit contributions
 unw-0.082
 grand-0.081
 Pen-0.080
keeper-0.080
 keeper-0.079
Token and
Token ID290
Feature activation+6.10e-03

Pos logit contributions
ained+0.237
akes+0.233
ags+0.232
affles+0.228
ahs+0.228
Neg logit contributions
 Pen-0.154
 unw-0.143
 grand-0.140
 unsc-0.133
 keeper-0.132
Token hugged
Token ID41130
Feature activation+2.53e-02

Pos logit contributions
ained+0.010
akes+0.010
ahs+0.010
ags+0.010
affles+0.010
Neg logit contributions
 Pen-0.007
 unw-0.006
 grand-0.006
An-0.006
 keeper-0.006
Token him
Token ID683
Feature activation+5.60e-03

Pos logit contributions
ained+0.038
akes+0.038
ags+0.037
ahs+0.037
ined+0.036
Neg logit contributions
 Pen-0.032
 unw-0.030
 grand-0.030
 unsc-0.028
An-0.028
Token.
Token ID13
Feature activation+3.52e-03

Pos logit contributions
ained+0.010
ags+0.010
akes+0.010
ahs+0.010
ined+0.010
Neg logit contributions
 Pen-0.005
 unw-0.005
 grand-0.005
 unsc-0.005
An-0.005
Token She
Token ID1375
Feature activation-5.83e-03

Pos logit contributions
ained+0.006
ags+0.006
akes+0.006
ahs+0.006
affles+0.006
Neg logit contributions
 Pen-0.004
 unw-0.003
 unsc-0.003
 keeper-0.003
 grand-0.003
Token the
Token ID262
Feature activation+6.67e-03

Pos logit contributions
ained+0.049
akes+0.049
ags+0.048
affles+0.048
ahs+0.048
Neg logit contributions
 Pen-0.024
 unw-0.021
 grand-0.021
 unsc-0.019
 keeper-0.019
Token neighbourhood
Token ID19495
Feature activation+2.25e-02

Pos logit contributions
ained+0.010
akes+0.010
ags+0.010
affles+0.010
ahs+0.010
Neg logit contributions
 Pen-0.009
 grand-0.008
 unw-0.008
 keeper-0.008
 unsc-0.008
Token with
Token ID351
Feature activation-3.90e-03

Pos logit contributions
ags+0.038
ained+0.038
affles+0.037
akes+0.037
agons+0.036
Neg logit contributions
 Pen-0.025
 unw-0.023
 grand-0.023
 keeper-0.022
 unsc-0.022
Token his
Token ID465
Feature activation+6.58e-03

Pos logit contributions
 Pen+0.004
 unw+0.003
 grand+0.003
 unsc+0.003
An+0.003
Neg logit contributions
ained-0.007
akes-0.007
ags-0.007
ahs-0.007
affles-0.007
Token sc
Token ID629
Feature activation+4.50e-02

Pos logit contributions
ained+0.011
affles+0.011
akes+0.011
ags+0.011
orting+0.011
Neg logit contributions
 grand-0.008
 Pen-0.007
 keeper-0.007
 unw-0.007
keeper-0.007
Tokenooter
Token ID25141
Feature activation+1.41e-01

Pos logit contributions
ained+0.036
akes+0.036
affles+0.035
ined+0.034
ags+0.034
Neg logit contributions
 Pen-0.084
 grand-0.083
 unw-0.083
 keeper-0.081
keeper-0.081
Token.
Token ID13
Feature activation-1.20e-02

Pos logit contributions
affles+0.249
ags+0.243
akes+0.239
ained+0.234
 highlight+0.231
Neg logit contributions
 Pen-0.142
 grand-0.141
 unw-0.138
 quickly-0.135
 keeper-0.134
Token He
Token ID679
Feature activation+7.57e-03

Pos logit contributions
 Pen+0.012
 grand+0.010
 unw+0.010
An+0.010
 Eva+0.009
Neg logit contributions
ined-0.022
ags-0.022
akes-0.022
affles-0.022
ained-0.022
Token felt
Token ID2936
Feature activation+2.02e-03

Pos logit contributions
ags+0.015
akes+0.014
ained+0.014
affles+0.014
unes+0.014
Neg logit contributions
 Pen-0.007
 unw-0.007
 grand-0.007
 quickly-0.007
 unsc-0.007
Token excited
Token ID6568
Feature activation-5.98e-02

Pos logit contributions
akes+0.004
affles+0.004
ags+0.004
agons+0.004
ahs+0.004
Neg logit contributions
 Pen-0.002
 unw-0.002
 grand-0.002
 unsc-0.002
An-0.002
Token to
Token ID284
Feature activation-1.50e-02

Pos logit contributions
 Pen+0.058
 grand+0.056
 unw+0.055
 keeper+0.051
 unsc+0.050
Neg logit contributions
ained-0.108
akes-0.108
ags-0.108
affles-0.106
agons-0.106
Token too
Token ID1165
Feature activation+1.18e-02

Pos logit contributions
mm+0.012
akes+0.012
affles+0.011
vals+0.011
ined+0.011
Neg logit contributions
 quickly-0.005
 open-0.004
 un-0.004
 unt-0.004
 keeper-0.004
Token fast
Token ID3049
Feature activation-3.56e-03

Pos logit contributions
ained+0.024
akes+0.023
ags+0.023
ahs+0.023
 highlight+0.022
Neg logit contributions
 Pen-0.009
 unw-0.008
 grand-0.008
 unsc-0.007
An-0.007
Token on
Token ID319
Feature activation+2.77e-02

Pos logit contributions
 Pen+0.002
 grand+0.002
 quickly+0.002
 unw+0.002
 unsc+0.002
Neg logit contributions
affles-0.007
 highlight-0.007
akes-0.007
ags-0.007
ined-0.007
Token his
Token ID465
Feature activation+1.22e-02

Pos logit contributions
ained+0.050
akes+0.049
ags+0.049
ahs+0.049
affles+0.048
Neg logit contributions
 Pen-0.027
 unw-0.024
 grand-0.023
 unsc-0.022
An-0.022
Token sc
Token ID629
Feature activation+4.65e-02

Pos logit contributions
ained+0.018
akes+0.018
ags+0.018
ahs+0.018
ined+0.018
Neg logit contributions
 Pen-0.016
 unw-0.015
 grand-0.014
 unsc-0.014
 keeper-0.014
Tokenooter
Token ID25141
Feature activation+1.41e-01

Pos logit contributions
ained+0.031
akes+0.030
ags+0.030
ahs+0.029
affles+0.028
Neg logit contributions
 Pen-0.098
 unw-0.094
 grand-0.093
 unsc-0.091
An-0.090
Token.
Token ID13
Feature activation-6.58e-03

Pos logit contributions
ained+0.240
akes+0.236
ags+0.235
ahs+0.232
affles+0.230
Neg logit contributions
 Pen-0.150
 unw-0.137
 grand-0.134
 unsc-0.128
An-0.126
Token Tim
Token ID5045
Feature activation+2.10e-03

Pos logit contributions
 Pen+0.009
 unw+0.008
 unsc+0.008
 security+0.008
An+0.008
Neg logit contributions
ained-0.009
ahs-0.009
akes-0.009
orting-0.009
ags-0.008
Tokenmy
Token ID1820
Feature activation+3.64e-02

Pos logit contributions
ained+0.002
akes+0.002
ags+0.002
ahs+0.002
ined+0.002
Neg logit contributions
 Pen-0.004
 unw-0.004
 grand-0.004
An-0.004
 unsc-0.003
Token learned
Token ID4499
Feature activation-1.34e-02

Pos logit contributions
ined+0.044
ahs+0.041
affles+0.041
ained+0.041
w+0.039
Neg logit contributions
 Pen-0.056
Un-0.050
 Eva-0.049
An-0.049
 unw-0.049
Token his
Token ID465
Feature activation+1.07e-02

Pos logit contributions
 Pen+0.019
 unw+0.017
 Eva+0.016
 gasped+0.016
An+0.016
Neg logit contributions
ained-0.019
ags-0.018
ahs-0.018
orting-0.018
affles-0.018
Token to
Token ID284
Feature activation+6.14e-03

Pos logit contributions
 Pen+0.007
 grand+0.006
 unw+0.006
 unsc+0.005
 keeper+0.005
Neg logit contributions
ained-0.019
akes-0.019
ags-0.019
ined-0.019
agons-0.019
Token play
Token ID711
Feature activation+5.97e-02

Pos logit contributions
ained+0.009
akes+0.009
ined+0.009
 highlight+0.009
ahs+0.009
Neg logit contributions
 Pen-0.008
 unw-0.008
 grand-0.008
 unsc-0.007
 open-0.007
Token with
Token ID351
Feature activation+4.38e-03

Pos logit contributions
 highlight+0.105
ained+0.104
unes+0.101
ahs+0.101
akes+0.100
Neg logit contributions
 Pen-0.064
 unw-0.061
 grand-0.059
 quickly-0.058
 unsc-0.057
Token their
Token ID511
Feature activation+8.81e-04

Pos logit contributions
ained+0.008
ags+0.007
akes+0.007
 highlight+0.007
ined+0.007
Neg logit contributions
 Pen-0.005
 grand-0.005
 unw-0.005
 unsc-0.004
An-0.004
Token sc
Token ID629
Feature activation+5.51e-02

Pos logit contributions
ained+0.001
akes+0.001
ined+0.001
ags+0.001
 highlight+0.001
Neg logit contributions
 grand-0.001
 Pen-0.001
 unw-0.001
 keeper-0.001
 open-0.001
Tokenooters
Token ID48316
Feature activation+1.39e-01

Pos logit contributions
ained+0.051
ined+0.050
akes+0.049
 highlight+0.047
ags+0.046
Neg logit contributions
 grand-0.097
 unw-0.095
 Pen-0.095
 keeper-0.094
keeper-0.093
Token in
Token ID287
Feature activation+2.56e-02

Pos logit contributions
affles+0.229
 highlight+0.226
ags+0.223
ained+0.222
ined+0.220
Neg logit contributions
 Pen-0.148
 grand-0.143
 unw-0.137
 keeper-0.136
keeper-0.131
Token the
Token ID262
Feature activation+1.26e-02

Pos logit contributions
ags+0.042
ained+0.042
akes+0.040
 highlight+0.040
agons+0.040
Neg logit contributions
 Pen-0.026
keeper-0.025
 unw-0.025
 keeper-0.023
 unsc-0.022
Token park
Token ID3952
Feature activation-1.25e-03

Pos logit contributions
akes+0.023
ained+0.023
ahs+0.022
affles+0.022
unes+0.021
Neg logit contributions
 grand-0.013
 Pen-0.012
 unw-0.012
 un-0.012
 open-0.011
Token.
Token ID13
Feature activation+9.73e-03

Pos logit contributions
 Pen+0.001
 grand+0.001
 unw+0.001
 keeper+0.001
 unsc+0.001
Neg logit contributions
ained-0.002
ags-0.002
affles-0.002
akes-0.002
agons-0.002
Token Ben
Token ID3932
Feature activation-1.71e-02

Pos logit contributions
ined+0.020
affles+0.019
akes+0.019
ags+0.019
ained+0.019
Neg logit contributions
 Pen-0.009
 grand-0.008
An-0.007
 unw-0.007
Un-0.007
Token and
Token ID290
Feature activation+1.93e-02

Pos logit contributions
affles+0.046
ained+0.045
ags+0.045
ined+0.045
akes+0.044
Neg logit contributions
 Pen-0.022
 grand-0.020
 unw-0.020
 keeper-0.019
 unsc-0.018
Token got
Token ID1392
Feature activation+6.64e-03

Pos logit contributions
ained+0.031
akes+0.031
ined+0.030
affles+0.030
ags+0.030
Neg logit contributions
 Pen-0.024
 grand-0.023
 unw-0.022
 unsc-0.021
 keeper-0.020
Token on
Token ID319
Feature activation+3.21e-02

Pos logit contributions
akes+0.012
mm+0.012
affles+0.012
agons+0.011
ags+0.011
Neg logit contributions
 quickly-0.007
 unw-0.007
 grand-0.007
 Pen-0.007
 un-0.007
Token their
Token ID511
Feature activation-7.09e-03

Pos logit contributions
ained+0.052
ags+0.051
ahs+0.051
akes+0.050
agons+0.049
Neg logit contributions
 Pen-0.035
 unw-0.032
keeper-0.031
An-0.030
 keeper-0.030
Token sc
Token ID629
Feature activation+4.80e-02

Pos logit contributions
 grand+0.010
 Pen+0.010
 unw+0.009
 keeper+0.009
 unsc+0.009
Neg logit contributions
ained-0.011
akes-0.010
ined-0.010
affles-0.010
ahs-0.010
Tokenooters
Token ID48316
Feature activation+1.39e-01

Pos logit contributions
ained+0.039
akes+0.037
ined+0.037
ags+0.036
 highlight+0.036
Neg logit contributions
 Pen-0.091
 grand-0.090
 unw-0.089
 unsc-0.087
 keeper-0.086
Token.
Token ID13
Feature activation+1.07e-02

Pos logit contributions
affles+0.244
ained+0.243
ags+0.243
akes+0.241
ined+0.238
Neg logit contributions
 Pen-0.135
 grand-0.124
 unw-0.122
 keeper-0.115
 unsc-0.115
Token They
Token ID1119
Feature activation+6.28e-03

Pos logit contributions
ined+0.021
akes+0.020
affles+0.020
ags+0.020
 highlight+0.020
Neg logit contributions
 Pen-0.010
 grand-0.009
 unw-0.008
An-0.008
Un-0.008
Token looked
Token ID3114
Feature activation+6.73e-04

Pos logit contributions
ags+0.010
ained+0.010
akes+0.010
affles+0.010
agons+0.010
Neg logit contributions
 Pen-0.007
 unw-0.007
 grand-0.007
 unsc-0.007
 quickly-0.006
Token at
Token ID379
Feature activation+9.49e-03

Pos logit contributions
akes+0.001
mm+0.001
vals+0.001
affles+0.001
w+0.001
Neg logit contributions
 grand-0.001
 open-0.001
 quickly-0.001
 un-0.001
 tight-0.001
Token each
Token ID1123
Feature activation+1.07e-02

Pos logit contributions
akes+0.020
ained+0.020
ined+0.020
affles+0.020
agons+0.019
Neg logit contributions
 grand-0.007
 Pen-0.007
 unw-0.006
 quickly-0.005
 unsc-0.005
TokenAs
Token ID1722
Feature activation-2.28e-02

Pos logit contributions
 Pen+0.007
 unw+0.006
 grand+0.006
 keeper+0.006
keeper+0.006
Neg logit contributions
ained-0.009
akes-0.008
ahs-0.008
affles-0.008
ags-0.008
Token Tim
Token ID5045
Feature activation+8.97e-03

Pos logit contributions
 Pen+0.021
 grand+0.019
 unw+0.017
 quickly+0.017
 keeper+0.016
Neg logit contributions
ags-0.048
ained-0.045
akes-0.045
affles-0.045
ined-0.045
Token rode
Token ID22075
Feature activation+2.72e-02

Pos logit contributions
ained+0.011
akes+0.011
ags+0.010
ahs+0.010
ined+0.010
Neg logit contributions
 Pen-0.014
 unw-0.013
 grand-0.013
 keeper-0.013
An-0.013
Token the
Token ID262
Feature activation+6.44e-03

Pos logit contributions
akes+0.048
affles+0.048
ained+0.046
agons+0.046
ined+0.046
Neg logit contributions
 Pen-0.030
 grand-0.028
 keeper-0.027
 quickly-0.026
 unw-0.026
Token sc
Token ID629
Feature activation+4.50e-02

Pos logit contributions
ained+0.009
akes+0.009
ags+0.009
affles+0.009
ined+0.009
Neg logit contributions
 Pen-0.009
 unw-0.008
 grand-0.008
 unsc-0.008
 keeper-0.008
Tokenooter
Token ID25141
Feature activation+1.38e-01

Pos logit contributions
ained+0.031
akes+0.030
ags+0.029
ined+0.029
affles+0.028
Neg logit contributions
 Pen-0.092
 unw-0.090
 grand-0.089
 unsc-0.087
keeper-0.086
Token,
Token ID11
Feature activation+7.77e-03

Pos logit contributions
affles+0.219
ags+0.215
akes+0.213
ained+0.213
agons+0.209
Neg logit contributions
 Pen-0.165
 grand-0.156
 unw-0.155
 keeper-0.150
keeper-0.148
Token he
Token ID339
Feature activation+3.09e-04

Pos logit contributions
ags+0.017
affles+0.016
akes+0.016
agging+0.016
agons+0.016
Neg logit contributions
 Pen-0.006
 grand-0.006
 quickly-0.005
 unw-0.005
 unsc-0.005
Token went
Token ID1816
Feature activation+1.73e-02

Pos logit contributions
ags+0.001
akes+0.001
ained+0.001
affles+0.001
agons+0.001
Neg logit contributions
 Pen-0.000
 unw-0.000
 grand-0.000
 unsc-0.000
An-0.000
Token very
Token ID845
Feature activation+1.82e-02

Pos logit contributions
akes+0.036
affles+0.035
mm+0.033
 highlight+0.033
agons+0.033
Neg logit contributions
 unw-0.014
 Pen-0.014
 quickly-0.014
 grand-0.014
 keeper-0.014
Token fast
Token ID3049
Feature activation-1.17e-02

Pos logit contributions
ained+0.032
ahs+0.031
akes+0.031
ags+0.031
ined+0.030
Neg logit contributions
 Pen-0.018
 unw-0.017
 grand-0.016
 unsc-0.015
An-0.015
Token loved
Token ID6151
Feature activation+2.49e-03

Pos logit contributions
 Pen+0.012
 unw+0.012
 grand+0.011
 keeper+0.011
keeper+0.011
Neg logit contributions
ained-0.012
ined-0.012
akes-0.012
ags-0.011
ahs-0.011
Token to
Token ID284
Feature activation-1.21e-02

Pos logit contributions
ained+0.005
akes+0.005
ags+0.005
ahs+0.005
affles+0.005
Neg logit contributions
 Pen-0.002
 unw-0.002
 keeper-0.002
 grand-0.002
 unsc-0.002
Token ride
Token ID6594
Feature activation-2.20e-03

Pos logit contributions
 Pen+0.017
 unw+0.017
 grand+0.016
 unsc+0.016
An+0.015
Neg logit contributions
ained-0.017
akes-0.016
ags-0.016
ahs-0.016
ined-0.016
Token her
Token ID607
Feature activation-2.97e-02

Pos logit contributions
 Pen+0.002
 unw+0.002
 grand+0.002
An+0.002
 unsc+0.002
Neg logit contributions
ained-0.004
akes-0.004
ags-0.004
ahs-0.004
affles-0.004
Token sc
Token ID629
Feature activation+4.28e-02

Pos logit contributions
 Pen+0.045
 unw+0.042
 unsc+0.041
 Eva+0.041
An+0.040
Neg logit contributions
ags-0.037
ahs-0.037
agons-0.037
ained-0.036
akes-0.035
Tokenooter
Token ID25141
Feature activation+1.38e-01

Pos logit contributions
ained+0.027
akes+0.026
ags+0.025
ined+0.024
ahs+0.024
Neg logit contributions
 Pen-0.091
 unw-0.088
 grand-0.088
 unsc-0.085
 keeper-0.085
Token every
Token ID790
Feature activation+9.33e-03

Pos logit contributions
ained+0.203
akes+0.203
affles+0.201
ags+0.201
 highlight+0.196
Neg logit contributions
 Pen-0.175
 grand-0.165
 unw-0.165
 keeper-0.156
 unsc-0.155
Token day
Token ID1110
Feature activation+3.65e-02

Pos logit contributions
ags+0.007
akes+0.007
ained+0.007
affles+0.007
agons+0.007
Neg logit contributions
 Pen-0.019
 grand-0.018
 unw-0.018
An-0.017
 unsc-0.017
Token.
Token ID13
Feature activation-1.59e-02

Pos logit contributions
ained+0.067
akes+0.066
ags+0.065
ahs+0.065
affles+0.064
Neg logit contributions
 Pen-0.035
 unw-0.032
 grand-0.031
 unsc-0.029
An-0.029
Token One
Token ID1881
Feature activation+5.34e-02

Pos logit contributions
 Pen+0.017
 unw+0.016
 grand+0.015
 unsc+0.015
An+0.015
Neg logit contributions
ained-0.027
akes-0.027
ags-0.026
ahs-0.026
affles-0.026
Token day
Token ID1110
Feature activation+3.43e-02

Pos logit contributions
ained+0.033
akes+0.032
ags+0.031
ahs+0.030
ined+0.029
Neg logit contributions
 Pen-0.116
 unw-0.111
 grand-0.110
 unsc-0.107
An-0.107
Token pretended
Token ID36766
Feature activation-3.41e-02

Pos logit contributions
ahs+0.017
ined+0.016
affles+0.016
ained+0.016
agons+0.015
Neg logit contributions
 Pen-0.016
An-0.013
 unw-0.013
 grand-0.013
 Eva-0.013
Token to
Token ID284
Feature activation-6.53e-03

Pos logit contributions
 Pen+0.022
 grand+0.020
 unw+0.019
 keeper+0.018
 quickly+0.017
Neg logit contributions
akes-0.076
ags-0.075
ained-0.075
agons-0.073
affles-0.073
Token throw
Token ID3714
Feature activation+3.49e-02

Pos logit contributions
 Pen+0.008
 unw+0.008
 grand+0.007
An+0.007
keeper+0.007
Neg logit contributions
ained-0.010
akes-0.010
ags-0.009
ahs-0.009
affles-0.009
Token his
Token ID465
Feature activation+9.18e-03

Pos logit contributions
ained+0.071
akes+0.069
affles+0.068
 highlight+0.068
ined+0.068
Neg logit contributions
 Pen-0.028
 unw-0.027
 open-0.027
 grand-0.026
 quickly-0.025
Token sc
Token ID629
Feature activation+4.69e-02

Pos logit contributions
ained+0.018
akes+0.016
ined+0.016
orting+0.016
reetings+0.015
Neg logit contributions
 grand-0.010
 unw-0.009
 open-0.009
 Pen-0.009
 unh-0.009
Tokenooter
Token ID25141
Feature activation+1.38e-01

Pos logit contributions
ained+0.042
ined+0.041
akes+0.040
 highlight+0.038
affles+0.037
Neg logit contributions
 unw-0.086
 grand-0.085
 Pen-0.083
 unsc-0.083
keeper-0.082
Token,
Token ID11
Feature activation+9.09e-03

Pos logit contributions
ained+0.211
akes+0.208
affles+0.207
ags+0.207
ined+0.203
Neg logit contributions
 Pen-0.167
 unw-0.161
 grand-0.157
 keeper-0.152
 unsc-0.151
Token and
Token ID290
Feature activation-3.26e-03

Pos logit contributions
akes+0.016
ained+0.016
ags+0.016
affles+0.016
ined+0.016
Neg logit contributions
 Pen-0.010
 unw-0.009
 grand-0.009
 unsc-0.009
 cautiously-0.008
Token the
Token ID262
Feature activation+3.33e-03

Pos logit contributions
 Pen+0.004
 unw+0.004
 grand+0.004
 unsc+0.003
 keeper+0.003
Neg logit contributions
ained-0.005
akes-0.005
ags-0.005
affles-0.005
ined-0.005
Token dog
Token ID3290
Feature activation-1.61e-02

Pos logit contributions
ained+0.006
akes+0.006
ags+0.006
agged+0.006
affles+0.006
Neg logit contributions
 grand-0.004
 keeper-0.004
 Pen-0.003
 unw-0.003
keeper-0.003
Token ran
Token ID4966
Feature activation+7.53e-03

Pos logit contributions
 Pen+0.017
 unw+0.016
 grand+0.015
 keeper+0.015
 quickly+0.014
Neg logit contributions
ags-0.029
ained-0.029
akes-0.029
affles-0.028
agons-0.028
Token ostr
Token ID42387
Feature activation+5.58e-02

Pos logit contributions
ained+0.026
akes+0.026
ags+0.026
ined+0.025
agged+0.025
Neg logit contributions
 grand-0.027
 Pen-0.027
 keeper-0.026
 unw-0.025
keeper-0.025
Tokenich
Token ID488
Feature activation+4.38e-02

Pos logit contributions
ained+0.076
ags+0.074
affles+0.072
akes+0.072
 highlight+0.071
Neg logit contributions
 Pen-0.078
 keeper-0.075
 grand-0.073
 unw-0.073
keeper-0.072
Token pen
Token ID3112
Feature activation+3.39e-02

Pos logit contributions
ained+0.069
ahs+0.069
akes+0.069
ags+0.067
orting+0.066
Neg logit contributions
 Pen-0.050
An-0.045
 unw-0.045
 grand-0.044
 unsc-0.043
Token.
Token ID13
Feature activation+1.85e-02

Pos logit contributions
ained+0.052
ags+0.051
akes+0.050
ahs+0.050
ined+0.049
Neg logit contributions
 Pen-0.042
 unw-0.038
An-0.038
 grand-0.038
 unsc-0.037
Token Ost
Token ID38919
Feature activation+2.25e-02

Pos logit contributions
ained+0.029
akes+0.029
ags+0.029
ahs+0.028
affles+0.028
Neg logit contributions
 Pen-0.022
 unw-0.020
 grand-0.020
 unsc-0.019
An-0.019
Tokenric
Token ID1173
Feature activation+1.37e-01

Pos logit contributions
ained+0.012
ags+0.011
akes+0.010
ined+0.010
ahs+0.010
Neg logit contributions
 Pen-0.052
 unw-0.050
 grand-0.050
 unsc-0.048
keeper-0.048
Tokenhes
Token ID956
Feature activation-5.97e-02

Pos logit contributions
ained+0.151
akes+0.148
ags+0.146
ahs+0.143
affles+0.141
Neg logit contributions
 Pen-0.230
 unw-0.217
 grand-0.214
 unsc-0.208
An-0.207
Token are
Token ID389
Feature activation-1.58e-02

Pos logit contributions
 Pen+0.077
 unw+0.068
 grand+0.068
 keeper+0.067
 unsc+0.065
Neg logit contributions
ained-0.090
ags-0.087
akes-0.087
 highlight-0.087
affles-0.086
Token big
Token ID1263
Feature activation-4.25e-03

Pos logit contributions
 Pen+0.016
 unw+0.015
 grand+0.014
 unsc+0.014
An+0.014
Neg logit contributions
ained-0.028
akes-0.027
ags-0.027
ahs-0.027
affles-0.026
Token birds
Token ID10087
Feature activation+3.29e-02

Pos logit contributions
 Pen+0.005
 grand+0.005
 unw+0.004
 keeper+0.004
 unsc+0.004
Neg logit contributions
ained-0.007
ined-0.007
ags-0.007
akes-0.007
 highlight-0.007
Token with
Token ID351
Feature activation-3.79e-03

Pos logit contributions
ained+0.053
akes+0.053
ags+0.052
ahs+0.051
ined+0.051
Neg logit contributions
 Pen-0.037
 unw-0.035
 grand-0.034
An-0.032
 unsc-0.032
Token zoom
Token ID19792
Feature activation+4.97e-02

Pos logit contributions
 Pen+0.015
 unw+0.015
 grand+0.014
 unsc+0.013
 un+0.013
Neg logit contributions
ained-0.018
ags-0.017
akes-0.017
ined-0.016
ahs-0.016
Token around
Token ID1088
Feature activation+1.68e-02

Pos logit contributions
ained+0.069
akes+0.068
affles+0.067
 highlight+0.067
ags+0.066
Neg logit contributions
 Pen-0.068
 grand-0.064
 unw-0.064
 unsc-0.061
 keeper-0.061
Token on
Token ID319
Feature activation+2.27e-02

Pos logit contributions
ained+0.029
akes+0.029
ags+0.028
ahs+0.028
ined+0.028
Neg logit contributions
 Pen-0.017
 unw-0.016
 grand-0.016
 unsc-0.015
An-0.015
Token his
Token ID465
Feature activation-3.08e-03

Pos logit contributions
ahs+0.033
ags+0.033
ained+0.033
agons+0.033
akes+0.032
Neg logit contributions
 Pen-0.026
keeper-0.026
 keeper-0.025
 unw-0.023
 passer-0.023
Token sc
Token ID629
Feature activation+3.94e-02

Pos logit contributions
 Pen+0.005
 unw+0.004
An+0.004
 unsc+0.004
 Eva+0.004
Neg logit contributions
ags-0.004
agons-0.004
ahs-0.004
agging-0.004
 highlight-0.004
Tokenooter
Token ID25141
Feature activation+1.37e-01

Pos logit contributions
ained+0.021
akes+0.020
ags+0.020
ahs+0.019
affles+0.018
Neg logit contributions
 Pen-0.089
 unw-0.084
 grand-0.083
An-0.082
 unsc-0.082
Token.
Token ID13
Feature activation-2.59e-02

Pos logit contributions
ained+0.250
ags+0.248
akes+0.248
affles+0.246
 highlight+0.242
Neg logit contributions
 Pen-0.126
 unw-0.116
 grand-0.115
 unsc-0.108
 keeper-0.108
Token He
Token ID679
Feature activation+7.50e-04

Pos logit contributions
 Pen+0.036
 unw+0.033
 keeper+0.031
keeper+0.031
An+0.031
Neg logit contributions
ained-0.036
akes-0.036
ags-0.035
ahs-0.034
 highlight-0.033
Token was
Token ID373
Feature activation-8.16e-03

Pos logit contributions
ags+0.001
akes+0.001
affles+0.001
ained+0.001
unes+0.001
Neg logit contributions
 Pen-0.001
 grand-0.001
 unw-0.001
 unsc-0.001
 quickly-0.001
Token so
Token ID523
Feature activation+3.14e-02

Pos logit contributions
 Pen+0.009
 grand+0.008
 unw+0.008
 unsc+0.008
An+0.008
Neg logit contributions
akes-0.014
ags-0.013
affles-0.013
ained-0.013
agons-0.013
Token fast
Token ID3049
Feature activation-1.76e-02

Pos logit contributions
ained+0.052
ined+0.050
akes+0.050
ahs+0.049
ags+0.049
Neg logit contributions
 Pen-0.036
 unw-0.033
keeper-0.031
 keeper-0.031
 grand-0.031
Token.
Token ID13
Feature activation+3.67e-03

Pos logit contributions
 Pen+0.001
 unw+0.001
 grand+0.000
 keeper+0.000
 unsc+0.000
Neg logit contributions
affles-0.001
ags-0.001
ained-0.001
akes-0.001
agons-0.001
Token They
Token ID1119
Feature activation+1.18e-03

Pos logit contributions
ained+0.006
akes+0.006
ags+0.006
ahs+0.006
ined+0.006
Neg logit contributions
 Pen-0.004
 unw-0.004
 grand-0.004
 unsc-0.004
An-0.003
Token were
Token ID547
Feature activation+2.65e-03

Pos logit contributions
ained+0.002
akes+0.002
ags+0.002
affles+0.002
ahs+0.002
Neg logit contributions
 Pen-0.001
 unw-0.001
 grand-0.001
 unsc-0.001
An-0.001
Token called
Token ID1444
Feature activation+4.39e-02

Pos logit contributions
akes+0.004
affles+0.004
ained+0.004
ags+0.004
agons+0.004
Neg logit contributions
 Pen-0.003
 grand-0.003
 unw-0.003
 keeper-0.003
 Eva-0.002
Token ost
Token ID23619
Feature activation+7.18e-02

Pos logit contributions
ined+0.073
ained+0.072
akes+0.071
affles+0.071
agons+0.069
Neg logit contributions
 Pen-0.057
 grand-0.052
 keeper-0.048
 unw-0.048
 Eva-0.046
Tokenric
Token ID1173
Feature activation+1.37e-01

Pos logit contributions
affles+0.064
ained+0.062
ags+0.062
 highlight+0.060
akes+0.059
Neg logit contributions
 Pen-0.131
 grand-0.123
 keeper-0.123
 unw-0.120
An-0.120
Tokenhes
Token ID956
Feature activation-6.02e-02

Pos logit contributions
ained+0.160
akes+0.160
ags+0.156
affles+0.154
 highlight+0.153
Neg logit contributions
 Pen-0.217
 unw-0.198
An-0.198
 grand-0.196
 unsc-0.192
Token.
Token ID13
Feature activation+1.40e-03

Pos logit contributions
 Pen+0.061
 grand+0.053
 unw+0.052
 keeper+0.051
 quickly+0.051
Neg logit contributions
ags-0.109
ained-0.107
affles-0.106
akes-0.106
agons-0.105
Token Lily
Token ID20037
Feature activation-1.05e-02

Pos logit contributions
ined+0.003
ained+0.003
akes+0.003
ags+0.003
affles+0.003
Neg logit contributions
 Pen-0.001
 grand-0.001
 unw-0.001
An-0.001
 keeper-0.001
Token and
Token ID290
Feature activation+4.94e-04

Pos logit contributions
 Pen+0.013
Un+0.012
keeper+0.012
 unw+0.012
 abrupt+0.012
Neg logit contributions
ahs-0.014
ained-0.013
ined-0.013
akes-0.013
affles-0.012
Token Ben
Token ID3932
Feature activation-2.67e-02

Pos logit contributions
ahs+0.001
unes+0.001
ained+0.001
akes+0.001
affles+0.001
Neg logit contributions
 unw-0.001
 unsc-0.001
keeper-0.001
An-0.001
 cautiously-0.001
Token creature
Token ID7185
Feature activation-2.52e-02

Pos logit contributions
akes+0.003
ained+0.003
ags+0.003
ahs+0.003
affles+0.003
Neg logit contributions
 Pen-0.002
 grand-0.002
 unw-0.002
 keeper-0.002
An-0.002
Token was
Token ID373
Feature activation-6.89e-03

Pos logit contributions
 Pen+0.024
 unw+0.022
 grand+0.021
 unsc+0.020
An+0.020
Neg logit contributions
ained-0.046
akes-0.046
ags-0.045
ahs-0.045
affles-0.045
Token a
Token ID257
Feature activation-1.22e-02

Pos logit contributions
 Pen+0.008
 unw+0.008
 grand+0.007
keeper+0.007
 unsc+0.007
Neg logit contributions
ained-0.011
ined-0.010
ags-0.010
ahs-0.010
 highlight-0.010
Token little
Token ID1310
Feature activation-2.23e-02

Pos logit contributions
 Pen+0.012
 unw+0.011
 grand+0.011
 unsc+0.010
 keeper+0.010
Neg logit contributions
ained-0.022
akes-0.022
ags-0.022
affles-0.021
ahs-0.021
Token rac
Token ID3444
Feature activation-1.80e-02

Pos logit contributions
 Pen+0.026
 unw+0.024
 grand+0.023
 unsc+0.022
 keeper+0.022
Neg logit contributions
ained-0.036
akes-0.036
ags-0.036
ahs-0.035
affles-0.035
Tokenoon
Token ID2049
Feature activation-1.33e-01

Pos logit contributions
 Pen+0.029
 grand+0.027
 unw+0.027
 unsc+0.026
keeper+0.026
Neg logit contributions
ained-0.021
ags-0.021
akes-0.021
ined-0.020
affles-0.020
Token.
Token ID13
Feature activation-2.17e-02

Pos logit contributions
 Pen+0.156
 unw+0.145
 grand+0.141
 unsc+0.135
An+0.134
Neg logit contributions
ained-0.214
akes-0.210
ags-0.210
ahs-0.205
affles-0.205
Token He
Token ID679
Feature activation+3.94e-03

Pos logit contributions
 unsc+0.031
 unw+0.030
Un+0.030
Far+0.029
 keeper+0.029
Neg logit contributions
ags-0.025
agged-0.023
ained-0.023
 Which-0.022
unes-0.022
Token was
Token ID373
Feature activation-6.42e-03

Pos logit contributions
ahs+0.006
ained+0.006
ined+0.006
affles+0.006
akes+0.006
Neg logit contributions
 Pen-0.004
 unw-0.004
 grand-0.004
 unsc-0.003
keeper-0.003
Token very
Token ID845
Feature activation+1.13e-02

Pos logit contributions
 Pen+0.007
 unw+0.007
 grand+0.007
keeper+0.007
 unsc+0.007
Neg logit contributions
ained-0.010
ags-0.010
 highlight-0.010
ined-0.010
ahs-0.009
Token hungry
Token ID14720
Feature activation-1.27e-02

Pos logit contributions
ained+0.017
ined+0.016
ging+0.016
ahs+0.016
ags+0.016
Neg logit contributions
 Pen-0.013
 unw-0.012
 unsc-0.011
An-0.011
 grand-0.011
Token struggled
Token ID11615
Feature activation-8.57e-02

Pos logit contributions
 Pen+0.003
 unw+0.003
 unsc+0.003
 grand+0.003
 quickly+0.003
Neg logit contributions
ags-0.003
ained-0.003
akes-0.003
 highlight-0.003
agons-0.003
Token out
Token ID503
Feature activation-5.15e-03

Pos logit contributions
 Pen+0.058
 unw+0.053
 grand+0.050
 unsc+0.049
An+0.045
Neg logit contributions
akes-0.179
ags-0.177
ained-0.177
affles-0.176
agons-0.175
Token of
Token ID286
Feature activation-1.10e-02

Pos logit contributions
 Pen+0.005
 unw+0.004
 unsc+0.004
 grand+0.004
 quickly+0.004
Neg logit contributions
akes-0.009
affles-0.009
ags-0.009
ained-0.009
agons-0.009
Token its
Token ID663
Feature activation-6.33e-03

Pos logit contributions
 Pen+0.011
 unw+0.010
keeper+0.010
An+0.010
 keeper+0.010
Neg logit contributions
ained-0.019
ags-0.019
akes-0.018
ahs-0.018
ined-0.018
Token coc
Token ID8954
Feature activation-1.58e-02

Pos logit contributions
 Pen+0.008
 unw+0.007
An+0.007
 grand+0.007
 unsc+0.007
Neg logit contributions
ained-0.010
ags-0.010
akes-0.010
ahs-0.010
agons-0.010
Tokenoon
Token ID2049
Feature activation-1.32e-01

Pos logit contributions
 grand+0.025
 Pen+0.024
 keeper+0.024
keeper+0.023
 security+0.022
Neg logit contributions
ained-0.023
ined-0.022
akes-0.021
agging-0.020
ags-0.020
Token.
Token ID13
Feature activation-2.27e-02

Pos logit contributions
 Pen+0.137
 unw+0.126
 grand+0.123
 unsc+0.117
 keeper+0.115
Neg logit contributions
ained-0.228
akes-0.226
ags-0.225
ahs-0.222
affles-0.221
Token It
Token ID632
Feature activation-8.49e-03

Pos logit contributions
 Pen+0.031
Un+0.030
 unw+0.030
 keeper+0.029
 security+0.029
Neg logit contributions
ained-0.028
ags-0.027
akes-0.027
unes-0.027
 sun-0.025
Token cl
Token ID537
Feature activation-3.76e-02

Pos logit contributions
 Pen+0.013
 Eva+0.013
 grand+0.013
Laura+0.012
Kat+0.012
Neg logit contributions
ained-0.010
ined-0.010
w-0.009
ahs-0.009
akes-0.009
Tokenung
Token ID2150
Feature activation-3.90e-02

Pos logit contributions
 Pen+0.066
 unw+0.063
 grand+0.062
An+0.061
 unsc+0.061
Neg logit contributions
ahs-0.036
affles-0.035
ained-0.035
agons-0.035
 highlight-0.035
Token to
Token ID284
Feature activation-2.03e-02

Pos logit contributions
 Pen+0.038
 grand+0.034
 unw+0.033
 quickly+0.032
An+0.032
Neg logit contributions
affles-0.071
akes-0.071
ahs-0.071
ained-0.071
 highlight-0.070
Token created
Token ID2727
Feature activation+3.79e-03

Pos logit contributions
 Pen+0.013
 unw+0.012
 grand+0.012
An+0.012
 keeper+0.011
Neg logit contributions
ained-0.016
akes-0.015
ags-0.015
ahs-0.015
ined-0.015
Token the
Token ID262
Feature activation-5.85e-03

Pos logit contributions
ained+0.006
ags+0.006
akes+0.006
ahs+0.006
ined+0.006
Neg logit contributions
 Pen-0.004
 unw-0.004
 grand-0.004
 keeper-0.004
An-0.004
Token most
Token ID749
Feature activation+5.94e-02

Pos logit contributions
 unw+0.008
 Pen+0.008
 grand+0.007
 unsc+0.007
 open+0.007
Neg logit contributions
ahs-0.009
ained-0.009
affles-0.009
akes-0.009
agged-0.009
Token beautiful
Token ID4950
Feature activation-3.31e-03

Pos logit contributions
ags+0.107
ained+0.106
akes+0.105
ahs+0.103
affles+0.102
Neg logit contributions
 Pen-0.059
 unw-0.057
 grand-0.055
 unsc-0.053
 keeper-0.051
Token coc
Token ID8954
Feature activation-1.33e-02

Pos logit contributions
 unw+0.004
 Pen+0.004
 grand+0.004
 unsc+0.004
 open+0.004
Neg logit contributions
ained-0.005
ags-0.005
ahs-0.005
akes-0.005
affles-0.005
Tokenoon
Token ID2049
Feature activation-1.32e-01

Pos logit contributions
 grand+0.021
 Pen+0.020
 unw+0.019
 keeper+0.019
keeper+0.019
Neg logit contributions
ined-0.017
ained-0.017
akes-0.017
affles-0.016
agons-0.016
Token.
Token ID13
Feature activation-2.08e-02

Pos logit contributions
 Pen+0.138
 unw+0.126
 grand+0.123
 unsc+0.117
An+0.116
Neg logit contributions
ained-0.227
akes-0.224
ags-0.223
ahs-0.219
affles-0.218
Token
Token ID198
Feature activation-1.71e-02

Pos logit contributions
 Pen+0.025
 unw+0.023
 grand+0.023
 unsc+0.022
 keeper+0.022
Neg logit contributions
ained-0.033
akes-0.032
ags-0.032
ahs-0.031
ined-0.031
Token
Token ID198
Feature activation-1.70e-02

Pos logit contributions
 Pen+0.021
 unw+0.020
 grand+0.019
 keeper+0.019
keeper+0.019
Neg logit contributions
ained-0.025
ags-0.023
ined-0.023
affles-0.023
ahs-0.023
TokenFinally
Token ID11158
Feature activation+4.23e-02

Pos logit contributions
 Pen+0.023
 grand+0.021
 unw+0.021
keeper+0.019
 keeper+0.019
Neg logit contributions
ained-0.024
 highlight-0.023
ahs-0.023
agons-0.023
affles-0.022
Token,
Token ID11
Feature activation-1.24e-03

Pos logit contributions
ained+0.081
akes+0.080
ags+0.080
affles+0.077
agons+0.076
Neg logit contributions
 Pen-0.043
 grand-0.040
 unw-0.040
 quickly-0.037
 unsc-0.037
Token wall
Token ID3355
Feature activation-4.68e-02

Pos logit contributions
ained+0.002
akes+0.002
ags+0.002
ined+0.002
affles+0.002
Neg logit contributions
 grand-0.001
 Pen-0.001
An-0.001
 keeper-0.001
 open-0.001
Token.
Token ID13
Feature activation-1.24e-02

Pos logit contributions
 Pen+0.046
 unw+0.042
 grand+0.041
 unsc+0.039
 keeper+0.039
Neg logit contributions
ained-0.084
ags-0.083
akes-0.083
affles-0.082
ahs-0.081
Token
Token ID220
Feature activation+3.29e-02

Pos logit contributions
 unw+0.014
 Pen+0.014
 keeper+0.013
 unsc+0.013
 grand+0.013
Neg logit contributions
ained-0.019
akes-0.019
ags-0.018
ahs-0.018
agons-0.018
Token
Token ID198
Feature activation-1.91e-02

Pos logit contributions
ained+0.056
akes+0.054
ags+0.054
ahs+0.052
ined+0.051
Neg logit contributions
 Pen-0.035
 unw-0.034
 grand-0.032
 keeper-0.031
 unsc-0.031
Token
Token ID198
Feature activation-1.79e-02

Pos logit contributions
 Pen+0.019
 unw+0.019
 keeper+0.018
keeper+0.018
 grand+0.018
Neg logit contributions
ained-0.033
akes-0.032
ags-0.031
ahs-0.031
agons-0.031
TokenFar
Token ID21428
Feature activation-1.31e-01

Pos logit contributions
 Pen+0.020
 unw+0.020
 keeper+0.019
keeper+0.019
 grand+0.019
Neg logit contributions
ained-0.028
agons-0.028
ahs-0.028
akes-0.027
ags-0.027
Tokenley
Token ID1636
Feature activation-3.89e-02

Pos logit contributions
 Pen+0.156
 unw+0.144
 grand+0.141
 unsc+0.136
An+0.136
Neg logit contributions
ained-0.209
akes-0.205
ags-0.204
affles-0.201
ahs-0.201
Token watched
Token ID7342
Feature activation+9.44e-04

Pos logit contributions
 Pen+0.044
Un+0.040
 unw+0.038
 grand+0.038
An+0.037
Neg logit contributions
ahs-0.058
ined-0.057
akes-0.057
ained-0.055
affles-0.055
Token as
Token ID355
Feature activation-1.44e-02

Pos logit contributions
ained+0.002
akes+0.002
affles+0.002
ags+0.002
 highlight+0.002
Neg logit contributions
 Pen-0.001
 grand-0.001
 unw-0.001
 unsc-0.001
 cautiously-0.001
Token his
Token ID465
Feature activation+2.55e-03

Pos logit contributions
 unw+0.017
 keeper+0.015
keeper+0.015
 Pen+0.015
urious+0.014
Neg logit contributions
ahs-0.022
ained-0.021
akes-0.020
ging-0.020
agons-0.020
Token mom
Token ID1995
Feature activation+7.58e-03

Pos logit contributions
ained+0.005
akes+0.005
ined+0.005
ags+0.005
affles+0.005
Neg logit contributions
 Pen-0.002
 grand-0.002
 unw-0.002
 keeper-0.002
 unsc-0.002
Token and
Token ID290
Feature activation-1.27e-02

Pos logit contributions
 Pen+0.029
 unw+0.027
 grand+0.025
 unsc+0.025
An+0.025
Neg logit contributions
ained-0.037
ags-0.036
akes-0.036
ahs-0.036
ined-0.036
Token said
Token ID531
Feature activation+1.39e-02

Pos logit contributions
 Pen+0.013
 unw+0.012
An+0.011
 unsc+0.011
 grand+0.011
Neg logit contributions
ained-0.021
ahs-0.021
ined-0.020
affles-0.020
akes-0.020
Token,
Token ID11
Feature activation-1.55e-03

Pos logit contributions
ained+0.021
akes+0.021
ags+0.020
affles+0.020
 highlight+0.020
Neg logit contributions
 Pen-0.018
 unw-0.017
 grand-0.017
 keeper-0.016
 unsc-0.016
Token "
Token ID366
Feature activation-3.25e-02

Pos logit contributions
 Pen+0.001
 grand+0.001
 unw+0.001
An+0.001
 unsc+0.001
Neg logit contributions
ained-0.003
akes-0.003
ags-0.003
affles-0.003
ined-0.003
TokenS
Token ID50
Feature activation-2.61e-02

Pos logit contributions
 Pen+0.038
An+0.034
 unw+0.034
 grand+0.034
Un+0.032
Neg logit contributions
ags-0.054
ained-0.053
affles-0.053
akes-0.053
agons-0.052
Tokenhel
Token ID2978
Feature activation-1.31e-01

Pos logit contributions
 unw+0.033
 grand+0.031
 Pen+0.030
 unsc+0.030
keeper+0.029
Neg logit contributions
ained-0.038
 highlight-0.037
ags-0.037
ined-0.036
ahs-0.035
Tokenter
Token ID353
Feature activation-3.47e-03

Pos logit contributions
 Pen+0.175
 grand+0.160
 unw+0.160
 keeper+0.154
 unsc+0.154
Neg logit contributions
ained-0.189
ags-0.186
akes-0.184
 highlight-0.180
affles-0.177
Token is
Token ID318
Feature activation-4.87e-02

Pos logit contributions
 Pen+0.004
 grand+0.004
 unw+0.004
 keeper+0.004
 unsc+0.004
Neg logit contributions
ained-0.006
ags-0.006
akes-0.006
 highlight-0.006
affles-0.005
Token when
Token ID618
Feature activation-3.33e-02

Pos logit contributions
 Pen+0.044
 grand+0.041
 unw+0.038
 unsc+0.037
 open+0.036
Neg logit contributions
akes-0.099
ags-0.095
affles-0.095
ained-0.094
agons-0.094
Token you
Token ID345
Feature activation+1.18e-02

Pos logit contributions
 Pen+0.036
 grand+0.034
 quickly+0.030
 unw+0.029
 open+0.029
Neg logit contributions
ags-0.062
ained-0.060
akes-0.060
ined-0.060
ahs-0.059
Token have
Token ID423
Feature activation+5.45e-03

Pos logit contributions
ained+0.017
ahs+0.017
ags+0.017
ined+0.016
akes+0.016
Neg logit contributions
 unw-0.016
 Pen-0.015
 grand-0.014
 unsc-0.014
 open-0.014
Token and
Token ID290
Feature activation-1.16e-02

Pos logit contributions
ained+0.017
ags+0.017
ahs+0.016
akes+0.016
ined+0.016
Neg logit contributions
 Pen-0.026
 unw-0.024
 grand-0.024
Un-0.024
An-0.024
Token bumped
Token ID38034
Feature activation+1.27e-03

Pos logit contributions
 Pen+0.014
 unw+0.013
 quickly+0.013
 unsc+0.012
 grand+0.012
Neg logit contributions
ags-0.021
akes-0.020
agons-0.020
affles-0.020
ained-0.020
Token against
Token ID1028
Feature activation+1.51e-02

Pos logit contributions
ained+0.002
ags+0.002
akes+0.002
ahs+0.002
ined+0.002
Neg logit contributions
 Pen-0.002
 unw-0.002
 grand-0.002
 keeper-0.001
An-0.001
Token the
Token ID262
Feature activation-3.58e-03

Pos logit contributions
ained+0.027
ags+0.027
akes+0.027
ahs+0.027
affles+0.026
Neg logit contributions
 Pen-0.014
 unw-0.013
 grand-0.012
 unsc-0.012
 keeper-0.012
Token books
Token ID3835
Feature activation-1.36e-02

Pos logit contributions
 Pen+0.004
 unw+0.004
 grand+0.004
 unsc+0.003
 keeper+0.003
Neg logit contributions
ained-0.006
akes-0.006
ags-0.006
ined-0.006
affles-0.006
Tokenhel
Token ID2978
Feature activation-1.31e-01

Pos logit contributions
 Pen+0.022
 grand+0.020
 passer+0.019
Un+0.019
An+0.019
Neg logit contributions
ained-0.015
akes-0.015
ahs-0.015
 on-0.015
 until-0.013
Tokenves
Token ID1158
Feature activation-3.43e-02

Pos logit contributions
 Pen+0.204
 unw+0.197
 grand+0.196
 unsc+0.188
An+0.185
Neg logit contributions
ahs-0.157
ained-0.152
akes-0.149
ined-0.146
ags-0.142
Token.
Token ID13
Feature activation-2.22e-02

Pos logit contributions
 Pen+0.039
 unw+0.036
 grand+0.035
 unsc+0.034
An+0.033
Neg logit contributions
ained-0.056
akes-0.055
ags-0.055
ahs-0.054
affles-0.054
Token He
Token ID679
Feature activation+6.22e-03

Pos logit contributions
 Pen+0.030
 unw+0.029
 grand+0.028
 keeper+0.028
 unsc+0.027
Neg logit contributions
ained-0.031
akes-0.029
ags-0.029
ahs-0.029
affles-0.028
Token smiled
Token ID13541
Feature activation-2.14e-02

Pos logit contributions
ained+0.010
akes+0.010
ahs+0.010
ags+0.010
ined+0.010
Neg logit contributions
 Pen-0.007
 unw-0.006
 grand-0.006
 keeper-0.006
An-0.006
Token as
Token ID355
Feature activation-1.93e-02

Pos logit contributions
 Pen+0.030
 unw+0.027
 unsc+0.026
An+0.026
 Eva+0.025
Neg logit contributions
ained-0.029
ahs-0.028
ags-0.028
ined-0.028
akes-0.028
Token one
Token ID530
Feature activation+5.00e-02

Pos logit contributions
akes+0.006
ags+0.006
agons+0.006
affles+0.006
ained+0.005
Neg logit contributions
 grand-0.003
 Pen-0.003
 quickly-0.003
 unw-0.003
 cautiously-0.003
Token day
Token ID1110
Feature activation+3.14e-02

Pos logit contributions
ained+0.034
akes+0.033
ags+0.033
ahs+0.031
affles+0.030
Neg logit contributions
 Pen-0.106
 unw-0.101
 grand-0.100
 unsc-0.098
An-0.097
Token,
Token ID11
Feature activation+3.09e-04

Pos logit contributions
ags+0.061
ained+0.059
akes+0.058
agons+0.058
affles+0.058
Neg logit contributions
 Pen-0.031
 grand-0.031
 unw-0.029
 unsc-0.028
 quickly-0.028
Token the
Token ID262
Feature activation-5.30e-03

Pos logit contributions
ags+0.001
agons+0.001
agging+0.001
affles+0.001
akes+0.001
Neg logit contributions
 grand-0.000
 quickly-0.000
 unw-0.000
 Pen-0.000
 unsc-0.000
Token coc
Token ID8954
Feature activation-1.38e-02

Pos logit contributions
 Pen+0.006
 unw+0.006
 grand+0.005
 unsc+0.005
An+0.005
Neg logit contributions
ained-0.009
akes-0.009
ags-0.008
ahs-0.008
affles-0.008
Tokenoon
Token ID2049
Feature activation-1.30e-01

Pos logit contributions
 Pen+0.017
 grand+0.016
 unw+0.016
 keeper+0.015
 unsc+0.014
Neg logit contributions
ained-0.022
akes-0.021
ined-0.021
ags-0.021
ahs-0.021
Token broke
Token ID6265
Feature activation+1.28e-02

Pos logit contributions
 Pen+0.147
 unw+0.141
 grand+0.133
 keeper+0.131
 unsc+0.130
Neg logit contributions
ags-0.216
ained-0.211
akes-0.210
affles-0.209
agons-0.206
Token open
Token ID1280
Feature activation-4.27e-02

Pos logit contributions
ained+0.020
akes+0.020
ags+0.020
ahs+0.020
affles+0.019
Neg logit contributions
 Pen-0.015
 unw-0.014
 grand-0.014
 unsc-0.013
An-0.013
Token and
Token ID290
Feature activation-9.33e-03

Pos logit contributions
 Pen+0.050
 unw+0.045
 grand+0.045
An+0.043
 unsc+0.042
Neg logit contributions
ained-0.068
akes-0.067
ags-0.066
ined-0.065
ahs-0.065
Token a
Token ID257
Feature activation-6.26e-03

Pos logit contributions
 Pen+0.009
 grand+0.009
 unw+0.009
 unsc+0.008
 keeper+0.008
Neg logit contributions
ags-0.017
akes-0.017
affles-0.017
ained-0.017
agons-0.017
Token beautiful
Token ID4950
Feature activation-4.12e-03

Pos logit contributions
 Pen+0.006
 unw+0.006
 grand+0.006
 unsc+0.006
 keeper+0.006
Neg logit contributions
ained-0.011
ags-0.011
akes-0.011
affles-0.011
ahs-0.011
Token put
Token ID1234
Feature activation-8.22e-03

Pos logit contributions
 Pen+0.010
 unw+0.009
 grand+0.009
 unsc+0.009
An+0.008
Neg logit contributions
ained-0.011
akes-0.011
ags-0.011
ahs-0.011
affles-0.010
Token them
Token ID606
Feature activation+2.59e-02

Pos logit contributions
 Pen+0.010
 unw+0.010
 quickly+0.009
 grand+0.009
An+0.009
Neg logit contributions
ained-0.013
akes-0.013
agons-0.012
agging-0.012
 highlight-0.012
Token near
Token ID1474
Feature activation+3.94e-03

Pos logit contributions
ained+0.040
akes+0.039
affles+0.039
ahs+0.038
agons+0.038
Neg logit contributions
 Pen-0.032
 unw-0.032
 quickly-0.029
 open-0.029
 cautiously-0.028
Token the
Token ID262
Feature activation-2.87e-03

Pos logit contributions
ags+0.006
ahs+0.006
ained+0.006
akes+0.006
affles+0.006
Neg logit contributions
 Pen-0.004
 unw-0.004
An-0.004
keeper-0.004
 keeper-0.004
Token rac
Token ID3444
Feature activation-1.55e-02

Pos logit contributions
 Pen+0.003
 unw+0.003
 grand+0.003
 keeper+0.003
 unsc+0.003
Neg logit contributions
ained-0.005
akes-0.005
affles-0.005
ags-0.005
ined-0.005
Tokenoon
Token ID2049
Feature activation-1.30e-01

Pos logit contributions
 grand+0.028
 Pen+0.028
 unw+0.027
keeper+0.026
 Eva+0.025
Neg logit contributions
agons-0.015
ags-0.015
affles-0.015
akes-0.015
ined-0.015
Token.
Token ID13
Feature activation-1.71e-02

Pos logit contributions
 Pen+0.125
 unw+0.114
 grand+0.110
 unsc+0.105
 keeper+0.104
Neg logit contributions
ained-0.235
akes-0.231
ags-0.230
ahs-0.227
affles-0.226
Token He
Token ID679
Feature activation+9.36e-03

Pos logit contributions
 abrupt+0.022
 keeper+0.022
 unsc+0.022
 unw+0.021
Un+0.020
Neg logit contributions
ags-0.021
ained-0.020
akes-0.020
orting-0.020
unes-0.020
Token ate
Token ID15063
Feature activation+5.50e-03

Pos logit contributions
ahs+0.013
ined+0.012
affles+0.012
ained+0.012
akes+0.012
Neg logit contributions
 Pen-0.013
 grand-0.011
 unw-0.011
An-0.011
 keeper-0.011
Token them
Token ID606
Feature activation+2.66e-02

Pos logit contributions
ained+0.010
akes+0.009
ags+0.009
agons+0.009
 highlight+0.009
Neg logit contributions
 Pen-0.006
 unw-0.006
 grand-0.005
 quickly-0.005
 unsc-0.005
Token so
Token ID523
Feature activation+4.04e-02

Pos logit contributions
ained+0.040
akes+0.039
ags+0.039
ahs+0.038
affles+0.038
Neg logit contributions
 Pen-0.034
 unw-0.032
 grand-0.031
 unsc-0.030
An-0.030
Token cater
Token ID20825
Feature activation-3.53e-03

Pos logit contributions
 Pen+0.004
 grand+0.003
 unw+0.003
 keeper+0.003
An+0.003
Neg logit contributions
ags-0.009
ained-0.008
akes-0.008
ined-0.008
ahs-0.008
Tokenpillar
Token ID41643
Feature activation-2.62e-04

Pos logit contributions
 Pen+0.004
 unw+0.004
An+0.004
 unsc+0.004
 grand+0.004
Neg logit contributions
ained-0.006
ags-0.006
 highlight-0.006
ahs-0.006
akes-0.006
Token made
Token ID925
Feature activation+2.83e-03

Pos logit contributions
 Pen+0.000
 grand+0.000
An+0.000
 unw+0.000
Un+0.000
Neg logit contributions
ined-0.000
ained-0.000
ahs-0.000
akes-0.000
 highlight-0.000
Token a
Token ID257
Feature activation-7.67e-03

Pos logit contributions
ained+0.005
akes+0.005
ags+0.005
ahs+0.005
ined+0.005
Neg logit contributions
 Pen-0.003
 unw-0.003
 grand-0.003
An-0.003
 keeper-0.003
Token coc
Token ID8954
Feature activation-1.28e-02

Pos logit contributions
 Pen+0.010
An+0.009
 unw+0.009
 Eva+0.008
 grand+0.008
Neg logit contributions
 highlight-0.011
ahs-0.011
akes-0.010
ained-0.010
affles-0.010
Tokenoon
Token ID2049
Feature activation-1.30e-01

Pos logit contributions
 grand+0.020
 Pen+0.019
 keeper+0.018
 security+0.018
keeper+0.018
Neg logit contributions
ined-0.018
ained-0.018
akes-0.017
agging-0.016
affles-0.016
Token and
Token ID290
Feature activation-1.18e-02

Pos logit contributions
 Pen+0.141
 unw+0.129
 grand+0.126
 unsc+0.120
An+0.119
Neg logit contributions
ained-0.219
akes-0.216
ags-0.214
ahs-0.211
ined-0.210
Token stayed
Token ID9658
Feature activation+1.92e-02

Pos logit contributions
 Pen+0.013
 unw+0.012
 grand+0.012
 unsc+0.011
 quickly+0.011
Neg logit contributions
ags-0.021
ained-0.021
akes-0.021
affles-0.020
agons-0.020
Token inside
Token ID2641
Feature activation-2.42e-02

Pos logit contributions
akes+0.032
ained+0.032
ahs+0.032
affles+0.031
ags+0.031
Neg logit contributions
 Pen-0.021
 unw-0.020
 grand-0.020
 unsc-0.019
 keeper-0.018
Token for
Token ID329
Feature activation+7.48e-03

Pos logit contributions
 Pen+0.027
 unw+0.025
 grand+0.024
An+0.024
 keeper+0.023
Neg logit contributions
ained-0.039
ags-0.038
akes-0.038
ined-0.038
ahs-0.038
Token a
Token ID257
Feature activation-7.69e-03

Pos logit contributions
akes+0.014
ained+0.014
ahs+0.014
affles+0.013
agons+0.013
Neg logit contributions
 Pen-0.007
 grand-0.007
 unw-0.007
 unsc-0.006
 quickly-0.006
Token He
Token ID679
Feature activation+8.62e-03

Pos logit contributions
 Pen+0.020
 unw+0.018
 grand+0.018
 unsc+0.017
 keeper+0.017
Neg logit contributions
ained-0.031
akes-0.030
ags-0.030
ahs-0.030
affles-0.030
Token stayed
Token ID9658
Feature activation+2.23e-02

Pos logit contributions
ined+0.013
ained+0.013
ahs+0.013
affles+0.012
akes+0.012
Neg logit contributions
 Pen-0.011
 grand-0.009
 keeper-0.009
 unw-0.009
keeper-0.009
Token inside
Token ID2641
Feature activation-2.15e-02

Pos logit contributions
ained+0.034
akes+0.034
ahs+0.033
ags+0.033
affles+0.033
Neg logit contributions
 Pen-0.028
 unw-0.027
 grand-0.026
 unsc-0.025
 keeper-0.025
Token the
Token ID262
Feature activation-5.08e-03

Pos logit contributions
 Pen+0.026
 unw+0.024
 grand+0.023
 unsc+0.022
An+0.022
Neg logit contributions
ained-0.034
akes-0.033
ags-0.033
ahs-0.033
affles-0.032
Token coc
Token ID8954
Feature activation-1.22e-02

Pos logit contributions
 Pen+0.006
 unw+0.006
 keeper+0.005
keeper+0.005
 grand+0.005
Neg logit contributions
ained-0.008
 highlight-0.007
akes-0.007
ags-0.007
affles-0.007
Tokenoon
Token ID2049
Feature activation-1.30e-01

Pos logit contributions
 grand+0.020
 Pen+0.020
 keeper+0.018
 unw+0.018
keeper+0.018
Neg logit contributions
ined-0.015
ained-0.015
akes-0.014
agging-0.014
affles-0.014
Token for
Token ID329
Feature activation+1.08e-02

Pos logit contributions
 Pen+0.161
 unw+0.149
 grand+0.144
An+0.140
 unsc+0.140
Neg logit contributions
ained-0.198
akes-0.195
ags-0.192
ahs-0.190
ined-0.189
Token days
Token ID1528
Feature activation-5.27e-03

Pos logit contributions
ained+0.014
akes+0.014
ahs+0.013
ined+0.012
 highlight+0.012
Neg logit contributions
 Pen-0.017
 grand-0.017
 unw-0.017
 unsc-0.016
An-0.016
Token,
Token ID11
Feature activation+2.50e-03

Pos logit contributions
 Pen+0.008
An+0.008
 unw+0.007
Un+0.007
 passer+0.007
Neg logit contributions
ained-0.006
akes-0.006
ahs-0.006
agging-0.006
ging-0.006
Token until
Token ID1566
Feature activation+1.14e-02

Pos logit contributions
akes+0.004
ained+0.004
affles+0.004
ags+0.004
agons+0.004
Neg logit contributions
 unw-0.003
 Pen-0.003
 grand-0.003
 unsc-0.003
 quickly-0.003
Token finally
Token ID3443
Feature activation+5.10e-03

Pos logit contributions
akes+0.023
affles+0.023
agons+0.023
ags+0.023
ained+0.022
Neg logit contributions
 Pen-0.010
 grand-0.009
 unw-0.009
 unsc-0.009
 quickly-0.009
Token Then
Token ID3244
Feature activation+2.49e-02

Pos logit contributions
ained+0.008
akes+0.008
ined+0.008
ags+0.008
ahs+0.008
Neg logit contributions
 Pen-0.007
 grand-0.006
 unw-0.006
An-0.006
 unsc-0.006
Token I
Token ID314
Feature activation+9.95e-03

Pos logit contributions
ained+0.051
ags+0.049
akes+0.048
ined+0.048
affles+0.047
Neg logit contributions
 Pen-0.026
 grand-0.024
 unw-0.023
�-0.021
 open-0.021
Token made
Token ID925
Feature activation+8.01e-03

Pos logit contributions
ags+0.015
ained+0.015
ahs+0.014
akes+0.014
agons+0.014
Neg logit contributions
 Pen-0.013
 unw-0.013
 unsc-0.012
 grand-0.012
An-0.012
Token a
Token ID257
Feature activation-1.96e-03

Pos logit contributions
ained+0.014
akes+0.014
ags+0.014
ahs+0.014
ined+0.013
Neg logit contributions
 Pen-0.009
 unw-0.008
 grand-0.008
 unsc-0.007
An-0.007
Token coc
Token ID8954
Feature activation-1.46e-02

Pos logit contributions
 Pen+0.002
 unw+0.002
 grand+0.002
 keeper+0.002
 unsc+0.002
Neg logit contributions
ained-0.003
akes-0.003
ags-0.003
ined-0.003
ahs-0.003
Tokenoon
Token ID2049
Feature activation-1.30e-01

Pos logit contributions
 Pen+0.018
 grand+0.018
 unw+0.017
 keeper+0.016
keeper+0.016
Neg logit contributions
ained-0.023
akes-0.022
ined-0.022
ags-0.022
affles-0.022
Token around
Token ID1088
Feature activation+9.05e-03

Pos logit contributions
 Pen+0.141
 unw+0.127
 grand+0.124
 keeper+0.119
 unsc+0.118
Neg logit contributions
ained-0.220
ags-0.218
affles-0.215
ahs-0.214
akes-0.214
Token myself
Token ID3589
Feature activation+2.75e-03

Pos logit contributions
ained+0.018
ahs+0.017
akes+0.017
 highlight+0.017
ags+0.017
Neg logit contributions
 Pen-0.008
 grand-0.007
 unw-0.007
An-0.007
 Eva-0.006
Token and
Token ID290
Feature activation-3.78e-03

Pos logit contributions
ained+0.004
akes+0.004
ags+0.004
ahs+0.004
 highlight+0.003
Neg logit contributions
 Pen-0.004
 unw-0.004
 grand-0.004
 unsc-0.003
Un-0.003
Token slept
Token ID21256
Feature activation-4.26e-02

Pos logit contributions
 Pen+0.004
 grand+0.004
 unw+0.004
 quickly+0.004
 unsc+0.004
Neg logit contributions
ained-0.007
ags-0.007
akes-0.006
ahs-0.006
ined-0.006
Token for
Token ID329
Feature activation+1.30e-02

Pos logit contributions
 Pen+0.055
 unw+0.052
 grand+0.051
 unsc+0.049
An+0.048
Neg logit contributions
ained-0.063
ags-0.062
akes-0.061
ined-0.061
affles-0.059
Token was
Token ID373
Feature activation-4.91e-03

Pos logit contributions
 Pen+0.005
 grand+0.005
Un+0.004
 unw+0.004
 Eva+0.004
Neg logit contributions
ined-0.007
ained-0.006
ahs-0.006
 highlight-0.006
akes-0.006
Token a
Token ID257
Feature activation-7.83e-03

Pos logit contributions
 Pen+0.005
 unw+0.005
 grand+0.004
An+0.004
 unsc+0.004
Neg logit contributions
ained-0.009
ags-0.008
akes-0.008
ahs-0.008
ined-0.008
Token tiny
Token ID7009
Feature activation-6.55e-03

Pos logit contributions
 grand+0.007
 unw+0.007
 Pen+0.007
 keeper+0.006
 unsc+0.006
Neg logit contributions
ained-0.016
ags-0.016
akes-0.016
agged-0.015
affles-0.015
Token butterfly
Token ID35113
Feature activation-1.56e-02

Pos logit contributions
 Pen+0.007
 unw+0.007
 grand+0.007
 unsc+0.006
 keeper+0.006
Neg logit contributions
ained-0.012
ags-0.012
akes-0.011
affles-0.011
ahs-0.011
Token coc
Token ID8954
Feature activation-1.35e-02

Pos logit contributions
 Pen+0.016
 unw+0.015
 grand+0.014
 unsc+0.014
An+0.014
Neg logit contributions
ained-0.027
akes-0.027
ags-0.026
ahs-0.026
ined-0.026
Tokenoon
Token ID2049
Feature activation-1.30e-01

Pos logit contributions
 grand+0.018
 Pen+0.018
 keeper+0.016
keeper+0.016
 unw+0.016
Neg logit contributions
ained-0.020
ined-0.020
akes-0.020
ags-0.019
affles-0.019
Token!
Token ID0
Feature activation-1.13e-02

Pos logit contributions
 Pen+0.153
 unw+0.141
 grand+0.138
 unsc+0.132
An+0.131
Neg logit contributions
ained-0.207
akes-0.203
ags-0.201
ahs-0.199
ined-0.198
Token He
Token ID679
Feature activation+8.04e-03

Pos logit contributions
 Pen+0.012
 unw+0.011
 grand+0.010
 unsc+0.010
An+0.010
Neg logit contributions
ained-0.020
akes-0.019
ags-0.019
ahs-0.019
affles-0.019
Token felt
Token ID2936
Feature activation-2.23e-04

Pos logit contributions
ined+0.013
ahs+0.013
ained+0.013
akes+0.012
affles+0.012
Neg logit contributions
 Pen-0.009
 grand-0.008
 unw-0.008
Un-0.007
 keeper-0.007
Token so
Token ID523
Feature activation+3.63e-02

Pos logit contributions
 Pen+0.000
 grand+0.000
 unw+0.000
 keeper+0.000
An+0.000
Neg logit contributions
ained-0.000
ags-0.000
ined-0.000
 highlight-0.000
akes-0.000
Token excited
Token ID6568
Feature activation-6.16e-02

Pos logit contributions
ained+0.056
ags+0.055
ined+0.055
akes+0.053
 highlight+0.053
Neg logit contributions
 Pen-0.045
 grand-0.041
 unw-0.040
Un-0.039
 keeper-0.039
Token starting
Token ID3599
Feature activation-5.52e-02

Pos logit contributions
 Pen+0.002
 unw+0.002
 grand+0.002
 unsc+0.002
An+0.002
Neg logit contributions
ained-0.003
ags-0.003
akes-0.003
ined-0.003
ahs-0.003
Token to
Token ID284
Feature activation-1.60e-02

Pos logit contributions
 Pen+0.065
 Eva+0.055
 grand+0.055
 unw+0.054
keeper+0.054
Neg logit contributions
ained-0.099
ined-0.085
ags-0.084
ahs-0.083
 highlight-0.081
Token spin
Token ID7906
Feature activation+1.39e-02

Pos logit contributions
 Pen+0.021
 grand+0.019
 unw+0.019
keeper+0.019
An+0.018
Neg logit contributions
ained-0.023
akes-0.022
ags-0.022
 highlight-0.022
ahs-0.022
Token a
Token ID257
Feature activation-7.29e-03

Pos logit contributions
ained+0.019
akes+0.018
ags+0.017
ined+0.017
ahs+0.017
Neg logit contributions
 Pen-0.021
 unw-0.019
 grand-0.019
 keeper-0.018
keeper-0.018
Token coc
Token ID8954
Feature activation-1.39e-02

Pos logit contributions
 Pen+0.009
 unw+0.008
 grand+0.008
 keeper+0.007
 Eva+0.007
Neg logit contributions
akes-0.011
ained-0.011
ahs-0.011
 highlight-0.011
ags-0.011
Tokenoon
Token ID2049
Feature activation-1.29e-01

Pos logit contributions
 grand+0.022
 Pen+0.021
keeper+0.021
 keeper+0.020
 security+0.019
Neg logit contributions
ained-0.020
ined-0.019
akes-0.019
agging-0.017
ags-0.017
Token.
Token ID13
Feature activation-1.78e-02

Pos logit contributions
 Pen+0.151
 unw+0.139
 grand+0.137
 unsc+0.130
An+0.130
Neg logit contributions
ained-0.208
akes-0.205
ags-0.203
ahs-0.200
ined-0.199
Token "
Token ID366
Feature activation-2.73e-02

Pos logit contributions
 Pen+0.019
 unw+0.018
 unsc+0.018
 grand+0.017
Un+0.017
Neg logit contributions
ained-0.028
ags-0.027
unes-0.027
akes-0.027
ahs-0.026
TokenWhat
Token ID2061
Feature activation+4.54e-02

Pos logit contributions
 Pen+0.039
An+0.037
 unw+0.037
Un+0.036
 grand+0.035
Neg logit contributions
ags-0.038
ained-0.038
affles-0.038
akes-0.037
agons-0.037
Token are
Token ID389
Feature activation-1.39e-02

Pos logit contributions
akes+0.061
ained+0.061
w+0.059
ahs+0.058
affles+0.057
Neg logit contributions
 Pen-0.061
keeper-0.059
 keeper-0.059
 unw-0.059
 cautiously-0.056
Token you
Token ID345
Feature activation+1.33e-02

Pos logit contributions
 Pen+0.019
 grand+0.018
 unw+0.018
An+0.017
 unsc+0.017
Neg logit contributions
ained-0.020
akes-0.020
ags-0.020
ined-0.019
ahs-0.019
TokenI
Token ID40
Feature activation+1.40e-02

Pos logit contributions
 Pen+0.028
An+0.026
 unw+0.026
 grand+0.025
 unsc+0.024
Neg logit contributions
ags-0.045
ained-0.045
akes-0.044
affles-0.044
agons-0.043
Token'm
Token ID1101
Feature activation+8.19e-03

Pos logit contributions
ags+0.020
ained+0.019
affles+0.018
agons+0.018
ahs+0.018
Neg logit contributions
 Pen-0.020
 unw-0.019
 grand-0.019
 unsc-0.018
An-0.018
Token making
Token ID1642
Feature activation+3.02e-03

Pos logit contributions
akes+0.009
ags+0.009
affles+0.009
ained+0.009
ahs+0.009
Neg logit contributions
 Pen-0.014
 unw-0.013
 grand-0.013
 unsc-0.013
An-0.012
Token a
Token ID257
Feature activation-2.82e-03

Pos logit contributions
ained+0.005
ags+0.005
akes+0.005
affles+0.005
ahs+0.005
Neg logit contributions
 Pen-0.003
 keeper-0.003
keeper-0.003
 unw-0.003
 grand-0.003
Token coc
Token ID8954
Feature activation-1.25e-02

Pos logit contributions
 Pen+0.003
 unw+0.003
An+0.003
keeper+0.003
 grand+0.003
Neg logit contributions
ained-0.004
akes-0.004
ahs-0.004
 highlight-0.004
ags-0.004
Tokenoon
Token ID2049
Feature activation-1.28e-01

Pos logit contributions
 Pen+0.016
 grand+0.015
 unw+0.014
 keeper+0.014
keeper+0.014
Neg logit contributions
ained-0.020
akes-0.019
ined-0.019
ags-0.019
affles-0.019
Token so
Token ID523
Feature activation+4.08e-02

Pos logit contributions
 Pen+0.135
 unw+0.130
 grand+0.128
 quickly+0.121
 open+0.120
Neg logit contributions
ags-0.216
 highlight-0.215
affles-0.214
ained-0.212
ahs-0.209
Token I
Token ID314
Feature activation+1.36e-02

Pos logit contributions
ained+0.058
akes+0.058
ined+0.055
ahs+0.055
ags+0.055
Neg logit contributions
 Pen-0.053
 unw-0.052
keeper-0.048
 keeper-0.048
 unsc-0.048
Token can
Token ID460
Feature activation-1.43e-02

Pos logit contributions
ags+0.024
agging+0.022
agons+0.022
ained+0.022
agged+0.022
Neg logit contributions
 unw-0.016
 unsc-0.015
 open-0.015
 unt-0.014
 quickly-0.014
Token turn
Token ID1210
Feature activation-2.92e-02

Pos logit contributions
 Pen+0.019
 unw+0.018
 grand+0.017
 unsc+0.017
An+0.016
Neg logit contributions
ained-0.021
ags-0.021
akes-0.020
ined-0.020
affles-0.020
Token into
Token ID656
Feature activation+1.57e-02

Pos logit contributions
�+0.029
 quickly+0.027
 open+0.026
 unw+0.026
 grand+0.025
Neg logit contributions
affles-0.058
 highlight-0.056
ined-0.056
agged-0.055
ags-0.054
Token tree
Token ID5509
Feature activation-4.80e-02

Pos logit contributions
 Pen+0.009
 unw+0.008
 unsc+0.008
 Jacob+0.008
keeper+0.008
Neg logit contributions
 highlight-0.008
ained-0.008
 sun-0.007
w-0.007
 morning-0.007
Token and
Token ID290
Feature activation-1.01e-02

Pos logit contributions
 Pen+0.075
An+0.074
Un+0.071
Jamie+0.068
Laura+0.067
Neg logit contributions
ained-0.052
orting-0.050
ined-0.048
akes-0.047
ahs-0.047
Token found
Token ID1043
Feature activation-1.57e-03

Pos logit contributions
 Pen+0.012
 grand+0.011
 unw+0.011
An+0.011
Un+0.011
Neg logit contributions
ained-0.014
ahs-0.014
ined-0.014
 highlight-0.014
akes-0.014
Token the
Token ID262
Feature activation-5.33e-03

Pos logit contributions
 Pen+0.002
 unw+0.002
 grand+0.002
An+0.001
 unsc+0.001
Neg logit contributions
ained-0.003
ags-0.003
ahs-0.003
akes-0.003
ined-0.003
Token coc
Token ID8954
Feature activation-1.08e-02

Pos logit contributions
 Pen+0.006
 unw+0.006
 grand+0.006
 unsc+0.005
An+0.005
Neg logit contributions
ained-0.009
akes-0.008
ags-0.008
 highlight-0.008
ahs-0.008
Tokenoon
Token ID2049
Feature activation-1.28e-01

Pos logit contributions
 grand+0.015
 Pen+0.015
 keeper+0.014
 unw+0.014
keeper+0.013
Neg logit contributions
ained-0.016
ined-0.016
akes-0.015
ags-0.015
affles-0.014
Token.
Token ID13
Feature activation-1.30e-02

Pos logit contributions
 Pen+0.148
 unw+0.137
 grand+0.134
 unsc+0.128
An+0.127
Neg logit contributions
ained-0.207
akes-0.203
ags-0.202
ahs-0.199
ined-0.198
Token He
Token ID679
Feature activation+9.09e-03

Pos logit contributions
 Pen+0.014
 unw+0.013
An+0.012
 grand+0.012
 unsc+0.012
Neg logit contributions
ained-0.022
akes-0.022
ags-0.022
ahs-0.021
 highlight-0.021
Token went
Token ID1816
Feature activation+1.23e-02

Pos logit contributions
ined+0.013
ahs+0.012
ained+0.012
affles+0.012
w+0.012
Neg logit contributions
 Pen-0.011
Un-0.010
 grand-0.010
 keeper-0.010
keeper-0.010
Token inside
Token ID2641
Feature activation-2.00e-02

Pos logit contributions
akes+0.023
affles+0.022
ained+0.022
ahs+0.021
agons+0.021
Neg logit contributions
 Pen-0.013
 unw-0.012
 keeper-0.011
 cautiously-0.011
 grand-0.011
Token and
Token ID290
Feature activation-6.77e-03

Pos logit contributions
 Pen+0.026
Un+0.023
An+0.023
 unw+0.023
keeper+0.023
Neg logit contributions
ained-0.027
ined-0.027
ags-0.026
ahs-0.026
akes-0.025
Token her
Token ID607
Feature activation-3.33e-02

Pos logit contributions
 Pen+0.010
 unw+0.009
 grand+0.009
keeper+0.009
 keeper+0.009
Neg logit contributions
ained-0.021
ags-0.021
akes-0.021
affles-0.021
ahs-0.021
Token not
Token ID407
Feature activation-3.48e-02

Pos logit contributions
 Pen+0.033
 unw+0.030
 grand+0.028
 unsc+0.027
An+0.027
Neg logit contributions
ained-0.060
akes-0.058
ahs-0.058
ags-0.058
affles-0.057
Token to
Token ID284
Feature activation-1.66e-02

Pos logit contributions
 Pen+0.041
 unw+0.035
An+0.034
keeper+0.033
 Eva+0.032
Neg logit contributions
ained-0.058
ags-0.053
 highlight-0.053
ahs-0.052
unes-0.051
Token investigate
Token ID9161
Feature activation-4.12e-02

Pos logit contributions
 Pen+0.022
An+0.019
keeper+0.018
 Eva+0.018
Laura+0.018
Neg logit contributions
ained-0.024
ags-0.023
akes-0.023
 highlight-0.023
agons-0.022
Token books
Token ID3835
Feature activation-1.25e-02

Pos logit contributions
 Pen+0.041
 grand+0.038
 unw+0.036
 unsc+0.034
 quickly+0.033
Neg logit contributions
ained-0.075
akes-0.075
ags-0.074
ahs-0.073
affles-0.073
Tokenhel
Token ID2978
Feature activation-1.28e-01

Pos logit contributions
 Pen+0.019
 unw+0.017
An+0.017
 grand+0.017
Un+0.017
Neg logit contributions
ained-0.016
akes-0.015
ahs-0.015
ags-0.015
ined-0.014
Tokenves
Token ID1158
Feature activation-3.15e-02

Pos logit contributions
 Pen+0.223
 unw+0.220
 grand+0.217
keeper+0.210
 unsc+0.210
Neg logit contributions
ahs-0.131
affles-0.124
akes-0.122
ained-0.122
ined-0.119
Token alone
Token ID3436
Feature activation-1.95e-03

Pos logit contributions
 Pen+0.035
 unw+0.033
 grand+0.032
 unsc+0.030
 keeper+0.030
Neg logit contributions
ained-0.052
akes-0.051
ags-0.051
affles-0.050
ahs-0.050
Token,
Token ID11
Feature activation+9.62e-04

Pos logit contributions
 Pen+0.002
 unw+0.002
 grand+0.002
 unsc+0.002
An+0.002
Neg logit contributions
ained-0.003
akes-0.003
ags-0.003
ahs-0.003
ined-0.003
Token but
Token ID475
Feature activation+1.63e-02

Pos logit contributions
ags+0.002
akes+0.002
ained+0.002
ined+0.002
affles+0.002
Neg logit contributions
 Pen-0.001
 unw-0.001
 grand-0.001
 unsc-0.001
 keeper-0.001
Token she
Token ID673
Feature activation-1.61e-02

Pos logit contributions
ained+0.025
 highlight+0.023
ahs+0.023
ined+0.022
akes+0.022
Neg logit contributions
 unw-0.020
 unsc-0.019
 Pen-0.019
keeper-0.018
 grand-0.017
Token spreading
Token ID14342
Feature activation-2.60e-02

Pos logit contributions
 unw+0.044
 Pen+0.043
 unsc+0.041
 grand+0.040
 cautiously+0.039
Neg logit contributions
akes-0.059
affles-0.059
ained-0.058
ags-0.058
ahs-0.057
Token out
Token ID503
Feature activation-3.48e-03

Pos logit contributions
 Pen+0.024
 unw+0.024
 grand+0.023
 unsc+0.022
 quickly+0.022
Neg logit contributions
affles-0.049
ained-0.048
akes-0.048
ahs-0.047
agons-0.047
Token of
Token ID286
Feature activation-6.97e-03

Pos logit contributions
 Pen+0.003
 unw+0.003
 grand+0.003
 unsc+0.003
 cautiously+0.003
Neg logit contributions
ained-0.007
akes-0.006
affles-0.006
ags-0.006
agons-0.006
Token the
Token ID262
Feature activation-5.04e-03

Pos logit contributions
 Pen+0.005
 unw+0.005
 grand+0.005
 unsc+0.005
 keeper+0.004
Neg logit contributions
akes-0.014
ained-0.014
ahs-0.014
affles-0.014
ags-0.014
Token coc
Token ID8954
Feature activation-1.09e-02

Pos logit contributions
 unw+0.005
 open+0.005
 Pen+0.005
 grand+0.005
 unsc+0.005
Neg logit contributions
ahs-0.009
ained-0.009
akes-0.009
affles-0.009
ined-0.009
Tokenoon
Token ID2049
Feature activation-1.28e-01

Pos logit contributions
 grand+0.018
 Pen+0.016
 keeper+0.016
keeper+0.016
 security+0.015
Neg logit contributions
ined-0.015
ained-0.014
akes-0.014
agging-0.013
agons-0.013
Token.
Token ID13
Feature activation-9.94e-03

Pos logit contributions
 Pen+0.143
 unw+0.132
 grand+0.129
 unsc+0.123
An+0.122
Neg logit contributions
ained-0.211
akes-0.208
ags-0.206
ahs-0.204
ined-0.202
Token He
Token ID679
Feature activation+1.15e-02

Pos logit contributions
 Pen+0.011
 unw+0.010
 grand+0.010
 unsc+0.009
An+0.009
Neg logit contributions
ained-0.017
akes-0.017
ags-0.017
ahs-0.016
affles-0.016
Token was
Token ID373
Feature activation+2.50e-04

Pos logit contributions
ined+0.017
ained+0.017
ahs+0.017
akes+0.017
 highlight+0.017
Neg logit contributions
 Pen-0.013
 grand-0.012
Un-0.012
 keeper-0.011
 unw-0.011
Token no
Token ID645
Feature activation+3.92e-02

Pos logit contributions
ained+0.000
ags+0.000
ined+0.000
 highlight+0.000
akes+0.000
Neg logit contributions
 Pen-0.000
 unw-0.000
 grand-0.000
An-0.000
 unsc-0.000
Token longer
Token ID2392
Feature activation+3.41e-02

Pos logit contributions
ained+0.037
akes+0.036
ags+0.035
ahs+0.035
affles+0.034
Neg logit contributions
 Pen-0.073
 unw-0.072
 grand-0.069
 unsc-0.069
 cautiously-0.067
Token there
Token ID612
Feature activation-2.06e-02

Pos logit contributions
 Pen+0.013
 unw+0.012
 grand+0.011
 unsc+0.011
An+0.011
Neg logit contributions
ained-0.018
akes-0.018
ags-0.018
ahs-0.018
affles-0.017
Token was
Token ID373
Feature activation-2.45e-03

Pos logit contributions
 Pen+0.025
 unw+0.024
 grand+0.023
An+0.023
Un+0.022
Neg logit contributions
ained-0.031
akes-0.030
ahs-0.030
 highlight-0.029
ined-0.029
Token an
Token ID281
Feature activation-1.58e-02

Pos logit contributions
 Pen+0.002
 grand+0.002
 unw+0.002
 unsc+0.002
 keeper+0.002
Neg logit contributions
akes-0.005
ags-0.005
affles-0.005
agons-0.005
ined-0.005
Token available
Token ID1695
Feature activation+3.79e-03

Pos logit contributions
 Pen+0.020
 unw+0.018
 grand+0.018
An+0.018
keeper+0.017
Neg logit contributions
ained-0.024
ags-0.023
akes-0.023
ined-0.023
affles-0.023
Token coc
Token ID8954
Feature activation-9.35e-03

Pos logit contributions
ained+0.006
ahs+0.006
akes+0.006
ags+0.006
affles+0.006
Neg logit contributions
 Pen-0.004
 unw-0.004
 grand-0.004
 unsc-0.004
 keeper-0.004
Tokenoon
Token ID2049
Feature activation-1.28e-01

Pos logit contributions
 Pen+0.011
 grand+0.011
 unw+0.011
 keeper+0.010
keeper+0.010
Neg logit contributions
ained-0.015
akes-0.015
ined-0.015
ags-0.014
affles-0.014
Token in
Token ID287
Feature activation+1.13e-02

Pos logit contributions
 Pen+0.134
 unw+0.127
 grand+0.125
 keeper+0.119
 unsc+0.119
Neg logit contributions
affles-0.216
ags-0.216
akes-0.210
ained-0.210
agons-0.208
Token the
Token ID262
Feature activation-4.79e-03

Pos logit contributions
ained+0.020
ags+0.020
akes+0.019
ahs+0.019
ined+0.019
Neg logit contributions
 Pen-0.011
 unw-0.010
 grand-0.009
keeper-0.009
 keeper-0.009
Token nearby
Token ID6716
Feature activation-2.25e-02

Pos logit contributions
 Pen+0.006
 unw+0.006
 grand+0.006
An+0.006
 unsc+0.006
Neg logit contributions
ained-0.007
ahs-0.007
akes-0.007
affles-0.007
ined-0.007
Token tree
Token ID5509
Feature activation-4.67e-02

Pos logit contributions
 Pen+0.028
 unw+0.025
 grand+0.025
An+0.024
 unsc+0.024
Neg logit contributions
ained-0.034
akes-0.034
ags-0.034
ined-0.032
 highlight-0.032
Token.
Token ID13
Feature activation-1.99e-02

Pos logit contributions
 Pen+0.050
 unw+0.046
 grand+0.045
 unsc+0.043
 keeper+0.043
Neg logit contributions
ained-0.079
ags-0.078
akes-0.078
affles-0.077
ahs-0.076
Token he
Token ID339
Feature activation-1.50e-03

Pos logit contributions
ained+0.005
ags+0.005
akes+0.005
affles+0.005
ined+0.005
Neg logit contributions
 Pen-0.003
 unw-0.003
 grand-0.003
 unsc-0.003
An-0.003
Token spun
Token ID26843
Feature activation+2.33e-02

Pos logit contributions
 Pen+0.002
 grand+0.002
 unw+0.002
Un+0.002
 keeper+0.002
Neg logit contributions
ained-0.002
ined-0.002
ahs-0.002
akes-0.002
 highlight-0.002
Token a
Token ID257
Feature activation-5.00e-03

Pos logit contributions
ained+0.036
akes+0.035
ags+0.035
affles+0.034
ahs+0.034
Neg logit contributions
 Pen-0.030
 unw-0.027
 grand-0.027
 unsc-0.026
An-0.026
Token special
Token ID2041
Feature activation-1.42e-03

Pos logit contributions
 Pen+0.006
 unw+0.005
 grand+0.005
Un+0.005
 Eva+0.005
Neg logit contributions
 highlight-0.007
ahs-0.007
akes-0.007
ained-0.007
ags-0.007
Token coc
Token ID8954
Feature activation-9.01e-03

Pos logit contributions
 Pen+0.001
 unw+0.001
 grand+0.001
 keeper+0.001
 unsc+0.001
Neg logit contributions
ained-0.003
akes-0.003
ags-0.003
affles-0.003
ahs-0.002
Tokenoon
Token ID2049
Feature activation-1.27e-01

Pos logit contributions
 grand+0.014
 Pen+0.013
 keeper+0.013
keeper+0.013
 unw+0.012
Neg logit contributions
ined-0.012
ained-0.012
akes-0.012
agging-0.011
ags-0.011
Token and
Token ID290
Feature activation-1.01e-02

Pos logit contributions
 Pen+0.157
 unw+0.147
 grand+0.143
 unsc+0.138
 keeper+0.136
Neg logit contributions
ained-0.195
akes-0.192
ags-0.192
affles-0.189
ahs-0.188
Token went
Token ID1816
Feature activation+1.25e-02

Pos logit contributions
 Pen+0.012
 unw+0.011
 grand+0.011
 unsc+0.010
An+0.010
Neg logit contributions
ags-0.017
akes-0.017
ained-0.017
affles-0.017
agons-0.017
Token to
Token ID284
Feature activation-1.63e-02

Pos logit contributions
akes+0.021
affles+0.021
ahs+0.019
ained+0.019
 highlight+0.019
Neg logit contributions
 Pen-0.015
 quickly-0.014
 unw-0.014
 cautiously-0.014
 open-0.014
Token sleep
Token ID3993
Feature activation-4.46e-02

Pos logit contributions
keeper+0.022
 Pen+0.022
An+0.021
 keeper+0.021
Un+0.021
Neg logit contributions
ags-0.021
ained-0.020
 highlight-0.020
akes-0.019
ahs-0.019
Token.
Token ID13
Feature activation-9.74e-03

Pos logit contributions
 Pen+0.053
 unw+0.050
 grand+0.048
An+0.047
 unsc+0.046
Neg logit contributions
ained-0.069
ags-0.068
akes-0.067
ined-0.067
ahs-0.066
Token to
Token ID284
Feature activation-1.58e-02

Pos logit contributions
akes+0.023
ahs+0.022
ained+0.022
ined+0.022
affles+0.022
Neg logit contributions
 unw-0.006
 Pen-0.006
 grand-0.005
 open-0.005
 unsc-0.005
Token go
Token ID467
Feature activation-3.72e-03

Pos logit contributions
 unw+0.018
 Pen+0.018
 grand+0.017
 unsc+0.017
 open+0.016
Neg logit contributions
ained-0.027
ahs-0.026
akes-0.026
affles-0.026
ined-0.025
Token into
Token ID656
Feature activation+1.37e-02

Pos logit contributions
 open+0.003
 quickly+0.003
 un+0.002
 unw+0.002
 unt+0.002
Neg logit contributions
affles-0.009
akes-0.009
ahs-0.008
mm-0.008
ined-0.008
Token a
Token ID257
Feature activation-5.15e-03

Pos logit contributions
ained+0.024
ags+0.024
akes+0.023
ined+0.023
affles+0.023
Neg logit contributions
 Pen-0.013
 unw-0.012
keeper-0.012
 keeper-0.011
An-0.011
Token coc
Token ID8954
Feature activation-9.51e-03

Pos logit contributions
 Pen+0.007
 unw+0.006
An+0.006
 unsc+0.006
 grand+0.006
Neg logit contributions
ained-0.007
 highlight-0.007
akes-0.007
ined-0.007
ahs-0.007
Tokenoon
Token ID2049
Feature activation-1.27e-01

Pos logit contributions
 grand+0.014
 Pen+0.013
 keeper+0.013
keeper+0.012
 unw+0.012
Neg logit contributions
ained-0.014
ined-0.014
akes-0.013
ags-0.013
agging-0.013
Token and
Token ID290
Feature activation-8.29e-03

Pos logit contributions
 Pen+0.140
 unw+0.129
 grand+0.126
 unsc+0.120
 keeper+0.119
Neg logit contributions
ained-0.212
akes-0.209
ags-0.208
ahs-0.205
affles-0.204
Token then
Token ID788
Feature activation+1.72e-02

Pos logit contributions
 Pen+0.009
 unw+0.009
 grand+0.009
 unsc+0.008
 quickly+0.008
Neg logit contributions
akes-0.014
ags-0.014
ained-0.014
ined-0.014
affles-0.014
Token he
Token ID339
Feature activation-3.71e-03

Pos logit contributions
ained+0.029
akes+0.028
ags+0.028
ahs+0.027
affles+0.027
Neg logit contributions
 Pen-0.020
 unw-0.018
 grand-0.018
 unsc-0.017
 keeper-0.017
Token would
Token ID561
Feature activation+8.94e-03

Pos logit contributions
 Pen+0.004
 unw+0.004
 grand+0.004
An+0.004
 unsc+0.004
Neg logit contributions
ained-0.006
ahs-0.006
akes-0.006
ined-0.005
affles-0.005
Token become
Token ID1716
Feature activation+7.98e-03

Pos logit contributions
ained+0.012
akes+0.012
ags+0.012
ahs+0.012
 highlight+0.012
Neg logit contributions
 Pen-0.012
 grand-0.011
 unw-0.011
An-0.011
keeper-0.011
Token good
Token ID922
Feature activation-2.81e-02

Pos logit contributions
ained+0.049
akes+0.049
ags+0.048
ahs+0.047
agons+0.047
Neg logit contributions
 Pen-0.026
 grand-0.024
 unw-0.023
 keeper-0.022
An-0.022
Token friends
Token ID2460
Feature activation+4.05e-02

Pos logit contributions
 grand+0.033
 quickly+0.033
 Pen+0.032
 keeper+0.031
 unw+0.030
Neg logit contributions
ined-0.045
ained-0.045
ags-0.044
affles-0.044
akes-0.044
Token.
Token ID13
Feature activation+1.35e-02

Pos logit contributions
ained+0.077
akes+0.076
ags+0.075
affles+0.075
ahs+0.074
Neg logit contributions
 Pen-0.037
 grand-0.033
 unw-0.032
 keeper-0.031
keeper-0.030
Token
Token ID198
Feature activation+1.68e-03

Pos logit contributions
ined+0.027
affles+0.027
akes+0.026
ained+0.026
ags+0.026
Neg logit contributions
 Pen-0.012
 grand-0.010
An-0.010
�-0.009
 unw-0.009
Token
Token ID198
Feature activation-4.12e-05

Pos logit contributions
ained+0.003
ags+0.003
akes+0.003
ahs+0.003
ined+0.003
Neg logit contributions
 Pen-0.002
 unw-0.002
 grand-0.002
 unsc-0.002
keeper-0.002
TokenOne
Token ID3198
Feature activation+7.53e-02

Pos logit contributions
 Pen+0.000
 unw+0.000
 grand+0.000
 unsc+0.000
 keeper+0.000
Neg logit contributions
ained-0.000
akes-0.000
ags-0.000
ahs-0.000
affles-0.000
Token day
Token ID1110
Feature activation+4.29e-02

Pos logit contributions
ags+0.076
agons+0.071
akes+0.070
unes+0.070
ained+0.069
Neg logit contributions
 Pen-0.142
 grand-0.135
 keeper-0.130
An-0.129
 open-0.126
Token Paul
Token ID3362
Feature activation+3.81e-02

Pos logit contributions
ags+0.086
ained+0.085
ined+0.083
akes+0.082
agons+0.082
Neg logit contributions
 Pen-0.045
 grand-0.038
 Eva-0.036
 unw-0.036
An-0.035
Token wanted
Token ID2227
Feature activation+2.92e-03

Pos logit contributions
ained+0.060
ahs+0.059
akes+0.058
ined+0.057
affles+0.056
Neg logit contributions
 Pen-0.044
 unw-0.038
An-0.037
Un-0.037
 grand-0.037
Token to
Token ID284
Feature activation-8.25e-03

Pos logit contributions
ained+0.006
akes+0.006
ags+0.006
ahs+0.006
ined+0.006
Neg logit contributions
 Pen-0.002
 unw-0.002
 grand-0.002
 unsc-0.002
 keeper-0.002
Token paint
Token ID7521
Feature activation+2.98e-02

Pos logit contributions
 Pen+0.013
An+0.012
keeper+0.012
 unw+0.011
 keeper+0.011
Neg logit contributions
ags-0.009
akes-0.009
affles-0.009
ained-0.009
 highlight-0.009
Token to
Token ID284
Feature activation-1.49e-02

Pos logit contributions
akes+0.013
ags+0.013
ained+0.013
affles+0.013
ahs+0.013
Neg logit contributions
 Pen-0.004
 grand-0.003
 unw-0.003
 keeper-0.003
 unsc-0.003
Token be
Token ID307
Feature activation+3.78e-02

Pos logit contributions
 Pen+0.018
 unw+0.018
 grand+0.017
 unsc+0.017
 keeper+0.016
Neg logit contributions
ained-0.023
akes-0.023
ahs-0.022
ags-0.022
ined-0.022
Token kind
Token ID1611
Feature activation-4.87e-02

Pos logit contributions
akes+0.075
ained+0.074
ahs+0.074
affles+0.073
ags+0.073
Neg logit contributions
 Pen-0.031
 unw-0.028
 grand-0.027
 unsc-0.026
An-0.025
Token and
Token ID290
Feature activation-7.30e-03

Pos logit contributions
 Pen+0.055
 unw+0.051
 grand+0.048
 unsc+0.047
An+0.047
Neg logit contributions
ained-0.081
akes-0.079
ags-0.078
ahs-0.077
ined-0.076
Token help
Token ID1037
Feature activation-1.75e-02

Pos logit contributions
 Pen+0.008
 unw+0.008
 grand+0.007
An+0.007
 unsc+0.007
Neg logit contributions
ained-0.012
ags-0.012
akes-0.011
 highlight-0.011
ined-0.011
Token others
Token ID1854
Feature activation+7.52e-02

Pos logit contributions
 Pen+0.030
 grand+0.029
 unw+0.029
 unsc+0.028
An+0.027
Neg logit contributions
ained-0.019
akes-0.018
ahs-0.018
ags-0.018
ined-0.018
Token,
Token ID11
Feature activation+6.53e-03

Pos logit contributions
ained+0.125
ags+0.122
akes+0.122
ahs+0.121
affles+0.120
Neg logit contributions
 Pen-0.085
 unw-0.078
 grand-0.078
 unsc-0.074
An-0.073
Token even
Token ID772
Feature activation+4.47e-02

Pos logit contributions
ags+0.011
akes+0.011
ined+0.011
ained+0.011
affles+0.011
Neg logit contributions
 Pen-0.007
 grand-0.007
 unw-0.007
 unsc-0.006
 keeper-0.006
Token if
Token ID611
Feature activation-4.06e-02

Pos logit contributions
akes+0.078
ags+0.077
ained+0.077
ined+0.075
agons+0.075
Neg logit contributions
 Pen-0.049
 grand-0.048
 unw-0.047
 unsc-0.044
An-0.043
Token they
Token ID484
Feature activation-2.02e-02

Pos logit contributions
 Pen+0.046
 grand+0.043
 unw+0.040
An+0.038
 keeper+0.037
Neg logit contributions
ags-0.072
akes-0.071
ained-0.070
ined-0.069
agons-0.068
Token are
Token ID389
Feature activation-2.05e-02

Pos logit contributions
 Pen+0.026
 unw+0.025
 grand+0.024
 unsc+0.023
An+0.023
Neg logit contributions
ained-0.031
ags-0.030
akes-0.030
ined-0.029
ahs-0.029
Token he
Token ID339
Feature activation-2.42e-03

Pos logit contributions
 Pen+0.012
 unw+0.011
An+0.010
 grand+0.010
Un+0.010
Neg logit contributions
ained-0.015
ahs-0.015
ined-0.014
 highlight-0.014
affles-0.014
Token knew
Token ID2993
Feature activation-2.02e-02

Pos logit contributions
 Pen+0.003
 unw+0.002
Un+0.002
An+0.002
 grand+0.002
Neg logit contributions
ahs-0.004
ained-0.004
ined-0.004
akes-0.004
affles-0.004
Token he
Token ID339
Feature activation-1.75e-03

Pos logit contributions
 Pen+0.025
 unw+0.023
 grand+0.022
An+0.022
 unsc+0.021
Neg logit contributions
ained-0.031
ahs-0.030
ags-0.030
akes-0.030
ined-0.029
Token was
Token ID373
Feature activation-4.37e-04

Pos logit contributions
 Pen+0.002
 unw+0.002
 grand+0.002
An+0.002
 unsc+0.002
Neg logit contributions
ained-0.003
akes-0.003
ahs-0.003
ined-0.003
affles-0.003
Token the
Token ID262
Feature activation-1.36e-03

Pos logit contributions
 Pen+0.001
 unw+0.000
 grand+0.000
 unsc+0.000
 keeper+0.000
Neg logit contributions
akes-0.001
ained-0.001
ahs-0.001
ags-0.001
affles-0.001
Token fastest
Token ID14162
Feature activation+9.04e-02

Pos logit contributions
 Pen+0.003
 grand+0.002
 keeper+0.002
 unw+0.002
An+0.002
Neg logit contributions
akes-0.001
ained-0.001
affles-0.001
ined-0.001
ags-0.001
Token.
Token ID13
Feature activation-1.27e-02

Pos logit contributions
ained+0.137
akes+0.135
ags+0.134
ahs+0.131
affles+0.131
Neg logit contributions
 Pen-0.115
 unw-0.106
 grand-0.105
 unsc-0.100
 keeper-0.099
Token
Token ID198
Feature activation-1.65e-02

Pos logit contributions
 Pen+0.017
 unw+0.015
 unsc+0.014
 keeper+0.014
An+0.014
Neg logit contributions
ained-0.019
ags-0.019
akes-0.018
orting-0.018
unes-0.018
Token
Token ID198
Feature activation-1.57e-02

Pos logit contributions
 Pen+0.019
 unw+0.017
 grand+0.016
 keeper+0.016
keeper+0.015
Neg logit contributions
ained-0.027
ags-0.027
affles-0.026
ined-0.026
akes-0.026
TokenBut
Token ID1537
Feature activation+7.15e-03

Pos logit contributions
 Pen+0.023
 unw+0.019
 grand+0.018
 keeper+0.017
 Eva+0.017
Neg logit contributions
ained-0.023
ined-0.022
 highlight-0.022
affles-0.022
agons-0.022
Token when
Token ID618
Feature activation-2.99e-02

Pos logit contributions
akes+0.012
ags+0.011
affles+0.011
agons+0.011
ained+0.011
Neg logit contributions
 Pen-0.009
 grand-0.009
 unw-0.008
 unsc-0.008
 keeper-0.008
Token mouth
Token ID5422
Feature activation+2.37e-02

Pos logit contributions
ained+0.019
akes+0.019
ags+0.018
affles+0.018
ined+0.018
Neg logit contributions
 Pen-0.011
 unw-0.011
 grand-0.011
 unsc-0.010
 cautiously-0.010
Token.
Token ID13
Feature activation+1.57e-02

Pos logit contributions
ained+0.042
akes+0.042
ags+0.042
affles+0.041
agons+0.041
Neg logit contributions
 Pen-0.024
 unw-0.022
 grand-0.021
 unsc-0.020
 keeper-0.020
Token
Token ID198
Feature activation+6.42e-03

Pos logit contributions
ained+0.031
akes+0.030
ags+0.030
ined+0.030
affles+0.030
Neg logit contributions
 Pen-0.014
 grand-0.012
 unw-0.012
An-0.011
 unsc-0.010
Token
Token ID198
Feature activation+4.71e-03

Pos logit contributions
ained+0.010
ags+0.010
akes+0.010
ined+0.010
affles+0.010
Neg logit contributions
 Pen-0.007
 unw-0.007
 grand-0.006
keeper-0.006
 keeper-0.006
Token"
Token ID1
Feature activation-2.07e-02

Pos logit contributions
ained+0.008
akes+0.008
ags+0.008
agons+0.008
affles+0.008
Neg logit contributions
 Pen-0.005
 unw-0.005
 grand-0.004
 keeper-0.004
keeper-0.004
TokenLook
Token ID8567
Feature activation+8.00e-02

Pos logit contributions
 Pen+0.026
An+0.024
Un+0.023
 unw+0.022
 Eva+0.021
Neg logit contributions
agons-0.035
ained-0.035
affles-0.034
akes-0.034
ags-0.034
Token,
Token ID11
Feature activation+3.05e-02

Pos logit contributions
ained+0.151
akes+0.142
ined+0.141
ags+0.139
 highlight+0.137
Neg logit contributions
 Pen-0.087
 grand-0.084
 unw-0.082
 open-0.073
 quickly-0.073
Token our
Token ID674
Feature activation+2.86e-02

Pos logit contributions
ained+0.059
akes+0.057
ags+0.057
ined+0.057
agons+0.056
Neg logit contributions
 Pen-0.030
 grand-0.027
 unw-0.026
 keeper-0.024
 unsc-0.024
Token snow
Token ID6729
Feature activation+4.33e-02

Pos logit contributions
ained+0.051
ined+0.048
ags+0.048
agged+0.047
akes+0.046
Neg logit contributions
 grand-0.034
 Pen-0.034
 unw-0.031
 keeper-0.030
keeper-0.028
Tokenman
Token ID805
Feature activation+2.88e-02

Pos logit contributions
ained+0.077
 highlight+0.075
ags+0.075
affles+0.072
agged+0.070
Neg logit contributions
 Pen-0.046
 keeper-0.044
 grand-0.041
keeper-0.040
 unw-0.039
Token is
Token ID318
Feature activation-3.23e-02

Pos logit contributions
ained+0.050
ags+0.050
 highlight+0.048
akes+0.047
affles+0.047
Neg logit contributions
 Pen-0.032
 unw-0.031
 grand-0.030
 keeper-0.029
 unsc-0.028
Token made
Token ID925
Feature activation+1.82e-02

Pos logit contributions
ained+0.007
ined+0.007
akes+0.007
ahs+0.007
affles+0.007
Neg logit contributions
 Pen-0.007
 grand-0.006
 unw-0.006
 keeper-0.006
Un-0.006
Token some
Token ID617
Feature activation+9.31e-03

Pos logit contributions
ained+0.029
ags+0.029
akes+0.029
ahs+0.029
ined+0.028
Neg logit contributions
 Pen-0.020
 unw-0.019
 grand-0.018
An-0.018
 keeper-0.018
Token small
Token ID1402
Feature activation+3.48e-03

Pos logit contributions
ained+0.019
affles+0.017
w+0.017
akes+0.017
reetings+0.017
Neg logit contributions
 open-0.009
 unw-0.009
 Pen-0.008
 quickly-0.008
 grand-0.008
Token balls
Token ID11333
Feature activation+1.56e-02

Pos logit contributions
ained+0.006
affles+0.006
 highlight+0.006
ags+0.006
ined+0.006
Neg logit contributions
 Pen-0.004
 open-0.004
 grand-0.004
 unw-0.004
 gasped-0.003
Token of
Token ID286
Feature activation+7.38e-03

Pos logit contributions
ained+0.025
akes+0.024
ags+0.024
ahs+0.024
affles+0.024
Neg logit contributions
 Pen-0.018
 unw-0.017
 grand-0.016
 unsc-0.016
An-0.016
Token clay
Token ID21558
Feature activation+7.80e-02

Pos logit contributions
ained+0.012
akes+0.012
affles+0.012
 highlight+0.012
ined+0.012
Neg logit contributions
 Pen-0.009
 unw-0.008
 grand-0.008
 unsc-0.007
 open-0.007
Token and
Token ID290
Feature activation+7.98e-03

Pos logit contributions
ained+0.140
ags+0.135
affles+0.134
ined+0.133
akes+0.132
Neg logit contributions
 Pen-0.078
 unw-0.075
 grand-0.071
 keeper-0.071
 unsc-0.068
Token stuck
Token ID7819
Feature activation+1.27e-02

Pos logit contributions
ained+0.014
ags+0.014
akes+0.014
agons+0.014
affles+0.013
Neg logit contributions
 unw-0.009
 Pen-0.009
 grand-0.009
 unsc-0.009
 gasped-0.008
Token them
Token ID606
Feature activation+3.58e-02

Pos logit contributions
ained+0.018
akes+0.018
affles+0.018
ags+0.017
ahs+0.017
Neg logit contributions
 Pen-0.017
 unw-0.016
 grand-0.016
 unsc-0.015
An-0.015
Token on
Token ID319
Feature activation+2.85e-02

Pos logit contributions
ained+0.053
akes+0.052
ags+0.052
affles+0.051
ahs+0.051
Neg logit contributions
 Pen-0.047
 unw-0.044
 grand-0.042
 unsc-0.041
 keeper-0.041
Token her
Token ID607
Feature activation-1.61e-02

Pos logit contributions
ags+0.044
akes+0.044
ained+0.043
ahs+0.043
ined+0.041
Neg logit contributions
 Pen-0.031
 unw-0.029
keeper-0.028
 keeper-0.028
 grand-0.027
Tokeneh
Token ID17231
Feature activation+5.21e-02

Pos logit contributions
 highlight+0.062
ahs+0.060
akes+0.058
orting+0.058
affles+0.057
Neg logit contributions
 Pen-0.047
 unw-0.046
 unsc-0.044
An-0.044
 open-0.043
Tokenive
Token ID425
Feature activation-3.02e-02

Pos logit contributions
ained+0.080
ined+0.078
affles+0.073
 highlight+0.073
agons+0.073
Neg logit contributions
 Pen-0.063
 unw-0.058
keeper-0.058
 keeper-0.057
 unsc-0.054
Token.
Token ID13
Feature activation-1.83e-03

Pos logit contributions
 Pen+0.034
 unw+0.032
 grand+0.031
 unsc+0.029
 keeper+0.029
Neg logit contributions
ained-0.049
ags-0.049
akes-0.049
affles-0.048
ahs-0.048
Token Bob
Token ID5811
Feature activation-5.81e-03

Pos logit contributions
 Pen+0.002
 unw+0.002
 grand+0.002
 unsc+0.002
An+0.002
Neg logit contributions
ained-0.003
akes-0.003
ags-0.003
ahs-0.003
affles-0.003
Tokeno
Token ID78
Feature activation+1.72e-02

Pos logit contributions
 Pen+0.009
 security+0.009
 grand+0.009
 unw+0.009
 abrupt+0.008
Neg logit contributions
ahs-0.006
ined-0.006
akes-0.006
w-0.005
ained-0.005
Token said
Token ID531
Feature activation+1.99e-02

Pos logit contributions
ahs+0.027
ained+0.027
akes+0.027
ined+0.026
affles+0.026
Neg logit contributions
 Pen-0.019
 grand-0.017
 unw-0.017
Un-0.016
 unsc-0.016
Token,
Token ID11
Feature activation+7.82e-03

Pos logit contributions
ained+0.032
akes+0.031
ags+0.031
agons+0.030
affles+0.030
Neg logit contributions
 Pen-0.025
 unw-0.023
 grand-0.023
 keeper-0.022
 unsc-0.021
Token "
Token ID366
Feature activation-2.78e-02

Pos logit contributions
ained+0.017
akes+0.017
ags+0.017
affles+0.016
ined+0.016
Neg logit contributions
 Pen-0.006
 grand-0.005
 unw-0.005
 keeper-0.004
An-0.004
TokenLook
Token ID8567
Feature activation+6.40e-02

Pos logit contributions
 Pen+0.036
An+0.033
Un+0.032
 unw+0.032
 keeper+0.031
Neg logit contributions
ags-0.044
affles-0.043
ained-0.042
agons-0.042
akes-0.041
Token,
Token ID11
Feature activation+1.16e-02

Pos logit contributions
ained+0.097
akes+0.094
ags+0.093
ined+0.091
ahs+0.090
Neg logit contributions
 Pen-0.084
 unw-0.078
 grand-0.078
 unsc-0.073
An-0.072
Token L
Token ID406
Feature activation-5.46e-03

Pos logit contributions
ahs+0.017
 highlight+0.017
akes+0.017
ained+0.017
affles+0.017
Neg logit contributions
 unw-0.013
 Pen-0.012
 unsc-0.012
An-0.012
 cautiously-0.011
Token said
Token ID531
Feature activation+2.29e-02

Pos logit contributions
 Pen+0.002
 unw+0.002
An+0.002
 grand+0.002
 unsc+0.002
Neg logit contributions
ained-0.002
ahs-0.002
akes-0.002
ined-0.002
 highlight-0.002
Token,
Token ID11
Feature activation+1.56e-02

Pos logit contributions
ained+0.040
akes+0.039
ags+0.038
agons+0.038
 highlight+0.037
Neg logit contributions
 Pen-0.027
 grand-0.024
 unw-0.024
keeper-0.023
 keeper-0.023
Token "
Token ID366
Feature activation-1.95e-02

Pos logit contributions
ained+0.032
akes+0.032
ags+0.032
affles+0.031
ined+0.031
Neg logit contributions
 Pen-0.012
 unw-0.011
 grand-0.011
 unsc-0.010
An-0.010
TokenThank
Token ID10449
Feature activation+1.62e-02

Pos logit contributions
 Pen+0.024
An+0.023
Un+0.022
 unw+0.022
 grand+0.021
Neg logit contributions
ained-0.031
ags-0.031
affles-0.031
agons-0.030
unes-0.030
Token you
Token ID345
Feature activation+1.71e-02

Pos logit contributions
ained+0.025
ahs+0.023
akes+0.023
ined+0.023
ags+0.023
Neg logit contributions
 Pen-0.022
 unw-0.020
 grand-0.020
An-0.020
 keeper-0.019
Token for
Token ID329
Feature activation+1.94e-02

Pos logit contributions
ained+0.024
akes+0.023
ags+0.023
ahs+0.023
affles+0.022
Neg logit contributions
 Pen-0.024
 unw-0.022
 grand-0.022
 unsc-0.021
An-0.021
Token helping
Token ID5742
Feature activation-6.33e-03

Pos logit contributions
ained+0.025
akes+0.025
ags+0.024
ahs+0.024
affles+0.024
Neg logit contributions
 Pen-0.029
 unw-0.027
 grand-0.027
 unsc-0.026
An-0.026
Token me
Token ID502
Feature activation+5.31e-02

Pos logit contributions
 Pen+0.006
 unw+0.006
 grand+0.006
 unsc+0.005
An+0.005
Neg logit contributions
ained-0.011
akes-0.011
ags-0.011
ahs-0.011
ined-0.011
Token help
Token ID1037
Feature activation-1.09e-02

Pos logit contributions
ained+0.092
akes+0.088
ags+0.088
ahs+0.088
ined+0.087
Neg logit contributions
 Pen-0.058
 unw-0.054
 grand-0.053
 unsc-0.052
 keeper-0.050
Token others
Token ID1854
Feature activation+8.03e-02

Pos logit contributions
 Pen+0.013
 unw+0.012
 grand+0.012
 unsc+0.012
An+0.012
Neg logit contributions
ained-0.017
akes-0.017
ags-0.017
affles-0.016
ahs-0.016
Token."
Token ID526
Feature activation-1.04e-02

Pos logit contributions
ained+0.126
akes+0.124
ags+0.123
ahs+0.122
affles+0.121
Neg logit contributions
 Pen-0.096
 unw-0.089
 grand-0.088
 unsc-0.084
An-0.083
Token smiled
Token ID13541
Feature activation-1.56e-02

Pos logit contributions
 Pen+0.003
An+0.003
 unw+0.003
 grand+0.003
case+0.003
Neg logit contributions
ahs-0.004
ined-0.004
ained-0.004
orting-0.004
ags-0.004
Token again
Token ID757
Feature activation-3.54e-03

Pos logit contributions
 Pen+0.024
 unw+0.021
 unsc+0.020
An+0.020
 Eva+0.020
Neg logit contributions
ained-0.020
ined-0.020
ags-0.019
ging-0.019
agging-0.019
Token.
Token ID13
Feature activation+4.08e-03

Pos logit contributions
 Pen+0.003
 grand+0.003
 unw+0.003
 unsc+0.002
 keeper+0.002
Neg logit contributions
ained-0.007
akes-0.007
ags-0.007
affles-0.007
ahs-0.007
Token "
Token ID366
Feature activation-1.96e-02

Pos logit contributions
akes+0.008
ained+0.008
affles+0.008
ags+0.008
ahs+0.008
Neg logit contributions
 Pen-0.003
 grand-0.003
 unw-0.003
keeper-0.003
 keeper-0.003
TokenNo
Token ID2949
Feature activation+1.23e-02

Pos logit contributions
 Pen+0.022
 unw+0.020
 grand+0.020
An+0.019
 unsc+0.019
Neg logit contributions
ained-0.033
akes-0.033
ags-0.032
ahs-0.032
affles-0.032
Token,
Token ID11
Feature activation+1.15e-02

Pos logit contributions
ained+0.026
ags+0.024
akes+0.024
ined+0.024
 highlight+0.023
Neg logit contributions
 grand-0.013
 Pen-0.012
 quickly-0.011
 unw-0.010
 keeper-0.010
Token it
Token ID340
Feature activation+5.16e-03

Pos logit contributions
ained+0.021
akes+0.021
ags+0.020
ined+0.020
agons+0.020
Neg logit contributions
 Pen-0.012
 grand-0.012
 unw-0.011
 keeper-0.011
 unsc-0.011
Token's
Token ID338
Feature activation-2.02e-02

Pos logit contributions
akes+0.010
ained+0.010
ahs+0.010
ined+0.010
affles+0.009
Neg logit contributions
 Pen-0.004
 unw-0.004
Un-0.004
 grand-0.004
An-0.003
Token not
Token ID407
Feature activation-2.71e-02

Pos logit contributions
 Pen+0.022
 grand+0.022
 unw+0.022
 open+0.020
 unsc+0.020
Neg logit contributions
akes-0.036
affles-0.034
ahs-0.034
ained-0.033
ined-0.033
Token mine
Token ID6164
Feature activation+5.88e-02

Pos logit contributions
 Pen+0.030
 grand+0.029
 unw+0.029
 unsc+0.027
 keeper+0.026
Neg logit contributions
akes-0.046
ained-0.045
affles-0.045
ags-0.045
ahs-0.045
Token.
Token ID13
Feature activation+8.37e-03

Pos logit contributions
ained+0.106
ags+0.105
affles+0.105
akes+0.105
 highlight+0.104
Neg logit contributions
 Pen-0.057
 grand-0.052
 unw-0.052
 keeper-0.049
keeper-0.048
Token was
Token ID373
Feature activation+2.80e-03

Pos logit contributions
ained+0.056
akes+0.055
ags+0.054
ahs+0.054
ined+0.054
Neg logit contributions
 Pen-0.035
 unw-0.032
 grand-0.031
 unsc-0.029
An-0.029
Token still
Token ID991
Feature activation+4.49e-04

Pos logit contributions
ained+0.004
ags+0.004
ined+0.004
 highlight+0.004
ahs+0.004
Neg logit contributions
 Pen-0.003
 unw-0.003
 grand-0.003
 unsc-0.003
An-0.003
Token a
Token ID257
Feature activation+2.66e-03

Pos logit contributions
akes+0.001
affles+0.001
ained+0.001
agons+0.001
ahs+0.001
Neg logit contributions
 Pen-0.000
 unw-0.000
 grand-0.000
 keeper-0.000
 unsc-0.000
Token bit
Token ID1643
Feature activation+3.39e-03

Pos logit contributions
ained+0.005
ags+0.005
akes+0.005
affles+0.005
ined+0.005
Neg logit contributions
 Pen-0.002
 grand-0.002
 unw-0.002
 keeper-0.002
 unsc-0.002
Token angry
Token ID7954
Feature activation+1.15e-02

Pos logit contributions
ained+0.005
akes+0.005
ags+0.005
ahs+0.005
affles+0.005
Neg logit contributions
 Pen-0.004
 unw-0.004
 grand-0.004
 unsc-0.003
 keeper-0.003
Token but
Token ID475
Feature activation+1.97e-02

Pos logit contributions
ained+0.021
akes+0.021
affles+0.021
 highlight+0.021
ags+0.021
Neg logit contributions
 Pen-0.012
 grand-0.010
 keeper-0.010
 unw-0.010
keeper-0.010
Token he
Token ID339
Feature activation+2.33e-03

Pos logit contributions
ained+0.030
ahs+0.029
ined+0.029
 highlight+0.028
ags+0.028
Neg logit contributions
 Pen-0.024
 unw-0.023
 grand-0.022
 unsc-0.021
An-0.021
Token said
Token ID531
Feature activation+1.94e-02

Pos logit contributions
ined+0.003
ahs+0.003
ained+0.003
 highlight+0.003
affles+0.003
Neg logit contributions
 Pen-0.003
 grand-0.003
 unw-0.003
An-0.002
 unsc-0.002
Token "
Token ID366
Feature activation-2.11e-02

Pos logit contributions
ained+0.033
akes+0.033
ags+0.032
agons+0.032
affles+0.031
Neg logit contributions
 Pen-0.023
 grand-0.020
 unw-0.020
keeper-0.020
 keeper-0.020
TokenIt
Token ID1026
Feature activation-8.03e-03

Pos logit contributions
 Pen+0.023
An+0.021
 unw+0.020
Un+0.020
 keeper+0.020
Neg logit contributions
ags-0.037
agons-0.036
affles-0.036
ained-0.035
akes-0.035
Token's
Token ID338
Feature activation-2.20e-02

Pos logit contributions
 Pen+0.005
 unw+0.005
 grand+0.004
 unsc+0.004
An+0.004
Neg logit contributions
ained-0.017
akes-0.017
ags-0.017
ahs-0.017
ined-0.016
Token how
Token ID703
Feature activation-3.10e-02

Pos logit contributions
 Pen+0.007
 unw+0.007
 grand+0.007
 unsc+0.006
An+0.006
Neg logit contributions
ained-0.007
akes-0.007
ags-0.007
ahs-0.006
ined-0.006
Token it
Token ID340
Feature activation+1.07e-02

Pos logit contributions
 Pen+0.036
 unw+0.033
 grand+0.031
 unsc+0.030
Un+0.030
Neg logit contributions
ained-0.047
ahs-0.047
ined-0.046
akes-0.046
 highlight-0.046
Token would
Token ID561
Feature activation+2.14e-02

Pos logit contributions
ined+0.016
ained+0.016
akes+0.016
ahs+0.016
 highlight+0.015
Neg logit contributions
 Pen-0.013
 grand-0.012
 unw-0.011
Un-0.011
keeper-0.011
Token taste
Token ID6938
Feature activation-2.51e-02

Pos logit contributions
ained+0.034
akes+0.034
ags+0.034
affles+0.033
ahs+0.033
Neg logit contributions
 Pen-0.025
 unw-0.024
 grand-0.023
 unsc-0.023
 keeper-0.022
Token.
Token ID13
Feature activation-1.83e-03

Pos logit contributions
 Pen+0.027
 unw+0.027
 grand+0.026
 quickly+0.025
 unsc+0.025
Neg logit contributions
ags-0.042
ained-0.042
akes-0.042
affles-0.041
ahs-0.041
Token He
Token ID679
Feature activation+2.09e-02

Pos logit contributions
 Pen+0.002
 unw+0.002
 keeper+0.002
 grand+0.002
 unsc+0.002
Neg logit contributions
ained-0.003
akes-0.003
ahs-0.003
ags-0.003
 highlight-0.003
Token was
Token ID373
Feature activation+1.38e-02

Pos logit contributions
ined+0.031
ahs+0.031
ained+0.030
affles+0.030
 highlight+0.030
Neg logit contributions
 Pen-0.024
 grand-0.022
 unw-0.021
Un-0.021
 keeper-0.021
Token so
Token ID523
Feature activation+5.14e-02

Pos logit contributions
ained+0.022
 highlight+0.022
ined+0.022
ags+0.021
ahs+0.021
Neg logit contributions
 Pen-0.015
 unw-0.015
 grand-0.014
keeper-0.014
 keeper-0.013
Token excited
Token ID6568
Feature activation-4.77e-02

Pos logit contributions
ined+0.060
 highlight+0.060
ained+0.056
ags+0.052
affles+0.051
Neg logit contributions
 Pen-0.079
 unw-0.074
keeper-0.072
 keeper-0.071
 grand-0.071
Token to
Token ID284
Feature activation-5.15e-03

Pos logit contributions
 Pen+0.043
 unw+0.039
 grand+0.039
 keeper+0.036
 unsc+0.036
Neg logit contributions
ained-0.090
ags-0.089
akes-0.089
affles-0.087
ined-0.087
Token try
Token ID1949
Feature activation-1.29e-02

Pos logit contributions
 Pen+0.007
 unw+0.007
 grand+0.007
 unsc+0.007
An+0.007
Neg logit contributions
ained-0.007
akes-0.007
ags-0.007
ahs-0.007
ined-0.007
Token She
Token ID1375
Feature activation-8.96e-03

Pos logit contributions
 Pen+0.001
 unw+0.001
 grand+0.001
 unsc+0.001
An+0.001
Neg logit contributions
ained-0.001
akes-0.001
ags-0.001
ahs-0.001
affles-0.001
Token then
Token ID788
Feature activation+2.01e-02

Pos logit contributions
 Pen+0.011
 unw+0.009
An+0.009
 grand+0.009
Un+0.009
Neg logit contributions
ined-0.013
ahs-0.013
ained-0.013
akes-0.013
affles-0.013
Token wrapped
Token ID12908
Feature activation-5.54e-02

Pos logit contributions
ained+0.032
akes+0.032
ags+0.032
ahs+0.031
affles+0.031
Neg logit contributions
 Pen-0.024
 unw-0.022
 grand-0.021
 unsc-0.021
An-0.020
Token them
Token ID606
Feature activation+2.84e-02

Pos logit contributions
 Pen+0.058
 unw+0.052
 grand+0.052
An+0.049
 unsc+0.048
Neg logit contributions
ained-0.097
akes-0.095
ags-0.094
affles-0.093
ahs-0.093
Token around
Token ID1088
Feature activation+9.20e-03

Pos logit contributions
ained+0.042
akes+0.041
affles+0.041
ags+0.041
ahs+0.040
Neg logit contributions
 Pen-0.037
 unw-0.035
 grand-0.034
 unsc-0.033
An-0.033
Token and
Token ID290
Feature activation-5.18e-03

Pos logit contributions
ained+0.018
akes+0.017
ags+0.017
affles+0.017
ined+0.017
Neg logit contributions
 Pen-0.008
 unw-0.007
 grand-0.007
An-0.007
 unsc-0.007
Token around
Token ID1088
Feature activation+8.52e-03

Pos logit contributions
 Pen+0.006
 unw+0.006
 grand+0.006
 unsc+0.005
An+0.005
Neg logit contributions
ained-0.008
akes-0.008
ags-0.008
ahs-0.008
ined-0.008
Token until
Token ID1566
Feature activation+6.33e-03

Pos logit contributions
ained+0.017
affles+0.016
ags+0.016
akes+0.016
agons+0.016
Neg logit contributions
 Pen-0.008
 grand-0.007
 unw-0.007
 keeper-0.007
 unsc-0.007
Token the
Token ID262
Feature activation-9.94e-06

Pos logit contributions
ags+0.013
agons+0.013
affles+0.013
akes+0.013
ained+0.013
Neg logit contributions
 Pen-0.006
 grand-0.005
 unw-0.005
 Eva-0.005
 quickly-0.005
Token yarn
Token ID21181
Feature activation+2.19e-02

Feature not active on prev token;
no predictions were affected.
Token was
Token ID373
Feature activation-1.68e-03

Pos logit contributions
ained+0.035
akes+0.035
ahs+0.034
ined+0.034
ags+0.034
Neg logit contributions
 Pen-0.025
 grand-0.022
 unw-0.022
An-0.022
Un-0.021
Token in
Token ID287
Feature activation+2.14e-02

Pos logit contributions
ahs+0.003
akes+0.003
affles+0.003
 highlight+0.003
agons+0.003
Neg logit contributions
 Pen-0.002
 grand-0.002
 unw-0.001
 open-0.001
 keeper-0.001
Token the
Token ID262
Feature activation+1.13e-02

Pos logit contributions
ained+0.031
ags+0.030
akes+0.028
 highlight+0.028
 ones+0.027
Neg logit contributions
 Pen-0.025
keeper-0.025
 unw-0.025
Un-0.024
 gasped-0.024
Token tree
Token ID5509
Feature activation-3.80e-02

Pos logit contributions
affles+0.024
ahs+0.024
akes+0.023
ained+0.023
asers+0.022
Neg logit contributions
 Pen-0.008
 grand-0.008
 keeper-0.007
 open-0.007
 unw-0.007
Token?"
Token ID1701
Feature activation+1.52e-02

Pos logit contributions
 Pen+0.064
 unw+0.060
 grand+0.060
 unsc+0.058
 keeper+0.058
Neg logit contributions
ained-0.041
akes-0.041
ags-0.041
affles-0.040
ahs-0.040
Token
Token ID198
Feature activation-5.49e-03

Pos logit contributions
akes+0.033
ained+0.033
 highlight+0.033
affles+0.033
ahs+0.033
Neg logit contributions
 Pen-0.010
 grand-0.009
 unw-0.009
An-0.008
 unsc-0.007
Token
Token ID198
Feature activation-5.26e-03

Pos logit contributions
 Pen+0.007
 unw+0.006
 grand+0.006
 unsc+0.006
 Eva+0.005
Neg logit contributions
ags-0.009
ained-0.008
ined-0.008
agons-0.008
akes-0.008
Token"
Token ID1
Feature activation-3.33e-02

Pos logit contributions
 Pen+0.007
 unw+0.006
 grand+0.006
 Eva+0.006
abel+0.006
Neg logit contributions
ags-0.007
agons-0.007
orting-0.007
ined-0.007
ained-0.007
TokenBecause
Token ID8128
Feature activation+2.39e-02

Pos logit contributions
 Pen+0.040
 unw+0.037
 grand+0.036
 unsc+0.035
 keeper+0.034
Neg logit contributions
ained-0.053
akes-0.052
ags-0.051
ahs-0.051
ined-0.050
Token it
Token ID340
Feature activation+6.89e-03

Pos logit contributions
ags+0.043
ained+0.043
akes+0.042
ined+0.042
affles+0.042
Neg logit contributions
 Pen-0.026
 grand-0.023
 unw-0.021
An-0.021
 unsc-0.020
Token is
Token ID318
Feature activation-4.20e-02

Pos logit contributions
ained+0.012
akes+0.011
ags+0.011
ahs+0.011
affles+0.011
Neg logit contributions
 Pen-0.008
 unw-0.007
 grand-0.007
 unsc-0.007
An-0.006
Token my
Token ID616
Feature activation+2.06e-02

Pos logit contributions
 Pen+0.040
 grand+0.038
 unw+0.036
 open+0.035
 keeper+0.033
Neg logit contributions
akes-0.083
affles-0.080
ahs-0.078
ags-0.078
ined-0.078
Token.
Token ID13
Feature activation-4.08e-03

Pos logit contributions
 Pen+0.024
 unw+0.022
 grand+0.021
An+0.021
Un+0.021
Neg logit contributions
ained-0.029
akes-0.028
ahs-0.028
ined-0.028
ags-0.028
Token The
Token ID383
Feature activation+2.35e-02

Pos logit contributions
 Pen+0.004
 unw+0.004
 grand+0.004
 unsc+0.004
 keeper+0.004
Neg logit contributions
ained-0.007
ags-0.007
akes-0.007
ahs-0.006
ined-0.006
Token fancy
Token ID14996
Feature activation-4.03e-03

Pos logit contributions
ained+0.044
akes+0.044
affles+0.043
ags+0.041
agons+0.040
Neg logit contributions
 keeper-0.025
 grand-0.024
 open-0.023
 Pen-0.022
 unw-0.022
Token elephant
Token ID20950
Feature activation+3.79e-02

Pos logit contributions
 keeper+0.004
 Pen+0.004
 grand+0.004
 unw+0.004
keeper+0.004
Neg logit contributions
ained-0.007
affles-0.007
akes-0.007
ags-0.007
agons-0.007
Token felt
Token ID2936
Feature activation+2.70e-03

Pos logit contributions
ined+0.061
ained+0.061
akes+0.060
ahs+0.060
ags+0.058
Neg logit contributions
 Pen-0.042
 grand-0.037
 unw-0.037
Un-0.036
An-0.036
Token good
Token ID922
Feature activation-3.26e-02

Pos logit contributions
akes+0.004
ained+0.004
ahs+0.004
ags+0.004
affles+0.004
Neg logit contributions
 Pen-0.003
 unw-0.003
 grand-0.003
 unsc-0.003
 keeper-0.003
Token and
Token ID290
Feature activation-4.12e-03

Pos logit contributions
 Pen+0.048
 unw+0.046
 grand+0.045
 unsc+0.043
 keeper+0.043
Neg logit contributions
ained-0.042
akes-0.042
ags-0.041
ahs-0.041
affles-0.040
Token went
Token ID1816
Feature activation+1.31e-02

Pos logit contributions
 Pen+0.006
An+0.005
 unw+0.005
Laura+0.005
 Eva+0.005
Neg logit contributions
ined-0.006
ained-0.006
 highlight-0.005
ags-0.005
ahs-0.005
Token home
Token ID1363
Feature activation+1.77e-02

Pos logit contributions
akes+0.026
affles+0.025
ahs+0.024
 highlight+0.023
agons+0.023
Neg logit contributions
 quickly-0.013
 unw-0.013
 open-0.013
 keeper-0.012
 Pen-0.012
Token.
Token ID13
Feature activation-3.52e-03

Pos logit contributions
ined+0.020
ained+0.020
ags+0.020
agging+0.019
 shining+0.019
Neg logit contributions
 Pen-0.026
Un-0.025
An-0.025
Laura-0.024
urious-0.024
Token<|endoftext|>
Token ID50256
Feature activation+4.08e-02

Pos logit contributions
 unw+0.005
 Pen+0.004
 keeper+0.004
 unh+0.004
 unsc+0.004
Neg logit contributions
ained-0.005
akes-0.005
ags-0.005
ahs-0.004
orting-0.004
Token new
Token ID649
Feature activation-1.19e-03

Pos logit contributions
 grand+0.002
 Pen+0.002
 unw+0.002
 keeper+0.001
 unsc+0.001
Neg logit contributions
ained-0.003
akes-0.003
affles-0.003
ahs-0.003
ined-0.003
Token doll
Token ID3654
Feature activation+1.59e-02

Pos logit contributions
 Pen+0.001
 grand+0.001
 unw+0.001
 keeper+0.001
 unsc+0.001
Neg logit contributions
ained-0.002
akes-0.002
ags-0.002
affles-0.002
ahs-0.002
Token and
Token ID290
Feature activation-9.46e-03

Pos logit contributions
ained+0.028
akes+0.028
ags+0.028
ahs+0.027
affles+0.027
Neg logit contributions
 Pen-0.016
 unw-0.015
 grand-0.014
 unsc-0.014
 keeper-0.014
Token coloring
Token ID33988
Feature activation+2.41e-02

Pos logit contributions
 Pen+0.010
 unw+0.009
 grand+0.009
 unsc+0.009
 keeper+0.009
Neg logit contributions
ained-0.016
akes-0.016
ags-0.016
ahs-0.016
affles-0.016
Token book
Token ID1492
Feature activation-2.26e-02

Pos logit contributions
ained+0.036
akes+0.036
ags+0.035
affles+0.035
ahs+0.035
Neg logit contributions
 Pen-0.031
 unw-0.029
 grand-0.029
 unsc-0.027
 keeper-0.027
Token.
Token ID13
Feature activation-1.16e-02

Pos logit contributions
 Pen+0.026
 unw+0.023
 grand+0.023
An+0.023
 unsc+0.022
Neg logit contributions
ained-0.036
akes-0.035
ahs-0.035
ags-0.035
ined-0.034
Token She
Token ID1375
Feature activation-9.83e-03

Pos logit contributions
 Pen+0.010
 unw+0.009
 grand+0.009
An+0.008
 unsc+0.008
Neg logit contributions
ained-0.022
ags-0.022
akes-0.022
affles-0.022
ahs-0.022
Token couldn
Token ID3521
Feature activation+7.52e-04

Pos logit contributions
 Pen+0.011
 unw+0.010
 grand+0.010
 keeper+0.010
An+0.010
Neg logit contributions
ained-0.015
ined-0.015
ahs-0.015
akes-0.015
 highlight-0.015
Token't
Token ID470
Feature activation-1.64e-02

Pos logit contributions
ained+0.001
akes+0.001
ahs+0.001
ags+0.001
ined+0.001
Neg logit contributions
 Pen-0.001
 grand-0.001
 unw-0.001
keeper-0.001
 keeper-0.001
Token believe
Token ID1975
Feature activation-4.09e-02

Pos logit contributions
 Pen+0.021
 unw+0.021
 grand+0.020
 unsc+0.019
An+0.018
Neg logit contributions
ained-0.026
ahs-0.025
akes-0.025
ags-0.025
affles-0.024
Token it
Token ID340
Feature activation-1.59e-03

Pos logit contributions
 Pen+0.045
 unw+0.042
 keeper+0.039
 grand+0.039
keeper+0.039
Neg logit contributions
ained-0.066
ags-0.065
ined-0.064
ahs-0.064
akes-0.063
Token remember
Token ID3505
Feature activation-2.83e-02

Pos logit contributions
ained+0.017
akes+0.016
unes+0.016
affles+0.016
ahs+0.016
Neg logit contributions
 grand-0.011
 unw-0.010
 Pen-0.010
 un-0.010
 open-0.010
Token the
Token ID262
Feature activation+4.89e-04

Pos logit contributions
 Pen+0.023
 grand+0.021
 unw+0.020
 unsc+0.018
An+0.018
Neg logit contributions
akes-0.058
ained-0.057
ags-0.056
ahs-0.056
affles-0.056
Token day
Token ID1110
Feature activation+3.35e-02

Pos logit contributions
affles+0.001
ained+0.001
unes+0.001
akes+0.001
agged+0.001
Neg logit contributions
 grand-0.001
 keeper-0.001
 open-0.000
 Pen-0.000
 security-0.000
Token she
Token ID673
Feature activation-1.81e-02

Pos logit contributions
ained+0.053
akes+0.052
ags+0.051
ahs+0.051
 highlight+0.050
Neg logit contributions
 Pen-0.039
 unw-0.037
 grand-0.035
 unsc-0.035
An-0.034
Token sang
Token ID25889
Feature activation+1.58e-02

Pos logit contributions
 Pen+0.020
 unw+0.019
 grand+0.019
 unsc+0.018
 keeper+0.017
Neg logit contributions
ained-0.031
ags-0.030
akes-0.030
ahs-0.029
affles-0.029
Token with
Token ID351
Feature activation-1.25e-02

Pos logit contributions
ained+0.026
akes+0.026
affles+0.026
ags+0.025
ahs+0.025
Neg logit contributions
 Pen-0.018
 grand-0.017
 unw-0.017
An-0.016
 unsc-0.015
Token the
Token ID262
Feature activation+3.88e-04

Pos logit contributions
 Pen+0.011
 unw+0.010
 grand+0.009
 unsc+0.009
An+0.009
Neg logit contributions
ained-0.024
akes-0.023
ags-0.023
ahs-0.023
affles-0.023
Token truck
Token ID7779
Feature activation+8.70e-03

Pos logit contributions
affles+0.001
ained+0.001
unes+0.001
w+0.001
akes+0.001
Neg logit contributions
 grand-0.000
 keeper-0.000
 open-0.000
 Pen-0.000
keeper-0.000
Token driver
Token ID4639
Feature activation+3.85e-02

Pos logit contributions
ained+0.015
ags+0.015
affles+0.014
akes+0.014
agons+0.014
Neg logit contributions
 Pen-0.010
 grand-0.009
 unw-0.009
 keeper-0.009
keeper-0.009
Token.
Token ID13
Feature activation-7.09e-03

Pos logit contributions
ained+0.068
ags+0.068
affles+0.066
agons+0.066
akes+0.066
Neg logit contributions
 Pen-0.042
 grand-0.038
 unw-0.037
 keeper-0.037
keeper-0.036
Token<|endoftext|>
Token ID50256
Feature activation+3.91e-02

Pos logit contributions
 unw+0.008
 Pen+0.008
 unsc+0.007
 keeper+0.007
 grand+0.007
Neg logit contributions
ained-0.011
akes-0.011
ags-0.011
ahs-0.010
 highlight-0.010
Token a
Token ID257
Feature activation-8.15e-03

Pos logit contributions
 Pen+0.000
 grand+0.000
 unw+0.000
An+0.000
 unsc+0.000
Neg logit contributions
ags-0.001
ained-0.001
agons-0.001
akes-0.001
affles-0.001
Token calm
Token ID9480
Feature activation-3.25e-02

Pos logit contributions
 Pen+0.007
 grand+0.007
 unw+0.007
 keeper+0.007
keeper+0.006
Neg logit contributions
ained-0.016
ags-0.016
akes-0.016
affles-0.015
ahs-0.015
Token,
Token ID11
Feature activation-2.10e-03

Pos logit contributions
 Pen+0.044
 grand+0.041
 keeper+0.040
 unw+0.039
keeper+0.038
Neg logit contributions
affles-0.049
ained-0.049
akes-0.048
ags-0.048
agons-0.047
Token soothing
Token ID40836
Feature activation-3.15e-02

Pos logit contributions
 Pen+0.002
 grand+0.002
 unw+0.002
 open+0.002
 unsc+0.002
Neg logit contributions
akes-0.004
affles-0.004
ained-0.004
ags-0.004
ahs-0.004
Token voice
Token ID3809
Feature activation-1.94e-02

Pos logit contributions
 Pen+0.034
 grand+0.030
 unw+0.030
 keeper+0.029
 unsc+0.028
Neg logit contributions
ained-0.058
akes-0.056
affles-0.055
ags-0.055
 highlight-0.054
Token filled
Token ID5901
Feature activation-1.03e-01

Pos logit contributions
 Pen+0.025
 unw+0.023
 grand+0.022
 unsc+0.022
 keeper+0.022
Neg logit contributions
ained-0.030
akes-0.029
ags-0.029
affles-0.028
ahs-0.028
Token the
Token ID262
Feature activation-6.43e-03

Pos logit contributions
 grand+0.120
 Pen+0.120
Un+0.119
ents+0.117
case+0.116
Neg logit contributions
ined-0.137
ags-0.136
ained-0.131
ging-0.127
akes-0.124
Token air
Token ID1633
Feature activation-2.85e-02

Pos logit contributions
 Pen+0.008
 keeper+0.008
 open+0.008
keeper+0.007
 unw+0.007
Neg logit contributions
affles-0.010
ahs-0.010
ained-0.010
agons-0.010
ags-0.010
Token.
Token ID13
Feature activation-1.15e-02

Pos logit contributions
 Pen+0.031
 unw+0.029
 grand+0.028
 unsc+0.027
 keeper+0.027
Neg logit contributions
ained-0.048
akes-0.047
ags-0.047
ahs-0.046
affles-0.046
Token The
Token ID383
Feature activation+2.19e-02

Pos logit contributions
 Pen+0.012
 unw+0.012
 grand+0.011
 unsc+0.011
 keeper+0.011
Neg logit contributions
ained-0.019
ags-0.018
akes-0.018
ahs-0.018
ined-0.018
Token bird
Token ID6512
Feature activation+8.72e-03

Pos logit contributions
ained+0.042
affles+0.041
ags+0.041
akes+0.040
agons+0.040
Neg logit contributions
 Pen-0.022
 keeper-0.020
 grand-0.020
keeper-0.019
 unw-0.019
Token by
Token ID416
Feature activation-1.91e-02

Pos logit contributions
akes+0.037
affles+0.036
ained+0.036
ags+0.036
agons+0.036
Neg logit contributions
 Pen-0.021
 unw-0.020
 grand-0.020
 unsc-0.019
An-0.019
Token the
Token ID262
Feature activation+1.24e-03

Pos logit contributions
 Pen+0.018
 unw+0.017
 unsc+0.016
 grand+0.015
 quickly+0.015
Neg logit contributions
akes-0.039
agons-0.038
ahs-0.037
ained-0.037
affles-0.037
Token bath
Token ID7837
Feature activation+3.11e-02

Pos logit contributions
ained+0.002
akes+0.002
ahs+0.002
ags+0.002
ined+0.002
Neg logit contributions
 Pen-0.002
 grand-0.001
 unw-0.001
 keeper-0.001
 unsc-0.001
Tokentub
Token ID37995
Feature activation-4.09e-02

Pos logit contributions
ained+0.041
ags+0.040
akes+0.040
ahs+0.040
affles+0.039
Neg logit contributions
 Pen-0.045
 unw-0.042
 grand-0.042
 unsc-0.040
 keeper-0.040
Token and
Token ID290
Feature activation-6.49e-03

Pos logit contributions
 Pen+0.049
 unw+0.044
 grand+0.043
An+0.043
 unsc+0.042
Neg logit contributions
ained-0.064
akes-0.063
ags-0.062
ahs-0.061
ined-0.061
Token filled
Token ID5901
Feature activation-1.01e-01

Pos logit contributions
 Pen+0.008
 unw+0.008
 grand+0.007
 unsc+0.007
 keeper+0.007
Neg logit contributions
ained-0.010
akes-0.010
ags-0.010
ahs-0.009
affles-0.009
Token it
Token ID340
Feature activation-7.30e-04

Pos logit contributions
 Pen+0.120
Un+0.115
 keeper+0.115
 grand+0.115
keeper+0.110
Neg logit contributions
ained-0.149
ined-0.147
ags-0.147
affles-0.144
akes-0.136
Token with
Token ID351
Feature activation-1.41e-02

Pos logit contributions
 Pen+0.001
 grand+0.001
Un+0.001
 Eva+0.001
 keeper+0.001
Neg logit contributions
ained-0.001
ined-0.001
akes-0.001
ags-0.001
 highlight-0.001
Token warm
Token ID5814
Feature activation-2.52e-02

Pos logit contributions
 unw+0.019
 Pen+0.018
 grand+0.018
 unsc+0.017
 quickly+0.017
Neg logit contributions
ahs-0.021
akes-0.021
ained-0.021
ags-0.020
ined-0.020
Token water
Token ID1660
Feature activation+4.91e-03

Pos logit contributions
 unw+0.030
 Pen+0.030
 grand+0.029
 quickly+0.028
 open+0.028
Neg logit contributions
ahs-0.041
 highlight-0.040
ained-0.039
akes-0.038
reetings-0.037
Token.
Token ID13
Feature activation-9.84e-03

Pos logit contributions
ained+0.009
akes+0.009
ags+0.009
affles+0.009
ahs+0.009
Neg logit contributions
 Pen-0.004
 unw-0.004
 grand-0.004
 unsc-0.004
 keeper-0.003
Token he
Token ID339
Feature activation+1.66e-03

Pos logit contributions
 grand+0.002
 unw+0.002
 Pen+0.002
 unsc+0.002
 quickly+0.002
Neg logit contributions
akes-0.006
ags-0.006
ained-0.006
agons-0.006
ined-0.006
Token was
Token ID373
Feature activation+1.79e-03

Pos logit contributions
akes+0.003
ained+0.002
ined+0.002
ahs+0.002
affles+0.002
Neg logit contributions
 Pen-0.002
 grand-0.002
 keeper-0.002
 unw-0.002
keeper-0.002
Token done
Token ID1760
Feature activation+2.42e-03

Pos logit contributions
akes+0.003
agons+0.002
ahs+0.002
ained+0.002
affles+0.002
Neg logit contributions
 Pen-0.003
 unw-0.003
 unsc-0.002
 grand-0.002
 open-0.002
Token examining
Token ID17247
Feature activation-2.43e-02

Pos logit contributions
akes+0.004
ained+0.004
ags+0.004
agons+0.004
affles+0.004
Neg logit contributions
 Pen-0.003
 unw-0.003
 grand-0.003
 unsc-0.003
 un-0.003
Token the
Token ID262
Feature activation+1.62e-03

Pos logit contributions
 Pen+0.016
 grand+0.015
 unw+0.014
 cautiously+0.013
 unsc+0.013
Neg logit contributions
ained-0.054
akes-0.053
affles-0.052
ags-0.051
 highlight-0.051
Token chair
Token ID5118
Feature activation-7.88e-02

Pos logit contributions
akes+0.003
ained+0.003
ags+0.003
 highlight+0.003
ahs+0.003
Neg logit contributions
 Pen-0.002
 unw-0.002
 keeper-0.001
keeper-0.001
 unsc-0.001
Token,
Token ID11
Feature activation+2.59e-03

Pos logit contributions
 Pen+0.105
 unw+0.099
 grand+0.097
 unsc+0.093
An+0.092
Neg logit contributions
ained-0.113
akes-0.111
ags-0.110
ahs-0.108
affles-0.108
Token he
Token ID339
Feature activation+2.55e-03

Pos logit contributions
ags+0.005
ained+0.005
akes+0.005
affles+0.005
agons+0.005
Neg logit contributions
 Pen-0.002
 grand-0.002
 unw-0.002
 unsc-0.002
An-0.002
Token thought
Token ID1807
Feature activation-9.41e-03

Pos logit contributions
ined+0.004
ahs+0.004
ained+0.003
affles+0.003
 highlight+0.003
Neg logit contributions
 Pen-0.003
 grand-0.003
Un-0.003
 keeper-0.003
An-0.003
Token it
Token ID340
Feature activation+3.11e-03

Pos logit contributions
 Pen+0.010
 unw+0.009
 grand+0.009
 unsc+0.009
An+0.008
Neg logit contributions
ained-0.017
akes-0.016
ags-0.016
ahs-0.016
 highlight-0.016
Token was
Token ID373
Feature activation+3.66e-03

Pos logit contributions
ined+0.005
ained+0.005
akes+0.005
ahs+0.005
 highlight+0.005
Neg logit contributions
 Pen-0.003
 unw-0.003
 grand-0.003
 Eva-0.003
Un-0.003
Token girl
Token ID2576
Feature activation-1.77e-02

Pos logit contributions
 Pen+0.017
 grand+0.017
 keeper+0.016
keeper+0.016
 unw+0.015
Neg logit contributions
ags-0.032
ained-0.032
akes-0.031
affles-0.031
ined-0.030
Token who
Token ID508
Feature activation+3.11e-02

Pos logit contributions
 Pen+0.023
 unw+0.021
 grand+0.021
An+0.020
 unsc+0.020
Neg logit contributions
ained-0.026
akes-0.025
ahs-0.025
ags-0.025
ined-0.025
Token saw
Token ID2497
Feature activation+5.32e-03

Pos logit contributions
ined+0.046
ained+0.045
ahs+0.044
akes+0.043
w+0.042
Neg logit contributions
 Pen-0.038
 unw-0.035
 grand-0.035
Un-0.033
 keeper-0.033
Token the
Token ID262
Feature activation+8.93e-04

Pos logit contributions
ained+0.009
akes+0.009
ags+0.009
ahs+0.009
affles+0.009
Neg logit contributions
 Pen-0.006
 unw-0.005
 grand-0.005
 unsc-0.005
An-0.005
Token seal
Token ID13810
Feature activation+4.59e-03

Pos logit contributions
ained+0.002
ags+0.002
agged+0.002
akes+0.002
ined+0.002
Neg logit contributions
 grand-0.001
 keeper-0.001
 Pen-0.001
keeper-0.001
 open-0.001
Token struggling
Token ID9648
Feature activation-8.33e-02

Pos logit contributions
ained+0.008
ags+0.008
akes+0.008
affles+0.008
agons+0.008
Neg logit contributions
 Pen-0.005
 unw-0.005
 grand-0.004
 keeper-0.004
 unsc-0.004
Token with
Token ID351
Feature activation-1.40e-02

Pos logit contributions
 Pen+0.088
 unw+0.080
 grand+0.077
An+0.074
 unsc+0.073
Neg logit contributions
ained-0.143
ags-0.138
akes-0.138
ahs-0.137
ined-0.135
Token the
Token ID262
Feature activation+7.09e-04

Pos logit contributions
 Pen+0.015
 unw+0.014
An+0.014
keeper+0.013
 keeper+0.013
Neg logit contributions
ags-0.023
ained-0.022
ahs-0.022
akes-0.022
ined-0.022
Token trash
Token ID13913
Feature activation+4.57e-02

Pos logit contributions
ained+0.001
ined+0.001
akes+0.001
agged+0.001
ags+0.001
Neg logit contributions
 grand-0.001
 Pen-0.001
 keeper-0.001
 open-0.001
keeper-0.001
Token.
Token ID13
Feature activation-1.05e-02

Pos logit contributions
ained+0.071
akes+0.070
ags+0.069
ahs+0.069
ined+0.068
Neg logit contributions
 Pen-0.055
 unw-0.051
 grand-0.050
 unsc-0.048
An-0.048
Token She
Token ID1375
Feature activation-8.03e-03

Pos logit contributions
 unw+0.010
 Pen+0.010
 unsc+0.010
 grand+0.009
An+0.009
Neg logit contributions
ained-0.017
unes-0.016
ags-0.016
akes-0.016
ahs-0.016
Token a
Token ID257
Feature activation-1.32e-02

Pos logit contributions
 Pen+0.018
 unw+0.017
 grand+0.017
 unsc+0.015
 open+0.014
Neg logit contributions
akes-0.053
ained-0.053
ahs-0.052
affles-0.051
ined-0.051
Token bag
Token ID6131
Feature activation-4.58e-02

Pos logit contributions
 Pen+0.014
 unw+0.013
 grand+0.013
 unsc+0.012
An+0.012
Neg logit contributions
ained-0.022
akes-0.022
ags-0.022
ahs-0.021
ined-0.021
Token with
Token ID351
Feature activation-1.86e-02

Pos logit contributions
 Pen+0.072
An+0.062
 unw+0.062
 Eva+0.061
Un+0.061
Neg logit contributions
ained-0.062
ined-0.057
ags-0.056
akes-0.056
orting-0.054
Token him
Token ID683
Feature activation-4.84e-03

Pos logit contributions
 Pen+0.021
 unw+0.020
 grand+0.019
 unsc+0.018
An+0.018
Neg logit contributions
ained-0.031
akes-0.030
ags-0.030
ahs-0.030
affles-0.029
Token,
Token ID11
Feature activation-2.68e-03

Pos logit contributions
 Pen+0.005
 unw+0.005
 grand+0.005
 unsc+0.005
 keeper+0.004
Neg logit contributions
ained-0.008
akes-0.008
ags-0.008
ahs-0.008
affles-0.008
Token filled
Token ID5901
Feature activation-1.04e-01

Pos logit contributions
 Pen+0.003
 unw+0.003
 grand+0.003
 keeper+0.003
An+0.003
Neg logit contributions
ained-0.004
ags-0.004
akes-0.004
ahs-0.004
 highlight-0.004
Token with
Token ID351
Feature activation-1.81e-02

Pos logit contributions
 Pen+0.168
Un+0.149
abel+0.148
case+0.143
Laura+0.143
Neg logit contributions
ined-0.111
 shining-0.108
ags-0.105
ained-0.103
 peanuts-0.102
Token lots
Token ID6041
Feature activation-8.71e-03

Pos logit contributions
 Pen+0.025
 unw+0.024
 grand+0.023
 unsc+0.022
An+0.022
Neg logit contributions
ained-0.025
akes-0.025
ags-0.025
ahs-0.024
affles-0.024
Token of
Token ID286
Feature activation-8.53e-03

Pos logit contributions
 Pen+0.013
 unw+0.012
 grand+0.012
An+0.011
 unsc+0.011
Neg logit contributions
ained-0.011
akes-0.011
ags-0.011
affles-0.011
ahs-0.011
Token things
Token ID1243
Feature activation+6.86e-03

Pos logit contributions
 grand+0.010
 unw+0.010
 Pen+0.010
 open+0.009
 unsc+0.009
Neg logit contributions
ained-0.014
akes-0.013
ahs-0.013
 highlight-0.013
ags-0.013
Token.
Token ID13
Feature activation-2.65e-02

Pos logit contributions
ained+0.011
akes+0.010
ags+0.010
ahs+0.010
ined+0.010
Neg logit contributions
 Pen-0.008
An-0.007
 unw-0.007
 grand-0.007
Un-0.007