TOP ACTIVATIONS
MAX = 9.954

little boy named Tim found a big , heavy rock in
was a little girl named A ire l le . She
a boy named Tim found a toy . The toy was
a boy named Tim found a cr ay on box .
a girl named Sue found a broken scarf . She wanted
little girl named Lucy found a big scale in her house
small boy named Tim found a penny . This penny was
little rabbit named Tim found a shiny trophy in the forest
a boy named Tim found a toy motorcycle . It was
a boy named Tim found a big hammer in his toy
little girl named Lucy found a big box in her room
a time , there was a little girl called Anna who
little boy named Tim found a magic f auc et in
was a little girl named A yl a . She was
little girl named Lucy found a colorful bird c age in
a boy named Tim found a heavy box . He wanted
little girl named Sue found a shiny coin on the ground
were best friends . On a sunny afternoon , they decided
little boy named Tim found a sc ooter . It was
little boy named Tim found a long string in his empty

MIN ACTIVATIONS
MIN = -14.612

will take the rh in oc eros back to the zoo
care of the rh in oc eros and it needed to
Sammy would often share av oc ados and other y ummy
try to grab the av oc ados . But the rock
learn more about rh in oc er oses and respect them
and picked the right av oc ados . Sadly
and hugged the rh in oc eros . He finally understood
you like the rh in oc eros . But remember ,
to see the rh in oc eros anymore . He is
to pet the rh in oc eros . It looked so
to enjoy eating her av oc ados with a smile on
thanked the elderly rh in oc eros for his wonderful surprise
me see the rh in oc eros . It is still
store to get more av oc ados . Gary looked at
to help the rh in oc eros . The zo oke
had never seen the av oc ados . They wished they
you like the rh in oc eros , but it is
Sam hugged the rh in oc eros goodbye , then ran
food for the rh in oc eros and his friends .
could help the rh in oc eros ." His dad smiled

INTERVAL 3.813 - 9.954
CONTAINS 8.503%

a time , there was a farmer named Jack . Jack
Once upon a time , there was a brave knight called
He tells her it has a happy ending . <|endoftext|> Ben
upon a time , there was a farmer who had a
was a bad idea . <|endoftext|> L ily and Ben were

INTERVAL -2.329 - 3.813
CONTAINS 72.092%

ily felt better and went to the party with her mom
said . " OK , let 's sleep in
Tom and Lily nodded . They still liked the
She opened her mouth with her songs . But
, two lines for the legs and a fan for the

INTERVAL -8.471 - -2.329
CONTAINS 19.349%

and she gave them lots of y ummy snacks .
The fairy smiled and waved goodbye as she flew
Mom hugged them and said , " I love you too
. He is ready to go back to his nest .
very angry . He stamped his feet and yelled because no

INTERVAL -14.612 - -8.471
CONTAINS 0.057%

so happy and thanked the re ind eer for showing her
Lily . I 'll help Mitt ens ." She climbed the
. It is for grown - ups . You could have
a strange , raspberry - col oured object hiding under a
new ! She thought av oc ados were the most interesting
Token little
Token ID1310
Feature activation+4.92e+00

Pos logit contributions
 big+0.060
 two+0.050
 whole+0.049
A+0.045
 huge+0.041
Neg logit contributions
anged-0.344
owing-0.337
and-0.337
 Su-0.333
 Acc-0.331
Token boy
Token ID2933
Feature activation+3.16e+00

Pos logit contributions
 big+0.078
 whole+0.065
 huge+0.063
A+0.062
 little+0.061
Neg logit contributions
anged-0.159
owing-0.153
and-0.152
os-0.150
 Su-0.146
Token named
Token ID3706
Feature activation-3.08e+00

Pos logit contributions
 big+0.080
 little+0.070
 whole+0.067
 two+0.065
 huge+0.065
Neg logit contributions
and-0.077
owing-0.076
anged-0.076
ules-0.074
 supervision-0.073
Token Tim
Token ID5045
Feature activation-1.30e+00

Pos logit contributions
anged+0.050
and+0.046
owing+0.045
os+0.045
 Su+0.042
Neg logit contributions
 big-0.101
 whole-0.092
 little-0.091
 huge-0.090
A-0.089
Token found
Token ID1043
Feature activation+3.89e+00

Pos logit contributions
anged+0.039
owing+0.037
and+0.037
os+0.037
 Su+0.035
Neg logit contributions
 big-0.025
 whole-0.021
 little-0.020
 huge-0.020
A-0.020
Token a
Token ID257
Feature activation+9.95e+00

Pos logit contributions
 big+0.040
 whole+0.037
 little+0.031
 huge+0.031
+0.030
Neg logit contributions
anged-0.134
and-0.128
 Su-0.127
os-0.126
 Kim-0.125
Token big
Token ID1263
Feature activation+5.71e+00

Pos logit contributions
A+0.079
 two+0.073
 entire+0.061
 whole+0.059
+0.055
Neg logit contributions
 Acc-0.321
 Su-0.308
anged-0.304
owing-0.304
and-0.301
Token,
Token ID11
Feature activation-1.01e+00

Pos logit contributions
 big+0.087
A+0.083
 whole+0.080
 two+0.072
 little+0.071
Neg logit contributions
anged-0.174
owing-0.168
 Su-0.160
os-0.160
and-0.158
Token heavy
Token ID4334
Feature activation+5.82e+00

Pos logit contributions
and+0.035
anged+0.035
owing+0.035
 Su+0.034
 Acc+0.033
Neg logit contributions
 big-0.010
 whole-0.008
A-0.008
 two-0.008
 little-0.007
Token rock
Token ID3881
Feature activation+3.18e+00

Pos logit contributions
A+0.086
 big+0.086
 whole+0.080
 two+0.075
 huge+0.072
Neg logit contributions
anged-0.173
owing-0.167
 Su-0.158
ress-0.156
os-0.154
Token in
Token ID287
Feature activation+4.74e-01

Pos logit contributions
 big+0.058
 whole+0.050
 little+0.049
 huge+0.048
A+0.048
Neg logit contributions
anged-0.097
owing-0.093
and-0.092
os-0.091
 Su-0.086
Token was
Token ID373
Feature activation+5.62e+00

Pos logit contributions
 big+0.068
 whole+0.053
 huge+0.050
 little+0.050
A+0.049
Neg logit contributions
anged-0.194
and-0.189
os-0.188
owing-0.187
 Su-0.183
Token a
Token ID257
Feature activation+8.57e+00

Pos logit contributions
 big+0.048
 whole+0.032
 little+0.030
 huge+0.030
A+0.027
Neg logit contributions
anged-0.214
and-0.213
owing-0.207
 Su-0.205
os-0.204
Token little
Token ID1310
Feature activation+2.92e+00

Pos logit contributions
 big+0.050
A+0.042
 whole+0.041
 two+0.039
 huge+0.031
Neg logit contributions
anged-0.308
 Su-0.304
and-0.303
owing-0.303
 Acc-0.299
Token girl
Token ID2576
Feature activation+2.32e+00

Pos logit contributions
 big+0.048
 whole+0.042
A+0.042
 huge+0.040
 two+0.038
Neg logit contributions
anged-0.088
owing-0.085
os-0.083
and-0.083
 Su-0.082
Token named
Token ID3706
Feature activation-4.80e+00

Pos logit contributions
 big+0.062
 little+0.054
 whole+0.052
 huge+0.051
A+0.051
Neg logit contributions
anged-0.055
and-0.053
owing-0.052
os-0.050
azes-0.049
Token A
Token ID317
Feature activation+9.91e+00

Pos logit contributions
anged+0.132
owing+0.126
os+0.120
and+0.119
azes+0.114
Neg logit contributions
 big-0.118
 little-0.103
 whole-0.101
A-0.096
 huge-0.096
Tokenire
Token ID557
Feature activation+1.77e+00

Pos logit contributions
 big+0.062
 whole+0.046
 two+0.045
A+0.040
 huge+0.036
Neg logit contributions
anged-0.369
and-0.368
 Su-0.367
owing-0.363
os-0.361
Tokenl
Token ID75
Feature activation+4.09e-01

Pos logit contributions
 big+0.032
 whole+0.027
 huge+0.026
 little+0.026
A+0.025
Neg logit contributions
anged-0.054
owing-0.052
and-0.052
os-0.051
 Su-0.049
Tokenle
Token ID293
Feature activation+3.04e+00

Pos logit contributions
 big+0.007
 whole+0.006
 huge+0.006
 little+0.006
A+0.006
Neg logit contributions
anged-0.013
owing-0.012
os-0.012
and-0.012
 Su-0.011
Token.
Token ID13
Feature activation+2.10e+00

Pos logit contributions
 big+0.068
 whole+0.060
 huge+0.060
 heavy+0.057
A+0.057
Neg logit contributions
anged-0.077
owing-0.074
and-0.073
os-0.073
 Su-0.069
Token She
Token ID1375
Feature activation+4.36e+00

Pos logit contributions
 big+0.041
 little+0.033
 two+0.032
A+0.031
 whole+0.031
Neg logit contributions
 supervision-0.068
owing-0.068
anged-0.068
beh-0.064
and-0.063
Token a
Token ID257
Feature activation+9.23e+00

Pos logit contributions
anged+0.024
and+0.023
os+0.023
owing+0.022
 Su+0.022
Neg logit contributions
 big-0.008
 whole-0.007
 huge-0.007
 nearest-0.006
 little-0.006
Token boy
Token ID2933
Feature activation+3.13e+00

Pos logit contributions
 big+0.061
 two+0.056
 whole+0.051
A+0.047
 huge+0.045
Neg logit contributions
anged-0.323
and-0.319
owing-0.315
 Su-0.311
 Acc-0.311
Token named
Token ID3706
Feature activation-3.08e+00

Pos logit contributions
 big+0.081
 little+0.070
 whole+0.069
A+0.066
 two+0.066
Neg logit contributions
and-0.074
anged-0.074
owing-0.074
ules-0.071
 Him-0.070
Token Tim
Token ID5045
Feature activation-1.37e+00

Pos logit contributions
anged+0.048
and+0.044
os+0.043
owing+0.042
 Su+0.040
Neg logit contributions
 big-0.101
 whole-0.094
 huge-0.091
 little-0.091
A-0.090
Token found
Token ID1043
Feature activation+3.97e+00

Pos logit contributions
anged+0.041
os+0.040
owing+0.039
and+0.039
 Su+0.038
Neg logit contributions
 big-0.025
 whole-0.021
 little-0.021
 huge-0.021
A-0.020
Token a
Token ID257
Feature activation+9.88e+00

Pos logit contributions
 big+0.040
 whole+0.038
 little+0.032
 huge+0.031
+0.029
Neg logit contributions
anged-0.138
and-0.131
os-0.130
 Su-0.129
owing-0.127
Token toy
Token ID13373
Feature activation+4.58e+00

Pos logit contributions
A+0.071
 two+0.067
 entire+0.056
 whole+0.053
+0.047
Neg logit contributions
 Acc-0.322
 Su-0.309
anged-0.307
and-0.306
owing-0.304
Token.
Token ID13
Feature activation+1.90e+00

Pos logit contributions
 big+0.079
 whole+0.073
 huge+0.067
A+0.067
 little+0.066
Neg logit contributions
anged-0.138
owing-0.128
and-0.126
os-0.126
azes-0.120
Token The
Token ID383
Feature activation+2.25e+00

Pos logit contributions
 big+0.041
 little+0.034
 whole+0.034
 huge+0.033
A+0.032
Neg logit contributions
anged-0.054
owing-0.053
and-0.053
os-0.050
years-0.049
Token toy
Token ID13373
Feature activation+2.98e+00

Pos logit contributions
A+0.031
 whole+0.028
+0.027
 big+0.026
 huge+0.025
Neg logit contributions
anged-0.060
 Kim-0.059
 Acc-0.057
 Su-0.057
 J-0.056
Token was
Token ID373
Feature activation+4.63e+00

Pos logit contributions
 big+0.043
 whole+0.041
 huge+0.036
 little+0.035
A+0.035
Neg logit contributions
anged-0.096
os-0.091
and-0.085
azes-0.085
 Jos-0.084
Token a
Token ID257
Feature activation+9.26e+00

Pos logit contributions
anged+0.023
and+0.021
os+0.021
 Su+0.021
owing+0.021
Neg logit contributions
 big-0.008
 huge-0.006
 nearest-0.006
 whole-0.006
 magical-0.006
Token boy
Token ID2933
Feature activation+3.11e+00

Pos logit contributions
 big+0.066
 whole+0.053
 two+0.053
A+0.049
 huge+0.047
Neg logit contributions
anged-0.331
and-0.324
owing-0.322
 Su-0.317
 Acc-0.314
Token named
Token ID3706
Feature activation-2.89e+00

Pos logit contributions
 big+0.080
 little+0.070
 whole+0.068
 huge+0.065
 two+0.065
Neg logit contributions
and-0.074
owing-0.073
anged-0.073
ules-0.071
 Him-0.071
Token Tim
Token ID5045
Feature activation-9.20e-01

Pos logit contributions
anged+0.048
and+0.045
owing+0.044
os+0.043
 Su+0.040
Neg logit contributions
 big-0.094
 whole-0.086
 little-0.084
 huge-0.084
A-0.083
Token found
Token ID1043
Feature activation+3.90e+00

Pos logit contributions
anged+0.028
owing+0.027
and+0.027
os+0.026
 Su+0.025
Neg logit contributions
 big-0.017
 whole-0.014
 little-0.014
 huge-0.014
A-0.014
Token a
Token ID257
Feature activation+9.84e+00

Pos logit contributions
 big+0.040
 whole+0.037
+0.031
 little+0.031
 huge+0.031
Neg logit contributions
anged-0.135
and-0.130
 Su-0.127
os-0.126
 Kim-0.125
Token cr
Token ID1067
Feature activation+1.81e+00

Pos logit contributions
A+0.076
 two+0.072
 entire+0.059
 whole+0.056
+0.054
Neg logit contributions
 Acc-0.318
 Su-0.305
and-0.304
anged-0.303
owing-0.298
Tokenay
Token ID323
Feature activation+1.76e+00

Pos logit contributions
 big+0.046
 little+0.041
 whole+0.041
 huge+0.040
 two+0.040
Neg logit contributions
and-0.039
anged-0.039
os-0.038
owing-0.038
azes-0.035
Tokenon
Token ID261
Feature activation+3.79e+00

Pos logit contributions
 big+0.054
 whole+0.048
 little+0.048
 huge+0.047
A+0.046
Neg logit contributions
anged-0.033
and-0.031
owing-0.031
os-0.030
 Su-0.028
Token box
Token ID3091
Feature activation+5.44e+00

Pos logit contributions
 big+0.073
A+0.064
 whole+0.064
 huge+0.063
 little+0.062
Neg logit contributions
anged-0.107
owing-0.104
and-0.103
os-0.100
azes-0.092
Token.
Token ID13
Feature activation+1.60e+00

Pos logit contributions
 big+0.106
A+0.095
 whole+0.094
 little+0.092
 huge+0.091
Neg logit contributions
anged-0.150
owing-0.148
and-0.145
os-0.141
 Su-0.131
Token a
Token ID257
Feature activation+9.31e+00

Pos logit contributions
anged+0.024
and+0.023
os+0.023
 Su+0.023
owing+0.022
Neg logit contributions
 big-0.008
 whole-0.007
 huge-0.007
 sharp-0.006
 little-0.006
Token girl
Token ID2576
Feature activation+3.73e+00

Pos logit contributions
 big+0.067
 two+0.056
 whole+0.055
A+0.051
 huge+0.049
Neg logit contributions
anged-0.327
and-0.322
owing-0.320
 Su-0.314
 Acc-0.312
Token named
Token ID3706
Feature activation-2.60e+00

Pos logit contributions
 big+0.096
 little+0.085
 whole+0.081
 two+0.078
 huge+0.078
Neg logit contributions
and-0.090
anged-0.090
owing-0.089
ules-0.086
ram-0.085
Token Sue
Token ID26089
Feature activation+3.75e-01

Pos logit contributions
anged+0.058
owing+0.056
and+0.054
os+0.053
ules+0.050
Neg logit contributions
 big-0.073
 little-0.064
 whole-0.064
 huge-0.062
A-0.062
Token found
Token ID1043
Feature activation+3.86e+00

Pos logit contributions
 big+0.006
 whole+0.005
 little+0.005
 huge+0.005
A+0.005
Neg logit contributions
anged-0.012
and-0.012
owing-0.012
os-0.011
 Su-0.011
Token a
Token ID257
Feature activation+9.83e+00

Pos logit contributions
 big+0.036
 whole+0.030
 huge+0.026
 little+0.026
A+0.023
Neg logit contributions
anged-0.141
and-0.137
 Su-0.135
owing-0.134
os-0.134
Token broken
Token ID5445
Feature activation+5.43e+00

Pos logit contributions
A+0.076
 two+0.073
 entire+0.059
 whole+0.056
+0.056
Neg logit contributions
 Acc-0.316
 Su-0.304
and-0.302
owing-0.296
anged-0.294
Token scarf
Token ID36953
Feature activation+2.35e+00

Pos logit contributions
A+0.089
 big+0.079
 whole+0.076
 huge+0.073
 two+0.072
Neg logit contributions
anged-0.144
ress-0.139
owing-0.139
 Su-0.138
 Acc-0.131
Token.
Token ID13
Feature activation+1.58e+00

Pos logit contributions
 big+0.045
A+0.041
 whole+0.039
 huge+0.039
 little+0.039
Neg logit contributions
owing-0.065
anged-0.064
and-0.062
os-0.062
 Su-0.057
Token She
Token ID1375
Feature activation+4.71e+00

Pos logit contributions
 big+0.041
 little+0.035
 whole+0.034
 huge+0.034
A+0.033
Neg logit contributions
anged-0.040
owing-0.038
and-0.038
os-0.036
years-0.036
Token wanted
Token ID2227
Feature activation+4.99e+00

Pos logit contributions
 big+0.086
 whole+0.082
 huge+0.077
 entire+0.070
 top+0.070
Neg logit contributions
anged-0.131
os-0.119
and-0.119
years-0.117
atted-0.115
Token little
Token ID1310
Feature activation+4.71e+00

Pos logit contributions
 big+0.051
 two+0.043
 whole+0.041
A+0.037
 huge+0.034
Neg logit contributions
anged-0.338
and-0.331
owing-0.329
 Su-0.325
 Acc-0.322
Token girl
Token ID2576
Feature activation+3.66e+00

Pos logit contributions
 big+0.073
 whole+0.060
 huge+0.057
 little+0.057
A+0.056
Neg logit contributions
anged-0.156
owing-0.150
and-0.150
os-0.147
 Su-0.143
Token named
Token ID3706
Feature activation-2.89e+00

Pos logit contributions
 big+0.093
 little+0.081
 whole+0.077
 two+0.075
 huge+0.074
Neg logit contributions
and-0.091
owing-0.090
anged-0.090
ules-0.087
ram-0.086
Token Lucy
Token ID22162
Feature activation-2.37e-01

Pos logit contributions
anged+0.073
owing+0.070
and+0.067
os+0.066
ules+0.065
Neg logit contributions
 big-0.075
 little-0.065
 whole-0.065
A-0.062
 huge-0.062
Token found
Token ID1043
Feature activation+3.75e+00

Pos logit contributions
anged+0.007
and+0.007
owing+0.007
os+0.007
 Su+0.007
Neg logit contributions
 big-0.004
 whole-0.004
 little-0.004
 huge-0.003
A-0.003
Token a
Token ID257
Feature activation+9.82e+00

Pos logit contributions
 big+0.035
 whole+0.027
 little+0.024
 huge+0.024
A+0.022
Neg logit contributions
anged-0.139
and-0.136
owing-0.133
 Su-0.133
os-0.133
Token big
Token ID1263
Feature activation+5.60e+00

Pos logit contributions
A+0.074
 two+0.072
 whole+0.057
 entire+0.056
+0.054
Neg logit contributions
 Acc-0.322
 Su-0.310
and-0.305
anged-0.304
owing-0.304
Token scale
Token ID5046
Feature activation+3.57e+00

Pos logit contributions
 big+0.087
A+0.079
 whole+0.078
 two+0.071
 little+0.070
Neg logit contributions
anged-0.172
owing-0.166
os-0.159
 Su-0.159
and-0.158
Token in
Token ID287
Feature activation+3.90e-01

Pos logit contributions
 big+0.061
 whole+0.051
 little+0.049
 huge+0.049
A+0.047
Neg logit contributions
anged-0.114
owing-0.109
and-0.109
os-0.107
 Su-0.103
Token her
Token ID607
Feature activation+6.54e-01

Pos logit contributions
 big+0.011
 whole+0.010
 little+0.010
 huge+0.010
A+0.009
Neg logit contributions
anged-0.008
and-0.008
owing-0.008
os-0.008
 Su-0.007
Token house
Token ID2156
Feature activation+2.42e+00

Pos logit contributions
 big+0.012
 whole+0.011
A+0.010
 huge+0.010
 little+0.010
Neg logit contributions
anged-0.019
and-0.018
owing-0.017
os-0.017
azes-0.017
Token small
Token ID1402
Feature activation+6.15e+00

Pos logit contributions
 big+0.057
 whole+0.043
 two+0.042
A+0.039
 huge+0.036
Neg logit contributions
anged-0.344
and-0.337
owing-0.336
 Su-0.330
os-0.327
Token boy
Token ID2933
Feature activation+2.99e+00

Pos logit contributions
 big+0.099
 whole+0.085
A+0.083
 huge+0.080
 two+0.077
Neg logit contributions
anged-0.195
owing-0.188
os-0.183
and-0.183
 Su-0.178
Token named
Token ID3706
Feature activation-3.08e+00

Pos logit contributions
 big+0.073
 little+0.063
 whole+0.060
 two+0.058
A+0.058
Neg logit contributions
and-0.076
owing-0.075
anged-0.074
ules-0.074
ram-0.072
Token Tim
Token ID5045
Feature activation-1.85e+00

Pos logit contributions
anged+0.052
and+0.048
owing+0.048
os+0.046
 Su+0.043
Neg logit contributions
 big-0.100
 whole-0.091
 little-0.090
 huge-0.089
A-0.088
Token found
Token ID1043
Feature activation+3.81e+00

Pos logit contributions
anged+0.051
owing+0.048
and+0.048
os+0.048
 Su+0.045
Neg logit contributions
 big-0.040
 whole-0.034
 little-0.034
 huge-0.034
A-0.033
Token a
Token ID257
Feature activation+9.81e+00

Pos logit contributions
 big+0.038
 whole+0.035
 little+0.029
 huge+0.029
+0.028
Neg logit contributions
anged-0.134
and-0.129
 Su-0.127
os-0.126
owing-0.125
Token penny
Token ID26369
Feature activation+3.19e+00

Pos logit contributions
A+0.080
 two+0.074
 entire+0.062
 whole+0.060
+0.057
Neg logit contributions
 Acc-0.312
 Su-0.299
anged-0.295
and-0.295
owing-0.293
Token.
Token ID13
Feature activation+1.64e+00

Pos logit contributions
 big+0.066
A+0.060
 whole+0.059
 huge+0.059
 little+0.058
Neg logit contributions
owing-0.083
anged-0.083
and-0.079
os-0.076
 Su-0.071
Token This
Token ID770
Feature activation+6.41e+00

Pos logit contributions
 big+0.035
 little+0.029
 whole+0.029
 huge+0.029
A+0.028
Neg logit contributions
anged-0.047
owing-0.046
and-0.046
os-0.044
years-0.043
Token penny
Token ID26369
Feature activation+1.79e+00

Pos logit contributions
 big+0.120
 whole+0.117
A+0.116
 huge+0.113
 entire+0.108
Neg logit contributions
anged-0.146
 Su-0.139
ress-0.133
and-0.132
rings-0.132
Token was
Token ID373
Feature activation+3.80e+00

Pos logit contributions
 big+0.027
 whole+0.025
A+0.023
 huge+0.022
 little+0.022
Neg logit contributions
anged-0.057
os-0.056
owing-0.054
and-0.052
azes-0.051
Token little
Token ID1310
Feature activation+4.69e+00

Pos logit contributions
 big+0.054
 two+0.046
 whole+0.043
A+0.040
 huge+0.037
Neg logit contributions
anged-0.336
owing-0.331
and-0.331
 Su-0.325
 Acc-0.323
Token rabbit
Token ID22746
Feature activation+3.31e+00

Pos logit contributions
 big+0.074
 whole+0.062
 huge+0.059
A+0.059
 little+0.058
Neg logit contributions
anged-0.151
owing-0.146
and-0.144
os-0.142
 Su-0.139
Token named
Token ID3706
Feature activation-3.05e+00

Pos logit contributions
 big+0.093
 little+0.082
 whole+0.082
 huge+0.080
A+0.079
Neg logit contributions
anged-0.071
and-0.068
owing-0.067
os-0.064
 Su-0.061
Token Tim
Token ID5045
Feature activation-1.03e+00

Pos logit contributions
anged+0.052
and+0.048
owing+0.048
os+0.046
 Su+0.043
Neg logit contributions
 big-0.098
 whole-0.089
 little-0.088
 huge-0.087
A-0.086
Token found
Token ID1043
Feature activation+3.73e+00

Pos logit contributions
anged+0.032
owing+0.030
and+0.030
os+0.030
 Su+0.029
Neg logit contributions
 big-0.019
 whole-0.016
 little-0.015
 huge-0.015
A-0.015
Token a
Token ID257
Feature activation+9.81e+00

Pos logit contributions
 big+0.035
 whole+0.029
 little+0.024
 huge+0.024
A+0.022
Neg logit contributions
anged-0.138
and-0.134
owing-0.132
os-0.131
 Su-0.130
Token shiny
Token ID22441
Feature activation+6.50e+00

Pos logit contributions
 two+0.074
A+0.072
 entire+0.062
+0.055
 whole+0.049
Neg logit contributions
 Acc-0.299
 Su-0.292
 figuring-0.286
owing-0.286
 Jan-0.285
Token trophy
Token ID16383
Feature activation+7.04e+00

Pos logit contributions
A+0.110
 whole+0.099
 big+0.097
 two+0.095
There+0.089
Neg logit contributions
anged-0.165
owing-0.160
oot-0.153
 Su-0.153
ress-0.151
Token in
Token ID287
Feature activation+5.45e-01

Pos logit contributions
 big+0.140
A+0.129
 whole+0.126
 huge+0.121
 little+0.121
Neg logit contributions
anged-0.187
owing-0.182
and-0.177
os-0.171
 figuring-0.163
Token the
Token ID262
Feature activation+1.73e+00

Pos logit contributions
 big+0.013
 whole+0.012
 little+0.012
 huge+0.012
A+0.011
Neg logit contributions
anged-0.011
 Su-0.011
and-0.011
 Kim-0.011
os-0.011
Token forest
Token ID8222
Feature activation+2.22e+00

Pos logit contributions
 big+0.029
 little+0.025
 sharp+0.025
 huge+0.025
+0.025
Neg logit contributions
and-0.044
anged-0.043
ric-0.041
 J-0.041
weet-0.041
Token a
Token ID257
Feature activation+9.29e+00

Pos logit contributions
anged+0.026
and+0.025
os+0.025
owing+0.024
 Su+0.024
Neg logit contributions
 big-0.009
 whole-0.008
 huge-0.008
 entire-0.007
 nearest-0.007
Token boy
Token ID2933
Feature activation+3.06e+00

Pos logit contributions
 big+0.068
 two+0.063
 whole+0.059
A+0.055
 huge+0.053
Neg logit contributions
anged-0.314
and-0.311
owing-0.306
 Acc-0.305
 Su-0.304
Token named
Token ID3706
Feature activation-2.83e+00

Pos logit contributions
 big+0.080
 little+0.069
 whole+0.068
 two+0.065
 huge+0.065
Neg logit contributions
and-0.072
anged-0.071
owing-0.071
ules-0.069
 Him-0.068
Token Tim
Token ID5045
Feature activation-8.16e-01

Pos logit contributions
anged+0.044
and+0.040
os+0.040
owing+0.039
 Su+0.037
Neg logit contributions
 big-0.093
 whole-0.086
 huge-0.084
 little-0.084
A-0.083
Token found
Token ID1043
Feature activation+3.80e+00

Pos logit contributions
anged+0.024
os+0.023
owing+0.023
and+0.023
 Su+0.022
Neg logit contributions
 big-0.015
 whole-0.013
 huge-0.013
 little-0.013
A-0.012
Token a
Token ID257
Feature activation+9.80e+00

Pos logit contributions
 big+0.039
 whole+0.036
 little+0.030
 huge+0.030
+0.030
Neg logit contributions
anged-0.129
and-0.124
os-0.121
 Su-0.121
 Kim-0.120
Token toy
Token ID13373
Feature activation+4.52e+00

Pos logit contributions
A+0.075
 two+0.071
 entire+0.060
 whole+0.054
+0.051
Neg logit contributions
 Acc-0.314
 Su-0.298
and-0.296
anged-0.294
owing-0.293
Token motorcycle
Token ID18757
Feature activation+3.40e+00

Pos logit contributions
 big+0.079
 whole+0.074
A+0.068
 huge+0.067
 little+0.066
Neg logit contributions
anged-0.133
owing-0.123
os-0.120
and-0.119
azes-0.115
Token.
Token ID13
Feature activation+1.43e+00

Pos logit contributions
 big+0.061
 whole+0.052
 little+0.050
 huge+0.050
A+0.049
Neg logit contributions
anged-0.104
owing-0.100
and-0.099
os-0.098
 Su-0.093
Token It
Token ID632
Feature activation+6.27e+00

Pos logit contributions
 big+0.032
 little+0.027
 whole+0.027
 huge+0.026
A+0.026
Neg logit contributions
anged-0.040
owing-0.038
and-0.038
os-0.037
years-0.035
Token was
Token ID373
Feature activation+4.68e+00

Pos logit contributions
 whole+0.090
 big+0.090
 little+0.085
 two+0.081
A+0.080
Neg logit contributions
os-0.182
anged-0.176
 Kim-0.170
 Su-0.167
 Jos-0.164
Token a
Token ID257
Feature activation+9.17e+00

Pos logit contributions
anged+0.027
and+0.026
owing+0.025
 Su+0.025
os+0.025
Neg logit contributions
 big-0.009
 huge-0.008
 whole-0.008
 sharp-0.007
 little-0.007
Token boy
Token ID2933
Feature activation+3.01e+00

Pos logit contributions
 big+0.064
 two+0.053
 whole+0.052
A+0.049
 huge+0.046
Neg logit contributions
anged-0.325
and-0.321
owing-0.317
 Su-0.313
 Acc-0.312
Token named
Token ID3706
Feature activation-2.84e+00

Pos logit contributions
 big+0.077
 little+0.067
 whole+0.065
 two+0.062
 huge+0.062
Neg logit contributions
and-0.073
owing-0.072
anged-0.071
ules-0.071
 Him-0.069
Token Tim
Token ID5045
Feature activation-8.94e-01

Pos logit contributions
anged+0.048
and+0.044
owing+0.044
os+0.043
 Su+0.040
Neg logit contributions
 big-0.092
 whole-0.084
 little-0.083
 huge-0.082
A-0.081
Token found
Token ID1043
Feature activation+3.76e+00

Pos logit contributions
anged+0.027
os+0.025
owing+0.025
and+0.025
 Su+0.024
Neg logit contributions
 big-0.017
 whole-0.014
 little-0.014
 huge-0.014
A-0.014
Token a
Token ID257
Feature activation+9.79e+00

Pos logit contributions
 big+0.037
 whole+0.034
 little+0.029
 huge+0.029
+0.028
Neg logit contributions
anged-0.132
and-0.127
os-0.123
 Su-0.123
owing-0.122
Token big
Token ID1263
Feature activation+5.51e+00

Pos logit contributions
A+0.073
 two+0.071
 entire+0.058
 whole+0.053
+0.051
Neg logit contributions
 Acc-0.318
and-0.303
 Su-0.302
anged-0.301
owing-0.297
Token hammer
Token ID15554
Feature activation+2.32e+00

Pos logit contributions
 big+0.083
A+0.082
 whole+0.077
 two+0.070
 huge+0.068
Neg logit contributions
anged-0.166
owing-0.160
 Su-0.153
os-0.151
 Acc-0.151
Token in
Token ID287
Feature activation+1.10e+00

Pos logit contributions
 big+0.043
 whole+0.037
A+0.036
 huge+0.036
 little+0.036
Neg logit contributions
anged-0.069
owing-0.067
and-0.065
os-0.065
 Su-0.061
Token his
Token ID465
Feature activation+1.40e+00

Pos logit contributions
 big+0.026
 whole+0.024
 huge+0.023
 little+0.022
A+0.022
Neg logit contributions
anged-0.025
and-0.024
 Su-0.024
os-0.024
 Jos-0.023
Token toy
Token ID13373
Feature activation+3.37e+00

Pos logit contributions
 big+0.025
 whole+0.022
A+0.021
 huge+0.021
 little+0.021
Neg logit contributions
anged-0.041
and-0.039
owing-0.038
os-0.038
azes-0.037
Token little
Token ID1310
Feature activation+4.44e+00

Pos logit contributions
 big+0.046
 whole+0.026
A+0.024
 two+0.023
 huge+0.022
Neg logit contributions
anged-0.373
and-0.367
owing-0.366
os-0.358
 Su-0.356
Token girl
Token ID2576
Feature activation+3.18e+00

Pos logit contributions
 big+0.069
 whole+0.056
 little+0.054
 huge+0.053
A+0.051
Neg logit contributions
anged-0.150
and-0.144
owing-0.144
os-0.141
 Su-0.136
Token named
Token ID3706
Feature activation-3.21e+00

Pos logit contributions
 big+0.079
 little+0.069
 whole+0.064
 two+0.063
 huge+0.062
Neg logit contributions
and-0.084
owing-0.081
anged-0.081
ules-0.079
reating-0.077
Token Lucy
Token ID22162
Feature activation-5.29e-01

Pos logit contributions
anged+0.076
owing+0.072
and+0.070
os+0.068
ules+0.066
Neg logit contributions
 big-0.088
 little-0.077
 whole-0.076
 huge-0.074
 two-0.073
Token found
Token ID1043
Feature activation+3.49e+00

Pos logit contributions
anged+0.017
and+0.016
owing+0.016
os+0.016
azes+0.015
Neg logit contributions
 big-0.009
 whole-0.008
 little-0.007
 huge-0.007
A-0.007
Token a
Token ID257
Feature activation+9.79e+00

Pos logit contributions
 big+0.034
 whole+0.029
 little+0.025
 huge+0.024
A+0.024
Neg logit contributions
anged-0.125
and-0.121
 Su-0.121
os-0.119
owing-0.119
Token big
Token ID1263
Feature activation+5.47e+00

Pos logit contributions
A+0.080
 two+0.074
 entire+0.062
 whole+0.061
+0.059
Neg logit contributions
 Acc-0.311
 Su-0.302
owing-0.297
anged-0.294
and-0.293
Token box
Token ID3091
Feature activation+5.87e+00

Pos logit contributions
 big+0.085
A+0.079
 whole+0.077
 two+0.070
 little+0.069
Neg logit contributions
anged-0.167
owing-0.162
 Su-0.154
os-0.154
and-0.152
Token in
Token ID287
Feature activation+1.48e+00

Pos logit contributions
 big+0.114
 whole+0.100
A+0.099
 huge+0.098
 little+0.097
Neg logit contributions
anged-0.163
owing-0.161
and-0.159
os-0.154
 Su-0.144
Token her
Token ID607
Feature activation+1.33e+00

Pos logit contributions
 big+0.043
 whole+0.038
 little+0.038
 huge+0.037
A+0.037
Neg logit contributions
anged-0.032
owing-0.030
and-0.029
os-0.029
 Su-0.026
Token room
Token ID2119
Feature activation+5.38e+00

Pos logit contributions
 big+0.023
 whole+0.021
A+0.020
 huge+0.020
 sharp+0.020
Neg logit contributions
anged-0.037
and-0.036
owing-0.034
os-0.034
 Jos-0.034
Token a
Token ID257
Feature activation+7.47e+00

Pos logit contributions
 big+0.075
+0.068
 huge+0.064
ball+0.063
 heavy+0.063
Neg logit contributions
and-0.208
 Su-0.187
weet-0.185
 J-0.185
 Jos-0.183
Token time
Token ID640
Feature activation+3.73e+00

Pos logit contributions
 big+0.093
 whole+0.072
 huge+0.069
A+0.068
 two+0.067
Neg logit contributions
anged-0.262
and-0.258
owing-0.255
os-0.249
 Su-0.242
Token,
Token ID11
Feature activation-1.13e+00

Pos logit contributions
 big+0.065
 little+0.053
 whole+0.052
 two+0.050
 huge+0.049
Neg logit contributions
anged-0.125
and-0.118
owing-0.117
os-0.115
beh-0.112
Token there
Token ID612
Feature activation+6.77e+00

Pos logit contributions
anged+0.033
owing+0.033
and+0.032
os+0.032
 Su+0.031
Neg logit contributions
 big-0.018
 whole-0.015
 huge-0.015
A-0.015
 sharp-0.015
Token was
Token ID373
Feature activation+6.79e+00

Pos logit contributions
 big+0.094
 whole+0.076
 sharp+0.076
 huge+0.075
A+0.072
Neg logit contributions
anged-0.221
and-0.219
os-0.218
owing-0.212
 Su-0.211
Token a
Token ID257
Feature activation+9.78e+00

Pos logit contributions
 big+0.059
 whole+0.042
 huge+0.038
 little+0.038
 sharp+0.036
Neg logit contributions
and-0.251
anged-0.250
 Su-0.242
owing-0.240
os-0.240
Token little
Token ID1310
Feature activation+3.90e+00

Pos logit contributions
 big+0.056
A+0.052
 whole+0.048
 two+0.045
 huge+0.033
Neg logit contributions
anged-0.343
 Su-0.341
 Acc-0.339
owing-0.338
and-0.336
Token girl
Token ID2576
Feature activation+3.20e+00

Pos logit contributions
 big+0.066
A+0.059
 whole+0.058
 huge+0.055
 two+0.053
Neg logit contributions
anged-0.114
owing-0.111
and-0.108
os-0.108
 Su-0.107
Token called
Token ID1444
Feature activation-1.78e+00

Pos logit contributions
 big+0.087
 little+0.078
 whole+0.075
 huge+0.074
A+0.073
Neg logit contributions
anged-0.073
and-0.069
owing-0.068
os-0.066
azes-0.064
Token Anna
Token ID11735
Feature activation+5.32e-01

Pos logit contributions
anged+0.044
owing+0.042
and+0.039
os+0.038
atted+0.038
Neg logit contributions
 big-0.047
 little-0.044
 two-0.042
 whole-0.040
 a-0.039
Token who
Token ID508
Feature activation+1.03e+00

Pos logit contributions
 big+0.010
 whole+0.008
 huge+0.008
 little+0.008
A+0.008
Neg logit contributions
anged-0.016
and-0.016
owing-0.016
os-0.015
 Su-0.014
Token little
Token ID1310
Feature activation+4.42e+00

Pos logit contributions
 big+0.060
 two+0.048
 whole+0.047
A+0.044
 huge+0.042
Neg logit contributions
anged-0.334
owing-0.326
and-0.325
 Su-0.319
 Acc-0.317
Token boy
Token ID2933
Feature activation+2.89e+00

Pos logit contributions
 big+0.069
 whole+0.057
 huge+0.054
 little+0.054
A+0.054
Neg logit contributions
anged-0.145
owing-0.140
and-0.139
os-0.137
 Su-0.133
Token named
Token ID3706
Feature activation-2.80e+00

Pos logit contributions
 big+0.074
 little+0.065
 whole+0.062
 huge+0.060
 two+0.060
Neg logit contributions
and-0.070
owing-0.069
anged-0.068
ules-0.067
 Him-0.066
Token Tim
Token ID5045
Feature activation-6.74e-01

Pos logit contributions
anged+0.045
and+0.041
owing+0.040
os+0.040
 Su+0.037
Neg logit contributions
 big-0.092
 whole-0.084
 little-0.083
 huge-0.083
A-0.082
Token found
Token ID1043
Feature activation+3.70e+00

Pos logit contributions
anged+0.020
owing+0.019
and+0.019
os+0.019
 Su+0.018
Neg logit contributions
 big-0.012
 whole-0.010
 little-0.010
 huge-0.010
A-0.010
Token a
Token ID257
Feature activation+9.76e+00

Pos logit contributions
 big+0.037
 whole+0.034
 little+0.029
 huge+0.028
+0.027
Neg logit contributions
anged-0.130
and-0.124
 Su-0.122
os-0.121
owing-0.121
Token magic
Token ID5536
Feature activation+6.51e+00

Pos logit contributions
A+0.075
 two+0.071
 entire+0.059
+0.053
 whole+0.053
Neg logit contributions
 Acc-0.316
 Su-0.300
anged-0.298
and-0.295
owing-0.295
Token f
Token ID277
Feature activation-1.12e+00

Pos logit contributions
A+0.112
 big+0.101
 whole+0.099
 two+0.096
+0.093
Neg logit contributions
anged-0.162
owing-0.153
 figuring-0.150
oot-0.149
and-0.147
Tokenauc
Token ID14272
Feature activation-2.24e+00

Pos logit contributions
anged+0.020
owing+0.019
and+0.019
os+0.018
 Su+0.016
Neg logit contributions
 big-0.036
 whole-0.032
 little-0.032
 huge-0.032
A-0.031
Tokenet
Token ID316
Feature activation-8.69e-03

Pos logit contributions
anged+0.038
and+0.036
owing+0.035
os+0.034
 Su+0.032
Neg logit contributions
 big-0.072
 whole-0.065
 little-0.064
 huge-0.064
 cool-0.063
Token in
Token ID287
Feature activation+5.33e-01

Pos logit contributions
anged+0.000
owing+0.000
and+0.000
os+0.000
 Su+0.000
Neg logit contributions
 big-0.000
 whole-0.000
 huge-0.000
 little-0.000
A-0.000
Token was
Token ID373
Feature activation+5.14e+00

Pos logit contributions
 big+0.068
 whole+0.053
 huge+0.049
 little+0.049
A+0.048
Neg logit contributions
anged-0.191
and-0.185
os-0.185
owing-0.183
 Su-0.181
Token a
Token ID257
Feature activation+8.40e+00

Pos logit contributions
 big+0.046
 whole+0.033
 huge+0.030
 little+0.030
 sharp+0.028
Neg logit contributions
anged-0.191
and-0.190
 Su-0.185
owing-0.183
os-0.182
Token little
Token ID1310
Feature activation+2.98e+00

Pos logit contributions
 big+0.053
 whole+0.044
A+0.042
 two+0.042
 huge+0.032
Neg logit contributions
anged-0.303
 Su-0.299
and-0.298
 Acc-0.296
owing-0.295
Token girl
Token ID2576
Feature activation+2.22e+00

Pos logit contributions
 big+0.049
 whole+0.044
A+0.043
 huge+0.040
 two+0.039
Neg logit contributions
anged-0.091
owing-0.088
and-0.085
os-0.085
 Su-0.084
Token named
Token ID3706
Feature activation-4.50e+00

Pos logit contributions
 big+0.057
 little+0.050
 whole+0.047
A+0.047
 huge+0.047
Neg logit contributions
anged-0.055
and-0.052
owing-0.052
os-0.050
azes-0.049
Token A
Token ID317
Feature activation+9.75e+00

Pos logit contributions
anged+0.119
owing+0.112
os+0.108
and+0.106
azes+0.101
Neg logit contributions
 big-0.113
 little-0.100
 whole-0.097
A-0.093
 huge-0.093
Tokenyl
Token ID2645
Feature activation-2.27e+00

Pos logit contributions
 big+0.074
 whole+0.058
 two+0.056
A+0.051
 huge+0.048
Neg logit contributions
anged-0.355
and-0.351
 Su-0.350
owing-0.347
 Acc-0.345
Tokena
Token ID64
Feature activation+2.60e+00

Pos logit contributions
anged+0.074
owing+0.070
and+0.069
os+0.068
 Su+0.067
Neg logit contributions
 big-0.038
 whole-0.032
 little-0.031
 huge-0.029
A-0.029
Token.
Token ID13
Feature activation+2.12e+00

Pos logit contributions
 big+0.060
 whole+0.053
 huge+0.052
 little+0.050
 two+0.050
Neg logit contributions
anged-0.066
and-0.063
owing-0.062
os-0.062
 Su-0.059
Token She
Token ID1375
Feature activation+4.36e+00

Pos logit contributions
 big+0.045
 little+0.037
 whole+0.035
A+0.034
 two+0.034
Neg logit contributions
anged-0.066
 supervision-0.065
owing-0.065
beh-0.062
and-0.061
Token was
Token ID373
Feature activation+4.49e+00

Pos logit contributions
 big+0.089
 two+0.080
 huge+0.079
 whole+0.077
 little+0.076
Neg logit contributions
anged-0.106
and-0.104
os-0.103
owing-0.097
 Acc-0.097
Token little
Token ID1310
Feature activation+4.54e+00

Pos logit contributions
 big+0.060
 two+0.050
 whole+0.048
A+0.043
 huge+0.042
Neg logit contributions
anged-0.327
and-0.321
owing-0.319
 Su-0.316
 Acc-0.314
Token girl
Token ID2576
Feature activation+3.53e+00

Pos logit contributions
 big+0.072
 whole+0.061
 huge+0.058
A+0.058
 little+0.056
Neg logit contributions
anged-0.146
owing-0.140
and-0.138
os-0.137
 Su-0.134
Token named
Token ID3706
Feature activation-3.25e+00

Pos logit contributions
 big+0.092
 little+0.081
 whole+0.077
 huge+0.075
 two+0.074
Neg logit contributions
and-0.086
owing-0.085
anged-0.084
ules-0.081
ram-0.079
Token Lucy
Token ID22162
Feature activation-3.33e-01

Pos logit contributions
anged+0.081
owing+0.078
and+0.075
os+0.074
ules+0.071
Neg logit contributions
 big-0.084
 little-0.074
 whole-0.073
 huge-0.070
 two-0.070
Token found
Token ID1043
Feature activation+3.69e+00

Pos logit contributions
anged+0.010
and+0.010
owing+0.010
os+0.010
 Su+0.009
Neg logit contributions
 big-0.006
 whole-0.005
 little-0.005
 huge-0.005
A-0.005
Token a
Token ID257
Feature activation+9.74e+00

Pos logit contributions
 big+0.035
 whole+0.029
 little+0.025
 huge+0.025
A+0.023
Neg logit contributions
anged-0.136
and-0.131
 Su-0.130
owing-0.130
os-0.128
Token colorful
Token ID20239
Feature activation+6.92e+00

Pos logit contributions
A+0.075
 two+0.073
 entire+0.059
+0.054
 whole+0.053
Neg logit contributions
 Acc-0.315
 Su-0.301
and-0.292
owing-0.291
anged-0.289
Token bird
Token ID6512
Feature activation+1.38e+00

Pos logit contributions
 big+0.114
A+0.110
 whole+0.105
 huge+0.098
 two+0.096
Neg logit contributions
anged-0.192
owing-0.183
 Su-0.181
and-0.176
ress-0.176
Tokenc
Token ID66
Feature activation+1.32e+00

Pos logit contributions
 big+0.026
 whole+0.022
 little+0.021
 huge+0.021
A+0.021
Neg logit contributions
anged-0.042
owing-0.040
and-0.040
os-0.039
 Su-0.037
Tokenage
Token ID496
Feature activation+2.62e+00

Pos logit contributions
 big+0.032
 whole+0.028
 little+0.028
 huge+0.028
A+0.027
Neg logit contributions
anged-0.032
and-0.031
owing-0.031
os-0.030
 Su-0.029
Token in
Token ID287
Feature activation+6.92e-01

Pos logit contributions
 big+0.051
 whole+0.044
 huge+0.044
 little+0.044
A+0.043
Neg logit contributions
anged-0.073
owing-0.073
and-0.070
os-0.069
 Su-0.066
Token a
Token ID257
Feature activation+9.13e+00

Pos logit contributions
anged+0.029
and+0.027
 Su+0.027
uzz+0.027
os+0.027
Neg logit contributions
 big-0.011
 huge-0.009
 whole-0.009
 nearest-0.008
 sharp-0.008
Token boy
Token ID2933
Feature activation+2.83e+00

Pos logit contributions
 big+0.054
 two+0.045
 whole+0.042
A+0.039
 huge+0.036
Neg logit contributions
anged-0.324
and-0.320
owing-0.317
 Su-0.316
 Acc-0.313
Token named
Token ID3706
Feature activation-3.32e+00

Pos logit contributions
 big+0.073
 little+0.064
 whole+0.061
 huge+0.059
 two+0.059
Neg logit contributions
and-0.068
owing-0.067
anged-0.067
 Him-0.064
ules-0.064
Token Tim
Token ID5045
Feature activation-1.74e+00

Pos logit contributions
anged+0.055
and+0.051
owing+0.051
os+0.049
 Su+0.046
Neg logit contributions
 big-0.108
 whole-0.099
 little-0.097
 huge-0.097
A-0.095
Token found
Token ID1043
Feature activation+3.62e+00

Pos logit contributions
anged+0.052
owing+0.049
and+0.049
os+0.049
 Su+0.047
Neg logit contributions
 big-0.033
 whole-0.028
 huge-0.027
 little-0.027
A-0.027
Token a
Token ID257
Feature activation+9.72e+00

Pos logit contributions
 big+0.036
 whole+0.034
+0.029
 little+0.028
 huge+0.028
Neg logit contributions
anged-0.125
and-0.120
 Su-0.118
 Kim-0.116
os-0.115
Token heavy
Token ID4334
Feature activation+5.98e+00

Pos logit contributions
 two+0.070
A+0.070
 entire+0.055
 whole+0.054
+0.051
Neg logit contributions
 Acc-0.319
 Su-0.306
and-0.304
anged-0.303
owing-0.298
Token box
Token ID3091
Feature activation+5.48e+00

Pos logit contributions
A+0.091
 big+0.084
 whole+0.080
 two+0.077
 huge+0.072
Neg logit contributions
anged-0.176
owing-0.169
 Su-0.158
ress-0.158
 Acc-0.152
Token.
Token ID13
Feature activation+1.67e+00

Pos logit contributions
 big+0.108
A+0.097
 whole+0.096
 little+0.094
 huge+0.093
Neg logit contributions
anged-0.150
owing-0.148
and-0.144
os-0.139
 Su-0.131
Token He
Token ID679
Feature activation+4.79e+00

Pos logit contributions
 big+0.035
 whole+0.030
 little+0.030
 huge+0.029
A+0.028
Neg logit contributions
anged-0.048
owing-0.046
and-0.046
os-0.045
 Su-0.042
Token wanted
Token ID2227
Feature activation+5.02e+00

Pos logit contributions
 big+0.088
 whole+0.082
 huge+0.076
 two+0.072
 little+0.072
Neg logit contributions
anged-0.135
and-0.126
os-0.122
years-0.120
 Su-0.117
Token little
Token ID1310
Feature activation+4.48e+00

Pos logit contributions
 big+0.057
 two+0.049
 whole+0.046
A+0.043
 huge+0.039
Neg logit contributions
anged-0.328
owing-0.320
and-0.319
 Su-0.318
 Acc-0.315
Token girl
Token ID2576
Feature activation+3.61e+00

Pos logit contributions
 big+0.072
 whole+0.061
A+0.059
 huge+0.058
 little+0.056
Neg logit contributions
anged-0.142
owing-0.137
and-0.134
os-0.133
 Su-0.131
Token named
Token ID3706
Feature activation-3.38e+00

Pos logit contributions
 big+0.094
 little+0.083
 whole+0.079
 huge+0.077
 two+0.076
Neg logit contributions
and-0.087
anged-0.086
owing-0.086
ules-0.082
ooters-0.081
Token Sue
Token ID26089
Feature activation-2.20e-01

Pos logit contributions
anged+0.087
owing+0.083
and+0.081
os+0.080
ules+0.077
Neg logit contributions
 big-0.085
 little-0.074
 whole-0.073
 huge-0.070
A-0.070
Token found
Token ID1043
Feature activation+3.74e+00

Pos logit contributions
anged+0.007
and+0.007
owing+0.007
os+0.007
 Su+0.006
Neg logit contributions
 big-0.004
 whole-0.003
 little-0.003
 huge-0.003
A-0.003
Token a
Token ID257
Feature activation+9.72e+00

Pos logit contributions
 big+0.036
 whole+0.029
 little+0.025
 huge+0.025
A+0.023
Neg logit contributions
anged-0.139
and-0.135
owing-0.132
 Su-0.132
os-0.131
Token shiny
Token ID22441
Feature activation+6.48e+00

Pos logit contributions
 two+0.078
A+0.076
 entire+0.061
+0.057
 whole+0.056
Neg logit contributions
 Acc-0.311
 Su-0.296
and-0.293
 figuring-0.286
owing-0.286
Token coin
Token ID10752
Feature activation+4.21e+00

Pos logit contributions
A+0.112
 whole+0.100
 big+0.100
 two+0.096
There+0.092
Neg logit contributions
anged-0.165
owing-0.158
 Su-0.153
ress-0.151
oot-0.146
Token on
Token ID319
Feature activation+1.64e+00

Pos logit contributions
 big+0.088
A+0.083
 whole+0.079
 huge+0.078
 little+0.077
Neg logit contributions
owing-0.108
anged-0.105
and-0.098
os-0.096
rings-0.089
Token the
Token ID262
Feature activation+8.68e-01

Pos logit contributions
 big+0.043
 whole+0.040
 huge+0.039
 little+0.038
A+0.037
Neg logit contributions
anged-0.031
 Su-0.030
and-0.029
 Jos-0.028
 Kim-0.028
Token ground
Token ID2323
Feature activation+3.07e+00

Pos logit contributions
 big+0.014
A+0.013
 huge+0.012
 whole+0.012
 two+0.012
Neg logit contributions
and-0.024
anged-0.023
owing-0.023
reating-0.022
ram-0.022
Token were
Token ID547
Feature activation+4.09e+00

Pos logit contributions
 big+0.009
 whole+0.008
 little+0.008
A+0.007
 huge+0.007
Neg logit contributions
anged-0.016
os-0.015
owing-0.015
and-0.015
 Su-0.014
Token best
Token ID1266
Feature activation+1.28e+00

Pos logit contributions
 big+0.088
 whole+0.074
 little+0.073
 huge+0.072
 two+0.071
Neg logit contributions
anged-0.119
and-0.113
owing-0.112
os-0.108
 Acc-0.103
Token friends
Token ID2460
Feature activation+1.89e+00

Pos logit contributions
 big+0.026
 whole+0.023
 little+0.022
 huge+0.022
A+0.021
Neg logit contributions
anged-0.037
and-0.035
owing-0.035
os-0.034
 Su-0.033
Token.
Token ID13
Feature activation+1.71e+00

Pos logit contributions
 big+0.027
 little+0.021
 whole+0.021
 huge+0.020
 two+0.020
Neg logit contributions
anged-0.067
and-0.064
owing-0.063
os-0.062
azes-0.061
Token On
Token ID1550
Feature activation+5.22e+00

Pos logit contributions
 big+0.033
 little+0.027
 two+0.027
 whole+0.025
 huge+0.025
Neg logit contributions
owing-0.056
anged-0.054
and-0.053
beh-0.052
years-0.050
Token a
Token ID257
Feature activation+9.71e+00

Pos logit contributions
 big+0.109
 whole+0.096
 little+0.093
 huge+0.091
 sharp+0.090
Neg logit contributions
anged-0.140
os-0.132
owing-0.130
and-0.130
 Su-0.129
Token sunny
Token ID27737
Feature activation+6.66e+00

Pos logit contributions
 big+0.126
 two+0.122
A+0.116
 sharp+0.112
 whole+0.111
Neg logit contributions
and-0.283
anged-0.273
os-0.270
 Su-0.269
 Jos-0.259
Token afternoon
Token ID6672
Feature activation+5.49e+00

Pos logit contributions
 big+0.155
 whole+0.136
 huge+0.133
 little+0.133
A+0.131
Neg logit contributions
anged-0.166
and-0.158
owing-0.157
os-0.155
 Su-0.149
Token,
Token ID11
Feature activation-3.28e-01

Pos logit contributions
 big+0.098
 little+0.083
 whole+0.080
 two+0.078
 huge+0.076
Neg logit contributions
anged-0.174
owing-0.170
and-0.168
os-0.165
 Acc-0.159
Token they
Token ID484
Feature activation+4.08e+00

Pos logit contributions
anged+0.008
and+0.007
owing+0.007
os+0.007
 Su+0.007
Neg logit contributions
 big-0.009
 whole-0.008
 little-0.007
 huge-0.007
A-0.007
Token decided
Token ID3066
Feature activation+2.93e+00

Pos logit contributions
 big+0.080
 whole+0.068
 little+0.066
 huge+0.066
A+0.064
Neg logit contributions
anged-0.120
and-0.114
owing-0.114
os-0.113
 Su-0.108
Token little
Token ID1310
Feature activation+4.40e+00

Pos logit contributions
 big+0.068
 whole+0.056
 two+0.054
A+0.050
 huge+0.049
Neg logit contributions
anged-0.327
owing-0.318
and-0.317
 Su-0.313
 Acc-0.310
Token boy
Token ID2933
Feature activation+2.84e+00

Pos logit contributions
 big+0.069
 whole+0.057
 huge+0.055
A+0.054
 little+0.054
Neg logit contributions
anged-0.144
owing-0.139
and-0.138
os-0.136
 Su-0.132
Token named
Token ID3706
Feature activation-2.97e+00

Pos logit contributions
 big+0.073
 little+0.064
 whole+0.061
 huge+0.060
 two+0.059
Neg logit contributions
and-0.069
owing-0.068
anged-0.067
ules-0.066
 Him-0.065
Token Tim
Token ID5045
Feature activation-7.91e-01

Pos logit contributions
anged+0.048
and+0.044
owing+0.043
os+0.043
 Su+0.040
Neg logit contributions
 big-0.098
 whole-0.089
 little-0.088
 huge-0.087
A-0.086
Token found
Token ID1043
Feature activation+3.76e+00

Pos logit contributions
anged+0.024
owing+0.022
os+0.022
and+0.022
 Su+0.021
Neg logit contributions
 big-0.015
 whole-0.013
 little-0.012
 huge-0.012
A-0.012
Token a
Token ID257
Feature activation+9.69e+00

Pos logit contributions
 big+0.037
 whole+0.034
 little+0.028
 huge+0.028
+0.026
Neg logit contributions
anged-0.133
and-0.127
 Su-0.125
os-0.124
owing-0.124
Token sc
Token ID629
Feature activation-1.37e+00

Pos logit contributions
A+0.074
 two+0.072
 entire+0.061
+0.054
 whole+0.053
Neg logit contributions
 Acc-0.313
 Su-0.298
and-0.295
anged-0.292
owing-0.290
Tokenooter
Token ID25141
Feature activation+2.77e+00

Pos logit contributions
anged+0.020
and+0.018
os+0.018
owing+0.017
 Su+0.017
Neg logit contributions
 big-0.048
 little-0.043
 whole-0.043
 huge-0.042
 sharp-0.041
Token.
Token ID13
Feature activation+1.63e+00

Pos logit contributions
 big+0.052
 whole+0.044
 little+0.043
 huge+0.043
A+0.042
Neg logit contributions
anged-0.083
owing-0.080
and-0.078
os-0.077
 Su-0.074
Token It
Token ID632
Feature activation+6.37e+00

Pos logit contributions
 big+0.035
 whole+0.029
 little+0.029
 huge+0.029
A+0.028
Neg logit contributions
anged-0.046
owing-0.044
and-0.044
os-0.043
years-0.041
Token was
Token ID373
Feature activation+4.80e+00

Pos logit contributions
 big+0.088
 whole+0.085
 little+0.079
 two+0.077
A+0.075
Neg logit contributions
os-0.194
anged-0.194
 Su-0.186
 Kim-0.184
 Jos-0.180
Token little
Token ID1310
Feature activation+4.53e+00

Pos logit contributions
 big+0.077
 two+0.069
 whole+0.069
A+0.062
 huge+0.061
Neg logit contributions
anged-0.302
and-0.297
owing-0.292
 Su-0.290
 Acc-0.289
Token boy
Token ID2933
Feature activation+2.94e+00

Pos logit contributions
 big+0.071
 whole+0.059
 huge+0.057
A+0.056
 little+0.056
Neg logit contributions
anged-0.148
owing-0.142
and-0.141
os-0.139
 Su-0.135
Token named
Token ID3706
Feature activation-3.00e+00

Pos logit contributions
 big+0.076
 little+0.066
 whole+0.064
 huge+0.062
 two+0.062
Neg logit contributions
and-0.071
anged-0.069
owing-0.069
ules-0.068
 Him-0.066
Token Tim
Token ID5045
Feature activation-8.57e-01

Pos logit contributions
anged+0.048
and+0.044
owing+0.043
os+0.043
 Su+0.040
Neg logit contributions
 big-0.098
 whole-0.090
 little-0.088
 huge-0.088
A-0.087
Token found
Token ID1043
Feature activation+3.74e+00

Pos logit contributions
anged+0.025
os+0.024
owing+0.024
and+0.024
 Su+0.022
Neg logit contributions
 big-0.017
 whole-0.014
 little-0.014
 huge-0.014
A-0.014
Token a
Token ID257
Feature activation+9.69e+00

Pos logit contributions
 big+0.036
 whole+0.033
 little+0.028
 huge+0.027
A+0.025
Neg logit contributions
anged-0.134
and-0.128
os-0.127
owing-0.126
 Su-0.125
Token long
Token ID890
Feature activation+6.45e+00

Pos logit contributions
A+0.076
 two+0.071
 entire+0.061
 whole+0.054
+0.054
Neg logit contributions
 Acc-0.307
 Su-0.290
anged-0.289
owing-0.287
and-0.287
Token string
Token ID4731
Feature activation+4.71e+00

Pos logit contributions
 big+0.096
A+0.092
 whole+0.087
 huge+0.085
 two+0.080
Neg logit contributions
anged-0.189
owing-0.180
 Su-0.168
ress-0.166
 Kim-0.164
Token in
Token ID287
Feature activation+6.56e-01

Pos logit contributions
 big+0.092
A+0.085
 whole+0.083
 huge+0.081
 little+0.079
Neg logit contributions
anged-0.128
owing-0.126
and-0.120
os-0.119
ooting-0.109
Token his
Token ID465
Feature activation+1.11e+00

Pos logit contributions
 big+0.016
 whole+0.014
 little+0.014
 huge+0.014
A+0.013
Neg logit contributions
anged-0.016
and-0.015
os-0.015
 Su-0.015
owing-0.015
Token empty
Token ID6565
Feature activation+5.01e+00

Pos logit contributions
 big+0.019
 whole+0.017
A+0.016
 huge+0.016
 little+0.016
Neg logit contributions
anged-0.032
and-0.030
owing-0.030
os-0.029
azes-0.029
Token will
Token ID481
Feature activation+2.51e-01

Pos logit contributions
 big+0.028
A+0.026
 little+0.025
 huge+0.025
 whole+0.024
Neg logit contributions
anged-0.037
os-0.036
owing-0.036
and-0.034
 Acc-0.032
Token take
Token ID1011
Feature activation-2.47e+00

Pos logit contributions
 big+0.005
 whole+0.005
 huge+0.005
A+0.005
 little+0.005
Neg logit contributions
anged-0.006
owing-0.006
and-0.006
 Su-0.006
os-0.006
Token the
Token ID262
Feature activation-1.82e+00

Pos logit contributions
anged+0.073
owing+0.071
and+0.070
os+0.069
azes+0.066
Neg logit contributions
 big-0.052
 whole-0.043
 little-0.043
 two-0.042
 huge-0.041
Token rh
Token ID9529
Feature activation-6.41e+00

Pos logit contributions
anged+0.049
owing+0.047
and+0.047
 supervision+0.046
os+0.046
Neg logit contributions
 big-0.031
A-0.029
 whole-0.027
 huge-0.026
 little-0.025
Tokenin
Token ID259
Feature activation-3.34e+00

Pos logit contributions
anged+0.174
and+0.173
owing+0.171
os+0.170
years+0.155
Neg logit contributions
 big-0.137
 whole-0.117
 huge-0.116
 little-0.115
 cool-0.111
Tokenoc
Token ID420
Feature activation-1.46e+01

Pos logit contributions
anged+0.075
and+0.071
owing+0.071
os+0.070
 Su+0.065
Neg logit contributions
 big-0.088
 whole-0.078
 little-0.077
 huge-0.077
A-0.075
Tokeneros
Token ID27498
Feature activation-3.89e+00

Pos logit contributions
and+0.366
anged+0.351
 Su+0.350
 Jos+0.349
os+0.339
Neg logit contributions
 big-0.329
 whole-0.288
 little-0.287
 two-0.283
 sharp-0.278
Token back
Token ID736
Feature activation-3.72e+00

Pos logit contributions
anged+0.101
and+0.100
owing+0.099
os+0.097
 Su+0.089
Neg logit contributions
 big-0.087
 whole-0.075
 huge-0.075
 little-0.074
A-0.074
Token to
Token ID284
Feature activation-3.12e+00

Pos logit contributions
anged+0.107
and+0.102
owing+0.102
os+0.100
 Su+0.096
Neg logit contributions
 big-0.076
 whole-0.065
 little-0.064
 huge-0.063
A-0.061
Token the
Token ID262
Feature activation-8.01e-01

Pos logit contributions
anged+0.099
and+0.096
owing+0.095
os+0.093
years+0.088
Neg logit contributions
 big-0.062
 little-0.051
 whole-0.050
 huge-0.049
 two-0.047
Token zoo
Token ID26626
Feature activation-1.89e+00

Pos logit contributions
anged+0.021
and+0.021
ric+0.020
 Acc+0.020
owing+0.019
Neg logit contributions
 big-0.013
A-0.013
 whole-0.012
 huge-0.011
 little-0.011
Token care
Token ID1337
Feature activation-6.99e-01

Pos logit contributions
anged+0.068
owing+0.065
and+0.065
os+0.064
 Su+0.061
Neg logit contributions
 big-0.048
 whole-0.040
 little-0.040
 huge-0.039
 two-0.038
Token of
Token ID286
Feature activation-4.84e+00

Pos logit contributions
anged+0.020
and+0.019
owing+0.019
os+0.019
 Su+0.018
Neg logit contributions
 big-0.014
 whole-0.012
 little-0.012
 huge-0.012
A-0.011
Token the
Token ID262
Feature activation-2.67e+00

Pos logit contributions
anged+0.125
and+0.118
owing+0.118
os+0.116
 Su+0.109
Neg logit contributions
 big-0.115
 whole-0.100
 little-0.099
 huge-0.097
A-0.095
Token rh
Token ID9529
Feature activation-6.55e+00

Pos logit contributions
anged+0.075
and+0.073
owing+0.072
os+0.069
 Su+0.069
Neg logit contributions
 big-0.045
A-0.042
 whole-0.038
 huge-0.038
 two-0.036
Tokenin
Token ID259
Feature activation-3.74e+00

Pos logit contributions
anged+0.181
and+0.179
owing+0.178
os+0.176
years+0.162
Neg logit contributions
 big-0.138
 whole-0.117
 huge-0.117
 little-0.115
 cool-0.112
Tokenoc
Token ID420
Feature activation-1.45e+01

Pos logit contributions
anged+0.084
and+0.079
owing+0.079
os+0.077
 Su+0.072
Neg logit contributions
 big-0.100
 whole-0.088
 little-0.087
 huge-0.087
A-0.085
Tokeneros
Token ID27498
Feature activation-3.74e+00

Pos logit contributions
and+0.371
 Su+0.358
 Jos+0.355
 Jas+0.350
anged+0.348
Neg logit contributions
 big-0.305
 whole-0.271
 little-0.269
 two-0.266
 top-0.264
Token and
Token ID290
Feature activation-4.91e+00

Pos logit contributions
anged+0.112
owing+0.109
and+0.109
os+0.107
 Su+0.101
Neg logit contributions
 big-0.070
 whole-0.059
A-0.058
 huge-0.057
 little-0.057
Token it
Token ID340
Feature activation+2.54e+00

Pos logit contributions
anged+0.136
and+0.128
owing+0.128
os+0.127
 Su+0.123
Neg logit contributions
 big-0.101
 whole-0.091
A-0.086
 huge-0.086
 little-0.085
Token needed
Token ID2622
Feature activation+4.43e+00

Pos logit contributions
 big+0.044
A+0.042
 whole+0.039
 little+0.038
 huge+0.037
Neg logit contributions
os-0.071
anged-0.071
owing-0.068
and-0.067
 Acc-0.065
Token to
Token ID284
Feature activation-1.09e+00

Pos logit contributions
 big+0.096
 whole+0.085
 huge+0.082
 little+0.082
A+0.080
Neg logit contributions
anged-0.118
and-0.114
owing-0.112
os-0.109
 Su-0.107
Token Sammy
Token ID44067
Feature activation-2.28e+00

Pos logit contributions
owing+0.119
anged+0.116
and+0.114
os+0.110
ules+0.108
Neg logit contributions
 big-0.087
 little-0.080
 two-0.076
 whole-0.070
 cool-0.068
Token would
Token ID561
Feature activation+2.87e+00

Pos logit contributions
anged+0.067
and+0.065
owing+0.065
os+0.062
 Su+0.060
Neg logit contributions
 big-0.046
 whole-0.039
 little-0.038
 huge-0.037
 two-0.036
Token often
Token ID1690
Feature activation+6.71e-01

Pos logit contributions
 big+0.068
 whole+0.059
 little+0.058
 huge+0.058
A+0.057
Neg logit contributions
anged-0.073
and-0.069
owing-0.069
os-0.068
 Su-0.064
Token share
Token ID2648
Feature activation+9.44e-02

Pos logit contributions
 big+0.015
 whole+0.013
 little+0.013
 huge+0.012
A+0.012
Neg logit contributions
anged-0.018
and-0.017
owing-0.017
os-0.017
 Su-0.016
Token av
Token ID1196
Feature activation-7.64e+00

Pos logit contributions
 big+0.002
 little+0.002
 whole+0.002
 huge+0.002
 two+0.002
Neg logit contributions
anged-0.003
owing-0.003
and-0.003
os-0.003
azes-0.002
Tokenoc
Token ID420
Feature activation-1.44e+01

Pos logit contributions
anged+0.217
 Su+0.210
 Acc+0.206
owing+0.205
 Jos+0.204
Neg logit contributions
 big-0.161
 whole-0.138
 little-0.130
 huge-0.127
A-0.125
Tokenados
Token ID22484
Feature activation-2.85e+00

Pos logit contributions
 Jos+0.262
 Su+0.255
and+0.248
anged+0.243
 Jas+0.242
Neg logit contributions
 big-0.394
 two-0.374
 sharp-0.362
 little-0.360
 top-0.344
Token and
Token ID290
Feature activation-5.77e+00

Pos logit contributions
anged+0.086
owing+0.078
 Su+0.078
 Acc+0.077
years+0.076
Neg logit contributions
 big-0.060
 whole-0.050
 little-0.049
 huge-0.047
 two-0.047
Token other
Token ID584
Feature activation-4.34e+00

Pos logit contributions
anged+0.168
owing+0.164
and+0.162
os+0.159
ules+0.151
Neg logit contributions
 big-0.123
 little-0.103
 whole-0.102
 two-0.098
 huge-0.098
Token y
Token ID331
Feature activation-2.55e+00

Pos logit contributions
anged+0.133
owing+0.126
and+0.124
os+0.124
 Su+0.119
Neg logit contributions
 big-0.086
 little-0.073
 whole-0.071
 huge-0.069
 two-0.067
Tokenummy
Token ID13513
Feature activation-2.44e+00

Pos logit contributions
anged+0.061
os+0.059
and+0.059
owing+0.058
 Su+0.057
Neg logit contributions
 big-0.062
 whole-0.054
 little-0.054
 huge-0.052
 two-0.051
Token try
Token ID1949
Feature activation+9.28e-01

Pos logit contributions
anged+0.159
owing+0.151
and+0.150
os+0.148
 Su+0.141
Neg logit contributions
 big-0.121
 whole-0.105
 little-0.103
 huge-0.101
A-0.100
Token to
Token ID284
Feature activation-2.84e+00

Pos logit contributions
 big+0.020
 little+0.016
 whole+0.016
 huge+0.016
 two+0.015
Neg logit contributions
anged-0.028
owing-0.026
os-0.026
and-0.026
years-0.025
Token grab
Token ID5552
Feature activation-2.65e+00

Pos logit contributions
owing+0.062
anged+0.060
and+0.059
os+0.056
 Su+0.054
Neg logit contributions
 big-0.060
A-0.058
 whole-0.056
 little-0.055
 huge-0.054
Token the
Token ID262
Feature activation-2.35e+00

Pos logit contributions
anged+0.072
and+0.068
owing+0.067
os+0.067
 Su+0.065
Neg logit contributions
 big-0.057
 whole-0.050
 little-0.049
 huge-0.048
A-0.048
Token av
Token ID1196
Feature activation-6.69e+00

Pos logit contributions
anged+0.058
owing+0.057
and+0.056
 supervision+0.055
os+0.054
Neg logit contributions
A-0.045
 big-0.041
 whole-0.038
 huge-0.037
 little-0.035
Tokenoc
Token ID420
Feature activation-1.44e+01

Pos logit contributions
anged+0.157
owing+0.148
and+0.148
os+0.144
 Su+0.138
Neg logit contributions
 big-0.172
 whole-0.152
 little-0.149
 huge-0.148
A-0.145
Tokenados
Token ID22484
Feature activation-9.60e-01

Pos logit contributions
and+0.231
anged+0.224
 Su+0.219
 Jos+0.218
os+0.209
Neg logit contributions
 big-0.457
 little-0.412
 whole-0.411
 two-0.408
 top-0.403
Token.
Token ID13
Feature activation-1.73e+00

Pos logit contributions
and+0.026
anged+0.026
owing+0.025
os+0.025
ram+0.023
Neg logit contributions
 big-0.020
 whole-0.017
A-0.017
 huge-0.017
 little-0.017
Token But
Token ID887
Feature activation-3.96e+00

Pos logit contributions
owing+0.050
anged+0.050
os+0.047
and+0.047
years+0.046
Neg logit contributions
 big-0.038
 little-0.032
A-0.031
 two-0.031
 whole-0.030
Token the
Token ID262
Feature activation+1.34e+00

Pos logit contributions
anged+0.117
and+0.116
owing+0.114
os+0.111
azes+0.106
Neg logit contributions
 big-0.081
 little-0.068
 two-0.067
 whole-0.067
 huge-0.065
Token rock
Token ID3881
Feature activation+1.20e+00

Pos logit contributions
 big+0.025
A+0.025
 whole+0.023
 huge+0.022
 little+0.022
Neg logit contributions
anged-0.033
owing-0.032
and-0.031
 supervision-0.031
os-0.030
Token learn
Token ID2193
Feature activation+6.45e-01

Pos logit contributions
anged+0.055
owing+0.054
and+0.053
os+0.052
 Su+0.051
Neg logit contributions
 big-0.043
 whole-0.038
 huge-0.037
A-0.036
 little-0.036
Token more
Token ID517
Feature activation-2.28e+00

Pos logit contributions
 big+0.014
 whole+0.012
 huge+0.012
 little+0.012
A+0.011
Neg logit contributions
anged-0.017
and-0.017
owing-0.016
os-0.016
 Su-0.016
Token about
Token ID546
Feature activation-3.09e+00

Pos logit contributions
anged+0.051
owing+0.048
and+0.048
os+0.047
 Su+0.044
Neg logit contributions
 big-0.062
 whole-0.055
 little-0.054
 huge-0.054
A-0.053
Token rh
Token ID9529
Feature activation-6.24e+00

Pos logit contributions
anged+0.089
owing+0.084
and+0.083
os+0.081
years+0.077
Neg logit contributions
 big-0.070
 little-0.060
 two-0.057
 huge-0.057
 whole-0.057
Tokenin
Token ID259
Feature activation-4.06e+00

Pos logit contributions
anged+0.152
and+0.145
owing+0.145
os+0.142
 Su+0.133
Neg logit contributions
 big-0.155
 whole-0.136
 little-0.134
 huge-0.132
A-0.130
Tokenoc
Token ID420
Feature activation-1.41e+01

Pos logit contributions
anged+0.112
and+0.105
owing+0.104
 Su+0.103
os+0.103
Neg logit contributions
 big-0.088
 whole-0.076
 little-0.076
 huge-0.073
 two-0.072
Tokener
Token ID263
Feature activation-5.60e+00

Pos logit contributions
and+0.296
 Jos+0.296
 Su+0.290
anged+0.280
 Jenna+0.278
Neg logit contributions
 big-0.348
 two-0.323
 little-0.311
 sharp-0.310
 whole-0.302
Tokenoses
Token ID4629
Feature activation-5.53e+00

Pos logit contributions
anged+0.175
and+0.166
owing+0.161
os+0.159
years+0.159
Neg logit contributions
 big-0.101
 little-0.083
 whole-0.083
 two-0.079
 huge-0.077
Token and
Token ID290
Feature activation-6.11e+00

Pos logit contributions
anged+0.173
and+0.165
owing+0.164
os+0.161
 Su+0.156
Neg logit contributions
 big-0.100
 whole-0.083
 little-0.081
 huge-0.079
 two-0.077
Token respect
Token ID2461
Feature activation-3.24e+00

Pos logit contributions
anged+0.160
owing+0.153
and+0.152
os+0.149
 Su+0.142
Neg logit contributions
 big-0.142
 whole-0.122
 little-0.122
 huge-0.119
 two-0.116
Token them
Token ID606
Feature activation-2.82e+00

Pos logit contributions
anged+0.102
owing+0.096
and+0.093
azes+0.092
os+0.092
Neg logit contributions
 big-0.064
 little-0.052
 whole-0.050
 two-0.049
 huge-0.048
Token and
Token ID290
Feature activation-4.47e+00

Pos logit contributions
anged+0.011
and+0.011
owing+0.011
os+0.011
 Su+0.010
Neg logit contributions
 big-0.006
 whole-0.005
 little-0.005
 huge-0.005
A-0.005
Token picked
Token ID6497
Feature activation-2.89e+00

Pos logit contributions
anged+0.119
owing+0.113
and+0.113
os+0.110
 Su+0.109
Neg logit contributions
 big-0.094
 whole-0.085
 huge-0.081
A-0.081
 little-0.078
Token the
Token ID262
Feature activation-2.64e+00

Pos logit contributions
anged+0.081
and+0.078
owing+0.078
os+0.077
 Su+0.074
Neg logit contributions
 big-0.058
 whole-0.050
 little-0.049
A-0.048
 huge-0.048
Token right
Token ID826
Feature activation-4.62e+00

Pos logit contributions
and+0.047
owing+0.047
anged+0.044
 supervision+0.044
 Him+0.043
Neg logit contributions
 big-0.065
A-0.064
 huge-0.059
 whole-0.058
ball-0.057
Token av
Token ID1196
Feature activation-6.17e+00

Pos logit contributions
anged+0.133
owing+0.132
and+0.128
 Su+0.122
os+0.122
Neg logit contributions
 big-0.081
A-0.074
 whole-0.069
 huge-0.068
 little-0.066
Tokenoc
Token ID420
Feature activation-1.41e+01

Pos logit contributions
anged+0.155
owing+0.145
and+0.144
os+0.142
 Su+0.141
Neg logit contributions
 big-0.150
 whole-0.132
 little-0.128
 huge-0.127
A-0.124
Tokenados
Token ID22484
Feature activation-1.21e+00

Pos logit contributions
and+0.226
anged+0.223
 Jos+0.213
 Su+0.211
os+0.206
Neg logit contributions
 big-0.445
 little-0.405
 whole-0.404
 two-0.403
 top-0.390
Token.
Token ID13
Feature activation-8.41e-01

Pos logit contributions
anged+0.034
and+0.034
owing+0.034
os+0.033
 Su+0.031
Neg logit contributions
 big-0.025
 whole-0.021
 little-0.021
 huge-0.020
A-0.020
Token
Token ID198
Feature activation+1.87e+00

Pos logit contributions
anged+0.026
owing+0.025
os+0.023
and+0.023
years+0.022
Neg logit contributions
 big-0.017
 little-0.015
 two-0.015
A-0.014
 whole-0.014
Token
Token ID198
Feature activation+2.00e+00

Pos logit contributions
 big+0.031
A+0.024
 whole+0.024
 little+0.024
 huge+0.023
Neg logit contributions
anged-0.066
owing-0.063
os-0.059
 supervision-0.058
beh-0.057
TokenSadly
Token ID36725
Feature activation-1.63e+00

Pos logit contributions
 big+0.033
 whole+0.027
 little+0.026
 huge+0.025
A+0.024
Neg logit contributions
anged-0.066
and-0.064
owing-0.064
os-0.063
 Su-0.060
Token and
Token ID290
Feature activation-5.07e+00

Pos logit contributions
anged+0.015
owing+0.015
and+0.015
os+0.014
 Jos+0.014
Neg logit contributions
 big-0.012
 whole-0.011
A-0.011
 huge-0.011
 little-0.010
Token hugged
Token ID41130
Feature activation-3.91e+00

Pos logit contributions
anged+0.131
owing+0.121
 Su+0.121
os+0.121
and+0.119
Neg logit contributions
 big-0.112
 whole-0.103
A-0.096
 huge-0.095
 little-0.094
Token the
Token ID262
Feature activation-3.26e+00

Pos logit contributions
anged+0.131
owing+0.124
os+0.122
and+0.122
azes+0.120
Neg logit contributions
 big-0.072
 little-0.062
 two-0.057
 whole-0.056
 huge-0.053
Token rh
Token ID9529
Feature activation-6.22e+00

Pos logit contributions
anged+0.093
owing+0.090
and+0.090
os+0.087
 Su+0.086
Neg logit contributions
 big-0.055
A-0.048
 whole-0.046
 huge-0.045
 sharp-0.043
Tokenin
Token ID259
Feature activation-3.66e+00

Pos logit contributions
anged+0.165
and+0.163
owing+0.163
os+0.163
years+0.147
Neg logit contributions
 big-0.136
 whole-0.117
 huge-0.116
 little-0.114
 cool-0.111
Tokenoc
Token ID420
Feature activation-1.40e+01

Pos logit contributions
anged+0.088
and+0.083
owing+0.083
os+0.081
 Su+0.077
Neg logit contributions
 big-0.094
 whole-0.083
 little-0.082
 huge-0.080
A-0.078
Tokeneros
Token ID27498
Feature activation-4.00e+00

Pos logit contributions
and+0.371
 Jos+0.351
anged+0.351
 Su+0.350
 Jas+0.342
Neg logit contributions
 big-0.289
 little-0.254
 whole-0.253
 two-0.252
 top-0.248
Token.
Token ID13
Feature activation-1.66e+00

Pos logit contributions
anged+0.129
and+0.123
owing+0.123
os+0.121
 Su+0.117
Neg logit contributions
 big-0.069
 whole-0.057
 little-0.056
 huge-0.054
 two-0.052
Token He
Token ID679
Feature activation+2.79e+00

Pos logit contributions
anged+0.048
owing+0.048
and+0.045
os+0.044
beh+0.043
Neg logit contributions
 big-0.035
 little-0.030
 two-0.030
 whole-0.029
A-0.028
Token finally
Token ID3443
Feature activation+1.72e+00

Pos logit contributions
 big+0.059
 whole+0.055
A+0.051
 huge+0.050
 little+0.049
Neg logit contributions
anged-0.071
os-0.069
and-0.065
owing-0.065
 Su-0.064
Token understood
Token ID7247
Feature activation+4.77e-01

Pos logit contributions
 big+0.040
 whole+0.038
A+0.037
 little+0.035
 huge+0.035
Neg logit contributions
os-0.037
anged-0.037
 Su-0.036
 J-0.035
owing-0.034
Token you
Token ID345
Feature activation-8.42e-01

Pos logit contributions
anged+0.045
owing+0.043
and+0.042
os+0.041
azes+0.039
Neg logit contributions
 big-0.027
 little-0.023
A-0.022
 two-0.022
 whole-0.022
Token like
Token ID588
Feature activation-3.10e-01

Pos logit contributions
owing+0.020
os+0.020
anged+0.020
and+0.018
 Su+0.018
Neg logit contributions
 big-0.020
 huge-0.017
A-0.017
 little-0.017
 sharp-0.017
Token the
Token ID262
Feature activation-1.82e+00

Pos logit contributions
anged+0.010
owing+0.009
and+0.009
os+0.009
azes+0.009
Neg logit contributions
 big-0.006
 little-0.005
 whole-0.005
 huge-0.005
 two-0.005
Token rh
Token ID9529
Feature activation-6.02e+00

Pos logit contributions
anged+0.044
owing+0.043
os+0.042
and+0.042
 Su+0.041
Neg logit contributions
 big-0.036
A-0.034
 whole-0.032
 huge-0.031
 little-0.030
Tokenin
Token ID259
Feature activation-3.37e+00

Pos logit contributions
owing+0.157
os+0.156
anged+0.156
and+0.155
years+0.140
Neg logit contributions
 big-0.133
 huge-0.114
 whole-0.113
 little-0.111
 cool-0.110
Tokenoc
Token ID420
Feature activation-1.40e+01

Pos logit contributions
anged+0.075
owing+0.072
and+0.071
os+0.070
 Su+0.065
Neg logit contributions
 big-0.090
 whole-0.080
 little-0.078
 huge-0.078
A-0.077
Tokeneros
Token ID27498
Feature activation-3.98e+00

Pos logit contributions
and+0.374
anged+0.354
 Jos+0.351
 Su+0.349
 Jas+0.341
Neg logit contributions
 big-0.299
 whole-0.263
 little-0.260
 two-0.257
 sharp-0.252
Token.
Token ID13
Feature activation-1.81e+00

Pos logit contributions
anged+0.120
owing+0.119
and+0.119
os+0.117
 Su+0.108
Neg logit contributions
 big-0.070
 whole-0.059
 huge-0.058
A-0.058
 little-0.055
Token But
Token ID887
Feature activation-4.51e+00

Pos logit contributions
anged+0.049
owing+0.046
and+0.045
os+0.044
years+0.041
Neg logit contributions
 big-0.041
 little-0.036
A-0.036
 two-0.035
 whole-0.035
Token remember
Token ID3505
Feature activation+1.25e+00

Pos logit contributions
anged+0.137
and+0.131
owing+0.131
os+0.129
 Su+0.122
Neg logit contributions
 big-0.088
 little-0.073
 whole-0.073
 huge-0.070
 two-0.070
Token,
Token ID11
Feature activation-3.82e+00

Pos logit contributions
 big+0.025
 whole+0.021
 little+0.021
 huge+0.020
A+0.020
Neg logit contributions
anged-0.037
and-0.036
owing-0.035
os-0.035
 Su-0.033
Token to
Token ID284
Feature activation-1.98e+00

Pos logit contributions
 big+0.059
 whole+0.050
 little+0.049
 huge+0.048
A+0.047
Neg logit contributions
anged-0.071
owing-0.068
and-0.067
os-0.066
 Su-0.063
Token see
Token ID766
Feature activation-2.56e+00

Pos logit contributions
anged+0.047
owing+0.046
os+0.045
 Su+0.045
and+0.044
Neg logit contributions
 big-0.044
 whole-0.040
 huge-0.040
A-0.039
 little-0.039
Token the
Token ID262
Feature activation-1.23e+00

Pos logit contributions
anged+0.074
owing+0.070
and+0.070
os+0.068
azes+0.065
Neg logit contributions
 big-0.055
 little-0.046
 whole-0.045
 huge-0.045
 two-0.043
Token rh
Token ID9529
Feature activation-5.83e+00

Pos logit contributions
anged+0.029
owing+0.028
ric+0.027
and+0.027
os+0.027
Neg logit contributions
 big-0.025
A-0.024
 whole-0.022
 huge-0.021
 little-0.021
Tokenin
Token ID259
Feature activation-2.96e+00

Pos logit contributions
os+0.149
owing+0.149
and+0.148
anged+0.148
years+0.132
Neg logit contributions
 big-0.132
 huge-0.114
 whole-0.112
 little-0.111
 cool-0.110
Tokenoc
Token ID420
Feature activation-1.40e+01

Pos logit contributions
anged+0.063
owing+0.060
os+0.059
and+0.059
 Su+0.053
Neg logit contributions
 big-0.082
 whole-0.072
 huge-0.071
 little-0.071
A-0.070
Tokeneros
Token ID27498
Feature activation-3.55e+00

Pos logit contributions
and+0.348
anged+0.335
 Su+0.321
os+0.319
 Jos+0.319
Neg logit contributions
 big-0.324
 whole-0.291
 little-0.284
 two-0.280
 huge-0.274
Token anymore
Token ID7471
Feature activation-2.12e+00

Pos logit contributions
owing+0.099
and+0.098
anged+0.097
os+0.097
 Him+0.087
Neg logit contributions
 big-0.069
A-0.060
 whole-0.059
 huge-0.059
 little-0.057
Token.
Token ID13
Feature activation-1.22e+00

Pos logit contributions
and+0.055
os+0.054
owing+0.051
anged+0.051
 Him+0.048
Neg logit contributions
 big-0.044
 huge-0.041
 whole-0.040
A-0.037
 large-0.037
Token He
Token ID679
Feature activation+2.79e+00

Pos logit contributions
anged+0.029
and+0.027
owing+0.027
os+0.027
 Su+0.025
Neg logit contributions
 big-0.031
 whole-0.028
 little-0.027
 huge-0.027
A-0.026
Token is
Token ID318
Feature activation+1.82e+00

Pos logit contributions
 big+0.049
A+0.047
 whole+0.046
 huge+0.042
 little+0.041
Neg logit contributions
os-0.078
anged-0.076
owing-0.076
atted-0.068
beh-0.066
Token to
Token ID284
Feature activation-2.10e+00

Pos logit contributions
 big+0.057
 whole+0.049
 little+0.048
 huge+0.047
A+0.047
Neg logit contributions
anged-0.072
owing-0.069
and-0.068
os-0.067
 Su-0.064
Token pet
Token ID4273
Feature activation-5.56e+00

Pos logit contributions
anged+0.051
 Su+0.049
owing+0.049
os+0.048
and+0.047
Neg logit contributions
 big-0.045
 whole-0.042
 huge-0.040
A-0.040
 two-0.039
Token the
Token ID262
Feature activation-2.25e+00

Pos logit contributions
anged+0.152
and+0.145
owing+0.145
os+0.144
 Su+0.137
Neg logit contributions
 big-0.119
 whole-0.103
 little-0.100
 huge-0.100
A-0.097
Token rh
Token ID9529
Feature activation-6.05e+00

Pos logit contributions
anged+0.056
owing+0.054
and+0.054
os+0.052
ric+0.051
Neg logit contributions
 big-0.044
A-0.041
 whole-0.038
 huge-0.038
 little-0.036
Tokenin
Token ID259
Feature activation-3.56e+00

Pos logit contributions
os+0.156
owing+0.155
and+0.155
anged+0.155
years+0.138
Neg logit contributions
 big-0.136
 huge-0.118
 whole-0.117
 little-0.115
 cool-0.114
Tokenoc
Token ID420
Feature activation-1.40e+01

Pos logit contributions
anged+0.076
owing+0.073
and+0.073
os+0.072
 Su+0.066
Neg logit contributions
 big-0.097
 whole-0.086
 huge-0.084
 little-0.084
A-0.083
Tokeneros
Token ID27498
Feature activation-3.77e+00

Pos logit contributions
and+0.366
anged+0.348
 Jos+0.343
 Su+0.342
 Jas+0.334
Neg logit contributions
 big-0.299
 whole-0.264
 little-0.261
 two-0.259
 top-0.256
Token.
Token ID13
Feature activation-1.65e+00

Pos logit contributions
owing+0.111
anged+0.110
os+0.110
and+0.110
 Su+0.101
Neg logit contributions
 big-0.068
 whole-0.057
 huge-0.057
A-0.056
 little-0.054
Token It
Token ID632
Feature activation+3.40e+00

Pos logit contributions
anged+0.044
and+0.042
owing+0.042
os+0.041
 Su+0.039
Neg logit contributions
 big-0.037
 whole-0.032
 little-0.032
 huge-0.031
A-0.030
Token looked
Token ID3114
Feature activation+2.59e+00

Pos logit contributions
 big+0.059
 whole+0.056
A+0.053
 huge+0.050
 large+0.047
Neg logit contributions
os-0.096
anged-0.090
owing-0.089
beh-0.087
 Acc-0.086
Token so
Token ID523
Feature activation-1.56e+00

Pos logit contributions
 big+0.039
 whole+0.037
A+0.036
 two+0.033
 little+0.032
Neg logit contributions
anged-0.077
owing-0.075
and-0.075
 Su-0.073
os-0.072
Token to
Token ID284
Feature activation-6.20e-01

Pos logit contributions
anged+0.006
and+0.006
owing+0.006
os+0.005
 Jos+0.005
Neg logit contributions
 big-0.004
 whole-0.003
 little-0.003
 huge-0.003
 two-0.003
Token enjoy
Token ID2883
Feature activation+5.91e-01

Pos logit contributions
anged+0.016
owing+0.015
and+0.015
os+0.015
 Su+0.014
Neg logit contributions
 big-0.014
 whole-0.012
 little-0.012
 huge-0.012
A-0.012
Token eating
Token ID6600
Feature activation-1.40e+00

Pos logit contributions
 big+0.013
 whole+0.011
 little+0.011
 huge+0.011
 two+0.010
Neg logit contributions
anged-0.017
and-0.016
owing-0.016
os-0.016
 Su-0.015
Token her
Token ID607
Feature activation-4.30e+00

Pos logit contributions
anged+0.043
owing+0.041
and+0.041
os+0.040
 Su+0.038
Neg logit contributions
 big-0.027
 whole-0.022
 little-0.022
 huge-0.021
 two-0.021
Token av
Token ID1196
Feature activation-6.61e+00

Pos logit contributions
anged+0.128
owing+0.125
and+0.124
os+0.122
 Su+0.115
Neg logit contributions
 big-0.074
A-0.064
 whole-0.063
 huge-0.061
 little-0.060
Tokenoc
Token ID420
Feature activation-1.40e+01

Pos logit contributions
anged+0.166
and+0.155
owing+0.154
 Su+0.153
os+0.152
Neg logit contributions
 big-0.161
 whole-0.142
 little-0.136
 huge-0.136
A-0.132
Tokenados
Token ID22484
Feature activation-1.27e+00

Pos logit contributions
and+0.233
 Jos+0.230
 Su+0.222
anged+0.220
 Jas+0.212
Neg logit contributions
 big-0.433
 little-0.391
 two-0.389
 whole-0.386
 sharp-0.382
Token with
Token ID351
Feature activation-3.61e+00

Pos logit contributions
anged+0.037
and+0.036
owing+0.036
os+0.035
 Su+0.033
Neg logit contributions
 big-0.024
 whole-0.021
 little-0.020
 huge-0.020
A-0.020
Token a
Token ID257
Feature activation+3.93e+00

Pos logit contributions
anged+0.090
and+0.086
owing+0.086
os+0.084
 Su+0.080
Neg logit contributions
 big-0.086
 whole-0.075
 little-0.074
 huge-0.073
A-0.072
Token smile
Token ID8212
Feature activation-1.82e+00

Pos logit contributions
 big+0.079
A+0.075
 whole+0.070
 two+0.069
 huge+0.068
Neg logit contributions
anged-0.098
and-0.097
owing-0.096
os-0.092
 Su-0.089
Token on
Token ID319
Feature activation-5.56e+00

Pos logit contributions
anged+0.055
owing+0.053
and+0.053
os+0.052
 Su+0.050
Neg logit contributions
 big-0.034
 whole-0.029
 little-0.028
 huge-0.028
A-0.027
Token thanked
Token ID26280
Feature activation-4.16e+00

Pos logit contributions
anged+0.024
owing+0.023
and+0.023
os+0.022
 Su+0.021
Neg logit contributions
 big-0.020
 whole-0.017
 little-0.017
 huge-0.016
A-0.016
Token the
Token ID262
Feature activation-2.75e+00

Pos logit contributions
anged+0.113
owing+0.107
and+0.105
os+0.100
azes+0.099
Neg logit contributions
 big-0.105
 little-0.093
 two-0.089
 a-0.086
 huge-0.085
Token elderly
Token ID13830
Feature activation-5.05e+00

Pos logit contributions
anged+0.079
owing+0.075
and+0.075
os+0.074
 Su+0.071
Neg logit contributions
 big-0.055
 whole-0.046
 little-0.045
A-0.045
 huge-0.045
Token rh
Token ID9529
Feature activation-6.29e+00

Pos logit contributions
anged+0.156
and+0.151
owing+0.147
os+0.145
azes+0.139
Neg logit contributions
 big-0.100
 little-0.084
 whole-0.084
 two-0.080
 huge-0.080
Tokenin
Token ID259
Feature activation-3.99e+00

Pos logit contributions
anged+0.182
owing+0.177
and+0.176
os+0.174
years+0.162
Neg logit contributions
 big-0.126
 whole-0.106
 huge-0.104
 little-0.104
A-0.100
Tokenoc
Token ID420
Feature activation-1.39e+01

Pos logit contributions
anged+0.111
and+0.105
azes+0.101
owing+0.101
 Su+0.099
Neg logit contributions
 big-0.090
 little-0.079
 whole-0.078
 two-0.076
 huge-0.075
Tokeneros
Token ID27498
Feature activation-4.38e+00

Pos logit contributions
and+0.372
 Jos+0.358
anged+0.356
 Su+0.354
os+0.346
Neg logit contributions
 big-0.275
 little-0.246
 whole-0.244
 two-0.244
 top-0.231
Token for
Token ID329
Feature activation-2.01e+00

Pos logit contributions
anged+0.121
and+0.112
owing+0.110
azes+0.109
os+0.109
Neg logit contributions
 big-0.099
 little-0.087
 whole-0.084
 two-0.083
 huge-0.081
Token his
Token ID465
Feature activation-2.31e+00

Pos logit contributions
anged+0.054
owing+0.050
os+0.050
and+0.049
azes+0.046
Neg logit contributions
 big-0.050
 whole-0.043
 little-0.043
 two-0.043
 huge-0.042
Token wonderful
Token ID7932
Feature activation-7.46e-01

Pos logit contributions
anged+0.068
owing+0.066
and+0.066
os+0.063
 Acc+0.061
Neg logit contributions
 big-0.041
A-0.036
 whole-0.035
 huge-0.033
 little-0.033
Token surprise
Token ID5975
Feature activation-2.92e-01

Pos logit contributions
anged+0.020
owing+0.019
and+0.019
os+0.018
 Su+0.018
Neg logit contributions
 big-0.015
A-0.014
 whole-0.013
 huge-0.013
 little-0.013
Token me
Token ID502
Feature activation-4.48e+00

Pos logit contributions
anged+0.076
and+0.073
owing+0.073
os+0.072
 Su+0.069
Neg logit contributions
 big-0.044
 whole-0.036
 little-0.036
 huge-0.035
A-0.034
Token see
Token ID766
Feature activation-2.80e+00

Pos logit contributions
owing+0.116
anged+0.114
and+0.109
os+0.109
 Su+0.102
Neg logit contributions
 big-0.095
 whole-0.086
A-0.084
 huge-0.084
 little-0.081
Token the
Token ID262
Feature activation-1.51e+00

Pos logit contributions
anged+0.084
owing+0.079
and+0.079
os+0.077
azes+0.074
Neg logit contributions
 big-0.057
 little-0.048
 whole-0.047
 huge-0.046
 two-0.045
Token rh
Token ID9529
Feature activation-5.79e+00

Pos logit contributions
anged+0.038
owing+0.037
and+0.036
os+0.035
 Su+0.035
Neg logit contributions
 big-0.029
A-0.027
 whole-0.025
 huge-0.025
 little-0.024
Tokenin
Token ID259
Feature activation-2.96e+00

Pos logit contributions
owing+0.152
and+0.152
anged+0.152
os+0.151
years+0.136
Neg logit contributions
 big-0.126
 huge-0.108
 whole-0.107
 little-0.106
 cool-0.104
Tokenoc
Token ID420
Feature activation-1.39e+01

Pos logit contributions
anged+0.060
owing+0.058
os+0.058
and+0.057
 Su+0.051
Neg logit contributions
 big-0.083
 whole-0.074
 huge-0.073
 little-0.072
A-0.072
Tokeneros
Token ID27498
Feature activation-3.37e+00

Pos logit contributions
and+0.353
anged+0.340
 Su+0.335
 Jos+0.333
 Jas+0.325
Neg logit contributions
 big-0.312
 whole-0.276
 little-0.272
 two-0.267
 top-0.263
Token.
Token ID13
Feature activation-1.73e+00

Pos logit contributions
owing+0.099
and+0.098
os+0.097
anged+0.097
 Su+0.088
Neg logit contributions
 big-0.061
A-0.052
 huge-0.051
 whole-0.051
 little-0.049
Token It
Token ID632
Feature activation+3.48e+00

Pos logit contributions
anged+0.047
owing+0.044
and+0.044
os+0.043
 Su+0.041
Neg logit contributions
 big-0.039
 whole-0.033
 little-0.033
 huge-0.033
A-0.032
Token is
Token ID318
Feature activation+1.87e+00

Pos logit contributions
 big+0.063
A+0.061
 whole+0.057
 huge+0.053
 little+0.052
Neg logit contributions
os-0.093
owing-0.086
anged-0.082
 Acc-0.081
beh-0.081
Token still
Token ID991
Feature activation+5.26e-01

Pos logit contributions
 big+0.026
 whole+0.023
A+0.023
 little+0.022
 huge+0.020
Neg logit contributions
 Su-0.055
anged-0.055
owing-0.055
and-0.055
 Kim-0.054
Token store
Token ID3650
Feature activation-1.04e+00

Pos logit contributions
and+0.009
anged+0.009
 Acc+0.009
owing+0.008
os+0.008
Neg logit contributions
 big-0.006
A-0.006
 whole-0.005
 sharp-0.005
 little-0.005
Token to
Token ID284
Feature activation-1.78e+00

Pos logit contributions
anged+0.034
owing+0.032
and+0.032
os+0.031
 Acc+0.031
Neg logit contributions
 big-0.019
 whole-0.015
 little-0.015
 huge-0.014
 two-0.014
Token get
Token ID651
Feature activation-1.86e-01

Pos logit contributions
anged+0.037
 Su+0.036
and+0.036
owing+0.035
 Jos+0.035
Neg logit contributions
 big-0.044
 whole-0.041
A-0.040
 little-0.039
 huge-0.039
Token more
Token ID517
Feature activation-1.28e+00

Pos logit contributions
anged+0.006
owing+0.006
and+0.006
os+0.006
 Su+0.005
Neg logit contributions
 big-0.003
 whole-0.003
 little-0.003
 huge-0.002
A-0.002
Token av
Token ID1196
Feature activation-6.03e+00

Pos logit contributions
anged+0.037
and+0.036
owing+0.035
os+0.034
 Su+0.033
Neg logit contributions
 big-0.023
A-0.021
 whole-0.020
 huge-0.020
 little-0.019
Tokenoc
Token ID420
Feature activation-1.39e+01

Pos logit contributions
anged+0.146
and+0.137
owing+0.137
os+0.134
 Su+0.131
Neg logit contributions
 big-0.152
 whole-0.134
 little-0.130
 huge-0.129
A-0.126
Tokenados
Token ID22484
Feature activation-5.92e-01

Pos logit contributions
and+0.221
 Su+0.213
 Jos+0.212
anged+0.210
os+0.208
Neg logit contributions
 big-0.426
 two-0.397
 little-0.388
 whole-0.386
 sharp-0.384
Token.
Token ID13
Feature activation-1.38e+00

Pos logit contributions
anged+0.017
and+0.017
owing+0.017
os+0.016
 Su+0.016
Neg logit contributions
 big-0.011
 whole-0.010
 little-0.010
A-0.010
 huge-0.009
Token Gary
Token ID10936
Feature activation-7.86e-01

Pos logit contributions
anged+0.032
and+0.030
os+0.030
owing+0.030
azes+0.028
Neg logit contributions
 big-0.036
 whole-0.032
 little-0.031
 huge-0.031
A-0.030
Token looked
Token ID3114
Feature activation+3.12e+00

Pos logit contributions
anged+0.025
os+0.024
and+0.024
owing+0.023
 Su+0.023
Neg logit contributions
 big-0.013
 whole-0.011
A-0.011
 little-0.010
 huge-0.010
Token at
Token ID379
Feature activation-2.82e+00

Pos logit contributions
 big+0.075
 whole+0.067
A+0.066
 little+0.065
 huge+0.065
Neg logit contributions
anged-0.074
and-0.071
owing-0.071
os-0.069
 Su-0.066
Token to
Token ID284
Feature activation-2.44e+00

Pos logit contributions
 big+0.035
 whole+0.030
 little+0.029
 huge+0.029
A+0.028
Neg logit contributions
anged-0.048
and-0.047
owing-0.046
os-0.046
 Su-0.043
Token help
Token ID1037
Feature activation-3.56e+00

Pos logit contributions
owing+0.055
anged+0.055
and+0.052
os+0.051
 Su+0.051
Neg logit contributions
 big-0.053
 huge-0.048
 whole-0.048
A-0.048
 heavy-0.046
Token the
Token ID262
Feature activation-2.67e+00

Pos logit contributions
anged+0.124
years+0.117
owing+0.116
os+0.116
atted+0.116
Neg logit contributions
 big-0.061
 little-0.047
 cool-0.044
 huge-0.042
 whole-0.041
Token rh
Token ID9529
Feature activation-6.23e+00

Pos logit contributions
anged+0.074
and+0.072
owing+0.071
os+0.069
 Su+0.067
Neg logit contributions
 big-0.051
 whole-0.044
A-0.043
 huge-0.042
 little-0.042
Tokenin
Token ID259
Feature activation-4.15e+00

Pos logit contributions
anged+0.175
and+0.172
owing+0.170
os+0.169
years+0.155
Neg logit contributions
 big-0.129
 whole-0.110
 huge-0.108
 little-0.107
 cool-0.103
Tokenoc
Token ID420
Feature activation-1.39e+01

Pos logit contributions
anged+0.108
and+0.102
owing+0.100
 Su+0.097
 Acc+0.097
Neg logit contributions
 big-0.099
 whole-0.087
 little-0.086
 huge-0.083
 two-0.082
Tokeneros
Token ID27498
Feature activation-4.34e+00

Pos logit contributions
and+0.361
 Jos+0.348
 Su+0.348
enna+0.344
anged+0.343
Neg logit contributions
 big-0.291
 top-0.255
 whole-0.255
 two-0.255
 little-0.252
Token.
Token ID13
Feature activation-1.31e+00

Pos logit contributions
anged+0.141
owing+0.136
and+0.136
os+0.133
 Su+0.128
Neg logit contributions
 big-0.072
 whole-0.059
 little-0.057
 huge-0.057
A-0.055
Token The
Token ID383
Feature activation+1.34e-01

Pos logit contributions
owing+0.047
anged+0.046
years+0.044
reating+0.044
 supervision+0.043
Neg logit contributions
 big-0.020
 little-0.017
 two-0.015
A-0.015
 cool-0.014
Token zo
Token ID40565
Feature activation-4.44e+00

Pos logit contributions
 big+0.003
 whole+0.002
 little+0.002
 huge+0.002
A+0.002
Neg logit contributions
anged-0.004
owing-0.004
and-0.004
os-0.004
 Su-0.004
Tokenoke
Token ID2088
Feature activation-6.24e+00

Pos logit contributions
anged+0.053
and+0.050
owing+0.047
 Su+0.047
 Acc+0.044
Neg logit contributions
 big-0.161
 little-0.148
 two-0.144
 huge-0.143
 whole-0.143
Token had
Token ID550
Feature activation+2.26e+00

Pos logit contributions
 big+0.026
 huge+0.023
 little+0.022
 large+0.022
 whole+0.022
Neg logit contributions
anged-0.030
os-0.030
owing-0.029
and-0.029
 Su-0.027
Token never
Token ID1239
Feature activation-3.02e+00

Pos logit contributions
 big+0.045
 whole+0.039
 huge+0.038
 little+0.038
A+0.037
Neg logit contributions
anged-0.064
owing-0.060
and-0.060
os-0.059
 Su-0.058
Token seen
Token ID1775
Feature activation-1.22e+00

Pos logit contributions
anged+0.083
owing+0.078
and+0.078
os+0.076
 Su+0.073
Neg logit contributions
 big-0.066
 whole-0.056
 huge-0.055
 little-0.055
A-0.053
Token the
Token ID262
Feature activation-1.34e+00

Pos logit contributions
anged+0.034
and+0.032
owing+0.032
os+0.032
 Su+0.031
Neg logit contributions
 big-0.024
 whole-0.022
 huge-0.020
A-0.020
 little-0.020
Token av
Token ID1196
Feature activation-6.05e+00

Pos logit contributions
owing+0.032
 Him+0.032
 supervision+0.031
anged+0.031
reating+0.030
Neg logit contributions
A-0.023
 big-0.023
 whole-0.020
 little-0.020
 huge-0.019
Tokenoc
Token ID420
Feature activation-1.38e+01

Pos logit contributions
anged+0.132
owing+0.126
and+0.124
os+0.121
 Su+0.111
Neg logit contributions
 big-0.164
 whole-0.145
 little-0.144
 huge-0.142
A-0.141
Tokenados
Token ID22484
Feature activation-1.19e+00

Pos logit contributions
and+0.234
anged+0.225
os+0.210
 Jos+0.209
 Su+0.208
Neg logit contributions
 big-0.436
 whole-0.400
 little-0.396
 two-0.393
 huge-0.385
Token.
Token ID13
Feature activation-1.91e+00

Pos logit contributions
owing+0.028
and+0.027
anged+0.026
os+0.026
 supervision+0.024
Neg logit contributions
 big-0.028
A-0.026
 huge-0.025
 whole-0.024
 little-0.024
Token They
Token ID1119
Feature activation+2.81e+00

Pos logit contributions
anged+0.051
owing+0.049
and+0.048
os+0.048
years+0.045
Neg logit contributions
 big-0.044
 little-0.038
 whole-0.038
A-0.037
 huge-0.037
Token wished
Token ID16555
Feature activation-4.23e-01

Pos logit contributions
 big+0.068
 whole+0.066
 huge+0.060
 entire+0.059
 little+0.058
Neg logit contributions
anged-0.064
years-0.057
 Su-0.055
os-0.055
owing-0.054
Token they
Token ID484
Feature activation+1.11e+00

Pos logit contributions
anged+0.014
owing+0.014
and+0.014
ram+0.013
beh+0.013
Neg logit contributions
 big-0.008
 two-0.006
 little-0.006
 a-0.006
 huge-0.005
Token you
Token ID345
Feature activation+6.91e-01

Pos logit contributions
 big+0.020
 little+0.017
 whole+0.017
 huge+0.017
 two+0.017
Neg logit contributions
anged-0.028
and-0.027
owing-0.027
os-0.026
 Su-0.025
Token like
Token ID588
Feature activation-5.45e-02

Pos logit contributions
 big+0.016
 whole+0.014
 huge+0.014
A+0.013
 top+0.013
Neg logit contributions
os-0.017
owing-0.017
anged-0.017
and-0.016
 Su-0.015
Token the
Token ID262
Feature activation-1.44e+00

Pos logit contributions
anged+0.002
owing+0.002
and+0.002
os+0.002
 Su+0.001
Neg logit contributions
 big-0.001
 whole-0.001
 little-0.001
 huge-0.001
A-0.001
Token rh
Token ID9529
Feature activation-5.73e+00

Pos logit contributions
anged+0.036
owing+0.036
and+0.035
os+0.034
 Su+0.034
Neg logit contributions
 big-0.028
A-0.026
 whole-0.025
 huge-0.024
 little-0.024
Tokenin
Token ID259
Feature activation-3.10e+00

Pos logit contributions
owing+0.150
anged+0.149
os+0.149
and+0.148
years+0.133
Neg logit contributions
 big-0.127
 huge-0.109
 whole-0.109
 little-0.107
 cool-0.105
Tokenoc
Token ID420
Feature activation-1.38e+01

Pos logit contributions
anged+0.072
and+0.068
owing+0.068
os+0.066
 Su+0.063
Neg logit contributions
 big-0.080
 whole-0.071
 little-0.070
 huge-0.069
A-0.068
Tokeneros
Token ID27498
Feature activation-3.59e+00

Pos logit contributions
and+0.373
 Jos+0.351
anged+0.348
 Su+0.347
 Jas+0.340
Neg logit contributions
 big-0.285
 whole-0.252
 two-0.249
 little-0.249
 top-0.243
Token,
Token ID11
Feature activation-5.37e+00

Pos logit contributions
anged+0.109
owing+0.108
and+0.107
os+0.106
 Su+0.099
Neg logit contributions
 big-0.063
 whole-0.053
 huge-0.051
A-0.051
 little-0.050
Token but
Token ID475
Feature activation-4.91e+00

Pos logit contributions
anged+0.174
owing+0.159
os+0.157
and+0.154
azes+0.150
Neg logit contributions
 big-0.109
 little-0.094
 huge-0.086
 two-0.086
 whole-0.084
Token it
Token ID340
Feature activation+2.56e+00

Pos logit contributions
anged+0.143
and+0.137
owing+0.137
os+0.135
 Su+0.129
Neg logit contributions
 big-0.098
 whole-0.083
 little-0.081
 huge-0.080
A-0.078
Token is
Token ID318
Feature activation+1.63e+00

Pos logit contributions
 big+0.048
 whole+0.044
A+0.042
 huge+0.039
 little+0.039
Neg logit contributions
os-0.074
owing-0.072
anged-0.071
and-0.067
 Acc-0.066
Token Sam
Token ID3409
Feature activation-1.38e+00

Pos logit contributions
and+0.106
owing+0.103
anged+0.101
 Acc+0.095
years+0.093
Neg logit contributions
 big-0.097
 two-0.087
 little-0.086
 a-0.079
 huge-0.078
Token hugged
Token ID41130
Feature activation-4.16e+00

Pos logit contributions
anged+0.038
owing+0.036
and+0.036
os+0.036
 Su+0.034
Neg logit contributions
 big-0.030
 whole-0.025
 little-0.025
 huge-0.025
A-0.024
Token the
Token ID262
Feature activation-3.17e+00

Pos logit contributions
anged+0.134
and+0.130
owing+0.129
azes+0.127
os+0.126
Neg logit contributions
 big-0.077
 little-0.065
 two-0.062
 whole-0.061
 huge-0.058
Token rh
Token ID9529
Feature activation-6.00e+00

Pos logit contributions
anged+0.075
owing+0.070
and+0.068
ric+0.067
os+0.066
Neg logit contributions
 big-0.064
A-0.059
 whole-0.056
 huge-0.055
 sharp-0.053
Tokenin
Token ID259
Feature activation-3.69e+00

Pos logit contributions
anged+0.160
owing+0.157
and+0.156
os+0.155
years+0.142
Neg logit contributions
 big-0.131
 whole-0.112
 huge-0.111
 little-0.109
A-0.107
Tokenoc
Token ID420
Feature activation-1.38e+01

Pos logit contributions
anged+0.093
and+0.088
owing+0.087
os+0.085
 Su+0.083
Neg logit contributions
 big-0.091
 whole-0.080
 little-0.079
 huge-0.077
 two-0.076
Tokeneros
Token ID27498
Feature activation-3.87e+00

Pos logit contributions
and+0.362
 Jos+0.344
anged+0.343
 Su+0.336
os+0.336
Neg logit contributions
 big-0.295
 little-0.260
 whole-0.255
 two-0.251
 sharp-0.248
Token goodbye
Token ID24829
Feature activation-2.83e+00

Pos logit contributions
anged+0.114
and+0.108
owing+0.107
os+0.106
 Su+0.102
Neg logit contributions
 big-0.078
 whole-0.066
 little-0.065
 huge-0.063
 two-0.062
Token,
Token ID11
Feature activation-4.55e+00

Pos logit contributions
anged+0.083
owing+0.081
and+0.080
os+0.078
 Su+0.075
Neg logit contributions
 big-0.055
 whole-0.046
 huge-0.045
 little-0.045
A-0.044
Token then
Token ID788
Feature activation-1.14e+00

Pos logit contributions
anged+0.135
owing+0.127
os+0.127
and+0.126
azes+0.124
Neg logit contributions
 big-0.099
 little-0.084
 whole-0.081
 huge-0.081
 two-0.077
Token ran
Token ID4966
Feature activation-2.07e+00

Pos logit contributions
and+0.033
anged+0.032
owing+0.031
os+0.031
azes+0.030
Neg logit contributions
 big-0.024
 little-0.020
 whole-0.020
 two-0.019
 huge-0.019
Token food
Token ID2057
Feature activation-7.43e-01

Pos logit contributions
anged+0.021
owing+0.020
and+0.020
os+0.020
azes+0.019
Neg logit contributions
 big-0.015
 whole-0.012
 little-0.012
 huge-0.012
 two-0.012
Token for
Token ID329
Feature activation-2.55e+00

Pos logit contributions
anged+0.020
owing+0.019
and+0.019
os+0.019
 Su+0.018
Neg logit contributions
 big-0.016
 whole-0.014
 little-0.014
 huge-0.013
A-0.013
Token the
Token ID262
Feature activation-1.86e+00

Pos logit contributions
anged+0.077
owing+0.072
os+0.072
and+0.071
 Acc+0.068
Neg logit contributions
 big-0.059
 little-0.050
 two-0.047
 whole-0.047
 huge-0.047
Token rh
Token ID9529
Feature activation-5.83e+00

Pos logit contributions
anged+0.054
and+0.052
owing+0.052
os+0.050
 Su+0.049
Neg logit contributions
 big-0.034
 whole-0.029
A-0.028
 huge-0.028
 little-0.027
Tokenin
Token ID259
Feature activation-2.88e+00

Pos logit contributions
anged+0.164
and+0.162
owing+0.160
os+0.157
years+0.146
Neg logit contributions
 big-0.120
 whole-0.102
 huge-0.101
 little-0.099
A-0.096
Tokenoc
Token ID420
Feature activation-1.38e+01

Pos logit contributions
anged+0.076
owing+0.070
and+0.070
azes+0.069
 Acc+0.068
Neg logit contributions
 big-0.069
 whole-0.060
 little-0.059
 huge-0.057
 two-0.056
Tokeneros
Token ID27498
Feature activation-3.13e+00

Pos logit contributions
and+0.340
 Jos+0.336
 Su+0.330
anged+0.329
 Jas+0.323
Neg logit contributions
 big-0.306
 whole-0.276
 little-0.271
 two-0.266
 top-0.260
Token and
Token ID290
Feature activation-5.16e+00

Pos logit contributions
anged+0.103
and+0.099
owing+0.099
os+0.097
 Su+0.094
Neg logit contributions
 big-0.051
 whole-0.042
 little-0.041
 huge-0.040
A-0.038
Token his
Token ID465
Feature activation-1.01e+00

Pos logit contributions
anged+0.149
owing+0.143
and+0.142
os+0.139
 Su+0.133
Neg logit contributions
 big-0.108
 little-0.091
 whole-0.091
 huge-0.089
 two-0.086
Token friends
Token ID2460
Feature activation-1.08e+00

Pos logit contributions
anged+0.029
owing+0.028
and+0.028
os+0.027
 Su+0.026
Neg logit contributions
 big-0.020
 whole-0.017
 little-0.017
A-0.017
 huge-0.017
Token.
Token ID13
Feature activation-9.42e-01

Pos logit contributions
anged+0.034
and+0.033
owing+0.033
os+0.032
 Su+0.030
Neg logit contributions
 big-0.019
 whole-0.015
 huge-0.015
 little-0.015
A-0.015
Token could
Token ID714
Feature activation+1.76e+00

Pos logit contributions
os+0.008
anged+0.008
and+0.008
owing+0.008
rings+0.007
Neg logit contributions
 big-0.007
 huge-0.006
 whole-0.006
A-0.006
 top-0.006
Token help
Token ID1037
Feature activation-2.78e+00

Pos logit contributions
 big+0.041
A+0.040
 sharp+0.039
 heavy+0.039
 huge+0.038
Neg logit contributions
owing-0.035
os-0.033
ric-0.031
anged-0.031
and-0.030
Token the
Token ID262
Feature activation-2.63e+00

Pos logit contributions
anged+0.086
owing+0.082
and+0.081
os+0.080
 Su+0.077
Neg logit contributions
 big-0.053
 whole-0.044
 little-0.044
 huge-0.043
A-0.042
Token rh
Token ID9529
Feature activation-6.25e+00

Pos logit contributions
anged+0.068
and+0.066
owing+0.066
os+0.063
 Su+0.061
Neg logit contributions
 big-0.050
 whole-0.044
A-0.044
 huge-0.043
 little-0.042
Tokenin
Token ID259
Feature activation-4.02e+00

Pos logit contributions
anged+0.172
and+0.170
owing+0.169
os+0.167
years+0.152
Neg logit contributions
 big-0.131
 whole-0.111
 huge-0.110
 little-0.109
 cool-0.105
Tokenoc
Token ID420
Feature activation-1.37e+01

Pos logit contributions
anged+0.101
and+0.096
owing+0.094
os+0.091
 Su+0.090
Neg logit contributions
 big-0.099
 whole-0.087
 little-0.086
 huge-0.084
A-0.082
Tokeneros
Token ID27498
Feature activation-4.31e+00

Pos logit contributions
and+0.368
 Jos+0.361
 Su+0.358
 Jas+0.351
 Jenna+0.350
Neg logit contributions
 big-0.283
 top-0.246
 whole-0.245
 two-0.244
 little-0.242
Token."
Token ID526
Feature activation-2.87e+00

Pos logit contributions
owing+0.126
and+0.124
anged+0.123
os+0.122
 Su+0.112
Neg logit contributions
 big-0.080
A-0.069
 whole-0.067
 huge-0.067
 little-0.064
Token His
Token ID2399
Feature activation-1.36e+00

Pos logit contributions
owing+0.091
years+0.088
anged+0.088
os+0.085
beh+0.085
Neg logit contributions
 big-0.055
 little-0.051
 two-0.044
A-0.043
 whole-0.043
Token dad
Token ID9955
Feature activation-2.51e+00

Pos logit contributions
anged+0.038
and+0.036
owing+0.036
os+0.035
 Su+0.034
Neg logit contributions
 big-0.028
 whole-0.024
A-0.024
 little-0.024
 huge-0.023
Token smiled
Token ID13541
Feature activation-4.24e-01

Pos logit contributions
anged+0.059
owing+0.056
and+0.056
os+0.055
 Su+0.052
Neg logit contributions
 big-0.064
 whole-0.057
 huge-0.055
 little-0.055
A-0.054
Token a
Token ID257
Feature activation+6.85e+00

Pos logit contributions
 big+0.081
+0.073
 huge+0.072
ball+0.072
 lid+0.071
Neg logit contributions
and-0.194
weet-0.163
 Su-0.163
 Kim-0.159
 Him-0.159
Token time
Token ID640
Feature activation+3.28e+00

Pos logit contributions
 big+0.098
 two+0.080
 whole+0.079
 huge+0.078
A+0.078
Neg logit contributions
and-0.219
anged-0.215
owing-0.214
os-0.203
 Su-0.199
Token,
Token ID11
Feature activation-1.30e+00

Pos logit contributions
 big+0.054
 little+0.044
 whole+0.040
 two+0.039
 huge+0.039
Neg logit contributions
anged-0.115
owing-0.108
and-0.106
beh-0.105
os-0.105
Token there
Token ID612
Feature activation+5.42e+00

Pos logit contributions
and+0.036
anged+0.035
owing+0.034
os+0.034
 Su+0.034
Neg logit contributions
 big-0.023
 huge-0.019
 sharp-0.019
 whole-0.019
A-0.018
Token was
Token ID373
Feature activation+5.41e+00

Pos logit contributions
 big+0.073
 whole+0.060
A+0.057
 huge+0.056
 sharp+0.055
Neg logit contributions
anged-0.182
os-0.179
and-0.179
 Su-0.175
owing-0.174
Token a
Token ID257
Feature activation+8.58e+00

Pos logit contributions
 big+0.048
 whole+0.034
 huge+0.032
 little+0.030
 sharp+0.030
Neg logit contributions
and-0.199
anged-0.197
 Su-0.192
owing-0.188
 Jos-0.188
Token farmer
Token ID18739
Feature activation-3.53e-01

Pos logit contributions
 big+0.051
 whole+0.042
A+0.042
 two+0.038
 huge+0.030
Neg logit contributions
anged-0.311
and-0.307
 Su-0.307
owing-0.303
 Acc-0.302
Token named
Token ID3706
Feature activation-4.90e+00

Pos logit contributions
anged+0.010
and+0.009
owing+0.009
azes+0.009
os+0.009
Neg logit contributions
 big-0.008
 little-0.007
 whole-0.006
 huge-0.006
 two-0.006
Token Jack
Token ID3619
Feature activation-7.13e-01

Pos logit contributions
anged+0.105
owing+0.098
and+0.095
os+0.091
azes+0.088
Neg logit contributions
 big-0.148
 little-0.133
 whole-0.131
 huge-0.127
 two-0.127
Token.
Token ID13
Feature activation+1.82e+00

Pos logit contributions
anged+0.024
and+0.023
owing+0.023
os+0.023
 Su+0.022
Neg logit contributions
 big-0.011
 whole-0.009
 little-0.009
 huge-0.008
A-0.008
Token Jack
Token ID3619
Feature activation-1.80e-01

Pos logit contributions
 big+0.049
 whole+0.044
 little+0.043
 huge+0.043
A+0.042
Neg logit contributions
anged-0.042
owing-0.040
and-0.039
os-0.038
 Su-0.035
TokenOnce
Token ID7454
Feature activation+1.11e-01

Pos logit contributions
 big+0.089
 whole+0.075
 little+0.074
 huge+0.073
A+0.070
Neg logit contributions
anged-0.139
and-0.134
owing-0.133
os-0.132
 Su-0.127
Token upon
Token ID2402
Feature activation+6.20e+00

Pos logit contributions
 big+0.003
 little+0.002
 whole+0.002
 huge+0.002
 two+0.002
Neg logit contributions
anged-0.003
owing-0.003
and-0.003
beh-0.003
os-0.003
Token a
Token ID257
Feature activation+7.15e+00

Pos logit contributions
 big+0.074
 huge+0.063
+0.061
ball+0.060
 heavy+0.060
Neg logit contributions
and-0.204
 Su-0.176
owing-0.175
 Kim-0.172
os-0.171
Token time
Token ID640
Feature activation+3.37e+00

Pos logit contributions
 big+0.105
 two+0.089
 huge+0.085
 whole+0.085
A+0.083
Neg logit contributions
and-0.221
anged-0.214
owing-0.214
os-0.204
 Su-0.200
Token,
Token ID11
Feature activation-1.22e+00

Pos logit contributions
 big+0.051
 little+0.041
 two+0.035
 huge+0.035
 whole+0.035
Neg logit contributions
anged-0.124
beh-0.116
owing-0.115
and-0.115
 supervision-0.113
Token there
Token ID612
Feature activation+5.53e+00

Pos logit contributions
anged+0.035
and+0.035
os+0.035
owing+0.034
 Su+0.034
Neg logit contributions
 big-0.021
 whole-0.018
 huge-0.017
 sharp-0.017
A-0.016
Token was
Token ID373
Feature activation+5.43e+00

Pos logit contributions
 big+0.067
 whole+0.052
 little+0.049
 huge+0.049
A+0.048
Neg logit contributions
anged-0.200
and-0.194
os-0.193
owing-0.192
 Su-0.187
Token a
Token ID257
Feature activation+8.67e+00

Pos logit contributions
 big+0.052
 whole+0.039
 huge+0.037
 sharp+0.036
+0.035
Neg logit contributions
and-0.191
 Su-0.187
anged-0.186
 Jos-0.182
years-0.180
Token brave
Token ID14802
Feature activation+4.38e+00

Pos logit contributions
 big+0.056
 whole+0.047
A+0.044
 two+0.042
 huge+0.034
Neg logit contributions
anged-0.315
 Su-0.310
and-0.309
owing-0.308
 Acc-0.307
Token knight
Token ID22062
Feature activation+1.27e+00

Pos logit contributions
 big+0.057
 whole+0.048
A+0.047
 huge+0.043
 two+0.042
Neg logit contributions
anged-0.147
owing-0.144
and-0.141
 Su-0.140
os-0.140
Token called
Token ID1444
Feature activation-2.76e+00

Pos logit contributions
 big+0.033
 whole+0.029
 little+0.029
 huge+0.028
A+0.028
Neg logit contributions
anged-0.029
and-0.028
owing-0.028
os-0.027
 Su-0.026
Token He
Token ID679
Feature activation+2.62e+00

Pos logit contributions
anged+0.044
owing+0.042
and+0.042
os+0.041
 Su+0.039
Neg logit contributions
 big-0.031
 whole-0.026
 little-0.026
 huge-0.025
A-0.025
Token tells
Token ID4952
Feature activation-3.10e+00

Pos logit contributions
 whole+0.078
 big+0.074
A+0.072
 entire+0.071
 little+0.069
Neg logit contributions
os-0.043
anged-0.042
 Acc-0.038
atted-0.038
olds-0.037
Token her
Token ID607
Feature activation-1.39e+00

Pos logit contributions
anged+0.076
owing+0.074
and+0.072
os+0.071
ooting+0.065
Neg logit contributions
 big-0.084
 little-0.075
 two-0.072
 whole-0.068
 huge-0.068
Token it
Token ID340
Feature activation+3.38e+00

Pos logit contributions
anged+0.036
and+0.035
os+0.034
owing+0.034
 Su+0.032
Neg logit contributions
 big-0.031
 whole-0.028
 huge-0.027
 little-0.026
A-0.026
Token has
Token ID468
Feature activation+4.14e+00

Pos logit contributions
 big+0.051
 whole+0.045
 little+0.042
 huge+0.041
A+0.041
Neg logit contributions
os-0.109
anged-0.109
and-0.107
owing-0.106
 Acc-0.100
Token a
Token ID257
Feature activation+5.29e+00

Pos logit contributions
 big+0.057
 whole+0.050
 huge+0.047
 little+0.046
A+0.046
Neg logit contributions
anged-0.132
and-0.129
os-0.129
owing-0.126
 Su-0.123
Token happy
Token ID3772
Feature activation+2.78e+00

Pos logit contributions
A+0.080
 big+0.073
 two+0.072
 whole+0.070
 entire+0.069
Neg logit contributions
 Acc-0.135
anged-0.134
 Him-0.132
and-0.132
owing-0.129
Token ending
Token ID7464
Feature activation+2.03e+00

Pos logit contributions
 big+0.060
A+0.052
 whole+0.052
 huge+0.051
 two+0.050
Neg logit contributions
anged-0.072
owing-0.069
and-0.069
os-0.066
 Acc-0.064
Token.
Token ID13
Feature activation-8.77e-01

Pos logit contributions
 big+0.033
 whole+0.026
 little+0.026
 huge+0.025
 two+0.024
Neg logit contributions
anged-0.068
owing-0.064
and-0.063
os-0.062
 Su-0.062
Token<|endoftext|>
Token ID50256
Feature activation+5.40e+00

Pos logit contributions
anged+0.026
owing+0.025
and+0.024
os+0.024
 Su+0.023
Neg logit contributions
 big-0.018
 whole-0.015
 little-0.015
 huge-0.015
A-0.014
TokenBen
Token ID11696
Feature activation-1.52e+00

Pos logit contributions
 big+0.110
 little+0.095
 huge+0.094
 whole+0.094
 sharp+0.093
Neg logit contributions
and-0.146
anged-0.146
os-0.145
owing-0.141
azes-0.133
Token upon
Token ID2402
Feature activation+6.42e+00

Pos logit contributions
 big+0.009
 little+0.008
 whole+0.008
 huge+0.008
 two+0.007
Neg logit contributions
anged-0.010
owing-0.010
and-0.010
os-0.009
beh-0.009
Token a
Token ID257
Feature activation+6.94e+00

Pos logit contributions
 big+0.080
+0.074
ball+0.074
 lid+0.073
 huge+0.073
Neg logit contributions
and-0.197
 Kim-0.166
weet-0.166
 Su-0.165
azes-0.162
Token time
Token ID640
Feature activation+3.49e+00

Pos logit contributions
 big+0.099
 two+0.084
 whole+0.082
 huge+0.082
A+0.080
Neg logit contributions
and-0.216
anged-0.211
owing-0.209
os-0.200
 Su-0.193
Token,
Token ID11
Feature activation-1.27e+00

Pos logit contributions
 big+0.059
 little+0.048
 whole+0.045
 huge+0.043
 two+0.043
Neg logit contributions
anged-0.119
owing-0.112
and-0.111
os-0.109
years-0.108
Token there
Token ID612
Feature activation+5.77e+00

Pos logit contributions
and+0.035
anged+0.035
owing+0.034
os+0.034
 Su+0.033
Neg logit contributions
 big-0.022
 huge-0.019
 sharp-0.018
 whole-0.018
A-0.018
Token was
Token ID373
Feature activation+5.70e+00

Pos logit contributions
 big+0.078
 whole+0.064
 huge+0.060
A+0.060
 sharp+0.059
Neg logit contributions
anged-0.194
os-0.190
and-0.190
owing-0.185
 Su-0.185
Token a
Token ID257
Feature activation+8.82e+00

Pos logit contributions
 big+0.049
 whole+0.034
 huge+0.032
 little+0.031
 sharp+0.030
Neg logit contributions
and-0.213
anged-0.211
 Su-0.205
os-0.202
owing-0.201
Token farmer
Token ID18739
Feature activation-1.07e-01

Pos logit contributions
 big+0.056
A+0.049
 whole+0.049
 two+0.047
 huge+0.036
Neg logit contributions
anged-0.309
 Su-0.305
and-0.304
 Acc-0.302
owing-0.301
Token who
Token ID508
Feature activation+1.21e+00

Pos logit contributions
anged+0.003
and+0.003
owing+0.003
os+0.003
azes+0.003
Neg logit contributions
 big-0.002
 little-0.002
 whole-0.002
 huge-0.002
 two-0.002
Token had
Token ID550
Feature activation+6.82e+00

Pos logit contributions
 big+0.023
 two+0.020
 huge+0.020
 little+0.020
 top+0.020
Neg logit contributions
anged-0.033
os-0.032
owing-0.031
and-0.031
 Acc-0.029
Token a
Token ID257
Feature activation+8.27e+00

Pos logit contributions
 big+0.059
+0.054
 whole+0.048
 little+0.047
ball+0.047
Neg logit contributions
and-0.229
 Su-0.227
anged-0.223
os-0.219
 Jos-0.218
Token was
Token ID373
Feature activation+4.60e-01

Pos logit contributions
anged+0.034
owing+0.032
and+0.032
os+0.032
 Su+0.030
Neg logit contributions
 big-0.013
 whole-0.011
 huge-0.010
 little-0.010
A-0.010
Token a
Token ID257
Feature activation+4.81e+00

Pos logit contributions
 big+0.007
 whole+0.006
 little+0.006
A+0.006
+0.006
Neg logit contributions
anged-0.014
owing-0.013
and-0.013
 Su-0.013
os-0.013
Token bad
Token ID2089
Feature activation+6.19e-01

Pos logit contributions
A+0.046
 two+0.045
 big+0.042
 whole+0.037
+0.037
Neg logit contributions
and-0.150
anged-0.149
 Acc-0.147
owing-0.143
 Su-0.143
Token idea
Token ID2126
Feature activation+4.34e+00

Pos logit contributions
 big+0.014
A+0.012
 whole+0.012
 huge+0.012
 two+0.012
Neg logit contributions
anged-0.016
owing-0.015
and-0.015
 Su-0.014
os-0.014
Token.
Token ID13
Feature activation-4.29e-01

Pos logit contributions
 big+0.062
 whole+0.048
 little+0.048
 huge+0.045
 two+0.044
Neg logit contributions
anged-0.153
owing-0.146
and-0.145
os-0.144
years-0.139
Token<|endoftext|>
Token ID50256
Feature activation+5.97e+00

Pos logit contributions
anged+0.013
owing+0.013
os+0.012
and+0.012
ules+0.012
Neg logit contributions
 big-0.008
A-0.007
 little-0.007
 two-0.007
 whole-0.007
TokenL
Token ID43
Feature activation-4.31e+00

Pos logit contributions
 big+0.123
 little+0.108
 huge+0.108
 sharp+0.108
 whole+0.106
Neg logit contributions
and-0.157
os-0.154
anged-0.152
owing-0.147
 Su-0.140
Tokenily
Token ID813
Feature activation+1.91e-01

Pos logit contributions
anged+0.108
owing+0.104
os+0.099
and+0.098
 Acc+0.096
Neg logit contributions
 big-0.110
 little-0.096
 whole-0.094
 two-0.094
A-0.091
Token and
Token ID290
Feature activation-1.94e+00

Pos logit contributions
 big+0.004
 whole+0.003
 huge+0.003
A+0.003
 little+0.003
Neg logit contributions
anged-0.006
os-0.005
owing-0.005
and-0.005
 Su-0.005
Token Ben
Token ID3932
Feature activation-3.46e-01

Pos logit contributions
anged+0.045
and+0.043
owing+0.043
os+0.042
 Su+0.039
Neg logit contributions
 big-0.051
 whole-0.045
 little-0.044
 huge-0.044
A-0.043
Token were
Token ID547
Feature activation+5.22e+00

Pos logit contributions
anged+0.009
os+0.008
owing+0.008
 Su+0.008
and+0.008
Neg logit contributions
 big-0.008
 whole-0.007
 huge-0.007
A-0.007
 little-0.006
Tokenily
Token ID813
Feature activation-5.98e-01

Pos logit contributions
anged+0.116
and+0.116
owing+0.111
os+0.107
 Su+0.102
Neg logit contributions
 big-0.130
 whole-0.114
 huge-0.113
 little-0.110
A-0.108
Token felt
Token ID2936
Feature activation+3.56e+00

Pos logit contributions
anged+0.016
and+0.015
os+0.015
owing+0.014
atted+0.014
Neg logit contributions
 big-0.013
 whole-0.011
 huge-0.011
A-0.011
 sharp-0.011
Token better
Token ID1365
Feature activation-7.69e-01

Pos logit contributions
 big+0.070
 whole+0.068
ball+0.065
A+0.064
 two+0.063
Neg logit contributions
and-0.085
 Acc-0.079
anged-0.077
owing-0.076
 Su-0.076
Token and
Token ID290
Feature activation-3.57e+00

Pos logit contributions
anged+0.022
owing+0.022
and+0.022
os+0.021
 Su+0.020
Neg logit contributions
 big-0.015
 whole-0.013
 huge-0.013
 little-0.013
A-0.012
Token went
Token ID1816
Feature activation-9.51e-01

Pos logit contributions
anged+0.101
and+0.096
owing+0.096
os+0.094
 Su+0.091
Neg logit contributions
 big-0.075
 whole-0.064
 little-0.062
 huge-0.062
A-0.061
Token to
Token ID284
Feature activation-1.66e+00

Pos logit contributions
anged+0.026
owing+0.025
and+0.025
os+0.024
 Su+0.023
Neg logit contributions
 big-0.021
 whole-0.018
 little-0.018
 huge-0.018
A-0.017
Token the
Token ID262
Feature activation-6.98e-03

Pos logit contributions
anged+0.053
owing+0.051
and+0.050
os+0.049
ram+0.048
Neg logit contributions
 big-0.035
 little-0.029
 whole-0.028
 two-0.028
 huge-0.027
Token party
Token ID2151
Feature activation+7.63e-01

Pos logit contributions
and+0.000
reating+0.000
 Jos+0.000
 discovering+0.000
 Acc+0.000
Neg logit contributions
 sharp-0.000
 big-0.000
-0.000
A-0.000
 heavy-0.000
Token with
Token ID351
Feature activation-1.36e+00

Pos logit contributions
 big+0.014
 whole+0.011
 little+0.011
 huge+0.011
 two+0.010
Neg logit contributions
anged-0.024
owing-0.023
and-0.023
os-0.023
 Su-0.022
Token her
Token ID607
Feature activation-1.32e+00

Pos logit contributions
anged+0.023
and+0.021
os+0.021
owing+0.021
 Su+0.020
Neg logit contributions
 big-0.041
 whole-0.038
 little-0.037
A-0.037
 huge-0.037
Token mom
Token ID1995
Feature activation-4.31e+00

Pos logit contributions
anged+0.029
os+0.028
and+0.027
owing+0.027
 Su+0.027
Neg logit contributions
 big-0.028
A-0.028
 whole-0.027
 sharp-0.026
 huge-0.025
Token said
Token ID531
Feature activation-3.55e+00

Pos logit contributions
anged+0.067
owing+0.064
and+0.064
os+0.063
 Su+0.060
Neg logit contributions
 big-0.044
 whole-0.039
 huge-0.037
 little-0.036
A-0.035
Token.
Token ID13
Feature activation-2.92e+00

Pos logit contributions
anged+0.143
ric+0.138
os+0.138
 Su+0.137
years+0.136
Neg logit contributions
 big-0.035
 little-0.025
 two-0.022
 a-0.018
 whole-0.017
Token
Token ID198
Feature activation+9.89e-01

Pos logit contributions
anged+0.109
owing+0.107
os+0.101
beh+0.101
and+0.099
Neg logit contributions
 big-0.044
 little-0.034
A-0.033
 two-0.033
 whole-0.030
Token
Token ID198
Feature activation+1.47e+00

Pos logit contributions
 big+0.016
 little+0.012
A+0.012
 whole+0.011
 two+0.011
Neg logit contributions
anged-0.036
owing-0.035
os-0.033
years-0.033
beh-0.032
Token"
Token ID1
Feature activation-3.80e-01

Pos logit contributions
 big+0.024
 whole+0.020
 little+0.019
 huge+0.019
A+0.018
Neg logit contributions
anged-0.049
owing-0.047
and-0.047
os-0.046
 Su-0.044
TokenOK
Token ID11380
Feature activation-1.95e+00

Pos logit contributions
and+0.009
anged+0.008
 Acc+0.008
years+0.008
 Su+0.008
Neg logit contributions
 big-0.009
 huge-0.008
 whole-0.008
 little-0.007
 cool-0.007
Token,
Token ID11
Feature activation-4.68e+00

Pos logit contributions
anged+0.050
owing+0.048
and+0.047
 Su+0.047
os+0.046
Neg logit contributions
 big-0.042
 whole-0.038
A-0.037
 huge-0.037
 sharp-0.035
Token let
Token ID1309
Feature activation-3.15e+00

Pos logit contributions
anged+0.129
and+0.124
owing+0.123
os+0.120
 Su+0.117
Neg logit contributions
 big-0.100
 whole-0.087
 huge-0.084
 little-0.083
A-0.082
Token's
Token ID338
Feature activation-3.41e+00

Pos logit contributions
anged+0.093
and+0.089
owing+0.089
os+0.088
 Su+0.084
Neg logit contributions
 big-0.062
 whole-0.052
 little-0.051
 huge-0.051
A-0.049
Token sleep
Token ID3993
Feature activation-7.22e-01

Pos logit contributions
owing+0.076
anged+0.076
os+0.073
and+0.072
 Su+0.072
Neg logit contributions
 big-0.075
A-0.071
 huge-0.069
 whole-0.069
 sharp-0.066
Token in
Token ID287
Feature activation-1.99e+00

Pos logit contributions
and+0.018
owing+0.018
anged+0.017
 Him+0.017
os+0.017
Neg logit contributions
 big-0.015
 huge-0.014
A-0.013
 whole-0.013
 sharp-0.013
Token
Token ID198
Feature activation+5.63e-01

Pos logit contributions
anged+0.098
owing+0.097
and+0.093
os+0.092
years+0.087
Neg logit contributions
 big-0.076
 little-0.063
A-0.062
 two-0.061
 huge-0.060
Token
Token ID198
Feature activation+1.25e+00

Pos logit contributions
 big+0.009
 little+0.007
A+0.007
 whole+0.007
 huge+0.007
Neg logit contributions
anged-0.020
owing-0.019
os-0.018
and-0.018
years-0.018
TokenTom
Token ID13787
Feature activation-2.94e+00

Pos logit contributions
 big+0.023
A+0.019
 whole+0.019
 little+0.019
 huge+0.018
Neg logit contributions
anged-0.040
owing-0.038
and-0.037
os-0.037
 Jos-0.036
Token and
Token ID290
Feature activation-3.68e+00

Pos logit contributions
anged+0.086
and+0.083
owing+0.082
os+0.081
 Su+0.077
Neg logit contributions
 big-0.058
 whole-0.050
 huge-0.048
 little-0.048
A-0.047
Token Lily
Token ID20037
Feature activation-1.06e+00

Pos logit contributions
anged+0.089
owing+0.085
and+0.084
os+0.082
 Su+0.077
Neg logit contributions
 big-0.094
 whole-0.082
 little-0.082
 huge-0.080
A-0.079
Token nodded
Token ID14464
Feature activation-1.17e+00

Pos logit contributions
anged+0.028
owing+0.027
and+0.026
os+0.026
 Su+0.025
Neg logit contributions
 big-0.024
 whole-0.021
 huge-0.020
A-0.020
 little-0.019
Token.
Token ID13
Feature activation-1.88e+00

Pos logit contributions
anged+0.042
and+0.041
os+0.041
owing+0.041
azes+0.039
Neg logit contributions
 big-0.016
 little-0.013
 whole-0.012
 huge-0.011
 two-0.011
Token They
Token ID1119
Feature activation+3.56e+00

Pos logit contributions
anged+0.049
owing+0.047
and+0.046
os+0.045
 Su+0.043
Neg logit contributions
 big-0.045
 whole-0.039
 little-0.039
 huge-0.038
A-0.037
Token still
Token ID991
Feature activation-3.52e-02

Pos logit contributions
 whole+0.082
 big+0.081
 entire+0.076
 huge+0.073
A+0.073
Neg logit contributions
anged-0.079
 Su-0.074
 J-0.073
os-0.071
 Jos-0.069
Token liked
Token ID8288
Feature activation+2.05e+00

Pos logit contributions
anged+0.001
owing+0.001
os+0.001
and+0.001
 Su+0.001
Neg logit contributions
 big-0.001
 whole-0.001
 huge-0.001
 little-0.001
A-0.000
Token the
Token ID262
Feature activation-1.79e+00

Pos logit contributions
 big+0.042
 little+0.034
 sharp+0.031
 huge+0.031
 two+0.031
Neg logit contributions
anged-0.066
owing-0.066
os-0.062
and-0.062
ram-0.061
Token She
Token ID1375
Feature activation+3.75e+00

Pos logit contributions
 big+0.001
 little+0.000
A+0.000
 two+0.000
 whole+0.000
Neg logit contributions
anged-0.001
owing-0.001
beh-0.001
os-0.001
and-0.001
Token opened
Token ID4721
Feature activation+5.38e-01

Pos logit contributions
 big+0.076
 whole+0.074
A+0.069
 huge+0.065
 little+0.064
Neg logit contributions
anged-0.100
os-0.097
atted-0.090
owing-0.089
 Su-0.089
Token her
Token ID607
Feature activation-1.86e+00

Pos logit contributions
 big+0.014
 whole+0.013
 little+0.012
A+0.012
 huge+0.012
Neg logit contributions
anged-0.011
os-0.011
and-0.011
 Su-0.011
owing-0.010
Token mouth
Token ID5422
Feature activation+9.12e-01

Pos logit contributions
anged+0.047
owing+0.043
os+0.042
atted+0.042
and+0.041
Neg logit contributions
 big-0.038
 whole-0.037
A-0.036
 huge-0.033
 little-0.033
Token with
Token ID351
Feature activation-1.79e+00

Pos logit contributions
 big+0.018
 whole+0.016
 little+0.016
 huge+0.016
A+0.015
Neg logit contributions
anged-0.025
and-0.025
owing-0.025
os-0.024
 Su-0.023
Token her
Token ID607
Feature activation-2.06e+00

Pos logit contributions
anged+0.042
 Su+0.039
os+0.039
owing+0.038
and+0.038
Neg logit contributions
 big-0.041
 whole-0.039
A-0.038
 little-0.036
 huge-0.035
Token songs
Token ID7259
Feature activation+2.01e-01

Pos logit contributions
anged+0.058
owing+0.056
os+0.054
 Su+0.053
and+0.053
Neg logit contributions
 big-0.036
A-0.033
 whole-0.033
 huge-0.031
 little-0.030
Token.
Token ID13
Feature activation-5.15e-01

Pos logit contributions
 big+0.004
 whole+0.003
 little+0.003
 huge+0.003
A+0.003
Neg logit contributions
anged-0.006
owing-0.006
and-0.006
os-0.006
 Su-0.005
Token
Token ID198
Feature activation+2.75e+00

Pos logit contributions
owing+0.016
anged+0.016
beh+0.015
os+0.014
 supervision+0.014
Neg logit contributions
 big-0.010
 little-0.009
 two-0.009
A-0.009
 small-0.008
Token
Token ID198
Feature activation+2.76e+00

Pos logit contributions
 big+0.044
A+0.035
 little+0.033
 whole+0.033
 huge+0.031
Neg logit contributions
anged-0.099
owing-0.096
os-0.088
beh-0.088
years-0.087
TokenBut
Token ID1537
Feature activation-3.31e+00

Pos logit contributions
 big+0.043
 whole+0.035
A+0.034
 little+0.034
 huge+0.033
Neg logit contributions
anged-0.094
owing-0.091
and-0.088
os-0.088
 Su-0.085
Token,
Token ID11
Feature activation-5.55e+00

Pos logit contributions
anged+0.038
owing+0.037
and+0.037
os+0.036
 Su+0.034
Neg logit contributions
 big-0.023
 whole-0.020
 little-0.019
 huge-0.019
A-0.019
Token two
Token ID734
Feature activation+8.24e-01

Pos logit contributions
anged+0.180
and+0.174
owing+0.173
os+0.169
 Su+0.163
Neg logit contributions
 big-0.096
 whole-0.077
 little-0.077
 huge-0.075
 two-0.073
Token lines
Token ID3951
Feature activation+5.29e-01

Pos logit contributions
 big+0.017
 whole+0.015
A+0.015
 little+0.015
 huge+0.014
Neg logit contributions
anged-0.023
owing-0.021
and-0.021
os-0.021
 Su-0.020
Token for
Token ID329
Feature activation-3.46e+00

Pos logit contributions
 big+0.010
 whole+0.009
 little+0.009
 huge+0.008
A+0.008
Neg logit contributions
anged-0.016
owing-0.015
and-0.015
os-0.015
 Su-0.014
Token the
Token ID262
Feature activation-2.46e+00

Pos logit contributions
anged+0.109
and+0.106
owing+0.106
os+0.104
beh+0.099
Neg logit contributions
 big-0.070
 little-0.058
 huge-0.056
 two-0.055
 sharp-0.054
Token legs
Token ID7405
Feature activation-2.09e+00

Pos logit contributions
anged+0.074
owing+0.070
and+0.069
os+0.069
 Su+0.066
Neg logit contributions
 big-0.043
A-0.037
 whole-0.036
 huge-0.035
 little-0.035
Token and
Token ID290
Feature activation-5.90e+00

Pos logit contributions
anged+0.063
owing+0.061
and+0.060
os+0.060
 Su+0.057
Neg logit contributions
 big-0.038
 whole-0.033
A-0.032
 huge-0.031
 little-0.031
Token a
Token ID257
Feature activation+4.08e+00

Pos logit contributions
anged+0.173
and+0.168
owing+0.167
os+0.163
 Su+0.157
Neg logit contributions
 big-0.119
 whole-0.099
 little-0.098
 huge-0.097
 two-0.095
Token fan
Token ID4336
Feature activation-1.39e+00

Pos logit contributions
 big+0.067
 whole+0.057
A+0.057
 little+0.055
 huge+0.054
Neg logit contributions
anged-0.125
and-0.120
owing-0.120
os-0.118
 Su-0.114
Token for
Token ID329
Feature activation-1.79e+00

Pos logit contributions
anged+0.046
owing+0.044
and+0.044
os+0.043
 Su+0.041
Neg logit contributions
 big-0.022
 whole-0.018
 little-0.018
 huge-0.018
A-0.017
Token the
Token ID262
Feature activation-1.86e+00

Pos logit contributions
anged+0.053
owing+0.052
and+0.051
os+0.050
 Acc+0.047
Neg logit contributions
 big-0.037
 little-0.031
 whole-0.030
 huge-0.030
 two-0.030
Token and
Token ID290
Feature activation-4.85e+00

Pos logit contributions
anged+0.028
owing+0.026
and+0.026
os+0.025
beh+0.025
Neg logit contributions
 big-0.016
 little-0.013
 whole-0.013
 huge-0.012
 two-0.012
Token she
Token ID673
Feature activation+1.69e+00

Pos logit contributions
anged+0.145
and+0.141
owing+0.141
os+0.136
azes+0.132
Neg logit contributions
 big-0.096
 little-0.079
 whole-0.079
 huge-0.077
 two-0.076
Token gave
Token ID2921
Feature activation-3.17e+00

Pos logit contributions
 big+0.034
 whole+0.030
 little+0.029
A+0.029
 huge+0.029
Neg logit contributions
anged-0.048
os-0.046
and-0.045
owing-0.045
 Su-0.043
Token them
Token ID606
Feature activation-1.32e+00

Pos logit contributions
anged+0.076
owing+0.073
and+0.071
os+0.068
azes+0.067
Neg logit contributions
 big-0.086
 two-0.075
 little-0.074
 whole-0.073
 huge-0.072
Token lots
Token ID6041
Feature activation+1.00e+00

Pos logit contributions
anged+0.044
and+0.043
owing+0.042
os+0.042
 Su+0.040
Neg logit contributions
 big-0.020
 whole-0.016
 little-0.016
 huge-0.016
A-0.015
Token of
Token ID286
Feature activation-4.22e+00

Pos logit contributions
 big+0.022
 whole+0.020
A+0.019
 huge+0.019
 little+0.019
Neg logit contributions
and-0.026
anged-0.025
owing-0.025
os-0.024
 Su-0.023
Token y
Token ID331
Feature activation-2.59e+00

Pos logit contributions
anged+0.127
and+0.122
owing+0.121
os+0.120
 Su+0.117
Neg logit contributions
 big-0.074
 whole-0.065
A-0.062
 huge-0.061
 little-0.060
Tokenummy
Token ID13513
Feature activation-2.35e+00

Pos logit contributions
anged+0.058
and+0.055
owing+0.055
os+0.054
 Su+0.051
Neg logit contributions
 big-0.069
 whole-0.061
 little-0.060
 huge-0.059
 two-0.058
Token snacks
Token ID28487
Feature activation-7.75e-01

Pos logit contributions
anged+0.067
and+0.065
owing+0.064
os+0.064
 Su+0.060
Neg logit contributions
 big-0.043
A-0.038
 whole-0.037
 huge-0.036
 little-0.035
Token.
Token ID13
Feature activation-1.98e+00

Pos logit contributions
anged+0.024
and+0.024
owing+0.024
os+0.023
 Su+0.022
Neg logit contributions
 big-0.013
 whole-0.011
 little-0.011
 huge-0.011
A-0.011
Token
Token ID198
Feature activation+1.76e+00

Pos logit contributions
owing+0.056
anged+0.055
and+0.051
os+0.050
beh+0.050
Neg logit contributions
 big-0.044
 little-0.038
 two-0.037
A-0.036
 whole-0.036
Token
Token ID198
Feature activation+5.84e-01

Pos logit contributions
anged+0.100
owing+0.095
beh+0.092
reating+0.090
ram+0.089
Neg logit contributions
 big-0.036
 little-0.030
 two-0.030
A-0.029
 a-0.027
Token
Token ID198
Feature activation+1.05e+00

Pos logit contributions
 big+0.009
A+0.007
 little+0.007
 whole+0.007
 two+0.007
Neg logit contributions
anged-0.022
owing-0.021
os-0.019
beh-0.019
 supervision-0.019
TokenThe
Token ID464
Feature activation+3.89e-01

Pos logit contributions
 big+0.014
 whole+0.011
A+0.011
 little+0.011
 huge+0.010
Neg logit contributions
anged-0.038
owing-0.037
os-0.036
and-0.036
 Su-0.035
Token fairy
Token ID25607
Feature activation-6.43e-01

Pos logit contributions
 big+0.008
A+0.007
 whole+0.007
 huge+0.007
 sharp+0.006
Neg logit contributions
anged-0.010
 Jos-0.009
and-0.009
years-0.009
 Him-0.009
Token smiled
Token ID13541
Feature activation-1.06e+00

Pos logit contributions
anged+0.016
os+0.016
owing+0.016
and+0.016
 Su+0.015
Neg logit contributions
 big-0.015
 whole-0.013
 huge-0.012
A-0.012
 little-0.012
Token and
Token ID290
Feature activation-4.99e+00

Pos logit contributions
owing+0.026
anged+0.025
and+0.025
 Jos+0.023
atted+0.023
Neg logit contributions
 big-0.022
A-0.021
 whole-0.020
 huge-0.020
 sharp-0.019
Token waved
Token ID26834
Feature activation-2.42e+00

Pos logit contributions
anged+0.128
os+0.120
owing+0.120
and+0.119
 Su+0.119
Neg logit contributions
 big-0.110
 whole-0.101
A-0.094
 huge-0.094
 little-0.093
Token goodbye
Token ID24829
Feature activation-3.06e+00

Pos logit contributions
anged+0.050
and+0.047
owing+0.047
os+0.045
 Su+0.043
Neg logit contributions
 big-0.069
 whole-0.062
 huge-0.060
 little-0.060
A-0.060
Token as
Token ID355
Feature activation-1.68e+00

Pos logit contributions
anged+0.095
and+0.091
owing+0.091
os+0.089
 Su+0.086
Neg logit contributions
 big-0.056
 whole-0.047
 little-0.046
 huge-0.045
A-0.044
Token she
Token ID673
Feature activation-7.45e-02

Pos logit contributions
anged+0.051
owing+0.049
and+0.048
os+0.047
ram+0.045
Neg logit contributions
 big-0.036
 little-0.030
 whole-0.029
 huge-0.029
 two-0.029
Token flew
Token ID13112
Feature activation-1.69e+00

Pos logit contributions
anged+0.002
os+0.002
owing+0.002
and+0.002
atted+0.002
Neg logit contributions
 big-0.002
 whole-0.002
 huge-0.002
 little-0.002
A-0.002
TokenMom
Token ID29252
Feature activation-2.10e+00

Pos logit contributions
 big+0.024
 whole+0.020
 little+0.020
 huge+0.020
 two+0.019
Neg logit contributions
anged-0.038
and-0.037
owing-0.036
os-0.036
 Su-0.035
Token hugged
Token ID41130
Feature activation-3.94e+00

Pos logit contributions
anged+0.054
and+0.051
owing+0.051
os+0.050
 Su+0.047
Neg logit contributions
 big-0.049
 whole-0.044
 little-0.042
 huge-0.042
A-0.041
Token them
Token ID606
Feature activation-1.94e+00

Pos logit contributions
owing+0.106
anged+0.103
and+0.099
os+0.096
ooting+0.095
Neg logit contributions
 big-0.102
 little-0.092
 two-0.089
 whole-0.082
A-0.081
Token and
Token ID290
Feature activation-5.32e+00

Pos logit contributions
anged+0.067
owing+0.063
os+0.062
and+0.062
azes+0.061
Neg logit contributions
 big-0.033
 little-0.026
 whole-0.026
 two-0.025
 huge-0.024
Token said
Token ID531
Feature activation-1.89e+00

Pos logit contributions
anged+0.141
os+0.130
 Su+0.129
owing+0.129
and+0.128
Neg logit contributions
 big-0.115
 whole-0.109
 huge-0.099
A-0.098
 little-0.097
Token,
Token ID11
Feature activation-3.85e+00

Pos logit contributions
anged+0.068
and+0.065
os+0.065
owing+0.064
beh+0.063
Neg logit contributions
 big-0.029
 little-0.024
 two-0.022
 bad-0.019
 small-0.019
Token "
Token ID366
Feature activation-1.38e+00

Pos logit contributions
anged+0.125
owing+0.117
os+0.116
beh+0.116
and+0.115
Neg logit contributions
 big-0.076
 little-0.066
 two-0.062
A-0.060
 whole-0.058
TokenI
Token ID40
Feature activation+1.52e+00

Pos logit contributions
and+0.044
anged+0.043
os+0.042
 Acc+0.041
owing+0.041
Neg logit contributions
 big-0.022
 huge-0.018
 whole-0.017
 little-0.017
 sharp-0.016
Token love
Token ID1842
Feature activation-6.43e-01

Pos logit contributions
 big+0.031
 whole+0.027
 huge+0.027
 little+0.026
A+0.026
Neg logit contributions
anged-0.042
owing-0.041
and-0.041
os-0.040
 Su-0.039
Token you
Token ID345
Feature activation-4.33e-01

Pos logit contributions
anged+0.022
owing+0.022
os+0.021
and+0.020
ooting+0.020
Neg logit contributions
 big-0.011
 little-0.009
 two-0.009
A-0.009
 whole-0.008
Token too
Token ID1165
Feature activation+2.04e+00

Pos logit contributions
owing+0.011
anged+0.011
and+0.011
os+0.011
years+0.010
Neg logit contributions
 big-0.009
 whole-0.008
 huge-0.008
A-0.007
 large-0.007
Token.
Token ID13
Feature activation-1.27e+00

Pos logit contributions
 big+0.008
 whole+0.007
A+0.007
 huge+0.007
 little+0.007
Neg logit contributions
anged-0.015
owing-0.014
and-0.014
os-0.014
 Su-0.013
Token He
Token ID679
Feature activation+3.51e+00

Pos logit contributions
anged+0.037
owing+0.037
and+0.036
os+0.035
years+0.034
Neg logit contributions
 big-0.026
 little-0.022
 whole-0.021
 huge-0.021
A-0.021
Token is
Token ID318
Feature activation+2.62e+00

Pos logit contributions
 whole+0.087
 big+0.084
A+0.081
 entire+0.078
 little+0.075
Neg logit contributions
anged-0.077
os-0.074
atted-0.069
azes-0.066
owing-0.064
Token ready
Token ID3492
Feature activation+6.24e-01

Pos logit contributions
 big+0.049
 whole+0.042
 little+0.040
A+0.040
 huge+0.040
Neg logit contributions
anged-0.078
owing-0.075
and-0.075
os-0.073
 Su-0.071
Token to
Token ID284
Feature activation-3.02e-01

Pos logit contributions
 big+0.014
 whole+0.011
 little+0.011
 huge+0.011
 two+0.011
Neg logit contributions
anged-0.018
and-0.017
owing-0.017
os-0.017
azes-0.016
Token go
Token ID467
Feature activation-3.07e+00

Pos logit contributions
anged+0.007
owing+0.007
 Su+0.007
os+0.007
 Him+0.006
Neg logit contributions
 big-0.006
 whole-0.006
A-0.006
 little-0.005
 huge-0.005
Token back
Token ID736
Feature activation-2.80e+00

Pos logit contributions
anged+0.091
and+0.087
owing+0.087
os+0.085
 Su+0.082
Neg logit contributions
 big-0.061
 whole-0.052
 little-0.051
 huge-0.050
A-0.048
Token to
Token ID284
Feature activation-2.61e+00

Pos logit contributions
anged+0.073
and+0.070
owing+0.070
os+0.069
 Su+0.065
Neg logit contributions
 big-0.064
 whole-0.056
 little-0.055
 huge-0.054
A-0.053
Token his
Token ID465
Feature activation-2.22e+00

Pos logit contributions
anged+0.081
owing+0.078
and+0.077
os+0.076
years+0.072
Neg logit contributions
 big-0.053
 little-0.043
 whole-0.043
 huge-0.042
 two-0.041
Token nest
Token ID16343
Feature activation-3.24e-01

Pos logit contributions
anged+0.054
owing+0.051
and+0.049
 supervision+0.047
os+0.047
Neg logit contributions
A-0.045
 big-0.044
 whole-0.041
 huge-0.040
 two-0.039
Token.
Token ID13
Feature activation-6.55e-01

Pos logit contributions
anged+0.010
owing+0.010
and+0.010
os+0.010
 Su+0.009
Neg logit contributions
 big-0.006
 whole-0.005
A-0.005
 huge-0.005
 little-0.005
Token very
Token ID845
Feature activation+8.49e-01

Pos logit contributions
 big+0.086
 whole+0.079
A+0.078
+0.075
 little+0.074
Neg logit contributions
anged-0.118
years-0.114
owing-0.113
and-0.112
os-0.105
Token angry
Token ID7954
Feature activation+5.80e-01

Pos logit contributions
 big+0.018
A+0.017
 whole+0.017
 huge+0.016
 two+0.016
Neg logit contributions
anged-0.021
owing-0.020
and-0.020
azes-0.019
 Acc-0.019
Token.
Token ID13
Feature activation-4.34e-01

Pos logit contributions
 big+0.011
 whole+0.010
 little+0.009
 huge+0.009
A+0.009
Neg logit contributions
anged-0.017
and-0.016
owing-0.016
os-0.016
 Su-0.015
Token He
Token ID679
Feature activation+3.63e+00

Pos logit contributions
anged+0.014
owing+0.014
and+0.013
os+0.013
 Him+0.013
Neg logit contributions
 big-0.008
 little-0.007
 huge-0.006
A-0.006
 two-0.006
Token stamped
Token ID28305
Feature activation-2.31e+00

Pos logit contributions
 big+0.078
 whole+0.074
 huge+0.068
 entire+0.064
 two+0.064
Neg logit contributions
anged-0.093
os-0.083
 Su-0.081
years-0.080
owing-0.079
Token his
Token ID465
Feature activation-3.20e+00

Pos logit contributions
anged+0.066
owing+0.065
and+0.063
os+0.063
 Acc+0.062
Neg logit contributions
 big-0.049
 little-0.042
 whole-0.040
 two-0.040
 huge-0.039
Token feet
Token ID3625
Feature activation-1.18e+00

Pos logit contributions
anged+0.086
owing+0.080
and+0.080
os+0.076
azes+0.074
Neg logit contributions
 big-0.069
A-0.061
 whole-0.060
 huge-0.059
 little-0.057
Token and
Token ID290
Feature activation-5.09e+00

Pos logit contributions
anged+0.035
and+0.034
owing+0.034
os+0.033
 Su+0.032
Neg logit contributions
 big-0.022
 whole-0.019
 little-0.018
 huge-0.018
A-0.018
Token yelled
Token ID22760
Feature activation-2.05e+00

Pos logit contributions
anged+0.138
and+0.133
owing+0.132
os+0.129
 Su+0.123
Neg logit contributions
 big-0.113
 whole-0.097
 little-0.096
 huge-0.094
A-0.092
Token because
Token ID780
Feature activation-8.93e-01

Pos logit contributions
anged+0.064
owing+0.061
and+0.061
os+0.060
 Su+0.057
Neg logit contributions
 big-0.040
 little-0.033
 whole-0.031
 huge-0.031
 two-0.030
Token no
Token ID645
Feature activation-1.11e+00

Pos logit contributions
anged+0.027
owing+0.026
and+0.026
os+0.025
 Su+0.024
Neg logit contributions
 big-0.017
 whole-0.014
 little-0.014
 huge-0.014
A-0.014
Token so
Token ID523
Feature activation+5.68e-01

Pos logit contributions
 big+0.047
 whole+0.041
A+0.040
 little+0.039
 huge+0.039
Neg logit contributions
anged-0.067
owing-0.065
and-0.065
os-0.063
 Su-0.062
Token happy
Token ID3772
Feature activation+1.77e+00

Pos logit contributions
 big+0.014
 whole+0.013
A+0.012
 little+0.012
 two+0.012
Neg logit contributions
and-0.013
anged-0.013
owing-0.012
os-0.012
 Acc-0.012
Token and
Token ID290
Feature activation-4.19e+00

Pos logit contributions
 big+0.028
 little+0.022
 whole+0.022
 huge+0.021
 two+0.021
Neg logit contributions
anged-0.060
owing-0.057
and-0.057
os-0.057
 Su-0.054
Token thanked
Token ID26280
Feature activation-4.01e+00

Pos logit contributions
anged+0.108
and+0.103
owing+0.103
os+0.100
 Su+0.097
Neg logit contributions
 big-0.097
 whole-0.085
 little-0.082
 huge-0.082
A-0.081
Token the
Token ID262
Feature activation-2.28e+00

Pos logit contributions
anged+0.141
owing+0.134
and+0.132
os+0.131
ooters+0.127
Neg logit contributions
 big-0.075
 little-0.066
 two-0.060
 a-0.059
A-0.055
Token re
Token ID302
Feature activation-1.02e+01

Pos logit contributions
 Him+0.052
years+0.047
and+0.047
owing+0.046
anged+0.045
Neg logit contributions
 big-0.043
A-0.042
-0.039
ball-0.039
case-0.039
Tokenind
Token ID521
Feature activation-9.01e+00

Pos logit contributions
anged+0.261
 Su+0.247
and+0.242
 Jos+0.237
owing+0.235
Neg logit contributions
 big-0.234
 little-0.205
 whole-0.204
A-0.196
 huge-0.194
Tokeneer
Token ID28153
Feature activation-4.14e+00

Pos logit contributions
anged+0.192
and+0.178
owing+0.175
 Su+0.169
os+0.169
Neg logit contributions
 big-0.254
 whole-0.227
 little-0.227
 huge-0.218
 two-0.216
Token for
Token ID329
Feature activation-1.52e+00

Pos logit contributions
anged+0.112
os+0.100
azes+0.100
and+0.100
 Su+0.099
Neg logit contributions
 big-0.098
 little-0.086
 whole-0.085
 two-0.083
 huge-0.080
Token showing
Token ID4478
Feature activation-2.97e+00

Pos logit contributions
anged+0.039
os+0.036
owing+0.034
and+0.034
ram+0.032
Neg logit contributions
 big-0.042
 two-0.036
 little-0.036
 whole-0.036
 huge-0.035
Token her
Token ID607
Feature activation-2.60e+00

Pos logit contributions
anged+0.063
owing+0.058
and+0.058
os+0.057
azes+0.053
Neg logit contributions
 big-0.086
 little-0.076
 whole-0.076
 huge-0.074
 two-0.074
Token Lily
Token ID20037
Feature activation-2.57e+00

Pos logit contributions
anged+0.112
and+0.110
 Su+0.108
owing+0.106
 Jos+0.103
Neg logit contributions
 big-0.129
 whole-0.118
 huge-0.113
A-0.112
 sharp-0.107
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
anged+0.077
owing+0.077
os+0.074
and+0.073
 Su+0.072
Neg logit contributions
 big-0.047
 whole-0.039
A-0.039
 huge-0.039
 little-0.037
Token I
Token ID314
Feature activation+2.00e+00

Pos logit contributions
anged+0.031
and+0.031
owing+0.030
os+0.029
ules+0.028
Neg logit contributions
 big-0.024
 little-0.022
 two-0.021
A-0.021
 whole-0.021
Token'll
Token ID1183
Feature activation+1.37e+00

Pos logit contributions
 big+0.044
A+0.039
 whole+0.039
 huge+0.038
 sharp+0.038
Neg logit contributions
anged-0.052
owing-0.051
os-0.051
 Su-0.047
and-0.047
Token help
Token ID1037
Feature activation-2.53e+00

Pos logit contributions
 big+0.031
 whole+0.029
A+0.029
 huge+0.028
 heavy+0.027
Neg logit contributions
owing-0.031
anged-0.030
 Su-0.030
 Acc-0.029
 Jos-0.028
Token Mitt
Token ID16627
Feature activation-9.62e+00

Pos logit contributions
anged+0.073
owing+0.069
and+0.069
os+0.068
 Su+0.065
Neg logit contributions
 big-0.053
 whole-0.045
 little-0.044
 huge-0.043
A-0.042
Tokenens
Token ID641
Feature activation-4.89e+00

Pos logit contributions
anged+0.145
owing+0.134
and+0.134
os+0.127
 Su+0.118
Neg logit contributions
 big-0.329
 whole-0.300
 huge-0.296
 little-0.294
A-0.289
Token."
Token ID526
Feature activation-2.72e+00

Pos logit contributions
owing+0.131
and+0.129
anged+0.127
os+0.125
 Su+0.116
Neg logit contributions
 big-0.099
A-0.090
 huge-0.087
 whole-0.086
 sharp-0.083
Token She
Token ID1375
Feature activation+2.52e+00

Pos logit contributions
owing+0.083
anged+0.082
os+0.080
years+0.078
beh+0.077
Neg logit contributions
 big-0.055
 little-0.051
 two-0.045
 whole-0.043
A-0.043
Token climbed
Token ID19952
Feature activation-2.66e+00

Pos logit contributions
 big+0.053
 whole+0.050
 huge+0.045
A+0.044
 little+0.044
Neg logit contributions
anged-0.068
os-0.063
and-0.062
owing-0.062
 Su-0.061
Token the
Token ID262
Feature activation-3.10e-01

Pos logit contributions
anged+0.075
owing+0.072
and+0.072
os+0.069
azes+0.067
Neg logit contributions
 big-0.059
 whole-0.051
 little-0.049
 huge-0.049
 two-0.049
Token.
Token ID13
Feature activation-1.08e+00

Pos logit contributions
 big+0.015
A+0.015
 whole+0.015
 huge+0.015
 large+0.014
Neg logit contributions
anged-0.018
owing-0.016
 supervision-0.016
os-0.016
rings-0.016
Token It
Token ID632
Feature activation+3.87e+00

Pos logit contributions
anged+0.031
owing+0.030
and+0.029
os+0.028
 Su+0.027
Neg logit contributions
 big-0.023
A-0.019
 little-0.019
 two-0.019
 whole-0.019
Token is
Token ID318
Feature activation+1.74e+00

Pos logit contributions
 big+0.063
 whole+0.061
A+0.057
 huge+0.054
 little+0.053
Neg logit contributions
os-0.119
anged-0.112
owing-0.109
 Acc-0.105
and-0.104
Token for
Token ID329
Feature activation+2.38e-01

Pos logit contributions
 big+0.023
 whole+0.022
A+0.020
 little+0.019
+0.019
Neg logit contributions
anged-0.055
 Su-0.053
owing-0.053
and-0.052
 Kim-0.052
Token grown
Token ID7334
Feature activation-2.44e+00

Pos logit contributions
 big+0.005
 whole+0.005
 little+0.005
 huge+0.005
A+0.005
Neg logit contributions
anged-0.006
and-0.006
owing-0.006
os-0.006
 Su-0.005
Token-
Token ID12
Feature activation-9.13e+00

Pos logit contributions
and+0.057
anged+0.057
owing+0.056
os+0.053
beh+0.051
Neg logit contributions
 big-0.058
 whole-0.052
 huge-0.052
A-0.051
 little-0.050
Tokenups
Token ID4739
Feature activation-2.13e+00

Pos logit contributions
anged+0.187
owing+0.177
and+0.170
os+0.170
 Su+0.166
Neg logit contributions
 big-0.260
 whole-0.232
A-0.229
 little-0.228
 huge-0.223
Token.
Token ID13
Feature activation-3.91e-01

Pos logit contributions
anged+0.067
owing+0.065
and+0.065
os+0.063
 Su+0.059
Neg logit contributions
 big-0.038
 whole-0.032
 huge-0.031
 little-0.030
A-0.030
Token You
Token ID921
Feature activation+2.52e+00

Pos logit contributions
anged+0.010
owing+0.010
and+0.009
os+0.009
ooting+0.009
Neg logit contributions
 big-0.009
A-0.008
 two-0.008
 little-0.008
 whole-0.008
Token could
Token ID714
Feature activation+2.80e+00

Pos logit contributions
 big+0.050
 whole+0.048
 huge+0.045
A+0.044
 little+0.043
Neg logit contributions
os-0.068
anged-0.066
owing-0.063
 Kim-0.059
 Su-0.059
Token have
Token ID423
Feature activation+3.18e+00

Pos logit contributions
 big+0.057
 whole+0.054
 little+0.052
 huge+0.051
A+0.050
Neg logit contributions
anged-0.067
 Kim-0.063
 Su-0.063
owing-0.061
os-0.061
Token a
Token ID257
Feature activation+6.96e+00

Pos logit contributions
 big+0.016
 whole+0.014
 huge+0.012
 little+0.011
A+0.011
Neg logit contributions
anged-0.045
and-0.044
 Su-0.043
owing-0.043
os-0.043
Token strange
Token ID6283
Feature activation+5.64e+00

Pos logit contributions
A+0.055
 two+0.052
 whole+0.051
 big+0.051
 entire+0.041
Neg logit contributions
 Su-0.228
anged-0.225
and-0.224
owing-0.222
 J-0.221
Token,
Token ID11
Feature activation-2.34e+00

Pos logit contributions
 big+0.091
A+0.081
 whole+0.080
 two+0.076
 huge+0.074
Neg logit contributions
anged-0.170
owing-0.168
and-0.162
 Su-0.160
os-0.160
Token raspberry
Token ID38973
Feature activation+1.33e+00

Pos logit contributions
anged+0.080
and+0.079
owing+0.078
 Su+0.078
os+0.077
Neg logit contributions
 big-0.027
 whole-0.023
A-0.021
 little-0.020
 two-0.020
Token-
Token ID12
Feature activation-7.83e+00

Pos logit contributions
 big+0.025
 whole+0.024
A+0.023
 huge+0.022
 little+0.021
Neg logit contributions
anged-0.034
owing-0.034
and-0.034
os-0.034
azes-0.032
Tokencol
Token ID4033
Feature activation-9.68e+00

Pos logit contributions
anged+0.198
owing+0.188
and+0.183
os+0.179
azes+0.172
Neg logit contributions
 big-0.192
 whole-0.168
 little-0.165
A-0.163
 huge-0.163
Tokenoured
Token ID8167
Feature activation+1.60e+00

Pos logit contributions
and+0.221
anged+0.196
os+0.196
owing+0.194
 Su+0.187
Neg logit contributions
 big-0.238
 whole-0.213
 little-0.213
A-0.210
 huge-0.209
Token object
Token ID2134
Feature activation+1.53e+00

Pos logit contributions
 big+0.031
 whole+0.027
A+0.026
 huge+0.026
 little+0.025
Neg logit contributions
anged-0.045
and-0.043
owing-0.043
os-0.042
 Su-0.041
Token hiding
Token ID11816
Feature activation+1.38e+00

Pos logit contributions
 big+0.029
 whole+0.026
A+0.025
 little+0.024
 huge+0.024
Neg logit contributions
anged-0.045
owing-0.044
os-0.042
and-0.042
 Su-0.040
Token under
Token ID739
Feature activation-1.71e+00

Pos logit contributions
 big+0.037
+0.034
 whole+0.034
 little+0.034
A+0.033
Neg logit contributions
anged-0.026
and-0.026
owing-0.025
os-0.024
 Su-0.024
Token a
Token ID257
Feature activation+6.41e+00

Pos logit contributions
anged+0.049
and+0.048
 Su+0.048
 Jos+0.047
os+0.046
Neg logit contributions
 big-0.029
 whole-0.026
 huge-0.024
 little-0.024
A-0.023
Token new
Token ID649
Feature activation+5.11e-01

Pos logit contributions
 big+0.062
 whole+0.055
A+0.051
 huge+0.049
 two+0.048
Neg logit contributions
and-0.144
owing-0.140
anged-0.139
os-0.137
 Su-0.136
Token!
Token ID0
Feature activation-1.21e+00

Pos logit contributions
 big+0.010
 whole+0.008
 little+0.008
 huge+0.008
A+0.008
Neg logit contributions
anged-0.015
and-0.015
owing-0.015
os-0.015
 Su-0.014
Token She
Token ID1375
Feature activation+2.81e+00

Pos logit contributions
owing+0.038
anged+0.037
and+0.034
years+0.033
 supervision+0.033
Neg logit contributions
 big-0.024
 two-0.020
 little-0.019
A-0.019
 whole-0.018
Token thought
Token ID1807
Feature activation+3.49e+00

Pos logit contributions
 big+0.059
 whole+0.053
A+0.051
 little+0.049
 huge+0.049
Neg logit contributions
anged-0.074
os-0.071
and-0.069
 Su-0.068
owing-0.068
Token av
Token ID1196
Feature activation-4.66e+00

Pos logit contributions
 big+0.072
 little+0.060
 huge+0.056
 two+0.054
 sharp+0.054
Neg logit contributions
anged-0.111
owing-0.104
os-0.100
and-0.100
beh-0.097
Tokenoc
Token ID420
Feature activation-1.11e+01

Pos logit contributions
anged+0.107
and+0.101
owing+0.101
os+0.098
 Su+0.093
Neg logit contributions
 big-0.122
 whole-0.108
 little-0.107
 huge-0.106
A-0.104
Tokenados
Token ID22484
Feature activation+2.68e-01

Pos logit contributions
anged+0.180
and+0.175
os+0.160
owing+0.159
 Jos+0.157
Neg logit contributions
 big-0.345
 two-0.320
 little-0.312
 sharp-0.312
A-0.309
Token were
Token ID547
Feature activation+6.07e-01

Pos logit contributions
 big+0.006
 whole+0.005
A+0.005
 little+0.005
 huge+0.005
Neg logit contributions
os-0.007
anged-0.007
owing-0.007
and-0.007
 Su-0.006
Token the
Token ID262
Feature activation-2.23e-01

Pos logit contributions
 big+0.009
 whole+0.008
A+0.008
+0.007
 little+0.007
Neg logit contributions
and-0.017
 Su-0.017
owing-0.017
 Jos-0.017
 J-0.017
Token most
Token ID749
Feature activation+1.81e+00

Pos logit contributions
and+0.006
owing+0.006
 J+0.006
anged+0.006
 Jos+0.006
Neg logit contributions
 big-0.003
A-0.003
 little-0.003
 huge-0.003
 whole-0.003
Token interesting
Token ID3499
Feature activation+2.20e+00

Pos logit contributions
 big+0.023
A+0.023
 whole+0.021
 two+0.021
 top+0.020
Neg logit contributions
and-0.059
owing-0.054
 Acc-0.053
 Su-0.052
anged-0.051