TOP ACTIVATIONS
MAX = 11.278

quiet compared to the noisy se ag ull s outside .
dressed up in her fan ci est clothes and asked her
enough to scare away the se ag ull s !"
Tom said goodbye to the se ag ull s and put
very special toy for L ott ie . It was a
the d unes to the se ash ore , she could
caught the attention of a se ag ull nearby . The
One day a trou l be came to the bridge .
and Jacob arrived at the se ash ore , a big
and said goodbye to her se ag ull friends . She
apped together until all the se ag ull s flew away
sorry ," the wild er be ast said . " But
his cars away from the se ag ull s so they
charging at the wild e be est , trying to make
Tommy laughed at the silly se ag ull and the man
. She waved to the se ag ull s and looked
rope and the wild er be ast smiled .
the beach to feed the se ag ull s , but
said the other wild er be ast . " Let 's
ebra and the wild e be est lived happily ever after

MIN ACTIVATIONS
MIN = -8.875

their families . They call 911 . They hope it is
hurt badly . She called 911 and waited for the ambulance
cl apped and shouted " ready to go !" She was
of the planets is called Earth . That 's our planet
here , â said Earth Man . Lu
not toys . They said lasers were dangerous .
Jen hurt . He calls 911 . He says " A
lasers anymore . They said lasers were not toys . They
. " Is math hard ?" asked Jenny .
the four . She looked everywhere , but she could not
, nine , ten ! Ready or not , here I
find it . She looked everywhere , but the yarn was
eyes and said , " ready or not , here I
and scared . She looked everywhere , but she could not
was missing . He looked everywhere for it . He was
she was . They looked everywhere . They asked the neighbors
You have five minutes . Ready , set , go !"
he was . They looked everywhere . They asked the park
pink dress . I looked everywhere ." Her mom said ,
, nine , ten . Ready or not , here I

INTERVAL 6.240 - 11.278
CONTAINS 0.043%

would swim around in the tub . He had so much
could play with the f auc et knowing that it would
pair put on their rain co ats and went outside .
the oy sters in a pot , added some butter and
always happy to share with each other . <|endoftext|> Once upon

INTERVAL 1.201 - 6.240
CONTAINS 20.591%

" Okay , but you must wear a life vest and
time together , and from that day on , Lily and
, enjoying this warm sunny day ." " That
hill and dug in the dirt . They found a box
girl were friends and they painted many pictures together . The

INTERVAL -3.837 - 1.201
CONTAINS 78.065%

Tom my was worried because he couldn 't see where his
on , Lily loved to hang out on the swing and
They dance with Mom and Dad . They swing and sway
The child nodded . The old man took the child
a popular vendor selling ice cream . Danny had some money

INTERVAL -8.875 - -3.837
CONTAINS 1.302%

cave . The cave was big and dark . Tim my
how the swings were too high . The musician smiled and
and cleaned it up . When the cake was done ,
they pretended to be pirates or superheroes . They had a
be fair . No pushing or cheating ." They
Token quiet
Token ID5897
Feature activation+2.14e+00

Pos logit contributions
 dod+0.016
 unlocks+0.014
 stops+0.014
 opens+0.014
 pulls+0.013
Neg logit contributions
be-0.016
 se-0.015
Sus-0.015
 especially-0.015
mine-0.014
Token compared
Token ID3688
Feature activation+2.42e+00

Pos logit contributions
be+0.104
Sus+0.101
years+0.100
mine+0.100
 se+0.095
Neg logit contributions
 dod-0.074
 stops-0.064
 opens-0.064
 unlocks-0.060
 falls-0.058
Token to
Token ID284
Feature activation+1.15e+00

Pos logit contributions
be+0.085
Sus+0.083
mine+0.082
years+0.081
 especially+0.078
Neg logit contributions
 dod-0.117
 stops-0.103
 opens-0.102
 unlocks-0.100
 pulls-0.097
Token the
Token ID262
Feature activation+2.17e-01

Pos logit contributions
be+0.046
Sus+0.044
mine+0.043
 especially+0.043
years+0.043
Neg logit contributions
 dod-0.049
 stops-0.043
 opens-0.043
 unlocks-0.042
 falls-0.040
Token noisy
Token ID31210
Feature activation+2.80e+00

Pos logit contributions
be+0.009
 se+0.008
Sus+0.008
 especially+0.008
 occasion+0.008
Neg logit contributions
 dod-0.009
 opens-0.008
 unlocks-0.008
 stops-0.008
 pulls-0.008
Token se
Token ID384
Feature activation+1.13e+01

Pos logit contributions
be+0.111
Sus+0.104
mine+0.102
 se+0.102
years+0.098
Neg logit contributions
 dod-0.124
 opens-0.110
 unlocks-0.108
 stops-0.107
 pulls-0.104
Tokenag
Token ID363
Feature activation+4.12e+00

Pos logit contributions
Sus+0.393
years+0.390
be+0.362
mine+0.361
 harbor+0.351
Neg logit contributions
 dod-0.535
 stops-0.469
 opens-0.451
 falls-0.445
 pulls-0.442
Tokenull
Token ID724
Feature activation+1.91e+00

Pos logit contributions
be+0.109
Sus+0.103
years+0.101
mine+0.099
 se+0.091
Neg logit contributions
 dod-0.234
 stops-0.216
 opens-0.214
 unlocks-0.206
 falls-0.204
Tokens
Token ID82
Feature activation+2.58e+00

Pos logit contributions
be+0.083
mine+0.076
Sus+0.074
ina+0.073
 se+0.073
Neg logit contributions
 dod-0.079
 opens-0.070
 stops-0.069
 unlocks-0.068
 pulls-0.065
Token outside
Token ID2354
Feature activation-2.62e+00

Pos logit contributions
be+0.108
mine+0.096
 especially+0.095
 se+0.094
Sus+0.092
Neg logit contributions
 dod-0.108
 opens-0.096
 unlocks-0.095
 stops-0.093
 pulls-0.089
Token.
Token ID13
Feature activation+2.55e-01

Pos logit contributions
 dod+0.096
 stops+0.083
 opens+0.082
 unlocks+0.078
 pulls+0.076
Neg logit contributions
be-0.124
Sus-0.119
mine-0.118
years-0.117
 se-0.112
Token dressed
Token ID12049
Feature activation+3.27e+00

Pos logit contributions
be+0.055
mine+0.053
Sus+0.052
ina+0.050
years+0.050
Neg logit contributions
 dod-0.049
 opens-0.045
 stops-0.045
 unlocks-0.042
 True-0.041
Token up
Token ID510
Feature activation-4.64e+00

Pos logit contributions
be+0.168
years+0.161
Sus+0.161
mine+0.158
 se+0.151
Neg logit contributions
 dod-0.103
 stops-0.091
 opens-0.091
 unlocks-0.084
 falls-0.081
Token in
Token ID287
Feature activation-1.30e+00

Pos logit contributions
 dod+0.203
 stops+0.175
 opens+0.168
 pulls+0.166
 unlocks+0.163
Neg logit contributions
Sus-0.192
be-0.190
years-0.187
mine-0.186
 especially-0.183
Token her
Token ID607
Feature activation+1.94e+00

Pos logit contributions
 dod+0.056
 stops+0.050
 opens+0.049
 unlocks+0.047
 falls+0.046
Neg logit contributions
be-0.053
Sus-0.051
mine-0.050
years-0.050
 se-0.048
Token fan
Token ID4336
Feature activation+4.17e+00

Pos logit contributions
 wedding+0.062
 se+0.058
 harbor+0.057
be+0.057
 especially+0.054
Neg logit contributions
 dod-0.107
 pulls-0.092
 Ready-0.090
 stops-0.089
 aims-0.088
Tokenci
Token ID979
Feature activation+1.08e+01

Pos logit contributions
be+0.202
Sus+0.198
years+0.197
mine+0.191
 se+0.183
Neg logit contributions
 dod-0.142
 opens-0.125
 stops-0.125
 unlocks-0.116
 falls-0.113
Tokenest
Token ID395
Feature activation+3.64e+00

Pos logit contributions
be+0.318
Sus+0.307
years+0.301
mine+0.288
 se+0.273
Neg logit contributions
 dod-0.578
 stops-0.541
 opens-0.537
 unlocks-0.512
 falls-0.504
Token clothes
Token ID8242
Feature activation+2.77e+00

Pos logit contributions
be+0.125
Sus+0.121
years+0.121
mine+0.118
ina+0.109
Neg logit contributions
 dod-0.177
 stops-0.162
 opens-0.161
 unlocks-0.152
 falls-0.150
Token and
Token ID290
Feature activation+5.62e-01

Pos logit contributions
be+0.143
years+0.143
Sus+0.142
mine+0.135
 se+0.129
Neg logit contributions
 dod-0.080
 stops-0.073
 opens-0.072
 unlocks-0.066
 falls-0.066
Token asked
Token ID1965
Feature activation+1.39e+00

Pos logit contributions
be+0.020
Sus+0.019
mine+0.019
years+0.019
ina+0.017
Neg logit contributions
 dod-0.027
 stops-0.024
 opens-0.024
 unlocks-0.023
 falls-0.023
Token her
Token ID607
Feature activation-2.55e+00

Pos logit contributions
years+0.041
mine+0.039
 especially+0.039
Sus+0.038
be+0.038
Neg logit contributions
 dod-0.078
 pulls-0.066
 stops-0.066
 falls-0.066
 opens-0.066
Token enough
Token ID1576
Feature activation+1.34e+00

Pos logit contributions
 dod+0.107
 unlocks+0.095
 opens+0.092
 stops+0.090
 pulls+0.088
Neg logit contributions
be-0.131
mine-0.118
years-0.117
Sus-0.117
 se-0.116
Token to
Token ID284
Feature activation+8.63e-01

Pos logit contributions
be+0.053
 se+0.050
 especially+0.047
Sus+0.047
 weekend+0.045
Neg logit contributions
 dod-0.058
 unlocks-0.051
 opens-0.049
ccoli-0.049
 pulls-0.049
Token scare
Token ID19437
Feature activation+1.56e+00

Pos logit contributions
be+0.038
Sus+0.036
mine+0.036
years+0.035
 se+0.034
Neg logit contributions
 dod-0.035
 stops-0.031
 opens-0.031
 unlocks-0.030
 falls-0.028
Token away
Token ID1497
Feature activation-1.55e+00

Pos logit contributions
mine+0.067
be+0.067
Sus+0.063
years+0.062
 se+0.061
Neg logit contributions
 dod-0.063
 stops-0.056
 opens-0.056
 unlocks-0.055
ccoli-0.054
Token the
Token ID262
Feature activation+8.15e-01

Pos logit contributions
 dod+0.060
 stops+0.052
 pulls+0.050
 opens+0.050
 unlocks+0.049
Neg logit contributions
Sus-0.068
be-0.068
mine-0.065
 se-0.064
ina-0.063
Token se
Token ID384
Feature activation+1.07e+01

Pos logit contributions
be+0.025
Sus+0.024
years+0.024
mine+0.024
ina+0.021
Neg logit contributions
 dod-0.043
 stops-0.039
 opens-0.039
 unlocks-0.037
 falls-0.037
Tokenag
Token ID363
Feature activation+4.26e+00

Pos logit contributions
Sus+0.321
be+0.321
years+0.317
mine+0.294
 se+0.290
Neg logit contributions
 dod-0.566
 stops-0.517
 opens-0.506
 falls-0.486
 unlocks-0.485
Tokenull
Token ID724
Feature activation+1.78e+00

Pos logit contributions
years+0.108
be+0.101
Sus+0.096
mine+0.089
 especially+0.085
Neg logit contributions
 dod-0.244
 stops-0.230
 opens-0.225
 falls-0.217
 unlocks-0.213
Tokens
Token ID82
Feature activation+2.63e+00

Pos logit contributions
be+0.083
mine+0.073
 se+0.070
h+0.070
line+0.069
Neg logit contributions
 dod-0.069
 opens-0.060
 unlocks-0.060
 stops-0.059
 pulls-0.056
Token!"
Token ID2474
Feature activation+3.14e+00

Pos logit contributions
be+0.112
mine+0.107
 se+0.099
 especially+0.098
Sus+0.098
Neg logit contributions
 dod-0.107
 opens-0.092
 unlocks-0.092
ccoli-0.089
 stops-0.088
Token
Token ID198
Feature activation+5.19e-01

Pos logit contributions
mine+0.106
Sus+0.101
be+0.093
 se+0.090
 especially+0.088
Neg logit contributions
 dod-0.163
 opens-0.145
 disappears-0.143
 unlocks-0.142
 stops-0.140
Token Tom
Token ID4186
Feature activation-1.29e+00

Pos logit contributions
Sus+0.042
be+0.042
 especially+0.042
line+0.040
mine+0.040
Neg logit contributions
 dod-0.041
 opens-0.037
 unlocks-0.035
ccoli-0.034
 disappears-0.033
Token said
Token ID531
Feature activation+1.45e+00

Pos logit contributions
 dod+0.058
 stops+0.051
 opens+0.050
 unlocks+0.048
 falls+0.048
Neg logit contributions
be-0.051
Sus-0.048
mine-0.047
years-0.046
 se-0.046
Token goodbye
Token ID24829
Feature activation-3.84e+00

Pos logit contributions
be+0.073
Sus+0.070
years+0.070
mine+0.069
 se+0.066
Neg logit contributions
 dod-0.049
 stops-0.042
 opens-0.042
 unlocks-0.039
 falls-0.038
Token to
Token ID284
Feature activation+2.38e+00

Pos logit contributions
 dod+0.168
 stops+0.151
 opens+0.150
 unlocks+0.142
 falls+0.140
Neg logit contributions
be-0.153
Sus-0.147
years-0.145
mine-0.144
 se-0.136
Token the
Token ID262
Feature activation+1.35e+00

Pos logit contributions
be+0.112
Sus+0.108
years+0.106
mine+0.106
 se+0.101
Neg logit contributions
 dod-0.088
 stops-0.077
 opens-0.076
 unlocks-0.072
 falls-0.070
Token se
Token ID384
Feature activation+1.05e+01

Pos logit contributions
years+0.029
mine+0.026
be+0.024
year+0.024
Sus+0.023
Neg logit contributions
 dod-0.081
 stops-0.079
 opens-0.074
 falls-0.071
 unlocks-0.071
Tokenag
Token ID363
Feature activation+3.84e+00

Pos logit contributions
be+0.320
years+0.313
Sus+0.312
mine+0.291
 se+0.289
Neg logit contributions
 dod-0.552
 stops-0.506
 opens-0.493
 unlocks-0.474
 falls-0.472
Tokenull
Token ID724
Feature activation+1.22e+00

Pos logit contributions
years+0.106
be+0.096
Sus+0.090
mine+0.089
 especially+0.082
Neg logit contributions
 dod-0.206
 stops-0.196
 opens-0.186
 falls-0.180
 unlocks-0.178
Tokens
Token ID82
Feature activation+2.42e+00

Pos logit contributions
be+0.056
Sus+0.053
mine+0.053
years+0.052
ina+0.051
Neg logit contributions
 dod-0.048
 stops-0.041
 opens-0.041
 unlocks-0.039
 pulls-0.038
Token and
Token ID290
Feature activation-6.51e-01

Pos logit contributions
be+0.112
mine+0.108
Sus+0.108
years+0.104
 se+0.103
Neg logit contributions
 dod-0.093
 opens-0.079
 stops-0.079
 unlocks-0.075
 pulls-0.073
Token put
Token ID1234
Feature activation+2.23e+00

Pos logit contributions
 dod+0.033
 stops+0.029
 opens+0.029
 unlocks+0.028
 falls+0.027
Neg logit contributions
be-0.022
Sus-0.021
 se-0.020
mine-0.020
years-0.020
Token very
Token ID845
Feature activation+3.45e+00

Pos logit contributions
 dod+0.053
 opens+0.045
 stops+0.045
 unlocks+0.044
 pulls+0.043
Neg logit contributions
be-0.058
Sus-0.056
 se-0.055
mine-0.053
years-0.053
Token special
Token ID2041
Feature activation+2.16e+00

Pos logit contributions
be+0.156
Sus+0.151
years+0.148
mine+0.147
 se+0.141
Neg logit contributions
 dod-0.134
 stops-0.117
 opens-0.116
 unlocks-0.110
 falls-0.108
Token toy
Token ID13373
Feature activation+2.86e-01

Pos logit contributions
be+0.087
Sus+0.084
mine+0.081
 se+0.080
years+0.079
Neg logit contributions
 dod-0.095
 stops-0.083
 opens-0.082
 unlocks-0.079
 falls-0.078
Token for
Token ID329
Feature activation+1.99e+00

Pos logit contributions
be+0.013
Sus+0.012
mine+0.012
years+0.012
 se+0.012
Neg logit contributions
 dod-0.011
 stops-0.010
 opens-0.010
 unlocks-0.009
 pulls-0.009
Token L
Token ID406
Feature activation+4.79e+00

Pos logit contributions
be+0.090
bowl+0.085
mine+0.083
Sus+0.081
line+0.078
Neg logit contributions
 dod-0.068
 opens-0.061
 stops-0.058
 True-0.057
 unlocks-0.056
Tokenott
Token ID1252
Feature activation+1.04e+01

Pos logit contributions
be+0.086
mine+0.067
years+0.062
 especially+0.059
 se+0.057
Neg logit contributions
 dod-0.295
 opens-0.288
 unlocks-0.288
 stops-0.282
 falls-0.274
Tokenie
Token ID494
Feature activation+2.26e+00

Pos logit contributions
be+0.229
years+0.211
Sus+0.203
 se+0.196
mine+0.188
Neg logit contributions
 dod-0.608
 stops-0.585
 opens-0.578
 unlocks-0.567
ccoli-0.555
Token.
Token ID13
Feature activation+8.45e-01

Pos logit contributions
be+0.107
Sus+0.102
mine+0.101
years+0.101
 se+0.097
Neg logit contributions
 dod-0.083
 stops-0.072
 opens-0.072
 unlocks-0.068
 falls-0.066
Token It
Token ID632
Feature activation+1.12e+00

Pos logit contributions
be+0.037
Sus+0.036
mine+0.036
years+0.035
 se+0.034
Neg logit contributions
 dod-0.033
 stops-0.030
 opens-0.030
 unlocks-0.028
 falls-0.027
Token was
Token ID373
Feature activation+4.97e-01

Pos logit contributions
be+0.045
Sus+0.044
mine+0.043
years+0.042
 se+0.041
Neg logit contributions
 dod-0.049
 stops-0.043
 opens-0.043
 unlocks-0.041
 falls-0.040
Token a
Token ID257
Feature activation-2.48e+00

Pos logit contributions
be+0.019
Sus+0.018
years+0.018
mine+0.018
 se+0.017
Neg logit contributions
 dod-0.023
 stops-0.021
 opens-0.020
 unlocks-0.019
 falls-0.019
Token the
Token ID262
Feature activation+1.21e+00

Pos logit contributions
 dod+0.137
 stops+0.122
 opens+0.121
 unlocks+0.117
 falls+0.115
Neg logit contributions
be-0.106
Sus-0.103
years-0.100
mine-0.099
 se-0.096
Token d
Token ID288
Feature activation+1.45e+00

Pos logit contributions
be+0.041
Sus+0.039
years+0.038
mine+0.038
 se+0.036
Neg logit contributions
 dod-0.061
 stops-0.055
 opens-0.055
 unlocks-0.052
 pulls-0.051
Tokenunes
Token ID4015
Feature activation+9.51e-01

Pos logit contributions
be+0.042
Sus+0.040
years+0.040
mine+0.040
 especially+0.036
Neg logit contributions
 dod-0.078
 stops-0.072
 opens-0.071
 falls-0.068
 unlocks-0.067
Token to
Token ID284
Feature activation+1.11e+00

Pos logit contributions
be+0.046
mine+0.044
Sus+0.044
years+0.043
 se+0.042
Neg logit contributions
 dod-0.034
 stops-0.029
 opens-0.029
 unlocks-0.027
 pulls-0.026
Token the
Token ID262
Feature activation+7.79e-01

Pos logit contributions
be+0.047
Sus+0.046
mine+0.045
years+0.044
 se+0.043
Neg logit contributions
 dod-0.047
 stops-0.041
 opens-0.040
 unlocks-0.039
 falls-0.038
Token se
Token ID384
Feature activation+1.04e+01

Pos logit contributions
be+0.024
Sus+0.022
 se+0.022
mine+0.021
years+0.021
Neg logit contributions
 dod-0.042
 opens-0.038
 stops-0.037
 unlocks-0.037
 pulls-0.036
Tokenash
Token ID1077
Feature activation+2.79e+00

Pos logit contributions
years+0.343
Sus+0.337
be+0.331
 se+0.320
mine+0.312
Neg logit contributions
 dod-0.516
 stops-0.469
 opens-0.457
 falls-0.440
 unlocks-0.436
Tokenore
Token ID382
Feature activation+1.81e+00

Pos logit contributions
be+0.096
years+0.095
Sus+0.091
mine+0.088
 se+0.083
Neg logit contributions
 dod-0.132
 stops-0.122
 opens-0.120
 falls-0.116
 unlocks-0.115
Token,
Token ID11
Feature activation+1.62e+00

Pos logit contributions
be+0.089
years+0.086
Sus+0.086
mine+0.085
 se+0.081
Neg logit contributions
 dod-0.062
 stops-0.055
 opens-0.054
 unlocks-0.050
 falls-0.050
Token she
Token ID673
Feature activation+9.73e-01

Pos logit contributions
be+0.063
Sus+0.060
years+0.058
mine+0.058
 se+0.057
Neg logit contributions
 dod-0.073
 stops-0.064
 opens-0.064
 unlocks-0.062
 falls-0.060
Token could
Token ID714
Feature activation+1.24e+00

Pos logit contributions
be+0.033
Sus+0.032
mine+0.031
years+0.031
 se+0.029
Neg logit contributions
 dod-0.048
 stops-0.044
 opens-0.044
 unlocks-0.042
 falls-0.041
Token caught
Token ID4978
Feature activation+4.91e-01

Pos logit contributions
be+0.009
Sus+0.009
mine+0.009
years+0.009
 se+0.008
Neg logit contributions
 dod-0.015
 stops-0.013
 opens-0.013
 unlocks-0.013
 pulls-0.012
Token the
Token ID262
Feature activation-1.14e-01

Pos logit contributions
Sus+0.017
mine+0.017
be+0.016
years+0.016
ina+0.016
Neg logit contributions
 dod-0.024
 stops-0.022
 opens-0.021
 pulls-0.021
 unlocks-0.021
Token attention
Token ID3241
Feature activation-1.76e+00

Pos logit contributions
 dod+0.006
 opens+0.005
 unlocks+0.005
 stops+0.005
 pulls+0.005
Neg logit contributions
be-0.004
 se-0.003
Sus-0.003
 especially-0.003
mine-0.003
Token of
Token ID286
Feature activation+7.75e-01

Pos logit contributions
 dod+0.083
 stops+0.075
 opens+0.075
 unlocks+0.071
 falls+0.070
Neg logit contributions
be-0.064
Sus-0.061
years-0.060
mine-0.060
 se-0.056
Token a
Token ID257
Feature activation-3.45e-01

Pos logit contributions
be+0.035
Sus+0.033
years+0.032
mine+0.032
ina+0.030
Neg logit contributions
 dod-0.030
 stops-0.027
 opens-0.027
 unlocks-0.025
 pulls-0.025
Token se
Token ID384
Feature activation+1.03e+01

Pos logit contributions
 dod+0.015
 opens+0.014
 unlocks+0.014
 stops+0.014
 pulls+0.014
Neg logit contributions
be-0.014
Sus-0.013
 se-0.013
mine-0.012
years-0.012
Tokenag
Token ID363
Feature activation+4.25e+00

Pos logit contributions
years+0.340
Sus+0.337
be+0.336
 se+0.314
mine+0.309
Neg logit contributions
 dod-0.507
 stops-0.458
 opens-0.447
 falls-0.434
 unlocks-0.426
Tokenull
Token ID724
Feature activation+1.44e+00

Pos logit contributions
years+0.117
be+0.104
Sus+0.098
mine+0.092
 especially+0.089
Neg logit contributions
 dod-0.233
 stops-0.223
 opens-0.215
 falls-0.207
 unlocks-0.203
Token nearby
Token ID6716
Feature activation+2.33e-01

Pos logit contributions
be+0.063
Sus+0.060
mine+0.059
years+0.058
 se+0.056
Neg logit contributions
 dod-0.058
 stops-0.051
 opens-0.051
 unlocks-0.048
 pulls-0.047
Token.
Token ID13
Feature activation+1.58e-01

Pos logit contributions
be+0.011
mine+0.010
Sus+0.010
years+0.010
 se+0.010
Neg logit contributions
 dod-0.009
 stops-0.007
 opens-0.007
 unlocks-0.007
 pulls-0.007
Token The
Token ID383
Feature activation-2.51e-01

Pos logit contributions
Sus+0.006
be+0.006
years+0.006
mine+0.005
 especially+0.005
Neg logit contributions
 dod-0.007
 stops-0.006
 opens-0.006
 disappears-0.006
 falls-0.006
Token One
Token ID1881
Feature activation+1.46e-01

Pos logit contributions
 dod+0.007
 stops+0.006
 opens+0.006
 unlocks+0.006
 falls+0.006
Neg logit contributions
Sus-0.007
be-0.007
mine-0.007
years-0.007
 especially-0.006
Token day
Token ID1110
Feature activation+1.12e-01

Pos logit contributions
years+0.005
mine+0.005
Sus+0.005
be+0.005
rol+0.004
Neg logit contributions
 dod-0.007
 stops-0.007
 opens-0.007
 falls-0.006
 unlocks-0.006
Token a
Token ID257
Feature activation-1.95e+00

Pos logit contributions
Sus+0.004
be+0.004
mine+0.004
 se+0.004
ina+0.004
Neg logit contributions
 dod-0.005
 unlocks-0.004
 opens-0.004
ccoli-0.004
 stops-0.004
Token trou
Token ID4057
Feature activation+3.41e+00

Pos logit contributions
 dod+0.075
 opens+0.066
 stops+0.066
 unlocks+0.063
 pulls+0.061
Neg logit contributions
be-0.088
Sus-0.085
years-0.082
 se-0.081
mine-0.081
Tokenl
Token ID75
Feature activation+6.00e-01

Pos logit contributions
be+0.131
years+0.126
mine+0.123
Sus+0.117
 se+0.115
Neg logit contributions
 dod-0.145
 opens-0.132
 stops-0.131
 unlocks-0.127
 falls-0.120
Tokenbe
Token ID1350
Feature activation+1.02e+01

Pos logit contributions
be+0.021
mine+0.020
Sus+0.020
years+0.019
ina+0.019
Neg logit contributions
 dod-0.030
 opens-0.026
 stops-0.026
 unlocks-0.025
 pulls-0.025
Token came
Token ID1625
Feature activation-3.60e-01

Pos logit contributions
years+0.487
Sus+0.469
mine+0.465
 wedding+0.449
line+0.448
Neg logit contributions
 dod-0.313
 stops-0.300
 falls-0.285
 opens-0.277
 pulls-0.275
Token to
Token ID284
Feature activation-8.14e-01

Pos logit contributions
 dod+0.017
 stops+0.015
 opens+0.015
 falls+0.014
 pulls+0.014
Neg logit contributions
be-0.014
Sus-0.013
ina-0.013
years-0.012
mine-0.012
Token the
Token ID262
Feature activation-9.82e-01

Pos logit contributions
 dod+0.030
 stops+0.026
 opens+0.026
 unlocks+0.024
 falls+0.024
Neg logit contributions
be-0.038
Sus-0.037
mine-0.037
years-0.036
 se-0.035
Token bridge
Token ID7696
Feature activation+1.34e+00

Pos logit contributions
 dod+0.050
 stops+0.046
 opens+0.046
 unlocks+0.044
 falls+0.043
Neg logit contributions
be-0.032
Sus-0.030
years-0.029
mine-0.029
 se-0.028
Token.
Token ID13
Feature activation+5.84e-02

Pos logit contributions
be+0.063
Sus+0.061
mine+0.060
years+0.060
 se+0.057
Neg logit contributions
 dod-0.049
 stops-0.043
 opens-0.043
 unlocks-0.040
 falls-0.039
Token and
Token ID290
Feature activation+3.33e-01

Pos logit contributions
be+0.010
mine+0.009
Sus+0.009
 se+0.008
years+0.008
Neg logit contributions
 dod-0.016
 opens-0.014
 stops-0.014
 unlocks-0.013
 falls-0.013
Token Jacob
Token ID12806
Feature activation-3.66e-01

Pos logit contributions
be+0.014
mine+0.014
Sus+0.013
years+0.013
 3+0.013
Neg logit contributions
 dod-0.014
 unlocks-0.011
 stops-0.011
 falls-0.011
 opens-0.011
Token arrived
Token ID5284
Feature activation-5.12e-01

Pos logit contributions
 dod+0.019
 stops+0.017
 opens+0.017
 unlocks+0.016
 falls+0.016
Neg logit contributions
be-0.012
Sus-0.011
years-0.011
mine-0.011
 se-0.010
Token at
Token ID379
Feature activation-2.25e-01

Pos logit contributions
 dod+0.022
 opens+0.019
 stops+0.019
 unlocks+0.018
 pulls+0.017
Neg logit contributions
be-0.022
Sus-0.021
years-0.020
mine-0.020
 se-0.020
Token the
Token ID262
Feature activation+9.97e-01

Pos logit contributions
 dod+0.010
 stops+0.008
 opens+0.008
 unlocks+0.008
 falls+0.008
Neg logit contributions
be-0.009
mine-0.009
Sus-0.009
years-0.009
 se-0.009
Token se
Token ID384
Feature activation+1.02e+01

Pos logit contributions
be+0.030
Sus+0.029
years+0.028
mine+0.028
ina+0.026
Neg logit contributions
 dod-0.053
 stops-0.049
 opens-0.048
 unlocks-0.046
 falls-0.046
Tokenash
Token ID1077
Feature activation+2.96e+00

Pos logit contributions
be+0.307
Sus+0.304
years+0.299
mine+0.284
 se+0.279
Neg logit contributions
 dod-0.535
 stops-0.485
 opens-0.479
 unlocks-0.461
 falls-0.455
Tokenore
Token ID382
Feature activation+2.12e+00

Pos logit contributions
be+0.106
Sus+0.101
mine+0.099
years+0.099
 se+0.093
Neg logit contributions
 dod-0.143
 stops-0.129
 opens-0.128
 unlocks-0.122
 falls-0.120
Token,
Token ID11
Feature activation+2.33e+00

Pos logit contributions
be+0.099
Sus+0.094
mine+0.093
years+0.092
ina+0.090
Neg logit contributions
 dod-0.081
 stops-0.068
 opens-0.067
 pulls-0.064
 unlocks-0.064
Token a
Token ID257
Feature activation-1.27e+00

Pos logit contributions
be+0.086
 especially+0.084
Sus+0.081
 se+0.080
years+0.079
Neg logit contributions
 dod-0.106
 unlocks-0.091
 opens-0.090
 stops-0.088
 pulls-0.086
Token big
Token ID1263
Feature activation-3.64e-01

Pos logit contributions
 dod+0.049
 stops+0.043
 opens+0.042
 unlocks+0.039
 falls+0.039
Neg logit contributions
be-0.057
mine-0.056
Sus-0.056
years-0.055
ina-0.052
Token and
Token ID290
Feature activation-1.26e-01

Pos logit contributions
be+0.053
Sus+0.051
years+0.051
mine+0.051
ina+0.048
Neg logit contributions
 dod-0.032
 stops-0.028
 opens-0.027
 unlocks-0.025
 pulls-0.025
Token said
Token ID531
Feature activation+1.60e+00

Pos logit contributions
 dod+0.005
 stops+0.004
 opens+0.004
 unlocks+0.004
 falls+0.004
Neg logit contributions
be-0.006
years-0.005
Sus-0.005
mine-0.005
ina-0.005
Token goodbye
Token ID24829
Feature activation-3.88e+00

Pos logit contributions
be+0.080
Sus+0.077
years+0.076
mine+0.076
ina+0.073
Neg logit contributions
 dod-0.054
 stops-0.048
 opens-0.047
 unlocks-0.043
 falls-0.043
Token to
Token ID284
Feature activation+2.34e+00

Pos logit contributions
 dod+0.163
 stops+0.144
 opens+0.143
 unlocks+0.137
 pulls+0.134
Neg logit contributions
be-0.164
Sus-0.157
mine-0.153
years-0.153
 se-0.146
Token her
Token ID607
Feature activation+7.60e-01

Pos logit contributions
be+0.092
Sus+0.090
mine+0.089
years+0.089
 se+0.085
Neg logit contributions
 dod-0.103
 stops-0.092
 opens-0.091
 unlocks-0.087
 falls-0.085
Token se
Token ID384
Feature activation+1.00e+01

Pos logit contributions
 se+0.027
be+0.025
 harbor+0.025
 wedding+0.023
 especially+0.023
Neg logit contributions
 dod-0.039
 opens-0.034
 falls-0.033
 stops-0.033
 pulls-0.033
Tokenag
Token ID363
Feature activation+3.47e+00

Pos logit contributions
be+0.309
Sus+0.304
years+0.300
 se+0.291
line+0.283
Neg logit contributions
 dod-0.514
 stops-0.469
 opens-0.463
 unlocks-0.448
 falls-0.440
Tokenull
Token ID724
Feature activation+1.36e+00

Pos logit contributions
be+0.103
years+0.102
Sus+0.095
mine+0.093
 especially+0.086
Neg logit contributions
 dod-0.180
 stops-0.168
 opens-0.165
 unlocks-0.158
 falls-0.156
Token friends
Token ID2460
Feature activation+1.72e+00

Pos logit contributions
be+0.064
Sus+0.061
mine+0.061
years+0.060
ina+0.058
Neg logit contributions
 dod-0.051
 stops-0.044
 opens-0.044
 unlocks-0.041
 pulls-0.040
Token.
Token ID13
Feature activation+1.90e-01

Pos logit contributions
be+0.086
Sus+0.084
years+0.083
mine+0.082
 se+0.079
Neg logit contributions
 dod-0.056
 stops-0.049
 opens-0.048
 unlocks-0.045
 falls-0.044
Token She
Token ID1375
Feature activation+6.93e-02

Pos logit contributions
be+0.008
Sus+0.008
mine+0.007
years+0.007
 se+0.007
Neg logit contributions
 dod-0.008
 stops-0.007
 opens-0.007
 unlocks-0.007
 falls-0.007
Tokenapped
Token ID6320
Feature activation+1.60e+00

Pos logit contributions
be+0.070
Sus+0.069
years+0.066
mine+0.066
 se+0.066
Neg logit contributions
 dod-0.152
 opens-0.138
 stops-0.137
 pulls-0.132
 unlocks-0.131
Token together
Token ID1978
Feature activation+8.39e-01

Pos logit contributions
be+0.070
Sus+0.068
mine+0.066
years+0.066
 se+0.063
Neg logit contributions
 dod-0.064
 stops-0.056
 opens-0.056
 unlocks-0.053
 falls-0.052
Token until
Token ID1566
Feature activation-5.43e-01

Pos logit contributions
be+0.039
mine+0.036
Sus+0.036
 se+0.035
years+0.035
Neg logit contributions
 dod-0.032
 unlocks-0.028
 stops-0.028
 opens-0.028
 pulls-0.026
Token all
Token ID477
Feature activation+2.57e+00

Pos logit contributions
 dod+0.020
 stops+0.017
 opens+0.017
 unlocks+0.016
 falls+0.016
Neg logit contributions
be-0.025
Sus-0.025
mine-0.024
years-0.024
 se-0.023
Token the
Token ID262
Feature activation-2.55e-01

Pos logit contributions
be+0.121
Sus+0.116
years+0.115
mine+0.114
 se+0.107
Neg logit contributions
 dod-0.094
 stops-0.085
 opens-0.085
 unlocks-0.078
 falls-0.077
Token se
Token ID384
Feature activation+9.99e+00

Pos logit contributions
 dod+0.012
 unlocks+0.011
 opens+0.011
 stops+0.011
 pulls+0.011
Neg logit contributions
be-0.009
Sus-0.008
 se-0.008
mine-0.008
years-0.008
Tokenag
Token ID363
Feature activation+3.51e+00

Pos logit contributions
be+0.321
Sus+0.320
years+0.317
line+0.314
 se+0.312
Neg logit contributions
 dod-0.497
 stops-0.434
 opens-0.430
 pulls-0.419
 unlocks-0.418
Tokenull
Token ID724
Feature activation+1.37e+00

Pos logit contributions
be+0.093
years+0.089
Sus+0.087
mine+0.084
 se+0.080
Neg logit contributions
 dod-0.195
 stops-0.182
 opens-0.179
 unlocks-0.171
 falls-0.170
Tokens
Token ID82
Feature activation+3.05e+00

Pos logit contributions
be+0.050
mine+0.045
 se+0.045
Sus+0.044
ina+0.042
Neg logit contributions
 dod-0.065
 opens-0.059
 stops-0.058
 unlocks-0.057
 falls-0.055
Token flew
Token ID13112
Feature activation-1.31e+00

Pos logit contributions
be+0.119
Sus+0.112
mine+0.111
years+0.109
 se+0.106
Neg logit contributions
 dod-0.137
 opens-0.123
 stops-0.122
 unlocks-0.119
 pulls-0.114
Token away
Token ID1497
Feature activation-1.38e+00

Pos logit contributions
 dod+0.055
 opens+0.048
 stops+0.048
 unlocks+0.046
 pulls+0.046
Neg logit contributions
be-0.056
Sus-0.054
mine-0.053
years-0.051
 se-0.050
Token sorry
Token ID7926
Feature activation-3.34e+00

Pos logit contributions
be+0.131
Sus+0.126
years+0.124
mine+0.121
 se+0.120
Neg logit contributions
 dod-0.125
 stops-0.112
 opens-0.111
 unlocks-0.106
 falls-0.102
Token,"
Token ID553
Feature activation+2.73e+00

Pos logit contributions
 dod+0.132
 opens+0.113
 unlocks+0.110
 stops+0.110
 pulls+0.109
Neg logit contributions
be-0.148
Sus-0.145
ina-0.136
 se-0.135
 especially-0.134
Token the
Token ID262
Feature activation-6.71e-01

Pos logit contributions
be+0.105
Sus+0.102
mine+0.100
years+0.099
 se+0.097
Neg logit contributions
 dod-0.122
 stops-0.109
 opens-0.108
 unlocks-0.103
 falls-0.101
Token wild
Token ID4295
Feature activation-8.41e-01

Pos logit contributions
 dod+0.027
 stops+0.024
 opens+0.024
 unlocks+0.023
 falls+0.022
Neg logit contributions
be-0.029
Sus-0.028
years-0.028
mine-0.028
ina-0.026
Tokener
Token ID263
Feature activation+3.82e+00

Pos logit contributions
 dod+0.039
 opens+0.035
 stops+0.035
 unlocks+0.034
 falls+0.033
Neg logit contributions
be-0.032
Sus-0.030
mine-0.029
years-0.029
 se-0.028
Tokenbe
Token ID1350
Feature activation+9.95e+00

Pos logit contributions
years+0.154
be+0.146
Sus+0.144
mine+0.141
 especially+0.137
Neg logit contributions
 dod-0.158
 stops-0.145
 opens-0.139
 pulls-0.133
 unlocks-0.131
Tokenast
Token ID459
Feature activation+4.88e-01

Pos logit contributions
years+0.380
Sus+0.372
be+0.365
mine+0.357
line+0.343
Neg logit contributions
 dod-0.431
 stops-0.398
 opens-0.386
 falls-0.379
 pulls-0.377
Token said
Token ID531
Feature activation+3.70e+00

Pos logit contributions
be+0.019
mine+0.018
Sus+0.017
ina+0.016
 se+0.016
Neg logit contributions
 dod-0.023
 opens-0.020
 stops-0.020
 unlocks-0.019
 True-0.019
Token.
Token ID13
Feature activation-8.89e-01

Pos logit contributions
be+0.171
Sus+0.164
mine+0.162
years+0.159
 se+0.156
Neg logit contributions
 dod-0.141
 stops-0.122
 opens-0.122
 unlocks-0.117
 pulls-0.113
Token "
Token ID366
Feature activation-3.49e-01

Pos logit contributions
 dod+0.038
 opens+0.034
 stops+0.033
 unlocks+0.032
 falls+0.032
Neg logit contributions
Sus-0.036
be-0.035
 se-0.033
mine-0.033
line-0.032
TokenBut
Token ID1537
Feature activation-1.18e+00

Pos logit contributions
 dod+0.016
 stops+0.015
 opens+0.014
 falls+0.014
 pulls+0.014
Neg logit contributions
be-0.012
years-0.012
Sus-0.011
mine-0.011
ina-0.011
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
be+0.046
Sus+0.045
mine+0.044
years+0.044
ina+0.041
Neg logit contributions
 dod-0.041
 stops-0.036
 opens-0.036
 unlocks-0.034
 pulls-0.033
Token cars
Token ID5006
Feature activation-1.67e+00

Pos logit contributions
 dod+0.046
 opens+0.040
 True+0.040
ccoli+0.040
 Patch+0.039
Neg logit contributions
be-0.040
 se-0.037
Sus-0.033
 harbor-0.033
 especially-0.032
Token away
Token ID1497
Feature activation-1.77e+00

Pos logit contributions
 dod+0.061
 stops+0.054
 opens+0.053
 unlocks+0.050
 falls+0.049
Neg logit contributions
be-0.078
Sus-0.075
years-0.074
mine-0.074
 se-0.071
Token from
Token ID422
Feature activation-2.11e+00

Pos logit contributions
 dod+0.074
 stops+0.063
 opens+0.063
 pulls+0.061
 unlocks+0.059
Neg logit contributions
be-0.078
Sus-0.074
 especially-0.073
mine-0.073
 se-0.073
Token the
Token ID262
Feature activation+9.74e-01

Pos logit contributions
 dod+0.087
 stops+0.077
 opens+0.076
 unlocks+0.072
 pulls+0.070
Neg logit contributions
be-0.091
Sus-0.087
years-0.087
mine-0.086
 se-0.082
Token se
Token ID384
Feature activation+9.95e+00

Pos logit contributions
be+0.030
Sus+0.027
 se+0.027
mine+0.027
years+0.026
Neg logit contributions
 dod-0.052
 opens-0.048
 stops-0.048
 unlocks-0.046
 pulls-0.045
Tokenag
Token ID363
Feature activation+3.88e+00

Pos logit contributions
Sus+0.303
be+0.302
years+0.297
mine+0.285
 se+0.279
Neg logit contributions
 dod-0.519
 stops-0.469
 opens-0.458
 unlocks-0.445
 falls-0.443
Tokenull
Token ID724
Feature activation+1.77e+00

Pos logit contributions
years+0.095
be+0.086
Sus+0.084
mine+0.080
 especially+0.071
Neg logit contributions
 dod-0.222
 stops-0.210
 opens-0.203
 falls-0.197
 unlocks-0.196
Tokens
Token ID82
Feature activation+2.64e+00

Pos logit contributions
be+0.076
mine+0.071
Sus+0.070
years+0.069
 se+0.067
Neg logit contributions
 dod-0.074
 opens-0.066
 stops-0.066
 unlocks-0.063
 pulls-0.061
Token so
Token ID523
Feature activation+2.99e+00

Pos logit contributions
be+0.120
mine+0.115
Sus+0.113
years+0.113
 se+0.109
Neg logit contributions
 dod-0.102
 opens-0.089
 stops-0.088
 unlocks-0.084
 pulls-0.082
Token they
Token ID484
Feature activation+2.32e+00

Pos logit contributions
be+0.104
years+0.098
 se+0.097
Sus+0.094
mine+0.091
Neg logit contributions
 dod-0.148
 opens-0.132
 stops-0.130
 unlocks-0.125
 pulls-0.123
Token charging
Token ID11642
Feature activation-1.26e+00

Pos logit contributions
be+0.016
Sus+0.015
mine+0.015
years+0.015
ina+0.014
Neg logit contributions
 dod-0.017
 stops-0.015
 opens-0.015
 pulls-0.014
 unlocks-0.014
Token at
Token ID379
Feature activation-9.03e-01

Pos logit contributions
 dod+0.054
 stops+0.048
 unlocks+0.047
 opens+0.047
ccoli+0.046
Neg logit contributions
be-0.052
years-0.049
Sus-0.048
mine-0.048
 se-0.046
Token the
Token ID262
Feature activation+4.55e-01

Pos logit contributions
 dod+0.041
 stops+0.036
 opens+0.036
 unlocks+0.035
 falls+0.034
Neg logit contributions
be-0.034
years-0.034
mine-0.033
Sus-0.032
 se-0.032
Token wild
Token ID4295
Feature activation+4.99e-01

Pos logit contributions
be+0.018
Sus+0.017
mine+0.017
years+0.017
 se+0.016
Neg logit contributions
 dod-0.020
 stops-0.018
 opens-0.018
 unlocks-0.018
 pulls-0.017
Tokene
Token ID68
Feature activation+2.80e+00

Pos logit contributions
be+0.019
Sus+0.018
mine+0.018
 se+0.018
ina+0.018
Neg logit contributions
 dod-0.023
 opens-0.020
 stops-0.020
 unlocks-0.019
 pulls-0.019
Tokenbe
Token ID1350
Feature activation+9.92e+00

Pos logit contributions
be+0.115
mine+0.104
 se+0.100
Sus+0.100
ina+0.098
Neg logit contributions
 dod-0.121
 unlocks-0.111
 opens-0.109
 stops-0.106
 closes-0.103
Tokenest
Token ID395
Feature activation+8.37e-01

Pos logit contributions
Sus+0.376
years+0.375
be+0.357
mine+0.356
 se+0.344
Neg logit contributions
 dod-0.440
 stops-0.394
 opens-0.383
 falls-0.375
 pulls-0.374
Token,
Token ID11
Feature activation+1.44e+00

Pos logit contributions
be+0.037
 se+0.033
mine+0.033
Sus+0.033
ina+0.032
Neg logit contributions
 dod-0.034
 opens-0.030
 unlocks-0.030
 stops-0.030
 pulls-0.028
Token trying
Token ID2111
Feature activation+1.17e+00

Pos logit contributions
be+0.058
Sus+0.054
 se+0.054
years+0.053
 especially+0.053
Neg logit contributions
 dod-0.062
 stops-0.055
 opens-0.055
 unlocks-0.054
 pulls-0.051
Token to
Token ID284
Feature activation-4.33e-01

Pos logit contributions
be+0.036
 especially+0.033
 se+0.032
Sus+0.032
mine+0.031
Neg logit contributions
 dod-0.061
 opens-0.055
 unlocks-0.054
 stops-0.054
 pulls-0.052
Token make
Token ID787
Feature activation+4.04e+00

Pos logit contributions
 dod+0.017
 stops+0.015
 opens+0.015
 unlocks+0.014
 falls+0.014
Neg logit contributions
be-0.019
mine-0.019
Sus-0.018
years-0.018
 se-0.018
Token Tommy
Token ID19919
Feature activation-1.01e+00

Pos logit contributions
 dod+0.000
ccoli+0.000
 opens+0.000
 unlocks+0.000
 disappears+0.000
Neg logit contributions
Sus-0.000
be-0.000
line-0.000
mine-0.000
 especially-0.000
Token laughed
Token ID13818
Feature activation-1.05e+00

Pos logit contributions
 dod+0.050
 opens+0.044
 stops+0.044
 unlocks+0.043
 falls+0.042
Neg logit contributions
be-0.036
mine-0.033
Sus-0.032
years-0.032
 se-0.031
Token at
Token ID379
Feature activation-4.15e-01

Pos logit contributions
 dod+0.037
 opens+0.032
 stops+0.032
 unlocks+0.030
 pulls+0.029
Neg logit contributions
be-0.051
Sus-0.050
mine-0.049
years-0.049
 se-0.047
Token the
Token ID262
Feature activation+5.41e-01

Pos logit contributions
 dod+0.016
 opens+0.013
 stops+0.013
 unlocks+0.013
 falls+0.012
Neg logit contributions
be-0.019
Sus-0.019
mine-0.019
years-0.019
 se-0.018
Token silly
Token ID14397
Feature activation+3.78e+00

Pos logit contributions
years+0.029
rol+0.027
year+0.027
Sus+0.027
days+0.026
Neg logit contributions
 stops-0.018
 dod-0.017
 opens-0.014
 pulls-0.014
 falls-0.014
Token se
Token ID384
Feature activation+9.91e+00

Pos logit contributions
years+0.185
Sus+0.167
year+0.164
rol+0.160
mine+0.160
Neg logit contributions
 stops-0.140
 dod-0.140
 opens-0.124
 falls-0.115
 pulls-0.113
Tokenag
Token ID363
Feature activation+3.36e+00

Pos logit contributions
be+0.294
Sus+0.290
years+0.287
mine+0.267
 se+0.265
Neg logit contributions
 dod-0.529
 stops-0.483
 opens-0.472
 falls-0.457
 unlocks-0.456
Tokenull
Token ID724
Feature activation+1.27e+00

Pos logit contributions
years+0.086
be+0.081
Sus+0.079
mine+0.072
 especially+0.070
Neg logit contributions
 dod-0.188
 stops-0.180
 opens-0.171
 falls-0.168
 pulls-0.165
Token and
Token ID290
Feature activation+7.07e-02

Pos logit contributions
be+0.061
mine+0.057
Sus+0.057
years+0.056
 se+0.055
Neg logit contributions
 dod-0.046
 opens-0.040
 stops-0.040
 unlocks-0.039
 pulls-0.037
Token the
Token ID262
Feature activation+1.41e-01

Pos logit contributions
be+0.003
Sus+0.002
years+0.002
mine+0.002
ina+0.002
Neg logit contributions
 dod-0.003
 stops-0.003
 opens-0.003
 unlocks-0.003
 falls-0.003
Token man
Token ID582
Feature activation+3.91e-01

Pos logit contributions
years+0.005
Sus+0.004
mine+0.004
rol+0.004
year+0.004
Neg logit contributions
 dod-0.007
 stops-0.007
 opens-0.006
 falls-0.006
 pulls-0.006
Token.
Token ID13
Feature activation-9.91e-02

Pos logit contributions
be+0.042
years+0.041
Sus+0.041
mine+0.040
 se+0.038
Neg logit contributions
 dod-0.024
 stops-0.022
 opens-0.022
 unlocks-0.020
 falls-0.019
Token She
Token ID1375
Feature activation-1.25e-01

Pos logit contributions
 dod+0.005
 stops+0.004
 opens+0.004
 falls+0.004
 disappears+0.004
Neg logit contributions
be-0.003
Sus-0.003
years-0.003
mine-0.003
line-0.003
Token waved
Token ID26834
Feature activation-2.94e+00

Pos logit contributions
 dod+0.007
 stops+0.006
 opens+0.006
 unlocks+0.006
 falls+0.006
Neg logit contributions
be-0.004
Sus-0.003
 se-0.003
years-0.003
mine-0.003
Token to
Token ID284
Feature activation+3.88e-01

Pos logit contributions
 dod+0.136
 stops+0.121
 opens+0.120
 unlocks+0.116
 pulls+0.114
Neg logit contributions
be-0.111
Sus-0.106
years-0.105
mine-0.104
 se-0.099
Token the
Token ID262
Feature activation-8.28e-01

Pos logit contributions
mine+0.014
years+0.013
 se+0.013
Sus+0.013
 especially+0.013
Neg logit contributions
 dod-0.019
ccoli-0.016
 unlocks-0.016
 stops-0.016
 opens-0.016
Token se
Token ID384
Feature activation+9.89e+00

Pos logit contributions
 dod+0.044
 stops+0.040
 opens+0.040
 unlocks+0.038
 falls+0.037
Neg logit contributions
be-0.026
Sus-0.025
mine-0.024
years-0.024
 se-0.022
Tokenag
Token ID363
Feature activation+3.00e+00

Pos logit contributions
be+0.303
Sus+0.300
years+0.298
 se+0.277
mine+0.273
Neg logit contributions
 dod-0.513
 stops-0.471
 opens-0.465
 unlocks-0.448
 falls-0.444
Tokenull
Token ID724
Feature activation+1.21e+00

Pos logit contributions
years+0.085
be+0.076
Sus+0.073
please+0.068
 especially+0.068
Neg logit contributions
 dod-0.158
 stops-0.155
 opens-0.150
 falls-0.144
 unlocks-0.142
Tokens
Token ID82
Feature activation+2.44e+00

Pos logit contributions
be+0.055
mine+0.052
Sus+0.051
years+0.051
 se+0.050
Neg logit contributions
 dod-0.047
 stops-0.040
 opens-0.040
 unlocks-0.039
 pulls-0.038
Token and
Token ID290
Feature activation-2.71e-01

Pos logit contributions
be+0.122
Sus+0.118
mine+0.117
years+0.116
 se+0.111
Neg logit contributions
 dod-0.083
 stops-0.071
 opens-0.071
 unlocks-0.066
 falls-0.064
Token looked
Token ID3114
Feature activation+1.37e+00

Pos logit contributions
 dod+0.014
 opens+0.012
 stops+0.012
 unlocks+0.012
 falls+0.012
Neg logit contributions
be-0.008
 se-0.008
mine-0.008
Sus-0.008
years-0.008
Token rope
Token ID17182
Feature activation+1.28e+00

Pos logit contributions
be+0.043
Sus+0.042
years+0.042
mine+0.042
ina+0.039
Neg logit contributions
 dod-0.031
 stops-0.027
 opens-0.027
 unlocks-0.025
 falls-0.025
Token and
Token ID290
Feature activation+1.51e-03

Pos logit contributions
be+0.058
mine+0.056
Sus+0.055
years+0.053
 se+0.053
Neg logit contributions
 dod-0.049
 stops-0.041
 opens-0.041
 unlocks-0.039
 pulls-0.038
Token the
Token ID262
Feature activation+1.54e-01

Pos logit contributions
be+0.000
Sus+0.000
mine+0.000
years+0.000
 se+0.000
Neg logit contributions
 dod-0.000
 stops-0.000
 opens-0.000
 unlocks-0.000
 falls-0.000
Token wild
Token ID4295
Feature activation-3.63e-01

Pos logit contributions
be+0.007
Sus+0.007
years+0.007
mine+0.007
ina+0.006
Neg logit contributions
 dod-0.006
 stops-0.005
 opens-0.005
 unlocks-0.005
 falls-0.005
Tokener
Token ID263
Feature activation+4.08e+00

Pos logit contributions
 dod+0.017
 opens+0.016
 stops+0.015
 unlocks+0.015
 falls+0.015
Neg logit contributions
be-0.013
Sus-0.012
mine-0.012
years-0.012
 se-0.012
Tokenbe
Token ID1350
Feature activation+9.85e+00

Pos logit contributions
be+0.152
years+0.152
Sus+0.150
mine+0.144
 especially+0.135
Neg logit contributions
 dod-0.180
 stops-0.164
 opens-0.161
 unlocks-0.154
 pulls-0.153
Tokenast
Token ID459
Feature activation+3.69e-01

Pos logit contributions
years+0.362
Sus+0.353
mine+0.329
be+0.329
 harbor+0.327
Neg logit contributions
 dod-0.434
 stops-0.405
 pulls-0.397
 falls-0.391
 opens-0.390
Token smiled
Token ID13541
Feature activation-1.71e+00

Pos logit contributions
be+0.014
mine+0.014
Sus+0.013
ina+0.012
 se+0.012
Neg logit contributions
 dod-0.017
 opens-0.015
 stops-0.015
 True-0.015
 unlocks-0.014
Token.
Token ID13
Feature activation-4.03e-01

Pos logit contributions
 dod+0.058
 stops+0.051
 opens+0.050
 unlocks+0.047
 falls+0.046
Neg logit contributions
be-0.084
Sus-0.082
years-0.081
mine-0.081
 se-0.077
Token
Token ID198
Feature activation-6.30e-01

Pos logit contributions
 dod+0.018
 opens+0.016
 falls+0.016
 stops+0.015
 unlocks+0.015
Neg logit contributions
Sus-0.015
be-0.014
years-0.013
line-0.013
 se-0.013
Token
Token ID198
Feature activation-4.10e-01

Pos logit contributions
 dod+0.028
 opens+0.026
 stops+0.025
 unlocks+0.024
 disappears+0.024
Neg logit contributions
Sus-0.026
be-0.025
mine-0.023
years-0.023
 se-0.022
Token the
Token ID262
Feature activation+1.06e+00

Pos logit contributions
be+0.070
Sus+0.070
mine+0.069
 se+0.068
years+0.067
Neg logit contributions
 dod-0.066
 stops-0.058
 opens-0.056
 unlocks-0.054
 falls-0.053
Token beach
Token ID10481
Feature activation+1.72e+00

Pos logit contributions
be+0.032
Sus+0.031
mine+0.031
years+0.031
ina+0.028
Neg logit contributions
 dod-0.056
 stops-0.052
 opens-0.051
 unlocks-0.048
 falls-0.048
Token to
Token ID284
Feature activation+2.47e+00

Pos logit contributions
be+0.090
Sus+0.089
years+0.089
mine+0.087
ina+0.081
Neg logit contributions
 dod-0.051
 opens-0.046
 stops-0.045
 unlocks-0.042
 falls-0.041
Token feed
Token ID3745
Feature activation+8.16e-01

Pos logit contributions
be+0.117
years+0.112
Sus+0.111
mine+0.109
ina+0.104
Neg logit contributions
 dod-0.091
 opens-0.082
 stops-0.082
 unlocks-0.076
 falls-0.074
Token the
Token ID262
Feature activation-2.59e-02

Pos logit contributions
be+0.032
mine+0.031
Sus+0.031
years+0.030
 se+0.029
Neg logit contributions
 dod-0.037
 stops-0.032
 opens-0.032
 unlocks-0.031
 pulls-0.030
Token se
Token ID384
Feature activation+9.83e+00

Pos logit contributions
 dod+0.001
 stops+0.001
 opens+0.001
 True+0.001
 unlocks+0.001
Neg logit contributions
be-0.001
years-0.001
Sus-0.001
mine-0.001
ina-0.001
Tokenag
Token ID363
Feature activation+3.42e+00

Pos logit contributions
Sus+0.318
be+0.314
years+0.309
 se+0.293
mine+0.286
Neg logit contributions
 dod-0.497
 stops-0.448
 opens-0.445
 unlocks-0.427
 falls-0.422
Tokenull
Token ID724
Feature activation+1.55e+00

Pos logit contributions
years+0.090
be+0.086
Sus+0.080
mine+0.076
 paramed+0.073
Neg logit contributions
 dod-0.188
 stops-0.178
 opens-0.175
 falls-0.167
 unlocks-0.165
Tokens
Token ID82
Feature activation+2.56e+00

Pos logit contributions
be+0.072
Sus+0.068
mine+0.068
years+0.067
 se+0.065
Neg logit contributions
 dod-0.058
 stops-0.051
 opens-0.050
 unlocks-0.048
 pulls-0.047
Token,
Token ID11
Feature activation+8.69e-01

Pos logit contributions
be+0.113
mine+0.109
 se+0.105
Sus+0.104
 especially+0.104
Neg logit contributions
 dod-0.102
 stops-0.088
 opens-0.087
 unlocks-0.085
 pulls-0.082
Token but
Token ID475
Feature activation+5.90e-01

Pos logit contributions
be+0.033
Sus+0.032
mine+0.031
years+0.031
 especially+0.030
Neg logit contributions
 dod-0.039
 stops-0.035
 opens-0.035
 unlocks-0.033
 falls-0.033
Token said
Token ID531
Feature activation+4.60e+00

Pos logit contributions
be+0.125
Sus+0.120
mine+0.118
years+0.117
 se+0.113
Neg logit contributions
 dod-0.150
 stops-0.135
 opens-0.134
 unlocks-0.128
 falls-0.126
Token the
Token ID262
Feature activation-4.69e-01

Pos logit contributions
be+0.215
Sus+0.209
mine+0.207
years+0.202
 se+0.201
Neg logit contributions
 dod-0.169
 opens-0.149
 stops-0.149
 unlocks-0.142
 falls-0.137
Token other
Token ID584
Feature activation+6.15e-01

Pos logit contributions
 dod+0.019
 stops+0.017
 opens+0.017
 unlocks+0.016
 pulls+0.016
Neg logit contributions
be-0.020
Sus-0.020
years-0.019
mine-0.019
ina-0.018
Token wild
Token ID4295
Feature activation-8.37e-01

Pos logit contributions
be+0.024
Sus+0.023
mine+0.023
years+0.023
 se+0.022
Neg logit contributions
 dod-0.027
 stops-0.024
 opens-0.024
 unlocks-0.023
 falls-0.023
Tokener
Token ID263
Feature activation+3.86e+00

Pos logit contributions
 dod+0.039
 stops+0.035
 opens+0.035
 unlocks+0.033
 falls+0.032
Neg logit contributions
be-0.032
Sus-0.030
mine-0.030
years-0.029
 se-0.028
Tokenbe
Token ID1350
Feature activation+9.77e+00

Pos logit contributions
years+0.204
be+0.202
Sus+0.201
mine+0.195
 occasion+0.189
Neg logit contributions
 dod-0.108
 stops-0.091
 opens-0.086
 unlocks-0.077
 pulls-0.077
Tokenast
Token ID459
Feature activation+8.59e-01

Pos logit contributions
years+0.372
Sus+0.365
be+0.360
mine+0.346
 harbor+0.345
Neg logit contributions
 dod-0.414
 stops-0.381
 opens-0.366
 pulls-0.360
 falls-0.360
Token.
Token ID13
Feature activation-1.11e-01

Pos logit contributions
be+0.034
mine+0.032
Sus+0.031
ina+0.029
 se+0.029
Neg logit contributions
 dod-0.040
 opens-0.035
 stops-0.035
 unlocks-0.034
 falls-0.033
Token "
Token ID366
Feature activation+2.43e-01

Pos logit contributions
 dod+0.005
 opens+0.005
 unlocks+0.005
 falls+0.005
 stops+0.004
Neg logit contributions
Sus-0.004
be-0.004
 se-0.003
line-0.003
bar-0.003
TokenLet
Token ID5756
Feature activation-6.73e-01

Pos logit contributions
be+0.009
Sus+0.008
mine+0.008
years+0.008
 se+0.008
Neg logit contributions
 dod-0.011
 stops-0.010
 opens-0.010
 True-0.010
 unlocks-0.009
Token's
Token ID338
Feature activation+2.31e+00

Pos logit contributions
 dod+0.029
 opens+0.026
 stops+0.026
 unlocks+0.024
 falls+0.024
Neg logit contributions
be-0.028
Sus-0.026
mine-0.026
years-0.025
 se-0.025
Tokenebra
Token ID37052
Feature activation-1.06e+00

Pos logit contributions
be+0.057
mine+0.053
years+0.052
Sus+0.052
ina+0.050
Neg logit contributions
 dod-0.059
 stops-0.055
 opens-0.055
 unlocks-0.052
 falls-0.050
Token and
Token ID290
Feature activation-2.05e-01

Pos logit contributions
 dod+0.051
 stops+0.045
 opens+0.045
 unlocks+0.044
 falls+0.042
Neg logit contributions
be-0.038
mine-0.035
Sus-0.035
years-0.034
 se-0.034
Token the
Token ID262
Feature activation-1.75e-01

Pos logit contributions
 dod+0.008
 stops+0.007
 opens+0.006
 unlocks+0.006
 falls+0.006
Neg logit contributions
be-0.010
Sus-0.009
years-0.009
mine-0.009
 se-0.009
Token wild
Token ID4295
Feature activation-9.01e-01

Pos logit contributions
 dod+0.007
 stops+0.006
 opens+0.006
 unlocks+0.006
 pulls+0.006
Neg logit contributions
be-0.007
Sus-0.007
mine-0.007
years-0.007
ina-0.007
Tokene
Token ID68
Feature activation+1.76e+00

Pos logit contributions
 dod+0.039
 stops+0.035
 opens+0.035
 unlocks+0.033
 falls+0.033
Neg logit contributions
be-0.036
Sus-0.035
years-0.034
mine-0.034
 se-0.032
Tokenbe
Token ID1350
Feature activation+9.76e+00

Pos logit contributions
be+0.069
Sus+0.064
mine+0.064
years+0.063
 se+0.060
Neg logit contributions
 dod-0.079
 opens-0.070
 stops-0.070
 unlocks-0.068
 pulls-0.066
Tokenest
Token ID395
Feature activation+3.78e-01

Pos logit contributions
Sus+0.342
years+0.340
line+0.330
be+0.319
mine+0.318
Neg logit contributions
 dod-0.440
 stops-0.405
 opens-0.399
 pulls-0.390
 falls-0.386
Token lived
Token ID5615
Feature activation-1.73e+00

Pos logit contributions
be+0.015
Sus+0.014
years+0.014
mine+0.014
 se+0.013
Neg logit contributions
 dod-0.017
 stops-0.015
 opens-0.015
 unlocks-0.014
 falls-0.014
Token happily
Token ID18177
Feature activation-2.42e+00

Pos logit contributions
 dod+0.077
 stops+0.071
 opens+0.070
 unlocks+0.068
 pulls+0.066
Neg logit contributions
be-0.069
Sus-0.065
years-0.064
mine-0.063
 se-0.061
Token ever
Token ID1683
Feature activation-1.51e+00

Pos logit contributions
 dod+0.109
 stops+0.096
 opens+0.095
 falls+0.090
 unlocks+0.090
Neg logit contributions
Sus-0.092
be-0.091
mine-0.088
years-0.087
 se-0.083
Token after
Token ID706
Feature activation-2.29e-01

Pos logit contributions
 dod+0.063
 stops+0.055
 opens+0.055
 unlocks+0.051
 falls+0.051
Neg logit contributions
be-0.063
Sus-0.062
mine-0.061
years-0.060
 se-0.058
Token their
Token ID511
Feature activation+5.20e-01

Pos logit contributions
 dod+0.020
 opens+0.018
 stops+0.018
 unlocks+0.017
 disappears+0.017
Neg logit contributions
 se-0.012
mine-0.012
Sus-0.012
be-0.012
years-0.012
Token families
Token ID4172
Feature activation-4.10e-01

Pos logit contributions
 se+0.013
be+0.013
 harbor+0.012
 ocean+0.010
 wedding+0.010
Neg logit contributions
 dod-0.030
 unlocks-0.028
 opens-0.028
ccoli-0.027
 stops-0.027
Token.
Token ID13
Feature activation-8.95e-01

Pos logit contributions
 dod+0.019
 opens+0.016
 stops+0.016
 unlocks+0.016
 pulls+0.016
Neg logit contributions
be-0.016
mine-0.015
 especially-0.015
 se-0.015
line-0.014
Token They
Token ID1119
Feature activation+9.83e-01

Pos logit contributions
 dod+0.039
 opens+0.034
 stops+0.034
 unlocks+0.033
 disappears+0.032
Neg logit contributions
be-0.037
Sus-0.036
mine-0.033
 especially-0.032
 se-0.032
Token call
Token ID869
Feature activation-2.90e+00

Pos logit contributions
 se+0.035
be+0.035
mine+0.034
 especially+0.032
 dressed+0.029
Neg logit contributions
 dod-0.044
ccoli-0.039
 unlocks-0.038
 stops-0.038
 opens-0.038
Token 911
Token ID16679
Feature activation-8.87e+00

Pos logit contributions
 dod+0.168
 opens+0.146
 unlocks+0.145
ccoli+0.145
 stops+0.143
Neg logit contributions
mine-0.072
be-0.072
Sus-0.068
 especially-0.068
 se-0.066
Token.
Token ID13
Feature activation-1.52e+00

Pos logit contributions
 dod+0.390
 pulls+0.316
 opens+0.314
 unlocks+0.312
 disappears+0.306
Neg logit contributions
be-0.386
mine-0.351
ina-0.340
line-0.337
 especially-0.326
Token They
Token ID1119
Feature activation+4.29e-01

Pos logit contributions
 dod+0.067
 opens+0.060
 stops+0.058
 unlocks+0.058
 disappears+0.056
Neg logit contributions
be-0.060
Sus-0.060
mine-0.056
 especially-0.054
years-0.054
Token hope
Token ID2911
Feature activation-1.58e+00

Pos logit contributions
be+0.015
 se+0.015
mine+0.014
 especially+0.014
 dressed+0.013
Neg logit contributions
 dod-0.020
ccoli-0.017
 stops-0.017
 unlocks-0.017
 opens-0.017
Token it
Token ID340
Feature activation-2.36e+00

Pos logit contributions
 dod+0.068
 stops+0.059
 opens+0.059
 unlocks+0.056
 pulls+0.055
Neg logit contributions
be-0.066
Sus-0.063
years-0.062
mine-0.062
 se-0.059
Token is
Token ID318
Feature activation-1.11e+00

Pos logit contributions
 dod+0.093
 Patch+0.087
 True+0.083
ccoli+0.080
 magn+0.079
Neg logit contributions
be-0.101
 especially-0.083
m-0.078
 se-0.078
 decided-0.075
Token hurt
Token ID5938
Feature activation-6.06e-01

Pos logit contributions
 dod+0.049
 stops+0.045
 opens+0.044
 unlocks+0.042
 falls+0.042
Neg logit contributions
be-0.048
Sus-0.046
years-0.046
 especially-0.044
mine-0.043
Token badly
Token ID11234
Feature activation-2.25e+00

Pos logit contributions
 dod+0.024
 opens+0.021
 stops+0.021
 unlocks+0.020
 pulls+0.020
Neg logit contributions
be-0.027
Sus-0.026
years-0.026
mine-0.025
 especially-0.024
Token.
Token ID13
Feature activation-1.40e+00

Pos logit contributions
 dod+0.112
 unlocks+0.106
 opens+0.105
 Ready+0.104
ccoli+0.100
Neg logit contributions
be-0.076
 se-0.064
 especially-0.061
m-0.061
h-0.058
Token She
Token ID1375
Feature activation-1.03e+00

Pos logit contributions
 dod+0.071
 opens+0.060
 stops+0.060
 unlocks+0.058
 disappears+0.058
Neg logit contributions
Sus-0.044
be-0.043
years-0.042
line-0.041
 especially-0.040
Token called
Token ID1444
Feature activation-2.63e+00

Pos logit contributions
 dod+0.054
 stops+0.047
 opens+0.047
 disappears+0.046
 unlocks+0.046
Neg logit contributions
be-0.032
 se-0.031
Sus-0.030
years-0.030
 especially-0.028
Token 911
Token ID16679
Feature activation-8.75e+00

Pos logit contributions
 dod+0.123
 unlocks+0.103
 opens+0.101
 stops+0.101
 AW+0.098
Neg logit contributions
mine-0.090
 especially-0.086
be-0.083
years-0.083
ful-0.083
Token and
Token ID290
Feature activation-1.96e+00

Pos logit contributions
 dod+0.373
 opens+0.297
 unlocks+0.296
 pulls+0.295
 AW+0.291
Neg logit contributions
be-0.381
mine-0.356
ina-0.348
years-0.338
Sus-0.327
Token waited
Token ID13488
Feature activation-1.56e+00

Pos logit contributions
 dod+0.097
 stops+0.084
 opens+0.082
 disappears+0.079
 unlocks+0.079
Neg logit contributions
be-0.066
 se-0.065
years-0.063
Sus-0.062
 especially-0.059
Token for
Token ID329
Feature activation-1.82e+00

Pos logit contributions
 dod+0.070
 stops+0.060
 opens+0.060
 unlocks+0.058
 pulls+0.057
Neg logit contributions
be-0.063
Sus-0.059
years-0.058
mine-0.057
 especially-0.057
Token the
Token ID262
Feature activation-1.63e+00

Pos logit contributions
 dod+0.089
 stops+0.081
 opens+0.079
 unlocks+0.078
 falls+0.075
Neg logit contributions
be-0.061
years-0.061
Sus-0.060
mine-0.059
 se-0.057
Token ambulance
Token ID22536
Feature activation-1.40e+00

Pos logit contributions
 dod+0.072
 opens+0.065
 unlocks+0.065
 stops+0.062
 pulls+0.061
Neg logit contributions
be-0.063
 se-0.059
 especially-0.056
Sus-0.055
 occasion-0.055
Token cl
Token ID537
Feature activation+2.69e+00

Pos logit contributions
be+0.070
Sus+0.065
mine+0.065
years+0.064
 se+0.061
Neg logit contributions
 dod-0.089
 stops-0.079
 opens-0.078
 unlocks-0.076
 pulls-0.074
Tokenapped
Token ID6320
Feature activation+5.39e-01

Pos logit contributions
 especially+0.077
 occasion+0.075
be+0.074
 weekend+0.074
Sus+0.073
Neg logit contributions
 dod-0.138
 opens-0.125
 stops-0.125
 unlocks-0.122
 pulls-0.120
Token and
Token ID290
Feature activation-4.95e-01

Pos logit contributions
be+0.021
Sus+0.020
years+0.019
mine+0.019
 se+0.018
Neg logit contributions
 dod-0.024
 stops-0.022
 opens-0.022
 unlocks-0.021
 falls-0.020
Token shouted
Token ID17293
Feature activation-4.25e-01

Pos logit contributions
 dod+0.024
 stops+0.022
 opens+0.022
 unlocks+0.021
 falls+0.021
Neg logit contributions
be-0.017
Sus-0.016
years-0.016
mine-0.016
 se-0.015
Token "
Token ID366
Feature activation-1.22e+00

Pos logit contributions
 dod+0.017
 unlocks+0.015
 pulls+0.015
 opens+0.015
 stops+0.014
Neg logit contributions
be-0.018
Sus-0.018
mine-0.017
years-0.017
 especially-0.017
Tokenready
Token ID1493
Feature activation-8.44e+00

Pos logit contributions
 dod+0.059
 stops+0.053
 opens+0.053
 unlocks+0.050
 pulls+0.049
Neg logit contributions
be-0.041
Sus-0.039
years-0.039
mine-0.038
 se-0.037
Token to
Token ID284
Feature activation-2.08e-02

Pos logit contributions
 dod+0.353
 charges+0.289
 unlocks+0.287
 opens+0.286
 closes+0.277
Neg logit contributions
be-0.385
mine-0.368
h-0.323
Sus-0.321
ina-0.319
Token go
Token ID467
Feature activation-2.67e+00

Pos logit contributions
 dod+0.001
 stops+0.001
 opens+0.001
 unlocks+0.001
ccoli+0.001
Neg logit contributions
be-0.001
mine-0.001
Sus-0.001
 se-0.001
ina-0.001
Token!"
Token ID2474
Feature activation+2.30e+00

Pos logit contributions
 dod+0.093
 stops+0.076
 opens+0.075
 pulls+0.073
 unlocks+0.072
Neg logit contributions
be-0.135
Sus-0.129
mine-0.128
years-0.121
ina-0.121
Token She
Token ID1375
Feature activation+5.36e-02

Pos logit contributions
be+0.096
Sus+0.095
mine+0.094
years+0.090
ina+0.087
Neg logit contributions
 dod-0.098
 opens-0.085
 stops-0.084
 unlocks-0.082
 pulls-0.079
Token was
Token ID373
Feature activation-4.19e-01

Pos logit contributions
be+0.002
Sus+0.002
 se+0.002
mine+0.002
years+0.002
Neg logit contributions
 dod-0.003
 stops-0.002
 opens-0.002
 unlocks-0.002
 True-0.002
Token of
Token ID286
Feature activation-1.31e-02

Pos logit contributions
 dod+0.021
 stops+0.018
 pulls+0.017
 opens+0.017
 unlocks+0.017
Neg logit contributions
be-0.015
 se-0.013
ina-0.013
mine-0.013
Sus-0.013
Token the
Token ID262
Feature activation-1.01e-01

Pos logit contributions
 dod+0.000
 opens+0.000
 stops+0.000
 unlocks+0.000
 True+0.000
Neg logit contributions
be-0.001
Sus-0.001
years-0.001
mine-0.001
line-0.001
Token planets
Token ID14705
Feature activation-2.45e+00

Pos logit contributions
 dod+0.005
 stops+0.005
 opens+0.004
 unlocks+0.004
 pulls+0.004
Neg logit contributions
be-0.003
years-0.003
 se-0.003
mine-0.003
Sus-0.003
Token is
Token ID318
Feature activation+7.35e-01

Pos logit contributions
 dod+0.106
 stops+0.088
 pulls+0.085
ccoli+0.084
 opens+0.084
Neg logit contributions
be-0.102
mine-0.098
ina-0.094
 especially-0.093
 se-0.092
Token called
Token ID1444
Feature activation-2.23e+00

Pos logit contributions
be+0.023
 especially+0.023
ina+0.021
years+0.021
 se+0.021
Neg logit contributions
 dod-0.037
 stops-0.034
 pulls-0.033
 falls-0.033
 unlocks-0.032
Token Earth
Token ID3668
Feature activation-8.39e+00

Pos logit contributions
 dod+0.100
 pulls+0.085
 stops+0.085
 falls+0.083
 unlocks+0.082
Neg logit contributions
be-0.087
years-0.082
mine-0.081
 se-0.080
ina-0.080
Token.
Token ID13
Feature activation-8.70e-01

Pos logit contributions
 dod+0.348
 pulls+0.295
 stops+0.290
 falls+0.286
 unlocks+0.280
Neg logit contributions
be-0.370
ina-0.342
mine-0.328
 especially-0.322
years-0.313
Token That
Token ID1320
Feature activation+5.87e-01

Pos logit contributions
 dod+0.034
 stops+0.030
 opens+0.029
 falls+0.028
 pulls+0.028
Neg logit contributions
be-0.040
Sus-0.037
years-0.037
mine-0.036
 especially-0.035
Token's
Token ID338
Feature activation+1.74e+00

Pos logit contributions
be+0.025
mine+0.023
 se+0.023
ina+0.023
Sus+0.023
Neg logit contributions
 dod-0.024
 stops-0.020
 opens-0.020
 pulls-0.020
 unlocks-0.019
Token our
Token ID674
Feature activation-6.31e-01

Pos logit contributions
 especially+0.052
ina+0.044
 particularly+0.041
 3+0.040
Sus+0.040
Neg logit contributions
 dod-0.098
 falls-0.090
 pulls-0.088
 stops-0.086
ls-0.084
Token planet
Token ID5440
Feature activation-3.26e+00

Pos logit contributions
 dod+0.030
 pulls+0.029
 unlocks+0.027
 falls+0.027
 cleans+0.027
Neg logit contributions
 harbor-0.020
 wedding-0.020
 ocean-0.020
 especially-0.019
be-0.019
Token here
Token ID994
Feature activation-3.18e+00

Pos logit contributions
 dod+0.234
 stops+0.198
 opens+0.195
 pulls+0.194
 unlocks+0.188
Neg logit contributions
be-0.216
Sus-0.204
mine-0.203
years-0.202
 especially-0.200
Token,
Token ID11
Feature activation+7.72e-01

Pos logit contributions
 dod+0.133
 stops+0.117
 opens+0.115
 unlocks+0.111
 pulls+0.110
Neg logit contributions
be-0.134
Sus-0.128
mine-0.127
years-0.126
 especially-0.121
Tokenâ
Token ID22940
Feature activation+2.84e+00

Pos logit contributions
be+0.028
Sus+0.027
 especially+0.027
 se+0.026
mine+0.026
Neg logit contributions
 dod-0.036
 opens-0.032
 stops-0.032
 unlocks-0.031
 pulls-0.030
Token
Token ID26391
Feature activation+1.02e+00

Pos logit contributions
be+0.105
Sus+0.101
years+0.099
mine+0.098
 se+0.093
Neg logit contributions
 dod-0.133
 stops-0.120
 opens-0.119
 unlocks-0.114
 falls-0.112
Token said
Token ID531
Feature activation+8.59e-01

Pos logit contributions
be+0.044
Sus+0.041
mine+0.039
line+0.038
 especially+0.038
Neg logit contributions
 dod-0.042
 opens-0.038
 stops-0.036
 unlocks-0.036
 pulls-0.035
Token Earth
Token ID3668
Feature activation-8.32e+00

Pos logit contributions
be+0.036
Sus+0.035
mine+0.034
years+0.033
 se+0.033
Neg logit contributions
 dod-0.036
 opens-0.031
 stops-0.030
 unlocks-0.030
 pulls-0.029
Token Man
Token ID1869
Feature activation-3.21e+00

Pos logit contributions
 dod+0.351
 opens+0.297
 pulls+0.297
 falls+0.293
 unlocks+0.293
Neg logit contributions
be-0.345
ina-0.325
mine-0.313
Sus-0.310
years-0.300
Token.
Token ID13
Feature activation-6.47e-01

Pos logit contributions
 dod+0.135
 stops+0.117
 opens+0.116
 pulls+0.113
 unlocks+0.112
Neg logit contributions
be-0.136
mine-0.127
Sus-0.126
years-0.125
ina-0.125
Token
Token ID198
Feature activation-7.63e-01

Pos logit contributions
 dod+0.030
 opens+0.027
 stops+0.027
 falls+0.026
 unlocks+0.026
Neg logit contributions
Sus-0.025
be-0.023
mine-0.022
years-0.021
 especially-0.021
Token
Token ID198
Feature activation-4.68e-01

Pos logit contributions
 dod+0.030
 opens+0.026
 stops+0.026
 unlocks+0.025
 pulls+0.024
Neg logit contributions
be-0.034
Sus-0.034
mine-0.032
years-0.032
 especially-0.031
TokenLu
Token ID25596
Feature activation-2.64e+00

Pos logit contributions
 dod+0.020
 stops+0.017
 opens+0.017
 unlocks+0.016
 True+0.016
Neg logit contributions
be-0.019
years-0.018
mine-0.018
ina-0.018
 se-0.018
Token not
Token ID407
Feature activation-3.76e+00

Pos logit contributions
 dod+0.087
 opens+0.085
 stops+0.085
 unlocks+0.085
 falls+0.082
Neg logit contributions
 especially-0.065
 se-0.058
years-0.056
be-0.053
 dressed-0.052
Token toys
Token ID14958
Feature activation-6.07e+00

Pos logit contributions
 dod+0.173
 opens+0.160
 stops+0.157
 unlocks+0.155
 falls+0.153
Neg logit contributions
be-0.132
 especially-0.126
 se-0.124
Sus-0.120
years-0.116
Token.
Token ID13
Feature activation-1.62e+00

Pos logit contributions
 dod+0.256
 opens+0.222
 unlocks+0.217
 stops+0.213
 pulls+0.208
Neg logit contributions
be-0.257
 especially-0.237
mine-0.237
 se-0.235
line-0.231
Token They
Token ID1119
Feature activation+5.78e-01

Pos logit contributions
 dod+0.066
 opens+0.058
 stops+0.058
 unlocks+0.056
 pulls+0.054
Neg logit contributions
be-0.069
Sus-0.067
mine-0.065
years-0.065
 se-0.063
Token said
Token ID531
Feature activation+8.51e-02

Pos logit contributions
be+0.024
Sus+0.024
years+0.023
mine+0.022
line+0.022
Neg logit contributions
 dod-0.025
 stops-0.023
 opens-0.022
 pulls-0.021
 falls-0.021
Token lasers
Token ID34628
Feature activation-8.31e+00

Pos logit contributions
be+0.004
Sus+0.003
years+0.003
mine+0.003
 se+0.003
Neg logit contributions
 dod-0.004
 stops-0.003
 opens-0.003
 unlocks-0.003
 falls-0.003
Token were
Token ID547
Feature activation-2.01e+00

Pos logit contributions
 dod+0.409
 opens+0.347
 Ready+0.346
 unlocks+0.340
 True+0.338
Neg logit contributions
be-0.289
mine-0.275
ina-0.255
 se-0.255
 especially-0.252
Token dangerous
Token ID4923
Feature activation-4.27e+00

Pos logit contributions
 dod+0.093
 stops+0.090
 opens+0.089
 unlocks+0.088
 falls+0.087
Neg logit contributions
 especially-0.070
 se-0.063
years-0.059
be-0.057
 dressed-0.057
Token.
Token ID13
Feature activation-1.65e+00

Pos logit contributions
 dod+0.158
 opens+0.135
 stops+0.132
 unlocks+0.129
 pulls+0.125
Neg logit contributions
be-0.203
Sus-0.188
mine-0.187
years-0.186
 se-0.181
Token
Token ID198
Feature activation-1.54e+00

Pos logit contributions
 dod+0.067
 opens+0.059
 stops+0.058
 unlocks+0.057
 pulls+0.054
Neg logit contributions
be-0.071
Sus-0.069
mine-0.067
years-0.066
 se-0.064
Token
Token ID198
Feature activation-1.20e+00

Pos logit contributions
 dod+0.064
 opens+0.057
 stops+0.056
 unlocks+0.054
 pulls+0.052
Neg logit contributions
be-0.066
Sus-0.065
years-0.061
mine-0.061
 se-0.060
Token Jen
Token ID13364
Feature activation+9.18e-01

Pos logit contributions
 dod+0.022
 stops+0.018
 opens+0.018
ccoli+0.017
 disappears+0.017
Neg logit contributions
Sus-0.031
mine-0.030
be-0.030
years-0.030
 se-0.030
Token hurt
Token ID5938
Feature activation+1.38e+00

Pos logit contributions
be+0.037
mine+0.033
m+0.031
 se+0.030
h+0.030
Neg logit contributions
 dod-0.040
ccoli-0.036
 True-0.034
 disappears-0.034
 opens-0.033
Token.
Token ID13
Feature activation-9.20e-01

Pos logit contributions
be+0.062
years+0.060
Sus+0.058
 occasion+0.057
mine+0.056
Neg logit contributions
 dod-0.051
 stops-0.047
 opens-0.047
 falls-0.043
 unlocks-0.043
Token He
Token ID679
Feature activation-1.34e+00

Pos logit contributions
 dod+0.043
 opens+0.036
 stops+0.036
ccoli+0.035
 disappears+0.035
Neg logit contributions
be-0.034
Sus-0.034
 se-0.032
years-0.031
mine-0.031
Token calls
Token ID3848
Feature activation-2.72e+00

Pos logit contributions
ccoli+0.047
 dod+0.039
 True+0.038
 Ready+0.036
 common+0.034
Neg logit contributions
 se-0.059
be-0.053
years-0.050
 especially-0.049
m-0.048
Token 911
Token ID16679
Feature activation-8.24e+00

Pos logit contributions
 dod+0.120
 opens+0.102
 stops+0.101
ccoli+0.098
 unlocks+0.097
Neg logit contributions
be-0.108
Sus-0.105
mine-0.104
years-0.100
 se-0.097
Token.
Token ID13
Feature activation-1.62e+00

Pos logit contributions
 dod+0.351
 opens+0.291
 stops+0.284
 unlocks+0.284
 pulls+0.281
Neg logit contributions
be-0.355
mine-0.335
years-0.319
ina-0.317
Sus-0.314
Token He
Token ID679
Feature activation-1.97e+00

Pos logit contributions
 dod+0.073
 opens+0.061
 stops+0.061
 unlocks+0.059
 disappears+0.059
Neg logit contributions
be-0.063
Sus-0.062
years-0.058
 se-0.057
mine-0.057
Token says
Token ID1139
Feature activation-1.27e-02

Pos logit contributions
ccoli+0.064
 dod+0.061
 True+0.059
 Ready+0.052
 Patch+0.050
Neg logit contributions
 se-0.090
be-0.081
 decided-0.075
 dressed-0.075
 especially-0.074
Token "
Token ID366
Feature activation-2.26e+00

Pos logit contributions
 dod+0.001
 opens+0.000
 unlocks+0.000
 pulls+0.000
 aims+0.000
Neg logit contributions
be-0.001
Sus-0.001
 se-0.001
mine-0.000
years-0.000
TokenA
Token ID32
Feature activation-4.42e+00

Pos logit contributions
 dod+0.091
 stops+0.085
 opens+0.079
 pulls+0.078
 falls+0.076
Neg logit contributions
years-0.089
be-0.084
ina-0.084
 wedding-0.084
mine-0.083
Token lasers
Token ID34628
Feature activation-4.94e+00

Pos logit contributions
 dod+0.027
 stops+0.025
 unlocks+0.025
 opens+0.025
 closes+0.025
Neg logit contributions
 se-0.007
mine-0.007
 especially-0.007
years-0.007
be-0.007
Token anymore
Token ID7471
Feature activation-2.28e+00

Pos logit contributions
 dod+0.202
 opens+0.177
 stops+0.173
 unlocks+0.172
 pulls+0.167
Neg logit contributions
be-0.214
mine-0.205
 especially-0.195
Sus-0.193
years-0.193
Token.
Token ID13
Feature activation-9.48e-01

Pos logit contributions
 dod+0.112
 opens+0.105
 unlocks+0.104
 stops+0.098
 pulls+0.097
Neg logit contributions
 especially-0.077
be-0.075
mine-0.072
ina-0.070
Sus-0.068
Token They
Token ID1119
Feature activation+8.40e-01

Pos logit contributions
 dod+0.040
 opens+0.035
 stops+0.035
 unlocks+0.034
 pulls+0.032
Neg logit contributions
be-0.039
Sus-0.038
mine-0.037
years-0.037
 se-0.035
Token said
Token ID531
Feature activation+1.85e-01

Pos logit contributions
be+0.035
Sus+0.035
years+0.034
mine+0.033
ina+0.032
Neg logit contributions
 dod-0.035
 stops-0.032
 opens-0.031
 pulls-0.030
 falls-0.029
Token lasers
Token ID34628
Feature activation-8.15e+00

Pos logit contributions
be+0.008
Sus+0.008
years+0.008
mine+0.008
ina+0.007
Neg logit contributions
 dod-0.008
 stops-0.007
 opens-0.007
 falls-0.006
 unlocks-0.006
Token were
Token ID547
Feature activation-1.89e+00

Pos logit contributions
 dod+0.407
 Ready+0.346
 opens+0.346
 unlocks+0.340
 True+0.337
Neg logit contributions
be-0.277
mine-0.262
ina-0.243
 se-0.242
 especially-0.240
Token not
Token ID407
Feature activation-3.76e+00

Pos logit contributions
 dod+0.087
 opens+0.085
 stops+0.085
 unlocks+0.085
 falls+0.082
Neg logit contributions
 especially-0.065
 se-0.058
years-0.056
be-0.053
 dressed-0.052
Token toys
Token ID14958
Feature activation-6.07e+00

Pos logit contributions
 dod+0.173
 opens+0.160
 stops+0.157
 unlocks+0.155
 falls+0.153
Neg logit contributions
be-0.132
 especially-0.126
 se-0.124
Sus-0.120
years-0.116
Token.
Token ID13
Feature activation-1.62e+00

Pos logit contributions
 dod+0.256
 opens+0.222
 unlocks+0.217
 stops+0.213
 pulls+0.208
Neg logit contributions
be-0.257
 especially-0.237
mine-0.237
 se-0.235
line-0.231
Token They
Token ID1119
Feature activation+5.78e-01

Pos logit contributions
 dod+0.066
 opens+0.058
 stops+0.058
 unlocks+0.056
 pulls+0.054
Neg logit contributions
be-0.069
Sus-0.067
mine-0.065
years-0.065
 se-0.063
Token.
Token ID13
Feature activation-3.40e-01

Pos logit contributions
be+0.052
Sus+0.050
years+0.049
mine+0.049
 se+0.047
Neg logit contributions
 dod-0.040
 stops-0.034
 opens-0.034
 unlocks-0.032
 falls-0.031
Token
Token ID198
Feature activation-6.85e-01

Pos logit contributions
 dod+0.013
 stops+0.012
 opens+0.012
 unlocks+0.011
 falls+0.011
Neg logit contributions
be-0.015
Sus-0.014
mine-0.014
years-0.014
 se-0.014
Token
Token ID198
Feature activation-3.99e-01

Pos logit contributions
 dod+0.022
 stops+0.019
 opens+0.019
 unlocks+0.017
 falls+0.017
Neg logit contributions
be-0.035
years-0.034
mine-0.034
Sus-0.033
 se-0.032
Token"
Token ID1
Feature activation+1.50e+00

Pos logit contributions
 dod+0.012
 opens+0.012
 stops+0.011
 Ready+0.011
 unlocks+0.011
Neg logit contributions
mine-0.018
 se-0.018
be-0.018
years-0.017
 wedding-0.017
TokenIs
Token ID3792
Feature activation-1.95e+00

Pos logit contributions
be+0.057
years+0.056
mine+0.054
 se+0.053
ina+0.052
Neg logit contributions
 dod-0.063
 stops-0.057
 opens-0.055
 Ready-0.054
 unlocks-0.052
Token math
Token ID10688
Feature activation-8.12e+00

Pos logit contributions
 dod+0.080
 stops+0.069
 opens+0.069
 unlocks+0.066
 pulls+0.064
Neg logit contributions
be-0.084
Sus-0.080
years-0.080
mine-0.079
 se-0.076
Token hard
Token ID1327
Feature activation-4.32e+00

Pos logit contributions
 dod+0.348
 stops+0.291
 opens+0.289
 unlocks+0.277
 pulls+0.276
Neg logit contributions
be-0.323
 especially-0.307
Sus-0.290
 se-0.288
mine-0.285
Token?"
Token ID1701
Feature activation+2.02e+00

Pos logit contributions
 dod+0.165
 stops+0.143
 opens+0.141
 unlocks+0.133
 pulls+0.131
Neg logit contributions
be-0.198
Sus-0.190
mine-0.187
years-0.186
 se-0.179
Token asked
Token ID1965
Feature activation+3.17e+00

Pos logit contributions
be+0.078
years+0.072
mine+0.072
Sus+0.070
rol+0.068
Neg logit contributions
 dod-0.088
 opens-0.085
 stops-0.084
 unlocks-0.079
 falls-0.078
Token Jenny
Token ID28437
Feature activation+2.41e+00

Pos logit contributions
be+0.119
Sus+0.112
years+0.110
mine+0.110
 se+0.104
Neg logit contributions
 dod-0.145
 stops-0.132
 opens-0.132
 unlocks-0.125
 falls-0.122
Token.
Token ID13
Feature activation+3.73e-01

Pos logit contributions
Sus+0.129
be+0.129
years+0.128
mine+0.123
 se+0.121
Neg logit contributions
 dod-0.063
 opens-0.058
 stops-0.055
 unlocks-0.049
 pulls-0.049
Token the
Token ID262
Feature activation-2.00e+00

Pos logit contributions
mine+0.034
years+0.034
 se+0.030
be+0.030
 especially+0.030
Neg logit contributions
 dod-0.066
 stops-0.060
 opens-0.059
ccoli-0.059
 closes-0.058
Token four
Token ID1440
Feature activation+9.28e-01

Pos logit contributions
 dod+0.097
 opens+0.088
 falls+0.087
 stops+0.085
 disappears+0.085
Neg logit contributions
be-0.066
 especially-0.065
 se-0.065
mine-0.063
 3-0.062
Token.
Token ID13
Feature activation-6.24e-01

Pos logit contributions
be+0.034
mine+0.032
 especially+0.031
years+0.031
 se+0.031
Neg logit contributions
 dod-0.043
 stops-0.039
 opens-0.038
 disappears-0.038
 unlocks-0.037
Token She
Token ID1375
Feature activation-5.45e-01

Pos logit contributions
 dod+0.033
 stops+0.029
 opens+0.028
 disappears+0.028
 pulls+0.028
Neg logit contributions
be-0.020
Sus-0.019
mine-0.019
years-0.019
ina-0.017
Token looked
Token ID3114
Feature activation+8.03e-01

Pos logit contributions
 dod+0.028
 stops+0.025
 opens+0.025
 unlocks+0.024
 falls+0.023
Neg logit contributions
be-0.018
years-0.017
mine-0.017
Sus-0.017
 se-0.016
Token everywhere
Token ID8347
Feature activation-8.10e+00

Pos logit contributions
 especially+0.026
years+0.026
 particularly+0.024
be+0.024
 se+0.023
Neg logit contributions
 dod-0.040
 unlocks-0.035
ccoli-0.034
 stops-0.034
 opens-0.034
Token,
Token ID11
Feature activation+4.25e-01

Pos logit contributions
 dod+0.353
 stops+0.304
 opens+0.301
 unlocks+0.291
 pulls+0.287
Neg logit contributions
be-0.330
Sus-0.327
mine-0.319
 especially-0.311
 se-0.310
Token but
Token ID475
Feature activation+8.49e-02

Pos logit contributions
be+0.018
mine+0.017
years+0.017
Sus+0.017
 especially+0.017
Neg logit contributions
 dod-0.018
 stops-0.015
 opens-0.015
 unlocks-0.015
 pulls-0.014
Token she
Token ID673
Feature activation-4.70e-01

Pos logit contributions
years+0.002
mine+0.002
 especially+0.002
be+0.002
Sus+0.002
Neg logit contributions
 dod-0.005
 stops-0.004
 opens-0.004
ccoli-0.004
 charges-0.004
Token could
Token ID714
Feature activation+2.23e-01

Pos logit contributions
 dod+0.021
 stops+0.019
 opens+0.019
 unlocks+0.018
 falls+0.017
Neg logit contributions
be-0.018
Sus-0.018
years-0.017
mine-0.017
 se-0.016
Token not
Token ID407
Feature activation-3.32e-01

Pos logit contributions
 especially+0.009
mine+0.009
be+0.009
 se+0.009
Sus+0.008
Neg logit contributions
 dod-0.009
 stops-0.008
 opens-0.008
 unlocks-0.008
ccoli-0.008
Token,
Token ID11
Feature activation+1.35e+00

Pos logit contributions
 dod+0.110
ccoli+0.106
 unlocks+0.100
 stops+0.099
 Ready+0.097
Neg logit contributions
be-0.117
 se-0.106
h-0.103
mine-0.100
years-0.098
Token nine
Token ID5193
Feature activation-3.08e+00

Pos logit contributions
be+0.053
mine+0.047
years+0.046
Sus+0.045
 se+0.045
Neg logit contributions
 dod-0.062
 stops-0.055
 opens-0.053
 unlocks-0.053
ccoli-0.052
Token,
Token ID11
Feature activation-6.64e-02

Pos logit contributions
 dod+0.126
ccoli+0.115
 unlocks+0.106
 Ready+0.106
 stops+0.106
Neg logit contributions
be-0.122
h-0.111
 se-0.103
 especially-0.100
years-0.100
Token ten
Token ID3478
Feature activation-3.52e+00

Pos logit contributions
 dod+0.003
 stops+0.003
 opens+0.003
 unlocks+0.002
 falls+0.002
Neg logit contributions
be-0.003
mine-0.002
years-0.002
Sus-0.002
 se-0.002
Token!
Token ID0
Feature activation-1.15e+00

Pos logit contributions
 dod+0.135
ccoli+0.120
 stops+0.117
 unlocks+0.115
 Ready+0.111
Neg logit contributions
be-0.155
h-0.141
years-0.135
 se-0.134
mine-0.134
Token Ready
Token ID23432
Feature activation-8.09e+00

Pos logit contributions
 dod+0.043
 stops+0.038
 opens+0.037
 falls+0.036
 unlocks+0.035
Neg logit contributions
be-0.052
Sus-0.050
mine-0.048
years-0.047
bar-0.046
Token or
Token ID393
Feature activation-3.44e+00

Pos logit contributions
 dod+0.355
 charges+0.273
 stops+0.272
ccoli+0.269
 opens+0.265
Neg logit contributions
be-0.353
mine-0.350
ina-0.319
h-0.318
days-0.313
Token not
Token ID407
Feature activation-1.46e+00

Pos logit contributions
 dod+0.146
 stops+0.126
 opens+0.123
 unlocks+0.119
ccoli+0.118
Neg logit contributions
be-0.140
mine-0.132
Sus-0.130
 se-0.130
years-0.127
Token,
Token ID11
Feature activation-6.15e-01

Pos logit contributions
 dod+0.065
ccoli+0.057
 stops+0.049
 pulls+0.049
 cleans+0.049
Neg logit contributions
 se-0.053
be-0.052
Sus-0.051
h-0.051
ina-0.049
Token here
Token ID994
Feature activation-4.04e+00

Pos logit contributions
 dod+0.025
 stops+0.022
 opens+0.021
 unlocks+0.020
 pulls+0.020
Neg logit contributions
be-0.026
Sus-0.025
mine-0.025
 se-0.024
years-0.024
Token I
Token ID314
Feature activation+8.54e-02

Pos logit contributions
 dod+0.181
 charges+0.150
 stops+0.136
ses+0.136
 aims+0.135
Neg logit contributions
be-0.145
mine-0.138
 especially-0.135
 se-0.134
date-0.133
Token find
Token ID1064
Feature activation+1.23e+00

Pos logit contributions
 dod+0.008
 opens+0.008
 stops+0.007
 unlocks+0.007
 falls+0.007
Neg logit contributions
be-0.011
years-0.011
Sus-0.011
mine-0.010
ina-0.010
Token it
Token ID340
Feature activation-7.63e-01

Pos logit contributions
mine+0.035
be+0.034
years+0.033
Sus+0.032
 especially+0.031
Neg logit contributions
 dod-0.066
 stops-0.060
 opens-0.059
 unlocks-0.057
ccoli-0.055
Token.
Token ID13
Feature activation-8.71e-01

Pos logit contributions
 dod+0.040
 stops+0.032
 opens+0.032
 pulls+0.031
 charges+0.031
Neg logit contributions
mine-0.026
 especially-0.025
be-0.025
ina-0.024
Sus-0.024
Token She
Token ID1375
Feature activation-6.66e-01

Pos logit contributions
 dod+0.050
 opens+0.042
 disappears+0.042
 stops+0.042
 pulls+0.041
Neg logit contributions
Sus-0.025
be-0.023
mine-0.023
years-0.022
line-0.021
Token looked
Token ID3114
Feature activation+6.21e-01

Pos logit contributions
 dod+0.030
 stops+0.027
 opens+0.027
 unlocks+0.026
 pulls+0.026
Neg logit contributions
be-0.025
Sus-0.025
mine-0.024
years-0.024
line-0.023
Token everywhere
Token ID8347
Feature activation-8.08e+00

Pos logit contributions
 especially+0.022
years+0.022
be+0.020
 particularly+0.020
mine+0.019
Neg logit contributions
 dod-0.030
 unlocks-0.026
 stops-0.026
 opens-0.025
 pulls-0.025
Token,
Token ID11
Feature activation+4.56e-01

Pos logit contributions
 dod+0.338
 stops+0.289
 opens+0.287
 unlocks+0.276
 pulls+0.272
Neg logit contributions
be-0.345
Sus-0.340
mine-0.332
years-0.326
 especially-0.322
Token but
Token ID475
Feature activation+1.96e-01

Pos logit contributions
be+0.019
mine+0.019
Sus+0.019
years+0.018
 especially+0.018
Neg logit contributions
 dod-0.019
 stops-0.016
 opens-0.016
 unlocks-0.016
 pulls-0.015
Token the
Token ID262
Feature activation-2.03e+00

Pos logit contributions
years+0.006
mine+0.005
be+0.005
 especially+0.005
Sus+0.005
Neg logit contributions
 dod-0.011
 stops-0.009
 opens-0.009
 pulls-0.009
ccoli-0.009
Token yarn
Token ID21181
Feature activation-3.41e+00

Pos logit contributions
 dod+0.081
 opens+0.070
 unlocks+0.069
 stops+0.069
 pulls+0.066
Neg logit contributions
be-0.086
 especially-0.081
 se-0.080
Sus-0.079
mine-0.079
Token was
Token ID373
Feature activation-8.34e-01

Pos logit contributions
 dod+0.166
 opens+0.142
 stops+0.142
 unlocks+0.137
ccoli+0.135
Neg logit contributions
mine-0.117
be-0.116
ina-0.107
 se-0.106
 especially-0.105
Token eyes
Token ID2951
Feature activation+1.18e+00

Pos logit contributions
be+0.010
 3+0.009
 se+0.009
 suit+0.008
 wedding+0.008
Neg logit contributions
 dod-0.017
 Ready-0.015
 stops-0.015
ccoli-0.015
 lets-0.015
Token and
Token ID290
Feature activation+4.03e-02

Pos logit contributions
be+0.056
Sus+0.054
mine+0.054
years+0.053
 se+0.052
Neg logit contributions
 dod-0.043
 stops-0.037
 opens-0.036
 unlocks-0.035
 pulls-0.034
Token said
Token ID531
Feature activation+1.55e+00

Pos logit contributions
be+0.001
Sus+0.001
years+0.001
 se+0.001
mine+0.001
Neg logit contributions
 dod-0.002
 stops-0.002
 opens-0.002
 unlocks-0.002
 falls-0.002
Token,
Token ID11
Feature activation-2.86e-01

Pos logit contributions
be+0.081
Sus+0.079
years+0.077
mine+0.077
 se+0.074
Neg logit contributions
 dod-0.049
 stops-0.041
 opens-0.041
 unlocks-0.038
 falls-0.037
Token "
Token ID366
Feature activation-7.96e-01

Pos logit contributions
 dod+0.010
 opens+0.009
 stops+0.009
 unlocks+0.009
 falls+0.008
Neg logit contributions
be-0.014
Sus-0.013
mine-0.013
years-0.013
 se-0.013
Tokenready
Token ID1493
Feature activation-8.07e+00

Pos logit contributions
 dod+0.036
 opens+0.031
 stops+0.031
 Ready+0.030
 True+0.030
Neg logit contributions
be-0.029
years-0.028
mine-0.027
 se-0.026
Sus-0.026
Token or
Token ID393
Feature activation-4.04e+00

Pos logit contributions
 dod+0.355
 charges+0.304
 unlocks+0.293
 opens+0.290
 closes+0.287
Neg logit contributions
be-0.341
mine-0.321
days-0.291
line-0.284
h-0.276
Token not
Token ID407
Feature activation-2.22e+00

Pos logit contributions
 dod+0.163
 opens+0.134
 stops+0.134
ccoli+0.129
 unlocks+0.128
Neg logit contributions
be-0.172
 se-0.169
mine-0.165
Sus-0.163
 especially-0.161
Token,
Token ID11
Feature activation-9.39e-01

Pos logit contributions
 dod+0.098
ccoli+0.084
 stops+0.076
 pulls+0.076
 falls+0.074
Neg logit contributions
 se-0.086
be-0.086
Sus-0.084
mine-0.080
 especially-0.080
Token here
Token ID994
Feature activation-4.28e+00

Pos logit contributions
 dod+0.041
 stops+0.035
 opens+0.034
 unlocks+0.033
 falls+0.033
Neg logit contributions
be-0.038
Sus-0.037
mine-0.036
 se-0.036
years-0.035
Token I
Token ID314
Feature activation+1.72e-02

Pos logit contributions
 dod+0.186
 charges+0.154
ses+0.144
 pulls+0.141
 aims+0.140
Neg logit contributions
be-0.163
 especially-0.156
 se-0.154
Sus-0.151
mine-0.146
Token and
Token ID290
Feature activation-1.19e+00

Pos logit contributions
 dod+0.017
 stops+0.015
 opens+0.015
 unlocks+0.014
 falls+0.014
Neg logit contributions
be-0.025
Sus-0.024
years-0.024
mine-0.023
 se-0.022
Token scared
Token ID12008
Feature activation+2.22e-02

Pos logit contributions
 dod+0.051
 stops+0.048
 opens+0.047
 falls+0.044
 unlocks+0.044
Neg logit contributions
be-0.049
Sus-0.047
mine-0.046
years-0.045
ina-0.042
Token.
Token ID13
Feature activation-4.55e-01

Pos logit contributions
be+0.001
Sus+0.001
years+0.001
mine+0.001
 se+0.001
Neg logit contributions
 dod-0.001
 stops-0.001
 opens-0.001
 falls-0.001
 True-0.001
Token She
Token ID1375
Feature activation-8.19e-01

Pos logit contributions
 dod+0.022
 opens+0.020
 unlocks+0.019
 pulls+0.019
 stops+0.019
Neg logit contributions
Sus-0.015
be-0.014
years-0.014
mine-0.014
ott-0.014
Token looked
Token ID3114
Feature activation+5.80e-01

Pos logit contributions
 dod+0.044
 stops+0.039
 opens+0.039
 unlocks+0.038
 falls+0.037
Neg logit contributions
be-0.025
Sus-0.023
years-0.023
mine-0.023
 se-0.022
Token everywhere
Token ID8347
Feature activation-8.04e+00

Pos logit contributions
 especially+0.018
years+0.018
be+0.016
 particularly+0.016
 dressed+0.016
Neg logit contributions
 dod-0.030
 pulls-0.026
 opens-0.026
 unlocks-0.026
ccoli-0.025
Token,
Token ID11
Feature activation+6.01e-01

Pos logit contributions
 dod+0.320
 opens+0.279
 stops+0.279
 unlocks+0.265
 pulls+0.258
Neg logit contributions
be-0.355
Sus-0.345
mine-0.338
years-0.337
 se-0.324
Token but
Token ID475
Feature activation+1.63e-01

Pos logit contributions
be+0.026
Sus+0.025
mine+0.025
years+0.025
 especially+0.024
Neg logit contributions
 dod-0.024
 opens-0.021
 stops-0.021
 unlocks-0.020
 pulls-0.020
Token she
Token ID673
Feature activation-5.51e-01

Pos logit contributions
years+0.005
 especially+0.004
Sus+0.004
mine+0.004
be+0.004
Neg logit contributions
 dod-0.009
 opens-0.008
 stops-0.008
 unlocks-0.008
 pulls-0.008
Token could
Token ID714
Feature activation+8.31e-02

Pos logit contributions
 dod+0.025
 stops+0.022
 opens+0.022
 unlocks+0.021
 falls+0.021
Neg logit contributions
be-0.022
Sus-0.021
mine-0.020
years-0.020
 se-0.019
Token not
Token ID407
Feature activation-6.45e-01

Pos logit contributions
be+0.004
 especially+0.004
mine+0.004
Sus+0.004
 se+0.003
Neg logit contributions
 dod-0.003
 opens-0.003
 stops-0.003
 unlocks-0.003
 pulls-0.003
Token was
Token ID373
Feature activation-4.95e-01

Pos logit contributions
 dod+0.183
 Ready+0.161
 Deal+0.158
 opens+0.153
 stops+0.152
Neg logit contributions
be-0.100
 se-0.091
 especially-0.087
mine-0.087
ina-0.082
Token missing
Token ID4814
Feature activation-1.43e+00

Pos logit contributions
 dod+0.021
 stops+0.019
 opens+0.019
 unlocks+0.018
 falls+0.017
Neg logit contributions
be-0.020
Sus-0.020
years-0.019
mine-0.019
 se-0.019
Token.
Token ID13
Feature activation-5.46e-01

Pos logit contributions
 dod+0.081
 stops+0.069
 pulls+0.067
 AW+0.067
 opens+0.067
Neg logit contributions
ful-0.039
 especially-0.039
 se-0.037
be-0.035
Sus-0.035
Token He
Token ID679
Feature activation-1.39e+00

Pos logit contributions
 dod+0.025
 opens+0.022
 stops+0.022
 falls+0.021
 unlocks+0.021
Neg logit contributions
Sus-0.021
be-0.020
years-0.019
mine-0.018
ott-0.018
Token looked
Token ID3114
Feature activation+8.05e-01

Pos logit contributions
 dod+0.071
 stops+0.064
 opens+0.064
 unlocks+0.061
 falls+0.060
Neg logit contributions
be-0.046
Sus-0.044
years-0.043
mine-0.043
 se-0.040
Token everywhere
Token ID8347
Feature activation-8.03e+00

Pos logit contributions
 especially+0.026
years+0.026
 se+0.024
be+0.024
 particularly+0.024
Neg logit contributions
 dod-0.040
 unlocks-0.034
ccoli-0.034
 opens-0.034
 Ready-0.034
Token for
Token ID329
Feature activation-2.47e-01

Pos logit contributions
 dod+0.333
 opens+0.285
 stops+0.283
 unlocks+0.275
 pulls+0.267
Neg logit contributions
be-0.343
Sus-0.335
mine-0.328
 se-0.323
 especially-0.320
Token it
Token ID340
Feature activation+1.15e-02

Pos logit contributions
 dod+0.011
 stops+0.010
 opens+0.010
 unlocks+0.010
 falls+0.009
Neg logit contributions
be-0.009
Sus-0.009
years-0.009
mine-0.009
 se-0.009
Token.
Token ID13
Feature activation-4.00e-01

Pos logit contributions
be+0.000
 especially+0.000
mine+0.000
line+0.000
ina+0.000
Neg logit contributions
 dod-0.001
 stops-0.000
 opens-0.000
 pulls-0.000
 charges-0.000
Token He
Token ID679
Feature activation-1.25e+00

Pos logit contributions
 dod+0.018
 opens+0.016
 stops+0.016
 falls+0.015
 disappears+0.015
Neg logit contributions
Sus-0.015
be-0.015
years-0.014
 especially-0.014
mine-0.013
Token was
Token ID373
Feature activation-4.08e-01

Pos logit contributions
 dod+0.062
 stops+0.057
 opens+0.057
 unlocks+0.054
 pulls+0.053
Neg logit contributions
be-0.043
Sus-0.041
mine-0.041
years-0.039
ina-0.037
Token she
Token ID673
Feature activation-2.62e-01

Pos logit contributions
 dod+0.132
 stops+0.120
 opens+0.118
 unlocks+0.111
 falls+0.110
Neg logit contributions
be-0.139
Sus-0.133
years-0.132
mine-0.131
ina-0.123
Token was
Token ID373
Feature activation-6.18e-01

Pos logit contributions
 dod+0.014
ccoli+0.012
 Ready+0.012
 stops+0.012
 disappears+0.012
Neg logit contributions
be-0.008
 especially-0.008
mine-0.007
 se-0.007
years-0.007
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
 dod+0.033
 stops+0.029
 opens+0.029
 unlocks+0.028
 pulls+0.028
Neg logit contributions
be-0.020
Sus-0.018
mine-0.018
 se-0.018
 especially-0.017
Token They
Token ID1119
Feature activation+7.61e-01

Pos logit contributions
 dod+0.052
 stops+0.046
 opens+0.045
 unlocks+0.043
 falls+0.042
Neg logit contributions
be-0.056
Sus-0.054
years-0.053
mine-0.053
 se-0.050
Token looked
Token ID3114
Feature activation+5.43e-01

Pos logit contributions
be+0.031
Sus+0.030
years+0.029
mine+0.028
line+0.027
Neg logit contributions
 dod-0.032
 stops-0.031
 opens-0.030
 pulls-0.028
 unlocks-0.028
Token everywhere
Token ID8347
Feature activation-8.02e+00

Pos logit contributions
 especially+0.017
years+0.017
be+0.017
 dressed+0.016
 se+0.016
Neg logit contributions
 dod-0.027
 unlocks-0.023
 opens-0.023
ccoli-0.023
 pulls-0.023
Token.
Token ID13
Feature activation-9.54e-01

Pos logit contributions
 dod+0.328
 stops+0.281
 opens+0.280
 unlocks+0.269
 pulls+0.263
Neg logit contributions
be-0.346
Sus-0.338
mine-0.332
years-0.324
 se-0.321
Token They
Token ID1119
Feature activation+8.98e-01

Pos logit contributions
 dod+0.038
 stops+0.033
 opens+0.033
 unlocks+0.031
 falls+0.031
Neg logit contributions
be-0.042
Sus-0.041
mine-0.040
years-0.040
 se-0.038
Token asked
Token ID1965
Feature activation+3.69e-01

Pos logit contributions
be+0.039
Sus+0.038
years+0.035
 occasion+0.035
ina+0.035
Neg logit contributions
 dod-0.035
 stops-0.034
 opens-0.034
 pulls-0.032
 unlocks-0.031
Token the
Token ID262
Feature activation-2.36e+00

Pos logit contributions
years+0.012
Sus+0.012
be+0.012
mine+0.012
 especially+0.012
Neg logit contributions
 dod-0.019
 opens-0.017
 unlocks-0.016
 stops-0.016
 pulls-0.016
Token neighbors
Token ID12020
Feature activation-3.04e+00

Pos logit contributions
 dod+0.107
 opens+0.096
 stops+0.095
 unlocks+0.094
 pulls+0.091
Neg logit contributions
be-0.090
Sus-0.085
 se-0.083
years-0.083
mine-0.082
Token You
Token ID921
Feature activation-6.40e-01

Pos logit contributions
 dod+0.078
ls+0.074
 stops+0.066
 opens+0.064
 disappears+0.064
Neg logit contributions
be-0.086
 3-0.073
Sus-0.073
 paramed-0.070
years-0.070
Token have
Token ID423
Feature activation+7.00e-01

Pos logit contributions
 dod+0.025
 stops+0.021
 opens+0.020
 Patch+0.020
 charges+0.020
Neg logit contributions
be-0.030
mine-0.027
 se-0.027
 especially-0.025
 dressed-0.024
Token five
Token ID1936
Feature activation-2.20e+00

Pos logit contributions
be+0.026
Sus+0.024
years+0.024
mine+0.024
 se+0.023
Neg logit contributions
 dod-0.033
 stops-0.030
 opens-0.029
 unlocks-0.028
 falls-0.028
Token minutes
Token ID2431
Feature activation+9.60e-01

Pos logit contributions
 dod+0.101
ccoli+0.093
 stops+0.092
 Patch+0.091
 unlocks+0.090
Neg logit contributions
be-0.083
 se-0.072
years-0.071
 especially-0.070
Sus-0.069
Token.
Token ID13
Feature activation-1.87e+00

Pos logit contributions
be+0.045
mine+0.043
Sus+0.043
years+0.041
ina+0.040
Neg logit contributions
 dod-0.037
 stops-0.031
 opens-0.031
 unlocks-0.029
 pulls-0.029
Token Ready
Token ID23432
Feature activation-8.01e+00

Pos logit contributions
 dod+0.073
ls+0.069
 stops+0.065
 disappears+0.062
 falls+0.061
Neg logit contributions
be-0.080
Sus-0.071
years-0.070
 3-0.069
 paramed-0.068
Token,
Token ID11
Feature activation-7.93e-02

Pos logit contributions
 dod+0.370
 charges+0.294
 Patch+0.290
 unlocks+0.285
 aims+0.284
Neg logit contributions
be-0.340
mine-0.336
days-0.300
h-0.300
ina-0.295
Token set
Token ID900
Feature activation+1.04e-01

Pos logit contributions
 dod+0.004
 stops+0.003
 opens+0.003
 unlocks+0.003
 falls+0.003
Neg logit contributions
be-0.003
 se-0.003
Sus-0.003
mine-0.003
 especially-0.002
Token,
Token ID11
Feature activation-2.08e-01

Pos logit contributions
be+0.004
mine+0.004
h+0.004
Sus+0.004
 se+0.004
Neg logit contributions
 dod-0.004
 stops-0.004
ccoli-0.004
 opens-0.004
 unlocks-0.004
Token go
Token ID467
Feature activation-3.10e+00

Pos logit contributions
 dod+0.009
 stops+0.008
 opens+0.008
 unlocks+0.008
 disappears+0.008
Neg logit contributions
be-0.008
 se-0.008
Sus-0.008
mine-0.008
 especially-0.007
Token!"
Token ID2474
Feature activation+1.83e+00

Pos logit contributions
 dod+0.094
 stops+0.077
 opens+0.076
 unlocks+0.071
 pulls+0.069
Neg logit contributions
be-0.170
Sus-0.162
mine-0.161
years-0.156
 se-0.155
Token he
Token ID339
Feature activation-8.98e-01

Pos logit contributions
 dod+0.143
 stops+0.127
 opens+0.126
 unlocks+0.120
 falls+0.118
Neg logit contributions
be-0.134
Sus-0.129
mine-0.127
years-0.127
 se-0.120
Token was
Token ID373
Feature activation-6.79e-01

Pos logit contributions
 dod+0.050
ccoli+0.042
 Ready+0.041
 AW+0.041
 stops+0.040
Neg logit contributions
be-0.026
 especially-0.025
mine-0.024
 se-0.024
 decided-0.023
Token.
Token ID13
Feature activation-1.41e+00

Pos logit contributions
 dod+0.035
 stops+0.029
 opens+0.029
 pulls+0.028
ccoli+0.028
Neg logit contributions
be-0.024
mine-0.022
 especially-0.022
 se-0.022
Sus-0.021
Token They
Token ID1119
Feature activation+6.63e-01

Pos logit contributions
 dod+0.054
 stops+0.048
 opens+0.048
 unlocks+0.045
 falls+0.044
Neg logit contributions
be-0.065
Sus-0.062
years-0.061
mine-0.061
 se-0.058
Token looked
Token ID3114
Feature activation+5.32e-01

Pos logit contributions
be+0.027
Sus+0.026
years+0.025
mine+0.024
ina+0.024
Neg logit contributions
 dod-0.028
 stops-0.027
 opens-0.026
 pulls-0.025
 unlocks-0.024
Token everywhere
Token ID8347
Feature activation-8.01e+00

Pos logit contributions
 especially+0.017
years+0.016
be+0.016
 dressed+0.015
 particularly+0.015
Neg logit contributions
 dod-0.027
 unlocks-0.023
ccoli-0.023
 opens-0.023
 pulls-0.022
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
 dod+0.331
 opens+0.282
 stops+0.281
 unlocks+0.271
 pulls+0.264
Neg logit contributions
be-0.343
Sus-0.334
mine-0.328
 se-0.320
years-0.320
Token They
Token ID1119
Feature activation+8.99e-01

Pos logit contributions
 dod+0.041
 stops+0.036
 opens+0.035
 unlocks+0.033
 falls+0.033
Neg logit contributions
be-0.045
Sus-0.044
mine-0.043
years-0.043
 se-0.041
Token asked
Token ID1965
Feature activation+3.73e-01

Pos logit contributions
be+0.038
Sus+0.038
years+0.035
 occasion+0.035
ina+0.035
Neg logit contributions
 dod-0.036
 stops-0.034
 opens-0.034
 pulls-0.032
 falls-0.032
Token the
Token ID262
Feature activation-2.30e+00

Pos logit contributions
be+0.012
years+0.012
Sus+0.012
mine+0.012
 especially+0.012
Neg logit contributions
 dod-0.019
 opens-0.016
 stops-0.016
 unlocks-0.016
 pulls-0.016
Token park
Token ID3952
Feature activation-3.43e+00

Pos logit contributions
 dod+0.111
 opens+0.098
 unlocks+0.097
 falls+0.095
 stops+0.095
Neg logit contributions
be-0.082
Sus-0.076
 se-0.075
mine-0.074
years-0.074
Token pink
Token ID11398
Feature activation-6.69e-01

Pos logit contributions
 dod+0.100
 opens+0.085
 unlocks+0.085
 stops+0.084
 pulls+0.083
Neg logit contributions
 se-0.057
be-0.055
 occasion-0.055
 3-0.054
 especially-0.054
Token dress
Token ID6576
Feature activation+2.54e+00

Pos logit contributions
 dod+0.032
 opens+0.028
 stops+0.028
 unlocks+0.027
 pulls+0.027
Neg logit contributions
be-0.025
mine-0.024
Sus-0.022
ina-0.022
 se-0.022
Token.
Token ID13
Feature activation-1.31e+00

Pos logit contributions
be+0.112
mine+0.111
Sus+0.105
ina+0.104
line+0.103
Neg logit contributions
 dod-0.106
 stops-0.086
 opens-0.084
 pulls-0.083
 unlocks-0.079
Token I
Token ID314
Feature activation-2.11e-02

Pos logit contributions
 dod+0.061
 stops+0.053
 opens+0.050
 disappears+0.050
 falls+0.049
Neg logit contributions
be-0.049
Sus-0.047
years-0.046
mine-0.044
 especially-0.042
Token looked
Token ID3114
Feature activation+3.99e-01

Pos logit contributions
 dod+0.001
 stops+0.001
 opens+0.001
 unlocks+0.001
 falls+0.001
Neg logit contributions
be-0.001
Sus-0.001
mine-0.001
years-0.001
 se-0.001
Token everywhere
Token ID8347
Feature activation-7.98e+00

Pos logit contributions
years+0.014
be+0.014
 especially+0.013
Sus+0.012
 se+0.012
Neg logit contributions
 dod-0.020
 stops-0.017
 pulls-0.017
 unlocks-0.017
 opens-0.017
Token."
Token ID526
Feature activation+1.62e+00

Pos logit contributions
 dod+0.373
 stops+0.323
 opens+0.318
 unlocks+0.315
 pulls+0.313
Neg logit contributions
be-0.309
 especially-0.297
Sus-0.296
mine-0.287
years-0.286
Token Her
Token ID2332
Feature activation-1.71e-01

Pos logit contributions
Sus+0.055
be+0.055
mine+0.053
ina+0.050
years+0.049
Neg logit contributions
 dod-0.080
 opens-0.070
 unlocks-0.068
 stops-0.067
 pulls-0.067
Token mom
Token ID1995
Feature activation-3.56e-01

Pos logit contributions
 dod+0.008
 unlocks+0.007
 opens+0.007
 disappears+0.007
 stops+0.006
Neg logit contributions
be-0.006
 se-0.006
Sus-0.006
ina-0.006
 especially-0.006
Token said
Token ID531
Feature activation+1.29e+00

Pos logit contributions
 dod+0.016
 opens+0.014
 stops+0.014
 unlocks+0.013
 pulls+0.013
Neg logit contributions
be-0.014
Sus-0.013
years-0.013
mine-0.013
line-0.013
Token,
Token ID11
Feature activation-8.19e-01

Pos logit contributions
be+0.068
Sus+0.066
mine+0.065
years+0.065
 se+0.062
Neg logit contributions
 dod-0.041
 stops-0.034
 opens-0.034
 unlocks-0.032
 pulls-0.031
Token,
Token ID11
Feature activation+1.21e+00

Pos logit contributions
 dod+0.119
ccoli+0.112
 stops+0.107
 unlocks+0.106
 Ready+0.105
Neg logit contributions
be-0.127
 se-0.113
h-0.109
mine-0.109
 especially-0.104
Token nine
Token ID5193
Feature activation-3.16e+00

Pos logit contributions
be+0.047
mine+0.042
years+0.042
Sus+0.041
 se+0.040
Neg logit contributions
 dod-0.056
 stops-0.049
 opens-0.048
 unlocks-0.047
 disappears-0.046
Token,
Token ID11
Feature activation-4.81e-02

Pos logit contributions
 dod+0.127
ccoli+0.114
 stops+0.108
 Ready+0.106
 unlocks+0.106
Neg logit contributions
be-0.130
h-0.115
 se-0.108
mine-0.107
 especially-0.105
Token ten
Token ID3478
Feature activation-3.51e+00

Pos logit contributions
 dod+0.002
 stops+0.002
 opens+0.002
 unlocks+0.002
 falls+0.002
Neg logit contributions
be-0.002
mine-0.002
years-0.002
Sus-0.002
 se-0.002
Token.
Token ID13
Feature activation-1.39e+00

Pos logit contributions
 dod+0.129
 stops+0.112
ccoli+0.111
 unlocks+0.108
 Ready+0.105
Neg logit contributions
be-0.161
h-0.144
mine-0.141
Sus-0.140
years-0.140
Token Ready
Token ID23432
Feature activation-7.98e+00

Pos logit contributions
 dod+0.053
 stops+0.047
 opens+0.044
 falls+0.044
 disappears+0.043
Neg logit contributions
be-0.064
Sus-0.060
years-0.059
mine-0.058
 especially-0.056
Token or
Token ID393
Feature activation-3.39e+00

Pos logit contributions
 dod+0.358
 charges+0.279
 stops+0.272
ccoli+0.269
 Patch+0.268
Neg logit contributions
mine-0.341
be-0.341
ina-0.308
h-0.305
days-0.303
Token not
Token ID407
Feature activation-1.38e+00

Pos logit contributions
 dod+0.143
 stops+0.123
 opens+0.121
 unlocks+0.117
ccoli+0.116
Neg logit contributions
be-0.139
mine-0.132
Sus-0.130
 se-0.129
years-0.126
Token,
Token ID11
Feature activation-5.70e-01

Pos logit contributions
 dod+0.062
ccoli+0.054
 pulls+0.046
 stops+0.046
ses+0.045
Neg logit contributions
 se-0.050
be-0.050
Sus-0.050
h-0.049
mine-0.048
Token here
Token ID994
Feature activation-3.85e+00

Pos logit contributions
 dod+0.025
 stops+0.022
 opens+0.021
 unlocks+0.020
 falls+0.020
Neg logit contributions
be-0.023
Sus-0.022
mine-0.022
years-0.021
 se-0.021
Token I
Token ID314
Feature activation+3.16e-01

Pos logit contributions
 dod+0.178
 charges+0.149
ses+0.137
 aims+0.135
 fist+0.135
Neg logit contributions
be-0.135
mine-0.131
 especially-0.129
Sus-0.123
 weekend-0.123
Token would
Token ID561
Feature activation+1.55e+00

Pos logit contributions
be+0.005
mine+0.004
 especially+0.004
 se+0.004
Sus+0.004
Neg logit contributions
 dod-0.009
 opens-0.008
 stops-0.008
 unlocks-0.008
 pulls-0.007
Token swim
Token ID9422
Feature activation-6.08e-01

Pos logit contributions
be+0.070
Sus+0.069
years+0.069
mine+0.066
ina+0.062
Neg logit contributions
 dod-0.060
 stops-0.053
 opens-0.053
 falls-0.049
 pulls-0.048
Token around
Token ID1088
Feature activation+6.33e-01

Pos logit contributions
 dod+0.026
 stops+0.022
 opens+0.022
 unlocks+0.021
 pulls+0.021
Neg logit contributions
be-0.026
Sus-0.024
mine-0.024
years-0.024
 especially-0.023
Token in
Token ID287
Feature activation+1.53e-01

Pos logit contributions
be+0.028
mine+0.027
Sus+0.027
years+0.027
 especially+0.026
Neg logit contributions
 dod-0.025
 stops-0.022
 opens-0.021
 unlocks-0.021
 pulls-0.020
Token the
Token ID262
Feature activation+3.63e+00

Pos logit contributions
be+0.007
Sus+0.007
mine+0.007
years+0.007
 se+0.007
Neg logit contributions
 dod-0.006
 stops-0.005
 opens-0.005
 unlocks-0.004
 pulls-0.004
Token tub
Token ID12202
Feature activation+7.91e+00

Pos logit contributions
years+0.143
mine+0.139
be+0.138
Sus+0.136
ina+0.126
Neg logit contributions
 dod-0.160
 stops-0.152
 opens-0.147
 falls-0.140
 closes-0.139
Token.
Token ID13
Feature activation+8.06e-01

Pos logit contributions
years+0.436
Sus+0.427
be+0.419
 paramed+0.401
mine+0.400
Neg logit contributions
 dod-0.182
 stops-0.171
 opens-0.164
 falls-0.158
 unlocks-0.150
Token He
Token ID679
Feature activation-3.79e-01

Pos logit contributions
be+0.033
Sus+0.032
mine+0.032
years+0.031
 especially+0.030
Neg logit contributions
 dod-0.034
 stops-0.030
 opens-0.030
 unlocks-0.029
 falls-0.028
Token had
Token ID550
Feature activation+1.89e+00

Pos logit contributions
 dod+0.019
 stops+0.017
 opens+0.017
 falls+0.016
 unlocks+0.016
Neg logit contributions
be-0.013
Sus-0.013
years-0.012
mine-0.012
ina-0.011
Token so
Token ID523
Feature activation+3.18e+00

Pos logit contributions
mine+0.083
be+0.079
Sus+0.079
ina+0.077
years+0.077
Neg logit contributions
 dod-0.076
 opens-0.068
 stops-0.068
 pulls-0.064
 True-0.063
Token much
Token ID881
Feature activation-1.01e+00

Pos logit contributions
be+0.125
Sus+0.120
mine+0.118
years+0.118
 se+0.111
Neg logit contributions
 dod-0.143
 stops-0.128
 opens-0.127
 unlocks-0.120
 falls-0.119
Token could
Token ID714
Feature activation+1.19e+00

Pos logit contributions
Sus+0.038
years+0.036
ules+0.035
line+0.034
 occasion+0.034
Neg logit contributions
 dod-0.041
 stops-0.039
 opens-0.037
 pulls-0.035
 has-0.035
Token play
Token ID711
Feature activation+1.08e+00

Pos logit contributions
years+0.048
be+0.047
Sus+0.046
mine+0.044
year+0.043
Neg logit contributions
 dod-0.050
 opens-0.046
 stops-0.045
 unlocks-0.044
 pulls-0.043
Token with
Token ID351
Feature activation+5.52e-01

Pos logit contributions
be+0.046
years+0.044
Sus+0.044
mine+0.044
ina+0.041
Neg logit contributions
 dod-0.044
 stops-0.039
 opens-0.039
 unlocks-0.037
 falls-0.036
Token the
Token ID262
Feature activation+4.91e-01

Pos logit contributions
be+0.024
Sus+0.024
years+0.023
mine+0.023
 se+0.023
Neg logit contributions
 dod-0.022
 stops-0.019
 opens-0.019
 unlocks-0.018
 falls-0.018
Token f
Token ID277
Feature activation+5.16e+00

Pos logit contributions
be+0.018
Sus+0.017
years+0.017
mine+0.017
 se+0.016
Neg logit contributions
 dod-0.023
 stops-0.021
 opens-0.021
 unlocks-0.020
 falls-0.020
Tokenauc
Token ID14272
Feature activation+6.99e+00

Pos logit contributions
mine+0.108
be+0.104
Sus+0.101
years+0.099
 especially+0.092
Neg logit contributions
 dod-0.312
 stops-0.285
 opens-0.284
 unlocks-0.279
 falls-0.272
Tokenet
Token ID316
Feature activation+4.26e+00

Pos logit contributions
be+0.227
Sus+0.216
years+0.211
mine+0.211
 se+0.196
Neg logit contributions
 dod-0.358
 stops-0.326
 opens-0.324
 unlocks-0.311
 falls-0.306
Token knowing
Token ID6970
Feature activation+9.78e-01

Pos logit contributions
be+0.184
Sus+0.179
years+0.176
mine+0.174
 se+0.166
Neg logit contributions
 dod-0.171
 stops-0.152
 opens-0.150
 unlocks-0.142
 falls-0.140
Token that
Token ID326
Feature activation+1.16e+00

Pos logit contributions
Sus+0.034
be+0.033
 se+0.031
years+0.031
 especially+0.031
Neg logit contributions
 dod-0.049
 stops-0.043
 opens-0.042
 pulls-0.041
 unlocks-0.041
Token it
Token ID340
Feature activation-6.85e-01

Pos logit contributions
be+0.044
Sus+0.043
mine+0.042
years+0.041
 se+0.040
Neg logit contributions
 dod-0.053
 stops-0.047
 opens-0.047
 unlocks-0.045
 pulls-0.044
Token would
Token ID561
Feature activation+2.63e-01

Pos logit contributions
 dod+0.036
 stops+0.031
 opens+0.031
 falls+0.030
 disappears+0.029
Neg logit contributions
be-0.022
mine-0.020
 especially-0.020
Sus-0.020
 se-0.020
Token pair
Token ID5166
Feature activation+4.04e+00

Pos logit contributions
 dod+0.031
 stops+0.029
 opens+0.027
 unlocks+0.025
 falls+0.025
Neg logit contributions
be-0.036
mine-0.035
years-0.035
Sus-0.034
line-0.032
Token put
Token ID1234
Feature activation+3.00e+00

Pos logit contributions
be+0.146
Sus+0.143
years+0.143
ina+0.131
mine+0.127
Neg logit contributions
 dod-0.181
 opens-0.168
 stops-0.168
 falls-0.158
 pulls-0.158
Token on
Token ID319
Feature activation-1.21e+00

Pos logit contributions
be+0.121
Sus+0.117
mine+0.115
years+0.114
 se+0.108
Neg logit contributions
 dod-0.131
 stops-0.117
 opens-0.116
 unlocks-0.111
 falls-0.109
Token their
Token ID511
Feature activation+2.64e+00

Pos logit contributions
 dod+0.056
 stops+0.050
 opens+0.050
 unlocks+0.048
 falls+0.047
Neg logit contributions
be-0.046
Sus-0.044
mine-0.043
years-0.043
 se-0.040
Token rain
Token ID6290
Feature activation+1.52e+00

Pos logit contributions
be+0.090
Sus+0.084
years+0.083
 se+0.083
mine+0.081
Neg logit contributions
 dod-0.132
 stops-0.116
 opens-0.115
 unlocks-0.113
 falls-0.110
Tokenco
Token ID1073
Feature activation+6.84e+00

Pos logit contributions
be+0.054
Sus+0.051
mine+0.050
years+0.049
 se+0.047
Neg logit contributions
 dod-0.074
 stops-0.066
 opens-0.066
 unlocks-0.063
 falls-0.062
Tokenats
Token ID1381
Feature activation+9.80e-01

Pos logit contributions
be+0.319
Sus+0.309
years+0.304
mine+0.300
 se+0.295
Neg logit contributions
 dod-0.243
 opens-0.216
 stops-0.214
 unlocks-0.202
 falls-0.199
Token and
Token ID290
Feature activation-9.67e-02

Pos logit contributions
be+0.049
Sus+0.047
mine+0.047
years+0.046
 se+0.045
Neg logit contributions
 dod-0.033
 stops-0.028
 opens-0.028
 unlocks-0.026
 falls-0.026
Token went
Token ID1816
Feature activation-1.31e+00

Pos logit contributions
 dod+0.004
 stops+0.004
 opens+0.004
 unlocks+0.004
 falls+0.004
Neg logit contributions
be-0.004
Sus-0.003
mine-0.003
years-0.003
 se-0.003
Token outside
Token ID2354
Feature activation-3.71e+00

Pos logit contributions
 dod+0.051
 stops+0.046
 opens+0.046
 unlocks+0.043
 falls+0.042
Neg logit contributions
be-0.058
years-0.055
Sus-0.055
mine-0.055
 se-0.052
Token.
Token ID13
Feature activation+6.55e-02

Pos logit contributions
 dod+0.098
 opens+0.091
 stops+0.088
 falls+0.077
 unlocks+0.076
Neg logit contributions
years-0.195
be-0.194
mine-0.192
Sus-0.192
ules-0.180
Token the
Token ID262
Feature activation-8.53e-01

Pos logit contributions
be+0.121
Sus+0.117
years+0.114
mine+0.112
ina+0.109
Neg logit contributions
 dod-0.103
 opens-0.092
 stops-0.091
 unlocks-0.086
 pulls-0.084
Token oy
Token ID35104
Feature activation-2.78e-01

Pos logit contributions
 dod+0.040
 opens+0.038
 stops+0.038
 unlocks+0.036
 pulls+0.034
Neg logit contributions
Sus-0.029
be-0.029
years-0.029
mine-0.028
ina-0.026
Tokensters
Token ID5937
Feature activation+3.04e+00

Pos logit contributions
 dod+0.012
 stops+0.011
 opens+0.011
 unlocks+0.011
 falls+0.010
Neg logit contributions
be-0.011
Sus-0.011
years-0.010
mine-0.010
 se-0.010
Token in
Token ID287
Feature activation-9.18e-01

Pos logit contributions
be+0.142
Sus+0.140
years+0.138
mine+0.135
ina+0.130
Neg logit contributions
 dod-0.107
 opens-0.096
 stops-0.095
 unlocks-0.091
 falls-0.088
Token a
Token ID257
Feature activation+1.64e+00

Pos logit contributions
 dod+0.030
 opens+0.027
 stops+0.026
 unlocks+0.025
 pulls+0.024
Neg logit contributions
be-0.046
years-0.044
Sus-0.044
mine-0.043
ina-0.042
Token pot
Token ID1787
Feature activation+6.71e+00

Pos logit contributions
be+0.068
Sus+0.068
mine+0.067
years+0.067
ina+0.065
Neg logit contributions
 dod-0.067
 opens-0.060
 stops-0.060
 unlocks-0.056
 falls-0.055
Token,
Token ID11
Feature activation+1.66e+00

Pos logit contributions
years+0.392
Sus+0.384
be+0.382
mine+0.361
add+0.349
Neg logit contributions
 dod-0.141
 opens-0.140
 stops-0.131
 unlocks-0.127
 falls-0.124
Token added
Token ID2087
Feature activation+1.96e+00

Pos logit contributions
be+0.070
Sus+0.068
years+0.067
mine+0.067
 se+0.066
Neg logit contributions
 dod-0.069
 stops-0.060
 opens-0.058
 unlocks-0.056
 falls-0.055
Token some
Token ID617
Feature activation+1.50e+00

Pos logit contributions
be+0.067
Sus+0.064
years+0.063
mine+0.063
 se+0.058
Neg logit contributions
 dod-0.097
 stops-0.088
 opens-0.087
 unlocks-0.084
 falls-0.082
Token butter
Token ID9215
Feature activation+1.58e+00

Pos logit contributions
be+0.057
 se+0.053
Sus+0.052
years+0.051
 especially+0.050
Neg logit contributions
 dod-0.070
 stops-0.062
 opens-0.060
 falls-0.059
 disappears-0.059
Token and
Token ID290
Feature activation-2.00e-01

Pos logit contributions
be+0.079
Sus+0.076
mine+0.075
years+0.075
 se+0.072
Neg logit contributions
 dod-0.054
 stops-0.047
 opens-0.046
 unlocks-0.043
 falls-0.042
Token always
Token ID1464
Feature activation+9.31e-02

Pos logit contributions
Sus+0.034
be+0.034
mine+0.034
years+0.033
line+0.033
Neg logit contributions
 dod-0.033
 stops-0.028
 opens-0.028
 True-0.027
 unlocks-0.026
Token happy
Token ID3772
Feature activation-8.39e-01

Pos logit contributions
mine+0.004
be+0.004
Sus+0.004
years+0.003
ina+0.003
Neg logit contributions
 dod-0.004
 opens-0.003
 stops-0.003
 unlocks-0.003
 True-0.003
Token to
Token ID284
Feature activation-4.39e-01

Pos logit contributions
 dod+0.040
 opens+0.036
 stops+0.035
 unlocks+0.035
 pulls+0.035
Neg logit contributions
be-0.031
Sus-0.029
 se-0.028
years-0.028
 especially-0.027
Token share
Token ID2648
Feature activation+1.90e+00

Pos logit contributions
 dod+0.013
 opens+0.012
 stops+0.011
 unlocks+0.011
 True+0.011
Neg logit contributions
years-0.023
be-0.023
Sus-0.022
mine-0.021
ules-0.021
Token with
Token ID351
Feature activation+1.33e+00

Pos logit contributions
 especially+0.060
 se+0.059
Sus+0.056
mine+0.055
be+0.053
Neg logit contributions
 dod-0.097
 stops-0.086
 pulls-0.085
 opens-0.084
 falls-0.084
Token each
Token ID1123
Feature activation+6.32e+00

Pos logit contributions
years+0.037
be+0.036
 especially+0.035
 se+0.034
Sus+0.034
Neg logit contributions
 dod-0.072
 opens-0.068
 stops-0.068
 unlocks-0.067
 falls-0.066
Token other
Token ID584
Feature activation+9.98e-01

Pos logit contributions
be+0.198
mine+0.184
years+0.176
 se+0.174
 especially+0.173
Neg logit contributions
 dod-0.333
 stops-0.305
 opens-0.300
 unlocks-0.297
 disappears-0.294
Token.
Token ID13
Feature activation+8.25e-02

Pos logit contributions
be+0.040
years+0.037
mine+0.036
 especially+0.036
ina+0.036
Neg logit contributions
 dod-0.043
 stops-0.039
 opens-0.039
 unlocks-0.038
 pulls-0.038
Token<|endoftext|>
Token ID50256
Feature activation-2.65e+00

Pos logit contributions
be+0.003
Sus+0.003
mine+0.003
years+0.003
ina+0.003
Neg logit contributions
 dod-0.003
 opens-0.003
 stops-0.003
 unlocks-0.003
 pulls-0.003
TokenOnce
Token ID7454
Feature activation-1.03e-01

Pos logit contributions
 dod+0.109
 opens+0.092
 stops+0.090
 pulls+0.088
 unlocks+0.087
Neg logit contributions
be-0.116
Sus-0.112
mine-0.112
ina-0.105
years-0.104
Token upon
Token ID2402
Feature activation-2.87e+00

Pos logit contributions
 dod+0.005
 opens+0.004
 stops+0.004
 unlocks+0.004
 pulls+0.004
Neg logit contributions
be-0.004
Sus-0.004
mine-0.004
years-0.004
ina-0.004
Token "
Token ID366
Feature activation+1.94e-01

Pos logit contributions
 dod+0.011
 opens+0.010
 falls+0.010
 disappears+0.010
 unlocks+0.009
Neg logit contributions
be-0.008
Sus-0.008
line-0.007
 3-0.007
 especially-0.007
TokenOkay
Token ID16454
Feature activation-2.16e+00

Pos logit contributions
 harbor+0.008
ina+0.008
mine+0.008
ann+0.008
be+0.008
Neg logit contributions
 dod-0.007
 Ready-0.006
 lets-0.006
 opens-0.006
 True-0.006
Token,
Token ID11
Feature activation-1.10e+00

Pos logit contributions
 dod+0.092
 opens+0.077
 stops+0.076
 unlocks+0.076
 disappears+0.074
Neg logit contributions
be-0.090
Sus-0.089
mine-0.089
 especially-0.085
heart-0.083
Token but
Token ID475
Feature activation-7.12e-01

Pos logit contributions
 dod+0.047
 stops+0.040
 disappears+0.039
 opens+0.039
 falls+0.039
Neg logit contributions
be-0.044
 especially-0.044
Sus-0.043
 se-0.041
heart-0.040
Token you
Token ID345
Feature activation-1.21e+00

Pos logit contributions
 dod+0.030
 stops+0.027
 opens+0.026
 unlocks+0.025
 falls+0.025
Neg logit contributions
be-0.030
Sus-0.028
 se-0.027
mine-0.027
 especially-0.027
Token must
Token ID1276
Feature activation+1.37e+00

Pos logit contributions
 dod+0.042
 stops+0.036
 opens+0.035
 unlocks+0.033
 falls+0.032
Neg logit contributions
be-0.060
mine-0.057
Sus-0.057
years-0.056
 se-0.055
Token wear
Token ID5806
Feature activation+4.78e-01

Pos logit contributions
be+0.061
Sus+0.058
mine+0.058
years+0.057
 se+0.055
Neg logit contributions
 dod-0.054
 stops-0.048
 opens-0.047
 unlocks-0.045
 falls-0.044
Token a
Token ID257
Feature activation+2.36e-02

Pos logit contributions
be+0.018
 se+0.018
Sus+0.017
years+0.017
mine+0.017
Neg logit contributions
 dod-0.021
 stops-0.019
 opens-0.019
 unlocks-0.018
 falls-0.018
Token life
Token ID1204
Feature activation+4.94e-01

Pos logit contributions
be+0.001
Sus+0.001
years+0.001
mine+0.001
ina+0.001
Neg logit contributions
 dod-0.001
 stops-0.001
 opens-0.001
 unlocks-0.001
 falls-0.001
Token vest
Token ID19750
Feature activation+1.11e-01

Pos logit contributions
be+0.021
Sus+0.020
years+0.020
mine+0.020
ina+0.018
Neg logit contributions
 dod-0.020
 stops-0.018
 opens-0.018
 unlocks-0.017
 falls-0.017
Token and
Token ID290
Feature activation-2.51e-01

Pos logit contributions
be+0.005
mine+0.005
Sus+0.005
years+0.004
 se+0.004
Neg logit contributions
 dod-0.005
 stops-0.004
 opens-0.004
 unlocks-0.004
 pulls-0.004
Token time
Token ID640
Feature activation-1.32e+00

Pos logit contributions
be+0.061
Sus+0.060
years+0.060
mine+0.059
 se+0.055
Neg logit contributions
 dod-0.055
 stops-0.049
 opens-0.048
 falls-0.045
 unlocks-0.045
Token together
Token ID1978
Feature activation+5.69e-01

Pos logit contributions
 dod+0.065
 opens+0.060
 stops+0.060
 unlocks+0.059
 pulls+0.056
Neg logit contributions
be-0.046
years-0.042
Sus-0.041
 se-0.040
 especially-0.039
Token,
Token ID11
Feature activation+1.64e+00

Pos logit contributions
be+0.023
 especially+0.022
mine+0.021
 se+0.021
ina+0.021
Neg logit contributions
 dod-0.025
 unlocks-0.022
 opens-0.021
 stops-0.021
 pulls-0.020
Token and
Token ID290
Feature activation-3.40e-01

Pos logit contributions
 especially+0.062
be+0.061
Sus+0.058
 se+0.057
mine+0.057
Neg logit contributions
 dod-0.073
 opens-0.067
 unlocks-0.067
 stops-0.066
 disappears-0.063
Token from
Token ID422
Feature activation-2.97e+00

Pos logit contributions
 dod+0.014
 opens+0.012
 stops+0.012
 unlocks+0.012
 pulls+0.011
Neg logit contributions
be-0.015
Sus-0.014
years-0.014
mine-0.014
 se-0.013
Token that
Token ID326
Feature activation+2.68e+00

Pos logit contributions
 dod+0.157
 opens+0.139
 unlocks+0.139
 stops+0.137
 pulls+0.134
Neg logit contributions
years-0.090
be-0.088
Sus-0.084
 especially-0.083
 se-0.083
Token day
Token ID1110
Feature activation-3.95e-01

Pos logit contributions
be+0.079
Sus+0.079
years+0.075
mine+0.074
 se+0.068
Neg logit contributions
 dod-0.144
 stops-0.134
 opens-0.131
 falls-0.127
 unlocks-0.125
Token on
Token ID319
Feature activation-2.67e-01

Pos logit contributions
 dod+0.017
 opens+0.015
 stops+0.015
 unlocks+0.014
 pulls+0.014
Neg logit contributions
be-0.017
mine-0.016
Sus-0.016
years-0.015
ina-0.015
Token,
Token ID11
Feature activation+1.54e+00

Pos logit contributions
 dod+0.012
 unlocks+0.011
 opens+0.011
 Ready+0.010
 stops+0.010
Neg logit contributions
ina-0.010
be-0.010
 especially-0.010
mine-0.010
Sus-0.010
Token Lily
Token ID20037
Feature activation+6.92e-01

Pos logit contributions
 especially+0.060
be+0.057
Sus+0.056
mine+0.055
 se+0.053
Neg logit contributions
 dod-0.068
 opens-0.064
 unlocks-0.063
 stops-0.060
 pulls-0.059
Token and
Token ID290
Feature activation+1.00e-01

Pos logit contributions
be+0.027
Sus+0.026
mine+0.025
years+0.025
 se+0.024
Neg logit contributions
 dod-0.031
 stops-0.028
 opens-0.028
 unlocks-0.026
 pulls-0.026
Token,
Token ID11
Feature activation+1.05e+00

Pos logit contributions
be+0.005
 especially+0.005
mine+0.005
years+0.005
Sus+0.005
Neg logit contributions
 dod-0.006
 stops-0.005
 unlocks-0.005
 opens-0.005
 pulls-0.005
Token enjoying
Token ID13226
Feature activation+2.13e+00

Pos logit contributions
 especially+0.039
be+0.038
Sus+0.036
 se+0.036
line+0.035
Neg logit contributions
 dod-0.048
 opens-0.043
 unlocks-0.043
 stops-0.042
 pulls-0.041
Token this
Token ID428
Feature activation+4.50e+00

Pos logit contributions
be+0.098
years+0.096
Sus+0.095
 especially+0.094
 se+0.093
Neg logit contributions
 dod-0.076
 stops-0.069
 opens-0.069
 unlocks-0.067
 pulls-0.063
Token warm
Token ID5814
Feature activation+4.19e+00

Pos logit contributions
 especially+0.173
be+0.173
 se+0.166
Sus+0.165
ina+0.162
Neg logit contributions
 dod-0.200
 stops-0.173
 pulls-0.172
 opens-0.172
 unlocks-0.169
Token sunny
Token ID27737
Feature activation+4.24e+00

Pos logit contributions
be+0.154
Sus+0.146
mine+0.145
years+0.139
 se+0.139
Neg logit contributions
 dod-0.197
 opens-0.173
 stops-0.173
 unlocks-0.168
 pulls-0.166
Token day
Token ID1110
Feature activation+2.76e+00

Pos logit contributions
be+0.117
Sus+0.109
mine+0.107
years+0.102
 se+0.101
Neg logit contributions
 dod-0.239
 stops-0.217
 opens-0.216
 unlocks-0.211
 pulls-0.208
Token."
Token ID526
Feature activation+2.89e+00

Pos logit contributions
be+0.130
Sus+0.121
mine+0.120
 especially+0.118
years+0.118
Neg logit contributions
 dod-0.105
 stops-0.092
 opens-0.089
 unlocks-0.087
 pulls-0.085
Token
Token ID198
Feature activation+5.87e-01

Pos logit contributions
be+0.122
Sus+0.120
mine+0.117
years+0.114
 se+0.110
Neg logit contributions
 dod-0.120
 opens-0.106
 stops-0.105
 unlocks-0.101
 falls-0.098
Token
Token ID198
Feature activation+5.23e-01

Pos logit contributions
be+0.028
Sus+0.027
years+0.027
mine+0.027
 se+0.025
Neg logit contributions
 dod-0.021
 stops-0.018
 opens-0.018
 unlocks-0.017
 falls-0.017
Token"
Token ID1
Feature activation+2.23e+00

Pos logit contributions
be+0.021
years+0.020
mine+0.020
Sus+0.019
 se+0.019
Neg logit contributions
 dod-0.023
 stops-0.020
 opens-0.020
 unlocks-0.019
 True-0.018
TokenThat
Token ID2504
Feature activation+5.22e-01

Pos logit contributions
be+0.089
years+0.086
mine+0.084
 harbor+0.079
bar+0.079
Neg logit contributions
 dod-0.092
 True-0.079
 stops-0.076
 lets-0.075
 Ready-0.074
Token hill
Token ID12788
Feature activation+3.06e-01

Pos logit contributions
 se+0.008
be+0.008
line+0.008
 ocean+0.008
 weekend+0.007
Neg logit contributions
 dod-0.016
ccoli-0.015
 cleans-0.014
 pulls-0.013
 unlocks-0.013
Token and
Token ID290
Feature activation-3.58e-01

Pos logit contributions
be+0.015
Sus+0.014
mine+0.014
years+0.014
 se+0.014
Neg logit contributions
 dod-0.011
 stops-0.009
 opens-0.009
 unlocks-0.008
 pulls-0.008
Token dug
Token ID18735
Feature activation+3.28e+00

Pos logit contributions
 dod+0.020
 stops+0.017
 opens+0.017
 unlocks+0.017
ccoli+0.016
Neg logit contributions
be-0.010
mine-0.009
 se-0.009
Sus-0.009
years-0.009
Token in
Token ID287
Feature activation-2.12e+00

Pos logit contributions
be+0.116
 se+0.112
Sus+0.108
 especially+0.104
mine+0.103
Neg logit contributions
 dod-0.148
ccoli-0.134
 stops-0.122
 pulls-0.120
 unlocks-0.119
Token the
Token ID262
Feature activation+1.57e+00

Pos logit contributions
 dod+0.072
 opens+0.063
 stops+0.063
 unlocks+0.058
 falls+0.057
Neg logit contributions
be-0.106
years-0.102
Sus-0.102
mine-0.101
ina-0.097
Token dirt
Token ID13647
Feature activation+1.47e+00

Pos logit contributions
be+0.058
 se+0.055
 ocean+0.052
 harbor+0.050
years+0.049
Neg logit contributions
 dod-0.072
 opens-0.061
ccoli-0.061
 pulls-0.059
 stops-0.059
Token.
Token ID13
Feature activation-1.31e-01

Pos logit contributions
be+0.074
Sus+0.072
years+0.071
mine+0.070
ina+0.067
Neg logit contributions
 dod-0.049
 stops-0.043
 opens-0.042
 unlocks-0.039
 falls-0.039
Token They
Token ID1119
Feature activation+1.62e+00

Pos logit contributions
 dod+0.007
ccoli+0.006
 stops+0.006
 opens+0.006
 unlocks+0.006
Neg logit contributions
Sus-0.004
be-0.004
 se-0.004
years-0.004
mine-0.004
Token found
Token ID1043
Feature activation+1.00e+00

Pos logit contributions
be+0.058
Sus+0.056
years+0.054
mine+0.053
ina+0.050
Neg logit contributions
 dod-0.078
 opens-0.071
 stops-0.071
 unlocks-0.068
 falls-0.067
Token a
Token ID257
Feature activation-1.98e+00

Pos logit contributions
be+0.037
Sus+0.035
years+0.035
mine+0.035
 se+0.034
Neg logit contributions
 dod-0.046
 stops-0.041
 opens-0.041
 unlocks-0.039
 falls-0.038
Token box
Token ID3091
Feature activation+1.26e+00

Pos logit contributions
 dod+0.085
 opens+0.076
 pulls+0.075
 unlocks+0.075
 stops+0.073
Neg logit contributions
be-0.081
 se-0.077
Sus-0.073
 3-0.070
 harbor-0.069
Token girl
Token ID2576
Feature activation+2.03e+00

Pos logit contributions
 dod+0.016
 stops+0.016
 opens+0.016
 unlocks+0.014
 pulls+0.014
Neg logit contributions
be-0.016
Sus-0.015
years-0.015
rol-0.015
mine-0.015
Token were
Token ID547
Feature activation+9.10e-01

Pos logit contributions
Sus+0.062
be+0.061
years+0.059
mine+0.057
 se+0.053
Neg logit contributions
 dod-0.105
 stops-0.097
 opens-0.096
 unlocks-0.092
 falls-0.091
Token friends
Token ID2460
Feature activation-1.31e+00

Pos logit contributions
mine+0.040
Sus+0.040
be+0.039
years+0.039
 occasion+0.039
Neg logit contributions
 dod-0.034
 stops-0.029
 opens-0.028
 True-0.027
 unlocks-0.026
Token and
Token ID290
Feature activation-1.51e+00

Pos logit contributions
 dod+0.040
 stops+0.034
 opens+0.032
 True+0.030
 falls+0.030
Neg logit contributions
Sus-0.066
be-0.066
years-0.065
mine-0.064
 se-0.063
Token they
Token ID484
Feature activation+1.94e+00

Pos logit contributions
 dod+0.067
 stops+0.062
 opens+0.060
 True+0.056
 falls+0.056
Neg logit contributions
be-0.058
Sus-0.057
mine-0.055
years-0.054
line-0.051
Token painted
Token ID13055
Feature activation+1.92e+00

Pos logit contributions
Sus+0.064
be+0.063
years+0.060
 occasion+0.056
line+0.055
Neg logit contributions
 dod-0.096
 stops-0.087
 opens-0.085
 falls-0.081
 unlocks-0.080
Token many
Token ID867
Feature activation+9.27e-01

Pos logit contributions
be+0.080
Sus+0.077
years+0.076
mine+0.075
ina+0.071
Neg logit contributions
 dod-0.080
 stops-0.072
 opens-0.072
 unlocks-0.068
 falls-0.066
Token pictures
Token ID5986
Feature activation-4.22e-01

Pos logit contributions
be+0.036
Sus+0.034
 se+0.034
 especially+0.034
years+0.034
Neg logit contributions
 dod-0.041
 stops-0.036
ccoli-0.036
 pulls-0.036
 opens-0.035
Token together
Token ID1978
Feature activation+1.58e+00

Pos logit contributions
 dod+0.021
 stops+0.018
 opens+0.018
 pulls+0.018
 unlocks+0.018
Neg logit contributions
be-0.015
mine-0.014
Sus-0.014
years-0.013
 se-0.013
Token.
Token ID13
Feature activation+1.68e-01

Pos logit contributions
be+0.062
mine+0.060
 especially+0.058
Sus+0.057
years+0.056
Neg logit contributions
 dod-0.071
 stops-0.063
 opens-0.062
 unlocks-0.062
 pulls-0.062
Token The
Token ID383
Feature activation-6.46e-01

Pos logit contributions
be+0.008
Sus+0.007
mine+0.007
years+0.007
ina+0.007
Neg logit contributions
 dod-0.006
 opens-0.006
 stops-0.006
 unlocks-0.005
 pulls-0.005
TokenTom
Token ID13787
Feature activation+4.92e-01

Pos logit contributions
 dod+0.010
 stops+0.009
 opens+0.009
 unlocks+0.008
 falls+0.008
Neg logit contributions
be-0.010
Sus-0.010
years-0.010
mine-0.010
 se-0.009
Tokenmy
Token ID1820
Feature activation-5.27e-01

Pos logit contributions
be+0.014
Sus+0.013
years+0.013
mine+0.013
 se+0.012
Neg logit contributions
 dod-0.027
 stops-0.025
 opens-0.025
 unlocks-0.024
 falls-0.024
Token was
Token ID373
Feature activation+5.82e-01

Pos logit contributions
 dod+0.024
 opens+0.021
 stops+0.021
 unlocks+0.020
 falls+0.020
Neg logit contributions
be-0.020
Sus-0.019
mine-0.019
years-0.018
 se-0.018
Token worried
Token ID7960
Feature activation+1.08e+00

Pos logit contributions
mine+0.024
Sus+0.024
be+0.024
ina+0.023
 occasion+0.022
Neg logit contributions
 dod-0.025
 stops-0.022
 opens-0.021
 True-0.020
 pulls-0.020
Token because
Token ID780
Feature activation+7.45e-01

Pos logit contributions
be+0.049
Sus+0.048
years+0.047
mine+0.047
 se+0.044
Neg logit contributions
 dod-0.041
 stops-0.037
 opens-0.036
 falls-0.034
 unlocks-0.034
Token he
Token ID339
Feature activation-2.78e-01

Pos logit contributions
be+0.033
mine+0.031
Sus+0.030
years+0.030
ina+0.029
Neg logit contributions
 dod-0.029
 stops-0.027
 opens-0.026
 falls-0.025
 unlocks-0.025
Token couldn
Token ID3521
Feature activation-1.51e+00

Pos logit contributions
 dod+0.013
 stops+0.012
 opens+0.012
 unlocks+0.011
 falls+0.011
Neg logit contributions
be-0.010
Sus-0.010
years-0.009
mine-0.009
 se-0.009
Token't
Token ID470
Feature activation+5.22e-01

Pos logit contributions
 dod+0.066
 opens+0.059
 stops+0.058
 unlocks+0.056
 pulls+0.054
Neg logit contributions
be-0.061
Sus-0.057
mine-0.057
years-0.055
 se-0.054
Token see
Token ID766
Feature activation+8.89e-01

Pos logit contributions
years+0.025
Sus+0.025
be+0.025
ina+0.023
rol+0.023
Neg logit contributions
 dod-0.018
 stops-0.016
 opens-0.015
 falls-0.015
 pulls-0.014
Token where
Token ID810
Feature activation-2.30e+00

Pos logit contributions
be+0.032
mine+0.031
years+0.031
Sus+0.031
 especially+0.030
Neg logit contributions
 dod-0.041
 opens-0.037
 stops-0.036
 unlocks-0.036
 closes-0.034
Token his
Token ID465
Feature activation-2.10e+00

Pos logit contributions
 dod+0.088
 stops+0.083
 opens+0.079
 falls+0.076
 unlocks+0.074
Neg logit contributions
be-0.103
years-0.099
Sus-0.097
mine-0.097
ina-0.093
Token on
Token ID319
Feature activation+6.36e-01

Pos logit contributions
be+0.030
mine+0.028
years+0.027
Sus+0.027
ina+0.025
Neg logit contributions
 dod-0.042
 opens-0.037
 stops-0.037
ccoli-0.036
 unlocks-0.035
Token,
Token ID11
Feature activation+2.47e+00

Pos logit contributions
Sus+0.021
mine+0.021
ina+0.021
 especially+0.020
be+0.019
Neg logit contributions
 dod-0.031
ccoli-0.026
 Ready-0.025
 pulls-0.025
 stops-0.025
Token Lily
Token ID20037
Feature activation+9.07e-01

Pos logit contributions
Sus+0.090
mine+0.088
be+0.087
years+0.085
 especially+0.084
Neg logit contributions
 dod-0.114
 stops-0.104
 opens-0.104
 unlocks-0.102
 pulls-0.099
Token loved
Token ID6151
Feature activation+9.21e-01

Pos logit contributions
Sus+0.032
be+0.032
years+0.031
mine+0.031
 occasion+0.028
Neg logit contributions
 dod-0.042
 stops-0.040
 opens-0.038
 falls-0.037
 unlocks-0.037
Token to
Token ID284
Feature activation+1.65e+00

Pos logit contributions
be+0.037
Sus+0.035
years+0.035
mine+0.035
 se+0.033
Neg logit contributions
 dod-0.040
 stops-0.036
 opens-0.036
 unlocks-0.034
 falls-0.033
Token hang
Token ID8181
Feature activation+2.95e-01

Pos logit contributions
be+0.073
Sus+0.070
mine+0.069
years+0.069
 se+0.066
Neg logit contributions
 dod-0.065
 stops-0.058
 opens-0.057
 unlocks-0.054
 falls-0.053
Token out
Token ID503
Feature activation-3.79e+00

Pos logit contributions
be+0.012
Sus+0.012
mine+0.012
 se+0.012
ina+0.012
Neg logit contributions
 dod-0.012
 stops-0.010
ccoli-0.010
 opens-0.010
 pulls-0.010
Token on
Token ID319
Feature activation-3.74e-02

Pos logit contributions
 dod+0.182
 stops+0.159
ccoli+0.158
 opens+0.154
 pulls+0.154
Neg logit contributions
be-0.134
Sus-0.128
mine-0.127
 se-0.125
years-0.124
Token the
Token ID262
Feature activation+2.48e+00

Pos logit contributions
 dod+0.002
 stops+0.001
ccoli+0.001
 pulls+0.001
 unlocks+0.001
Neg logit contributions
mine-0.001
be-0.001
 se-0.001
Sus-0.001
 especially-0.001
Token swing
Token ID9628
Feature activation+2.84e+00

Pos logit contributions
be+0.090
 se+0.086
mine+0.077
 especially+0.077
 harbor+0.076
Neg logit contributions
 dod-0.119
 pulls-0.108
ccoli-0.105
 unlocks-0.105
 opens-0.104
Token and
Token ID290
Feature activation-7.17e-02

Pos logit contributions
be+0.135
Sus+0.130
mine+0.130
years+0.128
 se+0.123
Neg logit contributions
 dod-0.103
 stops-0.089
 opens-0.088
 unlocks-0.083
 pulls-0.081
Token They
Token ID1119
Feature activation+1.24e+00

Pos logit contributions
 dod+0.023
 opens+0.020
 unlocks+0.020
 stops+0.020
 pulls+0.019
Neg logit contributions
be-0.017
Sus-0.017
mine-0.015
line-0.015
ina-0.015
Token dance
Token ID9280
Feature activation-1.76e+00

Pos logit contributions
be+0.046
mine+0.046
 se+0.044
 especially+0.043
Sus+0.042
Neg logit contributions
 dod-0.056
 stops-0.048
 unlocks-0.047
 opens-0.047
 disappears-0.045
Token with
Token ID351
Feature activation-3.39e-01

Pos logit contributions
 dod+0.076
 opens+0.064
 stops+0.064
 unlocks+0.064
 pulls+0.062
Neg logit contributions
be-0.072
mine-0.067
Sus-0.067
years-0.066
 se-0.065
Token Mom
Token ID11254
Feature activation+3.39e-01

Pos logit contributions
 dod+0.013
 stops+0.012
 opens+0.012
 unlocks+0.011
 pulls+0.011
Neg logit contributions
be-0.015
Sus-0.014
mine-0.014
years-0.014
 se-0.014
Token and
Token ID290
Feature activation-1.23e+00

Pos logit contributions
be+0.016
mine+0.016
Sus+0.015
years+0.015
ina+0.015
Neg logit contributions
 dod-0.012
 stops-0.010
 opens-0.010
 unlocks-0.010
 pulls-0.010
Token Dad
Token ID17415
Feature activation+1.85e-01

Pos logit contributions
 dod+0.061
 unlocks+0.053
 stops+0.053
 opens+0.052
 disappears+0.051
Neg logit contributions
be-0.042
 se-0.040
mine-0.039
years-0.037
Sus-0.037
Token.
Token ID13
Feature activation-1.21e+00

Pos logit contributions
be+0.009
mine+0.008
years+0.008
Sus+0.008
 se+0.008
Neg logit contributions
 dod-0.007
 stops-0.006
 opens-0.006
 unlocks-0.006
 pulls-0.006
Token They
Token ID1119
Feature activation+9.41e-01

Pos logit contributions
 dod+0.060
 opens+0.052
 unlocks+0.052
 pulls+0.051
 stops+0.050
Neg logit contributions
Sus-0.042
be-0.041
 especially-0.038
line-0.037
mine-0.037
Token swing
Token ID9628
Feature activation-1.86e+00

Pos logit contributions
be+0.038
mine+0.037
Sus+0.035
 se+0.035
 especially+0.035
Neg logit contributions
 dod-0.040
 stops-0.035
 opens-0.034
 unlocks-0.034
 pulls-0.032
Token and
Token ID290
Feature activation-1.89e+00

Pos logit contributions
 dod+0.086
 unlocks+0.076
 opens+0.075
 pulls+0.075
ccoli+0.073
Neg logit contributions
be-0.066
mine-0.063
 especially-0.058
 se-0.057
Sus-0.057
Token sway
Token ID20009
Feature activation-1.20e+00

Pos logit contributions
 dod+0.082
 stops+0.073
 opens+0.072
 unlocks+0.070
 pulls+0.067
Neg logit contributions
be-0.076
mine-0.073
Sus-0.071
years-0.070
 se-0.069
Token
Token ID198
Feature activation+1.05e-01

Pos logit contributions
Sus+0.006
be+0.005
mine+0.005
years+0.005
 se+0.005
Neg logit contributions
 dod-0.006
 opens-0.005
 stops-0.005
 unlocks-0.005
 pulls-0.005
TokenThe
Token ID464
Feature activation-9.92e-01

Pos logit contributions
be+0.004
Sus+0.004
years+0.004
mine+0.004
 se+0.004
Neg logit contributions
 dod-0.005
 stops-0.004
 opens-0.004
 unlocks-0.004
 falls-0.004
Token child
Token ID1200
Feature activation+2.37e+00

Pos logit contributions
 dod+0.043
 opens+0.036
 unlocks+0.035
 stops+0.035
 pulls+0.035
Neg logit contributions
be-0.042
 se-0.040
Sus-0.039
years-0.039
mine-0.038
Token nodded
Token ID14464
Feature activation+3.16e+00

Pos logit contributions
be+0.085
Sus+0.080
mine+0.079
years+0.079
 se+0.075
Neg logit contributions
 dod-0.114
 stops-0.101
 opens-0.100
 unlocks-0.097
 falls-0.096
Token.
Token ID13
Feature activation-1.60e-01

Pos logit contributions
be+0.158
Sus+0.153
years+0.152
mine+0.150
ina+0.142
Neg logit contributions
 dod-0.102
 stops-0.091
 opens-0.091
 unlocks-0.082
 falls-0.080
Token The
Token ID383
Feature activation-7.88e-01

Pos logit contributions
 dod+0.008
 falls+0.007
 opens+0.006
 disappears+0.006
 stops+0.006
Neg logit contributions
line-0.005
Sus-0.005
 3-0.005
be-0.005
ules-0.005
Token old
Token ID1468
Feature activation-1.20e-01

Pos logit contributions
 dod+0.032
 opens+0.028
 stops+0.027
 unlocks+0.026
 falls+0.026
Neg logit contributions
be-0.035
Sus-0.033
years-0.033
mine-0.032
 se-0.032
Token man
Token ID582
Feature activation-4.30e-01

Pos logit contributions
 dod+0.004
 stops+0.004
 opens+0.003
 unlocks+0.003
 pulls+0.003
Neg logit contributions
be-0.006
years-0.006
Sus-0.006
mine-0.006
 especially-0.006
Token took
Token ID1718
Feature activation+5.27e-01

Pos logit contributions
 dod+0.020
 stops+0.018
 opens+0.018
 unlocks+0.017
 falls+0.017
Neg logit contributions
be-0.016
Sus-0.015
years-0.015
mine-0.015
 se-0.014
Token the
Token ID262
Feature activation-1.56e-01

Pos logit contributions
be+0.019
mine+0.018
years+0.018
 se+0.018
Sus+0.018
Neg logit contributions
 dod-0.025
 stops-0.022
ccoli-0.022
 opens-0.021
 falls-0.021
Token child
Token ID1200
Feature activation+3.10e+00

Pos logit contributions
 dod+0.006
ccoli+0.005
 stops+0.005
 opens+0.005
 disappears+0.005
Neg logit contributions
be-0.007
 3-0.006
 se-0.006
mine-0.006
Sus-0.006
Token a
Token ID257
Feature activation-2.35e+00

Pos logit contributions
be+0.034
mine+0.033
Sus+0.033
years+0.032
ina+0.031
Neg logit contributions
 dod-0.027
 stops-0.024
 opens-0.024
 unlocks-0.022
 falls-0.022
Token popular
Token ID2968
Feature activation+1.85e+00

Pos logit contributions
 dod+0.101
 opens+0.094
 unlocks+0.093
 pulls+0.090
 stops+0.088
Neg logit contributions
be-0.095
 se-0.090
Sus-0.089
 especially-0.082
 3-0.081
Token vendor
Token ID18371
Feature activation-1.73e+00

Pos logit contributions
be+0.070
Sus+0.068
 se+0.068
mine+0.063
 occasion+0.062
Neg logit contributions
 dod-0.083
 opens-0.075
 unlocks-0.074
 pulls-0.072
 stops-0.072
Token selling
Token ID6301
Feature activation+2.45e-01

Pos logit contributions
 dod+0.064
 stops+0.056
 opens+0.055
 unlocks+0.052
 falls+0.051
Neg logit contributions
be-0.081
Sus-0.078
mine-0.077
years-0.077
 se-0.074
Token ice
Token ID4771
Feature activation+1.34e+00

Pos logit contributions
be+0.009
Sus+0.009
mine+0.009
years+0.009
 se+0.008
Neg logit contributions
 dod-0.011
 stops-0.010
 opens-0.010
 unlocks-0.009
 falls-0.009
Token cream
Token ID8566
Feature activation-2.03e+00

Pos logit contributions
be+0.054
Sus+0.053
years+0.052
mine+0.052
 se+0.048
Neg logit contributions
 dod-0.057
 stops-0.051
 opens-0.051
 unlocks-0.048
 falls-0.047
Token.
Token ID13
Feature activation+1.70e-02

Pos logit contributions
 dod+0.084
 opens+0.072
 stops+0.072
 unlocks+0.069
 pulls+0.067
Neg logit contributions
be-0.088
mine-0.084
Sus-0.081
 se-0.080
line-0.080
Token Danny
Token ID15105
Feature activation+1.31e+00

Pos logit contributions
be+0.001
Sus+0.001
years+0.001
mine+0.001
 se+0.001
Neg logit contributions
 dod-0.001
 stops-0.001
 opens-0.001
 unlocks-0.001
 falls-0.001
Token had
Token ID550
Feature activation+1.66e+00

Pos logit contributions
be+0.045
mine+0.041
years+0.041
Sus+0.041
 especially+0.039
Neg logit contributions
 dod-0.066
 stops-0.059
 opens-0.058
 unlocks-0.057
 falls-0.056
Token some
Token ID617
Feature activation+2.22e+00

Pos logit contributions
mine+0.079
Sus+0.077
ina+0.077
be+0.077
years+0.074
Neg logit contributions
 dod-0.058
 opens-0.054
 stops-0.053
 unlocks-0.050
 pulls-0.050
Token money
Token ID1637
Feature activation-2.01e+00

Pos logit contributions
be+0.084
Sus+0.079
years+0.077
mine+0.077
 se+0.076
Neg logit contributions
 dod-0.103
 stops-0.092
 opens-0.091
 unlocks-0.088
 falls-0.087
Token cave
Token ID11527
Feature activation+1.03e+00

Pos logit contributions
 dod+0.074
 opens+0.065
 stops+0.064
 pulls+0.063
 unlocks+0.062
Neg logit contributions
be-0.079
 se-0.076
Sus-0.075
 especially-0.071
ina-0.069
Token.
Token ID13
Feature activation+4.05e-01

Pos logit contributions
be+0.047
Sus+0.045
years+0.044
mine+0.044
 se+0.043
Neg logit contributions
 dod-0.040
 stops-0.034
 opens-0.034
 unlocks-0.032
 pulls-0.032
Token The
Token ID383
Feature activation-3.78e-01

Pos logit contributions
Sus+0.014
be+0.014
mine+0.013
 se+0.013
 especially+0.013
Neg logit contributions
 dod-0.020
 disappears-0.016
 opens-0.016
 falls-0.016
 stops-0.016
Token cave
Token ID11527
Feature activation+1.92e-01

Pos logit contributions
 dod+0.019
 stops+0.016
 opens+0.016
 unlocks+0.016
 pulls+0.016
Neg logit contributions
be-0.013
Sus-0.013
years-0.012
mine-0.012
 se-0.012
Token was
Token ID373
Feature activation+3.43e-01

Pos logit contributions
be+0.006
 se+0.006
Sus+0.006
 especially+0.005
ina+0.005
Neg logit contributions
 dod-0.010
 Ready-0.008
 opens-0.008
ccoli-0.008
 stops-0.008
Token big
Token ID1263
Feature activation-4.32e+00

Pos logit contributions
be+0.014
Sus+0.013
mine+0.013
years+0.013
ina+0.012
Neg logit contributions
 dod-0.015
 stops-0.014
 opens-0.013
 unlocks-0.013
 falls-0.013
Token and
Token ID290
Feature activation+2.96e-01

Pos logit contributions
 dod+0.170
 stops+0.142
 opens+0.137
 pulls+0.135
 unlocks+0.133
Neg logit contributions
be-0.189
 se-0.183
Sus-0.177
years-0.173
 especially-0.173
Token dark
Token ID3223
Feature activation-3.12e+00

Pos logit contributions
be+0.011
Sus+0.011
years+0.011
 se+0.011
mine+0.010
Neg logit contributions
 dod-0.014
 stops-0.012
 opens-0.012
 unlocks-0.011
 pulls-0.011
Token.
Token ID13
Feature activation+8.86e-02

Pos logit contributions
 dod+0.113
 stops+0.098
 opens+0.097
 unlocks+0.092
 falls+0.089
Neg logit contributions
be-0.148
Sus-0.143
years-0.141
mine-0.141
 se-0.135
Token Tim
Token ID5045
Feature activation+7.37e-02

Pos logit contributions
Sus+0.003
be+0.003
 especially+0.003
mine+0.003
 se+0.003
Neg logit contributions
 dod-0.004
 disappears-0.004
 opens-0.004
 closes-0.004
 falls-0.004
Tokenmy
Token ID1820
Feature activation-5.09e-01

Pos logit contributions
be+0.002
Sus+0.002
years+0.002
mine+0.002
line+0.002
Neg logit contributions
 dod-0.003
 stops-0.003
 opens-0.003
 unlocks-0.003
 pulls-0.003
Token how
Token ID703
Feature activation+3.06e-01

Pos logit contributions
 dod+0.001
 opens+0.000
 stops+0.000
 unlocks+0.000
ccoli+0.000
Neg logit contributions
be-0.000
Sus-0.000
 se-0.000
years-0.000
mine-0.000
Token the
Token ID262
Feature activation-2.75e-01

Pos logit contributions
be+0.012
Sus+0.012
years+0.012
mine+0.011
 se+0.011
Neg logit contributions
 dod-0.013
 opens-0.012
 stops-0.012
 unlocks-0.011
 pulls-0.011
Token swings
Token ID26728
Feature activation-5.23e-01

Pos logit contributions
 dod+0.013
 opens+0.011
 stops+0.011
 unlocks+0.011
 pulls+0.011
Neg logit contributions
be-0.011
Sus-0.010
mine-0.010
years-0.010
 se-0.010
Token were
Token ID547
Feature activation+7.24e-01

Pos logit contributions
 dod+0.027
 opens+0.023
 stops+0.023
ccoli+0.023
 unlocks+0.023
Neg logit contributions
be-0.017
mine-0.017
 se-0.015
 especially-0.015
Sus-0.015
Token too
Token ID1165
Feature activation-3.22e-01

Pos logit contributions
be+0.027
Sus+0.025
 especially+0.025
mine+0.024
 se+0.024
Neg logit contributions
 dod-0.034
 opens-0.030
 stops-0.029
 unlocks-0.028
 pulls-0.027
Token high
Token ID1029
Feature activation-5.66e+00

Pos logit contributions
 dod+0.013
 opens+0.011
 unlocks+0.011
 stops+0.011
 Ready+0.011
Neg logit contributions
be-0.014
Sus-0.013
 se-0.013
 especially-0.013
h-0.012
Token.
Token ID13
Feature activation-6.58e-01

Pos logit contributions
 dod+0.252
 opens+0.210
 stops+0.203
 unlocks+0.202
 pulls+0.200
Neg logit contributions
be-0.227
mine-0.218
Sus-0.209
ina-0.209
 especially-0.206
Token The
Token ID383
Feature activation-8.85e-01

Pos logit contributions
 dod+0.027
 opens+0.024
 stops+0.024
 unlocks+0.023
 falls+0.022
Neg logit contributions
Sus-0.028
be-0.027
mine-0.026
years-0.026
 especially-0.025
Token musician
Token ID21623
Feature activation-2.53e+00

Pos logit contributions
 dod+0.038
 opens+0.034
 unlocks+0.034
 stops+0.033
 pulls+0.032
Neg logit contributions
be-0.036
Sus-0.035
 se-0.035
mine-0.034
years-0.034
Token smiled
Token ID13541
Feature activation-1.95e+00

Pos logit contributions
 dod+0.123
 stops+0.111
 opens+0.111
 unlocks+0.106
 falls+0.104
Neg logit contributions
be-0.088
Sus-0.084
mine-0.082
years-0.082
 se-0.077
Token and
Token ID290
Feature activation-8.95e-01

Pos logit contributions
 dod+0.069
 opens+0.059
 stops+0.059
 unlocks+0.056
 pulls+0.054
Neg logit contributions
be-0.095
Sus-0.091
years-0.090
mine-0.090
 se-0.087
Token and
Token ID290
Feature activation-3.45e-01

Pos logit contributions
 dod+0.045
 opens+0.039
 unlocks+0.038
 stops+0.038
 pulls+0.036
Neg logit contributions
be-0.052
Sus-0.050
mine-0.049
 se-0.048
 especially-0.048
Token cleaned
Token ID20750
Feature activation+1.36e+00

Pos logit contributions
 dod+0.016
 stops+0.014
 opens+0.013
 falls+0.013
 pulls+0.013
Neg logit contributions
be-0.013
Sus-0.013
years-0.013
mine-0.013
ina-0.012
Token it
Token ID340
Feature activation+1.21e+00

Pos logit contributions
be+0.059
Sus+0.058
mine+0.057
 especially+0.053
ina+0.053
Neg logit contributions
 dod-0.055
 opens-0.048
 stops-0.047
 unlocks-0.047
 disappears-0.045
Token up
Token ID510
Feature activation-4.45e+00

Pos logit contributions
be+0.050
mine+0.047
Sus+0.047
 especially+0.046
 se+0.045
Neg logit contributions
 dod-0.052
 stops-0.044
 opens-0.043
 unlocks-0.042
 disappears-0.041
Token.
Token ID13
Feature activation-2.62e-01

Pos logit contributions
 dod+0.207
 stops+0.180
 opens+0.180
 unlocks+0.173
 pulls+0.171
Neg logit contributions
be-0.170
Sus-0.169
mine-0.165
 especially-0.160
 se-0.157
Token When
Token ID1649
Feature activation-4.83e+00

Pos logit contributions
 dod+0.012
 unlocks+0.011
 opens+0.010
 falls+0.010
 stops+0.010
Neg logit contributions
Sus-0.010
be-0.009
years-0.009
mine-0.009
 especially-0.009
Token the
Token ID262
Feature activation-1.05e+00

Pos logit contributions
 dod+0.217
 stops+0.190
 opens+0.190
 unlocks+0.184
 pulls+0.179
Neg logit contributions
be-0.187
Sus-0.185
mine-0.177
years-0.173
 se-0.169
Token cake
Token ID12187
Feature activation-1.13e-01

Pos logit contributions
 dod+0.056
 stops+0.051
 opens+0.051
 unlocks+0.049
 falls+0.048
Neg logit contributions
be-0.032
Sus-0.030
mine-0.029
years-0.029
 se-0.028
Token was
Token ID373
Feature activation-8.17e-01

Pos logit contributions
 dod+0.005
 opens+0.005
 stops+0.005
 unlocks+0.004
 pulls+0.004
Neg logit contributions
be-0.004
mine-0.004
Sus-0.004
 se-0.004
ina-0.004
Token done
Token ID1760
Feature activation+1.08e+00

Pos logit contributions
 dod+0.038
 stops+0.035
 opens+0.035
 unlocks+0.033
 falls+0.033
Neg logit contributions
be-0.030
years-0.028
Sus-0.028
mine-0.028
ina-0.026
Token,
Token ID11
Feature activation+5.07e-01

Pos logit contributions
be+0.059
Sus+0.057
years+0.057
mine+0.057
 se+0.054
Neg logit contributions
 dod-0.031
 stops-0.027
 opens-0.026
 unlocks-0.024
 falls-0.023
Token they
Token ID484
Feature activation+1.04e+00

Pos logit contributions
 dod+0.099
 opens+0.084
ccoli+0.084
 unlocks+0.083
 pulls+0.082
Neg logit contributions
be-0.056
 especially-0.053
 se-0.053
Sus-0.052
ina-0.052
Token pretended
Token ID36766
Feature activation-5.51e-01

Pos logit contributions
be+0.034
Sus+0.033
mine+0.032
years+0.032
 se+0.030
Neg logit contributions
 dod-0.053
 stops-0.048
 opens-0.048
 unlocks-0.046
 falls-0.045
Token to
Token ID284
Feature activation-6.70e-01

Pos logit contributions
 dod+0.029
 opens+0.025
 unlocks+0.024
 stops+0.024
 pulls+0.024
Neg logit contributions
be-0.017
Sus-0.016
 se-0.016
 especially-0.016
mine-0.015
Token be
Token ID307
Feature activation+2.12e+00

Pos logit contributions
 dod+0.025
 stops+0.023
 opens+0.022
 falls+0.020
 unlocks+0.020
Neg logit contributions
be-0.031
years-0.030
Sus-0.030
mine-0.029
ina-0.028
Token pirates
Token ID27516
Feature activation-2.83e+00

Pos logit contributions
be+0.073
 especially+0.070
Sus+0.069
years+0.069
 se+0.068
Neg logit contributions
 dod-0.097
 stops-0.091
 opens-0.089
 falls-0.088
 pulls-0.088
Token or
Token ID393
Feature activation-4.86e+00

Pos logit contributions
 dod+0.117
 opens+0.098
 stops+0.096
 pulls+0.095
 unlocks+0.094
Neg logit contributions
be-0.119
mine-0.116
 especially-0.115
ina-0.112
 se-0.110
Token superheroes
Token ID40896
Feature activation-3.12e+00

Pos logit contributions
 dod+0.217
 opens+0.192
 stops+0.190
 unlocks+0.189
 falls+0.182
Neg logit contributions
 se-0.183
be-0.181
 especially-0.170
years-0.165
Sus-0.162
Token.
Token ID13
Feature activation-5.40e-01

Pos logit contributions
 dod+0.125
 opens+0.111
 stops+0.109
 unlocks+0.108
 pulls+0.103
Neg logit contributions
be-0.137
mine-0.130
 especially-0.123
 se-0.120
ina-0.120
Token They
Token ID1119
Feature activation+9.59e-01

Pos logit contributions
 dod+0.024
 opens+0.021
 unlocks+0.020
 stops+0.020
 disappears+0.020
Neg logit contributions
be-0.021
Sus-0.021
mine-0.020
 especially-0.020
years-0.019
Token had
Token ID550
Feature activation+6.89e-01

Pos logit contributions
Sus+0.040
be+0.038
years+0.037
mine+0.035
heart+0.034
Neg logit contributions
 dod-0.041
 stops-0.038
 opens-0.038
 falls-0.035
 pulls-0.035
Token a
Token ID257
Feature activation-1.59e+00

Pos logit contributions
be+0.032
mine+0.032
Sus+0.032
years+0.031
ina+0.030
Neg logit contributions
 dod-0.026
 stops-0.023
 opens-0.023
 unlocks-0.021
 pulls-0.020
Token be
Token ID307
Feature activation+2.57e+00

Pos logit contributions
 dod+0.003
 stops+0.003
 opens+0.003
 unlocks+0.002
 falls+0.002
Neg logit contributions
be-0.003
Sus-0.003
mine-0.003
years-0.003
 se-0.003
Token fair
Token ID3148
Feature activation-2.91e+00

Pos logit contributions
be+0.095
years+0.089
Sus+0.087
 especially+0.086
 se+0.079
Neg logit contributions
 dod-0.119
 stops-0.110
 opens-0.108
 unlocks-0.106
 falls-0.104
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
 dod+0.116
 stops+0.099
 opens+0.097
 unlocks+0.094
 pulls+0.093
Neg logit contributions
be-0.135
Sus-0.124
mine-0.122
ina-0.122
years-0.121
Token No
Token ID1400
Feature activation-1.28e+00

Pos logit contributions
 dod+0.049
 stops+0.043
 opens+0.040
 falls+0.040
 pulls+0.040
Neg logit contributions
be-0.056
Sus-0.055
years-0.053
mine-0.053
 especially-0.050
Token pushing
Token ID7796
Feature activation-6.62e-01

Pos logit contributions
 dod+0.071
 opens+0.064
 unlocks+0.064
 stops+0.064
 pulls+0.061
Neg logit contributions
be-0.038
 se-0.033
Sus-0.032
mine-0.032
 especially-0.032
Token or
Token ID393
Feature activation-4.10e+00

Pos logit contributions
 dod+0.030
 unlocks+0.028
 opens+0.028
 stops+0.027
 disappears+0.027
Neg logit contributions
be-0.024
mine-0.024
 especially-0.022
h-0.021
years-0.021
Token cheating
Token ID21608
Feature activation-1.34e+00

Pos logit contributions
 dod+0.153
 unlocks+0.150
 opens+0.145
 stops+0.138
ccoli+0.137
Neg logit contributions
be-0.177
 se-0.166
mine-0.151
line-0.150
years-0.148
Token."
Token ID526
Feature activation+1.24e+00

Pos logit contributions
 dod+0.060
 unlocks+0.056
 opens+0.055
 Patch+0.053
 Under+0.053
Neg logit contributions
be-0.049
h-0.043
mine-0.042
 especially-0.040
line-0.039
Token
Token ID198
Feature activation-5.11e-01

Pos logit contributions
Sus+0.041
mine+0.039
be+0.038
 especially+0.037
line+0.036
Neg logit contributions
 dod-0.063
 opens-0.057
 unlocks-0.056
 pulls-0.055
 stops-0.055
Token
Token ID198
Feature activation-4.15e-01

Pos logit contributions
 dod+0.024
 opens+0.022
 stops+0.021
 unlocks+0.021
 pulls+0.021
Neg logit contributions
Sus-0.019
be-0.018
 se-0.016
mine-0.016
ina-0.016
TokenThey
Token ID2990
Feature activation+1.74e+00

Pos logit contributions
 dod+0.019
 stops+0.018
 opens+0.017
 unlocks+0.017
 falls+0.016
Neg logit contributions
be-0.016
Sus-0.016
mine-0.015
years-0.014
 se-0.014