TOP ACTIVATIONS
MAX = 0.279

day dream ed about wild towns . One day , he
far away from any busy towns . One day , a
a terrible storm hit the towns and it started to rain
safe . He asked the towns folk if they wanted him
. She stopped in towns and cities and recorded the
around him , even other towns , started to take notice
! The towns folk searched high and low
long way and saw many towns , but the best thing
days and days and the towns folk looked on .
biggest thing ever . The towns folk cl apped and cheered
little cracks and cre v ices of the rocks and trees
". He tried many different methods , but nothing seemed to
finished its bouncing , the towns folk thanked the rope for
a girl who loved to solve puzzles . On this special
a problem he couldn 't solve . He wanted to have
had a tough problem to solve . So he headed off
a boy who loved to solve puzzles . Every day ,
, but she wanted to solve a big mystery .
with her toys and do puzzles . One day , Lily
he might be able to solve it . He

MIN ACTIVATIONS
MIN = -0.311

Max filled Max 's do gh ouse with cozy blankets and
and said , " U gh , what are you doing
He shouted , " Ar gh mate ys !"
it back to his do gh ouse when it suddenly revealed
ran back to his do gh ouse but it was too
the bird to my do gh ouse . We can make
and went to his do gh ouse to think about what
lie down in my do gh ouse ? You can have
friend said , " U gh , this part is so
took a bite . U gh ! It was so sour
and pointed at the grass ho pper . He said ,
" Is it a grass ho pper ?" she guesses .
run out of his do gh ouse , and head straight
" Thank you for letting us pet him ," Ben says
lady bug and a grass ho pper . One
worked hard . The grass ho pper was spoiled and didn
in circles with the grass ho pper . They jumped and
so happy that the grass ho pper had said yes .
. Thank you for teaching us ." Mom smiles and says
It is not a grass ho pper or a cricket .

INTERVAL 0.132 - 0.279
CONTAINS 3.909%

but asked the old man if he was okay .
. Inside the box , there were many toys for Tom
it . But when Jack arrived , he saw
to explore . But only if you promise to be very
butterfly . Joe asked , " Hello butterfly , can you

INTERVAL -0.016 - 0.132
CONTAINS 58.585%

was scared at first , but then he got excited and
and the reef . She was happy that they had fun
hot and thirsty . They had been walking in the sand
boy with a big dream . He wanted to have a
inside and drink hot cocoa . " Let 's

INTERVAL -0.164 - -0.016
CONTAINS 35.275%

off . After the bird dried off , he felt so
. We want it ," Lily and Ben say .
One day , they go to the park with their dogs
She tries to do the same , but the rock is
enjoying her adventure and she even saw a fairy . She

INTERVAL -0.311 - -0.164
CONTAINS 2.230%

man thanked them and gave them some cookies . They ate
" Hi !" back to him . Tim was amazed and
good result . She worked really hard on her project and
heat . They were both very happy and not so busy
. â We are looking
Token day
Token ID1110
Feature activation+2.73e-02

Pos logit contributions
 mad+0.057
oto+0.056
iri+0.055
gh+0.054
 awful+0.053
Neg logit contributions
 All-0.036
 Using-0.027
Use-0.026
 Taking-0.026
Find-0.026
Tokendream
Token ID25966
Feature activation-5.78e-02

Pos logit contributions
 All+0.055
 who+0.045
 Using+0.044
Use+0.043
Let+0.043
Neg logit contributions
oto-0.039
iri-0.039
 mad-0.038
 awful-0.037
gh-0.035
Tokened
Token ID276
Feature activation+1.27e-02

Pos logit contributions
oto+0.107
 mad+0.106
iri+0.103
 awful+0.103
gh+0.101
Neg logit contributions
 All-0.087
 Using-0.069
Use-0.068
Find-0.066
 who-0.066
Token about
Token ID546
Feature activation+1.26e-02

Pos logit contributions
 All+0.019
 Using+0.015
Use+0.015
Find+0.015
Let+0.014
Neg logit contributions
 mad-0.024
oto-0.023
iri-0.023
 awful-0.023
gh-0.022
Token wild
Token ID4295
Feature activation-5.38e-02

Pos logit contributions
 All+0.020
+0.019
Use+0.019
 Using+0.019
Find+0.018
Neg logit contributions
 mad-0.022
oto-0.021
 awful-0.020
iri-0.019
 ours-0.019
Token towns
Token ID11684
Feature activation+2.79e-01

Pos logit contributions
 mad+0.110
oto+0.110
iri+0.105
 awful+0.105
gh+0.102
Neg logit contributions
 All-0.069
 Using-0.057
Use-0.056
Find-0.052
 Taking-0.052
Token.
Token ID13
Feature activation+2.16e-02

Pos logit contributions
 All+0.486
Use+0.451
 Using+0.441
Find+0.430
Anything+0.423
Neg logit contributions
 mad-0.430
 awful-0.397
ong-0.394
oto-0.390
 angry-0.372
Token One
Token ID1881
Feature activation+7.66e-02

Pos logit contributions
 All+0.023
Use+0.020
 Using+0.018
Find+0.018
es+0.018
Neg logit contributions
oto-0.046
 mad-0.044
iri-0.044
gh-0.043
 awful-0.042
Token day
Token ID1110
Feature activation+2.71e-02

Pos logit contributions
 All+0.127
 who+0.099
 Using+0.096
Let+0.095
Use+0.094
Neg logit contributions
 mad-0.135
oto-0.133
iri-0.132
 awful-0.130
gh-0.128
Token,
Token ID11
Feature activation+1.05e-03

Pos logit contributions
 All+0.043
 Using+0.034
 who+0.033
Use+0.033
Let+0.033
Neg logit contributions
 mad-0.048
oto-0.048
iri-0.047
 awful-0.046
gh-0.046
Token he
Token ID339
Feature activation+2.16e-02

Pos logit contributions
 All+0.002
 Using+0.001
Use+0.001
 who+0.001
 Taking+0.001
Neg logit contributions
 mad-0.002
oto-0.002
iri-0.002
 awful-0.002
gh-0.002
Token far
Token ID1290
Feature activation-6.16e-02

Pos logit contributions
 All+0.036
Find+0.032
Use+0.032
 Using+0.031
+0.030
Neg logit contributions
 mad-0.043
oto-0.043
gh-0.041
 awful-0.040
iri-0.040
Token away
Token ID1497
Feature activation-1.41e-01

Pos logit contributions
oto+0.079
 mad+0.079
iri+0.077
 awful+0.075
gh+0.073
Neg logit contributions
 All-0.129
 Using-0.108
 who-0.107
Use-0.105
Let-0.104
Token from
Token ID422
Feature activation+3.80e-02

Pos logit contributions
 mad+0.222
 awful+0.215
oto+0.214
iri+0.206
sing+0.203
Neg logit contributions
 All-0.235
Find-0.213
Use-0.213
-0.203
 Using-0.201
Token any
Token ID597
Feature activation+1.47e-02

Pos logit contributions
 All+0.046
 Using+0.033
 who+0.033
Use+0.031
Let+0.031
Neg logit contributions
oto-0.082
 mad-0.082
iri-0.081
 awful-0.080
gh-0.079
Token busy
Token ID8179
Feature activation-7.60e-02

Pos logit contributions
 All+0.021
 Using+0.016
 who+0.016
Use+0.016
Let+0.016
Neg logit contributions
oto-0.028
 mad-0.028
iri-0.028
 awful-0.027
gh-0.027
Token towns
Token ID11684
Feature activation+2.78e-01

Pos logit contributions
 mad+0.122
oto+0.122
iri+0.120
 awful+0.117
gh+0.114
Neg logit contributions
 All-0.130
 Using-0.112
Use-0.108
 Taking-0.105
Find-0.103
Token.
Token ID13
Feature activation+2.07e-02

Pos logit contributions
 All+0.501
Use+0.478
 Using+0.473
Find+0.469
Anything+0.455
Neg logit contributions
 mad-0.386
ong-0.364
oto-0.360
 awful-0.352
iri-0.342
Token One
Token ID1881
Feature activation+7.79e-02

Pos logit contributions
es+0.029
vered+0.026
mun+0.025
pen+0.025
+0.024
Neg logit contributions
sing-0.031
oto-0.031
aney-0.030
 ours-0.029
ife-0.029
Token day
Token ID1110
Feature activation+2.88e-02

Pos logit contributions
 All+0.134
 who+0.106
 Using+0.105
Let+0.102
Use+0.102
Neg logit contributions
 mad-0.131
oto-0.131
iri-0.128
 awful-0.126
gh-0.124
Token,
Token ID11
Feature activation-2.95e-03

Pos logit contributions
 All+0.051
Use+0.044
 Using+0.044
Find+0.043
 Taking+0.042
Neg logit contributions
 mad-0.044
gh-0.043
oto-0.042
iri-0.042
 awful-0.040
Token a
Token ID257
Feature activation+5.22e-02

Pos logit contributions
 mad+0.005
sing+0.005
 awful+0.004
 angry+0.004
oto+0.004
Neg logit contributions
 All-0.004
 Using-0.004
 Working-0.004
 Taking-0.004
 Being-0.004
Token a
Token ID257
Feature activation+5.25e-02

Pos logit contributions
 All+0.001
 Using+0.001
 Taking+0.001
Use+0.001
pen+0.001
Neg logit contributions
 mad-0.001
sing-0.001
 awful-0.001
 angry-0.001
 sour-0.001
Token terrible
Token ID7818
Feature activation-1.59e-01

Pos logit contributions
 All+0.082
 who+0.064
 Using+0.062
Let+0.061
Use+0.060
Neg logit contributions
oto-0.096
 mad-0.094
iri-0.094
 awful-0.092
gh-0.091
Token storm
Token ID6388
Feature activation-1.20e-01

Pos logit contributions
oto+0.208
iri+0.205
 mad+0.202
 awful+0.197
gh+0.193
Neg logit contributions
 All-0.330
 who-0.274
 Using-0.262
Let-0.261
Use-0.258
Token hit
Token ID2277
Feature activation-1.36e-02

Pos logit contributions
 mad+0.187
 awful+0.169
gh+0.162
oto+0.158
 angry+0.155
Neg logit contributions
 All-0.207
 Using-0.197
Anything-0.187
 Taking-0.187
 Anything-0.183
Token the
Token ID262
Feature activation+6.17e-02

Pos logit contributions
 mad+0.031
oto+0.031
iri+0.030
 awful+0.030
gh+0.029
Neg logit contributions
 All-0.015
 Using-0.011
Use-0.010
Find-0.010
 who-0.010
Token towns
Token ID11684
Feature activation+2.76e-01

Pos logit contributions
 All+0.102
 Using+0.080
 who+0.079
Use+0.078
Let+0.077
Neg logit contributions
oto-0.107
 mad-0.107
iri-0.105
 awful-0.103
gh-0.101
Token and
Token ID290
Feature activation-2.99e-03

Pos logit contributions
 All+0.410
 Using+0.364
Use+0.360
Anything+0.346
Find+0.344
Neg logit contributions
 mad-0.510
 awful-0.475
ong-0.467
 angry-0.443
sing-0.440
Token it
Token ID340
Feature activation-8.19e-02

Pos logit contributions
 mad+0.006
 awful+0.006
oto+0.005
 angry+0.005
 ours+0.005
Neg logit contributions
 All-0.004
 Using-0.004
 With-0.003
Use-0.003
 Taking-0.003
Token started
Token ID2067
Feature activation+4.93e-02

Pos logit contributions
 mad+0.142
 awful+0.137
oto+0.130
gh+0.129
iri+0.127
Neg logit contributions
 All-0.135
 Using-0.112
Use-0.109
 Taking-0.106
 who-0.105
Token to
Token ID284
Feature activation-6.54e-02

Pos logit contributions
 All+0.082
 who+0.064
 Using+0.064
Let+0.062
Use+0.061
Neg logit contributions
oto-0.087
iri-0.085
 mad-0.084
 awful-0.082
gh-0.081
Token rain
Token ID6290
Feature activation-1.15e-01

Pos logit contributions
oto+0.085
 mad+0.084
iri+0.083
 awful+0.080
gh+0.078
Neg logit contributions
 All-0.137
 Using-0.113
 who-0.113
Use-0.111
Let-0.110
Token safe
Token ID3338
Feature activation-8.18e-02

Pos logit contributions
 All+0.264
 who+0.218
 Using+0.212
Let+0.205
Use+0.205
Neg logit contributions
oto-0.232
iri-0.228
 mad-0.224
 awful-0.218
gh-0.216
Token.
Token ID13
Feature activation+4.89e-03

Pos logit contributions
 mad+0.155
oto+0.151
 awful+0.149
iri+0.148
gh+0.145
Neg logit contributions
 All-0.122
 Using-0.095
Use-0.093
Find-0.092
 Taking-0.090
Token He
Token ID679
Feature activation+3.55e-02

Pos logit contributions
Spread+0.007
Have+0.006
Use+0.006
Make+0.006
Find+0.006
Neg logit contributions
ax-0.007
 mad-0.007
 awful-0.007
oto-0.007
ong-0.006
Token asked
Token ID1965
Feature activation+3.55e-02

Pos logit contributions
 All+0.054
 Flying+0.043
 Using+0.043
 Taking+0.043
 Being+0.042
Neg logit contributions
 mad-0.066
oto-0.063
 awful-0.061
gh-0.060
iri-0.060
Token the
Token ID262
Feature activation+5.59e-02

Pos logit contributions
 All+0.058
Use+0.051
 Using+0.049
Find+0.049
+0.048
Neg logit contributions
oto-0.060
 mad-0.059
 awful-0.056
gh-0.056
iri-0.055
Token towns
Token ID11684
Feature activation+2.74e-01

Pos logit contributions
 All+0.106
 Using+0.095
es+0.088
 With+0.088
Use+0.087
Neg logit contributions
 mad-0.086
 awful-0.078
oto-0.074
 angry-0.072
sing-0.071
Tokenfolk
Token ID19956
Feature activation+1.68e-01

Pos logit contributions
 All+0.409
Use+0.334
 Using+0.331
Find+0.323
Let+0.309
Neg logit contributions
 mad-0.512
oto-0.499
 awful-0.480
iri-0.475
gh-0.461
Token if
Token ID611
Feature activation+1.69e-01

Pos logit contributions
 All+0.282
Find+0.253
Use+0.250
 Using+0.241
+0.239
Neg logit contributions
 mad-0.265
oto-0.263
 awful-0.238
iri-0.238
 ours-0.226
Token they
Token ID484
Feature activation+2.14e-02

Pos logit contributions
 All+0.272
 Using+0.254
 With+0.233
Use+0.233
 Taking+0.231
Neg logit contributions
 mad-0.282
oto-0.265
 awful-0.261
gh-0.256
iri-0.244
Token wanted
Token ID2227
Feature activation+2.33e-02

Pos logit contributions
 All+0.027
 Using+0.024
 Taking+0.023
 With+0.023
 Flying+0.022
Neg logit contributions
 mad-0.042
 awful-0.041
oto-0.041
iri-0.039
ax-0.039
Token him
Token ID683
Feature activation-1.97e-01

Pos logit contributions
 All+0.042
 Using+0.034
Use+0.034
Find+0.033
Let+0.033
Neg logit contributions
 mad-0.037
oto-0.036
 awful-0.035
iri-0.035
gh-0.035
Token.
Token ID13
Feature activation+1.69e-02

Pos logit contributions
 mad+0.106
oto+0.103
 awful+0.101
iri+0.101
gh+0.099
Neg logit contributions
 All-0.102
 Using-0.086
Use-0.083
Find-0.081
 Taking-0.081
Token
Token ID198
Feature activation+2.81e-02

Pos logit contributions
Spread+0.024
ieving+0.024
 acts+0.023
es+0.023
pen+0.023
Neg logit contributions
aney-0.021
ong-0.021
sing-0.020
 awful-0.020
oto-0.019
TokenShe
Token ID3347
Feature activation-3.45e-02

Pos logit contributions
 All+0.038
 acts+0.037
 Using+0.032
 Being+0.031
+0.031
Neg logit contributions
oto-0.049
gh-0.047
 mad-0.046
heads-0.045
iri-0.045
Token stopped
Token ID5025
Feature activation-7.56e-02

Pos logit contributions
 mad+0.059
oto+0.055
iri+0.053
 awful+0.053
gh+0.052
Neg logit contributions
 All-0.054
 Using-0.047
 Taking-0.045
Use-0.045
 Flying-0.044
Token in
Token ID287
Feature activation+1.18e-01

Pos logit contributions
 mad+0.138
oto+0.131
iri+0.127
 awful+0.126
sing+0.123
Neg logit contributions
 All-0.120
 Using-0.096
Use-0.095
Find-0.092
Let-0.090
Token towns
Token ID11684
Feature activation+2.73e-01

Pos logit contributions
 All+0.191
 Using+0.158
Find+0.156
 Taking+0.155
Use+0.152
Neg logit contributions
 mad-0.207
oto-0.192
iri-0.188
 awful-0.185
gh-0.178
Token and
Token ID290
Feature activation-7.92e-03

Pos logit contributions
 All+0.415
Use+0.344
Find+0.337
 Using+0.333
Let+0.325
Neg logit contributions
 mad-0.496
oto-0.473
iri-0.465
 awful-0.463
gh-0.450
Token cities
Token ID4736
Feature activation+1.10e-01

Pos logit contributions
 mad+0.013
 angry+0.012
 awful+0.012
unct+0.011
oto+0.011
Neg logit contributions
 All-0.013
 Using-0.012
-0.011
 Flying-0.011
 With-0.011
Token and
Token ID290
Feature activation-8.00e-03

Pos logit contributions
 All+0.164
Use+0.130
 Using+0.130
Find+0.126
Let+0.125
Neg logit contributions
 mad-0.203
oto-0.200
iri-0.193
 awful-0.192
gh-0.191
Token recorded
Token ID6264
Feature activation+4.70e-02

Pos logit contributions
 mad+0.015
 awful+0.014
 angry+0.013
oto+0.013
ong+0.012
Neg logit contributions
 All-0.012
 Using-0.011
-0.010
 With-0.010
Use-0.010
Token the
Token ID262
Feature activation+5.57e-02

Pos logit contributions
 All+0.081
Use+0.074
+0.070
Find+0.070
 Using+0.070
Neg logit contributions
 mad-0.074
 awful-0.072
oto-0.069
iri-0.066
gh-0.066
Token around
Token ID1088
Feature activation+4.65e-02

Pos logit contributions
 All+0.203
 Using+0.177
Use+0.176
+0.171
If+0.167
Neg logit contributions
 mad-0.269
 awful-0.240
ong-0.237
 angry-0.232
gh-0.230
Token him
Token ID683
Feature activation-1.99e-01

Pos logit contributions
 All+0.071
Use+0.065
 Using+0.064
Find+0.062
 Taking+0.062
Neg logit contributions
 mad-0.081
gh-0.073
 awful-0.072
 angry-0.069
oto-0.068
Token,
Token ID11
Feature activation-1.14e-02

Pos logit contributions
 mad+0.372
gh+0.360
 awful+0.354
sing+0.350
oto+0.342
Neg logit contributions
 All-0.284
Use-0.240
If-0.230
 Using-0.229
Find-0.229
Token even
Token ID772
Feature activation-7.13e-02

Pos logit contributions
 mad+0.021
 awful+0.019
oto+0.019
sing+0.018
 angry+0.018
Neg logit contributions
 All-0.017
 Using-0.014
Use-0.014
Find-0.014
If-0.014
Token other
Token ID584
Feature activation+2.59e-02

Pos logit contributions
 mad+0.127
 angry+0.113
 awful+0.109
 blurry+0.102
 sour+0.100
Neg logit contributions
 All-0.122
-0.108
Use-0.104
 Using-0.099
 Flying-0.095
Token towns
Token ID11684
Feature activation+2.72e-01

Pos logit contributions
 All+0.036
Use+0.033
 Using+0.031
Find+0.030
+0.030
Neg logit contributions
 mad-0.049
 awful-0.047
oto-0.046
iri-0.044
gh-0.044
Token,
Token ID11
Feature activation-1.20e-02

Pos logit contributions
 All+0.434
Use+0.405
 Using+0.389
Find+0.387
 Taking+0.369
Neg logit contributions
 mad-0.451
 awful-0.427
oto-0.410
gh-0.405
ong-0.402
Token started
Token ID2067
Feature activation+4.36e-02

Pos logit contributions
 mad+0.021
 awful+0.019
 angry+0.018
sing+0.017
oto+0.017
Neg logit contributions
 All-0.019
 Using-0.016
Use-0.016
-0.016
If-0.016
Token to
Token ID284
Feature activation-7.10e-02

Pos logit contributions
 All+0.078
Use+0.067
 Using+0.065
+0.064
If+0.063
Neg logit contributions
 mad-0.069
 awful-0.063
gh-0.063
oto-0.062
iri-0.060
Token take
Token ID1011
Feature activation+1.08e-01

Pos logit contributions
 mad+0.129
 awful+0.118
gh+0.114
sing+0.111
oto+0.110
Neg logit contributions
 All-0.113
 Using-0.094
 Taking-0.092
Use-0.091
 With-0.091
Token notice
Token ID4003
Feature activation-8.95e-02

Pos logit contributions
 All+0.173
Use+0.145
 Using+0.142
+0.139
 Taking+0.139
Neg logit contributions
 mad-0.187
 awful-0.177
gh-0.174
oto-0.173
iri-0.168
Token!
Token ID0
Feature activation-6.93e-02

Pos logit contributions
 mad+0.142
 awful+0.138
oto+0.135
gh+0.134
iri+0.130
Neg logit contributions
 All-0.138
Use-0.114
 Using-0.113
Find-0.113
Let-0.112
Token
Token ID220
Feature activation-5.55e-02

Pos logit contributions
 mad+0.136
 awful+0.132
oto+0.130
iri+0.128
aney+0.127
Neg logit contributions
 All-0.068
Use-0.064
Spread-0.063
Find-0.062
 ties-0.062
Token
Token ID198
Feature activation+1.72e-02

Pos logit contributions
oto+0.118
 mad+0.115
gh+0.113
iri+0.112
 awful+0.111
Neg logit contributions
 All-0.056
Use-0.048
Find-0.046
-0.044
Spread-0.044
Token
Token ID198
Feature activation+2.01e-02

Pos logit contributions
 All+0.016
 Using+0.011
+0.011
Use+0.010
 who+0.010
Neg logit contributions
oto-0.041
 mad-0.040
iri-0.040
gh-0.039
 awful-0.039
TokenThe
Token ID464
Feature activation+5.73e-02

Pos logit contributions
 All+0.028
 Using+0.023
 acts+0.022
 led+0.022
 Flying+0.021
Neg logit contributions
oto-0.037
 mad-0.037
gh-0.036
 awful-0.036
iri-0.036
Token towns
Token ID11684
Feature activation+2.72e-01

Pos logit contributions
 All+0.111
 Using+0.098
 With+0.092
es+0.091
 Being+0.091
Neg logit contributions
 mad-0.089
 awful-0.081
oto-0.075
gh-0.074
 angry-0.073
Tokenfolk
Token ID19956
Feature activation+1.67e-01

Pos logit contributions
 All+0.381
Use+0.337
 Using+0.331
Find+0.330
Anything+0.321
Neg logit contributions
 mad-0.517
 awful-0.466
 angry-0.444
ong-0.444
oto-0.435
Token searched
Token ID16499
Feature activation+3.83e-02

Pos logit contributions
 All+0.241
Use+0.204
 Using+0.204
+0.202
 Flying+0.201
Neg logit contributions
 mad-0.325
 awful-0.292
iri-0.282
oto-0.282
 angry-0.274
Token high
Token ID1029
Feature activation-1.21e-01

Pos logit contributions
 All+0.061
 Using+0.048
 who+0.048
Use+0.046
Let+0.046
Neg logit contributions
oto-0.068
 mad-0.068
iri-0.067
 awful-0.066
gh-0.065
Token and
Token ID290
Feature activation-1.27e-02

Pos logit contributions
 mad+0.243
oto+0.242
iri+0.238
 awful+0.235
gh+0.230
Neg logit contributions
 All-0.165
 Using-0.122
Use-0.119
 who-0.118
Find-0.117
Token low
Token ID1877
Feature activation-1.17e-01

Pos logit contributions
 mad+0.021
oto+0.020
 awful+0.020
iri+0.019
gh+0.019
Neg logit contributions
 All-0.022
 Using-0.018
Use-0.018
 who-0.017
 Taking-0.017
Token long
Token ID890
Feature activation-1.06e-01

Pos logit contributions
 All+0.060
 who+0.050
Let+0.046
 Using+0.046
Find+0.046
Neg logit contributions
oto-0.052
iri-0.051
 mad-0.049
 awful-0.048
gh-0.046
Token way
Token ID835
Feature activation+1.08e-02

Pos logit contributions
 mad+0.135
oto+0.134
gh+0.133
iri+0.132
 awful+0.121
Neg logit contributions
 All-0.218
 Using-0.187
Use-0.185
 Taking-0.182
 Being-0.181
Token and
Token ID290
Feature activation-2.02e-02

Pos logit contributions
 All+0.018
 Using+0.014
Use+0.014
Let+0.014
Find+0.013
Neg logit contributions
 mad-0.019
oto-0.019
iri-0.018
 awful-0.018
gh-0.018
Token saw
Token ID2497
Feature activation-3.45e-03

Pos logit contributions
 mad+0.035
 awful+0.034
gh+0.033
oto+0.032
iri+0.030
Neg logit contributions
 All-0.032
-0.027
 Using-0.027
Use-0.027
Find-0.026
Token many
Token ID867
Feature activation+8.28e-02

Pos logit contributions
 mad+0.006
 awful+0.006
oto+0.006
gh+0.006
iri+0.005
Neg logit contributions
 All-0.005
-0.005
Use-0.005
 Using-0.005
Find-0.004
Token towns
Token ID11684
Feature activation+2.72e-01

Pos logit contributions
 All+0.164
 Using+0.142
+0.140
Use+0.139
Find+0.139
Neg logit contributions
 mad-0.121
 awful-0.115
oto-0.109
iri-0.103
 angry-0.099
Token,
Token ID11
Feature activation-2.33e-02

Pos logit contributions
 All+0.430
Use+0.371
 Using+0.364
Find+0.359
If+0.348
Neg logit contributions
 mad-0.473
 awful-0.450
oto-0.430
ong-0.424
gh-0.418
Token but
Token ID475
Feature activation+4.72e-02

Pos logit contributions
 mad+0.038
 awful+0.036
oto+0.035
iri+0.033
gh+0.033
Neg logit contributions
 All-0.040
 Using-0.034
-0.034
Use-0.033
If-0.032
Token the
Token ID262
Feature activation+5.11e-02

Pos logit contributions
 All+0.076
Use+0.066
 Using+0.065
+0.064
If+0.064
Neg logit contributions
 mad-0.082
 awful-0.077
oto-0.072
gh-0.070
 angry-0.068
Token best
Token ID1266
Feature activation+2.69e-02

Pos logit contributions
 All+0.090
 Using+0.083
es+0.082
Use+0.080
 Taking+0.078
Neg logit contributions
 mad-0.082
 awful-0.076
 angry-0.069
sing-0.066
oto-0.066
Token thing
Token ID1517
Feature activation+6.88e-02

Pos logit contributions
 All+0.049
 Using+0.044
Use+0.043
+0.042
 Taking+0.042
Neg logit contributions
 mad-0.040
oto-0.039
 awful-0.037
gh-0.036
iri-0.035
Token days
Token ID1528
Feature activation+9.00e-02

Pos logit contributions
 All+0.043
+0.038
Use+0.037
Join+0.037
 Using+0.036
Neg logit contributions
gh-0.052
 mad-0.051
 awful-0.050
oto-0.048
 awesome-0.045
Token and
Token ID290
Feature activation-1.84e-02

Pos logit contributions
 All+0.144
Use+0.115
Find+0.113
 Using+0.112
Let+0.112
Neg logit contributions
 mad-0.156
gh-0.151
oto-0.150
 awful-0.148
iri-0.144
Token days
Token ID1528
Feature activation+9.09e-02

Pos logit contributions
 mad+0.030
 awful+0.028
gh+0.026
 angry+0.025
oto+0.024
Neg logit contributions
 All-0.032
-0.028
Join-0.027
 Flying-0.026
Use-0.026
Token and
Token ID290
Feature activation-1.71e-02

Pos logit contributions
 All+0.159
Use+0.144
Spread+0.144
If+0.141
Anything+0.140
Neg logit contributions
gh-0.122
 mad-0.121
ong-0.118
 awful-0.113
ax-0.112
Token the
Token ID262
Feature activation+5.03e-02

Pos logit contributions
 mad+0.029
 awful+0.027
gh+0.025
oto+0.025
 angry+0.024
Neg logit contributions
 All-0.028
-0.024
Use-0.023
 Using-0.023
 Flying-0.023
Token towns
Token ID11684
Feature activation+2.70e-01

Pos logit contributions
 All+0.085
 Using+0.072
Use+0.069
If+0.067
 Taking+0.067
Neg logit contributions
 mad-0.088
 awful-0.083
sing-0.077
oto-0.076
 angry-0.075
Tokenfolk
Token ID19956
Feature activation+1.62e-01

Pos logit contributions
 All+0.372
Use+0.328
 Using+0.324
 Taking+0.307
Find+0.306
Neg logit contributions
 mad-0.513
 awful-0.481
ong-0.468
gh-0.454
 angry-0.445
Token looked
Token ID3114
Feature activation-8.63e-02

Pos logit contributions
 All+0.231
 Using+0.192
Use+0.190
 Taking+0.188
+0.186
Neg logit contributions
 mad-0.315
 awful-0.288
gh-0.287
ong-0.280
oto-0.280
Token on
Token ID319
Feature activation+8.16e-02

Pos logit contributions
oto+0.137
iri+0.135
 mad+0.135
 awful+0.130
gh+0.128
Neg logit contributions
 All-0.156
 Using-0.125
 who-0.125
Use-0.121
Let-0.121
Token.
Token ID13
Feature activation-2.85e-03

Pos logit contributions
 All+0.136
Use+0.111
 Using+0.108
Find+0.107
If+0.106
Neg logit contributions
 mad-0.138
 awful-0.133
gh-0.128
oto-0.128
iri-0.125
Token
Token ID198
Feature activation+1.22e-02

Pos logit contributions
 awful+0.004
 eighteen+0.004
ax+0.004
aney+0.004
 mad+0.004
Neg logit contributions
Use-0.003
 who-0.003
Will-0.003
Find-0.003
Spread-0.003
Token biggest
Token ID4094
Feature activation+1.22e-02

Pos logit contributions
 All+0.090
 Using+0.074
Use+0.071
 who+0.071
 Taking+0.070
Neg logit contributions
 mad-0.078
oto-0.076
 awful-0.075
iri-0.074
gh-0.071
Token thing
Token ID1517
Feature activation+7.05e-02

Pos logit contributions
 All+0.020
 Using+0.016
Use+0.016
If+0.016
 Taking+0.016
Neg logit contributions
 mad-0.022
 awful-0.021
oto-0.020
iri-0.020
sing-0.019
Token ever
Token ID1683
Feature activation-1.58e-01

Pos logit contributions
 All+0.108
 Using+0.086
Use+0.084
Find+0.083
+0.082
Neg logit contributions
 mad-0.128
oto-0.127
 awful-0.124
gh-0.123
iri-0.121
Token.
Token ID13
Feature activation-7.02e-04

Pos logit contributions
 mad+0.280
oto+0.277
 awful+0.272
iri+0.266
gh+0.264
Neg logit contributions
 All-0.258
 Using-0.206
Use-0.199
Find-0.197
Let-0.194
Token The
Token ID383
Feature activation+1.03e-01

Pos logit contributions
 awful+0.001
ax+0.001
oto+0.001
ong+0.001
 mad+0.001
Neg logit contributions
Use-0.001
Find-0.001
Will-0.001
Spread-0.001
 who-0.001
Token towns
Token ID11684
Feature activation+2.68e-01

Pos logit contributions
 All+0.200
 Using+0.176
Use+0.172
If+0.171
es+0.168
Neg logit contributions
 mad-0.154
 awful-0.139
 angry-0.129
sing-0.128
oto-0.127
Tokenfolk
Token ID19956
Feature activation+1.59e-01

Pos logit contributions
 All+0.376
Use+0.330
 Using+0.323
Find+0.315
 Taking+0.312
Neg logit contributions
 mad-0.503
 awful-0.465
ong-0.447
oto-0.437
ige-0.434
Token cl
Token ID537
Feature activation-5.72e-02

Pos logit contributions
 All+0.227
Use+0.196
 Using+0.192
 Flying+0.188
 Taking+0.188
Neg logit contributions
 mad-0.304
oto-0.275
 awful-0.272
iri-0.270
ong-0.258
Tokenapped
Token ID6320
Feature activation+1.29e-01

Pos logit contributions
 mad+0.100
urg+0.097
ige+0.095
iri+0.095
ong+0.094
Neg logit contributions
 All-0.083
Use-0.073
If-0.069
Let-0.069
Find-0.068
Token and
Token ID290
Feature activation-1.88e-02

Pos logit contributions
 All+0.178
 who+0.141
 Using+0.128
Let+0.126
 With+0.124
Neg logit contributions
oto-0.261
iri-0.260
 mad-0.251
 awful-0.249
gh-0.246
Token cheered
Token ID34767
Feature activation-4.22e-02

Pos logit contributions
 mad+0.031
 awful+0.028
oto+0.026
 angry+0.025
sing+0.025
Neg logit contributions
 All-0.031
-0.029
If-0.028
Use-0.028
 Using-0.028
Token little
Token ID1310
Feature activation+8.08e-02

Pos logit contributions
 All+0.091
 Using+0.073
 who+0.072
Use+0.071
Let+0.070
Neg logit contributions
 mad-0.091
oto-0.090
iri-0.089
 awful-0.088
gh-0.085
Token cracks
Token ID23217
Feature activation+6.17e-02

Pos logit contributions
 All+0.130
 Using+0.106
Use+0.100
 Taking+0.100
Find+0.099
Neg logit contributions
 mad-0.146
 awful-0.137
iri-0.134
oto-0.133
sing-0.130
Token and
Token ID290
Feature activation-1.77e-02

Pos logit contributions
 All+0.088
 Using+0.068
Use+0.067
Find+0.066
 Taking+0.065
Neg logit contributions
 mad-0.119
oto-0.117
 awful-0.116
iri-0.114
gh-0.113
Token cre
Token ID1126
Feature activation-8.51e-02

Pos logit contributions
 mad+0.031
 awful+0.029
sing+0.028
oto+0.027
 ours+0.027
Neg logit contributions
 All-0.030
 Using-0.024
If-0.023
-0.023
Use-0.023
Tokenv
Token ID85
Feature activation+7.86e-02

Pos logit contributions
oto+0.139
iri+0.134
 awful+0.132
gh+0.132
 mad+0.131
Neg logit contributions
 All-0.145
 who-0.121
 Using-0.121
If-0.117
Use-0.116
Tokenices
Token ID1063
Feature activation+2.67e-01

Pos logit contributions
 All+0.120
Use+0.096
 Using+0.095
Find+0.095
 who+0.093
Neg logit contributions
oto-0.142
 mad-0.142
iri-0.140
gh-0.137
 awful-0.136
Token of
Token ID286
Feature activation+1.10e-01

Pos logit contributions
 All+0.390
 Using+0.299
Use+0.294
Find+0.286
Let+0.285
Neg logit contributions
 mad-0.506
oto-0.503
iri-0.492
 awful-0.491
gh-0.482
Token the
Token ID262
Feature activation+5.23e-02

Pos logit contributions
 All+0.180
+0.152
 Using+0.148
Use+0.145
 Taking+0.142
Neg logit contributions
 mad-0.197
 awful-0.190
oto-0.181
iri-0.176
sing-0.170
Token rocks
Token ID12586
Feature activation+7.35e-02

Pos logit contributions
 All+0.089
 Using+0.073
 Taking+0.069
 Being+0.069
Use+0.069
Neg logit contributions
 mad-0.091
 awful-0.088
oto-0.086
iri-0.084
gh-0.081
Token and
Token ID290
Feature activation-1.70e-02

Pos logit contributions
 All+0.108
 Using+0.083
Use+0.081
 Taking+0.079
Find+0.079
Neg logit contributions
 mad-0.141
oto-0.137
 awful-0.136
iri-0.134
gh-0.132
Token trees
Token ID7150
Feature activation+8.40e-02

Pos logit contributions
 mad+0.030
 awful+0.028
sing+0.027
oto+0.026
 ours+0.026
Neg logit contributions
 All-0.028
 Using-0.023
Use-0.022
If-0.022
-0.022
Token".
Token ID1911
Feature activation-7.87e-03

Pos logit contributions
 All+0.037
Use+0.030
 Using+0.030
Find+0.030
Let+0.030
Neg logit contributions
 mad-0.030
 awful-0.029
oto-0.029
iri-0.027
gh-0.027
Token He
Token ID679
Feature activation+4.33e-02

Pos logit contributions
sing+0.014
oto+0.014
 mad+0.013
 awful+0.013
iri+0.013
Neg logit contributions
Use-0.009
Find-0.009
Spread-0.008
Join-0.008
 All-0.008
Token tried
Token ID3088
Feature activation+8.16e-02

Pos logit contributions
 All+0.060
 Using+0.050
 Taking+0.048
Use+0.047
 With+0.046
Neg logit contributions
 mad-0.084
oto-0.084
iri-0.083
gh-0.081
 awful-0.079
Token many
Token ID867
Feature activation+9.04e-02

Pos logit contributions
 All+0.136
 who+0.106
 Using+0.106
Use+0.102
Let+0.102
Neg logit contributions
oto-0.141
 mad-0.140
iri-0.139
 awful-0.136
gh-0.133
Token different
Token ID1180
Feature activation-5.93e-02

Pos logit contributions
 All+0.161
+0.135
Use+0.134
 Using+0.132
Find+0.130
Neg logit contributions
 mad-0.150
oto-0.143
 awful-0.140
iri-0.135
gh-0.133
Token methods
Token ID5050
Feature activation+2.67e-01

Pos logit contributions
 mad+0.105
oto+0.102
 awful+0.099
iri+0.095
gh+0.095
Neg logit contributions
 All-0.094
Use-0.079
-0.079
Find-0.078
Let-0.075
Token,
Token ID11
Feature activation-1.03e-02

Pos logit contributions
 All+0.386
 who+0.301
 Using+0.288
 Taking+0.274
Let+0.273
Neg logit contributions
oto-0.517
iri-0.515
 mad-0.509
 awful-0.500
gh-0.488
Token but
Token ID475
Feature activation+6.16e-02

Pos logit contributions
 mad+0.017
 awful+0.016
 sour+0.015
sing+0.014
 angry+0.014
Neg logit contributions
 All-0.018
-0.016
 Using-0.015
Use-0.015
Join-0.015
Token nothing
Token ID2147
Feature activation-1.42e-02

Pos logit contributions
 All+0.099
 Using+0.084
Use+0.081
 With+0.079
Join+0.079
Neg logit contributions
 mad-0.108
 awful-0.101
gh-0.098
sing-0.098
oto-0.097
Token seemed
Token ID3947
Feature activation+1.36e-02

Pos logit contributions
 awful+0.024
 mad+0.024
oto+0.023
gh+0.023
iri+0.021
Neg logit contributions
 All-0.024
 Using-0.020
-0.020
 Taking-0.019
Use-0.019
Token to
Token ID284
Feature activation-6.69e-02

Pos logit contributions
 All+0.025
 Using+0.020
Use+0.019
 who+0.019
Let+0.019
Neg logit contributions
 mad-0.022
oto-0.021
 awful-0.021
iri-0.021
gh-0.020
Token finished
Token ID5201
Feature activation-8.87e-02

Pos logit contributions
 mad+0.224
oto+0.219
 awful+0.216
iri+0.216
gh+0.215
Neg logit contributions
 All-0.078
 Using-0.053
 who-0.049
Use-0.048
 Taking-0.047
Token its
Token ID663
Feature activation-1.32e-01

Pos logit contributions
oto+0.174
iri+0.173
 mad+0.171
 awful+0.170
gh+0.162
Neg logit contributions
 All-0.129
 Using-0.096
 who-0.095
 With-0.091
Let-0.091
Token bouncing
Token ID30713
Feature activation+9.11e-03

Pos logit contributions
iri+0.236
oto+0.235
 awful+0.226
 mad+0.225
urger+0.223
Neg logit contributions
 All-0.213
 Using-0.158
 led-0.157
 who-0.156
 Flying-0.156
Token,
Token ID11
Feature activation-2.23e-02

Pos logit contributions
 All+0.015
 Using+0.012
Use+0.012
 who+0.012
 Taking+0.012
Neg logit contributions
 mad-0.015
oto-0.015
gh-0.015
iri-0.014
 awful-0.014
Token the
Token ID262
Feature activation+4.83e-02

Pos logit contributions
 mad+0.039
 awful+0.034
 angry+0.033
sing+0.032
 ours+0.032
Neg logit contributions
 Using-0.034
 All-0.033
Use-0.032
If-0.032
pill-0.032
Token towns
Token ID11684
Feature activation+2.67e-01

Pos logit contributions
 All+0.097
 Using+0.089
 Did+0.086
 With+0.086
 Taking+0.084
Neg logit contributions
 mad-0.066
 awful-0.059
oto-0.059
ige-0.055
 angry-0.055
Tokenfolk
Token ID19956
Feature activation+1.57e-01

Pos logit contributions
 All+0.374
 Using+0.342
Use+0.331
 Taking+0.323
Find+0.312
Neg logit contributions
 mad-0.500
 awful-0.460
ong-0.451
ige-0.437
oto-0.436
Token thanked
Token ID26280
Feature activation-1.56e-02

Pos logit contributions
 All+0.240
 Using+0.205
Use+0.200
+0.198
 Taking+0.196
Neg logit contributions
 mad-0.289
oto-0.272
iri-0.267
 awful-0.260
gh-0.252
Token the
Token ID262
Feature activation+4.83e-02

Pos logit contributions
 mad+0.026
oto+0.026
iri+0.025
 awful+0.024
gh+0.023
Neg logit contributions
 All-0.026
 Using-0.022
Use-0.022
Find-0.021
 Taking-0.021
Token rope
Token ID17182
Feature activation+3.43e-02

Pos logit contributions
 All+0.090
 Using+0.084
 With+0.079
 Did+0.078
es+0.077
Neg logit contributions
 mad-0.071
 awful-0.064
oto-0.063
ige-0.060
 angry-0.060
Token for
Token ID329
Feature activation+2.64e-02

Pos logit contributions
 All+0.059
Use+0.050
Find+0.049
If+0.048
 Using+0.048
Neg logit contributions
 mad-0.055
oto-0.055
iri-0.052
 awful-0.052
sing-0.051
Token a
Token ID257
Feature activation+5.44e-02

Pos logit contributions
 All+0.034
 who+0.020
Let+0.018
Use+0.015
There+0.014
Neg logit contributions
oto-0.110
 mad-0.110
iri-0.109
 awful-0.107
urger-0.107
Token girl
Token ID2576
Feature activation+1.69e-01

Pos logit contributions
 All+0.073
 who+0.061
Let+0.054
Find+0.051
 Using+0.051
Neg logit contributions
oto-0.109
 awful-0.108
 mad-0.107
gh-0.106
iri-0.106
Token who
Token ID508
Feature activation+2.08e-01

Pos logit contributions
 All+0.235
 Using+0.189
Use+0.185
Find+0.184
 Believe+0.170
Neg logit contributions
 mad-0.332
iri-0.328
oto-0.322
gh-0.319
 awful-0.308
Token loved
Token ID6151
Feature activation+4.23e-02

Pos logit contributions
 All+0.290
 Using+0.234
 Believe+0.214
 Taking+0.214
+0.214
Neg logit contributions
 mad-0.416
gh-0.390
 awful-0.381
iri-0.377
oto-0.372
Token to
Token ID284
Feature activation-6.01e-02

Pos logit contributions
 All+0.072
 who+0.057
 Using+0.057
Let+0.055
Use+0.055
Neg logit contributions
 mad-0.071
oto-0.071
iri-0.070
 awful-0.069
gh-0.067
Token solve
Token ID8494
Feature activation+2.66e-01

Pos logit contributions
oto+0.106
 mad+0.103
iri+0.103
gh+0.101
 awful+0.096
Neg logit contributions
 All-0.092
 Using-0.077
Use-0.075
 Taking-0.075
If-0.075
Token puzzles
Token ID24367
Feature activation+2.60e-01

Pos logit contributions
 All+0.395
Use+0.330
Let+0.327
Find+0.302
 Using+0.297
Neg logit contributions
 mad-0.508
 awful-0.497
 ours-0.484
gh-0.465
 mine-0.455
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
 All+0.470
Use+0.398
Find+0.397
Let+0.393
 Taking+0.389
Neg logit contributions
 mad-0.398
 awful-0.376
oto-0.374
gh-0.367
 ours-0.353
Token On
Token ID1550
Feature activation+1.69e-01

Pos logit contributions
es+0.020
mun+0.019
vered+0.018
 who+0.018
pen+0.018
Neg logit contributions
aney-0.014
oto-0.014
ife-0.013
iri-0.013
alo-0.013
Token this
Token ID428
Feature activation-3.12e-04

Pos logit contributions
 All+0.245
 Using+0.189
 who+0.188
Use+0.185
Let+0.185
Neg logit contributions
oto-0.322
 mad-0.317
iri-0.314
 awful-0.311
gh-0.311
Token special
Token ID2041
Feature activation-3.38e-02

Pos logit contributions
 mad+0.000
gh+0.000
 awful+0.000
oto+0.000
sing+0.000
Neg logit contributions
 All-0.001
Use-0.000
Let-0.000
 who-0.000
 Using-0.000
Token a
Token ID257
Feature activation+5.96e-02

Pos logit contributions
 All+0.109
 Using+0.074
 who+0.072
Use+0.070
Let+0.069
Neg logit contributions
 mad-0.226
oto-0.226
iri-0.223
 awful-0.220
gh-0.216
Token problem
Token ID1917
Feature activation+1.75e-02

Pos logit contributions
 All+0.091
 who+0.079
 Stone+0.069
Let+0.069
 store+0.068
Neg logit contributions
oto-0.107
iri-0.105
 awful-0.104
ax-0.103
urg-0.103
Token he
Token ID339
Feature activation+1.90e-02

Pos logit contributions
 All+0.030
Find+0.027
Use+0.026
+0.026
Let+0.025
Neg logit contributions
 mad-0.027
oto-0.027
gh-0.027
 awful-0.025
sing-0.025
Token couldn
Token ID3521
Feature activation+1.61e-02

Pos logit contributions
 All+0.025
 Using+0.019
Use+0.018
 Taking+0.018
Find+0.018
Neg logit contributions
 mad-0.040
oto-0.039
 awful-0.039
gh-0.038
iri-0.038
Token't
Token ID470
Feature activation-7.65e-02

Pos logit contributions
 All+0.027
 who+0.024
 Using+0.023
Let+0.023
Use+0.023
Neg logit contributions
 mad-0.025
gh-0.024
oto-0.024
 awful-0.024
iri-0.023
Token solve
Token ID8494
Feature activation+2.64e-01

Pos logit contributions
oto+0.157
 mad+0.156
iri+0.156
 awful+0.153
gh+0.147
Neg logit contributions
 All-0.105
 who-0.077
 Using-0.072
Use-0.071
Find-0.070
Token.
Token ID13
Feature activation+2.53e-02

Pos logit contributions
 All+0.457
Use+0.391
Let+0.386
 Using+0.367
Find+0.365
Neg logit contributions
 mad-0.440
gh-0.426
 awful-0.425
oto-0.399
 ours-0.397
Token He
Token ID679
Feature activation+4.71e-02

Pos logit contributions
Use+0.033
Find+0.033
 All+0.032
Spread+0.031
Make+0.031
Neg logit contributions
 awful-0.041
 mad-0.040
oto-0.039
gh-0.039
sing-0.039
Token wanted
Token ID2227
Feature activation+3.05e-02

Pos logit contributions
 All+0.062
 Using+0.047
 Taking+0.044
Find+0.044
Use+0.044
Neg logit contributions
 mad-0.099
 awful-0.096
oto-0.094
gh-0.093
iri-0.092
Token to
Token ID284
Feature activation-6.15e-02

Pos logit contributions
 All+0.056
 Using+0.047
Use+0.046
+0.046
Let+0.045
Neg logit contributions
 mad-0.047
oto-0.045
 awful-0.044
gh-0.044
iri-0.043
Token have
Token ID423
Feature activation+1.51e-01

Pos logit contributions
 mad+0.118
 awful+0.112
gh+0.112
oto+0.111
iri+0.107
Neg logit contributions
 All-0.087
Let-0.069
 Using-0.069
Use-0.068
 who-0.068
Token had
Token ID550
Feature activation+1.01e-01

Pos logit contributions
 All+0.059
 Using+0.044
 Taking+0.041
Use+0.040
Find+0.040
Neg logit contributions
oto-0.110
 mad-0.110
iri-0.108
gh-0.107
 awful-0.107
Token a
Token ID257
Feature activation+6.13e-02

Pos logit contributions
 All+0.117
 Using+0.084
Use+0.079
 who+0.079
 Taking+0.077
Neg logit contributions
 mad-0.224
oto-0.223
iri-0.220
 awful-0.218
gh-0.214
Token tough
Token ID5802
Feature activation-2.84e-02

Pos logit contributions
 All+0.096
 who+0.083
Let+0.076
 Stone+0.074
Find+0.073
Neg logit contributions
oto-0.107
 awful-0.105
iri-0.104
urg-0.102
ax-0.102
Token problem
Token ID1917
Feature activation+1.52e-02

Pos logit contributions
oto+0.046
 awful+0.046
 mad+0.046
iri+0.045
gh+0.042
Neg logit contributions
 All-0.049
 who-0.039
 Using-0.038
Let-0.037
 With-0.037
Token to
Token ID284
Feature activation-5.77e-02

Pos logit contributions
 All+0.027
Find+0.024
Use+0.024
Let+0.023
+0.023
Neg logit contributions
gh-0.023
 mad-0.022
oto-0.022
 awful-0.021
sing-0.020
Token solve
Token ID8494
Feature activation+2.63e-01

Pos logit contributions
 mad+0.108
oto+0.106
gh+0.104
iri+0.104
 awful+0.104
Neg logit contributions
 All-0.084
 Using-0.068
Let-0.067
 Taking-0.066
Use-0.066
Token.
Token ID13
Feature activation+2.07e-02

Pos logit contributions
 All+0.448
Use+0.383
Let+0.376
Find+0.364
 Using+0.355
Neg logit contributions
 mad-0.447
 awful-0.432
gh-0.424
 ours-0.404
oto-0.403
Token So
Token ID1406
Feature activation+1.11e-01

Pos logit contributions
es+0.030
Use+0.029
+0.029
Spread+0.028
Find+0.027
Neg logit contributions
oto-0.030
 mad-0.028
 awful-0.028
iri-0.028
ax-0.027
Token he
Token ID339
Feature activation+1.96e-02

Pos logit contributions
 All+0.199
Use+0.175
 Using+0.173
Find+0.172
pill+0.170
Neg logit contributions
 mad-0.167
gh-0.161
oto-0.158
iri-0.154
 awful-0.154
Token headed
Token ID9153
Feature activation+2.27e-02

Pos logit contributions
 All+0.030
 Taking+0.024
 Using+0.024
 Being+0.023
 Flying+0.023
Neg logit contributions
 mad-0.036
oto-0.034
 awful-0.034
gh-0.034
iri-0.033
Token off
Token ID572
Feature activation-1.01e-01

Pos logit contributions
 All+0.038
 Using+0.030
 who+0.029
 With+0.028
Let+0.028
Neg logit contributions
iri-0.041
 mad-0.040
 awful-0.040
oto-0.040
gh-0.038
Token a
Token ID257
Feature activation+5.87e-02

Pos logit contributions
 All+0.047
 Using+0.031
 who+0.031
Use+0.030
Let+0.030
Neg logit contributions
 mad-0.103
oto-0.103
iri-0.102
 awful-0.100
gh-0.099
Token boy
Token ID2933
Feature activation+1.32e-01

Pos logit contributions
 All+0.086
 who+0.069
Let+0.063
 Using+0.062
Use+0.062
Neg logit contributions
oto-0.112
iri-0.110
 mad-0.110
 awful-0.109
gh-0.108
Token who
Token ID508
Feature activation+2.01e-01

Pos logit contributions
 All+0.178
 Using+0.142
Use+0.141
Find+0.138
+0.131
Neg logit contributions
 mad-0.270
oto-0.255
gh-0.252
iri-0.252
 awful-0.243
Token loved
Token ID6151
Feature activation+3.32e-02

Pos logit contributions
 All+0.258
 Using+0.206
Use+0.191
Find+0.189
 Taking+0.185
Neg logit contributions
 mad-0.422
gh-0.403
 awful-0.400
oto-0.400
iri-0.388
Token to
Token ID284
Feature activation-5.88e-02

Pos logit contributions
 All+0.057
 Using+0.046
 who+0.045
Use+0.045
Let+0.044
Neg logit contributions
 mad-0.055
oto-0.055
iri-0.053
 awful-0.053
gh-0.051
Token solve
Token ID8494
Feature activation+2.63e-01

Pos logit contributions
oto+0.110
 mad+0.110
iri+0.107
gh+0.106
 awful+0.104
Neg logit contributions
 All-0.085
 Using-0.068
Use-0.066
 Taking-0.065
 who-0.065
Token puzzles
Token ID24367
Feature activation+2.56e-01

Pos logit contributions
 All+0.464
Use+0.412
Let+0.407
+0.390
Find+0.385
Neg logit contributions
 mad-0.433
 awful-0.418
gh-0.393
 ours-0.388
 angry-0.370
Token.
Token ID13
Feature activation+2.40e-02

Pos logit contributions
 All+0.476
Use+0.414
Find+0.412
Let+0.406
+0.400
Neg logit contributions
 mad-0.366
oto-0.350
 awful-0.348
gh-0.342
 ours-0.331
Token Every
Token ID3887
Feature activation+1.27e-01

Pos logit contributions
es+0.035
Use+0.032
 All+0.032
Find+0.031
 who+0.031
Neg logit contributions
oto-0.039
sing-0.034
iri-0.034
 mad-0.033
gh-0.033
Token day
Token ID1110
Feature activation+3.02e-02

Pos logit contributions
 All+0.231
 Using+0.195
 who+0.188
Use+0.188
Find+0.186
Neg logit contributions
oto-0.192
 mad-0.191
gh-0.186
 awful-0.184
iri-0.183
Token,
Token ID11
Feature activation-9.37e-04

Pos logit contributions
 All+0.051
 Using+0.043
Use+0.042
Find+0.041
 who+0.041
Neg logit contributions
 mad-0.049
oto-0.048
gh-0.048
iri-0.048
 awful-0.046
Token,
Token ID11
Feature activation+5.56e-04

Pos logit contributions
 All+0.102
 Using+0.082
Use+0.081
Find+0.081
 Taking+0.077
Neg logit contributions
 mad-0.126
oto-0.124
iri-0.120
 awful-0.118
gh-0.113
Token but
Token ID475
Feature activation+6.80e-02

Pos logit contributions
 All+0.001
+0.001
 Using+0.001
Find+0.001
Use+0.001
Neg logit contributions
 mad-0.001
oto-0.001
 awful-0.001
iri-0.001
gh-0.001
Token she
Token ID673
Feature activation+3.27e-02

Pos logit contributions
 All+0.105
 Using+0.088
Find+0.088
Use+0.085
+0.082
Neg logit contributions
 mad-0.123
oto-0.119
 awful-0.119
iri-0.114
gh-0.107
Token wanted
Token ID2227
Feature activation+3.30e-02

Pos logit contributions
 All+0.039
 Using+0.031
Find+0.028
 Taking+0.028
Use+0.027
Neg logit contributions
 mad-0.072
 awful-0.069
oto-0.068
gh-0.068
iri-0.067
Token to
Token ID284
Feature activation-5.95e-02

Pos logit contributions
 All+0.057
 Using+0.046
Use+0.045
 who+0.045
Find+0.044
Neg logit contributions
 mad-0.055
oto-0.054
 awful-0.053
iri-0.053
gh-0.051
Token solve
Token ID8494
Feature activation+2.63e-01

Pos logit contributions
 mad+0.112
oto+0.111
iri+0.109
 awful+0.107
gh+0.106
Neg logit contributions
 All-0.087
 Using-0.069
 who-0.067
Use-0.067
 Taking-0.066
Token a
Token ID257
Feature activation+5.50e-02

Pos logit contributions
 All+0.421
Use+0.351
Let+0.348
Find+0.338
 Using+0.332
Neg logit contributions
 mad-0.476
 awful-0.456
 ours-0.432
gh-0.424
oto-0.418
Token big
Token ID1263
Feature activation-8.48e-02

Pos logit contributions
 All+0.133
 Using+0.115
 who+0.113
Use+0.113
Let+0.113
Neg logit contributions
 mad-0.054
 awful-0.049
gh-0.048
oto-0.046
iri-0.044
Token mystery
Token ID10715
Feature activation+6.83e-02

Pos logit contributions
 mad+0.124
 awful+0.110
gh+0.108
 mine+0.104
oto+0.103
Neg logit contributions
 All-0.165
If-0.141
Use-0.139
Let-0.138
 Using-0.138
Token.
Token ID13
Feature activation+1.74e-02

Pos logit contributions
 All+0.124
 Using+0.105
Use+0.105
Let+0.105
Find+0.103
Neg logit contributions
 mad-0.103
 awful-0.103
 ours-0.091
gh-0.091
ong-0.091
Token
Token ID198
Feature activation+3.23e-02

Pos logit contributions
 All+0.019
Use+0.018
Find+0.018
es+0.018
pen+0.017
Neg logit contributions
oto-0.031
 mad-0.031
aney-0.031
iri-0.030
 awful-0.030
Token with
Token ID351
Feature activation+1.74e-01

Pos logit contributions
 All+0.138
 Taking+0.115
 Using+0.115
 Being+0.110
Use+0.109
Neg logit contributions
oto-0.189
 mad-0.189
gh-0.188
 awful-0.179
iri-0.179
Token her
Token ID607
Feature activation-9.90e-02

Pos logit contributions
 All+0.310
 who+0.247
 Using+0.245
Let+0.238
Use+0.237
Neg logit contributions
oto-0.281
 mad-0.281
iri-0.279
 awful-0.273
gh-0.267
Token toys
Token ID14958
Feature activation+1.70e-01

Pos logit contributions
oto+0.190
 mad+0.189
iri+0.184
 awful+0.180
gh+0.177
Neg logit contributions
 All-0.148
 Using-0.114
Use-0.111
 Taking-0.109
Find-0.107
Token and
Token ID290
Feature activation-1.22e-03

Pos logit contributions
 All+0.298
Find+0.260
 Taking+0.258
Use+0.256
 Using+0.254
Neg logit contributions
 mad-0.264
oto-0.252
 awful-0.247
gh-0.240
iri-0.232
Token do
Token ID466
Feature activation+1.18e-01

Pos logit contributions
 mad+0.002
oto+0.002
iri+0.002
 awful+0.002
gh+0.002
Neg logit contributions
 All-0.002
 Using-0.002
Use-0.001
 Taking-0.001
 With-0.001
Token puzzles
Token ID24367
Feature activation+2.63e-01

Pos logit contributions
 All+0.215
 Using+0.180
Use+0.179
Let+0.176
Find+0.172
Neg logit contributions
 mad-0.182
gh-0.180
 awful-0.173
oto-0.171
iri-0.166
Token.
Token ID13
Feature activation+1.50e-02

Pos logit contributions
 All+0.373
 Using+0.282
Let+0.278
Use+0.278
 Taking+0.276
Neg logit contributions
 mad-0.514
oto-0.508
 awful-0.495
gh-0.491
iri-0.486
Token One
Token ID1881
Feature activation+7.14e-02

Pos logit contributions
 All+0.018
Use+0.016
es+0.016
Find+0.016
 Using+0.015
Neg logit contributions
oto-0.028
iri-0.027
 mad-0.026
sing-0.026
 awful-0.025
Token day
Token ID1110
Feature activation+3.20e-02

Pos logit contributions
 All+0.111
 who+0.086
Let+0.082
 Using+0.081
Use+0.079
Neg logit contributions
iri-0.133
 mad-0.133
oto-0.132
 awful-0.130
gh-0.127
Token,
Token ID11
Feature activation-1.87e-03

Pos logit contributions
 All+0.054
 Using+0.045
Use+0.045
Find+0.043
Let+0.043
Neg logit contributions
 mad-0.052
oto-0.051
gh-0.050
iri-0.050
 awful-0.049
Token Lily
Token ID20037
Feature activation-2.27e-04

Pos logit contributions
 mad+0.003
oto+0.003
iri+0.003
 awful+0.003
gh+0.003
Neg logit contributions
 All-0.003
 Using-0.003
 Taking-0.003
Use-0.003
 Being-0.003
Token he
Token ID339
Feature activation+1.58e-02

Pos logit contributions
oto+0.067
 mad+0.067
 awful+0.066
gh+0.066
iri+0.065
Neg logit contributions
 All-0.058
 Using-0.046
Use-0.046
Let-0.045
Find-0.045
Token might
Token ID1244
Feature activation-2.90e-02

Pos logit contributions
 All+0.021
 Using+0.018
 Taking+0.016
Use+0.016
If+0.016
Neg logit contributions
 mad-0.030
 awful-0.030
gh-0.029
oto-0.029
iri-0.028
Token be
Token ID307
Feature activation-2.32e-02

Pos logit contributions
 mad+0.051
oto+0.051
iri+0.050
 awful+0.049
gh+0.048
Neg logit contributions
 All-0.047
 Using-0.037
 who-0.036
Use-0.036
Let-0.036
Token able
Token ID1498
Feature activation+2.41e-03

Pos logit contributions
 mad+0.035
oto+0.035
iri+0.034
 awful+0.034
gh+0.033
Neg logit contributions
 All-0.043
 Using-0.035
 who-0.034
Use-0.034
Let-0.034
Token to
Token ID284
Feature activation-6.28e-02

Pos logit contributions
 All+0.004
 who+0.003
 Using+0.003
Let+0.003
Find+0.003
Neg logit contributions
iri-0.005
oto-0.005
 mad-0.004
 awful-0.004
gh-0.004
Token solve
Token ID8494
Feature activation+2.63e-01

Pos logit contributions
 mad+0.113
oto+0.108
iri+0.107
gh+0.107
 awful+0.106
Neg logit contributions
 All-0.097
 Using-0.081
Let-0.080
 Taking-0.080
 Being-0.078
Token it
Token ID340
Feature activation-7.94e-02

Pos logit contributions
 All+0.432
Use+0.358
Let+0.351
 Using+0.345
Find+0.334
Neg logit contributions
 mad-0.467
 awful-0.446
gh-0.443
oto-0.430
 ours-0.426
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
 mad+0.131
 awful+0.128
gh+0.124
oto+0.123
ong+0.117
Neg logit contributions
 All-0.130
Use-0.113
Let-0.112
Find-0.110
-0.108
Token
Token ID198
Feature activation+2.45e-02

Pos logit contributions
 acts+0.013
Use+0.013
Spread+0.013
+0.013
Make+0.012
Neg logit contributions
oto-0.016
ax-0.015
 mad-0.015
iri-0.015
 awful-0.015
Token
Token ID198
Feature activation+2.59e-02

Pos logit contributions
 All+0.025
 Using+0.018
+0.018
Use+0.017
Find+0.017
Neg logit contributions
oto-0.056
 mad-0.054
iri-0.054
gh-0.054
 awful-0.052
TokenHe
Token ID1544
Feature activation-1.87e-02

Pos logit contributions
 All+0.041
 acts+0.037
 led+0.034
 Using+0.034
 With+0.032
Neg logit contributions
oto-0.041
 mad-0.041
gh-0.041
 awful-0.040
heads-0.040
Token Max
Token ID5436
Feature activation-1.99e-02

Pos logit contributions
 mad+0.025
oto+0.024
iri+0.023
gh+0.022
sing+0.022
Neg logit contributions
 All-0.024
Use-0.020
Find-0.020
 Using-0.020
Let-0.019
Token filled
Token ID5901
Feature activation+1.17e-02

Pos logit contributions
 mad+0.037
oto+0.037
iri+0.036
 awful+0.035
ige+0.034
Neg logit contributions
 All-0.029
 Taking-0.023
 Using-0.023
 Flying-0.022
Find-0.022
Token Max
Token ID5436
Feature activation-2.01e-02

Pos logit contributions
 All+0.017
Use+0.013
+0.013
Find+0.012
 Flying+0.012
Neg logit contributions
 mad-0.023
oto-0.022
 awful-0.022
iri-0.022
gh-0.021
Token's
Token ID338
Feature activation+9.46e-03

Pos logit contributions
 mad+0.038
gh+0.035
 awful+0.035
sing+0.034
heads+0.034
Neg logit contributions
 All-0.030
If-0.025
Use-0.024
Find-0.023
 Being-0.023
Token do
Token ID466
Feature activation+9.77e-02

Pos logit contributions
 All+0.015
 Using+0.012
 who+0.012
Let+0.011
 With+0.011
Neg logit contributions
oto-0.017
iri-0.017
 mad-0.017
 awful-0.016
gh-0.016
Tokengh
Token ID456
Feature activation-3.10e-01

Pos logit contributions
 All+0.168
 Using+0.134
Use+0.132
 who+0.131
Let+0.130
Neg logit contributions
 mad-0.162
oto-0.160
iri-0.157
 awful-0.156
gh-0.156
Tokenouse
Token ID1076
Feature activation+5.14e-02

Pos logit contributions
 mad+0.473
 awful+0.464
oto+0.453
iri+0.448
sing+0.401
Neg logit contributions
 All-0.593
 who-0.488
 Using-0.487
 With-0.472
-0.463
Token with
Token ID351
Feature activation+1.52e-01

Pos logit contributions
 All+0.070
 who+0.052
 Using+0.051
Use+0.049
Let+0.049
Neg logit contributions
oto-0.104
 mad-0.104
iri-0.103
 awful-0.100
gh-0.098
Token cozy
Token ID37438
Feature activation-5.83e-02

Pos logit contributions
 All+0.236
 Using+0.188
Use+0.183
Find+0.180
 Taking+0.180
Neg logit contributions
 mad-0.279
oto-0.274
 awful-0.269
iri-0.267
gh-0.258
Token blankets
Token ID34794
Feature activation+5.55e-02

Pos logit contributions
 mad+0.076
oto+0.075
iri+0.074
 numb+0.073
heads+0.071
Neg logit contributions
 All-0.115
 Taking-0.100
If-0.098
Use-0.098
 Being-0.097
Token and
Token ID290
Feature activation-1.62e-02

Pos logit contributions
 All+0.075
 Using+0.055
 who+0.055
Use+0.053
Let+0.052
Neg logit contributions
 mad-0.113
oto-0.113
iri-0.111
 awful-0.109
gh-0.107
Token and
Token ID290
Feature activation-1.70e-02

Pos logit contributions
 mad+0.549
oto+0.546
iri+0.536
 awful+0.532
gh+0.520
Neg logit contributions
 All-0.350
 Using-0.258
 who-0.252
Use-0.250
Let-0.246
Token said
Token ID531
Feature activation-8.33e-02

Pos logit contributions
 mad+0.030
 awful+0.028
oto+0.027
iri+0.026
gh+0.025
Neg logit contributions
 All-0.029
 Using-0.023
 Flying-0.023
Use-0.023
-0.023
Token,
Token ID11
Feature activation-1.86e-02

Pos logit contributions
oto+0.155
 mad+0.155
iri+0.153
 awful+0.149
gh+0.147
Neg logit contributions
 All-0.127
 Using-0.097
 who-0.097
Use-0.094
Let-0.093
Token "
Token ID366
Feature activation+1.20e-01

Pos logit contributions
 mad+0.041
 awful+0.040
oto+0.039
iri+0.038
gh+0.038
Neg logit contributions
 All-0.021
Use-0.015
 Using-0.015
Find-0.015
If-0.014
TokenU
Token ID52
Feature activation-1.26e-01

Pos logit contributions
 All+0.163
 Using+0.126
 With+0.119
 Flying+0.117
 who+0.116
Neg logit contributions
 mad-0.244
oto-0.237
iri-0.234
 awful-0.233
gh-0.222
Tokengh
Token ID456
Feature activation-3.08e-01

Pos logit contributions
 mad+0.135
sing+0.134
iri+0.125
 awful+0.125
oto+0.122
Neg logit contributions
 All-0.283
-0.241
 Using-0.236
Use-0.233
 who-0.232
Token,
Token ID11
Feature activation-1.74e-02

Pos logit contributions
 awful+0.607
iri+0.602
oto+0.599
 mad+0.592
 mir+0.551
Neg logit contributions
 All-0.430
 who-0.354
-0.335
 Using-0.325
 With-0.321
Token what
Token ID644
Feature activation+8.66e-02

Pos logit contributions
 mad+0.029
 awful+0.028
oto+0.027
iri+0.026
 ours+0.025
Neg logit contributions
 All-0.030
 Using-0.026
Use-0.025
If-0.025
Find-0.024
Token are
Token ID389
Feature activation+2.02e-02

Pos logit contributions
 Using+0.142
+0.139
 All+0.138
 Along+0.132
Would+0.129
Neg logit contributions
 mad-0.150
 awful-0.130
gh-0.129
 done-0.126
oto-0.122
Token you
Token ID345
Feature activation-5.55e-02

Pos logit contributions
 All+0.042
 Using+0.036
Let+0.035
Use+0.035
Find+0.034
Neg logit contributions
 mad-0.026
 awful-0.026
oto-0.025
gh-0.025
 ours-0.024
Token doing
Token ID1804
Feature activation+3.47e-02

Pos logit contributions
 mad+0.090
oto+0.087
 awful+0.085
iri+0.085
gh+0.083
Neg logit contributions
 All-0.100
 Using-0.081
Use-0.079
 who-0.079
Let-0.078
Token He
Token ID679
Feature activation+2.66e-02

Pos logit contributions
 All+0.001
Use+0.001
Find+0.001
Spread+0.001
 led+0.001
Neg logit contributions
 mad-0.001
oto-0.001
iri-0.001
 awful-0.001
gh-0.001
Token shouted
Token ID17293
Feature activation-1.22e-01

Pos logit contributions
 All+0.041
 Taking+0.033
 Using+0.033
 Flying+0.033
+0.032
Neg logit contributions
 mad-0.049
 awful-0.046
oto-0.045
gh-0.044
iri-0.044
Token,
Token ID11
Feature activation-1.31e-02

Pos logit contributions
iri+0.273
oto+0.268
 awful+0.253
 mad+0.252
anut+0.247
Neg logit contributions
 All-0.159
 who-0.119
 With-0.103
 Using-0.100
 with-0.097
Token "
Token ID366
Feature activation+1.28e-01

Pos logit contributions
 mad+0.031
oto+0.031
iri+0.031
 awful+0.030
gh+0.030
Neg logit contributions
 All-0.013
 Using-0.008
 who-0.008
Use-0.008
Let-0.008
TokenAr
Token ID3163
Feature activation-1.26e-01

Pos logit contributions
 All+0.187
 Using+0.150
 who+0.142
 Taking+0.139
 With+0.139
Neg logit contributions
oto-0.242
 mad-0.241
iri-0.238
gh-0.232
 awful-0.230
Tokengh
Token ID456
Feature activation-3.07e-01

Pos logit contributions
 awful+0.157
 mad+0.156
oto+0.151
iri+0.149
sing+0.137
Neg logit contributions
 All-0.264
 who-0.226
 Using-0.219
Use-0.214
Find-0.211
Token mate
Token ID16133
Feature activation-1.55e-03

Pos logit contributions
 awful+0.736
 mad+0.695
iri+0.672
oto+0.672
 unpleasant+0.653
Neg logit contributions
 All-0.336
 who-0.262
-0.248
es-0.246
 With-0.234
Tokenys
Token ID893
Feature activation-6.09e-02

Pos logit contributions
 mad+0.003
 awful+0.002
iri+0.002
oto+0.002
gh+0.002
Neg logit contributions
 All-0.003
 who-0.002
 Using-0.002
 With-0.002
Let-0.002
Token!"
Token ID2474
Feature activation-1.17e-01

Pos logit contributions
oto+0.081
iri+0.078
 mad+0.078
gh+0.076
 awful+0.074
Neg logit contributions
 All-0.123
Use-0.105
 Using-0.104
Let-0.102
 Taking-0.102
Token
Token ID198
Feature activation+1.52e-02

Pos logit contributions
oto+0.240
 mad+0.237
iri+0.233
 awful+0.224
sing+0.221
Neg logit contributions
 All-0.122
Use-0.111
Find-0.104
-0.102
Spread-0.100
Token
Token ID198
Feature activation+1.64e-02

Pos logit contributions
 All+0.014
 Using+0.009
Use+0.009
 who+0.009
Let+0.008
Neg logit contributions
oto-0.037
 mad-0.036
iri-0.036
gh-0.035
 awful-0.035
Token it
Token ID340
Feature activation-8.12e-02

Pos logit contributions
 All+0.306
Let+0.250
Use+0.250
+0.249
 Using+0.247
Neg logit contributions
 mad-0.273
 awful-0.256
gh-0.249
oto-0.246
 ours-0.239
Token back
Token ID736
Feature activation-2.23e-01

Pos logit contributions
 mad+0.133
oto+0.130
 awful+0.127
gh+0.126
iri+0.125
Neg logit contributions
 All-0.142
Use-0.112
 Using-0.112
Let-0.111
Find-0.110
Token to
Token ID284
Feature activation-6.43e-02

Pos logit contributions
 mad+0.292
oto+0.288
iri+0.274
gh+0.268
 awful+0.264
Neg logit contributions
 All-0.458
Use-0.384
Find-0.382
 Using-0.375
-0.372
Token his
Token ID465
Feature activation-9.07e-02

Pos logit contributions
 mad+0.090
 awful+0.083
oto+0.082
gh+0.078
iri+0.076
Neg logit contributions
 All-0.121
-0.114
Use-0.111
 Using-0.110
Find-0.107
Token do
Token ID466
Feature activation+1.12e-01

Pos logit contributions
 mad+0.131
oto+0.129
iri+0.126
 awful+0.124
gh+0.120
Neg logit contributions
 All-0.177
Use-0.146
 Using-0.146
Let-0.143
Find-0.142
Tokengh
Token ID456
Feature activation-3.07e-01

Pos logit contributions
 All+0.332
 Using+0.298
Use+0.295
 Being+0.292
Let+0.292
Neg logit contributions
gh-0.054
oto-0.046
 mad-0.043
iri-0.039
 awful-0.036
Tokenouse
Token ID1076
Feature activation+5.84e-02

Pos logit contributions
 awful+0.462
 mad+0.458
oto+0.418
iri+0.418
sing+0.384
Neg logit contributions
 All-0.606
 who-0.504
 Using-0.493
-0.490
 With-0.484
Token when
Token ID618
Feature activation+1.72e-01

Pos logit contributions
 All+0.084
 Using+0.064
 who+0.062
Use+0.062
Let+0.061
Neg logit contributions
 mad-0.113
oto-0.113
iri-0.110
 awful-0.109
gh-0.108
Token it
Token ID340
Feature activation-8.22e-02

Pos logit contributions
 All+0.272
 Using+0.246
Use+0.235
If+0.230
+0.228
Neg logit contributions
 mad-0.300
gh-0.285
 awful-0.272
oto-0.264
iri-0.256
Token suddenly
Token ID6451
Feature activation+5.26e-02

Pos logit contributions
gh+0.172
 mad+0.171
 awful+0.169
oto+0.162
iri+0.158
Neg logit contributions
 All-0.101
 who-0.080
 Using-0.079
 Being-0.079
 Taking-0.078
Token revealed
Token ID4602
Feature activation+8.26e-03

Pos logit contributions
 All+0.091
 Using+0.075
Use+0.074
 who+0.073
If+0.073
Neg logit contributions
 mad-0.086
 awful-0.084
oto-0.083
gh-0.082
iri-0.081
Token ran
Token ID4966
Feature activation-2.05e-02

Pos logit contributions
 All+0.046
 Using+0.036
If+0.036
Use+0.036
 Taking+0.035
Neg logit contributions
 mad-0.056
iri-0.054
oto-0.054
 awful-0.054
gh-0.052
Token back
Token ID736
Feature activation-2.23e-01

Pos logit contributions
iri+0.037
oto+0.036
 awful+0.035
 mad+0.035
gh+0.034
Neg logit contributions
 All-0.035
 who-0.027
 Using-0.027
 With-0.026
Let-0.026
Token to
Token ID284
Feature activation-7.13e-02

Pos logit contributions
 mad+0.280
oto+0.261
iri+0.240
heads+0.234
sing+0.231
Neg logit contributions
 All-0.468
Use-0.404
Find-0.394
If-0.393
Spread-0.388
Token his
Token ID465
Feature activation-1.02e-01

Pos logit contributions
 mad+0.119
oto+0.111
iri+0.107
 awful+0.106
gh+0.104
Neg logit contributions
 All-0.122
Use-0.104
-0.103
If-0.101
Find-0.101
Token do
Token ID466
Feature activation+1.03e-01

Pos logit contributions
oto+0.175
iri+0.170
 mad+0.168
gh+0.165
 awful+0.165
Neg logit contributions
 All-0.169
 who-0.145
 Using-0.135
 led-0.129
Let-0.129
Tokengh
Token ID456
Feature activation-3.07e-01

Pos logit contributions
 All+0.225
 Using+0.191
Use+0.188
 who+0.185
Let+0.185
Neg logit contributions
 mad-0.123
oto-0.122
gh-0.119
iri-0.118
 awful-0.116
Tokenouse
Token ID1076
Feature activation+5.55e-02

Pos logit contributions
 awful+0.517
 mad+0.508
oto+0.461
iri+0.458
 ours+0.431
Neg logit contributions
 All-0.560
 who-0.469
 Using-0.447
 With-0.442
-0.440
Token but
Token ID475
Feature activation+5.18e-02

Pos logit contributions
 All+0.081
 Using+0.062
Use+0.061
Let+0.060
Find+0.060
Neg logit contributions
 mad-0.106
oto-0.106
iri-0.103
 awful-0.102
gh-0.101
Token it
Token ID340
Feature activation-8.80e-02

Pos logit contributions
 All+0.078
 Using+0.062
Use+0.061
Find+0.060
Let+0.059
Neg logit contributions
 mad-0.098
 awful-0.093
oto-0.093
iri-0.092
gh-0.090
Token was
Token ID373
Feature activation+2.45e-02

Pos logit contributions
 mad+0.210
oto+0.209
iri+0.207
 awful+0.205
gh+0.202
Neg logit contributions
 All-0.087
 Using-0.057
 who-0.055
Use-0.053
Let-0.052
Token too
Token ID1165
Feature activation-1.44e-01

Pos logit contributions
 All+0.042
 who+0.034
 Using+0.034
Let+0.032
 led+0.032
Neg logit contributions
oto-0.041
iri-0.040
 mad-0.038
gh-0.037
ige-0.037
Token the
Token ID262
Feature activation+5.34e-02

Pos logit contributions
 All+0.137
 Using+0.102
 who+0.099
Use+0.098
Let+0.096
Neg logit contributions
 mad-0.218
oto-0.216
iri-0.212
 awful-0.211
gh-0.207
Token bird
Token ID6512
Feature activation+4.67e-02

Pos logit contributions
 All+0.097
 Using+0.080
 who+0.077
Use+0.077
 Taking+0.076
Neg logit contributions
 mad-0.084
oto-0.082
 awful-0.081
iri-0.079
gh-0.077
Token to
Token ID284
Feature activation-7.10e-02

Pos logit contributions
 All+0.086
 who+0.075
 Using+0.069
Let+0.068
 With+0.067
Neg logit contributions
oto-0.071
iri-0.071
 mad-0.070
 awful-0.069
gh-0.068
Token my
Token ID616
Feature activation-1.20e-01

Pos logit contributions
oto+0.154
 mad+0.154
iri+0.152
 awful+0.149
gh+0.147
Neg logit contributions
 All-0.087
 Using-0.061
 who-0.060
Use-0.059
Let-0.058
Token do
Token ID466
Feature activation+1.05e-01

Pos logit contributions
 mad+0.208
oto+0.207
iri+0.204
 awful+0.201
gh+0.195
Neg logit contributions
 All-0.198
 Using-0.157
Use-0.153
 who-0.153
Let-0.151
Tokengh
Token ID456
Feature activation-3.06e-01

Pos logit contributions
 All+0.205
 who+0.168
 Using+0.167
Use+0.163
Let+0.163
Neg logit contributions
oto-0.149
 mad-0.148
iri-0.146
 awful-0.142
gh-0.136
Tokenouse
Token ID1076
Feature activation+5.67e-02

Pos logit contributions
 awful+0.493
 mad+0.482
iri+0.441
oto+0.440
arter+0.397
Neg logit contributions
 All-0.579
 who-0.485
-0.474
 With-0.466
es-0.465
Token.
Token ID13
Feature activation+1.18e-02

Pos logit contributions
 All+0.093
 who+0.076
 Using+0.071
 With+0.068
Let+0.068
Neg logit contributions
iri-0.100
 mad-0.098
oto-0.098
 awful-0.095
gh-0.092
Token We
Token ID775
Feature activation+9.96e-02

Pos logit contributions
pen+0.014
 led+0.014
mun+0.014
 towns+0.014
 puzzles+0.014
Neg logit contributions
sing-0.018
 ours-0.017
iri-0.017
 mad-0.017
oto-0.017
Token can
Token ID460
Feature activation+7.10e-02

Pos logit contributions
 All+0.130
 Taking+0.108
 Using+0.107
 With+0.103
 Being+0.103
Neg logit contributions
 mad-0.201
 awful-0.190
iri-0.186
oto-0.185
ige-0.182
Token make
Token ID787
Feature activation+1.26e-01

Pos logit contributions
 All+0.099
 Using+0.077
 Taking+0.075
Let+0.074
Use+0.074
Neg logit contributions
oto-0.139
 mad-0.138
iri-0.137
 awful-0.132
gh-0.132
Token and
Token ID290
Feature activation-2.42e-02

Pos logit contributions
 mad+0.293
 awful+0.280
oto+0.269
gh+0.252
sing+0.250
Neg logit contributions
 All-0.287
Find-0.259
Use-0.259
-0.257
 Using-0.254
Token went
Token ID1816
Feature activation+6.75e-03

Pos logit contributions
 mad+0.045
oto+0.044
 awful+0.043
iri+0.043
gh+0.041
Neg logit contributions
 All-0.038
 Using-0.029
Use-0.029
 who-0.028
Find-0.028
Token to
Token ID284
Feature activation-7.55e-02

Pos logit contributions
 All+0.013
 who+0.011
 Using+0.011
Let+0.011
 led+0.010
Neg logit contributions
iri-0.010
oto-0.010
 awful-0.009
ige-0.009
gh-0.009
Token his
Token ID465
Feature activation-1.06e-01

Pos logit contributions
 mad+0.142
oto+0.141
iri+0.138
 awful+0.136
gh+0.134
Neg logit contributions
 All-0.115
 Using-0.088
 who-0.087
Use-0.086
Let-0.085
Token do
Token ID466
Feature activation+1.03e-01

Pos logit contributions
oto+0.188
iri+0.188
 awful+0.179
 mad+0.179
gh+0.178
Neg logit contributions
 All-0.174
 who-0.142
 Using-0.135
Let-0.131
 Taking-0.129
Tokengh
Token ID456
Feature activation-3.06e-01

Pos logit contributions
 All+0.210
 Using+0.178
Use+0.176
Find+0.173
Let+0.172
Neg logit contributions
 mad-0.140
gh-0.135
oto-0.133
 awful-0.130
iri-0.127
Tokenouse
Token ID1076
Feature activation+5.83e-02

Pos logit contributions
 awful+0.537
 mad+0.509
iri+0.507
oto+0.494
 mir+0.461
Neg logit contributions
 All-0.535
 who-0.449
es-0.420
 Using-0.411
 With-0.410
Token to
Token ID284
Feature activation-7.53e-02

Pos logit contributions
 All+0.087
 who+0.066
 Using+0.066
Use+0.063
Let+0.063
Neg logit contributions
oto-0.110
 mad-0.110
iri-0.110
 awful-0.107
gh-0.104
Token think
Token ID892
Feature activation-3.68e-02

Pos logit contributions
 mad+0.141
oto+0.139
iri+0.136
 awful+0.135
gh+0.132
Neg logit contributions
 All-0.114
 Using-0.091
Use-0.087
 Taking-0.087
Let-0.086
Token about
Token ID546
Feature activation-4.54e-03

Pos logit contributions
 awful+0.067
 mad+0.067
gh+0.066
oto+0.065
iri+0.061
Neg logit contributions
 All-0.055
 Using-0.046
Use-0.045
Find-0.044
-0.044
Token what
Token ID644
Feature activation+8.59e-02

Pos logit contributions
 mad+0.007
 awful+0.007
gh+0.007
oto+0.006
 mine+0.006
Neg logit contributions
 All-0.008
Use-0.007
 Using-0.007
-0.007
Find-0.007
Token lie
Token ID6486
Feature activation-4.17e-02

Pos logit contributions
 mad+0.089
oto+0.089
iri+0.087
 awful+0.085
gh+0.084
Neg logit contributions
 All-0.087
 Using-0.070
Use-0.068
Let-0.068
 Taking-0.068
Token down
Token ID866
Feature activation+8.40e-03

Pos logit contributions
oto+0.074
 mad+0.073
iri+0.073
 awful+0.072
gh+0.070
Neg logit contributions
 All-0.067
 who-0.053
 Using-0.052
Let-0.050
Use-0.050
Token in
Token ID287
Feature activation+1.18e-01

Pos logit contributions
 All+0.014
 Using+0.011
Use+0.010
 who+0.010
Find+0.010
Neg logit contributions
 mad-0.015
oto-0.015
iri-0.014
 awful-0.014
gh-0.014
Token my
Token ID616
Feature activation-1.19e-01

Pos logit contributions
 All+0.190
Use+0.152
 Using+0.151
Find+0.148
Let+0.148
Neg logit contributions
 mad-0.214
 awful-0.200
 ours-0.198
oto-0.194
iri-0.192
Token do
Token ID466
Feature activation+1.08e-01

Pos logit contributions
oto+0.195
 mad+0.191
iri+0.190
 awful+0.183
gh+0.181
Neg logit contributions
 All-0.210
 who-0.170
 Using-0.167
Let-0.162
Use-0.160
Tokengh
Token ID456
Feature activation-3.06e-01

Pos logit contributions
 All+0.196
 Using+0.161
Use+0.158
 who+0.155
Let+0.155
Neg logit contributions
 mad-0.166
oto-0.166
iri-0.162
gh-0.160
 awful-0.159
Tokenouse
Token ID1076
Feature activation+5.73e-02

Pos logit contributions
 mad+0.438
 awful+0.434
oto+0.405
iri+0.404
sing+0.368
Neg logit contributions
 All-0.617
 who-0.509
 Using-0.500
-0.493
 With-0.490
Token?
Token ID30
Feature activation+6.65e-02

Pos logit contributions
 All+0.104
 Using+0.084
 who+0.083
Use+0.082
Let+0.081
Neg logit contributions
oto-0.089
 mad-0.089
iri-0.087
 awful-0.086
gh-0.084
Token You
Token ID921
Feature activation+7.91e-02

Pos logit contributions
 puzzles+0.096
 led+0.095
mun+0.089
 towns+0.089
 valleys+0.088
Neg logit contributions
 please-0.081
oto-0.075
 okay-0.074
iri-0.074
 ours-0.073
Token can
Token ID460
Feature activation+7.29e-02

Pos logit contributions
 All+0.114
 Using+0.106
 Taking+0.105
If+0.100
 With+0.099
Neg logit contributions
 mad-0.139
 awful-0.131
iri-0.125
ong-0.124
 ours-0.123
Token have
Token ID423
Feature activation+1.41e-01

Pos logit contributions
 All+0.109
 Using+0.090
Let+0.087
 Taking+0.087
Use+0.086
Neg logit contributions
oto-0.133
iri-0.131
 mad-0.131
gh-0.127
 awful-0.125
Token friend
Token ID1545
Feature activation+1.13e-01

Pos logit contributions
oto+0.096
iri+0.094
 mad+0.088
gh+0.087
 awful+0.082
Neg logit contributions
 All-0.100
 Using-0.087
 Being-0.084
Use-0.083
 Taking-0.083
Token said
Token ID531
Feature activation-6.97e-02

Pos logit contributions
 All+0.164
Find+0.139
Use+0.138
+0.137
 Using+0.135
Neg logit contributions
 mad-0.205
oto-0.199
 awful-0.192
iri-0.192
ax-0.184
Token,
Token ID11
Feature activation-5.73e-03

Pos logit contributions
oto+0.132
 mad+0.131
iri+0.130
 awful+0.127
gh+0.123
Neg logit contributions
 All-0.106
 who-0.081
 Using-0.079
Let-0.077
Use-0.076
Token "
Token ID366
Feature activation+1.33e-01

Pos logit contributions
 mad+0.012
oto+0.012
iri+0.011
 awful+0.011
gh+0.011
Neg logit contributions
 All-0.007
 Using-0.005
Use-0.005
Find-0.005
 who-0.005
TokenU
Token ID52
Feature activation-1.17e-01

Pos logit contributions
 All+0.132
 Using+0.095
 With+0.083
 Flying+0.083
 Taking+0.082
Neg logit contributions
oto-0.310
iri-0.304
 mad-0.304
 awful-0.292
gh-0.285
Tokengh
Token ID456
Feature activation-3.05e-01

Pos logit contributions
 mad+0.114
sing+0.104
 awful+0.104
iri+0.100
oto+0.099
Neg logit contributions
 All-0.278
-0.235
 Using-0.232
 who-0.231
Use-0.228
Token,
Token ID11
Feature activation-5.26e-03

Pos logit contributions
 awful+0.620
 mad+0.604
iri+0.593
oto+0.590
 mir+0.542
Neg logit contributions
 All-0.429
 who-0.341
-0.324
 Using-0.320
 With-0.318
Token this
Token ID428
Feature activation-3.34e-03

Pos logit contributions
 mad+0.010
oto+0.009
 awful+0.009
iri+0.009
 ours+0.009
Neg logit contributions
 All-0.008
 Using-0.007
Use-0.006
If-0.006
-0.006
Token part
Token ID636
Feature activation+6.87e-02

Pos logit contributions
oto+0.005
iri+0.004
 awful+0.004
 mad+0.004
gh+0.004
Neg logit contributions
 All-0.006
 Using-0.006
Find-0.005
 Being-0.005
 Working-0.005
Token is
Token ID318
Feature activation-7.71e-04

Pos logit contributions
 All+0.093
Find+0.083
 Using+0.081
Use+0.080
Would+0.079
Neg logit contributions
 mad-0.124
 awful-0.122
oto-0.118
ong-0.116
 ours-0.115
Token so
Token ID523
Feature activation-1.41e-01

Pos logit contributions
oto+0.001
 mad+0.001
iri+0.001
 awful+0.001
gh+0.001
Neg logit contributions
 All-0.001
 who-0.001
 Using-0.001
Use-0.001
Let-0.001
Token took
Token ID1718
Feature activation+8.44e-02

Pos logit contributions
 mad+0.026
 awful+0.025
oto+0.023
iri+0.022
 sour+0.022
Neg logit contributions
 All-0.023
 Flying-0.020
-0.020
If-0.019
 Being-0.019
Token a
Token ID257
Feature activation+3.79e-02

Pos logit contributions
 All+0.118
 Using+0.089
 who+0.088
Use+0.087
Let+0.085
Neg logit contributions
 mad-0.166
oto-0.165
iri-0.162
 awful-0.161
gh-0.158
Token bite
Token ID13197
Feature activation-1.51e-01

Pos logit contributions
 All+0.061
 who+0.051
Find+0.050
 Using+0.049
Let+0.048
Neg logit contributions
oto-0.069
iri-0.068
 awful-0.064
urg-0.063
 mad-0.062
Token.
Token ID13
Feature activation+1.22e-03

Pos logit contributions
oto+0.321
 mad+0.320
iri+0.319
 awful+0.309
gh+0.304
Neg logit contributions
 All-0.193
 who-0.140
 Using-0.137
Let-0.133
Use-0.131
Token U
Token ID471
Feature activation+3.96e-02

Pos logit contributions
 acts+0.002
 towns+0.001
ieving+0.001
 puzzles+0.001
 who+0.001
Neg logit contributions
aney-0.002
ax-0.002
oto-0.002
iri-0.002
ige-0.002
Tokengh
Token ID456
Feature activation-3.04e-01

Pos logit contributions
 All+0.073
 Using+0.059
 who+0.059
Use+0.057
+0.057
Neg logit contributions
 mad-0.060
oto-0.060
iri-0.059
 awful-0.058
sing-0.056
Token!
Token ID0
Feature activation-6.52e-02

Pos logit contributions
 mad+0.510
 awful+0.506
iri+0.477
oto+0.471
 mir+0.438
Neg logit contributions
 All-0.537
 who-0.458
 Using-0.441
 With-0.439
es-0.429
Token It
Token ID632
Feature activation+1.49e-02

Pos logit contributions
oto+0.106
iri+0.106
ax+0.105
 mad+0.101
ibi+0.101
Neg logit contributions
 acts-0.079
 who-0.076
ieving-0.073
 puzzles-0.072
 towns-0.072
Token was
Token ID373
Feature activation+2.60e-02

Pos logit contributions
 All+0.016
 Using+0.011
 Taking+0.011
 who+0.011
Use+0.011
Neg logit contributions
 mad-0.034
 awful-0.034
oto-0.033
iri-0.033
gh-0.032
Token so
Token ID523
Feature activation-1.54e-01

Pos logit contributions
 All+0.041
 who+0.032
 Using+0.031
 With+0.030
Let+0.030
Neg logit contributions
oto-0.047
iri-0.046
 mad-0.045
gh-0.045
heads-0.045
Token sour
Token ID11348
Feature activation-1.97e-01

Pos logit contributions
oto+0.197
iri+0.196
urger+0.194
heads+0.192
sing+0.189
Neg logit contributions
 All-0.337
 Using-0.275
 With-0.274
 Taking-0.268
Let-0.264
Token and
Token ID290
Feature activation-3.60e-02

Pos logit contributions
 All+0.152
 Using+0.119
 who+0.118
Use+0.116
Let+0.115
Neg logit contributions
oto-0.165
 mad-0.164
iri-0.162
 awful-0.158
gh-0.155
Token pointed
Token ID6235
Feature activation-2.97e-02

Pos logit contributions
 mad+0.062
 awful+0.054
 angry+0.053
heads+0.050
sing+0.050
Neg logit contributions
 All-0.063
-0.052
If-0.051
 Taking-0.051
 Using-0.050
Token at
Token ID379
Feature activation+3.04e-03

Pos logit contributions
oto+0.062
iri+0.062
 awful+0.059
 mad+0.059
gh+0.058
Neg logit contributions
 All-0.041
 who-0.032
 Using-0.029
Let-0.029
 With-0.028
Token the
Token ID262
Feature activation+3.43e-02

Pos logit contributions
 All+0.005
 Using+0.004
Use+0.004
+0.004
Join+0.004
Neg logit contributions
 mad-0.005
 awful-0.005
oto-0.004
iri-0.004
 ours-0.004
Token grass
Token ID8701
Feature activation+1.20e-02

Pos logit contributions
 All+0.062
 Using+0.051
 Taking+0.049
 With+0.048
Use+0.048
Neg logit contributions
 mad-0.056
oto-0.053
 awful-0.053
iri-0.052
gh-0.050
Tokenho
Token ID8873
Feature activation-3.04e-01

Pos logit contributions
 All+0.018
 Using+0.014
Use+0.014
 who+0.013
Find+0.013
Neg logit contributions
 mad-0.023
oto-0.023
iri-0.022
 awful-0.022
gh-0.021
Tokenpper
Token ID2848
Feature activation-7.05e-02

Pos logit contributions
 mad+0.351
oto+0.345
 awful+0.340
iri+0.333
gh+0.309
Neg logit contributions
 All-0.672
 who-0.576
 Using-0.567
Use-0.559
Let-0.554
Token.
Token ID13
Feature activation-2.61e-02

Pos logit contributions
 mad+0.115
 awful+0.109
oto+0.106
gh+0.106
iri+0.106
Neg logit contributions
 All-0.119
Use-0.102
If-0.099
Find-0.098
Let-0.096
Token He
Token ID679
Feature activation+1.05e-02

Pos logit contributions
 mad+0.047
oto+0.046
 awful+0.045
iri+0.044
gh+0.044
Neg logit contributions
 All-0.033
Use-0.029
If-0.029
 who-0.029
-0.028
Token said
Token ID531
Feature activation-9.79e-02

Pos logit contributions
 All+0.019
+0.016
 Using+0.016
 Taking+0.016
If+0.016
Neg logit contributions
 mad-0.017
 awful-0.015
oto-0.015
iri-0.015
 angry-0.014
Token,
Token ID11
Feature activation-3.54e-02

Pos logit contributions
oto+0.209
iri+0.206
 mad+0.199
 awful+0.199
sing+0.194
Neg logit contributions
 All-0.131
 who-0.097
 With-0.087
 Using-0.087
 Taking-0.084
Token "
Token ID366
Feature activation+1.17e-01

Pos logit contributions
 All+0.010
Find+0.008
Use+0.008
 Using+0.007
 who+0.007
Neg logit contributions
oto-0.017
iri-0.017
 mad-0.017
 awful-0.016
sing-0.016
TokenIs
Token ID3792
Feature activation+6.74e-02

Pos logit contributions
 All+0.153
 Using+0.117
 With+0.108
 Taking+0.106
 led+0.104
Neg logit contributions
oto-0.237
iri-0.236
 mad-0.235
 awful-0.225
gh-0.219
Token it
Token ID340
Feature activation-9.37e-02

Pos logit contributions
 All+0.102
 Using+0.083
Use+0.080
Let+0.077
 With+0.077
Neg logit contributions
 mad-0.125
oto-0.124
 awful-0.121
iri-0.121
gh-0.118
Token a
Token ID257
Feature activation+3.62e-02

Pos logit contributions
oto+0.195
 mad+0.195
iri+0.193
 awful+0.188
gh+0.187
Neg logit contributions
 All-0.122
 who-0.090
 Using-0.086
Use-0.082
Let-0.082
Token grass
Token ID8701
Feature activation+1.99e-02

Pos logit contributions
 All+0.063
 who+0.053
Let+0.049
Find+0.049
Use+0.048
Neg logit contributions
oto-0.058
iri-0.057
 mad-0.057
 awful-0.055
gh-0.054
Tokenho
Token ID8873
Feature activation-3.03e-01

Pos logit contributions
 All+0.033
 who+0.026
 Using+0.025
Use+0.024
Let+0.024
Neg logit contributions
 mad-0.035
oto-0.035
iri-0.034
 awful-0.034
gh-0.033
Tokenpper
Token ID2848
Feature activation-6.50e-02

Pos logit contributions
oto+0.289
 mad+0.286
iri+0.280
gh+0.266
 awful+0.262
Neg logit contributions
 All-0.744
 Using-0.636
 who-0.623
Let-0.623
Use-0.622
Token?"
Token ID1701
Feature activation-2.48e-02

Pos logit contributions
oto+0.092
 mad+0.092
iri+0.090
 awful+0.088
gh+0.085
Neg logit contributions
 All-0.127
 Using-0.107
Use-0.103
Let-0.102
 Taking-0.102
Token she
Token ID673
Feature activation+9.23e-03

Pos logit contributions
 mad+0.058
oto+0.058
iri+0.057
 awful+0.056
gh+0.054
Neg logit contributions
 All-0.024
 Using-0.017
Use-0.016
 who-0.015
Find-0.015
Token guesses
Token ID44774
Feature activation-3.17e-02

Pos logit contributions
 All+0.014
 Using+0.011
 who+0.010
Use+0.010
Find+0.010
Neg logit contributions
 mad-0.018
oto-0.017
 awful-0.017
iri-0.016
gh-0.016
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
 mad+0.058
oto+0.056
 awful+0.055
iri+0.053
gh+0.053
Neg logit contributions
 All-0.050
 Using-0.040
Find-0.040
Use-0.040
-0.040
Token run
Token ID1057
Feature activation+7.94e-03

Pos logit contributions
 mad+0.008
oto+0.007
iri+0.007
 awful+0.007
gh+0.007
Neg logit contributions
 All-0.007
 Using-0.006
Use-0.005
 Taking-0.005
Let-0.005
Token out
Token ID503
Feature activation-4.01e-02

Pos logit contributions
 All+0.012
 who+0.009
 Using+0.009
Let+0.008
Use+0.008
Neg logit contributions
oto-0.015
iri-0.015
 mad-0.015
 awful-0.015
gh-0.014
Token of
Token ID286
Feature activation+1.09e-01

Pos logit contributions
 mad+0.083
oto+0.079
 awful+0.079
iri+0.077
gh+0.076
Neg logit contributions
 All-0.053
Use-0.040
 Using-0.039
Find-0.039
-0.038
Token his
Token ID465
Feature activation-9.72e-02

Pos logit contributions
 All+0.145
 Using+0.108
Use+0.106
+0.105
 who+0.105
Neg logit contributions
 mad-0.226
oto-0.218
 awful-0.214
iri-0.212
gh-0.208
Token do
Token ID466
Feature activation+1.11e-01

Pos logit contributions
 mad+0.162
oto+0.159
iri+0.156
 awful+0.154
gh+0.150
Neg logit contributions
 All-0.168
 Using-0.133
Use-0.132
 who-0.129
Let-0.129
Tokengh
Token ID456
Feature activation-3.03e-01

Pos logit contributions
 All+0.192
 Using+0.158
Use+0.154
Find+0.151
 Taking+0.151
Neg logit contributions
 mad-0.181
gh-0.179
oto-0.179
 awful-0.173
iri-0.173
Tokenouse
Token ID1076
Feature activation+6.73e-02

Pos logit contributions
 awful+0.475
 mad+0.472
iri+0.437
oto+0.429
arter+0.392
Neg logit contributions
 All-0.585
 who-0.480
 Using-0.466
es-0.464
 With-0.460
Token,
Token ID11
Feature activation-1.03e-02

Pos logit contributions
 All+0.100
 Using+0.078
Use+0.076
Find+0.075
Let+0.075
Neg logit contributions
oto-0.126
 mad-0.125
 awful-0.122
iri-0.121
gh-0.121
Token and
Token ID290
Feature activation-4.60e-03

Pos logit contributions
 mad+0.018
oto+0.017
 awful+0.017
iri+0.017
gh+0.016
Neg logit contributions
 All-0.017
 Using-0.013
Use-0.013
If-0.013
-0.013
Token head
Token ID1182
Feature activation+8.21e-02

Pos logit contributions
 mad+0.008
oto+0.008
 awful+0.008
iri+0.007
gh+0.007
Neg logit contributions
 All-0.007
 Using-0.006
Use-0.006
If-0.006
-0.006
Token straight
Token ID3892
Feature activation-1.15e-01

Pos logit contributions
 All+0.125
 Using+0.096
 who+0.094
Use+0.093
Let+0.092
Neg logit contributions
 mad-0.152
oto-0.152
iri-0.149
 awful-0.147
gh-0.144
Token"
Token ID1
Feature activation+9.14e-02

Pos logit contributions
 All+0.035
 Using+0.028
 who+0.026
Use+0.026
 led+0.026
Neg logit contributions
oto-0.041
 mad-0.041
iri-0.041
gh-0.039
 awful-0.038
TokenThank
Token ID10449
Feature activation+6.34e-03

Pos logit contributions
 All+0.137
 Using+0.115
 Taking+0.108
 With+0.107
es+0.106
Neg logit contributions
oto-0.168
iri-0.163
 mad-0.160
gh-0.149
 awful-0.147
Token you
Token ID345
Feature activation-8.18e-02

Pos logit contributions
 All+0.012
 Using+0.010
 who+0.009
Use+0.009
Let+0.009
Neg logit contributions
 mad-0.010
oto-0.010
iri-0.009
 awful-0.009
gh-0.009
Token for
Token ID329
Feature activation+1.10e-02

Pos logit contributions
iri+0.173
oto+0.172
gh+0.172
 mad+0.169
 awful+0.167
Neg logit contributions
 All-0.099
 who-0.080
 With-0.067
 Using-0.062
 with-0.061
Token letting
Token ID9616
Feature activation+7.54e-02

Pos logit contributions
 All+0.017
 Using+0.014
Use+0.014
Let+0.013
Find+0.013
Neg logit contributions
 mad-0.020
oto-0.020
 awful-0.019
iri-0.019
gh-0.019
Token us
Token ID514
Feature activation-3.02e-01

Pos logit contributions
 All+0.161
 who+0.136
 Using+0.130
Let+0.127
Use+0.127
Neg logit contributions
iri-0.094
 mad-0.094
oto-0.093
sing-0.090
 awful-0.089
Token pet
Token ID4273
Feature activation-4.78e-02

Pos logit contributions
oto+0.575
 mad+0.573
iri+0.567
 awful+0.558
gh+0.546
Neg logit contributions
 All-0.444
 who-0.335
 Using-0.335
Use-0.324
Let-0.321
Token him
Token ID683
Feature activation-2.24e-01

Pos logit contributions
oto+0.079
 mad+0.079
iri+0.077
 awful+0.075
gh+0.072
Neg logit contributions
 All-0.082
 Using-0.067
Use-0.065
 Taking-0.064
Find-0.064
Token,"
Token ID553
Feature activation-1.14e-02

Pos logit contributions
oto+0.363
 mad+0.361
iri+0.361
 awful+0.353
gh+0.347
Neg logit contributions
 All-0.388
 who-0.318
 Using-0.306
 With-0.295
Let-0.294
Token Ben
Token ID3932
Feature activation-6.55e-02

Pos logit contributions
 mad+0.019
oto+0.019
iri+0.018
 awful+0.017
gh+0.017
Neg logit contributions
 All-0.019
 Using-0.016
Use-0.016
Find-0.016
-0.016
Token says
Token ID1139
Feature activation-1.06e-01

Pos logit contributions
 mad+0.110
oto+0.109
iri+0.107
 awful+0.105
gh+0.102
Neg logit contributions
 All-0.112
 Using-0.089
 who-0.087
Use-0.087
Let-0.086
Token lady
Token ID10846
Feature activation+8.78e-03

Pos logit contributions
 All+0.068
 who+0.055
 Using+0.053
Let+0.053
Use+0.052
Neg logit contributions
oto-0.070
 mad-0.070
iri-0.068
 awful-0.068
gh-0.066
Tokenbug
Token ID25456
Feature activation+7.01e-02

Pos logit contributions
 All+0.013
Use+0.011
 Using+0.010
Find+0.010
If+0.010
Neg logit contributions
iri-0.016
 mad-0.016
oto-0.016
gh-0.016
anut-0.015
Token and
Token ID290
Feature activation-1.03e-02

Pos logit contributions
 All+0.126
Use+0.109
 Using+0.108
Find+0.108
If+0.108
Neg logit contributions
iri-0.105
oto-0.103
gh-0.099
 mad-0.096
urger-0.094
Token a
Token ID257
Feature activation+4.14e-02

Pos logit contributions
 mad+0.016
oto+0.016
 awful+0.015
iri+0.015
 sour+0.014
Neg logit contributions
 All-0.016
-0.015
 Using-0.014
 Believe-0.014
 Taking-0.014
Token grass
Token ID8701
Feature activation+2.19e-02

Pos logit contributions
 All+0.070
 who+0.056
 Using+0.055
Let+0.054
Use+0.054
Neg logit contributions
oto-0.070
 mad-0.069
iri-0.068
 awful-0.067
gh-0.066
Tokenho
Token ID8873
Feature activation-3.02e-01

Pos logit contributions
 All+0.045
 Using+0.038
Use+0.038
 Taking+0.037
Find+0.037
Neg logit contributions
iri-0.029
oto-0.029
 mad-0.028
gh-0.026
 awful-0.026
Tokenpper
Token ID2848
Feature activation-6.40e-02

Pos logit contributions
 mad+0.348
 awful+0.346
oto+0.325
iri+0.301
sing+0.290
Neg logit contributions
 All-0.658
 who-0.583
-0.559
 Using-0.557
es-0.552
Token.
Token ID13
Feature activation+1.35e-02

Pos logit contributions
iri+0.089
oto+0.088
 mad+0.083
anut+0.082
 awful+0.081
Neg logit contributions
 All-0.116
Find-0.103
 Using-0.101
Use-0.101
If-0.100
Token
Token ID198
Feature activation+2.53e-02

Pos logit contributions
 All+0.015
Find+0.015
Use+0.014
Spread+0.014
 who+0.014
Neg logit contributions
oto-0.025
iri-0.025
ax-0.024
 mad-0.024
 awful-0.023
Token
Token ID198
Feature activation+2.60e-02

Pos logit contributions
 All+0.023
 Using+0.016
Use+0.014
 who+0.014
+0.014
Neg logit contributions
oto-0.062
iri-0.060
 mad-0.060
gh-0.058
 awful-0.058
TokenOne
Token ID3198
Feature activation+2.79e-03

Pos logit contributions
 All+0.036
 Using+0.028
 who+0.027
Use+0.026
 Taking+0.026
Neg logit contributions
oto-0.051
iri-0.050
 mad-0.050
 awful-0.049
gh-0.048
Token worked
Token ID3111
Feature activation+5.86e-02

Pos logit contributions
 mad+0.014
oto+0.014
 awful+0.013
iri+0.013
gh+0.013
Neg logit contributions
 All-0.012
 Using-0.010
Use-0.009
Find-0.009
 Taking-0.009
Token hard
Token ID1327
Feature activation-9.64e-02

Pos logit contributions
 All+0.128
 Using+0.107
 who+0.106
Use+0.106
Let+0.105
Neg logit contributions
 mad-0.071
oto-0.069
 awful-0.067
iri-0.066
gh-0.066
Token.
Token ID13
Feature activation+3.28e-03

Pos logit contributions
 mad+0.152
 awful+0.148
gh+0.147
oto+0.140
ong+0.132
Neg logit contributions
 All-0.166
Use-0.139
Find-0.138
Spread-0.137
Let-0.137
Token The
Token ID383
Feature activation+1.02e-01

Pos logit contributions
 All+0.004
Use+0.003
Find+0.003
Spread+0.003
 acts+0.003
Neg logit contributions
oto-0.006
 mad-0.006
iri-0.006
 awful-0.006
aney-0.006
Token grass
Token ID8701
Feature activation+2.09e-02

Pos logit contributions
 All+0.181
 Using+0.154
Use+0.144
 With+0.143
 Taking+0.143
Neg logit contributions
 mad-0.169
oto-0.161
 awful-0.159
iri-0.155
gh-0.148
Tokenho
Token ID8873
Feature activation-3.01e-01

Pos logit contributions
 All+0.021
 Using+0.014
Use+0.013
 Taking+0.013
 who+0.013
Neg logit contributions
 mad-0.049
oto-0.048
 awful-0.048
iri-0.048
gh-0.047
Tokenpper
Token ID2848
Feature activation-7.11e-02

Pos logit contributions
oto+0.406
gh+0.400
 mad+0.399
iri+0.391
 awful+0.374
Neg logit contributions
 All-0.618
 Using-0.512
 who-0.499
 Taking-0.498
Let-0.495
Token was
Token ID373
Feature activation+2.22e-02

Pos logit contributions
 mad+0.133
 awful+0.128
gh+0.120
oto+0.119
sing+0.116
Neg logit contributions
 All-0.099
 Using-0.081
If-0.081
 Taking-0.079
Use-0.078
Token spoiled
Token ID34781
Feature activation-7.35e-02

Pos logit contributions
 All+0.041
 who+0.034
 Using+0.033
Let+0.032
 led+0.032
Neg logit contributions
oto-0.034
iri-0.034
 mad-0.031
gh-0.030
urger-0.030
Token and
Token ID290
Feature activation-2.02e-02

Pos logit contributions
 mad+0.136
oto+0.131
 awful+0.131
iri+0.125
gh+0.125
Neg logit contributions
 All-0.114
 Using-0.089
Find-0.089
Use-0.089
 Taking-0.085
Token didn
Token ID1422
Feature activation-1.27e-02

Pos logit contributions
 mad+0.037
 awful+0.035
oto+0.035
iri+0.034
gh+0.033
Neg logit contributions
 All-0.032
 Using-0.026
Use-0.025
-0.025
Find-0.025
Token in
Token ID287
Feature activation+1.13e-01

Pos logit contributions
 All+0.063
Use+0.057
 Using+0.056
Find+0.054
 Taking+0.053
Neg logit contributions
 mad-0.075
gh-0.073
 awful-0.071
oto-0.068
iri-0.065
Token circles
Token ID13332
Feature activation+7.96e-02

Pos logit contributions
 All+0.135
 Using+0.104
Use+0.102
+0.099
 Taking+0.096
Neg logit contributions
 mad-0.245
oto-0.235
 awful-0.233
iri-0.231
gh-0.228
Token with
Token ID351
Feature activation+1.53e-01

Pos logit contributions
 All+0.121
Use+0.101
 Using+0.100
Find+0.098
 Taking+0.098
Neg logit contributions
oto-0.140
 mad-0.138
gh-0.138
 awful-0.135
iri-0.133
Token the
Token ID262
Feature activation+5.27e-02

Pos logit contributions
 All+0.229
 Using+0.175
 who+0.172
Use+0.169
Let+0.167
Neg logit contributions
oto-0.290
 mad-0.290
iri-0.285
 awful-0.280
gh-0.275
Token grass
Token ID8701
Feature activation+2.05e-02

Pos logit contributions
 All+0.089
 Using+0.070
 who+0.070
Use+0.069
Let+0.068
Neg logit contributions
 mad-0.090
oto-0.090
iri-0.088
 awful-0.086
gh-0.085
Tokenho
Token ID8873
Feature activation-3.01e-01

Pos logit contributions
 All+0.030
 Using+0.024
Use+0.024
Find+0.024
 Taking+0.023
Neg logit contributions
oto-0.037
 mad-0.037
iri-0.037
 awful-0.036
gh-0.036
Tokenpper
Token ID2848
Feature activation-6.47e-02

Pos logit contributions
 mad+0.353
 awful+0.345
oto+0.329
iri+0.315
sing+0.295
Neg logit contributions
 All-0.662
 who-0.576
 Using-0.557
-0.553
Use-0.546
Token.
Token ID13
Feature activation+6.14e-03

Pos logit contributions
 mad+0.106
oto+0.102
 awful+0.102
gh+0.101
iri+0.100
Neg logit contributions
 All-0.103
Use-0.090
If-0.088
 Using-0.087
Find-0.087
Token They
Token ID1119
Feature activation+6.27e-02

Pos logit contributions
 All+0.007
 who+0.007
+0.007
Use+0.007
 acts+0.007
Neg logit contributions
oto-0.011
ax-0.011
 mad-0.011
 awful-0.010
gh-0.010
Token jumped
Token ID11687
Feature activation-7.89e-02

Pos logit contributions
 All+0.100
 Using+0.087
 Taking+0.085
 With+0.083
Use+0.082
Neg logit contributions
 mad-0.107
oto-0.102
iri-0.101
 awful-0.098
gh-0.098
Token and
Token ID290
Feature activation-1.58e-02

Pos logit contributions
iri+0.178
oto+0.177
 mad+0.172
 awful+0.170
urger+0.169
Neg logit contributions
 All-0.097
 who-0.066
 Using-0.064
Let-0.063
 With-0.062
Token so
Token ID523
Feature activation-1.48e-01

Pos logit contributions
 All+0.053
 who+0.043
 Using+0.042
Let+0.041
Use+0.041
Neg logit contributions
oto-0.041
iri-0.040
 mad-0.040
 awful-0.039
gh-0.038
Token happy
Token ID3772
Feature activation-9.39e-02

Pos logit contributions
oto+0.225
iri+0.222
gh+0.216
 awful+0.216
 mad+0.210
Neg logit contributions
 All-0.282
 who-0.231
 Using-0.223
Let-0.222
 Taking-0.220
Token that
Token ID326
Feature activation-1.45e-02

Pos logit contributions
oto+0.143
 mad+0.135
iri+0.123
gh+0.123
 awful+0.122
Neg logit contributions
 All-0.168
Find-0.154
Use-0.153
-0.150
 Using-0.148
Token the
Token ID262
Feature activation+5.55e-02

Pos logit contributions
 mad+0.025
oto+0.024
gh+0.023
 awful+0.023
iri+0.023
Neg logit contributions
 All-0.023
 Using-0.021
Use-0.020
If-0.020
Find-0.020
Token grass
Token ID8701
Feature activation+2.13e-02

Pos logit contributions
 All+0.100
 Using+0.082
 who+0.080
 Taking+0.079
Use+0.078
Neg logit contributions
oto-0.089
 mad-0.089
iri-0.087
 awful-0.085
gh-0.081
Tokenho
Token ID8873
Feature activation-3.00e-01

Pos logit contributions
 All+0.024
 Using+0.018
Use+0.017
If+0.017
Find+0.017
Neg logit contributions
oto-0.047
 mad-0.046
iri-0.045
 awful-0.045
gh-0.044
Tokenpper
Token ID2848
Feature activation-6.40e-02

Pos logit contributions
gh+0.262
oto+0.260
iri+0.255
 mad+0.246
anut+0.226
Neg logit contributions
 All-0.768
 Using-0.659
 Taking-0.648
 Being-0.638
Find-0.636
Token had
Token ID550
Feature activation+8.13e-02

Pos logit contributions
 mad+0.125
 awful+0.122
oto+0.122
iri+0.116
gh+0.115
Neg logit contributions
 All-0.079
If-0.068
 Using-0.066
Find-0.065
Use-0.064
Token said
Token ID531
Feature activation-7.45e-02

Pos logit contributions
 All+0.121
 Using+0.094
 who+0.092
Use+0.091
 Taking+0.089
Neg logit contributions
 mad-0.152
oto-0.152
iri-0.149
 awful-0.148
gh-0.145
Token yes
Token ID3763
Feature activation-6.11e-02

Pos logit contributions
 mad+0.133
oto+0.133
iri+0.131
 awful+0.128
gh+0.125
Neg logit contributions
 All-0.120
 Using-0.092
 who-0.092
Use-0.089
Let-0.089
Token.
Token ID13
Feature activation+1.05e-02

Pos logit contributions
iri+0.132
sing+0.130
 mad+0.129
oto+0.127
arter+0.126
Neg logit contributions
 All-0.081
 who-0.062
 With-0.050
 with-0.048
 Using-0.047
Token.
Token ID13
Feature activation+2.68e-02

Pos logit contributions
oto+0.047
 mad+0.047
iri+0.046
gh+0.045
 awful+0.045
Neg logit contributions
 All-0.034
 who-0.025
 Using-0.024
Let-0.023
 Taking-0.023
Token Thank
Token ID6952
Feature activation-2.43e-02

Pos logit contributions
 All+0.029
Use+0.025
Find+0.025
 Using+0.024
Spread+0.024
Neg logit contributions
 mad-0.053
oto-0.052
iri-0.051
 awful-0.050
gh-0.049
Token you
Token ID345
Feature activation-8.57e-02

Pos logit contributions
 mad+0.035
oto+0.035
iri+0.035
 awful+0.033
gh+0.033
Neg logit contributions
 All-0.047
 Using-0.039
Use-0.038
Find-0.038
 Taking-0.037
Token for
Token ID329
Feature activation+1.11e-04

Pos logit contributions
oto+0.154
iri+0.153
 mad+0.153
 awful+0.150
gh+0.150
Neg logit contributions
 All-0.137
 who-0.111
 With-0.099
 Using-0.098
 Taking-0.097
Token teaching
Token ID7743
Feature activation+9.79e-02

Pos logit contributions
 All+0.000
 Using+0.000
Use+0.000
Let+0.000
Find+0.000
Neg logit contributions
 mad-0.000
oto-0.000
iri-0.000
 awful-0.000
gh-0.000
Token us
Token ID514
Feature activation-3.00e-01

Pos logit contributions
 All+0.208
 who+0.173
 Using+0.170
Let+0.166
 With+0.165
Neg logit contributions
 mad-0.123
oto-0.122
iri-0.122
 awful-0.118
gh-0.115
Token."
Token ID526
Feature activation-8.23e-02

Pos logit contributions
oto+0.603
 mad+0.603
 awful+0.585
iri+0.568
gh+0.555
Neg logit contributions
 All-0.391
Use-0.318
Find-0.315
 Using-0.312
-0.299
Token Mom
Token ID11254
Feature activation-1.60e-01

Pos logit contributions
 mad+0.183
oto+0.182
iri+0.179
 awful+0.169
gh+0.165
Neg logit contributions
 All-0.086
Use-0.065
Find-0.065
 Using-0.062
Spread-0.060
Token smiles
Token ID21845
Feature activation-5.25e-02

Pos logit contributions
 mad+0.255
oto+0.254
iri+0.242
 awful+0.234
gh+0.220
Neg logit contributions
 All-0.289
Find-0.238
 Using-0.236
Use-0.235
-0.232
Token and
Token ID290
Feature activation-4.12e-02

Pos logit contributions
iri+0.110
oto+0.110
gh+0.108
 awful+0.108
 ours+0.105
Neg logit contributions
 All-0.070
 who-0.055
Let-0.048
 Being-0.046
 Using-0.046
Token says
Token ID1139
Feature activation-1.09e-01

Pos logit contributions
 mad+0.059
oto+0.054
 awful+0.052
iri+0.049
 angry+0.046
Neg logit contributions
 All-0.080
If-0.071
Find-0.071
 Using-0.069
-0.069
TokenIt
Token ID1026
Feature activation-6.75e-02

Pos logit contributions
 All+0.121
 Using+0.101
 Taking+0.097
 With+0.095
 who+0.093
Neg logit contributions
oto-0.126
 mad-0.121
iri-0.120
 awful-0.114
gh-0.108
Token is
Token ID318
Feature activation-2.41e-02

Pos logit contributions
 mad+0.143
oto+0.140
 awful+0.139
iri+0.137
gh+0.135
Neg logit contributions
 All-0.084
 Using-0.062
Use-0.062
Find-0.060
Let-0.060
Token not
Token ID407
Feature activation-1.30e-01

Pos logit contributions
unct+0.048
iri+0.047
aney+0.047
oto+0.046
gh+0.046
Neg logit contributions
 All-0.030
 who-0.027
 Be-0.023
 with-0.022
 With-0.022
Token a
Token ID257
Feature activation+2.68e-02

Pos logit contributions
iri+0.278
oto+0.272
gh+0.267
 mad+0.260
 awful+0.256
Neg logit contributions
 All-0.160
 who-0.135
 Using-0.107
 led-0.105
 Stone-0.104
Token grass
Token ID8701
Feature activation+2.09e-02

Pos logit contributions
 All+0.039
 who+0.034
Let+0.031
 Stone+0.029
Find+0.029
Neg logit contributions
iri-0.049
oto-0.049
 awful-0.047
aney-0.047
urg-0.047
Tokenho
Token ID8873
Feature activation-2.98e-01

Pos logit contributions
 All+0.031
 Using+0.024
 who+0.024
Use+0.023
Let+0.023
Neg logit contributions
 mad-0.039
oto-0.039
iri-0.039
 awful-0.038
gh-0.037
Tokenpper
Token ID2848
Feature activation-6.41e-02

Pos logit contributions
 mad+0.309
oto+0.306
 awful+0.299
iri+0.291
gh+0.259
Neg logit contributions
 All-0.686
 who-0.597
 Using-0.585
Use-0.578
-0.574
Token or
Token ID393
Feature activation-5.75e-02

Pos logit contributions
 mad+0.113
oto+0.113
iri+0.111
 awful+0.109
gh+0.107
Neg logit contributions
 All-0.103
 Using-0.080
 who-0.080
Use-0.078
Let-0.077
Token a
Token ID257
Feature activation+2.79e-02

Pos logit contributions
 mad+0.107
 awful+0.103
oto+0.100
iri+0.099
sing+0.098
Neg logit contributions
 All-0.088
 Using-0.072
Use-0.069
If-0.068
 Taking-0.068
Token cricket
Token ID18836
Feature activation+6.86e-02

Pos logit contributions
 All+0.043
 who+0.036
Let+0.033
Find+0.032
Use+0.032
Neg logit contributions
oto-0.050
 mad-0.049
iri-0.049
 awful-0.048
gh-0.047
Token.
Token ID13
Feature activation-1.93e-02

Pos logit contributions
 All+0.122
 Using+0.099
Use+0.096
 who+0.095
Let+0.095
Neg logit contributions
 mad-0.110
oto-0.109
iri-0.106
 awful-0.105
gh-0.103
Token but
Token ID475
Feature activation+4.92e-02

Pos logit contributions
oto+0.034
iri+0.034
 mad+0.033
gh+0.033
 awful+0.033
Neg logit contributions
 All-0.024
 who-0.018
 Using-0.017
Let-0.017
 Taking-0.017
Token asked
Token ID1965
Feature activation+2.77e-02

Pos logit contributions
 All+0.079
 Using+0.067
Find+0.066
Use+0.065
ures+0.064
Neg logit contributions
 mad-0.090
 awful-0.081
oto-0.079
iri-0.076
 angry-0.075
Token the
Token ID262
Feature activation+5.04e-02

Pos logit contributions
 All+0.046
Use+0.040
Find+0.040
 Using+0.039
Join+0.038
Neg logit contributions
 mad-0.047
oto-0.046
gh-0.044
 awful-0.044
iri-0.043
Token old
Token ID1468
Feature activation+1.21e-02

Pos logit contributions
 All+0.080
 Using+0.067
Use+0.062
 With+0.061
 Taking+0.061
Neg logit contributions
 mad-0.093
oto-0.089
 awful-0.088
iri-0.084
sing-0.081
Token man
Token ID582
Feature activation+4.44e-02

Pos logit contributions
 All+0.026
 Using+0.022
Use+0.022
+0.021
Find+0.021
Neg logit contributions
 mad-0.016
oto-0.016
 awful-0.014
iri-0.014
 angry-0.012
Token if
Token ID611
Feature activation+1.69e-01

Pos logit contributions
 All+0.071
Use+0.061
Find+0.061
 Using+0.059
Let+0.057
Neg logit contributions
 mad-0.080
oto-0.075
 awful-0.072
iri-0.071
gh-0.069
Token he
Token ID339
Feature activation+1.71e-03

Pos logit contributions
 All+0.284
 Using+0.259
 With+0.235
 Taking+0.233
Use+0.232
Neg logit contributions
 mad-0.281
oto-0.269
 awful-0.259
iri-0.252
gh-0.250
Token was
Token ID373
Feature activation+2.37e-02

Pos logit contributions
 All+0.002
 Using+0.002
 Taking+0.002
Find+0.002
+0.002
Neg logit contributions
 mad-0.003
oto-0.003
 awful-0.003
iri-0.003
gh-0.003
Token okay
Token ID8788
Feature activation-9.86e-02

Pos logit contributions
 All+0.045
 who+0.038
 Using+0.037
Use+0.036
Let+0.036
Neg logit contributions
oto-0.035
iri-0.034
 mad-0.033
 awful-0.033
gh-0.032
Token.
Token ID13
Feature activation-5.87e-03

Pos logit contributions
 mad+0.190
oto+0.188
iri+0.185
 awful+0.182
gh+0.179
Neg logit contributions
 All-0.144
 Using-0.111
Use-0.108
 who-0.106
Find-0.106
Token
Token ID198
Feature activation+9.41e-03

Pos logit contributions
 mad+0.012
oto+0.012
iri+0.012
 awful+0.012
ax+0.011
Neg logit contributions
 All-0.006
Use-0.005
Find-0.005
Spread-0.005
 who-0.004
Token.
Token ID13
Feature activation+3.35e-04

Pos logit contributions
 All+0.036
 who+0.026
 Using+0.026
Let+0.024
 With+0.024
Neg logit contributions
oto-0.061
iri-0.060
 mad-0.059
 awful-0.058
gh-0.057
Token Inside
Token ID14384
Feature activation+1.59e-01

Pos logit contributions
 All+0.000
 who+0.000
Use+0.000
+0.000
If+0.000
Neg logit contributions
 mad-0.001
 awful-0.001
oto-0.001
sing-0.000
ige-0.000
Token the
Token ID262
Feature activation+4.58e-02

Pos logit contributions
 All+0.208
 Using+0.167
Use+0.165
Find+0.164
 who+0.158
Neg logit contributions
 mad-0.312
oto-0.309
gh-0.303
iri-0.301
 awful-0.300
Token box
Token ID3091
Feature activation+1.25e-01

Pos logit contributions
 All+0.067
 Using+0.052
Use+0.051
If+0.050
 Taking+0.050
Neg logit contributions
 mad-0.087
 awful-0.085
oto-0.083
gh-0.081
iri-0.081
Token,
Token ID11
Feature activation-1.85e-02

Pos logit contributions
 All+0.161
Use+0.134
Find+0.134
If+0.132
 Using+0.129
Neg logit contributions
 mad-0.237
 awful-0.231
gh-0.231
oto-0.231
iri-0.222
Token there
Token ID612
Feature activation+1.60e-01

Pos logit contributions
 mad+0.030
 ours+0.027
 awful+0.026
oto+0.025
 angry+0.025
Neg logit contributions
 All-0.032
-0.030
pill-0.029
 Using-0.029
If-0.028
Token were
Token ID547
Feature activation+2.41e-02

Pos logit contributions
 All+0.148
 Using+0.118
Use+0.113
+0.112
Find+0.112
Neg logit contributions
 mad-0.371
oto-0.362
 awful-0.361
gh-0.361
iri-0.352
Token many
Token ID867
Feature activation+8.03e-02

Pos logit contributions
 All+0.032
 Using+0.025
Use+0.024
+0.024
 who+0.023
Neg logit contributions
 mad-0.048
oto-0.048
 awful-0.048
iri-0.047
gh-0.046
Token toys
Token ID14958
Feature activation+1.49e-01

Pos logit contributions
 All+0.145
+0.126
 Using+0.121
Use+0.119
If+0.119
Neg logit contributions
 mad-0.134
 awful-0.128
 angry-0.114
oto-0.114
 sour-0.111
Token for
Token ID329
Feature activation+2.54e-02

Pos logit contributions
 All+0.243
 Using+0.197
If+0.194
+0.194
Use+0.194
Neg logit contributions
 mad-0.258
oto-0.248
 awful-0.245
gh-0.240
 ours-0.232
Token Tom
Token ID4186
Feature activation-1.20e-01

Pos logit contributions
 All+0.040
 Using+0.032
Use+0.031
Let+0.031
 who+0.031
Neg logit contributions
 mad-0.046
oto-0.045
iri-0.044
 awful-0.044
gh-0.043
Token it
Token ID340
Feature activation-7.53e-02

Pos logit contributions
 mad+0.103
 awful+0.098
gh+0.096
oto+0.096
iri+0.088
Neg logit contributions
 All-0.109
-0.099
Use-0.095
 Using-0.094
 Being-0.093
Token.
Token ID13
Feature activation+1.47e-02

Pos logit contributions
 mad+0.123
 awful+0.118
gh+0.116
oto+0.115
sing+0.111
Neg logit contributions
 All-0.126
Use-0.106
Find-0.103
-0.103
If-0.102
Token
Token ID198
Feature activation+2.73e-02

Pos logit contributions
 All+0.016
Find+0.016
Spread+0.015
 who+0.015
es+0.015
Neg logit contributions
 mad-0.026
 awful-0.025
ax-0.025
oto-0.025
gh-0.024
Token
Token ID198
Feature activation+2.82e-02

Pos logit contributions
 All+0.025
 Using+0.016
 who+0.016
Use+0.016
+0.015
Neg logit contributions
oto-0.065
 mad-0.065
iri-0.063
gh-0.063
 awful-0.063
TokenBut
Token ID1537
Feature activation+4.71e-02

Pos logit contributions
 All+0.046
 Using+0.037
 who+0.036
 Stone+0.036
 acts+0.036
Neg logit contributions
 mad-0.047
oto-0.046
 awful-0.046
gh-0.045
iri-0.044
Token when
Token ID618
Feature activation+1.71e-01

Pos logit contributions
 All+0.076
Find+0.066
Use+0.066
pill+0.065
Join+0.064
Neg logit contributions
 mad-0.080
 awful-0.076
sing-0.073
gh-0.072
oto-0.071
Token Jack
Token ID3619
Feature activation-6.84e-02

Pos logit contributions
 All+0.268
 Using+0.216
 who+0.215
Use+0.211
Find+0.209
Neg logit contributions
 mad-0.306
oto-0.293
 awful-0.293
iri-0.287
gh-0.286
Token arrived
Token ID5284
Feature activation-5.98e-02

Pos logit contributions
 mad+0.132
iri+0.128
oto+0.128
gh+0.124
 awful+0.124
Neg logit contributions
 All-0.098
 Using-0.075
 Being-0.074
Use-0.073
 Taking-0.072
Token,
Token ID11
Feature activation-5.63e-03

Pos logit contributions
oto+0.106
 mad+0.105
iri+0.104
 awful+0.102
gh+0.100
Neg logit contributions
 All-0.097
 Using-0.076
 who-0.075
Use-0.073
Let-0.073
Token he
Token ID339
Feature activation+1.44e-02

Pos logit contributions
 mad+0.009
 awful+0.009
 angry+0.008
gh+0.007
oto+0.007
Neg logit contributions
 All-0.009
Find-0.009
-0.008
Use-0.008
 Using-0.008
Token saw
Token ID2497
Feature activation+6.34e-03

Pos logit contributions
 All+0.023
 Using+0.018
Use+0.017
 Taking+0.017
Find+0.017
Neg logit contributions
 mad-0.026
oto-0.025
iri-0.025
 awful-0.025
gh-0.025
Token to
Token ID284
Feature activation-7.45e-02

Pos logit contributions
 All+0.047
 Using+0.038
Use+0.037
 who+0.036
Let+0.036
Neg logit contributions
 mad-0.045
oto-0.045
iri-0.044
 awful-0.043
gh-0.043
Token explore
Token ID7301
Feature activation+9.62e-02

Pos logit contributions
 mad+0.127
oto+0.124
iri+0.121
gh+0.119
 awful+0.118
Neg logit contributions
 All-0.122
 Using-0.104
Use-0.100
 Taking-0.099
Let-0.098
Token.
Token ID13
Feature activation+6.90e-03

Pos logit contributions
 All+0.170
 Using+0.145
Use+0.144
Find+0.143
Let+0.137
Neg logit contributions
 mad-0.148
 awful-0.146
oto-0.141
 ours-0.141
gh-0.135
Token But
Token ID887
Feature activation+7.73e-02

Pos logit contributions
pen+0.009
 led+0.009
 puzzles+0.009
es+0.009
 ties+0.008
Neg logit contributions
sing-0.009
 ours-0.009
oto-0.009
ife-0.009
 okay-0.009
Token only
Token ID691
Feature activation+1.49e-02

Pos logit contributions
 All+0.145
pill+0.133
 Using+0.131
Use+0.127
If+0.126
Neg logit contributions
 mad-0.103
 mine-0.098
oto-0.097
 awful-0.096
 ours-0.094
Token if
Token ID611
Feature activation+1.61e-01

Pos logit contributions
 All+0.024
pill+0.021
 Using+0.021
Use+0.021
Find+0.020
Neg logit contributions
 mad-0.026
 mine-0.024
 ours-0.024
 awful-0.023
sing-0.023
Token you
Token ID345
Feature activation-5.71e-02

Pos logit contributions
 All+0.317
 Using+0.291
Find+0.274
Use+0.270
 Taking+0.268
Neg logit contributions
 mad-0.207
 awful-0.204
oto-0.198
gh-0.196
iri-0.191
Token promise
Token ID6991
Feature activation+8.74e-03

Pos logit contributions
 mad+0.096
oto+0.092
iri+0.090
 awful+0.089
gh+0.085
Neg logit contributions
 All-0.095
 Using-0.081
 Taking-0.079
Let-0.078
 Being-0.077
Token to
Token ID284
Feature activation-7.65e-02

Pos logit contributions
 All+0.016
 Using+0.013
Use+0.013
Find+0.013
 Taking+0.013
Neg logit contributions
 mad-0.013
oto-0.013
iri-0.013
 awful-0.012
gh-0.012
Token be
Token ID307
Feature activation-4.14e-02

Pos logit contributions
oto+0.114
iri+0.110
 mad+0.110
gh+0.106
 awful+0.103
Neg logit contributions
 All-0.144
 Using-0.123
 Taking-0.119
Use-0.119
Find-0.117
Token very
Token ID845
Feature activation-2.09e-01

Pos logit contributions
 mad+0.053
 awful+0.048
oto+0.047
iri+0.045
gh+0.044
Neg logit contributions
 All-0.091
 Using-0.076
Use-0.076
Find-0.075
Let-0.074
Token butterfly
Token ID35113
Feature activation+4.98e-02

Pos logit contributions
 All+0.104
 Using+0.089
 Taking+0.084
Use+0.084
 With+0.084
Neg logit contributions
 mad-0.084
oto-0.081
 awful-0.080
iri-0.079
ige-0.073
Token.
Token ID13
Feature activation+6.77e-03

Pos logit contributions
 All+0.080
Use+0.066
 Using+0.065
Find+0.064
Let+0.064
Neg logit contributions
 mad-0.087
oto-0.085
iri-0.083
 awful-0.082
gh-0.080
Token Joe
Token ID5689
Feature activation-1.34e-01

Pos logit contributions
 All+0.008
Use+0.007
Find+0.007
 who+0.007
Spread+0.007
Neg logit contributions
oto-0.012
 mad-0.012
iri-0.012
 awful-0.012
ax-0.011
Token asked
Token ID1965
Feature activation+3.41e-02

Pos logit contributions
 mad+0.238
oto+0.226
iri+0.226
 awful+0.209
gh+0.202
Neg logit contributions
 All-0.207
Use-0.171
 Using-0.170
Find-0.167
 Taking-0.163
Token,
Token ID11
Feature activation-5.44e-03

Pos logit contributions
 All+0.058
Use+0.049
Find+0.049
 Using+0.048
+0.047
Neg logit contributions
oto-0.057
 mad-0.055
iri-0.054
gh-0.053
 awful-0.052
Token "
Token ID366
Feature activation+1.32e-01

Pos logit contributions
 mad+0.011
oto+0.011
iri+0.011
 awful+0.010
gh+0.010
Neg logit contributions
 All-0.007
Use-0.005
 Using-0.005
Find-0.005
If-0.005
TokenHello
Token ID15496
Feature activation-1.17e-01

Pos logit contributions
 All+0.168
 Using+0.127
 With+0.118
 Taking+0.117
 Flying+0.112
Neg logit contributions
oto-0.285
iri-0.281
 mad-0.275
 awful-0.261
gh-0.260
Token butterfly
Token ID35113
Feature activation+5.14e-02

Pos logit contributions
 mad+0.187
oto+0.185
iri+0.182
 awful+0.175
gh+0.169
Neg logit contributions
 All-0.206
 Using-0.175
Use-0.172
Find-0.165
Let-0.164
Token,
Token ID11
Feature activation+7.71e-04

Pos logit contributions
 All+0.093
 Using+0.078
Use+0.077
Find+0.076
Let+0.075
Neg logit contributions
oto-0.080
 mad-0.079
iri-0.078
 awful-0.075
gh-0.073
Token can
Token ID460
Feature activation+8.38e-02

Pos logit contributions
 All+0.001
 Using+0.001
Use+0.001
If+0.001
Find+0.001
Neg logit contributions
oto-0.001
 mad-0.001
 awful-0.001
iri-0.001
gh-0.001
Token you
Token ID345
Feature activation-4.47e-02

Pos logit contributions
 All+0.131
 Using+0.112
Use+0.106
Let+0.103
 Taking+0.103
Neg logit contributions
 mad-0.146
oto-0.144
iri-0.140
 awful-0.140
gh-0.138
Token was
Token ID373
Feature activation+3.41e-02

Pos logit contributions
 mad+0.055
oto+0.054
iri+0.053
 awful+0.053
gh+0.052
Neg logit contributions
 All-0.024
 Using-0.016
Use-0.016
 who-0.015
Find-0.015
Token scared
Token ID12008
Feature activation-3.93e-02

Pos logit contributions
 All+0.062
 who+0.050
 Using+0.049
Let+0.048
 Taking+0.048
Neg logit contributions
oto-0.054
iri-0.053
 mad-0.052
 awful-0.051
gh-0.051
Token at
Token ID379
Feature activation+1.11e-02

Pos logit contributions
iri+0.089
oto+0.088
gh+0.088
 mad+0.086
ige+0.085
Neg logit contributions
 All-0.048
 who-0.036
 Using-0.029
 Taking-0.029
 With-0.029
Token first
Token ID717
Feature activation-5.55e-02

Pos logit contributions
 All+0.020
 who+0.016
 Using+0.015
Let+0.015
Use+0.015
Neg logit contributions
iri-0.018
oto-0.018
 awful-0.018
 mad-0.018
gh-0.018
Token,
Token ID11
Feature activation-1.03e-02

Pos logit contributions
 mad+0.082
oto+0.078
gh+0.076
iri+0.074
 awful+0.072
Neg logit contributions
 All-0.098
Use-0.095
Find-0.091
 Using-0.091
-0.089
Token but
Token ID475
Feature activation+5.88e-02

Pos logit contributions
oto+0.022
iri+0.022
 mad+0.021
gh+0.021
 awful+0.021
Neg logit contributions
 All-0.014
 who-0.010
 Using-0.010
Let-0.009
 Taking-0.009
Token then
Token ID788
Feature activation+5.11e-02

Pos logit contributions
 All+0.089
 Using+0.070
Use+0.068
Find+0.066
 who+0.066
Neg logit contributions
 mad-0.112
oto-0.110
iri-0.108
 awful-0.106
gh-0.103
Token he
Token ID339
Feature activation+8.85e-03

Pos logit contributions
 All+0.078
 Using+0.065
Use+0.062
 Taking+0.061
Find+0.061
Neg logit contributions
 mad-0.096
oto-0.092
iri-0.090
 awful-0.089
gh-0.089
Token got
Token ID1392
Feature activation+2.43e-03

Pos logit contributions
 All+0.010
 who+0.008
 led+0.008
Let+0.007
Will+0.007
Neg logit contributions
oto-0.020
 ours-0.019
iri-0.019
arter-0.019
 awful-0.019
Token excited
Token ID6568
Feature activation-5.44e-02

Pos logit contributions
 All+0.004
 who+0.004
Let+0.004
 Using+0.004
Use+0.003
Neg logit contributions
oto-0.004
iri-0.004
gh-0.004
 awful-0.004
 mad-0.004
Token and
Token ID290
Feature activation-1.12e-02

Pos logit contributions
iri+0.127
 mad+0.121
oto+0.120
gh+0.119
ige+0.119
Neg logit contributions
 All-0.064
 who-0.049
 Using-0.040
 With-0.039
Let-0.038
Token and
Token ID290
Feature activation-3.56e-02

Pos logit contributions
 All+0.082
 Using+0.065
Use+0.064
Find+0.064
If+0.062
Neg logit contributions
 mad-0.098
oto-0.096
 awful-0.094
iri-0.094
gh-0.090
Token the
Token ID262
Feature activation+3.80e-02

Pos logit contributions
 mad+0.065
oto+0.062
 awful+0.060
iri+0.060
 ours+0.058
Neg logit contributions
 All-0.058
 Using-0.047
Use-0.046
If-0.045
Find-0.044
Token reef
Token ID25088
Feature activation+5.72e-02

Pos logit contributions
 All+0.071
 Using+0.059
Use+0.056
 With+0.056
 Taking+0.056
Neg logit contributions
 mad-0.060
oto-0.057
 awful-0.057
iri-0.056
gh-0.053
Token.
Token ID13
Feature activation-2.16e-02

Pos logit contributions
 All+0.093
Use+0.075
 Using+0.075
Find+0.074
If+0.074
Neg logit contributions
 mad-0.101
 awful-0.098
oto-0.097
iri-0.095
gh-0.092
Token She
Token ID1375
Feature activation+1.19e-02

Pos logit contributions
oto+0.050
 mad+0.050
iri+0.050
 awful+0.048
gh+0.047
Neg logit contributions
 All-0.021
 Using-0.015
Use-0.014
 who-0.013
Find-0.013
Token was
Token ID373
Feature activation+8.09e-03

Pos logit contributions
 All+0.021
 Using+0.018
+0.017
 Taking+0.017
If+0.017
Neg logit contributions
oto-0.020
 mad-0.019
iri-0.019
 awful-0.018
ige-0.017
Token happy
Token ID3772
Feature activation-1.05e-01

Pos logit contributions
 All+0.016
 who+0.014
 Using+0.013
 Taking+0.013
Let+0.013
Neg logit contributions
oto-0.011
iri-0.011
gh-0.011
 awful-0.010
urger-0.010
Token that
Token ID326
Feature activation-3.05e-02

Pos logit contributions
oto+0.187
 mad+0.186
iri+0.181
 awful+0.176
gh+0.171
Neg logit contributions
 All-0.169
 Using-0.136
Use-0.134
Find-0.131
Let-0.129
Token they
Token ID484
Feature activation-4.48e-03

Pos logit contributions
 mad+0.053
oto+0.052
iri+0.052
 awful+0.050
gh+0.050
Neg logit contributions
 All-0.049
 Using-0.042
Use-0.040
Find-0.039
-0.038
Token had
Token ID550
Feature activation+6.35e-02

Pos logit contributions
 mad+0.009
oto+0.009
 awful+0.009
iri+0.008
ige+0.008
Neg logit contributions
 All-0.006
 Taking-0.005
 Using-0.005
 With-0.005
 Flying-0.005
Token fun
Token ID1257
Feature activation-1.82e-01

Pos logit contributions
 All+0.097
 Using+0.078
Use+0.076
 Taking+0.076
+0.074
Neg logit contributions
 mad-0.118
oto-0.115
iri-0.112
 awful-0.112
gh-0.111
Token hot
Token ID3024
Feature activation-1.74e-01

Pos logit contributions
oto+0.298
gh+0.286
iri+0.284
 awful+0.275
urger+0.273
Neg logit contributions
 All-0.362
 who-0.316
 Taking-0.292
Let-0.286
 Using-0.284
Token and
Token ID290
Feature activation-3.30e-03

Pos logit contributions
 mad+0.344
oto+0.344
iri+0.337
 awful+0.331
gh+0.323
Neg logit contributions
 All-0.243
 Using-0.184
Use-0.179
 who-0.176
Find-0.175
Token thirsty
Token ID47124
Feature activation-5.23e-02

Pos logit contributions
oto+0.006
 mad+0.006
iri+0.006
 awful+0.006
gh+0.006
Neg logit contributions
 All-0.005
 who-0.004
 Using-0.004
Let-0.004
Use-0.004
Token.
Token ID13
Feature activation+1.92e-02

Pos logit contributions
 mad+0.099
oto+0.098
iri+0.096
 awful+0.096
gh+0.093
Neg logit contributions
 All-0.077
 Using-0.060
Use-0.058
Find-0.058
 Taking-0.057
Token They
Token ID1119
Feature activation+7.74e-02

Pos logit contributions
 All+0.021
Use+0.015
Find+0.015
 Using+0.015
 who+0.014
Neg logit contributions
oto-0.042
 mad-0.042
iri-0.042
 awful-0.040
sing-0.040
Token had
Token ID550
Feature activation+9.50e-02

Pos logit contributions
 All+0.089
 Using+0.065
Use+0.060
Find+0.060
 Taking+0.060
Neg logit contributions
 mad-0.173
oto-0.169
iri-0.168
 awful-0.167
gh-0.160
Token been
Token ID587
Feature activation-7.47e-02

Pos logit contributions
 All+0.102
 who+0.073
Let+0.073
Use+0.064
 Using+0.063
Neg logit contributions
oto-0.222
iri-0.217
 mad-0.216
 awful-0.211
gh-0.210
Token walking
Token ID6155
Feature activation+4.21e-02

Pos logit contributions
oto+0.139
iri+0.135
 awful+0.130
 mad+0.130
gh+0.129
Neg logit contributions
 All-0.115
 who-0.094
Let-0.088
 Using-0.087
 Taking-0.085
Token in
Token ID287
Feature activation+1.28e-01

Pos logit contributions
 All+0.056
 Using+0.044
Use+0.043
Find+0.042
 Taking+0.041
Neg logit contributions
 mad-0.085
oto-0.084
iri-0.083
 awful-0.080
gh-0.078
Token the
Token ID262
Feature activation+6.26e-02

Pos logit contributions
 All+0.180
 Using+0.144
Use+0.142
Find+0.142
+0.139
Neg logit contributions
 mad-0.253
oto-0.236
iri-0.231
 awful-0.230
 ours-0.223
Token sand
Token ID6450
Feature activation-9.95e-02

Pos logit contributions
 All+0.079
 who+0.055
Let+0.054
+0.050
Use+0.049
Neg logit contributions
oto-0.138
gh-0.137
iri-0.136
 mad-0.134
ige-0.131
Token boy
Token ID2933
Feature activation+1.47e-01

Pos logit contributions
 All+0.055
Find+0.046
Use+0.046
Spread+0.044
 Using+0.044
Neg logit contributions
 mad-0.058
oto-0.056
iri-0.053
 awful-0.052
 angry-0.050
Token with
Token ID351
Feature activation+1.83e-01

Pos logit contributions
 All+0.203
Use+0.173
Find+0.171
If+0.163
Spread+0.160
Neg logit contributions
 mad-0.280
gh-0.265
iri-0.261
oto-0.257
 awful-0.254
Token a
Token ID257
Feature activation+6.85e-02

Pos logit contributions
 All+0.280
Use+0.235
 Using+0.232
+0.232
 Taking+0.226
Neg logit contributions
 mad-0.325
oto-0.313
 awful-0.311
iri-0.304
gh-0.301
Token big
Token ID1263
Feature activation-7.77e-02

Pos logit contributions
 All+0.131
 who+0.107
 Using+0.106
Let+0.104
Use+0.103
Neg logit contributions
oto-0.101
 mad-0.099
iri-0.099
 awful-0.096
gh-0.093
Token dream
Token ID4320
Feature activation-1.16e-02

Pos logit contributions
oto+0.131
iri+0.129
 mad+0.126
 awful+0.125
gh+0.122
Neg logit contributions
 All-0.133
 who-0.105
 Using-0.104
 With-0.101
Use-0.100
Token.
Token ID13
Feature activation+1.60e-02

Pos logit contributions
oto+0.020
gh+0.020
 mad+0.020
 awful+0.020
iri+0.019
Neg logit contributions
 All-0.018
Use-0.015
 Using-0.015
Find-0.015
Let-0.015
Token He
Token ID679
Feature activation+4.31e-02

Pos logit contributions
Use+0.019
es+0.018
 All+0.018
 led+0.018
Find+0.017
Neg logit contributions
oto-0.028
iri-0.027
 mad-0.025
aney-0.025
ax-0.025
Token wanted
Token ID2227
Feature activation+3.70e-02

Pos logit contributions
 All+0.057
 Using+0.046
Use+0.045
Find+0.044
If+0.044
Neg logit contributions
 mad-0.085
oto-0.082
 awful-0.081
iri-0.081
gh-0.080
Token to
Token ID284
Feature activation-6.06e-02

Pos logit contributions
 All+0.063
 Using+0.050
 who+0.049
Use+0.049
Let+0.048
Neg logit contributions
 mad-0.062
oto-0.062
iri-0.061
 awful-0.060
gh-0.059
Token have
Token ID423
Feature activation+1.51e-01

Pos logit contributions
 mad+0.102
oto+0.102
iri+0.100
gh+0.099
 awful+0.096
Neg logit contributions
 All-0.096
 Using-0.083
Use-0.081
Let-0.081
 Taking-0.080
Token a
Token ID257
Feature activation+5.11e-02

Pos logit contributions
 All+0.217
 Using+0.165
 who+0.162
Use+0.159
Let+0.157
Neg logit contributions
 mad-0.292
oto-0.291
iri-0.286
 awful-0.282
gh-0.277
Token inside
Token ID2641
Feature activation+4.65e-02

Pos logit contributions
 awful+0.111
iri+0.109
ige+0.109
oto+0.106
gh+0.103
Neg logit contributions
 All-0.079
 who-0.072
 with-0.062
Let-0.061
 led-0.059
Token and
Token ID290
Feature activation-1.69e-02

Pos logit contributions
 All+0.062
 Using+0.046
Use+0.044
 who+0.044
Let+0.043
Neg logit contributions
 mad-0.095
oto-0.095
iri-0.093
 awful-0.092
gh-0.090
Token drink
Token ID4144
Feature activation+3.69e-02

Pos logit contributions
 mad+0.031
oto+0.030
iri+0.030
 awful+0.030
gh+0.029
Neg logit contributions
 All-0.026
 Using-0.020
Use-0.020
 Taking-0.019
Find-0.019
Token hot
Token ID3024
Feature activation-1.86e-01

Pos logit contributions
 All+0.041
 who+0.029
Let+0.028
 with+0.025
 led+0.024
Neg logit contributions
iri-0.087
oto-0.085
ige-0.083
sing-0.083
gh-0.082
Token cocoa
Token ID35845
Feature activation-1.32e-01

Pos logit contributions
iri+0.338
 mad+0.336
oto+0.332
 awful+0.325
gh+0.323
Neg logit contributions
 All-0.294
 who-0.229
Let-0.222
 Using-0.214
 With-0.212
Token.
Token ID13
Feature activation+3.49e-03

Pos logit contributions
iri+0.312
 mad+0.308
oto+0.306
 awful+0.296
ige+0.290
Neg logit contributions
 All-0.141
 who-0.100
Let-0.086
 Using-0.085
 with-0.083
Token
Token ID198
Feature activation+1.75e-02

Pos logit contributions
 All+0.004
Use+0.003
Find+0.003
 Using+0.003
 who+0.003
Neg logit contributions
 mad-0.007
iri-0.007
oto-0.007
 awful-0.007
gh-0.007
Token
Token ID198
Feature activation+1.82e-02

Pos logit contributions
 All+0.015
 Using+0.009
Use+0.009
 who+0.009
Find+0.008
Neg logit contributions
oto-0.043
 mad-0.043
iri-0.043
 awful-0.042
gh-0.042
Token"
Token ID1
Feature activation+9.04e-02

Pos logit contributions
 All+0.029
 Using+0.023
Use+0.022
Find+0.021
 who+0.021
Neg logit contributions
 mad-0.032
iri-0.032
oto-0.032
 awful-0.031
gh-0.030
TokenLet
Token ID5756
Feature activation+9.84e-02

Pos logit contributions
 All+0.134
 Using+0.102
 who+0.100
Use+0.097
 Taking+0.097
Neg logit contributions
 mad-0.173
oto-0.173
iri-0.170
 awful-0.167
gh-0.164
Token's
Token ID338
Feature activation+1.20e-02

Pos logit contributions
 All+0.051
 was+0.021
 led+0.010
 who+0.007
's+0.006
Neg logit contributions
sing-0.320
alo-0.315
arter-0.314
iri-0.301
oto-0.301
Token off
Token ID572
Feature activation-1.10e-01

Pos logit contributions
oto+0.240
 mad+0.238
iri+0.236
 awful+0.231
gh+0.226
Neg logit contributions
 All-0.234
 who-0.184
 Using-0.182
Let-0.179
Use-0.175
Token.
Token ID13
Feature activation-1.55e-03

Pos logit contributions
 mad+0.214
oto+0.212
iri+0.208
 awful+0.207
gh+0.202
Neg logit contributions
 All-0.157
 Using-0.120
Use-0.118
 who-0.117
Let-0.115
Token After
Token ID2293
Feature activation+1.16e-01

Pos logit contributions
oto+0.002
ax+0.002
iri+0.002
 mad+0.002
 awful+0.002
Neg logit contributions
 acts-0.002
Use-0.002
Spread-0.002
Find-0.002
es-0.002
Token the
Token ID262
Feature activation+5.52e-02

Pos logit contributions
 All+0.153
 Using+0.130
Use+0.125
+0.123
Find+0.122
Neg logit contributions
 mad-0.226
oto-0.224
 awful-0.220
iri-0.218
gh-0.215
Token bird
Token ID6512
Feature activation+4.98e-02

Pos logit contributions
 All+0.106
 who+0.086
 Using+0.085
Let+0.084
Use+0.084
Neg logit contributions
 mad-0.081
oto-0.081
iri-0.080
 awful-0.077
gh-0.076
Token dried
Token ID16577
Feature activation-3.65e-02

Pos logit contributions
 All+0.060
 Using+0.042
 who+0.042
Use+0.041
Let+0.040
Neg logit contributions
oto-0.108
 mad-0.108
iri-0.107
 awful-0.105
gh-0.103
Token off
Token ID572
Feature activation-1.13e-01

Pos logit contributions
oto+0.058
 mad+0.058
iri+0.057
 awful+0.056
gh+0.055
Neg logit contributions
 All-0.066
 who-0.052
 Using-0.052
Let-0.050
 With-0.050
Token,
Token ID11
Feature activation-1.37e-02

Pos logit contributions
 mad+0.205
oto+0.202
iri+0.198
 awful+0.196
gh+0.194
Neg logit contributions
 All-0.175
 Using-0.138
Use-0.137
 who-0.134
Find-0.134
Token he
Token ID339
Feature activation+6.44e-03

Pos logit contributions
 mad+0.022
 awful+0.020
 angry+0.019
oto+0.019
iri+0.019
Neg logit contributions
 All-0.021
 Using-0.021
If-0.021
Use-0.020
-0.020
Token felt
Token ID2936
Feature activation+5.84e-02

Pos logit contributions
 All+0.010
 Using+0.008
Use+0.007
 Taking+0.007
If+0.007
Neg logit contributions
 mad-0.012
oto-0.012
iri-0.012
 awful-0.011
gh-0.011
Token so
Token ID523
Feature activation-1.54e-01

Pos logit contributions
 All+0.116
 who+0.095
 Using+0.092
Let+0.091
Use+0.091
Neg logit contributions
oto-0.086
iri-0.084
 mad-0.081
gh-0.079
sing-0.077
Token.
Token ID13
Feature activation-2.75e-02

Pos logit contributions
iri+0.229
arter+0.215
oto+0.209
ulp+0.209
urg+0.208
Neg logit contributions
 All-0.113
 who-0.105
 with-0.100
 every-0.087
 was-0.083
Token We
Token ID775
Feature activation+8.08e-02

Pos logit contributions
 mad+0.055
 awful+0.054
oto+0.054
iri+0.053
sing+0.052
Neg logit contributions
 All-0.035
 Using-0.027
Use-0.026
 led-0.025
Find-0.025
Token want
Token ID765
Feature activation+1.84e-02

Pos logit contributions
 All+0.124
 Using+0.102
 Taking+0.098
 With+0.096
 Flying+0.094
Neg logit contributions
 mad-0.154
 awful-0.143
oto-0.143
iri-0.141
ige-0.136
Token it
Token ID340
Feature activation-9.99e-02

Pos logit contributions
 All+0.024
 who+0.019
Let+0.017
 Using+0.016
 Being+0.016
Neg logit contributions
iri-0.039
oto-0.038
 mad-0.037
gh-0.037
 awful-0.036
Token,"
Token ID553
Feature activation-1.77e-02

Pos logit contributions
iri+0.232
arter+0.222
ulp+0.214
oto+0.213
rim+0.213
Neg logit contributions
 All-0.112
 who-0.102
 with-0.100
 every-0.088
 was-0.081
Token Lily
Token ID20037
Feature activation-3.67e-02

Pos logit contributions
 mad+0.031
oto+0.031
iri+0.030
 awful+0.029
gh+0.028
Neg logit contributions
 All-0.029
 Using-0.023
Use-0.023
Find-0.022
 who-0.022
Token and
Token ID290
Feature activation-3.97e-02

Pos logit contributions
iri+0.061
oto+0.061
 mad+0.061
gh+0.060
 awful+0.060
Neg logit contributions
 All-0.063
 who-0.049
 Using-0.049
Use-0.047
 Taking-0.047
Token Ben
Token ID3932
Feature activation-6.32e-02

Pos logit contributions
gh+0.083
iri+0.082
oto+0.081
 awful+0.081
 mad+0.080
Neg logit contributions
 All-0.053
 who-0.040
 Using-0.038
Let-0.037
Use-0.036
Token say
Token ID910
Feature activation-8.28e-02

Pos logit contributions
iri+0.090
oto+0.090
gh+0.089
 mad+0.087
 awful+0.086
Neg logit contributions
 All-0.124
 who-0.102
 Using-0.100
Use-0.097
Let-0.096
Token.
Token ID13
Feature activation-2.85e-02

Pos logit contributions
 mad+0.156
oto+0.152
 awful+0.148
iri+0.148
gh+0.148
Neg logit contributions
 All-0.124
 Using-0.096
Use-0.096
Find-0.095
Let-0.093
Token
Token ID198
Feature activation-1.12e-02

Pos logit contributions
 mad+0.073
oto+0.073
iri+0.072
 awful+0.071
gh+0.070
Neg logit contributions
 All-0.023
 Using-0.013
 who-0.012
Use-0.012
Let-0.011
Token One
Token ID1881
Feature activation+6.28e-02

Pos logit contributions
oto+0.017
iri+0.017
 mad+0.017
 awful+0.016
gh+0.016
Neg logit contributions
 All-0.009
Use-0.007
 Using-0.007
Find-0.006
 who-0.006
Token day
Token ID1110
Feature activation+2.36e-02

Pos logit contributions
 All+0.106
 Using+0.085
 who+0.083
Use+0.082
Let+0.081
Neg logit contributions
oto-0.107
 mad-0.106
iri-0.104
 awful-0.102
gh-0.100
Token,
Token ID11
Feature activation-1.42e-02

Pos logit contributions
 All+0.040
 Using+0.034
Use+0.033
Find+0.033
 who+0.032
Neg logit contributions
 mad-0.038
oto-0.038
gh-0.037
iri-0.037
 awful-0.035
Token they
Token ID484
Feature activation+1.89e-02

Pos logit contributions
oto+0.026
 mad+0.026
iri+0.026
gh+0.024
 awful+0.024
Neg logit contributions
 All-0.021
 Using-0.018
Use-0.017
 Taking-0.017
Find-0.016
Token go
Token ID467
Feature activation-4.74e-02

Pos logit contributions
 All+0.030
 Using+0.025
Use+0.023
 Taking+0.023
 With+0.023
Neg logit contributions
 mad-0.034
oto-0.033
iri-0.032
 awful-0.032
gh-0.030
Token to
Token ID284
Feature activation-7.37e-02

Pos logit contributions
 mad+0.071
oto+0.070
iri+0.067
gh+0.064
 awful+0.064
Neg logit contributions
 All-0.089
 Using-0.073
Use-0.071
 Taking-0.070
Find-0.070
Token the
Token ID262
Feature activation+5.83e-02

Pos logit contributions
 mad+0.151
oto+0.147
iri+0.142
 ours+0.137
sing+0.136
Neg logit contributions
 All-0.096
 Using-0.076
Use-0.073
-0.071
If-0.069
Token park
Token ID3952
Feature activation+2.80e-02

Pos logit contributions
 All+0.072
 who+0.051
 Using+0.051
Use+0.049
Let+0.049
Neg logit contributions
 mad-0.126
oto-0.125
iri-0.124
 awful-0.122
gh-0.120
Token with
Token ID351
Feature activation+1.61e-01

Pos logit contributions
 All+0.046
Use+0.039
Find+0.039
+0.039
 Using+0.038
Neg logit contributions
 mad-0.046
oto-0.045
sing-0.043
iri-0.041
gh-0.040
Token their
Token ID511
Feature activation-1.49e-01

Pos logit contributions
 All+0.273
 who+0.212
 Using+0.211
Let+0.205
Use+0.203
Neg logit contributions
oto-0.278
 mad-0.278
iri-0.273
 awful-0.269
gh-0.265
Token dogs
Token ID6844
Feature activation+4.07e-02

Pos logit contributions
 awful+0.252
iri+0.250
 mad+0.250
oto+0.249
gh+0.244
Neg logit contributions
 All-0.253
 who-0.202
Let-0.198
 led-0.192
 Using-0.191
Token She
Token ID1375
Feature activation+1.09e-02

Pos logit contributions
 mad+0.037
oto+0.037
iri+0.035
 awful+0.035
gh+0.034
Neg logit contributions
 All-0.026
Spread-0.023
 who-0.022
 Using-0.022
Find-0.022
Token tries
Token ID8404
Feature activation+4.47e-02

Pos logit contributions
 All+0.018
If+0.016
 Using+0.015
Find+0.015
Spread+0.015
Neg logit contributions
 mad-0.019
 awful-0.017
 angry-0.016
iri-0.016
oto-0.016
Token to
Token ID284
Feature activation-7.86e-02

Pos logit contributions
 All+0.074
 who+0.058
 Using+0.058
Let+0.056
Use+0.056
Neg logit contributions
oto-0.078
 mad-0.077
iri-0.077
 awful-0.076
gh-0.073
Token do
Token ID466
Feature activation+9.92e-02

Pos logit contributions
 mad+0.144
oto+0.143
iri+0.140
 awful+0.137
gh+0.134
Neg logit contributions
 All-0.123
 Using-0.097
 Taking-0.093
Use-0.093
Let-0.092
Token the
Token ID262
Feature activation+4.52e-02

Pos logit contributions
 All+0.159
Let+0.141
Use+0.140
 Using+0.138
+0.137
Neg logit contributions
 mad-0.170
 awful-0.165
gh-0.159
oto-0.153
iri-0.146
Token same
Token ID976
Feature activation-2.54e-02

Pos logit contributions
 All+0.083
 who+0.066
 Using+0.066
Use+0.065
Let+0.064
Neg logit contributions
iri-0.069
oto-0.069
 mad-0.069
 awful-0.066
gh-0.065
Token,
Token ID11
Feature activation-3.06e-02

Pos logit contributions
 mad+0.042
oto+0.041
 awful+0.040
iri+0.040
gh+0.040
Neg logit contributions
 All-0.045
 Using-0.036
Use-0.035
Let-0.035
 Taking-0.035
Token but
Token ID475
Feature activation+4.30e-02

Pos logit contributions
 mad+0.058
 awful+0.055
oto+0.054
iri+0.053
sing+0.052
Neg logit contributions
 All-0.047
 Using-0.037
Use-0.035
Find-0.035
 Taking-0.034
Token the
Token ID262
Feature activation+4.65e-02

Pos logit contributions
 All+0.064
 Using+0.052
Use+0.049
Find+0.048
 who+0.048
Neg logit contributions
 mad-0.082
oto-0.078
 awful-0.078
iri-0.076
gh-0.076
Token rock
Token ID3881
Feature activation+6.40e-02

Pos logit contributions
 All+0.086
 Using+0.072
 Taking+0.069
Use+0.068
 who+0.068
Neg logit contributions
 mad-0.072
oto-0.071
iri-0.069
 awful-0.069
gh-0.065
Token is
Token ID318
Feature activation-1.96e-02

Pos logit contributions
 All+0.082
 Using+0.060
Use+0.057
 who+0.057
Let+0.056
Neg logit contributions
 mad-0.136
oto-0.133
 awful-0.131
iri-0.131
gh-0.128
Token enjoying
Token ID13226
Feature activation+4.09e-02

Pos logit contributions
 All+0.063
 Using+0.051
 who+0.051
Use+0.049
Let+0.049
Neg logit contributions
oto-0.051
 mad-0.050
iri-0.050
 awful-0.048
gh-0.047
Token her
Token ID607
Feature activation-1.15e-01

Pos logit contributions
 All+0.061
Use+0.052
 Using+0.052
Find+0.051
 Taking+0.049
Neg logit contributions
 mad-0.075
oto-0.072
iri-0.071
 awful-0.070
gh-0.070
Token adventure
Token ID8855
Feature activation-7.38e-03

Pos logit contributions
 mad+0.165
oto+0.161
 awful+0.155
iri+0.155
gh+0.153
Neg logit contributions
 All-0.226
 Using-0.187
Use-0.185
 who-0.182
Find-0.182
Token and
Token ID290
Feature activation-1.18e-02

Pos logit contributions
 mad+0.013
 awful+0.012
oto+0.012
gh+0.012
iri+0.012
Neg logit contributions
 All-0.012
Find-0.010
Use-0.010
 Using-0.010
Let-0.009
Token she
Token ID673
Feature activation+1.66e-02

Pos logit contributions
 mad+0.021
 awful+0.019
oto+0.018
 angry+0.018
iri+0.017
Neg logit contributions
 All-0.018
-0.016
Find-0.016
Use-0.016
 Using-0.015
Token even
Token ID772
Feature activation-6.68e-02

Pos logit contributions
 All+0.025
 Using+0.020
Use+0.020
If+0.020
 Taking+0.019
Neg logit contributions
 mad-0.030
oto-0.029
iri-0.029
 awful-0.028
gh-0.028
Token saw
Token ID2497
Feature activation+1.27e-04

Pos logit contributions
 mad+0.122
 awful+0.107
oto+0.106
 angry+0.103
iri+0.102
Neg logit contributions
 All-0.111
-0.093
 Using-0.090
 Taking-0.085
Use-0.085
Token a
Token ID257
Feature activation+3.90e-02

Pos logit contributions
 All+0.000
+0.000
 Using+0.000
Use+0.000
Find+0.000
Neg logit contributions
 mad-0.000
 awful-0.000
oto-0.000
iri-0.000
ong-0.000
Token fairy
Token ID25607
Feature activation+1.24e-01

Pos logit contributions
 All+0.073
 Using+0.059
 who+0.059
Use+0.058
Let+0.058
Neg logit contributions
oto-0.058
 mad-0.058
iri-0.057
 awful-0.056
gh-0.054
Token.
Token ID13
Feature activation+4.19e-03

Pos logit contributions
 All+0.209
 Using+0.172
Use+0.169
If+0.166
Let+0.164
Neg logit contributions
 mad-0.212
iri-0.204
oto-0.202
 awful-0.199
gh-0.198
Token She
Token ID1375
Feature activation+2.59e-02

Pos logit contributions
Find+0.005
 puzzles+0.005
Spread+0.005
Use+0.005
 ties+0.005
Neg logit contributions
aney-0.006
iri-0.006
oto-0.006
ax-0.006
 awful-0.006
Token man
Token ID582
Feature activation+2.75e-02

Pos logit contributions
 All+0.085
 Using+0.073
 Taking+0.070
Use+0.070
 With+0.069
Neg logit contributions
 mad-0.054
oto-0.054
iri-0.051
 awful-0.051
gh-0.047
Token thanked
Token ID26280
Feature activation-2.56e-02

Pos logit contributions
 All+0.048
If+0.042
+0.041
 Using+0.040
 Taking+0.040
Neg logit contributions
 mad-0.045
oto-0.040
 awful-0.040
iri-0.039
urger-0.039
Token them
Token ID606
Feature activation-2.36e-01

Pos logit contributions
oto+0.030
iri+0.029
 mad+0.029
 awful+0.027
gh+0.026
Neg logit contributions
 All-0.057
 Using-0.049
Use-0.048
 Taking-0.047
Find-0.047
Token and
Token ID290
Feature activation-3.37e-02

Pos logit contributions
 mad+0.419
oto+0.418
iri+0.393
 awful+0.391
urger+0.374
Neg logit contributions
 All-0.369
Use-0.309
Find-0.306
 Using-0.295
Let-0.294
Token gave
Token ID2921
Feature activation+8.56e-02

Pos logit contributions
 mad+0.054
oto+0.049
 awful+0.046
iri+0.045
 angry+0.045
Neg logit contributions
 All-0.061
If-0.055
-0.054
 Flying-0.051
 Taking-0.051
Token them
Token ID606
Feature activation-2.36e-01

Pos logit contributions
 All+0.169
 who+0.140
Let+0.137
 Using+0.136
Use+0.133
Neg logit contributions
 mad-0.120
oto-0.119
iri-0.119
 awful-0.117
gh-0.114
Token some
Token ID617
Feature activation+6.29e-02

Pos logit contributions
 mad+0.501
oto+0.500
iri+0.493
 awful+0.486
gh+0.479
Neg logit contributions
 All-0.295
 Using-0.212
 who-0.209
Use-0.203
Let-0.201
Token cookies
Token ID14746
Feature activation+9.36e-03

Pos logit contributions
 All+0.118
 Using+0.096
Use+0.093
 who+0.093
 Taking+0.093
Neg logit contributions
 mad-0.095
oto-0.095
 awful-0.093
iri-0.092
gh-0.086
Token.
Token ID13
Feature activation-2.03e-02

Pos logit contributions
 All+0.011
 who+0.009
 Using+0.008
Let+0.008
Use+0.007
Neg logit contributions
 mad-0.020
iri-0.020
oto-0.020
 awful-0.020
gh-0.019
Token They
Token ID1119
Feature activation+4.61e-02

Pos logit contributions
oto+0.035
 mad+0.034
 awful+0.033
iri+0.032
ax+0.031
Neg logit contributions
 All-0.026
-0.025
Use-0.025
Spread-0.024
Find-0.024
Token ate
Token ID15063
Feature activation-2.87e-02

Pos logit contributions
 All+0.073
 Taking+0.061
 Flying+0.060
 Using+0.059
If+0.059
Neg logit contributions
oto-0.083
 mad-0.081
iri-0.080
ige-0.077
 awful-0.076
Token "
Token ID366
Feature activation+1.32e-01

Pos logit contributions
oto+0.173
 mad+0.172
iri+0.171
 awful+0.168
gh+0.163
Neg logit contributions
 All-0.064
 who-0.038
 Using-0.037
Let-0.034
 With-0.033
TokenHi
Token ID17250
Feature activation-8.09e-02

Pos logit contributions
 All+0.175
 Using+0.135
 who+0.131
 Taking+0.126
 led+0.125
Neg logit contributions
oto-0.264
 mad-0.259
iri-0.258
gh-0.251
 awful-0.246
Token!"
Token ID2474
Feature activation-9.72e-02

Pos logit contributions
 mad+0.118
oto+0.116
iri+0.111
gh+0.111
 awful+0.109
Neg logit contributions
 All-0.147
Use-0.132
 Using-0.130
If-0.126
 Taking-0.124
Token back
Token ID736
Feature activation-2.21e-01

Pos logit contributions
 mad+0.171
oto+0.163
 awful+0.162
ong+0.150
sing+0.148
Neg logit contributions
 All-0.135
-0.125
Use-0.125
Find-0.123
Spread-0.118
Token to
Token ID284
Feature activation-6.56e-02

Pos logit contributions
oto+0.358
 mad+0.353
iri+0.318
 awful+0.291
 ours+0.290
Neg logit contributions
 All-0.363
Use-0.333
If-0.313
Find-0.310
Have-0.309
Token him
Token ID683
Feature activation-1.93e-01

Pos logit contributions
 mad+0.099
oto+0.095
 awful+0.090
iri+0.088
gh+0.086
Neg logit contributions
 All-0.119
Use-0.104
 Using-0.103
-0.101
Find-0.100
Token.
Token ID13
Feature activation+1.41e-02

Pos logit contributions
 mad+0.330
oto+0.299
sing+0.291
 awful+0.285
gh+0.283
Neg logit contributions
 All-0.324
Use-0.295
If-0.282
Find-0.279
-0.275
Token Tim
Token ID5045
Feature activation-8.28e-02

Pos logit contributions
Use+0.017
 All+0.017
Find+0.016
+0.016
es+0.016
Neg logit contributions
 mad-0.024
oto-0.023
 awful-0.023
sing-0.022
 ours-0.022
Token was
Token ID373
Feature activation+3.15e-02

Pos logit contributions
 mad+0.156
oto+0.150
iri+0.138
urger+0.135
anut+0.134
Neg logit contributions
 All-0.121
Use-0.099
Find-0.098
If-0.098
-0.097
Token amazed
Token ID24562
Feature activation-2.13e-02

Pos logit contributions
 All+0.061
 who+0.052
 Using+0.048
 Taking+0.047
Let+0.047
Neg logit contributions
oto-0.047
iri-0.046
 awful-0.044
gh-0.044
 mad-0.043
Token and
Token ID290
Feature activation-1.39e-02

Pos logit contributions
oto+0.043
 mad+0.043
iri+0.043
 awful+0.042
gh+0.042
Neg logit contributions
 All-0.029
 who-0.022
 Using-0.021
Let-0.020
Use-0.020
Token good
Token ID922
Feature activation-8.98e-02

Pos logit contributions
 All+0.081
 who+0.066
 Using+0.064
Find+0.063
Let+0.063
Neg logit contributions
iri-0.074
oto-0.073
 mad-0.069
 awful-0.069
gh-0.067
Token result
Token ID1255
Feature activation+6.40e-02

Pos logit contributions
 mad+0.123
oto+0.116
gh+0.112
 awful+0.111
iri+0.108
Neg logit contributions
 All-0.184
 Using-0.154
 who-0.151
Use-0.150
Let-0.150
Token.
Token ID13
Feature activation+1.65e-02

Pos logit contributions
 All+0.113
Find+0.100
Use+0.097
Let+0.095
+0.095
Neg logit contributions
 mad-0.098
gh-0.097
 awful-0.097
oto-0.096
ong-0.086
Token She
Token ID1375
Feature activation+3.02e-02

Pos logit contributions
 All+0.019
pen+0.018
Spread+0.018
Use+0.018
 who+0.018
Neg logit contributions
 mad-0.029
oto-0.028
 awful-0.028
gh-0.027
sing-0.027
Token worked
Token ID3111
Feature activation+6.15e-02

Pos logit contributions
 All+0.046
 Using+0.036
 Flying+0.036
 Taking+0.036
 Being+0.035
Neg logit contributions
 mad-0.057
oto-0.053
 awful-0.053
gh-0.052
iri-0.050
Token really
Token ID1107
Feature activation-1.74e-01

Pos logit contributions
 All+0.137
 Using+0.115
 who+0.114
Use+0.112
Let+0.111
Neg logit contributions
oto-0.072
iri-0.071
 mad-0.071
 awful-0.067
gh-0.065
Token hard
Token ID1327
Feature activation-9.47e-02

Pos logit contributions
 mad+0.139
gh+0.129
 awful+0.126
oto+0.120
iri+0.104
Neg logit contributions
 All-0.451
 Using-0.395
Let-0.392
Use-0.389
Find-0.389
Token on
Token ID319
Feature activation+8.57e-02

Pos logit contributions
 mad+0.158
gh+0.151
oto+0.149
 awful+0.148
ong+0.139
Neg logit contributions
 All-0.157
Let-0.132
Find-0.131
Use-0.131
Spread-0.130
Token her
Token ID607
Feature activation-1.13e-01

Pos logit contributions
 All+0.146
Use+0.120
Find+0.119
 Using+0.118
Let+0.118
Neg logit contributions
 mad-0.142
gh-0.137
 awful-0.136
oto-0.130
iri-0.128
Token project
Token ID1628
Feature activation+1.49e-01

Pos logit contributions
iri+0.188
oto+0.188
 mad+0.181
 awful+0.175
gh+0.167
Neg logit contributions
 All-0.201
 who-0.160
 Using-0.159
 Taking-0.154
 With-0.153
Token and
Token ID290
Feature activation-1.07e-02

Pos logit contributions
 All+0.237
Find+0.197
Use+0.196
Let+0.193
 Using+0.191
Neg logit contributions
gh-0.253
 mad-0.252
oto-0.247
 awful-0.244
ong-0.228
Token heat
Token ID4894
Feature activation-4.81e-02

Pos logit contributions
 All+0.083
 who+0.066
Let+0.065
Use+0.063
 Stone+0.061
Neg logit contributions
oto-0.089
gh-0.088
iri-0.088
 mad-0.086
urger-0.085
Token.
Token ID13
Feature activation+3.70e-03

Pos logit contributions
 mad+0.098
oto+0.098
iri+0.096
 awful+0.095
gh+0.093
Neg logit contributions
 All-0.064
 Using-0.047
 who-0.047
Use-0.046
Let-0.045
Token They
Token ID1119
Feature activation+6.44e-02

Pos logit contributions
Use+0.005
pen+0.004
 acts+0.004
Find+0.004
vered+0.004
Neg logit contributions
oto-0.006
iri-0.006
 awful-0.006
sing-0.006
aney-0.006
Token were
Token ID547
Feature activation+2.33e-02

Pos logit contributions
 All+0.087
 Using+0.069
 Taking+0.066
 With+0.065
Use+0.064
Neg logit contributions
 mad-0.130
oto-0.129
iri-0.128
 awful-0.124
gh-0.121
Token both
Token ID1111
Feature activation-6.84e-02

Pos logit contributions
 All+0.041
 who+0.034
 Using+0.032
Let+0.031
 Taking+0.031
Neg logit contributions
oto-0.038
iri-0.037
gh-0.036
 mad-0.036
 awful-0.036
Token very
Token ID845
Feature activation-2.08e-01

Pos logit contributions
oto+0.083
 mad+0.082
iri+0.080
 awful+0.078
gh+0.076
Neg logit contributions
 All-0.150
 Using-0.126
 who-0.125
Use-0.123
Let-0.122
Token happy
Token ID3772
Feature activation-1.01e-01

Pos logit contributions
oto+0.316
iri+0.302
gh+0.302
urger+0.289
 awful+0.283
Neg logit contributions
 All-0.410
 who-0.341
 Taking-0.317
 With-0.316
 Using-0.313
Token and
Token ID290
Feature activation-2.03e-02

Pos logit contributions
 mad+0.170
oto+0.166
 awful+0.155
iri+0.155
sing+0.154
Neg logit contributions
 All-0.162
Use-0.146
Find-0.144
 Using-0.143
If-0.138
Token not
Token ID407
Feature activation-1.26e-01

Pos logit contributions
oto+0.035
iri+0.034
 mad+0.034
 awful+0.033
gh+0.033
Neg logit contributions
 All-0.034
 who-0.027
 Using-0.027
Let-0.026
Use-0.026
Token so
Token ID523
Feature activation-1.58e-01

Pos logit contributions
oto+0.182
iri+0.177
 mad+0.170
gh+0.169
 awful+0.168
Neg logit contributions
 All-0.250
 who-0.207
 Using-0.202
Let-0.198
Use-0.196
Token busy
Token ID8179
Feature activation-9.01e-02

Pos logit contributions
 mad+0.193
 awful+0.170
 angry+0.139
sing+0.138
oto+0.138
Neg logit contributions
 All-0.354
Use-0.325
 Using-0.320
Let-0.316
Find-0.316
Token.
Token ID13
Feature activation+1.77e-02

Pos logit contributions
 All+0.152
 Using+0.126
Use+0.126
Find+0.124
 Taking+0.119
Neg logit contributions
 mad-0.170
oto-0.163
iri-0.161
 awful-0.158
gh-0.154
Token
Token ID220
Feature activation-3.93e-02

Pos logit contributions
 All+0.019
Use+0.017
Find+0.016
Spread+0.016
 Using+0.015
Neg logit contributions
 mad-0.035
oto-0.034
iri-0.034
 awful-0.033
gh-0.032
Token
Token ID198
Feature activation+2.77e-02

Pos logit contributions
oto+0.089
 mad+0.087
iri+0.087
gh+0.085
 awful+0.084
Neg logit contributions
 All-0.038
Use-0.029
Find-0.028
 Using-0.027
-0.027
Token
Token ID198
Feature activation+2.90e-02

Pos logit contributions
 All+0.026
 Using+0.018
Use+0.017
 who+0.017
+0.017
Neg logit contributions
oto-0.065
 mad-0.064
iri-0.063
gh-0.062
 awful-0.061
Tokenâ
Token ID22940
Feature activation-1.25e-01

Pos logit contributions
 All+0.043
 Using+0.034
 led+0.033
Use+0.032
 who+0.032
Neg logit contributions
 mad-0.052
oto-0.052
iri-0.051
gh-0.049
 awful-0.049
Token
Token ID26391
Feature activation-1.84e-01

Pos logit contributions
oto+0.118
 mad+0.116
iri+0.111
 awful+0.110
gh+0.105
Neg logit contributions
 All-0.304
 Using-0.260
 who-0.259
Find-0.257
Use-0.256
Token
Token ID129
Feature activation-4.58e-02

Pos logit contributions
iri+0.357
 mad+0.353
gh+0.352
oto+0.350
sing+0.337
Neg logit contributions
 All-0.237
Use-0.191
 Using-0.186
If-0.184
Find-0.179
Token
Token ID241
Feature activation-6.55e-02

Pos logit contributions
iri+0.065
oto+0.065
 mad+0.064
urger+0.062
 awful+0.061
Neg logit contributions
 All-0.091
 Using-0.075
 Taking-0.072
 Flying-0.071
 With-0.071
TokenWe
Token ID1135
Feature activation+5.26e-02

Pos logit contributions
 mad+0.137
oto+0.135
iri+0.132
gh+0.130
 awful+0.124
Neg logit contributions
 All-0.074
 Using-0.061
 led-0.058
es-0.055
 Stone-0.054
Token are
Token ID389
Feature activation+3.26e-02

Pos logit contributions
 All+0.072
 Using+0.063
 Taking+0.062
 With+0.059
Use+0.058
Neg logit contributions
 mad-0.101
iri-0.094
 awful-0.094
oto-0.094
gh-0.091
Token looking
Token ID2045
Feature activation-3.44e-02

Pos logit contributions
 All+0.052
 Using+0.043
Use+0.042
Find+0.041
Let+0.041
Neg logit contributions
 mad-0.058
oto-0.056
 awful-0.055
iri-0.055
gh-0.053