TOP ACTIVATIONS
MAX = 13.600

<|endoftext|> <|endoftext|> <|endoftext|> <|endoftext|> <|endoftext|> L ila and Ben are friends
that he wanted . <|endoftext|> Ak io was a 3 year
squash ." <|endoftext|> <|endoftext|> <|endoftext|> L ily and Sam went on
fly again soon ! <|endoftext|> Jul ie and Joe were very
he shook it . <|endoftext|> Pa ola and her mother were
, a gentle girl named Ci ara stumbled upon a lovely
it seemed magical ! <|endoftext|> Sus ie was very proud .
her brown fur . <|endoftext|> Ark ie wanted his own equipment
lasting memories . <|endoftext|> Gr ac ie was a gifted girl
there were two friends , Ab ig ail and Mat ty
two little girls , named Ab bie and Daisy . Ab
never forgot it . <|endoftext|> Jan ie and John were best
time ! â <|endoftext|> Pa olo was very excited .
of magic . <|endoftext|> Isa ac was preparing breakfast in the
from now on . <|endoftext|> Z oe was playing in the
a time two friends , Jas mine and John , were
still bl ossom . <|endoftext|> Sus ie and her mom were
." <|endoftext|> <|endoftext|> <|endoftext|> <|endoftext|> Anna and Ben are playing with
, a little girl named Ab ig ail went to the
crown back again ! <|endoftext|> Em ma was walking in the

MIN ACTIVATIONS
MIN = -12.412

not played with the ash tr ay . They wished they
I should have stirred the soup more slowly so it wouldn
. We can share the mel on ." Tom said .
did . You wasted the soup . You made a mess
not play with the ash tr ay . It is dangerous
wished they had shared the pasta . They looked at the
wanted to keep the ash tr ay . So he tried
shouldn 't have taken the soup without asking . Now you
not played with the ash tr ay . They wish they
will not touch the ash tr ay again ," Tim says
field and still keep the corn safe . The farmer and
make the waiter drop the pizz as . They said sorry
not nice to keep the dish all to yourself ."
field and still keep the corn safe . The farmer and
helping me make my magic soup . It will make me
We should have shared the cream and been nice to each
I want to share my jelly with you . Can we
favorite . It makes the soup creamy and y ummy ,"
You can 't stir the soup with it ! You can
vegetable . It makes the soup creamy and tasty . Try

INTERVAL 7.097 - 13.600
CONTAINS 0.852%

day , two friends , Winston the mouse and Jo jo
are happy . <|endoftext|> L ily and Tom were twins who
play with . From then on , the dog became their
, a little girl named Lily wanted to ride her bike
the garden !" Mark replied , " Let 's

INTERVAL 0.594 - 7.097
CONTAINS 39.678%

He hugged Mia back and said , " Thank you ,
Jack cried for help and soon , a
little girl . She was only three years old , but
you . You should not poke their home . They live
he fell down and got hurt . His mom my and

INTERVAL -5.909 - 0.594
CONTAINS 57.240%

time the little boy visited , the creek was full and
and made her face funny . But she did not scream
away !" The monster grow led louder . It
. He says , " My dad got it for me
them new v ests . They are blue and warm and

INTERVAL -12.412 - -5.909
CONTAINS 2.230%

player . Not the new hockey stick ." Ben
stream began to get bigger and bigger , and soon there
It is big and black and has many buttons . Anna
wished they had shared the ball and played together . They
the scarf , making holes and r ips . He thought
Token<|endoftext|>
Token ID50256
Feature activation+5.79e+00

Pos logit contributions
ding+0.068
fire+0.061
tr+0.060
storm+0.060
ache+0.059
Neg logit contributions
els-0.048
ubby-0.046
ere-0.046
el-0.046
 opinions-0.045
Token<|endoftext|>
Token ID50256
Feature activation+7.81e+00

Pos logit contributions
els+0.106
en+0.096
ubby+0.096
el+0.095
ere+0.095
Neg logit contributions
ding-0.159
tr-0.150
fire-0.142
storm-0.138
whe-0.136
Token<|endoftext|>
Token ID50256
Feature activation+8.96e+00

Pos logit contributions
els+0.150
en+0.138
ubby+0.137
el+0.136
ere+0.136
Neg logit contributions
ding-0.206
tr-0.195
fire-0.183
storm-0.178
whe-0.175
Token<|endoftext|>
Token ID50256
Feature activation+9.57e+00

Pos logit contributions
els+0.176
en+0.162
ubby+0.160
ere+0.159
el+0.159
Neg logit contributions
ding-0.233
tr-0.220
fire-0.206
storm-0.200
whe-0.197
Token<|endoftext|>
Token ID50256
Feature activation+9.89e+00

Pos logit contributions
els+0.189
en+0.174
ubby+0.173
ere+0.171
el+0.171
Neg logit contributions
ding-0.247
tr-0.233
fire-0.219
storm-0.211
whe-0.209
TokenL
Token ID43
Feature activation+1.36e+01

Pos logit contributions
els+0.197
en+0.181
ubby+0.180
ere+0.177
el+0.177
Neg logit contributions
ding-0.255
tr-0.240
fire-0.226
storm-0.217
whe-0.216
Tokenila
Token ID10102
Feature activation+1.08e+01

Pos logit contributions
ened+0.272
els+0.235
 angles+0.234
ening+0.232
efully+0.230
Neg logit contributions
ding-0.322
t-0.303
tr-0.301
uts-0.272
 much-0.268
Token and
Token ID290
Feature activation+6.57e+00

Pos logit contributions
els+0.244
ubby+0.223
 angles+0.222
ened+0.221
eful+0.215
Neg logit contributions
ding-0.242
tr-0.228
fire-0.205
t-0.201
storm-0.191
Token Ben
Token ID3932
Feature activation+1.16e+01

Pos logit contributions
els+0.113
en+0.101
ubby+0.100
ened+0.098
 angles+0.096
Neg logit contributions
ding-0.193
tr-0.179
fire-0.167
storm-0.166
whe-0.165
Token are
Token ID389
Feature activation+8.89e+00

Pos logit contributions
els+0.263
ubby+0.240
ened+0.236
 angles+0.233
en+0.231
Neg logit contributions
ding-0.270
tr-0.251
fire-0.232
t-0.224
whe-0.217
Token friends
Token ID2460
Feature activation+6.27e+00

Pos logit contributions
els+0.195
en+0.189
ened+0.189
 angles+0.177
 opinions+0.170
Neg logit contributions
ding-0.214
tr-0.196
t-0.179
fire-0.175
whe-0.172
Token that
Token ID326
Feature activation-4.90e+00

Pos logit contributions
ding+0.199
tr+0.188
fire+0.182
 villains+0.181
Brown+0.172
Neg logit contributions
en-0.135
els-0.127
ened-0.126
el-0.125
 me-0.118
Token he
Token ID339
Feature activation-3.49e+00

Pos logit contributions
ding+0.125
tr+0.114
fire+0.110
uts+0.106
 numb+0.105
Neg logit contributions
ened-0.093
en-0.087
els-0.085
el-0.085
 opinions-0.084
Token wanted
Token ID2227
Feature activation+1.54e+00

Pos logit contributions
ding+0.086
tr+0.080
fire+0.076
whe+0.073
storm+0.072
Neg logit contributions
els-0.073
en-0.069
ened-0.068
ubby-0.067
ere-0.067
Token.
Token ID13
Feature activation-3.31e+00

Pos logit contributions
els+0.035
en+0.033
ened+0.032
el+0.032
ubby+0.031
Neg logit contributions
ding-0.036
tr-0.033
fire-0.031
storm-0.030
whe-0.029
Token<|endoftext|>
Token ID50256
Feature activation+6.25e+00

Pos logit contributions
ding+0.083
tr+0.076
fire+0.075
storm+0.073
 numb+0.071
Neg logit contributions
els-0.062
ere-0.062
en-0.060
ubby-0.059
ened-0.059
TokenAk
Token ID33901
Feature activation+1.35e+01

Pos logit contributions
els+0.137
en+0.128
ubby+0.127
ere+0.126
el+0.125
Neg logit contributions
ding-0.147
tr-0.137
fire-0.128
storm-0.124
whe-0.122
Tokenio
Token ID952
Feature activation+7.26e+00

Pos logit contributions
 delight+0.286
 opinions+0.284
 agreement+0.283
 amaz+0.273
 confusion+0.258
Neg logit contributions
ding-0.290
t-0.256
ongo-0.239
tr-0.233
 he-0.222
Token was
Token ID373
Feature activation+5.10e+00

Pos logit contributions
els+0.138
ubby+0.134
ened+0.127
 angles+0.126
en+0.124
Neg logit contributions
ding-0.187
tr-0.178
fire-0.158
ache-0.157
t-0.156
Token a
Token ID257
Feature activation-9.64e-01

Pos logit contributions
els+0.122
en+0.112
ened+0.110
 opinions+0.108
el+0.108
Neg logit contributions
ding-0.115
tr-0.107
fire-0.096
whe-0.095
storm-0.092
Token 3
Token ID513
Feature activation+4.75e+00

Pos logit contributions
ding+0.030
tr+0.029
whe+0.028
fire+0.028
uts+0.027
Neg logit contributions
ubby-0.013
els-0.012
en-0.011
ere-0.011
el-0.011
Token year
Token ID614
Feature activation+3.36e+00

Pos logit contributions
els+0.106
en+0.098
ubby+0.098
ened+0.097
ere+0.096
Neg logit contributions
ding-0.110
tr-0.101
fire-0.097
t-0.092
whe-0.091
Token squash
Token ID34613
Feature activation-6.99e+00

Pos logit contributions
ding+0.089
tr+0.080
fire+0.076
whe+0.074
t+0.074
Neg logit contributions
els-0.097
en-0.091
ened-0.091
ubby-0.090
el-0.090
Token."
Token ID526
Feature activation-1.60e+00

Pos logit contributions
ding+0.177
tr+0.166
fire+0.151
storm+0.148
whe+0.147
Neg logit contributions
els-0.145
en-0.135
ubby-0.130
 opinions-0.129
ere-0.128
Token<|endoftext|>
Token ID50256
Feature activation+7.01e+00

Pos logit contributions
ding+0.036
tr+0.034
fire+0.032
storm+0.031
whe+0.030
Neg logit contributions
els-0.037
ubby-0.035
en-0.034
ere-0.034
el-0.034
Token<|endoftext|>
Token ID50256
Feature activation+8.86e+00

Pos logit contributions
els+0.134
en+0.123
ubby+0.122
ened+0.120
ere+0.119
Neg logit contributions
ding-0.187
tr-0.175
fire-0.166
whe-0.160
storm-0.159
Token<|endoftext|>
Token ID50256
Feature activation+9.81e+00

Pos logit contributions
els+0.172
en+0.157
ubby+0.157
ened+0.155
ere+0.153
Neg logit contributions
ding-0.233
tr-0.218
fire-0.207
whe-0.199
storm-0.198
TokenL
Token ID43
Feature activation+1.32e+01

Pos logit contributions
els+0.193
en+0.177
ubby+0.176
ened+0.175
ere+0.171
Neg logit contributions
ding-0.256
tr-0.238
fire-0.226
whe-0.218
storm-0.216
Tokenily
Token ID813
Feature activation+1.15e+01

Pos logit contributions
ened+0.276
ening+0.242
 angles+0.237
 opinions+0.232
els+0.232
Neg logit contributions
ding-0.299
t-0.291
tr-0.268
uts-0.263
 much-0.252
Token and
Token ID290
Feature activation+5.79e+00

Pos logit contributions
els+0.300
ubby+0.280
 angles+0.275
ened+0.272
ered+0.271
Neg logit contributions
ding-0.209
tr-0.207
fire-0.168
age-0.163
t-0.160
Token Sam
Token ID3409
Feature activation+1.13e+01

Pos logit contributions
els+0.104
en+0.094
ubby+0.094
 angles+0.091
ened+0.091
Neg logit contributions
ding-0.161
tr-0.151
whe-0.141
fire-0.139
storm-0.139
Token went
Token ID1816
Feature activation+8.86e+00

Pos logit contributions
els+0.280
ubby+0.263
 angles+0.252
ened+0.246
eful+0.245
Neg logit contributions
ding-0.234
tr-0.210
fire-0.185
t-0.185
whe-0.182
Token on
Token ID319
Feature activation+9.62e+00

Pos logit contributions
els+0.209
ubby+0.197
en+0.195
el+0.192
ened+0.190
Neg logit contributions
ding-0.194
tr-0.179
fire-0.172
storm-0.163
whe-0.161
Token fly
Token ID6129
Feature activation-9.32e-01

Pos logit contributions
els+0.008
en+0.008
el+0.008
 opinions+0.008
ubby+0.008
Neg logit contributions
ding-0.011
tr-0.010
fire-0.010
storm-0.009
t-0.009
Token again
Token ID757
Feature activation+6.65e-01

Pos logit contributions
ding+0.025
tr+0.025
storm+0.022
fire+0.022
t+0.022
Neg logit contributions
en-0.017
els-0.017
ened-0.016
el-0.015
ere-0.015
Token soon
Token ID2582
Feature activation+8.04e-01

Pos logit contributions
els+0.013
en+0.012
ubby+0.011
ened+0.011
el+0.011
Neg logit contributions
ding-0.018
tr-0.017
fire-0.016
storm-0.015
whe-0.015
Token!
Token ID0
Feature activation-2.32e+00

Pos logit contributions
els+0.016
en+0.016
ened+0.015
el+0.015
ubby+0.015
Neg logit contributions
ding-0.020
tr-0.020
fire-0.018
storm-0.017
whe-0.017
Token<|endoftext|>
Token ID50256
Feature activation+6.58e+00

Pos logit contributions
ding+0.057
tr+0.053
fire+0.052
storm+0.051
ache+0.049
Neg logit contributions
els-0.045
ened-0.044
en-0.043
el-0.042
 opinions-0.042
TokenJul
Token ID16980
Feature activation+1.32e+01

Pos logit contributions
els+0.131
en+0.121
ubby+0.120
ere+0.118
el+0.118
Neg logit contributions
ding-0.168
tr-0.159
fire-0.149
storm-0.144
whe-0.143
Tokenie
Token ID494
Feature activation+7.06e+00

Pos logit contributions
rew+0.247
els+0.242
 asks+0.240
 opinions+0.234
efully+0.232
Neg logit contributions
ding-0.267
t-0.261
tr-0.239
ache-0.227
hell-0.221
Token and
Token ID290
Feature activation+1.30e+00

Pos logit contributions
els+0.162
ubby+0.149
 opinions+0.142
en+0.140
 angles+0.140
Neg logit contributions
ding-0.164
tr-0.147
fire-0.139
storm-0.135
t-0.133
Token Joe
Token ID5689
Feature activation+8.47e+00

Pos logit contributions
els+0.024
ubby+0.022
en+0.022
ened+0.021
ere+0.021
Neg logit contributions
ding-0.035
tr-0.033
fire-0.031
whe-0.030
storm-0.030
Token were
Token ID547
Feature activation+5.12e+00

Pos logit contributions
els+0.210
ubby+0.199
 opinions+0.191
 angles+0.189
eful+0.188
Neg logit contributions
ding-0.177
tr-0.152
storm-0.139
fire-0.137
whe-0.137
Token very
Token ID845
Feature activation+7.53e-02

Pos logit contributions
els+0.116
en+0.108
ened+0.107
 opinions+0.103
ere+0.103
Neg logit contributions
ding-0.120
tr-0.109
whe-0.098
fire-0.096
t-0.096
Token he
Token ID339
Feature activation-3.92e+00

Pos logit contributions
ding+0.086
tr+0.081
fire+0.071
whe+0.068
storm+0.068
Neg logit contributions
els-0.101
en-0.096
ened-0.095
 opinions-0.094
el-0.092
Token shook
Token ID14682
Feature activation+2.66e+00

Pos logit contributions
ding+0.098
tr+0.091
fire+0.086
whe+0.083
storm+0.083
Neg logit contributions
els-0.081
ubby-0.075
en-0.074
ened-0.072
ere-0.072
Token it
Token ID340
Feature activation-5.92e+00

Pos logit contributions
els+0.047
en+0.043
ubby+0.043
ened+0.042
el+0.042
Neg logit contributions
ding-0.074
tr-0.070
fire-0.066
storm-0.064
whe-0.064
Token.
Token ID13
Feature activation-2.96e+00

Pos logit contributions
ding+0.150
tr+0.136
fire+0.133
storm+0.127
whe+0.127
Neg logit contributions
els-0.119
en-0.114
ened-0.111
el-0.109
ubby-0.106
Token<|endoftext|>
Token ID50256
Feature activation+6.44e+00

Pos logit contributions
ding+0.073
tr+0.066
fire+0.065
storm+0.063
ache+0.061
Neg logit contributions
els-0.059
ere-0.057
ubby-0.057
en-0.056
el-0.056
TokenPa
Token ID28875
Feature activation+1.27e+01

Pos logit contributions
els+0.127
en+0.117
ubby+0.116
ere+0.114
ened+0.114
Neg logit contributions
ding-0.166
tr-0.156
fire-0.147
storm-0.142
whe-0.141
Tokenola
Token ID5708
Feature activation+8.59e+00

Pos logit contributions
els+0.247
 opinions+0.240
ened+0.225
 angles+0.224
en+0.218
Neg logit contributions
ding-0.311
tr-0.286
ongo-0.274
t-0.266
ache-0.261
Token and
Token ID290
Feature activation+3.11e+00

Pos logit contributions
els+0.189
ubby+0.176
 angles+0.168
en+0.167
ened+0.166
Neg logit contributions
ding-0.205
tr-0.189
fire-0.176
storm-0.172
t-0.168
Token her
Token ID607
Feature activation+5.88e+00

Pos logit contributions
els+0.061
ubby+0.055
en+0.054
ened+0.053
ere+0.052
Neg logit contributions
ding-0.082
tr-0.076
fire-0.072
whe-0.071
t-0.069
Token mother
Token ID2802
Feature activation+5.59e+00

Pos logit contributions
els+0.111
en+0.103
ened+0.099
ubby+0.098
ere+0.096
Neg logit contributions
ding-0.160
tr-0.152
fire-0.140
whe-0.137
t-0.136
Token were
Token ID547
Feature activation+5.75e+00

Pos logit contributions
els+0.140
ubby+0.138
 angles+0.133
ened+0.128
eful+0.127
Neg logit contributions
ding-0.109
tr-0.095
whe-0.084
storm-0.084
fire-0.081
Token,
Token ID11
Feature activation+5.73e+00

Pos logit contributions
els+0.134
en+0.123
ened+0.122
 opinions+0.118
ere+0.114
Neg logit contributions
ding-0.203
tr-0.188
t-0.186
fire-0.175
storm-0.172
Token a
Token ID257
Feature activation+1.83e+00

Pos logit contributions
els+0.138
 opinions+0.132
en+0.131
ened+0.129
ere+0.124
Neg logit contributions
ding-0.125
tr-0.121
fire-0.107
t-0.107
whe-0.099
Token gentle
Token ID10296
Feature activation+4.68e+00

Pos logit contributions
ubby+0.021
els+0.019
ere+0.018
el+0.016
en+0.016
Neg logit contributions
ding-0.062
tr-0.060
whe-0.057
fire-0.057
t-0.056
Token girl
Token ID2576
Feature activation+5.97e+00

Pos logit contributions
ubby+0.092
en+0.090
els+0.089
ere+0.087
el+0.086
Neg logit contributions
ding-0.118
tr-0.109
whe-0.102
fire-0.102
t-0.100
Token named
Token ID3706
Feature activation+7.35e+00

Pos logit contributions
els+0.164
ened+0.156
 opinions+0.156
ubby+0.156
 angles+0.154
Neg logit contributions
tr-0.101
ding-0.100
fire-0.087
storm-0.081
t-0.078
Token Ci
Token ID37685
Feature activation+1.27e+01

Pos logit contributions
els+0.191
ened+0.178
ubby+0.175
en+0.174
 opinions+0.172
Neg logit contributions
tr-0.153
ding-0.148
t-0.130
whe-0.122
storm-0.116
Tokenara
Token ID3301
Feature activation+7.41e+00

Pos logit contributions
els+0.190
 opinions+0.175
ened+0.168
 angles+0.168
ubby+0.161
Neg logit contributions
ding-0.387
tr-0.345
t-0.327
ache-0.323
fire-0.322
Token stumbled
Token ID24241
Feature activation+5.94e+00

Pos logit contributions
els+0.172
ubby+0.160
ened+0.158
en+0.157
ere+0.154
Neg logit contributions
ding-0.166
tr-0.157
fire-0.144
t-0.137
storm-0.136
Token upon
Token ID2402
Feature activation+3.42e+00

Pos logit contributions
els+0.132
en+0.125
ubby+0.124
el+0.121
ened+0.121
Neg logit contributions
ding-0.136
tr-0.127
fire-0.123
storm-0.117
whe-0.114
Token a
Token ID257
Feature activation-2.18e+00

Pos logit contributions
els+0.041
en+0.040
ened+0.038
el+0.038
 opinions+0.037
Neg logit contributions
ding-0.110
fire-0.102
tr-0.102
storm-0.096
whe-0.096
Token lovely
Token ID14081
Feature activation+3.01e+00

Pos logit contributions
ding+0.058
whe+0.057
uts+0.056
fire+0.055
t+0.054
Neg logit contributions
ubby-0.035
 me-0.035
el-0.034
ere-0.032
els-0.031
Token it
Token ID340
Feature activation-4.75e+00

Pos logit contributions
ding+0.094
tr+0.090
fire+0.081
whe+0.080
storm+0.079
Neg logit contributions
els-0.071
en-0.069
el-0.068
ened-0.067
 opinions-0.065
Token seemed
Token ID3947
Feature activation-1.92e+00

Pos logit contributions
ding+0.114
tr+0.105
fire+0.098
whe+0.096
storm+0.093
Neg logit contributions
els-0.102
en-0.097
ened-0.096
ubby-0.094
el-0.094
Token magical
Token ID10883
Feature activation+9.49e-01

Pos logit contributions
ding+0.045
tr+0.042
fire+0.038
whe+0.038
storm+0.037
Neg logit contributions
els-0.042
en-0.040
ened-0.040
ubby-0.040
ere-0.039
Token!
Token ID0
Feature activation-3.76e+00

Pos logit contributions
els+0.021
en+0.020
ubby+0.020
ened+0.019
ere+0.019
Neg logit contributions
ding-0.022
tr-0.021
fire-0.019
whe-0.018
storm-0.018
Token<|endoftext|>
Token ID50256
Feature activation+7.16e+00

Pos logit contributions
ding+0.097
tr+0.091
fire+0.088
storm+0.085
 numb+0.084
Neg logit contributions
els-0.069
el-0.067
en-0.066
ubby-0.066
ere-0.066
TokenSus
Token ID30746
Feature activation+1.26e+01

Pos logit contributions
els+0.134
en+0.123
ubby+0.123
ere+0.120
ened+0.120
Neg logit contributions
ding-0.192
tr-0.181
fire-0.171
storm-0.164
whe-0.164
Tokenie
Token ID494
Feature activation+7.59e+00

Pos logit contributions
 delight+0.255
 opinions+0.251
eful+0.244
 agreement+0.241
 amaz+0.240
Neg logit contributions
ding-0.284
tr-0.232
t-0.214
ache-0.212
o-0.200
Token was
Token ID373
Feature activation+5.12e+00

Pos logit contributions
els+0.168
ubby+0.152
 angles+0.147
 opinions+0.147
en+0.145
Neg logit contributions
ding-0.180
tr-0.164
fire-0.154
storm-0.150
t-0.146
Token very
Token ID845
Feature activation+7.22e-01

Pos logit contributions
els+0.123
en+0.113
ened+0.112
 opinions+0.110
 angles+0.108
Neg logit contributions
ding-0.113
tr-0.105
fire-0.095
whe-0.094
storm-0.091
Token proud
Token ID6613
Feature activation+4.85e+00

Pos logit contributions
els+0.019
en+0.018
ened+0.017
 opinions+0.017
ere+0.017
Neg logit contributions
ding-0.014
tr-0.012
fire-0.012
t-0.012
 numb-0.011
Token.
Token ID13
Feature activation-9.61e-01

Pos logit contributions
els+0.101
en+0.095
ened+0.093
el+0.092
ubby+0.091
Neg logit contributions
ding-0.122
tr-0.114
fire-0.108
storm-0.102
whe-0.101
Token her
Token ID607
Feature activation-1.52e+00

Pos logit contributions
els+0.004
en+0.004
ubby+0.004
ened+0.004
ere+0.004
Neg logit contributions
ding-0.008
tr-0.007
fire-0.007
storm-0.007
whe-0.007
Token brown
Token ID7586
Feature activation-5.81e-01

Pos logit contributions
ding+0.038
tr+0.035
fire+0.033
storm+0.032
whe+0.032
Neg logit contributions
els-0.033
ubby-0.030
ened-0.030
en-0.030
 opinions-0.029
Token fur
Token ID9230
Feature activation-5.68e+00

Pos logit contributions
ding+0.014
tr+0.013
fire+0.012
storm+0.012
t+0.012
Neg logit contributions
els-0.012
en-0.011
ubby-0.011
ere-0.011
ened-0.011
Token.
Token ID13
Feature activation-3.45e+00

Pos logit contributions
ding+0.144
tr+0.127
fire+0.122
whe+0.121
 villains+0.118
Neg logit contributions
els-0.109
en-0.108
ened-0.105
el-0.102
ubby-0.095
Token<|endoftext|>
Token ID50256
Feature activation+6.30e+00

Pos logit contributions
ding+0.081
tr+0.075
fire+0.073
storm+0.070
whe+0.067
Neg logit contributions
els-0.074
ere-0.071
ubby-0.070
en-0.070
el-0.069
TokenArk
Token ID42007
Feature activation+1.26e+01

Pos logit contributions
els+0.129
en+0.119
ubby+0.118
ened+0.117
ere+0.115
Neg logit contributions
ding-0.159
tr-0.148
fire-0.140
whe-0.135
storm-0.134
Tokenie
Token ID494
Feature activation+6.85e+00

Pos logit contributions
 delight+0.258
 amaz+0.252
 opinions+0.239
eful+0.236
ubby+0.236
Neg logit contributions
ding-0.257
tr-0.225
ache-0.217
fire-0.202
t-0.201
Token wanted
Token ID2227
Feature activation+5.04e+00

Pos logit contributions
els+0.146
ubby+0.134
 angles+0.127
en+0.127
 opinions+0.125
Neg logit contributions
ding-0.169
tr-0.155
fire-0.148
storm-0.141
t-0.141
Token his
Token ID465
Feature activation+3.10e+00

Pos logit contributions
els+0.112
en+0.105
ened+0.103
 opinions+0.103
ere+0.102
Neg logit contributions
ding-0.118
tr-0.111
fire-0.103
storm-0.100
whe-0.098
Token own
Token ID898
Feature activation+2.50e+00

Pos logit contributions
els+0.059
en+0.056
ubby+0.054
ened+0.054
ere+0.054
Neg logit contributions
ding-0.081
tr-0.076
fire-0.071
whe-0.069
storm-0.068
Token equipment
Token ID5112
Feature activation-3.10e+00

Pos logit contributions
els+0.053
en+0.049
ubby+0.048
ened+0.048
ere+0.047
Neg logit contributions
ding-0.061
tr-0.057
fire-0.054
whe-0.052
storm-0.052
Token lasting
Token ID15727
Feature activation-1.06e+00

Pos logit contributions
els+0.003
ubby+0.003
en+0.003
ened+0.003
ere+0.003
Neg logit contributions
ding-0.005
tr-0.005
fire-0.004
whe-0.004
storm-0.004
Token memories
Token ID9846
Feature activation-5.41e+00

Pos logit contributions
ding+0.030
tr+0.029
fire+0.027
whe+0.026
storm+0.026
Neg logit contributions
els-0.018
en-0.016
ubby-0.016
ened-0.016
ere-0.016
Token.
Token ID13
Feature activation-4.02e+00

Pos logit contributions
ding+0.130
tr+0.125
fire+0.113
storm+0.112
whe+0.108
Neg logit contributions
els-0.113
en-0.113
ened-0.105
el-0.104
ubby-0.103
Token<|endoftext|>
Token ID50256
Feature activation+6.54e+00

Pos logit contributions
ding+0.082
tr+0.075
fire+0.070
whe+0.066
t+0.066
Neg logit contributions
els-0.102
en-0.095
ubby-0.095
ened-0.094
ere-0.093
TokenGr
Token ID8642
Feature activation+1.04e+01

Pos logit contributions
els+0.123
en+0.113
ubby+0.112
ere+0.111
el+0.110
Neg logit contributions
ding-0.174
tr-0.164
fire-0.155
storm-0.149
whe-0.149
Tokenac
Token ID330
Feature activation+1.25e+01

Pos logit contributions
els+0.213
ened+0.201
 opinions+0.192
eful+0.181
 angles+0.181
Neg logit contributions
ding-0.220
t-0.211
tr-0.211
fire-0.203
Snow-0.194
Tokenie
Token ID494
Feature activation+7.66e+00

Pos logit contributions
ubby+0.203
els+0.193
 agreement+0.187
ased+0.183
ened+0.179
Neg logit contributions
ding-0.286
tr-0.279
ache-0.271
t-0.271
Snow-0.269
Token was
Token ID373
Feature activation+5.53e+00

Pos logit contributions
els+0.177
ubby+0.161
 angles+0.153
 opinions+0.153
eful+0.151
Neg logit contributions
ding-0.174
tr-0.158
fire-0.145
storm-0.144
t-0.142
Token a
Token ID257
Feature activation-6.51e-01

Pos logit contributions
els+0.130
en+0.119
ened+0.117
 opinions+0.115
ubby+0.115
Neg logit contributions
ding-0.127
tr-0.117
fire-0.107
whe-0.105
storm-0.103
Token gifted
Token ID22527
Feature activation+5.10e+00

Pos logit contributions
ding+0.020
tr+0.019
fire+0.019
whe+0.018
uts+0.018
Neg logit contributions
ubby-0.009
els-0.008
el-0.008
en-0.008
ere-0.008
Token girl
Token ID2576
Feature activation+3.48e+00

Pos logit contributions
els+0.086
en+0.078
ubby+0.078
ened+0.076
ere+0.076
Neg logit contributions
ding-0.146
tr-0.138
fire-0.131
whe-0.127
storm-0.126
Token there
Token ID612
Feature activation+5.08e+00

Pos logit contributions
els+0.127
 opinions+0.119
en+0.119
ened+0.116
ere+0.113
Neg logit contributions
ding-0.113
tr-0.109
t-0.101
fire-0.098
storm-0.091
Token were
Token ID547
Feature activation+5.29e+00

Pos logit contributions
els+0.141
ubby+0.135
en+0.126
elling+0.126
ere+0.125
Neg logit contributions
ding-0.091
tr-0.084
t-0.079
whe-0.075
fire-0.075
Token two
Token ID734
Feature activation+6.52e+00

Pos logit contributions
els+0.083
en+0.073
ened+0.073
 angles+0.070
ere+0.064
Neg logit contributions
ding-0.163
tr-0.149
t-0.143
whe-0.141
fire-0.140
Token friends
Token ID2460
Feature activation+4.14e+00

Pos logit contributions
els+0.110
en+0.101
ened+0.099
ubby+0.099
ere+0.097
Neg logit contributions
ding-0.186
tr-0.175
fire-0.166
whe-0.161
storm-0.160
Token,
Token ID11
Feature activation+1.83e+00

Pos logit contributions
els+0.096
ened+0.091
en+0.090
ubby+0.089
ere+0.088
Neg logit contributions
ding-0.090
tr-0.084
fire-0.079
whe-0.076
t-0.075
Token Ab
Token ID2275
Feature activation+1.25e+01

Pos logit contributions
els+0.030
ubby+0.028
en+0.027
ere+0.027
el+0.027
Neg logit contributions
ding-0.053
tr-0.049
fire-0.048
storm-0.046
t-0.046
Tokenig
Token ID328
Feature activation+3.56e+00

Pos logit contributions
ened+0.247
 opinions+0.242
els+0.223
 angles+0.220
en+0.220
Neg logit contributions
ding-0.296
tr-0.265
ache-0.241
t-0.239
Snow-0.228
Tokenail
Token ID603
Feature activation+7.23e+00

Pos logit contributions
els+0.065
en+0.059
ubby+0.058
ened+0.058
ere+0.057
Neg logit contributions
ding-0.098
tr-0.092
fire-0.087
t-0.084
whe-0.084
Token and
Token ID290
Feature activation+1.07e+00

Pos logit contributions
els+0.132
en+0.120
ubby+0.120
ened+0.119
ere+0.117
Neg logit contributions
ding-0.197
tr-0.185
fire-0.176
t-0.169
whe-0.168
Token Mat
Token ID6550
Feature activation+7.57e+00

Pos logit contributions
els+0.021
ubby+0.018
en+0.018
ened+0.018
ere+0.018
Neg logit contributions
ding-0.028
tr-0.027
fire-0.025
whe-0.025
storm-0.025
Tokenty
Token ID774
Feature activation+5.08e+00

Pos logit contributions
els+0.128
ubby+0.116
en+0.114
ened+0.114
ere+0.111
Neg logit contributions
ding-0.218
tr-0.204
fire-0.194
whe-0.189
t-0.188
Token two
Token ID734
Feature activation+6.95e+00

Pos logit contributions
els+0.113
en+0.100
ened+0.100
 angles+0.096
ubby+0.094
Neg logit contributions
ding-0.183
tr-0.163
whe-0.158
t-0.158
fire-0.154
Token little
Token ID1310
Feature activation+7.25e+00

Pos logit contributions
els+0.113
ubby+0.102
en+0.101
el+0.098
ere+0.098
Neg logit contributions
ding-0.207
tr-0.195
fire-0.187
storm-0.180
whe-0.179
Token girls
Token ID4813
Feature activation+3.45e+00

Pos logit contributions
els+0.123
en+0.115
ened+0.112
ubby+0.111
ere+0.111
Neg logit contributions
ding-0.200
tr-0.186
fire-0.176
whe-0.175
storm-0.171
Token,
Token ID11
Feature activation+2.17e+00

Pos logit contributions
els+0.066
en+0.060
ubby+0.060
ened+0.059
ere+0.058
Neg logit contributions
ding-0.092
tr-0.087
fire-0.082
storm-0.079
whe-0.079
Token named
Token ID3706
Feature activation+7.33e+00

Pos logit contributions
els+0.035
ubby+0.032
en+0.032
ere+0.031
el+0.031
Neg logit contributions
ding-0.064
tr-0.059
fire-0.057
storm-0.055
t-0.054
Token Ab
Token ID2275
Feature activation+1.25e+01

Pos logit contributions
els+0.187
ened+0.176
ubby+0.173
eled+0.171
ere+0.170
Neg logit contributions
ding-0.141
tr-0.140
t-0.119
whe-0.118
storm-0.113
Tokenbie
Token ID12590
Feature activation+7.24e+00

Pos logit contributions
 opinions+0.239
ened+0.238
 ideas+0.217
els+0.212
efully+0.211
Neg logit contributions
ding-0.302
tr-0.268
ache-0.246
t-0.243
Snow-0.228
Token and
Token ID290
Feature activation+1.03e+00

Pos logit contributions
els+0.111
en+0.101
el+0.099
ubby+0.099
ere+0.096
Neg logit contributions
ding-0.221
tr-0.207
fire-0.199
whe-0.196
storm-0.195
Token Daisy
Token ID40355
Feature activation+6.85e+00

Pos logit contributions
els+0.016
ubby+0.014
en+0.013
ened+0.013
ere+0.013
Neg logit contributions
ding-0.032
tr-0.030
fire-0.028
whe-0.028
storm-0.028
Token.
Token ID13
Feature activation-4.85e-01

Pos logit contributions
els+0.153
en+0.145
el+0.142
ubby+0.142
ened+0.141
Neg logit contributions
ding-0.161
tr-0.143
fire-0.139
storm-0.136
whe-0.131
Token Ab
Token ID2275
Feature activation+1.06e+01

Pos logit contributions
ding+0.005
tr+0.005
whe+0.004
fall+0.004
uts+0.004
Neg logit contributions
els-0.016
ubby-0.015
ened-0.015
ant-0.014
ere-0.014
Token never
Token ID1239
Feature activation+3.77e-01

Pos logit contributions
ding+0.026
tr+0.025
fire+0.023
whe+0.022
storm+0.022
Neg logit contributions
els-0.022
en-0.020
ened-0.020
ubby-0.020
ere-0.020
Token forgot
Token ID16453
Feature activation+1.83e+00

Pos logit contributions
en+0.008
els+0.008
ened+0.008
ubby+0.007
 opinions+0.007
Neg logit contributions
ding-0.009
tr-0.008
fire-0.008
whe-0.008
storm-0.007
Token it
Token ID340
Feature activation-4.82e+00

Pos logit contributions
els+0.029
en+0.027
ened+0.026
ere+0.025
ubby+0.025
Neg logit contributions
ding-0.055
tr-0.053
fire-0.050
storm-0.048
whe-0.048
Token.
Token ID13
Feature activation-2.82e+00

Pos logit contributions
ding+0.120
tr+0.112
fire+0.103
whe+0.100
storm+0.099
Neg logit contributions
els-0.097
en-0.096
ened-0.092
 opinions-0.090
el-0.088
Token<|endoftext|>
Token ID50256
Feature activation+6.71e+00

Pos logit contributions
ding+0.076
tr+0.074
fire+0.072
ache+0.068
storm+0.068
Neg logit contributions
els-0.047
en-0.047
ere-0.045
ubby-0.045
el-0.045
TokenJan
Token ID12128
Feature activation+1.24e+01

Pos logit contributions
els+0.149
en+0.138
ubby+0.137
ere+0.135
ened+0.135
Neg logit contributions
ding-0.157
tr-0.147
fire-0.137
storm-0.132
whe-0.131
Tokenie
Token ID494
Feature activation+7.42e+00

Pos logit contributions
ubby+0.212
els+0.205
 forces+0.199
 opinions+0.197
eful+0.197
Neg logit contributions
ding-0.307
tr-0.277
ache-0.276
ongo-0.262
t-0.250
Token and
Token ID290
Feature activation+1.60e+00

Pos logit contributions
els+0.171
ubby+0.156
 angles+0.149
en+0.148
 opinions+0.148
Neg logit contributions
ding-0.171
tr-0.155
fire-0.145
storm-0.143
t-0.139
Token John
Token ID1757
Feature activation+8.70e+00

Pos logit contributions
els+0.024
ubby+0.022
en+0.022
ened+0.021
ere+0.021
Neg logit contributions
ding-0.049
tr-0.046
fire-0.044
whe-0.043
storm-0.043
Token were
Token ID547
Feature activation+4.84e+00

Pos logit contributions
els+0.209
ubby+0.196
ere+0.190
en+0.189
ened+0.188
Neg logit contributions
ding-0.189
tr-0.171
fire-0.157
storm-0.153
whe-0.152
Token best
Token ID1266
Feature activation+5.15e+00

Pos logit contributions
els+0.109
en+0.101
ened+0.099
 opinions+0.096
ere+0.096
Neg logit contributions
ding-0.115
tr-0.103
whe-0.094
t-0.092
fire-0.092
Token time
Token ID640
Feature activation+4.28e-01

Pos logit contributions
els+0.029
ubby+0.025
en+0.024
ened+0.024
el+0.024
Neg logit contributions
ding-0.043
tr-0.040
fire-0.039
storm-0.038
t-0.038
Token!
Token ID0
Feature activation-3.22e+00

Pos logit contributions
els+0.009
en+0.009
ubby+0.009
ened+0.008
el+0.008
Neg logit contributions
ding-0.010
tr-0.010
fire-0.009
whe-0.009
storm-0.009
Tokenâ
Token ID22940
Feature activation+3.44e+00

Pos logit contributions
ding+0.075
tr+0.069
fire+0.065
storm+0.063
whe+0.061
Neg logit contributions
els-0.071
ubby-0.067
en-0.067
ere-0.066
el-0.066
Token
Token ID26391
Feature activation+7.22e-02

Pos logit contributions
els+0.080
en+0.074
ubby+0.074
ened+0.073
el+0.072
Neg logit contributions
ding-0.078
tr-0.072
fire-0.068
whe-0.065
storm-0.065
Token<|endoftext|>
Token ID50256
Feature activation+6.71e+00

Pos logit contributions
els+0.001
ere+0.001
ubby+0.001
en+0.001
el+0.001
Neg logit contributions
tr-0.002
ding-0.002
storm-0.001
fire-0.001
 numb-0.001
TokenPa
Token ID28875
Feature activation+1.24e+01

Pos logit contributions
els+0.129
en+0.118
ubby+0.117
ened+0.115
ere+0.114
Neg logit contributions
ding-0.178
tr-0.167
fire-0.158
whe-0.152
storm-0.152
Tokenolo
Token ID14057
Feature activation+5.04e+00

Pos logit contributions
els+0.271
 opinions+0.255
en+0.244
 angles+0.243
ened+0.240
Neg logit contributions
ding-0.304
tr-0.257
t-0.249
fire-0.233
whe-0.233
Token was
Token ID373
Feature activation+4.90e+00

Pos logit contributions
els+0.103
ubby+0.095
en+0.091
ened+0.091
 angles+0.090
Neg logit contributions
ding-0.127
tr-0.118
fire-0.111
storm-0.107
t-0.106
Token very
Token ID845
Feature activation+4.10e-01

Pos logit contributions
els+0.117
en+0.108
ened+0.107
ubby+0.106
ere+0.104
Neg logit contributions
ding-0.109
tr-0.101
fire-0.093
whe-0.090
storm-0.088
Token excited
Token ID6568
Feature activation+6.81e+00

Pos logit contributions
els+0.010
en+0.010
ened+0.009
ere+0.009
el+0.009
Neg logit contributions
ding-0.009
tr-0.008
fire-0.007
t-0.007
whe-0.007
Token.
Token ID13
Feature activation-8.87e-01

Pos logit contributions
els+0.153
en+0.146
ened+0.142
ubby+0.141
el+0.141
Neg logit contributions
ding-0.155
tr-0.147
fire-0.137
whe-0.130
storm-0.129
Token of
Token ID286
Feature activation-3.79e-01

Pos logit contributions
ding+0.079
tr+0.071
fire+0.068
t+0.066
whe+0.064
Neg logit contributions
els-0.087
ere-0.080
en-0.080
ubby-0.080
ened-0.079
Token magic
Token ID5536
Feature activation-3.12e+00

Pos logit contributions
ding+0.010
tr+0.009
fire+0.009
whe+0.008
t+0.008
Neg logit contributions
els-0.008
ubby-0.007
en-0.007
ere-0.007
ened-0.007
Token.
Token ID13
Feature activation-3.09e+00

Pos logit contributions
ding+0.075
tr+0.069
fire+0.065
whe+0.063
storm+0.062
Neg logit contributions
els-0.068
en-0.064
ubby-0.063
ened-0.062
el-0.062
Token<|endoftext|>
Token ID50256
Feature activation+6.70e+00

Pos logit contributions
ding+0.070
tr+0.064
fire+0.060
storm+0.059
whe+0.057
Neg logit contributions
els-0.071
en-0.067
el-0.066
ubby-0.066
ere-0.066
TokenIsa
Token ID39443
Feature activation+1.14e+01

Pos logit contributions
els+0.148
en+0.138
ubby+0.137
ere+0.135
el+0.135
Neg logit contributions
ding-0.157
tr-0.147
fire-0.137
storm-0.132
whe-0.131
Tokenac
Token ID330
Feature activation+1.23e+01

Pos logit contributions
els+0.260
ened+0.246
 opinions+0.239
 girls+0.234
ere+0.234
Neg logit contributions
tr-0.233
ding-0.216
fire-0.209
t-0.196
age-0.185
Token was
Token ID373
Feature activation+6.50e+00

Pos logit contributions
ubby+0.199
els+0.191
 agreement+0.190
 amaz+0.174
ened+0.174
Neg logit contributions
tr-0.305
ding-0.302
t-0.283
age-0.279
ache-0.278
Token preparing
Token ID10629
Feature activation+4.58e+00

Pos logit contributions
els+0.163
en+0.148
ened+0.146
 opinions+0.146
el+0.144
Neg logit contributions
ding-0.140
tr-0.126
fire-0.117
whe-0.113
t-0.110
Token breakfast
Token ID12607
Feature activation-1.58e+00

Pos logit contributions
els+0.098
ubby+0.092
en+0.090
ened+0.089
ere+0.088
Neg logit contributions
ding-0.111
tr-0.101
fire-0.098
whe-0.092
t-0.092
Token in
Token ID287
Feature activation+4.35e+00

Pos logit contributions
ding+0.039
tr+0.038
storm+0.035
 villains+0.035
fire+0.034
Neg logit contributions
els-0.031
en-0.029
el-0.029
ere-0.028
ened-0.027
Token the
Token ID262
Feature activation+8.16e-01

Pos logit contributions
els+0.090
ubby+0.086
en+0.081
ere+0.080
ened+0.079
Neg logit contributions
ding-0.105
tr-0.094
fire-0.093
whe-0.091
 numb-0.088
Token from
Token ID422
Feature activation+3.07e+00

Pos logit contributions
ding+0.035
tr+0.033
storm+0.031
fire+0.029
 numb+0.029
Neg logit contributions
en-0.020
els-0.019
ubby-0.019
ened-0.018
el-0.018
Token now
Token ID783
Feature activation+9.34e-01

Pos logit contributions
els+0.055
en+0.050
ubby+0.050
ened+0.049
ere+0.049
Neg logit contributions
ding-0.085
tr-0.080
fire-0.076
whe-0.073
storm-0.073
Token on
Token ID319
Feature activation+4.30e+00

Pos logit contributions
els+0.020
en+0.020
ened+0.020
el+0.019
ubby+0.019
Neg logit contributions
ding-0.022
tr-0.021
fire-0.019
whe-0.018
t-0.018
Token.
Token ID13
Feature activation-3.25e+00

Pos logit contributions
els+0.093
en+0.089
ubby+0.087
el+0.086
ened+0.086
Neg logit contributions
ding-0.105
tr-0.099
fire-0.093
whe-0.088
storm-0.087
Token<|endoftext|>
Token ID50256
Feature activation+6.41e+00

Pos logit contributions
ding+0.080
tr+0.075
fire+0.073
storm+0.070
whe+0.067
Neg logit contributions
els-0.064
ere-0.062
ubby-0.062
en-0.062
el-0.062
TokenZ
Token ID57
Feature activation+1.23e+01

Pos logit contributions
els+0.148
en+0.138
ubby+0.137
ere+0.137
el+0.136
Neg logit contributions
ding-0.143
tr-0.134
fire-0.124
storm-0.120
whe-0.118
Tokenoe
Token ID2577
Feature activation+9.56e+00

Pos logit contributions
els+0.276
 opinions+0.265
ened+0.253
en+0.250
 girls+0.249
Neg logit contributions
ding-0.287
t-0.264
tr-0.226
ongo-0.212
whe-0.211
Token was
Token ID373
Feature activation+6.21e+00

Pos logit contributions
els+0.204
ubby+0.189
en+0.183
ere+0.181
ened+0.181
Neg logit contributions
ding-0.236
tr-0.217
fire-0.202
t-0.196
whe-0.195
Token playing
Token ID2712
Feature activation+4.20e+00

Pos logit contributions
els+0.141
en+0.129
ened+0.127
ubby+0.125
el+0.124
Neg logit contributions
ding-0.147
tr-0.135
fire-0.125
whe-0.122
storm-0.120
Token in
Token ID287
Feature activation+4.13e+00

Pos logit contributions
els+0.098
ubby+0.091
en+0.091
ened+0.090
el+0.089
Neg logit contributions
ding-0.094
tr-0.087
fire-0.083
whe-0.079
storm-0.078
Token the
Token ID262
Feature activation+4.61e-01

Pos logit contributions
els+0.082
ubby+0.080
ened+0.075
elling+0.074
 opinions+0.072
Neg logit contributions
ding-0.101
whe-0.093
fire-0.089
 he-0.089
tr-0.089
Token a
Token ID257
Feature activation+3.55e+00

Pos logit contributions
els+0.052
 opinions+0.050
el+0.049
en+0.048
ere+0.048
Neg logit contributions
ding-0.180
tr-0.172
storm-0.167
fire-0.165
ache-0.158
Token time
Token ID640
Feature activation+6.74e+00

Pos logit contributions
els+0.054
ubby+0.049
en+0.049
ere+0.047
ened+0.047
Neg logit contributions
ding-0.108
tr-0.102
fire-0.097
whe-0.094
storm-0.094
Token two
Token ID734
Feature activation+7.78e+00

Pos logit contributions
els+0.121
en+0.111
ened+0.109
 opinions+0.106
ubby+0.103
Neg logit contributions
ding-0.192
tr-0.178
t-0.174
fire-0.167
storm-0.163
Token friends
Token ID2460
Feature activation+5.70e+00

Pos logit contributions
els+0.132
en+0.121
ened+0.118
ubby+0.117
ere+0.116
Neg logit contributions
ding-0.221
tr-0.208
fire-0.196
whe-0.192
storm-0.190
Token,
Token ID11
Feature activation+3.53e+00

Pos logit contributions
 angles+0.139
els+0.134
ened+0.133
ishment+0.132
elling+0.131
Neg logit contributions
 he-0.088
tr-0.087
ding-0.085
t-0.081
 much-0.079
Token Jas
Token ID21961
Feature activation+1.22e+01

Pos logit contributions
els+0.067
ubby+0.061
en+0.061
ere+0.060
ened+0.060
Neg logit contributions
ding-0.094
tr-0.088
fire-0.083
storm-0.081
whe-0.080
Tokenmine
Token ID3810
Feature activation+7.07e+00

Pos logit contributions
 opinions+0.281
els+0.271
ubby+0.270
eful+0.268
efully+0.266
Neg logit contributions
ding-0.268
t-0.223
whe-0.194
tr-0.188
storm-0.186
Token and
Token ID290
Feature activation+6.11e-01

Pos logit contributions
els+0.134
ened+0.123
ubby+0.122
en+0.121
ere+0.117
Neg logit contributions
ding-0.189
tr-0.174
fire-0.167
t-0.163
storm-0.158
Token John
Token ID1757
Feature activation+7.61e+00

Pos logit contributions
els+0.013
ubby+0.012
en+0.012
ere+0.011
el+0.011
Neg logit contributions
ding-0.015
tr-0.014
whe-0.013
fire-0.013
storm-0.013
Token,
Token ID11
Feature activation+1.41e+00

Pos logit contributions
els+0.178
ubby+0.165
en+0.164
ened+0.161
ere+0.161
Neg logit contributions
ding-0.172
tr-0.158
fire-0.147
whe-0.142
t-0.140
Token were
Token ID547
Feature activation+3.43e+00

Pos logit contributions
els+0.025
en+0.025
ened+0.025
ere+0.025
 opinions+0.024
Neg logit contributions
ding-0.035
fire-0.033
tr-0.033
 numb-0.032
ache-0.031
Token still
Token ID991
Feature activation-2.51e+00

Pos logit contributions
ding+0.111
tr+0.104
fire+0.094
storm+0.091
whe+0.091
Neg logit contributions
els-0.092
en-0.090
el-0.087
ubby-0.084
ere-0.083
Token bl
Token ID698
Feature activation-5.03e+00

Pos logit contributions
ding+0.062
tr+0.058
fire+0.051
storm+0.051
whe+0.051
Neg logit contributions
els-0.053
en-0.053
el-0.050
ened-0.049
 opinions-0.049
Tokenossom
Token ID25548
Feature activation-4.29e+00

Pos logit contributions
ding+0.091
tr+0.084
storm+0.078
fire+0.076
whe+0.075
Neg logit contributions
els-0.135
en-0.132
ubby-0.132
ere-0.127
el-0.126
Token.
Token ID13
Feature activation-3.09e+00

Pos logit contributions
ding+0.108
tr+0.100
fire+0.092
storm+0.091
whe+0.091
Neg logit contributions
els-0.089
en-0.086
ened-0.081
el-0.080
 opinions-0.078
Token<|endoftext|>
Token ID50256
Feature activation+6.76e+00

Pos logit contributions
ding+0.078
tr+0.071
fire+0.070
storm+0.067
 numb+0.066
Neg logit contributions
els-0.060
ere-0.059
el-0.057
en-0.057
ened-0.056
TokenSus
Token ID30746
Feature activation+1.22e+01

Pos logit contributions
els+0.142
en+0.132
ubby+0.131
ere+0.129
ened+0.129
Neg logit contributions
ding-0.165
tr-0.155
fire-0.145
storm-0.140
whe-0.139
Tokenie
Token ID494
Feature activation+7.74e+00

Pos logit contributions
 delight+0.255
 opinions+0.255
 amaz+0.249
eful+0.244
 example+0.241
Neg logit contributions
ding-0.265
tr-0.215
ache-0.200
t-0.200
ongo-0.184
Token and
Token ID290
Feature activation+2.55e+00

Pos logit contributions
els+0.189
ubby+0.173
 opinions+0.166
 angles+0.165
en+0.162
Neg logit contributions
ding-0.168
tr-0.150
fire-0.143
storm-0.137
t-0.135
Token her
Token ID607
Feature activation+5.50e+00

Pos logit contributions
els+0.044
ubby+0.041
en+0.038
ened+0.037
ere+0.036
Neg logit contributions
ding-0.073
tr-0.068
whe-0.064
fire-0.063
storm-0.062
Token mom
Token ID1995
Feature activation+5.80e+00

Pos logit contributions
els+0.096
ubby+0.088
en+0.086
el+0.085
ere+0.085
Neg logit contributions
ding-0.156
tr-0.145
fire-0.140
storm-0.135
whe-0.134
Token were
Token ID547
Feature activation+5.65e+00

Pos logit contributions
els+0.125
ubby+0.124
 angles+0.116
eful+0.113
 opinions+0.112
Neg logit contributions
ding-0.141
tr-0.129
whe-0.111
storm-0.110
t-0.104
Token."
Token ID526
Feature activation-1.26e+00

Pos logit contributions
els+0.024
en+0.022
el+0.022
ubby+0.022
ened+0.021
Neg logit contributions
ding-0.026
tr-0.024
fire-0.023
whe-0.022
storm-0.022
Token<|endoftext|>
Token ID50256
Feature activation+6.60e+00

Pos logit contributions
ding+0.033
tr+0.031
fire+0.029
storm+0.028
whe+0.028
Neg logit contributions
els-0.025
el-0.023
ubby-0.023
ere-0.023
 asks-0.023
Token<|endoftext|>
Token ID50256
Feature activation+8.18e+00

Pos logit contributions
els+0.142
en+0.132
ubby+0.131
ere+0.130
el+0.130
Neg logit contributions
ding-0.159
tr-0.149
fire-0.139
storm-0.135
whe-0.133
Token<|endoftext|>
Token ID50256
Feature activation+9.05e+00

Pos logit contributions
els+0.176
en+0.164
ubby+0.163
ere+0.161
el+0.160
Neg logit contributions
ding-0.196
tr-0.184
fire-0.172
storm-0.166
whe-0.164
Token<|endoftext|>
Token ID50256
Feature activation+9.52e+00

Pos logit contributions
els+0.189
en+0.175
ubby+0.174
ere+0.171
el+0.170
Neg logit contributions
ding-0.224
tr-0.210
fire-0.197
storm-0.190
whe-0.189
TokenAnna
Token ID31160
Feature activation+1.21e+01

Pos logit contributions
els+0.195
en+0.179
ubby+0.179
ere+0.176
el+0.175
Neg logit contributions
ding-0.240
tr-0.225
fire-0.211
storm-0.203
whe-0.203
Token and
Token ID290
Feature activation+6.43e+00

Pos logit contributions
els+0.292
ened+0.264
ubby+0.263
 angles+0.263
 opinions+0.257
Neg logit contributions
ding-0.260
tr-0.243
fire-0.223
t-0.214
ache-0.204
Token Ben
Token ID3932
Feature activation+1.15e+01

Pos logit contributions
els+0.109
en+0.096
ened+0.095
ubby+0.094
 angles+0.094
Neg logit contributions
ding-0.191
tr-0.176
storm-0.164
whe-0.164
t-0.164
Token are
Token ID389
Feature activation+8.57e+00

Pos logit contributions
els+0.257
ubby+0.232
ened+0.230
en+0.227
 angles+0.224
Neg logit contributions
ding-0.274
tr-0.252
fire-0.234
t-0.228
whe-0.221
Token playing
Token ID2712
Feature activation+5.21e+00

Pos logit contributions
els+0.187
en+0.180
ened+0.178
 angles+0.167
 opinions+0.163
Neg logit contributions
ding-0.210
tr-0.191
t-0.174
fire-0.173
whe-0.170
Token with
Token ID351
Feature activation+5.94e+00

Pos logit contributions
els+0.115
en+0.106
ubby+0.106
ened+0.105
ere+0.104
Neg logit contributions
ding-0.124
tr-0.115
fire-0.108
whe-0.104
storm-0.103
Token,
Token ID11
Feature activation+5.16e+00

Pos logit contributions
els+0.084
en+0.074
ubby+0.071
 opinions+0.071
ened+0.070
Neg logit contributions
ding-0.161
tr-0.148
fire-0.143
t-0.142
storm-0.139
Token a
Token ID257
Feature activation+1.10e+00

Pos logit contributions
els+0.103
en+0.094
ubby+0.093
ened+0.091
ere+0.091
Neg logit contributions
ding-0.136
tr-0.126
fire-0.119
t-0.115
whe-0.115
Token little
Token ID1310
Feature activation+5.45e+00

Pos logit contributions
ubby+0.010
ere+0.007
els+0.006
el+0.006
 me+0.005
Neg logit contributions
ding-0.040
tr-0.039
uts-0.038
whe-0.037
t-0.037
Token girl
Token ID2576
Feature activation+5.69e+00

Pos logit contributions
ubby+0.086
els+0.085
el+0.081
en+0.080
ere+0.080
Neg logit contributions
ding-0.159
tr-0.153
fire-0.144
whe-0.140
t-0.140
Token named
Token ID3706
Feature activation+7.25e+00

Pos logit contributions
els+0.166
 angles+0.159
 opinions+0.157
ubby+0.157
efully+0.152
Neg logit contributions
ding-0.085
tr-0.085
fire-0.073
storm-0.069
t-0.067
Token Ab
Token ID2275
Feature activation+1.21e+01

Pos logit contributions
els+0.201
ubby+0.182
ened+0.180
en+0.179
eful+0.179
Neg logit contributions
tr-0.137
ding-0.133
t-0.126
whe-0.108
 much-0.106
Tokenig
Token ID328
Feature activation+3.16e+00

Pos logit contributions
 opinions+0.273
 angles+0.247
ened+0.240
 ideas+0.238
els+0.237
Neg logit contributions
ding-0.267
tr-0.236
ache-0.209
t-0.197
ongo-0.194
Tokenail
Token ID603
Feature activation+6.80e+00

Pos logit contributions
els+0.063
 opinions+0.059
ened+0.059
en+0.059
ubby+0.058
Neg logit contributions
ding-0.079
tr-0.076
t-0.070
fire-0.069
storm-0.067
Token went
Token ID1816
Feature activation+6.35e+00

Pos logit contributions
els+0.171
ubby+0.164
 angles+0.161
 opinions+0.156
ened+0.155
Neg logit contributions
ding-0.132
tr-0.124
fire-0.112
t-0.109
storm-0.108
Token to
Token ID284
Feature activation+4.32e+00

Pos logit contributions
els+0.139
en+0.130
ubby+0.129
el+0.127
ere+0.126
Neg logit contributions
ding-0.151
tr-0.139
fire-0.134
storm-0.128
whe-0.126
Token the
Token ID262
Feature activation+1.77e+00

Pos logit contributions
els+0.097
ubby+0.093
ened+0.086
eful+0.085
en+0.083
Neg logit contributions
ding-0.099
tr-0.091
whe-0.087
fire-0.084
 he-0.081
Token crown
Token ID12389
Feature activation-8.42e+00

Pos logit contributions
ding+0.046
tr+0.043
fire+0.040
t+0.038
storm+0.037
Neg logit contributions
els-0.057
ubby-0.053
en-0.053
ere-0.052
ened-0.052
Token back
Token ID736
Feature activation-3.08e+00

Pos logit contributions
ding+0.229
tr+0.212
fire+0.210
storm+0.209
 villains+0.207
Neg logit contributions
els-0.138
el-0.133
en-0.132
ened-0.131
ubby-0.121
Token again
Token ID757
Feature activation-5.43e-01

Pos logit contributions
ding+0.078
tr+0.074
fire+0.070
whe+0.068
storm+0.068
Neg logit contributions
els-0.061
en-0.058
ubby-0.057
ened-0.056
el-0.056
Token!
Token ID0
Feature activation-2.90e+00

Pos logit contributions
ding+0.015
tr+0.014
whe+0.013
storm+0.012
t+0.012
Neg logit contributions
en-0.010
el-0.010
els-0.010
ened-0.009
ere-0.009
Token<|endoftext|>
Token ID50256
Feature activation+6.93e+00

Pos logit contributions
ding+0.077
tr+0.073
fire+0.070
storm+0.069
ache+0.068
Neg logit contributions
els-0.050
ere-0.050
el-0.050
ubby-0.049
en-0.049
TokenEm
Token ID10161
Feature activation+1.21e+01

Pos logit contributions
els+0.140
en+0.131
ubby+0.129
ere+0.129
el+0.128
Neg logit contributions
ding-0.173
tr-0.163
fire-0.153
storm-0.149
whe-0.146
Tokenma
Token ID2611
Feature activation+7.67e+00

Pos logit contributions
els+0.221
 opinions+0.216
ubby+0.209
elling+0.198
 ways+0.191
Neg logit contributions
ding-0.328
tr-0.275
t-0.270
fire-0.261
whe-0.251
Token was
Token ID373
Feature activation+5.43e+00

Pos logit contributions
els+0.183
ubby+0.173
 angles+0.169
 opinions+0.165
eful+0.164
Neg logit contributions
ding-0.168
tr-0.151
storm-0.134
fire-0.134
t-0.129
Token walking
Token ID6155
Feature activation+5.52e+00

Pos logit contributions
els+0.128
en+0.118
ened+0.116
 opinions+0.114
ubby+0.114
Neg logit contributions
ding-0.123
tr-0.113
fire-0.104
whe-0.101
storm-0.100
Token in
Token ID287
Feature activation+5.93e+00

Pos logit contributions
els+0.118
ubby+0.108
en+0.107
ened+0.105
ere+0.105
Neg logit contributions
ding-0.137
tr-0.126
fire-0.120
whe-0.115
t-0.114
Token the
Token ID262
Feature activation+1.45e+00

Pos logit contributions
els+0.113
ubby+0.108
elling+0.098
ened+0.097
en+0.097
Neg logit contributions
ding-0.153
fire-0.137
tr-0.137
whe-0.136
 he-0.131
Token not
Token ID407
Feature activation-2.86e+00

Pos logit contributions
ding+0.028
tr+0.025
fire+0.024
storm+0.024
t+0.023
Neg logit contributions
els-0.024
 opinions-0.023
ere-0.022
en-0.022
ened-0.022
Token played
Token ID2826
Feature activation-3.94e-01

Pos logit contributions
ding+0.073
tr+0.066
fire+0.065
storm+0.062
t+0.061
Neg logit contributions
els-0.059
en-0.054
ened-0.054
 opinions-0.053
el-0.053
Token with
Token ID351
Feature activation-3.62e-01

Pos logit contributions
ding+0.011
tr+0.010
t+0.009
storm+0.009
 lifetime+0.009
Neg logit contributions
els-0.008
en-0.007
el-0.007
 opinions-0.007
ere-0.007
Token the
Token ID262
Feature activation-4.40e+00

Pos logit contributions
ding+0.011
tr+0.010
fire+0.010
storm+0.010
whe+0.010
Neg logit contributions
els-0.006
en-0.005
ubby-0.005
el-0.005
ened-0.005
Token ash
Token ID12530
Feature activation-1.05e+01

Pos logit contributions
ding+0.115
tr+0.105
fire+0.102
whe+0.098
storm+0.097
Neg logit contributions
els-0.088
ubby-0.080
el-0.079
en-0.079
ere-0.078
Tokentr
Token ID2213
Feature activation-1.24e+01

Pos logit contributions
ding+0.119
 villains+0.100
 lifetime+0.097
fire+0.092
vere+0.091
Neg logit contributions
els-0.346
el-0.334
en-0.322
 opinions-0.308
ened-0.307
Tokenay
Token ID323
Feature activation-3.61e+00

Pos logit contributions
fire+0.393
ding+0.379
 villains+0.374
storm+0.353
tr+0.345
Neg logit contributions
els-0.179
el-0.170
en-0.142
ubby-0.140
ened-0.139
Token.
Token ID13
Feature activation-3.94e+00

Pos logit contributions
fire+0.085
ding+0.085
 villains+0.079
tr+0.078
 numb+0.076
Neg logit contributions
els-0.083
el-0.077
en-0.072
ened-0.071
 opinions-0.064
Token They
Token ID1119
Feature activation-2.25e+00

Pos logit contributions
ding+0.092
tr+0.084
fire+0.080
whe+0.077
storm+0.077
Neg logit contributions
els-0.088
ubby-0.082
en-0.082
ened-0.081
el-0.080
Token wished
Token ID16555
Feature activation+1.47e+00

Pos logit contributions
ding+0.067
fire+0.060
tr+0.059
storm+0.057
whe+0.057
Neg logit contributions
els-0.036
 opinions-0.034
ened-0.033
el-0.033
en-0.031
Token they
Token ID484
Feature activation-4.13e+00

Pos logit contributions
els+0.041
en+0.038
ened+0.038
el+0.038
 opinions+0.037
Neg logit contributions
ding-0.027
tr-0.025
fire-0.023
storm-0.022
whe-0.021
Token I
Token ID314
Feature activation-1.93e+00

Pos logit contributions
ding+0.058
tr+0.054
fire+0.051
whe+0.050
storm+0.050
Neg logit contributions
els-0.041
en-0.037
ubby-0.037
ened-0.037
ere-0.037
Token should
Token ID815
Feature activation-6.79e-01

Pos logit contributions
ding+0.050
tr+0.046
fire+0.045
whe+0.043
storm+0.042
Neg logit contributions
els-0.037
en-0.036
ubby-0.035
 opinions-0.035
ened-0.035
Token have
Token ID423
Feature activation+3.48e-02

Pos logit contributions
ding+0.013
tr+0.012
fire+0.011
storm+0.011
t+0.011
Neg logit contributions
els-0.018
ubby-0.017
en-0.017
ened-0.017
ere-0.016
Token stirred
Token ID33091
Feature activation+5.02e-01

Pos logit contributions
els+0.001
en+0.001
ubby+0.001
ened+0.001
ere+0.001
Neg logit contributions
ding-0.001
tr-0.001
fire-0.001
storm-0.001
whe-0.001
Token the
Token ID262
Feature activation-4.76e+00

Pos logit contributions
els+0.008
ubby+0.007
en+0.007
ened+0.007
ere+0.007
Neg logit contributions
ding-0.015
tr-0.014
fire-0.014
whe-0.013
storm-0.013
Token soup
Token ID17141
Feature activation-1.22e+01

Pos logit contributions
ding+0.122
tr+0.110
uts+0.105
fire+0.104
whe+0.103
Neg logit contributions
els-0.094
ubby-0.088
en-0.086
 opinions-0.086
ere-0.084
Token more
Token ID517
Feature activation-6.26e+00

Pos logit contributions
ding+0.308
tr+0.289
whe+0.274
 villains+0.269
fire+0.262
Neg logit contributions
en-0.225
els-0.224
ubby-0.212
el-0.206
ened-0.205
Token slowly
Token ID6364
Feature activation-3.77e+00

Pos logit contributions
ding+0.145
tr+0.138
fire+0.124
whe+0.124
storm+0.121
Neg logit contributions
els-0.138
en-0.132
ubby-0.132
ened-0.128
ere-0.128
Token so
Token ID523
Feature activation-4.66e+00

Pos logit contributions
ding+0.091
tr+0.086
fire+0.080
whe+0.079
storm+0.076
Neg logit contributions
els-0.077
en-0.076
ened-0.073
ubby-0.072
el-0.071
Token it
Token ID340
Feature activation-5.18e+00

Pos logit contributions
ding+0.113
tr+0.109
fire+0.098
storm+0.097
whe+0.097
Neg logit contributions
els-0.095
ubby-0.092
el-0.092
ened-0.090
en-0.090
Token wouldn
Token ID3636
Feature activation-2.18e+00

Pos logit contributions
ding+0.144
tr+0.130
fire+0.127
whe+0.124
storm+0.120
Neg logit contributions
els-0.093
en-0.088
ened-0.085
el-0.083
ubby-0.083
Token.
Token ID13
Feature activation-3.98e+00

Pos logit contributions
els+0.056
en+0.053
ubby+0.052
ened+0.052
el+0.051
Neg logit contributions
ding-0.059
tr-0.056
fire-0.052
storm-0.049
whe-0.048
Token We
Token ID775
Feature activation-1.89e+00

Pos logit contributions
ding+0.080
tr+0.076
fire+0.069
whe+0.066
t+0.065
Neg logit contributions
els-0.103
en-0.096
ubby-0.094
ere-0.094
ened-0.093
Token can
Token ID460
Feature activation-2.40e+00

Pos logit contributions
ding+0.048
tr+0.044
fire+0.043
whe+0.041
storm+0.040
Neg logit contributions
els-0.039
en-0.036
ubby-0.035
ened-0.035
el-0.035
Token share
Token ID2648
Feature activation+2.39e+00

Pos logit contributions
ding+0.057
tr+0.052
fire+0.050
storm+0.047
whe+0.047
Neg logit contributions
els-0.053
en-0.050
el-0.049
ubby-0.049
 opinions-0.048
Token the
Token ID262
Feature activation-5.32e+00

Pos logit contributions
els+0.036
ubby+0.033
en+0.033
ened+0.032
ere+0.032
Neg logit contributions
ding-0.072
tr-0.068
fire-0.065
whe-0.063
storm-0.063
Token mel
Token ID7758
Feature activation-1.21e+01

Pos logit contributions
ding+0.109
tr+0.101
fire+0.093
whe+0.088
storm+0.088
Neg logit contributions
els-0.134
en-0.126
ubby-0.125
ened-0.124
ere-0.123
Tokenon
Token ID261
Feature activation-9.73e+00

Pos logit contributions
 villains+0.288
tr+0.285
ding+0.282
fire+0.276
 numb+0.274
Neg logit contributions
els-0.268
el-0.247
ere-0.225
en-0.215
ubby-0.210
Token."
Token ID526
Feature activation-1.61e+00

Pos logit contributions
ding+0.271
storm+0.244
tr+0.242
fire+0.232
fall+0.231
Neg logit contributions
els-0.176
el-0.159
 opinions-0.153
en-0.153
ubby-0.147
Token Tom
Token ID4186
Feature activation+5.19e+00

Pos logit contributions
ding+0.052
fire+0.051
tr+0.051
storm+0.049
ache+0.047
Neg logit contributions
els-0.021
el-0.019
 asks-0.018
ened-0.018
ere-0.018
Token said
Token ID531
Feature activation+3.82e+00

Pos logit contributions
els+0.116
ubby+0.111
en+0.106
ere+0.105
ened+0.104
Neg logit contributions
ding-0.122
tr-0.114
fire-0.105
whe-0.103
t-0.102
Token.
Token ID13
Feature activation-2.21e+00

Pos logit contributions
els+0.078
ened+0.075
en+0.073
ubby+0.072
el+0.071
Neg logit contributions
ding-0.096
tr-0.092
storm-0.087
fire-0.084
whe-0.079
Token did
Token ID750
Feature activation+9.04e-01

Pos logit contributions
ding+0.039
tr+0.036
t+0.033
fire+0.032
whe+0.032
Neg logit contributions
els-0.040
ubby-0.037
en-0.037
ere-0.037
ened-0.037
Token.
Token ID13
Feature activation-3.31e+00

Pos logit contributions
els+0.022
ubby+0.020
en+0.020
ened+0.020
ere+0.020
Neg logit contributions
ding-0.020
tr-0.018
fire-0.017
t-0.016
whe-0.016
Token You
Token ID921
Feature activation-1.21e+00

Pos logit contributions
ding+0.076
tr+0.074
fire+0.067
whe+0.065
t+0.064
Neg logit contributions
els-0.073
en-0.069
ened-0.069
ere-0.067
ubby-0.066
Token wasted
Token ID18359
Feature activation+1.71e+00

Pos logit contributions
ding+0.029
tr+0.027
fire+0.026
whe+0.025
storm+0.025
Neg logit contributions
els-0.026
ubby-0.024
en-0.024
el-0.023
ere-0.023
Token the
Token ID262
Feature activation-5.27e+00

Pos logit contributions
els+0.027
ubby+0.025
en+0.024
ened+0.023
ere+0.023
Neg logit contributions
ding-0.050
tr-0.048
fire-0.045
whe-0.044
t-0.044
Token soup
Token ID17141
Feature activation-1.20e+01

Pos logit contributions
ding+0.123
tr+0.114
fire+0.107
whe+0.103
storm+0.102
Neg logit contributions
els-0.119
ubby-0.111
en-0.110
ened-0.109
ere-0.108
Token.
Token ID13
Feature activation-3.75e+00

Pos logit contributions
ding+0.345
tr+0.315
whe+0.308
fire+0.302
 villains+0.295
Neg logit contributions
els-0.200
en-0.184
ubby-0.180
el-0.179
ere-0.169
Token You
Token ID921
Feature activation-1.62e+00

Pos logit contributions
ding+0.086
tr+0.085
fire+0.077
t+0.074
whe+0.073
Neg logit contributions
els-0.082
en-0.078
ened-0.077
ere-0.075
ubby-0.075
Token made
Token ID925
Feature activation+1.33e+00

Pos logit contributions
ding+0.041
tr+0.038
fire+0.036
whe+0.035
storm+0.035
Neg logit contributions
els-0.033
ubby-0.030
en-0.030
el-0.030
ere-0.030
Token a
Token ID257
Feature activation-7.12e+00

Pos logit contributions
els+0.028
ubby+0.028
ened+0.026
ere+0.026
en+0.025
Neg logit contributions
ding-0.032
tr-0.030
whe-0.027
fire-0.027
t-0.026
Token mess
Token ID2085
Feature activation-1.61e+00

Pos logit contributions
ding+0.228
uts+0.213
whe+0.211
fire+0.205
t+0.201
Neg logit contributions
 me-0.093
ubby-0.090
 ways-0.081
 agreement-0.080
els-0.079
Token not
Token ID407
Feature activation-7.86e-01

Pos logit contributions
ding+0.007
tr+0.007
fire+0.006
whe+0.006
storm+0.006
Neg logit contributions
els-0.010
en-0.010
ubby-0.010
ened-0.009
ere-0.009
Token play
Token ID711
Feature activation-3.81e-01

Pos logit contributions
ding+0.021
fire+0.018
tr+0.018
 lifetime+0.017
t+0.017
Neg logit contributions
els-0.015
el-0.015
en-0.015
 opinions-0.013
ere-0.013
Token with
Token ID351
Feature activation+2.24e-02

Pos logit contributions
ding+0.010
tr+0.009
t+0.008
storm+0.008
fire+0.008
Neg logit contributions
els-0.008
en-0.007
el-0.007
ere-0.007
 opinions-0.007
Token the
Token ID262
Feature activation-4.31e+00

Pos logit contributions
ened+0.000
en+0.000
els+0.000
ubby+0.000
ere+0.000
Neg logit contributions
ding-0.001
tr-0.001
fire-0.001
whe-0.000
ache-0.000
Token ash
Token ID12530
Feature activation-1.02e+01

Pos logit contributions
ding+0.121
fire+0.106
tr+0.105
whe+0.103
uts+0.103
Neg logit contributions
els-0.081
 opinions-0.072
ubby-0.071
el-0.071
ere-0.068
Tokentr
Token ID2213
Feature activation-1.19e+01

Pos logit contributions
ding+0.124
 numb+0.102
fire+0.097
 villains+0.093
 lifetime+0.090
Neg logit contributions
els-0.333
el-0.319
en-0.315
 me-0.301
 opinions-0.289
Tokenay
Token ID323
Feature activation-3.37e+00

Pos logit contributions
fire+0.391
ding+0.375
 villains+0.373
storm+0.347
 numb+0.345
Neg logit contributions
els-0.160
el-0.154
en-0.124
ubby-0.123
ere-0.112
Token.
Token ID13
Feature activation-4.04e+00

Pos logit contributions
ding+0.085
fire+0.084
tr+0.078
 numb+0.077
 villains+0.076
Neg logit contributions
els-0.070
el-0.064
en-0.063
ened-0.059
ere-0.057
Token It
Token ID632
Feature activation-5.09e+00

Pos logit contributions
ding+0.098
tr+0.097
fire+0.088
t+0.084
whe+0.084
Neg logit contributions
els-0.084
en-0.078
ened-0.078
ere-0.077
ubby-0.076
Token is
Token ID318
Feature activation-3.90e+00

Pos logit contributions
ding+0.163
fire+0.144
whe+0.140
tr+0.138
 numb+0.137
Neg logit contributions
els-0.075
el-0.068
en-0.066
ened-0.063
 opinions-0.062
Token dangerous
Token ID4923
Feature activation+1.68e-01

Pos logit contributions
ding+0.094
whe+0.078
fire+0.077
t+0.077
tr+0.075
Neg logit contributions
els-0.090
ubby-0.082
el-0.081
en-0.081
 opinions-0.080
Token wished
Token ID16555
Feature activation+1.94e+00

Pos logit contributions
ding+0.044
tr+0.040
fire+0.039
whe+0.037
storm+0.037
Neg logit contributions
els-0.031
ened-0.028
en-0.028
el-0.028
ubby-0.027
Token they
Token ID484
Feature activation-3.52e+00

Pos logit contributions
els+0.048
 opinions+0.044
el+0.044
en+0.043
ened+0.043
Neg logit contributions
ding-0.041
tr-0.037
fire-0.036
storm-0.034
age-0.034
Token had
Token ID550
Feature activation-7.91e-01

Pos logit contributions
ding+0.091
tr+0.083
fire+0.079
fall+0.078
whe+0.078
Neg logit contributions
els-0.063
ened-0.061
 opinions-0.058
ered-0.058
el-0.056
Token shared
Token ID4888
Feature activation+2.81e+00

Pos logit contributions
ding+0.019
tr+0.017
fire+0.016
storm+0.016
t+0.016
Neg logit contributions
els-0.017
 opinions-0.016
ened-0.016
en-0.016
ere-0.016
Token the
Token ID262
Feature activation-5.76e+00

Pos logit contributions
els+0.049
en+0.044
ubby+0.042
ened+0.042
el+0.042
Neg logit contributions
ding-0.082
tr-0.076
fire-0.072
storm-0.071
whe-0.070
Token pasta
Token ID26296
Feature activation-1.19e+01

Pos logit contributions
ding+0.135
tr+0.126
fire+0.117
whe+0.113
storm+0.113
Neg logit contributions
els-0.128
ubby-0.119
en-0.119
ened-0.117
ere-0.116
Token.
Token ID13
Feature activation-3.51e+00

Pos logit contributions
ding+0.338
tr+0.312
Snow+0.296
whe+0.293
storm+0.292
Neg logit contributions
els-0.197
en-0.179
 ideas-0.171
 opinions-0.167
 ways-0.167
Token They
Token ID1119
Feature activation-1.44e+00

Pos logit contributions
ding+0.080
tr+0.074
fire+0.069
whe+0.066
storm+0.066
Neg logit contributions
els-0.081
en-0.075
ubby-0.075
ened-0.074
ere-0.074
Token looked
Token ID3114
Feature activation+1.81e-01

Pos logit contributions
ding+0.040
tr+0.036
fire+0.035
whe+0.034
storm+0.034
Neg logit contributions
els-0.027
ened-0.024
en-0.024
el-0.024
 opinions-0.023
Token at
Token ID379
Feature activation+3.07e+00

Pos logit contributions
els+0.004
en+0.003
ubby+0.003
el+0.003
ere+0.003
Neg logit contributions
ding-0.005
tr-0.004
fire-0.004
whe-0.004
storm-0.004
Token the
Token ID262
Feature activation-1.64e+00

Pos logit contributions
els+0.053
ubby+0.052
ere+0.050
ened+0.050
en+0.049
Neg logit contributions
ding-0.082
tr-0.079
fire-0.074
whe-0.072
 numb-0.069
Token wanted
Token ID2227
Feature activation+2.70e+00

Pos logit contributions
els+0.018
ubby+0.016
en+0.016
ere+0.016
ened+0.016
Neg logit contributions
ding-0.020
tr-0.019
fire-0.017
storm-0.017
t-0.017
Token to
Token ID284
Feature activation+5.19e-02

Pos logit contributions
els+0.063
en+0.059
ened+0.058
ubby+0.058
el+0.057
Neg logit contributions
ding-0.061
tr-0.057
fire-0.053
storm-0.051
whe-0.051
Token keep
Token ID1394
Feature activation+1.25e+00

Pos logit contributions
els+0.001
en+0.001
el+0.001
 opinions+0.001
ere+0.001
Neg logit contributions
ding-0.001
tr-0.001
fire-0.001
storm-0.001
whe-0.001
Token the
Token ID262
Feature activation-4.70e+00

Pos logit contributions
els+0.023
ubby+0.022
en+0.021
ere+0.021
ened+0.021
Neg logit contributions
ding-0.033
tr-0.031
fire-0.030
whe-0.029
t-0.028
Token ash
Token ID12530
Feature activation-1.01e+01

Pos logit contributions
ding+0.113
tr+0.107
fire+0.099
whe+0.094
storm+0.094
Neg logit contributions
els-0.102
en-0.095
ubby-0.095
ened-0.093
ere-0.093
Tokentr
Token ID2213
Feature activation-1.18e+01

Pos logit contributions
ding+0.178
 villains+0.167
 lifetime+0.141
fire+0.139
storm+0.138
Neg logit contributions
els-0.269
el-0.267
en-0.248
ened-0.240
ered-0.238
Tokenay
Token ID323
Feature activation-3.34e+00

Pos logit contributions
fire+0.375
ding+0.369
 villains+0.366
storm+0.350
fall+0.337
Neg logit contributions
els-0.163
el-0.159
ubby-0.124
en-0.123
ened-0.121
Token.
Token ID13
Feature activation-2.69e+00

Pos logit contributions
ding+0.084
fire+0.079
storm+0.078
tr+0.077
 villains+0.075
Neg logit contributions
els-0.066
el-0.062
en-0.060
ened-0.060
ered-0.055
Token So
Token ID1406
Feature activation+8.35e-01

Pos logit contributions
ding+0.067
tr+0.064
fire+0.058
whe+0.057
t+0.057
Neg logit contributions
els-0.057
ubby-0.052
en-0.052
ere-0.051
ened-0.050
Token he
Token ID339
Feature activation+1.73e-01

Pos logit contributions
els+0.015
ubby+0.013
en+0.013
ere+0.013
ened+0.013
Neg logit contributions
ding-0.024
tr-0.022
fire-0.021
t-0.021
whe-0.020
Token tried
Token ID3088
Feature activation+2.58e+00

Pos logit contributions
els+0.004
ubby+0.004
 angles+0.003
en+0.003
ere+0.003
Neg logit contributions
ding-0.004
tr-0.004
fire-0.004
storm-0.003
t-0.003
Token shouldn
Token ID6584
Feature activation+7.28e-02

Pos logit contributions
ding+0.013
tr+0.012
fire+0.011
t+0.011
storm+0.011
Neg logit contributions
els-0.013
en-0.012
ened-0.012
ubby-0.012
ere-0.012
Token't
Token ID470
Feature activation+1.78e-01

Pos logit contributions
els+0.002
ubby+0.002
en+0.002
ened+0.002
ere+0.002
Neg logit contributions
ding-0.001
tr-0.001
fire-0.001
storm-0.001
t-0.001
Token have
Token ID423
Feature activation+6.04e-01

Pos logit contributions
els+0.004
en+0.004
el+0.004
 opinions+0.003
ubby+0.003
Neg logit contributions
ding-0.004
tr-0.004
fire-0.004
storm-0.004
whe-0.004
Token taken
Token ID2077
Feature activation+2.20e+00

Pos logit contributions
els+0.013
ubby+0.013
en+0.012
ened+0.012
ere+0.012
Neg logit contributions
ding-0.014
tr-0.013
fire-0.012
t-0.012
whe-0.012
Token the
Token ID262
Feature activation-4.28e+00

Pos logit contributions
els+0.045
ubby+0.044
ere+0.041
ened+0.041
en+0.040
Neg logit contributions
ding-0.049
tr-0.045
fire-0.045
t-0.044
 much-0.041
Token soup
Token ID17141
Feature activation-1.17e+01

Pos logit contributions
ding+0.102
tr+0.094
fire+0.089
whe+0.085
storm+0.085
Neg logit contributions
els-0.095
en-0.088
ubby-0.088
ened-0.086
ere-0.086
Token without
Token ID1231
Feature activation+1.99e-01

Pos logit contributions
ding+0.290
tr+0.273
whe+0.254
fire+0.248
 villains+0.243
Neg logit contributions
els-0.236
en-0.226
ubby-0.219
el-0.215
ened-0.212
Token asking
Token ID4737
Feature activation+2.73e+00

Pos logit contributions
els+0.004
ubby+0.003
ere+0.003
en+0.003
ened+0.003
Neg logit contributions
ding-0.005
tr-0.005
fire-0.005
whe-0.005
storm-0.005
Token.
Token ID13
Feature activation-3.27e+00

Pos logit contributions
els+0.066
ubby+0.062
ened+0.062
en+0.062
ere+0.061
Neg logit contributions
ding-0.057
tr-0.050
fire-0.049
t-0.047
whe-0.045
Token Now
Token ID2735
Feature activation+1.37e+00

Pos logit contributions
ding+0.077
tr+0.073
fire+0.067
t+0.065
whe+0.064
Neg logit contributions
els-0.072
en-0.067
ened-0.066
ubby-0.065
ere-0.064
Token you
Token ID345
Feature activation-5.28e-01

Pos logit contributions
els+0.028
ubby+0.027
ere+0.025
en+0.025
ened+0.025
Neg logit contributions
ding-0.034
tr-0.031
fire-0.030
t-0.029
storm-0.029
Token not
Token ID407
Feature activation-2.32e+00

Pos logit contributions
ding+0.022
tr+0.020
fire+0.019
storm+0.019
whe+0.019
Neg logit contributions
els-0.018
 opinions-0.017
ened-0.016
en-0.016
ere-0.016
Token played
Token ID2826
Feature activation-1.48e-01

Pos logit contributions
ding+0.060
tr+0.054
fire+0.053
storm+0.051
whe+0.050
Neg logit contributions
els-0.044
 opinions-0.043
ened-0.042
en-0.042
ere-0.040
Token with
Token ID351
Feature activation+7.16e-02

Pos logit contributions
ding+0.004
tr+0.003
t+0.003
storm+0.003
whe+0.003
Neg logit contributions
els-0.003
en-0.003
ubby-0.003
 opinions-0.003
ened-0.003
Token the
Token ID262
Feature activation-3.74e+00

Pos logit contributions
els+0.001
en+0.001
ubby+0.001
ened+0.001
ere+0.001
Neg logit contributions
ding-0.002
tr-0.002
fire-0.002
whe-0.002
storm-0.002
Token ash
Token ID12530
Feature activation-1.00e+01

Pos logit contributions
ding+0.095
tr+0.087
whe+0.082
uts+0.081
t+0.080
Neg logit contributions
els-0.075
ubby-0.072
 opinions-0.070
ere-0.067
en-0.066
Tokentr
Token ID2213
Feature activation-1.17e+01

Pos logit contributions
ding+0.243
vere+0.212
 numb+0.209
 lifetime+0.208
whe+0.207
Neg logit contributions
els-0.199
en-0.186
el-0.174
 opinions-0.171
 ways-0.167
Tokenay
Token ID323
Feature activation-3.36e+00

Pos logit contributions
fire+0.376
ding+0.361
 villains+0.357
storm+0.337
Snow+0.336
Neg logit contributions
els-0.162
el-0.154
ubby-0.131
en-0.127
ened-0.124
Token.
Token ID13
Feature activation-3.33e+00

Pos logit contributions
ding+0.078
tr+0.075
fire+0.074
 numb+0.069
whe+0.068
Neg logit contributions
els-0.077
el-0.069
en-0.069
ened-0.067
ubby-0.063
Token They
Token ID1119
Feature activation-1.74e+00

Pos logit contributions
ding+0.085
tr+0.078
fire+0.075
whe+0.073
storm+0.072
Neg logit contributions
els-0.065
ubby-0.062
 opinions-0.062
en-0.061
ere-0.060
Token wish
Token ID4601
Feature activation+1.05e+00

Pos logit contributions
ding+0.052
tr+0.046
fire+0.045
storm+0.044
whe+0.044
Neg logit contributions
els-0.027
 opinions-0.027
en-0.025
ened-0.025
el-0.025
Token they
Token ID484
Feature activation-3.57e+00

Pos logit contributions
els+0.029
en+0.027
ened+0.027
el+0.027
 opinions+0.027
Neg logit contributions
ding-0.019
tr-0.017
fire-0.016
whe-0.015
storm-0.015
Token will
Token ID481
Feature activation-1.76e+00

Pos logit contributions
ding+0.082
fire+0.070
tr+0.070
whe+0.068
storm+0.068
Neg logit contributions
els-0.052
 opinions-0.051
en-0.050
ere-0.049
ened-0.049
Token not
Token ID407
Feature activation-1.73e+00

Pos logit contributions
ding+0.047
tr+0.039
fire+0.039
whe+0.038
storm+0.037
Neg logit contributions
els-0.035
en-0.034
 opinions-0.033
el-0.032
ubby-0.032
Token touch
Token ID3638
Feature activation+1.85e+00

Pos logit contributions
ding+0.049
fire+0.042
tr+0.041
 lifetime+0.039
t+0.039
Neg logit contributions
en-0.031
els-0.031
el-0.030
 opinions-0.029
 me-0.029
Token the
Token ID262
Feature activation-4.29e+00

Pos logit contributions
els+0.030
en+0.026
el+0.026
ubby+0.025
ened+0.025
Neg logit contributions
ding-0.056
tr-0.053
fire-0.051
storm-0.049
whe-0.049
Token ash
Token ID12530
Feature activation-9.99e+00

Pos logit contributions
ding+0.121
fire+0.108
uts+0.107
whe+0.104
t+0.103
Neg logit contributions
els-0.080
el-0.072
 opinions-0.069
ubby-0.066
en-0.065
Tokentr
Token ID2213
Feature activation-1.17e+01

Pos logit contributions
ding+0.094
 villains+0.092
fire+0.078
 lifetime+0.077
 numb+0.072
Neg logit contributions
els-0.347
el-0.340
en-0.329
 means-0.306
ened-0.306
Tokenay
Token ID323
Feature activation-3.18e+00

Pos logit contributions
fire+0.366
ding+0.354
 villains+0.353
storm+0.330
 numb+0.322
Neg logit contributions
els-0.174
el-0.166
en-0.138
ened-0.132
ubby-0.131
Token again
Token ID757
Feature activation-2.92e+00

Pos logit contributions
ding+0.075
fire+0.075
 villains+0.071
tr+0.070
 numb+0.067
Neg logit contributions
els-0.070
el-0.066
en-0.063
ened-0.061
ere-0.056
Token,"
Token ID553
Feature activation-1.09e+00

Pos logit contributions
ding+0.068
tr+0.064
fire+0.059
whe+0.058
storm+0.057
Neg logit contributions
els-0.066
en-0.062
ubby-0.061
el-0.061
ened-0.060
Token Tim
Token ID5045
Feature activation+4.87e+00

Pos logit contributions
ding+0.024
tr+0.022
fire+0.021
storm+0.020
t+0.019
Neg logit contributions
els-0.027
el-0.025
ere-0.024
en-0.024
ened-0.024
Token says
Token ID1139
Feature activation+2.82e+00

Pos logit contributions
els+0.074
ubby+0.073
ened+0.068
ere+0.067
en+0.065
Neg logit contributions
ding-0.142
tr-0.140
whe-0.128
ache-0.125
fire-0.124
Tokenfield
Token ID3245
Feature activation-7.18e+00

Pos logit contributions
ding+0.281
tr+0.280
 villains+0.252
t+0.249
whe+0.248
Neg logit contributions
en-0.183
els-0.181
el-0.176
 ways-0.160
ened-0.158
Token and
Token ID290
Feature activation-4.90e+00

Pos logit contributions
ding+0.182
tr+0.177
whe+0.157
storm+0.157
t+0.153
Neg logit contributions
els-0.140
en-0.139
el-0.133
ened-0.130
ubby-0.129
Token still
Token ID991
Feature activation-2.38e-01

Pos logit contributions
ding+0.133
tr+0.121
t+0.113
storm+0.110
whe+0.109
Neg logit contributions
els-0.089
en-0.089
 opinions-0.088
ened-0.087
 angles-0.081
Token keep
Token ID1394
Feature activation+3.58e-01

Pos logit contributions
ding+0.006
tr+0.006
whe+0.005
fire+0.005
t+0.005
Neg logit contributions
els-0.005
en-0.005
ened-0.005
 opinions-0.005
el-0.004
Token the
Token ID262
Feature activation-4.89e+00

Pos logit contributions
els+0.006
en+0.005
el+0.005
ened+0.005
ere+0.005
Neg logit contributions
ding-0.011
tr-0.011
fire-0.010
storm-0.010
t-0.009
Token corn
Token ID11676
Feature activation-1.17e+01

Pos logit contributions
ding+0.129
tr+0.121
whe+0.111
t+0.109
fire+0.109
Neg logit contributions
els-0.094
ubby-0.090
en-0.090
el-0.088
ened-0.087
Token safe
Token ID3338
Feature activation-2.76e+00

Pos logit contributions
ding+0.289
tr+0.284
 villains+0.262
whe+0.258
storm+0.248
Neg logit contributions
el-0.229
els-0.227
en-0.224
ubby-0.211
 ways-0.205
Token.
Token ID13
Feature activation-3.26e+00

Pos logit contributions
ding+0.073
tr+0.070
whe+0.062
storm+0.060
t+0.059
Neg logit contributions
en-0.054
els-0.051
ened-0.050
el-0.049
 ways-0.046
Token The
Token ID383
Feature activation+1.09e+00

Pos logit contributions
ding+0.091
tr+0.087
fire+0.084
storm+0.083
 numb+0.080
Neg logit contributions
els-0.053
ere-0.052
en-0.051
el-0.051
ubby-0.051
Token farmer
Token ID18739
Feature activation+6.65e-01

Pos logit contributions
els+0.022
en+0.020
ubby+0.019
ened+0.019
ere+0.019
Neg logit contributions
ding-0.029
tr-0.027
fire-0.026
storm-0.024
whe-0.024
Token and
Token ID290
Feature activation-2.54e+00

Pos logit contributions
els+0.014
ubby+0.013
en+0.013
ened+0.012
ere+0.012
Neg logit contributions
ding-0.016
tr-0.015
fire-0.015
whe-0.014
storm-0.014
Token make
Token ID787
Feature activation+9.30e-01

Pos logit contributions
ding+0.031
tr+0.029
fire+0.027
storm+0.025
t+0.025
Neg logit contributions
els-0.033
en-0.032
el-0.031
ubby-0.031
ened-0.030
Token the
Token ID262
Feature activation-3.92e+00

Pos logit contributions
els+0.015
en+0.013
ubby+0.013
ened+0.013
el+0.013
Neg logit contributions
ding-0.028
tr-0.027
fire-0.025
storm-0.025
whe-0.025
Token waiter
Token ID46612
Feature activation-5.09e+00

Pos logit contributions
ding+0.088
tr+0.082
fire+0.075
whe+0.074
t+0.073
Neg logit contributions
els-0.092
ubby-0.087
en-0.085
ere-0.084
 opinions-0.083
Token drop
Token ID4268
Feature activation+8.01e-01

Pos logit contributions
ding+0.139
tr+0.138
storm+0.125
Snow+0.124
fire+0.121
Neg logit contributions
els-0.091
en-0.088
el-0.085
ened-0.081
 girls-0.078
Token the
Token ID262
Feature activation-4.51e+00

Pos logit contributions
els+0.012
en+0.011
ubby+0.011
ened+0.011
el+0.011
Neg logit contributions
ding-0.025
tr-0.023
fire-0.022
storm-0.021
whe-0.021
Token pizz
Token ID38748
Feature activation-1.16e+01

Pos logit contributions
ding+0.098
tr+0.090
fire+0.084
whe+0.081
storm+0.081
Neg logit contributions
els-0.110
ubby-0.102
en-0.102
ened-0.100
el-0.100
Tokenas
Token ID292
Feature activation-6.18e+00

Pos logit contributions
storm+0.268
ding+0.265
tr+0.253
 villains+0.243
Snow+0.241
Neg logit contributions
els-0.254
ubby-0.228
el-0.222
ac-0.215
en-0.201
Token.
Token ID13
Feature activation-3.17e+00

Pos logit contributions
tr+0.164
ding+0.160
fire+0.143
 villains+0.140
Snow+0.140
Neg logit contributions
els-0.118
en-0.115
el-0.113
ened-0.106
ubby-0.102
Token They
Token ID1119
Feature activation-1.85e+00

Pos logit contributions
ding+0.086
tr+0.079
fire+0.076
whe+0.074
storm+0.074
Neg logit contributions
els-0.058
ubby-0.055
 opinions-0.054
en-0.053
el-0.053
Token said
Token ID531
Feature activation+2.56e+00

Pos logit contributions
ding+0.052
tr+0.048
fire+0.046
storm+0.044
whe+0.044
Neg logit contributions
els-0.033
ened-0.030
en-0.030
ubby-0.029
el-0.029
Token sorry
Token ID7926
Feature activation+3.45e+00

Pos logit contributions
els+0.068
en+0.064
ubby+0.064
ened+0.063
ere+0.063
Neg logit contributions
ding-0.049
tr-0.045
fire-0.041
whe-0.039
storm-0.039
Token not
Token ID407
Feature activation-3.40e+00

Pos logit contributions
ding+0.064
tr+0.057
fire+0.054
storm+0.051
whe+0.051
Neg logit contributions
els-0.087
en-0.082
ubby-0.082
ened-0.080
ere-0.080
Token nice
Token ID3621
Feature activation+8.74e-01

Pos logit contributions
ding+0.099
tr+0.090
fire+0.088
storm+0.087
whe+0.083
Neg logit contributions
els-0.058
en-0.053
ubby-0.053
 opinions-0.053
ened-0.052
Token to
Token ID284
Feature activation-2.20e+00

Pos logit contributions
els+0.023
ubby+0.021
ere+0.021
en+0.021
ened+0.021
Neg logit contributions
ding-0.017
tr-0.016
fire-0.014
t-0.014
whe-0.014
Token keep
Token ID1394
Feature activation-3.72e-01

Pos logit contributions
ding+0.056
tr+0.052
fire+0.049
whe+0.047
storm+0.047
Neg logit contributions
els-0.045
ubby-0.041
en-0.041
ened-0.041
ere-0.040
Token the
Token ID262
Feature activation-5.52e+00

Pos logit contributions
ding+0.010
tr+0.010
fire+0.009
whe+0.009
t+0.009
Neg logit contributions
els-0.006
ubby-0.006
ere-0.006
ened-0.006
en-0.006
Token dish
Token ID9433
Feature activation-1.16e+01

Pos logit contributions
ding+0.143
tr+0.132
fire+0.125
whe+0.122
storm+0.121
Neg logit contributions
els-0.112
ubby-0.102
en-0.102
el-0.101
ened-0.100
Token all
Token ID477
Feature activation-6.26e+00

Pos logit contributions
ding+0.224
tr+0.209
storm+0.197
 villains+0.190
whe+0.189
Neg logit contributions
els-0.298
ubby-0.285
el-0.284
en-0.274
ac-0.261
Token to
Token ID284
Feature activation-4.61e+00

Pos logit contributions
ding+0.152
tr+0.142
fire+0.133
storm+0.128
whe+0.127
Neg logit contributions
els-0.136
en-0.126
ubby-0.126
ened-0.123
el-0.122
Token yourself
Token ID3511
Feature activation-2.25e+00

Pos logit contributions
ding+0.117
tr+0.110
fire+0.103
storm+0.099
whe+0.099
Neg logit contributions
els-0.095
en-0.087
ubby-0.087
ened-0.086
ere-0.085
Token."
Token ID526
Feature activation-1.49e+00

Pos logit contributions
ding+0.052
tr+0.049
storm+0.045
fire+0.044
whe+0.044
Neg logit contributions
els-0.052
en-0.048
ubby-0.047
el-0.046
ened-0.046
Token
Token ID198
Feature activation-3.80e+00

Pos logit contributions
ding+0.038
tr+0.036
storm+0.034
fire+0.034
whe+0.032
Neg logit contributions
els-0.029
ubby-0.028
en-0.028
ere-0.027
el-0.027
Tokenfield
Token ID3245
Feature activation-7.18e+00

Pos logit contributions
ding+0.279
tr+0.277
 villains+0.249
t+0.247
whe+0.242
Neg logit contributions
en-0.187
els-0.187
el-0.182
 ways-0.163
eful-0.161
Token and
Token ID290
Feature activation-5.00e+00

Pos logit contributions
ding+0.179
tr+0.175
storm+0.155
whe+0.154
t+0.151
Neg logit contributions
els-0.144
en-0.141
el-0.135
ened-0.132
ubby-0.131
Token still
Token ID991
Feature activation-1.57e-01

Pos logit contributions
ding+0.132
tr+0.121
t+0.111
storm+0.109
whe+0.108
Neg logit contributions
els-0.094
en-0.094
ened-0.093
 opinions-0.091
 angles-0.086
Token keep
Token ID1394
Feature activation+4.85e-01

Pos logit contributions
ding+0.004
tr+0.004
fire+0.003
whe+0.003
t+0.003
Neg logit contributions
els-0.003
ened-0.003
en-0.003
 opinions-0.003
el-0.003
Token the
Token ID262
Feature activation-4.86e+00

Pos logit contributions
els+0.008
en+0.007
el+0.007
ened+0.007
ubby+0.007
Neg logit contributions
ding-0.014
tr-0.014
fire-0.013
storm-0.013
t-0.013
Token corn
Token ID11676
Feature activation-1.16e+01

Pos logit contributions
ding+0.129
tr+0.121
whe+0.111
t+0.110
uts+0.109
Neg logit contributions
els-0.093
ubby-0.089
en-0.089
el-0.087
 opinions-0.086
Token safe
Token ID3338
Feature activation-2.75e+00

Pos logit contributions
ding+0.286
tr+0.280
 villains+0.260
whe+0.253
storm+0.247
Neg logit contributions
el-0.229
els-0.227
en-0.222
ubby-0.209
 ways-0.203
Token.
Token ID13
Feature activation-3.37e+00

Pos logit contributions
ding+0.072
tr+0.068
whe+0.061
storm+0.059
t+0.059
Neg logit contributions
en-0.055
els-0.051
ened-0.051
el-0.050
 angles-0.047
Token The
Token ID383
Feature activation+1.06e+00

Pos logit contributions
ding+0.088
tr+0.083
fire+0.080
storm+0.078
 numb+0.075
Neg logit contributions
els-0.062
ere-0.060
en-0.060
ubby-0.059
el-0.059
Token farmer
Token ID18739
Feature activation+5.67e-01

Pos logit contributions
els+0.021
en+0.019
ubby+0.019
ened+0.019
ere+0.019
Neg logit contributions
ding-0.028
tr-0.026
fire-0.024
whe-0.024
storm-0.024
Token and
Token ID290
Feature activation-2.68e+00

Pos logit contributions
els+0.012
ubby+0.011
en+0.011
ened+0.011
ere+0.010
Neg logit contributions
ding-0.014
tr-0.013
fire-0.013
whe-0.012
storm-0.012
Token helping
Token ID5742
Feature activation+3.88e+00

Pos logit contributions
els+0.002
en+0.002
ubby+0.002
ened+0.002
ere+0.002
Neg logit contributions
ding-0.003
tr-0.002
fire-0.002
storm-0.002
whe-0.002
Token me
Token ID502
Feature activation-1.50e+00

Pos logit contributions
els+0.034
ubby+0.029
ened+0.028
ere+0.026
en+0.026
Neg logit contributions
ding-0.142
tr-0.134
fire-0.131
whe-0.127
t-0.126
Token make
Token ID787
Feature activation+3.22e-01

Pos logit contributions
ding+0.036
tr+0.034
fire+0.031
storm+0.031
whe+0.030
Neg logit contributions
els-0.032
en-0.031
ubby-0.030
el-0.029
ere-0.029
Token my
Token ID616
Feature activation-2.46e+00

Pos logit contributions
els+0.006
ubby+0.005
ened+0.005
en+0.005
ere+0.005
Neg logit contributions
ding-0.009
tr-0.008
fire-0.008
whe-0.008
storm-0.008
Token magic
Token ID5536
Feature activation-4.96e+00

Pos logit contributions
ding+0.064
tr+0.058
t+0.056
whe+0.055
fire+0.054
Neg logit contributions
els-0.050
ubby-0.046
el-0.045
en-0.045
 opinions-0.045
Token soup
Token ID17141
Feature activation-1.15e+01

Pos logit contributions
ding+0.132
whe+0.115
tr+0.113
t+0.108
fire+0.105
Neg logit contributions
els-0.098
en-0.097
el-0.091
ubby-0.090
 opinions-0.086
Token.
Token ID13
Feature activation-4.19e+00

Pos logit contributions
ding+0.304
tr+0.272
whe+0.264
 villains+0.262
t+0.256
Neg logit contributions
els-0.210
en-0.209
el-0.193
ubby-0.185
 opinions-0.185
Token It
Token ID632
Feature activation-5.65e+00

Pos logit contributions
ding+0.109
tr+0.100
fire+0.095
storm+0.092
whe+0.091
Neg logit contributions
els-0.083
ubby-0.077
en-0.077
el-0.075
ere-0.074
Token will
Token ID481
Feature activation-4.05e+00

Pos logit contributions
ding+0.169
tr+0.152
fire+0.147
whe+0.145
storm+0.142
Neg logit contributions
els-0.092
en-0.086
el-0.083
ened-0.081
ubby-0.081
Token make
Token ID787
Feature activation-1.10e+00

Pos logit contributions
ding+0.113
tr+0.103
whe+0.094
uts+0.094
fire+0.093
Neg logit contributions
els-0.070
en-0.069
el-0.069
ubby-0.066
 opinions-0.064
Token me
Token ID502
Feature activation-4.34e+00

Pos logit contributions
ding+0.030
tr+0.028
fire+0.026
storm+0.026
t+0.025
Neg logit contributions
els-0.021
en-0.020
el-0.019
ubby-0.019
ere-0.019
Token We
Token ID775
Feature activation-1.58e+00

Pos logit contributions
ding+0.056
tr+0.052
fire+0.047
t+0.043
storm+0.043
Neg logit contributions
els-0.103
en-0.098
ere-0.096
ubby-0.096
ened-0.096
Token should
Token ID815
Feature activation-8.94e-01

Pos logit contributions
ding+0.040
tr+0.037
fire+0.036
storm+0.034
whe+0.034
Neg logit contributions
els-0.033
en-0.030
ubby-0.030
ened-0.030
ere-0.029
Token have
Token ID423
Feature activation-1.30e-01

Pos logit contributions
ding+0.013
tr+0.013
t+0.011
fire+0.011
storm+0.011
Neg logit contributions
els-0.026
ened-0.025
ubby-0.025
en-0.025
ere-0.025
Token shared
Token ID4888
Feature activation+2.25e+00

Pos logit contributions
ding+0.003
tr+0.003
fire+0.003
whe+0.003
storm+0.003
Neg logit contributions
els-0.003
ubby-0.003
en-0.003
ened-0.003
ere-0.003
Token the
Token ID262
Feature activation-6.29e+00

Pos logit contributions
els+0.041
en+0.037
ubby+0.037
ened+0.036
ere+0.036
Neg logit contributions
ding-0.063
tr-0.059
fire-0.056
storm-0.054
whe-0.054
Token cream
Token ID8566
Feature activation-1.15e+01

Pos logit contributions
ding+0.144
tr+0.133
fire+0.127
storm+0.122
whe+0.119
Neg logit contributions
els-0.141
ubby-0.131
 opinions-0.131
ere-0.130
el-0.126
Token and
Token ID290
Feature activation-7.23e+00

Pos logit contributions
ding+0.339
tr+0.321
 villains+0.317
 numb+0.312
Snow+0.309
Neg logit contributions
els-0.161
en-0.142
el-0.139
ubby-0.134
 opinions-0.129
Token been
Token ID587
Feature activation-2.29e+00

Pos logit contributions
ding+0.206
tr+0.187
fire+0.178
storm+0.177
t+0.172
Neg logit contributions
els-0.126
 opinions-0.121
ened-0.115
en-0.114
ere-0.110
Token nice
Token ID3621
Feature activation+7.62e-01

Pos logit contributions
ding+0.051
tr+0.047
fire+0.044
whe+0.043
t+0.042
Neg logit contributions
els-0.054
en-0.050
ubby-0.050
ened-0.050
el-0.049
Token to
Token ID284
Feature activation-1.61e+00

Pos logit contributions
els+0.017
en+0.016
ubby+0.016
ened+0.016
 opinions+0.016
Neg logit contributions
ding-0.018
tr-0.016
fire-0.015
whe-0.015
storm-0.015
Token each
Token ID1123
Feature activation-2.03e+00

Pos logit contributions
ding+0.045
tr+0.042
fire+0.040
storm+0.038
whe+0.038
Neg logit contributions
els-0.030
en-0.028
ubby-0.027
el-0.027
ened-0.027
Token I
Token ID314
Feature activation-2.58e+00

Pos logit contributions
ding+0.081
tr+0.077
fire+0.069
whe+0.066
t+0.066
Neg logit contributions
els-0.095
en-0.089
ubby-0.088
ened-0.087
ere-0.087
Token want
Token ID765
Feature activation+6.79e-01

Pos logit contributions
ding+0.070
tr+0.062
fire+0.061
whe+0.059
storm+0.059
Neg logit contributions
els-0.046
en-0.045
 opinions-0.044
ere-0.044
ened-0.043
Token to
Token ID284
Feature activation-6.06e-01

Pos logit contributions
els+0.016
en+0.015
el+0.014
ubby+0.014
ened+0.014
Neg logit contributions
ding-0.016
tr-0.014
fire-0.014
storm-0.013
whe-0.013
Token share
Token ID2648
Feature activation+2.02e+00

Pos logit contributions
ding+0.016
tr+0.015
fire+0.014
storm+0.014
t+0.014
Neg logit contributions
els-0.012
en-0.011
el-0.011
 opinions-0.011
ere-0.011
Token my
Token ID616
Feature activation-3.45e+00

Pos logit contributions
els+0.034
ubby+0.031
en+0.030
ened+0.030
ere+0.030
Neg logit contributions
ding-0.059
tr-0.055
fire-0.053
whe-0.051
storm-0.051
Token jelly
Token ID27644
Feature activation-1.15e+01

Pos logit contributions
ding+0.058
fire+0.053
tr+0.052
t+0.046
 numb+0.045
Neg logit contributions
els-0.096
en-0.090
ere-0.089
ened-0.089
ubby-0.087
Token with
Token ID351
Feature activation-1.97e+00

Pos logit contributions
ding+0.295
tr+0.264
storm+0.249
whe+0.245
 numb+0.232
Neg logit contributions
els-0.228
en-0.218
ubby-0.212
el-0.204
ac-0.198
Token you
Token ID345
Feature activation-5.03e+00

Pos logit contributions
ding+0.055
tr+0.051
fire+0.049
storm+0.047
whe+0.047
Neg logit contributions
els-0.036
en-0.033
ubby-0.032
el-0.032
ere-0.032
Token.
Token ID13
Feature activation-4.01e+00

Pos logit contributions
ding+0.105
tr+0.097
fire+0.090
whe+0.085
storm+0.085
Neg logit contributions
els-0.125
en-0.117
ubby-0.117
ened-0.115
ere-0.115
Token Can
Token ID1680
Feature activation+7.30e-02

Pos logit contributions
ding+0.090
tr+0.085
fire+0.078
whe+0.075
storm+0.074
Neg logit contributions
els-0.094
en-0.087
ubby-0.087
ened-0.086
ere-0.085
Token we
Token ID356
Feature activation-1.81e+00

Pos logit contributions
els+0.002
en+0.002
ubby+0.002
el+0.002
ened+0.002
Neg logit contributions
ding-0.001
tr-0.001
fire-0.001
storm-0.001
whe-0.001
Token favorite
Token ID4004
Feature activation-1.03e-01

Pos logit contributions
ding+0.021
tr+0.019
fire+0.018
whe+0.018
storm+0.018
Neg logit contributions
els-0.017
ubby-0.016
en-0.016
el-0.016
ened-0.016
Token.
Token ID13
Feature activation-4.30e+00

Pos logit contributions
ding+0.002
tr+0.002
fire+0.002
storm+0.002
t+0.002
Neg logit contributions
els-0.002
ubby-0.002
en-0.002
ere-0.002
ened-0.002
Token It
Token ID632
Feature activation-5.09e+00

Pos logit contributions
ding+0.117
tr+0.109
fire+0.104
whe+0.100
storm+0.100
Neg logit contributions
els-0.080
en-0.073
ubby-0.073
ened-0.071
ere-0.071
Token makes
Token ID1838
Feature activation-5.31e-01

Pos logit contributions
ding+0.161
tr+0.149
fire+0.144
whe+0.141
storm+0.139
Neg logit contributions
els-0.074
en-0.066
el-0.065
ubby-0.064
ened-0.064
Token the
Token ID262
Feature activation-3.98e+00

Pos logit contributions
ding+0.015
tr+0.014
fire+0.013
whe+0.013
t+0.013
Neg logit contributions
els-0.009
ubby-0.009
en-0.008
ened-0.008
ere-0.008
Token soup
Token ID17141
Feature activation-1.15e+01

Pos logit contributions
ding+0.109
tr+0.099
fire+0.094
t+0.093
whe+0.093
Neg logit contributions
els-0.076
el-0.070
en-0.070
ubby-0.069
ened-0.068
Token creamy
Token ID27892
Feature activation-1.48e+00

Pos logit contributions
ding+0.296
tr+0.271
whe+0.260
storm+0.253
 villains+0.250
Neg logit contributions
els-0.231
en-0.218
el-0.215
ubby-0.200
 opinions-0.199
Token and
Token ID290
Feature activation-8.25e+00

Pos logit contributions
ding+0.040
tr+0.038
fire+0.036
whe+0.035
storm+0.034
Neg logit contributions
els-0.028
en-0.026
ubby-0.025
el-0.025
ened-0.025
Token y
Token ID331
Feature activation-6.46e-01

Pos logit contributions
ding+0.186
tr+0.172
fire+0.164
storm+0.153
whe+0.153
Neg logit contributions
els-0.190
ubby-0.177
en-0.176
ere-0.175
ened-0.172
Tokenummy
Token ID13513
Feature activation-1.35e+00

Pos logit contributions
ding+0.021
tr+0.019
fire+0.018
t+0.018
whe+0.018
Neg logit contributions
els-0.008
ened-0.008
ubby-0.007
ere-0.007
en-0.007
Token,"
Token ID553
Feature activation-3.69e-01

Pos logit contributions
ding+0.033
tr+0.031
fire+0.028
whe+0.028
storm+0.028
Neg logit contributions
els-0.030
en-0.028
ened-0.027
el-0.027
ubby-0.027
Token You
Token ID921
Feature activation-1.64e+00

Pos logit contributions
ding+0.107
tr+0.101
fire+0.094
t+0.091
whe+0.091
Neg logit contributions
els-0.092
en-0.084
ened-0.083
ubby-0.083
ere-0.082
Token can
Token ID460
Feature activation-2.29e+00

Pos logit contributions
ding+0.039
tr+0.036
fire+0.034
whe+0.033
storm+0.033
Neg logit contributions
els-0.036
ubby-0.033
en-0.033
el-0.033
ere-0.033
Token't
Token ID470
Feature activation-1.04e+00

Pos logit contributions
ding+0.050
tr+0.045
fire+0.044
 lifetime+0.040
storm+0.040
Neg logit contributions
els-0.054
el-0.053
en-0.051
ubby-0.049
ered-0.047
Token stir
Token ID11240
Feature activation-8.16e-01

Pos logit contributions
ding+0.030
tr+0.027
 lifetime+0.025
fire+0.025
 villains+0.025
Neg logit contributions
el-0.019
els-0.018
en-0.018
ubby-0.016
 me-0.016
Token the
Token ID262
Feature activation-4.13e+00

Pos logit contributions
ding+0.021
tr+0.019
fire+0.019
whe+0.018
t+0.018
Neg logit contributions
els-0.016
ubby-0.015
ere-0.014
ened-0.014
en-0.014
Token soup
Token ID17141
Feature activation-1.14e+01

Pos logit contributions
ding+0.114
tr+0.104
whe+0.101
t+0.099
uts+0.099
Neg logit contributions
els-0.080
el-0.073
en-0.071
ubby-0.070
ened-0.068
Token with
Token ID351
Feature activation-1.41e+00

Pos logit contributions
ding+0.289
tr+0.275
whe+0.255
 villains+0.250
fire+0.249
Neg logit contributions
els-0.223
en-0.222
el-0.207
ubby-0.200
ened-0.194
Token it
Token ID340
Feature activation-6.43e+00

Pos logit contributions
ding+0.041
tr+0.038
fire+0.037
storm+0.035
t+0.035
Neg logit contributions
els-0.023
ubby-0.021
en-0.021
ened-0.020
ere-0.020
Token!
Token ID0
Feature activation-4.44e+00

Pos logit contributions
ding+0.156
tr+0.144
fire+0.139
whe+0.135
storm+0.130
Neg logit contributions
els-0.136
en-0.129
el-0.126
ubby-0.125
ened-0.124
Token You
Token ID921
Feature activation-1.85e+00

Pos logit contributions
ding+0.109
tr+0.104
fire+0.096
whe+0.093
t+0.093
Neg logit contributions
els-0.093
en-0.086
ened-0.085
ubby-0.085
ere-0.084
Token can
Token ID460
Feature activation-2.41e+00

Pos logit contributions
ding+0.047
tr+0.044
fire+0.041
whe+0.040
storm+0.040
Neg logit contributions
els-0.038
ubby-0.035
en-0.035
el-0.034
ere-0.034
Token vegetable
Token ID20236
Feature activation-7.81e+00

Pos logit contributions
ding+0.185
whe+0.166
tr+0.164
uts+0.159
t+0.154
Neg logit contributions
ubby-0.124
els-0.119
el-0.118
 me-0.117
en-0.114
Token.
Token ID13
Feature activation-3.26e+00

Pos logit contributions
ding+0.215
tr+0.196
whe+0.187
storm+0.178
t+0.176
Neg logit contributions
els-0.147
en-0.136
ubby-0.135
el-0.135
ere-0.123
Token It
Token ID632
Feature activation-4.95e+00

Pos logit contributions
ding+0.078
tr+0.074
fire+0.069
t+0.065
whe+0.065
Neg logit contributions
els-0.071
en-0.065
ened-0.065
ubby-0.065
ere-0.064
Token makes
Token ID1838
Feature activation-5.16e-01

Pos logit contributions
ding+0.151
tr+0.143
fire+0.136
whe+0.132
storm+0.132
Neg logit contributions
els-0.075
en-0.067
ubby-0.067
ened-0.065
el-0.065
Token the
Token ID262
Feature activation-3.94e+00

Pos logit contributions
ding+0.013
tr+0.012
fire+0.012
t+0.011
whe+0.011
Neg logit contributions
els-0.010
ubby-0.009
en-0.009
ened-0.009
ere-0.009
Token soup
Token ID17141
Feature activation-1.14e+01

Pos logit contributions
ding+0.107
tr+0.099
fire+0.093
whe+0.091
t+0.091
Neg logit contributions
els-0.075
ubby-0.069
en-0.069
el-0.068
ened-0.067
Token creamy
Token ID27892
Feature activation-1.67e+00

Pos logit contributions
ding+0.299
tr+0.277
whe+0.262
storm+0.256
 villains+0.250
Neg logit contributions
els-0.225
en-0.218
el-0.208
ubby-0.197
 opinions-0.196
Token and
Token ID290
Feature activation-8.38e+00

Pos logit contributions
ding+0.046
tr+0.043
fire+0.040
whe+0.039
storm+0.039
Neg logit contributions
els-0.031
en-0.029
ubby-0.028
el-0.028
ened-0.028
Token tasty
Token ID25103
Feature activation-9.92e-01

Pos logit contributions
ding+0.184
tr+0.170
fire+0.162
whe+0.151
storm+0.151
Neg logit contributions
els-0.198
ubby-0.184
en-0.184
ere-0.181
ened-0.180
Token.
Token ID13
Feature activation-3.82e+00

Pos logit contributions
ding+0.023
tr+0.022
whe+0.020
fire+0.020
storm+0.020
Neg logit contributions
els-0.022
en-0.021
el-0.020
ened-0.020
ubby-0.020
Token Try
Token ID9993
Feature activation+9.85e-01

Pos logit contributions
ding+0.093
tr+0.087
fire+0.082
whe+0.078
storm+0.078
Neg logit contributions
els-0.081
en-0.075
ubby-0.075
ened-0.074
ere-0.073
Token day
Token ID1110
Feature activation+5.27e+00

Pos logit contributions
els+0.105
ubby+0.100
en+0.099
el+0.098
ere+0.098
Neg logit contributions
ding-0.103
tr-0.096
fire-0.090
whe-0.085
storm-0.085
Token,
Token ID11
Feature activation+5.32e+00

Pos logit contributions
els+0.088
en+0.077
 opinions+0.075
ened+0.074
ubby+0.073
Neg logit contributions
ding-0.159
tr-0.146
t-0.141
fire-0.139
storm-0.136
Token two
Token ID734
Feature activation+7.86e+00

Pos logit contributions
els+0.110
en+0.101
ubby+0.098
ened+0.097
ere+0.096
Neg logit contributions
ding-0.138
tr-0.127
fire-0.121
t-0.116
whe-0.115
Token friends
Token ID2460
Feature activation+5.47e+00

Pos logit contributions
els+0.142
en+0.137
ened+0.132
 angles+0.128
ere+0.128
Neg logit contributions
ding-0.203
tr-0.191
fire-0.177
whe-0.176
t-0.175
Token,
Token ID11
Feature activation+3.01e+00

Pos logit contributions
 angles+0.133
elling+0.128
els+0.128
ishment+0.127
ened+0.124
Neg logit contributions
ding-0.086
tr-0.083
t-0.083
 he-0.080
 fell-0.080
Token Winston
Token ID21400
Feature activation+8.66e+00

Pos logit contributions
els+0.053
ubby+0.049
en+0.048
ere+0.048
ened+0.047
Neg logit contributions
ding-0.084
tr-0.078
fire-0.075
storm-0.073
whe-0.072
Token the
Token ID262
Feature activation+1.04e+00

Pos logit contributions
els+0.199
 angles+0.190
ubby+0.181
 opinions+0.180
ened+0.178
Neg logit contributions
ding-0.190
fire-0.169
t-0.155
 he-0.152
tr-0.147
Token mouse
Token ID10211
Feature activation+1.76e+00

Pos logit contributions
els+0.020
ubby+0.019
ere+0.018
 opinions+0.018
el+0.018
Neg logit contributions
ding-0.028
tr-0.025
fire-0.024
t-0.024
whe-0.023
Token and
Token ID290
Feature activation-9.35e-01

Pos logit contributions
els+0.039
ubby+0.036
en+0.036
ened+0.035
ere+0.035
Neg logit contributions
ding-0.041
tr-0.039
fire-0.036
storm-0.035
t-0.034
Token Jo
Token ID5302
Feature activation+7.06e+00

Pos logit contributions
ding+0.025
tr+0.024
fire+0.022
whe+0.022
storm+0.021
Neg logit contributions
els-0.018
ubby-0.016
en-0.016
ened-0.016
el-0.016
Tokenjo
Token ID7639
Feature activation+4.96e+00

Pos logit contributions
els+0.159
ubby+0.147
en+0.145
ened+0.145
ere+0.144
Neg logit contributions
ding-0.163
tr-0.150
fire-0.139
t-0.135
whe-0.134
Token are
Token ID389
Feature activation-1.31e+00

Pos logit contributions
ding+0.034
tr+0.033
fire+0.031
storm+0.030
t+0.029
Neg logit contributions
els-0.021
en-0.020
ubby-0.019
 opinions-0.019
ere-0.019
Token happy
Token ID3772
Feature activation+2.03e+00

Pos logit contributions
ding+0.031
tr+0.030
fire+0.027
storm+0.027
t+0.027
Neg logit contributions
els-0.028
ubby-0.027
en-0.026
ere-0.026
 opinions-0.026
Token.
Token ID13
Feature activation-2.93e+00

Pos logit contributions
els+0.035
en+0.034
 ways+0.032
 opinions+0.032
el+0.032
Neg logit contributions
ding-0.059
tr-0.058
fire-0.052
storm-0.051
ache-0.050
Token<|endoftext|>
Token ID50256
Feature activation+6.79e+00

Pos logit contributions
ding+0.072
tr+0.065
fire+0.063
storm+0.061
whe+0.060
Neg logit contributions
els-0.061
ubby-0.059
en-0.057
 opinions-0.056
el-0.056
TokenL
Token ID43
Feature activation+1.02e+01

Pos logit contributions
els+0.128
en+0.117
ubby+0.117
ened+0.116
ere+0.114
Neg logit contributions
ding-0.182
tr-0.170
fire-0.161
whe-0.155
storm-0.155
Tokenily
Token ID813
Feature activation+8.73e+00

Pos logit contributions
ened+0.184
 opinions+0.179
 amaz+0.165
els+0.163
ening+0.163
Neg logit contributions
ding-0.263
t-0.251
 he-0.218
uts-0.215
tr-0.212
Token and
Token ID290
Feature activation+2.54e+00

Pos logit contributions
els+0.200
ubby+0.184
 angles+0.181
 agreement+0.178
 opinions+0.177
Neg logit contributions
ding-0.193
tr-0.170
 he-0.159
storm-0.153
 rider-0.150
Token Tom
Token ID4186
Feature activation+9.48e+00

Pos logit contributions
els+0.048
en+0.044
ubby+0.043
ened+0.042
ere+0.042
Neg logit contributions
ding-0.069
tr-0.065
fire-0.060
whe-0.059
t-0.059
Token were
Token ID547
Feature activation+5.20e+00

Pos logit contributions
els+0.232
ubby+0.218
 angles+0.214
eful+0.208
ened+0.207
Neg logit contributions
ding-0.197
tr-0.168
whe-0.155
fire-0.155
storm-0.150
Token twins
Token ID20345
Feature activation+2.97e+00

Pos logit contributions
els+0.127
en+0.125
ened+0.124
 angles+0.116
 opinions+0.115
Neg logit contributions
ding-0.112
tr-0.100
whe-0.089
t-0.087
fire-0.086
Token who
Token ID508
Feature activation+1.67e+00

Pos logit contributions
els+0.052
en+0.050
el+0.049
ubby+0.047
ere+0.046
Neg logit contributions
ding-0.079
tr-0.078
storm-0.073
fire-0.071
 villains-0.069
Token play
Token ID711
Feature activation+1.86e+00

Pos logit contributions
ding+0.020
tr+0.019
fire+0.017
t+0.017
whe+0.017
Neg logit contributions
els-0.019
en-0.018
ubby-0.018
ened-0.018
el-0.018
Token with
Token ID351
Feature activation+1.65e+00

Pos logit contributions
en+0.035
els+0.034
ere+0.032
ened+0.032
ubby+0.032
Neg logit contributions
ding-0.049
tr-0.046
ache-0.041
t-0.040
whe-0.040
Token.
Token ID13
Feature activation-2.49e+00

Pos logit contributions
els+0.028
en+0.025
ubby+0.024
ened+0.024
el+0.024
Neg logit contributions
ding-0.049
tr-0.046
fire-0.044
whe-0.042
storm-0.042
Token From
Token ID3574
Feature activation+4.07e+00

Pos logit contributions
ding+0.063
tr+0.058
fire+0.056
whe+0.053
storm+0.053
Neg logit contributions
els-0.050
en-0.046
ubby-0.046
ere-0.046
el-0.045
Token then
Token ID788
Feature activation+3.61e+00

Pos logit contributions
els+0.104
ubby+0.098
ened+0.097
en+0.096
 angles+0.095
Neg logit contributions
ding-0.080
tr-0.072
fire-0.069
storm-0.066
Snow-0.065
Token on
Token ID319
Feature activation+8.33e+00

Pos logit contributions
els+0.095
ubby+0.089
en+0.089
el+0.088
ened+0.088
Neg logit contributions
ding-0.071
tr-0.065
fire-0.061
storm-0.057
whe-0.056
Token,
Token ID11
Feature activation-3.86e-01

Pos logit contributions
els+0.136
en+0.121
ubby+0.121
ened+0.116
ere+0.116
Neg logit contributions
ding-0.243
tr-0.231
fire-0.219
t-0.216
storm-0.213
Token the
Token ID262
Feature activation+1.59e+00

Pos logit contributions
ding+0.011
tr+0.010
fire+0.010
ache+0.009
storm+0.009
Neg logit contributions
els-0.007
en-0.007
el-0.006
ere-0.006
ened-0.006
Token dog
Token ID3290
Feature activation+6.02e-01

Pos logit contributions
els+0.027
ubby+0.026
ere+0.024
en+0.024
el+0.024
Neg logit contributions
ding-0.046
tr-0.042
uts-0.040
t-0.039
fire-0.039
Token became
Token ID2627
Feature activation+2.09e+00

Pos logit contributions
els+0.012
en+0.012
ubby+0.011
ened+0.011
ere+0.011
Neg logit contributions
ding-0.015
tr-0.014
fire-0.013
whe-0.013
storm-0.013
Token their
Token ID511
Feature activation-1.02e-01

Pos logit contributions
els+0.040
en+0.036
ubby+0.036
ere+0.036
ened+0.036
Neg logit contributions
ding-0.057
tr-0.053
fire-0.050
storm-0.049
t-0.048
Token,
Token ID11
Feature activation+5.53e+00

Pos logit contributions
els+0.143
en+0.132
ened+0.129
 opinions+0.126
 angles+0.122
Neg logit contributions
ding-0.197
t-0.182
tr-0.179
fire-0.166
uts-0.165
Token a
Token ID257
Feature activation+1.79e+00

Pos logit contributions
els+0.140
en+0.133
 opinions+0.131
 angles+0.127
ened+0.127
Neg logit contributions
ding-0.116
tr-0.110
t-0.106
fire-0.100
 he-0.094
Token little
Token ID1310
Feature activation+6.09e+00

Pos logit contributions
ubby+0.019
ere+0.015
els+0.015
el+0.013
en+0.013
Neg logit contributions
ding-0.062
tr-0.061
whe-0.059
fire-0.058
uts-0.057
Token girl
Token ID2576
Feature activation+6.69e+00

Pos logit contributions
ubby+0.105
els+0.104
el+0.099
ere+0.097
en+0.096
Neg logit contributions
ding-0.171
tr-0.163
fire-0.154
whe-0.152
t-0.149
Token named
Token ID3706
Feature activation+7.56e+00

Pos logit contributions
els+0.194
 opinions+0.190
 angles+0.190
efully+0.186
ened+0.185
Neg logit contributions
tr-0.103
ding-0.096
fire-0.082
storm-0.079
t-0.075
Token Lily
Token ID20037
Feature activation+8.98e+00

Pos logit contributions
els+0.204
en+0.190
ened+0.187
 opinions+0.186
ubby+0.186
Neg logit contributions
tr-0.152
ding-0.143
t-0.135
whe-0.115
 much-0.113
Token wanted
Token ID2227
Feature activation+5.10e+00

Pos logit contributions
els+0.235
ubby+0.226
 angles+0.216
 opinions+0.211
ened+0.211
Neg logit contributions
ding-0.172
tr-0.167
fire-0.142
storm-0.139
t-0.136
Token to
Token ID284
Feature activation+1.94e+00

Pos logit contributions
els+0.125
en+0.117
ubby+0.116
ened+0.115
el+0.115
Neg logit contributions
ding-0.109
tr-0.100
fire-0.094
storm-0.089
whe-0.088
Token ride
Token ID6594
Feature activation+3.37e+00

Pos logit contributions
els+0.044
en+0.041
ubby+0.041
ened+0.040
ere+0.040
Neg logit contributions
ding-0.045
tr-0.041
fire-0.039
storm-0.037
whe-0.037
Token her
Token ID607
Feature activation+9.98e-01

Pos logit contributions
els+0.040
el+0.033
en+0.033
ened+0.031
ere+0.031
Neg logit contributions
ding-0.117
fire-0.110
tr-0.110
storm-0.106
t-0.105
Token bike
Token ID7161
Feature activation-3.05e+00

Pos logit contributions
els+0.029
el+0.028
ubby+0.027
ened+0.027
ere+0.027
Neg logit contributions
ding-0.016
fire-0.015
tr-0.014
storm-0.013
whe-0.013
Token the
Token ID262
Feature activation-2.04e+00

Pos logit contributions
els+0.039
ened+0.036
ubby+0.036
ere+0.034
en+0.034
Neg logit contributions
ding-0.061
fire-0.056
tr-0.054
whe-0.053
 explode-0.051
Token garden
Token ID11376
Feature activation-3.75e+00

Pos logit contributions
ding+0.054
tr+0.050
fire+0.048
t+0.047
uts+0.046
Neg logit contributions
els-0.039
en-0.036
ubby-0.036
ere-0.035
el-0.035
Token!"
Token ID2474
Feature activation+6.37e-02

Pos logit contributions
ding+0.095
tr+0.089
fire+0.083
storm+0.080
whe+0.080
Neg logit contributions
els-0.075
en-0.071
ubby-0.069
ere-0.068
el-0.067
Token
Token ID198
Feature activation-2.75e+00

Pos logit contributions
els+0.001
en+0.001
 opinions+0.001
ere+0.001
el+0.001
Neg logit contributions
ding-0.002
tr-0.001
fire-0.001
storm-0.001
ache-0.001
Token
Token ID198
Feature activation-2.14e+00

Pos logit contributions
ding+0.065
tr+0.062
fire+0.058
storm+0.058
ache+0.055
Neg logit contributions
els-0.058
ere-0.055
en-0.055
ubby-0.054
el-0.054
TokenMark
Token ID9704
Feature activation+7.39e+00

Pos logit contributions
ding+0.047
tr+0.042
fire+0.040
storm+0.039
 numb+0.039
Neg logit contributions
els-0.051
en-0.049
ere-0.047
 opinions-0.046
el-0.046
Token replied
Token ID8712
Feature activation+6.46e+00

Pos logit contributions
els+0.165
ubby+0.157
 angles+0.153
ened+0.143
el+0.143
Neg logit contributions
ding-0.166
tr-0.159
fire-0.145
t-0.140
storm-0.137
Token,
Token ID11
Feature activation+7.79e-01

Pos logit contributions
els+0.099
en+0.087
ubby+0.087
ere+0.085
el+0.084
Neg logit contributions
ding-0.199
tr-0.187
fire-0.180
t-0.175
whe-0.174
Token "
Token ID366
Feature activation+4.13e-01

Pos logit contributions
els+0.013
ened+0.012
en+0.012
ubby+0.012
ere+0.012
Neg logit contributions
ding-0.022
tr-0.021
fire-0.020
whe-0.019
storm-0.019
TokenLet
Token ID5756
Feature activation+4.66e+00

Pos logit contributions
en+0.008
 opinions+0.007
els+0.007
ere+0.007
eful+0.007
Neg logit contributions
ding-0.010
tr-0.010
ache-0.009
fire-0.009
 numb-0.009
Token's
Token ID338
Feature activation+5.37e-01

Pos logit contributions
els+0.132
ubby+0.123
 angles+0.116
ened+0.116
ere+0.115
Neg logit contributions
ding-0.083
tr-0.072
t-0.070
storm-0.066
whe-0.065
Token He
Token ID679
Feature activation-1.00e+00

Pos logit contributions
ding+0.043
tr+0.040
fire+0.036
whe+0.033
t+0.033
Neg logit contributions
els-0.077
en-0.073
ubby-0.072
ened-0.072
ere-0.071
Token hugged
Token ID41130
Feature activation+5.11e+00

Pos logit contributions
ding+0.027
tr+0.025
fire+0.024
storm+0.023
whe+0.023
Neg logit contributions
els-0.019
en-0.018
ened-0.018
ubby-0.017
ere-0.017
Token Mia
Token ID32189
Feature activation+1.30e+00

Pos logit contributions
els+0.106
ubby+0.104
ere+0.100
ened+0.096
eful+0.095
Neg logit contributions
ding-0.120
tr-0.110
fire-0.102
whe-0.101
t-0.099
Token back
Token ID736
Feature activation+9.59e-01

Pos logit contributions
els+0.021
en+0.020
ubby+0.019
ened+0.019
el+0.019
Neg logit contributions
ding-0.038
tr-0.036
fire-0.034
storm-0.033
whe-0.033
Token and
Token ID290
Feature activation-2.74e+00

Pos logit contributions
els+0.018
en+0.016
ubby+0.016
ened+0.016
ere+0.016
Neg logit contributions
ding-0.026
tr-0.025
fire-0.023
t-0.022
whe-0.022
Token said
Token ID531
Feature activation+2.78e+00

Pos logit contributions
ding+0.077
tr+0.072
fire+0.069
t+0.067
storm+0.067
Neg logit contributions
els-0.049
en-0.045
ened-0.045
el-0.044
ere-0.043
Token,
Token ID11
Feature activation-1.87e+00

Pos logit contributions
els+0.043
en+0.039
ubby+0.038
ere+0.038
ened+0.037
Neg logit contributions
ding-0.084
tr-0.079
fire-0.076
t-0.075
whe-0.073
Token "
Token ID366
Feature activation-6.43e-01

Pos logit contributions
ding+0.056
tr+0.055
fire+0.053
whe+0.052
storm+0.052
Neg logit contributions
els-0.023
ened-0.022
en-0.021
ubby-0.021
el-0.020
TokenThank
Token ID10449
Feature activation+5.72e+00

Pos logit contributions
ding+0.015
tr+0.014
fire+0.013
storm+0.013
uts+0.013
Neg logit contributions
en-0.013
ened-0.013
ere-0.013
els-0.013
el-0.013
Token you
Token ID345
Feature activation-5.59e-01

Pos logit contributions
els+0.139
en+0.129
ubby+0.129
ened+0.127
ere+0.127
Neg logit contributions
ding-0.124
tr-0.114
fire-0.106
storm-0.101
whe-0.101
Token,
Token ID11
Feature activation-6.82e-01

Pos logit contributions
ding+0.019
tr+0.018
fire+0.017
storm+0.017
whe+0.017
Neg logit contributions
els-0.007
ubby-0.006
en-0.006
el-0.006
ened-0.006
Token
Token ID220
Feature activation+1.00e+00

Pos logit contributions
ding+0.057
tr+0.053
fire+0.050
storm+0.048
whe+0.048
Neg logit contributions
els-0.046
ere-0.043
en-0.043
ubby-0.043
ened-0.043
Token
Token ID198
Feature activation-2.92e+00

Pos logit contributions
els+0.023
ere+0.021
ubby+0.021
en+0.021
el+0.021
Neg logit contributions
ding-0.022
tr-0.021
fire-0.020
storm-0.020
whe-0.019
Token
Token ID198
Feature activation-2.29e+00

Pos logit contributions
ding+0.064
tr+0.061
fire+0.055
storm+0.055
whe+0.052
Neg logit contributions
els-0.068
ere-0.064
ubby-0.064
en-0.063
el-0.062
TokenJack
Token ID14295
Feature activation+6.39e+00

Pos logit contributions
ding+0.053
tr+0.050
storm+0.045
fire+0.044
 numb+0.043
Neg logit contributions
els-0.049
ere-0.048
ubby-0.047
en-0.046
efully-0.045
Token cried
Token ID16896
Feature activation+6.26e+00

Pos logit contributions
els+0.155
ubby+0.144
 angles+0.134
 opinions+0.134
ere+0.134
Neg logit contributions
ding-0.137
tr-0.127
fire-0.124
storm-0.115
t-0.114
Token for
Token ID329
Feature activation+3.84e+00

Pos logit contributions
ened+0.095
en+0.093
els+0.092
ubby+0.086
el+0.085
Neg logit contributions
ding-0.187
tr-0.176
fire-0.166
storm-0.164
whe-0.163
Token help
Token ID1037
Feature activation+3.86e+00

Pos logit contributions
els+0.071
en+0.066
ened+0.065
ubby+0.065
ere+0.064
Neg logit contributions
ding-0.106
tr-0.100
fire-0.090
storm-0.090
t-0.089
Token and
Token ID290
Feature activation-4.13e+00

Pos logit contributions
els+0.059
en+0.055
ened+0.052
el+0.052
ubby+0.052
Neg logit contributions
ding-0.118
tr-0.113
fire-0.104
storm-0.103
whe-0.102
Token soon
Token ID2582
Feature activation+1.92e+00

Pos logit contributions
ding+0.110
tr+0.101
fire+0.092
whe+0.092
storm+0.092
Neg logit contributions
els-0.079
ened-0.076
en-0.075
ere-0.072
el-0.071
Token,
Token ID11
Feature activation-2.22e+00

Pos logit contributions
els+0.049
ened+0.048
en+0.048
ubby+0.046
el+0.046
Neg logit contributions
ding-0.039
tr-0.036
fire-0.031
storm-0.031
whe-0.030
Token a
Token ID257
Feature activation-3.04e+00

Pos logit contributions
ding+0.052
tr+0.051
uts+0.044
storm+0.044
ache+0.043
Neg logit contributions
ened-0.046
els-0.045
en-0.044
el-0.042
 opinions-0.042
Token little
Token ID1310
Feature activation+4.07e+00

Pos logit contributions
ding+0.023
tr+0.022
uts+0.022
t+0.021
whe+0.021
Neg logit contributions
ubby-0.007
ere-0.005
els-0.004
 me-0.004
el-0.004
Token girl
Token ID2576
Feature activation+4.51e+00

Pos logit contributions
ubby+0.062
els+0.058
el+0.057
en+0.054
ere+0.053
Neg logit contributions
ding-0.124
tr-0.119
t-0.113
whe-0.112
fire-0.112
Token.
Token ID13
Feature activation+8.14e-02

Pos logit contributions
els+0.099
en+0.092
ubby+0.092
el+0.090
ere+0.090
Neg logit contributions
ding-0.108
tr-0.100
fire-0.094
whe-0.091
storm-0.090
Token She
Token ID1375
Feature activation+1.62e+00

Pos logit contributions
els+0.002
ubby+0.002
en+0.002
ened+0.002
ere+0.002
Neg logit contributions
ding-0.002
tr-0.002
fire-0.002
whe-0.002
storm-0.002
Token was
Token ID373
Feature activation+2.40e+00

Pos logit contributions
els+0.029
ened+0.028
en+0.027
ere+0.027
el+0.027
Neg logit contributions
ding-0.043
tr-0.040
fire-0.038
whe-0.037
storm-0.036
Token only
Token ID691
Feature activation+2.07e+00

Pos logit contributions
els+0.063
ubby+0.059
en+0.059
ened+0.058
ere+0.058
Neg logit contributions
ding-0.047
tr-0.043
fire-0.040
storm-0.037
whe-0.037
Token three
Token ID1115
Feature activation+3.97e+00

Pos logit contributions
els+0.025
en+0.021
ened+0.020
ere+0.020
ubby+0.020
Neg logit contributions
ding-0.070
tr-0.067
fire-0.064
whe-0.062
t-0.062
Token years
Token ID812
Feature activation+4.97e-01

Pos logit contributions
els+0.077
en+0.072
ened+0.071
ubby+0.071
ere+0.071
Neg logit contributions
ding-0.100
tr-0.092
fire-0.088
whe-0.085
t-0.084
Token old
Token ID1468
Feature activation+3.59e+00

Pos logit contributions
els+0.008
en+0.007
ened+0.007
 opinions+0.007
ere+0.007
Neg logit contributions
ding-0.015
tr-0.013
fire-0.013
t-0.013
storm-0.012
Token,
Token ID11
Feature activation-2.49e+00

Pos logit contributions
els+0.075
en+0.070
ubby+0.069
ened+0.068
el+0.068
Neg logit contributions
ding-0.090
tr-0.084
fire-0.080
storm-0.076
whe-0.076
Token but
Token ID475
Feature activation-4.39e-01

Pos logit contributions
ding+0.066
tr+0.062
fire+0.061
storm+0.057
whe+0.056
Neg logit contributions
els-0.045
en-0.042
ened-0.042
el-0.041
 opinions-0.041
Token you
Token ID345
Feature activation-4.03e+00

Pos logit contributions
els+0.037
el+0.034
en+0.034
ubby+0.033
ened+0.032
Neg logit contributions
ding-0.062
tr-0.058
fire-0.055
storm-0.054
whe-0.053
Token.
Token ID13
Feature activation-3.67e+00

Pos logit contributions
ding+0.103
tr+0.101
fire+0.095
whe+0.093
storm+0.093
Neg logit contributions
els-0.074
el-0.073
en-0.072
ubby-0.070
ened-0.069
Token You
Token ID921
Feature activation-1.67e+00

Pos logit contributions
ding+0.088
tr+0.083
fire+0.078
whe+0.074
storm+0.074
Neg logit contributions
els-0.080
en-0.074
ubby-0.073
ened-0.072
ere-0.072
Token should
Token ID815
Feature activation-8.44e-01

Pos logit contributions
ding+0.045
tr+0.041
fire+0.040
storm+0.038
whe+0.038
Neg logit contributions
els-0.031
el-0.029
ubby-0.029
 opinions-0.028
en-0.028
Token not
Token ID407
Feature activation-1.23e+00

Pos logit contributions
ding+0.016
tr+0.015
fire+0.014
storm+0.013
t+0.013
Neg logit contributions
els-0.022
ened-0.021
ubby-0.021
en-0.021
ere-0.020
Token poke
Token ID22620
Feature activation+2.44e+00

Pos logit contributions
ding+0.032
fire+0.028
tr+0.028
 lifetime+0.026
storm+0.026
Neg logit contributions
els-0.024
en-0.024
el-0.023
 opinions-0.022
ubby-0.021
Token their
Token ID511
Feature activation-2.36e+00

Pos logit contributions
els+0.043
en+0.039
ubby+0.039
ened+0.038
el+0.038
Neg logit contributions
ding-0.070
tr-0.065
fire-0.062
storm-0.060
whe-0.060
Token home
Token ID1363
Feature activation-4.43e+00

Pos logit contributions
ding+0.074
tr+0.066
t+0.064
storm+0.064
whe+0.064
Neg logit contributions
els-0.040
 opinions-0.036
el-0.035
 ways-0.034
 angles-0.034
Token.
Token ID13
Feature activation-4.04e+00

Pos logit contributions
ding+0.114
tr+0.110
storm+0.103
whe+0.100
fire+0.098
Neg logit contributions
els-0.084
en-0.081
el-0.078
ubby-0.076
 ways-0.071
Token They
Token ID1119
Feature activation-2.78e+00

Pos logit contributions
ding+0.095
tr+0.090
fire+0.084
t+0.080
storm+0.080
Neg logit contributions
els-0.089
en-0.082
ened-0.082
ere-0.081
ubby-0.081
Token live
Token ID2107
Feature activation-2.35e-01

Pos logit contributions
ding+0.086
tr+0.077
fire+0.075
storm+0.073
whe+0.072
Neg logit contributions
els-0.043
el-0.041
 opinions-0.040
en-0.038
ened-0.037
Token he
Token ID339
Feature activation-3.59e-01

Pos logit contributions
ding+0.082
tr+0.076
fire+0.072
whe+0.070
t+0.070
Neg logit contributions
ened-0.053
els-0.051
en-0.049
el-0.047
ered-0.045
Token fell
Token ID3214
Feature activation+2.64e+00

Pos logit contributions
ding+0.010
tr+0.009
fire+0.009
storm+0.009
whe+0.009
Neg logit contributions
ened-0.006
els-0.006
en-0.006
el-0.006
ered-0.005
Token down
Token ID866
Feature activation+9.44e-01

Pos logit contributions
els+0.043
en+0.043
ened+0.043
el+0.041
ubby+0.039
Neg logit contributions
ding-0.073
fire-0.073
tr-0.069
storm-0.066
whe-0.064
Token and
Token ID290
Feature activation-2.90e+00

Pos logit contributions
en+0.014
els+0.014
ened+0.014
el+0.013
ubby+0.012
Neg logit contributions
ding-0.027
tr-0.027
fire-0.024
storm-0.024
ongo-0.024
Token got
Token ID1392
Feature activation+1.78e+00

Pos logit contributions
ding+0.065
fire+0.060
tr+0.058
uts+0.055
whe+0.053
Neg logit contributions
ened-0.068
 me-0.063
el-0.060
els-0.060
 asks-0.060
Token hurt
Token ID5938
Feature activation+3.41e+00

Pos logit contributions
els+0.047
el+0.044
ened+0.044
ubby+0.042
en+0.042
Neg logit contributions
ding-0.037
tr-0.034
fire-0.032
storm-0.031
t-0.028
Token.
Token ID13
Feature activation-1.84e+00

Pos logit contributions
el+0.062
en+0.061
els+0.061
ubby+0.057
ened+0.055
Neg logit contributions
ding-0.098
tr-0.095
storm-0.089
fire-0.087
 lifetime-0.080
Token His
Token ID2399
Feature activation+1.96e+00

Pos logit contributions
ding+0.049
tr+0.045
fire+0.045
whe+0.042
storm+0.041
Neg logit contributions
els-0.033
ened-0.032
el-0.031
en-0.031
ere-0.031
Token mom
Token ID1995
Feature activation+2.67e+00

Pos logit contributions
els+0.032
ubby+0.030
el+0.029
ened+0.029
en+0.028
Neg logit contributions
ding-0.059
tr-0.055
fire-0.052
whe-0.052
storm-0.050
Tokenmy
Token ID1820
Feature activation+2.65e+00

Pos logit contributions
els+0.051
ubby+0.048
 opinions+0.047
ened+0.047
ere+0.047
Neg logit contributions
ding-0.069
tr-0.065
t-0.061
ache-0.059
fire-0.059
Token and
Token ID290
Feature activation-2.10e+00

Pos logit contributions
els+0.048
ubby+0.044
en+0.044
ened+0.043
ere+0.043
Neg logit contributions
ding-0.073
tr-0.069
fire-0.065
t-0.063
storm-0.063
Token time
Token ID640
Feature activation+1.99e+00

Pos logit contributions
els+0.061
en+0.055
ubby+0.055
ened+0.055
ere+0.054
Neg logit contributions
ding-0.050
tr-0.045
fire-0.043
storm-0.041
t-0.041
Token the
Token ID262
Feature activation+1.35e+00

Pos logit contributions
els+0.032
ened+0.030
en+0.030
el+0.029
ere+0.029
Neg logit contributions
ding-0.058
tr-0.056
fire-0.052
storm-0.051
whe-0.050
Token little
Token ID1310
Feature activation+3.45e+00

Pos logit contributions
els+0.036
en+0.033
ened+0.032
el+0.032
ere+0.032
Neg logit contributions
ding-0.027
tr-0.024
fire-0.024
storm-0.023
t-0.022
Token boy
Token ID2933
Feature activation+1.79e+00

Pos logit contributions
els+0.089
en+0.080
ubby+0.079
ere+0.079
el+0.078
Neg logit contributions
ding-0.072
tr-0.064
fire-0.061
whe-0.058
storm-0.058
Token visited
Token ID8672
Feature activation+5.73e+00

Pos logit contributions
els+0.038
ubby+0.035
en+0.034
ere+0.034
ened+0.034
Neg logit contributions
ding-0.044
tr-0.041
fire-0.039
storm-0.037
t-0.037
Token,
Token ID11
Feature activation-1.39e+00

Pos logit contributions
els+0.084
ubby+0.075
en+0.072
ened+0.071
ere+0.071
Neg logit contributions
ding-0.180
tr-0.168
fire-0.164
whe-0.157
storm-0.156
Token the
Token ID262
Feature activation+6.75e-01

Pos logit contributions
ding+0.031
tr+0.031
storm+0.027
fire+0.026
t+0.026
Neg logit contributions
ened-0.031
en-0.031
els-0.031
el-0.029
ere-0.029
Token creek
Token ID39459
Feature activation-4.00e+00

Pos logit contributions
els+0.019
el+0.017
en+0.017
ened+0.017
ere+0.016
Neg logit contributions
ding-0.013
fire-0.012
tr-0.011
t-0.010
storm-0.010
Token was
Token ID373
Feature activation-7.05e-01

Pos logit contributions
ding+0.101
tr+0.095
fire+0.087
whe+0.086
storm+0.085
Neg logit contributions
els-0.080
en-0.075
ened-0.074
ubby-0.073
ere-0.073
Token full
Token ID1336
Feature activation+1.47e+00

Pos logit contributions
ding+0.015
tr+0.014
fire+0.013
whe+0.012
storm+0.012
Neg logit contributions
els-0.017
en-0.016
ubby-0.016
ened-0.016
ere-0.016
Token and
Token ID290
Feature activation-5.48e+00

Pos logit contributions
els+0.030
en+0.028
ened+0.028
ubby+0.028
el+0.027
Neg logit contributions
ding-0.037
tr-0.035
fire-0.032
storm-0.031
whe-0.031
Token and
Token ID290
Feature activation-5.84e+00

Pos logit contributions
els+0.021
en+0.021
ened+0.020
el+0.019
 opinions+0.019
Neg logit contributions
ding-0.024
tr-0.023
fire-0.022
whe-0.020
storm-0.020
Token made
Token ID925
Feature activation+4.88e-01

Pos logit contributions
ding+0.140
tr+0.127
fire+0.123
whe+0.117
storm+0.115
Neg logit contributions
els-0.127
ened-0.123
el-0.119
en-0.119
 opinions-0.116
Token her
Token ID607
Feature activation-2.14e+00

Pos logit contributions
els+0.006
en+0.005
el+0.005
ened+0.005
ubby+0.005
Neg logit contributions
ding-0.017
tr-0.016
fire-0.015
storm-0.015
whe-0.015
Token face
Token ID1986
Feature activation-4.12e+00

Pos logit contributions
ding+0.059
fire+0.056
tr+0.053
storm+0.053
whe+0.050
Neg logit contributions
el-0.039
els-0.038
 opinions-0.037
ened-0.036
en-0.036
Token funny
Token ID8258
Feature activation+7.35e-01

Pos logit contributions
ding+0.101
fire+0.093
tr+0.091
storm+0.088
whe+0.085
Neg logit contributions
en-0.087
els-0.086
ened-0.085
el-0.084
 opinions-0.079
Token.
Token ID13
Feature activation-3.21e+00

Pos logit contributions
els+0.016
en+0.016
 opinions+0.015
ened+0.015
el+0.015
Neg logit contributions
ding-0.018
tr-0.017
fire-0.016
storm-0.015
t-0.015
Token But
Token ID887
Feature activation-2.19e-01

Pos logit contributions
ding+0.072
tr+0.068
t+0.061
fire+0.060
whe+0.060
Neg logit contributions
els-0.076
en-0.071
ubby-0.070
ened-0.070
ere-0.068
Token she
Token ID673
Feature activation-5.50e-01

Pos logit contributions
ding+0.005
tr+0.005
fire+0.005
t+0.005
whe+0.005
Neg logit contributions
els-0.004
ubby-0.004
ere-0.004
en-0.004
ened-0.004
Token did
Token ID750
Feature activation+1.79e+00

Pos logit contributions
ding+0.014
tr+0.013
fire+0.012
whe+0.012
storm+0.012
Neg logit contributions
els-0.011
en-0.011
ubby-0.010
ened-0.010
el-0.010
Token not
Token ID407
Feature activation-3.26e-01

Pos logit contributions
els+0.048
en+0.045
el+0.043
 opinions+0.043
ened+0.043
Neg logit contributions
ding-0.035
tr-0.032
fire-0.030
storm-0.029
whe-0.028
Token scream
Token ID8196
Feature activation+3.51e+00

Pos logit contributions
ding+0.009
tr+0.008
fire+0.008
storm+0.007
whe+0.007
Neg logit contributions
els-0.006
en-0.006
ubby-0.006
el-0.006
ened-0.006
Token away
Token ID1497
Feature activation+1.37e+00

Pos logit contributions
els+0.008
ubby+0.007
ened+0.007
en+0.007
ere+0.007
Neg logit contributions
ding-0.009
tr-0.009
fire-0.008
t-0.008
whe-0.008
Token!"
Token ID2474
Feature activation-1.43e+00

Pos logit contributions
els+0.034
ened+0.031
ere+0.031
ubby+0.031
el+0.031
Neg logit contributions
ding-0.029
tr-0.026
fire-0.026
t-0.024
storm-0.023
Token
Token ID198
Feature activation-3.69e+00

Pos logit contributions
ding+0.035
tr+0.035
fire+0.033
storm+0.033
ache+0.031
Neg logit contributions
els-0.028
el-0.026
ere-0.026
en-0.025
ubby-0.025
Token
Token ID198
Feature activation-2.97e+00

Pos logit contributions
ding+0.089
tr+0.086
storm+0.080
fire+0.079
ache+0.074
Neg logit contributions
els-0.077
ere-0.072
el-0.071
ubby-0.071
en-0.070
TokenThe
Token ID464
Feature activation+2.03e+00

Pos logit contributions
ding+0.070
tr+0.066
storm+0.060
fire+0.059
t+0.057
Neg logit contributions
els-0.067
ere-0.062
en-0.060
ubby-0.060
el-0.060
Token monster
Token ID9234
Feature activation-2.95e-01

Pos logit contributions
els+0.054
ubby+0.051
en+0.051
ened+0.050
el+0.050
Neg logit contributions
ding-0.039
tr-0.035
fire-0.033
whe-0.031
storm-0.031
Token grow
Token ID1663
Feature activation+1.63e+00

Pos logit contributions
ding+0.008
tr+0.007
fire+0.007
whe+0.007
storm+0.007
Neg logit contributions
els-0.006
ened-0.005
en-0.005
el-0.005
ere-0.005
Tokenled
Token ID992
Feature activation+6.05e+00

Pos logit contributions
els+0.030
ere+0.028
ubby+0.028
en+0.028
 opinions+0.026
Neg logit contributions
ding-0.043
tr-0.040
fire-0.039
t-0.038
whe-0.037
Token louder
Token ID27089
Feature activation+3.80e+00

Pos logit contributions
els+0.119
ubby+0.109
en+0.108
ere+0.107
ened+0.106
Neg logit contributions
ding-0.158
tr-0.148
fire-0.140
whe-0.134
storm-0.134
Token.
Token ID13
Feature activation-2.24e+00

Pos logit contributions
els+0.075
en+0.069
ubby+0.069
ened+0.068
el+0.067
Neg logit contributions
ding-0.100
tr-0.093
fire-0.088
whe-0.085
storm-0.085
Token It
Token ID632
Feature activation-2.73e+00

Pos logit contributions
ding+0.054
tr+0.050
fire+0.047
storm+0.045
whe+0.045
Neg logit contributions
els-0.049
ubby-0.045
en-0.045
ened-0.045
el-0.044
Token.
Token ID13
Feature activation-1.60e+00

Pos logit contributions
els+0.091
ubby+0.084
en+0.084
ere+0.082
ened+0.082
Neg logit contributions
ding-0.091
tr-0.085
fire-0.079
whe-0.076
storm-0.076
Token He
Token ID679
Feature activation-2.17e-01

Pos logit contributions
ding+0.033
tr+0.030
fire+0.029
storm+0.027
t+0.027
Neg logit contributions
els-0.039
ubby-0.037
en-0.037
ere-0.036
el-0.036
Token says
Token ID1139
Feature activation+3.31e+00

Pos logit contributions
tr+0.006
ding+0.005
ache+0.005
whe+0.005
 hon+0.005
Neg logit contributions
ubby-0.004
els-0.004
ened-0.004
el-0.004
 angles-0.004
Token,
Token ID11
Feature activation-1.36e+00

Pos logit contributions
els+0.052
en+0.046
ubby+0.045
ere+0.045
ened+0.044
Neg logit contributions
ding-0.101
tr-0.094
fire-0.090
t-0.089
whe-0.088
Token "
Token ID366
Feature activation-3.86e-01

Pos logit contributions
ding+0.039
tr+0.038
fire+0.036
whe+0.035
storm+0.035
Neg logit contributions
els-0.021
ubby-0.019
ened-0.019
en-0.018
ere-0.018
TokenMy
Token ID3666
Feature activation+5.35e-01

Pos logit contributions
ding+0.010
tr+0.010
fire+0.009
ache+0.009
 numb+0.009
Neg logit contributions
 opinions-0.006
ere-0.006
efully-0.006
el-0.006
 asks-0.006
Token dad
Token ID9955
Feature activation+6.11e-01

Pos logit contributions
els+0.013
ubby+0.012
 opinions+0.011
en+0.011
ere+0.011
Neg logit contributions
ding-0.012
tr-0.011
fire-0.010
whe-0.010
storm-0.010
Token got
Token ID1392
Feature activation+9.34e-01

Pos logit contributions
els+0.011
ubby+0.010
en+0.009
ened+0.009
ere+0.009
Neg logit contributions
ding-0.017
tr-0.016
fire-0.015
t-0.015
whe-0.015
Token it
Token ID340
Feature activation-5.34e+00

Pos logit contributions
els+0.017
en+0.016
ubby+0.015
ened+0.015
el+0.015
Neg logit contributions
ding-0.026
tr-0.024
fire-0.023
storm-0.022
whe-0.022
Token for
Token ID329
Feature activation+2.12e-01

Pos logit contributions
ding+0.135
tr+0.126
fire+0.125
whe+0.118
storm+0.118
Neg logit contributions
els-0.099
en-0.095
el-0.094
ubby-0.093
ened-0.090
Token me
Token ID502
Feature activation-2.56e+00

Pos logit contributions
els+0.004
en+0.003
ened+0.003
ubby+0.003
el+0.003
Neg logit contributions
ding-0.006
tr-0.005
fire-0.005
whe-0.005
storm-0.005
Token them
Token ID606
Feature activation-4.32e-01

Pos logit contributions
ubby+0.113
els+0.106
ened+0.102
ere+0.100
 opinions+0.099
Neg logit contributions
ding-0.140
tr-0.124
 he-0.118
t-0.117
fire-0.117
Token new
Token ID649
Feature activation+9.64e-01

Pos logit contributions
ding+0.011
tr+0.010
fire+0.009
whe+0.009
t+0.009
Neg logit contributions
els-0.009
ubby-0.008
en-0.008
ened-0.008
ere-0.008
Token v
Token ID410
Feature activation-1.67e+00

Pos logit contributions
els+0.020
ubby+0.020
en+0.019
ere+0.019
ened+0.019
Neg logit contributions
tr-0.022
ding-0.022
fire-0.019
whe-0.019
t-0.018
Tokenests
Token ID3558
Feature activation-4.28e+00

Pos logit contributions
ding+0.039
fire+0.035
tr+0.035
storm+0.034
 numb+0.034
Neg logit contributions
els-0.038
ubby-0.034
ere-0.034
en-0.033
el-0.033
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
ding+0.108
tr+0.105
fire+0.101
storm+0.100
 villains+0.096
Neg logit contributions
els-0.079
en-0.072
el-0.072
ened-0.070
ubby-0.068
Token They
Token ID1119
Feature activation+1.64e-01

Pos logit contributions
ding+0.021
tr+0.019
 he+0.019
uts+0.018
fall+0.018
Neg logit contributions
els-0.026
en-0.024
ere-0.023
ered-0.023
ened-0.022
Token are
Token ID389
Feature activation+3.88e-02

Pos logit contributions
els+0.004
ubby+0.003
en+0.003
ened+0.003
ere+0.003
Neg logit contributions
ding-0.004
tr-0.004
fire-0.003
whe-0.003
storm-0.003
Token blue
Token ID4171
Feature activation+1.28e+00

Pos logit contributions
els+0.001
en+0.001
ubby+0.001
ened+0.001
ere+0.001
Neg logit contributions
ding-0.001
tr-0.001
fire-0.001
whe-0.001
storm-0.001
Token and
Token ID290
Feature activation-5.56e+00

Pos logit contributions
els+0.022
en+0.020
ubby+0.020
ened+0.019
ere+0.019
Neg logit contributions
ding-0.037
tr-0.035
fire-0.033
whe-0.032
storm-0.032
Token warm
Token ID5814
Feature activation+1.95e+00

Pos logit contributions
ding+0.146
tr+0.137
fire+0.130
storm+0.126
whe+0.125
Neg logit contributions
els-0.108
ubby-0.099
en-0.098
ened-0.097
el-0.096
Token and
Token ID290
Feature activation-5.67e+00

Pos logit contributions
els+0.036
en+0.036
ened+0.034
el+0.033
 opinions+0.032
Neg logit contributions
ding-0.051
tr-0.048
fire-0.046
t-0.045
whe-0.045
Token player
Token ID2137
Feature activation-4.38e+00

Pos logit contributions
ding+0.168
tr+0.154
whe+0.139
storm+0.135
 numb+0.135
Neg logit contributions
els-0.151
en-0.146
ubby-0.145
 opinions-0.140
ened-0.137
Token.
Token ID13
Feature activation-3.41e+00

Pos logit contributions
ding+0.111
tr+0.104
fire+0.095
whe+0.094
storm+0.093
Neg logit contributions
els-0.088
en-0.083
ubby-0.083
ened-0.079
el-0.079
Token Not
Token ID1892
Feature activation-3.09e+00

Pos logit contributions
ding+0.084
tr+0.078
fire+0.073
whe+0.070
storm+0.070
Neg logit contributions
els-0.072
en-0.067
ubby-0.067
ened-0.066
ere-0.065
Token the
Token ID262
Feature activation-2.02e+00

Pos logit contributions
ding+0.077
tr+0.072
fire+0.067
storm+0.065
whe+0.062
Neg logit contributions
els-0.064
en-0.061
ubby-0.061
ened-0.059
 opinions-0.058
Token new
Token ID649
Feature activation-1.23e+00

Pos logit contributions
ding+0.053
tr+0.049
fire+0.045
storm+0.044
t+0.044
Neg logit contributions
els-0.039
ubby-0.039
en-0.037
 opinions-0.037
ened-0.036
Token hockey
Token ID12217
Feature activation-6.43e+00

Pos logit contributions
ding+0.029
tr+0.027
fire+0.025
storm+0.024
whe+0.024
Neg logit contributions
els-0.028
en-0.026
ubby-0.026
ened-0.026
 opinions-0.025
Token stick
Token ID4859
Feature activation-6.02e+00

Pos logit contributions
ding+0.175
tr+0.164
whe+0.150
storm+0.149
fire+0.149
Neg logit contributions
els-0.117
en-0.110
ubby-0.110
ened-0.106
 opinions-0.105
Token."
Token ID526
Feature activation-9.78e-01

Pos logit contributions
ding+0.167
tr+0.159
fire+0.149
whe+0.146
storm+0.145
Neg logit contributions
els-0.107
en-0.099
ubby-0.097
el-0.096
ened-0.095
Token
Token ID198
Feature activation-3.26e+00

Pos logit contributions
ding+0.024
tr+0.023
fire+0.021
storm+0.021
whe+0.020
Neg logit contributions
els-0.021
el-0.019
ere-0.019
ubby-0.019
en-0.019
Token
Token ID198
Feature activation-2.67e+00

Pos logit contributions
ding+0.072
tr+0.068
fire+0.062
storm+0.061
whe+0.059
Neg logit contributions
els-0.077
ubby-0.072
ere-0.072
en-0.071
el-0.071
TokenBen
Token ID11696
Feature activation+6.26e+00

Pos logit contributions
ding+0.064
tr+0.060
storm+0.055
fire+0.054
t+0.052
Neg logit contributions
els-0.058
ere-0.056
ubby-0.055
en-0.053
el-0.053
Token stream
Token ID4269
Feature activation-2.58e+00

Pos logit contributions
els+0.038
ubby+0.035
en+0.035
el+0.034
ened+0.034
Neg logit contributions
ding-0.046
tr-0.043
fire-0.039
whe-0.039
storm-0.038
Token began
Token ID2540
Feature activation+1.05e-01

Pos logit contributions
ding+0.065
tr+0.060
fire+0.056
whe+0.054
storm+0.054
Neg logit contributions
els-0.053
en-0.049
ubby-0.049
ened-0.049
ere-0.048
Token to
Token ID284
Feature activation-8.52e-01

Pos logit contributions
els+0.002
en+0.002
ened+0.002
el+0.002
ubby+0.002
Neg logit contributions
ding-0.003
tr-0.002
fire-0.002
storm-0.002
whe-0.002
Token get
Token ID651
Feature activation+1.99e-01

Pos logit contributions
ding+0.019
tr+0.018
uts+0.016
t+0.016
fire+0.016
Neg logit contributions
en-0.020
els-0.019
ened-0.018
el-0.018
 opinions-0.017
Token bigger
Token ID5749
Feature activation+7.46e-01

Pos logit contributions
els+0.005
en+0.004
ened+0.004
el+0.004
ubby+0.004
Neg logit contributions
ding-0.005
tr-0.004
fire-0.004
storm-0.004
whe-0.004
Token and
Token ID290
Feature activation-6.29e+00

Pos logit contributions
els+0.012
en+0.011
ened+0.011
ubby+0.011
el+0.010
Neg logit contributions
ding-0.022
tr-0.021
fire-0.020
storm-0.019
t-0.019
Token bigger
Token ID5749
Feature activation+4.99e-01

Pos logit contributions
ding+0.162
tr+0.152
fire+0.143
storm+0.140
uts+0.138
Neg logit contributions
els-0.122
en-0.115
ened-0.114
el-0.109
ubby-0.107
Token,
Token ID11
Feature activation-4.12e+00

Pos logit contributions
els+0.010
en+0.009
ened+0.009
ubby+0.009
el+0.009
Neg logit contributions
ding-0.013
tr-0.012
fire-0.011
storm-0.011
t-0.011
Token and
Token ID290
Feature activation-5.00e+00

Pos logit contributions
ding+0.103
tr+0.097
fire+0.091
whe+0.088
t+0.088
Neg logit contributions
els-0.084
en-0.079
ened-0.078
ubby-0.075
el-0.075
Token soon
Token ID2582
Feature activation+1.04e+00

Pos logit contributions
ding+0.142
tr+0.134
fire+0.128
storm+0.125
uts+0.125
Neg logit contributions
els-0.084
ened-0.078
en-0.076
el-0.074
 opinions-0.072
Token there
Token ID612
Feature activation+3.31e-02

Pos logit contributions
els+0.017
ened+0.017
en+0.017
el+0.016
ere+0.016
Neg logit contributions
ding-0.030
tr-0.029
fire-0.027
storm-0.026
whe-0.026
Token It
Token ID632
Feature activation-3.84e+00

Pos logit contributions
ding+0.049
tr+0.046
fire+0.043
whe+0.042
storm+0.041
Neg logit contributions
els-0.043
en-0.040
ubby-0.040
ened-0.039
ere-0.039
Token is
Token ID318
Feature activation-2.96e+00

Pos logit contributions
ding+0.114
tr+0.103
fire+0.101
whe+0.097
storm+0.096
Neg logit contributions
els-0.063
en-0.058
ened-0.056
el-0.056
ubby-0.054
Token big
Token ID1263
Feature activation+1.15e+00

Pos logit contributions
ding+0.069
tr+0.064
fire+0.060
whe+0.058
storm+0.058
Neg logit contributions
els-0.067
ubby-0.062
en-0.061
ened-0.060
ere-0.060
Token and
Token ID290
Feature activation-6.96e+00

Pos logit contributions
els+0.022
ubby+0.020
ere+0.020
el+0.019
en+0.019
Neg logit contributions
ding-0.031
tr-0.029
fire-0.027
storm-0.026
whe-0.026
Token black
Token ID2042
Feature activation+1.19e+00

Pos logit contributions
ding+0.187
tr+0.168
fire+0.163
storm+0.161
whe+0.160
Neg logit contributions
els-0.124
ened-0.115
el-0.114
ubby-0.113
 opinions-0.110
Token and
Token ID290
Feature activation-6.95e+00

Pos logit contributions
els+0.024
ubby+0.022
en+0.022
ere+0.022
ened+0.021
Neg logit contributions
ding-0.031
tr-0.029
fire-0.027
storm-0.026
whe-0.026
Token has
Token ID468
Feature activation-1.48e+00

Pos logit contributions
ding+0.202
fire+0.178
tr+0.177
t+0.174
ache+0.173
Neg logit contributions
els-0.113
el-0.104
ened-0.102
ubby-0.098
 opinions-0.095
Token many
Token ID867
Feature activation-1.90e+00

Pos logit contributions
ding+0.041
tr+0.038
fire+0.037
storm+0.036
whe+0.035
Neg logit contributions
els-0.026
en-0.024
ened-0.023
ubby-0.023
el-0.023
Token buttons
Token ID12163
Feature activation-4.75e+00

Pos logit contributions
ding+0.053
tr+0.049
fire+0.047
whe+0.046
t+0.046
Neg logit contributions
els-0.031
ubby-0.030
en-0.029
ere-0.029
el-0.028
Token.
Token ID13
Feature activation-2.44e+00

Pos logit contributions
ding+0.106
tr+0.100
fire+0.096
storm+0.090
whe+0.090
Neg logit contributions
els-0.111
en-0.103
ened-0.101
el-0.100
ubby-0.099
Token Anna
Token ID11735
Feature activation+4.33e+00

Pos logit contributions
ding+0.058
tr+0.054
fire+0.050
whe+0.049
t+0.048
Neg logit contributions
els-0.053
en-0.050
ened-0.049
ubby-0.049
ere-0.048
Token wished
Token ID16555
Feature activation+1.89e+00

Pos logit contributions
ding+0.027
tr+0.025
fire+0.024
storm+0.023
whe+0.023
Neg logit contributions
els-0.018
ubby-0.016
en-0.016
ened-0.016
ere-0.016
Token they
Token ID484
Feature activation-3.50e+00

Pos logit contributions
els+0.052
en+0.049
ened+0.048
 opinions+0.048
el+0.048
Neg logit contributions
ding-0.035
tr-0.032
fire-0.029
storm-0.028
whe-0.027
Token had
Token ID550
Feature activation-7.08e-01

Pos logit contributions
ding+0.089
tr+0.083
fire+0.079
fall+0.076
whe+0.076
Neg logit contributions
els-0.063
ened-0.061
 opinions-0.058
en-0.058
ered-0.057
Token shared
Token ID4888
Feature activation+2.86e+00

Pos logit contributions
ding+0.017
tr+0.015
fire+0.015
storm+0.014
t+0.014
Neg logit contributions
els-0.015
 opinions-0.014
ened-0.014
en-0.014
ere-0.014
Token the
Token ID262
Feature activation-5.59e+00

Pos logit contributions
els+0.047
en+0.042
ened+0.041
ubby+0.040
 opinions+0.040
Neg logit contributions
ding-0.086
tr-0.080
fire-0.076
storm-0.075
whe-0.074
Token ball
Token ID2613
Feature activation-8.42e+00

Pos logit contributions
ding+0.130
tr+0.118
fire+0.114
whe+0.107
storm+0.107
Neg logit contributions
els-0.126
ubby-0.117
en-0.116
ened-0.115
el-0.114
Token and
Token ID290
Feature activation-6.30e+00

Pos logit contributions
ding+0.281
tr+0.255
fall+0.247
storm+0.247
Snow+0.246
Neg logit contributions
els-0.097
en-0.092
el-0.088
 ways-0.079
ubby-0.075
Token played
Token ID2826
Feature activation+5.97e-01

Pos logit contributions
ding+0.182
tr+0.163
fire+0.155
t+0.155
storm+0.153
Neg logit contributions
els-0.109
 opinions-0.106
ened-0.102
en-0.099
 ideas-0.094
Token together
Token ID1978
Feature activation-1.35e+00

Pos logit contributions
els+0.012
en+0.011
 opinions+0.011
ere+0.011
 ideas+0.011
Neg logit contributions
ding-0.015
tr-0.014
storm-0.013
t-0.013
ache-0.013
Token.
Token ID13
Feature activation-3.22e+00

Pos logit contributions
ding+0.035
tr+0.032
storm+0.029
fire+0.029
whe+0.029
Neg logit contributions
els-0.026
en-0.026
ened-0.024
el-0.023
 opinions-0.023
Token They
Token ID1119
Feature activation-1.43e+00

Pos logit contributions
ding+0.070
tr+0.065
fire+0.061
storm+0.058
whe+0.057
Neg logit contributions
els-0.078
en-0.073
ubby-0.072
ened-0.071
ere-0.071
Token the
Token ID262
Feature activation-9.66e-01

Pos logit contributions
ding+0.094
tr+0.088
fire+0.083
whe+0.080
storm+0.080
Neg logit contributions
els-0.073
en-0.068
ubby-0.067
ened-0.066
ere-0.065
Token scarf
Token ID36953
Feature activation-6.18e+00

Pos logit contributions
ding+0.025
tr+0.021
t+0.021
fire+0.021
whe+0.021
Neg logit contributions
els-0.020
ened-0.019
ubby-0.019
 opinions-0.019
el-0.018
Token,
Token ID11
Feature activation-2.50e+00

Pos logit contributions
ding+0.161
tr+0.148
fire+0.142
storm+0.139
whe+0.135
Neg logit contributions
els-0.117
ened-0.114
en-0.111
el-0.108
ered-0.102
Token making
Token ID1642
Feature activation+1.08e+00

Pos logit contributions
ding+0.063
tr+0.059
fire+0.056
whe+0.054
storm+0.054
Neg logit contributions
els-0.052
en-0.047
ubby-0.047
ened-0.047
el-0.046
Token holes
Token ID10421
Feature activation-5.06e+00

Pos logit contributions
els+0.021
en+0.019
ened+0.019
ubby+0.019
el+0.018
Neg logit contributions
ding-0.029
tr-0.027
fire-0.026
storm-0.025
t-0.025
Token and
Token ID290
Feature activation-7.05e+00

Pos logit contributions
ding+0.128
tr+0.120
fire+0.112
storm+0.109
whe+0.107
Neg logit contributions
els-0.104
en-0.099
ened-0.096
ubby-0.094
el-0.094
Token r
Token ID374
Feature activation-9.86e-01

Pos logit contributions
ding+0.189
tr+0.170
t+0.163
fire+0.161
whe+0.159
Neg logit contributions
els-0.141
en-0.131
ened-0.130
el-0.125
 opinions-0.124
Tokenips
Token ID2419
Feature activation-1.68e+00

Pos logit contributions
ding+0.026
tr+0.025
fire+0.023
whe+0.022
storm+0.022
Neg logit contributions
els-0.019
en-0.017
ubby-0.017
ened-0.017
ere-0.017
Token.
Token ID13
Feature activation-2.75e+00

Pos logit contributions
ding+0.042
tr+0.040
storm+0.038
fire+0.037
whe+0.035
Neg logit contributions
els-0.034
en-0.033
ened-0.032
el-0.032
ubby-0.030
Token He
Token ID679
Feature activation-1.14e+00

Pos logit contributions
ding+0.062
tr+0.057
fire+0.054
storm+0.052
whe+0.051
Neg logit contributions
els-0.063
ubby-0.059
en-0.058
ere-0.058
ened-0.058
Token thought
Token ID1807
Feature activation+2.05e+00

Pos logit contributions
ding+0.030
tr+0.028
fire+0.027
whe+0.026
storm+0.026
Neg logit contributions
els-0.022
en-0.020
ened-0.020
ubby-0.020
el-0.020