TOP ACTIVATIONS
MAX = 0.167

for you ." U ma carefully took the key
were very happy . <|endoftext|> E lla loved to visit her
the fire work . <|endoftext|> E dd ie was a three
he shook it . <|endoftext|> Pa ola and her mother were
it out . U nexpected ly , Jenny discovered
a loving home . <|endoftext|> E lla was a happy three
, a gentle girl named Ci ara stumbled upon a lovely
first friend said , " U gh , this part is
the ancient eye . <|endoftext|> E lla was having a special
. They say , " U gly !" " Pretty !"
bit of magic . <|endoftext|> Isa ac was preparing breakfast in
there were two friends , Ab ig ail and Mat ty
it . U ps et about the poor
stream ers . U ma jumped up and down
mad and said , " U gh , what are you
was their friend . <|endoftext|> E lla and Ben were twins
time ! â <|endoftext|> Pa olo was very excited .
that he wanted . <|endoftext|> Ak io was a 3 year
colorful door . U ma was a 3 year
, a p anda named P ao was very hungry .

MIN ACTIVATIONS
MIN = -0.138

be up high on the roof , but they both enjoyed
build a nest on this roof , but I can 't
for daddy to fix the roof . You could fall and
had someone to share the hay with . Then , she
through a crack in the roof . They sat down and
that they could fill the roof with water and make it
off to sit on the roof and sun bat he again
wanted to keep the ash tr ay . So he tried
take a march around the books he lf like he had
would need to mend the roof , quickly .
your fingers in the ash tr ay , so Jon had
not played with the ash tr ay . They wished they
spend the day exploring this roof and having fun !" said
all the books on the books he lf .
not to hang onto the branches too hard . We don
the books back on the books he lf , smiling proudly
shining brightly up on the roof of the building . Tom
money to build a new roof . The father was a
cozy nest on the green roof . From that day on
of being alone on the roof . She tried to follow

INTERVAL 0.091 - 0.167
CONTAINS 1.478%

thing on the table . Tim wasn 't sure what it
is ugly ." Tim thought about what the bird
the step . Bob o kept playing near the
can have the steak , L ula ". She turned around
help him . Tom ran to the tree and

INTERVAL 0.015 - 0.091
CONTAINS 33.477%

not like it . He didn 't know how to make
ran to the forest and Jake ran to the mountain .
pictures . They did not mean to make a mess .
The other animals helped look after the sick sheep .
He followed them to the merry - go - round and

INTERVAL -0.061 - 0.015
CONTAINS 60.450%

Doug ie went to the forest , he would always try
came out of the exam that she started to applaud .
laughed and cheered as the boat moved . They did not
lady was happy too , and thanked the little girl for
. It jumped in and out of the leaves . It

INTERVAL -0.138 - -0.061
CONTAINS 4.595%

and grew tall in the garden . Every day the flower
" It must belong to something very big ," Mama said
Do you see the tiny fish leaping out of the lake
, every color can make a huge difference ! <|endoftext|> Jimmy
and started to draw a picture of a big , bright
Token for
Token ID329
Feature activation-2.53e-02

Pos logit contributions
fire+0.203
 distance+0.187
storm+0.163
 hill+0.159
ding+0.156
Neg logit contributions
els-0.238
atted-0.231
ished-0.225
lee-0.224
 opinions-0.220
Token you
Token ID345
Feature activation-5.13e-03

Pos logit contributions
fire+0.052
 distance+0.050
 hill+0.044
storm+0.044
ding+0.044
Neg logit contributions
els-0.047
lee-0.045
atted-0.044
 opinions-0.044
ubby-0.042
Token."
Token ID526
Feature activation-1.78e-02

Pos logit contributions
fire+0.009
 distance+0.008
 hill+0.007
storm+0.007
whe+0.007
Neg logit contributions
els-0.011
atted-0.011
lee-0.011
ished-0.011
 opinions-0.011
Token
Token ID198
Feature activation-4.12e-02

Pos logit contributions
fire+0.026
 distance+0.022
 hill+0.018
storm+0.018
 road+0.017
Neg logit contributions
atted-0.046
els-0.045
lee-0.045
ished-0.043
 opinions-0.043
Token
Token ID198
Feature activation-3.53e-02

Pos logit contributions
fire+0.060
 distance+0.054
storm+0.046
 hill+0.045
 road+0.043
Neg logit contributions
atted-0.102
els-0.101
lee-0.101
 opinions-0.096
ished-0.095
TokenU
Token ID52
Feature activation+1.59e-01

Pos logit contributions
fire+0.058
 distance+0.053
 hill+0.045
storm+0.043
 road+0.043
Neg logit contributions
els-0.082
atted-0.081
lee-0.079
ished-0.077
 opinions-0.077
Tokenma
Token ID2611
Feature activation+5.68e-02

Pos logit contributions
lee+0.307
ishment+0.302
atted+0.295
ubby+0.291
els+0.288
Neg logit contributions
fire-0.282
ding-0.234
storm-0.230
tr-0.225
asis-0.217
Token carefully
Token ID7773
Feature activation+4.05e-02

Pos logit contributions
els+0.126
atted+0.122
lee+0.120
ubby+0.120
ished+0.118
Neg logit contributions
fire-0.098
 distance-0.085
 hill-0.075
 he-0.074
ding-0.074
Token took
Token ID1718
Feature activation+5.98e-02

Pos logit contributions
els+0.081
atted+0.081
lee+0.079
 opinions+0.078
ubby+0.078
Neg logit contributions
fire-0.079
 distance-0.071
 hill-0.062
ding-0.061
 he-0.061
Token the
Token ID262
Feature activation-3.16e-02

Pos logit contributions
atted+0.128
els+0.128
lee+0.125
ished+0.122
 opinions+0.121
Neg logit contributions
fire-0.106
 distance-0.097
 hill-0.083
ding-0.081
whe-0.081
Token key
Token ID1994
Feature activation-5.17e-02

Pos logit contributions
fire+0.057
 distance+0.052
storm+0.045
ding+0.045
field+0.044
Neg logit contributions
els-0.068
atted-0.066
ished-0.064
lee-0.063
 opinions-0.063
Token were
Token ID547
Feature activation+2.86e-02

Pos logit contributions
fire+0.031
 distance+0.026
fall+0.025
storm+0.025
ding+0.024
Neg logit contributions
lee-0.024
els-0.023
atted-0.023
ished-0.022
 opinions-0.022
Token very
Token ID845
Feature activation+8.92e-03

Pos logit contributions
els+0.060
atted+0.059
lee+0.057
ished+0.057
 opinions+0.056
Neg logit contributions
fire-0.054
 distance-0.049
 hill-0.043
ding-0.042
 road-0.041
Token happy
Token ID3772
Feature activation+7.25e-02

Pos logit contributions
atted+0.018
els+0.017
lee+0.017
ished+0.017
 opinions+0.016
Neg logit contributions
fire-0.018
 distance-0.016
 hill-0.015
storm-0.014
 road-0.014
Token.
Token ID13
Feature activation-2.07e-02

Pos logit contributions
atted+0.132
lee+0.126
 opinions+0.124
els+0.123
ished+0.114
Neg logit contributions
fire-0.157
 distance-0.151
boat-0.130
ding-0.130
|-0.127
Token<|endoftext|>
Token ID50256
Feature activation+3.64e-02

Pos logit contributions
fire+0.029
 distance+0.026
 hill+0.021
ding+0.020
 road+0.020
Neg logit contributions
els-0.053
atted-0.053
lee-0.052
ished-0.051
 opinions-0.050
TokenE
Token ID36
Feature activation+1.58e-01

Pos logit contributions
els+0.069
ubby+0.062
atted+0.062
 opinions+0.060
lee+0.060
Neg logit contributions
fire-0.072
 distance-0.064
storm-0.062
ding-0.057
 usual-0.056
Tokenlla
Token ID8466
Feature activation+9.43e-02

Pos logit contributions
els+0.298
atted+0.292
lee+0.280
ished+0.276
 opinions+0.273
Neg logit contributions
fire-0.325
 distance-0.308
 hill-0.267
 road-0.262
storm-0.258
Token loved
Token ID6151
Feature activation+5.19e-02

Pos logit contributions
els+0.204
atted+0.203
lee+0.197
ished+0.195
 opinions+0.192
Neg logit contributions
fire-0.167
 distance-0.151
 hill-0.131
ding-0.128
 road-0.127
Token to
Token ID284
Feature activation-3.48e-02

Pos logit contributions
els+0.114
atted+0.112
lee+0.111
 opinions+0.107
ished+0.105
Neg logit contributions
fire-0.095
 distance-0.085
 hill-0.071
storm-0.070
ding-0.069
Token visit
Token ID3187
Feature activation+2.16e-02

Pos logit contributions
fire+0.053
 distance+0.047
ding+0.038
storm+0.038
 hill+0.038
Neg logit contributions
els-0.085
lee-0.084
atted-0.084
 opinions-0.081
ished-0.079
Token her
Token ID607
Feature activation+1.92e-02

Pos logit contributions
atted+0.037
els+0.037
ished+0.036
lee+0.035
 opinions+0.034
Neg logit contributions
fire-0.048
 distance-0.044
 hill-0.040
ding-0.039
 road-0.039
Token the
Token ID262
Feature activation-3.40e-02

Pos logit contributions
els+0.064
lee+0.062
atted+0.062
 opinions+0.060
ished+0.058
Neg logit contributions
fire-0.075
 distance-0.069
 hill-0.061
ding-0.060
field-0.060
Token fire
Token ID2046
Feature activation-5.54e-02

Pos logit contributions
fire+0.062
 distance+0.057
whe+0.052
 Zig+0.052
boat+0.052
Neg logit contributions
els-0.066
 opinions-0.066
elling-0.064
lee-0.063
atted-0.062
Tokenwork
Token ID1818
Feature activation-7.73e-02

Pos logit contributions
 distance+0.111
fire+0.106
 saf+0.095
 trail+0.095
 hill+0.094
Neg logit contributions
els-0.102
atted-0.099
lee-0.094
elling-0.093
 opinions-0.093
Token.
Token ID13
Feature activation-3.10e-02

Pos logit contributions
fire+0.145
 distance+0.134
 Zig+0.125
rob+0.117
K+0.117
Neg logit contributions
els-0.149
atted-0.145
lee-0.143
 opinions-0.140
ished-0.138
Token<|endoftext|>
Token ID50256
Feature activation+2.57e-02

Pos logit contributions
fire+0.056
 distance+0.048
 hill+0.043
storm+0.041
ding+0.041
Neg logit contributions
lee-0.068
atted-0.067
els-0.065
 opinions-0.064
ished-0.063
TokenE
Token ID36
Feature activation+1.57e-01

Pos logit contributions
els+0.042
lee+0.042
atted+0.041
ished+0.040
ubby+0.040
Neg logit contributions
fire-0.051
storm-0.045
 distance-0.043
 roof-0.041
tr-0.040
Tokendd
Token ID1860
Feature activation+1.14e-01

Pos logit contributions
els+0.292
atted+0.290
lee+0.278
ished+0.273
 opinions+0.268
Neg logit contributions
fire-0.328
 distance-0.306
 hill-0.268
 road-0.264
whe-0.259
Tokenie
Token ID494
Feature activation+7.79e-02

Pos logit contributions
els+0.165
atted+0.163
lee+0.155
ished+0.153
 opinions+0.150
Neg logit contributions
fire-0.283
 distance-0.266
 hill-0.239
ding-0.234
 road-0.233
Token was
Token ID373
Feature activation+4.65e-02

Pos logit contributions
atted+0.158
els+0.158
lee+0.154
ished+0.151
 opinions+0.147
Neg logit contributions
fire-0.148
 distance-0.135
 hill-0.118
ding-0.115
 road-0.114
Token a
Token ID257
Feature activation-3.75e-02

Pos logit contributions
els+0.108
atted+0.107
lee+0.104
ished+0.102
 opinions+0.102
Neg logit contributions
fire-0.075
 distance-0.069
 hill-0.058
ding-0.057
 road-0.057
Token three
Token ID1115
Feature activation+2.06e-02

Pos logit contributions
fire+0.090
 distance+0.082
whe+0.079
 Zig+0.075
|+0.074
Neg logit contributions
ubby-0.055
lee-0.053
atted-0.050
ealing-0.048
els-0.048
Token he
Token ID339
Feature activation-2.22e-02

Pos logit contributions
fire+0.033
 distance+0.030
ding+0.025
field+0.024
storm+0.024
Neg logit contributions
els-0.050
atted-0.049
 opinions-0.048
lee-0.048
ished-0.047
Token shook
Token ID14682
Feature activation+3.24e-02

Pos logit contributions
fire+0.045
 distance+0.042
 hill+0.036
ding+0.036
 road+0.035
Neg logit contributions
els-0.043
atted-0.042
lee-0.041
ished-0.040
 opinions-0.040
Token it
Token ID340
Feature activation-4.80e-02

Pos logit contributions
els+0.068
atted+0.067
lee+0.065
ished+0.064
 opinions+0.063
Neg logit contributions
fire-0.061
 distance-0.055
 hill-0.048
ding-0.047
storm-0.047
Token.
Token ID13
Feature activation-2.70e-02

Pos logit contributions
fire+0.093
 distance+0.085
 hill+0.072
ding+0.072
storm+0.072
Neg logit contributions
els-0.095
atted-0.094
ished-0.092
 opinions-0.091
lee-0.090
Token<|endoftext|>
Token ID50256
Feature activation+2.62e-02

Pos logit contributions
fire+0.048
 distance+0.042
storm+0.036
 trail+0.036
 hill+0.035
Neg logit contributions
lee-0.059
atted-0.059
 opinions-0.057
els-0.056
ished-0.055
TokenPa
Token ID28875
Feature activation+1.56e-01

Pos logit contributions
els+0.049
atted+0.047
lee+0.047
ished+0.045
ubby+0.045
Neg logit contributions
fire-0.050
 distance-0.045
storm-0.042
 road-0.039
ding-0.039
Tokenola
Token ID5708
Feature activation+1.09e-01

Pos logit contributions
 opinions+0.231
els+0.225
atted+0.217
ishment+0.198
ished+0.197
Neg logit contributions
 distance-0.341
fire-0.332
 saf-0.323
 ride-0.317
 he-0.315
Token and
Token ID290
Feature activation+7.40e-03

Pos logit contributions
els+0.236
atted+0.234
lee+0.225
ished+0.224
 opinions+0.221
Neg logit contributions
fire-0.191
 distance-0.174
 hill-0.152
ding-0.147
 road-0.146
Token her
Token ID607
Feature activation+6.13e-02

Pos logit contributions
els+0.014
atted+0.014
ished+0.013
lee+0.013
 opinions+0.013
Neg logit contributions
fire-0.015
 distance-0.014
 hill-0.013
 he-0.012
 road-0.012
Token mother
Token ID2802
Feature activation+4.23e-02

Pos logit contributions
els+0.095
atted+0.094
ished+0.089
lee+0.088
 opinions+0.086
Neg logit contributions
fire-0.147
 distance-0.138
 hill-0.125
 road-0.122
ding-0.122
Token were
Token ID547
Feature activation+4.60e-02

Pos logit contributions
els+0.101
atted+0.100
ished+0.098
lee+0.098
 opinions+0.096
Neg logit contributions
fire-0.062
 distance-0.057
 hill-0.049
 he-0.047
 road-0.047
Token it
Token ID340
Feature activation-4.67e-02

Pos logit contributions
atted+0.010
els+0.010
ished+0.010
lee+0.010
 opinions+0.010
Neg logit contributions
fire-0.007
 distance-0.007
 hill-0.006
 he-0.006
 road-0.006
Token out
Token ID503
Feature activation-4.70e-02

Pos logit contributions
fire+0.102
 distance+0.094
storm+0.085
ding+0.082
 hill+0.082
Neg logit contributions
els-0.084
atted-0.080
ished-0.079
lee-0.077
 opinions-0.076
Token.
Token ID13
Feature activation-3.38e-02

Pos logit contributions
fire+0.072
 distance+0.063
 hill+0.053
storm+0.053
ding+0.052
Neg logit contributions
els-0.116
atted-0.115
lee-0.112
ished-0.110
 opinions-0.109
Token
Token ID198
Feature activation-4.93e-02

Pos logit contributions
 distance+0.027
fire+0.024
 he+0.023
ding+0.022
To+0.021
Neg logit contributions
els-0.100
ubby-0.098
ished-0.097
atted-0.097
elling-0.094
Token
Token ID198
Feature activation-4.05e-02

Pos logit contributions
fire+0.063
 distance+0.054
 hill+0.044
storm+0.042
 road+0.041
Neg logit contributions
els-0.131
atted-0.131
lee-0.129
ished-0.126
 opinions-0.125
TokenU
Token ID52
Feature activation+1.55e-01

Pos logit contributions
 distance+0.049
To+0.049
fire+0.048
ding+0.042
 road+0.039
Neg logit contributions
atted-0.109
els-0.106
ished-0.103
elling-0.100
ubby-0.099
Tokennexpected
Token ID42072
Feature activation+9.68e-02

Pos logit contributions
ishment+0.329
atted+0.322
ubby+0.318
arted+0.314
lee+0.314
Neg logit contributions
fire-0.201
ding-0.180
 saf-0.167
rob-0.165
asis-0.164
Tokenly
Token ID306
Feature activation+6.54e-02

Pos logit contributions
 Deal+0.168
els+0.154
 opinions+0.148
ealing+0.147
oking+0.147
Neg logit contributions
 he-0.182
fire-0.169
 distance-0.165
 road-0.163
ding-0.158
Token,
Token ID11
Feature activation+1.57e-02

Pos logit contributions
 opinions+0.168
elling+0.164
els+0.160
ished+0.159
ishment+0.158
Neg logit contributions
fire-0.093
 he-0.085
 distance-0.070
ding-0.069
fall-0.064
Token Jenny
Token ID28437
Feature activation+9.85e-02

Pos logit contributions
atted+0.034
els+0.034
ished+0.033
elling+0.033
ubby+0.031
Neg logit contributions
 he-0.026
fire-0.026
 distance-0.025
 usual-0.023
 hill-0.022
Token discovered
Token ID5071
Feature activation+4.54e-02

Pos logit contributions
els+0.215
atted+0.214
 opinions+0.209
elling+0.206
ubby+0.204
Neg logit contributions
fire-0.169
 distance-0.151
 he-0.138
 hill-0.136
field-0.131
Token a
Token ID257
Feature activation-7.49e-02

Pos logit contributions
fire+0.032
 distance+0.029
 hill+0.025
ding+0.025
 road+0.024
Neg logit contributions
els-0.041
atted-0.041
lee-0.039
ished-0.039
 opinions-0.038
Token loving
Token ID14442
Feature activation+1.77e-02

Pos logit contributions
fire+0.152
 Zig+0.143
whe+0.141
 distance+0.132
ding+0.129
Neg logit contributions
lee-0.117
ubby-0.116
atted-0.115
ealing-0.113
 opinions-0.110
Token home
Token ID1363
Feature activation-5.38e-02

Pos logit contributions
atted+0.035
lee+0.034
els+0.034
 opinions+0.033
ished+0.032
Neg logit contributions
fire-0.034
 distance-0.030
ding-0.028
whe-0.028
storm-0.027
Token.
Token ID13
Feature activation-3.46e-02

Pos logit contributions
fire+0.095
 distance+0.093
Ro+0.087
 hill+0.082
storm+0.082
Neg logit contributions
atted-0.106
els-0.106
lee-0.105
ished-0.100
 opinions-0.099
Token<|endoftext|>
Token ID50256
Feature activation+3.09e-02

Pos logit contributions
fire+0.061
 distance+0.055
 hill+0.047
storm+0.047
ding+0.046
Neg logit contributions
lee-0.077
atted-0.076
els-0.073
 opinions-0.071
ished-0.070
TokenE
Token ID36
Feature activation+1.55e-01

Pos logit contributions
els+0.058
lee+0.055
atted+0.055
ished+0.053
ubby+0.053
Neg logit contributions
fire-0.060
 distance-0.052
storm-0.050
 road-0.045
ding-0.045
Tokenlla
Token ID8466
Feature activation+1.03e-01

Pos logit contributions
els+0.293
atted+0.292
lee+0.278
ished+0.274
 opinions+0.269
Neg logit contributions
fire-0.317
 distance-0.302
 hill-0.261
 road-0.257
whe-0.251
Token was
Token ID373
Feature activation+5.87e-02

Pos logit contributions
els+0.199
atted+0.197
lee+0.191
ished+0.189
 opinions+0.185
Neg logit contributions
fire-0.206
 distance-0.189
 hill-0.167
ding-0.163
 road-0.162
Token a
Token ID257
Feature activation-2.26e-02

Pos logit contributions
els+0.135
atted+0.131
lee+0.128
 opinions+0.126
ished+0.126
Neg logit contributions
fire-0.097
 distance-0.090
 hill-0.075
ding-0.075
 road-0.074
Token happy
Token ID3772
Feature activation+7.35e-02

Pos logit contributions
fire+0.052
 distance+0.047
whe+0.043
ding+0.042
storm+0.042
Neg logit contributions
lee-0.036
atted-0.036
ubby-0.036
els-0.035
ished-0.033
Token three
Token ID1115
Feature activation+3.14e-02

Pos logit contributions
atted+0.112
lee+0.111
els+0.108
ubby+0.107
 opinions+0.101
Neg logit contributions
fire-0.176
 distance-0.157
whe-0.146
ding-0.145
field-0.140
Token,
Token ID11
Feature activation+4.66e-02

Pos logit contributions
els+0.117
atted+0.115
lee+0.112
ished+0.112
 opinions+0.110
Neg logit contributions
fire-0.119
 distance-0.109
 hill-0.098
ding-0.095
 he-0.095
Token a
Token ID257
Feature activation-9.08e-03

Pos logit contributions
els+0.109
atted+0.108
lee+0.107
 opinions+0.104
ished+0.104
Neg logit contributions
fire-0.075
 distance-0.067
 hill-0.058
ding-0.056
 he-0.055
Token gentle
Token ID10296
Feature activation+4.70e-02

Pos logit contributions
fire+0.022
 distance+0.021
whe+0.018
field+0.018
storm+0.018
Neg logit contributions
ubby-0.013
atted-0.013
els-0.013
lee-0.013
ished-0.012
Token girl
Token ID2576
Feature activation+4.50e-02

Pos logit contributions
ubby+0.091
lee+0.089
atted+0.089
els+0.082
elling+0.079
Neg logit contributions
fire-0.089
 distance-0.081
whe-0.076
ding-0.072
To-0.069
Token named
Token ID3706
Feature activation+5.34e-02

Pos logit contributions
els+0.112
atted+0.112
lee+0.109
ished+0.108
 opinions+0.106
Neg logit contributions
fire-0.065
 distance-0.058
 hill-0.048
 road-0.046
ding-0.046
Token Ci
Token ID37685
Feature activation+1.55e-01

Pos logit contributions
els+0.124
atted+0.123
 opinions+0.117
ishment+0.116
ished+0.116
Neg logit contributions
fire-0.083
 distance-0.080
 hill-0.069
|-0.068
 Zig-0.067
Tokenara
Token ID3301
Feature activation+9.01e-02

Pos logit contributions
atted+0.211
els+0.206
 opinions+0.197
lee+0.195
ished+0.191
Neg logit contributions
fire-0.385
 distance-0.376
 hill-0.340
field-0.339
ding-0.338
Token stumbled
Token ID24241
Feature activation+6.42e-02

Pos logit contributions
els+0.195
atted+0.194
lee+0.190
ished+0.186
 opinions+0.183
Neg logit contributions
fire-0.162
 distance-0.147
 hill-0.127
ding-0.125
storm-0.123
Token upon
Token ID2402
Feature activation+3.83e-02

Pos logit contributions
atted+0.132
ished+0.129
lee+0.128
els+0.128
ubby+0.122
Neg logit contributions
fire-0.125
 distance-0.105
storm-0.097
 hill-0.096
 saf-0.092
Token a
Token ID257
Feature activation-5.21e-02

Pos logit contributions
 opinions+0.058
lee+0.056
ished+0.055
els+0.055
atted+0.054
Neg logit contributions
fire-0.091
 distance-0.077
field-0.075
boat-0.074
ding-0.072
Token lovely
Token ID14081
Feature activation+3.28e-02

Pos logit contributions
fire+0.111
whe+0.096
 distance+0.091
|+0.091
 Zig+0.090
Neg logit contributions
ubby-0.088
lee-0.083
els-0.080
atted-0.078
ealing-0.077
Token first
Token ID717
Feature activation+5.07e-02

Pos logit contributions
els+0.008
lee+0.008
atted+0.008
 opinions+0.008
ished+0.008
Neg logit contributions
fire-0.007
 distance-0.006
ding-0.006
whe-0.006
storm-0.006
Token friend
Token ID1545
Feature activation+8.97e-03

Pos logit contributions
els+0.108
atted+0.107
lee+0.104
ished+0.103
 opinions+0.101
Neg logit contributions
fire-0.093
 distance-0.085
 hill-0.074
 road-0.072
ding-0.071
Token said
Token ID531
Feature activation+6.57e-02

Pos logit contributions
els+0.018
atted+0.018
lee+0.017
ished+0.017
 opinions+0.017
Neg logit contributions
fire-0.017
 distance-0.016
 hill-0.014
ding-0.014
 road-0.014
Token,
Token ID11
Feature activation+2.41e-02

Pos logit contributions
els+0.125
atted+0.124
lee+0.119
ished+0.118
 opinions+0.115
Neg logit contributions
fire-0.136
 distance-0.124
 hill-0.110
ding-0.107
 road-0.107
Token "
Token ID366
Feature activation+3.17e-02

Pos logit contributions
atted+0.041
lee+0.041
els+0.041
ished+0.038
 opinions+0.038
Neg logit contributions
fire-0.053
 distance-0.050
 hill-0.044
 road-0.043
storm-0.043
TokenU
Token ID52
Feature activation+1.55e-01

Pos logit contributions
els+0.059
atted+0.058
 opinions+0.057
ished+0.057
lee+0.056
Neg logit contributions
fire-0.063
 distance-0.056
 hill-0.052
 road-0.051
 saf-0.051
Tokengh
Token ID456
Feature activation+1.45e-01

Pos logit contributions
atted+0.343
lee+0.342
ishment+0.336
ished+0.332
els+0.331
Neg logit contributions
fire-0.233
 saf-0.190
 Zig-0.182
asis-0.180
 hill-0.179
Token,
Token ID11
Feature activation+2.28e-02

Pos logit contributions
els+0.291
atted+0.287
lee+0.279
 opinions+0.276
ished+0.274
Neg logit contributions
fire-0.277
 distance-0.246
ding-0.227
 hill-0.225
whe-0.216
Token this
Token ID428
Feature activation-2.89e-03

Pos logit contributions
atted+0.043
els+0.043
lee+0.042
ished+0.041
 opinions+0.040
Neg logit contributions
fire-0.046
 distance-0.043
 hill-0.037
ding-0.037
field-0.036
Token part
Token ID636
Feature activation-4.64e-02

Pos logit contributions
fire+0.004
 distance+0.004
 hill+0.003
 road+0.003
field+0.003
Neg logit contributions
els-0.007
atted-0.007
lee-0.007
ished-0.007
 opinions-0.007
Token is
Token ID318
Feature activation+3.24e-03

Pos logit contributions
fire+0.103
 distance+0.096
ding+0.086
whe+0.084
 saf+0.083
Neg logit contributions
atted-0.079
els-0.078
 opinions-0.074
ished-0.074
lee-0.073
Token the
Token ID262
Feature activation-4.22e-02

Pos logit contributions
els+0.006
atted+0.006
lee+0.006
 opinions+0.005
ished+0.005
Neg logit contributions
fire-0.007
 distance-0.006
storm-0.006
ding-0.006
 hill-0.005
Token ancient
Token ID6156
Feature activation-1.07e-02

Pos logit contributions
fire+0.084
 distance+0.077
ding+0.070
storm+0.068
whe+0.068
Neg logit contributions
els-0.081
atted-0.080
lee-0.079
ished-0.077
 opinions-0.077
Token eye
Token ID4151
Feature activation-6.77e-02

Pos logit contributions
fire+0.019
 distance+0.017
ding+0.016
whe+0.015
storm+0.015
Neg logit contributions
els-0.023
atted-0.022
lee-0.022
 opinions-0.021
ished-0.021
Token.
Token ID13
Feature activation-3.88e-02

Pos logit contributions
fire+0.122
 distance+0.110
|+0.102
ding+0.100
 hill+0.100
Neg logit contributions
els-0.142
atted-0.138
lee-0.136
ished-0.130
 opinions-0.125
Token<|endoftext|>
Token ID50256
Feature activation+2.48e-02

Pos logit contributions
fire+0.061
 distance+0.053
storm+0.048
ding+0.045
 hill+0.045
Neg logit contributions
atted-0.093
lee-0.092
els-0.089
 opinions-0.087
ished-0.087
TokenE
Token ID36
Feature activation+1.55e-01

Pos logit contributions
els+0.044
atted+0.042
lee+0.042
ubby+0.041
ished+0.041
Neg logit contributions
fire-0.049
 distance-0.044
storm-0.042
tr-0.039
 road-0.039
Tokenlla
Token ID8466
Feature activation+9.13e-02

Pos logit contributions
atted+0.282
els+0.281
lee+0.263
ished+0.262
ubby+0.257
Neg logit contributions
fire-0.324
 distance-0.312
 hill-0.269
 road-0.269
whe-0.262
Token was
Token ID373
Feature activation+4.81e-02

Pos logit contributions
els+0.188
atted+0.186
lee+0.180
ished+0.178
 opinions+0.176
Neg logit contributions
fire-0.171
 distance-0.156
 hill-0.137
ding-0.133
 road-0.132
Token having
Token ID1719
Feature activation+2.79e-02

Pos logit contributions
els+0.112
atted+0.108
lee+0.106
 opinions+0.105
ished+0.105
Neg logit contributions
fire-0.076
 distance-0.072
 hill-0.059
ding-0.059
 road-0.059
Token a
Token ID257
Feature activation-5.64e-02

Pos logit contributions
lee+0.052
 opinions+0.051
atted+0.050
els+0.049
ished+0.048
Neg logit contributions
fire-0.060
 distance-0.052
storm-0.047
 hill-0.046
field-0.046
Token special
Token ID2041
Feature activation+1.47e-02

Pos logit contributions
fire+0.102
 distance+0.090
whe+0.075
ding+0.075
 hill+0.074
Neg logit contributions
atted-0.120
els-0.119
lee-0.117
ished-0.114
 opinions-0.114
Token.
Token ID13
Feature activation-1.54e-02

Pos logit contributions
atted+0.092
lee+0.091
els+0.090
ished+0.087
ubby+0.086
Neg logit contributions
fire-0.087
 distance-0.080
ding-0.072
 hill-0.072
 road-0.071
Token They
Token ID1119
Feature activation-1.04e-02

Pos logit contributions
fire+0.025
 distance+0.022
 hill+0.019
 road+0.018
ding+0.018
Neg logit contributions
els-0.036
atted-0.036
lee-0.035
ished-0.035
 opinions-0.034
Token say
Token ID910
Feature activation+6.87e-02

Pos logit contributions
fire+0.021
 distance+0.019
 hill+0.017
ding+0.017
storm+0.017
Neg logit contributions
els-0.020
atted-0.020
lee-0.019
ished-0.019
 opinions-0.019
Token,
Token ID11
Feature activation+2.73e-02

Pos logit contributions
els+0.142
atted+0.140
ished+0.138
lee+0.134
ishment+0.131
Neg logit contributions
fire-0.128
 distance-0.115
 he-0.102
 hill-0.102
ding-0.100
Token "
Token ID366
Feature activation+3.80e-02

Pos logit contributions
els+0.049
atted+0.049
lee+0.048
 opinions+0.046
ished+0.045
Neg logit contributions
fire-0.058
 distance-0.054
 hill-0.047
field-0.046
 road-0.046
TokenU
Token ID52
Feature activation+1.54e-01

Pos logit contributions
els+0.076
atted+0.074
 opinions+0.072
lee+0.071
ished+0.071
Neg logit contributions
fire-0.075
 distance-0.068
 hill-0.060
field-0.058
 road-0.058
Tokengly
Token ID10853
Feature activation+9.67e-02

Pos logit contributions
ishment+0.332
lee+0.332
atted+0.327
ished+0.316
ubby+0.314
Neg logit contributions
fire-0.220
ding-0.190
 saf-0.188
whe-0.182
rob-0.181
Token!"
Token ID2474
Feature activation-7.90e-03

Pos logit contributions
atted+0.199
els+0.198
lee+0.193
ished+0.189
 opinions+0.185
Neg logit contributions
fire-0.179
 distance-0.161
 hill-0.144
ding-0.142
 road-0.139
Token "
Token ID366
Feature activation+3.47e-02

Pos logit contributions
fire+0.013
 distance+0.011
 hill+0.010
storm+0.009
ding+0.009
Neg logit contributions
els-0.018
atted-0.018
lee-0.018
ished-0.018
 opinions-0.017
TokenPretty
Token ID35700
Feature activation-4.11e-05

Pos logit contributions
els+0.068
 opinions+0.065
atted+0.065
ished+0.063
lee+0.063
Neg logit contributions
fire-0.068
 distance-0.062
 hill-0.055
field-0.055
 mountain-0.053
Token!"
Token ID2474
Feature activation-1.27e-02

Pos logit contributions
fire+0.000
 distance+0.000
field+0.000
ding+0.000
storm+0.000
Neg logit contributions
els-0.000
atted-0.000
 opinions-0.000
lee-0.000
ished-0.000
Token bit
Token ID1643
Feature activation-1.32e-02

Pos logit contributions
atted+0.040
els+0.039
 opinions+0.038
ished+0.038
lee+0.037
Neg logit contributions
fire-0.033
 distance-0.030
ding-0.027
whe-0.026
storm-0.026
Token of
Token ID286
Feature activation-3.38e-02

Pos logit contributions
fire+0.022
 distance+0.020
 hill+0.017
 he+0.016
 road+0.016
Neg logit contributions
els-0.030
atted-0.030
lee-0.029
ished-0.028
 opinions-0.028
Token magic
Token ID5536
Feature activation-1.89e-02

Pos logit contributions
fire+0.073
 distance+0.067
 hill+0.060
ding+0.058
 road+0.058
Neg logit contributions
els-0.061
atted-0.061
lee-0.058
ished-0.058
 opinions-0.056
Token.
Token ID13
Feature activation-4.02e-02

Pos logit contributions
fire+0.034
 distance+0.031
ding+0.027
 hill+0.026
whe+0.026
Neg logit contributions
els-0.041
atted-0.040
lee-0.039
ished-0.039
 opinions-0.038
Token<|endoftext|>
Token ID50256
Feature activation+2.61e-02

Pos logit contributions
fire+0.060
 distance+0.054
storm+0.045
 hill+0.044
ding+0.044
Neg logit contributions
atted-0.098
els-0.097
lee-0.096
 opinions-0.094
ished-0.093
TokenIsa
Token ID39443
Feature activation+1.54e-01

Pos logit contributions
els+0.048
lee+0.047
atted+0.046
ished+0.045
ubby+0.045
Neg logit contributions
fire-0.050
 distance-0.043
storm-0.042
 road-0.038
tr-0.038
Tokenac
Token ID330
Feature activation+1.26e-01

Pos logit contributions
atted+0.263
els+0.259
lee+0.256
ishment+0.251
elling+0.247
Neg logit contributions
fire-0.336
 distance-0.313
|-0.270
field-0.264
 saf-0.262
Token was
Token ID373
Feature activation+5.96e-02

Pos logit contributions
atted+0.180
els+0.179
ishment+0.176
ubby+0.172
ealing+0.168
Neg logit contributions
fire-0.275
 distance-0.269
 he-0.235
boat-0.235
 saf-0.234
Token preparing
Token ID10629
Feature activation+1.69e-02

Pos logit contributions
els+0.140
atted+0.137
lee+0.133
 opinions+0.132
ished+0.131
Neg logit contributions
fire-0.096
 distance-0.088
ding-0.073
 he-0.072
 hill-0.072
Token breakfast
Token ID12607
Feature activation-4.06e-02

Pos logit contributions
els+0.036
atted+0.036
lee+0.035
ished+0.034
 opinions+0.034
Neg logit contributions
fire-0.031
 distance-0.028
 hill-0.024
storm-0.024
ding-0.024
Token in
Token ID287
Feature activation-2.06e-02

Pos logit contributions
fire+0.072
 distance+0.072
 Zig+0.063
|+0.061
storm+0.060
Neg logit contributions
els-0.082
atted-0.077
ished-0.077
lee-0.076
inding-0.075
Token there
Token ID612
Feature activation+8.21e-03

Pos logit contributions
els+0.091
atted+0.091
lee+0.089
ished+0.087
 opinions+0.087
Neg logit contributions
fire-0.065
 distance-0.058
 hill-0.051
 he-0.049
ding-0.048
Token were
Token ID547
Feature activation+2.61e-02

Pos logit contributions
els+0.019
atted+0.019
lee+0.018
ished+0.018
 opinions+0.018
Neg logit contributions
fire-0.014
 distance-0.012
 hill-0.010
ding-0.010
storm-0.010
Token two
Token ID734
Feature activation+2.04e-02

Pos logit contributions
els+0.048
atted+0.048
lee+0.044
ished+0.044
ishment+0.044
Neg logit contributions
fire-0.053
 distance-0.050
 hill-0.045
ding-0.044
 saf-0.044
Token friends
Token ID2460
Feature activation-9.37e-03

Pos logit contributions
els+0.034
atted+0.033
ished+0.032
lee+0.032
 opinions+0.031
Neg logit contributions
fire-0.045
 distance-0.043
 hill-0.039
 road-0.038
 saf-0.037
Token,
Token ID11
Feature activation+2.25e-02

Pos logit contributions
fire+0.015
 distance+0.014
 hill+0.012
ding+0.011
 road+0.011
Neg logit contributions
els-0.022
atted-0.022
lee-0.021
ished-0.021
 opinions-0.020
Token Ab
Token ID2275
Feature activation+1.54e-01

Pos logit contributions
lee+0.034
atted+0.034
els+0.032
ubby+0.032
 opinions+0.030
Neg logit contributions
fire-0.055
 distance-0.049
asis-0.047
storm-0.046
ding-0.045
Tokenig
Token ID328
Feature activation+3.56e-02

Pos logit contributions
 opinions+0.250
ishment+0.234
elling+0.230
ished+0.229
atted+0.228
Neg logit contributions
 distance-0.314
fire-0.309
field-0.293
 usual-0.275
 hill-0.273
Tokenail
Token ID603
Feature activation+7.21e-02

Pos logit contributions
atted+0.052
els+0.052
lee+0.050
ished+0.048
 opinions+0.046
Neg logit contributions
fire-0.089
 distance-0.083
 hill-0.075
 road-0.074
ding-0.074
Token and
Token ID290
Feature activation-4.45e-03

Pos logit contributions
els+0.131
atted+0.129
lee+0.124
ished+0.124
 opinions+0.121
Neg logit contributions
fire-0.153
 distance-0.141
 hill-0.126
 road-0.123
ding-0.122
Token Mat
Token ID6550
Feature activation+1.04e-01

Pos logit contributions
fire+0.010
 distance+0.009
 hill+0.008
 he+0.008
 road+0.008
Neg logit contributions
els-0.008
atted-0.008
ished-0.008
lee-0.008
 opinions-0.007
Tokenty
Token ID774
Feature activation+6.21e-02

Pos logit contributions
atted+0.165
els+0.164
ished+0.154
lee+0.154
 opinions+0.152
Neg logit contributions
fire-0.244
 distance-0.229
 hill-0.206
ding-0.203
 road-0.201
Token it
Token ID340
Feature activation-4.98e-02

Pos logit contributions
fire+0.048
 distance+0.043
ding+0.040
whe+0.040
storm+0.039
Neg logit contributions
els-0.050
atted-0.047
lee-0.046
 opinions-0.046
ishment-0.045
Token.
Token ID13
Feature activation-2.68e-02

Pos logit contributions
fire+0.087
 distance+0.078
 Zig+0.068
ding+0.068
whe+0.068
Neg logit contributions
atted-0.106
ished-0.105
els-0.104
lee-0.103
 opinions-0.098
Token
Token ID220
Feature activation+2.44e-02

Pos logit contributions
fire+0.046
 distance+0.041
storm+0.035
 hill+0.035
ding+0.034
Neg logit contributions
atted-0.060
lee-0.060
els-0.058
ished-0.057
 opinions-0.056
Token
Token ID198
Feature activation-4.31e-02

Pos logit contributions
atted+0.061
els+0.061
lee+0.060
ished+0.059
 opinions+0.058
Neg logit contributions
fire-0.035
 distance-0.032
storm-0.026
 hill-0.026
 road-0.025
Token
Token ID198
Feature activation-3.39e-02

Pos logit contributions
fire+0.068
 distance+0.062
storm+0.058
 hill+0.054
 road+0.052
Neg logit contributions
atted-0.095
lee-0.095
els-0.092
ished-0.090
ealing-0.089
TokenU
Token ID52
Feature activation+1.54e-01

Pos logit contributions
fire+0.055
 distance+0.052
storm+0.046
 hill+0.045
field+0.045
Neg logit contributions
atted-0.071
lee-0.070
els-0.070
ished-0.068
inding-0.068
Tokenps
Token ID862
Feature activation+5.00e-02

Pos logit contributions
atted+0.235
lee+0.233
els+0.228
ished+0.220
ishment+0.219
Neg logit contributions
fire-0.351
storm-0.292
asis-0.288
field-0.281
ding-0.281
Tokenet
Token ID316
Feature activation+5.28e-02

Pos logit contributions
atted+0.101
els+0.099
lee+0.097
ished+0.096
 opinions+0.092
Neg logit contributions
fire-0.096
 distance-0.089
 hill-0.079
 road-0.077
field-0.076
Token about
Token ID546
Feature activation-4.71e-03

Pos logit contributions
els+0.101
atted+0.100
lee+0.097
ished+0.095
 opinions+0.093
Neg logit contributions
fire-0.107
 distance-0.099
 hill-0.087
ding-0.085
 road-0.085
Token the
Token ID262
Feature activation-3.45e-02

Pos logit contributions
fire+0.010
 distance+0.009
ding+0.008
storm+0.008
 hill+0.008
Neg logit contributions
els-0.009
atted-0.009
lee-0.008
 opinions-0.008
ished-0.008
Token poor
Token ID3595
Feature activation+3.18e-02

Pos logit contributions
fire+0.082
 distance+0.076
 hill+0.069
 road+0.067
ding+0.066
Neg logit contributions
atted-0.055
els-0.055
lee-0.053
ished-0.052
 opinions-0.050
Token stream
Token ID4269
Feature activation-3.68e-02

Pos logit contributions
fire+0.100
 distance+0.090
field+0.082
storm+0.081
 hill+0.081
Neg logit contributions
els-0.084
lee-0.083
atted-0.082
 opinions-0.080
ished-0.075
Tokeners
Token ID364
Feature activation-2.18e-02

Pos logit contributions
 distance+0.086
fire+0.086
 hill+0.077
 road+0.076
field+0.076
Neg logit contributions
atted-0.053
els-0.053
lee-0.052
ished-0.050
 opinions-0.047
Token.
Token ID13
Feature activation-2.17e-02

Pos logit contributions
fire+0.040
 distance+0.038
|+0.033
 hill+0.033
 road+0.032
Neg logit contributions
els-0.043
lee-0.043
atted-0.043
ished-0.040
 opinions-0.039
Token
Token ID198
Feature activation-4.98e-02

Pos logit contributions
fire+0.045
 distance+0.041
 hill+0.036
storm+0.036
field+0.035
Neg logit contributions
atted-0.041
lee-0.041
els-0.041
ished-0.038
 opinions-0.038
Token
Token ID198
Feature activation-4.23e-02

Pos logit contributions
fire+0.073
 distance+0.065
storm+0.055
 hill+0.054
 road+0.052
Neg logit contributions
atted-0.123
els-0.123
lee-0.122
ished-0.115
 opinions-0.115
TokenU
Token ID52
Feature activation+1.53e-01

Pos logit contributions
fire+0.073
 distance+0.067
 hill+0.057
storm+0.056
 road+0.055
Neg logit contributions
els-0.093
atted-0.092
lee-0.091
ished-0.088
 opinions-0.087
Tokenma
Token ID2611
Feature activation+5.30e-02

Pos logit contributions
lee+0.262
ishment+0.257
els+0.254
atted+0.251
ished+0.247
Neg logit contributions
fire-0.313
ding-0.270
storm-0.263
tr-0.255
whe-0.249
Token jumped
Token ID11687
Feature activation+7.92e-02

Pos logit contributions
els+0.108
atted+0.107
lee+0.103
ished+0.103
 opinions+0.101
Neg logit contributions
fire-0.100
 distance-0.091
 hill-0.080
ding-0.078
 road-0.077
Token up
Token ID510
Feature activation+2.73e-02

Pos logit contributions
els+0.152
atted+0.152
lee+0.147
ished+0.145
 opinions+0.140
Neg logit contributions
fire-0.160
 distance-0.147
 hill-0.129
 road-0.126
storm-0.126
Token and
Token ID290
Feature activation-2.95e-02

Pos logit contributions
els+0.042
atted+0.042
lee+0.041
ished+0.038
 opinions+0.036
Neg logit contributions
fire-0.065
 distance-0.059
storm-0.054
ding-0.053
whe-0.052
Token down
Token ID866
Feature activation+9.08e-03

Pos logit contributions
fire+0.066
 distance+0.060
storm+0.053
 road+0.053
 hill+0.052
Neg logit contributions
atted-0.053
lee-0.051
els-0.050
ished-0.050
ishment-0.045
Token mad
Token ID8805
Feature activation+9.52e-02

Pos logit contributions
els+0.100
atted+0.096
lee+0.096
ishment+0.095
ubby+0.094
Neg logit contributions
fire-0.095
 distance-0.091
 he-0.081
whe-0.078
 hill-0.076
Token and
Token ID290
Feature activation-2.85e-02

Pos logit contributions
els+0.185
atted+0.179
lee+0.172
 opinions+0.171
ished+0.171
Neg logit contributions
fire-0.189
 distance-0.176
 he-0.162
 hill-0.156
ding-0.155
Token said
Token ID531
Feature activation+5.88e-02

Pos logit contributions
fire+0.058
 distance+0.053
 hill+0.047
ding+0.046
 road+0.045
Neg logit contributions
els-0.055
atted-0.054
lee-0.053
ished-0.052
 opinions-0.051
Token,
Token ID11
Feature activation+1.51e-02

Pos logit contributions
els+0.125
atted+0.119
ished+0.118
ishment+0.118
 opinions+0.116
Neg logit contributions
fire-0.108
 distance-0.093
 he-0.088
ding-0.083
storm-0.083
Token "
Token ID366
Feature activation+3.11e-02

Pos logit contributions
atted+0.028
els+0.028
lee+0.028
ished+0.027
 opinions+0.026
Neg logit contributions
fire-0.031
 distance-0.029
 hill-0.025
 road-0.025
ding-0.025
TokenU
Token ID52
Feature activation+1.53e-01

Pos logit contributions
atted+0.072
els+0.072
lee+0.071
ished+0.069
 opinions+0.067
Neg logit contributions
fire-0.050
 distance-0.046
 hill-0.038
ding-0.038
whe-0.037
Tokengh
Token ID456
Feature activation+1.42e-01

Pos logit contributions
atted+0.306
ishment+0.306
lee+0.302
ished+0.296
arted+0.296
Neg logit contributions
fire-0.242
asis-0.197
ding-0.195
storm-0.191
rob-0.186
Token,
Token ID11
Feature activation+2.78e-02

Pos logit contributions
els+0.301
 opinions+0.297
ishment+0.292
atted+0.290
elling+0.289
Neg logit contributions
fire-0.241
ding-0.209
 distance-0.196
 hill-0.194
 he-0.192
Token what
Token ID644
Feature activation-2.24e-02

Pos logit contributions
els+0.059
atted+0.058
ished+0.056
lee+0.055
 opinions+0.053
Neg logit contributions
fire-0.052
 distance-0.046
 hill-0.043
 he-0.042
 road-0.040
Token are
Token ID389
Feature activation+1.33e-02

Pos logit contributions
fire+0.042
 distance+0.038
 hill+0.034
 he+0.034
 road+0.033
Neg logit contributions
atted-0.045
els-0.045
lee-0.044
ished-0.043
ishment-0.042
Token you
Token ID345
Feature activation+1.98e-02

Pos logit contributions
atted+0.029
els+0.029
lee+0.028
ished+0.028
 opinions+0.027
Neg logit contributions
fire-0.024
 distance-0.021
 hill-0.018
ding-0.018
 he-0.018
Token was
Token ID373
Feature activation+3.79e-02

Pos logit contributions
atted+0.003
els+0.002
ished+0.002
 opinions+0.002
lee+0.002
Neg logit contributions
fire-0.003
 distance-0.003
ding-0.002
 hill-0.002
whe-0.002
Token their
Token ID511
Feature activation+2.07e-02

Pos logit contributions
atted+0.076
els+0.075
lee+0.073
ished+0.072
ubby+0.071
Neg logit contributions
fire-0.073
 distance-0.067
 hill-0.061
ding-0.059
storm-0.058
Token friend
Token ID1545
Feature activation-1.18e-02

Pos logit contributions
els+0.032
 opinions+0.032
ubby+0.032
atted+0.032
lee+0.030
Neg logit contributions
fire-0.048
 distance-0.044
ding-0.041
storm-0.040
whe-0.039
Token.
Token ID13
Feature activation-2.40e-02

Pos logit contributions
fire+0.025
 distance+0.022
ding+0.020
storm+0.020
field+0.020
Neg logit contributions
atted-0.022
els-0.021
lee-0.021
 opinions-0.020
ished-0.020
Token<|endoftext|>
Token ID50256
Feature activation+2.86e-02

Pos logit contributions
fire+0.044
 distance+0.039
 hill+0.036
field+0.035
storm+0.035
Neg logit contributions
lee-0.048
atted-0.047
ubby-0.047
 opinions-0.047
els-0.046
TokenE
Token ID36
Feature activation+1.52e-01

Pos logit contributions
els+0.052
ubby+0.048
atted+0.046
 opinions+0.045
ished+0.045
Neg logit contributions
fire-0.057
 distance-0.053
storm-0.051
ding-0.047
 usual-0.047
Tokenlla
Token ID8466
Feature activation+9.18e-02

Pos logit contributions
els+0.280
atted+0.271
ubby+0.260
 opinions+0.254
ished+0.254
Neg logit contributions
fire-0.308
 distance-0.304
 hill-0.259
whe-0.258
storm-0.253
Token and
Token ID290
Feature activation+7.56e-03

Pos logit contributions
atted+0.152
els+0.151
lee+0.145
ished+0.142
 opinions+0.138
Neg logit contributions
fire-0.209
 distance-0.197
 hill-0.174
storm-0.171
ding-0.170
Token Ben
Token ID3932
Feature activation+1.09e-01

Pos logit contributions
els+0.012
atted+0.012
lee+0.012
 opinions+0.011
ished+0.011
Neg logit contributions
fire-0.018
 distance-0.016
ding-0.015
 hill-0.015
storm-0.014
Token were
Token ID547
Feature activation+4.77e-02

Pos logit contributions
els+0.239
atted+0.238
lee+0.230
ished+0.227
 opinions+0.223
Neg logit contributions
fire-0.190
 distance-0.173
 hill-0.149
ding-0.145
 road-0.144
Token twins
Token ID20345
Feature activation-4.99e-03

Pos logit contributions
els+0.099
atted+0.094
 opinions+0.093
ished+0.093
lee+0.092
Neg logit contributions
fire-0.083
 distance-0.079
 road-0.070
 saf-0.070
 hill-0.070
Token time
Token ID640
Feature activation+4.98e-03

Pos logit contributions
els+0.032
atted+0.032
lee+0.031
ished+0.030
 opinions+0.030
Neg logit contributions
fire-0.040
 distance-0.037
 hill-0.033
ding-0.033
whe-0.032
Token!
Token ID0
Feature activation-3.14e-02

Pos logit contributions
atted+0.011
els+0.011
lee+0.011
ished+0.010
 opinions+0.010
Neg logit contributions
fire-0.009
 distance-0.008
 hill-0.007
 road-0.007
storm-0.007
Tokenâ
Token ID22940
Feature activation+7.98e-02

Pos logit contributions
fire+0.051
 distance+0.046
 hill+0.039
storm+0.038
ding+0.037
Neg logit contributions
atted-0.072
els-0.072
lee-0.071
ished-0.069
 opinions-0.069
Token
Token ID26391
Feature activation+5.37e-02

Pos logit contributions
els+0.158
atted+0.156
lee+0.151
ished+0.147
 opinions+0.143
Neg logit contributions
fire-0.163
 distance-0.148
 hill-0.129
ding-0.128
storm-0.126
Token<|endoftext|>
Token ID50256
Feature activation+2.25e-02

Pos logit contributions
atted+0.119
lee+0.118
ealing+0.115
els+0.114
inding+0.111
Neg logit contributions
fire-0.089
 distance-0.082
storm-0.069
whe-0.066
 usual-0.066
TokenPa
Token ID28875
Feature activation+1.52e-01

Pos logit contributions
els+0.043
lee+0.041
atted+0.041
ubby+0.040
ished+0.040
Neg logit contributions
fire-0.043
 distance-0.038
storm-0.036
tr-0.034
 road-0.034
Tokenolo
Token ID14057
Feature activation+6.97e-02

Pos logit contributions
els+0.282
 opinions+0.277
atted+0.269
ished+0.257
lee+0.253
Neg logit contributions
fire-0.297
 distance-0.293
 ride-0.264
ding-0.264
 he-0.260
Token was
Token ID373
Feature activation+5.00e-02

Pos logit contributions
els+0.133
atted+0.133
lee+0.128
ished+0.126
 opinions+0.123
Neg logit contributions
fire-0.141
 distance-0.129
 hill-0.114
ding-0.112
 road-0.111
Token very
Token ID845
Feature activation+1.37e-02

Pos logit contributions
els+0.110
atted+0.109
lee+0.106
ished+0.104
 opinions+0.103
Neg logit contributions
fire-0.088
 distance-0.080
 hill-0.068
ding-0.067
 road-0.066
Token excited
Token ID6568
Feature activation+8.50e-02

Pos logit contributions
els+0.027
atted+0.026
lee+0.025
ished+0.025
 opinions+0.025
Neg logit contributions
fire-0.028
 distance-0.026
 hill-0.023
ding-0.022
 he-0.022
Token.
Token ID13
Feature activation-3.49e-02

Pos logit contributions
atted+0.189
els+0.188
lee+0.185
ished+0.181
 opinions+0.180
Neg logit contributions
fire-0.145
 distance-0.132
 hill-0.111
ding-0.106
 road-0.106
Token that
Token ID326
Feature activation-5.25e-02

Pos logit contributions
fire+0.142
 distance+0.122
 Zig+0.122
 saf+0.116
|+0.115
Neg logit contributions
ished-0.117
els-0.110
atted-0.105
lee-0.103
ubby-0.101
Token he
Token ID339
Feature activation-7.75e-03

Pos logit contributions
fire+0.097
 distance+0.075
ding+0.075
 Zig+0.074
field+0.072
Neg logit contributions
 opinions-0.099
ened-0.096
atted-0.094
els-0.093
lee-0.093
Token wanted
Token ID2227
Feature activation+4.79e-02

Pos logit contributions
fire+0.015
 distance+0.013
 hill+0.012
ding+0.012
 road+0.011
Neg logit contributions
atted-0.016
els-0.016
ished-0.015
lee-0.015
 opinions-0.015
Token.
Token ID13
Feature activation-3.07e-02

Pos logit contributions
els+0.106
atted+0.106
lee+0.103
ished+0.102
 opinions+0.102
Neg logit contributions
fire-0.083
 distance-0.073
 hill-0.065
field-0.063
storm-0.063
Token<|endoftext|>
Token ID50256
Feature activation+2.77e-02

Pos logit contributions
fire+0.057
 distance+0.050
storm+0.044
 hill+0.044
 trail+0.043
Neg logit contributions
atted-0.065
lee-0.064
ished-0.061
 opinions-0.061
els-0.061
TokenAk
Token ID33901
Feature activation+1.52e-01

Pos logit contributions
els+0.052
lee+0.050
atted+0.050
ished+0.049
ubby+0.048
Neg logit contributions
fire-0.053
 distance-0.046
storm-0.045
ding-0.040
 road-0.040
Tokenio
Token ID952
Feature activation+9.88e-02

Pos logit contributions
 opinions+0.270
ishment+0.256
atted+0.256
 requests+0.254
elling+0.250
Neg logit contributions
fire-0.278
 he-0.256
 distance-0.250
 saf-0.249
boat-0.249
Token was
Token ID373
Feature activation+5.42e-02

Pos logit contributions
els+0.182
atted+0.182
lee+0.174
ished+0.174
 opinions+0.171
Neg logit contributions
fire-0.200
 distance-0.184
 hill-0.165
ding-0.162
field-0.160
Token a
Token ID257
Feature activation-3.16e-02

Pos logit contributions
els+0.122
atted+0.119
lee+0.117
 opinions+0.114
ished+0.114
Neg logit contributions
fire-0.092
 distance-0.085
ding-0.072
 hill-0.071
 road-0.070
Token 3
Token ID513
Feature activation+4.53e-02

Pos logit contributions
fire+0.075
 distance+0.068
whe+0.063
ding+0.061
 Zig+0.061
Neg logit contributions
lee-0.047
ubby-0.046
atted-0.046
els-0.045
ished-0.043
Token year
Token ID614
Feature activation+2.65e-02

Pos logit contributions
els+0.107
atted+0.106
lee+0.103
ished+0.103
 opinions+0.101
Neg logit contributions
fire-0.072
 distance-0.065
 hill-0.055
 road-0.053
ding-0.052
Token colorful
Token ID20239
Feature activation-9.07e-03

Pos logit contributions
|+0.156
whe+0.152
fire+0.151
 Zig+0.142
storm+0.138
Neg logit contributions
ubby-0.159
els-0.157
lee-0.155
 requests-0.148
ealing-0.147
Token door
Token ID3420
Feature activation-1.12e-01

Pos logit contributions
fire+0.016
 distance+0.015
ding+0.014
|+0.013
whe+0.013
Neg logit contributions
els-0.019
lee-0.018
atted-0.018
 opinions-0.018
ubby-0.017
Token.
Token ID13
Feature activation-3.72e-02

Pos logit contributions
fire+0.204
 distance+0.184
|+0.166
 Zig+0.163
Ro+0.161
Neg logit contributions
els-0.229
atted-0.219
lee-0.212
ished-0.206
 opinions-0.206
Token
Token ID198
Feature activation-4.27e-02

Pos logit contributions
fire+0.088
 distance+0.080
storm+0.074
asis+0.072
field+0.072
Neg logit contributions
 opinions-0.056
lee-0.053
els-0.052
atted-0.051
 requests-0.050
Token
Token ID198
Feature activation-3.57e-02

Pos logit contributions
fire+0.076
 distance+0.067
storm+0.065
boat+0.061
field+0.061
Neg logit contributions
inding-0.077
els-0.077
 opinions-0.077
lee-0.077
atted-0.076
TokenU
Token ID52
Feature activation+1.52e-01

Pos logit contributions
fire+0.063
field+0.055
 distance+0.055
storm+0.051
ding+0.050
Neg logit contributions
ubby-0.063
 opinions-0.062
inding-0.062
els-0.061
lee-0.059
Tokenma
Token ID2611
Feature activation+5.42e-02

Pos logit contributions
atted+0.334
els+0.331
lee+0.329
ished+0.322
ishment+0.319
Neg logit contributions
fire-0.255
 distance-0.197
storm-0.197
ding-0.196
 saf-0.193
Token was
Token ID373
Feature activation+5.01e-02

Pos logit contributions
els+0.119
atted+0.119
lee+0.115
ished+0.115
 opinions+0.113
Neg logit contributions
fire-0.092
 distance-0.083
 hill-0.073
 road-0.071
ding-0.070
Token a
Token ID257
Feature activation-4.59e-02

Pos logit contributions
els+0.125
atted+0.125
lee+0.122
ished+0.120
 opinions+0.118
Neg logit contributions
fire-0.073
 distance-0.065
 hill-0.054
ding-0.052
 road-0.051
Token 3
Token ID513
Feature activation+3.97e-02

Pos logit contributions
fire+0.105
 distance+0.099
whe+0.098
|+0.096
ding+0.095
Neg logit contributions
ubby-0.063
lee-0.056
atted-0.054
ealing-0.053
inding-0.052
Token year
Token ID614
Feature activation+2.09e-02

Pos logit contributions
els+0.088
atted+0.088
lee+0.085
ished+0.085
 opinions+0.083
Neg logit contributions
fire-0.069
 distance-0.061
 hill-0.053
 road-0.051
 he-0.050
Token,
Token ID11
Feature activation+5.39e-02

Pos logit contributions
atted+0.069
els+0.068
lee+0.066
ished+0.064
ubby+0.063
Neg logit contributions
fire-0.092
 distance-0.086
 hill-0.076
 road-0.075
whe-0.074
Token a
Token ID257
Feature activation-8.99e-03

Pos logit contributions
els+0.118
atted+0.118
lee+0.114
ished+0.113
 opinions+0.110
Neg logit contributions
fire-0.095
 distance-0.087
 hill-0.074
ding-0.072
 road-0.072
Token p
Token ID279
Feature activation+1.92e-02

Pos logit contributions
fire+0.022
 distance+0.020
whe+0.019
field+0.018
|+0.018
Neg logit contributions
ubby-0.013
lee-0.012
atted-0.012
els-0.012
ealing-0.012
Tokenanda
Token ID5282
Feature activation+4.44e-02

Pos logit contributions
els+0.028
atted+0.027
 opinions+0.027
lee+0.027
ished+0.027
Neg logit contributions
fire-0.046
 distance-0.043
 hill-0.039
ding-0.039
 road-0.038
Token named
Token ID3706
Feature activation+4.88e-02

Pos logit contributions
atted+0.087
els+0.086
lee+0.080
ished+0.077
ubby+0.076
Neg logit contributions
fire-0.088
 distance-0.085
whe-0.073
ding-0.072
 hill-0.071
Token P
Token ID350
Feature activation+1.52e-01

Pos logit contributions
els+0.106
atted+0.105
lee+0.103
ished+0.100
 opinions+0.098
Neg logit contributions
fire-0.088
 distance-0.080
 hill-0.069
ding-0.067
storm-0.067
Tokenao
Token ID5488
Feature activation+8.56e-02

Pos logit contributions
atted+0.218
els+0.213
 opinions+0.207
lee+0.205
ishment+0.204
Neg logit contributions
fire-0.359
 distance-0.330
 hill-0.311
ding-0.309
 Zig-0.307
Token was
Token ID373
Feature activation+4.71e-02

Pos logit contributions
atted+0.181
els+0.179
lee+0.178
ished+0.171
ubby+0.164
Neg logit contributions
fire-0.159
 distance-0.152
 hill-0.125
storm-0.122
 road-0.122
Token very
Token ID845
Feature activation+1.53e-02

Pos logit contributions
atted+0.098
els+0.097
lee+0.097
ubby+0.093
ished+0.093
Neg logit contributions
fire-0.090
 distance-0.080
 hill-0.071
storm-0.069
ding-0.069
Token hungry
Token ID14720
Feature activation+5.73e-02

Pos logit contributions
atted+0.029
els+0.029
lee+0.029
ubby+0.028
ished+0.028
Neg logit contributions
fire-0.031
 distance-0.029
 hill-0.026
storm-0.025
ding-0.025
Token.
Token ID13
Feature activation-2.25e-02

Pos logit contributions
els+0.105
atted+0.099
ished+0.098
 opinions+0.097
lee+0.097
Neg logit contributions
fire-0.113
 distance-0.100
|-0.095
field-0.093
ding-0.092
Token be
Token ID307
Feature activation+1.42e-02

Pos logit contributions
fire+0.101
 distance+0.090
ding+0.082
field+0.078
storm+0.078
Neg logit contributions
els-0.094
 opinions-0.089
lee-0.088
atted-0.087
ished-0.084
Token up
Token ID510
Feature activation-5.42e-03

Pos logit contributions
els+0.028
atted+0.028
lee+0.027
ished+0.027
 opinions+0.026
Neg logit contributions
fire-0.028
 distance-0.026
 hill-0.023
ding-0.022
storm-0.022
Token high
Token ID1029
Feature activation+2.96e-02

Pos logit contributions
fire+0.010
 distance+0.009
ding+0.008
storm+0.008
 hill+0.008
Neg logit contributions
els-0.011
atted-0.011
lee-0.011
 opinions-0.010
ished-0.010
Token on
Token ID319
Feature activation-3.45e-02

Pos logit contributions
els+0.059
atted+0.058
lee+0.056
ished+0.056
 opinions+0.054
Neg logit contributions
fire-0.058
 distance-0.053
 hill-0.047
ding-0.046
 road-0.045
Token the
Token ID262
Feature activation-6.35e-02

Pos logit contributions
fire+0.070
 distance+0.065
ding+0.058
whe+0.057
storm+0.057
Neg logit contributions
els-0.068
lee-0.062
 opinions-0.061
atted-0.061
ubby-0.059
Token roof
Token ID9753
Feature activation-1.38e-01

Pos logit contributions
fire+0.094
rob+0.071
boat+0.071
Ro+0.069
storm+0.068
Neg logit contributions
els-0.145
lee-0.145
 opinions-0.142
 heads-0.137
atted-0.134
Token,
Token ID11
Feature activation-1.50e-02

Pos logit contributions
fire+0.271
Ro+0.237
 distance+0.235
Ar+0.227
|+0.227
Neg logit contributions
els-0.253
atted-0.240
lee-0.239
ished-0.235
 opinions-0.227
Token but
Token ID475
Feature activation-1.29e-02

Pos logit contributions
fire+0.028
 distance+0.025
 hill+0.022
ding+0.021
 road+0.021
Neg logit contributions
els-0.032
atted-0.032
lee-0.031
ished-0.030
 opinions-0.029
Token they
Token ID484
Feature activation-1.06e-02

Pos logit contributions
fire+0.020
 distance+0.018
 hill+0.015
ding+0.014
 road+0.014
Neg logit contributions
els-0.032
atted-0.031
lee-0.031
ished-0.030
 opinions-0.030
Token both
Token ID1111
Feature activation+4.06e-03

Pos logit contributions
fire+0.023
 distance+0.021
 road+0.018
ding+0.018
 hill+0.018
Neg logit contributions
atted-0.019
els-0.019
lee-0.019
ished-0.018
 opinions-0.018
Token enjoyed
Token ID8359
Feature activation+2.76e-02

Pos logit contributions
atted+0.007
lee+0.007
ished+0.006
els+0.006
 opinions+0.006
Neg logit contributions
fire-0.009
 distance-0.009
 hill-0.008
ding-0.008
 road-0.008
Token build
Token ID1382
Feature activation-1.88e-02

Pos logit contributions
fire+0.077
 distance+0.070
ding+0.066
field+0.060
storm+0.059
Neg logit contributions
els-0.086
 opinions-0.085
lee-0.082
atted-0.080
ubby-0.077
Token a
Token ID257
Feature activation-9.22e-02

Pos logit contributions
fire+0.034
 distance+0.031
 hill+0.027
ding+0.026
storm+0.026
Neg logit contributions
els-0.040
atted-0.040
lee-0.039
ished-0.038
 opinions-0.038
Token nest
Token ID16343
Feature activation-9.63e-02

Pos logit contributions
fire+0.185
|+0.161
whe+0.158
 distance+0.157
ding+0.151
Neg logit contributions
els-0.151
lee-0.149
ubby-0.148
ishment-0.146
inding-0.143
Token on
Token ID319
Feature activation-4.68e-02

Pos logit contributions
fire+0.208
 distance+0.197
 Zig+0.177
 saf+0.177
ding+0.177
Neg logit contributions
els-0.151
atted-0.144
ished-0.142
ubby-0.141
 opinions-0.138
Token this
Token ID428
Feature activation-6.56e-02

Pos logit contributions
fire+0.105
 distance+0.096
ding+0.089
whe+0.088
storm+0.085
Neg logit contributions
els-0.082
 opinions-0.072
lee-0.071
elling-0.070
inding-0.069
Token roof
Token ID9753
Feature activation-1.35e-01

Pos logit contributions
fire+0.104
whe+0.088
|+0.087
 distance+0.083
ding+0.082
Neg logit contributions
els-0.140
elling-0.132
ubby-0.129
lee-0.128
inding-0.126
Token,
Token ID11
Feature activation+2.22e-03

Pos logit contributions
fire+0.287
 distance+0.257
 saf+0.231
 Zig+0.230
Ar+0.228
Neg logit contributions
els-0.239
atted-0.232
 opinions-0.225
ished-0.224
elling-0.221
Token but
Token ID475
Feature activation-1.08e-02

Pos logit contributions
els+0.005
atted+0.005
lee+0.004
ished+0.004
 opinions+0.004
Neg logit contributions
fire-0.004
 distance-0.004
 hill-0.003
 road-0.003
ding-0.003
Token I
Token ID314
Feature activation-2.38e-02

Pos logit contributions
fire+0.019
 distance+0.017
 hill+0.015
 road+0.015
 he+0.015
Neg logit contributions
atted-0.024
els-0.023
lee-0.023
ished-0.023
 opinions-0.022
Token can
Token ID460
Feature activation-1.29e-02

Pos logit contributions
fire+0.048
 distance+0.044
 hill+0.038
ding+0.038
 road+0.037
Neg logit contributions
els-0.046
atted-0.046
lee-0.044
ished-0.044
 opinions-0.043
Token't
Token ID470
Feature activation+2.32e-03

Pos logit contributions
fire+0.021
 distance+0.020
ding+0.016
storm+0.016
 saf+0.016
Neg logit contributions
els-0.029
lee-0.028
atted-0.028
 opinions-0.028
inding-0.027
Token for
Token ID329
Feature activation-3.11e-02

Pos logit contributions
els+0.001
atted+0.001
lee+0.001
ished+0.001
 opinions+0.001
Neg logit contributions
fire-0.001
 distance-0.001
 hill-0.001
ding-0.001
storm-0.001
Token daddy
Token ID35822
Feature activation+1.58e-02

Pos logit contributions
fire+0.066
 distance+0.063
ding+0.054
 hill+0.054
storm+0.053
Neg logit contributions
els-0.058
atted-0.055
lee-0.054
 opinions-0.053
ished-0.051
Token to
Token ID284
Feature activation-4.60e-02

Pos logit contributions
els+0.032
atted+0.032
lee+0.031
ished+0.031
 opinions+0.030
Neg logit contributions
fire-0.029
 distance-0.027
 hill-0.024
storm-0.023
ding-0.023
Token fix
Token ID4259
Feature activation-2.71e-02

Pos logit contributions
fire+0.090
 distance+0.080
ding+0.078
rob+0.077
field+0.072
Neg logit contributions
 opinions-0.085
els-0.083
lee-0.077
atted-0.075
 requests-0.074
Token the
Token ID262
Feature activation-6.22e-02

Pos logit contributions
fire+0.052
 distance+0.047
 hill+0.041
 road+0.040
ding+0.040
Neg logit contributions
atted-0.055
els-0.055
lee-0.053
ished-0.052
 opinions-0.051
Token roof
Token ID9753
Feature activation-1.35e-01

Pos logit contributions
fire+0.098
 distance+0.081
ding+0.077
|+0.074
whe+0.073
Neg logit contributions
els-0.147
 opinions-0.142
atted-0.138
lee-0.136
ubby-0.134
Token.
Token ID13
Feature activation-1.99e-02

Pos logit contributions
fire+0.269
 distance+0.250
storm+0.220
 saf+0.220
 Zig+0.219
Neg logit contributions
els-0.253
atted-0.242
ished-0.237
 opinions-0.232
lee-0.227
Token You
Token ID921
Feature activation+1.45e-02

Pos logit contributions
fire+0.036
 distance+0.033
 hill+0.029
 road+0.028
ding+0.028
Neg logit contributions
els-0.042
atted-0.042
lee-0.041
ished-0.040
 opinions-0.039
Token could
Token ID714
Feature activation-1.76e-02

Pos logit contributions
els+0.028
atted+0.027
ished+0.026
 opinions+0.026
lee+0.026
Neg logit contributions
fire-0.029
 distance-0.027
ding-0.023
 hill-0.023
storm-0.023
Token fall
Token ID2121
Feature activation+2.11e-02

Pos logit contributions
fire+0.037
 distance+0.034
ding+0.030
boat+0.030
field+0.028
Neg logit contributions
els-0.033
 opinions-0.031
inding-0.029
lee-0.028
 requests-0.028
Token and
Token ID290
Feature activation-4.72e-02

Pos logit contributions
els+0.037
atted+0.035
lee+0.034
ished+0.033
ubby+0.032
Neg logit contributions
fire-0.048
 distance-0.043
field-0.038
 saf-0.037
whe-0.037
Token had
Token ID550
Feature activation+1.97e-02

Pos logit contributions
fire+0.013
 distance+0.012
 hill+0.010
 road+0.010
ding+0.010
Neg logit contributions
lee-0.015
atted-0.015
els-0.014
ished-0.014
 opinions-0.014
Token someone
Token ID2130
Feature activation-5.65e-02

Pos logit contributions
 opinions+0.038
atted+0.038
lee+0.037
els+0.037
ished+0.036
Neg logit contributions
fire-0.040
 distance-0.036
 hill-0.033
ding-0.032
 road-0.031
Token to
Token ID284
Feature activation-4.81e-02

Pos logit contributions
fire+0.095
 distance+0.083
 hill+0.074
storm+0.072
ding+0.072
Neg logit contributions
atted-0.129
els-0.127
ished-0.124
lee-0.124
 opinions-0.122
Token share
Token ID2648
Feature activation-3.38e-03

Pos logit contributions
fire+0.100
ding+0.094
 distance+0.092
storm+0.087
 hill+0.083
Neg logit contributions
 opinions-0.084
els-0.080
lee-0.078
 requests-0.077
inding-0.072
Token the
Token ID262
Feature activation-7.05e-02

Pos logit contributions
fire+0.009
 distance+0.008
ding+0.008
storm+0.008
 hill+0.008
Neg logit contributions
els-0.004
 opinions-0.004
atted-0.004
lee-0.004
ished-0.004
Token hay
Token ID27678
Feature activation-1.34e-01

Pos logit contributions
fire+0.121
 distance+0.107
ding+0.091
field+0.089
storm+0.089
Neg logit contributions
els-0.154
atted-0.151
 opinions-0.150
ished-0.149
lee-0.145
Token with
Token ID351
Feature activation-4.87e-02

Pos logit contributions
fire+0.248
 distance+0.241
ding+0.235
Ar+0.227
storm+0.221
Neg logit contributions
els-0.236
ished-0.226
atted-0.225
 opinions-0.224
lee-0.220
Token.
Token ID13
Feature activation-3.00e-02

Pos logit contributions
fire+0.132
 distance+0.122
ding+0.118
 hill+0.117
storm+0.114
Neg logit contributions
els-0.063
lee-0.060
 opinions-0.058
atted-0.058
ished-0.053
Token Then
Token ID3244
Feature activation+6.52e-03

Pos logit contributions
fire+0.054
 distance+0.049
 hill+0.043
 road+0.043
 he+0.042
Neg logit contributions
els-0.067
atted-0.065
ished-0.062
ubby-0.061
lee-0.060
Token,
Token ID11
Feature activation+6.94e-03

Pos logit contributions
els+0.015
atted+0.014
ished+0.014
lee+0.014
ubby+0.013
Neg logit contributions
fire-0.011
 distance-0.011
 he-0.010
 hill-0.009
 Zig-0.009
Token she
Token ID673
Feature activation+8.01e-03

Pos logit contributions
atted+0.014
els+0.013
lee+0.013
 opinions+0.013
ished+0.013
Neg logit contributions
fire-0.013
 distance-0.012
ding-0.011
 hill-0.011
 road-0.011
Token through
Token ID832
Feature activation-2.90e-02

Pos logit contributions
els+0.012
lee+0.012
atted+0.011
ished+0.011
ubby+0.011
Neg logit contributions
fire-0.013
 distance-0.012
storm-0.011
field-0.011
ding-0.011
Token a
Token ID257
Feature activation-8.91e-02

Pos logit contributions
fire+0.052
 distance+0.047
ding+0.044
storm+0.043
whe+0.043
Neg logit contributions
els-0.064
lee-0.059
atted-0.059
 opinions-0.057
ished-0.056
Token crack
Token ID8469
Feature activation-4.43e-02

Pos logit contributions
fire+0.159
whe+0.146
ding+0.138
|+0.135
 Zig+0.135
Neg logit contributions
els-0.172
lee-0.168
ubby-0.155
ished-0.154
atted-0.153
Token in
Token ID287
Feature activation-5.57e-02

Pos logit contributions
fire+0.070
 distance+0.065
ding+0.057
 Zig+0.056
whe+0.056
Neg logit contributions
els-0.104
atted-0.097
ished-0.097
lee-0.095
ubby-0.093
Token the
Token ID262
Feature activation-6.14e-02

Pos logit contributions
fire+0.104
 distance+0.092
ding+0.083
storm+0.082
whe+0.081
Neg logit contributions
els-0.118
atted-0.114
lee-0.111
 opinions-0.110
ished-0.109
Token roof
Token ID9753
Feature activation-1.33e-01

Pos logit contributions
fire+0.125
whe+0.107
ding+0.107
|+0.102
 Zig+0.101
Neg logit contributions
els-0.111
lee-0.111
elling-0.107
atted-0.106
 opinions-0.106
Token.
Token ID13
Feature activation-2.09e-02

Pos logit contributions
fire+0.251
Ro+0.213
Ar+0.208
 Zig+0.206
 distance+0.203
Neg logit contributions
els-0.277
atted-0.254
lee-0.251
ished-0.246
 opinions-0.239
Token They
Token ID1119
Feature activation-2.35e-02

Pos logit contributions
fire+0.037
 distance+0.034
 hill+0.029
ding+0.028
 road+0.028
Neg logit contributions
els-0.045
atted-0.045
lee-0.044
ished-0.044
 opinions-0.043
Token sat
Token ID3332
Feature activation+3.95e-02

Pos logit contributions
fire+0.049
 distance+0.044
storm+0.040
ding+0.040
 hill+0.039
Neg logit contributions
els-0.044
atted-0.044
lee-0.043
ished-0.042
 opinions-0.040
Token down
Token ID866
Feature activation+2.62e-03

Pos logit contributions
els+0.087
atted+0.086
lee+0.085
ished+0.083
 opinions+0.083
Neg logit contributions
fire-0.069
 distance-0.063
 hill-0.055
ding-0.052
 road-0.052
Token and
Token ID290
Feature activation-3.32e-02

Pos logit contributions
els+0.004
atted+0.004
lee+0.003
ished+0.003
 opinions+0.003
Neg logit contributions
fire-0.006
 distance-0.006
storm-0.005
|-0.005
whe-0.005
Token that
Token ID326
Feature activation-4.22e-02

Pos logit contributions
atted+0.164
lee+0.162
els+0.154
 opinions+0.154
ished+0.144
Neg logit contributions
fire-0.124
 distance-0.112
ding-0.097
 hill-0.095
tr-0.095
Token they
Token ID484
Feature activation-3.25e-02

Pos logit contributions
fire+0.078
 distance+0.071
ding+0.066
storm+0.063
tr+0.062
Neg logit contributions
 opinions-0.084
els-0.084
atted-0.082
lee-0.080
ishment-0.077
Token could
Token ID714
Feature activation-4.00e-02

Pos logit contributions
fire+0.059
 distance+0.051
 road+0.047
 hill+0.047
 saf+0.046
Neg logit contributions
lee-0.068
atted-0.068
ished-0.066
els-0.066
ubby-0.063
Token fill
Token ID6070
Feature activation-1.35e-03

Pos logit contributions
fire+0.082
 distance+0.078
ding+0.070
storm+0.068
tr+0.067
Neg logit contributions
 opinions-0.065
els-0.065
lee-0.065
inding-0.064
atted-0.063
Token the
Token ID262
Feature activation-6.61e-02

Pos logit contributions
fire+0.003
 distance+0.003
 hill+0.002
ding+0.002
storm+0.002
Neg logit contributions
els-0.003
atted-0.003
lee-0.002
ished-0.002
 opinions-0.002
Token roof
Token ID9753
Feature activation-1.32e-01

Pos logit contributions
fire+0.122
 distance+0.103
whe+0.099
ding+0.098
storm+0.098
Neg logit contributions
els-0.134
atted-0.132
 opinions-0.131
lee-0.130
ished-0.128
Token with
Token ID351
Feature activation-4.73e-02

Pos logit contributions
fire+0.269
 distance+0.241
storm+0.236
Ro+0.235
 villains+0.232
Neg logit contributions
els-0.232
atted-0.231
ished-0.225
lee-0.215
 opinions-0.209
Token water
Token ID1660
Feature activation-7.36e-02

Pos logit contributions
fire+0.095
 distance+0.084
storm+0.082
ding+0.079
whe+0.078
Neg logit contributions
els-0.090
atted-0.090
 opinions-0.089
lee-0.087
ished-0.084
Token and
Token ID290
Feature activation-5.15e-02

Pos logit contributions
fire+0.155
 distance+0.144
whe+0.129
storm+0.128
 road+0.127
Neg logit contributions
atted-0.129
els-0.128
lee-0.122
ished-0.122
 opinions-0.117
Token make
Token ID787
Feature activation-5.32e-04

Pos logit contributions
fire+0.095
 distance+0.085
storm+0.082
ding+0.081
whe+0.076
Neg logit contributions
 opinions-0.101
atted-0.098
lee-0.096
els-0.096
ished-0.091
Token it
Token ID340
Feature activation-6.09e-02

Pos logit contributions
fire+0.001
 distance+0.001
storm+0.001
 hill+0.001
ding+0.001
Neg logit contributions
atted-0.001
els-0.001
lee-0.001
ished-0.001
 opinions-0.001
Token off
Token ID572
Feature activation-3.00e-02

Pos logit contributions
atted+0.041
lee+0.040
els+0.040
ished+0.038
 opinions+0.037
Neg logit contributions
fire-0.043
 distance-0.039
 hill-0.036
 road-0.035
storm-0.035
Token to
Token ID284
Feature activation-4.90e-02

Pos logit contributions
fire+0.062
 distance+0.053
tr+0.051
Ro+0.050
ding+0.050
Neg logit contributions
atted-0.058
lee-0.055
ened-0.050
els-0.050
ished-0.050
Token sit
Token ID1650
Feature activation-3.36e-03

Pos logit contributions
fire+0.091
 distance+0.082
ding+0.079
whe+0.073
storm+0.072
Neg logit contributions
lee-0.101
atted-0.099
els-0.099
 opinions-0.097
ubby-0.092
Token on
Token ID319
Feature activation-1.55e-02

Pos logit contributions
fire+0.007
 distance+0.006
storm+0.005
whe+0.005
 road+0.005
Neg logit contributions
els-0.007
atted-0.007
lee-0.006
ished-0.006
ubby-0.006
Token the
Token ID262
Feature activation-5.79e-02

Pos logit contributions
fire+0.034
 distance+0.030
whe+0.029
storm+0.029
ding+0.029
Neg logit contributions
els-0.026
 opinions-0.025
lee-0.024
atted-0.023
ened-0.023
Token roof
Token ID9753
Feature activation-1.31e-01

Pos logit contributions
fire+0.105
ding+0.089
|+0.087
 distance+0.086
whe+0.085
Neg logit contributions
lee-0.110
els-0.106
elling-0.105
ubby-0.102
atted-0.102
Token and
Token ID290
Feature activation-5.42e-02

Pos logit contributions
fire+0.262
Ro+0.247
Ar+0.234
 villains+0.232
 Zig+0.225
Neg logit contributions
atted-0.224
lee-0.213
els-0.211
ished-0.210
ishment-0.199
Token sun
Token ID4252
Feature activation-2.67e-02

Pos logit contributions
fire+0.101
 distance+0.089
ding+0.087
storm+0.085
whe+0.080
Neg logit contributions
atted-0.106
 opinions-0.103
lee-0.103
els-0.100
ished-0.095
Tokenbat
Token ID8664
Feature activation+1.74e-02

Pos logit contributions
fire+0.050
 distance+0.050
Ro+0.044
|+0.044
 villains+0.043
Neg logit contributions
atted-0.050
els-0.048
lee-0.047
ished-0.047
ened-0.045
Tokenhe
Token ID258
Feature activation+4.59e-03

Pos logit contributions
atted+0.039
ished+0.039
els+0.038
elling+0.038
ubby+0.038
Neg logit contributions
fire-0.026
 distance-0.024
 saf-0.022
 hill-0.022
 hunt-0.022
Token again
Token ID757
Feature activation-1.61e-02

Pos logit contributions
els+0.009
atted+0.009
lee+0.009
ened+0.008
 opinions+0.008
Neg logit contributions
 distance-0.008
fire-0.008
ding-0.007
tr-0.007
|-0.007
Token wanted
Token ID2227
Feature activation+3.93e-02

Pos logit contributions
els+0.076
atted+0.073
lee+0.071
ished+0.070
ubby+0.069
Neg logit contributions
fire-0.066
 distance-0.061
 hill-0.053
 he-0.053
 road-0.052
Token to
Token ID284
Feature activation-4.73e-02

Pos logit contributions
els+0.091
atted+0.090
lee+0.088
ished+0.086
 opinions+0.086
Neg logit contributions
fire-0.067
 distance-0.059
 hill-0.051
storm-0.051
ding-0.050
Token keep
Token ID1394
Feature activation-1.02e-02

Pos logit contributions
fire+0.098
 distance+0.088
ding+0.083
storm+0.080
 hill+0.078
Neg logit contributions
els-0.087
 opinions-0.087
lee-0.084
atted-0.083
ished-0.079
Token the
Token ID262
Feature activation-6.65e-02

Pos logit contributions
fire+0.016
 distance+0.015
 he+0.013
 hill+0.012
 saf+0.012
Neg logit contributions
atted-0.024
els-0.023
ished-0.023
lee-0.023
ubby-0.023
Token ash
Token ID12530
Feature activation-8.77e-02

Pos logit contributions
fire+0.122
 distance+0.112
 hill+0.097
 road+0.095
ding+0.093
Neg logit contributions
atted-0.141
els-0.141
lee-0.137
ished-0.134
 opinions-0.131
Tokentr
Token ID2213
Feature activation-1.30e-01

Pos logit contributions
fire+0.123
 distance+0.116
field+0.103
Ro+0.102
rob+0.100
Neg logit contributions
els-0.209
ished-0.199
 opinions-0.193
atted-0.190
ener-0.187
Tokenay
Token ID323
Feature activation+6.71e-03

Pos logit contributions
fire+0.346
 distance+0.326
 hill+0.289
field+0.287
 road+0.284
Neg logit contributions
els-0.184
atted-0.178
ished-0.168
lee-0.163
elling-0.158
Token.
Token ID13
Feature activation-2.98e-02

Pos logit contributions
els+0.014
atted+0.013
ished+0.013
 opinions+0.013
lee+0.012
Neg logit contributions
fire-0.013
 distance-0.012
field-0.011
 hill-0.011
 road-0.010
Token So
Token ID1406
Feature activation+5.27e-03

Pos logit contributions
fire+0.050
 distance+0.049
 he+0.044
 hill+0.044
 road+0.044
Neg logit contributions
els-0.065
atted-0.062
ubby-0.060
ished-0.059
lee-0.058
Token he
Token ID339
Feature activation+1.23e-02

Pos logit contributions
els+0.011
atted+0.011
lee+0.011
ished+0.010
 opinions+0.010
Neg logit contributions
fire-0.010
 distance-0.009
 hill-0.008
 he-0.008
 road-0.008
Token tried
Token ID3088
Feature activation+3.16e-02

Pos logit contributions
els+0.027
atted+0.026
ubby+0.025
lee+0.025
 opinions+0.025
Neg logit contributions
fire-0.022
 distance-0.019
 he-0.017
 hill-0.017
 ride-0.016
Token take
Token ID1011
Feature activation+2.80e-02

Pos logit contributions
fire+0.046
 distance+0.041
ding+0.038
field+0.036
storm+0.036
Neg logit contributions
els-0.052
 opinions-0.050
lee-0.049
atted-0.049
ished-0.048
Token a
Token ID257
Feature activation-7.10e-02

Pos logit contributions
els+0.060
atted+0.059
lee+0.057
ished+0.057
 opinions+0.056
Neg logit contributions
fire-0.052
 distance-0.047
 hill-0.041
storm-0.040
ding-0.040
Token march
Token ID9960
Feature activation-1.92e-02

Pos logit contributions
fire+0.139
field+0.112
storm+0.106
asis+0.104
stack+0.101
Neg logit contributions
atted-0.135
ubby-0.132
els-0.130
ished-0.130
ealing-0.128
Token around
Token ID1088
Feature activation-1.65e-02

Pos logit contributions
fire+0.039
 distance+0.034
field+0.034
asis+0.032
storm+0.031
Neg logit contributions
atted-0.034
els-0.033
ished-0.033
lee-0.032
 opinions-0.032
Token the
Token ID262
Feature activation-6.01e-02

Pos logit contributions
fire+0.033
 distance+0.030
ding+0.028
field+0.028
storm+0.027
Neg logit contributions
els-0.032
lee-0.030
atted-0.030
 opinions-0.029
ished-0.028
Token books
Token ID3835
Feature activation-1.30e-01

Pos logit contributions
fire+0.129
 distance+0.110
storm+0.107
field+0.105
boat+0.104
Neg logit contributions
els-0.106
lee-0.103
ished-0.102
 opinions-0.101
ubby-0.097
Tokenhe
Token ID258
Feature activation-3.95e-02

Pos logit contributions
fire+0.206
 distance+0.172
Snow+0.164
Ro+0.164
Ar+0.163
Neg logit contributions
els-0.271
ished-0.265
inding-0.257
atted-0.253
ishment-0.247
Tokenlf
Token ID1652
Feature activation-6.66e-02

Pos logit contributions
fire+0.066
 distance+0.061
ding+0.053
whe+0.050
field+0.049
Neg logit contributions
els-0.090
lee-0.085
atted-0.085
 opinions-0.082
ished-0.081
Token like
Token ID588
Feature activation-8.48e-03

Pos logit contributions
fire+0.123
 distance+0.115
Ro+0.105
Snow+0.104
ding+0.102
Neg logit contributions
els-0.126
atted-0.124
ished-0.123
lee-0.121
 opinions-0.116
Token he
Token ID339
Feature activation-2.26e-02

Pos logit contributions
fire+0.016
 distance+0.014
field+0.013
|+0.012
asis+0.012
Neg logit contributions
els-0.018
atted-0.018
lee-0.018
 opinions-0.017
ished-0.016
Token had
Token ID550
Feature activation+1.00e-02

Pos logit contributions
fire+0.044
 distance+0.040
 hill+0.034
ding+0.034
storm+0.033
Neg logit contributions
els-0.046
atted-0.045
ished-0.044
lee-0.044
 opinions-0.043
Token would
Token ID561
Feature activation-5.66e-03

Pos logit contributions
atted+0.034
els+0.034
ished+0.033
lee+0.033
 opinions+0.032
Neg logit contributions
fire-0.025
 distance-0.023
 hill-0.020
ding-0.019
 road-0.019
Token need
Token ID761
Feature activation-7.87e-03

Pos logit contributions
fire+0.011
 distance+0.010
ding+0.010
rob+0.010
whe+0.009
Neg logit contributions
 opinions-0.010
els-0.009
lee-0.009
inding-0.009
 requests-0.009
Token to
Token ID284
Feature activation-4.31e-02

Pos logit contributions
fire+0.015
 distance+0.014
storm+0.013
ding+0.013
 hill+0.013
Neg logit contributions
atted-0.015
els-0.015
 opinions-0.015
lee-0.014
ished-0.014
Token mend
Token ID47618
Feature activation+3.05e-03

Pos logit contributions
fire+0.083
 distance+0.078
ding+0.074
storm+0.070
whe+0.068
Neg logit contributions
els-0.082
 opinions-0.080
lee-0.077
atted-0.076
inding-0.074
Token the
Token ID262
Feature activation-6.09e-02

Pos logit contributions
els+0.006
atted+0.006
lee+0.006
ished+0.006
 opinions+0.006
Neg logit contributions
fire-0.006
 distance-0.005
storm-0.005
ding-0.005
 hill-0.005
Token roof
Token ID9753
Feature activation-1.30e-01

Pos logit contributions
fire+0.102
 distance+0.086
ding+0.080
whe+0.079
storm+0.077
Neg logit contributions
els-0.138
atted-0.133
lee-0.133
 opinions-0.133
ished-0.127
Token,
Token ID11
Feature activation-9.06e-03

Pos logit contributions
fire+0.263
 distance+0.236
Ro+0.229
storm+0.224
 Zig+0.220
Neg logit contributions
els-0.236
atted-0.228
ished-0.222
lee-0.215
 opinions-0.214
Token quickly
Token ID2952
Feature activation+1.63e-02

Pos logit contributions
fire+0.018
 distance+0.017
 hill+0.015
ding+0.015
storm+0.015
Neg logit contributions
atted-0.017
els-0.017
lee-0.017
ished-0.016
 opinions-0.016
Token.
Token ID13
Feature activation-3.52e-02

Pos logit contributions
atted+0.032
els+0.032
lee+0.031
ished+0.031
 opinions+0.030
Neg logit contributions
fire-0.031
 distance-0.029
 hill-0.025
ding-0.025
whe-0.025
Token
Token ID198
Feature activation-5.30e-02

Pos logit contributions
fire+0.057
 distance+0.052
storm+0.044
 hill+0.043
ding+0.042
Neg logit contributions
atted-0.082
lee-0.080
els-0.078
ished-0.078
 opinions-0.076
Token
Token ID198
Feature activation-4.21e-02

Pos logit contributions
fire+0.089
 distance+0.083
storm+0.078
 hill+0.070
field+0.069
Neg logit contributions
lee-0.114
atted-0.114
els-0.108
ished-0.105
 opinions-0.104
Token your
Token ID534
Feature activation-2.32e-03

Pos logit contributions
fire+0.021
 distance+0.020
 hill+0.017
ding+0.017
storm+0.017
Neg logit contributions
els-0.022
atted-0.022
lee-0.021
ished-0.021
 opinions-0.021
Token fingers
Token ID9353
Feature activation-4.92e-02

Pos logit contributions
fire+0.005
 distance+0.004
 hill+0.004
ding+0.004
 road+0.004
Neg logit contributions
els-0.004
atted-0.004
lee-0.004
ished-0.004
 opinions-0.004
Token in
Token ID287
Feature activation-5.04e-02

Pos logit contributions
fire+0.097
 distance+0.087
storm+0.078
 saf+0.077
 Zig+0.077
Neg logit contributions
els-0.097
atted-0.094
ished-0.092
lee-0.090
 opinions-0.088
Token the
Token ID262
Feature activation-6.40e-02

Pos logit contributions
fire+0.099
 distance+0.089
ding+0.083
storm+0.082
whe+0.080
Neg logit contributions
els-0.101
 opinions-0.095
atted-0.094
lee-0.093
ubby-0.090
Token ash
Token ID12530
Feature activation-8.87e-02

Pos logit contributions
fire+0.138
 distance+0.123
ding+0.116
field+0.112
whe+0.112
Neg logit contributions
els-0.116
atted-0.112
 opinions-0.110
lee-0.109
ished-0.107
Tokentr
Token ID2213
Feature activation-1.30e-01

Pos logit contributions
fire+0.104
 distance+0.098
|+0.077
Ar+0.075
whe+0.069
Neg logit contributions
els-0.243
ished-0.222
 opinions-0.222
atted-0.221
elling-0.218
Tokenay
Token ID323
Feature activation-3.87e-03

Pos logit contributions
fire+0.343
 distance+0.322
 hill+0.286
 road+0.282
field+0.280
Neg logit contributions
els-0.186
atted-0.183
ished-0.171
lee-0.168
elling-0.160
Token,
Token ID11
Feature activation-1.09e-02

Pos logit contributions
fire+0.008
 distance+0.007
 hill+0.006
 road+0.006
field+0.006
Neg logit contributions
els-0.008
atted-0.008
lee-0.007
ished-0.007
 opinions-0.007
Token so
Token ID523
Feature activation-6.25e-03

Pos logit contributions
fire+0.021
 distance+0.019
 hill+0.017
ding+0.017
storm+0.017
Neg logit contributions
els-0.022
atted-0.022
lee-0.021
ished-0.021
 opinions-0.020
Token Jon
Token ID5966
Feature activation+8.12e-02

Pos logit contributions
fire+0.010
 distance+0.009
storm+0.007
 hill+0.007
ding+0.007
Neg logit contributions
atted-0.015
els-0.015
lee-0.014
 opinions-0.014
ished-0.014
Token had
Token ID550
Feature activation+2.05e-02

Pos logit contributions
atted+0.161
els+0.161
lee+0.156
ished+0.154
 opinions+0.150
Neg logit contributions
fire-0.158
 distance-0.146
 hill-0.128
ding-0.125
 road-0.124
Token not
Token ID407
Feature activation-1.36e-02

Pos logit contributions
atted+0.027
els+0.027
 opinions+0.026
lee+0.026
ished+0.026
Neg logit contributions
fire-0.029
 distance-0.027
 hill-0.024
storm-0.023
ding-0.023
Token played
Token ID2826
Feature activation+8.21e-03

Pos logit contributions
fire+0.028
 distance+0.026
 hill+0.023
field+0.023
ding+0.022
Neg logit contributions
atted-0.026
els-0.026
lee-0.025
 opinions-0.025
ished-0.025
Token with
Token ID351
Feature activation-3.68e-02

Pos logit contributions
els+0.017
 opinions+0.016
ished+0.016
atted+0.015
inding+0.015
Neg logit contributions
 distance-0.015
fire-0.015
field-0.014
ding-0.013
 hill-0.013
Token the
Token ID262
Feature activation-6.24e-02

Pos logit contributions
fire+0.078
 distance+0.073
storm+0.064
ding+0.063
 hill+0.063
Neg logit contributions
els-0.072
atted-0.068
lee-0.066
 opinions-0.065
ished-0.065
Token ash
Token ID12530
Feature activation-8.95e-02

Pos logit contributions
fire+0.131
 distance+0.118
field+0.105
ding+0.104
 hill+0.103
Neg logit contributions
els-0.119
atted-0.116
lee-0.112
 opinions-0.111
ished-0.110
Tokentr
Token ID2213
Feature activation-1.29e-01

Pos logit contributions
fire+0.119
 distance+0.117
field+0.097
|+0.096
Ar+0.094
Neg logit contributions
els-0.225
ished-0.206
 opinions-0.206
elling-0.200
atted-0.198
Tokenay
Token ID323
Feature activation+3.50e-03

Pos logit contributions
fire+0.343
 distance+0.320
 hill+0.283
field+0.282
 road+0.279
Neg logit contributions
els-0.187
atted-0.181
ished-0.170
lee-0.167
elling-0.160
Token.
Token ID13
Feature activation-2.81e-02

Pos logit contributions
els+0.008
atted+0.008
ished+0.007
 opinions+0.007
elling+0.007
Neg logit contributions
fire-0.006
 distance-0.006
field-0.005
 hill-0.005
 trail-0.005
Token They
Token ID1119
Feature activation-2.85e-02

Pos logit contributions
fire+0.047
 distance+0.043
 hill+0.036
ding+0.035
 road+0.035
Neg logit contributions
els-0.064
atted-0.064
lee-0.062
ished-0.061
 opinions-0.060
Token wished
Token ID16555
Feature activation+5.31e-02

Pos logit contributions
fire+0.072
 distance+0.061
ding+0.060
 hill+0.059
storm+0.058
Neg logit contributions
 opinions-0.041
atted-0.041
lee-0.041
els-0.041
ished-0.038
Token they
Token ID484
Feature activation-2.85e-02

Pos logit contributions
atted+0.131
els+0.130
lee+0.130
 opinions+0.125
ished+0.119
Neg logit contributions
fire-0.082
 distance-0.075
 hill-0.061
ding-0.059
asis-0.059
Token spend
Token ID4341
Feature activation+3.37e-03

Pos logit contributions
fire+0.075
 distance+0.067
ding+0.058
storm+0.057
 hill+0.056
Neg logit contributions
els-0.089
atted-0.085
lee-0.085
 opinions-0.084
ished-0.082
Token the
Token ID262
Feature activation-5.48e-02

Pos logit contributions
atted+0.007
els+0.007
lee+0.007
ished+0.007
 opinions+0.007
Neg logit contributions
fire-0.006
 distance-0.006
 hill-0.005
 road-0.005
ding-0.005
Token day
Token ID1110
Feature activation-4.44e-02

Pos logit contributions
fire+0.094
 distance+0.080
field+0.074
ding+0.072
storm+0.069
Neg logit contributions
 opinions-0.120
els-0.119
atted-0.116
lee-0.114
ished-0.113
Token exploring
Token ID13504
Feature activation-2.03e-02

Pos logit contributions
fire+0.084
 distance+0.078
storm+0.067
 hill+0.067
field+0.066
Neg logit contributions
els-0.089
atted-0.087
lee-0.085
ished-0.085
 opinions-0.083
Token this
Token ID428
Feature activation-6.13e-02

Pos logit contributions
fire+0.040
 distance+0.037
storm+0.032
ding+0.032
 hill+0.032
Neg logit contributions
els-0.040
atted-0.039
lee-0.038
ished-0.037
 opinions-0.037
Token roof
Token ID9753
Feature activation-1.29e-01

Pos logit contributions
fire+0.109
 distance+0.097
ding+0.085
whe+0.084
storm+0.083
Neg logit contributions
els-0.132
atted-0.127
lee-0.126
 opinions-0.125
ished-0.123
Token and
Token ID290
Feature activation-5.03e-02

Pos logit contributions
fire+0.261
 distance+0.240
storm+0.214
 Zig+0.210
 saf+0.209
Neg logit contributions
els-0.240
atted-0.231
ished-0.227
 opinions-0.224
lee-0.222
Token having
Token ID1719
Feature activation+1.90e-02

Pos logit contributions
fire+0.091
 distance+0.083
ding+0.075
storm+0.072
boat+0.070
Neg logit contributions
 opinions-0.101
els-0.100
lee-0.100
atted-0.099
ished-0.094
Token fun
Token ID1257
Feature activation+3.74e-03

Pos logit contributions
lee+0.040
els+0.040
atted+0.039
 opinions+0.039
ished+0.038
Neg logit contributions
fire-0.035
 distance-0.031
storm-0.027
ding-0.027
field-0.027
Token!"
Token ID2474
Feature activation-2.43e-02

Pos logit contributions
atted+0.008
els+0.008
lee+0.008
ished+0.008
 opinions+0.008
Neg logit contributions
fire-0.007
 distance-0.006
 hill-0.005
 road-0.005
ding-0.005
Token said
Token ID531
Feature activation+5.05e-02

Pos logit contributions
fire+0.055
 distance+0.051
 hill+0.046
 road+0.044
ding+0.044
Neg logit contributions
atted-0.042
els-0.040
lee-0.039
 opinions-0.039
ished-0.037
Token all
Token ID477
Feature activation-5.18e-02

Pos logit contributions
fire+0.001
 distance+0.001
storm+0.001
 hill+0.001
ding+0.001
Neg logit contributions
els-0.002
atted-0.002
lee-0.002
ished-0.002
 opinions-0.002
Token the
Token ID262
Feature activation-4.90e-02

Pos logit contributions
fire+0.080
 distance+0.076
 hill+0.065
 he+0.064
 road+0.061
Neg logit contributions
atted-0.121
els-0.120
lee-0.118
ished-0.116
ealing-0.115
Token books
Token ID3835
Feature activation-1.13e-01

Pos logit contributions
fire+0.087
 distance+0.084
 hill+0.077
 road+0.076
 saf+0.070
Neg logit contributions
atted-0.107
lee-0.103
els-0.099
ealing-0.099
ubby-0.097
Token on
Token ID319
Feature activation-3.85e-02

Pos logit contributions
fire+0.200
 distance+0.188
 saf+0.163
 Zig+0.162
storm+0.162
Neg logit contributions
els-0.234
ished-0.228
atted-0.225
lee-0.216
inding-0.215
Token the
Token ID262
Feature activation-6.13e-02

Pos logit contributions
fire+0.074
 distance+0.067
storm+0.060
ding+0.059
whe+0.058
Neg logit contributions
els-0.080
lee-0.074
atted-0.074
 opinions-0.073
ished-0.071
Token books
Token ID3835
Feature activation-1.29e-01

Pos logit contributions
fire+0.119
 distance+0.104
storm+0.095
field+0.094
ding+0.091
Neg logit contributions
els-0.123
 opinions-0.116
lee-0.115
ished-0.114
atted-0.113
Tokenhe
Token ID258
Feature activation-3.53e-02

Pos logit contributions
fire+0.174
 distance+0.156
 Zig+0.138
 saf+0.135
Ro+0.134
Neg logit contributions
els-0.312
ished-0.305
inding-0.298
atted-0.292
lee-0.289
Tokenlf
Token ID1652
Feature activation-7.04e-02

Pos logit contributions
fire+0.059
 distance+0.054
ding+0.045
whe+0.044
 hill+0.044
Neg logit contributions
els-0.081
atted-0.077
lee-0.077
 opinions-0.075
ished-0.074
Token.
Token ID13
Feature activation-2.40e-02

Pos logit contributions
fire+0.121
 distance+0.117
Ro+0.098
storm+0.098
 Zig+0.096
Neg logit contributions
els-0.151
ished-0.145
atted-0.144
lee-0.142
 opinions-0.139
Token
Token ID198
Feature activation-4.88e-02

Pos logit contributions
fire+0.049
 distance+0.045
 hill+0.039
ding+0.038
storm+0.038
Neg logit contributions
els-0.046
atted-0.046
lee-0.045
ished-0.044
 opinions-0.043
Token
Token ID198
Feature activation-4.18e-02

Pos logit contributions
fire+0.074
 distance+0.066
storm+0.056
 hill+0.054
field+0.052
Neg logit contributions
els-0.115
atted-0.114
lee-0.113
inding-0.108
ubby-0.108
Token not
Token ID407
Feature activation-6.24e-03

Pos logit contributions
atted+0.169
els+0.168
lee+0.164
ished+0.163
ishment+0.160
Neg logit contributions
fire-0.111
 distance-0.093
 hill-0.082
 he-0.080
ding-0.079
Token to
Token ID284
Feature activation-4.65e-02

Pos logit contributions
fire+0.010
 distance+0.009
 hill+0.007
ding+0.007
storm+0.007
Neg logit contributions
els-0.015
atted-0.015
lee-0.014
ished-0.014
 opinions-0.014
Token hang
Token ID8181
Feature activation-1.86e-02

Pos logit contributions
fire+0.075
 distance+0.069
 hill+0.059
 road+0.057
ding+0.056
Neg logit contributions
atted-0.108
els-0.107
lee-0.104
ished-0.103
 opinions-0.101
Token onto
Token ID4291
Feature activation-3.72e-02

Pos logit contributions
fire+0.036
 distance+0.032
 hill+0.028
ding+0.028
 road+0.028
Neg logit contributions
atted-0.037
els-0.037
lee-0.036
ished-0.036
 opinions-0.035
Token the
Token ID262
Feature activation-6.93e-02

Pos logit contributions
fire+0.077
 distance+0.071
ding+0.062
storm+0.061
 hill+0.061
Neg logit contributions
els-0.072
atted-0.069
lee-0.067
 opinions-0.066
ished-0.066
Token branches
Token ID13737
Feature activation-1.29e-01

Pos logit contributions
fire+0.114
 distance+0.096
ding+0.080
boat+0.079
field+0.079
Neg logit contributions
els-0.166
lee-0.159
atted-0.158
ished-0.155
 opinions-0.153
Token too
Token ID1165
Feature activation-6.58e-02

Pos logit contributions
fire+0.258
 distance+0.230
 saf+0.217
 Zig+0.210
storm+0.206
Neg logit contributions
els-0.246
ished-0.230
ubby-0.229
atted-0.224
elling-0.224
Token hard
Token ID1327
Feature activation+3.04e-02

Pos logit contributions
fire+0.112
 distance+0.101
whe+0.087
storm+0.087
 saf+0.083
Neg logit contributions
els-0.148
atted-0.142
lee-0.140
ubby-0.139
ished-0.138
Token.
Token ID13
Feature activation-3.29e-02

Pos logit contributions
els+0.069
atted+0.069
lee+0.067
ished+0.066
 opinions+0.065
Neg logit contributions
fire-0.051
 distance-0.046
 hill-0.039
ding-0.037
 road-0.037
Token We
Token ID775
Feature activation-1.30e-02

Pos logit contributions
fire+0.068
 distance+0.062
 hill+0.055
 road+0.054
ding+0.054
Neg logit contributions
els-0.062
atted-0.062
lee-0.059
ished-0.059
 opinions-0.057
Token don
Token ID836
Feature activation+5.24e-02

Pos logit contributions
fire+0.025
 distance+0.023
 hill+0.020
ding+0.019
 road+0.019
Neg logit contributions
els-0.026
atted-0.026
lee-0.025
ished-0.025
 opinions-0.025
Token the
Token ID262
Feature activation-4.62e-02

Pos logit contributions
els+0.009
atted+0.009
lee+0.009
ished+0.009
 opinions+0.009
Neg logit contributions
fire-0.007
 distance-0.007
 hill-0.006
ding-0.006
 road-0.006
Token books
Token ID3835
Feature activation-1.20e-01

Pos logit contributions
fire+0.084
 distance+0.071
storm+0.064
field+0.064
ding+0.062
Neg logit contributions
els-0.101
atted-0.096
 opinions-0.095
ished-0.095
lee-0.094
Token back
Token ID736
Feature activation-2.30e-02

Pos logit contributions
fire+0.227
Ro+0.206
 distance+0.198
 villains+0.196
 Zig+0.193
Neg logit contributions
els-0.219
ished-0.217
atted-0.210
lee-0.198
ened-0.195
Token on
Token ID319
Feature activation-2.12e-02

Pos logit contributions
fire+0.043
 distance+0.040
storm+0.034
 hill+0.033
field+0.033
Neg logit contributions
els-0.049
atted-0.048
ished-0.046
lee-0.046
 opinions-0.044
Token the
Token ID262
Feature activation-5.38e-02

Pos logit contributions
fire+0.044
 distance+0.038
storm+0.036
whe+0.035
ding+0.035
Neg logit contributions
els-0.040
lee-0.037
atted-0.036
 opinions-0.036
ishment-0.035
Token books
Token ID3835
Feature activation-1.28e-01

Pos logit contributions
fire+0.103
 distance+0.085
boat+0.080
field+0.080
storm+0.079
Neg logit contributions
els-0.106
lee-0.103
 opinions-0.102
ished-0.099
atted-0.098
Tokenhe
Token ID258
Feature activation-4.16e-02

Pos logit contributions
fire+0.210
Ro+0.184
 distance+0.179
 Zig+0.172
 villains+0.167
Neg logit contributions
els-0.259
ished-0.252
lee-0.246
inding-0.244
atted-0.243
Tokenlf
Token ID1652
Feature activation-7.16e-02

Pos logit contributions
fire+0.071
 distance+0.065
ding+0.056
whe+0.054
 Zig+0.053
Neg logit contributions
els-0.093
lee-0.089
atted-0.089
 opinions-0.086
ished-0.084
Token,
Token ID11
Feature activation-1.03e-03

Pos logit contributions
fire+0.137
 distance+0.131
Ro+0.123
|+0.115
Snow+0.114
Neg logit contributions
els-0.134
lee-0.131
atted-0.131
ished-0.128
ened-0.121
Token smiling
Token ID16755
Feature activation+7.21e-02

Pos logit contributions
fire+0.001
 distance+0.001
|+0.001
storm+0.001
ding+0.001
Neg logit contributions
atted-0.003
els-0.003
lee-0.003
 opinions-0.003
ished-0.003
Token proudly
Token ID23017
Feature activation+3.51e-02

Pos logit contributions
atted+0.132
lee+0.130
els+0.129
ished+0.126
 opinions+0.122
Neg logit contributions
fire-0.153
 distance-0.142
 hill-0.125
field-0.122
 road-0.121
Token shining
Token ID22751
Feature activation+1.71e-02

Pos logit contributions
els+0.025
atted+0.025
lee+0.024
ished+0.023
 opinions+0.023
Neg logit contributions
fire-0.018
 distance-0.016
 hill-0.014
 road-0.013
ding-0.013
Token brightly
Token ID35254
Feature activation+1.03e-02

Pos logit contributions
els+0.029
atted+0.029
lee+0.028
ished+0.027
 opinions+0.026
Neg logit contributions
fire-0.038
 distance-0.035
field-0.031
 hill-0.031
|-0.031
Token up
Token ID510
Feature activation-1.38e-02

Pos logit contributions
els+0.020
atted+0.020
lee+0.019
ished+0.019
 opinions+0.019
Neg logit contributions
fire-0.021
 distance-0.019
 hill-0.017
ding-0.017
 road-0.016
Token on
Token ID319
Feature activation-3.68e-02

Pos logit contributions
fire+0.029
 distance+0.026
storm+0.023
 hill+0.023
whe+0.023
Neg logit contributions
els-0.025
atted-0.025
lee-0.024
ished-0.024
ishment-0.023
Token the
Token ID262
Feature activation-5.34e-02

Pos logit contributions
fire+0.075
 distance+0.067
 hill+0.059
ding+0.059
storm+0.059
Neg logit contributions
els-0.072
atted-0.069
lee-0.068
 opinions-0.066
ished-0.065
Token roof
Token ID9753
Feature activation-1.28e-01

Pos logit contributions
fire+0.107
 distance+0.092
field+0.090
 Zig+0.089
storm+0.088
Neg logit contributions
els-0.097
lee-0.094
 opinions-0.090
elling-0.089
atted-0.089
Token of
Token ID286
Feature activation-5.11e-02

Pos logit contributions
fire+0.275
Ro+0.248
 distance+0.238
 villains+0.237
|+0.235
Neg logit contributions
els-0.219
atted-0.213
lee-0.207
ished-0.198
elling-0.194
Token the
Token ID262
Feature activation-5.28e-02

Pos logit contributions
fire+0.114
 distance+0.099
storm+0.097
ding+0.095
asis+0.093
Neg logit contributions
els-0.090
 opinions-0.082
lee-0.081
ishment-0.079
atted-0.077
Token building
Token ID2615
Feature activation-8.28e-02

Pos logit contributions
fire+0.103
 Zig+0.087
field+0.085
boat+0.085
 distance+0.084
Neg logit contributions
els-0.099
lee-0.095
 opinions-0.092
elling-0.090
atted-0.089
Token.
Token ID13
Feature activation-3.10e-02

Pos logit contributions
fire+0.169
 distance+0.152
Ro+0.142
|+0.140
 Zig+0.138
Neg logit contributions
els-0.151
atted-0.150
lee-0.146
 opinions-0.142
ished-0.140
Token Tom
Token ID4186
Feature activation+8.55e-02

Pos logit contributions
fire+0.066
 distance+0.060
 hill+0.053
storm+0.052
ding+0.051
Neg logit contributions
atted-0.057
els-0.057
lee-0.056
ished-0.054
 opinions-0.053
Token money
Token ID1637
Feature activation-6.47e-02

Pos logit contributions
fire+0.069
 distance+0.053
ding+0.048
|+0.045
storm+0.045
Neg logit contributions
els-0.153
 opinions-0.150
atted-0.149
lee-0.146
ished-0.145
Token to
Token ID284
Feature activation-5.28e-02

Pos logit contributions
fire+0.126
 distance+0.116
Ro+0.107
 Zig+0.106
|+0.104
Neg logit contributions
els-0.119
atted-0.117
ished-0.115
lee-0.112
 opinions-0.111
Token build
Token ID1382
Feature activation-2.07e-02

Pos logit contributions
fire+0.102
 distance+0.100
ding+0.096
rob+0.094
storm+0.093
Neg logit contributions
 opinions-0.091
els-0.087
lee-0.083
 requests-0.081
atted-0.079
Token a
Token ID257
Feature activation-9.33e-02

Pos logit contributions
fire+0.042
 distance+0.039
storm+0.036
field+0.035
ding+0.035
Neg logit contributions
els-0.040
 opinions-0.037
atted-0.037
ished-0.036
lee-0.036
Token new
Token ID649
Feature activation-2.98e-02

Pos logit contributions
rob+0.187
fire+0.182
|+0.178
whe+0.173
seek+0.165
Neg logit contributions
 mistakes-0.137
 requests-0.133
ishment-0.132
lee-0.132
ealing-0.127
Token roof
Token ID9753
Feature activation-1.28e-01

Pos logit contributions
fire+0.058
ding+0.049
 distance+0.048
|+0.048
whe+0.047
Neg logit contributions
 opinions-0.058
els-0.057
atted-0.055
lee-0.053
ishment-0.053
Token.
Token ID13
Feature activation-2.47e-02

Pos logit contributions
fire+0.238
 distance+0.219
Ro+0.218
 Zig+0.214
Ar+0.209
Neg logit contributions
els-0.237
atted-0.226
ished-0.224
 opinions-0.214
lee-0.213
Token The
Token ID383
Feature activation-5.96e-03

Pos logit contributions
fire+0.046
 distance+0.043
 hill+0.037
ding+0.036
storm+0.036
Neg logit contributions
atted-0.051
els-0.051
lee-0.050
ished-0.049
 opinions-0.048
Token father
Token ID2988
Feature activation+3.19e-02

Pos logit contributions
fire+0.012
 distance+0.011
 hill+0.009
 road+0.009
ding+0.009
Neg logit contributions
els-0.012
atted-0.012
lee-0.012
ished-0.012
 opinions-0.011
Token was
Token ID373
Feature activation+4.62e-02

Pos logit contributions
els+0.068
atted+0.067
lee+0.066
ished+0.064
 opinions+0.064
Neg logit contributions
fire-0.058
 distance-0.051
 hill-0.046
 he-0.044
ding-0.044
Token a
Token ID257
Feature activation-4.62e-02

Pos logit contributions
els+0.109
atted+0.107
lee+0.105
ished+0.103
 opinions+0.101
Neg logit contributions
fire-0.075
 distance-0.068
 hill-0.056
ding-0.055
 road-0.055
Token cozy
Token ID37438
Feature activation+2.42e-03

Pos logit contributions
fire+0.139
|+0.137
rob+0.137
whe+0.135
 Zig+0.125
Neg logit contributions
lee-0.138
ubby-0.129
els-0.129
 mistakes-0.128
ishment-0.126
Token nest
Token ID16343
Feature activation-7.79e-02

Pos logit contributions
lee+0.005
atted+0.005
ubby+0.005
els+0.005
ishment+0.005
Neg logit contributions
fire-0.004
whe-0.004
ding-0.003
 distance-0.003
storm-0.003
Token on
Token ID319
Feature activation-3.28e-02

Pos logit contributions
fire+0.132
 distance+0.130
Ro+0.125
 Zig+0.120
Ar+0.119
Neg logit contributions
els-0.156
atted-0.151
ished-0.148
lee-0.144
 opinions-0.139
Token the
Token ID262
Feature activation-5.83e-02

Pos logit contributions
fire+0.069
 distance+0.064
ding+0.060
whe+0.060
storm+0.057
Neg logit contributions
els-0.060
 opinions-0.054
lee-0.053
ishment-0.051
atted-0.050
Token green
Token ID4077
Feature activation-7.35e-03

Pos logit contributions
fire+0.094
whe+0.082
|+0.082
ding+0.080
 distance+0.077
Neg logit contributions
els-0.114
lee-0.113
 opinions-0.108
elling-0.107
ubby-0.107
Token roof
Token ID9753
Feature activation-1.28e-01

Pos logit contributions
fire+0.011
whe+0.010
 distance+0.010
ding+0.010
|+0.009
Neg logit contributions
els-0.016
lee-0.016
atted-0.016
 opinions-0.015
ished-0.015
Token.
Token ID13
Feature activation-2.55e-02

Pos logit contributions
fire+0.231
Ro+0.213
 distance+0.206
 villains+0.206
 Zig+0.203
Neg logit contributions
els-0.252
atted-0.244
lee-0.236
ished-0.235
 opinions-0.227
Token From
Token ID3574
Feature activation+4.69e-02

Pos logit contributions
fire+0.053
 distance+0.049
 hill+0.043
storm+0.042
ding+0.042
Neg logit contributions
atted-0.048
lee-0.047
els-0.047
ished-0.045
 opinions-0.044
Token that
Token ID326
Feature activation+1.84e-03

Pos logit contributions
els+0.110
atted+0.109
ished+0.108
lee+0.105
 opinions+0.105
Neg logit contributions
fire-0.075
 distance-0.068
 hill-0.059
 he-0.058
field-0.057
Token day
Token ID1110
Feature activation+2.74e-02

Pos logit contributions
els+0.004
atted+0.004
lee+0.004
ubby+0.004
 opinions+0.004
Neg logit contributions
fire-0.003
 distance-0.002
ding-0.002
whe-0.002
 Zig-0.002
Token on
Token ID319
Feature activation+3.55e-02

Pos logit contributions
els+0.070
atted+0.070
lee+0.068
ished+0.068
 opinions+0.067
Neg logit contributions
fire-0.038
 distance-0.033
 hill-0.027
ding-0.026
 road-0.026
Token of
Token ID286
Feature activation-1.24e-02

Pos logit contributions
atted+0.119
lee+0.115
els+0.115
 opinions+0.107
ished+0.106
Neg logit contributions
fire-0.109
 distance-0.092
field-0.087
ding-0.083
 hill-0.082
Token being
Token ID852
Feature activation+6.11e-03

Pos logit contributions
fire+0.023
 distance+0.021
 hill+0.018
 road+0.018
ding+0.018
Neg logit contributions
atted-0.026
els-0.026
lee-0.025
ished-0.025
 opinions-0.024
Token alone
Token ID3436
Feature activation+1.88e-02

Pos logit contributions
els+0.013
atted+0.012
lee+0.012
 opinions+0.012
ishment+0.012
Neg logit contributions
fire-0.011
 distance-0.010
 he-0.009
whe-0.009
 road-0.009
Token on
Token ID319
Feature activation-2.68e-02

Pos logit contributions
els+0.043
atted+0.042
lee+0.041
ished+0.041
 opinions+0.040
Neg logit contributions
fire-0.031
 distance-0.028
ding-0.024
field-0.024
storm-0.024
Token the
Token ID262
Feature activation-6.33e-02

Pos logit contributions
fire+0.052
 distance+0.047
ding+0.041
 hill+0.040
storm+0.040
Neg logit contributions
els-0.055
atted-0.053
lee-0.052
ished-0.051
 opinions-0.051
Token roof
Token ID9753
Feature activation-1.28e-01

Pos logit contributions
fire+0.100
 distance+0.077
boat+0.073
field+0.073
To+0.071
Neg logit contributions
 opinions-0.139
els-0.138
lee-0.133
 heads-0.130
elling-0.130
Token.
Token ID13
Feature activation-2.06e-02

Pos logit contributions
fire+0.248
 distance+0.216
Ro+0.213
Ar+0.212
field+0.202
Neg logit contributions
els-0.249
atted-0.229
 opinions-0.224
ished-0.218
lee-0.215
Token She
Token ID1375
Feature activation-1.57e-02

Pos logit contributions
fire+0.037
 distance+0.034
 hill+0.030
ding+0.029
 road+0.029
Neg logit contributions
els-0.044
atted-0.044
lee-0.043
ished-0.042
 opinions-0.041
Token tried
Token ID3088
Feature activation+3.36e-02

Pos logit contributions
fire+0.030
 distance+0.028
 hill+0.024
 road+0.024
 he+0.024
Neg logit contributions
els-0.032
atted-0.031
lee-0.031
ished-0.030
 opinions-0.030
Token to
Token ID284
Feature activation-2.59e-02

Pos logit contributions
atted+0.074
els+0.073
lee+0.071
 opinions+0.070
ished+0.069
Neg logit contributions
fire-0.062
 distance-0.051
storm-0.047
ding-0.045
 hill-0.045
Token follow
Token ID1061
Feature activation+1.57e-02

Pos logit contributions
fire+0.044
 distance+0.040
 hill+0.034
ding+0.034
storm+0.033
Neg logit contributions
els-0.058
atted-0.058
lee-0.056
ished-0.055
 opinions-0.055
Token thing
Token ID1517
Feature activation-6.38e-02

Pos logit contributions
els+0.016
atted+0.016
 opinions+0.016
ished+0.015
ubby+0.015
Neg logit contributions
fire-0.018
 distance-0.017
|-0.016
whe-0.016
ding-0.016
Token on
Token ID319
Feature activation-4.27e-02

Pos logit contributions
fire+0.118
 distance+0.114
|+0.101
field+0.094
ding+0.092
Neg logit contributions
els-0.131
atted-0.123
ished-0.120
lee-0.117
 opinions-0.113
Token the
Token ID262
Feature activation-6.20e-02

Pos logit contributions
fire+0.092
 distance+0.083
ding+0.080
whe+0.079
storm+0.076
Neg logit contributions
els-0.079
 opinions-0.068
ubby-0.066
ishment-0.066
lee-0.066
Token table
Token ID3084
Feature activation-9.26e-02

Pos logit contributions
fire+0.126
|+0.116
ding+0.112
whe+0.112
 Zig+0.111
Neg logit contributions
els-0.095
elling-0.091
ished-0.088
 opinions-0.088
ubby-0.086
Token.
Token ID13
Feature activation-2.44e-02

Pos logit contributions
fire+0.159
 distance+0.148
 Zig+0.139
|+0.135
ding+0.134
Neg logit contributions
els-0.192
ished-0.182
atted-0.180
lee-0.179
 opinions-0.176
Token Tim
Token ID5045
Feature activation+9.22e-02

Pos logit contributions
fire+0.044
 distance+0.040
field+0.033
storm+0.033
 hill+0.032
Neg logit contributions
atted-0.054
lee-0.053
els-0.052
ished-0.051
 opinions-0.051
Token wasn
Token ID2492
Feature activation+7.45e-02

Pos logit contributions
els+0.216
atted+0.209
ubby+0.207
elling+0.206
 opinions+0.203
Neg logit contributions
fire-0.143
 distance-0.137
 hill-0.121
 he-0.121
 road-0.114
Token't
Token ID470
Feature activation+1.23e-02

Pos logit contributions
els+0.182
atted+0.179
lee+0.176
ished+0.174
ubby+0.173
Neg logit contributions
fire-0.111
 distance-0.098
 hill-0.079
field-0.077
 saf-0.077
Token sure
Token ID1654
Feature activation+5.92e-02

Pos logit contributions
els+0.016
atted+0.016
lee+0.016
ished+0.015
 opinions+0.015
Neg logit contributions
fire-0.032
 distance-0.030
 hill-0.028
storm-0.027
field-0.027
Token what
Token ID644
Feature activation-4.17e-02

Pos logit contributions
atted+0.097
els+0.097
lee+0.095
ishment+0.094
ished+0.093
Neg logit contributions
fire-0.128
 he-0.120
 distance-0.119
 hill-0.111
 road-0.107
Token it
Token ID340
Feature activation-1.82e-02

Pos logit contributions
 distance+0.074
fire+0.073
ding+0.065
storm+0.062
whe+0.061
Neg logit contributions
els-0.084
lee-0.081
 opinions-0.081
atted-0.080
ished-0.075
Token is
Token ID318
Feature activation+7.74e-03

Pos logit contributions
fire+0.032
 distance+0.030
 hill+0.027
ding+0.026
 road+0.026
Neg logit contributions
els-0.025
atted-0.024
lee-0.024
ished-0.023
 opinions-0.023
Token ugly
Token ID13400
Feature activation+4.21e-02

Pos logit contributions
els+0.016
atted+0.016
lee+0.015
ished+0.015
 opinions+0.015
Neg logit contributions
fire-0.015
 distance-0.014
 hill-0.012
ding-0.012
storm-0.012
Token."
Token ID526
Feature activation-2.32e-02

Pos logit contributions
els+0.096
atted+0.096
lee+0.093
ished+0.092
 opinions+0.090
Neg logit contributions
fire-0.070
 distance-0.063
 hill-0.054
 road-0.052
ding-0.052
Token
Token ID198
Feature activation-4.38e-02

Pos logit contributions
fire+0.042
 distance+0.038
field+0.031
storm+0.031
boat+0.031
Neg logit contributions
atted-0.052
lee-0.050
 opinions-0.049
els-0.049
ished-0.047
Token
Token ID198
Feature activation-3.67e-02

Pos logit contributions
fire+0.071
 distance+0.063
storm+0.055
 hill+0.052
field+0.051
Neg logit contributions
atted-0.102
els-0.100
lee-0.100
 opinions-0.096
ished-0.096
TokenTim
Token ID14967
Feature activation+9.22e-02

Pos logit contributions
fire+0.049
 distance+0.043
 hill+0.034
field+0.034
ding+0.033
Neg logit contributions
els-0.096
atted-0.096
lee-0.093
ished-0.092
 opinions-0.092
Token thought
Token ID1807
Feature activation+7.22e-02

Pos logit contributions
els+0.192
atted+0.182
ubby+0.181
lee+0.179
elling+0.176
Neg logit contributions
fire-0.170
 distance-0.158
 hill-0.140
 he-0.139
 ride-0.134
Token about
Token ID546
Feature activation+1.38e-02

Pos logit contributions
els+0.130
atted+0.129
lee+0.124
ished+0.123
 opinions+0.120
Neg logit contributions
fire-0.155
 distance-0.144
 hill-0.128
ding-0.125
 road-0.125
Token what
Token ID644
Feature activation-3.71e-02

Pos logit contributions
els+0.023
 opinions+0.021
lee+0.021
atted+0.021
inding+0.018
Neg logit contributions
fire-0.034
 distance-0.032
ding-0.028
storm-0.027
whe-0.027
Token the
Token ID262
Feature activation-2.40e-02

Pos logit contributions
fire+0.077
 distance+0.071
ding+0.064
storm+0.061
field+0.061
Neg logit contributions
els-0.071
atted-0.070
 opinions-0.070
lee-0.068
ished-0.064
Token bird
Token ID6512
Feature activation-3.10e-02

Pos logit contributions
fire+0.039
 distance+0.037
 hill+0.031
 road+0.030
storm+0.029
Neg logit contributions
atted-0.056
els-0.056
lee-0.055
ished-0.054
 opinions-0.051
Token the
Token ID262
Feature activation-5.53e-02

Pos logit contributions
fire+0.039
 distance+0.034
ding+0.032
storm+0.032
whe+0.031
Neg logit contributions
els-0.036
atted-0.034
 opinions-0.033
lee-0.033
ished-0.032
Token step
Token ID2239
Feature activation-5.71e-02

Pos logit contributions
fire+0.109
|+0.091
storm+0.089
 lifetime+0.088
ding+0.088
Neg logit contributions
ubby-0.097
els-0.095
 eyes-0.093
 heads-0.092
 picks-0.089
Token.
Token ID13
Feature activation-3.25e-02

Pos logit contributions
fire+0.111
 distance+0.099
|+0.088
field+0.088
Ar+0.086
Neg logit contributions
els-0.114
atted-0.109
ished-0.105
lee-0.105
ubby-0.099
Token
Token ID198
Feature activation-5.25e-02

Pos logit contributions
fire+0.062
 distance+0.058
boat+0.050
storm+0.050
ding+0.049
Neg logit contributions
atted-0.065
lee-0.061
 opinions-0.061
ished-0.061
els-0.059
Token
Token ID198
Feature activation-4.24e-02

Pos logit contributions
fire+0.087
 distance+0.078
storm+0.074
 hill+0.067
field+0.065
Neg logit contributions
atted-0.114
lee-0.111
els-0.111
ished-0.106
ubby-0.106
TokenBob
Token ID18861
Feature activation+9.99e-02

Pos logit contributions
fire+0.075
 distance+0.071
storm+0.062
 hill+0.062
field+0.062
Neg logit contributions
atted-0.081
ished-0.079
els-0.078
ealing-0.078
inding-0.077
Tokeno
Token ID78
Feature activation+7.80e-02

Pos logit contributions
els+0.278
atted+0.275
lee+0.270
ished+0.266
 opinions+0.265
Neg logit contributions
fire-0.117
 distance-0.102
 hill-0.079
ding-0.074
 road-0.074
Token kept
Token ID4030
Feature activation+4.33e-02

Pos logit contributions
els+0.161
atted+0.160
lee+0.155
ished+0.153
 opinions+0.150
Neg logit contributions
fire-0.147
 distance-0.134
 hill-0.117
ding-0.114
 road-0.113
Token playing
Token ID2712
Feature activation-3.69e-03

Pos logit contributions
els+0.092
atted+0.092
lee+0.089
ished+0.088
 opinions+0.086
Neg logit contributions
fire-0.080
 distance-0.072
 hill-0.063
ding-0.061
 road-0.061
Token near
Token ID1474
Feature activation-1.44e-02

Pos logit contributions
fire+0.007
 distance+0.007
ding+0.007
storm+0.007
field+0.007
Neg logit contributions
els-0.006
atted-0.006
ished-0.006
 opinions-0.006
lee-0.006
Token the
Token ID262
Feature activation-5.38e-02

Pos logit contributions
fire+0.030
 distance+0.027
storm+0.026
ding+0.025
tr+0.024
Neg logit contributions
els-0.027
atted-0.026
 opinions-0.025
lee-0.025
ished-0.025
Token can
Token ID460
Feature activation-4.08e-04

Pos logit contributions
els+0.058
atted+0.057
lee+0.056
ished+0.055
 opinions+0.054
Neg logit contributions
fire-0.051
 distance-0.046
 hill-0.040
 road-0.039
ding-0.039
Token have
Token ID423
Feature activation+1.09e-02

Pos logit contributions
fire+0.001
 distance+0.001
ding+0.001
 hill+0.001
storm+0.001
Neg logit contributions
els-0.001
atted-0.001
lee-0.001
 opinions-0.001
ished-0.001
Token the
Token ID262
Feature activation-4.53e-02

Pos logit contributions
els+0.022
atted+0.022
lee+0.022
ished+0.021
 opinions+0.021
Neg logit contributions
fire-0.021
 distance-0.019
 hill-0.017
ding-0.016
 road-0.016
Token steak
Token ID26320
Feature activation-8.65e-02

Pos logit contributions
fire+0.080
 distance+0.070
ding+0.062
whe+0.062
 Zig+0.062
Neg logit contributions
els-0.095
atted-0.094
 opinions-0.093
ished-0.092
ubby-0.090
Token,
Token ID11
Feature activation+9.68e-03

Pos logit contributions
fire+0.171
 distance+0.162
whe+0.140
ding+0.140
 hill+0.139
Neg logit contributions
els-0.159
atted-0.158
ished-0.153
lee-0.152
 opinions-0.150
Token L
Token ID406
Feature activation+1.16e-01

Pos logit contributions
els+0.020
atted+0.020
ished+0.019
lee+0.018
 opinions+0.018
Neg logit contributions
fire-0.018
 distance-0.017
 hill-0.015
 he-0.015
 road-0.015
Tokenula
Token ID4712
Feature activation+1.52e-02

Pos logit contributions
ishment+0.265
 opinions+0.249
ened+0.242
atted+0.242
els+0.235
Neg logit contributions
t-0.155
fire-0.148
ding-0.147
whe-0.141
To-0.135
Token".
Token ID1911
Feature activation-2.07e-02

Pos logit contributions
atted+0.030
els+0.030
ished+0.029
lee+0.029
ubby+0.028
Neg logit contributions
fire-0.029
 distance-0.028
whe-0.023
 hill-0.023
ding-0.023
Token She
Token ID1375
Feature activation-1.96e-03

Pos logit contributions
fire+0.039
 distance+0.035
 hill+0.031
storm+0.030
ding+0.030
Neg logit contributions
atted-0.043
els-0.043
lee-0.042
ished-0.041
 opinions-0.040
Token turned
Token ID2900
Feature activation+5.01e-02

Pos logit contributions
fire+0.004
 distance+0.003
 hill+0.003
 he+0.003
field+0.003
Neg logit contributions
els-0.004
atted-0.004
ished-0.004
lee-0.004
 opinions-0.004
Token around
Token ID1088
Feature activation+1.64e-02

Pos logit contributions
els+0.116
 opinions+0.113
atted+0.112
ishment+0.112
lee+0.109
Neg logit contributions
fire-0.078
 distance-0.075
 he-0.065
 hill-0.060
ding-0.059
Token help
Token ID1037
Feature activation+1.93e-02

Pos logit contributions
fire+0.009
 distance+0.008
ding+0.008
storm+0.008
 hill+0.007
Neg logit contributions
els-0.009
 opinions-0.008
atted-0.008
lee-0.008
ished-0.008
Token him
Token ID683
Feature activation+2.80e-02

Pos logit contributions
els+0.036
atted+0.033
lee+0.032
ished+0.032
 opinions+0.032
Neg logit contributions
fire-0.041
 distance-0.039
storm-0.035
 hill-0.034
field-0.034
Token.
Token ID13
Feature activation-1.72e-02

Pos logit contributions
els+0.064
atted+0.064
ished+0.062
lee+0.061
 opinions+0.060
Neg logit contributions
fire-0.046
 distance-0.042
storm-0.036
ding-0.036
 hill-0.035
Token
Token ID198
Feature activation-3.49e-02

Pos logit contributions
fire+0.028
 distance+0.025
storm+0.021
 hill+0.021
field+0.021
Neg logit contributions
atted-0.040
els-0.039
lee-0.038
 opinions-0.038
ished-0.037
Token
Token ID198
Feature activation-2.57e-02

Pos logit contributions
fire+0.049
 distance+0.043
storm+0.037
 hill+0.036
field+0.035
Neg logit contributions
els-0.087
atted-0.086
lee-0.084
ished-0.082
 opinions-0.081
TokenTom
Token ID13787
Feature activation+9.39e-02

Pos logit contributions
fire+0.061
 distance+0.057
 hill+0.052
storm+0.051
field+0.051
Neg logit contributions
els-0.039
atted-0.039
lee-0.036
ished-0.036
 opinions-0.036
Token ran
Token ID4966
Feature activation+5.76e-02

Pos logit contributions
els+0.212
elling+0.204
ubby+0.202
ishment+0.200
lee+0.200
Neg logit contributions
fire-0.156
 distance-0.138
 he-0.129
 hill-0.123
 ride-0.117
Token to
Token ID284
Feature activation-1.42e-02

Pos logit contributions
els+0.134
atted+0.134
lee+0.130
ished+0.128
 opinions+0.126
Neg logit contributions
fire-0.094
 distance-0.084
 hill-0.072
ding-0.069
storm-0.069
Token the
Token ID262
Feature activation-2.92e-02

Pos logit contributions
fire+0.032
storm+0.027
 distance+0.027
ding+0.026
field+0.026
Neg logit contributions
els-0.024
atted-0.023
lee-0.023
 opinions-0.023
 requests-0.022
Token tree
Token ID5509
Feature activation-6.35e-02

Pos logit contributions
fire+0.054
ding+0.050
|+0.050
whe+0.049
t+0.047
Neg logit contributions
els-0.050
lee-0.047
 opinions-0.047
atted-0.046
ished-0.045
Token and
Token ID290
Feature activation-3.18e-02

Pos logit contributions
fire+0.157
 distance+0.143
storm+0.141
ding+0.141
whe+0.137
Neg logit contributions
atted-0.084
els-0.081
ished-0.076
 opinions-0.074
lee-0.073
Token not
Token ID407
Feature activation+3.31e-04

Pos logit contributions
atted+0.088
els+0.088
 opinions+0.086
lee+0.085
ished+0.083
Neg logit contributions
fire-0.107
 distance-0.099
storm-0.089
ding-0.088
field-0.088
Token like
Token ID588
Feature activation+2.10e-03

Pos logit contributions
 opinions+0.001
lee+0.001
els+0.001
atted+0.000
ished+0.000
Neg logit contributions
fire-0.001
field-0.001
 distance-0.001
ding-0.001
 hill-0.001
Token it
Token ID340
Feature activation-3.44e-02

Pos logit contributions
els+0.004
atted+0.004
lee+0.004
 opinions+0.004
ished+0.004
Neg logit contributions
fire-0.004
 distance-0.004
field-0.003
ding-0.003
 hill-0.003
Token.
Token ID13
Feature activation-3.56e-02

Pos logit contributions
fire+0.070
 distance+0.063
field+0.056
Ar+0.053
ding+0.053
Neg logit contributions
els-0.068
ished-0.066
atted-0.065
 opinions-0.065
lee-0.062
Token He
Token ID679
Feature activation+4.84e-03

Pos logit contributions
fire+0.066
 distance+0.059
storm+0.052
ding+0.050
 hill+0.049
Neg logit contributions
atted-0.075
lee-0.075
els-0.072
ished-0.072
 opinions-0.071
Token didn
Token ID1422
Feature activation+7.10e-02

Pos logit contributions
atted+0.009
els+0.009
lee+0.009
ished+0.008
 opinions+0.008
Neg logit contributions
fire-0.010
 distance-0.010
 hill-0.008
ding-0.008
storm-0.008
Token't
Token ID470
Feature activation+1.17e-02

Pos logit contributions
els+0.158
lee+0.148
atted+0.148
inding+0.147
ubby+0.146
Neg logit contributions
fire-0.122
 distance-0.116
field-0.095
 hill-0.092
boat-0.089
Token know
Token ID760
Feature activation+3.91e-02

Pos logit contributions
els+0.017
 opinions+0.017
lee+0.017
atted+0.016
ished+0.016
Neg logit contributions
fire-0.028
 distance-0.025
field-0.025
ding-0.024
 hill-0.024
Token how
Token ID703
Feature activation-3.03e-02

Pos logit contributions
els+0.049
atted+0.048
lee+0.045
 opinions+0.042
ished+0.042
Neg logit contributions
fire-0.109
 distance-0.102
storm-0.093
ding-0.093
field-0.092
Token to
Token ID284
Feature activation-3.23e-02

Pos logit contributions
fire+0.058
 distance+0.052
field+0.047
storm+0.045
 hill+0.045
Neg logit contributions
atted-0.065
lee-0.064
els-0.060
 opinions-0.059
ished-0.059
Token make
Token ID787
Feature activation-5.32e-04

Pos logit contributions
fire+0.060
 distance+0.053
ding+0.051
field+0.048
tr+0.048
Neg logit contributions
els-0.064
 opinions-0.062
inding-0.058
ant-0.058
lee-0.057
Token ran
Token ID4966
Feature activation+4.75e-02

Pos logit contributions
els+0.146
atted+0.142
 opinions+0.138
lee+0.138
ished+0.137
Neg logit contributions
fire-0.133
 distance-0.121
 hill-0.106
ding-0.103
 road-0.102
Token to
Token ID284
Feature activation-1.67e-02

Pos logit contributions
atted+0.086
els+0.084
lee+0.083
ished+0.082
 opinions+0.079
Neg logit contributions
fire-0.099
 distance-0.088
field-0.081
storm-0.079
 hill-0.079
Token the
Token ID262
Feature activation-3.26e-02

Pos logit contributions
fire+0.035
 distance+0.031
storm+0.029
ding+0.029
 hill+0.027
Neg logit contributions
els-0.031
atted-0.030
lee-0.030
 opinions-0.029
ished-0.028
Token forest
Token ID8222
Feature activation-5.22e-02

Pos logit contributions
fire+0.065
ding+0.055
boat+0.054
|+0.054
wreck+0.051
Neg logit contributions
els-0.058
lee-0.058
 opinions-0.058
elling-0.055
atted-0.054
Token and
Token ID290
Feature activation-3.60e-02

Pos logit contributions
fire+0.120
 distance+0.110
ding+0.106
Ro+0.106
storm+0.106
Neg logit contributions
atted-0.079
lee-0.074
els-0.072
 opinions-0.069
ishment-0.068
Token Jake
Token ID14757
Feature activation+7.23e-02

Pos logit contributions
fire+0.076
 distance+0.070
 hill+0.063
 road+0.063
storm+0.062
Neg logit contributions
atted-0.065
lee-0.063
els-0.063
ished-0.061
 opinions-0.058
Token ran
Token ID4966
Feature activation+3.81e-02

Pos logit contributions
atted+0.138
lee+0.133
els+0.132
ished+0.129
 opinions+0.122
Neg logit contributions
fire-0.148
 distance-0.136
 hill-0.119
storm-0.119
 road-0.119
Token to
Token ID284
Feature activation-2.47e-02

Pos logit contributions
atted+0.068
lee+0.067
ished+0.067
els+0.067
 opinions+0.064
Neg logit contributions
fire-0.077
 distance-0.068
field-0.065
storm-0.064
ding-0.062
Token the
Token ID262
Feature activation-4.02e-02

Pos logit contributions
fire+0.052
storm+0.046
ding+0.046
 distance+0.046
tr+0.044
Neg logit contributions
 opinions-0.042
els-0.042
atted-0.042
lee-0.042
 requests-0.040
Token mountain
Token ID8598
Feature activation-7.91e-02

Pos logit contributions
fire+0.080
ding+0.071
|+0.070
boat+0.068
t+0.066
Neg logit contributions
lee-0.067
 opinions-0.067
els-0.066
elling-0.065
 heads-0.064
Token.
Token ID13
Feature activation-2.18e-02

Pos logit contributions
Ro+0.151
fire+0.151
Snow+0.142
storm+0.142
ding+0.141
Neg logit contributions
atted-0.130
els-0.123
lee-0.120
ishment-0.120
 opinions-0.118
Token pictures
Token ID5986
Feature activation-7.99e-02

Pos logit contributions
fire+0.010
 distance+0.010
 hill+0.009
 road+0.008
 saf+0.008
Neg logit contributions
atted-0.009
els-0.009
ished-0.009
lee-0.008
ubby-0.008
Token.
Token ID13
Feature activation-1.87e-02

Pos logit contributions
fire+0.154
 distance+0.136
field+0.121
ding+0.120
 hill+0.119
Neg logit contributions
els-0.163
atted-0.157
 opinions-0.154
ished-0.150
lee-0.150
Token They
Token ID1119
Feature activation-2.23e-02

Pos logit contributions
fire+0.030
 distance+0.027
 hill+0.023
 road+0.022
ding+0.022
Neg logit contributions
els-0.045
atted-0.045
lee-0.043
ished-0.043
ubby-0.041
Token did
Token ID750
Feature activation+5.06e-02

Pos logit contributions
fire+0.044
 distance+0.042
 hill+0.037
 he+0.036
 road+0.036
Neg logit contributions
els-0.043
atted-0.042
lee-0.041
ished-0.040
elling-0.040
Token not
Token ID407
Feature activation+3.96e-03

Pos logit contributions
atted+0.114
els+0.113
lee+0.111
ished+0.110
ubby+0.106
Neg logit contributions
fire-0.087
 distance-0.078
 hill-0.067
 he-0.065
 road-0.065
Token mean
Token ID1612
Feature activation+7.29e-02

Pos logit contributions
atted+0.007
els+0.007
lee+0.007
ished+0.007
 opinions+0.006
Neg logit contributions
fire-0.009
 distance-0.008
 hill-0.007
 road-0.007
ding-0.007
Token to
Token ID284
Feature activation-3.34e-02

Pos logit contributions
atted+0.191
els+0.191
lee+0.187
ished+0.185
ubby+0.183
Neg logit contributions
fire-0.092
 distance-0.084
 he-0.069
 hill-0.067
 road-0.066
Token make
Token ID787
Feature activation+2.40e-03

Pos logit contributions
fire+0.058
 distance+0.052
 hill+0.044
ding+0.044
field+0.043
Neg logit contributions
els-0.075
atted-0.074
lee-0.072
 opinions-0.071
ished-0.070
Token a
Token ID257
Feature activation-7.13e-02

Pos logit contributions
atted+0.004
els+0.004
lee+0.004
ished+0.004
ubby+0.004
Neg logit contributions
fire-0.005
 distance-0.005
 hill-0.004
 road-0.004
ding-0.004
Token mess
Token ID2085
Feature activation-2.25e-02

Pos logit contributions
fire+0.203
 distance+0.179
whe+0.174
ding+0.173
field+0.172
Neg logit contributions
els-0.077
lee-0.072
 opinions-0.070
atted-0.070
 requests-0.068
Token.
Token ID13
Feature activation-1.60e-02

Pos logit contributions
fire+0.031
 distance+0.027
 hill+0.022
ding+0.022
 road+0.021
Neg logit contributions
atted-0.058
els-0.057
lee-0.056
ished-0.055
 opinions-0.055
Token
Token ID198
Feature activation-4.00e-02

Pos logit contributions
fire+0.081
 distance+0.072
storm+0.069
 hill+0.061
ding+0.061
Neg logit contributions
lee-0.113
atted-0.112
els-0.110
 opinions-0.105
ubby-0.105
TokenThe
Token ID464
Feature activation+3.98e-03

Pos logit contributions
fire+0.063
 distance+0.057
 hill+0.049
storm+0.049
ding+0.049
Neg logit contributions
els-0.091
atted-0.091
lee-0.089
ished-0.087
 opinions-0.087
Token other
Token ID584
Feature activation+2.67e-02

Pos logit contributions
atted+0.008
els+0.008
lee+0.008
ished+0.008
 opinions+0.008
Neg logit contributions
fire-0.007
 distance-0.007
 hill-0.006
 road-0.006
storm-0.006
Token animals
Token ID4695
Feature activation-4.25e-02

Pos logit contributions
atted+0.059
els+0.059
lee+0.058
ished+0.057
ishment+0.055
Neg logit contributions
fire-0.046
 distance-0.043
 hill-0.037
 road-0.036
 saf-0.035
Token helped
Token ID4193
Feature activation+3.53e-02

Pos logit contributions
fire+0.084
 distance+0.078
 road+0.071
ding+0.071
storm+0.070
Neg logit contributions
atted-0.083
lee-0.078
els-0.078
ished-0.076
 opinions-0.072
Token look
Token ID804
Feature activation+2.96e-02

Pos logit contributions
els+0.059
atted+0.057
lee+0.056
 opinions+0.054
ished+0.053
Neg logit contributions
fire-0.080
 distance-0.075
 hill-0.068
ding-0.067
storm-0.067
Token after
Token ID706
Feature activation-1.27e-02

Pos logit contributions
els+0.055
atted+0.054
lee+0.052
ished+0.051
 opinions+0.050
Neg logit contributions
fire-0.062
 distance-0.058
 hill-0.051
ding-0.050
 road-0.050
Token the
Token ID262
Feature activation-2.89e-02

Pos logit contributions
fire+0.028
 distance+0.026
 hill+0.023
ding+0.023
storm+0.023
Neg logit contributions
els-0.023
atted-0.022
lee-0.022
 opinions-0.021
ished-0.020
Token sick
Token ID6639
Feature activation+1.39e-02

Pos logit contributions
fire+0.056
 distance+0.051
 hill+0.045
 road+0.044
ding+0.043
Neg logit contributions
els-0.059
atted-0.059
lee-0.057
ished-0.056
 opinions-0.055
Token sheep
Token ID15900
Feature activation-4.55e-02

Pos logit contributions
els+0.027
atted+0.027
lee+0.026
 opinions+0.025
ished+0.025
Neg logit contributions
fire-0.027
 distance-0.024
ding-0.022
|-0.022
whe-0.022
Token.
Token ID13
Feature activation-2.30e-02

Pos logit contributions
fire+0.092
 distance+0.087
Ro+0.081
ding+0.080
whe+0.078
Neg logit contributions
atted-0.078
els-0.077
lee-0.076
ished-0.074
 opinions-0.074
Token He
Token ID679
Feature activation-2.39e-04

Pos logit contributions
fire+0.025
 distance+0.023
 he+0.019
 hill+0.018
 road+0.018
Neg logit contributions
els-0.057
atted-0.056
lee-0.054
ished-0.054
 opinions-0.053
Token followed
Token ID3940
Feature activation+3.90e-02

Pos logit contributions
fire+0.001
 distance+0.000
 hill+0.000
ding+0.000
storm+0.000
Neg logit contributions
atted-0.000
els-0.000
lee-0.000
ished-0.000
 opinions-0.000
Token them
Token ID606
Feature activation-1.68e-02

Pos logit contributions
atted+0.069
els+0.068
ished+0.065
lee+0.065
 opinions+0.063
Neg logit contributions
fire-0.084
 distance-0.079
 hill-0.069
 he-0.069
 road-0.069
Token to
Token ID284
Feature activation-2.75e-02

Pos logit contributions
fire+0.039
 distance+0.034
storm+0.033
field+0.032
ding+0.031
Neg logit contributions
els-0.028
atted-0.027
ished-0.026
 opinions-0.025
lee-0.025
Token the
Token ID262
Feature activation-3.83e-02

Pos logit contributions
fire+0.063
ding+0.056
storm+0.056
 distance+0.054
field+0.053
Neg logit contributions
 opinions-0.044
els-0.043
lee-0.042
atted-0.041
 requests-0.040
Token merry
Token ID44570
Feature activation+1.93e-02

Pos logit contributions
fire+0.071
whe+0.059
|+0.058
ding+0.057
tr+0.056
Neg logit contributions
 opinions-0.070
els-0.067
ealing-0.066
ubby-0.066
atted-0.066
Token-
Token ID12
Feature activation-1.99e-02

Pos logit contributions
atted+0.032
els+0.032
lee+0.030
ished+0.029
 opinions+0.029
Neg logit contributions
fire-0.045
 distance-0.041
 hill-0.037
ding-0.037
storm-0.036
Tokengo
Token ID2188
Feature activation+8.75e-03

Pos logit contributions
 distance+0.039
fire+0.039
field+0.038
 hill+0.038
 trail+0.038
Neg logit contributions
atted-0.034
ished-0.032
els-0.031
elling-0.031
eal-0.030
Token-
Token ID12
Feature activation-3.03e-02

Pos logit contributions
atted+0.010
els+0.010
ished+0.008
ubby+0.008
 opinions+0.008
Neg logit contributions
fire-0.025
 distance-0.023
storm-0.022
jar-0.022
field-0.022
Tokenround
Token ID744
Feature activation-8.37e-02

Pos logit contributions
fire+0.052
 distance+0.049
 cave+0.046
field+0.046
 trail+0.045
Neg logit contributions
atted-0.059
els-0.058
elling-0.055
ished-0.055
 opinions-0.053
Token and
Token ID290
Feature activation-4.67e-02

Pos logit contributions
fire+0.196
 distance+0.175
Ro+0.173
Ar+0.171
storm+0.165
Neg logit contributions
els-0.124
atted-0.121
ished-0.109
ishment-0.108
 opinions-0.108
Token Doug
Token ID15115
Feature activation+5.93e-02

Pos logit contributions
els+0.057
atted+0.056
lee+0.055
 opinions+0.054
ished+0.054
Neg logit contributions
fire-0.028
 distance-0.026
field-0.020
 hill-0.019
ding-0.019
Tokenie
Token ID494
Feature activation+4.02e-02

Pos logit contributions
atted+0.123
els+0.122
lee+0.119
ished+0.117
 opinions+0.115
Neg logit contributions
fire-0.111
 distance-0.101
 hill-0.088
ding-0.087
 road-0.086
Token went
Token ID1816
Feature activation+2.83e-02

Pos logit contributions
atted+0.082
els+0.082
lee+0.079
ished+0.078
 opinions+0.076
Neg logit contributions
fire-0.077
 distance-0.069
ding-0.060
 hill-0.059
whe-0.059
Token to
Token ID284
Feature activation-2.71e-02

Pos logit contributions
atted+0.054
els+0.054
lee+0.053
ished+0.051
 opinions+0.048
Neg logit contributions
fire-0.057
 distance-0.048
storm-0.048
ding-0.045
field-0.044
Token the
Token ID262
Feature activation-2.86e-02

Pos logit contributions
fire+0.060
ding+0.053
storm+0.051
 distance+0.050
tr+0.048
Neg logit contributions
 opinions-0.047
els-0.045
 requests-0.045
lee-0.045
ishment-0.041
Token forest
Token ID8222
Feature activation-5.52e-02

Pos logit contributions
fire+0.052
ding+0.046
|+0.045
t+0.043
whe+0.043
Neg logit contributions
 opinions-0.053
els-0.052
ishment-0.050
lee-0.049
 requests-0.049
Token,
Token ID11
Feature activation-3.58e-04

Pos logit contributions
fire+0.115
 distance+0.105
ding+0.104
Ro+0.103
storm+0.103
Neg logit contributions
atted-0.088
els-0.084
 opinions-0.083
lee-0.083
ished-0.081
Token he
Token ID339
Feature activation+1.53e-02

Pos logit contributions
fire+0.001
 distance+0.001
storm+0.000
ding+0.000
 hill+0.000
Neg logit contributions
atted-0.001
els-0.001
lee-0.001
ished-0.001
 opinions-0.001
Token would
Token ID561
Feature activation+7.04e-03

Pos logit contributions
atted+0.032
els+0.031
lee+0.030
ished+0.030
 opinions+0.029
Neg logit contributions
fire-0.029
 distance-0.026
storm-0.022
ding-0.022
whe-0.022
Token always
Token ID1464
Feature activation+1.87e-02

Pos logit contributions
els+0.014
atted+0.014
lee+0.014
 opinions+0.014
ished+0.013
Neg logit contributions
fire-0.013
 distance-0.012
ding-0.011
storm-0.010
whe-0.010
Token try
Token ID1949
Feature activation+2.09e-02

Pos logit contributions
els+0.036
atted+0.036
lee+0.035
 opinions+0.034
ished+0.034
Neg logit contributions
fire-0.037
 distance-0.034
ding-0.030
 hill-0.030
storm-0.029
Token came
Token ID1625
Feature activation+4.79e-02

Pos logit contributions
fire+0.000
 distance+0.000
 hill+0.000
storm+0.000
 road+0.000
Neg logit contributions
atted-0.000
els-0.000
lee-0.000
ished-0.000
 opinions-0.000
Token out
Token ID503
Feature activation-1.69e-02

Pos logit contributions
els+0.097
atted+0.097
lee+0.094
ished+0.092
 opinions+0.091
Neg logit contributions
fire-0.093
 distance-0.084
 hill-0.074
ding-0.072
storm-0.072
Token of
Token ID286
Feature activation-6.49e-03

Pos logit contributions
fire+0.032
 distance+0.028
 hill+0.025
storm+0.025
ding+0.025
Neg logit contributions
atted-0.036
els-0.035
lee-0.035
ished-0.033
 opinions-0.033
Token the
Token ID262
Feature activation-4.02e-02

Pos logit contributions
fire+0.013
 distance+0.012
 hill+0.011
ding+0.010
 road+0.010
Neg logit contributions
els-0.013
atted-0.013
lee-0.012
ished-0.012
 opinions-0.012
Token exam
Token ID2814
Feature activation-6.98e-02

Pos logit contributions
fire+0.090
 distance+0.075
 Zig+0.072
ding+0.071
storm+0.070
Neg logit contributions
els-0.070
lee-0.070
atted-0.068
ished-0.066
 opinions-0.066
Token that
Token ID326
Feature activation-4.14e-02

Pos logit contributions
fire+0.133
 distance+0.122
 Zig+0.106
|+0.105
Ro+0.103
Neg logit contributions
els-0.137
atted-0.133
lee-0.131
 opinions-0.126
ished-0.124
Token she
Token ID673
Feature activation-7.73e-03

Pos logit contributions
fire+0.077
 distance+0.063
ding+0.062
 road+0.059
 Zig+0.059
Neg logit contributions
els-0.081
atted-0.081
 opinions-0.081
lee-0.080
ened-0.077
Token started
Token ID2067
Feature activation+1.09e-02

Pos logit contributions
fire+0.015
 distance+0.014
 hill+0.012
 road+0.012
ding+0.011
Neg logit contributions
atted-0.016
els-0.016
lee-0.015
ished-0.015
 opinions-0.015
Token to
Token ID284
Feature activation-4.04e-02

Pos logit contributions
atted+0.024
els+0.024
lee+0.023
 opinions+0.023
ished+0.022
Neg logit contributions
fire-0.020
 distance-0.017
 hill-0.015
storm-0.015
ding-0.015
Token applaud
Token ID22517
Feature activation+3.77e-02

Pos logit contributions
fire+0.082
 distance+0.072
ding+0.065
storm+0.064
field+0.063
Neg logit contributions
 opinions-0.076
lee-0.075
els-0.075
atted-0.074
 requests-0.071
Token.
Token ID13
Feature activation-2.64e-02

Pos logit contributions
els+0.080
atted+0.078
lee+0.077
ished+0.075
 opinions+0.074
Neg logit contributions
fire-0.071
 distance-0.066
 hill-0.057
field-0.055
 road-0.055
Token laughed
Token ID13818
Feature activation+9.02e-02

Pos logit contributions
fire+0.043
 distance+0.038
storm+0.037
ding+0.036
whe+0.036
Neg logit contributions
atted-0.030
els-0.030
lee-0.029
ished-0.029
 opinions-0.028
Token and
Token ID290
Feature activation-2.26e-02

Pos logit contributions
atted+0.132
els+0.130
lee+0.126
ished+0.123
 opinions+0.120
Neg logit contributions
fire-0.225
 distance-0.210
 hill-0.190
field-0.188
storm-0.186
Token cheered
Token ID34767
Feature activation+7.94e-02

Pos logit contributions
fire+0.051
 distance+0.044
storm+0.043
 hill+0.042
ding+0.041
Neg logit contributions
atted-0.039
lee-0.038
els-0.035
ished-0.035
 opinions-0.035
Token as
Token ID355
Feature activation-4.29e-02

Pos logit contributions
els+0.174
atted+0.173
lee+0.168
ished+0.165
 opinions+0.163
Neg logit contributions
fire-0.140
 distance-0.129
 hill-0.110
ding-0.107
 road-0.106
Token the
Token ID262
Feature activation-1.51e-02

Pos logit contributions
fire+0.075
 distance+0.071
storm+0.062
ding+0.060
field+0.060
Neg logit contributions
atted-0.093
lee-0.092
els-0.091
 opinions-0.088
ubby-0.084
Token boat
Token ID8848
Feature activation-4.07e-02

Pos logit contributions
fire+0.025
 distance+0.022
whe+0.021
ding+0.021
|+0.020
Neg logit contributions
els-0.033
atted-0.033
lee-0.032
 opinions-0.032
ubby-0.032
Token moved
Token ID3888
Feature activation+2.38e-02

Pos logit contributions
fire+0.072
 distance+0.069
|+0.060
field+0.060
 hill+0.060
Neg logit contributions
atted-0.083
els-0.080
ished-0.079
lee-0.075
 opinions-0.074
Token.
Token ID13
Feature activation-1.33e-02

Pos logit contributions
els+0.052
atted+0.052
ished+0.050
lee+0.049
 opinions+0.048
Neg logit contributions
fire-0.041
 distance-0.037
field-0.034
storm-0.034
 hill-0.033
Token They
Token ID1119
Feature activation-1.85e-02

Pos logit contributions
fire+0.025
 distance+0.023
 hill+0.020
storm+0.019
ding+0.019
Neg logit contributions
atted-0.027
els-0.027
lee-0.027
 opinions-0.026
ished-0.026
Token did
Token ID750
Feature activation+4.99e-02

Pos logit contributions
fire+0.043
 distance+0.038
storm+0.036
ding+0.035
 hill+0.035
Neg logit contributions
atted-0.029
lee-0.028
els-0.028
ished-0.028
 opinions-0.027
Token not
Token ID407
Feature activation+5.33e-03

Pos logit contributions
atted+0.091
els+0.090
 opinions+0.088
lee+0.087
ished+0.084
Neg logit contributions
fire-0.106
 distance-0.098
storm-0.087
 hill-0.086
field-0.085
Token lady
Token ID10846
Feature activation+9.53e-03

Pos logit contributions
fire+0.008
 distance+0.007
 hill+0.006
 road+0.006
storm+0.006
Neg logit contributions
els-0.007
atted-0.006
lee-0.006
ished-0.006
 opinions-0.006
Token was
Token ID373
Feature activation+4.57e-02

Pos logit contributions
atted+0.018
els+0.018
lee+0.018
ished+0.017
 opinions+0.017
Neg logit contributions
fire-0.019
 distance-0.018
 hill-0.016
 road-0.015
ding-0.015
Token happy
Token ID3772
Feature activation+8.09e-02

Pos logit contributions
els+0.105
atted+0.103
lee+0.101
ished+0.100
 opinions+0.098
Neg logit contributions
fire-0.076
 distance-0.070
 hill-0.058
whe-0.057
ding-0.057
Token too
Token ID1165
Feature activation-4.14e-02

Pos logit contributions
atted+0.153
els+0.148
lee+0.148
ished+0.141
 opinions+0.140
Neg logit contributions
fire-0.170
 distance-0.155
ding-0.136
 road-0.136
 hill-0.135
Token,
Token ID11
Feature activation-8.59e-03

Pos logit contributions
fire+0.083
 distance+0.077
 hill+0.068
whe+0.067
boat+0.067
Neg logit contributions
lee-0.079
atted-0.077
els-0.076
ubby-0.073
ished-0.073
Token and
Token ID290
Feature activation-3.48e-02

Pos logit contributions
fire+0.021
 distance+0.018
ding+0.017
storm+0.017
 hill+0.017
Neg logit contributions
atted-0.014
lee-0.013
els-0.013
 opinions-0.012
ished-0.012
Token thanked
Token ID26280
Feature activation+2.62e-02

Pos logit contributions
fire+0.055
 distance+0.050
 hill+0.042
ding+0.041
 road+0.041
Neg logit contributions
atted-0.083
lee-0.081
els-0.080
ished-0.078
 opinions-0.076
Token the
Token ID262
Feature activation-2.23e-02

Pos logit contributions
els+0.048
lee+0.046
atted+0.046
 opinions+0.044
ished+0.042
Neg logit contributions
fire-0.057
 distance-0.053
ding-0.047
 hill-0.047
storm-0.046
Token little
Token ID1310
Feature activation+4.43e-02

Pos logit contributions
fire+0.042
 distance+0.039
 hill+0.035
 road+0.034
storm+0.032
Neg logit contributions
els-0.047
atted-0.046
ished-0.045
lee-0.044
ishment-0.043
Token girl
Token ID2576
Feature activation+5.95e-03

Pos logit contributions
els+0.102
atted+0.102
lee+0.098
inding+0.098
ished+0.097
Neg logit contributions
 distance-0.070
fire-0.068
 hill-0.060
 road-0.056
 he-0.053
Token for
Token ID329
Feature activation-3.08e-03

Pos logit contributions
atted+0.011
lee+0.011
els+0.011
ished+0.011
 opinions+0.010
Neg logit contributions
fire-0.012
 distance-0.011
storm-0.011
ding-0.010
 road-0.010
Token.
Token ID13
Feature activation-1.97e-02

Pos logit contributions
fire+0.108
 distance+0.093
storm+0.086
field+0.083
whe+0.081
Neg logit contributions
els-0.130
atted-0.126
ubby-0.124
lee-0.123
ished-0.122
Token It
Token ID632
Feature activation-3.56e-02

Pos logit contributions
fire+0.030
 distance+0.027
 he+0.023
 hill+0.023
 road+0.022
Neg logit contributions
els-0.048
atted-0.047
ished-0.046
lee-0.045
 opinions-0.045
Token jumped
Token ID11687
Feature activation+6.49e-02

Pos logit contributions
fire+0.072
 distance+0.064
ding+0.057
 hill+0.057
field+0.056
Neg logit contributions
atted-0.069
els-0.067
ished-0.066
lee-0.065
 opinions-0.062
Token in
Token ID287
Feature activation-2.44e-02

Pos logit contributions
els+0.130
atted+0.129
lee+0.125
ished+0.124
 opinions+0.120
Neg logit contributions
fire-0.128
 distance-0.117
 hill-0.102
ding-0.100
 road-0.099
Token and
Token ID290
Feature activation-4.27e-02

Pos logit contributions
fire+0.044
 distance+0.041
 hill+0.035
 road+0.034
ding+0.034
Neg logit contributions
els-0.052
atted-0.052
lee-0.051
ished-0.050
 opinions-0.049
Token out
Token ID503
Feature activation-3.39e-02

Pos logit contributions
fire+0.089
 distance+0.081
 hill+0.072
field+0.072
 road+0.072
Neg logit contributions
atted-0.080
ished-0.077
lee-0.075
els-0.074
 opinions-0.068
Token of
Token ID286
Feature activation-1.91e-02

Pos logit contributions
fire+0.066
 distance+0.060
field+0.055
 hill+0.052
ding+0.052
Neg logit contributions
atted-0.072
els-0.071
lee-0.068
ished-0.066
ubby-0.066
Token the
Token ID262
Feature activation-4.59e-02

Pos logit contributions
fire+0.034
 distance+0.031
 hill+0.027
 road+0.026
ding+0.026
Neg logit contributions
els-0.041
atted-0.041
lee-0.040
ished-0.040
 opinions-0.039
Token leaves
Token ID5667
Feature activation-9.32e-02

Pos logit contributions
fire+0.100
 distance+0.084
ding+0.082
storm+0.081
field+0.078
Neg logit contributions
els-0.084
atted-0.081
ished-0.080
lee-0.080
ubby-0.078
Token.
Token ID13
Feature activation-1.64e-02

Pos logit contributions
fire+0.177
 distance+0.159
|+0.146
 saf+0.143
field+0.142
Neg logit contributions
atted-0.184
els-0.178
ished-0.175
lee-0.168
ened-0.160
Token It
Token ID632
Feature activation-3.08e-02

Pos logit contributions
fire+0.031
 distance+0.028
 hill+0.025
ding+0.024
 road+0.024
Neg logit contributions
els-0.034
atted-0.034
lee-0.033
ished-0.032
 opinions-0.032
Token and
Token ID290
Feature activation-5.17e-02

Pos logit contributions
els+0.085
atted+0.084
lee+0.082
ished+0.081
 opinions+0.080
Neg logit contributions
fire-0.085
 distance-0.074
storm-0.071
ding-0.069
 hill-0.068
Token grew
Token ID6348
Feature activation+1.83e-02

Pos logit contributions
fire+0.102
 distance+0.091
ding+0.082
storm+0.081
whe+0.079
Neg logit contributions
lee-0.102
atted-0.101
els-0.099
 opinions-0.098
ished-0.091
Token tall
Token ID7331
Feature activation+1.16e-02

Pos logit contributions
atted+0.040
els+0.040
lee+0.039
ished+0.038
 opinions+0.037
Neg logit contributions
fire-0.032
 distance-0.028
storm-0.026
 hill-0.026
 road-0.025
Token in
Token ID287
Feature activation-6.11e-02

Pos logit contributions
els+0.024
lee+0.024
atted+0.024
ished+0.023
 opinions+0.022
Neg logit contributions
fire-0.021
 distance-0.019
ding-0.017
storm-0.017
boat-0.016
Token the
Token ID262
Feature activation-6.98e-02

Pos logit contributions
fire+0.128
 distance+0.109
ding+0.106
storm+0.106
whe+0.102
Neg logit contributions
els-0.118
lee-0.114
 opinions-0.110
atted-0.110
ished-0.105
Token garden
Token ID11376
Feature activation-8.98e-02

Pos logit contributions
fire+0.133
 distance+0.111
ding+0.107
whe+0.107
 Zig+0.104
Neg logit contributions
lee-0.139
els-0.138
atted-0.136
elling-0.132
 opinions-0.131
Token.
Token ID13
Feature activation-3.74e-02

Pos logit contributions
fire+0.183
 distance+0.173
storm+0.171
|+0.164
ding+0.163
Neg logit contributions
atted-0.154
els-0.150
lee-0.147
 opinions-0.144
ished-0.139
Token Every
Token ID3887
Feature activation+9.29e-03

Pos logit contributions
fire+0.089
 distance+0.077
ding+0.070
storm+0.070
boat+0.068
Neg logit contributions
lee-0.060
 opinions-0.058
els-0.058
atted-0.057
ishment-0.054
Token day
Token ID1110
Feature activation+2.55e-03

Pos logit contributions
atted+0.020
ubby+0.019
 opinions+0.019
lee+0.019
els+0.018
Neg logit contributions
fire-0.016
whe-0.013
 distance-0.013
ding-0.012
 Zig-0.012
Token the
Token ID262
Feature activation-1.16e-02

Pos logit contributions
atted+0.003
els+0.003
lee+0.003
ished+0.003
ubby+0.003
Neg logit contributions
fire-0.007
 distance-0.006
field-0.006
storm-0.005
 hill-0.005
Token flower
Token ID15061
Feature activation-4.71e-02

Pos logit contributions
fire+0.017
 distance+0.017
 hill+0.014
 road+0.013
 mountain+0.013
Neg logit contributions
atted-0.030
els-0.029
ished-0.029
lee-0.028
ishment-0.027
Token"
Token ID1
Feature activation+2.79e-02

Pos logit contributions
fire+0.079
 distance+0.072
 hill+0.064
To+0.062
 road+0.061
Neg logit contributions
atted-0.074
ished-0.072
els-0.072
lee-0.068
ishment-0.067
TokenIt
Token ID1026
Feature activation-2.40e-02

Pos logit contributions
els+0.060
atted+0.057
 opinions+0.057
lee+0.057
ished+0.055
Neg logit contributions
fire-0.049
 distance-0.045
 hill-0.038
 saf-0.038
 road-0.037
Token must
Token ID1276
Feature activation-1.23e-02

Pos logit contributions
fire+0.051
 distance+0.047
 hill+0.042
 road+0.041
storm+0.041
Neg logit contributions
els-0.043
atted-0.043
lee-0.041
ished-0.040
 opinions-0.040
Token belong
Token ID5594
Feature activation+7.08e-03

Pos logit contributions
fire+0.024
 distance+0.022
ding+0.019
 hill+0.019
storm+0.018
Neg logit contributions
els-0.025
atted-0.024
lee-0.024
ished-0.023
 opinions-0.023
Token to
Token ID284
Feature activation-4.96e-02

Pos logit contributions
atted+0.015
els+0.015
ished+0.015
 opinions+0.015
lee+0.015
Neg logit contributions
fire-0.012
 distance-0.011
 hill-0.010
ding-0.009
storm-0.009
Token something
Token ID1223
Feature activation-8.18e-02

Pos logit contributions
fire+0.111
 distance+0.105
ding+0.095
 hill+0.093
whe+0.092
Neg logit contributions
els-0.085
lee-0.080
atted-0.079
 opinions-0.079
elling-0.075
Token very
Token ID845
Feature activation-8.64e-03

Pos logit contributions
fire+0.153
 distance+0.143
ding+0.124
storm+0.122
field+0.121
Neg logit contributions
atted-0.161
els-0.161
lee-0.158
ished-0.154
 opinions-0.153
Token big
Token ID1263
Feature activation+1.11e-02

Pos logit contributions
fire+0.018
 distance+0.017
ding+0.015
storm+0.015
 hill+0.015
Neg logit contributions
els-0.016
atted-0.016
lee-0.015
ished-0.015
 opinions-0.015
Token,"
Token ID553
Feature activation-1.32e-02

Pos logit contributions
atted+0.021
els+0.021
lee+0.021
ished+0.020
 opinions+0.020
Neg logit contributions
fire-0.023
 distance-0.021
 hill-0.018
 road-0.018
ding-0.018
Token Mama
Token ID40084
Feature activation+6.02e-02

Pos logit contributions
fire+0.028
 distance+0.026
 hill+0.023
 road+0.022
ding+0.022
Neg logit contributions
atted-0.024
els-0.024
lee-0.023
ished-0.023
 opinions-0.022
Token said
Token ID531
Feature activation+5.36e-02

Pos logit contributions
els+0.112
atted+0.111
ished+0.107
lee+0.106
 opinions+0.105
Neg logit contributions
fire-0.123
 distance-0.111
 he-0.100
 hill-0.098
storm-0.097
TokenDo
Token ID5211
Feature activation+7.68e-02

Pos logit contributions
els+0.046
lee+0.045
atted+0.044
 opinions+0.043
ished+0.042
Neg logit contributions
fire-0.046
 distance-0.041
 hill-0.037
 road-0.036
storm-0.036
Token you
Token ID345
Feature activation+1.92e-02

Pos logit contributions
atted+0.178
els+0.178
ished+0.174
lee+0.167
ishment+0.163
Neg logit contributions
fire-0.124
 distance-0.113
 hill-0.100
 he-0.099
 saf-0.097
Token see
Token ID766
Feature activation+2.67e-02

Pos logit contributions
atted+0.040
els+0.040
ished+0.038
lee+0.037
ishment+0.036
Neg logit contributions
fire-0.037
 distance-0.035
 hill-0.030
 ride-0.030
 road-0.029
Token the
Token ID262
Feature activation-4.55e-02

Pos logit contributions
atted+0.055
els+0.055
lee+0.053
ished+0.053
 opinions+0.051
Neg logit contributions
fire-0.050
 distance-0.046
 hill-0.040
ding-0.039
 road-0.039
Token tiny
Token ID7009
Feature activation+7.48e-03

Pos logit contributions
fire+0.079
 distance+0.065
whe+0.062
ding+0.060
|+0.060
Neg logit contributions
els-0.098
 opinions-0.095
ubby-0.093
lee-0.092
elling-0.092
Token fish
Token ID5916
Feature activation-7.38e-02

Pos logit contributions
els+0.017
atted+0.017
lee+0.016
ished+0.016
 opinions+0.016
Neg logit contributions
fire-0.012
 distance-0.011
ding-0.009
whe-0.009
storm-0.009
Token leaping
Token ID45583
Feature activation+1.92e-02

Pos logit contributions
fire+0.157
 distance+0.145
whe+0.130
ding+0.127
 road+0.127
Neg logit contributions
els-0.130
atted-0.129
lee-0.122
ubby-0.122
ished-0.121
Token out
Token ID503
Feature activation-6.18e-02

Pos logit contributions
els+0.039
atted+0.038
lee+0.037
ished+0.037
ubby+0.036
Neg logit contributions
fire-0.037
 distance-0.034
 hill-0.029
whe-0.029
storm-0.029
Token of
Token ID286
Feature activation-3.81e-02

Pos logit contributions
fire+0.116
 distance+0.104
storm+0.091
whe+0.091
ding+0.091
Neg logit contributions
els-0.129
atted-0.127
lee-0.124
ished-0.121
ubby-0.119
Token the
Token ID262
Feature activation-5.83e-02

Pos logit contributions
fire+0.071
 distance+0.064
ding+0.055
 hill+0.055
storm+0.054
Neg logit contributions
els-0.080
atted-0.079
lee-0.077
ished-0.076
 opinions-0.075
Token lake
Token ID13546
Feature activation-8.83e-02

Pos logit contributions
fire+0.099
whe+0.082
 distance+0.079
ding+0.076
 Zig+0.075
Neg logit contributions
els-0.125
ubby-0.121
elling-0.121
lee-0.119
ished-0.118
Token,
Token ID11
Feature activation+1.92e-04

Pos logit contributions
els+0.033
atted+0.033
ished+0.032
lee+0.032
ubby+0.031
Neg logit contributions
fire-0.031
 distance-0.028
 hill-0.025
 he-0.025
 road-0.024
Token every
Token ID790
Feature activation+1.24e-02

Pos logit contributions
els+0.000
atted+0.000
 opinions+0.000
lee+0.000
ished+0.000
Neg logit contributions
fire-0.000
 distance-0.000
ding-0.000
storm-0.000
 hill-0.000
Token color
Token ID3124
Feature activation-2.44e-02

Pos logit contributions
atted+0.029
els+0.028
lee+0.027
 opinions+0.027
ished+0.027
Neg logit contributions
fire-0.020
 distance-0.017
ding-0.016
whe-0.015
 hill-0.015
Token can
Token ID460
Feature activation-3.00e-02

Pos logit contributions
fire+0.045
 distance+0.041
 hill+0.035
 road+0.034
ding+0.034
Neg logit contributions
els-0.051
atted-0.051
lee-0.050
ished-0.049
 opinions-0.048
Token make
Token ID787
Feature activation-4.74e-03

Pos logit contributions
fire+0.056
 distance+0.051
ding+0.047
 saf+0.044
storm+0.044
Neg logit contributions
els-0.060
lee-0.058
atted-0.057
 opinions-0.057
ubby-0.054
Token a
Token ID257
Feature activation-7.86e-02

Pos logit contributions
fire+0.009
 distance+0.009
 hill+0.008
ding+0.008
storm+0.008
Neg logit contributions
els-0.009
atted-0.009
lee-0.009
 opinions-0.009
ished-0.009
Token huge
Token ID3236
Feature activation-1.40e-03

Pos logit contributions
fire+0.171
ding+0.159
whe+0.155
rob+0.155
 Zig+0.149
Neg logit contributions
lee-0.123
 opinions-0.119
ishment-0.116
atted-0.115
els-0.114
Token difference
Token ID3580
Feature activation-3.00e-02

Pos logit contributions
fire+0.003
 distance+0.002
ding+0.002
whe+0.002
storm+0.002
Neg logit contributions
els-0.003
atted-0.003
lee-0.003
 opinions-0.003
ished-0.003
Token!
Token ID0
Feature activation-3.92e-02

Pos logit contributions
fire+0.052
 distance+0.048
ding+0.042
 hill+0.041
 road+0.041
Neg logit contributions
els-0.065
atted-0.063
lee-0.062
ished-0.060
 opinions-0.060
Token<|endoftext|>
Token ID50256
Feature activation+2.42e-02

Pos logit contributions
fire+0.071
 distance+0.064
storm+0.060
 trail+0.059
ding+0.059
Neg logit contributions
 opinions-0.077
atted-0.076
els-0.075
lee-0.075
inding-0.074
TokenJimmy
Token ID40335
Feature activation+1.13e-01

Pos logit contributions
els+0.043
lee+0.042
atted+0.042
ubby+0.041
ished+0.041
Neg logit contributions
fire-0.048
 distance-0.042
storm-0.040
 road-0.037
 trail-0.037
Token and
Token ID290
Feature activation-1.89e-02

Pos logit contributions
els+0.196
atted+0.189
ished+0.179
 opinions+0.178
lee+0.177
Neg logit contributions
fire-0.180
 distance-0.174
 he-0.157
 hill-0.149
ding-0.147
Token started
Token ID2067
Feature activation+2.21e-02

Pos logit contributions
fire+0.038
 distance+0.036
 hill+0.031
 he+0.031
ding+0.031
Neg logit contributions
els-0.037
atted-0.036
ished-0.034
lee-0.034
ubby-0.034
Token to
Token ID284
Feature activation-3.14e-02

Pos logit contributions
els+0.052
atted+0.052
lee+0.051
ished+0.050
 opinions+0.049
Neg logit contributions
fire-0.035
 distance-0.032
 hill-0.027
ding-0.026
 road-0.026
Token draw
Token ID3197
Feature activation+4.46e-03

Pos logit contributions
fire+0.053
 distance+0.049
 hill+0.043
 road+0.042
whe+0.040
Neg logit contributions
atted-0.070
els-0.069
ished-0.068
lee-0.067
ubby-0.065
Token a
Token ID257
Feature activation-7.91e-02

Pos logit contributions
els+0.009
atted+0.009
lee+0.009
ished+0.009
 opinions+0.009
Neg logit contributions
fire-0.009
 distance-0.008
field-0.007
 hill-0.007
storm-0.007
Token picture
Token ID4286
Feature activation-7.41e-02

Pos logit contributions
fire+0.177
 distance+0.148
|+0.144
ding+0.143
storm+0.142
Neg logit contributions
lee-0.134
els-0.126
 opinions-0.121
 requests-0.117
atted-0.116
Token of
Token ID286
Feature activation-4.86e-02

Pos logit contributions
fire+0.162
 distance+0.154
field+0.143
|+0.141
asis+0.134
Neg logit contributions
els-0.128
atted-0.120
ished-0.119
lee-0.113
 opinions-0.112
Token a
Token ID257
Feature activation-7.86e-02

Pos logit contributions
fire+0.108
 distance+0.098
ding+0.088
storm+0.086
 hill+0.085
Neg logit contributions
els-0.088
 opinions-0.084
atted-0.083
lee-0.083
ishment-0.078
Token big
Token ID1263
Feature activation-6.63e-03

Pos logit contributions
fire+0.165
|+0.145
 distance+0.140
whe+0.138
 Zig+0.136
Neg logit contributions
lee-0.126
ubby-0.113
els-0.112
 daughter-0.110
 opinions-0.107
Token,
Token ID11
Feature activation-2.75e-02

Pos logit contributions
fire+0.013
 distance+0.011
ding+0.010
whe+0.010
field+0.010
Neg logit contributions
els-0.014
atted-0.013
lee-0.013
 opinions-0.013
ished-0.013
Token bright
Token ID6016
Feature activation+2.03e-03

Pos logit contributions
fire+0.056
 distance+0.051
 hill+0.045
ding+0.044
field+0.044
Neg logit contributions
els-0.052
atted-0.051
lee-0.051
ished-0.049
 opinions-0.049