TOP ACTIVATIONS
MAX = 0.152

. The gr umpy bear was so mad
he wanted to keep the oct opus anyway . " I
home . The oct opus sw am and sw
ocean . The oct opus was curious and excited
She said to Mr . Sp ud , " I want
The smart oct opus was very happy with
. We will share the oct opus together . Let 's
't I play with the oct opus , too ?"
! The oct opus knew right away that
wanted to play with the oct opus , but we broke
dog . The gr umpy cat tried to find
tail ." The oct opus and the duck each
, we can ," the oct opus said . " We
- war ," said the oct opus . " I have
with me today ?" The gr umpy cat said , "
you and we love the oct opus ." Mom
shouted , pulling on the oct opus . "
away . The oct opus is happy . It
before . The oct opus had lots of arms
water . The oct opus is hungry and curious

MIN ACTIVATIONS
MIN = -0.079

band - aid on it and it will heal soon ."
. Daddy asked if they should save some of the whip
smiled and asked if she would like to say hello .
mole . " To get to the other side
. They hoped the sun would dry their stones soon .
, she asked if she would like a special kiss .
man asked Sarah if she would like to help him ,
clouds and asked if they would give it some water .
He asked her if she would like a tour of the
go home , so they carefully watched the changing tide until
go home , so they carefully watched the changing tide until
said to herself , " should I follow the path ?"
watch the butterfly and she wondered what it was doing .
They sit on a blanket and eat their food . They
replied the nurse . " To make sure your body is
He asked her if she would like to hear his song
the puppy , so she carefully put it on his collar
and the butterfly . They wondered together how such a small
knew that the wishing tree would remain a special place he
smile and asked if she could lend her the money for

INTERVAL 0.094 - 0.152
CONTAINS 3.585%

ie tried to quit eating the flower , but it was
a doll . They threw the ball and hugged the doll
had never played with the aer opl ane . They wish
smiled . She hopped around the tree singing the words of
laughed and smiled , and the pengu in showed him how

INTERVAL 0.036 - 0.094
CONTAINS 12.018%

slid down , feeling the warm sun on her face .
filled it up with the yellow cr ay on . He
a search to find out more about gray . Everywhere she
the big pond next to her house . She jumped in
, still smiling from all their fun . <|endoftext|> Once upon

INTERVAL -0.021 - 0.036
CONTAINS 53.174%

<|endoftext|> Once upon a time , in a tiny hut ,
believe her eyes . The sisters spent the day
was something fun . They came inside and put
her finger and gave her a hug . From that day
ed and stopped them . " Rex , what

INTERVAL -0.079 - -0.021
CONTAINS 31.223%

like cans and coins and papers . They liked to lift
started to wipe the counter and the floor . They worked
means . He also lets them touch his hat and his
be very lucky if you get to see it ."
my best friend . You can be friends too ."
Token.
Token ID13
Feature activation+1.26e-02

Pos logit contributions
 let+0.006
 it+0.006
 share+0.005
 hid+0.005
 fought+0.005
Neg logit contributions
ummy-0.008
umm-0.008
err-0.008
um-0.008
ini-0.008
Token
Token ID220
Feature activation+4.29e-02

Pos logit contributions
umm+0.025
ini+0.025
err+0.024
ummy+0.024
um+0.024
Neg logit contributions
 let-0.021
 it-0.021
 hid-0.019
 share-0.019
 fought-0.018
Token
Token ID198
Feature activation+2.59e-02

Pos logit contributions
umm+0.096
ini+0.088
ummy+0.087
 surroundings+0.086
err+0.086
Neg logit contributions
 it-0.069
 let-0.068
 hid-0.057
 share-0.057
 whoever-0.055
Token
Token ID198
Feature activation+1.76e-02

Pos logit contributions
umm+0.051
ummy+0.048
err+0.048
ini+0.047
 surroundings+0.047
Neg logit contributions
 let-0.046
 it-0.046
 share-0.040
 hid-0.040
 fought-0.039
TokenThe
Token ID464
Feature activation+1.28e-01

Pos logit contributions
umm+0.032
err+0.031
ummy+0.031
um+0.031
ini+0.031
Neg logit contributions
 let-0.031
 it-0.031
 hid-0.028
 share-0.028
 fought-0.027
Token gr
Token ID1036
Feature activation+1.52e-01

Pos logit contributions
umm+0.262
igsaw+0.255
ummy+0.254
um+0.250
ay+0.249
Neg logit contributions
 let-0.247
 share-0.217
 hid-0.215
 it-0.209
 put-0.207
Tokenumpy
Token ID32152
Feature activation+1.33e-01

Pos logit contributions
 biggest+0.129
umm+0.119
ama+0.110
 surroundings+0.110
err+0.100
Neg logit contributions
 let-0.391
 hid-0.390
 it-0.384
 fought-0.382
pping-0.373
Token bear
Token ID6842
Feature activation+8.17e-03

Pos logit contributions
umm+0.245
err+0.227
ay+0.226
igsaw+0.224
um+0.223
Neg logit contributions
 let-0.257
 it-0.246
 hid-0.242
 share-0.229
 put-0.222
Token was
Token ID373
Feature activation-1.45e-02

Pos logit contributions
err+0.017
umpy+0.017
um+0.017
ama+0.017
 surroundings+0.017
Neg logit contributions
 it-0.010
 let-0.010
 share-0.010
 risking-0.009
 letting-0.009
Token so
Token ID523
Feature activation+1.24e-03

Pos logit contributions
 let+0.032
 it+0.031
 share+0.029
 hid+0.028
 whoever+0.028
Neg logit contributions
umpy-0.019
err-0.019
ummy-0.019
um-0.019
ubby-0.019
Token mad
Token ID8805
Feature activation+6.09e-03

Pos logit contributions
umm+0.002
err+0.002
ummy+0.002
ini+0.002
 surroundings+0.002
Neg logit contributions
 let-0.002
 it-0.002
 share-0.002
 fought-0.002
 hid-0.002
Token he
Token ID339
Feature activation-3.60e-02

Pos logit contributions
pping+0.044
 share+0.043
ts+0.043
sels+0.042
pped+0.042
Neg logit contributions
um-0.073
 surroundings-0.072
 world-0.070
ummy-0.070
 biggest-0.068
Token wanted
Token ID2227
Feature activation-2.85e-02

Pos logit contributions
 risking+0.041
 letting+0.039
 it+0.038
 share+0.037
pping+0.036
Neg logit contributions
umpy-0.078
um-0.076
ini-0.075
err-0.074
 surroundings-0.072
Token to
Token ID284
Feature activation-4.40e-02

Pos logit contributions
 let+0.033
 hid+0.030
 fought+0.030
 share+0.029
pping+0.029
Neg logit contributions
 biggest-0.064
ummy-0.064
 surroundings-0.063
um-0.063
 world-0.062
Token keep
Token ID1394
Feature activation-3.13e-02

Pos logit contributions
 risking+0.049
pped+0.049
 hid+0.046
pping+0.046
 wouldn+0.045
Neg logit contributions
um-0.099
 biggest-0.094
ummy-0.093
ama-0.092
 world-0.091
Token the
Token ID262
Feature activation+1.16e-01

Pos logit contributions
 let+0.040
 hid+0.035
 fought+0.034
 share+0.031
 wouldn+0.031
Neg logit contributions
 surroundings-0.071
 biggest-0.071
ummy-0.070
um-0.068
ama-0.068
Token oct
Token ID19318
Feature activation+1.52e-01

Pos logit contributions
ama+0.202
umm+0.201
err+0.199
um+0.198
ay+0.193
Neg logit contributions
 let-0.227
 it-0.224
 share-0.200
 hid-0.190
 put-0.190
Tokenopus
Token ID25790
Feature activation+1.90e-02

Pos logit contributions
err+0.205
 surroundings+0.202
umm+0.201
 biggest+0.196
um+0.180
Neg logit contributions
 let-0.328
 it-0.318
 fought-0.297
 share-0.293
pping-0.286
Token anyway
Token ID6949
Feature activation-4.47e-04

Pos logit contributions
umm+0.037
ummy+0.037
 surroundings+0.037
um+0.037
ini+0.037
Neg logit contributions
 let-0.030
 it-0.029
 hid-0.027
 fought-0.026
 share-0.026
Token.
Token ID13
Feature activation+8.95e-03

Pos logit contributions
 let+0.001
 it+0.001
 share+0.001
 hid+0.001
 fought+0.001
Neg logit contributions
umm-0.001
ummy-0.001
um-0.001
err-0.001
ini-0.001
Token "
Token ID366
Feature activation-7.05e-03

Pos logit contributions
ini+0.016
ubby+0.015
umm+0.015
err+0.015
 surroundings+0.015
Neg logit contributions
 let-0.017
 it-0.017
 share-0.015
 hid-0.015
 fought-0.014
TokenI
Token ID40
Feature activation-1.82e-02

Pos logit contributions
 let+0.011
 it+0.010
 share+0.009
 fought+0.009
 hid+0.009
Neg logit contributions
ummy-0.015
umm-0.015
ama-0.015
err-0.015
ubby-0.015
Token home
Token ID1363
Feature activation-1.78e-02

Pos logit contributions
 let+0.038
 fought+0.036
 hid+0.034
 share+0.034
 snatched+0.034
Neg logit contributions
 surroundings-0.052
ummy-0.052
 biggest-0.051
um-0.051
 world-0.050
Token.
Token ID13
Feature activation+1.52e-02

Pos logit contributions
 let+0.030
 fought+0.027
 share+0.027
 hid+0.026
 it+0.026
Neg logit contributions
ummy-0.033
 surroundings-0.032
um-0.031
 biggest-0.031
err-0.031
Token
Token ID198
Feature activation+3.75e-02

Pos logit contributions
umm+0.034
ubby+0.033
ini+0.033
ummy+0.032
err+0.031
Neg logit contributions
 it-0.025
 let-0.024
 hid-0.021
 fought-0.019
 share-0.019
Token
Token ID198
Feature activation+2.37e-02

Pos logit contributions
umm+0.092
ini+0.081
ummy+0.080
err+0.080
ubby+0.079
Neg logit contributions
 it-0.060
 let-0.058
 never-0.048
 hid-0.047
 share-0.046
TokenThe
Token ID464
Feature activation+1.17e-01

Pos logit contributions
umm+0.048
ummy+0.046
ini+0.044
err+0.044
um+0.043
Neg logit contributions
 let-0.043
 it-0.041
 hid-0.037
 share-0.037
 fought-0.036
Token oct
Token ID19318
Feature activation+1.52e-01

Pos logit contributions
umm+0.235
err+0.223
ama+0.217
ay+0.215
oky+0.213
Neg logit contributions
 let-0.226
 it-0.215
 share-0.191
 put-0.189
 fought-0.176
Tokenopus
Token ID25790
Feature activation+4.64e-03

Pos logit contributions
umm+0.193
 biggest+0.181
err+0.180
 surroundings+0.179
um+0.176
Neg logit contributions
 let-0.346
 it-0.345
 fought-0.321
 hid-0.313
 share-0.312
Token sw
Token ID1509
Feature activation+6.86e-02

Pos logit contributions
err+0.012
umm+0.012
um+0.011
 surroundings+0.011
ini+0.011
Neg logit contributions
 let-0.005
 it-0.005
 share-0.004
 hid-0.004
 fought-0.004
Tokenam
Token ID321
Feature activation+1.20e-03

Pos logit contributions
umm+0.115
ummy+0.109
err+0.105
ini+0.105
ama+0.104
Neg logit contributions
 let-0.133
 it-0.130
 hid-0.125
 share-0.124
 fought-0.124
Token and
Token ID290
Feature activation-3.14e-02

Pos logit contributions
umm+0.003
ummy+0.003
err+0.003
um+0.003
ini+0.003
Neg logit contributions
 let-0.002
 it-0.002
 share-0.001
 hid-0.001
 fought-0.001
Token sw
Token ID1509
Feature activation+3.66e-02

Pos logit contributions
pping+0.043
 share+0.039
 let+0.039
pped+0.039
 risking+0.038
Neg logit contributions
 surroundings-0.060
 sun-0.056
err-0.056
 sky-0.055
 woods-0.054
Token ocean
Token ID9151
Feature activation+1.72e-03

Pos logit contributions
err+0.146
ama+0.144
umm+0.141
ummy+0.135
ini+0.133
Neg logit contributions
 let-0.207
 it-0.206
 share-0.181
 hid-0.180
 put-0.179
Token.
Token ID13
Feature activation+4.09e-03

Pos logit contributions
ummy+0.003
umm+0.003
 world+0.003
um+0.003
 surroundings+0.003
Neg logit contributions
 let-0.003
 share-0.003
 it-0.002
 fought-0.002
 hid-0.002
Token
Token ID198
Feature activation+3.09e-02

Pos logit contributions
umm+0.008
ini+0.008
err+0.008
ummy+0.008
 surroundings+0.008
Neg logit contributions
 let-0.007
 it-0.007
 hid-0.006
 share-0.006
 fought-0.006
Token
Token ID198
Feature activation+1.83e-02

Pos logit contributions
umm+0.069
ini+0.062
err+0.062
ummy+0.062
 surroundings+0.061
Neg logit contributions
 it-0.052
 let-0.051
 share-0.042
 hid-0.042
 never-0.041
TokenThe
Token ID464
Feature activation+1.20e-01

Pos logit contributions
umm+0.033
err+0.032
um+0.032
ummy+0.032
ini+0.032
Neg logit contributions
 let-0.033
 it-0.032
 hid-0.030
 share-0.030
 fought-0.029
Token oct
Token ID19318
Feature activation+1.50e-01

Pos logit contributions
umm+0.242
err+0.241
ama+0.234
anny+0.231
oky+0.230
Neg logit contributions
 let-0.219
 it-0.205
 put-0.180
 share-0.173
 tried-0.169
Tokenopus
Token ID25790
Feature activation+2.56e-03

Pos logit contributions
umm+0.181
 surroundings+0.173
err+0.171
 biggest+0.168
um+0.168
Neg logit contributions
 let-0.355
 it-0.354
 fought-0.329
 hid-0.324
 share-0.319
Token was
Token ID373
Feature activation-1.51e-02

Pos logit contributions
um+0.006
err+0.006
ini+0.006
 surroundings+0.006
ummy+0.006
Neg logit contributions
 let-0.003
 it-0.003
 share-0.003
 risking-0.002
 hid-0.002
Token curious
Token ID11040
Feature activation-1.35e-02

Pos logit contributions
 let+0.030
 it+0.029
 share+0.028
 fought+0.027
 hid+0.027
Neg logit contributions
 biggest-0.022
err-0.022
um-0.022
ummy-0.022
 surroundings-0.022
Token and
Token ID290
Feature activation-2.97e-02

Pos logit contributions
 let+0.021
 share+0.021
 fought+0.019
 hid+0.019
 it+0.019
Neg logit contributions
ummy-0.027
umm-0.026
um-0.026
 surroundings-0.026
err-0.025
Token excited
Token ID6568
Feature activation-1.26e-04

Pos logit contributions
pping+0.035
pped+0.035
 risking+0.032
 let+0.032
 share+0.032
Neg logit contributions
 surroundings-0.063
 biggest-0.061
err-0.061
umpy-0.059
um-0.059
Token She
Token ID1375
Feature activation-4.04e-02

Pos logit contributions
umm+0.002
ummy+0.002
um+0.001
err+0.001
ama+0.001
Neg logit contributions
 let-0.002
 it-0.002
 fought-0.002
 share-0.002
 hid-0.002
Token said
Token ID531
Feature activation-3.27e-02

Pos logit contributions
 risking+0.054
pping+0.050
 share+0.049
 letting+0.048
 it+0.048
Neg logit contributions
um-0.081
err-0.080
olly-0.078
ama-0.078
 surroundings-0.078
Token to
Token ID284
Feature activation-4.79e-02

Pos logit contributions
 let+0.050
 fought+0.049
pping+0.047
 argued+0.046
 hid+0.045
Neg logit contributions
ummy-0.067
um-0.066
ama-0.064
olly-0.060
 biggest-0.059
Token Mr
Token ID1770
Feature activation+7.66e-02

Pos logit contributions
 let+0.071
 it+0.067
 hid+0.066
 fought+0.066
 managed+0.064
Neg logit contributions
ummy-0.099
um-0.096
umm-0.096
ama-0.095
err-0.094
Token.
Token ID13
Feature activation+4.92e-02

Pos logit contributions
umm+0.167
ummy+0.166
err+0.162
um+0.162
ama+0.161
Neg logit contributions
 let-0.110
 it-0.105
 share-0.095
 hid-0.095
 fought-0.094
Token Sp
Token ID1338
Feature activation+1.50e-01

Pos logit contributions
umm+0.104
ummy+0.102
err+0.101
um+0.101
ini+0.100
Neg logit contributions
 let-0.075
 it-0.072
 share-0.065
 hid-0.065
 fought-0.063
Tokenud
Token ID463
Feature activation+1.82e-02

Pos logit contributions
 biggest+0.232
umm+0.223
 surroundings+0.214
um+0.207
 countryside+0.205
Neg logit contributions
 let-0.293
 it-0.277
pping-0.273
 hid-0.264
 wouldn-0.257
Token,
Token ID11
Feature activation-3.35e-02

Pos logit contributions
ama+0.031
um+0.030
ummy+0.030
ubby+0.029
ini+0.029
Neg logit contributions
 share-0.031
 risking-0.030
 let-0.029
 whoever-0.029
 distract-0.028
Token "
Token ID366
Feature activation-1.68e-02

Pos logit contributions
 share+0.070
ts+0.069
pping+0.068
 fought+0.067
pped+0.066
Neg logit contributions
 sky-0.041
 sun-0.041
ummy-0.038
err-0.037
 girl-0.037
TokenI
Token ID40
Feature activation-8.90e-03

Pos logit contributions
 let+0.027
 it+0.026
 hid+0.024
 share+0.024
 fought+0.023
Neg logit contributions
umm-0.034
err-0.033
ini-0.033
ummy-0.033
um-0.033
Token want
Token ID765
Feature activation-3.46e-02

Pos logit contributions
 let+0.012
 it+0.012
 letting+0.011
 risking+0.011
pping+0.011
Neg logit contributions
umpy-0.018
err-0.018
um-0.018
ama-0.018
ummy-0.017
Token
Token ID220
Feature activation+3.12e-02

Pos logit contributions
umm+0.003
ini+0.003
err+0.003
 surroundings+0.003
ubby+0.003
Neg logit contributions
 it-0.003
 let-0.003
 hid-0.003
 share-0.002
 whoever-0.002
Token
Token ID198
Feature activation+1.72e-02

Pos logit contributions
umm+0.060
ini+0.056
ummy+0.056
err+0.056
 surroundings+0.056
Neg logit contributions
 let-0.056
 it-0.055
 share-0.049
 hid-0.049
 fought-0.047
Token
Token ID198
Feature activation+1.06e-02

Pos logit contributions
umm+0.031
ummy+0.030
err+0.030
ini+0.030
um+0.030
Neg logit contributions
 let-0.032
 it-0.031
 share-0.029
 hid-0.028
 fought-0.028
TokenThe
Token ID464
Feature activation+1.14e-01

Pos logit contributions
umm+0.016
umpy+0.016
um+0.016
err+0.016
ini+0.016
Neg logit contributions
 it-0.021
 let-0.021
 hid-0.020
 fought-0.020
 share-0.019
Token smart
Token ID4451
Feature activation+1.26e-01

Pos logit contributions
umm+0.212
err+0.206
ama+0.204
umpy+0.201
ummy+0.199
Neg logit contributions
 let-0.229
 it-0.221
 share-0.198
 put-0.193
 hid-0.185
Token oct
Token ID19318
Feature activation+1.50e-01

Pos logit contributions
umm+0.221
err+0.212
ama+0.210
 surroundings+0.208
ummy+0.199
Neg logit contributions
 let-0.257
 it-0.252
 hid-0.220
 put-0.218
 share-0.217
Tokenopus
Token ID25790
Feature activation+4.16e-03

Pos logit contributions
umm+0.175
err+0.166
 surroundings+0.164
um+0.161
ummy+0.160
Neg logit contributions
 let-0.362
 it-0.357
 fought-0.333
 share-0.331
 hid-0.330
Token was
Token ID373
Feature activation-2.71e-02

Pos logit contributions
ini+0.009
um+0.009
ummy+0.009
err+0.009
umm+0.009
Neg logit contributions
 let-0.005
 it-0.005
 risking-0.005
 share-0.005
 losing-0.005
Token very
Token ID845
Feature activation-3.17e-05

Pos logit contributions
 let+0.047
 fought+0.045
 risking+0.045
 hid+0.044
 argued+0.044
Neg logit contributions
 biggest-0.047
um-0.044
 best-0.042
ummy-0.042
ubby-0.041
Token happy
Token ID3772
Feature activation-7.49e-03

Pos logit contributions
 let+0.000
 it+0.000
 share+0.000
 hid+0.000
 fought+0.000
Neg logit contributions
umm-0.000
err-0.000
ummy-0.000
ini-0.000
 surroundings-0.000
Token with
Token ID351
Feature activation-2.20e-02

Pos logit contributions
 let+0.012
pped+0.011
 share+0.011
 fought+0.011
 argued+0.011
Neg logit contributions
ummy-0.015
umm-0.014
 world-0.014
 surroundings-0.014
um-0.013
Token.
Token ID13
Feature activation-1.61e-02

Pos logit contributions
umm+0.065
ummy+0.064
err+0.063
um+0.063
ini+0.062
Neg logit contributions
 let-0.040
 it-0.039
 hid-0.035
 share-0.035
 fought-0.033
Token We
Token ID775
Feature activation-2.80e-02

Pos logit contributions
 let+0.026
 it+0.025
 share+0.023
 hid+0.023
 fought+0.022
Neg logit contributions
umm-0.032
ummy-0.032
err-0.031
ini-0.031
um-0.031
Token will
Token ID481
Feature activation-5.61e-02

Pos logit contributions
 letting+0.042
 risking+0.041
 losing+0.039
 it+0.037
 let+0.036
Neg logit contributions
ay-0.055
ubby-0.052
 world-0.051
um-0.051
umpy-0.050
Token share
Token ID2648
Feature activation-4.42e-02

Pos logit contributions
 risking+0.072
 hid+0.067
pped+0.064
 snatched+0.063
 it+0.063
Neg logit contributions
um-0.118
 world-0.117
ummy-0.114
ay-0.112
 sun-0.112
Token the
Token ID262
Feature activation+1.08e-01

Pos logit contributions
 let+0.043
 hid+0.041
 fought+0.038
 managed+0.037
 wouldn+0.037
Neg logit contributions
ummy-0.110
um-0.108
 surroundings-0.108
ay-0.107
err-0.107
Token oct
Token ID19318
Feature activation+1.50e-01

Pos logit contributions
umm+0.207
ama+0.207
um+0.204
lights+0.200
umpy+0.195
Neg logit contributions
 let-0.192
 it-0.191
 share-0.168
 put-0.158
 don-0.156
Tokenopus
Token ID25790
Feature activation+2.67e-02

Pos logit contributions
 surroundings+0.211
umm+0.204
 biggest+0.203
err+0.203
um+0.186
Neg logit contributions
 let-0.314
 it-0.304
 fought-0.292
 share-0.285
 hid-0.270
Token together
Token ID1978
Feature activation-3.94e-02

Pos logit contributions
umm+0.063
err+0.061
ummy+0.061
um+0.060
ama+0.060
Neg logit contributions
 let-0.035
 it-0.034
 share-0.031
 hid-0.029
 fought-0.028
Token.
Token ID13
Feature activation-1.47e-02

Pos logit contributions
 let+0.059
 hid+0.058
 risking+0.056
 snatched+0.056
 fought+0.056
Neg logit contributions
ummy-0.077
um-0.074
 world-0.073
 surroundings-0.073
ay-0.072
Token Let
Token ID3914
Feature activation-3.34e-02

Pos logit contributions
 let+0.024
 it+0.024
 hid+0.022
 share+0.022
 fought+0.021
Neg logit contributions
umm-0.029
ummy-0.028
err-0.028
um-0.028
ini-0.027
Token's
Token ID338
Feature activation-3.41e-02

Pos logit contributions
 let+0.043
 it+0.041
 hid+0.038
 share+0.038
 fought+0.038
Neg logit contributions
umm-0.076
ummy-0.075
um-0.075
err-0.074
ini-0.074
Token't
Token ID470
Feature activation-2.85e-02

Pos logit contributions
 let+0.070
 hid+0.067
 managed+0.063
 risking+0.062
 fought+0.062
Neg logit contributions
ummy-0.122
um-0.118
ama-0.117
umm-0.115
ini-0.114
Token I
Token ID314
Feature activation-3.72e-02

Pos logit contributions
 let+0.033
 hid+0.033
 managed+0.033
 wouldn+0.032
 risking+0.031
Neg logit contributions
ummy-0.068
um-0.064
umm-0.063
ini-0.062
ama-0.061
Token play
Token ID711
Feature activation-4.68e-03

Pos logit contributions
 it+0.042
 let+0.040
 risking+0.040
 hid+0.038
 managed+0.036
Neg logit contributions
ummy-0.085
um-0.085
ay-0.083
umpy-0.083
ini-0.082
Token with
Token ID351
Feature activation-2.28e-02

Pos logit contributions
 let+0.007
 it+0.006
 hid+0.006
 fought+0.006
 managed+0.006
Neg logit contributions
ummy-0.010
um-0.009
ini-0.009
umm-0.009
ay-0.009
Token the
Token ID262
Feature activation+1.19e-01

Pos logit contributions
 let+0.029
 hid+0.026
 managed+0.025
 it+0.025
 fought+0.024
Neg logit contributions
ummy-0.053
um-0.051
umm-0.050
 surroundings-0.050
 biggest-0.050
Token oct
Token ID19318
Feature activation+1.49e-01

Pos logit contributions
umm+0.243
ama+0.243
um+0.240
err+0.237
lights+0.237
Neg logit contributions
 let-0.198
 it-0.194
 hid-0.160
 share-0.159
 put-0.158
Tokenopus
Token ID25790
Feature activation+2.63e-02

Pos logit contributions
 surroundings+0.208
umm+0.198
err+0.198
 biggest+0.198
um+0.186
Neg logit contributions
 let-0.318
 it-0.312
 fought-0.299
 share-0.285
 hid-0.283
Token,
Token ID11
Feature activation-3.52e-02

Pos logit contributions
umm+0.053
ummy+0.052
ini+0.051
err+0.051
um+0.051
Neg logit contributions
 let-0.041
 it-0.040
 hid-0.037
 share-0.036
 fought-0.035
Token too
Token ID1165
Feature activation+1.60e-03

Pos logit contributions
 let+0.050
 it+0.047
 hid+0.047
 fought+0.047
 share+0.046
Neg logit contributions
ummy-0.073
umm-0.071
um-0.071
err-0.070
ama-0.070
Token?"
Token ID1701
Feature activation+9.51e-03

Pos logit contributions
umm+0.004
err+0.004
ini+0.004
 surroundings+0.004
ummy+0.004
Neg logit contributions
 let-0.002
 it-0.002
 share-0.002
 hid-0.002
 fought-0.002
Token
Token ID198
Feature activation+2.99e-02

Pos logit contributions
umm+0.018
ummy+0.017
ini+0.017
err+0.017
um+0.017
Neg logit contributions
 let-0.017
 it-0.017
 share-0.015
 hid-0.015
 fought-0.015
Token!
Token ID0
Feature activation+9.69e-03

Pos logit contributions
 share+0.098
 fought+0.097
 let+0.097
 argued+0.094
 hid+0.093
Neg logit contributions
 biggest-0.077
um-0.077
 world-0.077
 surroundings-0.076
ummy-0.075
Token
Token ID220
Feature activation+3.69e-02

Pos logit contributions
umm+0.020
ini+0.019
ummy+0.019
err+0.019
 surroundings+0.019
Neg logit contributions
 it-0.016
 let-0.016
 hid-0.014
 share-0.014
 fought-0.013
Token
Token ID198
Feature activation+2.84e-02

Pos logit contributions
umm+0.062
ummy+0.061
err+0.060
um+0.060
ini+0.059
Neg logit contributions
 let-0.072
 it-0.069
 share-0.065
 hid-0.065
 fought-0.064
Token
Token ID198
Feature activation+2.29e-02

Pos logit contributions
umm+0.052
ummy+0.050
err+0.049
ini+0.049
um+0.049
Neg logit contributions
 let-0.053
 it-0.052
 share-0.047
 hid-0.047
 fought-0.046
TokenThe
Token ID464
Feature activation+1.16e-01

Pos logit contributions
umm+0.040
err+0.039
um+0.039
ini+0.039
ummy+0.039
Neg logit contributions
 let-0.043
 it-0.042
 hid-0.039
 share-0.039
 fought-0.038
Token oct
Token ID19318
Feature activation+1.49e-01

Pos logit contributions
umm+0.234
err+0.224
ummy+0.223
ama+0.222
ay+0.221
Neg logit contributions
 let-0.216
 it-0.205
 put-0.179
 hid-0.175
 share-0.173
Tokenopus
Token ID25790
Feature activation-3.72e-04

Pos logit contributions
umm+0.190
 biggest+0.180
 surroundings+0.176
err+0.175
um+0.173
Neg logit contributions
 let-0.341
 it-0.337
 fought-0.320
 hid-0.313
 share-0.306
Token knew
Token ID2993
Feature activation-4.24e-02

Pos logit contributions
 let+0.000
 it+0.000
 share+0.000
 hid+0.000
 risking+0.000
Neg logit contributions
err-0.001
um-0.001
ini-0.001
 surroundings-0.001
umm-0.001
Token right
Token ID826
Feature activation+9.26e-03

Pos logit contributions
 argued+0.050
 argue+0.050
 fought+0.049
 share+0.049
 shared+0.043
Neg logit contributions
ama-0.091
 surroundings-0.090
 best-0.089
 biggest-0.088
 new-0.086
Token away
Token ID1497
Feature activation-2.12e-02

Pos logit contributions
um+0.021
 surroundings+0.021
ummy+0.020
umm+0.020
err+0.020
Neg logit contributions
 let-0.012
 share-0.011
 it-0.011
 hid-0.011
 fought-0.011
Token that
Token ID326
Feature activation-7.67e-03

Pos logit contributions
 fought+0.030
 argued+0.030
 share+0.030
 let+0.030
pped+0.028
Neg logit contributions
ummy-0.041
 surroundings-0.041
um-0.041
ama-0.040
 world-0.039
Token wanted
Token ID2227
Feature activation-3.18e-02

Pos logit contributions
 it+0.021
 letting+0.020
 risking+0.020
 whoever+0.019
 let+0.019
Neg logit contributions
umpy-0.040
ini-0.040
ubby-0.039
ay-0.039
um-0.039
Token to
Token ID284
Feature activation-4.98e-02

Pos logit contributions
 let+0.033
 hid+0.030
pping+0.029
 managed+0.029
 fought+0.029
Neg logit contributions
ummy-0.078
 biggest-0.075
um-0.074
 surroundings-0.074
umm-0.072
Token play
Token ID711
Feature activation-2.38e-04

Pos logit contributions
 risking+0.057
 it+0.055
 hid+0.055
 wouldn+0.055
 let+0.054
Neg logit contributions
um-0.110
ummy-0.108
ay-0.107
ini-0.106
ama-0.105
Token with
Token ID351
Feature activation-1.67e-02

Pos logit contributions
 let+0.000
 it+0.000
 hid+0.000
 fought+0.000
 managed+0.000
Neg logit contributions
ummy-0.000
um-0.000
umm-0.000
err-0.000
 surroundings-0.000
Token the
Token ID262
Feature activation+1.15e-01

Pos logit contributions
 let+0.019
 it+0.017
 hid+0.016
 share+0.016
 fought+0.016
Neg logit contributions
ummy-0.041
umm-0.040
um-0.040
 surroundings-0.040
ama-0.040
Token oct
Token ID19318
Feature activation+1.49e-01

Pos logit contributions
um+0.209
umm+0.209
ama+0.202
err+0.196
ummy+0.191
Neg logit contributions
 it-0.217
 let-0.214
 share-0.192
 hid-0.188
 make-0.188
Tokenopus
Token ID25790
Feature activation+2.37e-02

Pos logit contributions
 biggest+0.218
umm+0.216
 surroundings+0.216
err+0.201
andel+0.198
Neg logit contributions
 it-0.294
 let-0.294
 fought-0.280
 hid-0.269
 stole-0.261
Token,
Token ID11
Feature activation-2.38e-02

Pos logit contributions
umm+0.053
ummy+0.053
err+0.052
um+0.052
ini+0.052
Neg logit contributions
 let-0.032
 it-0.030
 share-0.027
 hid-0.027
 fought-0.026
Token but
Token ID475
Feature activation-2.66e-02

Pos logit contributions
 let+0.035
 it+0.034
 hid+0.033
 fought+0.032
 share+0.032
Neg logit contributions
ummy-0.048
umm-0.048
um-0.047
err-0.047
ama-0.047
Token we
Token ID356
Feature activation-3.15e-02

Pos logit contributions
pped+0.027
pping+0.026
 argued+0.025
 fought+0.024
 let+0.023
Neg logit contributions
ummy-0.065
 biggest-0.064
ama-0.062
um-0.061
 best-0.060
Token broke
Token ID6265
Feature activation-3.47e-02

Pos logit contributions
 letting+0.037
 risking+0.036
 it+0.034
 whoever+0.034
 losing+0.031
Neg logit contributions
umpy-0.067
ubby-0.065
ini-0.064
ay-0.061
err-0.060
Token dog
Token ID3290
Feature activation+2.69e-03

Pos logit contributions
umm+0.102
err+0.100
ini+0.098
um+0.097
ummy+0.097
Neg logit contributions
 let-0.132
 it-0.130
 share-0.119
 hid-0.119
 fought-0.116
Token.
Token ID13
Feature activation+8.81e-03

Pos logit contributions
 world+0.004
ummy+0.003
umpy+0.003
 jungle+0.003
elly+0.003
Neg logit contributions
 managed-0.005
 argued-0.005
ts-0.005
 it-0.005
 risking-0.004
Token
Token ID198
Feature activation+3.80e-02

Pos logit contributions
umm+0.017
ini+0.017
err+0.017
ummy+0.017
 surroundings+0.017
Neg logit contributions
 let-0.015
 it-0.015
 share-0.013
 hid-0.013
 fought-0.013
Token
Token ID198
Feature activation+2.62e-02

Pos logit contributions
umm+0.091
ini+0.082
 surroundings+0.081
err+0.080
ummy+0.080
Neg logit contributions
 it-0.062
 let-0.059
 share-0.047
 never-0.047
 hid-0.047
TokenThe
Token ID464
Feature activation+1.22e-01

Pos logit contributions
umm+0.050
ummy+0.048
err+0.047
ini+0.047
 surroundings+0.047
Neg logit contributions
 let-0.048
 it-0.046
 share-0.042
 hid-0.042
 fought-0.041
Token gr
Token ID1036
Feature activation+1.49e-01

Pos logit contributions
umm+0.268
lights+0.245
um+0.245
igsaw+0.242
err+0.241
Neg logit contributions
 let-0.221
 it-0.201
 put-0.190
 share-0.189
 make-0.181
Tokenumpy
Token ID32152
Feature activation+1.25e-01

Pos logit contributions
 biggest+0.140
umm+0.135
 surroundings+0.114
err+0.106
 best+0.096
Neg logit contributions
 it-0.379
 let-0.375
 fought-0.366
 hid-0.360
pping-0.351
Token cat
Token ID3797
Feature activation+1.97e-03

Pos logit contributions
umm+0.234
err+0.212
lights+0.205
 surroundings+0.202
um+0.201
Neg logit contributions
 let-0.242
 it-0.241
 hid-0.219
 share-0.217
 put-0.210
Token tried
Token ID3088
Feature activation-4.78e-02

Pos logit contributions
err+0.004
umpy+0.004
um+0.004
ama+0.004
 surroundings+0.004
Neg logit contributions
 it-0.002
 share-0.002
 risking-0.002
 let-0.002
 letting-0.002
Token to
Token ID284
Feature activation-4.89e-02

Pos logit contributions
 let+0.052
pped+0.046
 fought+0.045
 share+0.044
 hid+0.044
Neg logit contributions
err-0.112
 surroundings-0.112
ummy-0.109
 world-0.107
 biggest-0.106
Token find
Token ID1064
Feature activation-2.98e-02

Pos logit contributions
 it+0.058
 let+0.053
 risking+0.053
 wouldn+0.052
 fought+0.050
Neg logit contributions
um-0.106
err-0.105
 surroundings-0.102
ama-0.102
 biggest-0.102
Token tail
Token ID7894
Feature activation-1.75e-03

Pos logit contributions
umm+0.161
ini+0.150
ummy+0.150
err+0.149
ama+0.149
Neg logit contributions
 let-0.160
 it-0.156
 share-0.141
 hid-0.138
 put-0.134
Token."
Token ID526
Feature activation+1.13e-02

Pos logit contributions
 let+0.002
 it+0.002
 hid+0.002
 share+0.002
 fought+0.002
Neg logit contributions
ummy-0.004
umm-0.004
umpy-0.004
um-0.004
 biggest-0.004
Token
Token ID198
Feature activation+4.44e-02

Pos logit contributions
umm+0.021
ummy+0.020
ini+0.020
err+0.020
 surroundings+0.020
Neg logit contributions
 let-0.021
 it-0.020
 share-0.018
 hid-0.018
 fought-0.018
Token
Token ID198
Feature activation+3.57e-02

Pos logit contributions
umm+0.112
ini+0.099
 surroundings+0.097
err+0.097
ubby+0.097
Neg logit contributions
 it-0.068
 let-0.066
 share-0.052
 hid-0.050
 put-0.049
TokenThe
Token ID464
Feature activation+1.17e-01

Pos logit contributions
umm+0.066
ummy+0.064
err+0.063
um+0.063
ini+0.063
Neg logit contributions
 let-0.065
 it-0.063
 share-0.058
 hid-0.058
 fought-0.057
Token oct
Token ID19318
Feature activation+1.49e-01

Pos logit contributions
umm+0.247
lights+0.230
ama+0.229
err+0.228
 surroundings+0.226
Neg logit contributions
 let-0.216
 it-0.200
 put-0.175
 share-0.174
 tried-0.170
Tokenopus
Token ID25790
Feature activation-2.79e-03

Pos logit contributions
umm+0.189
 surroundings+0.182
 biggest+0.182
err+0.173
um+0.172
Neg logit contributions
 let-0.345
 it-0.339
 fought-0.316
 hid-0.313
 share-0.312
Token and
Token ID290
Feature activation+1.70e-02

Pos logit contributions
 it+0.003
 let+0.003
 risking+0.003
 share+0.003
 whoever+0.003
Neg logit contributions
err-0.006
um-0.006
ini-0.006
umm-0.006
 surroundings-0.006
Token the
Token ID262
Feature activation+1.11e-01

Pos logit contributions
umm+0.038
err+0.038
ummy+0.037
um+0.037
ini+0.037
Neg logit contributions
 let-0.022
 it-0.021
 hid-0.019
 share-0.019
 fought-0.019
Token duck
Token ID22045
Feature activation-4.15e-03

Pos logit contributions
umm+0.222
ama+0.211
um+0.210
ummy+0.210
err+0.208
Neg logit contributions
 let-0.217
 it-0.200
 tried-0.175
 put-0.174
 share-0.174
Token each
Token ID1123
Feature activation+3.99e-02

Pos logit contributions
 it+0.005
 risking+0.005
 let+0.005
 whoever+0.004
 letting+0.004
Neg logit contributions
um-0.009
 surroundings-0.009
ini-0.009
umpy-0.009
ama-0.008
Token,
Token ID11
Feature activation-1.72e-02

Pos logit contributions
 let+0.002
 hid+0.002
 fought+0.002
 risking+0.002
 share+0.002
Neg logit contributions
ummy-0.002
um-0.002
ama-0.002
ay-0.002
umm-0.002
Token we
Token ID356
Feature activation-6.78e-03

Pos logit contributions
 fought+0.018
 hid+0.018
 shared+0.016
 argued+0.016
 share+0.016
Neg logit contributions
ummy-0.041
 biggest-0.039
umm-0.039
um-0.039
ini-0.039
Token can
Token ID460
Feature activation-3.58e-02

Pos logit contributions
 risking+0.010
 letting+0.009
 hid+0.009
ts+0.009
 it+0.009
Neg logit contributions
ay-0.013
um-0.013
 biggest-0.013
 sun-0.012
 world-0.012
Token,"
Token ID553
Feature activation-6.36e-03

Pos logit contributions
 risking+0.066
 letting+0.059
 hid+0.057
pped+0.056
 managed+0.056
Neg logit contributions
um-0.063
ay-0.060
ummy-0.060
ama-0.056
 biggest-0.055
Token the
Token ID262
Feature activation+1.28e-01

Pos logit contributions
 let+0.007
 it+0.006
 hid+0.006
 share+0.005
 fought+0.005
Neg logit contributions
umm-0.016
ummy-0.016
err-0.016
um-0.016
ini-0.016
Token oct
Token ID19318
Feature activation+1.49e-01

Pos logit contributions
umm+0.287
 surroundings+0.265
err+0.262
lights+0.261
ay+0.260
Neg logit contributions
 it-0.221
 let-0.216
 hid-0.174
 put-0.173
 tried-0.173
Tokenopus
Token ID25790
Feature activation+4.00e-03

Pos logit contributions
umm+0.189
 surroundings+0.183
 biggest+0.183
err+0.165
ummy+0.165
Neg logit contributions
 let-0.347
 it-0.343
 fought-0.322
 hid-0.320
 share-0.312
Token said
Token ID531
Feature activation-9.26e-03

Pos logit contributions
err+0.009
um+0.009
umm+0.009
ummy+0.009
ini+0.009
Neg logit contributions
 it-0.005
 let-0.005
 share-0.004
 risking-0.004
 hid-0.004
Token.
Token ID13
Feature activation+8.96e-03

Pos logit contributions
 risking+0.015
 let+0.015
 hid+0.014
 never+0.014
 share+0.014
Neg logit contributions
um-0.016
ummy-0.016
ubby-0.015
olly-0.015
 biggest-0.015
Token "
Token ID366
Feature activation-1.27e-02

Pos logit contributions
umm+0.017
ummy+0.017
err+0.017
um+0.017
umpy+0.016
Neg logit contributions
 let-0.015
 it-0.014
 share-0.013
 hid-0.013
 fought-0.013
TokenWe
Token ID1135
Feature activation-1.75e-02

Pos logit contributions
 let+0.020
 it+0.019
 share+0.017
 hid+0.017
 fought+0.017
Neg logit contributions
umm-0.026
ummy-0.026
err-0.026
um-0.026
ini-0.025
Token-
Token ID12
Feature activation+3.87e-02

Pos logit contributions
umm+0.082
err+0.079
ummy+0.078
ini+0.078
um+0.077
Neg logit contributions
 let-0.107
 it-0.104
 share-0.096
 hid-0.096
 fought-0.094
Tokenwar
Token ID5767
Feature activation+6.47e-03

Pos logit contributions
umm+0.080
ummy+0.077
err+0.076
 surroundings+0.075
ama+0.075
Neg logit contributions
 let-0.064
 it-0.063
 share-0.056
 hid-0.055
 fought-0.054
Token,"
Token ID553
Feature activation-6.81e-03

Pos logit contributions
umm+0.015
ummy+0.015
um+0.014
err+0.014
ini+0.014
Neg logit contributions
 let-0.009
 it-0.008
 share-0.007
 hid-0.007
 fought-0.007
Token said
Token ID531
Feature activation-1.05e-02

Pos logit contributions
 let+0.008
pping+0.008
pped+0.008
 it+0.008
 hid+0.008
Neg logit contributions
um-0.015
umm-0.015
ummy-0.014
ama-0.014
err-0.014
Token the
Token ID262
Feature activation+1.17e-01

Pos logit contributions
 let+0.012
 hid+0.010
pping+0.010
 it+0.010
 share+0.010
Neg logit contributions
um-0.025
ummy-0.025
umm-0.024
ama-0.024
err-0.024
Token oct
Token ID19318
Feature activation+1.48e-01

Pos logit contributions
umm+0.236
err+0.224
 surroundings+0.219
ama+0.219
lights+0.216
Neg logit contributions
 let-0.220
 it-0.216
 share-0.183
 put-0.181
 tried-0.179
Tokenopus
Token ID25790
Feature activation+1.27e-02

Pos logit contributions
umm+0.180
 surroundings+0.180
 biggest+0.174
err+0.165
ummy+0.161
Neg logit contributions
 let-0.357
 it-0.349
 fought-0.325
 hid-0.322
 share-0.321
Token.
Token ID13
Feature activation+8.27e-03

Pos logit contributions
um+0.025
err+0.025
ummy+0.025
ini+0.025
umm+0.025
Neg logit contributions
 let-0.019
 it-0.018
 share-0.017
 risking-0.017
 hid-0.016
Token "
Token ID366
Feature activation-1.13e-02

Pos logit contributions
umm+0.015
ummy+0.015
err+0.015
um+0.015
ama+0.015
Neg logit contributions
 let-0.014
 it-0.014
 share-0.013
 hid-0.013
 fought-0.013
TokenI
Token ID40
Feature activation-2.25e-02

Pos logit contributions
 let+0.018
 it+0.017
 hid+0.016
 share+0.016
 fought+0.015
Neg logit contributions
umm-0.023
err-0.023
ummy-0.022
ini-0.022
um-0.022
Token have
Token ID423
Feature activation-1.41e-02

Pos logit contributions
 letting+0.029
 risking+0.028
pped+0.028
 it+0.027
pping+0.026
Neg logit contributions
umpy-0.048
ubby-0.046
err-0.045
ay-0.045
 world-0.044
Token with
Token ID351
Feature activation-1.72e-02

Pos logit contributions
ummy+0.006
um+0.006
ay+0.006
err+0.006
umm+0.006
Neg logit contributions
 let-0.004
 it-0.004
 hid-0.004
 fought-0.004
 risking-0.004
Token me
Token ID502
Feature activation+9.93e-03

Pos logit contributions
 let+0.032
 it+0.029
 hid+0.029
 managed+0.029
 fought+0.029
Neg logit contributions
ummy-0.029
um-0.028
ay-0.028
 surroundings-0.028
ini-0.027
Token today
Token ID1909
Feature activation+2.43e-03

Pos logit contributions
ummy+0.020
um+0.019
umm+0.019
err+0.019
 surroundings+0.019
Neg logit contributions
 let-0.016
 it-0.015
 hid-0.014
 share-0.014
 fought-0.014
Token?"
Token ID1701
Feature activation+7.62e-03

Pos logit contributions
ummy+0.005
umm+0.005
um+0.005
err+0.005
ama+0.005
Neg logit contributions
 let-0.004
 it-0.004
 hid-0.003
 share-0.003
 fought-0.003
Token The
Token ID383
Feature activation+1.26e-01

Pos logit contributions
umm+0.012
ummy+0.012
err+0.012
um+0.012
ama+0.012
Neg logit contributions
 let-0.015
 it-0.014
 hid-0.013
 share-0.013
 fought-0.013
Token gr
Token ID1036
Feature activation+1.48e-01

Pos logit contributions
umm+0.272
um+0.253
err+0.246
lights+0.245
 surroundings+0.245
Neg logit contributions
 let-0.236
 it-0.218
 make-0.197
 put-0.196
 tried-0.193
Tokenumpy
Token ID32152
Feature activation+1.27e-01

Pos logit contributions
 biggest+0.135
umm+0.121
 surroundings+0.120
 sights+0.095
 best+0.092
Neg logit contributions
 it-0.386
 let-0.383
 fought-0.381
 hid-0.370
pping-0.361
Token cat
Token ID3797
Feature activation-5.29e-03

Pos logit contributions
umm+0.246
 surroundings+0.221
err+0.214
um+0.211
lights+0.210
Neg logit contributions
 it-0.243
 let-0.241
 hid-0.216
 fought-0.214
 share-0.210
Token said
Token ID531
Feature activation-1.81e-02

Pos logit contributions
 risking+0.007
 it+0.007
 let+0.006
 share+0.006
pped+0.006
Neg logit contributions
err-0.011
um-0.011
ummy-0.010
ama-0.010
olly-0.010
Token,
Token ID11
Feature activation-3.04e-02

Pos logit contributions
 let+0.032
 fought+0.032
 hid+0.031
 risking+0.031
 argued+0.030
Neg logit contributions
um-0.030
ummy-0.030
ama-0.028
 biggest-0.028
olly-0.027
Token "
Token ID366
Feature activation-9.28e-03

Pos logit contributions
 share+0.057
 risking+0.056
 hid+0.056
 fought+0.056
pped+0.056
Neg logit contributions
ummy-0.041
um-0.041
 sky-0.041
err-0.041
 sun-0.040
Token you
Token ID345
Feature activation-3.96e-03

Pos logit contributions
 let+0.038
 managed+0.037
 hid+0.037
 wouldn+0.034
pped+0.034
Neg logit contributions
ummy-0.066
umm-0.061
 surroundings-0.061
um-0.060
ama-0.060
Token and
Token ID290
Feature activation-3.48e-02

Pos logit contributions
 it+0.006
 risking+0.006
 let+0.006
 hid+0.006
pped+0.006
Neg logit contributions
ummy-0.007
umpy-0.007
um-0.007
 biggest-0.007
ama-0.007
Token we
Token ID356
Feature activation-1.94e-02

Pos logit contributions
pped+0.058
pping+0.056
 risking+0.049
assed+0.049
 let+0.047
Neg logit contributions
ummy-0.065
 biggest-0.063
 sun-0.060
umm-0.060
 surroundings-0.060
Token love
Token ID1842
Feature activation-3.19e-02

Pos logit contributions
 it+0.030
 risking+0.028
pped+0.028
 whoever+0.028
 letting+0.027
Neg logit contributions
umpy-0.035
ay-0.034
um-0.033
 biggest-0.033
 best-0.033
Token the
Token ID262
Feature activation+1.15e-01

Pos logit contributions
 let+0.038
 managed+0.036
 hid+0.036
 fought+0.033
 snatched+0.033
Neg logit contributions
ummy-0.077
 surroundings-0.072
umm-0.072
um-0.072
ama-0.070
Token oct
Token ID19318
Feature activation+1.48e-01

Pos logit contributions
umm+0.202
ama+0.202
um+0.200
err+0.196
umpy+0.192
Neg logit contributions
 it-0.223
 let-0.220
 share-0.194
 hid-0.186
 don-0.184
Tokenopus
Token ID25790
Feature activation+2.34e-02

Pos logit contributions
 surroundings+0.191
err+0.189
 biggest+0.187
umm+0.185
um+0.179
Neg logit contributions
 it-0.316
 let-0.315
 fought-0.296
 share-0.289
pping-0.287
Token."
Token ID526
Feature activation+1.43e-02

Pos logit contributions
ummy+0.050
umm+0.050
ini+0.049
umpy+0.049
err+0.049
Neg logit contributions
 let-0.033
 it-0.031
 hid-0.029
 share-0.028
 fought-0.028
Token
Token ID198
Feature activation+3.60e-02

Pos logit contributions
umm+0.033
ini+0.031
err+0.031
ubby+0.030
 surroundings+0.030
Neg logit contributions
 it-0.023
 let-0.023
 share-0.019
 hid-0.018
 whoever-0.017
Token
Token ID198
Feature activation+2.87e-02

Pos logit contributions
umm+0.080
err+0.072
ini+0.071
 surroundings+0.070
ummy+0.070
Neg logit contributions
 it-0.061
 let-0.060
 share-0.052
 hid-0.049
 never-0.048
TokenMom
Token ID29252
Feature activation-3.12e-03

Pos logit contributions
umm+0.047
ummy+0.045
err+0.045
ini+0.044
um+0.044
Neg logit contributions
 let-0.059
 it-0.057
 share-0.053
 hid-0.052
 fought-0.051
Token shouted
Token ID17293
Feature activation-1.80e-02

Pos logit contributions
 risking+0.002
 it+0.002
 share+0.002
 let+0.002
pping+0.002
Neg logit contributions
um-0.003
err-0.003
ummy-0.003
umpy-0.003
ama-0.003
Token,
Token ID11
Feature activation-3.10e-02

Pos logit contributions
 let+0.026
 it+0.024
 share+0.023
 hid+0.022
 fought+0.022
Neg logit contributions
ummy-0.038
um-0.038
umm-0.038
err-0.038
ama-0.037
Token pulling
Token ID10427
Feature activation-2.77e-02

Pos logit contributions
 risking+0.042
 share+0.042
 it+0.042
 hid+0.041
 let+0.041
Neg logit contributions
err-0.055
um-0.054
 sun-0.053
 biggest-0.052
ummy-0.052
Token on
Token ID319
Feature activation-1.78e-02

Pos logit contributions
 let+0.025
 it+0.021
 hid+0.020
 share+0.020
 fought+0.020
Neg logit contributions
um-0.074
err-0.073
ummy-0.073
umm-0.073
ama-0.072
Token the
Token ID262
Feature activation+1.09e-01

Pos logit contributions
 let+0.017
 fought+0.015
 share+0.015
 hid+0.015
 risking+0.014
Neg logit contributions
ummy-0.044
err-0.044
um-0.044
 surroundings-0.043
umm-0.043
Token oct
Token ID19318
Feature activation+1.48e-01

Pos logit contributions
umm+0.189
ama+0.179
um+0.178
 surroundings+0.174
err+0.173
Neg logit contributions
 it-0.232
 let-0.229
 hid-0.202
 share-0.196
 put-0.193
Tokenopus
Token ID25790
Feature activation+1.64e-02

Pos logit contributions
umm+0.183
 biggest+0.178
 surroundings+0.177
err+0.172
ubby+0.163
Neg logit contributions
 it-0.342
 let-0.340
 fought-0.313
 hid-0.310
 share-0.304
Token.
Token ID13
Feature activation+6.09e-03

Pos logit contributions
ummy+0.030
err+0.029
um+0.029
ini+0.029
 surroundings+0.029
Neg logit contributions
 let-0.025
 risking-0.024
 it-0.024
 share-0.024
pped-0.023
Token
Token ID198
Feature activation+3.29e-02

Pos logit contributions
umm+0.013
ini+0.013
ummy+0.012
 surroundings+0.012
ubby+0.012
Neg logit contributions
 let-0.010
 it-0.010
 hid-0.009
 share-0.008
 fought-0.008
Token
Token ID198
Feature activation+2.36e-02

Pos logit contributions
umm+0.073
ini+0.066
ummy+0.066
 surroundings+0.066
err+0.065
Neg logit contributions
 let-0.056
 it-0.055
 share-0.047
 hid-0.046
 whoever-0.045
Token"
Token ID1
Feature activation-1.14e-02

Pos logit contributions
umm+0.047
ummy+0.046
err+0.045
ini+0.045
um+0.045
Neg logit contributions
 let-0.040
 it-0.039
 share-0.035
 hid-0.035
 fought-0.034
Token away
Token ID1497
Feature activation-2.36e-02

Pos logit contributions
 let+0.026
 risking+0.024
 whoever+0.024
 fought+0.024
 wouldn+0.024
Neg logit contributions
err-0.036
um-0.036
ay-0.035
umpy-0.035
 surroundings-0.035
Token.
Token ID13
Feature activation+5.22e-03

Pos logit contributions
 let+0.038
pped+0.038
 fought+0.037
 wouldn+0.036
 argued+0.036
Neg logit contributions
err-0.042
ummy-0.041
um-0.040
umpy-0.040
 surroundings-0.040
Token
Token ID198
Feature activation+2.44e-02

Pos logit contributions
umm+0.009
ummy+0.008
err+0.008
um+0.008
ama+0.008
Neg logit contributions
 let-0.010
 it-0.010
 share-0.009
 hid-0.009
 fought-0.009
Token
Token ID198
Feature activation+1.42e-02

Pos logit contributions
umm+0.049
ini+0.046
err+0.045
ummy+0.045
 surroundings+0.045
Neg logit contributions
 let-0.043
 it-0.043
 share-0.038
 hid-0.038
 fought-0.036
TokenThe
Token ID464
Feature activation+1.14e-01

Pos logit contributions
um+0.025
ama+0.024
umpy+0.024
 surroundings+0.023
ummy+0.023
Neg logit contributions
 let-0.026
 it-0.025
 hid-0.024
 fought-0.024
 share-0.024
Token oct
Token ID19318
Feature activation+1.48e-01

Pos logit contributions
umm+0.248
ay+0.234
ama+0.230
err+0.229
anny+0.225
Neg logit contributions
 let-0.196
 it-0.191
 put-0.161
 share-0.155
 hid-0.153
Tokenopus
Token ID25790
Feature activation-4.10e-03

Pos logit contributions
umm+0.183
err+0.171
 biggest+0.171
 surroundings+0.170
um+0.165
Neg logit contributions
 let-0.346
 it-0.346
 fought-0.318
 hid-0.318
 share-0.315
Token is
Token ID318
Feature activation-2.38e-02

Pos logit contributions
 let+0.007
 it+0.007
 share+0.006
 whoever+0.006
 hid+0.006
Neg logit contributions
err-0.007
 surroundings-0.007
um-0.007
umm-0.007
ummy-0.007
Token happy
Token ID3772
Feature activation-1.71e-02

Pos logit contributions
 let+0.045
 it+0.044
 share+0.042
 whoever+0.042
 fought+0.041
Neg logit contributions
 biggest-0.035
err-0.035
ubby-0.035
umpy-0.035
um-0.034
Token.
Token ID13
Feature activation-8.22e-03

Pos logit contributions
pped+0.035
pping+0.034
 argued+0.034
 share+0.034
assed+0.034
Neg logit contributions
ummy-0.024
 world-0.022
 sun-0.020
 sky-0.020
 surroundings-0.020
Token It
Token ID632
Feature activation-2.79e-02

Pos logit contributions
 let+0.012
 it+0.012
 share+0.011
 hid+0.011
 fought+0.011
Neg logit contributions
umm-0.017
ummy-0.017
err-0.017
um-0.017
 surroundings-0.016
Token before
Token ID878
Feature activation-1.54e-02

Pos logit contributions
umm+0.023
ummy+0.023
um+0.022
err+0.022
ini+0.022
Neg logit contributions
 let-0.011
 it-0.010
 hid-0.009
 share-0.009
 fought-0.009
Token.
Token ID13
Feature activation-4.79e-03

Pos logit contributions
 let+0.025
 share+0.024
 fought+0.024
 hid+0.023
 shared+0.022
Neg logit contributions
ummy-0.031
um-0.029
ama-0.029
 surroundings-0.028
 world-0.028
Token
Token ID198
Feature activation+2.77e-02

Pos logit contributions
 let+0.009
 it+0.009
 share+0.007
 hid+0.007
 whoever+0.007
Neg logit contributions
ini-0.009
umm-0.009
err-0.009
 surroundings-0.009
ubby-0.008
Token
Token ID198
Feature activation+1.78e-02

Pos logit contributions
umm+0.067
ini+0.062
err+0.061
 surroundings+0.061
ubby+0.059
Neg logit contributions
 it-0.042
 let-0.041
 whoever-0.032
 share-0.032
 never-0.030
TokenThe
Token ID464
Feature activation+1.16e-01

Pos logit contributions
umm+0.037
err+0.036
ini+0.035
ummy+0.035
 surroundings+0.035
Neg logit contributions
 let-0.029
 it-0.028
 share-0.025
 hid-0.024
 fought-0.023
Token oct
Token ID19318
Feature activation+1.48e-01

Pos logit contributions
umm+0.217
err+0.213
elly+0.202
umpy+0.201
oky+0.199
Neg logit contributions
 let-0.231
 it-0.221
 put-0.203
 picked-0.183
 tried-0.183
Tokenopus
Token ID25790
Feature activation-5.12e-03

Pos logit contributions
umm+0.177
 biggest+0.175
err+0.170
 surroundings+0.169
um+0.163
Neg logit contributions
 let-0.340
 it-0.339
 hid-0.314
 fought-0.313
 risking-0.304
Token had
Token ID550
Feature activation-2.49e-02

Pos logit contributions
 let+0.006
 it+0.006
 share+0.006
 risking+0.005
 hid+0.005
Neg logit contributions
um-0.011
 surroundings-0.011
ini-0.011
err-0.011
ama-0.011
Token lots
Token ID6041
Feature activation+1.99e-02

Pos logit contributions
 share+0.034
 let+0.034
pping+0.033
 fought+0.031
 risking+0.031
Neg logit contributions
 biggest-0.050
um-0.046
 sky-0.046
err-0.046
 world-0.046
Token of
Token ID286
Feature activation+4.50e-02

Pos logit contributions
umm+0.032
err+0.030
ummy+0.030
ini+0.030
um+0.030
Neg logit contributions
 let-0.041
 it-0.040
 hid-0.037
 share-0.037
 fought-0.036
Token arms
Token ID5101
Feature activation-1.04e-02

Pos logit contributions
umm+0.082
ama+0.074
err+0.074
ini+0.073
ummy+0.073
Neg logit contributions
 it-0.094
 let-0.090
 hid-0.085
 share-0.082
 tried-0.080
Token water
Token ID1660
Feature activation+7.00e-03

Pos logit contributions
umm+0.169
err+0.164
ama+0.161
ini+0.161
 surroundings+0.157
Neg logit contributions
 let-0.187
 it-0.185
 hid-0.166
 share-0.166
 put-0.155
Token.
Token ID13
Feature activation+8.58e-03

Pos logit contributions
ummy+0.012
 world+0.012
 sky+0.012
umm+0.012
um+0.012
Neg logit contributions
 let-0.011
 share-0.011
 hid-0.011
 fought-0.011
 shared-0.011
Token
Token ID198
Feature activation+3.00e-02

Pos logit contributions
umm+0.017
ini+0.017
ummy+0.016
err+0.016
 surroundings+0.016
Neg logit contributions
 it-0.015
 let-0.015
 share-0.013
 hid-0.013
 fought-0.012
Token
Token ID198
Feature activation+1.78e-02

Pos logit contributions
umm+0.071
ini+0.065
err+0.063
 surroundings+0.062
ummy+0.062
Neg logit contributions
 it-0.049
 let-0.047
 share-0.039
 never-0.039
 hid-0.039
TokenThe
Token ID464
Feature activation+1.06e-01

Pos logit contributions
umm+0.036
ummy+0.034
err+0.034
ini+0.034
um+0.034
Neg logit contributions
 let-0.030
 it-0.029
 share-0.026
 hid-0.026
 fought-0.026
Token oct
Token ID19318
Feature activation+1.47e-01

Pos logit contributions
umm+0.226
ay+0.216
ama+0.215
err+0.211
ummy+0.206
Neg logit contributions
 let-0.189
 it-0.184
 put-0.154
 hid-0.151
 share-0.150
Tokenopus
Token ID25790
Feature activation+1.93e-03

Pos logit contributions
umm+0.188
err+0.179
 biggest+0.177
 surroundings+0.176
um+0.169
Neg logit contributions
 let-0.339
 it-0.338
 hid-0.309
 fought-0.309
 share-0.306
Token is
Token ID318
Feature activation-2.05e-02

Pos logit contributions
err+0.004
umm+0.003
 surroundings+0.003
um+0.003
ummy+0.003
Neg logit contributions
 let-0.003
 it-0.003
 share-0.003
 hid-0.003
 fought-0.003
Token hungry
Token ID14720
Feature activation-3.89e-03

Pos logit contributions
 let+0.040
 it+0.038
 share+0.037
 hid+0.035
 whoever+0.035
Neg logit contributions
err-0.031
ubby-0.031
um-0.030
umpy-0.030
ummy-0.030
Token and
Token ID290
Feature activation-3.87e-02

Pos logit contributions
 fought+0.007
 let+0.007
 share+0.007
 risking+0.006
 hid+0.006
Neg logit contributions
ummy-0.007
um-0.006
umm-0.006
 world-0.006
umpy-0.006
Token curious
Token ID11040
Feature activation-1.87e-02

Pos logit contributions
 let+0.068
pping+0.068
pped+0.067
 it+0.063
 risking+0.063
Neg logit contributions
 biggest-0.058
err-0.058
 surroundings-0.058
ubby-0.057
umpy-0.057
Token band
Token ID4097
Feature activation+2.63e-02

Pos logit contributions
umm+0.149
ummy+0.142
err+0.139
um+0.138
ini+0.136
Neg logit contributions
 let-0.085
 it-0.080
 share-0.070
 hid-0.067
 put-0.065
Token-
Token ID12
Feature activation+3.62e-02

Pos logit contributions
ini+0.034
umm+0.033
err+0.033
um+0.033
ummy+0.032
Neg logit contributions
 let-0.060
 it-0.059
 hid-0.055
 share-0.055
 fought-0.054
Tokenaid
Token ID1698
Feature activation+1.53e-02

Pos logit contributions
umm+0.077
ummy+0.076
err+0.074
um+0.073
ama+0.073
Neg logit contributions
 let-0.057
 it-0.055
 share-0.050
 hid-0.049
 fought-0.048
Token on
Token ID319
Feature activation-5.07e-02

Pos logit contributions
umm+0.035
ummy+0.034
um+0.034
err+0.034
ini+0.034
Neg logit contributions
 let-0.020
 it-0.019
 hid-0.017
 share-0.017
 fought-0.017
Token it
Token ID340
Feature activation-1.95e-02

Pos logit contributions
 hid+0.036
 let+0.034
 fought+0.032
 managed+0.030
 argued+0.028
Neg logit contributions
ummy-0.142
ay-0.141
um-0.140
ama-0.138
 biggest-0.135
Token and
Token ID290
Feature activation-7.94e-02

Pos logit contributions
 let+0.027
 it+0.026
 share+0.026
 risking+0.026
 fought+0.026
Neg logit contributions
ummy-0.039
 biggest-0.038
um-0.038
 surroundings-0.038
umpy-0.038
Token it
Token ID340
Feature activation-4.45e-02

Pos logit contributions
pping+0.073
 fought+0.070
 risking+0.069
assed+0.068
 argued+0.067
Neg logit contributions
 biggest-0.183
ama-0.178
 sun-0.177
 surroundings-0.175
um-0.173
Token will
Token ID481
Feature activation-6.84e-02

Pos logit contributions
 it+0.050
 let+0.049
 share+0.046
 fought+0.046
 risking+0.046
Neg logit contributions
 biggest-0.101
err-0.099
umm-0.097
 surroundings-0.097
ini-0.097
Token heal
Token ID12035
Feature activation-3.67e-02

Pos logit contributions
 fought+0.070
 hid+0.070
 it+0.069
 let+0.066
 argued+0.066
Neg logit contributions
 biggest-0.156
um-0.155
umm-0.154
ummy-0.153
ini-0.153
Token soon
Token ID2582
Feature activation-3.23e-02

Pos logit contributions
 let+0.042
 hid+0.040
 fought+0.037
 it+0.037
 snatched+0.036
Neg logit contributions
 surroundings-0.084
ummy-0.082
umm-0.082
ini-0.082
um-0.082
Token."
Token ID526
Feature activation+1.36e-02

Pos logit contributions
 fought+0.052
 hid+0.051
 risking+0.051
 argued+0.050
pped+0.050
Neg logit contributions
ummy-0.055
umm-0.053
um-0.052
 world-0.052
umpy-0.052
Token.
Token ID13
Feature activation-5.46e-03

Pos logit contributions
 let+0.015
 risking+0.014
 hid+0.014
 fought+0.014
 it+0.014
Neg logit contributions
ummy-0.020
umm-0.020
um-0.019
ay-0.019
ama-0.018
Token Daddy
Token ID26926
Feature activation-1.32e-02

Pos logit contributions
 let+0.010
 it+0.010
 share+0.009
 hid+0.009
 fought+0.008
Neg logit contributions
umm-0.010
ini-0.010
err-0.010
 surroundings-0.010
um-0.010
Token asked
Token ID1965
Feature activation-3.45e-02

Pos logit contributions
 risking+0.018
 share+0.016
 it+0.016
 losing+0.015
 let+0.015
Neg logit contributions
um-0.028
ama-0.028
ummy-0.028
olly-0.027
ini-0.027
Token if
Token ID611
Feature activation-3.45e-02

Pos logit contributions
 fought+0.052
 let+0.051
 risking+0.051
 hid+0.048
 share+0.047
Neg logit contributions
ummy-0.071
um-0.068
ama-0.066
umm-0.065
err-0.064
Token they
Token ID484
Feature activation-5.08e-02

Pos logit contributions
pping+0.046
 fought+0.045
 let+0.045
pped+0.043
 share+0.043
Neg logit contributions
ummy-0.074
ama-0.072
umm-0.070
um-0.070
 biggest-0.069
Token should
Token ID815
Feature activation-7.89e-02

Pos logit contributions
 risking+0.066
assed+0.062
ocated+0.059
 losing+0.059
pping+0.058
Neg logit contributions
olly-0.103
um-0.100
umpy-0.097
ummy-0.097
ama-0.093
Token save
Token ID3613
Feature activation-2.99e-02

Pos logit contributions
 risking+0.097
assed+0.093
 hid+0.086
 fought+0.086
pped+0.083
Neg logit contributions
um-0.179
ummy-0.168
ama-0.168
 sun-0.154
ay-0.154
Token some
Token ID617
Feature activation+3.29e-02

Pos logit contributions
 let+0.025
 it+0.021
 hid+0.020
 fought+0.019
 share+0.019
Neg logit contributions
ummy-0.083
umm-0.083
um-0.082
ama-0.081
 surroundings-0.081
Token of
Token ID286
Feature activation-2.20e-02

Pos logit contributions
umm+0.056
ummy+0.055
um+0.054
err+0.053
 surroundings+0.053
Neg logit contributions
 let-0.064
 it-0.061
 hid-0.058
 share-0.057
 fought-0.057
Token the
Token ID262
Feature activation+1.14e-01

Pos logit contributions
 let+0.023
 hid+0.022
 fought+0.021
 snatched+0.019
 managed+0.019
Neg logit contributions
ummy-0.056
ama-0.054
um-0.053
umm-0.053
ubby-0.052
Token whip
Token ID22972
Feature activation-3.24e-03

Pos logit contributions
umm+0.217
ummy+0.211
err+0.211
um+0.209
ini+0.207
Neg logit contributions
 let-0.199
 it-0.194
 share-0.177
 hid-0.174
 fought-0.171
Token smiled
Token ID13541
Feature activation-7.32e-03

Pos logit contributions
 let+0.000
 it+0.000
 share+0.000
 risking+0.000
 hid+0.000
Neg logit contributions
um-0.001
ama-0.001
ummy-0.001
ini-0.001
err-0.001
Token and
Token ID290
Feature activation-4.88e-02

Pos logit contributions
 let+0.010
 it+0.010
 share+0.009
 hid+0.009
 fought+0.009
Neg logit contributions
umm-0.016
ummy-0.016
err-0.016
ini-0.016
um-0.016
Token asked
Token ID1965
Feature activation-2.83e-02

Pos logit contributions
pping+0.061
 risking+0.061
 share+0.059
pped+0.057
 fought+0.056
Neg logit contributions
um-0.099
ama-0.098
err-0.095
ummy-0.094
umm-0.094
Token if
Token ID611
Feature activation-2.30e-02

Pos logit contributions
 let+0.033
 fought+0.030
 risking+0.030
 share+0.030
 hid+0.029
Neg logit contributions
ummy-0.065
um-0.065
ama-0.064
umm-0.062
err-0.062
Token she
Token ID673
Feature activation-5.32e-02

Pos logit contributions
 let+0.030
 share+0.028
 fought+0.028
pping+0.028
 hid+0.027
Neg logit contributions
ummy-0.049
ama-0.048
um-0.048
umm-0.048
 surroundings-0.047
Token would
Token ID561
Feature activation-7.86e-02

Pos logit contributions
 risking+0.053
 it+0.051
pping+0.049
 share+0.047
 let+0.047
Neg logit contributions
um-0.124
ama-0.123
ini-0.122
err-0.121
ummy-0.120
Token like
Token ID588
Feature activation-3.72e-02

Pos logit contributions
 risking+0.087
 fought+0.086
 hid+0.083
ts+0.081
 it+0.078
Neg logit contributions
um-0.180
 world-0.174
ummy-0.173
ay-0.172
ama-0.171
Token to
Token ID284
Feature activation-5.56e-02

Pos logit contributions
 fought+0.032
 let+0.032
 hid+0.031
 managed+0.029
pping+0.029
Neg logit contributions
ummy-0.092
 world-0.091
 biggest-0.090
 sun-0.090
ay-0.088
Token say
Token ID910
Feature activation-1.51e-02

Pos logit contributions
 risking+0.066
 hid+0.065
 it+0.064
 fought+0.063
 wouldn+0.062
Neg logit contributions
um-0.120
ama-0.114
ay-0.114
ummy-0.114
 biggest-0.112
Token hello
Token ID23748
Feature activation-9.36e-03

Pos logit contributions
 let+0.025
 it+0.023
 hid+0.023
 fought+0.023
 share+0.022
Neg logit contributions
um-0.028
 biggest-0.028
ummy-0.028
umm-0.028
err-0.027
Token.
Token ID13
Feature activation+1.19e-02

Pos logit contributions
 let+0.012
 it+0.011
 fought+0.011
 share+0.011
 hid+0.011
Neg logit contributions
ummy-0.021
umm-0.021
um-0.021
err-0.020
 surroundings-0.020
Token mole
Token ID9411
Feature activation+2.35e-03

Pos logit contributions
anny+0.233
ay+0.229
umm+0.227
lights+0.227
andel+0.217
Neg logit contributions
 let-0.242
 it-0.233
 tried-0.212
 share-0.208
 hid-0.207
Token.
Token ID13
Feature activation+5.59e-03

Pos logit contributions
um+0.005
ummy+0.005
 surroundings+0.005
ama+0.005
err+0.005
Neg logit contributions
 let-0.003
 it-0.003
 fought-0.003
 hid-0.003
 share-0.003
Token
Token ID198
Feature activation+3.87e-02

Pos logit contributions
umm+0.011
ummy+0.011
err+0.011
um+0.011
ini+0.011
Neg logit contributions
 let-0.009
 it-0.009
 share-0.008
 hid-0.008
 fought-0.008
Token
Token ID198
Feature activation+3.13e-02

Pos logit contributions
umm+0.092
 surroundings+0.082
err+0.081
 biggest+0.081
ini+0.081
Neg logit contributions
 it-0.061
 let-0.060
 share-0.048
 whoever-0.048
 never-0.045
Token"
Token ID1
Feature activation-1.35e-02

Pos logit contributions
umm+0.067
ummy+0.065
err+0.064
ini+0.063
 surroundings+0.063
Neg logit contributions
 let-0.049
 it-0.047
 share-0.042
 hid-0.041
 fought-0.041
TokenTo
Token ID2514
Feature activation-7.83e-02

Pos logit contributions
 let+0.021
 it+0.020
 fought+0.017
 share+0.017
 never+0.017
Neg logit contributions
umm-0.030
ummy-0.030
 surroundings-0.029
um-0.029
ama-0.029
Token get
Token ID651
Feature activation-2.51e-02

Pos logit contributions
 fought+0.081
 hid+0.078
 argued+0.077
 let+0.077
 risking+0.077
Neg logit contributions
um-0.178
ummy-0.177
ama-0.176
ubby-0.173
ay-0.171
Token to
Token ID284
Feature activation-5.10e-02

Pos logit contributions
 let+0.031
 hid+0.027
 it+0.026
 share+0.026
 fought+0.026
Neg logit contributions
um-0.056
ummy-0.056
 surroundings-0.055
 biggest-0.055
umm-0.054
Token the
Token ID262
Feature activation+9.89e-02

Pos logit contributions
 wouldn+0.053
 let+0.051
 fought+0.051
 hid+0.050
 managed+0.049
Neg logit contributions
ummy-0.115
um-0.113
 biggest-0.109
 world-0.109
err-0.108
Token other
Token ID584
Feature activation+8.67e-02

Pos logit contributions
ummy+0.145
umm+0.144
ini+0.144
um+0.143
err+0.142
Neg logit contributions
 let-0.218
 it-0.211
 share-0.197
 hid-0.192
 fought-0.189
Token side
Token ID1735
Feature activation+4.95e-03

Pos logit contributions
err+0.124
ini+0.123
umm+0.123
ummy+0.122
um+0.121
Neg logit contributions
 let-0.197
 it-0.187
 share-0.175
 hid-0.174
 fought-0.168
Token.
Token ID13
Feature activation+1.43e-03

Pos logit contributions
 let+0.003
 it+0.003
 hid+0.003
 share+0.003
 risking+0.003
Neg logit contributions
umm-0.004
ummy-0.004
 surroundings-0.004
um-0.004
err-0.004
Token They
Token ID1119
Feature activation-5.14e-02

Pos logit contributions
umm+0.003
ini+0.003
err+0.003
ummy+0.003
um+0.003
Neg logit contributions
 let-0.002
 it-0.002
 share-0.002
 hid-0.002
 fought-0.002
Token hoped
Token ID10719
Feature activation-3.45e-02

Pos logit contributions
 risking+0.064
 it+0.064
 whoever+0.055
 losing+0.055
pping+0.055
Neg logit contributions
um-0.109
ama-0.105
ini-0.104
err-0.103
olly-0.103
Token the
Token ID262
Feature activation+1.09e-01

Pos logit contributions
 let+0.035
 fought+0.033
 hid+0.033
pping+0.032
 snatched+0.032
Neg logit contributions
ummy-0.083
um-0.081
ama-0.081
 surroundings-0.080
umm-0.079
Token sun
Token ID4252
Feature activation-7.90e-03

Pos logit contributions
umm+0.165
ummy+0.158
err+0.154
ini+0.152
ama+0.151
Neg logit contributions
 let-0.243
 it-0.238
 share-0.217
 hid-0.216
 fought-0.211
Token would
Token ID561
Feature activation-7.79e-02

Pos logit contributions
 let+0.007
 it+0.007
 hid+0.006
 share+0.006
 risking+0.006
Neg logit contributions
umm-0.020
ini-0.020
ama-0.020
um-0.020
err-0.020
Token dry
Token ID5894
Feature activation-2.36e-02

Pos logit contributions
 hid+0.091
 risking+0.090
 fought+0.089
 argued+0.087
pping+0.085
Neg logit contributions
um-0.168
 biggest-0.167
 sun-0.167
 world-0.164
ummy-0.163
Token their
Token ID511
Feature activation+6.51e-02

Pos logit contributions
 let+0.036
 hid+0.032
 it+0.032
 share+0.032
 fought+0.032
Neg logit contributions
umm-0.048
ummy-0.048
um-0.047
 surroundings-0.046
ini-0.046
Token stones
Token ID14966
Feature activation-2.58e-02

Pos logit contributions
umm+0.106
ummy+0.103
ama+0.102
umpy+0.102
um+0.102
Neg logit contributions
 let-0.134
 it-0.131
 share-0.123
 hid-0.121
 fought-0.119
Token soon
Token ID2582
Feature activation-4.14e-02

Pos logit contributions
 let+0.038
 risking+0.036
 hid+0.036
 it+0.036
 fought+0.035
Neg logit contributions
 biggest-0.048
 world-0.048
ummy-0.047
ini-0.047
um-0.047
Token.
Token ID13
Feature activation+6.13e-03

Pos logit contributions
 risking+0.072
 whoever+0.069
assed+0.069
 hid+0.068
pped+0.068
Neg logit contributions
 world-0.062
 sky-0.061
ummy-0.061
 sun-0.060
um-0.060
Token,
Token ID11
Feature activation-2.40e-02

Pos logit contributions
 let+0.045
 fought+0.040
 hid+0.040
 share+0.038
pped+0.037
Neg logit contributions
ummy-0.084
 world-0.084
 surroundings-0.082
umm-0.080
err-0.080
Token she
Token ID673
Feature activation-4.69e-02

Pos logit contributions
 let+0.029
 hid+0.026
 fought+0.026
 it+0.026
 share+0.026
Neg logit contributions
umm-0.055
ummy-0.054
um-0.053
err-0.053
 surroundings-0.052
Token asked
Token ID1965
Feature activation-3.91e-02

Pos logit contributions
 risking+0.061
 it+0.056
pping+0.053
 let+0.053
 share+0.053
Neg logit contributions
um-0.100
ama-0.094
err-0.094
ini-0.092
ummy-0.092
Token if
Token ID611
Feature activation-4.41e-02

Pos logit contributions
 let+0.045
 fought+0.041
 hid+0.039
 risking+0.039
 it+0.038
Neg logit contributions
ummy-0.093
um-0.093
umm-0.090
err-0.090
ama-0.088
Token she
Token ID673
Feature activation-4.82e-02

Pos logit contributions
pping+0.061
 let+0.060
 fought+0.058
 hid+0.057
pped+0.057
Neg logit contributions
ummy-0.087
um-0.085
 biggest-0.085
ama-0.085
umm-0.083
Token would
Token ID561
Feature activation-7.77e-02

Pos logit contributions
 risking+0.054
pping+0.050
assed+0.047
 letting+0.047
 it+0.046
Neg logit contributions
um-0.108
olly-0.105
ummy-0.105
umpy-0.105
ini-0.105
Token like
Token ID588
Feature activation-3.79e-02

Pos logit contributions
 risking+0.093
 fought+0.086
ts+0.085
 hid+0.084
assed+0.084
Neg logit contributions
um-0.169
 world-0.167
ummy-0.162
ay-0.159
 biggest-0.158
Token a
Token ID257
Feature activation+6.08e-02

Pos logit contributions
 fought+0.041
 let+0.040
 hid+0.039
ocated+0.038
pping+0.038
Neg logit contributions
ummy-0.087
 biggest-0.085
 world-0.085
 sun-0.084
 sky-0.082
Token special
Token ID2041
Feature activation+7.48e-02

Pos logit contributions
umm+0.098
err+0.094
ummy+0.093
um+0.093
ama+0.092
Neg logit contributions
 let-0.129
 it-0.127
 share-0.116
 hid-0.113
 fought-0.108
Token kiss
Token ID9245
Feature activation+2.37e-03

Pos logit contributions
umm+0.122
err+0.117
umpy+0.116
ama+0.115
ini+0.115
Neg logit contributions
 let-0.154
 it-0.153
 share-0.139
 hid-0.136
 fought-0.130
Token.
Token ID13
Feature activation+6.74e-03

Pos logit contributions
ummy+0.005
um+0.005
ay+0.005
umm+0.005
ama+0.005
Neg logit contributions
 let-0.003
 hid-0.003
 fought-0.003
 risking-0.003
 it-0.003
Token man
Token ID582
Feature activation-1.60e-02

Pos logit contributions
umm+0.179
ini+0.164
err+0.164
ummy+0.163
umpy+0.158
Neg logit contributions
 let-0.258
 it-0.252
 put-0.230
 share-0.224
 tried-0.219
Token asked
Token ID1965
Feature activation-2.90e-02

Pos logit contributions
 it+0.021
 let+0.021
 risking+0.020
 share+0.020
 hid+0.019
Neg logit contributions
um-0.033
ama-0.032
ummy-0.032
ini-0.031
err-0.031
Token Sarah
Token ID10490
Feature activation-2.35e-02

Pos logit contributions
 let+0.037
 hid+0.034
 fought+0.033
 it+0.033
 risking+0.033
Neg logit contributions
ummy-0.066
um-0.066
umm-0.064
ama-0.064
ay-0.063
Token if
Token ID611
Feature activation-3.49e-02

Pos logit contributions
 fought+0.031
 let+0.029
 argued+0.028
 hid+0.028
 risking+0.027
Neg logit contributions
ummy-0.051
ama-0.051
um-0.051
olly-0.047
 surroundings-0.047
Token she
Token ID673
Feature activation-4.82e-02

Pos logit contributions
pping+0.049
 let+0.047
 fought+0.046
 hid+0.045
pped+0.045
Neg logit contributions
ummy-0.071
um-0.069
ama-0.069
 biggest-0.067
olly-0.065
Token would
Token ID561
Feature activation-7.76e-02

Pos logit contributions
 risking+0.048
 it+0.045
pping+0.044
assed+0.043
 letting+0.043
Neg logit contributions
um-0.112
ini-0.111
ama-0.110
err-0.110
umpy-0.109
Token like
Token ID588
Feature activation-3.49e-02

Pos logit contributions
 risking+0.100
 fought+0.094
 hid+0.090
assed+0.090
ts+0.090
Neg logit contributions
um-0.162
 world-0.157
ummy-0.155
ay-0.155
ama-0.150
Token to
Token ID284
Feature activation-5.59e-02

Pos logit contributions
 let+0.033
 fought+0.032
 hid+0.032
 managed+0.030
pping+0.029
Neg logit contributions
ummy-0.087
 world-0.083
 biggest-0.083
 sun-0.082
umm-0.082
Token help
Token ID1037
Feature activation-3.14e-02

Pos logit contributions
 risking+0.069
 hid+0.068
 fought+0.064
assed+0.063
 wouldn+0.063
Neg logit contributions
um-0.119
ummy-0.115
ini-0.112
ay-0.112
ama-0.111
Token him
Token ID683
Feature activation-1.97e-02

Pos logit contributions
 let+0.042
 hid+0.040
 fought+0.039
 managed+0.038
 risking+0.037
Neg logit contributions
ummy-0.070
um-0.067
umm-0.066
ama-0.065
 surroundings-0.065
Token,
Token ID11
Feature activation-3.45e-02

Pos logit contributions
 let+0.028
 fought+0.027
 hid+0.027
 it+0.026
 risking+0.026
Neg logit contributions
ummy-0.040
 surroundings-0.039
um-0.039
 biggest-0.039
ini-0.038
Token clouds
Token ID15114
Feature activation-3.25e-02

Pos logit contributions
err+0.182
umm+0.174
olly+0.171
elly+0.170
umpy+0.169
Neg logit contributions
 let-0.226
 it-0.225
 fought-0.199
 share-0.199
 tried-0.196
Token and
Token ID290
Feature activation-4.11e-02

Pos logit contributions
 risking+0.052
 hid+0.051
ocated+0.051
 let+0.051
 fought+0.051
Neg logit contributions
ummy-0.056
um-0.054
 sky-0.053
 sun-0.052
 biggest-0.051
Token asked
Token ID1965
Feature activation-3.66e-02

Pos logit contributions
pping+0.043
 risking+0.041
pped+0.039
 share+0.039
 let+0.038
Neg logit contributions
 sun-0.091
 sky-0.090
um-0.090
 biggest-0.089
err-0.089
Token if
Token ID611
Feature activation-4.00e-02

Pos logit contributions
 let+0.040
 fought+0.039
 risking+0.039
 snatched+0.039
 hid+0.039
Neg logit contributions
ummy-0.083
um-0.083
 surroundings-0.081
 biggest-0.080
ama-0.080
Token they
Token ID484
Feature activation-4.59e-02

Pos logit contributions
pping+0.042
pped+0.038
 hid+0.038
assed+0.037
 let+0.036
Neg logit contributions
 biggest-0.093
ama-0.092
ummy-0.090
um-0.090
 surroundings-0.089
Token would
Token ID561
Feature activation-7.76e-02

Pos logit contributions
ocated+0.054
 risking+0.053
assed+0.052
pping+0.049
 losing+0.048
Neg logit contributions
olly-0.096
umpy-0.094
um-0.094
 biggest-0.092
 sun-0.091
Token give
Token ID1577
Feature activation-2.71e-02

Pos logit contributions
 risking+0.097
 hid+0.093
 fought+0.092
assed+0.088
pped+0.088
Neg logit contributions
um-0.167
 world-0.164
ummy-0.162
 sun-0.161
 biggest-0.160
Token it
Token ID340
Feature activation-1.31e-02

Pos logit contributions
 let+0.018
 hid+0.016
 fought+0.015
 share+0.013
 it+0.013
Neg logit contributions
um-0.077
 surroundings-0.077
ummy-0.077
umm-0.076
ama-0.076
Token some
Token ID617
Feature activation+2.49e-02

Pos logit contributions
 let+0.019
 fought+0.018
 hid+0.018
 risking+0.017
 it+0.017
Neg logit contributions
 biggest-0.027
um-0.026
 surroundings-0.026
ummy-0.026
err-0.026
Token water
Token ID1660
Feature activation-7.31e-03

Pos logit contributions
umm+0.044
ummy+0.043
um+0.042
err+0.042
 surroundings+0.042
Neg logit contributions
 let-0.046
 it-0.045
 hid-0.042
 share-0.042
 fought-0.042
Token.
Token ID13
Feature activation-2.21e-03

Pos logit contributions
 let+0.011
 risking+0.010
 hid+0.010
 fought+0.010
 it+0.010
Neg logit contributions
ummy-0.014
um-0.014
umm-0.013
 biggest-0.013
ama-0.013
Token He
Token ID679
Feature activation-3.57e-02

Pos logit contributions
 let+0.008
 it+0.008
 hid+0.008
 share+0.008
 fought+0.007
Neg logit contributions
umm-0.009
ummy-0.008
err-0.008
ini-0.008
um-0.008
Token asked
Token ID1965
Feature activation-2.98e-02

Pos logit contributions
 it+0.042
 let+0.041
 risking+0.039
 share+0.039
pping+0.036
Neg logit contributions
um-0.080
err-0.079
ama-0.079
ummy-0.078
ini-0.077
Token her
Token ID607
Feature activation+8.98e-03

Pos logit contributions
 let+0.030
 it+0.027
 fought+0.027
 hid+0.026
 share+0.025
Neg logit contributions
ummy-0.073
um-0.073
err-0.072
umm-0.071
ama-0.071
Token if
Token ID611
Feature activation-3.03e-02

Pos logit contributions
ummy+0.024
um+0.024
ama+0.024
err+0.023
 surroundings+0.023
Neg logit contributions
 let-0.008
 it-0.007
 fought-0.007
 hid-0.006
 share-0.006
Token she
Token ID673
Feature activation-4.69e-02

Pos logit contributions
pping+0.044
 let+0.042
 fought+0.041
pped+0.041
 share+0.040
Neg logit contributions
ama-0.058
ummy-0.057
 biggest-0.056
 surroundings-0.055
um-0.055
Token would
Token ID561
Feature activation-7.76e-02

Pos logit contributions
 risking+0.044
 it+0.044
pping+0.040
 let+0.040
 share+0.038
Neg logit contributions
um-0.111
ama-0.110
err-0.110
umpy-0.110
ummy-0.109
Token like
Token ID588
Feature activation-3.46e-02

Pos logit contributions
 risking+0.094
 fought+0.088
ts+0.086
 hid+0.084
pped+0.084
Neg logit contributions
um-0.167
 world-0.166
ay-0.163
ummy-0.162
ama-0.159
Token a
Token ID257
Feature activation+5.85e-02

Pos logit contributions
 let+0.038
 fought+0.036
 hid+0.036
 managed+0.035
pped+0.034
Neg logit contributions
ummy-0.079
 biggest-0.078
 world-0.077
um-0.075
umm-0.075
Token tour
Token ID4205
Feature activation+2.35e-02

Pos logit contributions
umm+0.094
ini+0.090
ummy+0.089
um+0.088
err+0.088
Neg logit contributions
 let-0.131
 it-0.128
 share-0.115
 hid-0.112
 put-0.109
Token of
Token ID286
Feature activation-2.62e-03

Pos logit contributions
umm+0.050
ummy+0.049
um+0.048
err+0.047
ay+0.047
Neg logit contributions
 let-0.035
 it-0.034
 hid-0.031
 share-0.031
 fought-0.030
Token the
Token ID262
Feature activation+1.06e-01

Pos logit contributions
 let+0.003
 hid+0.003
 fought+0.003
 it+0.003
 managed+0.003
Neg logit contributions
umm-0.006
ummy-0.006
 world-0.006
ay-0.006
 biggest-0.005
Token go
Token ID467
Feature activation-1.69e-02

Pos logit contributions
 fought+0.065
 hid+0.063
 it+0.063
 risking+0.062
 snatched+0.060
Neg logit contributions
um-0.115
ama-0.108
ummy-0.108
umm-0.108
ay-0.104
Token home
Token ID1363
Feature activation-2.01e-02

Pos logit contributions
 let+0.033
 it+0.031
 fought+0.030
 hid+0.029
 share+0.029
Neg logit contributions
ummy-0.027
 surroundings-0.027
um-0.027
umm-0.027
ini-0.027
Token,
Token ID11
Feature activation-3.83e-02

Pos logit contributions
 let+0.032
 it+0.029
 fought+0.029
 share+0.029
 hid+0.028
Neg logit contributions
ummy-0.039
 surroundings-0.039
um-0.038
ama-0.038
umm-0.037
Token so
Token ID523
Feature activation-2.51e-02

Pos logit contributions
pped+0.061
 fought+0.060
 share+0.058
 risking+0.057
 let+0.056
Neg logit contributions
 world-0.066
 biggest-0.065
 surroundings-0.064
ummy-0.064
 sun-0.062
Token they
Token ID484
Feature activation-4.99e-02

Pos logit contributions
 let+0.033
 share+0.031
pping+0.031
 fought+0.030
 hid+0.030
Neg logit contributions
ama-0.053
ummy-0.052
 surroundings-0.051
umm-0.051
um-0.050
Token carefully
Token ID7773
Feature activation-7.74e-02

Pos logit contributions
 risking+0.069
 losing+0.065
 letting+0.061
pping+0.059
assed+0.057
Neg logit contributions
um-0.097
ama-0.093
olly-0.091
ay-0.090
 sun-0.089
Token watched
Token ID7342
Feature activation-5.00e-02

Pos logit contributions
 risking+0.094
 it+0.087
 losing+0.084
 whoever+0.080
ocated+0.077
Neg logit contributions
um-0.149
 sun-0.142
 sky-0.141
ama-0.141
ini-0.140
Token the
Token ID262
Feature activation+1.02e-01

Pos logit contributions
 fought+0.052
 hid+0.051
 let+0.050
 managed+0.048
 stole+0.047
Neg logit contributions
ummy-0.122
um-0.119
umm-0.115
ama-0.115
ay-0.114
Token changing
Token ID5609
Feature activation+4.87e-02

Pos logit contributions
umm+0.176
ama+0.167
err+0.164
ummy+0.164
ay+0.163
Neg logit contributions
 it-0.217
 let-0.209
 put-0.177
 share-0.175
 hid-0.173
Token tide
Token ID20013
Feature activation-3.14e-02

Pos logit contributions
umm+0.077
ummy+0.074
err+0.074
um+0.073
ini+0.073
Neg logit contributions
 let-0.101
 it-0.099
 share-0.091
 hid-0.091
 fought-0.089
Token until
Token ID1566
Feature activation-4.68e-02

Pos logit contributions
 let+0.045
 fought+0.043
 it+0.043
 hid+0.043
 risking+0.043
Neg logit contributions
 biggest-0.060
 surroundings-0.060
umm-0.060
ummy-0.059
um-0.059
Token go
Token ID467
Feature activation-1.69e-02

Pos logit contributions
 fought+0.065
 hid+0.063
 it+0.063
 risking+0.062
 snatched+0.060
Neg logit contributions
um-0.115
ama-0.108
ummy-0.108
umm-0.108
ay-0.104
Token home
Token ID1363
Feature activation-2.01e-02

Pos logit contributions
 let+0.033
 it+0.031
 fought+0.030
 hid+0.029
 share+0.029
Neg logit contributions
ummy-0.027
 surroundings-0.027
um-0.027
umm-0.027
ini-0.027
Token,
Token ID11
Feature activation-3.83e-02

Pos logit contributions
 let+0.032
 it+0.029
 fought+0.029
 share+0.029
 hid+0.028
Neg logit contributions
ummy-0.039
 surroundings-0.039
um-0.038
ama-0.038
umm-0.037
Token so
Token ID523
Feature activation-2.51e-02

Pos logit contributions
pped+0.061
 fought+0.060
 share+0.058
 risking+0.057
 let+0.056
Neg logit contributions
 world-0.066
 biggest-0.065
 surroundings-0.064
ummy-0.064
 sun-0.062
Token they
Token ID484
Feature activation-4.99e-02

Pos logit contributions
 let+0.033
 share+0.031
pping+0.031
 fought+0.030
 hid+0.030
Neg logit contributions
ama-0.053
ummy-0.052
 surroundings-0.051
umm-0.051
um-0.050
Token carefully
Token ID7773
Feature activation-7.74e-02

Pos logit contributions
 risking+0.069
 losing+0.065
 letting+0.061
pping+0.059
assed+0.057
Neg logit contributions
um-0.097
ama-0.093
olly-0.091
ay-0.090
 sun-0.089
Token watched
Token ID7342
Feature activation-5.00e-02

Pos logit contributions
 risking+0.094
 it+0.087
 losing+0.084
 whoever+0.080
ocated+0.077
Neg logit contributions
um-0.149
 sun-0.142
 sky-0.141
ama-0.141
ini-0.140
Token the
Token ID262
Feature activation+1.02e-01

Pos logit contributions
 fought+0.052
 hid+0.051
 let+0.050
 managed+0.048
 stole+0.047
Neg logit contributions
ummy-0.122
um-0.119
umm-0.115
ama-0.115
ay-0.114
Token changing
Token ID5609
Feature activation+4.87e-02

Pos logit contributions
umm+0.176
ama+0.167
err+0.164
ummy+0.164
ay+0.163
Neg logit contributions
 it-0.217
 let-0.209
 put-0.177
 share-0.175
 hid-0.173
Token tide
Token ID20013
Feature activation-3.14e-02

Pos logit contributions
umm+0.077
ummy+0.074
err+0.074
um+0.073
ini+0.073
Neg logit contributions
 let-0.101
 it-0.099
 share-0.091
 hid-0.091
 fought-0.089
Token until
Token ID1566
Feature activation-4.68e-02

Pos logit contributions
 let+0.045
 fought+0.043
 it+0.043
 hid+0.043
 risking+0.043
Neg logit contributions
 biggest-0.060
 surroundings-0.060
umm-0.060
ummy-0.059
um-0.059
Token said
Token ID531
Feature activation-2.24e-02

Pos logit contributions
 let+0.014
 it+0.014
 share+0.013
 hid+0.011
 fought+0.011
Neg logit contributions
um-0.042
ama-0.041
ummy-0.041
err-0.041
umm-0.041
Token to
Token ID284
Feature activation-4.64e-02

Pos logit contributions
 let+0.033
 share+0.031
 fought+0.030
 hid+0.030
 risking+0.030
Neg logit contributions
ama-0.045
um-0.045
ummy-0.044
 surroundings-0.043
err-0.042
Token herself
Token ID5223
Feature activation-2.23e-03

Pos logit contributions
 let+0.049
 it+0.044
 hid+0.043
 fought+0.042
 share+0.041
Neg logit contributions
ummy-0.114
err-0.113
umm-0.113
um-0.113
ama-0.112
Token,
Token ID11
Feature activation-3.56e-02

Pos logit contributions
 let+0.003
 it+0.003
 share+0.003
 fought+0.003
 hid+0.003
Neg logit contributions
um-0.004
ummy-0.004
ama-0.004
err-0.004
 surroundings-0.004
Token "
Token ID366
Feature activation-1.62e-02

Pos logit contributions
 share+0.054
 fought+0.052
 hid+0.051
 let+0.050
 risking+0.050
Neg logit contributions
um-0.060
ama-0.060
err-0.059
ummy-0.059
 surroundings-0.058
Tokenshould
Token ID21754
Feature activation-7.73e-02

Pos logit contributions
 let+0.025
 it+0.024
 share+0.022
 hid+0.021
 fought+0.021
Neg logit contributions
umm-0.034
ummy-0.034
err-0.033
um-0.033
ini-0.033
Token I
Token ID314
Feature activation-5.75e-02

Pos logit contributions
pped+0.102
pping+0.099
 fought+0.097
assed+0.095
 argued+0.094
Neg logit contributions
ummy-0.156
 biggest-0.156
ama-0.153
 surroundings-0.148
 world-0.148
Token follow
Token ID1061
Feature activation-4.05e-02

Pos logit contributions
 it+0.063
 risking+0.061
 let+0.059
 fought+0.059
 letting+0.056
Neg logit contributions
um-0.132
ay-0.130
ama-0.129
err-0.128
ummy-0.128
Token the
Token ID262
Feature activation+1.04e-01

Pos logit contributions
 fought+0.041
 let+0.040
 hid+0.038
 managed+0.037
 argued+0.035
Neg logit contributions
ummy-0.102
um-0.101
 surroundings-0.099
ama-0.098
ay-0.097
Token path
Token ID3108
Feature activation-3.31e-02

Pos logit contributions
umm+0.140
ini+0.139
ummy+0.136
um+0.134
err+0.133
Neg logit contributions
 let-0.247
 it-0.245
 hid-0.222
 share-0.221
 never-0.214
Token?"
Token ID1701
Feature activation+6.79e-03

Pos logit contributions
 let+0.045
 fought+0.043
 hid+0.042
 it+0.041
 risking+0.040
Neg logit contributions
ummy-0.069
um-0.068
 biggest-0.067
 surroundings-0.067
 world-0.067
Token watch
Token ID2342
Feature activation-4.28e-02

Pos logit contributions
 fought+0.084
 hid+0.084
 risking+0.081
 wouldn+0.078
ocated+0.078
Neg logit contributions
err-0.123
 biggest-0.121
 sun-0.120
um-0.116
 world-0.115
Token the
Token ID262
Feature activation+1.01e-01

Pos logit contributions
 let+0.027
 hid+0.026
 fought+0.025
 managed+0.023
 share+0.021
Neg logit contributions
ummy-0.122
um-0.120
ay-0.119
err-0.118
umm-0.116
Token butterfly
Token ID35113
Feature activation-9.44e-03

Pos logit contributions
umm+0.197
ama+0.192
err+0.191
umpy+0.187
anny+0.186
Neg logit contributions
 it-0.179
 let-0.179
 share-0.150
 put-0.142
 hid-0.138
Token and
Token ID290
Feature activation-5.41e-02

Pos logit contributions
 let+0.014
 it+0.013
 fought+0.013
 hid+0.013
 share+0.012
Neg logit contributions
ummy-0.019
err-0.018
um-0.018
umm-0.018
 surroundings-0.018
Token she
Token ID673
Feature activation-3.40e-02

Pos logit contributions
pping+0.070
pped+0.063
 risking+0.062
ocated+0.061
assed+0.056
Neg logit contributions
 surroundings-0.108
 sun-0.105
 biggest-0.105
 girl-0.105
 sky-0.104
Token wondered
Token ID14028
Feature activation-7.70e-02

Pos logit contributions
 let+0.038
 share+0.038
 it+0.037
 risking+0.037
pping+0.034
Neg logit contributions
um-0.076
err-0.074
 surroundings-0.074
ama-0.073
ini-0.072
Token what
Token ID644
Feature activation-1.84e-02

Pos logit contributions
 fought+0.102
 argued+0.091
 snatched+0.089
 hid+0.087
 stole+0.083
Neg logit contributions
 biggest-0.163
ummy-0.161
ama-0.159
 surroundings-0.159
 best-0.156
Token it
Token ID340
Feature activation-3.45e-02

Pos logit contributions
pping+0.014
pped+0.013
 let+0.013
 fought+0.012
 share+0.012
Neg logit contributions
 world-0.049
 biggest-0.048
ummy-0.047
 surroundings-0.046
err-0.046
Token was
Token ID373
Feature activation-1.75e-02

Pos logit contributions
 let+0.046
 share+0.046
 it+0.045
 risking+0.045
 fought+0.043
Neg logit contributions
err-0.069
ini-0.068
 biggest-0.067
 world-0.066
umpy-0.065
Token doing
Token ID1804
Feature activation-2.92e-02

Pos logit contributions
 fought+0.028
 let+0.028
 it+0.027
pped+0.026
 managed+0.026
Neg logit contributions
 biggest-0.032
 world-0.031
ummy-0.031
 surroundings-0.030
umpy-0.030
Token.
Token ID13
Feature activation-2.90e-04

Pos logit contributions
 let+0.049
 fought+0.049
 snatched+0.048
 hid+0.047
 risking+0.047
Neg logit contributions
ummy-0.052
 world-0.052
um-0.049
 surroundings-0.048
 biggest-0.048
Token They
Token ID1119
Feature activation-5.50e-02

Pos logit contributions
 let+0.015
 it+0.015
 fought+0.014
 share+0.014
 hid+0.014
Neg logit contributions
umm-0.014
ummy-0.014
um-0.013
err-0.013
 surroundings-0.013
Token sit
Token ID1650
Feature activation-1.04e-02

Pos logit contributions
 risking+0.074
pped+0.072
 it+0.066
 letting+0.066
pping+0.065
Neg logit contributions
 biggest-0.105
 sun-0.103
um-0.101
olly-0.100
 sky-0.098
Token on
Token ID319
Feature activation-2.93e-02

Pos logit contributions
 let+0.012
 it+0.011
 share+0.010
 hid+0.009
 shared+0.008
Neg logit contributions
umm-0.028
ummy-0.026
ini-0.026
err-0.026
umpy-0.026
Token a
Token ID257
Feature activation+4.63e-02

Pos logit contributions
 let+0.030
 it+0.026
 hid+0.026
 share+0.025
 fought+0.025
Neg logit contributions
ummy-0.074
umm-0.074
um-0.073
err-0.073
 surroundings-0.072
Token blanket
Token ID18447
Feature activation-2.18e-02

Pos logit contributions
umm+0.094
err+0.088
ini+0.086
 surroundings+0.086
ama+0.085
Neg logit contributions
 let-0.084
 it-0.082
 share-0.073
 hid-0.071
 put-0.069
Token and
Token ID290
Feature activation-7.70e-02

Pos logit contributions
 let+0.035
 it+0.034
 share+0.033
 hid+0.033
 fought+0.032
Neg logit contributions
ummy-0.041
um-0.040
umm-0.040
err-0.039
ay-0.039
Token eat
Token ID4483
Feature activation-2.25e-02

Pos logit contributions
pping+0.094
 risking+0.092
pped+0.090
 fought+0.085
ocated+0.084
Neg logit contributions
 biggest-0.157
 sun-0.156
um-0.149
 sky-0.148
err-0.147
Token their
Token ID511
Feature activation+7.22e-02

Pos logit contributions
 let+0.042
 fought+0.040
 hid+0.039
 it+0.038
 wouldn+0.038
Neg logit contributions
ummy-0.038
um-0.037
umm-0.036
err-0.036
 biggest-0.035
Token food
Token ID2057
Feature activation+2.77e-03

Pos logit contributions
umpy+0.157
ini+0.155
umm+0.155
ubby+0.149
andel+0.146
Neg logit contributions
 let-0.116
 it-0.114
 share-0.108
 shared-0.099
 hid-0.098
Token.
Token ID13
Feature activation-8.54e-03

Pos logit contributions
ummy+0.006
umm+0.006
um+0.006
err+0.005
ama+0.005
Neg logit contributions
 let-0.004
 it-0.004
 hid-0.004
 fought-0.004
 share-0.004
Token They
Token ID1119
Feature activation-5.96e-02

Pos logit contributions
 let+0.015
 it+0.014
 share+0.013
 hid+0.013
 fought+0.013
Neg logit contributions
umm-0.016
ummy-0.016
err-0.015
um-0.015
 surroundings-0.015
Token replied
Token ID8712
Feature activation-2.59e-02

Pos logit contributions
pping+0.016
 let+0.016
pped+0.015
 hid+0.015
 risking+0.015
Neg logit contributions
ummy-0.029
um-0.028
ama-0.028
umm-0.027
 biggest-0.027
Token the
Token ID262
Feature activation+1.10e-01

Pos logit contributions
 let+0.028
 hid+0.027
 fought+0.026
 risking+0.026
pping+0.025
Neg logit contributions
ummy-0.060
um-0.060
ama-0.058
umm-0.056
 biggest-0.056
Token nurse
Token ID15849
Feature activation+1.66e-02

Pos logit contributions
umm+0.198
ummy+0.189
ini+0.188
igsaw+0.179
err+0.177
Neg logit contributions
 let-0.243
 it-0.227
 put-0.204
 hid-0.203
 make-0.202
Token.
Token ID13
Feature activation+6.39e-03

Pos logit contributions
ummy+0.033
um+0.033
ama+0.033
err+0.033
ini+0.033
Neg logit contributions
 let-0.024
 it-0.024
 share-0.022
 hid-0.022
 risking-0.022
Token "
Token ID366
Feature activation-2.05e-02

Pos logit contributions
ummy+0.011
umm+0.011
um+0.010
ama+0.010
err+0.010
Neg logit contributions
 let-0.012
 it-0.011
 fought-0.011
 share-0.011
 hid-0.011
TokenTo
Token ID2514
Feature activation-7.68e-02

Pos logit contributions
 let+0.030
 it+0.028
 share+0.025
 hid+0.025
 fought+0.024
Neg logit contributions
umm-0.045
ummy-0.045
err-0.044
um-0.044
ini-0.044
Token make
Token ID787
Feature activation-1.96e-02

Pos logit contributions
 fought+0.077
 risking+0.073
 it+0.073
 hid+0.072
 argued+0.071
Neg logit contributions
um-0.181
ummy-0.180
ama-0.178
ay-0.176
ubby-0.173
Token sure
Token ID1654
Feature activation-3.51e-02

Pos logit contributions
 let+0.022
 fought+0.020
 it+0.019
 hid+0.019
 share+0.019
Neg logit contributions
ummy-0.047
um-0.047
umm-0.046
ama-0.046
 surroundings-0.046
Token your
Token ID534
Feature activation+6.07e-02

Pos logit contributions
 fought+0.044
 argued+0.041
pped+0.041
pping+0.039
 hid+0.039
Neg logit contributions
ummy-0.078
ama-0.076
 biggest-0.075
um-0.074
 world-0.074
Token body
Token ID1767
Feature activation-3.35e-02

Pos logit contributions
umm+0.105
err+0.100
ini+0.099
umpy+0.098
um+0.098
Neg logit contributions
 let-0.122
 it-0.120
 share-0.113
 hid-0.110
 put-0.106
Token is
Token ID318
Feature activation-3.51e-02

Pos logit contributions
 let+0.046
 it+0.042
 hid+0.041
 whoever+0.041
 fought+0.040
Neg logit contributions
ummy-0.069
ini-0.069
umpy-0.068
 surroundings-0.068
 biggest-0.068
Token He
Token ID679
Feature activation-4.52e-02

Pos logit contributions
 let+0.004
 it+0.004
 hid+0.004
 share+0.004
 fought+0.004
Neg logit contributions
umm-0.004
ini-0.003
ummy-0.003
err-0.003
um-0.003
Token asked
Token ID1965
Feature activation-3.09e-02

Pos logit contributions
 it+0.051
pping+0.049
 let+0.049
 risking+0.048
 share+0.047
Neg logit contributions
err-0.100
um-0.100
ama-0.097
ini-0.096
ummy-0.095
Token her
Token ID607
Feature activation-4.36e-03

Pos logit contributions
 let+0.030
 it+0.028
 hid+0.024
 share+0.024
 fought+0.024
Neg logit contributions
umm-0.080
ummy-0.080
um-0.079
err-0.079
 surroundings-0.078
Token if
Token ID611
Feature activation-3.88e-02

Pos logit contributions
 let+0.004
 fought+0.004
 it+0.004
 argued+0.004
pped+0.004
Neg logit contributions
 surroundings-0.011
ama-0.011
um-0.011
err-0.011
 biggest-0.011
Token she
Token ID673
Feature activation-4.40e-02

Pos logit contributions
pping+0.055
 let+0.054
 fought+0.051
pped+0.050
 argued+0.050
Neg logit contributions
ama-0.075
ummy-0.075
um-0.073
 surroundings-0.073
 biggest-0.073
Token would
Token ID561
Feature activation-7.65e-02

Pos logit contributions
 risking+0.044
 it+0.043
pping+0.041
 letting+0.038
assed+0.038
Neg logit contributions
ini-0.101
umpy-0.101
um-0.100
ama-0.099
err-0.099
Token like
Token ID588
Feature activation-3.27e-02

Pos logit contributions
 risking+0.104
pped+0.096
assed+0.093
 fought+0.093
 hid+0.093
Neg logit contributions
 world-0.149
um-0.147
ay-0.144
 sun-0.141
ummy-0.139
Token to
Token ID284
Feature activation-5.07e-02

Pos logit contributions
 let+0.028
 fought+0.027
 managed+0.027
 hid+0.027
 argued+0.026
Neg logit contributions
ummy-0.082
 sun-0.079
 world-0.079
 biggest-0.079
ay-0.078
Token hear
Token ID3285
Feature activation-3.04e-02

Pos logit contributions
 risking+0.075
 wouldn+0.068
 hid+0.066
assed+0.065
 snatched+0.064
Neg logit contributions
um-0.098
ay-0.098
 sun-0.093
ummy-0.092
 world-0.092
Token his
Token ID465
Feature activation+6.73e-02

Pos logit contributions
 let+0.031
 it+0.027
 hid+0.026
 fought+0.026
 share+0.025
Neg logit contributions
ummy-0.076
umm-0.076
um-0.075
err-0.075
 surroundings-0.074
Token song
Token ID3496
Feature activation-3.98e-03

Pos logit contributions
umm+0.122
ini+0.111
um+0.110
umpy+0.110
err+0.107
Neg logit contributions
 it-0.139
 let-0.135
 share-0.123
 fought-0.117
 make-0.116
Token the
Token ID262
Feature activation+1.12e-01

Pos logit contributions
 fought+0.050
 let+0.050
 hid+0.047
 managed+0.046
 risking+0.043
Neg logit contributions
ummy-0.114
 biggest-0.112
um-0.111
 world-0.111
 surroundings-0.111
Token puppy
Token ID26188
Feature activation-6.69e-03

Pos logit contributions
umm+0.202
anny+0.198
err+0.197
ama+0.196
ini+0.196
Neg logit contributions
 let-0.217
 it-0.214
 share-0.183
 hid-0.183
 put-0.176
Token,
Token ID11
Feature activation-4.25e-02

Pos logit contributions
 let+0.010
pped+0.010
 fought+0.010
 risking+0.010
 it+0.010
Neg logit contributions
ummy-0.012
 biggest-0.012
 surroundings-0.012
um-0.012
 world-0.012
Token so
Token ID523
Feature activation-1.88e-02

Pos logit contributions
pped+0.072
 fought+0.070
pping+0.068
ts+0.068
 risking+0.067
Neg logit contributions
 biggest-0.070
 sun-0.067
ummy-0.066
 world-0.064
 sky-0.064
Token she
Token ID673
Feature activation-4.01e-02

Pos logit contributions
 let+0.024
 it+0.021
 share+0.021
 hid+0.021
 fought+0.020
Neg logit contributions
ummy-0.043
um-0.043
umm-0.043
ama-0.043
 surroundings-0.042
Token carefully
Token ID7773
Feature activation-7.65e-02

Pos logit contributions
 risking+0.042
 losing+0.037
 share+0.037
pping+0.037
 it+0.035
Neg logit contributions
um-0.092
ini-0.087
err-0.087
ama-0.086
olly-0.085
Token put
Token ID1234
Feature activation-3.47e-02

Pos logit contributions
 risking+0.090
 losing+0.081
 share+0.078
 such+0.077
 behave+0.077
Neg logit contributions
um-0.149
 sky-0.138
 sun-0.135
ay-0.133
andel-0.132
Token it
Token ID340
Feature activation-1.81e-02

Pos logit contributions
 let+0.023
 fought+0.020
 hid+0.019
 share+0.018
 risking+0.016
Neg logit contributions
um-0.098
ay-0.097
 biggest-0.096
ummy-0.096
umm-0.096
Token on
Token ID319
Feature activation-3.27e-02

Pos logit contributions
 let+0.027
 it+0.026
 fought+0.025
 whoever+0.025
 risking+0.025
Neg logit contributions
um-0.036
 biggest-0.035
err-0.035
 surroundings-0.035
ummy-0.034
Token his
Token ID465
Feature activation+6.68e-02

Pos logit contributions
 let+0.041
 hid+0.039
 fought+0.039
 share+0.036
 managed+0.036
Neg logit contributions
ummy-0.075
um-0.074
ay-0.071
err-0.071
umm-0.071
Token collar
Token ID19908
Feature activation-2.37e-02

Pos logit contributions
umm+0.114
ummy+0.110
err+0.110
ini+0.109
um+0.108
Neg logit contributions
 let-0.131
 it-0.128
 share-0.118
 hid-0.117
 fought-0.113
Token and
Token ID290
Feature activation-4.40e-02

Pos logit contributions
ummy+0.038
umm+0.037
 surroundings+0.037
um+0.036
ama+0.036
Neg logit contributions
 let-0.025
 it-0.024
 hid-0.023
 fought-0.023
 share-0.023
Token the
Token ID262
Feature activation+8.87e-02

Pos logit contributions
 let+0.051
 it+0.047
pping+0.046
 hid+0.045
 fought+0.045
Neg logit contributions
 surroundings-0.098
umm-0.098
um-0.097
ummy-0.097
err-0.096
Token butterfly
Token ID35113
Feature activation-1.56e-02

Pos logit contributions
umm+0.129
ummy+0.123
umpy+0.120
err+0.120
um+0.120
Neg logit contributions
 let-0.199
 it-0.198
 share-0.178
 hid-0.172
 put-0.171
Token.
Token ID13
Feature activation-3.44e-03

Pos logit contributions
 let+0.021
 it+0.020
 hid+0.020
 fought+0.020
pped+0.019
Neg logit contributions
 surroundings-0.031
 world-0.031
ummy-0.031
um-0.030
 biggest-0.030
Token They
Token ID1119
Feature activation-4.55e-02

Pos logit contributions
 let+0.006
 it+0.006
 share+0.005
 hid+0.005
 fought+0.005
Neg logit contributions
umm-0.007
ini-0.006
err-0.006
ummy-0.006
 surroundings-0.006
Token wondered
Token ID14028
Feature activation-7.65e-02

Pos logit contributions
 risking+0.052
ocated+0.049
 it+0.048
 let+0.047
 letting+0.047
Neg logit contributions
um-0.095
 sky-0.093
ay-0.092
 world-0.091
ama-0.091
Token together
Token ID1978
Feature activation-5.13e-02

Pos logit contributions
 fought+0.079
 snatched+0.073
 hid+0.073
pped+0.071
 let+0.071
Neg logit contributions
 surroundings-0.176
 biggest-0.175
ummy-0.175
ama-0.172
um-0.168
Token how
Token ID703
Feature activation-1.33e-02

Pos logit contributions
 fought+0.067
 hid+0.066
pped+0.065
 let+0.065
 snatched+0.064
Neg logit contributions
ummy-0.104
 world-0.104
 surroundings-0.103
um-0.100
 biggest-0.100
Token such
Token ID884
Feature activation+1.19e-02

Pos logit contributions
 let+0.024
 hid+0.022
 it+0.022
 share+0.022
 fought+0.022
Neg logit contributions
ummy-0.023
 surroundings-0.023
umm-0.023
ini-0.023
um-0.023
Token a
Token ID257
Feature activation+6.16e-02

Pos logit contributions
err+0.022
 surroundings+0.022
ummy+0.022
umm+0.021
um+0.021
Neg logit contributions
 let-0.020
 it-0.018
 hid-0.018
 share-0.018
 fought-0.018
Token small
Token ID1402
Feature activation+7.42e-02

Pos logit contributions
umm+0.079
ummy+0.075
err+0.075
um+0.074
ama+0.073
Neg logit contributions
 let-0.149
 it-0.147
 share-0.135
 hid-0.132
 never-0.130
Token knew
Token ID2993
Feature activation-3.96e-02

Pos logit contributions
 let+0.044
 it+0.040
 share+0.039
 hid+0.038
 fought+0.038
Neg logit contributions
umm-0.104
um-0.103
 surroundings-0.102
ummy-0.102
err-0.102
Token that
Token ID326
Feature activation-2.49e-02

Pos logit contributions
 fought+0.043
 let+0.042
 hid+0.042
 argued+0.041
 share+0.040
Neg logit contributions
ummy-0.091
 surroundings-0.090
ama-0.090
 biggest-0.089
um-0.089
Token the
Token ID262
Feature activation+1.15e-01

Pos logit contributions
 let+0.027
 hid+0.025
 fought+0.024
 share+0.024
pped+0.023
Neg logit contributions
umm-0.058
ummy-0.058
 surroundings-0.057
err-0.057
 world-0.057
Token wishing
Token ID24433
Feature activation+2.28e-02

Pos logit contributions
ummy+0.190
umm+0.186
olly+0.185
ini+0.185
um+0.185
Neg logit contributions
 let-0.248
 it-0.240
 never-0.220
 share-0.208
 put-0.207
Token tree
Token ID5509
Feature activation-1.28e-02

Pos logit contributions
umm+0.039
ummy+0.039
 surroundings+0.038
err+0.038
um+0.038
Neg logit contributions
 let-0.042
 it-0.040
 share-0.038
 hid-0.038
 fought-0.038
Token would
Token ID561
Feature activation-7.64e-02

Pos logit contributions
 let+0.016
 it+0.015
 share+0.015
 hid+0.014
 fought+0.014
Neg logit contributions
ini-0.027
 surroundings-0.027
err-0.027
 biggest-0.027
 world-0.027
Token remain
Token ID3520
Feature activation-1.06e-02

Pos logit contributions
 hid+0.085
 fought+0.083
 argued+0.081
pped+0.081
assed+0.076
Neg logit contributions
 world-0.170
 biggest-0.160
ummy-0.159
um-0.157
umm-0.157
Token a
Token ID257
Feature activation+4.43e-02

Pos logit contributions
 let+0.016
 hid+0.015
 fought+0.015
 it+0.015
 argued+0.015
Neg logit contributions
 surroundings-0.021
 world-0.020
um-0.020
ini-0.020
ubby-0.020
Token special
Token ID2041
Feature activation+7.14e-02

Pos logit contributions
umm+0.072
ummy+0.071
err+0.070
um+0.069
ini+0.069
Neg logit contributions
 let-0.094
 it-0.090
 share-0.082
 never-0.081
 hid-0.080
Token place
Token ID1295
Feature activation-1.92e-02

Pos logit contributions
umm+0.101
ummy+0.096
ini+0.096
err+0.095
um+0.095
Neg logit contributions
 let-0.165
 it-0.160
 share-0.147
 hid-0.145
 never-0.142
Token he
Token ID339
Feature activation-2.04e-02

Pos logit contributions
 let+0.026
 hid+0.025
 fought+0.025
 argued+0.024
 share+0.023
Neg logit contributions
ummy-0.041
 world-0.040
umm-0.039
um-0.039
ay-0.039
Token smile
Token ID8212
Feature activation-1.28e-02

Pos logit contributions
umm+0.105
ummy+0.101
err+0.101
ini+0.100
um+0.100
Neg logit contributions
 let-0.135
 it-0.132
 share-0.123
 hid-0.122
 fought-0.120
Token and
Token ID290
Feature activation-4.42e-02

Pos logit contributions
 let+0.019
 hid+0.019
 share+0.019
 fought+0.018
 it+0.018
Neg logit contributions
um-0.025
ummy-0.024
ay-0.024
ama-0.024
ini-0.023
Token asked
Token ID1965
Feature activation-3.47e-02

Pos logit contributions
 let+0.051
pping+0.049
 it+0.047
 risking+0.047
pped+0.045
Neg logit contributions
um-0.099
umm-0.097
ama-0.097
ummy-0.097
 biggest-0.096
Token if
Token ID611
Feature activation-4.09e-02

Pos logit contributions
 let+0.038
 fought+0.034
 hid+0.034
 risking+0.031
 it+0.031
Neg logit contributions
ummy-0.086
um-0.085
ama-0.083
umm-0.082
 surroundings-0.081
Token she
Token ID673
Feature activation-4.92e-02

Pos logit contributions
pping+0.060
 let+0.057
 fought+0.057
 hid+0.056
pped+0.056
Neg logit contributions
ama-0.080
ummy-0.080
um-0.079
 biggest-0.076
umm-0.074
Token could
Token ID714
Feature activation-7.62e-02

Pos logit contributions
 risking+0.047
pping+0.046
 letting+0.041
pped+0.041
assed+0.041
Neg logit contributions
um-0.119
ummy-0.117
ama-0.116
olly-0.115
 biggest-0.114
Token lend
Token ID22096
Feature activation-3.60e-02

Pos logit contributions
 risking+0.082
 hid+0.081
 fought+0.077
ts+0.075
assed+0.074
Neg logit contributions
um-0.186
ay-0.179
ummy-0.178
ama-0.174
ini-0.166
Token her
Token ID607
Feature activation+2.42e-02

Pos logit contributions
 let+0.035
 it+0.031
 hid+0.029
 fought+0.028
 share+0.028
Neg logit contributions
umm-0.093
ummy-0.093
um-0.093
err-0.091
 surroundings-0.091
Token the
Token ID262
Feature activation+1.09e-01

Pos logit contributions
ummy+0.045
um+0.045
 surroundings+0.045
 biggest+0.045
umm+0.044
Neg logit contributions
 let-0.040
 it-0.037
 hid-0.037
 fought-0.037
 share-0.035
Token money
Token ID1637
Feature activation-1.03e-02

Pos logit contributions
umm+0.161
ummy+0.155
um+0.154
 surroundings+0.154
 biggest+0.153
Neg logit contributions
 let-0.234
 it-0.226
 hid-0.214
 fought-0.212
 share-0.210
Token for
Token ID329
Feature activation-2.72e-02

Pos logit contributions
 let+0.015
 fought+0.014
 hid+0.014
 risking+0.014
 it+0.014
Neg logit contributions
ummy-0.021
um-0.021
 biggest-0.020
umm-0.020
ama-0.020
Tokenie
Token ID494
Feature activation-2.68e-03

Pos logit contributions
umm+0.108
ama+0.102
ummy+0.101
um+0.100
err+0.099
Neg logit contributions
 let-0.112
 it-0.107
 hid-0.099
 share-0.098
 fought-0.098
Token tried
Token ID3088
Feature activation-4.56e-02

Pos logit contributions
 let+0.003
 it+0.003
 share+0.003
 risking+0.003
 hid+0.003
Neg logit contributions
um-0.006
ama-0.006
err-0.006
ini-0.006
ummy-0.006
Token to
Token ID284
Feature activation-5.24e-02

Pos logit contributions
 let+0.047
 fought+0.043
 hid+0.042
 share+0.039
 it+0.039
Neg logit contributions
 surroundings-0.111
ummy-0.111
err-0.110
um-0.109
ama-0.108
Token quit
Token ID11238
Feature activation-3.84e-02

Pos logit contributions
 it+0.055
 fought+0.052
 let+0.050
 hid+0.049
 risking+0.049
Neg logit contributions
um-0.121
ummy-0.118
err-0.118
ama-0.118
ini-0.117
Token eating
Token ID6600
Feature activation-3.43e-02

Pos logit contributions
 let+0.057
 fought+0.051
 hid+0.050
 managed+0.049
 share+0.049
Neg logit contributions
 surroundings-0.078
ummy-0.074
 world-0.073
ama-0.071
err-0.069
Token the
Token ID262
Feature activation+1.07e-01

Pos logit contributions
 let+0.034
 fought+0.033
 hid+0.031
 argued+0.028
 managed+0.027
Neg logit contributions
ummy-0.086
um-0.084
umm-0.084
 surroundings-0.083
 biggest-0.083
Token flower
Token ID15061
Feature activation+9.36e-03

Pos logit contributions
umm+0.201
ama+0.191
ummy+0.190
ini+0.190
err+0.189
Neg logit contributions
 it-0.202
 let-0.202
 share-0.180
 hid-0.173
 put-0.168
Token,
Token ID11
Feature activation-4.31e-02

Pos logit contributions
ummy+0.019
umm+0.019
um+0.019
err+0.019
ama+0.019
Neg logit contributions
 let-0.014
 it-0.013
 hid-0.012
 fought-0.012
 share-0.012
Token but
Token ID475
Feature activation-2.70e-02

Pos logit contributions
 share+0.071
 fought+0.069
 hid+0.067
pped+0.067
 let+0.066
Neg logit contributions
 world-0.069
 biggest-0.068
 sky-0.068
ummy-0.067
 sun-0.066
Token it
Token ID340
Feature activation-1.74e-02

Pos logit contributions
 let+0.025
 share+0.023
 fought+0.023
pping+0.022
 hid+0.022
Neg logit contributions
ama-0.065
ummy-0.064
 biggest-0.064
 surroundings-0.064
um-0.064
Token was
Token ID373
Feature activation-2.69e-02

Pos logit contributions
 let+0.023
 it+0.023
 share+0.022
 risking+0.020
 hid+0.020
Neg logit contributions
err-0.038
umm-0.037
ini-0.037
um-0.037
ama-0.036
Token a
Token ID257
Feature activation+3.15e-02

Pos logit contributions
 let+0.050
pping+0.048
 it+0.047
 risking+0.047
 share+0.046
Neg logit contributions
 biggest-0.068
um-0.068
ummy-0.067
err-0.067
umm-0.066
Token doll
Token ID3654
Feature activation+3.20e-03

Pos logit contributions
umm+0.064
ini+0.061
ummy+0.060
err+0.059
umpy+0.059
Neg logit contributions
 let-0.059
 it-0.057
 share-0.052
 hid-0.049
 put-0.048
Token.
Token ID13
Feature activation-2.87e-03

Pos logit contributions
ummy+0.006
um+0.006
umm+0.006
ay+0.006
ama+0.006
Neg logit contributions
 let-0.005
 it-0.005
 risking-0.005
 managed-0.005
 share-0.005
Token They
Token ID1119
Feature activation-4.26e-02

Pos logit contributions
 let+0.004
 it+0.004
 hid+0.004
 share+0.004
 fought+0.004
Neg logit contributions
umm-0.006
ummy-0.006
err-0.006
um-0.006
ini-0.006
Token threw
Token ID9617
Feature activation-1.80e-02

Pos logit contributions
 risking+0.073
pping+0.062
assed+0.061
 letting+0.060
 losing+0.059
Neg logit contributions
 biggest-0.070
 sky-0.070
um-0.069
 sun-0.069
olly-0.068
Token the
Token ID262
Feature activation+9.78e-02

Pos logit contributions
 let+0.018
 it+0.016
 hid+0.015
 share+0.015
 fought+0.014
Neg logit contributions
umm-0.046
err-0.046
ummy-0.046
um-0.046
 surroundings-0.045
Token ball
Token ID2613
Feature activation-1.94e-02

Pos logit contributions
umm+0.189
ama+0.187
ini+0.181
ummy+0.180
um+0.179
Neg logit contributions
 it-0.177
 let-0.177
 share-0.159
 hid-0.154
 fought-0.149
Token and
Token ID290
Feature activation-4.98e-02

Pos logit contributions
 let+0.024
 it+0.022
 share+0.020
 hid+0.020
 fought+0.019
Neg logit contributions
umm-0.046
ummy-0.045
err-0.045
um-0.045
 surroundings-0.044
Token hugged
Token ID41130
Feature activation-2.81e-02

Pos logit contributions
pping+0.057
 risking+0.056
pped+0.049
 let+0.047
ocated+0.046
Neg logit contributions
 sun-0.107
 sky-0.103
 girl-0.101
 biggest-0.100
um-0.096
Token the
Token ID262
Feature activation+8.74e-02

Pos logit contributions
 hid+0.015
 risking+0.015
 fought+0.013
pping+0.013
 managed+0.013
Neg logit contributions
um-0.080
ay-0.079
 sun-0.078
 surroundings-0.078
ummy-0.077
Token doll
Token ID3654
Feature activation+7.15e-03

Pos logit contributions
umm+0.154
ama+0.152
ummy+0.150
ini+0.150
um+0.150
Neg logit contributions
 let-0.170
 it-0.167
 share-0.153
 hid-0.146
 fought-0.144
Token had
Token ID550
Feature activation-4.54e-02

Pos logit contributions
 whoever+0.046
 letting+0.045
assed+0.045
pping+0.045
gency+0.045
Neg logit contributions
umpy-0.054
 whole-0.053
 entire-0.052
 best-0.049
 world-0.048
Token never
Token ID1239
Feature activation-3.52e-02

Pos logit contributions
pping+0.057
pped+0.051
sels+0.050
 whoever+0.050
assed+0.048
Neg logit contributions
 best-0.095
 biggest-0.087
 new-0.085
 sun-0.081
 friend-0.079
Token played
Token ID2826
Feature activation+6.50e-03

Pos logit contributions
 it+0.041
pping+0.037
 whoever+0.036
 let+0.034
 risking+0.033
Neg logit contributions
err-0.076
umpy-0.074
 biggest-0.073
 world-0.072
 surroundings-0.072
Token with
Token ID351
Feature activation-1.61e-02

Pos logit contributions
 world+0.012
 surroundings+0.012
 biggest+0.012
ummy+0.012
umpy+0.012
Neg logit contributions
 let-0.010
 managed-0.010
 hid-0.009
 argued-0.009
 wouldn-0.009
Token the
Token ID262
Feature activation+1.21e-01

Pos logit contributions
 let+0.017
 hid+0.014
 share+0.013
 managed+0.013
 fought+0.013
Neg logit contributions
 surroundings-0.040
ummy-0.040
 biggest-0.039
um-0.039
ama-0.038
Token aer
Token ID9551
Feature activation+1.33e-01

Pos logit contributions
umm+0.235
ummy+0.220
um+0.220
err+0.218
ama+0.213
Neg logit contributions
 it-0.231
 let-0.229
 hid-0.194
 never-0.193
 share-0.192
Tokenopl
Token ID20106
Feature activation+1.01e-01

Pos logit contributions
umm+0.149
 surroundings+0.132
 biggest+0.132
err+0.131
ubby+0.123
Neg logit contributions
 let-0.325
 it-0.324
 never-0.315
pping-0.314
 fought-0.298
Tokenane
Token ID1531
Feature activation+1.73e-02

Pos logit contributions
umm+0.126
err+0.120
 surroundings+0.118
 biggest+0.112
ummy+0.111
Neg logit contributions
 it-0.242
 let-0.238
 share-0.217
 fought-0.215
pped-0.213
Token.
Token ID13
Feature activation+1.05e-02

Pos logit contributions
umm+0.037
ummy+0.037
err+0.036
um+0.036
ini+0.036
Neg logit contributions
 let-0.024
 it-0.023
 hid-0.022
 share-0.021
 fought-0.021
Token They
Token ID1119
Feature activation-4.03e-02

Pos logit contributions
umm+0.023
ini+0.023
ummy+0.022
um+0.022
 surroundings+0.022
Neg logit contributions
 it-0.017
 let-0.017
 share-0.014
 hid-0.014
 never-0.014
Token wish
Token ID4601
Feature activation-3.77e-02

Pos logit contributions
 letting+0.070
pped+0.067
 it+0.067
pping+0.067
 let+0.066
Neg logit contributions
 biggest-0.059
 sky-0.058
umpy-0.058
 world-0.057
 surroundings-0.057
Token smiled
Token ID13541
Feature activation-1.22e-02

Pos logit contributions
 risking+0.049
pping+0.042
 share+0.040
 it+0.040
 letting+0.040
Neg logit contributions
um-0.083
err-0.080
ama-0.078
 surroundings-0.077
 biggest-0.077
Token.
Token ID13
Feature activation-1.95e-04

Pos logit contributions
 let+0.017
 it+0.016
 fought+0.016
 hid+0.016
 share+0.016
Neg logit contributions
umm-0.026
ummy-0.026
um-0.026
err-0.025
 surroundings-0.025
Token She
Token ID1375
Feature activation-3.79e-02

Pos logit contributions
 let+0.000
 it+0.000
 share+0.000
 hid+0.000
 fought+0.000
Neg logit contributions
umm-0.000
ini-0.000
err-0.000
ummy-0.000
 surroundings-0.000
Token hopped
Token ID45230
Feature activation-2.79e-02

Pos logit contributions
 risking+0.044
 it+0.041
 share+0.039
pping+0.039
 let+0.038
Neg logit contributions
um-0.084
err-0.082
ay-0.080
ama-0.080
ummy-0.079
Token around
Token ID1088
Feature activation-3.11e-02

Pos logit contributions
 let+0.042
 fought+0.039
 it+0.038
 hid+0.038
 share+0.038
Neg logit contributions
um-0.058
err-0.056
ummy-0.056
ay-0.056
umm-0.055
Token the
Token ID262
Feature activation+9.56e-02

Pos logit contributions
 let+0.041
 fought+0.036
 hid+0.036
 share+0.035
 wouldn+0.034
Neg logit contributions
ummy-0.068
 world-0.067
err-0.067
 surroundings-0.066
umm-0.066
Token tree
Token ID5509
Feature activation-2.20e-02

Pos logit contributions
umm+0.151
err+0.142
ini+0.140
ubby+0.139
ama+0.139
Neg logit contributions
 let-0.208
 it-0.208
 share-0.183
 put-0.180
 hid-0.179
Token singing
Token ID13777
Feature activation-1.60e-02

Pos logit contributions
 let+0.032
 it+0.031
 share+0.028
 hid+0.028
 fought+0.027
Neg logit contributions
umm-0.047
ummy-0.046
um-0.046
err-0.046
 surroundings-0.045
Token the
Token ID262
Feature activation+9.64e-02

Pos logit contributions
 let+0.023
 it+0.021
 hid+0.020
 share+0.020
 fought+0.020
Neg logit contributions
umm-0.034
um-0.034
ummy-0.034
err-0.033
 surroundings-0.033
Token words
Token ID2456
Feature activation-2.69e-02

Pos logit contributions
umm+0.130
ummy+0.129
ubby+0.128
ama+0.128
um+0.127
Neg logit contributions
 let-0.233
 it-0.221
 share-0.199
 put-0.197
 never-0.196
Token of
Token ID286
Feature activation-1.34e-02

Pos logit contributions
 fought+0.044
 hid+0.044
 risking+0.043
 let+0.043
 share+0.042
Neg logit contributions
ay-0.049
ummy-0.048
um-0.048
umm-0.046
 sky-0.045
Token laughed
Token ID13818
Feature activation-1.49e-02

Pos logit contributions
 risking+0.048
 it+0.041
 letting+0.040
ocated+0.039
 losing+0.039
Neg logit contributions
um-0.077
olly-0.073
ay-0.072
 sun-0.072
ama-0.071
Token and
Token ID290
Feature activation-3.21e-02

Pos logit contributions
 let+0.024
 it+0.022
 hid+0.021
 fought+0.021
 share+0.021
Neg logit contributions
ummy-0.029
um-0.029
umm-0.029
err-0.029
 surroundings-0.028
Token smiled
Token ID13541
Feature activation-8.28e-03

Pos logit contributions
 risking+0.034
 let+0.032
pping+0.030
 steal+0.029
 it+0.029
Neg logit contributions
um-0.070
 sun-0.070
ay-0.069
ama-0.069
 surroundings-0.068
Token,
Token ID11
Feature activation-2.65e-02

Pos logit contributions
 let+0.011
 it+0.011
 hid+0.010
 share+0.010
 fought+0.009
Neg logit contributions
umm-0.019
ummy-0.019
um-0.018
err-0.018
 surroundings-0.018
Token and
Token ID290
Feature activation-3.74e-02

Pos logit contributions
 let+0.040
 hid+0.040
 share+0.040
 risking+0.039
 steal+0.039
Neg logit contributions
ummy-0.045
err-0.044
 sun-0.044
umm-0.044
ini-0.044
Token the
Token ID262
Feature activation+1.11e-01

Pos logit contributions
 let+0.040
 risking+0.036
pping+0.035
 it+0.035
 share+0.034
Neg logit contributions
um-0.086
umm-0.085
ama-0.085
ummy-0.084
err-0.083
Token pengu
Token ID38373
Feature activation+9.67e-02

Pos logit contributions
umm+0.191
ummy+0.184
ama+0.181
um+0.180
igsaw+0.176
Neg logit contributions
 let-0.252
 it-0.238
 put-0.218
 hid-0.209
 tried-0.209
Tokenin
Token ID259
Feature activation+1.28e-02

Pos logit contributions
umm+0.154
ummy+0.144
err+0.142
um+0.142
 surroundings+0.139
Neg logit contributions
 let-0.205
 it-0.201
 fought-0.189
 share-0.187
 hid-0.185
Token showed
Token ID3751
Feature activation-1.59e-02

Pos logit contributions
um+0.029
umm+0.029
err+0.029
ummy+0.029
ama+0.029
Neg logit contributions
 let-0.016
 it-0.015
 share-0.014
 hid-0.013
 risking-0.013
Token him
Token ID683
Feature activation-1.19e-02

Pos logit contributions
 let+0.013
 it+0.012
 share+0.010
 hid+0.010
 fought+0.009
Neg logit contributions
umm-0.045
ummy-0.044
err-0.044
um-0.044
ini-0.044
Token how
Token ID703
Feature activation-1.54e-02

Pos logit contributions
 let+0.013
 fought+0.011
 hid+0.011
pped+0.011
 it+0.011
Neg logit contributions
 surroundings-0.028
 biggest-0.028
err-0.028
um-0.028
ummy-0.028
Token slid
Token ID27803
Feature activation-9.14e-03

Pos logit contributions
 it+0.040
 let+0.039
 share+0.036
 risking+0.036
 fought+0.034
Neg logit contributions
um-0.096
ama-0.094
umm-0.094
err-0.094
ini-0.094
Token down
Token ID866
Feature activation-3.13e-02

Pos logit contributions
 let+0.015
 it+0.014
 share+0.013
 hid+0.013
 fought+0.013
Neg logit contributions
umm-0.018
um-0.018
ummy-0.018
err-0.018
 surroundings-0.018
Token,
Token ID11
Feature activation-2.89e-02

Pos logit contributions
 let+0.036
 share+0.031
 hid+0.031
 fought+0.031
 it+0.031
Neg logit contributions
ummy-0.074
um-0.074
umm-0.074
err-0.074
ay-0.073
Token feeling
Token ID4203
Feature activation-2.47e-02

Pos logit contributions
 let+0.040
 share+0.039
 it+0.039
 fought+0.039
 hid+0.038
Neg logit contributions
 biggest-0.056
um-0.055
err-0.054
 surroundings-0.054
ummy-0.054
Token the
Token ID262
Feature activation+7.78e-02

Pos logit contributions
 let+0.042
 fought+0.040
 hid+0.038
 share+0.038
 argued+0.037
Neg logit contributions
 biggest-0.044
um-0.042
umpy-0.042
umm-0.041
 surroundings-0.041
Token warm
Token ID5814
Feature activation+6.53e-02

Pos logit contributions
ummy+0.120
err+0.120
umm+0.119
ini+0.118
ama+0.117
Neg logit contributions
 let-0.172
 it-0.163
 share-0.150
 put-0.148
 stayed-0.147
Token sun
Token ID4252
Feature activation+1.83e-03

Pos logit contributions
err+0.084
ini+0.081
ama+0.080
umm+0.079
ubby+0.076
Neg logit contributions
 it-0.157
 let-0.156
 share-0.141
 hid-0.138
 put-0.136
Token on
Token ID319
Feature activation-3.61e-02

Pos logit contributions
umm+0.004
ummy+0.004
um+0.004
ama+0.004
ay+0.004
Neg logit contributions
 let-0.003
 it-0.003
 hid-0.002
 fought-0.002
 share-0.002
Token her
Token ID607
Feature activation+5.39e-02

Pos logit contributions
 let+0.033
 it+0.033
 share+0.026
 hid+0.025
 fought+0.023
Neg logit contributions
umm-0.099
ini-0.097
err-0.097
ummy-0.096
 surroundings-0.096
Token face
Token ID1986
Feature activation-2.22e-02

Pos logit contributions
umm+0.093
err+0.089
ini+0.089
ummy+0.086
ubby+0.085
Neg logit contributions
 let-0.106
 it-0.105
 share-0.095
 hid-0.093
 fought-0.090
Token.
Token ID13
Feature activation+3.01e-03

Pos logit contributions
 let+0.032
 it+0.031
 hid+0.030
 fought+0.030
 share+0.030
Neg logit contributions
ummy-0.046
um-0.046
umm-0.046
ama-0.044
ay-0.044
Token filled
Token ID5901
Feature activation-1.79e-02

Pos logit contributions
 risking+0.042
 let+0.038
 it+0.038
 share+0.037
 letting+0.036
Neg logit contributions
um-0.081
err-0.079
ini-0.078
 biggest-0.077
 surroundings-0.077
Token it
Token ID340
Feature activation-1.38e-02

Pos logit contributions
 let+0.009
 it+0.008
 hid+0.006
 share+0.006
 fought+0.006
Neg logit contributions
umm-0.055
ummy-0.054
err-0.054
um-0.054
ini-0.054
Token up
Token ID510
Feature activation-2.30e-02

Pos logit contributions
 let+0.020
 it+0.019
 hid+0.017
 share+0.017
 fought+0.017
Neg logit contributions
umm-0.030
ummy-0.030
err-0.030
um-0.030
ini-0.029
Token with
Token ID351
Feature activation-2.77e-02

Pos logit contributions
 let+0.032
 it+0.030
 hid+0.028
 share+0.027
 fought+0.027
Neg logit contributions
umm-0.052
ummy-0.051
err-0.050
um-0.050
 surroundings-0.050
Token the
Token ID262
Feature activation+8.86e-02

Pos logit contributions
 let+0.048
 it+0.045
 hid+0.044
 share+0.043
 fought+0.043
Neg logit contributions
umm-0.050
ummy-0.049
um-0.049
err-0.048
 biggest-0.048
Token yellow
Token ID7872
Feature activation+9.38e-02

Pos logit contributions
umm+0.155
ummy+0.148
err+0.147
ini+0.146
ama+0.145
Neg logit contributions
 let-0.178
 it-0.177
 share-0.155
 put-0.149
 hid-0.149
Token cr
Token ID1067
Feature activation+7.25e-02

Pos logit contributions
umm+0.157
err+0.152
ini+0.150
ummy+0.150
ama+0.148
Neg logit contributions
 let-0.187
 it-0.185
 share-0.168
 hid-0.165
 fought-0.161
Tokenay
Token ID323
Feature activation+4.48e-02

Pos logit contributions
 surroundings+0.049
err+0.040
umm+0.038
 biggest+0.035
ini+0.031
Neg logit contributions
 let-0.213
 it-0.213
pped-0.198
 share-0.196
pping-0.194
Tokenon
Token ID261
Feature activation+5.58e-03

Pos logit contributions
umm+0.098
err+0.095
ummy+0.094
ini+0.093
 surroundings+0.093
Neg logit contributions
 let-0.066
 it-0.065
 share-0.057
 hid-0.057
 fought-0.055
Token.
Token ID13
Feature activation-5.11e-03

Pos logit contributions
ummy+0.012
umm+0.011
um+0.011
ama+0.011
ay+0.011
Neg logit contributions
 let-0.008
 it-0.008
 share-0.008
 fought-0.007
 hid-0.007
Token He
Token ID679
Feature activation-3.85e-02

Pos logit contributions
 it+0.010
 let+0.009
 share+0.008
 hid+0.008
 whoever+0.008
Neg logit contributions
umm-0.009
ini-0.009
ubby-0.009
err-0.009
 surroundings-0.009
Token a
Token ID257
Feature activation+5.43e-02

Pos logit contributions
 let+0.025
 it+0.022
 share+0.022
 hid+0.021
 fought+0.021
Neg logit contributions
 surroundings-0.038
ummy-0.038
err-0.038
umm-0.038
um-0.038
Token search
Token ID2989
Feature activation-2.21e-02

Pos logit contributions
umm+0.106
ummy+0.103
err+0.102
um+0.102
ini+0.102
Neg logit contributions
 let-0.091
 it-0.088
 share-0.081
 hid-0.081
 fought-0.079
Token to
Token ID284
Feature activation-3.81e-02

Pos logit contributions
 let+0.040
 it+0.038
 share+0.037
 fought+0.036
 hid+0.036
Neg logit contributions
ummy-0.038
umm-0.038
um-0.037
 surroundings-0.037
err-0.037
Token find
Token ID1064
Feature activation-2.08e-02

Pos logit contributions
 let+0.041
 it+0.038
 fought+0.036
 hid+0.035
 risking+0.034
Neg logit contributions
 surroundings-0.090
 biggest-0.090
err-0.089
umm-0.088
 world-0.088
Token out
Token ID503
Feature activation-1.42e-02

Pos logit contributions
 let+0.026
 it+0.026
 hid+0.022
 share+0.022
 fought+0.021
Neg logit contributions
umm-0.050
ummy-0.049
err-0.049
ini-0.048
um-0.048
Token more
Token ID517
Feature activation+4.83e-02

Pos logit contributions
 let+0.015
 fought+0.014
 share+0.013
 hid+0.013
 it+0.013
Neg logit contributions
 surroundings-0.035
ummy-0.034
err-0.034
um-0.033
 biggest-0.033
Token about
Token ID546
Feature activation-2.94e-02

Pos logit contributions
umm+0.087
ummy+0.082
um+0.082
err+0.082
ama+0.082
Neg logit contributions
 it-0.093
 let-0.093
 hid-0.083
 share-0.082
 fought-0.079
Token gray
Token ID12768
Feature activation+6.80e-02

Pos logit contributions
 let+0.036
 share+0.032
 fought+0.032
 hid+0.031
 managed+0.030
Neg logit contributions
 surroundings-0.067
 world-0.067
ummy-0.066
 biggest-0.066
err-0.065
Token.
Token ID13
Feature activation+8.47e-03

Pos logit contributions
umm+0.104
err+0.100
ama+0.099
ini+0.099
um+0.099
Neg logit contributions
 it-0.146
 let-0.144
 hid-0.132
 share-0.128
 fought-0.128
Token Everywhere
Token ID48201
Feature activation+2.72e-03

Pos logit contributions
umm+0.018
ini+0.018
ubby+0.018
err+0.017
umpy+0.017
Neg logit contributions
 it-0.014
 let-0.014
 hid-0.012
 share-0.011
 whoever-0.011
Token she
Token ID673
Feature activation+4.37e-03

Pos logit contributions
ama+0.006
ummy+0.005
 surroundings+0.005
ay+0.005
um+0.005
Neg logit contributions
 share-0.004
 let-0.004
 fought-0.004
 hid-0.003
 argued-0.003
Token the
Token ID262
Feature activation+9.34e-02

Pos logit contributions
 let+0.053
 fought+0.051
 managed+0.049
 risking+0.048
 hid+0.048
Neg logit contributions
 world-0.089
ummy-0.088
um-0.086
 sky-0.085
err-0.085
Token big
Token ID1263
Feature activation+8.19e-02

Pos logit contributions
umm+0.171
err+0.164
ama+0.160
olly+0.155
um+0.155
Neg logit contributions
 let-0.185
 it-0.183
 put-0.162
 hid-0.162
 never-0.159
Token pond
Token ID16723
Feature activation-1.55e-02

Pos logit contributions
err+0.148
umm+0.144
ama+0.141
um+0.136
 surroundings+0.136
Neg logit contributions
 it-0.161
 let-0.158
 hid-0.142
 share-0.139
 fought-0.134
Token next
Token ID1306
Feature activation+5.25e-02

Pos logit contributions
 let+0.024
 it+0.023
 share+0.022
 fought+0.022
 risking+0.022
Neg logit contributions
ummy-0.030
um-0.029
 surroundings-0.029
err-0.029
 world-0.029
Token to
Token ID284
Feature activation-4.25e-02

Pos logit contributions
umm+0.136
ini+0.133
ama+0.133
err+0.133
umpy+0.127
Neg logit contributions
 let-0.062
 it-0.059
 share-0.047
 hid-0.044
 come-0.043
Token her
Token ID607
Feature activation+3.95e-02

Pos logit contributions
 let+0.043
 fought+0.042
 hid+0.039
 argued+0.037
 risking+0.037
Neg logit contributions
 biggest-0.102
 world-0.100
 surroundings-0.100
err-0.099
ummy-0.099
Token house
Token ID2156
Feature activation-1.38e-02

Pos logit contributions
umm+0.053
err+0.051
umpy+0.049
um+0.047
ini+0.047
Neg logit contributions
 let-0.094
 it-0.094
 hid-0.086
 share-0.085
 put-0.082
Token.
Token ID13
Feature activation-1.69e-03

Pos logit contributions
 let+0.024
 risking+0.023
 fought+0.023
 it+0.023
 share+0.023
Neg logit contributions
ummy-0.023
 world-0.022
 surroundings-0.022
um-0.022
ama-0.021
Token She
Token ID1375
Feature activation-3.14e-02

Pos logit contributions
 let+0.003
 it+0.003
 hid+0.003
 share+0.003
 fought+0.003
Neg logit contributions
umm-0.003
ini-0.003
ummy-0.003
err-0.003
um-0.003
Token jumped
Token ID11687
Feature activation-7.77e-03

Pos logit contributions
 risking+0.039
 share+0.039
 it+0.037
 let+0.037
 steal+0.036
Neg logit contributions
 surroundings-0.064
ini-0.064
ubby-0.063
umpy-0.063
um-0.063
Token in
Token ID287
Feature activation-8.00e-03

Pos logit contributions
 let+0.012
 it+0.011
 share+0.011
 fought+0.011
 hid+0.010
Neg logit contributions
um-0.016
err-0.016
umm-0.016
ummy-0.016
 surroundings-0.015
Token,
Token ID11
Feature activation-3.10e-02

Pos logit contributions
 let+0.052
 fought+0.047
 wouldn+0.046
 share+0.045
 hid+0.045
Neg logit contributions
ummy-0.082
um-0.081
 surroundings-0.079
ama-0.079
err-0.078
Token still
Token ID991
Feature activation-2.69e-02

Pos logit contributions
 let+0.056
 fought+0.055
 share+0.055
 it+0.054
 hid+0.054
Neg logit contributions
 surroundings-0.045
ama-0.045
 biggest-0.045
 world-0.044
um-0.044
Token smiling
Token ID16755
Feature activation-1.67e-03

Pos logit contributions
 let+0.041
 fought+0.039
 hid+0.038
 it+0.038
pped+0.038
Neg logit contributions
 surroundings-0.050
 world-0.050
 biggest-0.049
ummy-0.048
umm-0.047
Token from
Token ID422
Feature activation-1.12e-02

Pos logit contributions
 let+0.002
 it+0.002
 share+0.002
 hid+0.002
 fought+0.002
Neg logit contributions
umm-0.003
ummy-0.003
 surroundings-0.003
um-0.003
err-0.003
Token all
Token ID477
Feature activation+4.32e-03

Pos logit contributions
 let+0.021
 it+0.020
 hid+0.019
 share+0.019
 fought+0.018
Neg logit contributions
umm-0.020
ummy-0.019
err-0.019
um-0.019
 surroundings-0.019
Token their
Token ID511
Feature activation+6.42e-02

Pos logit contributions
ummy+0.011
umm+0.011
err+0.011
um+0.011
ini+0.011
Neg logit contributions
 let-0.004
 it-0.004
 hid-0.004
 share-0.004
 fought-0.004
Token fun
Token ID1257
Feature activation+2.26e-02

Pos logit contributions
umm+0.103
err+0.097
ini+0.097
um+0.097
umpy+0.097
Neg logit contributions
 let-0.134
 it-0.133
 share-0.122
 hid-0.121
 fought-0.121
Token.
Token ID13
Feature activation+1.42e-02

Pos logit contributions
ummy+0.042
 surroundings+0.042
umm+0.042
ini+0.041
 world+0.040
Neg logit contributions
 let-0.039
 hid-0.036
 it-0.036
 share-0.035
 fought-0.035
Token<|endoftext|>
Token ID50256
Feature activation+4.61e-02

Pos logit contributions
umm+0.026
ummy+0.025
ini+0.025
err+0.025
um+0.025
Neg logit contributions
 let-0.026
 it-0.025
 share-0.023
 hid-0.022
 fought-0.022
TokenOnce
Token ID7454
Feature activation+2.84e-02

Pos logit contributions
umm+0.067
ummy+0.063
ini+0.063
err+0.063
um+0.062
Neg logit contributions
 let-0.102
 it-0.101
 hid-0.092
 share-0.092
 fought-0.090
Token upon
Token ID2402
Feature activation+4.84e-03

Pos logit contributions
umm+0.053
err+0.048
ini+0.047
 surroundings+0.046
umpy+0.045
Neg logit contributions
 it-0.058
 let-0.056
 hid-0.049
 share-0.048
 don-0.047
Token<|endoftext|>
Token ID50256
Feature activation+5.64e-02

Pos logit contributions
umm+0.042
ini+0.041
 surroundings+0.041
ubby+0.040
um+0.039
Neg logit contributions
 it-0.030
 let-0.029
 share-0.025
 never-0.024
 whoever-0.023
TokenOnce
Token ID7454
Feature activation+3.21e-02

Pos logit contributions
umm+0.111
ini+0.105
ummy+0.105
 surroundings+0.104
err+0.103
Neg logit contributions
 let-0.099
 it-0.099
 hid-0.086
 share-0.086
 fought-0.082
Token upon
Token ID2402
Feature activation+7.10e-03

Pos logit contributions
umm+0.065
ini+0.057
err+0.056
 surroundings+0.056
umpy+0.053
Neg logit contributions
 it-0.064
 let-0.062
 hid-0.052
 share-0.052
 don-0.050
Token a
Token ID257
Feature activation+7.23e-02

Pos logit contributions
umm+0.016
err+0.016
ummy+0.016
um+0.015
ini+0.015
Neg logit contributions
 let-0.010
 it-0.009
 hid-0.008
 share-0.008
 fought-0.008
Token time
Token ID640
Feature activation+2.36e-02

Pos logit contributions
umm+0.148
ini+0.135
 surroundings+0.125
andel+0.123
ummy+0.122
Neg logit contributions
 let-0.144
 it-0.131
 never-0.127
 put-0.118
 hid-0.114
Token,
Token ID11
Feature activation+7.93e-03

Pos logit contributions
umm+0.066
ini+0.063
 surroundings+0.061
err+0.061
umpy+0.059
Neg logit contributions
 it-0.026
 let-0.025
 hid-0.020
 don-0.019
 share-0.019
Token in
Token ID287
Feature activation+3.04e-02

Pos logit contributions
umm+0.023
ini+0.022
ummy+0.022
err+0.022
 surroundings+0.022
Neg logit contributions
 let-0.007
 it-0.007
 hid-0.005
 share-0.005
 fought-0.005
Token a
Token ID257
Feature activation+5.26e-02

Pos logit contributions
umm+0.077
ummy+0.074
ini+0.074
um+0.073
err+0.073
Neg logit contributions
 let-0.037
 it-0.036
 share-0.030
 hid-0.029
 fought-0.028
Token tiny
Token ID7009
Feature activation+6.58e-02

Pos logit contributions
umm+0.091
ummy+0.080
err+0.077
ama+0.077
um+0.075
Neg logit contributions
 let-0.122
 it-0.120
 never-0.109
 put-0.108
 share-0.108
Token hut
Token ID40812
Feature activation-7.37e-03

Pos logit contributions
umm+0.113
err+0.103
ama+0.097
um+0.093
ini+0.091
Neg logit contributions
 it-0.147
 let-0.143
 put-0.130
 hid-0.127
 share-0.126
Token,
Token ID11
Feature activation-6.93e-03

Pos logit contributions
 let+0.011
 it+0.010
 hid+0.009
 share+0.009
 fought+0.009
Neg logit contributions
ummy-0.016
umm-0.016
um-0.016
ini-0.015
ama-0.015
Token believe
Token ID1975
Feature activation-2.46e-02

Pos logit contributions
 it+0.022
 risking+0.021
 let+0.020
 letting+0.018
 fought+0.018
Neg logit contributions
umm-0.050
um-0.049
ummy-0.048
umpy-0.048
 biggest-0.048
Token her
Token ID607
Feature activation+4.95e-02

Pos logit contributions
 fought+0.029
 hid+0.026
 let+0.026
 managed+0.025
 snatched+0.024
Neg logit contributions
ummy-0.061
um-0.057
umm-0.056
 surroundings-0.055
err-0.055
Token eyes
Token ID2951
Feature activation-1.66e-02

Pos logit contributions
ini+0.076
umpy+0.076
umm+0.075
err+0.072
um+0.066
Neg logit contributions
 it-0.107
 let-0.103
 hid-0.093
 share-0.092
 don-0.090
Token.
Token ID13
Feature activation+4.72e-03

Pos logit contributions
 let+0.024
 fought+0.023
 it+0.023
 share+0.023
 hid+0.023
Neg logit contributions
ummy-0.033
umm-0.033
um-0.032
 biggest-0.032
ay-0.032
Token
Token ID198
Feature activation+2.91e-02

Pos logit contributions
ini+0.010
ubby+0.010
umm+0.010
err+0.010
ooky+0.010
Neg logit contributions
 it-0.007
 let-0.007
 share-0.006
 whoever-0.006
 hid-0.006
Token
Token ID198
Feature activation+1.89e-02

Pos logit contributions
umm+0.068
ini+0.061
err+0.061
ubby+0.060
 surroundings+0.060
Neg logit contributions
 it-0.047
 let-0.045
 share-0.037
 never-0.036
 whoever-0.036
TokenThe
Token ID464
Feature activation+1.08e-01

Pos logit contributions
umm+0.031
ummy+0.029
err+0.029
 surroundings+0.029
ini+0.029
Neg logit contributions
 let-0.039
 it-0.038
 share-0.035
 hid-0.034
 never-0.033
Token sisters
Token ID15153
Feature activation-2.32e-02

Pos logit contributions
umm+0.200
ini+0.186
ubby+0.184
err+0.184
umpy+0.183
Neg logit contributions
 let-0.217
 it-0.198
 put-0.191
 hid-0.188
 share-0.184
Token spent
Token ID3377
Feature activation-1.73e-02

Pos logit contributions
 risking+0.027
 it+0.027
 let+0.025
 share+0.024
 losing+0.024
Neg logit contributions
um-0.053
ama-0.051
ummy-0.050
ay-0.049
olly-0.049
Token the
Token ID262
Feature activation+9.38e-02

Pos logit contributions
 let+0.020
 it+0.017
 share+0.016
 hid+0.016
 fought+0.016
Neg logit contributions
 surroundings-0.041
err-0.041
umm-0.041
um-0.041
ummy-0.041
Token day
Token ID1110
Feature activation-4.02e-03

Pos logit contributions
umm+0.148
ama+0.142
ubby+0.141
um+0.140
ummy+0.136
Neg logit contributions
 let-0.205
 it-0.191
 never-0.186
 hid-0.184
 share-0.184
Token was
Token ID373
Feature activation-3.19e-02

Pos logit contributions
 let+0.035
 it+0.035
 risking+0.033
 share+0.032
 fought+0.031
Neg logit contributions
err-0.066
 surroundings-0.064
umm-0.063
ini-0.063
 biggest-0.063
Token something
Token ID1223
Feature activation-3.15e-03

Pos logit contributions
 let+0.047
 fought+0.045
 hid+0.044
 argued+0.043
 snatched+0.043
Neg logit contributions
 biggest-0.062
ummy-0.061
err-0.061
um-0.061
 surroundings-0.060
Token fun
Token ID1257
Feature activation-8.51e-03

Pos logit contributions
pped+0.008
pping+0.008
 fought+0.008
 argued+0.007
 let+0.007
Neg logit contributions
 biggest-0.003
ummy-0.003
 best-0.003
 world-0.003
umm-0.003
Token.
Token ID13
Feature activation+5.27e-03

Pos logit contributions
pped+0.012
 let+0.012
 snatched+0.012
pping+0.012
 fought+0.011
Neg logit contributions
 world-0.017
ummy-0.016
 surroundings-0.016
 biggest-0.015
 best-0.015
Token
Token ID198
Feature activation+3.46e-02

Pos logit contributions
umm+0.011
ini+0.011
err+0.011
ummy+0.011
ubby+0.011
Neg logit contributions
 let-0.009
 it-0.009
 share-0.007
 hid-0.007
 whoever-0.007
Token
Token ID198
Feature activation+2.36e-02

Pos logit contributions
umm+0.074
ini+0.066
err+0.066
ummy+0.066
 surroundings+0.065
Neg logit contributions
 it-0.060
 let-0.059
 share-0.051
 hid-0.050
 never-0.049
TokenThey
Token ID2990
Feature activation-4.69e-02

Pos logit contributions
um+0.043
ama+0.043
 surroundings+0.042
ummy+0.042
err+0.042
Neg logit contributions
 let-0.040
 it-0.039
 hid-0.038
 fought-0.037
 share-0.036
Token came
Token ID1625
Feature activation-1.41e-02

Pos logit contributions
 risking+0.069
 it+0.060
 losing+0.058
pping+0.057
 letting+0.056
Neg logit contributions
um-0.090
 sky-0.084
ama-0.082
olly-0.082
 sun-0.080
Token inside
Token ID2641
Feature activation-3.60e-02

Pos logit contributions
 let+0.016
 it+0.016
 share+0.012
 hid+0.012
 never+0.011
Neg logit contributions
umm-0.040
ini-0.037
ummy-0.036
err-0.035
umpy-0.035
Token and
Token ID290
Feature activation-5.14e-02

Pos logit contributions
 let+0.050
 fought+0.046
 hid+0.046
 share+0.044
 it+0.044
Neg logit contributions
ummy-0.079
um-0.075
err-0.075
ama-0.074
 surroundings-0.074
Token put
Token ID1234
Feature activation-2.02e-02

Pos logit contributions
pping+0.066
 risking+0.060
pped+0.058
 it+0.053
assed+0.051
Neg logit contributions
err-0.101
 biggest-0.100
 girl-0.099
 sun-0.098
 surroundings-0.098
Token her
Token ID607
Feature activation+4.92e-02

Pos logit contributions
 let+0.033
 fought+0.030
 hid+0.030
 share+0.028
 managed+0.026
Neg logit contributions
ummy-0.089
um-0.088
umm-0.086
ama-0.085
ay-0.085
Token finger
Token ID7660
Feature activation-1.34e-02

Pos logit contributions
umm+0.079
ummy+0.078
um+0.076
err+0.076
ini+0.076
Neg logit contributions
 let-0.099
 it-0.096
 share-0.089
 hid-0.089
 fought-0.088
Token and
Token ID290
Feature activation-5.99e-02

Pos logit contributions
 let+0.021
 it+0.020
 share+0.019
 fought+0.019
 hid+0.019
Neg logit contributions
ummy-0.026
um-0.026
umm-0.025
ama-0.025
 surroundings-0.025
Token gave
Token ID2921
Feature activation-2.38e-02

Pos logit contributions
pping+0.066
assed+0.063
 risking+0.062
 share+0.058
 fought+0.057
Neg logit contributions
ama-0.125
um-0.122
 sun-0.122
ummy-0.122
 biggest-0.120
Token her
Token ID607
Feature activation+6.16e-03

Pos logit contributions
 let+0.019
 fought+0.016
 hid+0.015
 share+0.015
 risking+0.014
Neg logit contributions
ummy-0.064
um-0.064
umm-0.064
err-0.063
ay-0.063
Token a
Token ID257
Feature activation+2.78e-02

Pos logit contributions
ummy+0.013
err+0.013
um+0.013
 biggest+0.013
ama+0.013
Neg logit contributions
 let-0.009
 it-0.008
 fought-0.008
 hid-0.008
 share-0.008
Token hug
Token ID16225
Feature activation-1.86e-02

Pos logit contributions
umm+0.053
ummy+0.049
 surroundings+0.049
ini+0.048
err+0.048
Neg logit contributions
 let-0.054
 it-0.052
 share-0.047
 hid-0.046
 put-0.045
Token.
Token ID13
Feature activation-6.06e-03

Pos logit contributions
 risking+0.032
 fought+0.031
 let+0.031
 hid+0.031
 share+0.030
Neg logit contributions
ummy-0.032
um-0.031
ay-0.031
ama-0.030
umm-0.030
Token From
Token ID3574
Feature activation+8.99e-03

Pos logit contributions
 let+0.011
 it+0.010
 share+0.010
 hid+0.010
 fought+0.009
Neg logit contributions
umm-0.011
ummy-0.011
err-0.011
um-0.011
ini-0.011
Token that
Token ID326
Feature activation+3.25e-02

Pos logit contributions
ummy+0.025
um+0.024
umm+0.024
ama+0.024
 surroundings+0.024
Neg logit contributions
 let-0.008
 it-0.007
 share-0.006
 hid-0.006
 fought-0.006
Token day
Token ID1110
Feature activation-1.48e-02

Pos logit contributions
umm+0.039
 surroundings+0.036
ini+0.035
err+0.035
um+0.034
Neg logit contributions
 it-0.085
 let-0.085
 share-0.076
 never-0.075
 hid-0.074
Tokened
Token ID276
Feature activation-2.12e-02

Pos logit contributions
 let+0.024
 it+0.023
 share+0.023
 fought+0.021
 risking+0.021
Neg logit contributions
err-0.016
um-0.016
umpy-0.016
umm-0.016
ummy-0.016
Token and
Token ID290
Feature activation-4.83e-02

Pos logit contributions
 let+0.034
 share+0.032
 it+0.032
 fought+0.030
 risking+0.030
Neg logit contributions
err-0.040
 surroundings-0.039
umm-0.039
um-0.039
ummy-0.039
Token stopped
Token ID5025
Feature activation-2.85e-02

Pos logit contributions
pping+0.066
pped+0.062
 share+0.061
 let+0.059
 risking+0.059
Neg logit contributions
 sun-0.089
err-0.089
 biggest-0.088
ama-0.085
 sky-0.085
Token them
Token ID606
Feature activation+5.08e-03

Pos logit contributions
 let+0.040
 share+0.036
 it+0.036
 hid+0.035
 fought+0.035
Neg logit contributions
 surroundings-0.059
err-0.059
ummy-0.059
ama-0.058
ini-0.058
Token.
Token ID13
Feature activation+1.01e-02

Pos logit contributions
ummy+0.010
um+0.010
err+0.010
 surroundings+0.010
umm+0.010
Neg logit contributions
 let-0.008
 it-0.008
 share-0.007
 fought-0.007
 hid-0.007
Token
Token ID198
Feature activation+2.67e-02

Pos logit contributions
ini+0.023
umm+0.023
ummy+0.021
ubby+0.021
 surroundings+0.021
Neg logit contributions
 it-0.017
 let-0.016
 hid-0.014
 tried-0.013
 pulled-0.013
Token
Token ID198
Feature activation+1.70e-02

Pos logit contributions
umm+0.055
ini+0.051
ummy+0.051
err+0.051
 surroundings+0.050
Neg logit contributions
 let-0.047
 it-0.046
 hid-0.040
 share-0.040
 never-0.038
Token"
Token ID1
Feature activation-1.66e-02

Pos logit contributions
umm+0.031
ummy+0.030
err+0.030
ini+0.030
um+0.030
Neg logit contributions
 let-0.031
 it-0.030
 share-0.028
 hid-0.028
 fought-0.027
TokenRex
Token ID47389
Feature activation+2.05e-02

Pos logit contributions
 let+0.029
 it+0.028
 share+0.026
 hid+0.026
 fought+0.025
Neg logit contributions
umm-0.031
err-0.031
ini-0.030
um-0.030
ummy-0.030
Token,
Token ID11
Feature activation-4.73e-02

Pos logit contributions
ama+0.035
ummy+0.034
ubby+0.034
um+0.034
umpy+0.033
Neg logit contributions
 let-0.032
 risking-0.032
 it-0.032
 share-0.031
pped-0.031
Token what
Token ID644
Feature activation-9.99e-03

Pos logit contributions
 argued+0.053
 fought+0.052
pping+0.051
 shared+0.049
pped+0.049
Neg logit contributions
 biggest-0.098
ama-0.096
ay-0.096
err-0.094
ubby-0.094
Token like
Token ID588
Feature activation-2.14e-02

Pos logit contributions
 let+0.042
pped+0.039
 it+0.039
 managed+0.038
 hid+0.038
Neg logit contributions
 biggest-0.073
 surroundings-0.071
umpy-0.071
ummy-0.071
 world-0.071
Token cans
Token ID23916
Feature activation-7.98e-03

Pos logit contributions
 let+0.040
 share+0.038
pping+0.037
 managed+0.037
 hid+0.037
Neg logit contributions
 biggest-0.032
 world-0.031
ummy-0.031
 surroundings-0.030
 sun-0.030
Token and
Token ID290
Feature activation-8.45e-03

Pos logit contributions
 let+0.014
 managed+0.014
 risking+0.013
 fought+0.013
 whoever+0.013
Neg logit contributions
ummy-0.014
ay-0.013
um-0.013
 sky-0.012
 world-0.012
Token coins
Token ID10796
Feature activation-1.08e-02

Pos logit contributions
 let+0.018
 it+0.018
 share+0.017
pping+0.017
pped+0.017
Neg logit contributions
umm-0.011
 biggest-0.011
 surroundings-0.011
ummy-0.011
err-0.010
Token and
Token ID290
Feature activation-2.33e-02

Pos logit contributions
 let+0.020
 whoever+0.019
 managed+0.019
pped+0.019
 fought+0.019
Neg logit contributions
ummy-0.015
 biggest-0.014
 surroundings-0.014
ubby-0.014
um-0.014
Token papers
Token ID9473
Feature activation-3.52e-02

Pos logit contributions
 let+0.047
 it+0.046
pping+0.045
pped+0.044
 managed+0.043
Neg logit contributions
 biggest-0.032
umm-0.031
 surroundings-0.031
ummy-0.030
err-0.029
Token.
Token ID13
Feature activation+9.47e-03

Pos logit contributions
 let+0.064
 whoever+0.064
 risking+0.062
gency+0.061
 fought+0.061
Neg logit contributions
ummy-0.048
 biggest-0.047
ay-0.047
 sky-0.047
um-0.047
Token They
Token ID1119
Feature activation-4.45e-02

Pos logit contributions
umm+0.022
ini+0.022
err+0.021
ubby+0.021
 surroundings+0.021
Neg logit contributions
 it-0.014
 let-0.014
 share-0.012
 whoever-0.012
 hid-0.011
Token liked
Token ID8288
Feature activation-1.99e-02

Pos logit contributions
 risking+0.067
 it+0.060
pping+0.059
pped+0.059
 letting+0.057
Neg logit contributions
um-0.077
 biggest-0.076
 best-0.076
 sky-0.076
 girl-0.076
Token to
Token ID284
Feature activation-4.37e-02

Pos logit contributions
pped+0.026
pping+0.025
 let+0.024
 managed+0.024
ocated+0.022
Neg logit contributions
 world-0.041
 new-0.040
 biggest-0.040
 sun-0.040
 jungle-0.039
Token lift
Token ID10303
Feature activation-1.16e-02

Pos logit contributions
 risking+0.058
pped+0.055
 it+0.053
pping+0.052
 letting+0.051
Neg logit contributions
 biggest-0.086
um-0.086
ini-0.085
 surroundings-0.085
ay-0.084
Token started
Token ID2067
Feature activation-3.45e-02

Pos logit contributions
 risking+0.056
 it+0.050
 losing+0.048
 whoever+0.045
 letting+0.045
Neg logit contributions
um-0.082
ama-0.078
 sun-0.077
olly-0.077
 biggest-0.076
Token to
Token ID284
Feature activation-4.41e-02

Pos logit contributions
 let+0.047
 whoever+0.044
 hid+0.043
 share+0.043
 fought+0.042
Neg logit contributions
ummy-0.070
 surroundings-0.068
um-0.068
ama-0.068
ini-0.067
Token wipe
Token ID19916
Feature activation-2.63e-02

Pos logit contributions
 risking+0.076
assed+0.068
 it+0.068
 hid+0.068
 fought+0.067
Neg logit contributions
ini-0.075
um-0.072
 sun-0.071
ummy-0.071
ama-0.070
Token the
Token ID262
Feature activation+1.14e-01

Pos logit contributions
 let+0.024
 it+0.022
 hid+0.020
 share+0.019
 fought+0.019
Neg logit contributions
umm-0.071
ummy-0.070
um-0.069
err-0.069
 surroundings-0.069
Token counter
Token ID3753
Feature activation-1.75e-02

Pos logit contributions
umm+0.215
ama+0.209
err+0.206
ummy+0.205
um+0.204
Neg logit contributions
 let-0.206
 it-0.206
 hid-0.187
 share-0.181
 put-0.179
Token and
Token ID290
Feature activation-5.19e-02

Pos logit contributions
 let+0.027
 risking+0.026
 it+0.026
 fought+0.025
 hid+0.025
Neg logit contributions
um-0.033
ummy-0.033
umm-0.033
 biggest-0.032
 surroundings-0.032
Token the
Token ID262
Feature activation+9.55e-02

Pos logit contributions
pping+0.054
 risking+0.053
pped+0.049
assed+0.046
 let+0.042
Neg logit contributions
 sun-0.116
 biggest-0.113
 surroundings-0.110
lights-0.110
um-0.109
Token floor
Token ID4314
Feature activation-3.33e-02

Pos logit contributions
umm+0.163
ummy+0.159
err+0.158
um+0.157
ama+0.157
Neg logit contributions
 let-0.188
 it-0.185
 hid-0.168
 share-0.168
 fought-0.160
Token.
Token ID13
Feature activation-6.91e-04

Pos logit contributions
 risking+0.055
 let+0.054
 whoever+0.053
 it+0.051
 share+0.051
Neg logit contributions
um-0.057
ummy-0.057
lights-0.056
 biggest-0.056
umm-0.055
Token They
Token ID1119
Feature activation-4.66e-02

Pos logit contributions
 it+0.001
 let+0.001
 share+0.001
 hid+0.001
 whoever+0.001
Neg logit contributions
umm-0.002
ini-0.001
err-0.001
 surroundings-0.001
ubby-0.001
Token worked
Token ID3111
Feature activation-2.32e-02

Pos logit contributions
 risking+0.063
 letting+0.055
 it+0.054
assed+0.053
 whoever+0.052
Neg logit contributions
ini-0.089
um-0.088
 sky-0.087
 biggest-0.087
 sun-0.086
Token means
Token ID1724
Feature activation-1.10e-02

Pos logit contributions
 let+0.037
 it+0.036
 fought+0.033
 share+0.033
 hid+0.033
Neg logit contributions
err-0.033
ummy-0.032
umm-0.031
umpy-0.031
um-0.031
Token.
Token ID13
Feature activation+1.04e-03

Pos logit contributions
 let+0.018
 hid+0.018
 snatched+0.017
pped+0.017
 managed+0.017
Neg logit contributions
ummy-0.019
 world-0.018
um-0.018
ama-0.018
 surroundings-0.018
Token He
Token ID679
Feature activation-4.52e-02

Pos logit contributions
umm+0.002
ini+0.002
ummy+0.002
err+0.002
um+0.002
Neg logit contributions
 let-0.002
 it-0.002
 hid-0.002
 share-0.002
 fought-0.001
Token also
Token ID635
Feature activation-4.28e-02

Pos logit contributions
 let+0.073
 it+0.072
 share+0.066
 hid+0.065
 fought+0.065
Neg logit contributions
err-0.084
um-0.083
ummy-0.083
umm-0.083
 surroundings-0.082
Token lets
Token ID8781
Feature activation-4.31e-02

Pos logit contributions
 it+0.071
 let+0.071
 hid+0.063
 fought+0.063
 risking+0.063
Neg logit contributions
err-0.075
 biggest-0.073
ubby-0.073
um-0.073
 surroundings-0.072
Token them
Token ID606
Feature activation-2.74e-02

Pos logit contributions
 let+0.041
 it+0.038
 hid+0.035
 fought+0.035
 share+0.034
Neg logit contributions
um-0.110
umm-0.109
ummy-0.109
err-0.109
ama-0.108
Token touch
Token ID3638
Feature activation-1.95e-02

Pos logit contributions
 let+0.032
 it+0.031
 hid+0.027
 share+0.027
 fought+0.027
Neg logit contributions
umm-0.064
um-0.064
err-0.064
ummy-0.063
 surroundings-0.063
Token his
Token ID465
Feature activation+6.75e-02

Pos logit contributions
 let+0.016
 it+0.015
 hid+0.013
 share+0.013
 fought+0.012
Neg logit contributions
umm-0.054
ummy-0.053
err-0.053
um-0.053
ini-0.052
Token hat
Token ID6877
Feature activation-9.69e-03

Pos logit contributions
umm+0.127
ummy+0.120
ini+0.119
err+0.116
um+0.115
Neg logit contributions
 let-0.130
 it-0.127
 share-0.118
 hid-0.114
 fought-0.114
Token and
Token ID290
Feature activation-5.06e-02

Pos logit contributions
 let+0.015
 it+0.014
 hid+0.014
 fought+0.014
 risking+0.014
Neg logit contributions
ummy-0.019
um-0.019
ama-0.018
ay-0.018
umm-0.018
Token his
Token ID465
Feature activation+6.43e-02

Pos logit contributions
 let+0.070
pping+0.068
pped+0.066
 it+0.066
 hid+0.063
Neg logit contributions
 surroundings-0.098
 biggest-0.096
err-0.094
um-0.093
umm-0.093
Token be
Token ID307
Feature activation-8.98e-04

Pos logit contributions
 fought+0.056
 risking+0.055
 hid+0.055
 argued+0.055
pped+0.054
Neg logit contributions
 world-0.100
 sun-0.096
 biggest-0.096
um-0.096
ay-0.095
Token very
Token ID845
Feature activation+8.85e-03

Pos logit contributions
 let+0.002
 fought+0.002
 hid+0.002
 it+0.002
 wouldn+0.002
Neg logit contributions
 biggest-0.001
 surroundings-0.001
ubby-0.001
 world-0.001
um-0.001
Token lucky
Token ID9670
Feature activation-6.25e-03

Pos logit contributions
umm+0.014
ummy+0.013
err+0.013
um+0.013
ini+0.013
Neg logit contributions
 let-0.019
 it-0.018
 share-0.017
 hid-0.017
 fought-0.017
Token if
Token ID611
Feature activation-1.81e-02

Pos logit contributions
 fought+0.008
 let+0.008
pped+0.008
 snatched+0.008
 hid+0.008
Neg logit contributions
ummy-0.014
umm-0.013
 world-0.013
 biggest-0.013
um-0.013
Token you
Token ID345
Feature activation-2.86e-02

Pos logit contributions
 let+0.016
 hid+0.015
 fought+0.014
 share+0.014
 argued+0.013
Neg logit contributions
ummy-0.047
 surroundings-0.047
 biggest-0.047
ama-0.046
um-0.046
Token get
Token ID651
Feature activation-2.20e-02

Pos logit contributions
 it+0.034
 risking+0.034
 letting+0.032
 losing+0.032
 let+0.029
Neg logit contributions
umpy-0.061
ubby-0.060
um-0.060
ay-0.060
 world-0.059
Token to
Token ID284
Feature activation-4.77e-02

Pos logit contributions
 let+0.027
 hid+0.025
 fought+0.025
 share+0.023
 wouldn+0.023
Neg logit contributions
 biggest-0.050
 surroundings-0.049
um-0.048
ummy-0.047
ama-0.047
Token see
Token ID766
Feature activation-4.31e-02

Pos logit contributions
 hid+0.061
 wouldn+0.059
ocated+0.059
 fought+0.058
 snatched+0.058
Neg logit contributions
 sun-0.100
 biggest-0.098
um-0.097
 world-0.096
 surroundings-0.095
Token it
Token ID340
Feature activation-1.69e-02

Pos logit contributions
 let+0.021
 hid+0.019
 fought+0.019
 share+0.016
 argued+0.014
Neg logit contributions
ummy-0.129
 biggest-0.128
 surroundings-0.127
um-0.127
umm-0.126
Token."
Token ID526
Feature activation+2.59e-02

Pos logit contributions
 let+0.023
 it+0.022
 fought+0.021
 hid+0.021
 share+0.020
Neg logit contributions
ummy-0.036
umm-0.036
 biggest-0.035
um-0.035
 surroundings-0.035
Token
Token ID198
Feature activation+4.10e-02

Pos logit contributions
umm+0.053
err+0.050
ubby+0.050
 surroundings+0.050
ini+0.050
Neg logit contributions
 let-0.044
 it-0.044
 share-0.038
 hid-0.037
 fought-0.036
Token my
Token ID616
Feature activation+8.24e-02

Pos logit contributions
 let+0.020
 it+0.019
 fought+0.018
 hid+0.018
 share+0.018
Neg logit contributions
ummy-0.025
err-0.024
umm-0.024
 biggest-0.024
um-0.024
Token best
Token ID1266
Feature activation+5.76e-02

Pos logit contributions
umm+0.118
um+0.112
umpy+0.109
ama+0.107
ini+0.106
Neg logit contributions
 let-0.196
 it-0.193
 share-0.178
 hid-0.168
 put-0.168
Token friend
Token ID1545
Feature activation-2.87e-03

Pos logit contributions
umm+0.073
err+0.066
ini+0.064
um+0.063
ama+0.063
Neg logit contributions
 let-0.146
 it-0.144
 share-0.131
 hid-0.130
 put-0.125
Token.
Token ID13
Feature activation-1.52e-02

Pos logit contributions
 argued+0.004
 fought+0.004
 let+0.004
 risking+0.004
pped+0.004
Neg logit contributions
ummy-0.005
 world-0.005
ay-0.005
umpy-0.005
ubby-0.005
Token You
Token ID921
Feature activation-3.64e-02

Pos logit contributions
 let+0.021
 it+0.020
 hid+0.018
 share+0.018
 fought+0.018
Neg logit contributions
umm-0.034
ummy-0.033
err-0.033
um-0.033
ini-0.033
Token can
Token ID460
Feature activation-4.82e-02

Pos logit contributions
 letting+0.047
 risking+0.046
 it+0.045
pped+0.045
pping+0.044
Neg logit contributions
ay-0.072
 biggest-0.070
ubby-0.069
umpy-0.067
 world-0.067
Token be
Token ID307
Feature activation-6.39e-03

Pos logit contributions
 it+0.053
 let+0.053
 fought+0.050
 risking+0.049
 hid+0.049
Neg logit contributions
um-0.112
ummy-0.111
ama-0.109
ay-0.108
ini-0.106
Token friends
Token ID2460
Feature activation-6.25e-03

Pos logit contributions
 let+0.009
 it+0.008
 hid+0.008
 fought+0.008
 share+0.008
Neg logit contributions
umm-0.014
ummy-0.014
err-0.014
um-0.013
ini-0.013
Token too
Token ID1165
Feature activation+2.04e-03

Pos logit contributions
 argued+0.010
 hid+0.010
 fought+0.010
 managed+0.010
pped+0.010
Neg logit contributions
 world-0.012
umpy-0.011
ummy-0.011
 surroundings-0.011
ay-0.011
Token."
Token ID526
Feature activation+1.49e-02

Pos logit contributions
umm+0.006
ini+0.005
err+0.005
 surroundings+0.005
um+0.005
Neg logit contributions
 let-0.002
 it-0.002
 share-0.002
 hid-0.002
 never-0.002
Token
Token ID198
Feature activation+3.21e-02

Pos logit contributions
umm+0.033
ini+0.033
err+0.032
ubby+0.031
 surroundings+0.030
Neg logit contributions
 it-0.024
 let-0.024
 share-0.021
 hid-0.020
 him-0.018