TOP ACTIVATIONS
MAX = 0.128

. " A magic book ? Really ? How
Suddenly , a magic fairy appeared ! She was
of the hurt . A magic wind came and blew away
Lily . She said a magic word . But nothing happened
middle . She says a magic word and waits .
One day , a brave little boy heard the planet
side . He 's a smart dog ," the man said
a special surprise . A thoughtful gn ome was playing the
a miracle happened ! A magic fairy came and fixed the
needed a doctor . A thoughtful doctor came and looked at
One night , a magical fairy visited the film .
Just then , a magic fairy appeared with a wand
Just then , a magic fairy appeared with a wand
powers and she said a magic word that opened the gate
saw something extraordinary . A magic wand was hidden under the
together . He said a magic word and pulled them apart
" That 's a very thoughtful idea , Tim my ."
taste . Suddenly , a compassionate stranger appeared and welcomed her
hot . But then a brave firefighter came to save the
fairy princess , then a courageous knight , then a wise

MIN ACTIVATIONS
MIN = -0.165

slide and looked through the ey ep iece . He saw
, it 's very pretty isn 't it Tommy ," Mama
. This lovely muff in isn 't for us , it
reward . It was ch ocol ates and sweets !
the sky before they walked fore shadow ed the down pour
, M andy , it isn 't ours . We can
on the carpet ? That isn 't very nice ".
. They were called ost ric hes . Lily and Ben
is a sign that brains weren 't meant to be folded
was a box of ch ocol ates . Lily loved ch
giving him too many ch ocol ates ." The king learned
pl uck cel ery that isn 't grown . He showed
fix it if the steam isn 't available . But don
, the loud est voice isn 't the one that helps
" It means the tuna isn 't very fresh , Jimmy
, kicking is really fun isn 't it ? â
he knew the scary noises weren 't anything to worry about
drink he had been drinking wasn 't very good for him
, " No , mud isn 't for sharing ."
and run to the ost ric hes . "

INTERVAL 0.054 - 0.128
CONTAINS 5.960%

waved to the path . Suddenly , the path said hello
the day they experienced the magical spray of the butterfly .
it moved very fast . A little boy looked up at
around it . She was scared and was about to turn
. Tim my was so happy and thanked his mom my

INTERVAL -0.019 - 0.054
CONTAINS 64.991%

excited . She ran to the spell , but
. He finally made it back home . Mom
't fall down again . But he couldn 't use the
pass . When it arrived , the mouse was filled with
find out what is inside soon enough ." The

INTERVAL -0.092 - -0.019
CONTAINS 28.666%

The charming daughter was no longer scared and
Tim went outside and said , " L ily
dragon . All of the people were amazed because it was
free . She spun the wheel faster and faster , until
build a house for his dolls . He asked his mom

INTERVAL -0.165 - -0.092
CONTAINS 0.382%

cat with the big lip were very happy . They played
They were friends . <|endoftext|> L ily and Tom were playing
clothes and made sure everything was ready .
after it , but he wasn 't paying attention to where
, but still the web wasn 't finished .
Token.
Token ID13
Feature activation-2.82e-03

Pos logit contributions
+0.150
ha+0.143
www+0.140
ww+0.139
pool+0.139
Neg logit contributions
e-0.132
ow-0.126
years-0.109
 being-0.103
ad-0.103
Token
Token ID198
Feature activation+3.49e-02

Pos logit contributions
e+0.006
ow+0.005
years+0.005
ad+0.004
 eyel+0.004
Neg logit contributions
 trail-0.004
-0.004
 hill-0.004
www-0.004
ww-0.004
Token
Token ID198
Feature activation+5.23e-02

Pos logit contributions
+0.039
unt+0.038
ha+0.035
 trail+0.034
OK+0.034
Neg logit contributions
e-0.090
ow-0.084
years-0.078
ric-0.074
eking-0.072
Token"
Token ID1
Feature activation+2.54e-02

Pos logit contributions
storm+0.073
unt+0.072
 wagon+0.068
pool+0.066
 trail+0.065
Neg logit contributions
e-0.116
years-0.099
ad-0.086
ow-0.084
eking-0.080
TokenA
Token ID32
Feature activation+8.46e-02

Pos logit contributions
+0.032
unt+0.030
 trail+0.030
uddle+0.030
 wagon+0.030
Neg logit contributions
e-0.060
years-0.054
ow-0.050
 eyel-0.047
ad-0.046
Token magic
Token ID5536
Feature activation+1.28e-01

Pos logit contributions
pool+0.117
 wagon+0.116
 hill+0.108
+0.108
Okay+0.105
Neg logit contributions
e-0.190
ow-0.163
 eyel-0.148
years-0.142
 staring-0.131
Token book
Token ID1492
Feature activation+3.66e-02

Pos logit contributions
+0.183
pool+0.183
First+0.183
Together+0.182
s+0.175
Neg logit contributions
e-0.276
ow-0.253
 eyel-0.210
ric-0.200
ps-0.198
Token?
Token ID30
Feature activation+1.39e-02

Pos logit contributions
unt+0.060
+0.060
pool+0.059
OK+0.058
 wagon+0.058
Neg logit contributions
e-0.074
ow-0.069
years-0.053
eking-0.051
ps-0.050
Token Really
Token ID16123
Feature activation+6.82e-03

Pos logit contributions
+0.021
unt+0.020
 trail+0.019
ha+0.019
 hill+0.018
Neg logit contributions
e-0.032
ow-0.029
years-0.027
ric-0.025
 eyel-0.024
Token?
Token ID30
Feature activation+7.35e-03

Pos logit contributions
+0.011
unt+0.010
ha+0.010
 trail+0.009
OK+0.009
Neg logit contributions
e-0.015
ow-0.014
years-0.013
ric-0.012
eking-0.012
Token How
Token ID1374
Feature activation+2.77e-02

Pos logit contributions
+0.011
unt+0.010
 trail+0.010
 hill+0.010
ha+0.010
Neg logit contributions
e-0.017
ow-0.016
years-0.014
ric-0.013
 eyel-0.013
Token
Token ID198
Feature activation+3.65e-02

Pos logit contributions
unt+0.005
+0.004
ha+0.004
 trail+0.004
ww+0.004
Neg logit contributions
e-0.009
ow-0.008
years-0.007
ric-0.007
 eyel-0.007
Token
Token ID198
Feature activation+4.71e-02

Pos logit contributions
+0.043
unt+0.041
 trail+0.038
OK+0.038
 wagon+0.038
Neg logit contributions
e-0.093
ow-0.086
years-0.078
ric-0.073
eking-0.072
TokenSuddenly
Token ID38582
Feature activation+6.02e-02

Pos logit contributions
unt+0.054
+0.053
 wagon+0.047
 hill+0.046
 trail+0.046
Neg logit contributions
e-0.124
ow-0.109
years-0.101
eking-0.090
ad-0.089
Token,
Token ID11
Feature activation+4.62e-02

Pos logit contributions
+0.066
unt+0.060
ha+0.058
s+0.057
 man+0.054
Neg logit contributions
e-0.160
ow-0.148
years-0.145
ric-0.141
eking-0.136
Token a
Token ID257
Feature activation+7.72e-02

Pos logit contributions
+0.080
unt+0.074
ha+0.071
s+0.071
OK+0.071
Neg logit contributions
e-0.095
ow-0.086
years-0.081
ric-0.077
eking-0.074
Token magic
Token ID5536
Feature activation+1.26e-01

Pos logit contributions
+0.092
unt+0.083
s+0.083
pool+0.081
OK+0.080
Neg logit contributions
e-0.188
ow-0.175
years-0.162
 eyel-0.151
ric-0.150
Token fairy
Token ID25607
Feature activation+1.31e-02

Pos logit contributions
+0.184
unt+0.171
s+0.165
First+0.164
OK+0.161
Neg logit contributions
e-0.281
ow-0.263
 eyel-0.228
years-0.223
ric-0.219
Token appeared
Token ID4120
Feature activation-2.49e-02

Pos logit contributions
+0.024
unt+0.024
ha+0.023
s+0.023
 trail+0.022
Neg logit contributions
e-0.024
ow-0.022
years-0.021
ric-0.019
eking-0.018
Token!
Token ID0
Feature activation+1.17e-02

Pos logit contributions
e+0.061
years+0.056
ow+0.056
ric+0.055
 elbows+0.050
Neg logit contributions
s-0.029
-0.027
 trail-0.026
ha-0.025
unt-0.025
Token She
Token ID1375
Feature activation+5.71e-02

Pos logit contributions
+0.017
unt+0.017
ha+0.016
OK+0.016
 trail+0.016
Neg logit contributions
e-0.025
ow-0.024
years-0.021
 eyel-0.019
ric-0.018
Token was
Token ID373
Feature activation-6.30e-02

Pos logit contributions
+0.082
unt+0.080
ha+0.074
s+0.073
 trail+0.071
Neg logit contributions
e-0.132
ow-0.123
years-0.114
ric-0.106
eking-0.105
Token of
Token ID286
Feature activation-1.60e-02

Pos logit contributions
+0.018
unt+0.017
ha+0.016
s+0.015
pool+0.014
Neg logit contributions
e-0.070
ow-0.066
years-0.064
ric-0.064
eking-0.061
Token the
Token ID262
Feature activation+8.15e-03

Pos logit contributions
e+0.032
ow+0.030
years+0.028
ric+0.026
eking+0.025
Neg logit contributions
-0.026
unt-0.025
 hill-0.024
 man-0.024
 trail-0.024
Token hurt
Token ID5938
Feature activation+4.22e-02

Pos logit contributions
+0.012
unt+0.012
ha+0.011
s+0.010
+0.010
Neg logit contributions
e-0.019
ow-0.017
years-0.016
 eyel-0.015
eking-0.015
Token.
Token ID13
Feature activation+6.19e-03

Pos logit contributions
+0.049
unt+0.047
ha+0.042
OK+0.042
s+0.041
Neg logit contributions
e-0.110
ow-0.103
years-0.095
ric-0.089
eking-0.089
Token A
Token ID317
Feature activation+9.14e-02

Pos logit contributions
+0.007
unt+0.007
ha+0.007
 trail+0.006
s+0.006
Neg logit contributions
e-0.016
ow-0.015
years-0.014
ric-0.013
 eyel-0.013
Token magic
Token ID5536
Feature activation+1.26e-01

Pos logit contributions
+0.132
pool+0.127
s+0.123
oun+0.115
unt+0.114
Neg logit contributions
e-0.200
ow-0.172
 eyel-0.163
years-0.159
 staring-0.151
Token wind
Token ID2344
Feature activation+3.33e-02

Pos logit contributions
+0.183
unt+0.172
s+0.167
Together+0.163
+0.163
Neg logit contributions
e-0.284
ow-0.260
 eyel-0.227
years-0.218
eking-0.213
Token came
Token ID1625
Feature activation-6.48e-02

Pos logit contributions
+0.055
unt+0.054
ha+0.050
OK+0.049
 trail+0.049
Neg logit contributions
e-0.070
ow-0.064
years-0.057
ric-0.054
eking-0.053
Token and
Token ID290
Feature activation-1.63e-02

Pos logit contributions
e+0.153
years+0.136
ow+0.136
ric+0.123
eking+0.119
Neg logit contributions
s-0.084
unt-0.084
-0.084
 trail-0.079
ha-0.078
Token blew
Token ID17948
Feature activation+3.74e-02

Pos logit contributions
e+0.033
ow+0.031
years+0.028
eking+0.026
ric+0.025
Neg logit contributions
-0.028
unt-0.027
OK-0.025
ha-0.025
pool-0.024
Token away
Token ID1497
Feature activation+2.37e-02

Pos logit contributions
+0.065
unt+0.062
s+0.060
ha+0.060
 trail+0.058
Neg logit contributions
e-0.073
ow-0.069
years-0.064
ric-0.061
eking-0.058
Token Lily
Token ID20037
Feature activation-3.41e-02

Pos logit contributions
e+0.008
ow+0.008
years+0.007
eking+0.007
owing+0.007
Neg logit contributions
-0.006
unt-0.006
 man-0.006
s-0.006
 hill-0.006
Token.
Token ID13
Feature activation+7.82e-03

Pos logit contributions
e+0.083
ow+0.077
years+0.076
ric+0.070
eking+0.069
Neg logit contributions
-0.040
s-0.039
unt-0.038
 trail-0.035
 man-0.035
Token She
Token ID1375
Feature activation+6.61e-02

Pos logit contributions
+0.011
unt+0.011
ha+0.010
 trail+0.010
 hill+0.010
Neg logit contributions
e-0.018
ow-0.017
years-0.015
 eyel-0.014
ric-0.014
Token said
Token ID531
Feature activation+2.77e-02

Pos logit contributions
+0.086
unt+0.083
ha+0.075
 hill+0.074
 trail+0.074
Neg logit contributions
e-0.162
ow-0.150
years-0.140
ric-0.129
eking-0.129
Token a
Token ID257
Feature activation+7.95e-02

Pos logit contributions
+0.033
unt+0.031
s+0.029
ha+0.028
 man+0.027
Neg logit contributions
e-0.071
ow-0.066
years-0.063
ric-0.060
eking-0.059
Token magic
Token ID5536
Feature activation+1.26e-01

Pos logit contributions
+0.114
unt+0.105
OK+0.100
s+0.100
ha+0.100
Neg logit contributions
e-0.177
ow-0.163
ric-0.143
 eyel-0.142
years-0.141
Token word
Token ID1573
Feature activation+3.85e-02

Pos logit contributions
+0.189
unt+0.178
OK+0.163
s+0.162
ha+0.161
Neg logit contributions
e-0.282
ow-0.264
years-0.231
ric-0.224
 eyel-0.222
Token.
Token ID13
Feature activation+4.57e-03

Pos logit contributions
+0.064
unt+0.060
OK+0.056
ha+0.056
 hill+0.055
Neg logit contributions
e-0.082
ow-0.076
years-0.068
ric-0.062
eking-0.061
Token But
Token ID887
Feature activation+1.98e-02

Pos logit contributions
+0.007
unt+0.007
ha+0.007
 trail+0.007
 hill+0.006
Neg logit contributions
e-0.010
ow-0.009
years-0.008
 eyel-0.008
ric-0.007
Token nothing
Token ID2147
Feature activation+7.36e-02

Pos logit contributions
+0.026
unt+0.025
 man+0.024
s+0.023
OK+0.023
Neg logit contributions
e-0.047
ow-0.044
years-0.041
ric-0.040
eking-0.038
Token happened
Token ID3022
Feature activation-1.42e-02

Pos logit contributions
+0.124
unt+0.121
ha+0.113
s+0.112
OK+0.111
Neg logit contributions
e-0.151
ow-0.139
years-0.129
ric-0.118
eking-0.116
Token middle
Token ID3504
Feature activation+1.37e-02

Pos logit contributions
+0.008
unt+0.008
ha+0.007
OK+0.007
 hill+0.007
Neg logit contributions
e-0.019
ow-0.018
years-0.017
ric-0.016
eking-0.016
Token.
Token ID13
Feature activation-9.98e-03

Pos logit contributions
+0.019
unt+0.019
OK+0.017
ha+0.017
+0.016
Neg logit contributions
e-0.033
ow-0.030
years-0.027
ric-0.025
eking-0.025
Token She
Token ID1375
Feature activation+6.20e-02

Pos logit contributions
e+0.023
ow+0.022
years+0.020
 eyel+0.018
ric+0.018
Neg logit contributions
-0.014
unt-0.013
 trail-0.013
ha-0.013
 hill-0.013
Token says
Token ID1139
Feature activation+2.78e-02

Pos logit contributions
+0.091
unt+0.087
ha+0.080
s+0.080
OK+0.079
Neg logit contributions
e-0.142
ow-0.131
years-0.122
ric-0.113
eking-0.112
Token a
Token ID257
Feature activation+7.72e-02

Pos logit contributions
+0.034
unt+0.033
ha+0.030
s+0.030
OK+0.029
Neg logit contributions
e-0.070
ow-0.065
years-0.061
ric-0.059
eking-0.058
Token magic
Token ID5536
Feature activation+1.26e-01

Pos logit contributions
+0.105
unt+0.100
s+0.096
pool+0.094
First+0.090
Neg logit contributions
e-0.175
ow-0.155
 eyel-0.144
years-0.139
ric-0.138
Token word
Token ID1573
Feature activation+3.69e-02

Pos logit contributions
+0.205
unt+0.198
Together+0.186
s+0.184
First+0.182
Neg logit contributions
e-0.261
ow-0.243
 eyel-0.206
ps-0.198
ric-0.195
Token and
Token ID290
Feature activation-1.35e-02

Pos logit contributions
+0.064
unt+0.061
OK+0.058
 trail+0.056
 hill+0.056
Neg logit contributions
e-0.075
ow-0.071
years-0.060
 eyel-0.056
ric-0.055
Token waits
Token ID28364
Feature activation+2.63e-02

Pos logit contributions
e+0.029
ow+0.027
years+0.025
ric+0.023
eking+0.023
Neg logit contributions
-0.021
unt-0.021
ha-0.019
s-0.019
OK-0.019
Token.
Token ID13
Feature activation-3.53e-04

Pos logit contributions
+0.036
unt+0.034
ha+0.032
s+0.031
 trail+0.031
Neg logit contributions
e-0.062
ow-0.058
years-0.054
ric-0.051
eking-0.049
Token
Token ID198
Feature activation+4.26e-02

Pos logit contributions
e+0.001
ow+0.001
years+0.001
 eyel+0.001
ric+0.001
Neg logit contributions
-0.000
unt-0.000
 trail-0.000
ha-0.000
 hill-0.000
Token
Token ID198
Feature activation+4.97e-02

Pos logit contributions
+0.042
unt+0.040
ha+0.037
 trail+0.036
OK+0.036
Neg logit contributions
e-0.102
ow-0.095
years-0.088
ric-0.083
eking-0.082
TokenOne
Token ID3198
Feature activation+7.91e-02

Pos logit contributions
+0.063
 wagon+0.059
unt+0.059
 trail+0.057
 hill+0.056
Neg logit contributions
e-0.126
years-0.106
ow-0.105
eking-0.092
ric-0.091
Token day
Token ID1110
Feature activation+7.18e-02

Pos logit contributions
+0.080
unt+0.075
ha+0.067
OK+0.066
s+0.064
Neg logit contributions
e-0.219
ow-0.204
years-0.191
ric-0.176
eking-0.174
Token,
Token ID11
Feature activation+5.18e-02

Pos logit contributions
+0.097
unt+0.093
ha+0.085
s+0.084
OK+0.083
Neg logit contributions
e-0.173
ow-0.161
years-0.150
ric-0.139
eking-0.138
Token a
Token ID257
Feature activation+8.21e-02

Pos logit contributions
+0.113
unt+0.112
ha+0.106
s+0.104
 trail+0.104
Neg logit contributions
e-0.081
ow-0.072
years-0.064
ric-0.054
eking-0.054
Token brave
Token ID14802
Feature activation+1.25e-01

Pos logit contributions
+0.109
s+0.103
pool+0.100
First+0.099
unt+0.098
Neg logit contributions
e-0.184
ow-0.164
years-0.159
 eyel-0.146
ric-0.137
Token little
Token ID1310
Feature activation+7.53e-02

Pos logit contributions
+0.142
unt+0.131
s+0.122
OK+0.118
ha+0.115
Neg logit contributions
e-0.326
ow-0.302
years-0.279
 eyel-0.259
ric-0.258
Token boy
Token ID2933
Feature activation-3.62e-02

Pos logit contributions
+0.077
unt+0.075
Together+0.069
First+0.068
ha+0.066
Neg logit contributions
e-0.196
ow-0.185
years-0.168
 eyel-0.157
owing-0.155
Token heard
Token ID2982
Feature activation+1.46e-02

Pos logit contributions
e+0.072
ow+0.069
years+0.065
ric+0.062
eking+0.060
Neg logit contributions
-0.061
unt-0.057
ha-0.055
s-0.055
 man-0.054
Token the
Token ID262
Feature activation+1.12e-02

Pos logit contributions
+0.025
unt+0.025
s+0.024
ha+0.023
 man+0.023
Neg logit contributions
e-0.029
ow-0.026
years-0.025
ric-0.023
eking-0.023
Token planet
Token ID5440
Feature activation+2.31e-02

Pos logit contributions
+0.017
unt+0.017
ha+0.015
+0.015
s+0.015
Neg logit contributions
e-0.025
ow-0.022
years-0.020
eking-0.019
 eyel-0.018
Token side
Token ID1735
Feature activation-1.94e-02

Pos logit contributions
e+0.060
ow+0.054
ric+0.046
years+0.046
eking+0.043
Neg logit contributions
unt-0.051
-0.049
ha-0.047
OK-0.045
ww-0.045
Token.
Token ID13
Feature activation+4.25e-03

Pos logit contributions
e+0.045
ow+0.042
years+0.035
owing+0.033
ric+0.033
Neg logit contributions
unt-0.028
-0.028
OK-0.026
-0.026
ha-0.026
Token He
Token ID679
Feature activation+6.53e-02

Pos logit contributions
+0.006
unt+0.005
ha+0.005
 trail+0.005
s+0.005
Neg logit contributions
e-0.010
ow-0.009
years-0.009
ric-0.008
 eyel-0.008
Token's
Token ID338
Feature activation-2.08e-02

Pos logit contributions
+0.104
unt+0.103
ha+0.094
OK+0.092
 trail+0.092
Neg logit contributions
e-0.141
ow-0.129
years-0.117
ric-0.109
eking-0.108
Token a
Token ID257
Feature activation+6.47e-02

Pos logit contributions
e+0.042
ow+0.039
years+0.034
ric+0.032
eking+0.032
Neg logit contributions
-0.037
unt-0.036
ha-0.034
s-0.033
OK-0.033
Token smart
Token ID4451
Feature activation+1.24e-01

Pos logit contributions
+0.085
s+0.079
pool+0.078
unt+0.077
Together+0.075
Neg logit contributions
e-0.141
ow-0.121
ric-0.118
 eyel-0.116
years-0.115
Token dog
Token ID3290
Feature activation-1.36e-02

Pos logit contributions
+0.161
unt+0.154
s+0.150
Together+0.144
ha+0.137
Neg logit contributions
e-0.300
ow-0.272
ric-0.235
 eyel-0.234
years-0.232
Token,"
Token ID553
Feature activation+1.31e-02

Pos logit contributions
e+0.031
ow+0.028
years+0.024
ric+0.023
eking+0.023
Neg logit contributions
-0.021
unt-0.021
OK-0.019
ha-0.019
 trail-0.019
Token the
Token ID262
Feature activation+4.90e-02

Pos logit contributions
+0.020
unt+0.018
ha+0.018
pool+0.018
 trail+0.018
Neg logit contributions
e-0.029
ow-0.026
years-0.025
 eyel-0.022
ric-0.022
Token man
Token ID582
Feature activation-4.69e-02

Pos logit contributions
+0.068
ha+0.065
pool+0.065
unt+0.061
www+0.059
Neg logit contributions
e-0.109
ow-0.099
years-0.092
 elbows-0.091
 eyel-0.090
Token said
Token ID531
Feature activation+1.15e-02

Pos logit contributions
e+0.132
ow+0.123
years+0.122
ric+0.119
eking+0.113
Neg logit contributions
-0.039
unt-0.038
s-0.038
 man-0.035
ha-0.032
Token a
Token ID257
Feature activation+6.00e-02

Pos logit contributions
+0.013
unt+0.012
ha+0.012
s+0.012
OK+0.012
Neg logit contributions
e-0.010
ow-0.009
years-0.008
ric-0.007
eking-0.007
Token special
Token ID2041
Feature activation+1.04e-01

Pos logit contributions
+0.079
pool+0.074
unt+0.072
Together+0.070
OK+0.069
Neg logit contributions
e-0.134
ow-0.114
ric-0.110
 staring-0.107
years-0.105
Token surprise
Token ID5975
Feature activation+4.17e-02

Pos logit contributions
+0.129
unt+0.118
s+0.114
Together+0.111
pool+0.109
Neg logit contributions
e-0.257
ow-0.236
years-0.209
 eyel-0.205
ric-0.205
Token.
Token ID13
Feature activation+8.97e-04

Pos logit contributions
+0.067
unt+0.064
OK+0.059
ha+0.058
s+0.058
Neg logit contributions
e-0.090
ow-0.083
years-0.075
eking-0.070
 eyel-0.068
Token A
Token ID317
Feature activation+9.10e-02

Pos logit contributions
+0.001
unt+0.001
 trail+0.001
ha+0.001
s+0.001
Neg logit contributions
e-0.002
ow-0.002
years-0.002
ric-0.002
 eyel-0.002
Token thoughtful
Token ID22677
Feature activation+1.24e-01

Pos logit contributions
+0.124
pool+0.123
s+0.109
First+0.108
OK+0.106
Neg logit contributions
e-0.199
years-0.162
ow-0.161
 eyel-0.156
 staring-0.154
Token gn
Token ID19967
Feature activation+3.15e-02

Pos logit contributions
+0.137
unt+0.123
s+0.117
OK+0.110
ha+0.110
Neg logit contributions
e-0.329
ow-0.308
years-0.283
 eyel-0.265
ric-0.265
Tokenome
Token ID462
Feature activation+1.65e-02

Pos logit contributions
+0.045
unt+0.043
s+0.040
ha+0.040
 trail+0.039
Neg logit contributions
e-0.073
ow-0.068
years-0.063
ric-0.058
eking-0.058
Token was
Token ID373
Feature activation-6.20e-02

Pos logit contributions
+0.026
unt+0.025
ha+0.023
s+0.023
OK+0.023
Neg logit contributions
e-0.036
ow-0.033
years-0.031
ric-0.028
eking-0.028
Token playing
Token ID2712
Feature activation+7.78e-03

Pos logit contributions
e+0.104
ow+0.097
years+0.089
ric+0.084
 eyel+0.079
Neg logit contributions
unt-0.118
-0.115
 man-0.114
ha-0.112
s-0.108
Token the
Token ID262
Feature activation-1.38e-02

Pos logit contributions
+0.013
unt+0.013
ha+0.012
s+0.012
OK+0.012
Neg logit contributions
e-0.016
ow-0.015
years-0.014
ric-0.012
eking-0.012
Token a
Token ID257
Feature activation+7.95e-02

Pos logit contributions
+0.070
unt+0.068
ha+0.063
OK+0.062
s+0.062
Neg logit contributions
e-0.088
ow-0.081
years-0.073
ric-0.067
eking-0.066
Token miracle
Token ID20820
Feature activation+9.14e-02

Pos logit contributions
+0.109
s+0.101
pool+0.098
unt+0.097
OK+0.093
Neg logit contributions
e-0.182
ow-0.167
years-0.158
 eyel-0.146
ric-0.142
Token happened
Token ID3022
Feature activation-1.40e-02

Pos logit contributions
+0.181
unt+0.173
OK+0.164
 hill+0.163
+0.162
Neg logit contributions
e-0.160
ow-0.147
years-0.124
eking-0.118
ric-0.112
Token!
Token ID0
Feature activation+1.56e-03

Pos logit contributions
e+0.034
ow+0.032
years+0.030
ric+0.028
eking+0.027
Neg logit contributions
-0.018
unt-0.017
s-0.016
ha-0.016
OK-0.016
Token A
Token ID317
Feature activation+8.93e-02

Pos logit contributions
+0.002
unt+0.002
OK+0.002
 trail+0.002
ha+0.002
Neg logit contributions
e-0.004
ow-0.004
years-0.003
ric-0.003
 eyel-0.003
Token magic
Token ID5536
Feature activation+1.24e-01

Pos logit contributions
+0.124
pool+0.117
s+0.111
OK+0.110
First+0.106
Neg logit contributions
e-0.201
ow-0.180
years-0.172
 eyel-0.159
ric-0.156
Token fairy
Token ID25607
Feature activation+8.85e-03

Pos logit contributions
+0.197
unt+0.179
s+0.179
OK+0.173
+0.172
Neg logit contributions
e-0.264
ow-0.244
 eyel-0.210
years-0.208
ric-0.200
Token came
Token ID1625
Feature activation-6.26e-02

Pos logit contributions
+0.017
unt+0.016
ha+0.015
OK+0.015
s+0.015
Neg logit contributions
e-0.016
ow-0.015
years-0.013
ric-0.012
eking-0.012
Token and
Token ID290
Feature activation-1.20e-02

Pos logit contributions
e+0.141
ow+0.126
years+0.125
ric+0.114
eking+0.112
Neg logit contributions
s-0.087
unt-0.084
ha-0.081
-0.080
 trail-0.080
Token fixed
Token ID5969
Feature activation+3.92e-02

Pos logit contributions
e+0.024
ow+0.022
years+0.021
eking+0.019
ric+0.018
Neg logit contributions
-0.020
unt-0.020
OK-0.018
 hill-0.018
ha-0.018
Token the
Token ID262
Feature activation+1.42e-02

Pos logit contributions
+0.062
unt+0.061
s+0.059
 hill+0.058
ha+0.058
Neg logit contributions
e-0.081
ow-0.075
years-0.071
ric-0.064
eking-0.063
Token needed
Token ID2622
Feature activation+3.61e-02

Pos logit contributions
+0.043
unt+0.040
s+0.038
ha+0.037
 trail+0.037
Neg logit contributions
e-0.063
ow-0.059
years-0.055
ric-0.052
eking-0.051
Token a
Token ID257
Feature activation+5.25e-02

Pos logit contributions
+0.050
unt+0.050
ha+0.046
 few+0.046
 trail+0.045
Neg logit contributions
e-0.082
ow-0.076
years-0.074
ric-0.071
eking-0.068
Token doctor
Token ID6253
Feature activation-4.20e-03

Pos logit contributions
+0.067
unt+0.063
ha+0.059
s+0.058
OK+0.058
Neg logit contributions
e-0.126
ow-0.120
years-0.108
 eyel-0.103
ric-0.102
Token.
Token ID13
Feature activation+8.73e-03

Pos logit contributions
e+0.010
ow+0.009
years+0.009
ric+0.008
eking+0.008
Neg logit contributions
-0.006
unt-0.006
ha-0.005
OK-0.005
 trail-0.005
Token A
Token ID317
Feature activation+8.83e-02

Pos logit contributions
+0.013
unt+0.013
ha+0.012
 trail+0.012
 hill+0.012
Neg logit contributions
e-0.019
ow-0.018
years-0.016
 eyel-0.015
ric-0.014
Token thoughtful
Token ID22677
Feature activation+1.23e-01

Pos logit contributions
+0.118
unt+0.109
s+0.103
pool+0.103
ha+0.102
Neg logit contributions
e-0.210
ow-0.195
years-0.180
 eyel-0.168
eking-0.166
Token doctor
Token ID6253
Feature activation+8.61e-03

Pos logit contributions
+0.117
unt+0.111
ha+0.097
OK+0.094
s+0.094
Neg logit contributions
e-0.345
ow-0.324
years-0.307
ric-0.289
eking-0.287
Token came
Token ID1625
Feature activation-5.67e-02

Pos logit contributions
+0.015
unt+0.014
 trail+0.013
 hill+0.013
ha+0.013
Neg logit contributions
e-0.018
ow-0.016
years-0.015
ric-0.013
eking-0.013
Token and
Token ID290
Feature activation-1.11e-02

Pos logit contributions
e+0.138
ow+0.125
years+0.125
 eyel+0.121
oken+0.119
Neg logit contributions
s-0.064
unt-0.058
 few-0.057
-0.057
 man-0.056
Token looked
Token ID3114
Feature activation+2.08e-02

Pos logit contributions
e+0.024
ow+0.023
years+0.021
ric+0.019
eking+0.019
Neg logit contributions
-0.017
unt-0.016
ha-0.015
OK-0.015
s-0.015
Token at
Token ID379
Feature activation-1.90e-02

Pos logit contributions
+0.036
unt+0.035
ha+0.033
OK+0.033
 trail+0.033
Neg logit contributions
e-0.041
ow-0.038
years-0.035
eking-0.032
ric-0.031
Token
Token ID198
Feature activation+4.94e-02

Pos logit contributions
+0.044
unt+0.041
 trail+0.038
ha+0.038
OK+0.037
Neg logit contributions
e-0.103
ow-0.096
years-0.089
ric-0.083
eking-0.082
TokenOne
Token ID3198
Feature activation+8.05e-02

Pos logit contributions
+0.067
 wagon+0.060
unt+0.060
 hill+0.058
 trail+0.058
Neg logit contributions
e-0.123
ow-0.105
years-0.104
eking-0.090
ric-0.087
Token night
Token ID1755
Feature activation+7.10e-02

Pos logit contributions
+0.083
unt+0.076
ha+0.069
OK+0.068
+0.067
Neg logit contributions
e-0.223
ow-0.208
years-0.193
ric-0.177
 eyel-0.177
Token,
Token ID11
Feature activation+5.22e-02

Pos logit contributions
+0.100
unt+0.096
OK+0.089
ha+0.088
 trail+0.087
Neg logit contributions
e-0.166
ow-0.154
years-0.140
ric-0.130
eking-0.129
Token a
Token ID257
Feature activation+8.06e-02

Pos logit contributions
+0.112
unt+0.110
ha+0.104
s+0.103
OK+0.102
Neg logit contributions
e-0.084
ow-0.075
years-0.067
ric-0.059
eking-0.058
Token magical
Token ID10883
Feature activation+1.23e-01

Pos logit contributions
+0.108
s+0.104
pool+0.103
First+0.099
OK+0.094
Neg logit contributions
e-0.175
ow-0.156
years-0.150
 eyel-0.142
 staring-0.132
Token fairy
Token ID25607
Feature activation+1.30e-02

Pos logit contributions
First+0.168
Together+0.158
+0.156
s+0.155
pool+0.147
Neg logit contributions
e-0.283
ow-0.254
 eyel-0.222
ad-0.208
 staring-0.206
Token visited
Token ID8672
Feature activation+1.54e-02

Pos logit contributions
+0.024
unt+0.024
ha+0.022
OK+0.022
s+0.022
Neg logit contributions
e-0.024
ow-0.022
years-0.020
ric-0.018
eking-0.018
Token the
Token ID262
Feature activation+1.29e-02

Pos logit contributions
+0.026
unt+0.025
s+0.024
ha+0.024
 man+0.024
Neg logit contributions
e-0.031
ow-0.029
years-0.027
ric-0.026
eking-0.025
Token film
Token ID2646
Feature activation+3.70e-02

Pos logit contributions
pool+0.018
unt+0.018
+0.018
ha+0.017
OK+0.017
Neg logit contributions
e-0.030
ow-0.024
years-0.023
 eyel-0.021
 being-0.021
Token.
Token ID13
Feature activation+1.49e-02

Pos logit contributions
unt+0.057
+0.057
ha+0.055
ww+0.054
www+0.053
Neg logit contributions
e-0.082
ow-0.075
years-0.066
owing-0.060
ric-0.059
Token
Token ID198
Feature activation+4.75e-02

Pos logit contributions
+0.040
unt+0.038
 hill+0.035
OK+0.034
 trail+0.034
Neg logit contributions
e-0.094
ow-0.088
years-0.082
ric-0.076
eking-0.075
TokenJust
Token ID5703
Feature activation+3.64e-02

Pos logit contributions
+0.059
 hill+0.056
unt+0.054
 man+0.053
 wagon+0.053
Neg logit contributions
e-0.116
ow-0.103
years-0.102
eking-0.092
 eyel-0.092
Token then
Token ID788
Feature activation+4.89e-02

Pos logit contributions
+0.041
unt+0.036
OK+0.034
 man+0.034
ha+0.033
Neg logit contributions
e-0.095
ow-0.089
years-0.087
ric-0.082
eking-0.077
Token,
Token ID11
Feature activation+4.60e-02

Pos logit contributions
+0.055
unt+0.050
ha+0.047
s+0.045
+0.044
Neg logit contributions
e-0.131
ow-0.122
ric-0.120
years-0.119
eking-0.114
Token a
Token ID257
Feature activation+8.07e-02

Pos logit contributions
+0.098
unt+0.096
ha+0.090
OK+0.090
s+0.090
Neg logit contributions
e-0.074
ow-0.066
years-0.059
eking-0.052
ric-0.051
Token magic
Token ID5536
Feature activation+1.23e-01

Pos logit contributions
+0.102
s+0.095
pool+0.091
unt+0.089
OK+0.086
Neg logit contributions
e-0.188
ow-0.174
years-0.166
 eyel-0.153
ric-0.146
Token fairy
Token ID25607
Feature activation+1.21e-02

Pos logit contributions
+0.171
unt+0.156
s+0.155
+0.146
First+0.145
Neg logit contributions
e-0.285
ow-0.265
years-0.234
 eyel-0.232
eking-0.225
Token appeared
Token ID4120
Feature activation-2.44e-02

Pos logit contributions
+0.022
unt+0.022
ha+0.021
s+0.021
 trail+0.020
Neg logit contributions
e-0.022
ow-0.021
years-0.019
ric-0.018
eking-0.017
Token with
Token ID351
Feature activation-2.71e-02

Pos logit contributions
e+0.059
ow+0.054
years+0.053
ric+0.052
eking+0.047
Neg logit contributions
-0.029
s-0.029
unt-0.027
 trail-0.027
ha-0.026
Token a
Token ID257
Feature activation+5.69e-02

Pos logit contributions
e+0.049
ow+0.046
ric+0.042
owing+0.040
eking+0.040
Neg logit contributions
-0.047
ha-0.046
unt-0.046
 few-0.045
 wagon-0.043
Token wand
Token ID11569
Feature activation+5.27e-02

Pos logit contributions
+0.074
unt+0.071
OK+0.066
s+0.066
ha+0.064
Neg logit contributions
e-0.135
ow-0.125
years-0.115
 eyel-0.108
ric-0.108
Token
Token ID198
Feature activation+4.75e-02

Pos logit contributions
+0.040
unt+0.038
 hill+0.035
OK+0.034
 trail+0.034
Neg logit contributions
e-0.094
ow-0.088
years-0.082
ric-0.076
eking-0.075
TokenJust
Token ID5703
Feature activation+3.64e-02

Pos logit contributions
+0.059
 hill+0.056
unt+0.054
 man+0.053
 wagon+0.053
Neg logit contributions
e-0.116
ow-0.103
years-0.102
eking-0.092
 eyel-0.092
Token then
Token ID788
Feature activation+4.89e-02

Pos logit contributions
+0.041
unt+0.036
OK+0.034
 man+0.034
ha+0.033
Neg logit contributions
e-0.095
ow-0.089
years-0.087
ric-0.082
eking-0.077
Token,
Token ID11
Feature activation+4.60e-02

Pos logit contributions
+0.055
unt+0.050
ha+0.047
s+0.045
+0.044
Neg logit contributions
e-0.131
ow-0.122
ric-0.120
years-0.119
eking-0.114
Token a
Token ID257
Feature activation+8.07e-02

Pos logit contributions
+0.098
unt+0.096
ha+0.090
OK+0.090
s+0.090
Neg logit contributions
e-0.074
ow-0.066
years-0.059
eking-0.052
ric-0.051
Token magic
Token ID5536
Feature activation+1.23e-01

Pos logit contributions
+0.102
s+0.095
pool+0.091
unt+0.089
OK+0.086
Neg logit contributions
e-0.188
ow-0.174
years-0.166
 eyel-0.153
ric-0.146
Token fairy
Token ID25607
Feature activation+1.21e-02

Pos logit contributions
+0.171
unt+0.156
s+0.155
+0.146
First+0.145
Neg logit contributions
e-0.285
ow-0.265
years-0.234
 eyel-0.232
eking-0.225
Token appeared
Token ID4120
Feature activation-2.44e-02

Pos logit contributions
+0.022
unt+0.022
ha+0.021
s+0.021
 trail+0.020
Neg logit contributions
e-0.022
ow-0.021
years-0.019
ric-0.018
eking-0.017
Token with
Token ID351
Feature activation-2.71e-02

Pos logit contributions
e+0.059
ow+0.054
years+0.053
ric+0.052
eking+0.047
Neg logit contributions
-0.029
s-0.029
unt-0.027
 trail-0.027
ha-0.026
Token a
Token ID257
Feature activation+5.69e-02

Pos logit contributions
e+0.049
ow+0.046
ric+0.042
owing+0.040
eking+0.040
Neg logit contributions
-0.047
ha-0.046
unt-0.046
 few-0.045
 wagon-0.043
Token wand
Token ID11569
Feature activation+5.27e-02

Pos logit contributions
+0.074
unt+0.071
OK+0.066
s+0.066
ha+0.064
Neg logit contributions
e-0.135
ow-0.125
years-0.115
 eyel-0.108
ric-0.108
Token powers
Token ID5635
Feature activation+5.89e-02

Pos logit contributions
+0.024
unt+0.023
ha+0.021
OK+0.020
s+0.020
Neg logit contributions
e-0.047
ow-0.045
years-0.041
eking-0.038
ric-0.038
Token and
Token ID290
Feature activation-6.46e-03

Pos logit contributions
+0.079
unt+0.076
ha+0.069
s+0.069
 trail+0.067
Neg logit contributions
e-0.141
ow-0.131
years-0.123
ric-0.114
eking-0.113
Token she
Token ID673
Feature activation+5.45e-02

Pos logit contributions
e+0.014
ow+0.013
years+0.012
ric+0.011
eking+0.011
Neg logit contributions
-0.010
unt-0.010
ha-0.009
s-0.009
OK-0.009
Token said
Token ID531
Feature activation+2.47e-02

Pos logit contributions
+0.085
unt+0.081
ha+0.077
s+0.077
OK+0.074
Neg logit contributions
e-0.119
ow-0.109
years-0.102
ric-0.095
eking-0.094
Token a
Token ID257
Feature activation+7.67e-02

Pos logit contributions
+0.036
unt+0.035
ha+0.032
OK+0.031
s+0.031
Neg logit contributions
e-0.057
ow-0.053
years-0.049
eking-0.045
ric-0.045
Token magic
Token ID5536
Feature activation+1.23e-01

Pos logit contributions
+0.104
s+0.097
unt+0.094
OK+0.091
+0.089
Neg logit contributions
e-0.169
ow-0.162
years-0.143
 eyel-0.141
ric-0.140
Token word
Token ID1573
Feature activation+3.45e-02

Pos logit contributions
+0.198
unt+0.181
s+0.176
Together+0.171
First+0.170
Neg logit contributions
e-0.255
ow-0.248
 eyel-0.209
years-0.203
ric-0.202
Token that
Token ID326
Feature activation+4.84e-02

Pos logit contributions
+0.059
unt+0.054
OK+0.051
ww+0.050
 hill+0.050
Neg logit contributions
e-0.073
ow-0.069
years-0.059
 eyel-0.054
eking-0.054
Token opened
Token ID4721
Feature activation+8.92e-03

Pos logit contributions
+0.064
unt+0.064
ha+0.059
s+0.058
OK+0.057
Neg logit contributions
e-0.114
ow-0.105
years-0.100
ric-0.094
eking-0.092
Token the
Token ID262
Feature activation+1.05e-02

Pos logit contributions
+0.013
unt+0.012
ha+0.011
s+0.011
OK+0.011
Neg logit contributions
e-0.020
ow-0.019
years-0.018
ric-0.016
eking-0.016
Token gate
Token ID8946
Feature activation+2.03e-03

Pos logit contributions
+0.012
unt+0.012
ha+0.011
s+0.011
OK+0.010
Neg logit contributions
e-0.027
ow-0.025
years-0.024
ric-0.022
eking-0.022
Token saw
Token ID2497
Feature activation-3.87e-03

Pos logit contributions
+0.030
s+0.027
ha+0.025
+0.024
 man+0.024
Neg logit contributions
e-0.063
ow-0.061
ric-0.055
years-0.055
eking-0.054
Token something
Token ID1223
Feature activation+5.36e-02

Pos logit contributions
e+0.007
ow+0.007
years+0.007
eking+0.006
owing+0.006
Neg logit contributions
-0.007
unt-0.007
 wagon-0.006
 man-0.006
s-0.006
Token extraordinary
Token ID11359
Feature activation+5.46e-02

Pos logit contributions
+0.072
unt+0.069
ha+0.063
s+0.062
OK+0.062
Neg logit contributions
e-0.129
ow-0.120
years-0.112
ric-0.104
eking-0.103
Token.
Token ID13
Feature activation+3.11e-03

Pos logit contributions
+0.077
unt+0.073
s+0.069
ha+0.068
 trail+0.067
Neg logit contributions
e-0.127
ow-0.118
years-0.112
ric-0.103
eking-0.101
Token A
Token ID317
Feature activation+8.77e-02

Pos logit contributions
+0.004
unt+0.004
ha+0.003
 trail+0.003
OK+0.003
Neg logit contributions
e-0.008
ow-0.007
years-0.007
ric-0.006
 eyel-0.006
Token magic
Token ID5536
Feature activation+1.23e-01

Pos logit contributions
+0.100
First+0.096
pool+0.095
OK+0.093
Okay+0.090
Neg logit contributions
e-0.202
ow-0.179
years-0.171
 eyel-0.167
ric-0.162
Token wand
Token ID11569
Feature activation+5.72e-02

Pos logit contributions
+0.174
unt+0.169
First+0.158
s+0.157
OK+0.156
Neg logit contributions
e-0.278
ow-0.251
 eyel-0.223
years-0.220
eking-0.213
Token was
Token ID373
Feature activation-8.36e-02

Pos logit contributions
+0.107
unt+0.106
OK+0.099
ha+0.096
 hill+0.095
Neg logit contributions
e-0.108
ow-0.098
years-0.085
eking-0.081
ric-0.078
Token hidden
Token ID7104
Feature activation+4.71e-02

Pos logit contributions
e+0.163
ow+0.150
years+0.143
owing+0.126
ric+0.125
Neg logit contributions
 man-0.132
unt-0.132
s-0.130
-0.128
ha-0.128
Token under
Token ID739
Feature activation-3.63e-02

Pos logit contributions
+0.096
unt+0.093
ha+0.088
s+0.087
OK+0.087
Neg logit contributions
e-0.081
ow-0.073
years-0.066
ric-0.059
eking-0.058
Token the
Token ID262
Feature activation-3.94e-03

Pos logit contributions
e+0.063
ow+0.061
years+0.055
eking+0.050
ric+0.049
Neg logit contributions
-0.070
unt-0.067
ha-0.065
s-0.065
 hill-0.065
Token together
Token ID1978
Feature activation+6.22e-02

Pos logit contributions
e+0.063
ow+0.059
years+0.056
ric+0.051
eking+0.049
Neg logit contributions
-0.056
s-0.052
unt-0.052
ha-0.050
 trail-0.050
Token.
Token ID13
Feature activation+4.71e-03

Pos logit contributions
+0.086
unt+0.083
ha+0.076
OK+0.074
s+0.074
Neg logit contributions
e-0.147
ow-0.136
years-0.127
ric-0.117
eking-0.117
Token He
Token ID679
Feature activation+6.21e-02

Pos logit contributions
 trail+0.009
ha+0.009
ww+0.009
unt+0.008
www+0.008
Neg logit contributions
e-0.008
ow-0.008
years-0.006
azes-0.006
 eyel-0.006
Token said
Token ID531
Feature activation+2.40e-02

Pos logit contributions
+0.103
unt+0.101
ha+0.093
 trail+0.091
OK+0.091
Neg logit contributions
e-0.130
ow-0.118
years-0.108
ric-0.099
eking-0.098
Token a
Token ID257
Feature activation+7.37e-02

Pos logit contributions
+0.034
unt+0.032
ha+0.029
OK+0.029
 trail+0.029
Neg logit contributions
e-0.057
ow-0.053
years-0.049
ric-0.045
eking-0.045
Token magic
Token ID5536
Feature activation+1.23e-01

Pos logit contributions
+0.088
unt+0.083
ha+0.077
s+0.076
OK+0.074
Neg logit contributions
e-0.182
ow-0.163
ric-0.149
years-0.147
 eyel-0.146
Token word
Token ID1573
Feature activation+3.66e-02

Pos logit contributions
+0.155
unt+0.151
First+0.145
Together+0.143
s+0.133
Neg logit contributions
e-0.295
ow-0.274
 eyel-0.234
ric-0.233
ps-0.232
Token and
Token ID290
Feature activation-1.31e-02

Pos logit contributions
+0.062
unt+0.058
OK+0.054
ha+0.053
ww+0.053
Neg logit contributions
e-0.078
ow-0.073
years-0.062
ric-0.058
 eyel-0.058
Token pulled
Token ID5954
Feature activation-7.73e-04

Pos logit contributions
e+0.028
ow+0.026
years+0.024
ric+0.022
eking+0.022
Neg logit contributions
-0.021
unt-0.020
ha-0.019
s-0.019
OK-0.019
Token them
Token ID606
Feature activation-2.67e-02

Pos logit contributions
e+0.002
ow+0.002
years+0.002
ric+0.002
eking+0.002
Neg logit contributions
s-0.001
-0.001
ha-0.001
unt-0.001
 man-0.001
Token apart
Token ID5475
Feature activation+1.91e-02

Pos logit contributions
e+0.055
ow+0.050
years+0.045
eking+0.041
ric+0.041
Neg logit contributions
-0.046
unt-0.045
ha-0.042
OK-0.041
 trail-0.040
Token "
Token ID366
Feature activation+6.78e-03

Pos logit contributions
+0.047
ha+0.043
unt+0.043
 trail+0.042
ww+0.041
Neg logit contributions
e-0.090
ow-0.086
years-0.081
eking-0.075
ric-0.075
TokenThat
Token ID2504
Feature activation+3.65e-02

Pos logit contributions
+0.009
 trail+0.008
uddle+0.008
 wagon+0.008
 hill+0.008
Neg logit contributions
e-0.015
years-0.015
ow-0.013
Acc-0.013
ad-0.012
Token's
Token ID338
Feature activation-3.40e-02

Pos logit contributions
unt+0.075
ha+0.073
pool+0.072
+0.071
Sure+0.071
Neg logit contributions
e-0.054
ow-0.047
ric-0.038
 staring-0.033
 eyel-0.033
Token a
Token ID257
Feature activation+5.88e-02

Pos logit contributions
e+0.073
ow+0.067
years+0.064
ric+0.057
eking+0.057
Neg logit contributions
-0.056
unt-0.054
OK-0.050
ha-0.049
-0.048
Token very
Token ID845
Feature activation+3.18e-02

Pos logit contributions
unt+0.089
s+0.086
+0.085
ha+0.083
pool+0.080
Neg logit contributions
e-0.127
ow-0.107
years-0.097
ric-0.096
 eyel-0.093
Token thoughtful
Token ID22677
Feature activation+1.23e-01

Pos logit contributions
+0.047
pool+0.044
unt+0.044
ha+0.042
 trail+0.042
Neg logit contributions
e-0.066
ow-0.063
years-0.057
eking-0.054
 eyel-0.052
Token idea
Token ID2126
Feature activation+5.36e-02

Pos logit contributions
unt+0.157
+0.156
Together+0.148
+0.142
First+0.141
Neg logit contributions
e-0.289
ow-0.267
owing-0.223
ric-0.218
ad-0.217
Token,
Token ID11
Feature activation+3.23e-02

Pos logit contributions
+0.089
unt+0.087
 trail+0.086
pool+0.084
Together+0.083
Neg logit contributions
e-0.111
ow-0.103
owing-0.081
years-0.079
ps-0.078
Token Tim
Token ID5045
Feature activation-4.79e-02

Pos logit contributions
+0.074
pool+0.068
unt+0.067
s+0.066
+0.065
Neg logit contributions
e-0.039
ow-0.036
 being-0.033
 finding-0.030
 saving-0.029
Tokenmy
Token ID1820
Feature activation-6.52e-03

Pos logit contributions
e+0.068
ow+0.060
years+0.050
ric+0.045
eking+0.044
Neg logit contributions
-0.111
unt-0.109
 trail-0.104
ha-0.103
 hill-0.103
Token."
Token ID526
Feature activation+3.28e-03

Pos logit contributions
e+0.012
ow+0.011
owing+0.009
years+0.008
ps+0.008
Neg logit contributions
unt-0.012
 trail-0.012
ha-0.011
pool-0.011
Sure-0.011
Token taste
Token ID6938
Feature activation+1.55e-02

Pos logit contributions
+0.085
unt+0.082
ha+0.075
s+0.074
OK+0.073
Neg logit contributions
e-0.144
ow-0.134
years-0.124
ric-0.115
eking-0.114
Token.
Token ID13
Feature activation+1.18e-03

Pos logit contributions
+0.024
unt+0.023
ha+0.021
OK+0.021
 trail+0.021
Neg logit contributions
e-0.034
ow-0.032
years-0.029
eking-0.027
ric-0.027
Token Suddenly
Token ID24975
Feature activation+5.75e-02

Pos logit contributions
+0.001
unt+0.001
ha+0.001
 trail+0.001
OK+0.001
Neg logit contributions
e-0.003
ow-0.003
years-0.003
ric-0.002
 eyel-0.002
Token,
Token ID11
Feature activation+3.48e-02

Pos logit contributions
+0.066
unt+0.064
ha+0.058
s+0.057
+0.054
Neg logit contributions
e-0.151
ow-0.139
years-0.136
ric-0.129
eking-0.125
Token a
Token ID257
Feature activation+7.59e-02

Pos logit contributions
+0.058
unt+0.056
ha+0.052
s+0.051
OK+0.051
Neg logit contributions
e-0.074
ow-0.067
years-0.063
ric-0.059
eking-0.058
Token compassionate
Token ID32533
Feature activation+1.22e-01

Pos logit contributions
+0.084
s+0.075
unt+0.074
pool+0.074
OK+0.071
Neg logit contributions
e-0.188
ow-0.174
years-0.165
 eyel-0.156
ric-0.152
Token stranger
Token ID16195
Feature activation-5.64e-03

Pos logit contributions
+0.110
unt+0.099
s+0.090
ha+0.088
OK+0.088
Neg logit contributions
e-0.344
ow-0.326
years-0.302
ric-0.284
 eyel-0.283
Token appeared
Token ID4120
Feature activation-2.63e-02

Pos logit contributions
e+0.012
ow+0.011
years+0.011
ric+0.011
eking+0.010
Neg logit contributions
-0.009
unt-0.008
s-0.008
ha-0.008
 man-0.008
Token and
Token ID290
Feature activation-3.87e-03

Pos logit contributions
e+0.070
ow+0.064
ric+0.063
years+0.063
eking+0.058
Neg logit contributions
s-0.026
-0.025
unt-0.023
 trail-0.023
-0.022
Token welcomed
Token ID15846
Feature activation+3.17e-02

Pos logit contributions
e+0.008
ow+0.007
years+0.007
eking+0.006
 eyel+0.006
Neg logit contributions
-0.006
pool-0.006
 hill-0.006
 trail-0.006
unt-0.005
Token her
Token ID607
Feature activation-2.38e-02

Pos logit contributions
+0.059
unt+0.057
ha+0.054
s+0.053
OK+0.052
Neg logit contributions
e-0.060
ow-0.054
years-0.051
ric-0.047
eking-0.046
Token hot
Token ID3024
Feature activation+3.55e-02

Pos logit contributions
e+0.047
ow+0.043
years+0.040
ric+0.037
eking+0.037
Neg logit contributions
-0.035
unt-0.033
ha-0.031
OK-0.031
s-0.030
Token.
Token ID13
Feature activation+3.08e-04

Pos logit contributions
+0.061
unt+0.058
OK+0.057
ha+0.056
Sure+0.055
Neg logit contributions
e-0.072
ow-0.067
years-0.054
owing-0.054
eking-0.053
Token But
Token ID887
Feature activation+9.45e-03

Pos logit contributions
+0.000
unt+0.000
ha+0.000
 trail+0.000
 hill+0.000
Neg logit contributions
e-0.001
ow-0.001
years-0.001
ric-0.001
 eyel-0.001
Token then
Token ID788
Feature activation+4.22e-02

Pos logit contributions
+0.013
unt+0.012
 hill+0.011
s+0.011
+0.011
Neg logit contributions
e-0.022
ow-0.021
years-0.020
ric-0.018
eking-0.018
Token a
Token ID257
Feature activation+8.00e-02

Pos logit contributions
+0.059
unt+0.057
ha+0.052
s+0.051
OK+0.051
Neg logit contributions
e-0.100
ow-0.092
years-0.086
ric-0.080
eking-0.079
Token brave
Token ID14802
Feature activation+1.22e-01

Pos logit contributions
+0.105
s+0.102
pool+0.099
First+0.095
unt+0.091
Neg logit contributions
e-0.178
ow-0.161
years-0.153
 eyel-0.147
oken-0.134
Token firefighter
Token ID43105
Feature activation+2.03e-02

Pos logit contributions
+0.125
unt+0.112
s+0.108
OK+0.103
ha+0.100
Neg logit contributions
e-0.331
ow-0.311
years-0.284
 eyel-0.268
ric-0.268
Token came
Token ID1625
Feature activation-5.76e-02

Pos logit contributions
+0.035
unt+0.033
ha+0.032
s+0.031
 man+0.031
Neg logit contributions
e-0.040
ow-0.037
years-0.035
ric-0.032
eking-0.032
Token to
Token ID284
Feature activation-3.05e-02

Pos logit contributions
e+0.139
ow+0.127
years+0.127
 eyel+0.117
ric+0.116
Neg logit contributions
s-0.071
-0.070
unt-0.066
 hill-0.064
 man-0.063
Token save
Token ID3613
Feature activation-3.25e-03

Pos logit contributions
e+0.065
ow+0.062
years+0.060
eking+0.055
ric+0.054
Neg logit contributions
-0.049
unt-0.044
 man-0.044
 hill-0.043
OK-0.042
Token the
Token ID262
Feature activation+9.10e-03

Pos logit contributions
e+0.007
years+0.007
ow+0.007
ric+0.006
 eyel+0.006
Neg logit contributions
-0.005
 man-0.005
unt-0.005
s-0.004
 hill-0.004
Token fairy
Token ID25607
Feature activation+1.21e-02

Pos logit contributions
+0.088
pool+0.084
s+0.082
First+0.081
Together+0.077
Neg logit contributions
e-0.135
years-0.114
ow-0.113
 eyel-0.113
 staring-0.103
Token princess
Token ID21752
Feature activation+3.59e-03

Pos logit contributions
+0.021
unt+0.020
ha+0.019
OK+0.019
s+0.018
Neg logit contributions
e-0.025
ow-0.023
years-0.021
ric-0.019
eking-0.019
Token,
Token ID11
Feature activation+2.34e-02

Pos logit contributions
+0.006
unt+0.006
ha+0.005
s+0.005
OK+0.005
Neg logit contributions
e-0.007
ow-0.007
years-0.006
ric-0.006
eking-0.006
Token then
Token ID788
Feature activation+2.91e-02

Pos logit contributions
+0.039
unt+0.038
 man+0.036
 trail+0.036
s+0.036
Neg logit contributions
e-0.047
ow-0.043
years-0.040
eking-0.038
ric-0.037
Token a
Token ID257
Feature activation+7.05e-02

Pos logit contributions
+0.047
unt+0.045
ha+0.042
s+0.042
OK+0.042
Neg logit contributions
e-0.062
ow-0.057
years-0.053
ric-0.048
eking-0.048
Token courageous
Token ID34010
Feature activation+1.22e-01

Pos logit contributions
+0.086
unt+0.079
s+0.078
pool+0.076
OK+0.076
Neg logit contributions
e-0.173
ow-0.156
years-0.149
 eyel-0.140
ric-0.137
Token knight
Token ID22062
Feature activation+8.45e-03

Pos logit contributions
+0.153
unt+0.145
s+0.134
ha+0.132
OK+0.131
Neg logit contributions
e-0.303
ow-0.281
years-0.261
ric-0.243
eking-0.241
Token,
Token ID11
Feature activation+1.84e-02

Pos logit contributions
+0.013
unt+0.013
ha+0.012
OK+0.012
s+0.012
Neg logit contributions
e-0.018
ow-0.017
years-0.015
ric-0.014
eking-0.014
Token then
Token ID788
Feature activation+2.62e-02

Pos logit contributions
+0.031
unt+0.029
s+0.027
ha+0.027
 trail+0.027
Neg logit contributions
e-0.038
ow-0.035
years-0.032
ric-0.030
eking-0.030
Token a
Token ID257
Feature activation+6.87e-02

Pos logit contributions
+0.043
unt+0.041
OK+0.038
ha+0.038
s+0.038
Neg logit contributions
e-0.055
ow-0.051
years-0.046
eking-0.042
ric-0.042
Token wise
Token ID10787
Feature activation+9.29e-02

Pos logit contributions
+0.087
s+0.080
unt+0.079
pool+0.077
OK+0.077
Neg logit contributions
e-0.165
ow-0.148
years-0.141
 eyel-0.133
ric-0.130
Token slide
Token ID10649
Feature activation-1.74e-02

Pos logit contributions
e+0.008
ow+0.007
years+0.007
ric+0.006
eking+0.006
Neg logit contributions
-0.004
unt-0.004
ha-0.003
 trail-0.003
 hill-0.003
Token and
Token ID290
Feature activation-2.90e-02

Pos logit contributions
e+0.037
ow+0.035
years+0.030
ric+0.028
 eyel+0.028
Neg logit contributions
-0.028
unt-0.028
ha-0.026
ww-0.025
www-0.025
Token looked
Token ID3114
Feature activation+1.34e-04

Pos logit contributions
e+0.062
ow+0.058
years+0.054
ric+0.050
eking+0.049
Neg logit contributions
-0.046
unt-0.044
s-0.042
ha-0.041
OK-0.041
Token through
Token ID832
Feature activation-2.88e-02

Pos logit contributions
unt+0.000
 trail+0.000
ha+0.000
+0.000
ww+0.000
Neg logit contributions
e-0.000
ow-0.000
years-0.000
eking-0.000
owing-0.000
Token the
Token ID262
Feature activation-4.23e-03

Pos logit contributions
e+0.056
ow+0.050
years+0.046
ric+0.041
eking+0.041
Neg logit contributions
-0.053
unt-0.052
ha-0.048
OK-0.048
s-0.047
Token ey
Token ID1926
Feature activation-1.52e-01

Pos logit contributions
e+0.009
ow+0.008
years+0.007
ric+0.007
eking+0.007
Neg logit contributions
-0.007
unt-0.007
ha-0.007
s-0.006
OK-0.006
Tokenep
Token ID538
Feature activation-2.18e-02

Pos logit contributions
ow+0.289
e+0.283
years+0.281
 eyel+0.256
ric+0.248
Neg logit contributions
s-0.232
-0.230
ha-0.218
unt-0.211
"-0.209
Tokeniece
Token ID8535
Feature activation-5.55e-03

Pos logit contributions
e+0.047
ow+0.044
years+0.042
ric+0.038
eking+0.037
Neg logit contributions
-0.032
s-0.031
unt-0.030
 man-0.030
ha-0.029
Token.
Token ID13
Feature activation-1.04e-02

Pos logit contributions
e+0.013
ow+0.013
years+0.012
ric+0.011
eking+0.011
Neg logit contributions
-0.007
unt-0.007
s-0.006
ha-0.006
 trail-0.006
Token He
Token ID679
Feature activation+5.07e-02

Pos logit contributions
e+0.026
ow+0.024
years+0.023
 eyel+0.021
ric+0.021
Neg logit contributions
-0.013
unt-0.012
ha-0.011
ww-0.011
 trail-0.011
Token saw
Token ID2497
Feature activation-2.01e-03

Pos logit contributions
+0.069
s+0.063
unt+0.062
 man+0.062
OK+0.061
Neg logit contributions
e-0.119
ow-0.114
years-0.105
ric-0.101
eking-0.099
Token,
Token ID11
Feature activation+2.29e-02

Pos logit contributions
 trail+0.015
First+0.014
unt+0.014
 hill+0.014
+0.014
Neg logit contributions
e-0.016
ow-0.016
ad-0.012
 eyel-0.012
eking-0.012
Token it
Token ID340
Feature activation+2.51e-02

Pos logit contributions
+0.036
s+0.034
unt+0.033
OK+0.033
 trail+0.032
Neg logit contributions
e-0.047
ow-0.043
years-0.037
ad-0.034
ric-0.034
Token's
Token ID338
Feature activation-5.72e-02

Pos logit contributions
unt+0.040
+0.039
OK+0.036
 hill+0.036
 trail+0.036
Neg logit contributions
e-0.054
ow-0.050
eking-0.041
 eyel-0.040
ric-0.040
Token very
Token ID845
Feature activation-2.61e-02

Pos logit contributions
e+0.123
ow+0.113
years+0.105
ric+0.096
eking+0.095
Neg logit contributions
-0.093
unt-0.090
ha-0.083
s-0.082
OK-0.082
Token pretty
Token ID2495
Feature activation+2.56e-02

Pos logit contributions
e+0.059
ow+0.056
years+0.047
eking+0.046
 eyel+0.044
Neg logit contributions
unt-0.036
-0.035
 wagon-0.034
 hill-0.034
-0.034
Token isn
Token ID2125
Feature activation-1.46e-01

Pos logit contributions
unt+0.033
OK+0.032
Okay+0.032
Together+0.031
Sure+0.031
Neg logit contributions
e-0.061
ow-0.057
 eyel-0.047
years-0.046
owing-0.046
Token't
Token ID470
Feature activation-4.74e-02

Pos logit contributions
e+0.330
ow+0.305
years+0.287
ric+0.263
eking+0.262
Neg logit contributions
-0.217
unt-0.208
ha-0.192
s-0.192
 hill-0.186
Token it
Token ID340
Feature activation-1.71e-02

Pos logit contributions
e+0.117
ow+0.110
years+0.096
eking+0.094
ric+0.093
Neg logit contributions
-0.060
unt-0.057
OK-0.056
Okay-0.053
-0.052
Token Tommy
Token ID19919
Feature activation-7.86e-02

Pos logit contributions
e+0.039
ow+0.039
eking+0.032
years+0.031
owing+0.031
Neg logit contributions
OK-0.023
-0.022
unt-0.022
Okay-0.022
 hill-0.021
Token,"
Token ID553
Feature activation-7.06e-03

Pos logit contributions
e+0.195
ow+0.184
years+0.159
ric+0.150
owing+0.150
Neg logit contributions
unt-0.098
-0.096
 trail-0.092
OK-0.091
 hill-0.090
Token Mama
Token ID40084
Feature activation-5.11e-02

Pos logit contributions
e+0.017
ow+0.016
years+0.014
ric+0.013
eking+0.013
Neg logit contributions
-0.010
unt-0.009
ha-0.008
 hill-0.008
 trail-0.008
Token.
Token ID13
Feature activation-2.16e-03

Pos logit contributions
e+0.116
ow+0.108
years+0.092
eking+0.087
ric+0.087
Neg logit contributions
-0.072
unt-0.071
 trail-0.068
 hill-0.066
ha-0.065
Token This
Token ID770
Feature activation+5.97e-02

Pos logit contributions
e+0.005
ow+0.005
years+0.004
ric+0.004
 eyel+0.004
Neg logit contributions
-0.003
unt-0.003
 trail-0.003
ha-0.003
 hill-0.002
Token lovely
Token ID14081
Feature activation+5.41e-02

Pos logit contributions
+0.093
unt+0.092
+0.088
Together+0.087
OK+0.087
Neg logit contributions
e-0.127
ow-0.118
ric-0.098
 eyel-0.095
eking-0.090
Token muff
Token ID27563
Feature activation-3.43e-02

Pos logit contributions
+0.073
unt+0.069
ha+0.065
s+0.065
OK+0.064
Neg logit contributions
e-0.128
ow-0.118
years-0.103
ric-0.102
 eyel-0.101
Tokenin
Token ID259
Feature activation-1.75e-02

Pos logit contributions
e+0.072
ow+0.067
years+0.062
 eyel+0.059
ric+0.058
Neg logit contributions
-0.055
s-0.051
unt-0.051
ha-0.049
OK-0.047
Token isn
Token ID2125
Feature activation-1.44e-01

Pos logit contributions
e+0.043
ow+0.039
years+0.034
ric+0.032
eking+0.032
Neg logit contributions
unt-0.024
-0.023
 trail-0.022
ha-0.021
OK-0.021
Token't
Token ID470
Feature activation-4.61e-02

Pos logit contributions
e+0.335
ow+0.308
years+0.295
ric+0.269
eking+0.268
Neg logit contributions
-0.204
unt-0.197
s-0.183
ha-0.182
 hill-0.173
Token for
Token ID329
Feature activation-5.16e-02

Pos logit contributions
e+0.105
ow+0.097
years+0.090
ric+0.083
eking+0.083
Neg logit contributions
-0.068
unt-0.066
ha-0.061
s-0.060
OK-0.059
Token us
Token ID514
Feature activation-5.23e-02

Pos logit contributions
e+0.107
ow+0.098
years+0.092
ric+0.085
eking+0.082
Neg logit contributions
-0.085
unt-0.082
ha-0.076
 hill-0.075
s-0.075
Token,
Token ID11
Feature activation+1.27e-02

Pos logit contributions
e+0.126
ow+0.120
years+0.101
owing+0.098
eking+0.097
Neg logit contributions
-0.073
unt-0.072
ha-0.067
OK-0.067
 trail-0.066
Token it
Token ID340
Feature activation+1.60e-02

Pos logit contributions
+0.020
unt+0.019
ha+0.018
OK+0.017
s+0.017
Neg logit contributions
e-0.028
ow-0.026
years-0.023
ric-0.022
eking-0.022
Token reward
Token ID6721
Feature activation-2.48e-02

Pos logit contributions
+0.035
unt+0.035
ha+0.032
OK+0.031
s+0.030
Neg logit contributions
e-0.073
ow-0.068
years-0.063
ric-0.060
 eyel-0.058
Token.
Token ID13
Feature activation-9.14e-03

Pos logit contributions
e+0.056
ow+0.052
years+0.048
eking+0.044
ric+0.044
Neg logit contributions
-0.036
unt-0.035
ha-0.032
OK-0.032
 trail-0.032
Token It
Token ID632
Feature activation+1.13e-02

Pos logit contributions
e+0.022
ow+0.020
years+0.018
ric+0.017
 eyel+0.016
Neg logit contributions
-0.012
 trail-0.012
unt-0.012
ha-0.011
OK-0.011
Token was
Token ID373
Feature activation-9.30e-02

Pos logit contributions
unt+0.019
+0.019
ha+0.017
 trail+0.017
pool+0.017
Neg logit contributions
e-0.023
ow-0.021
years-0.019
 eyel-0.017
ric-0.017
Token ch
Token ID442
Feature activation-1.54e-03

Pos logit contributions
e+0.181
ow+0.166
years+0.152
ric+0.141
eking+0.139
Neg logit contributions
-0.162
unt-0.160
ha-0.149
 man-0.146
 trail-0.146
Tokenocol
Token ID4668
Feature activation-1.42e-01

Pos logit contributions
e+0.003
ow+0.003
years+0.003
ric+0.003
eking+0.002
Neg logit contributions
-0.002
unt-0.002
pool-0.002
ha-0.002
s-0.002
Tokenates
Token ID689
Feature activation-5.28e-02

Pos logit contributions
 weep+0.180
 eyel+0.178
ow+0.166
 crying+0.166
oken+0.162
Neg logit contributions
ha-0.254
-0.252
s-0.241
ww-0.239
"-0.239
Token and
Token ID290
Feature activation-4.71e-02

Pos logit contributions
e+0.126
ow+0.117
years+0.107
ric+0.099
eking+0.099
Neg logit contributions
-0.073
unt-0.071
ha-0.064
OK-0.064
 trail-0.063
Token sweets
Token ID42402
Feature activation-2.58e-02

Pos logit contributions
e+0.100
ow+0.091
years+0.084
ric+0.077
eking+0.076
Neg logit contributions
-0.077
unt-0.075
ha-0.069
OK-0.068
s-0.068
Token!
Token ID0
Feature activation-2.49e-02

Pos logit contributions
e+0.058
ow+0.052
years+0.046
eking+0.043
ric+0.043
Neg logit contributions
unt-0.040
-0.039
OK-0.036
ha-0.035
ww-0.034
Token
Token ID198
Feature activation+1.08e-02

Pos logit contributions
e+0.057
ow+0.053
years+0.048
ric+0.044
 eyel+0.043
Neg logit contributions
-0.035
 trail-0.034
unt-0.034
ha-0.032
 hill-0.032
Token the
Token ID262
Feature activation-3.62e-02

Pos logit contributions
e+0.160
ow+0.146
years+0.136
ric+0.121
eking+0.120
Neg logit contributions
-0.177
unt-0.170
 hill-0.160
ha-0.160
s-0.160
Token sky
Token ID6766
Feature activation-2.46e-02

Pos logit contributions
e+0.091
ow+0.084
years+0.078
ric+0.074
eking+0.073
Neg logit contributions
-0.044
unt-0.043
ha-0.039
s-0.039
OK-0.038
Token before
Token ID878
Feature activation-5.73e-02

Pos logit contributions
e+0.055
ow+0.050
years+0.044
ric+0.043
eking+0.043
Neg logit contributions
unt-0.038
-0.037
ha-0.035
OK-0.034
 trail-0.033
Token they
Token ID484
Feature activation+4.11e-03

Pos logit contributions
e+0.118
ow+0.109
years+0.097
ric+0.092
eking+0.090
Neg logit contributions
-0.094
unt-0.093
OK-0.087
ha-0.086
s-0.085
Token walked
Token ID6807
Feature activation-5.08e-02

Pos logit contributions
+0.006
unt+0.006
ha+0.006
s+0.006
OK+0.006
Neg logit contributions
e-0.009
ow-0.008
years-0.008
ric-0.007
eking-0.007
Token fore
Token ID1674
Feature activation-1.42e-01

Pos logit contributions
e+0.089
ow+0.084
years+0.072
eking+0.066
owing+0.064
Neg logit contributions
unt-0.096
-0.095
ww-0.090
ha-0.089
www-0.088
Tokenshadow
Token ID19106
Feature activation-3.00e-02

Pos logit contributions
e+0.286
ow+0.265
years+0.243
eking+0.222
ric+0.220
Neg logit contributions
-0.243
unt-0.236
ha-0.219
OK-0.217
 hill-0.216
Tokened
Token ID276
Feature activation-3.48e-02

Pos logit contributions
e+0.060
ow+0.054
years+0.051
ric+0.047
eking+0.046
Neg logit contributions
-0.052
unt-0.050
s-0.048
ha-0.047
OK-0.047
Token the
Token ID262
Feature activation-7.42e-03

Pos logit contributions
e+0.075
ow+0.069
years+0.061
eking+0.057
ric+0.057
Neg logit contributions
-0.056
unt-0.054
ha-0.050
OK-0.049
 trail-0.049
Token down
Token ID866
Feature activation-7.59e-03

Pos logit contributions
e+0.018
ow+0.017
years+0.016
ric+0.015
 eyel+0.015
Neg logit contributions
-0.009
unt-0.008
s-0.008
ha-0.008
pool-0.007
Tokenpour
Token ID48681
Feature activation-1.95e-02

Pos logit contributions
e+0.016
ow+0.015
years+0.013
eking+0.011
owing+0.011
Neg logit contributions
unt-0.013
-0.012
ha-0.012
ww-0.011
OK-0.011
Token,
Token ID11
Feature activation+9.13e-03

Pos logit contributions
e+0.081
ow+0.075
years+0.070
ric+0.065
eking+0.065
Neg logit contributions
-0.043
unt-0.042
ha-0.038
s-0.037
OK-0.037
Token M
Token ID337
Feature activation-5.71e-02

Pos logit contributions
+0.013
unt+0.013
ha+0.012
s+0.012
pool+0.012
Neg logit contributions
e-0.021
ow-0.019
years-0.017
ric-0.016
eking-0.016
Tokenandy
Token ID10757
Feature activation-3.85e-02

Pos logit contributions
e+0.132
ow+0.118
years+0.116
eking+0.105
ric+0.104
Neg logit contributions
-0.084
unt-0.082
ha-0.073
s-0.071
 hill-0.071
Token,
Token ID11
Feature activation+5.65e-03

Pos logit contributions
e+0.091
ow+0.085
years+0.071
eking+0.069
ric+0.069
Neg logit contributions
unt-0.054
-0.052
 trail-0.049
OK-0.049
ha-0.048
Token it
Token ID340
Feature activation+3.31e-03

Pos logit contributions
+0.008
unt+0.008
ha+0.007
 trail+0.007
OK+0.007
Neg logit contributions
e-0.013
ow-0.012
years-0.010
ric-0.010
eking-0.010
Token isn
Token ID2125
Feature activation-1.42e-01

Pos logit contributions
+0.005
unt+0.005
ha+0.005
OK+0.005
 trail+0.005
Neg logit contributions
e-0.007
ow-0.006
years-0.005
ric-0.005
 eyel-0.005
Token't
Token ID470
Feature activation-4.51e-02

Pos logit contributions
e+0.339
ow+0.311
years+0.302
eking+0.276
ric+0.274
Neg logit contributions
-0.193
unt-0.184
s-0.175
ha-0.169
 hill-0.160
Token ours
Token ID16903
Feature activation+9.10e-03

Pos logit contributions
e+0.109
ow+0.101
years+0.093
eking+0.088
ric+0.087
Neg logit contributions
-0.061
unt-0.058
OK-0.054
s-0.053
ha-0.053
Token.
Token ID13
Feature activation-1.18e-02

Pos logit contributions
unt+0.013
+0.013
ha+0.012
OK+0.012
 trail+0.011
Neg logit contributions
e-0.021
ow-0.020
years-0.017
ric-0.016
eking-0.016
Token We
Token ID775
Feature activation-7.42e-03

Pos logit contributions
e+0.028
ow+0.026
years+0.024
ric+0.022
 eyel+0.021
Neg logit contributions
-0.016
unt-0.015
 trail-0.015
ha-0.015
 hill-0.015
Token can
Token ID460
Feature activation-4.39e-02

Pos logit contributions
e+0.018
ow+0.017
years+0.014
ric+0.014
eks+0.013
Neg logit contributions
unt-0.011
-0.010
 trail-0.009
ww-0.009
 hill-0.009
Token on
Token ID319
Feature activation-4.84e-02

Pos logit contributions
e+0.050
ow+0.046
years+0.039
ric+0.037
eking+0.036
Neg logit contributions
unt-0.033
OK-0.032
ha-0.032
Okay-0.031
Together-0.031
Token the
Token ID262
Feature activation-3.54e-02

Pos logit contributions
e+0.089
ow+0.077
years+0.063
ric+0.059
 eyel+0.056
Neg logit contributions
OK-0.093
unt-0.093
-0.092
Okay-0.090
ha-0.089
Token carpet
Token ID20710
Feature activation-1.88e-02

Pos logit contributions
e+0.092
ow+0.085
years+0.078
ric+0.076
 eyel+0.075
Neg logit contributions
-0.039
unt-0.037
s-0.035
ha-0.035
OK-0.034
Token?
Token ID30
Feature activation-2.82e-02

Pos logit contributions
e+0.040
ow+0.037
years+0.030
ric+0.030
owing+0.029
Neg logit contributions
unt-0.031
ha-0.029
OK-0.029
-0.029
Okay-0.028
Token That
Token ID1320
Feature activation+2.76e-02

Pos logit contributions
e+0.067
ow+0.061
years+0.056
ric+0.051
 eyel+0.049
Neg logit contributions
-0.039
 trail-0.039
unt-0.038
 hill-0.037
ha-0.036
Token isn
Token ID2125
Feature activation-1.41e-01

Pos logit contributions
unt+0.049
+0.049
+0.048
OK+0.047
Together+0.047
Neg logit contributions
e-0.053
ow-0.045
ric-0.037
 eyel-0.036
 being-0.034
Token't
Token ID470
Feature activation-4.81e-02

Pos logit contributions
e+0.332
ow+0.306
years+0.293
eking+0.270
ric+0.268
Neg logit contributions
-0.198
unt-0.192
s-0.178
ha-0.176
 hill-0.166
Token very
Token ID845
Feature activation-4.59e-02

Pos logit contributions
e+0.096
ow+0.087
years+0.078
eking+0.074
ric+0.072
Neg logit contributions
-0.085
unt-0.081
OK-0.077
s-0.076
-0.075
Token nice
Token ID3621
Feature activation+3.96e-02

Pos logit contributions
e+0.104
ow+0.096
years+0.089
eking+0.084
ric+0.079
Neg logit contributions
-0.066
unt-0.065
-0.059
s-0.058
 hill-0.056
Token".
Token ID1911
Feature activation-2.74e-02

Pos logit contributions
unt+0.073
+0.070
OK+0.070
By+0.068
Sure+0.068
Neg logit contributions
e-0.070
ow-0.067
owing-0.053
 being-0.052
ric-0.050
Token
Token ID198
Feature activation+1.76e-02

Pos logit contributions
e+0.069
ow+0.065
years+0.060
ric+0.056
eking+0.055
Neg logit contributions
-0.033
unt-0.032
ha-0.029
 trail-0.028
OK-0.028
Token.
Token ID13
Feature activation-1.63e-02

Pos logit contributions
e+0.036
ow+0.034
years+0.031
ric+0.029
eking+0.029
Neg logit contributions
-0.022
unt-0.022
ha-0.020
OK-0.020
 trail-0.019
Token They
Token ID1119
Feature activation+4.64e-02

Pos logit contributions
e+0.039
ow+0.036
years+0.034
ric+0.031
 eyel+0.031
Neg logit contributions
-0.021
unt-0.021
ha-0.020
 trail-0.020
ww-0.019
Token were
Token ID547
Feature activation-6.95e-02

Pos logit contributions
+0.076
unt+0.074
ha+0.068
s+0.067
OK+0.067
Neg logit contributions
e-0.098
ow-0.090
years-0.083
ric-0.076
eking-0.075
Token called
Token ID1444
Feature activation-6.41e-02

Pos logit contributions
e+0.136
ow+0.122
years+0.116
ric+0.103
eking+0.102
Neg logit contributions
unt-0.119
-0.115
 man-0.109
s-0.109
 trail-0.108
Token ost
Token ID23619
Feature activation-9.41e-02

Pos logit contributions
e+0.143
ow+0.130
years+0.125
eking+0.113
ric+0.113
Neg logit contributions
-0.096
unt-0.094
ha-0.086
s-0.086
 trail-0.085
Tokenric
Token ID1173
Feature activation-1.41e-01

Pos logit contributions
e+0.088
years+0.063
ow+0.058
 eyel+0.048
 elbows+0.039
Neg logit contributions
s-0.254
unt-0.246
-0.244
-0.237
 hill-0.236
Tokenhes
Token ID956
Feature activation+1.52e-02

Pos logit contributions
e+0.255
ow+0.237
years+0.229
 eyel+0.215
 elbows+0.204
Neg logit contributions
-0.243
s-0.236
unt-0.231
ha-0.228
 man-0.220
Token.
Token ID13
Feature activation-1.53e-02

Pos logit contributions
+0.023
unt+0.022
ha+0.020
OK+0.020
 trail+0.020
Neg logit contributions
e-0.034
ow-0.032
years-0.029
ric-0.027
eking-0.026
Token Lily
Token ID20037
Feature activation-3.44e-02

Pos logit contributions
e+0.036
ow+0.034
years+0.031
ric+0.029
 eyel+0.027
Neg logit contributions
-0.021
 trail-0.020
unt-0.019
ha-0.019
 hill-0.019
Token and
Token ID290
Feature activation+3.98e-03

Pos logit contributions
e+0.077
ow+0.070
years+0.065
ric+0.059
eking+0.059
Neg logit contributions
-0.052
unt-0.051
ha-0.047
OK-0.046
 trail-0.046
Token Ben
Token ID3932
Feature activation-3.23e-02

Pos logit contributions
 man+0.005
unt+0.005
+0.005
 hill+0.005
 wagon+0.005
Neg logit contributions
e-0.009
eking-0.008
years-0.008
ow-0.008
oken-0.008
Token is
Token ID318
Feature activation-1.00e-01

Pos logit contributions
e+0.002
ow+0.002
years+0.002
ric+0.002
eking+0.002
Neg logit contributions
unt-0.002
-0.002
ha-0.001
OK-0.001
 trail-0.001
Token a
Token ID257
Feature activation+2.01e-02

Pos logit contributions
e+0.205
ow+0.187
years+0.176
ric+0.158
eking+0.156
Neg logit contributions
-0.169
unt-0.163
ha-0.152
s-0.150
OK-0.149
Token sign
Token ID1051
Feature activation+1.22e-03

Pos logit contributions
+0.026
unt+0.024
s+0.024
pool+0.023
Together+0.023
Neg logit contributions
e-0.047
ow-0.042
years-0.038
 eyel-0.038
ric-0.037
Token that
Token ID326
Feature activation+2.42e-02

Pos logit contributions
+0.002
OK+0.002
 trail+0.002
+0.002
unt+0.002
Neg logit contributions
e-0.003
ow-0.002
years-0.002
 sitting-0.002
 being-0.002
Token brains
Token ID14290
Feature activation+1.50e-02

Pos logit contributions
unt+0.034
+0.034
ha+0.031
pool+0.031
+0.031
Neg logit contributions
e-0.055
ow-0.051
years-0.042
ric-0.041
eking-0.041
Token weren
Token ID6304
Feature activation-1.40e-01

Pos logit contributions
unt+0.026
+0.024
ww+0.023
ha+0.023
OK+0.023
Neg logit contributions
e-0.031
ow-0.029
ric-0.023
years-0.023
eking-0.023
Token't
Token ID470
Feature activation-5.45e-02

Pos logit contributions
e+0.319
ow+0.292
years+0.278
ric+0.255
eking+0.252
Neg logit contributions
-0.202
unt-0.196
ha-0.182
s-0.179
 hill-0.174
Token meant
Token ID4001
Feature activation-1.85e-02

Pos logit contributions
e+0.114
ow+0.105
years+0.095
ric+0.089
eking+0.089
Neg logit contributions
-0.091
unt-0.088
OK-0.081
ha-0.080
s-0.080
Token to
Token ID284
Feature activation-5.74e-02

Pos logit contributions
e+0.042
ow+0.039
years+0.035
eking+0.033
ric+0.033
Neg logit contributions
-0.028
unt-0.027
OK-0.025
ha-0.025
 hill-0.024
Token be
Token ID307
Feature activation-8.12e-02

Pos logit contributions
e+0.117
ow+0.107
years+0.098
ric+0.090
eking+0.089
Neg logit contributions
-0.098
unt-0.096
ha-0.089
s-0.088
OK-0.088
Token folded
Token ID24650
Feature activation-2.98e-03

Pos logit contributions
e+0.166
ow+0.152
years+0.141
ric+0.129
eking+0.127
Neg logit contributions
-0.137
unt-0.133
ha-0.124
s-0.123
OK-0.122
Token was
Token ID373
Feature activation-9.80e-02

Pos logit contributions
+0.031
unt+0.030
ha+0.028
s+0.028
OK+0.028
Neg logit contributions
e-0.043
ow-0.039
years-0.036
ric-0.033
eking-0.033
Token a
Token ID257
Feature activation+3.00e-02

Pos logit contributions
e+0.222
ow+0.205
years+0.192
ric+0.179
owing+0.179
Neg logit contributions
unt-0.138
-0.136
 man-0.134
 trail-0.133
 wagon-0.130
Token box
Token ID3091
Feature activation-7.26e-03

Pos logit contributions
+0.039
unt+0.038
pool+0.037
s+0.036
First+0.035
Neg logit contributions
e-0.069
ow-0.059
 eyel-0.057
years-0.056
ric-0.054
Token of
Token ID286
Feature activation-5.17e-02

Pos logit contributions
e+0.015
ow+0.014
years+0.012
eking+0.011
owing+0.011
Neg logit contributions
-0.013
unt-0.012
OK-0.012
ha-0.011
Okay-0.011
Token ch
Token ID442
Feature activation-4.73e-03

Pos logit contributions
e+0.120
ow+0.111
years+0.103
ric+0.096
eking+0.095
Neg logit contributions
-0.074
unt-0.071
ha-0.066
s-0.065
OK-0.064
Tokenocol
Token ID4668
Feature activation-1.39e-01

Pos logit contributions
e+0.006
ow+0.006
years+0.005
ric+0.005
eking+0.005
Neg logit contributions
s-0.010
-0.010
ay-0.010
ha-0.010
pool-0.010
Tokenates
Token ID689
Feature activation-4.60e-02

Pos logit contributions
 eyel+0.181
 weep+0.180
ow+0.176
years+0.171
 crying+0.169
Neg logit contributions
ay-0.244
ha-0.240
-0.236
s-0.235
ww-0.234
Token.
Token ID13
Feature activation-2.09e-02

Pos logit contributions
e+0.106
ow+0.097
years+0.087
eking+0.082
ric+0.081
Neg logit contributions
unt-0.069
-0.068
OK-0.062
ha-0.061
 trail-0.059
Token Lily
Token ID20037
Feature activation-4.12e-02

Pos logit contributions
e+0.048
ow+0.044
years+0.041
 eyel+0.036
ric+0.036
Neg logit contributions
unt-0.030
-0.029
ha-0.029
 trail-0.028
ww-0.027
Token loved
Token ID6151
Feature activation-3.44e-03

Pos logit contributions
e+0.089
ow+0.083
years+0.077
ric+0.071
eking+0.070
Neg logit contributions
-0.065
unt-0.062
s-0.058
ha-0.058
OK-0.057
Token ch
Token ID442
Feature activation+4.93e-03

Pos logit contributions
e+0.008
ow+0.007
years+0.006
eking+0.006
owing+0.006
Neg logit contributions
unt-0.005
-0.005
 man-0.005
s-0.005
 hill-0.005
Token giving
Token ID3501
Feature activation-2.78e-02

Pos logit contributions
e+0.131
ow+0.125
years+0.121
owing+0.102
eking+0.098
Neg logit contributions
-0.149
unt-0.144
 hill-0.136
 man-0.136
 trail-0.136
Token him
Token ID683
Feature activation-7.88e-02

Pos logit contributions
e+0.059
ow+0.055
years+0.051
ric+0.047
eking+0.047
Neg logit contributions
-0.045
unt-0.043
s-0.040
ha-0.040
OK-0.039
Token too
Token ID1165
Feature activation-1.15e-02

Pos logit contributions
e+0.155
ow+0.141
years+0.131
ric+0.118
eking+0.117
Neg logit contributions
-0.140
unt-0.135
ha-0.127
s-0.126
OK-0.125
Token many
Token ID867
Feature activation+2.00e-02

Pos logit contributions
e+0.021
ow+0.020
years+0.017
 eyel+0.015
eking+0.015
Neg logit contributions
-0.020
unt-0.020
-0.020
 wagon-0.019
 trail-0.019
Token ch
Token ID442
Feature activation-1.18e-02

Pos logit contributions
+0.027
unt+0.026
OK+0.024
ha+0.023
+0.023
Neg logit contributions
e-0.048
ow-0.044
years-0.041
ric-0.038
eking-0.038
Tokenocol
Token ID4668
Feature activation-1.39e-01

Pos logit contributions
e+0.020
ow+0.017
years+0.016
 eyel+0.016
ric+0.015
Neg logit contributions
-0.022
s-0.020
pool-0.020
unt-0.020
ha-0.020
Tokenates
Token ID689
Feature activation-5.67e-02

Pos logit contributions
 eyel+0.179
e+0.166
years+0.161
ow+0.158
 weep+0.157
Neg logit contributions
-0.267
s-0.254
ha-0.244
ay-0.241
ww-0.240
Token."
Token ID526
Feature activation-2.54e-02

Pos logit contributions
e+0.136
ow+0.126
years+0.113
ric+0.105
eking+0.105
Neg logit contributions
unt-0.079
-0.076
OK-0.072
ha-0.069
 trail-0.068
Token The
Token ID383
Feature activation+2.40e-02

Pos logit contributions
e+0.064
ow+0.060
years+0.056
ric+0.052
eking+0.051
Neg logit contributions
-0.030
unt-0.029
ha-0.027
 trail-0.027
OK-0.026
Token king
Token ID5822
Feature activation-3.23e-02

Pos logit contributions
+0.024
unt+0.023
 man+0.021
 trail+0.020
 hill+0.020
Neg logit contributions
e-0.067
ow-0.063
years-0.058
ric-0.056
eking-0.055
Token learned
Token ID4499
Feature activation+1.65e-02

Pos logit contributions
e+0.073
ow+0.068
years+0.065
ric+0.059
eking+0.059
Neg logit contributions
-0.047
unt-0.045
s-0.042
ha-0.041
 man-0.041
Token pl
Token ID458
Feature activation-7.37e-02

Pos logit contributions
e+0.096
ow+0.088
years+0.077
ric+0.072
eking+0.072
Neg logit contributions
-0.079
unt-0.078
ha-0.072
s-0.072
OK-0.071
Tokenuck
Token ID1347
Feature activation+2.30e-03

Pos logit contributions
e+0.121
 eyel+0.112
years+0.107
ow+0.103
ric+0.100
Neg logit contributions
-0.129
s-0.121
pool-0.120
-0.112
unt-0.109
Token cel
Token ID18725
Feature activation-1.54e-02

Pos logit contributions
+0.004
unt+0.004
ha+0.004
OK+0.004
 trail+0.004
Neg logit contributions
e-0.004
ow-0.004
years-0.004
ric-0.003
eking-0.003
Tokenery
Token ID1924
Feature activation+1.49e-03

Pos logit contributions
e+0.025
ow+0.023
years+0.020
ric+0.018
eking+0.018
Neg logit contributions
-0.032
unt-0.032
ha-0.030
s-0.030
OK-0.030
Token that
Token ID326
Feature activation+2.63e-02

Pos logit contributions
unt+0.002
+0.002
ha+0.002
OK+0.002
ww+0.002
Neg logit contributions
e-0.003
ow-0.003
years-0.003
eking-0.003
ric-0.002
Token isn
Token ID2125
Feature activation-1.39e-01

Pos logit contributions
+0.038
unt+0.037
ha+0.033
OK+0.033
s+0.032
Neg logit contributions
e-0.061
ow-0.056
years-0.052
ric-0.048
eking-0.048
Token't
Token ID470
Feature activation-4.67e-02

Pos logit contributions
e+0.325
ow+0.299
years+0.287
eking+0.260
ric+0.260
Neg logit contributions
-0.196
unt-0.187
s-0.177
ha-0.173
 hill-0.166
Token grown
Token ID7334
Feature activation-5.48e-02

Pos logit contributions
e+0.096
ow+0.088
years+0.080
eking+0.075
ric+0.074
Neg logit contributions
-0.079
unt-0.077
OK-0.071
s-0.070
 trail-0.070
Token.
Token ID13
Feature activation-2.34e-03

Pos logit contributions
e+0.127
ow+0.119
years+0.110
ric+0.105
 eyel+0.100
Neg logit contributions
-0.077
s-0.072
unt-0.070
ha-0.069
pool-0.066
Token He
Token ID679
Feature activation+4.89e-02

Pos logit contributions
e+0.006
ow+0.005
years+0.005
ric+0.004
 eyel+0.004
Neg logit contributions
 trail-0.003
unt-0.003
-0.003
ha-0.003
ww-0.003
Token showed
Token ID3751
Feature activation+5.86e-03

Pos logit contributions
+0.078
unt+0.076
ha+0.070
s+0.070
OK+0.069
Neg logit contributions
e-0.105
ow-0.097
years-0.089
ric-0.082
eking-0.082
Token fix
Token ID4259
Feature activation-6.95e-04

Pos logit contributions
e+0.024
ow+0.022
years+0.017
ric+0.016
ad+0.016
Neg logit contributions
unt-0.022
-0.021
s-0.020
ha-0.020
-0.020
Token it
Token ID340
Feature activation-2.57e-03

Pos logit contributions
e+0.001
ow+0.001
years+0.001
ric+0.001
eking+0.001
Neg logit contributions
-0.001
unt-0.001
s-0.001
ha-0.001
 man-0.001
Token if
Token ID611
Feature activation-1.45e-02

Pos logit contributions
e+0.006
ow+0.005
eking+0.004
years+0.004
owing+0.004
Neg logit contributions
unt-0.004
-0.004
OK-0.004
Okay-0.004
ha-0.004
Token the
Token ID262
Feature activation+1.34e-02

Pos logit contributions
e+0.032
ow+0.030
years+0.027
eking+0.025
ric+0.025
Neg logit contributions
-0.022
unt-0.022
ha-0.020
OK-0.020
 trail-0.019
Token steam
Token ID13324
Feature activation+4.32e-02

Pos logit contributions
+0.019
unt+0.018
s+0.017
ha+0.017
Together+0.016
Neg logit contributions
e-0.031
ow-0.029
years-0.026
ric-0.025
 eyel-0.025
Token isn
Token ID2125
Feature activation-1.39e-01

Pos logit contributions
unt+0.055
+0.055
ha+0.051
 trail+0.050
OK+0.049
Neg logit contributions
e-0.106
ow-0.100
ric-0.085
years-0.082
ps-0.081
Token't
Token ID470
Feature activation-4.51e-02

Pos logit contributions
e+0.309
ow+0.285
years+0.265
ric+0.244
eking+0.243
Neg logit contributions
-0.211
unt-0.203
ha-0.188
s-0.186
OK-0.184
Token available
Token ID1695
Feature activation+2.00e-02

Pos logit contributions
e+0.090
ow+0.083
years+0.071
eking+0.069
ric+0.067
Neg logit contributions
-0.079
unt-0.077
OK-0.073
s-0.071
 hill-0.071
Token.
Token ID13
Feature activation-3.91e-03

Pos logit contributions
unt+0.031
+0.030
OK+0.029
ha+0.028
Okay+0.028
Neg logit contributions
e-0.043
ow-0.041
years-0.034
eking-0.033
owing-0.032
Token But
Token ID887
Feature activation-7.15e-04

Pos logit contributions
e+0.009
ow+0.009
years+0.008
ric+0.007
 eyel+0.007
Neg logit contributions
-0.005
 trail-0.005
unt-0.005
ha-0.005
 hill-0.005
Token don
Token ID836
Feature activation-4.26e-02

Pos logit contributions
e+0.002
ow+0.002
years+0.001
ric+0.001
eking+0.001
Neg logit contributions
-0.001
unt-0.001
ha-0.001
s-0.001
OK-0.001
Token,
Token ID11
Feature activation+1.35e-02

Pos logit contributions
unt+0.053
+0.052
pool+0.048
ha+0.048
OK+0.048
Neg logit contributions
e-0.076
ow-0.070
years-0.060
eking-0.058
 eyel-0.058
Token the
Token ID262
Feature activation+2.83e-02

Pos logit contributions
+0.018
unt+0.018
ha+0.016
s+0.016
OK+0.016
Neg logit contributions
e-0.032
ow-0.030
years-0.028
ric-0.026
eking-0.026
Token loud
Token ID7812
Feature activation+5.99e-02

Pos logit contributions
+0.037
unt+0.035
s+0.034
ha+0.032
pool+0.031
Neg logit contributions
e-0.067
ow-0.063
years-0.057
 eyel-0.055
ric-0.055
Tokenest
Token ID395
Feature activation+4.86e-02

Pos logit contributions
+0.083
unt+0.079
s+0.077
ha+0.074
OK+0.074
Neg logit contributions
e-0.138
ow-0.127
ric-0.108
years-0.107
 eyel-0.107
Token voice
Token ID3809
Feature activation-6.90e-03

Pos logit contributions
+0.069
unt+0.066
ha+0.061
s+0.060
+0.060
Neg logit contributions
e-0.113
ow-0.105
 eyel-0.089
years-0.089
ric-0.088
Token isn
Token ID2125
Feature activation-1.38e-01

Pos logit contributions
e+0.016
ow+0.015
years+0.013
eking+0.012
ric+0.012
Neg logit contributions
-0.010
unt-0.010
 trail-0.009
 hill-0.009
ha-0.009
Token't
Token ID470
Feature activation-3.95e-02

Pos logit contributions
e+0.296
ow+0.275
years+0.247
ric+0.231
eking+0.227
Neg logit contributions
-0.223
unt-0.215
OK-0.202
 trail-0.200
ha-0.200
Token the
Token ID262
Feature activation-1.27e-03

Pos logit contributions
e+0.077
ow+0.071
years+0.059
eking+0.058
ric+0.057
Neg logit contributions
-0.072
unt-0.069
OK-0.067
Okay-0.064
-0.064
Token one
Token ID530
Feature activation+3.93e-02

Pos logit contributions
e+0.003
ow+0.003
years+0.002
ric+0.002
 eyel+0.002
Neg logit contributions
-0.002
unt-0.002
s-0.002
ha-0.002
-0.002
Token that
Token ID326
Feature activation+4.16e-02

Pos logit contributions
+0.061
unt+0.060
ha+0.055
 trail+0.055
OK+0.055
Neg logit contributions
e-0.088
ow-0.081
years-0.067
owing-0.065
ric-0.064
Token helps
Token ID5419
Feature activation+1.12e-02

Pos logit contributions
+0.059
unt+0.057
ha+0.052
 trail+0.051
+0.051
Neg logit contributions
e-0.098
ow-0.090
years-0.080
ric-0.076
eking-0.074
Token "
Token ID366
Feature activation-6.82e-03

Pos logit contributions
+0.026
unt+0.024
ha+0.021
s+0.021
OK+0.020
Neg logit contributions
e-0.082
ow-0.077
years-0.073
ric-0.068
eking-0.068
TokenIt
Token ID1026
Feature activation+9.10e-03

Pos logit contributions
e+0.017
ow+0.015
years+0.015
ric+0.014
eking+0.013
Neg logit contributions
-0.009
unt-0.008
 trail-0.008
 wagon-0.008
 hill-0.007
Token means
Token ID1724
Feature activation+2.32e-02

Pos logit contributions
+0.015
unt+0.014
 trail+0.014
 hill+0.013
ha+0.013
Neg logit contributions
e-0.019
ow-0.018
years-0.015
ric-0.015
eking-0.014
Token the
Token ID262
Feature activation+1.52e-02

Pos logit contributions
+0.035
unt+0.033
ha+0.031
OK+0.030
s+0.030
Neg logit contributions
e-0.052
ow-0.048
years-0.045
ric-0.041
eking-0.041
Token tuna
Token ID38883
Feature activation+1.31e-02

Pos logit contributions
unt+0.022
+0.021
ha+0.020
s+0.019
+0.018
Neg logit contributions
e-0.036
ow-0.032
years-0.029
ric-0.028
 eyel-0.028
Token isn
Token ID2125
Feature activation-1.38e-01

Pos logit contributions
unt+0.019
+0.017
ha+0.016
OK+0.016
Together+0.016
Neg logit contributions
e-0.031
ow-0.028
years-0.024
ps-0.023
eking-0.023
Token't
Token ID470
Feature activation-4.64e-02

Pos logit contributions
e+0.311
ow+0.288
years+0.271
ric+0.248
eking+0.248
Neg logit contributions
-0.205
unt-0.198
ha-0.182
s-0.182
OK-0.177
Token very
Token ID845
Feature activation-4.07e-02

Pos logit contributions
e+0.100
ow+0.092
years+0.084
eking+0.078
ric+0.077
Neg logit contributions
-0.074
unt-0.072
OK-0.066
s-0.065
ha-0.065
Token fresh
Token ID4713
Feature activation+3.90e-02

Pos logit contributions
e+0.093
ow+0.086
years+0.079
eking+0.074
ric+0.071
Neg logit contributions
unt-0.058
-0.057
ha-0.050
-0.050
 hill-0.049
Token,
Token ID11
Feature activation+1.26e-02

Pos logit contributions
unt+0.069
ha+0.060
Sure+0.059
Together+0.059
OK+0.059
Neg logit contributions
e-0.082
ow-0.071
years-0.060
owing-0.060
ps-0.058
Token Jimmy
Token ID12963
Feature activation-4.26e-02

Pos logit contributions
+0.019
unt+0.018
ha+0.016
 trail+0.016
OK+0.016
Neg logit contributions
e-0.029
ow-0.026
years-0.024
eking-0.022
ric-0.022
Token,
Token ID11
Feature activation+1.88e-02

Pos logit contributions
+0.010
unt+0.010
ha+0.009
s+0.009
OK+0.009
Neg logit contributions
e-0.019
ow-0.017
years-0.016
ric-0.015
eking-0.015
Token kicking
Token ID17997
Feature activation+4.55e-03

Pos logit contributions
+0.029
unt+0.028
ha+0.026
s+0.026
 trail+0.026
Neg logit contributions
e-0.041
ow-0.037
years-0.033
ric-0.030
eking-0.030
Token is
Token ID318
Feature activation-7.58e-02

Pos logit contributions
+0.007
unt+0.007
ha+0.007
s+0.007
OK+0.007
Neg logit contributions
e-0.010
ow-0.009
years-0.008
ric-0.008
eking-0.007
Token really
Token ID1107
Feature activation-3.58e-02

Pos logit contributions
e+0.151
ow+0.138
years+0.124
ric+0.114
eking+0.113
Neg logit contributions
-0.135
unt-0.129
ha-0.120
 hill-0.120
s-0.120
Token fun
Token ID1257
Feature activation+2.70e-02

Pos logit contributions
e+0.085
ow+0.077
years+0.072
eking+0.067
 eyel+0.066
Neg logit contributions
-0.049
unt-0.048
 hill-0.043
 trail-0.043
-0.042
Token isn
Token ID2125
Feature activation-1.37e-01

Pos logit contributions
unt+0.042
+0.041
OK+0.039
ha+0.038
pool+0.038
Neg logit contributions
e-0.060
ow-0.051
years-0.043
eking-0.043
 being-0.042
Token't
Token ID470
Feature activation-3.67e-02

Pos logit contributions
e+0.287
ow+0.265
years+0.235
ric+0.220
eking+0.214
Neg logit contributions
-0.227
unt-0.220
OK-0.210
 trail-0.209
 hill-0.205
Token it
Token ID340
Feature activation-8.75e-03

Pos logit contributions
e+0.079
ow+0.072
years+0.066
eking+0.061
ric+0.061
Neg logit contributions
-0.058
unt-0.056
OK-0.053
s-0.052
 trail-0.052
Token?
Token ID30
Feature activation-2.55e-02

Pos logit contributions
e+0.018
ow+0.018
years+0.014
eking+0.014
ric+0.014
Neg logit contributions
-0.014
unt-0.013
OK-0.013
ha-0.012
 trail-0.012
Tokenâ
Token ID22940
Feature activation-3.30e-02

Pos logit contributions
e+0.057
ow+0.052
years+0.048
ric+0.045
 eyel+0.042
Neg logit contributions
 trail-0.037
unt-0.036
-0.035
 hill-0.035
ha-0.034
Token
Token ID26391
Feature activation-5.97e-02

Pos logit contributions
e+0.056
ow+0.051
years+0.045
ric+0.041
eking+0.040
Neg logit contributions
-0.068
unt-0.066
ha-0.062
s-0.062
 trail-0.061
Token he
Token ID339
Feature activation+1.99e-02

Pos logit contributions
unt+0.054
+0.054
ha+0.050
+0.049
OK+0.049
Neg logit contributions
e-0.084
ow-0.076
years-0.067
eking-0.064
ric-0.063
Token knew
Token ID2993
Feature activation-1.75e-02

Pos logit contributions
+0.028
unt+0.028
ha+0.025
 trail+0.025
OK+0.025
Neg logit contributions
e-0.046
ow-0.042
years-0.039
ric-0.036
eking-0.036
Token the
Token ID262
Feature activation+8.07e-03

Pos logit contributions
e+0.040
ow+0.037
years+0.034
ric+0.032
eking+0.032
Neg logit contributions
-0.025
unt-0.024
ha-0.022
OK-0.022
s-0.022
Token scary
Token ID14343
Feature activation+6.05e-02

Pos logit contributions
+0.011
s+0.010
unt+0.010
ha+0.009
+0.009
Neg logit contributions
e-0.019
ow-0.018
years-0.016
ric-0.016
 eyel-0.016
Token noises
Token ID26782
Feature activation+1.55e-02

Pos logit contributions
+0.078
unt+0.074
s+0.069
Together+0.068
ha+0.068
Neg logit contributions
e-0.146
ow-0.136
years-0.121
ric-0.117
 eyel-0.117
Token weren
Token ID6304
Feature activation-1.37e-01

Pos logit contributions
unt+0.025
+0.024
OK+0.022
 trail+0.022
+0.022
Neg logit contributions
e-0.033
ow-0.030
years-0.026
eking-0.024
owing-0.024
Token't
Token ID470
Feature activation-4.19e-02

Pos logit contributions
e+0.295
ow+0.274
years+0.248
ric+0.229
eking+0.227
Neg logit contributions
-0.221
unt-0.212
OK-0.198
 trail-0.196
ha-0.195
Token anything
Token ID1997
Feature activation+2.02e-02

Pos logit contributions
e+0.083
ow+0.076
years+0.070
ric+0.064
eking+0.063
Neg logit contributions
-0.074
unt-0.071
ha-0.067
s-0.066
OK-0.066
Token to
Token ID284
Feature activation-5.22e-02

Pos logit contributions
+0.031
unt+0.031
OK+0.029
 trail+0.029
 hill+0.029
Neg logit contributions
e-0.043
ow-0.040
years-0.035
eking-0.033
 eyel-0.033
Token worry
Token ID5490
Feature activation+2.41e-02

Pos logit contributions
e+0.105
ow+0.095
years+0.083
eking+0.078
ric+0.078
Neg logit contributions
-0.092
unt-0.091
ha-0.085
s-0.085
OK-0.083
Token about
Token ID546
Feature activation-3.44e-02

Pos logit contributions
+0.041
unt+0.039
OK+0.039
ha+0.038
+0.036
Neg logit contributions
e-0.048
ow-0.045
ric-0.038
eking-0.037
years-0.037
Token drink
Token ID4144
Feature activation+1.20e-02

Pos logit contributions
+0.041
unt+0.041
OK+0.038
ha+0.037
Together+0.037
Neg logit contributions
e-0.063
ow-0.058
years-0.051
ric-0.049
 eyel-0.048
Token he
Token ID339
Feature activation+1.13e-02

Pos logit contributions
unt+0.018
+0.018
ha+0.017
OK+0.017
 trail+0.016
Neg logit contributions
e-0.026
ow-0.024
years-0.020
ric-0.019
eking-0.019
Token had
Token ID550
Feature activation-2.72e-02

Pos logit contributions
+0.018
unt+0.017
ha+0.016
s+0.016
OK+0.016
Neg logit contributions
e-0.024
ow-0.023
years-0.021
ric-0.019
eking-0.019
Token been
Token ID587
Feature activation-6.84e-02

Pos logit contributions
e+0.056
ow+0.051
years+0.047
ric+0.043
eking+0.043
Neg logit contributions
-0.046
unt-0.045
ha-0.042
s-0.041
OK-0.041
Token drinking
Token ID7722
Feature activation-2.12e-03

Pos logit contributions
e+0.131
ow+0.121
years+0.113
ric+0.106
eking+0.099
Neg logit contributions
-0.118
unt-0.116
s-0.109
ha-0.108
 few-0.106
Token wasn
Token ID2492
Feature activation-1.37e-01

Pos logit contributions
e+0.004
ow+0.004
years+0.004
ric+0.003
eking+0.003
Neg logit contributions
-0.003
unt-0.003
ha-0.003
OK-0.003
 trail-0.003
Token't
Token ID470
Feature activation-4.67e-02

Pos logit contributions
e+0.312
ow+0.287
years+0.271
ric+0.250
eking+0.248
Neg logit contributions
-0.201
unt-0.193
ha-0.179
s-0.177
 trail-0.172
Token very
Token ID845
Feature activation-4.34e-02

Pos logit contributions
e+0.096
ow+0.088
years+0.082
ric+0.074
eking+0.072
Neg logit contributions
-0.078
unt-0.076
ha-0.072
 man-0.071
s-0.070
Token good
Token ID922
Feature activation+3.02e-02

Pos logit contributions
e+0.094
ow+0.087
years+0.081
eking+0.074
ric+0.072
Neg logit contributions
-0.068
unt-0.067
ha-0.061
 trail-0.061
 hill-0.061
Token for
Token ID329
Feature activation-5.77e-02

Pos logit contributions
unt+0.050
+0.049
ha+0.046
 hill+0.045
 trail+0.044
Neg logit contributions
e-0.062
ow-0.059
years-0.051
eking-0.050
 eyel-0.049
Token him
Token ID683
Feature activation-4.65e-02

Pos logit contributions
e+0.108
ow+0.100
years+0.091
ric+0.084
owing+0.081
Neg logit contributions
-0.100
unt-0.096
 man-0.095
 hill-0.095
 few-0.093
Token,
Token ID11
Feature activation+2.57e-02

Pos logit contributions
+0.023
unt+0.022
OK+0.020
 hill+0.020
ha+0.020
Neg logit contributions
e-0.037
ow-0.035
years-0.032
eking-0.029
 eyel-0.029
Token "
Token ID366
Feature activation-4.27e-03

Pos logit contributions
+0.036
unt+0.033
ha+0.032
ww+0.032
 trail+0.032
Neg logit contributions
e-0.056
ow-0.054
years-0.052
 eyel-0.047
ric-0.045
TokenNo
Token ID2949
Feature activation-8.29e-03

Pos logit contributions
e+0.011
years+0.009
ow+0.009
ad+0.008
 elbows+0.008
Neg logit contributions
unt-0.005
 trail-0.005
 hill-0.005
 wagon-0.005
-0.004
Token,
Token ID11
Feature activation+2.50e-02

Pos logit contributions
e+0.016
ow+0.014
 eyel+0.012
ric+0.012
eking+0.012
Neg logit contributions
-0.014
unt-0.014
OK-0.014
pool-0.013
-0.013
Token mud
Token ID17492
Feature activation+1.06e-02

Pos logit contributions
unt+0.040
+0.039
s+0.037
ha+0.036
 trail+0.035
Neg logit contributions
e-0.053
ow-0.047
years-0.042
 eyel-0.039
 being-0.039
Token isn
Token ID2125
Feature activation-1.37e-01

Pos logit contributions
unt+0.016
+0.016
ha+0.014
OK+0.014
pool+0.013
Neg logit contributions
e-0.024
ow-0.022
ric-0.019
years-0.019
eking-0.019
Token't
Token ID470
Feature activation-4.84e-02

Pos logit contributions
e+0.298
ow+0.275
years+0.253
ric+0.234
eking+0.231
Neg logit contributions
-0.214
unt-0.207
ha-0.191
OK-0.190
 trail-0.188
Token for
Token ID329
Feature activation-4.88e-02

Pos logit contributions
e+0.109
ow+0.100
years+0.089
eking+0.088
ric+0.084
Neg logit contributions
-0.071
unt-0.070
OK-0.064
s-0.063
-0.062
Token sharing
Token ID7373
Feature activation+8.71e-03

Pos logit contributions
e+0.104
ow+0.096
years+0.089
ric+0.082
eking+0.081
Neg logit contributions
-0.079
unt-0.076
ha-0.071
s-0.070
 trail-0.069
Token."
Token ID526
Feature activation-5.56e-03

Pos logit contributions
unt+0.013
+0.012
ha+0.012
 trail+0.011
OK+0.011
Neg logit contributions
e-0.020
ow-0.018
years-0.015
ps-0.015
ric-0.015
Token
Token ID198
Feature activation+2.91e-02

Pos logit contributions
e+0.012
ow+0.011
years+0.010
 eyel+0.010
ric+0.009
Neg logit contributions
unt-0.008
ha-0.008
ww-0.008
-0.008
 trail-0.008
Token and
Token ID290
Feature activation-2.04e-02

Pos logit contributions
e+0.027
ow+0.026
years+0.022
ric+0.021
eking+0.021
Neg logit contributions
-0.020
unt-0.020
ha-0.018
ww-0.018
www-0.017
Token run
Token ID1057
Feature activation-1.39e-02

Pos logit contributions
e+0.045
years+0.041
ow+0.041
eking+0.036
ric+0.036
Neg logit contributions
-0.032
 hill-0.029
s-0.028
 man-0.028
OK-0.028
Token to
Token ID284
Feature activation-4.45e-02

Pos logit contributions
e+0.031
ow+0.028
years+0.025
 eyel+0.023
eking+0.023
Neg logit contributions
unt-0.022
-0.022
ha-0.019
ww-0.019
www-0.019
Token the
Token ID262
Feature activation-6.81e-03

Pos logit contributions
e+0.094
years+0.086
ow+0.086
eking+0.077
ric+0.074
Neg logit contributions
-0.072
 hill-0.067
unt-0.067
 man-0.065
s-0.064
Token ost
Token ID23619
Feature activation-9.26e-02

Pos logit contributions
e+0.018
ow+0.016
years+0.015
ric+0.015
eking+0.014
Neg logit contributions
-0.008
unt-0.007
pool-0.007
ha-0.006
s-0.006
Tokenric
Token ID1173
Feature activation-1.36e-01

Pos logit contributions
e+0.072
years+0.059
ow+0.045
 eyel+0.042
oken+0.034
Neg logit contributions
s-0.258
-0.253
unt-0.249
 hill-0.241
-0.240
Tokenhes
Token ID956
Feature activation+2.34e-02

Pos logit contributions
e+0.244
years+0.223
ow+0.223
 eyel+0.199
eking+0.190
Neg logit contributions
-0.251
unt-0.238
s-0.237
ha-0.227
 man-0.222
Token.
Token ID13
Feature activation-1.28e-02

Pos logit contributions
+0.034
unt+0.033
ha+0.030
OK+0.030
 trail+0.029
Neg logit contributions
e-0.055
ow-0.051
years-0.046
ric-0.043
eking-0.042
Token
Token ID198
Feature activation+2.45e-02

Pos logit contributions
e+0.030
ow+0.029
ric+0.024
years+0.024
 eyel+0.023
Neg logit contributions
-0.017
ha-0.017
unt-0.016
 trail-0.016
ww-0.016
Token
Token ID198
Feature activation+4.01e-02

Pos logit contributions
+0.031
unt+0.030
 trail+0.028
 wagon+0.028
pool+0.028
Neg logit contributions
e-0.060
ow-0.055
years-0.049
ric-0.047
eking-0.045
Token"
Token ID1
Feature activation+1.17e-02

Pos logit contributions
umm+0.047
pool+0.044
+0.044
 trail+0.043
 wagon+0.043
Neg logit contributions
e-0.106
ow-0.089
years-0.085
ric-0.076
eking-0.075
Token waved
Token ID26834
Feature activation-6.65e-05

Pos logit contributions
e+0.027
ow+0.025
years+0.023
ric+0.021
eking+0.021
Neg logit contributions
-0.018
unt-0.018
ha-0.016
pool-0.016
 hill-0.016
Token to
Token ID284
Feature activation-2.91e-02

Pos logit contributions
e+0.000
ow+0.000
years+0.000
 eyel+0.000
owing+0.000
Neg logit contributions
-0.000
unt-0.000
 hill-0.000
ha-0.000
 trail-0.000
Token the
Token ID262
Feature activation+5.87e-03

Pos logit contributions
e+0.061
ow+0.057
years+0.053
eking+0.049
ric+0.048
Neg logit contributions
-0.048
unt-0.046
s-0.043
OK-0.043
ha-0.043
Token path
Token ID3108
Feature activation+7.22e-03

Pos logit contributions
+0.009
unt+0.008
s+0.008
pool+0.007
Together+0.007
Neg logit contributions
e-0.012
ow-0.011
years-0.011
 staring-0.010
 eyel-0.010
Token.
Token ID13
Feature activation+3.70e-03

Pos logit contributions
+0.012
unt+0.011
ha+0.010
ww+0.010
OK+0.010
Neg logit contributions
e-0.016
ow-0.015
years-0.013
eking-0.012
owing-0.012
Token Suddenly
Token ID24975
Feature activation+5.45e-02

Pos logit contributions
+0.004
unt+0.004
ha+0.004
ww+0.004
 trail+0.004
Neg logit contributions
e-0.009
ow-0.009
years-0.008
 eyel-0.008
ric-0.008
Token,
Token ID11
Feature activation+3.35e-02

Pos logit contributions
+0.068
unt+0.065
ha+0.060
s+0.060
 trail+0.058
Neg logit contributions
e-0.136
ow-0.126
years-0.119
ric-0.112
eking-0.111
Token the
Token ID262
Feature activation+3.42e-02

Pos logit contributions
+0.049
unt+0.047
ha+0.043
s+0.043
OK+0.043
Neg logit contributions
e-0.077
ow-0.071
years-0.066
ric-0.061
eking-0.061
Token path
Token ID3108
Feature activation+3.58e-02

Pos logit contributions
+0.053
s+0.047
unt+0.046
pool+0.043
+0.041
Neg logit contributions
ow-0.069
e-0.069
 eyel-0.062
years-0.062
ad-0.057
Token said
Token ID531
Feature activation+1.19e-02

Pos logit contributions
+0.063
unt+0.060
ww+0.055
OK+0.055
ha+0.054
Neg logit contributions
e-0.071
ow-0.066
years-0.059
eking-0.055
 eyel-0.052
Token hello
Token ID23748
Feature activation+1.20e-02

Pos logit contributions
+0.016
unt+0.015
ha+0.014
OK+0.014
 hill+0.014
Neg logit contributions
e-0.028
ow-0.027
years-0.025
ric-0.023
eking-0.023
Token the
Token ID262
Feature activation+1.36e-02

Pos logit contributions
+0.019
unt+0.018
ha+0.017
OK+0.017
 trail+0.017
Neg logit contributions
e-0.026
ow-0.023
years-0.021
ric-0.020
eking-0.020
Token day
Token ID1110
Feature activation+3.26e-02

Pos logit contributions
+0.018
unt+0.017
s+0.015
ha+0.015
+0.015
Neg logit contributions
e-0.033
ow-0.031
years-0.029
eking-0.027
ric-0.027
Token they
Token ID484
Feature activation+1.99e-02

Pos logit contributions
+0.052
unt+0.051
ha+0.047
OK+0.046
 trail+0.046
Neg logit contributions
e-0.071
ow-0.064
years-0.059
eking-0.054
ric-0.054
Token experienced
Token ID5924
Feature activation+2.06e-02

Pos logit contributions
+0.033
unt+0.033
ha+0.030
 trail+0.029
s+0.029
Neg logit contributions
e-0.042
ow-0.038
years-0.035
eking-0.032
ric-0.032
Token the
Token ID262
Feature activation+1.84e-03

Pos logit contributions
+0.030
unt+0.030
ha+0.027
OK+0.027
s+0.026
Neg logit contributions
e-0.048
ow-0.043
years-0.040
eking-0.037
ric-0.037
Token magical
Token ID10883
Feature activation+7.98e-02

Pos logit contributions
+0.002
unt+0.002
s+0.002
ha+0.002
pool+0.002
Neg logit contributions
e-0.005
ow-0.004
years-0.004
eking-0.004
ric-0.004
Token spray
Token ID11662
Feature activation+4.12e-02

Pos logit contributions
+0.101
unt+0.097
s+0.096
pool+0.092
Together+0.091
Neg logit contributions
e-0.191
ow-0.171
 eyel-0.152
years-0.151
eking-0.151
Token of
Token ID286
Feature activation-3.73e-02

Pos logit contributions
+0.061
unt+0.060
ha+0.056
OK+0.056
ww+0.054
Neg logit contributions
e-0.093
ow-0.083
years-0.071
ric-0.069
 eyel-0.067
Token the
Token ID262
Feature activation-8.52e-03

Pos logit contributions
e+0.078
ow+0.073
years+0.067
eking+0.061
ric+0.061
Neg logit contributions
-0.061
unt-0.059
 hill-0.055
 trail-0.054
 man-0.054
Token butterfly
Token ID35113
Feature activation-3.03e-02

Pos logit contributions
e+0.021
ow+0.019
years+0.018
eking+0.017
ric+0.017
Neg logit contributions
-0.011
unt-0.010
s-0.009
ha-0.009
pool-0.009
Token.
Token ID13
Feature activation-1.41e-04

Pos logit contributions
e+0.068
ow+0.060
years+0.051
ric+0.049
eking+0.049
Neg logit contributions
unt-0.048
-0.046
OK-0.045
ha-0.044
Okay-0.043
Token it
Token ID340
Feature activation+8.79e-03

Pos logit contributions
e+0.095
ow+0.086
years+0.081
ric+0.074
eking+0.073
Neg logit contributions
-0.058
unt-0.055
ha-0.052
pool-0.050
s-0.050
Token moved
Token ID3888
Feature activation-8.17e-03

Pos logit contributions
+0.014
unt+0.013
ha+0.012
OK+0.012
 trail+0.012
Neg logit contributions
e-0.019
ow-0.018
years-0.016
ric-0.015
eking-0.015
Token very
Token ID845
Feature activation-2.74e-02

Pos logit contributions
e+0.017
ow+0.016
years+0.015
ric+0.013
eking+0.013
Neg logit contributions
-0.014
unt-0.013
ha-0.012
s-0.012
OK-0.012
Token fast
Token ID3049
Feature activation+3.96e-02

Pos logit contributions
e+0.054
ow+0.051
years+0.048
eking+0.045
 eyel+0.042
Neg logit contributions
unt-0.047
-0.044
ha-0.042
www-0.040
ww-0.040
Token.
Token ID13
Feature activation-7.71e-03

Pos logit contributions
unt+0.081
+0.075
Sure+0.073
OK+0.073
ww+0.072
Neg logit contributions
e-0.071
ow-0.064
years-0.054
ps-0.052
eking-0.049
Token A
Token ID317
Feature activation+7.61e-02

Pos logit contributions
e+0.018
ow+0.016
years+0.014
ric+0.014
 eyel+0.014
Neg logit contributions
ha-0.010
-0.010
unt-0.010
ww-0.010
s-0.010
Token little
Token ID1310
Feature activation+5.55e-02

Pos logit contributions
First+0.099
pool+0.098
 First+0.095
s+0.094
Together+0.089
Neg logit contributions
e-0.159
ow-0.129
 eyel-0.127
years-0.126
 staring-0.123
Token boy
Token ID2933
Feature activation-4.02e-02

Pos logit contributions
Together+0.055
First+0.051
unt+0.050
+0.049
ha+0.048
Neg logit contributions
e-0.146
ow-0.136
years-0.119
 eyel-0.117
owing-0.115
Token looked
Token ID3114
Feature activation+1.06e-02

Pos logit contributions
e+0.067
ow+0.065
years+0.062
ric+0.059
eking+0.058
Neg logit contributions
-0.078
unt-0.075
s-0.073
ha-0.072
 man-0.070
Token up
Token ID510
Feature activation-9.67e-03

Pos logit contributions
+0.017
unt+0.016
s+0.015
ha+0.015
 hill+0.015
Neg logit contributions
e-0.023
ow-0.021
years-0.019
ric-0.018
eking-0.018
Token at
Token ID379
Feature activation-3.55e-02

Pos logit contributions
e+0.021
ow+0.019
years+0.018
ric+0.016
eking+0.016
Neg logit contributions
-0.015
unt-0.015
ha-0.014
s-0.014
OK-0.014
Token around
Token ID1088
Feature activation-3.43e-02

Pos logit contributions
e+0.141
ow+0.127
years+0.121
ric+0.109
eking+0.107
Neg logit contributions
-0.111
unt-0.106
s-0.103
 hill-0.103
 trail-0.102
Token it
Token ID340
Feature activation-1.23e-03

Pos logit contributions
e+0.083
ow+0.079
years+0.074
ric+0.070
eking+0.068
Neg logit contributions
-0.043
 hill-0.039
unt-0.039
 trail-0.039
s-0.038
Token.
Token ID13
Feature activation-1.93e-03

Pos logit contributions
e+0.002
ow+0.002
years+0.002
eking+0.002
owing+0.002
Neg logit contributions
unt-0.002
-0.002
OK-0.002
ww-0.002
ha-0.002
Token She
Token ID1375
Feature activation+5.18e-02

Pos logit contributions
e+0.005
ow+0.004
years+0.004
ric+0.004
 eyel+0.004
Neg logit contributions
-0.002
unt-0.002
ha-0.002
 trail-0.002
OK-0.002
Token was
Token ID373
Feature activation-7.21e-02

Pos logit contributions
+0.066
unt+0.063
s+0.063
ha+0.062
OK+0.061
Neg logit contributions
e-0.125
ow-0.117
years-0.109
ric-0.107
eking-0.104
Token scared
Token ID12008
Feature activation+5.57e-02

Pos logit contributions
e+0.174
years+0.146
ric+0.146
ow+0.144
 elbows+0.144
Neg logit contributions
 few-0.077
 man-0.074
unt-0.074
 while-0.073
 steep-0.072
Token and
Token ID290
Feature activation-3.54e-02

Pos logit contributions
+0.095
unt+0.088
OK+0.086
ha+0.083
 trail+0.081
Neg logit contributions
e-0.115
ow-0.113
years-0.100
eking-0.090
owing-0.090
Token was
Token ID373
Feature activation-7.49e-02

Pos logit contributions
e+0.082
ow+0.071
years+0.068
ric+0.068
eking+0.062
Neg logit contributions
-0.050
unt-0.047
 man-0.046
OK-0.046
s-0.046
Token about
Token ID546
Feature activation-2.78e-02

Pos logit contributions
e+0.169
ric+0.142
ow+0.141
years+0.139
 elbows+0.131
Neg logit contributions
unt-0.102
 few-0.094
 man-0.093
 trail-0.090
-0.090
Token to
Token ID284
Feature activation-4.43e-02

Pos logit contributions
e+0.078
ow+0.072
years+0.068
eking+0.064
ric+0.064
Neg logit contributions
 trail-0.024
 hill-0.024
-0.024
unt-0.023
 few-0.022
Token turn
Token ID1210
Feature activation+1.18e-03

Pos logit contributions
years+0.115
e+0.102
ric+0.095
ow+0.094
eking+0.093
Neg logit contributions
 trail-0.049
 hill-0.049
-0.049
 wagon-0.047
 man-0.046
Token.
Token ID13
Feature activation+1.92e-03

Pos logit contributions
+0.009
unt+0.009
ha+0.008
OK+0.008
+0.008
Neg logit contributions
e-0.013
ow-0.012
years-0.011
eking-0.010
ric-0.010
Token Tim
Token ID5045
Feature activation-4.87e-02

Pos logit contributions
+0.003
ha+0.003
 trail+0.003
unt+0.003
OK+0.003
Neg logit contributions
ow-0.004
e-0.003
years-0.003
 eyel-0.003
ric-0.003
Tokenmy
Token ID1820
Feature activation-8.75e-03

Pos logit contributions
e+0.093
ow+0.085
years+0.081
 eyel+0.074
eking+0.073
Neg logit contributions
-0.088
unt-0.084
s-0.083
 man-0.078
ha-0.077
Token was
Token ID373
Feature activation-6.29e-02

Pos logit contributions
e+0.020
ow+0.018
years+0.017
ric+0.016
eking+0.016
Neg logit contributions
-0.013
unt-0.013
s-0.012
ha-0.012
 hill-0.011
Token so
Token ID523
Feature activation-5.33e-03

Pos logit contributions
e+0.150
years+0.138
ow+0.137
ric+0.136
els+0.136
Neg logit contributions
unt-0.064
 man-0.063
 few-0.060
-0.059
 while-0.056
Token happy
Token ID3772
Feature activation+6.18e-02

Pos logit contributions
e+0.012
ow+0.011
years+0.011
ric+0.010
eking+0.010
Neg logit contributions
-0.008
unt-0.007
s-0.007
OK-0.007
 man-0.007
Token and
Token ID290
Feature activation-1.52e-02

Pos logit contributions
+0.087
unt+0.083
OK+0.078
ha+0.078
 hill+0.073
Neg logit contributions
e-0.145
ow-0.137
years-0.122
eking-0.115
ric-0.114
Token thanked
Token ID26280
Feature activation+2.50e-02

Pos logit contributions
e+0.033
ow+0.031
years+0.028
eking+0.026
 eyel+0.025
Neg logit contributions
-0.023
unt-0.023
ha-0.022
 hill-0.021
OK-0.021
Token his
Token ID465
Feature activation-2.77e-03

Pos logit contributions
unt+0.043
+0.043
 man+0.040
 wagon+0.039
 trail+0.037
Neg logit contributions
e-0.051
ow-0.048
years-0.045
ric-0.041
eking-0.041
Token mom
Token ID1995
Feature activation-5.22e-02

Pos logit contributions
e+0.006
ow+0.006
years+0.005
 eyel+0.005
eking+0.005
Neg logit contributions
-0.004
pool-0.004
unt-0.004
ha-0.004
ww-0.004
Tokenmy
Token ID1820
Feature activation-2.59e-02

Pos logit contributions
e+0.095
ow+0.085
years+0.079
eking+0.072
ric+0.071
Neg logit contributions
-0.099
unt-0.097
s-0.095
 man-0.090
 hill-0.089
Token excited
Token ID6568
Feature activation+5.25e-02

Pos logit contributions
e+0.059
ow+0.054
years+0.050
eking+0.045
ric+0.044
Neg logit contributions
-0.055
unt-0.053
ha-0.050
 trail-0.049
 hill-0.048
Token.
Token ID13
Feature activation-8.41e-03

Pos logit contributions
+0.081
unt+0.078
ha+0.073
OK+0.072
 hill+0.070
Neg logit contributions
e-0.116
ow-0.110
years-0.099
eking-0.093
 eyel-0.091
Token
Token ID198
Feature activation+2.49e-02

Pos logit contributions
e+0.019
ow+0.018
years+0.017
ric+0.015
 eyel+0.015
Neg logit contributions
-0.011
unt-0.011
 trail-0.011
ha-0.010
 hill-0.010
Token
Token ID198
Feature activation+3.51e-02

Pos logit contributions
+0.028
unt+0.027
 trail+0.024
OK+0.024
 hill+0.024
Neg logit contributions
e-0.065
ow-0.060
years-0.056
ric-0.052
eking-0.051
TokenShe
Token ID3347
Feature activation+4.89e-02

Pos logit contributions
+0.052
 wagon+0.045
unt+0.044
 hill+0.044
pool+0.044
Neg logit contributions
e-0.082
years-0.074
ow-0.069
eking-0.060
ad-0.059
Token ran
Token ID4966
Feature activation-1.49e-02

Pos logit contributions
+0.083
unt+0.080
 hill+0.074
OK+0.073
ha+0.073
Neg logit contributions
e-0.101
ow-0.092
years-0.085
eking-0.076
ric-0.076
Token to
Token ID284
Feature activation-3.83e-02

Pos logit contributions
e+0.030
ow+0.028
years+0.025
eking+0.023
ric+0.023
Neg logit contributions
-0.026
unt-0.025
OK-0.023
ha-0.023
s-0.023
Token the
Token ID262
Feature activation-1.21e-03

Pos logit contributions
e+0.079
years+0.074
ow+0.074
eking+0.069
ric+0.068
Neg logit contributions
-0.060
unt-0.059
 man-0.058
 hill-0.056
 trail-0.055
Token spell
Token ID4822
Feature activation+1.27e-02

Pos logit contributions
e+0.003
ow+0.003
years+0.003
ric+0.002
eking+0.002
Neg logit contributions
-0.002
unt-0.001
 hill-0.001
 trail-0.001
ha-0.001
Token,
Token ID11
Feature activation+5.26e-03

Pos logit contributions
+0.019
unt+0.018
OK+0.017
ha+0.017
ww+0.017
Neg logit contributions
e-0.029
ow-0.027
years-0.024
eking-0.022
ric-0.022
Token but
Token ID475
Feature activation+4.06e-03

Pos logit contributions
+0.007
unt+0.007
s+0.007
 man+0.007
+0.007
Neg logit contributions
e-0.012
ow-0.011
years-0.010
ric-0.010
eking-0.009
Token.
Token ID13
Feature activation-1.10e-02

Pos logit contributions
+0.093
unt+0.090
OK+0.084
+0.083
ha+0.083
Neg logit contributions
e-0.107
ow-0.101
years-0.085
 eyel-0.081
owing-0.081
Token
Token ID198
Feature activation+2.32e-02

Pos logit contributions
e+0.027
ow+0.025
years+0.024
 eyel+0.022
ric+0.022
Neg logit contributions
-0.013
unt-0.013
 trail-0.012
ha-0.012
 hill-0.012
Token
Token ID198
Feature activation+3.38e-02

Pos logit contributions
+0.028
unt+0.026
 trail+0.026
 hill+0.025
 wagon+0.024
Neg logit contributions
e-0.058
ow-0.053
years-0.050
ric-0.046
eking-0.045
TokenHe
Token ID1544
Feature activation+4.61e-02

Pos logit contributions
+0.043
 trail+0.039
unt+0.038
 hill+0.038
 wagon+0.036
Neg logit contributions
e-0.085
years-0.075
ow-0.074
eking-0.064
ric-0.063
Token finally
Token ID3443
Feature activation+2.60e-02

Pos logit contributions
+0.072
unt+0.070
ha+0.066
s+0.065
 trail+0.064
Neg logit contributions
e-0.100
ow-0.092
years-0.086
ric-0.081
eking-0.078
Token made
Token ID925
Feature activation+1.67e-02

Pos logit contributions
+0.045
unt+0.044
OK+0.041
 hill+0.041
 wagon+0.040
Neg logit contributions
e-0.053
ow-0.048
years-0.043
eking-0.039
ric-0.038
Token it
Token ID340
Feature activation+7.82e-04

Pos logit contributions
+0.025
unt+0.024
 man+0.024
ha+0.024
 trail+0.023
Neg logit contributions
e-0.035
years-0.033
ow-0.033
ric-0.030
eking-0.029
Token back
Token ID736
Feature activation-5.31e-02

Pos logit contributions
+0.001
unt+0.001
s+0.001
 trail+0.001
 hill+0.001
Neg logit contributions
e-0.002
ow-0.001
years-0.001
ric-0.001
eking-0.001
Token home
Token ID1363
Feature activation-3.79e-02

Pos logit contributions
e+0.123
years+0.111
ow+0.109
ric+0.099
eking+0.099
Neg logit contributions
-0.072
unt-0.070
s-0.068
 trail-0.066
 hill-0.066
Token.
Token ID13
Feature activation-7.25e-03

Pos logit contributions
e+0.093
ow+0.085
years+0.082
ric+0.077
eking+0.075
Neg logit contributions
-0.047
unt-0.045
s-0.044
 man-0.043
 trail-0.042
Token Mom
Token ID11254
Feature activation-5.06e-02

Pos logit contributions
e+0.017
ow+0.016
years+0.015
 eyel+0.014
ric+0.014
Neg logit contributions
-0.009
unt-0.009
 trail-0.009
ha-0.009
s-0.008
Token't
Token ID470
Feature activation-3.98e-02

Pos logit contributions
e+0.171
ow+0.157
years+0.141
ric+0.131
eking+0.128
Neg logit contributions
-0.128
unt-0.127
OK-0.119
ha-0.118
 trail-0.116
Token fall
Token ID2121
Feature activation-2.01e-02

Pos logit contributions
e+0.070
ow+0.065
years+0.053
eking+0.053
ric+0.052
Neg logit contributions
unt-0.078
-0.075
ha-0.070
OK-0.070
ww-0.070
Token down
Token ID866
Feature activation-2.83e-02

Pos logit contributions
e+0.039
ow+0.037
years+0.032
ric+0.030
eking+0.030
Neg logit contributions
unt-0.036
-0.035
OK-0.034
ha-0.033
Okay-0.032
Token again
Token ID757
Feature activation-2.64e-02

Pos logit contributions
e+0.061
ow+0.056
years+0.051
ric+0.047
eking+0.047
Neg logit contributions
unt-0.045
-0.044
OK-0.041
ha-0.041
-0.039
Token.
Token ID13
Feature activation-9.90e-03

Pos logit contributions
e+0.056
ow+0.049
years+0.043
eking+0.041
ric+0.040
Neg logit contributions
unt-0.046
-0.043
OK-0.042
ha-0.042
Okay-0.040
Token But
Token ID887
Feature activation+7.19e-03

Pos logit contributions
e+0.020
ow+0.020
years+0.017
ric+0.017
 eyel+0.016
Neg logit contributions
 trail-0.015
ha-0.015
unt-0.014
-0.014
ww-0.014
Token he
Token ID339
Feature activation+5.03e-02

Pos logit contributions
+0.010
unt+0.009
s+0.009
ha+0.009
OK+0.009
Neg logit contributions
e-0.017
ow-0.016
years-0.015
ric-0.014
eking-0.014
Token couldn
Token ID3521
Feature activation-4.04e-02

Pos logit contributions
+0.074
unt+0.070
s+0.069
ha+0.069
 man+0.067
Neg logit contributions
e-0.111
ow-0.104
years-0.098
ric-0.096
eking-0.093
Token't
Token ID470
Feature activation-5.43e-03

Pos logit contributions
e+0.090
ow+0.083
years+0.077
ric+0.071
eking+0.070
Neg logit contributions
-0.062
unt-0.059
ha-0.055
s-0.054
OK-0.054
Token use
Token ID779
Feature activation+1.27e-02

Pos logit contributions
e+0.011
ow+0.010
years+0.009
ric+0.008
eking+0.008
Neg logit contributions
unt-0.010
-0.010
ha-0.009
s-0.009
OK-0.009
Token the
Token ID262
Feature activation+1.43e-02

Pos logit contributions
+0.024
unt+0.023
ha+0.022
s+0.021
OK+0.021
Neg logit contributions
e-0.024
ow-0.022
years-0.020
ric-0.018
eking-0.018
Token pass
Token ID1208
Feature activation+1.06e-02

Pos logit contributions
e+0.011
ow+0.009
ric+0.009
years+0.008
oken+0.008
Neg logit contributions
unt-0.005
-0.005
s-0.004
Okay-0.004
ha-0.004
Token.
Token ID13
Feature activation+3.16e-03

Pos logit contributions
+0.016
unt+0.016
ha+0.014
OK+0.014
ww+0.014
Neg logit contributions
e-0.025
ow-0.023
years-0.020
ric-0.019
eking-0.019
Token When
Token ID1649
Feature activation+4.28e-02

Pos logit contributions
+0.004
unt+0.004
 trail+0.004
ha+0.004
OK+0.004
Neg logit contributions
e-0.008
ow-0.007
years-0.006
ric-0.006
 eyel-0.006
Token it
Token ID340
Feature activation+3.74e-02

Pos logit contributions
unt+0.071
+0.071
OK+0.064
ha+0.064
s+0.063
Neg logit contributions
e-0.088
ow-0.079
years-0.072
ric-0.065
 eyel-0.065
Token arrived
Token ID5284
Feature activation-1.29e-03

Pos logit contributions
+0.067
unt+0.066
OK+0.061
 wagon+0.059
ww+0.058
Neg logit contributions
e-0.074
ow-0.068
years-0.059
eking-0.054
ric-0.052
Token,
Token ID11
Feature activation+3.84e-02

Pos logit contributions
e+0.003
ow+0.002
years+0.002
eking+0.002
owing+0.002
Neg logit contributions
unt-0.002
-0.002
ha-0.002
OK-0.002
 trail-0.002
Token the
Token ID262
Feature activation+3.98e-02

Pos logit contributions
unt+0.074
+0.073
ha+0.069
OK+0.067
ww+0.066
Neg logit contributions
e-0.069
ow-0.062
years-0.055
ric-0.050
eking-0.050
Token mouse
Token ID10211
Feature activation-5.19e-03

Pos logit contributions
+0.056
unt+0.055
pool+0.054
ha+0.052
s+0.052
Neg logit contributions
e-0.093
ow-0.080
years-0.075
 eyel-0.069
ric-0.066
Token was
Token ID373
Feature activation-6.78e-02

Pos logit contributions
e+0.011
ow+0.009
years+0.008
eking+0.008
ric+0.008
Neg logit contributions
-0.009
unt-0.009
OK-0.008
ha-0.008
ww-0.008
Token filled
Token ID5901
Feature activation+1.52e-02

Pos logit contributions
e+0.138
ow+0.127
years+0.122
ric+0.115
 eyel+0.112
Neg logit contributions
-0.103
unt-0.100
 man-0.095
 few-0.095
s-0.092
Token with
Token ID351
Feature activation-3.90e-02

Pos logit contributions
+0.019
 man+0.017
s+0.017
unt+0.017
 hill+0.016
Neg logit contributions
ow-0.036
e-0.036
years-0.034
ric-0.033
eking-0.031
Token find
Token ID1064
Feature activation-2.50e-02

Pos logit contributions
e+0.027
ow+0.024
years+0.020
ric+0.019
eking+0.019
Neg logit contributions
-0.027
unt-0.027
ha-0.026
OK-0.025
s-0.025
Token out
Token ID503
Feature activation-1.72e-02

Pos logit contributions
e+0.046
ow+0.042
years+0.038
ric+0.035
eking+0.034
Neg logit contributions
-0.048
unt-0.046
ha-0.044
s-0.043
OK-0.043
Token what
Token ID644
Feature activation+1.77e-03

Pos logit contributions
e+0.032
ow+0.029
years+0.026
eking+0.024
ric+0.023
Neg logit contributions
-0.033
unt-0.031
OK-0.031
pool-0.030
 trail-0.030
Token is
Token ID318
Feature activation-5.73e-02

Pos logit contributions
+0.003
unt+0.003
pool+0.002
OK+0.002
 trail+0.002
Neg logit contributions
e-0.004
ow-0.004
years-0.003
ric-0.003
eking-0.003
Token inside
Token ID2641
Feature activation-7.48e-03

Pos logit contributions
e+0.108
ow+0.098
years+0.086
eking+0.082
ric+0.079
Neg logit contributions
-0.108
unt-0.105
OK-0.098
s-0.098
ha-0.097
Token soon
Token ID2582
Feature activation+1.92e-02

Pos logit contributions
e+0.018
ow+0.017
years+0.015
eking+0.014
ric+0.014
Neg logit contributions
-0.010
unt-0.010
OK-0.009
ha-0.009
 trail-0.009
Token enough
Token ID1576
Feature activation+6.63e-02

Pos logit contributions
unt+0.027
Sure+0.025
OK+0.025
Okay+0.024
ha+0.024
Neg logit contributions
e-0.046
ow-0.043
eking-0.037
owing-0.036
ps-0.035
Token."
Token ID526
Feature activation-3.64e-03

Pos logit contributions
+0.089
Okay+0.089
Sure+0.087
OK+0.087
+0.087
Neg logit contributions
e-0.147
ow-0.143
eking-0.120
ric-0.118
ps-0.117
Token
Token ID198
Feature activation+3.02e-02

Pos logit contributions
e+0.009
ow+0.009
years+0.008
ric+0.008
 eyel+0.008
Neg logit contributions
-0.004
unt-0.004
ha-0.004
 trail-0.004
 hill-0.004
Token
Token ID198
Feature activation+4.17e-02

Pos logit contributions
+0.036
unt+0.035
 wagon+0.035
 trail+0.035
pool+0.034
Neg logit contributions
e-0.074
ow-0.067
years-0.061
ric-0.057
eking-0.055
TokenThe
Token ID464
Feature activation+4.34e-02

Pos logit contributions
+0.062
pool+0.061
 wagon+0.058
 trigger+0.057
 hill+0.056
Neg logit contributions
e-0.093
years-0.076
ow-0.070
ad-0.062
 eyel-0.062
Token
Token ID220
Feature activation-1.63e-02

Pos logit contributions
e+0.004
ow+0.004
years+0.004
ric+0.004
 eyel+0.003
Neg logit contributions
-0.002
unt-0.002
ha-0.002
 trail-0.002
OK-0.002
Token
Token ID198
Feature activation+2.98e-02

Pos logit contributions
e+0.046
ow+0.043
years+0.041
eking+0.038
ric+0.038
Neg logit contributions
-0.015
unt-0.014
s-0.013
ha-0.012
OK-0.012
Token
Token ID198
Feature activation+4.06e-02

Pos logit contributions
+0.029
unt+0.027
s+0.024
ha+0.023
OK+0.023
Neg logit contributions
e-0.083
ow-0.079
years-0.074
eking-0.070
ric-0.070
TokenThe
Token ID464
Feature activation+4.55e-02

Pos logit contributions
+0.049
unt+0.045
 trail+0.044
 hill+0.044
pool+0.043
Neg logit contributions
e-0.105
ow-0.091
years-0.089
ric-0.081
eking-0.079
Token charming
Token ID23332
Feature activation+9.52e-02

Pos logit contributions
+0.063
unt+0.060
s+0.058
pool+0.057
ha+0.057
Neg logit contributions
e-0.103
ow-0.095
years-0.090
 eyel-0.085
eking-0.080
Token daughter
Token ID4957
Feature activation-3.12e-02

Pos logit contributions
+0.140
ha+0.128
pool+0.127
unt+0.123
Together+0.123
Neg logit contributions
e-0.198
ow-0.185
 eyel-0.176
years-0.166
 gazing-0.155
Token was
Token ID373
Feature activation-7.74e-02

Pos logit contributions
e+0.065
ow+0.060
years+0.055
ric+0.050
eking+0.049
Neg logit contributions
-0.052
unt-0.050
ha-0.047
s-0.046
OK-0.046
Token no
Token ID645
Feature activation+7.71e-03

Pos logit contributions
e+0.155
ow+0.143
years+0.140
eking+0.132
ric+0.130
Neg logit contributions
unt-0.120
-0.119
 few-0.108
 man-0.106
s-0.105
Token longer
Token ID2392
Feature activation+3.97e-02

Pos logit contributions
+0.011
unt+0.011
 trail+0.010
 hill+0.010
+0.010
Neg logit contributions
e-0.018
ow-0.016
years-0.015
 eyel-0.014
eking-0.014
Token scared
Token ID12008
Feature activation+5.02e-02

Pos logit contributions
+0.059
unt+0.056
 man+0.055
 few+0.053
s+0.052
Neg logit contributions
e-0.085
ow-0.079
ric-0.072
years-0.071
eking-0.067
Token and
Token ID290
Feature activation-1.67e-02

Pos logit contributions
+0.078
unt+0.074
ha+0.069
OK+0.069
 trail+0.068
Neg logit contributions
e-0.112
ow-0.105
years-0.095
eking-0.087
 eyel-0.086
Token
Token ID198
Feature activation+3.19e-02

Pos logit contributions
e+0.008
ow+0.008
years+0.007
ric+0.007
 eyel+0.007
Neg logit contributions
ha-0.006
-0.006
 trail-0.006
OK-0.006
ww-0.006
Token
Token ID198
Feature activation+4.49e-02

Pos logit contributions
+0.037
unt+0.035
OK+0.033
ha+0.033
 trail+0.033
Neg logit contributions
e-0.081
ow-0.075
years-0.068
ric-0.065
eking-0.063
TokenTim
Token ID14967
Feature activation-2.93e-02

Pos logit contributions
+0.056
pool+0.055
umm+0.054
 wagon+0.052
uddle+0.052
Neg logit contributions
e-0.110
ow-0.087
years-0.083
ad-0.073
 crawling-0.072
Token went
Token ID1816
Feature activation-4.89e-02

Pos logit contributions
e+0.064
ow+0.058
years+0.052
eking+0.047
ric+0.047
Neg logit contributions
-0.046
unt-0.045
OK-0.042
ha-0.042
pool-0.042
Token outside
Token ID2354
Feature activation-1.60e-02

Pos logit contributions
e+0.111
ow+0.102
years+0.095
ric+0.088
eking+0.087
Neg logit contributions
-0.073
unt-0.070
ha-0.065
s-0.064
OK-0.064
Token and
Token ID290
Feature activation-2.18e-02

Pos logit contributions
e+0.040
ow+0.038
years+0.036
ric+0.034
eking+0.034
Neg logit contributions
-0.019
unt-0.018
s-0.017
 hill-0.016
 man-0.016
Token said
Token ID531
Feature activation+2.42e-02

Pos logit contributions
e+0.047
ow+0.043
years+0.040
eking+0.036
ric+0.036
Neg logit contributions
unt-0.033
-0.033
ha-0.030
pool-0.030
 trail-0.030
Token,
Token ID11
Feature activation+3.01e-02

Pos logit contributions
+0.033
unt+0.032
ha+0.029
OK+0.029
 trail+0.029
Neg logit contributions
e-0.057
ow-0.054
years-0.050
ric-0.046
eking-0.045
Token "
Token ID366
Feature activation+1.15e-03

Pos logit contributions
+0.033
unt+0.031
ha+0.029
OK+0.028
 trail+0.028
Neg logit contributions
e-0.077
ow-0.074
years-0.069
ric-0.064
 eyel-0.064
TokenL
Token ID43
Feature activation-3.85e-02

Pos logit contributions
+0.001
 trail+0.001
unt+0.001
 hill+0.001
 wagon+0.001
Neg logit contributions
e-0.003
years-0.002
ow-0.002
 eyel-0.002
ric-0.002
Tokenily
Token ID813
Feature activation-2.50e-03

Pos logit contributions
e+0.062
ow+0.054
years+0.052
ric+0.050
eking+0.045
Neg logit contributions
unt-0.081
-0.080
ha-0.079
OK-0.074
s-0.074
Token dragon
Token ID10441
Feature activation-1.21e-02

Pos logit contributions
+0.020
unt+0.018
s+0.017
OK+0.017
+0.017
Neg logit contributions
e-0.030
ow-0.028
years-0.026
eking-0.024
 staring-0.024
Token.
Token ID13
Feature activation+5.70e-03

Pos logit contributions
e+0.029
ow+0.027
years+0.026
eking+0.023
ric+0.023
Neg logit contributions
-0.016
unt-0.015
s-0.015
 trail-0.014
ha-0.014
Token All
Token ID1439
Feature activation+2.64e-02

Pos logit contributions
+0.008
ha+0.008
ww+0.007
unt+0.007
 trail+0.007
Neg logit contributions
e-0.013
ow-0.013
years-0.012
 eyel-0.011
ric-0.010
Token of
Token ID286
Feature activation-2.53e-02

Pos logit contributions
+0.044
unt+0.042
OK+0.039
ha+0.039
 trail+0.039
Neg logit contributions
e-0.057
ow-0.052
years-0.048
ric-0.043
eking-0.042
Token the
Token ID262
Feature activation+9.09e-03

Pos logit contributions
e+0.052
ow+0.051
years+0.045
owing+0.044
eking+0.044
Neg logit contributions
unt-0.038
 few-0.037
 man-0.037
ha-0.037
-0.036
Token people
Token ID661
Feature activation-4.74e-02

Pos logit contributions
+0.014
unt+0.013
ha+0.012
OK+0.012
pool+0.012
Neg logit contributions
e-0.020
ow-0.018
years-0.018
eking-0.016
ric-0.016
Token were
Token ID547
Feature activation-8.00e-02

Pos logit contributions
e+0.098
ow+0.091
years+0.083
ric+0.078
eking+0.076
Neg logit contributions
-0.078
unt-0.075
s-0.073
ha-0.070
 hill-0.069
Token amazed
Token ID24562
Feature activation+4.63e-02

Pos logit contributions
e+0.167
ow+0.152
years+0.146
ric+0.135
 eyel+0.135
Neg logit contributions
unt-0.122
-0.117
 man-0.117
s-0.114
 few-0.109
Token because
Token ID780
Feature activation+2.69e-02

Pos logit contributions
+0.078
unt+0.078
ha+0.072
 trail+0.070
OK+0.069
Neg logit contributions
e-0.096
ow-0.089
years-0.081
eking-0.074
owing-0.073
Token it
Token ID340
Feature activation+3.38e-02

Pos logit contributions
+0.034
unt+0.031
ha+0.028
OK+0.028
 trail+0.028
Neg logit contributions
e-0.068
ow-0.063
years-0.060
ric-0.054
eking-0.054
Token was
Token ID373
Feature activation-8.16e-02

Pos logit contributions
+0.052
unt+0.050
ha+0.047
s+0.046
OK+0.046
Neg logit contributions
e-0.075
ow-0.069
years-0.064
ric-0.059
eking-0.059
Token free
Token ID1479
Feature activation+3.67e-02

Pos logit contributions
e+0.065
ow+0.059
years+0.055
eking+0.050
 eyel+0.049
Neg logit contributions
-0.050
unt-0.049
ha-0.046
 hill-0.045
 trail-0.044
Token.
Token ID13
Feature activation-7.38e-03

Pos logit contributions
+0.054
unt+0.052
ha+0.047
OK+0.047
s+0.046
Neg logit contributions
e-0.085
ow-0.079
years-0.073
ric-0.067
eking-0.067
Token She
Token ID1375
Feature activation+5.49e-02

Pos logit contributions
e+0.018
ow+0.017
years+0.016
ric+0.015
eking+0.014
Neg logit contributions
-0.009
unt-0.009
ha-0.008
 trail-0.008
OK-0.008
Token spun
Token ID26843
Feature activation+5.03e-03

Pos logit contributions
+0.098
unt+0.096
ha+0.089
 hill+0.087
 trail+0.087
Neg logit contributions
e-0.108
ow-0.098
years-0.090
ric-0.081
eking-0.081
Token the
Token ID262
Feature activation-4.82e-03

Pos logit contributions
+0.009
unt+0.009
OK+0.008
ha+0.008
 hill+0.008
Neg logit contributions
e-0.010
ow-0.009
years-0.008
eking-0.007
 eyel-0.007
Token wheel
Token ID7825
Feature activation-2.24e-02

Pos logit contributions
e+0.013
ow+0.012
years+0.011
ric+0.011
eking+0.010
Neg logit contributions
-0.005
unt-0.005
ha-0.005
s-0.004
OK-0.004
Token faster
Token ID5443
Feature activation+3.30e-02

Pos logit contributions
e+0.049
ow+0.046
years+0.042
eking+0.038
ric+0.038
Neg logit contributions
-0.035
unt-0.033
ha-0.031
OK-0.030
s-0.030
Token and
Token ID290
Feature activation-3.96e-02

Pos logit contributions
+0.048
unt+0.046
ha+0.042
OK+0.041
s+0.041
Neg logit contributions
e-0.077
ow-0.071
years-0.065
eking-0.060
ric-0.060
Token faster
Token ID5443
Feature activation+4.21e-02

Pos logit contributions
e+0.088
ow+0.081
years+0.076
eking+0.070
ric+0.069
Neg logit contributions
-0.060
unt-0.058
ha-0.053
 hill-0.052
 trail-0.052
Token,
Token ID11
Feature activation+2.02e-03

Pos logit contributions
+0.059
unt+0.057
ha+0.052
OK+0.050
s+0.050
Neg logit contributions
e-0.100
ow-0.093
years-0.086
eking-0.080
ric-0.079
Token until
Token ID1566
Feature activation+1.51e-02

Pos logit contributions
+0.002
unt+0.002
OK+0.002
 hill+0.002
 man+0.002
Neg logit contributions
e-0.005
ow-0.005
years-0.005
ric-0.004
eking-0.004
Token build
Token ID1382
Feature activation-1.01e-02

Pos logit contributions
e+0.114
ow+0.106
years+0.099
ric+0.091
eking+0.091
Neg logit contributions
-0.076
unt-0.073
ha-0.067
s-0.066
 trail-0.066
Token a
Token ID257
Feature activation+2.77e-02

Pos logit contributions
years+0.022
e+0.021
ow+0.021
ric+0.020
owing+0.019
Neg logit contributions
-0.015
 man-0.014
 wagon-0.014
 trail-0.014
 hill-0.014
Token house
Token ID2156
Feature activation-2.12e-02

Pos logit contributions
+0.035
unt+0.035
ha+0.034
OK+0.034
pool+0.033
Neg logit contributions
e-0.062
ow-0.056
years-0.051
ric-0.050
 eyel-0.050
Token for
Token ID329
Feature activation-4.53e-02

Pos logit contributions
e+0.044
ow+0.040
years+0.036
eking+0.033
ric+0.033
Neg logit contributions
-0.036
unt-0.036
ha-0.034
OK-0.033
s-0.032
Token his
Token ID465
Feature activation-2.32e-02

Pos logit contributions
e+0.090
ow+0.086
ric+0.077
years+0.076
owing+0.074
Neg logit contributions
-0.073
 few-0.073
 man-0.072
 wagon-0.071
 trail-0.071
Token dolls
Token ID36062
Feature activation-3.61e-02

Pos logit contributions
e+0.053
ow+0.051
years+0.046
 eyel+0.043
ric+0.043
Neg logit contributions
-0.033
unt-0.031
ha-0.030
pool-0.029
s-0.028
Token.
Token ID13
Feature activation-3.00e-03

Pos logit contributions
e+0.075
ow+0.071
years+0.059
owing+0.057
eking+0.056
Neg logit contributions
unt-0.064
-0.061
ww-0.059
ha-0.059
OK-0.058
Token He
Token ID679
Feature activation+4.68e-02

Pos logit contributions
e+0.006
ow+0.006
ric+0.005
years+0.005
 eyel+0.005
Neg logit contributions
ha-0.004
unt-0.004
ww-0.004
-0.004
 trail-0.004
Token asked
Token ID1965
Feature activation+4.98e-03

Pos logit contributions
+0.071
unt+0.069
ha+0.064
s+0.063
OK+0.063
Neg logit contributions
e-0.105
ow-0.097
years-0.090
ric-0.084
eking-0.082
Token his
Token ID465
Feature activation-1.00e-02

Pos logit contributions
unt+0.009
+0.009
 man+0.008
 few+0.008
ha+0.008
Neg logit contributions
e-0.010
ow-0.009
ric-0.009
years-0.008
eking-0.008
Token mom
Token ID1995
Feature activation-5.22e-02

Pos logit contributions
e+0.021
ow+0.020
years+0.018
 eyel+0.017
ric+0.016
Neg logit contributions
-0.017
pool-0.016
ha-0.015
unt-0.015
OK-0.014
Token cat
Token ID3797
Feature activation-3.66e-02

Pos logit contributions
+0.119
Together+0.104
pool+0.100
unt+0.100
s+0.100
Neg logit contributions
e-0.210
ow-0.201
 eyel-0.168
ric-0.161
years-0.160
Token with
Token ID351
Feature activation-2.45e-02

Pos logit contributions
e+0.073
ow+0.066
years+0.058
eking+0.053
ric+0.053
Neg logit contributions
-0.066
unt-0.065
ha-0.059
OK-0.059
 trail-0.058
Token the
Token ID262
Feature activation+1.74e-02

Pos logit contributions
e+0.046
ow+0.042
years+0.038
ric+0.035
eking+0.034
Neg logit contributions
-0.046
unt-0.045
ha-0.042
s-0.042
OK-0.041
Token big
Token ID1263
Feature activation+6.57e-02

Pos logit contributions
+0.021
unt+0.020
ha+0.018
ww+0.017
pool+0.017
Neg logit contributions
e-0.043
ow-0.041
 elbows-0.035
ric-0.035
 eyel-0.035
Token lip
Token ID10645
Feature activation-4.10e-03

Pos logit contributions
+0.096
unt+0.092
ha+0.086
OK+0.085
s+0.085
Neg logit contributions
e-0.149
ow-0.140
years-0.121
 eyel-0.116
ric-0.116
Token were
Token ID547
Feature activation-9.95e-02

Pos logit contributions
e+0.009
ow+0.008
years+0.007
owing+0.007
 eyel+0.007
Neg logit contributions
-0.007
unt-0.007
OK-0.006
ha-0.006
www-0.006
Token very
Token ID845
Feature activation-3.56e-02

Pos logit contributions
e+0.194
ow+0.177
years+0.162
ric+0.147
eking+0.145
Neg logit contributions
-0.181
unt-0.176
ha-0.165
s-0.163
OK-0.162
Token happy
Token ID3772
Feature activation+5.48e-02

Pos logit contributions
e+0.081
ow+0.076
years+0.069
 eyel+0.065
eking+0.065
Neg logit contributions
-0.052
unt-0.052
ha-0.047
 trail-0.044
OK-0.043
Token.
Token ID13
Feature activation-1.44e-03

Pos logit contributions
+0.083
unt+0.082
ha+0.075
OK+0.072
ww+0.070
Neg logit contributions
e-0.124
ow-0.115
years-0.104
ric-0.097
eking-0.097
Token They
Token ID1119
Feature activation+5.86e-02

Pos logit contributions
e+0.004
ow+0.004
years+0.003
ric+0.003
 eyel+0.003
Neg logit contributions
-0.002
unt-0.002
ha-0.002
 trail-0.001
 hill-0.001
Token played
Token ID2826
Feature activation+6.83e-03

Pos logit contributions
+0.105
unt+0.101
ha+0.095
s+0.094
OK+0.094
Neg logit contributions
e-0.115
ow-0.105
years-0.097
ric-0.088
eking-0.087
Token They
Token ID1119
Feature activation+4.23e-02

Pos logit contributions
e+0.027
ow+0.024
years+0.022
ric+0.022
 eyel+0.021
Neg logit contributions
 trail-0.012
-0.011
ha-0.011
unt-0.011
s-0.011
Token were
Token ID547
Feature activation-7.62e-02

Pos logit contributions
+0.067
unt+0.067
ha+0.061
 trail+0.061
 hill+0.060
Neg logit contributions
e-0.091
ow-0.082
years-0.076
ric-0.069
 eyel-0.068
Token friends
Token ID2460
Feature activation-2.98e-02

Pos logit contributions
e+0.171
ow+0.160
years+0.149
ric+0.139
eking+0.137
Neg logit contributions
unt-0.109
-0.106
 man-0.098
OK-0.096
s-0.095
Token.
Token ID13
Feature activation-1.13e-02

Pos logit contributions
e+0.065
ow+0.057
years+0.051
owing+0.045
ric+0.045
Neg logit contributions
-0.049
unt-0.049
ha-0.048
 trail-0.047
ww-0.046
Token<|endoftext|>
Token ID50256
Feature activation-2.70e-02

Pos logit contributions
e+0.028
ow+0.025
years+0.023
ric+0.022
 eyel+0.021
Neg logit contributions
 trail-0.015
ha-0.014
-0.014
unt-0.014
 hill-0.013
TokenL
Token ID43
Feature activation-9.76e-02

Pos logit contributions
ow+0.076
years+0.076
e+0.075
eking+0.073
ric+0.068
Neg logit contributions
-0.025
s-0.019
unt-0.018
 hill-0.017
ha-0.016
Tokenily
Token ID813
Feature activation-7.24e-02

Pos logit contributions
e+0.179
years+0.176
ow+0.176
 eyel+0.160
 elbows+0.159
Neg logit contributions
ha-0.165
s-0.155
unt-0.155
-0.152
 man-0.146
Token and
Token ID290
Feature activation-2.67e-02

Pos logit contributions
e+0.175
ow+0.166
years+0.159
eking+0.148
ric+0.147
Neg logit contributions
-0.095
unt-0.088
s-0.082
OK-0.080
ha-0.080
Token Tom
Token ID4186
Feature activation-8.92e-02

Pos logit contributions
e+0.061
years+0.061
eking+0.060
oken+0.056
ow+0.056
Neg logit contributions
-0.041
unt-0.033
s-0.032
 man-0.032
-0.031
Token were
Token ID547
Feature activation-7.38e-02

Pos logit contributions
years+0.210
e+0.202
ow+0.199
eking+0.199
ric+0.185
Neg logit contributions
-0.123
s-0.107
OK-0.102
 man-0.097
-0.095
Token playing
Token ID2712
Feature activation-1.90e-02

Pos logit contributions
e+0.163
ow+0.141
ric+0.141
eking+0.135
years+0.134
Neg logit contributions
-0.099
unt-0.096
s-0.090
 wagon-0.089
 sun-0.089
Token clothes
Token ID8242
Feature activation-9.19e-04

Pos logit contributions
e+0.045
ow+0.042
years+0.039
ric+0.036
eking+0.036
Neg logit contributions
-0.028
unt-0.027
ha-0.025
 trail-0.025
s-0.025
Token and
Token ID290
Feature activation-2.23e-02

Pos logit contributions
e+0.002
ow+0.002
years+0.002
eking+0.002
owing+0.002
Neg logit contributions
-0.001
unt-0.001
OK-0.001
ha-0.001
ww-0.001
Token made
Token ID925
Feature activation+1.71e-02

Pos logit contributions
e+0.051
ow+0.047
years+0.044
ric+0.041
eking+0.040
Neg logit contributions
-0.032
unt-0.031
ha-0.029
s-0.029
 trail-0.029
Token sure
Token ID1654
Feature activation+3.88e-02

Pos logit contributions
+0.030
unt+0.029
ha+0.029
s+0.029
 man+0.028
Neg logit contributions
e-0.032
ow-0.030
years-0.028
ric-0.027
owing-0.024
Token everything
Token ID2279
Feature activation-8.29e-03

Pos logit contributions
+0.054
unt+0.052
ha+0.048
s+0.047
OK+0.047
Neg logit contributions
e-0.091
ow-0.085
years-0.079
ric-0.073
eking-0.072
Token was
Token ID373
Feature activation-1.05e-01

Pos logit contributions
e+0.018
ow+0.017
years+0.016
eking+0.014
ric+0.014
Neg logit contributions
-0.013
unt-0.012
ha-0.011
OK-0.011
 trail-0.011
Token ready
Token ID3492
Feature activation+1.40e-02

Pos logit contributions
e+0.219
ow+0.201
years+0.184
eking+0.171
ric+0.168
Neg logit contributions
-0.176
unt-0.171
ha-0.158
s-0.157
OK-0.157
Token.
Token ID13
Feature activation+2.65e-03

Pos logit contributions
+0.021
unt+0.021
OK+0.019
ha+0.019
 hill+0.018
Neg logit contributions
e-0.031
ow-0.029
years-0.026
eking-0.025
ric-0.024
Token
Token ID220
Feature activation-4.31e-03

Pos logit contributions
+0.004
unt+0.004
ha+0.003
OK+0.003
s+0.003
Neg logit contributions
e-0.006
ow-0.006
years-0.005
ric-0.005
eking-0.005
Token
Token ID198
Feature activation+3.72e-02

Pos logit contributions
e+0.012
ow+0.011
years+0.011
eking+0.010
ric+0.010
Neg logit contributions
-0.004
unt-0.004
s-0.003
ha-0.003
OK-0.003
Token
Token ID198
Feature activation+4.81e-02

Pos logit contributions
+0.042
unt+0.040
ha+0.036
OK+0.036
 trail+0.035
Neg logit contributions
e-0.097
ow-0.090
years-0.084
ric-0.079
eking-0.078
Token after
Token ID706
Feature activation-4.23e-02

Pos logit contributions
e+0.005
ow+0.005
years+0.004
ric+0.004
eking+0.004
Neg logit contributions
-0.004
unt-0.004
ha-0.003
s-0.003
OK-0.003
Token it
Token ID340
Feature activation-5.93e-03

Pos logit contributions
e+0.099
ow+0.093
years+0.087
ric+0.083
eking+0.081
Neg logit contributions
-0.057
unt-0.054
ha-0.051
s-0.051
 trail-0.051
Token,
Token ID11
Feature activation-1.98e-03

Pos logit contributions
e+0.014
ow+0.013
years+0.011
eking+0.011
 eyel+0.010
Neg logit contributions
-0.009
unt-0.009
ha-0.008
OK-0.008
-0.008
Token but
Token ID475
Feature activation-6.37e-03

Pos logit contributions
e+0.005
ow+0.005
years+0.005
ric+0.004
eking+0.004
Neg logit contributions
-0.002
unt-0.002
 hill-0.002
s-0.002
 trail-0.002
Token he
Token ID339
Feature activation+4.15e-02

Pos logit contributions
e+0.017
ow+0.015
years+0.015
ric+0.014
eking+0.014
Neg logit contributions
-0.007
unt-0.007
ha-0.006
s-0.006
 hill-0.006
Token wasn
Token ID2492
Feature activation-1.08e-01

Pos logit contributions
+0.058
s+0.056
unt+0.055
ha+0.055
 man+0.054
Neg logit contributions
e-0.092
ow-0.086
years-0.083
ric-0.079
eking-0.076
Token't
Token ID470
Feature activation-2.95e-02

Pos logit contributions
e+0.246
ow+0.226
years+0.212
ric+0.196
eking+0.195
Neg logit contributions
-0.160
unt-0.154
ha-0.143
s-0.141
 hill-0.138
Token paying
Token ID5989
Feature activation-1.43e-04

Pos logit contributions
e+0.064
ow+0.059
years+0.057
ric+0.054
 eyel+0.051
Neg logit contributions
-0.044
unt-0.042
ha-0.042
 man-0.041
 trail-0.040
Token attention
Token ID3241
Feature activation+7.16e-03

Pos logit contributions
e+0.000
ow+0.000
years+0.000
ric+0.000
 eyel+0.000
Neg logit contributions
-0.000
s-0.000
unt-0.000
ha-0.000
 man-0.000
Token to
Token ID284
Feature activation-5.03e-02

Pos logit contributions
+0.009
unt+0.009
ha+0.008
s+0.008
OK+0.008
Neg logit contributions
e-0.017
ow-0.016
years-0.015
ric-0.014
eking-0.014
Token where
Token ID810
Feature activation-2.31e-02

Pos logit contributions
e+0.093
years+0.092
ow+0.088
oken+0.076
ric+0.075
Neg logit contributions
-0.083
 man-0.081
unt-0.080
 hill-0.080
 trail-0.078
Token,
Token ID11
Feature activation+2.39e-02

Pos logit contributions
+0.033
unt+0.032
OK+0.030
ha+0.029
+0.029
Neg logit contributions
e-0.048
ow-0.045
years-0.040
eking-0.037
ric-0.037
Token but
Token ID475
Feature activation+1.12e-03

Pos logit contributions
+0.025
unt+0.025
 trail+0.023
 hill+0.023
 man+0.023
Neg logit contributions
e-0.062
ow-0.059
years-0.055
ric-0.053
eking-0.051
Token still
Token ID991
Feature activation+4.07e-03

Pos logit contributions
unt+0.001
+0.001
 man+0.001
ha+0.001
s+0.001
Neg logit contributions
e-0.003
ow-0.003
ric-0.002
years-0.002
eking-0.002
Token the
Token ID262
Feature activation+2.61e-02

Pos logit contributions
+0.006
 man+0.005
unt+0.005
s+0.005
ha+0.005
Neg logit contributions
e-0.009
ow-0.008
ric-0.008
years-0.008
 elbows-0.007
Token web
Token ID3992
Feature activation+2.79e-02

Pos logit contributions
+0.039
unt+0.037
ha+0.034
s+0.034
OK+0.033
Neg logit contributions
e-0.059
ow-0.055
years-0.051
ric-0.047
eking-0.047
Token wasn
Token ID2492
Feature activation-1.09e-01

Pos logit contributions
+0.047
unt+0.045
ha+0.041
OK+0.041
 hill+0.041
Neg logit contributions
e-0.059
ow-0.054
years-0.049
eking-0.045
ric-0.044
Token't
Token ID470
Feature activation-3.37e-02

Pos logit contributions
e+0.244
ow+0.225
years+0.210
ric+0.194
eking+0.192
Neg logit contributions
-0.162
unt-0.156
ha-0.145
s-0.143
OK-0.141
Token finished
Token ID5201
Feature activation-2.25e-02

Pos logit contributions
e+0.072
ow+0.066
years+0.063
ric+0.058
eking+0.055
Neg logit contributions
-0.053
unt-0.052
ha-0.049
 trail-0.048
 man-0.048
Token.
Token ID13
Feature activation-1.38e-03

Pos logit contributions
e+0.059
ow+0.055
years+0.053
ric+0.049
eking+0.047
Neg logit contributions
-0.024
unt-0.022
s-0.022
 trail-0.021
 hill-0.020
Token
Token ID198
Feature activation+3.14e-02

Pos logit contributions
e+0.003
ow+0.003
years+0.003
ric+0.003
 eyel+0.003
Neg logit contributions
-0.002
unt-0.002
ha-0.001
 trail-0.001
s-0.001
Token
Token ID198
Feature activation+4.07e-02

Pos logit contributions
+0.033
unt+0.031
ha+0.028
OK+0.028
 trail+0.027
Neg logit contributions
e-0.084
ow-0.079
years-0.074
ric-0.069
eking-0.068