TOP ACTIVATIONS
MAX = 4.234

. " Who 's there ?" she said in a loud
me pick out a costume ?" she asked .
you doing here ?" a voice said . It was a
voice coming from the diamond . Annie was startled . "
. Daddy put an arm around Jake and said
What are you searching for ?" An nie smiled
had to go to the bar ber . She said his
he asks in a loud voice . S ara
Hello ? Is someone there ?" Lu cy was
that said ' No entry '. But Annie was eager to
" Hello !" said Pen elope in a
Who goes there ?" a voice said from inside .
" What are you doing ?" It was her mum .
you like it ?" the bar ber asked .
Can I get an egg for a cake ?
Who are you ?" a voice says . The voice is
suddenly she heard a voice . It was a
did you find that handle ?" Clara looked up and saw
have to destroy my map ?" Jamie cried .
's mom had filled the chamber with balloons , stream ers

MIN ACTIVATIONS
MIN = -4.315

golf with his dad . They would go to the golf
on , Lily and her neighbor became good friends and played
ooters . They always wore their gear : helmets , pads
on , he wore his new sunglasses everywhere he went ,
!" So , Kitty and Dog gy took turns sliding with
golf with his dad . They would go to the big
the park to play with his jets . He threw them
." R ib bit said , " Would
" Yes !" So they skipped together along the path
with his dad . They would go to the golf course
her friends . They ran , jumped , and slid on
day on , Kitty and Dog gy loved to play the
with his dad . They would go to the big green
a bell . They ride their bikes on the road near
, too . They skipped and sang and laughed .
a cloth . They want their bikes to look nice .
her friends . They ran , jumped , and slid on
skip just like Tom . They both laughed and had lots
and boy swung together . They laughed and smiled and it
They share the ball and the bike . They take turns

INTERVAL 2.097 - 4.234
CONTAINS 0.407%

, she stumbled upon a cave . "
. The End . <|endoftext|> Once upon a
a kind man . He helped her pick up the heavy
the top of the mountain . He was glaring down at
" I 'm making a delicious mac aron i and cheese

INTERVAL -0.040 - 2.097
CONTAINS 52.785%

day , she went to a big stadium with her mom
, Lily always remembered the huge mushroom and the little mouse
, but then she had an idea . She picked up
hands , so she decided to sing with it .
When Mill ie got inside , she ran to her

INTERVAL -2.178 - -0.040
CONTAINS 46.304%

a river and through some trees . He was having so
to run and jump and chase each other . They saw
mom for another one , but her mom said she had
at them . She is friendly . Tom and
not happy . " L ily and Tom ,

INTERVAL -4.315 - -2.178
CONTAINS 0.504%

other and catch it . They laughed and had fun .
race . They raced and Zoom went very fast . He
Lily for her help . They became good friends and played
. The wind was still blowing all around the
were two friends , Mia and George . Mia was a
Token.
Token ID13
Feature activation+2.76e+00

Pos logit contributions
ore+0.120
year+0.119
 healer+0.118
 gen+0.115
 x+0.113
Neg logit contributions
 pads-0.098
burn-0.078
coat-0.077
 sunglasses-0.077
ety-0.074
Token "
Token ID366
Feature activation-2.15e-02

Pos logit contributions
 x+0.154
 gen+0.152
year+0.151
ore+0.150
le+0.149
Neg logit contributions
 pads-0.093
 pad-0.072
ety-0.067
 tails-0.060
 tail-0.056
TokenWho
Token ID8241
Feature activation+9.04e-01

Pos logit contributions
 pads+0.001
ety+0.001
burn+0.001
 tails+0.001
 Indians+0.001
Neg logit contributions
ore-0.001
 x-0.001
 gen-0.001
year-0.001
 goblin-0.001
Token's
Token ID338
Feature activation+9.66e-01

Pos logit contributions
ore+0.049
year+0.046
 gen+0.045
 x+0.045
 goblin+0.044
Neg logit contributions
 pads-0.044
ety-0.035
 Indians-0.031
 tails-0.031
 pad-0.031
Token there
Token ID612
Feature activation+7.24e-01

Pos logit contributions
ore+0.043
year+0.041
 x+0.038
 gen+0.037
 goblin+0.037
Neg logit contributions
 pads-0.052
ety-0.046
 Indians-0.044
 sunglasses-0.044
burn-0.043
Token?"
Token ID1701
Feature activation+4.15e+00

Pos logit contributions
ore+0.028
year+0.027
 gen+0.025
 x+0.025
 goblin+0.024
Neg logit contributions
 pads-0.045
ety-0.039
burn-0.036
 Indians-0.036
coat-0.036
Token she
Token ID673
Feature activation+1.12e+00

Pos logit contributions
year+0.251
 x+0.238
auc+0.235
ore+0.234
place+0.230
Neg logit contributions
 pads-0.103
 pad-0.067
 Cle-0.066
 sunglasses-0.065
 swung-0.063
Token said
Token ID531
Feature activation+1.79e+00

Pos logit contributions
ore+0.059
year+0.053
 x+0.050
 gen+0.049
 healer+0.049
Neg logit contributions
 pads-0.058
ety-0.047
 sunglasses-0.043
 tails-0.043
 Indians-0.043
Token in
Token ID287
Feature activation+1.20e+00

Pos logit contributions
ore+0.086
 x+0.085
year+0.082
 healer+0.080
 gen+0.080
Neg logit contributions
 pads-0.080
ety-0.064
coat-0.063
ked-0.063
burn-0.061
Token a
Token ID257
Feature activation+9.39e-01

Pos logit contributions
ore+0.061
year+0.052
 gen+0.051
 healer+0.049
 x+0.048
Neg logit contributions
 pads-0.060
 tails-0.050
 sunglasses-0.050
ety-0.046
 tail-0.045
Token loud
Token ID7812
Feature activation+1.43e+00

Pos logit contributions
ore+0.038
year+0.033
 x+0.031
 gen+0.031
 healer+0.029
Neg logit contributions
 pads-0.058
 tails-0.049
 pad-0.047
ety-0.047
 sunglasses-0.047
Token me
Token ID502
Feature activation+2.01e+00

Pos logit contributions
year+0.066
ore+0.065
 gen+0.060
 goblin+0.058
 x+0.056
Neg logit contributions
 pads-0.052
ety-0.046
 sunglasses-0.041
burn-0.040
 Indians-0.040
Token pick
Token ID2298
Feature activation+1.04e+00

Pos logit contributions
year+0.094
ore+0.090
 gen+0.084
 goblin+0.082
abel+0.077
Neg logit contributions
 pads-0.104
ety-0.092
burn-0.090
 sunglasses-0.086
 Indians-0.085
Token out
Token ID503
Feature activation+1.48e+00

Pos logit contributions
ore+0.046
year+0.044
 gen+0.043
 x+0.041
 operating+0.040
Neg logit contributions
 pads-0.060
ety-0.050
 sunglasses-0.048
burn-0.046
 Indians-0.045
Token a
Token ID257
Feature activation+7.73e-01

Pos logit contributions
ore+0.070
year+0.063
 gen+0.063
 x+0.061
 goblin+0.061
Neg logit contributions
 pads-0.081
ety-0.066
 sunglasses-0.066
 tails-0.063
 sunscreen-0.062
Token costume
Token ID16569
Feature activation+2.40e+00

Pos logit contributions
ore+0.036
year+0.034
 x+0.031
 gen+0.031
 operating+0.030
Neg logit contributions
 pads-0.043
ety-0.036
 Indians-0.034
coat-0.034
 sunglasses-0.034
Token?"
Token ID1701
Feature activation+4.05e+00

Pos logit contributions
year+0.103
ore+0.101
 gen+0.092
 x+0.089
 goblin+0.088
Neg logit contributions
 pads-0.134
ety-0.111
coat-0.111
 sunglasses-0.109
 pad-0.107
Token she
Token ID673
Feature activation+1.31e+00

Pos logit contributions
year+0.239
 passage+0.212
 site+0.203
auc+0.201
ore+0.201
Neg logit contributions
 pads-0.104
 Cle-0.096
ety-0.094
burn-0.086
 Indians-0.086
Token asked
Token ID1965
Feature activation+1.69e+00

Pos logit contributions
ore+0.072
year+0.062
 x+0.059
 gen+0.059
 operating+0.058
Neg logit contributions
 pads-0.065
ety-0.055
 sunglasses-0.050
 tails-0.049
 pad-0.049
Token.
Token ID13
Feature activation+1.97e+00

Pos logit contributions
ore+0.081
 gen+0.076
year+0.074
 x+0.072
 goblin+0.071
Neg logit contributions
 pads-0.083
ety-0.071
 tails-0.064
coat-0.063
 pad-0.063
Token
Token ID198
Feature activation+1.27e+00

Pos logit contributions
year+0.102
ore+0.098
 x+0.097
 gen+0.097
 goblin+0.092
Neg logit contributions
 pads-0.083
ety-0.068
 Indians-0.064
 pad-0.062
burn-0.061
Token
Token ID198
Feature activation+1.49e+00

Pos logit contributions
year+0.061
ore+0.060
 gen+0.059
 goblin+0.058
 x+0.057
Neg logit contributions
 pads-0.059
ety-0.050
 Indians-0.046
coat-0.046
burn-0.046
Token you
Token ID345
Feature activation-3.63e-01

Pos logit contributions
ore+0.023
year+0.021
 gen+0.020
 x+0.020
 goblin+0.020
Neg logit contributions
 pads-0.026
ety-0.022
 Indians-0.021
 sunglasses-0.021
 tails-0.020
Token doing
Token ID1804
Feature activation+1.53e+00

Pos logit contributions
 pads+0.019
ety+0.016
 tails+0.014
coat+0.014
burn+0.014
Neg logit contributions
ore-0.019
 x-0.017
 gen-0.017
year-0.017
 goblin-0.016
Token here
Token ID994
Feature activation+1.29e+00

Pos logit contributions
year+0.068
ore+0.066
 gen+0.059
 goblin+0.056
 x+0.056
Neg logit contributions
 pads-0.083
ety-0.069
 sunglasses-0.069
coat-0.067
burn-0.067
Token?"
Token ID1701
Feature activation+2.65e+00

Pos logit contributions
year+0.053
ore+0.053
 gen+0.048
 x+0.046
 goblin+0.045
Neg logit contributions
 pads-0.076
ety-0.065
coat-0.061
 sunglasses-0.060
burn-0.060
Token a
Token ID257
Feature activation+3.40e+00

Pos logit contributions
year+0.154
ore+0.151
fire+0.140
 ingredient+0.138
 x+0.134
Neg logit contributions
 pads-0.089
ety-0.084
 Cle-0.073
 sunglasses-0.072
 dads-0.069
Token voice
Token ID3809
Feature activation+4.01e+00

Pos logit contributions
ore+0.147
year+0.141
 ingredient+0.125
 operating+0.124
abel+0.122
Neg logit contributions
 dads-0.143
ety-0.129
 pads-0.123
 Indians-0.121
 sunglasses-0.120
Token said
Token ID531
Feature activation+1.94e+00

Pos logit contributions
ore+0.218
 healer+0.211
 goblin+0.202
year+0.202
fire+0.201
Neg logit contributions
 pads-0.111
ked-0.101
 sunglasses-0.100
ety-0.100
 swung-0.096
Token.
Token ID13
Feature activation+1.86e+00

Pos logit contributions
ore+0.092
year+0.089
 healer+0.085
 x+0.085
 operating+0.084
Neg logit contributions
 pads-0.087
ety-0.075
ked-0.074
 sunglasses-0.069
burn-0.067
Token It
Token ID632
Feature activation+1.39e+00

Pos logit contributions
ore+0.097
year+0.095
 ingredient+0.091
 healer+0.090
 x+0.089
Neg logit contributions
 pads-0.079
ety-0.061
 pad-0.057
 sunglasses-0.057
 tails-0.057
Token was
Token ID373
Feature activation+7.60e-01

Pos logit contributions
ore+0.070
year+0.064
 x+0.059
 healer+0.059
old+0.056
Neg logit contributions
 pads-0.073
ety-0.059
 tails-0.057
 swung-0.056
 sunglasses-0.056
Token a
Token ID257
Feature activation+5.01e-01

Pos logit contributions
ore+0.038
year+0.033
 gen+0.031
 x+0.031
 healer+0.031
Neg logit contributions
 pads-0.040
ety-0.034
 tails-0.032
 sunglasses-0.031
burn-0.031
Token voice
Token ID3809
Feature activation+1.98e+00

Pos logit contributions
ore+0.128
year+0.127
handed+0.110
 ingredient+0.109
 x+0.109
Neg logit contributions
 pads-0.091
ety-0.086
 pad-0.080
 sunglasses-0.078
 tails-0.075
Token coming
Token ID2406
Feature activation+9.60e-01

Pos logit contributions
year+0.098
ore+0.094
 goblin+0.090
 healer+0.089
 x+0.087
Neg logit contributions
 pads-0.086
ety-0.074
burn-0.072
coat-0.069
 sunglasses-0.068
Token from
Token ID422
Feature activation+1.42e+00

Pos logit contributions
ore+0.033
 x+0.030
year+0.030
 gen+0.030
 goblin+0.029
Neg logit contributions
 pads-0.060
ety-0.055
burn-0.051
 tails-0.050
 pad-0.049
Token the
Token ID262
Feature activation+6.33e-01

Pos logit contributions
ore+0.067
 gen+0.063
year+0.063
abel+0.059
 x+0.059
Neg logit contributions
 pads-0.068
ety-0.061
 sunglasses-0.058
 tails-0.056
bur-0.056
Token diamond
Token ID15291
Feature activation+3.94e-01

Pos logit contributions
ore+0.030
year+0.028
 x+0.027
 goblin+0.025
 gen+0.025
Neg logit contributions
 pads-0.034
ety-0.029
 sunglasses-0.027
 pad-0.026
 Indians-0.026
Token.
Token ID13
Feature activation+3.96e+00

Pos logit contributions
ore+0.016
year+0.016
 x+0.015
 gen+0.015
 goblin+0.014
Neg logit contributions
 pads-0.024
ety-0.020
 pad-0.019
burn-0.019
 tails-0.019
Token Annie
Token ID25860
Feature activation+1.87e+00

Pos logit contributions
 x+0.203
ore+0.190
handed+0.189
 healer+0.182
 potion+0.181
Neg logit contributions
 pads-0.136
 pad-0.108
 tails-0.107
 wo-0.106
ety-0.105
Token was
Token ID373
Feature activation+7.95e-01

Pos logit contributions
ore+0.102
year+0.099
 x+0.098
 gen+0.092
 operating+0.091
Neg logit contributions
 pads-0.082
ety-0.070
 pad-0.063
 swung-0.063
 tails-0.061
Token startled
Token ID36249
Feature activation+9.36e-01

Pos logit contributions
ore+0.034
year+0.030
 gen+0.030
 x+0.029
 goblin+0.028
Neg logit contributions
 pads-0.046
ety-0.039
burn-0.037
 tails-0.037
coat-0.036
Token.
Token ID13
Feature activation+2.45e+00

Pos logit contributions
ore+0.048
 gen+0.046
year+0.045
 x+0.045
 healer+0.042
Neg logit contributions
 pads-0.045
ety-0.041
 tails-0.036
 sunglasses-0.035
 pad-0.034
Token "
Token ID366
Feature activation-1.75e-01

Pos logit contributions
 gen+0.138
year+0.138
ore+0.138
 x+0.138
 goblin+0.131
Neg logit contributions
 pads-0.086
ety-0.068
 pad-0.063
 tails-0.059
coat-0.057
Token.
Token ID13
Feature activation+1.01e+00

Pos logit contributions
ore+0.053
year+0.051
 gen+0.048
 x+0.047
 operating+0.043
Neg logit contributions
 pads-0.061
ety-0.048
 tails-0.047
 pad-0.047
coat-0.047
Token
Token ID198
Feature activation+1.42e+00

Pos logit contributions
ore+0.049
 gen+0.047
year+0.045
 x+0.045
 healer+0.045
Neg logit contributions
 pads-0.053
ety-0.042
 Indians-0.039
 pad-0.039
ibi-0.039
Token
Token ID198
Feature activation+9.57e-01

Pos logit contributions
 gen+0.070
ore+0.069
year+0.067
 x+0.067
 goblin+0.067
Neg logit contributions
 pads-0.068
ety-0.055
ibi-0.052
 Indians-0.049
 pad-0.049
TokenDaddy
Token ID48280
Feature activation+1.12e+00

Pos logit contributions
ore+0.051
 gen+0.047
 x+0.045
 healer+0.045
 goblin+0.044
Neg logit contributions
 pads-0.048
ety-0.036
 Indians-0.035
 tails-0.035
 pad-0.034
Token put
Token ID1234
Feature activation+1.06e+00

Pos logit contributions
ore+0.061
year+0.054
 x+0.054
 gen+0.052
 operating+0.050
Neg logit contributions
 pads-0.054
ety-0.047
 Indians-0.041
burn-0.041
 pad-0.041
Token an
Token ID281
Feature activation+3.95e+00

Pos logit contributions
ore+0.049
 gen+0.047
 healer+0.045
year+0.044
 operating+0.044
Neg logit contributions
 pads-0.057
ety-0.046
burn-0.043
 sunglasses-0.043
 tails-0.043
Token arm
Token ID3211
Feature activation+1.27e+00

Pos logit contributions
ore+0.130
fire+0.106
year+0.096
arium+0.091
ground+0.090
Neg logit contributions
 pads-0.240
 sunglasses-0.230
coat-0.216
 pad-0.212
 boots-0.210
Token around
Token ID1088
Feature activation+1.10e+00

Pos logit contributions
ore+0.060
 gen+0.058
year+0.057
 goblin+0.056
 x+0.055
Neg logit contributions
 pads-0.070
coat-0.054
ety-0.053
 pad-0.051
burn-0.051
Token Jake
Token ID14757
Feature activation+1.17e-01

Pos logit contributions
ore+0.058
 gen+0.054
 healer+0.053
year+0.053
 x+0.052
Neg logit contributions
 pads-0.050
 tails-0.040
ety-0.039
 Indians-0.037
 pad-0.036
Token and
Token ID290
Feature activation+1.43e+00

Pos logit contributions
ore+0.005
year+0.005
 gen+0.005
 x+0.005
 goblin+0.004
Neg logit contributions
 pads-0.007
ety-0.006
 tails-0.005
coat-0.005
 pad-0.005
Token said
Token ID531
Feature activation+7.02e-01

Pos logit contributions
ore+0.080
year+0.070
 x+0.067
 gen+0.065
 healer+0.064
Neg logit contributions
 pads-0.070
ety-0.055
 Indians-0.054
 sunglasses-0.053
 swung-0.053
Token What
Token ID1867
Feature activation+9.46e-01

Pos logit contributions
ore+0.050
 x+0.047
 gen+0.047
year+0.045
 goblin+0.044
Neg logit contributions
 pads-0.057
ety-0.048
 Indians-0.045
 pad-0.043
 tails-0.042
Token are
Token ID389
Feature activation+4.38e-01

Pos logit contributions
ore+0.059
year+0.056
 gen+0.054
 x+0.053
 goblin+0.052
Neg logit contributions
 pads-0.039
ety-0.029
 Indians-0.026
 tails-0.025
 sunglasses-0.025
Token you
Token ID345
Feature activation+5.02e-02

Pos logit contributions
ore+0.020
 gen+0.018
 x+0.018
year+0.018
 goblin+0.017
Neg logit contributions
 pads-0.025
ety-0.021
 Indians-0.020
 tails-0.019
 sunglasses-0.019
Token searching
Token ID10342
Feature activation+9.55e-01

Pos logit contributions
ore+0.003
 x+0.003
year+0.003
 gen+0.003
 goblin+0.002
Neg logit contributions
 pads-0.002
ety-0.002
 Indians-0.002
 tails-0.002
burn-0.002
Token for
Token ID329
Feature activation+2.01e+00

Pos logit contributions
ore+0.045
 gen+0.043
 x+0.042
year+0.042
 goblin+0.039
Neg logit contributions
 pads-0.051
ety-0.043
 sunglasses-0.040
 pad-0.039
 tails-0.039
Token?"
Token ID1701
Feature activation+3.93e+00

Pos logit contributions
year+0.091
ore+0.089
 gen+0.087
 goblin+0.083
 x+0.081
Neg logit contributions
 pads-0.104
ety-0.085
 sunglasses-0.084
 tails-0.080
 Indians-0.080
Token
Token ID198
Feature activation+3.02e+00

Pos logit contributions
 x+0.248
year+0.239
 potion+0.231
ore+0.231
place+0.229
Neg logit contributions
 pads-0.079
 pad-0.053
ety-0.050
 -0.044
 Indians-0.033
Token
Token ID198
Feature activation+2.86e+00

Pos logit contributions
 x+0.180
 goblin+0.175
ore+0.174
year+0.174
 ingredient+0.171
Neg logit contributions
 pads-0.068
ety-0.051
 pad-0.042
 Indians-0.042
ibi-0.038
TokenAn
Token ID2025
Feature activation+2.62e+00

Pos logit contributions
 x+0.132
ore+0.130
 potion+0.114
 goblin+0.113
 ingredient+0.110
Neg logit contributions
 pads-0.111
ety-0.094
 Indians-0.093
 pad-0.087
 tails-0.086
Tokennie
Token ID11952
Feature activation+1.88e+00

Pos logit contributions
ore+0.148
 gen+0.136
 operating+0.126
fire+0.125
 x+0.125
Neg logit contributions
 pads-0.106
 pad-0.088
ety-0.084
 tails-0.079
lets-0.073
Token smiled
Token ID13541
Feature activation+3.22e-01

Pos logit contributions
ore+0.101
 x+0.095
year+0.093
 operating+0.088
 gen+0.087
Neg logit contributions
 pads-0.088
ety-0.072
 pad-0.069
 tails-0.067
 swung-0.066
Token had
Token ID550
Feature activation-4.57e-01

Pos logit contributions
ore+0.017
 x+0.016
 gen+0.016
year+0.016
 goblin+0.015
Neg logit contributions
 pads-0.015
ety-0.012
coat-0.011
 tails-0.011
burn-0.011
Token to
Token ID284
Feature activation+9.33e-01

Pos logit contributions
 pads+0.027
ety+0.023
 tails+0.021
burn+0.021
coat+0.021
Neg logit contributions
ore-0.020
 x-0.019
 gen-0.018
year-0.018
 goblin-0.017
Token go
Token ID467
Feature activation+1.47e+00

Pos logit contributions
ore+0.052
year+0.049
 gen+0.045
 goblin+0.044
 x+0.043
Neg logit contributions
 pads-0.044
ety-0.036
burn-0.034
 sunglasses-0.034
coat-0.034
Token to
Token ID284
Feature activation+1.82e+00

Pos logit contributions
year+0.088
 gen+0.088
ore+0.085
 goblin+0.083
abel+0.079
Neg logit contributions
 pads-0.059
ety-0.047
coat-0.045
burn-0.044
bur-0.040
Token the
Token ID262
Feature activation+1.98e+00

Pos logit contributions
year+0.103
ore+0.098
 gen+0.091
 goblin+0.085
le+0.084
Neg logit contributions
 pads-0.076
ety-0.066
 Indians-0.064
 sunscreen-0.064
burn-0.063
Token bar
Token ID2318
Feature activation+3.88e+00

Pos logit contributions
ore+0.098
year+0.097
le+0.086
 gen+0.085
ment+0.081
Neg logit contributions
 pads-0.094
coat-0.077
ety-0.077
 sunglasses-0.076
 sunscreen-0.076
Tokenber
Token ID527
Feature activation+1.95e+00

Pos logit contributions
 potion+0.199
 x+0.193
 gen+0.193
 goblin+0.183
year+0.178
Neg logit contributions
 pads-0.148
ety-0.122
 pad-0.117
 dads-0.117
coat-0.105
Token.
Token ID13
Feature activation+6.86e-01

Pos logit contributions
ore+0.086
year+0.086
 gen+0.085
 goblin+0.078
 x+0.077
Neg logit contributions
 pads-0.113
 pad-0.090
ety-0.090
coat-0.085
burn-0.080
Token She
Token ID1375
Feature activation+1.71e+00

Pos logit contributions
ore+0.039
year+0.037
 gen+0.037
 x+0.035
 goblin+0.034
Neg logit contributions
 pads-0.031
ety-0.025
 tails-0.023
coat-0.023
ibi-0.022
Token said
Token ID531
Feature activation+6.62e-01

Pos logit contributions
ore+0.104
year+0.087
 x+0.085
 goblin+0.085
fire+0.085
Neg logit contributions
 pads-0.075
ety-0.064
 swung-0.060
coat-0.056
 sunglasses-0.055
Token his
Token ID465
Feature activation+5.19e-01

Pos logit contributions
ore+0.030
year+0.028
 x+0.027
 gen+0.026
 goblin+0.026
Neg logit contributions
 pads-0.037
ety-0.031
 Indians-0.029
 tails-0.029
 sunglasses-0.029
Token he
Token ID339
Feature activation+1.43e+00

Pos logit contributions
year+0.177
ore+0.177
fire+0.169
 ingredient+0.163
le+0.163
Neg logit contributions
 pads-0.075
 dads-0.068
ety-0.067
 sunscreen-0.066
 sunglasses-0.064
Token asks
Token ID7893
Feature activation+2.19e+00

Pos logit contributions
ore+0.073
year+0.071
fire+0.063
 gen+0.062
 ingredient+0.060
Neg logit contributions
 pads-0.076
ety-0.061
 tails-0.061
 sunglasses-0.059
 Indians-0.057
Token in
Token ID287
Feature activation+1.74e+00

Pos logit contributions
ore+0.117
year+0.110
 gen+0.110
fire+0.109
 x+0.109
Neg logit contributions
 pads-0.079
ety-0.074
 dads-0.065
 sunglasses-0.060
 tails-0.059
Token a
Token ID257
Feature activation+2.23e+00

Pos logit contributions
ore+0.092
year+0.088
 gen+0.084
le+0.081
 healer+0.081
Neg logit contributions
 pads-0.065
 sunglasses-0.057
 tails-0.056
 dads-0.056
 sunscreen-0.054
Token loud
Token ID7812
Feature activation+2.29e+00

Pos logit contributions
year+0.098
ore+0.098
abel+0.084
le+0.082
 gen+0.080
Neg logit contributions
 pads-0.110
 tails-0.096
 dads-0.093
 sunglasses-0.093
 pad-0.090
Token voice
Token ID3809
Feature activation+3.83e+00

Pos logit contributions
year+0.090
ore+0.082
 healer+0.074
 operating+0.068
abel+0.067
Neg logit contributions
 pads-0.125
 tails-0.115
 sunglasses-0.109
ety-0.109
coat-0.109
Token.
Token ID13
Feature activation+2.23e+00

Pos logit contributions
 healer+0.221
 goblin+0.219
year+0.218
body+0.211
ore+0.209
Neg logit contributions
 pads-0.104
ety-0.085
 sunglasses-0.069
 dads-0.068
burn-0.067
Token
Token ID198
Feature activation+1.78e+00

Pos logit contributions
le+0.121
year+0.120
 x+0.118
ore+0.118
 ingredient+0.117
Neg logit contributions
 pads-0.078
ety-0.060
 Indians-0.056
 pad-0.054
ibi-0.052
Token
Token ID198
Feature activation+1.70e+00

Pos logit contributions
year+0.094
 goblin+0.091
ore+0.090
 x+0.088
 healer+0.088
Neg logit contributions
 pads-0.071
ety-0.062
ibi-0.055
 dads-0.051
 Indians-0.051
TokenS
Token ID50
Feature activation+9.44e-01

Pos logit contributions
ore+0.097
 x+0.096
year+0.092
 ingredient+0.090
 gen+0.089
Neg logit contributions
 pads-0.060
ety-0.052
 dads-0.046
 Indians-0.046
 sunscreen-0.043
Tokenara
Token ID3301
Feature activation+7.72e-01

Pos logit contributions
year+0.056
 gen+0.054
ore+0.053
 goblin+0.052
 x+0.051
Neg logit contributions
 pads-0.039
ety-0.034
burn-0.029
coat-0.029
 tails-0.027
TokenHello
Token ID15496
Feature activation+4.34e-01

Pos logit contributions
 pads+0.009
ety+0.008
 tails+0.007
burn+0.007
 Indians+0.007
Neg logit contributions
ore-0.009
 x-0.008
 gen-0.008
year-0.008
 goblin-0.007
Token?
Token ID30
Feature activation+1.37e+00

Pos logit contributions
ore+0.019
year+0.017
 x+0.017
 gen+0.016
 goblin+0.016
Neg logit contributions
 pads-0.026
ety-0.022
burn-0.020
 tails-0.020
coat-0.020
Token Is
Token ID1148
Feature activation-1.99e-01

Pos logit contributions
ore+0.057
 x+0.053
year+0.052
 gen+0.052
 healer+0.050
Neg logit contributions
 pads-0.077
ety-0.067
 Indians-0.062
coat-0.062
 pad-0.061
Token someone
Token ID2130
Feature activation+6.86e-01

Pos logit contributions
 pads+0.013
ety+0.011
 tails+0.010
burn+0.010
 Indians+0.010
Neg logit contributions
ore-0.008
 x-0.007
 gen-0.007
year-0.007
 goblin-0.007
Token there
Token ID612
Feature activation+7.40e-02

Pos logit contributions
ore+0.031
year+0.028
 gen+0.028
 x+0.028
 goblin+0.027
Neg logit contributions
 pads-0.040
ety-0.034
coat-0.031
 Indians-0.031
burn-0.030
Token?"
Token ID1701
Feature activation+3.81e+00

Pos logit contributions
ore+0.003
year+0.002
 x+0.002
 gen+0.002
 goblin+0.002
Neg logit contributions
 pads-0.005
ety-0.004
burn-0.004
 tails-0.004
 Indians-0.004
Token
Token ID198
Feature activation+2.15e+00

Pos logit contributions
year+0.224
ore+0.213
 goblin+0.209
 x+0.203
 ingredient+0.203
Neg logit contributions
 pads-0.099
ety-0.085
 Cle-0.083
 swung-0.082
 sunglasses-0.075
Token
Token ID198
Feature activation+1.94e+00

Pos logit contributions
ore+0.117
 goblin+0.115
year+0.114
 x+0.111
 ingredient+0.110
Neg logit contributions
 pads-0.080
ety-0.066
coat-0.060
burn-0.056
 Indians-0.055
TokenLu
Token ID25596
Feature activation+8.54e-01

Pos logit contributions
ore+0.099
 goblin+0.091
 x+0.091
year+0.088
 ingredient+0.086
Neg logit contributions
 pads-0.084
ety-0.072
coat-0.069
 Indians-0.064
 swung-0.063
Tokency
Token ID948
Feature activation+1.36e+00

Pos logit contributions
ore+0.051
 x+0.050
 gen+0.050
year+0.048
 goblin+0.048
Neg logit contributions
 pads-0.035
ety-0.030
coat-0.025
burn-0.025
 pad-0.024
Token was
Token ID373
Feature activation+1.18e+00

Pos logit contributions
ore+0.070
year+0.063
 x+0.063
 gen+0.060
 healer+0.059
Neg logit contributions
 pads-0.069
ety-0.058
 tails-0.054
 pad-0.053
 swung-0.051
Token that
Token ID326
Feature activation+1.05e+00

Pos logit contributions
ore+0.047
year+0.044
 x+0.042
 gen+0.041
 goblin+0.041
Neg logit contributions
 pads-0.077
ety-0.063
coat-0.061
 sunglasses-0.061
 pad-0.060
Token said
Token ID531
Feature activation+3.05e-01

Pos logit contributions
ore+0.052
 x+0.048
year+0.047
 gen+0.046
 goblin+0.045
Neg logit contributions
 pads-0.056
ety-0.045
 sunglasses-0.041
coat-0.041
 tails-0.041
Token '
Token ID705
Feature activation+4.23e-01

Pos logit contributions
ore+0.016
 x+0.015
year+0.015
 gen+0.014
 goblin+0.014
Neg logit contributions
 pads-0.015
ety-0.012
 tails-0.011
coat-0.011
 Indians-0.011
TokenNo
Token ID2949
Feature activation+6.19e-01

Pos logit contributions
ore+0.020
 x+0.018
 gen+0.018
year+0.018
 goblin+0.017
Neg logit contributions
 pads-0.024
ety-0.020
burn-0.018
coat-0.018
 Indians-0.018
Token entry
Token ID5726
Feature activation+1.64e+00

Pos logit contributions
ore+0.027
year+0.025
 gen+0.024
 x+0.024
 goblin+0.022
Neg logit contributions
 pads-0.036
ety-0.030
 tails-0.029
coat-0.028
burn-0.028
Token'.
Token ID4458
Feature activation+3.77e+00

Pos logit contributions
ore+0.063
 gen+0.061
 x+0.060
 goblin+0.057
year+0.057
Neg logit contributions
 pads-0.102
ety-0.083
 pad-0.080
coat-0.078
 tails-0.077
Token But
Token ID887
Feature activation+2.50e+00

Pos logit contributions
ore+0.171
 x+0.163
year+0.162
 potion+0.157
 healer+0.154
Neg logit contributions
 pads-0.140
 tails-0.124
 wrists-0.120
 pad-0.117
're-0.113
Token Annie
Token ID25860
Feature activation+1.33e+00

Pos logit contributions
ore+0.133
year+0.124
 x+0.123
looking+0.119
old+0.117
Neg logit contributions
 pads-0.098
 tails-0.083
ety-0.078
 pad-0.077
 sunglasses-0.077
Token was
Token ID373
Feature activation+1.21e+00

Pos logit contributions
ore+0.070
 x+0.066
year+0.066
 gen+0.064
 goblin+0.061
Neg logit contributions
 pads-0.066
ety-0.053
 pad-0.050
 tails-0.048
coat-0.047
Token eager
Token ID11069
Feature activation-2.13e-01

Pos logit contributions
ore+0.064
 gen+0.055
 x+0.054
oc+0.054
year+0.053
Neg logit contributions
 pads-0.061
ety-0.048
 pad-0.048
 sunglasses-0.046
coat-0.045
Token to
Token ID284
Feature activation+6.50e-01

Pos logit contributions
 pads+0.012
ety+0.010
 tails+0.009
burn+0.009
 Indians+0.009
Neg logit contributions
ore-0.010
 x-0.009
 gen-0.009
year-0.009
 goblin-0.008
Token
Token ID220
Feature activation+9.54e-02

Pos logit contributions
 x+0.091
year+0.091
ore+0.088
 gen+0.088
 goblin+0.086
Neg logit contributions
 pads-0.087
ety-0.067
 tails-0.066
 pad-0.062
coat-0.060
Token
Token ID198
Feature activation+1.49e+00

Pos logit contributions
ore+0.004
 x+0.004
 gen+0.004
year+0.004
 goblin+0.003
Neg logit contributions
 pads-0.006
ety-0.005
 tails-0.005
coat-0.005
burn-0.005
Token
Token ID198
Feature activation+1.56e+00

Pos logit contributions
 x+0.075
ore+0.074
 gen+0.073
year+0.072
 goblin+0.071
Neg logit contributions
 pads-0.073
ety-0.058
coat-0.052
 pad-0.052
 tails-0.052
Token"
Token ID1
Feature activation-1.44e+00

Pos logit contributions
ore+0.085
 x+0.081
 gen+0.080
 goblin+0.078
year+0.078
Neg logit contributions
 pads-0.072
 tails-0.055
ety-0.054
coat-0.051
 Indians-0.051
TokenHello
Token ID15496
Feature activation+6.18e-01

Pos logit contributions
 pads+0.082
ety+0.063
 tails+0.062
coat+0.060
 Indians+0.060
Neg logit contributions
 gen-0.066
ore-0.065
year-0.065
 x-0.060
fire-0.057
Token!"
Token ID2474
Feature activation+3.77e+00

Pos logit contributions
ore+0.029
year+0.028
 x+0.027
 gen+0.026
 goblin+0.025
Neg logit contributions
 pads-0.033
ety-0.029
burn-0.026
 tails-0.026
 pad-0.025
Token said
Token ID531
Feature activation+2.35e+00

Pos logit contributions
year+0.215
ore+0.201
 x+0.197
auc+0.191
 ingredient+0.188
Neg logit contributions
 pads-0.136
 tails-0.119
 swung-0.108
ety-0.100
 ho-0.098
Token Pen
Token ID7507
Feature activation+5.77e-01

Pos logit contributions
ore+0.122
year+0.122
 x+0.113
 goblin+0.109
auc+0.109
Neg logit contributions
 pads-0.096
 tails-0.083
ety-0.082
burn-0.080
 tail-0.072
Tokenelope
Token ID47329
Feature activation+8.42e-01

Pos logit contributions
ore+0.022
 x+0.020
year+0.020
 gen+0.019
 operating+0.018
Neg logit contributions
 pads-0.038
ety-0.033
 tails-0.030
 pad-0.030
burn-0.030
Token in
Token ID287
Feature activation+8.38e-01

Pos logit contributions
ore+0.037
 x+0.034
year+0.034
 gen+0.033
 goblin+0.031
Neg logit contributions
 pads-0.050
ety-0.042
 tails-0.039
 pad-0.038
burn-0.038
Token a
Token ID257
Feature activation+3.39e-01

Pos logit contributions
ore+0.039
year+0.035
 gen+0.033
 x+0.033
 operating+0.031
Neg logit contributions
 pads-0.047
 tails-0.039
 sunglasses-0.038
ety-0.037
 Indians-0.036
TokenWho
Token ID8241
Feature activation+9.71e-01

Pos logit contributions
ore+0.015
year+0.015
 gen+0.014
 x+0.014
 goblin+0.013
Neg logit contributions
 pads-0.013
ety-0.010
 tails-0.009
 Indians-0.009
burn-0.009
Token goes
Token ID2925
Feature activation+9.73e-01

Pos logit contributions
ore+0.048
year+0.045
 x+0.044
 gen+0.044
 goblin+0.042
Neg logit contributions
 pads-0.052
ety-0.042
 tails-0.039
 Indians-0.038
 sunglasses-0.038
Token there
Token ID612
Feature activation+3.35e-01

Pos logit contributions
ore+0.042
year+0.042
 gen+0.040
 x+0.039
 goblin+0.037
Neg logit contributions
 pads-0.055
ety-0.046
 sunglasses-0.043
burn-0.043
 Indians-0.043
Token?"
Token ID1701
Feature activation+9.68e-01

Pos logit contributions
ore+0.013
 x+0.012
year+0.012
 gen+0.012
 goblin+0.011
Neg logit contributions
 pads-0.021
ety-0.018
 tails-0.017
coat-0.017
 Indians-0.017
Token a
Token ID257
Feature activation+2.58e+00

Pos logit contributions
ore+0.047
 x+0.045
year+0.044
 gen+0.044
 goblin+0.041
Neg logit contributions
 pads-0.051
ety-0.042
 tails-0.039
 sunglasses-0.038
 Indians-0.038
Token voice
Token ID3809
Feature activation+3.75e+00

Pos logit contributions
ore+0.111
year+0.101
 x+0.093
 operating+0.092
 ingredient+0.088
Neg logit contributions
 pads-0.116
 dads-0.114
ety-0.107
 tails-0.106
 Indians-0.104
Token said
Token ID531
Feature activation+1.44e+00

Pos logit contributions
ore+0.182
 healer+0.171
handed+0.170
 goblin+0.166
 x+0.166
Neg logit contributions
 pads-0.119
ked-0.114
 swung-0.111
 sunglasses-0.109
ety-0.108
Token from
Token ID422
Feature activation+2.01e+00

Pos logit contributions
ore+0.067
 x+0.065
year+0.063
 operating+0.061
 healer+0.060
Neg logit contributions
 pads-0.069
ety-0.058
coat-0.054
 pad-0.054
burn-0.054
Token inside
Token ID2641
Feature activation+1.67e+00

Pos logit contributions
ore+0.099
year+0.087
 gen+0.084
abel+0.083
 x+0.082
Neg logit contributions
 pads-0.088
 tails-0.079
 sunglasses-0.078
ety-0.078
 pad-0.071
Token.
Token ID13
Feature activation+1.13e+00

Pos logit contributions
ore+0.081
 x+0.079
year+0.078
 gen+0.075
 healer+0.074
Neg logit contributions
 pads-0.075
ety-0.064
 tails-0.061
coat-0.060
 sunglasses-0.059
Token
Token ID198
Feature activation+1.86e+00

Pos logit contributions
ore+0.055
year+0.051
 x+0.051
 gen+0.049
 healer+0.047
Neg logit contributions
 pads-0.059
ety-0.049
 pad-0.044
 Indians-0.044
 tails-0.044
Token "
Token ID366
Feature activation+7.28e-01

Pos logit contributions
ore+0.161
 healer+0.160
 x+0.154
handed+0.152
body+0.149
Neg logit contributions
 pads-0.084
ety-0.060
 pad-0.057
coat-0.056
 tails-0.056
TokenWhat
Token ID2061
Feature activation+1.07e-01

Pos logit contributions
ore+0.036
 x+0.033
 gen+0.032
 healer+0.031
 goblin+0.031
Neg logit contributions
 pads-0.038
ety-0.033
burn-0.029
coat-0.029
 Indians-0.029
Token are
Token ID389
Feature activation-1.34e-01

Pos logit contributions
ore+0.006
year+0.005
 gen+0.005
 x+0.005
 goblin+0.005
Neg logit contributions
 pads-0.006
ety-0.004
 tails-0.004
 Indians-0.004
burn-0.004
Token you
Token ID345
Feature activation-6.11e-01

Pos logit contributions
 pads+0.008
ety+0.007
 tails+0.007
burn+0.007
 Indians+0.007
Neg logit contributions
ore-0.005
 x-0.005
 gen-0.005
year-0.005
 goblin-0.005
Token doing
Token ID1804
Feature activation+1.45e+00

Pos logit contributions
 pads+0.033
ety+0.027
 tails+0.025
coat+0.025
burn+0.024
Neg logit contributions
ore-0.031
 x-0.029
 gen-0.028
year-0.028
 goblin-0.027
Token?"
Token ID1701
Feature activation+3.74e+00

Pos logit contributions
year+0.066
ore+0.062
 gen+0.058
abel+0.055
 goblin+0.054
Neg logit contributions
 pads-0.074
 sunglasses-0.064
ety-0.063
burn-0.062
coat-0.062
Token It
Token ID632
Feature activation+6.26e-01

Pos logit contributions
 healer+0.206
 x+0.203
ore+0.202
year+0.201
handed+0.197
Neg logit contributions
 pads-0.107
 swung-0.081
ety-0.080
 played-0.077
 Sam-0.073
Token was
Token ID373
Feature activation+9.55e-01

Pos logit contributions
ore+0.026
year+0.025
 x+0.023
 gen+0.022
 healer+0.021
Neg logit contributions
 pads-0.038
ety-0.032
 tails-0.030
burn-0.030
 Indians-0.030
Token her
Token ID607
Feature activation-7.11e-01

Pos logit contributions
ore+0.047
year+0.042
 gen+0.039
 healer+0.038
 x+0.037
Neg logit contributions
 pads-0.050
ety-0.043
 tails-0.041
 sunglasses-0.040
burn-0.040
Token mum
Token ID25682
Feature activation+4.31e-01

Pos logit contributions
 pads+0.038
ety+0.031
 Indians+0.029
burn+0.028
coat+0.028
Neg logit contributions
ore-0.036
 x-0.034
 gen-0.033
year-0.031
 goblin-0.031
Token.
Token ID13
Feature activation+1.53e+00

Pos logit contributions
ore+0.019
year+0.017
 x+0.017
 gen+0.017
 goblin+0.016
Neg logit contributions
 pads-0.026
ety-0.022
burn-0.020
coat-0.020
 tails-0.020
Token you
Token ID345
Feature activation+1.44e-01

Pos logit contributions
 pads+0.060
ety+0.052
coat+0.051
burn+0.051
 pad+0.049
Neg logit contributions
ore-0.028
 x-0.028
 gen-0.026
year-0.023
 operating-0.022
Token like
Token ID588
Feature activation-5.75e-01

Pos logit contributions
ore+0.008
 gen+0.007
 x+0.007
year+0.007
 goblin+0.007
Neg logit contributions
 pads-0.007
ety-0.006
 tails-0.005
coat-0.005
 pad-0.005
Token it
Token ID340
Feature activation+2.67e-01

Pos logit contributions
 pads+0.037
ety+0.032
 Indians+0.030
coat+0.030
 tails+0.030
Neg logit contributions
ore-0.022
 x-0.020
 gen-0.020
year-0.019
 operating-0.018
Token?"
Token ID1701
Feature activation+2.06e+00

Pos logit contributions
ore+0.010
year+0.009
 gen+0.009
 x+0.009
 goblin+0.008
Neg logit contributions
 pads-0.018
ety-0.015
burn-0.014
coat-0.014
 tails-0.014
Token the
Token ID262
Feature activation-2.37e-01

Pos logit contributions
year+0.133
ore+0.130
fire+0.124
 ingredient+0.122
 potion+0.121
Neg logit contributions
 pads-0.066
ety-0.054
 tails-0.047
 sunglasses-0.042
 dads-0.042
Token bar
Token ID2318
Feature activation+3.73e+00

Pos logit contributions
 pads+0.014
ety+0.012
 tails+0.011
burn+0.011
 Indians+0.011
Neg logit contributions
ore-0.010
 x-0.009
 gen-0.009
year-0.009
 goblin-0.008
Tokenber
Token ID527
Feature activation+2.46e+00

Pos logit contributions
 potion+0.186
 x+0.183
 cellar+0.175
 operating+0.172
ore+0.170
Neg logit contributions
 pads-0.144
ety-0.131
 dads-0.119
ked-0.107
 pad-0.104
Token asked
Token ID1965
Feature activation+1.40e+00

Pos logit contributions
ore+0.135
year+0.129
body+0.121
fire+0.119
 x+0.119
Neg logit contributions
 pads-0.112
ety-0.092
 swung-0.082
 pad-0.081
coat-0.077
Token.
Token ID13
Feature activation+1.93e+00

Pos logit contributions
ore+0.067
 gen+0.062
year+0.062
 x+0.061
 healer+0.059
Neg logit contributions
 pads-0.072
ety-0.061
 tails-0.056
 pad-0.053
coat-0.053
Token
Token ID198
Feature activation+1.52e+00

Pos logit contributions
ore+0.107
year+0.105
fire+0.099
 gen+0.098
 ingredient+0.098
Neg logit contributions
 pads-0.083
ety-0.060
 pad-0.058
 Indians-0.055
 tails-0.055
Token
Token ID198
Feature activation+8.87e-01

Pos logit contributions
year+0.080
ore+0.079
 gen+0.077
 x+0.075
 goblin+0.075
Neg logit contributions
 pads-0.069
ety-0.055
ibi-0.048
 tails-0.047
 Indians-0.047
Token
Token ID129
Feature activation-4.70e-01

Pos logit contributions
 pads+0.054
ety+0.045
 Indians+0.042
 tails+0.042
 sunglasses+0.041
Neg logit contributions
ore-0.048
 x-0.043
 gen-0.042
year-0.041
 operating-0.039
Token
Token ID241
Feature activation-6.97e-01

Pos logit contributions
 pads+0.026
ety+0.021
burn+0.020
 Indians+0.019
coat+0.019
Neg logit contributions
ore-0.023
 x-0.021
year-0.021
 gen-0.020
 operating-0.019
TokenCan
Token ID6090
Feature activation+2.81e-01

Pos logit contributions
 pads+0.039
ety+0.032
 Indians+0.030
 tails+0.030
burn+0.030
Neg logit contributions
ore-0.033
year-0.031
 x-0.030
 gen-0.029
 goblin-0.028
Token I
Token ID314
Feature activation+1.29e+00

Pos logit contributions
ore+0.013
year+0.012
 gen+0.012
 x+0.012
 goblin+0.011
Neg logit contributions
 pads-0.016
ety-0.013
coat-0.012
 Indians-0.012
burn-0.012
Token get
Token ID651
Feature activation+1.35e+00

Pos logit contributions
ore+0.063
year+0.061
 goblin+0.058
 gen+0.057
 x+0.056
Neg logit contributions
 pads-0.065
ety-0.056
coat-0.052
burn-0.051
 sunglasses-0.051
Token an
Token ID281
Feature activation+3.73e+00

Pos logit contributions
ore+0.062
 gen+0.059
 goblin+0.058
year+0.057
 x+0.056
Neg logit contributions
 pads-0.069
ety-0.061
 sunglasses-0.058
burn-0.056
 sunscreen-0.056
Token egg
Token ID5935
Feature activation+3.55e-01

Pos logit contributions
ore+0.184
year+0.181
 goblin+0.176
fire+0.176
 Voice+0.164
Neg logit contributions
 pads-0.133
 sunglasses-0.129
 pad-0.128
burn-0.123
ety-0.116
Token for
Token ID329
Feature activation+8.35e-01

Pos logit contributions
ore+0.013
year+0.012
 x+0.012
 gen+0.012
 goblin+0.011
Neg logit contributions
 pads-0.023
ety-0.020
burn-0.019
coat-0.018
 Indians-0.018
Token a
Token ID257
Feature activation+4.04e-01

Pos logit contributions
ore+0.040
year+0.038
 gen+0.036
 x+0.034
 goblin+0.034
Neg logit contributions
 pads-0.045
ety-0.037
 tails-0.037
 sunglasses-0.036
 Indians-0.036
Token cake
Token ID12187
Feature activation-2.88e-04

Pos logit contributions
ore+0.019
year+0.017
 x+0.017
 gen+0.017
 goblin+0.016
Neg logit contributions
 pads-0.023
ety-0.019
burn-0.018
coat-0.018
 pad-0.018
Token?
Token ID30
Feature activation+5.74e-01

Pos logit contributions
 pads+0.000
ety+0.000
burn+0.000
coat+0.000
 tails+0.000
Neg logit contributions
ore-0.000
 x-0.000
year-0.000
 gen-0.000
 goblin-0.000
Token Who
Token ID5338
Feature activation+7.48e-01

Pos logit contributions
ore+0.057
year+0.054
 x+0.054
 gen+0.052
 goblin+0.052
Neg logit contributions
 pads-0.058
ety-0.048
 Indians-0.045
 tails-0.044
 pad-0.041
Token are
Token ID389
Feature activation+1.17e+00

Pos logit contributions
ore+0.043
year+0.041
 gen+0.039
 x+0.039
 goblin+0.038
Neg logit contributions
 pads-0.034
ety-0.027
burn-0.024
 tails-0.024
 Indians-0.024
Token you
Token ID345
Feature activation-9.21e-02

Pos logit contributions
ore+0.062
year+0.059
 gen+0.057
 goblin+0.056
 x+0.055
Neg logit contributions
 pads-0.054
ety-0.045
 Indians-0.043
 tails-0.042
 sunglasses-0.042
Token?"
Token ID1701
Feature activation+1.87e+00

Pos logit contributions
 pads+0.006
ety+0.006
 tails+0.005
burn+0.005
 Indians+0.005
Neg logit contributions
ore-0.003
 x-0.003
 gen-0.003
year-0.003
 goblin-0.003
Token a
Token ID257
Feature activation+1.81e+00

Pos logit contributions
ore+0.138
year+0.133
 goblin+0.125
 potion+0.123
 ingredient+0.123
Neg logit contributions
 pads-0.041
 dads-0.034
 tails-0.034
ety-0.033
 sunglasses-0.030
Token voice
Token ID3809
Feature activation+3.73e+00

Pos logit contributions
ore+0.074
year+0.067
abel+0.061
 x+0.060
 operating+0.058
Neg logit contributions
 pads-0.096
 dads-0.087
ety-0.087
 Indians-0.086
 tails-0.085
Token says
Token ID1139
Feature activation+1.04e+00

Pos logit contributions
ore+0.187
year+0.180
 healer+0.178
 goblin+0.174
 ingredient+0.169
Neg logit contributions
 pads-0.142
ety-0.114
 dads-0.113
 swung-0.113
 sunglasses-0.113
Token.
Token ID13
Feature activation+1.09e+00

Pos logit contributions
ore+0.047
year+0.045
 gen+0.044
 x+0.044
 healer+0.043
Neg logit contributions
 pads-0.058
ety-0.047
 tails-0.045
coat-0.043
burn-0.043
Token The
Token ID383
Feature activation+1.81e+00

Pos logit contributions
ore+0.056
year+0.052
 x+0.050
 goblin+0.048
abel+0.048
Neg logit contributions
 pads-0.055
ety-0.043
 tails-0.042
 pad-0.039
 Indians-0.039
Token voice
Token ID3809
Feature activation+2.81e+00

Pos logit contributions
year+0.080
ore+0.078
le+0.069
old+0.068
abel+0.068
Neg logit contributions
 pads-0.083
 tails-0.078
 sunglasses-0.075
burn-0.073
 tail-0.073
Token is
Token ID318
Feature activation-1.87e-01

Pos logit contributions
ore+0.147
year+0.136
 healer+0.135
handed+0.133
fire+0.133
Neg logit contributions
 pads-0.122
 swung-0.099
 tails-0.099
 sunglasses-0.096
 hoops-0.095
Token suddenly
Token ID6451
Feature activation+3.83e-01

Pos logit contributions
ore+0.060
 x+0.056
year+0.055
 gen+0.054
 healer+0.052
Neg logit contributions
 pads-0.058
ety-0.047
 pad-0.044
 tails-0.044
 sunglasses-0.043
Token she
Token ID673
Feature activation-5.32e-01

Pos logit contributions
ore+0.018
year+0.017
 x+0.017
 gen+0.016
 goblin+0.015
Neg logit contributions
 pads-0.021
ety-0.017
 tails-0.016
coat-0.016
burn-0.016
Token heard
Token ID2982
Feature activation+1.85e+00

Pos logit contributions
 pads+0.029
ety+0.024
burn+0.022
 Indians+0.022
 tails+0.022
Neg logit contributions
ore-0.026
 x-0.024
 gen-0.024
year-0.023
 goblin-0.022
Token a
Token ID257
Feature activation+1.98e+00

Pos logit contributions
ore+0.096
 healer+0.087
 gen+0.083
year+0.082
 goblin+0.082
Neg logit contributions
 pads-0.089
 sunglasses-0.073
ety-0.068
 tails-0.067
burn-0.065
Token voice
Token ID3809
Feature activation+2.55e+00

Pos logit contributions
ore+0.091
year+0.087
old+0.079
 ingredient+0.076
handed+0.075
Neg logit contributions
 pads-0.094
 tails-0.079
 pad-0.079
ety-0.078
 Indians-0.078
Token.
Token ID13
Feature activation+3.73e+00

Pos logit contributions
ore+0.123
year+0.122
 healer+0.120
 goblin+0.118
 gen+0.112
Neg logit contributions
 pads-0.110
ety-0.089
 sunglasses-0.087
burn-0.087
 pad-0.083
Token
Token ID198
Feature activation+2.69e+00

Pos logit contributions
 x+0.206
handed+0.195
ore+0.193
year+0.191
 healer+0.190
Neg logit contributions
 pads-0.118
 pad-0.089
ety-0.085
 wo-0.084
 tails-0.081
Token
Token ID198
Feature activation+3.03e+00

Pos logit contributions
year+0.154
 x+0.152
 gen+0.150
 goblin+0.150
 healer+0.146
Neg logit contributions
 pads-0.090
ety-0.062
ibi-0.059
coat-0.057
 pad-0.056
TokenIt
Token ID1026
Feature activation+1.11e+00

Pos logit contributions
 x+0.181
ore+0.178
 gen+0.170
 goblin+0.169
year+0.169
Neg logit contributions
 pads-0.082
 tails-0.055
ety-0.054
 Indians-0.053
coat-0.051
Token was
Token ID373
Feature activation+8.16e-01

Pos logit contributions
ore+0.050
year+0.046
 x+0.045
 healer+0.042
 gen+0.042
Neg logit contributions
 pads-0.063
ety-0.053
burn-0.049
 tails-0.049
 Indians-0.048
Token a
Token ID257
Feature activation+1.44e+00

Pos logit contributions
ore+0.041
year+0.037
 gen+0.035
 goblin+0.034
 x+0.034
Neg logit contributions
 pads-0.040
ety-0.035
burn-0.033
 sunglasses-0.033
 tails-0.032
Token did
Token ID750
Feature activation+1.40e+00

Pos logit contributions
ore+0.029
year+0.027
 gen+0.027
 x+0.027
 goblin+0.025
Neg logit contributions
 pads-0.024
ety-0.020
 sunglasses-0.018
 Indians-0.018
 tails-0.017
Token you
Token ID345
Feature activation+1.02e+00

Pos logit contributions
ore+0.073
year+0.070
 gen+0.064
 goblin+0.063
 x+0.062
Neg logit contributions
 pads-0.070
ety-0.057
 sunglasses-0.056
 Indians-0.055
 tails-0.055
Token find
Token ID1064
Feature activation+1.73e+00

Pos logit contributions
ore+0.052
year+0.052
 gen+0.048
 goblin+0.046
 healer+0.045
Neg logit contributions
 pads-0.048
ety-0.041
 sunglasses-0.040
 Indians-0.038
burn-0.038
Token that
Token ID326
Feature activation+6.14e-01

Pos logit contributions
year+0.087
ore+0.082
 goblin+0.075
 gen+0.075
fire+0.074
Neg logit contributions
 pads-0.080
 sunglasses-0.073
ety-0.067
 sunscreen-0.065
bur-0.064
Token handle
Token ID5412
Feature activation-2.07e-02

Pos logit contributions
ore+0.025
year+0.024
 gen+0.021
 x+0.021
 goblin+0.021
Neg logit contributions
 pads-0.037
ety-0.032
burn-0.030
 sunglasses-0.030
coat-0.030
Token?"
Token ID1701
Feature activation+3.73e+00

Pos logit contributions
 pads+0.001
ety+0.001
 tails+0.001
burn+0.001
coat+0.001
Neg logit contributions
ore-0.001
 x-0.001
year-0.001
 gen-0.001
 goblin-0.001
Token Clara
Token ID27443
Feature activation+1.69e+00

Pos logit contributions
 potion+0.223
auc+0.221
year+0.219
ore+0.218
 x+0.214
Neg logit contributions
 pads-0.079
 wrists-0.076
 swung-0.067
 sunglasses-0.065
hi-0.062
Token looked
Token ID3114
Feature activation+1.29e+00

Pos logit contributions
ore+0.086
year+0.082
 x+0.080
 gen+0.077
 operating+0.076
Neg logit contributions
 pads-0.081
ety-0.071
 pad-0.063
lets-0.060
 tails-0.060
Token up
Token ID510
Feature activation+1.21e+00

Pos logit contributions
 gen+0.060
ore+0.060
 x+0.060
year+0.056
fire+0.055
Neg logit contributions
 pads-0.068
ety-0.058
 sunglasses-0.056
 pad-0.055
 tails-0.052
Token and
Token ID290
Feature activation-7.85e-01

Pos logit contributions
 x+0.066
 gen+0.065
ore+0.064
year+0.061
fire+0.060
Neg logit contributions
 pads-0.055
ety-0.046
 sunglasses-0.043
 tails-0.041
 pad-0.040
Token saw
Token ID2497
Feature activation+1.16e+00

Pos logit contributions
 pads+0.046
ety+0.038
burn+0.037
coat+0.037
 tails+0.037
Neg logit contributions
ore-0.034
 gen-0.031
year-0.031
 x-0.030
 goblin-0.029
Token have
Token ID423
Feature activation-1.76e-01

Pos logit contributions
 pads+0.018
ety+0.015
 tails+0.014
 Indians+0.013
coat+0.013
Neg logit contributions
ore-0.014
 x-0.013
 gen-0.013
year-0.013
 goblin-0.012
Token to
Token ID284
Feature activation+5.73e-01

Pos logit contributions
 pads+0.010
ety+0.009
 tails+0.008
burn+0.008
 Indians+0.008
Neg logit contributions
ore-0.008
 x-0.007
 gen-0.007
year-0.007
 goblin-0.007
Token destroy
Token ID4117
Feature activation+9.30e-01

Pos logit contributions
ore+0.029
year+0.028
 gen+0.025
 goblin+0.025
 x+0.025
Neg logit contributions
 pads-0.029
ety-0.024
burn-0.023
 Indians-0.023
 tails-0.023
Token my
Token ID616
Feature activation-5.51e-01

Pos logit contributions
ore+0.042
year+0.041
 gen+0.038
 goblin+0.037
 healer+0.036
Neg logit contributions
 pads-0.052
ety-0.043
 tails-0.042
 sunglasses-0.041
burn-0.040
Token map
Token ID3975
Feature activation+5.57e-01

Pos logit contributions
 pads+0.031
ety+0.026
burn+0.025
coat+0.025
 Indians+0.024
Neg logit contributions
ore-0.026
 x-0.024
 gen-0.023
year-0.022
 goblin-0.022
Token?"
Token ID1701
Feature activation+3.72e+00

Pos logit contributions
ore+0.021
year+0.019
 gen+0.018
 x+0.018
 goblin+0.017
Neg logit contributions
 pads-0.037
ety-0.031
coat-0.029
burn-0.029
 Indians-0.029
Token Jamie
Token ID17826
Feature activation+1.40e+00

Pos logit contributions
ore+0.212
 potion+0.201
fire+0.200
year+0.199
 healer+0.193
Neg logit contributions
 pads-0.095
 swung-0.089
 sunglasses-0.077
ety-0.073
 Tom-0.073
Token cried
Token ID16896
Feature activation+1.32e+00

Pos logit contributions
ore+0.072
year+0.068
 gen+0.063
 x+0.063
fire+0.060
Neg logit contributions
 pads-0.071
ety-0.061
 pad-0.054
 tails-0.054
burn-0.053
Token.
Token ID13
Feature activation+9.50e-01

Pos logit contributions
ore+0.063
 gen+0.058
year+0.057
 ingredient+0.055
 healer+0.054
Neg logit contributions
 pads-0.067
ety-0.059
burn-0.055
coat-0.053
 tails-0.052
Token
Token ID198
Feature activation+1.08e+00

Pos logit contributions
ore+0.046
year+0.043
 x+0.043
 gen+0.042
 goblin+0.040
Neg logit contributions
 pads-0.048
ety-0.040
coat-0.037
 pad-0.036
 Indians-0.036
Token
Token ID198
Feature activation+1.38e+00

Pos logit contributions
ore+0.050
 gen+0.049
year+0.049
 x+0.048
 goblin+0.048
Neg logit contributions
 pads-0.054
ety-0.046
ibi-0.042
 Indians-0.040
coat-0.040
Token's
Token ID338
Feature activation+5.71e-01

Pos logit contributions
ore+0.073
 x+0.068
year+0.064
 gen+0.064
 goblin+0.062
Neg logit contributions
 pads-0.073
ety-0.059
 pad-0.056
lets-0.054
 swung-0.054
Token mom
Token ID1995
Feature activation+6.17e-01

Pos logit contributions
ore+0.034
year+0.031
 gen+0.030
 x+0.030
 operating+0.029
Neg logit contributions
 pads-0.025
ety-0.019
 tails-0.019
 sunglasses-0.018
 pad-0.018
Token had
Token ID550
Feature activation+9.03e-03

Pos logit contributions
ore+0.031
 x+0.028
 gen+0.028
year+0.027
 goblin+0.026
Neg logit contributions
 pads-0.032
ety-0.027
coat-0.025
 pad-0.024
burn-0.024
Token filled
Token ID5901
Feature activation+2.18e+00

Pos logit contributions
ore+0.000
 x+0.000
 gen+0.000
year+0.000
 goblin+0.000
Neg logit contributions
 pads-0.001
ety-0.000
 tails-0.000
 Indians-0.000
coat-0.000
Token the
Token ID262
Feature activation+2.68e+00

Pos logit contributions
ore+0.117
 x+0.111
 operating+0.109
 gen+0.109
year+0.107
Neg logit contributions
 pads-0.085
 sunglasses-0.073
ety-0.073
 sunscreen-0.070
 tails-0.069
Token chamber
Token ID11847
Feature activation+3.71e+00

Pos logit contributions
ore+0.100
year+0.098
abel+0.085
 gen+0.085
ment+0.084
Neg logit contributions
 pads-0.150
 pad-0.132
 tails-0.129
 tail-0.122
ety-0.122
Token with
Token ID351
Feature activation+6.56e-01

Pos logit contributions
 x+0.175
ore+0.172
 gen+0.158
year+0.158
 goblin+0.153
Neg logit contributions
 pads-0.189
 pad-0.146
coat-0.140
ety-0.138
lets-0.137
Token balloons
Token ID35485
Feature activation-6.34e-01

Pos logit contributions
ore+0.032
 x+0.029
 gen+0.029
year+0.028
 goblin+0.027
Neg logit contributions
 pads-0.036
ety-0.028
 tails-0.028
 sunglasses-0.028
coat-0.027
Token,
Token ID11
Feature activation-6.50e-01

Pos logit contributions
 pads+0.041
ety+0.035
 Indians+0.033
 tails+0.033
burn+0.033
Neg logit contributions
ore-0.024
 gen-0.021
 x-0.021
year-0.021
 goblin-0.019
Token stream
Token ID4269
Feature activation-3.71e-01

Pos logit contributions
 pads+0.038
ety+0.033
burn+0.030
 tails+0.030
coat+0.030
Neg logit contributions
ore-0.028
 x-0.026
 gen-0.026
year-0.026
 goblin-0.024
Tokeners
Token ID364
Feature activation-1.32e+00

Pos logit contributions
 pads+0.026
ety+0.023
burn+0.022
 tails+0.022
 Indians+0.022
Neg logit contributions
ore-0.012
 x-0.010
 gen-0.010
year-0.010
 goblin-0.009
Token golf
Token ID13126
Feature activation-8.57e-01

Pos logit contributions
year+0.038
ore+0.037
 gen+0.035
abel+0.032
 goblin+0.032
Neg logit contributions
 pads-0.052
coat-0.043
 sunglasses-0.043
 tails-0.043
 Indians-0.043
Token with
Token ID351
Feature activation-2.25e+00

Pos logit contributions
 pads+0.063
ety+0.056
burn+0.055
 tails+0.053
 Indians+0.053
Neg logit contributions
ore-0.024
 x-0.020
year-0.019
 gen-0.019
 goblin-0.017
Token his
Token ID465
Feature activation-2.85e+00

Pos logit contributions
 pads+0.172
burn+0.165
coat+0.164
ety+0.164
drops+0.154
Neg logit contributions
ore-0.038
 x-0.038
 of-0.035
year-0.026
looking-0.023
Token dad
Token ID9955
Feature activation-7.84e-01

Pos logit contributions
 pads+0.182
burn+0.171
coat+0.163
ety+0.163
drops+0.160
Neg logit contributions
 x-0.074
ore-0.067
 of-0.058
 old-0.057
--0.057
Token.
Token ID13
Feature activation-1.43e+00

Pos logit contributions
 pads+0.054
ety+0.047
 Indians+0.045
coat+0.045
 tails+0.044
Neg logit contributions
ore-0.026
 x-0.023
year-0.022
 gen-0.021
 healer-0.020
Token They
Token ID1119
Feature activation-4.31e+00

Pos logit contributions
 pads+0.104
ety+0.089
burn+0.087
 tails+0.087
 Indians+0.085
Neg logit contributions
ore-0.043
 x-0.039
 gen-0.034
 operating-0.032
year-0.032
Token would
Token ID561
Feature activation-4.15e+00

Pos logit contributions
burn+0.249
 pads+0.248
coat+0.230
 tails+0.229
 fault+0.228
Neg logit contributions
 of-0.107
year-0.100
 year-0.084
 was-0.080
 vill-0.080
Token go
Token ID467
Feature activation-1.81e+00

Pos logit contributions
 pads+0.244
ety+0.220
burn+0.210
ibi+0.203
ked+0.203
Neg logit contributions
 x-0.121
 deliver-0.110
 of-0.108
 knock-0.098
 cook-0.097
Token to
Token ID284
Feature activation-1.49e+00

Pos logit contributions
 pads+0.118
 tails+0.101
ety+0.100
burn+0.099
 pad+0.096
Neg logit contributions
ore-0.067
 x-0.053
year-0.051
 of-0.048
 healer-0.046
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
 pads+0.112
ety+0.094
coat+0.093
burn+0.092
 tails+0.091
Neg logit contributions
 x-0.042
ore-0.039
 gen-0.033
 operating-0.032
 healer-0.031
Token golf
Token ID13126
Feature activation-1.70e+00

Pos logit contributions
 pads+0.069
ety+0.058
 tails+0.055
burn+0.054
coat+0.054
Neg logit contributions
ore-0.058
 x-0.057
 gen-0.053
 operating-0.051
 healer-0.050
Token on
Token ID319
Feature activation-1.13e-01

Pos logit contributions
 pads+0.070
ety+0.061
 tails+0.058
coat+0.056
 Indians+0.056
Neg logit contributions
ore-0.044
year-0.038
 x-0.038
 gen-0.036
 operating-0.034
Token,
Token ID11
Feature activation-8.36e-01

Pos logit contributions
 pads+0.007
ety+0.006
 tails+0.005
burn+0.005
coat+0.005
Neg logit contributions
ore-0.005
 x-0.005
year-0.005
 gen-0.005
 goblin-0.004
Token Lily
Token ID20037
Feature activation-2.92e+00

Pos logit contributions
 pads+0.051
ety+0.041
 pad+0.040
coat+0.040
 tails+0.040
Neg logit contributions
ore-0.035
 x-0.032
 gen-0.031
year-0.031
 operating-0.029
Token and
Token ID290
Feature activation-5.15e-01

Pos logit contributions
 pads+0.179
ety+0.165
 tails+0.165
 Indians+0.161
 sunscreen+0.154
Neg logit contributions
ore-0.089
abel-0.077
 gen-0.074
old-0.074
 of-0.073
Token her
Token ID607
Feature activation-2.63e+00

Pos logit contributions
 pads+0.031
ety+0.025
coat+0.024
 tails+0.024
burn+0.023
Neg logit contributions
ore-0.023
 x-0.021
 gen-0.020
year-0.020
 goblin-0.019
Token neighbor
Token ID4780
Feature activation-4.25e+00

Pos logit contributions
 pads+0.153
ety+0.136
burn+0.128
coat+0.128
inas+0.126
Neg logit contributions
 x-0.102
 goblin-0.084
 news-0.081
 gen-0.076
 Doctor-0.075
Token became
Token ID2627
Feature activation-2.07e+00

Pos logit contributions
 pads+0.197
 Indians+0.195
ety+0.182
bur+0.179
 sunscreen+0.177
Neg logit contributions
 of-0.171
old-0.158
abel-0.148
ore-0.144
year-0.139
Token good
Token ID922
Feature activation-7.29e-01

Pos logit contributions
 pads+0.124
ety+0.104
coat+0.100
 pad+0.100
ibi+0.097
Neg logit contributions
 x-0.079
year-0.077
ore-0.076
 healer-0.076
 gen-0.074
Token friends
Token ID2460
Feature activation+8.38e-02

Pos logit contributions
 pads+0.040
ety+0.033
 Indians+0.030
coat+0.030
 tails+0.030
Neg logit contributions
ore-0.035
 x-0.032
year-0.032
 gen-0.031
 goblin-0.030
Token and
Token ID290
Feature activation-2.90e+00

Pos logit contributions
ore+0.004
 x+0.003
 gen+0.003
year+0.003
 goblin+0.003
Neg logit contributions
 pads-0.005
ety-0.004
 tails-0.004
burn-0.004
coat-0.004
Token played
Token ID2826
Feature activation-6.54e-01

Pos logit contributions
 pads+0.143
ety+0.122
bur+0.117
 pad+0.117
coat+0.116
Neg logit contributions
year-0.122
 of-0.117
ore-0.116
old-0.110
 operating-0.108
Tokenooters
Token ID48316
Feature activation-2.27e+00

Pos logit contributions
 pads+0.065
 tails+0.052
ety+0.052
burn+0.052
 Indians+0.050
Neg logit contributions
ore-0.101
year-0.096
old-0.094
 x-0.090
 operating-0.085
Token.
Token ID13
Feature activation-9.05e-01

Pos logit contributions
 pads+0.160
 Indians+0.148
ety+0.144
 tails+0.142
 pad+0.138
Neg logit contributions
ore-0.064
year-0.050
 x-0.046
 of-0.045
le-0.044
Token They
Token ID1119
Feature activation-2.75e+00

Pos logit contributions
 pads+0.058
ety+0.049
burn+0.049
coat+0.048
 Indians+0.047
Neg logit contributions
ore-0.035
 x-0.030
year-0.029
 gen-0.028
 operating-0.028
Token always
Token ID1464
Feature activation-2.97e+00

Pos logit contributions
 pads+0.163
burn+0.148
 tails+0.146
bit+0.138
ety+0.138
Neg logit contributions
year-0.088
 gen-0.080
ore-0.078
abel-0.077
 of-0.077
Token wore
Token ID12408
Feature activation-2.80e+00

Pos logit contributions
 pads+0.168
 tails+0.148
burn+0.146
 hoops+0.142
ety+0.141
Neg logit contributions
 x-0.093
year-0.093
ore-0.092
 gen-0.092
old-0.086
Token their
Token ID511
Feature activation-4.23e+00

Pos logit contributions
ety+0.146
 pads+0.145
 pad+0.132
burn+0.123
flower+0.123
Neg logit contributions
 x-0.110
year-0.093
 operating-0.093
 gen-0.092
 news-0.091
Token gear
Token ID7733
Feature activation-1.42e+00

Pos logit contributions
ety+0.193
 pads+0.178
burn+0.175
inas+0.172
 Indians+0.166
Neg logit contributions
 x-0.188
 news-0.161
 year-0.160
 gen-0.155
 wishes-0.155
Token:
Token ID25
Feature activation-3.24e+00

Pos logit contributions
 pads+0.084
ety+0.075
 Indians+0.071
burn+0.069
 tails+0.067
Neg logit contributions
ore-0.057
 x-0.052
 gen-0.050
year-0.048
 operating-0.046
Token helmets
Token ID28359
Feature activation-2.87e+00

Pos logit contributions
 pads+0.180
ety+0.166
burn+0.164
ibi+0.155
bur+0.155
Neg logit contributions
 x-0.101
year-0.101
ore-0.101
 operating-0.090
 of-0.086
Token,
Token ID11
Feature activation-2.44e+00

Pos logit contributions
 pads+0.197
ety+0.180
 Indians+0.176
 tails+0.169
 dads+0.161
Neg logit contributions
ore-0.090
 x-0.071
 gen-0.068
 operating-0.064
 of-0.062
Token pads
Token ID21226
Feature activation-3.00e+00

Pos logit contributions
 pads+0.099
ety+0.094
burn+0.084
ibi+0.083
 pad+0.082
Neg logit contributions
 x-0.128
 operating-0.108
 gen-0.108
ore-0.105
 of-0.104
Token on
Token ID319
Feature activation-3.69e-01

Pos logit contributions
 pads+0.086
ety+0.075
 tails+0.071
coat+0.069
 Indians+0.068
Neg logit contributions
ore-0.053
year-0.047
 x-0.045
 gen-0.042
 operating-0.040
Token,
Token ID11
Feature activation-1.05e+00

Pos logit contributions
 pads+0.023
ety+0.020
 tails+0.018
burn+0.018
coat+0.018
Neg logit contributions
ore-0.015
 x-0.014
 gen-0.014
year-0.013
 goblin-0.013
Token he
Token ID339
Feature activation-8.88e-01

Pos logit contributions
 pads+0.066
coat+0.053
 pad+0.053
ety+0.053
 tails+0.051
Neg logit contributions
ore-0.043
year-0.039
 gen-0.038
 x-0.038
 operating-0.036
Token wore
Token ID12408
Feature activation-2.00e+00

Pos logit contributions
 pads+0.047
ety+0.038
 tails+0.036
coat+0.036
burn+0.035
Neg logit contributions
ore-0.044
 gen-0.041
year-0.040
 x-0.040
 goblin-0.038
Token his
Token ID465
Feature activation-3.98e+00

Pos logit contributions
 pads+0.156
ety+0.146
burn+0.136
 pad+0.134
coat+0.132
Neg logit contributions
 x-0.037
ore-0.034
 gen-0.032
year-0.031
 operating-0.028
Token new
Token ID649
Feature activation-4.23e+00

Pos logit contributions
 pads+0.168
burn+0.161
ety+0.150
lets+0.139
inas+0.138
Neg logit contributions
 x-0.181
 goblin-0.171
 news-0.171
 magical-0.171
 old-0.168
Token sunglasses
Token ID36251
Feature activation-2.04e+00

Pos logit contributions
 pads+0.154
ety+0.129
burn+0.127
 Indians+0.122
ked+0.113
Neg logit contributions
 x-0.216
 ingredient-0.214
 news-0.213
 year-0.207
--0.206
Token everywhere
Token ID8347
Feature activation-1.75e-01

Pos logit contributions
 pads+0.125
ety+0.113
 tails+0.110
 Indians+0.108
burn+0.105
Neg logit contributions
ore-0.078
year-0.067
 x-0.063
 gen-0.063
 of-0.059
Token he
Token ID339
Feature activation-2.53e-01

Pos logit contributions
 pads+0.012
ety+0.010
 tails+0.010
burn+0.010
 Indians+0.010
Neg logit contributions
ore-0.006
 x-0.005
 gen-0.005
year-0.005
 goblin-0.005
Token went
Token ID1816
Feature activation-3.11e-01

Pos logit contributions
 pads+0.012
ety+0.010
 tails+0.009
burn+0.009
 Indians+0.009
Neg logit contributions
ore-0.014
 x-0.013
year-0.013
 gen-0.013
 goblin-0.012
Token,
Token ID11
Feature activation-1.87e+00

Pos logit contributions
 pads+0.020
ety+0.017
 tails+0.016
burn+0.016
 Indians+0.016
Neg logit contributions
ore-0.012
 x-0.011
 gen-0.010
year-0.010
 goblin-0.010
Token!"
Token ID2474
Feature activation-4.12e-01

Pos logit contributions
 pads+0.142
ety+0.128
 Indians+0.124
 tails+0.122
 sunglasses+0.122
Neg logit contributions
ore-0.073
 of-0.053
 x-0.052
year-0.048
 ingredient-0.047
Token So
Token ID1406
Feature activation-1.49e+00

Pos logit contributions
 pads+0.023
ety+0.019
 tails+0.018
 Indians+0.018
coat+0.018
Neg logit contributions
ore-0.019
 x-0.018
 gen-0.017
year-0.017
 operating-0.016
Token,
Token ID11
Feature activation-8.66e-01

Pos logit contributions
 pads+0.105
ety+0.089
 pad+0.087
coat+0.086
 Indians+0.085
Neg logit contributions
ore-0.047
 gen-0.041
 x-0.040
year-0.037
 operating-0.037
Token Kitty
Token ID21168
Feature activation-2.16e+00

Pos logit contributions
 pads+0.048
ety+0.039
 pad+0.037
coat+0.036
 tails+0.036
Neg logit contributions
ore-0.041
 x-0.038
 gen-0.038
year-0.037
 operating-0.035
Token and
Token ID290
Feature activation-2.18e+00

Pos logit contributions
 pads+0.134
ety+0.118
 tails+0.116
burn+0.113
 Indians+0.112
Neg logit contributions
ore-0.073
 x-0.063
old-0.061
 gen-0.060
year-0.060
Token Dog
Token ID8532
Feature activation-4.21e+00

Pos logit contributions
 pads+0.113
ety+0.089
coat+0.088
 pad+0.088
 tails+0.080
Neg logit contributions
 x-0.101
ore-0.093
 operating-0.093
 gen-0.091
 God-0.090
Tokengy
Token ID1360
Feature activation-3.11e+00

Pos logit contributions
 pads+0.219
 sunscreen+0.209
 sunglasses+0.203
 Indians+0.198
 hoops+0.193
Neg logit contributions
ore-0.138
oc-0.132
le-0.128
ray-0.128
fire-0.123
Token took
Token ID1718
Feature activation-8.91e-01

Pos logit contributions
 pads+0.166
 Indians+0.145
ety+0.143
burn+0.139
 pad+0.131
Neg logit contributions
old-0.119
ore-0.118
 of-0.116
year-0.115
abel-0.112
Token turns
Token ID4962
Feature activation-1.58e+00

Pos logit contributions
 pads+0.058
ety+0.050
 Indians+0.047
 pad+0.047
coat+0.047
Neg logit contributions
ore-0.033
 x-0.031
year-0.029
 gen-0.028
 operating-0.026
Token sliding
Token ID22292
Feature activation-2.39e+00

Pos logit contributions
 pads+0.076
ety+0.065
 Indians+0.060
 tails+0.057
burn+0.055
Neg logit contributions
ore-0.087
year-0.077
 x-0.075
 of-0.074
 gen-0.072
Token with
Token ID351
Feature activation-2.34e+00

Pos logit contributions
 pads+0.152
 Indians+0.140
ety+0.138
burn+0.129
 pad+0.127
Neg logit contributions
ore-0.088
 of-0.063
 x-0.062
year-0.059
 site-0.057
Token golf
Token ID13126
Feature activation-7.13e-01

Pos logit contributions
year+0.049
ore+0.048
 gen+0.045
abel+0.043
 goblin+0.041
Neg logit contributions
 pads-0.063
 sunglasses-0.053
coat-0.053
 tails-0.053
 Indians-0.052
Token with
Token ID351
Feature activation-2.03e+00

Pos logit contributions
 pads+0.052
ety+0.046
burn+0.045
 Indians+0.044
 tails+0.044
Neg logit contributions
ore-0.021
 x-0.018
year-0.017
 gen-0.017
 goblin-0.015
Token his
Token ID465
Feature activation-3.07e+00

Pos logit contributions
 pads+0.163
burn+0.154
ety+0.154
coat+0.152
 pad+0.146
Neg logit contributions
 x-0.030
ore-0.030
 of-0.024
year-0.019
looking-0.016
Token dad
Token ID9955
Feature activation-8.32e-01

Pos logit contributions
 pads+0.190
burn+0.184
coat+0.177
ety+0.175
drops+0.171
Neg logit contributions
 x-0.074
ore-0.070
 of-0.065
 old-0.062
 voice-0.060
Token.
Token ID13
Feature activation-1.34e+00

Pos logit contributions
 pads+0.058
ety+0.050
 Indians+0.049
coat+0.048
burn+0.048
Neg logit contributions
ore-0.027
 x-0.024
year-0.022
 gen-0.022
 healer-0.020
Token They
Token ID1119
Feature activation-4.19e+00

Pos logit contributions
 pads+0.096
ety+0.081
burn+0.081
 tails+0.080
 Indians+0.079
Neg logit contributions
ore-0.043
 x-0.037
 gen-0.033
year-0.032
 operating-0.031
Token would
Token ID561
Feature activation-4.11e+00

Pos logit contributions
burn+0.250
 pads+0.245
 fault+0.227
coat+0.224
aters+0.223
Neg logit contributions
 of-0.103
year-0.098
 vill-0.081
 year-0.079
 was-0.077
Token go
Token ID467
Feature activation-1.81e+00

Pos logit contributions
 pads+0.244
ety+0.221
burn+0.214
ibi+0.203
ked+0.200
Neg logit contributions
 x-0.119
 deliver-0.109
 of-0.106
 show-0.097
 repair-0.097
Token to
Token ID284
Feature activation-1.54e+00

Pos logit contributions
 pads+0.118
burn+0.100
 tails+0.100
ety+0.100
 pad+0.096
Neg logit contributions
ore-0.067
 x-0.053
year-0.051
 of-0.048
 healer-0.046
Token the
Token ID262
Feature activation-1.31e+00

Pos logit contributions
 pads+0.116
ety+0.097
coat+0.096
burn+0.095
 tails+0.093
Neg logit contributions
 x-0.043
ore-0.040
 gen-0.034
 operating-0.033
 healer-0.031
Token big
Token ID1263
Feature activation-1.32e+00

Pos logit contributions
 pads+0.071
ety+0.059
burn+0.056
 tails+0.056
coat+0.055
Neg logit contributions
ore-0.062
 x-0.060
 gen-0.057
 goblin-0.054
 healer-0.054
Token the
Token ID262
Feature activation-1.19e+00

Pos logit contributions
ore+0.005
year+0.005
 gen+0.005
 x+0.004
 goblin+0.004
Neg logit contributions
 pads-0.006
ety-0.005
 Indians-0.005
 tails-0.005
burn-0.005
Token park
Token ID3952
Feature activation+6.21e-01

Pos logit contributions
 pads+0.073
ety+0.060
 tails+0.058
coat+0.056
lets+0.055
Neg logit contributions
 x-0.050
ore-0.049
 gen-0.046
 operating-0.044
 healer-0.044
Token to
Token ID284
Feature activation-3.47e-01

Pos logit contributions
ore+0.027
 x+0.027
 gen+0.027
year+0.026
 goblin+0.025
Neg logit contributions
 pads-0.036
ety-0.031
coat-0.028
burn-0.027
 tails-0.027
Token play
Token ID711
Feature activation+3.93e-01

Pos logit contributions
 pads+0.016
ety+0.013
 tails+0.012
burn+0.012
coat+0.012
Neg logit contributions
ore-0.020
 x-0.019
 gen-0.018
year-0.018
 goblin-0.017
Token with
Token ID351
Feature activation-6.45e-01

Pos logit contributions
ore+0.014
 gen+0.013
year+0.013
 x+0.013
 goblin+0.012
Neg logit contributions
 pads-0.026
ety-0.022
 tails-0.021
coat-0.021
 Indians-0.021
Token his
Token ID465
Feature activation-4.18e+00

Pos logit contributions
 pads+0.048
ety+0.041
 pad+0.040
coat+0.040
burn+0.039
Neg logit contributions
ore-0.019
 x-0.017
year-0.016
 gen-0.016
 goblin-0.014
Token jets
Token ID20792
Feature activation-1.92e+00

Pos logit contributions
 pads+0.239
coat+0.223
ety+0.219
 pad+0.206
burn+0.201
Neg logit contributions
 old-0.117
 of-0.114
 magical-0.107
 news-0.104
 special-0.104
Token.
Token ID13
Feature activation-9.95e-01

Pos logit contributions
 pads+0.145
ety+0.133
ibi+0.127
burn+0.126
 tails+0.125
Neg logit contributions
ore-0.044
 x-0.032
 of-0.032
 gen-0.028
 operating-0.028
Token He
Token ID679
Feature activation-1.97e+00

Pos logit contributions
 pads+0.072
ety+0.062
 tails+0.062
burn+0.062
 sunglasses+0.060
Neg logit contributions
ore-0.031
 x-0.025
year-0.023
 gen-0.023
 operating-0.022
Token threw
Token ID9617
Feature activation-2.37e+00

Pos logit contributions
 pads+0.120
burn+0.105
 tails+0.103
bit+0.101
ety+0.101
Neg logit contributions
ore-0.065
 x-0.059
 gen-0.058
year-0.057
abel-0.055
Token them
Token ID606
Feature activation-2.47e+00

Pos logit contributions
 pads+0.171
ety+0.157
coat+0.156
burn+0.154
ibi+0.152
Neg logit contributions
ore-0.053
 x-0.053
 of-0.050
year-0.044
 gen-0.044
Token."
Token ID526
Feature activation-9.29e-01

Pos logit contributions
 pads+0.029
ety+0.025
burn+0.023
 Indians+0.023
coat+0.023
Neg logit contributions
ore-0.021
 x-0.018
 gen-0.018
year-0.018
 goblin-0.017
Token
Token ID220
Feature activation-6.38e-01

Pos logit contributions
 pads+0.047
ety+0.038
coat+0.037
burn+0.037
 Indians+0.035
Neg logit contributions
ore-0.049
 gen-0.044
 x-0.043
year-0.041
 healer-0.041
Token
Token ID198
Feature activation-4.57e-01

Pos logit contributions
 pads+0.043
ety+0.036
burn+0.035
 tails+0.035
coat+0.035
Neg logit contributions
ore-0.024
 x-0.020
 gen-0.020
year-0.020
 goblin-0.018
Token
Token ID198
Feature activation-7.26e-01

Pos logit contributions
 pads+0.030
ety+0.026
 tails+0.024
burn+0.024
coat+0.024
Neg logit contributions
ore-0.017
 x-0.015
 gen-0.015
year-0.015
 goblin-0.014
TokenR
Token ID49
Feature activation-1.52e-01

Pos logit contributions
 pads+0.036
ety+0.029
coat+0.027
burn+0.027
 tails+0.026
Neg logit contributions
ore-0.039
 gen-0.036
 x-0.036
year-0.035
 healer-0.034
Tokenib
Token ID571
Feature activation-4.17e+00

Pos logit contributions
 pads+0.007
ety+0.006
 tails+0.005
burn+0.005
 Indians+0.005
Neg logit contributions
ore-0.008
 x-0.008
 gen-0.008
year-0.008
 goblin-0.007
Tokenbit
Token ID2545
Feature activation-1.79e+00

Pos logit contributions
coat+0.145
 Indians+0.142
 pads+0.140
burn+0.133
 sunscreen+0.127
Neg logit contributions
ore-0.214
le-0.206
oc-0.194
 of-0.194
 gen-0.192
Token said
Token ID531
Feature activation-7.63e-01

Pos logit contributions
 pads+0.123
coat+0.105
burn+0.105
ety+0.103
 Indians+0.102
Neg logit contributions
ore-0.055
 gen-0.051
 goblin-0.045
le-0.045
year-0.044
Token,
Token ID11
Feature activation-2.05e+00

Pos logit contributions
 pads+0.051
ety+0.043
coat+0.041
burn+0.040
 Indians+0.040
Neg logit contributions
ore-0.029
 gen-0.027
year-0.026
 x-0.025
 goblin-0.024
Token "
Token ID366
Feature activation-1.78e+00

Pos logit contributions
 pads+0.135
ety+0.117
burn+0.111
coat+0.111
ibi+0.108
Neg logit contributions
ore-0.076
year-0.072
 gen-0.068
 healer-0.062
 x-0.062
TokenWould
Token ID17353
Feature activation+2.37e-01

Pos logit contributions
 pads+0.107
 sunglasses+0.089
 Indians+0.089
coat+0.089
 tails+0.087
Neg logit contributions
ore-0.072
year-0.065
 gen-0.061
 healer-0.057
oc-0.056
Token "
Token ID366
Feature activation-2.12e+00

Pos logit contributions
 pads+0.114
ety+0.100
burn+0.092
 tails+0.089
bur+0.088
Neg logit contributions
ore-0.073
year-0.066
 gen-0.065
 x-0.063
 goblin-0.059
TokenYes
Token ID5297
Feature activation-1.57e+00

Pos logit contributions
 pads+0.131
 tails+0.112
 Indians+0.111
burn+0.110
ety+0.108
Neg logit contributions
ore-0.077
year-0.074
looking-0.065
 gen-0.062
fire-0.061
Token!"
Token ID2474
Feature activation-5.51e-01

Pos logit contributions
 pads+0.118
ety+0.098
 Indians+0.094
 pad+0.092
 tails+0.092
Neg logit contributions
ore-0.049
 x-0.043
 gen-0.042
 of-0.037
year-0.036
Token
Token ID198
Feature activation-4.12e-01

Pos logit contributions
 pads+0.034
ety+0.028
 Indians+0.027
burn+0.027
 tails+0.027
Neg logit contributions
ore-0.023
year-0.021
 x-0.021
 gen-0.020
 goblin-0.019
TokenSo
Token ID2396
Feature activation-1.12e+00

Pos logit contributions
 pads+0.024
ety+0.020
 tails+0.019
 Indians+0.018
burn+0.018
Neg logit contributions
ore-0.019
year-0.017
 x-0.017
 gen-0.017
 goblin-0.016
Token they
Token ID484
Feature activation-4.15e+00

Pos logit contributions
 pads+0.075
ety+0.063
coat+0.062
burn+0.061
 Indians+0.060
Neg logit contributions
ore-0.041
year-0.036
 gen-0.036
 x-0.034
 goblin-0.032
Token skipped
Token ID26684
Feature activation-1.85e+00

Pos logit contributions
 pads+0.216
bur+0.197
burn+0.190
 fault+0.186
 Indians+0.185
Neg logit contributions
year-0.148
 of-0.146
ore-0.144
le-0.139
abel-0.132
Token together
Token ID1978
Feature activation-1.78e+00

Pos logit contributions
 pads+0.126
ety+0.114
 Indians+0.109
burn+0.108
 tails+0.105
Neg logit contributions
ore-0.059
year-0.048
 of-0.043
 operating-0.040
 x-0.039
Token along
Token ID1863
Feature activation-3.34e-01

Pos logit contributions
 pads+0.122
ety+0.114
 Indians+0.109
burn+0.108
bur+0.107
Neg logit contributions
ore-0.057
 x-0.044
year-0.043
 of-0.041
 operating-0.041
Token the
Token ID262
Feature activation-2.18e+00

Pos logit contributions
 pads+0.022
ety+0.019
burn+0.017
 Indians+0.017
coat+0.017
Neg logit contributions
ore-0.013
 x-0.012
year-0.011
 gen-0.011
 goblin-0.010
Token path
Token ID3108
Feature activation-6.84e-01

Pos logit contributions
 pads+0.112
ety+0.101
bur+0.093
burn+0.093
 Indians+0.090
Neg logit contributions
ore-0.095
 x-0.094
 operating-0.087
looking-0.082
 site-0.082
Token with
Token ID351
Feature activation-2.25e+00

Pos logit contributions
 pads+0.063
ety+0.056
burn+0.055
 tails+0.053
 Indians+0.053
Neg logit contributions
ore-0.024
 x-0.020
year-0.019
 gen-0.019
 goblin-0.017
Token his
Token ID465
Feature activation-2.85e+00

Pos logit contributions
 pads+0.172
burn+0.165
coat+0.164
ety+0.164
drops+0.154
Neg logit contributions
ore-0.038
 x-0.038
 of-0.035
year-0.026
looking-0.023
Token dad
Token ID9955
Feature activation-7.84e-01

Pos logit contributions
 pads+0.182
burn+0.171
coat+0.163
ety+0.163
drops+0.160
Neg logit contributions
 x-0.074
ore-0.067
 of-0.058
 old-0.057
--0.057
Token.
Token ID13
Feature activation-1.43e+00

Pos logit contributions
 pads+0.054
ety+0.047
 Indians+0.045
coat+0.045
 tails+0.044
Neg logit contributions
ore-0.026
 x-0.023
year-0.022
 gen-0.021
 healer-0.020
Token They
Token ID1119
Feature activation-4.31e+00

Pos logit contributions
 pads+0.104
ety+0.089
burn+0.087
 tails+0.087
 Indians+0.085
Neg logit contributions
ore-0.043
 x-0.039
 gen-0.034
 operating-0.032
year-0.032
Token would
Token ID561
Feature activation-4.15e+00

Pos logit contributions
burn+0.249
 pads+0.248
coat+0.230
 tails+0.229
 fault+0.228
Neg logit contributions
 of-0.107
year-0.100
 year-0.084
 was-0.080
 vill-0.080
Token go
Token ID467
Feature activation-1.81e+00

Pos logit contributions
 pads+0.244
ety+0.220
burn+0.210
ibi+0.203
ked+0.203
Neg logit contributions
 x-0.121
 deliver-0.110
 of-0.108
 knock-0.098
 cook-0.097
Token to
Token ID284
Feature activation-1.49e+00

Pos logit contributions
 pads+0.118
 tails+0.101
ety+0.100
burn+0.099
 pad+0.096
Neg logit contributions
ore-0.067
 x-0.053
year-0.051
 of-0.048
 healer-0.046
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
 pads+0.112
ety+0.094
coat+0.093
burn+0.092
 tails+0.091
Neg logit contributions
 x-0.042
ore-0.039
 gen-0.033
 operating-0.032
 healer-0.031
Token golf
Token ID13126
Feature activation-1.70e+00

Pos logit contributions
 pads+0.069
ety+0.058
 tails+0.055
burn+0.054
coat+0.054
Neg logit contributions
ore-0.058
 x-0.057
 gen-0.053
 operating-0.051
 healer-0.050
Token course
Token ID1781
Feature activation-1.22e+00

Pos logit contributions
 pads+0.091
ety+0.081
burn+0.081
 tails+0.075
ibi+0.075
Neg logit contributions
ore-0.079
 x-0.068
 gen-0.067
 goblin-0.065
 site-0.065
Token her
Token ID607
Feature activation-2.04e+00

Pos logit contributions
 pads+0.074
 pad+0.059
ety+0.059
 Indians+0.058
coat+0.058
Neg logit contributions
ore-0.039
 x-0.038
 gen-0.036
year-0.035
 goblin-0.032
Token friends
Token ID2460
Feature activation-1.38e+00

Pos logit contributions
 pads+0.133
ety+0.115
burn+0.110
bit+0.106
 pad+0.106
Neg logit contributions
ore-0.065
 x-0.064
 gen-0.062
 goblin-0.059
 chamber-0.054
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
 pads+0.106
ety+0.095
 Indians+0.093
burn+0.092
ibi+0.092
Neg logit contributions
ore-0.034
 x-0.027
year-0.025
 operating-0.024
 gen-0.022
Token They
Token ID1119
Feature activation-3.06e+00

Pos logit contributions
 pads+0.066
burn+0.056
ety+0.056
bur+0.054
 Indians+0.054
Neg logit contributions
ore-0.040
 x-0.037
year-0.035
 operating-0.034
 gen-0.033
Token ran
Token ID4966
Feature activation-1.54e+00

Pos logit contributions
 pads+0.190
burn+0.179
 fault+0.167
bit+0.164
 tails+0.161
Neg logit contributions
year-0.097
 of-0.078
ore-0.074
looking-0.068
 gen-0.068
Token,
Token ID11
Feature activation-4.13e+00

Pos logit contributions
 pads+0.106
burn+0.094
ety+0.094
 Indians+0.089
bur+0.088
Neg logit contributions
ore-0.050
year-0.040
 x-0.037
looking-0.036
 operating-0.034
Token jumped
Token ID11687
Feature activation-1.19e+00

Pos logit contributions
 fault+0.226
burn+0.223
aters+0.220
bit+0.214
lets+0.203
Neg logit contributions
 of-0.136
looking-0.122
ore-0.106
year-0.100
 else-0.092
Token,
Token ID11
Feature activation-3.11e+00

Pos logit contributions
 pads+0.080
ety+0.072
burn+0.068
 Indians+0.066
bur+0.065
Neg logit contributions
ore-0.042
 x-0.034
year-0.033
 gen-0.032
 goblin-0.031
Token and
Token ID290
Feature activation-2.70e+00

Pos logit contributions
 pads+0.191
aters+0.190
 fault+0.189
burn+0.189
ibi+0.187
Neg logit contributions
 of-0.067
ore-0.064
looking-0.056
year-0.051
ce-0.044
Token slid
Token ID27803
Feature activation-3.05e+00

Pos logit contributions
 pads+0.133
aters+0.122
burn+0.120
ety+0.118
bur+0.117
Neg logit contributions
ore-0.108
year-0.107
looking-0.095
 of-0.095
fire-0.091
Token on
Token ID319
Feature activation-1.31e+00

Pos logit contributions
burn+0.216
ety+0.211
 Indians+0.211
 pads+0.211
ige+0.199
Neg logit contributions
ore-0.050
 of-0.043
 with-0.039
--0.036
 site-0.028
Token day
Token ID1110
Feature activation-1.64e+00

Pos logit contributions
 pads+0.035
ety+0.030
 tails+0.027
coat+0.027
 Indians+0.027
Neg logit contributions
ore-0.029
 x-0.026
 gen-0.026
year-0.025
 operating-0.024
Token on
Token ID319
Feature activation-5.76e-01

Pos logit contributions
 pads+0.112
ety+0.102
 tails+0.093
coat+0.093
 Indians+0.090
Neg logit contributions
ore-0.060
year-0.051
 x-0.049
 gen-0.044
 operating-0.044
Token,
Token ID11
Feature activation-1.85e+00

Pos logit contributions
 pads+0.037
ety+0.032
 Indians+0.029
coat+0.029
burn+0.029
Neg logit contributions
ore-0.023
 x-0.020
 gen-0.020
year-0.019
 operating-0.019
Token Kitty
Token ID21168
Feature activation-1.99e+00

Pos logit contributions
 pads+0.107
 pad+0.092
ety+0.089
coat+0.088
 sunglasses+0.086
Neg logit contributions
ore-0.077
 x-0.067
 gen-0.066
 of-0.065
 operating-0.065
Token and
Token ID290
Feature activation-2.97e+00

Pos logit contributions
 pads+0.130
ety+0.114
 tails+0.111
 Indians+0.110
 sunglasses+0.108
Neg logit contributions
ore-0.066
 gen-0.054
 x-0.053
abel-0.052
 healer-0.051
Token Dog
Token ID8532
Feature activation-4.13e+00

Pos logit contributions
 pads+0.161
coat+0.135
 pad+0.131
ety+0.129
 wrists+0.120
Neg logit contributions
 x-0.128
 operating-0.118
 God-0.114
 Doctor-0.111
 healer-0.111
Tokengy
Token ID1360
Feature activation-3.24e+00

Pos logit contributions
 pads+0.199
 sunscreen+0.185
 sunglasses+0.183
 Indians+0.178
 hoops+0.175
Neg logit contributions
ore-0.168
 of-0.146
abel-0.146
le-0.145
oc-0.144
Token loved
Token ID6151
Feature activation-1.11e+00

Pos logit contributions
 pads+0.164
ety+0.141
 Indians+0.137
burn+0.129
bur+0.129
Neg logit contributions
ore-0.135
abel-0.134
old-0.128
 operating-0.125
year-0.125
Token to
Token ID284
Feature activation-8.79e-01

Pos logit contributions
 pads+0.066
ety+0.058
coat+0.052
 pad+0.052
ibi+0.051
Neg logit contributions
ore-0.047
 x-0.044
year-0.041
 operating-0.040
 gen-0.040
Token play
Token ID711
Feature activation+4.70e-01

Pos logit contributions
 pads+0.046
ety+0.039
 pad+0.034
 Indians+0.034
 tails+0.034
Neg logit contributions
ore-0.045
 x-0.042
 gen-0.041
 operating-0.039
year-0.039
Token the
Token ID262
Feature activation-9.91e-01

Pos logit contributions
ore+0.022
year+0.021
 gen+0.021
 x+0.020
 goblin+0.019
Neg logit contributions
 pads-0.026
ety-0.021
 tails-0.021
coat-0.020
 Indians-0.020
Token with
Token ID351
Feature activation-2.03e+00

Pos logit contributions
 pads+0.052
ety+0.046
burn+0.045
 Indians+0.044
 tails+0.044
Neg logit contributions
ore-0.021
 x-0.018
year-0.017
 gen-0.017
 goblin-0.015
Token his
Token ID465
Feature activation-3.07e+00

Pos logit contributions
 pads+0.163
burn+0.154
ety+0.154
coat+0.152
 pad+0.146
Neg logit contributions
 x-0.030
ore-0.030
 of-0.024
year-0.019
looking-0.016
Token dad
Token ID9955
Feature activation-8.32e-01

Pos logit contributions
 pads+0.190
burn+0.184
coat+0.177
ety+0.175
drops+0.171
Neg logit contributions
 x-0.074
ore-0.070
 of-0.065
 old-0.062
 voice-0.060
Token.
Token ID13
Feature activation-1.34e+00

Pos logit contributions
 pads+0.058
ety+0.050
 Indians+0.049
coat+0.048
burn+0.048
Neg logit contributions
ore-0.027
 x-0.024
year-0.022
 gen-0.022
 healer-0.020
Token They
Token ID1119
Feature activation-4.19e+00

Pos logit contributions
 pads+0.096
ety+0.081
burn+0.081
 tails+0.080
 Indians+0.079
Neg logit contributions
ore-0.043
 x-0.037
 gen-0.033
year-0.032
 operating-0.031
Token would
Token ID561
Feature activation-4.11e+00

Pos logit contributions
burn+0.250
 pads+0.245
 fault+0.227
coat+0.224
aters+0.223
Neg logit contributions
 of-0.103
year-0.098
 vill-0.081
 year-0.079
 was-0.077
Token go
Token ID467
Feature activation-1.81e+00

Pos logit contributions
 pads+0.244
ety+0.221
burn+0.214
ibi+0.203
ked+0.200
Neg logit contributions
 x-0.119
 deliver-0.109
 of-0.106
 show-0.097
 repair-0.097
Token to
Token ID284
Feature activation-1.54e+00

Pos logit contributions
 pads+0.118
burn+0.100
 tails+0.100
ety+0.100
 pad+0.096
Neg logit contributions
ore-0.067
 x-0.053
year-0.051
 of-0.048
 healer-0.046
Token the
Token ID262
Feature activation-1.31e+00

Pos logit contributions
 pads+0.116
ety+0.097
coat+0.096
burn+0.095
 tails+0.093
Neg logit contributions
 x-0.043
ore-0.040
 gen-0.034
 operating-0.033
 healer-0.031
Token big
Token ID1263
Feature activation-1.32e+00

Pos logit contributions
 pads+0.071
ety+0.059
burn+0.056
 tails+0.056
coat+0.055
Neg logit contributions
ore-0.062
 x-0.060
 gen-0.057
 goblin-0.054
 healer-0.054
Token green
Token ID4077
Feature activation-1.16e+00

Pos logit contributions
 pads+0.074
ety+0.062
burn+0.060
coat+0.058
 tails+0.058
Neg logit contributions
ore-0.061
 x-0.057
 gen-0.053
 goblin-0.053
 healer-0.051
Token a
Token ID257
Feature activation-2.99e+00

Pos logit contributions
 pads+0.117
ety+0.115
burn+0.111
bur+0.106
coat+0.104
Neg logit contributions
 x-0.077
year-0.071
 gen-0.068
old-0.065
 operating-0.064
Token bell
Token ID8966
Feature activation-1.98e+00

Pos logit contributions
 pads+0.161
ety+0.159
bur+0.140
burn+0.139
 Indians+0.131
Neg logit contributions
 gen-0.121
 x-0.118
ore-0.106
 operating-0.104
 goblin-0.101
Token.
Token ID13
Feature activation-8.76e-01

Pos logit contributions
 pads+0.137
ety+0.122
 Indians+0.122
 tails+0.115
 pad+0.112
Neg logit contributions
ore-0.062
 x-0.054
 gen-0.052
year-0.047
 operating-0.046
Token They
Token ID1119
Feature activation-2.48e+00

Pos logit contributions
 pads+0.053
ety+0.044
burn+0.044
coat+0.042
 Indians+0.042
Neg logit contributions
ore-0.036
 x-0.033
year-0.031
 operating-0.031
 gen-0.030
Token ride
Token ID6594
Feature activation-1.40e+00

Pos logit contributions
 pads+0.141
burn+0.129
ety+0.121
 tails+0.116
 Indians+0.116
Neg logit contributions
 x-0.086
year-0.083
 operating-0.081
ore-0.081
 gen-0.079
Token their
Token ID511
Feature activation-4.11e+00

Pos logit contributions
 pads+0.088
ety+0.076
 Indians+0.072
burn+0.071
coat+0.071
Neg logit contributions
ore-0.055
year-0.050
 x-0.049
 gen-0.045
 operating-0.042
Token bikes
Token ID16715
Feature activation-2.21e+00

Pos logit contributions
ety+0.180
burn+0.170
 pads+0.169
inas+0.160
 Indians+0.160
Neg logit contributions
 x-0.170
 site-0.162
 year-0.158
 old-0.151
 news-0.149
Token on
Token ID319
Feature activation-1.02e+00

Pos logit contributions
 pads+0.150
 Indians+0.134
ety+0.133
 tails+0.130
burn+0.129
Neg logit contributions
ore-0.072
year-0.058
 x-0.056
 of-0.051
 operating-0.049
Token the
Token ID262
Feature activation-2.29e+00

Pos logit contributions
 pads+0.071
ety+0.063
burn+0.058
 Indians+0.058
coat+0.058
Neg logit contributions
ore-0.034
 x-0.032
 gen-0.029
year-0.028
 operating-0.027
Token road
Token ID2975
Feature activation-5.04e-01

Pos logit contributions
 pads+0.123
ety+0.122
burn+0.108
lets+0.108
ked+0.104
Neg logit contributions
 x-0.098
 site-0.085
ore-0.085
 operating-0.084
 gen-0.083
Token near
Token ID1474
Feature activation-8.90e-01

Pos logit contributions
 pads+0.032
ety+0.028
 Indians+0.026
burn+0.026
 tails+0.026
Neg logit contributions
ore-0.020
 x-0.018
year-0.017
 gen-0.017
 operating-0.016
Token,
Token ID11
Feature activation-2.33e+00

Pos logit contributions
 pads+0.178
ety+0.172
 Indians+0.162
burn+0.161
bit+0.159
Neg logit contributions
ore-0.068
 of-0.050
 x-0.048
year-0.047
 operating-0.047
Token too
Token ID1165
Feature activation-1.34e+00

Pos logit contributions
 pads+0.132
ety+0.115
burn+0.112
 tails+0.109
bit+0.107
Neg logit contributions
 x-0.083
ore-0.082
 of-0.077
 gen-0.077
 goblin-0.074
Token.
Token ID13
Feature activation-1.48e+00

Pos logit contributions
 pads+0.098
ety+0.087
 Indians+0.085
bur+0.084
burn+0.083
Neg logit contributions
ore-0.040
year-0.030
 x-0.028
 goblin-0.027
 operating-0.027
Token They
Token ID1119
Feature activation-3.17e+00

Pos logit contributions
 pads+0.097
burn+0.087
ety+0.083
 tails+0.082
 Indians+0.082
Neg logit contributions
ore-0.051
 x-0.043
year-0.042
 operating-0.040
 gen-0.038
Token skipped
Token ID26684
Feature activation-2.17e+00

Pos logit contributions
 pads+0.186
burn+0.159
bit+0.157
 tails+0.156
ety+0.153
Neg logit contributions
year-0.111
abel-0.108
ore-0.107
 of-0.099
 year-0.092
Token and
Token ID290
Feature activation-4.10e+00

Pos logit contributions
 pads+0.151
ety+0.140
burn+0.136
bur+0.133
 Indians+0.132
Neg logit contributions
ore-0.062
year-0.049
 of-0.048
 goblin-0.040
 x-0.039
Token sang
Token ID25889
Feature activation-1.57e+00

Pos logit contributions
bur+0.206
bit+0.204
 fault+0.202
 pads+0.197
burn+0.197
Neg logit contributions
 of-0.129
ore-0.126
year-0.120
 year-0.108
 Annie-0.098
Token and
Token ID290
Feature activation-3.57e+00

Pos logit contributions
 pads+0.097
ety+0.092
burn+0.082
bur+0.082
 Indians+0.080
Neg logit contributions
ore-0.060
year-0.056
 x-0.052
 gen-0.049
 operating-0.048
Token laughed
Token ID13818
Feature activation-1.61e+00

Pos logit contributions
bur+0.183
 pads+0.182
ety+0.179
burn+0.170
 pad+0.157
Neg logit contributions
ore-0.118
year-0.118
 of-0.109
 year-0.099
fire-0.098
Token.
Token ID13
Feature activation-1.23e+00

Pos logit contributions
 pads+0.110
ety+0.102
 Indians+0.094
bur+0.094
burn+0.093
Neg logit contributions
ore-0.052
year-0.046
 x-0.043
 gen-0.042
 deeply-0.040
Token
Token ID198
Feature activation+2.80e-01

Pos logit contributions
 pads+0.079
ety+0.068
burn+0.067
 Indians+0.065
bur+0.064
Neg logit contributions
ore-0.047
year-0.040
 x-0.040
 gen-0.037
 operating-0.037
Token a
Token ID257
Feature activation-2.62e+00

Pos logit contributions
 pads+0.129
ety+0.113
 Indians+0.109
 pad+0.106
burn+0.106
Neg logit contributions
 x-0.070
ore-0.066
year-0.062
 gen-0.059
 of-0.057
Token cloth
Token ID16270
Feature activation-2.27e+00

Pos logit contributions
 pads+0.138
ety+0.130
bur+0.113
burn+0.108
 Indians+0.108
Neg logit contributions
ore-0.115
 gen-0.114
 x-0.112
 goblin-0.100
 operating-0.099
Token.
Token ID13
Feature activation-5.77e-01

Pos logit contributions
 pads+0.161
 Indians+0.146
ety+0.144
 tails+0.136
ibi+0.133
Neg logit contributions
ore-0.066
 gen-0.051
 x-0.051
 of-0.046
 operating-0.043
Token They
Token ID1119
Feature activation-1.56e+00

Pos logit contributions
 pads+0.036
ety+0.030
 Indians+0.029
burn+0.028
coat+0.028
Neg logit contributions
ore-0.024
 x-0.021
year-0.021
 gen-0.021
 operating-0.019
Token want
Token ID765
Feature activation-8.68e-01

Pos logit contributions
 pads+0.091
ety+0.080
burn+0.075
 tails+0.074
 Indians+0.073
Neg logit contributions
ore-0.061
 x-0.061
 gen-0.058
year-0.056
 operating-0.056
Token their
Token ID511
Feature activation-4.10e+00

Pos logit contributions
 pads+0.054
ety+0.046
 Indians+0.043
 pad+0.043
coat+0.042
Neg logit contributions
ore-0.035
 x-0.033
 gen-0.031
year-0.030
 operating-0.030
Token bikes
Token ID16715
Feature activation-2.12e+00

Pos logit contributions
ety+0.191
 pads+0.184
 Indians+0.160
pp+0.158
pper+0.158
Neg logit contributions
 x-0.173
 news-0.166
 old-0.155
 wishes-0.151
 voice-0.151
Token to
Token ID284
Feature activation-1.84e+00

Pos logit contributions
 pads+0.145
 tails+0.127
 Indians+0.125
 pad+0.121
ety+0.120
Neg logit contributions
ore-0.064
 x-0.057
 of-0.054
 gen-0.052
year-0.051
Token look
Token ID804
Feature activation-7.30e-02

Pos logit contributions
 pads+0.111
ety+0.095
 pad+0.087
 tails+0.084
burn+0.084
Neg logit contributions
 x-0.081
ore-0.074
 gen-0.070
 operating-0.069
year-0.064
Token nice
Token ID3621
Feature activation-5.23e-01

Pos logit contributions
 pads+0.004
ety+0.003
 tails+0.003
burn+0.003
coat+0.003
Neg logit contributions
ore-0.003
 x-0.003
 gen-0.003
year-0.003
 goblin-0.003
Token.
Token ID13
Feature activation-7.54e-01

Pos logit contributions
 pads+0.034
ety+0.030
 Indians+0.028
 tails+0.028
burn+0.028
Neg logit contributions
ore-0.019
 x-0.017
year-0.017
 gen-0.017
 goblin-0.015
Token her
Token ID607
Feature activation-1.88e+00

Pos logit contributions
 pads+0.056
ety+0.045
 pad+0.044
coat+0.044
 Indians+0.044
Neg logit contributions
ore-0.034
 x-0.033
 gen-0.031
year-0.030
 goblin-0.028
Token friends
Token ID2460
Feature activation-1.44e+00

Pos logit contributions
 pads+0.121
ety+0.103
burn+0.098
 pad+0.097
coat+0.095
Neg logit contributions
ore-0.066
 gen-0.063
 x-0.062
 goblin-0.059
 healer-0.052
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
 pads+0.112
ety+0.099
 Indians+0.097
ibi+0.096
coat+0.096
Neg logit contributions
ore-0.035
 x-0.026
year-0.025
 operating-0.022
 healer-0.022
Token They
Token ID1119
Feature activation-2.86e+00

Pos logit contributions
 pads+0.071
ety+0.058
 Indians+0.058
burn+0.058
 tails+0.057
Neg logit contributions
ore-0.045
 x-0.039
year-0.038
 operating-0.036
 gen-0.036
Token ran
Token ID4966
Feature activation-1.44e+00

Pos logit contributions
 pads+0.178
burn+0.168
 fault+0.150
bit+0.150
 tails+0.149
Neg logit contributions
year-0.090
ore-0.077
 of-0.075
 gen-0.070
looking-0.067
Token,
Token ID11
Feature activation-4.08e+00

Pos logit contributions
 pads+0.097
ety+0.086
burn+0.086
 Indians+0.081
bur+0.080
Neg logit contributions
ore-0.049
year-0.040
 x-0.037
looking-0.036
 operating-0.035
Token jumped
Token ID11687
Feature activation-1.13e+00

Pos logit contributions
burn+0.225
aters+0.217
 fault+0.215
bit+0.210
 pads+0.201
Neg logit contributions
 of-0.137
looking-0.120
ore-0.111
year-0.100
 else-0.095
Token,
Token ID11
Feature activation-3.09e+00

Pos logit contributions
 pads+0.075
ety+0.067
burn+0.064
 Indians+0.062
bur+0.060
Neg logit contributions
ore-0.041
 x-0.033
year-0.033
 gen-0.032
 goblin-0.031
Token and
Token ID290
Feature activation-2.53e+00

Pos logit contributions
 pads+0.177
burn+0.176
aters+0.176
ibi+0.173
ety+0.171
Neg logit contributions
ore-0.084
 of-0.082
looking-0.069
year-0.067
 lad-0.061
Token slid
Token ID27803
Feature activation-3.01e+00

Pos logit contributions
 pads+0.124
burn+0.110
ety+0.110
aters+0.108
bur+0.107
Neg logit contributions
ore-0.110
year-0.104
looking-0.094
 of-0.093
 goblin-0.093
Token on
Token ID319
Feature activation-1.27e+00

Pos logit contributions
burn+0.220
 pads+0.213
ety+0.213
 Indians+0.212
bur+0.198
Neg logit contributions
ore-0.051
 of-0.040
 with-0.034
--0.030
le-0.026
Token skip
Token ID14267
Feature activation-1.46e+00

Pos logit contributions
 pads+0.125
ety+0.108
 pad+0.101
 Indians+0.098
bur+0.097
Neg logit contributions
ore-0.089
 x-0.086
 of-0.080
 operating-0.078
 gen-0.077
Token just
Token ID655
Feature activation-1.26e+00

Pos logit contributions
 pads+0.097
ety+0.087
burn+0.084
 Indians+0.083
 tails+0.082
Neg logit contributions
ore-0.050
year-0.040
 x-0.040
 gen-0.037
 of-0.037
Token like
Token ID588
Feature activation-1.97e+00

Pos logit contributions
 pads+0.078
ety+0.068
burn+0.065
 Indians+0.065
coat+0.063
Neg logit contributions
ore-0.049
 x-0.043
year-0.041
 operating-0.039
 gen-0.039
Token Tom
Token ID4186
Feature activation-1.58e+00

Pos logit contributions
 pads+0.121
ety+0.107
burn+0.102
coat+0.097
ibi+0.096
Neg logit contributions
 x-0.074
ore-0.072
 gen-0.065
year-0.063
 operating-0.062
Token.
Token ID13
Feature activation-1.75e+00

Pos logit contributions
 pads+0.106
ety+0.095
 Indians+0.093
burn+0.093
 tails+0.091
Neg logit contributions
ore-0.046
 x-0.042
 gen-0.037
year-0.036
 goblin-0.035
Token They
Token ID1119
Feature activation-4.06e+00

Pos logit contributions
 pads+0.113
burn+0.103
 Indians+0.099
ety+0.098
 tails+0.097
Neg logit contributions
ore-0.063
old-0.048
year-0.048
 x-0.046
looking-0.043
Token both
Token ID1111
Feature activation-3.60e+00

Pos logit contributions
 pads+0.207
burn+0.194
bur+0.187
 Indians+0.178
 fault+0.174
Neg logit contributions
year-0.133
abel-0.132
 of-0.131
le-0.130
ore-0.127
Token laughed
Token ID13818
Feature activation-1.72e+00

Pos logit contributions
 pads+0.191
burn+0.185
bur+0.185
 Indians+0.179
ety+0.162
Neg logit contributions
year-0.131
ore-0.127
abel-0.120
 of-0.116
le-0.116
Token and
Token ID290
Feature activation-3.65e+00

Pos logit contributions
 pads+0.125
ety+0.118
bur+0.112
 Indians+0.111
burn+0.110
Neg logit contributions
ore-0.041
year-0.037
 of-0.033
 deeply-0.033
 x-0.032
Token had
Token ID550
Feature activation-2.29e+00

Pos logit contributions
 pads+0.179
bur+0.171
ety+0.169
burn+0.162
 pad+0.159
Neg logit contributions
ore-0.111
year-0.109
 of-0.101
 deeply-0.095
le-0.095
Token lots
Token ID6041
Feature activation+1.72e-01

Pos logit contributions
 pads+0.147
ety+0.143
burn+0.141
bur+0.135
flower+0.129
Neg logit contributions
 x-0.053
year-0.052
abel-0.051
ore-0.049
 of-0.047
Token and
Token ID290
Feature activation-2.13e+00

Pos logit contributions
 pads+0.132
 Indians+0.121
 tails+0.116
bit+0.115
burn+0.112
Neg logit contributions
ore-0.125
abel-0.115
year-0.113
old-0.111
 operating-0.108
Token boy
Token ID2933
Feature activation-2.44e+00

Pos logit contributions
 pads+0.135
bit+0.111
 tails+0.110
aters+0.108
burn+0.108
Neg logit contributions
ore-0.066
year-0.066
 gen-0.066
abel-0.061
old-0.061
Token swung
Token ID28507
Feature activation-2.34e+00

Pos logit contributions
 pads+0.124
 Indians+0.110
 tails+0.102
ety+0.099
burn+0.097
Neg logit contributions
ore-0.115
old-0.103
abel-0.103
year-0.101
 operating-0.099
Token together
Token ID1978
Feature activation-1.52e+00

Pos logit contributions
 pads+0.158
 Indians+0.154
ety+0.149
burn+0.145
coat+0.136
Neg logit contributions
ore-0.056
year-0.055
 of-0.052
 gen-0.042
 x-0.041
Token.
Token ID13
Feature activation-1.54e+00

Pos logit contributions
 pads+0.110
ety+0.101
 Indians+0.100
burn+0.096
ibi+0.093
Neg logit contributions
ore-0.046
year-0.036
 x-0.033
 gen-0.032
 ingredient-0.030
Token They
Token ID1119
Feature activation-4.05e+00

Pos logit contributions
 pads+0.101
ety+0.086
burn+0.086
 sunglasses+0.085
 tails+0.085
Neg logit contributions
ore-0.056
 x-0.044
year-0.043
 operating-0.043
 gen-0.039
Token laughed
Token ID13818
Feature activation-1.64e+00

Pos logit contributions
 pads+0.229
burn+0.210
aters+0.209
 Indians+0.208
bit+0.206
Neg logit contributions
 of-0.124
abel-0.121
year-0.118
 year-0.105
le-0.105
Token and
Token ID290
Feature activation-3.94e+00

Pos logit contributions
 pads+0.118
ety+0.111
 Indians+0.102
burn+0.102
bur+0.101
Neg logit contributions
ore-0.042
year-0.038
 x-0.035
 gen-0.035
 deeply-0.034
Token smiled
Token ID13541
Feature activation-1.37e+00

Pos logit contributions
 pads+0.228
ety+0.207
burn+0.199
aters+0.198
 fault+0.198
Neg logit contributions
year-0.103
ore-0.090
 of-0.090
 Annie-0.083
 deeply-0.081
Token and
Token ID290
Feature activation-3.18e+00

Pos logit contributions
 pads+0.090
ety+0.080
 Indians+0.077
coat+0.074
burn+0.073
Neg logit contributions
ore-0.049
year-0.044
 gen-0.042
 x-0.040
 operating-0.039
Token it
Token ID340
Feature activation-9.46e-01

Pos logit contributions
 pads+0.185
ety+0.164
 pad+0.156
coat+0.153
bur+0.153
Neg logit contributions
year-0.102
ore-0.092
 gen-0.088
 of-0.084
abel-0.082
Token They
Token ID1119
Feature activation-2.90e+00

Pos logit contributions
 pads+0.110
burn+0.097
 Indians+0.095
ety+0.094
coat+0.093
Neg logit contributions
ore-0.066
 x-0.057
year-0.051
 operating-0.048
old-0.048
Token share
Token ID2648
Feature activation-1.66e+00

Pos logit contributions
 pads+0.170
burn+0.161
ety+0.149
ibi+0.144
 tails+0.142
Neg logit contributions
 x-0.088
ore-0.086
 of-0.080
 operating-0.078
old-0.078
Token the
Token ID262
Feature activation-3.22e+00

Pos logit contributions
 pads+0.124
coat+0.113
ety+0.112
 Indians+0.110
burn+0.109
Neg logit contributions
ore-0.038
 x-0.032
year-0.031
 gen-0.031
 of-0.030
Token ball
Token ID2613
Feature activation+2.85e-01

Pos logit contributions
 pads+0.138
ety+0.136
burn+0.126
ked+0.117
bur+0.117
Neg logit contributions
 x-0.151
 gen-0.135
 news-0.134
 noise-0.134
ore-0.134
Token and
Token ID290
Feature activation-2.58e+00

Pos logit contributions
ore+0.011
 x+0.010
 gen+0.010
year+0.010
 goblin+0.010
Neg logit contributions
 pads-0.018
ety-0.015
 tails-0.015
coat-0.014
burn-0.014
Token the
Token ID262
Feature activation-4.04e+00

Pos logit contributions
 pads+0.143
ety+0.141
burn+0.134
coat+0.130
bur+0.128
Neg logit contributions
 x-0.072
ore-0.069
 of-0.067
 gen-0.067
 sells-0.067
Token bike
Token ID7161
Feature activation-5.58e-01

Pos logit contributions
ety+0.185
 pads+0.183
burn+0.181
coat+0.164
ked+0.164
Neg logit contributions
 x-0.159
 noise-0.152
 wishes-0.145
 villagers-0.145
 gen-0.144
Token.
Token ID13
Feature activation-1.94e+00

Pos logit contributions
 pads+0.038
ety+0.033
 Indians+0.031
burn+0.031
 tails+0.031
Neg logit contributions
ore-0.019
 x-0.017
year-0.017
 gen-0.017
 operating-0.015
Token They
Token ID1119
Feature activation-3.02e+00

Pos logit contributions
 pads+0.132
burn+0.118
ety+0.116
coat+0.116
 Indians+0.116
Neg logit contributions
ore-0.063
 x-0.049
year-0.042
 operating-0.041
 of-0.041
Token take
Token ID1011
Feature activation-1.13e+00

Pos logit contributions
 pads+0.181
burn+0.168
ety+0.156
coat+0.150
ibi+0.149
Neg logit contributions
 x-0.089
ore-0.085
 gen-0.083
 operating-0.080
 healer-0.079
Token turns
Token ID4962
Feature activation-1.84e+00

Pos logit contributions
 pads+0.076
ety+0.069
 Indians+0.065
coat+0.065
burn+0.065
Neg logit contributions
ore-0.039
 x-0.034
year-0.032
 gen-0.030
old-0.028
Token,
Token ID11
Feature activation+4.02e-01

Pos logit contributions
ore+0.005
 x+0.004
 gen+0.004
year+0.004
 goblin+0.004
Neg logit contributions
 pads-0.007
ety-0.006
 tails-0.006
coat-0.006
burn-0.006
Token she
Token ID673
Feature activation+3.90e-01

Pos logit contributions
ore+0.019
 x+0.017
 gen+0.017
year+0.017
 goblin+0.016
Neg logit contributions
 pads-0.022
ety-0.019
 tails-0.017
coat-0.017
burn-0.017
Token stumbled
Token ID24241
Feature activation-1.50e+00

Pos logit contributions
ore+0.021
 x+0.019
year+0.019
 gen+0.019
 healer+0.018
Neg logit contributions
 pads-0.020
ety-0.016
 pad-0.015
 tails-0.014
coat-0.014
Token upon
Token ID2402
Feature activation+7.89e-01

Pos logit contributions
 pads+0.091
 tails+0.080
ety+0.080
 Indians+0.077
burn+0.076
Neg logit contributions
ore-0.062
year-0.054
 gen-0.050
 goblin-0.050
 x-0.049
Token a
Token ID257
Feature activation+9.71e-01

Pos logit contributions
ore+0.034
 gen+0.030
year+0.030
 x+0.030
 healer+0.028
Neg logit contributions
 pads-0.047
ety-0.039
 tails-0.037
coat-0.037
burn-0.036
Token cave
Token ID11527
Feature activation+2.33e+00

Pos logit contributions
ore+0.044
year+0.043
 x+0.038
 gen+0.037
old+0.037
Neg logit contributions
 pads-0.054
ety-0.045
 pad-0.044
 Indians-0.043
 sunglasses-0.042
Token.
Token ID13
Feature activation+4.74e-01

Pos logit contributions
 x+0.119
ore+0.118
year+0.115
 gen+0.111
old+0.110
Neg logit contributions
 pads-0.106
coat-0.082
ety-0.082
 pad-0.078
 swing-0.073
Token
Token ID220
Feature activation+2.98e-01

Pos logit contributions
ore+0.021
 gen+0.020
 x+0.019
year+0.019
 goblin+0.018
Neg logit contributions
 pads-0.028
ety-0.023
 pad-0.021
coat-0.021
 tails-0.021
Token
Token ID198
Feature activation+5.91e-01

Pos logit contributions
ore+0.012
 gen+0.012
 x+0.012
year+0.011
 goblin+0.011
Neg logit contributions
 pads-0.018
ety-0.015
coat-0.014
 tails-0.014
burn-0.014
Token
Token ID198
Feature activation+7.84e-02

Pos logit contributions
ore+0.026
 x+0.025
 gen+0.025
year+0.024
 goblin+0.023
Neg logit contributions
 pads-0.034
ety-0.028
coat-0.026
 pad-0.026
ibi-0.025
Token"
Token ID1
Feature activation-1.15e+00

Pos logit contributions
ore+0.004
 x+0.003
 gen+0.003
year+0.003
 goblin+0.003
Neg logit contributions
 pads-0.004
ety-0.004
 tails-0.003
coat-0.003
burn-0.003
Token.
Token ID13
Feature activation+4.27e-01

Pos logit contributions
ore+0.026
 x+0.025
 gen+0.025
year+0.024
 goblin+0.023
Neg logit contributions
 pads-0.037
ety-0.029
 tails-0.029
coat-0.029
 sunglasses-0.028
Token
Token ID220
Feature activation-2.67e-01

Pos logit contributions
ore+0.019
 x+0.019
 gen+0.018
year+0.018
 goblin+0.017
Neg logit contributions
 pads-0.024
ety-0.020
 tails-0.018
burn-0.018
 pad-0.018
Token
Token ID198
Feature activation+9.72e-01

Pos logit contributions
 pads+0.017
ety+0.015
 tails+0.014
 Indians+0.014
burn+0.014
Neg logit contributions
ore-0.010
 x-0.009
 gen-0.009
year-0.009
 goblin-0.008
Token
Token ID198
Feature activation+1.07e+00

Pos logit contributions
 x+0.046
ore+0.045
 goblin+0.043
 gen+0.042
 healer+0.041
Neg logit contributions
 pads-0.051
ety-0.043
ibi-0.039
 tails-0.038
burn-0.038
TokenThe
Token ID464
Feature activation+3.65e-01

Pos logit contributions
ore+0.055
 x+0.055
 goblin+0.052
 gen+0.049
 healer+0.047
Neg logit contributions
 pads-0.051
ety-0.043
burn-0.039
ibi-0.039
 tails-0.037
Token End
Token ID5268
Feature activation+2.35e+00

Pos logit contributions
ore+0.017
 gen+0.015
year+0.015
 x+0.015
 operating+0.014
Neg logit contributions
 pads-0.021
ety-0.017
 sunglasses-0.016
 Indians-0.016
 pad-0.016
Token.
Token ID13
Feature activation+5.58e-01

Pos logit contributions
 x+0.122
 gen+0.118
 goblin+0.115
 healer+0.113
ore+0.112
Neg logit contributions
 pads-0.104
ibi-0.077
ety-0.075
 tails-0.074
burn-0.072
Token<|endoftext|>
Token ID50256
Feature activation+7.40e-01

Pos logit contributions
ore+0.024
 x+0.024
 gen+0.022
year+0.022
 goblin+0.022
Neg logit contributions
 pads-0.032
ety-0.027
burn-0.025
 tails-0.024
 pad-0.024
TokenOnce
Token ID7454
Feature activation-2.43e-01

Pos logit contributions
ore+0.028
 x+0.027
 gen+0.027
year+0.027
 goblin+0.024
Neg logit contributions
 pads-0.047
ety-0.039
coat-0.036
 pad-0.036
burn-0.036
Token upon
Token ID2402
Feature activation+9.32e-01

Pos logit contributions
 pads+0.012
ety+0.010
 tails+0.009
burn+0.009
 Indians+0.009
Neg logit contributions
ore-0.013
 x-0.012
 gen-0.012
year-0.012
 goblin-0.011
Token a
Token ID257
Feature activation-2.07e-01

Pos logit contributions
ore+0.044
 gen+0.038
 x+0.038
year+0.037
 goblin+0.037
Neg logit contributions
 pads-0.054
ety-0.044
 tails-0.041
burn-0.041
coat-0.041
Token a
Token ID257
Feature activation+9.86e-01

Pos logit contributions
ore+0.053
year+0.048
 gen+0.048
 goblin+0.045
 x+0.045
Neg logit contributions
 pads-0.052
ety-0.044
 sunglasses-0.041
 tails-0.041
 Indians-0.039
Token kind
Token ID1611
Feature activation+1.36e+00

Pos logit contributions
ore+0.045
year+0.042
 x+0.040
 gen+0.038
abel+0.037
Neg logit contributions
 pads-0.054
ety-0.047
 Indians-0.044
 pad-0.044
coat-0.042
Token man
Token ID582
Feature activation+1.34e+00

Pos logit contributions
ore+0.061
year+0.059
 x+0.056
 gen+0.053
 operating+0.051
Neg logit contributions
 pads-0.073
ety-0.064
 pad-0.059
 sunglasses-0.058
coat-0.058
Token.
Token ID13
Feature activation+1.53e+00

Pos logit contributions
ore+0.059
 gen+0.058
year+0.057
 x+0.056
 goblin+0.054
Neg logit contributions
 pads-0.078
ety-0.064
 pad-0.060
burn-0.057
 tails-0.056
Token He
Token ID679
Feature activation+1.63e+00

Pos logit contributions
 gen+0.078
ore+0.077
 x+0.076
year+0.074
le+0.073
Neg logit contributions
 pads-0.072
ety-0.062
 pad-0.053
coat-0.052
 Indians-0.052
Token helped
Token ID4193
Feature activation+2.10e+00

Pos logit contributions
ore+0.095
 x+0.085
year+0.081
 goblin+0.080
 gen+0.080
Neg logit contributions
 pads-0.072
ety-0.064
 swung-0.054
 Indians-0.053
coat-0.053
Token her
Token ID607
Feature activation+8.64e-01

Pos logit contributions
ore+0.116
year+0.113
 gen+0.108
 goblin+0.106
 operating+0.102
Neg logit contributions
 pads-0.082
ety-0.077
 dads-0.065
 tails-0.064
burn-0.064
Token pick
Token ID2298
Feature activation+1.19e+00

Pos logit contributions
ore+0.043
year+0.043
 gen+0.039
 x+0.039
 goblin+0.039
Neg logit contributions
 pads-0.045
ety-0.037
burn-0.035
coat-0.033
 tails-0.033
Token up
Token ID510
Feature activation+9.70e-01

Pos logit contributions
ore+0.055
year+0.052
 gen+0.050
 x+0.049
 operating+0.048
Neg logit contributions
 pads-0.067
ety-0.055
 sunglasses-0.053
burn-0.051
 tails-0.050
Token the
Token ID262
Feature activation-7.57e-02

Pos logit contributions
ore+0.049
year+0.044
 gen+0.043
 x+0.043
 goblin+0.041
Neg logit contributions
 pads-0.052
ety-0.043
 tails-0.041
 sunglasses-0.041
burn-0.039
Token heavy
Token ID4334
Feature activation+9.01e-01

Pos logit contributions
 pads+0.004
ety+0.004
 tails+0.003
burn+0.003
coat+0.003
Neg logit contributions
ore-0.004
 x-0.003
year-0.003
 gen-0.003
 goblin-0.003
Token the
Token ID262
Feature activation+1.43e+00

Pos logit contributions
ore+0.063
year+0.059
 gen+0.058
 x+0.056
 goblin+0.054
Neg logit contributions
 pads-0.053
ety-0.046
burn-0.042
bur-0.042
 tails-0.042
Token top
Token ID1353
Feature activation+1.82e+00

Pos logit contributions
year+0.079
ore+0.078
old+0.070
 x+0.068
 gen+0.068
Neg logit contributions
 pads-0.063
 pad-0.049
ety-0.049
 Indians-0.047
 sunglasses-0.045
Token of
Token ID286
Feature activation+2.85e-01

Pos logit contributions
year+0.057
ore+0.055
 x+0.054
 gen+0.053
 healer+0.049
Neg logit contributions
 pads-0.129
ety-0.104
 pad-0.103
coat-0.099
 tails-0.098
Token the
Token ID262
Feature activation+1.31e+00

Pos logit contributions
ore+0.012
year+0.011
 gen+0.011
 x+0.011
 goblin+0.010
Neg logit contributions
 pads-0.017
ety-0.014
 tails-0.013
burn-0.013
 Indians-0.013
Token mountain
Token ID8598
Feature activation+1.30e+00

Pos logit contributions
year+0.076
ore+0.071
old+0.068
 x+0.067
 gen+0.065
Neg logit contributions
 pads-0.056
ety-0.043
 Indians-0.043
 pad-0.042
coat-0.039
Token.
Token ID13
Feature activation+2.37e+00

Pos logit contributions
ore+0.056
 x+0.054
year+0.053
 gen+0.052
 goblin+0.049
Neg logit contributions
 pads-0.077
ety-0.061
 pad-0.059
 tails-0.058
coat-0.058
Token He
Token ID679
Feature activation+1.28e+00

Pos logit contributions
year+0.118
ore+0.117
 x+0.117
le+0.113
 gen+0.113
Neg logit contributions
 pads-0.105
ety-0.084
 pad-0.078
 tails-0.076
 Indians-0.073
Token was
Token ID373
Feature activation+1.02e+00

Pos logit contributions
ore+0.066
 x+0.059
year+0.057
 gen+0.054
 operating+0.054
Neg logit contributions
 pads-0.066
ety-0.056
 swung-0.050
 tails-0.050
 Indians-0.050
Token glaring
Token ID29097
Feature activation+1.17e+00

Pos logit contributions
ore+0.051
year+0.044
 gen+0.044
 x+0.044
 goblin+0.043
Neg logit contributions
 pads-0.053
ety-0.043
 tails-0.041
coat-0.040
burn-0.040
Token down
Token ID866
Feature activation+1.04e+00

Pos logit contributions
 gen+0.060
ore+0.059
 x+0.057
year+0.057
fire+0.055
Neg logit contributions
 pads-0.061
ety-0.047
 tails-0.046
 sunglasses-0.046
 pad-0.046
Token at
Token ID379
Feature activation+4.52e-01

Pos logit contributions
ore+0.050
 x+0.049
 gen+0.048
year+0.045
 goblin+0.044
Neg logit contributions
 pads-0.054
ety-0.044
 tails-0.042
 sunglasses-0.041
 pad-0.040
Token"
Token ID1
Feature activation-2.01e-01

Pos logit contributions
ore+0.030
 x+0.029
 gen+0.028
year+0.027
 goblin+0.026
Neg logit contributions
 pads-0.032
ety-0.027
 Indians-0.024
burn-0.024
 tails-0.024
TokenI
Token ID40
Feature activation+1.72e+00

Pos logit contributions
 pads+0.011
ety+0.009
 tails+0.008
burn+0.008
 Indians+0.008
Neg logit contributions
ore-0.010
 x-0.009
year-0.009
 gen-0.009
 goblin-0.009
Token'm
Token ID1101
Feature activation+1.60e+00

Pos logit contributions
 x+0.097
 gen+0.096
year+0.094
ore+0.093
 goblin+0.091
Neg logit contributions
 pads-0.068
ety-0.055
 pad-0.049
coat-0.049
burn-0.048
Token making
Token ID1642
Feature activation+2.41e+00

Pos logit contributions
ore+0.080
 gen+0.076
year+0.074
 goblin+0.072
 x+0.072
Neg logit contributions
 pads-0.073
 tails-0.058
 sunglasses-0.058
ety-0.057
burn-0.057
Token a
Token ID257
Feature activation+2.33e+00

Pos logit contributions
year+0.112
ore+0.107
 gen+0.100
 goblin+0.093
 operating+0.092
Neg logit contributions
 pads-0.124
 sunglasses-0.109
 sunscreen-0.107
 tails-0.105
burn-0.098
Token delicious
Token ID12625
Feature activation+2.60e+00

Pos logit contributions
year+0.113
ore+0.104
old+0.097
 gen+0.088
ment+0.088
Neg logit contributions
 pads-0.115
 sunglasses-0.098
 tails-0.096
burn-0.096
 pad-0.095
Token mac
Token ID8352
Feature activation+1.15e+00

Pos logit contributions
year+0.136
old+0.113
ore+0.112
ibrarian+0.107
 operating+0.106
Neg logit contributions
 pads-0.121
burn-0.109
 sunglasses-0.106
 tails-0.104
 sunscreen-0.102
Tokenaron
Token ID8045
Feature activation+7.14e-02

Pos logit contributions
ore+0.048
year+0.048
 x+0.044
 gen+0.042
 operating+0.042
Neg logit contributions
 pads-0.070
ety-0.056
 pad-0.055
 tails-0.054
burn-0.054
Tokeni
Token ID72
Feature activation+1.57e+00

Pos logit contributions
ore+0.004
 x+0.004
year+0.004
 gen+0.004
 goblin+0.004
Neg logit contributions
 pads-0.003
ety-0.003
burn-0.002
 tails-0.002
coat-0.002
Token and
Token ID290
Feature activation+4.17e-01

Pos logit contributions
ore+0.064
year+0.063
 x+0.060
 gen+0.059
 operating+0.057
Neg logit contributions
 pads-0.098
ety-0.079
 pad-0.076
 tails-0.075
burn-0.074
Token cheese
Token ID9891
Feature activation+1.08e+00

Pos logit contributions
ore+0.019
year+0.017
 x+0.017
 gen+0.016
 operating+0.016
Neg logit contributions
 pads-0.025
ety-0.020
 tails-0.019
 Indians-0.019
burn-0.019
Token day
Token ID1110
Feature activation-7.79e-01

Pos logit contributions
ore+0.020
 gen+0.019
 x+0.019
year+0.019
 goblin+0.018
Neg logit contributions
 pads-0.017
ety-0.014
burn-0.012
coat-0.012
 tails-0.012
Token,
Token ID11
Feature activation-1.01e+00

Pos logit contributions
 pads+0.052
ety+0.044
 Indians+0.042
 tails+0.041
burn+0.041
Neg logit contributions
ore-0.030
 x-0.026
 gen-0.025
year-0.025
 operating-0.023
Token she
Token ID673
Feature activation-1.66e-01

Pos logit contributions
 pads+0.071
ety+0.058
 pad+0.056
 Indians+0.056
burn+0.055
Neg logit contributions
ore-0.035
 x-0.032
 gen-0.031
year-0.030
 goblin-0.028
Token went
Token ID1816
Feature activation+9.41e-01

Pos logit contributions
 pads+0.009
ety+0.007
 tails+0.007
burn+0.007
 Indians+0.007
Neg logit contributions
ore-0.008
 x-0.007
 gen-0.007
year-0.007
 goblin-0.007
Token to
Token ID284
Feature activation+3.76e-01

Pos logit contributions
 gen+0.051
year+0.050
ore+0.049
 x+0.047
 goblin+0.045
Neg logit contributions
 pads-0.045
ety-0.035
coat-0.034
burn-0.034
 Indians-0.033
Token a
Token ID257
Feature activation+1.87e+00

Pos logit contributions
ore+0.018
year+0.017
 gen+0.016
 x+0.015
 goblin+0.015
Neg logit contributions
 pads-0.021
ety-0.018
 Indians-0.017
burn-0.017
 tails-0.016
Token big
Token ID1263
Feature activation+1.23e+00

Pos logit contributions
year+0.097
ore+0.094
ment+0.087
 gen+0.085
le+0.081
Neg logit contributions
 pads-0.085
ety-0.076
 Indians-0.072
coat-0.069
 sunglasses-0.069
Token stadium
Token ID10308
Feature activation+1.39e+00

Pos logit contributions
year+0.064
ore+0.061
 gen+0.057
 x+0.053
 goblin+0.052
Neg logit contributions
 pads-0.060
ety-0.051
 sunglasses-0.048
 Indians-0.048
coat-0.048
Token with
Token ID351
Feature activation-7.81e-01

Pos logit contributions
 x+0.067
 gen+0.067
year+0.064
ore+0.064
 goblin+0.062
Neg logit contributions
 pads-0.072
ety-0.060
coat-0.057
 pad-0.054
 tails-0.050
Token her
Token ID607
Feature activation-1.38e+00

Pos logit contributions
 pads+0.049
ety+0.043
coat+0.040
burn+0.040
 pad+0.039
Neg logit contributions
ore-0.031
year-0.029
 x-0.028
 gen-0.028
 goblin-0.025
Token mom
Token ID1995
Feature activation-1.17e+00

Pos logit contributions
 pads+0.083
ety+0.074
burn+0.067
coat+0.067
 Indians+0.064
Neg logit contributions
ore-0.058
 x-0.053
 gen-0.052
 goblin-0.048
 healer-0.046
Token,
Token ID11
Feature activation-1.68e-01

Pos logit contributions
ore+0.001
year+0.001
 x+0.001
 gen+0.001
 goblin+0.001
Neg logit contributions
 pads-0.002
ety-0.001
 tails-0.001
burn-0.001
coat-0.001
Token Lily
Token ID20037
Feature activation-5.05e-01

Pos logit contributions
 pads+0.009
ety+0.008
 tails+0.007
burn+0.007
 Indians+0.007
Neg logit contributions
ore-0.008
 x-0.008
 gen-0.007
year-0.007
 goblin-0.007
Token always
Token ID1464
Feature activation-2.70e-01

Pos logit contributions
 pads+0.027
ety+0.022
 tails+0.020
coat+0.020
 Indians+0.020
Neg logit contributions
ore-0.025
 gen-0.023
 x-0.023
year-0.023
 goblin-0.022
Token remembered
Token ID12086
Feature activation-3.57e-01

Pos logit contributions
 pads+0.013
ety+0.011
 tails+0.010
coat+0.010
burn+0.010
Neg logit contributions
ore-0.015
 x-0.014
 gen-0.014
year-0.014
 goblin-0.013
Token the
Token ID262
Feature activation+6.02e-01

Pos logit contributions
 pads+0.021
ety+0.018
 tails+0.016
coat+0.016
 Indians+0.016
Neg logit contributions
ore-0.016
 x-0.015
 gen-0.014
year-0.014
 goblin-0.013
Token huge
Token ID3236
Feature activation+8.14e-01

Pos logit contributions
ore+0.026
year+0.024
 gen+0.022
 x+0.021
 operating+0.021
Neg logit contributions
 pads-0.036
ety-0.030
 tails-0.029
 Indians-0.028
 pad-0.028
Token mushroom
Token ID28520
Feature activation+1.42e+00

Pos logit contributions
ore+0.037
year+0.035
 gen+0.031
 x+0.030
abel+0.029
Neg logit contributions
 pads-0.047
 tails-0.039
ety-0.039
 pad-0.037
 sunglasses-0.036
Token and
Token ID290
Feature activation+3.14e-01

Pos logit contributions
ore+0.069
 x+0.066
year+0.065
 gen+0.064
 operating+0.058
Neg logit contributions
 pads-0.078
ety-0.061
 pad-0.059
 tails-0.056
coat-0.055
Token the
Token ID262
Feature activation-2.22e-01

Pos logit contributions
ore+0.017
 x+0.015
 gen+0.015
year+0.015
 goblin+0.014
Neg logit contributions
 pads-0.016
ety-0.013
 tails-0.012
 Indians-0.012
burn-0.012
Token little
Token ID1310
Feature activation+4.85e-01

Pos logit contributions
 pads+0.014
ety+0.012
 tails+0.011
burn+0.011
coat+0.011
Neg logit contributions
ore-0.009
 x-0.008
 gen-0.008
year-0.008
 goblin-0.008
Token mouse
Token ID10211
Feature activation+7.94e-01

Pos logit contributions
ore+0.020
year+0.018
 x+0.017
 gen+0.017
 operating+0.016
Neg logit contributions
 pads-0.030
ety-0.026
 tails-0.025
 pad-0.024
burn-0.024
Token,
Token ID11
Feature activation-1.36e+00

Pos logit contributions
ore+0.025
year+0.024
 gen+0.024
 x+0.024
 goblin+0.022
Neg logit contributions
 pads-0.026
ety-0.022
 pad-0.020
 tails-0.020
burn-0.020
Token but
Token ID475
Feature activation-9.91e-02

Pos logit contributions
 pads+0.077
ety+0.062
 Indians+0.061
burn+0.060
 tails+0.058
Neg logit contributions
ore-0.060
 x-0.056
 gen-0.056
year-0.054
 operating-0.052
Token then
Token ID788
Feature activation-1.26e-01

Pos logit contributions
 pads+0.006
ety+0.005
 tails+0.004
burn+0.004
 Indians+0.004
Neg logit contributions
ore-0.005
 x-0.004
 gen-0.004
year-0.004
 goblin-0.004
Token she
Token ID673
Feature activation-5.94e-01

Pos logit contributions
 pads+0.008
ety+0.006
 tails+0.006
burn+0.006
 Indians+0.006
Neg logit contributions
ore-0.006
 x-0.005
 gen-0.005
year-0.005
 goblin-0.005
Token had
Token ID550
Feature activation-3.51e-01

Pos logit contributions
 pads+0.029
ety+0.024
burn+0.023
 tails+0.022
coat+0.022
Neg logit contributions
ore-0.030
 x-0.029
 gen-0.029
year-0.029
 goblin-0.027
Token an
Token ID281
Feature activation+1.23e+00

Pos logit contributions
 pads+0.024
ety+0.020
 tails+0.019
burn+0.019
 Indians+0.019
Neg logit contributions
ore-0.012
 x-0.011
 gen-0.011
year-0.011
 goblin-0.010
Token idea
Token ID2126
Feature activation+6.41e-01

Pos logit contributions
ore+0.059
 gen+0.051
fire+0.050
year+0.049
 goblin+0.046
Neg logit contributions
 pads-0.069
ety-0.058
 sunglasses-0.057
 tails-0.054
 pad-0.054
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
ore+0.028
 gen+0.026
 x+0.025
year+0.025
 healer+0.024
Neg logit contributions
 pads-0.037
ety-0.031
coat-0.029
 pad-0.029
 tails-0.029
Token She
Token ID1375
Feature activation+1.82e-01

Pos logit contributions
 pads+0.066
 tails+0.057
burn+0.056
ety+0.056
 Indians+0.055
Neg logit contributions
ore-0.037
 x-0.031
year-0.030
 gen-0.029
 operating-0.028
Token picked
Token ID6497
Feature activation-2.73e-01

Pos logit contributions
ore+0.010
 x+0.009
 gen+0.009
year+0.009
 goblin+0.009
Neg logit contributions
 pads-0.009
ety-0.007
 tails-0.007
 Indians-0.007
coat-0.007
Token up
Token ID510
Feature activation-7.15e-01

Pos logit contributions
 pads+0.018
ety+0.016
 tails+0.015
coat+0.015
 Indians+0.015
Neg logit contributions
ore-0.010
 x-0.009
year-0.009
 gen-0.009
 goblin-0.008
Token hands
Token ID2832
Feature activation-1.22e+00

Pos logit contributions
ore+0.002
year+0.002
 x+0.002
 gen+0.002
 goblin+0.002
Neg logit contributions
 pads-0.002
ety-0.001
 tails-0.001
coat-0.001
burn-0.001
Token,
Token ID11
Feature activation-7.44e-01

Pos logit contributions
 pads+0.079
ety+0.069
 Indians+0.067
 tails+0.064
burn+0.064
Neg logit contributions
ore-0.045
 x-0.039
 gen-0.039
year-0.038
 operating-0.036
Token so
Token ID523
Feature activation+4.80e-01

Pos logit contributions
 pads+0.042
ety+0.036
 Indians+0.034
burn+0.033
coat+0.033
Neg logit contributions
ore-0.033
 x-0.030
 gen-0.030
year-0.030
 goblin-0.028
Token she
Token ID673
Feature activation+1.01e-02

Pos logit contributions
ore+0.023
 x+0.022
 gen+0.021
year+0.021
 operating+0.019
Neg logit contributions
 pads-0.027
ety-0.022
 tails-0.021
 sunglasses-0.020
burn-0.020
Token decided
Token ID3066
Feature activation-8.60e-01

Pos logit contributions
ore+0.001
 x+0.000
 gen+0.000
year+0.000
 goblin+0.000
Neg logit contributions
 pads-0.001
ety-0.000
 tails-0.000
coat-0.000
burn-0.000
Token to
Token ID284
Feature activation+6.55e-01

Pos logit contributions
 pads+0.053
ety+0.045
 tails+0.043
 Indians+0.043
burn+0.042
Neg logit contributions
ore-0.035
 x-0.031
year-0.031
 gen-0.031
 operating-0.029
Token sing
Token ID1702
Feature activation+1.43e-01

Pos logit contributions
ore+0.034
year+0.033
 gen+0.031
 x+0.031
 goblin+0.030
Neg logit contributions
 pads-0.032
ety-0.026
burn-0.025
coat-0.024
 tails-0.024
Token with
Token ID351
Feature activation+5.74e-01

Pos logit contributions
ore+0.006
 x+0.006
 gen+0.005
year+0.005
 goblin+0.005
Neg logit contributions
 pads-0.009
ety-0.007
 tails-0.007
coat-0.007
burn-0.007
Token it
Token ID340
Feature activation-2.36e-02

Pos logit contributions
ore+0.027
 x+0.025
 gen+0.025
year+0.024
 operating+0.023
Neg logit contributions
 pads-0.032
ety-0.025
 tails-0.025
 sunglasses-0.025
burn-0.024
Token.
Token ID13
Feature activation+7.50e-01

Pos logit contributions
 pads+0.001
ety+0.001
 tails+0.001
burn+0.001
coat+0.001
Neg logit contributions
ore-0.001
 x-0.001
 gen-0.001
year-0.001
 goblin-0.001
Token
Token ID198
Feature activation+7.79e-01

Pos logit contributions
 x+0.037
 gen+0.037
ore+0.036
year+0.035
 operating+0.033
Neg logit contributions
 pads-0.040
ety-0.032
 pad-0.029
 tails-0.028
coat-0.028
Token
Token ID198
Feature activation+4.88e-01

Pos logit contributions
ore+0.037
 x+0.037
 gen+0.036
year+0.035
 goblin+0.035
Neg logit contributions
 pads-0.045
ety-0.038
ibi-0.035
 Indians-0.034
burn-0.034
TokenWhen
Token ID2215
Feature activation+7.14e-01

Pos logit contributions
ore+0.025
 x+0.023
 gen+0.023
year+0.022
 goblin+0.022
Neg logit contributions
 pads-0.025
ety-0.021
burn-0.019
 Indians-0.019
 tails-0.019
Token Mill
Token ID9212
Feature activation+3.80e-01

Pos logit contributions
ore+0.036
 x+0.034
year+0.033
 gen+0.033
 goblin+0.032
Neg logit contributions
 pads-0.035
ety-0.030
burn-0.028
 sunglasses-0.027
 tails-0.027
Tokenie
Token ID494
Feature activation+1.51e+00

Pos logit contributions
ore+0.018
year+0.016
 x+0.016
 gen+0.016
 goblin+0.015
Neg logit contributions
 pads-0.022
ety-0.019
burn-0.017
 pad-0.017
bur-0.016
Token got
Token ID1392
Feature activation+6.26e-01

Pos logit contributions
year+0.083
ore+0.083
 gen+0.075
 x+0.075
 ingredient+0.072
Neg logit contributions
 pads-0.071
ety-0.060
 swung-0.054
 pad-0.053
burn-0.052
Token inside
Token ID2641
Feature activation+1.40e+00

Pos logit contributions
ore+0.038
 gen+0.036
year+0.035
 x+0.035
 goblin+0.034
Neg logit contributions
 pads-0.026
ety-0.022
burn-0.020
coat-0.019
 sunglasses-0.019
Token,
Token ID11
Feature activation+3.57e-01

Pos logit contributions
year+0.076
ore+0.075
 gen+0.074
 x+0.073
 healer+0.070
Neg logit contributions
 pads-0.057
ety-0.049
coat-0.045
burn-0.043
 sunglasses-0.043
Token she
Token ID673
Feature activation+7.43e-01

Pos logit contributions
ore+0.017
 x+0.016
year+0.016
 gen+0.015
 goblin+0.015
Neg logit contributions
 pads-0.019
ety-0.016
burn-0.015
 sunglasses-0.015
 Indians-0.015
Token ran
Token ID4966
Feature activation+1.73e-01

Pos logit contributions
ore+0.043
year+0.039
 x+0.039
 gen+0.038
 healer+0.036
Neg logit contributions
 pads-0.035
ety-0.029
 pad-0.025
coat-0.025
 sunglasses-0.025
Token to
Token ID284
Feature activation+6.40e-01

Pos logit contributions
ore+0.009
 gen+0.008
 x+0.008
year+0.008
 goblin+0.007
Neg logit contributions
 pads-0.009
ety-0.008
coat-0.007
burn-0.007
 tails-0.007
Token her
Token ID607
Feature activation-7.30e-02

Pos logit contributions
ore+0.034
year+0.032
 gen+0.031
 x+0.029
 goblin+0.029
Neg logit contributions
 pads-0.030
ety-0.025
 sunglasses-0.024
 Indians-0.024
burn-0.024
Token a
Token ID257
Feature activation+1.56e-01

Pos logit contributions
ore+0.019
 x+0.018
 gen+0.017
year+0.017
 goblin+0.016
Neg logit contributions
 pads-0.022
ety-0.018
 tails-0.017
 pad-0.017
 sunglasses-0.016
Token river
Token ID7850
Feature activation+2.93e-01

Pos logit contributions
ore+0.007
year+0.006
 x+0.006
 gen+0.006
 goblin+0.006
Neg logit contributions
 pads-0.009
ety-0.008
 tails-0.007
 pad-0.007
 Indians-0.007
Token and
Token ID290
Feature activation-1.21e-01

Pos logit contributions
ore+0.013
 x+0.012
year+0.012
 gen+0.012
 goblin+0.011
Neg logit contributions
 pads-0.017
ety-0.014
 tails-0.014
coat-0.013
 pad-0.013
Token through
Token ID832
Feature activation+7.80e-01

Pos logit contributions
 pads+0.006
ety+0.005
 tails+0.005
 Indians+0.005
burn+0.005
Neg logit contributions
ore-0.006
 x-0.006
 gen-0.006
year-0.006
 goblin-0.005
Token some
Token ID617
Feature activation+8.00e-01

Pos logit contributions
ore+0.037
 x+0.035
year+0.034
 gen+0.033
 healer+0.031
Neg logit contributions
 pads-0.044
 tails-0.035
ety-0.035
 pad-0.032
 sunglasses-0.032
Token trees
Token ID7150
Feature activation-6.15e-01

Pos logit contributions
ore+0.042
 x+0.039
 gen+0.039
year+0.039
 healer+0.036
Neg logit contributions
 pads-0.041
 tails-0.031
ety-0.031
 pad-0.030
 sunglasses-0.029
Token.
Token ID13
Feature activation+3.76e-01

Pos logit contributions
 pads+0.040
ety+0.035
burn+0.033
 Indians+0.033
 tails+0.033
Neg logit contributions
ore-0.023
 gen-0.020
year-0.020
 x-0.020
 goblin-0.018
Token He
Token ID679
Feature activation+1.93e-01

Pos logit contributions
ore+0.015
 gen+0.014
 x+0.014
year+0.013
 goblin+0.012
Neg logit contributions
 pads-0.024
ety-0.020
 Indians-0.019
 tails-0.019
 pad-0.019
Token was
Token ID373
Feature activation-4.64e-01

Pos logit contributions
ore+0.009
 x+0.009
 gen+0.008
year+0.008
 goblin+0.008
Neg logit contributions
 pads-0.011
ety-0.009
 tails-0.008
 Indians-0.008
coat-0.008
Token having
Token ID1719
Feature activation-4.09e-01

Pos logit contributions
 pads+0.028
ety+0.023
burn+0.022
 tails+0.022
 Indians+0.022
Neg logit contributions
ore-0.020
year-0.018
 x-0.018
 gen-0.018
 goblin-0.017
Token so
Token ID523
Feature activation+1.04e+00

Pos logit contributions
 pads+0.025
ety+0.021
burn+0.020
 tails+0.020
coat+0.020
Neg logit contributions
ore-0.017
 x-0.016
year-0.015
 gen-0.015
 goblin-0.014
Token to
Token ID284
Feature activation-1.19e+00

Pos logit contributions
 pads+0.087
ety+0.074
burn+0.071
ibi+0.071
coat+0.070
Neg logit contributions
ore-0.049
 x-0.044
year-0.043
 gen-0.041
 operating-0.039
Token run
Token ID1057
Feature activation-5.32e-01

Pos logit contributions
 pads+0.070
ety+0.058
burn+0.056
 Indians+0.054
 tails+0.052
Neg logit contributions
 x-0.050
ore-0.050
 gen-0.046
year-0.045
 ingredient-0.044
Token and
Token ID290
Feature activation-2.19e+00

Pos logit contributions
 pads+0.035
ety+0.030
burn+0.029
 Indians+0.028
 tails+0.028
Neg logit contributions
ore-0.020
 x-0.017
year-0.017
 gen-0.017
 goblin-0.016
Token jump
Token ID4391
Feature activation-9.17e-01

Pos logit contributions
 pads+0.122
burn+0.105
ety+0.104
ige+0.099
bit+0.099
Neg logit contributions
ore-0.089
 x-0.078
 goblin-0.075
fire-0.075
 operating-0.073
Token and
Token ID290
Feature activation-2.12e+00

Pos logit contributions
 pads+0.062
ety+0.056
 Indians+0.053
burn+0.053
ibi+0.051
Neg logit contributions
ore-0.032
 x-0.027
year-0.026
 gen-0.026
 goblin-0.024
Token chase
Token ID15505
Feature activation-5.11e-01

Pos logit contributions
 pads+0.114
ety+0.097
burn+0.097
ibi+0.091
 tails+0.090
Neg logit contributions
ore-0.092
fire-0.080
 x-0.079
 goblin-0.078
 gen-0.078
Token each
Token ID1123
Feature activation-1.43e+00

Pos logit contributions
 pads+0.022
ety+0.018
burn+0.016
coat+0.016
 Indians+0.016
Neg logit contributions
ore-0.030
 x-0.028
year-0.027
 gen-0.027
 goblin-0.026
Token other
Token ID584
Feature activation-7.38e-01

Pos logit contributions
 pads+0.087
ety+0.077
coat+0.076
burn+0.074
 Indians+0.072
Neg logit contributions
ore-0.055
 x-0.052
year-0.049
 goblin-0.045
le-0.045
Token.
Token ID13
Feature activation-1.61e+00

Pos logit contributions
 pads+0.050
ety+0.044
burn+0.042
 Indians+0.042
coat+0.041
Neg logit contributions
ore-0.025
 x-0.022
year-0.021
 gen-0.021
 goblin-0.020
Token They
Token ID1119
Feature activation-2.40e+00

Pos logit contributions
 pads+0.112
burn+0.100
 Indians+0.098
ety+0.096
bit+0.093
Neg logit contributions
ore-0.048
 x-0.040
year-0.038
 operating-0.038
 gen-0.032
Token saw
Token ID2497
Feature activation-6.14e-01

Pos logit contributions
 pads+0.146
burn+0.135
ety+0.121
bit+0.121
aters+0.121
Neg logit contributions
year-0.074
abel-0.074
ore-0.070
 gen-0.069
 goblin-0.069
Token mom
Token ID1995
Feature activation-5.97e-01

Pos logit contributions
ore+0.010
year+0.009
 x+0.009
 gen+0.009
 goblin+0.009
Neg logit contributions
 pads-0.008
ety-0.006
 tails-0.006
 Indians-0.005
burn-0.005
Token for
Token ID329
Feature activation+6.03e-01

Pos logit contributions
 pads+0.036
ety+0.030
 tails+0.029
coat+0.028
 Indians+0.028
Neg logit contributions
ore-0.026
 x-0.024
year-0.023
 gen-0.023
 operating-0.021
Token another
Token ID1194
Feature activation+2.29e-01

Pos logit contributions
ore+0.029
year+0.025
 gen+0.024
 x+0.024
 goblin+0.023
Neg logit contributions
 pads-0.033
ety-0.028
 sunglasses-0.027
 Indians-0.027
 tails-0.027
Token one
Token ID530
Feature activation+2.60e-01

Pos logit contributions
ore+0.012
year+0.011
 gen+0.011
 x+0.011
 operating+0.010
Neg logit contributions
 pads-0.012
ety-0.010
coat-0.009
burn-0.009
 tails-0.009
Token,
Token ID11
Feature activation-4.30e-01

Pos logit contributions
ore+0.011
year+0.011
 gen+0.010
 x+0.010
 operating+0.010
Neg logit contributions
 pads-0.016
ety-0.013
 pad-0.012
coat-0.012
 tails-0.012
Token but
Token ID475
Feature activation-8.84e-01

Pos logit contributions
 pads+0.023
ety+0.019
burn+0.018
 tails+0.018
 Indians+0.018
Neg logit contributions
ore-0.021
 x-0.019
year-0.019
 gen-0.019
 goblin-0.018
Token her
Token ID607
Feature activation-1.21e+00

Pos logit contributions
 pads+0.055
ety+0.045
 tails+0.044
coat+0.043
 Indians+0.043
Neg logit contributions
ore-0.035
 gen-0.034
 x-0.034
year-0.034
 operating-0.031
Token mom
Token ID1995
Feature activation-5.22e-01

Pos logit contributions
 pads+0.070
ety+0.059
 Indians+0.057
burn+0.055
 tails+0.055
Neg logit contributions
 x-0.053
 gen-0.050
ore-0.049
 goblin-0.045
abel-0.044
Token said
Token ID531
Feature activation-6.68e-01

Pos logit contributions
 pads+0.033
ety+0.027
 tails+0.026
 Indians+0.026
burn+0.025
Neg logit contributions
ore-0.021
 x-0.019
 gen-0.019
year-0.019
 goblin-0.017
Token she
Token ID673
Feature activation-9.24e-02

Pos logit contributions
 pads+0.038
ety+0.032
 Indians+0.029
 tails+0.029
burn+0.029
Neg logit contributions
ore-0.031
 gen-0.029
 x-0.029
year-0.028
 operating-0.026
Token had
Token ID550
Feature activation-7.03e-01

Pos logit contributions
 pads+0.005
ety+0.004
 tails+0.003
burn+0.003
coat+0.003
Neg logit contributions
ore-0.005
 x-0.005
 gen-0.005
year-0.005
 goblin-0.004
Token at
Token ID379
Feature activation-2.35e+00

Pos logit contributions
 pads+0.067
ety+0.059
 Indians+0.056
coat+0.056
ibi+0.054
Neg logit contributions
ore-0.031
year-0.027
 x-0.026
 gen-0.026
 operating-0.024
Token them
Token ID606
Feature activation-9.15e-02

Pos logit contributions
 pads+0.147
coat+0.121
ety+0.118
 Indians+0.117
burn+0.114
Neg logit contributions
 x-0.097
ore-0.087
year-0.076
 gen-0.075
 God-0.075
Token.
Token ID13
Feature activation-1.35e+00

Pos logit contributions
 pads+0.006
ety+0.005
 tails+0.005
burn+0.005
coat+0.005
Neg logit contributions
ore-0.003
 x-0.003
 gen-0.003
year-0.003
 goblin-0.003
Token She
Token ID1375
Feature activation+1.32e-01

Pos logit contributions
 pads+0.096
ety+0.083
burn+0.082
 Indians+0.081
coat+0.080
Neg logit contributions
ore-0.043
 x-0.035
 gen-0.034
year-0.033
 operating-0.031
Token is
Token ID318
Feature activation-6.32e-01

Pos logit contributions
ore+0.005
year+0.005
 x+0.005
 gen+0.005
 goblin+0.004
Neg logit contributions
 pads-0.008
ety-0.007
 tails-0.007
 Indians-0.007
coat-0.007
Token friendly
Token ID8030
Feature activation-4.81e-01

Pos logit contributions
 pads+0.039
ety+0.033
 Indians+0.030
coat+0.030
 tails+0.030
Neg logit contributions
ore-0.026
 x-0.024
 gen-0.024
year-0.024
 goblin-0.022
Token.
Token ID13
Feature activation-1.18e+00

Pos logit contributions
 pads+0.033
ety+0.028
 Indians+0.027
burn+0.027
 tails+0.026
Neg logit contributions
ore-0.017
 x-0.015
 gen-0.015
year-0.014
 goblin-0.013
Token
Token ID198
Feature activation+7.01e-01

Pos logit contributions
 pads+0.083
ety+0.071
burn+0.070
 tails+0.070
 Indians+0.069
Neg logit contributions
ore-0.039
 x-0.032
 gen-0.031
year-0.031
 operating-0.028
Token
Token ID198
Feature activation-5.63e-01

Pos logit contributions
ore+0.031
year+0.030
 gen+0.030
 x+0.030
 goblin+0.029
Neg logit contributions
 pads-0.039
ety-0.032
ibi-0.029
 tails-0.029
 Indians-0.029
TokenTom
Token ID13787
Feature activation-3.87e-01

Pos logit contributions
 pads+0.031
ety+0.026
burn+0.024
 tails+0.024
coat+0.023
Neg logit contributions
ore-0.027
year-0.025
 x-0.025
 gen-0.025
 operating-0.023
Token and
Token ID290
Feature activation-1.17e+00

Pos logit contributions
 pads+0.024
ety+0.020
burn+0.019
 tails+0.019
 Indians+0.019
Neg logit contributions
ore-0.016
 x-0.014
 gen-0.014
year-0.014
 goblin-0.013
Token not
Token ID407
Feature activation-1.67e-01

Pos logit contributions
ore+0.076
oc+0.068
 goblin+0.066
le+0.066
 gen+0.065
Neg logit contributions
 pads-0.054
ety-0.047
 dads-0.045
 sunglasses-0.045
coat-0.044
Token happy
Token ID3772
Feature activation-2.66e-01

Pos logit contributions
 pads+0.010
ety+0.008
 tails+0.008
burn+0.008
 Indians+0.008
Neg logit contributions
ore-0.007
 x-0.007
 gen-0.007
year-0.007
 goblin-0.006
Token.
Token ID13
Feature activation+6.58e-01

Pos logit contributions
 pads+0.018
ety+0.015
 Indians+0.014
 tails+0.014
burn+0.014
Neg logit contributions
ore-0.010
 x-0.009
 gen-0.009
year-0.009
 goblin-0.008
Token
Token ID198
Feature activation+1.28e+00

Pos logit contributions
ore+0.032
year+0.031
 gen+0.031
 x+0.031
 goblin+0.029
Neg logit contributions
 pads-0.034
ety-0.029
coat-0.025
 Indians-0.025
 tails-0.025
Token
Token ID198
Feature activation+5.03e-01

Pos logit contributions
ore+0.062
 gen+0.062
 x+0.061
year+0.060
 goblin+0.060
Neg logit contributions
 pads-0.061
ety-0.052
ibi-0.047
 Indians-0.044
coat-0.044
Token"
Token ID1
Feature activation-8.17e-02

Pos logit contributions
ore+0.026
 gen+0.025
 x+0.024
year+0.023
 goblin+0.023
Neg logit contributions
 pads-0.026
ety-0.021
 Indians-0.019
 tails-0.019
coat-0.019
TokenL
Token ID43
Feature activation+5.71e-01

Pos logit contributions
 pads+0.004
ety+0.004
 tails+0.003
burn+0.003
 Indians+0.003
Neg logit contributions
ore-0.004
 x-0.004
year-0.004
 gen-0.004
 goblin-0.003
Tokenily
Token ID813
Feature activation-7.30e-01

Pos logit contributions
ore+0.021
 gen+0.019
 x+0.018
year+0.018
 goblin+0.017
Neg logit contributions
 pads-0.038
ety-0.033
coat-0.031
 tails-0.030
 pad-0.030
Token and
Token ID290
Feature activation-1.93e+00

Pos logit contributions
 pads+0.047
ety+0.039
 Indians+0.037
 tails+0.037
burn+0.037
Neg logit contributions
ore-0.029
 x-0.026
 gen-0.026
year-0.024
 goblin-0.023
Token Tom
Token ID4186
Feature activation-7.28e-01

Pos logit contributions
 pads+0.109
coat+0.094
ety+0.092
burn+0.092
inas+0.089
Neg logit contributions
 x-0.075
old-0.064
ore-0.064
 operating-0.061
ibrarian-0.060
Token,
Token ID11
Feature activation-1.55e-01

Pos logit contributions
 pads+0.047
ety+0.039
 Indians+0.037
burn+0.037
 tails+0.037
Neg logit contributions
ore-0.029
 x-0.026
 gen-0.026
year-0.024
 goblin-0.023
Token other
Token ID584
Feature activation-8.53e-01

Pos logit contributions
 pads+0.066
ety+0.057
coat+0.055
burn+0.054
 Indians+0.053
Neg logit contributions
ore-0.050
 x-0.046
year-0.044
 gen-0.041
 goblin-0.041
Token and
Token ID290
Feature activation-1.45e+00

Pos logit contributions
 pads+0.056
ety+0.050
burn+0.047
 Indians+0.047
coat+0.046
Neg logit contributions
ore-0.030
 x-0.026
 gen-0.025
year-0.025
 goblin-0.024
Token catch
Token ID4929
Feature activation-4.11e-02

Pos logit contributions
 pads+0.081
ety+0.072
burn+0.067
coat+0.065
bur+0.063
Neg logit contributions
ore-0.060
 gen-0.057
 x-0.057
 goblin-0.054
year-0.053
Token it
Token ID340
Feature activation-8.13e-01

Pos logit contributions
 pads+0.003
ety+0.002
 tails+0.002
burn+0.002
 Indians+0.002
Neg logit contributions
ore-0.002
 x-0.002
 gen-0.002
year-0.001
 goblin-0.001
Token.
Token ID13
Feature activation-6.56e-01

Pos logit contributions
 pads+0.056
ety+0.050
 Indians+0.048
burn+0.047
coat+0.047
Neg logit contributions
ore-0.027
 x-0.023
 gen-0.022
year-0.022
 goblin-0.020
Token They
Token ID1119
Feature activation-2.85e+00

Pos logit contributions
 pads+0.043
ety+0.037
burn+0.036
 Indians+0.035
coat+0.035
Neg logit contributions
ore-0.025
year-0.021
 x-0.021
 gen-0.021
 goblin-0.019
Token laughed
Token ID13818
Feature activation-1.30e+00

Pos logit contributions
 pads+0.162
burn+0.150
ety+0.138
ibi+0.133
aters+0.132
Neg logit contributions
year-0.092
 gen-0.090
abel-0.090
ore-0.088
 of-0.086
Token and
Token ID290
Feature activation-3.37e+00

Pos logit contributions
 pads+0.092
ety+0.087
coat+0.080
burn+0.080
 Indians+0.079
Neg logit contributions
ore-0.036
year-0.031
 x-0.031
 gen-0.031
 operating-0.028
Token had
Token ID550
Feature activation-2.47e+00

Pos logit contributions
 pads+0.190
ety+0.183
burn+0.172
bur+0.170
 pad+0.163
Neg logit contributions
ore-0.093
year-0.090
fire-0.089
 gen-0.084
 goblin-0.083
Token fun
Token ID1257
Feature activation-8.73e-01

Pos logit contributions
ety+0.159
 pads+0.158
burn+0.156
bur+0.147
ibi+0.144
Neg logit contributions
 x-0.056
 wishes-0.052
 trouble-0.051
 news-0.051
 gen-0.050
Token.
Token ID13
Feature activation-7.61e-01

Pos logit contributions
 pads+0.063
ety+0.057
burn+0.053
 Indians+0.053
coat+0.052
Neg logit contributions
ore-0.027
 x-0.022
 gen-0.022
year-0.021
 operating-0.020
Token race
Token ID3234
Feature activation-9.71e-01

Pos logit contributions
ore+0.002
year+0.002
 x+0.002
 gen+0.002
 goblin+0.002
Neg logit contributions
 pads-0.002
ety-0.002
 tails-0.002
burn-0.002
 Indians-0.002
Token.
Token ID13
Feature activation-4.43e-01

Pos logit contributions
 pads+0.066
ety+0.059
burn+0.055
coat+0.055
 Indians+0.054
Neg logit contributions
ore-0.034
 x-0.029
 gen-0.028
year-0.027
 goblin-0.026
Token They
Token ID1119
Feature activation-2.47e+00

Pos logit contributions
 pads+0.025
ety+0.021
burn+0.020
 tails+0.019
coat+0.019
Neg logit contributions
ore-0.021
 x-0.019
year-0.018
 gen-0.018
 goblin-0.017
Token raced
Token ID32897
Feature activation-2.08e+00

Pos logit contributions
 pads+0.134
ety+0.120
burn+0.120
 tails+0.110
coat+0.108
Neg logit contributions
ore-0.097
year-0.096
 gen-0.092
abel-0.088
 of-0.087
Token and
Token ID290
Feature activation-2.88e+00

Pos logit contributions
 pads+0.142
ety+0.136
burn+0.128
 Indians+0.120
bur+0.119
Neg logit contributions
ore-0.062
 x-0.050
year-0.049
 of-0.049
 operating-0.047
Token Zoom
Token ID40305
Feature activation-2.54e+00

Pos logit contributions
 pads+0.143
ety+0.142
burn+0.139
aters+0.128
ige+0.127
Neg logit contributions
ore-0.109
year-0.104
fire-0.101
 of-0.100
 goblin-0.097
Token went
Token ID1816
Feature activation-1.75e+00

Pos logit contributions
 pads+0.153
ety+0.147
burn+0.140
 sunglasses+0.134
bur+0.133
Neg logit contributions
ore-0.091
year-0.072
le-0.071
 of-0.071
 x-0.070
Token very
Token ID845
Feature activation-8.62e-01

Pos logit contributions
 pads+0.116
ety+0.105
burn+0.100
 tails+0.095
 Indians+0.094
Neg logit contributions
ore-0.062
 x-0.049
 operating-0.049
year-0.049
 goblin-0.045
Token fast
Token ID3049
Feature activation-1.10e+00

Pos logit contributions
 pads+0.053
ety+0.045
burn+0.044
 Indians+0.042
coat+0.042
Neg logit contributions
ore-0.035
year-0.031
 x-0.031
 gen-0.031
 operating-0.029
Token.
Token ID13
Feature activation-7.40e-01

Pos logit contributions
 pads+0.075
ety+0.068
burn+0.063
 Indians+0.063
ibi+0.062
Neg logit contributions
ore-0.038
 x-0.031
year-0.030
 gen-0.030
 operating-0.029
Token He
Token ID679
Feature activation-1.44e+00

Pos logit contributions
 pads+0.041
ety+0.035
burn+0.032
coat+0.032
 tails+0.031
Neg logit contributions
ore-0.035
 x-0.033
year-0.031
 gen-0.031
 operating-0.030
Token Lily
Token ID20037
Feature activation-1.93e-01

Pos logit contributions
ore+0.001
year+0.001
 x+0.001
 gen+0.001
 goblin+0.001
Neg logit contributions
 pads-0.001
ety-0.001
 tails-0.001
burn-0.001
 Indians-0.001
Token for
Token ID329
Feature activation-4.08e-01

Pos logit contributions
 pads+0.012
ety+0.010
 tails+0.009
burn+0.009
 Indians+0.009
Neg logit contributions
ore-0.008
 x-0.007
 gen-0.007
year-0.007
 goblin-0.007
Token her
Token ID607
Feature activation+2.68e-01

Pos logit contributions
 pads+0.024
ety+0.020
coat+0.018
 tails+0.018
burn+0.018
Neg logit contributions
ore-0.019
 x-0.017
 gen-0.017
year-0.017
 goblin-0.016
Token help
Token ID1037
Feature activation-4.60e-01

Pos logit contributions
ore+0.012
year+0.011
 gen+0.010
 x+0.010
 goblin+0.010
Neg logit contributions
 pads-0.016
ety-0.013
 tails-0.013
burn-0.012
 Indians-0.012
Token.
Token ID13
Feature activation-5.98e-01

Pos logit contributions
 pads+0.031
ety+0.027
 Indians+0.026
 tails+0.026
burn+0.025
Neg logit contributions
ore-0.016
 x-0.014
 gen-0.014
year-0.014
 goblin-0.013
Token They
Token ID1119
Feature activation-2.19e+00

Pos logit contributions
 pads+0.037
ety+0.031
burn+0.030
 tails+0.029
 Indians+0.029
Neg logit contributions
ore-0.025
year-0.022
 x-0.022
 gen-0.022
 goblin-0.020
Token became
Token ID2627
Feature activation-1.56e+00

Pos logit contributions
 pads+0.117
coat+0.098
ety+0.096
burn+0.094
 tails+0.090
Neg logit contributions
ore-0.092
 gen-0.091
year-0.091
abel-0.084
 goblin-0.081
Token good
Token ID922
Feature activation-1.34e-01

Pos logit contributions
 pads+0.101
coat+0.086
ety+0.086
 pad+0.083
ibi+0.080
Neg logit contributions
ore-0.053
year-0.052
 gen-0.051
 x-0.050
 healer-0.048
Token friends
Token ID2460
Feature activation+7.15e-02

Pos logit contributions
 pads+0.007
ety+0.005
 tails+0.005
burn+0.005
 Indians+0.005
Neg logit contributions
ore-0.007
 x-0.007
 gen-0.007
year-0.007
 goblin-0.006
Token and
Token ID290
Feature activation-1.45e+00

Pos logit contributions
ore+0.003
 x+0.003
 gen+0.003
year+0.003
 goblin+0.003
Neg logit contributions
 pads-0.004
ety-0.004
 tails-0.003
burn-0.003
coat-0.003
Token played
Token ID2826
Feature activation+4.27e-01

Pos logit contributions
 pads+0.066
ety+0.054
coat+0.051
 pad+0.049
burn+0.048
Neg logit contributions
ore-0.076
year-0.075
 gen-0.074
 x-0.070
 goblin-0.068
Token.
Token ID13
Feature activation-5.82e-01

Pos logit contributions
 pads+0.001
ety+0.000
 tails+0.000
burn+0.000
coat+0.000
Neg logit contributions
ore-0.000
 x-0.000
 gen-0.000
year-0.000
 goblin-0.000
Token
Token ID198
Feature activation+2.08e-01

Pos logit contributions
 pads+0.036
ety+0.030
burn+0.029
 tails+0.029
 Indians+0.028
Neg logit contributions
ore-0.024
 x-0.021
year-0.021
 gen-0.021
 goblin-0.020
Token
Token ID198
Feature activation-2.32e-01

Pos logit contributions
ore+0.009
 x+0.008
 gen+0.008
year+0.008
 goblin+0.007
Neg logit contributions
 pads-0.013
ety-0.011
coat-0.010
 Indians-0.010
 tails-0.010
TokenThe
Token ID464
Feature activation-3.43e-01

Pos logit contributions
 pads+0.013
ety+0.011
 tails+0.010
burn+0.010
 Indians+0.010
Neg logit contributions
ore-0.011
 x-0.010
year-0.010
 gen-0.010
 goblin-0.009
Token wind
Token ID2344
Feature activation-3.81e-01

Pos logit contributions
 pads+0.019
ety+0.016
 tails+0.015
burn+0.014
coat+0.014
Neg logit contributions
ore-0.016
 x-0.015
 gen-0.015
year-0.015
 goblin-0.014
Token was
Token ID373
Feature activation-2.20e+00

Pos logit contributions
 pads+0.020
ety+0.017
 tails+0.015
 Indians+0.015
burn+0.015
Neg logit contributions
ore-0.019
 gen-0.017
 x-0.017
year-0.017
 goblin-0.016
Token still
Token ID991
Feature activation-1.04e+00

Pos logit contributions
 pads+0.134
ety+0.111
lets+0.105
 Indians+0.103
 pad+0.101
Neg logit contributions
 x-0.088
ore-0.084
year-0.082
 gen-0.081
 late-0.078
Token blowing
Token ID19280
Feature activation-4.46e-01

Pos logit contributions
 pads+0.059
ety+0.049
 Indians+0.046
 tails+0.045
 pad+0.044
Neg logit contributions
ore-0.048
 x-0.045
 gen-0.044
year-0.043
 operating-0.041
Token all
Token ID477
Feature activation-8.26e-01

Pos logit contributions
 pads+0.027
ety+0.023
 Indians+0.022
burn+0.022
 tails+0.021
Neg logit contributions
ore-0.019
 x-0.017
 gen-0.017
year-0.017
 goblin-0.015
Token around
Token ID1088
Feature activation-1.04e-01

Pos logit contributions
 pads+0.052
ety+0.044
 Indians+0.043
 tails+0.042
burn+0.041
Neg logit contributions
ore-0.034
year-0.029
 x-0.029
 gen-0.028
 operating-0.026
Token the
Token ID262
Feature activation-5.62e-02

Pos logit contributions
 pads+0.006
ety+0.005
 tails+0.005
coat+0.005
burn+0.005
Neg logit contributions
ore-0.005
 x-0.004
 gen-0.004
year-0.004
 goblin-0.004
Token were
Token ID547
Feature activation-4.85e-01

Pos logit contributions
ore+0.013
year+0.012
 x+0.012
 gen+0.012
 goblin+0.011
Neg logit contributions
 pads-0.022
ety-0.019
coat-0.017
 tails-0.017
burn-0.017
Token two
Token ID734
Feature activation-7.86e-01

Pos logit contributions
 pads+0.028
ety+0.023
coat+0.022
 tails+0.022
 Indians+0.022
Neg logit contributions
ore-0.022
 x-0.020
year-0.020
 gen-0.020
 goblin-0.018
Token friends
Token ID2460
Feature activation-1.24e-01

Pos logit contributions
 pads+0.042
ety+0.034
burn+0.032
 Indians+0.032
coat+0.032
Neg logit contributions
ore-0.037
 x-0.036
 gen-0.035
year-0.034
 goblin-0.033
Token,
Token ID11
Feature activation-1.28e+00

Pos logit contributions
 pads+0.007
ety+0.006
 tails+0.006
burn+0.006
 Indians+0.006
Neg logit contributions
ore-0.005
 x-0.005
 gen-0.005
year-0.005
 goblin-0.004
Token Mia
Token ID32189
Feature activation-5.05e-01

Pos logit contributions
 pads+0.077
ety+0.062
burn+0.061
coat+0.059
 pad+0.059
Neg logit contributions
ore-0.052
 x-0.050
year-0.050
 gen-0.049
 goblin-0.046
Token and
Token ID290
Feature activation-2.22e+00

Pos logit contributions
 pads+0.032
ety+0.027
 Indians+0.026
burn+0.026
 tails+0.025
Neg logit contributions
ore-0.020
 x-0.018
 gen-0.018
year-0.017
 goblin-0.016
Token George
Token ID4502
Feature activation-1.52e-01

Pos logit contributions
 pads+0.134
 pad+0.108
coat+0.105
ety+0.102
 tails+0.102
Neg logit contributions
 x-0.081
 gen-0.080
 goblin-0.079
ore-0.078
year-0.076
Token.
Token ID13
Feature activation-2.80e-01

Pos logit contributions
 pads+0.010
ety+0.009
 tails+0.008
burn+0.008
 Indians+0.008
Neg logit contributions
ore-0.006
 x-0.005
 gen-0.005
year-0.005
 goblin-0.005
Token Mia
Token ID32189
Feature activation+3.50e-01

Pos logit contributions
 pads+0.015
ety+0.013
burn+0.012
 Indians+0.011
coat+0.011
Neg logit contributions
ore-0.014
 x-0.013
year-0.012
 gen-0.012
 goblin-0.012
Token was
Token ID373
Feature activation+9.48e-02

Pos logit contributions
ore+0.019
 x+0.018
year+0.018
 gen+0.018
 goblin+0.017
Neg logit contributions
 pads-0.017
ety-0.014
 tails-0.012
coat-0.012
 pad-0.012
Token a
Token ID257
Feature activation-4.94e-02

Pos logit contributions
ore+0.004
 gen+0.004
year+0.004
 x+0.004
 goblin+0.003
Neg logit contributions
 pads-0.006
ety-0.005
 tails-0.004
burn-0.004
 Indians-0.004