TOP ACTIVATIONS
MAX = 0.142

his new friend the white bird and flew back home .
can get water from the o asis ?" The bird answered
, there was a big ostr ich . The ostr ich
as he could to the o asis . When he got
for rabbits near the b erry bushes . The
, there was a big ostr ich who was very proud
, Harry flew to the bee hive and became very excited
want to see the big ostr ich with the long neck
couldn 't see what the ostr ich was hiding , but
" Will the egg be warm now , Mom
" What is inside the egg , Mom my ?" she
time , there was an ostr ich who liked to celebrate
, there is the queen bee !" Ben said . He
in the branches of the tree . Jack
apple and slowly approached the ostr ich .
her day in the high o asis . <|endoftext|> Once upon
and I will free the bird ," said the voice .
a time there was an ostr ich . His name was
walked , they saw an ostr ich . " Look !"
melt the water in the o asis . All the water

MIN ACTIVATIONS
MIN = -0.113

on top . They use colorful gears for the eyes ,
was a boy named Ryan . He was three years old
They have many tubes of different colors and sizes . They
with tubes . They have many tubes of different colors and
They have many tubes of different colors and sizes . They
was a boy named Tyler . One day , Tyler wanted
a small ball called Bounce . Bounce loved making sounds and
with tubes . They have many tubes of different colors and
knew that chess was a hard game for a three year
a little boy named Ryan . He had a lot of
a little boy called Bart . One day , he went
He had a big , hard basketball that he would bounce
bles . She had many shiny and colorful mar bles .
that even though people have different jobs and wear different clothes
. He was a very talented drummer . He had a
a smart dog called Jazz . Jazz liked to steal things
new pants and he liked sorting . He was so happy
and Ryan was very excited . His family was sending him
with tubes . They have many tubes of different colors and
Jake who loved to kick . Jake kicked over anything that

INTERVAL 0.078 - 0.142
CONTAINS 0.014%

regretted getting close to the Gor illa and not listening to
! Hello there , Mr Owl !" The owl
. The ostr ich kept walking until he
so nice to see a rose like that ."
friend ." The ostr ich heard Ben 's voice

INTERVAL 0.014 - 0.078
CONTAINS 22.042%

She hugs them and says she loves them .
the little girl was done , she said , " Thank
something that helps us learn about the world around us and
on the back of a bird . The end
the Fire . <|endoftext|> Once upon a time there was a

INTERVAL -0.049 - 0.014
CONTAINS 77.362%

in a powerful yet quiet way . The
two beetles . They were best friends , but they often
with all the blue berries in the basket bump ing around
to the top , he looked around and saw the whole
. He loved his t eddy bear very much .

INTERVAL -0.113 - -0.049
CONTAINS 0.581%

and Jack was always making mess es and leaving things around
day , they decide to trade their toys .
sorry . They did not mean to make Mom angry .
Tom had so much fun mixing things together in the kitchen
Tim my learned that sharing is good and it helps
Token his
Token ID465
Feature activation+3.76e-02

Pos logit contributions
adow+0.088
lor+0.086
elope+0.083
comb+0.082
oney+0.082
Neg logit contributions
 gears-0.036
 trucks-0.035
 motors-0.032
 robots-0.030
 race-0.030
Token new
Token ID649
Feature activation+2.05e-02

Pos logit contributions
lor+0.082
adow+0.081
oney+0.075
elope+0.074
 Flo+0.069
Neg logit contributions
 gears-0.048
 trucks-0.047
 colors-0.046
 motors-0.043
 balls-0.042
Token friend
Token ID1545
Feature activation+5.91e-02

Pos logit contributions
adow+0.044
elope+0.040
lor+0.040
oney+0.038
 Flo+0.037
Neg logit contributions
 gears-0.029
 trucks-0.026
 motors-0.026
 colors-0.025
 magnets-0.025
Token the
Token ID262
Feature activation+2.28e-02

Pos logit contributions
adow+0.105
elope+0.099
opsy+0.092
 Maur+0.091
 Flo+0.091
Neg logit contributions
 gears-0.082
 magnets-0.081
 trucks-0.079
 numbers-0.077
 motors-0.077
Token white
Token ID2330
Feature activation+4.35e-02

Pos logit contributions
adow+0.055
elope+0.049
lor+0.048
oney+0.046
 Flo+0.045
Neg logit contributions
 gears-0.030
 motors-0.027
 trucks-0.026
 colors-0.025
 magnets-0.025
Token bird
Token ID6512
Feature activation+1.05e-01

Pos logit contributions
adow+0.097
elope+0.093
oney+0.091
lor+0.088
 Maur+0.085
Neg logit contributions
 gears-0.049
 colors-0.048
 motors-0.046
 trucks-0.044
 magnets-0.043
Token and
Token ID290
Feature activation+2.01e-02

Pos logit contributions
 Maur+0.185
 Dahl+0.178
adow+0.176
opsy+0.176
uf+0.174
Neg logit contributions
 colors-0.112
 magnets-0.112
 trucks-0.110
 race-0.109
 trophy-0.108
Token flew
Token ID13112
Feature activation+3.18e-02

Pos logit contributions
adow+0.044
lor+0.041
elope+0.041
oney+0.039
uf+0.037
Neg logit contributions
 gears-0.027
 trucks-0.026
 motors-0.022
 magnets-0.022
 balls-0.022
Token back
Token ID736
Feature activation+6.94e-02

Pos logit contributions
adow+0.071
elope+0.070
oney+0.064
comb+0.062
lor+0.062
Neg logit contributions
 gears-0.042
 motors-0.036
 balls-0.035
 hoops-0.034
 colors-0.034
Token home
Token ID1363
Feature activation+7.89e-02

Pos logit contributions
elope+0.145
adow+0.141
lor+0.139
uf+0.138
comb+0.133
Neg logit contributions
 gears-0.079
 trucks-0.072
 numbers-0.069
 blocks-0.069
 balls-0.068
Token.
Token ID13
Feature activation+1.35e-02

Pos logit contributions
adow+0.179
elope+0.168
opsy+0.158
lor+0.157
 Flo+0.154
Neg logit contributions
 gears-0.066
 trophies-0.064
 trophy-0.062
 colors-0.061
 numbers-0.061
Token can
Token ID460
Feature activation-9.67e-03

Pos logit contributions
 gears+0.003
 motors+0.003
 magnets+0.003
 trucks+0.003
 cubes+0.003
Neg logit contributions
adow-0.005
elope-0.004
lor-0.004
 Flo-0.004
 hive-0.004
Token get
Token ID651
Feature activation+3.35e-04

Pos logit contributions
 gears+0.014
 motors+0.013
 trucks+0.012
 magnets+0.012
 cubes+0.012
Neg logit contributions
adow-0.021
elope-0.019
 Flo-0.017
lor-0.017
 perched-0.017
Token water
Token ID1660
Feature activation+6.43e-03

Pos logit contributions
adow+0.001
elope+0.001
lor+0.001
 Flo+0.001
 perched+0.001
Neg logit contributions
 gears-0.001
 motors-0.001
 trucks-0.001
 magnets-0.001
 cubes-0.000
Token from
Token ID422
Feature activation+5.68e-02

Pos logit contributions
adow+0.012
elope+0.011
lor+0.010
 Flo+0.010
 perched+0.009
Neg logit contributions
 gears-0.011
 motors-0.010
 trucks-0.010
 magnets-0.010
 cubes-0.010
Token the
Token ID262
Feature activation+6.01e-02

Pos logit contributions
adow+0.126
elope+0.119
lor+0.113
comb+0.104
ossom+0.103
Neg logit contributions
 gears-0.077
 trucks-0.076
 motors-0.072
 magnets-0.070
 blocks-0.065
Token o
Token ID267
Feature activation+1.00e-01

Pos logit contributions
adow+0.134
elope+0.120
oney+0.119
owed+0.116
lor+0.115
Neg logit contributions
 colors-0.079
 trucks-0.078
 ones-0.077
 gears-0.072
 balls-0.070
Tokenasis
Token ID17765
Feature activation+6.99e-02

Pos logit contributions
elope+0.135
adow+0.119
 perched+0.117
 symp+0.116
 Clover+0.115
Neg logit contributions
 trucks-0.168
 cubes-0.164
 gears-0.150
 motors-0.148
 colors-0.147
Token?"
Token ID1701
Feature activation+2.45e-02

Pos logit contributions
elope+0.140
adow+0.132
 Flo+0.124
opsy+0.121
lor+0.119
Neg logit contributions
 gears-0.098
 trucks-0.096
 magnets-0.090
 cubes-0.090
 motors-0.085
Token The
Token ID383
Feature activation+3.70e-02

Pos logit contributions
adow+0.051
lor+0.043
elope+0.043
comb+0.041
 hive+0.041
Neg logit contributions
 gears-0.036
 magnets-0.034
 motors-0.032
 trucks-0.032
 cubes-0.031
Token bird
Token ID6512
Feature activation+3.80e-02

Pos logit contributions
adow+0.096
elope+0.086
lor+0.086
oney+0.084
comb+0.081
Neg logit contributions
 colors-0.035
 gears-0.035
 trucks-0.034
 cubes-0.031
 buttons-0.031
Token answered
Token ID9373
Feature activation+7.57e-03

Pos logit contributions
adow+0.079
lor+0.073
elope+0.071
oney+0.068
 ther+0.065
Neg logit contributions
 gears-0.048
 colors-0.044
 trucks-0.044
 magnets-0.043
 motors-0.042
Token,
Token ID11
Feature activation+1.06e-02

Pos logit contributions
 gears+0.001
 trucks+0.001
 motors+0.001
 magnets+0.001
 cubes+0.001
Neg logit contributions
adow-0.002
elope-0.002
lor-0.002
 perched-0.002
 Flo-0.002
Token there
Token ID612
Feature activation-1.49e-02

Pos logit contributions
adow+0.020
lor+0.019
elope+0.019
 Flo+0.017
comb+0.017
Neg logit contributions
 gears-0.016
 cubes-0.015
 magnets-0.015
 blocks-0.014
 trucks-0.014
Token was
Token ID373
Feature activation+3.62e-02

Pos logit contributions
 gears+0.022
 motors+0.019
 trucks+0.019
 magnets+0.018
 cubes+0.018
Neg logit contributions
adow-0.031
elope-0.028
lor-0.026
 hive-0.025
 Flo-0.025
Token a
Token ID257
Feature activation+9.55e-04

Pos logit contributions
adow+0.071
elope+0.066
lor+0.062
 Flo+0.057
comb+0.056
Neg logit contributions
 gears-0.060
 magnets-0.054
 motors-0.054
 trucks-0.052
 cubes-0.051
Token big
Token ID1263
Feature activation-2.34e-02

Pos logit contributions
adow+0.002
elope+0.002
 Flo+0.002
lor+0.002
 perched+0.002
Neg logit contributions
 gears-0.002
 motors-0.001
 trucks-0.001
 magnets-0.001
 cubes-0.001
Token ostr
Token ID42387
Feature activation+9.92e-02

Pos logit contributions
 gears+0.038
 motors+0.034
 magnets+0.032
 trucks+0.032
 trophies+0.031
Neg logit contributions
adow-0.047
elope-0.043
 hive-0.039
 Flo-0.038
 perched-0.037
Tokenich
Token ID488
Feature activation+4.16e-02

Pos logit contributions
adow+0.172
elope+0.163
 Honey+0.159
 Flo+0.157
 Merry+0.148
Neg logit contributions
 trucks-0.146
 cubes-0.145
 gears-0.144
 motors-0.138
 automobiles-0.136
Token.
Token ID13
Feature activation-4.00e-04

Pos logit contributions
adow+0.075
elope+0.070
lor+0.067
 Flo+0.066
comb+0.065
Neg logit contributions
 gears-0.065
 trucks-0.063
 motors-0.060
 cubes-0.059
 magnets-0.058
Token The
Token ID383
Feature activation+2.40e-02

Pos logit contributions
 gears+0.001
 trucks+0.001
 magnets+0.001
 motors+0.001
 automobiles+0.001
Neg logit contributions
adow-0.001
elope-0.001
lor-0.001
 hive-0.001
opsy-0.001
Token ostr
Token ID42387
Feature activation+9.22e-02

Pos logit contributions
adow+0.046
elope+0.041
lor+0.039
comb+0.038
oney+0.036
Neg logit contributions
 gears-0.040
 trucks-0.039
 motors-0.037
 colors-0.037
 cubes-0.035
Tokenich
Token ID488
Feature activation+2.45e-02

Pos logit contributions
adow+0.160
elope+0.147
 Honey+0.146
 Flo+0.140
 Bonnie+0.136
Neg logit contributions
 trucks-0.139
 gears-0.137
 cubes-0.135
 automobiles-0.134
 motors-0.133
Token as
Token ID355
Feature activation+1.38e-02

Pos logit contributions
 gears+0.062
 magnets+0.057
 motors+0.056
 trucks+0.054
 automobiles+0.054
Neg logit contributions
adow-0.049
 Flo-0.044
elope-0.044
 perched-0.042
ow-0.038
Token he
Token ID339
Feature activation+7.90e-03

Pos logit contributions
adow+0.030
elope+0.027
 Flo+0.025
lor+0.025
 perched+0.024
Neg logit contributions
 gears-0.021
 motors-0.018
 trucks-0.018
 magnets-0.018
 cubes-0.017
Token could
Token ID714
Feature activation+1.97e-02

Pos logit contributions
adow+0.018
elope+0.017
 hive+0.015
lor+0.015
oney+0.015
Neg logit contributions
 gears-0.011
 motors-0.009
 trucks-0.009
 buttons-0.009
 cubes-0.008
Token to
Token ID284
Feature activation+3.25e-02

Pos logit contributions
adow+0.042
elope+0.040
lor+0.036
 Flo+0.035
 hive+0.035
Neg logit contributions
 gears-0.027
 motors-0.024
 trucks-0.023
 magnets-0.022
 cubes-0.022
Token the
Token ID262
Feature activation+5.97e-02

Pos logit contributions
adow+0.072
elope+0.067
lor+0.065
oney+0.062
opsy+0.058
Neg logit contributions
 gears-0.043
 trucks-0.039
 motors-0.038
 balls-0.035
 blocks-0.034
Token o
Token ID267
Feature activation+9.88e-02

Pos logit contributions
adow+0.124
oney+0.113
elope+0.113
lor+0.103
oppy+0.098
Neg logit contributions
 trucks-0.086
 gears-0.084
 motors-0.079
 colors-0.078
 buttons-0.077
Tokenasis
Token ID17765
Feature activation+4.77e-02

Pos logit contributions
elope+0.121
 Clover+0.115
 Honey+0.112
 Bonnie+0.109
 Pen+0.109
Neg logit contributions
 cubes-0.178
 trucks-0.177
 gears-0.171
 motors-0.163
 colors-0.159
Token.
Token ID13
Feature activation+6.94e-04

Pos logit contributions
elope+0.094
adow+0.093
lor+0.084
opsy+0.084
 Flo+0.082
Neg logit contributions
 gears-0.071
 trucks-0.066
 cubes-0.062
 motors-0.061
 magnets-0.059
Token When
Token ID1649
Feature activation+5.01e-03

Pos logit contributions
adow+0.002
elope+0.001
lor+0.001
 hive+0.001
comb+0.001
Neg logit contributions
 gears-0.001
 trucks-0.001
 motors-0.001
 magnets-0.001
 cubes-0.001
Token he
Token ID339
Feature activation-4.71e-03

Pos logit contributions
adow+0.014
elope+0.012
lor+0.012
 hive+0.011
oney+0.011
Neg logit contributions
 gears-0.006
 trucks-0.005
 motors-0.004
 cubes-0.004
 magnets-0.004
Token got
Token ID1392
Feature activation+4.91e-03

Pos logit contributions
 gears+0.006
 motors+0.005
 trucks+0.005
 cubes+0.004
 balls+0.004
Neg logit contributions
adow-0.011
elope-0.010
lor-0.009
 hive-0.009
oney-0.009
Token for
Token ID329
Feature activation+3.60e-02

Pos logit contributions
adow+0.040
elope+0.039
lor+0.035
 Flo+0.034
comb+0.034
Neg logit contributions
 gears-0.030
 magnets-0.027
 motors-0.026
 trophies-0.026
 trucks-0.026
Token rabbits
Token ID33043
Feature activation+4.59e-02

Pos logit contributions
adow+0.079
elope+0.073
comb+0.066
lor+0.064
opsy+0.062
Neg logit contributions
 gears-0.050
 numbers-0.046
 magnets-0.045
 balls-0.045
 colors-0.045
Token near
Token ID1474
Feature activation+3.97e-02

Pos logit contributions
elope+0.085
adow+0.085
lor+0.075
opsy+0.074
 Flo+0.074
Neg logit contributions
 gears-0.067
 magnets-0.063
 motors-0.062
 trucks-0.062
 balls-0.059
Token the
Token ID262
Feature activation+5.30e-02

Pos logit contributions
adow+0.082
elope+0.074
lor+0.070
comb+0.070
ossom+0.065
Neg logit contributions
 magnets-0.056
 gears-0.055
 balls-0.054
 motors-0.053
 trucks-0.052
Token b
Token ID275
Feature activation+5.72e-02

Pos logit contributions
adow+0.122
elope+0.104
oney+0.103
lor+0.101
ow+0.100
Neg logit contributions
 balls-0.071
 colors-0.070
 gears-0.068
 trucks-0.066
 numbers-0.064
Tokenerry
Token ID6996
Feature activation+9.85e-02

Pos logit contributions
elope+0.070
adow+0.065
 perched+0.061
lor+0.059
 ther+0.055
Neg logit contributions
 gears-0.115
 motors-0.110
 cubes-0.108
 balls-0.108
 magnets-0.107
Token bushes
Token ID37413
Feature activation+6.51e-02

Pos logit contributions
adow+0.167
elope+0.166
lor+0.140
uth+0.140
 symp+0.140
Neg logit contributions
 trucks-0.145
 magnets-0.144
 balls-0.143
 gears-0.139
 blocks-0.136
Token.
Token ID13
Feature activation+4.48e-02

Pos logit contributions
elope+0.125
adow+0.123
opsy+0.111
 Maur+0.108
 Flo+0.107
Neg logit contributions
 gears-0.092
 magnets-0.088
 trucks-0.085
 motors-0.084
 balls-0.083
Token
Token ID198
Feature activation+2.14e-02

Pos logit contributions
adow+0.082
 hive+0.068
elope+0.067
comb+0.066
 perched+0.064
Neg logit contributions
 gears-0.074
 motors-0.070
 automobiles-0.067
 magnets-0.067
 trucks-0.067
Token
Token ID198
Feature activation+6.07e-03

Pos logit contributions
adow+0.038
comb+0.033
elope+0.032
 hatched+0.031
 hatch+0.031
Neg logit contributions
 gears-0.032
 motors-0.029
 balls-0.028
 hoops-0.028
Years-0.028
TokenThe
Token ID464
Feature activation+1.52e-02

Pos logit contributions
adow+0.014
elope+0.012
 perched+0.011
 hive+0.011
 Flo+0.011
Neg logit contributions
 gears-0.008
 motors-0.007
 balls-0.006
 hoops-0.006
 magnets-0.006
Token,
Token ID11
Feature activation+8.79e-03

Pos logit contributions
 gears+0.002
 trucks+0.002
 motors+0.002
 magnets+0.002
 cubes+0.002
Neg logit contributions
adow-0.004
elope-0.004
lor-0.003
 perched-0.003
 hive-0.003
Token there
Token ID612
Feature activation-1.71e-02

Pos logit contributions
adow+0.017
lor+0.017
elope+0.016
comb+0.014
uf+0.014
Neg logit contributions
 gears-0.013
 cubes-0.012
 magnets-0.012
 blocks-0.011
 trucks-0.011
Token was
Token ID373
Feature activation+3.53e-02

Pos logit contributions
 gears+0.025
 motors+0.022
 trucks+0.022
 magnets+0.022
 cubes+0.021
Neg logit contributions
adow-0.035
elope-0.032
lor-0.030
 Flo-0.029
 hive-0.029
Token a
Token ID257
Feature activation-1.05e-04

Pos logit contributions
adow+0.069
elope+0.064
lor+0.060
 Flo+0.056
comb+0.054
Neg logit contributions
 gears-0.059
 magnets-0.053
 motors-0.053
 trucks-0.051
 cubes-0.050
Token big
Token ID1263
Feature activation-2.49e-02

Pos logit contributions
 gears+0.000
 motors+0.000
 trucks+0.000
 magnets+0.000
 cubes+0.000
Neg logit contributions
adow-0.000
elope-0.000
 Flo-0.000
lor-0.000
 perched-0.000
Token ostr
Token ID42387
Feature activation+9.76e-02

Pos logit contributions
 gears+0.041
 motors+0.038
 magnets+0.035
 trucks+0.034
 trophies+0.034
Neg logit contributions
adow-0.048
elope-0.044
 hive-0.041
 Flo-0.039
 perched-0.038
Tokenich
Token ID488
Feature activation+4.03e-02

Pos logit contributions
adow+0.176
elope+0.163
 Honey+0.161
 Flo+0.160
 Merry+0.153
Neg logit contributions
 cubes-0.134
 trucks-0.132
 gears-0.131
 motors-0.125
 automobiles-0.123
Token who
Token ID508
Feature activation+1.45e-02

Pos logit contributions
adow+0.074
elope+0.068
lor+0.067
 Flo+0.065
comb+0.064
Neg logit contributions
 gears-0.062
 trucks-0.060
 motors-0.056
 cubes-0.056
 magnets-0.055
Token was
Token ID373
Feature activation+2.56e-02

Pos logit contributions
adow+0.027
elope+0.024
lor+0.023
 Flo+0.022
comb+0.022
Neg logit contributions
 gears-0.025
 trucks-0.022
 motors-0.022
 magnets-0.021
 cubes-0.021
Token very
Token ID845
Feature activation+1.98e-02

Pos logit contributions
lor+0.062
adow+0.060
comb+0.058
apa+0.056
oney+0.055
Neg logit contributions
 gears-0.026
 trucks-0.025
 motors-0.024
 robots-0.022
 colors-0.022
Token proud
Token ID6613
Feature activation-1.92e-02

Pos logit contributions
lor+0.047
adow+0.043
elope+0.042
comb+0.042
 perched+0.037
Neg logit contributions
 gears-0.023
 trucks-0.022
 cubes-0.022
 colors-0.021
 motors-0.020
Token,
Token ID11
Feature activation+2.39e-02

Pos logit contributions
adow+0.052
elope+0.050
lor+0.045
 Flo+0.044
 perched+0.043
Neg logit contributions
 gears-0.032
 trucks-0.028
 motors-0.028
 magnets-0.028
 numbers-0.026
Token Harry
Token ID5850
Feature activation+2.22e-02

Pos logit contributions
adow+0.060
elope+0.051
lor+0.051
comb+0.047
ossom+0.046
Neg logit contributions
 gears-0.031
 trucks-0.028
 magnets-0.026
 motors-0.026
 numbers-0.025
Token flew
Token ID13112
Feature activation+3.04e-02

Pos logit contributions
adow+0.047
elope+0.042
oney+0.041
lor+0.041
 ther+0.040
Neg logit contributions
 gears-0.027
 trucks-0.024
 colors-0.023
 motors-0.023
 magnets-0.023
Token to
Token ID284
Feature activation+5.65e-02

Pos logit contributions
adow+0.063
elope+0.059
lor+0.054
oney+0.051
comb+0.049
Neg logit contributions
 gears-0.047
 motors-0.040
 trucks-0.040
 magnets-0.039
 balls-0.038
Token the
Token ID262
Feature activation+7.06e-02

Pos logit contributions
adow+0.153
lor+0.146
elope+0.133
oney+0.132
ink+0.132
Neg logit contributions
 numbers-0.042
 trucks-0.038
 blocks-0.038
 gears-0.037
 swirl-0.036
Token bee
Token ID20697
Feature activation+9.72e-02

Pos logit contributions
adow+0.119
lor+0.104
ow+0.096
oney+0.095
utt+0.093
Neg logit contributions
 colors-0.121
 numbers-0.115
 shapes-0.111
 blocks-0.110
 buttons-0.109
Token hive
Token ID35881
Feature activation+7.75e-02

Pos logit contributions
uf+0.183
oney+0.178
 Maur+0.175
 Dahl+0.174
lor+0.174
Neg logit contributions
 balls-0.089
 trucks-0.087
 swirl-0.086
 blocks-0.084
 and-0.082
Token and
Token ID290
Feature activation+4.33e-02

Pos logit contributions
elope+0.180
adow+0.178
uf+0.175
lor+0.171
opsy+0.171
Neg logit contributions
 gears-0.057
 magnets-0.052
 trucks-0.051
 blocks-0.049
 numbers-0.049
Token became
Token ID2627
Feature activation+4.45e-02

Pos logit contributions
adow+0.099
lor+0.093
uf+0.085
oney+0.083
elope+0.082
Neg logit contributions
 trucks-0.045
 gears-0.040
 colors-0.039
 balls-0.038
 numbers-0.038
Token very
Token ID845
Feature activation+3.16e-02

Pos logit contributions
adow+0.099
lor+0.092
elope+0.089
oney+0.085
comb+0.083
Neg logit contributions
 gears-0.060
 magnets-0.054
 motors-0.052
 colors-0.050
 numbers-0.050
Token excited
Token ID6568
Feature activation+3.10e-03

Pos logit contributions
lor+0.070
adow+0.068
elope+0.065
comb+0.059
uf+0.057
Neg logit contributions
 gears-0.036
 colors-0.034
 numbers-0.032
 trophies-0.032
 trucks-0.032
Token want
Token ID765
Feature activation+7.88e-03

Pos logit contributions
 gears+0.006
 motors+0.006
 trucks+0.005
 magnets+0.005
 cubes+0.005
Neg logit contributions
adow-0.009
elope-0.009
lor-0.008
 Flo-0.008
 perched-0.008
Token to
Token ID284
Feature activation-1.05e-02

Pos logit contributions
adow+0.018
elope+0.017
lor+0.016
comb+0.015
oney+0.015
Neg logit contributions
 gears-0.011
 motors-0.010
 trucks-0.010
 magnets-0.010
 numbers-0.009
Token see
Token ID766
Feature activation+2.45e-02

Pos logit contributions
 gears+0.014
 motors+0.013
 trucks+0.012
 magnets+0.012
 numbers+0.011
Neg logit contributions
adow-0.023
elope-0.022
lor-0.021
oney-0.020
 perched-0.020
Token the
Token ID262
Feature activation+1.81e-02

Pos logit contributions
adow+0.058
elope+0.056
lor+0.052
comb+0.051
oney+0.049
Neg logit contributions
 gears-0.032
 trucks-0.031
 motors-0.030
 magnets-0.029
 automobiles-0.028
Token big
Token ID1263
Feature activation+3.32e-03

Pos logit contributions
adow+0.045
elope+0.041
lor+0.039
 Flo+0.038
oney+0.037
Neg logit contributions
 gears-0.022
 trucks-0.020
 motors-0.020
 magnets-0.019
 automobiles-0.017
Token ostr
Token ID42387
Feature activation+9.66e-02

Pos logit contributions
adow+0.007
elope+0.007
 Flo+0.006
lor+0.006
 perched+0.006
Neg logit contributions
 gears-0.005
 motors-0.004
 trucks-0.004
 magnets-0.004
 cubes-0.004
Tokenich
Token ID488
Feature activation+5.94e-02

Pos logit contributions
adow+0.177
 Honey+0.166
elope+0.163
 Flo+0.162
 Bonnie+0.154
Neg logit contributions
 trucks-0.149
 motors-0.139
 gears-0.139
 cubes-0.138
 automobiles-0.134
Token with
Token ID351
Feature activation+2.11e-02

Pos logit contributions
elope+0.128
adow+0.123
opsy+0.119
lor+0.118
oney+0.115
Neg logit contributions
 trucks-0.076
 balls-0.069
 magnets-0.065
 gears-0.065
 motors-0.065
Token the
Token ID262
Feature activation+2.97e-02

Pos logit contributions
adow+0.044
elope+0.041
lor+0.038
comb+0.035
opsy+0.035
Neg logit contributions
 gears-0.035
 trucks-0.032
 motors-0.032
 magnets-0.031
 balls-0.030
Token long
Token ID890
Feature activation+1.92e-02

Pos logit contributions
adow+0.067
elope+0.062
lor+0.060
oney+0.056
 Flo+0.056
Neg logit contributions
 gears-0.044
 trucks-0.040
 motors-0.038
 magnets-0.037
 numbers-0.036
Token neck
Token ID7393
Feature activation+2.63e-02

Pos logit contributions
adow+0.039
elope+0.036
lor+0.034
 Flo+0.032
 perched+0.032
Neg logit contributions
 gears-0.030
 trucks-0.028
 motors-0.027
 colors-0.025
 magnets-0.025
Token couldn
Token ID3521
Feature activation+1.59e-02

Pos logit contributions
 gears+0.012
 trucks+0.011
 motors+0.011
 magnets+0.011
 cubes+0.010
Neg logit contributions
adow-0.017
elope-0.016
lor-0.015
 Flo-0.014
comb-0.014
Token't
Token ID470
Feature activation-1.47e-02

Pos logit contributions
adow+0.038
elope+0.034
 hive+0.032
opsy+0.031
 perched+0.030
Neg logit contributions
 gears-0.021
 motors-0.018
 trucks-0.017
 magnets-0.016
driver-0.016
Token see
Token ID766
Feature activation-1.19e-02

Pos logit contributions
 gears+0.023
 motors+0.021
 trucks+0.020
 magnets+0.020
 trophies+0.019
Neg logit contributions
adow-0.029
elope-0.027
lor-0.025
 perched-0.024
 Flo-0.024
Token what
Token ID644
Feature activation-1.06e-02

Pos logit contributions
 gears+0.020
 motors+0.018
 magnets+0.018
 trucks+0.018
 cubes+0.017
Neg logit contributions
adow-0.022
elope-0.021
 Flo-0.019
 perched-0.018
lor-0.018
Token the
Token ID262
Feature activation+1.05e-02

Pos logit contributions
 gears+0.017
 motors+0.015
 trucks+0.015
 magnets+0.015
 cubes+0.014
Neg logit contributions
adow-0.022
elope-0.020
lor-0.018
 Flo-0.018
comb-0.017
Token ostr
Token ID42387
Feature activation+9.61e-02

Pos logit contributions
adow+0.024
elope+0.022
lor+0.020
 Flo+0.020
oney+0.020
Neg logit contributions
 gears-0.015
 motors-0.013
 trucks-0.013
 magnets-0.013
 cubes-0.012
Tokenich
Token ID488
Feature activation+3.46e-02

Pos logit contributions
adow+0.170
 Honey+0.165
elope+0.158
 Flo+0.156
 Bonnie+0.151
Neg logit contributions
 trucks-0.144
 cubes-0.136
 gears-0.136
 motors-0.134
 automobiles-0.131
Token was
Token ID373
Feature activation+1.03e-02

Pos logit contributions
adow+0.074
elope+0.068
lor+0.066
oney+0.064
 Flo+0.062
Neg logit contributions
 gears-0.046
 trucks-0.045
 motors-0.042
 magnets-0.040
 automobiles-0.039
Token hiding
Token ID11816
Feature activation-1.18e-02

Pos logit contributions
elope+0.021
adow+0.021
lor+0.019
 Flo+0.018
opsy+0.018
Neg logit contributions
 gears-0.015
 motors-0.013
 trucks-0.013
 magnets-0.012
 colors-0.012
Token,
Token ID11
Feature activation+2.59e-03

Pos logit contributions
 gears+0.020
 motors+0.018
 magnets+0.018
 trucks+0.018
 cubes+0.017
Neg logit contributions
adow-0.022
elope-0.021
lor-0.018
 Flo-0.018
 hive-0.017
Token but
Token ID475
Feature activation+3.73e-04

Pos logit contributions
adow+0.005
elope+0.005
lor+0.004
 Flo+0.004
 hive+0.004
Neg logit contributions
 gears-0.004
 motors-0.004
 trucks-0.004
 magnets-0.004
 cubes-0.003
Token
Token ID198
Feature activation-8.86e-04

Pos logit contributions
 gears+0.019
 motors+0.017
 magnets+0.017
 trucks+0.016
 automobiles+0.016
Neg logit contributions
adow-0.020
elope-0.018
 hive-0.015
lor-0.015
 Flo-0.015
Token
Token ID198
Feature activation-1.34e-02

Pos logit contributions
 gears+0.002
 motors+0.002
 magnets+0.002
 trucks+0.002
 automobiles+0.001
Neg logit contributions
adow-0.002
elope-0.001
 hive-0.001
comb-0.001
 Flo-0.001
Token"
Token ID1
Feature activation-7.57e-03

Pos logit contributions
 gears+0.023
 motors+0.021
 trucks+0.021
 magnets+0.020
 cubes+0.020
Neg logit contributions
adow-0.025
elope-0.023
 Flo-0.021
lor-0.020
 perched-0.020
TokenWill
Token ID8743
Feature activation+4.37e-02

Pos logit contributions
 gears+0.013
 trucks+0.011
 motors+0.011
 magnets+0.011
 cubes+0.011
Neg logit contributions
adow-0.015
elope-0.014
 Flo-0.012
lor-0.012
 perched-0.012
Token the
Token ID262
Feature activation+2.49e-02

Pos logit contributions
adow+0.088
elope+0.086
lor+0.079
 cottage+0.076
comb+0.074
Neg logit contributions
 trucks-0.061
 gears-0.061
 colors-0.055
 magnets-0.055
 motors-0.055
Token egg
Token ID5935
Feature activation+9.58e-02

Pos logit contributions
adow+0.047
elope+0.038
ow+0.036
lor+0.035
oney+0.033
Neg logit contributions
 gears-0.044
 colors-0.044
 buttons-0.043
 magnets-0.042
 trucks-0.041
Token be
Token ID307
Feature activation+3.62e-02

Pos logit contributions
elope+0.210
adow+0.190
lor+0.190
 Flo+0.188
 Sof+0.187
Neg logit contributions
 gears-0.105
 trucks-0.098
 magnets-0.098
 colors-0.097
 blocks-0.096
Token warm
Token ID5814
Feature activation+4.78e-02

Pos logit contributions
adow+0.080
elope+0.079
lor+0.075
 Sof+0.067
 Flo+0.067
Neg logit contributions
 gears-0.049
 magnets-0.045
 colors-0.045
 motors-0.044
 cubes-0.043
Token now
Token ID783
Feature activation+4.32e-02

Pos logit contributions
adow+0.091
elope+0.091
lor+0.079
 Flo+0.078
oney+0.072
Neg logit contributions
 gears-0.077
 magnets-0.071
 trucks-0.068
 motors-0.068
 cubes-0.067
Token,
Token ID11
Feature activation+1.59e-02

Pos logit contributions
adow+0.082
elope+0.078
lor+0.068
 Flo+0.066
 hive+0.062
Neg logit contributions
 gears-0.074
 magnets-0.067
 motors-0.067
 trucks-0.066
 cubes-0.064
Token Mom
Token ID11254
Feature activation+4.52e-02

Pos logit contributions
adow+0.033
elope+0.031
lor+0.029
 Flo+0.026
 hive+0.025
Neg logit contributions
 gears-0.026
 magnets-0.023
 trucks-0.023
 motors-0.023
 numbers-0.021
Token"
Token ID1
Feature activation-6.10e-03

Pos logit contributions
 gears+0.005
 motors+0.004
 magnets+0.004
 trucks+0.004
 cubes+0.004
Neg logit contributions
adow-0.007
elope-0.006
 Flo-0.006
 perched-0.006
 hive-0.006
TokenWhat
Token ID2061
Feature activation+1.66e-02

Pos logit contributions
 gears+0.010
 motors+0.008
 trucks+0.008
 magnets+0.008
 cubes+0.008
Neg logit contributions
adow-0.013
elope-0.012
 Flo-0.010
lor-0.010
 perched-0.010
Token is
Token ID318
Feature activation+1.76e-02

Pos logit contributions
adow+0.031
elope+0.031
 Flo+0.028
lor+0.028
 perched+0.026
Neg logit contributions
 gears-0.027
 magnets-0.025
 numbers-0.024
 motors-0.024
 trucks-0.023
Token inside
Token ID2641
Feature activation+4.65e-02

Pos logit contributions
adow+0.035
elope+0.032
lor+0.028
 Flo+0.027
comb+0.026
Neg logit contributions
 gears-0.030
 motors-0.027
 magnets-0.027
 trucks-0.027
 cubes-0.026
Token the
Token ID262
Feature activation+4.22e-02

Pos logit contributions
elope+0.105
adow+0.101
lor+0.092
 Flo+0.090
comb+0.086
Neg logit contributions
 gears-0.055
 trucks-0.052
 magnets-0.051
 cubes-0.050
 colors-0.048
Token egg
Token ID5935
Feature activation+9.57e-02

Pos logit contributions
adow+0.073
elope+0.062
ow+0.055
 Flo+0.053
lor+0.052
Neg logit contributions
 gears-0.085
 trucks-0.079
 motors-0.079
 magnets-0.078
 cubes-0.077
Token,
Token ID11
Feature activation+1.35e-02

Pos logit contributions
elope+0.205
 Flo+0.187
adow+0.185
 Dahl+0.182
lor+0.181
Neg logit contributions
 gears-0.108
 trucks-0.106
 cubes-0.104
 magnets-0.101
 colors-0.099
Token Mom
Token ID11254
Feature activation+5.64e-02

Pos logit contributions
adow+0.029
elope+0.028
lor+0.025
 Flo+0.023
 perched+0.023
Neg logit contributions
 gears-0.020
 magnets-0.018
 colors-0.017
 trucks-0.016
 pencil-0.016
Tokenmy
Token ID1820
Feature activation+5.56e-02

Pos logit contributions
elope+0.121
adow+0.118
 Flo+0.112
lor+0.112
 Dahl+0.103
Neg logit contributions
 gears-0.073
 trucks-0.069
 magnets-0.067
 cubes-0.063
 motors-0.061
Token?"
Token ID1701
Feature activation+8.60e-03

Pos logit contributions
elope+0.106
adow+0.106
 Flo+0.097
lor+0.096
 Dahl+0.090
Neg logit contributions
 gears-0.082
 trucks-0.076
 magnets-0.075
 motors-0.071
 cubes-0.069
Token she
Token ID673
Feature activation+1.40e-02

Pos logit contributions
adow+0.021
elope+0.018
lor+0.017
comb+0.017
 hive+0.016
Neg logit contributions
 gears-0.012
 magnets-0.010
 pencil-0.010
 motors-0.009
 buttons-0.009
Token time
Token ID640
Feature activation-5.09e-03

Pos logit contributions
adow+0.047
elope+0.043
lor+0.042
 Flo+0.041
 perched+0.041
Neg logit contributions
 gears-0.033
 motors-0.029
 trucks-0.028
 magnets-0.028
 cubes-0.027
Token,
Token ID11
Feature activation+5.79e-03

Pos logit contributions
 gears+0.006
 trucks+0.005
 cubes+0.005
 motors+0.005
 magnets+0.005
Neg logit contributions
adow-0.012
elope-0.011
lor-0.011
comb-0.010
 perched-0.010
Token there
Token ID612
Feature activation-1.69e-02

Pos logit contributions
lor+0.012
adow+0.011
elope+0.010
comb+0.010
uf+0.010
Neg logit contributions
 gears-0.008
 magnets-0.008
 cubes-0.008
 blocks-0.008
 trucks-0.007
Token was
Token ID373
Feature activation+3.41e-02

Pos logit contributions
 gears+0.024
 motors+0.021
 trucks+0.021
 magnets+0.020
 cubes+0.020
Neg logit contributions
adow-0.036
elope-0.033
lor-0.031
 hive-0.030
oney-0.030
Token an
Token ID281
Feature activation-2.41e-02

Pos logit contributions
adow+0.067
elope+0.062
lor+0.058
 Flo+0.054
 hive+0.053
Neg logit contributions
 gears-0.057
 motors-0.051
 magnets-0.051
 trucks-0.049
 cubes-0.048
Token ostr
Token ID42387
Feature activation+9.52e-02

Pos logit contributions
 gears+0.041
 motors+0.037
 magnets+0.036
 trucks+0.035
 trophies+0.034
Neg logit contributions
adow-0.046
elope-0.041
 hive-0.036
 Flo-0.036
 perched-0.035
Tokenich
Token ID488
Feature activation+3.54e-02

Pos logit contributions
adow+0.167
elope+0.153
 Honey+0.150
 Flo+0.145
 Merry+0.139
Neg logit contributions
 gears-0.131
 cubes-0.129
 automobiles-0.125
 trucks-0.124
 motors-0.124
Token who
Token ID508
Feature activation+7.52e-03

Pos logit contributions
adow+0.060
elope+0.055
lor+0.052
 Flo+0.050
comb+0.050
Neg logit contributions
 gears-0.062
 trucks-0.058
 motors-0.057
 cubes-0.055
 magnets-0.054
Token liked
Token ID8288
Feature activation+2.17e-03

Pos logit contributions
adow+0.012
elope+0.011
lor+0.010
 Flo+0.010
comb+0.010
Neg logit contributions
 gears-0.015
 motors-0.013
 trucks-0.013
 magnets-0.013
 cubes-0.012
Token to
Token ID284
Feature activation+1.53e-02

Pos logit contributions
adow+0.004
elope+0.004
 Flo+0.003
 perched+0.003
 hive+0.003
Neg logit contributions
 gears-0.004
 motors-0.003
 magnets-0.003
 trucks-0.003
 cubes-0.003
Token celebrate
Token ID10648
Feature activation-5.20e-03

Pos logit contributions
adow+0.034
elope+0.032
lor+0.030
oney+0.029
 perched+0.029
Neg logit contributions
 gears-0.020
 trucks-0.018
 motors-0.018
 magnets-0.017
 balls-0.016
Token,
Token ID11
Feature activation+2.58e-02

Pos logit contributions
adow+0.057
elope+0.053
lor+0.051
 Flo+0.049
opsy+0.047
Neg logit contributions
 gears-0.031
 trucks-0.029
 magnets-0.026
 balls-0.025
 motors-0.025
Token there
Token ID612
Feature activation+3.48e-02

Pos logit contributions
adow+0.054
elope+0.050
lor+0.046
oney+0.042
ossom+0.041
Neg logit contributions
 gears-0.040
 trucks-0.039
 magnets-0.037
 motors-0.036
 balls-0.036
Token is
Token ID318
Feature activation+7.46e-02

Pos logit contributions
adow+0.077
elope+0.072
oney+0.069
lor+0.068
opsy+0.065
Neg logit contributions
 trucks-0.039
 gears-0.037
 colors-0.037
 balls-0.036
 numbers-0.036
Token the
Token ID262
Feature activation+3.21e-02

Pos logit contributions
adow+0.192
lor+0.179
oney+0.166
elope+0.166
anna+0.165
Neg logit contributions
 trucks-0.076
 balls-0.071
 blocks-0.069
 numbers-0.069
 colors-0.069
Token queen
Token ID16599
Feature activation+8.35e-02

Pos logit contributions
adow+0.071
elope+0.065
lor+0.064
oney+0.060
 perched+0.057
Neg logit contributions
 gears-0.044
 colors-0.042
 numbers-0.041
 trucks-0.041
 magnets-0.040
Token bee
Token ID20697
Feature activation+9.46e-02

Pos logit contributions
elope+0.168
 symp+0.167
apa+0.165
 Maur+0.164
oney+0.162
Neg logit contributions
 balls-0.090
 blocks-0.087
 trucks-0.085
 colors-0.081
 magnets-0.080
Token!"
Token ID2474
Feature activation+1.41e-03

Pos logit contributions
apa+0.184
elope+0.179
oney+0.179
adow+0.179
 Maur+0.178
Neg logit contributions
 balls-0.108
 trucks-0.101
 blocks-0.096
 colors-0.094
 magnets-0.089
Token Ben
Token ID3932
Feature activation-2.90e-03

Pos logit contributions
adow+0.003
elope+0.003
lor+0.003
comb+0.003
 perched+0.002
Neg logit contributions
 gears-0.002
 trucks-0.002
 magnets-0.002
 motors-0.002
 cubes-0.002
Token said
Token ID531
Feature activation+1.95e-02

Pos logit contributions
 gears+0.005
 trucks+0.005
 motors+0.004
 magnets+0.004
 cubes+0.004
Neg logit contributions
adow-0.006
elope-0.005
lor-0.005
 Flo-0.004
 hive-0.004
Token.
Token ID13
Feature activation-1.66e-02

Pos logit contributions
adow+0.040
elope+0.039
lor+0.037
 hive+0.034
 perched+0.034
Neg logit contributions
 gears-0.028
 magnets-0.025
 trucks-0.025
 motors-0.025
 cubes-0.024
Token He
Token ID679
Feature activation+5.57e-03

Pos logit contributions
 gears+0.032
 motors+0.029
 trucks+0.029
 magnets+0.028
 trophies+0.028
Neg logit contributions
adow-0.027
elope-0.025
 Flo-0.023
lor-0.021
 perched-0.021
Token in
Token ID287
Feature activation+4.78e-02

Pos logit contributions
elope+0.122
adow+0.117
 Flo+0.109
lor+0.108
 Maur+0.108
Neg logit contributions
 gears-0.088
 trucks-0.083
 cubes-0.080
 magnets-0.078
 motors-0.078
Token the
Token ID262
Feature activation+6.34e-02

Pos logit contributions
adow+0.107
elope+0.095
opsy+0.085
oney+0.084
comb+0.084
Neg logit contributions
 gears-0.070
 magnets-0.063
 motors-0.062
 trucks-0.061
 cubes-0.057
Token branches
Token ID13737
Feature activation+6.66e-02

Pos logit contributions
adow+0.150
oney+0.143
owed+0.132
elope+0.132
ow+0.129
Neg logit contributions
 gears-0.074
 colors-0.066
 blocks-0.065
 trucks-0.062
 magnets-0.059
Token of
Token ID286
Feature activation+5.57e-02

Pos logit contributions
elope+0.135
adow+0.125
 Flo+0.120
lor+0.118
opsy+0.118
Neg logit contributions
 gears-0.088
 motors-0.079
 cubes-0.078
 trucks-0.077
 magnets-0.077
Token the
Token ID262
Feature activation+5.26e-02

Pos logit contributions
adow+0.140
elope+0.125
lor+0.121
ossom+0.118
oney+0.118
Neg logit contributions
 gears-0.074
 trucks-0.068
 magnets-0.068
 motors-0.065
 blocks-0.064
Token tree
Token ID5509
Feature activation+9.45e-02

Pos logit contributions
adow+0.123
oney+0.113
lor+0.113
elope+0.110
ow+0.109
Neg logit contributions
 gears-0.063
 colors-0.056
 blocks-0.055
 trucks-0.055
 numbers-0.051
Token.
Token ID13
Feature activation+4.64e-02

Pos logit contributions
elope+0.182
 Maur+0.180
opsy+0.179
adow+0.175
 Flo+0.174
Neg logit contributions
 gears-0.111
 trucks-0.107
 blocks-0.104
 magnets-0.102
 colors-0.102
Token
Token ID220
Feature activation+2.62e-02

Pos logit contributions
adow+0.084
 hive+0.072
elope+0.072
comb+0.069
opsy+0.068
Neg logit contributions
 gears-0.080
 magnets-0.075
 motors-0.074
 automobiles-0.070
 trucks-0.070
Token
Token ID198
Feature activation+5.24e-03

Pos logit contributions
adow+0.050
 hive+0.048
elope+0.047
 thirst+0.045
comb+0.044
Neg logit contributions
 gears-0.038
 magnets-0.035
 motors-0.033
driver-0.033
 automobiles-0.032
Token
Token ID198
Feature activation-8.09e-03

Pos logit contributions
adow+0.010
elope+0.009
 hive+0.008
ow+0.008
 Flo+0.008
Neg logit contributions
 gears-0.009
 motors-0.008
 magnets-0.008
 cubes-0.007
 trucks-0.007
TokenJack
Token ID14295
Feature activation-3.73e-03

Pos logit contributions
 gears+0.013
 motors+0.011
 trucks+0.011
 magnets+0.011
 cubes+0.011
Neg logit contributions
adow-0.017
elope-0.015
 Flo-0.014
lor-0.014
 perched-0.013
Token apple
Token ID17180
Feature activation+3.11e-02

Pos logit contributions
adow+0.063
elope+0.058
lor+0.055
 Flo+0.053
 perched+0.050
Neg logit contributions
 gears-0.038
 magnets-0.034
 trucks-0.034
 motors-0.032
 cubes-0.032
Token and
Token ID290
Feature activation+1.26e-03

Pos logit contributions
adow+0.070
elope+0.065
lor+0.059
 Flo+0.058
 perched+0.057
Neg logit contributions
 gears-0.043
 magnets-0.037
 trucks-0.037
 motors-0.037
 cubes-0.035
Token slowly
Token ID6364
Feature activation+5.90e-03

Pos logit contributions
adow+0.003
elope+0.002
lor+0.002
 Flo+0.002
 perched+0.002
Neg logit contributions
 gears-0.002
 motors-0.002
 trucks-0.002
 magnets-0.002
 cubes-0.002
Token approached
Token ID10448
Feature activation+2.26e-02

Pos logit contributions
adow+0.012
elope+0.012
lor+0.010
 Flo+0.010
 hive+0.010
Neg logit contributions
 gears-0.009
 motors-0.008
 trucks-0.008
 magnets-0.008
 cubes-0.007
Token the
Token ID262
Feature activation+3.96e-02

Pos logit contributions
adow+0.052
elope+0.047
lor+0.045
 hive+0.042
comb+0.041
Neg logit contributions
 gears-0.032
 magnets-0.029
 trucks-0.029
 motors-0.028
 numbers-0.027
Token ostr
Token ID42387
Feature activation+9.42e-02

Pos logit contributions
adow+0.091
lor+0.079
elope+0.077
oney+0.072
 Flo+0.072
Neg logit contributions
 trucks-0.056
 gears-0.056
 balls-0.051
 colors-0.051
 motors-0.051
Tokenich
Token ID488
Feature activation+6.70e-02

Pos logit contributions
adow+0.173
 Honey+0.161
 Flo+0.155
 Bonnie+0.154
elope+0.151
Neg logit contributions
 trucks-0.137
 motors-0.132
 gears-0.129
 cubes-0.126
 automobiles-0.126
Token.
Token ID13
Feature activation+6.24e-03

Pos logit contributions
adow+0.143
elope+0.139
lor+0.136
opsy+0.130
 Flo+0.128
Neg logit contributions
 gears-0.080
 magnets-0.079
 trucks-0.079
 balls-0.074
 cubes-0.071
Token
Token ID220
Feature activation+5.51e-03

Pos logit contributions
adow+0.012
elope+0.010
lor+0.010
 hive+0.009
 perched+0.009
Neg logit contributions
 gears-0.011
 magnets-0.010
 trucks-0.010
 motors-0.010
 numbers-0.009
Token
Token ID198
Feature activation+9.61e-03

Pos logit contributions
adow+0.010
elope+0.010
 hive+0.009
comb+0.009
lor+0.009
Neg logit contributions
 gears-0.009
 magnets-0.008
 motors-0.008
 trucks-0.007
 cubes-0.007
Token
Token ID198
Feature activation-1.16e-03

Pos logit contributions
adow+0.016
elope+0.014
 hive+0.012
 perched+0.012
lor+0.012
Neg logit contributions
 gears-0.019
 motors-0.017
 magnets-0.017
 trucks-0.017
 cubes-0.016
Token her
Token ID607
Feature activation+3.41e-02

Pos logit contributions
adow+0.075
lor+0.074
elope+0.066
ossom+0.062
comb+0.061
Neg logit contributions
 trucks-0.044
 gears-0.041
 motors-0.040
 automobiles-0.037
 magnets-0.037
Token day
Token ID1110
Feature activation+2.87e-02

Pos logit contributions
adow+0.069
lor+0.068
elope+0.061
 Flo+0.060
oney+0.057
Neg logit contributions
 gears-0.050
 trucks-0.048
 magnets-0.045
 motors-0.045
 cubes-0.043
Token in
Token ID287
Feature activation+3.34e-02

Pos logit contributions
adow+0.061
elope+0.061
lor+0.058
 Flo+0.056
Pen+0.053
Neg logit contributions
 gears-0.033
 magnets-0.031
 motors-0.030
 colors-0.028
 balls-0.028
Token the
Token ID262
Feature activation+5.74e-02

Pos logit contributions
adow+0.076
elope+0.068
lor+0.066
oney+0.062
ossom+0.061
Neg logit contributions
 gears-0.042
 magnets-0.038
 motors-0.037
 trucks-0.035
 numbers-0.035
Token high
Token ID1029
Feature activation+4.35e-02

Pos logit contributions
adow+0.127
lor+0.118
oney+0.116
comb+0.114
owed+0.113
Neg logit contributions
 gears-0.070
 trucks-0.067
 colors-0.067
 motors-0.063
 cubes-0.063
Token o
Token ID267
Feature activation+9.36e-02

Pos logit contributions
adow+0.088
elope+0.079
lor+0.077
oney+0.075
ow+0.071
Neg logit contributions
 gears-0.066
 trucks-0.064
 colors-0.061
 motors-0.061
 cubes-0.058
Tokenasis
Token ID17765
Feature activation+6.65e-02

Pos logit contributions
 Flo+0.119
elope+0.119
 Clover+0.117
 Honey+0.117
 Pen+0.117
Neg logit contributions
 cubes-0.162
 trucks-0.155
 gears-0.153
 motors-0.147
 sculptures-0.144
Token.
Token ID13
Feature activation+2.82e-02

Pos logit contributions
adow+0.135
elope+0.135
opsy+0.129
 Flo+0.129
 Maur+0.126
Neg logit contributions
 gears-0.074
 trucks-0.072
 cubes-0.068
 colors-0.067
 motors-0.066
Token<|endoftext|>
Token ID50256
Feature activation+3.06e-02

Pos logit contributions
adow+0.063
comb+0.057
 hive+0.054
ocket+0.054
 hatched+0.054
Neg logit contributions
 gears-0.026
 automobiles-0.023
 magnets-0.023
 motors-0.022
 robots-0.022
TokenOnce
Token ID7454
Feature activation+9.51e-03

Pos logit contributions
adow+0.063
comb+0.058
oney+0.057
elope+0.056
ossom+0.055
Neg logit contributions
 gears-0.039
 blocks-0.034
 magnets-0.034
 balls-0.033
 motors-0.033
Token upon
Token ID2402
Feature activation+5.40e-02

Pos logit contributions
elope+0.014
adow+0.014
lor+0.014
comb+0.013
tail+0.012
Neg logit contributions
 gears-0.018
 cubes-0.017
 magnets-0.016
 trucks-0.016
 blocks-0.016
Token and
Token ID290
Feature activation+3.65e-02

Pos logit contributions
elope+0.093
adow+0.092
opsy+0.085
lor+0.085
 Flo+0.081
Neg logit contributions
 gears-0.073
 trucks-0.067
 numbers-0.063
 motors-0.063
driver-0.062
Token I
Token ID314
Feature activation+3.75e-02

Pos logit contributions
adow+0.074
elope+0.069
lor+0.065
opsy+0.060
 Flo+0.059
Neg logit contributions
 gears-0.056
 trucks-0.052
 magnets-0.050
 numbers-0.049
 motors-0.047
Token will
Token ID481
Feature activation+3.22e-02

Pos logit contributions
elope+0.074
adow+0.074
lor+0.070
 Flo+0.068
oney+0.066
Neg logit contributions
 gears-0.046
 motors-0.038
 trucks-0.037
 bang-0.037
 numbers-0.037
Token free
Token ID1479
Feature activation+4.26e-02

Pos logit contributions
adow+0.076
elope+0.071
lor+0.065
 perched+0.062
 Flo+0.062
Neg logit contributions
 gears-0.037
 motors-0.031
 numbers-0.030
 trucks-0.029
 magnets-0.029
Token the
Token ID262
Feature activation+3.19e-02

Pos logit contributions
adow+0.083
elope+0.078
lor+0.071
opsy+0.070
oney+0.066
Neg logit contributions
 gears-0.068
 trucks-0.063
 motors-0.062
 magnets-0.061
 numbers-0.061
Token bird
Token ID6512
Feature activation+9.35e-02

Pos logit contributions
adow+0.075
elope+0.066
lor+0.066
oney+0.063
ow+0.061
Neg logit contributions
 gears-0.042
 trucks-0.037
 colors-0.037
 numbers-0.036
 blocks-0.036
Token,"
Token ID553
Feature activation+3.52e-02

Pos logit contributions
uf+0.177
opsy+0.177
lor+0.174
 Maur+0.171
elope+0.171
Neg logit contributions
 trucks-0.111
 balls-0.102
 colors-0.100
 race-0.099
 blocks-0.099
Token said
Token ID531
Feature activation+4.42e-02

Pos logit contributions
adow+0.099
lor+0.090
elope+0.084
comb+0.082
ossom+0.080
Neg logit contributions
 gears-0.027
 numbers-0.025
 buttons-0.025
 pencil-0.025
 magnets-0.024
Token the
Token ID262
Feature activation+3.36e-02

Pos logit contributions
adow+0.131
lor+0.121
asis+0.113
elope+0.111
hew+0.106
Neg logit contributions
 pencil-0.021
 numbers-0.021
 gears-0.020
 magnets-0.020
 swirl-0.019
Token voice
Token ID3809
Feature activation+2.68e-02

Pos logit contributions
adow+0.081
elope+0.072
oney+0.070
lor+0.070
comb+0.066
Neg logit contributions
 gears-0.039
 colors-0.036
 magnets-0.034
 numbers-0.034
 blocks-0.034
Token.
Token ID13
Feature activation+3.96e-02

Pos logit contributions
adow+0.058
elope+0.053
lor+0.052
opsy+0.051
 hive+0.049
Neg logit contributions
 gears-0.034
 buttons-0.031
 motors-0.031
 trucks-0.030
driver-0.029
Token a
Token ID257
Feature activation+1.78e-02

Pos logit contributions
adow+0.124
elope+0.120
lor+0.116
oney+0.113
 hive+0.111
Neg logit contributions
 gears-0.062
 magnets-0.053
 balls-0.052
 motors-0.051
 numbers-0.049
Token time
Token ID640
Feature activation-1.96e-03

Pos logit contributions
adow+0.036
elope+0.033
lor+0.031
 Flo+0.031
 perched+0.031
Neg logit contributions
 gears-0.026
 motors-0.023
 magnets-0.023
 trucks-0.023
 cubes-0.022
Token there
Token ID612
Feature activation-1.35e-02

Pos logit contributions
 gears+0.002
 trucks+0.002
 cubes+0.002
 motors+0.002
 magnets+0.002
Neg logit contributions
adow-0.004
elope-0.004
lor-0.004
 perched-0.004
 hive-0.004
Token was
Token ID373
Feature activation+3.79e-02

Pos logit contributions
 gears+0.018
 trucks+0.016
 motors+0.016
 cubes+0.015
 magnets+0.015
Neg logit contributions
adow-0.029
elope-0.027
lor-0.026
 hive-0.025
oney-0.025
Token an
Token ID281
Feature activation-2.16e-02

Pos logit contributions
adow+0.077
elope+0.071
lor+0.070
 Flo+0.062
 hive+0.062
Neg logit contributions
 gears-0.060
 magnets-0.054
 motors-0.053
 trucks-0.051
 cubes-0.050
Token ostr
Token ID42387
Feature activation+9.34e-02

Pos logit contributions
 gears+0.037
 motors+0.033
 magnets+0.032
 trucks+0.032
 trophies+0.030
Neg logit contributions
adow-0.042
elope-0.038
 Flo-0.033
 hive-0.033
 perched-0.032
Tokenich
Token ID488
Feature activation+3.94e-02

Pos logit contributions
adow+0.163
elope+0.148
 Honey+0.145
 Flo+0.142
 Merry+0.136
Neg logit contributions
 gears-0.136
 trucks-0.135
 cubes-0.135
 motors-0.132
 automobiles-0.129
Token.
Token ID13
Feature activation-1.24e-02

Pos logit contributions
adow+0.071
elope+0.066
lor+0.064
 Flo+0.061
comb+0.061
Neg logit contributions
 gears-0.062
 trucks-0.059
 motors-0.056
 cubes-0.055
 magnets-0.055
Token His
Token ID2399
Feature activation+2.90e-02

Pos logit contributions
 gears+0.023
 trucks+0.021
 magnets+0.021
 motors+0.021
 automobiles+0.020
Neg logit contributions
adow-0.022
elope-0.019
lor-0.019
 hive-0.017
 perched-0.017
Token name
Token ID1438
Feature activation-2.10e-02

Pos logit contributions
adow+0.058
lor+0.054
elope+0.054
oney+0.050
comb+0.049
Neg logit contributions
 gears-0.040
 trucks-0.037
 colors-0.036
 motors-0.035
 numbers-0.035
Token was
Token ID373
Feature activation+1.90e-02

Pos logit contributions
 gears+0.039
 motors+0.036
 trucks+0.035
 magnets+0.034
 cubes+0.033
Neg logit contributions
adow-0.037
elope-0.034
 Flo-0.030
lor-0.029
 perched-0.029
Token walked
Token ID6807
Feature activation+4.02e-02

Pos logit contributions
adow+0.062
elope+0.060
lor+0.060
oney+0.055
 Flo+0.054
Neg logit contributions
 gears-0.035
 motors-0.032
 trucks-0.032
 magnets-0.029
 balls-0.028
Token,
Token ID11
Feature activation+2.70e-02

Pos logit contributions
adow+0.089
elope+0.086
lor+0.081
oney+0.078
comb+0.078
Neg logit contributions
 gears-0.053
 trucks-0.046
 motors-0.046
 balls-0.042
 bikes-0.041
Token they
Token ID484
Feature activation+1.13e-02

Pos logit contributions
adow+0.061
elope+0.057
lor+0.056
comb+0.051
 hive+0.050
Neg logit contributions
 gears-0.036
 trucks-0.033
 motors-0.033
 magnets-0.029
 balls-0.029
Token saw
Token ID2497
Feature activation+1.61e-02

Pos logit contributions
adow+0.025
elope+0.024
lor+0.023
oney+0.022
 Flo+0.021
Neg logit contributions
 gears-0.014
 motors-0.013
 trucks-0.012
 balls-0.011
 trophies-0.011
Token an
Token ID281
Feature activation-1.01e-02

Pos logit contributions
adow+0.032
elope+0.029
lor+0.027
oney+0.026
comb+0.025
Neg logit contributions
 gears-0.028
 motors-0.026
 trucks-0.025
 magnets-0.024
 numbers-0.023
Token ostr
Token ID42387
Feature activation+9.31e-02

Pos logit contributions
 gears+0.016
 motors+0.015
 magnets+0.014
 trucks+0.014
 cubes+0.014
Neg logit contributions
adow-0.020
elope-0.019
 Flo-0.016
 perched-0.016
 hive-0.016
Tokenich
Token ID488
Feature activation+5.47e-02

Pos logit contributions
adow+0.160
 Honey+0.149
elope+0.145
 Flo+0.139
oney+0.133
Neg logit contributions
 trucks-0.152
 gears-0.146
 motors-0.143
 cubes-0.141
 automobiles-0.137
Token.
Token ID13
Feature activation+2.14e-02

Pos logit contributions
adow+0.116
oney+0.114
elope+0.113
opsy+0.109
 Maur+0.108
Neg logit contributions
 trucks-0.066
 gears-0.060
 motors-0.058
 automobiles-0.057
 balls-0.056
Token "
Token ID366
Feature activation+5.09e-03

Pos logit contributions
adow+0.053
comb+0.045
lor+0.044
 hive+0.044
elope+0.044
Neg logit contributions
 gears-0.025
 motors-0.023
 trucks-0.023
 magnets-0.023
 automobiles-0.022
TokenLook
Token ID8567
Feature activation+2.70e-02

Pos logit contributions
adow+0.011
elope+0.010
 Flo+0.009
lor+0.009
 perched+0.009
Neg logit contributions
 gears-0.007
 motors-0.007
 trucks-0.006
 magnets-0.006
 cubes-0.006
Token!"
Token ID2474
Feature activation+2.15e-02

Pos logit contributions
adow+0.062
elope+0.061
lor+0.056
 hive+0.052
 Flo+0.052
Neg logit contributions
 gears-0.032
 trucks-0.030
 cubes-0.029
 magnets-0.027
 motors-0.027
Token melt
Token ID16867
Feature activation+1.42e-02

Pos logit contributions
adow+0.069
elope+0.064
lor+0.058
 Flo+0.056
oney+0.056
Neg logit contributions
 gears-0.045
 motors-0.040
 trucks-0.040
 magnets-0.039
 cubes-0.037
Token the
Token ID262
Feature activation+4.82e-02

Pos logit contributions
adow+0.028
elope+0.026
lor+0.024
 Flo+0.024
oney+0.023
Neg logit contributions
 gears-0.024
 motors-0.021
 trucks-0.021
 magnets-0.020
 cubes-0.020
Token water
Token ID1660
Feature activation+3.27e-02

Pos logit contributions
adow+0.117
elope+0.107
lor+0.104
oney+0.099
ow+0.098
Neg logit contributions
 gears-0.057
 trucks-0.054
 colors-0.053
 motors-0.051
 cubes-0.049
Token in
Token ID287
Feature activation+2.24e-02

Pos logit contributions
adow+0.065
elope+0.062
lor+0.055
 Flo+0.054
oney+0.052
Neg logit contributions
 gears-0.052
 motors-0.048
 trucks-0.047
 magnets-0.046
 cubes-0.045
Token the
Token ID262
Feature activation+6.69e-02

Pos logit contributions
adow+0.046
elope+0.041
lor+0.037
 Flo+0.036
 perched+0.036
Neg logit contributions
 gears-0.037
 motors-0.033
 magnets-0.032
 trucks-0.032
 cubes-0.031
Token o
Token ID267
Feature activation+9.31e-02

Pos logit contributions
adow+0.167
elope+0.156
oney+0.151
lor+0.148
comb+0.147
Neg logit contributions
 gears-0.065
 colors-0.061
 trucks-0.059
 cubes-0.057
 motors-0.056
Tokenasis
Token ID17765
Feature activation+6.24e-02

Pos logit contributions
elope+0.112
 Flo+0.107
 perched+0.105
 Pen+0.103
adow+0.101
Neg logit contributions
 gears-0.176
 cubes-0.176
 trucks-0.172
 motors-0.164
 boards-0.160
Token.
Token ID13
Feature activation-8.14e-03

Pos logit contributions
adow+0.127
elope+0.126
opsy+0.117
 Flo+0.117
oney+0.112
Neg logit contributions
 gears-0.084
 trucks-0.078
 motors-0.074
 cubes-0.074
 colors-0.071
Token All
Token ID1439
Feature activation-4.19e-02

Pos logit contributions
 gears+0.012
 magnets+0.010
 motors+0.010
 trucks+0.010
 numbers+0.009
Neg logit contributions
adow-0.018
elope-0.016
lor-0.016
 hive-0.016
 perched-0.015
Token the
Token ID262
Feature activation+9.22e-03

Pos logit contributions
 gears+0.074
 motors+0.068
 magnets+0.065
 trucks+0.065
 automobiles+0.062
Neg logit contributions
adow-0.078
elope-0.071
 Flo-0.063
 perched-0.061
 hive-0.060
Token water
Token ID1660
Feature activation+4.20e-03

Pos logit contributions
adow+0.021
elope+0.020
lor+0.019
comb+0.018
 Flo+0.017
Neg logit contributions
 gears-0.013
 trucks-0.011
 motors-0.011
 cubes-0.010
 colors-0.010
Token on
Token ID319
Feature activation-2.95e-02

Pos logit contributions
 gears+0.107
 motors+0.099
 magnets+0.097
 trucks+0.095
 trophies+0.094
Neg logit contributions
adow-0.074
elope-0.057
ow-0.054
 perched-0.054
 Flo-0.050
Token top
Token ID1353
Feature activation-2.70e-02

Pos logit contributions
 gears+0.056
 motors+0.052
 trucks+0.052
 trophies+0.051
 cubes+0.051
Neg logit contributions
elope-0.036
adow-0.033
 Flo-0.030
 perched-0.028
c-0.026
Token.
Token ID13
Feature activation-2.25e-02

Pos logit contributions
 gears+0.055
 motors+0.051
 magnets+0.050
 trophies+0.049
 trucks+0.049
Neg logit contributions
adow-0.039
elope-0.032
 Flo-0.027
ow-0.027
 perched-0.027
Token They
Token ID1119
Feature activation-4.92e-02

Pos logit contributions
 gears+0.031
 motors+0.027
 trucks+0.026
 magnets+0.026
 automobiles+0.024
Neg logit contributions
adow-0.054
elope-0.048
lor-0.045
 hive-0.044
 perched-0.043
Token use
Token ID779
Feature activation-7.74e-02

Pos logit contributions
 automobiles+0.081
 gears+0.080
 motors+0.079
ruck+0.073
irens+0.073
Neg logit contributions
adow-0.073
elope-0.060
�-0.058
abel-0.057
ow-0.055
Token colorful
Token ID20239
Feature activation-1.13e-01

Pos logit contributions
about+0.126
 referee+0.123
 gears+0.123
 sculptures+0.122
 trophies+0.120
Neg logit contributions
adow-0.089
 perched-0.081
c-0.079
elope-0.076
ow-0.072
Token gears
Token ID28713
Feature activation-7.41e-02

Pos logit contributions
 photographer+0.108
 referee+0.102
 reporter+0.097
 trophy+0.097
 nerd+0.095
Neg logit contributions
c-0.193
adow-0.187
 tal-0.185
's-0.171
�-0.170
Token for
Token ID329
Feature activation-5.40e-02

Pos logit contributions
 gears+0.143
 motors+0.133
 magnets+0.130
 trophies+0.128
 trucks+0.124
Neg logit contributions
adow-0.108
ow-0.088
 perched-0.080
elope-0.075
comb-0.070
Token the
Token ID262
Feature activation-5.95e-02

Pos logit contributions
about+0.075
 gears+0.068
ubes+0.066
 trophy+0.065
 cubes+0.064
Neg logit contributions
elope-0.077
adow-0.070
 perched-0.067
 Flo-0.066
 symp-0.064
Token eyes
Token ID2951
Feature activation-1.35e-02

Pos logit contributions
ubes+0.054
about+0.054
 motors+0.053
 gears+0.052
hing+0.049
Neg logit contributions
adow-0.110
elope-0.106
 hive-0.096
 ther-0.095
 cottage-0.093
Token,
Token ID11
Feature activation-4.58e-02

Pos logit contributions
 gears+0.023
 motors+0.021
 magnets+0.020
 trucks+0.020
 trophies+0.020
Neg logit contributions
adow-0.025
elope-0.022
 Flo-0.020
 perched-0.019
comb-0.019
Token was
Token ID373
Feature activation+2.32e-02

Pos logit contributions
 gears+0.023
 motors+0.020
 trucks+0.020
 cubes+0.019
 magnets+0.019
Neg logit contributions
adow-0.034
elope-0.031
lor-0.030
 hive-0.029
comb-0.028
Token a
Token ID257
Feature activation+6.83e-03

Pos logit contributions
adow+0.041
elope+0.038
lor+0.033
 Flo+0.033
 perched+0.032
Neg logit contributions
 gears-0.043
 motors-0.039
 magnets-0.038
 trucks-0.038
 cubes-0.037
Token boy
Token ID2933
Feature activation-5.06e-03

Pos logit contributions
adow+0.014
elope+0.013
lor+0.012
 Flo+0.012
 perched+0.012
Neg logit contributions
 gears-0.010
 motors-0.009
 trucks-0.009
 magnets-0.009
 cubes-0.009
Token named
Token ID3706
Feature activation+1.00e-02

Pos logit contributions
 gears+0.011
 motors+0.011
 trucks+0.011
 magnets+0.010
 cubes+0.010
Neg logit contributions
adow-0.007
elope-0.006
lor-0.005
 Flo-0.005
 perched-0.005
Token Ryan
Token ID6047
Feature activation-3.98e-02

Pos logit contributions
adow+0.024
elope+0.023
lor+0.022
 hive+0.021
comb+0.020
Neg logit contributions
 gears-0.014
 trucks-0.012
 motors-0.012
 magnets-0.011
 cubes-0.011
Token.
Token ID13
Feature activation-1.07e-01

Pos logit contributions
 gears+0.082
 motors+0.075
 trucks+0.073
 magnets+0.073
 cubes+0.070
Neg logit contributions
adow-0.064
elope-0.057
 Flo-0.048
 perched-0.047
lor-0.047
Token He
Token ID679
Feature activation-2.34e-02

Pos logit contributions
 cubes+0.218
 trucks+0.213
 motors+0.213
 chips+0.212
 pies+0.212
Neg logit contributions
 -0.084
adow-0.076
 Just-0.066
 Today-0.063
 Sof-0.061
Token was
Token ID373
Feature activation-2.01e-02

Pos logit contributions
 gears+0.033
 motors+0.028
 trucks+0.028
 magnets+0.027
 cubes+0.026
Neg logit contributions
adow-0.053
elope-0.048
lor-0.045
 Flo-0.045
 hive-0.043
Token three
Token ID1115
Feature activation-2.73e-02

Pos logit contributions
 gears+0.036
 motors+0.033
 trucks+0.032
 magnets+0.032
 cubes+0.031
Neg logit contributions
adow-0.037
elope-0.034
lor-0.031
 Flo-0.030
comb-0.029
Token years
Token ID812
Feature activation-9.72e-03

Pos logit contributions
 gears+0.064
 motors+0.060
 magnets+0.058
 trucks+0.057
 trophies+0.055
Neg logit contributions
adow-0.036
elope-0.030
 hive-0.023
 Flo-0.022
 perched-0.022
Token old
Token ID1468
Feature activation-5.08e-02

Pos logit contributions
 gears+0.015
 motors+0.014
 trucks+0.013
 magnets+0.013
 cubes+0.013
Neg logit contributions
adow-0.020
elope-0.018
 Flo-0.016
 hive-0.016
 perched-0.016
Token They
Token ID1119
Feature activation-7.16e-02

Pos logit contributions
 gears+0.085
 motors+0.077
 magnets+0.074
 trucks+0.074
 trophies+0.072
Neg logit contributions
adow-0.082
elope-0.074
 Flo-0.066
 hive-0.062
 perched-0.061
Token have
Token ID423
Feature activation-4.74e-02

Pos logit contributions
 gears+0.155
 motors+0.150
irens+0.145
 automobiles+0.145
ruck+0.143
Neg logit contributions
adow-0.081
elope-0.068
�-0.060
 Flo-0.056
 hive-0.053
Token many
Token ID867
Feature activation-1.05e-01

Pos logit contributions
 gears+0.092
 trophy+0.085
 magnets+0.085
 motors+0.084
 trophies+0.084
Neg logit contributions
adow-0.065
elope-0.058
 Flo-0.051
 hive-0.050
 perched-0.048
Token tubes
Token ID21103
Feature activation-8.11e-02

Pos logit contributions
 trophy+0.141
 referee+0.140
 photographer+0.139
driver+0.138
 motorcycle+0.136
Neg logit contributions
adow-0.156
c-0.151
 upon-0.138
 ther-0.131
elope-0.130
Token of
Token ID286
Feature activation-7.98e-02

Pos logit contributions
 gears+0.175
 motors+0.170
 levers+0.160
 motorcycles+0.157
 trucks+0.156
Neg logit contributions
adow-0.108
ow-0.074
 hive-0.071
bug-0.067
 Flo-0.066
Token different
Token ID1180
Feature activation-1.06e-01

Pos logit contributions
 motorcycle+0.135
 trophy+0.133
 referee+0.128
 trophies+0.126
bike+0.125
Neg logit contributions
adow-0.100
elope-0.086
c-0.085
 ther-0.083
 upon-0.083
Token colors
Token ID7577
Feature activation-7.46e-02

Pos logit contributions
 cop+0.123
 motorcycle+0.120
 trophy+0.119
 referee+0.118
 rider+0.111
Neg logit contributions
adow-0.209
c-0.178
elope-0.168
 upon-0.163
ow-0.157
Token and
Token ID290
Feature activation-5.91e-02

Pos logit contributions
 gears+0.148
 motors+0.135
 trophies+0.133
 trucks+0.130
 magnets+0.127
Neg logit contributions
adow-0.111
 perched-0.082
elope-0.080
 Flo-0.079
ow-0.077
Token sizes
Token ID10620
Feature activation-7.13e-02

Pos logit contributions
 gears+0.053
about+0.048
 motorcycle+0.047
 motors+0.046
 trophy+0.046
Neg logit contributions
adow-0.125
elope-0.122
 Flo-0.110
 perched-0.104
 Honey-0.103
Token.
Token ID13
Feature activation-3.59e-02

Pos logit contributions
 gears+0.178
 motors+0.164
 trucks+0.163
 trophies+0.156
 magnets+0.156
Neg logit contributions
adow-0.073
elope-0.044
 Flo-0.040
 hive-0.038
 perched-0.038
Token They
Token ID1119
Feature activation-6.69e-02

Pos logit contributions
 gears+0.062
 motors+0.057
 trucks+0.057
 magnets+0.055
 cubes+0.054
Neg logit contributions
adow-0.068
elope-0.062
lor-0.058
 perched-0.055
comb-0.053
Token with
Token ID351
Feature activation-1.84e-02

Pos logit contributions
 gears+0.054
 motors+0.050
 trucks+0.049
 magnets+0.047
 automobiles+0.046
Neg logit contributions
adow-0.047
elope-0.042
 Flo-0.037
 perched-0.036
 hive-0.036
Token tubes
Token ID21103
Feature activation-6.74e-02

Pos logit contributions
 gears+0.028
 motors+0.025
driver+0.025
ruck+0.024
hing+0.024
Neg logit contributions
adow-0.032
elope-0.031
 hive-0.029
 Flo-0.029
 perched-0.027
Token.
Token ID13
Feature activation-4.54e-02

Pos logit contributions
 gears+0.162
 motors+0.153
 trucks+0.146
 levers+0.146
 gadgets+0.144
Neg logit contributions
adow-0.079
 hive-0.053
elope-0.047
 perched-0.046
ow-0.046
Token They
Token ID1119
Feature activation-7.16e-02

Pos logit contributions
 gears+0.085
 motors+0.077
 magnets+0.074
 trucks+0.074
 trophies+0.072
Neg logit contributions
adow-0.082
elope-0.074
 Flo-0.066
 hive-0.062
 perched-0.061
Token have
Token ID423
Feature activation-4.74e-02

Pos logit contributions
 gears+0.155
 motors+0.150
irens+0.145
 automobiles+0.145
ruck+0.143
Neg logit contributions
adow-0.081
elope-0.068
�-0.060
 Flo-0.056
 hive-0.053
Token many
Token ID867
Feature activation-1.05e-01

Pos logit contributions
 gears+0.092
 trophy+0.085
 magnets+0.085
 motors+0.084
 trophies+0.084
Neg logit contributions
adow-0.065
elope-0.058
 Flo-0.051
 hive-0.050
 perched-0.048
Token tubes
Token ID21103
Feature activation-8.11e-02

Pos logit contributions
 trophy+0.141
 referee+0.140
 photographer+0.139
driver+0.138
 motorcycle+0.136
Neg logit contributions
adow-0.156
c-0.151
 upon-0.138
 ther-0.131
elope-0.130
Token of
Token ID286
Feature activation-7.98e-02

Pos logit contributions
 gears+0.175
 motors+0.170
 levers+0.160
 motorcycles+0.157
 trucks+0.156
Neg logit contributions
adow-0.108
ow-0.074
 hive-0.071
bug-0.067
 Flo-0.066
Token different
Token ID1180
Feature activation-1.06e-01

Pos logit contributions
 motorcycle+0.135
 trophy+0.133
 referee+0.128
 trophies+0.126
bike+0.125
Neg logit contributions
adow-0.100
elope-0.086
c-0.085
 ther-0.083
 upon-0.083
Token colors
Token ID7577
Feature activation-7.46e-02

Pos logit contributions
 cop+0.123
 motorcycle+0.120
 trophy+0.119
 referee+0.118
 rider+0.111
Neg logit contributions
adow-0.209
c-0.178
elope-0.168
 upon-0.163
ow-0.157
Token and
Token ID290
Feature activation-5.91e-02

Pos logit contributions
 gears+0.148
 motors+0.135
 trophies+0.133
 trucks+0.130
 magnets+0.127
Neg logit contributions
adow-0.111
 perched-0.082
elope-0.080
 Flo-0.079
ow-0.077
Token They
Token ID1119
Feature activation-7.00e-02

Pos logit contributions
 gears+0.100
 motors+0.091
 trucks+0.089
 magnets+0.088
 cubes+0.085
Neg logit contributions
adow-0.084
elope-0.076
 Flo-0.066
 perched-0.063
 hive-0.063
Token have
Token ID423
Feature activation-5.03e-02

Pos logit contributions
 gears+0.143
 motors+0.137
 automobiles+0.136
irens+0.135
 magnets+0.132
Neg logit contributions
adow-0.087
elope-0.075
�-0.067
 Flo-0.060
 hive-0.058
Token many
Token ID867
Feature activation-9.99e-02

Pos logit contributions
 gears+0.095
 magnets+0.089
 trophy+0.088
 motors+0.087
 trucks+0.087
Neg logit contributions
adow-0.072
elope-0.063
 Flo-0.055
 hive-0.053
 perched-0.053
Token tubes
Token ID21103
Feature activation-7.70e-02

Pos logit contributions
 referee+0.138
 photographer+0.137
driver+0.135
 trophy+0.134
 motorcycle+0.130
Neg logit contributions
adow-0.150
c-0.141
 upon-0.128
 ther-0.126
elope-0.123
Token of
Token ID286
Feature activation-8.05e-02

Pos logit contributions
 gears+0.166
 motors+0.159
 levers+0.149
 trucks+0.148
 motorcycles+0.146
Neg logit contributions
adow-0.105
ow-0.072
 hive-0.069
bug-0.065
 Flo-0.062
Token different
Token ID1180
Feature activation-1.03e-01

Pos logit contributions
 motorcycle+0.140
 trophy+0.138
 referee+0.135
 photographer+0.132
 trophies+0.129
Neg logit contributions
adow-0.098
elope-0.082
 ther-0.082
c-0.081
 upon-0.080
Token colors
Token ID7577
Feature activation-7.19e-02

Pos logit contributions
 cop+0.120
 referee+0.119
 trophy+0.116
 motorcycle+0.115
 rider+0.107
Neg logit contributions
adow-0.207
c-0.175
elope-0.165
 upon-0.161
ow-0.153
Token and
Token ID290
Feature activation-5.82e-02

Pos logit contributions
 gears+0.146
 motors+0.133
 trophies+0.130
 trucks+0.129
 magnets+0.127
Neg logit contributions
adow-0.105
 perched-0.076
elope-0.074
ow-0.071
 Flo-0.071
Token sizes
Token ID10620
Feature activation-6.88e-02

Pos logit contributions
 gears+0.049
about+0.042
 motors+0.042
 magnets+0.041
 trophy+0.041
Neg logit contributions
adow-0.130
elope-0.125
 Flo-0.112
 perched-0.108
 Honey-0.107
Token.
Token ID13
Feature activation-3.92e-02

Pos logit contributions
 gears+0.170
 motors+0.156
 trucks+0.155
 magnets+0.149
 trophies+0.148
Neg logit contributions
adow-0.073
elope-0.046
 perched-0.040
 Flo-0.040
 hive-0.038
Token They
Token ID1119
Feature activation-6.55e-02

Pos logit contributions
 gears+0.065
 motors+0.060
 magnets+0.059
 trucks+0.058
 hoops+0.056
Neg logit contributions
adow-0.077
elope-0.071
lor-0.066
 perched-0.063
comb-0.062
Token was
Token ID373
Feature activation+3.57e-02

Pos logit contributions
 gears+0.027
 motors+0.023
 trucks+0.023
 magnets+0.023
 cubes+0.022
Neg logit contributions
adow-0.039
elope-0.035
lor-0.033
 hive-0.032
oney-0.032
Token a
Token ID257
Feature activation-3.79e-04

Pos logit contributions
adow+0.073
elope+0.067
lor+0.065
 Flo+0.059
comb+0.058
Neg logit contributions
 gears-0.057
 magnets-0.051
 motors-0.051
 trucks-0.048
 cubes-0.048
Token boy
Token ID2933
Feature activation-7.92e-03

Pos logit contributions
 gears+0.001
 motors+0.001
 trucks+0.001
 magnets+0.001
 cubes+0.000
Neg logit contributions
adow-0.001
elope-0.001
 Flo-0.001
lor-0.001
 perched-0.001
Token named
Token ID3706
Feature activation+1.13e-02

Pos logit contributions
 gears+0.019
 motors+0.018
 trucks+0.017
 magnets+0.017
 cubes+0.017
Neg logit contributions
adow-0.010
elope-0.009
 Flo-0.007
 perched-0.007
lor-0.007
Token Tyler
Token ID14886
Feature activation-2.22e-02

Pos logit contributions
adow+0.025
elope+0.024
lor+0.022
 hive+0.021
 perched+0.021
Neg logit contributions
 gears-0.017
 trucks-0.015
 motors-0.015
 magnets-0.015
 cubes-0.014
Token.
Token ID13
Feature activation-1.01e-01

Pos logit contributions
 gears+0.047
 motors+0.043
 magnets+0.042
 trucks+0.042
 trophies+0.041
Neg logit contributions
adow-0.034
elope-0.031
 Flo-0.025
 perched-0.024
lor-0.024
Token One
Token ID1881
Feature activation-1.49e-02

Pos logit contributions
 magnets+0.191
 trophies+0.191
 trophy+0.191
 sculptures+0.188
 cubes+0.183
Neg logit contributions
adow-0.126
elope-0.104
 Sof-0.087
ow-0.086
 symp-0.085
Token day
Token ID1110
Feature activation-6.60e-02

Pos logit contributions
 gears+0.022
 motors+0.019
 trucks+0.019
 magnets+0.019
 cubes+0.018
Neg logit contributions
adow-0.031
elope-0.029
lor-0.028
 Flo-0.027
 perched-0.026
Token,
Token ID11
Feature activation-7.81e-02

Pos logit contributions
 magnets+0.151
 motors+0.149
 gears+0.147
 sculptures+0.147
 trophies+0.145
Neg logit contributions
adow-0.070
elope-0.054
ow-0.052
 ther-0.050
 Flo-0.047
Token Tyler
Token ID14886
Feature activation-1.62e-02

Pos logit contributions
 trophies+0.152
 magnets+0.151
 sculptures+0.150
 motors+0.148
 designs+0.143
Neg logit contributions
adow-0.087
 ther-0.068
elope-0.068
 upon-0.064
 perched-0.061
Token wanted
Token ID2227
Feature activation-2.05e-02

Pos logit contributions
 gears+0.027
 motors+0.025
 trucks+0.024
 magnets+0.024
 cubes+0.023
Neg logit contributions
adow-0.032
elope-0.029
 Flo-0.026
lor-0.026
 perched-0.025
Token a
Token ID257
Feature activation-7.50e-03

Pos logit contributions
adow+0.056
elope+0.051
lor+0.047
 Flo+0.044
 hive+0.043
Neg logit contributions
 gears-0.052
 motors-0.047
 magnets-0.046
 trucks-0.045
 cubes-0.044
Token small
Token ID1402
Feature activation-2.28e-02

Pos logit contributions
 gears+0.012
 motors+0.011
 trucks+0.011
 magnets+0.011
 cubes+0.010
Neg logit contributions
adow-0.015
elope-0.014
 Flo-0.012
 hive-0.012
 perched-0.012
Token ball
Token ID2613
Feature activation-1.18e-02

Pos logit contributions
 gears+0.036
 motors+0.032
 trucks+0.031
 magnets+0.031
 cubes+0.029
Neg logit contributions
adow-0.047
elope-0.043
 Flo-0.038
 hive-0.038
 perched-0.038
Token called
Token ID1444
Feature activation+1.10e-02

Pos logit contributions
 gears+0.027
 motors+0.025
 magnets+0.024
 trucks+0.024
 trophies+0.024
Neg logit contributions
adow-0.016
elope-0.014
 Flo-0.012
 perched-0.011
 hive-0.011
Token Bounce
Token ID41474
Feature activation-1.50e-02

Pos logit contributions
adow+0.024
elope+0.021
lor+0.019
 hive+0.019
 cottage+0.018
Neg logit contributions
 gears-0.018
 motors-0.016
 magnets-0.016
 trucks-0.016
 balls-0.015
Token.
Token ID13
Feature activation-9.99e-02

Pos logit contributions
 gears+0.029
 motors+0.027
 trucks+0.026
 magnets+0.026
 cubes+0.025
Neg logit contributions
adow-0.026
elope-0.023
 Flo-0.020
lor-0.020
 perched-0.020
Token Bounce
Token ID41474
Feature activation-3.94e-02

Pos logit contributions
ruck+0.204
 cubes+0.202
 sculptures+0.200
 motors+0.192
 cream+0.191
Neg logit contributions
adow-0.086
 Flo-0.084
 Dahl-0.081
tail-0.080
 Just-0.080
Token loved
Token ID6151
Feature activation-3.69e-02

Pos logit contributions
 gears+0.072
 motors+0.066
 magnets+0.065
 trucks+0.064
 cubes+0.063
Neg logit contributions
adow-0.071
elope-0.066
 Flo-0.056
 perched-0.055
 hive-0.052
Token making
Token ID1642
Feature activation-4.00e-02

Pos logit contributions
 gears+0.075
about+0.071
driver+0.071
 motors+0.070
 levers+0.070
Neg logit contributions
adow-0.043
 perched-0.039
 hive-0.038
 Flo-0.035
elope-0.033
Token sounds
Token ID5238
Feature activation-5.15e-02

Pos logit contributions
 gears+0.059
 motors+0.052
 trucks+0.050
 magnets+0.049
 cubes+0.049
Neg logit contributions
adow-0.081
elope-0.073
 perched-0.069
 Flo-0.069
 hive-0.067
Token and
Token ID290
Feature activation-1.82e-02

Pos logit contributions
 gears+0.127
 motors+0.118
drivers+0.114
 levers+0.114
 trucks+0.114
Neg logit contributions
adow-0.041
 perched-0.039
 hive-0.034
 Flo-0.034
 Buzz-0.032
Token with
Token ID351
Feature activation-1.61e-02

Pos logit contributions
 gears+0.045
 motors+0.042
 trucks+0.041
 magnets+0.040
 cubes+0.038
Neg logit contributions
adow-0.039
elope-0.035
 Flo-0.031
 perched-0.030
lor-0.030
Token tubes
Token ID21103
Feature activation-6.38e-02

Pos logit contributions
 gears+0.022
 motors+0.020
 trophies+0.019
driver+0.019
 referee+0.018
Neg logit contributions
adow-0.032
elope-0.031
 hive-0.028
 Flo-0.028
opsy-0.027
Token.
Token ID13
Feature activation-5.04e-02

Pos logit contributions
 gears+0.149
 motors+0.140
 trucks+0.135
 levers+0.130
 trophies+0.129
Neg logit contributions
adow-0.079
 hive-0.053
elope-0.052
 perched-0.048
ow-0.047
Token They
Token ID1119
Feature activation-7.00e-02

Pos logit contributions
 gears+0.100
 motors+0.091
 trucks+0.089
 magnets+0.088
 cubes+0.085
Neg logit contributions
adow-0.084
elope-0.076
 Flo-0.066
 perched-0.063
 hive-0.063
Token have
Token ID423
Feature activation-5.03e-02

Pos logit contributions
 gears+0.143
 motors+0.137
 automobiles+0.136
irens+0.135
 magnets+0.132
Neg logit contributions
adow-0.087
elope-0.075
�-0.067
 Flo-0.060
 hive-0.058
Token many
Token ID867
Feature activation-9.99e-02

Pos logit contributions
 gears+0.095
 magnets+0.089
 trophy+0.088
 motors+0.087
 trucks+0.087
Neg logit contributions
adow-0.072
elope-0.063
 Flo-0.055
 hive-0.053
 perched-0.053
Token tubes
Token ID21103
Feature activation-7.70e-02

Pos logit contributions
 referee+0.138
 photographer+0.137
driver+0.135
 trophy+0.134
 motorcycle+0.130
Neg logit contributions
adow-0.150
c-0.141
 upon-0.128
 ther-0.126
elope-0.123
Token of
Token ID286
Feature activation-8.05e-02

Pos logit contributions
 gears+0.166
 motors+0.159
 levers+0.149
 trucks+0.148
 motorcycles+0.146
Neg logit contributions
adow-0.105
ow-0.072
 hive-0.069
bug-0.065
 Flo-0.062
Token different
Token ID1180
Feature activation-1.03e-01

Pos logit contributions
 motorcycle+0.140
 trophy+0.138
 referee+0.135
 photographer+0.132
 trophies+0.129
Neg logit contributions
adow-0.098
elope-0.082
 ther-0.082
c-0.081
 upon-0.080
Token colors
Token ID7577
Feature activation-7.19e-02

Pos logit contributions
 cop+0.120
 referee+0.119
 trophy+0.116
 motorcycle+0.115
 rider+0.107
Neg logit contributions
adow-0.207
c-0.175
elope-0.165
 upon-0.161
ow-0.153
Token and
Token ID290
Feature activation-5.82e-02

Pos logit contributions
 gears+0.146
 motors+0.133
 trophies+0.130
 trucks+0.129
 magnets+0.127
Neg logit contributions
adow-0.105
 perched-0.076
elope-0.074
ow-0.071
 Flo-0.071
Token knew
Token ID2993
Feature activation-4.72e-03

Pos logit contributions
 gears+0.015
 motors+0.014
 trucks+0.013
 magnets+0.013
 cubes+0.013
Neg logit contributions
adow-0.012
elope-0.011
 perched-0.010
 Flo-0.009
 hive-0.009
Token that
Token ID326
Feature activation-5.59e-03

Pos logit contributions
 gears+0.008
 motors+0.007
 trucks+0.007
 magnets+0.007
 cubes+0.007
Neg logit contributions
adow-0.009
elope-0.008
 Flo-0.007
lor-0.007
 perched-0.007
Token chess
Token ID19780
Feature activation-5.76e-02

Pos logit contributions
 gears+0.009
 motors+0.008
 trucks+0.008
 magnets+0.008
 cubes+0.008
Neg logit contributions
adow-0.012
elope-0.010
lor-0.010
 Flo-0.009
 perched-0.009
Token was
Token ID373
Feature activation-8.67e-04

Pos logit contributions
 gears+0.118
 motors+0.114
 trucks+0.109
 automobiles+0.107
 cubes+0.104
Neg logit contributions
adow-0.079
elope-0.069
lor-0.057
 hive-0.056
comb-0.054
Token a
Token ID257
Feature activation-3.68e-02

Pos logit contributions
 gears+0.002
 motors+0.001
 trucks+0.001
 magnets+0.001
 cubes+0.001
Neg logit contributions
adow-0.002
elope-0.001
 Flo-0.001
 perched-0.001
lor-0.001
Token hard
Token ID1327
Feature activation-9.85e-02

Pos logit contributions
 gears+0.059
 motors+0.056
 trucks+0.055
 automobiles+0.054
 magnets+0.051
Neg logit contributions
elope-0.059
 hive-0.056
adow-0.055
 ther-0.052
lor-0.049
Token game
Token ID983
Feature activation-4.15e-02

Pos logit contributions
 automobiles+0.175
 motors+0.173
 trucks+0.169
 gears+0.166
 buses+0.166
Neg logit contributions
adow-0.119
elope-0.115
over-0.103
 once-0.099
comb-0.095
Token for
Token ID329
Feature activation-1.66e-02

Pos logit contributions
 gears+0.091
 motors+0.085
 magnets+0.082
 trucks+0.082
 trophies+0.080
Neg logit contributions
adow-0.059
elope-0.049
 Flo-0.041
ow-0.041
 hive-0.040
Token a
Token ID257
Feature activation-3.38e-02

Pos logit contributions
 gears+0.027
 motors+0.024
 magnets+0.024
 trucks+0.024
 cubes+0.023
Neg logit contributions
adow-0.035
elope-0.032
lor-0.029
 Flo-0.028
 perched-0.028
Token three
Token ID1115
Feature activation-6.09e-02

Pos logit contributions
 gears+0.048
 motors+0.042
 trucks+0.041
 magnets+0.041
 automobiles+0.040
Neg logit contributions
adow-0.067
elope-0.067
 hive-0.059
 Flo-0.059
lor-0.057
Token year
Token ID614
Feature activation+1.29e-02

Pos logit contributions
 gears+0.099
 motors+0.092
 trucks+0.085
 automobiles+0.085
 magnets+0.084
Neg logit contributions
adow-0.109
elope-0.100
 hive-0.087
over-0.086
abel-0.083
Token a
Token ID257
Feature activation+9.87e-03

Pos logit contributions
adow+0.062
elope+0.058
lor+0.056
 hive+0.050
comb+0.050
Neg logit contributions
 gears-0.051
 motors-0.046
 magnets-0.046
 trucks-0.043
 cubes-0.043
Token little
Token ID1310
Feature activation-7.94e-03

Pos logit contributions
adow+0.021
elope+0.019
lor+0.017
 Flo+0.017
 perched+0.017
Neg logit contributions
 gears-0.015
 motors-0.013
 trucks-0.013
 magnets-0.013
 cubes-0.012
Token boy
Token ID2933
Feature activation+6.86e-04

Pos logit contributions
 gears+0.013
 motors+0.011
 trucks+0.011
 magnets+0.011
 cubes+0.010
Neg logit contributions
adow-0.016
elope-0.015
 Flo-0.013
 hive-0.013
 perched-0.013
Token named
Token ID3706
Feature activation+2.12e-02

Pos logit contributions
adow+0.001
elope+0.001
lor+0.001
 Flo+0.001
 perched+0.001
Neg logit contributions
 gears-0.001
 motors-0.001
 trucks-0.001
 magnets-0.001
 cubes-0.001
Token Ryan
Token ID6047
Feature activation-3.43e-02

Pos logit contributions
adow+0.057
lor+0.055
elope+0.054
comb+0.050
 hive+0.050
Neg logit contributions
 gears-0.024
 trucks-0.022
 motors-0.019
 cubes-0.019
 magnets-0.019
Token.
Token ID13
Feature activation-9.81e-02

Pos logit contributions
 gears+0.069
 motors+0.063
 trucks+0.062
 magnets+0.062
 cubes+0.059
Neg logit contributions
adow-0.056
elope-0.051
 Flo-0.043
lor-0.042
 perched-0.042
Token He
Token ID679
Feature activation-2.29e-02

Pos logit contributions
 motors+0.200
 cubes+0.199
 trucks+0.198
 chips+0.195
ruck+0.194
Neg logit contributions
adow-0.076
 -0.072
 Sof-0.060
 Just-0.058
 Today-0.057
Token had
Token ID550
Feature activation-1.61e-02

Pos logit contributions
 gears+0.031
 motors+0.027
 trucks+0.026
 magnets+0.025
 cubes+0.024
Neg logit contributions
adow-0.052
elope-0.048
lor-0.045
 Flo-0.045
 hive-0.043
Token a
Token ID257
Feature activation-2.43e-02

Pos logit contributions
 gears+0.028
 motors+0.025
 magnets+0.023
 trucks+0.023
 cubes+0.023
Neg logit contributions
adow-0.032
elope-0.030
lor-0.027
 Flo-0.026
oney-0.026
Token lot
Token ID1256
Feature activation-1.75e-02

Pos logit contributions
 gears+0.034
 motors+0.030
 trucks+0.029
 magnets+0.028
 cubes+0.027
Neg logit contributions
adow-0.054
elope-0.050
 Flo-0.045
 hive-0.044
 perched-0.044
Token of
Token ID286
Feature activation-2.60e-02

Pos logit contributions
 gears+0.034
 motors+0.033
 trucks+0.032
 magnets+0.031
 automobiles+0.030
Neg logit contributions
adow-0.029
elope-0.021
ow-0.021
 Flo-0.020
 hive-0.019
Token a
Token ID257
Feature activation+5.78e-03

Pos logit contributions
adow+0.045
elope+0.042
lor+0.038
 Flo+0.036
 perched+0.035
Neg logit contributions
 gears-0.046
 motors-0.041
 magnets-0.041
 trucks-0.040
 cubes-0.039
Token little
Token ID1310
Feature activation-1.34e-02

Pos logit contributions
adow+0.012
elope+0.011
 Flo+0.010
lor+0.010
 perched+0.010
Neg logit contributions
 gears-0.009
 motors-0.008
 trucks-0.008
 magnets-0.008
 cubes-0.007
Token boy
Token ID2933
Feature activation-2.45e-03

Pos logit contributions
 gears+0.022
 motors+0.020
 trucks+0.019
 magnets+0.019
 trophies+0.018
Neg logit contributions
adow-0.026
elope-0.024
 hive-0.022
 Flo-0.022
 perched-0.021
Token called
Token ID1444
Feature activation+4.62e-03

Pos logit contributions
 gears+0.006
 motors+0.005
 trucks+0.005
 magnets+0.005
 cubes+0.005
Neg logit contributions
adow-0.003
elope-0.003
lor-0.002
 Flo-0.002
 perched-0.002
Token Bart
Token ID13167
Feature activation-3.84e-02

Pos logit contributions
adow+0.010
elope+0.010
lor+0.009
 hive+0.009
 perched+0.009
Neg logit contributions
 gears-0.007
 trucks-0.006
 motors-0.006
 magnets-0.006
 cubes-0.006
Token.
Token ID13
Feature activation-9.79e-02

Pos logit contributions
 gears+0.074
 motors+0.068
 trucks+0.066
 magnets+0.066
 cubes+0.063
Neg logit contributions
adow-0.065
elope-0.059
 Flo-0.051
lor-0.051
 perched-0.050
Token One
Token ID1881
Feature activation-1.25e-02

Pos logit contributions
 cubes+0.200
 cream+0.192
 chips+0.192
 papers+0.190
ruck+0.188
Neg logit contributions
adow-0.096
elope-0.087
 Flo-0.085
 perched-0.080
 Dahl-0.077
Token day
Token ID1110
Feature activation-5.94e-02

Pos logit contributions
 gears+0.016
 trucks+0.013
 motors+0.013
 cubes+0.013
 magnets+0.012
Neg logit contributions
adow-0.028
lor-0.027
elope-0.027
 Flo-0.025
 perched-0.025
Token,
Token ID11
Feature activation-6.24e-02

Pos logit contributions
 gears+0.125
 motors+0.122
 trucks+0.121
 sculptures+0.121
 magnets+0.120
Neg logit contributions
adow-0.064
 Flo-0.057
 perched-0.057
elope-0.054
 ther-0.048
Token he
Token ID339
Feature activation-5.47e-03

Pos logit contributions
 cubes+0.118
 motors+0.116
 gears+0.115
 sculptures+0.115
 sizes+0.113
Neg logit contributions
adow-0.069
 perched-0.068
 Flo-0.067
elope-0.065
 ther-0.056
Token went
Token ID1816
Feature activation+5.90e-03

Pos logit contributions
 gears+0.008
 motors+0.007
 trucks+0.006
 magnets+0.006
 cubes+0.006
Neg logit contributions
adow-0.012
elope-0.011
lor-0.011
 Flo-0.010
comb-0.010
Token He
Token ID679
Feature activation-5.34e-02

Pos logit contributions
 gears+0.120
 motors+0.115
 trophies+0.112
 trucks+0.108
 medals+0.107
Neg logit contributions
adow-0.086
elope-0.073
 Flo-0.067
ow-0.060
 Sof-0.057
Token had
Token ID550
Feature activation-4.16e-02

Pos logit contributions
 gears+0.107
 motors+0.106
 levers+0.102
 automobiles+0.102
ruck+0.100
Neg logit contributions
adow-0.067
elope-0.064
 perched-0.054
 Flo-0.052
c-0.051
Token a
Token ID257
Feature activation-5.13e-02

Pos logit contributions
 gears+0.087
 motors+0.083
 trucks+0.082
 magnets+0.081
 automobiles+0.080
Neg logit contributions
adow-0.052
elope-0.045
 Flo-0.041
 perched-0.041
 hive-0.036
Token big
Token ID1263
Feature activation-8.00e-02

Pos logit contributions
 motors+0.084
ruck+0.083
 gears+0.083
hing+0.081
 trucks+0.081
Neg logit contributions
 hive-0.077
adow-0.076
elope-0.074
opsy-0.073
 ther-0.072
Token,
Token ID11
Feature activation-4.71e-02

Pos logit contributions
 levers+0.118
 motors+0.117
ruck+0.115
drivers+0.107
 gears+0.105
Neg logit contributions
adow-0.136
 hive-0.134
elope-0.117
 ther-0.116
 nest-0.116
Token hard
Token ID1327
Feature activation-9.77e-02

Pos logit contributions
 motors+0.064
 gears+0.064
 trucks+0.059
ruck+0.058
 automobiles+0.056
Neg logit contributions
adow-0.087
elope-0.078
 ther-0.075
 perched-0.073
 Flo-0.072
Token basketball
Token ID9669
Feature activation-6.23e-02

Pos logit contributions
 levers+0.152
 motors+0.148
hing+0.138
 sculptures+0.135
ruck+0.134
Neg logit contributions
adow-0.147
 hive-0.134
oppy-0.118
opsy-0.118
 ther-0.113
Token that
Token ID326
Feature activation-3.36e-02

Pos logit contributions
 gears+0.139
 motors+0.138
 levers+0.133
ruck+0.131
 magnets+0.128
Neg logit contributions
adow-0.068
 hive-0.047
 perched-0.045
elope-0.045
ow-0.044
Token he
Token ID339
Feature activation-4.03e-02

Pos logit contributions
 gears+0.060
 motors+0.060
 automobiles+0.056
ruck+0.055
 magnets+0.055
Neg logit contributions
adow-0.050
elope-0.043
 Flo-0.041
 perched-0.039
 Sof-0.036
Token would
Token ID561
Feature activation-4.77e-02

Pos logit contributions
 gears+0.072
 motors+0.069
 automobiles+0.067
 magnets+0.066
ruck+0.065
Neg logit contributions
adow-0.064
elope-0.058
 perched-0.052
 Flo-0.048
ow-0.048
Token bounce
Token ID20110
Feature activation-4.98e-02

Pos logit contributions
 gears+0.077
 motors+0.070
 magnets+0.069
 automobiles+0.069
 trucks+0.069
Neg logit contributions
adow-0.085
elope-0.075
 Flo-0.073
 perched-0.071
 hive-0.071
Tokenbles
Token ID7689
Feature activation-4.24e-02

Pos logit contributions
 motors+0.072
 gears+0.071
 trucks+0.067
 automobiles+0.064
 levers+0.063
Neg logit contributions
adow-0.091
lor-0.075
elope-0.070
 ther-0.069
oppy-0.068
Token.
Token ID13
Feature activation-4.01e-02

Pos logit contributions
 gears+0.086
 motors+0.080
 trucks+0.076
 magnets+0.073
 automobiles+0.072
Neg logit contributions
adow-0.069
elope-0.055
 hive-0.053
 perched-0.051
 Flo-0.049
Token She
Token ID1375
Feature activation-3.10e-02

Pos logit contributions
 gears+0.083
 motors+0.076
 trucks+0.074
 magnets+0.074
 cubes+0.071
Neg logit contributions
adow-0.064
elope-0.057
 Flo-0.049
lor-0.048
 perched-0.048
Token had
Token ID550
Feature activation-2.93e-02

Pos logit contributions
 gears+0.059
 motors+0.055
 magnets+0.054
 automobiles+0.053
 trophies+0.053
Neg logit contributions
adow-0.051
elope-0.048
 perched-0.040
 Flo-0.039
 hive-0.038
Token many
Token ID867
Feature activation-8.15e-02

Pos logit contributions
 gears+0.058
 motors+0.053
 trucks+0.052
 magnets+0.052
 cubes+0.050
Neg logit contributions
adow-0.047
elope-0.043
 Flo-0.038
 perched-0.036
lor-0.036
Token shiny
Token ID22441
Feature activation-9.73e-02

Pos logit contributions
driver+0.111
 motors+0.102
hing+0.102
 gears+0.101
ruck+0.097
Neg logit contributions
adow-0.141
c-0.123
 ther-0.120
 perched-0.118
elope-0.115
Token and
Token ID290
Feature activation-3.98e-02

Pos logit contributions
hing+0.122
 subtract+0.118
 motors+0.117
Years+0.114
 keyboard+0.113
Neg logit contributions
adow-0.151
 Nib-0.146
c-0.145
 ther-0.142
 perched-0.140
Token colorful
Token ID20239
Feature activation-7.92e-02

Pos logit contributions
 gears+0.060
 motors+0.059
 trucks+0.053
 magnets+0.053
 automobiles+0.052
Neg logit contributions
adow-0.071
elope-0.068
 Flo-0.064
 perched-0.061
c-0.059
Token mar
Token ID1667
Feature activation-5.36e-02

Pos logit contributions
hing+0.093
Years+0.092
 gears+0.090
driver+0.088
 motors+0.087
Neg logit contributions
adow-0.124
 Nib-0.121
 Flor-0.117
 ther-0.117
c-0.116
Tokenbles
Token ID7689
Feature activation-4.79e-02

Pos logit contributions
 gears+0.086
 motors+0.085
 trucks+0.082
 automobiles+0.078
 magnets+0.076
Neg logit contributions
adow-0.094
lor-0.081
 perched-0.073
elope-0.072
ocket-0.072
Token.
Token ID13
Feature activation-3.00e-02

Pos logit contributions
 gears+0.098
 motors+0.091
 trucks+0.088
 magnets+0.087
 automobiles+0.084
Neg logit contributions
adow-0.077
elope-0.064
 Flo-0.057
 perched-0.057
 hive-0.056
Token that
Token ID326
Feature activation-1.48e-02

Pos logit contributions
 gears+0.003
 motors+0.003
 trucks+0.003
 magnets+0.003
 cubes+0.003
Neg logit contributions
adow-0.003
elope-0.003
 Flo-0.003
 perched-0.003
lor-0.003
Token even
Token ID772
Feature activation-3.90e-02

Pos logit contributions
 gears+0.026
 motors+0.024
 trucks+0.023
 cubes+0.022
 magnets+0.022
Neg logit contributions
adow-0.026
elope-0.024
 perched-0.021
 Flo-0.021
 hive-0.021
Token though
Token ID996
Feature activation-9.49e-03

Pos logit contributions
 gears+0.063
 motors+0.058
 trucks+0.055
 cubes+0.054
 magnets+0.053
Neg logit contributions
adow-0.071
elope-0.064
 perched-0.062
 Flo-0.059
 hive-0.058
Token people
Token ID661
Feature activation-4.48e-02

Pos logit contributions
 gears+0.016
 motors+0.014
 trucks+0.014
 magnets+0.014
 automobiles+0.013
Neg logit contributions
adow-0.019
elope-0.018
lor-0.016
 Flo-0.015
comb-0.015
Token have
Token ID423
Feature activation-6.31e-02

Pos logit contributions
 gears+0.079
 motors+0.071
 trucks+0.069
 magnets+0.069
 cubes+0.066
Neg logit contributions
adow-0.084
elope-0.074
 perched-0.065
 Flo-0.065
ow-0.064
Token different
Token ID1180
Feature activation-9.71e-02

Pos logit contributions
 gears+0.097
 trucks+0.087
 motors+0.085
 magnets+0.085
 cubes+0.084
Neg logit contributions
adow-0.120
elope-0.106
 Flo-0.097
 perched-0.095
 hive-0.093
Token jobs
Token ID3946
Feature activation-6.24e-02

Pos logit contributions
 gears+0.114
 cubes+0.102
 motors+0.101
 referee+0.100
 magnets+0.100
Neg logit contributions
adow-0.203
elope-0.166
ow-0.159
c-0.153
 Flo-0.152
Token and
Token ID290
Feature activation-4.62e-02

Pos logit contributions
 gears+0.111
 motors+0.099
 magnets+0.098
 trucks+0.098
 cubes+0.094
Neg logit contributions
adow-0.105
elope-0.087
ow-0.081
 perched-0.078
 Flo-0.078
Token wear
Token ID5806
Feature activation-5.12e-02

Pos logit contributions
 gears+0.069
 motors+0.060
 magnets+0.059
 trucks+0.058
 cubes+0.057
Neg logit contributions
adow-0.094
elope-0.086
 Flo-0.077
 perched-0.077
 hive-0.075
Token different
Token ID1180
Feature activation-8.70e-02

Pos logit contributions
 gears+0.071
 magnets+0.062
 motors+0.061
 trucks+0.061
 cubes+0.060
Neg logit contributions
adow-0.106
elope-0.097
 Flo-0.088
 perched-0.085
lor-0.083
Token clothes
Token ID8242
Feature activation-4.36e-02

Pos logit contributions
 gears+0.109
driver+0.099
 cubes+0.099
 motors+0.099
 magnets+0.099
Neg logit contributions
adow-0.177
elope-0.144
c-0.130
anna-0.128
 Flo-0.128
Token.
Token ID13
Feature activation-4.45e-02

Pos logit contributions
 gears+0.019
 motors+0.018
 trucks+0.018
 magnets+0.017
 cubes+0.017
Neg logit contributions
adow-0.014
elope-0.013
 Flo-0.011
lor-0.011
 hive-0.010
Token He
Token ID679
Feature activation-3.73e-02

Pos logit contributions
 gears+0.099
 motors+0.091
 trucks+0.091
 magnets+0.091
 cubes+0.089
Neg logit contributions
adow-0.056
elope-0.050
 Flo-0.042
 perched-0.039
ow-0.038
Token was
Token ID373
Feature activation-2.22e-02

Pos logit contributions
 gears+0.069
 magnets+0.064
 motors+0.062
 trucks+0.061
 automobiles+0.061
Neg logit contributions
adow-0.061
elope-0.056
 perched-0.050
c-0.045
 Flo-0.045
Token a
Token ID257
Feature activation-4.49e-02

Pos logit contributions
 gears+0.043
 motors+0.039
 magnets+0.039
 trucks+0.038
 cubes+0.037
Neg logit contributions
adow-0.037
elope-0.033
 perched-0.029
 Flo-0.029
 hive-0.028
Token very
Token ID845
Feature activation-3.12e-02

Pos logit contributions
 gears+0.084
 motors+0.077
 magnets+0.076
 trucks+0.075
 automobiles+0.074
Neg logit contributions
adow-0.074
elope-0.068
 hive-0.060
 perched-0.057
 Flo-0.056
Token talented
Token ID12356
Feature activation-9.69e-02

Pos logit contributions
 gears+0.051
 motors+0.047
 magnets+0.046
 trucks+0.045
 automobiles+0.044
Neg logit contributions
adow-0.060
elope-0.054
 Flo-0.048
 hive-0.047
 perched-0.046
Token drummer
Token ID34269
Feature activation-2.22e-02

Pos logit contributions
 gears+0.139
 trucks+0.131
 automobiles+0.129
 magnets+0.126
 motors+0.124
Neg logit contributions
adow-0.169
elope-0.144
lor-0.139
over-0.139
abel-0.137
Token.
Token ID13
Feature activation-4.61e-02

Pos logit contributions
 gears+0.041
 motors+0.038
 trucks+0.037
 magnets+0.036
 cubes+0.035
Neg logit contributions
adow-0.039
elope-0.036
 Flo-0.031
lor-0.031
 perched-0.030
Token He
Token ID679
Feature activation-3.57e-02

Pos logit contributions
 gears+0.085
 motors+0.077
 trucks+0.076
 magnets+0.076
 cubes+0.073
Neg logit contributions
adow-0.080
elope-0.073
 Flo-0.064
 perched-0.061
lor-0.061
Token had
Token ID550
Feature activation-3.25e-02

Pos logit contributions
 gears+0.061
 magnets+0.058
 motors+0.055
 trucks+0.055
 automobiles+0.055
Neg logit contributions
adow-0.063
elope-0.056
 perched-0.051
c-0.047
ow-0.046
Token a
Token ID257
Feature activation-5.03e-02

Pos logit contributions
 gears+0.064
 magnets+0.061
 trucks+0.060
 motors+0.059
 cubes+0.058
Neg logit contributions
adow-0.048
elope-0.040
 perched-0.037
 Flo-0.034
lor-0.034
Token a
Token ID257
Feature activation+6.19e-03

Pos logit contributions
adow+0.047
elope+0.044
lor+0.041
 Flo+0.038
 hive+0.037
Neg logit contributions
 gears-0.043
 motors-0.039
 magnets-0.038
 trucks-0.037
 cubes-0.036
Token smart
Token ID4451
Feature activation-3.00e-02

Pos logit contributions
adow+0.013
elope+0.012
 Flo+0.010
lor+0.010
 perched+0.010
Neg logit contributions
 gears-0.010
 motors-0.009
 trucks-0.009
 magnets-0.009
 cubes-0.008
Token dog
Token ID3290
Feature activation+7.86e-03

Pos logit contributions
 gears+0.050
 motors+0.044
 trucks+0.043
 magnets+0.043
 cubes+0.041
Neg logit contributions
adow-0.059
elope-0.054
 Flo-0.048
 hive-0.047
 perched-0.047
Token called
Token ID1444
Feature activation+1.11e-02

Pos logit contributions
adow+0.013
elope+0.011
lor+0.011
comb+0.010
 Flo+0.010
Neg logit contributions
 gears-0.015
 trucks-0.014
 motors-0.014
 magnets-0.013
 cubes-0.013
Token Jazz
Token ID21406
Feature activation+5.13e-03

Pos logit contributions
adow+0.026
elope+0.023
lor+0.023
 hive+0.022
comb+0.022
Neg logit contributions
 gears-0.014
 motors-0.013
 trucks-0.013
 magnets-0.012
 balls-0.012
Token.
Token ID13
Feature activation-9.66e-02

Pos logit contributions
adow+0.010
elope+0.009
 Flo+0.009
 hive+0.009
lor+0.008
Neg logit contributions
 gears-0.008
 motors-0.007
 trucks-0.007
 magnets-0.007
 automobiles-0.006
Token Jazz
Token ID21406
Feature activation-2.23e-02

Pos logit contributions
 motors+0.190
 magnets+0.185
 cubes+0.184
 gears+0.182
 trucks+0.178
Neg logit contributions
adow-0.118
elope-0.097
 Sof-0.097
 Dahl-0.089
c-0.084
Token liked
Token ID8288
Feature activation-2.07e-02

Pos logit contributions
 gears+0.034
 motors+0.030
 trucks+0.029
 magnets+0.029
 cubes+0.028
Neg logit contributions
adow-0.047
elope-0.044
 Flo-0.039
lor-0.039
 perched-0.038
Token to
Token ID284
Feature activation-1.07e-02

Pos logit contributions
 gears+0.040
 motors+0.037
 magnets+0.036
 trophies+0.035
 cubes+0.035
Neg logit contributions
adow-0.031
elope-0.026
 perched-0.025
 hive-0.025
 Flo-0.025
Token steal
Token ID8711
Feature activation-3.25e-02

Pos logit contributions
 gears+0.015
 motors+0.013
 trucks+0.013
 magnets+0.013
 cubes+0.012
Neg logit contributions
adow-0.024
elope-0.022
 Flo-0.020
lor-0.020
 perched-0.019
Token things
Token ID1243
Feature activation-4.07e-02

Pos logit contributions
 gears+0.042
 motors+0.038
 trucks+0.036
 magnets+0.036
 cubes+0.034
Neg logit contributions
adow-0.072
elope-0.065
 Flo-0.061
 perched-0.059
 hive-0.059
Token new
Token ID649
Feature activation-7.86e-02

Pos logit contributions
 gears+0.097
 motors+0.096
driver+0.093
 magnets+0.091
ruck+0.091
Neg logit contributions
adow-0.102
elope-0.083
 hive-0.079
opsy-0.078
c-0.076
Token pants
Token ID12581
Feature activation-5.10e-02

Pos logit contributions
hing+0.114
 motors+0.112
 gears+0.110
 magnets+0.108
 levers+0.106
Neg logit contributions
adow-0.139
elope-0.115
c-0.114
 hive-0.107
opsy-0.106
Token and
Token ID290
Feature activation-3.78e-02

Pos logit contributions
 gears+0.100
 motors+0.097
 trucks+0.094
 magnets+0.090
 automobiles+0.089
Neg logit contributions
adow-0.083
elope-0.063
 perched-0.057
ow-0.056
comb-0.055
Token he
Token ID339
Feature activation-3.18e-02

Pos logit contributions
 gears+0.063
 motors+0.059
 magnets+0.055
driver+0.055
 automobiles+0.054
Neg logit contributions
adow-0.064
elope-0.059
 Flo-0.054
 perched-0.053
 hive-0.049
Token liked
Token ID8288
Feature activation-3.98e-02

Pos logit contributions
 gears+0.058
 motors+0.053
 trucks+0.052
 magnets+0.051
 automobiles+0.051
Neg logit contributions
adow-0.055
elope-0.050
 perched-0.045
 Flo-0.042
 hive-0.041
Token sorting
Token ID29407
Feature activation-9.64e-02

Pos logit contributions
 gears+0.074
 motors+0.067
 magnets+0.065
 trucks+0.065
 automobiles+0.063
Neg logit contributions
adow-0.057
elope-0.052
 perched-0.050
 Flo-0.048
 hive-0.044
Token.
Token ID13
Feature activation-2.66e-02

Pos logit contributions
 gears+0.165
 trucks+0.162
 motors+0.157
 builders+0.156
 referee+0.154
Neg logit contributions
adow-0.108
 perched-0.096
c-0.095
 ther-0.089
 Flo-0.084
Token He
Token ID679
Feature activation-4.48e-02

Pos logit contributions
 gears+0.048
 motors+0.044
 trucks+0.043
 magnets+0.042
 cubes+0.041
Neg logit contributions
adow-0.048
elope-0.044
 Flo-0.039
lor-0.038
 perched-0.038
Token was
Token ID373
Feature activation-2.19e-02

Pos logit contributions
 gears+0.085
 motors+0.082
 automobiles+0.081
 trucks+0.078
 magnets+0.076
Neg logit contributions
adow-0.066
elope-0.058
 perched-0.054
 Flo-0.047
abel-0.047
Token so
Token ID523
Feature activation-2.16e-02

Pos logit contributions
 gears+0.042
 motors+0.039
 trucks+0.038
 magnets+0.038
 cubes+0.036
Neg logit contributions
adow-0.037
elope-0.034
 Flo-0.029
lor-0.029
 perched-0.028
Token happy
Token ID3772
Feature activation-1.73e-02

Pos logit contributions
 gears+0.048
 motors+0.046
 trucks+0.045
 automobiles+0.044
 magnets+0.044
Neg logit contributions
adow-0.030
elope-0.024
 Flo-0.021
 perched-0.021
oney-0.020
Token and
Token ID290
Feature activation+6.61e-03

Pos logit contributions
adow+0.005
elope+0.005
lor+0.004
 Flo+0.004
 hive+0.004
Neg logit contributions
 gears-0.002
 motors-0.002
 trucks-0.002
 magnets-0.002
 cubes-0.002
Token Ryan
Token ID6047
Feature activation-1.93e-02

Pos logit contributions
adow+0.014
lor+0.013
comb+0.013
elope+0.013
ocket+0.012
Neg logit contributions
 gears-0.009
 trucks-0.008
 motors-0.008
 magnets-0.007
 colors-0.007
Token was
Token ID373
Feature activation+1.49e-02

Pos logit contributions
 gears+0.026
 motors+0.023
 trucks+0.022
 magnets+0.020
 cubes+0.019
Neg logit contributions
adow-0.041
elope-0.039
lor-0.038
 Flo-0.036
 hive-0.036
Token very
Token ID845
Feature activation+3.27e-03

Pos logit contributions
lor+0.034
adow+0.031
ocket+0.030
elope+0.030
comb+0.030
Neg logit contributions
 gears-0.017
 motors-0.017
 trucks-0.016
 blocks-0.014
 colors-0.014
Token excited
Token ID6568
Feature activation-1.86e-02

Pos logit contributions
lor+0.007
elope+0.007
adow+0.006
comb+0.006
ocket+0.006
Neg logit contributions
 gears-0.004
 motors-0.004
 trucks-0.004
 cubes-0.004
 colors-0.004
Token.
Token ID13
Feature activation-9.49e-02

Pos logit contributions
 gears+0.035
 motors+0.031
 trucks+0.031
 magnets+0.031
 cubes+0.029
Neg logit contributions
adow-0.033
elope-0.030
 Flo-0.026
lor-0.026
 perched-0.026
Token His
Token ID2399
Feature activation-8.26e-03

Pos logit contributions
 cubes+0.212
 trucks+0.199
 motors+0.198
ruck+0.196
 sculptures+0.195
Neg logit contributions
 Just-0.069
adow-0.068
 -0.060
elope-0.049
 Luckily-0.049
Token family
Token ID1641
Feature activation-2.90e-02

Pos logit contributions
 gears+0.013
 trucks+0.011
 motors+0.011
 magnets+0.011
 numbers+0.010
Neg logit contributions
adow-0.017
elope-0.016
lor-0.016
 perched-0.015
 Flo-0.015
Token was
Token ID373
Feature activation-5.21e-03

Pos logit contributions
 gears+0.052
 motors+0.048
 trucks+0.046
 magnets+0.046
 automobiles+0.044
Neg logit contributions
adow-0.054
elope-0.048
 perched-0.042
 Flo-0.042
 hive-0.041
Token sending
Token ID7216
Feature activation-3.56e-02

Pos logit contributions
 gears+0.008
 motors+0.007
 trucks+0.007
 magnets+0.007
 cubes+0.007
Neg logit contributions
adow-0.011
elope-0.010
lor-0.009
 Flo-0.009
comb-0.009
Token him
Token ID683
Feature activation-9.10e-03

Pos logit contributions
 gears+0.062
 motors+0.057
 trucks+0.056
 magnets+0.055
 cubes+0.054
Neg logit contributions
adow-0.062
elope-0.056
 Flo-0.051
 perched-0.049
lor-0.048
Token with
Token ID351
Feature activation-2.31e-02

Pos logit contributions
 gears+0.038
 motors+0.034
 trucks+0.033
 magnets+0.033
 cubes+0.031
Neg logit contributions
adow-0.047
elope-0.044
 Flo-0.039
lor-0.039
 perched-0.038
Token tubes
Token ID21103
Feature activation-5.91e-02

Pos logit contributions
 gears+0.033
 motors+0.029
driver+0.027
 magnets+0.027
 trucks+0.026
Neg logit contributions
adow-0.047
elope-0.044
 Flo-0.039
 hive-0.039
 perched-0.039
Token.
Token ID13
Feature activation-4.07e-02

Pos logit contributions
 gears+0.123
 motors+0.114
 trucks+0.109
 magnets+0.106
 levers+0.106
Neg logit contributions
adow-0.088
elope-0.063
 hive-0.061
ow-0.059
 Flo-0.057
Token They
Token ID1119
Feature activation-6.11e-02

Pos logit contributions
 gears+0.065
 motors+0.063
 trucks+0.058
 magnets+0.057
 hoops+0.056
Neg logit contributions
adow-0.083
elope-0.076
lor-0.072
comb-0.070
 perched-0.067
Token have
Token ID423
Feature activation-4.09e-02

Pos logit contributions
 gears+0.116
 motors+0.110
 automobiles+0.108
irens+0.105
 magnets+0.105
Neg logit contributions
adow-0.089
elope-0.077
�-0.067
 Flo-0.063
 perched-0.062
Token many
Token ID867
Feature activation-9.47e-02

Pos logit contributions
 gears+0.078
 magnets+0.071
 trucks+0.071
 motors+0.071
 automobiles+0.070
Neg logit contributions
adow-0.062
elope-0.056
 Flo-0.049
 perched-0.047
 hive-0.046
Token tubes
Token ID21103
Feature activation-7.14e-02

Pos logit contributions
 referee+0.120
driver+0.117
 trophy+0.116
 photographer+0.112
hing+0.108
Neg logit contributions
adow-0.163
c-0.155
elope-0.141
 ther-0.135
 upon-0.130
Token of
Token ID286
Feature activation-6.78e-02

Pos logit contributions
 gears+0.142
 motors+0.135
 trucks+0.127
 magnets+0.125
 trophies+0.124
Neg logit contributions
adow-0.107
ow-0.076
 hive-0.070
 Flo-0.069
elope-0.068
Token different
Token ID1180
Feature activation-9.24e-02

Pos logit contributions
 trophy+0.103
 gears+0.101
 motorcycle+0.101
 referee+0.101
 trophies+0.100
Neg logit contributions
adow-0.100
elope-0.089
 Flo-0.084
c-0.082
 ther-0.082
Token colors
Token ID7577
Feature activation-6.75e-02

Pos logit contributions
 referee+0.091
 cop+0.090
 trophy+0.089
 motorcycle+0.086
driver+0.083
Neg logit contributions
adow-0.204
c-0.174
elope-0.172
ow-0.154
 ther-0.151
Token and
Token ID290
Feature activation-5.00e-02

Pos logit contributions
 gears+0.134
 motors+0.121
 trucks+0.120
 trophies+0.118
 magnets+0.116
Neg logit contributions
adow-0.101
elope-0.076
 perched-0.076
 Flo-0.070
 hive-0.068
Token Jake
Token ID14757
Feature activation-2.11e-02

Pos logit contributions
adow+0.014
elope+0.013
 Flo+0.011
lor+0.011
 perched+0.011
Neg logit contributions
 gears-0.011
 motors-0.010
 trucks-0.010
 magnets-0.010
 cubes-0.009
Token who
Token ID508
Feature activation-2.46e-02

Pos logit contributions
 gears+0.049
 motors+0.045
 trucks+0.043
 magnets+0.043
 trophies+0.043
Neg logit contributions
adow-0.029
elope-0.025
 perched-0.019
ow-0.019
 Flo-0.018
Token loved
Token ID6151
Feature activation-2.74e-02

Pos logit contributions
 gears+0.044
 motors+0.042
 magnets+0.040
 trucks+0.039
 trophies+0.039
Neg logit contributions
adow-0.044
elope-0.042
 perched-0.033
 Flo-0.033
 hive-0.033
Token to
Token ID284
Feature activation-1.59e-02

Pos logit contributions
 gears+0.049
 motors+0.045
driver+0.044
about+0.043
 trophies+0.043
Neg logit contributions
adow-0.042
elope-0.036
 perched-0.035
 hive-0.035
 Flo-0.033
Token kick
Token ID4829
Feature activation-6.43e-02

Pos logit contributions
 gears+0.024
 motors+0.022
 trucks+0.021
 magnets+0.021
 automobiles+0.020
Neg logit contributions
adow-0.034
elope-0.030
 Flo-0.027
 hive-0.026
ow-0.026
Token.
Token ID13
Feature activation-9.45e-02

Pos logit contributions
 gears+0.105
driver+0.099
 sculptures+0.099
 trucks+0.097
ruck+0.097
Neg logit contributions
adow-0.100
elope-0.085
 perched-0.084
c-0.083
 Flo-0.079
Token Jake
Token ID14757
Feature activation-4.55e-02

Pos logit contributions
 bands+0.204
 motors+0.202
 trophies+0.197
 cubes+0.195
 gears+0.193
Neg logit contributions
adow-0.095
elope-0.074
 Just-0.060
ow-0.054
 -0.052
Token kicked
Token ID12165
Feature activation-6.30e-02

Pos logit contributions
 gears+0.087
 motors+0.078
 magnets+0.078
 trucks+0.077
 cubes+0.074
Neg logit contributions
adow-0.077
elope-0.071
 perched-0.061
 Flo-0.060
lor-0.057
Token over
Token ID625
Feature activation-3.88e-02

Pos logit contributions
 sculptures+0.109
 gears+0.106
 motors+0.106
ubes+0.104
 sizes+0.104
Neg logit contributions
adow-0.073
 perched-0.072
elope-0.067
 Flo-0.066
ow-0.063
Token anything
Token ID1997
Feature activation-2.77e-02

Pos logit contributions
 gears+0.059
 motors+0.059
 sizes+0.057
ubes+0.054
 automobiles+0.054
Neg logit contributions
adow-0.063
elope-0.052
 Flo-0.048
 Sof-0.047
 perched-0.047
Token that
Token ID326
Feature activation-2.92e-02

Pos logit contributions
 gears+0.052
 motors+0.048
 magnets+0.048
 trucks+0.047
 automobiles+0.046
Neg logit contributions
adow-0.043
elope-0.037
 perched-0.035
 Flo-0.035
 hive-0.033
Token regretted
Token ID44652
Feature activation+2.40e-02

Pos logit contributions
adow+0.020
lor+0.018
elope+0.017
oney+0.017
comb+0.017
Neg logit contributions
 gears-0.011
 trucks-0.010
 motors-0.009
 buttons-0.008
 numbers-0.008
Token getting
Token ID1972
Feature activation-8.47e-03

Pos logit contributions
adow+0.052
elope+0.047
lor+0.044
 Flo+0.040
c+0.040
Neg logit contributions
 gears-0.036
 trucks-0.033
 motors-0.033
 magnets-0.031
 numbers-0.030
Token close
Token ID1969
Feature activation-1.38e-02

Pos logit contributions
 gears+0.014
 motors+0.013
 magnets+0.012
 trucks+0.012
 cubes+0.012
Neg logit contributions
adow-0.017
elope-0.016
lor-0.015
comb-0.014
oney-0.014
Token to
Token ID284
Feature activation+1.42e-02

Pos logit contributions
 gears+0.022
 motors+0.020
 trucks+0.019
 magnets+0.019
 cubes+0.018
Neg logit contributions
adow-0.028
elope-0.026
 Flo-0.023
lor-0.023
 perched-0.023
Token the
Token ID262
Feature activation+4.50e-02

Pos logit contributions
adow+0.035
lor+0.032
elope+0.031
comb+0.030
opsy+0.029
Neg logit contributions
 gears-0.018
 trucks-0.017
 motors-0.016
 numbers-0.015
 magnets-0.015
Token Gor
Token ID19097
Feature activation+8.99e-02

Pos logit contributions
adow+0.099
elope+0.084
lor+0.082
oney+0.080
opsy+0.077
Neg logit contributions
 trucks-0.066
 gears-0.065
 numbers-0.062
 motors-0.060
 colors-0.059
Tokenilla
Token ID5049
Feature activation+5.84e-02

Pos logit contributions
adow+0.115
 Flo+0.102
 ther+0.091
 Dahl+0.090
 Honey+0.089
Neg logit contributions
 trucks-0.182
 gears-0.175
 motors-0.174
 cubes-0.168
 automobiles-0.168
Token and
Token ID290
Feature activation+2.48e-03

Pos logit contributions
adow+0.133
elope+0.128
lor+0.122
 Flo+0.119
opsy+0.119
Neg logit contributions
 gears-0.063
 trucks-0.062
 magnets-0.057
 motors-0.054
 numbers-0.053
Token not
Token ID407
Feature activation-4.13e-03

Pos logit contributions
adow+0.005
elope+0.005
lor+0.005
oney+0.005
opsy+0.004
Neg logit contributions
 gears-0.004
 trucks-0.003
 motors-0.003
 magnets-0.003
 numbers-0.003
Token listening
Token ID8680
Feature activation+4.90e-03

Pos logit contributions
 gears+0.007
 motors+0.006
 trucks+0.006
 magnets+0.005
 numbers+0.005
Neg logit contributions
adow-0.008
elope-0.008
lor-0.007
 Flo-0.007
opsy-0.007
Token to
Token ID284
Feature activation+1.52e-02

Pos logit contributions
adow+0.011
elope+0.011
lor+0.009
 Flo+0.009
opsy+0.009
Neg logit contributions
 gears-0.007
 motors-0.006
 trucks-0.006
 magnets-0.006
 numbers-0.006
Token!
Token ID0
Feature activation+6.70e-02

Pos logit contributions
adow+0.087
elope+0.083
 thirst+0.078
lor+0.075
oney+0.075
Neg logit contributions
 gears-0.048
 numbers-0.045
 trucks-0.045
 magnets-0.044
 automobiles-0.042
Token Hello
Token ID18435
Feature activation+3.87e-02

Pos logit contributions
adow+0.143
 hive+0.128
elope+0.126
comb+0.123
opsy+0.123
Neg logit contributions
 gears-0.069
 robots-0.068
 cop-0.065
 motors-0.065
driver-0.064
Token there
Token ID612
Feature activation+3.02e-02

Pos logit contributions
adow+0.079
elope+0.075
lor+0.070
oney+0.068
 thirst+0.067
Neg logit contributions
 gears-0.044
 numbers-0.042
 trucks-0.041
 colors-0.040
 magnets-0.039
Token,
Token ID11
Feature activation+5.40e-02

Pos logit contributions
adow+0.060
elope+0.059
oney+0.054
opsy+0.054
lor+0.053
Neg logit contributions
 gears-0.037
 numbers-0.036
 colors-0.035
 trucks-0.035
 motors-0.034
Token Mr
Token ID1770
Feature activation+5.46e-02

Pos logit contributions
adow+0.116
elope+0.109
lor+0.100
comb+0.098
ossom+0.097
Neg logit contributions
 gears-0.073
 trucks-0.066
 magnets-0.065
 robots-0.063
 numbers-0.062
Token Owl
Token ID37007
Feature activation+8.08e-02

Pos logit contributions
adow+0.116
 thirst+0.104
oney+0.102
lor+0.102
 ther+0.097
Neg logit contributions
 gears-0.059
 numbers-0.055
 trucks-0.055
 magnets-0.054
 automobiles-0.054
Token!"
Token ID2474
Feature activation+4.30e-02

Pos logit contributions
adow+0.160
elope+0.152
lor+0.151
opsy+0.151
 Maur+0.148
Neg logit contributions
 trucks-0.092
 gears-0.089
 sizes-0.082
 motors-0.082
 numbers-0.081
Token
Token ID198
Feature activation+4.52e-02

Pos logit contributions
adow+0.096
opsy+0.079
elope+0.078
lor+0.076
asis+0.075
Neg logit contributions
 numbers-0.049
 gears-0.049
 magnets-0.047
 robots-0.046
 trucks-0.046
Token
Token ID198
Feature activation+3.29e-02

Pos logit contributions
adow+0.097
elope+0.081
 hive+0.077
comb+0.075
 thirst+0.073
Neg logit contributions
 gears-0.060
 motors-0.055
 magnets-0.054
 numbers-0.054
 cop-0.052
TokenThe
Token ID464
Feature activation+2.35e-02

Pos logit contributions
adow+0.071
elope+0.063
 Flo+0.056
 hive+0.055
 perched+0.055
Neg logit contributions
 gears-0.047
 motors-0.043
 magnets-0.042
 trucks-0.041
 trophies-0.040
Token owl
Token ID39610
Feature activation+4.88e-02

Pos logit contributions
adow+0.055
elope+0.048
lor+0.046
 Flo+0.045
oney+0.044
Neg logit contributions
 gears-0.032
 magnets-0.028
 trucks-0.028
 numbers-0.027
 motors-0.027
Token.
Token ID13
Feature activation+1.30e-02

Pos logit contributions
adow+0.060
elope+0.056
lor+0.052
oney+0.051
opsy+0.049
Neg logit contributions
 gears-0.045
 motors-0.040
 trucks-0.040
 magnets-0.037
 numbers-0.037
Token
Token ID220
Feature activation+1.44e-02

Pos logit contributions
adow+0.030
 hive+0.027
comb+0.026
elope+0.025
opsy+0.024
Neg logit contributions
 gears-0.019
 magnets-0.015
 trucks-0.015
 motors-0.015
 robots-0.015
Token
Token ID198
Feature activation+1.96e-02

Pos logit contributions
adow+0.029
 hive+0.028
elope+0.027
comb+0.026
tail+0.025
Neg logit contributions
 gears-0.022
 motors-0.019
 magnets-0.018
 trucks-0.017
driver-0.017
Token
Token ID198
Feature activation+3.60e-03

Pos logit contributions
adow+0.036
elope+0.031
 hive+0.029
comb+0.027
ow+0.027
Neg logit contributions
 gears-0.036
 motors-0.032
 magnets-0.031
 trucks-0.030
 cubes-0.030
TokenThe
Token ID464
Feature activation+3.14e-02

Pos logit contributions
adow+0.008
elope+0.007
 Flo+0.006
lor+0.006
 perched+0.006
Neg logit contributions
 gears-0.005
 motors-0.005
 trucks-0.005
 magnets-0.005
 cubes-0.004
Token ostr
Token ID42387
Feature activation+8.54e-02

Pos logit contributions
adow+0.066
lor+0.057
elope+0.055
comb+0.052
oney+0.051
Neg logit contributions
 gears-0.049
 trucks-0.049
 motors-0.046
 colors-0.044
 numbers-0.043
Tokenich
Token ID488
Feature activation+3.14e-02

Pos logit contributions
adow+0.152
 Honey+0.142
elope+0.137
 Bonnie+0.132
 Flo+0.132
Neg logit contributions
 trucks-0.120
 gears-0.119
 automobiles-0.118
 motors-0.116
 cubes-0.113
Token kept
Token ID4030
Feature activation-8.40e-04

Pos logit contributions
adow+0.071
oney+0.065
lor+0.063
elope+0.063
comb+0.061
Neg logit contributions
 gears-0.036
 trucks-0.032
 motors-0.029
 colors-0.027
 buttons-0.027
Token walking
Token ID6155
Feature activation+8.01e-03

Pos logit contributions
 gears+0.002
 motors+0.001
 trucks+0.001
 magnets+0.001
 cubes+0.001
Neg logit contributions
adow-0.001
elope-0.001
lor-0.001
 Flo-0.001
comb-0.001
Token until
Token ID1566
Feature activation+1.27e-02

Pos logit contributions
adow+0.017
elope+0.016
lor+0.014
 Flo+0.014
 perched+0.014
Neg logit contributions
 gears-0.012
 motors-0.011
 trucks-0.011
 magnets-0.010
 cubes-0.010
Token he
Token ID339
Feature activation+1.49e-02

Pos logit contributions
adow+0.037
elope+0.032
lor+0.032
 hive+0.030
anna+0.030
Neg logit contributions
 gears-0.011
 motors-0.008
 trucks-0.008
 cubes-0.007
 magnets-0.006
Token so
Token ID523
Feature activation+3.60e-02

Pos logit contributions
lor+0.089
adow+0.089
elope+0.084
oney+0.081
utt+0.080
Neg logit contributions
 colors-0.047
 color-0.039
 gears-0.038
 colored-0.038
 numbers-0.038
Token nice
Token ID3621
Feature activation+3.00e-02

Pos logit contributions
adow+0.095
elope+0.092
lor+0.090
comb+0.087
 perched+0.082
Neg logit contributions
 gears-0.028
 colors-0.027
 numbers-0.024
 shapes-0.022
 motors-0.022
Token to
Token ID284
Feature activation+2.73e-02

Pos logit contributions
elope+0.060
adow+0.059
lor+0.055
opsy+0.053
 symp+0.050
Neg logit contributions
 gears-0.042
 trucks-0.037
 numbers-0.037
 colors-0.037
 magnets-0.036
Token see
Token ID766
Feature activation+5.66e-02

Pos logit contributions
adow+0.064
elope+0.058
lor+0.057
 ther+0.052
comb+0.052
Neg logit contributions
 gears-0.030
 motors-0.028
 colors-0.028
 numbers-0.027
 swirl-0.027
Token a
Token ID257
Feature activation+4.17e-02

Pos logit contributions
adow+0.126
elope+0.125
comb+0.114
lor+0.108
opsy+0.107
Neg logit contributions
 gears-0.069
 motors-0.065
 magnets-0.065
 numbers-0.064
 trucks-0.064
Token rose
Token ID8278
Feature activation+8.23e-02

Pos logit contributions
adow+0.101
elope+0.089
ow+0.086
lor+0.083
 Flo+0.082
Neg logit contributions
 colors-0.043
 gears-0.041
 swirl-0.041
 numbers-0.040
 trophy-0.039
Token like
Token ID588
Feature activation+6.10e-02

Pos logit contributions
elope+0.171
adow+0.155
uf+0.152
apa+0.150
lor+0.148
Neg logit contributions
 colors-0.090
 balls-0.087
 gears-0.086
 swirl-0.085
 trophy-0.084
Token that
Token ID326
Feature activation+6.81e-02

Pos logit contributions
adow+0.136
elope+0.135
comb+0.113
lor+0.111
ossom+0.111
Neg logit contributions
 gears-0.077
 numbers-0.073
 magnets-0.073
 motors-0.072
 colors-0.071
Token."
Token ID526
Feature activation+2.99e-02

Pos logit contributions
adow+0.132
elope+0.132
lor+0.129
uf+0.128
apa+0.120
Neg logit contributions
 colors-0.087
 color-0.078
 numbers-0.077
 shapes-0.074
 ones-0.074
Token
Token ID198
Feature activation+3.27e-02

Pos logit contributions
adow+0.073
lor+0.062
comb+0.061
elope+0.059
asis+0.058
Neg logit contributions
 gears-0.032
 turns-0.031
 brushes-0.030
 pencil-0.030
 numbers-0.030
Token
Token ID198
Feature activation+1.99e-02

Pos logit contributions
adow+0.075
comb+0.065
elope+0.061
 hive+0.061
ow+0.061
Neg logit contributions
 gears-0.035
 magnets-0.030
 pencil-0.029
 numbers-0.029
 motors-0.029
Token friend
Token ID1545
Feature activation+1.75e-02

Pos logit contributions
adow+0.044
elope+0.041
lor+0.040
 Flo+0.037
 perched+0.037
Neg logit contributions
 gears-0.029
 motors-0.026
 trucks-0.026
 cubes-0.024
 magnets-0.024
Token."
Token ID526
Feature activation+1.96e-02

Pos logit contributions
adow+0.036
elope+0.035
lor+0.032
opsy+0.031
 Flo+0.031
Neg logit contributions
 gears-0.023
 trucks-0.023
 motors-0.022
 balls-0.022
 magnets-0.021
Token
Token ID198
Feature activation+2.72e-02

Pos logit contributions
adow+0.045
elope+0.038
comb+0.038
lor+0.037
ocket+0.037
Neg logit contributions
 gears-0.023
 trucks-0.022
 cubes-0.021
 motors-0.021
 automobiles-0.020
Token
Token ID198
Feature activation+1.70e-02

Pos logit contributions
adow+0.063
comb+0.056
elope+0.052
 hive+0.051
ow+0.050
Neg logit contributions
 gears-0.032
 motors-0.029
 cubes-0.027
 trucks-0.027
 magnets-0.026
TokenThe
Token ID464
Feature activation+2.31e-02

Pos logit contributions
adow+0.041
elope+0.035
comb+0.034
ow+0.034
 hive+0.034
Neg logit contributions
 gears-0.019
 motors-0.017
 trucks-0.016
 cubes-0.016
 magnets-0.016
Token ostr
Token ID42387
Feature activation+8.47e-02

Pos logit contributions
adow+0.050
lor+0.042
elope+0.041
oney+0.039
comb+0.039
Neg logit contributions
 gears-0.037
 trucks-0.037
 motors-0.035
 colors-0.033
 automobiles-0.033
Tokenich
Token ID488
Feature activation+4.57e-02

Pos logit contributions
adow+0.145
 Honey+0.142
elope+0.136
 Flo+0.133
 Bonnie+0.132
Neg logit contributions
 trucks-0.131
 gears-0.128
 cubes-0.127
 automobiles-0.126
 motors-0.123
Token heard
Token ID2982
Feature activation+2.69e-02

Pos logit contributions
adow+0.105
elope+0.093
oney+0.093
lor+0.091
opsy+0.087
Neg logit contributions
 gears-0.051
 trucks-0.049
 magnets-0.044
 balls-0.043
 motors-0.043
Token Ben
Token ID3932
Feature activation+1.80e-02

Pos logit contributions
adow+0.065
elope+0.056
comb+0.052
lor+0.051
field+0.049
Neg logit contributions
 motors-0.036
 gears-0.035
 numbers-0.033
 trucks-0.033
 balls-0.033
Token's
Token ID338
Feature activation+2.95e-02

Pos logit contributions
adow+0.039
elope+0.037
lor+0.035
 Flo+0.034
opsy+0.033
Neg logit contributions
 gears-0.022
 trucks-0.021
 motors-0.020
 magnets-0.019
 buttons-0.019
Token voice
Token ID3809
Feature activation+7.45e-03

Pos logit contributions
adow+0.055
elope+0.050
lor+0.046
 Flo+0.044
 perched+0.044
Neg logit contributions
 gears-0.052
 motors-0.047
 trucks-0.047
 magnets-0.045
 automobiles-0.044
TokenShe
Token ID3347
Feature activation+4.92e-03

Pos logit contributions
adow+0.006
elope+0.006
lor+0.005
 Flo+0.005
 perched+0.005
Neg logit contributions
 gears-0.005
 motors-0.004
 trucks-0.004
 magnets-0.004
 cubes-0.004
Token hugs
Token ID40657
Feature activation-2.61e-02

Pos logit contributions
adow+0.011
elope+0.010
 Flo+0.009
lor+0.009
 perched+0.008
Neg logit contributions
 gears-0.007
 motors-0.006
 trucks-0.006
 magnets-0.006
 cubes-0.006
Token them
Token ID606
Feature activation-1.50e-02

Pos logit contributions
 gears+0.040
 motors+0.035
 trucks+0.034
 magnets+0.034
 cubes+0.032
Neg logit contributions
adow-0.054
elope-0.050
 Flo-0.046
lor-0.044
 perched-0.044
Token and
Token ID290
Feature activation-1.36e-02

Pos logit contributions
 gears+0.024
 motors+0.022
 trucks+0.021
 magnets+0.020
 cubes+0.020
Neg logit contributions
adow-0.030
elope-0.026
 Flo-0.024
 hive-0.024
 perched-0.024
Token says
Token ID1139
Feature activation+1.73e-02

Pos logit contributions
 gears+0.024
 motors+0.022
 trucks+0.021
 magnets+0.021
 cubes+0.021
Neg logit contributions
adow-0.025
elope-0.023
 Flo-0.020
 perched-0.020
lor-0.020
Token she
Token ID673
Feature activation+1.47e-02

Pos logit contributions
adow+0.041
elope+0.039
lor+0.034
 perched+0.033
comb+0.033
Neg logit contributions
 gears-0.023
 motors-0.021
 magnets-0.020
 trucks-0.020
 automobiles-0.019
Token loves
Token ID10408
Feature activation-1.52e-02

Pos logit contributions
adow+0.031
elope+0.029
 Flo+0.028
lor+0.027
comb+0.026
Neg logit contributions
 gears-0.021
 motors-0.019
 trucks-0.019
 magnets-0.018
 numbers-0.017
Token them
Token ID606
Feature activation-2.52e-02

Pos logit contributions
 gears+0.020
 motors+0.018
 trucks+0.017
 magnets+0.017
 cubes+0.016
Neg logit contributions
adow-0.036
elope-0.034
lor-0.031
comb-0.030
 perched-0.030
Token.
Token ID13
Feature activation-1.00e-02

Pos logit contributions
 gears+0.042
 motors+0.039
 trucks+0.038
 magnets+0.038
 cubes+0.036
Neg logit contributions
adow-0.048
elope-0.047
lor-0.042
 Flo-0.041
 perched-0.040
Token
Token ID198
Feature activation+1.14e-02

Pos logit contributions
 gears+0.018
 motors+0.016
 magnets+0.016
 trucks+0.016
 automobiles+0.015
Neg logit contributions
adow-0.019
elope-0.017
lor-0.016
 perched-0.015
comb-0.015
Token
Token ID198
Feature activation+6.91e-04

Pos logit contributions
adow+0.021
elope+0.019
comb+0.018
lor+0.017
 hive+0.017
Neg logit contributions
 gears-0.020
 motors-0.017
 magnets-0.017
 trucks-0.017
 cubes-0.017
Token the
Token ID262
Feature activation-1.77e-02

Pos logit contributions
adow+0.001
elope+0.001
lor+0.001
ow+0.001
 perched+0.001
Neg logit contributions
 gears-0.000
 numbers-0.000
 magnets-0.000
 trucks-0.000
 motors-0.000
Token little
Token ID1310
Feature activation-1.31e-02

Pos logit contributions
 gears+0.031
 motors+0.028
 magnets+0.027
 trucks+0.027
 cubes+0.026
Neg logit contributions
adow-0.032
elope-0.029
 hive-0.026
 Flo-0.025
lor-0.024
Token girl
Token ID2576
Feature activation-8.87e-03

Pos logit contributions
 gears+0.024
 motors+0.022
 trucks+0.021
 magnets+0.021
 cubes+0.021
Neg logit contributions
adow-0.023
elope-0.022
 hive-0.019
 Flo-0.019
 perched-0.018
Token was
Token ID373
Feature activation-9.14e-03

Pos logit contributions
 gears+0.013
 motors+0.011
 trucks+0.011
 numbers+0.010
 magnets+0.010
Neg logit contributions
adow-0.019
elope-0.017
lor-0.016
 Flo-0.016
oney-0.015
Token done
Token ID1760
Feature activation-2.28e-02

Pos logit contributions
 gears+0.016
 motors+0.014
 trucks+0.014
 magnets+0.013
 cubes+0.013
Neg logit contributions
adow-0.017
elope-0.017
lor-0.014
 Flo-0.014
 hive-0.014
Token,
Token ID11
Feature activation+2.16e-02

Pos logit contributions
 gears+0.037
 motors+0.033
 trucks+0.032
 magnets+0.032
 cubes+0.031
Neg logit contributions
adow-0.046
elope-0.043
 Flo-0.038
lor-0.037
 perched-0.037
Token she
Token ID673
Feature activation+1.10e-03

Pos logit contributions
adow+0.048
elope+0.043
lor+0.042
ow+0.037
oney+0.037
Neg logit contributions
 gears-0.029
 magnets-0.026
 trucks-0.025
 numbers-0.025
 motors-0.025
Token said
Token ID531
Feature activation+2.81e-02

Pos logit contributions
adow+0.002
elope+0.002
lor+0.002
 Flo+0.002
oney+0.002
Neg logit contributions
 gears-0.002
 motors-0.001
 trucks-0.001
 magnets-0.001
 cubes-0.001
Token,
Token ID11
Feature activation+2.41e-02

Pos logit contributions
adow+0.059
elope+0.057
lor+0.048
opsy+0.046
 perched+0.046
Neg logit contributions
 gears-0.041
 magnets-0.036
 motors-0.036
 trophies-0.036
 numbers-0.035
Token "
Token ID366
Feature activation-7.82e-03

Pos logit contributions
adow+0.063
elope+0.057
lor+0.051
 hive+0.050
 Flo+0.049
Neg logit contributions
 gears-0.024
 motors-0.021
 magnets-0.021
 trucks-0.020
 numbers-0.020
TokenThank
Token ID10449
Feature activation+1.92e-02

Pos logit contributions
 gears+0.012
 motors+0.011
 trucks+0.011
 magnets+0.010
 cubes+0.010
Neg logit contributions
adow-0.016
elope-0.015
 Flo-0.014
lor-0.013
 perched-0.013
Token something
Token ID1223
Feature activation+9.93e-03

Pos logit contributions
adow+0.068
elope+0.062
lor+0.060
 symp+0.058
opsy+0.058
Neg logit contributions
 gears-0.048
 trucks-0.046
 numbers-0.046
 motors-0.045
 magnets-0.041
Token that
Token ID326
Feature activation+2.72e-02

Pos logit contributions
adow+0.021
elope+0.020
lor+0.018
comb+0.017
 Flo+0.017
Neg logit contributions
 gears-0.015
 motors-0.014
 trucks-0.014
 magnets-0.013
 automobiles-0.013
Token helps
Token ID5419
Feature activation+1.84e-02

Pos logit contributions
adow+0.056
elope+0.053
lor+0.050
opsy+0.047
comb+0.047
Neg logit contributions
 gears-0.041
 trucks-0.039
 motors-0.037
 magnets-0.037
 numbers-0.036
Token us
Token ID514
Feature activation+2.45e-02

Pos logit contributions
adow+0.037
elope+0.035
lor+0.032
 perched+0.031
comb+0.031
Neg logit contributions
 gears-0.029
 motors-0.027
 trucks-0.027
 automobiles-0.026
 magnets-0.025
Token learn
Token ID2193
Feature activation+1.27e-02

Pos logit contributions
elope+0.048
adow+0.048
lor+0.045
 Flo+0.043
opsy+0.042
Neg logit contributions
 gears-0.036
 motors-0.033
 trucks-0.033
 magnets-0.031
 numbers-0.030
Token about
Token ID546
Feature activation+1.97e-02

Pos logit contributions
adow+0.024
elope+0.022
 Flo+0.019
lor+0.019
 perched+0.019
Neg logit contributions
 gears-0.022
 motors-0.020
 trucks-0.020
 magnets-0.020
 automobiles-0.019
Token the
Token ID262
Feature activation+9.69e-03

Pos logit contributions
adow+0.047
elope+0.043
lor+0.040
opsy+0.040
comb+0.039
Neg logit contributions
 gears-0.026
 trucks-0.024
 motors-0.024
 magnets-0.023
 numbers-0.022
Token world
Token ID995
Feature activation+2.35e-03

Pos logit contributions
adow+0.020
elope+0.019
lor+0.017
 Flo+0.017
 perched+0.016
Neg logit contributions
 gears-0.016
 motors-0.014
 trucks-0.014
 magnets-0.014
 cubes-0.013
Token around
Token ID1088
Feature activation+4.37e-03

Pos logit contributions
adow+0.004
elope+0.004
lor+0.003
 Flo+0.003
opsy+0.003
Neg logit contributions
 gears-0.004
 motors-0.004
 trucks-0.004
 magnets-0.004
 automobiles-0.004
Token us
Token ID514
Feature activation+2.69e-02

Pos logit contributions
adow+0.008
elope+0.007
lor+0.006
comb+0.006
 Flo+0.006
Neg logit contributions
 gears-0.008
 motors-0.008
 trucks-0.008
 magnets-0.008
 automobiles-0.007
Token and
Token ID290
Feature activation+1.15e-02

Pos logit contributions
elope+0.051
adow+0.049
lor+0.049
 Flo+0.046
opsy+0.045
Neg logit contributions
 gears-0.041
 trucks-0.039
 motors-0.039
 magnets-0.036
 colors-0.036
Token on
Token ID319
Feature activation+2.27e-02

Pos logit contributions
adow+0.032
elope+0.031
lor+0.029
oney+0.027
uf+0.027
Neg logit contributions
 gears-0.021
 motors-0.018
 colors-0.018
 balls-0.018
 magnets-0.017
Token the
Token ID262
Feature activation+9.86e-03

Pos logit contributions
adow+0.049
elope+0.044
lor+0.040
 Flo+0.038
oney+0.038
Neg logit contributions
 gears-0.036
 motors-0.032
 magnets-0.031
 trucks-0.031
 numbers-0.030
Token back
Token ID736
Feature activation+3.05e-02

Pos logit contributions
adow+0.023
elope+0.021
 Flo+0.019
lor+0.019
 perched+0.018
Neg logit contributions
 gears-0.014
 motors-0.012
 trucks-0.012
 magnets-0.012
 cubes-0.011
Token of
Token ID286
Feature activation+1.04e-02

Pos logit contributions
adow+0.062
elope+0.060
lor+0.054
 Flo+0.052
oney+0.052
Neg logit contributions
 gears-0.047
 motors-0.041
 trucks-0.041
 magnets-0.040
 cubes-0.038
Token a
Token ID257
Feature activation+2.28e-02

Pos logit contributions
adow+0.021
elope+0.019
lor+0.017
 Flo+0.017
 perched+0.016
Neg logit contributions
 gears-0.018
 motors-0.016
 magnets-0.016
 trucks-0.015
 cubes-0.015
Token bird
Token ID6512
Feature activation+4.77e-02

Pos logit contributions
adow+0.054
elope+0.048
ow+0.046
 Flo+0.045
lor+0.045
Neg logit contributions
 gears-0.029
 motors-0.026
 magnets-0.024
 balls-0.023
 cubes-0.023
Token.
Token ID13
Feature activation+1.55e-02

Pos logit contributions
lor+0.093
elope+0.091
uf+0.091
oney+0.091
adow+0.088
Neg logit contributions
 gears-0.055
 balls-0.052
 trucks-0.051
 colors-0.049
 blocks-0.049
Token
Token ID198
Feature activation+2.45e-02

Pos logit contributions
adow+0.033
elope+0.028
comb+0.028
 hive+0.027
 perched+0.027
Neg logit contributions
 gears-0.020
 magnets-0.019
 automobiles-0.018
 brushes-0.017
 motors-0.017
Token
Token ID198
Feature activation+9.37e-03

Pos logit contributions
adow+0.041
elope+0.036
comb+0.033
 hive+0.031
lor+0.031
Neg logit contributions
 gears-0.043
 motors-0.038
 magnets-0.038
 numbers-0.037
 balls-0.037
TokenThe
Token ID464
Feature activation-1.97e-02

Pos logit contributions
adow+0.017
elope+0.016
 Flo+0.014
lor+0.014
 perched+0.014
Neg logit contributions
 gears-0.017
 motors-0.015
 trucks-0.015
 magnets-0.015
 cubes-0.014
Token end
Token ID886
Feature activation+1.00e-02

Pos logit contributions
 gears+0.031
 motors+0.027
 trucks+0.027
 magnets+0.027
 cubes+0.025
Neg logit contributions
adow-0.041
elope-0.038
 Flo-0.034
lor-0.034
 perched-0.033
Token the
Token ID262
Feature activation-1.60e-02

Pos logit contributions
 gears+0.012
 motors+0.011
 trucks+0.010
 magnets+0.010
 cubes+0.010
Neg logit contributions
adow-0.014
elope-0.013
lor-0.012
 Flo-0.012
 perched-0.011
Token Fire
Token ID3764
Feature activation+1.98e-02

Pos logit contributions
 gears+0.026
 motors+0.023
 magnets+0.022
 trucks+0.022
 cubes+0.021
Neg logit contributions
adow-0.032
elope-0.030
 Flo-0.026
 hive-0.026
 perched-0.025
Token.
Token ID13
Feature activation+1.40e-02

Pos logit contributions
adow+0.039
elope+0.036
lor+0.032
 Flo+0.032
 perched+0.031
Neg logit contributions
 gears-0.033
 motors-0.030
 trucks-0.029
 magnets-0.029
 cubes-0.028
Token<|endoftext|>
Token ID50256
Feature activation+2.45e-02

Pos logit contributions
adow+0.027
 perched+0.023
elope+0.023
 hive+0.023
comb+0.023
Neg logit contributions
 gears-0.020
 magnets-0.019
 motors-0.019
 automobiles-0.018
 trucks-0.017
TokenOnce
Token ID7454
Feature activation+3.85e-03

Pos logit contributions
adow+0.050
comb+0.045
oney+0.045
elope+0.045
ossom+0.044
Neg logit contributions
 gears-0.033
 magnets-0.029
 blocks-0.029
 motors-0.028
 balls-0.028
Token upon
Token ID2402
Feature activation+5.64e-02

Pos logit contributions
elope+0.005
adow+0.005
lor+0.005
comb+0.005
tail+0.005
Neg logit contributions
 gears-0.008
 cubes-0.007
 magnets-0.007
 trucks-0.007
 blocks-0.007
Token a
Token ID257
Feature activation+1.67e-02

Pos logit contributions
adow+0.130
elope+0.128
lor+0.123
oney+0.120
 hive+0.118
Neg logit contributions
 gears-0.064
 magnets-0.054
 balls-0.052
 motors-0.051
 numbers-0.050
Token time
Token ID640
Feature activation-3.40e-03

Pos logit contributions
adow+0.034
elope+0.031
lor+0.030
 Flo+0.029
 perched+0.029
Neg logit contributions
 gears-0.024
 motors-0.021
 magnets-0.021
 trucks-0.021
 cubes-0.020
Token there
Token ID612
Feature activation-1.45e-02

Pos logit contributions
 gears+0.004
 trucks+0.004
 cubes+0.003
 motors+0.003
 magnets+0.003
Neg logit contributions
adow-0.008
elope-0.007
lor-0.007
 perched-0.007
 hive-0.007
Token was
Token ID373
Feature activation+3.61e-02

Pos logit contributions
 gears+0.019
 trucks+0.016
 motors+0.016
 cubes+0.016
 magnets+0.015
Neg logit contributions
adow-0.032
elope-0.029
lor-0.029
 hive-0.027
oney-0.027
Token a
Token ID257
Feature activation+5.55e-03

Pos logit contributions
adow+0.072
elope+0.066
lor+0.064
 Flo+0.058
 hive+0.057
Neg logit contributions
 gears-0.059
 magnets-0.053
 motors-0.053
 trucks-0.051
 cubes-0.050
Token in
Token ID287
Feature activation-1.55e-02

Pos logit contributions
 gears+0.043
 motors+0.039
 trucks+0.038
 magnets+0.037
 cubes+0.036
Neg logit contributions
adow-0.057
elope-0.052
 Flo-0.046
lor-0.046
 perched-0.045
Token a
Token ID257
Feature activation+1.09e-02

Pos logit contributions
 gears+0.026
 motors+0.024
 trucks+0.023
 magnets+0.023
 cubes+0.023
Neg logit contributions
adow-0.029
elope-0.027
 Flo-0.024
lor-0.023
 perched-0.023
Token powerful
Token ID3665
Feature activation-2.06e-02

Pos logit contributions
adow+0.023
elope+0.021
 Flo+0.020
lor+0.019
 perched+0.019
Neg logit contributions
 gears-0.017
 motors-0.015
 magnets-0.014
 trucks-0.014
 trophies-0.014
Token yet
Token ID1865
Feature activation+6.51e-03

Pos logit contributions
 gears+0.035
 motors+0.031
 trucks+0.031
 magnets+0.030
 cubes+0.029
Neg logit contributions
adow-0.040
elope-0.037
 Flo-0.033
lor-0.033
 perched-0.032
Token quiet
Token ID5897
Feature activation-1.76e-02

Pos logit contributions
adow+0.013
elope+0.012
 Flo+0.011
lor+0.011
opsy+0.011
Neg logit contributions
 gears-0.011
 motors-0.010
 trucks-0.009
 magnets-0.009
 numbers-0.009
Token way
Token ID835
Feature activation-1.61e-03

Pos logit contributions
 gears+0.028
 motors+0.025
 trucks+0.024
 magnets+0.024
 cubes+0.023
Neg logit contributions
adow-0.036
elope-0.033
 Flo-0.030
lor-0.029
 perched-0.029
Token.
Token ID13
Feature activation-5.04e-04

Pos logit contributions
 gears+0.002
 motors+0.002
 trucks+0.002
 magnets+0.002
 automobiles+0.002
Neg logit contributions
adow-0.003
elope-0.003
lor-0.003
opsy-0.003
 hive-0.003
Token
Token ID220
Feature activation+6.23e-03

Pos logit contributions
 gears+0.001
 motors+0.001
 magnets+0.001
 trucks+0.001
 cubes+0.001
Neg logit contributions
adow-0.001
elope-0.001
 hive-0.001
comb-0.001
lor-0.001
Token
Token ID198
Feature activation+1.27e-02

Pos logit contributions
adow+0.012
elope+0.011
 hive+0.011
comb+0.010
 Flo+0.009
Neg logit contributions
 gears-0.011
 motors-0.010
 magnets-0.009
 trucks-0.009
 automobiles-0.009
Token
Token ID198
Feature activation-1.61e-03

Pos logit contributions
adow+0.025
elope+0.022
 hive+0.021
comb+0.021
ow+0.019
Neg logit contributions
 gears-0.021
 motors-0.019
 cubes-0.017
 magnets-0.017
 trucks-0.017
TokenThe
Token ID464
Feature activation-1.11e-02

Pos logit contributions
 gears+0.002
 motors+0.002
 trucks+0.002
 magnets+0.002
 cubes+0.002
Neg logit contributions
adow-0.004
elope-0.003
 hive-0.003
 Flo-0.003
comb-0.003
Token two
Token ID734
Feature activation-1.15e-02

Pos logit contributions
adow+0.068
elope+0.065
lor+0.060
comb+0.058
 perched+0.057
Neg logit contributions
 gears-0.046
 magnets-0.043
 cubes-0.042
 motors-0.042
 trucks-0.040
Token beetles
Token ID46716
Feature activation+2.23e-02

Pos logit contributions
 gears+0.020
 motors+0.018
 trucks+0.018
 magnets+0.018
 cubes+0.017
Neg logit contributions
adow-0.022
elope-0.020
 Flo-0.017
 hive-0.017
 perched-0.017
Token.
Token ID13
Feature activation-1.71e-02

Pos logit contributions
adow+0.042
elope+0.041
lor+0.040
 Flo+0.038
comb+0.037
Neg logit contributions
 gears-0.032
 magnets-0.029
 trucks-0.029
 motors-0.029
 cubes-0.028
Token They
Token ID1119
Feature activation-1.51e-02

Pos logit contributions
 gears+0.027
 magnets+0.025
 trucks+0.025
 motors+0.024
 cubes+0.023
Neg logit contributions
adow-0.036
lor-0.031
elope-0.031
 perched-0.029
 hive-0.028
Token were
Token ID547
Feature activation+6.85e-03

Pos logit contributions
 gears+0.028
 motors+0.026
 trucks+0.025
 magnets+0.025
 cubes+0.024
Neg logit contributions
adow-0.027
elope-0.024
 Flo-0.022
 perched-0.021
 hive-0.021
Token best
Token ID1266
Feature activation-2.68e-02

Pos logit contributions
adow+0.015
lor+0.014
elope+0.014
comb+0.013
ossom+0.012
Neg logit contributions
 gears-0.009
 trucks-0.009
 magnets-0.009
 motors-0.008
 automobiles-0.008
Token friends
Token ID2460
Feature activation-2.79e-02

Pos logit contributions
 gears+0.047
 motors+0.043
 trucks+0.042
 magnets+0.042
 cubes+0.040
Neg logit contributions
adow-0.050
elope-0.046
 Flo-0.040
lor-0.040
 perched-0.039
Token,
Token ID11
Feature activation-1.89e-02

Pos logit contributions
 gears+0.052
 motors+0.047
 trucks+0.046
 magnets+0.045
 cubes+0.044
Neg logit contributions
adow-0.050
elope-0.046
 Flo-0.040
lor-0.039
 perched-0.039
Token but
Token ID475
Feature activation-3.29e-02

Pos logit contributions
 gears+0.030
 motors+0.027
 trucks+0.026
 magnets+0.026
 cubes+0.024
Neg logit contributions
adow-0.039
elope-0.036
 Flo-0.032
 perched-0.031
lor-0.031
Token they
Token ID484
Feature activation-2.37e-02

Pos logit contributions
 gears+0.036
 trucks+0.036
 magnets+0.032
 motors+0.032
 colors+0.031
Neg logit contributions
adow-0.086
lor-0.080
elope-0.073
comb-0.068
oney-0.068
Token often
Token ID1690
Feature activation-1.86e-02

Pos logit contributions
 gears+0.039
 motors+0.035
 trucks+0.034
 magnets+0.034
 cubes+0.032
Neg logit contributions
adow-0.047
elope-0.044
 Flo-0.039
 perched-0.038
 hive-0.037
Token with
Token ID351
Feature activation+1.87e-03

Pos logit contributions
 gears+0.052
 motors+0.047
 magnets+0.046
 trucks+0.046
 cubes+0.044
Neg logit contributions
adow-0.056
elope-0.051
 Flo-0.045
lor-0.044
 perched-0.044
Token all
Token ID477
Feature activation-1.54e-02

Pos logit contributions
adow+0.004
elope+0.003
lor+0.003
 Flo+0.003
 perched+0.003
Neg logit contributions
 gears-0.003
 motors-0.003
 trucks-0.003
 magnets-0.003
 cubes-0.003
Token the
Token ID262
Feature activation-1.26e-02

Pos logit contributions
 gears+0.026
 motors+0.023
 trucks+0.022
 magnets+0.022
 cubes+0.021
Neg logit contributions
adow-0.030
elope-0.028
 Flo-0.025
lor-0.025
 perched-0.024
Token blue
Token ID4171
Feature activation-1.52e-02

Pos logit contributions
 gears+0.017
 motors+0.015
 magnets+0.015
driver+0.015
 trucks+0.015
Neg logit contributions
adow-0.027
elope-0.025
 Flo-0.022
 hive-0.022
 perched-0.022
Tokenberries
Token ID20853
Feature activation+4.84e-03

Pos logit contributions
 gears+0.021
 motors+0.018
 trucks+0.017
 magnets+0.017
 cubes+0.016
Neg logit contributions
adow-0.034
elope-0.030
 Flo-0.028
 perched-0.027
 hive-0.027
Token in
Token ID287
Feature activation+1.37e-02

Pos logit contributions
adow+0.009
elope+0.009
lor+0.008
 Flo+0.008
 perched+0.007
Neg logit contributions
 gears-0.008
 motors-0.007
 trucks-0.007
 magnets-0.007
 cubes-0.007
Token the
Token ID262
Feature activation+2.23e-02

Pos logit contributions
adow+0.027
elope+0.025
lor+0.022
 Flo+0.022
 perched+0.021
Neg logit contributions
 gears-0.023
 motors-0.021
 magnets-0.020
 trucks-0.020
 cubes-0.019
Token basket
Token ID7988
Feature activation+3.19e-02

Pos logit contributions
adow+0.049
elope+0.046
 Flo+0.042
lor+0.042
oney+0.042
Neg logit contributions
 gears-0.035
 motors-0.030
 trucks-0.029
 magnets-0.029
 cubes-0.027
Token bump
Token ID13852
Feature activation-1.38e-02

Pos logit contributions
elope+0.067
adow+0.065
 Flo+0.060
lor+0.059
oney+0.058
Neg logit contributions
 gears-0.040
 magnets-0.035
 cubes-0.035
 trucks-0.034
 motors-0.034
Tokening
Token ID278
Feature activation+5.53e-03

Pos logit contributions
 gears+0.020
 motors+0.018
 trucks+0.017
 magnets+0.017
 cubes+0.016
Neg logit contributions
adow-0.030
elope-0.028
 Flo-0.025
lor-0.025
 perched-0.024
Token around
Token ID1088
Feature activation+1.18e-02

Pos logit contributions
adow+0.011
elope+0.010
lor+0.009
 Flo+0.009
 perched+0.008
Neg logit contributions
 gears-0.009
 motors-0.008
 trucks-0.008
 magnets-0.008
 cubes-0.008
Token to
Token ID284
Feature activation-8.14e-04

Pos logit contributions
 gears+0.009
 motors+0.008
 trucks+0.008
 magnets+0.008
 cubes+0.007
Neg logit contributions
adow-0.009
elope-0.008
lor-0.007
 Flo-0.007
 perched-0.007
Token the
Token ID262
Feature activation+2.65e-02

Pos logit contributions
 gears+0.001
 trucks+0.001
 motors+0.001
 magnets+0.001
 numbers+0.001
Neg logit contributions
adow-0.002
elope-0.002
lor-0.001
 Flo-0.001
oney-0.001
Token top
Token ID1353
Feature activation+7.68e-03

Pos logit contributions
adow+0.058
elope+0.054
lor+0.049
 Flo+0.048
oney+0.048
Neg logit contributions
 gears-0.039
 trucks-0.034
 numbers-0.032
 motors-0.032
 magnets-0.031
Token,
Token ID11
Feature activation+6.54e-03

Pos logit contributions
adow+0.017
elope+0.016
lor+0.014
 Flo+0.014
 perched+0.014
Neg logit contributions
 gears-0.011
 trucks-0.010
 motors-0.009
 magnets-0.009
 cubes-0.009
Token he
Token ID339
Feature activation-3.77e-04

Pos logit contributions
adow+0.014
elope+0.013
lor+0.012
 Flo+0.012
 hive+0.011
Neg logit contributions
 gears-0.010
 motors-0.009
 trucks-0.008
 magnets-0.008
 cubes-0.008
Token looked
Token ID3114
Feature activation-4.48e-03

Pos logit contributions
 gears+0.001
 trucks+0.001
 motors+0.001
 magnets+0.001
 cubes+0.000
Neg logit contributions
adow-0.001
elope-0.001
lor-0.001
 Flo-0.001
oney-0.001
Token around
Token ID1088
Feature activation+4.84e-03

Pos logit contributions
 gears+0.008
 motors+0.008
 trucks+0.008
 magnets+0.007
 cubes+0.007
Neg logit contributions
adow-0.008
elope-0.007
 Flo-0.006
lor-0.006
 perched-0.006
Token and
Token ID290
Feature activation+1.10e-02

Pos logit contributions
adow+0.011
elope+0.010
lor+0.010
 Flo+0.009
oney+0.009
Neg logit contributions
 gears-0.007
 trucks-0.006
 motors-0.006
 magnets-0.006
 cubes-0.005
Token saw
Token ID2497
Feature activation+1.15e-02

Pos logit contributions
adow+0.026
lor+0.025
elope+0.024
oney+0.022
 Flo+0.022
Neg logit contributions
 gears-0.014
 trucks-0.012
 colors-0.011
 motors-0.011
 cubes-0.010
Token the
Token ID262
Feature activation+1.82e-02

Pos logit contributions
adow+0.023
elope+0.021
lor+0.019
 Flo+0.018
oney+0.018
Neg logit contributions
 gears-0.020
 motors-0.018
 trucks-0.018
 magnets-0.017
 cubes-0.017
Token whole
Token ID2187
Feature activation+1.92e-02

Pos logit contributions
adow+0.039
elope+0.036
lor+0.034
 Flo+0.032
oney+0.032
Neg logit contributions
 gears-0.028
 trucks-0.024
 motors-0.023
 numbers-0.023
 magnets-0.023
Token.
Token ID13
Feature activation-2.73e-02

Pos logit contributions
adow+0.010
elope+0.009
lor+0.008
comb+0.008
 perched+0.008
Neg logit contributions
 gears-0.005
 trucks-0.005
 motors-0.005
 numbers-0.004
 magnets-0.004
Token He
Token ID679
Feature activation-1.49e-02

Pos logit contributions
 gears+0.055
 motors+0.050
 trucks+0.049
 magnets+0.048
 cubes+0.047
Neg logit contributions
adow-0.045
elope-0.041
lor-0.035
 Flo-0.035
 hive-0.035
Token loved
Token ID6151
Feature activation-2.51e-02

Pos logit contributions
 gears+0.024
 motors+0.021
 trucks+0.021
 magnets+0.020
 cubes+0.019
Neg logit contributions
adow-0.031
elope-0.028
 Flo-0.025
lor-0.025
 perched-0.025
Token his
Token ID465
Feature activation-2.21e-02

Pos logit contributions
 gears+0.044
 motors+0.040
 magnets+0.040
 trophies+0.038
about+0.037
Neg logit contributions
adow-0.041
elope-0.039
 Flo-0.035
 perched-0.035
 hive-0.034
Token t
Token ID256
Feature activation-1.95e-02

Pos logit contributions
 gears+0.030
 motors+0.027
 magnets+0.026
 trucks+0.025
 cubes+0.024
Neg logit contributions
adow-0.049
elope-0.046
 Flo-0.041
 hive-0.041
lor-0.041
Tokeneddy
Token ID21874
Feature activation-1.72e-02

Pos logit contributions
 gears+0.031
 motors+0.028
 trucks+0.027
 magnets+0.027
 trophies+0.026
Neg logit contributions
adow-0.039
elope-0.036
lor-0.032
 Flo-0.031
ow-0.031
Token bear
Token ID6842
Feature activation-8.83e-03

Pos logit contributions
 gears+0.029
 motors+0.026
 magnets+0.025
 trucks+0.025
 cubes+0.024
Neg logit contributions
adow-0.033
elope-0.031
 Flo-0.027
 perched-0.026
lor-0.026
Token very
Token ID845
Feature activation-1.04e-02

Pos logit contributions
 gears+0.014
 trucks+0.012
 motors+0.012
 magnets+0.012
 cubes+0.011
Neg logit contributions
adow-0.018
elope-0.017
lor-0.016
 Flo-0.016
 perched-0.015
Token much
Token ID881
Feature activation-1.90e-02

Pos logit contributions
 gears+0.022
 motors+0.020
 trucks+0.020
 magnets+0.019
 cubes+0.019
Neg logit contributions
adow-0.016
elope-0.014
 Flo-0.012
lor-0.012
 perched-0.012
Token.
Token ID13
Feature activation-3.01e-02

Pos logit contributions
 gears+0.039
 motors+0.036
 trucks+0.035
 magnets+0.034
 cubes+0.033
Neg logit contributions
adow-0.031
elope-0.027
 Flo-0.023
lor-0.023
 perched-0.023
Token
Token ID198
Feature activation-7.81e-03

Pos logit contributions
 gears+0.063
 motors+0.058
 trucks+0.057
 magnets+0.056
 cubes+0.054
Neg logit contributions
adow-0.048
elope-0.043
lor-0.037
 Flo-0.036
 hive-0.036
Token and
Token ID290
Feature activation-3.87e-02

Pos logit contributions
 gears+0.053
 motors+0.049
 magnets+0.046
 trophies+0.046
driver+0.046
Neg logit contributions
adow-0.051
elope-0.045
 perched-0.040
 Flo-0.040
 hive-0.038
Token Jack
Token ID3619
Feature activation-3.53e-02

Pos logit contributions
 gears+0.064
 motors+0.059
 magnets+0.055
 trucks+0.055
 automobiles+0.053
Neg logit contributions
adow-0.075
elope-0.070
 Flo-0.062
 perched-0.059
 hive-0.058
Token was
Token ID373
Feature activation-1.93e-02

Pos logit contributions
 gears+0.064
 motors+0.058
 trucks+0.056
 magnets+0.056
 automobiles+0.054
Neg logit contributions
adow-0.066
elope-0.060
 perched-0.051
 Flo-0.050
 hive-0.049
Token always
Token ID1464
Feature activation-2.06e-02

Pos logit contributions
 gears+0.033
 motors+0.029
 trucks+0.029
 magnets+0.029
 cubes+0.028
Neg logit contributions
adow-0.038
elope-0.035
lor-0.032
 Flo-0.031
comb-0.030
Token making
Token ID1642
Feature activation-2.66e-02

Pos logit contributions
 gears+0.037
 motors+0.033
 trucks+0.033
 magnets+0.033
 cubes+0.031
Neg logit contributions
adow-0.038
elope-0.035
 Flo-0.031
lor-0.031
 perched-0.030
Token mess
Token ID2085
Feature activation-5.18e-02

Pos logit contributions
 gears+0.043
 motors+0.038
 trucks+0.037
 magnets+0.037
 cubes+0.035
Neg logit contributions
adow-0.055
elope-0.050
 Flo-0.045
lor-0.045
 perched-0.044
Tokenes
Token ID274
Feature activation-2.25e-02

Pos logit contributions
 gears+0.101
 motors+0.095
 trucks+0.091
 automobiles+0.089
 magnets+0.089
Neg logit contributions
adow-0.079
elope-0.069
lor-0.059
 perched-0.057
 Flo-0.056
Token and
Token ID290
Feature activation-3.70e-02

Pos logit contributions
 gears+0.043
 motors+0.039
 trucks+0.038
 magnets+0.038
 automobiles+0.036
Neg logit contributions
adow-0.039
elope-0.034
 Flo-0.030
lor-0.029
 perched-0.029
Token leaving
Token ID4305
Feature activation-2.35e-02

Pos logit contributions
 gears+0.054
 motors+0.051
 automobiles+0.047
about+0.045
 magnets+0.045
Neg logit contributions
adow-0.069
elope-0.064
 perched-0.057
 hive-0.056
 Flo-0.055
Token things
Token ID1243
Feature activation-1.08e-02

Pos logit contributions
 gears+0.038
 motors+0.034
 trucks+0.033
 magnets+0.033
 cubes+0.031
Neg logit contributions
adow-0.047
elope-0.043
 Flo-0.039
lor-0.037
 perched-0.037
Token around
Token ID1088
Feature activation-1.82e-02

Pos logit contributions
 gears+0.022
 motors+0.020
 trucks+0.020
 magnets+0.019
 automobiles+0.019
Neg logit contributions
adow-0.017
elope-0.014
 perched-0.013
 Flo-0.012
lor-0.012
Token day
Token ID1110
Feature activation-2.08e-02

Pos logit contributions
 gears+0.063
 motors+0.058
 magnets+0.056
 trophies+0.055
 trucks+0.054
Neg logit contributions
adow-0.066
elope-0.058
 Flo-0.050
 hive-0.049
ow-0.048
Token,
Token ID11
Feature activation-2.11e-02

Pos logit contributions
 gears+0.040
 motors+0.037
 trucks+0.035
 magnets+0.035
 automobiles+0.035
Neg logit contributions
adow-0.035
elope-0.031
 Flo-0.029
 perched-0.027
ow-0.026
Token they
Token ID484
Feature activation-4.06e-02

Pos logit contributions
 gears+0.037
 motors+0.034
 magnets+0.033
 trucks+0.033
 cubes+0.031
Neg logit contributions
adow-0.039
elope-0.036
 Flo-0.032
 perched-0.031
lor-0.030
Token decide
Token ID5409
Feature activation+3.36e-03

Pos logit contributions
 gears+0.074
 motors+0.068
 trucks+0.064
 automobiles+0.064
 cubes+0.064
Neg logit contributions
adow-0.067
elope-0.057
 hive-0.051
 perched-0.050
ow-0.048
Token to
Token ID284
Feature activation-4.26e-02

Pos logit contributions
adow+0.006
elope+0.005
 perched+0.005
 Flo+0.005
lor+0.004
Neg logit contributions
 gears-0.006
 motors-0.006
 magnets-0.005
 trucks-0.005
 cubes-0.005
Token trade
Token ID3292
Feature activation-6.08e-02

Pos logit contributions
 gears+0.069
 motors+0.061
 magnets+0.061
 trophies+0.060
 cubes+0.059
Neg logit contributions
adow-0.074
elope-0.066
ow-0.062
 Flo-0.059
tail-0.057
Token their
Token ID511
Feature activation-4.70e-02

Pos logit contributions
 gears+0.102
 motors+0.093
 magnets+0.088
 trucks+0.088
 cubes+0.084
Neg logit contributions
adow-0.114
elope-0.103
 Flo-0.092
 perched-0.092
lor-0.089
Token toys
Token ID14958
Feature activation-5.21e-02

Pos logit contributions
 gears+0.048
 motors+0.042
 magnets+0.039
ruck+0.038
driver+0.038
Neg logit contributions
adow-0.107
elope-0.098
 hive-0.089
 perched-0.088
 Flo-0.088
Token.
Token ID13
Feature activation-2.97e-02

Pos logit contributions
 gears+0.102
 motors+0.094
 trucks+0.091
 magnets+0.090
 automobiles+0.086
Neg logit contributions
adow-0.086
elope-0.074
 perched-0.065
 Flo-0.064
lor-0.063
Token
Token ID198
Feature activation-4.96e-03

Pos logit contributions
 gears+0.050
 trucks+0.045
 motors+0.044
 magnets+0.044
 cubes+0.042
Neg logit contributions
adow-0.059
elope-0.053
lor-0.050
comb-0.048
 hive-0.047
Token
Token ID198
Feature activation-2.56e-02

Pos logit contributions
 gears+0.009
 motors+0.009
 trucks+0.008
 magnets+0.008
 cubes+0.008
Neg logit contributions
adow-0.008
elope-0.007
comb-0.007
 hive-0.006
lor-0.006
Token sorry
Token ID7926
Feature activation-1.45e-02

Pos logit contributions
 gears+0.013
 trucks+0.012
 magnets+0.011
 motors+0.011
 balls+0.010
Neg logit contributions
lor-0.016
adow-0.015
elope-0.015
 Flo-0.013
comb-0.013
Token.
Token ID13
Feature activation-2.51e-02

Pos logit contributions
 gears+0.025
 motors+0.022
 trucks+0.022
 magnets+0.022
 cubes+0.020
Neg logit contributions
adow-0.029
elope-0.027
lor-0.024
 Flo-0.024
 perched-0.023
Token They
Token ID1119
Feature activation-2.79e-02

Pos logit contributions
 gears+0.037
 motors+0.032
 trucks+0.031
 magnets+0.031
 cubes+0.030
Neg logit contributions
adow-0.055
elope-0.051
 Flo-0.046
lor-0.045
 perched-0.045
Token did
Token ID750
Feature activation-9.10e-04

Pos logit contributions
 gears+0.045
 motors+0.040
 magnets+0.039
 trucks+0.039
 cubes+0.036
Neg logit contributions
adow-0.056
elope-0.052
lor-0.047
 Flo-0.047
 perched-0.045
Token not
Token ID407
Feature activation-3.68e-02

Pos logit contributions
 gears+0.002
 motors+0.002
 trucks+0.001
 magnets+0.001
 cubes+0.001
Neg logit contributions
adow-0.002
elope-0.002
 Flo-0.001
lor-0.001
 perched-0.001
Token mean
Token ID1612
Feature activation-5.59e-02

Pos logit contributions
 gears+0.066
 motors+0.061
 trucks+0.060
 magnets+0.059
 automobiles+0.058
Neg logit contributions
adow-0.069
elope-0.059
ow-0.052
 Flo-0.051
 hive-0.050
Token to
Token ID284
Feature activation-3.49e-02

Pos logit contributions
 gears+0.127
 magnets+0.117
 motors+0.114
 pawn+0.112
 trophies+0.112
Neg logit contributions
adow-0.062
elope-0.048
ow-0.047
 Flo-0.042
 hive-0.040
Token make
Token ID787
Feature activation-1.08e-02

Pos logit contributions
 gears+0.058
 motors+0.051
 magnets+0.051
 trophies+0.050
 trucks+0.048
Neg logit contributions
adow-0.061
elope-0.061
 Flo-0.055
ow-0.053
opsy-0.051
Token Mom
Token ID11254
Feature activation-1.56e-02

Pos logit contributions
 gears+0.017
 motors+0.016
 trucks+0.016
 magnets+0.015
 automobiles+0.015
Neg logit contributions
adow-0.023
elope-0.021
lor-0.020
 Flo-0.019
comb-0.018
Token angry
Token ID7954
Feature activation-4.40e-02

Pos logit contributions
 gears+0.028
 motors+0.026
 trucks+0.025
 magnets+0.025
 cubes+0.024
Neg logit contributions
adow-0.028
elope-0.026
 Flo-0.023
lor-0.023
 perched-0.022
Token.
Token ID13
Feature activation-2.39e-02

Pos logit contributions
 gears+0.097
 motors+0.088
 magnets+0.086
 trucks+0.085
 cubes+0.082
Neg logit contributions
adow-0.061
elope-0.054
 Flo-0.046
ow-0.044
 hive-0.043
Token Tom
Token ID4186
Feature activation-3.92e-02

Pos logit contributions
 gears+0.058
 motors+0.053
 magnets+0.052
 trucks+0.051
 cubes+0.050
Neg logit contributions
adow-0.052
elope-0.047
 Flo-0.042
lor-0.040
 perched-0.040
Token had
Token ID550
Feature activation-2.68e-02

Pos logit contributions
 gears+0.075
 motors+0.073
 automobiles+0.069
 trucks+0.066
 magnets+0.066
Neg logit contributions
adow-0.064
elope-0.057
 perched-0.050
ow-0.046
abel-0.045
Token so
Token ID523
Feature activation-1.56e-02

Pos logit contributions
 gears+0.050
 motors+0.047
 magnets+0.046
 trucks+0.046
 automobiles+0.045
Neg logit contributions
adow-0.044
elope-0.039
 perched-0.035
 Flo-0.034
lor-0.033
Token much
Token ID881
Feature activation-4.02e-02

Pos logit contributions
 gears+0.033
 motors+0.031
 trucks+0.030
 magnets+0.030
 automobiles+0.029
Neg logit contributions
adow-0.025
elope-0.021
 Flo-0.018
 perched-0.017
ow-0.016
Token fun
Token ID1257
Feature activation-5.30e-02

Pos logit contributions
 trucks+0.073
 motors+0.072
 automobiles+0.070
 gears+0.069
 pawn+0.069
Neg logit contributions
adow-0.056
 perched-0.041
 resting-0.038
 ther-0.037
 hive-0.036
Token mixing
Token ID17090
Feature activation-5.73e-02

Pos logit contributions
 gears+0.099
 motors+0.093
 magnets+0.090
 trucks+0.085
 automobiles+0.085
Neg logit contributions
adow-0.087
elope-0.074
 perched-0.071
 hive-0.065
 Flo-0.063
Token things
Token ID1243
Feature activation-4.18e-02

Pos logit contributions
 gears+0.111
 motors+0.107
driver+0.102
 automobiles+0.101
 trucks+0.100
Neg logit contributions
adow-0.076
elope-0.065
 perched-0.059
 Flo-0.056
 ther-0.053
Token together
Token ID1978
Feature activation-2.83e-02

Pos logit contributions
 gears+0.085
 motors+0.080
 trucks+0.078
 magnets+0.077
 automobiles+0.076
Neg logit contributions
adow-0.064
elope-0.050
 perched-0.049
 Flo-0.046
ow-0.045
Token in
Token ID287
Feature activation-2.11e-02

Pos logit contributions
 gears+0.054
 motors+0.049
 trucks+0.048
 magnets+0.047
 automobiles+0.046
Neg logit contributions
adow-0.049
elope-0.044
 Flo-0.038
 perched-0.038
lor-0.037
Token the
Token ID262
Feature activation+5.88e-03

Pos logit contributions
 gears+0.041
 motors+0.039
 automobiles+0.038
 trucks+0.038
 magnets+0.036
Neg logit contributions
adow-0.028
elope-0.027
 Flo-0.022
 perched-0.021
 hive-0.021
Token kitchen
Token ID9592
Feature activation-3.79e-03

Pos logit contributions
adow+0.010
elope+0.009
 hive+0.008
lor+0.008
 ther+0.008
Neg logit contributions
 gears-0.011
 motors-0.010
 trucks-0.010
 magnets-0.009
 trophies-0.009
Token
Token ID198
Feature activation-2.72e-02

Pos logit contributions
adow+0.008
elope+0.007
 Flo+0.006
lor+0.006
 hive+0.006
Neg logit contributions
 gears-0.013
 motors-0.012
 trucks-0.011
 magnets-0.011
 cubes-0.011
TokenTim
Token ID14967
Feature activation-2.35e-02

Pos logit contributions
 gears+0.056
 magnets+0.053
 trophies+0.052
 pawn+0.051
 motors+0.051
Neg logit contributions
elope-0.033
adow-0.032
 perched-0.030
lor-0.027
<|endoftext|>-0.026
Tokenmy
Token ID1820
Feature activation-1.80e-02

Pos logit contributions
 gears+0.041
 motors+0.038
 magnets+0.036
 trucks+0.036
 trophies+0.035
Neg logit contributions
adow-0.045
elope-0.041
 perched-0.035
lor-0.034
 Flo-0.034
Token learned
Token ID4499
Feature activation-2.30e-02

Pos logit contributions
 gears+0.037
 motors+0.034
 trucks+0.032
 magnets+0.032
 cubes+0.032
Neg logit contributions
adow-0.028
elope-0.026
 perched-0.022
 hive-0.021
lor-0.021
Token that
Token ID326
Feature activation-2.33e-02

Pos logit contributions
 gears+0.045
 motors+0.043
 trucks+0.042
 sizes+0.042
 magnets+0.042
Neg logit contributions
adow-0.031
elope-0.027
 perched-0.026
 Flo-0.025
lor-0.025
Token sharing
Token ID7373
Feature activation-5.32e-02

Pos logit contributions
 gears+0.041
 motors+0.038
 trucks+0.036
 cubes+0.035
 levers+0.035
Neg logit contributions
adow-0.036
elope-0.034
 perched-0.031
 hive-0.031
 Flo-0.031
Token is
Token ID318
Feature activation-1.29e-02

Pos logit contributions
 gears+0.106
 motors+0.105
about+0.097
 levers+0.096
driver+0.093
Neg logit contributions
adow-0.057
elope-0.052
c-0.047
's-0.044
 hive-0.043
Token good
Token ID922
Feature activation-5.79e-02

Pos logit contributions
 gears+0.028
 motors+0.027
 trucks+0.027
 cubes+0.027
about+0.027
Neg logit contributions
adow-0.013
 okay-0.012
elope-0.012
 perched-0.012
 OK-0.011
Token and
Token ID290
Feature activation-3.19e-02

Pos logit contributions
 motors+0.123
 gears+0.121
 levers+0.118
 sculptures+0.117
hing+0.116
Neg logit contributions
adow-0.061
ow-0.048
elope-0.046
 Just-0.044
 OK-0.041
Token it
Token ID340
Feature activation-1.19e-02

Pos logit contributions
 gears+0.059
 motors+0.055
 trucks+0.052
 cubes+0.052
 magnets+0.051
Neg logit contributions
adow-0.050
elope-0.047
 perched-0.041
 Flo-0.041
 hive-0.038
Token helps
Token ID5419
Feature activation-1.53e-02

Pos logit contributions
 gears+0.030
 motors+0.029
 cubes+0.028
 trucks+0.028
 hoops+0.027
Neg logit contributions
adow-0.009
elope-0.009
 perched-0.007
 hive-0.006
lor-0.006