TOP ACTIVATIONS
MAX = 0.093

day , Lily accidentally touched the screen and it broke .
Sara sees a light on the screen . The light is
was surprised . He touched the screen and it made such
just couldn 't . So the object decided to keep repeating
see ? An image of the perfect day ! The sun
and pushed the button on the television . The screen lit
room . They can touch the screen and make things happen
. His eyes landed on the little freezer . He smiled
so big ! She touched the screen and it lit up
frustrated . The buttons on the television were different and she
Tim stood up and pushed the button on the television .
ummy w affle . The w affle was very special because
. He laid down on the cushion and curled up .
so cool ! She touched the shapes and they moved around
than anything he had ever seen before . Mark
seen before - a w affle ! She was so confused
ia put her hand on the cold mirror and started to
snacks . She saw the pupp ets and smiled .
Tom jumped on the table to get a closer
the big black board on the wall . Tim had a

MIN ACTIVATIONS
MIN = -0.097

branches . They make the tree colorful and bright .
it . He makes the tree look more beautiful and fun
happy that he gave the little bird a big hug .
Tim my 's blanket would become lively whenever they wanted to
iring , until it started to get dark . He started
your name ? '. The little pig let bl ushed and
and asked if she could have some po pp ies .
and pretend they were going on a trip . They took
so they can make the forest pretty ." L
. Soon , the path grew steep . Joe was tired
But sometimes , the sky gets dark and it rains .
mom if they could go on a saf ari , but
them up . But the last snowball was too heavy for
. She wanted to make it soar higher than the boy
. Every morning , the little pig would run and jump
me !" The little boy said , " What
." So the little boy started sharp ening the
, Grace 's attic was becoming more and more full with
that it could make the garden even better .
Soon , the night sky grew very dark . The little

INTERVAL 0.045 - 0.093
CONTAINS 0.347%

, " That 's what friends are for !" <|endoftext|> Once
she hurried ly ran over and met the little rabbit .
He was carrying a bou quet of wild fl owers he
She ran to the kitchen and opened the box . Inside
Can I go outside and show the wallet to my friends

INTERVAL -0.002 - 0.045
CONTAINS 54.403%

her hand , she slowly made her way up .
had a big red balloon that she liked to play with
why the bird had kept striking the tree . Looking up
vegetables , we can have some dessert . How does that
spoil the nature around here ?". Molly was deeply sorry and

INTERVAL -0.049 - -0.002
CONTAINS 44.967%

a muscle that was sore . It made her feel very
but also take care when exploring . <|endoftext|> Once upon a
nothing to give . <|endoftext|> Once upon a time , there
Jack watched , his heart began to beat faster and he
was able to make a stitch that looked normal . After

INTERVAL -0.097 - -0.049
CONTAINS 0.283%

, they decide to go on an adventure . They put
, but mom said they had to un pack first .
learned that sometimes , wandering can lead to wonderful things .
's wrong ?" The bird softly replied between its so bs
if her friends were coming to play . But when she
Token day
Token ID1110
Feature activation-2.49e-03

Pos logit contributions
leaf+0.028
vell+0.027
ibrarian+0.026
taker+0.025
iders+0.025
Neg logit contributions
 someday-0.023
 swe-0.023
 spe-0.021
 smaller-0.021
 cheaper-0.021
Token,
Token ID11
Feature activation-1.34e-02

Pos logit contributions
 swe+0.003
 someday+0.003
 cheaper+0.002
 spe+0.002
 smaller+0.002
Neg logit contributions
leaf-0.005
ibrarian-0.005
vell-0.005
taker-0.005
he-0.004
Token Lily
Token ID20037
Feature activation-5.27e-03

Pos logit contributions
 swe+0.019
 spe+0.018
 someday+0.018
 cheaper+0.018
c+0.017
Neg logit contributions
leaf-0.021
ibrarian-0.021
vell-0.020
 character-0.020
taker-0.020
Token accidentally
Token ID14716
Feature activation+2.29e-02

Pos logit contributions
 swe+0.006
 someday+0.006
 spe+0.006
 cheaper+0.006
 smaller+0.006
Neg logit contributions
leaf-0.010
ibrarian-0.010
vell-0.009
taker-0.009
he-0.009
Token touched
Token ID12615
Feature activation-1.02e-02

Pos logit contributions
leaf+0.044
ibrarian+0.042
vell+0.042
taker+0.041
he+0.041
Neg logit contributions
 swe-0.028
 someday-0.026
 smaller-0.026
 spe-0.026
 enough-0.025
Token the
Token ID262
Feature activation+9.28e-02

Pos logit contributions
 swe+0.013
 someday+0.013
 spe+0.013
 cheaper+0.013
 shin+0.012
Neg logit contributions
leaf-0.017
ibrarian-0.017
taker-0.016
vell-0.016
he-0.016
Token screen
Token ID3159
Feature activation+4.78e-03

Pos logit contributions
izards+0.166
ibrarian+0.156
ibr+0.153
ying+0.152
iders+0.149
Neg logit contributions
 smaller-0.134
 blue-0.128
 one-0.123
 bigger-0.116
 One-0.116
Token and
Token ID290
Feature activation+3.23e-02

Pos logit contributions
leaf+0.009
ibrarian+0.008
vell+0.008
taker+0.008
asc+0.008
Neg logit contributions
 swe-0.006
 someday-0.006
 cheaper-0.006
 spe-0.006
 smaller-0.006
Token it
Token ID340
Feature activation+6.30e-05

Pos logit contributions
leaf+0.081
vell+0.077
ibrarian+0.075
asc+0.074
ibr+0.073
Neg logit contributions
 swe-0.025
 enough-0.023
 smaller-0.022
 someday-0.022
 spe-0.021
Token broke
Token ID6265
Feature activation+1.68e-03

Pos logit contributions
leaf+0.000
ibrarian+0.000
vell+0.000
taker+0.000
he+0.000
Neg logit contributions
 swe-0.000
 someday-0.000
 cheaper-0.000
 spe-0.000
 smaller-0.000
Token.
Token ID13
Feature activation+1.49e-02

Pos logit contributions
leaf+0.003
ibrarian+0.002
vell+0.002
taker+0.002
asc+0.002
Neg logit contributions
 swe-0.003
 someday-0.003
 cheaper-0.003
 spe-0.003
 smaller-0.002
Token Sara
Token ID24799
Feature activation-2.15e-02

Pos logit contributions
 swe+0.005
 someday+0.005
 cheaper+0.005
 spe+0.005
c+0.004
Neg logit contributions
leaf-0.008
ibrarian-0.008
vell-0.007
taker-0.007
he-0.007
Token sees
Token ID7224
Feature activation+1.57e-02

Pos logit contributions
 swe+0.029
 spe+0.028
 cheaper+0.028
 someday+0.027
 shin+0.026
Neg logit contributions
ibrarian-0.037
leaf-0.037
vell-0.034
asc-0.033
ibr-0.033
Token a
Token ID257
Feature activation+5.93e-03

Pos logit contributions
leaf+0.033
vell+0.032
ibrarian+0.031
taker+0.031
asc+0.030
Neg logit contributions
 swe-0.016
 someday-0.015
 cheaper-0.015
 smaller-0.015
 enough-0.015
Token light
Token ID1657
Feature activation-4.20e-03

Pos logit contributions
leaf+0.011
ibrarian+0.011
vell+0.010
taker+0.010
asc+0.010
Neg logit contributions
 swe-0.008
 someday-0.007
 cheaper-0.007
 spe-0.007
 smaller-0.007
Token on
Token ID319
Feature activation-2.06e-03

Pos logit contributions
 swe+0.007
 someday+0.007
 cheaper+0.007
 spe+0.007
 smaller+0.006
Neg logit contributions
leaf-0.006
ibrarian-0.006
vell-0.005
taker-0.005
he-0.005
Token the
Token ID262
Feature activation+9.10e-02

Pos logit contributions
 swe+0.002
 someday+0.002
 cheaper+0.002
 spe+0.002
 smaller+0.002
Neg logit contributions
leaf-0.004
ibrarian-0.004
vell-0.004
taker-0.004
he-0.004
Token screen
Token ID3159
Feature activation+2.03e-02

Pos logit contributions
izards+0.169
imes+0.156
oll+0.154
iders+0.152
ibr+0.150
Neg logit contributions
 smaller-0.125
 swe-0.118
 blue-0.117
 one-0.115
 less-0.115
Token.
Token ID13
Feature activation-5.12e-03

Pos logit contributions
leaf+0.042
vell+0.039
ibrarian+0.039
taker+0.037
asc+0.037
Neg logit contributions
 someday-0.023
 swe-0.022
 One-0.020
 cheaper-0.020
c-0.020
Token The
Token ID383
Feature activation+7.18e-02

Pos logit contributions
 swe+0.007
 someday+0.007
 cheaper+0.007
 spe+0.006
 smaller+0.006
Neg logit contributions
leaf-0.009
ibrarian-0.009
vell-0.008
taker-0.008
he-0.008
Token light
Token ID1657
Feature activation-2.05e-04

Pos logit contributions
oll+0.147
izards+0.141
imes+0.139
vell+0.138
idget+0.137
Neg logit contributions
 smaller-0.089
 stream-0.077
 better-0.075
 less-0.074
 blue-0.074
Token is
Token ID318
Feature activation+1.85e-02

Pos logit contributions
 swe+0.000
 someday+0.000
 cheaper+0.000
 spe+0.000
 smaller+0.000
Neg logit contributions
leaf-0.000
ibrarian-0.000
vell-0.000
taker-0.000
asc-0.000
Token was
Token ID373
Feature activation+2.80e-02

Pos logit contributions
leaf+0.016
ibrarian+0.015
vell+0.015
taker+0.015
asc+0.014
Neg logit contributions
 swe-0.009
 someday-0.009
 spe-0.008
 cheaper-0.008
 smaller-0.008
Token surprised
Token ID6655
Feature activation+1.16e-02

Pos logit contributions
leaf+0.053
ibrarian+0.051
vell+0.050
iders+0.049
affles+0.047
Neg logit contributions
 smaller-0.036
 swe-0.034
 enough-0.034
 cheaper-0.033
 spe-0.033
Token.
Token ID13
Feature activation+2.09e-02

Pos logit contributions
leaf+0.023
ibrarian+0.023
vell+0.022
taker+0.021
asc+0.021
Neg logit contributions
 someday-0.014
 swe-0.013
 enough-0.012
 cheaper-0.012
 smaller-0.012
Token He
Token ID679
Feature activation+1.60e-02

Pos logit contributions
leaf+0.046
vell+0.046
ibrarian+0.045
asc+0.043
iders+0.042
Neg logit contributions
 One-0.020
 someday-0.018
 swe-0.018
 cheaper-0.017
 smaller-0.017
Token touched
Token ID12615
Feature activation+2.19e-02

Pos logit contributions
leaf+0.030
ibrarian+0.029
vell+0.029
taker+0.027
asc+0.027
Neg logit contributions
 swe-0.020
 someday-0.019
 spe-0.019
 cheaper-0.018
 smaller-0.018
Token the
Token ID262
Feature activation+9.09e-02

Pos logit contributions
leaf+0.051
vell+0.050
ibrarian+0.050
iders+0.048
ibr+0.047
Neg logit contributions
 swe-0.019
 someday-0.018
 smaller-0.018
 enough-0.018
c-0.017
Token screen
Token ID3159
Feature activation-1.28e-03

Pos logit contributions
izards+0.167
oll+0.157
live+0.155
asc+0.155
ibrarian+0.152
Neg logit contributions
 smaller-0.130
 blue-0.117
 bigger-0.115
 one-0.114
 better-0.110
Token and
Token ID290
Feature activation+3.71e-02

Pos logit contributions
 swe+0.002
 someday+0.002
 spe+0.002
 cheaper+0.002
 smaller+0.001
Neg logit contributions
leaf-0.002
ibrarian-0.002
vell-0.002
taker-0.002
he-0.002
Token it
Token ID340
Feature activation+5.03e-03

Pos logit contributions
leaf+0.083
ibrarian+0.080
vell+0.079
asc+0.079
opher+0.076
Neg logit contributions
 someday-0.035
 smaller-0.035
 swe-0.035
 enough-0.033
 shin-0.032
Token made
Token ID925
Feature activation+2.17e-02

Pos logit contributions
leaf+0.010
ibrarian+0.009
vell+0.009
taker+0.009
asc+0.009
Neg logit contributions
 swe-0.006
 someday-0.006
 spe-0.006
 cheaper-0.006
 smaller-0.005
Token such
Token ID884
Feature activation+1.06e-02

Pos logit contributions
leaf+0.043
ibrarian+0.043
vell+0.042
asc+0.041
iders+0.040
Neg logit contributions
 swe-0.024
 smaller-0.023
 someday-0.023
 enough-0.023
 cheaper-0.022
Token just
Token ID655
Feature activation+4.29e-03

Pos logit contributions
 swe+0.011
 someday+0.010
 cheaper+0.010
 spe+0.010
 smaller+0.009
Neg logit contributions
leaf-0.013
ibrarian-0.013
vell-0.013
taker-0.012
 character-0.012
Token couldn
Token ID3521
Feature activation-5.88e-02

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.007
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.006
 someday-0.006
 spe-0.005
 cheaper-0.005
 smaller-0.005
Token't
Token ID470
Feature activation-4.98e-03

Pos logit contributions
 swe+0.149
 spe+0.146
 someday+0.142
 cheaper+0.140
 shin+0.140
Neg logit contributions
he-0.039
ibrarian-0.033
taker-0.027
iders-0.024
leaf-0.024
Token.
Token ID13
Feature activation-2.08e-02

Pos logit contributions
 swe+0.007
 someday+0.007
 spe+0.006
 cheaper+0.006
 smaller+0.006
Neg logit contributions
leaf-0.009
ibrarian-0.008
vell-0.008
taker-0.008
he-0.008
Token So
Token ID1406
Feature activation-2.35e-02

Pos logit contributions
 swe+0.034
 spe+0.032
 cheaper+0.031
 someday+0.031
 smaller+0.030
Neg logit contributions
leaf-0.030
ibrarian-0.030
he-0.028
taker-0.028
 character-0.027
Token the
Token ID262
Feature activation+8.92e-02

Pos logit contributions
 swe+0.036
 cheaper+0.034
c+0.033
 shin+0.032
 spe+0.032
Neg logit contributions
 ear-0.034
ibrarian-0.033
 goodness-0.033
 character-0.033
leaf-0.031
Token object
Token ID2134
Feature activation-3.09e-02

Pos logit contributions
ives+0.176
iders+0.175
izards+0.171
ibrarian+0.170
leaf+0.169
Neg logit contributions
 copper-0.108
 smaller-0.106
 silver-0.098
 blue-0.090
 shin-0.090
Token decided
Token ID3066
Feature activation-2.23e-02

Pos logit contributions
 swe+0.046
 spe+0.044
 cheaper+0.043
 someday+0.041
 smaller+0.040
Neg logit contributions
ibrarian-0.051
leaf-0.047
taker-0.045
vell-0.045
 character-0.045
Token to
Token ID284
Feature activation+6.09e-03

Pos logit contributions
 spe+0.031
 swe+0.031
c+0.030
 cheaper+0.028
 someday+0.028
Neg logit contributions
leaf-0.036
ibrarian-0.034
taker-0.033
vell-0.032
 outside-0.031
Token keep
Token ID1394
Feature activation+5.66e-03

Pos logit contributions
leaf+0.012
ibrarian+0.011
vell+0.011
taker+0.011
asc+0.010
Neg logit contributions
 swe-0.007
 someday-0.007
 cheaper-0.007
 spe-0.007
 smaller-0.006
Token repeating
Token ID20394
Feature activation+9.97e-03

Pos logit contributions
leaf+0.011
ibrarian+0.010
vell+0.010
taker+0.010
he+0.009
Neg logit contributions
 swe-0.007
 someday-0.007
 cheaper-0.007
 spe-0.007
 smaller-0.007
Token see
Token ID766
Feature activation+4.72e-02

Pos logit contributions
leaf+0.017
ibrarian+0.016
vell+0.016
taker+0.015
asc+0.015
Neg logit contributions
 someday-0.012
 swe-0.012
 cheaper-0.011
 spe-0.011
c-0.011
Token?
Token ID30
Feature activation+1.71e-02

Pos logit contributions
leaf+0.106
vell+0.096
taker+0.096
affle+0.096
opher+0.095
Neg logit contributions
 someday-0.046
c-0.046
 enough-0.041
 swe-0.041
 smaller-0.037
Token An
Token ID1052
Feature activation+3.92e-02

Pos logit contributions
leaf+0.034
vell+0.033
ibrarian+0.033
taker+0.031
asc+0.031
Neg logit contributions
 someday-0.019
 One-0.019
 swe-0.018
c-0.017
 spe-0.017
Token image
Token ID2939
Feature activation+4.38e-02

Pos logit contributions
leaf+0.064
vell+0.064
ibrarian+0.061
iders+0.061
ibr+0.059
Neg logit contributions
 swe-0.056
 spe-0.055
 cheaper-0.053
 someday-0.051
 enough-0.050
Token of
Token ID286
Feature activation+4.18e-02

Pos logit contributions
leaf+0.085
vell+0.080
issors+0.079
ibrarian+0.077
opher+0.077
Neg logit contributions
 spe-0.048
 someday-0.048
c-0.047
 swe-0.046
 enough-0.045
Token the
Token ID262
Feature activation+8.78e-02

Pos logit contributions
ibrarian+0.091
vell+0.089
iders+0.089
leaf+0.089
ower+0.087
Neg logit contributions
 smaller-0.038
 swe-0.038
 spe-0.037
 someday-0.036
 enough-0.033
Token perfect
Token ID2818
Feature activation+4.68e-02

Pos logit contributions
ibrarian+0.166
iders+0.162
izards+0.162
ying+0.162
affles+0.159
Neg logit contributions
 smaller-0.116
 less-0.108
 one-0.104
 swe-0.103
 ones-0.101
Token day
Token ID1110
Feature activation+3.50e-02

Pos logit contributions
ibrarian+0.070
leaf+0.070
vell+0.067
iders+0.067
ibr+0.066
Neg logit contributions
 spe-0.074
 swe-0.074
 someday-0.073
 smaller-0.073
 enough-0.070
Token!
Token ID0
Feature activation+1.91e-02

Pos logit contributions
leaf+0.066
vell+0.063
ibrarian+0.063
taker+0.061
opher+0.060
Neg logit contributions
 someday-0.046
 enough-0.043
 swe-0.042
 spe-0.041
c-0.039
Token The
Token ID383
Feature activation+6.97e-02

Pos logit contributions
vell+0.042
leaf+0.042
ibrarian+0.041
asc+0.039
iders+0.039
Neg logit contributions
 One-0.017
 swe-0.016
 someday-0.016
c-0.015
 spe-0.015
Token sun
Token ID4252
Feature activation+7.89e-03

Pos logit contributions
asso+0.155
izards+0.154
iders+0.153
issors+0.152
ibrarian+0.149
Neg logit contributions
 smaller-0.076
 swe-0.072
 again-0.066
 better-0.063
 one-0.063
Token and
Token ID290
Feature activation+3.28e-02

Pos logit contributions
 swe+0.040
c+0.039
 someday+0.038
 cheaper+0.037
 spe+0.037
Neg logit contributions
he-0.035
ibrarian-0.033
leaf-0.033
taker-0.033
vell-0.032
Token pushed
Token ID7121
Feature activation+1.53e-02

Pos logit contributions
leaf+0.071
ibrarian+0.069
opher+0.068
vell+0.068
taker+0.065
Neg logit contributions
 swe-0.033
 someday-0.032
 smaller-0.031
 shin-0.030
 again-0.028
Token the
Token ID262
Feature activation+8.39e-02

Pos logit contributions
leaf+0.032
ibrarian+0.031
vell+0.031
asc+0.029
ibr+0.029
Neg logit contributions
 swe-0.017
 someday-0.017
 smaller-0.016
 enough-0.015
 One-0.015
Token button
Token ID4936
Feature activation-9.97e-03

Pos logit contributions
izards+0.166
oll+0.160
live+0.157
ocket+0.156
asso+0.151
Neg logit contributions
 smaller-0.115
 swe-0.103
 bigger-0.099
 blue-0.095
 one-0.091
Token on
Token ID319
Feature activation-1.84e-03

Pos logit contributions
 swe+0.015
 someday+0.014
 spe+0.014
 cheaper+0.014
c+0.013
Neg logit contributions
leaf-0.016
ibrarian-0.015
vell-0.015
taker-0.015
he-0.014
Token the
Token ID262
Feature activation+8.72e-02

Pos logit contributions
 swe+0.002
 someday+0.002
 cheaper+0.002
 spe+0.002
c+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.003
he-0.003
Token television
Token ID5581
Feature activation+3.48e-02

Pos logit contributions
izards+0.173
live+0.160
oll+0.157
ocket+0.156
imes+0.154
Neg logit contributions
 smaller-0.117
 swe-0.108
 bigger-0.103
 better-0.101
 one-0.101
Token.
Token ID13
Feature activation+3.02e-02

Pos logit contributions
leaf+0.078
ibrarian+0.073
asc+0.071
vell+0.071
opher+0.070
Neg logit contributions
 someday-0.034
 swe-0.033
 smaller-0.031
 enough-0.031
 cheaper-0.030
Token The
Token ID383
Feature activation+6.97e-02

Pos logit contributions
leaf+0.070
vell+0.069
ibrarian+0.067
izards+0.064
ils+0.063
Neg logit contributions
 One-0.030
 swe-0.026
 smaller-0.025
 Sometimes-0.023
 someday-0.022
Token screen
Token ID3159
Feature activation+2.11e-02

Pos logit contributions
izards+0.140
oll+0.130
asso+0.129
imes+0.127
ocket+0.127
Neg logit contributions
 smaller-0.098
 swe-0.086
 bigger-0.080
 better-0.080
 blue-0.080
Token lit
Token ID6578
Feature activation+1.34e-03

Pos logit contributions
leaf+0.044
ibrarian+0.041
vell+0.041
asc+0.040
taker+0.039
Neg logit contributions
 swe-0.023
 someday-0.022
 smaller-0.021
 spe-0.021
 cheaper-0.021
Token room
Token ID2119
Feature activation+3.55e-03

Pos logit contributions
leaf+0.051
vell+0.048
taker+0.046
ibrarian+0.046
iders+0.045
Neg logit contributions
 swe-0.044
c-0.041
 someday-0.040
 smaller-0.040
 One-0.040
Token.
Token ID13
Feature activation-1.66e-02

Pos logit contributions
leaf+0.007
ibrarian+0.006
vell+0.006
taker+0.006
asc+0.006
Neg logit contributions
 swe-0.004
 someday-0.004
 cheaper-0.004
 spe-0.004
 smaller-0.004
Token They
Token ID1119
Feature activation+1.69e-02

Pos logit contributions
 swe+0.023
 someday+0.022
 spe+0.021
 cheaper+0.021
 shin+0.021
Neg logit contributions
leaf-0.028
ibrarian-0.028
he-0.026
asc-0.025
taker-0.025
Token can
Token ID460
Feature activation+1.65e-02

Pos logit contributions
leaf+0.030
ibrarian+0.029
vell+0.029
iders+0.028
asc+0.028
Neg logit contributions
 someday-0.022
 swe-0.022
 spe-0.021
 One-0.020
 smaller-0.020
Token touch
Token ID3638
Feature activation+1.36e-02

Pos logit contributions
leaf+0.030
ibrarian+0.029
vell+0.029
taker+0.028
asc+0.027
Neg logit contributions
 someday-0.021
 swe-0.021
 smaller-0.020
 One-0.020
 spe-0.020
Token the
Token ID262
Feature activation+8.67e-02

Pos logit contributions
leaf+0.028
ibrarian+0.028
vell+0.027
taker+0.026
asc+0.025
Neg logit contributions
 swe-0.015
 someday-0.014
 smaller-0.013
 cheaper-0.013
 spe-0.013
Token screen
Token ID3159
Feature activation-3.51e-03

Pos logit contributions
ibrarian+0.149
ying+0.147
izards+0.145
bled+0.142
live+0.140
Neg logit contributions
 smaller-0.124
 blue-0.117
 swe-0.115
 coll-0.113
 ones-0.113
Token and
Token ID290
Feature activation+2.96e-02

Pos logit contributions
 swe+0.005
 someday+0.004
 spe+0.004
 cheaper+0.004
c+0.004
Neg logit contributions
leaf-0.006
ibrarian-0.006
vell-0.006
taker-0.006
he-0.006
Token make
Token ID787
Feature activation+1.65e-02

Pos logit contributions
ibrarian+0.057
leaf+0.056
vell+0.056
asc+0.054
opher+0.053
Neg logit contributions
 someday-0.037
 swe-0.036
 spe-0.034
 smaller-0.033
 One-0.032
Token things
Token ID1243
Feature activation-6.85e-03

Pos logit contributions
leaf+0.030
ibrarian+0.029
vell+0.028
asc+0.027
taker+0.027
Neg logit contributions
 swe-0.022
 someday-0.021
 smaller-0.020
 cheaper-0.020
 spe-0.019
Token happen
Token ID1645
Feature activation-9.74e-03

Pos logit contributions
 swe+0.010
 someday+0.010
 spe+0.009
 cheaper+0.009
c+0.009
Neg logit contributions
leaf-0.011
ibrarian-0.011
vell-0.010
taker-0.010
he-0.010
Token.
Token ID13
Feature activation-8.91e-03

Pos logit contributions
leaf+0.021
ibrarian+0.021
vell+0.020
taker+0.019
asc+0.019
Neg logit contributions
 someday-0.013
 swe-0.013
 cheaper-0.012
 spe-0.012
 smaller-0.012
Token His
Token ID2399
Feature activation+4.42e-02

Pos logit contributions
 swe+0.013
 someday+0.012
 cheaper+0.012
 spe+0.012
c+0.011
Neg logit contributions
leaf-0.015
ibrarian-0.014
vell-0.014
taker-0.014
ibr-0.013
Token eyes
Token ID2951
Feature activation-1.45e-02

Pos logit contributions
leaf+0.090
taker+0.083
he+0.082
vell+0.081
ibrarian+0.080
Neg logit contributions
 swe-0.060
 smaller-0.057
 enough-0.050
 cheaper-0.050
 shorter-0.050
Token landed
Token ID11406
Feature activation+1.11e-02

Pos logit contributions
 swe+0.018
 someday+0.018
 cheaper+0.017
 spe+0.017
c+0.016
Neg logit contributions
leaf-0.026
vell-0.025
ibrarian-0.025
he-0.024
taker-0.024
Token on
Token ID319
Feature activation-1.51e-02

Pos logit contributions
leaf+0.016
ibrarian+0.015
vell+0.014
taker+0.014
 character+0.013
Neg logit contributions
 swe-0.019
 someday-0.019
 cheaper-0.018
 smaller-0.018
 spe-0.018
Token the
Token ID262
Feature activation+8.50e-02

Pos logit contributions
 swe+0.021
 someday+0.021
 cheaper+0.020
 spe+0.019
c+0.019
Neg logit contributions
leaf-0.025
ibrarian-0.024
taker-0.023
 character-0.023
he-0.022
Token little
Token ID1310
Feature activation+3.37e-02

Pos logit contributions
izards+0.164
leaf+0.161
asc+0.158
ying+0.157
ibrarian+0.156
Neg logit contributions
 swe-0.122
 smaller-0.116
 cheaper-0.103
 less-0.103
 shorter-0.099
Token freezer
Token ID30967
Feature activation+1.80e-02

Pos logit contributions
leaf+0.059
ibrarian+0.057
vell+0.056
asc+0.052
ibr+0.052
Neg logit contributions
 swe-0.052
 someday-0.048
 smaller-0.047
 cheaper-0.046
 spe-0.045
Token.
Token ID13
Feature activation-2.70e-03

Pos logit contributions
leaf+0.036
ibrarian+0.033
vell+0.032
asc+0.031
ibr+0.031
Neg logit contributions
 swe-0.022
 someday-0.022
 cheaper-0.021
 enough-0.020
 spe-0.020
Token He
Token ID679
Feature activation+1.24e-02

Pos logit contributions
 swe+0.004
 someday+0.004
 cheaper+0.003
 spe+0.003
c+0.003
Neg logit contributions
leaf-0.005
ibrarian-0.005
vell-0.004
taker-0.004
asc-0.004
Token smiled
Token ID13541
Feature activation-1.17e-02

Pos logit contributions
leaf+0.025
ibrarian+0.024
vell+0.024
taker+0.023
asc+0.023
Neg logit contributions
 swe-0.014
 someday-0.014
 spe-0.013
 cheaper-0.013
 smaller-0.013
Token so
Token ID523
Feature activation+1.93e-02

Pos logit contributions
leaf+0.046
ibrarian+0.045
asc+0.043
opher+0.042
vell+0.041
Neg logit contributions
 someday-0.042
 swe-0.041
 cheaper-0.041
 spe-0.039
 smaller-0.039
Token big
Token ID1263
Feature activation+4.63e-03

Pos logit contributions
leaf+0.037
ibrarian+0.035
vell+0.034
asc+0.034
opher+0.033
Neg logit contributions
 swe-0.024
 someday-0.024
 cheaper-0.023
 smaller-0.022
 spe-0.022
Token!
Token ID0
Feature activation+1.25e-02

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.007
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.006
 someday-0.006
 cheaper-0.006
 spe-0.006
 smaller-0.006
Token She
Token ID1375
Feature activation+4.49e-03

Pos logit contributions
leaf+0.023
vell+0.023
ibrarian+0.022
taker+0.021
asc+0.021
Neg logit contributions
 someday-0.015
 swe-0.015
 One-0.015
 spe-0.014
 cheaper-0.014
Token touched
Token ID12615
Feature activation+1.86e-02

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.008
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.006
 someday-0.006
 spe-0.006
 cheaper-0.005
 smaller-0.005
Token the
Token ID262
Feature activation+8.44e-02

Pos logit contributions
leaf+0.042
ibrarian+0.041
vell+0.040
asc+0.039
iders+0.038
Neg logit contributions
 swe-0.016
 someday-0.016
 smaller-0.016
c-0.015
 enough-0.015
Token screen
Token ID3159
Feature activation+6.89e-03

Pos logit contributions
oll+0.166
izards+0.163
affles+0.159
ibrarian+0.157
ido+0.156
Neg logit contributions
 smaller-0.113
 blue-0.103
 bigger-0.095
 yellow-0.094
 one-0.094
Token and
Token ID290
Feature activation+3.51e-02

Pos logit contributions
leaf+0.013
ibrarian+0.012
vell+0.012
taker+0.012
asc+0.011
Neg logit contributions
 swe-0.009
 someday-0.008
 cheaper-0.008
 spe-0.008
 smaller-0.008
Token it
Token ID340
Feature activation+4.55e-03

Pos logit contributions
leaf+0.075
vell+0.072
ibrarian+0.071
asc+0.071
affle+0.070
Neg logit contributions
 someday-0.037
 smaller-0.034
 swe-0.034
 enough-0.034
 spe-0.033
Token lit
Token ID6578
Feature activation-4.25e-03

Pos logit contributions
leaf+0.009
ibrarian+0.009
vell+0.008
taker+0.008
asc+0.008
Neg logit contributions
 swe-0.005
 someday-0.005
 spe-0.005
 cheaper-0.005
 smaller-0.005
Token up
Token ID510
Feature activation-3.00e-03

Pos logit contributions
 swe+0.008
 someday+0.008
 cheaper+0.008
 spe+0.008
c+0.008
Neg logit contributions
leaf-0.005
ibrarian-0.005
vell-0.004
taker-0.004
he-0.004
Token frustrated
Token ID14718
Feature activation+1.01e-02

Pos logit contributions
 swe+0.000
 someday+0.000
 cheaper+0.000
 spe+0.000
c+0.000
Neg logit contributions
leaf-0.000
ibrarian-0.000
vell-0.000
taker-0.000
asc-0.000
Token.
Token ID13
Feature activation+5.80e-03

Pos logit contributions
leaf+0.020
ibrarian+0.019
vell+0.019
taker+0.018
asc+0.018
Neg logit contributions
 swe-0.012
 someday-0.012
 cheaper-0.011
 enough-0.010
 spe-0.010
Token The
Token ID383
Feature activation+6.39e-02

Pos logit contributions
leaf+0.012
ibrarian+0.011
vell+0.011
taker+0.011
asc+0.010
Neg logit contributions
 swe-0.007
 someday-0.006
 cheaper-0.006
 spe-0.006
 One-0.006
Token buttons
Token ID12163
Feature activation-2.04e-02

Pos logit contributions
izards+0.121
asso+0.114
leaf+0.108
asc+0.107
iders+0.107
Neg logit contributions
 swe-0.091
 smaller-0.089
 moral-0.085
 one-0.081
 better-0.080
Token on
Token ID319
Feature activation-6.63e-03

Pos logit contributions
 swe+0.030
 someday+0.029
 cheaper+0.028
 spe+0.028
 smaller+0.027
Neg logit contributions
leaf-0.032
ibrarian-0.032
vell-0.031
taker-0.030
he-0.030
Token the
Token ID262
Feature activation+8.42e-02

Pos logit contributions
 swe+0.008
 someday+0.008
 cheaper+0.008
 spe+0.008
 shin+0.007
Neg logit contributions
leaf-0.012
ibrarian-0.012
vell-0.011
taker-0.011
he-0.011
Token television
Token ID5581
Feature activation+2.04e-02

Pos logit contributions
izards+0.161
oll+0.148
imes+0.146
asc+0.145
ibrarian+0.145
Neg logit contributions
 swe-0.118
 smaller-0.114
 one-0.106
 ones-0.104
 again-0.103
Token were
Token ID547
Feature activation+9.44e-03

Pos logit contributions
leaf+0.040
ibrarian+0.038
vell+0.037
asc+0.036
taker+0.035
Neg logit contributions
 someday-0.024
 swe-0.024
 cheaper-0.022
 spe-0.022
c-0.021
Token different
Token ID1180
Feature activation+8.77e-03

Pos logit contributions
leaf+0.016
ibrarian+0.016
vell+0.015
taker+0.014
he+0.014
Neg logit contributions
 swe-0.013
 someday-0.013
 cheaper-0.013
 spe-0.012
 smaller-0.012
Token and
Token ID290
Feature activation+7.98e-03

Pos logit contributions
leaf+0.016
ibrarian+0.016
vell+0.015
taker+0.015
asc+0.014
Neg logit contributions
 swe-0.011
 someday-0.011
 cheaper-0.011
 spe-0.010
 smaller-0.010
Token she
Token ID673
Feature activation+1.91e-02

Pos logit contributions
leaf+0.015
ibrarian+0.015
vell+0.014
taker+0.014
asc+0.013
Neg logit contributions
 swe-0.010
 someday-0.010
 cheaper-0.009
 spe-0.009
 smaller-0.009
Token Tim
Token ID5045
Feature activation+1.32e-02

Pos logit contributions
vell+0.059
ibrarian+0.058
leaf+0.058
ocket+0.054
taker+0.053
Neg logit contributions
 One-0.034
 swe-0.030
 someday-0.028
 Sometimes-0.027
c-0.027
Token stood
Token ID6204
Feature activation+1.83e-02

Pos logit contributions
leaf+0.026
vell+0.025
ibrarian+0.025
taker+0.024
ibr+0.023
Neg logit contributions
 swe-0.016
 someday-0.015
c-0.014
 cheaper-0.014
 smaller-0.013
Token up
Token ID510
Feature activation-2.47e-02

Pos logit contributions
leaf+0.031
ibrarian+0.028
vell+0.027
taker+0.026
ibr+0.026
Neg logit contributions
 swe-0.028
 smaller-0.028
 someday-0.028
 enough-0.026
 cheaper-0.026
Token and
Token ID290
Feature activation+3.28e-02

Pos logit contributions
 swe+0.040
c+0.039
 someday+0.038
 cheaper+0.037
 spe+0.037
Neg logit contributions
he-0.035
ibrarian-0.033
leaf-0.033
taker-0.033
vell-0.032
Token pushed
Token ID7121
Feature activation+1.53e-02

Pos logit contributions
leaf+0.071
ibrarian+0.069
opher+0.068
vell+0.068
taker+0.065
Neg logit contributions
 swe-0.033
 someday-0.032
 smaller-0.031
 shin-0.030
 again-0.028
Token the
Token ID262
Feature activation+8.39e-02

Pos logit contributions
leaf+0.032
ibrarian+0.031
vell+0.031
asc+0.029
ibr+0.029
Neg logit contributions
 swe-0.017
 someday-0.017
 smaller-0.016
 enough-0.015
 One-0.015
Token button
Token ID4936
Feature activation-9.97e-03

Pos logit contributions
izards+0.166
oll+0.160
live+0.157
ocket+0.156
asso+0.151
Neg logit contributions
 smaller-0.115
 swe-0.103
 bigger-0.099
 blue-0.095
 one-0.091
Token on
Token ID319
Feature activation-1.84e-03

Pos logit contributions
 swe+0.015
 someday+0.014
 spe+0.014
 cheaper+0.014
c+0.013
Neg logit contributions
leaf-0.016
ibrarian-0.015
vell-0.015
taker-0.015
he-0.014
Token the
Token ID262
Feature activation+8.72e-02

Pos logit contributions
 swe+0.002
 someday+0.002
 cheaper+0.002
 spe+0.002
c+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.003
he-0.003
Token television
Token ID5581
Feature activation+3.48e-02

Pos logit contributions
izards+0.173
live+0.160
oll+0.157
ocket+0.156
imes+0.154
Neg logit contributions
 smaller-0.117
 swe-0.108
 bigger-0.103
 better-0.101
 one-0.101
Token.
Token ID13
Feature activation+3.02e-02

Pos logit contributions
leaf+0.078
ibrarian+0.073
asc+0.071
vell+0.071
opher+0.070
Neg logit contributions
 someday-0.034
 swe-0.033
 smaller-0.031
 enough-0.031
 cheaper-0.030
Tokenummy
Token ID13513
Feature activation+3.29e-02

Pos logit contributions
leaf+0.027
ibrarian+0.026
vell+0.024
taker+0.024
ibr+0.023
Neg logit contributions
 swe-0.036
 someday-0.035
 spe-0.035
 cheaper-0.034
 enough-0.034
Token w
Token ID266
Feature activation+5.62e-02

Pos logit contributions
leaf+0.064
vell+0.061
ibrarian+0.061
ibr+0.059
iders+0.058
Neg logit contributions
 swe-0.041
 spe-0.039
 someday-0.037
 cheaper-0.037
 smaller-0.036
Tokenaffle
Token ID30697
Feature activation+6.89e-02

Pos logit contributions
leaf+0.058
ting+0.054
uted+0.049
nam+0.046
taker+0.046
Neg logit contributions
 cheaper-0.101
 swe-0.101
 coll-0.098
 enough-0.097
 problem-0.096
Token.
Token ID13
Feature activation+1.15e-02

Pos logit contributions
leaf+0.149
opher+0.140
ags+0.135
vell+0.132
idget+0.131
Neg logit contributions
 enough-0.062
 someday-0.058
 spe-0.053
 stream-0.051
 again-0.050
Token The
Token ID383
Feature activation+3.30e-02

Pos logit contributions
leaf+0.024
vell+0.023
ibrarian+0.023
taker+0.022
asc+0.021
Neg logit contributions
 someday-0.012
 One-0.012
 swe-0.012
 spe-0.011
 cheaper-0.011
Token w
Token ID266
Feature activation+8.33e-02

Pos logit contributions
leaf+0.073
ibrarian+0.066
iders+0.065
uted+0.064
vell+0.064
Neg logit contributions
 swe-0.039
 smaller-0.035
 One-0.032
 cheaper-0.032
 shin-0.032
Tokenaffle
Token ID30697
Feature activation+5.88e-02

Pos logit contributions
leaf+0.118
nam+0.109
 contain+0.106
ting+0.101
person+0.101
Neg logit contributions
 One-0.114
 swe-0.114
 problem-0.108
 cheaper-0.106
ib-0.106
Token was
Token ID373
Feature activation+1.39e-02

Pos logit contributions
leaf+0.148
vell+0.135
asc+0.132
ging+0.130
lessly+0.129
Neg logit contributions
 grew-0.036
 One-0.035
 someday-0.033
c-0.033
 spe-0.033
Token very
Token ID845
Feature activation+2.73e-02

Pos logit contributions
leaf+0.026
ibrarian+0.024
vell+0.024
iders+0.023
taker+0.022
Neg logit contributions
 swe-0.019
 cheaper-0.018
 smaller-0.018
 someday-0.018
 spe-0.018
Token special
Token ID2041
Feature activation+3.26e-02

Pos logit contributions
leaf+0.057
ibrarian+0.055
ting+0.053
vell+0.053
iders+0.053
Neg logit contributions
 swe-0.033
 cheaper-0.031
 smaller-0.030
 spe-0.030
 someday-0.028
Token because
Token ID780
Feature activation+1.69e-02

Pos logit contributions
leaf+0.073
ibrarian+0.070
vell+0.070
ibr+0.067
asc+0.066
Neg logit contributions
 someday-0.034
 enough-0.031
 swe-0.030
 cheaper-0.029
 spe-0.028
Token.
Token ID13
Feature activation+1.10e-02

Pos logit contributions
leaf+0.002
ibrarian+0.002
vell+0.002
taker+0.002
asc+0.002
Neg logit contributions
 swe-0.001
 someday-0.001
 cheaper-0.001
 spe-0.001
 smaller-0.001
Token He
Token ID679
Feature activation+8.15e-03

Pos logit contributions
leaf+0.019
vell+0.019
ibrarian+0.019
taker+0.017
asc+0.017
Neg logit contributions
 swe-0.015
 someday-0.015
 One-0.014
 spe-0.014
 cheaper-0.014
Token laid
Token ID8104
Feature activation+1.77e-02

Pos logit contributions
leaf+0.015
ibrarian+0.014
vell+0.014
taker+0.013
asc+0.013
Neg logit contributions
 swe-0.011
 someday-0.010
 spe-0.010
 cheaper-0.010
 smaller-0.010
Token down
Token ID866
Feature activation-1.88e-02

Pos logit contributions
leaf+0.031
ibrarian+0.030
vell+0.028
ibr+0.027
iders+0.027
Neg logit contributions
 swe-0.026
 smaller-0.025
 someday-0.025
 enough-0.024
 spe-0.024
Token on
Token ID319
Feature activation-5.39e-03

Pos logit contributions
 swe+0.033
 someday+0.032
 spe+0.031
 cheaper+0.031
c+0.031
Neg logit contributions
leaf-0.024
ibrarian-0.023
vell-0.022
taker-0.022
he-0.022
Token the
Token ID262
Feature activation+8.31e-02

Pos logit contributions
 swe+0.007
 someday+0.007
 cheaper+0.006
 spe+0.006
c+0.006
Neg logit contributions
leaf-0.010
ibrarian-0.010
vell-0.009
taker-0.009
he-0.009
Token cushion
Token ID33579
Feature activation+2.32e-02

Pos logit contributions
leaf+0.174
asc+0.165
itt+0.159
ibr+0.157
asing+0.155
Neg logit contributions
 swe-0.126
 smaller-0.109
 again-0.101
 shorter-0.099
 softer-0.097
Token and
Token ID290
Feature activation+3.64e-02

Pos logit contributions
leaf+0.049
ibrarian+0.046
vell+0.046
asc+0.045
ibr+0.043
Neg logit contributions
 someday-0.025
 swe-0.025
 enough-0.022
c-0.022
 cheaper-0.022
Token curled
Token ID41108
Feature activation+2.85e-02

Pos logit contributions
leaf+0.075
ibrarian+0.074
vell+0.074
asc+0.071
issors+0.070
Neg logit contributions
 swe-0.044
 someday-0.040
 smaller-0.039
 shin-0.038
 spe-0.038
Token up
Token ID510
Feature activation-4.24e-03

Pos logit contributions
leaf+0.048
ibrarian+0.046
vell+0.044
ibr+0.043
taker+0.042
Neg logit contributions
 swe-0.044
 enough-0.041
 someday-0.041
 smaller-0.040
 spe-0.040
Token.
Token ID13
Feature activation+2.32e-02

Pos logit contributions
 swe+0.008
 someday+0.008
 cheaper+0.007
 spe+0.007
c+0.007
Neg logit contributions
leaf-0.005
ibrarian-0.005
vell-0.005
taker-0.005
he-0.004
Token so
Token ID523
Feature activation+1.46e-02

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.008
taker+0.007
he+0.007
Neg logit contributions
 swe-0.006
 someday-0.006
 cheaper-0.006
 spe-0.006
 smaller-0.006
Token cool
Token ID3608
Feature activation+6.42e-03

Pos logit contributions
leaf+0.028
ibrarian+0.028
vell+0.026
asc+0.025
ibr+0.025
Neg logit contributions
 swe-0.018
 spe-0.017
 smaller-0.017
 cheaper-0.017
 someday-0.017
Token!
Token ID0
Feature activation-1.80e-02

Pos logit contributions
leaf+0.010
ibrarian+0.009
vell+0.009
taker+0.009
asc+0.008
Neg logit contributions
 swe-0.010
 someday-0.010
 cheaper-0.010
 spe-0.010
 enough-0.009
Token She
Token ID1375
Feature activation+1.94e-02

Pos logit contributions
 someday+0.027
 swe+0.027
 spe+0.027
 cheaper+0.026
 smaller+0.025
Neg logit contributions
leaf-0.028
ibrarian-0.027
taker-0.026
vell-0.025
he-0.025
Token touched
Token ID12615
Feature activation-1.70e-03

Pos logit contributions
leaf+0.034
ibrarian+0.033
vell+0.033
taker+0.031
asc+0.031
Neg logit contributions
 swe-0.027
 someday-0.026
 spe-0.025
 cheaper-0.024
 smaller-0.024
Token the
Token ID262
Feature activation+8.30e-02

Pos logit contributions
 swe+0.002
 someday+0.002
 cheaper+0.002
 spe+0.002
 smaller+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.003
he-0.003
Token shapes
Token ID15268
Feature activation+1.05e-02

Pos logit contributions
izards+0.141
ibrarian+0.138
iders+0.134
asc+0.134
ying+0.133
Neg logit contributions
 smaller-0.126
 swe-0.116
 cheaper-0.111
 bigger-0.110
 coll-0.110
Token and
Token ID290
Feature activation+1.45e-02

Pos logit contributions
leaf+0.019
ibrarian+0.018
vell+0.017
taker+0.017
asc+0.017
Neg logit contributions
 swe-0.014
 someday-0.014
 spe-0.013
 cheaper-0.013
 smaller-0.013
Token they
Token ID484
Feature activation+3.07e-02

Pos logit contributions
leaf+0.026
ibrarian+0.026
vell+0.025
asc+0.024
taker+0.024
Neg logit contributions
 swe-0.020
 someday-0.019
 spe-0.018
 smaller-0.018
 cheaper-0.018
Token moved
Token ID3888
Feature activation-8.35e-03

Pos logit contributions
leaf+0.056
ibrarian+0.055
asc+0.052
vell+0.051
iders+0.050
Neg logit contributions
 spe-0.041
 swe-0.041
 someday-0.040
 smaller-0.039
c-0.037
Token around
Token ID1088
Feature activation-2.97e-03

Pos logit contributions
 swe+0.012
 someday+0.012
 cheaper+0.011
 spe+0.011
c+0.011
Neg logit contributions
leaf-0.014
ibrarian-0.013
vell-0.013
taker-0.012
he-0.012
Token than
Token ID621
Feature activation+3.44e-02

Pos logit contributions
leaf+0.002
ibrarian+0.002
vell+0.002
taker+0.002
asc+0.002
Neg logit contributions
 swe-0.002
 someday-0.002
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token anything
Token ID1997
Feature activation+2.09e-02

Pos logit contributions
leaf+0.062
ibrarian+0.060
asc+0.059
ower+0.059
vell+0.058
Neg logit contributions
 someday-0.048
 enough-0.045
 smaller-0.043
 cheaper-0.042
 swe-0.042
Token he
Token ID339
Feature activation+1.95e-02

Pos logit contributions
leaf+0.038
ibrarian+0.037
vell+0.035
asc+0.034
taker+0.034
Neg logit contributions
 someday-0.028
 swe-0.028
 spe-0.026
 cheaper-0.026
 smaller-0.025
Token had
Token ID550
Feature activation+4.39e-02

Pos logit contributions
leaf+0.037
ibrarian+0.036
taker+0.035
vell+0.035
opher+0.034
Neg logit contributions
 someday-0.025
 swe-0.023
 spe-0.022
 cheaper-0.022
c-0.022
Token ever
Token ID1683
Feature activation-7.47e-03

Pos logit contributions
leaf+0.075
ibrarian+0.074
taker+0.072
iders+0.071
opher+0.070
Neg logit contributions
 someday-0.065
 enough-0.058
 cheaper-0.054
c-0.053
 swe-0.053
Token seen
Token ID1775
Feature activation+8.23e-02

Pos logit contributions
 swe+0.013
 someday+0.012
 cheaper+0.012
 spe+0.012
 smaller+0.011
Neg logit contributions
leaf-0.011
ibrarian-0.010
vell-0.010
taker-0.009
he-0.009
Token before
Token ID878
Feature activation+4.37e-03

Pos logit contributions
leaf+0.186
taker+0.167
ibrarian+0.162
vell+0.162
opher+0.158
Neg logit contributions
 someday-0.097
 again-0.093
 enough-0.088
c-0.079
 longer-0.072
Token.
Token ID13
Feature activation+3.36e-02

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.007
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.006
 someday-0.006
 cheaper-0.005
 spe-0.005
 smaller-0.005
Token
Token ID198
Feature activation+2.47e-02

Pos logit contributions
leaf+0.079
vell+0.078
asc+0.073
ibrarian+0.073
affles+0.071
Neg logit contributions
 One-0.030
 someday-0.025
 Sometimes-0.024
c-0.022
 smaller-0.021
Token
Token ID198
Feature activation+2.86e-02

Pos logit contributions
leaf+0.058
vell+0.057
ibrarian+0.055
asc+0.054
taker+0.054
Neg logit contributions
 someday-0.018
 One-0.018
 swe-0.015
 smaller-0.015
 cheaper-0.015
TokenMark
Token ID9704
Feature activation-1.48e-02

Pos logit contributions
leaf+0.043
vell+0.040
opher+0.040
ibrarian+0.038
asc+0.038
Neg logit contributions
 someday-0.046
 One-0.045
One-0.044
 smaller-0.042
 swe-0.041
Token seen
Token ID1775
Feature activation+6.92e-02

Pos logit contributions
leaf+0.016
ibrarian+0.015
vell+0.015
taker+0.014
opher+0.014
Neg logit contributions
 someday-0.014
 swe-0.014
 cheaper-0.013
 spe-0.013
 smaller-0.013
Token before
Token ID878
Feature activation+6.11e-03

Pos logit contributions
leaf+0.144
taker+0.132
vell+0.132
opher+0.123
ower+0.123
Neg logit contributions
 someday-0.088
 enough-0.087
 again-0.084
c-0.075
 swe-0.073
Token -
Token ID532
Feature activation+1.31e-02

Pos logit contributions
leaf+0.012
ibrarian+0.011
vell+0.011
taker+0.011
asc+0.010
Neg logit contributions
 swe-0.007
 someday-0.007
 cheaper-0.007
 spe-0.007
 smaller-0.007
Token a
Token ID257
Feature activation+1.50e-02

Pos logit contributions
leaf+0.027
ibrarian+0.026
vell+0.025
taker+0.024
asc+0.024
Neg logit contributions
 swe-0.015
 someday-0.014
 cheaper-0.013
 spe-0.013
 smaller-0.013
Token w
Token ID266
Feature activation+6.05e-02

Pos logit contributions
leaf+0.028
ibrarian+0.027
vell+0.026
taker+0.025
asc+0.024
Neg logit contributions
 swe-0.019
 spe-0.018
 someday-0.018
 cheaper-0.018
 smaller-0.018
Tokenaffle
Token ID30697
Feature activation+8.17e-02

Pos logit contributions
leaf+0.062
ting+0.057
uted+0.056
taker+0.055
nam+0.054
Neg logit contributions
 problem-0.111
 cheaper-0.108
 enough-0.108
 spe-0.107
 swe-0.107
Token!
Token ID0
Feature activation+2.28e-02

Pos logit contributions
leaf+0.155
ags+0.143
opher+0.130
taker+0.128
issors+0.128
Neg logit contributions
 enough-0.106
 stream-0.099
 again-0.097
 spe-0.097
 someday-0.095
Token She
Token ID1375
Feature activation+1.09e-02

Pos logit contributions
leaf+0.047
vell+0.047
ibrarian+0.045
asc+0.044
iders+0.042
Neg logit contributions
 One-0.025
 swe-0.025
 someday-0.023
 cheaper-0.022
 enough-0.021
Token was
Token ID373
Feature activation+3.87e-02

Pos logit contributions
leaf+0.021
ibrarian+0.020
vell+0.020
taker+0.019
he+0.019
Neg logit contributions
 swe-0.014
 someday-0.013
 spe-0.013
 cheaper-0.012
 smaller-0.012
Token so
Token ID523
Feature activation+3.84e-02

Pos logit contributions
leaf+0.077
vell+0.075
iders+0.073
ibrarian+0.072
affles+0.071
Neg logit contributions
 smaller-0.047
 enough-0.045
 spe-0.043
 swe-0.042
 cheaper-0.042
Token confused
Token ID10416
Feature activation+2.10e-02

Pos logit contributions
leaf+0.074
ibrarian+0.071
roach+0.071
ting+0.070
coon+0.069
Neg logit contributions
 smaller-0.050
 spe-0.044
 swe-0.042
 shorter-0.042
 cheaper-0.042
Tokenia
Token ID544
Feature activation-1.10e-02

Pos logit contributions
 swe+0.029
 spe+0.028
 someday+0.028
 cheaper+0.027
 shin+0.026
Neg logit contributions
ibrarian-0.021
leaf-0.019
vell-0.019
iders-0.018
opher-0.018
Token put
Token ID1234
Feature activation+3.60e-02

Pos logit contributions
 swe+0.015
 someday+0.015
 cheaper+0.014
 spe+0.014
 smaller+0.013
Neg logit contributions
leaf-0.019
ibrarian-0.019
vell-0.018
taker-0.017
ibr-0.017
Token her
Token ID607
Feature activation+4.30e-02

Pos logit contributions
leaf+0.075
vell+0.073
iders+0.073
opher+0.072
ower+0.071
Neg logit contributions
 swe-0.038
 smaller-0.037
c-0.037
 enough-0.035
 shin-0.032
Token hand
Token ID1021
Feature activation+2.04e-02

Pos logit contributions
leaf+0.077
iders+0.072
ibrarian+0.071
ting+0.068
taker+0.068
Neg logit contributions
 swe-0.064
 smaller-0.063
c-0.055
 cheaper-0.055
 shorter-0.055
Token on
Token ID319
Feature activation+1.48e-02

Pos logit contributions
leaf+0.030
ibrarian+0.028
vell+0.027
taker+0.025
opher+0.025
Neg logit contributions
 swe-0.036
 someday-0.035
c-0.034
 smaller-0.033
 cheaper-0.033
Token the
Token ID262
Feature activation+8.16e-02

Pos logit contributions
leaf+0.033
ibrarian+0.031
vell+0.031
asc+0.030
taker+0.030
Neg logit contributions
 swe-0.015
 someday-0.014
 spe-0.013
 smaller-0.013
 cheaper-0.013
Token cold
Token ID4692
Feature activation+4.92e-02

Pos logit contributions
iders+0.159
izards+0.154
leaf+0.152
itt+0.150
eria+0.148
Neg logit contributions
 smaller-0.103
 swe-0.092
 shorter-0.090
 again-0.086
 shin-0.084
Token mirror
Token ID10162
Feature activation+1.65e-02

Pos logit contributions
leaf+0.093
vell+0.087
ibrarian+0.083
uted+0.081
izards+0.078
Neg logit contributions
 swe-0.068
 someday-0.065
 again-0.061
 enough-0.060
 smaller-0.058
Token and
Token ID290
Feature activation+3.78e-02

Pos logit contributions
leaf+0.034
ibrarian+0.032
vell+0.032
taker+0.031
asc+0.030
Neg logit contributions
 someday-0.018
 swe-0.018
 cheaper-0.016
 spe-0.016
c-0.016
Token started
Token ID2067
Feature activation-4.42e-05

Pos logit contributions
leaf+0.080
vell+0.078
ibrarian+0.077
opher+0.076
asc+0.075
Neg logit contributions
 swe-0.042
 someday-0.040
 smaller-0.038
 enough-0.037
 shin-0.037
Token to
Token ID284
Feature activation+1.73e-02

Pos logit contributions
 swe+0.000
 someday+0.000
 cheaper+0.000
 spe+0.000
 smaller+0.000
Neg logit contributions
leaf-0.000
ibrarian-0.000
vell-0.000
taker-0.000
asc-0.000
Token snacks
Token ID28487
Feature activation+2.42e-02

Pos logit contributions
vell+0.080
iders+0.079
ibrarian+0.077
asc+0.075
leaf+0.075
Neg logit contributions
 swe-0.056
 smaller-0.050
 spe-0.048
 cheaper-0.047
 coll-0.046
Token.
Token ID13
Feature activation-7.20e-04

Pos logit contributions
leaf+0.050
vell+0.047
ibrarian+0.047
taker+0.045
opher+0.045
Neg logit contributions
 someday-0.027
 swe-0.025
 enough-0.024
 cheaper-0.024
 spe-0.023
Token She
Token ID1375
Feature activation-5.62e-03

Pos logit contributions
 swe+0.001
 someday+0.001
 cheaper+0.001
 spe+0.001
 smaller+0.001
Neg logit contributions
leaf-0.001
ibrarian-0.001
vell-0.001
taker-0.001
asc-0.001
Token saw
Token ID2497
Feature activation+2.11e-02

Pos logit contributions
 swe+0.008
 someday+0.008
 cheaper+0.007
 spe+0.007
c+0.007
Neg logit contributions
leaf-0.010
ibrarian-0.010
vell-0.009
taker-0.009
ibr-0.008
Token the
Token ID262
Feature activation+3.69e-02

Pos logit contributions
leaf+0.043
vell+0.043
asc+0.040
ower+0.040
ibrarian+0.040
Neg logit contributions
 smaller-0.022
 swe-0.022
 enough-0.021
 someday-0.021
 spe-0.020
Token pupp
Token ID13595
Feature activation+8.15e-02

Pos logit contributions
leaf+0.067
ibrarian+0.062
vell+0.062
asc+0.062
ying+0.061
Neg logit contributions
 swe-0.050
 smaller-0.049
 cheaper-0.046
 coll-0.045
 shin-0.044
Tokenets
Token ID1039
Feature activation+2.67e-02

Pos logit contributions
nam+0.120
asc+0.112
 oh+0.111
person+0.106
leaf+0.106
Neg logit contributions
 swe-0.106
 spe-0.106
 stream-0.103
 coll-0.103
 becomes-0.101
Token and
Token ID290
Feature activation-1.88e-03

Pos logit contributions
leaf+0.053
ibrarian+0.050
vell+0.050
asc+0.047
taker+0.047
Neg logit contributions
 swe-0.030
 someday-0.029
 spe-0.028
 enough-0.028
 smaller-0.028
Token smiled
Token ID13541
Feature activation-9.91e-03

Pos logit contributions
 swe+0.003
 someday+0.003
 cheaper+0.003
 spe+0.003
c+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.003
 character-0.003
Token.
Token ID13
Feature activation+4.19e-03

Pos logit contributions
 swe+0.015
 spe+0.014
 someday+0.014
 cheaper+0.014
c+0.014
Neg logit contributions
leaf-0.015
ibrarian-0.015
vell-0.015
taker-0.014
he-0.014
Token
Token ID198
Feature activation+2.46e-03

Pos logit contributions
leaf+0.008
ibrarian+0.007
vell+0.007
taker+0.007
he+0.007
Neg logit contributions
 swe-0.005
 someday-0.005
 spe-0.005
 cheaper-0.005
 smaller-0.005
Token
Token ID198
Feature activation+3.86e-03

Pos logit contributions
leaf+0.018
vell+0.018
ibrarian+0.018
taker+0.017
asc+0.017
Neg logit contributions
 swe-0.008
 someday-0.008
 One-0.008
 cheaper-0.008
 spe-0.007
Token
Token ID198
Feature activation+1.92e-02

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.008
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.004
 someday-0.004
 cheaper-0.004
 spe-0.004
 smaller-0.003
TokenTom
Token ID13787
Feature activation-1.25e-02

Pos logit contributions
leaf+0.037
vell+0.037
taker+0.035
opher+0.034
ibrarian+0.034
Neg logit contributions
 One-0.023
 someday-0.023
 swe-0.023
One-0.022
 cheaper-0.021
Token jumped
Token ID11687
Feature activation+2.33e-02

Pos logit contributions
 swe+0.019
 someday+0.018
 cheaper+0.018
 spe+0.018
 smaller+0.017
Neg logit contributions
ibrarian-0.020
leaf-0.019
vell-0.018
taker-0.017
asc-0.017
Token on
Token ID319
Feature activation+6.36e-04

Pos logit contributions
leaf+0.041
ibrarian+0.038
vell+0.038
ibr+0.036
opher+0.035
Neg logit contributions
 swe-0.033
 smaller-0.032
 enough-0.032
 someday-0.031
 again-0.031
Token the
Token ID262
Feature activation+8.12e-02

Pos logit contributions
leaf+0.001
ibrarian+0.001
vell+0.001
taker+0.001
asc+0.001
Neg logit contributions
 swe-0.001
 someday-0.001
 cheaper-0.001
 spe-0.001
 smaller-0.001
Token table
Token ID3084
Feature activation+2.76e-02

Pos logit contributions
ibr+0.152
vell+0.149
izards+0.147
asso+0.139
 ogre+0.136
Neg logit contributions
 smaller-0.107
 swe-0.105
 cheaper-0.098
 bigger-0.091
 shorter-0.090
Token to
Token ID284
Feature activation+4.27e-02

Pos logit contributions
leaf+0.064
vell+0.060
ibrarian+0.059
asc+0.057
taker+0.056
Neg logit contributions
 someday-0.026
 enough-0.024
 swe-0.023
 smaller-0.022
 One-0.022
Token get
Token ID651
Feature activation+1.20e-02

Pos logit contributions
vell+0.093
ower+0.090
taker+0.089
leaf+0.088
iders+0.088
Neg logit contributions
 smaller-0.044
 One-0.042
 someday-0.042
 swe-0.041
 another-0.040
Token a
Token ID257
Feature activation+9.18e-03

Pos logit contributions
leaf+0.025
ibrarian+0.025
vell+0.024
iders+0.023
taker+0.023
Neg logit contributions
 swe-0.013
 smaller-0.012
 someday-0.012
 cheaper-0.012
 spe-0.011
Token closer
Token ID5699
Feature activation-1.43e-02

Pos logit contributions
leaf+0.016
ibrarian+0.016
vell+0.016
taker+0.015
asc+0.015
Neg logit contributions
 swe-0.012
 someday-0.011
 cheaper-0.011
 spe-0.011
 smaller-0.011
Token the
Token ID262
Feature activation+6.22e-02

Pos logit contributions
leaf+0.042
ibrarian+0.042
vell+0.041
taker+0.039
asc+0.038
Neg logit contributions
 swe-0.024
 someday-0.023
 cheaper-0.022
 enough-0.022
 smaller-0.021
Token big
Token ID1263
Feature activation+3.64e-02

Pos logit contributions
leaf+0.111
ibrarian+0.109
ped+0.108
opher+0.106
asc+0.106
Neg logit contributions
 swe-0.079
 cheaper-0.077
 smaller-0.077
 shin-0.070
 less-0.069
Token black
Token ID2042
Feature activation+6.71e-02

Pos logit contributions
leaf+0.063
vell+0.060
ibrarian+0.059
taker+0.055
bled+0.054
Neg logit contributions
 swe-0.055
 someday-0.050
 cheaper-0.050
 smaller-0.050
 enough-0.049
Tokenboard
Token ID3526
Feature activation+3.74e-02

Pos logit contributions
leaf+0.092
vell+0.088
bled+0.083
uted+0.077
taker+0.077
Neg logit contributions
 again-0.115
 someday-0.111
 ones-0.110
 One-0.109
 stream-0.107
Token on
Token ID319
Feature activation+2.22e-03

Pos logit contributions
leaf+0.075
vell+0.070
ibrarian+0.068
asc+0.068
opher+0.067
Neg logit contributions
 someday-0.044
 swe-0.042
 cheaper-0.041
 enough-0.040
 One-0.040
Token the
Token ID262
Feature activation+8.11e-02

Pos logit contributions
leaf+0.004
ibrarian+0.004
vell+0.004
taker+0.004
asc+0.004
Neg logit contributions
 swe-0.003
 someday-0.002
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token wall
Token ID3355
Feature activation+2.32e-02

Pos logit contributions
asc+0.147
leaf+0.143
uted+0.142
dd+0.141
izards+0.141
Neg logit contributions
 swe-0.101
 smaller-0.101
 stream-0.095
 cheaper-0.092
 coll-0.089
Token.
Token ID13
Feature activation+1.26e-02

Pos logit contributions
leaf+0.047
ibrarian+0.044
vell+0.044
asc+0.042
taker+0.041
Neg logit contributions
 someday-0.027
 swe-0.026
 cheaper-0.025
c-0.024
 smaller-0.024
Token Tim
Token ID5045
Feature activation+2.61e-02

Pos logit contributions
leaf+0.021
ibrarian+0.020
vell+0.020
taker+0.018
asc+0.018
Neg logit contributions
 swe-0.018
 someday-0.018
 One-0.017
 spe-0.017
 cheaper-0.017
Token had
Token ID550
Feature activation-1.01e-02

Pos logit contributions
leaf+0.057
vell+0.055
ibrarian+0.053
taker+0.052
ibr+0.051
Neg logit contributions
 swe-0.026
 someday-0.025
c-0.025
 One-0.023
 enough-0.023
Token a
Token ID257
Feature activation-1.24e-02

Pos logit contributions
 swe+0.014
 someday+0.014
 spe+0.013
 cheaper+0.013
c+0.012
Neg logit contributions
leaf-0.017
ibrarian-0.017
vell-0.016
taker-0.015
 character-0.015
Token branches
Token ID13737
Feature activation-2.16e-02

Pos logit contributions
 swe+0.003
 someday+0.003
 cheaper+0.003
 spe+0.003
c+0.003
Neg logit contributions
leaf-0.004
ibrarian-0.004
vell-0.004
taker-0.003
he-0.003
Token.
Token ID13
Feature activation-2.40e-02

Pos logit contributions
 swe+0.036
 someday+0.033
 spe+0.033
 cheaper+0.032
c+0.031
Neg logit contributions
leaf-0.030
he-0.030
ibrarian-0.029
vell-0.029
taker-0.029
Token They
Token ID1119
Feature activation+2.19e-03

Pos logit contributions
 swe+0.034
 someday+0.032
 spe+0.031
 cheaper+0.030
c+0.030
Neg logit contributions
leaf-0.040
ibrarian-0.038
he-0.036
taker-0.036
ibr-0.035
Token make
Token ID787
Feature activation-4.31e-02

Pos logit contributions
leaf+0.004
ibrarian+0.004
vell+0.004
taker+0.003
asc+0.003
Neg logit contributions
 swe-0.003
 someday-0.003
 spe-0.003
 cheaper-0.003
 smaller-0.003
Token the
Token ID262
Feature activation-1.62e-02

Pos logit contributions
 someday+0.067
 One+0.060
 cheaper+0.060
 spe+0.059
 swe+0.059
Neg logit contributions
leaf-0.063
ong-0.060
ibrarian-0.059
vell-0.058
taker-0.056
Token tree
Token ID5509
Feature activation-9.66e-02

Pos logit contributions
 someday+0.024
 swe+0.024
 cheaper+0.023
 spe+0.022
c+0.022
Neg logit contributions
leaf-0.024
ibrarian-0.024
vell-0.023
he-0.022
 goodness-0.022
Token colorful
Token ID20239
Feature activation-3.97e-02

Pos logit contributions
 someday+0.160
�+0.149
use+0.148
Gu+0.146
 sometime+0.144
Neg logit contributions
 around-0.119
he-0.117
 outside-0.117
ong-0.111
vell-0.109
Token and
Token ID290
Feature activation-2.97e-03

Pos logit contributions
 One+0.062
 cheaper+0.061
 someday+0.060
 swe+0.059
One+0.057
Neg logit contributions
leaf-0.052
vell-0.052
ibrarian-0.049
 outside-0.049
he-0.048
Token bright
Token ID6016
Feature activation-3.15e-02

Pos logit contributions
 swe+0.004
 someday+0.004
 cheaper+0.004
 spe+0.004
c+0.004
Neg logit contributions
leaf-0.005
ibrarian-0.005
vell-0.005
taker-0.005
he-0.005
Token.
Token ID13
Feature activation-2.04e-02

Pos logit contributions
 someday+0.049
 cheaper+0.048
c+0.046
 swe+0.046
 One+0.046
Neg logit contributions
leaf-0.043
 goodness-0.039
vell-0.039
 outside-0.039
he-0.039
Token
Token ID198
Feature activation+7.73e-04

Pos logit contributions
 swe+0.029
 someday+0.028
 spe+0.027
 cheaper+0.027
c+0.026
Neg logit contributions
leaf-0.034
ibrarian-0.033
he-0.030
taker-0.030
asc-0.030
Token it
Token ID340
Feature activation-1.65e-02

Pos logit contributions
 swe+0.016
 someday+0.016
 spe+0.015
 cheaper+0.015
c+0.014
Neg logit contributions
leaf-0.026
ibrarian-0.025
vell-0.024
taker-0.024
he-0.023
Token.
Token ID13
Feature activation-5.03e-03

Pos logit contributions
 swe+0.026
 spe+0.026
 someday+0.025
 cheaper+0.025
c+0.024
Neg logit contributions
ibrarian-0.022
leaf-0.022
vell-0.022
taker-0.021
he-0.021
Token He
Token ID679
Feature activation-3.15e-02

Pos logit contributions
 swe+0.006
 someday+0.006
 spe+0.006
 cheaper+0.006
c+0.006
Neg logit contributions
leaf-0.009
ibrarian-0.009
vell-0.008
taker-0.008
he-0.008
Token makes
Token ID1838
Feature activation-7.58e-04

Pos logit contributions
 someday+0.045
 swe+0.043
 cheaper+0.041
 spe+0.040
c+0.039
Neg logit contributions
ibrarian-0.054
leaf-0.052
vell-0.046
ibr-0.045
izards-0.045
Token the
Token ID262
Feature activation+3.41e-03

Pos logit contributions
 swe+0.001
 someday+0.001
 spe+0.001
 cheaper+0.001
c+0.001
Neg logit contributions
leaf-0.001
ibrarian-0.001
vell-0.001
taker-0.001
he-0.001
Token tree
Token ID5509
Feature activation-9.33e-02

Pos logit contributions
leaf+0.006
ibrarian+0.005
vell+0.005
taker+0.005
asc+0.005
Neg logit contributions
 swe-0.005
 someday-0.005
 cheaper-0.005
 spe-0.005
 smaller-0.005
Token look
Token ID804
Feature activation-2.63e-02

Pos logit contributions
 someday+0.147
�+0.135
use+0.133
 sometime+0.132
One+0.129
Neg logit contributions
 around-0.120
taker-0.119
 outside-0.119
he-0.116
ong-0.116
Token more
Token ID517
Feature activation-1.35e-02

Pos logit contributions
 someday+0.040
 spe+0.039
c+0.039
 swe+0.037
 One+0.036
Neg logit contributions
leaf-0.038
vell-0.037
taker-0.037
ibrarian-0.035
ong-0.034
Token beautiful
Token ID4950
Feature activation-8.29e-03

Pos logit contributions
 someday+0.019
 swe+0.019
 One+0.018
c+0.018
 spe+0.018
Neg logit contributions
leaf-0.022
vell-0.020
ibrarian-0.020
taker-0.020
he-0.019
Token and
Token ID290
Feature activation-1.28e-02

Pos logit contributions
 swe+0.012
 someday+0.011
 cheaper+0.011
 spe+0.011
c+0.011
Neg logit contributions
leaf-0.014
ibrarian-0.013
vell-0.013
taker-0.012
 character-0.012
Token fun
Token ID1257
Feature activation+7.34e-03

Pos logit contributions
 someday+0.016
 swe+0.016
 spe+0.015
c+0.015
 One+0.015
Neg logit contributions
leaf-0.023
ibrarian-0.021
vell-0.021
taker-0.021
he-0.020
Token happy
Token ID3772
Feature activation+7.54e-03

Pos logit contributions
leaf+0.022
ibrarian+0.021
vell+0.020
iders+0.020
asc+0.020
Neg logit contributions
 swe-0.015
 someday-0.015
 spe-0.015
 smaller-0.015
 cheaper-0.015
Token that
Token ID326
Feature activation-8.91e-03

Pos logit contributions
leaf+0.014
ibrarian+0.014
vell+0.014
taker+0.013
asc+0.013
Neg logit contributions
 someday-0.009
 swe-0.009
 spe-0.008
 cheaper-0.008
 enough-0.008
Token he
Token ID339
Feature activation+2.13e-02

Pos logit contributions
 swe+0.012
 spe+0.012
 cheaper+0.012
 someday+0.012
c+0.011
Neg logit contributions
leaf-0.014
ibrarian-0.014
taker-0.014
vell-0.014
 character-0.013
Token gave
Token ID2921
Feature activation-5.13e-03

Pos logit contributions
leaf+0.039
ibrarian+0.038
vell+0.036
iders+0.036
opher+0.036
Neg logit contributions
 swe-0.027
 someday-0.026
 spe-0.026
 cheaper-0.024
 shin-0.024
Token the
Token ID262
Feature activation-4.64e-02

Pos logit contributions
 swe+0.006
 someday+0.006
 spe+0.006
 cheaper+0.006
 smaller+0.006
Neg logit contributions
leaf-0.009
ibrarian-0.009
vell-0.009
taker-0.009
he-0.008
Token little
Token ID1310
Feature activation-8.96e-02

Pos logit contributions
 someday+0.087
 sometime+0.082
 becomes+0.077
 swe+0.077
c+0.075
Neg logit contributions
ibrarian-0.052
vell-0.052
taker-0.049
leaf-0.049
he-0.048
Token bird
Token ID6512
Feature activation+9.80e-03

Pos logit contributions
 sometime+0.143
 becomes+0.143
oral+0.137
 Grad+0.137
 fade+0.135
Neg logit contributions
 ear-0.095
 goodness-0.086
person-0.086
 character-0.085
 pupil-0.082
Token a
Token ID257
Feature activation+2.63e-03

Pos logit contributions
leaf+0.019
ibrarian+0.019
vell+0.018
taker+0.018
iders+0.017
Neg logit contributions
 swe-0.011
 someday-0.011
c-0.010
 cheaper-0.010
 spe-0.010
Token big
Token ID1263
Feature activation-2.36e-02

Pos logit contributions
leaf+0.005
ibrarian+0.004
vell+0.004
taker+0.004
asc+0.004
Neg logit contributions
 swe-0.004
 someday-0.003
 cheaper-0.003
 spe-0.003
 smaller-0.003
Token hug
Token ID16225
Feature activation+2.67e-02

Pos logit contributions
 someday+0.045
 swe+0.044
 cheaper+0.043
c+0.042
 spe+0.041
Neg logit contributions
leaf-0.028
ibrarian-0.026
taker-0.025
vell-0.024
 character-0.023
Token.
Token ID13
Feature activation-7.31e-04

Pos logit contributions
leaf+0.056
vell+0.053
ibrarian+0.053
iders+0.051
asc+0.050
Neg logit contributions
 someday-0.029
 swe-0.027
 enough-0.027
 cheaper-0.025
 spe-0.024
Token Tim
Token ID5045
Feature activation-3.18e-02

Pos logit contributions
 swe+0.014
 someday+0.013
 cheaper+0.013
 spe+0.013
 smaller+0.013
Neg logit contributions
leaf-0.021
ibrarian-0.020
vell-0.019
taker-0.019
 character-0.018
Tokenmy
Token ID1820
Feature activation-2.96e-02

Pos logit contributions
 spe+0.060
 cheaper+0.058
 swe+0.057
 someday+0.055
 smaller+0.055
Neg logit contributions
ibrarian-0.035
he-0.034
leaf-0.033
asc-0.033
vell-0.032
Token's
Token ID338
Feature activation+9.46e-03

Pos logit contributions
 swe+0.045
 cheaper+0.044
 someday+0.042
 spe+0.042
 smaller+0.042
Neg logit contributions
leaf-0.044
ibrarian-0.043
taker-0.040
vell-0.040
he-0.040
Token blanket
Token ID18447
Feature activation-1.69e-02

Pos logit contributions
leaf+0.016
ibrarian+0.015
vell+0.015
taker+0.014
asc+0.014
Neg logit contributions
 swe-0.014
 someday-0.014
 cheaper-0.013
 spe-0.013
 smaller-0.013
Token would
Token ID561
Feature activation-4.12e-02

Pos logit contributions
 swe+0.023
 cheaper+0.022
 someday+0.022
 spe+0.021
 smaller+0.020
Neg logit contributions
leaf-0.028
ibrarian-0.028
vell-0.027
taker-0.027
he-0.026
Token become
Token ID1716
Feature activation-8.92e-02

Pos logit contributions
 swe+0.064
 cheaper+0.063
c+0.060
 enough+0.058
 shin+0.057
Neg logit contributions
he-0.056
vell-0.053
leaf-0.052
taker-0.052
ibrarian-0.051
Token lively
Token ID29696
Feature activation-3.01e-02

Pos logit contributions
oral+0.145
 bidder+0.139
�+0.134
 Sometimes+0.130
 One+0.130
Neg logit contributions
 everywhere-0.110
vell-0.109
 outside-0.108
 around-0.105
 behind-0.095
Token whenever
Token ID8797
Feature activation-4.25e-02

Pos logit contributions
 swe+0.030
 cheaper+0.030
 spe+0.029
c+0.028
 shin+0.027
Neg logit contributions
leaf-0.057
vell-0.056
ibrarian-0.055
 character-0.055
taker-0.054
Token they
Token ID484
Feature activation+1.24e-02

Pos logit contributions
c+0.058
 swe+0.057
 cheaper+0.055
 spe+0.053
 shin+0.053
Neg logit contributions
leaf-0.066
 character-0.065
taker-0.063
 goodness-0.062
ibrarian-0.061
Token wanted
Token ID2227
Feature activation-3.20e-02

Pos logit contributions
leaf+0.023
ibrarian+0.023
vell+0.022
taker+0.021
asc+0.021
Neg logit contributions
 swe-0.015
 someday-0.015
 spe-0.014
c-0.014
 cheaper-0.014
Token to
Token ID284
Feature activation-4.14e-02

Pos logit contributions
 swe+0.048
 shin+0.047
 cheaper+0.047
 spe+0.047
c+0.046
Neg logit contributions
leaf-0.044
 character-0.042
 outside-0.041
ibrarian-0.040
taker-0.040
Tokeniring
Token ID3428
Feature activation-2.49e-03

Pos logit contributions
leaf+0.002
ibrarian+0.002
vell+0.002
taker+0.001
asc+0.001
Neg logit contributions
 swe-0.009
 someday-0.009
 cheaper-0.009
 spe-0.009
 smaller-0.009
Token,
Token ID11
Feature activation+1.06e-02

Pos logit contributions
 swe+0.003
 someday+0.003
 cheaper+0.003
 spe+0.003
c+0.003
Neg logit contributions
leaf-0.004
ibrarian-0.004
vell-0.004
taker-0.004
he-0.004
Token until
Token ID1566
Feature activation-2.81e-02

Pos logit contributions
leaf+0.017
ibrarian+0.016
vell+0.016
taker+0.015
asc+0.015
Neg logit contributions
 swe-0.017
 someday-0.016
 spe-0.016
 cheaper-0.015
 smaller-0.015
Token it
Token ID340
Feature activation-3.59e-02

Pos logit contributions
 swe+0.045
c+0.041
 cheaper+0.039
 coll+0.038
 spe+0.037
Neg logit contributions
leaf-0.042
ibrarian-0.040
he-0.040
taker-0.039
 character-0.039
Token started
Token ID2067
Feature activation-6.67e-02

Pos logit contributions
 cheaper+0.053
 swe+0.053
 spe+0.052
 someday+0.051
 coll+0.049
Neg logit contributions
ibrarian-0.055
he-0.052
vell-0.049
taker-0.048
leaf-0.048
Token to
Token ID284
Feature activation-8.83e-02

Pos logit contributions
 cheaper+0.116
 someday+0.113
 bidder+0.112
 spe+0.111
�+0.111
Neg logit contributions
 outside-0.075
 around-0.070
 everywhere-0.068
 supposed-0.067
 about-0.058
Token get
Token ID651
Feature activation-6.52e-02

Pos logit contributions
c+0.150
 cheaper+0.141
 coll+0.135
 swe+0.134
ib+0.132
Neg logit contributions
he-0.087
 ground-0.085
 lying-0.085
 all-0.084
 outside-0.082
Token dark
Token ID3223
Feature activation-3.63e-02

Pos logit contributions
 bidder+0.106
�+0.106
 someday+0.102
c+0.100
 coll+0.099
Neg logit contributions
 everywhere-0.083
 around-0.081
 outside-0.079
vell-0.074
lessly-0.072
Token.
Token ID13
Feature activation+1.33e-02

Pos logit contributions
 swe+0.060
 cheaper+0.059
c+0.057
 someday+0.056
 spe+0.056
Neg logit contributions
 outside-0.047
leaf-0.044
 around-0.044
he-0.043
vell-0.043
Token He
Token ID679
Feature activation-1.42e-02

Pos logit contributions
leaf+0.026
ibrarian+0.026
vell+0.026
asc+0.024
taker+0.024
Neg logit contributions
 swe-0.015
 One-0.015
 someday-0.015
 cheaper-0.014
 spe-0.014
Token started
Token ID2067
Feature activation-2.81e-02

Pos logit contributions
 swe+0.020
 someday+0.019
 cheaper+0.019
 spe+0.018
 smaller+0.017
Neg logit contributions
leaf-0.024
ibrarian-0.024
vell-0.022
taker-0.022
 character-0.021
Token your
Token ID534
Feature activation-1.07e-02

Pos logit contributions
leaf+0.046
ibrarian+0.045
vell+0.045
taker+0.043
he+0.043
Neg logit contributions
 swe-0.029
 someday-0.028
 enough-0.027
 cheaper-0.027
 smaller-0.026
Token name
Token ID1438
Feature activation+2.43e-02

Pos logit contributions
 swe+0.019
 someday+0.019
 spe+0.019
 cheaper+0.019
c+0.018
Neg logit contributions
leaf-0.013
ibrarian-0.013
vell-0.012
taker-0.011
 character-0.011
Token?
Token ID30
Feature activation+2.17e-02

Pos logit contributions
leaf+0.044
vell+0.041
ibrarian+0.041
taker+0.040
ibr+0.038
Neg logit contributions
 someday-0.032
 swe-0.032
 cheaper-0.030
c-0.029
 enough-0.029
Token'.
Token ID4458
Feature activation+4.96e-03

Pos logit contributions
leaf+0.035
vell+0.034
ibrarian+0.034
taker+0.032
asc+0.032
Neg logit contributions
 someday-0.031
 swe-0.031
 One-0.030
 cheaper-0.029
 spe-0.029
Token The
Token ID383
Feature activation-6.39e-02

Pos logit contributions
leaf+0.010
ibrarian+0.010
vell+0.010
taker+0.009
asc+0.009
Neg logit contributions
 swe-0.005
 someday-0.005
 spe-0.005
 cheaper-0.005
 One-0.005
Token little
Token ID1310
Feature activation-8.77e-02

Pos logit contributions
onna+0.102
 someday+0.101
 sometime+0.095
 fade+0.089
 becomes+0.085
Neg logit contributions
 ear-0.081
 children-0.078
 goodness-0.075
 adults-0.073
 outside-0.071
Token pig
Token ID12967
Feature activation+3.02e-03

Pos logit contributions
onna+0.173
 sometime+0.155
 someday+0.154
 fade+0.152
 fades+0.144
Neg logit contributions
 goodness-0.093
 children-0.091
ting-0.090
 ear-0.083
 outside-0.079
Tokenlet
Token ID1616
Feature activation+1.29e-02

Pos logit contributions
leaf+0.006
ibrarian+0.005
vell+0.005
taker+0.005
asc+0.005
Neg logit contributions
 swe-0.004
 someday-0.004
 spe-0.004
 cheaper-0.004
 smaller-0.003
Token bl
Token ID698
Feature activation+1.13e-02

Pos logit contributions
leaf+0.027
vell+0.025
ibrarian+0.025
taker+0.024
asc+0.024
Neg logit contributions
 swe-0.014
 someday-0.013
c-0.012
 enough-0.012
 cheaper-0.012
Tokenushed
Token ID7474
Feature activation+9.70e-03

Pos logit contributions
leaf+0.012
ibrarian+0.011
vell+0.010
taker+0.010
he+0.009
Neg logit contributions
 someday-0.023
 swe-0.023
c-0.022
 cheaper-0.022
 spe-0.022
Token and
Token ID290
Feature activation+1.85e-02

Pos logit contributions
leaf+0.015
ibrarian+0.015
vell+0.014
taker+0.013
asc+0.013
Neg logit contributions
 swe-0.016
 someday-0.015
 spe-0.014
 cheaper-0.014
 smaller-0.014
Token and
Token ID290
Feature activation-9.94e-03

Pos logit contributions
 swe+0.006
 someday+0.006
 cheaper+0.006
 spe+0.006
 shin+0.005
Neg logit contributions
leaf-0.010
ibrarian-0.010
vell-0.010
he-0.009
taker-0.009
Token asked
Token ID1965
Feature activation-6.48e-03

Pos logit contributions
 someday+0.013
 swe+0.013
 spe+0.013
 cheaper+0.013
 enough+0.012
Neg logit contributions
leaf-0.017
ibrarian-0.016
taker-0.016
vell-0.016
asc-0.015
Token if
Token ID611
Feature activation-3.62e-02

Pos logit contributions
 swe+0.009
 spe+0.009
 someday+0.009
 cheaper+0.009
 shin+0.009
Neg logit contributions
leaf-0.010
ibrarian-0.010
taker-0.010
vell-0.009
 character-0.009
Token she
Token ID673
Feature activation-1.29e-02

Pos logit contributions
c+0.060
 spe+0.060
 swe+0.058
 cheaper+0.057
 coll+0.055
Neg logit contributions
ibrarian-0.048
leaf-0.045
taker-0.043
 character-0.043
vell-0.043
Token could
Token ID714
Feature activation-6.00e-03

Pos logit contributions
 swe+0.022
 someday+0.021
 cheaper+0.021
 spe+0.020
 smaller+0.020
Neg logit contributions
leaf-0.018
ibrarian-0.018
vell-0.016
taker-0.016
 character-0.016
Token have
Token ID423
Feature activation-8.76e-02

Pos logit contributions
 swe+0.008
 someday+0.008
 cheaper+0.007
 spe+0.007
 shin+0.007
Neg logit contributions
leaf-0.011
ibrarian-0.010
vell-0.010
taker-0.009
he-0.009
Token some
Token ID617
Feature activation-3.17e-02

Pos logit contributions
 someday+0.136
oral+0.130
 weaker+0.129
onna+0.128
Or+0.128
Neg logit contributions
leaf-0.119
 goodness-0.103
 outside-0.100
ils-0.097
 ear-0.096
Token po
Token ID745
Feature activation-2.01e-02

Pos logit contributions
 someday+0.040
 swe+0.036
c+0.036
 spe+0.036
 smaller+0.036
Neg logit contributions
leaf-0.057
ibrarian-0.050
 goodness-0.050
vell-0.050
he-0.047
Tokenpp
Token ID381
Feature activation-2.33e-02

Pos logit contributions
 swe+0.028
 someday+0.027
 smaller+0.027
 spe+0.026
 cheaper+0.026
Neg logit contributions
leaf-0.035
vell-0.032
ibrarian-0.032
he-0.030
taker-0.029
Tokenies
Token ID444
Feature activation-2.33e-02

Pos logit contributions
 someday+0.037
 swe+0.036
 cheaper+0.035
 spe+0.035
 One+0.034
Neg logit contributions
leaf-0.034
vell-0.034
ibrarian-0.033
taker-0.032
he-0.030
Token.
Token ID13
Feature activation-2.38e-02

Pos logit contributions
 swe+0.032
 spe+0.032
 someday+0.031
 cheaper+0.030
 shin+0.030
Neg logit contributions
leaf-0.038
ibrarian-0.037
vell-0.036
taker-0.035
he-0.034
Token and
Token ID290
Feature activation-1.85e-02

Pos logit contributions
 swe+0.050
 cheaper+0.048
 someday+0.048
 spe+0.047
 shin+0.047
Neg logit contributions
ibrarian-0.038
leaf-0.037
vell-0.037
he-0.037
 outside-0.035
Token pretend
Token ID16614
Feature activation-1.94e-02

Pos logit contributions
 someday+0.027
 swe+0.027
 cheaper+0.027
 spe+0.026
c+0.026
Neg logit contributions
leaf-0.029
ibrarian-0.026
vell-0.025
taker-0.025
 character-0.023
Token they
Token ID484
Feature activation-1.86e-02

Pos logit contributions
 swe+0.022
 someday+0.021
c+0.021
 spe+0.020
 cheaper+0.020
Neg logit contributions
leaf-0.036
ibrarian-0.035
vell-0.034
taker-0.034
 character-0.033
Token were
Token ID547
Feature activation-1.70e-02

Pos logit contributions
 swe+0.032
 someday+0.030
 cheaper+0.029
 spe+0.028
c+0.028
Neg logit contributions
leaf-0.025
ibrarian-0.025
vell-0.025
he-0.024
taker-0.022
Token going
Token ID1016
Feature activation-4.47e-02

Pos logit contributions
 swe+0.029
 someday+0.028
 spe+0.027
 cheaper+0.027
 One+0.027
Neg logit contributions
leaf-0.023
vell-0.023
ibrarian-0.022
asc-0.020
taker-0.020
Token on
Token ID319
Feature activation-8.76e-02

Pos logit contributions
 swe+0.081
 spe+0.080
 someday+0.077
 god+0.076
 cheaper+0.074
Neg logit contributions
 outside-0.057
 around-0.054
leaf-0.051
vell-0.049
 everywhere-0.047
Token a
Token ID257
Feature activation-3.68e-02

Pos logit contributions
 swe+0.153
 god+0.152
 fades+0.150
 becomes+0.150
 heals+0.149
Neg logit contributions
 outside-0.101
 everywhere-0.085
 ages-0.081
leaf-0.078
 behind-0.077
Token trip
Token ID5296
Feature activation+1.35e-03

Pos logit contributions
 someday+0.054
 swe+0.052
 enough+0.052
c+0.052
 spe+0.051
Neg logit contributions
leaf-0.051
vell-0.051
ibrarian-0.049
 outside-0.049
he-0.049
Token.
Token ID13
Feature activation-1.69e-02

Pos logit contributions
leaf+0.002
ibrarian+0.002
vell+0.002
taker+0.002
asc+0.002
Neg logit contributions
 swe-0.002
 someday-0.002
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token They
Token ID1119
Feature activation+5.20e-03

Pos logit contributions
 swe+0.024
 spe+0.024
 someday+0.023
 shin+0.022
 cheaper+0.022
Neg logit contributions
leaf-0.027
ibrarian-0.027
he-0.026
asc-0.024
taker-0.024
Token took
Token ID1718
Feature activation-8.02e-03

Pos logit contributions
leaf+0.010
ibrarian+0.009
vell+0.009
taker+0.009
asc+0.008
Neg logit contributions
 swe-0.007
 someday-0.007
 spe-0.006
 cheaper-0.006
 smaller-0.006
Token so
Token ID523
Feature activation-7.24e-03

Pos logit contributions
 spe+0.077
 someday+0.076
 swe+0.074
 cheaper+0.073
 One+0.073
Neg logit contributions
vell-0.065
taker-0.062
ibrarian-0.062
leaf-0.062
 around-0.059
Token they
Token ID484
Feature activation-9.97e-03

Pos logit contributions
 swe+0.009
 someday+0.009
 spe+0.009
 cheaper+0.009
c+0.009
Neg logit contributions
leaf-0.013
ibrarian-0.012
vell-0.012
taker-0.012
he-0.011
Token can
Token ID460
Feature activation-2.19e-02

Pos logit contributions
 swe+0.012
 someday+0.012
 cheaper+0.011
 spe+0.011
 smaller+0.011
Neg logit contributions
leaf-0.019
ibrarian-0.018
vell-0.018
taker-0.017
he-0.017
Token make
Token ID787
Feature activation-2.25e-02

Pos logit contributions
 swe+0.033
 someday+0.031
 cheaper+0.030
 spe+0.030
 shin+0.030
Neg logit contributions
leaf-0.033
ibrarian-0.033
he-0.033
vell-0.032
taker-0.030
Token the
Token ID262
Feature activation-4.02e-03

Pos logit contributions
 someday+0.031
 swe+0.030
 spe+0.029
 cheaper+0.029
 One+0.028
Neg logit contributions
leaf-0.037
ibrarian-0.036
vell-0.035
taker-0.034
 goodness-0.032
Token forest
Token ID8222
Feature activation-8.72e-02

Pos logit contributions
 swe+0.006
 someday+0.006
 cheaper+0.006
 spe+0.006
c+0.006
Neg logit contributions
leaf-0.006
ibrarian-0.006
vell-0.006
taker-0.006
he-0.005
Token pretty
Token ID2495
Feature activation-5.02e-02

Pos logit contributions
 One+0.128
 someday+0.126
One+0.122
 spe+0.119
 Sometimes+0.116
Neg logit contributions
vell-0.118
taker-0.118
 outside-0.110
 around-0.110
leaf-0.107
Token."
Token ID526
Feature activation-4.77e-03

Pos logit contributions
 cheaper+0.077
 One+0.076
 spe+0.075
 someday+0.075
c+0.074
Neg logit contributions
vell-0.065
taker-0.062
 around-0.061
 outside-0.061
leaf-0.060
Token
Token ID198
Feature activation+1.18e-02

Pos logit contributions
 swe+0.006
 someday+0.006
 spe+0.006
 cheaper+0.005
 smaller+0.005
Neg logit contributions
leaf-0.009
ibrarian-0.009
vell-0.008
taker-0.008
he-0.008
Token
Token ID198
Feature activation-8.19e-03

Pos logit contributions
leaf+0.026
vell+0.025
ibrarian+0.025
taker+0.024
asc+0.023
Neg logit contributions
 swe-0.011
 someday-0.011
 cheaper-0.010
 One-0.010
c-0.010
TokenL
Token ID43
Feature activation+1.47e-02

Pos logit contributions
 swe+0.011
 spe+0.011
 someday+0.011
 cheaper+0.010
 enough+0.010
Neg logit contributions
leaf-0.014
ibrarian-0.014
vell-0.013
he-0.013
taker-0.013
Token.
Token ID13
Feature activation+7.11e-04

Pos logit contributions
 swe+0.018
 someday+0.017
 cheaper+0.017
 spe+0.017
 smaller+0.016
Neg logit contributions
leaf-0.022
ibrarian-0.021
vell-0.021
taker-0.020
he-0.020
Token Soon
Token ID15894
Feature activation-2.91e-02

Pos logit contributions
leaf+0.001
ibrarian+0.001
vell+0.001
taker+0.001
asc+0.001
Neg logit contributions
 swe-0.001
 someday-0.001
 cheaper-0.001
 spe-0.001
 smaller-0.001
Token,
Token ID11
Feature activation+4.55e-03

Pos logit contributions
 swe+0.039
c+0.037
 cheaper+0.037
 spe+0.036
 someday+0.036
Neg logit contributions
ibrarian-0.049
 character-0.046
leaf-0.046
taker-0.046
vell-0.045
Token the
Token ID262
Feature activation+2.49e-02

Pos logit contributions
leaf+0.009
ibrarian+0.008
vell+0.008
taker+0.008
asc+0.008
Neg logit contributions
 swe-0.005
 someday-0.005
 cheaper-0.005
 spe-0.005
 smaller-0.005
Token path
Token ID3108
Feature activation-2.77e-02

Pos logit contributions
leaf+0.045
ibrarian+0.044
vell+0.042
asc+0.041
taker+0.041
Neg logit contributions
 swe-0.032
 someday-0.031
 spe-0.031
 smaller-0.030
 cheaper-0.030
Token grew
Token ID6348
Feature activation-8.65e-02

Pos logit contributions
 swe+0.039
 someday+0.037
 spe+0.035
 cheaper+0.035
 smaller+0.033
Neg logit contributions
ibrarian-0.046
leaf-0.046
vell-0.044
taker-0.044
he-0.043
Token steep
Token ID14559
Feature activation-2.33e-02

Pos logit contributions
 bidder+0.133
c+0.122
 someday+0.118
 god+0.116
 coll+0.116
Neg logit contributions
 outside-0.125
vell-0.123
 around-0.122
 everywhere-0.121
iring-0.116
Token.
Token ID13
Feature activation+4.51e-03

Pos logit contributions
 swe+0.036
 someday+0.033
 cheaper+0.033
 spe+0.032
c+0.032
Neg logit contributions
leaf-0.035
he-0.034
ibrarian-0.033
vell-0.033
taker-0.032
Token Joe
Token ID5689
Feature activation-1.48e-02

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.008
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.006
 someday-0.006
 cheaper-0.006
 spe-0.006
c-0.005
Token was
Token ID373
Feature activation+4.94e-03

Pos logit contributions
 swe+0.021
 someday+0.020
 cheaper+0.019
 spe+0.019
 smaller+0.018
Neg logit contributions
leaf-0.025
ibrarian-0.025
vell-0.023
taker-0.023
he-0.023
Token tired
Token ID10032
Feature activation-2.14e-02

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.007
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.007
 someday-0.007
 cheaper-0.007
 spe-0.007
 smaller-0.007
Token But
Token ID887
Feature activation-2.17e-02

Pos logit contributions
 swe+0.007
 someday+0.007
 cheaper+0.007
 spe+0.006
 shin+0.006
Neg logit contributions
leaf-0.014
ibrarian-0.014
vell-0.013
taker-0.013
he-0.013
Token sometimes
Token ID3360
Feature activation-3.23e-02

Pos logit contributions
 swe+0.029
 cheaper+0.027
 spe+0.027
 shin+0.026
c+0.026
Neg logit contributions
ibrarian-0.035
leaf-0.035
 character-0.033
taker-0.032
 ear-0.032
Token,
Token ID11
Feature activation-2.81e-02

Pos logit contributions
 swe+0.036
 cheaper+0.036
c+0.036
 spe+0.036
 shin+0.035
Neg logit contributions
 character-0.054
ibrarian-0.054
leaf-0.052
izards-0.052
taker-0.051
Token the
Token ID262
Feature activation-6.02e-03

Pos logit contributions
 swe+0.032
c+0.031
 cheaper+0.030
 shin+0.029
 spe+0.029
Neg logit contributions
leaf-0.050
ibrarian-0.049
 character-0.048
izards-0.047
vell-0.047
Token sky
Token ID6766
Feature activation-5.88e-02

Pos logit contributions
 someday+0.008
 swe+0.008
 spe+0.008
 cheaper+0.008
c+0.008
Neg logit contributions
leaf-0.010
ibrarian-0.010
vell-0.009
taker-0.009
asc-0.009
Token gets
Token ID3011
Feature activation-8.65e-02

Pos logit contributions
 swe+0.089
 shin+0.087
 someday+0.085
 cheaper+0.084
 spe+0.081
Neg logit contributions
he-0.076
 around-0.076
ibrarian-0.071
ower-0.071
ong-0.071
Token dark
Token ID3223
Feature activation-3.42e-02

Pos logit contributions
 bidder+0.154
 god+0.153
 someday+0.152
 coll+0.148
c+0.147
Neg logit contributions
 everywhere-0.096
 outside-0.095
 around-0.089
ong-0.080
 behind-0.080
Token and
Token ID290
Feature activation-5.65e-02

Pos logit contributions
 swe+0.055
 cheaper+0.053
 spe+0.052
c+0.052
 shin+0.051
Neg logit contributions
 outside-0.047
he-0.046
leaf-0.043
vell-0.043
taker-0.042
Token it
Token ID340
Feature activation-5.97e-02

Pos logit contributions
 coll+0.081
 someday+0.081
c+0.081
 cheaper+0.078
 swe+0.077
Neg logit contributions
taker-0.073
 around-0.071
 outside-0.067
leaf-0.067
 everywhere-0.065
Token rains
Token ID29424
Feature activation-2.00e-02

Pos logit contributions
 someday+0.086
 swe+0.084
 cheaper+0.084
 coll+0.078
 spe+0.076
Neg logit contributions
 around-0.086
ibrarian-0.084
he-0.082
taker-0.080
 outside-0.077
Token.
Token ID13
Feature activation-1.73e-02

Pos logit contributions
 swe+0.031
 cheaper+0.029
 someday+0.029
 spe+0.028
 shin+0.028
Neg logit contributions
leaf-0.029
ibrarian-0.029
he-0.029
vell-0.028
taker-0.028
Token mom
Token ID1995
Feature activation-1.89e-02

Pos logit contributions
 someday+0.029
 swe+0.029
 spe+0.029
c+0.029
 cheaper+0.028
Neg logit contributions
ibrarian-0.022
leaf-0.021
vell-0.020
ibr-0.019
 character-0.019
Token if
Token ID611
Feature activation-3.85e-02

Pos logit contributions
 spe+0.030
 swe+0.028
 cheaper+0.026
 someday+0.026
 shin+0.026
Neg logit contributions
ibrarian-0.028
leaf-0.028
vell-0.026
taker-0.026
he-0.026
Token they
Token ID484
Feature activation-1.08e-02

Pos logit contributions
c+0.060
 swe+0.058
 spe+0.057
 cheaper+0.054
 coll+0.053
Neg logit contributions
ibrarian-0.055
leaf-0.052
taker-0.051
vell-0.050
 character-0.050
Token could
Token ID714
Feature activation-3.46e-02

Pos logit contributions
 swe+0.020
 someday+0.018
 cheaper+0.018
 spe+0.018
c+0.018
Neg logit contributions
leaf-0.014
ibrarian-0.014
vell-0.013
taker-0.013
he-0.012
Token go
Token ID467
Feature activation-4.93e-02

Pos logit contributions
 swe+0.055
 spe+0.051
 shin+0.050
 cheaper+0.050
c+0.048
Neg logit contributions
he-0.051
leaf-0.048
ibrarian-0.047
vell-0.047
 outside-0.045
Token on
Token ID319
Feature activation-8.57e-02

Pos logit contributions
 spe+0.099
 swe+0.099
 shin+0.089
 cheaper+0.087
 smaller+0.086
Neg logit contributions
 outside-0.058
he-0.051
 around-0.050
leaf-0.049
 behind-0.049
Token a
Token ID257
Feature activation-6.44e-02

Pos logit contributions
 spe+0.160
onna+0.158
Or+0.158
 becomes+0.154
 fade+0.153
Neg logit contributions
 outside-0.084
 everywhere-0.071
 all-0.071
 behind-0.070
 ages-0.066
Token saf
Token ID1932
Feature activation+1.40e-02

Pos logit contributions
 spe+0.089
 swe+0.089
 becomes+0.086
 fade+0.086
 someday+0.085
Neg logit contributions
 outside-0.089
vell-0.085
leaf-0.084
lf-0.082
ibrarian-0.080
Tokenari
Token ID2743
Feature activation-2.37e-02

Pos logit contributions
leaf+0.023
ibrarian+0.022
taker+0.021
vell+0.021
opher+0.020
Neg logit contributions
 someday-0.020
 swe-0.020
 cheaper-0.019
 spe-0.018
 One-0.018
Token,
Token ID11
Feature activation-2.45e-02

Pos logit contributions
 swe+0.038
 spe+0.036
 smaller+0.035
 cheaper+0.034
 shin+0.034
Neg logit contributions
leaf-0.035
ibrarian-0.034
vell-0.033
he-0.033
 outside-0.031
Token but
Token ID475
Feature activation-3.63e-02

Pos logit contributions
 swe+0.040
 spe+0.039
c+0.038
 someday+0.037
 cheaper+0.037
Neg logit contributions
leaf-0.034
vell-0.030
ibrarian-0.030
taker-0.029
he-0.028
Token them
Token ID606
Feature activation-5.90e-03

Pos logit contributions
 someday+0.053
 swe+0.051
c+0.048
 cheaper+0.048
 spe+0.048
Neg logit contributions
taker-0.060
leaf-0.059
ibrarian-0.058
he-0.058
 around-0.057
Token up
Token ID510
Feature activation-3.89e-02

Pos logit contributions
 swe+0.012
 someday+0.012
 cheaper+0.012
 spe+0.012
c+0.011
Neg logit contributions
leaf-0.006
ibrarian-0.006
vell-0.005
taker-0.005
he-0.005
Token.
Token ID13
Feature activation-2.70e-02

Pos logit contributions
 swe+0.071
c+0.068
 someday+0.067
 cheaper+0.066
 spe+0.064
Neg logit contributions
he-0.042
 outside-0.042
 around-0.042
vell-0.041
leaf-0.040
Token But
Token ID887
Feature activation-3.96e-02

Pos logit contributions
 spe+0.039
 someday+0.039
 swe+0.039
 cheaper+0.037
c+0.036
Neg logit contributions
leaf-0.042
taker-0.040
vell-0.039
ibrarian-0.039
issors-0.038
Token the
Token ID262
Feature activation-2.43e-02

Pos logit contributions
 swe+0.072
 spe+0.072
 cheaper+0.068
c+0.065
 someday+0.064
Neg logit contributions
bled-0.041
ting-0.041
taker-0.039
 ear-0.039
 around-0.039
Token last
Token ID938
Feature activation-8.56e-02

Pos logit contributions
 someday+0.030
 swe+0.027
 spe+0.027
 cheaper+0.026
c+0.026
Neg logit contributions
ibrarian-0.043
leaf-0.042
vell-0.040
taker-0.040
 character-0.039
Token snowball
Token ID46275
Feature activation-3.00e-02

Pos logit contributions
Or+0.152
 someday+0.141
Flor+0.138
 cheaper+0.137
 spe+0.136
Neg logit contributions
 character-0.087
lf-0.077
 around-0.070
ied-0.069
 members-0.069
Token was
Token ID373
Feature activation-1.21e-02

Pos logit contributions
 swe+0.044
 someday+0.043
 cheaper+0.042
 spe+0.041
c+0.039
Neg logit contributions
ibrarian-0.045
leaf-0.044
 character-0.043
taker-0.043
vell-0.042
Token too
Token ID1165
Feature activation-1.59e-02

Pos logit contributions
 someday+0.015
 swe+0.015
 spe+0.014
c+0.014
 cheaper+0.014
Neg logit contributions
leaf-0.021
ibrarian-0.021
vell-0.020
taker-0.020
asc-0.019
Token heavy
Token ID4334
Feature activation-5.81e-03

Pos logit contributions
 someday+0.024
 swe+0.023
c+0.021
 spe+0.021
 cheaper+0.021
Neg logit contributions
leaf-0.025
ibrarian-0.024
vell-0.024
taker-0.024
he-0.023
Token for
Token ID329
Feature activation-1.34e-02

Pos logit contributions
 swe+0.009
 someday+0.009
 spe+0.009
 cheaper+0.008
c+0.008
Neg logit contributions
leaf-0.009
ibrarian-0.009
vell-0.008
taker-0.008
he-0.008
Token.
Token ID13
Feature activation-4.99e-03

Pos logit contributions
 swe+0.021
 someday+0.020
 spe+0.020
 cheaper+0.019
c+0.019
Neg logit contributions
leaf-0.023
ibrarian-0.022
vell-0.021
taker-0.021
he-0.021
Token She
Token ID1375
Feature activation-2.19e-03

Pos logit contributions
 swe+0.007
 someday+0.006
 spe+0.006
 cheaper+0.006
c+0.006
Neg logit contributions
leaf-0.009
ibrarian-0.009
vell-0.008
taker-0.008
he-0.008
Token wanted
Token ID2227
Feature activation-2.33e-02

Pos logit contributions
 swe+0.003
 someday+0.003
 cheaper+0.003
 spe+0.003
 smaller+0.003
Neg logit contributions
leaf-0.004
ibrarian-0.003
vell-0.003
taker-0.003
he-0.003
Token to
Token ID284
Feature activation-1.68e-03

Pos logit contributions
c+0.031
 spe+0.031
 someday+0.030
 swe+0.030
 shin+0.029
Neg logit contributions
leaf-0.039
ibrarian-0.037
 character-0.034
taker-0.033
 ear-0.033
Token make
Token ID787
Feature activation-1.51e-02

Pos logit contributions
 swe+0.002
 someday+0.002
 cheaper+0.002
 spe+0.002
c+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.003
he-0.003
Token it
Token ID340
Feature activation-8.55e-02

Pos logit contributions
 someday+0.023
 swe+0.022
 spe+0.022
 cheaper+0.021
c+0.021
Neg logit contributions
leaf-0.024
ibrarian-0.023
taker-0.022
vell-0.022
 character-0.021
Token soar
Token ID48701
Feature activation-4.51e-02

Pos logit contributions
 spe+0.135
�+0.129
c+0.121
oral+0.121
use+0.120
Neg logit contributions
 around-0.115
 outside-0.111
taker-0.106
ong-0.105
vell-0.103
Token higher
Token ID2440
Feature activation-3.91e-02

Pos logit contributions
 swe+0.078
 spe+0.077
 cheaper+0.074
 One+0.072
oral+0.071
Neg logit contributions
he-0.054
 around-0.054
 outside-0.050
vell-0.050
 behind-0.048
Token than
Token ID621
Feature activation+1.50e-03

Pos logit contributions
 swe+0.070
 cheaper+0.066
 spe+0.064
c+0.064
 shin+0.063
Neg logit contributions
he-0.045
 around-0.042
 behind-0.041
 outside-0.041
ibrarian-0.040
Token the
Token ID262
Feature activation-1.69e-02

Pos logit contributions
leaf+0.003
ibrarian+0.003
vell+0.003
taker+0.002
asc+0.002
Neg logit contributions
 swe-0.002
 someday-0.002
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token boy
Token ID2933
Feature activation+7.97e-03

Pos logit contributions
 swe+0.023
 someday+0.023
c+0.021
 spe+0.021
 cheaper+0.021
Neg logit contributions
leaf-0.028
ibrarian-0.026
vell-0.026
he-0.026
taker-0.025
Token.
Token ID13
Feature activation-1.47e-02

Pos logit contributions
leaf+0.005
ibrarian+0.005
vell+0.005
taker+0.005
asc+0.004
Neg logit contributions
 swe-0.004
 someday-0.004
 cheaper-0.003
 spe-0.003
 smaller-0.003
Token Every
Token ID3887
Feature activation-2.07e-02

Pos logit contributions
 swe+0.015
 spe+0.014
 cheaper+0.014
 someday+0.014
 shin+0.013
Neg logit contributions
leaf-0.030
ibrarian-0.030
he-0.028
taker-0.028
 character-0.027
Token morning
Token ID3329
Feature activation-3.96e-04

Pos logit contributions
 swe+0.041
 cheaper+0.040
 shin+0.039
 spe+0.038
 someday+0.038
Neg logit contributions
leaf-0.022
ibrarian-0.022
 character-0.020
he-0.020
taker-0.019
Token,
Token ID11
Feature activation-1.40e-02

Pos logit contributions
 swe+0.000
 someday+0.000
 cheaper+0.000
 spe+0.000
 smaller+0.000
Neg logit contributions
leaf-0.001
ibrarian-0.001
vell-0.001
taker-0.001
he-0.001
Token the
Token ID262
Feature activation-3.12e-02

Pos logit contributions
 swe+0.020
 someday+0.020
 cheaper+0.020
 spe+0.019
c+0.019
Neg logit contributions
leaf-0.022
ibrarian-0.021
taker-0.020
vell-0.020
 character-0.020
Token little
Token ID1310
Feature activation-8.52e-02

Pos logit contributions
 someday+0.041
c+0.039
 swe+0.036
 cheaper+0.036
 One+0.035
Neg logit contributions
vell-0.050
ibrarian-0.050
leaf-0.049
taker-0.048
 goodness-0.047
Token pig
Token ID12967
Feature activation-3.59e-02

Pos logit contributions
onna+0.150
 someday+0.142
 sometime+0.141
 fade+0.133
 simpler+0.132
Neg logit contributions
ting-0.093
 goodness-0.092
 children-0.091
 ear-0.086
 outside-0.085
Token would
Token ID561
Feature activation+4.69e-03

Pos logit contributions
 cheaper+0.045
 swe+0.044
 spe+0.042
 shin+0.041
 someday+0.040
Neg logit contributions
ibrarian-0.065
leaf-0.060
vell-0.060
taker-0.059
he-0.059
Token run
Token ID1057
Feature activation-1.70e-03

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.008
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.006
 someday-0.006
 spe-0.006
 cheaper-0.006
 smaller-0.006
Token and
Token ID290
Feature activation+1.82e-02

Pos logit contributions
 swe+0.003
 someday+0.003
 spe+0.003
 cheaper+0.003
 smaller+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.002
vell-0.002
taker-0.002
he-0.002
Token jump
Token ID4391
Feature activation+6.17e-03

Pos logit contributions
leaf+0.033
ibrarian+0.033
vell+0.032
taker+0.030
ibr+0.030
Neg logit contributions
 swe-0.025
 someday-0.023
 spe-0.022
 smaller-0.022
c-0.021
Token me
Token ID502
Feature activation+2.88e-02

Pos logit contributions
leaf+0.021
ibrarian+0.020
vell+0.020
taker+0.018
ibr+0.018
Neg logit contributions
 swe-0.021
 someday-0.020
c-0.020
 cheaper-0.019
 enough-0.019
Token!"
Token ID2474
Feature activation-6.49e-03

Pos logit contributions
leaf+0.062
ibrarian+0.056
vell+0.056
asc+0.054
ibr+0.053
Neg logit contributions
 someday-0.032
 enough-0.028
 swe-0.028
c-0.027
 smaller-0.027
Token
Token ID198
Feature activation+3.39e-03

Pos logit contributions
 swe+0.008
 someday+0.008
 cheaper+0.007
 spe+0.007
 smaller+0.007
Neg logit contributions
leaf-0.013
ibrarian-0.012
vell-0.011
taker-0.011
he-0.011
Token
Token ID198
Feature activation-1.04e-02

Pos logit contributions
leaf+0.007
ibrarian+0.007
vell+0.007
taker+0.006
asc+0.006
Neg logit contributions
 swe-0.004
 someday-0.003
 spe-0.003
 cheaper-0.003
 smaller-0.003
TokenThe
Token ID464
Feature activation-5.80e-02

Pos logit contributions
 swe+0.013
 someday+0.013
 cheaper+0.012
 spe+0.012
 enough+0.012
Neg logit contributions
leaf-0.020
ibrarian-0.019
vell-0.018
taker-0.018
he-0.017
Token little
Token ID1310
Feature activation-8.52e-02

Pos logit contributions
 someday+0.096
onna+0.094
 sometime+0.091
c+0.090
eter+0.084
Neg logit contributions
he-0.066
ibrarian-0.065
 maker-0.064
 children-0.064
coon-0.063
Token boy
Token ID2933
Feature activation-7.63e-03

Pos logit contributions
onna+0.160
 former+0.154
 sometime+0.153
 someday+0.148
 reality+0.146
Neg logit contributions
 goodness-0.085
ting-0.084
he-0.078
 children-0.074
 ear-0.073
Token said
Token ID531
Feature activation+2.46e-03

Pos logit contributions
 swe+0.010
 someday+0.010
 cheaper+0.010
 spe+0.010
 smaller+0.009
Neg logit contributions
leaf-0.013
ibrarian-0.013
vell-0.012
taker-0.012
 character-0.012
Token,
Token ID11
Feature activation+3.51e-03

Pos logit contributions
leaf+0.005
ibrarian+0.005
vell+0.005
taker+0.005
asc+0.004
Neg logit contributions
 swe-0.003
 someday-0.003
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token "
Token ID366
Feature activation+1.25e-02

Pos logit contributions
leaf+0.006
ibrarian+0.006
vell+0.006
taker+0.005
asc+0.005
Neg logit contributions
 swe-0.005
 someday-0.005
 spe-0.005
 cheaper-0.005
 smaller-0.005
TokenWhat
Token ID2061
Feature activation+1.61e-02

Pos logit contributions
ibrarian+0.022
leaf+0.022
vell+0.021
taker+0.020
opher+0.020
Neg logit contributions
 swe-0.017
 someday-0.017
 spe-0.016
 cheaper-0.015
 shin-0.015
Token."
Token ID526
Feature activation-1.11e-02

Pos logit contributions
 swe+0.001
 someday+0.001
 cheaper+0.001
 spe+0.001
 smaller+0.001
Neg logit contributions
leaf-0.002
ibrarian-0.002
vell-0.002
taker-0.002
asc-0.001
Token
Token ID198
Feature activation-4.81e-03

Pos logit contributions
 swe+0.015
 someday+0.014
 cheaper+0.014
 spe+0.014
 smaller+0.013
Neg logit contributions
leaf-0.020
ibrarian-0.019
taker-0.019
he-0.018
vell-0.018
Token
Token ID198
Feature activation-1.97e-02

Pos logit contributions
 swe+0.006
 someday+0.006
 spe+0.005
 cheaper+0.005
 smaller+0.005
Neg logit contributions
leaf-0.009
ibrarian-0.009
vell-0.009
taker-0.009
he-0.008
TokenSo
Token ID2396
Feature activation-2.60e-02

Pos logit contributions
 swe+0.029
 spe+0.028
 someday+0.027
 enough+0.027
 cheaper+0.026
Neg logit contributions
leaf-0.034
ibrarian-0.032
he-0.030
vell-0.030
taker-0.029
Token the
Token ID262
Feature activation-3.65e-02

Pos logit contributions
 swe+0.038
 cheaper+0.036
c+0.036
 someday+0.035
 spe+0.034
Neg logit contributions
leaf-0.038
ibrarian-0.038
taker-0.038
 character-0.036
he-0.036
Token little
Token ID1310
Feature activation-8.49e-02

Pos logit contributions
 someday+0.071
 sometime+0.064
c+0.063
 swe+0.063
 spe+0.063
Neg logit contributions
ibrarian-0.038
he-0.037
taker-0.035
 character-0.033
vell-0.033
Token boy
Token ID2933
Feature activation-1.89e-02

Pos logit contributions
onna+0.204
 former+0.193
 sometime+0.189
 reality+0.185
 fade+0.183
Neg logit contributions
 children-0.054
 goodness-0.049
ting-0.045
 kids-0.043
he-0.040
Token started
Token ID2067
Feature activation-2.53e-02

Pos logit contributions
 swe+0.027
 someday+0.027
 cheaper+0.026
 spe+0.025
 smaller+0.024
Neg logit contributions
leaf-0.031
ibrarian-0.030
vell-0.028
taker-0.028
 character-0.027
Token sharp
Token ID7786
Feature activation-2.02e-02

Pos logit contributions
 someday+0.044
 swe+0.042
 cheaper+0.041
 spe+0.039
 shin+0.039
Neg logit contributions
leaf-0.034
ibrarian-0.030
vell-0.029
 outside-0.029
taker-0.029
Tokenening
Token ID3101
Feature activation-1.82e-03

Pos logit contributions
 someday+0.031
 swe+0.031
 cheaper+0.029
 spe+0.029
c+0.028
Neg logit contributions
leaf-0.030
ibrarian-0.027
taker-0.027
vell-0.026
ting-0.026
Token the
Token ID262
Feature activation+1.57e-02

Pos logit contributions
 swe+0.003
 someday+0.003
 cheaper+0.003
 spe+0.003
c+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.003
he-0.003
Token,
Token ID11
Feature activation-2.54e-02

Pos logit contributions
 swe+0.035
c+0.035
 cheaper+0.033
 someday+0.033
 spe+0.033
Neg logit contributions
ibrarian-0.043
taker-0.042
leaf-0.042
vell-0.042
he-0.040
Token Grace
Token ID16156
Feature activation-1.26e-02

Pos logit contributions
 swe+0.036
 cheaper+0.034
 someday+0.034
c+0.034
 spe+0.033
Neg logit contributions
leaf-0.040
taker-0.039
vell-0.038
ibrarian-0.038
he-0.037
Token's
Token ID338
Feature activation+1.39e-02

Pos logit contributions
 swe+0.017
 someday+0.016
 cheaper+0.016
 spe+0.015
 smaller+0.015
Neg logit contributions
leaf-0.022
ibrarian-0.022
vell-0.021
taker-0.020
he-0.020
Token attic
Token ID46113
Feature activation-4.43e-02

Pos logit contributions
leaf+0.024
ibrarian+0.022
vell+0.022
asc+0.021
taker+0.021
Neg logit contributions
 swe-0.021
 someday-0.020
 cheaper-0.020
 spe-0.019
 smaller-0.019
Token was
Token ID373
Feature activation-2.90e-02

Pos logit contributions
 swe+0.062
 someday+0.060
 spe+0.058
c+0.056
 cheaper+0.056
Neg logit contributions
ibrarian-0.071
vell-0.069
taker-0.067
leaf-0.065
he-0.065
Token becoming
Token ID5033
Feature activation-8.48e-02

Pos logit contributions
 swe+0.045
 someday+0.044
 spe+0.043
c+0.042
 coll+0.041
Neg logit contributions
ibrarian-0.040
vell-0.040
leaf-0.039
taker-0.039
he-0.037
Token more
Token ID517
Feature activation-7.42e-03

Pos logit contributions
 bidder+0.116
 coll+0.115
c+0.113
 spe+0.112
 One+0.110
Neg logit contributions
vell-0.118
 outside-0.114
taker-0.113
 around-0.110
 everywhere-0.105
Token and
Token ID290
Feature activation-4.10e-02

Pos logit contributions
 swe+0.011
 someday+0.011
 spe+0.011
c+0.011
 cheaper+0.011
Neg logit contributions
leaf-0.011
ibrarian-0.011
vell-0.011
taker-0.010
he-0.010
Token more
Token ID517
Feature activation-1.61e-03

Pos logit contributions
 spe+0.042
 One+0.042
c+0.041
 someday+0.040
 swe+0.039
Neg logit contributions
taker-0.072
leaf-0.071
vell-0.071
ibrarian-0.070
 character-0.068
Token full
Token ID1336
Feature activation+1.62e-02

Pos logit contributions
 swe+0.002
 someday+0.002
 spe+0.002
 cheaper+0.002
c+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.002
he-0.002
Token with
Token ID351
Feature activation+2.25e-03

Pos logit contributions
leaf+0.026
ibrarian+0.025
vell+0.024
asc+0.023
taker+0.023
Neg logit contributions
 swe-0.025
 someday-0.025
 cheaper-0.023
 spe-0.023
 enough-0.023
Token that
Token ID326
Feature activation-3.38e-03

Pos logit contributions
 swe+0.001
 someday+0.001
 cheaper+0.001
 spe+0.001
 smaller+0.001
Neg logit contributions
leaf-0.002
ibrarian-0.002
vell-0.002
taker-0.002
asc-0.002
Token it
Token ID340
Feature activation-4.54e-04

Pos logit contributions
 swe+0.004
 someday+0.004
 cheaper+0.004
 spe+0.004
 smaller+0.004
Neg logit contributions
leaf-0.006
ibrarian-0.006
vell-0.006
taker-0.006
 character-0.005
Token could
Token ID714
Feature activation-1.83e-02

Pos logit contributions
 swe+0.001
 someday+0.001
 cheaper+0.001
 spe+0.001
 smaller+0.001
Neg logit contributions
leaf-0.001
ibrarian-0.001
vell-0.001
taker-0.001
asc-0.001
Token make
Token ID787
Feature activation-1.80e-02

Pos logit contributions
 swe+0.026
 someday+0.024
 cheaper+0.024
c+0.023
 enough+0.023
Neg logit contributions
leaf-0.031
ibrarian-0.030
vell-0.029
taker-0.028
he-0.028
Token the
Token ID262
Feature activation-7.20e-03

Pos logit contributions
 someday+0.025
 swe+0.024
 cheaper+0.024
 spe+0.023
 One+0.022
Neg logit contributions
leaf-0.029
ibrarian-0.029
vell-0.028
taker-0.027
 character-0.027
Token garden
Token ID11376
Feature activation-8.40e-02

Pos logit contributions
 swe+0.010
 someday+0.010
 cheaper+0.009
 spe+0.009
c+0.009
Neg logit contributions
leaf-0.012
ibrarian-0.012
vell-0.011
taker-0.011
 character-0.011
Token even
Token ID772
Feature activation-4.40e-02

Pos logit contributions
 someday+0.127
 One+0.121
One+0.121
 bidder+0.119
 spe+0.118
Neg logit contributions
vell-0.110
taker-0.110
 around-0.104
 outside-0.104
ong-0.102
Token better
Token ID1365
Feature activation-1.39e-03

Pos logit contributions
 One+0.038
 someday+0.038
 spe+0.035
c+0.034
One+0.033
Neg logit contributions
leaf-0.088
taker-0.084
vell-0.082
 around-0.081
 outside-0.080
Token.
Token ID13
Feature activation+2.62e-03

Pos logit contributions
 swe+0.002
 someday+0.002
 cheaper+0.002
 spe+0.002
 smaller+0.002
Neg logit contributions
leaf-0.002
ibrarian-0.002
vell-0.002
taker-0.002
asc-0.002
Token
Token ID198
Feature activation+1.62e-02

Pos logit contributions
leaf+0.005
ibrarian+0.005
vell+0.005
taker+0.005
asc+0.004
Neg logit contributions
 swe-0.003
 someday-0.003
 cheaper-0.003
 spe-0.003
c-0.003
Token
Token ID198
Feature activation+5.60e-03

Pos logit contributions
leaf+0.036
vell+0.036
ibrarian+0.035
taker+0.034
asc+0.034
Neg logit contributions
 someday-0.014
 swe-0.013
 One-0.012
c-0.012
 cheaper-0.012
TokenSoon
Token ID28093
Feature activation-4.02e-02

Pos logit contributions
 swe+0.026
 spe+0.024
 cheaper+0.024
 someday+0.024
 enough+0.023
Neg logit contributions
leaf-0.030
ibrarian-0.029
vell-0.027
he-0.027
taker-0.027
Token,
Token ID11
Feature activation-1.87e-03

Pos logit contributions
 swe+0.060
c+0.060
 cheaper+0.058
 shin+0.056
 coll+0.054
Neg logit contributions
 character-0.056
 goodness-0.056
ibrarian-0.055
leaf-0.054
ting-0.053
Token the
Token ID262
Feature activation+7.60e-03

Pos logit contributions
 swe+0.002
 someday+0.002
 cheaper+0.002
 spe+0.002
c+0.002
Neg logit contributions
leaf-0.004
ibrarian-0.003
vell-0.003
taker-0.003
he-0.003
Token night
Token ID1755
Feature activation-1.15e-02

Pos logit contributions
leaf+0.014
ibrarian+0.013
vell+0.013
taker+0.012
asc+0.012
Neg logit contributions
 swe-0.010
 someday-0.010
 spe-0.010
 cheaper-0.009
 smaller-0.009
Token sky
Token ID6766
Feature activation-4.99e-02

Pos logit contributions
 swe+0.016
 someday+0.015
 cheaper+0.015
 spe+0.014
 smaller+0.014
Neg logit contributions
leaf-0.019
ibrarian-0.019
vell-0.018
taker-0.018
he-0.017
Token grew
Token ID6348
Feature activation-8.38e-02

Pos logit contributions
 swe+0.074
 cheaper+0.073
 someday+0.068
 coll+0.066
 shin+0.065
Neg logit contributions
he-0.072
 around-0.068
leaf-0.068
ong-0.067
taker-0.067
Token very
Token ID845
Feature activation-2.80e-02

Pos logit contributions
 bidder+0.132
c+0.122
 coll+0.121
 god+0.118
 someday+0.117
Neg logit contributions
 outside-0.122
ong-0.110
 around-0.109
vell-0.109
 everywhere-0.105
Token dark
Token ID3223
Feature activation-2.12e-02

Pos logit contributions
c+0.042
 someday+0.041
 swe+0.040
 coll+0.039
 spe+0.038
Neg logit contributions
leaf-0.039
vell-0.039
taker-0.038
he-0.037
ibrarian-0.035
Token.
Token ID13
Feature activation+6.55e-04

Pos logit contributions
 swe+0.032
 cheaper+0.031
 someday+0.030
c+0.029
 spe+0.029
Neg logit contributions
leaf-0.032
vell-0.030
ibrarian-0.030
he-0.030
taker-0.030
Token The
Token ID383
Feature activation+1.85e-02

Pos logit contributions
leaf+0.001
ibrarian+0.001
vell+0.001
taker+0.001
asc+0.001
Neg logit contributions
 swe-0.001
 someday-0.001
 cheaper-0.001
 spe-0.001
 smaller-0.001
Token little
Token ID1310
Feature activation-2.43e-02

Pos logit contributions
leaf+0.034
ibrarian+0.033
vell+0.031
asc+0.031
iders+0.030
Neg logit contributions
 swe-0.025
 someday-0.024
 spe-0.024
 smaller-0.023
 cheaper-0.023
Token,
Token ID11
Feature activation+7.35e-03

Pos logit contributions
 swe+0.005
 someday+0.005
 spe+0.005
 cheaper+0.005
c+0.005
Neg logit contributions
leaf-0.009
ibrarian-0.009
vell-0.008
taker-0.008
ibr-0.008
Token "
Token ID366
Feature activation+2.09e-04

Pos logit contributions
leaf+0.013
ibrarian+0.012
vell+0.012
taker+0.011
asc+0.011
Neg logit contributions
 swe-0.010
 someday-0.010
 spe-0.009
 cheaper-0.009
 smaller-0.009
TokenThat
Token ID2504
Feature activation-1.55e-03

Pos logit contributions
leaf+0.000
ibrarian+0.000
vell+0.000
taker+0.000
asc+0.000
Neg logit contributions
 swe-0.000
 someday-0.000
 cheaper-0.000
 spe-0.000
 smaller-0.000
Token's
Token ID338
Feature activation-4.83e-03

Pos logit contributions
 swe+0.002
 someday+0.002
 spe+0.002
 cheaper+0.002
 smaller+0.001
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.003
asc-0.003
Token what
Token ID644
Feature activation+1.64e-02

Pos logit contributions
 swe+0.007
 someday+0.006
 spe+0.006
 cheaper+0.006
c+0.006
Neg logit contributions
leaf-0.008
ibrarian-0.008
vell-0.008
taker-0.007
asc-0.007
Token friends
Token ID2460
Feature activation+6.81e-02

Pos logit contributions
leaf+0.027
ibrarian+0.026
vell+0.025
taker+0.024
he+0.024
Neg logit contributions
 swe-0.024
 someday-0.024
 cheaper-0.023
 smaller-0.022
 spe-0.022
Token are
Token ID389
Feature activation+2.09e-02

Pos logit contributions
ibrarian+0.137
opher+0.133
iders+0.131
leaf+0.131
taker+0.130
Neg logit contributions
 someday-0.075
 cheaper-0.062
 enough-0.061
 swe-0.060
 better-0.060
Token for
Token ID329
Feature activation+3.98e-02

Pos logit contributions
leaf+0.035
ibrarian+0.035
vell+0.033
he+0.033
iders+0.032
Neg logit contributions
 swe-0.030
 someday-0.030
 cheaper-0.029
 enough-0.028
 smaller-0.026
Token!"
Token ID2474
Feature activation+2.57e-02

Pos logit contributions
leaf+0.077
ibrarian+0.073
vell+0.073
taker+0.071
asc+0.067
Neg logit contributions
 someday-0.054
 swe-0.049
 enough-0.047
 cheaper-0.046
 One-0.045
Token<|endoftext|>
Token ID50256
Feature activation-2.72e-03

Pos logit contributions
leaf+0.060
vell+0.059
ibrarian+0.058
asc+0.056
iders+0.055
Neg logit contributions
 someday-0.021
 One-0.019
 swe-0.019
c-0.016
 again-0.015
TokenOnce
Token ID7454
Feature activation-7.37e-03

Pos logit contributions
 swe+0.003
 someday+0.003
 cheaper+0.003
 spe+0.003
 smaller+0.003
Neg logit contributions
leaf-0.005
ibrarian-0.005
vell-0.005
taker-0.005
he-0.005
Token she
Token ID673
Feature activation+2.98e-02

Pos logit contributions
leaf+0.016
ibrarian+0.016
vell+0.015
taker+0.015
asc+0.015
Neg logit contributions
 someday-0.009
 swe-0.009
 cheaper-0.008
 spe-0.008
 smaller-0.008
Token hurried
Token ID31662
Feature activation+3.38e-02

Pos logit contributions
leaf+0.057
ibrarian+0.054
he+0.053
vell+0.053
taker+0.052
Neg logit contributions
 swe-0.037
 someday-0.034
 spe-0.033
 smaller-0.032
 cheaper-0.032
Tokenly
Token ID306
Feature activation+2.83e-02

Pos logit contributions
leaf+0.064
ibrarian+0.064
vell+0.064
affles+0.063
iders+0.061
Neg logit contributions
 someday-0.041
 enough-0.040
 again-0.039
 swe-0.038
 One-0.037
Token ran
Token ID4966
Feature activation+2.91e-02

Pos logit contributions
leaf+0.057
vell+0.056
ibrarian+0.056
taker+0.054
iders+0.054
Neg logit contributions
 enough-0.030
 swe-0.029
 someday-0.029
 smaller-0.027
c-0.027
Token over
Token ID625
Feature activation+1.62e-03

Pos logit contributions
ibrarian+0.054
leaf+0.054
vell+0.052
ibr+0.051
affles+0.051
Neg logit contributions
 someday-0.038
 enough-0.036
 swe-0.035
 One-0.034
 longer-0.034
Token and
Token ID290
Feature activation+4.74e-02

Pos logit contributions
leaf+0.003
ibrarian+0.003
vell+0.003
taker+0.003
asc+0.003
Neg logit contributions
 swe-0.002
 someday-0.002
 cheaper-0.002
 spe-0.002
 smaller-0.001
Token met
Token ID1138
Feature activation+4.25e-02

Pos logit contributions
ibrarian+0.099
affles+0.097
ates+0.097
opher+0.096
iders+0.095
Neg logit contributions
 swe-0.050
 smaller-0.046
 someday-0.045
 again-0.044
c-0.044
Token the
Token ID262
Feature activation+1.21e-02

Pos logit contributions
leaf+0.096
iders+0.093
ower+0.093
vell+0.093
affles+0.091
Neg logit contributions
 someday-0.041
 swe-0.039
 enough-0.038
 One-0.037
 smaller-0.037
Token little
Token ID1310
Feature activation-2.60e-02

Pos logit contributions
leaf+0.024
ibrarian+0.023
vell+0.022
taker+0.022
asc+0.021
Neg logit contributions
 swe-0.015
 someday-0.014
 smaller-0.014
 cheaper-0.013
 One-0.013
Token rabbit
Token ID22746
Feature activation-1.23e-02

Pos logit contributions
 cheaper+0.042
 someday+0.041
 swe+0.041
 spe+0.040
c+0.038
Neg logit contributions
leaf-0.033
ibrarian-0.031
ting-0.031
 ogre-0.031
 goodness-0.031
Token.
Token ID13
Feature activation-1.19e-03

Pos logit contributions
 swe+0.017
 someday+0.017
 spe+0.016
 cheaper+0.016
 smaller+0.015
Neg logit contributions
leaf-0.020
ibrarian-0.020
vell-0.019
taker-0.018
 character-0.018
Token He
Token ID679
Feature activation-1.06e-02

Pos logit contributions
leaf+0.041
vell+0.041
ibrarian+0.039
asc+0.037
iders+0.037
Neg logit contributions
 swe-0.023
 One-0.023
 someday-0.022
 cheaper-0.022
 smaller-0.020
Token was
Token ID373
Feature activation+1.09e-02

Pos logit contributions
 swe+0.016
 someday+0.016
 spe+0.015
 cheaper+0.015
c+0.014
Neg logit contributions
leaf-0.017
ibrarian-0.016
vell-0.015
taker-0.015
 character-0.014
Token carrying
Token ID6872
Feature activation+2.71e-02

Pos logit contributions
leaf+0.018
ibrarian+0.017
vell+0.017
iders+0.016
taker+0.016
Neg logit contributions
 swe-0.016
 someday-0.015
 cheaper-0.015
 smaller-0.015
 spe-0.015
Token a
Token ID257
Feature activation+1.33e-02

Pos logit contributions
leaf+0.058
ibrarian+0.057
vell+0.056
asc+0.055
iders+0.054
Neg logit contributions
 swe-0.029
 smaller-0.026
 cheaper-0.026
 enough-0.026
 someday-0.025
Token bou
Token ID35833
Feature activation+3.04e-02

Pos logit contributions
leaf+0.024
ibrarian+0.024
vell+0.023
taker+0.022
asc+0.022
Neg logit contributions
 swe-0.018
 cheaper-0.017
 someday-0.016
 spe-0.016
 smaller-0.016
Tokenquet
Token ID21108
Feature activation+5.32e-02

Pos logit contributions
leaf+0.045
ibrarian+0.044
taker+0.042
asc+0.042
opher+0.041
Neg logit contributions
 swe-0.048
 someday-0.046
c-0.044
 spe-0.044
 cheaper-0.044
Token of
Token ID286
Feature activation+3.12e-02

Pos logit contributions
leaf+0.102
affles+0.098
asc+0.097
ibrarian+0.095
vell+0.094
Neg logit contributions
 someday-0.069
 enough-0.068
 cheaper-0.062
 again-0.061
 spe-0.061
Token wild
Token ID4295
Feature activation+1.89e-02

Pos logit contributions
ibrarian+0.061
vell+0.059
iders+0.059
leaf+0.057
taker+0.057
Neg logit contributions
 swe-0.042
 cheaper-0.036
 spe-0.036
 someday-0.035
 smaller-0.034
Tokenfl
Token ID2704
Feature activation+2.26e-02

Pos logit contributions
ibrarian+0.032
leaf+0.031
vell+0.031
taker+0.030
he+0.029
Neg logit contributions
 swe-0.028
 someday-0.027
 spe-0.026
c-0.025
 cheaper-0.025
Tokenowers
Token ID3618
Feature activation+2.65e-02

Pos logit contributions
leaf+0.029
ibrarian+0.029
taker+0.027
vell+0.026
 character+0.024
Neg logit contributions
 swe-0.041
c-0.040
 someday-0.039
 spe-0.039
 enough-0.037
Token he
Token ID339
Feature activation+1.87e-02

Pos logit contributions
leaf+0.051
ibrarian+0.051
vell+0.048
asc+0.047
taker+0.047
Neg logit contributions
 someday-0.032
 swe-0.031
 enough-0.030
 cheaper-0.029
 spe-0.029
Token She
Token ID1375
Feature activation+8.79e-03

Pos logit contributions
leaf+0.072
vell+0.072
ibrarian+0.069
coon+0.065
taker+0.065
Neg logit contributions
 One-0.033
 swe-0.030
 someday-0.028
 smaller-0.025
 Sometimes-0.025
Token ran
Token ID4966
Feature activation+2.10e-02

Pos logit contributions
leaf+0.016
ibrarian+0.016
vell+0.016
taker+0.015
asc+0.015
Neg logit contributions
 swe-0.011
 someday-0.011
 spe-0.010
 cheaper-0.010
 smaller-0.010
Token to
Token ID284
Feature activation+2.91e-02

Pos logit contributions
vell+0.049
leaf+0.048
ibrarian+0.048
iders+0.047
ibr+0.046
Neg logit contributions
 swe-0.016
 someday-0.016
 enough-0.016
 cheaper-0.016
 smaller-0.014
Token the
Token ID262
Feature activation+3.95e-03

Pos logit contributions
vell+0.075
ibrarian+0.071
iders+0.071
taker+0.071
ower+0.070
Neg logit contributions
 swe-0.020
 One-0.019
 smaller-0.019
 someday-0.018
 cheaper-0.016
Token kitchen
Token ID9592
Feature activation+2.01e-02

Pos logit contributions
leaf+0.007
ibrarian+0.007
vell+0.007
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.005
 someday-0.005
 cheaper-0.005
 spe-0.005
 smaller-0.004
Token and
Token ID290
Feature activation+5.09e-02

Pos logit contributions
leaf+0.045
vell+0.042
ibrarian+0.042
taker+0.041
asc+0.040
Neg logit contributions
 swe-0.020
 someday-0.019
 cheaper-0.018
 enough-0.018
 smaller-0.017
Token opened
Token ID4721
Feature activation-1.43e-02

Pos logit contributions
agging+0.122
vell+0.120
opher+0.120
iders+0.116
ickets+0.115
Neg logit contributions
 smaller-0.038
 swe-0.036
 One-0.033
 cheaper-0.033
 spe-0.033
Token the
Token ID262
Feature activation-5.26e-03

Pos logit contributions
 someday+0.019
 swe+0.019
 spe+0.018
 cheaper+0.018
c+0.017
Neg logit contributions
leaf-0.024
ibrarian-0.023
vell-0.022
he-0.022
taker-0.022
Token box
Token ID3091
Feature activation+5.03e-03

Pos logit contributions
 someday+0.009
 swe+0.009
 spe+0.009
 cheaper+0.008
c+0.008
Neg logit contributions
leaf-0.007
ibrarian-0.007
vell-0.007
taker-0.007
he-0.006
Token.
Token ID13
Feature activation+4.42e-02

Pos logit contributions
leaf+0.010
ibrarian+0.009
vell+0.009
taker+0.009
asc+0.008
Neg logit contributions
 swe-0.006
 someday-0.006
 cheaper-0.006
 spe-0.006
 smaller-0.006
Token Inside
Token ID14384
Feature activation-2.27e-02

Pos logit contributions
vell+0.095
leaf+0.092
ibrarian+0.089
coon+0.088
taker+0.085
Neg logit contributions
 One-0.056
 swe-0.043
 Sometimes-0.042
 someday-0.041
 smaller-0.041
TokenCan
Token ID6090
Feature activation-1.87e-02

Pos logit contributions
 swe+0.005
 someday+0.005
 spe+0.005
 cheaper+0.005
 smaller+0.004
Neg logit contributions
leaf-0.006
ibrarian-0.006
vell-0.006
taker-0.006
he-0.006
Token I
Token ID314
Feature activation-7.96e-03

Pos logit contributions
 swe+0.035
 spe+0.034
 cheaper+0.033
c+0.033
 someday+0.033
Neg logit contributions
ibrarian-0.022
leaf-0.021
vell-0.019
taker-0.019
he-0.018
Token go
Token ID467
Feature activation-2.71e-02

Pos logit contributions
 swe+0.011
 someday+0.011
 spe+0.010
 cheaper+0.010
 shin+0.010
Neg logit contributions
leaf-0.014
ibrarian-0.013
vell-0.013
he-0.012
taker-0.012
Token outside
Token ID2354
Feature activation-2.98e-02

Pos logit contributions
 swe+0.047
 spe+0.046
 shin+0.044
 cheaper+0.043
 someday+0.042
Neg logit contributions
leaf-0.037
 outside-0.033
ibrarian-0.033
he-0.032
vell-0.031
Token and
Token ID290
Feature activation+2.21e-02

Pos logit contributions
 swe+0.045
 spe+0.042
 cheaper+0.041
 shin+0.041
 someday+0.039
Neg logit contributions
leaf-0.046
he-0.045
ibrarian-0.044
vell-0.042
taker-0.041
Token show
Token ID905
Feature activation+4.56e-02

Pos logit contributions
ibrarian+0.043
leaf+0.043
vell+0.041
ibr+0.039
taker+0.039
Neg logit contributions
 someday-0.027
 swe-0.027
 smaller-0.025
 spe-0.024
 cheaper-0.024
Token the
Token ID262
Feature activation+7.58e-03

Pos logit contributions
vell+0.089
iders+0.088
ower+0.086
leaf+0.086
asc+0.085
Neg logit contributions
 swe-0.058
 someday-0.054
 enough-0.053
 smaller-0.049
 cheaper-0.049
Token wallet
Token ID13008
Feature activation+8.18e-03

Pos logit contributions
leaf+0.013
ibrarian+0.013
vell+0.012
taker+0.012
iders+0.012
Neg logit contributions
 swe-0.011
 someday-0.010
 cheaper-0.010
 spe-0.010
 smaller-0.010
Token to
Token ID284
Feature activation+3.37e-02

Pos logit contributions
leaf+0.015
ibrarian+0.014
vell+0.014
taker+0.013
ibr+0.013
Neg logit contributions
 swe-0.011
 someday-0.011
 cheaper-0.010
 smaller-0.010
 spe-0.010
Token my
Token ID616
Feature activation+2.18e-02

Pos logit contributions
leaf+0.064
vell+0.062
ibrarian+0.060
taker+0.059
asc+0.058
Neg logit contributions
 someday-0.045
 swe-0.045
 smaller-0.041
 cheaper-0.040
 One-0.039
Token friends
Token ID2460
Feature activation+1.57e-02

Pos logit contributions
leaf+0.036
ibrarian+0.034
vell+0.033
taker+0.032
iders+0.032
Neg logit contributions
 swe-0.033
 someday-0.032
 cheaper-0.031
 smaller-0.030
 spe-0.030
Token her
Token ID607
Feature activation+5.16e-03

Pos logit contributions
 swe+0.024
 someday+0.023
 cheaper+0.023
 spe+0.022
 shin+0.022
Neg logit contributions
ibrarian-0.021
leaf-0.021
he-0.020
taker-0.019
vell-0.019
Token hand
Token ID1021
Feature activation+2.23e-03

Pos logit contributions
leaf+0.008
ibrarian+0.007
vell+0.007
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.009
 someday-0.008
 spe-0.008
 cheaper-0.008
c-0.008
Token,
Token ID11
Feature activation-5.83e-03

Pos logit contributions
leaf+0.005
ibrarian+0.005
vell+0.004
taker+0.004
asc+0.004
Neg logit contributions
 swe-0.002
 someday-0.002
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token she
Token ID673
Feature activation+2.90e-03

Pos logit contributions
 swe+0.008
 someday+0.008
 cheaper+0.007
 spe+0.007
c+0.007
Neg logit contributions
leaf-0.010
ibrarian-0.010
vell-0.010
taker-0.009
he-0.009
Token slowly
Token ID6364
Feature activation-2.65e-02

Pos logit contributions
leaf+0.005
ibrarian+0.005
vell+0.005
taker+0.005
asc+0.005
Neg logit contributions
 swe-0.004
 someday-0.004
 spe-0.004
 cheaper-0.004
 smaller-0.003
Token made
Token ID925
Feature activation+1.31e-02

Pos logit contributions
 swe+0.041
 someday+0.040
 cheaper+0.039
 spe+0.038
 One+0.036
Neg logit contributions
leaf-0.040
ibrarian-0.039
vell-0.036
he-0.035
taker-0.035
Token her
Token ID607
Feature activation+9.20e-04

Pos logit contributions
leaf+0.024
ibrarian+0.023
vell+0.023
asc+0.021
iders+0.021
Neg logit contributions
 swe-0.017
 someday-0.016
 smaller-0.016
 cheaper-0.016
 enough-0.015
Token way
Token ID835
Feature activation+1.60e-02

Pos logit contributions
leaf+0.002
ibrarian+0.001
vell+0.001
taker+0.001
asc+0.001
Neg logit contributions
 swe-0.001
 someday-0.001
 cheaper-0.001
 spe-0.001
 smaller-0.001
Token up
Token ID510
Feature activation-4.12e-02

Pos logit contributions
leaf+0.029
ibrarian+0.029
vell+0.027
ibr+0.027
 character+0.027
Neg logit contributions
 swe-0.021
 someday-0.021
 smaller-0.020
 cheaper-0.020
 enough-0.019
Token.
Token ID13
Feature activation+2.13e-02

Pos logit contributions
 swe+0.069
 cheaper+0.065
 spe+0.063
 someday+0.063
 shin+0.062
Neg logit contributions
 outside-0.053
he-0.053
 around-0.052
 behind-0.051
leaf-0.051
Token
Token ID198
Feature activation+2.36e-02

Pos logit contributions
leaf+0.044
vell+0.044
ibrarian+0.043
asc+0.041
iders+0.040
Neg logit contributions
 One-0.023
 swe-0.021
 someday-0.021
c-0.019
 spe-0.019
Token had
Token ID550
Feature activation-3.75e-02

Pos logit contributions
 swe+0.054
 someday+0.052
 cheaper+0.051
 smaller+0.051
 shin+0.050
Neg logit contributions
ibrarian-0.039
leaf-0.039
 character-0.035
he-0.035
ils-0.034
Token a
Token ID257
Feature activation-2.86e-02

Pos logit contributions
 someday+0.065
 swe+0.062
c+0.058
 shin+0.058
 cheaper+0.058
Neg logit contributions
leaf-0.051
ibrarian-0.045
 character-0.042
 outside-0.042
 ear-0.041
Token big
Token ID1263
Feature activation-2.63e-02

Pos logit contributions
 someday+0.043
 swe+0.040
c+0.038
 shin+0.038
 enough+0.037
Neg logit contributions
leaf-0.045
ibrarian-0.042
 character-0.042
vell-0.041
 ear-0.039
Token red
Token ID2266
Feature activation-2.30e-02

Pos logit contributions
 someday+0.034
 swe+0.033
 shin+0.031
 cheaper+0.031
c+0.030
Neg logit contributions
leaf-0.044
ibrarian-0.043
 character-0.042
he-0.042
 ear-0.041
Token balloon
Token ID21190
Feature activation+1.59e-02

Pos logit contributions
 swe+0.035
 someday+0.034
 cheaper+0.033
 spe+0.033
 shin+0.032
Neg logit contributions
leaf-0.035
ibrarian-0.033
he-0.032
 character-0.031
vell-0.031
Token that
Token ID326
Feature activation+1.65e-02

Pos logit contributions
leaf+0.026
ibrarian+0.024
vell+0.024
taker+0.022
ibr+0.022
Neg logit contributions
 someday-0.025
 swe-0.024
 spe-0.023
 cheaper-0.023
c-0.023
Token she
Token ID673
Feature activation+4.90e-03

Pos logit contributions
leaf+0.034
ibrarian+0.032
vell+0.032
asc+0.030
opher+0.030
Neg logit contributions
 swe-0.018
 someday-0.018
 spe-0.017
 cheaper-0.016
c-0.016
Token liked
Token ID8288
Feature activation+3.87e-03

Pos logit contributions
leaf+0.006
ibrarian+0.006
vell+0.006
taker+0.006
asc+0.005
Neg logit contributions
 swe-0.009
 someday-0.009
 spe-0.008
 cheaper-0.008
c-0.008
Token to
Token ID284
Feature activation-2.45e-04

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.008
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.004
 someday-0.004
 spe-0.004
 cheaper-0.004
 smaller-0.003
Token play
Token ID711
Feature activation+2.32e-02

Pos logit contributions
 swe+0.000
 someday+0.000
 cheaper+0.000
 spe+0.000
 smaller+0.000
Neg logit contributions
leaf-0.000
ibrarian-0.000
vell-0.000
taker-0.000
he-0.000
Token with
Token ID351
Feature activation+3.65e-02

Pos logit contributions
leaf+0.040
ibrarian+0.038
vell+0.037
taker+0.036
opher+0.036
Neg logit contributions
 someday-0.035
 swe-0.033
 One-0.031
 longer-0.030
 cheaper-0.030
Token why
Token ID1521
Feature activation+5.63e-03

Pos logit contributions
leaf+0.006
ibrarian+0.006
vell+0.005
taker+0.005
asc+0.005
Neg logit contributions
 swe-0.006
 someday-0.006
 cheaper-0.005
 spe-0.005
 smaller-0.005
Token the
Token ID262
Feature activation+1.78e-02

Pos logit contributions
leaf+0.010
ibrarian+0.010
vell+0.010
taker+0.009
asc+0.009
Neg logit contributions
 swe-0.007
 someday-0.007
 cheaper-0.007
 spe-0.007
 smaller-0.006
Token bird
Token ID6512
Feature activation-1.49e-02

Pos logit contributions
leaf+0.031
ibrarian+0.030
vell+0.029
taker+0.028
asc+0.028
Neg logit contributions
 swe-0.025
 someday-0.024
 spe-0.023
 cheaper-0.023
 smaller-0.022
Token had
Token ID550
Feature activation+4.80e-03

Pos logit contributions
 swe+0.019
 someday+0.018
 cheaper+0.017
 spe+0.017
 smaller+0.017
Neg logit contributions
leaf-0.027
ibrarian-0.026
vell-0.026
taker-0.025
he-0.024
Token kept
Token ID4030
Feature activation+5.67e-03

Pos logit contributions
leaf+0.009
ibrarian+0.009
vell+0.009
taker+0.008
he+0.008
Neg logit contributions
 swe-0.006
 someday-0.006
 cheaper-0.005
 spe-0.005
 smaller-0.005
Token striking
Token ID8871
Feature activation+9.80e-03

Pos logit contributions
leaf+0.009
ibrarian+0.008
vell+0.008
taker+0.008
asc+0.008
Neg logit contributions
 swe-0.009
 someday-0.009
 cheaper-0.008
 spe-0.008
 smaller-0.008
Token the
Token ID262
Feature activation-4.28e-03

Pos logit contributions
leaf+0.018
ibrarian+0.018
vell+0.017
taker+0.016
asc+0.016
Neg logit contributions
 swe-0.013
 someday-0.012
 spe-0.011
 cheaper-0.011
 smaller-0.011
Token tree
Token ID5509
Feature activation-3.48e-02

Pos logit contributions
 swe+0.006
 someday+0.006
 cheaper+0.005
 spe+0.005
c+0.005
Neg logit contributions
leaf-0.008
ibrarian-0.007
vell-0.007
taker-0.007
he-0.007
Token.
Token ID13
Feature activation-5.96e-03

Pos logit contributions
 swe+0.051
 spe+0.049
 someday+0.048
 cheaper+0.048
 shin+0.046
Neg logit contributions
leaf-0.051
vell-0.051
ibrarian-0.050
taker-0.050
he-0.050
Token Looking
Token ID15616
Feature activation+6.94e-03

Pos logit contributions
 swe+0.008
 someday+0.007
 cheaper+0.007
 spe+0.007
 smaller+0.007
Neg logit contributions
leaf-0.011
ibrarian-0.011
vell-0.010
taker-0.010
he-0.010
Token up
Token ID510
Feature activation-1.57e-02

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.007
taker+0.007
asc+0.006
Neg logit contributions
 swe-0.014
 someday-0.014
 cheaper-0.013
 spe-0.013
 smaller-0.013
Token vegetables
Token ID13701
Feature activation-1.01e-02

Pos logit contributions
leaf+0.026
ibrarian+0.025
vell+0.024
taker+0.023
asc+0.022
Neg logit contributions
 swe-0.022
 someday-0.020
 cheaper-0.020
 smaller-0.020
 spe-0.019
Token,
Token ID11
Feature activation-1.65e-02

Pos logit contributions
 swe+0.013
 someday+0.013
 spe+0.013
 cheaper+0.012
c+0.012
Neg logit contributions
leaf-0.017
ibrarian-0.017
vell-0.017
taker-0.016
he-0.016
Token we
Token ID356
Feature activation+1.21e-02

Pos logit contributions
 swe+0.023
 someday+0.023
 spe+0.022
c+0.022
 cheaper+0.022
Neg logit contributions
leaf-0.026
ibrarian-0.025
vell-0.025
taker-0.024
he-0.023
Token can
Token ID460
Feature activation-1.25e-02

Pos logit contributions
leaf+0.027
ibrarian+0.026
vell+0.026
taker+0.025
iders+0.025
Neg logit contributions
 someday-0.010
 swe-0.010
 spe-0.010
 cheaper-0.009
 smaller-0.009
Token have
Token ID423
Feature activation-3.65e-02

Pos logit contributions
 swe+0.019
 spe+0.018
 someday+0.017
 cheaper+0.017
c+0.017
Neg logit contributions
leaf-0.020
ibrarian-0.020
he-0.019
vell-0.019
taker-0.018
Token some
Token ID617
Feature activation+8.96e-04

Pos logit contributions
c+0.052
 shin+0.051
 spe+0.050
 swe+0.049
 someday+0.049
Neg logit contributions
leaf-0.053
 outside-0.049
ibrarian-0.048
 goodness-0.047
 supposed-0.045
Token dessert
Token ID23084
Feature activation+6.73e-03

Pos logit contributions
leaf+0.002
ibrarian+0.002
vell+0.001
taker+0.001
asc+0.001
Neg logit contributions
 swe-0.001
 someday-0.001
 cheaper-0.001
 spe-0.001
 smaller-0.001
Token.
Token ID13
Feature activation-8.12e-03

Pos logit contributions
leaf+0.013
ibrarian+0.013
vell+0.012
taker+0.012
asc+0.012
Neg logit contributions
 swe-0.008
 someday-0.008
 cheaper-0.007
 spe-0.007
 smaller-0.007
Token How
Token ID1374
Feature activation-1.54e-02

Pos logit contributions
 swe+0.013
 someday+0.012
 cheaper+0.012
 spe+0.012
 smaller+0.012
Neg logit contributions
leaf-0.013
ibrarian-0.012
vell-0.011
taker-0.011
ibr-0.011
Token does
Token ID857
Feature activation-2.87e-02

Pos logit contributions
 swe+0.031
 spe+0.030
 someday+0.029
 cheaper+0.029
c+0.029
Neg logit contributions
leaf-0.017
ibrarian-0.017
vell-0.015
he-0.014
taker-0.014
Token that
Token ID326
Feature activation-2.29e-02

Pos logit contributions
 swe+0.051
c+0.050
 someday+0.049
 shin+0.048
 spe+0.048
Neg logit contributions
ibrarian-0.035
leaf-0.034
 character-0.032
taker-0.032
 maker-0.030
Token spoil
Token ID20851
Feature activation+2.01e-02

Pos logit contributions
leaf+0.062
ibrarian+0.061
vell+0.058
ibr+0.058
taker+0.058
Neg logit contributions
 someday-0.034
 swe-0.032
 cheaper-0.031
 spe-0.030
 smaller-0.030
Token the
Token ID262
Feature activation+1.82e-02

Pos logit contributions
leaf+0.041
ibrarian+0.040
vell+0.039
taker+0.038
asc+0.037
Neg logit contributions
 swe-0.024
 someday-0.022
 cheaper-0.021
 smaller-0.021
 enough-0.020
Token nature
Token ID3450
Feature activation-1.99e-05

Pos logit contributions
ibrarian+0.034
leaf+0.033
vell+0.031
taker+0.031
ibr+0.031
Neg logit contributions
 swe-0.025
 someday-0.023
 cheaper-0.022
 spe-0.022
 enough-0.021
Token around
Token ID1088
Feature activation+1.61e-02

Pos logit contributions
 swe+0.000
 someday+0.000
 cheaper+0.000
 spe+0.000
 smaller+0.000
Neg logit contributions
leaf-0.000
ibrarian-0.000
vell-0.000
taker-0.000
he-0.000
Token here
Token ID994
Feature activation-4.03e-03

Pos logit contributions
leaf+0.030
ibrarian+0.028
vell+0.028
taker+0.026
ibr+0.026
Neg logit contributions
 someday-0.021
 swe-0.021
 spe-0.019
 cheaper-0.019
c-0.019
Token?".
Token ID43634
Feature activation+2.99e-03

Pos logit contributions
 swe+0.006
 someday+0.006
 spe+0.005
 cheaper+0.005
 smaller+0.005
Neg logit contributions
leaf-0.007
ibrarian-0.007
vell-0.006
taker-0.006
he-0.006
Token Molly
Token ID30236
Feature activation+3.88e-03

Pos logit contributions
leaf+0.004
ibrarian+0.004
vell+0.004
taker+0.004
asc+0.004
Neg logit contributions
 swe-0.005
 someday-0.005
 cheaper-0.005
 spe-0.005
 smaller-0.004
Token was
Token ID373
Feature activation+2.30e-02

Pos logit contributions
leaf+0.007
ibrarian+0.007
vell+0.007
taker+0.007
asc+0.006
Neg logit contributions
 swe-0.005
 someday-0.005
 spe-0.004
 cheaper-0.004
 smaller-0.004
Token deeply
Token ID7744
Feature activation+3.93e-03

Pos logit contributions
leaf+0.042
vell+0.040
ibrarian+0.038
iders+0.037
affles+0.037
Neg logit contributions
 smaller-0.030
 swe-0.029
 someday-0.029
 spe-0.028
 cheaper-0.028
Token sorry
Token ID7926
Feature activation+3.69e-03

Pos logit contributions
leaf+0.006
ibrarian+0.006
vell+0.006
taker+0.006
asc+0.005
Neg logit contributions
 swe-0.006
 someday-0.006
 cheaper-0.006
 spe-0.006
 smaller-0.005
Token and
Token ID290
Feature activation-4.24e-03

Pos logit contributions
leaf+0.007
ibrarian+0.007
vell+0.007
taker+0.006
asc+0.006
Neg logit contributions
 swe-0.004
 someday-0.004
 cheaper-0.004
 spe-0.004
 smaller-0.004
Token a
Token ID257
Feature activation-8.37e-03

Pos logit contributions
 swe+0.003
 someday+0.003
 cheaper+0.003
 spe+0.003
 smaller+0.002
Neg logit contributions
leaf-0.004
ibrarian-0.004
vell-0.004
taker-0.004
he-0.004
Token muscle
Token ID8280
Feature activation+6.89e-03

Pos logit contributions
 someday+0.012
 swe+0.012
 spe+0.011
 cheaper+0.011
c+0.011
Neg logit contributions
leaf-0.014
ibrarian-0.014
vell-0.013
taker-0.013
 character-0.012
Token that
Token ID326
Feature activation+2.64e-02

Pos logit contributions
leaf+0.012
ibrarian+0.011
vell+0.011
taker+0.010
asc+0.010
Neg logit contributions
 swe-0.010
 someday-0.010
 spe-0.009
 cheaper-0.009
 smaller-0.009
Token was
Token ID373
Feature activation+4.97e-03

Pos logit contributions
leaf+0.054
vell+0.052
ibrarian+0.052
asc+0.049
opher+0.049
Neg logit contributions
 swe-0.030
 someday-0.027
 spe-0.027
c-0.027
 cheaper-0.026
Token sore
Token ID19597
Feature activation+3.51e-02

Pos logit contributions
leaf+0.009
ibrarian+0.009
vell+0.008
taker+0.008
he+0.008
Neg logit contributions
 swe-0.007
 someday-0.006
 cheaper-0.006
 spe-0.006
 smaller-0.006
Token.
Token ID13
Feature activation-1.51e-02

Pos logit contributions
leaf+0.081
ibrarian+0.074
vell+0.074
asc+0.071
opher+0.069
Neg logit contributions
 someday-0.036
 enough-0.035
 swe-0.032
 One-0.030
 less-0.029
Token It
Token ID632
Feature activation-8.74e-03

Pos logit contributions
 swe+0.019
 someday+0.019
 spe+0.019
 cheaper+0.018
 smaller+0.017
Neg logit contributions
leaf-0.026
ibrarian-0.026
taker-0.025
vell-0.024
he-0.024
Token made
Token ID925
Feature activation+1.40e-02

Pos logit contributions
 swe+0.011
 someday+0.011
 cheaper+0.010
 spe+0.010
 smaller+0.010
Neg logit contributions
leaf-0.016
ibrarian-0.015
vell-0.015
taker-0.015
 character-0.014
Token her
Token ID607
Feature activation+1.29e-03

Pos logit contributions
leaf+0.025
ibrarian+0.024
vell+0.024
asc+0.022
taker+0.022
Neg logit contributions
 swe-0.020
 someday-0.018
 cheaper-0.018
 smaller-0.018
 spe-0.017
Token feel
Token ID1254
Feature activation-5.37e-03

Pos logit contributions
leaf+0.002
ibrarian+0.002
vell+0.002
taker+0.002
asc+0.002
Neg logit contributions
 swe-0.002
 someday-0.002
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token very
Token ID845
Feature activation-1.11e-03

Pos logit contributions
 swe+0.006
 someday+0.006
 spe+0.006
 cheaper+0.006
c+0.006
Neg logit contributions
leaf-0.010
ibrarian-0.010
vell-0.010
taker-0.010
he-0.009
Token but
Token ID475
Feature activation+3.95e-03

Pos logit contributions
 swe+0.017
 someday+0.017
 spe+0.016
 cheaper+0.016
c+0.016
Neg logit contributions
leaf-0.024
ibrarian-0.023
vell-0.022
taker-0.022
 character-0.021
Token also
Token ID635
Feature activation+4.38e-03

Pos logit contributions
leaf+0.007
ibrarian+0.007
vell+0.006
taker+0.006
asc+0.006
Neg logit contributions
 swe-0.005
 someday-0.005
 cheaper-0.005
 spe-0.005
 smaller-0.005
Token take
Token ID1011
Feature activation-5.19e-03

Pos logit contributions
leaf+0.008
ibrarian+0.007
vell+0.007
taker+0.007
he+0.007
Neg logit contributions
 swe-0.006
 someday-0.006
 cheaper-0.006
 spe-0.006
 smaller-0.005
Token care
Token ID1337
Feature activation+3.45e-02

Pos logit contributions
 swe+0.007
 someday+0.007
 spe+0.007
 cheaper+0.007
 smaller+0.006
Neg logit contributions
leaf-0.009
ibrarian-0.008
vell-0.008
taker-0.008
ibr-0.008
Token when
Token ID618
Feature activation-1.48e-02

Pos logit contributions
leaf+0.057
ibrarian+0.054
vell+0.053
asc+0.052
opher+0.050
Neg logit contributions
 someday-0.051
 swe-0.050
 cheaper-0.048
 enough-0.047
 smaller-0.047
Token exploring
Token ID13504
Feature activation-1.05e-02

Pos logit contributions
 swe+0.023
 someday+0.022
 spe+0.021
 cheaper+0.021
c+0.021
Neg logit contributions
leaf-0.022
ibrarian-0.021
vell-0.021
taker-0.020
 character-0.019
Token.
Token ID13
Feature activation-2.01e-02

Pos logit contributions
 swe+0.014
 someday+0.013
 spe+0.013
 cheaper+0.013
c+0.012
Neg logit contributions
leaf-0.018
ibrarian-0.018
vell-0.017
taker-0.017
he-0.016
Token<|endoftext|>
Token ID50256
Feature activation-1.79e-03

Pos logit contributions
 swe+0.030
 spe+0.028
 cheaper+0.027
 someday+0.027
 smaller+0.026
Neg logit contributions
leaf-0.033
ibrarian-0.033
taker-0.031
he-0.030
ibr-0.029
TokenOnce
Token ID7454
Feature activation-6.69e-03

Pos logit contributions
 swe+0.002
 someday+0.002
 cheaper+0.002
 spe+0.002
 smaller+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.003
he-0.003
Token upon
Token ID2402
Feature activation-1.63e-03

Pos logit contributions
 swe+0.010
 someday+0.010
 spe+0.009
 cheaper+0.009
 shin+0.009
Neg logit contributions
leaf-0.011
ibrarian-0.010
vell-0.010
taker-0.010
he-0.010
Token a
Token ID257
Feature activation-7.19e-03

Pos logit contributions
 swe+0.002
 someday+0.002
 cheaper+0.002
 spe+0.002
 smaller+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.003
he-0.003
Token nothing
Token ID2147
Feature activation-8.53e-03

Pos logit contributions
 swe+0.033
 spe+0.032
 someday+0.032
c+0.031
 cheaper+0.030
Neg logit contributions
leaf-0.038
ibrarian-0.036
vell-0.035
taker-0.034
 character-0.032
Token to
Token ID284
Feature activation-1.42e-02

Pos logit contributions
 swe+0.011
 someday+0.011
 spe+0.011
 cheaper+0.010
c+0.010
Neg logit contributions
leaf-0.015
ibrarian-0.014
vell-0.014
taker-0.014
he-0.013
Token give
Token ID1577
Feature activation+1.17e-03

Pos logit contributions
 swe+0.019
 spe+0.018
 someday+0.018
c+0.017
 cheaper+0.017
Neg logit contributions
leaf-0.024
ibrarian-0.023
vell-0.023
he-0.022
taker-0.022
Token.
Token ID13
Feature activation-3.27e-02

Pos logit contributions
leaf+0.002
ibrarian+0.002
vell+0.002
taker+0.002
asc+0.002
Neg logit contributions
 swe-0.002
 someday-0.002
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token<|endoftext|>
Token ID50256
Feature activation-3.35e-03

Pos logit contributions
 swe+0.052
 spe+0.047
 cheaper+0.047
 smaller+0.045
 shin+0.045
Neg logit contributions
leaf-0.052
ibrarian-0.051
taker-0.047
 goodness-0.046
he-0.045
TokenOnce
Token ID7454
Feature activation-1.73e-02

Pos logit contributions
 swe+0.004
 someday+0.004
 cheaper+0.004
 spe+0.004
 smaller+0.003
Neg logit contributions
leaf-0.006
ibrarian-0.006
vell-0.006
taker-0.006
he-0.006
Token upon
Token ID2402
Feature activation-7.17e-03

Pos logit contributions
 swe+0.028
 spe+0.027
 someday+0.026
 cheaper+0.026
 shin+0.025
Neg logit contributions
ibrarian-0.025
leaf-0.025
taker-0.024
he-0.024
vell-0.024
Token a
Token ID257
Feature activation-1.22e-02

Pos logit contributions
 swe+0.010
 someday+0.009
 cheaper+0.009
 spe+0.009
c+0.008
Neg logit contributions
leaf-0.013
ibrarian-0.012
vell-0.012
taker-0.012
he-0.011
Token time
Token ID640
Feature activation-1.04e-02

Pos logit contributions
 swe+0.016
 someday+0.016
 cheaper+0.015
 spe+0.015
 smaller+0.015
Neg logit contributions
leaf-0.021
ibrarian-0.020
vell-0.019
he-0.019
taker-0.019
Token,
Token ID11
Feature activation-1.12e-02

Pos logit contributions
 swe+0.012
 someday+0.012
 cheaper+0.011
 spe+0.011
 shin+0.011
Neg logit contributions
leaf-0.019
ibrarian-0.019
vell-0.019
taker-0.018
he-0.018
Token there
Token ID612
Feature activation-3.38e-03

Pos logit contributions
 swe+0.016
 someday+0.015
 cheaper+0.015
 spe+0.015
 shin+0.014
Neg logit contributions
leaf-0.018
ibrarian-0.018
vell-0.017
taker-0.017
he-0.017
Token Jack
Token ID3619
Feature activation+6.31e-03

Pos logit contributions
leaf+0.004
ibrarian+0.004
vell+0.004
taker+0.004
asc+0.004
Neg logit contributions
 swe-0.003
 someday-0.003
 cheaper-0.003
 spe-0.003
 smaller-0.002
Token watched
Token ID7342
Feature activation+1.43e-02

Pos logit contributions
leaf+0.012
ibrarian+0.011
vell+0.011
taker+0.011
asc+0.010
Neg logit contributions
 swe-0.008
 someday-0.008
 spe-0.007
 cheaper-0.007
c-0.007
Token,
Token ID11
Feature activation+2.07e-02

Pos logit contributions
leaf+0.029
ibrarian+0.028
vell+0.028
asc+0.027
iders+0.026
Neg logit contributions
 someday-0.016
 swe-0.016
 smaller-0.014
 cheaper-0.014
 spe-0.014
Token his
Token ID465
Feature activation+3.06e-02

Pos logit contributions
leaf+0.044
ibrarian+0.044
vell+0.042
asc+0.042
iders+0.041
Neg logit contributions
 someday-0.021
 swe-0.020
 smaller-0.019
 enough-0.019
 cheaper-0.018
Token heart
Token ID2612
Feature activation-1.04e-02

Pos logit contributions
leaf+0.054
vell+0.051
he+0.050
ibrarian+0.050
asc+0.050
Neg logit contributions
 swe-0.043
 smaller-0.041
 someday-0.040
 spe-0.040
 cheaper-0.039
Token began
Token ID2540
Feature activation-2.72e-02

Pos logit contributions
 swe+0.014
 someday+0.014
 cheaper+0.013
 spe+0.013
 One+0.012
Neg logit contributions
leaf-0.018
ibrarian-0.017
vell-0.017
taker-0.017
he-0.016
Token to
Token ID284
Feature activation-2.70e-02

Pos logit contributions
 swe+0.035
 someday+0.035
 cheaper+0.033
 spe+0.033
c+0.032
Neg logit contributions
leaf-0.047
ibrarian-0.044
taker-0.043
vell-0.042
 character-0.041
Token beat
Token ID4405
Feature activation+1.67e-03

Pos logit contributions
 swe+0.040
 someday+0.039
 cheaper+0.038
 spe+0.037
c+0.037
Neg logit contributions
leaf-0.041
vell-0.038
ibrarian-0.038
taker-0.037
he-0.036
Token faster
Token ID5443
Feature activation-1.49e-03

Pos logit contributions
leaf+0.003
ibrarian+0.003
vell+0.003
taker+0.003
asc+0.002
Neg logit contributions
 swe-0.002
 someday-0.002
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token and
Token ID290
Feature activation-2.25e-02

Pos logit contributions
 swe+0.002
 someday+0.002
 cheaper+0.002
 spe+0.002
 smaller+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.002
vell-0.002
taker-0.002
he-0.002
Token he
Token ID339
Feature activation+5.69e-03

Pos logit contributions
 swe+0.022
 someday+0.022
c+0.022
 spe+0.021
 One+0.020
Neg logit contributions
leaf-0.045
ibrarian-0.042
vell-0.042
taker-0.041
 character-0.040
Token was
Token ID373
Feature activation+2.14e-02

Pos logit contributions
 swe+0.004
 someday+0.004
 cheaper+0.004
 spe+0.004
 smaller+0.004
Neg logit contributions
leaf-0.006
ibrarian-0.006
vell-0.005
taker-0.005
he-0.005
Token able
Token ID1498
Feature activation-1.16e-02

Pos logit contributions
leaf+0.037
ibrarian+0.035
vell+0.033
iders+0.033
taker+0.032
Neg logit contributions
 swe-0.031
 smaller-0.031
 cheaper-0.030
 enough-0.030
 someday-0.029
Token to
Token ID284
Feature activation-7.04e-03

Pos logit contributions
 someday+0.014
 swe+0.013
 spe+0.013
 cheaper+0.012
c+0.012
Neg logit contributions
leaf-0.022
ibrarian-0.021
vell-0.021
taker-0.020
opher-0.019
Token make
Token ID787
Feature activation-1.94e-02

Pos logit contributions
 swe+0.010
 someday+0.010
 cheaper+0.009
 spe+0.009
c+0.009
Neg logit contributions
leaf-0.012
ibrarian-0.011
vell-0.011
taker-0.011
he-0.011
Token a
Token ID257
Feature activation-2.91e-02

Pos logit contributions
 someday+0.029
 spe+0.027
 swe+0.027
 cheaper+0.026
 One+0.025
Neg logit contributions
leaf-0.031
ibrarian-0.029
vell-0.029
taker-0.028
 character-0.028
Token stitch
Token ID24695
Feature activation-2.11e-02

Pos logit contributions
 someday+0.038
 One+0.034
 swe+0.033
 spe+0.032
c+0.032
Neg logit contributions
leaf-0.050
vell-0.049
ibrarian-0.048
taker-0.047
 character-0.047
Token that
Token ID326
Feature activation+1.66e-02

Pos logit contributions
 swe+0.032
 someday+0.032
 spe+0.031
 cheaper+0.031
 One+0.030
Neg logit contributions
leaf-0.031
ibrarian-0.031
vell-0.030
taker-0.030
he-0.028
Token looked
Token ID3114
Feature activation-8.97e-03

Pos logit contributions
leaf+0.034
ibrarian+0.032
vell+0.031
asc+0.030
opher+0.030
Neg logit contributions
 swe-0.020
 someday-0.019
 spe-0.017
 cheaper-0.017
 smaller-0.017
Token normal
Token ID3487
Feature activation+3.59e-04

Pos logit contributions
 someday+0.015
 swe+0.015
 spe+0.014
 cheaper+0.014
c+0.014
Neg logit contributions
leaf-0.013
vell-0.012
ibrarian-0.012
taker-0.011
he-0.011
Token.
Token ID13
Feature activation+7.57e-03

Pos logit contributions
leaf+0.001
ibrarian+0.001
vell+0.001
taker+0.001
asc+0.001
Neg logit contributions
 swe-0.000
 someday-0.000
 cheaper-0.000
 spe-0.000
 smaller-0.000
Token After
Token ID2293
Feature activation-1.50e-02

Pos logit contributions
leaf+0.015
ibrarian+0.015
vell+0.015
taker+0.014
asc+0.014
Neg logit contributions
 swe-0.008
 someday-0.008
 cheaper-0.008
 One-0.007
 spe-0.007
Token,
Token ID11
Feature activation+3.13e-02

Pos logit contributions
leaf+0.043
vell+0.041
ibrarian+0.041
taker+0.040
asc+0.038
Neg logit contributions
 someday-0.013
 swe-0.013
 enough-0.012
 One-0.012
 cheaper-0.011
Token they
Token ID484
Feature activation+2.72e-03

Pos logit contributions
leaf+0.068
vell+0.068
ibrarian+0.066
taker+0.065
opher+0.062
Neg logit contributions
 someday-0.031
 enough-0.030
 swe-0.029
 One-0.028
 cheaper-0.028
Token decide
Token ID5409
Feature activation-2.76e-02

Pos logit contributions
leaf+0.005
ibrarian+0.005
vell+0.005
taker+0.004
asc+0.004
Neg logit contributions
 swe-0.004
 someday-0.004
 spe-0.003
 cheaper-0.003
 smaller-0.003
Token to
Token ID284
Feature activation-1.62e-02

Pos logit contributions
 spe+0.041
c+0.041
 swe+0.039
 shin+0.036
 cheaper+0.036
Neg logit contributions
leaf-0.045
ibrarian-0.041
vell-0.038
 outside-0.038
taker-0.037
Token go
Token ID467
Feature activation-3.40e-02

Pos logit contributions
 swe+0.023
 cheaper+0.021
 someday+0.021
 spe+0.021
 shin+0.020
Neg logit contributions
leaf-0.028
ibrarian-0.027
vell-0.026
he-0.025
taker-0.025
Token on
Token ID319
Feature activation-5.74e-02

Pos logit contributions
 spe+0.061
 swe+0.060
 cheaper+0.055
 shin+0.053
 someday+0.052
Neg logit contributions
 outside-0.047
leaf-0.047
 around-0.043
he-0.041
ibrarian-0.040
Token an
Token ID281
Feature activation-1.40e-02

Pos logit contributions
 swe+0.106
 spe+0.104
 shin+0.100
c+0.099
 cheaper+0.098
Neg logit contributions
 outside-0.066
leaf-0.062
 ages-0.056
he-0.055
 character-0.055
Token adventure
Token ID8855
Feature activation+3.38e-03

Pos logit contributions
 swe+0.021
 spe+0.020
 someday+0.020
 cheaper+0.019
c+0.019
Neg logit contributions
leaf-0.022
ibrarian-0.022
vell-0.020
taker-0.020
he-0.019
Token.
Token ID13
Feature activation+1.31e-02

Pos logit contributions
leaf+0.006
ibrarian+0.006
vell+0.006
taker+0.005
asc+0.005
Neg logit contributions
 swe-0.005
 someday-0.004
 cheaper-0.004
 spe-0.004
 smaller-0.004
Token They
Token ID1119
Feature activation-1.88e-03

Pos logit contributions
leaf+0.029
vell+0.028
ibrarian+0.028
taker+0.027
asc+0.026
Neg logit contributions
 swe-0.012
 someday-0.012
 One-0.011
 cheaper-0.011
 spe-0.010
Token put
Token ID1234
Feature activation-1.03e-03

Pos logit contributions
 swe+0.002
 someday+0.002
 cheaper+0.002
 spe+0.002
 smaller+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.003
he-0.003
Token,
Token ID11
Feature activation-1.00e-02

Pos logit contributions
leaf+0.023
ibrarian+0.021
vell+0.020
taker+0.020
asc+0.019
Neg logit contributions
 swe-0.021
 someday-0.020
 cheaper-0.020
 smaller-0.019
 spe-0.018
Token but
Token ID475
Feature activation+1.57e-02

Pos logit contributions
 swe+0.014
 someday+0.014
 spe+0.014
 cheaper+0.013
c+0.013
Neg logit contributions
leaf-0.016
ibrarian-0.015
vell-0.015
taker-0.014
 character-0.014
Token mom
Token ID1995
Feature activation+1.79e-02

Pos logit contributions
leaf+0.032
vell+0.030
ibrarian+0.030
taker+0.028
ibr+0.028
Neg logit contributions
 swe-0.019
 someday-0.019
 cheaper-0.017
 enough-0.017
 smaller-0.017
Token said
Token ID531
Feature activation-3.22e-02

Pos logit contributions
leaf+0.035
vell+0.033
ibrarian+0.032
taker+0.032
opher+0.031
Neg logit contributions
 swe-0.022
 someday-0.021
 cheaper-0.020
 enough-0.019
c-0.019
Token they
Token ID484
Feature activation-4.44e-05

Pos logit contributions
c+0.044
 shin+0.042
 someday+0.042
 swe+0.042
 cheaper+0.041
Neg logit contributions
ibrarian-0.052
leaf-0.049
taker-0.048
 character-0.048
izards-0.046
Token had
Token ID550
Feature activation-5.05e-02

Pos logit contributions
 swe+0.000
 someday+0.000
 cheaper+0.000
 spe+0.000
 smaller+0.000
Neg logit contributions
leaf-0.000
ibrarian-0.000
vell-0.000
taker-0.000
he-0.000
Token to
Token ID284
Feature activation-1.41e-02

Pos logit contributions
c+0.080
 spe+0.078
 shin+0.077
 someday+0.077
 swe+0.077
Neg logit contributions
leaf-0.065
 supposed-0.061
ibrarian-0.057
 everywhere-0.056
 character-0.054
Token un
Token ID555
Feature activation-8.87e-03

Pos logit contributions
 swe+0.020
 someday+0.019
 spe+0.019
 cheaper+0.019
c+0.018
Neg logit contributions
leaf-0.023
ibrarian-0.022
vell-0.021
he-0.021
taker-0.021
Tokenpack
Token ID8002
Feature activation+2.66e-03

Pos logit contributions
 swe+0.020
 someday+0.020
 spe+0.020
 cheaper+0.019
 smaller+0.019
Neg logit contributions
leaf-0.007
ibrarian-0.007
vell-0.006
he-0.006
taker-0.006
Token first
Token ID717
Feature activation-1.88e-02

Pos logit contributions
leaf+0.005
ibrarian+0.005
vell+0.005
taker+0.005
asc+0.004
Neg logit contributions
 swe-0.003
 someday-0.003
 cheaper-0.003
 spe-0.003
 smaller-0.003
Token.
Token ID13
Feature activation-1.17e-02

Pos logit contributions
 swe+0.027
 spe+0.027
 someday+0.025
 cheaper+0.025
 shin+0.025
Neg logit contributions
ibrarian-0.029
leaf-0.029
vell-0.028
taker-0.027
he-0.027
Token learned
Token ID4499
Feature activation-1.95e-02

Pos logit contributions
 swe+0.003
 someday+0.003
 cheaper+0.003
 spe+0.003
 smaller+0.003
Neg logit contributions
leaf-0.004
ibrarian-0.004
vell-0.004
taker-0.003
he-0.003
Token that
Token ID326
Feature activation-8.05e-03

Pos logit contributions
 spe+0.034
c+0.034
 swe+0.033
 someday+0.033
 cheaper+0.031
Neg logit contributions
leaf-0.027
ibrarian-0.025
taker-0.023
vell-0.023
 character-0.021
Token sometimes
Token ID3360
Feature activation-2.49e-02

Pos logit contributions
 swe+0.013
 someday+0.012
 spe+0.012
 cheaper+0.012
c+0.011
Neg logit contributions
leaf-0.012
ibrarian-0.012
vell-0.011
taker-0.011
 character-0.011
Token,
Token ID11
Feature activation-3.50e-02

Pos logit contributions
 swe+0.035
c+0.034
 spe+0.033
 cheaper+0.032
 shin+0.032
Neg logit contributions
leaf-0.037
 character-0.037
ibrarian-0.036
taker-0.035
vell-0.035
Token wandering
Token ID24504
Feature activation-6.55e-03

Pos logit contributions
 swe+0.044
c+0.043
 spe+0.041
 cheaper+0.040
 One+0.040
Neg logit contributions
leaf-0.057
vell-0.055
ibrarian-0.053
izards-0.053
taker-0.053
Token can
Token ID460
Feature activation-6.23e-02

Pos logit contributions
 swe+0.008
 someday+0.008
 cheaper+0.008
 spe+0.008
 smaller+0.007
Neg logit contributions
leaf-0.012
ibrarian-0.012
vell-0.011
taker-0.011
he-0.010
Token lead
Token ID1085
Feature activation-5.62e-03

Pos logit contributions
 spe+0.100
c+0.094
 swe+0.094
 enough+0.090
 cheaper+0.087
Neg logit contributions
he-0.074
vell-0.072
ibrarian-0.071
taker-0.068
iders-0.066
Token to
Token ID284
Feature activation-5.63e-02

Pos logit contributions
 swe+0.007
 someday+0.006
 spe+0.006
 cheaper+0.006
 shin+0.006
Neg logit contributions
leaf-0.011
ibrarian-0.010
vell-0.010
taker-0.010
ibr-0.009
Token wonderful
Token ID7932
Feature activation-1.00e-03

Pos logit contributions
c+0.086
 spe+0.083
 swe+0.081
 cheaper+0.075
 shin+0.074
Neg logit contributions
 supposed-0.070
 ear-0.069
leaf-0.068
 outside-0.068
 adults-0.067
Token things
Token ID1243
Feature activation+6.80e-03

Pos logit contributions
 swe+0.001
 someday+0.001
 cheaper+0.001
 spe+0.001
 smaller+0.001
Neg logit contributions
leaf-0.002
ibrarian-0.002
vell-0.002
taker-0.002
asc-0.001
Token.
Token ID13
Feature activation-1.44e-02

Pos logit contributions
leaf+0.013
ibrarian+0.013
vell+0.012
taker+0.012
asc+0.012
Neg logit contributions
 swe-0.008
 someday-0.008
 cheaper-0.008
 spe-0.007
 smaller-0.007
Token's
Token ID338
Feature activation+1.77e-02

Pos logit contributions
leaf+0.027
ibrarian+0.026
vell+0.026
taker+0.025
opher+0.024
Neg logit contributions
 someday-0.017
 swe-0.017
 spe-0.016
c-0.015
 cheaper-0.015
Token wrong
Token ID2642
Feature activation-1.17e-02

Pos logit contributions
leaf+0.034
ibrarian+0.033
vell+0.032
taker+0.031
he+0.031
Neg logit contributions
 swe-0.022
 cheaper-0.020
 someday-0.020
 enough-0.020
 spe-0.020
Token?"
Token ID1701
Feature activation-8.82e-03

Pos logit contributions
 swe+0.016
 someday+0.015
 cheaper+0.015
 spe+0.015
 shin+0.014
Neg logit contributions
ibrarian-0.019
leaf-0.019
vell-0.019
taker-0.018
asc-0.018
Token The
Token ID383
Feature activation-2.39e-02

Pos logit contributions
 swe+0.011
 someday+0.011
 cheaper+0.010
 spe+0.010
 smaller+0.010
Neg logit contributions
leaf-0.017
ibrarian-0.016
taker-0.015
vell-0.015
he-0.015
Token bird
Token ID6512
Feature activation-3.33e-02

Pos logit contributions
 someday+0.027
 swe+0.027
 cheaper+0.025
 spe+0.023
 enough+0.023
Neg logit contributions
ibrarian-0.046
leaf-0.045
vell-0.043
taker-0.042
 ear-0.042
Token softly
Token ID26625
Feature activation-4.96e-02

Pos logit contributions
 cheaper+0.057
 swe+0.057
 someday+0.056
 spe+0.054
 smaller+0.052
Neg logit contributions
ibrarian-0.042
leaf-0.041
taker-0.039
vell-0.038
ped-0.036
Token replied
Token ID8712
Feature activation-7.37e-03

Pos logit contributions
 cheaper+0.084
 swe+0.081
 someday+0.080
 Sometimes+0.077
 smaller+0.076
Neg logit contributions
leaf-0.064
ibrarian-0.061
vell-0.058
ped-0.056
ting-0.056
Token between
Token ID1022
Feature activation-1.33e-02

Pos logit contributions
 swe+0.009
 someday+0.008
 cheaper+0.008
 spe+0.008
c+0.007
Neg logit contributions
leaf-0.014
ibrarian-0.014
vell-0.013
taker-0.013
 character-0.013
Token its
Token ID663
Feature activation-1.99e-02

Pos logit contributions
 someday+0.019
 swe+0.019
 cheaper+0.019
c+0.018
 spe+0.018
Neg logit contributions
leaf-0.022
ibrarian-0.021
taker-0.020
vell-0.020
opher-0.019
Token so
Token ID523
Feature activation-2.20e-02

Pos logit contributions
 someday+0.031
 swe+0.030
 cheaper+0.030
 spe+0.028
 shin+0.028
Neg logit contributions
leaf-0.028
ibrarian-0.027
vell-0.026
 ear-0.026
opher-0.025
Tokenbs
Token ID1443
Feature activation+1.78e-03

Pos logit contributions
 swe+0.033
 someday+0.032
 cheaper+0.032
c+0.031
 shin+0.031
Neg logit contributions
ibrarian-0.033
taker-0.033
leaf-0.032
vell-0.030
he-0.030
Token if
Token ID611
Feature activation-2.54e-02

Pos logit contributions
leaf+0.010
ibrarian+0.010
vell+0.009
taker+0.009
asc+0.009
Neg logit contributions
 swe-0.008
 someday-0.008
 cheaper-0.008
 smaller-0.007
 spe-0.007
Token her
Token ID607
Feature activation+1.63e-03

Pos logit contributions
 swe+0.038
 spe+0.036
c+0.036
 someday+0.036
 cheaper+0.036
Neg logit contributions
ibrarian-0.038
leaf-0.036
vell-0.035
taker-0.035
 character-0.034
Token friends
Token ID2460
Feature activation+3.03e-02

Pos logit contributions
leaf+0.003
ibrarian+0.003
vell+0.002
taker+0.002
asc+0.002
Neg logit contributions
 swe-0.002
 someday-0.002
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token were
Token ID547
Feature activation+2.20e-03

Pos logit contributions
leaf+0.061
ibrarian+0.060
vell+0.058
taker+0.058
iders+0.057
Neg logit contributions
 someday-0.035
 swe-0.033
 cheaper-0.032
 One-0.031
 spe-0.029
Token coming
Token ID2406
Feature activation-2.44e-02

Pos logit contributions
leaf+0.004
ibrarian+0.004
vell+0.004
taker+0.004
asc+0.003
Neg logit contributions
 swe-0.003
 someday-0.003
 cheaper-0.003
 spe-0.003
 smaller-0.003
Token to
Token ID284
Feature activation-5.23e-02

Pos logit contributions
 swe+0.042
 spe+0.041
 cheaper+0.038
c+0.038
 shin+0.037
Neg logit contributions
leaf-0.033
vell-0.031
 outside-0.030
he-0.029
asc-0.029
Token play
Token ID711
Feature activation-1.31e-02

Pos logit contributions
 swe+0.094
 spe+0.092
 cheaper+0.091
 shin+0.087
 enough+0.086
Neg logit contributions
 outside-0.059
leaf-0.059
he-0.053
 ear-0.052
 behind-0.049
Token.
Token ID13
Feature activation-7.16e-03

Pos logit contributions
 swe+0.020
 spe+0.019
 someday+0.019
 cheaper+0.019
 smaller+0.018
Neg logit contributions
leaf-0.020
ibrarian-0.020
vell-0.019
he-0.019
taker-0.018
Token But
Token ID887
Feature activation-1.82e-02

Pos logit contributions
 swe+0.009
 someday+0.009
 spe+0.009
 cheaper+0.009
 smaller+0.008
Neg logit contributions
leaf-0.013
ibrarian-0.012
vell-0.012
taker-0.012
he-0.012
Token when
Token ID618
Feature activation-3.57e-02

Pos logit contributions
 swe+0.028
 spe+0.028
 cheaper+0.027
 shin+0.027
 someday+0.026
Neg logit contributions
ibrarian-0.026
leaf-0.024
taker-0.024
 character-0.023
vell-0.023
Token she
Token ID673
Feature activation-1.83e-02

Pos logit contributions
c+0.064
 shin+0.063
 cheaper+0.062
 swe+0.061
 spe+0.061
Neg logit contributions
 character-0.038
 outside-0.037
he-0.036
 everywhere-0.036
ibrarian-0.036