TOP ACTIVATIONS
MAX = 3.774

day , Lily accidentally touched the screen and it broke .
Sara sees a light on the screen . The light is
was surprised . He touched the screen and it made such
just couldn 't . So the object decided to keep repeating
see ? An image of the perfect day ! The sun
and pushed the button on the television . The screen lit
room . They can touch the screen and make things happen
. His eyes landed on the little freezer . He smiled
so big ! She touched the screen and it lit up
frustrated . The buttons on the television were different and she
Tim stood up and pushed the button on the television .
ummy w affle . The w affle was very special because
. He laid down on the cushion and curled up .
so cool ! She touched the shapes and they moved around
than anything he had ever seen before . Mark
ia put her hand on the cold mirror and started to
seen before - a w affle ! She was so confused
snacks . She saw the pupp ets and smiled .
the big black board on the wall . Tim had a
Tom jumped on the table to get a closer

MIN ACTIVATIONS
MIN = -3.930

branches . They make the tree colorful and bright .
it . He makes the tree look more beautiful and fun
happy that he gave the little bird a big hug .
Tim my 's blanket would become lively whenever they wanted to
iring , until it started to get dark . He started
your name ? '. The little pig let bl ushed and
and pretend they were going on a trip . They took
and asked if she could have some po pp ies .
so they can make the forest pretty ." L
. Soon , the path grew steep . Joe was tired
But sometimes , the sky gets dark and it rains .
mom if they could go on a saf ari , but
them up . But the last snowball was too heavy for
. She wanted to make it soar higher than the boy
. Every morning , the little pig would run and jump
me !" The little boy said , " What
." So the little boy started sharp ening the
, Grace 's attic was becoming more and more full with
that it could make the garden even better .
Soon , the night sky grew very dark . The little

INTERVAL 1.848 - 3.774
CONTAINS 0.347%

He thought she was unhappy with him and his clay .
looked at it , and their eyes spark led . They
" Ben , can I show you something ?" Mia says
from the woods . She was curious , so she decided
. The books he lf was happy to have

INTERVAL -0.078 - 1.848
CONTAINS 54.437%

captain of the ship . Give me your treasure or walk
she saw a big post box . It was red !
rabbit was so happy ! He followed the bird and they
head . Mia felt something wet and gross on her hair
. He said , " You 're welcome , little boy

INTERVAL -2.004 - -0.078
CONTAINS 44.935%

. " Can I cook you for dinner ?" asked Lily
until it stopped in front of a big tree . The
, but Ben keeps taking her blocks . Sara gets angry
- old but her heart was brave . One
face and pur r every time she saw him . <|endoftext|>

INTERVAL -3.930 - -2.004
CONTAINS 0.281%

happy , and they continued to sail on the sea ,
One day , it started to rain and Betsy got very
pin because it made the rainbow look even more beautiful .
sleepy and decided to have a rest . She went to
each other when they were in trouble . <|endoftext|> Once upon
Token day
Token ID1110
Feature activation-1.03e-01

Pos logit contributions
leaf+0.028
vell+0.027
ibrarian+0.026
taker+0.025
iders+0.025
Neg logit contributions
 someday-0.023
 swe-0.023
 spe-0.021
 smaller-0.021
c-0.021
Token,
Token ID11
Feature activation-5.47e-01

Pos logit contributions
 swe+0.003
 someday+0.003
 cheaper+0.003
 spe+0.003
 smaller+0.002
Neg logit contributions
leaf-0.005
ibrarian-0.005
vell-0.005
taker-0.005
he-0.005
Token Lily
Token ID20037
Feature activation-2.17e-01

Pos logit contributions
 swe+0.019
 spe+0.018
 someday+0.018
 cheaper+0.018
c+0.018
Neg logit contributions
leaf-0.021
ibrarian-0.021
 character-0.020
vell-0.020
taker-0.020
Token accidentally
Token ID14716
Feature activation+9.24e-01

Pos logit contributions
 swe+0.007
 someday+0.006
 spe+0.006
 cheaper+0.006
 smaller+0.006
Neg logit contributions
leaf-0.010
ibrarian-0.010
vell-0.009
taker-0.009
he-0.009
Token touched
Token ID12615
Feature activation-4.17e-01

Pos logit contributions
leaf+0.044
ibrarian+0.042
vell+0.042
taker+0.041
he+0.040
Neg logit contributions
 swe-0.028
 someday-0.026
 smaller-0.026
 spe-0.026
 enough-0.025
Token the
Token ID262
Feature activation+3.77e+00

Pos logit contributions
 someday+0.013
 swe+0.013
 spe+0.013
 cheaper+0.013
 shin+0.012
Neg logit contributions
leaf-0.018
ibrarian-0.017
taker-0.016
vell-0.016
he-0.016
Token screen
Token ID3159
Feature activation+1.95e-01

Pos logit contributions
izards+0.166
ibrarian+0.156
ibr+0.153
ying+0.152
iders+0.148
Neg logit contributions
 smaller-0.134
 blue-0.128
 one-0.123
 bigger-0.116
 One-0.116
Token and
Token ID290
Feature activation+1.31e+00

Pos logit contributions
leaf+0.009
ibrarian+0.008
vell+0.008
taker+0.008
asc+0.008
Neg logit contributions
 swe-0.006
 someday-0.006
 cheaper-0.006
 spe-0.006
 smaller-0.006
Token it
Token ID340
Feature activation+4.17e-03

Pos logit contributions
leaf+0.081
vell+0.076
ibrarian+0.075
asc+0.074
ibr+0.072
Neg logit contributions
 swe-0.025
 enough-0.023
 smaller-0.022
 someday-0.022
 spe-0.021
Token broke
Token ID6265
Feature activation+6.55e-02

Pos logit contributions
leaf+0.000
ibrarian+0.000
vell+0.000
taker+0.000
he+0.000
Neg logit contributions
 swe-0.000
 someday-0.000
 cheaper-0.000
 spe-0.000
 smaller-0.000
Token.
Token ID13
Feature activation+6.06e-01

Pos logit contributions
leaf+0.002
ibrarian+0.002
vell+0.002
taker+0.002
asc+0.002
Neg logit contributions
 swe-0.003
 someday-0.003
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token Sara
Token ID24799
Feature activation-8.72e-01

Pos logit contributions
 swe+0.005
 someday+0.005
 cheaper+0.005
 spe+0.005
c+0.005
Neg logit contributions
leaf-0.008
ibrarian-0.008
vell-0.007
taker-0.007
he-0.007
Token sees
Token ID7224
Feature activation+6.33e-01

Pos logit contributions
 swe+0.029
 spe+0.028
 cheaper+0.028
 someday+0.027
 shin+0.026
Neg logit contributions
ibrarian-0.037
leaf-0.037
vell-0.034
asc-0.033
ibr-0.033
Token a
Token ID257
Feature activation+2.36e-01

Pos logit contributions
leaf+0.033
vell+0.032
ibrarian+0.031
taker+0.030
asc+0.030
Neg logit contributions
 swe-0.016
 someday-0.015
 cheaper-0.015
 smaller-0.015
 enough-0.014
Token light
Token ID1657
Feature activation-1.71e-01

Pos logit contributions
leaf+0.011
ibrarian+0.010
vell+0.010
taker+0.010
asc+0.009
Neg logit contributions
 swe-0.007
 someday-0.007
 cheaper-0.007
 spe-0.007
 smaller-0.007
Token on
Token ID319
Feature activation-8.49e-02

Pos logit contributions
 swe+0.007
 someday+0.007
 cheaper+0.007
 spe+0.007
 smaller+0.006
Neg logit contributions
leaf-0.006
ibrarian-0.006
vell-0.005
taker-0.005
he-0.005
Token the
Token ID262
Feature activation+3.70e+00

Pos logit contributions
 swe+0.002
 someday+0.002
 cheaper+0.002
 spe+0.002
 smaller+0.002
Neg logit contributions
leaf-0.004
ibrarian-0.004
vell-0.004
taker-0.004
he-0.004
Token screen
Token ID3159
Feature activation+8.28e-01

Pos logit contributions
izards+0.170
imes+0.156
oll+0.154
iders+0.152
ibr+0.151
Neg logit contributions
 smaller-0.125
 swe-0.118
 blue-0.117
 one-0.115
 less-0.115
Token.
Token ID13
Feature activation-2.11e-01

Pos logit contributions
leaf+0.042
vell+0.040
ibrarian+0.039
taker+0.038
asc+0.037
Neg logit contributions
 someday-0.023
 swe-0.022
 One-0.020
 cheaper-0.020
c-0.020
Token The
Token ID383
Feature activation+2.92e+00

Pos logit contributions
 swe+0.007
 someday+0.007
 cheaper+0.007
 spe+0.007
 smaller+0.006
Neg logit contributions
leaf-0.009
ibrarian-0.009
vell-0.008
taker-0.008
he-0.008
Token light
Token ID1657
Feature activation-6.47e-03

Pos logit contributions
oll+0.147
izards+0.141
imes+0.139
vell+0.138
idget+0.137
Neg logit contributions
 smaller-0.089
 stream-0.077
 better-0.075
 less-0.074
 blue-0.074
Token is
Token ID318
Feature activation+7.52e-01

Pos logit contributions
 swe+0.000
 someday+0.000
 cheaper+0.000
 spe+0.000
 smaller+0.000
Neg logit contributions
leaf-0.000
ibrarian-0.000
vell-0.000
taker-0.000
asc-0.000
Token was
Token ID373
Feature activation+1.14e+00

Pos logit contributions
leaf+0.016
ibrarian+0.015
vell+0.015
taker+0.015
asc+0.014
Neg logit contributions
 swe-0.009
 someday-0.009
 spe-0.008
 cheaper-0.008
 smaller-0.008
Token surprised
Token ID6655
Feature activation+4.77e-01

Pos logit contributions
leaf+0.053
ibrarian+0.051
vell+0.050
iders+0.048
affles+0.047
Neg logit contributions
 smaller-0.036
 swe-0.034
 enough-0.034
 cheaper-0.033
 spe-0.033
Token.
Token ID13
Feature activation+8.51e-01

Pos logit contributions
leaf+0.024
ibrarian+0.023
vell+0.022
taker+0.021
asc+0.021
Neg logit contributions
 someday-0.014
 swe-0.014
 enough-0.012
 cheaper-0.012
 smaller-0.012
Token He
Token ID679
Feature activation+6.53e-01

Pos logit contributions
leaf+0.046
vell+0.046
ibrarian+0.045
asc+0.043
iders+0.042
Neg logit contributions
 One-0.020
 someday-0.018
 swe-0.018
 cheaper-0.017
 smaller-0.017
Token touched
Token ID12615
Feature activation+8.90e-01

Pos logit contributions
leaf+0.030
ibrarian+0.029
vell+0.029
taker+0.027
asc+0.027
Neg logit contributions
 swe-0.020
 someday-0.019
 spe-0.019
 cheaper-0.018
 smaller-0.018
Token the
Token ID262
Feature activation+3.70e+00

Pos logit contributions
leaf+0.051
vell+0.050
ibrarian+0.050
iders+0.047
ibr+0.047
Neg logit contributions
 swe-0.019
 someday-0.018
 smaller-0.018
 enough-0.018
c-0.017
Token screen
Token ID3159
Feature activation-4.85e-02

Pos logit contributions
izards+0.167
oll+0.157
asc+0.155
live+0.155
ibrarian+0.152
Neg logit contributions
 smaller-0.130
 blue-0.117
 bigger-0.115
 one-0.114
 better-0.110
Token and
Token ID290
Feature activation+1.51e+00

Pos logit contributions
 swe+0.002
 someday+0.002
 spe+0.001
 cheaper+0.001
c+0.001
Neg logit contributions
leaf-0.002
ibrarian-0.002
vell-0.002
taker-0.002
he-0.002
Token it
Token ID340
Feature activation+2.06e-01

Pos logit contributions
leaf+0.082
ibrarian+0.080
vell+0.079
asc+0.078
opher+0.076
Neg logit contributions
 someday-0.035
 smaller-0.035
 swe-0.035
 enough-0.033
 shin-0.032
Token made
Token ID925
Feature activation+8.84e-01

Pos logit contributions
leaf+0.010
ibrarian+0.009
vell+0.009
taker+0.009
asc+0.009
Neg logit contributions
 swe-0.006
 someday-0.006
 spe-0.006
 cheaper-0.006
 smaller-0.005
Token such
Token ID884
Feature activation+4.25e-01

Pos logit contributions
leaf+0.044
ibrarian+0.043
vell+0.042
asc+0.041
iders+0.040
Neg logit contributions
 swe-0.025
 smaller-0.023
 someday-0.023
 enough-0.023
 cheaper-0.022
Token just
Token ID655
Feature activation+1.77e-01

Pos logit contributions
 swe+0.011
 someday+0.010
 cheaper+0.010
 spe+0.010
 smaller+0.009
Neg logit contributions
leaf-0.013
ibrarian-0.013
vell-0.013
taker-0.012
 character-0.012
Token couldn
Token ID3521
Feature activation-2.39e+00

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.007
taker+0.007
he+0.007
Neg logit contributions
 swe-0.006
 someday-0.006
 spe-0.005
 cheaper-0.005
 smaller-0.005
Token't
Token ID470
Feature activation-2.01e-01

Pos logit contributions
 swe+0.149
 spe+0.146
 someday+0.142
 cheaper+0.140
 shin+0.140
Neg logit contributions
he-0.039
ibrarian-0.033
taker-0.027
iders-0.024
leaf-0.024
Token.
Token ID13
Feature activation-8.43e-01

Pos logit contributions
 swe+0.007
 someday+0.006
 spe+0.006
 cheaper+0.006
 smaller+0.006
Neg logit contributions
leaf-0.008
ibrarian-0.008
vell-0.008
taker-0.008
he-0.008
Token So
Token ID1406
Feature activation-9.50e-01

Pos logit contributions
 swe+0.033
 spe+0.032
 cheaper+0.031
 someday+0.031
 smaller+0.030
Neg logit contributions
leaf-0.030
ibrarian-0.030
he-0.028
taker-0.027
 character-0.027
Token the
Token ID262
Feature activation+3.63e+00

Pos logit contributions
 swe+0.036
 cheaper+0.034
c+0.032
 shin+0.032
 spe+0.032
Neg logit contributions
 ear-0.034
ibrarian-0.033
 goodness-0.033
 character-0.033
leaf-0.031
Token object
Token ID2134
Feature activation-1.26e+00

Pos logit contributions
ives+0.176
iders+0.175
izards+0.171
ibrarian+0.171
leaf+0.170
Neg logit contributions
 copper-0.109
 smaller-0.106
 silver-0.098
 blue-0.091
 shin-0.090
Token decided
Token ID3066
Feature activation-9.03e-01

Pos logit contributions
 swe+0.045
 spe+0.044
 cheaper+0.043
 someday+0.041
 smaller+0.040
Neg logit contributions
ibrarian-0.051
leaf-0.047
taker-0.045
vell-0.045
 character-0.045
Token to
Token ID284
Feature activation+2.49e-01

Pos logit contributions
 spe+0.031
 swe+0.031
c+0.030
 cheaper+0.028
 someday+0.028
Neg logit contributions
leaf-0.036
ibrarian-0.034
taker-0.032
vell-0.032
 outside-0.031
Token keep
Token ID1394
Feature activation+2.30e-01

Pos logit contributions
leaf+0.012
ibrarian+0.011
vell+0.011
taker+0.011
asc+0.010
Neg logit contributions
 swe-0.007
 someday-0.007
 cheaper-0.007
 spe-0.007
 smaller-0.006
Token repeating
Token ID20394
Feature activation+4.02e-01

Pos logit contributions
leaf+0.011
ibrarian+0.010
vell+0.010
taker+0.010
he+0.009
Neg logit contributions
 swe-0.007
 someday-0.007
 cheaper-0.007
 spe-0.007
 smaller-0.007
Token see
Token ID766
Feature activation+1.92e+00

Pos logit contributions
leaf+0.017
ibrarian+0.016
vell+0.016
taker+0.016
asc+0.015
Neg logit contributions
 someday-0.012
 swe-0.012
 cheaper-0.011
 spe-0.011
c-0.011
Token?
Token ID30
Feature activation+6.94e-01

Pos logit contributions
leaf+0.106
vell+0.096
taker+0.096
affle+0.096
opher+0.095
Neg logit contributions
 someday-0.046
c-0.046
 enough-0.041
 swe-0.041
 smaller-0.038
Token An
Token ID1052
Feature activation+1.60e+00

Pos logit contributions
leaf+0.034
vell+0.033
ibrarian+0.033
taker+0.031
asc+0.031
Neg logit contributions
 someday-0.019
 One-0.019
 swe-0.018
c-0.017
 spe-0.017
Token image
Token ID2939
Feature activation+1.78e+00

Pos logit contributions
leaf+0.064
vell+0.064
ibrarian+0.062
iders+0.061
ibr+0.059
Neg logit contributions
 swe-0.056
 spe-0.055
 cheaper-0.053
 someday-0.052
 enough-0.051
Token of
Token ID286
Feature activation+1.70e+00

Pos logit contributions
leaf+0.085
vell+0.080
issors+0.079
ibrarian+0.077
opher+0.077
Neg logit contributions
 spe-0.048
 someday-0.048
c-0.047
 swe-0.046
 enough-0.046
Token the
Token ID262
Feature activation+3.58e+00

Pos logit contributions
ibrarian+0.091
vell+0.089
iders+0.089
leaf+0.089
ower+0.087
Neg logit contributions
 smaller-0.039
 swe-0.038
 spe-0.037
 someday-0.036
 enough-0.033
Token perfect
Token ID2818
Feature activation+1.90e+00

Pos logit contributions
ibrarian+0.166
iders+0.162
izards+0.162
ying+0.162
affles+0.159
Neg logit contributions
 smaller-0.116
 less-0.108
 one-0.105
 swe-0.103
 ones-0.101
Token day
Token ID1110
Feature activation+1.43e+00

Pos logit contributions
ibrarian+0.070
leaf+0.070
vell+0.067
iders+0.066
ibr+0.066
Neg logit contributions
 spe-0.074
 swe-0.074
 someday-0.073
 smaller-0.073
 enough-0.071
Token!
Token ID0
Feature activation+7.73e-01

Pos logit contributions
leaf+0.066
vell+0.063
ibrarian+0.063
taker+0.061
opher+0.060
Neg logit contributions
 someday-0.046
 enough-0.043
 swe-0.042
 spe-0.041
c-0.039
Token The
Token ID383
Feature activation+2.84e+00

Pos logit contributions
vell+0.042
leaf+0.042
ibrarian+0.041
asc+0.039
iders+0.039
Neg logit contributions
 One-0.017
 swe-0.016
 someday-0.016
c-0.015
 spe-0.015
Token sun
Token ID4252
Feature activation+3.26e-01

Pos logit contributions
asso+0.155
izards+0.155
iders+0.153
issors+0.152
ibrarian+0.150
Neg logit contributions
 smaller-0.076
 swe-0.072
 again-0.066
 one-0.063
 better-0.063
Token and
Token ID290
Feature activation+1.33e+00

Pos logit contributions
 swe+0.040
c+0.039
 someday+0.038
 cheaper+0.037
 spe+0.037
Neg logit contributions
he-0.035
ibrarian-0.033
leaf-0.033
taker-0.033
vell-0.032
Token pushed
Token ID7121
Feature activation+6.21e-01

Pos logit contributions
leaf+0.071
ibrarian+0.069
opher+0.068
vell+0.068
taker+0.065
Neg logit contributions
 swe-0.033
 someday-0.032
 smaller-0.031
 shin-0.030
 again-0.028
Token the
Token ID262
Feature activation+3.41e+00

Pos logit contributions
leaf+0.033
vell+0.031
ibrarian+0.031
asc+0.029
ibr+0.029
Neg logit contributions
 swe-0.017
 someday-0.017
 smaller-0.016
 enough-0.015
 One-0.015
Token button
Token ID4936
Feature activation-4.04e-01

Pos logit contributions
izards+0.166
oll+0.160
live+0.157
ocket+0.156
asso+0.151
Neg logit contributions
 smaller-0.115
 swe-0.103
 bigger-0.099
 blue-0.095
 one-0.091
Token on
Token ID319
Feature activation-7.21e-02

Pos logit contributions
 swe+0.015
 someday+0.014
 spe+0.014
 cheaper+0.014
c+0.013
Neg logit contributions
leaf-0.016
ibrarian-0.015
vell-0.015
taker-0.015
he-0.014
Token the
Token ID262
Feature activation+3.55e+00

Pos logit contributions
 swe+0.002
 someday+0.002
 cheaper+0.002
 spe+0.002
c+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.003
he-0.003
Token television
Token ID5581
Feature activation+1.41e+00

Pos logit contributions
izards+0.173
live+0.160
oll+0.158
ocket+0.156
imes+0.154
Neg logit contributions
 smaller-0.117
 swe-0.108
 bigger-0.103
 one-0.101
 better-0.101
Token.
Token ID13
Feature activation+1.23e+00

Pos logit contributions
leaf+0.078
ibrarian+0.073
asc+0.071
vell+0.071
opher+0.070
Neg logit contributions
 someday-0.034
 swe-0.033
 smaller-0.031
 enough-0.031
 cheaper-0.030
Token The
Token ID383
Feature activation+2.84e+00

Pos logit contributions
leaf+0.070
vell+0.069
ibrarian+0.067
ils+0.063
izards+0.063
Neg logit contributions
 One-0.030
 swe-0.026
 smaller-0.025
 Sometimes-0.023
 someday-0.022
Token screen
Token ID3159
Feature activation+8.65e-01

Pos logit contributions
izards+0.140
oll+0.130
asso+0.129
imes+0.127
ocket+0.127
Neg logit contributions
 smaller-0.098
 swe-0.086
 bigger-0.080
 better-0.080
 blue-0.080
Token lit
Token ID6578
Feature activation+5.83e-02

Pos logit contributions
leaf+0.044
ibrarian+0.041
vell+0.041
asc+0.040
taker+0.039
Neg logit contributions
 swe-0.023
 someday-0.022
 smaller-0.021
 spe-0.021
 cheaper-0.021
Token room
Token ID2119
Feature activation+1.42e-01

Pos logit contributions
leaf+0.051
vell+0.048
taker+0.047
ibrarian+0.046
iders+0.045
Neg logit contributions
 swe-0.044
c-0.041
 someday-0.040
 smaller-0.040
 One-0.040
Token.
Token ID13
Feature activation-6.75e-01

Pos logit contributions
leaf+0.007
ibrarian+0.006
vell+0.006
taker+0.006
asc+0.006
Neg logit contributions
 swe-0.004
 someday-0.004
 cheaper-0.004
 spe-0.004
 smaller-0.004
Token They
Token ID1119
Feature activation+6.84e-01

Pos logit contributions
 swe+0.023
 someday+0.022
 spe+0.022
 cheaper+0.021
 shin+0.021
Neg logit contributions
leaf-0.028
ibrarian-0.028
he-0.026
asc-0.025
taker-0.025
Token can
Token ID460
Feature activation+6.70e-01

Pos logit contributions
leaf+0.030
ibrarian+0.029
vell+0.029
iders+0.027
asc+0.027
Neg logit contributions
 someday-0.022
 swe-0.022
 spe-0.021
 One-0.020
 smaller-0.020
Token touch
Token ID3638
Feature activation+5.50e-01

Pos logit contributions
leaf+0.030
vell+0.029
ibrarian+0.029
taker+0.028
asc+0.027
Neg logit contributions
 someday-0.021
 swe-0.021
 smaller-0.020
 One-0.020
 spe-0.020
Token the
Token ID262
Feature activation+3.53e+00

Pos logit contributions
leaf+0.028
ibrarian+0.027
vell+0.027
taker+0.026
asc+0.025
Neg logit contributions
 swe-0.015
 someday-0.014
 smaller-0.013
 spe-0.013
 cheaper-0.013
Token screen
Token ID3159
Feature activation-1.40e-01

Pos logit contributions
ibrarian+0.149
ying+0.147
izards+0.145
bled+0.142
live+0.140
Neg logit contributions
 smaller-0.124
 blue-0.117
 swe-0.115
 coll-0.113
 ones-0.113
Token and
Token ID290
Feature activation+1.20e+00

Pos logit contributions
 swe+0.004
 someday+0.004
 spe+0.004
 cheaper+0.004
c+0.004
Neg logit contributions
leaf-0.006
ibrarian-0.006
vell-0.006
taker-0.006
he-0.006
Token make
Token ID787
Feature activation+6.69e-01

Pos logit contributions
ibrarian+0.057
leaf+0.056
vell+0.056
asc+0.054
opher+0.052
Neg logit contributions
 someday-0.037
 swe-0.036
 spe-0.034
 smaller-0.033
 One-0.032
Token things
Token ID1243
Feature activation-2.79e-01

Pos logit contributions
leaf+0.030
ibrarian+0.029
vell+0.028
asc+0.027
taker+0.027
Neg logit contributions
 swe-0.022
 someday-0.021
 smaller-0.020
 cheaper-0.020
 spe-0.019
Token happen
Token ID1645
Feature activation-3.95e-01

Pos logit contributions
 swe+0.010
 someday+0.010
 spe+0.009
 cheaper+0.009
c+0.009
Neg logit contributions
leaf-0.011
ibrarian-0.011
vell-0.010
taker-0.010
he-0.010
Token.
Token ID13
Feature activation-3.62e-01

Pos logit contributions
leaf+0.021
ibrarian+0.021
vell+0.020
taker+0.019
asc+0.019
Neg logit contributions
 someday-0.013
 swe-0.013
 cheaper-0.012
 spe-0.012
 smaller-0.012
Token His
Token ID2399
Feature activation+1.79e+00

Pos logit contributions
 swe+0.013
 someday+0.012
 cheaper+0.012
 spe+0.012
c+0.011
Neg logit contributions
leaf-0.015
ibrarian-0.014
vell-0.014
taker-0.014
ibr-0.013
Token eyes
Token ID2951
Feature activation-5.87e-01

Pos logit contributions
leaf+0.090
taker+0.083
he+0.082
vell+0.081
ibrarian+0.080
Neg logit contributions
 swe-0.060
 smaller-0.057
 enough-0.050
 cheaper-0.050
 shorter-0.050
Token landed
Token ID11406
Feature activation+4.52e-01

Pos logit contributions
 swe+0.018
 someday+0.018
 cheaper+0.017
 spe+0.017
c+0.016
Neg logit contributions
leaf-0.026
vell-0.025
ibrarian-0.025
he-0.024
taker-0.024
Token on
Token ID319
Feature activation-6.12e-01

Pos logit contributions
leaf+0.016
ibrarian+0.015
vell+0.014
taker+0.014
 character+0.013
Neg logit contributions
 swe-0.019
 someday-0.019
 cheaper-0.018
 smaller-0.018
 spe-0.018
Token the
Token ID262
Feature activation+3.46e+00

Pos logit contributions
 swe+0.021
 someday+0.020
 cheaper+0.020
 spe+0.019
c+0.019
Neg logit contributions
leaf-0.025
ibrarian-0.024
taker-0.023
 character-0.022
he-0.022
Token little
Token ID1310
Feature activation+1.37e+00

Pos logit contributions
izards+0.164
leaf+0.161
asc+0.158
ying+0.157
ibrarian+0.156
Neg logit contributions
 swe-0.122
 smaller-0.116
 cheaper-0.103
 less-0.103
 shorter-0.099
Token freezer
Token ID30967
Feature activation+7.32e-01

Pos logit contributions
leaf+0.059
ibrarian+0.057
vell+0.056
asc+0.052
ibr+0.052
Neg logit contributions
 swe-0.052
 someday-0.048
 smaller-0.047
 cheaper-0.046
 spe-0.045
Token.
Token ID13
Feature activation-1.10e-01

Pos logit contributions
leaf+0.036
ibrarian+0.033
vell+0.032
asc+0.031
ibr+0.031
Neg logit contributions
 swe-0.022
 someday-0.021
 cheaper-0.021
 enough-0.020
 spe-0.020
Token He
Token ID679
Feature activation+5.07e-01

Pos logit contributions
 swe+0.004
 someday+0.004
 cheaper+0.003
 spe+0.003
c+0.003
Neg logit contributions
leaf-0.005
ibrarian-0.005
vell-0.004
taker-0.004
asc-0.004
Token smiled
Token ID13541
Feature activation-4.75e-01

Pos logit contributions
leaf+0.025
ibrarian+0.024
vell+0.024
taker+0.023
asc+0.023
Neg logit contributions
 swe-0.014
 someday-0.014
 spe-0.013
 cheaper-0.013
 smaller-0.013
Token so
Token ID523
Feature activation+7.83e-01

Pos logit contributions
leaf+0.046
ibrarian+0.045
asc+0.043
opher+0.042
vell+0.041
Neg logit contributions
 someday-0.042
 swe-0.041
 cheaper-0.041
 spe-0.039
 smaller-0.039
Token big
Token ID1263
Feature activation+1.92e-01

Pos logit contributions
leaf+0.037
ibrarian+0.035
vell+0.034
asc+0.034
opher+0.033
Neg logit contributions
 swe-0.024
 someday-0.024
 cheaper-0.023
 smaller-0.022
 spe-0.022
Token!
Token ID0
Feature activation+5.09e-01

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.008
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.007
 someday-0.006
 spe-0.006
 cheaper-0.006
 smaller-0.006
Token She
Token ID1375
Feature activation+1.84e-01

Pos logit contributions
leaf+0.023
vell+0.023
ibrarian+0.022
taker+0.021
asc+0.021
Neg logit contributions
 someday-0.015
 swe-0.015
 One-0.015
 spe-0.014
 cheaper-0.014
Token touched
Token ID12615
Feature activation+7.52e-01

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.008
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.006
 someday-0.006
 spe-0.006
 cheaper-0.006
 smaller-0.005
Token the
Token ID262
Feature activation+3.43e+00

Pos logit contributions
leaf+0.042
ibrarian+0.040
vell+0.040
asc+0.039
iders+0.038
Neg logit contributions
 swe-0.016
 someday-0.016
 smaller-0.016
c-0.015
 enough-0.015
Token screen
Token ID3159
Feature activation+2.86e-01

Pos logit contributions
oll+0.166
izards+0.163
affles+0.159
ibrarian+0.157
ido+0.156
Neg logit contributions
 smaller-0.113
 blue-0.103
 bigger-0.095
 one-0.094
 yellow-0.094
Token and
Token ID290
Feature activation+1.42e+00

Pos logit contributions
leaf+0.013
ibrarian+0.013
vell+0.012
taker+0.012
asc+0.012
Neg logit contributions
 swe-0.009
 someday-0.009
 cheaper-0.008
 spe-0.008
 smaller-0.008
Token it
Token ID340
Feature activation+1.88e-01

Pos logit contributions
leaf+0.074
vell+0.072
ibrarian+0.071
asc+0.071
affle+0.070
Neg logit contributions
 someday-0.037
 smaller-0.034
 swe-0.034
 enough-0.034
 spe-0.033
Token lit
Token ID6578
Feature activation-1.68e-01

Pos logit contributions
leaf+0.009
ibrarian+0.009
vell+0.009
taker+0.008
asc+0.008
Neg logit contributions
 swe-0.005
 someday-0.005
 spe-0.005
 cheaper-0.005
 smaller-0.005
Token up
Token ID510
Feature activation-1.18e-01

Pos logit contributions
 swe+0.008
 someday+0.008
 cheaper+0.008
 spe+0.008
c+0.008
Neg logit contributions
leaf-0.005
ibrarian-0.004
vell-0.004
taker-0.004
he-0.004
Token frustrated
Token ID14718
Feature activation+4.05e-01

Pos logit contributions
 swe+0.000
 someday+0.000
 spe+0.000
 cheaper+0.000
c+0.000
Neg logit contributions
leaf-0.000
ibrarian-0.000
vell-0.000
taker-0.000
asc-0.000
Token.
Token ID13
Feature activation+2.37e-01

Pos logit contributions
leaf+0.020
ibrarian+0.019
vell+0.019
taker+0.018
asc+0.018
Neg logit contributions
 swe-0.012
 someday-0.012
 cheaper-0.011
 spe-0.010
 enough-0.010
Token The
Token ID383
Feature activation+2.60e+00

Pos logit contributions
leaf+0.012
ibrarian+0.011
vell+0.011
taker+0.011
asc+0.010
Neg logit contributions
 swe-0.007
 someday-0.006
 cheaper-0.006
 spe-0.006
 One-0.006
Token buttons
Token ID12163
Feature activation-8.30e-01

Pos logit contributions
izards+0.121
asso+0.114
leaf+0.108
asc+0.107
iders+0.107
Neg logit contributions
 swe-0.091
 smaller-0.089
 moral-0.085
 one-0.082
 better-0.080
Token on
Token ID319
Feature activation-2.72e-01

Pos logit contributions
 swe+0.030
 someday+0.029
 cheaper+0.028
 spe+0.028
 smaller+0.027
Neg logit contributions
leaf-0.032
ibrarian-0.032
vell-0.031
taker-0.030
he-0.030
Token the
Token ID262
Feature activation+3.42e+00

Pos logit contributions
 swe+0.008
 someday+0.008
 cheaper+0.008
 spe+0.008
 shin+0.007
Neg logit contributions
leaf-0.012
ibrarian-0.012
vell-0.012
taker-0.011
he-0.011
Token television
Token ID5581
Feature activation+8.30e-01

Pos logit contributions
izards+0.162
oll+0.148
asc+0.146
imes+0.146
ibrarian+0.145
Neg logit contributions
 swe-0.118
 smaller-0.114
 one-0.106
 ones-0.104
 again-0.103
Token were
Token ID547
Feature activation+3.83e-01

Pos logit contributions
leaf+0.040
ibrarian+0.038
vell+0.037
asc+0.036
taker+0.035
Neg logit contributions
 someday-0.024
 swe-0.024
 cheaper-0.022
 spe-0.022
c-0.022
Token different
Token ID1180
Feature activation+3.55e-01

Pos logit contributions
leaf+0.016
ibrarian+0.016
vell+0.015
taker+0.014
he+0.014
Neg logit contributions
 swe-0.013
 someday-0.013
 cheaper-0.013
 spe-0.012
 smaller-0.012
Token and
Token ID290
Feature activation+3.23e-01

Pos logit contributions
leaf+0.016
ibrarian+0.016
vell+0.015
taker+0.014
asc+0.014
Neg logit contributions
 swe-0.011
 someday-0.011
 cheaper-0.011
 spe-0.010
 smaller-0.010
Token she
Token ID673
Feature activation+7.74e-01

Pos logit contributions
leaf+0.015
ibrarian+0.015
vell+0.014
taker+0.013
asc+0.013
Neg logit contributions
 swe-0.010
 someday-0.010
 cheaper-0.009
 spe-0.009
 smaller-0.009
Token Tim
Token ID5045
Feature activation+5.38e-01

Pos logit contributions
vell+0.059
ibrarian+0.058
leaf+0.058
ocket+0.054
taker+0.053
Neg logit contributions
 One-0.034
 swe-0.030
 someday-0.028
 Sometimes-0.027
c-0.027
Token stood
Token ID6204
Feature activation+7.44e-01

Pos logit contributions
leaf+0.026
vell+0.025
ibrarian+0.025
taker+0.024
asc+0.023
Neg logit contributions
 swe-0.016
 someday-0.015
c-0.014
 cheaper-0.014
 smaller-0.014
Token up
Token ID510
Feature activation-1.00e+00

Pos logit contributions
leaf+0.031
ibrarian+0.028
vell+0.027
taker+0.026
ibr+0.026
Neg logit contributions
 swe-0.028
 smaller-0.028
 someday-0.028
 enough-0.026
 cheaper-0.026
Token and
Token ID290
Feature activation+1.33e+00

Pos logit contributions
 swe+0.040
c+0.039
 someday+0.038
 cheaper+0.037
 spe+0.037
Neg logit contributions
he-0.035
ibrarian-0.033
leaf-0.033
taker-0.033
vell-0.032
Token pushed
Token ID7121
Feature activation+6.21e-01

Pos logit contributions
leaf+0.071
ibrarian+0.069
opher+0.068
vell+0.068
taker+0.065
Neg logit contributions
 swe-0.033
 someday-0.032
 smaller-0.031
 shin-0.030
 again-0.028
Token the
Token ID262
Feature activation+3.41e+00

Pos logit contributions
leaf+0.033
vell+0.031
ibrarian+0.031
asc+0.029
ibr+0.029
Neg logit contributions
 swe-0.017
 someday-0.017
 smaller-0.016
 enough-0.015
 One-0.015
Token button
Token ID4936
Feature activation-4.04e-01

Pos logit contributions
izards+0.166
oll+0.160
live+0.157
ocket+0.156
asso+0.151
Neg logit contributions
 smaller-0.115
 swe-0.103
 bigger-0.099
 blue-0.095
 one-0.091
Token on
Token ID319
Feature activation-7.21e-02

Pos logit contributions
 swe+0.015
 someday+0.014
 spe+0.014
 cheaper+0.014
c+0.013
Neg logit contributions
leaf-0.016
ibrarian-0.015
vell-0.015
taker-0.015
he-0.014
Token the
Token ID262
Feature activation+3.55e+00

Pos logit contributions
 swe+0.002
 someday+0.002
 cheaper+0.002
 spe+0.002
c+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.003
he-0.003
Token television
Token ID5581
Feature activation+1.41e+00

Pos logit contributions
izards+0.173
live+0.160
oll+0.158
ocket+0.156
imes+0.154
Neg logit contributions
 smaller-0.117
 swe-0.108
 bigger-0.103
 one-0.101
 better-0.101
Token.
Token ID13
Feature activation+1.23e+00

Pos logit contributions
leaf+0.078
ibrarian+0.073
asc+0.071
vell+0.071
opher+0.070
Neg logit contributions
 someday-0.034
 swe-0.033
 smaller-0.031
 enough-0.031
 cheaper-0.030
Tokenummy
Token ID13513
Feature activation+1.34e+00

Pos logit contributions
leaf+0.028
ibrarian+0.026
vell+0.024
taker+0.024
ibr+0.023
Neg logit contributions
 swe-0.036
 someday-0.035
 spe-0.035
 cheaper-0.034
 enough-0.034
Token w
Token ID266
Feature activation+2.28e+00

Pos logit contributions
leaf+0.064
vell+0.061
ibrarian+0.061
ibr+0.059
iders+0.058
Neg logit contributions
 swe-0.041
 spe-0.039
 someday-0.037
 cheaper-0.037
 smaller-0.036
Tokenaffle
Token ID30697
Feature activation+2.79e+00

Pos logit contributions
leaf+0.058
ting+0.054
uted+0.049
nam+0.046
taker+0.046
Neg logit contributions
 cheaper-0.101
 swe-0.101
 coll-0.098
 enough-0.097
 problem-0.096
Token.
Token ID13
Feature activation+4.66e-01

Pos logit contributions
leaf+0.148
opher+0.140
ags+0.135
vell+0.131
idget+0.130
Neg logit contributions
 enough-0.062
 someday-0.058
 spe-0.053
 stream-0.052
 again-0.050
Token The
Token ID383
Feature activation+1.35e+00

Pos logit contributions
leaf+0.024
vell+0.023
ibrarian+0.023
taker+0.022
asc+0.021
Neg logit contributions
 someday-0.012
 One-0.012
 swe-0.012
 spe-0.011
 cheaper-0.011
Token w
Token ID266
Feature activation+3.39e+00

Pos logit contributions
leaf+0.073
ibrarian+0.066
iders+0.066
uted+0.065
vell+0.064
Neg logit contributions
 swe-0.039
 smaller-0.035
 One-0.032
 cheaper-0.032
 shin-0.032
Tokenaffle
Token ID30697
Feature activation+2.39e+00

Pos logit contributions
leaf+0.118
nam+0.109
 contain+0.106
ting+0.102
person+0.101
Neg logit contributions
 One-0.114
 swe-0.114
 problem-0.108
 cheaper-0.106
ib-0.106
Token was
Token ID373
Feature activation+5.69e-01

Pos logit contributions
leaf+0.148
vell+0.135
asc+0.132
ging+0.129
opher+0.129
Neg logit contributions
 grew-0.036
 One-0.035
c-0.033
 someday-0.033
 spe-0.033
Token very
Token ID845
Feature activation+1.12e+00

Pos logit contributions
leaf+0.026
ibrarian+0.024
vell+0.024
iders+0.023
taker+0.022
Neg logit contributions
 swe-0.019
 cheaper-0.018
 smaller-0.018
 spe-0.018
 someday-0.018
Token special
Token ID2041
Feature activation+1.32e+00

Pos logit contributions
leaf+0.057
ibrarian+0.055
ting+0.054
vell+0.053
iders+0.053
Neg logit contributions
 swe-0.033
 cheaper-0.031
 smaller-0.030
 spe-0.030
 someday-0.028
Token because
Token ID780
Feature activation+6.89e-01

Pos logit contributions
leaf+0.073
vell+0.070
ibrarian+0.070
ibr+0.067
asc+0.066
Neg logit contributions
 someday-0.034
 enough-0.031
 swe-0.030
 cheaper-0.029
 spe-0.028
Token.
Token ID13
Feature activation+4.46e-01

Pos logit contributions
leaf+0.002
ibrarian+0.002
vell+0.002
taker+0.002
asc+0.002
Neg logit contributions
 swe-0.001
 someday-0.001
 cheaper-0.001
 spe-0.001
 smaller-0.001
Token He
Token ID679
Feature activation+3.30e-01

Pos logit contributions
leaf+0.019
vell+0.019
ibrarian+0.019
taker+0.017
asc+0.017
Neg logit contributions
 swe-0.015
 someday-0.015
 One-0.014
 spe-0.014
 cheaper-0.014
Token laid
Token ID8104
Feature activation+7.21e-01

Pos logit contributions
leaf+0.015
ibrarian+0.014
vell+0.014
taker+0.013
asc+0.013
Neg logit contributions
 swe-0.011
 someday-0.010
 spe-0.010
 cheaper-0.010
 smaller-0.010
Token down
Token ID866
Feature activation-7.62e-01

Pos logit contributions
leaf+0.031
ibrarian+0.030
vell+0.028
ibr+0.027
iders+0.027
Neg logit contributions
 swe-0.026
 smaller-0.025
 someday-0.025
 enough-0.024
 spe-0.024
Token on
Token ID319
Feature activation-2.17e-01

Pos logit contributions
 swe+0.033
 someday+0.032
 spe+0.031
 cheaper+0.031
c+0.031
Neg logit contributions
leaf-0.024
ibrarian-0.023
vell-0.022
he-0.022
taker-0.022
Token the
Token ID262
Feature activation+3.38e+00

Pos logit contributions
 swe+0.007
 someday+0.006
 cheaper+0.006
 spe+0.006
c+0.006
Neg logit contributions
leaf-0.010
ibrarian-0.010
vell-0.009
taker-0.009
he-0.009
Token cushion
Token ID33579
Feature activation+9.37e-01

Pos logit contributions
leaf+0.174
asc+0.165
itt+0.159
ibr+0.157
asing+0.156
Neg logit contributions
 swe-0.126
 smaller-0.109
 again-0.101
 shorter-0.099
 softer-0.097
Token and
Token ID290
Feature activation+1.48e+00

Pos logit contributions
leaf+0.049
ibrarian+0.046
vell+0.046
asc+0.044
taker+0.043
Neg logit contributions
 someday-0.025
 swe-0.024
 enough-0.022
c-0.022
 cheaper-0.022
Token curled
Token ID41108
Feature activation+1.16e+00

Pos logit contributions
leaf+0.075
vell+0.073
ibrarian+0.073
asc+0.071
issors+0.070
Neg logit contributions
 swe-0.044
 someday-0.040
 smaller-0.039
 shin-0.038
 spe-0.038
Token up
Token ID510
Feature activation-1.68e-01

Pos logit contributions
leaf+0.048
ibrarian+0.046
vell+0.044
ibr+0.043
taker+0.042
Neg logit contributions
 swe-0.044
 enough-0.041
 someday-0.041
 smaller-0.040
 spe-0.040
Token.
Token ID13
Feature activation+9.41e-01

Pos logit contributions
 swe+0.008
 someday+0.007
 cheaper+0.007
 spe+0.007
c+0.007
Neg logit contributions
leaf-0.005
ibrarian-0.005
vell-0.005
taker-0.005
he-0.004
Token so
Token ID523
Feature activation+5.94e-01

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.008
taker+0.007
he+0.007
Neg logit contributions
 swe-0.006
 someday-0.006
 spe-0.006
 cheaper-0.006
 smaller-0.006
Token cool
Token ID3608
Feature activation+2.62e-01

Pos logit contributions
leaf+0.028
ibrarian+0.028
vell+0.026
asc+0.025
ibr+0.025
Neg logit contributions
 swe-0.018
 spe-0.017
 smaller-0.017
 cheaper-0.017
 someday-0.017
Token!
Token ID0
Feature activation-7.30e-01

Pos logit contributions
leaf+0.010
ibrarian+0.010
vell+0.009
taker+0.009
asc+0.008
Neg logit contributions
 swe-0.010
 someday-0.010
 cheaper-0.010
 spe-0.010
 enough-0.009
Token She
Token ID1375
Feature activation+7.87e-01

Pos logit contributions
 someday+0.027
 swe+0.027
 spe+0.027
 cheaper+0.026
 smaller+0.025
Neg logit contributions
leaf-0.028
ibrarian-0.027
taker-0.026
vell-0.025
he-0.025
Token touched
Token ID12615
Feature activation-7.03e-02

Pos logit contributions
leaf+0.034
ibrarian+0.033
vell+0.033
taker+0.031
asc+0.031
Neg logit contributions
 swe-0.027
 someday-0.026
 spe-0.025
 cheaper-0.024
 smaller-0.024
Token the
Token ID262
Feature activation+3.37e+00

Pos logit contributions
 swe+0.002
 someday+0.002
 cheaper+0.002
 spe+0.002
 smaller+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.003
he-0.003
Token shapes
Token ID15268
Feature activation+4.26e-01

Pos logit contributions
izards+0.141
ibrarian+0.138
iders+0.134
asc+0.134
ying+0.133
Neg logit contributions
 smaller-0.126
 swe-0.116
 cheaper-0.111
 bigger-0.110
 coll-0.110
Token and
Token ID290
Feature activation+5.87e-01

Pos logit contributions
leaf+0.019
ibrarian+0.018
vell+0.017
taker+0.017
asc+0.017
Neg logit contributions
 swe-0.014
 someday-0.014
 spe-0.013
 cheaper-0.013
 smaller-0.013
Token they
Token ID484
Feature activation+1.25e+00

Pos logit contributions
leaf+0.026
ibrarian+0.026
vell+0.025
asc+0.024
taker+0.024
Neg logit contributions
 swe-0.020
 someday-0.019
 spe-0.018
 smaller-0.018
 cheaper-0.018
Token moved
Token ID3888
Feature activation-3.38e-01

Pos logit contributions
leaf+0.056
ibrarian+0.055
asc+0.052
vell+0.051
iders+0.050
Neg logit contributions
 spe-0.041
 swe-0.041
 someday-0.040
 smaller-0.039
c-0.037
Token around
Token ID1088
Feature activation-1.20e-01

Pos logit contributions
 swe+0.012
 someday+0.012
 cheaper+0.011
 spe+0.011
c+0.011
Neg logit contributions
leaf-0.014
ibrarian-0.013
vell-0.013
taker-0.012
he-0.012
Token than
Token ID621
Feature activation+1.40e+00

Pos logit contributions
leaf+0.002
ibrarian+0.002
vell+0.002
taker+0.002
asc+0.002
Neg logit contributions
 swe-0.002
 someday-0.002
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token anything
Token ID1997
Feature activation+8.52e-01

Pos logit contributions
leaf+0.062
ibrarian+0.060
asc+0.059
ower+0.059
vell+0.058
Neg logit contributions
 someday-0.048
 enough-0.045
 smaller-0.043
 cheaper-0.043
 swe-0.042
Token he
Token ID339
Feature activation+7.95e-01

Pos logit contributions
leaf+0.038
ibrarian+0.037
vell+0.035
asc+0.034
taker+0.034
Neg logit contributions
 someday-0.028
 swe-0.028
 spe-0.027
 cheaper-0.026
 smaller-0.025
Token had
Token ID550
Feature activation+1.79e+00

Pos logit contributions
leaf+0.037
ibrarian+0.036
taker+0.035
vell+0.035
opher+0.034
Neg logit contributions
 someday-0.025
 swe-0.023
 spe-0.022
 cheaper-0.022
c-0.022
Token ever
Token ID1683
Feature activation-3.03e-01

Pos logit contributions
leaf+0.075
ibrarian+0.074
taker+0.072
iders+0.071
opher+0.070
Neg logit contributions
 someday-0.065
 enough-0.059
 cheaper-0.054
c-0.054
 swe-0.054
Token seen
Token ID1775
Feature activation+3.35e+00

Pos logit contributions
 swe+0.012
 someday+0.012
 spe+0.012
 cheaper+0.012
 smaller+0.011
Neg logit contributions
leaf-0.011
ibrarian-0.010
vell-0.010
taker-0.009
he-0.009
Token before
Token ID878
Feature activation+1.77e-01

Pos logit contributions
leaf+0.186
taker+0.167
ibrarian+0.162
vell+0.162
opher+0.158
Neg logit contributions
 someday-0.097
 again-0.093
 enough-0.089
c-0.079
 longer-0.072
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.007
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.006
 someday-0.006
 spe-0.005
 cheaper-0.005
 smaller-0.005
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
leaf+0.079
vell+0.078
asc+0.073
ibrarian+0.073
opher+0.072
Neg logit contributions
 One-0.030
 someday-0.025
 Sometimes-0.024
c-0.022
 smaller-0.021
Token
Token ID198
Feature activation+1.16e+00

Pos logit contributions
leaf+0.058
vell+0.057
ibrarian+0.055
asc+0.054
taker+0.054
Neg logit contributions
 someday-0.018
 One-0.018
 swe-0.015
 smaller-0.015
 cheaper-0.015
TokenMark
Token ID9704
Feature activation-5.98e-01

Pos logit contributions
leaf+0.043
vell+0.041
opher+0.040
ibrarian+0.038
asc+0.038
Neg logit contributions
 someday-0.046
 One-0.045
One-0.044
 smaller-0.042
 swe-0.041
Tokenia
Token ID544
Feature activation-4.43e-01

Pos logit contributions
 swe+0.029
 spe+0.028
 someday+0.028
 cheaper+0.027
 shin+0.026
Neg logit contributions
ibrarian-0.021
leaf-0.019
vell-0.019
opher-0.018
iders-0.018
Token put
Token ID1234
Feature activation+1.46e+00

Pos logit contributions
 swe+0.015
 someday+0.014
 cheaper+0.014
 spe+0.014
 smaller+0.013
Neg logit contributions
leaf-0.019
ibrarian-0.019
vell-0.017
taker-0.017
ibr-0.017
Token her
Token ID607
Feature activation+1.75e+00

Pos logit contributions
leaf+0.075
vell+0.073
iders+0.073
opher+0.072
ower+0.071
Neg logit contributions
 swe-0.038
 smaller-0.037
c-0.037
 enough-0.035
 shin-0.032
Token hand
Token ID1021
Feature activation+8.26e-01

Pos logit contributions
leaf+0.077
iders+0.072
ibrarian+0.071
ting+0.068
coon+0.068
Neg logit contributions
 swe-0.064
 smaller-0.063
c-0.055
 cheaper-0.055
 shorter-0.055
Token on
Token ID319
Feature activation+6.02e-01

Pos logit contributions
leaf+0.029
ibrarian+0.028
vell+0.026
taker+0.025
opher+0.025
Neg logit contributions
 swe-0.036
 someday-0.035
c-0.033
 smaller-0.033
 cheaper-0.033
Token the
Token ID262
Feature activation+3.32e+00

Pos logit contributions
leaf+0.033
ibrarian+0.031
vell+0.031
asc+0.030
taker+0.030
Neg logit contributions
 swe-0.015
 someday-0.014
 spe-0.013
 smaller-0.013
 cheaper-0.013
Token cold
Token ID4692
Feature activation+2.00e+00

Pos logit contributions
iders+0.159
izards+0.154
leaf+0.152
itt+0.150
eria+0.148
Neg logit contributions
 smaller-0.103
 swe-0.092
 shorter-0.090
 again-0.086
 shin-0.084
Token mirror
Token ID10162
Feature activation+6.63e-01

Pos logit contributions
leaf+0.093
vell+0.087
ibrarian+0.083
uted+0.082
izards+0.078
Neg logit contributions
 swe-0.068
 someday-0.065
 again-0.061
 enough-0.060
 smaller-0.058
Token and
Token ID290
Feature activation+1.53e+00

Pos logit contributions
leaf+0.034
ibrarian+0.032
vell+0.031
taker+0.030
asc+0.030
Neg logit contributions
 someday-0.018
 swe-0.018
 cheaper-0.016
 spe-0.016
c-0.016
Token started
Token ID2067
Feature activation-4.04e-03

Pos logit contributions
leaf+0.079
vell+0.078
ibrarian+0.077
opher+0.076
asc+0.074
Neg logit contributions
 swe-0.042
 someday-0.040
 smaller-0.038
 enough-0.037
 shin-0.037
Token to
Token ID284
Feature activation+7.03e-01

Pos logit contributions
 swe+0.000
 someday+0.000
 cheaper+0.000
 spe+0.000
 smaller+0.000
Neg logit contributions
leaf-0.000
ibrarian-0.000
vell-0.000
taker-0.000
asc-0.000
Token seen
Token ID1775
Feature activation+2.81e+00

Pos logit contributions
leaf+0.016
ibrarian+0.015
vell+0.015
taker+0.014
opher+0.014
Neg logit contributions
 someday-0.014
 swe-0.014
 cheaper-0.013
 spe-0.013
 smaller-0.013
Token before
Token ID878
Feature activation+2.46e-01

Pos logit contributions
leaf+0.144
taker+0.133
vell+0.132
opher+0.124
ower+0.123
Neg logit contributions
 someday-0.088
 enough-0.087
 again-0.084
c-0.075
 swe-0.073
Token -
Token ID532
Feature activation+5.26e-01

Pos logit contributions
leaf+0.012
ibrarian+0.011
vell+0.011
taker+0.011
asc+0.010
Neg logit contributions
 swe-0.007
 someday-0.007
 cheaper-0.007
 spe-0.007
 smaller-0.007
Token a
Token ID257
Feature activation+6.03e-01

Pos logit contributions
leaf+0.026
ibrarian+0.026
vell+0.025
taker+0.024
asc+0.024
Neg logit contributions
 swe-0.015
 someday-0.014
 spe-0.013
 cheaper-0.013
 smaller-0.013
Token w
Token ID266
Feature activation+2.46e+00

Pos logit contributions
leaf+0.027
ibrarian+0.026
vell+0.025
taker+0.025
asc+0.024
Neg logit contributions
 swe-0.019
 spe-0.018
 someday-0.018
 cheaper-0.018
 smaller-0.018
Tokenaffle
Token ID30697
Feature activation+3.32e+00

Pos logit contributions
leaf+0.062
ting+0.057
uted+0.056
taker+0.055
nam+0.054
Neg logit contributions
 problem-0.111
 cheaper-0.108
 enough-0.108
 spe-0.107
 swe-0.107
Token!
Token ID0
Feature activation+9.28e-01

Pos logit contributions
leaf+0.154
ags+0.143
opher+0.129
taker+0.128
issors+0.128
Neg logit contributions
 enough-0.106
 stream-0.099
 spe-0.097
 again-0.097
 someday-0.095
Token She
Token ID1375
Feature activation+4.46e-01

Pos logit contributions
leaf+0.047
vell+0.047
ibrarian+0.045
asc+0.044
iders+0.042
Neg logit contributions
 One-0.025
 swe-0.025
 someday-0.023
 cheaper-0.022
 enough-0.021
Token was
Token ID373
Feature activation+1.57e+00

Pos logit contributions
leaf+0.021
ibrarian+0.020
vell+0.020
taker+0.019
he+0.019
Neg logit contributions
 swe-0.014
 someday-0.013
 spe-0.013
 smaller-0.012
 cheaper-0.012
Token so
Token ID523
Feature activation+1.56e+00

Pos logit contributions
leaf+0.076
vell+0.074
iders+0.073
ibrarian+0.072
affles+0.071
Neg logit contributions
 smaller-0.047
 enough-0.045
 spe-0.043
 swe-0.042
 cheaper-0.042
Token confused
Token ID10416
Feature activation+8.56e-01

Pos logit contributions
leaf+0.074
ibrarian+0.071
roach+0.071
ting+0.070
coon+0.069
Neg logit contributions
 smaller-0.050
 spe-0.044
 swe-0.042
 shorter-0.042
 cheaper-0.042
Token snacks
Token ID28487
Feature activation+9.80e-01

Pos logit contributions
vell+0.080
iders+0.078
ibrarian+0.077
asc+0.075
leaf+0.075
Neg logit contributions
 swe-0.055
 smaller-0.050
 spe-0.048
 cheaper-0.047
 coll-0.046
Token.
Token ID13
Feature activation-3.47e-02

Pos logit contributions
leaf+0.050
vell+0.047
ibrarian+0.047
taker+0.045
opher+0.045
Neg logit contributions
 someday-0.027
 swe-0.025
 enough-0.024
 cheaper-0.024
 spe-0.023
Token She
Token ID1375
Feature activation-2.29e-01

Pos logit contributions
 swe+0.001
 someday+0.001
 cheaper+0.001
 spe+0.001
c+0.001
Neg logit contributions
leaf-0.002
ibrarian-0.001
vell-0.001
taker-0.001
asc-0.001
Token saw
Token ID2497
Feature activation+8.56e-01

Pos logit contributions
 swe+0.008
 someday+0.008
 cheaper+0.007
 spe+0.007
c+0.007
Neg logit contributions
leaf-0.010
ibrarian-0.010
vell-0.009
taker-0.009
he-0.008
Token the
Token ID262
Feature activation+1.50e+00

Pos logit contributions
leaf+0.043
vell+0.043
asc+0.040
ower+0.040
ibrarian+0.040
Neg logit contributions
 smaller-0.022
 swe-0.022
 enough-0.021
 someday-0.021
 spe-0.020
Token pupp
Token ID13595
Feature activation+3.31e+00

Pos logit contributions
leaf+0.067
ibrarian+0.062
vell+0.062
asc+0.062
ying+0.061
Neg logit contributions
 swe-0.050
 smaller-0.048
 cheaper-0.046
 coll-0.045
 shin-0.044
Tokenets
Token ID1039
Feature activation+1.08e+00

Pos logit contributions
nam+0.120
asc+0.112
 oh+0.110
person+0.106
leaf+0.106
Neg logit contributions
 spe-0.107
 swe-0.106
 stream-0.104
 coll-0.103
 becomes-0.101
Token and
Token ID290
Feature activation-8.11e-02

Pos logit contributions
leaf+0.053
ibrarian+0.050
vell+0.050
asc+0.047
taker+0.047
Neg logit contributions
 swe-0.030
 someday-0.029
 spe-0.028
 enough-0.028
 smaller-0.028
Token smiled
Token ID13541
Feature activation-4.02e-01

Pos logit contributions
 swe+0.003
 someday+0.003
 cheaper+0.003
 spe+0.003
c+0.003
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.003
 character-0.003
Token.
Token ID13
Feature activation+1.66e-01

Pos logit contributions
 swe+0.014
 spe+0.014
 someday+0.014
 cheaper+0.014
c+0.014
Neg logit contributions
leaf-0.015
ibrarian-0.015
vell-0.015
taker-0.014
he-0.014
Token
Token ID198
Feature activation+9.95e-02

Pos logit contributions
leaf+0.007
ibrarian+0.007
vell+0.007
taker+0.007
he+0.007
Neg logit contributions
 swe-0.005
 someday-0.005
 spe-0.005
 cheaper-0.005
 smaller-0.005
Token the
Token ID262
Feature activation+2.53e+00

Pos logit contributions
leaf+0.042
ibrarian+0.041
vell+0.041
taker+0.038
asc+0.038
Neg logit contributions
 swe-0.024
 someday-0.023
 cheaper-0.022
 enough-0.022
 smaller-0.021
Token big
Token ID1263
Feature activation+1.48e+00

Pos logit contributions
leaf+0.111
ibrarian+0.109
ped+0.108
opher+0.106
asc+0.106
Neg logit contributions
 swe-0.079
 cheaper-0.077
 smaller-0.077
 shin-0.070
 less-0.069
Token black
Token ID2042
Feature activation+2.72e+00

Pos logit contributions
leaf+0.063
vell+0.060
ibrarian+0.059
taker+0.055
bled+0.054
Neg logit contributions
 swe-0.055
 someday-0.050
 cheaper-0.050
 smaller-0.050
 enough-0.049
Tokenboard
Token ID3526
Feature activation+1.52e+00

Pos logit contributions
leaf+0.092
vell+0.087
bled+0.083
uted+0.077
taker+0.077
Neg logit contributions
 again-0.114
 someday-0.111
 ones-0.110
 One-0.109
 stream-0.108
Token on
Token ID319
Feature activation+8.89e-02

Pos logit contributions
leaf+0.075
vell+0.070
ibrarian+0.068
asc+0.068
opher+0.067
Neg logit contributions
 someday-0.044
 swe-0.042
 cheaper-0.041
 enough-0.040
 One-0.040
Token the
Token ID262
Feature activation+3.30e+00

Pos logit contributions
leaf+0.004
ibrarian+0.004
vell+0.004
taker+0.004
asc+0.004
Neg logit contributions
 swe-0.002
 someday-0.002
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token wall
Token ID3355
Feature activation+9.43e-01

Pos logit contributions
asc+0.147
leaf+0.144
uted+0.142
dd+0.142
izards+0.141
Neg logit contributions
 swe-0.101
 smaller-0.101
 stream-0.096
 cheaper-0.092
 coll-0.089
Token.
Token ID13
Feature activation+5.10e-01

Pos logit contributions
leaf+0.047
ibrarian+0.044
vell+0.044
asc+0.042
taker+0.041
Neg logit contributions
 someday-0.027
 swe-0.026
 cheaper-0.025
c-0.024
 smaller-0.024
Token Tim
Token ID5045
Feature activation+1.06e+00

Pos logit contributions
leaf+0.021
ibrarian+0.020
vell+0.020
taker+0.018
asc+0.018
Neg logit contributions
 swe-0.018
 someday-0.018
 spe-0.017
 One-0.017
 cheaper-0.017
Token had
Token ID550
Feature activation-4.10e-01

Pos logit contributions
leaf+0.057
vell+0.055
ibrarian+0.053
taker+0.052
ibr+0.051
Neg logit contributions
 swe-0.026
 someday-0.025
c-0.025
 One-0.023
 enough-0.023
Token a
Token ID257
Feature activation-5.09e-01

Pos logit contributions
 swe+0.014
 someday+0.014
 spe+0.013
 cheaper+0.013
c+0.012
Neg logit contributions
leaf-0.017
ibrarian-0.017
vell-0.016
taker-0.015
 character-0.015
Token
Token ID198
Feature activation+1.56e-01

Pos logit contributions
leaf+0.019
vell+0.018
ibrarian+0.018
taker+0.017
asc+0.017
Neg logit contributions
 swe-0.008
 someday-0.008
 One-0.008
 cheaper-0.008
 spe-0.007
Token
Token ID198
Feature activation+7.79e-01

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.007
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.004
 someday-0.004
 cheaper-0.004
 spe-0.004
 smaller-0.003
TokenTom
Token ID13787
Feature activation-5.07e-01

Pos logit contributions
leaf+0.037
vell+0.037
taker+0.035
opher+0.034
ibrarian+0.034
Neg logit contributions
 One-0.023
 someday-0.023
 swe-0.023
One-0.022
c-0.021
Token jumped
Token ID11687
Feature activation+9.47e-01

Pos logit contributions
 swe+0.019
 someday+0.018
 cheaper+0.018
 spe+0.018
 smaller+0.017
Neg logit contributions
ibrarian-0.020
leaf-0.019
vell-0.017
taker-0.017
asc-0.017
Token on
Token ID319
Feature activation+2.73e-02

Pos logit contributions
leaf+0.041
ibrarian+0.038
vell+0.038
ibr+0.036
opher+0.035
Neg logit contributions
 swe-0.033
 smaller-0.032
 enough-0.032
 someday-0.031
 again-0.031
Token the
Token ID262
Feature activation+3.30e+00

Pos logit contributions
leaf+0.001
ibrarian+0.001
vell+0.001
taker+0.001
he+0.001
Neg logit contributions
 swe-0.001
 someday-0.001
 cheaper-0.001
 spe-0.001
 smaller-0.001
Token table
Token ID3084
Feature activation+1.12e+00

Pos logit contributions
ibr+0.152
vell+0.149
izards+0.146
asso+0.139
 ogre+0.136
Neg logit contributions
 smaller-0.107
 swe-0.105
 cheaper-0.098
 bigger-0.091
 shorter-0.089
Token to
Token ID284
Feature activation+1.74e+00

Pos logit contributions
leaf+0.064
vell+0.060
ibrarian+0.059
asc+0.057
taker+0.056
Neg logit contributions
 someday-0.026
 enough-0.024
 swe-0.023
 smaller-0.022
 One-0.022
Token get
Token ID651
Feature activation+4.91e-01

Pos logit contributions
vell+0.093
ower+0.090
taker+0.089
leaf+0.088
ibr+0.088
Neg logit contributions
 smaller-0.045
 One-0.042
 someday-0.042
 swe-0.041
 another-0.040
Token a
Token ID257
Feature activation+3.68e-01

Pos logit contributions
leaf+0.025
ibrarian+0.025
vell+0.024
iders+0.023
taker+0.023
Neg logit contributions
 swe-0.013
 smaller-0.012
 someday-0.012
 cheaper-0.012
 spe-0.011
Token closer
Token ID5699
Feature activation-5.84e-01

Pos logit contributions
leaf+0.016
ibrarian+0.016
vell+0.015
taker+0.015
asc+0.015
Neg logit contributions
 swe-0.012
 someday-0.011
 cheaper-0.011
 spe-0.011
 smaller-0.011
Token branches
Token ID13737
Feature activation-8.80e-01

Pos logit contributions
 swe+0.003
 someday+0.003
 cheaper+0.003
 spe+0.003
c+0.003
Neg logit contributions
leaf-0.004
ibrarian-0.004
vell-0.003
taker-0.003
he-0.003
Token.
Token ID13
Feature activation-9.77e-01

Pos logit contributions
 swe+0.036
 someday+0.033
 spe+0.033
 cheaper+0.032
c+0.031
Neg logit contributions
leaf-0.030
he-0.030
ibrarian-0.029
vell-0.029
taker-0.029
Token They
Token ID1119
Feature activation+8.67e-02

Pos logit contributions
 swe+0.034
 someday+0.032
 spe+0.032
 cheaper+0.031
c+0.030
Neg logit contributions
leaf-0.040
ibrarian-0.038
he-0.036
taker-0.036
ibr-0.035
Token make
Token ID787
Feature activation-1.75e+00

Pos logit contributions
leaf+0.004
ibrarian+0.004
vell+0.004
taker+0.003
asc+0.003
Neg logit contributions
 swe-0.003
 someday-0.003
 spe-0.003
 cheaper-0.003
 smaller-0.003
Token the
Token ID262
Feature activation-6.57e-01

Pos logit contributions
 someday+0.067
 One+0.060
 cheaper+0.060
 spe+0.059
 swe+0.059
Neg logit contributions
leaf-0.063
ong-0.060
ibrarian-0.059
vell-0.058
taker-0.056
Token tree
Token ID5509
Feature activation-3.93e+00

Pos logit contributions
 someday+0.024
 swe+0.024
 cheaper+0.023
 spe+0.022
c+0.022
Neg logit contributions
leaf-0.024
ibrarian-0.024
vell-0.023
he-0.022
 goodness-0.022
Token colorful
Token ID20239
Feature activation-1.61e+00

Pos logit contributions
 someday+0.160
�+0.149
use+0.148
Gu+0.146
 sometime+0.145
Neg logit contributions
 around-0.119
he-0.117
 outside-0.117
ong-0.111
vell-0.108
Token and
Token ID290
Feature activation-1.24e-01

Pos logit contributions
 One+0.061
 cheaper+0.061
 someday+0.060
 swe+0.059
One+0.057
Neg logit contributions
leaf-0.052
vell-0.052
ibrarian-0.049
 outside-0.049
he-0.048
Token bright
Token ID6016
Feature activation-1.28e+00

Pos logit contributions
 swe+0.004
 someday+0.004
 cheaper+0.004
 spe+0.004
c+0.004
Neg logit contributions
leaf-0.005
ibrarian-0.005
vell-0.005
taker-0.005
he-0.005
Token.
Token ID13
Feature activation-8.33e-01

Pos logit contributions
 someday+0.048
 cheaper+0.048
c+0.046
 swe+0.046
 One+0.045
Neg logit contributions
leaf-0.043
 goodness-0.039
vell-0.039
 outside-0.039
he-0.039
Token
Token ID198
Feature activation+2.97e-02

Pos logit contributions
 swe+0.029
 someday+0.028
 spe+0.027
 cheaper+0.027
c+0.026
Neg logit contributions
leaf-0.034
ibrarian-0.033
he-0.030
taker-0.030
asc-0.030
Token it
Token ID340
Feature activation-6.66e-01

Pos logit contributions
 swe+0.016
 someday+0.016
 spe+0.015
 cheaper+0.015
c+0.014
Neg logit contributions
leaf-0.026
ibrarian-0.025
vell-0.024
taker-0.024
he-0.023
Token.
Token ID13
Feature activation-2.07e-01

Pos logit contributions
 swe+0.026
 spe+0.026
 someday+0.025
 cheaper+0.025
c+0.024
Neg logit contributions
ibrarian-0.022
vell-0.022
leaf-0.022
taker-0.021
he-0.021
Token He
Token ID679
Feature activation-1.28e+00

Pos logit contributions
 swe+0.007
 someday+0.006
 spe+0.006
 cheaper+0.006
c+0.006
Neg logit contributions
leaf-0.009
ibrarian-0.009
vell-0.008
taker-0.008
he-0.008
Token makes
Token ID1838
Feature activation-2.94e-02

Pos logit contributions
 someday+0.045
 swe+0.043
 cheaper+0.041
 spe+0.040
c+0.039
Neg logit contributions
ibrarian-0.054
leaf-0.052
vell-0.046
ibr-0.045
izards-0.045
Token the
Token ID262
Feature activation+1.43e-01

Pos logit contributions
 swe+0.001
 someday+0.001
 spe+0.001
 cheaper+0.001
c+0.001
Neg logit contributions
leaf-0.001
ibrarian-0.001
vell-0.001
taker-0.001
he-0.001
Token tree
Token ID5509
Feature activation-3.79e+00

Pos logit contributions
leaf+0.006
ibrarian+0.006
vell+0.005
taker+0.005
asc+0.005
Neg logit contributions
 swe-0.005
 someday-0.005
 spe-0.005
 cheaper-0.005
 smaller-0.005
Token look
Token ID804
Feature activation-1.06e+00

Pos logit contributions
 someday+0.147
�+0.135
 sometime+0.132
use+0.132
One+0.129
Neg logit contributions
 around-0.120
 outside-0.119
taker-0.119
he-0.116
ong-0.116
Token more
Token ID517
Feature activation-5.43e-01

Pos logit contributions
 someday+0.040
 spe+0.039
c+0.039
 swe+0.037
 One+0.036
Neg logit contributions
leaf-0.038
vell-0.037
taker-0.036
ibrarian-0.034
ong-0.034
Token beautiful
Token ID4950
Feature activation-3.35e-01

Pos logit contributions
 someday+0.019
 swe+0.019
c+0.018
 One+0.018
 spe+0.018
Neg logit contributions
leaf-0.021
vell-0.020
ibrarian-0.020
taker-0.019
he-0.019
Token and
Token ID290
Feature activation-5.15e-01

Pos logit contributions
 swe+0.012
 someday+0.011
 cheaper+0.011
 spe+0.011
c+0.011
Neg logit contributions
leaf-0.014
ibrarian-0.013
vell-0.013
taker-0.012
he-0.012
Token fun
Token ID1257
Feature activation+3.00e-01

Pos logit contributions
 someday+0.016
 swe+0.016
 spe+0.015
c+0.015
 One+0.015
Neg logit contributions
leaf-0.022
ibrarian-0.021
vell-0.021
taker-0.020
he-0.020
Token happy
Token ID3772
Feature activation+3.07e-01

Pos logit contributions
leaf+0.023
ibrarian+0.022
vell+0.021
iders+0.020
asc+0.020
Neg logit contributions
 swe-0.015
 someday-0.015
 spe-0.015
 smaller-0.015
 cheaper-0.015
Token that
Token ID326
Feature activation-3.58e-01

Pos logit contributions
leaf+0.015
ibrarian+0.014
vell+0.014
taker+0.013
asc+0.013
Neg logit contributions
 someday-0.009
 swe-0.009
 spe-0.008
 cheaper-0.008
 enough-0.008
Token he
Token ID339
Feature activation+8.69e-01

Pos logit contributions
 swe+0.012
 spe+0.012
 cheaper+0.012
 someday+0.012
c+0.011
Neg logit contributions
leaf-0.014
ibrarian-0.014
taker-0.014
vell-0.014
he-0.013
Token gave
Token ID2921
Feature activation-2.05e-01

Pos logit contributions
leaf+0.039
ibrarian+0.038
vell+0.036
iders+0.036
opher+0.036
Neg logit contributions
 swe-0.027
 someday-0.026
 spe-0.026
 cheaper-0.025
 shin-0.024
Token the
Token ID262
Feature activation-1.89e+00

Pos logit contributions
 swe+0.006
 someday+0.006
 spe+0.006
 cheaper+0.006
 smaller+0.006
Neg logit contributions
leaf-0.009
ibrarian-0.009
vell-0.009
taker-0.009
he-0.008
Token little
Token ID1310
Feature activation-3.64e+00

Pos logit contributions
 someday+0.087
 sometime+0.082
 becomes+0.077
 swe+0.077
c+0.075
Neg logit contributions
ibrarian-0.052
vell-0.052
taker-0.049
leaf-0.049
he-0.048
Token bird
Token ID6512
Feature activation+3.98e-01

Pos logit contributions
 sometime+0.143
 becomes+0.143
oral+0.137
 Grad+0.137
 fade+0.135
Neg logit contributions
 ear-0.095
 goodness-0.086
person-0.086
 character-0.085
 pupil-0.082
Token a
Token ID257
Feature activation+1.02e-01

Pos logit contributions
leaf+0.019
ibrarian+0.019
vell+0.018
taker+0.018
iders+0.017
Neg logit contributions
 swe-0.011
 someday-0.011
c-0.010
 cheaper-0.010
 spe-0.010
Token big
Token ID1263
Feature activation-9.59e-01

Pos logit contributions
leaf+0.004
ibrarian+0.004
vell+0.004
taker+0.004
asc+0.004
Neg logit contributions
 swe-0.003
 someday-0.003
 cheaper-0.003
 spe-0.003
 smaller-0.003
Token hug
Token ID16225
Feature activation+1.08e+00

Pos logit contributions
 someday+0.045
 swe+0.044
 cheaper+0.043
c+0.042
 spe+0.041
Neg logit contributions
leaf-0.028
ibrarian-0.026
taker-0.025
vell-0.024
 character-0.023
Token.
Token ID13
Feature activation-2.75e-02

Pos logit contributions
leaf+0.056
vell+0.053
ibrarian+0.053
iders+0.050
asc+0.050
Neg logit contributions
 someday-0.029
 swe-0.027
 enough-0.027
 cheaper-0.025
 spe-0.025
Token Tim
Token ID5045
Feature activation-1.29e+00

Pos logit contributions
 swe+0.014
 someday+0.013
 cheaper+0.013
 spe+0.013
 smaller+0.013
Neg logit contributions
leaf-0.021
ibrarian-0.020
vell-0.019
taker-0.019
 character-0.018
Tokenmy
Token ID1820
Feature activation-1.20e+00

Pos logit contributions
 spe+0.060
 cheaper+0.058
 swe+0.056
 someday+0.055
 smaller+0.055
Neg logit contributions
ibrarian-0.035
he-0.034
leaf-0.033
asc-0.032
vell-0.032
Token's
Token ID338
Feature activation+3.88e-01

Pos logit contributions
 swe+0.045
 cheaper+0.044
 someday+0.042
 spe+0.042
 smaller+0.042
Neg logit contributions
leaf-0.044
ibrarian-0.043
taker-0.040
vell-0.040
he-0.040
Token blanket
Token ID18447
Feature activation-6.84e-01

Pos logit contributions
leaf+0.016
ibrarian+0.016
vell+0.015
taker+0.014
asc+0.014
Neg logit contributions
 swe-0.014
 someday-0.014
 cheaper-0.013
 spe-0.013
 smaller-0.013
Token would
Token ID561
Feature activation-1.67e+00

Pos logit contributions
 swe+0.023
 cheaper+0.022
 someday+0.022
 spe+0.021
 smaller+0.020
Neg logit contributions
leaf-0.028
ibrarian-0.028
vell-0.027
taker-0.026
he-0.026
Token become
Token ID1716
Feature activation-3.62e+00

Pos logit contributions
 swe+0.063
 cheaper+0.063
c+0.060
 enough+0.058
 shin+0.057
Neg logit contributions
he-0.055
vell-0.053
leaf-0.052
taker-0.052
ibrarian-0.051
Token lively
Token ID29696
Feature activation-1.22e+00

Pos logit contributions
oral+0.145
 bidder+0.139
�+0.134
 Sometimes+0.130
 One+0.129
Neg logit contributions
 everywhere-0.110
vell-0.109
 outside-0.108
 around-0.105
 behind-0.094
Token whenever
Token ID8797
Feature activation-1.72e+00

Pos logit contributions
 swe+0.030
 cheaper+0.030
 spe+0.029
c+0.028
 shin+0.027
Neg logit contributions
leaf-0.057
vell-0.055
ibrarian-0.055
 character-0.054
taker-0.054
Token they
Token ID484
Feature activation+5.06e-01

Pos logit contributions
c+0.058
 swe+0.057
 cheaper+0.055
 spe+0.053
 shin+0.053
Neg logit contributions
leaf-0.066
 character-0.064
taker-0.063
 goodness-0.062
ibrarian-0.061
Token wanted
Token ID2227
Feature activation-1.30e+00

Pos logit contributions
leaf+0.024
ibrarian+0.023
vell+0.022
taker+0.021
asc+0.021
Neg logit contributions
 swe-0.015
 someday-0.015
 spe-0.014
c-0.014
 cheaper-0.014
Token to
Token ID284
Feature activation-1.68e+00

Pos logit contributions
 swe+0.048
 shin+0.047
 cheaper+0.047
 spe+0.047
c+0.046
Neg logit contributions
leaf-0.044
 character-0.041
 outside-0.041
ibrarian-0.040
taker-0.040
Tokeniring
Token ID3428
Feature activation-1.04e-01

Pos logit contributions
leaf+0.002
ibrarian+0.002
vell+0.002
taker+0.001
asc+0.001
Neg logit contributions
 swe-0.009
 someday-0.009
 cheaper-0.009
 spe-0.009
 smaller-0.009
Token,
Token ID11
Feature activation+4.35e-01

Pos logit contributions
 swe+0.004
 someday+0.003
 cheaper+0.003
 spe+0.003
c+0.003
Neg logit contributions
leaf-0.004
ibrarian-0.004
vell-0.004
taker-0.004
he-0.004
Token until
Token ID1566
Feature activation-1.14e+00

Pos logit contributions
leaf+0.017
ibrarian+0.017
vell+0.016
taker+0.015
asc+0.015
Neg logit contributions
 swe-0.017
 someday-0.016
 spe-0.016
 cheaper-0.016
 smaller-0.015
Token it
Token ID340
Feature activation-1.46e+00

Pos logit contributions
 swe+0.045
c+0.041
 cheaper+0.039
 coll+0.038
 spe+0.037
Neg logit contributions
leaf-0.041
ibrarian-0.040
he-0.040
taker-0.039
 character-0.039
Token started
Token ID2067
Feature activation-2.71e+00

Pos logit contributions
 cheaper+0.053
 swe+0.053
 spe+0.052
 someday+0.051
 coll+0.049
Neg logit contributions
ibrarian-0.054
he-0.052
vell-0.049
taker-0.048
leaf-0.048
Token to
Token ID284
Feature activation-3.59e+00

Pos logit contributions
 cheaper+0.116
 someday+0.113
 bidder+0.112
 spe+0.111
�+0.111
Neg logit contributions
 outside-0.075
 around-0.070
 everywhere-0.068
 supposed-0.067
 about-0.058
Token get
Token ID651
Feature activation-2.65e+00

Pos logit contributions
c+0.150
 cheaper+0.141
 coll+0.136
 swe+0.134
ib+0.132
Neg logit contributions
he-0.087
 ground-0.085
 lying-0.085
 all-0.084
 outside-0.082
Token dark
Token ID3223
Feature activation-1.48e+00

Pos logit contributions
 bidder+0.106
�+0.106
 someday+0.102
c+0.100
 coll+0.099
Neg logit contributions
 everywhere-0.083
 around-0.081
 outside-0.079
vell-0.074
lessly-0.071
Token.
Token ID13
Feature activation+5.41e-01

Pos logit contributions
 swe+0.060
 cheaper+0.059
c+0.057
 someday+0.056
 spe+0.056
Neg logit contributions
 outside-0.047
leaf-0.044
 around-0.044
he-0.043
vell-0.043
Token He
Token ID679
Feature activation-5.77e-01

Pos logit contributions
leaf+0.026
ibrarian+0.026
vell+0.026
asc+0.024
taker+0.024
Neg logit contributions
 swe-0.015
 One-0.015
 someday-0.015
 cheaper-0.014
 spe-0.014
Token started
Token ID2067
Feature activation-1.15e+00

Pos logit contributions
 swe+0.020
 cheaper+0.019
 someday+0.019
 spe+0.018
c+0.017
Neg logit contributions
leaf-0.024
ibrarian-0.024
vell-0.022
taker-0.022
 character-0.021
Token your
Token ID534
Feature activation-4.33e-01

Pos logit contributions
leaf+0.046
ibrarian+0.045
vell+0.045
taker+0.043
he+0.043
Neg logit contributions
 swe-0.030
 someday-0.028
 enough-0.027
 cheaper-0.027
 smaller-0.026
Token name
Token ID1438
Feature activation+9.90e-01

Pos logit contributions
 swe+0.019
 someday+0.019
 spe+0.019
 cheaper+0.018
c+0.018
Neg logit contributions
leaf-0.013
ibrarian-0.013
vell-0.012
taker-0.011
 character-0.011
Token?
Token ID30
Feature activation+8.86e-01

Pos logit contributions
leaf+0.044
vell+0.042
ibrarian+0.041
taker+0.040
opher+0.038
Neg logit contributions
 someday-0.032
 swe-0.032
 cheaper-0.030
c-0.030
 enough-0.029
Token'.
Token ID4458
Feature activation+2.01e-01

Pos logit contributions
leaf+0.035
vell+0.034
ibrarian+0.034
taker+0.032
asc+0.032
Neg logit contributions
 someday-0.031
 swe-0.031
 One-0.030
 spe-0.029
 cheaper-0.029
Token The
Token ID383
Feature activation-2.59e+00

Pos logit contributions
leaf+0.010
ibrarian+0.010
vell+0.010
taker+0.009
asc+0.009
Neg logit contributions
 swe-0.005
 someday-0.005
 spe-0.005
 cheaper-0.005
 One-0.005
Token little
Token ID1310
Feature activation-3.56e+00

Pos logit contributions
onna+0.102
 someday+0.101
 sometime+0.095
 fade+0.089
 becomes+0.085
Neg logit contributions
 ear-0.081
 children-0.078
 goodness-0.075
 adults-0.073
 outside-0.071
Token pig
Token ID12967
Feature activation+1.24e-01

Pos logit contributions
onna+0.173
 sometime+0.155
 someday+0.154
 fade+0.152
 fades+0.143
Neg logit contributions
 goodness-0.093
 children-0.091
ting-0.090
 ear-0.083
 outside-0.079
Tokenlet
Token ID1616
Feature activation+5.25e-01

Pos logit contributions
leaf+0.006
ibrarian+0.005
vell+0.005
taker+0.005
asc+0.005
Neg logit contributions
 swe-0.004
 someday-0.004
 spe-0.004
 cheaper-0.004
c-0.003
Token bl
Token ID698
Feature activation+4.52e-01

Pos logit contributions
leaf+0.027
vell+0.025
ibrarian+0.025
taker+0.024
asc+0.024
Neg logit contributions
 swe-0.014
 someday-0.013
c-0.012
 enough-0.012
 cheaper-0.012
Tokenushed
Token ID7474
Feature activation+3.98e-01

Pos logit contributions
leaf+0.012
ibrarian+0.011
vell+0.010
taker+0.010
he+0.009
Neg logit contributions
 someday-0.023
 swe-0.023
c-0.022
 cheaper-0.021
 spe-0.021
Token and
Token ID290
Feature activation+7.48e-01

Pos logit contributions
leaf+0.015
ibrarian+0.015
vell+0.014
taker+0.013
asc+0.013
Neg logit contributions
 swe-0.016
 someday-0.016
 spe-0.015
 cheaper-0.014
 smaller-0.014
Token and
Token ID290
Feature activation-7.57e-01

Pos logit contributions
 swe+0.050
 cheaper+0.049
 someday+0.048
 spe+0.047
 shin+0.047
Neg logit contributions
ibrarian-0.038
leaf-0.037
vell-0.037
he-0.037
 outside-0.035
Token pretend
Token ID16614
Feature activation-7.86e-01

Pos logit contributions
 someday+0.028
 swe+0.027
 cheaper+0.027
 spe+0.026
c+0.026
Neg logit contributions
leaf-0.029
ibrarian-0.026
vell-0.025
taker-0.025
 character-0.024
Token they
Token ID484
Feature activation-7.55e-01

Pos logit contributions
 swe+0.022
 someday+0.021
c+0.021
 spe+0.020
 cheaper+0.020
Neg logit contributions
leaf-0.036
ibrarian-0.035
vell-0.034
taker-0.034
 character-0.033
Token were
Token ID547
Feature activation-6.92e-01

Pos logit contributions
 swe+0.032
 someday+0.030
 cheaper+0.029
 spe+0.028
c+0.028
Neg logit contributions
leaf-0.025
ibrarian-0.025
vell-0.025
he-0.024
taker-0.022
Token going
Token ID1016
Feature activation-1.82e+00

Pos logit contributions
 swe+0.029
 someday+0.028
 spe+0.027
 cheaper+0.027
 One+0.027
Neg logit contributions
leaf-0.023
vell-0.023
ibrarian-0.022
asc-0.021
taker-0.020
Token on
Token ID319
Feature activation-3.56e+00

Pos logit contributions
 swe+0.081
 spe+0.080
 someday+0.077
 god+0.076
 cheaper+0.074
Neg logit contributions
 outside-0.057
 around-0.054
leaf-0.051
vell-0.049
 everywhere-0.047
Token a
Token ID257
Feature activation-1.50e+00

Pos logit contributions
 swe+0.153
 god+0.152
 fades+0.150
 becomes+0.150
 heals+0.149
Neg logit contributions
 outside-0.101
 everywhere-0.085
 ages-0.081
leaf-0.078
 behind-0.077
Token trip
Token ID5296
Feature activation+5.53e-02

Pos logit contributions
 someday+0.053
 enough+0.052
 swe+0.052
c+0.052
 spe+0.051
Neg logit contributions
leaf-0.051
vell-0.051
ibrarian-0.049
 outside-0.049
he-0.049
Token.
Token ID13
Feature activation-6.88e-01

Pos logit contributions
leaf+0.002
ibrarian+0.002
vell+0.002
taker+0.002
asc+0.002
Neg logit contributions
 swe-0.002
 someday-0.002
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token They
Token ID1119
Feature activation+2.08e-01

Pos logit contributions
 swe+0.024
 spe+0.024
 someday+0.023
 shin+0.022
 cheaper+0.022
Neg logit contributions
leaf-0.027
ibrarian-0.027
he-0.026
asc-0.024
taker-0.024
Token took
Token ID1718
Feature activation-3.26e-01

Pos logit contributions
leaf+0.009
ibrarian+0.009
vell+0.009
taker+0.008
asc+0.008
Neg logit contributions
 swe-0.007
 someday-0.007
 spe-0.006
 cheaper-0.006
 smaller-0.006
Token and
Token ID290
Feature activation-4.01e-01

Pos logit contributions
 swe+0.006
 someday+0.006
 cheaper+0.006
 spe+0.006
 shin+0.005
Neg logit contributions
leaf-0.010
ibrarian-0.010
vell-0.010
he-0.009
taker-0.009
Token asked
Token ID1965
Feature activation-2.61e-01

Pos logit contributions
 someday+0.013
 swe+0.013
 spe+0.013
 cheaper+0.013
 enough+0.012
Neg logit contributions
leaf-0.017
ibrarian-0.016
vell-0.016
taker-0.016
asc-0.015
Token if
Token ID611
Feature activation-1.47e+00

Pos logit contributions
 swe+0.009
 spe+0.009
 someday+0.009
 cheaper+0.009
 shin+0.009
Neg logit contributions
leaf-0.010
ibrarian-0.010
taker-0.009
vell-0.009
 character-0.009
Token she
Token ID673
Feature activation-5.24e-01

Pos logit contributions
c+0.060
 spe+0.060
 swe+0.058
 cheaper+0.057
 coll+0.055
Neg logit contributions
ibrarian-0.048
leaf-0.044
taker-0.043
vell-0.043
 character-0.043
Token could
Token ID714
Feature activation-2.45e-01

Pos logit contributions
 swe+0.022
 someday+0.021
 cheaper+0.020
 spe+0.020
 smaller+0.020
Neg logit contributions
leaf-0.018
ibrarian-0.018
vell-0.016
taker-0.016
 character-0.016
Token have
Token ID423
Feature activation-3.56e+00

Pos logit contributions
 swe+0.008
 someday+0.008
 cheaper+0.008
 spe+0.007
 shin+0.007
Neg logit contributions
leaf-0.011
ibrarian-0.010
vell-0.010
taker-0.009
he-0.009
Token some
Token ID617
Feature activation-1.28e+00

Pos logit contributions
 someday+0.135
oral+0.130
 weaker+0.129
onna+0.128
Or+0.128
Neg logit contributions
leaf-0.119
 goodness-0.103
 outside-0.101
ils-0.097
 ear-0.097
Token po
Token ID745
Feature activation-8.16e-01

Pos logit contributions
 someday+0.040
c+0.036
 swe+0.036
 spe+0.036
 smaller+0.036
Neg logit contributions
leaf-0.057
 goodness-0.050
ibrarian-0.050
vell-0.050
he-0.047
Tokenpp
Token ID381
Feature activation-9.47e-01

Pos logit contributions
 swe+0.028
 someday+0.027
 smaller+0.027
 spe+0.026
 cheaper+0.026
Neg logit contributions
leaf-0.035
vell-0.033
ibrarian-0.032
he-0.030
taker-0.029
Tokenies
Token ID444
Feature activation-9.50e-01

Pos logit contributions
 someday+0.037
 swe+0.036
 cheaper+0.035
 spe+0.035
 One+0.034
Neg logit contributions
leaf-0.035
vell-0.034
ibrarian-0.033
taker-0.032
he-0.030
Token.
Token ID13
Feature activation-9.64e-01

Pos logit contributions
 swe+0.033
 spe+0.033
 someday+0.031
 cheaper+0.030
 smaller+0.030
Neg logit contributions
leaf-0.038
ibrarian-0.037
vell-0.036
taker-0.035
he-0.034
Token so
Token ID523
Feature activation-2.94e-01

Pos logit contributions
 spe+0.077
 someday+0.076
 swe+0.074
 cheaper+0.073
 One+0.073
Neg logit contributions
vell-0.065
taker-0.062
ibrarian-0.062
leaf-0.062
 around-0.059
Token they
Token ID484
Feature activation-4.06e-01

Pos logit contributions
 swe+0.009
 someday+0.009
 spe+0.009
 cheaper+0.009
c+0.009
Neg logit contributions
leaf-0.013
ibrarian-0.012
vell-0.012
taker-0.012
he-0.011
Token can
Token ID460
Feature activation-8.89e-01

Pos logit contributions
 swe+0.012
 someday+0.012
 cheaper+0.011
 spe+0.011
 smaller+0.011
Neg logit contributions
leaf-0.019
ibrarian-0.018
vell-0.018
taker-0.017
he-0.017
Token make
Token ID787
Feature activation-9.15e-01

Pos logit contributions
 swe+0.033
 someday+0.031
 cheaper+0.030
 spe+0.030
 shin+0.030
Neg logit contributions
leaf-0.033
ibrarian-0.033
he-0.033
vell-0.032
taker-0.030
Token the
Token ID262
Feature activation-1.57e-01

Pos logit contributions
 someday+0.031
 swe+0.030
 spe+0.029
 cheaper+0.029
 One+0.028
Neg logit contributions
leaf-0.037
ibrarian-0.035
vell-0.035
taker-0.034
 goodness-0.032
Token forest
Token ID8222
Feature activation-3.55e+00

Pos logit contributions
 swe+0.006
 someday+0.006
 cheaper+0.006
 spe+0.005
c+0.005
Neg logit contributions
leaf-0.006
ibrarian-0.006
vell-0.006
taker-0.005
he-0.005
Token pretty
Token ID2495
Feature activation-2.04e+00

Pos logit contributions
 One+0.128
 someday+0.126
One+0.122
 spe+0.119
 Sometimes+0.117
Neg logit contributions
vell-0.118
taker-0.118
 outside-0.110
 around-0.110
leaf-0.107
Token."
Token ID526
Feature activation-1.92e-01

Pos logit contributions
 cheaper+0.077
 One+0.076
 spe+0.076
 someday+0.075
c+0.074
Neg logit contributions
vell-0.064
taker-0.062
 around-0.061
 outside-0.061
leaf-0.060
Token
Token ID198
Feature activation+4.82e-01

Pos logit contributions
 swe+0.006
 someday+0.006
 spe+0.005
 cheaper+0.005
 smaller+0.005
Neg logit contributions
leaf-0.009
ibrarian-0.009
vell-0.008
taker-0.008
he-0.008
Token
Token ID198
Feature activation-3.31e-01

Pos logit contributions
leaf+0.026
vell+0.025
ibrarian+0.025
taker+0.024
asc+0.023
Neg logit contributions
 swe-0.011
 someday-0.011
 cheaper-0.010
 One-0.010
c-0.010
TokenL
Token ID43
Feature activation+5.95e-01

Pos logit contributions
 swe+0.011
 spe+0.011
 someday+0.010
 cheaper+0.010
 enough+0.010
Neg logit contributions
leaf-0.014
ibrarian-0.014
vell-0.013
he-0.013
taker-0.013
Token.
Token ID13
Feature activation+2.99e-02

Pos logit contributions
 swe+0.018
 someday+0.017
 cheaper+0.017
 spe+0.017
 smaller+0.016
Neg logit contributions
leaf-0.022
ibrarian-0.021
vell-0.021
taker-0.020
he-0.020
Token Soon
Token ID15894
Feature activation-1.18e+00

Pos logit contributions
leaf+0.001
ibrarian+0.001
vell+0.001
taker+0.001
asc+0.001
Neg logit contributions
 swe-0.001
 someday-0.001
 cheaper-0.001
 spe-0.001
 smaller-0.001
Token,
Token ID11
Feature activation+1.83e-01

Pos logit contributions
 swe+0.038
c+0.037
 cheaper+0.037
 spe+0.036
 someday+0.036
Neg logit contributions
ibrarian-0.049
 character-0.046
leaf-0.046
taker-0.046
vell-0.045
Token the
Token ID262
Feature activation+1.01e+00

Pos logit contributions
leaf+0.009
ibrarian+0.008
vell+0.008
taker+0.008
asc+0.008
Neg logit contributions
 swe-0.005
 someday-0.005
 cheaper-0.005
 spe-0.005
 smaller-0.005
Token path
Token ID3108
Feature activation-1.12e+00

Pos logit contributions
leaf+0.045
ibrarian+0.044
vell+0.042
asc+0.041
taker+0.041
Neg logit contributions
 swe-0.033
 someday-0.031
 spe-0.031
 smaller-0.030
 cheaper-0.030
Token grew
Token ID6348
Feature activation-3.52e+00

Pos logit contributions
 swe+0.039
 someday+0.037
 spe+0.035
 cheaper+0.035
 smaller+0.033
Neg logit contributions
ibrarian-0.046
leaf-0.046
vell-0.044
taker-0.043
he-0.043
Token steep
Token ID14559
Feature activation-9.47e-01

Pos logit contributions
 bidder+0.133
c+0.122
 someday+0.118
 god+0.116
 coll+0.116
Neg logit contributions
 outside-0.125
vell-0.123
 around-0.122
 everywhere-0.121
iring-0.116
Token.
Token ID13
Feature activation+1.82e-01

Pos logit contributions
 swe+0.036
 someday+0.033
 cheaper+0.033
 spe+0.032
c+0.032
Neg logit contributions
leaf-0.035
he-0.034
ibrarian-0.033
vell-0.033
taker-0.032
Token Joe
Token ID5689
Feature activation-6.02e-01

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.008
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.006
 someday-0.006
 cheaper-0.006
 spe-0.006
c-0.005
Token was
Token ID373
Feature activation+2.00e-01

Pos logit contributions
 swe+0.021
 someday+0.020
 cheaper+0.019
 spe+0.019
 smaller+0.018
Neg logit contributions
leaf-0.025
ibrarian-0.025
vell-0.023
taker-0.023
he-0.023
Token tired
Token ID10032
Feature activation-8.67e-01

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.007
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.007
 someday-0.007
 cheaper-0.007
 spe-0.007
 smaller-0.007
Token But
Token ID887
Feature activation-8.85e-01

Pos logit contributions
 swe+0.007
 someday+0.007
 cheaper+0.007
 spe+0.007
 shin+0.006
Neg logit contributions
leaf-0.015
ibrarian-0.014
vell-0.013
taker-0.013
he-0.013
Token sometimes
Token ID3360
Feature activation-1.32e+00

Pos logit contributions
 swe+0.029
 spe+0.027
 cheaper+0.027
 shin+0.026
c+0.026
Neg logit contributions
ibrarian-0.035
leaf-0.035
 character-0.033
taker-0.032
 ear-0.032
Token,
Token ID11
Feature activation-1.15e+00

Pos logit contributions
 swe+0.036
 cheaper+0.036
c+0.036
 spe+0.036
 shin+0.035
Neg logit contributions
 character-0.054
ibrarian-0.054
leaf-0.052
izards-0.052
taker-0.051
Token the
Token ID262
Feature activation-2.43e-01

Pos logit contributions
 swe+0.032
c+0.031
 cheaper+0.030
 shin+0.030
 spe+0.029
Neg logit contributions
leaf-0.050
ibrarian-0.049
 character-0.048
izards-0.048
vell-0.047
Token sky
Token ID6766
Feature activation-2.39e+00

Pos logit contributions
 someday+0.008
 swe+0.008
 spe+0.008
 cheaper+0.008
c+0.008
Neg logit contributions
leaf-0.010
ibrarian-0.010
vell-0.009
taker-0.009
asc-0.009
Token gets
Token ID3011
Feature activation-3.51e+00

Pos logit contributions
 swe+0.089
 shin+0.086
 someday+0.085
 cheaper+0.083
 spe+0.081
Neg logit contributions
he-0.076
 around-0.076
ibrarian-0.071
ower-0.071
ong-0.070
Token dark
Token ID3223
Feature activation-1.39e+00

Pos logit contributions
 bidder+0.154
 god+0.153
 someday+0.151
 coll+0.148
c+0.147
Neg logit contributions
 everywhere-0.096
 outside-0.095
 around-0.089
ong-0.080
 behind-0.080
Token and
Token ID290
Feature activation-2.30e+00

Pos logit contributions
 swe+0.055
 cheaper+0.053
 spe+0.052
c+0.052
 shin+0.051
Neg logit contributions
 outside-0.047
he-0.046
leaf-0.043
vell-0.043
taker-0.042
Token it
Token ID340
Feature activation-2.43e+00

Pos logit contributions
 coll+0.081
c+0.081
 someday+0.081
 cheaper+0.078
 swe+0.077
Neg logit contributions
taker-0.073
 around-0.071
 outside-0.067
leaf-0.067
 everywhere-0.065
Token rains
Token ID29424
Feature activation-8.16e-01

Pos logit contributions
 someday+0.086
 swe+0.084
 cheaper+0.084
 coll+0.078
 spe+0.076
Neg logit contributions
 around-0.086
ibrarian-0.083
he-0.082
taker-0.080
 outside-0.077
Token.
Token ID13
Feature activation-7.07e-01

Pos logit contributions
 swe+0.031
 cheaper+0.029
 someday+0.029
 spe+0.028
 shin+0.028
Neg logit contributions
leaf-0.030
ibrarian-0.029
he-0.029
vell-0.028
taker-0.028
Token mom
Token ID1995
Feature activation-7.69e-01

Pos logit contributions
 someday+0.029
 spe+0.029
 swe+0.029
c+0.028
 cheaper+0.028
Neg logit contributions
ibrarian-0.022
leaf-0.021
vell-0.020
ibr-0.019
 character-0.019
Token if
Token ID611
Feature activation-1.56e+00

Pos logit contributions
 spe+0.030
 swe+0.028
 cheaper+0.026
 someday+0.026
 shin+0.026
Neg logit contributions
ibrarian-0.028
leaf-0.028
vell-0.026
taker-0.026
he-0.026
Token they
Token ID484
Feature activation-4.40e-01

Pos logit contributions
c+0.060
 swe+0.058
 spe+0.057
 cheaper+0.054
 coll+0.053
Neg logit contributions
ibrarian-0.055
leaf-0.052
taker-0.051
vell-0.050
 character-0.049
Token could
Token ID714
Feature activation-1.40e+00

Pos logit contributions
 swe+0.020
 someday+0.018
 cheaper+0.018
 spe+0.018
c+0.018
Neg logit contributions
leaf-0.014
ibrarian-0.014
vell-0.013
taker-0.013
he-0.012
Token go
Token ID467
Feature activation-2.00e+00

Pos logit contributions
 swe+0.055
 spe+0.051
 shin+0.050
 cheaper+0.050
c+0.048
Neg logit contributions
he-0.050
leaf-0.048
ibrarian-0.047
vell-0.047
 outside-0.045
Token on
Token ID319
Feature activation-3.48e+00

Pos logit contributions
 spe+0.099
 swe+0.098
 shin+0.088
 cheaper+0.087
 smaller+0.086
Neg logit contributions
 outside-0.058
he-0.051
 around-0.050
leaf-0.049
 behind-0.049
Token a
Token ID257
Feature activation-2.61e+00

Pos logit contributions
 spe+0.160
onna+0.159
Or+0.158
 becomes+0.154
 fade+0.153
Neg logit contributions
 outside-0.084
 everywhere-0.072
 all-0.071
 behind-0.070
 ages-0.066
Token saf
Token ID1932
Feature activation+5.72e-01

Pos logit contributions
 spe+0.089
 swe+0.088
 becomes+0.086
 fade+0.086
 someday+0.085
Neg logit contributions
 outside-0.089
vell-0.085
leaf-0.083
lf-0.082
ibrarian-0.080
Tokenari
Token ID2743
Feature activation-9.64e-01

Pos logit contributions
leaf+0.023
ibrarian+0.022
taker+0.021
vell+0.021
opher+0.020
Neg logit contributions
 someday-0.021
 swe-0.020
 cheaper-0.019
 spe-0.019
 One-0.018
Token,
Token ID11
Feature activation-9.98e-01

Pos logit contributions
 swe+0.038
 spe+0.036
 smaller+0.035
 cheaper+0.034
 shin+0.034
Neg logit contributions
leaf-0.035
ibrarian-0.034
vell-0.033
he-0.033
 outside-0.031
Token but
Token ID475
Feature activation-1.48e+00

Pos logit contributions
 swe+0.040
 spe+0.039
c+0.038
 someday+0.037
 cheaper+0.037
Neg logit contributions
leaf-0.034
vell-0.030
ibrarian-0.030
taker-0.029
he-0.028
Token them
Token ID606
Feature activation-2.38e-01

Pos logit contributions
 someday+0.053
 swe+0.051
c+0.048
 cheaper+0.048
 spe+0.048
Neg logit contributions
taker-0.060
leaf-0.059
he-0.058
ibrarian-0.058
 around-0.057
Token up
Token ID510
Feature activation-1.58e+00

Pos logit contributions
 swe+0.012
 someday+0.012
 cheaper+0.012
 spe+0.012
c+0.011
Neg logit contributions
leaf-0.006
ibrarian-0.006
vell-0.005
taker-0.005
he-0.005
Token.
Token ID13
Feature activation-1.10e+00

Pos logit contributions
 swe+0.071
c+0.068
 someday+0.067
 cheaper+0.066
 spe+0.064
Neg logit contributions
he-0.042
 outside-0.042
 around-0.042
vell-0.041
leaf-0.040
Token But
Token ID887
Feature activation-1.61e+00

Pos logit contributions
 spe+0.039
 someday+0.039
 swe+0.039
 cheaper+0.037
c+0.036
Neg logit contributions
leaf-0.042
taker-0.040
vell-0.039
ibrarian-0.038
issors-0.038
Token the
Token ID262
Feature activation-9.86e-01

Pos logit contributions
 swe+0.072
 spe+0.072
 cheaper+0.068
c+0.065
 someday+0.064
Neg logit contributions
bled-0.041
ting-0.041
 ear-0.039
taker-0.039
 around-0.039
Token last
Token ID938
Feature activation-3.48e+00

Pos logit contributions
 someday+0.029
 swe+0.027
 spe+0.027
 cheaper+0.026
c+0.026
Neg logit contributions
ibrarian-0.043
leaf-0.042
vell-0.040
taker-0.040
 character-0.039
Token snowball
Token ID46275
Feature activation-1.22e+00

Pos logit contributions
Or+0.152
 someday+0.141
Flor+0.138
 cheaper+0.137
 spe+0.136
Neg logit contributions
 character-0.087
lf-0.078
 around-0.070
ied-0.070
 members-0.069
Token was
Token ID373
Feature activation-4.95e-01

Pos logit contributions
 swe+0.044
 someday+0.043
 cheaper+0.042
 spe+0.041
c+0.039
Neg logit contributions
ibrarian-0.045
leaf-0.044
 character-0.043
taker-0.043
vell-0.042
Token too
Token ID1165
Feature activation-6.52e-01

Pos logit contributions
 someday+0.015
 swe+0.015
 spe+0.014
c+0.014
 cheaper+0.014
Neg logit contributions
leaf-0.021
ibrarian-0.021
vell-0.020
taker-0.020
asc-0.019
Token heavy
Token ID4334
Feature activation-2.34e-01

Pos logit contributions
 someday+0.024
 swe+0.024
c+0.022
 spe+0.021
 cheaper+0.021
Neg logit contributions
leaf-0.025
ibrarian-0.024
vell-0.024
taker-0.024
he-0.024
Token for
Token ID329
Feature activation-5.40e-01

Pos logit contributions
 swe+0.009
 someday+0.009
 spe+0.008
 cheaper+0.008
c+0.008
Neg logit contributions
leaf-0.009
ibrarian-0.008
vell-0.008
taker-0.008
he-0.008
Token.
Token ID13
Feature activation-2.03e-01

Pos logit contributions
 swe+0.021
 someday+0.020
 spe+0.020
 cheaper+0.019
c+0.019
Neg logit contributions
leaf-0.023
ibrarian-0.022
vell-0.021
taker-0.021
he-0.021
Token She
Token ID1375
Feature activation-9.11e-02

Pos logit contributions
 swe+0.007
 someday+0.006
 spe+0.006
 cheaper+0.006
c+0.006
Neg logit contributions
leaf-0.009
ibrarian-0.009
vell-0.008
taker-0.008
he-0.008
Token wanted
Token ID2227
Feature activation-9.43e-01

Pos logit contributions
 swe+0.003
 someday+0.003
 cheaper+0.003
 spe+0.003
 smaller+0.003
Neg logit contributions
leaf-0.004
ibrarian-0.004
vell-0.003
taker-0.003
he-0.003
Token to
Token ID284
Feature activation-6.48e-02

Pos logit contributions
c+0.031
 spe+0.031
 someday+0.030
 swe+0.030
 shin+0.029
Neg logit contributions
leaf-0.038
ibrarian-0.037
 character-0.034
 ear-0.033
taker-0.033
Token make
Token ID787
Feature activation-6.13e-01

Pos logit contributions
 swe+0.002
 someday+0.002
 spe+0.002
 cheaper+0.002
c+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.002
he-0.002
Token it
Token ID340
Feature activation-3.47e+00

Pos logit contributions
 someday+0.023
 swe+0.022
 spe+0.022
 cheaper+0.021
c+0.021
Neg logit contributions
leaf-0.024
ibrarian-0.023
taker-0.022
vell-0.021
 character-0.021
Token soar
Token ID48701
Feature activation-1.83e+00

Pos logit contributions
 spe+0.136
�+0.129
c+0.121
oral+0.121
use+0.119
Neg logit contributions
 around-0.115
 outside-0.111
taker-0.106
ong-0.105
vell-0.103
Token higher
Token ID2440
Feature activation-1.59e+00

Pos logit contributions
 swe+0.078
 spe+0.077
 cheaper+0.074
 One+0.071
oral+0.071
Neg logit contributions
he-0.054
 around-0.054
 outside-0.050
vell-0.050
 behind-0.048
Token than
Token ID621
Feature activation+6.43e-02

Pos logit contributions
 swe+0.070
 cheaper+0.066
 spe+0.065
c+0.064
 shin+0.063
Neg logit contributions
he-0.045
 around-0.042
 behind-0.041
 outside-0.041
ibrarian-0.040
Token the
Token ID262
Feature activation-6.85e-01

Pos logit contributions
leaf+0.003
ibrarian+0.003
vell+0.003
taker+0.003
asc+0.002
Neg logit contributions
 swe-0.002
 someday-0.002
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token boy
Token ID2933
Feature activation+3.26e-01

Pos logit contributions
 swe+0.023
 someday+0.023
c+0.021
 spe+0.021
 cheaper+0.021
Neg logit contributions
leaf-0.028
ibrarian-0.026
vell-0.026
he-0.026
taker-0.025
Token.
Token ID13
Feature activation-5.93e-01

Pos logit contributions
leaf+0.005
ibrarian+0.005
vell+0.005
taker+0.005
asc+0.005
Neg logit contributions
 swe-0.004
 someday-0.004
 cheaper-0.004
 spe-0.004
 smaller-0.003
Token Every
Token ID3887
Feature activation-8.45e-01

Pos logit contributions
 swe+0.015
 spe+0.014
 cheaper+0.014
 someday+0.013
 shin+0.013
Neg logit contributions
leaf-0.030
ibrarian-0.029
he-0.028
taker-0.028
 character-0.027
Token morning
Token ID3329
Feature activation-1.37e-02

Pos logit contributions
 swe+0.041
 cheaper+0.040
 shin+0.039
 spe+0.039
 someday+0.038
Neg logit contributions
leaf-0.022
ibrarian-0.022
 character-0.020
he-0.020
taker-0.019
Token,
Token ID11
Feature activation-5.70e-01

Pos logit contributions
 swe+0.000
 someday+0.000
 cheaper+0.000
 spe+0.000
 smaller+0.000
Neg logit contributions
leaf-0.001
ibrarian-0.001
vell-0.001
taker-0.001
he-0.001
Token the
Token ID262
Feature activation-1.26e+00

Pos logit contributions
 swe+0.020
 someday+0.020
 cheaper+0.020
 spe+0.019
c+0.019
Neg logit contributions
leaf-0.022
ibrarian-0.021
taker-0.020
vell-0.020
 character-0.020
Token little
Token ID1310
Feature activation-3.46e+00

Pos logit contributions
 someday+0.041
c+0.039
 swe+0.036
 cheaper+0.035
 One+0.035
Neg logit contributions
vell-0.050
ibrarian-0.050
leaf-0.049
taker-0.048
 goodness-0.047
Token pig
Token ID12967
Feature activation-1.46e+00

Pos logit contributions
onna+0.150
 someday+0.142
 sometime+0.141
 fade+0.133
 simpler+0.132
Neg logit contributions
ting-0.094
 goodness-0.093
 children-0.092
 ear-0.086
 outside-0.085
Token would
Token ID561
Feature activation+1.91e-01

Pos logit contributions
 cheaper+0.045
 swe+0.045
 spe+0.043
 shin+0.041
 someday+0.040
Neg logit contributions
ibrarian-0.065
leaf-0.060
vell-0.060
taker-0.059
he-0.059
Token run
Token ID1057
Feature activation-6.78e-02

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.008
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.006
 someday-0.006
 spe-0.006
 cheaper-0.006
 smaller-0.006
Token and
Token ID290
Feature activation+7.37e-01

Pos logit contributions
 swe+0.003
 someday+0.003
 spe+0.003
 cheaper+0.003
c+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.002
vell-0.002
taker-0.002
he-0.002
Token jump
Token ID4391
Feature activation+2.52e-01

Pos logit contributions
leaf+0.033
ibrarian+0.033
vell+0.032
taker+0.030
ibr+0.030
Neg logit contributions
 swe-0.024
 someday-0.023
 spe-0.022
 smaller-0.022
c-0.021
Token me
Token ID502
Feature activation+1.16e+00

Pos logit contributions
leaf+0.021
ibrarian+0.020
vell+0.020
taker+0.018
ibr+0.018
Neg logit contributions
 swe-0.021
 someday-0.020
c-0.020
 cheaper-0.019
 enough-0.019
Token!"
Token ID2474
Feature activation-2.63e-01

Pos logit contributions
leaf+0.062
ibrarian+0.056
vell+0.056
asc+0.054
taker+0.053
Neg logit contributions
 someday-0.032
 enough-0.028
 swe-0.028
c-0.027
 smaller-0.027
Token
Token ID198
Feature activation+1.39e-01

Pos logit contributions
 swe+0.008
 someday+0.008
 cheaper+0.007
 spe+0.007
 smaller+0.007
Neg logit contributions
leaf-0.012
ibrarian-0.012
vell-0.011
taker-0.011
he-0.011
Token
Token ID198
Feature activation-4.19e-01

Pos logit contributions
leaf+0.007
ibrarian+0.007
vell+0.007
taker+0.007
asc+0.006
Neg logit contributions
 swe-0.004
 someday-0.003
 spe-0.003
 cheaper-0.003
 smaller-0.003
TokenThe
Token ID464
Feature activation-2.36e+00

Pos logit contributions
 swe+0.013
 someday+0.013
 cheaper+0.012
 spe+0.012
 enough+0.012
Neg logit contributions
leaf-0.020
ibrarian-0.019
vell-0.018
taker-0.017
he-0.017
Token little
Token ID1310
Feature activation-3.46e+00

Pos logit contributions
 someday+0.096
onna+0.094
 sometime+0.091
c+0.090
eter+0.084
Neg logit contributions
he-0.066
ibrarian-0.065
 maker-0.064
 children-0.064
coon-0.063
Token boy
Token ID2933
Feature activation-3.10e-01

Pos logit contributions
onna+0.160
 former+0.154
 sometime+0.153
 someday+0.148
 reality+0.146
Neg logit contributions
 goodness-0.086
ting-0.084
he-0.079
 children-0.074
 ear-0.073
Token said
Token ID531
Feature activation+1.00e-01

Pos logit contributions
 swe+0.010
 someday+0.010
 cheaper+0.010
 spe+0.010
 smaller+0.009
Neg logit contributions
leaf-0.013
ibrarian-0.013
vell-0.012
taker-0.012
 character-0.012
Token,
Token ID11
Feature activation+1.43e-01

Pos logit contributions
leaf+0.005
ibrarian+0.005
vell+0.005
taker+0.005
asc+0.004
Neg logit contributions
 swe-0.003
 someday-0.003
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token "
Token ID366
Feature activation+5.12e-01

Pos logit contributions
leaf+0.006
ibrarian+0.006
vell+0.006
taker+0.005
asc+0.005
Neg logit contributions
 swe-0.005
 someday-0.005
 spe-0.005
 cheaper-0.005
 smaller-0.005
TokenWhat
Token ID2061
Feature activation+6.47e-01

Pos logit contributions
ibrarian+0.022
leaf+0.022
vell+0.021
taker+0.020
opher+0.020
Neg logit contributions
 swe-0.017
 someday-0.017
 spe-0.016
 cheaper-0.016
 shin-0.015
Token."
Token ID526
Feature activation-4.52e-01

Pos logit contributions
 swe+0.001
 someday+0.001
 cheaper+0.001
 spe+0.001
c+0.001
Neg logit contributions
leaf-0.002
ibrarian-0.002
vell-0.002
taker-0.002
asc-0.001
Token
Token ID198
Feature activation-1.95e-01

Pos logit contributions
 swe+0.015
 someday+0.014
 spe+0.014
 cheaper+0.014
 smaller+0.013
Neg logit contributions
leaf-0.020
ibrarian-0.019
taker-0.019
he-0.018
vell-0.018
Token
Token ID198
Feature activation-8.01e-01

Pos logit contributions
 swe+0.006
 someday+0.006
 spe+0.005
 cheaper+0.005
 smaller+0.005
Neg logit contributions
leaf-0.009
ibrarian-0.009
vell-0.009
taker-0.008
he-0.008
TokenSo
Token ID2396
Feature activation-1.05e+00

Pos logit contributions
 swe+0.029
 spe+0.028
 someday+0.027
 enough+0.027
 cheaper+0.026
Neg logit contributions
leaf-0.034
ibrarian-0.032
he-0.030
vell-0.030
taker-0.029
Token the
Token ID262
Feature activation-1.48e+00

Pos logit contributions
 swe+0.038
 cheaper+0.036
c+0.036
 someday+0.034
 spe+0.034
Neg logit contributions
leaf-0.038
ibrarian-0.037
taker-0.037
 character-0.036
he-0.036
Token little
Token ID1310
Feature activation-3.45e+00

Pos logit contributions
 someday+0.071
 sometime+0.064
c+0.063
 swe+0.063
 spe+0.063
Neg logit contributions
ibrarian-0.038
he-0.037
taker-0.035
 character-0.033
 goodness-0.033
Token boy
Token ID2933
Feature activation-7.70e-01

Pos logit contributions
onna+0.204
 former+0.193
 sometime+0.189
 reality+0.185
 fade+0.183
Neg logit contributions
 children-0.054
 goodness-0.050
ting-0.045
 kids-0.043
he-0.040
Token started
Token ID2067
Feature activation-1.03e+00

Pos logit contributions
 swe+0.027
 someday+0.027
 cheaper+0.026
 spe+0.025
 smaller+0.024
Neg logit contributions
leaf-0.031
ibrarian-0.031
vell-0.028
taker-0.028
he-0.027
Token sharp
Token ID7786
Feature activation-8.25e-01

Pos logit contributions
 someday+0.044
 swe+0.042
 cheaper+0.041
 spe+0.040
 shin+0.039
Neg logit contributions
leaf-0.035
ibrarian-0.030
vell-0.030
 outside-0.030
taker-0.029
Tokenening
Token ID3101
Feature activation-7.47e-02

Pos logit contributions
 someday+0.032
 swe+0.031
 cheaper+0.030
 spe+0.029
c+0.028
Neg logit contributions
leaf-0.030
ibrarian-0.028
taker-0.027
ting-0.027
vell-0.027
Token the
Token ID262
Feature activation+6.38e-01

Pos logit contributions
 swe+0.003
 someday+0.003
 cheaper+0.003
 spe+0.003
c+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.003
he-0.003
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
 swe+0.035
c+0.035
 cheaper+0.033
 someday+0.033
 spe+0.033
Neg logit contributions
ibrarian-0.043
taker-0.042
leaf-0.042
vell-0.042
he-0.040
Token Grace
Token ID16156
Feature activation-5.15e-01

Pos logit contributions
 swe+0.036
 cheaper+0.034
 someday+0.034
c+0.034
 spe+0.033
Neg logit contributions
leaf-0.040
taker-0.039
vell-0.038
ibrarian-0.038
he-0.037
Token's
Token ID338
Feature activation+5.68e-01

Pos logit contributions
 swe+0.017
 someday+0.016
 cheaper+0.016
 spe+0.015
 smaller+0.015
Neg logit contributions
leaf-0.022
ibrarian-0.022
vell-0.021
taker-0.020
he-0.020
Token attic
Token ID46113
Feature activation-1.80e+00

Pos logit contributions
leaf+0.024
ibrarian+0.022
vell+0.022
asc+0.021
taker+0.021
Neg logit contributions
 swe-0.021
 someday-0.020
 cheaper-0.020
 spe-0.019
 smaller-0.019
Token was
Token ID373
Feature activation-1.18e+00

Pos logit contributions
 swe+0.062
 someday+0.060
 spe+0.058
c+0.056
 cheaper+0.056
Neg logit contributions
ibrarian-0.071
vell-0.069
taker-0.067
leaf-0.065
he-0.065
Token becoming
Token ID5033
Feature activation-3.45e+00

Pos logit contributions
 swe+0.045
 someday+0.044
 spe+0.043
c+0.042
 coll+0.041
Neg logit contributions
ibrarian-0.040
vell-0.039
leaf-0.039
taker-0.039
he-0.037
Token more
Token ID517
Feature activation-3.01e-01

Pos logit contributions
 bidder+0.116
 coll+0.115
c+0.113
 spe+0.112
 One+0.110
Neg logit contributions
vell-0.118
 outside-0.114
taker-0.113
 around-0.110
 everywhere-0.105
Token and
Token ID290
Feature activation-1.66e+00

Pos logit contributions
 swe+0.011
 someday+0.011
 spe+0.011
c+0.011
 cheaper+0.011
Neg logit contributions
leaf-0.011
ibrarian-0.011
vell-0.011
taker-0.010
he-0.010
Token more
Token ID517
Feature activation-6.55e-02

Pos logit contributions
 spe+0.042
 One+0.042
c+0.041
 someday+0.040
 swe+0.038
Neg logit contributions
taker-0.071
leaf-0.071
vell-0.071
ibrarian-0.070
 character-0.068
Token full
Token ID1336
Feature activation+6.64e-01

Pos logit contributions
 swe+0.002
 someday+0.002
 spe+0.002
 cheaper+0.002
c+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.002
taker-0.002
he-0.002
Token with
Token ID351
Feature activation+9.08e-02

Pos logit contributions
leaf+0.026
ibrarian+0.025
vell+0.024
asc+0.023
taker+0.023
Neg logit contributions
 swe-0.025
 someday-0.025
 cheaper-0.023
 spe-0.023
 enough-0.023
Token that
Token ID326
Feature activation-1.39e-01

Pos logit contributions
 swe+0.002
 someday+0.002
 cheaper+0.001
 spe+0.001
c+0.001
Neg logit contributions
leaf-0.002
ibrarian-0.002
vell-0.002
taker-0.002
asc-0.002
Token it
Token ID340
Feature activation-1.76e-02

Pos logit contributions
 swe+0.004
 someday+0.004
 cheaper+0.004
 spe+0.004
 smaller+0.004
Neg logit contributions
leaf-0.006
ibrarian-0.006
vell-0.006
taker-0.006
 character-0.006
Token could
Token ID714
Feature activation-7.44e-01

Pos logit contributions
 swe+0.001
 someday+0.001
 cheaper+0.001
 spe+0.001
 smaller+0.000
Neg logit contributions
leaf-0.001
ibrarian-0.001
vell-0.001
taker-0.001
asc-0.001
Token make
Token ID787
Feature activation-7.32e-01

Pos logit contributions
 swe+0.026
 someday+0.024
 cheaper+0.024
c+0.023
 enough+0.023
Neg logit contributions
leaf-0.031
ibrarian-0.030
vell-0.029
taker-0.028
he-0.028
Token the
Token ID262
Feature activation-2.95e-01

Pos logit contributions
 someday+0.025
 swe+0.024
 cheaper+0.024
 spe+0.023
 One+0.022
Neg logit contributions
leaf-0.029
ibrarian-0.029
vell-0.028
taker-0.027
 character-0.027
Token garden
Token ID11376
Feature activation-3.41e+00

Pos logit contributions
 swe+0.010
 someday+0.010
 cheaper+0.009
 spe+0.009
c+0.009
Neg logit contributions
leaf-0.012
ibrarian-0.012
vell-0.012
taker-0.011
 character-0.011
Token even
Token ID772
Feature activation-1.79e+00

Pos logit contributions
 someday+0.127
 One+0.121
One+0.121
 bidder+0.119
 spe+0.118
Neg logit contributions
vell-0.110
taker-0.110
 around-0.104
 outside-0.104
ong-0.102
Token better
Token ID1365
Feature activation-5.97e-02

Pos logit contributions
 One+0.038
 someday+0.038
 spe+0.035
c+0.035
One+0.033
Neg logit contributions
leaf-0.088
taker-0.084
vell-0.082
 around-0.081
 outside-0.080
Token.
Token ID13
Feature activation+1.07e-01

Pos logit contributions
 swe+0.002
 someday+0.002
 cheaper+0.002
 spe+0.002
c+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.002
vell-0.002
taker-0.002
asc-0.002
Token
Token ID198
Feature activation+6.57e-01

Pos logit contributions
leaf+0.005
ibrarian+0.005
vell+0.005
taker+0.005
asc+0.004
Neg logit contributions
 swe-0.003
 someday-0.003
 spe-0.003
 cheaper-0.003
c-0.003
Token
Token ID198
Feature activation+2.28e-01

Pos logit contributions
leaf+0.036
vell+0.036
ibrarian+0.035
taker+0.034
asc+0.034
Neg logit contributions
 someday-0.013
 swe-0.013
 One-0.012
c-0.012
 cheaper-0.012
TokenSoon
Token ID28093
Feature activation-1.63e+00

Pos logit contributions
 swe+0.026
 spe+0.024
 cheaper+0.024
 someday+0.024
 enough+0.023
Neg logit contributions
leaf-0.030
ibrarian-0.029
vell-0.027
he-0.027
taker-0.027
Token,
Token ID11
Feature activation-7.88e-02

Pos logit contributions
 swe+0.060
c+0.060
 cheaper+0.058
 shin+0.056
 coll+0.054
Neg logit contributions
 character-0.056
 goodness-0.056
ibrarian-0.055
ting-0.053
leaf-0.053
Token the
Token ID262
Feature activation+3.11e-01

Pos logit contributions
 swe+0.002
 someday+0.002
 cheaper+0.002
 spe+0.002
c+0.002
Neg logit contributions
leaf-0.004
ibrarian-0.004
vell-0.004
taker-0.003
he-0.003
Token night
Token ID1755
Feature activation-4.65e-01

Pos logit contributions
leaf+0.014
ibrarian+0.013
vell+0.013
taker+0.012
asc+0.012
Neg logit contributions
 swe-0.010
 someday-0.010
 spe-0.010
 cheaper-0.010
 smaller-0.009
Token sky
Token ID6766
Feature activation-2.02e+00

Pos logit contributions
 swe+0.016
 someday+0.015
 cheaper+0.015
 spe+0.014
 smaller+0.014
Neg logit contributions
leaf-0.019
ibrarian-0.019
vell-0.018
taker-0.018
he-0.017
Token grew
Token ID6348
Feature activation-3.40e+00

Pos logit contributions
 swe+0.073
 cheaper+0.073
 someday+0.068
 coll+0.066
 shin+0.065
Neg logit contributions
he-0.071
 around-0.068
leaf-0.067
ong-0.067
taker-0.067
Token very
Token ID845
Feature activation-1.13e+00

Pos logit contributions
 bidder+0.132
c+0.122
 coll+0.120
 god+0.117
 someday+0.116
Neg logit contributions
 outside-0.122
ong-0.110
 around-0.109
vell-0.109
 everywhere-0.105
Token dark
Token ID3223
Feature activation-8.60e-01

Pos logit contributions
c+0.042
 someday+0.041
 swe+0.040
 coll+0.039
 spe+0.038
Neg logit contributions
leaf-0.039
vell-0.039
taker-0.038
he-0.036
ibrarian-0.035
Token.
Token ID13
Feature activation+2.45e-02

Pos logit contributions
 swe+0.032
 cheaper+0.031
 someday+0.030
c+0.029
 spe+0.029
Neg logit contributions
leaf-0.031
vell-0.030
ibrarian-0.030
he-0.030
taker-0.029
Token The
Token ID383
Feature activation+7.55e-01

Pos logit contributions
leaf+0.001
ibrarian+0.001
vell+0.001
taker+0.001
he+0.001
Neg logit contributions
 swe-0.001
 someday-0.001
 cheaper-0.001
 spe-0.001
 smaller-0.001
Token little
Token ID1310
Feature activation-9.87e-01

Pos logit contributions
leaf+0.034
ibrarian+0.033
vell+0.031
asc+0.031
iders+0.030
Neg logit contributions
 swe-0.025
 someday-0.024
 spe-0.024
 smaller-0.023
 cheaper-0.023
Token He
Token ID679
Feature activation+4.65e-01

Pos logit contributions
leaf+0.006
ibrarian+0.006
vell+0.005
taker+0.005
asc+0.005
Neg logit contributions
 swe-0.004
 someday-0.004
 cheaper-0.004
 spe-0.004
 smaller-0.003
Token thought
Token ID1807
Feature activation+6.86e-01

Pos logit contributions
leaf+0.021
ibrarian+0.020
vell+0.020
taker+0.019
asc+0.018
Neg logit contributions
 swe-0.015
 someday-0.014
 spe-0.014
 cheaper-0.014
 smaller-0.013
Token she
Token ID673
Feature activation+4.93e-01

Pos logit contributions
leaf+0.032
vell+0.031
ibrarian+0.031
asc+0.030
iders+0.028
Neg logit contributions
 swe-0.021
 someday-0.020
 cheaper-0.019
 enough-0.019
 smaller-0.019
Token was
Token ID373
Feature activation+1.93e+00

Pos logit contributions
leaf+0.025
ibrarian+0.024
vell+0.024
taker+0.023
asc+0.023
Neg logit contributions
 swe-0.013
 someday-0.012
 spe-0.012
 cheaper-0.012
c-0.011
Token unhappy
Token ID19283
Feature activation+6.03e-01

Pos logit contributions
leaf+0.090
he+0.088
ibrarian+0.088
person+0.084
iders+0.084
Neg logit contributions
 smaller-0.060
 swe-0.058
 enough-0.057
 cheaper-0.056
 shin-0.052
Token with
Token ID351
Feature activation+1.98e+00

Pos logit contributions
leaf+0.030
ibrarian+0.028
vell+0.028
asc+0.026
taker+0.026
Neg logit contributions
 someday-0.017
 swe-0.017
 cheaper-0.016
 enough-0.016
 smaller-0.015
Token him
Token ID683
Feature activation+7.01e-01

Pos logit contributions
leaf+0.098
vell+0.097
ibrarian+0.097
asc+0.095
iders+0.093
Neg logit contributions
 swe-0.056
 enough-0.053
 cheaper-0.051
 smaller-0.051
 someday-0.050
Token and
Token ID290
Feature activation+5.51e-01

Pos logit contributions
leaf+0.035
ibrarian+0.033
vell+0.032
taker+0.031
asc+0.031
Neg logit contributions
 someday-0.020
 swe-0.020
 cheaper-0.018
 enough-0.018
 smaller-0.018
Token his
Token ID465
Feature activation+1.35e+00

Pos logit contributions
leaf+0.025
ibrarian+0.024
vell+0.023
asc+0.022
taker+0.022
Neg logit contributions
 swe-0.018
 someday-0.017
 cheaper-0.017
 spe-0.016
 smaller-0.016
Token clay
Token ID21558
Feature activation+8.81e-01

Pos logit contributions
leaf+0.056
ibrarian+0.054
vell+0.053
he+0.050
asc+0.050
Neg logit contributions
 swe-0.049
 someday-0.046
 cheaper-0.045
 smaller-0.044
 spe-0.044
Token.
Token ID13
Feature activation-2.11e-01

Pos logit contributions
leaf+0.043
ibrarian+0.040
vell+0.039
asc+0.038
taker+0.037
Neg logit contributions
 swe-0.027
 someday-0.026
 cheaper-0.024
 spe-0.023
 smaller-0.023
Token looked
Token ID3114
Feature activation+7.17e-01

Pos logit contributions
ibrarian+0.066
leaf+0.064
taker+0.062
iders+0.062
opher+0.061
Neg logit contributions
 someday-0.040
 spe-0.039
 swe-0.039
c-0.036
 One-0.035
Token at
Token ID379
Feature activation+5.94e-01

Pos logit contributions
ibrarian+0.025
leaf+0.024
iders+0.022
vell+0.022
taker+0.021
Neg logit contributions
 swe-0.032
 someday-0.031
 enough-0.030
 cheaper-0.029
 smaller-0.029
Token it
Token ID340
Feature activation-4.09e-01

Pos logit contributions
leaf+0.031
ibrarian+0.031
vell+0.030
ibr+0.029
asc+0.028
Neg logit contributions
 someday-0.015
 swe-0.015
 smaller-0.014
 cheaper-0.013
 spe-0.013
Token,
Token ID11
Feature activation+2.08e-01

Pos logit contributions
 swe+0.014
 someday+0.014
 spe+0.014
 cheaper+0.014
c+0.013
Neg logit contributions
leaf-0.016
ibrarian-0.016
vell-0.015
taker-0.015
he-0.015
Token and
Token ID290
Feature activation+3.43e-01

Pos logit contributions
leaf+0.009
ibrarian+0.009
vell+0.008
taker+0.008
asc+0.008
Neg logit contributions
 swe-0.007
 someday-0.007
 spe-0.007
 cheaper-0.007
 smaller-0.006
Token their
Token ID511
Feature activation+1.94e+00

Pos logit contributions
leaf+0.018
ibrarian+0.018
vell+0.017
taker+0.017
he+0.016
Neg logit contributions
 swe-0.008
 someday-0.008
 spe-0.007
 cheaper-0.007
 smaller-0.007
Token eyes
Token ID2951
Feature activation-6.69e-01

Pos logit contributions
leaf+0.094
ibrarian+0.093
iders+0.091
taker+0.089
he+0.089
Neg logit contributions
 swe-0.062
 smaller-0.058
 cheaper-0.053
 enough-0.050
 coll-0.049
Token spark
Token ID9009
Feature activation-9.28e-01

Pos logit contributions
 swe+0.021
 someday+0.021
 cheaper+0.020
 spe+0.019
c+0.018
Neg logit contributions
leaf-0.029
vell-0.028
ibrarian-0.028
he-0.027
taker-0.027
Tokenled
Token ID992
Feature activation-8.10e-01

Pos logit contributions
 someday+0.062
 swe+0.062
 cheaper+0.061
 spe+0.060
 One+0.060
Neg logit contributions
leaf-0.007
vell-0.005
ibrarian-0.005
taker-0.004
he-0.003
Token.
Token ID13
Feature activation-4.48e-03

Pos logit contributions
 swe+0.037
 cheaper+0.036
c+0.036
 someday+0.036
 spe+0.036
Neg logit contributions
leaf-0.022
vell-0.021
ibrarian-0.019
taker-0.019
he-0.018
Token They
Token ID1119
Feature activation+1.27e+00

Pos logit contributions
 swe+0.000
 someday+0.000
 cheaper+0.000
 spe+0.000
 smaller+0.000
Neg logit contributions
leaf-0.000
ibrarian-0.000
vell-0.000
taker-0.000
asc-0.000
Token"
Token ID1
Feature activation-5.17e-01

Pos logit contributions
leaf+0.031
vell+0.030
ibrarian+0.029
taker+0.028
opher+0.028
Neg logit contributions
 someday-0.017
 swe-0.016
One-0.015
 One-0.015
 smaller-0.015
TokenBen
Token ID11696
Feature activation-5.24e-01

Pos logit contributions
 swe+0.021
 spe+0.021
 someday+0.020
 cheaper+0.020
 enough+0.020
Neg logit contributions
leaf-0.019
ibrarian-0.018
vell-0.017
iders-0.016
ibr-0.016
Token,
Token ID11
Feature activation+9.30e-01

Pos logit contributions
 swe+0.017
 someday+0.016
 cheaper+0.015
 spe+0.015
 smaller+0.014
Neg logit contributions
ibrarian-0.023
leaf-0.023
vell-0.022
taker-0.021
asc-0.021
Token can
Token ID460
Feature activation+1.63e-01

Pos logit contributions
leaf+0.043
vell+0.039
ower+0.038
ibrarian+0.038
asc+0.038
Neg logit contributions
 someday-0.030
 swe-0.030
 smaller-0.028
 enough-0.027
 cheaper-0.027
Token I
Token ID314
Feature activation-8.02e-01

Pos logit contributions
leaf+0.006
ibrarian+0.005
vell+0.005
taker+0.005
asc+0.005
Neg logit contributions
 swe-0.007
 someday-0.007
 cheaper-0.006
 spe-0.006
 smaller-0.006
Token show
Token ID905
Feature activation+1.85e+00

Pos logit contributions
 swe+0.030
 spe+0.029
 someday+0.028
 cheaper+0.028
 shin+0.027
Neg logit contributions
leaf-0.030
ibrarian-0.030
vell-0.029
he-0.028
taker-0.027
Token you
Token ID345
Feature activation-7.53e-02

Pos logit contributions
vell+0.083
leaf+0.083
ower+0.080
asc+0.080
iders+0.076
Neg logit contributions
 someday-0.061
 swe-0.060
 enough-0.060
 smaller-0.057
 again-0.055
Token something
Token ID1223
Feature activation+8.22e-01

Pos logit contributions
 swe+0.002
 someday+0.002
 spe+0.002
 cheaper+0.002
c+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.003
he-0.003
Token?"
Token ID1701
Feature activation-4.94e-01

Pos logit contributions
leaf+0.036
ibrarian+0.034
vell+0.033
he+0.033
taker+0.032
Neg logit contributions
 swe-0.029
 someday-0.027
 cheaper-0.027
 smaller-0.026
 enough-0.024
Token Mia
Token ID32189
Feature activation-9.89e-01

Pos logit contributions
 swe+0.022
 someday+0.021
 cheaper+0.021
 spe+0.021
c+0.020
Neg logit contributions
ibrarian-0.016
leaf-0.015
taker-0.014
vell-0.013
 character-0.013
Token says
Token ID1139
Feature activation-3.74e-01

Pos logit contributions
 spe+0.034
 someday+0.034
 cheaper+0.034
 swe+0.033
 shin+0.030
Neg logit contributions
ibrarian-0.041
leaf-0.040
taker-0.036
izards-0.035
he-0.035
Token from
Token ID422
Feature activation+1.06e-01

Pos logit contributions
 swe+0.007
 someday+0.007
 cheaper+0.007
 spe+0.007
c+0.007
Neg logit contributions
leaf-0.005
ibrarian-0.005
vell-0.005
taker-0.005
he-0.005
Token the
Token ID262
Feature activation+1.37e+00

Pos logit contributions
leaf+0.004
ibrarian+0.004
vell+0.004
taker+0.004
asc+0.004
Neg logit contributions
 swe-0.004
 someday-0.004
 spe-0.004
 cheaper-0.004
 smaller-0.004
Token woods
Token ID16479
Feature activation-5.28e-01

Pos logit contributions
leaf+0.063
ibrarian+0.062
vell+0.059
ibr+0.057
taker+0.057
Neg logit contributions
 swe-0.044
 smaller-0.043
 cheaper-0.040
 spe-0.040
 someday-0.040
Token.
Token ID13
Feature activation+2.60e-01

Pos logit contributions
 swe+0.019
 someday+0.018
 spe+0.018
 cheaper+0.018
 shin+0.017
Neg logit contributions
leaf-0.021
ibrarian-0.020
vell-0.020
taker-0.019
he-0.019
Token She
Token ID1375
Feature activation+8.64e-02

Pos logit contributions
leaf+0.012
ibrarian+0.011
vell+0.011
taker+0.011
asc+0.010
Neg logit contributions
 swe-0.008
 someday-0.008
 cheaper-0.008
 spe-0.008
 One-0.007
Token was
Token ID373
Feature activation+2.03e+00

Pos logit contributions
leaf+0.004
ibrarian+0.004
vell+0.004
taker+0.004
he+0.004
Neg logit contributions
 swe-0.003
 someday-0.003
 spe-0.002
 cheaper-0.002
 smaller-0.002
Token curious
Token ID11040
Feature activation+6.49e-01

Pos logit contributions
vell+0.101
leaf+0.100
issors+0.099
affles+0.098
iders+0.096
Neg logit contributions
 smaller-0.060
 enough-0.056
 less-0.052
 longer-0.050
 swe-0.049
Token,
Token ID11
Feature activation+1.57e-01

Pos logit contributions
leaf+0.029
ibrarian+0.028
vell+0.028
asc+0.026
taker+0.025
Neg logit contributions
 swe-0.022
 someday-0.022
 enough-0.021
 cheaper-0.020
 One-0.020
Token so
Token ID523
Feature activation+6.27e-01

Pos logit contributions
leaf+0.006
ibrarian+0.006
vell+0.006
taker+0.006
asc+0.006
Neg logit contributions
 swe-0.006
 someday-0.005
 spe-0.005
 cheaper-0.005
 smaller-0.005
Token she
Token ID673
Feature activation-1.33e-01

Pos logit contributions
leaf+0.034
ibrarian+0.033
vell+0.032
asc+0.031
taker+0.031
Neg logit contributions
 someday-0.016
 swe-0.015
 smaller-0.014
 enough-0.014
 spe-0.013
Token decided
Token ID3066
Feature activation-1.67e-01

Pos logit contributions
 swe+0.004
 someday+0.004
 cheaper+0.004
 spe+0.004
c+0.004
Neg logit contributions
leaf-0.006
ibrarian-0.006
vell-0.006
taker-0.005
asc-0.005
Token.
Token ID13
Feature activation+5.29e-01

Pos logit contributions
leaf+0.045
ibrarian+0.044
vell+0.041
asc+0.041
opher+0.040
Neg logit contributions
 someday-0.026
 swe-0.025
 spe-0.024
 cheaper-0.024
 enough-0.023
Token
Token ID198
Feature activation+6.32e-01

Pos logit contributions
leaf+0.028
ibrarian+0.027
vell+0.027
asc+0.026
taker+0.026
Neg logit contributions
 someday-0.012
 swe-0.012
 One-0.012
 spe-0.011
 cheaper-0.011
Token
Token ID198
Feature activation+1.56e-01

Pos logit contributions
leaf+0.034
vell+0.033
ibrarian+0.033
asc+0.031
taker+0.031
Neg logit contributions
 someday-0.014
 swe-0.013
 One-0.013
 smaller-0.012
 cheaper-0.012
TokenThe
Token ID464
Feature activation+1.88e+00

Pos logit contributions
leaf+0.007
ibrarian+0.007
vell+0.007
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.005
 someday-0.005
 cheaper-0.004
 spe-0.004
 smaller-0.004
Token books
Token ID3835
Feature activation+1.58e+00

Pos logit contributions
leaf+0.092
asc+0.088
izards+0.084
uted+0.083
ying+0.082
Neg logit contributions
 swe-0.059
 smaller-0.057
 cheaper-0.054
 One-0.054
 someday-0.053
Tokenhe
Token ID258
Feature activation+1.89e+00

Pos logit contributions
leaf+0.037
asc+0.031
ibrarian+0.030
vell+0.030
issors+0.028
Neg logit contributions
 someday-0.086
 swe-0.081
 spe-0.081
 again-0.080
 enough-0.080
Tokenlf
Token ID1652
Feature activation-4.07e-01

Pos logit contributions
leaf+0.057
vell+0.053
asc+0.047
issors+0.044
ower+0.042
Neg logit contributions
 someday-0.091
 again-0.088
 smaller-0.084
 enough-0.083
 spe-0.083
Token was
Token ID373
Feature activation-1.32e-01

Pos logit contributions
 swe+0.013
 someday+0.012
 cheaper+0.012
 spe+0.012
 smaller+0.011
Neg logit contributions
leaf-0.018
ibrarian-0.018
vell-0.017
taker-0.017
he-0.017
Token happy
Token ID3772
Feature activation-2.58e-02

Pos logit contributions
 swe+0.005
 someday+0.005
 cheaper+0.005
 spe+0.005
c+0.005
Neg logit contributions
leaf-0.005
ibrarian-0.005
vell-0.005
taker-0.005
he-0.004
Token to
Token ID284
Feature activation-1.32e+00

Pos logit contributions
 swe+0.001
 someday+0.001
 cheaper+0.001
 spe+0.001
c+0.001
Neg logit contributions
leaf-0.001
ibrarian-0.001
vell-0.001
taker-0.001
he-0.001
Token have
Token ID423
Feature activation-1.65e-01

Pos logit contributions
 swe+0.053
 cheaper+0.050
 shin+0.047
 spe+0.045
 coll+0.045
Neg logit contributions
he-0.050
 ear-0.048
ibrarian-0.048
leaf-0.046
taker-0.044
Token captain
Token ID10654
Feature activation+3.00e-01

Pos logit contributions
leaf+0.014
ibrarian+0.013
vell+0.013
taker+0.012
asc+0.012
Neg logit contributions
 swe-0.011
 someday-0.011
 cheaper-0.010
 spe-0.010
 smaller-0.010
Token of
Token ID286
Feature activation-3.41e-04

Pos logit contributions
leaf+0.011
ibrarian+0.011
vell+0.010
taker+0.010
asc+0.010
Neg logit contributions
 someday-0.012
 swe-0.012
 cheaper-0.011
 spe-0.011
c-0.011
Token the
Token ID262
Feature activation+3.47e-01

Pos logit contributions
 swe+0.000
 someday+0.000
 cheaper+0.000
 spe+0.000
c+0.000
Neg logit contributions
leaf-0.000
ibrarian-0.000
vell-0.000
taker-0.000
he-0.000
Token ship
Token ID4074
Feature activation+4.84e-01

Pos logit contributions
leaf+0.015
ibrarian+0.014
vell+0.014
taker+0.013
asc+0.013
Neg logit contributions
 swe-0.012
 someday-0.012
 cheaper-0.011
 spe-0.011
 smaller-0.011
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
leaf+0.020
ibrarian+0.019
vell+0.019
taker+0.018
asc+0.018
Neg logit contributions
 someday-0.017
 swe-0.017
 cheaper-0.016
 spe-0.016
 smaller-0.015
Token Give
Token ID13786
Feature activation+9.28e-01

Pos logit contributions
leaf+0.044
vell+0.043
ibrarian+0.043
he+0.040
opher+0.040
Neg logit contributions
 someday-0.035
 One-0.033
 swe-0.032
 spe-0.031
 cheaper-0.030
Token me
Token ID502
Feature activation+4.98e-01

Pos logit contributions
vell+0.043
leaf+0.043
ibrarian+0.042
iders+0.041
taker+0.041
Neg logit contributions
 swe-0.028
 enough-0.028
 someday-0.027
 smaller-0.026
 cheaper-0.025
Token your
Token ID534
Feature activation+8.19e-01

Pos logit contributions
leaf+0.024
ibrarian+0.023
vell+0.023
taker+0.022
iders+0.022
Neg logit contributions
 swe-0.014
 someday-0.014
 cheaper-0.013
 smaller-0.013
 enough-0.013
Token treasure
Token ID14068
Feature activation+2.85e-01

Pos logit contributions
leaf+0.037
ibrarian+0.036
asc+0.034
taker+0.034
iders+0.033
Neg logit contributions
 swe-0.028
 someday-0.026
 cheaper-0.026
 smaller-0.026
 spe-0.025
Token or
Token ID393
Feature activation+1.29e+00

Pos logit contributions
leaf+0.012
ibrarian+0.011
vell+0.011
taker+0.011
asc+0.010
Neg logit contributions
 swe-0.010
 someday-0.010
 cheaper-0.009
 spe-0.009
 smaller-0.009
Token walk
Token ID2513
Feature activation-1.38e-02

Pos logit contributions
leaf+0.061
ibrarian+0.060
he+0.060
vell+0.059
taker+0.058
Neg logit contributions
 someday-0.040
 swe-0.039
 smaller-0.037
 cheaper-0.036
 enough-0.035
Token she
Token ID673
Feature activation+1.02e+00

Pos logit contributions
leaf+0.034
ibrarian+0.032
vell+0.031
asc+0.030
opher+0.029
Neg logit contributions
 swe-0.023
 someday-0.022
 cheaper-0.021
 smaller-0.020
 enough-0.020
Token saw
Token ID2497
Feature activation+4.60e-01

Pos logit contributions
leaf+0.058
vell+0.053
taker+0.053
ibrarian+0.052
opher+0.051
Neg logit contributions
 someday-0.023
 swe-0.022
c-0.022
 smaller-0.020
 spe-0.020
Token a
Token ID257
Feature activation+3.92e-01

Pos logit contributions
leaf+0.023
vell+0.022
ibrarian+0.022
taker+0.021
asc+0.021
Neg logit contributions
 swe-0.012
 someday-0.012
 cheaper-0.011
 smaller-0.011
 enough-0.011
Token big
Token ID1263
Feature activation+3.40e-01

Pos logit contributions
leaf+0.018
ibrarian+0.017
vell+0.017
taker+0.016
ibr+0.016
Neg logit contributions
 swe-0.012
 someday-0.012
 cheaper-0.011
 spe-0.011
 smaller-0.011
Token post
Token ID1281
Feature activation+8.84e-01

Pos logit contributions
leaf+0.015
ibrarian+0.015
vell+0.014
taker+0.014
ibr+0.014
Neg logit contributions
 swe-0.011
 someday-0.011
 spe-0.010
 cheaper-0.010
 smaller-0.010
Tokenbox
Token ID3524
Feature activation+1.15e+00

Pos logit contributions
leaf+0.035
ibrarian+0.032
vell+0.032
taker+0.031
opher+0.031
Neg logit contributions
 someday-0.033
c-0.031
 swe-0.031
 spe-0.031
 cheaper-0.031
Token.
Token ID13
Feature activation+7.68e-01

Pos logit contributions
leaf+0.047
ibrarian+0.044
taker+0.042
vell+0.041
opher+0.041
Neg logit contributions
 someday-0.042
 swe-0.040
 spe-0.040
 enough-0.040
 cheaper-0.040
Token It
Token ID632
Feature activation-8.68e-01

Pos logit contributions
leaf+0.037
vell+0.036
ibrarian+0.035
asc+0.034
taker+0.033
Neg logit contributions
 swe-0.023
 One-0.022
 someday-0.021
 cheaper-0.021
 smaller-0.020
Token was
Token ID373
Feature activation-1.09e-01

Pos logit contributions
 someday+0.032
 swe+0.032
 cheaper+0.029
 enough+0.028
 spe+0.028
Neg logit contributions
ibrarian-0.034
leaf-0.032
vell-0.031
 character-0.031
he-0.031
Token red
Token ID2266
Feature activation+9.71e-01

Pos logit contributions
 swe+0.004
 someday+0.004
 cheaper+0.003
 spe+0.003
c+0.003
Neg logit contributions
leaf-0.005
ibrarian-0.004
vell-0.004
taker-0.004
he-0.004
Token!
Token ID0
Feature activation+7.91e-01

Pos logit contributions
leaf+0.054
ibrarian+0.052
vell+0.051
ibr+0.049
taker+0.049
Neg logit contributions
 swe-0.022
 enough-0.021
 someday-0.021
 cheaper-0.020
 spe-0.019
Token rabbit
Token ID22746
Feature activation-7.58e-01

Pos logit contributions
leaf+0.005
ibrarian+0.004
vell+0.004
taker+0.004
asc+0.004
Neg logit contributions
 swe-0.003
 someday-0.003
 cheaper-0.003
 spe-0.003
 smaller-0.003
Token was
Token ID373
Feature activation+6.74e-01

Pos logit contributions
 swe+0.026
 someday+0.025
 cheaper+0.025
 spe+0.024
 smaller+0.023
Neg logit contributions
leaf-0.031
ibrarian-0.031
vell-0.029
taker-0.028
 character-0.027
Token so
Token ID523
Feature activation+1.06e+00

Pos logit contributions
leaf+0.029
vell+0.027
ibrarian+0.027
iders+0.026
taker+0.025
Neg logit contributions
 swe-0.023
 smaller-0.022
 cheaper-0.022
 someday-0.022
 spe-0.022
Token happy
Token ID3772
Feature activation+5.99e-01

Pos logit contributions
leaf+0.047
ibrarian+0.045
vell+0.044
iders+0.043
asc+0.041
Neg logit contributions
 smaller-0.035
 swe-0.034
 spe-0.034
 cheaper-0.033
 someday-0.032
Token!
Token ID0
Feature activation-3.67e-01

Pos logit contributions
leaf+0.028
ibrarian+0.028
vell+0.027
taker+0.026
ibr+0.026
Neg logit contributions
 someday-0.018
 swe-0.018
 enough-0.017
 spe-0.016
 cheaper-0.016
Token He
Token ID679
Feature activation+1.91e-01

Pos logit contributions
 swe+0.012
 someday+0.012
 spe+0.011
 cheaper+0.011
 smaller+0.011
Neg logit contributions
leaf-0.016
ibrarian-0.015
taker-0.014
vell-0.014
he-0.014
Token followed
Token ID3940
Feature activation+1.14e+00

Pos logit contributions
leaf+0.008
ibrarian+0.008
vell+0.008
taker+0.007
asc+0.007
Neg logit contributions
 swe-0.007
 someday-0.006
 spe-0.006
 cheaper-0.006
 smaller-0.006
Token the
Token ID262
Feature activation+8.64e-01

Pos logit contributions
vell+0.064
leaf+0.063
ibrarian+0.062
asc+0.062
iders+0.061
Neg logit contributions
 someday-0.024
 swe-0.023
 enough-0.021
 One-0.021
 smaller-0.021
Token bird
Token ID6512
Feature activation-2.64e-01

Pos logit contributions
leaf+0.039
ibrarian+0.038
vell+0.037
asc+0.036
he+0.036
Neg logit contributions
 swe-0.029
 smaller-0.027
 someday-0.026
 spe-0.026
 cheaper-0.026
Token and
Token ID290
Feature activation+3.50e-01

Pos logit contributions
 swe+0.010
 someday+0.010
 cheaper+0.010
 spe+0.010
 smaller+0.009
Neg logit contributions
leaf-0.010
ibrarian-0.010
vell-0.009
taker-0.009
he-0.009
Token they
Token ID484
Feature activation+1.02e+00

Pos logit contributions
leaf+0.017
ibrarian+0.016
vell+0.016
taker+0.015
asc+0.015
Neg logit contributions
 swe-0.010
 someday-0.010
 spe-0.009
 cheaper-0.009
 smaller-0.009
Token head
Token ID1182
Feature activation+9.74e-02

Pos logit contributions
leaf+0.022
ibrarian+0.021
vell+0.019
taker+0.019
asc+0.019
Neg logit contributions
 swe-0.022
 someday-0.021
 smaller-0.021
 cheaper-0.021
 spe-0.020
Token.
Token ID13
Feature activation+5.27e-01

Pos logit contributions
leaf+0.004
ibrarian+0.004
vell+0.004
taker+0.004
asc+0.004
Neg logit contributions
 swe-0.003
 someday-0.003
 cheaper-0.003
 spe-0.003
 smaller-0.003
Token Mia
Token ID32189
Feature activation+5.34e-01

Pos logit contributions
leaf+0.025
vell+0.024
ibrarian+0.024
taker+0.022
asc+0.022
Neg logit contributions
 swe-0.016
 someday-0.015
 One-0.015
 spe-0.014
 cheaper-0.014
Token felt
Token ID2936
Feature activation+3.52e-01

Pos logit contributions
leaf+0.026
vell+0.025
ibrarian+0.024
taker+0.024
asc+0.023
Neg logit contributions
 swe-0.015
 someday-0.015
 spe-0.014
c-0.014
 cheaper-0.014
Token something
Token ID1223
Feature activation+7.84e-01

Pos logit contributions
leaf+0.016
ibrarian+0.016
vell+0.015
asc+0.014
taker+0.014
Neg logit contributions
 swe-0.011
 someday-0.011
 cheaper-0.010
 spe-0.010
 smaller-0.010
Token wet
Token ID9583
Feature activation+5.32e-01

Pos logit contributions
leaf+0.034
ibrarian+0.033
vell+0.032
asc+0.031
ibr+0.031
Neg logit contributions
 swe-0.028
 spe-0.026
 someday-0.025
 cheaper-0.025
 smaller-0.025
Token and
Token ID290
Feature activation+3.86e-01

Pos logit contributions
leaf+0.023
ibrarian+0.022
vell+0.021
taker+0.020
ibr+0.020
Neg logit contributions
 someday-0.019
 swe-0.019
 enough-0.017
 spe-0.017
 cheaper-0.017
Token gross
Token ID10319
Feature activation+1.02e+00

Pos logit contributions
leaf+0.018
ibrarian+0.017
vell+0.016
taker+0.016
ibr+0.015
Neg logit contributions
 swe-0.013
 someday-0.012
 spe-0.012
 cheaper-0.012
 smaller-0.012
Token on
Token ID319
Feature activation-5.65e-02

Pos logit contributions
leaf+0.052
ibrarian+0.049
vell+0.048
ibr+0.045
taker+0.045
Neg logit contributions
 swe-0.028
 enough-0.028
 someday-0.028
 spe-0.025
 smaller-0.025
Token her
Token ID607
Feature activation+4.25e-01

Pos logit contributions
 swe+0.002
 someday+0.002
 cheaper+0.002
 spe+0.002
c+0.002
Neg logit contributions
leaf-0.002
ibrarian-0.002
vell-0.002
taker-0.002
he-0.002
Token hair
Token ID4190
Feature activation+1.94e-01

Pos logit contributions
leaf+0.014
ibrarian+0.013
vell+0.012
taker+0.012
asc+0.011
Neg logit contributions
 swe-0.020
 someday-0.019
 spe-0.018
 smaller-0.018
 cheaper-0.018
Token.
Token ID13
Feature activation-2.98e-01

Pos logit contributions
 swe+0.000
 someday+0.000
 spe+0.000
 cheaper+0.000
 smaller+0.000
Neg logit contributions
leaf-0.000
ibrarian-0.000
vell-0.000
taker-0.000
asc-0.000
Token He
Token ID679
Feature activation-2.43e-01

Pos logit contributions
 swe+0.010
 someday+0.009
 spe+0.009
 cheaper+0.009
 smaller+0.008
Neg logit contributions
leaf-0.013
ibrarian-0.013
vell-0.012
taker-0.012
asc-0.012
Token said
Token ID531
Feature activation-5.05e-01

Pos logit contributions
 swe+0.008
 someday+0.008
 cheaper+0.008
 spe+0.008
 smaller+0.007
Neg logit contributions
leaf-0.010
ibrarian-0.010
vell-0.010
taker-0.009
asc-0.009
Token,
Token ID11
Feature activation+2.58e-01

Pos logit contributions
 swe+0.015
 someday+0.015
 cheaper+0.014
 spe+0.014
c+0.014
Neg logit contributions
leaf-0.023
ibrarian-0.023
taker-0.022
vell-0.022
 character-0.021
Token "
Token ID366
Feature activation+1.49e-01

Pos logit contributions
leaf+0.011
ibrarian+0.010
vell+0.010
taker+0.010
asc+0.010
Neg logit contributions
 swe-0.009
 someday-0.009
 spe-0.008
 cheaper-0.008
 smaller-0.008
TokenYou
Token ID1639
Feature activation+9.29e-01

Pos logit contributions
leaf+0.006
ibrarian+0.006
vell+0.006
taker+0.006
asc+0.006
Neg logit contributions
 swe-0.005
 someday-0.005
 cheaper-0.005
 spe-0.005
c-0.005
Token're
Token ID821
Feature activation+1.91e+00

Pos logit contributions
leaf+0.042
ibrarian+0.040
iders+0.039
ower+0.038
opher+0.038
Neg logit contributions
 someday-0.028
 swe-0.028
 enough-0.027
 smaller-0.026
 spe-0.026
Token welcome
Token ID7062
Feature activation+2.71e-01

Pos logit contributions
iders+0.081
ower+0.079
leaf+0.079
person+0.079
ied+0.077
Neg logit contributions
 smaller-0.067
 less-0.066
 better-0.066
 swe-0.064
 enough-0.064
Token,
Token ID11
Feature activation+1.87e-01

Pos logit contributions
leaf+0.013
ibrarian+0.012
vell+0.012
taker+0.012
he+0.011
Neg logit contributions
 swe-0.008
 someday-0.008
 cheaper-0.007
 spe-0.007
 enough-0.007
Token little
Token ID1310
Feature activation-4.20e-01

Pos logit contributions
leaf+0.006
ibrarian+0.006
vell+0.006
taker+0.006
he+0.006
Neg logit contributions
 swe-0.008
 someday-0.008
 cheaper-0.007
 spe-0.007
 smaller-0.007
Token boy
Token ID2933
Feature activation-4.57e-01

Pos logit contributions
 swe+0.016
 someday+0.016
 spe+0.015
 cheaper+0.015
c+0.015
Neg logit contributions
leaf-0.016
ibrarian-0.015
vell-0.014
taker-0.014
 character-0.013
Token.
Token ID13
Feature activation+4.23e-02

Pos logit contributions
leaf+0.040
ibrarian+0.037
vell+0.037
asc+0.035
ibr+0.035
Neg logit contributions
 swe-0.019
 someday-0.019
c-0.017
 cheaper-0.017
 smaller-0.017
Token "
Token ID366
Feature activation+2.96e-01

Pos logit contributions
leaf+0.002
ibrarian+0.002
vell+0.002
taker+0.002
asc+0.002
Neg logit contributions
 swe-0.001
 someday-0.001
 cheaper-0.001
 spe-0.001
 smaller-0.001
TokenCan
Token ID6090
Feature activation-8.25e-01

Pos logit contributions
leaf+0.013
ibrarian+0.013
vell+0.013
taker+0.012
opher+0.012
Neg logit contributions
 swe-0.009
 someday-0.009
 spe-0.009
 cheaper-0.009
c-0.008
Token I
Token ID314
Feature activation-1.59e-01

Pos logit contributions
 swe+0.039
 cheaper+0.038
 someday+0.037
 enough+0.037
 spe+0.036
Neg logit contributions
leaf-0.022
ibrarian-0.022
vell-0.020
 goodness-0.019
taker-0.019
Token cook
Token ID4255
Feature activation+9.08e-01

Pos logit contributions
 swe+0.006
 someday+0.006
 cheaper+0.005
 spe+0.005
 smaller+0.005
Neg logit contributions
leaf-0.007
ibrarian-0.006
vell-0.006
taker-0.006
he-0.006
Token you
Token ID345
Feature activation-2.01e-01

Pos logit contributions
leaf+0.045
ibrarian+0.043
vell+0.043
taker+0.042
ibr+0.040
Neg logit contributions
 swe-0.027
 someday-0.026
 cheaper-0.024
 smaller-0.024
 enough-0.023
Token for
Token ID329
Feature activation-3.43e-01

Pos logit contributions
 swe+0.007
 someday+0.007
 spe+0.007
 cheaper+0.007
c+0.007
Neg logit contributions
leaf-0.008
ibrarian-0.008
vell-0.007
taker-0.007
he-0.007
Token dinner
Token ID8073
Feature activation+2.35e-01

Pos logit contributions
 swe+0.012
 someday+0.011
 spe+0.011
 cheaper+0.011
c+0.011
Neg logit contributions
leaf-0.014
ibrarian-0.014
vell-0.013
taker-0.013
 character-0.013
Token?"
Token ID1701
Feature activation-3.82e-01

Pos logit contributions
leaf+0.010
ibrarian+0.010
vell+0.010
taker+0.009
asc+0.009
Neg logit contributions
 swe-0.008
 someday-0.008
 cheaper-0.007
 spe-0.007
 smaller-0.007
Token asked
Token ID1965
Feature activation-5.79e-01

Pos logit contributions
 swe+0.012
 someday+0.012
 cheaper+0.012
 spe+0.011
 smaller+0.011
Neg logit contributions
leaf-0.017
ibrarian-0.016
taker-0.015
vell-0.015
he-0.015
Token Lily
Token ID20037
Feature activation-1.44e-01

Pos logit contributions
 swe+0.020
 cheaper+0.019
 someday+0.019
 spe+0.019
 shin+0.018
Neg logit contributions
ibrarian-0.023
leaf-0.023
he-0.021
taker-0.021
 character-0.021
Token until
Token ID1566
Feature activation-9.37e-01

Pos logit contributions
leaf+0.002
ibrarian+0.002
vell+0.002
taker+0.002
asc+0.002
Neg logit contributions
 swe-0.002
 someday-0.002
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token it
Token ID340
Feature activation-6.65e-01

Pos logit contributions
 swe+0.032
c+0.029
 someday+0.028
 cheaper+0.028
 shin+0.027
Neg logit contributions
leaf-0.040
ibrarian-0.038
taker-0.037
vell-0.036
he-0.036
Token stopped
Token ID5025
Feature activation-1.30e+00

Pos logit contributions
 someday+0.023
 swe+0.023
 cheaper+0.022
 spe+0.022
 shin+0.021
Neg logit contributions
ibrarian-0.026
leaf-0.026
vell-0.025
he-0.024
taker-0.024
Token in
Token ID287
Feature activation-1.33e+00

Pos logit contributions
 swe+0.053
 cheaper+0.053
 spe+0.051
 someday+0.049
c+0.049
Neg logit contributions
leaf-0.041
 outside-0.038
ting-0.037
vell-0.037
ower-0.037
Token front
Token ID2166
Feature activation-2.39e-01

Pos logit contributions
 swe+0.055
c+0.052
 someday+0.052
 cheaper+0.051
 shin+0.050
Neg logit contributions
leaf-0.044
he-0.040
 ear-0.038
 character-0.038
ibrarian-0.038
Token of
Token ID286
Feature activation-5.88e-01

Pos logit contributions
 swe+0.011
 someday+0.011
 spe+0.010
 cheaper+0.010
c+0.010
Neg logit contributions
leaf-0.007
ibrarian-0.007
vell-0.006
taker-0.006
he-0.006
Token a
Token ID257
Feature activation-3.00e-01

Pos logit contributions
 swe+0.018
 someday+0.017
 cheaper+0.016
 spe+0.016
c+0.016
Neg logit contributions
leaf-0.027
ibrarian-0.026
vell-0.025
taker-0.024
he-0.024
Token big
Token ID1263
Feature activation-6.43e-01

Pos logit contributions
 swe+0.010
 someday+0.010
 spe+0.010
 cheaper+0.010
c+0.009
Neg logit contributions
leaf-0.012
ibrarian-0.012
vell-0.011
taker-0.011
he-0.011
Token tree
Token ID5509
Feature activation-7.63e-01

Pos logit contributions
 swe+0.021
 someday+0.021
 cheaper+0.020
 spe+0.020
 shin+0.019
Neg logit contributions
leaf-0.026
ibrarian-0.025
taker-0.024
vell-0.024
 character-0.024
Token.
Token ID13
Feature activation-9.44e-02

Pos logit contributions
 swe+0.028
 someday+0.026
 cheaper+0.026
 spe+0.025
 smaller+0.024
Neg logit contributions
leaf-0.030
ibrarian-0.029
vell-0.028
he-0.027
taker-0.027
Token The
Token ID383
Feature activation+8.20e-01

Pos logit contributions
 swe+0.003
 someday+0.003
 spe+0.003
 cheaper+0.003
c+0.003
Neg logit contributions
leaf-0.004
ibrarian-0.004
vell-0.003
taker-0.003
asc-0.003
Token,
Token ID11
Feature activation-7.13e-01

Pos logit contributions
 swe+0.029
 spe+0.028
 cheaper+0.027
 someday+0.027
 shin+0.026
Neg logit contributions
ibrarian-0.022
leaf-0.021
he-0.021
vell-0.020
taker-0.020
Token but
Token ID475
Feature activation+3.33e-01

Pos logit contributions
 swe+0.028
 spe+0.027
 someday+0.027
 cheaper+0.026
c+0.026
Neg logit contributions
leaf-0.025
ibrarian-0.023
vell-0.023
taker-0.022
he-0.021
Token Ben
Token ID3932
Feature activation+3.25e-01

Pos logit contributions
leaf+0.016
vell+0.015
ibrarian+0.015
taker+0.014
asc+0.014
Neg logit contributions
 swe-0.010
 someday-0.010
 cheaper-0.009
 spe-0.009
 enough-0.009
Token keeps
Token ID7622
Feature activation+1.32e-01

Pos logit contributions
leaf+0.015
ibrarian+0.014
vell+0.014
taker+0.013
ibr+0.013
Neg logit contributions
 swe-0.010
 someday-0.010
 cheaper-0.010
 spe-0.009
c-0.009
Token taking
Token ID2263
Feature activation+3.08e-01

Pos logit contributions
leaf+0.004
ibrarian+0.004
vell+0.004
taker+0.004
he+0.004
Neg logit contributions
 swe-0.006
 someday-0.006
 cheaper-0.005
 spe-0.005
 smaller-0.005
Token her
Token ID607
Feature activation-3.37e-01

Pos logit contributions
leaf+0.015
ibrarian+0.015
vell+0.014
taker+0.014
asc+0.014
Neg logit contributions
 swe-0.009
 someday-0.008
 cheaper-0.008
 spe-0.008
 smaller-0.008
Token blocks
Token ID7021
Feature activation-1.16e-01

Pos logit contributions
 swe+0.013
 someday+0.013
 spe+0.013
 cheaper+0.013
 shin+0.012
Neg logit contributions
leaf-0.012
ibrarian-0.012
vell-0.011
taker-0.011
he-0.010
Token.
Token ID13
Feature activation-1.09e+00

Pos logit contributions
 swe+0.004
 someday+0.004
 spe+0.004
 cheaper+0.004
c+0.004
Neg logit contributions
leaf-0.005
ibrarian-0.005
vell-0.005
taker-0.004
he-0.004
Token Sara
Token ID24799
Feature activation+1.23e-01

Pos logit contributions
 swe+0.040
 someday+0.039
 spe+0.039
 cheaper+0.039
 shin+0.036
Neg logit contributions
ibrarian-0.041
leaf-0.038
taker-0.037
he-0.037
asc-0.036
Token gets
Token ID3011
Feature activation-5.46e-01

Pos logit contributions
leaf+0.006
ibrarian+0.006
vell+0.005
taker+0.005
asc+0.005
Neg logit contributions
 swe-0.004
 someday-0.004
 cheaper-0.003
 spe-0.003
 smaller-0.003
Token angry
Token ID7954
Feature activation+1.05e+00

Pos logit contributions
 someday+0.022
 swe+0.021
 spe+0.021
 cheaper+0.020
c+0.019
Neg logit contributions
leaf-0.020
ibrarian-0.019
taker-0.018
vell-0.018
asc-0.017
Token-
Token ID12
Feature activation+1.39e+00

Pos logit contributions
leaf+0.050
vell+0.048
ibrarian+0.047
iders+0.046
he+0.045
Neg logit contributions
 swe-0.030
 spe-0.029
 someday-0.028
 cheaper-0.027
 enough-0.027
Tokenold
Token ID727
Feature activation-1.15e-01

Pos logit contributions
opher+0.041
vell+0.040
leaf+0.039
iders+0.037
 character+0.037
Neg logit contributions
 someday-0.061
c-0.060
 cheaper-0.058
 swe-0.056
 One-0.054
Token but
Token ID475
Feature activation+3.01e-01

Pos logit contributions
 swe+0.004
 someday+0.003
 cheaper+0.003
 spe+0.003
 smaller+0.003
Neg logit contributions
leaf-0.005
ibrarian-0.005
vell-0.005
taker-0.005
asc-0.005
Token her
Token ID607
Feature activation+1.52e+00

Pos logit contributions
leaf+0.014
ibrarian+0.014
vell+0.014
taker+0.013
asc+0.013
Neg logit contributions
 swe-0.009
 someday-0.009
 cheaper-0.008
 spe-0.008
 smaller-0.008
Token heart
Token ID2612
Feature activation+8.53e-02

Pos logit contributions
leaf+0.084
iders+0.078
taker+0.077
ting+0.076
ying+0.076
Neg logit contributions
 swe-0.037
 smaller-0.032
 spe-0.032
 cheaper-0.032
 One-0.032
Token was
Token ID373
Feature activation-7.30e-01

Pos logit contributions
leaf+0.004
ibrarian+0.004
vell+0.004
taker+0.004
asc+0.004
Neg logit contributions
 swe-0.002
 someday-0.002
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token brave
Token ID14802
Feature activation-6.81e-02

Pos logit contributions
 someday+0.028
 swe+0.027
 spe+0.027
 cheaper+0.026
c+0.025
Neg logit contributions
leaf-0.025
ibrarian-0.025
taker-0.024
vell-0.023
he-0.022
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
 swe+0.002
 someday+0.002
 spe+0.002
 cheaper+0.002
 smaller+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.003
he-0.003
Token
Token ID198
Feature activation-7.50e-01

Pos logit contributions
 spe+0.032
 swe+0.031
 cheaper+0.031
 someday+0.030
 smaller+0.028
Neg logit contributions
ibrarian-0.046
leaf-0.045
taker-0.045
he-0.044
asc-0.044
Token
Token ID198
Feature activation-1.31e+00

Pos logit contributions
 swe+0.026
 spe+0.024
 cheaper+0.024
 someday+0.023
 shin+0.022
Neg logit contributions
leaf-0.031
ibrarian-0.031
he-0.030
taker-0.029
vell-0.029
TokenOne
Token ID3198
Feature activation+1.01e+00

Pos logit contributions
 spe+0.032
 swe+0.031
 cheaper+0.029
 enough+0.028
 someday+0.027
Neg logit contributions
he-0.066
ibrarian-0.065
leaf-0.063
asc-0.062
taker-0.060
Token face
Token ID1986
Feature activation+7.48e-02

Pos logit contributions
leaf+0.026
ibrarian+0.025
vell+0.024
taker+0.023
asc+0.023
Neg logit contributions
 swe-0.032
 someday-0.030
 spe-0.029
c-0.029
 smaller-0.029
Token and
Token ID290
Feature activation+5.28e-01

Pos logit contributions
leaf+0.003
ibrarian+0.003
vell+0.003
taker+0.003
asc+0.003
Neg logit contributions
 swe-0.002
 someday-0.002
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token pur
Token ID1308
Feature activation-1.19e-01

Pos logit contributions
leaf+0.023
ibrarian+0.022
vell+0.022
taker+0.020
asc+0.020
Neg logit contributions
 swe-0.018
 someday-0.018
 spe-0.017
 cheaper-0.017
 smaller-0.016
Tokenr
Token ID81
Feature activation+9.16e-01

Pos logit contributions
 swe+0.006
 someday+0.006
 cheaper+0.006
 spe+0.006
 smaller+0.006
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.002
he-0.002
Token every
Token ID790
Feature activation-4.26e-01

Pos logit contributions
leaf+0.043
ibrarian+0.040
vell+0.039
asc+0.037
taker+0.036
Neg logit contributions
 swe-0.029
 someday-0.028
 spe-0.026
 enough-0.026
 cheaper-0.026
Token time
Token ID640
Feature activation-2.22e-01

Pos logit contributions
 swe+0.017
 someday+0.016
 cheaper+0.016
 spe+0.015
 shin+0.015
Neg logit contributions
leaf-0.016
ibrarian-0.015
taker-0.014
he-0.014
vell-0.014
Token she
Token ID673
Feature activation-8.00e-02

Pos logit contributions
 swe+0.007
 someday+0.007
 spe+0.007
 cheaper+0.007
c+0.006
Neg logit contributions
leaf-0.010
ibrarian-0.009
vell-0.009
taker-0.009
he-0.009
Token saw
Token ID2497
Feature activation+7.58e-01

Pos logit contributions
 swe+0.003
 someday+0.003
 cheaper+0.002
 spe+0.002
 smaller+0.002
Neg logit contributions
leaf-0.003
ibrarian-0.003
vell-0.003
taker-0.003
he-0.003
Token him
Token ID683
Feature activation-2.11e-01

Pos logit contributions
leaf+0.034
ibrarian+0.033
vell+0.033
asc+0.031
taker+0.031
Neg logit contributions
 swe-0.025
 someday-0.024
 cheaper-0.022
 spe-0.022
c-0.022
Token.
Token ID13
Feature activation-3.83e-02

Pos logit contributions
 swe+0.007
 someday+0.007
 spe+0.007
 cheaper+0.007
 smaller+0.006
Neg logit contributions
leaf-0.009
ibrarian-0.009
vell-0.008
taker-0.008
he-0.008
Token<|endoftext|>
Token ID50256
Feature activation+2.04e-01

Pos logit contributions
 swe+0.001
 someday+0.001
 cheaper+0.001
 spe+0.001
 smaller+0.001
Neg logit contributions
leaf-0.002
ibrarian-0.002
vell-0.002
taker-0.002
he-0.002
Token happy
Token ID3772
Feature activation-2.54e-01

Pos logit contributions
 swe+0.033
 someday+0.033
 spe+0.033
c+0.032
 cheaper+0.031
Neg logit contributions
leaf-0.031
taker-0.029
ibrarian-0.029
vell-0.028
he-0.028
Token,
Token ID11
Feature activation-9.09e-01

Pos logit contributions
 swe+0.008
 someday+0.008
 spe+0.008
 cheaper+0.008
c+0.008
Neg logit contributions
leaf-0.011
ibrarian-0.011
vell-0.010
taker-0.010
he-0.010
Token and
Token ID290
Feature activation+3.61e-02

Pos logit contributions
 swe+0.035
 cheaper+0.033
 spe+0.033
c+0.033
 someday+0.032
Neg logit contributions
leaf-0.033
taker-0.030
vell-0.030
ibrarian-0.029
he-0.028
Token they
Token ID484
Feature activation+5.05e-01

Pos logit contributions
leaf+0.002
ibrarian+0.002
vell+0.002
taker+0.002
asc+0.002
Neg logit contributions
 swe-0.001
 someday-0.001
 cheaper-0.001
 spe-0.001
 smaller-0.001
Token continued
Token ID3767
Feature activation-2.03e+00

Pos logit contributions
leaf+0.018
ibrarian+0.018
vell+0.017
iders+0.016
taker+0.016
Neg logit contributions
 someday-0.020
 swe-0.020
 spe-0.019
c-0.019
 cheaper-0.019
Token to
Token ID284
Feature activation-2.13e+00

Pos logit contributions
 cheaper+0.094
 shin+0.093
 someday+0.091
 swe+0.087
 spe+0.086
Neg logit contributions
 outside-0.060
 around-0.057
leaf-0.053
 ear-0.052
 everywhere-0.051
Token sail
Token ID14936
Feature activation-5.67e-01

Pos logit contributions
 cheaper+0.085
 swe+0.082
 spe+0.072
 enough+0.072
 shin+0.072
Neg logit contributions
he-0.070
leaf-0.068
 ear-0.067
vell-0.064
 outside-0.063
Token on
Token ID319
Feature activation-1.59e+00

Pos logit contributions
 swe+0.022
 cheaper+0.021
 spe+0.021
 someday+0.021
 shin+0.020
Neg logit contributions
leaf-0.020
ibrarian-0.019
he-0.019
vell-0.019
taker-0.018
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
 shin+0.055
 cheaper+0.055
 swe+0.055
c+0.055
 someday+0.054
Neg logit contributions
 outside-0.054
leaf-0.053
 everywhere-0.052
 around-0.052
 ear-0.050
Token sea
Token ID5417
Feature activation-9.21e-01

Pos logit contributions
 someday+0.038
c+0.038
 swe+0.038
 spe+0.037
 One+0.035
Neg logit contributions
leaf-0.051
vell-0.049
he-0.047
 outside-0.046
ong-0.045
Token,
Token ID11
Feature activation-2.33e-02

Pos logit contributions
 swe+0.033
 cheaper+0.032
 spe+0.031
 someday+0.031
 shin+0.030
Neg logit contributions
vell-0.034
leaf-0.033
ibrarian-0.033
he-0.032
taker-0.032
Token One
Token ID1881
Feature activation+1.19e-01

Pos logit contributions
 cheaper+0.044
 shin+0.043
 spe+0.042
 smaller+0.041
 enough+0.039
Neg logit contributions
 outside-0.088
he-0.086
 goodness-0.083
 around-0.082
 Everyone-0.081
Token day
Token ID1110
Feature activation+3.34e-01

Pos logit contributions
leaf+0.004
ibrarian+0.004
vell+0.004
taker+0.004
ibr+0.004
Neg logit contributions
 swe-0.005
 someday-0.005
 spe-0.004
 cheaper-0.004
c-0.004
Token,
Token ID11
Feature activation-7.25e-01

Pos logit contributions
leaf+0.018
ibrarian+0.018
vell+0.017
taker+0.017
asc+0.016
Neg logit contributions
 swe-0.008
 someday-0.007
 spe-0.007
 cheaper-0.007
 One-0.006
Token it
Token ID340
Feature activation-1.02e+00

Pos logit contributions
 someday+0.029
 swe+0.029
 cheaper+0.029
 spe+0.028
c+0.028
Neg logit contributions
leaf-0.023
taker-0.022
vell-0.021
he-0.021
ibrarian-0.021
Token started
Token ID2067
Feature activation-8.22e-01

Pos logit contributions
 someday+0.037
 cheaper+0.036
 spe+0.035
 swe+0.035
 shin+0.033
Neg logit contributions
ibrarian-0.037
leaf-0.036
taker-0.035
he-0.034
vell-0.034
Token to
Token ID284
Feature activation-2.12e+00

Pos logit contributions
 swe+0.026
 someday+0.026
 cheaper+0.025
 spe+0.025
 shin+0.024
Neg logit contributions
leaf-0.035
ibrarian-0.032
 outside-0.031
asc-0.031
taker-0.031
Token rain
Token ID6290
Feature activation+6.08e-02

Pos logit contributions
 someday+0.078
 cheaper+0.078
 enough+0.074
c+0.072
 spe+0.072
Neg logit contributions
he-0.069
 outside-0.067
taker-0.064
 sidelines-0.063
 earthquake-0.062
Token and
Token ID290
Feature activation-9.29e-01

Pos logit contributions
leaf+0.002
ibrarian+0.002
vell+0.002
taker+0.002
asc+0.002
Neg logit contributions
 swe-0.002
 someday-0.002
 cheaper-0.002
 spe-0.002
 smaller-0.002
Token Betsy
Token ID45775
Feature activation+1.27e-01

Pos logit contributions
 someday+0.032
 cheaper+0.030
 swe+0.030
c+0.029
 shin+0.029
Neg logit contributions
leaf-0.035
taker-0.035
ibrarian-0.032
vell-0.032
 character-0.031
Token got
Token ID1392
Feature activation-4.26e-01

Pos logit contributions
leaf+0.006
ibrarian+0.006
vell+0.005
taker+0.005
asc+0.005
Neg logit contributions
 swe-0.004
 someday-0.004
 spe-0.004
 cheaper-0.004
c-0.004
Token very
Token ID845
Feature activation-2.76e-02

Pos logit contributions
 someday+0.017
 swe+0.016
 spe+0.016
 cheaper+0.016
c+0.015
Neg logit contributions
leaf-0.015
ibrarian-0.014
vell-0.014
taker-0.014
asc-0.013
Token pin
Token ID6757
Feature activation+4.69e-01

Pos logit contributions
 someday+0.040
 swe+0.037
 spe+0.036
 enough+0.035
 cheaper+0.035
Neg logit contributions
leaf-0.042
vell-0.039
taker-0.039
ibrarian-0.037
 goodness-0.036
Token because
Token ID780
Feature activation+2.30e-01

Pos logit contributions
leaf+0.022
ibrarian+0.021
vell+0.021
asc+0.020
taker+0.020
Neg logit contributions
 swe-0.014
 someday-0.014
 cheaper-0.013
 enough-0.012
c-0.012
Token it
Token ID340
Feature activation+8.03e-01

Pos logit contributions
leaf+0.011
ibrarian+0.011
vell+0.011
taker+0.010
asc+0.010
Neg logit contributions
 swe-0.006
 someday-0.006
 cheaper-0.006
 spe-0.006
 smaller-0.006
Token made
Token ID925
Feature activation+1.17e-01

Pos logit contributions
leaf+0.043
ibrarian+0.042
vell+0.040
asc+0.039
opher+0.039
Neg logit contributions
 swe-0.019
 someday-0.018
 spe-0.017
 smaller-0.017
 shin-0.017
Token the
Token ID262
Feature activation-8.78e-01

Pos logit contributions
leaf+0.005
ibrarian+0.005
vell+0.005
taker+0.004
asc+0.004
Neg logit contributions
 swe-0.004
 someday-0.004
 cheaper-0.004
 spe-0.004
 smaller-0.004
Token rainbow
Token ID27223
Feature activation-2.01e+00

Pos logit contributions
 someday+0.035
 swe+0.034
 spe+0.034
 cheaper+0.032
 One+0.031
Neg logit contributions
vell-0.030
leaf-0.030
ibrarian-0.029
 goodness-0.028
he-0.028
Token look
Token ID804
Feature activation-1.11e+00

Pos logit contributions
 One+0.073
 swe+0.072
 cheaper+0.072
 someday+0.072
One+0.070
Neg logit contributions
vell-0.071
taker-0.070
leaf-0.068
ong-0.066
 around-0.065
Token even
Token ID772
Feature activation-1.52e+00

Pos logit contributions
 someday+0.043
 swe+0.043
c+0.042
 spe+0.042
 One+0.041
Neg logit contributions
leaf-0.037
vell-0.036
taker-0.034
ong-0.032
 around-0.032
Token more
Token ID517
Feature activation+3.53e-01

Pos logit contributions
 someday+0.034
 spe+0.034
 One+0.034
 bidder+0.030
c+0.029
Neg logit contributions
leaf-0.075
vell-0.071
taker-0.071
 around-0.068
ibrarian-0.067
Token beautiful
Token ID4950
Feature activation+1.51e-01

Pos logit contributions
leaf+0.016
ibrarian+0.016
vell+0.015
taker+0.014
iders+0.014
Neg logit contributions
 swe-0.012
 someday-0.011
 cheaper-0.011
 spe-0.011
 smaller-0.010
Token.
Token ID13
Feature activation+1.37e-01

Pos logit contributions
leaf+0.007
ibrarian+0.006
vell+0.006
taker+0.006
asc+0.006
Neg logit contributions
 swe-0.005
 someday-0.005
 spe-0.005
 cheaper-0.005
 smaller-0.004
Token sleepy
Token ID42023
Feature activation+1.02e-01

Pos logit contributions
 someday+0.030
 swe+0.029
 spe+0.028
c+0.028
 coll+0.027
Neg logit contributions
leaf-0.029
taker-0.027
ibrarian-0.026
vell-0.025
he-0.025
Token and
Token ID290
Feature activation+3.99e-01

Pos logit contributions
leaf+0.005
ibrarian+0.004
vell+0.004
taker+0.004
asc+0.004
Neg logit contributions
 swe-0.003
 someday-0.003
 cheaper-0.003
 spe-0.003
 smaller-0.003
Token decided
Token ID3066
Feature activation-8.16e-01

Pos logit contributions
leaf+0.017
ibrarian+0.017
vell+0.016
taker+0.016
asc+0.015
Neg logit contributions
 swe-0.013
 someday-0.013
 cheaper-0.012
 spe-0.012
 smaller-0.012
Token to
Token ID284
Feature activation+3.46e-01

Pos logit contributions
 spe+0.026
 swe+0.026
c+0.026
 someday+0.025
 cheaper+0.024
Neg logit contributions
leaf-0.035
ibrarian-0.034
taker-0.032
vell-0.031
 character-0.030
Token have
Token ID423
Feature activation-1.51e+00

Pos logit contributions
leaf+0.016
ibrarian+0.016
vell+0.015
taker+0.015
asc+0.014
Neg logit contributions
 swe-0.010
 someday-0.010
 spe-0.009
 cheaper-0.009
 smaller-0.009
Token a
Token ID257
Feature activation-2.12e+00

Pos logit contributions
 spe+0.057
 cheaper+0.054
 shin+0.053
 someday+0.053
c+0.053
Neg logit contributions
leaf-0.054
ibrarian-0.049
 character-0.047
 outside-0.047
 goodness-0.045
Token rest
Token ID1334
Feature activation+5.71e-01

Pos logit contributions
 someday+0.077
 spe+0.077
c+0.077
 shin+0.075
 swe+0.072
Neg logit contributions
leaf-0.073
ibrarian-0.068
 outside-0.066
vell-0.064
 character-0.063
Token.
Token ID13
Feature activation+7.07e-01

Pos logit contributions
leaf+0.025
ibrarian+0.023
vell+0.023
taker+0.022
ibr+0.021
Neg logit contributions
 someday-0.020
 swe-0.019
 cheaper-0.018
 enough-0.018
 spe-0.018
Token She
Token ID1375
Feature activation-5.45e-01

Pos logit contributions
leaf+0.034
vell+0.033
ibrarian+0.033
asc+0.031
taker+0.031
Neg logit contributions
 someday-0.021
 One-0.020
 swe-0.020
 cheaper-0.019
c-0.018
Token went
Token ID1816
Feature activation+5.32e-01

Pos logit contributions
 swe+0.019
 someday+0.018
 cheaper+0.018
 spe+0.018
 smaller+0.017
Neg logit contributions
leaf-0.022
ibrarian-0.022
vell-0.020
taker-0.020
he-0.019
Token to
Token ID284
Feature activation+3.58e-01

Pos logit contributions
leaf+0.025
vell+0.024
ibrarian+0.024
taker+0.023
iders+0.022
Neg logit contributions
 swe-0.016
 someday-0.016
 cheaper-0.015
 smaller-0.015
 enough-0.014
Token each
Token ID1123
Feature activation-4.61e-01

Pos logit contributions
leaf+0.007
ibrarian+0.007
vell+0.007
taker+0.007
asc+0.006
Neg logit contributions
 swe-0.005
 someday-0.005
 cheaper-0.005
 spe-0.005
 smaller-0.005
Token other
Token ID584
Feature activation-3.27e-01

Pos logit contributions
 swe+0.020
 someday+0.019
 cheaper+0.019
 spe+0.019
 shin+0.018
Neg logit contributions
leaf-0.015
ibrarian-0.014
vell-0.013
taker-0.013
he-0.013
Token when
Token ID618
Feature activation-1.49e+00

Pos logit contributions
 swe+0.012
 someday+0.012
 spe+0.012
 cheaper+0.012
c+0.011
Neg logit contributions
leaf-0.012
ibrarian-0.012
vell-0.012
taker-0.011
he-0.011
Token they
Token ID484
Feature activation+4.98e-02

Pos logit contributions
 swe+0.046
c+0.045
 cheaper+0.042
 someday+0.041
 spe+0.041
Neg logit contributions
leaf-0.062
vell-0.058
taker-0.058
 goodness-0.057
 character-0.057
Token were
Token ID547
Feature activation-2.72e-01

Pos logit contributions
leaf+0.002
ibrarian+0.002
vell+0.002
taker+0.002
asc+0.002
Neg logit contributions
 swe-0.002
 someday-0.002
 cheaper-0.001
 spe-0.001
 smaller-0.001
Token in
Token ID287
Feature activation-2.33e+00

Pos logit contributions
 swe+0.011
 someday+0.011
 spe+0.010
 cheaper+0.010
c+0.010
Neg logit contributions
leaf-0.010
ibrarian-0.009
vell-0.009
taker-0.009
he-0.008
Token trouble
Token ID5876
Feature activation+8.83e-02

Pos logit contributions
 swe+0.087
 someday+0.081
 shin+0.080
c+0.077
 cheaper+0.076
Neg logit contributions
he-0.075
taker-0.075
 everywhere-0.074
 character-0.073
leaf-0.073
Token.
Token ID13
Feature activation-9.45e-01

Pos logit contributions
leaf+0.004
ibrarian+0.004
vell+0.004
taker+0.004
asc+0.003
Neg logit contributions
 swe-0.003
 someday-0.003
 cheaper-0.003
 spe-0.003
 smaller-0.003
Token<|endoftext|>
Token ID50256
Feature activation-2.87e-02

Pos logit contributions
 swe+0.035
 cheaper+0.033
 shin+0.032
 smaller+0.031
 spe+0.031
Neg logit contributions
leaf-0.038
ibrarian-0.037
he-0.035
taker-0.035
 character-0.034
TokenOnce
Token ID7454
Feature activation-3.04e-01

Pos logit contributions
 swe+0.001
 someday+0.001
 cheaper+0.001
 spe+0.001
 smaller+0.001
Neg logit contributions
leaf-0.001
ibrarian-0.001
vell-0.001
taker-0.001
he-0.001
Token upon
Token ID2402
Feature activation-3.81e-01

Pos logit contributions
 swe+0.011
 someday+0.011
 spe+0.010
 cheaper+0.010
 shin+0.010
Neg logit contributions
leaf-0.012
ibrarian-0.012
vell-0.011
taker-0.011
he-0.011