TOP ACTIVATIONS
MAX = 0.325

praised him for his master pieces . Jack stopped
tools to create his master pieces . <|endoftext|> Once upon a
they saw her ink master pieces . From then
had fun , sharing the snowball and some ol ives they
ha , I catch your snowball , Ben !" Lily says
snow man and had a snowball fight with her friends .
shouted . He threw a snowball at the dog . The
man together and had a snowball fight . Lily said ,
. It 's melting the snowball just like it would melt
said . He made a snowball and threw it at the
. He rolled a big snowball and put it on top
ran home , squeezing the snowball tight . When he came
and pretended to have a snowball fight with her imaginary friend
careful not to release the snowball !" They
, " Just like the snowball melted , the sun can
that you have a icy snowball in your stomach , you
. He made a big snowball and rolled it around the
made snow men and had snowball fights . They used their
. As he rolled the snowball , it got bigger and
a snow man and a snowball and a snow angel .

MIN ACTIVATIONS
MIN = -0.308

but fighting is not . Money is not the most important
someone else for money . Money is what people use to
not buy more chalk . Money was too difficult for them
laughed . They agreed . Sno opy was a good name
oral of the story : Money cannot buy true happiness .
oxygen from the tank . Hel io was thankful and thanked
" I will name him Sno opy . Because he is
was very kind and offered Hel io some of the oxygen
's weigh the options . St aying here is cheap ,
was a little girl named St acey . She was very
like to have money . Money can buy candy and ice
themselves as Hel io . Hel io explained that their oxygen
no . It 's too far away ". Maria was feeling
tank and introduced themselves as Hel io . Hel io explained
It 's a big country far away . It has many
. Sometimes he gets too far and loses his way .
careful and not swim too far . They said they were
Sm elly King and the St inky Queen . As he
train is going to a far place . We have to
It is very high and far ." They start

INTERVAL 0.167 - 0.325
CONTAINS 0.350%

little girl named Amy . Amy and Tim would go on
, they saw the burnt cookies and the smoke . They
sat down and ate the cookies while watching their favorite show
, " What is this ?" Ben says ,
warm coat and his m ittens , for the weather outside

INTERVAL 0.008 - 0.167
CONTAINS 40.895%

leave . As she walked away , she heard
them and told them what happened . " Help
yogurt was so y ummy and Sarah was thrilled with it
friendly taxi driver . After they were finished reading , they
. Amy watched them with amaz ement . She smiled ,

INTERVAL -0.150 - 0.008
CONTAINS 58.093%

miss you too , Smoke y . Bye for now ,
Her mom said yes , so the girl went to
we see your machine ?" Tim asks . "
The water soaked them from head to toe . They were
loved to visit her grandma 's house . Her grandma was

INTERVAL -0.308 - -0.150
CONTAINS 0.662%

his house . He had never seen it before , so
They reached for the or nam ents and touched them softly
went in . They saw many things that they liked :
he heard a voice coming from inside the radio .
her . With a heavy heart and a frown on her
Token praised
Token ID15342
Feature activation-3.28e-02

Pos logit contributions
pieces+0.017
 piping+0.015
"'+0.014
 snowball+0.014
THE+0.013
Neg logit contributions
 Gre-0.017
oust-0.016
 Money-0.016
hips-0.015
 St-0.015
Token him
Token ID683
Feature activation-5.42e-02

Pos logit contributions
 Gre+0.042
oust+0.042
 Money+0.039
 ante+0.039
ub+0.038
Neg logit contributions
pieces-0.040
 piping-0.035
 snowball-0.032
"'-0.032
THE-0.030
Token for
Token ID329
Feature activation+1.69e-02

Pos logit contributions
 Gre+0.081
oust+0.078
 Money+0.075
 ante+0.073
 lur+0.073
Neg logit contributions
pieces-0.057
 piping-0.046
 snowball-0.044
"'-0.043
THE-0.041
Token his
Token ID465
Feature activation+3.48e-02

Pos logit contributions
pieces+0.020
 piping+0.017
"'+0.015
 snowball+0.015
 freshly+0.015
Neg logit contributions
oust-0.023
 Gre-0.023
engu-0.022
ub-0.022
 Money-0.022
Token master
Token ID4958
Feature activation+2.78e-02

Pos logit contributions
pieces+0.042
 piping+0.039
 snowball+0.035
"'+0.035
THE+0.032
Neg logit contributions
 Gre-0.046
oust-0.044
 Money-0.043
 St-0.041
hips-0.041
Tokenpieces
Token ID34154
Feature activation+3.25e-01

Pos logit contributions
pieces+0.035
 piping+0.034
"'+0.031
 snowball+0.030
THE+0.028
Neg logit contributions
 Gre-0.034
oust-0.032
 Money-0.031
 St-0.030
hips-0.030
Token.
Token ID13
Feature activation-4.53e-03

Pos logit contributions
pieces+0.404
 piping+0.376
"'+0.341
 snowball+0.339
THE+0.314
Neg logit contributions
 Gre-0.415
oust-0.394
 Money-0.384
 St-0.367
hips-0.363
Token
Token ID198
Feature activation-8.25e-03

Pos logit contributions
 Gre+0.006
oust+0.005
 Money+0.005
hips+0.005
 St+0.005
Neg logit contributions
pieces-0.006
 piping-0.005
"'-0.005
 snowball-0.005
THE-0.005
Token
Token ID198
Feature activation-8.96e-03

Pos logit contributions
 Gre+0.009
oust+0.009
 Money+0.008
 St+0.008
hips+0.008
Neg logit contributions
pieces-0.012
 piping-0.011
 snowball-0.010
"'-0.010
THE-0.009
TokenJack
Token ID14295
Feature activation-2.71e-03

Pos logit contributions
 Gre+0.010
 Money+0.009
oust+0.009
 far+0.009
 List+0.009
Neg logit contributions
pieces-0.013
 piping-0.012
 snowball-0.011
"'-0.010
 freshly-0.010
Token stopped
Token ID5025
Feature activation-8.23e-02

Pos logit contributions
 Gre+0.004
oust+0.004
 Money+0.004
hips+0.003
 St+0.003
Neg logit contributions
pieces-0.003
 piping-0.003
"'-0.003
 snowball-0.003
THE-0.002
Token tools
Token ID4899
Feature activation-6.38e-03

Pos logit contributions
 Gre+0.117
oust+0.110
 Money+0.107
 St+0.101
hips+0.099
Neg logit contributions
pieces-0.167
 piping-0.158
"'-0.145
 snowball-0.145
THE-0.136
Token to
Token ID284
Feature activation+4.85e-03

Pos logit contributions
 Gre+0.010
oust+0.009
 Money+0.009
 appearances+0.009
 List+0.009
Neg logit contributions
pieces-0.006
 piping-0.005
"'-0.005
THE-0.005
 snowball-0.005
Token create
Token ID2251
Feature activation+8.30e-02

Pos logit contributions
pieces+0.006
 piping+0.005
"'+0.005
 snowball+0.004
THE+0.004
Neg logit contributions
 Gre-0.007
oust-0.006
 Money-0.006
 St-0.006
hips-0.006
Token his
Token ID465
Feature activation+3.26e-02

Pos logit contributions
pieces+0.096
 piping+0.081
 snowball+0.075
"'+0.074
THE+0.066
Neg logit contributions
 Gre-0.118
oust-0.116
 Money-0.110
ho-0.108
ugg-0.108
Token master
Token ID4958
Feature activation+2.59e-02

Pos logit contributions
pieces+0.033
 piping+0.031
 snowball+0.027
"'+0.027
THE+0.024
Neg logit contributions
 Gre-0.049
oust-0.047
 Money-0.046
 St-0.044
hips-0.044
Tokenpieces
Token ID34154
Feature activation+3.24e-01

Pos logit contributions
 piping+0.033
 snowball+0.032
"'+0.030
THE+0.028
pieces+0.028
Neg logit contributions
 well-0.029
 Gre-0.029
 far-0.028
oust-0.027
 Money-0.027
Token.
Token ID13
Feature activation-3.04e-03

Pos logit contributions
 piping+0.457
pieces+0.447
 snowball+0.424
"'+0.396
 freshly+0.369
Neg logit contributions
 Gre-0.343
oust-0.323
 Money-0.323
 well-0.315
 far-0.308
Token<|endoftext|>
Token ID50256
Feature activation+2.41e-04

Pos logit contributions
 Gre+0.004
oust+0.004
 Money+0.003
 St+0.003
 Spark+0.003
Neg logit contributions
pieces-0.004
 piping-0.004
"'-0.003
 snowball-0.003
THE-0.003
TokenOnce
Token ID7454
Feature activation+2.26e-02

Pos logit contributions
pieces+0.000
 piping+0.000
 snowball+0.000
"'+0.000
THE+0.000
Neg logit contributions
 Gre-0.000
oust-0.000
 Money-0.000
hips-0.000
 St-0.000
Token upon
Token ID2402
Feature activation-2.34e-03

Pos logit contributions
pieces+0.035
 piping+0.032
 snowball+0.030
"'+0.029
 freshly+0.027
Neg logit contributions
 Gre-0.024
oust-0.021
 far-0.021
 Money-0.021
ret-0.021
Token a
Token ID257
Feature activation-1.37e-02

Pos logit contributions
 far+0.002
 well+0.002
 Gre+0.002
 St+0.001
 Spark+0.001
Neg logit contributions
pieces-0.005
 piping-0.004
"'-0.004
ries-0.004
affles-0.003
Token they
Token ID484
Feature activation+3.94e-02

Pos logit contributions
 Gre+0.089
oust+0.087
 Money+0.084
hips+0.082
ub+0.081
Neg logit contributions
pieces-0.060
 piping-0.053
"'-0.048
 snowball-0.046
THE-0.043
Token saw
Token ID2497
Feature activation+6.31e-02

Pos logit contributions
pieces+0.038
 piping+0.032
"'+0.030
THE+0.028
?",+0.027
Neg logit contributions
 Gre-0.062
 Money-0.060
oust-0.060
 appearances-0.059
 List-0.057
Token her
Token ID607
Feature activation+4.71e-02

Pos logit contributions
pieces+0.063
 piping+0.052
 freshly+0.042
"'+0.042
 snowball+0.040
Neg logit contributions
oust-0.099
 Gre-0.098
 Money-0.096
ark-0.095
 ante-0.094
Token ink
Token ID16882
Feature activation-8.70e-02

Pos logit contributions
pieces+0.059
 piping+0.058
 snowball+0.053
"'+0.052
THE+0.048
Neg logit contributions
 Gre-0.059
oust-0.055
 Money-0.054
 St-0.052
 greed-0.051
Token master
Token ID4958
Feature activation+2.35e-02

Pos logit contributions
 Gre+0.117
oust+0.111
 Money+0.108
 St+0.103
 Hel+0.102
Neg logit contributions
pieces-0.104
 piping-0.095
"'-0.087
 snowball-0.086
THE-0.080
Tokenpieces
Token ID34154
Feature activation+3.22e-01

Pos logit contributions
pieces+0.028
 piping+0.028
 snowball+0.025
"'+0.025
THE+0.023
Neg logit contributions
 Gre-0.030
oust-0.028
 Money-0.027
 St-0.026
hips-0.026
Token.
Token ID13
Feature activation-6.79e-03

Pos logit contributions
pieces+0.386
 piping+0.355
"'+0.323
 snowball+0.318
THE+0.298
Neg logit contributions
 Gre-0.426
oust-0.405
 Money-0.395
 St-0.380
hips-0.373
Token
Token ID198
Feature activation-9.21e-03

Pos logit contributions
oust+0.009
 Gre+0.009
 Money+0.008
hips+0.008
 lur+0.008
Neg logit contributions
pieces-0.008
 piping-0.007
"'-0.007
 snowball-0.006
THE-0.006
Token
Token ID198
Feature activation-9.79e-03

Pos logit contributions
 Gre+0.008
 Money+0.006
 far+0.006
 Spark+0.006
 St+0.006
Neg logit contributions
pieces-0.016
 piping-0.015
 snowball-0.014
"'-0.013
 freshly-0.013
TokenFrom
Token ID4863
Feature activation-3.46e-02

Pos logit contributions
 Gre+0.011
 Money+0.010
 far+0.010
ark+0.009
 List+0.009
Neg logit contributions
 piping-0.014
pieces-0.013
 snowball-0.013
 freshly-0.011
"'-0.011
Token then
Token ID788
Feature activation+1.53e-01

Pos logit contributions
 Gre+0.047
oust+0.044
 Money+0.043
 St+0.042
 far+0.042
Neg logit contributions
pieces-0.041
 piping-0.038
"'-0.034
 snowball-0.034
THE-0.031
Token had
Token ID550
Feature activation+5.58e-02

Pos logit contributions
pieces+0.011
 piping+0.009
 snowball+0.008
"'+0.008
THE+0.008
Neg logit contributions
oust-0.022
 Gre-0.022
 Money-0.022
hips-0.021
 appearances-0.020
Token fun
Token ID1257
Feature activation+2.27e-02

Pos logit contributions
pieces+0.072
 piping+0.067
 snowball+0.061
"'+0.061
THE+0.056
Neg logit contributions
 Gre-0.069
oust-0.067
 Money-0.064
hips-0.062
 St-0.061
Token,
Token ID11
Feature activation-2.43e-02

Pos logit contributions
 piping+0.035
pieces+0.034
"'+0.031
 snowball+0.030
 freshly+0.028
Neg logit contributions
 Gre-0.021
 well-0.021
 far-0.020
oust-0.019
 greed-0.019
Token sharing
Token ID7373
Feature activation-1.70e-01

Pos logit contributions
 Gre+0.037
oust+0.036
 Money+0.036
engu+0.034
hips+0.034
Neg logit contributions
pieces-0.024
 piping-0.021
 snowball-0.021
"'-0.018
THE-0.018
Token the
Token ID262
Feature activation+1.88e-02

Pos logit contributions
 Gre+0.226
oust+0.215
 Money+0.210
 St+0.202
hips+0.199
Neg logit contributions
pieces-0.204
 piping-0.191
"'-0.172
 snowball-0.171
THE-0.156
Token snowball
Token ID46275
Feature activation+2.92e-01

Pos logit contributions
pieces+0.021
 piping+0.020
 snowball+0.019
"'+0.017
THE+0.016
Neg logit contributions
 Gre-0.026
oust-0.025
 Money-0.025
 St-0.023
ub-0.023
Token and
Token ID290
Feature activation+1.86e-02

Pos logit contributions
pieces+0.334
 piping+0.302
 snowball+0.288
"'+0.272
THE+0.254
Neg logit contributions
 Gre-0.408
 Money-0.385
oust-0.385
 St-0.359
 List-0.354
Token some
Token ID617
Feature activation+3.72e-02

Pos logit contributions
pieces+0.021
 piping+0.019
 snowball+0.018
"'+0.017
THE+0.016
Neg logit contributions
 Gre-0.026
oust-0.025
 Money-0.025
hips-0.023
ub-0.023
Token ol
Token ID25776
Feature activation-6.84e-04

Pos logit contributions
pieces+0.045
 piping+0.042
"'+0.039
 snowball+0.037
THE+0.035
Neg logit contributions
 Gre-0.048
oust-0.046
 Money-0.044
 St-0.043
 far-0.043
Tokenives
Token ID1083
Feature activation-8.57e-02

Pos logit contributions
 Gre+0.001
 Money+0.001
oust+0.001
 appearances+0.001
 List+0.001
Neg logit contributions
pieces-0.001
 piping-0.001
 snowball-0.001
"'-0.001
THE-0.001
Token they
Token ID484
Feature activation+4.90e-02

Pos logit contributions
 Gre+0.133
oust+0.128
 Money+0.127
 List+0.122
 appearances+0.122
Neg logit contributions
pieces-0.084
 piping-0.071
 snowball-0.067
"'-0.065
THE-0.064
Token ha
Token ID387
Feature activation-3.32e-02

Pos logit contributions
pieces+0.138
 piping+0.131
THE+0.116
 snowball+0.115
"'+0.114
Neg logit contributions
 Gre-0.075
oust-0.068
 well-0.065
 St-0.064
 lur-0.064
Token,
Token ID11
Feature activation-2.51e-02

Pos logit contributions
 Gre+0.039
oust+0.038
 Money+0.036
 St+0.035
ub+0.034
Neg logit contributions
pieces-0.044
 piping-0.042
 snowball-0.038
"'-0.038
THE-0.035
Token I
Token ID314
Feature activation-2.59e-02

Pos logit contributions
 Gre+0.031
oust+0.031
 Money+0.030
hips+0.028
 List+0.028
Neg logit contributions
pieces-0.031
 piping-0.029
 snowball-0.027
"'-0.027
THE-0.023
Token catch
Token ID4929
Feature activation+5.68e-02

Pos logit contributions
 Gre+0.037
oust+0.035
 Money+0.035
 St+0.033
hips+0.033
Neg logit contributions
pieces-0.028
 piping-0.026
 snowball-0.024
"'-0.024
THE-0.021
Token your
Token ID534
Feature activation+3.40e-03

Pos logit contributions
pieces+0.079
 piping+0.075
"'+0.069
 snowball+0.068
THE+0.064
Neg logit contributions
 Gre-0.064
oust-0.060
 Money-0.058
 St-0.056
hips-0.055
Token snowball
Token ID46275
Feature activation+2.90e-01

Pos logit contributions
pieces+0.004
 piping+0.004
"'+0.004
THE+0.004
 snowball+0.003
Neg logit contributions
 Gre-0.004
oust-0.004
 well-0.004
 St-0.004
 far-0.004
Token,
Token ID11
Feature activation-2.38e-02

Pos logit contributions
pieces+0.315
 piping+0.292
"'+0.248
THE+0.227
 snowball+0.226
Neg logit contributions
 Gre-0.411
oust-0.407
 well-0.395
 far-0.395
 greed-0.387
Token Ben
Token ID3932
Feature activation-1.35e-01

Pos logit contributions
 Gre+0.025
 far+0.024
oust+0.023
 St+0.023
hips+0.023
Neg logit contributions
pieces-0.034
 piping-0.033
"'-0.029
 snowball-0.027
THE-0.027
Token!"
Token ID2474
Feature activation+1.18e-01

Pos logit contributions
 Gre+0.205
oust+0.196
 Money+0.192
 St+0.187
hips+0.183
Neg logit contributions
pieces-0.136
 piping-0.125
"'-0.108
 snowball-0.107
THE-0.097
Token Lily
Token ID20037
Feature activation-8.38e-03

Pos logit contributions
pieces+0.159
 piping+0.151
 snowball+0.140
"'+0.140
THE+0.125
Neg logit contributions
 Gre-0.133
oust-0.130
 Money-0.125
hips-0.121
 appearances-0.117
Token says
Token ID1139
Feature activation+2.55e-02

Pos logit contributions
 Gre+0.011
oust+0.010
 Money+0.010
 St+0.010
hips+0.010
Neg logit contributions
pieces-0.010
 piping-0.010
"'-0.009
 snowball-0.009
THE-0.008
Token snow
Token ID6729
Feature activation+4.20e-03

Pos logit contributions
 Gre+0.011
oust+0.011
 Money+0.011
 St+0.010
hips+0.010
Neg logit contributions
pieces-0.011
 piping-0.010
 snowball-0.009
"'-0.009
THE-0.008
Tokenman
Token ID805
Feature activation+5.53e-03

Pos logit contributions
pieces+0.007
 piping+0.006
"'+0.006
 freshly+0.005
THE+0.005
Neg logit contributions
oust-0.004
 Gre-0.004
hips-0.004
 far-0.004
 well-0.004
Token and
Token ID290
Feature activation+1.68e-02

Pos logit contributions
pieces+0.008
 piping+0.007
"'+0.007
 snowball+0.007
THE+0.006
Neg logit contributions
 Gre-0.006
oust-0.006
 Money-0.005
 St-0.005
 far-0.005
Token had
Token ID550
Feature activation+5.56e-02

Pos logit contributions
pieces+0.023
 piping+0.021
"'+0.020
THE+0.019
 snowball+0.018
Neg logit contributions
 Gre-0.019
oust-0.018
 St-0.018
 Money-0.017
 Ho-0.017
Token a
Token ID257
Feature activation-1.03e-02

Pos logit contributions
pieces+0.087
 piping+0.078
"'+0.077
 snowball+0.069
THE+0.068
Neg logit contributions
 Gre-0.053
 far-0.049
oust-0.049
 Money-0.048
 St-0.048
Token snowball
Token ID46275
Feature activation+2.88e-01

Pos logit contributions
 Gre+0.013
oust+0.013
 far+0.013
 well+0.012
 Money+0.012
Neg logit contributions
pieces-0.013
"'-0.011
 piping-0.011
THE-0.010
cause-0.009
Token fight
Token ID1907
Feature activation+7.41e-02

Pos logit contributions
pieces+0.375
 piping+0.351
"'+0.319
 snowball+0.305
THE+0.293
Neg logit contributions
 Gre-0.353
oust-0.343
 Money-0.325
 St-0.316
hips-0.316
Token with
Token ID351
Feature activation-1.03e-01

Pos logit contributions
pieces+0.059
 snowball+0.056
"'+0.050
 piping+0.043
THE+0.041
Neg logit contributions
 Gre-0.121
oust-0.119
 Money-0.117
ugg-0.115
 List-0.115
Token her
Token ID607
Feature activation+5.09e-02

Pos logit contributions
 Gre+0.157
 Money+0.152
oust+0.151
 List+0.148
ub+0.147
Neg logit contributions
pieces-0.098
 piping-0.090
 snowball-0.089
"'-0.075
THE-0.069
Token friends
Token ID2460
Feature activation+6.38e-02

Pos logit contributions
pieces+0.080
 piping+0.077
"'+0.072
 freshly+0.065
THE+0.063
Neg logit contributions
 Gre-0.047
oust-0.046
 St-0.045
 Ho-0.045
 well-0.045
Token.
Token ID13
Feature activation+9.32e-03

Pos logit contributions
pieces+0.080
 piping+0.074
"'+0.068
 snowball+0.067
THE+0.062
Neg logit contributions
 Gre-0.081
oust-0.077
 Money-0.075
 St-0.072
hips-0.071
Token shouted
Token ID17293
Feature activation-3.42e-02

Pos logit contributions
pieces+0.004
 piping+0.004
"'+0.003
 snowball+0.003
THE+0.003
Neg logit contributions
 Gre-0.004
oust-0.004
 Money-0.004
 St-0.004
hips-0.004
Token.
Token ID13
Feature activation-1.27e-02

Pos logit contributions
 Gre+0.052
oust+0.050
 Money+0.050
 appearances+0.048
hips+0.047
Neg logit contributions
pieces-0.033
 piping-0.029
"'-0.028
 snowball-0.028
THE-0.023
Token He
Token ID679
Feature activation+2.92e-03

Pos logit contributions
 Gre+0.016
oust+0.015
 Money+0.015
 St+0.014
hips+0.014
Neg logit contributions
pieces-0.016
 piping-0.015
"'-0.014
 snowball-0.014
THE-0.013
Token threw
Token ID9617
Feature activation+3.76e-02

Pos logit contributions
pieces+0.003
 piping+0.003
 snowball+0.003
"'+0.003
THE+0.002
Neg logit contributions
 Gre-0.004
oust-0.004
 Money-0.004
hips-0.004
 appearances-0.004
Token a
Token ID257
Feature activation-1.27e-02

Pos logit contributions
pieces+0.040
 piping+0.036
 snowball+0.035
"'+0.032
THE+0.029
Neg logit contributions
 Gre-0.055
oust-0.054
 Money-0.053
hips-0.050
 appearances-0.050
Token snowball
Token ID46275
Feature activation+2.88e-01

Pos logit contributions
 Gre+0.017
oust+0.016
 Money+0.016
 St+0.015
hips+0.015
Neg logit contributions
pieces-0.015
 piping-0.014
 snowball-0.013
"'-0.012
THE-0.011
Token at
Token ID379
Feature activation+3.72e-02

Pos logit contributions
pieces+0.303
 piping+0.281
 snowball+0.268
"'+0.257
THE+0.231
Neg logit contributions
 Gre-0.415
oust-0.394
 Money-0.391
 St-0.368
 appearances-0.365
Token the
Token ID262
Feature activation+1.22e-02

Pos logit contributions
pieces+0.053
 piping+0.049
"'+0.045
 snowball+0.042
 freshly+0.040
Neg logit contributions
 Gre-0.043
oust-0.040
 far-0.039
 Money-0.038
 St-0.038
Token dog
Token ID3290
Feature activation+4.48e-02

Pos logit contributions
pieces+0.017
 piping+0.016
"'+0.015
 snowball+0.014
THE+0.013
Neg logit contributions
 Gre-0.014
oust-0.013
 Money-0.013
 St-0.012
 well-0.012
Token.
Token ID13
Feature activation-9.05e-04

Pos logit contributions
pieces+0.050
 piping+0.043
 snowball+0.042
"'+0.040
THE+0.037
Neg logit contributions
 Gre-0.063
oust-0.061
 Money-0.059
 List-0.056
 Hel-0.056
Token The
Token ID383
Feature activation+5.70e-02

Pos logit contributions
 Gre+0.001
oust+0.001
 Money+0.001
hips+0.001
 appearances+0.001
Neg logit contributions
pieces-0.001
 piping-0.001
 snowball-0.001
"'-0.001
THE-0.001
Tokenman
Token ID805
Feature activation+4.80e-03

Pos logit contributions
pieces+0.011
 piping+0.010
"'+0.010
 freshly+0.008
THE+0.008
Neg logit contributions
oust-0.008
 Gre-0.007
hips-0.007
 far-0.007
 well-0.007
Token together
Token ID1978
Feature activation+4.49e-02

Pos logit contributions
pieces+0.006
 piping+0.006
"'+0.005
 snowball+0.005
THE+0.005
Neg logit contributions
 Gre-0.006
oust-0.006
 Money-0.005
 St-0.005
 far-0.005
Token and
Token ID290
Feature activation+1.45e-02

Pos logit contributions
pieces+0.058
 piping+0.055
"'+0.050
 snowball+0.049
THE+0.045
Neg logit contributions
 Gre-0.055
oust-0.052
 Money-0.051
 St-0.049
hips-0.048
Token had
Token ID550
Feature activation+5.45e-02

Pos logit contributions
pieces+0.015
 piping+0.014
 snowball+0.013
"'+0.012
THE+0.011
Neg logit contributions
 Gre-0.021
oust-0.021
 Money-0.020
hips-0.020
ub-0.019
Token a
Token ID257
Feature activation-1.22e-02

Pos logit contributions
pieces+0.083
 piping+0.077
"'+0.073
 snowball+0.068
THE+0.066
Neg logit contributions
 Gre-0.055
 Money-0.050
oust-0.049
 far-0.048
 St-0.048
Token snowball
Token ID46275
Feature activation+2.88e-01

Pos logit contributions
 Gre+0.015
oust+0.014
 far+0.014
 Money+0.013
 well+0.013
Neg logit contributions
pieces-0.016
 piping-0.014
"'-0.014
THE-0.012
cause-0.012
Token fight
Token ID1907
Feature activation+7.01e-02

Pos logit contributions
pieces+0.382
 piping+0.359
"'+0.327
 snowball+0.306
THE+0.298
Neg logit contributions
 Gre-0.342
oust-0.336
 Money-0.314
hips-0.313
 far-0.311
Token.
Token ID13
Feature activation+1.97e-03

Pos logit contributions
pieces+0.054
 snowball+0.052
"'+0.045
 piping+0.036
THE+0.035
Neg logit contributions
oust-0.119
 Gre-0.115
ugg-0.114
 Money-0.112
 List-0.111
Token Lily
Token ID20037
Feature activation-6.92e-03

Pos logit contributions
pieces+0.003
 piping+0.003
"'+0.002
 snowball+0.002
THE+0.002
Neg logit contributions
 Gre-0.002
oust-0.002
 Money-0.002
 St-0.002
hips-0.002
Token said
Token ID531
Feature activation+1.42e-02

Pos logit contributions
 Gre+0.010
oust+0.009
 Money+0.009
hips+0.009
 appearances+0.009
Neg logit contributions
pieces-0.008
 piping-0.007
"'-0.006
 snowball-0.006
THE-0.006
Token,
Token ID11
Feature activation-2.15e-02

Pos logit contributions
pieces+0.018
 piping+0.017
"'+0.015
 snowball+0.015
THE+0.014
Neg logit contributions
 Gre-0.018
oust-0.017
 Money-0.017
 St-0.016
hips-0.016
Token.
Token ID13
Feature activation-3.25e-03

Pos logit contributions
 Gre+0.032
oust+0.030
 Money+0.029
 far+0.029
 well+0.029
Neg logit contributions
pieces-0.033
 piping-0.031
"'-0.028
 snowball-0.028
THE-0.026
Token It
Token ID632
Feature activation+4.43e-02

Pos logit contributions
 Gre+0.004
oust+0.004
 Money+0.004
hips+0.004
 lur+0.004
Neg logit contributions
pieces-0.004
 piping-0.004
"'-0.003
 snowball-0.003
THE-0.003
Token's
Token ID338
Feature activation+2.32e-03

Pos logit contributions
pieces+0.054
 piping+0.051
"'+0.046
 snowball+0.046
THE+0.042
Neg logit contributions
 Gre-0.057
oust-0.055
 Money-0.053
 St-0.051
hips-0.050
Token melting
Token ID24203
Feature activation+3.01e-02

Pos logit contributions
pieces+0.003
 piping+0.002
 snowball+0.002
"'+0.002
THE+0.002
Neg logit contributions
 Gre-0.003
oust-0.003
 Money-0.003
ub-0.003
 St-0.003
Token the
Token ID262
Feature activation+1.61e-02

Pos logit contributions
pieces+0.037
 piping+0.034
"'+0.031
 snowball+0.030
THE+0.028
Neg logit contributions
 Gre-0.039
oust-0.037
 Money-0.036
 St-0.035
 far-0.035
Token snowball
Token ID46275
Feature activation+2.84e-01

Pos logit contributions
pieces+0.019
 piping+0.018
 snowball+0.017
"'+0.015
 freshly+0.015
Neg logit contributions
 Gre-0.022
oust-0.021
 Money-0.020
ub-0.020
 St-0.020
Token just
Token ID655
Feature activation-7.72e-03

Pos logit contributions
pieces+0.338
 piping+0.315
 snowball+0.286
"'+0.285
THE+0.259
Neg logit contributions
 Gre-0.379
oust-0.360
 Money-0.352
 St-0.336
hips-0.331
Token like
Token ID588
Feature activation-1.76e-02

Pos logit contributions
 Gre+0.011
oust+0.011
 Money+0.011
ark+0.010
 appearances+0.010
Neg logit contributions
pieces-0.008
 piping-0.008
 snowball-0.007
"'-0.007
 freshly-0.007
Token it
Token ID340
Feature activation-4.85e-02

Pos logit contributions
 Gre+0.022
oust+0.022
 Money+0.021
ub+0.020
hips+0.020
Neg logit contributions
pieces-0.022
 piping-0.020
 snowball-0.019
"'-0.018
THE-0.017
Token would
Token ID561
Feature activation+2.83e-02

Pos logit contributions
 Gre+0.063
oust+0.060
 Money+0.058
 St+0.056
 far+0.055
Neg logit contributions
pieces-0.059
 piping-0.055
"'-0.049
 snowball-0.049
THE-0.045
Token melt
Token ID16867
Feature activation-3.37e-02

Pos logit contributions
pieces+0.032
 piping+0.030
"'+0.027
 snowball+0.027
THE+0.024
Neg logit contributions
 Gre-0.039
oust-0.037
 Money-0.037
 St-0.035
hips-0.034
Token said
Token ID531
Feature activation+9.11e-03

Pos logit contributions
 Gre+0.134
oust+0.129
 Money+0.127
hips+0.127
 appearances+0.122
Neg logit contributions
pieces-0.092
 piping-0.082
"'-0.078
 snowball-0.077
?",-0.065
Token.
Token ID13
Feature activation-3.83e-03

Pos logit contributions
pieces+0.009
 piping+0.009
"'+0.008
 snowball+0.008
THE+0.007
Neg logit contributions
 Gre-0.013
 Money-0.013
oust-0.013
hips-0.012
 appearances-0.012
Token He
Token ID679
Feature activation+6.04e-03

Pos logit contributions
 Gre+0.005
oust+0.005
 Money+0.005
 St+0.004
hips+0.004
Neg logit contributions
pieces-0.005
 piping-0.005
"'-0.004
 snowball-0.004
THE-0.004
Token made
Token ID925
Feature activation-5.11e-02

Pos logit contributions
pieces+0.007
 piping+0.006
"'+0.006
 snowball+0.006
THE+0.005
Neg logit contributions
 Gre-0.008
oust-0.008
 Money-0.008
hips-0.008
 appearances-0.007
Token a
Token ID257
Feature activation-1.56e-02

Pos logit contributions
 Gre+0.070
oust+0.068
 Money+0.067
hips+0.064
 appearances+0.062
Neg logit contributions
pieces-0.060
 piping-0.054
 snowball-0.053
"'-0.047
THE-0.043
Token snowball
Token ID46275
Feature activation+2.84e-01

Pos logit contributions
 Gre+0.020
oust+0.019
 Money+0.018
 St+0.018
hips+0.017
Neg logit contributions
pieces-0.020
 piping-0.018
"'-0.017
 snowball-0.016
THE-0.015
Token and
Token ID290
Feature activation+1.32e-02

Pos logit contributions
pieces+0.316
 snowball+0.293
 piping+0.287
"'+0.261
THE+0.233
Neg logit contributions
 Gre-0.400
 Money-0.375
oust-0.368
 List-0.351
 St-0.351
Token threw
Token ID9617
Feature activation+3.38e-02

Pos logit contributions
pieces+0.015
 piping+0.014
"'+0.013
 snowball+0.013
THE+0.012
Neg logit contributions
 Gre-0.018
oust-0.017
 Money-0.017
 St-0.016
hips-0.016
Token it
Token ID340
Feature activation-5.02e-02

Pos logit contributions
pieces+0.051
 piping+0.048
"'+0.044
 snowball+0.042
THE+0.041
Neg logit contributions
 Gre-0.035
oust-0.032
 Money-0.031
 St-0.030
 far-0.030
Token at
Token ID379
Feature activation+3.52e-02

Pos logit contributions
 Gre+0.085
oust+0.084
 Money+0.080
 appearances+0.078
engu+0.077
Neg logit contributions
pieces-0.039
 snowball-0.039
"'-0.033
 piping-0.031
THE-0.029
Token the
Token ID262
Feature activation+8.20e-03

Pos logit contributions
pieces+0.040
 piping+0.038
 snowball+0.036
"'+0.034
THE+0.031
Neg logit contributions
 Gre-0.047
oust-0.046
 Money-0.045
hips-0.043
ub-0.042
Token.
Token ID13
Feature activation+7.93e-03

Pos logit contributions
 well+0.002
 far+0.002
 Gre+0.002
 St+0.001
 Money+0.001
Neg logit contributions
 piping-0.003
pieces-0.003
"'-0.003
 freshly-0.003
THE-0.003
Token He
Token ID679
Feature activation+1.26e-02

Pos logit contributions
pieces+0.011
 piping+0.010
"'+0.009
 snowball+0.009
THE+0.008
Neg logit contributions
 Gre-0.010
oust-0.009
 Money-0.009
 St-0.008
 Spark-0.008
Token rolled
Token ID11686
Feature activation-3.31e-02

Pos logit contributions
pieces+0.015
 piping+0.014
 snowball+0.013
"'+0.013
THE+0.011
Neg logit contributions
 Gre-0.017
oust-0.016
 Money-0.016
hips-0.016
 appearances-0.015
Token a
Token ID257
Feature activation-7.05e-03

Pos logit contributions
 far+0.032
 Gre+0.030
 well+0.028
 St+0.027
 Ho+0.027
Neg logit contributions
pieces-0.058
 piping-0.052
"'-0.046
 freshly-0.045
affles-0.042
Token big
Token ID1263
Feature activation+7.25e-03

Pos logit contributions
 Gre+0.009
oust+0.009
 Money+0.008
 far+0.008
 St+0.008
Neg logit contributions
pieces-0.009
 piping-0.008
"'-0.007
 snowball-0.007
THE-0.007
Token snowball
Token ID46275
Feature activation+2.84e-01

Pos logit contributions
pieces+0.010
 piping+0.009
"'+0.008
THE+0.008
 freshly+0.007
Neg logit contributions
oust-0.008
 Gre-0.008
 far-0.008
 Money-0.008
 well-0.007
Token and
Token ID290
Feature activation+2.03e-02

Pos logit contributions
pieces+0.321
 piping+0.296
 snowball+0.282
"'+0.270
THE+0.245
Neg logit contributions
 Gre-0.391
oust-0.367
 Money-0.364
 St-0.344
 List-0.338
Token put
Token ID1234
Feature activation-6.28e-02

Pos logit contributions
pieces+0.026
 piping+0.024
"'+0.022
 snowball+0.021
THE+0.020
Neg logit contributions
 Gre-0.026
oust-0.024
 Money-0.023
 St-0.023
 far-0.023
Token it
Token ID340
Feature activation-4.84e-02

Pos logit contributions
 Gre+0.076
oust+0.071
 Money+0.070
 St+0.066
hips+0.065
Neg logit contributions
pieces-0.084
 piping-0.078
"'-0.072
 snowball-0.071
THE-0.066
Token on
Token ID319
Feature activation+8.31e-02

Pos logit contributions
 Gre+0.078
oust+0.075
 Money+0.073
 St+0.070
 appearances+0.069
Neg logit contributions
pieces-0.045
 piping-0.041
 snowball-0.038
"'-0.036
THE-0.033
Token top
Token ID1353
Feature activation-1.70e-02

Pos logit contributions
 snowball+0.073
 piping+0.065
pieces+0.062
"'+0.059
THE+0.052
Neg logit contributions
oust-0.132
 Gre-0.131
ark-0.127
hips-0.126
ub-0.126
Token ran
Token ID4966
Feature activation+1.61e-02

Pos logit contributions
pieces+0.015
 piping+0.015
 snowball+0.014
"'+0.013
 freshly+0.011
Neg logit contributions
 Gre-0.021
oust-0.020
 Money-0.020
hips-0.020
ark-0.019
Token home
Token ID1363
Feature activation-6.44e-02

Pos logit contributions
pieces+0.016
 piping+0.015
 snowball+0.014
"'+0.014
THE+0.013
Neg logit contributions
 Gre-0.024
oust-0.023
 Money-0.023
 appearances-0.022
 St-0.022
Token,
Token ID11
Feature activation-2.44e-02

Pos logit contributions
 Gre+0.103
oust+0.099
 Money+0.098
 appearances+0.095
Bl+0.093
Neg logit contributions
pieces-0.056
"'-0.048
 piping-0.046
 snowball-0.045
THE-0.039
Token squeezing
Token ID40237
Feature activation+1.14e-02

Pos logit contributions
oust+0.032
 Gre+0.032
hips+0.031
 Money+0.031
ret+0.031
Neg logit contributions
pieces-0.027
 piping-0.026
 snowball-0.026
"'-0.023
 freshly-0.023
Token the
Token ID262
Feature activation+1.38e-02

Pos logit contributions
 snowball+0.011
pieces+0.011
 piping+0.010
"'+0.008
 cocoa+0.008
Neg logit contributions
 Gre-0.017
oust-0.017
 Money-0.017
hips-0.017
 appearances-0.016
Token snowball
Token ID46275
Feature activation+2.82e-01

Pos logit contributions
pieces+0.016
 piping+0.015
"'+0.013
THE+0.012
 snowball+0.012
Neg logit contributions
 Gre-0.019
oust-0.018
 Money-0.017
 St-0.017
 far-0.017
Token tight
Token ID5381
Feature activation-1.54e-01

Pos logit contributions
pieces+0.297
 piping+0.268
 snowball+0.264
"'+0.241
THE+0.217
Neg logit contributions
 Gre-0.419
 Money-0.398
oust-0.394
 List-0.368
 St-0.367
Token.
Token ID13
Feature activation-2.18e-03

Pos logit contributions
 Gre+0.240
oust+0.229
 Money+0.225
 List+0.215
 appearances+0.215
Neg logit contributions
pieces-0.142
 piping-0.132
 snowball-0.126
"'-0.120
THE-0.110
Token When
Token ID1649
Feature activation+5.42e-02

Pos logit contributions
 appearances+0.003
hips+0.003
oust+0.003
 ante+0.003
 Gre+0.003
Neg logit contributions
pieces-0.002
 piping-0.002
 snowball-0.002
"'-0.002
 freshly-0.001
Token he
Token ID339
Feature activation+7.26e-04

Pos logit contributions
pieces+0.049
 piping+0.047
 snowball+0.043
"'+0.042
THE+0.040
Neg logit contributions
oust-0.083
 Gre-0.082
hips-0.080
ark-0.080
ub-0.078
Token came
Token ID1625
Feature activation-3.15e-02

Pos logit contributions
pieces+0.001
 piping+0.001
"'+0.001
 snowball+0.001
THE+0.001
Neg logit contributions
 Gre-0.001
oust-0.001
 Money-0.001
 St-0.001
 far-0.001
Token and
Token ID290
Feature activation+1.28e-02

Pos logit contributions
 Gre+0.005
oust+0.004
 Money+0.004
 St+0.004
hips+0.004
Neg logit contributions
pieces-0.005
 piping-0.005
"'-0.004
 snowball-0.004
THE-0.004
Token pretended
Token ID36766
Feature activation+1.40e-01

Pos logit contributions
pieces+0.017
 piping+0.015
"'+0.014
 snowball+0.014
THE+0.013
Neg logit contributions
 Gre-0.016
oust-0.015
 Money-0.015
 St-0.014
 Spark-0.014
Token to
Token ID284
Feature activation+1.11e-02

Pos logit contributions
pieces+0.176
 piping+0.165
 snowball+0.150
"'+0.150
THE+0.138
Neg logit contributions
 Gre-0.175
oust-0.166
 Money-0.162
 St-0.155
hips-0.153
Token have
Token ID423
Feature activation+1.03e-01

Pos logit contributions
pieces+0.012
 piping+0.011
 snowball+0.011
"'+0.010
THE+0.009
Neg logit contributions
 Gre-0.016
 Money-0.015
oust-0.015
 List-0.014
hips-0.014
Token a
Token ID257
Feature activation-1.52e-02

Pos logit contributions
pieces+0.142
 piping+0.132
"'+0.125
 snowball+0.118
THE+0.112
Neg logit contributions
 Gre-0.118
oust-0.111
 Money-0.108
 St-0.105
 Spark-0.103
Token snowball
Token ID46275
Feature activation+2.82e-01

Pos logit contributions
 Gre+0.022
oust+0.021
 Money+0.021
 St+0.020
hips+0.020
Neg logit contributions
pieces-0.016
 piping-0.015
 snowball-0.014
"'-0.013
THE-0.012
Token fight
Token ID1907
Feature activation+6.54e-02

Pos logit contributions
pieces+0.345
 piping+0.323
"'+0.292
 snowball+0.287
THE+0.266
Neg logit contributions
 Gre-0.366
oust-0.350
 Money-0.339
 St-0.326
hips-0.323
Token with
Token ID351
Feature activation-1.12e-01

Pos logit contributions
pieces+0.056
 snowball+0.052
"'+0.046
 piping+0.041
THE+0.038
Neg logit contributions
 Gre-0.106
oust-0.103
 Money-0.102
 List-0.101
ret-0.100
Token her
Token ID607
Feature activation+4.66e-02

Pos logit contributions
 Gre+0.171
 Money+0.165
oust+0.164
hips+0.160
ark+0.159
Neg logit contributions
pieces-0.111
 piping-0.102
 snowball-0.096
"'-0.083
 freshly-0.077
Token imaginary
Token ID26726
Feature activation+1.99e-01

Pos logit contributions
pieces+0.068
 piping+0.065
"'+0.061
THE+0.054
?",+0.053
Neg logit contributions
 Gre-0.048
 St-0.048
 Ho-0.048
oust-0.048
 Spark-0.047
Token friend
Token ID1545
Feature activation+6.24e-02

Pos logit contributions
pieces+0.225
 piping+0.208
 snowball+0.187
"'+0.187
THE+0.169
Neg logit contributions
 Gre-0.278
oust-0.265
 Money-0.259
 St-0.249
hips-0.246
Token careful
Token ID8161
Feature activation-6.30e-02

Pos logit contributions
pieces+0.051
 piping+0.048
 snowball+0.047
"'+0.046
THE+0.042
Neg logit contributions
 Gre-0.049
oust-0.048
 Money-0.046
hips-0.045
 appearances-0.043
Token not
Token ID407
Feature activation-5.71e-02

Pos logit contributions
 Gre+0.095
oust+0.090
 Money+0.089
 St+0.085
 Sno+0.084
Neg logit contributions
pieces-0.063
 piping-0.057
"'-0.054
 snowball-0.054
THE-0.047
Token to
Token ID284
Feature activation+1.14e-02

Pos logit contributions
oust+0.098
 Gre+0.096
 Money+0.091
 List+0.091
ret+0.090
Neg logit contributions
 snowball-0.040
pieces-0.037
"'-0.036
 gonna-0.033
?",-0.033
Token release
Token ID2650
Feature activation-1.12e-02

Pos logit contributions
 snowball+0.010
pieces+0.009
"'+0.009
?",+0.009
 piping+0.009
Neg logit contributions
 Gre-0.017
 Money-0.017
 List-0.017
oust-0.016
ret-0.016
Token the
Token ID262
Feature activation+1.12e-02

Pos logit contributions
oust+0.017
 Gre+0.017
 Money+0.017
hips+0.016
ark+0.016
Neg logit contributions
pieces-0.011
 snowball-0.010
 piping-0.009
"'-0.008
?",-0.008
Token snowball
Token ID46275
Feature activation+2.82e-01

Pos logit contributions
pieces+0.013
 piping+0.012
 snowball+0.011
"'+0.011
THE+0.010
Neg logit contributions
 Gre-0.015
oust-0.014
 Money-0.014
 St-0.014
hips-0.013
Token!"
Token ID2474
Feature activation+1.15e-01

Pos logit contributions
pieces+0.319
 piping+0.299
"'+0.265
 snowball+0.262
THE+0.240
Neg logit contributions
 Gre-0.391
oust-0.373
 Money-0.364
 St-0.351
hips-0.349
Token
Token ID220
Feature activation-4.36e-02

Pos logit contributions
pieces+0.153
 piping+0.143
 snowball+0.131
"'+0.130
THE+0.120
Neg logit contributions
 Gre-0.142
oust-0.133
 Money-0.130
 St-0.125
 Spark-0.121
Token
Token ID198
Feature activation-5.13e-03

Pos logit contributions
 Gre+0.057
oust+0.055
 Money+0.053
 appearances+0.051
 St+0.050
Neg logit contributions
pieces-0.052
 piping-0.048
"'-0.045
 snowball-0.044
THE-0.040
Token
Token ID198
Feature activation-5.60e-03

Pos logit contributions
 Gre+0.007
oust+0.006
 Money+0.006
hips+0.006
 appearances+0.006
Neg logit contributions
pieces-0.006
 piping-0.006
"'-0.005
 snowball-0.005
THE-0.005
TokenThey
Token ID2990
Feature activation+2.74e-02

Pos logit contributions
 Gre+0.008
oust+0.008
 Money+0.008
 lur+0.008
hips+0.007
Neg logit contributions
pieces-0.006
 piping-0.005
"'-0.005
THE-0.005
 snowball-0.005
Token,
Token ID11
Feature activation-2.42e-02

Pos logit contributions
pieces+0.052
 piping+0.050
"'+0.046
 snowball+0.042
 freshly+0.038
Neg logit contributions
 Gre-0.084
 Money-0.081
oust-0.081
hips-0.078
 appearances-0.077
Token "
Token ID366
Feature activation+3.50e-03

Pos logit contributions
 Gre+0.033
oust+0.031
 Money+0.030
hips+0.029
 St+0.029
Neg logit contributions
pieces-0.028
 piping-0.027
"'-0.024
 snowball-0.024
THE-0.022
TokenJust
Token ID5703
Feature activation+8.74e-02

Pos logit contributions
pieces+0.004
 piping+0.004
 snowball+0.003
"'+0.003
THE+0.003
Neg logit contributions
 Gre-0.005
oust-0.005
hips-0.005
 Money-0.004
 well-0.004
Token like
Token ID588
Feature activation-1.89e-02

Pos logit contributions
 piping+0.077
pieces+0.069
?",+0.068
 freshly+0.065
 snowball+0.065
Neg logit contributions
ark-0.139
oust-0.137
 Gre-0.135
 Money-0.131
hips-0.129
Token the
Token ID262
Feature activation+1.29e-02

Pos logit contributions
oust+0.028
 Gre+0.028
hips+0.027
ub+0.026
ark+0.026
Neg logit contributions
pieces-0.018
 piping-0.018
 snowball-0.017
 freshly-0.015
"'-0.015
Token snowball
Token ID46275
Feature activation+2.81e-01

Pos logit contributions
pieces+0.016
 piping+0.015
"'+0.014
 snowball+0.013
THE+0.013
Neg logit contributions
 Gre-0.016
oust-0.016
 Money-0.015
 St-0.014
 far-0.014
Token melted
Token ID24178
Feature activation+9.62e-03

Pos logit contributions
pieces+0.369
 piping+0.348
"'+0.316
 snowball+0.310
THE+0.295
Neg logit contributions
 Gre-0.337
oust-0.319
 Money-0.308
 St-0.300
 well-0.297
Token,
Token ID11
Feature activation-2.33e-02

Pos logit contributions
pieces+0.011
 piping+0.010
 snowball+0.009
"'+0.009
THE+0.008
Neg logit contributions
 Gre-0.014
oust-0.013
 Money-0.013
 St-0.012
hips-0.012
Token the
Token ID262
Feature activation+1.34e-02

Pos logit contributions
 Gre+0.031
oust+0.031
 Money+0.029
hips+0.028
 St+0.028
Neg logit contributions
pieces-0.027
 piping-0.025
 snowball-0.023
"'-0.023
THE-0.021
Token sun
Token ID4252
Feature activation+1.14e-02

Pos logit contributions
pieces+0.017
 piping+0.015
"'+0.015
THE+0.013
cause+0.012
Neg logit contributions
 Gre-0.016
oust-0.016
 far-0.015
 Money-0.015
 well-0.015
Token can
Token ID460
Feature activation+4.94e-03

Pos logit contributions
pieces+0.016
 piping+0.015
"'+0.014
 snowball+0.013
THE+0.013
Neg logit contributions
 Gre-0.012
oust-0.012
 Money-0.011
 St-0.011
 well-0.011
Token that
Token ID326
Feature activation-3.41e-04

Pos logit contributions
pieces+0.043
 piping+0.039
"'+0.037
+0.034
THE+0.034
Neg logit contributions
oust-0.073
 Gre-0.069
 appearances-0.068
uzz-0.068
 ante-0.067
Token you
Token ID345
Feature activation-6.45e-02

Pos logit contributions
oust+0.001
uzz+0.001
hips+0.001
 Gre+0.001
 Money+0.001
Neg logit contributions
pieces-0.000
 piping-0.000
 snowball-0.000
"'-0.000
?",-0.000
Token have
Token ID423
Feature activation+1.04e-01

Pos logit contributions
 Gre+0.099
oust+0.095
 Money+0.094
 appearances+0.092
 St+0.090
Neg logit contributions
pieces-0.063
 piping-0.057
 snowball-0.051
"'-0.051
THE-0.047
Token a
Token ID257
Feature activation-1.54e-02

Pos logit contributions
pieces+0.126
 piping+0.118
"'+0.107
 snowball+0.106
THE+0.098
Neg logit contributions
 Gre-0.137
oust-0.132
 Money-0.127
hips-0.123
 St-0.121
Token icy
Token ID30284
Feature activation+1.49e-02

Pos logit contributions
 Gre+0.021
oust+0.020
 Money+0.020
hips+0.019
 St+0.019
Neg logit contributions
pieces-0.018
 piping-0.017
 snowball-0.015
"'-0.015
THE-0.014
Token snowball
Token ID46275
Feature activation+2.81e-01

Pos logit contributions
pieces+0.015
 piping+0.015
 snowball+0.014
"'+0.013
THE+0.012
Neg logit contributions
 Gre-0.022
 Money-0.021
oust-0.021
hips-0.020
 St-0.020
Token in
Token ID287
Feature activation+1.13e-02

Pos logit contributions
pieces+0.329
 piping+0.306
 snowball+0.276
"'+0.275
THE+0.251
Neg logit contributions
 Gre-0.379
oust-0.360
 Money-0.352
 St-0.337
hips-0.333
Token your
Token ID534
Feature activation+3.58e-03

Pos logit contributions
pieces+0.012
 piping+0.012
 snowball+0.010
"'+0.010
THE+0.009
Neg logit contributions
 Gre-0.016
oust-0.016
 Money-0.015
hips-0.015
 St-0.014
Token stomach
Token ID11384
Feature activation-7.83e-02

Pos logit contributions
pieces+0.005
 piping+0.004
"'+0.004
THE+0.004
 snowball+0.004
Neg logit contributions
 Gre-0.004
oust-0.004
 Money-0.004
 St-0.004
 far-0.004
Token,
Token ID11
Feature activation-2.92e-02

Pos logit contributions
 Gre+0.110
oust+0.105
 Money+0.103
 St+0.098
hips+0.098
Neg logit contributions
pieces-0.087
 piping-0.081
"'-0.073
 snowball-0.073
THE-0.066
Token you
Token ID345
Feature activation-6.45e-02

Pos logit contributions
 Gre+0.040
oust+0.039
 Money+0.038
hips+0.037
 appearances+0.036
Neg logit contributions
pieces-0.033
 piping-0.030
 snowball-0.028
"'-0.027
THE-0.025
Token.
Token ID13
Feature activation+3.74e-04

Pos logit contributions
pieces+0.025
 piping+0.020
 snowball+0.019
"'+0.019
THE+0.018
Neg logit contributions
 Gre-0.067
oust-0.065
 appearances-0.065
 ante-0.065
 List-0.064
Token He
Token ID679
Feature activation+8.96e-03

Pos logit contributions
pieces+0.000
 snowball+0.000
"'+0.000
 piping+0.000
 Either+0.000
Neg logit contributions
oust-0.001
 Gre-0.001
hips-0.001
ret-0.001
 appearances-0.001
Token made
Token ID925
Feature activation-5.09e-02

Pos logit contributions
 snowball+0.008
pieces+0.008
 piping+0.008
"'+0.007
 apr+0.006
Neg logit contributions
oust-0.013
 Gre-0.013
hips-0.013
 appearances-0.013
 Money-0.012
Token a
Token ID257
Feature activation-9.62e-03

Pos logit contributions
 Gre+0.075
oust+0.074
 Money+0.072
ark+0.070
 appearances+0.070
Neg logit contributions
pieces-0.052
 snowball-0.052
 piping-0.047
"'-0.039
THE-0.039
Token big
Token ID1263
Feature activation+6.72e-03

Pos logit contributions
 Gre+0.012
oust+0.012
 Money+0.012
 St+0.011
hips+0.011
Neg logit contributions
pieces-0.012
 piping-0.011
 snowball-0.010
"'-0.010
THE-0.009
Token snowball
Token ID46275
Feature activation+2.80e-01

Pos logit contributions
 snowball+0.007
 piping+0.006
pieces+0.006
 freshly+0.005
"'+0.005
Neg logit contributions
 Gre-0.010
 Money-0.010
ark-0.010
uzz-0.009
hips-0.009
Token and
Token ID290
Feature activation+1.63e-02

Pos logit contributions
pieces+0.256
 snowball+0.253
 piping+0.218
"'+0.188
tail+0.179
Neg logit contributions
 Gre-0.446
 Money-0.420
 List-0.408
oust-0.407
 St-0.392
Token rolled
Token ID11686
Feature activation-3.75e-02

Pos logit contributions
pieces+0.025
 piping+0.023
"'+0.021
THE+0.020
rd+0.019
Neg logit contributions
 Gre-0.016
 far-0.016
 St-0.015
 Spark-0.015
 well-0.015
Token it
Token ID340
Feature activation-5.01e-02

Pos logit contributions
 Gre+0.052
oust+0.051
 appearances+0.049
 Money+0.048
hips+0.047
Neg logit contributions
pieces-0.040
 snowball-0.040
 piping-0.037
"'-0.036
THE-0.034
Token around
Token ID1088
Feature activation-9.35e-04

Pos logit contributions
oust+0.090
 Gre+0.089
 appearances+0.086
engu+0.083
 Money+0.083
Neg logit contributions
 snowball-0.037
pieces-0.028
THE-0.025
"'-0.025
 back-0.024
Token the
Token ID262
Feature activation+1.43e-02

Pos logit contributions
 Gre+0.001
oust+0.001
 Money+0.001
hips+0.001
engu+0.001
Neg logit contributions
pieces-0.001
 snowball-0.001
 piping-0.001
"'-0.001
THE-0.001
Token made
Token ID925
Feature activation-4.88e-02

Pos logit contributions
pieces+0.000
 piping+0.000
 snowball+0.000
"'+0.000
THE+0.000
Neg logit contributions
 Gre-0.000
oust-0.000
 Money-0.000
 appearances-0.000
hips-0.000
Token snow
Token ID6729
Feature activation-2.89e-04

Pos logit contributions
 Gre+0.062
oust+0.059
 Money+0.058
 St+0.055
hips+0.054
Neg logit contributions
pieces-0.061
 piping-0.057
 snowball-0.052
"'-0.052
THE-0.048
Tokenmen
Token ID3653
Feature activation-2.38e-02

Pos logit contributions
 Gre+0.000
oust+0.000
 Money+0.000
 St+0.000
hips+0.000
Neg logit contributions
pieces-0.000
 piping-0.000
"'-0.000
 snowball-0.000
THE-0.000
Token and
Token ID290
Feature activation+1.81e-02

Pos logit contributions
 Gre+0.035
oust+0.033
 Money+0.032
 St+0.031
 appearances+0.031
Neg logit contributions
pieces-0.026
 piping-0.024
 snowball-0.022
"'-0.021
THE-0.019
Token had
Token ID550
Feature activation+5.63e-02

Pos logit contributions
pieces+0.024
 piping+0.022
"'+0.020
 snowball+0.019
THE+0.018
Neg logit contributions
 Gre-0.022
oust-0.021
 Money-0.020
 St-0.020
 far-0.019
Token snowball
Token ID46275
Feature activation+2.80e-01

Pos logit contributions
pieces+0.089
 piping+0.081
"'+0.078
THE+0.068
 snowball+0.068
Neg logit contributions
 Gre-0.054
 far-0.052
 well-0.049
 St-0.049
oust-0.048
Token fights
Token ID11418
Feature activation-1.25e-02

Pos logit contributions
pieces+0.330
 piping+0.307
 snowball+0.292
"'+0.275
THE+0.255
Neg logit contributions
 Gre-0.379
 Money-0.353
oust-0.352
 St-0.331
 List-0.325
Token.
Token ID13
Feature activation+7.64e-04

Pos logit contributions
 Gre+0.022
oust+0.021
 Money+0.021
ret+0.021
ub+0.020
Neg logit contributions
 snowball-0.010
pieces-0.009
THE-0.007
 cocoa-0.006
"'-0.006
Token They
Token ID1119
Feature activation-1.34e-04

Pos logit contributions
pieces+0.001
 piping+0.001
"'+0.001
 snowball+0.001
THE+0.001
Neg logit contributions
 Gre-0.001
 Money-0.001
oust-0.001
 St-0.001
 Spark-0.001
Token used
Token ID973
Feature activation-7.55e-02

Pos logit contributions
 Gre+0.000
oust+0.000
 Money+0.000
 St+0.000
hips+0.000
Neg logit contributions
pieces-0.000
 piping-0.000
 snowball-0.000
"'-0.000
THE-0.000
Token their
Token ID511
Feature activation+6.93e-02

Pos logit contributions
 Gre+0.103
oust+0.099
 Money+0.097
 St+0.091
hips+0.091
Neg logit contributions
pieces-0.087
 piping-0.082
 snowball-0.076
"'-0.072
THE-0.066
Token.
Token ID13
Feature activation+5.14e-05

Pos logit contributions
pieces+0.021
 snowball+0.016
"'+0.013
 then+0.011
THE+0.011
Neg logit contributions
 Gre-0.091
oust-0.087
ret-0.087
 appearances-0.087
engu-0.087
Token As
Token ID1081
Feature activation-1.15e-02

Pos logit contributions
pieces+0.000
 piping+0.000
 snowball+0.000
"'+0.000
 freshly+0.000
Neg logit contributions
 Gre-0.000
oust-0.000
 appearances-0.000
hips-0.000
 Money-0.000
Token he
Token ID339
Feature activation-3.53e-04

Pos logit contributions
 Gre+0.018
oust+0.018
hips+0.018
 Money+0.017
 appearances+0.017
Neg logit contributions
 snowball-0.010
pieces-0.009
 piping-0.009
THE-0.009
"'-0.008
Token rolled
Token ID11686
Feature activation-3.79e-02

Pos logit contributions
 Gre+0.000
oust+0.000
 Money+0.000
 St+0.000
hips+0.000
Neg logit contributions
pieces-0.000
 piping-0.000
 snowball-0.000
"'-0.000
THE-0.000
Token the
Token ID262
Feature activation+1.42e-02

Pos logit contributions
 Gre+0.051
oust+0.049
 Money+0.047
 St+0.045
hips+0.045
Neg logit contributions
pieces-0.045
 piping-0.042
 snowball-0.038
"'-0.038
THE-0.035
Token snowball
Token ID46275
Feature activation+2.79e-01

Pos logit contributions
pieces+0.022
 piping+0.019
"'+0.019
tail+0.016
?",+0.016
Neg logit contributions
 Gre-0.015
oust-0.014
 far-0.014
 Money-0.014
 well-0.013
Token,
Token ID11
Feature activation-2.22e-02

Pos logit contributions
 snowball+0.187
pieces+0.165
"'+0.120
 piping+0.116
tail+0.114
Neg logit contributions
 Gre-0.495
 Money-0.483
oust-0.478
 List-0.464
 appearances-0.463
Token it
Token ID340
Feature activation-5.04e-02

Pos logit contributions
 Gre+0.030
oust+0.029
 Money+0.028
hips+0.028
ub+0.028
Neg logit contributions
pieces-0.025
 piping-0.023
 snowball-0.022
"'-0.021
THE-0.020
Token got
Token ID1392
Feature activation+1.07e-02

Pos logit contributions
 Gre+0.067
oust+0.064
 Money+0.063
 St+0.060
hips+0.059
Neg logit contributions
pieces-0.060
 piping-0.056
 snowball-0.050
"'-0.050
THE-0.046
Token bigger
Token ID5749
Feature activation-2.97e-02

Pos logit contributions
 snowball+0.011
pieces+0.010
 piping+0.010
"'+0.009
THE+0.009
Neg logit contributions
 Gre-0.015
oust-0.015
 appearances-0.015
ark-0.015
 Money-0.014
Token and
Token ID290
Feature activation+1.55e-02

Pos logit contributions
 Gre+0.045
oust+0.042
 Money+0.041
hips+0.040
 St+0.040
Neg logit contributions
pieces-0.030
 piping-0.028
 snowball-0.027
"'-0.025
THE-0.023
Token a
Token ID257
Feature activation-2.11e-02

Pos logit contributions
 Gre+0.076
oust+0.073
 Money+0.072
hips+0.067
 St+0.067
Neg logit contributions
pieces-0.067
 piping-0.062
 snowball-0.060
"'-0.054
THE-0.051
Token snow
Token ID6729
Feature activation-5.63e-03

Pos logit contributions
 Gre+0.027
oust+0.026
 Money+0.025
 St+0.024
hips+0.024
Neg logit contributions
pieces-0.026
 piping-0.025
 snowball-0.022
"'-0.022
THE-0.021
Tokenman
Token ID805
Feature activation-7.06e-03

Pos logit contributions
 Gre+0.006
oust+0.006
 Money+0.006
hips+0.006
 St+0.006
Neg logit contributions
pieces-0.008
 piping-0.007
"'-0.007
 snowball-0.006
THE-0.006
Token and
Token ID290
Feature activation+5.03e-03

Pos logit contributions
 Gre+0.008
oust+0.008
 Money+0.007
 St+0.007
hips+0.007
Neg logit contributions
pieces-0.010
 piping-0.009
 snowball-0.008
"'-0.008
THE-0.008
Token a
Token ID257
Feature activation-2.13e-02

Pos logit contributions
pieces+0.008
 piping+0.007
"'+0.007
THE+0.006
 snowball+0.006
Neg logit contributions
 Gre-0.005
 St-0.005
oust-0.005
 Ho-0.005
 Spark-0.005
Token snowball
Token ID46275
Feature activation+2.79e-01

Pos logit contributions
 Gre+0.028
oust+0.027
 Money+0.026
 St+0.025
hips+0.025
Neg logit contributions
pieces-0.025
 piping-0.023
"'-0.021
 snowball-0.021
THE-0.019
Token and
Token ID290
Feature activation+5.64e-03

Pos logit contributions
pieces+0.347
 piping+0.326
 snowball+0.306
"'+0.295
THE+0.272
Neg logit contributions
 Gre-0.357
oust-0.333
 Money-0.330
 St-0.311
hips-0.305
Token a
Token ID257
Feature activation-2.10e-02

Pos logit contributions
pieces+0.009
"'+0.008
 piping+0.008
THE+0.008
+0.007
Neg logit contributions
 Gre-0.005
 far-0.005
 St-0.005
 Spark-0.005
 well-0.004
Token snow
Token ID6729
Feature activation-6.18e-03

Pos logit contributions
 Gre+0.027
oust+0.026
 Money+0.025
 St+0.024
 far+0.024
Neg logit contributions
pieces-0.026
 piping-0.023
"'-0.022
 snowball-0.020
THE-0.019
Token angel
Token ID18304
Feature activation+5.66e-02

Pos logit contributions
 Gre+0.007
oust+0.007
 Money+0.006
hips+0.006
 St+0.006
Neg logit contributions
pieces-0.009
 piping-0.008
"'-0.007
 snowball-0.007
THE-0.007
Token.
Token ID13
Feature activation-1.01e-02

Pos logit contributions
pieces+0.095
"'+0.093
 piping+0.092
 Deal+0.084
 ""+0.082
Neg logit contributions
oust-0.042
 well-0.041
 far-0.039
 Spark-0.039
ub-0.039
Token but
Token ID475
Feature activation+4.53e-02

Pos logit contributions
 Gre+0.043
oust+0.040
 Money+0.039
 St+0.038
hips+0.036
Neg logit contributions
pieces-0.049
 piping-0.046
"'-0.042
 snowball-0.042
THE-0.039
Token fighting
Token ID4330
Feature activation-6.44e-02

Pos logit contributions
pieces+0.038
 then+0.033
"'+0.031
 piping+0.031
 gonna+0.030
Neg logit contributions
oust-0.074
engu-0.067
ugg-0.066
ark-0.065
 jung-0.065
Token is
Token ID318
Feature activation-2.50e-02

Pos logit contributions
oust+0.096
 Gre+0.092
 Money+0.088
 Hel+0.086
 Spr+0.086
Neg logit contributions
pieces-0.064
 piping-0.057
"'-0.057
 snowball-0.056
?",-0.049
Token not
Token ID407
Feature activation-6.65e-02

Pos logit contributions
oust+0.033
 Gre+0.031
 Money+0.030
ub+0.029
ark+0.029
Neg logit contributions
pieces-0.030
 piping-0.027
"'-0.026
 snowball-0.025
THE-0.023
Token.
Token ID13
Feature activation-1.56e-02

Pos logit contributions
oust+0.102
 Gre+0.094
ub+0.092
ugg+0.092
ho+0.090
Neg logit contributions
pieces-0.068
 snowball-0.059
"'-0.057
 piping-0.057
 awesome-0.052
Token Money
Token ID12911
Feature activation-3.07e-01

Pos logit contributions
oust+0.022
 far+0.021
 lur+0.021
 Gre+0.021
ho+0.021
Neg logit contributions
pieces-0.015
 piping-0.014
"'-0.013
 snowball-0.012
 Either-0.011
Token is
Token ID318
Feature activation-2.54e-02

Pos logit contributions
 Gre+0.404
oust+0.387
 Money+0.372
 St+0.356
 Hel+0.351
Neg logit contributions
pieces-0.367
 piping-0.342
"'-0.313
 snowball-0.311
THE-0.285
Token not
Token ID407
Feature activation-6.71e-02

Pos logit contributions
oust+0.042
ugg+0.039
ub+0.038
ark+0.037
ho+0.037
Neg logit contributions
pieces-0.023
"'-0.020
 awesome-0.020
 gonna-0.019
 piping-0.019
Token the
Token ID262
Feature activation+2.49e-03

Pos logit contributions
oust+0.102
 Gre+0.095
ub+0.093
ho+0.093
ugg+0.091
Neg logit contributions
pieces-0.069
 piping-0.059
"'-0.059
 snowball-0.057
 awesome-0.052
Token most
Token ID749
Feature activation+7.80e-02

Pos logit contributions
pieces+0.003
 piping+0.003
"'+0.002
 freshly+0.002
 snowball+0.002
Neg logit contributions
oust-0.003
ub-0.003
ark-0.003
 Gre-0.003
uzz-0.003
Token important
Token ID1593
Feature activation-3.65e-02

Pos logit contributions
pieces+0.090
 piping+0.083
"'+0.077
 snowball+0.073
 freshly+0.068
Neg logit contributions
 Gre-0.104
oust-0.104
 Sno-0.096
ub-0.096
 Spr-0.096
Token someone
Token ID2130
Feature activation-1.26e-01

Pos logit contributions
pieces+0.005
 piping+0.004
"'+0.004
 freshly+0.004
 snowball+0.003
Neg logit contributions
oust-0.007
 Gre-0.007
 Money-0.007
ub-0.007
ark-0.007
Token else
Token ID2073
Feature activation+1.54e-02

Pos logit contributions
 Gre+0.159
oust+0.151
 Money+0.147
 St+0.140
hips+0.139
Neg logit contributions
pieces-0.159
 piping-0.148
"'-0.135
 snowball-0.134
THE-0.123
Token for
Token ID329
Feature activation+1.42e-02

Pos logit contributions
pieces+0.015
 piping+0.013
"'+0.013
 snowball+0.010
THE+0.010
Neg logit contributions
oust-0.023
 Gre-0.023
 jung-0.021
 Hel-0.021
ub-0.021
Token money
Token ID1637
Feature activation-1.46e-01

Pos logit contributions
pieces+0.017
 piping+0.015
"'+0.014
 freshly+0.013
 snowball+0.013
Neg logit contributions
oust-0.018
 Gre-0.018
ub-0.018
 Money-0.017
ark-0.017
Token.
Token ID13
Feature activation-4.80e-03

Pos logit contributions
 Gre+0.227
oust+0.223
 List+0.208
 lur+0.208
 jung+0.208
Neg logit contributions
pieces-0.131
 piping-0.114
"'-0.114
THE-0.101
 snowball-0.093
Token Money
Token ID12911
Feature activation-3.04e-01

Pos logit contributions
oust+0.006
 appearances+0.006
hips+0.006
 Gre+0.006
 lur+0.006
Neg logit contributions
pieces-0.006
 piping-0.005
 snowball-0.005
"'-0.005
THE-0.004
Token is
Token ID318
Feature activation-1.74e-02

Pos logit contributions
 Gre+0.410
oust+0.396
 Money+0.374
 St+0.365
 Sno+0.358
Neg logit contributions
pieces-0.351
 piping-0.326
"'-0.303
 snowball-0.288
THE-0.272
Token what
Token ID644
Feature activation+6.73e-03

Pos logit contributions
oust+0.024
 Gre+0.023
ub+0.023
ark+0.022
 lur+0.021
Neg logit contributions
pieces-0.019
 piping-0.017
"'-0.017
 snowball-0.015
 freshly-0.015
Token people
Token ID661
Feature activation-9.26e-02

Pos logit contributions
pieces+0.006
"'+0.005
 piping+0.005
 snowball+0.005
?",+0.005
Neg logit contributions
oust-0.010
ub-0.010
 Gre-0.010
ark-0.009
hips-0.009
Token use
Token ID779
Feature activation+3.70e-03

Pos logit contributions
 Gre+0.142
oust+0.142
 Sno+0.135
ub+0.133
 St+0.132
Neg logit contributions
pieces-0.088
 piping-0.075
"'-0.073
-0.063
 freshly-0.062
Token to
Token ID284
Feature activation+4.08e-03

Pos logit contributions
pieces+0.004
 piping+0.003
"'+0.003
 freshly+0.003
?",+0.003
Neg logit contributions
oust-0.006
 Gre-0.005
ub-0.005
 ante-0.005
 Sno-0.005
Token not
Token ID407
Feature activation-5.87e-02

Pos logit contributions
pieces+0.036
 piping+0.034
 snowball+0.032
"'+0.031
 freshly+0.029
Neg logit contributions
 Gre-0.024
oust-0.022
 Money-0.021
hips-0.021
 well-0.021
Token buy
Token ID2822
Feature activation-2.93e-02

Pos logit contributions
 Gre+0.072
oust+0.071
 Money+0.067
hips+0.065
 well+0.064
Neg logit contributions
pieces-0.074
 piping-0.071
 snowball-0.066
"'-0.064
THE-0.058
Token more
Token ID517
Feature activation+5.01e-02

Pos logit contributions
oust+0.048
 Gre+0.046
ark+0.046
ub+0.045
ugg+0.044
Neg logit contributions
pieces-0.029
 piping-0.024
 snowball-0.020
 awesome-0.018
 freshly-0.018
Token chalk
Token ID30860
Feature activation+1.10e-02

Pos logit contributions
pieces+0.064
 piping+0.061
"'+0.056
 snowball+0.056
THE+0.052
Neg logit contributions
 Gre-0.062
oust-0.058
 Money-0.057
 St-0.054
hips-0.054
Token.
Token ID13
Feature activation-7.76e-03

Pos logit contributions
pieces+0.014
 piping+0.013
"'+0.012
 snowball+0.012
THE+0.011
Neg logit contributions
 Gre-0.014
oust-0.013
 Money-0.013
 St-0.012
hips-0.012
Token Money
Token ID12911
Feature activation-3.04e-01

Pos logit contributions
 Gre+0.011
oust+0.011
ret+0.010
 appearances+0.010
 greed+0.010
Neg logit contributions
pieces-0.008
 piping-0.007
"'-0.007
 snowball-0.006
THE-0.006
Token was
Token ID373
Feature activation+1.29e-02

Pos logit contributions
 Gre+0.383
oust+0.361
 Money+0.356
hips+0.347
 St+0.339
Neg logit contributions
pieces-0.382
 piping-0.360
 snowball-0.327
"'-0.322
THE-0.295
Token too
Token ID1165
Feature activation+9.10e-02

Pos logit contributions
pieces+0.014
 piping+0.013
"'+0.012
 snowball+0.012
 freshly+0.011
Neg logit contributions
oust-0.018
 Gre-0.017
ub-0.017
ark-0.017
 Money-0.017
Token difficult
Token ID2408
Feature activation-1.59e-02

Pos logit contributions
"'+0.068
 piping+0.061
pieces+0.059
 awesome+0.055
?"+0.054
Neg logit contributions
 Spr-0.148
Bl-0.143
 Sno-0.142
oust-0.141
 Par-0.141
Token for
Token ID329
Feature activation+1.61e-02

Pos logit contributions
 Gre+0.026
oust+0.026
 Sno+0.025
Bl+0.025
engu+0.024
Neg logit contributions
pieces-0.013
"'-0.010
 piping-0.009
 snowball-0.009
 freshly-0.008
Token them
Token ID606
Feature activation+3.61e-02

Pos logit contributions
pieces+0.017
 piping+0.015
"'+0.014
 snowball+0.013
 freshly+0.012
Neg logit contributions
 Gre-0.024
oust-0.023
 Money-0.022
ub-0.022
 ante-0.021
Token laughed
Token ID13818
Feature activation-2.63e-02

Pos logit contributions
 Gre+0.073
 appearances+0.072
 Money+0.071
 jun+0.068
oust+0.068
Neg logit contributions
pieces-0.031
 piping-0.030
"'-0.022
 then-0.022
-0.022
Token.
Token ID13
Feature activation-1.38e-02

Pos logit contributions
 Gre+0.028
 Money+0.026
 well+0.026
oust+0.026
 far+0.026
Neg logit contributions
pieces-0.038
 piping-0.036
"'-0.033
 snowball-0.032
 freshly-0.030
Token They
Token ID1119
Feature activation-1.10e-02

Pos logit contributions
 Gre+0.017
oust+0.016
 Money+0.016
 St+0.015
hips+0.015
Neg logit contributions
pieces-0.018
 piping-0.017
"'-0.015
 snowball-0.015
THE-0.014
Token agreed
Token ID4987
Feature activation+3.23e-02

Pos logit contributions
 Gre+0.016
oust+0.015
 Money+0.015
 appearances+0.015
hips+0.014
Neg logit contributions
pieces-0.012
 piping-0.011
 snowball-0.010
"'-0.010
THE-0.009
Token.
Token ID13
Feature activation-1.44e-02

Pos logit contributions
pieces+0.027
 piping+0.026
"'+0.023
 snowball+0.022
THE+0.021
Neg logit contributions
 Gre-0.051
oust-0.050
hips-0.049
 Money-0.049
 ante-0.048
Token Sno
Token ID42578
Feature activation-3.04e-01

Pos logit contributions
 Gre+0.018
oust+0.018
 Money+0.017
hips+0.017
 lur+0.016
Neg logit contributions
pieces-0.017
 piping-0.017
"'-0.015
 snowball-0.015
THE-0.014
Tokenopy
Token ID11081
Feature activation-2.96e-02

Pos logit contributions
 Gre+0.473
 Money+0.436
 unexpl+0.433
 jung+0.433
hips+0.428
Neg logit contributions
 piping-0.269
pieces-0.264
-0.252
 snowball-0.252
"'-0.250
Token was
Token ID373
Feature activation+3.60e-03

Pos logit contributions
 Gre+0.051
 Money+0.049
oust+0.047
 List+0.047
hips+0.047
Neg logit contributions
pieces-0.022
 piping-0.019
 snowball-0.017
"'-0.017
-0.015
Token a
Token ID257
Feature activation-2.55e-02

Pos logit contributions
pieces+0.005
 piping+0.005
"'+0.004
 snowball+0.004
THE+0.004
Neg logit contributions
 Gre-0.004
 far-0.004
 St-0.004
 well-0.004
oust-0.004
Token good
Token ID922
Feature activation-7.42e-02

Pos logit contributions
 Gre+0.030
oust+0.029
 Money+0.028
 St+0.027
hips+0.026
Neg logit contributions
pieces-0.035
 piping-0.032
"'-0.030
 snowball-0.029
THE-0.027
Token name
Token ID1438
Feature activation-4.71e-02

Pos logit contributions
hips+0.131
 Money+0.122
 Gre+0.122
bars+0.121
ocular+0.120
Neg logit contributions
 snowball-0.051
 piping-0.050
pieces-0.047
-0.040
"'-0.035
Tokenoral
Token ID6864
Feature activation+5.99e-02

Pos logit contributions
pieces+0.040
 piping+0.036
"'+0.033
 snowball+0.033
THE+0.030
Neg logit contributions
 Gre-0.057
oust-0.054
 Money-0.053
 St-0.052
 Spark-0.051
Token of
Token ID286
Feature activation-2.96e-04

Pos logit contributions
pieces+0.031
"'+0.031
?",+0.030
THE+0.030
?"+0.027
Neg logit contributions
 Ho-0.105
ub-0.103
inch-0.103
 Sno-0.099
opp-0.099
Token the
Token ID262
Feature activation+9.79e-03

Pos logit contributions
 Gre+0.000
oust+0.000
 Money+0.000
 St+0.000
hips+0.000
Neg logit contributions
pieces-0.000
 piping-0.000
"'-0.000
 snowball-0.000
THE-0.000
Token story
Token ID1621
Feature activation-3.16e-02

Pos logit contributions
pieces+0.009
 piping+0.008
"'+0.007
THE+0.007
 snowball+0.007
Neg logit contributions
oust-0.015
uzz-0.015
 Gre-0.015
ub-0.015
 Money-0.014
Token:
Token ID25
Feature activation+4.81e-02

Pos logit contributions
oust+0.060
 Gre+0.059
 lur+0.058
 Sno+0.058
 ante+0.058
Neg logit contributions
"'-0.012
pieces-0.010
THE-0.009
 piping-0.009
?",-0.008
Token Money
Token ID12911
Feature activation-3.04e-01

Pos logit contributions
pieces+0.048
"'+0.043
THE+0.038
 Either+0.036
 ""+0.036
Neg logit contributions
oust-0.068
 lur-0.064
 unexpl-0.064
 ante-0.061
ocular-0.060
Token cannot
Token ID2314
Feature activation+6.02e-02

Pos logit contributions
 Gre+0.438
oust+0.430
 Money+0.402
 Sno+0.399
 St+0.396
Neg logit contributions
pieces-0.315
 piping-0.280
"'-0.264
THE-0.241
 snowball-0.237
Token buy
Token ID2822
Feature activation-2.79e-02

Pos logit contributions
pieces+0.034
"'+0.032
?",+0.028
 have+0.028
 possibly+0.028
Neg logit contributions
oust-0.101
ud-0.101
 St-0.100
uff-0.099
 Gre-0.098
Token true
Token ID2081
Feature activation-6.19e-02

Pos logit contributions
oust+0.046
ho+0.045
ugg+0.045
ub+0.044
ark+0.044
Neg logit contributions
pieces-0.025
 piping-0.020
 awesome-0.019
"'-0.017
 freshly-0.017
Token happiness
Token ID12157
Feature activation-5.86e-02

Pos logit contributions
oust+0.088
 Gre+0.086
ark+0.083
 ante+0.082
 St+0.081
Neg logit contributions
pieces-0.066
 piping-0.057
"'-0.053
-0.047
 freshly-0.046
Token.
Token ID13
Feature activation-1.13e-02

Pos logit contributions
 Gre+0.085
oust+0.082
 Money+0.078
 ante+0.077
 Hel+0.075
Neg logit contributions
pieces-0.063
 piping-0.056
"'-0.051
 snowball-0.048
THE-0.047
Token oxygen
Token ID11863
Feature activation-1.34e-01

Pos logit contributions
pieces+0.003
 piping+0.003
 snowball+0.003
"'+0.002
 freshly+0.002
Neg logit contributions
oust-0.004
 Gre-0.004
uzz-0.004
ho-0.004
ark-0.004
Token from
Token ID422
Feature activation-1.56e-01

Pos logit contributions
 Gre+0.182
oust+0.174
 Money+0.169
 St+0.162
 Hel+0.159
Neg logit contributions
pieces-0.155
 piping-0.144
 snowball-0.130
"'-0.130
THE-0.120
Token the
Token ID262
Feature activation+4.27e-03

Pos logit contributions
oust+0.242
engu+0.239
ub+0.236
 Money+0.234
 Gre+0.231
Neg logit contributions
pieces-0.139
 piping-0.120
 snowball-0.106
 freshly-0.100
"'-0.097
Token tank
Token ID6873
Feature activation-4.25e-02

Pos logit contributions
pieces+0.005
 piping+0.004
 snowball+0.004
 freshly+0.004
"'+0.004
Neg logit contributions
uzz-0.006
oust-0.006
ark-0.006
 Gre-0.006
ub-0.006
Token.
Token ID13
Feature activation-1.46e-02

Pos logit contributions
 Gre+0.068
oust+0.064
 ante+0.064
 Money+0.063
 St+0.062
Neg logit contributions
pieces-0.038
"'-0.030
 piping-0.030
THE-0.027
 snowball-0.025
Token Hel
Token ID5053
Feature activation-3.02e-01

Pos logit contributions
 appearances+0.021
 greed+0.021
oust+0.021
 lur+0.021
 well+0.020
Neg logit contributions
pieces-0.014
"'-0.012
 piping-0.012
THE-0.010
 ""-0.010
Tokenio
Token ID952
Feature activation-2.89e-02

Pos logit contributions
 Gre+0.354
oust+0.332
 Money+0.324
 St+0.308
hips+0.306
Neg logit contributions
pieces-0.413
 piping-0.385
"'-0.356
 snowball-0.349
THE-0.325
Token was
Token ID373
Feature activation+4.07e-03

Pos logit contributions
 Gre+0.040
 Money+0.038
oust+0.037
 St+0.036
hips+0.036
Neg logit contributions
pieces-0.033
 piping-0.031
"'-0.028
 snowball-0.028
THE-0.025
Token thankful
Token ID25535
Feature activation-7.17e-02

Pos logit contributions
pieces+0.004
 piping+0.004
"'+0.004
 snowball+0.004
THE+0.003
Neg logit contributions
 Gre-0.006
oust-0.006
 Money-0.006
 St-0.005
hips-0.005
Token and
Token ID290
Feature activation+1.56e-03

Pos logit contributions
 Gre+0.107
oust+0.103
 Money+0.102
 ante+0.098
hips+0.097
Neg logit contributions
pieces-0.069
 piping-0.064
"'-0.059
 snowball-0.056
THE-0.055
Token thanked
Token ID26280
Feature activation-3.22e-02

Pos logit contributions
pieces+0.002
 piping+0.002
"'+0.001
THE+0.001
 snowball+0.001
Neg logit contributions
oust-0.002
 Gre-0.002
 Money-0.002
hips-0.002
 appearances-0.002
Token "
Token ID366
Feature activation-9.51e-03

Pos logit contributions
 Gre+0.048
oust+0.046
 Money+0.045
 St+0.043
hips+0.042
Neg logit contributions
pieces-0.048
 piping-0.045
 snowball-0.041
"'-0.040
THE-0.037
TokenI
Token ID40
Feature activation-1.45e-02

Pos logit contributions
 Gre+0.013
oust+0.012
 Money+0.012
hips+0.012
 appearances+0.011
Neg logit contributions
pieces-0.011
 piping-0.011
"'-0.009
 snowball-0.009
THE-0.009
Token will
Token ID481
Feature activation-3.09e-02

Pos logit contributions
 Gre+0.018
oust+0.017
 Money+0.017
hips+0.016
 appearances+0.016
Neg logit contributions
pieces-0.018
 piping-0.017
"'-0.016
 snowball-0.015
THE-0.014
Token name
Token ID1438
Feature activation-4.54e-02

Pos logit contributions
 Gre+0.043
oust+0.042
 Money+0.041
 lur+0.039
 St+0.039
Neg logit contributions
pieces-0.033
 piping-0.032
"'-0.029
 snowball-0.027
THE-0.026
Token him
Token ID683
Feature activation-5.81e-02

Pos logit contributions
 Gre+0.052
oust+0.049
 Money+0.047
 St+0.045
hips+0.045
Neg logit contributions
pieces-0.063
 piping-0.059
"'-0.054
 snowball-0.054
THE-0.050
Token Sno
Token ID42578
Feature activation-3.02e-01

Pos logit contributions
 Gre+0.068
 Money+0.063
 St+0.062
oust+0.062
 Spark+0.061
Neg logit contributions
pieces-0.080
 piping-0.076
 snowball-0.067
"'-0.067
 freshly-0.062
Tokenopy
Token ID11081
Feature activation-2.74e-02

Pos logit contributions
 Gre+0.439
oust+0.405
 Money+0.403
 jun+0.395
 List+0.394
Neg logit contributions
pieces-0.321
 piping-0.300
"'-0.273
 snowball-0.273
-0.272
Token.
Token ID13
Feature activation-1.22e-02

Pos logit contributions
 Gre+0.036
oust+0.034
 Money+0.034
 St+0.032
hips+0.032
Neg logit contributions
pieces-0.033
 piping-0.030
"'-0.028
 snowball-0.028
THE-0.025
Token Because
Token ID4362
Feature activation+4.27e-02

Pos logit contributions
 Gre+0.017
oust+0.016
hips+0.015
 Money+0.015
ret+0.015
Neg logit contributions
 piping-0.013
pieces-0.013
 snowball-0.011
"'-0.011
THE-0.010
Token he
Token ID339
Feature activation-1.13e-02

Pos logit contributions
 piping+0.036
pieces+0.032
?"+0.027
 snowball+0.027
 freshly+0.027
Neg logit contributions
hips-0.065
 Gre-0.064
oust-0.064
 jun-0.063
 appearances-0.062
Token is
Token ID318
Feature activation-2.14e-02

Pos logit contributions
 Gre+0.017
oust+0.017
hips+0.017
 Money+0.017
ub+0.016
Neg logit contributions
pieces-0.010
 piping-0.009
"'-0.009
 snowball-0.008
-0.008
Token was
Token ID373
Feature activation+4.95e-03

Pos logit contributions
 Gre+0.056
oust+0.054
 Money+0.054
hips+0.053
 appearances+0.052
Neg logit contributions
pieces-0.034
 piping-0.030
 snowball-0.030
"'-0.029
THE-0.026
Token very
Token ID845
Feature activation+5.35e-03

Pos logit contributions
pieces+0.005
 piping+0.005
"'+0.004
 snowball+0.004
THE+0.004
Neg logit contributions
 Gre-0.007
oust-0.007
 Money-0.007
 St-0.007
hips-0.007
Token kind
Token ID1611
Feature activation-1.54e-01

Pos logit contributions
pieces+0.004
 piping+0.004
"'+0.004
 snowball+0.003
 freshly+0.003
Neg logit contributions
 Gre-0.009
oust-0.009
 Money-0.009
 appearances-0.009
hips-0.008
Token and
Token ID290
Feature activation+2.22e-03

Pos logit contributions
 Gre+0.279
Bl+0.266
ret+0.265
 ante+0.264
 jun+0.262
Neg logit contributions
pieces-0.087
 piping-0.069
"'-0.066
 snowball-0.063
THE-0.057
Token offered
Token ID4438
Feature activation-4.50e-03

Pos logit contributions
pieces+0.002
 piping+0.002
"'+0.002
 freshly+0.002
 snowball+0.002
Neg logit contributions
oust-0.003
 Gre-0.003
hips-0.003
 Money-0.003
 appearances-0.003
Token Hel
Token ID5053
Feature activation-3.02e-01

Pos logit contributions
 appearances+0.008
oust+0.008
 ante+0.008
hips+0.008
 jun+0.007
Neg logit contributions
pieces-0.003
 cocoa-0.001
 freshly-0.001
 piping-0.001
 such-0.001
Tokenio
Token ID952
Feature activation-2.79e-02

Pos logit contributions
 Gre+0.364
oust+0.336
 Money+0.333
 List+0.333
 far+0.331
Neg logit contributions
pieces-0.404
 piping-0.349
"'-0.340
 snowball-0.304
THE-0.292
Token some
Token ID617
Feature activation+2.33e-02

Pos logit contributions
 Gre+0.037
oust+0.035
 Money+0.034
 ante+0.032
 St+0.032
Neg logit contributions
pieces-0.034
 piping-0.031
"'-0.029
 snowball-0.028
THE-0.026
Token of
Token ID286
Feature activation-6.94e-03

Pos logit contributions
pieces+0.027
 piping+0.025
 snowball+0.023
 freshly+0.021
"'+0.020
Neg logit contributions
 Gre-0.032
oust-0.031
ub-0.030
ark-0.030
 Money-0.030
Token the
Token ID262
Feature activation+2.85e-03

Pos logit contributions
oust+0.010
 Money+0.010
 Gre+0.010
engu+0.010
 List+0.010
Neg logit contributions
pieces-0.007
 piping-0.005
"'-0.005
 snowball-0.005
THE-0.005
Token oxygen
Token ID11863
Feature activation-1.34e-01

Pos logit contributions
pieces+0.003
 piping+0.003
 snowball+0.003
"'+0.002
 freshly+0.002
Neg logit contributions
oust-0.004
 Gre-0.004
uzz-0.004
ho-0.004
ark-0.004
Token's
Token ID338
Feature activation-8.97e-03

Pos logit contributions
pieces+0.070
 piping+0.068
 snowball+0.063
 freshly+0.058
THE+0.058
Neg logit contributions
 Gre-0.035
 truth-0.027
 far-0.027
 Money-0.027
 well-0.026
Token weigh
Token ID10164
Feature activation-6.67e-02

Pos logit contributions
 Gre+0.012
oust+0.012
 Money+0.011
 St+0.011
ub+0.011
Neg logit contributions
pieces-0.010
 piping-0.010
"'-0.009
 snowball-0.009
?",-0.008
Token the
Token ID262
Feature activation+6.54e-03

Pos logit contributions
 Gre+0.089
oust+0.086
 Money+0.083
hips+0.079
 St+0.077
Neg logit contributions
pieces-0.080
 piping-0.074
 snowball-0.067
"'-0.067
THE-0.060
Token options
Token ID3689
Feature activation+3.14e-02

Pos logit contributions
pieces+0.007
 piping+0.007
 snowball+0.007
 freshly+0.006
"'+0.006
Neg logit contributions
 Gre-0.009
ark-0.009
oust-0.009
ub-0.008
 Money-0.008
Token.
Token ID13
Feature activation-9.21e-03

Pos logit contributions
pieces+0.034
 piping+0.032
"'+0.029
 snowball+0.028
THE+0.026
Neg logit contributions
 Gre-0.045
oust-0.043
 Money-0.041
 appearances-0.040
 St-0.040
Token St
Token ID520
Feature activation-3.01e-01

Pos logit contributions
oust+0.013
 Gre+0.013
 lur+0.012
 far+0.012
hips+0.012
Neg logit contributions
pieces-0.010
 piping-0.009
"'-0.008
 snowball-0.008
THE-0.007
Tokenaying
Token ID8369
Feature activation-7.28e-02

Pos logit contributions
 Gre+0.386
 Hel+0.369
 jung+0.364
 Spark+0.362
 Ho+0.361
Neg logit contributions
pieces-0.368
"'-0.324
 piping-0.318
affles-0.285
-0.282
Token here
Token ID994
Feature activation+9.32e-02

Pos logit contributions
 Gre+0.103
oust+0.102
 Money+0.097
ub+0.092
uzz+0.092
Neg logit contributions
pieces-0.078
 piping-0.074
 snowball-0.069
"'-0.065
?",-0.060
Token is
Token ID318
Feature activation-1.86e-02

Pos logit contributions
pieces+0.106
 piping+0.097
"'+0.090
 snowball+0.086
THE+0.082
Neg logit contributions
 Gre-0.128
oust-0.125
 Money-0.119
 St-0.115
 ante-0.113
Token cheap
Token ID7026
Feature activation-6.61e-03

Pos logit contributions
 Gre+0.024
oust+0.024
 Money+0.023
ub+0.022
hips+0.022
Neg logit contributions
pieces-0.022
 piping-0.021
 snowball-0.019
"'-0.019
THE-0.017
Token,
Token ID11
Feature activation-3.41e-02

Pos logit contributions
 Gre+0.010
oust+0.009
 Money+0.009
 St+0.008
hips+0.008
Neg logit contributions
pieces-0.007
 piping-0.007
 snowball-0.006
"'-0.006
THE-0.005
Token was
Token ID373
Feature activation+1.17e-02

Pos logit contributions
pieces+0.018
 piping+0.017
 snowball+0.015
"'+0.015
THE+0.014
Neg logit contributions
 Gre-0.024
oust-0.023
 Money-0.023
hips-0.022
 appearances-0.021
Token a
Token ID257
Feature activation-1.69e-02

Pos logit contributions
pieces+0.019
 piping+0.017
"'+0.016
 snowball+0.015
THE+0.015
Neg logit contributions
 Gre-0.011
 far-0.010
 St-0.010
 Money-0.010
oust-0.010
Token little
Token ID1310
Feature activation-2.07e-02

Pos logit contributions
 Gre+0.023
oust+0.022
 Money+0.021
hips+0.021
 appearances+0.020
Neg logit contributions
pieces-0.019
 piping-0.018
 snowball-0.018
"'-0.016
 freshly-0.015
Token girl
Token ID2576
Feature activation+4.58e-02

Pos logit contributions
 Gre+0.036
hips+0.036
ret+0.035
 Money+0.035
oust+0.035
Neg logit contributions
 snowball-0.015
 piping-0.014
pieces-0.013
-0.010
"'-0.010
Token named
Token ID3706
Feature activation+1.64e-02

Pos logit contributions
pieces+0.043
 piping+0.040
 snowball+0.035
"'+0.035
THE+0.030
Neg logit contributions
 Gre-0.071
oust-0.068
 Money-0.068
hips-0.066
ub-0.064
Token St
Token ID520
Feature activation-3.00e-01

Pos logit contributions
pieces+0.021
 piping+0.020
"'+0.018
 snowball+0.017
THE+0.016
Neg logit contributions
 Gre-0.020
oust-0.020
 Money-0.019
hips-0.018
 St-0.018
Tokenacey
Token ID25415
Feature activation-1.05e-01

Pos logit contributions
 Gre+0.314
 lur+0.307
ocular+0.304
 jung+0.303
 appearances+0.300
Neg logit contributions
pieces-0.406
 piping-0.375
"'-0.368
-0.364
THE-0.337
Token.
Token ID13
Feature activation-7.45e-03

Pos logit contributions
 Gre+0.150
oust+0.142
 Money+0.139
hips+0.133
 St+0.131
Neg logit contributions
pieces-0.114
 piping-0.102
"'-0.095
 snowball-0.089
THE-0.087
Token She
Token ID1375
Feature activation-1.11e-02

Pos logit contributions
oust+0.010
 Gre+0.010
 appearances+0.009
hips+0.009
 lur+0.009
Neg logit contributions
pieces-0.008
 piping-0.008
"'-0.007
 snowball-0.006
THE-0.006
Token was
Token ID373
Feature activation+1.17e-02

Pos logit contributions
 Gre+0.012
oust+0.011
 Money+0.011
 St+0.011
 Spark+0.010
Neg logit contributions
pieces-0.016
 piping-0.015
"'-0.014
 snowball-0.014
THE-0.013
Token very
Token ID845
Feature activation+8.02e-03

Pos logit contributions
pieces+0.013
 piping+0.012
 snowball+0.011
"'+0.010
 freshly+0.010
Neg logit contributions
oust-0.016
 Gre-0.016
hips-0.016
 Money-0.015
ub-0.015
Token like
Token ID588
Feature activation-2.37e-02

Pos logit contributions
 Gre+0.071
oust+0.069
 Money+0.068
ub+0.064
hips+0.064
Neg logit contributions
pieces-0.055
 piping-0.051
"'-0.046
 snowball-0.045
 freshly-0.041
Token to
Token ID284
Feature activation+7.99e-03

Pos logit contributions
 Gre+0.033
oust+0.033
 Money+0.031
hips+0.030
ub+0.030
Neg logit contributions
pieces-0.026
 piping-0.024
 snowball-0.022
"'-0.021
THE-0.019
Token have
Token ID423
Feature activation+1.02e-01

Pos logit contributions
pieces+0.009
 piping+0.009
"'+0.008
 snowball+0.008
THE+0.007
Neg logit contributions
 Gre-0.011
oust-0.010
 Money-0.010
 St-0.010
hips-0.010
Token money
Token ID1637
Feature activation-1.42e-01

Pos logit contributions
pieces+0.138
 piping+0.131
"'+0.124
 snowball+0.116
THE+0.113
Neg logit contributions
 Gre-0.116
 Money-0.107
oust-0.106
 St-0.105
 well-0.104
Token.
Token ID13
Feature activation+2.97e-03

Pos logit contributions
 Gre+0.232
oust+0.228
 lur+0.216
 jung+0.214
ub+0.214
Neg logit contributions
pieces-0.111
 piping-0.103
"'-0.092
 snowball-0.090
-0.082
Token Money
Token ID12911
Feature activation-3.00e-01

Pos logit contributions
pieces+0.004
 piping+0.004
"'+0.004
 snowball+0.004
THE+0.004
Neg logit contributions
 Gre-0.003
 Money-0.003
 St-0.003
 List-0.003
oust-0.003
Token can
Token ID460
Feature activation-2.27e-03

Pos logit contributions
 Gre+0.365
oust+0.345
 Money+0.340
hips+0.324
 St+0.323
Neg logit contributions
pieces-0.391
 piping-0.367
 snowball-0.334
"'-0.332
THE-0.306
Token buy
Token ID2822
Feature activation-2.47e-02

Pos logit contributions
 Gre+0.003
oust+0.003
ub+0.003
 List+0.003
 Money+0.003
Neg logit contributions
pieces-0.002
 piping-0.002
"'-0.002
 snowball-0.002
?",-0.002
Token candy
Token ID18550
Feature activation-5.89e-02

Pos logit contributions
oust+0.042
ub+0.042
ho+0.041
ugg+0.040
ark+0.040
Neg logit contributions
pieces-0.020
 piping-0.018
 cookies-0.016
 awesome-0.015
 freshly-0.014
Token and
Token ID290
Feature activation+1.74e-02

Pos logit contributions
 Gre+0.077
oust+0.074
 Money+0.071
 St+0.067
 appearances+0.067
Neg logit contributions
pieces-0.071
 piping-0.066
"'-0.060
 snowball-0.059
THE-0.055
Token ice
Token ID4771
Feature activation-3.05e-03

Pos logit contributions
pieces+0.022
 piping+0.021
"'+0.019
 snowball+0.019
 freshly+0.018
Neg logit contributions
 Gre-0.021
oust-0.021
ub-0.020
ark-0.019
 Money-0.019
Token themselves
Token ID2405
Feature activation+1.39e-03

Pos logit contributions
 lur+0.031
hips+0.031
oust+0.030
engu+0.030
bars+0.029
Neg logit contributions
pieces-0.010
THE-0.006
 then-0.006
 such-0.006
 back-0.005
Token as
Token ID355
Feature activation+6.03e-02

Pos logit contributions
pieces+0.001
 piping+0.001
"'+0.001
THE+0.001
 snowball+0.001
Neg logit contributions
 Gre-0.002
oust-0.002
 Money-0.002
 ante-0.002
 lur-0.002
Token Hel
Token ID5053
Feature activation-2.99e-01

Pos logit contributions
pieces+0.045
 awesome+0.042
 gonna+0.039
THE+0.037
 snowball+0.035
Neg logit contributions
oust-0.093
engu-0.093
ark-0.089
 appearances-0.089
 ante-0.088
Tokenio
Token ID952
Feature activation-2.59e-02

Pos logit contributions
 Gre+0.336
 List+0.324
oust+0.316
hips+0.316
 far+0.312
Neg logit contributions
pieces-0.418
"'-0.369
 piping-0.361
cause-0.311
THE-0.305
Token.
Token ID13
Feature activation-1.23e-02

Pos logit contributions
 Gre+0.036
 Money+0.033
oust+0.032
 List+0.032
 St+0.031
Neg logit contributions
pieces-0.031
 piping-0.027
"'-0.026
 snowball-0.025
THE-0.023
Token Hel
Token ID5053
Feature activation-3.00e-01

Pos logit contributions
 appearances+0.020
oust+0.019
 greed+0.019
 lur+0.019
ho+0.019
Neg logit contributions
pieces-0.010
 piping-0.009
"'-0.008
THE-0.007
-0.007
Tokenio
Token ID952
Feature activation-2.61e-02

Pos logit contributions
 Gre+0.318
oust+0.301
 Money+0.289
 far+0.283
 List+0.283
Neg logit contributions
pieces-0.442
 piping-0.398
"'-0.386
 snowball-0.355
THE-0.342
Token explained
Token ID4893
Feature activation+1.64e-02

Pos logit contributions
 Gre+0.037
 Money+0.035
oust+0.034
 St+0.033
 appearances+0.032
Neg logit contributions
pieces-0.029
 piping-0.027
"'-0.025
 snowball-0.025
THE-0.023
Token that
Token ID326
Feature activation-6.48e-03

Pos logit contributions
pieces+0.009
"'+0.006
:"+0.006
 ""+0.005
 then+0.005
Neg logit contributions
oust-0.029
ud-0.029
ugg-0.028
rate-0.028
 Gre-0.028
Token their
Token ID511
Feature activation+6.06e-02

Pos logit contributions
oust+0.011
ocular+0.011
ark+0.011
inch+0.011
erv+0.011
Neg logit contributions
pieces-0.004
"'-0.002
 then-0.002
THE-0.002
 piping-0.002
Token oxygen
Token ID11863
Feature activation-1.32e-01

Pos logit contributions
pieces+0.065
 piping+0.061
 snowball+0.061
"'+0.056
THE+0.053
Neg logit contributions
oust-0.083
 Gre-0.080
 St-0.076
ark-0.074
 Sno-0.074
Token no
Token ID645
Feature activation-8.93e-02

Pos logit contributions
pieces+0.046
 piping+0.042
"'+0.038
?",+0.035
 freshly+0.035
Neg logit contributions
oust-0.068
 Gre-0.065
ark-0.061
ub-0.061
 Money-0.061
Token.
Token ID13
Feature activation-8.55e-03

Pos logit contributions
 Gre+0.108
oust+0.102
 Money+0.099
 St+0.095
hips+0.094
Neg logit contributions
pieces-0.119
 piping-0.112
"'-0.102
 snowball-0.102
THE-0.094
Token It
Token ID632
Feature activation+3.25e-02

Pos logit contributions
oust+0.014
 lur+0.013
 far+0.013
 well+0.013
 Gre+0.013
Neg logit contributions
pieces-0.007
 piping-0.006
"'-0.006
 Either-0.005
 gonna-0.005
Token's
Token ID338
Feature activation-7.44e-03

Pos logit contributions
pieces+0.026
"'+0.022
 piping+0.022
 snowball+0.021
?",+0.018
Neg logit contributions
oust-0.055
 Gre-0.052
uzz-0.050
 Sno-0.050
hips-0.049
Token too
Token ID1165
Feature activation+8.82e-02

Pos logit contributions
oust+0.012
 Gre+0.011
ub+0.011
engu+0.011
ark+0.011
Neg logit contributions
pieces-0.007
 snowball-0.006
 piping-0.006
"'-0.006
 gonna-0.005
Token far
Token ID1290
Feature activation-3.00e-01

Pos logit contributions
?"+0.098
?",+0.098
"'+0.095
+0.095
 snowball+0.093
Neg logit contributions
oust-0.110
 Gre-0.104
 Sno-0.103
inch-0.102
pen-0.102
Token away
Token ID1497
Feature activation+2.83e-02

Pos logit contributions
 Gre+0.428
oust+0.412
 Money+0.399
 St+0.383
hips+0.377
Neg logit contributions
pieces-0.323
 piping-0.301
"'-0.271
 snowball-0.268
THE-0.244
Token".
Token ID1911
Feature activation+2.93e-02

Pos logit contributions
pieces+0.033
 piping+0.031
"'+0.028
 snowball+0.027
THE+0.025
Neg logit contributions
 Gre-0.038
oust-0.036
 Money-0.035
 St-0.034
hips-0.033
Token Maria
Token ID14200
Feature activation+1.32e-01

Pos logit contributions
 piping+0.041
pieces+0.040
 snowball+0.040
 stret+0.037
affles+0.037
Neg logit contributions
 Gre-0.034
 List-0.032
 Money-0.031
 T-0.031
 Ign-0.031
Token was
Token ID373
Feature activation+7.52e-03

Pos logit contributions
pieces+0.147
 piping+0.135
"'+0.126
 snowball+0.124
THE+0.111
Neg logit contributions
 Gre-0.184
oust-0.176
 Money-0.175
hips-0.170
 appearances-0.166
Token feeling
Token ID4203
Feature activation-5.71e-02

Pos logit contributions
pieces+0.009
 piping+0.009
"'+0.008
 snowball+0.008
THE+0.007
Neg logit contributions
 Gre-0.010
oust-0.009
 Money-0.009
 St-0.009
hips-0.009
Token tank
Token ID6873
Feature activation-3.96e-02

Pos logit contributions
 Gre+0.111
oust+0.106
 St+0.102
 Money+0.101
 ante+0.101
Neg logit contributions
pieces-0.067
 piping-0.060
"'-0.056
 snowball-0.055
THE-0.052
Token and
Token ID290
Feature activation+3.73e-03

Pos logit contributions
 Gre+0.059
oust+0.057
 Money+0.055
 St+0.053
 Hel+0.053
Neg logit contributions
pieces-0.040
 piping-0.035
"'-0.033
 snowball-0.033
THE-0.031
Token introduced
Token ID5495
Feature activation-1.80e-02

Pos logit contributions
pieces+0.004
 piping+0.003
 then+0.003
"'+0.003
 snowball+0.003
Neg logit contributions
oust-0.006
 Gre-0.005
ark-0.005
 Money-0.005
hips-0.005
Token themselves
Token ID2405
Feature activation+1.39e-03

Pos logit contributions
 lur+0.031
hips+0.031
oust+0.030
engu+0.030
bars+0.029
Neg logit contributions
pieces-0.010
THE-0.006
 then-0.006
 such-0.006
 back-0.005
Token as
Token ID355
Feature activation+6.03e-02

Pos logit contributions
pieces+0.001
 piping+0.001
"'+0.001
THE+0.001
 snowball+0.001
Neg logit contributions
 Gre-0.002
oust-0.002
 Money-0.002
 ante-0.002
 lur-0.002
Token Hel
Token ID5053
Feature activation-2.99e-01

Pos logit contributions
pieces+0.045
 awesome+0.042
 gonna+0.039
THE+0.037
 snowball+0.035
Neg logit contributions
oust-0.093
engu-0.093
ark-0.089
 appearances-0.089
 ante-0.088
Tokenio
Token ID952
Feature activation-2.59e-02

Pos logit contributions
 Gre+0.336
 List+0.324
oust+0.316
hips+0.316
 far+0.312
Neg logit contributions
pieces-0.418
"'-0.369
 piping-0.361
cause-0.311
THE-0.305
Token.
Token ID13
Feature activation-1.23e-02

Pos logit contributions
 Gre+0.036
 Money+0.033
oust+0.032
 List+0.032
 St+0.031
Neg logit contributions
pieces-0.031
 piping-0.027
"'-0.026
 snowball-0.025
THE-0.023
Token Hel
Token ID5053
Feature activation-3.00e-01

Pos logit contributions
 appearances+0.020
oust+0.019
 greed+0.019
 lur+0.019
ho+0.019
Neg logit contributions
pieces-0.010
 piping-0.009
"'-0.008
THE-0.007
-0.007
Tokenio
Token ID952
Feature activation-2.61e-02

Pos logit contributions
 Gre+0.318
oust+0.301
 Money+0.289
 far+0.283
 List+0.283
Neg logit contributions
pieces-0.442
 piping-0.398
"'-0.386
 snowball-0.355
THE-0.342
Token explained
Token ID4893
Feature activation+1.64e-02

Pos logit contributions
 Gre+0.037
 Money+0.035
oust+0.034
 St+0.033
 appearances+0.032
Neg logit contributions
pieces-0.029
 piping-0.027
"'-0.025
 snowball-0.025
THE-0.023
TokenIt
Token ID1026
Feature activation-4.20e-02

Pos logit contributions
pieces+0.039
 piping+0.037
"'+0.033
 snowball+0.033
THE+0.030
Neg logit contributions
 Gre-0.044
oust-0.042
 Money-0.040
 St-0.039
hips-0.039
Token's
Token ID338
Feature activation-1.29e-02

Pos logit contributions
 Gre+0.055
oust+0.051
 Money+0.051
 St+0.048
hips+0.048
Neg logit contributions
pieces-0.052
 piping-0.049
"'-0.044
 snowball-0.043
THE-0.040
Token a
Token ID257
Feature activation-2.85e-02

Pos logit contributions
oust+0.018
 Gre+0.018
 Money+0.017
ub+0.017
ark+0.016
Neg logit contributions
pieces-0.013
 piping-0.012
 snowball-0.012
"'-0.011
THE-0.010
Token big
Token ID1263
Feature activation-1.14e-02

Pos logit contributions
 Gre+0.039
oust+0.036
 Money+0.036
 St+0.036
 List+0.035
Neg logit contributions
pieces-0.032
 piping-0.032
 snowball-0.028
"'-0.027
 freshly-0.025
Token country
Token ID1499
Feature activation-1.27e-01

Pos logit contributions
 Gre+0.016
oust+0.015
 Money+0.015
 St+0.015
ub+0.014
Neg logit contributions
pieces-0.012
 piping-0.012
 snowball-0.011
"'-0.010
-0.009
Token far
Token ID1290
Feature activation-2.99e-01

Pos logit contributions
 Gre+0.179
oust+0.170
 Money+0.164
 St+0.160
 Sno+0.156
Neg logit contributions
pieces-0.141
 piping-0.129
"'-0.118
 snowball-0.116
THE-0.108
Token away
Token ID1497
Feature activation+2.97e-02

Pos logit contributions
 Gre+0.436
oust+0.419
 Money+0.409
 St+0.393
hips+0.386
Neg logit contributions
pieces-0.315
 piping-0.292
 snowball-0.263
"'-0.261
THE-0.235
Token.
Token ID13
Feature activation-1.55e-02

Pos logit contributions
pieces+0.037
 piping+0.034
"'+0.031
 snowball+0.031
THE+0.028
Neg logit contributions
 Gre-0.038
oust-0.036
 Money-0.035
 St-0.034
hips-0.034
Token It
Token ID632
Feature activation+3.05e-02

Pos logit contributions
oust+0.025
 Gre+0.024
 well+0.024
 lur+0.023
ho+0.023
Neg logit contributions
pieces-0.012
 piping-0.010
"'-0.010
 Either-0.009
THE-0.008
Token has
Token ID468
Feature activation+3.90e-02

Pos logit contributions
pieces+0.035
 piping+0.032
"'+0.030
 snowball+0.029
THE+0.027
Neg logit contributions
 Gre-0.041
oust-0.041
 Money-0.038
 St-0.037
 Sno-0.037
Token many
Token ID867
Feature activation-2.04e-01

Pos logit contributions
pieces+0.043
 piping+0.039
 awesome+0.035
"'+0.034
 snowball+0.034
Neg logit contributions
oust-0.054
 Gre-0.053
ub-0.053
opp-0.050
ark-0.050
Token.
Token ID13
Feature activation-1.54e-02

Pos logit contributions
 Gre+0.009
oust+0.009
engu+0.009
 Sno+0.009
 List+0.008
Neg logit contributions
pieces-0.003
THE-0.003
-0.003
"'-0.003
 snowball-0.003
Token Sometimes
Token ID8975
Feature activation-7.47e-02

Pos logit contributions
oust+0.021
 Gre+0.021
ret+0.019
hips+0.019
 lur+0.019
Neg logit contributions
pieces-0.016
 piping-0.016
"'-0.014
 snowball-0.014
THE-0.013
Token he
Token ID339
Feature activation-1.36e-02

Pos logit contributions
 Gre+0.105
oust+0.105
 Money+0.098
uzz+0.097
ub+0.097
Neg logit contributions
pieces-0.078
 piping-0.071
 snowball-0.071
"'-0.067
?",-0.061
Token gets
Token ID3011
Feature activation-9.79e-03

Pos logit contributions
 Gre+0.019
oust+0.018
 Money+0.017
hips+0.017
 St+0.017
Neg logit contributions
pieces-0.015
 piping-0.014
"'-0.013
 snowball-0.013
THE-0.011
Token too
Token ID1165
Feature activation+8.77e-02

Pos logit contributions
 Gre+0.013
oust+0.012
 Money+0.012
hips+0.011
 St+0.011
Neg logit contributions
pieces-0.012
 piping-0.011
 snowball-0.010
"'-0.010
THE-0.009
Token far
Token ID1290
Feature activation-2.99e-01

Pos logit contributions
 snowball+0.093
"'+0.082
 piping+0.079
+0.078
pieces+0.078
Neg logit contributions
 Gre-0.116
oust-0.115
 Money-0.110
 ante-0.109
rate-0.108
Token and
Token ID290
Feature activation+5.68e-04

Pos logit contributions
 Gre+0.461
oust+0.448
 Money+0.430
engu+0.415
 List+0.413
Neg logit contributions
pieces-0.274
 piping-0.259
 snowball-0.236
"'-0.235
THE-0.210
Token loses
Token ID14754
Feature activation+2.50e-02

Pos logit contributions
pieces+0.001
 piping+0.001
 snowball+0.001
"'+0.001
THE+0.001
Neg logit contributions
 Gre-0.001
oust-0.001
 Money-0.001
 St-0.001
hips-0.001
Token his
Token ID465
Feature activation+2.47e-02

Pos logit contributions
pieces+0.024
 snowball+0.023
 piping+0.021
"'+0.020
 freshly+0.017
Neg logit contributions
 Gre-0.037
oust-0.036
 Money-0.035
ugg-0.034
engu-0.034
Token way
Token ID835
Feature activation-9.88e-02

Pos logit contributions
pieces+0.030
 piping+0.028
 snowball+0.026
"'+0.025
THE+0.023
Neg logit contributions
 Gre-0.032
oust-0.030
 Money-0.030
 St-0.028
hips-0.028
Token.
Token ID13
Feature activation-1.74e-02

Pos logit contributions
 Gre+0.143
oust+0.138
 Money+0.133
 St+0.129
 appearances+0.127
Neg logit contributions
pieces-0.105
 piping-0.094
 snowball-0.087
"'-0.086
THE-0.082
Token careful
Token ID8161
Feature activation-7.49e-02

Pos logit contributions
pieces+0.044
 piping+0.041
"'+0.038
 snowball+0.038
THE+0.035
Neg logit contributions
 Gre-0.033
oust-0.031
 Money-0.030
 St-0.028
hips-0.028
Token and
Token ID290
Feature activation+3.06e-04

Pos logit contributions
 Gre+0.125
oust+0.119
 Sno+0.118
 ante+0.118
 Hel+0.117
Neg logit contributions
pieces-0.063
 piping-0.053
"'-0.051
THE-0.048
 snowball-0.048
Token not
Token ID407
Feature activation-6.80e-02

Pos logit contributions
pieces+0.000
 piping+0.000
"'+0.000
 snowball+0.000
 freshly+0.000
Neg logit contributions
 Gre-0.000
oust-0.000
 Money-0.000
 St-0.000
ub-0.000
Token swim
Token ID9422
Feature activation+4.19e-03

Pos logit contributions
 Gre+0.108
oust+0.108
 Money+0.105
ret+0.105
hips+0.102
Neg logit contributions
pieces-0.050
 snowball-0.048
 piping-0.046
"'-0.045
?",-0.041
Token too
Token ID1165
Feature activation+8.73e-02

Pos logit contributions
pieces+0.005
 snowball+0.004
 piping+0.004
"'+0.004
THE+0.004
Neg logit contributions
oust-0.006
 Gre-0.006
 Money-0.005
 appearances-0.005
 List-0.005
Token far
Token ID1290
Feature activation-2.99e-01

Pos logit contributions
 snowball+0.114
pieces+0.103
 minutes+0.099
"'+0.098
THE+0.098
Neg logit contributions
oust-0.099
 Gre-0.097
 St-0.090
 Money-0.090
 Sno-0.089
Token.
Token ID13
Feature activation-1.47e-02

Pos logit contributions
 Gre+0.467
oust+0.462
 Money+0.437
engu+0.429
 List+0.428
Neg logit contributions
pieces-0.267
 piping-0.240
"'-0.225
 snowball-0.221
THE-0.205
Token They
Token ID1119
Feature activation-1.62e-02

Pos logit contributions
oust+0.020
 Gre+0.020
 Money+0.019
hips+0.019
 lur+0.019
Neg logit contributions
pieces-0.015
 piping-0.015
"'-0.013
 snowball-0.013
THE-0.012
Token said
Token ID531
Feature activation-3.21e-03

Pos logit contributions
 Gre+0.024
oust+0.024
 Money+0.024
 appearances+0.023
ub+0.022
Neg logit contributions
pieces-0.016
 piping-0.014
 snowball-0.013
THE-0.013
"'-0.013
Token they
Token ID484
Feature activation+2.93e-02

Pos logit contributions
 Gre+0.005
oust+0.005
 Money+0.004
hips+0.004
 St+0.004
Neg logit contributions
pieces-0.004
 piping-0.003
"'-0.003
 snowball-0.003
THE-0.003
Token were
Token ID547
Feature activation-3.38e-02

Pos logit contributions
pieces+0.039
 piping+0.038
"'+0.034
 snowball+0.034
THE+0.030
Neg logit contributions
 Gre-0.035
oust-0.033
 Money-0.032
 far-0.031
hips-0.030
Token Sm
Token ID2439
Feature activation-1.87e-01

Pos logit contributions
pieces+0.014
 piping+0.012
"'+0.011
 snowball+0.010
 freshly+0.010
Neg logit contributions
 Gre-0.015
oust-0.015
ark-0.015
ub-0.015
 Money-0.014
Tokenelly
Token ID6148
Feature activation-8.42e-02

Pos logit contributions
 Gre+0.250
oust+0.233
 Money+0.226
 St+0.225
 Spark+0.222
Neg logit contributions
pieces-0.230
 piping-0.205
"'-0.195
 snowball-0.178
THE-0.170
Token King
Token ID2677
Feature activation-9.77e-02

Pos logit contributions
 Gre+0.101
oust+0.096
 Money+0.093
 St+0.090
 Sno+0.087
Neg logit contributions
pieces-0.113
 piping-0.103
"'-0.097
 snowball-0.092
THE-0.086
Token and
Token ID290
Feature activation+7.78e-03

Pos logit contributions
 Gre+0.142
oust+0.135
 Money+0.134
 List+0.129
hips+0.128
Neg logit contributions
pieces-0.106
 piping-0.092
"'-0.086
 snowball-0.082
THE-0.076
Token the
Token ID262
Feature activation+1.17e-02

Pos logit contributions
pieces+0.009
 piping+0.008
"'+0.008
 snowball+0.007
THE+0.007
Neg logit contributions
 Gre-0.010
oust-0.010
 Money-0.010
hips-0.010
ub-0.009
Token St
Token ID520
Feature activation-2.98e-01

Pos logit contributions
pieces+0.014
 piping+0.012
"'+0.011
 snowball+0.011
 freshly+0.010
Neg logit contributions
 Gre-0.016
oust-0.015
ark-0.015
ub-0.015
 Money-0.014
Tokeninky
Token ID29246
Feature activation-1.18e-01

Pos logit contributions
 Gre+0.370
 St+0.335
 Money+0.332
oust+0.328
 Hel+0.325
Neg logit contributions
pieces-0.401
"'-0.351
 piping-0.339
-0.298
THE-0.295
Token Queen
Token ID7542
Feature activation-7.81e-02

Pos logit contributions
 Gre+0.157
oust+0.150
 Money+0.145
 St+0.139
hips+0.137
Neg logit contributions
pieces-0.141
 piping-0.129
"'-0.118
 snowball-0.116
THE-0.107
Token.
Token ID13
Feature activation-8.03e-03

Pos logit contributions
 Gre+0.104
oust+0.099
 Money+0.097
 St+0.092
hips+0.091
Neg logit contributions
pieces-0.094
 piping-0.086
"'-0.078
 snowball-0.076
THE-0.071
Token As
Token ID1081
Feature activation-1.97e-02

Pos logit contributions
oust+0.014
 lur+0.014
 appearances+0.013
ark+0.013
 far+0.013
Neg logit contributions
pieces-0.006
 piping-0.005
"'-0.004
 Either-0.004
 "--0.003
Token he
Token ID339
Feature activation-5.01e-03

Pos logit contributions
hips+0.037
oust+0.036
ocular+0.035
 appearances+0.034
bars+0.034
Neg logit contributions
pieces-0.012
 piping-0.010
THE-0.009
 snowball-0.009
 then-0.008
Token train
Token ID4512
Feature activation+6.16e-02

Pos logit contributions
pieces+0.029
 piping+0.025
 snowball+0.025
"'+0.023
THE+0.021
Neg logit contributions
 Gre-0.062
oust-0.060
 Money-0.059
 Sno-0.058
ub-0.057
Token is
Token ID318
Feature activation-2.67e-02

Pos logit contributions
pieces+0.073
 piping+0.067
 snowball+0.063
"'+0.062
THE+0.057
Neg logit contributions
 Gre-0.082
oust-0.078
 Money-0.076
 St-0.073
 Sno-0.071
Token going
Token ID1016
Feature activation-2.29e-02

Pos logit contributions
 Gre+0.035
oust+0.034
 Money+0.033
ub+0.031
 St+0.031
Neg logit contributions
pieces-0.032
 piping-0.030
 snowball-0.028
"'-0.027
THE-0.025
Token to
Token ID284
Feature activation-1.11e-03

Pos logit contributions
 Gre+0.031
oust+0.030
 Money+0.029
 St+0.028
hips+0.028
Neg logit contributions
pieces-0.027
 piping-0.025
"'-0.022
 snowball-0.022
THE-0.020
Token a
Token ID257
Feature activation-2.54e-02

Pos logit contributions
oust+0.002
 Money+0.002
ark+0.002
 Gre+0.002
ub+0.002
Neg logit contributions
pieces-0.001
 piping-0.001
 snowball-0.001
"'-0.001
THE-0.001
Token far
Token ID1290
Feature activation-2.98e-01

Pos logit contributions
oust+0.036
 Gre+0.036
 Money+0.035
uzz+0.035
hips+0.034
Neg logit contributions
pieces-0.026
 snowball-0.025
 piping-0.024
THE-0.021
"'-0.020
Token place
Token ID1295
Feature activation-4.56e-02

Pos logit contributions
 Gre+0.377
oust+0.360
 Money+0.353
 St+0.331
hips+0.330
Neg logit contributions
pieces-0.368
 piping-0.348
 snowball-0.326
"'-0.316
THE-0.294
Token.
Token ID13
Feature activation-1.68e-02

Pos logit contributions
 Gre+0.062
oust+0.060
 Money+0.057
 St+0.055
 Sno+0.055
Neg logit contributions
pieces-0.052
 piping-0.048
"'-0.044
 snowball-0.043
THE-0.042
Token We
Token ID775
Feature activation+3.27e-02

Pos logit contributions
 Gre+0.024
oust+0.024
 Money+0.022
hips+0.022
ret+0.022
Neg logit contributions
pieces-0.017
 piping-0.016
 snowball-0.014
"'-0.014
THE-0.013
Token have
Token ID423
Feature activation+9.14e-02

Pos logit contributions
pieces+0.033
 piping+0.030
 snowball+0.028
"'+0.028
THE+0.025
Neg logit contributions
 Gre-0.048
oust-0.046
 Money-0.045
 St-0.044
 appearances-0.044
Token to
Token ID284
Feature activation-6.22e-04

Pos logit contributions
pieces+0.099
 piping+0.093
 snowball+0.084
"'+0.083
THE+0.077
Neg logit contributions
 Gre-0.129
oust-0.126
 Money-0.120
hips-0.118
ub-0.117
Token It
Token ID632
Feature activation+3.29e-02

Pos logit contributions
 Gre+0.008
oust+0.008
 Money+0.007
hips+0.007
 lur+0.007
Neg logit contributions
pieces-0.007
 piping-0.006
"'-0.006
 snowball-0.006
THE-0.005
Token is
Token ID318
Feature activation-2.12e-02

Pos logit contributions
pieces+0.038
 piping+0.035
 snowball+0.033
"'+0.032
THE+0.029
Neg logit contributions
 Gre-0.044
oust-0.043
 Money-0.041
 St-0.039
hips-0.039
Token very
Token ID845
Feature activation+5.37e-03

Pos logit contributions
 Gre+0.027
oust+0.025
 Money+0.024
 St+0.023
hips+0.023
Neg logit contributions
pieces-0.027
 piping-0.026
"'-0.023
 snowball-0.023
THE-0.021
Token high
Token ID1029
Feature activation-1.32e-01

Pos logit contributions
pieces+0.006
 piping+0.006
 snowball+0.006
"'+0.006
THE+0.005
Neg logit contributions
 Gre-0.007
oust-0.007
 Money-0.006
 St-0.006
 appearances-0.006
Token and
Token ID290
Feature activation+9.60e-03

Pos logit contributions
 well+0.150
 far+0.150
 Gre+0.146
 Money+0.141
oust+0.137
Neg logit contributions
pieces-0.191
 piping-0.176
"'-0.153
 freshly-0.142
 snowball-0.140
Token far
Token ID1290
Feature activation-2.98e-01

Pos logit contributions
pieces+0.013
 piping+0.012
 snowball+0.011
"'+0.011
THE+0.010
Neg logit contributions
 Gre-0.011
oust-0.011
 Money-0.010
 St-0.010
hips-0.010
Token."
Token ID526
Feature activation+4.39e-02

Pos logit contributions
 Gre+0.452
oust+0.439
 Money+0.422
 St+0.403
 List+0.400
Neg logit contributions
pieces-0.292
 piping-0.272
"'-0.246
 snowball-0.246
THE-0.218
Token
Token ID198
Feature activation-1.11e-02

Pos logit contributions
pieces+0.067
 piping+0.062
 snowball+0.056
"'+0.056
THE+0.053
Neg logit contributions
 Gre-0.048
oust-0.043
 Money-0.043
 St-0.041
 far-0.040
Token
Token ID198
Feature activation-1.15e-02

Pos logit contributions
 Gre+0.014
oust+0.013
 Money+0.013
 St+0.012
hips+0.012
Neg logit contributions
pieces-0.014
 piping-0.013
"'-0.012
 snowball-0.012
THE-0.011
TokenThey
Token ID2990
Feature activation+1.68e-02

Pos logit contributions
 Gre+0.013
oust+0.012
 Money+0.012
 far+0.011
 St+0.011
Neg logit contributions
pieces-0.017
 piping-0.016
 snowball-0.014
"'-0.014
 freshly-0.013
Token start
Token ID923
Feature activation+9.41e-02

Pos logit contributions
pieces+0.015
"'+0.013
 piping+0.013
 snowball+0.012
+0.011
Neg logit contributions
 Gre-0.027
 Money-0.026
 appearances-0.026
oust-0.025
ark-0.025
Token little
Token ID1310
Feature activation-7.92e-03

Pos logit contributions
 Gre+0.007
oust+0.006
 Money+0.006
hips+0.006
 appearances+0.006
Neg logit contributions
pieces-0.006
 piping-0.006
 snowball-0.005
"'-0.005
THE-0.004
Token girl
Token ID2576
Feature activation+5.95e-02

Pos logit contributions
 Gre+0.013
oust+0.012
ret+0.012
hips+0.012
 Money+0.012
Neg logit contributions
 snowball-0.007
pieces-0.007
 piping-0.007
"'-0.005
-0.005
Token named
Token ID3706
Feature activation+3.64e-02

Pos logit contributions
pieces+0.086
 piping+0.082
 snowball+0.073
"'+0.073
THE+0.068
Neg logit contributions
 Gre-0.065
oust-0.061
 St-0.061
 Money-0.059
 far-0.058
Token Amy
Token ID14235
Feature activation+1.84e-01

Pos logit contributions
pieces+0.039
 piping+0.034
"'+0.034
 snowball+0.031
THE+0.028
Neg logit contributions
oust-0.052
 Gre-0.050
 appearances-0.047
 greed-0.047
ret-0.047
Token.
Token ID13
Feature activation+6.92e-03

Pos logit contributions
pieces+0.286
 piping+0.270
 snowball+0.247
"'+0.239
 freshly+0.226
Neg logit contributions
 Gre-0.183
 St-0.169
oust-0.167
 Money-0.165
 far-0.164
Token Amy
Token ID14235
Feature activation+1.84e-01

Pos logit contributions
pieces+0.013
 piping+0.012
"'+0.011
 stret+0.010
 snowball+0.010
Neg logit contributions
 Gre-0.006
 Money-0.005
 Spark-0.005
 St-0.005
 List-0.005
Token and
Token ID290
Feature activation+2.15e-02

Pos logit contributions
pieces+0.246
 piping+0.231
"'+0.210
 snowball+0.210
THE+0.194
Neg logit contributions
 Gre-0.219
oust-0.207
 Money-0.201
 St-0.192
hips-0.189
Token Tim
Token ID5045
Feature activation-6.30e-03

Pos logit contributions
pieces+0.029
 piping+0.028
"'+0.025
 snowball+0.024
 freshly+0.023
Neg logit contributions
 Gre-0.026
 Spark-0.025
 St-0.024
 Money-0.024
 Cas-0.023
Token would
Token ID561
Feature activation+2.74e-02

Pos logit contributions
 Gre+0.007
oust+0.007
 Money+0.007
 Hel+0.006
hips+0.006
Neg logit contributions
pieces-0.009
 piping-0.008
"'-0.008
 snowball-0.007
THE-0.007
Token go
Token ID467
Feature activation-1.67e-02

Pos logit contributions
pieces+0.038
 piping+0.036
"'+0.033
 snowball+0.032
THE+0.030
Neg logit contributions
 Gre-0.031
oust-0.029
 Money-0.029
 St-0.027
hips-0.027
Token on
Token ID319
Feature activation+8.32e-02

Pos logit contributions
 far+0.018
 Gre+0.018
 well+0.017
 Hel+0.017
 Spark+0.017
Neg logit contributions
pieces-0.025
 piping-0.024
"'-0.021
 snowball-0.020
 freshly-0.019
Token,
Token ID11
Feature activation-2.58e-02

Pos logit contributions
 Gre+0.024
oust+0.022
 Money+0.022
 St+0.021
hips+0.020
Neg logit contributions
pieces-0.024
 piping-0.022
"'-0.020
 snowball-0.020
THE-0.019
Token they
Token ID484
Feature activation+4.24e-02

Pos logit contributions
 Gre+0.034
oust+0.032
 Money+0.032
 St+0.031
hips+0.030
Neg logit contributions
pieces-0.031
 piping-0.029
"'-0.026
 snowball-0.026
THE-0.024
Token saw
Token ID2497
Feature activation+6.39e-02

Pos logit contributions
pieces+0.044
 piping+0.039
"'+0.036
 snowball+0.034
THE+0.032
Neg logit contributions
 Gre-0.064
oust-0.061
 Money-0.061
 appearances-0.059
 List-0.057
Token the
Token ID262
Feature activation+1.57e-02

Pos logit contributions
pieces+0.066
 piping+0.059
 snowball+0.052
 freshly+0.048
"'+0.047
Neg logit contributions
oust-0.096
 Gre-0.095
 Money-0.092
ub-0.091
ark-0.090
Token burnt
Token ID22999
Feature activation+5.83e-02

Pos logit contributions
pieces+0.020
 piping+0.018
"'+0.017
THE+0.016
 snowball+0.016
Neg logit contributions
 Gre-0.019
oust-0.018
 well-0.018
 far-0.018
 Money-0.018
Token cookies
Token ID14746
Feature activation+1.67e-01

Pos logit contributions
pieces+0.072
 piping+0.064
"'+0.060
THE+0.057
 snowball+0.054
Neg logit contributions
 Gre-0.073
oust-0.072
 well-0.070
 Money-0.068
 far-0.067
Token and
Token ID290
Feature activation+1.35e-02

Pos logit contributions
pieces+0.183
 piping+0.168
 snowball+0.151
"'+0.150
THE+0.138
Neg logit contributions
 Gre-0.241
oust-0.229
 Money-0.225
 St-0.218
 Hel-0.213
Token the
Token ID262
Feature activation+1.57e-02

Pos logit contributions
pieces+0.016
 piping+0.015
 snowball+0.013
"'+0.013
 freshly+0.012
Neg logit contributions
 Gre-0.018
oust-0.018
 Money-0.017
hips-0.017
ub-0.016
Token smoke
Token ID7523
Feature activation-2.02e-02

Pos logit contributions
pieces+0.017
 piping+0.016
"'+0.015
 snowball+0.014
THE+0.014
Neg logit contributions
 Gre-0.022
oust-0.021
 Money-0.020
 well-0.020
 far-0.020
Token.
Token ID13
Feature activation-3.06e-03

Pos logit contributions
 Gre+0.029
oust+0.027
 Money+0.027
 List+0.026
 St+0.026
Neg logit contributions
pieces-0.022
 piping-0.021
 snowball-0.019
"'-0.019
 freshly-0.017
Token They
Token ID1119
Feature activation-3.54e-03

Pos logit contributions
 Gre+0.004
oust+0.004
 Money+0.004
 St+0.003
hips+0.003
Neg logit contributions
pieces-0.004
 piping-0.004
"'-0.003
 snowball-0.003
THE-0.003
Token sat
Token ID3332
Feature activation-3.61e-02

Pos logit contributions
 Gre+0.102
oust+0.097
 Money+0.094
 St+0.090
hips+0.088
Neg logit contributions
pieces-0.106
 piping-0.099
"'-0.090
 snowball-0.089
THE-0.083
Token down
Token ID866
Feature activation+6.63e-02

Pos logit contributions
 Gre+0.046
 far+0.045
 well+0.044
 Money+0.044
oust+0.043
Neg logit contributions
pieces-0.047
 piping-0.043
"'-0.038
 snowball-0.036
THE-0.032
Token and
Token ID290
Feature activation+1.51e-02

Pos logit contributions
pieces+0.065
 piping+0.062
 snowball+0.056
"'+0.055
THE+0.051
Neg logit contributions
 Gre-0.099
oust-0.096
 Money-0.093
 appearances-0.091
hips-0.089
Token ate
Token ID15063
Feature activation+5.49e-02

Pos logit contributions
pieces+0.013
 piping+0.013
 snowball+0.011
 freshly+0.010
"'+0.010
Neg logit contributions
hips-0.025
 Gre-0.025
 appearances-0.025
oust-0.024
 Money-0.024
Token the
Token ID262
Feature activation+1.38e-02

Pos logit contributions
pieces+0.055
 piping+0.054
 snowball+0.050
"'+0.044
 freshly+0.042
Neg logit contributions
 Gre-0.083
oust-0.082
 Money-0.078
 appearances-0.076
hips-0.076
Token cookies
Token ID14746
Feature activation+1.68e-01

Pos logit contributions
pieces+0.015
"'+0.015
THE+0.014
 piping+0.013
?",+0.012
Neg logit contributions
 well-0.018
 far-0.017
 Gre-0.016
 truth-0.016
 St-0.016
Token while
Token ID981
Feature activation-5.67e-03

Pos logit contributions
pieces+0.179
 piping+0.163
 snowball+0.149
"'+0.144
THE+0.137
Neg logit contributions
 Gre-0.251
oust-0.235
 Money-0.233
 appearances-0.225
 Hel-0.221
Token watching
Token ID4964
Feature activation-3.58e-02

Pos logit contributions
 Gre+0.008
oust+0.008
 Money+0.008
hips+0.007
ub+0.007
Neg logit contributions
pieces-0.006
 piping-0.006
 snowball-0.005
"'-0.005
THE-0.005
Token their
Token ID511
Feature activation+6.71e-02

Pos logit contributions
 Gre+0.051
oust+0.049
 Money+0.048
hips+0.046
ub+0.045
Neg logit contributions
pieces-0.040
 piping-0.037
 snowball-0.033
"'-0.033
THE-0.029
Token favorite
Token ID4004
Feature activation+1.41e-01

Pos logit contributions
pieces+0.081
 piping+0.077
"'+0.070
 snowball+0.066
THE+0.063
Neg logit contributions
 Gre-0.088
oust-0.082
 Money-0.081
 well-0.079
 St-0.079
Token show
Token ID905
Feature activation+6.86e-02

Pos logit contributions
pieces+0.173
"'+0.166
 piping+0.165
THE+0.155
 gonna+0.150
Neg logit contributions
 well-0.167
 Gre-0.157
 St-0.152
 greed-0.149
 truth-0.147
Token,
Token ID11
Feature activation-1.80e-02

Pos logit contributions
pieces+0.041
 piping+0.038
 snowball+0.035
"'+0.034
THE+0.032
Neg logit contributions
 Gre-0.033
oust-0.030
 Money-0.030
 St-0.029
 well-0.028
Token "
Token ID366
Feature activation+7.21e-03

Pos logit contributions
 Gre+0.023
oust+0.022
 Money+0.021
 St+0.020
hips+0.020
Neg logit contributions
pieces-0.023
 piping-0.021
 snowball-0.019
"'-0.019
THE-0.018
TokenWhat
Token ID2061
Feature activation+1.48e-01

Pos logit contributions
pieces+0.009
 piping+0.009
"'+0.008
 snowball+0.008
THE+0.007
Neg logit contributions
 Gre-0.009
oust-0.008
 Money-0.008
 St-0.008
hips-0.008
Token is
Token ID318
Feature activation-8.90e-03

Pos logit contributions
pieces+0.164
 piping+0.152
 snowball+0.138
"'+0.135
?",+0.120
Neg logit contributions
 Gre-0.206
oust-0.203
 Money-0.193
ub-0.191
hips-0.189
Token this
Token ID428
Feature activation+1.09e-03

Pos logit contributions
 Gre+0.010
 Money+0.009
 far+0.009
oust+0.009
 well+0.009
Neg logit contributions
pieces-0.013
 piping-0.012
"'-0.011
 snowball-0.011
THE-0.010
Token?"
Token ID1701
Feature activation+1.92e-01

Pos logit contributions
pieces+0.001
 piping+0.001
 snowball+0.001
"'+0.001
THE+0.001
Neg logit contributions
 Gre-0.002
oust-0.002
 Money-0.002
 well-0.002
 far-0.002
Token
Token ID198
Feature activation+1.89e-03

Pos logit contributions
pieces+0.333
 piping+0.311
 snowball+0.291
"'+0.290
THE+0.277
Neg logit contributions
 Gre-0.165
 St-0.146
oust-0.142
 Money-0.142
 Spark-0.134
Token
Token ID198
Feature activation+1.24e-03

Pos logit contributions
pieces+0.002
 piping+0.002
"'+0.002
 snowball+0.002
THE+0.002
Neg logit contributions
 Gre-0.003
oust-0.003
 Money-0.003
 appearances-0.002
hips-0.002
TokenBen
Token ID11696
Feature activation-1.17e-01

Pos logit contributions
pieces+0.001
 piping+0.001
"'+0.001
 snowball+0.001
THE+0.001
Neg logit contributions
 Gre-0.002
oust-0.002
 Money-0.002
hips-0.002
 lur-0.002
Token says
Token ID1139
Feature activation+2.58e-02

Pos logit contributions
 Gre+0.166
oust+0.160
 Money+0.154
hips+0.149
 St+0.148
Neg logit contributions
pieces-0.127
 piping-0.119
"'-0.109
 snowball-0.108
THE-0.095
Token,
Token ID11
Feature activation-2.31e-02

Pos logit contributions
pieces+0.032
 piping+0.030
"'+0.027
 snowball+0.027
THE+0.024
Neg logit contributions
 Gre-0.033
oust-0.032
 Money-0.031
 St-0.029
hips-0.029
Token warm
Token ID5814
Feature activation-2.34e-02

Pos logit contributions
pieces+0.056
 piping+0.048
"'+0.044
THE+0.044
should+0.042
Neg logit contributions
 well-0.043
 far-0.042
 Gre-0.040
 St-0.040
 Spark-0.039
Token coat
Token ID13209
Feature activation+8.82e-02

Pos logit contributions
 Gre+0.028
oust+0.028
 St+0.026
 well+0.026
 Money+0.026
Neg logit contributions
pieces-0.032
 piping-0.027
"'-0.024
THE-0.023
 snowball-0.022
Token and
Token ID290
Feature activation+1.33e-02

Pos logit contributions
pieces+0.083
 piping+0.074
 snowball+0.068
"'+0.068
THE+0.062
Neg logit contributions
 Gre-0.138
oust-0.129
 Money-0.127
 appearances-0.125
 ante-0.122
Token his
Token ID465
Feature activation+3.76e-02

Pos logit contributions
pieces+0.018
 piping+0.016
"'+0.015
 snowball+0.014
THE+0.014
Neg logit contributions
 Gre-0.016
oust-0.015
 Money-0.015
 St-0.015
 far-0.015
Token m
Token ID285
Feature activation+5.60e-02

Pos logit contributions
pieces+0.046
 piping+0.041
"'+0.038
THE+0.035
 snowball+0.034
Neg logit contributions
 Gre-0.049
oust-0.047
 well-0.046
 St-0.046
 far-0.046
Tokenittens
Token ID34978
Feature activation+2.07e-01

Pos logit contributions
pieces+0.024
 piping+0.021
 snowball+0.016
"'+0.015
THE+0.010
Neg logit contributions
 Gre-0.117
 Money-0.111
oust-0.109
 St-0.108
hips-0.107
Token,
Token ID11
Feature activation-2.57e-02

Pos logit contributions
pieces+0.213
 piping+0.194
"'+0.180
 snowball+0.175
THE+0.159
Neg logit contributions
 Gre-0.305
oust-0.290
 Money-0.281
 appearances-0.274
 List-0.271
Token for
Token ID329
Feature activation+1.93e-02

Pos logit contributions
 Gre+0.038
oust+0.036
 Money+0.035
hips+0.034
ret+0.034
Neg logit contributions
pieces-0.027
 piping-0.026
 snowball-0.023
"'-0.023
 freshly-0.021
Token the
Token ID262
Feature activation+1.47e-02

Pos logit contributions
pieces+0.019
 piping+0.018
 snowball+0.017
"'+0.016
 freshly+0.015
Neg logit contributions
 Gre-0.029
oust-0.028
 Money-0.028
ark-0.028
ub-0.027
Token weather
Token ID6193
Feature activation-5.58e-02

Pos logit contributions
pieces+0.016
 piping+0.015
 snowball+0.015
"'+0.013
 freshly+0.013
Neg logit contributions
 Gre-0.020
oust-0.020
ret-0.019
 Money-0.019
hips-0.019
Token outside
Token ID2354
Feature activation+2.36e-02

Pos logit contributions
 Gre+0.099
ret+0.094
 appearances+0.093
 List+0.093
 Money+0.091
Neg logit contributions
pieces-0.038
 snowball-0.035
 piping-0.034
 cocoa-0.030
"'-0.030
Token leave
Token ID2666
Feature activation-1.03e-02

Pos logit contributions
 snowball+0.006
pieces+0.005
 piping+0.005
?",+0.005
"'+0.005
Neg logit contributions
 Money-0.013
hips-0.013
ark-0.012
ret-0.012
engu-0.012
Token.
Token ID13
Feature activation-8.51e-03

Pos logit contributions
engu+0.019
icated+0.018
 ante+0.018
ub+0.018
ark+0.018
Neg logit contributions
pieces-0.007
 freshly-0.005
"'-0.005
 piping-0.005
THE-0.004
Token
Token ID198
Feature activation-1.22e-02

Pos logit contributions
 appearances+0.016
 ante+0.015
hips+0.015
 greed+0.015
 mer+0.015
Neg logit contributions
pieces-0.005
 piping-0.005
"'-0.005
-0.004
THE-0.004
Token
Token ID198
Feature activation-1.27e-02

Pos logit contributions
 Gre+0.017
 appearances+0.017
oust+0.017
 Money+0.017
hips+0.017
Neg logit contributions
pieces-0.012
 piping-0.011
"'-0.011
THE-0.010
 snowball-0.010
TokenAs
Token ID1722
Feature activation-2.22e-02

Pos logit contributions
 Gre+0.019
oust+0.019
 appearances+0.018
 Money+0.018
 lur+0.018
Neg logit contributions
pieces-0.013
"'-0.011
 piping-0.011
THE-0.011
 snowball-0.009
Token she
Token ID673
Feature activation+2.70e-02

Pos logit contributions
 appearances+0.040
hips+0.040
inch+0.039
oust+0.038
engu+0.037
Neg logit contributions
pieces-0.013
THE-0.012
 piping-0.012
 then-0.010
"'-0.010
Token walked
Token ID6807
Feature activation-1.40e-02

Pos logit contributions
pieces+0.030
 piping+0.029
 snowball+0.027
"'+0.026
THE+0.024
Neg logit contributions
 Gre-0.037
oust-0.035
 Money-0.035
hips-0.034
 appearances-0.033
Token away
Token ID1497
Feature activation+3.11e-02

Pos logit contributions
 Gre+0.019
oust+0.018
 Money+0.018
 St+0.017
hips+0.017
Neg logit contributions
pieces-0.016
 piping-0.015
"'-0.014
 snowball-0.014
THE-0.012
Token,
Token ID11
Feature activation-3.17e-02

Pos logit contributions
pieces+0.024
"'+0.022
 piping+0.021
THE+0.017
 freshly+0.017
Neg logit contributions
 Gre-0.050
oust-0.049
 Money-0.049
 appearances-0.047
inch-0.047
Token she
Token ID673
Feature activation+2.67e-02

Pos logit contributions
engu+0.053
inch+0.053
ark+0.053
 appearances+0.052
hips+0.052
Neg logit contributions
pieces-0.023
 finally-0.018
 piping-0.017
"'-0.016
THE-0.016
Token heard
Token ID2982
Feature activation+1.35e-02

Pos logit contributions
pieces+0.020
"'+0.017
 piping+0.016
THE+0.015
 snowball+0.014
Neg logit contributions
hips-0.046
 Gre-0.045
 Money-0.045
 appearances-0.044
oust-0.044
Token them
Token ID606
Feature activation+2.86e-02

Pos logit contributions
oust+0.134
 Gre+0.130
 Money+0.126
ub+0.122
 List+0.120
Neg logit contributions
pieces-0.078
 snowball-0.067
 piping-0.064
"'-0.060
 cocoa-0.052
Token and
Token ID290
Feature activation+1.04e-03

Pos logit contributions
pieces+0.028
 piping+0.024
 snowball+0.023
"'+0.023
THE+0.020
Neg logit contributions
oust-0.043
 Gre-0.043
 Money-0.040
 ante-0.039
ret-0.039
Token told
Token ID1297
Feature activation+1.67e-02

Pos logit contributions
pieces+0.001
 piping+0.001
"'+0.001
 snowball+0.001
THE+0.001
Neg logit contributions
 Gre-0.001
oust-0.001
 Money-0.001
hips-0.001
ub-0.001
Token them
Token ID606
Feature activation+2.83e-02

Pos logit contributions
pieces+0.013
 piping+0.009
 snowball+0.009
"'+0.008
 cocoa+0.007
Neg logit contributions
oust-0.030
 Money-0.029
 Gre-0.029
ub-0.028
ark-0.028
Token what
Token ID644
Feature activation+2.24e-03

Pos logit contributions
pieces+0.030
 piping+0.025
 snowball+0.023
"'+0.023
THE+0.021
Neg logit contributions
 Gre-0.041
oust-0.041
 Money-0.038
ub-0.038
 ante-0.037
Token happened
Token ID3022
Feature activation+4.72e-02

Pos logit contributions
pieces+0.002
 piping+0.002
"'+0.002
 snowball+0.002
THE+0.002
Neg logit contributions
 Gre-0.003
oust-0.003
 Money-0.003
hips-0.003
 St-0.003
Token.
Token ID13
Feature activation-1.46e-02

Pos logit contributions
pieces+0.064
 piping+0.061
 snowball+0.056
"'+0.054
THE+0.050
Neg logit contributions
 Gre-0.056
oust-0.052
 Money-0.052
 St-0.049
hips-0.048
Token
Token ID198
Feature activation-2.09e-02

Pos logit contributions
 Gre+0.019
oust+0.019
 Money+0.018
hips+0.017
 appearances+0.017
Neg logit contributions
pieces-0.017
 piping-0.016
"'-0.015
 snowball-0.015
THE-0.013
Token
Token ID198
Feature activation-2.09e-02

Pos logit contributions
 Gre+0.026
oust+0.025
 Money+0.025
 St+0.023
hips+0.023
Neg logit contributions
pieces-0.026
 piping-0.025
"'-0.022
 snowball-0.022
THE-0.021
Token"
Token ID1
Feature activation+3.51e-02

Pos logit contributions
 Gre+0.031
oust+0.030
 Money+0.029
 St+0.028
hips+0.028
Neg logit contributions
pieces-0.022
 piping-0.020
"'-0.018
 snowball-0.018
THE-0.016
TokenHelp
Token ID22087
Feature activation-5.17e-03

Pos logit contributions
pieces+0.039
 piping+0.037
"'+0.034
 snowball+0.033
THE+0.031
Neg logit contributions
 Gre-0.049
oust-0.048
 Money-0.045
hips-0.044
 St-0.044
Token yogurt
Token ID32132
Feature activation+5.22e-02

Pos logit contributions
pieces+0.065
 piping+0.058
 snowball+0.055
"'+0.050
THE+0.049
Neg logit contributions
 Gre-0.085
oust-0.081
 Money-0.081
hips-0.077
 St-0.077
Token was
Token ID373
Feature activation+1.81e-02

Pos logit contributions
pieces+0.054
 piping+0.051
 snowball+0.046
"'+0.045
THE+0.040
Neg logit contributions
 Gre-0.079
 Money-0.074
oust-0.072
hips-0.070
 Hel-0.070
Token so
Token ID523
Feature activation+8.61e-02

Pos logit contributions
pieces+0.020
 piping+0.019
"'+0.017
 snowball+0.017
THE+0.015
Neg logit contributions
 Gre-0.025
oust-0.024
 Money-0.024
 St-0.023
hips-0.023
Token y
Token ID331
Feature activation+2.89e-02

Pos logit contributions
 piping+0.083
pieces+0.081
 snowball+0.070
"'+0.069
 freshly+0.065
Neg logit contributions
 Gre-0.133
oust-0.128
 Money-0.128
 appearances-0.126
hips-0.122
Tokenummy
Token ID13513
Feature activation-2.66e-02

Pos logit contributions
pieces+0.024
 piping+0.023
 snowball+0.021
 freshly+0.020
"'+0.019
Neg logit contributions
 Gre-0.047
 Money-0.044
 appearances-0.044
 St-0.043
 Hel-0.042
Token and
Token ID290
Feature activation+1.59e-02

Pos logit contributions
 Gre+0.038
oust+0.036
 Money+0.036
hips+0.034
 St+0.034
Neg logit contributions
pieces-0.029
 piping-0.027
 snowball-0.024
"'-0.024
THE-0.022
Token Sarah
Token ID10490
Feature activation+7.18e-02

Pos logit contributions
pieces+0.017
 piping+0.016
 snowball+0.015
"'+0.014
 freshly+0.013
Neg logit contributions
 Gre-0.023
oust-0.022
 Money-0.022
hips-0.021
 appearances-0.021
Token was
Token ID373
Feature activation+1.84e-02

Pos logit contributions
pieces+0.082
 piping+0.077
"'+0.070
 snowball+0.068
THE+0.063
Neg logit contributions
 Gre-0.098
oust-0.094
 Money-0.093
hips-0.090
 appearances-0.088
Token thrilled
Token ID20536
Feature activation+1.11e-01

Pos logit contributions
pieces+0.020
 piping+0.018
"'+0.017
 snowball+0.017
THE+0.015
Neg logit contributions
 Gre-0.026
oust-0.025
 Money-0.024
 St-0.024
 far-0.023
Token with
Token ID351
Feature activation-1.13e-01

Pos logit contributions
pieces+0.121
 piping+0.109
"'+0.101
 snowball+0.097
THE+0.094
Neg logit contributions
 Gre-0.156
oust-0.148
 Money-0.145
hips-0.141
 appearances-0.140
Token it
Token ID340
Feature activation-4.55e-02

Pos logit contributions
 Gre+0.153
oust+0.151
 Money+0.147
ub+0.145
ark+0.142
Neg logit contributions
pieces-0.134
 piping-0.125
 snowball-0.108
"'-0.104
 freshly-0.102
Token friendly
Token ID8030
Feature activation-2.56e-02

Pos logit contributions
pieces+0.011
 piping+0.010
 snowball+0.009
"'+0.009
THE+0.008
Neg logit contributions
 Gre-0.009
oust-0.008
 Money-0.008
hips-0.008
 St-0.008
Token taxi
Token ID17536
Feature activation+6.37e-03

Pos logit contributions
 Gre+0.041
ub+0.041
 St+0.040
ret+0.040
inch+0.040
Neg logit contributions
pieces-0.019
 piping-0.019
"'-0.018
 snowball-0.017
 stret-0.014
Token driver
Token ID4639
Feature activation-3.59e-02

Pos logit contributions
pieces+0.007
 piping+0.006
"'+0.006
 snowball+0.006
THE+0.005
Neg logit contributions
 Gre-0.009
oust-0.008
 Money-0.008
 appearances-0.008
 St-0.008
Token.
Token ID13
Feature activation-1.13e-02

Pos logit contributions
 Gre+0.048
oust+0.045
 Money+0.044
 St+0.042
hips+0.042
Neg logit contributions
pieces-0.043
 piping-0.040
"'-0.036
 snowball-0.036
THE-0.033
Token After
Token ID2293
Feature activation+2.56e-02

Pos logit contributions
oust+0.015
 Gre+0.015
 Money+0.014
 lur+0.014
 ante+0.014
Neg logit contributions
pieces-0.013
 piping-0.012
"'-0.011
 snowball-0.011
THE-0.010
Token they
Token ID484
Feature activation+3.74e-02

Pos logit contributions
pieces+0.032
 piping+0.030
"'+0.027
 snowball+0.027
THE+0.025
Neg logit contributions
 Gre-0.033
oust-0.031
 Money-0.030
 St-0.029
hips-0.029
Token were
Token ID547
Feature activation-2.84e-02

Pos logit contributions
pieces+0.035
 piping+0.033
"'+0.028
THE+0.028
 snowball+0.026
Neg logit contributions
 Gre-0.059
 Money-0.056
oust-0.056
 appearances-0.056
 St-0.054
Token finished
Token ID5201
Feature activation+6.79e-02

Pos logit contributions
 Gre+0.052
 Money+0.050
oust+0.050
 appearances+0.048
hips+0.047
Neg logit contributions
pieces-0.021
 piping-0.019
"'-0.016
 snowball-0.014
THE-0.013
Token reading
Token ID3555
Feature activation-2.48e-02

Pos logit contributions
pieces+0.082
 piping+0.077
"'+0.069
 snowball+0.068
THE+0.063
Neg logit contributions
 Gre-0.090
oust-0.086
 Money-0.083
hips-0.079
 St-0.079
Token,
Token ID11
Feature activation-3.33e-02

Pos logit contributions
 Gre+0.038
oust+0.037
 Money+0.036
ub+0.035
 ante+0.035
Neg logit contributions
pieces-0.025
 piping-0.023
 snowball-0.020
"'-0.019
THE-0.018
Token they
Token ID484
Feature activation+3.70e-02

Pos logit contributions
oust+0.053
 Gre+0.051
 Money+0.048
 ante+0.048
hips+0.047
Neg logit contributions
pieces-0.033
 piping-0.028
"'-0.025
 snowball-0.024
THE-0.023
Token.
Token ID13
Feature activation-4.56e-03

Pos logit contributions
pieces+0.051
 piping+0.047
"'+0.043
 snowball+0.043
THE+0.039
Neg logit contributions
 Gre-0.051
oust-0.049
 Money-0.047
 St-0.045
hips-0.045
Token Amy
Token ID14235
Feature activation+1.75e-01

Pos logit contributions
 Gre+0.006
 Money+0.005
 St+0.005
 List+0.005
 Spark+0.005
Neg logit contributions
pieces-0.006
 piping-0.005
 snowball-0.005
"'-0.005
THE-0.005
Token watched
Token ID7342
Feature activation-4.46e-02

Pos logit contributions
pieces+0.182
 piping+0.170
"'+0.157
 snowball+0.155
THE+0.137
Neg logit contributions
 Gre-0.256
 Money-0.247
oust-0.245
hips-0.240
 appearances-0.235
Token them
Token ID606
Feature activation+4.10e-02

Pos logit contributions
 Gre+0.074
 Money+0.073
oust+0.072
 jun+0.069
 List+0.068
Neg logit contributions
pieces-0.041
 piping-0.035
 snowball-0.031
"'-0.028
 freshly-0.025
Token with
Token ID351
Feature activation-1.14e-01

Pos logit contributions
pieces+0.047
 piping+0.040
 snowball+0.039
"'+0.036
THE+0.034
Neg logit contributions
 Gre-0.059
oust-0.057
 Money-0.056
 appearances-0.052
 St-0.052
Token amaz
Token ID40642
Feature activation+3.05e-02

Pos logit contributions
 Gre+0.170
ark+0.170
ub+0.170
 Money+0.168
oust+0.167
Neg logit contributions
 piping-0.114
pieces-0.112
 snowball-0.100
 freshly-0.093
"'-0.084
Tokenement
Token ID972
Feature activation+1.17e-01

Pos logit contributions
pieces+0.035
 piping+0.031
"'+0.030
 snowball+0.030
THE+0.027
Neg logit contributions
oust-0.040
 Gre-0.039
hips-0.037
 well-0.037
 far-0.036
Token.
Token ID13
Feature activation-5.88e-03

Pos logit contributions
pieces+0.128
 piping+0.117
"'+0.104
 snowball+0.104
THE+0.095
Neg logit contributions
 Gre-0.170
oust-0.159
 Money-0.158
 St-0.150
hips-0.148
Token She
Token ID1375
Feature activation-6.47e-03

Pos logit contributions
 Gre+0.008
oust+0.008
 Money+0.007
hips+0.007
 appearances+0.007
Neg logit contributions
pieces-0.007
 piping-0.006
"'-0.006
 snowball-0.006
THE-0.005
Token smiled
Token ID13541
Feature activation-5.16e-02

Pos logit contributions
 Gre+0.009
 Money+0.009
oust+0.009
hips+0.008
 appearances+0.008
Neg logit contributions
pieces-0.007
 piping-0.007
 snowball-0.006
"'-0.006
THE-0.006
Token,
Token ID11
Feature activation-2.68e-02

Pos logit contributions
 Gre+0.075
oust+0.073
 Money+0.071
hips+0.068
 appearances+0.067
Neg logit contributions
pieces-0.054
 piping-0.050
"'-0.045
 snowball-0.045
THE-0.041
Token miss
Token ID2051
Feature activation-3.16e-02

Pos logit contributions
 piping+0.021
pieces+0.020
"'+0.019
 snowball+0.017
?",+0.017
Neg logit contributions
 Gre-0.035
 Money-0.034
oust-0.034
 List-0.033
ret-0.032
Token you
Token ID345
Feature activation-6.51e-02

Pos logit contributions
 Gre+0.039
oust+0.037
 Money+0.036
hips+0.034
 ante+0.034
Neg logit contributions
pieces-0.041
 piping-0.038
 snowball-0.035
"'-0.034
THE-0.032
Token too
Token ID1165
Feature activation+9.39e-02

Pos logit contributions
 Gre+0.080
oust+0.076
 Money+0.075
 St+0.073
 well+0.072
Neg logit contributions
pieces-0.084
 piping-0.079
"'-0.071
 snowball-0.071
THE-0.064
Token,
Token ID11
Feature activation-3.08e-02

Pos logit contributions
 piping+0.070
pieces+0.067
"'+0.062
 snowball+0.062
+0.056
Neg logit contributions
 Gre-0.157
oust-0.151
 ante-0.145
 Money-0.145
 lur-0.142
Token Smoke
Token ID25416
Feature activation-1.04e-01

Pos logit contributions
 Gre+0.036
oust+0.034
 Money+0.033
hips+0.032
 St+0.030
Neg logit contributions
pieces-0.042
 piping-0.039
 snowball-0.036
"'-0.036
THE-0.033
Tokeny
Token ID88
Feature activation-7.53e-02

Pos logit contributions
 Gre+0.105
oust+0.099
 Money+0.096
 St+0.091
hips+0.090
Neg logit contributions
pieces-0.159
 piping-0.149
"'-0.137
 snowball-0.135
THE-0.127
Token.
Token ID13
Feature activation-7.45e-03

Pos logit contributions
 Gre+0.099
oust+0.094
 Money+0.091
 St+0.087
hips+0.086
Neg logit contributions
pieces-0.091
 piping-0.085
"'-0.077
 snowball-0.077
THE-0.070
Token Bye
Token ID47843
Feature activation-1.34e-01

Pos logit contributions
oust+0.010
 Gre+0.010
 far+0.010
ret+0.010
hips+0.010
Neg logit contributions
 piping-0.007
pieces-0.007
 snowball-0.007
"'-0.006
-0.006
Token for
Token ID329
Feature activation+1.93e-02

Pos logit contributions
 Gre+0.199
 Money+0.187
oust+0.184
hips+0.179
 lur+0.178
Neg logit contributions
pieces-0.133
 piping-0.127
 snowball-0.117
"'-0.116
-0.107
Token now
Token ID783
Feature activation+1.09e-01

Pos logit contributions
pieces+0.021
 piping+0.019
 snowball+0.018
"'+0.018
THE+0.017
Neg logit contributions
 Gre-0.026
oust-0.026
 Money-0.026
ret-0.025
ub-0.025
Token,
Token ID11
Feature activation-3.15e-02

Pos logit contributions
pieces+0.147
 piping+0.138
 snowball+0.121
"'+0.119
 freshly+0.113
Neg logit contributions
 Gre-0.130
oust-0.121
 St-0.121
 Money-0.121
 well-0.121
Token
Token ID198
Feature activation-1.83e-03

Pos logit contributions
 Gre+0.001
oust+0.001
 Money+0.001
hips+0.001
 appearances+0.001
Neg logit contributions
pieces-0.001
 piping-0.001
"'-0.001
 snowball-0.001
THE-0.000
TokenHer
Token ID9360
Feature activation+1.35e-01

Pos logit contributions
 Gre+0.003
oust+0.002
 Money+0.002
 St+0.002
hips+0.002
Neg logit contributions
pieces-0.002
 piping-0.002
 snowball-0.002
"'-0.002
THE-0.002
Token mom
Token ID1995
Feature activation+3.91e-02

Pos logit contributions
pieces+0.251
 piping+0.216
"'+0.216
THE+0.207
cause+0.207
Neg logit contributions
 Gre-0.103
 St-0.096
oust-0.092
 well-0.090
 truth-0.089
Token said
Token ID531
Feature activation+1.67e-02

Pos logit contributions
pieces+0.036
 piping+0.032
"'+0.029
 snowball+0.028
?",+0.025
Neg logit contributions
 Gre-0.062
 Money-0.060
oust-0.059
hips-0.059
 appearances-0.059
Token yes
Token ID3763
Feature activation+8.91e-02

Pos logit contributions
pieces+0.018
 piping+0.017
"'+0.015
 snowball+0.015
THE+0.013
Neg logit contributions
 Gre-0.024
oust-0.023
 Money-0.022
hips-0.022
 St-0.021
Token,
Token ID11
Feature activation-2.11e-02

Pos logit contributions
pieces+0.093
 piping+0.084
"'+0.078
 snowball+0.075
THE+0.069
Neg logit contributions
 Gre-0.129
oust-0.123
 Money-0.122
 appearances-0.117
hips-0.117
Token so
Token ID523
Feature activation+8.81e-02

Pos logit contributions
 Gre+0.035
oust+0.034
hips+0.033
 Money+0.033
 appearances+0.032
Neg logit contributions
pieces-0.018
 piping-0.016
 snowball-0.014
"'-0.014
 freshly-0.012
Token the
Token ID262
Feature activation+1.50e-02

Pos logit contributions
pieces+0.114
 piping+0.108
 snowball+0.099
"'+0.097
THE+0.090
Neg logit contributions
 Gre-0.111
oust-0.100
 St-0.100
 Money-0.099
 Spark-0.097
Token girl
Token ID2576
Feature activation+5.68e-02

Pos logit contributions
pieces+0.020
 piping+0.018
"'+0.017
 snowball+0.016
THE+0.016
Neg logit contributions
 Gre-0.018
 Money-0.017
oust-0.017
 St-0.017
 well-0.016
Token went
Token ID1816
Feature activation-5.05e-02

Pos logit contributions
pieces+0.069
 piping+0.063
"'+0.059
 snowball+0.058
THE+0.052
Neg logit contributions
 Gre-0.074
oust-0.070
 Money-0.069
hips-0.067
 appearances-0.067
Token to
Token ID284
Feature activation+1.38e-02

Pos logit contributions
 Gre+0.066
oust+0.062
 Money+0.061
 St+0.058
hips+0.057
Neg logit contributions
pieces-0.062
 piping-0.058
"'-0.053
 snowball-0.053
THE-0.048
Token we
Token ID356
Feature activation+5.31e-02

Pos logit contributions
pieces+0.179
 piping+0.171
 snowball+0.155
"'+0.148
 freshly+0.141
Neg logit contributions
 Gre-0.157
 Money-0.141
 St-0.139
 Spark-0.138
oust-0.134
Token see
Token ID766
Feature activation+9.88e-02

Pos logit contributions
pieces+0.078
 piping+0.074
 snowball+0.068
"'+0.066
THE+0.063
Neg logit contributions
 Gre-0.056
 far-0.053
oust-0.052
 well-0.051
hips-0.050
Token your
Token ID534
Feature activation-1.50e-03

Pos logit contributions
pieces+0.136
 piping+0.131
"'+0.125
THE+0.116
 snowball+0.114
Neg logit contributions
 Gre-0.108
 far-0.105
 St-0.100
 Spark-0.099
 well-0.098
Token machine
Token ID4572
Feature activation-3.35e-02

Pos logit contributions
 Gre+0.002
oust+0.002
 Money+0.002
 St+0.002
hips+0.002
Neg logit contributions
pieces-0.002
 piping-0.002
 snowball-0.002
"'-0.001
THE-0.001
Token?"
Token ID1701
Feature activation+1.82e-01

Pos logit contributions
 well+0.047
 Gre+0.047
 far+0.047
oust+0.044
hips+0.044
Neg logit contributions
pieces-0.035
 piping-0.034
 snowball-0.029
"'-0.029
THE-0.026
Token Tim
Token ID5045
Feature activation-1.77e-02

Pos logit contributions
pieces+0.290
 piping+0.273
THE+0.247
"'+0.244
 snowball+0.241
Neg logit contributions
 Gre-0.190
 St-0.166
 Money-0.165
 Spark-0.160
 Hel-0.159
Token asks
Token ID7893
Feature activation+7.94e-04

Pos logit contributions
 Gre+0.019
 Money+0.017
 St+0.017
oust+0.017
 Hel+0.016
Neg logit contributions
pieces-0.028
 piping-0.025
THE-0.023
"'-0.022
 freshly-0.022
Token.
Token ID13
Feature activation-6.72e-03

Pos logit contributions
pieces+0.001
 piping+0.001
 snowball+0.001
"'+0.001
THE+0.001
Neg logit contributions
 Gre-0.001
oust-0.001
 Money-0.001
hips-0.001
 appearances-0.001
Token
Token ID198
Feature activation-1.12e-02

Pos logit contributions
 Gre+0.007
 Money+0.007
oust+0.006
 St+0.006
 Spark+0.006
Neg logit contributions
pieces-0.011
 piping-0.010
"'-0.008
 snowball-0.008
THE-0.008
Token
Token ID198
Feature activation-1.19e-02

Pos logit contributions
 Gre+0.016
oust+0.016
 Money+0.015
 appearances+0.015
hips+0.015
Neg logit contributions
pieces-0.011
 piping-0.011
"'-0.010
 snowball-0.010
THE-0.009
Token"
Token ID1
Feature activation+4.23e-02

Pos logit contributions
 Gre+0.018
oust+0.017
 Money+0.017
 St+0.016
hips+0.016
Neg logit contributions
pieces-0.013
 piping-0.012
 snowball-0.010
"'-0.010
THE-0.009
Token The
Token ID383
Feature activation+4.87e-02

Pos logit contributions
oust+0.010
 Gre+0.010
ret+0.010
ark+0.010
 appearances+0.010
Neg logit contributions
pieces-0.006
 piping-0.006
 snowball-0.005
"'-0.005
 Either-0.004
Token water
Token ID1660
Feature activation-7.03e-02

Pos logit contributions
pieces+0.056
 piping+0.052
 snowball+0.047
"'+0.046
THE+0.042
Neg logit contributions
 Gre-0.068
oust-0.064
 Money-0.062
 St-0.060
 Hel-0.060
Token soaked
Token ID33657
Feature activation+2.04e-02

Pos logit contributions
 Gre+0.108
oust+0.104
 Money+0.103
 Hel+0.099
 List+0.098
Neg logit contributions
pieces-0.064
 snowball-0.062
 piping-0.061
"'-0.058
THE-0.051
Token them
Token ID606
Feature activation+4.10e-02

Pos logit contributions
 snowball+0.018
pieces+0.016
 piping+0.016
"'+0.013
THE+0.013
Neg logit contributions
 Gre-0.033
oust-0.033
 Money-0.031
 List-0.031
 appearances-0.030
Token from
Token ID422
Feature activation-1.46e-01

Pos logit contributions
pieces+0.033
 snowball+0.032
 piping+0.030
"'+0.025
THE+0.024
Neg logit contributions
 Gre-0.070
oust-0.069
 Money-0.067
 Hel-0.064
 Cas-0.063
Token head
Token ID1182
Feature activation-5.61e-02

Pos logit contributions
 Gre+0.220
oust+0.217
 Money+0.211
ub+0.204
ret+0.201
Neg logit contributions
pieces-0.145
 piping-0.135
 snowball-0.130
"'-0.112
 freshly-0.108
Token to
Token ID284
Feature activation+9.88e-03

Pos logit contributions
 Gre+0.106
 List+0.105
 appearances+0.103
 ante+0.103
oust+0.103
Neg logit contributions
pieces-0.024
 snowball-0.024
"'-0.019
 piping-0.019
THE-0.017
Token toe
Token ID21189
Feature activation+4.90e-02

Pos logit contributions
pieces+0.010
 piping+0.009
 snowball+0.009
"'+0.008
 freshly+0.008
Neg logit contributions
 Gre-0.015
 Money-0.015
oust-0.014
 List-0.014
ub-0.014
Token.
Token ID13
Feature activation-4.83e-03

Pos logit contributions
pieces+0.057
 piping+0.053
 snowball+0.049
"'+0.048
THE+0.044
Neg logit contributions
 Gre-0.067
oust-0.063
 Money-0.062
hips-0.059
 St-0.058
Token They
Token ID1119
Feature activation-4.68e-03

Pos logit contributions
oust+0.007
 Gre+0.007
ark+0.007
hips+0.007
ret+0.007
Neg logit contributions
pieces-0.005
 piping-0.005
 snowball-0.004
"'-0.004
 freshly-0.003
Token were
Token ID547
Feature activation-2.40e-02

Pos logit contributions
 Gre+0.007
 Money+0.007
oust+0.007
 appearances+0.007
 List+0.007
Neg logit contributions
pieces-0.004
 piping-0.004
 snowball-0.004
"'-0.004
THE-0.003
Token loved
Token ID6151
Feature activation-5.88e-02

Pos logit contributions
 Gre+0.004
 well+0.003
oust+0.003
 St+0.003
 far+0.003
Neg logit contributions
pieces-0.007
 piping-0.006
"'-0.006
 snowball-0.006
THE-0.005
Token to
Token ID284
Feature activation+1.01e-02

Pos logit contributions
 Gre+0.063
 St+0.058
oust+0.058
 Money+0.058
 far+0.057
Neg logit contributions
pieces-0.088
 piping-0.080
"'-0.077
 snowball-0.072
THE-0.070
Token visit
Token ID3187
Feature activation-3.77e-02

Pos logit contributions
pieces+0.015
THE+0.013
"'+0.013
 piping+0.013
 snowball+0.012
Neg logit contributions
oust-0.010
 Gre-0.010
 St-0.009
 Spark-0.009
 T-0.009
Token her
Token ID607
Feature activation+4.95e-02

Pos logit contributions
 Gre+0.056
oust+0.054
 Money+0.053
hips+0.051
ub+0.050
Neg logit contributions
pieces-0.039
 piping-0.036
 snowball-0.032
"'-0.032
THE-0.028
Token grandma
Token ID49890
Feature activation-3.88e-02

Pos logit contributions
pieces+0.072
 piping+0.068
"'+0.063
 snowball+0.062
THE+0.058
Neg logit contributions
 Gre-0.054
oust-0.050
 Money-0.049
 St-0.047
 Hel-0.046
Token's
Token ID338
Feature activation+2.48e-03

Pos logit contributions
 Gre+0.041
 St+0.039
 Money+0.038
 Spark+0.037
oust+0.036
Neg logit contributions
pieces-0.056
 piping-0.054
 snowball-0.051
"'-0.050
THE-0.046
Token house
Token ID2156
Feature activation+2.89e-03

Pos logit contributions
pieces+0.002
 piping+0.002
 snowball+0.002
"'+0.002
 freshly+0.002
Neg logit contributions
 Gre-0.004
oust-0.004
ub-0.003
hips-0.003
ret-0.003
Token.
Token ID13
Feature activation+3.66e-03

Pos logit contributions
pieces+0.003
 piping+0.002
 snowball+0.002
"'+0.002
THE+0.002
Neg logit contributions
 Gre-0.005
oust-0.004
 Money-0.004
 ante-0.004
ub-0.004
Token Her
Token ID2332
Feature activation+8.77e-02

Pos logit contributions
pieces+0.006
 piping+0.005
 snowball+0.005
"'+0.005
THE+0.005
Neg logit contributions
 Gre-0.004
 Money-0.004
 St-0.004
 Spark-0.004
oust-0.003
Token grandma
Token ID49890
Feature activation-3.94e-02

Pos logit contributions
pieces+0.157
 piping+0.140
 snowball+0.135
"'+0.135
THE+0.129
Neg logit contributions
 Gre-0.074
oust-0.067
 Money-0.065
 St-0.065
 Hel-0.063
Token was
Token ID373
Feature activation+1.72e-02

Pos logit contributions
 Gre+0.055
oust+0.054
 Money+0.052
 appearances+0.050
hips+0.050
Neg logit contributions
pieces-0.043
 piping-0.041
"'-0.037
 snowball-0.036
THE-0.033
Token his
Token ID465
Feature activation+4.14e-02

Pos logit contributions
ub+0.011
oust+0.011
 Gre+0.011
hips+0.011
 Money+0.010
Neg logit contributions
 snowball-0.006
pieces-0.005
 piping-0.005
"'-0.005
THE-0.004
Token house
Token ID2156
Feature activation+1.05e-02

Pos logit contributions
pieces+0.045
 piping+0.042
 snowball+0.038
"'+0.037
THE+0.035
Neg logit contributions
 Gre-0.058
oust-0.056
 Money-0.054
hips-0.052
 St-0.052
Token.
Token ID13
Feature activation+3.65e-03

Pos logit contributions
pieces+0.008
"'+0.007
 piping+0.006
 snowball+0.006
?",+0.005
Neg logit contributions
 Gre-0.017
oust-0.017
ret-0.017
ub-0.017
 Ho-0.017
Token He
Token ID679
Feature activation+1.27e-02

Pos logit contributions
pieces+0.004
 piping+0.004
 snowball+0.004
"'+0.004
THE+0.003
Neg logit contributions
 Gre-0.005
oust-0.005
 Money-0.004
hips-0.004
 St-0.004
Token had
Token ID550
Feature activation+5.76e-02

Pos logit contributions
pieces+0.016
 piping+0.015
"'+0.013
 snowball+0.013
THE+0.012
Neg logit contributions
 Gre-0.016
oust-0.015
 Money-0.015
 St-0.014
hips-0.014
Token never
Token ID1239
Feature activation-1.55e-01

Pos logit contributions
pieces+0.094
 piping+0.089
"'+0.082
 snowball+0.081
THE+0.077
Neg logit contributions
 Gre-0.053
oust-0.049
 Money-0.048
 St-0.045
 far-0.044
Token seen
Token ID1775
Feature activation+7.65e-02

Pos logit contributions
 Gre+0.206
oust+0.192
 Money+0.189
 far+0.185
 lur+0.182
Neg logit contributions
pieces-0.190
 piping-0.181
 snowball-0.160
"'-0.157
THE-0.147
Token it
Token ID340
Feature activation-4.67e-02

Pos logit contributions
pieces+0.073
 snowball+0.070
 piping+0.067
 cocoa+0.054
"'+0.053
Neg logit contributions
 Gre-0.117
oust-0.115
ub-0.113
hips-0.112
ark-0.111
Token before
Token ID878
Feature activation-6.78e-02

Pos logit contributions
 Gre+0.062
oust+0.059
 Money+0.058
 appearances+0.057
engu+0.056
Neg logit contributions
pieces-0.055
 piping-0.048
"'-0.046
 snowball-0.045
THE-0.044
Token,
Token ID11
Feature activation-1.91e-02

Pos logit contributions
 Gre+0.094
oust+0.089
 Money+0.087
hips+0.084
 St+0.083
Neg logit contributions
pieces-0.075
 piping-0.070
 snowball-0.066
"'-0.064
THE-0.059
Token so
Token ID523
Feature activation+8.86e-02

Pos logit contributions
 Gre+0.030
oust+0.029
 Money+0.028
hips+0.027
 St+0.027
Neg logit contributions
pieces-0.018
 piping-0.017
 snowball-0.015
"'-0.015
THE-0.013
Token They
Token ID1119
Feature activation-1.65e-02

Pos logit contributions
oust+0.025
 Gre+0.024
ret+0.023
 lur+0.023
hips+0.023
Neg logit contributions
 piping-0.015
pieces-0.014
"'-0.012
 snowball-0.012
THE-0.011
Token reached
Token ID4251
Feature activation+1.11e-02

Pos logit contributions
 Gre+0.028
 Money+0.027
 appearances+0.027
oust+0.027
ub+0.027
Neg logit contributions
pieces-0.012
 piping-0.011
"'-0.010
 snowball-0.009
-0.009
Token for
Token ID329
Feature activation+1.02e-02

Pos logit contributions
pieces+0.013
 piping+0.013
 snowball+0.011
"'+0.011
THE+0.010
Neg logit contributions
 Gre-0.015
oust-0.014
 Money-0.014
hips-0.013
ub-0.013
Token the
Token ID262
Feature activation+6.76e-04

Pos logit contributions
 piping+0.008
pieces+0.007
 freshly+0.006
 apr+0.006
 snowball+0.005
Neg logit contributions
ark-0.019
ub-0.019
oust-0.018
 Gre-0.017
rate-0.017
Token or
Token ID393
Feature activation-3.12e-03

Pos logit contributions
pieces+0.001
 piping+0.001
"'+0.001
 snowball+0.001
 freshly+0.001
Neg logit contributions
 Gre-0.001
oust-0.001
 Money-0.001
 St-0.001
ub-0.001
Tokennam
Token ID7402
Feature activation-1.92e-01

Pos logit contributions
 Gre+0.002
oust+0.002
 Money+0.002
 St+0.001
hips+0.001
Neg logit contributions
pieces-0.006
 piping-0.006
"'-0.005
 snowball-0.005
THE-0.005
Tokenents
Token ID658
Feature activation-1.34e-01

Pos logit contributions
 Gre+0.202
oust+0.182
 Money+0.177
 St+0.173
 appearances+0.171
Neg logit contributions
pieces-0.294
 piping-0.270
"'-0.250
 snowball-0.238
THE-0.231
Token and
Token ID290
Feature activation-1.60e-03

Pos logit contributions
 Gre+0.195
oust+0.185
 Money+0.184
 List+0.174
 St+0.174
Neg logit contributions
pieces-0.147
 piping-0.130
"'-0.117
 snowball-0.117
THE-0.108
Token touched
Token ID12615
Feature activation-1.69e-02

Pos logit contributions
 Gre+0.002
oust+0.002
ark+0.002
ub+0.002
hips+0.002
Neg logit contributions
pieces-0.002
 piping-0.002
"'-0.001
 snowball-0.001
 freshly-0.001
Token them
Token ID606
Feature activation+2.66e-02

Pos logit contributions
oust+0.028
 Gre+0.028
 Money+0.026
ark+0.026
ub+0.026
Neg logit contributions
pieces-0.014
 piping-0.014
 snowball-0.012
"'-0.010
 freshly-0.009
Token softly
Token ID26625
Feature activation-1.15e-01

Pos logit contributions
pieces+0.022
 piping+0.018
 snowball+0.017
"'+0.016
THE+0.015
Neg logit contributions
 Gre-0.044
oust-0.044
 ante-0.042
 Hel-0.041
 unexpl-0.041
Token went
Token ID1816
Feature activation-5.71e-02

Pos logit contributions
pieces+0.014
 piping+0.013
"'+0.012
 snowball+0.011
?",+0.011
Neg logit contributions
 Gre-0.022
oust-0.021
 appearances-0.021
 Money-0.020
ub-0.020
Token in
Token ID287
Feature activation+1.02e-02

Pos logit contributions
 Gre+0.080
oust+0.076
 Money+0.074
 St+0.071
hips+0.070
Neg logit contributions
pieces-0.065
 piping-0.060
"'-0.054
 snowball-0.054
THE-0.049
Token.
Token ID13
Feature activation-2.70e-03

Pos logit contributions
pieces+0.010
 piping+0.010
 snowball+0.009
"'+0.009
 freshly+0.008
Neg logit contributions
 Gre-0.015
oust-0.014
 Money-0.014
 appearances-0.014
 ante-0.014
Token They
Token ID1119
Feature activation-5.52e-03

Pos logit contributions
oust+0.004
 appearances+0.004
 greed+0.004
 Gre+0.004
hips+0.004
Neg logit contributions
pieces-0.002
 piping-0.002
"'-0.002
 snowball-0.002
-0.002
Token saw
Token ID2497
Feature activation+6.25e-02

Pos logit contributions
 appearances+0.009
 Gre+0.009
oust+0.009
 Money+0.009
ub+0.009
Neg logit contributions
pieces-0.004
 piping-0.004
"'-0.003
?",-0.003
 snowball-0.003
Token many
Token ID867
Feature activation-1.92e-01

Pos logit contributions
pieces+0.064
 piping+0.057
 snowball+0.052
 cocoa+0.048
 awesome+0.047
Neg logit contributions
ub-0.097
ark-0.093
oust-0.093
rate-0.090
 Gre-0.089
Token things
Token ID1243
Feature activation-2.18e-03

Pos logit contributions
 Gre+0.253
oust+0.241
 Money+0.231
 St+0.226
ub+0.224
Neg logit contributions
pieces-0.240
 piping-0.221
 snowball-0.206
"'-0.197
 freshly-0.180
Token that
Token ID326
Feature activation+3.36e-04

Pos logit contributions
 Gre+0.004
oust+0.003
 Money+0.003
 St+0.003
 Sno+0.003
Neg logit contributions
pieces-0.002
 piping-0.002
"'-0.002
 freshly-0.001
 snowball-0.001
Token they
Token ID484
Feature activation+3.84e-02

Pos logit contributions
pieces+0.000
 piping+0.000
 Amy+0.000
 freshly+0.000
"'+0.000
Neg logit contributions
ub-0.001
ark-0.001
oust-0.001
 appearances-0.001
ud-0.001
Token liked
Token ID8288
Feature activation+1.99e-02

Pos logit contributions
pieces+0.043
 piping+0.040
"'+0.036
 snowball+0.036
THE+0.032
Neg logit contributions
 Gre-0.054
oust-0.051
 Money-0.050
 St-0.048
hips-0.048
Token:
Token ID25
Feature activation+5.16e-02

Pos logit contributions
pieces+0.019
 piping+0.016
 snowball+0.016
"'+0.015
THE+0.014
Neg logit contributions
 Gre-0.030
oust-0.030
 Money-0.028
 List-0.028
ub-0.027
Token he
Token ID339
Feature activation-5.74e-03

Pos logit contributions
 Gre+0.039
oust+0.038
 Money+0.036
hips+0.036
ub+0.035
Neg logit contributions
pieces-0.034
 piping-0.032
 snowball-0.029
"'-0.028
THE-0.026
Token heard
Token ID2982
Feature activation+1.30e-02

Pos logit contributions
 Gre+0.009
 Money+0.009
oust+0.008
hips+0.008
 appearances+0.008
Neg logit contributions
pieces-0.005
 piping-0.005
 snowball-0.005
"'-0.005
THE-0.004
Token a
Token ID257
Feature activation-1.80e-02

Pos logit contributions
pieces+0.010
 snowball+0.009
 piping+0.007
 cocoa+0.006
"'+0.006
Neg logit contributions
ub-0.023
hips-0.023
ho-0.022
opp-0.022
 Gre-0.022
Token voice
Token ID3809
Feature activation-1.25e-03

Pos logit contributions
 Gre+0.023
oust+0.022
 Money+0.022
hips+0.021
 St+0.021
Neg logit contributions
pieces-0.022
 piping-0.021
 snowball-0.019
"'-0.019
THE-0.017
Token coming
Token ID2406
Feature activation-5.54e-02

Pos logit contributions
oust+0.001
 Gre+0.001
 lur+0.001
 far+0.001
 Hel+0.001
Neg logit contributions
pieces-0.002
 piping-0.002
"'-0.002
 snowball-0.002
THE-0.002
Token from
Token ID422
Feature activation-1.54e-01

Pos logit contributions
 Gre+0.054
oust+0.051
 Money+0.050
 appearances+0.047
ret+0.047
Neg logit contributions
pieces-0.079
 snowball-0.075
 piping-0.073
"'-0.072
THE-0.067
Token inside
Token ID2641
Feature activation+9.14e-03

Pos logit contributions
 far+0.169
 Gre+0.158
 well+0.144
oust+0.141
 Hel+0.139
Neg logit contributions
pieces-0.252
 piping-0.231
"'-0.210
affles-0.188
THE-0.186
Token the
Token ID262
Feature activation+7.84e-03

Pos logit contributions
pieces+0.010
 piping+0.010
 snowball+0.009
"'+0.009
THE+0.008
Neg logit contributions
 Gre-0.012
oust-0.012
 Money-0.012
hips-0.011
 St-0.011
Token radio
Token ID5243
Feature activation-1.25e-02

Pos logit contributions
pieces+0.011
 piping+0.010
 snowball+0.009
"'+0.009
THE+0.009
Neg logit contributions
 Gre-0.009
oust-0.009
 Money-0.008
 St-0.008
hips-0.008
Token.
Token ID13
Feature activation-7.13e-03

Pos logit contributions
 Gre+0.019
oust+0.019
 Money+0.018
ret+0.018
 List+0.018
Neg logit contributions
pieces-0.012
 piping-0.010
 snowball-0.009
"'-0.009
THE-0.008
Token
Token ID198
Feature activation-1.29e-02

Pos logit contributions
 Gre+0.010
oust+0.010
 Money+0.009
hips+0.009
 appearances+0.009
Neg logit contributions
pieces-0.008
 piping-0.007
"'-0.007
 snowball-0.007
THE-0.006
Token her
Token ID607
Feature activation+5.22e-02

Pos logit contributions
ub+0.203
oust+0.200
 ante+0.199
engu+0.198
ark+0.196
Neg logit contributions
pieces-0.072
 piping-0.063
 freshly-0.061
"'-0.049
 cocoa-0.045
Token.
Token ID13
Feature activation-4.55e-03

Pos logit contributions
pieces+0.058
 piping+0.052
 snowball+0.047
"'+0.047
THE+0.044
Neg logit contributions
 Gre-0.073
oust-0.070
 Money-0.069
 ante-0.066
hips-0.066
Token With
Token ID2080
Feature activation-8.63e-02

Pos logit contributions
oust+0.006
 Gre+0.006
hips+0.006
 appearances+0.006
 lur+0.006
Neg logit contributions
pieces-0.005
 piping-0.005
"'-0.004
 snowball-0.004
THE-0.004
Token a
Token ID257
Feature activation-9.48e-03

Pos logit contributions
ub+0.137
oust+0.136
 Money+0.132
 Gre+0.131
ark+0.130
Neg logit contributions
 piping-0.080
pieces-0.077
 freshly-0.064
"'-0.062
 snowball-0.061
Token heavy
Token ID4334
Feature activation-3.78e-02

Pos logit contributions
 Gre+0.013
 Money+0.013
oust+0.012
hips+0.012
 appearances+0.012
Neg logit contributions
pieces-0.011
 piping-0.010
 snowball-0.009
"'-0.009
 freshly-0.008
Token heart
Token ID2612
Feature activation-1.70e-01

Pos logit contributions
 Money+0.068
ark+0.065
 Gre+0.065
 List+0.065
hips+0.065
Neg logit contributions
 snowball-0.028
 piping-0.025
pieces-0.019
 freshly-0.019
"'-0.018
Token and
Token ID290
Feature activation+1.51e-02

Pos logit contributions
 Gre+0.248
 Money+0.234
oust+0.230
 List+0.219
 St+0.218
Neg logit contributions
pieces-0.190
 piping-0.174
"'-0.158
 snowball-0.149
THE-0.138
Token a
Token ID257
Feature activation-9.35e-03

Pos logit contributions
pieces+0.016
 piping+0.015
"'+0.013
 freshly+0.013
THE+0.012
Neg logit contributions
oust-0.022
ub-0.022
 Money-0.021
hips-0.021
 Gre-0.021
Token frown
Token ID22002
Feature activation-5.91e-02

Pos logit contributions
 Gre+0.013
 Money+0.013
oust+0.012
hips+0.012
 appearances+0.012
Neg logit contributions
 piping-0.010
pieces-0.010
 snowball-0.009
"'-0.009
 freshly-0.008
Token on
Token ID319
Feature activation+8.39e-02

Pos logit contributions
 Gre+0.084
 Money+0.080
oust+0.080
 List+0.076
 appearances+0.076
Neg logit contributions
pieces-0.065
 piping-0.060
"'-0.055
 snowball-0.053
THE-0.048
Token her
Token ID607
Feature activation+5.14e-02

Pos logit contributions
pieces+0.105
 piping+0.098
"'+0.089
 snowball+0.089
THE+0.082
Neg logit contributions
 Gre-0.107
oust-0.101
 Money-0.099
 St-0.095
hips-0.093