TOP ACTIVATIONS
MAX = 0.416

all cheered and began to asting marsh m allows . All
found the cookie on the counter . Lily was so happy
kitchen , standing at the counter . She had a big
a cookie jar on the counter . He reached up to
Sarah how to use the counter top to un load the
took the box to the counter . He did not know
The patient went to the counter and asked to buy the
to the lady at the counter ," mom said .
made a normal - t asting wine that was sweet and
put his penny on the counter . The shop keeper smiled
an icy pastry on the counter . It looked delicious !
he made some y ummy burgers . He put them on
ucked the onion from the counter . When Lily
big , sweet - t asting fruit tree !
a beautiful cheese o me lette . John was so pleased
The o me lette tasted even better than it
they had the o me lette put together . They were
to try one of the burgers ?" asked Aunt S ari
having too much fun ro asting her marsh mallow .
to the check - in counter to find their own suitcase

MIN ACTIVATIONS
MIN = -0.464

She felt like a real superstar ! L ily
. She became a fashion superstar , admired by all of
we can get your k ite back . Maybe we can
had a big red k ite that she took to the
. I like your k ite . It is a crow
a ball and a k ite . Anna is tall and
we can have a k ite ." They run
" Or maybe the k ite is too heavy ."
" We like your k ite . Can you introduce us
a ball and a k ite . They run and laugh
, this is my k ite . I made it with
the ball , the k ite , and the bucket .
and Ben took their k ite to the park . They
. She sees the k ite . " That
. It wob bles and til ts . Sara and Ben
want to fly the k ite again . We want to
They look at the k ite . They see a problem
a problem . The k ite has a hole in it
the ball , the k ite and the bike . The
can we have a k ite ?" Sara asks . "

INTERVAL 0.196 - 0.416
CONTAINS 1.436%

now ." L ily and Ben started to panic
. They played with other plants that were nice and soft
happy that the bird was safe and the bell was ringing
The bug spl ashed and sw am and had lots of
She said , " L ily , please don 't yell

INTERVAL -0.024 - 0.196
CONTAINS 70.446%

the pot started to rise . It was like magic !
M ummy and Daddy laughed and hugged her . They said
, stir the batter and fill a pan with the batter
his it ches disappearing . Suddenly , he heard a voice
you . Now , why don 't you play together nicely

INTERVAL -0.244 - -0.024
CONTAINS 27.569%

way to play on the tall rail near the pond .
Tom hiding behind it . " Boo !" Tom
, Lily and Max met another little girl and her dog
, his wings kept getting caught in the trees . Finally
Sure , Lily ! You can give this letter to your

INTERVAL -0.464 - -0.244
CONTAINS 0.548%

's raining because it 's cloudy outside , and the rain
his eyes when the star suddenly seemed to move . He
bear was never without gum again ! <|endoftext|> Once upon a
dad carried the suit cases again . They had a fun
. And I love the stars . But I don 't
Token all
Token ID477
Feature activation-5.77e-02

Pos logit contributions
 superstar+0.198
ite+0.185
 til+0.181
bul+0.177
 flashes+0.177
Neg logit contributions
cious-0.172
 burgers-0.158
lette-0.157
asting-0.156
 cooker-0.153
Token cheered
Token ID34767
Feature activation+2.39e-02

Pos logit contributions
 superstar+0.086
ite+0.081
 til+0.080
 flashes+0.080
bird+0.078
Neg logit contributions
cious-0.090
 counter-0.085
 burgers-0.084
lette-0.083
 counters-0.079
Token and
Token ID290
Feature activation+1.96e-03

Pos logit contributions
cious+0.044
 burgers+0.040
lette+0.040
asting+0.039
 counter+0.039
Neg logit contributions
 superstar-0.029
 til-0.026
bird-0.025
bul-0.025
ite-0.025
Token began
Token ID2540
Feature activation-1.01e-02

Pos logit contributions
cious+0.003
lette+0.003
 burgers+0.003
asting+0.003
 counter+0.003
Neg logit contributions
 superstar-0.003
ite-0.003
 til-0.003
 flashes-0.002
bird-0.002
Token to
Token ID284
Feature activation+2.81e-02

Pos logit contributions
 superstar+0.015
 til+0.014
ite+0.014
 flashes+0.014
bird+0.014
Neg logit contributions
cious-0.015
lette-0.014
 burgers-0.014
asting-0.014
 counter-0.014
Tokenasting
Token ID9222
Feature activation+4.16e-01

Pos logit contributions
cious+0.047
 burgers+0.045
lette+0.044
asting+0.044
 cooker+0.043
Neg logit contributions
 superstar-0.037
ite-0.035
 til-0.033
bul-0.033
bird-0.033
Token marsh
Token ID22397
Feature activation+5.89e-02

Pos logit contributions
cious+0.653
lette+0.601
 burgers+0.594
asting+0.588
 cooker+0.581
Neg logit contributions
 superstar-0.604
ite-0.562
 til-0.559
 flashes-0.545
bird-0.540
Tokenm
Token ID76
Feature activation-1.74e-02

Pos logit contributions
cious+0.106
 burgers+0.098
lette+0.098
asting+0.096
 counter+0.096
Neg logit contributions
 superstar-0.072
ite-0.068
bird-0.064
 til-0.064
 flashes-0.062
Tokenallows
Token ID47205
Feature activation+2.37e-01

Pos logit contributions
 flashes+0.029
inkle+0.029
 til+0.028
 superstar+0.027
ller+0.027
Neg logit contributions
cious-0.022
 counter-0.020
 cooker-0.018
 oh-0.018
keeping-0.018
Token.
Token ID13
Feature activation+1.44e-02

Pos logit contributions
cious+0.385
 burgers+0.352
lette+0.351
asting+0.343
 cooker+0.343
Neg logit contributions
 superstar-0.333
ite-0.311
 til-0.302
bird-0.300
bul-0.298
Token All
Token ID1439
Feature activation-9.33e-02

Pos logit contributions
cious+0.022
 burgers+0.020
lette+0.020
asting+0.020
 cooker+0.019
Neg logit contributions
 superstar-0.022
ite-0.020
 til-0.020
bird-0.020
 flashes-0.020
Token found
Token ID1043
Feature activation+1.44e-01

Pos logit contributions
 superstar+0.234
ite+0.220
 til+0.216
bird+0.209
 flashes+0.207
Neg logit contributions
cious-0.228
 burgers-0.218
lette-0.214
 counter-0.210
 cooker-0.210
Token the
Token ID262
Feature activation+2.35e-02

Pos logit contributions
cious+0.226
lette+0.206
 burgers+0.204
asting+0.202
 cooker+0.201
Neg logit contributions
 superstar-0.208
ite-0.194
 til-0.192
bird-0.191
 flashes-0.187
Token cookie
Token ID19751
Feature activation-2.51e-02

Pos logit contributions
cious+0.033
 burgers+0.032
 counter+0.032
lette+0.031
asting+0.031
Neg logit contributions
 superstar-0.037
ite-0.035
 til-0.034
 flashes-0.034
 Rhythm-0.033
Token on
Token ID319
Feature activation-5.69e-02

Pos logit contributions
 superstar+0.032
ite+0.031
bird+0.030
bul+0.028
 til+0.027
Neg logit contributions
cious-0.043
 burgers-0.040
lette-0.040
asting-0.039
ough-0.039
Token the
Token ID262
Feature activation+1.96e-02

Pos logit contributions
 superstar+0.087
ite+0.082
 til+0.080
bird+0.078
 flashes+0.078
Neg logit contributions
cious-0.085
 burgers-0.079
lette-0.078
asting-0.077
 counter-0.076
Token counter
Token ID3753
Feature activation+4.07e-01

Pos logit contributions
cious+0.022
asting+0.020
lette+0.019
 burgers+0.019
row+0.019
Neg logit contributions
 superstar-0.035
 unicorn-0.035
bird-0.034
 rider-0.033
ite-0.033
Token.
Token ID13
Feature activation+2.89e-02

Pos logit contributions
cious+0.750
 burgers+0.694
asting+0.693
ough+0.675
lette+0.671
Neg logit contributions
 superstar-0.459
ite-0.433
bird-0.426
-0.399
 til-0.396
Token Lily
Token ID20037
Feature activation+1.18e-02

Pos logit contributions
cious+0.049
asting+0.047
lette+0.046
 burgers+0.046
 cooker+0.045
Neg logit contributions
 superstar-0.038
ite-0.036
bird-0.035
 Rhythm-0.035
 Spark-0.034
Token was
Token ID373
Feature activation+7.50e-03

Pos logit contributions
cious+0.019
 burgers+0.018
lette+0.018
asting+0.018
 cooker+0.018
Neg logit contributions
 superstar-0.016
 til-0.015
bird-0.015
ite-0.015
bul-0.014
Token so
Token ID523
Feature activation+1.52e-01

Pos logit contributions
cious+0.010
 burgers+0.009
 cooker+0.009
lette+0.009
asting+0.008
Neg logit contributions
 superstar-0.013
 til-0.012
ite-0.012
bird-0.012
 flashes-0.012
Token happy
Token ID3772
Feature activation+4.86e-02

Pos logit contributions
cious+0.240
 burgers+0.230
 cooker+0.226
lette+0.224
 counter+0.223
Neg logit contributions
 superstar-0.214
 til-0.203
ite-0.199
bird-0.190
 flashes-0.188
Token kitchen
Token ID9592
Feature activation+2.38e-01

Pos logit contributions
cious+0.034
 burgers+0.031
lette+0.031
asting+0.031
 cooker+0.030
Neg logit contributions
 superstar-0.041
ite-0.038
 til-0.037
bird-0.037
 flashes-0.037
Token,
Token ID11
Feature activation+2.01e-02

Pos logit contributions
cious+0.485
asting+0.450
ough+0.439
ocado+0.432
keeping+0.425
Neg logit contributions
 superstar-0.203
 smiles-0.200
 flashes-0.194
inkle-0.188
 til-0.186
Token standing
Token ID5055
Feature activation+9.25e-02

Pos logit contributions
cious+0.035
ough+0.031
lette+0.031
asting+0.031
row+0.031
Neg logit contributions
 superstar-0.025
 til-0.024
 flashes-0.024
 flash-0.023
 smiles-0.023
Token at
Token ID379
Feature activation-4.83e-02

Pos logit contributions
cious+0.149
 burgers+0.136
lette+0.135
asting+0.134
ough+0.132
Neg logit contributions
 superstar-0.131
ite-0.122
 til-0.121
 flashes-0.117
bird-0.116
Token the
Token ID262
Feature activation-5.49e-03

Pos logit contributions
 superstar+0.070
ite+0.065
 til+0.064
bird+0.063
 flashes+0.063
Neg logit contributions
cious-0.077
lette-0.070
 burgers-0.069
asting-0.069
 cooker-0.068
Token counter
Token ID3753
Feature activation+4.07e-01

Pos logit contributions
 superstar+0.010
ite+0.009
bird+0.009
 unicorn+0.009
 til+0.009
Neg logit contributions
cious-0.007
lette-0.006
ough-0.006
asting-0.006
 burgers-0.006
Token.
Token ID13
Feature activation+1.99e-02

Pos logit contributions
cious+0.697
asting+0.638
 burgers+0.637
lette+0.628
 cooker+0.621
Neg logit contributions
 superstar-0.515
ite-0.485
bird-0.471
 til-0.464
 flashes-0.451
Token She
Token ID1375
Feature activation-3.01e-02

Pos logit contributions
cious+0.034
asting+0.033
lette+0.032
 burgers+0.032
 cooker+0.031
Neg logit contributions
 superstar-0.025
bird-0.024
ite-0.023
 Rhythm-0.023
 til-0.023
Token had
Token ID550
Feature activation+5.20e-02

Pos logit contributions
 superstar+0.049
ite+0.046
bul+0.044
 Rhythm+0.044
 til+0.043
Neg logit contributions
cious-0.042
 burgers-0.039
lette-0.039
 counter-0.038
 cooker-0.038
Token a
Token ID257
Feature activation-1.01e-03

Pos logit contributions
cious+0.083
asting+0.076
 burgers+0.076
 cooker+0.076
ough+0.075
Neg logit contributions
 superstar-0.069
 flashes-0.064
 til-0.064
 flash-0.062
bul-0.062
Token big
Token ID1263
Feature activation-1.97e-02

Pos logit contributions
 superstar+0.002
ite+0.002
 til+0.001
bird+0.001
 flashes+0.001
Neg logit contributions
cious-0.001
 burgers-0.001
lette-0.001
 cooker-0.001
 counter-0.001
Token a
Token ID257
Feature activation+7.53e-03

Pos logit contributions
cious+0.004
lette+0.004
asting+0.004
 burgers+0.004
 cooker+0.004
Neg logit contributions
 superstar-0.005
ite-0.004
 flashes-0.004
 til-0.004
bird-0.004
Token cookie
Token ID19751
Feature activation-2.87e-02

Pos logit contributions
cious+0.011
 burgers+0.010
lette+0.010
asting+0.010
 cooker+0.010
Neg logit contributions
 superstar-0.012
ite-0.011
 til-0.011
bird-0.010
 flashes-0.010
Token jar
Token ID17379
Feature activation+1.00e-01

Pos logit contributions
 superstar+0.039
ite+0.036
bird+0.035
 til+0.035
 rider+0.034
Neg logit contributions
cious-0.049
lette-0.044
asting-0.044
 burgers-0.043
ough-0.043
Token on
Token ID319
Feature activation-6.64e-02

Pos logit contributions
cious+0.189
 burgers+0.171
lette+0.170
 cooker+0.166
 counters+0.165
Neg logit contributions
ite-0.106
 superstar-0.105
bird-0.099
bul-0.097
 rider-0.095
Token the
Token ID262
Feature activation-7.20e-03

Pos logit contributions
 superstar+0.084
 til+0.076
bird+0.075
ite+0.074
 flashes+0.074
Neg logit contributions
cious-0.118
lette-0.108
 burgers-0.104
asting-0.103
 cooker-0.103
Token counter
Token ID3753
Feature activation+4.06e-01

Pos logit contributions
 unicorn+0.012
 cartoon+0.012
bird+0.012
 superstar+0.012
 ghost+0.011
Neg logit contributions
cious-0.008
asting-0.008
row-0.007
lette-0.007
em-0.007
Token.
Token ID13
Feature activation+1.17e-02

Pos logit contributions
cious+0.789
asting+0.737
 burgers+0.716
OU+0.707
ough+0.704
Neg logit contributions
 superstar-0.388
ite-0.373
bird-0.370
-0.345
 til-0.340
Token He
Token ID679
Feature activation-7.78e-03

Pos logit contributions
cious+0.023
asting+0.022
lette+0.022
row+0.021
 burgers+0.021
Neg logit contributions
 superstar-0.013
 Spark-0.012
 Rhythm-0.012
bird-0.012
 Zoom-0.012
Token reached
Token ID4251
Feature activation+6.40e-02

Pos logit contributions
 superstar+0.012
 til+0.011
ite+0.011
bird+0.011
bul+0.011
Neg logit contributions
cious-0.011
 burgers-0.010
lette-0.010
asting-0.010
 cooker-0.010
Token up
Token ID510
Feature activation-2.73e-02

Pos logit contributions
cious+0.103
lette+0.092
asting+0.088
 burgers+0.088
keeping+0.085
Neg logit contributions
 superstar-0.091
 til-0.090
ite-0.088
bird-0.086
bul-0.085
Token to
Token ID284
Feature activation+2.16e-02

Pos logit contributions
 superstar+0.042
ite+0.039
 til+0.038
 flashes+0.037
bul+0.037
Neg logit contributions
cious-0.041
 burgers-0.039
asting-0.038
 counter-0.038
lette-0.037
Token Sarah
Token ID10490
Feature activation-3.68e-02

Pos logit contributions
 superstar+0.082
ite+0.077
 til+0.075
+0.074
 rider+0.072
Neg logit contributions
 burgers-0.055
cious-0.054
lette-0.053
 counter-0.052
asting-0.051
Token how
Token ID703
Feature activation+4.44e-02

Pos logit contributions
 superstar+0.063
ite+0.060
 til+0.059
 flashes+0.058
 Rhythm+0.057
Neg logit contributions
cious-0.046
 burgers-0.044
asting-0.044
lette-0.044
 counter-0.042
Token to
Token ID284
Feature activation+4.52e-02

Pos logit contributions
cious+0.063
 burgers+0.058
lette+0.058
asting+0.057
 cooker+0.056
Neg logit contributions
 superstar-0.071
ite-0.067
 til-0.066
bird-0.064
 flashes-0.064
Token use
Token ID779
Feature activation-1.61e-01

Pos logit contributions
cious+0.066
asting+0.063
ough+0.062
 burgers+0.061
lette+0.061
Neg logit contributions
 superstar-0.071
ite-0.066
 til-0.064
bul-0.062
 rider-0.061
Token the
Token ID262
Feature activation+1.32e-02

Pos logit contributions
 superstar+0.302
ite+0.276
 til+0.271
 rider+0.270
bird+0.261
Neg logit contributions
 burgers-0.184
cious-0.179
 counter-0.179
 cooker-0.169
asting-0.169
Token counter
Token ID3753
Feature activation+4.06e-01

Pos logit contributions
cious+0.020
 burgers+0.019
lette+0.018
 counter+0.018
asting+0.018
Neg logit contributions
 superstar-0.020
ite-0.019
 til-0.018
bird-0.018
 flashes-0.018
Tokentop
Token ID4852
Feature activation-3.69e-02

Pos logit contributions
cious+0.674
lette+0.607
 burgers+0.606
asting+0.600
 cooker+0.587
Neg logit contributions
 superstar-0.545
ite-0.525
bul-0.495
 til-0.488
bird-0.484
Token to
Token ID284
Feature activation+3.34e-02

Pos logit contributions
 superstar+0.046
ite+0.046
bul+0.045
bird+0.044
 flashes+0.043
Neg logit contributions
cious-0.062
lette-0.056
asting-0.055
 burgers-0.053
ough-0.053
Token un
Token ID555
Feature activation+1.61e-01

Pos logit contributions
cious+0.051
lette+0.046
 burgers+0.046
asting+0.045
 cooker+0.044
Neg logit contributions
 superstar-0.050
ite-0.046
 til-0.046
bird-0.045
 flashes-0.045
Tokenload
Token ID2220
Feature activation+1.27e-01

Pos logit contributions
cious+0.178
lette+0.167
 burgers+0.164
asting+0.162
 cooker+0.161
Neg logit contributions
 superstar-0.314
 til-0.291
ite-0.288
 flashes-0.286
 Rhythm-0.282
Token the
Token ID262
Feature activation+3.71e-03

Pos logit contributions
cious+0.232
lette+0.206
asting+0.197
keeping+0.193
 cooker+0.193
Neg logit contributions
ite-0.149
 flashes-0.141
 superstar-0.141
 smiles-0.141
bul-0.138
Token took
Token ID1718
Feature activation+1.46e-01

Pos logit contributions
cious+0.000
 burgers+0.000
lette+0.000
asting+0.000
 cooker+0.000
Neg logit contributions
 superstar-0.000
ite-0.000
 til-0.000
bird-0.000
 flashes-0.000
Token the
Token ID262
Feature activation+6.20e-03

Pos logit contributions
cious+0.207
lette+0.188
 burgers+0.186
asting+0.185
 cooker+0.180
Neg logit contributions
 superstar-0.231
ite-0.219
 til-0.215
bird-0.213
 flashes-0.211
Token box
Token ID3091
Feature activation+8.13e-02

Pos logit contributions
cious+0.009
 burgers+0.009
lette+0.008
asting+0.008
 cooker+0.008
Neg logit contributions
 superstar-0.009
ite-0.009
 til-0.009
bird-0.008
bul-0.008
Token to
Token ID284
Feature activation+2.53e-02

Pos logit contributions
cious+0.130
 burgers+0.118
lette+0.117
asting+0.114
 cooker+0.113
Neg logit contributions
 superstar-0.115
ite-0.113
bul-0.107
bird-0.106
 til-0.103
Token the
Token ID262
Feature activation+3.00e-03

Pos logit contributions
cious+0.038
 burgers+0.035
lette+0.035
asting+0.035
 cooker+0.034
Neg logit contributions
 superstar-0.038
ite-0.036
 til-0.035
bird-0.034
 flashes-0.034
Token counter
Token ID3753
Feature activation+4.03e-01

Pos logit contributions
cious+0.004
lette+0.004
 burgers+0.004
asting+0.004
ough+0.004
Neg logit contributions
 superstar-0.005
ite-0.005
 til-0.004
bird-0.004
 unicorn-0.004
Token.
Token ID13
Feature activation+1.23e-02

Pos logit contributions
cious+0.764
asting+0.690
 burgers+0.680
ough+0.678
lette+0.676
Neg logit contributions
 superstar-0.440
ite-0.419
bird-0.397
 til-0.391
bul-0.384
Token He
Token ID679
Feature activation-9.94e-03

Pos logit contributions
cious+0.022
asting+0.021
lette+0.021
 burgers+0.020
 cooker+0.020
Neg logit contributions
 superstar-0.015
ite-0.014
bird-0.014
 Rhythm-0.014
 Spark-0.014
Token did
Token ID750
Feature activation+2.63e-02

Pos logit contributions
 superstar+0.016
ite+0.015
 til+0.015
bird+0.015
 flashes+0.015
Neg logit contributions
cious-0.014
 burgers-0.012
lette-0.012
asting-0.012
 cooker-0.012
Token not
Token ID407
Feature activation+2.91e-02

Pos logit contributions
cious+0.036
 burgers+0.033
lette+0.033
asting+0.032
 cooker+0.032
Neg logit contributions
 superstar-0.043
ite-0.041
 til-0.040
 flashes-0.039
bird-0.039
Token know
Token ID760
Feature activation+5.51e-02

Pos logit contributions
cious+0.046
 burgers+0.042
lette+0.041
 counter+0.041
 cooker+0.040
Neg logit contributions
 superstar-0.042
 til-0.039
ite-0.039
bird-0.038
 rider-0.038
TokenThe
Token ID464
Feature activation+2.71e-02

Pos logit contributions
 superstar+0.017
ite+0.017
bird+0.016
 Rhythm+0.016
Because+0.016
Neg logit contributions
cious-0.020
 burgers-0.019
lette-0.018
 cooker-0.018
 counter-0.018
Token patient
Token ID5827
Feature activation+2.69e-02

Pos logit contributions
cious+0.032
 burgers+0.031
 counter+0.031
 cooker+0.030
asting+0.030
Neg logit contributions
 superstar-0.048
ite-0.046
 til-0.045
 flashes-0.044
 Rhythm-0.044
Token went
Token ID1816
Feature activation+1.15e-01

Pos logit contributions
cious+0.039
 burgers+0.036
lette+0.036
asting+0.035
 counter+0.035
Neg logit contributions
 superstar-0.042
ite-0.039
 til-0.039
 flashes-0.038
bird-0.038
Token to
Token ID284
Feature activation+1.98e-02

Pos logit contributions
cious+0.148
 counter+0.140
 burgers+0.138
asting+0.136
 cooker+0.135
Neg logit contributions
 superstar-0.197
ite-0.185
 flashes-0.181
 til-0.179
bird-0.177
Token the
Token ID262
Feature activation-1.17e-02

Pos logit contributions
cious+0.028
 burgers+0.026
 counter+0.026
asting+0.026
lette+0.025
Neg logit contributions
 superstar-0.032
ite-0.031
 til-0.029
bul-0.029
 flashes-0.028
Token counter
Token ID3753
Feature activation+4.03e-01

Pos logit contributions
 superstar+0.019
ite+0.018
 til+0.018
 flashes+0.018
bul+0.017
Neg logit contributions
cious-0.016
 counter-0.015
 burgers-0.015
 cooker-0.015
lette-0.014
Token and
Token ID290
Feature activation-4.15e-03

Pos logit contributions
cious+0.639
 burgers+0.592
lette+0.584
asting+0.583
 cooker+0.570
Neg logit contributions
 superstar-0.578
ite-0.543
 til-0.524
bird-0.517
bul-0.510
Token asked
Token ID1965
Feature activation-6.42e-02

Pos logit contributions
 superstar+0.006
ite+0.006
 til+0.006
bird+0.005
bul+0.005
Neg logit contributions
cious-0.006
lette-0.006
asting-0.006
 burgers-0.006
ough-0.006
Token to
Token ID284
Feature activation+1.91e-02

Pos logit contributions
 superstar+0.081
 til+0.076
bird+0.075
 Spark+0.074
 Rhythm+0.073
Neg logit contributions
cious-0.112
lette-0.105
asting-0.103
 burgers-0.102
 cooker-0.100
Token buy
Token ID2822
Feature activation+1.14e-01

Pos logit contributions
cious+0.034
 burgers+0.029
lette+0.029
ocado+0.028
asting+0.028
Neg logit contributions
 superstar-0.025
bird-0.024
bul-0.023
ite-0.023
 flash-0.022
Token the
Token ID262
Feature activation-1.04e-02

Pos logit contributions
cious+0.179
lette+0.164
 burgers+0.162
asting+0.161
 cooker+0.160
Neg logit contributions
 superstar-0.166
ite-0.156
 til-0.153
bird-0.149
 flashes-0.148
Token to
Token ID284
Feature activation+4.24e-02

Pos logit contributions
 superstar+0.023
ite+0.021
 flashes+0.020
 til+0.020
 bolt+0.020
Neg logit contributions
cious-0.013
 counter-0.013
 burgers-0.012
lette-0.012
asting-0.012
Token the
Token ID262
Feature activation+2.53e-02

Pos logit contributions
cious+0.066
lette+0.059
 burgers+0.059
asting+0.059
 cooker+0.058
Neg logit contributions
 superstar-0.063
ite-0.058
 til-0.058
bird-0.057
 flashes-0.057
Token lady
Token ID10846
Feature activation+4.40e-02

Pos logit contributions
cious+0.036
lette+0.033
 burgers+0.033
asting+0.032
 cooker+0.032
Neg logit contributions
 superstar-0.040
ite-0.038
 til-0.037
bird-0.037
 flashes-0.036
Token at
Token ID379
Feature activation-4.21e-02

Pos logit contributions
cious+0.064
 burgers+0.059
lette+0.058
asting+0.058
 cooker+0.057
Neg logit contributions
 superstar-0.069
ite-0.065
 til-0.063
bird-0.062
bul-0.062
Token the
Token ID262
Feature activation+2.72e-03

Pos logit contributions
 superstar+0.066
ite+0.062
 til+0.060
bird+0.059
 flashes+0.059
Neg logit contributions
cious-0.061
 burgers-0.057
lette-0.056
asting-0.056
 counter-0.055
Token counter
Token ID3753
Feature activation+4.02e-01

Pos logit contributions
cious+0.004
asting+0.004
 burgers+0.004
lette+0.004
ough+0.003
Neg logit contributions
 superstar-0.004
ite-0.004
bird-0.004
 til-0.004
 unicorn-0.004
Token,"
Token ID553
Feature activation+5.39e-02

Pos logit contributions
cious+0.709
asting+0.648
 burgers+0.644
lette+0.632
ough+0.627
Neg logit contributions
 superstar-0.501
ite-0.477
bird-0.458
 til-0.446
bul-0.434
Token mom
Token ID1995
Feature activation-7.24e-02

Pos logit contributions
cious+0.081
 burgers+0.075
lette+0.074
asting+0.073
 counter+0.073
Neg logit contributions
 superstar-0.082
ite-0.077
 til-0.075
 flashes-0.073
bul-0.073
Token said
Token ID531
Feature activation-1.15e-02

Pos logit contributions
 superstar+0.095
 til+0.089
ite+0.088
bird+0.088
 flashes+0.085
Neg logit contributions
cious-0.124
 burgers-0.115
asting-0.113
lette-0.112
 cooker-0.111
Token.
Token ID13
Feature activation-1.32e-03

Pos logit contributions
 superstar+0.013
bird+0.012
 til+0.012
bul+0.011
+0.011
Neg logit contributions
cious-0.021
asting-0.020
 burgers-0.020
ough-0.020
lette-0.020
Token
Token ID198
Feature activation-2.31e-02

Pos logit contributions
 superstar+0.002
ite+0.002
bird+0.002
 til+0.002
 Rhythm+0.002
Neg logit contributions
cious-0.002
asting-0.002
 burgers-0.002
lette-0.002
 cooker-0.002
Token made
Token ID925
Feature activation-6.67e-02

Pos logit contributions
cious+0.076
 burgers+0.069
lette+0.069
 cooker+0.069
asting+0.068
Neg logit contributions
 superstar-0.067
 flashes-0.059
bird-0.059
 til-0.059
ite-0.059
Token a
Token ID257
Feature activation-6.91e-03

Pos logit contributions
 superstar+0.101
ite+0.093
 til+0.092
bird+0.091
 flashes+0.090
Neg logit contributions
cious-0.101
 burgers-0.092
lette-0.092
 cooker-0.090
asting-0.090
Token normal
Token ID3487
Feature activation-1.39e-03

Pos logit contributions
 superstar+0.011
ite+0.010
 til+0.010
bird+0.009
 flashes+0.009
Neg logit contributions
cious-0.010
lette-0.010
 burgers-0.010
asting-0.009
ough-0.009
Token-
Token ID12
Feature activation+2.34e-02

Pos logit contributions
 superstar+0.002
ite+0.002
 til+0.002
bird+0.002
bul+0.002
Neg logit contributions
cious-0.002
 burgers-0.002
lette-0.002
asting-0.002
ough-0.002
Tokent
Token ID83
Feature activation-6.61e-02

Pos logit contributions
cious+0.028
 burgers+0.027
 counter+0.026
 counters+0.026
keeping+0.025
Neg logit contributions
 superstar-0.040
ite-0.040
bird-0.039
bul-0.037
 rider-0.036
Tokenasting
Token ID9222
Feature activation+4.02e-01

Pos logit contributions
 superstar+0.097
bul+0.091
bird+0.088
ite+0.086
 til+0.085
Neg logit contributions
cious-0.103
 burgers-0.091
lette-0.091
 cooker-0.089
 Dinner-0.089
Token wine
Token ID8237
Feature activation+1.81e-01

Pos logit contributions
cious+0.526
lette+0.467
ough+0.446
 burgers+0.444
asting+0.443
Neg logit contributions
 superstar-0.698
bird-0.650
ite-0.649
 til-0.647
 flashes-0.641
Token that
Token ID326
Feature activation-4.00e-02

Pos logit contributions
cious+0.317
lette+0.291
 burgers+0.287
asting+0.282
ough+0.278
Neg logit contributions
 superstar-0.236
ite-0.210
bird-0.208
bul-0.203
 til-0.203
Token was
Token ID373
Feature activation-6.90e-03

Pos logit contributions
 superstar+0.058
ite+0.054
 til+0.053
bird+0.052
bul+0.051
Neg logit contributions
cious-0.063
 burgers-0.058
lette-0.058
asting-0.057
 cooker-0.056
Token sweet
Token ID6029
Feature activation+1.46e-01

Pos logit contributions
 superstar+0.010
 til+0.009
bul+0.009
ite+0.009
 flashes+0.009
Neg logit contributions
cious-0.011
lette-0.010
 burgers-0.010
 cooker-0.010
ough-0.010
Token and
Token ID290
Feature activation-4.02e-03

Pos logit contributions
cious+0.258
lette+0.233
 burgers+0.231
ough+0.229
asting+0.228
Neg logit contributions
 superstar-0.189
bird-0.170
ite-0.168
 til-0.164
bul-0.162
Token put
Token ID1234
Feature activation+9.18e-02

Pos logit contributions
 burgers+0.137
asting+0.133
cious+0.132
 cooker+0.131
ough+0.128
Neg logit contributions
 superstar-0.120
ite-0.114
bird-0.110
 til-0.109
 whist-0.107
Token his
Token ID465
Feature activation+4.72e-02

Pos logit contributions
cious+0.160
lette+0.147
asting+0.143
 cooker+0.142
 burgers+0.140
Neg logit contributions
 superstar-0.117
ite-0.113
 til-0.107
bul-0.106
 Spark-0.105
Token penny
Token ID26369
Feature activation-6.96e-02

Pos logit contributions
cious+0.067
 burgers+0.061
lette+0.061
asting+0.061
 cooker+0.059
Neg logit contributions
 superstar-0.076
ite-0.072
 til-0.070
bird-0.068
bul-0.068
Token on
Token ID319
Feature activation-6.41e-02

Pos logit contributions
 superstar+0.101
ite+0.098
bul+0.092
 til+0.092
bird+0.091
Neg logit contributions
cious-0.108
 burgers-0.100
lette-0.099
asting-0.097
 cooker-0.097
Token the
Token ID262
Feature activation+1.28e-02

Pos logit contributions
 superstar+0.095
ite+0.088
 til+0.087
bird+0.085
 flashes+0.084
Neg logit contributions
cious-0.099
 burgers-0.091
lette-0.091
asting-0.089
 cooker-0.088
Token counter
Token ID3753
Feature activation+4.02e-01

Pos logit contributions
cious+0.017
lette+0.016
asting+0.015
 burgers+0.015
ough+0.015
Neg logit contributions
 superstar-0.021
 unicorn-0.020
ite-0.020
 rider-0.020
bul-0.020
Token.
Token ID13
Feature activation+2.14e-02

Pos logit contributions
cious+0.772
asting+0.709
OU+0.703
 burgers+0.700
lette+0.694
Neg logit contributions
ite-0.405
 superstar-0.401
bul-0.370
-0.356
bird-0.354
Token The
Token ID383
Feature activation-5.26e-02

Pos logit contributions
cious+0.038
asting+0.037
lette+0.037
 burgers+0.036
row+0.035
Neg logit contributions
 superstar-0.027
 Spark-0.025
-0.025
ite-0.024
 Zoom-0.023
Token shop
Token ID6128
Feature activation+3.03e-02

Pos logit contributions
 superstar+0.083
ite+0.079
 unicorn+0.075
bird+0.075
 rider+0.074
Neg logit contributions
cious-0.078
lette-0.070
asting-0.069
ough-0.067
 burgers-0.067
Tokenkeeper
Token ID13884
Feature activation-2.03e-02

Pos logit contributions
cious+0.042
 burgers+0.040
 counter+0.039
lette+0.039
 cooker+0.038
Neg logit contributions
 superstar-0.049
 til-0.046
ite-0.045
 Rhythm-0.044
 flashes-0.044
Token smiled
Token ID13541
Feature activation-1.33e-01

Pos logit contributions
 superstar+0.035
ite+0.034
 flashes+0.033
 Rhythm+0.032
 til+0.031
Neg logit contributions
cious-0.026
 burgers-0.024
 counter-0.024
lette-0.024
asting-0.023
Token an
Token ID281
Feature activation-6.11e-02

Pos logit contributions
cious+0.042
 burgers+0.040
asting+0.039
lette+0.039
 counter+0.038
Neg logit contributions
 superstar-0.049
ite-0.046
 til-0.044
bird-0.043
 flashes-0.043
Token icy
Token ID30284
Feature activation+7.28e-02

Pos logit contributions
 til+0.078
 superstar+0.078
bird+0.076
 rider+0.073
 unicorn+0.073
Neg logit contributions
cious-0.102
 burgers-0.091
keeping-0.091
 cooker-0.090
lette-0.089
Token pastry
Token ID41738
Feature activation+1.08e-01

Pos logit contributions
cious+0.103
lette+0.093
 burgers+0.092
asting+0.092
ough+0.090
Neg logit contributions
 superstar-0.118
ite-0.112
 til-0.109
bird-0.108
bul-0.107
Token on
Token ID319
Feature activation-6.51e-02

Pos logit contributions
cious+0.172
 burgers+0.158
lette+0.157
asting+0.157
 cooker+0.153
Neg logit contributions
 superstar-0.155
ite-0.145
 til-0.141
bird-0.140
bul-0.138
Token the
Token ID262
Feature activation-5.28e-03

Pos logit contributions
 superstar+0.102
ite+0.096
 til+0.093
bird+0.091
bul+0.090
Neg logit contributions
cious-0.095
 burgers-0.089
asting-0.087
lette-0.087
 counter-0.086
Token counter
Token ID3753
Feature activation+4.02e-01

Pos logit contributions
 unicorn+0.009
bird+0.009
 rider+0.008
 cartoon+0.008
 ghost+0.008
Neg logit contributions
cious-0.006
row-0.006
apa-0.006
asting-0.005
ocado-0.005
Token.
Token ID13
Feature activation+1.44e-02

Pos logit contributions
cious+0.710
asting+0.662
 burgers+0.658
lette+0.639
ough+0.638
Neg logit contributions
 superstar-0.469
ite-0.448
bird-0.442
 til-0.423
 flashes-0.413
Token It
Token ID632
Feature activation+3.91e-02

Pos logit contributions
cious+0.026
asting+0.025
lette+0.024
 burgers+0.024
 cooker+0.023
Neg logit contributions
 superstar-0.017
bird-0.016
ite-0.016
 til-0.016
 Rhythm-0.016
Token looked
Token ID3114
Feature activation-5.45e-03

Pos logit contributions
cious+0.052
 burgers+0.048
lette+0.047
asting+0.046
 counter+0.046
Neg logit contributions
 superstar-0.066
ite-0.064
 Rhythm-0.059
 til-0.059
 rider-0.058
Token delicious
Token ID12625
Feature activation+1.33e-01

Pos logit contributions
 superstar+0.009
ite+0.008
 til+0.008
 flashes+0.008
bird+0.008
Neg logit contributions
cious-0.008
lette-0.007
 burgers-0.007
asting-0.007
 cooker-0.007
Token!
Token ID0
Feature activation-4.07e-03

Pos logit contributions
cious+0.256
asting+0.240
lette+0.238
ough+0.238
 counters+0.233
Neg logit contributions
 superstar-0.133
bird-0.117
ite-0.117
 til-0.116
 unicorn-0.111
Token he
Token ID339
Feature activation-7.56e-03

Pos logit contributions
 superstar+0.135
ite+0.126
 til+0.123
bird+0.120
 flashes+0.120
Neg logit contributions
cious-0.138
 burgers-0.128
lette-0.127
asting-0.126
 cooker-0.124
Token made
Token ID925
Feature activation-5.02e-02

Pos logit contributions
 superstar+0.013
ite+0.012
 Rhythm+0.012
 flashes+0.012
 til+0.012
Neg logit contributions
cious-0.010
 burgers-0.009
lette-0.009
asting-0.009
 cooker-0.009
Token some
Token ID617
Feature activation+1.17e-02

Pos logit contributions
 superstar+0.074
 til+0.069
 flashes+0.068
bird+0.067
ite+0.067
Neg logit contributions
cious-0.077
lette-0.069
 burgers-0.068
 cooker-0.068
asting-0.067
Token y
Token ID331
Feature activation+1.46e-01

Pos logit contributions
cious+0.015
 burgers+0.014
lette+0.014
asting+0.014
 counter+0.014
Neg logit contributions
 superstar-0.020
ite-0.019
 til-0.018
 Rhythm-0.018
bird-0.018
Tokenummy
Token ID13513
Feature activation-1.06e-02

Pos logit contributions
cious+0.270
 counter+0.243
 burgers+0.243
asting+0.242
lette+0.242
Neg logit contributions
 superstar-0.166
ite-0.163
bird-0.152
 flashes-0.150
 til-0.148
Token burgers
Token ID34623
Feature activation+4.01e-01

Pos logit contributions
 superstar+0.019
bird+0.018
 til+0.018
 flashes+0.018
ite+0.018
Neg logit contributions
cious-0.013
lette-0.011
ough-0.011
 cooker-0.011
keeping-0.011
Token.
Token ID13
Feature activation+1.37e-02

Pos logit contributions
cious+0.762
lette+0.664
 counters+0.655
asting+0.653
 cooker+0.644
Neg logit contributions
ite-0.432
 superstar-0.428
bird-0.412
bul-0.395
 rider-0.388
Token He
Token ID679
Feature activation-7.69e-03

Pos logit contributions
cious+0.022
asting+0.021
 burgers+0.020
lette+0.020
 cooker+0.020
Neg logit contributions
 superstar-0.019
ite-0.018
bird-0.017
 til-0.017
 Rhythm-0.017
Token put
Token ID1234
Feature activation+8.41e-02

Pos logit contributions
 superstar+0.012
ite+0.011
 til+0.011
bird+0.011
 flashes+0.011
Neg logit contributions
cious-0.011
 burgers-0.010
lette-0.010
asting-0.010
 cooker-0.010
Token them
Token ID606
Feature activation+4.45e-02

Pos logit contributions
cious+0.144
lette+0.128
keeping+0.125
 cooker+0.124
asting+0.124
Neg logit contributions
 superstar-0.110
ite-0.103
 til-0.102
bird-0.100
bul-0.100
Token on
Token ID319
Feature activation-7.18e-02

Pos logit contributions
cious+0.076
 counter+0.069
lette+0.069
 cooker+0.068
 burgers+0.067
Neg logit contributions
 superstar-0.057
ite-0.056
 til-0.052
bird-0.052
bul-0.051
Tokenucked
Token ID17758
Feature activation-6.22e-04

Pos logit contributions
cious+0.205
 burgers+0.194
 counter+0.185
 cooker+0.184
lette+0.183
Neg logit contributions
 superstar-0.252
ite-0.244
 til-0.235
bird-0.234
bul-0.230
Token the
Token ID262
Feature activation+7.58e-03

Pos logit contributions
 superstar+0.001
 til+0.001
ite+0.001
bird+0.001
 flashes+0.001
Neg logit contributions
cious-0.001
lette-0.001
 burgers-0.001
asting-0.001
 cooker-0.001
Token onion
Token ID21670
Feature activation+1.77e-01

Pos logit contributions
cious+0.011
 burgers+0.010
lette+0.010
asting+0.010
 cooker+0.010
Neg logit contributions
 superstar-0.012
ite-0.011
 til-0.011
bird-0.010
 flashes-0.010
Token from
Token ID422
Feature activation+7.80e-02

Pos logit contributions
cious+0.293
asting+0.268
 burgers+0.266
lette+0.266
 cooker+0.257
Neg logit contributions
 superstar-0.235
ite-0.228
bird-0.222
 til-0.220
bul-0.217
Token the
Token ID262
Feature activation+4.68e-03

Pos logit contributions
cious+0.110
 burgers+0.104
lette+0.101
 counter+0.100
asting+0.100
Neg logit contributions
 superstar-0.127
ite-0.120
 til-0.113
bul-0.112
 flashes-0.112
Token counter
Token ID3753
Feature activation+4.01e-01

Pos logit contributions
cious+0.007
asting+0.006
lette+0.006
 burgers+0.006
ough+0.006
Neg logit contributions
 superstar-0.007
ite-0.007
bird-0.007
bul-0.007
 til-0.007
Token.
Token ID13
Feature activation+1.59e-02

Pos logit contributions
cious+0.723
asting+0.680
 burgers+0.669
ough+0.657
lette+0.653
Neg logit contributions
 superstar-0.443
bird-0.433
ite-0.432
 til-0.412
bul-0.400
Token
Token ID198
Feature activation-1.33e-02

Pos logit contributions
cious+0.027
asting+0.026
lette+0.025
 burgers+0.025
 cooker+0.025
Neg logit contributions
 superstar-0.020
bird-0.019
ite-0.019
 Rhythm-0.019
 til-0.019
Token
Token ID198
Feature activation-1.57e-02

Pos logit contributions
bird+0.013
ite+0.013
 superstar+0.013
 Rhythm+0.013
 til+0.012
Neg logit contributions
cious-0.026
 burgers-0.025
lette-0.024
 cooker-0.024
 counter-0.023
TokenWhen
Token ID2215
Feature activation+5.69e-02

Pos logit contributions
ite+0.021
Because+0.021
bird+0.021
 superstar+0.021
Zero+0.021
Neg logit contributions
 burgers-0.024
cious-0.024
 cooker-0.023
lette-0.023
 counter-0.022
Token Lily
Token ID20037
Feature activation+9.10e-03

Pos logit contributions
cious+0.086
 burgers+0.079
lette+0.079
asting+0.078
 cooker+0.077
Neg logit contributions
 superstar-0.087
ite-0.081
 til-0.079
bird-0.078
 flashes-0.077
Token big
Token ID1263
Feature activation-2.33e-02

Pos logit contributions
 superstar+0.004
ite+0.003
 til+0.003
bird+0.003
 flashes+0.003
Neg logit contributions
cious-0.004
 burgers-0.003
lette-0.003
asting-0.003
 cooker-0.003
Token,
Token ID11
Feature activation+9.27e-03

Pos logit contributions
 superstar+0.038
ite+0.036
 til+0.035
 Rhythm+0.035
bird+0.034
Neg logit contributions
cious-0.032
 burgers-0.030
 counter-0.029
 cooker-0.029
lette-0.029
Token sweet
Token ID6029
Feature activation+1.47e-01

Pos logit contributions
cious+0.017
lette+0.017
ough+0.016
keeping+0.016
 cooker+0.016
Neg logit contributions
 superstar-0.010
 til-0.010
 flashes-0.009
ite-0.009
 flash-0.009
Token-
Token ID12
Feature activation+2.24e-02

Pos logit contributions
cious+0.233
ough+0.218
lette+0.212
keeping+0.205
asting+0.205
Neg logit contributions
 superstar-0.222
bird-0.207
 unicorn-0.203
 til-0.201
 flash-0.197
Tokent
Token ID83
Feature activation-6.74e-02

Pos logit contributions
cious+0.028
 burgers+0.027
 counters+0.027
 cooker+0.026
 counter+0.026
Neg logit contributions
ite-0.037
 superstar-0.037
bird-0.037
 unicorn-0.035
bul-0.034
Tokenasting
Token ID9222
Feature activation+4.01e-01

Pos logit contributions
 superstar+0.082
bul+0.078
bird+0.076
 flashes+0.076
 unicorn+0.074
Neg logit contributions
cious-0.116
 Dinner-0.111
 counters-0.108
lette-0.107
 burgers-0.107
Token fruit
Token ID8234
Feature activation+2.46e-01

Pos logit contributions
cious+0.568
lette+0.532
ough+0.506
 cooker+0.495
keeping+0.489
Neg logit contributions
 superstar-0.635
 flashes-0.616
 til-0.607
bird-0.600
 flash-0.598
Token tree
Token ID5509
Feature activation+2.67e-02

Pos logit contributions
cious+0.444
lette+0.422
asting+0.412
 burgers+0.409
ough+0.403
Neg logit contributions
 superstar-0.285
ite-0.279
bird-0.257
 til-0.255
bul-0.248
Token!
Token ID0
Feature activation-4.72e-03

Pos logit contributions
cious+0.043
 burgers+0.040
lette+0.040
asting+0.040
 cooker+0.040
Neg logit contributions
 superstar-0.036
ite-0.035
 til-0.033
bird-0.033
 flashes-0.032
Token
Token ID198
Feature activation-1.25e-02

Pos logit contributions
 superstar+0.007
ite+0.006
 til+0.006
bul+0.006
 rider+0.006
Neg logit contributions
cious-0.007
lette-0.007
 counter-0.007
 burgers-0.007
 cooker-0.006
Token
Token ID198
Feature activation-1.26e-02

Pos logit contributions
 superstar+0.021
ite+0.019
bul+0.018
 flashes+0.018
 rider+0.018
Neg logit contributions
cious-0.018
asting-0.016
lette-0.016
 burgers-0.015
 counter-0.015
Token a
Token ID257
Feature activation-1.16e-03

Pos logit contributions
cious+0.079
lette+0.073
asting+0.072
 burgers+0.072
ough+0.071
Neg logit contributions
 superstar-0.075
ite-0.069
 til-0.068
 flashes-0.067
bird-0.067
Token beautiful
Token ID4950
Feature activation-1.10e-01

Pos logit contributions
 superstar+0.002
ite+0.002
 til+0.002
bird+0.002
bul+0.002
Neg logit contributions
cious-0.002
lette-0.002
asting-0.001
 burgers-0.001
ough-0.001
Token cheese
Token ID9891
Feature activation+1.38e-01

Pos logit contributions
 superstar+0.176
ite+0.161
 til+0.160
bird+0.159
bul+0.156
Neg logit contributions
cious-0.160
asting-0.145
 burgers-0.145
lette-0.145
ough-0.143
Token o
Token ID267
Feature activation+2.16e-01

Pos logit contributions
cious+0.235
ough+0.221
asting+0.216
 burgers+0.215
lette+0.214
Neg logit contributions
 superstar-0.180
bird-0.169
ite-0.167
 unicorn-0.165
 rider-0.160
Tokenme
Token ID1326
Feature activation-1.35e-02

Pos logit contributions
cious+0.322
 burgers+0.306
 cooker+0.294
keeping+0.292
 oh+0.291
Neg logit contributions
ite-0.322
-0.315
 superstar-0.305
inkle-0.302
bird-0.301
Tokenlette
Token ID21348
Feature activation+4.01e-01

Pos logit contributions
bird+0.030
ite+0.029
 superstar+0.028
inkle+0.028
 unicorn+0.027
Neg logit contributions
 burgers-0.008
 cooker-0.008
cious-0.008
 counter-0.008
 counters-0.007
Token.
Token ID13
Feature activation+1.17e-02

Pos logit contributions
cious+0.701
ough+0.656
 burgers+0.652
asting+0.647
lette+0.640
Neg logit contributions
 superstar-0.495
ite-0.459
bird-0.452
 unicorn-0.432
bul-0.430
Token John
Token ID1757
Feature activation+3.18e-02

Pos logit contributions
cious+0.020
asting+0.019
lette+0.019
 burgers+0.018
row+0.018
Neg logit contributions
 superstar-0.015
bird-0.015
ite-0.014
 Rhythm-0.014
 Spark-0.014
Token was
Token ID373
Feature activation-2.77e-03

Pos logit contributions
cious+0.049
 burgers+0.046
lette+0.045
asting+0.045
 cooker+0.044
Neg logit contributions
 superstar-0.047
ite-0.044
 til-0.043
bird-0.042
bul-0.042
Token so
Token ID523
Feature activation+1.41e-01

Pos logit contributions
 superstar+0.005
 til+0.005
bird+0.004
 flashes+0.004
ite+0.004
Neg logit contributions
cious-0.004
 burgers-0.003
 cooker-0.003
asting-0.003
ough-0.003
Token pleased
Token ID10607
Feature activation+4.52e-02

Pos logit contributions
cious+0.227
 burgers+0.213
 cooker+0.209
lette+0.208
ough+0.208
Neg logit contributions
 superstar-0.193
 til-0.181
ite-0.177
bird-0.172
 flashes-0.171
Token
Token ID198
Feature activation-1.00e-02

Pos logit contributions
cious+0.209
 burgers+0.200
 cooker+0.195
asting+0.193
lette+0.191
Neg logit contributions
ite-0.114
 superstar-0.112
bird-0.108
 til-0.106
 Rhythm-0.105
Token
Token ID198
Feature activation-1.07e-02

Pos logit contributions
 superstar+0.010
ite+0.010
bird+0.010
 Rhythm+0.010
 til+0.010
Neg logit contributions
cious-0.019
 burgers-0.018
lette-0.018
 cooker-0.018
 counter-0.017
TokenThe
Token ID464
Feature activation+2.77e-02

Pos logit contributions
 superstar+0.013
Because+0.013
Brian+0.013
bird+0.013
ite+0.013
Neg logit contributions
cious-0.018
 burgers-0.018
 cooker-0.017
 counter-0.017
lette-0.017
Token o
Token ID267
Feature activation+2.17e-01

Pos logit contributions
cious+0.037
lette+0.033
asting+0.033
 burgers+0.033
ough+0.033
Neg logit contributions
 superstar-0.047
ite-0.044
 til-0.043
bird-0.043
 flashes-0.042
Tokenme
Token ID1326
Feature activation-1.32e-02

Pos logit contributions
cious+0.334
 burgers+0.312
 cooker+0.300
 counter+0.298
asting+0.297
Neg logit contributions
 superstar-0.321
ite-0.309
bird-0.294
 til-0.289
bul-0.285
Tokenlette
Token ID21348
Feature activation+4.01e-01

Pos logit contributions
 superstar+0.024
bird+0.023
ite+0.023
bul+0.022
 til+0.021
Neg logit contributions
cious-0.015
 burgers-0.015
 counter-0.014
 cooker-0.014
 counters-0.013
Token tasted
Token ID29187
Feature activation+1.11e-01

Pos logit contributions
cious+0.526
lette+0.499
asting+0.497
 burgers+0.494
 cooker+0.474
Neg logit contributions
 superstar-0.654
 Rhythm-0.634
ite-0.626
 til-0.605
 flashes-0.601
Token even
Token ID772
Feature activation-9.45e-02

Pos logit contributions
cious+0.163
lette+0.146
keeping+0.145
 cooker+0.144
ough+0.143
Neg logit contributions
 superstar-0.176
 til-0.162
 flashes-0.156
bird-0.156
ite-0.154
Token better
Token ID1365
Feature activation-1.32e-01

Pos logit contributions
 superstar+0.136
ite+0.131
+0.127
 Rhythm+0.126
ane+0.123
Neg logit contributions
 burgers-0.145
asting-0.143
cious-0.139
lette-0.135
 counter-0.135
Token than
Token ID621
Feature activation-5.26e-02

Pos logit contributions
 superstar+0.190
ite+0.178
 til+0.173
bird+0.170
 flashes+0.169
Neg logit contributions
cious-0.212
 burgers-0.197
lette-0.195
asting-0.193
 cooker-0.191
Token it
Token ID340
Feature activation+1.05e-02

Pos logit contributions
 superstar+0.084
ite+0.080
 til+0.078
bird+0.076
 flashes+0.076
Neg logit contributions
cious-0.075
 burgers-0.070
asting-0.069
lette-0.068
 counter-0.067
Token they
Token ID484
Feature activation+3.05e-02

Pos logit contributions
cious+0.134
lette+0.124
asting+0.123
 burgers+0.122
 cooker+0.119
Neg logit contributions
 superstar-0.108
ite-0.103
 til-0.100
bird-0.097
 flashes-0.095
Token had
Token ID550
Feature activation+5.92e-02

Pos logit contributions
cious+0.041
 burgers+0.038
lette+0.037
asting+0.037
 cooker+0.036
Neg logit contributions
 superstar-0.051
ite-0.048
 til-0.047
bul-0.046
 flashes-0.046
Token the
Token ID262
Feature activation+1.42e-02

Pos logit contributions
cious+0.093
lette+0.085
 cooker+0.085
asting+0.084
 counter+0.084
Neg logit contributions
 superstar-0.084
ite-0.079
 flashes-0.076
 til-0.076
bird-0.074
Token o
Token ID267
Feature activation+2.15e-01

Pos logit contributions
cious+0.020
 burgers+0.020
 counter+0.020
lette+0.019
 cooker+0.019
Neg logit contributions
 superstar-0.022
 til-0.020
ite-0.020
 flashes-0.020
 Rhythm-0.020
Tokenme
Token ID1326
Feature activation-1.36e-02

Pos logit contributions
cious+0.255
 burgers+0.230
 cooker+0.227
 counter+0.225
keeping+0.225
Neg logit contributions
ite-0.383
 superstar-0.381
bird-0.355
inkle-0.355
 til-0.353
Tokenlette
Token ID21348
Feature activation+4.00e-01

Pos logit contributions
ite+0.024
bird+0.024
 superstar+0.023
inkle+0.022
ane+0.022
Neg logit contributions
cious-0.015
 burgers-0.015
 cooker-0.014
 counter-0.014
 counters-0.014
Token put
Token ID1234
Feature activation+8.50e-02

Pos logit contributions
cious+0.667
 burgers+0.606
asting+0.606
lette+0.604
 cooker+0.601
Neg logit contributions
 superstar-0.527
ite-0.507
bird-0.474
 til-0.468
bul-0.464
Token together
Token ID1978
Feature activation-1.38e-01

Pos logit contributions
cious+0.144
 cooker+0.128
lette+0.126
ough+0.126
 counters+0.126
Neg logit contributions
ite-0.107
 superstar-0.106
bird-0.100
bul-0.098
 til-0.098
Token.
Token ID13
Feature activation+2.03e-02

Pos logit contributions
 superstar+0.179
ite+0.171
bul+0.161
bird+0.161
 til+0.161
Neg logit contributions
cious-0.232
 burgers-0.212
lette-0.211
 cooker-0.209
asting-0.207
Token They
Token ID1119
Feature activation+6.53e-02

Pos logit contributions
cious+0.033
asting+0.031
 burgers+0.031
lette+0.031
 cooker+0.030
Neg logit contributions
 superstar-0.028
ite-0.026
bird-0.026
 Rhythm-0.025
 til-0.025
Token were
Token ID547
Feature activation+2.22e-02

Pos logit contributions
cious+0.090
 burgers+0.083
asting+0.082
lette+0.082
 counter+0.079
Neg logit contributions
 superstar-0.107
ite-0.101
 til-0.098
 flashes-0.097
bul-0.097
Token to
Token ID284
Feature activation+1.30e-02

Pos logit contributions
cious+0.069
 burgers+0.068
lette+0.066
asting+0.064
 counter+0.063
Neg logit contributions
 superstar-0.099
ite-0.093
 til-0.091
 rider-0.088
 Rhythm-0.087
Token try
Token ID1949
Feature activation-8.67e-03

Pos logit contributions
cious+0.015
asting+0.014
lette+0.014
 burgers+0.014
ough+0.014
Neg logit contributions
 superstar-0.024
ite-0.023
 til-0.023
bul-0.022
 rider-0.022
Token one
Token ID530
Feature activation-1.54e-01

Pos logit contributions
 superstar+0.016
ite+0.015
 rider+0.015
bul+0.014
 til+0.014
Neg logit contributions
 burgers-0.010
cious-0.010
asting-0.009
lette-0.009
 counter-0.009
Token of
Token ID286
Feature activation-2.37e-02

Pos logit contributions
 superstar+0.202
 til+0.182
ite+0.181
 flashes+0.176
 Rhythm+0.174
Neg logit contributions
cious-0.261
 burgers-0.250
lette-0.247
asting-0.245
 counter-0.242
Token the
Token ID262
Feature activation-9.03e-03

Pos logit contributions
 superstar+0.038
ite+0.035
 til+0.035
bird+0.034
 flashes+0.034
Neg logit contributions
cious-0.034
lette-0.031
 burgers-0.030
asting-0.030
 cooker-0.030
Token burgers
Token ID34623
Feature activation+4.00e-01

Pos logit contributions
 superstar+0.016
 til+0.016
ite+0.016
 flashes+0.015
 unicorn+0.015
Neg logit contributions
cious-0.010
ough-0.009
lette-0.009
asting-0.009
keeping-0.009
Token?"
Token ID1701
Feature activation-8.49e-02

Pos logit contributions
cious+0.748
lette+0.678
 burgers+0.677
asting+0.675
 cooker+0.661
Neg logit contributions
 superstar-0.469
ite-0.447
 til-0.419
bird-0.416
bul-0.408
Token asked
Token ID1965
Feature activation-7.20e-02

Pos logit contributions
 superstar+0.137
ite+0.132
 rider+0.122
 til+0.122
 flashes+0.121
Neg logit contributions
cious-0.118
 burgers-0.110
 counter-0.107
 cooker-0.107
lette-0.105
Token Aunt
Token ID38074
Feature activation-1.77e-01

Pos logit contributions
 superstar+0.113
ite+0.106
 til+0.103
 flashes+0.101
bird+0.100
Neg logit contributions
cious-0.105
 burgers-0.097
lette-0.096
asting-0.095
 cooker-0.094
Token S
Token ID311
Feature activation-5.06e-02

Pos logit contributions
 superstar+0.269
ite+0.254
 til+0.246
 flashes+0.241
 Rhythm+0.237
Neg logit contributions
cious-0.263
 burgers-0.245
lette-0.243
asting-0.241
 counter-0.237
Tokenari
Token ID2743
Feature activation-6.80e-02

Pos logit contributions
 superstar+0.082
ite+0.077
bird+0.076
 til+0.075
bul+0.074
Neg logit contributions
cious-0.071
 burgers-0.066
lette-0.064
 cooker-0.063
asting-0.063
Token having
Token ID1719
Feature activation-3.77e-02

Pos logit contributions
 superstar+0.009
ite+0.008
 til+0.008
bird+0.008
bul+0.008
Neg logit contributions
cious-0.014
 burgers-0.013
lette-0.013
asting-0.013
 cooker-0.013
Token too
Token ID1165
Feature activation-1.96e-01

Pos logit contributions
 superstar+0.062
ite+0.060
bird+0.057
 til+0.056
bul+0.056
Neg logit contributions
cious-0.052
 burgers-0.051
asting-0.048
lette-0.047
 counter-0.047
Token much
Token ID881
Feature activation-6.13e-03

Pos logit contributions
 superstar+0.282
ite+0.272
 til+0.260
 flashes+0.257
bird+0.253
Neg logit contributions
cious-0.306
lette-0.284
 burgers-0.283
asting-0.277
 counter-0.277
Token fun
Token ID1257
Feature activation-9.39e-02

Pos logit contributions
 superstar+0.010
ite+0.009
bird+0.009
 til+0.008
 Rhythm+0.008
Neg logit contributions
cious-0.009
 burgers-0.009
asting-0.008
lette-0.008
 counter-0.008
Token ro
Token ID686
Feature activation-3.53e-02

Pos logit contributions
 superstar+0.149
ite+0.141
 til+0.136
bird+0.134
 flashes+0.133
Neg logit contributions
cious-0.136
 burgers-0.126
asting-0.124
lette-0.123
 cooker-0.121
Tokenasting
Token ID9222
Feature activation+4.00e-01

Pos logit contributions
ite+0.062
 superstar+0.062
inkle+0.058
 Rhythm+0.058
bul+0.057
Neg logit contributions
cious-0.042
 burgers-0.039
 counter-0.037
lette-0.036
 cooker-0.036
Token her
Token ID607
Feature activation+1.34e-02

Pos logit contributions
cious+0.624
lette+0.572
 burgers+0.545
asting+0.544
ough+0.540
Neg logit contributions
 superstar-0.592
 flashes-0.562
 til-0.560
ite-0.550
bul-0.546
Token marsh
Token ID22397
Feature activation+4.94e-02

Pos logit contributions
cious+0.018
 burgers+0.017
lette+0.017
asting+0.016
 counter+0.016
Neg logit contributions
 superstar-0.022
ite-0.021
 til-0.020
bird-0.020
bul-0.020
Tokenmallow
Token ID42725
Feature activation-1.64e-01

Pos logit contributions
cious+0.091
 burgers+0.085
lette+0.084
asting+0.084
 cooker+0.083
Neg logit contributions
 superstar-0.059
ite-0.054
 til-0.052
bird-0.051
bul-0.051
Token.
Token ID13
Feature activation+8.67e-03

Pos logit contributions
 superstar+0.231
ite+0.214
 til+0.208
bird+0.206
bul+0.204
Neg logit contributions
cious-0.263
 burgers-0.244
lette-0.243
asting-0.240
 cooker-0.237
Token
Token ID198
Feature activation-1.31e-02

Pos logit contributions
cious+0.016
asting+0.016
lette+0.016
 burgers+0.015
 cooker+0.015
Neg logit contributions
 superstar-0.010
 til-0.009
 Spark-0.009
 Rhythm-0.009
ite-0.009
Token to
Token ID284
Feature activation+2.07e-02

Pos logit contributions
cious+0.103
keeping+0.095
 burgers+0.095
 counter+0.093
asting+0.090
Neg logit contributions
 superstar-0.173
ite-0.155
 flashes-0.155
 til-0.154
bird-0.154
Token the
Token ID262
Feature activation+1.96e-03

Pos logit contributions
cious+0.031
lette+0.029
 burgers+0.028
asting+0.028
 cooker+0.028
Neg logit contributions
 superstar-0.032
 til-0.029
ite-0.029
 Rhythm-0.028
bird-0.028
Token check
Token ID2198
Feature activation-4.98e-03

Pos logit contributions
cious+0.003
 burgers+0.003
 counter+0.003
asting+0.002
lette+0.002
Neg logit contributions
 superstar-0.003
ite-0.003
 til-0.003
bird-0.003
 flashes-0.003
Token-
Token ID12
Feature activation+2.91e-02

Pos logit contributions
 superstar+0.006
ite+0.006
bird+0.006
bul+0.006
 rider+0.006
Neg logit contributions
cious-0.009
lette-0.008
 burgers-0.008
asting-0.008
 cooker-0.008
Tokenin
Token ID259
Feature activation-7.92e-02

Pos logit contributions
cious+0.038
 burgers+0.034
lette+0.034
asting+0.033
 cooker+0.033
Neg logit contributions
 superstar-0.050
ite-0.049
bird-0.046
 til-0.046
bul-0.046
Token counter
Token ID3753
Feature activation+3.99e-01

Pos logit contributions
 superstar+0.120
ite+0.114
 til+0.110
bird+0.109
bul+0.108
Neg logit contributions
cious-0.120
lette-0.110
 burgers-0.109
asting-0.108
 cooker-0.105
Token to
Token ID284
Feature activation+1.93e-02

Pos logit contributions
cious+0.675
lette+0.615
 burgers+0.615
asting+0.609
 cooker+0.599
Neg logit contributions
 superstar-0.531
ite-0.499
 til-0.475
bird-0.467
bul-0.464
Token find
Token ID1064
Feature activation+1.42e-01

Pos logit contributions
cious+0.029
lette+0.026
 burgers+0.025
 cooker+0.025
asting+0.025
Neg logit contributions
 superstar-0.030
 til-0.028
ite-0.028
bird-0.027
bul-0.027
Token their
Token ID511
Feature activation+9.83e-02

Pos logit contributions
cious+0.253
lette+0.225
asting+0.218
ough+0.216
 cooker+0.213
Neg logit contributions
 superstar-0.171
 flashes-0.165
ite-0.162
bul-0.162
 smiles-0.160
Token own
Token ID898
Feature activation-1.46e-01

Pos logit contributions
cious+0.130
 burgers+0.128
 counter+0.125
 cooker+0.121
lette+0.118
Neg logit contributions
 superstar-0.168
ite-0.156
bird-0.155
 til-0.153
 flashes-0.153
Token suitcase
Token ID45391
Feature activation-1.04e-02

Pos logit contributions
 superstar+0.213
ite+0.202
 unicorn+0.195
bul+0.195
 til+0.193
Neg logit contributions
cious-0.226
lette-0.208
 burgers-0.200
asting-0.200
ough-0.198
Token She
Token ID1375
Feature activation-3.86e-02

Pos logit contributions
cious+0.015
 burgers+0.014
lette+0.014
asting+0.014
 cooker+0.014
Neg logit contributions
 superstar-0.014
ite-0.013
 til-0.013
bird-0.012
 flashes-0.012
Token felt
Token ID2936
Feature activation-7.67e-02

Pos logit contributions
 superstar+0.065
ite+0.063
bul+0.060
bird+0.059
 Rhythm+0.059
Neg logit contributions
cious-0.050
 burgers-0.046
lette-0.045
 counter-0.045
asting-0.045
Token like
Token ID588
Feature activation-7.25e-02

Pos logit contributions
 superstar+0.118
ite+0.109
 til+0.109
 flashes+0.105
bird+0.105
Neg logit contributions
cious-0.115
 burgers-0.106
lette-0.105
asting-0.103
 cooker-0.103
Token a
Token ID257
Feature activation-3.70e-03

Pos logit contributions
 superstar+0.114
ite+0.111
 til+0.106
bird+0.103
 flashes+0.103
Neg logit contributions
cious-0.102
 burgers-0.097
lette-0.094
asting-0.093
 counter-0.093
Token real
Token ID1103
Feature activation-7.70e-02

Pos logit contributions
ite+0.006
ays+0.006
 til+0.006
ane+0.006
 flashes+0.006
Neg logit contributions
cious-0.005
 counter-0.005
 burgers-0.004
 cooker-0.004
 stove-0.004
Token superstar
Token ID28422
Feature activation-4.64e-01

Pos logit contributions
ite+0.115
ays+0.112
Zero+0.111
Brian+0.111
pers+0.110
Neg logit contributions
 counter-0.104
cious-0.104
 food-0.103
 burgers-0.103
 cooker-0.102
Token!
Token ID0
Feature activation-1.59e-02

Pos logit contributions
 superstar+0.697
ite+0.655
 til+0.639
 flashes+0.624
bird+0.624
Neg logit contributions
cious-0.708
 burgers-0.649
lette-0.646
asting-0.640
 cooker-0.629
Token
Token ID198
Feature activation-1.77e-02

Pos logit contributions
 superstar+0.025
 til+0.024
 flashes+0.023
ite+0.023
bul+0.023
Neg logit contributions
cious-0.022
 counter-0.020
lette-0.020
 burgers-0.020
ough-0.019
Token
Token ID198
Feature activation-1.93e-02

Pos logit contributions
 superstar+0.030
ite+0.028
 til+0.028
bul+0.027
 flashes+0.027
Neg logit contributions
cious-0.023
 burgers-0.021
lette-0.021
asting-0.021
 counter-0.021
TokenL
Token ID43
Feature activation+3.80e-02

Pos logit contributions
 superstar+0.029
ite+0.027
bird+0.026
 til+0.026
 Rhythm+0.026
Neg logit contributions
cious-0.030
 burgers-0.028
lette-0.027
 cooker-0.027
 counter-0.027
Tokenily
Token ID813
Feature activation+2.37e-01

Pos logit contributions
cious+0.029
 burgers+0.025
lette+0.023
 cooker+0.023
 counter+0.022
Neg logit contributions
 superstar-0.087
ite-0.084
bird-0.082
 til-0.082
bul-0.081
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
 superstar+0.129
ite+0.120
 til+0.119
 flashes+0.116
bird+0.115
Neg logit contributions
cious-0.130
 burgers-0.119
lette-0.119
asting-0.118
 counter-0.117
Token She
Token ID1375
Feature activation-4.11e-02

Pos logit contributions
cious+0.019
 burgers+0.018
lette+0.018
asting+0.018
 cooker+0.017
Neg logit contributions
 superstar-0.017
ite-0.016
 til-0.016
bird-0.016
 flashes-0.015
Token became
Token ID2627
Feature activation-7.24e-02

Pos logit contributions
 superstar+0.069
ite+0.067
+0.065
 Rhythm+0.064
bul+0.064
Neg logit contributions
cious-0.052
 counter-0.048
 burgers-0.046
lette-0.045
 cooker-0.045
Token a
Token ID257
Feature activation+4.04e-03

Pos logit contributions
 superstar+0.124
ite+0.120
ller+0.114
+0.113
bul+0.112
Neg logit contributions
cious-0.092
 burgers-0.089
asting-0.086
lette-0.085
 counter-0.083
Token fashion
Token ID6977
Feature activation-9.40e-02

Pos logit contributions
cious+0.005
 cooker+0.005
 counter+0.005
 burgers+0.004
lette+0.004
Neg logit contributions
ays-0.007
 flashes-0.007
ite-0.007
 til-0.007
pers-0.006
Token superstar
Token ID28422
Feature activation-4.47e-01

Pos logit contributions
 til+0.116
 flashes+0.114
 Rhythm+0.106
pers+0.106
 flash+0.104
Neg logit contributions
 counter-0.158
cious-0.154
 cooker-0.152
 burgers-0.144
asting-0.142
Token,
Token ID11
Feature activation+9.70e-03

Pos logit contributions
 superstar+0.665
ite+0.625
 til+0.611
 flashes+0.598
bird+0.596
Neg logit contributions
cious-0.685
 burgers-0.626
lette-0.624
asting-0.619
 counter-0.608
Token admired
Token ID29382
Feature activation-1.00e-01

Pos logit contributions
cious+0.013
asting+0.011
lette+0.011
 counter+0.011
 burgers+0.011
Neg logit contributions
ite-0.016
 superstar-0.016
 flashes-0.015
 bolt-0.015
bul-0.015
Token by
Token ID416
Feature activation+1.28e-02

Pos logit contributions
 superstar+0.175
ite+0.173
 til+0.166
 bolt+0.163
+0.163
Neg logit contributions
cious-0.120
asting-0.108
 burgers-0.107
 counter-0.106
lette-0.104
Token all
Token ID477
Feature activation-6.03e-02

Pos logit contributions
cious+0.017
asting+0.016
 counter+0.016
 burgers+0.015
lette+0.015
Neg logit contributions
ite-0.022
 superstar-0.021
 rider-0.020
 til-0.019
-0.019
Token of
Token ID286
Feature activation-1.54e-02

Pos logit contributions
 superstar+0.100
ite+0.093
 til+0.092
+0.088
bird+0.088
Neg logit contributions
cious-0.081
asting-0.077
 burgers-0.075
lette-0.074
 counter-0.074
Token we
Token ID356
Feature activation-8.21e-03

Pos logit contributions
cious+0.017
 burgers+0.016
lette+0.015
asting+0.015
 counter+0.015
Neg logit contributions
 superstar-0.022
ite-0.021
 til-0.020
bird-0.020
bul-0.020
Token can
Token ID460
Feature activation-9.10e-02

Pos logit contributions
 superstar+0.013
 til+0.012
 Rhythm+0.012
ller+0.012
 flashes+0.012
Neg logit contributions
cious-0.011
asting-0.011
lette-0.011
 burgers-0.010
ough-0.010
Token get
Token ID651
Feature activation+7.55e-02

Pos logit contributions
 superstar+0.189
bul+0.173
 flashes+0.173
 moons+0.171
 til+0.168
Neg logit contributions
ough-0.084
asting-0.082
 counter-0.075
line-0.074
 grill-0.073
Token your
Token ID534
Feature activation+8.37e-02

Pos logit contributions
 burgers+0.084
 food+0.080
 mud+0.075
 dirt+0.074
 bread+0.074
Neg logit contributions
 superstar-0.153
bird-0.135
 Rhythm-0.134
azz-0.134
-0.133
Token k
Token ID479
Feature activation-1.50e-01

Pos logit contributions
 food+0.112
 bread+0.111
 burgers+0.110
 counter+0.109
 wheat+0.107
Neg logit contributions
 superstar-0.134
azz-0.121
Because-0.120
 til-0.119
Zero-0.118
Tokenite
Token ID578
Feature activation-4.09e-01

Pos logit contributions
 til+0.147
 superstar+0.147
 flashes+0.139
 Zero+0.124
 moons+0.123
Neg logit contributions
lette-0.279
ough-0.275
row-0.273
'-0.264
 bread-0.264
Token back
Token ID736
Feature activation+7.86e-02

Pos logit contributions
 superstar+0.767
 til+0.716
 Rhythm+0.705
 flashes+0.703
Zero+0.677
Neg logit contributions
 burgers-0.427
lette-0.424
cious-0.419
asting-0.417
 counter-0.417
Token.
Token ID13
Feature activation-6.84e-03

Pos logit contributions
cious+0.072
keeping+0.070
asting+0.069
 counter+0.068
 burgers+0.068
Neg logit contributions
 superstar-0.161
 til-0.148
 Rhythm-0.145
 flashes-0.144
Zero-0.143
Token Maybe
Token ID6674
Feature activation-1.07e-01

Pos logit contributions
 superstar+0.011
ite+0.010
 til+0.010
 flashes+0.010
bul+0.010
Neg logit contributions
cious-0.009
lette-0.009
 burgers-0.009
 counter-0.008
 cooker-0.008
Token we
Token ID356
Feature activation-8.27e-03

Pos logit contributions
 superstar+0.187
ite+0.173
 til+0.171
bird+0.168
 flashes+0.167
Neg logit contributions
cious-0.136
 burgers-0.129
 counter-0.125
lette-0.124
asting-0.122
Token can
Token ID460
Feature activation-9.20e-02

Pos logit contributions
 superstar+0.013
 til+0.012
 flashes+0.012
ller+0.012
 Rhythm+0.012
Neg logit contributions
cious-0.011
lette-0.011
asting-0.011
 burgers-0.010
 cooker-0.010
Token had
Token ID550
Feature activation+3.31e-02

Pos logit contributions
 superstar+0.106
+0.101
ays+0.100
ement+0.099
ller+0.099
Neg logit contributions
cious-0.046
asting-0.033
ocado-0.032
lette-0.031
ough-0.031
Token a
Token ID257
Feature activation-1.45e-02

Pos logit contributions
cious+0.047
 burgers+0.043
lette+0.043
asting+0.042
 counter+0.041
Neg logit contributions
 superstar-0.054
ite-0.051
 til-0.049
bul-0.049
bird-0.048
Token big
Token ID1263
Feature activation-4.16e-02

Pos logit contributions
ays+0.026
 flashes+0.025
 superstar+0.025
yles+0.024
ller+0.024
Neg logit contributions
cious-0.017
 grill-0.016
 wheat-0.016
 stove-0.015
 counter-0.015
Token red
Token ID2266
Feature activation-1.34e-01

Pos logit contributions
 superstar+0.069
Because+0.064
yles+0.064
ller+0.063
azz+0.063
Neg logit contributions
cious-0.050
 grill-0.050
 counter-0.049
 stove-0.049
 burgers-0.049
Token k
Token ID479
Feature activation-1.59e-01

Pos logit contributions
 superstar+0.200
 til+0.193
 flashes+0.186
Brian+0.184
 Rhythm+0.182
Neg logit contributions
cious-0.181
 counter-0.179
 burgers-0.171
 grill-0.170
 cooker-0.168
Tokenite
Token ID578
Feature activation-4.07e-01

Pos logit contributions
 til+0.183
 former+0.175
 fail+0.173
 superstar+0.170
inkle+0.163
Neg logit contributions
cious-0.295
row-0.272
ful-0.267
and-0.263
lette-0.258
Token that
Token ID326
Feature activation-6.05e-02

Pos logit contributions
 superstar+0.744
 til+0.686
 flashes+0.672
yles+0.668
inkle+0.664
Neg logit contributions
cious-0.456
 counter-0.439
lette-0.417
ough-0.415
 burgers-0.402
Token she
Token ID673
Feature activation-5.70e-02

Pos logit contributions
 superstar+0.109
inkle+0.099
 til+0.098
ller+0.098
ooming+0.097
Neg logit contributions
cious-0.074
 counter-0.063
 burgers-0.062
 cooker-0.061
ough-0.060
Token took
Token ID1718
Feature activation+1.27e-01

Pos logit contributions
 superstar+0.114
 flashes+0.112
ement+0.111
ays+0.108
 bolt+0.108
Neg logit contributions
cious-0.059
ocado-0.047
ough-0.045
asting-0.045
ages-0.043
Token to
Token ID284
Feature activation-1.80e-03

Pos logit contributions
cious+0.187
 burgers+0.172
asting+0.170
lette+0.170
 counter+0.167
Neg logit contributions
 superstar-0.199
ite-0.188
 til-0.181
bul-0.178
 flashes-0.178
Token the
Token ID262
Feature activation-2.39e-02

Pos logit contributions
 superstar+0.003
ite+0.003
bul+0.003
 til+0.003
 flashes+0.003
Neg logit contributions
cious-0.002
 counter-0.002
 burgers-0.002
asting-0.002
ough-0.002
Token.
Token ID13
Feature activation-1.49e-02

Pos logit contributions
 superstar+0.052
ite+0.049
 til+0.048
 flashes+0.047
bul+0.046
Neg logit contributions
cious-0.051
 burgers-0.047
lette-0.047
asting-0.046
 counter-0.046
Token I
Token ID314
Feature activation+3.69e-02

Pos logit contributions
 superstar+0.026
ite+0.024
 til+0.024
bul+0.023
 flashes+0.023
Neg logit contributions
cious-0.019
lette-0.017
 burgers-0.017
 cooker-0.017
 counter-0.017
Token like
Token ID588
Feature activation-7.91e-02

Pos logit contributions
cious+0.040
asting+0.038
ocado+0.036
lette+0.034
 food+0.030
Neg logit contributions
ller-0.071
lights-0.070
ement-0.068
 flashes-0.068
llers-0.065
Token your
Token ID534
Feature activation+7.91e-02

Pos logit contributions
 superstar+0.141
ite+0.131
 til+0.129
bird+0.126
bul+0.125
Neg logit contributions
cious-0.099
 burgers-0.092
lette-0.089
asting-0.087
 counter-0.087
Token k
Token ID479
Feature activation-1.51e-01

Pos logit contributions
 food+0.100
 bread+0.100
 grill+0.099
 wheat+0.099
cious+0.097
Neg logit contributions
 superstar-0.128
Because-0.120
Brian-0.119
Taylor-0.117
azz-0.117
Tokenite
Token ID578
Feature activation-4.07e-01

Pos logit contributions
 superstar+0.184
 fail+0.172
 til+0.171
 Melvin+0.165
 perfected+0.165
Neg logit contributions
lette-0.245
cious-0.243
 bread-0.232
row-0.231
ese-0.223
Token.
Token ID13
Feature activation-1.65e-02

Pos logit contributions
 superstar+0.774
 til+0.731
 flashes+0.727
Brian+0.722
+0.720
Neg logit contributions
cious-0.382
 counter-0.381
 burgers-0.377
 food-0.370
lette-0.358
Token It
Token ID632
Feature activation+2.53e-02

Pos logit contributions
 superstar+0.030
ite+0.028
 til+0.028
bul+0.027
 flashes+0.027
Neg logit contributions
cious-0.020
lette-0.017
 burgers-0.017
 counter-0.017
ough-0.017
Token is
Token ID318
Feature activation-6.46e-02

Pos logit contributions
cious+0.024
lette+0.020
ocado+0.020
 burgers+0.020
 counter+0.020
Neg logit contributions
 superstar-0.048
-0.046
Brian-0.046
 recess-0.046
icycle-0.045
Token a
Token ID257
Feature activation-1.57e-02

Pos logit contributions
 superstar+0.105
ite+0.099
bird+0.095
 til+0.094
bul+0.093
Neg logit contributions
cious-0.091
 burgers-0.085
lette-0.083
asting-0.083
 counter-0.081
Token crow
Token ID37593
Feature activation+3.86e-02

Pos logit contributions
 superstar+0.024
 til+0.023
ite+0.023
 flashes+0.023
bird+0.022
Neg logit contributions
cious-0.022
 burgers-0.021
 counter-0.020
lette-0.020
asting-0.020
Token a
Token ID257
Feature activation-1.20e-02

Pos logit contributions
 flashes+0.061
 Rhythm+0.059
 whist+0.058
 smiles+0.057
 Spark+0.056
Neg logit contributions
 cooker-0.151
 counter-0.146
apa-0.145
row-0.144
lette-0.143
Token ball
Token ID2613
Feature activation-3.20e-02

Pos logit contributions
 superstar+0.018
ite+0.017
 til+0.017
bul+0.017
 flashes+0.017
Neg logit contributions
cious-0.018
 burgers-0.017
lette-0.016
 cooker-0.016
asting-0.016
Token and
Token ID290
Feature activation-2.07e-02

Pos logit contributions
 superstar+0.052
 til+0.048
 flashes+0.047
ite+0.046
bul+0.046
Neg logit contributions
cious-0.044
 burgers-0.041
 counter-0.040
lette-0.040
 cooker-0.040
Token a
Token ID257
Feature activation-1.24e-02

Pos logit contributions
 superstar+0.028
ite+0.027
 flashes+0.026
 til+0.026
 Rhythm+0.025
Neg logit contributions
cious-0.033
asting-0.032
lette-0.032
 counter-0.031
 cooker-0.031
Token k
Token ID479
Feature activation-1.57e-01

Pos logit contributions
 superstar+0.018
ite+0.017
 til+0.016
bird+0.016
 flashes+0.016
Neg logit contributions
cious-0.020
 burgers-0.018
lette-0.018
asting-0.018
 cooker-0.018
Tokenite
Token ID578
Feature activation-4.06e-01

Pos logit contributions
 former+0.156
 superstar+0.155
 til+0.153
 fail+0.152
 pract+0.148
Neg logit contributions
cious-0.299
 burgers-0.276
row-0.274
oys-0.273
 bread-0.272
Token.
Token ID13
Feature activation-1.28e-02

Pos logit contributions
 superstar+0.692
 til+0.639
 flashes+0.621
ite+0.616
bul+0.615
Neg logit contributions
cious-0.523
 burgers-0.484
 counter-0.478
lette-0.477
ough-0.467
Token Anna
Token ID11735
Feature activation-4.09e-02

Pos logit contributions
 superstar+0.020
ite+0.019
 til+0.019
bul+0.019
 flashes+0.018
Neg logit contributions
cious-0.018
 burgers-0.017
lette-0.017
 counter-0.017
 cooker-0.017
Token is
Token ID318
Feature activation-6.69e-02

Pos logit contributions
 superstar+0.063
bul+0.058
ite+0.055
 flashes+0.055
 bolt+0.055
Neg logit contributions
cious-0.061
 burgers-0.055
 cooker-0.054
 counter-0.054
ough-0.054
Token tall
Token ID7331
Feature activation-4.70e-02

Pos logit contributions
 superstar+0.112
ite+0.107
bul+0.104
bird+0.101
 til+0.101
Neg logit contributions
cious-0.091
 burgers-0.082
lette-0.081
asting-0.081
ough-0.079
Token and
Token ID290
Feature activation-2.06e-02

Pos logit contributions
 superstar+0.083
ite+0.078
bul+0.076
inkle+0.075
 Rhythm+0.075
Neg logit contributions
cious-0.059
 counter-0.054
 burgers-0.053
lette-0.050
 cooker-0.049
Token we
Token ID356
Feature activation-9.43e-03

Pos logit contributions
cious+0.022
asting+0.021
lette+0.020
ough+0.020
 cooker+0.020
Neg logit contributions
 superstar-0.018
ite-0.018
 til-0.016
 Spark-0.016
 flashes-0.016
Token can
Token ID460
Feature activation-9.57e-02

Pos logit contributions
 superstar+0.013
ite+0.012
 til+0.012
 flashes+0.011
bul+0.011
Neg logit contributions
cious-0.016
 burgers-0.015
lette-0.015
asting-0.015
 cooker-0.014
Token have
Token ID423
Feature activation-7.51e-02

Pos logit contributions
 superstar+0.200
 til+0.184
 flashes+0.184
bul+0.181
 conflicts+0.179
Neg logit contributions
ough-0.091
asting-0.083
 counter-0.080
cious-0.079
 grill-0.079
Token a
Token ID257
Feature activation-1.85e-02

Pos logit contributions
 superstar+0.135
 til+0.123
bird+0.122
ite+0.122
inkle+0.119
Neg logit contributions
 burgers-0.092
cious-0.089
asting-0.084
lette-0.083
 counter-0.083
Token k
Token ID479
Feature activation-1.49e-01

Pos logit contributions
 superstar+0.026
 til+0.026
ays+0.025
 flashes+0.025
ller+0.024
Neg logit contributions
 wheat-0.025
 grill-0.025
 burgers-0.025
 counter-0.025
cious-0.024
Tokenite
Token ID578
Feature activation-4.06e-01

Pos logit contributions
 til+0.172
 flashes+0.149
 moons+0.147
inkle+0.145
 superstar+0.143
Neg logit contributions
row-0.251
'-0.250
lette-0.249
 envelope-0.242
ful-0.241
Token."
Token ID526
Feature activation+4.65e-02

Pos logit contributions
 superstar+0.731
 til+0.692
 flashes+0.686
 Rhythm+0.666
Zero+0.655
Neg logit contributions
 burgers-0.446
lette-0.437
 counter-0.436
cious-0.435
ough-0.420
Token
Token ID198
Feature activation-3.53e-02

Pos logit contributions
cious+0.086
asting+0.081
lette+0.080
 burgers+0.079
 cooker+0.078
Neg logit contributions
 superstar-0.055
ite-0.052
 til-0.051
bird-0.050
 Rhythm-0.049
Token
Token ID198
Feature activation-3.49e-02

Pos logit contributions
 superstar+0.047
ite+0.044
 til+0.043
bird+0.042
 flashes+0.042
Neg logit contributions
cious-0.060
 burgers-0.055
lette-0.055
asting-0.054
 cooker-0.054
TokenThey
Token ID2990
Feature activation+8.92e-02

Pos logit contributions
ite+0.050
 superstar+0.048
bird+0.047
Because+0.046
 rider+0.044
Neg logit contributions
cious-0.057
 burgers-0.051
lette-0.051
 cooker-0.050
keeping-0.050
Token run
Token ID1057
Feature activation+1.30e-01

Pos logit contributions
cious+0.109
 burgers+0.099
lette+0.097
asting+0.097
 cooker+0.096
Neg logit contributions
 superstar-0.161
ite-0.151
 til-0.146
bul-0.146
 flashes-0.145
Token "
Token ID366
Feature activation+1.61e-01

Pos logit contributions
 superstar+0.043
ite+0.041
 til+0.040
bird+0.040
 flashes+0.040
Neg logit contributions
cious-0.024
 burgers-0.021
lette-0.021
asting-0.021
 cooker-0.020
TokenOr
Token ID5574
Feature activation+4.27e-02

Pos logit contributions
cious+0.222
lette+0.201
asting+0.193
 burgers+0.191
ough+0.190
Neg logit contributions
 superstar-0.270
 til-0.247
inkle-0.246
 flashes-0.245
ite-0.245
Token maybe
Token ID3863
Feature activation-7.08e-02

Pos logit contributions
 oh+0.047
cious+0.043
 burgers+0.042
ough+0.042
 food+0.040
Neg logit contributions
 superstar-0.082
bird-0.076
icycle-0.075
inkle-0.075
-0.074
Token the
Token ID262
Feature activation-3.17e-02

Pos logit contributions
 superstar+0.115
ite+0.108
 til+0.105
bird+0.103
bul+0.102
Neg logit contributions
cious-0.099
 burgers-0.092
lette-0.090
asting-0.089
 counter-0.088
Token k
Token ID479
Feature activation-1.53e-01

Pos logit contributions
 superstar+0.053
 til+0.047
ite+0.046
bird+0.046
 Rhythm+0.046
Neg logit contributions
cious-0.042
 counter-0.040
 burgers-0.040
lette-0.040
 cooker-0.039
Tokenite
Token ID578
Feature activation-4.06e-01

Pos logit contributions
 superstar+0.144
 til+0.130
 nightmares+0.109
 former+0.104
 Spark+0.104
Neg logit contributions
ough-0.297
lette-0.295
 wheat-0.291
cious-0.287
 bread-0.281
Token is
Token ID318
Feature activation-6.98e-02

Pos logit contributions
 superstar+0.684
Brian+0.615
 Rhythm+0.615
 til+0.605
+0.600
Neg logit contributions
cious-0.497
 burgers-0.478
lette-0.477
asting-0.467
 counter-0.456
Token too
Token ID1165
Feature activation-2.10e-01

Pos logit contributions
 superstar+0.101
ite+0.094
 til+0.089
bird+0.089
 flashes+0.088
Neg logit contributions
cious-0.110
 burgers-0.104
lette-0.103
asting-0.103
 counter-0.099
Token heavy
Token ID4334
Feature activation+3.18e-03

Pos logit contributions
 superstar+0.263
ite+0.253
 til+0.243
 flashes+0.239
bul+0.234
Neg logit contributions
cious-0.368
lette-0.342
 burgers-0.342
 counter-0.337
 cooker-0.334
Token."
Token ID526
Feature activation+4.33e-02

Pos logit contributions
cious+0.005
lette+0.005
 burgers+0.005
asting+0.005
 cooker+0.005
Neg logit contributions
 superstar-0.004
ite-0.004
 til-0.004
bul-0.004
 rider-0.004
Token
Token ID198
Feature activation-3.59e-02

Pos logit contributions
cious+0.068
 burgers+0.063
lette+0.062
 counter+0.061
asting+0.061
Neg logit contributions
 superstar-0.064
ite-0.059
 til-0.057
bul-0.057
bird-0.056
Token "
Token ID366
Feature activation+1.61e-01

Pos logit contributions
 superstar+0.045
ite+0.043
 til+0.042
bul+0.042
 flashes+0.042
Neg logit contributions
cious-0.020
 burgers-0.018
lette-0.017
asting-0.017
 counter-0.017
TokenWe
Token ID1135
Feature activation-2.61e-02

Pos logit contributions
cious+0.219
asting+0.194
lette+0.194
 burgers+0.190
 counter+0.187
Neg logit contributions
 superstar-0.262
inkle-0.253
 til-0.252
ite-0.249
 flashes-0.248
Token like
Token ID588
Feature activation-8.34e-02

Pos logit contributions
ller+0.052
 superstar+0.051
 til+0.049
bul+0.049
 flashes+0.048
Neg logit contributions
cious-0.026
asting-0.025
lette-0.025
ough-0.024
ocado-0.023
Token your
Token ID534
Feature activation+7.25e-02

Pos logit contributions
 superstar+0.150
ite+0.140
 til+0.138
bird+0.135
bul+0.134
Neg logit contributions
cious-0.103
 burgers-0.097
lette-0.092
asting-0.091
 counter-0.090
Token k
Token ID479
Feature activation-1.53e-01

Pos logit contributions
 food+0.085
 bread+0.085
 wheat+0.085
 grill+0.081
 sand+0.079
Neg logit contributions
 superstar-0.124
azz-0.118
Brian-0.117
Taylor-0.116
Because-0.116
Tokenite
Token ID578
Feature activation-4.06e-01

Pos logit contributions
 superstar+0.212
 til+0.200
 former+0.192
 fail+0.186
 nightmare+0.186
Neg logit contributions
row-0.218
cious-0.216
ough-0.215
lette-0.207
'-0.204
Token.
Token ID13
Feature activation-2.13e-02

Pos logit contributions
 superstar+0.788
 til+0.739
+0.739
Brian+0.728
bul+0.723
Neg logit contributions
 burgers-0.393
cious-0.392
 counter-0.375
 food-0.373
lette-0.362
Token Can
Token ID1680
Feature activation-4.00e-02

Pos logit contributions
 superstar+0.039
ite+0.037
 til+0.037
bul+0.036
inkle+0.036
Neg logit contributions
cious-0.024
 burgers-0.022
lette-0.022
 counter-0.021
ough-0.021
Token you
Token ID345
Feature activation+7.85e-02

Pos logit contributions
 superstar+0.091
 til+0.085
inkle+0.084
bul+0.084
 flashes+0.083
Neg logit contributions
cious-0.027
 oh-0.026
 burgers-0.025
 counter-0.024
 Dinner-0.023
Token introduce
Token ID10400
Feature activation-1.07e-01

Pos logit contributions
asting+0.072
ough+0.065
 grill+0.061
lette+0.059
row+0.059
Neg logit contributions
ays-0.164
lights-0.157
 conflicts-0.152
 superstar-0.152
chairs-0.152
Token us
Token ID514
Feature activation+1.13e-01

Pos logit contributions
 superstar+0.220
 til+0.204
ite+0.202
bul+0.201
 flashes+0.197
Neg logit contributions
cious-0.108
 burgers-0.096
lette-0.096
ough-0.092
asting-0.090
Token a
Token ID257
Feature activation-1.41e-02

Pos logit contributions
 Rhythm+0.062
 whist+0.062
 flashes+0.062
 Zoom+0.060
 smiles+0.059
Neg logit contributions
 cooker-0.153
apa-0.148
 counter-0.147
row-0.147
lette-0.146
Token ball
Token ID2613
Feature activation-3.17e-02

Pos logit contributions
 superstar+0.021
ite+0.020
 til+0.020
bul+0.019
bird+0.019
Neg logit contributions
cious-0.021
 burgers-0.020
lette-0.019
asting-0.019
 cooker-0.019
Token and
Token ID290
Feature activation-2.10e-02

Pos logit contributions
 superstar+0.054
 til+0.050
 flashes+0.048
inkle+0.048
bul+0.047
Neg logit contributions
cious-0.041
 burgers-0.039
 counter-0.039
ough-0.038
 cooker-0.038
Token a
Token ID257
Feature activation-1.41e-02

Pos logit contributions
 superstar+0.028
ite+0.027
 flashes+0.025
 til+0.025
 Rhythm+0.025
Neg logit contributions
cious-0.034
asting-0.033
lette-0.033
 cooker-0.032
 counter-0.032
Token k
Token ID479
Feature activation-1.54e-01

Pos logit contributions
 superstar+0.020
ite+0.019
 til+0.018
bird+0.018
 flashes+0.018
Neg logit contributions
cious-0.022
lette-0.021
 burgers-0.021
asting-0.021
 cooker-0.020
Tokenite
Token ID578
Feature activation-4.05e-01

Pos logit contributions
 superstar+0.151
 til+0.149
 fail+0.146
 former+0.145
 pract+0.140
Neg logit contributions
cious-0.296
 burgers-0.276
 bread-0.273
row-0.270
oys-0.263
Token.
Token ID13
Feature activation-1.03e-02

Pos logit contributions
 superstar+0.694
 til+0.640
 flashes+0.622
bul+0.616
ite+0.616
Neg logit contributions
cious-0.516
 burgers-0.480
 counter-0.475
lette-0.471
ough-0.463
Token They
Token ID1119
Feature activation+5.76e-02

Pos logit contributions
 superstar+0.016
ite+0.015
 til+0.014
bird+0.014
 flashes+0.014
Neg logit contributions
cious-0.016
 burgers-0.015
lette-0.015
asting-0.014
 cooker-0.014
Token run
Token ID1057
Feature activation+1.32e-01

Pos logit contributions
cious+0.074
 burgers+0.067
asting+0.066
ough+0.066
 cooker+0.066
Neg logit contributions
 superstar-0.101
bul-0.093
ller-0.091
ite-0.091
 til-0.090
Token and
Token ID290
Feature activation-2.08e-02

Pos logit contributions
cious+0.228
lette+0.224
 burgers+0.217
asting+0.215
 cooker+0.212
Neg logit contributions
 superstar-0.166
ite-0.158
 til-0.153
 Rhythm-0.152
 rider-0.147
Token laugh
Token ID6487
Feature activation-1.12e-01

Pos logit contributions
 superstar+0.029
ite+0.028
 til+0.027
bird+0.026
 Rhythm+0.026
Neg logit contributions
cious-0.033
lette-0.032
asting-0.031
 burgers-0.031
 cooker-0.030
Token,
Token ID11
Feature activation-9.56e-03

Pos logit contributions
 superstar+0.122
ite+0.115
 til+0.111
bird+0.109
 flashes+0.107
Neg logit contributions
cious-0.144
asting-0.136
lette-0.135
 burgers-0.134
 cooker-0.133
Token this
Token ID428
Feature activation+5.01e-02

Pos logit contributions
 superstar+0.015
 unicorn+0.013
 Spark+0.013
ite+0.013
 til+0.013
Neg logit contributions
cious-0.015
lette-0.014
keeping-0.013
asting-0.013
 counters-0.013
Token is
Token ID318
Feature activation-6.67e-02

Pos logit contributions
cious+0.076
lette+0.071
 burgers+0.070
asting+0.070
 counter+0.069
Neg logit contributions
 superstar-0.074
 til-0.067
ite-0.065
bird-0.065
bul-0.065
Token my
Token ID616
Feature activation+1.55e-02

Pos logit contributions
 superstar+0.104
 Rhythm+0.100
 til+0.099
 unicorn+0.097
ite+0.097
Neg logit contributions
cious-0.097
 cooker-0.089
lette-0.089
asting-0.088
 counters-0.088
Token k
Token ID479
Feature activation-1.52e-01

Pos logit contributions
 food+0.018
 wheat+0.018
cious+0.018
 counter+0.018
 bread+0.017
Neg logit contributions
azz-0.025
Taylor-0.025
Brian-0.025
 superstar-0.025
Zero-0.024
Tokenite
Token ID578
Feature activation-4.05e-01

Pos logit contributions
 til+0.182
 superstar+0.170
 fail+0.168
 former+0.164
 nightmares+0.159
Neg logit contributions
row-0.255
cious-0.255
lette-0.248
ough-0.246
 envelope-0.236
Token.
Token ID13
Feature activation-1.56e-02

Pos logit contributions
 superstar+0.707
 til+0.658
 flashes+0.646
ite+0.636
 Rhythm+0.631
Neg logit contributions
cious-0.516
 burgers-0.478
lette-0.473
 counter-0.466
asting-0.461
Token I
Token ID314
Feature activation+3.18e-02

Pos logit contributions
 superstar+0.029
 til+0.028
bul+0.027
inkle+0.027
ite+0.027
Neg logit contributions
cious-0.017
lette-0.016
ough-0.015
 burgers-0.015
 Dinner-0.015
Token made
Token ID925
Feature activation-7.31e-02

Pos logit contributions
asting+0.033
cious+0.031
ocado+0.030
lette+0.029
row+0.027
Neg logit contributions
ller-0.062
 flashes-0.060
lights-0.060
ement-0.058
llers-0.058
Token it
Token ID340
Feature activation-5.89e-03

Pos logit contributions
 superstar+0.119
ite+0.112
 til+0.110
bird+0.107
 flashes+0.107
Neg logit contributions
cious-0.102
 burgers-0.094
lette-0.093
asting-0.092
 cooker-0.091
Token with
Token ID351
Feature activation-8.24e-02

Pos logit contributions
 superstar+0.010
+0.009
ite+0.009
 til+0.009
 flashes+0.009
Neg logit contributions
cious-0.008
asting-0.008
 burgers-0.008
lette-0.007
 counter-0.007
Token the
Token ID262
Feature activation-2.95e-02

Pos logit contributions
 superstar+0.139
ite+0.135
bul+0.127
 rider+0.125
inkle+0.122
Neg logit contributions
 burgers-0.070
 counter-0.069
cious-0.069
asting-0.065
 stove-0.063
Token ball
Token ID2613
Feature activation-2.65e-02

Pos logit contributions
 superstar+0.048
ite+0.045
 til+0.044
 Rhythm+0.044
+0.043
Neg logit contributions
 counter-0.040
 burgers-0.040
 cooker-0.038
 stove-0.038
cious-0.037
Token,
Token ID11
Feature activation-1.29e-02

Pos logit contributions
 superstar+0.045
ite+0.041
 flashes+0.040
 til+0.040
 Rhythm+0.040
Neg logit contributions
cious-0.034
 counter-0.033
 burgers-0.032
lette-0.031
asting-0.031
Token the
Token ID262
Feature activation-3.09e-02

Pos logit contributions
 superstar+0.024
ite+0.024
bul+0.022
ane+0.022
inkle+0.022
Neg logit contributions
cious-0.014
 burgers-0.014
 counter-0.014
lette-0.013
 oh-0.012
Token k
Token ID479
Feature activation-1.52e-01

Pos logit contributions
 superstar+0.051
 til+0.047
ite+0.047
bul+0.046
inkle+0.046
Neg logit contributions
cious-0.040
 burgers-0.040
 counter-0.040
 cooker-0.038
lette-0.037
Tokenite
Token ID578
Feature activation-4.05e-01

Pos logit contributions
 superstar+0.168
 fail+0.160
 til+0.160
 pract+0.158
 former+0.153
Neg logit contributions
cious-0.268
ough-0.256
row-0.253
ocado-0.251
oys-0.249
Token,
Token ID11
Feature activation-1.16e-02

Pos logit contributions
 superstar+0.693
ite+0.621
 til+0.615
 flashes+0.613
 Rhythm+0.612
Neg logit contributions
cious-0.513
 counter-0.488
 burgers-0.465
lette-0.464
asting-0.459
Token and
Token ID290
Feature activation-2.82e-02

Pos logit contributions
 superstar+0.020
ite+0.019
inkle+0.018
bul+0.018
 Rhythm+0.018
Neg logit contributions
cious-0.015
 burgers-0.014
 counter-0.014
lette-0.013
 oh-0.013
Token the
Token ID262
Feature activation-2.59e-02

Pos logit contributions
 superstar+0.051
ite+0.048
+0.048
icycle+0.048
bul+0.048
Neg logit contributions
cious-0.034
 burgers-0.032
 oh-0.030
 counter-0.029
 cooker-0.028
Token bucket
Token ID19236
Feature activation+1.84e-01

Pos logit contributions
 superstar+0.041
ite+0.037
 til+0.037
inkle+0.036
bul+0.036
Neg logit contributions
cious-0.036
 counter-0.035
 burgers-0.035
lette-0.033
 cooker-0.033
Token.
Token ID13
Feature activation-1.98e-02

Pos logit contributions
cious+0.238
 counter+0.227
 burgers+0.226
asting+0.224
lette+0.221
Neg logit contributions
 superstar-0.310
 Rhythm-0.278
 til-0.270
Brian-0.269
ite-0.268
Token and
Token ID290
Feature activation-2.47e-02

Pos logit contributions
cious+0.343
lette+0.311
 burgers+0.310
asting+0.307
 counter+0.302
Neg logit contributions
 superstar-0.372
ite-0.349
bul-0.337
 flashes-0.336
 til-0.335
Token Ben
Token ID3932
Feature activation+1.49e-02

Pos logit contributions
 superstar+0.042
ite+0.040
bul+0.038
 til+0.038
 flashes+0.038
Neg logit contributions
cious-0.033
 burgers-0.031
lette-0.030
 counter-0.030
 cooker-0.029
Token took
Token ID1718
Feature activation+1.29e-01

Pos logit contributions
cious+0.022
 burgers+0.020
lette+0.020
asting+0.020
 cooker+0.020
Neg logit contributions
 superstar-0.023
ite-0.022
 til-0.021
bird-0.021
 flashes-0.021
Token their
Token ID511
Feature activation+7.33e-02

Pos logit contributions
cious+0.164
 burgers+0.157
lette+0.149
asting+0.149
 cooker+0.148
Neg logit contributions
 superstar-0.227
ite-0.212
 til-0.205
bul-0.203
bird-0.202
Token k
Token ID479
Feature activation-1.53e-01

Pos logit contributions
cious+0.115
 burgers+0.106
 counter+0.104
lette+0.102
 bread+0.102
Neg logit contributions
 superstar-0.110
 til-0.095
 flashes-0.092
Because-0.091
inkle-0.091
Tokenite
Token ID578
Feature activation-4.05e-01

Pos logit contributions
 superstar+0.155
 til+0.138
 flashes+0.114
inkle+0.111
 fail+0.111
Neg logit contributions
row-0.310
cious-0.306
lette-0.299
ough-0.295
 bread-0.288
Token to
Token ID284
Feature activation+4.93e-03

Pos logit contributions
 superstar+0.698
 til+0.629
 flashes+0.614
ite+0.612
 Rhythm+0.605
Neg logit contributions
cious-0.525
 burgers-0.500
lette-0.486
 counter-0.477
asting-0.477
Token the
Token ID262
Feature activation-2.11e-02

Pos logit contributions
cious+0.006
 burgers+0.006
 counter+0.006
asting+0.006
lette+0.006
Neg logit contributions
 superstar-0.009
ite-0.008
bul-0.008
 flashes-0.008
bird-0.008
Token park
Token ID3952
Feature activation-3.48e-02

Pos logit contributions
 superstar+0.035
ite+0.032
 til+0.032
bul+0.031
 flashes+0.031
Neg logit contributions
cious-0.029
 burgers-0.027
 counter-0.027
 cooker-0.026
lette-0.026
Token.
Token ID13
Feature activation-1.14e-02

Pos logit contributions
 superstar+0.050
ite+0.047
 til+0.046
bird+0.045
 flashes+0.045
Neg logit contributions
cious-0.055
 burgers-0.051
lette-0.051
asting-0.050
 cooker-0.050
Token They
Token ID1119
Feature activation+4.96e-02

Pos logit contributions
 superstar+0.019
ite+0.018
 til+0.017
 flashes+0.017
bird+0.017
Neg logit contributions
cious-0.016
lette-0.015
asting-0.015
 burgers-0.015
 cooker-0.014
Token.
Token ID13
Feature activation-2.37e-02

Pos logit contributions
 superstar+0.306
ite+0.285
 Rhythm+0.277
Brian+0.274
 til+0.270
Neg logit contributions
cious-0.230
 counter-0.216
 burgers-0.214
lette-0.212
asting-0.207
Token She
Token ID1375
Feature activation-5.54e-02

Pos logit contributions
 superstar+0.034
ite+0.031
 til+0.031
bird+0.030
 flashes+0.030
Neg logit contributions
cious-0.038
 burgers-0.036
lette-0.035
asting-0.035
 cooker-0.034
Token sees
Token ID7224
Feature activation-1.18e-01

Pos logit contributions
 superstar+0.105
bul+0.098
ite+0.095
+0.095
 rider+0.093
Neg logit contributions
 counter-0.056
 burgers-0.054
cious-0.053
 counters-0.050
 stove-0.050
Token the
Token ID262
Feature activation-2.80e-02

Pos logit contributions
 superstar+0.208
ite+0.196
 til+0.186
bul+0.185
 rider+0.185
Neg logit contributions
cious-0.149
 burgers-0.143
 counter-0.138
asting-0.135
lette-0.134
Token k
Token ID479
Feature activation-1.52e-01

Pos logit contributions
 superstar+0.045
azz+0.042
Because+0.040
Brian+0.040
 til+0.039
Neg logit contributions
 counter-0.039
 bread-0.037
 food-0.036
 dirt-0.036
 wheat-0.036
Tokenite
Token ID578
Feature activation-4.05e-01

Pos logit contributions
 former+0.186
 til+0.176
 superstar+0.174
 nightmare+0.166
 nightmares+0.165
Neg logit contributions
ough-0.239
cious-0.236
row-0.234
'-0.234
line-0.229
Token.
Token ID13
Feature activation-2.39e-02

Pos logit contributions
 superstar+0.666
ite+0.597
 Rhythm+0.585
 til+0.584
Brian+0.579
Neg logit contributions
cious-0.541
 counter-0.516
asting-0.507
 burgers-0.503
lette-0.502
Token
Token ID198
Feature activation-3.57e-02

Pos logit contributions
 superstar+0.036
ite+0.033
 til+0.032
bul+0.032
bird+0.031
Neg logit contributions
cious-0.037
 burgers-0.034
 counter-0.034
lette-0.033
 cooker-0.033
Token
Token ID198
Feature activation-3.60e-02

Pos logit contributions
 superstar+0.049
ite+0.046
 til+0.045
bird+0.044
 flashes+0.044
Neg logit contributions
cious-0.059
 burgers-0.055
lette-0.054
asting-0.054
 cooker-0.053
Token"
Token ID1
Feature activation+2.11e-01

Pos logit contributions
ite+0.064
Because+0.060
Zero+0.057
 Rhythm+0.057
 til+0.056
Neg logit contributions
lette-0.047
cious-0.046
 burgers-0.044
keeping-0.044
 cooker-0.043
TokenThat
Token ID2504
Feature activation+6.31e-02

Pos logit contributions
cious+0.329
 burgers+0.306
lette+0.302
asting+0.299
 counter+0.296
Neg logit contributions
 superstar-0.312
ite-0.291
 til-0.286
bird-0.277
 flashes-0.277
Token.
Token ID13
Feature activation-1.86e-02

Pos logit contributions
cious+0.024
lette+0.022
 burgers+0.022
 counters+0.021
asting+0.021
Neg logit contributions
 superstar-0.017
ite-0.016
 rider-0.015
 til-0.015
bird-0.015
Token It
Token ID632
Feature activation+1.96e-02

Pos logit contributions
 superstar+0.028
ite+0.026
 til+0.026
bird+0.025
 flashes+0.025
Neg logit contributions
cious-0.028
 burgers-0.026
lette-0.026
asting-0.026
 cooker-0.025
Token wob
Token ID46867
Feature activation-2.19e-01

Pos logit contributions
 burgers+0.020
 bread+0.019
cious+0.019
 mud+0.018
 food+0.018
Neg logit contributions
 superstar-0.038
Brian-0.036
bul-0.034
ite-0.034
 Rhythm-0.034
Tokenbles
Token ID7689
Feature activation-1.50e-02

Pos logit contributions
 superstar+0.418
bird+0.385
 unicorn+0.384
 flashes+0.381
ite+0.378
Neg logit contributions
cious-0.248
asting-0.248
ough-0.228
lette-0.227
 burgers-0.219
Token and
Token ID290
Feature activation-2.90e-02

Pos logit contributions
ite+0.017
 superstar+0.017
 til+0.016
bird+0.015
 hits+0.015
Neg logit contributions
cious-0.027
 burgers-0.026
lette-0.026
 counters-0.025
 cooker-0.025
Token til
Token ID21502
Feature activation-4.05e-01

Pos logit contributions
 superstar+0.047
icycle+0.043
ite+0.042
Brian+0.042
bird+0.042
Neg logit contributions
 burgers-0.040
 dirt-0.039
 counter-0.038
 bread-0.038
cious-0.037
Tokents
Token ID912
Feature activation+8.07e-02

Pos logit contributions
 superstar+0.772
ite+0.732
bird+0.706
 til+0.705
bul+0.705
Neg logit contributions
cious-0.461
 burgers-0.416
asting-0.408
lette-0.405
 counter-0.391
Token.
Token ID13
Feature activation-1.98e-02

Pos logit contributions
cious+0.133
 burgers+0.125
lette+0.123
 cooker+0.123
 counter+0.121
Neg logit contributions
 superstar-0.109
 til-0.105
ite-0.100
bird-0.099
 flashes-0.097
Token Sara
Token ID24799
Feature activation+7.25e-02

Pos logit contributions
 superstar+0.031
ite+0.029
 til+0.029
bird+0.028
 flashes+0.028
Neg logit contributions
cious-0.029
 burgers-0.027
lette-0.026
asting-0.026
 cooker-0.026
Token and
Token ID290
Feature activation-3.16e-02

Pos logit contributions
cious+0.090
 counter+0.085
 burgers+0.082
ough+0.080
 counters+0.080
Neg logit contributions
 superstar-0.132
bul-0.123
ite-0.113
 unicorn-0.113
 former-0.111
Token Ben
Token ID3932
Feature activation+1.24e-02

Pos logit contributions
 superstar+0.059
ite+0.055
bul+0.054
bird+0.052
 flashes+0.051
Neg logit contributions
cious-0.038
 burgers-0.036
 counter-0.034
lette-0.033
 cooker-0.033
Token want
Token ID765
Feature activation+5.29e-02

Pos logit contributions
 superstar+0.036
ite+0.033
 til+0.032
 flashes+0.031
bul+0.031
Neg logit contributions
cious-0.042
 burgers-0.039
lette-0.039
asting-0.038
 cooker-0.038
Token to
Token ID284
Feature activation+2.66e-04

Pos logit contributions
cious+0.074
 burgers+0.069
lette+0.067
asting+0.066
 counter+0.066
Neg logit contributions
 superstar-0.088
ite-0.082
 til-0.080
bird-0.078
bul-0.078
Token fly
Token ID6129
Feature activation-1.54e-01

Pos logit contributions
cious+0.000
 burgers+0.000
lette+0.000
asting+0.000
ough+0.000
Neg logit contributions
 superstar-0.000
ite-0.000
 til-0.000
bul-0.000
 flashes-0.000
Token the
Token ID262
Feature activation-2.78e-02

Pos logit contributions
 superstar+0.290
+0.261
bul+0.252
 bolt+0.250
 til+0.250
Neg logit contributions
cious-0.173
 burgers-0.169
 food-0.160
asting-0.159
keeping-0.158
Token k
Token ID479
Feature activation-1.53e-01

Pos logit contributions
 superstar+0.043
ite+0.039
 til+0.039
 flashes+0.038
bird+0.038
Neg logit contributions
cious-0.041
 burgers-0.039
lette-0.038
asting-0.038
 counter-0.038
Tokenite
Token ID578
Feature activation-4.04e-01

Pos logit contributions
 til+0.151
 superstar+0.145
inkle+0.126
 flashes+0.126
 fail+0.125
Neg logit contributions
row-0.295
ough-0.287
cious-0.282
lette-0.275
and-0.271
Token again
Token ID757
Feature activation-2.94e-01

Pos logit contributions
 superstar+0.692
 til+0.645
 flashes+0.639
bul+0.616
 Rhythm+0.610
Neg logit contributions
cious-0.497
 burgers-0.480
 counter-0.473
lette-0.470
asting-0.469
Token.
Token ID13
Feature activation-1.94e-02

Pos logit contributions
 superstar+0.504
ite+0.472
 til+0.464
bird+0.455
 flashes+0.455
Neg logit contributions
cious-0.387
 burgers-0.354
lette-0.350
asting-0.347
 cooker-0.339
Token We
Token ID775
Feature activation-3.59e-02

Pos logit contributions
 superstar+0.032
ite+0.030
 til+0.030
bul+0.029
bird+0.029
Neg logit contributions
cious-0.026
 burgers-0.024
lette-0.024
 counter-0.023
 cooker-0.023
Token want
Token ID765
Feature activation+5.45e-02

Pos logit contributions
 superstar+0.056
ller+0.052
 flashes+0.051
 til+0.051
bul+0.051
Neg logit contributions
cious-0.051
asting-0.049
lette-0.049
 cooker-0.047
ough-0.047
Token to
Token ID284
Feature activation+4.26e-04

Pos logit contributions
 burgers+0.061
cious+0.058
 counter+0.055
 food+0.054
lette+0.054
Neg logit contributions
 superstar-0.107
ite-0.097
inkle-0.097
 til-0.096
 rider-0.094
TokenThey
Token ID2990
Feature activation+8.83e-02

Pos logit contributions
ite+0.057
 superstar+0.055
 Rhythm+0.055
 til+0.053
Zero+0.053
Neg logit contributions
cious-0.050
 burgers-0.048
 cooker-0.048
lette-0.047
asting-0.046
Token look
Token ID804
Feature activation+5.87e-02

Pos logit contributions
cious+0.098
 cooker+0.086
lette+0.086
asting+0.085
 burgers+0.085
Neg logit contributions
 superstar-0.168
ite-0.157
bul-0.157
ller-0.156
 unicorn-0.151
Token at
Token ID379
Feature activation-7.05e-02

Pos logit contributions
 burgers+0.076
cious+0.074
asting+0.070
 cooker+0.069
 counters+0.069
Neg logit contributions
 superstar-0.100
ite-0.096
-0.094
bird-0.091
bul-0.090
Token the
Token ID262
Feature activation-3.05e-02

Pos logit contributions
 superstar+0.151
+0.134
ite+0.134
icycle+0.134
bul+0.132
Neg logit contributions
 burgers-0.066
 wheat-0.061
 dirt-0.059
 bread-0.058
 grill-0.057
Token k
Token ID479
Feature activation-1.52e-01

Pos logit contributions
 superstar+0.051
azz+0.047
Brian+0.045
yles+0.044
Because+0.044
Neg logit contributions
 counter-0.040
 dirt-0.040
 wheat-0.039
 soil-0.038
 bread-0.038
Tokenite
Token ID578
Feature activation-4.04e-01

Pos logit contributions
 superstar+0.165
 til+0.160
 nightmares+0.143
 former+0.140
 fail+0.134
Neg logit contributions
ough-0.275
row-0.266
cious-0.266
 wheat-0.264
lette-0.262
Token.
Token ID13
Feature activation-2.04e-02

Pos logit contributions
 superstar+0.693
ite+0.628
 til+0.615
 Rhythm+0.610
 flashes+0.607
Neg logit contributions
cious-0.517
 counter-0.484
 burgers-0.480
lette-0.479
asting-0.469
Token They
Token ID1119
Feature activation+4.76e-02

Pos logit contributions
 superstar+0.037
ite+0.035
bul+0.034
 rider+0.034
 til+0.034
Neg logit contributions
cious-0.024
 burgers-0.023
 counter-0.022
 cooker-0.021
lette-0.021
Token see
Token ID766
Feature activation-4.50e-02

Pos logit contributions
cious+0.050
lette+0.041
 dirt+0.041
 counter+0.040
 grill+0.040
Neg logit contributions
 superstar-0.095
bul-0.092
ite-0.087
ller-0.087
icycle-0.086
Token a
Token ID257
Feature activation-2.18e-02

Pos logit contributions
 superstar+0.090
+0.086
icycle+0.084
ite+0.083
inkle+0.080
Neg logit contributions
 burgers-0.047
cious-0.043
 bread-0.042
 wheat-0.042
 dirt-0.042
Token problem
Token ID1917
Feature activation-2.28e-01

Pos logit contributions
 superstar+0.033
ite+0.031
 til+0.030
bul+0.030
bird+0.029
Neg logit contributions
cious-0.032
 counter-0.030
 burgers-0.029
lette-0.028
asting-0.028
Token a
Token ID257
Feature activation-2.18e-02

Pos logit contributions
 superstar+0.090
+0.086
icycle+0.084
ite+0.083
inkle+0.080
Neg logit contributions
 burgers-0.047
cious-0.043
 bread-0.042
 wheat-0.042
 dirt-0.042
Token problem
Token ID1917
Feature activation-2.28e-01

Pos logit contributions
 superstar+0.033
ite+0.031
 til+0.030
bul+0.030
bird+0.029
Neg logit contributions
cious-0.032
 counter-0.030
 burgers-0.029
lette-0.028
asting-0.028
Token.
Token ID13
Feature activation-2.37e-02

Pos logit contributions
 superstar+0.309
ite+0.295
 til+0.289
 flashes+0.284
bird+0.283
Neg logit contributions
cious-0.377
lette-0.349
 burgers-0.347
asting-0.343
 cooker-0.338
Token The
Token ID383
Feature activation-7.93e-02

Pos logit contributions
 superstar+0.041
ite+0.038
 til+0.037
 rider+0.036
bul+0.036
Neg logit contributions
cious-0.031
 burgers-0.029
 counter-0.028
lette-0.028
 cooker-0.028
Token k
Token ID479
Feature activation-1.51e-01

Pos logit contributions
 superstar+0.142
azz+0.130
yles+0.125
Because+0.124
 til+0.123
Neg logit contributions
 dirt-0.093
 counter-0.092
 wheat-0.091
 bread-0.090
 soil-0.088
Tokenite
Token ID578
Feature activation-4.04e-01

Pos logit contributions
 superstar+0.161
 former+0.139
 nightmares+0.138
 til+0.136
 fail+0.131
Neg logit contributions
 wheat-0.265
ough-0.263
cious-0.255
 bread-0.253
lette-0.250
Token has
Token ID468
Feature activation-8.86e-02

Pos logit contributions
 superstar+0.771
Brian+0.714
 former+0.695
+0.692
icycle+0.679
Neg logit contributions
 counter-0.380
cious-0.376
 bread-0.374
 burgers-0.370
 dirt-0.368
Token a
Token ID257
Feature activation-2.11e-02

Pos logit contributions
 superstar+0.139
ite+0.129
 til+0.126
bird+0.123
bul+0.122
Neg logit contributions
cious-0.130
 burgers-0.120
lette-0.119
asting-0.118
 counter-0.116
Token hole
Token ID7604
Feature activation+4.31e-02

Pos logit contributions
 superstar+0.033
ite+0.031
bird+0.031
 til+0.030
bul+0.029
Neg logit contributions
cious-0.029
 counter-0.027
 burgers-0.026
lette-0.026
 cooker-0.025
Token in
Token ID287
Feature activation+2.04e-02

Pos logit contributions
cious+0.058
asting+0.054
 burgers+0.054
lette+0.054
 counter+0.052
Neg logit contributions
 superstar-0.072
ite-0.066
 til-0.064
 Rhythm-0.064
bird-0.063
Token it
Token ID340
Feature activation-8.83e-03

Pos logit contributions
cious+0.025
 burgers+0.024
 counter+0.024
asting+0.023
lette+0.023
Neg logit contributions
 superstar-0.036
ite-0.034
 til-0.033
 rider-0.032
bird-0.032
Token the
Token ID262
Feature activation-2.41e-02

Pos logit contributions
 superstar+0.154
ite+0.146
bul+0.143
 til+0.138
 rider+0.136
Neg logit contributions
 burgers-0.098
cious-0.097
 counter-0.090
asting-0.087
lette-0.087
Token ball
Token ID2613
Feature activation-2.77e-02

Pos logit contributions
 superstar+0.041
ite+0.040
 til+0.038
bul+0.037
irl+0.037
Neg logit contributions
 counter-0.031
 burgers-0.031
cious-0.030
 food-0.029
 cooker-0.028
Token,
Token ID11
Feature activation-1.55e-02

Pos logit contributions
 superstar+0.047
 til+0.043
 flashes+0.043
bul+0.043
ite+0.042
Neg logit contributions
cious-0.035
 burgers-0.033
 counter-0.032
asting-0.032
ough-0.032
Token the
Token ID262
Feature activation-2.59e-02

Pos logit contributions
 superstar+0.025
ite+0.024
bul+0.023
 flashes+0.023
inkle+0.023
Neg logit contributions
cious-0.020
 burgers-0.020
 counter-0.019
 cooker-0.019
lette-0.018
Token k
Token ID479
Feature activation-1.53e-01

Pos logit contributions
 superstar+0.044
ite+0.044
 til+0.041
Because+0.041
irl+0.041
Neg logit contributions
 counter-0.033
 burgers-0.032
cious-0.031
 food-0.031
 stove-0.030
Tokenite
Token ID578
Feature activation-4.04e-01

Pos logit contributions
 superstar+0.203
 til+0.193
 fail+0.188
 pract+0.186
irl+0.177
Neg logit contributions
cious-0.244
ocado-0.243
lette-0.242
row-0.228
 burgers-0.224
Token and
Token ID290
Feature activation-3.37e-02

Pos logit contributions
 superstar+0.671
 til+0.608
ite+0.606
 flashes+0.605
bul+0.604
Neg logit contributions
cious-0.533
 burgers-0.494
 counter-0.485
lette-0.484
asting-0.483
Token the
Token ID262
Feature activation-2.43e-02

Pos logit contributions
 superstar+0.063
icycle+0.060
ite+0.060
bul+0.058
ane+0.057
Neg logit contributions
 burgers-0.037
cious-0.036
 counter-0.034
 bread-0.033
 food-0.032
Token bike
Token ID7161
Feature activation-1.18e-01

Pos logit contributions
 superstar+0.038
ite+0.035
 til+0.034
 flashes+0.033
bul+0.033
Neg logit contributions
cious-0.035
 burgers-0.034
 counter-0.033
lette-0.033
 cooker-0.032
Token.
Token ID13
Feature activation-2.22e-02

Pos logit contributions
 superstar+0.222
 flashes+0.206
bul+0.205
 til+0.201
ller+0.201
Neg logit contributions
cious-0.123
 counter-0.115
ough-0.112
 burgers-0.111
asting-0.109
Token The
Token ID383
Feature activation-8.04e-02

Pos logit contributions
 superstar+0.043
ite+0.043
bul+0.042
bird+0.040
 til+0.040
Neg logit contributions
cious-0.023
 burgers-0.021
 counter-0.020
 Dinner-0.020
 cooker-0.019
Token can
Token ID460
Feature activation-9.41e-02

Pos logit contributions
 Zoom+0.013
ite+0.013
 smiles+0.013
 superstar+0.013
 flash+0.013
Neg logit contributions
cious-0.026
asting-0.026
keeping-0.025
ough-0.024
row-0.024
Token we
Token ID356
Feature activation-5.78e-03

Pos logit contributions
 superstar+0.178
 til+0.166
ite+0.164
bul+0.162
 flashes+0.161
Neg logit contributions
cious-0.107
 burgers-0.098
 counter-0.095
lette-0.095
 cooker-0.092
Token have
Token ID423
Feature activation-7.60e-02

Pos logit contributions
 superstar+0.011
 flashes+0.011
 til+0.011
ays+0.011
ement+0.010
Neg logit contributions
lette-0.006
 counter-0.005
cious-0.005
 grill-0.005
ough-0.005
Token a
Token ID257
Feature activation-2.09e-02

Pos logit contributions
 superstar+0.135
ite+0.127
bird+0.123
 til+0.123
inkle+0.120
Neg logit contributions
 burgers-0.094
cious-0.093
 counter-0.088
lette-0.087
asting-0.084
Token k
Token ID479
Feature activation-1.51e-01

Pos logit contributions
 superstar+0.030
 flashes+0.029
ays+0.029
 til+0.029
ite+0.028
Neg logit contributions
cious-0.030
 counter-0.029
 burgers-0.029
 grill-0.028
 cooker-0.027
Tokenite
Token ID578
Feature activation-4.04e-01

Pos logit contributions
 til+0.181
 former+0.171
 fail+0.165
 moons+0.163
inkle+0.161
Neg logit contributions
 envelope-0.235
row-0.233
ful-0.231
lette-0.231
'-0.231
Token?"
Token ID1701
Feature activation-1.08e-01

Pos logit contributions
 superstar+0.688
 flashes+0.644
 conflicts+0.640
 Rhythm+0.637
yles+0.633
Neg logit contributions
 burgers-0.465
 counter-0.450
lette-0.442
 oh-0.431
 grill-0.429
Token Sara
Token ID24799
Feature activation+7.42e-02

Pos logit contributions
 superstar+0.196
ite+0.187
bul+0.182
 rider+0.176
 til+0.175
Neg logit contributions
cious-0.132
 burgers-0.125
 counter-0.123
 cooker-0.118
lette-0.113
Token asks
Token ID7893
Feature activation-2.04e-01

Pos logit contributions
cious+0.126
 counter+0.121
lette+0.118
 burgers+0.117
 counters+0.115
Neg logit contributions
 superstar-0.101
ite-0.092
bul-0.086
-0.086
 bolt-0.082
Token.
Token ID13
Feature activation-2.30e-02

Pos logit contributions
 superstar+0.258
bird+0.246
ite+0.246
 til+0.243
 flashes+0.232
Neg logit contributions
cious-0.351
 burgers-0.327
asting-0.326
lette-0.324
 cooker-0.318
Token "
Token ID366
Feature activation+1.60e-01

Pos logit contributions
 superstar+0.034
ite+0.031
 til+0.031
bird+0.030
 flashes+0.030
Neg logit contributions
cious-0.036
 burgers-0.033
lette-0.033
asting-0.033
 cooker-0.032
Token now
Token ID783
Feature activation-1.49e-01

Pos logit contributions
cious+0.258
 burgers+0.240
lette+0.237
asting+0.236
 counter+0.233
Neg logit contributions
 superstar-0.255
ite-0.236
 til-0.233
bird-0.228
 flashes-0.227
Token."
Token ID526
Feature activation+4.22e-02

Pos logit contributions
 superstar+0.227
ite+0.213
 til+0.208
 flashes+0.203
bird+0.202
Neg logit contributions
cious-0.225
 burgers-0.207
lette-0.206
asting-0.203
 cooker-0.200
Token
Token ID198
Feature activation-4.04e-02

Pos logit contributions
cious+0.052
 counter+0.051
 burgers+0.051
 cooker+0.048
 counters+0.048
Neg logit contributions
 superstar-0.074
bul-0.069
ite-0.069
 til-0.067
 rider-0.067
Token
Token ID198
Feature activation-4.03e-02

Pos logit contributions
 superstar+0.069
ite+0.063
bul+0.062
 flashes+0.060
 til+0.060
Neg logit contributions
cious-0.053
 burgers-0.051
asting-0.050
 counter-0.049
lette-0.048
TokenL
Token ID43
Feature activation+1.96e-02

Pos logit contributions
 superstar+0.063
ite+0.060
 til+0.057
 Rhythm+0.057
bird+0.056
Neg logit contributions
cious-0.060
lette-0.055
 burgers-0.055
 cooker-0.053
asting-0.053
Tokenily
Token ID813
Feature activation+2.24e-01

Pos logit contributions
cious+0.023
 burgers+0.021
lette+0.018
 hamb+0.018
 cooker+0.018
Neg logit contributions
 superstar-0.038
bird-0.036
ite-0.036
ays-0.035
azz-0.034
Token and
Token ID290
Feature activation-3.76e-02

Pos logit contributions
cious+0.374
 burgers+0.351
lette+0.348
asting+0.347
 cooker+0.340
Neg logit contributions
 superstar-0.304
ite-0.282
 til-0.279
bird-0.270
 flashes-0.268
Token Ben
Token ID3932
Feature activation+3.89e-03

Pos logit contributions
 superstar+0.069
ite+0.066
bul+0.065
 til+0.063
bird+0.062
Neg logit contributions
cious-0.045
 burgers-0.042
lette-0.041
 counter-0.040
 cooker-0.040
Token started
Token ID2067
Feature activation+1.10e-02

Pos logit contributions
cious+0.007
 burgers+0.007
asting+0.006
lette+0.006
 cooker+0.006
Neg logit contributions
 superstar-0.005
 til-0.004
ite-0.004
 flashes-0.004
bird-0.004
Token to
Token ID284
Feature activation-3.60e-03

Pos logit contributions
cious+0.014
 burgers+0.013
 counter+0.013
asting+0.013
lette+0.013
Neg logit contributions
 superstar-0.019
ite-0.018
bird-0.017
 til-0.017
bul-0.017
Token panic
Token ID13619
Feature activation-9.33e-02

Pos logit contributions
 superstar+0.005
ite+0.005
 til+0.004
bird+0.004
 flashes+0.004
Neg logit contributions
cious-0.006
 burgers-0.006
lette-0.006
asting-0.005
 cooker-0.005
Token.
Token ID13
Feature activation-2.18e-02

Pos logit contributions
cious+0.132
 burgers+0.122
lette+0.121
asting+0.120
 counter+0.117
Neg logit contributions
 superstar-0.153
ite-0.144
 til-0.141
 flashes-0.137
bul-0.136
Token They
Token ID1119
Feature activation+5.07e-02

Pos logit contributions
 superstar+0.038
ite+0.036
bul+0.035
bird+0.035
inkle+0.035
Neg logit contributions
cious-0.026
 burgers-0.026
 counter-0.025
 Dinner-0.024
 cooker-0.024
Token played
Token ID2826
Feature activation-1.16e-01

Pos logit contributions
cious+0.061
 burgers+0.059
asting+0.057
lette+0.054
 bread+0.051
Neg logit contributions
 superstar-0.088
ite-0.084
ller-0.083
bul-0.083
inkle-0.082
Token with
Token ID351
Feature activation-8.18e-02

Pos logit contributions
 superstar+0.157
ite+0.148
bul+0.142
 flashes+0.139
 til+0.139
Neg logit contributions
cious-0.193
 burgers-0.183
asting-0.179
lette-0.178
 counter-0.175
Token other
Token ID584
Feature activation+7.05e-02

Pos logit contributions
 superstar+0.161
ite+0.153
icycle+0.149
bul+0.149
+0.148
Neg logit contributions
 burgers-0.089
cious-0.085
 food-0.083
 bread-0.081
 dirt-0.079
Token plants
Token ID6134
Feature activation+2.36e-01

Pos logit contributions
cious+0.094
lette+0.086
 burgers+0.085
asting+0.084
 cooker+0.083
Neg logit contributions
 superstar-0.119
ite-0.112
 til-0.111
bird-0.109
 flashes-0.109
Token that
Token ID326
Feature activation-5.35e-02

Pos logit contributions
cious+0.296
 burgers+0.279
asting+0.265
lette+0.262
 counter+0.261
Neg logit contributions
 superstar-0.403
 flashes-0.371
ller-0.371
bul-0.367
ite-0.365
Token were
Token ID547
Feature activation+6.13e-03

Pos logit contributions
 superstar+0.088
ite+0.083
 til+0.081
bird+0.079
bul+0.079
Neg logit contributions
cious-0.073
 burgers-0.069
asting-0.066
lette-0.066
 cooker-0.065
Token nice
Token ID3621
Feature activation+1.31e-01

Pos logit contributions
cious+0.009
 burgers+0.008
asting+0.008
lette+0.008
 counter+0.008
Neg logit contributions
 superstar-0.010
ite-0.009
bird-0.009
 til-0.009
bul-0.009
Token and
Token ID290
Feature activation-3.11e-02

Pos logit contributions
cious+0.207
 burgers+0.193
lette+0.191
asting+0.189
 cooker+0.187
Neg logit contributions
 superstar-0.190
ite-0.177
 til-0.172
bird-0.169
 flashes-0.168
Token soft
Token ID2705
Feature activation+8.28e-02

Pos logit contributions
 superstar+0.050
ite+0.047
 til+0.047
 flashes+0.046
bul+0.045
Neg logit contributions
cious-0.043
lette-0.040
 burgers-0.039
 counter-0.039
 cooker-0.039
Token happy
Token ID3772
Feature activation+3.96e-02

Pos logit contributions
cious+0.018
 burgers+0.016
lette+0.016
asting+0.016
 cooker+0.016
Neg logit contributions
 superstar-0.019
ite-0.017
 til-0.017
bird-0.017
bul-0.017
Token that
Token ID326
Feature activation-3.66e-02

Pos logit contributions
cious+0.057
 burgers+0.053
lette+0.052
 counter+0.051
 cooker+0.051
Neg logit contributions
 superstar-0.062
ite-0.060
 til-0.057
 Rhythm-0.056
 flashes-0.056
Token the
Token ID262
Feature activation-6.91e-03

Pos logit contributions
 superstar+0.056
ite+0.053
 til+0.051
 flashes+0.050
bird+0.050
Neg logit contributions
cious-0.054
 burgers-0.051
asting-0.050
lette-0.049
 counter-0.049
Token bird
Token ID6512
Feature activation-1.13e-01

Pos logit contributions
 superstar+0.010
ays+0.010
ite+0.010
 Rhythm+0.010
 til+0.010
Neg logit contributions
cious-0.010
 counter-0.009
 food-0.009
lette-0.009
 dirt-0.009
Token was
Token ID373
Feature activation-4.33e-04

Pos logit contributions
 superstar+0.199
 flashes+0.187
ite+0.186
 Rhythm+0.184
 til+0.184
Neg logit contributions
cious-0.134
asting-0.122
lette-0.122
row-0.111
 burgers-0.111
Token safe
Token ID3338
Feature activation+2.38e-01

Pos logit contributions
 superstar+0.001
ite+0.001
 til+0.001
bird+0.001
bul+0.001
Neg logit contributions
cious-0.001
 burgers-0.001
 cooker-0.001
 counter-0.001
lette-0.001
Token and
Token ID290
Feature activation-4.00e-03

Pos logit contributions
cious+0.368
 burgers+0.341
lette+0.340
asting+0.336
 counter+0.332
Neg logit contributions
 superstar-0.350
ite-0.328
 til-0.321
 flashes-0.314
 Rhythm-0.311
Token the
Token ID262
Feature activation-6.16e-03

Pos logit contributions
 superstar+0.007
ite+0.007
 til+0.007
bird+0.006
 flashes+0.006
Neg logit contributions
cious-0.005
 burgers-0.005
lette-0.004
asting-0.004
 cooker-0.004
Token bell
Token ID8966
Feature activation-1.10e-01

Pos logit contributions
 superstar+0.009
ite+0.009
 til+0.009
bul+0.008
 Rhythm+0.008
Neg logit contributions
cious-0.009
lette-0.008
 burgers-0.008
 counter-0.008
asting-0.008
Token was
Token ID373
Feature activation-1.13e-03

Pos logit contributions
 superstar+0.186
 til+0.178
 Rhythm+0.176
 flashes+0.170
+0.168
Neg logit contributions
cious-0.140
lette-0.133
asting-0.126
row-0.119
 counter-0.118
Token ringing
Token ID32333
Feature activation-2.06e-01

Pos logit contributions
 superstar+0.002
ite+0.002
 til+0.002
bird+0.002
bul+0.002
Neg logit contributions
cious-0.002
 burgers-0.002
lette-0.002
asting-0.001
 cooker-0.001
Token The
Token ID383
Feature activation-6.21e-02

Pos logit contributions
cious+0.018
 burgers+0.017
lette+0.016
asting+0.016
 cooker+0.016
Neg logit contributions
 superstar-0.016
ite-0.015
 til-0.014
bird-0.014
 flashes-0.014
Token bug
Token ID5434
Feature activation-4.51e-02

Pos logit contributions
 superstar+0.106
ite+0.100
 til+0.099
bul+0.097
 Rhythm+0.096
Neg logit contributions
cious-0.081
 burgers-0.077
asting-0.075
 cooker-0.074
 counter-0.074
Token spl
Token ID4328
Feature activation+5.82e-02

Pos logit contributions
 superstar+0.071
ite+0.067
 til+0.065
 flashes+0.064
bul+0.064
Neg logit contributions
cious-0.065
 burgers-0.059
lette-0.059
asting-0.058
 cooker-0.057
Tokenashed
Token ID5263
Feature activation+5.45e-02

Pos logit contributions
cious+0.089
 counter+0.087
 burgers+0.084
 cooker+0.081
lette+0.081
Neg logit contributions
 superstar-0.086
ite-0.085
bird-0.077
 til-0.077
 flashes-0.076
Token and
Token ID290
Feature activation-5.33e-03

Pos logit contributions
cious+0.104
lette+0.096
 counter+0.093
 burgers+0.092
 cooker+0.091
Neg logit contributions
 superstar-0.063
ite-0.059
 til-0.058
bird-0.057
 grace-0.057
Token sw
Token ID1509
Feature activation+2.02e-01

Pos logit contributions
 superstar+0.008
ite+0.007
 til+0.007
 flashes+0.007
bird+0.007
Neg logit contributions
cious-0.008
lette-0.008
 burgers-0.008
 counter-0.008
 cooker-0.008
Tokenam
Token ID321
Feature activation+4.00e-02

Pos logit contributions
 counter+0.356
cious+0.341
 cooker+0.337
 burgers+0.334
lette+0.322
Neg logit contributions
 superstar-0.245
ite-0.240
 til-0.232
inkle-0.229
 flashes-0.220
Token and
Token ID290
Feature activation-5.66e-03

Pos logit contributions
cious+0.068
lette+0.065
 burgers+0.064
 cooker+0.062
 counter+0.062
Neg logit contributions
 superstar-0.054
ite-0.048
 til-0.048
 grace-0.048
bird-0.046
Token had
Token ID550
Feature activation+5.31e-02

Pos logit contributions
 superstar+0.009
ite+0.008
 til+0.008
bird+0.008
 flashes+0.008
Neg logit contributions
cious-0.009
lette-0.008
 burgers-0.008
 counter-0.008
 cooker-0.008
Token lots
Token ID6041
Feature activation+6.12e-02

Pos logit contributions
cious+0.087
 cooker+0.085
lette+0.085
 counter+0.081
 burgers+0.081
Neg logit contributions
 superstar-0.071
 smiles-0.064
 Spark-0.063
 flashes-0.063
 grace-0.062
Token of
Token ID286
Feature activation-2.42e-02

Pos logit contributions
cious+0.098
 burgers+0.092
asting+0.091
lette+0.090
 cooker+0.088
Neg logit contributions
 superstar-0.087
ite-0.081
 til-0.079
bird-0.077
bul-0.077
Token She
Token ID1375
Feature activation-5.08e-02

Pos logit contributions
 superstar+0.022
ite+0.021
bul+0.019
bird+0.019
 til+0.019
Neg logit contributions
cious-0.021
 burgers-0.020
 counter-0.020
 cooker-0.020
lette-0.019
Token said
Token ID531
Feature activation-2.57e-02

Pos logit contributions
 superstar+0.099
ite+0.094
ane+0.092
inkle+0.090
ller+0.089
Neg logit contributions
cious-0.053
 burgers-0.050
 counter-0.046
 oh-0.045
 scrambled-0.045
Token,
Token ID11
Feature activation-1.38e-02

Pos logit contributions
 superstar+0.033
 til+0.031
ite+0.031
bird+0.030
 flashes+0.030
Neg logit contributions
cious-0.044
asting-0.041
lette-0.041
 burgers-0.041
 cooker-0.040
Token "
Token ID366
Feature activation+1.63e-01

Pos logit contributions
 superstar+0.024
 til+0.023
 Rhythm+0.022
bird+0.022
 Spark+0.022
Neg logit contributions
cious-0.017
asting-0.017
keeping-0.017
lette-0.016
 burgers-0.016
TokenL
Token ID43
Feature activation+2.91e-02

Pos logit contributions
cious+0.200
 oh+0.179
asting+0.179
ough+0.176
lette+0.175
Neg logit contributions
 superstar-0.290
inkle-0.277
 til-0.275
bul-0.267
 flashes-0.266
Tokenily
Token ID813
Feature activation+2.30e-01

Pos logit contributions
cious+0.027
 burgers+0.025
lette+0.022
 cooker+0.021
 counter+0.021
Neg logit contributions
 superstar-0.062
ite-0.062
bird-0.059
 til-0.058
bul-0.057
Token,
Token ID11
Feature activation-1.37e-02

Pos logit contributions
cious+0.405
 burgers+0.385
asting+0.382
lette+0.376
 cooker+0.371
Neg logit contributions
 superstar-0.283
ite-0.281
bird-0.264
 til-0.259
 Rhythm-0.253
Token please
Token ID3387
Feature activation+5.06e-02

Pos logit contributions
 superstar+0.019
 til+0.018
 Spark+0.018
 ghost+0.017
 flash+0.017
Neg logit contributions
cious-0.023
keeping-0.022
lette-0.021
 burgers-0.021
asting-0.021
Token don
Token ID836
Feature activation-2.60e-03

Pos logit contributions
cious+0.078
 burgers+0.073
lette+0.072
asting+0.071
 cooker+0.070
Neg logit contributions
 superstar-0.075
ite-0.071
 til-0.069
bird-0.067
 flashes-0.067
Token't
Token ID470
Feature activation-2.23e-01

Pos logit contributions
 superstar+0.002
ite+0.002
 til+0.002
bird+0.001
bul+0.001
Neg logit contributions
cious-0.006
 burgers-0.006
lette-0.006
asting-0.006
 counter-0.006
Token yell
Token ID28946
Feature activation-4.22e-02

Pos logit contributions
 superstar+0.386
ite+0.357
 til+0.356
bul+0.351
 flashes+0.348
Neg logit contributions
cious-0.295
lette-0.268
 burgers-0.267
asting-0.266
ough-0.259
Token the
Token ID262
Feature activation-1.50e-02

Pos logit contributions
 burgers+0.003
cious+0.002
 oh+0.002
 counter+0.002
 cooker+0.002
Neg logit contributions
 superstar-0.004
ite-0.004
 til-0.003
bird-0.003
bul-0.003
Token pot
Token ID1787
Feature activation+2.03e-01

Pos logit contributions
 superstar+0.023
 til+0.023
 flashes+0.023
 whist+0.022
 unicorn+0.022
Neg logit contributions
cious-0.021
keeping-0.020
ough-0.020
asting-0.019
lette-0.019
Token started
Token ID2067
Feature activation+2.23e-02

Pos logit contributions
cious+0.310
 burgers+0.286
lette+0.283
asting+0.279
 cooker+0.276
Neg logit contributions
 superstar-0.305
ite-0.286
 til-0.278
bird-0.277
 flashes-0.274
Token to
Token ID284
Feature activation+8.88e-03

Pos logit contributions
cious+0.028
 burgers+0.027
lette+0.025
asting+0.025
 counter+0.025
Neg logit contributions
 superstar-0.040
ite-0.039
 rider-0.037
 Rhythm-0.036
 til-0.036
Token rise
Token ID4485
Feature activation+6.48e-03

Pos logit contributions
cious+0.013
 burgers+0.012
asting+0.012
 counter+0.012
lette+0.012
Neg logit contributions
 superstar-0.014
ite-0.013
 til-0.012
bird-0.012
 Rhythm-0.012
Token.
Token ID13
Feature activation+4.73e-03

Pos logit contributions
cious+0.009
asting+0.009
 burgers+0.009
lette+0.008
 counter+0.008
Neg logit contributions
 superstar-0.010
ite-0.010
 til-0.009
 Rhythm-0.009
bird-0.009
Token It
Token ID632
Feature activation+3.05e-02

Pos logit contributions
cious+0.008
lette+0.008
asting+0.008
 burgers+0.008
 cooker+0.007
Neg logit contributions
 superstar-0.006
 til-0.006
ite-0.005
 flashes-0.005
bird-0.005
Token was
Token ID373
Feature activation-9.13e-03

Pos logit contributions
cious+0.037
 burgers+0.035
asting+0.035
lette+0.034
 counter+0.032
Neg logit contributions
 superstar-0.053
ite-0.052
 Rhythm-0.049
 rider-0.048
Brian-0.048
Token like
Token ID588
Feature activation-7.50e-02

Pos logit contributions
 superstar+0.015
ite+0.014
 til+0.013
bird+0.013
 Rhythm+0.013
Neg logit contributions
cious-0.013
 burgers-0.012
asting-0.012
lette-0.012
 counter-0.012
Token magic
Token ID5536
Feature activation-9.99e-02

Pos logit contributions
 superstar+0.109
 til+0.099
ite+0.098
 flashes+0.098
bul+0.096
Neg logit contributions
cious-0.120
lette-0.110
asting-0.108
 burgers-0.107
 cooker-0.106
Token!
Token ID0
Feature activation-1.30e-02

Pos logit contributions
 superstar+0.142
ite+0.130
 til+0.129
bul+0.127
bird+0.127
Neg logit contributions
cious-0.161
lette-0.148
 burgers-0.147
asting-0.145
ough-0.143
TokenM
Token ID44
Feature activation+1.57e-01

Pos logit contributions
Because+0.036
Zero+0.035
Brian+0.034
ite+0.034
bird+0.034
Neg logit contributions
 burgers-0.049
lette-0.048
cious-0.047
 cooker-0.047
asting-0.046
Tokenummy
Token ID13513
Feature activation-2.52e-02

Pos logit contributions
cious+0.180
lette+0.172
 counter+0.160
apa+0.157
 burgers+0.156
Neg logit contributions
 superstar-0.294
 flashes-0.276
 til-0.272
 Rhythm-0.269
bul-0.264
Token and
Token ID290
Feature activation-2.31e-02

Pos logit contributions
 superstar+0.036
ite+0.034
 til+0.033
 flashes+0.032
bul+0.032
Neg logit contributions
cious-0.040
 burgers-0.037
lette-0.037
asting-0.036
 cooker-0.036
Token Daddy
Token ID26926
Feature activation+2.16e-02

Pos logit contributions
 superstar+0.035
 til+0.033
bird+0.032
 flashes+0.031
ite+0.031
Neg logit contributions
cious-0.036
asting-0.034
lette-0.033
ough-0.032
 burgers-0.032
Token laughed
Token ID13818
Feature activation-7.08e-02

Pos logit contributions
cious+0.033
 burgers+0.031
lette+0.031
asting+0.030
 cooker+0.030
Neg logit contributions
 superstar-0.032
ite-0.030
 til-0.029
 flashes-0.028
bul-0.028
Token and
Token ID290
Feature activation-2.19e-02

Pos logit contributions
 superstar+0.105
ite+0.100
 til+0.096
bird+0.094
bul+0.094
Neg logit contributions
cious-0.108
 burgers-0.101
lette-0.100
asting-0.099
 cooker-0.097
Token hugged
Token ID41130
Feature activation-4.02e-02

Pos logit contributions
 superstar+0.032
ite+0.030
 til+0.029
bird+0.029
bul+0.028
Neg logit contributions
cious-0.034
 burgers-0.032
lette-0.031
asting-0.031
 cooker-0.031
Token her
Token ID607
Feature activation+8.39e-03

Pos logit contributions
 superstar+0.055
bird+0.050
 til+0.050
 flashes+0.048
bul+0.048
Neg logit contributions
cious-0.068
lette-0.063
asting-0.063
 burgers-0.062
ough-0.062
Token.
Token ID13
Feature activation-1.00e-02

Pos logit contributions
cious+0.012
 burgers+0.011
lette+0.011
asting+0.011
 counter+0.011
Neg logit contributions
 superstar-0.013
ite-0.012
 til-0.012
 flashes-0.012
bul-0.012
Token They
Token ID1119
Feature activation+5.50e-02

Pos logit contributions
 superstar+0.015
 Sometimes+0.013
 til+0.013
bird+0.013
 flashes+0.013
Neg logit contributions
cious-0.017
asting-0.016
lette-0.016
row-0.015
ough-0.015
Token said
Token ID531
Feature activation-2.14e-02

Pos logit contributions
cious+0.070
 burgers+0.066
asting+0.066
lette+0.063
 cooker+0.061
Neg logit contributions
 superstar-0.093
ite-0.090
 Rhythm-0.086
ller-0.086
 til-0.085
Token,
Token ID11
Feature activation+6.95e-03

Pos logit contributions
cious+0.428
lette+0.384
 burgers+0.382
OU+0.374
asting+0.373
Neg logit contributions
 superstar-0.213
ite-0.211
bul-0.205
bird-0.186
 til-0.186
Token stir
Token ID11240
Feature activation+1.10e-01

Pos logit contributions
cious+0.011
lette+0.010
 burgers+0.010
asting+0.010
 cooker+0.010
Neg logit contributions
 superstar-0.010
 til-0.009
ite-0.009
bul-0.009
 flashes-0.009
Token the
Token ID262
Feature activation-9.45e-03

Pos logit contributions
cious+0.195
keeping+0.176
lette+0.175
 cooker+0.169
 Dinner+0.169
Neg logit contributions
inkle-0.129
 superstar-0.127
 til-0.126
 flashes-0.126
bul-0.123
Token batter
Token ID4984
Feature activation+9.63e-02

Pos logit contributions
 superstar+0.015
bird+0.015
 unicorn+0.015
ite+0.015
 cartoon+0.014
Neg logit contributions
cious-0.012
keeping-0.011
ough-0.010
asting-0.010
row-0.010
Token and
Token ID290
Feature activation-7.46e-03

Pos logit contributions
cious+0.173
keeping+0.155
lette+0.155
 burgers+0.154
 oh+0.150
Neg logit contributions
 superstar-0.113
bul-0.110
inkle-0.106
ite-0.106
 til-0.105
Token fill
Token ID6070
Feature activation+5.55e-02

Pos logit contributions
 superstar+0.011
 til+0.010
ite+0.010
 flashes+0.010
bul+0.010
Neg logit contributions
cious-0.011
 burgers-0.010
lette-0.010
asting-0.010
 cooker-0.010
Token a
Token ID257
Feature activation-4.97e-03

Pos logit contributions
cious+0.099
lette+0.089
ough+0.087
row+0.087
 cooker+0.086
Neg logit contributions
 superstar-0.063
 til-0.063
 flashes-0.063
inkle-0.061
bird-0.059
Token pan
Token ID3425
Feature activation+2.46e-01

Pos logit contributions
 superstar+0.007
 flash+0.007
 unicorn+0.006
ite+0.006
inkle+0.006
Neg logit contributions
cious-0.008
keeping-0.008
ough-0.008
lette-0.007
asting-0.007
Token with
Token ID351
Feature activation-7.34e-02

Pos logit contributions
cious+0.435
 burgers+0.390
lette+0.386
asting+0.383
ough+0.378
Neg logit contributions
 superstar-0.304
ite-0.292
bul-0.282
bird-0.278
 til-0.276
Token the
Token ID262
Feature activation-7.94e-03

Pos logit contributions
 superstar+0.105
 flashes+0.103
bul+0.102
 flash+0.099
 til+0.099
Neg logit contributions
cious-0.112
lette-0.097
 cooker-0.096
ough-0.095
row-0.095
Token batter
Token ID4984
Feature activation+9.67e-02

Pos logit contributions
 superstar+0.013
ite+0.012
 unicorn+0.012
bird+0.012
 flashes+0.012
Neg logit contributions
cious-0.010
keeping-0.009
ough-0.009
asting-0.009
row-0.009
Token his
Token ID465
Feature activation+3.07e-02

Pos logit contributions
 superstar+0.106
ite+0.100
 til+0.097
 flashes+0.096
bul+0.096
Neg logit contributions
cious-0.117
lette-0.107
 burgers-0.107
 cooker-0.104
 counter-0.103
Token it
Token ID340
Feature activation+7.16e-03

Pos logit contributions
cious+0.041
 burgers+0.038
lette+0.037
asting+0.037
 cooker+0.036
Neg logit contributions
 superstar-0.053
ite-0.050
 til-0.048
bird-0.047
bul-0.047
Tokenches
Token ID2052
Feature activation-5.55e-02

Pos logit contributions
cious+0.011
asting+0.010
 burgers+0.010
lette+0.010
 cooker+0.010
Neg logit contributions
 superstar-0.011
ite-0.010
 til-0.010
bird-0.010
 Rhythm-0.010
Token disappearing
Token ID28867
Feature activation-9.38e-02

Pos logit contributions
 superstar+0.079
ite+0.074
 til+0.072
bird+0.070
 flashes+0.070
Neg logit contributions
cious-0.089
 burgers-0.083
lette-0.082
asting-0.081
 cooker-0.080
Token.
Token ID13
Feature activation+8.49e-03

Pos logit contributions
 superstar+0.144
ite+0.134
 til+0.132
bird+0.128
 flashes+0.128
Neg logit contributions
cious-0.140
 burgers-0.130
lette-0.129
asting-0.127
 cooker-0.125
Token Suddenly
Token ID24975
Feature activation-2.17e-02

Pos logit contributions
cious+0.015
asting+0.014
lette+0.014
 burgers+0.014
 cooker+0.013
Neg logit contributions
 superstar-0.011
 til-0.010
ite-0.010
bird-0.010
 flashes-0.010
Token,
Token ID11
Feature activation+7.26e-03

Pos logit contributions
 superstar+0.035
ite+0.033
 til+0.031
bird+0.031
bul+0.031
Neg logit contributions
cious-0.031
 burgers-0.029
lette-0.028
asting-0.027
 counter-0.027
Token he
Token ID339
Feature activation-1.39e-02

Pos logit contributions
 burgers+0.009
 oh+0.008
cious+0.008
 counter+0.008
 food+0.008
Neg logit contributions
 superstar-0.013
ite-0.013
inkle-0.012
 til-0.012
bird-0.012
Token heard
Token ID2982
Feature activation+6.59e-03

Pos logit contributions
 superstar+0.024
ite+0.024
bul+0.023
 Rhythm+0.023
 rider+0.022
Neg logit contributions
cious-0.018
 burgers-0.016
asting-0.016
lette-0.015
 cooker-0.015
Token a
Token ID257
Feature activation-4.16e-03

Pos logit contributions
cious+0.010
 burgers+0.009
lette+0.009
asting+0.009
 cooker+0.009
Neg logit contributions
 superstar-0.010
ite-0.010
 til-0.009
bird-0.009
 flashes-0.009
Token voice
Token ID3809
Feature activation-2.74e-02

Pos logit contributions
 superstar+0.005
 til+0.005
 flash+0.005
 flashes+0.005
bird+0.005
Neg logit contributions
cious-0.007
lette-0.007
ough-0.007
keeping-0.007
asting-0.007
Token you
Token ID345
Feature activation+7.85e-02

Pos logit contributions
 superstar+0.071
ite+0.067
 til+0.065
bird+0.064
 Rhythm+0.064
Neg logit contributions
cious-0.053
lette-0.048
 burgers-0.048
asting-0.047
 cooker-0.047
Token.
Token ID13
Feature activation-1.76e-02

Pos logit contributions
cious+0.147
lette+0.139
 cooker+0.136
 burgers+0.136
 counters+0.134
Neg logit contributions
 superstar-0.089
 Rhythm-0.077
 Spark-0.076
ite-0.074
 unicorn-0.074
Token Now
Token ID2735
Feature activation+2.51e-02

Pos logit contributions
 superstar+0.028
ite+0.026
 til+0.025
bird+0.025
 flashes+0.025
Neg logit contributions
cious-0.025
 burgers-0.024
lette-0.023
asting-0.023
 counter-0.023
Token,
Token ID11
Feature activation-1.44e-02

Pos logit contributions
cious+0.043
keeping+0.039
asting+0.039
lette+0.039
 burgers+0.039
Neg logit contributions
 superstar-0.034
 Spark-0.031
ite-0.031
 til-0.031
 Rhythm-0.030
Token why
Token ID1521
Feature activation+1.06e-01

Pos logit contributions
 superstar+0.026
ite+0.024
 til+0.024
 Rhythm+0.024
bird+0.023
Neg logit contributions
cious-0.018
 burgers-0.016
lette-0.016
keeping-0.015
 counter-0.015
Token don
Token ID836
Feature activation-8.36e-03

Pos logit contributions
cious+0.148
 burgers+0.138
asting+0.136
lette+0.135
 cooker+0.132
Neg logit contributions
 superstar-0.171
ite-0.162
 til-0.158
bird-0.156
bul-0.155
Token't
Token ID470
Feature activation-2.26e-01

Pos logit contributions
 superstar+0.008
 til+0.007
ite+0.007
bul+0.007
bird+0.007
Neg logit contributions
cious-0.016
 burgers-0.016
asting-0.016
 counter-0.016
 cooker-0.016
Token you
Token ID345
Feature activation+7.86e-02

Pos logit contributions
 superstar+0.434
ite+0.412
 til+0.406
bul+0.397
 flashes+0.396
Neg logit contributions
cious-0.248
 burgers-0.227
lette-0.222
asting-0.217
 cooker-0.215
Token play
Token ID711
Feature activation-1.48e-01

Pos logit contributions
asting+0.088
lette+0.087
ough+0.086
cious+0.083
 burgers+0.083
Neg logit contributions
 superstar-0.144
bul-0.139
 til-0.139
 flashes-0.135
 unicorn-0.134
Token together
Token ID1978
Feature activation-1.57e-01

Pos logit contributions
ite+0.239
bul+0.236
 superstar+0.236
 flashes+0.230
stretched+0.227
Neg logit contributions
 burgers-0.183
 oh-0.182
 food-0.178
cious-0.172
 counter-0.172
Token nicely
Token ID16576
Feature activation+5.09e-02

Pos logit contributions
 superstar+0.231
 flashes+0.219
 til+0.217
ite+0.211
 Rhythm+0.210
Neg logit contributions
asting-0.223
cious-0.221
 burgers-0.217
lette-0.214
 counter-0.213
Token way
Token ID835
Feature activation+1.74e-01

Pos logit contributions
 superstar+0.125
ite+0.117
 til+0.113
bul+0.113
bird+0.112
Neg logit contributions
cious-0.065
 burgers-0.063
lette-0.060
 counter-0.058
ough-0.057
Token to
Token ID284
Feature activation+1.04e-02

Pos logit contributions
cious+0.289
lette+0.267
 burgers+0.264
 counter+0.260
 cooker+0.256
Neg logit contributions
ite-0.239
 superstar-0.234
bul-0.219
 flashes-0.219
bird-0.217
Token play
Token ID711
Feature activation-1.45e-01

Pos logit contributions
cious+0.014
 burgers+0.013
asting+0.013
lette+0.013
 counter+0.013
Neg logit contributions
 superstar-0.018
ite-0.017
 til-0.016
bul-0.016
bird-0.016
Token on
Token ID319
Feature activation-7.44e-02

Pos logit contributions
 superstar+0.202
ite+0.187
 til+0.184
bird+0.179
 flashes+0.178
Neg logit contributions
cious-0.239
lette-0.223
 burgers-0.220
asting-0.218
 cooker-0.216
Token the
Token ID262
Feature activation-1.38e-02

Pos logit contributions
 superstar+0.124
ite+0.117
bul+0.112
 til+0.112
bird+0.111
Neg logit contributions
cious-0.102
 burgers-0.096
 counter-0.093
asting-0.092
lette-0.089
Token tall
Token ID7331
Feature activation-3.74e-02

Pos logit contributions
 superstar+0.014
 rider+0.013
ite+0.013
 til+0.012
 Rhythm+0.012
Neg logit contributions
lette-0.026
cious-0.026
apa-0.026
 cooker-0.025
row-0.025
Token rail
Token ID6787
Feature activation-2.34e-02

Pos logit contributions
ite+0.050
 superstar+0.050
 flashes+0.048
bird+0.048
 flash+0.047
Neg logit contributions
cious-0.062
lette-0.060
asting-0.057
apa-0.056
 cooker-0.055
Token near
Token ID1474
Feature activation+2.16e-02

Pos logit contributions
 superstar+0.036
ite+0.033
 til+0.033
 flashes+0.032
bird+0.032
Neg logit contributions
cious-0.035
 burgers-0.033
lette-0.032
asting-0.032
 counter-0.032
Token the
Token ID262
Feature activation-1.33e-02

Pos logit contributions
cious+0.036
lette+0.033
 cooker+0.033
 burgers+0.033
asting+0.032
Neg logit contributions
 superstar-0.028
bird-0.027
ite-0.026
 Rhythm-0.026
 flashes-0.026
Token pond
Token ID16723
Feature activation+8.54e-02

Pos logit contributions
 superstar+0.019
ite+0.019
 flashes+0.018
 til+0.018
bird+0.018
Neg logit contributions
cious-0.020
lette-0.019
 cooker-0.018
 burgers-0.018
asting-0.018
Token.
Token ID13
Feature activation+6.48e-03

Pos logit contributions
cious+0.161
lette+0.150
 cooker+0.147
 burgers+0.146
 counters+0.144
Neg logit contributions
 superstar-0.093
ite-0.092
bird-0.085
 til-0.083
 flashes-0.082
Token Tom
Token ID4186
Feature activation+8.05e-04

Pos logit contributions
 superstar+0.019
ite+0.018
 til+0.017
 rider+0.017
bul+0.017
Neg logit contributions
cious-0.016
 burgers-0.014
lette-0.014
 counter-0.014
asting-0.014
Token hiding
Token ID11816
Feature activation+4.18e-03

Pos logit contributions
cious+0.001
 burgers+0.001
lette+0.001
 cooker+0.001
asting+0.001
Neg logit contributions
 superstar-0.001
 til-0.001
ite-0.001
bird-0.001
 flashes-0.001
Token behind
Token ID2157
Feature activation+5.83e-03

Pos logit contributions
cious+0.006
 burgers+0.006
lette+0.006
asting+0.005
 cooker+0.005
Neg logit contributions
 superstar-0.007
ite-0.006
 til-0.006
bird-0.006
 flashes-0.006
Token it
Token ID340
Feature activation+2.01e-03

Pos logit contributions
cious+0.007
 counter+0.007
 burgers+0.007
lette+0.007
asting+0.006
Neg logit contributions
 superstar-0.010
ite-0.010
 til-0.009
bul-0.009
 rider-0.009
Token.
Token ID13
Feature activation-3.42e-03

Pos logit contributions
cious+0.003
 burgers+0.003
lette+0.003
asting+0.003
 cooker+0.003
Neg logit contributions
 superstar-0.003
ite-0.003
 til-0.003
bird-0.003
 flashes-0.003
Token
Token ID198
Feature activation-2.46e-02

Pos logit contributions
 superstar+0.004
ite+0.004
 til+0.004
bird+0.004
 flashes+0.004
Neg logit contributions
cious-0.006
asting-0.006
lette-0.006
 burgers-0.006
 cooker-0.005
Token
Token ID198
Feature activation-2.55e-02

Pos logit contributions
ite+0.024
 superstar+0.023
 til+0.023
 Rhythm+0.023
bird+0.023
Neg logit contributions
cious-0.048
lette-0.047
 burgers-0.047
 cooker-0.046
asting-0.045
Token"
Token ID1
Feature activation+2.18e-01

Pos logit contributions
ite+0.040
Because+0.039
Zero+0.038
Brian+0.036
 rider+0.035
Neg logit contributions
 burgers-0.039
lette-0.038
 cooker-0.037
 stove-0.036
 hamb-0.036
TokenBoo
Token ID46120
Feature activation-1.23e-01

Pos logit contributions
cious+0.376
 burgers+0.354
lette+0.351
asting+0.349
 cooker+0.343
Neg logit contributions
 superstar-0.287
ite-0.265
bird-0.258
 til-0.255
 flashes-0.253
Token!"
Token ID2474
Feature activation+7.70e-02

Pos logit contributions
 superstar+0.202
 til+0.192
ite+0.185
ller+0.180
 flashes+0.180
Neg logit contributions
cious-0.167
asting-0.161
lette-0.160
 burgers-0.160
 cooker-0.154
Token Tom
Token ID4186
Feature activation-1.50e-03

Pos logit contributions
cious+0.118
 burgers+0.109
lette+0.109
asting+0.108
 cooker+0.106
Neg logit contributions
 superstar-0.115
ite-0.108
 til-0.106
bird-0.103
 flashes-0.103
Token,
Token ID11
Feature activation+4.60e-03

Pos logit contributions
cious+0.078
lette+0.076
keeping+0.073
 burgers+0.073
asting+0.072
Neg logit contributions
 superstar-0.052
ite-0.046
 Rhythm-0.045
 Spark-0.044
bird-0.043
Token Lily
Token ID20037
Feature activation-9.56e-04

Pos logit contributions
cious+0.005
 burgers+0.005
 counter+0.005
 oh+0.005
 cooker+0.005
Neg logit contributions
ite-0.008
 superstar-0.008
 til-0.008
bul-0.008
bird-0.008
Token and
Token ID290
Feature activation-1.33e-02

Pos logit contributions
 superstar+0.001
ite+0.001
 til+0.001
bird+0.001
 whist+0.001
Neg logit contributions
cious-0.001
lette-0.001
 burgers-0.001
asting-0.001
ough-0.001
Token Max
Token ID5436
Feature activation-1.36e-01

Pos logit contributions
 superstar+0.019
ite+0.019
bul+0.017
 til+0.017
 flashes+0.017
Neg logit contributions
cious-0.021
 burgers-0.020
 counter-0.020
 oh-0.019
 cooker-0.019
Token met
Token ID1138
Feature activation+5.02e-02

Pos logit contributions
 superstar+0.192
 Spark+0.176
 whist+0.173
ite+0.171
bird+0.165
Neg logit contributions
cious-0.214
lette-0.209
 counter-0.204
 counters-0.200
 cooker-0.198
Token another
Token ID1194
Feature activation-2.37e-01

Pos logit contributions
cious+0.072
 burgers+0.067
lette+0.065
asting+0.065
 cooker+0.064
Neg logit contributions
 superstar-0.079
ite-0.075
 til-0.073
bul-0.073
 flashes-0.072
Token little
Token ID1310
Feature activation+1.67e-01

Pos logit contributions
 superstar+0.413
ite+0.392
 til+0.390
bul+0.383
 flashes+0.381
Neg logit contributions
cious-0.300
 burgers-0.281
asting-0.268
 cooker-0.268
 counter-0.265
Token girl
Token ID2576
Feature activation+4.02e-02

Pos logit contributions
 burgers+0.161
cious+0.159
 cooker+0.151
 counter+0.140
 oy+0.137
Neg logit contributions
inkle-0.327
 superstar-0.324
bul-0.317
-0.316
ller-0.314
Token and
Token ID290
Feature activation-1.33e-02

Pos logit contributions
cious+0.050
 burgers+0.045
lette+0.044
asting+0.043
 counter+0.043
Neg logit contributions
 superstar-0.071
ite-0.067
 flashes-0.067
 til-0.066
bul-0.066
Token her
Token ID607
Feature activation+9.79e-03

Pos logit contributions
 superstar+0.021
ite+0.019
 til+0.019
bird+0.019
 flashes+0.018
Neg logit contributions
cious-0.020
 burgers-0.018
lette-0.018
asting-0.018
 cooker-0.018
Token dog
Token ID3290
Feature activation-2.16e-02

Pos logit contributions
cious+0.013
 burgers+0.012
 food+0.011
 cooker+0.011
 counter+0.011
Neg logit contributions
 superstar-0.016
bul-0.016
ller-0.016
 til-0.016
inkle-0.016
Token,
Token ID11
Feature activation+7.86e-03

Pos logit contributions
 superstar+0.048
ite+0.047
inkle+0.043
bird+0.043
 rider+0.043
Neg logit contributions
cious-0.024
 burgers-0.023
 food-0.022
asting-0.022
 oh-0.021
Token his
Token ID465
Feature activation+3.22e-02

Pos logit contributions
cious+0.010
 burgers+0.010
 oh+0.009
asting+0.009
 counter+0.009
Neg logit contributions
 superstar-0.013
ite-0.013
bird-0.012
 til-0.012
inkle-0.012
Token wings
Token ID12098
Feature activation-1.28e-01

Pos logit contributions
cious+0.036
 food+0.035
 stove+0.034
lette+0.033
 mud+0.033
Neg logit contributions
 superstar-0.059
ays-0.056
ite-0.055
Because-0.054
azz-0.054
Token kept
Token ID4030
Feature activation-1.08e-02

Pos logit contributions
 superstar+0.204
ite+0.188
 til+0.185
 Rhythm+0.181
 flashes+0.180
Neg logit contributions
cious-0.182
asting-0.168
lette-0.166
 burgers-0.165
ough-0.159
Token getting
Token ID1972
Feature activation+6.67e-02

Pos logit contributions
 superstar+0.018
ite+0.017
bird+0.016
ane+0.015
 rider+0.015
Neg logit contributions
 burgers-0.014
asting-0.014
cious-0.014
lette-0.013
 counter-0.013
Token caught
Token ID4978
Feature activation-4.18e-02

Pos logit contributions
asting+0.094
 burgers+0.091
 dirt+0.091
 mud+0.090
 weeds+0.087
Neg logit contributions
 superstar-0.107
-0.097
ite-0.096
icycle-0.094
ane-0.094
Token in
Token ID287
Feature activation+3.33e-02

Pos logit contributions
 superstar+0.050
ite+0.048
bul+0.048
bird+0.048
 til+0.048
Neg logit contributions
cious-0.076
lette-0.068
 burgers-0.068
 cooker-0.068
 counter-0.066
Token the
Token ID262
Feature activation-8.41e-03

Pos logit contributions
cious+0.044
 burgers+0.043
asting+0.043
 counter+0.042
lette+0.041
Neg logit contributions
 superstar-0.056
ite-0.053
 til-0.050
 Rhythm-0.049
 rider-0.048
Token trees
Token ID7150
Feature activation-5.97e-02

Pos logit contributions
 superstar+0.014
 til+0.013
ite+0.013
 Rhythm+0.013
yles+0.012
Neg logit contributions
cious-0.011
 burgers-0.011
lette-0.011
 counter-0.010
asting-0.010
Token.
Token ID13
Feature activation+1.01e-02

Pos logit contributions
 superstar+0.082
ite+0.078
 til+0.075
bird+0.075
bul+0.074
Neg logit contributions
cious-0.098
 burgers-0.091
lette-0.089
 cooker-0.088
asting-0.088
Token Finally
Token ID9461
Feature activation-2.26e-02

Pos logit contributions
cious+0.015
 burgers+0.014
lette+0.014
 counter+0.014
asting+0.014
Neg logit contributions
 superstar-0.015
ite-0.014
 til-0.014
bul-0.013
 flashes-0.013
TokenSure
Token ID19457
Feature activation-7.98e-02

Pos logit contributions
cious+0.284
lette+0.261
 burgers+0.260
asting+0.257
 cooker+0.254
Neg logit contributions
 superstar-0.243
 til-0.232
inkle-0.227
 flashes-0.223
ite-0.220
Token,
Token ID11
Feature activation+6.48e-03

Pos logit contributions
 superstar+0.100
ite+0.093
bird+0.089
 til+0.088
 rider+0.088
Neg logit contributions
cious-0.141
asting-0.136
 burgers-0.134
lette-0.133
 cooker-0.132
Token Lily
Token ID20037
Feature activation+1.41e-03

Pos logit contributions
cious+0.009
lette+0.008
 burgers+0.008
 cooker+0.008
asting+0.008
Neg logit contributions
 superstar-0.011
ite-0.010
bird-0.010
 til-0.010
 unicorn-0.010
Token!
Token ID0
Feature activation-1.36e-02

Pos logit contributions
cious+0.003
 burgers+0.003
asting+0.003
lette+0.003
 counters+0.003
Neg logit contributions
 superstar-0.001
ite-0.001
 rider-0.001
 unicorn-0.001
bird-0.001
Token You
Token ID921
Feature activation+2.48e-02

Pos logit contributions
 superstar+0.023
ite+0.022
bird+0.021
 til+0.021
 Rhythm+0.021
Neg logit contributions
cious-0.018
 burgers-0.017
asting-0.016
lette-0.016
 counter-0.016
Token can
Token ID460
Feature activation-8.47e-02

Pos logit contributions
cious+0.034
lette+0.033
asting+0.032
 cooker+0.032
ough+0.032
Neg logit contributions
 superstar-0.039
 til-0.037
ller-0.036
 flashes-0.036
ite-0.035
Token give
Token ID1577
Feature activation+1.45e-02

Pos logit contributions
 superstar+0.159
 til+0.147
 flashes+0.146
ite+0.142
bul+0.139
Neg logit contributions
cious-0.098
lette-0.095
asting-0.094
ough-0.094
 counter-0.093
Token this
Token ID428
Feature activation+5.81e-02

Pos logit contributions
cious+0.022
asting+0.020
lette+0.019
 cooker+0.019
 burgers+0.019
Neg logit contributions
 superstar-0.021
ite-0.021
bird-0.021
 til-0.020
 unicorn-0.020
Token letter
Token ID3850
Feature activation+6.01e-02

Pos logit contributions
cious+0.100
asting+0.089
 burgers+0.088
lette+0.088
keeping+0.087
Neg logit contributions
 superstar-0.076
ite-0.076
bul-0.073
 unicorn-0.072
 rider-0.070
Token to
Token ID284
Feature activation+1.12e-02

Pos logit contributions
cious+0.090
 burgers+0.084
asting+0.083
lette+0.082
 cooker+0.081
Neg logit contributions
 superstar-0.091
ite-0.087
bird-0.082
 til-0.082
bul-0.081
Token your
Token ID534
Feature activation+8.35e-02

Pos logit contributions
cious+0.015
lette+0.013
 cooker+0.013
 burgers+0.013
asting+0.013
Neg logit contributions
 superstar-0.020
 til-0.018
 Rhythm-0.018
 unicorn-0.018
ite-0.018
Token's
Token ID338
Feature activation+1.90e-01

Pos logit contributions
cious+0.021
 burgers+0.018
lette+0.018
asting+0.018
 cooker+0.017
Neg logit contributions
 superstar-0.042
ite-0.040
 til-0.039
bird-0.038
 flashes-0.038
Token raining
Token ID43079
Feature activation-1.26e-01

Pos logit contributions
cious+0.324
 burgers+0.287
lette+0.287
 cooker+0.285
 counter+0.283
Neg logit contributions
 superstar-0.257
 til-0.239
 flashes-0.238
bul-0.237
ite-0.233
Token because
Token ID780
Feature activation-3.55e-02

Pos logit contributions
 superstar+0.188
ite+0.177
 til+0.172
bul+0.170
 flashes+0.169
Neg logit contributions
cious-0.193
lette-0.176
 burgers-0.176
asting-0.171
 cooker-0.170
Token it
Token ID340
Feature activation+7.57e-03

Pos logit contributions
 superstar+0.055
ite+0.052
 til+0.051
 flashes+0.050
bul+0.050
Neg logit contributions
cious-0.052
 burgers-0.048
lette-0.048
asting-0.047
 cooker-0.046
Token's
Token ID338
Feature activation+1.86e-01

Pos logit contributions
cious+0.010
 burgers+0.009
lette+0.009
asting+0.009
 counter+0.009
Neg logit contributions
 superstar-0.013
ite-0.012
 til-0.012
 flashes-0.012
bird-0.012
Token cloudy
Token ID40026
Feature activation-2.47e-01

Pos logit contributions
cious+0.321
lette+0.288
 burgers+0.285
 counter+0.282
 cooker+0.281
Neg logit contributions
 superstar-0.248
 flashes-0.228
 til-0.228
bul-0.227
ite-0.227
Token outside
Token ID2354
Feature activation+6.98e-02

Pos logit contributions
 superstar+0.321
ite+0.310
bul+0.310
 flashes+0.309
bird+0.297
Neg logit contributions
cious-0.428
 burgers-0.376
lette-0.375
asting-0.365
 cooker-0.364
Token,
Token ID11
Feature activation+7.86e-03

Pos logit contributions
cious+0.117
lette+0.104
 burgers+0.104
 cooker+0.101
asting+0.101
Neg logit contributions
 superstar-0.090
ite-0.088
 flashes-0.086
bird-0.084
bul-0.083
Token and
Token ID290
Feature activation-7.02e-03

Pos logit contributions
cious+0.013
ough+0.012
lette+0.012
keeping+0.012
row+0.012
Neg logit contributions
 superstar-0.010
 flashes-0.010
 til-0.010
 flash-0.009
bul-0.009
Token the
Token ID262
Feature activation-9.57e-03

Pos logit contributions
 superstar+0.011
ite+0.010
 til+0.010
 flashes+0.010
bul+0.010
Neg logit contributions
cious-0.010
lette-0.009
 burgers-0.009
asting-0.009
 cooker-0.009
Token rain
Token ID6290
Feature activation+2.50e-02

Pos logit contributions
 superstar+0.016
ite+0.015
 til+0.015
bird+0.015
 flashes+0.015
Neg logit contributions
cious-0.013
 burgers-0.012
lette-0.012
asting-0.012
 cooker-0.011
Token his
Token ID465
Feature activation+2.72e-02

Pos logit contributions
cious+0.200
lette+0.186
 burgers+0.184
asting+0.180
ough+0.178
Neg logit contributions
 superstar-0.178
 til-0.160
bul-0.158
ite-0.157
 flashes-0.156
Token eyes
Token ID2951
Feature activation-9.01e-02

Pos logit contributions
cious+0.039
lette+0.036
 cooker+0.035
 burgers+0.035
 counter+0.034
Neg logit contributions
 superstar-0.044
ite-0.041
 rider-0.039
 Rhythm-0.039
bird-0.039
Token when
Token ID618
Feature activation+5.31e-02

Pos logit contributions
 superstar+0.141
ite+0.135
 til+0.132
 Rhythm+0.129
 flashes+0.128
Neg logit contributions
cious-0.131
asting-0.121
lette-0.118
 counter-0.117
 burgers-0.117
Token the
Token ID262
Feature activation-1.33e-02

Pos logit contributions
cious+0.093
lette+0.087
asting+0.086
 burgers+0.085
ough+0.084
Neg logit contributions
 superstar-0.068
ite-0.061
 til-0.060
 flashes-0.060
bird-0.059
Token star
Token ID3491
Feature activation-3.27e-01

Pos logit contributions
 superstar+0.017
ite+0.016
 til+0.016
 Rhythm+0.016
+0.015
Neg logit contributions
cious-0.022
 cooker-0.021
 counter-0.021
lette-0.021
 burgers-0.021
Token suddenly
Token ID6451
Feature activation-2.67e-01

Pos logit contributions
 superstar+0.502
 Rhythm+0.474
 til+0.473
ite+0.459
Zero+0.451
Neg logit contributions
cious-0.467
lette-0.429
asting-0.419
 burgers-0.408
 cooker-0.408
Token seemed
Token ID3947
Feature activation-1.35e-01

Pos logit contributions
 superstar+0.376
ite+0.350
 til+0.342
 flashes+0.333
bird+0.333
Neg logit contributions
cious-0.429
 burgers-0.401
lette-0.397
asting-0.393
 cooker-0.387
Token to
Token ID284
Feature activation+1.08e-02

Pos logit contributions
 superstar+0.214
ite+0.204
 til+0.196
bird+0.193
 flashes+0.191
Neg logit contributions
cious-0.193
 burgers-0.178
asting-0.177
lette-0.177
 cooker-0.172
Token move
Token ID1445
Feature activation-1.62e-02

Pos logit contributions
cious+0.013
asting+0.013
lette+0.012
 burgers+0.011
 counter+0.011
Neg logit contributions
 superstar-0.019
ite-0.019
 Rhythm-0.017
 rider-0.017
 til-0.017
Token.
Token ID13
Feature activation+5.33e-03

Pos logit contributions
 superstar+0.026
ite+0.025
 til+0.024
 Rhythm+0.024
 flashes+0.023
Neg logit contributions
cious-0.022
asting-0.021
 burgers-0.021
lette-0.020
 counter-0.020
Token He
Token ID679
Feature activation-1.03e-02

Pos logit contributions
cious+0.010
asting+0.009
lette+0.009
 burgers+0.009
row+0.009
Neg logit contributions
 superstar-0.007
ite-0.006
bird-0.006
-0.006
 Zero-0.006
Token bear
Token ID6842
Feature activation-9.08e-02

Pos logit contributions
ays+0.015
azz+0.014
ite+0.014
 superstar+0.014
ane+0.014
Neg logit contributions
 food-0.010
 counter-0.009
 mud-0.009
 dirt-0.009
 burgers-0.009
Token was
Token ID373
Feature activation-3.89e-03

Pos logit contributions
 superstar+0.147
 flashes+0.138
ite+0.137
 til+0.134
 Rhythm+0.133
Neg logit contributions
cious-0.128
lette-0.117
asting-0.114
 burgers-0.114
ough-0.110
Token never
Token ID1239
Feature activation+7.18e-03

Pos logit contributions
 superstar+0.006
ite+0.005
 til+0.005
bird+0.005
bul+0.005
Neg logit contributions
cious-0.006
 burgers-0.006
lette-0.006
asting-0.005
 cooker-0.005
Token without
Token ID1231
Feature activation-8.33e-02

Pos logit contributions
cious+0.011
 burgers+0.010
lette+0.010
 cooker+0.010
 counter+0.010
Neg logit contributions
 superstar-0.011
 til-0.010
ite-0.010
bird-0.010
 flashes-0.009
Token gum
Token ID27142
Feature activation-6.90e-02

Pos logit contributions
 superstar+0.122
 til+0.112
ite+0.109
 flashes+0.108
bul+0.108
Neg logit contributions
cious-0.130
lette-0.119
 burgers-0.118
asting-0.117
 cooker-0.117
Token again
Token ID757
Feature activation-2.82e-01

Pos logit contributions
 superstar+0.093
ite+0.086
 til+0.082
bird+0.081
bul+0.080
Neg logit contributions
cious-0.116
 burgers-0.108
lette-0.107
asting-0.104
 cooker-0.104
Token!
Token ID0
Feature activation-6.16e-03

Pos logit contributions
 superstar+0.408
ite+0.380
 til+0.372
bird+0.363
bul+0.360
Neg logit contributions
cious-0.444
 burgers-0.415
lette-0.410
asting-0.406
 cooker-0.403
Token<|endoftext|>
Token ID50256
Feature activation+8.91e-02

Pos logit contributions
 superstar+0.009
 til+0.009
ite+0.009
 flashes+0.009
bul+0.008
Neg logit contributions
cious-0.009
lette-0.008
 burgers-0.008
asting-0.008
 counter-0.008
TokenOnce
Token ID7454
Feature activation-1.78e-03

Pos logit contributions
cious+0.147
 burgers+0.136
asting+0.135
lette+0.135
 counter+0.133
Neg logit contributions
 superstar-0.122
ite-0.112
 til-0.110
bul-0.108
 flashes-0.108
Token upon
Token ID2402
Feature activation-5.24e-02

Pos logit contributions
 superstar+0.003
 til+0.002
 flashes+0.002
bul+0.002
ite+0.002
Neg logit contributions
cious-0.003
asting-0.002
 burgers-0.002
 cooker-0.002
 counter-0.002
Token a
Token ID257
Feature activation-2.93e-03

Pos logit contributions
 superstar+0.069
 til+0.062
bird+0.061
ite+0.060
 flashes+0.060
Neg logit contributions
cious-0.088
 cooker-0.084
 burgers-0.083
lette-0.083
 counter-0.081
Token dad
Token ID9955
Feature activation+1.04e-01

Pos logit contributions
 superstar+0.002
 til+0.002
inkle+0.002
+0.002
ite+0.002
Neg logit contributions
 burgers-0.001
 counter-0.001
cious-0.001
 cooker-0.001
 counters-0.001
Token carried
Token ID5281
Feature activation+2.71e-02

Pos logit contributions
cious+0.121
 counter+0.114
 burgers+0.114
asting+0.113
lette+0.109
Neg logit contributions
 superstar-0.184
ite-0.171
-0.171
 Rhythm-0.171
 flashes-0.167
Token the
Token ID262
Feature activation-4.70e-03

Pos logit contributions
cious+0.045
lette+0.041
 cooker+0.040
asting+0.040
 burgers+0.040
Neg logit contributions
 superstar-0.036
ite-0.034
bird-0.034
 til-0.034
 Rhythm-0.033
Token suit
Token ID6050
Feature activation+4.52e-02

Pos logit contributions
 superstar+0.007
 til+0.006
ite+0.006
 Rhythm+0.006
 flashes+0.006
Neg logit contributions
cious-0.007
 burgers-0.007
 counter-0.007
 cooker-0.007
lette-0.007
Tokencases
Token ID33964
Feature activation+3.29e-03

Pos logit contributions
cious+0.059
 burgers+0.054
lette+0.054
asting+0.054
 counter+0.054
Neg logit contributions
 superstar-0.077
 til-0.072
 flashes-0.070
ite-0.070
 Rhythm-0.070
Token again
Token ID757
Feature activation-2.85e-01

Pos logit contributions
cious+0.004
 counter+0.004
asting+0.004
 burgers+0.004
lette+0.004
Neg logit contributions
 superstar-0.005
 til-0.005
ite-0.005
 flashes-0.005
 Rhythm-0.005
Token.
Token ID13
Feature activation+5.13e-03

Pos logit contributions
 superstar+0.361
ite+0.346
 til+0.333
bird+0.324
bul+0.320
Neg logit contributions
cious-0.488
 burgers-0.463
lette-0.458
 cooker-0.454
asting-0.449
Token They
Token ID1119
Feature activation+6.69e-02

Pos logit contributions
cious+0.010
lette+0.009
asting+0.009
row+0.009
 burgers+0.009
Neg logit contributions
 superstar-0.006
ite-0.006
 Rhythm-0.005
 Spark-0.005
 til-0.005
Token had
Token ID550
Feature activation+4.65e-02

Pos logit contributions
cious+0.087
 burgers+0.079
asting+0.079
lette+0.076
 counter+0.075
Neg logit contributions
 superstar-0.116
ite-0.106
 til-0.105
 flashes-0.105
bul-0.104
Token a
Token ID257
Feature activation-5.34e-03

Pos logit contributions
cious+0.074
 cooker+0.071
lette+0.069
asting+0.068
ough+0.067
Neg logit contributions
 superstar-0.064
 til-0.058
 flashes-0.058
ite-0.058
 smiles-0.057
Token fun
Token ID1257
Feature activation-9.36e-02

Pos logit contributions
 superstar+0.008
ite+0.008
 til+0.008
bird+0.008
bul+0.008
Neg logit contributions
cious-0.007
 burgers-0.007
 counter-0.007
asting-0.007
 cooker-0.007
Token.
Token ID13
Feature activation-1.67e-02

Pos logit contributions
cious+0.151
 burgers+0.138
lette+0.138
asting+0.137
 cooker+0.136
Neg logit contributions
 superstar-0.111
ite-0.100
 til-0.098
bird-0.096
 rider-0.094
Token And
Token ID843
Feature activation+4.32e-02

Pos logit contributions
 superstar+0.025
ite+0.024
bird+0.023
 til+0.023
 Rhythm+0.022
Neg logit contributions
cious-0.026
asting-0.024
 burgers-0.024
lette-0.023
 counter-0.023
Token I
Token ID314
Feature activation+3.42e-02

Pos logit contributions
cious+0.066
asting+0.062
lette+0.061
 burgers+0.061
 counters+0.059
Neg logit contributions
 superstar-0.064
ite-0.060
 til-0.059
 rider-0.057
 Rhythm-0.057
Token love
Token ID1842
Feature activation-1.19e-01

Pos logit contributions
cious+0.048
asting+0.045
lette+0.045
 burgers+0.043
 cooker+0.042
Neg logit contributions
ite-0.053
 superstar-0.052
bul-0.051
 flashes-0.051
ller-0.051
Token the
Token ID262
Feature activation-1.77e-02

Pos logit contributions
 superstar+0.232
ite+0.219
 til+0.213
bul+0.209
bird+0.209
Neg logit contributions
cious-0.133
 burgers-0.125
lette-0.118
 counter-0.117
asting-0.116
Token stars
Token ID5788
Feature activation-2.93e-01

Pos logit contributions
 superstar+0.028
ite+0.027
 til+0.027
ller+0.026
bul+0.026
Neg logit contributions
 burgers-0.024
cious-0.024
 counter-0.023
asting-0.022
lette-0.022
Token.
Token ID13
Feature activation-2.24e-02

Pos logit contributions
 superstar+0.454
ite+0.426
 til+0.415
bird+0.407
 flashes+0.405
Neg logit contributions
cious-0.432
 burgers-0.399
lette-0.395
asting-0.391
 cooker-0.385
Token But
Token ID887
Feature activation-4.09e-02

Pos logit contributions
 superstar+0.037
ite+0.035
 til+0.034
bird+0.033
 flashes+0.033
Neg logit contributions
cious-0.031
 burgers-0.028
lette-0.028
asting-0.027
 cooker-0.027
Token I
Token ID314
Feature activation+3.22e-02

Pos logit contributions
 superstar+0.069
ite+0.065
 til+0.063
bird+0.062
 flashes+0.062
Neg logit contributions
cious-0.054
 burgers-0.051
lette-0.050
asting-0.049
 counter-0.049
Token don
Token ID836
Feature activation-8.83e-03

Pos logit contributions
cious+0.040
asting+0.037
lette+0.037
ocado+0.036
 burgers+0.035
Neg logit contributions
 superstar-0.056
bul-0.055
ite-0.054
 unicorn-0.053
 flashes-0.053
Token't
Token ID470
Feature activation-2.26e-01

Pos logit contributions
 superstar+0.009
 til+0.008
ite+0.008
bul+0.007
 flashes+0.007
Neg logit contributions
cious-0.017
asting-0.017
 counter-0.017
 burgers-0.017
ough-0.016