TOP ACTIVATIONS
MAX = 0.074

elephant went to the river bank and he noticed something strange
were walking near the river bank near the arrow . The
village , there were two groups of friends . They liked
Standing on the river bank was a duck . The
who lived in a small factory . He had been stirring
bird saw two squirrel s quarrel over a nut . The
, there was a gloomy factory . The factory was old
. In the factory , Tom met a friendly
came crashing onto the river bank . It was so big
day they had a big argument and the husband said ,
apping against the creek 's banks . He listened to the
a raft near the river bank and she thought it was
appeared and destroyed the river bank ! Water was everywhere and
day , a big number war happened . In this war
them in their pig gy banks . They think of what
, Jack and Jill were arguing about who should patch the
you shouldn 't run after ambul ances . It 's very
gray sand of the river bank . The sign said "
a gloomy factory . The factory was old and tired .
monkey and rabbit began to quarrel about who was bra ver

MIN ACTIVATIONS
MIN = -0.070

, " Is that your coc oon , butterfly ?" The
you want to see my coc oon , L ila ?"
, just like the z ig zag . <|endoftext|> Once there
't want to be a coc oon anymore !"
, sun ! Pour your rays around the sky and give
to eat leaves and make coc oons . Maybe one day
a cookie or a l oll ipop . Tom
there is nothing there . Isn 't that interesting ?"
cater pillar will make a coc oon . Then it will
, it will make a coc oon and turn into a
will not touch the ash tr ay again ," Tim says
and c umin and cor i ander and card am om
down and enjoyed her popcorn coc oon . <|endoftext|> Once upon
too heavy for you ? Well , I think you 're
, he emerged from the coc oon as a beautiful butterfly
you where I make my coc oon ." L
, he spun a special coc oon and went to sleep
passes so grace fully ? Isn 't it amazing ?"
." L ila touched the coc oon and felt its texture
should never touch an ash tr ay ." S

INTERVAL 0.038 - 0.074
CONTAINS 0.352%

day , Lily and her family went to the park to
left it here and did not throw it away . We
, and Lily was a bit older . They were very
a time , there were two friends , Jack and Jill
The little girl did not know what to do with

INTERVAL 0.002 - 0.038
CONTAINS 43.190%

a time there was a cup . The cup was very
friendly knight appeared and showed them around the castle . Lily
The big , scary fish chased them , but Nem o
could stay safe until she wanted to play again .
She is too busy cutting . She does not see that

INTERVAL -0.034 - 0.002
CONTAINS 55.876%

it changed . Her mom my explained that the restaurant got
getting teased , so she flew over to try and help
the naughty boy . From then on , the naughty boy
. Ben followed Lily . He was still scared
smiled and said , " That sounds great , Dad .

INTERVAL -0.070 - -0.034
CONTAINS 0.583%

. She always remembered to enjoy the little things with gratitude
. Joy knew that she couldn 't do it on her
careful it won 't vanish again ." So Lucy
tree would sway back and forth , while the small bush
a wonderful thing . He stood there , ent ran ced
Token elephant
Token ID20950
Feature activation+5.96e-03

Pos logit contributions
ances+0.007
glers+0.006
 sizes+0.006
boxes+0.006
ales+0.005
Neg logit contributions
 again-0.005
 once-0.005
<|endoftext|>-0.005
 Will-0.005
 happily-0.005
Token went
Token ID1816
Feature activation+5.26e-03

Pos logit contributions
ances+0.013
glers+0.012
 sizes+0.011
cut+0.011
boxes+0.011
Neg logit contributions
 again-0.006
 blast-0.006
<|endoftext|>-0.006
 once-0.006
 Will-0.006
Token to
Token ID284
Feature activation-1.21e-02

Pos logit contributions
ances+0.014
glers+0.012
 sizes+0.012
boxes+0.012
cut+0.012
Neg logit contributions
 again-0.004
 once-0.004
 blast-0.004
<|endoftext|>-0.004
 Will-0.004
Token the
Token ID262
Feature activation+9.19e-03

Pos logit contributions
 again+0.013
ig+0.013
 once+0.011
 finished+0.011
 happily+0.011
Neg logit contributions
ances-0.025
bank-0.022
 barrels-0.022
glers-0.022
 crates-0.022
Token river
Token ID7850
Feature activation+2.68e-02

Pos logit contributions
ances+0.017
glers+0.015
 sizes+0.014
boxes+0.014
cut+0.014
Neg logit contributions
 again-0.015
 once-0.014
<|endoftext|>-0.013
 Will-0.013
 happily-0.013
Tokenbank
Token ID17796
Feature activation+6.57e-02

Pos logit contributions
ances+0.075
glers+0.065
 trigger+0.065
 sizes+0.064
boxes+0.064
Neg logit contributions
 again-0.025
 once-0.020
 happily-0.019
<|endoftext|>-0.018
 finished-0.015
Token and
Token ID290
Feature activation-1.17e-02

Pos logit contributions
ances+0.193
glers+0.177
 trigger+0.175
 sizes+0.172
limits+0.172
Neg logit contributions
 again-0.053
 once-0.040
 happily-0.039
 and-0.026
<|endoftext|>-0.026
Token he
Token ID339
Feature activation+5.93e-03

Pos logit contributions
 again+0.013
<|endoftext|>+0.011
 once+0.011
 Will+0.011
 happily+0.011
Neg logit contributions
ances-0.029
glers-0.025
 sizes-0.025
boxes-0.024
ales-0.023
Token noticed
Token ID6810
Feature activation-4.97e-03

Pos logit contributions
ances+0.013
glers+0.011
 sizes+0.011
cut+0.011
bank+0.011
Neg logit contributions
 again-0.007
 once-0.006
 blast-0.006
<|endoftext|>-0.006
 Will-0.006
Token something
Token ID1223
Feature activation+2.42e-02

Pos logit contributions
 again+0.005
 once+0.004
<|endoftext|>+0.004
 Will+0.003
 happily+0.003
Neg logit contributions
ances-0.014
glers-0.012
 sizes-0.012
boxes-0.012
ales-0.011
Token strange
Token ID6283
Feature activation+8.72e-03

Pos logit contributions
ances+0.049
glers+0.041
 sizes+0.040
boxes+0.040
ales+0.040
Neg logit contributions
 again-0.037
<|endoftext|>-0.034
 once-0.034
 happily-0.033
 Will-0.032
Token were
Token ID547
Feature activation-1.11e-03

Pos logit contributions
ances+0.093
 sizes+0.081
glers+0.081
boxes+0.079
limits+0.078
Neg logit contributions
 again-0.040
 once-0.035
 happily-0.034
<|endoftext|>-0.034
 finished-0.033
Token walking
Token ID6155
Feature activation+6.43e-03

Pos logit contributions
 again+0.001
 Will+0.001
 once+0.001
<|endoftext|>+0.001
 blast+0.001
Neg logit contributions
ances-0.003
glers-0.002
 sizes-0.002
boxes-0.002
cut-0.002
Token near
Token ID1474
Feature activation-4.41e-03

Pos logit contributions
ances+0.017
glers+0.015
 sizes+0.015
boxes+0.014
cut+0.014
Neg logit contributions
 again-0.006
 once-0.005
<|endoftext|>-0.005
 happily-0.005
 Will-0.005
Token the
Token ID262
Feature activation+7.21e-03

Pos logit contributions
ig+0.005
 again+0.005
 finished+0.005
 la+0.005
 imag+0.005
Neg logit contributions
ances-0.009
 cans-0.008
 barrels-0.008
 crates-0.008
 sizes-0.008
Token river
Token ID7850
Feature activation+2.63e-02

Pos logit contributions
ances+0.014
glers+0.012
 sizes+0.011
boxes+0.011
cut+0.011
Neg logit contributions
 again-0.011
 once-0.010
 Will-0.010
<|endoftext|>-0.010
 happily-0.010
Tokenbank
Token ID17796
Feature activation+6.56e-02

Pos logit contributions
ances+0.077
 trigger+0.069
boxes+0.067
rier+0.066
glers+0.065
Neg logit contributions
 again-0.023
 once-0.018
 happily-0.017
<|endoftext|>-0.015
-0.014
Token near
Token ID1474
Feature activation+6.56e-04

Pos logit contributions
ances+0.196
 trigger+0.191
glers+0.180
stones+0.178
boxes+0.178
Neg logit contributions
 again-0.048
 once-0.037
 happily-0.032
 and-0.028
 at-0.025
Token the
Token ID262
Feature activation+8.55e-03

Pos logit contributions
ances+0.002
glers+0.001
 sizes+0.001
 cans+0.001
boxes+0.001
Neg logit contributions
 again-0.001
 Will-0.001
 finished-0.001
 blast-0.001
ig-0.000
Token arrow
Token ID15452
Feature activation+3.47e-03

Pos logit contributions
ances+0.014
boxes+0.011
limits+0.011
cases+0.011
 sizes+0.011
Neg logit contributions
 again-0.017
 once-0.015
<|endoftext|>-0.015
 happily-0.015
 Will-0.015
Token.
Token ID13
Feature activation+3.77e-03

Pos logit contributions
ances+0.009
glers+0.008
 sizes+0.008
boxes+0.008
limits+0.008
Neg logit contributions
 again-0.004
 once-0.003
 happily-0.003
<|endoftext|>-0.003
 Will-0.003
Token The
Token ID383
Feature activation+3.52e-03

Pos logit contributions
ances+0.008
glers+0.007
 sizes+0.007
boxes+0.007
 crates+0.007
Neg logit contributions
 again-0.005
 once-0.004
<|endoftext|>-0.004
 blast-0.004
 finished-0.004
Token village
Token ID7404
Feature activation+4.46e-02

Pos logit contributions
ances+0.009
glers+0.008
 sizes+0.007
boxes+0.007
 cans+0.007
Neg logit contributions
 again-0.007
 once-0.007
 Will-0.007
<|endoftext|>-0.006
 finished-0.006
Token,
Token ID11
Feature activation-7.78e-03

Pos logit contributions
ances+0.128
 trigger+0.115
 sizes+0.114
ales+0.112
glers+0.112
Neg logit contributions
 again-0.031
<|endoftext|>-0.028
 once-0.026
 happily-0.024
 finished-0.022
Token there
Token ID612
Feature activation+1.30e-02

Pos logit contributions
 again+0.011
 once+0.010
<|endoftext|>+0.010
 Will+0.010
 happily+0.010
Neg logit contributions
ances-0.017
glers-0.014
 sizes-0.014
boxes-0.014
ales-0.013
Token were
Token ID547
Feature activation+2.83e-02

Pos logit contributions
ances+0.026
 sizes+0.024
glers+0.024
boxes+0.024
cut+0.023
Neg logit contributions
 again-0.014
 blast-0.014
 rec-0.013
 Will-0.013
 la-0.012
Token two
Token ID734
Feature activation+4.75e-02

Pos logit contributions
ances+0.065
glers+0.055
 sizes+0.055
boxes+0.054
cut+0.053
Neg logit contributions
 again-0.035
 once-0.033
<|endoftext|>-0.033
 Will-0.032
 happily-0.031
Token groups
Token ID2628
Feature activation+6.35e-02

Pos logit contributions
ances+0.113
limits+0.109
bank+0.103
boxes+0.102
cut+0.102
Neg logit contributions
 again-0.048
 happily-0.044
 once-0.041
<|endoftext|>-0.041
 finished-0.040
Token of
Token ID286
Feature activation+8.73e-03

Pos logit contributions
ances+0.168
limits+0.152
 trigger+0.148
rier+0.147
 sizes+0.143
Neg logit contributions
 again-0.056
<|endoftext|>-0.049
 once-0.048
 happily-0.046
-0.042
Token friends
Token ID2460
Feature activation+4.34e-02

Pos logit contributions
ances+0.013
glers+0.010
 sizes+0.010
boxes+0.010
 crates+0.009
Neg logit contributions
 again-0.018
 once-0.016
<|endoftext|>-0.016
 finished-0.016
 blast-0.016
Token.
Token ID13
Feature activation+6.75e-03

Pos logit contributions
ances+0.111
glers+0.099
 sizes+0.097
boxes+0.096
 trigger+0.095
Neg logit contributions
 again-0.041
 once-0.036
<|endoftext|>-0.035
 happily-0.034
 Will-0.034
Token They
Token ID1119
Feature activation+2.52e-02

Pos logit contributions
ances+0.016
cut+0.014
glers+0.014
limits+0.014
rier+0.014
Neg logit contributions
 Will-0.008
 again-0.007
 once-0.007
<|endoftext|>-0.007
 blast-0.007
Token liked
Token ID8288
Feature activation-1.36e-02

Pos logit contributions
ances+0.059
glers+0.051
 sizes+0.051
boxes+0.049
cut+0.048
Neg logit contributions
 again-0.029
 once-0.026
<|endoftext|>-0.026
 Will-0.025
 happily-0.025
Token
Token ID198
Feature activation+7.40e-03

Pos logit contributions
ances+0.002
glers+0.002
cut+0.002
handed+0.002
bank+0.002
Neg logit contributions
 imag-0.001
 again-0.001
 least-0.000
 finished-0.000
 blast-0.000
TokenStanding
Token ID44196
Feature activation+4.91e-03

Pos logit contributions
ances+0.018
glers+0.016
 sizes+0.016
boxes+0.015
ales+0.015
Neg logit contributions
<|endoftext|>-0.009
 again-0.008
 Will-0.007
 happily-0.007
 once-0.007
Token on
Token ID319
Feature activation-2.39e-02

Pos logit contributions
ances+0.013
glers+0.011
boxes+0.011
 sizes+0.011
cut+0.011
Neg logit contributions
 again-0.004
 blast-0.004
 Will-0.004
<|endoftext|>-0.004
 once-0.004
Token the
Token ID262
Feature activation+1.40e-02

Pos logit contributions
 again+0.013
 once+0.011
<|endoftext|>+0.009
 happily+0.009
 Will+0.008
Neg logit contributions
ances-0.073
glers-0.065
 sizes-0.065
ales-0.063
boxes-0.063
Token river
Token ID7850
Feature activation+2.59e-02

Pos logit contributions
ances+0.029
 sizes+0.025
limits+0.025
boxes+0.024
cases+0.024
Neg logit contributions
 again-0.021
 least-0.019
 blast-0.018
 finished-0.018
 once-0.018
Tokenbank
Token ID17796
Feature activation+6.33e-02

Pos logit contributions
ances+0.072
 sizes+0.062
glers+0.062
 trigger+0.061
ales+0.061
Neg logit contributions
 again-0.025
 once-0.020
 happily-0.019
 finished-0.018
<|endoftext|>-0.018
Token was
Token ID373
Feature activation+2.29e-03

Pos logit contributions
ances+0.200
 trigger+0.175
ales+0.175
boxes+0.174
glers+0.172
Neg logit contributions
 again-0.042
 once-0.029
 happily-0.027
 finished-0.026
:-0.021
Token a
Token ID257
Feature activation-1.05e-02

Pos logit contributions
ances+0.006
glers+0.005
 sizes+0.005
boxes+0.005
ales+0.005
Neg logit contributions
 again-0.002
 once-0.002
<|endoftext|>-0.002
 finished-0.002
 happily-0.002
Token duck
Token ID22045
Feature activation+1.74e-02

Pos logit contributions
 again+0.017
 once+0.016
 Will+0.016
<|endoftext|>+0.016
 happily+0.015
Neg logit contributions
ances-0.018
glers-0.016
 sizes-0.015
cut-0.015
boxes-0.015
Token.
Token ID13
Feature activation+8.47e-03

Pos logit contributions
ances+0.048
 sizes+0.042
boxes+0.042
limits+0.041
 crates+0.040
Neg logit contributions
 again-0.017
 once-0.014
<|endoftext|>-0.013
 happily-0.013
 finished-0.011
Token The
Token ID383
Feature activation+9.22e-03

Pos logit contributions
ances+0.022
glers+0.019
 sizes+0.019
boxes+0.019
ales+0.018
Neg logit contributions
 again-0.008
 once-0.006
<|endoftext|>-0.006
 happily-0.006
 blast-0.006
Token who
Token ID508
Feature activation-6.22e-03

Pos logit contributions
ances+0.018
glers+0.016
cut+0.016
 sizes+0.016
boxes+0.016
Neg logit contributions
 again-0.009
 blast-0.008
 finished-0.008
 once-0.008
 Will-0.008
Token lived
Token ID5615
Feature activation+1.22e-02

Pos logit contributions
 again+0.006
 once+0.005
<|endoftext|>+0.005
 Will+0.005
 blast+0.005
Neg logit contributions
ances-0.016
glers-0.014
 sizes-0.014
boxes-0.013
cut-0.013
Token in
Token ID287
Feature activation-3.72e-03

Pos logit contributions
ances+0.028
cut+0.026
glers+0.026
boxes+0.025
ters+0.024
Neg logit contributions
 blast-0.011
 again-0.011
 Will-0.011
 finished-0.010
 imag-0.010
Token a
Token ID257
Feature activation-2.97e-03

Pos logit contributions
 again+0.003
 once+0.003
<|endoftext|>+0.002
 happily+0.002
 Will+0.002
Neg logit contributions
ances-0.010
glers-0.009
 sizes-0.009
boxes-0.009
cut-0.008
Token small
Token ID1402
Feature activation+1.27e-02

Pos logit contributions
 again+0.005
 once+0.005
 Will+0.005
 finished+0.004
 blast+0.004
Neg logit contributions
ances-0.005
glers-0.004
 sizes-0.004
cut-0.004
 crates-0.004
Token factory
Token ID8860
Feature activation+6.27e-02

Pos logit contributions
ances+0.027
glers+0.023
 sizes+0.023
boxes+0.022
ales+0.022
Neg logit contributions
 again-0.018
 once-0.016
<|endoftext|>-0.016
 happily-0.015
 Will-0.015
Token.
Token ID13
Feature activation-1.21e-03

Pos logit contributions
ances+0.174
 trigger+0.154
ales+0.150
 sizes+0.149
glers+0.149
Neg logit contributions
 again-0.052
 once-0.042
 happily-0.040
<|endoftext|>-0.039
-0.033
Token He
Token ID679
Feature activation+7.43e-03

Pos logit contributions
 again+0.002
 once+0.001
<|endoftext|>+0.001
 Will+0.001
 happily+0.001
Neg logit contributions
ances-0.003
glers-0.002
 sizes-0.002
boxes-0.002
cut-0.002
Token had
Token ID550
Feature activation-1.68e-03

Pos logit contributions
ances+0.015
glers+0.013
 sizes+0.013
cut+0.013
bank+0.012
Neg logit contributions
 again-0.010
 blast-0.009
ig-0.009
 once-0.009
<|endoftext|>-0.008
Token been
Token ID587
Feature activation+4.65e-03

Pos logit contributions
 again+0.001
<|endoftext|>+0.001
 once+0.001
 blast+0.001
 Will+0.001
Neg logit contributions
ances-0.005
glers-0.005
ales-0.004
 sizes-0.004
 trigger-0.004
Token stirring
Token ID26547
Feature activation-1.92e-03

Pos logit contributions
ances+0.011
glers+0.009
 sizes+0.009
boxes+0.009
ales+0.009
Neg logit contributions
 again-0.005
 once-0.005
<|endoftext|>-0.005
 Will-0.005
 happily-0.005
Token bird
Token ID6512
Feature activation-7.41e-03

Pos logit contributions
 Will+0.013
 again+0.013
 blast+0.013
 finished+0.013
 imag+0.013
Neg logit contributions
ances-0.012
boxes-0.010
glers-0.010
cut-0.010
 sizes-0.009
Token saw
Token ID2497
Feature activation+3.77e-03

Pos logit contributions
 again+0.009
 blast+0.009
 Will+0.008
<|endoftext|>+0.008
 once+0.008
Neg logit contributions
ances-0.016
glers-0.014
 sizes-0.013
cut-0.013
limits-0.013
Token two
Token ID734
Feature activation+4.20e-02

Pos logit contributions
ances+0.009
glers+0.008
 sizes+0.008
 cans+0.008
cut+0.008
Neg logit contributions
 again-0.004
 blast-0.003
 Will-0.003
 happily-0.003
 finished-0.003
Token squirrel
Token ID33039
Feature activation+4.40e-03

Pos logit contributions
ances+0.082
limits+0.070
boxes+0.068
glers+0.068
cut+0.067
Neg logit contributions
 again-0.066
 once-0.060
<|endoftext|>-0.059
 happily-0.059
 finished-0.058
Tokens
Token ID82
Feature activation+3.21e-03

Pos logit contributions
ances+0.011
boxes+0.009
 sizes+0.009
rier+0.009
limits+0.009
Neg logit contributions
 again-0.005
<|endoftext|>-0.005
 once-0.005
 happily-0.005
 Will-0.004
Token quarrel
Token ID41998
Feature activation+6.25e-02

Pos logit contributions
ances+0.008
glers+0.007
 sizes+0.007
limits+0.007
 crates+0.007
Neg logit contributions
 again-0.003
 once-0.003
<|endoftext|>-0.003
 happily-0.003
 finished-0.003
Token over
Token ID625
Feature activation-1.32e-02

Pos logit contributions
ances+0.150
 sizes+0.133
glers+0.133
boxes+0.129
ales+0.126
Neg logit contributions
 again-0.063
 Will-0.058
 blast-0.057
 once-0.056
<|endoftext|>-0.056
Token a
Token ID257
Feature activation-1.45e-03

Pos logit contributions
 again+0.010
 once+0.009
<|endoftext|>+0.009
 Will+0.009
 blast+0.008
Neg logit contributions
ances-0.035
glers-0.031
 sizes-0.031
boxes-0.030
cut-0.030
Token nut
Token ID6701
Feature activation-1.81e-03

Pos logit contributions
 again+0.002
 once+0.002
<|endoftext|>+0.002
 Will+0.002
 happily+0.002
Neg logit contributions
ances-0.003
glers-0.002
 sizes-0.002
boxes-0.002
cut-0.002
Token.
Token ID13
Feature activation+3.54e-03

Pos logit contributions
 again+0.002
 once+0.002
<|endoftext|>+0.002
 happily+0.002
 Will+0.002
Neg logit contributions
ances-0.004
glers-0.004
 sizes-0.004
boxes-0.004
limits-0.004
Token The
Token ID383
Feature activation+9.77e-03

Pos logit contributions
ances+0.010
glers+0.009
 sizes+0.009
boxes+0.009
ales+0.009
Neg logit contributions
 again-0.002
 once-0.002
<|endoftext|>-0.002
 Will-0.002
 happily-0.002
Token,
Token ID11
Feature activation-6.10e-03

Pos logit contributions
ances+0.123
 trigger+0.111
glers+0.110
 sizes+0.109
ales+0.109
Neg logit contributions
 again-0.035
 once-0.030
<|endoftext|>-0.029
 happily-0.027
 finished-0.025
Token there
Token ID612
Feature activation+1.50e-02

Pos logit contributions
 again+0.008
 once+0.008
<|endoftext|>+0.008
 Will+0.007
 happily+0.007
Neg logit contributions
ances-0.013
glers-0.011
 sizes-0.011
boxes-0.011
ales-0.011
Token was
Token ID373
Feature activation+5.27e-03

Pos logit contributions
ances+0.035
glers+0.032
 sizes+0.032
boxes+0.031
cut+0.030
Neg logit contributions
 again-0.014
 blast-0.013
 Will-0.012
 finished-0.011
 happily-0.011
Token a
Token ID257
Feature activation-1.91e-02

Pos logit contributions
ances+0.011
 sizes+0.010
glers+0.010
 cans+0.010
boxes+0.009
Neg logit contributions
 again-0.006
ig-0.006
 happily-0.005
 Will-0.005
 imag-0.005
Token gloomy
Token ID46400
Feature activation+1.14e-02

Pos logit contributions
 again+0.034
 once+0.033
 imag+0.030
 Will+0.030
 finished+0.029
Neg logit contributions
ances-0.030
glers-0.026
cut-0.024
 sizes-0.024
 trigger-0.023
Token factory
Token ID8860
Feature activation+6.24e-02

Pos logit contributions
ances+0.022
glers+0.019
 sizes+0.019
cut+0.018
boxes+0.018
Neg logit contributions
 again-0.017
 Will-0.016
 once-0.015
<|endoftext|>-0.015
 blast-0.015
Token.
Token ID13
Feature activation+1.91e-04

Pos logit contributions
ances+0.180
 trigger+0.163
ales+0.161
limits+0.157
rier+0.156
Neg logit contributions
 again-0.044
 once-0.033
 happily-0.029
 finished-0.027
<|endoftext|>-0.027
Token The
Token ID383
Feature activation+7.39e-03

Pos logit contributions
ances+0.000
glers+0.000
 sizes+0.000
boxes+0.000
cut+0.000
Neg logit contributions
 again-0.000
 once-0.000
<|endoftext|>-0.000
 happily-0.000
 Will-0.000
Token factory
Token ID8860
Feature activation+6.03e-02

Pos logit contributions
ances+0.013
ales+0.011
glers+0.011
 sizes+0.011
stones+0.011
Neg logit contributions
<|endoftext|>-0.012
 again-0.012
 blast-0.012
 once-0.012
 heart-0.011
Token was
Token ID373
Feature activation-4.44e-03

Pos logit contributions
ances+0.177
ales+0.158
glers+0.156
 sizes+0.155
 trigger+0.155
Neg logit contributions
 again-0.039
 once-0.032
 finished-0.029
<|endoftext|>-0.028
 happily-0.027
Token old
Token ID1468
Feature activation+1.50e-02

Pos logit contributions
 again+0.005
 once+0.004
<|endoftext|>+0.004
 Will+0.004
 happily+0.004
Neg logit contributions
ances-0.011
glers-0.009
 sizes-0.009
boxes-0.009
cut-0.009
Token.
Token ID13
Feature activation+1.01e-02

Pos logit contributions
 again+0.009
 Will+0.008
 once+0.008
 blast+0.008
<|endoftext|>+0.008
Neg logit contributions
ances-0.019
glers-0.017
 sizes-0.016
boxes-0.016
cut-0.016
Token
Token ID198
Feature activation+5.02e-03

Pos logit contributions
ances+0.023
glers+0.020
 sizes+0.020
boxes+0.019
cut+0.019
Neg logit contributions
 again-0.012
 once-0.011
<|endoftext|>-0.011
 Will-0.011
 happily-0.011
Token
Token ID198
Feature activation+1.48e-02

Pos logit contributions
ances+0.012
glers+0.011
boxes+0.011
handed+0.010
cut+0.010
Neg logit contributions
 imag-0.004
 again-0.004
 least-0.004
 finished-0.004
 rec-0.003
TokenIn
Token ID818
Feature activation-1.14e-02

Pos logit contributions
ances+0.032
glers+0.027
 sizes+0.027
rier+0.027
cut+0.026
Neg logit contributions
 again-0.020
<|endoftext|>-0.020
 once-0.019
 blast-0.019
 finished-0.018
Token the
Token ID262
Feature activation+1.64e-02

Pos logit contributions
 again+0.008
 once+0.006
 Will+0.006
<|endoftext|>+0.006
 happily+0.006
Neg logit contributions
ances-0.031
glers-0.028
 sizes-0.028
boxes-0.028
cut-0.027
Token factory
Token ID8860
Feature activation+6.23e-02

Pos logit contributions
ances+0.032
 sizes+0.026
rier+0.025
glers+0.025
stones+0.025
Neg logit contributions
<|endoftext|>-0.026
 again-0.026
 blast-0.026
 once-0.025
 finished-0.025
Token,
Token ID11
Feature activation+1.37e-02

Pos logit contributions
ances+0.192
 sizes+0.170
 trigger+0.170
rier+0.165
ales+0.165
Neg logit contributions
 again-0.044
 once-0.033
 finished-0.033
<|endoftext|>-0.028
-0.026
Token Tom
Token ID4186
Feature activation+4.74e-03

Pos logit contributions
ances+0.034
glers+0.030
 sizes+0.030
ales+0.029
bank+0.028
Neg logit contributions
 again-0.015
 once-0.013
<|endoftext|>-0.013
 Will-0.013
 happily-0.012
Token met
Token ID1138
Feature activation+2.64e-02

Pos logit contributions
ances+0.012
glers+0.010
 sizes+0.010
boxes+0.010
cut+0.010
Neg logit contributions
 again-0.005
 once-0.004
<|endoftext|>-0.004
 Will-0.004
 happily-0.004
Token a
Token ID257
Feature activation-1.27e-02

Pos logit contributions
ances+0.070
glers+0.063
 sizes+0.062
boxes+0.061
cut+0.060
Neg logit contributions
 again-0.021
 once-0.018
<|endoftext|>-0.017
 Will-0.017
 happily-0.017
Token friendly
Token ID8030
Feature activation+8.40e-03

Pos logit contributions
 again+0.022
 once+0.021
 Will+0.020
<|endoftext|>+0.019
 happily+0.019
Neg logit contributions
ances-0.021
glers-0.018
 sizes-0.017
cut-0.017
boxes-0.017
Token came
Token ID1625
Feature activation+7.73e-03

Pos logit contributions
 again+0.001
 once+0.001
<|endoftext|>+0.001
 Will+0.001
 happily+0.001
Neg logit contributions
ances-0.002
glers-0.001
 sizes-0.001
boxes-0.001
cut-0.001
Token crashing
Token ID21899
Feature activation-2.66e-03

Pos logit contributions
ances+0.019
glers+0.017
 sizes+0.016
boxes+0.016
cut+0.016
Neg logit contributions
 again-0.008
<|endoftext|>-0.007
 once-0.007
 Will-0.007
 blast-0.007
Token onto
Token ID4291
Feature activation-7.93e-03

Pos logit contributions
 again+0.002
 once+0.002
<|endoftext|>+0.002
 Will+0.002
 happily+0.002
Neg logit contributions
ances-0.007
glers-0.006
 sizes-0.006
boxes-0.006
cut-0.006
Token the
Token ID262
Feature activation+1.96e-02

Pos logit contributions
 again+0.007
 finished+0.006
 once+0.006
 happily+0.006
<|endoftext|>+0.006
Neg logit contributions
ances-0.020
glers-0.018
boxes-0.017
 sizes-0.017
 trigger-0.017
Token river
Token ID7850
Feature activation+2.38e-02

Pos logit contributions
ances+0.039
glers+0.032
boxes+0.032
 sizes+0.032
limits+0.031
Neg logit contributions
 again-0.032
 once-0.029
<|endoftext|>-0.028
 finished-0.028
 Will-0.027
Tokenbank
Token ID17796
Feature activation+6.20e-02

Pos logit contributions
ances+0.067
glers+0.057
boxes+0.057
 sizes+0.057
 trigger+0.056
Neg logit contributions
 again-0.023
 once-0.017
 happily-0.016
<|endoftext|>-0.015
 finished-0.014
Token.
Token ID13
Feature activation-2.25e-03

Pos logit contributions
ances+0.185
glers+0.169
limits+0.167
 trigger+0.166
 redd+0.165
Neg logit contributions
 again-0.048
 once-0.036
 happily-0.028
 and-0.026
-0.019
Token It
Token ID632
Feature activation+8.48e-03

Pos logit contributions
 again+0.003
 once+0.002
<|endoftext|>+0.002
 Will+0.002
 happily+0.002
Neg logit contributions
ances-0.005
glers-0.005
 sizes-0.005
boxes-0.004
cut-0.004
Token was
Token ID373
Feature activation+1.08e-03

Pos logit contributions
ances+0.021
glers+0.019
 sizes+0.019
cut+0.018
boxes+0.018
Neg logit contributions
 again-0.008
 once-0.007
<|endoftext|>-0.007
 blast-0.006
 Will-0.006
Token so
Token ID523
Feature activation+2.13e-02

Pos logit contributions
ances+0.003
glers+0.002
 sizes+0.002
boxes+0.002
cut+0.002
Neg logit contributions
 again-0.001
 once-0.001
<|endoftext|>-0.001
 Will-0.001
 happily-0.001
Token big
Token ID1263
Feature activation-2.58e-02

Pos logit contributions
ances+0.048
boxes+0.041
ales+0.040
glers+0.039
 sizes+0.039
Neg logit contributions
 again-0.028
<|endoftext|>-0.028
 once-0.027
 happily-0.026
 finished-0.026
Token day
Token ID1110
Feature activation+2.10e-02

Pos logit contributions
ances+0.010
glers+0.008
 sizes+0.008
boxes+0.008
cut+0.008
Neg logit contributions
 again-0.006
 blast-0.005
 imag-0.005
 Will-0.005
 finished-0.005
Token they
Token ID484
Feature activation+2.53e-02

Pos logit contributions
ances+0.052
glers+0.046
 sizes+0.045
boxes+0.045
cut+0.044
Neg logit contributions
 again-0.021
 once-0.018
<|endoftext|>-0.018
 Will-0.017
 happily-0.017
Token had
Token ID550
Feature activation+2.75e-04

Pos logit contributions
ances+0.052
glers+0.046
 sizes+0.043
boxes+0.042
 cans+0.041
Neg logit contributions
 again-0.035
<|endoftext|>-0.032
 once-0.032
 blast-0.031
 Will-0.031
Token a
Token ID257
Feature activation-8.82e-04

Pos logit contributions
ances+0.001
 sizes+0.001
boxes+0.001
glers+0.001
 cans+0.001
Neg logit contributions
 again-0.000
ig-0.000
 Will-0.000
<|endoftext|>-0.000
 happily-0.000
Token big
Token ID1263
Feature activation-4.24e-03

Pos logit contributions
 again+0.002
 once+0.001
 Will+0.001
 Tw+0.001
<|endoftext|>+0.001
Neg logit contributions
ances-0.001
glers-0.001
 trigger-0.001
 sizes-0.001
 barrels-0.001
Token argument
Token ID4578
Feature activation+6.19e-02

Pos logit contributions
 again+0.007
 Will+0.006
<|endoftext|>+0.006
 Tw+0.006
 once+0.006
Neg logit contributions
ances-0.008
glers-0.007
 sizes-0.006
cut-0.006
boxes-0.006
Token and
Token ID290
Feature activation-5.16e-04

Pos logit contributions
ances+0.161
glers+0.143
 sizes+0.142
boxes+0.139
cut+0.137
Neg logit contributions
 again-0.054
 once-0.047
<|endoftext|>-0.046
 Will-0.044
 happily-0.044
Token the
Token ID262
Feature activation+1.64e-02

Pos logit contributions
 again+0.001
 once+0.000
 Will+0.000
<|endoftext|>+0.000
 happily+0.000
Neg logit contributions
ances-0.001
glers-0.001
 sizes-0.001
boxes-0.001
cut-0.001
Token husband
Token ID5229
Feature activation+1.71e-02

Pos logit contributions
ances+0.038
ales+0.034
 sizes+0.034
glers+0.033
limits+0.032
Neg logit contributions
 again-0.020
 once-0.018
<|endoftext|>-0.018
 happily-0.018
 finished-0.017
Token said
Token ID531
Feature activation+1.42e-02

Pos logit contributions
ances+0.041
glers+0.036
 sizes+0.036
boxes+0.035
ales+0.034
Neg logit contributions
 again-0.019
 once-0.017
 happily-0.016
<|endoftext|>-0.016
 Will-0.016
Token,
Token ID11
Feature activation+7.58e-03

Pos logit contributions
ances+0.037
glers+0.033
 sizes+0.033
ales+0.032
boxes+0.032
Neg logit contributions
 again-0.014
 once-0.011
<|endoftext|>-0.011
 Will-0.011
 happily-0.011
Tokenapping
Token ID5912
Feature activation+8.89e-03

Pos logit contributions
 again+0.025
 once+0.023
<|endoftext|>+0.023
 happily+0.022
 Will+0.022
Neg logit contributions
ances-0.029
 sizes-0.024
glers-0.024
boxes-0.023
limits-0.023
Token against
Token ID1028
Feature activation-4.26e-03

Pos logit contributions
ances+0.022
glers+0.020
limits+0.019
 sizes+0.019
ales+0.019
Neg logit contributions
 again-0.009
 once-0.008
 happily-0.008
<|endoftext|>-0.007
 Will-0.007
Token the
Token ID262
Feature activation+1.57e-02

Pos logit contributions
 again+0.004
 once+0.003
<|endoftext|>+0.003
 finished+0.003
 blast+0.003
Neg logit contributions
ances-0.011
 sizes-0.009
glers-0.009
boxes-0.009
ales-0.009
Token creek
Token ID39459
Feature activation+1.83e-02

Pos logit contributions
ances+0.028
glers+0.023
boxes+0.023
 sizes+0.022
limits+0.022
Neg logit contributions
 again-0.027
<|endoftext|>-0.025
 once-0.025
 Will-0.025
 happily-0.024
Token's
Token ID338
Feature activation+8.79e-03

Pos logit contributions
ances+0.050
glers+0.043
limits+0.043
 trigger+0.042
ales+0.042
Neg logit contributions
 again-0.017
 once-0.013
 happily-0.013
<|endoftext|>-0.012
 Will-0.010
Token banks
Token ID6341
Feature activation+6.18e-02

Pos logit contributions
ances+0.017
glers+0.015
 sizes+0.014
boxes+0.014
limits+0.014
Neg logit contributions
 again-0.013
 once-0.012
<|endoftext|>-0.012
 Will-0.012
 happily-0.012
Token.
Token ID13
Feature activation+5.87e-03

Pos logit contributions
ances+0.176
limits+0.173
 trigger+0.173
trained+0.165
cut+0.163
Neg logit contributions
 again-0.038
 once-0.027
-0.020
<|endoftext|>-0.020
 and-0.019
Token He
Token ID679
Feature activation+5.63e-03

Pos logit contributions
ances+0.011
 sizes+0.009
glers+0.009
boxes+0.009
 crates+0.009
Neg logit contributions
 again-0.009
 blast-0.009
 finished-0.008
 once-0.008
<|endoftext|>-0.008
Token listened
Token ID16399
Feature activation-7.81e-04

Pos logit contributions
ances+0.010
glers+0.009
 sizes+0.009
cases+0.009
 crates+0.009
Neg logit contributions
 again-0.007
 once-0.007
 blast-0.007
<|endoftext|>-0.007
 least-0.007
Token to
Token ID284
Feature activation-6.59e-03

Pos logit contributions
 again+0.001
 blast+0.001
<|endoftext|>+0.001
 once+0.001
 Will+0.001
Neg logit contributions
ances-0.002
 sizes-0.002
glers-0.002
boxes-0.002
cut-0.002
Token the
Token ID262
Feature activation+7.36e-03

Pos logit contributions
 again+0.006
 once+0.005
<|endoftext|>+0.005
 finished+0.005
 Will+0.005
Neg logit contributions
ances-0.017
 sizes-0.015
glers-0.015
boxes-0.014
ales-0.014
Token a
Token ID257
Feature activation-1.97e-02

Pos logit contributions
 again+0.002
<|endoftext|>+0.002
 once+0.002
 Will+0.002
 happily+0.002
Neg logit contributions
ances-0.008
glers-0.007
 sizes-0.007
boxes-0.007
ales-0.007
Token raft
Token ID31812
Feature activation+2.38e-02

Pos logit contributions
 again+0.032
 once+0.030
 Will+0.029
<|endoftext|>+0.029
 happily+0.029
Neg logit contributions
ances-0.035
glers-0.031
 sizes-0.029
cut-0.029
boxes-0.028
Token near
Token ID1474
Feature activation+5.56e-03

Pos logit contributions
ances+0.064
 sizes+0.056
limits+0.054
 crates+0.053
boxes+0.053
Neg logit contributions
 again-0.025
 once-0.022
<|endoftext|>-0.020
 happily-0.019
 finished-0.017
Token the
Token ID262
Feature activation+1.27e-02

Pos logit contributions
ances+0.014
glers+0.012
 sizes+0.012
boxes+0.012
ales+0.012
Neg logit contributions
 again-0.005
 Will-0.005
 blast-0.004
 finished-0.004
 once-0.004
Token river
Token ID7850
Feature activation+2.51e-02

Pos logit contributions
ances+0.022
 sizes+0.017
glers+0.017
cases+0.017
limits+0.017
Neg logit contributions
 again-0.023
<|endoftext|>-0.022
 once-0.022
 blast-0.021
 finished-0.021
Tokenbank
Token ID17796
Feature activation+6.17e-02

Pos logit contributions
ances+0.073
 trigger+0.063
cases+0.062
stones+0.062
boxes+0.062
Neg logit contributions
 again-0.021
 once-0.017
<|endoftext|>-0.015
 happily-0.014
-0.013
Token and
Token ID290
Feature activation-1.66e-02

Pos logit contributions
ances+0.180
 trigger+0.167
glers+0.165
stones+0.165
limits+0.164
Neg logit contributions
 again-0.042
 once-0.034
 happily-0.026
 and-0.026
 for-0.024
Token she
Token ID673
Feature activation-7.58e-03

Pos logit contributions
 again+0.021
<|endoftext|>+0.019
 Will+0.019
 happily+0.019
 once+0.019
Neg logit contributions
ances-0.038
glers-0.033
boxes-0.032
 sizes-0.032
ales-0.031
Token thought
Token ID1807
Feature activation-2.09e-03

Pos logit contributions
 again+0.011
 once+0.010
 blast+0.010
 Will+0.010
<|endoftext|>+0.009
Neg logit contributions
ances-0.015
glers-0.013
 sizes-0.013
cut-0.012
boxes-0.012
Token it
Token ID340
Feature activation-5.13e-03

Pos logit contributions
 again+0.002
 Will+0.001
 once+0.001
 blast+0.001
<|endoftext|>+0.001
Neg logit contributions
ances-0.005
 sizes-0.005
glers-0.005
boxes-0.005
cut-0.005
Token was
Token ID373
Feature activation+4.64e-03

Pos logit contributions
 again+0.004
 once+0.003
<|endoftext|>+0.003
 Will+0.003
 happily+0.003
Neg logit contributions
ances-0.014
glers-0.013
 sizes-0.013
boxes-0.012
ales-0.012
Token appeared
Token ID4120
Feature activation-9.80e-03

Pos logit contributions
 again+0.003
 once+0.003
<|endoftext|>+0.003
 Will+0.003
 happily+0.003
Neg logit contributions
ances-0.007
glers-0.006
 sizes-0.006
boxes-0.006
cut-0.006
Token and
Token ID290
Feature activation-1.86e-02

Pos logit contributions
 again+0.009
 Will+0.008
<|endoftext|>+0.008
 blast+0.008
 once+0.008
Neg logit contributions
ances-0.024
glers-0.021
 sizes-0.021
boxes-0.021
ales-0.020
Token destroyed
Token ID6572
Feature activation+2.24e-02

Pos logit contributions
 again+0.023
 Will+0.021
<|endoftext|>+0.021
 once+0.021
 happily+0.020
Neg logit contributions
ances-0.044
 sizes-0.037
glers-0.037
boxes-0.036
ales-0.035
Token the
Token ID262
Feature activation+6.17e-03

Pos logit contributions
ances+0.059
glers+0.053
 sizes+0.052
boxes+0.051
cut+0.050
Neg logit contributions
 again-0.019
 once-0.016
<|endoftext|>-0.016
 Will-0.015
 happily-0.015
Token river
Token ID7850
Feature activation+2.33e-02

Pos logit contributions
ances+0.013
cases+0.011
boxes+0.011
 sizes+0.011
glers+0.011
Neg logit contributions
 again-0.009
 once-0.009
<|endoftext|>-0.008
 finished-0.008
 whist-0.008
Tokenbank
Token ID17796
Feature activation+6.14e-02

Pos logit contributions
ances+0.067
glers+0.056
ales+0.055
boxes+0.055
 sizes+0.055
Neg logit contributions
 again-0.023
 once-0.018
 happily-0.016
<|endoftext|>-0.015
 finished-0.014
Token!
Token ID0
Feature activation-3.81e-03

Pos logit contributions
ances+0.188
glers+0.164
limits+0.164
 redd+0.163
cut+0.163
Neg logit contributions
 again-0.049
 once-0.035
 happily-0.029
 finished-0.022
 and-0.021
Token Water
Token ID5638
Feature activation+1.22e-02

Pos logit contributions
 again+0.003
 once+0.003
<|endoftext|>+0.003
 happily+0.003
 Will+0.003
Neg logit contributions
ances-0.010
glers-0.009
 sizes-0.009
boxes-0.009
cut-0.008
Token was
Token ID373
Feature activation-9.76e-03

Pos logit contributions
ances+0.032
 sizes+0.028
glers+0.028
boxes+0.027
ales+0.027
Neg logit contributions
 again-0.012
 once-0.010
 finished-0.010
 happily-0.009
<|endoftext|>-0.009
Token everywhere
Token ID8347
Feature activation+2.56e-03

Pos logit contributions
 again+0.014
 once+0.012
<|endoftext|>+0.012
 happily+0.012
 finished+0.012
Neg logit contributions
ances-0.021
glers-0.018
 sizes-0.018
boxes-0.017
ales-0.017
Token and
Token ID290
Feature activation-1.32e-02

Pos logit contributions
ances+0.006
glers+0.006
 sizes+0.006
boxes+0.006
cut+0.005
Neg logit contributions
 again-0.003
 once-0.002
<|endoftext|>-0.002
 happily-0.002
 Will-0.002
Token day
Token ID1110
Feature activation+1.33e-02

Pos logit contributions
ances+0.017
glers+0.015
 sizes+0.015
iest+0.014
ales+0.014
Neg logit contributions
 again-0.011
 Will-0.010
 blast-0.010
 happily-0.010
 imag-0.009
Token,
Token ID11
Feature activation+6.55e-03

Pos logit contributions
ances+0.034
glers+0.030
 sizes+0.030
boxes+0.029
cut+0.029
Neg logit contributions
 again-0.013
 once-0.011
<|endoftext|>-0.011
 Will-0.011
 happily-0.010
Token a
Token ID257
Feature activation-2.05e-02

Pos logit contributions
ances+0.015
glers+0.013
 sizes+0.013
boxes+0.013
cut+0.013
Neg logit contributions
 again-0.007
 blast-0.006
 finished-0.005
 rec-0.005
 Will-0.005
Token big
Token ID1263
Feature activation-1.46e-02

Pos logit contributions
 again+0.037
 once+0.034
 finished+0.032
 imag+0.031
 least+0.031
Neg logit contributions
ances-0.034
glers-0.028
 sizes-0.026
cut-0.026
 trigger-0.026
Token number
Token ID1271
Feature activation+1.63e-02

Pos logit contributions
 again+0.026
 Will+0.024
 once+0.024
 happily+0.023
<|endoftext|>+0.023
Neg logit contributions
ances-0.021
glers-0.019
cut-0.018
igator-0.018
 barrels-0.017
Token war
Token ID1175
Feature activation+6.14e-02

Pos logit contributions
ances+0.036
glers+0.032
boxes+0.031
 sizes+0.031
cut+0.031
Neg logit contributions
 again-0.019
 Will-0.018
<|endoftext|>-0.017
 once-0.017
 happily-0.017
Token happened
Token ID3022
Feature activation-4.92e-03

Pos logit contributions
ances+0.148
glers+0.132
 sizes+0.131
boxes+0.128
cut+0.126
Neg logit contributions
 again-0.062
 once-0.054
 Will-0.054
<|endoftext|>-0.054
 happily-0.052
Token.
Token ID13
Feature activation+8.80e-03

Pos logit contributions
 again+0.006
 Will+0.005
 blast+0.005
 once+0.005
<|endoftext|>+0.005
Neg logit contributions
ances-0.011
 sizes-0.009
glers-0.009
boxes-0.009
cut-0.009
Token In
Token ID554
Feature activation-1.13e-02

Pos logit contributions
ances+0.023
glers+0.021
 sizes+0.020
boxes+0.020
cut+0.020
Neg logit contributions
 again-0.008
 once-0.006
<|endoftext|>-0.006
 Will-0.006
 happily-0.006
Token this
Token ID428
Feature activation-1.05e-02

Pos logit contributions
 again+0.008
 once+0.006
<|endoftext|>+0.006
 Will+0.006
 blast+0.006
Neg logit contributions
ances-0.032
glers-0.028
 sizes-0.028
boxes-0.028
cut-0.027
Token war
Token ID1175
Feature activation+5.53e-02

Pos logit contributions
 again+0.015
 once+0.014
<|endoftext|>+0.014
 Will+0.014
 happily+0.014
Neg logit contributions
ances-0.021
glers-0.018
 sizes-0.018
boxes-0.017
cut-0.017
Token them
Token ID606
Feature activation+1.12e-02

Pos logit contributions
 again+0.008
 once+0.007
<|endoftext|>+0.007
 Will+0.007
 blast+0.007
Neg logit contributions
ances-0.021
glers-0.018
 sizes-0.018
boxes-0.018
cut-0.017
Token in
Token ID287
Feature activation-4.72e-03

Pos logit contributions
ances+0.031
glers+0.028
 sizes+0.028
boxes+0.027
cut+0.027
Neg logit contributions
 again-0.008
 once-0.007
<|endoftext|>-0.006
 happily-0.006
 Will-0.006
Token their
Token ID511
Feature activation+2.37e-02

Pos logit contributions
 again+0.004
 once+0.004
<|endoftext|>+0.003
 Will+0.003
 happily+0.003
Neg logit contributions
ances-0.012
glers-0.011
 sizes-0.011
boxes-0.011
cut-0.010
Token pig
Token ID12967
Feature activation+1.38e-02

Pos logit contributions
ances+0.033
 cans+0.030
 crates+0.030
 barrels+0.030
boxes+0.028
Neg logit contributions
 again-0.045
 Will-0.044
 once-0.043
 Tw-0.041
Tw-0.041
Tokengy
Token ID1360
Feature activation+2.49e-02

Pos logit contributions
ances+0.025
glers+0.021
 sizes+0.020
 trigger+0.020
rier+0.020
Neg logit contributions
 again-0.024
 once-0.022
 happily-0.022
<|endoftext|>-0.022
 Will-0.020
Token banks
Token ID6341
Feature activation+6.13e-02

Pos logit contributions
ances+0.031
glers+0.023
 sizes+0.022
boxes+0.021
ales+0.020
Neg logit contributions
 again-0.056
 once-0.053
<|endoftext|>-0.053
 happily-0.052
 finished-0.051
Token.
Token ID13
Feature activation+1.87e-02

Pos logit contributions
ances+0.172
 trigger+0.160
stones+0.158
glers+0.156
limits+0.155
Neg logit contributions
 again-0.048
 once-0.037
<|endoftext|>-0.033
-0.031
 happily-0.031
Token They
Token ID1119
Feature activation+2.10e-02

Pos logit contributions
ances+0.026
 cans+0.025
 sizes+0.024
boxes+0.023
cases+0.023
Neg logit contributions
ig-0.033
 again-0.032
 blast-0.031
 least-0.030
 finished-0.030
Token think
Token ID892
Feature activation+7.07e-03

Pos logit contributions
ances+0.049
glers+0.043
 sizes+0.042
boxes+0.041
cut+0.040
Neg logit contributions
 again-0.024
 once-0.022
<|endoftext|>-0.021
 Will-0.021
 happily-0.021
Token of
Token ID286
Feature activation-1.79e-02

Pos logit contributions
ances+0.016
glers+0.014
 sizes+0.014
 cans+0.013
boxes+0.013
Neg logit contributions
 again-0.008
<|endoftext|>-0.007
 blast-0.007
 once-0.007
 happily-0.007
Token what
Token ID644
Feature activation-1.10e-02

Pos logit contributions
 again+0.016
 once+0.014
<|endoftext|>+0.014
 Will+0.014
 happily+0.013
Neg logit contributions
ances-0.046
 sizes-0.040
glers-0.040
boxes-0.039
cut-0.038
Token,
Token ID11
Feature activation+8.41e-03

Pos logit contributions
ances+0.067
glers+0.060
 sizes+0.059
boxes+0.058
cut+0.057
Neg logit contributions
 again-0.023
 once-0.020
<|endoftext|>-0.019
 Will-0.019
 happily-0.019
Token Jack
Token ID3619
Feature activation+5.15e-04

Pos logit contributions
ances+0.021
glers+0.018
 sizes+0.018
cut+0.018
boxes+0.018
Neg logit contributions
 again-0.009
 once-0.008
 Will-0.008
 happily-0.008
<|endoftext|>-0.008
Token and
Token ID290
Feature activation+1.34e-02

Pos logit contributions
ances+0.001
glers+0.001
 sizes+0.001
boxes+0.001
cut+0.001
Neg logit contributions
 again-0.000
 once-0.000
<|endoftext|>-0.000
 Will-0.000
 happily-0.000
Token Jill
Token ID22772
Feature activation+2.23e-02

Pos logit contributions
ances+0.029
glers+0.025
 sizes+0.024
boxes+0.024
limits+0.024
Neg logit contributions
 again-0.018
 once-0.016
<|endoftext|>-0.016
 Will-0.016
 happily-0.016
Token were
Token ID547
Feature activation+7.64e-03

Pos logit contributions
ances+0.049
glers+0.042
 sizes+0.042
boxes+0.041
cut+0.040
Neg logit contributions
 again-0.029
 once-0.026
<|endoftext|>-0.026
 Will-0.026
 happily-0.025
Token arguing
Token ID11810
Feature activation+6.11e-02

Pos logit contributions
ances+0.019
glers+0.017
 sizes+0.016
boxes+0.016
limits+0.016
Neg logit contributions
 again-0.007
<|endoftext|>-0.006
 Will-0.006
 once-0.006
 blast-0.006
Token about
Token ID546
Feature activation-1.66e-02

Pos logit contributions
ances+0.144
glers+0.126
 sizes+0.125
boxes+0.122
cut+0.119
Neg logit contributions
 again-0.066
 Will-0.059
 once-0.059
 blast-0.059
<|endoftext|>-0.058
Token who
Token ID508
Feature activation+5.85e-04

Pos logit contributions
 again+0.015
 once+0.014
 finished+0.014
 blast+0.014
 Will+0.014
Neg logit contributions
ances-0.040
 sizes-0.036
glers-0.035
boxes-0.034
 cans-0.034
Token should
Token ID815
Feature activation+2.27e-02

Pos logit contributions
ances+0.001
glers+0.001
 sizes+0.001
boxes+0.001
ales+0.001
Neg logit contributions
 again-0.001
 once-0.001
 Will-0.000
<|endoftext|>-0.000
 happily-0.000
Token patch
Token ID8529
Feature activation-1.63e-02

Pos logit contributions
ances+0.064
 sizes+0.057
boxes+0.057
glers+0.056
ales+0.055
Neg logit contributions
 again-0.017
<|endoftext|>-0.014
 finished-0.014
 once-0.013
 happily-0.013
Token the
Token ID262
Feature activation+1.93e-02

Pos logit contributions
 again+0.017
 once+0.014
<|endoftext|>+0.013
 happily+0.012
 finished+0.011
Neg logit contributions
ances-0.043
 trigger-0.036
 sizes-0.036
boxes-0.036
limits-0.036
Token you
Token ID345
Feature activation-2.20e-02

Pos logit contributions
 again+0.020
<|endoftext|>+0.019
 once+0.018
 happily+0.018
 blast+0.018
Neg logit contributions
ances-0.037
glers-0.031
boxes-0.031
 sizes-0.031
cut-0.030
Token shouldn
Token ID6584
Feature activation-3.51e-02

Pos logit contributions
 again+0.020
<|endoftext|>+0.017
 happily+0.017
 Will+0.017
 once+0.017
Neg logit contributions
ances-0.053
glers-0.047
 sizes-0.045
 cans-0.045
ales-0.045
Token't
Token ID470
Feature activation+1.09e-02

Pos logit contributions
 again+0.049
 once+0.045
<|endoftext|>+0.045
 Will+0.044
 blast+0.043
Neg logit contributions
ances-0.074
glers-0.063
 sizes-0.063
boxes-0.061
ales-0.060
Token run
Token ID1057
Feature activation-3.57e-04

Pos logit contributions
ances+0.027
 sizes+0.025
glers+0.025
boxes+0.024
 crates+0.023
Neg logit contributions
 again-0.011
 once-0.009
<|endoftext|>-0.009
 Will-0.009
 happily-0.009
Token after
Token ID706
Feature activation-1.77e-02

Pos logit contributions
 again+0.000
 once+0.000
<|endoftext|>+0.000
 happily+0.000
 Will+0.000
Neg logit contributions
ances-0.001
glers-0.001
 sizes-0.001
boxes-0.001
cut-0.001
Token ambul
Token ID18907
Feature activation+6.11e-02

Pos logit contributions
 again+0.017
 once+0.014
<|endoftext|>+0.014
 Will+0.014
 happily+0.014
Neg logit contributions
ances-0.045
glers-0.039
 sizes-0.039
boxes-0.038
ales-0.037
Tokenances
Token ID1817
Feature activation+6.33e-03

Pos logit contributions
glers+0.031
 trigger+0.029
stones+0.027
rier+0.026
 redd+0.025
Neg logit contributions
 again-0.159
 once-0.152
 finished-0.151
<|endoftext|>-0.150
 str-0.148
Token.
Token ID13
Feature activation-1.73e-02

Pos logit contributions
ances+0.016
glers+0.014
 trigger+0.014
 sizes+0.014
boxes+0.014
Neg logit contributions
 again-0.006
 once-0.005
 finished-0.005
 happily-0.005
<|endoftext|>-0.005
Token It
Token ID632
Feature activation-1.06e-02

Pos logit contributions
 again+0.020
 Will+0.018
 once+0.018
<|endoftext|>+0.018
 blast+0.017
Neg logit contributions
ances-0.040
glers-0.036
 sizes-0.034
cut-0.033
boxes-0.033
Token's
Token ID338
Feature activation+2.23e-03

Pos logit contributions
 again+0.008
 once+0.007
 Will+0.006
 finished+0.006
 happily+0.006
Neg logit contributions
ances-0.030
glers-0.026
boxes-0.026
 sizes-0.026
ales-0.026
Token very
Token ID845
Feature activation+3.61e-02

Pos logit contributions
ances+0.005
glers+0.004
 sizes+0.004
boxes+0.004
ales+0.004
Neg logit contributions
 again-0.003
 once-0.002
<|endoftext|>-0.002
 Will-0.002
 happily-0.002
Token gray
Token ID12768
Feature activation+1.53e-02

Pos logit contributions
ances+0.027
glers+0.023
 sizes+0.022
cases+0.021
boxes+0.021
Neg logit contributions
 again-0.026
<|endoftext|>-0.025
 blast-0.025
 once-0.024
 finished-0.024
Token sand
Token ID6450
Feature activation-3.12e-03

Pos logit contributions
ances+0.030
glers+0.026
 sizes+0.025
cases+0.025
limits+0.024
Neg logit contributions
 again-0.024
 blast-0.022
 once-0.022
<|endoftext|>-0.022
-0.020
Token of
Token ID286
Feature activation-9.62e-03

Pos logit contributions
 again+0.003
 once+0.002
 happily+0.002
 finished+0.002
<|endoftext|>+0.002
Neg logit contributions
ances-0.008
limits-0.008
trained-0.007
glers-0.007
 sizes-0.007
Token the
Token ID262
Feature activation+1.12e-02

Pos logit contributions
 again+0.006
 once+0.005
<|endoftext|>+0.005
 Will+0.005
 happily+0.004
Neg logit contributions
ances-0.028
glers-0.025
 sizes-0.024
cases-0.024
limits-0.024
Token river
Token ID7850
Feature activation+2.54e-02

Pos logit contributions
ances+0.020
glers+0.016
 sizes+0.016
boxes+0.016
cases+0.015
Neg logit contributions
 again-0.020
<|endoftext|>-0.019
 blast-0.019
 once-0.018
 finished-0.018
Tokenbank
Token ID17796
Feature activation+6.06e-02

Pos logit contributions
ances+0.072
 trigger+0.063
limits+0.062
glers+0.062
ales+0.062
Neg logit contributions
 again-0.021
 once-0.015
 happily-0.014
<|endoftext|>-0.012
 finished-0.012
Token.
Token ID13
Feature activation-1.26e-03

Pos logit contributions
ances+0.180
 trigger+0.170
glers+0.169
limits+0.168
 redd+0.166
Neg logit contributions
 again-0.037
 once-0.026
 happily-0.020
 for-0.018
 and-0.018
Token The
Token ID383
Feature activation+4.17e-03

Pos logit contributions
 again+0.001
 blast+0.001
 finished+0.001
 imag+0.001
ig+0.001
Neg logit contributions
ances-0.003
bank-0.003
 sizes-0.003
boxes-0.003
glers-0.003
Token sign
Token ID1051
Feature activation+2.84e-02

Pos logit contributions
ances+0.007
glers+0.006
 sizes+0.006
limits+0.006
ales+0.006
Neg logit contributions
 again-0.007
<|endoftext|>-0.007
 finished-0.007
 blast-0.007
 once-0.007
Token said
Token ID531
Feature activation+2.67e-03

Pos logit contributions
ances+0.065
glers+0.058
 sizes+0.057
boxes+0.056
cut+0.056
Neg logit contributions
 again-0.031
 Will-0.028
<|endoftext|>-0.028
 once-0.028
 blast-0.027
Token "
Token ID366
Feature activation-3.02e-03

Pos logit contributions
ances+0.007
glers+0.006
 sizes+0.006
boxes+0.006
ales+0.006
Neg logit contributions
 again-0.003
 once-0.002
<|endoftext|>-0.002
 Will-0.002
 happily-0.002
Token a
Token ID257
Feature activation-1.91e-02

Pos logit contributions
ances+0.011
 sizes+0.010
glers+0.010
 cans+0.010
boxes+0.009
Neg logit contributions
 again-0.006
ig-0.006
 happily-0.005
 Will-0.005
 imag-0.005
Token gloomy
Token ID46400
Feature activation+1.14e-02

Pos logit contributions
 again+0.034
 once+0.033
 imag+0.030
 Will+0.030
 finished+0.029
Neg logit contributions
ances-0.030
glers-0.026
cut-0.024
 sizes-0.024
 trigger-0.023
Token factory
Token ID8860
Feature activation+6.24e-02

Pos logit contributions
ances+0.022
glers+0.019
 sizes+0.019
cut+0.018
boxes+0.018
Neg logit contributions
 again-0.017
 Will-0.016
 once-0.015
<|endoftext|>-0.015
 blast-0.015
Token.
Token ID13
Feature activation+1.91e-04

Pos logit contributions
ances+0.180
 trigger+0.163
ales+0.161
limits+0.157
rier+0.156
Neg logit contributions
 again-0.044
 once-0.033
 happily-0.029
 finished-0.027
<|endoftext|>-0.027
Token The
Token ID383
Feature activation+7.39e-03

Pos logit contributions
ances+0.000
glers+0.000
 sizes+0.000
boxes+0.000
cut+0.000
Neg logit contributions
 again-0.000
 once-0.000
<|endoftext|>-0.000
 happily-0.000
 Will-0.000
Token factory
Token ID8860
Feature activation+6.03e-02

Pos logit contributions
ances+0.013
ales+0.011
glers+0.011
 sizes+0.011
stones+0.011
Neg logit contributions
<|endoftext|>-0.012
 again-0.012
 blast-0.012
 once-0.012
 heart-0.011
Token was
Token ID373
Feature activation-4.44e-03

Pos logit contributions
ances+0.177
ales+0.158
glers+0.156
 sizes+0.155
 trigger+0.155
Neg logit contributions
 again-0.039
 once-0.032
 finished-0.029
<|endoftext|>-0.028
 happily-0.027
Token old
Token ID1468
Feature activation+1.50e-02

Pos logit contributions
 again+0.005
 once+0.004
<|endoftext|>+0.004
 Will+0.004
 happily+0.004
Neg logit contributions
ances-0.011
glers-0.009
 sizes-0.009
boxes-0.009
cut-0.009
Token and
Token ID290
Feature activation-9.50e-03

Pos logit contributions
ances+0.045
glers+0.041
 sizes+0.040
limits+0.039
ales+0.039
Neg logit contributions
 again-0.008
 once-0.006
 happily-0.005
<|endoftext|>-0.005
 Will-0.005
Token tired
Token ID10032
Feature activation+2.96e-02

Pos logit contributions
 again+0.009
<|endoftext|>+0.008
 once+0.008
 finished+0.008
 Will+0.008
Neg logit contributions
ances-0.024
glers-0.021
ales-0.021
 trigger-0.020
boxes-0.020
Token.
Token ID13
Feature activation+3.00e-03

Pos logit contributions
ances+0.079
glers+0.071
cut+0.070
ales+0.069
stones+0.069
Neg logit contributions
 again-0.030
 once-0.024
<|endoftext|>-0.020
 Will-0.020
 happily-0.019
Token monkey
Token ID21657
Feature activation-4.88e-03

Pos logit contributions
 again+0.002
 once+0.002
<|endoftext|>+0.002
 happily+0.002
 finished+0.002
Neg logit contributions
ances-0.003
 sizes-0.003
glers-0.003
limits-0.003
boxes-0.003
Token and
Token ID290
Feature activation+7.73e-03

Pos logit contributions
 again+0.005
<|endoftext|>+0.004
 Will+0.004
 once+0.004
 blast+0.004
Neg logit contributions
ances-0.012
glers-0.011
 sizes-0.010
boxes-0.010
cut-0.010
Token rabbit
Token ID22746
Feature activation+2.52e-02

Pos logit contributions
ances+0.021
glers+0.019
 sizes+0.018
boxes+0.018
cases+0.018
Neg logit contributions
 again-0.006
<|endoftext|>-0.005
 once-0.005
 Will-0.005
 happily-0.005
Token began
Token ID2540
Feature activation+4.71e-03

Pos logit contributions
ances+0.060
glers+0.053
 sizes+0.052
boxes+0.051
cut+0.050
Neg logit contributions
 again-0.027
 once-0.024
<|endoftext|>-0.024
 Will-0.023
 happily-0.023
Token to
Token ID284
Feature activation-3.34e-03

Pos logit contributions
ances+0.011
glers+0.010
bank+0.010
 sizes+0.010
 barrels+0.009
Neg logit contributions
 again-0.004
 blast-0.004
 Will-0.004
 once-0.004
ig-0.004
Token quarrel
Token ID41998
Feature activation+6.03e-02

Pos logit contributions
 again+0.004
<|endoftext|>+0.003
 once+0.003
 Will+0.003
 happily+0.003
Neg logit contributions
ances-0.008
glers-0.007
 sizes-0.007
boxes-0.007
ales-0.007
Token about
Token ID546
Feature activation-1.46e-02

Pos logit contributions
ances+0.136
glers+0.120
 sizes+0.120
boxes+0.117
ales+0.113
Neg logit contributions
 again-0.064
 blast-0.062
 Will-0.061
 finished-0.059
 once-0.059
Token who
Token ID508
Feature activation-1.90e-03

Pos logit contributions
 again+0.012
 once+0.010
<|endoftext|>+0.010
 Will+0.010
 happily+0.010
Neg logit contributions
ances-0.039
glers-0.034
 sizes-0.034
boxes-0.033
cut-0.033
Token was
Token ID373
Feature activation+3.59e-04

Pos logit contributions
 again+0.002
 once+0.002
 Will+0.002
 happily+0.002
<|endoftext|>+0.002
Neg logit contributions
ances-0.005
glers-0.004
 sizes-0.004
boxes-0.004
ales-0.004
Token bra
Token ID8290
Feature activation-1.64e-02

Pos logit contributions
ances+0.001
glers+0.001
 sizes+0.001
boxes+0.001
cut+0.001
Neg logit contributions
 again-0.000
 once-0.000
<|endoftext|>-0.000
 happily-0.000
 Will-0.000
Tokenver
Token ID332
Feature activation-9.74e-03

Pos logit contributions
 again+0.024
 once+0.022
 happily+0.021
 Will+0.021
<|endoftext|>+0.021
Neg logit contributions
ances-0.034
 sizes-0.030
glers-0.029
 crates-0.028
boxes-0.028
Token,
Token ID11
Feature activation-9.33e-03

Pos logit contributions
ances+0.022
boxes+0.020
rier+0.020
glers+0.020
cut+0.020
Neg logit contributions
 blast-0.015
 least-0.014
 finished-0.014
<|endoftext|>-0.014
 once-0.014
Token "
Token ID366
Feature activation+5.80e-03

Pos logit contributions
 blast+0.015
 imag+0.014
 least+0.014
ig+0.014
 finished+0.014
Neg logit contributions
ances-0.014
boxes-0.013
cut-0.013
line-0.012
 sizes-0.012
TokenIs
Token ID3792
Feature activation-2.63e-02

Pos logit contributions
ances+0.012
glers+0.010
 sizes+0.010
boxes+0.010
cut+0.010
Neg logit contributions
 again-0.008
 once-0.008
<|endoftext|>-0.008
 happily-0.007
 Will-0.007
Token that
Token ID326
Feature activation-2.01e-02

Pos logit contributions
 blast+0.030
 happily+0.029
<|endoftext|>+0.026
 again+0.025
<+0.025
Neg logit contributions
ances-0.052
 cans-0.048
glers-0.047
stones-0.047
 barrels-0.046
Token your
Token ID534
Feature activation-9.80e-03

Pos logit contributions
 blast+0.024
 happily+0.023
 Will+0.023
 finished+0.023
 imag+0.023
Neg logit contributions
ances-0.038
 sizes-0.034
 cans-0.033
rier-0.033
glers-0.033
Token coc
Token ID8954
Feature activation-7.03e-02

Pos logit contributions
 Will+0.015
 again+0.015
ez+0.014
 happily+0.014
<|endoftext|>+0.014
Neg logit contributions
ances-0.016
 sizes-0.013
 cans-0.012
 contents-0.012
 textures-0.012
Tokenoon
Token ID2049
Feature activation-1.11e-02

Pos logit contributions
 again+0.129
 Will+0.120
<|endoftext|>+0.119
 once+0.119
 blast+0.119
Neg logit contributions
ances-0.111
glers-0.093
 sizes-0.089
boxes-0.085
cut-0.084
Token,
Token ID11
Feature activation-1.74e-03

Pos logit contributions
 again+0.010
 Will+0.009
<|endoftext|>+0.009
 blast+0.009
 once+0.009
Neg logit contributions
ances-0.027
glers-0.024
 sizes-0.024
boxes-0.023
cut-0.023
Token butterfly
Token ID35113
Feature activation-3.32e-02

Pos logit contributions
 happily+0.002
 again+0.002
 imag+0.002
 finished+0.002
 blast+0.002
Neg logit contributions
ances-0.004
 cans-0.003
 sizes-0.003
boxes-0.003
ters-0.003
Token?"
Token ID1701
Feature activation-6.41e-03

Pos logit contributions
 again+0.036
 once+0.035
 blast+0.034
 Will+0.034
 happily+0.034
Neg logit contributions
ances-0.072
glers-0.066
 sizes-0.063
cut-0.061
boxes-0.061
Token The
Token ID383
Feature activation-1.43e-03

Pos logit contributions
 blast+0.007
 imag+0.007
 finished+0.006
ig+0.006
ez+0.006
Neg logit contributions
ances-0.013
 cans-0.012
line-0.012
bank-0.011
glers-0.011
Token you
Token ID345
Feature activation-1.59e-02

Pos logit contributions
 again+0.001
 happily+0.001
 once+0.001
 blast+0.001
 finished+0.001
Neg logit contributions
ances-0.002
glers-0.002
 sizes-0.002
boxes-0.002
cut-0.002
Token want
Token ID765
Feature activation-1.47e-03

Pos logit contributions
 again+0.027
 Will+0.025
 once+0.025
<|endoftext|>+0.025
 blast+0.024
Neg logit contributions
ances-0.027
 sizes-0.020
ales-0.020
cut-0.018
glers-0.018
Token to
Token ID284
Feature activation-1.31e-02

Pos logit contributions
 blast+0.001
 happily+0.001
 symp+0.001
ig+0.001
 finished+0.001
Neg logit contributions
ances-0.003
glers-0.003
 cans-0.003
 sizes-0.003
ales-0.003
Token see
Token ID766
Feature activation-2.30e-02

Pos logit contributions
 symp+0.017
 again+0.017
 Will+0.016
 happily+0.016
 once+0.016
Neg logit contributions
ances-0.024
glers-0.020
 sizes-0.020
ales-0.019
boxes-0.019
Token my
Token ID616
Feature activation-8.59e-03

Pos logit contributions
 blast+0.027
 finished+0.026
 happily+0.024
 again+0.023
<|endoftext|>+0.022
Neg logit contributions
ances-0.047
ales-0.042
 sizes-0.042
glers-0.041
stones-0.041
Token coc
Token ID8954
Feature activation-6.88e-02

Pos logit contributions
 Will+0.013
 blast+0.013
 again+0.012
 finished+0.012
<|endoftext|>+0.012
Neg logit contributions
ances-0.015
 sizes-0.013
stones-0.012
glers-0.012
 cans-0.012
Tokenoon
Token ID2049
Feature activation-1.46e-02

Pos logit contributions
 again+0.132
 Will+0.123
 once+0.122
 blast+0.122
<|endoftext|>+0.121
Neg logit contributions
ances-0.104
glers-0.085
 sizes-0.082
boxes-0.078
cut-0.078
Token,
Token ID11
Feature activation-6.58e-03

Pos logit contributions
 blast+0.014
 Will+0.013
<|endoftext|>+0.013
 again+0.013
 finished+0.012
Neg logit contributions
ances-0.034
glers-0.030
 sizes-0.030
boxes-0.029
cut-0.029
Token L
Token ID406
Feature activation-3.25e-02

Pos logit contributions
 again+0.008
 finished+0.008
 blast+0.007
 happily+0.007
 imag+0.007
Neg logit contributions
ances-0.014
 sizes-0.012
glers-0.012
boxes-0.011
 cans-0.011
Tokenila
Token ID10102
Feature activation-5.26e-02

Pos logit contributions
 again+0.057
 blast+0.055
 Will+0.054
 imag+0.054
 happily+0.054
Neg logit contributions
ances-0.056
glers-0.046
ales-0.043
boxes-0.041
ters-0.040
Token?"
Token ID1701
Feature activation-8.75e-03

Pos logit contributions
 blast+0.076
 imag+0.073
 finished+0.067
 Will+0.063
 smiles+0.061
Neg logit contributions
ances-0.102
cut-0.091
itive-0.085
glers-0.083
 sizes-0.082
Token,
Token ID11
Feature activation-1.89e-02

Pos logit contributions
 again+0.008
 once+0.007
<|endoftext|>+0.007
 Will+0.007
 blast+0.007
Neg logit contributions
ances-0.021
glers-0.019
 sizes-0.018
boxes-0.018
cut-0.018
Token just
Token ID655
Feature activation+5.48e-03

Pos logit contributions
 again+0.020
<|endoftext|>+0.019
 once+0.018
 Will+0.018
 happily+0.018
Neg logit contributions
ances-0.044
 sizes-0.039
glers-0.038
boxes-0.037
cut-0.037
Token like
Token ID588
Feature activation-2.82e-02

Pos logit contributions
ances+0.011
 sizes+0.010
glers+0.009
cut+0.009
ales+0.009
Neg logit contributions
 again-0.007
<|endoftext|>-0.007
 blast-0.007
 Will-0.007
 once-0.007
Token the
Token ID262
Feature activation-8.89e-03

Pos logit contributions
 again+0.028
<|endoftext|>+0.024
 once+0.024
 happily+0.023
 blast+0.023
Neg logit contributions
ances-0.069
 sizes-0.061
glers-0.060
boxes-0.059
cut-0.058
Token z
Token ID1976
Feature activation-2.77e-02

Pos logit contributions
 again+0.013
 happily+0.013
 Will+0.012
<|endoftext|>+0.012
 finished+0.012
Neg logit contributions
ances-0.016
 sizes-0.015
glers-0.013
cut-0.013
boxes-0.013
Tokenig
Token ID328
Feature activation-6.44e-02

Pos logit contributions
 Will+0.031
<|endoftext|>+0.028
 again+0.026
+0.024
 happily+0.023
Neg logit contributions
ances-0.067
stones-0.056
iest-0.056
ales-0.056
gs-0.055
Tokenzag
Token ID50183
Feature activation+1.31e-03

Pos logit contributions
 again+0.125
<|endoftext|>+0.117
 Will+0.117
 once+0.116
 blast+0.116
Neg logit contributions
ances-0.097
glers-0.077
 sizes-0.076
boxes-0.073
cut-0.072
Token.
Token ID13
Feature activation-2.03e-02

Pos logit contributions
ances+0.003
 sizes+0.002
glers+0.002
boxes+0.002
cut+0.002
Neg logit contributions
 again-0.002
 Will-0.002
<|endoftext|>-0.002
 once-0.001
 happily-0.001
Token<|endoftext|>
Token ID50256
Feature activation-1.07e-04

Pos logit contributions
 finished+0.021
 again+0.021
 rec+0.019
 str+0.019
 imag+0.019
Neg logit contributions
ances-0.042
 sizes-0.041
boxes-0.039
bank-0.038
handed-0.038
TokenOnce
Token ID7454
Feature activation+1.93e-03

Pos logit contributions
 blast+0.000
 str+0.000
 imag+0.000
 finished+0.000
 rec+0.000
Neg logit contributions
glers-0.000
boxes-0.000
ances-0.000
 cans-0.000
ters-0.000
Token there
Token ID612
Feature activation+7.84e-03

Pos logit contributions
ances+0.004
glers+0.004
 sizes+0.003
boxes+0.003
cases+0.003
Neg logit contributions
 blast-0.002
 again-0.002
 Will-0.002
 finished-0.002
 imag-0.002
Token't
Token ID470
Feature activation+2.87e-02

Pos logit contributions
 again+0.026
 once+0.024
<|endoftext|>+0.023
 Will+0.023
 happily+0.023
Neg logit contributions
ances-0.036
glers-0.031
 sizes-0.030
boxes-0.029
cut-0.029
Token want
Token ID765
Feature activation+6.44e-03

Pos logit contributions
ances+0.055
 sizes+0.045
ales+0.043
glers+0.042
limits+0.041
Neg logit contributions
 again-0.045
 once-0.041
<|endoftext|>-0.041
 Will-0.040
 happily-0.040
Token to
Token ID284
Feature activation+4.96e-03

Pos logit contributions
ances+0.012
 cans+0.011
 sizes+0.011
ales+0.010
glers+0.010
Neg logit contributions
 symp-0.008
 happily-0.007
ig-0.007
 blast-0.007
 Pret-0.006
Token be
Token ID307
Feature activation+1.21e-02

Pos logit contributions
ances+0.011
glers+0.010
 sizes+0.010
ales+0.009
boxes+0.009
Neg logit contributions
 again-0.006
 once-0.005
<|endoftext|>-0.005
 Will-0.005
 happily-0.005
Token a
Token ID257
Feature activation-1.67e-02

Pos logit contributions
ances+0.025
glers+0.022
 sizes+0.022
boxes+0.021
ales+0.020
Neg logit contributions
 again-0.016
 once-0.015
<|endoftext|>-0.015
 Will-0.014
 blast-0.014
Token coc
Token ID8954
Feature activation-6.40e-02

Pos logit contributions
 again+0.025
<|endoftext|>+0.025
 finished+0.025
 once+0.025
 smiles+0.024
Neg logit contributions
ances-0.027
 trigger-0.023
rier-0.023
glers-0.022
 sizes-0.022
Tokenoon
Token ID2049
Feature activation-3.55e-03

Pos logit contributions
 again+0.118
 once+0.110
<|endoftext|>+0.109
 Will+0.108
 happily+0.107
Neg logit contributions
ances-0.104
glers-0.086
 sizes-0.084
boxes-0.081
cut-0.079
Token anymore
Token ID7471
Feature activation-3.64e-03

Pos logit contributions
 again+0.004
 once+0.004
 happily+0.003
 Will+0.003
<|endoftext|>+0.003
Neg logit contributions
ances-0.009
glers-0.008
 sizes-0.007
boxes-0.007
 crates-0.007
Token!"
Token ID2474
Feature activation-1.45e-03

Pos logit contributions
 again+0.005
<|endoftext|>+0.005
 blast+0.005
 finished+0.005
 Will+0.005
Neg logit contributions
ances-0.007
 sizes-0.006
glers-0.006
boxes-0.006
cut-0.006
Token
Token ID198
Feature activation-5.84e-03

Pos logit contributions
 blast+0.001
ig+0.001
 finished+0.001
 imag+0.001
 again+0.001
Neg logit contributions
ances-0.003
 cans-0.003
limits-0.003
handed-0.003
bank-0.003
Token
Token ID198
Feature activation+5.10e-03

Pos logit contributions
 imag+0.005
 blast+0.005
 again+0.005
 finished+0.005
 Will+0.005
Neg logit contributions
ances-0.013
handed-0.012
glers-0.012
 cans-0.012
cut-0.011
Token,
Token ID11
Feature activation-6.89e-03

Pos logit contributions
 again+0.009
 once+0.008
<|endoftext|>+0.007
 Will+0.007
 happily+0.007
Neg logit contributions
ances-0.020
glers-0.017
 sizes-0.017
boxes-0.017
cut-0.016
Token sun
Token ID4252
Feature activation-2.97e-02

Pos logit contributions
 again+0.007
<|endoftext|>+0.006
 once+0.006
 Will+0.006
 happily+0.006
Neg logit contributions
ances-0.016
 sizes-0.014
glers-0.014
boxes-0.014
cut-0.013
Token!
Token ID0
Feature activation-2.73e-02

Pos logit contributions
 again+0.046
 once+0.042
<|endoftext|>+0.041
 Will+0.041
 happily+0.041
Neg logit contributions
ances-0.058
glers-0.049
 sizes-0.049
boxes-0.047
limits-0.047
Token Pour
Token ID39128
Feature activation-8.27e-03

Pos logit contributions
 again+0.026
 finished+0.024
 imag+0.022
 happily+0.021
 blast+0.021
Neg logit contributions
ances-0.064
 sizes-0.058
boxes-0.058
ters-0.056
glers-0.055
Token your
Token ID534
Feature activation+4.49e-03

Pos logit contributions
 again+0.008
<|endoftext|>+0.007
 blast+0.007
 Will+0.007
 once+0.007
Neg logit contributions
ances-0.020
 sizes-0.018
glers-0.018
boxes-0.017
 cans-0.017
Token rays
Token ID24823
Feature activation-6.31e-02

Pos logit contributions
ances+0.008
 sizes+0.007
 crates+0.007
boxes+0.007
 barrels+0.007
Neg logit contributions
 again-0.007
 Will-0.007
 once-0.006
 happily-0.006
<|endoftext|>-0.006
Token around
Token ID1088
Feature activation-2.52e-02

Pos logit contributions
 again+0.058
 once+0.050
<|endoftext|>+0.048
 happily+0.047
 Will+0.046
Neg logit contributions
ances-0.163
glers-0.144
 sizes-0.143
boxes-0.140
cut-0.137
Token the
Token ID262
Feature activation+2.58e-03

Pos logit contributions
 again+0.020
<|endoftext|>+0.019
 blast+0.019
 Will+0.018
 finished+0.018
Neg logit contributions
ances-0.063
glers-0.056
 sizes-0.055
boxes-0.054
ales-0.053
Token sky
Token ID6766
Feature activation-1.30e-02

Pos logit contributions
ances+0.005
glers+0.004
 sizes+0.004
boxes+0.004
cut+0.004
Neg logit contributions
 again-0.004
 once-0.004
<|endoftext|>-0.004
 Will-0.004
 happily-0.004
Token and
Token ID290
Feature activation-1.28e-02

Pos logit contributions
 again+0.010
<|endoftext|>+0.008
 once+0.008
 Will+0.008
 blast+0.008
Neg logit contributions
ances-0.035
glers-0.031
 sizes-0.031
boxes-0.030
cut-0.029
Token give
Token ID1577
Feature activation+5.57e-03

Pos logit contributions
 again+0.013
<|endoftext|>+0.012
 once+0.012
 Will+0.011
 blast+0.011
Neg logit contributions
ances-0.030
glers-0.027
 sizes-0.027
boxes-0.026
cut-0.025
Token to
Token ID284
Feature activation+4.02e-03

Pos logit contributions
ances+0.007
 sizes+0.006
boxes+0.006
 cans+0.006
glers+0.006
Neg logit contributions
 again-0.003
 once-0.003
 happily-0.002
 Will-0.002
 blast-0.002
Token eat
Token ID4483
Feature activation-1.29e-02

Pos logit contributions
ances+0.010
glers+0.009
 sizes+0.009
boxes+0.008
cases+0.008
Neg logit contributions
 again-0.004
<|endoftext|>-0.004
 once-0.004
 happily-0.004
 blast-0.004
Token leaves
Token ID5667
Feature activation-3.94e-02

Pos logit contributions
 again+0.013
 blast+0.012
 once+0.012
 Will+0.012
<|endoftext|>+0.011
Neg logit contributions
ances-0.031
glers-0.027
 sizes-0.027
boxes-0.026
ales-0.026
Token and
Token ID290
Feature activation-7.82e-03

Pos logit contributions
 again+0.028
 once+0.021
 happily+0.018
<|endoftext|>+0.018
 Will+0.017
Neg logit contributions
ances-0.114
glers-0.102
 sizes-0.101
boxes-0.099
limits-0.098
Token make
Token ID787
Feature activation-7.18e-03

Pos logit contributions
 again+0.012
 once+0.011
<|endoftext|>+0.011
 Will+0.011
 happily+0.011
Neg logit contributions
ances-0.015
glers-0.013
 sizes-0.013
boxes-0.012
cut-0.012
Token coc
Token ID8954
Feature activation-6.30e-02

Pos logit contributions
 again+0.008
 blast+0.007
 finished+0.007
 once+0.007
 Will+0.007
Neg logit contributions
ances-0.015
 sizes-0.014
glers-0.013
 cans-0.013
ales-0.013
Tokenoons
Token ID13022
Feature activation-1.44e-02

Pos logit contributions
 again+0.110
 once+0.103
<|endoftext|>+0.103
 happily+0.100
 Will+0.099
Neg logit contributions
ances-0.110
glers-0.091
 sizes-0.090
boxes-0.088
 crates-0.085
Token.
Token ID13
Feature activation-1.02e-02

Pos logit contributions
 again+0.014
 once+0.011
 happily+0.010
 finished+0.009
<|endoftext|>+0.009
Neg logit contributions
ances-0.039
glers-0.035
 sizes-0.034
ales-0.033
 trigger-0.033
Token Maybe
Token ID6674
Feature activation-1.07e-02

Pos logit contributions
 again+0.011
 once+0.010
<|endoftext|>+0.010
 happily+0.009
 Will+0.009
Neg logit contributions
ances-0.024
glers-0.021
 sizes-0.021
boxes-0.021
cut-0.020
Token one
Token ID530
Feature activation-1.88e-02

Pos logit contributions
 again+0.012
<|endoftext|>+0.011
 happily+0.011
 blast+0.011
 once+0.011
Neg logit contributions
ances-0.025
glers-0.021
 sizes-0.021
boxes-0.021
 cans-0.020
Token day
Token ID1110
Feature activation+1.54e-02

Pos logit contributions
 symp+0.029
joy+0.028
 again+0.027
 blast+0.027
 rec+0.026
Neg logit contributions
igator-0.030
cut-0.029
ances-0.029
igators-0.028
 sizes-0.027
Token a
Token ID257
Feature activation-1.74e-02

Pos logit contributions
 again+0.025
 once+0.022
<|endoftext|>+0.022
 Will+0.022
 blast+0.022
Neg logit contributions
ances-0.054
glers-0.048
 sizes-0.047
boxes-0.046
cut-0.045
Token cookie
Token ID19751
Feature activation-1.82e-02

Pos logit contributions
 again+0.026
 once+0.025
<|endoftext|>+0.024
 Will+0.024
 happily+0.024
Neg logit contributions
ances-0.031
glers-0.027
 sizes-0.027
cut-0.026
boxes-0.025
Token or
Token ID393
Feature activation+8.22e-03

Pos logit contributions
 again+0.019
 once+0.017
<|endoftext|>+0.017
 Will+0.017
 happily+0.016
Neg logit contributions
ances-0.043
glers-0.038
 sizes-0.038
boxes-0.037
cut-0.036
Token a
Token ID257
Feature activation-1.49e-02

Pos logit contributions
ances+0.020
glers+0.017
 sizes+0.017
boxes+0.017
ales+0.016
Neg logit contributions
 again-0.009
 once-0.008
 Will-0.008
<|endoftext|>-0.008
 happily-0.008
Token l
Token ID300
Feature activation-2.42e-02

Pos logit contributions
 again+0.022
 once+0.021
 Will+0.021
<|endoftext|>+0.021
 blast+0.020
Neg logit contributions
ances-0.026
 sizes-0.023
glers-0.023
cut-0.022
boxes-0.022
Tokenoll
Token ID692
Feature activation-6.28e-02

Pos logit contributions
 again+0.031
<|endoftext|>+0.029
 Will+0.028
 once+0.028
 blast+0.027
Neg logit contributions
ances-0.052
glers-0.045
boxes-0.043
ales-0.043
cut-0.043
Tokenipop
Token ID42800
Feature activation-3.04e-02

Pos logit contributions
 again+0.079
<|endoftext|>+0.073
 once+0.072
 blast+0.071
 Will+0.070
Neg logit contributions
ances-0.138
glers-0.119
 sizes-0.116
boxes-0.115
cut-0.114
Token.
Token ID13
Feature activation-7.96e-03

Pos logit contributions
 again+0.033
 once+0.029
<|endoftext|>+0.028
 Will+0.028
 happily+0.028
Neg logit contributions
ances-0.073
glers-0.064
 sizes-0.063
boxes-0.062
cut-0.061
Token
Token ID198
Feature activation-5.30e-03

Pos logit contributions
 again+0.010
ig+0.009
 blast+0.009
 finished+0.009
 imag+0.009
Neg logit contributions
ances-0.016
 sizes-0.015
cases-0.014
boxes-0.014
glers-0.014
Token
Token ID198
Feature activation+6.45e-03

Pos logit contributions
 imag+0.006
 least+0.005
 again+0.005
ig+0.005
 Will+0.005
Neg logit contributions
ances-0.011
glers-0.011
cases-0.010
cut-0.010
handed-0.010
TokenTom
Token ID13787
Feature activation-5.79e-03

Pos logit contributions
ances+0.012
glers+0.010
 sizes+0.010
boxes+0.010
cut+0.010
Neg logit contributions
 again-0.010
 Will-0.009
 once-0.009
 happily-0.009
 blast-0.009
Token there
Token ID612
Feature activation+3.44e-03

Pos logit contributions
 again+0.006
<|endoftext|>+0.006
 once+0.006
 blast+0.006
 happily+0.006
Neg logit contributions
ances-0.013
 sizes-0.012
glers-0.012
boxes-0.011
cut-0.011
Token is
Token ID318
Feature activation+1.08e-02

Pos logit contributions
ances+0.009
glers+0.008
 sizes+0.008
 trigger+0.008
boxes+0.008
Neg logit contributions
 again-0.003
 once-0.002
<|endoftext|>-0.002
 Will-0.002
 happily-0.002
Token nothing
Token ID2147
Feature activation+2.71e-02

Pos logit contributions
ances+0.027
glers+0.024
 sizes+0.023
boxes+0.023
cut+0.023
Neg logit contributions
 again-0.011
 once-0.009
<|endoftext|>-0.009
 Will-0.009
 happily-0.009
Token there
Token ID612
Feature activation-2.90e-03

Pos logit contributions
ances+0.072
glers+0.064
 sizes+0.063
boxes+0.062
ales+0.061
Neg logit contributions
 again-0.024
 once-0.020
<|endoftext|>-0.019
 Will-0.019
 happily-0.019
Token.
Token ID13
Feature activation-3.53e-02

Pos logit contributions
 again+0.003
 once+0.003
<|endoftext|>+0.003
 Will+0.003
 happily+0.003
Neg logit contributions
ances-0.007
glers-0.006
 sizes-0.006
boxes-0.006
cut-0.006
Token Isn
Token ID25110
Feature activation-6.21e-02

Pos logit contributions
 again+0.041
<|endoftext|>+0.036
 happily+0.036
 once+0.036
 finished+0.036
Neg logit contributions
ances-0.081
 sizes-0.071
boxes-0.069
glers-0.069
cut-0.067
Token't
Token ID470
Feature activation+1.03e-02

Pos logit contributions
 again+0.100
 happily+0.092
<|endoftext|>+0.091
 blast+0.091
 once+0.090
Neg logit contributions
ances-0.111
glers-0.095
 sizes-0.090
cases-0.087
limits-0.087
Token that
Token ID326
Feature activation-2.03e-02

Pos logit contributions
ances+0.028
glers+0.025
 sizes+0.025
boxes+0.024
ales+0.024
Neg logit contributions
 again-0.008
 once-0.007
<|endoftext|>-0.006
 Will-0.006
 happily-0.006
Token interesting
Token ID3499
Feature activation-8.30e-04

Pos logit contributions
 again+0.029
<|endoftext|>+0.026
 once+0.026
 Will+0.026
 happily+0.026
Neg logit contributions
ances-0.041
 sizes-0.035
glers-0.035
boxes-0.034
cut-0.033
Token?"
Token ID1701
Feature activation-3.37e-02

Pos logit contributions
 again+0.001
<|endoftext|>+0.001
 Will+0.001
 once+0.001
 blast+0.001
Neg logit contributions
ances-0.002
 sizes-0.001
glers-0.001
boxes-0.001
cut-0.001
Token
Token ID198
Feature activation-2.35e-02

Pos logit contributions
 finished+0.032
 blast+0.032
 again+0.030
 imag+0.027
ig+0.026
Neg logit contributions
ances-0.077
boxes-0.069
cases-0.066
 sizes-0.065
cut-0.065
Token cater
Token ID20825
Feature activation-3.61e-02

Pos logit contributions
 again+0.005
 happily+0.005
 finished+0.005
 Will+0.005
 blast+0.004
Neg logit contributions
ances-0.004
 sizes-0.004
glers-0.004
 cans-0.004
cut-0.003
Tokenpillar
Token ID41643
Feature activation-1.87e-02

Pos logit contributions
 again+0.061
<|endoftext|>+0.057
 blast+0.057
 finished+0.057
 happily+0.056
Neg logit contributions
ances-0.062
 sizes-0.052
glers-0.052
stones-0.049
ales-0.049
Token will
Token ID481
Feature activation+1.47e-03

Pos logit contributions
 again+0.017
<|endoftext|>+0.016
 blast+0.016
 Will+0.015
 once+0.015
Neg logit contributions
ances-0.045
 sizes-0.041
glers-0.040
boxes-0.038
cut-0.038
Token make
Token ID787
Feature activation-3.04e-03

Pos logit contributions
ances+0.003
glers+0.003
 sizes+0.003
boxes+0.003
cases+0.003
Neg logit contributions
 again-0.002
 once-0.002
<|endoftext|>-0.002
 happily-0.002
 finished-0.002
Token a
Token ID257
Feature activation+2.83e-03

Pos logit contributions
 again+0.004
 once+0.003
 blast+0.003
 Will+0.003
<|endoftext|>+0.003
Neg logit contributions
ances-0.007
 sizes-0.006
glers-0.006
boxes-0.005
cut-0.005
Token coc
Token ID8954
Feature activation-6.20e-02

Pos logit contributions
ances+0.005
glers+0.004
 sizes+0.004
rier+0.004
 trigger+0.004
Neg logit contributions
 Will-0.004
 once-0.004
 again-0.004
 finished-0.004
<|endoftext|>-0.004
Tokenoon
Token ID2049
Feature activation-4.40e-03

Pos logit contributions
 again+0.107
 once+0.100
<|endoftext|>+0.100
 happily+0.097
 Will+0.096
Neg logit contributions
ances-0.109
glers-0.090
 sizes-0.090
boxes-0.088
 crates-0.086
Token.
Token ID13
Feature activation+1.24e-02

Pos logit contributions
 again+0.004
 once+0.004
<|endoftext|>+0.003
 Will+0.003
 happily+0.003
Neg logit contributions
ances-0.011
glers-0.010
 sizes-0.010
boxes-0.010
cut-0.009
Token Then
Token ID3244
Feature activation+1.54e-02

Pos logit contributions
ances+0.020
 sizes+0.016
boxes+0.016
 cans+0.016
limits+0.016
Neg logit contributions
 again-0.022
 finished-0.021
 blast-0.020
 happily-0.020
<|endoftext|>-0.020
Token it
Token ID340
Feature activation-1.46e-03

Pos logit contributions
ances+0.031
glers+0.026
 sizes+0.026
boxes+0.025
limits+0.025
Neg logit contributions
 again-0.022
 once-0.020
<|endoftext|>-0.020
 Will-0.020
 happily-0.020
Token will
Token ID481
Feature activation+2.19e-03

Pos logit contributions
<|endoftext|>+0.002
 Will+0.002
iley+0.002
 blast+0.002
joy+0.002
Neg logit contributions
glers-0.002
 cans-0.002
 sizes-0.002
cases-0.002
ances-0.002
Token,
Token ID11
Feature activation-7.08e-03

Pos logit contributions
 again+0.003
 once+0.003
<|endoftext|>+0.003
 Will+0.003
 happily+0.003
Neg logit contributions
ances-0.008
glers-0.007
 sizes-0.007
boxes-0.007
cut-0.007
Token it
Token ID340
Feature activation-9.58e-03

Pos logit contributions
 again+0.006
<|endoftext|>+0.005
 once+0.005
 happily+0.005
 Will+0.005
Neg logit contributions
ances-0.018
glers-0.016
 sizes-0.016
boxes-0.016
cut-0.015
Token will
Token ID481
Feature activation-7.81e-03

Pos logit contributions
<|endoftext|>+0.010
 again+0.010
 blast+0.009
 Will+0.009
 happily+0.009
Neg logit contributions
ances-0.020
glers-0.019
 sizes-0.019
limits-0.018
 cans-0.018
Token make
Token ID787
Feature activation-6.79e-03

Pos logit contributions
 again+0.011
 Will+0.010
 once+0.010
<|endoftext|>+0.010
 happily+0.010
Neg logit contributions
ances-0.016
glers-0.013
 sizes-0.013
cut-0.013
boxes-0.012
Token a
Token ID257
Feature activation-2.30e-03

Pos logit contributions
 again+0.008
 blast+0.007
 finished+0.007
 Will+0.007
 happily+0.007
Neg logit contributions
ances-0.014
 sizes-0.013
glers-0.012
ales-0.012
ters-0.012
Token coc
Token ID8954
Feature activation-6.17e-02

Pos logit contributions
 Will+0.004
 once+0.004
 again+0.004
 finished+0.004
 imag+0.004
Neg logit contributions
ances-0.003
rier-0.003
 trigger-0.003
glers-0.003
 sizes-0.003
Tokenoon
Token ID2049
Feature activation-2.16e-03

Pos logit contributions
 again+0.113
 once+0.105
<|endoftext|>+0.104
 Will+0.103
 happily+0.103
Neg logit contributions
ances-0.102
glers-0.084
 sizes-0.082
boxes-0.079
cut-0.077
Token and
Token ID290
Feature activation-8.73e-03

Pos logit contributions
 again+0.002
 once+0.002
<|endoftext|>+0.002
 Will+0.002
 happily+0.002
Neg logit contributions
ances-0.005
glers-0.005
 sizes-0.005
boxes-0.005
cut-0.005
Token turn
Token ID1210
Feature activation+2.46e-03

Pos logit contributions
 again+0.012
 once+0.011
<|endoftext|>+0.011
 Will+0.011
 happily+0.011
Neg logit contributions
ances-0.018
 sizes-0.016
glers-0.015
boxes-0.014
 cans-0.014
Token into
Token ID656
Feature activation-1.35e-02

Pos logit contributions
ances+0.006
 sizes+0.005
glers+0.005
 cans+0.005
 trigger+0.005
Neg logit contributions
 blast-0.002
 Will-0.002
 again-0.002
<|endoftext|>-0.002
 finished-0.002
Token a
Token ID257
Feature activation-1.29e-02

Pos logit contributions
 again+0.015
 Will+0.014
 finished+0.014
 once+0.014
ossible+0.014
Neg logit contributions
ances-0.028
 sizes-0.025
 cans-0.024
 barrels-0.023
 crates-0.023
Token will
Token ID481
Feature activation+4.29e-03

Pos logit contributions
 again+0.020
 blast+0.017
<|endoftext|>+0.017
 once+0.017
 Will+0.017
Neg logit contributions
ances-0.035
glers-0.031
 sizes-0.031
cut-0.031
limits-0.030
Token not
Token ID407
Feature activation+2.21e-02

Pos logit contributions
ances+0.009
glers+0.008
 sizes+0.008
boxes+0.008
cut+0.008
Neg logit contributions
 again-0.006
 once-0.005
<|endoftext|>-0.005
 Will-0.005
 happily-0.005
Token touch
Token ID3638
Feature activation-2.63e-02

Pos logit contributions
ances+0.049
glers+0.043
 sizes+0.042
boxes+0.041
cut+0.040
Neg logit contributions
 again-0.028
 once-0.025
<|endoftext|>-0.025
 Will-0.024
 happily-0.024
Token the
Token ID262
Feature activation-6.99e-03

Pos logit contributions
 again+0.024
 once+0.020
<|endoftext|>+0.020
 Will+0.019
 happily+0.019
Neg logit contributions
ances-0.068
glers-0.061
 sizes-0.060
boxes-0.059
cut-0.058
Token ash
Token ID12530
Feature activation-2.55e-02

Pos logit contributions
 again+0.013
 once+0.012
<|endoftext|>+0.012
 Will+0.012
 blast+0.012
Neg logit contributions
ances-0.011
glers-0.009
 sizes-0.009
boxes-0.009
ales-0.009
Tokentr
Token ID2213
Feature activation-6.15e-02

Pos logit contributions
<|endoftext|>+0.037
 blast+0.037
 Will+0.037
 again+0.036
 happily+0.036
Neg logit contributions
ances-0.047
boxes-0.041
glers-0.040
 sizes-0.039
cut-0.039
Tokenay
Token ID323
Feature activation-3.35e-02

Pos logit contributions
<|endoftext|>+0.086
 Will+0.085
 happily+0.082
 blast+0.081
 whist+0.080
Neg logit contributions
ances-0.115
boxes-0.105
glers-0.103
cases-0.102
limits-0.095
Token again
Token ID757
Feature activation-4.01e-02

Pos logit contributions
 blast+0.039
 again+0.039
<|endoftext|>+0.039
 Will+0.037
 happily+0.037
Neg logit contributions
ances-0.072
glers-0.063
boxes-0.062
 sizes-0.061
cut-0.061
Token,"
Token ID553
Feature activation-3.57e-03

Pos logit contributions
 again+0.049
 once+0.043
<|endoftext|>+0.041
 happily+0.041
 Will+0.039
Neg logit contributions
ances-0.094
glers-0.083
 sizes-0.081
boxes-0.080
cut-0.079
Token Tim
Token ID5045
Feature activation+8.84e-05

Pos logit contributions
 again+0.005
<|endoftext|>+0.005
 once+0.005
 Will+0.005
 happily+0.005
Neg logit contributions
ances-0.007
glers-0.006
 sizes-0.006
boxes-0.006
ales-0.006
Token says
Token ID1139
Feature activation+1.25e-02

Pos logit contributions
ances+0.000
glers+0.000
 sizes+0.000
boxes+0.000
ales+0.000
Neg logit contributions
 again-0.000
 blast-0.000
 Will-0.000
 once-0.000
<|endoftext|>-0.000
Token and
Token ID290
Feature activation-1.16e-02

Pos logit contributions
 again+0.018
 once+0.016
<|endoftext|>+0.016
 Will+0.016
 happily+0.016
Neg logit contributions
ances-0.028
glers-0.024
 sizes-0.024
boxes-0.023
cut-0.023
Token c
Token ID269
Feature activation-3.46e-02

Pos logit contributions
 again+0.014
 once+0.013
<|endoftext|>+0.013
 Will+0.012
 happily+0.012
Neg logit contributions
ances-0.026
glers-0.022
 sizes-0.022
boxes-0.022
cut-0.021
Tokenumin
Token ID7230
Feature activation-4.31e-02

Pos logit contributions
 again+0.059
 Will+0.059
 blast+0.058
<|endoftext|>+0.057
 once+0.056
Neg logit contributions
ances-0.058
glers-0.049
cases-0.047
rier-0.046
ales-0.046
Token and
Token ID290
Feature activation-1.11e-02

Pos logit contributions
 again+0.032
 once+0.027
<|endoftext|>+0.026
 Will+0.025
 happily+0.025
Neg logit contributions
ances-0.118
glers-0.106
 sizes-0.104
boxes-0.102
cut-0.100
Token cor
Token ID1162
Feature activation-2.40e-02

Pos logit contributions
 again+0.013
 once+0.012
<|endoftext|>+0.012
 Will+0.012
 happily+0.012
Neg logit contributions
ances-0.025
glers-0.021
 sizes-0.021
boxes-0.021
cut-0.020
Tokeni
Token ID72
Feature activation-6.14e-02

Pos logit contributions
<|endoftext|>+0.044
 again+0.044
 once+0.043
 happily+0.041
 blast+0.040
Neg logit contributions
ances-0.036
ales-0.030
glers-0.030
 sizes-0.029
boxes-0.028
Tokenander
Token ID4066
Feature activation-4.04e-02

Pos logit contributions
 again+0.058
 blast+0.055
<|endoftext|>+0.054
 Will+0.051
 once+0.051
Neg logit contributions
ances-0.145
boxes-0.125
glers-0.125
cut-0.125
 sizes-0.125
Token and
Token ID290
Feature activation-1.09e-02

Pos logit contributions
 again+0.039
<|endoftext|>+0.035
 blast+0.034
 once+0.034
 Will+0.034
Neg logit contributions
ances-0.099
glers-0.086
 sizes-0.086
boxes-0.085
cut-0.084
Token card
Token ID2657
Feature activation-8.26e-03

Pos logit contributions
 again+0.013
 once+0.012
<|endoftext|>+0.012
 Will+0.011
 happily+0.011
Neg logit contributions
ances-0.024
glers-0.021
 sizes-0.021
boxes-0.020
cut-0.020
Tokenam
Token ID321
Feature activation+1.08e-02

Pos logit contributions
 again+0.009
 once+0.008
<|endoftext|>+0.008
 happily+0.007
 Will+0.007
Neg logit contributions
ances-0.020
glers-0.018
 sizes-0.017
ales-0.017
boxes-0.017
Tokenom
Token ID296
Feature activation-2.01e-03

Pos logit contributions
ances+0.025
glers+0.022
 sizes+0.022
boxes+0.021
cut+0.021
Neg logit contributions
 again-0.012
 once-0.011
<|endoftext|>-0.011
 Will-0.011
 happily-0.011
Token down
Token ID866
Feature activation-2.40e-02

Pos logit contributions
 again+0.042
 blast+0.042
 Will+0.042
<|endoftext|>+0.040
 finished+0.039
Neg logit contributions
ances-0.092
glers-0.081
boxes-0.080
 sizes-0.078
cut-0.078
Token and
Token ID290
Feature activation-1.50e-02

Pos logit contributions
 blast+0.021
 Will+0.020
<|endoftext|>+0.020
 again+0.019
 imag+0.018
Neg logit contributions
ances-0.057
glers-0.051
boxes-0.050
cut-0.049
 sizes-0.049
Token enjoyed
Token ID8359
Feature activation-3.46e-02

Pos logit contributions
 again+0.017
 blast+0.017
<|endoftext|>+0.017
 once+0.016
 Will+0.015
Neg logit contributions
ances-0.030
glers-0.028
 sizes-0.027
cases-0.026
limits-0.026
Token her
Token ID607
Feature activation+3.83e-04

Pos logit contributions
 again+0.034
 blast+0.033
ig+0.033
 Will+0.032
 imag+0.031
Neg logit contributions
ances-0.076
 sizes-0.069
glers-0.067
boxes-0.065
bank-0.063
Token popcorn
Token ID38914
Feature activation+3.64e-03

Pos logit contributions
ances+0.001
 sizes+0.001
glers+0.001
 contents+0.001
iest+0.001
Neg logit contributions
 again-0.001
 Will-0.001
 Tw-0.000
Tw-0.000
 least-0.000
Token coc
Token ID8954
Feature activation-6.11e-02

Pos logit contributions
ances+0.009
glers+0.008
 sizes+0.008
boxes+0.008
ales+0.007
Neg logit contributions
 again-0.004
 once-0.003
<|endoftext|>-0.003
 happily-0.003
 Will-0.003
Tokenoon
Token ID2049
Feature activation-7.57e-05

Pos logit contributions
 again+0.099
 once+0.093
<|endoftext|>+0.093
 happily+0.091
 Will+0.090
Neg logit contributions
ances-0.113
glers-0.094
 sizes-0.094
boxes-0.091
 trigger-0.090
Token.
Token ID13
Feature activation-1.83e-02

Pos logit contributions
 again+0.000
 once+0.000
 happily+0.000
<|endoftext|>+0.000
 Will+0.000
Neg logit contributions
ances-0.000
glers-0.000
 trigger-0.000
boxes-0.000
limits-0.000
Token<|endoftext|>
Token ID50256
Feature activation-1.18e-03

Pos logit contributions
 again+0.022
 finished+0.021
 blast+0.020
ig+0.019
 str+0.019
Neg logit contributions
ances-0.037
boxes-0.035
 sizes-0.034
bank-0.032
cases-0.032
TokenOnce
Token ID7454
Feature activation+2.09e-03

Pos logit contributions
 blast+0.001
 finished+0.001
 imag+0.001
 str+0.001
 again+0.001
Neg logit contributions
ances-0.002
boxes-0.002
glers-0.002
 cans-0.002
 sizes-0.002
Token upon
Token ID2402
Feature activation-8.96e-03

Pos logit contributions
ances+0.004
glers+0.004
 sizes+0.004
cases+0.003
boxes+0.003
Neg logit contributions
 blast-0.003
 again-0.003
 imag-0.002
 Will-0.002
 finished-0.002
Token too
Token ID1165
Feature activation+2.30e-02

Pos logit contributions
 again+0.003
 Will+0.003
 once+0.003
<|endoftext|>+0.003
 blast+0.003
Neg logit contributions
ances-0.005
 sizes-0.004
glers-0.004
boxes-0.004
cut-0.004
Token heavy
Token ID4334
Feature activation-8.91e-03

Pos logit contributions
ances+0.051
 sizes+0.045
boxes+0.045
glers+0.044
 trigger+0.042
Neg logit contributions
 again-0.031
 finished-0.029
 once-0.028
<|endoftext|>-0.028
 happily-0.026
Token for
Token ID329
Feature activation+1.05e-03

Pos logit contributions
 again+0.007
 once+0.006
<|endoftext|>+0.006
 Will+0.006
 happily+0.006
Neg logit contributions
ances-0.024
glers-0.021
 sizes-0.021
boxes-0.021
cut-0.020
Token you
Token ID345
Feature activation+4.66e-03

Pos logit contributions
ances+0.002
glers+0.002
ales+0.002
 sizes+0.002
 cans+0.002
Neg logit contributions
 again-0.001
 once-0.001
 happily-0.001
 blast-0.001
 Will-0.001
Token?
Token ID30
Feature activation-4.41e-02

Pos logit contributions
ances+0.011
glers+0.010
 sizes+0.009
boxes+0.009
ales+0.009
Neg logit contributions
 again-0.005
 blast-0.004
 Will-0.004
 once-0.004
 happily-0.004
Token Well
Token ID3894
Feature activation-6.11e-02

Pos logit contributions
 again+0.064
 finished+0.059
 happily+0.058
<|endoftext|>+0.056
 once+0.056
Neg logit contributions
ances-0.083
limits-0.072
 sizes-0.071
boxes-0.070
cut-0.070
Token,
Token ID11
Feature activation-1.31e-02

Pos logit contributions
 again+0.073
 Will+0.068
 once+0.067
 happily+0.065
Tw+0.063
Neg logit contributions
ances-0.134
glers-0.113
cut-0.111
boxes-0.107
 sizes-0.107
Token I
Token ID314
Feature activation-4.07e-02

Pos logit contributions
 again+0.016
<|endoftext|>+0.015
 once+0.015
 happily+0.015
 blast+0.014
Neg logit contributions
ances-0.027
 sizes-0.023
glers-0.023
boxes-0.023
cut-0.023
Token think
Token ID892
Feature activation-1.57e-02

Pos logit contributions
 again+0.049
<|endoftext|>+0.045
 Tw+0.044
Tw+0.043
 once+0.042
Neg logit contributions
ances-0.082
cut-0.072
glers-0.071
 sizes-0.068
limits-0.067
Token you
Token ID345
Feature activation-2.18e-02

Pos logit contributions
 again+0.021
<|endoftext|>+0.020
 happily+0.020
 blast+0.020
 once+0.019
Neg logit contributions
ances-0.031
 sizes-0.026
cases-0.025
glers-0.025
boxes-0.025
Token're
Token ID821
Feature activation-2.67e-03

Pos logit contributions
 again+0.022
<|endoftext|>+0.020
 once+0.019
 happily+0.019
 Will+0.018
Neg logit contributions
ances-0.053
 sizes-0.046
glers-0.046
boxes-0.045
cut-0.044
Token,
Token ID11
Feature activation-1.10e-02

Pos logit contributions
 again+0.001
 once+0.000
<|endoftext|>+0.000
 Will+0.000
 happily+0.000
Neg logit contributions
ances-0.001
glers-0.001
 sizes-0.001
boxes-0.001
cut-0.001
Token he
Token ID339
Feature activation-2.26e-03

Pos logit contributions
 again+0.015
 once+0.013
<|endoftext|>+0.013
 happily+0.013
 finished+0.012
Neg logit contributions
ances-0.025
 sizes-0.022
glers-0.022
boxes-0.021
ales-0.020
Token emerged
Token ID9349
Feature activation-1.92e-02

Pos logit contributions
 Will+0.003
 again+0.003
 imag+0.003
<|endoftext|>+0.003
 symp+0.003
Neg logit contributions
ances-0.004
glers-0.003
stones-0.003
 barrels-0.003
boxes-0.003
Token from
Token ID422
Feature activation-4.45e-03

Pos logit contributions
 again+0.024
 Will+0.022
<|endoftext|>+0.022
 once+0.021
 blast+0.021
Neg logit contributions
ances-0.042
glers-0.036
 sizes-0.036
boxes-0.035
cut-0.034
Token the
Token ID262
Feature activation-5.35e-03

Pos logit contributions
 again+0.005
 Will+0.004
 blast+0.004
<|endoftext|>+0.004
ig+0.004
Neg logit contributions
ances-0.009
glers-0.009
cut-0.008
 crates-0.008
 barrels-0.008
Token coc
Token ID8954
Feature activation-6.09e-02

Pos logit contributions
 Will+0.007
 again+0.007
 happily+0.007
 imag+0.006
 once+0.006
Neg logit contributions
ances-0.010
 cans-0.009
 sizes-0.009
stones-0.009
glers-0.009
Tokenoon
Token ID2049
Feature activation-6.09e-03

Pos logit contributions
 again+0.098
 once+0.091
<|endoftext|>+0.091
 happily+0.088
 Will+0.086
Neg logit contributions
ances-0.116
 sizes-0.098
glers-0.097
boxes-0.094
 crates-0.091
Token as
Token ID355
Feature activation+4.16e-03

Pos logit contributions
 again+0.004
 once+0.004
 happily+0.003
<|endoftext|>+0.003
 finished+0.003
Neg logit contributions
ances-0.018
glers-0.016
 sizes-0.016
boxes-0.015
 trigger-0.015
Token a
Token ID257
Feature activation-1.23e-02

Pos logit contributions
ances+0.010
glers+0.008
 sizes+0.008
boxes+0.008
ales+0.008
Neg logit contributions
 again-0.005
 once-0.004
<|endoftext|>-0.004
 Will-0.004
 happily-0.004
Token beautiful
Token ID4950
Feature activation-2.38e-02

Pos logit contributions
 again+0.019
 once+0.019
 Will+0.018
 imag+0.018
 smiles+0.017
Neg logit contributions
ances-0.021
 trigger-0.017
glers-0.017
cut-0.017
stones-0.017
Token butterfly
Token ID35113
Feature activation-2.33e-02

Pos logit contributions
 Will+0.040
 again+0.035
 imag+0.035
 least+0.035
ossible+0.035
Neg logit contributions
ances-0.038
 sizes-0.032
 contents-0.029
stones-0.029
 textures-0.029
Token you
Token ID345
Feature activation-4.36e-03

Pos logit contributions
ances+0.011
 sizes+0.009
glers+0.009
boxes+0.009
ales+0.009
Neg logit contributions
 again-0.008
 blast-0.007
 finished-0.007
 once-0.007
 happily-0.007
Token where
Token ID810
Feature activation-2.17e-02

Pos logit contributions
 again+0.004
 once+0.004
<|endoftext|>+0.004
 Will+0.004
 blast+0.004
Neg logit contributions
ances-0.011
glers-0.009
 sizes-0.009
boxes-0.009
cut-0.009
Token I
Token ID314
Feature activation-9.64e-03

Pos logit contributions
 again+0.027
 once+0.024
<|endoftext|>+0.024
 happily+0.024
 blast+0.024
Neg logit contributions
ances-0.047
glers-0.041
 sizes-0.041
boxes-0.040
cut-0.039
Token make
Token ID787
Feature activation-1.14e-02

Pos logit contributions
 again+0.011
 once+0.010
<|endoftext|>+0.010
 Will+0.010
 happily+0.009
Neg logit contributions
ances-0.022
glers-0.019
 sizes-0.019
cut-0.018
ales-0.018
Token my
Token ID616
Feature activation+2.01e-03

Pos logit contributions
 again+0.013
 once+0.011
<|endoftext|>+0.011
 blast+0.011
 happily+0.011
Neg logit contributions
ances-0.026
glers-0.023
 sizes-0.023
boxes-0.022
cut-0.022
Token coc
Token ID8954
Feature activation-6.09e-02

Pos logit contributions
ances+0.004
 sizes+0.003
glers+0.003
boxes+0.003
cut+0.003
Neg logit contributions
 Will-0.003
 again-0.003
 finished-0.003
 blast-0.003
<|endoftext|>-0.003
Tokenoon
Token ID2049
Feature activation-4.08e-03

Pos logit contributions
 again+0.107
 once+0.100
<|endoftext|>+0.099
 happily+0.098
 Will+0.097
Neg logit contributions
ances-0.105
glers-0.087
 sizes-0.086
boxes-0.083
 crates-0.080
Token."
Token ID526
Feature activation+6.35e-04

Pos logit contributions
 again+0.005
 once+0.004
<|endoftext|>+0.004
 happily+0.004
 Will+0.004
Neg logit contributions
ances-0.009
glers-0.008
 sizes-0.008
boxes-0.008
cut-0.008
Token
Token ID198
Feature activation-7.18e-03

Pos logit contributions
ances+0.001
boxes+0.001
glers+0.001
bank+0.001
 cans+0.001
Neg logit contributions
 finished-0.001
 blast-0.001
 imag-0.001
 again-0.001
 str-0.000
Token
Token ID198
Feature activation+2.69e-03

Pos logit contributions
 imag+0.007
 finished+0.007
 again+0.007
 least+0.006
 blast+0.006
Neg logit contributions
ances-0.016
glers-0.015
cut-0.014
handed-0.014
boxes-0.014
TokenL
Token ID43
Feature activation-1.43e-02

Pos logit contributions
ances+0.006
glers+0.005
 sizes+0.005
boxes+0.005
ales+0.005
Neg logit contributions
 again-0.003
<|endoftext|>-0.003
 once-0.003
 happily-0.003
 Will-0.003
Token,
Token ID11
Feature activation-1.24e-02

Pos logit contributions
ances+0.021
glers+0.018
 sizes+0.018
boxes+0.017
ales+0.017
Neg logit contributions
 again-0.008
 once-0.007
<|endoftext|>-0.006
 happily-0.006
 Will-0.006
Token he
Token ID339
Feature activation-4.18e-03

Pos logit contributions
 again+0.019
 once+0.018
 Will+0.017
<|endoftext|>+0.017
 happily+0.017
Neg logit contributions
ances-0.025
glers-0.021
 sizes-0.021
boxes-0.020
ales-0.020
Token spun
Token ID26843
Feature activation-1.01e-02

Pos logit contributions
 blast+0.005
 again+0.005
<|endoftext|>+0.005
 Will+0.005
 Tw+0.005
Neg logit contributions
ances-0.008
 sizes-0.007
bank-0.007
stones-0.007
glers-0.007
Token a
Token ID257
Feature activation-5.62e-03

Pos logit contributions
 again+0.010
 blast+0.010
 Will+0.009
<|endoftext|>+0.009
 finished+0.009
Neg logit contributions
ances-0.023
 sizes-0.021
glers-0.020
boxes-0.020
 crates-0.019
Token special
Token ID2041
Feature activation-2.24e-03

Pos logit contributions
 again+0.009
 Will+0.009
 once+0.009
 imag+0.008
<|endoftext|>+0.008
Neg logit contributions
ances-0.009
 sizes-0.008
glers-0.008
cut-0.008
 trigger-0.008
Token coc
Token ID8954
Feature activation-6.07e-02

Pos logit contributions
 Will+0.004
 again+0.004
 once+0.003
<|endoftext|>+0.003
 finished+0.003
Neg logit contributions
ances-0.003
 sizes-0.003
cut-0.003
bank-0.003
boxes-0.003
Tokenoon
Token ID2049
Feature activation-3.21e-03

Pos logit contributions
 again+0.101
 once+0.095
<|endoftext|>+0.094
 happily+0.092
 Will+0.090
Neg logit contributions
ances-0.112
glers-0.093
 sizes-0.093
boxes-0.090
 trigger-0.087
Token and
Token ID290
Feature activation-8.54e-03

Pos logit contributions
 again+0.003
 once+0.002
 happily+0.002
<|endoftext|>+0.002
 finished+0.002
Neg logit contributions
ances-0.009
glers-0.008
 sizes-0.008
boxes-0.008
 trigger-0.008
Token went
Token ID1816
Feature activation+7.30e-03

Pos logit contributions
 again+0.009
 once+0.008
<|endoftext|>+0.008
 Will+0.007
 happily+0.007
Neg logit contributions
ances-0.021
glers-0.018
 sizes-0.018
boxes-0.018
cut-0.017
Token to
Token ID284
Feature activation-9.02e-03

Pos logit contributions
ances+0.017
glers+0.016
boxes+0.015
 sizes+0.015
 crates+0.015
Neg logit contributions
 again-0.006
 Will-0.006
<|endoftext|>-0.006
 once-0.006
 blast-0.006
Token sleep
Token ID3993
Feature activation-1.71e-02

Pos logit contributions
 again+0.010
ig+0.010
text+0.010
 imag+0.010
Tw+0.010
Neg logit contributions
ances-0.015
bank-0.015
 paths-0.014
 barrels-0.014
 crates-0.014
Token passes
Token ID8318
Feature activation+1.08e-02

Pos logit contributions
 again+0.012
<|endoftext|>+0.012
 once+0.011
 Will+0.011
 blast+0.011
Neg logit contributions
ances-0.026
 sizes-0.024
glers-0.023
stones-0.022
boxes-0.022
Token so
Token ID523
Feature activation+8.67e-03

Pos logit contributions
ances+0.025
glers+0.022
 sizes+0.022
boxes+0.021
cut+0.021
Neg logit contributions
 again-0.012
 once-0.011
<|endoftext|>-0.011
 Will-0.011
 happily-0.011
Token grace
Token ID11542
Feature activation-2.05e-02

Pos logit contributions
ances+0.020
glers+0.017
 sizes+0.017
boxes+0.016
cut+0.016
Neg logit contributions
 again-0.010
 once-0.009
<|endoftext|>-0.009
 blast-0.009
 Will-0.009
Tokenfully
Token ID2759
Feature activation-1.94e-02

Pos logit contributions
 again+0.033
 blast+0.031
<|endoftext|>+0.031
 once+0.031
 Will+0.031
Neg logit contributions
ances-0.037
glers-0.030
 sizes-0.030
boxes-0.029
cut-0.029
Token?
Token ID30
Feature activation-4.65e-02

Pos logit contributions
 again+0.023
<|endoftext|>+0.021
 Will+0.021
 once+0.021
 blast+0.020
Neg logit contributions
ances-0.043
 sizes-0.037
glers-0.037
boxes-0.036
cut-0.035
Token Isn
Token ID25110
Feature activation-6.06e-02

Pos logit contributions
 again+0.059
 finished+0.057
 happily+0.053
<|endoftext|>+0.051
 once+0.051
Neg logit contributions
ances-0.091
 sizes-0.082
boxes-0.081
ters-0.081
cut-0.079
Token't
Token ID470
Feature activation+8.44e-03

Pos logit contributions
 again+0.103
 imag+0.098
 finished+0.097
 blast+0.097
 happily+0.096
Neg logit contributions
ances-0.096
glers-0.082
cases-0.076
 sizes-0.075
stones-0.075
Token it
Token ID340
Feature activation-5.03e-03

Pos logit contributions
ances+0.023
glers+0.020
 sizes+0.020
boxes+0.020
cut+0.019
Neg logit contributions
 again-0.007
 once-0.005
<|endoftext|>-0.005
 Will-0.005
 happily-0.005
Token amazing
Token ID4998
Feature activation-3.05e-02

Pos logit contributions
 again+0.007
 once+0.006
<|endoftext|>+0.006
 Will+0.006
 happily+0.006
Neg logit contributions
ances-0.011
glers-0.009
 sizes-0.009
boxes-0.009
cut-0.009
Token?"
Token ID1701
Feature activation-3.26e-02

Pos logit contributions
 blast+0.035
<|endoftext|>+0.033
 finished+0.032
 Will+0.030
 happily+0.030
Neg logit contributions
ances-0.066
 sizes-0.061
boxes-0.057
glers-0.056
rier-0.055
Token
Token ID198
Feature activation-1.94e-02

Pos logit contributions
 blast+0.043
 finished+0.042
ig+0.041
 least+0.040
 imag+0.038
Neg logit contributions
 sizes-0.045
bank-0.045
ances-0.044
boxes-0.044
ters-0.043
Token."
Token ID526
Feature activation+4.25e-03

Pos logit contributions
 again+0.030
 once+0.025
 happily+0.023
 Will+0.021
<|endoftext|>+0.020
Neg logit contributions
ances-0.066
glers-0.060
limits-0.057
handed-0.056
 crates-0.056
Token L
Token ID406
Feature activation-2.34e-02

Pos logit contributions
ances+0.010
glers+0.009
 cans+0.009
boxes+0.009
 sizes+0.009
Neg logit contributions
 finished-0.004
 blast-0.004
 imag-0.003
 again-0.003
 happily-0.003
Tokenila
Token ID10102
Feature activation-1.68e-02

Pos logit contributions
 again+0.047
 Will+0.046
 imag+0.045
 blast+0.045
 happily+0.044
Neg logit contributions
ances-0.033
glers-0.028
ales-0.024
ters-0.024
boxes-0.023
Token touched
Token ID12615
Feature activation-2.04e-02

Pos logit contributions
 blast+0.026
 again+0.025
 Will+0.025
<|endoftext|>+0.024
 imag+0.024
Neg logit contributions
ances-0.027
glers-0.023
cut-0.023
 sizes-0.022
 cans-0.022
Token the
Token ID262
Feature activation-8.75e-03

Pos logit contributions
 blast+0.020
 imag+0.019
 finished+0.018
 again+0.017
 happily+0.017
Neg logit contributions
ances-0.046
 cans-0.042
glers-0.042
 sizes-0.041
 trigger-0.041
Token coc
Token ID8954
Feature activation-6.05e-02

Pos logit contributions
 again+0.014
 Will+0.014
 once+0.013
 happily+0.013
 finished+0.013
Neg logit contributions
ances-0.015
 sizes-0.013
 trigger-0.013
glers-0.013
boxes-0.012
Tokenoon
Token ID2049
Feature activation-2.27e-03

Pos logit contributions
 again+0.086
 once+0.079
<|endoftext|>+0.078
 happily+0.076
 Will+0.076
Neg logit contributions
ances-0.125
glers-0.107
 sizes-0.106
boxes-0.103
 crates-0.101
Token and
Token ID290
Feature activation-6.95e-03

Pos logit contributions
 again+0.002
 once+0.001
<|endoftext|>+0.001
 Will+0.001
 happily+0.001
Neg logit contributions
ances-0.006
glers-0.005
 sizes-0.005
boxes-0.005
cut-0.005
Token felt
Token ID2936
Feature activation-1.17e-02

Pos logit contributions
 again+0.007
<|endoftext|>+0.007
 once+0.007
 blast+0.007
 Will+0.007
Neg logit contributions
ances-0.015
 sizes-0.014
glers-0.013
 cans-0.013
boxes-0.013
Token its
Token ID663
Feature activation+9.96e-03

Pos logit contributions
 again+0.013
 blast+0.012
 Will+0.012
 once+0.012
 finished+0.011
Neg logit contributions
ances-0.026
 sizes-0.023
glers-0.023
cut-0.022
limits-0.022
Token texture
Token ID11743
Feature activation+5.83e-03

Pos logit contributions
ances+0.019
 sizes+0.016
glers+0.016
boxes+0.015
cut+0.015
Neg logit contributions
 again-0.016
 once-0.014
<|endoftext|>-0.014
 Will-0.014
 happily-0.014
Token should
Token ID815
Feature activation+1.26e-02

Pos logit contributions
 again+0.032
 once+0.028
<|endoftext|>+0.027
 Will+0.027
 happily+0.027
Neg logit contributions
ances-0.077
glers-0.068
 sizes-0.068
boxes-0.066
cut-0.065
Token never
Token ID1239
Feature activation+9.29e-03

Pos logit contributions
ances+0.032
glers+0.029
boxes+0.028
cases+0.028
 sizes+0.028
Neg logit contributions
 again-0.012
<|endoftext|>-0.012
 blast-0.011
 once-0.011
 finished-0.010
Token touch
Token ID3638
Feature activation-2.73e-02

Pos logit contributions
ances+0.021
glers+0.018
 sizes+0.018
boxes+0.018
cut+0.017
Neg logit contributions
 again-0.011
 once-0.010
<|endoftext|>-0.010
 Will-0.010
 happily-0.010
Token an
Token ID281
Feature activation-1.59e-02

Pos logit contributions
 again+0.028
 once+0.024
<|endoftext|>+0.022
 Will+0.022
 happily+0.020
Neg logit contributions
ances-0.070
glers-0.063
 sizes-0.060
boxes-0.059
cases-0.059
Token ash
Token ID12530
Feature activation-2.44e-02

Pos logit contributions
 again+0.030
 once+0.029
<|endoftext|>+0.028
 Will+0.028
 happily+0.028
Neg logit contributions
ances-0.025
glers-0.021
 sizes-0.020
boxes-0.020
limits-0.019
Tokentr
Token ID2213
Feature activation-6.04e-02

Pos logit contributions
<|endoftext|>+0.037
 blast+0.036
 happily+0.035
 Will+0.035
 again+0.035
Neg logit contributions
ances-0.044
boxes-0.038
cut-0.037
glers-0.037
 trigger-0.036
Tokenay
Token ID323
Feature activation-3.98e-02

Pos logit contributions
<|endoftext|>+0.083
 Will+0.081
 happily+0.080
 blast+0.079
 whist+0.076
Neg logit contributions
ances-0.113
boxes-0.104
glers-0.101
cases-0.100
rier-0.096
Token."
Token ID526
Feature activation+2.38e-03

Pos logit contributions
 again+0.045
 blast+0.044
<|endoftext|>+0.044
 happily+0.042
 imag+0.042
Neg logit contributions
ances-0.088
glers-0.076
boxes-0.075
cut-0.075
 sizes-0.073
Token
Token ID198
Feature activation-5.73e-03

Pos logit contributions
ances+0.006
glers+0.006
 cans+0.006
boxes+0.005
 sizes+0.005
Neg logit contributions
 again-0.002
 blast-0.001
 finished-0.001
 imag-0.001
 str-0.001
Token
Token ID198
Feature activation+3.24e-03

Pos logit contributions
 imag+0.005
 again+0.004
 finished+0.004
 blast+0.004
 least+0.004
Neg logit contributions
ances-0.014
glers-0.013
cut-0.012
boxes-0.012
 cans-0.012
TokenS
Token ID50
Feature activation-1.11e-02

Pos logit contributions
ances+0.007
 sizes+0.006
glers+0.006
ales+0.005
boxes+0.005
Neg logit contributions
 again-0.005
<|endoftext|>-0.005
 once-0.004
 happily-0.004
 Will-0.004
Token day
Token ID1110
Feature activation+2.52e-02

Pos logit contributions
 again+0.001
 imag+0.001
 blast+0.001
 Will+0.001
 happily+0.001
Neg logit contributions
ances-0.001
glers-0.001
cut-0.001
 sizes-0.001
boxes-0.001
Token,
Token ID11
Feature activation+1.45e-02

Pos logit contributions
ances+0.043
glers+0.039
boxes+0.036
 cans+0.036
iest+0.036
Neg logit contributions
 again-0.034
 imag-0.034
 least-0.033
 symp-0.032
 blast-0.032
Token Lily
Token ID20037
Feature activation-7.30e-03

Pos logit contributions
ances+0.027
glers+0.024
 sizes+0.023
 cans+0.023
 trigger+0.023
Neg logit contributions
 imag-0.018
 again-0.018
ig-0.018
 blast-0.017
ez-0.016
Token and
Token ID290
Feature activation+1.16e-02

Pos logit contributions
 again+0.008
<|endoftext|>+0.008
 blast+0.007
 Will+0.007
 once+0.007
Neg logit contributions
ances-0.016
glers-0.014
 sizes-0.014
limits-0.013
 cans-0.013
Token her
Token ID607
Feature activation-2.81e-03

Pos logit contributions
ances+0.020
boxes+0.017
cut+0.016
 sizes+0.016
glers+0.016
Neg logit contributions
 blast-0.018
 again-0.016
ig-0.016
 pro-0.015
 finished-0.015
Token family
Token ID1641
Feature activation+4.53e-02

Pos logit contributions
 again+0.003
 blast+0.003
 once+0.003
<|endoftext|>+0.003
 Will+0.003
Neg logit contributions
ances-0.006
glers-0.006
 sizes-0.005
boxes-0.005
 cans-0.005
Token went
Token ID1816
Feature activation+1.30e-02

Pos logit contributions
ances+0.094
glers+0.087
boxes+0.078
 cans+0.078
cut+0.078
Neg logit contributions
 again-0.051
 Tw-0.049
 blast-0.048
 Will-0.048
<|endoftext|>-0.047
Token to
Token ID284
Feature activation-1.16e-02

Pos logit contributions
ances+0.030
glers+0.028
ales+0.026
boxes+0.026
 crates+0.026
Neg logit contributions
 again-0.011
 Will-0.011
 blast-0.011
 once-0.010
 finished-0.010
Token the
Token ID262
Feature activation+1.23e-02

Pos logit contributions
ig+0.020
ez+0.017
 again+0.016
 pro+0.016
 symp+0.016
Neg logit contributions
 abandoned-0.016
ances-0.016
 litter-0.016
 cans-0.015
 barrels-0.015
Token park
Token ID3952
Feature activation+8.01e-03

Pos logit contributions
ances+0.022
glers+0.020
 sizes+0.019
boxes+0.019
 crates+0.018
Neg logit contributions
 again-0.019
 once-0.018
 happily-0.018
 Will-0.018
<|endoftext|>-0.017
Token to
Token ID284
Feature activation-3.01e-03

Pos logit contributions
ances+0.019
glers+0.017
 sizes+0.017
boxes+0.016
ales+0.016
Neg logit contributions
 again-0.007
 Will-0.007
 blast-0.007
<|endoftext|>-0.007
 finished-0.007
Token left
Token ID1364
Feature activation-5.03e-03

Pos logit contributions
ances+0.032
glers+0.028
 sizes+0.028
cut+0.027
ales+0.027
Neg logit contributions
 again-0.016
 once-0.015
<|endoftext|>-0.015
 happily-0.014
 Will-0.014
Token it
Token ID340
Feature activation+1.33e-03

Pos logit contributions
 again+0.003
 once+0.003
<|endoftext|>+0.003
 Will+0.003
 happily+0.003
Neg logit contributions
ances-0.014
glers-0.013
 sizes-0.012
boxes-0.012
cut-0.012
Token here
Token ID994
Feature activation+1.13e-02

Pos logit contributions
ances+0.003
glers+0.003
 sizes+0.003
boxes+0.003
cut+0.003
Neg logit contributions
 again-0.001
 once-0.001
<|endoftext|>-0.001
 Will-0.001
 happily-0.001
Token and
Token ID290
Feature activation-4.60e-03

Pos logit contributions
ances+0.024
 sizes+0.021
glers+0.020
boxes+0.020
ales+0.020
Neg logit contributions
 again-0.014
 blast-0.013
 Will-0.013
<|endoftext|>-0.013
 finished-0.012
Token did
Token ID750
Feature activation-9.12e-03

Pos logit contributions
 again+0.005
 once+0.005
<|endoftext|>+0.004
 Will+0.004
 happily+0.004
Neg logit contributions
ances-0.011
glers-0.009
 sizes-0.009
boxes-0.009
limits-0.009
Token not
Token ID407
Feature activation+4.34e-02

Pos logit contributions
 again+0.015
 once+0.014
 Will+0.014
<|endoftext|>+0.013
 happily+0.013
Neg logit contributions
ances-0.017
glers-0.014
 sizes-0.014
boxes-0.013
limits-0.013
Token throw
Token ID3714
Feature activation+5.88e-03

Pos logit contributions
ances+0.103
boxes+0.095
glers+0.093
cases+0.092
 sizes+0.092
Neg logit contributions
 again-0.049
 blast-0.043
<|endoftext|>-0.043
 finished-0.043
 happily-0.042
Token it
Token ID340
Feature activation-6.58e-03

Pos logit contributions
ances+0.014
 sizes+0.013
glers+0.013
boxes+0.012
ales+0.012
Neg logit contributions
 again-0.005
 Will-0.005
 blast-0.005
 once-0.005
<|endoftext|>-0.005
Token away
Token ID1497
Feature activation+5.55e-03

Pos logit contributions
 again+0.007
 once+0.006
<|endoftext|>+0.006
 happily+0.006
 Will+0.006
Neg logit contributions
ances-0.016
glers-0.014
 sizes-0.014
boxes-0.013
cut-0.013
Token.
Token ID13
Feature activation-1.16e-02

Pos logit contributions
ances+0.013
glers+0.011
 sizes+0.011
limits+0.011
boxes+0.011
Neg logit contributions
 again-0.007
 once-0.006
 happily-0.006
<|endoftext|>-0.006
 Will-0.006
Token We
Token ID775
Feature activation+4.94e-03

Pos logit contributions
 again+0.011
 Will+0.010
 once+0.010
<|endoftext|>+0.010
 blast+0.009
Neg logit contributions
ances-0.029
glers-0.026
 sizes-0.025
boxes-0.025
cut-0.024
Token,
Token ID11
Feature activation-4.70e-03

Pos logit contributions
ances+0.083
glers+0.074
 sizes+0.074
boxes+0.072
cut+0.071
Neg logit contributions
 again-0.020
 once-0.016
<|endoftext|>-0.016
 Will-0.015
 happily-0.015
Token and
Token ID290
Feature activation-1.52e-03

Pos logit contributions
 again+0.006
 once+0.006
 happily+0.006
<|endoftext|>+0.006
 Will+0.005
Neg logit contributions
ances-0.010
glers-0.009
 sizes-0.009
boxes-0.009
 trigger-0.009
Token Lily
Token ID20037
Feature activation+1.13e-02

Pos logit contributions
 again+0.002
<|endoftext|>+0.002
 once+0.001
 Will+0.001
 happily+0.001
Neg logit contributions
ances-0.004
glers-0.003
boxes-0.003
ales-0.003
 trigger-0.003
Token was
Token ID373
Feature activation-6.94e-03

Pos logit contributions
ances+0.028
 sizes+0.025
glers+0.025
boxes+0.024
cut+0.024
Neg logit contributions
 again-0.010
 blast-0.009
 Will-0.009
 once-0.009
<|endoftext|>-0.009
Token a
Token ID257
Feature activation-1.17e-02

Pos logit contributions
 again+0.008
 blast+0.006
<|endoftext|>+0.006
 once+0.006
 Will+0.006
Neg logit contributions
ances-0.016
 sizes-0.014
glers-0.014
cut-0.013
limits-0.013
Token bit
Token ID1643
Feature activation+4.27e-02

Pos logit contributions
 again+0.021
 once+0.020
<|endoftext|>+0.019
ig+0.019
 finished+0.018
Neg logit contributions
ances-0.017
 sizes-0.014
cut-0.014
glers-0.014
rier-0.013
Token older
Token ID4697
Feature activation+3.51e-03

Pos logit contributions
ances+0.091
boxes+0.081
ales+0.075
glers+0.074
keeping+0.073
Neg logit contributions
 again-0.064
 happily-0.058
 Will-0.058
 once-0.055
 finished-0.053
Token.
Token ID13
Feature activation-1.50e-03

Pos logit contributions
ances+0.008
 sizes+0.007
glers+0.007
boxes+0.007
cut+0.007
Neg logit contributions
 again-0.004
<|endoftext|>-0.004
 once-0.003
 blast-0.003
 Will-0.003
Token They
Token ID1119
Feature activation+2.90e-02

Pos logit contributions
 again+0.002
 once+0.002
<|endoftext|>+0.002
 Will+0.002
 happily+0.002
Neg logit contributions
ances-0.003
glers-0.003
 sizes-0.003
boxes-0.003
cut-0.003
Token were
Token ID547
Feature activation+1.01e-02

Pos logit contributions
ances+0.065
glers+0.057
 sizes+0.057
cut+0.053
boxes+0.053
Neg logit contributions
 again-0.036
 once-0.032
<|endoftext|>-0.031
 blast-0.031
 Will-0.030
Token very
Token ID845
Feature activation+3.89e-02

Pos logit contributions
ances+0.024
glers+0.021
 sizes+0.021
boxes+0.020
cut+0.020
Neg logit contributions
 again-0.011
 once-0.010
<|endoftext|>-0.010
 Will-0.009
 blast-0.009
Token a
Token ID257
Feature activation-2.04e-02

Pos logit contributions
 again+0.004
ig+0.003
 blast+0.003
 finished+0.003
 imag+0.003
Neg logit contributions
ances-0.008
 sizes-0.007
glers-0.007
boxes-0.007
handed-0.007
Token time
Token ID640
Feature activation+9.50e-03

Pos logit contributions
 again+0.024
 Will+0.022
 Tw+0.020
 least+0.019
 once+0.018
Neg logit contributions
ances-0.044
 sizes-0.037
glers-0.035
ales-0.035
 barrels-0.034
Token,
Token ID11
Feature activation-1.26e-02

Pos logit contributions
ances+0.022
glers+0.019
 sizes+0.019
boxes+0.018
cut+0.018
Neg logit contributions
 again-0.010
 blast-0.009
<|endoftext|>-0.009
 Will-0.009
 once-0.009
Token there
Token ID612
Feature activation+3.54e-03

Pos logit contributions
 again+0.020
 blast+0.020
 imag+0.020
 Will+0.019
 finished+0.019
Neg logit contributions
ances-0.020
glers-0.017
cut-0.016
boxes-0.016
 sizes-0.016
Token were
Token ID547
Feature activation+2.25e-02

Pos logit contributions
ances+0.008
glers+0.007
 sizes+0.007
boxes+0.007
cut+0.007
Neg logit contributions
 again-0.004
 blast-0.003
 imag-0.003
 rec-0.003
 Will-0.003
Token two
Token ID734
Feature activation+4.67e-02

Pos logit contributions
ances+0.041
glers+0.037
 sizes+0.034
boxes+0.033
 cans+0.033
Neg logit contributions
 again-0.034
 happily-0.030
<|endoftext|>-0.029
 blast-0.029
 Will-0.029
Token friends
Token ID2460
Feature activation+3.84e-02

Pos logit contributions
ances+0.112
limits+0.103
boxes+0.100
keeping+0.100
 trigger+0.099
Neg logit contributions
 again-0.049
 happily-0.044
<|endoftext|>-0.043
 once-0.043
 finished-0.041
Token,
Token ID11
Feature activation-8.77e-03

Pos logit contributions
ances+0.097
glers+0.086
 sizes+0.084
boxes+0.082
cut+0.082
Neg logit contributions
 again-0.036
 once-0.031
 Will-0.031
<|endoftext|>-0.030
 blast-0.030
Token Jack
Token ID3619
Feature activation-1.13e-02

Pos logit contributions
 again+0.010
 once+0.009
 finished+0.008
<|endoftext|>+0.008
 happily+0.008
Neg logit contributions
ances-0.020
glers-0.018
 sizes-0.017
boxes-0.017
cut-0.016
Token and
Token ID290
Feature activation+4.45e-03

Pos logit contributions
 again+0.008
 blast+0.007
 Will+0.007
 once+0.007
<|endoftext|>+0.007
Neg logit contributions
ances-0.031
 sizes-0.027
glers-0.027
boxes-0.026
cut-0.026
Token Jill
Token ID22772
Feature activation+1.15e-02

Pos logit contributions
ances+0.012
glers+0.010
boxes+0.010
 trigger+0.010
ales+0.010
Neg logit contributions
 Will-0.004
 again-0.004
 happily-0.004
 once-0.004
<|endoftext|>-0.003
Token
Token ID198
Feature activation+6.56e-03

Pos logit contributions
 imag+0.000
 again+0.000
 least+0.000
 finished+0.000
 blast+0.000
Neg logit contributions
ances-0.001
glers-0.001
cut-0.001
handed-0.001
boxes-0.001
TokenThe
Token ID464
Feature activation+5.34e-03

Pos logit contributions
ances+0.012
glers+0.010
 sizes+0.010
cut+0.010
boxes+0.010
Neg logit contributions
 again-0.010
 once-0.009
 Will-0.009
 finished-0.009
 blast-0.009
Token little
Token ID1310
Feature activation+1.47e-03

Pos logit contributions
ances+0.011
glers+0.010
 sizes+0.009
boxes+0.009
cut+0.009
Neg logit contributions
 again-0.007
 Will-0.006
 once-0.006
 blast-0.006
 happily-0.006
Token girl
Token ID2576
Feature activation+7.28e-04

Pos logit contributions
ances+0.003
glers+0.003
ales+0.003
 sizes+0.003
cut+0.003
Neg logit contributions
 again-0.002
 once-0.002
 happily-0.002
 Will-0.002
<|endoftext|>-0.002
Token did
Token ID750
Feature activation-3.70e-03

Pos logit contributions
ances+0.001
glers+0.001
cases+0.001
 sizes+0.001
cut+0.001
Neg logit contributions
 blast-0.001
<|endoftext|>-0.001
 again-0.001
<-0.001
 Will-0.001
Token not
Token ID407
Feature activation+4.03e-02

Pos logit contributions
 again+0.005
 Will+0.005
 blast+0.005
ig+0.005
 imag+0.004
Neg logit contributions
ances-0.007
glers-0.006
 sizes-0.006
cases-0.006
limits-0.006
Token know
Token ID760
Feature activation-2.22e-03

Pos logit contributions
ances+0.091
glers+0.081
 sizes+0.079
boxes+0.079
cut+0.076
Neg logit contributions
 again-0.049
 once-0.044
<|endoftext|>-0.043
 Will-0.043
 happily-0.043
Token what
Token ID644
Feature activation-1.20e-02

Pos logit contributions
 again+0.002
 blast+0.002
<|endoftext|>+0.002
 once+0.002
 happily+0.002
Neg logit contributions
ances-0.006
glers-0.005
 sizes-0.005
boxes-0.005
cut-0.005
Token to
Token ID284
Feature activation+1.46e-02

Pos logit contributions
 again+0.013
 blast+0.012
 once+0.012
 happily+0.012
ig+0.011
Neg logit contributions
ances-0.027
 sizes-0.024
boxes-0.023
glers-0.023
cut-0.023
Token do
Token ID466
Feature activation-1.16e-02

Pos logit contributions
ances+0.040
 sizes+0.038
glers+0.037
 crates+0.037
cut+0.036
Neg logit contributions
 again-0.010
 once-0.009
<|endoftext|>-0.009
 finished-0.008
 Will-0.007
Token with
Token ID351
Feature activation-2.77e-03

Pos logit contributions
 again+0.012
 once+0.011
<|endoftext|>+0.010
 Will+0.010
 happily+0.010
Neg logit contributions
ances-0.028
glers-0.025
 sizes-0.025
boxes-0.024
cut-0.024
Token a
Token ID257
Feature activation-2.67e-02

Pos logit contributions
 again+0.005
ig+0.004
 blast+0.004
 finished+0.004
 imag+0.004
Neg logit contributions
ances-0.010
 sizes-0.009
glers-0.009
boxes-0.009
handed-0.009
Token time
Token ID640
Feature activation+6.29e-03

Pos logit contributions
 again+0.032
 Will+0.029
 Tw+0.026
 least+0.025
 rec+0.024
Neg logit contributions
ances-0.057
 sizes-0.048
glers-0.046
ales-0.045
ooky-0.045
Token there
Token ID612
Feature activation+6.86e-03

Pos logit contributions
ances+0.015
glers+0.013
 sizes+0.013
boxes+0.013
cut+0.012
Neg logit contributions
 again-0.007
 once-0.006
 blast-0.006
<|endoftext|>-0.006
 Will-0.006
Token was
Token ID373
Feature activation-2.62e-03

Pos logit contributions
ances+0.015
glers+0.014
 sizes+0.014
boxes+0.013
cut+0.013
Neg logit contributions
 again-0.007
 blast-0.007
 imag-0.006
 la-0.006
 rec-0.006
Token a
Token ID257
Feature activation-2.63e-02

Pos logit contributions
 again+0.003
 Will+0.002
ig+0.002
 finished+0.002
 blast+0.002
Neg logit contributions
ances-0.006
 sizes-0.005
glers-0.005
 cans-0.005
ales-0.005
Token cup
Token ID6508
Feature activation+1.92e-02

Pos logit contributions
 again+0.048
 once+0.046
 imag+0.044
 finished+0.042
 least+0.042
Neg logit contributions
ances-0.039
glers-0.034
cut-0.031
 sizes-0.031
 trigger-0.030
Token.
Token ID13
Feature activation-3.54e-03

Pos logit contributions
ances+0.048
 sizes+0.042
limits+0.041
 crates+0.040
ales+0.040
Neg logit contributions
 again-0.021
<|endoftext|>-0.019
 once-0.018
 happily-0.017
 finished-0.016
Token The
Token ID383
Feature activation+7.70e-03

Pos logit contributions
 again+0.003
 once+0.002
<|endoftext|>+0.002
 happily+0.002
 blast+0.002
Neg logit contributions
ances-0.009
glers-0.008
 sizes-0.008
boxes-0.008
ales-0.008
Token cup
Token ID6508
Feature activation+1.61e-02

Pos logit contributions
ances+0.018
glers+0.015
 sizes+0.015
boxes+0.015
cut+0.014
Neg logit contributions
 again-0.009
 once-0.008
<|endoftext|>-0.008
 Will-0.008
 happily-0.008
Token was
Token ID373
Feature activation-1.41e-03

Pos logit contributions
ances+0.043
glers+0.038
 sizes+0.037
boxes+0.037
ales+0.036
Neg logit contributions
 again-0.014
 once-0.012
<|endoftext|>-0.011
 Will-0.011
 happily-0.011
Token very
Token ID845
Feature activation+3.32e-02

Pos logit contributions
 again+0.002
 once+0.001
<|endoftext|>+0.001
 Will+0.001
 happily+0.001
Neg logit contributions
ances-0.003
glers-0.003
 sizes-0.003
boxes-0.003
cut-0.003
Token friendly
Token ID8030
Feature activation-5.38e-04

Pos logit contributions
 again+0.024
<|endoftext|>+0.023
 once+0.022
 Will+0.022
 blast+0.022
Neg logit contributions
ances-0.026
glers-0.022
 sizes-0.022
boxes-0.021
ales-0.021
Token knight
Token ID22062
Feature activation-4.43e-03

Pos logit contributions
 again+0.001
 once+0.001
<|endoftext|>+0.001
 Will+0.001
 happily+0.001
Neg logit contributions
ances-0.001
glers-0.001
 sizes-0.001
boxes-0.001
cut-0.001
Token appeared
Token ID4120
Feature activation-1.06e-02

Pos logit contributions
 again+0.005
 once+0.005
<|endoftext|>+0.004
 blast+0.004
 Will+0.004
Neg logit contributions
ances-0.010
glers-0.009
 sizes-0.009
cut-0.009
boxes-0.009
Token and
Token ID290
Feature activation-2.25e-02

Pos logit contributions
 again+0.012
 once+0.010
<|endoftext|>+0.010
 happily+0.010
 Will+0.009
Neg logit contributions
ances-0.026
glers-0.023
 sizes-0.023
boxes-0.022
limits-0.022
Token showed
Token ID3751
Feature activation+1.86e-02

Pos logit contributions
 again+0.032
 once+0.030
<|endoftext|>+0.029
 blast+0.028
 happily+0.028
Neg logit contributions
ances-0.045
glers-0.039
 sizes-0.038
boxes-0.037
cut-0.037
Token them
Token ID606
Feature activation+5.86e-03

Pos logit contributions
ances+0.048
glers+0.043
 sizes+0.042
boxes+0.041
cut+0.040
Neg logit contributions
 again-0.017
 once-0.014
<|endoftext|>-0.014
 Will-0.014
 happily-0.014
Token around
Token ID1088
Feature activation-1.51e-02

Pos logit contributions
ances+0.017
glers+0.015
 sizes+0.015
boxes+0.014
limits+0.014
Neg logit contributions
 again-0.005
<|endoftext|>-0.004
 once-0.004
 happily-0.004
 Will-0.003
Token the
Token ID262
Feature activation+1.53e-02

Pos logit contributions
 blast+0.014
 again+0.014
<|endoftext|>+0.013
 finished+0.013
 Will+0.013
Neg logit contributions
ances-0.035
 sizes-0.031
glers-0.031
boxes-0.031
cut-0.030
Token castle
Token ID16669
Feature activation+6.42e-03

Pos logit contributions
ances+0.032
glers+0.027
 sizes+0.027
boxes+0.026
ales+0.025
Neg logit contributions
 again-0.022
 once-0.020
<|endoftext|>-0.020
 Will-0.020
 blast-0.019
Token.
Token ID13
Feature activation+1.62e-03

Pos logit contributions
ances+0.017
glers+0.015
 trigger+0.015
ales+0.014
 sizes+0.014
Neg logit contributions
 again-0.007
 once-0.005
 happily-0.005
 finished-0.005
<|endoftext|>-0.005
Token Lily
Token ID20037
Feature activation-1.75e-02

Pos logit contributions
ances+0.003
glers+0.003
 sizes+0.003
boxes+0.003
cases+0.003
Neg logit contributions
 again-0.002
 once-0.002
 blast-0.002
<|endoftext|>-0.002
ig-0.002
TokenThe
Token ID464
Feature activation+1.21e-03

Pos logit contributions
ances+0.013
 sizes+0.011
glers+0.011
boxes+0.011
ales+0.011
Neg logit contributions
 again-0.008
<|endoftext|>-0.008
 happily-0.008
 once-0.008
 Will-0.007
Token big
Token ID1263
Feature activation+2.09e-03

Pos logit contributions
ances+0.003
ales+0.002
limits+0.002
 sizes+0.002
boxes+0.002
Neg logit contributions
 again-0.002
<|endoftext|>-0.002
 once-0.001
 Will-0.001
 happily-0.001
Token,
Token ID11
Feature activation-1.85e-02

Pos logit contributions
ances+0.005
limits+0.004
cases+0.004
boxes+0.004
 sizes+0.004
Neg logit contributions
 again-0.003
 once-0.003
 happily-0.003
<|endoftext|>-0.002
 Will-0.002
Token scary
Token ID14343
Feature activation+2.33e-02

Pos logit contributions
 again+0.031
 once+0.029
<|endoftext|>+0.029
 Will+0.028
 happily+0.028
Neg logit contributions
ances-0.034
glers-0.028
 sizes-0.028
boxes-0.027
ales-0.026
Token fish
Token ID5916
Feature activation+6.61e-03

Pos logit contributions
ances+0.052
boxes+0.044
limits+0.044
 sizes+0.044
glers+0.044
Neg logit contributions
 again-0.033
 once-0.030
 happily-0.028
<|endoftext|>-0.028
 Will-0.028
Token chased
Token ID26172
Feature activation+2.50e-02

Pos logit contributions
ances+0.016
glers+0.014
 sizes+0.014
boxes+0.013
limits+0.013
Neg logit contributions
 again-0.008
 once-0.007
<|endoftext|>-0.007
 happily-0.006
 Will-0.006
Token them
Token ID606
Feature activation+2.01e-02

Pos logit contributions
ances+0.049
ales+0.042
glers+0.041
 trigger+0.041
 crates+0.040
Neg logit contributions
 blast-0.032
 again-0.032
 finished-0.031
ig-0.030
<|endoftext|>-0.030
Token,
Token ID11
Feature activation-2.19e-02

Pos logit contributions
ances+0.054
glers+0.049
 sizes+0.047
boxes+0.047
ales+0.046
Neg logit contributions
 again-0.019
 once-0.015
 happily-0.015
<|endoftext|>-0.014
 Will-0.013
Token but
Token ID475
Feature activation-7.83e-03

Pos logit contributions
 again+0.026
 once+0.024
<|endoftext|>+0.023
 Will+0.023
 happily+0.023
Neg logit contributions
ances-0.050
glers-0.043
 sizes-0.043
boxes-0.042
cut-0.041
Token Nem
Token ID22547
Feature activation-1.46e-02

Pos logit contributions
 again+0.011
 blast+0.011
<|endoftext|>+0.010
 once+0.010
 Will+0.010
Neg logit contributions
ances-0.015
glers-0.013
 sizes-0.013
boxes-0.012
cut-0.012
Tokeno
Token ID78
Feature activation-1.79e-02

Pos logit contributions
 again+0.015
 once+0.013
 happily+0.012
<|endoftext|>+0.012
 Will+0.012
Neg logit contributions
ances-0.037
glers-0.032
 sizes-0.032
boxes-0.031
 crates-0.031
Token could
Token ID714
Feature activation-6.82e-03

Pos logit contributions
ances+0.007
glers+0.006
 sizes+0.006
ales+0.006
 trigger+0.006
Neg logit contributions
 again-0.003
 once-0.003
 happily-0.002
 Will-0.002
 finished-0.002
Token stay
Token ID2652
Feature activation+8.04e-03

Pos logit contributions
 again+0.008
 once+0.007
<|endoftext|>+0.007
 happily+0.007
 finished+0.007
Neg logit contributions
ances-0.016
 sizes-0.014
glers-0.014
boxes-0.014
 trigger-0.014
Token safe
Token ID3338
Feature activation-3.75e-03

Pos logit contributions
ances+0.019
glers+0.017
 sizes+0.017
boxes+0.017
cut+0.016
Neg logit contributions
 again-0.009
 once-0.008
<|endoftext|>-0.007
 Will-0.007
 happily-0.007
Token until
Token ID1566
Feature activation-1.69e-02

Pos logit contributions
 again+0.003
 once+0.003
 happily+0.002
<|endoftext|>+0.002
 Will+0.002
Neg logit contributions
ances-0.010
glers-0.009
 sizes-0.009
boxes-0.009
cut-0.009
Token she
Token ID673
Feature activation-1.47e-02

Pos logit contributions
 blast+0.019
 Will+0.019
Tw+0.019
<|endoftext|>+0.019
ig+0.018
Neg logit contributions
ances-0.034
glers-0.030
boxes-0.029
limits-0.029
 sizes-0.028
Token wanted
Token ID2227
Feature activation+6.74e-03

Pos logit contributions
<|endoftext|>+0.017
 again+0.016
 blast+0.016
 Will+0.015
Tw+0.015
Neg logit contributions
ances-0.030
glers-0.027
 sizes-0.026
cut-0.025
cases-0.025
Token to
Token ID284
Feature activation-4.34e-03

Pos logit contributions
ances+0.017
 sizes+0.015
glers+0.015
boxes+0.015
 cans+0.015
Neg logit contributions
 again-0.005
 blast-0.005
<|endoftext|>-0.005
 Will-0.004
 once-0.004
Token play
Token ID711
Feature activation-1.71e-02

Pos logit contributions
 again+0.005
 once+0.004
<|endoftext|>+0.004
 Will+0.004
 happily+0.004
Neg logit contributions
ances-0.010
glers-0.009
 sizes-0.009
boxes-0.008
cut-0.008
Token again
Token ID757
Feature activation-2.80e-02

Pos logit contributions
 imag+0.019
 Will+0.019
 blast+0.019
 finished+0.019
<|endoftext|>+0.018
Neg logit contributions
ances-0.035
boxes-0.030
glers-0.030
 sizes-0.030
 cans-0.030
Token.
Token ID13
Feature activation-8.91e-03

Pos logit contributions
 again+0.030
 blast+0.030
<|endoftext|>+0.029
 Will+0.029
 finished+0.028
Neg logit contributions
ances-0.063
 sizes-0.054
glers-0.053
boxes-0.052
ales-0.051
Token
Token ID198
Feature activation-2.08e-03

Pos logit contributions
 blast+0.012
ig+0.011
 whist+0.011
 finished+0.011
 imag+0.010
Neg logit contributions
ances-0.014
 sizes-0.014
boxes-0.014
handed-0.014
cases-0.014
Token She
Token ID1375
Feature activation+1.06e-02

Pos logit contributions
ances+0.026
boxes+0.021
 sizes+0.021
 cans+0.020
limits+0.020
Neg logit contributions
 again-0.033
 finished-0.031
 blast-0.031
 once-0.031
 Will-0.030
Token is
Token ID318
Feature activation+1.70e-03

Pos logit contributions
ances+0.021
glers+0.019
 cans+0.019
 sizes+0.019
cases+0.018
Neg logit contributions
 again-0.013
<|endoftext|>-0.012
 once-0.012
 Will-0.012
 blast-0.011
Token too
Token ID1165
Feature activation+2.95e-02

Pos logit contributions
ances+0.003
glers+0.002
 sizes+0.002
cut+0.002
limits+0.002
Neg logit contributions
 again-0.003
 Will-0.003
 blast-0.003
 once-0.003
<|endoftext|>-0.002
Token busy
Token ID8179
Feature activation+4.68e-03

Pos logit contributions
ances+0.063
boxes+0.058
glers+0.057
 sizes+0.056
stones+0.055
Neg logit contributions
 again-0.039
<|endoftext|>-0.038
 happily-0.038
 once-0.037
 finished-0.036
Token cutting
Token ID7720
Feature activation+2.01e-03

Pos logit contributions
ances+0.011
glers+0.009
 sizes+0.009
boxes+0.009
cut+0.009
Neg logit contributions
 again-0.005
<|endoftext|>-0.005
 Will-0.005
 once-0.005
 blast-0.005
Token.
Token ID13
Feature activation+1.38e-02

Pos logit contributions
ances+0.005
glers+0.005
 sizes+0.005
boxes+0.004
cut+0.004
Neg logit contributions
 again-0.002
 once-0.002
<|endoftext|>-0.002
 happily-0.002
 Will-0.001
Token She
Token ID1375
Feature activation+9.75e-03

Pos logit contributions
ances+0.021
 sizes+0.017
boxes+0.017
 cans+0.016
limits+0.016
Neg logit contributions
 again-0.026
 blast-0.024
 finished-0.024
 once-0.023
 happily-0.023
Token does
Token ID857
Feature activation-7.42e-03

Pos logit contributions
ances+0.019
 cans+0.017
glers+0.017
 sizes+0.017
cases+0.016
Neg logit contributions
 again-0.013
<|endoftext|>-0.012
 once-0.012
 Will-0.011
 happily-0.011
Token not
Token ID407
Feature activation+4.36e-02

Pos logit contributions
 again+0.013
 once+0.012
 Will+0.012
<|endoftext|>+0.012
 happily+0.012
Neg logit contributions
ances-0.012
glers-0.010
 sizes-0.010
cut-0.010
limits-0.009
Token see
Token ID766
Feature activation+2.69e-03

Pos logit contributions
ances+0.099
glers+0.087
 sizes+0.086
boxes+0.085
cut+0.082
Neg logit contributions
 again-0.054
 once-0.047
<|endoftext|>-0.047
 Will-0.047
 happily-0.047
Token that
Token ID326
Feature activation+8.48e-03

Pos logit contributions
ances+0.006
 sizes+0.006
glers+0.006
 cans+0.005
ales+0.005
Neg logit contributions
 again-0.002
 blast-0.002
 happily-0.002
<|endoftext|>-0.002
 once-0.002
Token it
Token ID340
Feature activation+3.82e-03

Pos logit contributions
 again+0.020
 once+0.018
<|endoftext|>+0.017
 Will+0.017
 happily+0.017
Neg logit contributions
ances-0.042
glers-0.037
 sizes-0.037
boxes-0.036
cut-0.035
Token changed
Token ID3421
Feature activation+1.47e-02

Pos logit contributions
ances+0.011
glers+0.010
 sizes+0.010
boxes+0.009
ales+0.009
Neg logit contributions
 again-0.003
 once-0.002
<|endoftext|>-0.002
 Will-0.002
 happily-0.002
Token.
Token ID13
Feature activation+7.11e-03

Pos logit contributions
ances+0.037
glers+0.033
 sizes+0.032
cut+0.031
boxes+0.031
Neg logit contributions
 again-0.015
 once-0.013
 happily-0.013
<|endoftext|>-0.013
 Will-0.012
Token Her
Token ID2332
Feature activation-6.00e-03

Pos logit contributions
ances+0.013
boxes+0.011
glers+0.011
limits+0.011
bank+0.011
Neg logit contributions
 again-0.010
 blast-0.010
 finished-0.009
 once-0.009
<|endoftext|>-0.009
Token mom
Token ID1995
Feature activation-6.57e-03

Pos logit contributions
 again+0.007
 blast+0.007
Tw+0.006
 finished+0.006
<|endoftext|>+0.006
Neg logit contributions
ances-0.013
glers-0.012
 sizes-0.011
boxes-0.011
cases-0.011
Tokenmy
Token ID1820
Feature activation-1.24e-02

Pos logit contributions
 again+0.009
 once+0.008
<|endoftext|>+0.008
 Will+0.008
 happily+0.008
Neg logit contributions
ances-0.014
glers-0.012
 sizes-0.012
boxes-0.012
ales-0.011
Token explained
Token ID4893
Feature activation+2.31e-02

Pos logit contributions
 again+0.017
 once+0.015
<|endoftext|>+0.015
 blast+0.015
 Will+0.015
Neg logit contributions
ances-0.026
glers-0.023
 sizes-0.022
boxes-0.022
cut-0.022
Token that
Token ID326
Feature activation+2.04e-04

Pos logit contributions
ances+0.056
glers+0.049
 sizes+0.048
boxes+0.047
cut+0.047
Neg logit contributions
 again-0.023
 once-0.021
 blast-0.021
<|endoftext|>-0.021
 Will-0.020
Token the
Token ID262
Feature activation+6.71e-03

Pos logit contributions
ances+0.001
glers+0.000
 sizes+0.000
boxes+0.000
cut+0.000
Neg logit contributions
 again-0.000
 once-0.000
<|endoftext|>-0.000
 happily-0.000
 Will-0.000
Token restaurant
Token ID7072
Feature activation+1.88e-02

Pos logit contributions
ances+0.015
glers+0.012
ales+0.012
cut+0.012
 sizes+0.012
Neg logit contributions
 again-0.010
 once-0.009
 Will-0.009
<|endoftext|>-0.008
 finished-0.008
Token got
Token ID1392
Feature activation+1.91e-02

Pos logit contributions
ances+0.052
glers+0.047
 sizes+0.046
boxes+0.045
ales+0.045
Neg logit contributions
 again-0.013
 once-0.011
<|endoftext|>-0.011
 Will-0.010
 happily-0.010
Token getting
Token ID1972
Feature activation+3.39e-02

Pos logit contributions
 again+0.011
 once+0.010
<|endoftext|>+0.010
 happily+0.009
 Will+0.009
Neg logit contributions
ances-0.018
glers-0.015
 sizes-0.015
boxes-0.015
ales-0.014
Token teased
Token ID35380
Feature activation+2.81e-02

Pos logit contributions
ances+0.072
 sizes+0.060
boxes+0.060
glers+0.059
ales+0.058
Neg logit contributions
 again-0.051
 once-0.046
<|endoftext|>-0.045
 happily-0.045
 Will-0.044
Token,
Token ID11
Feature activation-2.85e-03

Pos logit contributions
ances+0.071
glers+0.063
 sizes+0.062
boxes+0.061
cut+0.060
Neg logit contributions
 again-0.027
 once-0.024
<|endoftext|>-0.023
 Will-0.023
 happily-0.023
Token so
Token ID523
Feature activation+1.15e-02

Pos logit contributions
 again+0.003
 once+0.003
<|endoftext|>+0.003
 Will+0.003
 happily+0.003
Neg logit contributions
ances-0.007
glers-0.006
 sizes-0.006
boxes-0.006
ales-0.006
Token she
Token ID673
Feature activation-5.22e-03

Pos logit contributions
ances+0.023
glers+0.020
 sizes+0.019
boxes+0.019
ales+0.019
Neg logit contributions
 again-0.017
 once-0.016
<|endoftext|>-0.016
 happily-0.016
 Will-0.015
Token flew
Token ID13112
Feature activation+8.33e-04

Pos logit contributions
 again+0.007
 once+0.007
<|endoftext|>+0.007
 blast+0.007
 Will+0.006
Neg logit contributions
ances-0.010
glers-0.009
 sizes-0.009
cut-0.009
boxes-0.009
Token over
Token ID625
Feature activation-2.89e-02

Pos logit contributions
ances+0.002
 sizes+0.002
glers+0.002
boxes+0.002
ales+0.002
Neg logit contributions
 again-0.001
 once-0.001
 happily-0.001
<|endoftext|>-0.001
 Will-0.001
Token to
Token ID284
Feature activation-1.42e-02

Pos logit contributions
 blast+0.028
ig+0.026
 finished+0.026
 Will+0.025
<|endoftext|>+0.025
Neg logit contributions
ances-0.062
 cans-0.056
 barrels-0.055
glers-0.055
boxes-0.054
Token try
Token ID1949
Feature activation-1.01e-02

Pos logit contributions
 again+0.015
 once+0.014
 finished+0.014
 blast+0.013
<|endoftext|>+0.013
Neg logit contributions
ances-0.032
glers-0.028
cut-0.028
 sizes-0.028
bank-0.027
Token and
Token ID290
Feature activation-1.71e-02

Pos logit contributions
 blast+0.009
 finished+0.008
 Will+0.008
 imag+0.008
 again+0.007
Neg logit contributions
ances-0.024
glers-0.021
 sizes-0.021
bank-0.021
boxes-0.021
Token help
Token ID1037
Feature activation+1.34e-02

Pos logit contributions
 again+0.020
 once+0.018
<|endoftext|>+0.018
 Will+0.017
 happily+0.017
Neg logit contributions
ances-0.039
glers-0.034
 sizes-0.034
boxes-0.033
cut-0.033
Token the
Token ID262
Feature activation-5.11e-03

Pos logit contributions
ances+0.006
glers+0.005
cut+0.005
boxes+0.005
 sizes+0.005
Neg logit contributions
 blast-0.003
 finished-0.003
 again-0.003
<|endoftext|>-0.003
 imag-0.003
Token naughty
Token ID45128
Feature activation+1.87e-02

Pos logit contributions
 again+0.008
 once+0.007
<|endoftext|>+0.007
 Will+0.007
 happily+0.007
Neg logit contributions
ances-0.010
glers-0.009
 sizes-0.009
boxes-0.008
ales-0.008
Token boy
Token ID2933
Feature activation-7.15e-03

Pos logit contributions
ances+0.041
glers+0.036
 sizes+0.035
ales+0.035
boxes+0.035
Neg logit contributions
 again-0.027
 once-0.023
 happily-0.022
 Will-0.022
<|endoftext|>-0.022
Token.
Token ID13
Feature activation-1.40e-03

Pos logit contributions
 blast+0.008
 Will+0.008
 imag+0.008
 again+0.007
 finished+0.007
Neg logit contributions
ances-0.015
 sizes-0.013
glers-0.013
boxes-0.013
cut-0.013
Token From
Token ID3574
Feature activation-1.92e-02

Pos logit contributions
 again+0.002
 blast+0.002
 once+0.002
<|endoftext|>+0.002
 finished+0.002
Neg logit contributions
ances-0.003
glers-0.002
 sizes-0.002
boxes-0.002
cut-0.002
Token then
Token ID788
Feature activation-1.25e-02

Pos logit contributions
 again+0.016
 once+0.014
<|endoftext|>+0.013
 Will+0.013
 happily+0.013
Neg logit contributions
ances-0.051
glers-0.046
 sizes-0.045
boxes-0.044
cut-0.043
Token on
Token ID319
Feature activation-3.82e-02

Pos logit contributions
 again+0.015
 Will+0.014
 imag+0.013
abel+0.013
 smiles+0.013
Neg logit contributions
ances-0.025
boxes-0.022
cut-0.022
limits-0.021
glers-0.021
Token,
Token ID11
Feature activation-8.50e-03

Pos logit contributions
 again+0.036
 once+0.031
<|endoftext|>+0.030
 Will+0.029
 happily+0.029
Neg logit contributions
ances-0.099
glers-0.087
 sizes-0.087
boxes-0.084
ales-0.083
Token the
Token ID262
Feature activation-9.04e-03

Pos logit contributions
 again+0.010
 blast+0.009
<|endoftext|>+0.009
 finished+0.009
 once+0.009
Neg logit contributions
ances-0.018
glers-0.016
 sizes-0.016
cut-0.016
boxes-0.016
Token naughty
Token ID45128
Feature activation+1.15e-02

Pos logit contributions
 again+0.015
 Will+0.014
<|endoftext|>+0.014
 happily+0.014
 once+0.014
Neg logit contributions
ances-0.015
glers-0.013
 sizes-0.013
boxes-0.012
 cans-0.012
Token boy
Token ID2933
Feature activation-5.27e-03

Pos logit contributions
ances+0.024
glers+0.020
 sizes+0.020
boxes+0.019
ales+0.019
Neg logit contributions
 again-0.017
 once-0.016
<|endoftext|>-0.015
 Will-0.015
 happily-0.015
Token.
Token ID13
Feature activation+1.42e-02

Pos logit contributions
 again+0.011
 blast+0.011
 Will+0.011
<|endoftext|>+0.010
 finished+0.010
Neg logit contributions
ances-0.025
glers-0.023
 sizes-0.023
boxes-0.022
 cans-0.022
Token
Token ID198
Feature activation-2.28e-03

Pos logit contributions
ances+0.025
 cans+0.023
glers+0.023
cases+0.023
 crates+0.022
Neg logit contributions
 again-0.019
 blast-0.018
ig-0.017
 finished-0.017
 imag-0.017
Token
Token ID198
Feature activation+8.61e-03

Pos logit contributions
 imag+0.002
 again+0.002
 blast+0.002
 finished+0.002
ig+0.002
Neg logit contributions
ances-0.005
glers-0.005
handed-0.005
boxes-0.005
bank-0.005
TokenBen
Token ID11696
Feature activation+3.14e-03

Pos logit contributions
ances+0.021
 sizes+0.019
glers+0.019
boxes+0.018
ales+0.018
Neg logit contributions
<|endoftext|>-0.009
 again-0.009
 blast-0.008
 happily-0.008
 Will-0.008
Token followed
Token ID3940
Feature activation+1.66e-02

Pos logit contributions
ances+0.006
 sizes+0.006
glers+0.006
cut+0.006
 cans+0.006
Neg logit contributions
 again-0.004
<|endoftext|>-0.004
 Will-0.003
 blast-0.003
 once-0.003
Token Lily
Token ID20037
Feature activation-8.72e-03

Pos logit contributions
ances+0.038
glers+0.033
ales+0.032
 trigger+0.032
 crates+0.032
Neg logit contributions
 blast-0.016
 again-0.016
 once-0.015
 finished-0.015
 imag-0.015
Token.
Token ID13
Feature activation+2.54e-02

Pos logit contributions
 again+0.008
 once+0.007
<|endoftext|>+0.007
 happily+0.007
 Will+0.007
Neg logit contributions
ances-0.022
glers-0.020
 sizes-0.020
boxes-0.019
cut-0.019
Token He
Token ID679
Feature activation+9.11e-03

Pos logit contributions
ances+0.042
glers+0.035
 sizes+0.035
boxes+0.034
 crates+0.033
Neg logit contributions
 again-0.044
 once-0.041
<|endoftext|>-0.041
 blast-0.041
 finished-0.040
Token was
Token ID373
Feature activation-1.16e-02

Pos logit contributions
ances+0.017
 sizes+0.016
glers+0.015
cut+0.015
 cans+0.015
Neg logit contributions
 again-0.012
<|endoftext|>-0.011
 Will-0.011
 blast-0.011
 once-0.011
Token still
Token ID991
Feature activation+6.43e-03

Pos logit contributions
 again+0.014
 once+0.013
<|endoftext|>+0.013
 happily+0.013
 Will+0.012
Neg logit contributions
ances-0.026
glers-0.023
 sizes-0.022
boxes-0.022
ales-0.022
Token scared
Token ID12008
Feature activation+3.25e-02

Pos logit contributions
ances+0.013
glers+0.012
 sizes+0.011
boxes+0.011
ales+0.011
Neg logit contributions
 again-0.009
 once-0.008
<|endoftext|>-0.008
 happily-0.008
 Will-0.008
Token smiled
Token ID13541
Feature activation-1.48e-02

Pos logit contributions
ances+0.028
glers+0.025
 sizes+0.024
boxes+0.023
cut+0.023
Neg logit contributions
 again-0.016
 once-0.015
<|endoftext|>-0.014
 Will-0.014
 blast-0.014
Token and
Token ID290
Feature activation-5.50e-03

Pos logit contributions
 blast+0.013
 Will+0.012
 finished+0.011
<|endoftext|>+0.011
 again+0.011
Neg logit contributions
ances-0.035
boxes-0.031
 sizes-0.031
rier-0.030
glers-0.030
Token said
Token ID531
Feature activation+1.56e-02

Pos logit contributions
 again+0.007
 once+0.007
<|endoftext|>+0.006
 Will+0.006
 blast+0.006
Neg logit contributions
ances-0.011
glers-0.010
 sizes-0.010
boxes-0.010
 crates-0.009
Token,
Token ID11
Feature activation-7.09e-03

Pos logit contributions
ances+0.037
glers+0.033
 sizes+0.033
boxes+0.032
cut+0.032
Neg logit contributions
 again-0.016
 once-0.014
 blast-0.014
<|endoftext|>-0.014
 Will-0.014
Token "
Token ID366
Feature activation+1.10e-02

Pos logit contributions
 again+0.009
 blast+0.009
 imag+0.009
 once+0.008
ig+0.008
Neg logit contributions
ances-0.014
boxes-0.012
cut-0.012
 sizes-0.012
cases-0.012
TokenThat
Token ID2504
Feature activation-2.02e-02

Pos logit contributions
ances+0.026
glers+0.023
ales+0.022
cases+0.021
 trigger+0.021
Neg logit contributions
 again-0.012
 Will-0.012
 once-0.011
 blast-0.011
 happily-0.011
Token sounds
Token ID5238
Feature activation+7.32e-03

Pos logit contributions
 again+0.032
 once+0.029
<|endoftext|>+0.029
 Will+0.028
 happily+0.028
Neg logit contributions
ances-0.040
glers-0.033
 sizes-0.032
boxes-0.032
ales-0.031
Token great
Token ID1049
Feature activation-1.24e-02

Pos logit contributions
ances+0.014
 sizes+0.013
glers+0.012
boxes+0.011
 cans+0.011
Neg logit contributions
 again-0.010
 Will-0.010
 once-0.010
 happily-0.009
<|endoftext|>-0.009
Token,
Token ID11
Feature activation+8.05e-03

Pos logit contributions
 again+0.012
 once+0.010
<|endoftext|>+0.010
 Will+0.010
 happily+0.010
Neg logit contributions
ances-0.032
glers-0.028
 sizes-0.028
boxes-0.027
cut-0.027
Token Dad
Token ID17415
Feature activation-9.48e-03

Pos logit contributions
ances+0.019
 sizes+0.017
glers+0.016
 cans+0.016
boxes+0.016
Neg logit contributions
 again-0.008
<|endoftext|>-0.008
 once-0.007
 blast-0.007
 happily-0.007
Token.
Token ID13
Feature activation-9.14e-03

Pos logit contributions
 again+0.010
 once+0.008
 Will+0.008
 happily+0.008
 finished+0.007
Neg logit contributions
ances-0.024
glers-0.021
limits-0.021
 trigger-0.021
 sizes-0.021
Token.
Token ID13
Feature activation-1.69e-02

Pos logit contributions
 again+0.006
 once+0.005
 happily+0.005
<|endoftext|>+0.005
 finished+0.005
Neg logit contributions
ances-0.016
glers-0.014
 sizes-0.014
boxes-0.014
 trigger-0.014
Token She
Token ID1375
Feature activation-6.93e-03

Pos logit contributions
 finished+0.019
 blast+0.019
 again+0.018
 str+0.018
 imag+0.017
Neg logit contributions
ances-0.034
 sizes-0.032
boxes-0.031
handed-0.031
bank-0.030
Token always
Token ID1464
Feature activation+8.68e-03

Pos logit contributions
 blast+0.009
<|endoftext|>+0.009
 again+0.009
<+0.009
 Will+0.008
Neg logit contributions
ances-0.012
 sizes-0.011
cut-0.010
glers-0.010
bank-0.010
Token remembered
Token ID12086
Feature activation-1.90e-02

Pos logit contributions
ances+0.019
 sizes+0.017
glers+0.016
boxes+0.016
limits+0.016
Neg logit contributions
 again-0.010
<|endoftext|>-0.010
 once-0.009
 blast-0.009
 Will-0.009
Token to
Token ID284
Feature activation+4.99e-04

Pos logit contributions
 again+0.016
 blast+0.015
 Will+0.014
<|endoftext|>+0.014
 once+0.014
Neg logit contributions
ances-0.048
 sizes-0.042
glers-0.042
boxes-0.041
cut-0.040
Token enjoy
Token ID2883
Feature activation-3.88e-02

Pos logit contributions
ances+0.001
glers+0.001
 sizes+0.001
boxes+0.001
cut+0.001
Neg logit contributions
 again-0.001
 once-0.000
<|endoftext|>-0.000
 Will-0.000
 happily-0.000
Token the
Token ID262
Feature activation-2.15e-03

Pos logit contributions
 again+0.037
<|endoftext|>+0.032
 Will+0.032
 once+0.032
 blast+0.032
Neg logit contributions
ances-0.096
 sizes-0.084
glers-0.083
boxes-0.082
cut-0.080
Token little
Token ID1310
Feature activation+8.88e-04

Pos logit contributions
 again+0.003
 Will+0.003
<|endoftext|>+0.003
 once+0.003
 happily+0.003
Neg logit contributions
ances-0.004
glers-0.004
 sizes-0.004
boxes-0.003
cut-0.003
Token things
Token ID1243
Feature activation+1.64e-03

Pos logit contributions
ances+0.002
glers+0.001
ales+0.001
boxes+0.001
 sizes+0.001
Neg logit contributions
 again-0.001
 once-0.001
 happily-0.001
 Will-0.001
 finished-0.001
Token with
Token ID351
Feature activation+3.38e-03

Pos logit contributions
ances+0.004
glers+0.003
 sizes+0.003
 trigger+0.003
boxes+0.003
Neg logit contributions
 again-0.002
 once-0.002
 happily-0.002
<|endoftext|>-0.002
 finished-0.002
Token gratitude
Token ID24083
Feature activation-5.75e-03

Pos logit contributions
ances+0.009
glers+0.008
 sizes+0.007
boxes+0.007
cut+0.007
Neg logit contributions
 again-0.003
 once-0.003
<|endoftext|>-0.003
 Will-0.003
 happily-0.003
Token.
Token ID13
Feature activation+7.20e-03

Pos logit contributions
ances+0.056
glers+0.049
 sizes+0.049
boxes+0.047
ales+0.047
Neg logit contributions
 again-0.020
 once-0.017
<|endoftext|>-0.016
 happily-0.015
 finished-0.014
Token Joy
Token ID14087
Feature activation-1.66e-02

Pos logit contributions
ances+0.016
glers+0.013
 sizes+0.013
boxes+0.013
cut+0.013
Neg logit contributions
 again-0.010
 once-0.009
<|endoftext|>-0.009
 happily-0.009
 Will-0.009
Token knew
Token ID2993
Feature activation-2.78e-03

Pos logit contributions
 again+0.020
 blast+0.019
 once+0.018
<|endoftext|>+0.018
 Will+0.017
Neg logit contributions
ances-0.036
glers-0.031
cut-0.031
 sizes-0.031
boxes-0.029
Token that
Token ID326
Feature activation-1.86e-03

Pos logit contributions
 again+0.003
 once+0.003
<|endoftext|>+0.003
 happily+0.003
 Will+0.003
Neg logit contributions
ances-0.006
glers-0.006
 sizes-0.006
boxes-0.005
ales-0.005
Token she
Token ID673
Feature activation-6.21e-03

Pos logit contributions
 again+0.002
 once+0.002
<|endoftext|>+0.002
 Will+0.002
 happily+0.001
Neg logit contributions
ances-0.005
glers-0.004
 trigger-0.004
ales-0.004
 sizes-0.004
Token couldn
Token ID3521
Feature activation-4.20e-02

Pos logit contributions
 again+0.008
 blast+0.007
<|endoftext|>+0.007
 once+0.007
 Will+0.007
Neg logit contributions
ances-0.014
glers-0.012
 sizes-0.012
cut-0.011
boxes-0.011
Token't
Token ID470
Feature activation+2.33e-02

Pos logit contributions
 again+0.061
 once+0.056
<|endoftext|>+0.055
 Will+0.055
 happily+0.055
Neg logit contributions
ances-0.083
glers-0.072
 sizes-0.069
boxes-0.068
cut-0.067
Token do
Token ID466
Feature activation-1.88e-02

Pos logit contributions
ances+0.052
glers+0.045
 sizes+0.044
boxes+0.043
cut+0.042
Neg logit contributions
 again-0.029
 once-0.026
<|endoftext|>-0.026
 Will-0.026
 happily-0.025
Token it
Token ID340
Feature activation+1.30e-02

Pos logit contributions
 again+0.020
 once+0.018
<|endoftext|>+0.018
 Will+0.018
 happily+0.018
Neg logit contributions
ances-0.044
glers-0.039
 sizes-0.038
boxes-0.037
limits-0.037
Token on
Token ID319
Feature activation-8.12e-03

Pos logit contributions
ances+0.034
glers+0.030
 sizes+0.030
boxes+0.029
cut+0.029
Neg logit contributions
 again-0.013
 once-0.011
 happily-0.010
<|endoftext|>-0.010
 Will-0.010
Token her
Token ID607
Feature activation-1.00e-03

Pos logit contributions
 again+0.008
 once+0.007
<|endoftext|>+0.007
 happily+0.006
 Will+0.006
Neg logit contributions
ances-0.021
glers-0.019
 sizes-0.018
boxes-0.018
cases-0.018
Token careful
Token ID8161
Feature activation-3.91e-03

Pos logit contributions
 again+0.010
 once+0.009
<|endoftext|>+0.009
 Will+0.009
 happily+0.009
Neg logit contributions
ances-0.012
glers-0.010
 sizes-0.010
boxes-0.010
cut-0.010
Token it
Token ID340
Feature activation-5.52e-03

Pos logit contributions
 again+0.004
 once+0.004
 happily+0.003
<|endoftext|>+0.003
 Will+0.003
Neg logit contributions
ances-0.010
glers-0.009
 trigger-0.009
 sizes-0.009
ales-0.009
Token won
Token ID1839
Feature activation-1.98e-02

Pos logit contributions
 again+0.006
 once+0.005
<|endoftext|>+0.005
 Will+0.005
 happily+0.005
Neg logit contributions
ances-0.014
glers-0.012
 sizes-0.012
boxes-0.012
cut-0.012
Token't
Token ID470
Feature activation+2.69e-02

Pos logit contributions
 again+0.027
 once+0.025
<|endoftext|>+0.024
 Will+0.024
 happily+0.024
Neg logit contributions
ances-0.043
glers-0.037
 sizes-0.037
boxes-0.036
 crates-0.035
Token vanish
Token ID39572
Feature activation-7.02e-03

Pos logit contributions
ances+0.064
boxes+0.059
glers+0.058
 crates+0.058
 sizes+0.056
Neg logit contributions
 again-0.034
 once-0.028
 happily-0.028
<|endoftext|>-0.026
 finished-0.026
Token again
Token ID757
Feature activation-3.99e-02

Pos logit contributions
 again+0.008
 once+0.007
<|endoftext|>+0.007
 Will+0.007
 happily+0.007
Neg logit contributions
ances-0.017
glers-0.015
 sizes-0.014
boxes-0.014
cut-0.014
Token."
Token ID526
Feature activation-1.63e-02

Pos logit contributions
 again+0.052
 once+0.046
 happily+0.044
 Will+0.043
<|endoftext|>+0.043
Neg logit contributions
ances-0.090
glers-0.080
 sizes-0.077
boxes-0.076
cut-0.075
Token
Token ID198
Feature activation-1.06e-02

Pos logit contributions
 blast+0.015
 finished+0.014
 imag+0.014
ig+0.014
 str+0.014
Neg logit contributions
ances-0.036
bank-0.033
handed-0.033
limits-0.033
boxes-0.032
Token
Token ID198
Feature activation-1.91e-03

Pos logit contributions
 imag+0.011
 least+0.010
 str+0.010
 blast+0.009
 finished+0.009
Neg logit contributions
handed-0.022
ances-0.022
cut-0.021
limits-0.020
bank-0.020
TokenSo
Token ID2396
Feature activation+1.48e-02

Pos logit contributions
 again+0.003
 once+0.003
 Will+0.003
<|endoftext|>+0.003
 blast+0.003
Neg logit contributions
ances-0.003
glers-0.003
 sizes-0.003
boxes-0.003
cut-0.003
Token Lucy
Token ID22162
Feature activation-2.80e-02

Pos logit contributions
ances+0.034
glers+0.030
 sizes+0.030
boxes+0.029
cut+0.028
Neg logit contributions
 again-0.016
<|endoftext|>-0.015
 once-0.015
 blast-0.014
 Will-0.014
Token tree
Token ID5509
Feature activation-4.35e-03

Pos logit contributions
ances+0.013
glers+0.011
 sizes+0.011
boxes+0.010
cut+0.010
Neg logit contributions
 again-0.012
 once-0.011
<|endoftext|>-0.011
 happily-0.010
 Will-0.010
Token would
Token ID561
Feature activation-2.20e-03

Pos logit contributions
 blast+0.005
 Will+0.005
 again+0.005
<|endoftext|>+0.005
 least+0.004
Neg logit contributions
ances-0.009
glers-0.008
 sizes-0.008
cut-0.008
cases-0.007
Token sway
Token ID20009
Feature activation-1.87e-02

Pos logit contributions
 again+0.003
 once+0.003
 Will+0.003
ig+0.002
<|endoftext|>+0.002
Neg logit contributions
ances-0.004
glers-0.004
 sizes-0.004
cut-0.004
bank-0.004
Token back
Token ID736
Feature activation-2.76e-02

Pos logit contributions
 again+0.017
 once+0.014
<|endoftext|>+0.014
 Will+0.014
 happily+0.014
Neg logit contributions
ances-0.049
glers-0.043
 sizes-0.043
boxes-0.042
cut-0.041
Token and
Token ID290
Feature activation-4.95e-05

Pos logit contributions
 blast+0.024
<|endoftext|>+0.023
 finished+0.022
 Will+0.022
 again+0.021
Neg logit contributions
ances-0.068
 sizes-0.060
glers-0.060
boxes-0.059
cases-0.058
Token forth
Token ID6071
Feature activation-3.59e-02

Pos logit contributions
 blast+0.000
<|endoftext|>+0.000
 once+0.000
 again+0.000
 Will+0.000
Neg logit contributions
ances-0.000
 sizes-0.000
glers-0.000
cut-0.000
boxes-0.000
Token,
Token ID11
Feature activation-7.37e-03

Pos logit contributions
 blast+0.034
<|endoftext|>+0.034
 again+0.033
 Will+0.032
 finished+0.032
Neg logit contributions
ances-0.084
 sizes-0.073
boxes-0.071
glers-0.071
cut-0.070
Token while
Token ID981
Feature activation-8.75e-03

Pos logit contributions
<|endoftext|>+0.009
 again+0.009
 blast+0.009
 Will+0.008
ig+0.008
Neg logit contributions
ances-0.015
 sizes-0.013
glers-0.013
bank-0.012
cut-0.012
Token the
Token ID262
Feature activation+3.35e-03

Pos logit contributions
 again+0.008
 finished+0.008
 blast+0.008
 once+0.008
 imag+0.008
Neg logit contributions
ances-0.020
glers-0.018
 sizes-0.017
 cans-0.017
boxes-0.017
Token small
Token ID1402
Feature activation+9.18e-03

Pos logit contributions
ances+0.006
glers+0.005
 sizes+0.005
boxes+0.005
cut+0.005
Neg logit contributions
 again-0.006
 once-0.005
 Will-0.005
<|endoftext|>-0.005
 happily-0.005
Token bush
Token ID24484
Feature activation-1.04e-03

Pos logit contributions
ances+0.017
glers+0.015
 sizes+0.014
boxes+0.014
limits+0.014
Neg logit contributions
 again-0.015
 once-0.014
<|endoftext|>-0.014
 happily-0.013
 Will-0.013
Token a
Token ID257
Feature activation-1.23e-02

Pos logit contributions
 again+0.002
 once+0.002
<|endoftext|>+0.002
 happily+0.002
 Will+0.002
Neg logit contributions
ances-0.007
glers-0.006
ales-0.006
 sizes-0.006
boxes-0.006
Token wonderful
Token ID7932
Feature activation-1.08e-02

Pos logit contributions
 again+0.019
 once+0.017
<|endoftext|>+0.017
 Will+0.017
 finished+0.017
Neg logit contributions
ances-0.022
 sizes-0.019
glers-0.019
cut-0.018
boxes-0.018
Token thing
Token ID1517
Feature activation+6.42e-03

Pos logit contributions
 again+0.016
 Will+0.015
<|endoftext|>+0.015
 finished+0.015
 once+0.014
Neg logit contributions
ances-0.019
 sizes-0.017
boxes-0.015
cut-0.015
glers-0.015
Token.
Token ID13
Feature activation-1.14e-02

Pos logit contributions
ances+0.016
glers+0.014
 sizes+0.014
boxes+0.014
limits+0.013
Neg logit contributions
 again-0.007
 once-0.006
 Will-0.006
<|endoftext|>-0.006
 happily-0.005
Token He
Token ID679
Feature activation+4.71e-03

Pos logit contributions
 again+0.016
 blast+0.014
 once+0.014
 finished+0.014
 happily+0.014
Neg logit contributions
ances-0.024
 sizes-0.020
glers-0.020
boxes-0.020
ales-0.019
Token stood
Token ID6204
Feature activation-4.14e-02

Pos logit contributions
ances+0.008
 sizes+0.007
glers+0.007
stones+0.007
cases+0.007
Neg logit contributions
 again-0.006
 blast-0.006
<|endoftext|>-0.006
 once-0.006
 Pretty-0.006
Token there
Token ID612
Feature activation+1.16e-03

Pos logit contributions
 blast+0.057
 Will+0.053
 imag+0.050
text+0.050
 finished+0.049
Neg logit contributions
ances-0.075
rier-0.069
boxes-0.067
glers-0.067
 crates-0.066
Token,
Token ID11
Feature activation-3.42e-02

Pos logit contributions
ances+0.003
 sizes+0.003
glers+0.003
boxes+0.003
cut+0.003
Neg logit contributions
 again-0.001
 blast-0.001
<|endoftext|>-0.001
 Will-0.001
 once-0.001
Token ent
Token ID920
Feature activation-7.21e-03

Pos logit contributions
 again+0.037
<|endoftext|>+0.034
 blast+0.034
 once+0.033
 Will+0.033
Neg logit contributions
ances-0.076
 sizes-0.067
glers-0.065
cut-0.064
boxes-0.064
Tokenran
Token ID2596
Feature activation-1.64e-02

Pos logit contributions
 again+0.011
 once+0.011
<|endoftext|>+0.010
 Will+0.010
 blast+0.010
Neg logit contributions
ances-0.014
glers-0.011
 sizes-0.011
boxes-0.011
cut-0.011
Tokenced
Token ID771
Feature activation-1.91e-02

Pos logit contributions
 again+0.023
<|endoftext|>+0.021
 once+0.021
 blast+0.021
 Will+0.021
Neg logit contributions
ances-0.034
glers-0.029
 sizes-0.028
boxes-0.028
cut-0.027