TOP ACTIVATIONS
MAX = 3.880

We have different likes and disl ikes . We can be
with lots of cob we bs . She liked to spin
much and smiled as the bar ber sn ipped away .
and watched as the cater pill ars grew bigger and bigger
and fl apped them as hard as she could . The
almost as high as the laser . But as soon as
and being kind is what matters . <|endoftext|> One day ,
. As the hedge hog ran away , the
almost as tall as the iron . The
were as cool as a robot . Finally , he found
be as tall as the tree . The c actus tried
some with its friends as well . So , it z
time into exercise time as well . Joey and his mom
different directions with both com bs . Jack felt so cool
and down and waved as hard as she could with a
just as fast as the black automobile . The little red
closer and watched as the cord went all around the room
the same shape as the toy . Andy was so excited
- four , thirty - five , thirty - six ,
- four , forty - five , forty - six ,

MIN ACTIVATIONS
MIN = -4.048

said , " A br ac ad ab ra !" But
together , " A br ac ad ab ra !"
and say " A br ac ad ab ra !" and
She had entered a sand pit and it was getting bigger
around . " A br ac ad ab ra !"
said , " A br ac ad ab ra !" Suddenly
new knowledge he made gu ac am ole .
and said " A br ac ad ab ra !" but
said , " A br ac ad ab ra ! I
air . " A br ac ad ab ra !"
did ! They built the most beautiful castle with the clean
wand . " A br ac ad ab ra !"
saying , " A br ac ad ab ra !"
said , " A br ac ad ab ra , make
. Nut ty showed Bub bles a shiny shield that he
said , " A br ac ad ab ra ! I
said , " A br ac ad ab ra !" Suddenly
j ogged , the hung rier she got . Finally ,
her arrow . She unt ies the rope and puts it
. He was even hung rier than before . But then

INTERVAL 1.898 - 3.880
CONTAINS 0.176%

said it was ok as long as he trusted the vegetable
cord was to charge the robot 's battery . Tim my
the bag together and take turns playing with the mar bles
moon until he could feel its light on his face .
new haircut and thanked the bar ber . After

INTERVAL -0.084 - 1.898
CONTAINS 57.473%

her mom and dad to take away any bear toys or
be very careful when using the tools . Steve
when Jack arrived he had a big smile on his face
have some y ummy pie . The end . <|endoftext|> Once
, and chase each other in the garden .

INTERVAL -2.066 - -0.084
CONTAINS 41.972%

They are curious . They want to see what is in
needed to go to the bathroom . She went to the
with the doll some more . They do not lose her
cat always tries to catch the mouse , but the mouse
" How will that help ?" her mom wondered

INTERVAL -4.048 - -2.066
CONTAINS 0.380%

and the robot were the best of friends . <|endoftext|> Once
big river . He was so small and the river seemed
anything bad again . He found his happiness with the hug
. It had a picture of him inside . Tim thought
long time . She was so happy to have seen something
Token We
Token ID775
Feature activation+8.95e-01

Pos logit contributions
 seek+0.010
ney+0.008
umpy+0.007
 than+0.007
and+0.007
Neg logit contributions
 hearts-0.010
 walks-0.010
 stro-0.010
 spr-0.010
 hugs-0.010
Token have
Token ID423
Feature activation+8.92e-01

Pos logit contributions
 walks+0.059
 hearts+0.058
 tul+0.057
 strokes+0.057
 spr+0.057
Neg logit contributions
 seek-0.049
ney-0.040
umpy-0.038
 than-0.037
and-0.033
Token different
Token ID1180
Feature activation+7.60e-01

Pos logit contributions
 stro+0.058
ened+0.058
 tul+0.057
irs+0.057
 strokes+0.056
Neg logit contributions
 seek-0.046
 than-0.038
ney-0.035
umpy-0.033
and-0.032
Token likes
Token ID7832
Feature activation+1.35e+00

Pos logit contributions
 walks+0.040
 stro+0.039
 spr+0.039
 tul+0.038
 strokes+0.038
Neg logit contributions
 seek-0.051
ney-0.043
 than-0.042
umpy-0.041
and-0.038
Token and
Token ID290
Feature activation+3.44e-01

Pos logit contributions
 walks+0.076
 stro+0.076
 spr+0.076
 strokes+0.075
ened+0.075
Neg logit contributions
 seek-0.082
 than-0.068
ney-0.067
umpy-0.064
and-0.062
Token disl
Token ID19621
Feature activation+3.88e+00

Pos logit contributions
 walks+0.019
 tul+0.018
 spr+0.018
 stro+0.018
ened+0.018
Neg logit contributions
 seek-0.023
ney-0.018
 than-0.018
umpy-0.018
and-0.016
Tokenikes
Token ID7938
Feature activation+1.25e-01

Pos logit contributions
 walks+0.279
 hearts+0.274
 strokes+0.269
 bows+0.269
ened+0.268
Neg logit contributions
 seek-0.198
ney-0.151
umpy-0.137
Bob-0.117
 than-0.117
Token.
Token ID13
Feature activation-9.55e-02

Pos logit contributions
 walks+0.007
 hearts+0.007
 spr+0.007
 strokes+0.007
 hugs+0.007
Neg logit contributions
 seek-0.008
ney-0.006
umpy-0.006
 than-0.006
and-0.006
Token We
Token ID775
Feature activation+9.32e-01

Pos logit contributions
 seek+0.006
ney+0.005
umpy+0.004
 than+0.004
and+0.004
Neg logit contributions
 hearts-0.006
 walks-0.006
 stro-0.006
 hugs-0.006
 tul-0.006
Token can
Token ID460
Feature activation+1.34e-01

Pos logit contributions
 walks+0.065
 tul+0.064
 hearts+0.064
 strokes+0.063
ened+0.063
Neg logit contributions
 seek-0.047
ney-0.037
umpy-0.036
 than-0.035
and-0.029
Token be
Token ID307
Feature activation-8.40e-02

Pos logit contributions
ened+0.009
 walks+0.008
 stro+0.008
ience+0.008
 strokes+0.008
Neg logit contributions
 seek-0.008
ney-0.006
 than-0.006
umpy-0.006
?"-0.005
Token with
Token ID351
Feature activation+3.86e-01

Pos logit contributions
 seek+0.018
ney+0.014
umpy+0.014
and+0.012
 than+0.012
Neg logit contributions
 walks-0.025
 spr-0.025
 hearts-0.024
 hugs-0.024
 tul-0.024
Token lots
Token ID6041
Feature activation+7.55e-01

Pos logit contributions
ience+0.027
 stro+0.026
 walks+0.026
ened+0.025
ian+0.025
Neg logit contributions
 seek-0.020
ney-0.015
 than-0.014
 sunny-0.014
and-0.013
Token of
Token ID286
Feature activation-1.95e-01

Pos logit contributions
 walks+0.062
 spr+0.060
 stro+0.060
 tul+0.060
 hearts+0.060
Neg logit contributions
 seek-0.028
ney-0.022
umpy-0.021
 than-0.020
and-0.017
Token cob
Token ID22843
Feature activation+1.99e+00

Pos logit contributions
 seek+0.012
ney+0.010
umpy+0.009
 than+0.009
and+0.009
Neg logit contributions
 walks-0.011
 stro-0.011
 spr-0.011
ened-0.011
 hearts-0.011
Tokenwe
Token ID732
Feature activation+1.43e+00

Pos logit contributions
 walks+0.133
 spr+0.129
 tul+0.128
 hearts+0.128
 strokes+0.127
Neg logit contributions
 seek-0.109
ney-0.088
umpy-0.085
 than-0.079
and-0.074
Tokenbs
Token ID1443
Feature activation+3.66e+00

Pos logit contributions
 walks+0.054
 spr+0.053
 hearts+0.052
 tul+0.052
 strokes+0.051
Neg logit contributions
 seek-0.110
ney-0.104
umpy-0.100
 than-0.096
and-0.095
Token.
Token ID13
Feature activation+7.63e-02

Pos logit contributions
 walks+0.228
 hearts+0.221
 spr+0.220
 strokes+0.219
 tul+0.218
Neg logit contributions
 seek-0.212
ney-0.178
umpy-0.169
 than-0.162
and-0.152
Token She
Token ID1375
Feature activation+4.92e-01

Pos logit contributions
 walks+0.004
 hearts+0.004
 spr+0.004
 strokes+0.004
 tul+0.004
Neg logit contributions
 seek-0.005
ney-0.004
umpy-0.004
 than-0.004
and-0.004
Token liked
Token ID8288
Feature activation+2.59e-01

Pos logit contributions
 walks+0.033
 hearts+0.033
 strokes+0.032
ened+0.032
 tul+0.032
Neg logit contributions
 seek-0.026
ney-0.022
umpy-0.019
 than-0.019
and-0.017
Token to
Token ID284
Feature activation+3.58e-01

Pos logit contributions
 walks+0.017
 stro+0.016
ened+0.016
 strokes+0.016
 hearts+0.016
Neg logit contributions
 seek-0.014
ney-0.011
 than-0.011
umpy-0.010
and-0.010
Token spin
Token ID7906
Feature activation+2.57e-01

Pos logit contributions
ened+0.020
ience+0.020
 hearts+0.020
 walks+0.019
irs+0.019
Neg logit contributions
 seek-0.024
ney-0.019
 than-0.017
umpy-0.016
 sunny-0.015
Token much
Token ID881
Feature activation+4.88e-01

Pos logit contributions
 seek+0.029
ney+0.025
umpy+0.022
and+0.021
 than+0.021
Neg logit contributions
 walks-0.028
 hearts-0.027
 strokes-0.027
 hugs-0.026
ened-0.026
Token and
Token ID290
Feature activation+2.44e-01

Pos logit contributions
 walks+0.029
 hearts+0.028
 strokes+0.028
 spr+0.028
 hugs+0.028
Neg logit contributions
 seek-0.030
ney-0.026
umpy-0.024
 than-0.022
and-0.022
Token smiled
Token ID13541
Feature activation+9.77e-01

Pos logit contributions
 walks+0.015
 hearts+0.014
 hugs+0.014
ened+0.014
 strokes+0.014
Neg logit contributions
 seek-0.014
ney-0.012
umpy-0.012
 than-0.011
and-0.011
Token as
Token ID355
Feature activation+6.58e-01

Pos logit contributions
 walks+0.056
 hearts+0.055
 strokes+0.054
ened+0.054
 spr+0.054
Neg logit contributions
 seek-0.062
ney-0.053
umpy-0.050
and-0.046
 than-0.045
Token the
Token ID262
Feature activation+1.33e+00

Pos logit contributions
 walks+0.035
 hugs+0.034
 hearts+0.034
ened+0.034
 strokes+0.032
Neg logit contributions
 seek-0.045
ney-0.041
umpy-0.039
aking-0.036
and-0.036
Token bar
Token ID2318
Feature activation+3.55e+00

Pos logit contributions
 walks+0.064
 hearts+0.061
 hugs+0.060
 strokes+0.060
 er+0.059
Neg logit contributions
 seek-0.097
ney-0.086
umpy-0.081
 than-0.079
and-0.078
Tokenber
Token ID527
Feature activation+1.34e+00

Pos logit contributions
 walks+0.282
 stro+0.278
 hearts+0.277
ened+0.274
 spr+0.274
Neg logit contributions
 seek-0.152
ney-0.105
umpy-0.098
and-0.084
 than-0.082
Token sn
Token ID3013
Feature activation-3.82e-02

Pos logit contributions
 walks+0.072
 hearts+0.070
 spr+0.069
 tul+0.069
 strokes+0.069
Neg logit contributions
 seek-0.088
ney-0.076
umpy-0.072
 than-0.070
and-0.066
Tokenipped
Token ID3949
Feature activation+7.92e-01

Pos logit contributions
 seek+0.003
ney+0.002
umpy+0.002
 than+0.002
and+0.002
Neg logit contributions
 walks-0.002
 spr-0.002
 hearts-0.002
 tul-0.002
 strokes-0.002
Token away
Token ID1497
Feature activation+2.00e-01

Pos logit contributions
 walks+0.049
 hearts+0.047
 strokes+0.047
 spr+0.047
ened+0.046
Neg logit contributions
 seek-0.047
ney-0.039
umpy-0.038
 than-0.036
and-0.034
Token.
Token ID13
Feature activation-3.26e-02

Pos logit contributions
 walks+0.012
 hearts+0.012
 strokes+0.011
ened+0.011
 hugs+0.011
Neg logit contributions
 seek-0.012
ney-0.011
umpy-0.010
 than-0.009
and-0.009
Token and
Token ID290
Feature activation+1.62e-02

Pos logit contributions
 seek+0.007
ney+0.006
 than+0.006
umpy+0.005
and+0.005
Neg logit contributions
 tul-0.012
 walks-0.012
 spr-0.012
 stro-0.012
 hearts-0.012
Token watched
Token ID7342
Feature activation+1.37e+00

Pos logit contributions
 walks+0.001
 hearts+0.001
 spr+0.001
 strokes+0.001
 tul+0.001
Neg logit contributions
 seek-0.001
ney-0.001
umpy-0.001
 than-0.001
and-0.001
Token as
Token ID355
Feature activation+8.32e-01

Pos logit contributions
ience+0.085
 walks+0.084
 tul+0.084
 spr+0.083
 stro+0.083
Neg logit contributions
 seek-0.073
 than-0.058
ney-0.053
and-0.052
 but-0.050
Token the
Token ID262
Feature activation+1.59e+00

Pos logit contributions
 walks+0.047
 hearts+0.045
 spr+0.045
 strokes+0.045
 tul+0.044
Neg logit contributions
 seek-0.054
ney-0.046
umpy-0.043
 than-0.042
and-0.040
Token cater
Token ID20825
Feature activation+3.02e+00

Pos logit contributions
 stro+0.078
ened+0.077
irs+0.077
 walks+0.076
 strokes+0.074
Neg logit contributions
 seek-0.118
umpy-0.089
ney-0.089
 than-0.088
 sunny-0.084
Tokenpill
Token ID27215
Feature activation+3.55e+00

Pos logit contributions
 hearts+0.176
ened+0.163
 horror+0.163
 imag+0.161
 tul+0.159
Neg logit contributions
 seek-0.193
umpy-0.165
 than-0.154
ney-0.148
aking-0.143
Tokenars
Token ID945
Feature activation+1.71e+00

Pos logit contributions
 walks+0.280
 hearts+0.274
 spr+0.273
 tul+0.272
 strokes+0.272
Neg logit contributions
 seek-0.147
ney-0.114
umpy-0.106
 than-0.099
and-0.090
Token grew
Token ID6348
Feature activation+8.43e-01

Pos logit contributions
 walks+0.087
 hearts+0.085
 tul+0.085
 hugs+0.083
 er+0.082
Neg logit contributions
 seek-0.117
ney-0.097
 than-0.094
umpy-0.093
and-0.087
Token bigger
Token ID5749
Feature activation+7.71e-01

Pos logit contributions
 walks+0.052
 hearts+0.050
 spr+0.050
 stro+0.050
 strokes+0.050
Neg logit contributions
 seek-0.049
ney-0.041
umpy-0.039
 than-0.038
and-0.035
Token and
Token ID290
Feature activation+8.69e-01

Pos logit contributions
 walks+0.050
 hearts+0.048
 spr+0.048
 strokes+0.048
 tul+0.047
Neg logit contributions
 seek-0.044
ney-0.037
umpy-0.035
 than-0.032
and-0.031
Token bigger
Token ID5749
Feature activation+1.07e+00

Pos logit contributions
 walks+0.047
 spr+0.045
 hearts+0.045
 tul+0.045
 stro+0.044
Neg logit contributions
 seek-0.057
ney-0.051
umpy-0.049
 than-0.046
and-0.045
Token and
Token ID290
Feature activation+2.16e-01

Pos logit contributions
irs+0.059
 tul+0.057
ience+0.057
 walks+0.057
 hearts+0.057
Neg logit contributions
 seek-0.031
 than-0.026
ney-0.023
and-0.021
umpy-0.019
Token fl
Token ID781
Feature activation+1.76e+00

Pos logit contributions
 walks+0.010
 hearts+0.010
irs+0.010
 tul+0.010
 er+0.010
Neg logit contributions
 seek-0.016
ney-0.013
 than-0.012
umpy-0.011
 active-0.011
Tokenapped
Token ID6320
Feature activation+3.70e-01

Pos logit contributions
 walks+0.092
 tul+0.087
 hearts+0.085
 spr+0.084
 er+0.084
Neg logit contributions
 seek-0.121
ney-0.109
umpy-0.104
 than-0.099
iring-0.097
Token them
Token ID606
Feature activation+5.59e-01

Pos logit contributions
 walks+0.014
ience+0.013
 spr+0.013
 tul+0.013
 er+0.013
Neg logit contributions
 seek-0.031
 than-0.027
ney-0.025
umpy-0.025
and-0.025
Token as
Token ID355
Feature activation+1.08e+00

Pos logit contributions
irs+0.044
 tul+0.043
 er+0.042
 hearts+0.041
 spr+0.041
Neg logit contributions
 seek-0.024
 than-0.018
ney-0.017
umpy-0.015
and-0.015
Token hard
Token ID1327
Feature activation+3.54e+00

Pos logit contributions
 walks+0.058
 spr+0.056
 hearts+0.055
 stro+0.055
 tul+0.055
Neg logit contributions
 seek-0.073
ney-0.065
umpy-0.062
 than-0.057
and-0.056
Token as
Token ID355
Feature activation+1.38e+00

Pos logit contributions
irs+0.274
 cockpit+0.270
 paints+0.263
 tul+0.262
 cleans+0.262
Neg logit contributions
 than-0.147
 seek-0.125
 but-0.104
 except-0.091
 with-0.090
Token she
Token ID673
Feature activation+1.73e+00

Pos logit contributions
 walks+0.073
 hearts+0.070
 spr+0.069
 tul+0.069
 strokes+0.069
Neg logit contributions
 seek-0.094
ney-0.082
umpy-0.079
 than-0.075
and-0.072
Token could
Token ID714
Feature activation+6.29e-01

Pos logit contributions
irs+0.138
 hearts+0.133
 walks+0.133
 tul+0.131
 er+0.130
Neg logit contributions
 seek-0.072
ney-0.054
 than-0.047
umpy-0.044
and-0.039
Token.
Token ID13
Feature activation-6.43e-02

Pos logit contributions
 walks+0.039
 hearts+0.039
 tul+0.038
ened+0.037
 hugs+0.037
Neg logit contributions
 seek-0.037
ney-0.030
umpy-0.029
 than-0.028
and-0.026
Token The
Token ID383
Feature activation+3.03e-01

Pos logit contributions
 seek+0.004
ney+0.003
umpy+0.003
 than+0.003
and+0.003
Neg logit contributions
 walks-0.004
 spr-0.004
 tul-0.004
 hearts-0.004
ened-0.004
Token almost
Token ID2048
Feature activation-1.60e+00

Pos logit contributions
 walks+0.044
 spr+0.042
 strokes+0.041
 stro+0.041
 hearts+0.041
Neg logit contributions
 seek-0.043
ney-0.037
umpy-0.035
 than-0.032
and-0.032
Token as
Token ID355
Feature activation+2.85e-01

Pos logit contributions
 seek+0.094
ney+0.080
umpy+0.076
 than+0.072
and+0.068
Neg logit contributions
 walks-0.099
 hearts-0.095
 spr-0.095
 strokes-0.095
 tul-0.094
Token high
Token ID1029
Feature activation+2.35e+00

Pos logit contributions
 fl+0.015
 spr+0.015
 lov+0.014
 scrib+0.013
ened+0.013
Neg logit contributions
 seek-0.016
ney-0.016
asc-0.016
umpy-0.015
aghetti-0.014
Token as
Token ID355
Feature activation+7.69e-01

Pos logit contributions
 tul+0.127
 walks+0.127
 hearts+0.126
 hugs+0.126
 stro+0.125
Neg logit contributions
 seek-0.150
 than-0.127
ney-0.120
umpy-0.117
iring-0.107
Token the
Token ID262
Feature activation+1.56e+00

Pos logit contributions
 walks+0.050
 spr+0.047
 strokes+0.046
 hearts+0.046
ened+0.046
Neg logit contributions
 seek-0.041
ney-0.039
umpy-0.038
asc-0.033
aking-0.032
Token laser
Token ID12855
Feature activation+3.53e+00

Pos logit contributions
 stro+0.085
 walks+0.081
irs+0.077
 tul+0.076
 strokes+0.076
Neg logit contributions
 seek-0.110
ney-0.092
 than-0.087
umpy-0.085
 sunny-0.079
Token.
Token ID13
Feature activation+2.45e-01

Pos logit contributions
 tul+0.178
 hearts+0.177
 walks+0.174
 hugs+0.174
irs+0.170
Neg logit contributions
 seek-0.246
ney-0.209
 than-0.200
umpy-0.193
and-0.178
Token But
Token ID887
Feature activation+5.00e-01

Pos logit contributions
 walks+0.016
 hearts+0.015
 strokes+0.015
 hugs+0.014
ened+0.014
Neg logit contributions
 seek-0.015
ney-0.012
umpy-0.012
 than-0.011
and-0.010
Token as
Token ID355
Feature activation-4.87e-01

Pos logit contributions
 walks+0.027
ened+0.026
 hearts+0.026
 er+0.025
 stro+0.025
Neg logit contributions
 seek-0.033
ney-0.030
umpy-0.030
and-0.027
 than-0.027
Token soon
Token ID2582
Feature activation-4.83e-01

Pos logit contributions
umpy+0.033
ney+0.033
asc+0.031
aking+0.030
and+0.029
Neg logit contributions
 walks-0.025
 practice-0.022
 hearts-0.022
ened-0.022
 hugs-0.022
Token as
Token ID355
Feature activation+2.52e-01

Pos logit contributions
 seek+0.029
 than+0.023
umpy+0.022
ney+0.021
Bob+0.020
Neg logit contributions
 tul-0.029
 hearts-0.028
 spr-0.028
 strokes-0.028
 stro-0.028
Token and
Token ID290
Feature activation+4.40e-01

Pos logit contributions
 walks+0.046
 hearts+0.045
ened+0.044
 strokes+0.044
 spr+0.044
Neg logit contributions
 seek-0.051
ney-0.045
umpy-0.041
and-0.037
Bob-0.037
Token being
Token ID852
Feature activation+9.48e-01

Pos logit contributions
 walks+0.025
 tul+0.024
 spr+0.024
 hearts+0.024
 stro+0.024
Neg logit contributions
 seek-0.028
ney-0.024
umpy-0.023
 than-0.023
and-0.021
Token kind
Token ID1611
Feature activation+1.39e+00

Pos logit contributions
 walks+0.061
 hearts+0.059
 hugs+0.059
 strokes+0.058
 spr+0.058
Neg logit contributions
 seek-0.054
ney-0.046
umpy-0.041
and-0.038
 than-0.038
Token is
Token ID318
Feature activation+1.31e+00

Pos logit contributions
 hearts+0.075
 walks+0.074
 hugs+0.073
 strokes+0.072
ened+0.072
Neg logit contributions
 seek-0.096
ney-0.078
umpy-0.072
Bob-0.069
aghetti-0.068
Token what
Token ID644
Feature activation+1.25e+00

Pos logit contributions
 walks+0.092
 tul+0.089
 spr+0.089
 hearts+0.089
 strokes+0.089
Neg logit contributions
 seek-0.066
ney-0.053
umpy-0.050
 than-0.050
and-0.044
Token matters
Token ID6067
Feature activation+3.50e+00

Pos logit contributions
 walks+0.075
 tul+0.073
 spr+0.073
 stro+0.073
 hearts+0.072
Neg logit contributions
 seek-0.074
ney-0.062
umpy-0.059
 than-0.058
and-0.054
Token.
Token ID13
Feature activation+3.49e-02

Pos logit contributions
 hearts+0.183
 walks+0.183
 spr+0.178
 strokes+0.177
 hugs+0.177
Neg logit contributions
 seek-0.238
ney-0.198
umpy-0.190
Bob-0.178
and-0.178
Token<|endoftext|>
Token ID50256
Feature activation+9.92e-01

Pos logit contributions
 walks+0.002
 hearts+0.002
 stro+0.002
 strokes+0.002
 spr+0.002
Neg logit contributions
 seek-0.002
ney-0.002
umpy-0.002
 than-0.001
and-0.001
TokenOne
Token ID3198
Feature activation+1.55e-01

Pos logit contributions
 walks+0.048
 hugs+0.041
 hearts+0.039
 strokes+0.038
 spr+0.037
Neg logit contributions
 seek-0.077
rated-0.068
ney-0.065
 than-0.065
umpy-0.063
Token day
Token ID1110
Feature activation+3.32e-01

Pos logit contributions
 walks+0.010
 hearts+0.009
 spr+0.009
 er+0.009
 hugs+0.009
Neg logit contributions
 seek-0.009
ney-0.007
iring-0.007
umpy-0.007
and-0.007
Token,
Token ID11
Feature activation+1.12e-01

Pos logit contributions
 walks+0.020
 hearts+0.020
 spr+0.019
 strokes+0.019
 tul+0.019
Neg logit contributions
 seek-0.020
ney-0.016
umpy-0.016
 than-0.015
and-0.014
Token.
Token ID13
Feature activation-2.09e-01

Pos logit contributions
 seek+0.076
umpy+0.068
Bob+0.065
ney+0.065
aghetti+0.064
Neg logit contributions
 walks-0.042
 spr-0.041
 hearts-0.040
 strokes-0.040
 hugs-0.040
Token
Token ID198
Feature activation-4.36e-01

Pos logit contributions
 seek+0.013
ney+0.011
umpy+0.011
 than+0.010
iring+0.009
Neg logit contributions
 walks-0.012
 strokes-0.012
 stro-0.012
 hugs-0.012
 hearts-0.012
Token
Token ID198
Feature activation-1.32e-01

Pos logit contributions
 seek+0.033
 than+0.028
rated+0.028
ney+0.028
kins+0.027
Neg logit contributions
 walks-0.020
 hugs-0.017
irs-0.016
 strokes-0.016
 stro-0.016
TokenAs
Token ID1722
Feature activation+8.78e-01

Pos logit contributions
 seek+0.007
ney+0.007
kins+0.007
 than+0.007
kin+0.007
Neg logit contributions
 walks-0.007
 stro-0.006
 spr-0.006
 strokes-0.006
 bang-0.006
Token the
Token ID262
Feature activation+1.07e+00

Pos logit contributions
 walks+0.051
 hearts+0.049
 spr+0.047
 stro+0.046
 tul+0.044
Neg logit contributions
ney-0.053
 seek-0.051
umpy-0.051
and-0.046
rated-0.045
Token hedge
Token ID19859
Feature activation+3.48e+00

Pos logit contributions
 walks+0.061
 hearts+0.059
 spr+0.059
 tul+0.058
 strokes+0.058
Neg logit contributions
 seek-0.069
ney-0.059
umpy-0.056
 than-0.055
and-0.052
Tokenhog
Token ID31897
Feature activation+1.48e+00

Pos logit contributions
 walks+0.161
 tul+0.156
 hearts+0.155
 strokes+0.154
 stro+0.152
Neg logit contributions
 seek-0.258
ney-0.226
umpy-0.215
 than-0.207
and-0.195
Token ran
Token ID4966
Feature activation+4.95e-01

Pos logit contributions
 tul+0.079
irs+0.079
 hearts+0.076
 strokes+0.075
 er+0.073
Neg logit contributions
 seek-0.099
ney-0.083
umpy-0.076
 than-0.074
Bob-0.071
Token away
Token ID1497
Feature activation-2.63e-01

Pos logit contributions
 walks+0.030
 hearts+0.029
 spr+0.029
 strokes+0.028
 hugs+0.028
Neg logit contributions
 seek-0.030
ney-0.026
umpy-0.024
 than-0.023
and-0.022
Token,
Token ID11
Feature activation+2.53e-01

Pos logit contributions
 seek+0.019
ney+0.017
umpy+0.017
aghetti+0.016
and+0.015
Neg logit contributions
 walks-0.012
 spr-0.012
 strokes-0.012
 stro-0.012
ened-0.012
Token the
Token ID262
Feature activation-4.74e-01

Pos logit contributions
 walks+0.017
 hearts+0.016
 spr+0.016
 stro+0.016
 strokes+0.016
Neg logit contributions
 seek-0.014
ney-0.013
umpy-0.012
 than-0.011
and-0.011
Token almost
Token ID2048
Feature activation-1.65e+00

Pos logit contributions
 walks+0.027
 hearts+0.026
 strokes+0.026
 tul+0.025
 stro+0.025
Neg logit contributions
 seek-0.020
ney-0.016
umpy-0.015
 than-0.015
and-0.013
Token as
Token ID355
Feature activation+5.01e-01

Pos logit contributions
 seek+0.096
ney+0.077
 than+0.075
umpy+0.074
and+0.066
Neg logit contributions
 walks-0.103
 strokes-0.099
 hearts-0.099
 stro-0.099
 tul-0.099
Token tall
Token ID7331
Feature activation+2.12e+00

Pos logit contributions
 spr+0.026
 tul+0.025
 fl+0.025
 hearts+0.024
 walks+0.023
Neg logit contributions
ney-0.032
 seek-0.032
umpy-0.030
neys-0.026
asc-0.026
Token as
Token ID355
Feature activation+1.13e+00

Pos logit contributions
irs+0.140
 cockpit+0.133
 er+0.129
 tul+0.128
ian+0.128
Neg logit contributions
 seek-0.110
 than-0.104
 but-0.076
 except-0.075
umpy-0.072
Token the
Token ID262
Feature activation+1.77e+00

Pos logit contributions
 walks+0.055
 spr+0.055
 hearts+0.054
 tul+0.053
 strokes+0.052
Neg logit contributions
 seek-0.079
ney-0.074
umpy-0.071
and-0.063
aking-0.062
Token iron
Token ID6953
Feature activation+3.44e+00

Pos logit contributions
 walks+0.103
 stro+0.100
irs+0.098
ened+0.098
 strokes+0.097
Neg logit contributions
 seek-0.112
ney-0.090
 than-0.087
umpy-0.086
and-0.076
Token.
Token ID13
Feature activation+2.10e-01

Pos logit contributions
 walks+0.193
 hearts+0.186
 spr+0.183
 tul+0.183
 strokes+0.183
Neg logit contributions
 seek-0.221
ney-0.190
umpy-0.180
 than-0.176
and-0.164
Token
Token ID220
Feature activation+6.35e-01

Pos logit contributions
 walks+0.011
 hearts+0.010
 spr+0.010
 strokes+0.010
 stro+0.010
Neg logit contributions
 seek-0.015
ney-0.013
umpy-0.012
 than-0.012
and-0.011
Token
Token ID198
Feature activation-2.04e-01

Pos logit contributions
 walks+0.037
 strokes+0.036
 hugs+0.035
irs+0.035
 stro+0.035
Neg logit contributions
 seek-0.041
ney-0.032
 than-0.031
umpy-0.031
rated-0.029
Token
Token ID198
Feature activation-2.09e-02

Pos logit contributions
 seek+0.013
ney+0.011
 than+0.011
rated+0.011
umpy+0.011
Neg logit contributions
 walks-0.011
 stro-0.010
 strokes-0.010
irs-0.010
 hugs-0.010
TokenThe
Token ID464
Feature activation-2.71e-01

Pos logit contributions
 seek+0.001
ney+0.001
 than+0.001
umpy+0.001
neys+0.001
Neg logit contributions
 walks-0.001
 stro-0.001
 spr-0.001
 tul-0.001
 strokes-0.001
Token were
Token ID547
Feature activation+1.09e+00

Pos logit contributions
 seek+0.013
ney+0.011
umpy+0.010
 than+0.010
and+0.009
Neg logit contributions
 walks-0.013
 tul-0.013
 hearts-0.013
 spr-0.013
 strokes-0.013
Token as
Token ID355
Feature activation+9.03e-01

Pos logit contributions
 walks+0.066
 hearts+0.064
 spr+0.064
 tul+0.064
 strokes+0.064
Neg logit contributions
 seek-0.065
ney-0.054
umpy-0.052
 than-0.050
and-0.047
Token cool
Token ID3608
Feature activation+1.44e+00

Pos logit contributions
 walks+0.050
 hearts+0.049
 spr+0.048
 bows+0.048
 er+0.047
Neg logit contributions
 seek-0.055
ney-0.053
umpy-0.048
 than-0.044
asis-0.043
Token as
Token ID355
Feature activation+1.36e+00

Pos logit contributions
 tul+0.087
 spr+0.085
 walks+0.084
 stro+0.084
 strokes+0.082
Neg logit contributions
 seek-0.082
 than-0.071
ney-0.068
umpy-0.067
and-0.061
Token a
Token ID257
Feature activation+2.33e+00

Pos logit contributions
 walks+0.090
 hearts+0.088
 bows+0.085
 hugs+0.085
 spr+0.085
Neg logit contributions
 seek-0.074
ney-0.067
umpy-0.064
 than-0.055
and-0.053
Token robot
Token ID9379
Feature activation+3.44e+00

Pos logit contributions
 stro+0.135
 walks+0.135
irs+0.132
 spr+0.131
 tul+0.129
Neg logit contributions
 seek-0.149
ney-0.118
umpy-0.115
 than-0.113
 boy-0.100
Token.
Token ID13
Feature activation+2.47e-01

Pos logit contributions
 tul+0.222
 hearts+0.219
irs+0.216
 spr+0.215
 strokes+0.215
Neg logit contributions
 seek-0.190
ney-0.154
 than-0.151
umpy-0.141
and-0.128
Token Finally
Token ID9461
Feature activation-2.92e-02

Pos logit contributions
 walks+0.016
 hearts+0.016
 spr+0.016
 strokes+0.016
ened+0.016
Neg logit contributions
 seek-0.014
ney-0.011
umpy-0.011
 than-0.010
and-0.009
Token,
Token ID11
Feature activation+4.34e-01

Pos logit contributions
 seek+0.002
ney+0.001
umpy+0.001
 than+0.001
and+0.001
Neg logit contributions
 walks-0.002
 hearts-0.002
 spr-0.002
 tul-0.002
 strokes-0.002
Token he
Token ID339
Feature activation-2.43e-01

Pos logit contributions
 walks+0.024
 spr+0.023
 hearts+0.023
ened+0.023
 stro+0.023
Neg logit contributions
umpy-0.027
 seek-0.027
ney-0.026
and-0.024
 than-0.022
Token found
Token ID1043
Feature activation-1.61e+00

Pos logit contributions
 seek+0.018
ney+0.016
umpy+0.015
 than+0.015
and+0.015
Neg logit contributions
 walks-0.011
 stro-0.011
 spr-0.011
 er-0.011
 hugs-0.010
Token be
Token ID307
Feature activation-3.48e-02

Pos logit contributions
 walks+0.009
 hearts+0.008
 spr+0.008
 strokes+0.008
 stro+0.008
Neg logit contributions
 seek-0.010
ney-0.009
umpy-0.008
 than-0.008
and-0.008
Token as
Token ID355
Feature activation+3.41e-01

Pos logit contributions
 seek+0.002
ney+0.002
umpy+0.002
and+0.002
rated+0.002
Neg logit contributions
 walks-0.002
 spr-0.002
 bows-0.002
 stro-0.002
 tul-0.002
Token tall
Token ID7331
Feature activation+1.90e+00

Pos logit contributions
 spr+0.017
 tul+0.016
 fl+0.015
 er+0.014
 walks+0.014
Neg logit contributions
ney-0.023
 seek-0.023
umpy-0.021
rated-0.020
 rink-0.020
Token as
Token ID355
Feature activation+1.06e+00

Pos logit contributions
irs+0.113
ian+0.107
 strokes+0.106
ience+0.106
 tul+0.106
Neg logit contributions
 seek-0.112
 than-0.099
umpy-0.082
ney-0.078
 but-0.076
Token the
Token ID262
Feature activation+1.76e+00

Pos logit contributions
 spr+0.049
 walks+0.047
 tul+0.047
 hearts+0.046
 bows+0.044
Neg logit contributions
 seek-0.078
ney-0.075
umpy-0.074
aking-0.065
and-0.065
Token tree
Token ID5509
Feature activation+3.43e+00

Pos logit contributions
 walks+0.101
 hearts+0.097
 strokes+0.097
 stro+0.096
 spr+0.096
Neg logit contributions
 seek-0.112
ney-0.095
umpy-0.091
 than-0.088
and-0.082
Token.
Token ID13
Feature activation-9.22e-02

Pos logit contributions
 walks+0.200
 spr+0.196
 stro+0.190
 strokes+0.189
 hearts+0.189
Neg logit contributions
 seek-0.214
ney-0.181
umpy-0.177
 than-0.164
and-0.161
Token The
Token ID383
Feature activation+2.61e-01

Pos logit contributions
 seek+0.006
ney+0.005
umpy+0.005
 than+0.005
and+0.005
Neg logit contributions
 walks-0.005
 hearts-0.005
 spr-0.005
 strokes-0.005
 stro-0.005
Token c
Token ID269
Feature activation+2.88e-01

Pos logit contributions
 walks+0.014
ened+0.013
 strokes+0.013
 hearts+0.013
 stro+0.013
Neg logit contributions
 seek-0.018
ney-0.015
umpy-0.015
 than-0.014
and-0.014
Tokenactus
Token ID34144
Feature activation+6.13e-01

Pos logit contributions
 walks+0.018
ened+0.017
 hearts+0.017
 stro+0.017
 spr+0.017
Neg logit contributions
 seek-0.016
ney-0.015
umpy-0.014
 than-0.013
iring-0.013
Token tried
Token ID3088
Feature activation+3.94e-01

Pos logit contributions
 spr+0.032
 walks+0.031
 stro+0.030
 strokes+0.029
 hugs+0.029
Neg logit contributions
 seek-0.040
umpy-0.036
 than-0.034
ney-0.034
and-0.034
Token some
Token ID617
Feature activation+1.49e-01

Pos logit contributions
 seek+0.012
ney+0.012
umpy+0.011
rated+0.010
iring+0.010
Neg logit contributions
 walks-0.008
 hearts-0.007
 bows-0.007
 hugs-0.007
 strokes-0.007
Token with
Token ID351
Feature activation+4.50e-01

Pos logit contributions
 walks+0.010
 hugs+0.010
 hearts+0.009
 bows+0.009
 strokes+0.009
Neg logit contributions
 seek-0.008
ney-0.007
umpy-0.007
rated-0.006
asc-0.006
Token its
Token ID663
Feature activation+1.78e+00

Pos logit contributions
 walks+0.026
 spr+0.025
ened+0.025
 tul+0.025
 strokes+0.025
Neg logit contributions
 seek-0.027
ney-0.023
 than-0.022
umpy-0.021
and-0.020
Token friends
Token ID2460
Feature activation-1.96e-01

Pos logit contributions
 walks+0.097
 hearts+0.095
 hugs+0.092
 bows+0.091
 strokes+0.090
Neg logit contributions
 seek-0.111
umpy-0.102
ney-0.101
and-0.094
aghetti-0.093
Token as
Token ID355
Feature activation+1.12e+00

Pos logit contributions
 seek+0.012
ney+0.010
umpy+0.010
 than+0.009
iring+0.009
Neg logit contributions
 walks-0.012
 stro-0.011
 hearts-0.011
 hugs-0.011
 strokes-0.011
Token well
Token ID880
Feature activation+3.40e+00

Pos logit contributions
 walks+0.063
 hearts+0.061
 spr+0.060
 strokes+0.060
 stro+0.059
Neg logit contributions
 seek-0.072
ney-0.062
umpy-0.059
 than-0.056
and-0.054
Token.
Token ID13
Feature activation+4.12e-01

Pos logit contributions
 walks+0.223
 hearts+0.218
 tul+0.218
 spr+0.217
 strokes+0.216
Neg logit contributions
 seek-0.185
ney-0.148
umpy-0.143
 than-0.142
and-0.127
Token So
Token ID1406
Feature activation-2.00e-02

Pos logit contributions
 tul+0.029
 walks+0.029
 spr+0.028
ened+0.028
 er+0.028
Neg logit contributions
 seek-0.019
ney-0.017
umpy-0.015
 than-0.015
and-0.014
Token,
Token ID11
Feature activation+2.35e-01

Pos logit contributions
 seek+0.001
ney+0.001
umpy+0.001
 than+0.001
and+0.001
Neg logit contributions
 walks-0.001
 hearts-0.001
 spr-0.001
 strokes-0.001
 tul-0.001
Token it
Token ID340
Feature activation+2.78e-01

Pos logit contributions
 walks+0.013
 hearts+0.012
 spr+0.012
 stro+0.012
 strokes+0.012
Neg logit contributions
 seek-0.016
ney-0.014
umpy-0.013
 than-0.012
and-0.012
Token z
Token ID1976
Feature activation-3.20e-01

Pos logit contributions
irs+0.015
 hearts+0.015
 walks+0.015
 strokes+0.015
 tul+0.015
Neg logit contributions
 seek-0.018
ney-0.015
umpy-0.014
 than-0.014
aking-0.013
Token time
Token ID640
Feature activation-2.50e-01

Pos logit contributions
 walks+0.077
ened+0.077
 bl+0.075
 strokes+0.075
 hugs+0.075
Neg logit contributions
 seek-0.077
ney-0.071
umpy-0.065
 than-0.057
Bob-0.056
Token into
Token ID656
Feature activation+6.61e-01

Pos logit contributions
 seek+0.016
ney+0.014
umpy+0.013
neys+0.012
iring+0.012
Neg logit contributions
 walks-0.014
ened-0.013
 hugs-0.013
 hearts-0.013
 strokes-0.013
Token exercise
Token ID5517
Feature activation+4.90e-01

Pos logit contributions
 stro+0.041
 walks+0.041
 tul+0.040
 spr+0.040
ened+0.040
Neg logit contributions
 seek-0.038
ney-0.029
 than-0.028
umpy-0.027
and-0.026
Token time
Token ID640
Feature activation-1.46e-02

Pos logit contributions
 walks+0.028
 spr+0.027
 hearts+0.027
 strokes+0.026
 stro+0.026
Neg logit contributions
 seek-0.031
ney-0.027
umpy-0.026
 than-0.024
iring-0.023
Token as
Token ID355
Feature activation+1.14e+00

Pos logit contributions
 seek+0.001
ney+0.001
umpy+0.001
iring+0.001
neys+0.001
Neg logit contributions
 walks-0.001
 hearts-0.001
 strokes-0.001
ened-0.001
 spr-0.001
Token well
Token ID880
Feature activation+3.38e+00

Pos logit contributions
 walks+0.073
 hearts+0.070
 hugs+0.068
 spr+0.068
 tul+0.068
Neg logit contributions
 seek-0.062
ney-0.058
umpy-0.056
 than-0.048
rated-0.047
Token.
Token ID13
Feature activation+2.93e-01

Pos logit contributions
 walks+0.210
 hearts+0.204
 spr+0.203
 strokes+0.202
 tul+0.202
Neg logit contributions
 seek-0.196
ney-0.163
umpy-0.155
 than-0.150
and-0.140
Token Joey
Token ID26154
Feature activation-6.19e-01

Pos logit contributions
 walks+0.017
 spr+0.017
 strokes+0.016
 hearts+0.016
 stro+0.016
Neg logit contributions
 seek-0.019
ney-0.015
umpy-0.015
 than-0.015
kin-0.013
Token and
Token ID290
Feature activation+2.32e-01

Pos logit contributions
 seek+0.040
ney+0.034
umpy+0.032
 than+0.031
and+0.030
Neg logit contributions
 walks-0.035
 stro-0.034
 spr-0.033
 strokes-0.033
 hearts-0.033
Token his
Token ID465
Feature activation+7.53e-01

Pos logit contributions
 walks+0.016
 hearts+0.015
 spr+0.015
 strokes+0.015
 tul+0.015
Neg logit contributions
 seek-0.013
ney-0.010
umpy-0.010
 than-0.009
and-0.009
Token mom
Token ID1995
Feature activation-3.60e-02

Pos logit contributions
 walks+0.040
 tul+0.039
 spr+0.039
 stro+0.039
ened+0.038
Neg logit contributions
 seek-0.050
ney-0.042
umpy-0.041
 than-0.040
and-0.038
Token different
Token ID1180
Feature activation+1.70e-01

Pos logit contributions
 seek+0.008
ney+0.007
umpy+0.007
 than+0.007
and+0.006
Neg logit contributions
 walks-0.009
 stro-0.008
 spr-0.008
 tul-0.008
 hearts-0.008
Token directions
Token ID11678
Feature activation-2.12e-01

Pos logit contributions
 stro+0.009
 walks+0.009
ience+0.009
irs+0.009
 spr+0.009
Neg logit contributions
 seek-0.011
 than-0.010
ney-0.010
umpy-0.009
and-0.008
Token with
Token ID351
Feature activation+4.77e-01

Pos logit contributions
 seek+0.013
ney+0.011
umpy+0.011
 than+0.010
and+0.010
Neg logit contributions
 walks-0.012
 hearts-0.012
 strokes-0.012
ened-0.012
 spr-0.012
Token both
Token ID1111
Feature activation-2.24e-01

Pos logit contributions
 tul+0.031
 spr+0.031
 walks+0.030
 stro+0.030
ience+0.030
Neg logit contributions
 seek-0.025
 than-0.020
ney-0.018
umpy-0.017
and-0.017
Token com
Token ID401
Feature activation+7.06e-01

Pos logit contributions
 seek+0.013
ney+0.011
 than+0.011
umpy+0.011
and+0.010
Neg logit contributions
 walks-0.013
 stro-0.013
 spr-0.013
 tul-0.013
 hearts-0.012
Tokenbs
Token ID1443
Feature activation+3.37e+00

Pos logit contributions
 spr+0.043
 tul+0.042
 walks+0.042
 horror+0.041
ishly+0.040
Neg logit contributions
 than-0.041
umpy-0.037
 seek-0.035
ney-0.034
and-0.033
Token.
Token ID13
Feature activation+6.73e-02

Pos logit contributions
 walks+0.216
 tul+0.209
 hearts+0.209
 spr+0.209
 strokes+0.206
Neg logit contributions
 seek-0.187
ney-0.155
umpy-0.148
 than-0.147
and-0.134
Token Jack
Token ID3619
Feature activation-7.64e-01

Pos logit contributions
 walks+0.004
 strokes+0.004
 hearts+0.004
 hugs+0.004
ened+0.004
Neg logit contributions
 seek-0.004
ney-0.004
 than-0.003
umpy-0.003
rated-0.003
Token felt
Token ID2936
Feature activation-7.24e-02

Pos logit contributions
 seek+0.043
ney+0.036
 than+0.032
umpy+0.031
and+0.029
Neg logit contributions
irs-0.049
 walks-0.048
 hearts-0.048
 tul-0.047
 strokes-0.046
Token so
Token ID523
Feature activation-1.53e+00

Pos logit contributions
 seek+0.003
umpy+0.002
 than+0.002
ney+0.002
and+0.002
Neg logit contributions
 walks-0.006
 hearts-0.006
 tul-0.006
 spr-0.006
 strokes-0.005
Token cool
Token ID3608
Feature activation-8.29e-02

Pos logit contributions
 seek+0.077
ney+0.062
umpy+0.059
 than+0.056
and+0.052
Neg logit contributions
 walks-0.109
 hearts-0.105
 spr-0.105
 tul-0.105
 strokes-0.104
Token and
Token ID290
Feature activation+3.15e-01

Pos logit contributions
 walks+0.048
 spr+0.046
 hearts+0.046
 strokes+0.046
 tul+0.045
Neg logit contributions
 seek-0.041
ney-0.036
umpy-0.034
 than-0.030
and-0.030
Token down
Token ID866
Feature activation+8.78e-01

Pos logit contributions
 walks+0.021
 hearts+0.021
 spr+0.021
 strokes+0.021
 stro+0.020
Neg logit contributions
 seek-0.016
ney-0.014
umpy-0.013
 than-0.012
and-0.012
Token and
Token ID290
Feature activation+5.44e-01

Pos logit contributions
 walks+0.055
 hearts+0.053
 spr+0.053
 strokes+0.053
 tul+0.053
Neg logit contributions
 seek-0.051
ney-0.042
umpy-0.040
 than-0.039
and-0.036
Token waved
Token ID26834
Feature activation+9.39e-01

Pos logit contributions
 walks+0.032
 hearts+0.031
 spr+0.031
 tul+0.031
 strokes+0.031
Neg logit contributions
 seek-0.033
ney-0.028
umpy-0.026
 than-0.026
and-0.024
Token as
Token ID355
Feature activation+9.07e-01

Pos logit contributions
 walks+0.046
 tul+0.045
 spr+0.044
 er+0.044
irs+0.044
Neg logit contributions
 seek-0.066
 than-0.056
umpy-0.053
ney-0.053
and-0.051
Token hard
Token ID1327
Feature activation+3.34e+00

Pos logit contributions
 walks+0.045
 hearts+0.043
 spr+0.043
 tul+0.043
 strokes+0.043
Neg logit contributions
 seek-0.064
ney-0.057
umpy-0.054
 than-0.052
and-0.050
Token as
Token ID355
Feature activation+1.30e+00

Pos logit contributions
 cockpit+0.255
 cleans+0.253
ience+0.252
 hugs+0.250
irs+0.250
Neg logit contributions
 than-0.131
 seek-0.123
 but-0.106
 except-0.085
 there-0.080
Token she
Token ID673
Feature activation+1.81e+00

Pos logit contributions
 walks+0.062
 hearts+0.059
 spr+0.059
 tul+0.059
 strokes+0.058
Neg logit contributions
 seek-0.094
ney-0.084
umpy-0.081
 than-0.077
and-0.074
Token could
Token ID714
Feature activation+6.98e-01

Pos logit contributions
 walks+0.144
 hearts+0.142
irs+0.142
 tul+0.142
 strokes+0.139
Neg logit contributions
 seek-0.073
ney-0.053
umpy-0.046
 than-0.045
and-0.041
Token with
Token ID351
Feature activation+5.25e-01

Pos logit contributions
 walks+0.042
 spr+0.040
 hearts+0.040
 strokes+0.040
 stro+0.040
Neg logit contributions
 seek-0.042
ney-0.036
umpy-0.034
 than-0.033
and-0.031
Token a
Token ID257
Feature activation+1.16e+00

Pos logit contributions
ience+0.024
 spr+0.024
 tul+0.024
 stro+0.023
 walks+0.022
Neg logit contributions
 seek-0.037
 than-0.030
Bob-0.029
 but-0.028
 Sue-0.028
Token just
Token ID655
Feature activation-7.03e-01

Pos logit contributions
 walks+0.062
 spr+0.058
ened+0.058
 hearts+0.058
 strokes+0.057
Neg logit contributions
 seek-0.056
ney-0.050
umpy-0.047
 than-0.044
and-0.043
Token as
Token ID355
Feature activation+7.85e-01

Pos logit contributions
 seek+0.042
ney+0.036
umpy+0.034
 than+0.033
and+0.031
Neg logit contributions
 walks-0.042
 hearts-0.041
 spr-0.041
 strokes-0.041
 tul-0.041
Token fast
Token ID3049
Feature activation+2.46e+00

Pos logit contributions
 fl+0.043
 scrib+0.040
 bows+0.040
 str+0.039
 spr+0.039
Neg logit contributions
ney-0.045
 seek-0.043
umpy-0.043
asis-0.040
asc-0.040
Token as
Token ID355
Feature activation+1.24e+00

Pos logit contributions
 tul+0.185
irs+0.179
 lav+0.171
 stro+0.168
 spr+0.167
Neg logit contributions
 than-0.112
 seek-0.103
 but-0.083
 except-0.074
 yet-0.071
Token the
Token ID262
Feature activation+1.76e+00

Pos logit contributions
 walks+0.064
 hearts+0.063
 spr+0.062
 bows+0.061
 tul+0.060
Neg logit contributions
 seek-0.083
ney-0.077
umpy-0.075
and-0.065
aking-0.065
Token black
Token ID2042
Feature activation+3.33e+00

Pos logit contributions
 stro+0.117
irs+0.112
 tul+0.111
 spr+0.109
 strokes+0.109
Neg logit contributions
 seek-0.099
ney-0.080
umpy-0.078
 than-0.078
 sunny-0.072
Token automobile
Token ID27930
Feature activation+2.93e+00

Pos logit contributions
irs+0.203
 tul+0.203
 stro+0.199
ience+0.194
 lav+0.190
Neg logit contributions
ney-0.184
 seek-0.181
umpy-0.166
 than-0.164
 boy-0.152
Token.
Token ID13
Feature activation+1.30e-01

Pos logit contributions
 tul+0.175
 walks+0.173
 hearts+0.171
 strokes+0.170
 spr+0.169
Neg logit contributions
 seek-0.174
ney-0.151
 than-0.141
umpy-0.140
and-0.127
Token The
Token ID383
Feature activation+1.30e-01

Pos logit contributions
 walks+0.007
 tul+0.007
 spr+0.007
 hearts+0.007
 stro+0.007
Neg logit contributions
 seek-0.008
ney-0.007
umpy-0.007
 than-0.006
and-0.006
Token little
Token ID1310
Feature activation+1.33e-01

Pos logit contributions
 stro+0.009
irs+0.008
 tul+0.008
 strokes+0.008
 spr+0.008
Neg logit contributions
 seek-0.007
ney-0.006
umpy-0.006
 than-0.005
 sunny-0.005
Token red
Token ID2266
Feature activation+2.08e-01

Pos logit contributions
irs+0.009
 stro+0.008
 strokes+0.008
 tul+0.008
 walks+0.008
Neg logit contributions
 seek-0.008
ney-0.006
 than-0.006
 boy-0.006
umpy-0.006
Token closer
Token ID5699
Feature activation+1.29e-01

Pos logit contributions
 seek+0.008
ney+0.007
umpy+0.007
asis+0.007
aghetti+0.006
Neg logit contributions
 walks-0.007
ened-0.007
 hugs-0.007
 bows-0.007
 spr-0.007
Token and
Token ID290
Feature activation-6.59e-01

Pos logit contributions
 walks+0.007
 strokes+0.007
 spr+0.007
 hearts+0.007
 hugs+0.007
Neg logit contributions
 seek-0.009
ney-0.008
umpy-0.008
neys-0.006
asc-0.006
Token watched
Token ID7342
Feature activation+1.20e+00

Pos logit contributions
 seek+0.043
ney+0.037
umpy+0.036
 than+0.034
and+0.032
Neg logit contributions
 walks-0.037
 spr-0.035
 hearts-0.035
 stro-0.035
 tul-0.035
Token as
Token ID355
Feature activation+7.79e-01

Pos logit contributions
 walks+0.073
 hearts+0.071
 stro+0.071
 tul+0.070
 spr+0.070
Neg logit contributions
 seek-0.070
ney-0.057
 than-0.055
umpy-0.054
and-0.051
Token the
Token ID262
Feature activation+1.62e+00

Pos logit contributions
 walks+0.037
 spr+0.037
 hearts+0.036
 tul+0.035
 strokes+0.035
Neg logit contributions
 seek-0.057
ney-0.051
umpy-0.050
and-0.046
 than-0.046
Token cord
Token ID15050
Feature activation+3.33e+00

Pos logit contributions
 walks+0.081
 stro+0.079
 hearts+0.078
 tul+0.078
 spr+0.077
Neg logit contributions
 seek-0.117
ney-0.100
umpy-0.096
 than-0.093
and-0.085
Token went
Token ID1816
Feature activation+5.56e-01

Pos logit contributions
irs+0.176
 tul+0.169
 hearts+0.164
 hugs+0.161
 walks+0.161
Neg logit contributions
 seek-0.230
ney-0.191
 than-0.179
umpy-0.167
 but-0.157
Token all
Token ID477
Feature activation-3.05e-01

Pos logit contributions
 walks+0.041
 hearts+0.041
 stro+0.040
 tul+0.040
 strokes+0.040
Neg logit contributions
 seek-0.026
ney-0.020
 than-0.019
umpy-0.018
and-0.016
Token around
Token ID1088
Feature activation+1.64e+00

Pos logit contributions
 seek+0.017
ney+0.014
umpy+0.014
 than+0.013
and+0.012
Neg logit contributions
 walks-0.020
 hearts-0.019
 spr-0.019
ened-0.019
 strokes-0.019
Token the
Token ID262
Feature activation+4.80e-01

Pos logit contributions
 walks+0.100
 stro+0.099
 tul+0.098
 hearts+0.097
 strokes+0.097
Neg logit contributions
 seek-0.096
ney-0.079
 than-0.075
umpy-0.073
and-0.069
Token room
Token ID2119
Feature activation+7.40e-01

Pos logit contributions
 walks+0.031
 er+0.030
 hearts+0.029
 spr+0.029
 strokes+0.029
Neg logit contributions
 seek-0.027
umpy-0.022
ney-0.022
 than-0.021
emo-0.020
Token the
Token ID262
Feature activation+1.24e-01

Pos logit contributions
 stro+0.020
 walks+0.020
 tul+0.020
 spr+0.020
 hearts+0.020
Neg logit contributions
 seek-0.027
ney-0.023
 than-0.022
umpy-0.021
and-0.021
Token same
Token ID976
Feature activation-6.63e-01

Pos logit contributions
 walks+0.008
 stro+0.007
 spr+0.007
 strokes+0.007
 hearts+0.007
Neg logit contributions
 seek-0.007
ney-0.006
umpy-0.006
 than-0.006
and-0.005
Token shape
Token ID5485
Feature activation+1.51e+00

Pos logit contributions
 seek+0.045
ney+0.039
umpy+0.037
 than+0.036
and+0.034
Neg logit contributions
 walks-0.035
 hearts-0.033
 spr-0.033
 strokes-0.033
 tul-0.033
Token as
Token ID355
Feature activation+9.71e-01

Pos logit contributions
 walks+0.088
 hearts+0.087
 spr+0.086
 strokes+0.085
 tul+0.084
Neg logit contributions
 seek-0.093
ney-0.078
umpy-0.076
 than-0.071
and-0.068
Token the
Token ID262
Feature activation+1.46e+00

Pos logit contributions
 walks+0.055
 hearts+0.054
 spr+0.053
 strokes+0.053
 tul+0.052
Neg logit contributions
 seek-0.062
ney-0.055
umpy-0.053
 than-0.048
and-0.047
Token toy
Token ID13373
Feature activation+3.32e+00

Pos logit contributions
 walks+0.083
 stro+0.082
 spr+0.079
 tul+0.079
 strokes+0.079
Neg logit contributions
 seek-0.094
ney-0.079
umpy-0.075
 than-0.074
and-0.068
Token.
Token ID13
Feature activation+2.45e-02

Pos logit contributions
 walks+0.190
 hearts+0.182
 stro+0.182
 spr+0.182
 tul+0.182
Neg logit contributions
 seek-0.209
ney-0.179
umpy-0.169
 than-0.165
and-0.154
Token Andy
Token ID12382
Feature activation-4.06e-01

Pos logit contributions
 walks+0.002
 strokes+0.002
 spr+0.002
 hearts+0.002
 stro+0.002
Neg logit contributions
 seek-0.001
ney-0.001
umpy-0.001
 than-0.001
and-0.001
Token was
Token ID373
Feature activation+1.79e-02

Pos logit contributions
 seek+0.022
ney+0.019
umpy+0.017
 than+0.016
and+0.015
Neg logit contributions
 walks-0.027
 tul-0.026
 hearts-0.026
irs-0.026
 spr-0.026
Token so
Token ID523
Feature activation-1.90e+00

Pos logit contributions
 walks+0.001
 spr+0.001
 hearts+0.001
 stro+0.001
 strokes+0.001
Neg logit contributions
 seek-0.001
ney-0.001
umpy-0.001
 than-0.001
and-0.001
Token excited
Token ID6568
Feature activation+3.95e-01

Pos logit contributions
 seek+0.103
ney+0.088
umpy+0.077
 than+0.076
and+0.071
Neg logit contributions
 walks-0.123
 spr-0.123
 stro-0.122
 hugs-0.121
 strokes-0.120
Token-
Token ID12
Feature activation+1.16e+00

Pos logit contributions
 seek+0.005
ney+0.004
umpy+0.004
and+0.004
iring+0.003
Neg logit contributions
 walks-0.004
 hearts-0.004
 strokes-0.003
 tul-0.003
 er-0.003
Tokenfour
Token ID14337
Feature activation+2.37e+00

Pos logit contributions
 walks+0.066
 spr+0.064
 hearts+0.064
 strokes+0.063
ened+0.062
Neg logit contributions
 seek-0.074
ney-0.063
 than-0.060
umpy-0.060
and-0.058
Token,
Token ID11
Feature activation+2.16e+00

Pos logit contributions
 walks+0.129
 hearts+0.126
 strokes+0.124
 spr+0.123
 hugs+0.122
Neg logit contributions
 seek-0.158
ney-0.138
umpy-0.128
 than-0.120
and-0.120
Token thirty
Token ID12277
Feature activation+1.50e+00

Pos logit contributions
ened+0.097
 walks+0.096
irs+0.095
 stro+0.094
ience+0.089
Neg logit contributions
 seek-0.158
 than-0.135
ney-0.128
umpy-0.120
and-0.117
Token-
Token ID12
Feature activation+2.31e+00

Pos logit contributions
 walks+0.071
 hearts+0.070
 strokes+0.069
 er+0.068
 hugs+0.068
Neg logit contributions
 seek-0.108
ney-0.092
umpy-0.091
and-0.083
iring-0.082
Tokenfive
Token ID13261
Feature activation+3.32e+00

Pos logit contributions
 walks+0.119
 hearts+0.116
 spr+0.114
 strokes+0.112
ened+0.111
Neg logit contributions
 seek-0.161
ney-0.137
umpy-0.133
 than-0.132
and-0.127
Token,
Token ID11
Feature activation+1.80e+00

Pos logit contributions
 hearts+0.173
 walks+0.161
 hugs+0.161
 strokes+0.158
ened+0.154
Neg logit contributions
 seek-0.238
ney-0.208
umpy-0.204
rated-0.186
aghetti-0.182
Token thirty
Token ID12277
Feature activation+8.32e-01

Pos logit contributions
ened+0.079
 walks+0.078
irs+0.077
 stro+0.076
ience+0.073
Neg logit contributions
 seek-0.134
 than-0.115
ney-0.111
umpy-0.103
and-0.101
Token-
Token ID12
Feature activation+1.84e+00

Pos logit contributions
 walks+0.041
 hearts+0.041
 strokes+0.040
 er+0.039
 tul+0.039
Neg logit contributions
 seek-0.059
ney-0.051
umpy-0.050
and-0.045
iring-0.044
Tokensix
Token ID19412
Feature activation+7.65e-01

Pos logit contributions
 walks+0.094
 hearts+0.091
 spr+0.090
 strokes+0.089
ened+0.088
Neg logit contributions
 seek-0.129
ney-0.111
umpy-0.107
 than-0.106
and-0.102
Token,
Token ID11
Feature activation+9.87e-01

Pos logit contributions
 walks+0.048
 stro+0.047
irs+0.047
 spr+0.046
ened+0.046
Neg logit contributions
 seek-0.042
ney-0.035
 than-0.034
umpy-0.032
and-0.028
Token-
Token ID12
Feature activation+1.08e+00

Pos logit contributions
 seek+0.027
ney+0.023
umpy+0.023
and+0.020
rated+0.020
Neg logit contributions
 walks-0.019
 hearts-0.019
 strokes-0.018
 er-0.018
 hugs-0.018
Tokenfour
Token ID14337
Feature activation+2.40e+00

Pos logit contributions
 walks+0.060
 spr+0.058
 hearts+0.058
 strokes+0.057
ened+0.056
Neg logit contributions
 seek-0.071
ney-0.060
umpy-0.058
 than-0.057
and-0.055
Token,
Token ID11
Feature activation+2.17e+00

Pos logit contributions
 walks+0.133
 hearts+0.130
 strokes+0.127
 spr+0.127
 hugs+0.126
Neg logit contributions
 seek-0.158
ney-0.137
umpy-0.128
 than-0.120
and-0.119
Token forty
Token ID16571
Feature activation+1.23e+00

Pos logit contributions
 walks+0.094
ened+0.094
irs+0.093
 stro+0.092
 strokes+0.087
Neg logit contributions
 seek-0.163
 than-0.140
ney-0.133
umpy-0.125
and-0.123
Token-
Token ID12
Feature activation+2.33e+00

Pos logit contributions
 walks+0.057
 hearts+0.057
 strokes+0.056
 hugs+0.055
 er+0.055
Neg logit contributions
 seek-0.092
ney-0.078
umpy-0.076
and-0.069
rated-0.069
Tokenfive
Token ID13261
Feature activation+3.32e+00

Pos logit contributions
 walks+0.118
 hearts+0.114
 spr+0.113
 strokes+0.111
ened+0.110
Neg logit contributions
 seek-0.164
ney-0.140
umpy-0.136
 than-0.135
and-0.129
Token,
Token ID11
Feature activation+1.78e+00

Pos logit contributions
 hearts+0.175
 walks+0.166
 hugs+0.164
 strokes+0.162
ened+0.158
Neg logit contributions
 seek-0.234
ney-0.204
umpy-0.199
rated-0.180
aghetti-0.177
Token forty
Token ID16571
Feature activation+5.23e-01

Pos logit contributions
 walks+0.075
ened+0.075
irs+0.074
 stro+0.073
 strokes+0.070
Neg logit contributions
 seek-0.135
 than-0.116
ney-0.113
umpy-0.106
and-0.103
Token-
Token ID12
Feature activation+1.83e+00

Pos logit contributions
 walks+0.025
 hearts+0.025
 strokes+0.024
 hugs+0.024
 er+0.024
Neg logit contributions
 seek-0.038
ney-0.033
umpy-0.032
and-0.029
rated-0.029
Tokensix
Token ID19412
Feature activation+7.01e-01

Pos logit contributions
 walks+0.091
 hearts+0.088
 spr+0.087
 strokes+0.086
ened+0.085
Neg logit contributions
 seek-0.130
ney-0.112
umpy-0.108
 than-0.107
and-0.103
Token,
Token ID11
Feature activation+9.14e-01

Pos logit contributions
 walks+0.044
 stro+0.044
irs+0.043
 spr+0.043
 tul+0.042
Neg logit contributions
 seek-0.039
ney-0.032
 than-0.031
umpy-0.029
and-0.026
Token said
Token ID531
Feature activation-1.08e+00

Pos logit contributions
 walks+0.003
 hearts+0.003
 tul+0.003
irs+0.003
 strokes+0.003
Neg logit contributions
 seek-0.004
ney-0.003
 than-0.003
umpy-0.003
Bob-0.003
Token,
Token ID11
Feature activation-5.39e-01

Pos logit contributions
 seek+0.056
 than+0.044
ney+0.042
umpy+0.042
and+0.039
Neg logit contributions
 walks-0.070
irs-0.069
ened-0.068
 hearts-0.067
 bows-0.067
Token "
Token ID366
Feature activation-5.21e-01

Pos logit contributions
 seek+0.026
ney+0.021
 than+0.020
umpy+0.020
and+0.018
Neg logit contributions
 walks-0.037
ened-0.036
irs-0.036
 hearts-0.036
 bows-0.036
TokenA
Token ID32
Feature activation+3.06e-01

Pos logit contributions
 seek+0.030
ney+0.025
umpy+0.024
 than+0.023
and+0.022
Neg logit contributions
 walks-0.033
 hearts-0.032
 hugs-0.032
 strokes-0.032
ened-0.032
Tokenbr
Token ID1671
Feature activation-2.78e-01

Pos logit contributions
 walks+0.015
 hearts+0.015
 stro+0.015
ened+0.015
 tul+0.015
Neg logit contributions
 seek-0.022
ney-0.018
umpy-0.018
 than-0.017
and-0.016
Tokenac
Token ID330
Feature activation-4.05e+00

Pos logit contributions
 seek+0.010
ney+0.007
umpy+0.007
 than+0.006
and+0.006
Neg logit contributions
 walks-0.023
 hearts-0.023
 spr-0.023
 strokes-0.023
 tul-0.023
Tokenad
Token ID324
Feature activation-1.09e+00

Pos logit contributions
 seek+0.349
ney+0.307
umpy+0.298
 than+0.291
and+0.284
Neg logit contributions
 walks-0.144
 spr-0.134
 hearts-0.134
ened-0.133
 strokes-0.132
Tokenab
Token ID397
Feature activation-1.10e+00

Pos logit contributions
 seek+0.070
ney+0.060
umpy+0.057
 than+0.055
and+0.053
Neg logit contributions
 walks-0.061
 spr-0.059
 hearts-0.058
 strokes-0.058
 tul-0.058
Tokenra
Token ID430
Feature activation-1.41e+00

Pos logit contributions
 seek+0.085
ney+0.075
umpy+0.071
 than+0.070
and+0.066
Neg logit contributions
 walks-0.048
 hearts-0.046
 tul-0.045
 stro-0.045
 strokes-0.044
Token!"
Token ID2474
Feature activation-3.33e-01

Pos logit contributions
 seek+0.071
ney+0.056
umpy+0.054
 than+0.053
and+0.046
Neg logit contributions
 walks-0.098
 hearts-0.097
 tul-0.095
 spr-0.095
 strokes-0.095
Token But
Token ID887
Feature activation+5.15e-01

Pos logit contributions
 seek+0.018
ney+0.015
umpy+0.014
 than+0.013
rated+0.012
Neg logit contributions
 walks-0.023
 strokes-0.022
 stro-0.022
 spr-0.022
 hearts-0.022
Token together
Token ID1978
Feature activation+2.08e-02

Pos logit contributions
 seek+0.015
 than+0.011
ney+0.011
umpy+0.011
and+0.011
Neg logit contributions
 walks-0.020
irs-0.020
ened-0.020
 tul-0.019
 er-0.019
Token,
Token ID11
Feature activation+5.09e-02

Pos logit contributions
 tul+0.001
 walks+0.001
irs+0.001
 stro+0.001
 bows+0.001
Neg logit contributions
 seek-0.001
 than-0.001
ney-0.001
and-0.001
umpy-0.001
Token "
Token ID366
Feature activation-1.20e-01

Pos logit contributions
 walks+0.003
ened+0.003
 tul+0.003
 hearts+0.003
 er+0.003
Neg logit contributions
 seek-0.003
ney-0.002
 than-0.002
umpy-0.002
and-0.002
TokenA
Token ID32
Feature activation+4.64e-01

Pos logit contributions
 seek+0.006
ney+0.005
umpy+0.005
 than+0.005
and+0.004
Neg logit contributions
 walks-0.008
 hearts-0.008
 spr-0.008
 strokes-0.008
 tul-0.008
Tokenbr
Token ID1671
Feature activation-2.06e-02

Pos logit contributions
 walks+0.028
 hearts+0.028
ened+0.027
 stro+0.027
 tul+0.027
Neg logit contributions
 seek-0.028
ney-0.023
umpy-0.022
 than-0.022
and-0.020
Tokenac
Token ID330
Feature activation-3.98e+00

Pos logit contributions
 seek+0.001
ney+0.001
umpy+0.001
 than+0.001
and+0.000
Neg logit contributions
 walks-0.002
 hearts-0.002
 spr-0.002
 strokes-0.002
 tul-0.002
Tokenad
Token ID324
Feature activation-8.88e-01

Pos logit contributions
 seek+0.334
ney+0.296
umpy+0.286
 than+0.279
and+0.268
Neg logit contributions
 walks-0.150
 hearts-0.141
 spr-0.141
 strokes-0.139
 tul-0.139
Tokenab
Token ID397
Feature activation-1.10e+00

Pos logit contributions
 seek+0.056
ney+0.048
umpy+0.045
 than+0.044
and+0.042
Neg logit contributions
 walks-0.051
 hearts-0.049
 spr-0.049
 strokes-0.049
 tul-0.048
Tokenra
Token ID430
Feature activation-1.25e+00

Pos logit contributions
 seek+0.083
ney+0.073
 than+0.069
umpy+0.068
and+0.063
Neg logit contributions
 walks-0.050
 hearts-0.047
 tul-0.047
 stro-0.047
 spr-0.047
Token!"
Token ID2474
Feature activation-2.67e-01

Pos logit contributions
 seek+0.056
ney+0.044
 than+0.041
umpy+0.041
and+0.034
Neg logit contributions
 walks-0.092
 hearts-0.092
 tul-0.090
 spr-0.089
 strokes-0.089
Token
Token ID198
Feature activation+6.84e-02

Pos logit contributions
 seek+0.014
ney+0.011
umpy+0.010
 than+0.010
and+0.009
Neg logit contributions
 walks-0.019
 hearts-0.018
 strokes-0.018
 stro-0.018
 spr-0.018
Token and
Token ID290
Feature activation-3.33e-01

Pos logit contributions
 seek+0.031
ney+0.026
umpy+0.024
 than+0.023
and+0.022
Neg logit contributions
 walks-0.038
 hearts-0.037
 spr-0.036
 strokes-0.036
ened-0.036
Token say
Token ID910
Feature activation+1.39e-01

Pos logit contributions
 seek+0.023
ney+0.020
umpy+0.019
 than+0.019
and+0.017
Neg logit contributions
 walks-0.017
 hearts-0.016
 spr-0.016
 tul-0.016
 strokes-0.016
Token "
Token ID366
Feature activation+2.23e-01

Pos logit contributions
 walks+0.008
 spr+0.007
 hearts+0.007
 stro+0.007
 tul+0.007
Neg logit contributions
 seek-0.009
ney-0.008
umpy-0.007
 than-0.007
and-0.007
TokenA
Token ID32
Feature activation+6.94e-01

Pos logit contributions
ened+0.012
 walks+0.012
 bows+0.012
 hearts+0.012
 stro+0.012
Neg logit contributions
 seek-0.014
ney-0.012
umpy-0.011
neys-0.011
 than-0.011
Tokenbr
Token ID1671
Feature activation+3.21e-01

Pos logit contributions
 walks+0.036
 hearts+0.035
 stro+0.035
ened+0.034
 strokes+0.034
Neg logit contributions
 seek-0.049
ney-0.042
umpy-0.040
 than-0.039
and-0.037
Tokenac
Token ID330
Feature activation-3.95e+00

Pos logit contributions
 hearts+0.025
 walks+0.025
 strokes+0.025
 spr+0.025
 tul+0.025
Neg logit contributions
 seek-0.014
ney-0.010
umpy-0.009
 than-0.008
and-0.008
Tokenad
Token ID324
Feature activation-6.99e-01

Pos logit contributions
 seek+0.333
ney+0.292
umpy+0.284
 than+0.276
and+0.267
Neg logit contributions
 walks-0.147
 hearts-0.139
ened-0.136
 strokes-0.136
 spr-0.136
Tokenab
Token ID397
Feature activation-8.32e-01

Pos logit contributions
 seek+0.046
ney+0.039
umpy+0.037
 than+0.036
and+0.034
Neg logit contributions
 walks-0.038
 spr-0.036
 hearts-0.036
 strokes-0.036
 tul-0.036
Tokenra
Token ID430
Feature activation-9.78e-01

Pos logit contributions
 seek+0.063
ney+0.055
umpy+0.052
 than+0.052
and+0.048
Neg logit contributions
 walks-0.038
 hearts-0.036
 stro-0.035
 tul-0.035
 spr-0.035
Token!"
Token ID2474
Feature activation-2.38e-01

Pos logit contributions
 seek+0.038
ney+0.029
umpy+0.027
 than+0.025
and+0.022
Neg logit contributions
 walks-0.079
 hearts-0.077
 spr-0.077
 tul-0.077
 strokes-0.076
Token and
Token ID290
Feature activation-5.39e-01

Pos logit contributions
 seek+0.013
ney+0.011
umpy+0.010
 than+0.010
and+0.009
Neg logit contributions
 walks-0.016
 hearts-0.015
 strokes-0.015
 stro-0.015
 spr-0.015
Token She
Token ID1375
Feature activation+2.53e-01

Pos logit contributions
 seek+0.001
ney+0.001
umpy+0.001
 than+0.001
and+0.001
Neg logit contributions
 walks-0.001
 spr-0.001
 hearts-0.001
 strokes-0.001
 hugs-0.001
Token had
Token ID550
Feature activation-4.86e-01

Pos logit contributions
 walks+0.014
 hearts+0.014
 tul+0.014
 strokes+0.014
irs+0.014
Neg logit contributions
 seek-0.016
ney-0.014
umpy-0.013
 than-0.012
and-0.011
Token entered
Token ID5982
Feature activation-3.96e-01

Pos logit contributions
 seek+0.028
ney+0.021
 than+0.020
umpy+0.019
and+0.018
Neg logit contributions
 tul-0.031
irs-0.031
 hearts-0.031
ened-0.030
ience-0.030
Token a
Token ID257
Feature activation-3.19e-01

Pos logit contributions
 seek+0.026
ney+0.021
 than+0.020
umpy+0.020
and+0.019
Neg logit contributions
 walks-0.022
 hearts-0.021
 stro-0.021
 tul-0.021
 spr-0.021
Token sand
Token ID6450
Feature activation-1.58e+00

Pos logit contributions
 seek+0.018
ney+0.015
umpy+0.015
 than+0.014
and+0.013
Neg logit contributions
 walks-0.020
 hearts-0.019
 spr-0.019
 strokes-0.019
 tul-0.019
Tokenpit
Token ID15544
Feature activation-3.92e+00

Pos logit contributions
 seek+0.081
umpy+0.070
ney+0.068
and+0.061
rated+0.059
Neg logit contributions
 spr-0.109
 walks-0.105
 hearts-0.104
 tul-0.103
 er-0.102
Token and
Token ID290
Feature activation+1.19e-01

Pos logit contributions
 seek+0.213
ney+0.177
umpy+0.176
and+0.153
 than+0.150
Neg logit contributions
 spr-0.254
 walks-0.254
 hearts-0.244
 tul-0.244
 er-0.242
Token it
Token ID340
Feature activation+9.27e-01

Pos logit contributions
 walks+0.006
 spr+0.006
 stro+0.006
 hearts+0.006
 tul+0.006
Neg logit contributions
 seek-0.008
ney-0.007
umpy-0.007
 than-0.006
and-0.006
Token was
Token ID373
Feature activation+4.49e-01

Pos logit contributions
 walks+0.061
 spr+0.059
 hearts+0.059
 strokes+0.058
 tul+0.058
Neg logit contributions
 seek-0.051
ney-0.042
umpy-0.040
 than-0.038
and-0.036
Token getting
Token ID1972
Feature activation+2.00e-01

Pos logit contributions
 walks+0.028
 spr+0.027
 hearts+0.027
 tul+0.027
 strokes+0.026
Neg logit contributions
 seek-0.026
ney-0.023
umpy-0.022
 than-0.020
and-0.020
Token bigger
Token ID5749
Feature activation+1.02e+00

Pos logit contributions
 walks+0.013
 hearts+0.012
 strokes+0.012
 stro+0.012
 spr+0.012
Neg logit contributions
 seek-0.011
ney-0.010
umpy-0.009
 than-0.009
and-0.008
Token around
Token ID1088
Feature activation+1.11e+00

Pos logit contributions
 tul+0.100
irs+0.098
 stro+0.095
 spr+0.095
 hearts+0.095
Neg logit contributions
 seek-0.039
 than-0.029
ney-0.024
and-0.021
 but-0.020
Token.
Token ID13
Feature activation-1.96e-01

Pos logit contributions
ience+0.063
 stro+0.057
 spr+0.056
 walks+0.056
irs+0.055
Neg logit contributions
 seek-0.074
 than-0.062
ney-0.056
 but-0.055
and-0.055
Token "
Token ID366
Feature activation-3.34e-01

Pos logit contributions
 seek+0.009
ney+0.007
umpy+0.007
and+0.006
 than+0.006
Neg logit contributions
 walks-0.014
ened-0.014
ience-0.014
 hearts-0.014
 tul-0.014
TokenA
Token ID32
Feature activation+5.37e-01

Pos logit contributions
 seek+0.020
ney+0.017
umpy+0.016
 than+0.015
and+0.014
Neg logit contributions
 walks-0.020
 hearts-0.020
 spr-0.019
 tul-0.019
 stro-0.019
Tokenbr
Token ID1671
Feature activation+5.80e-02

Pos logit contributions
 walks+0.023
 hearts+0.022
 stro+0.021
 spr+0.021
ened+0.021
Neg logit contributions
 seek-0.042
ney-0.037
umpy-0.036
 than-0.035
and-0.033
Tokenac
Token ID330
Feature activation-3.84e+00

Pos logit contributions
 walks+0.005
 hearts+0.005
 spr+0.005
 strokes+0.005
 tul+0.005
Neg logit contributions
 seek-0.002
ney-0.002
umpy-0.001
 than-0.001
and-0.001
Tokenad
Token ID324
Feature activation-8.73e-01

Pos logit contributions
 seek+0.316
ney+0.281
umpy+0.271
 than+0.264
and+0.253
Neg logit contributions
 walks-0.150
 hearts-0.142
 spr-0.141
 stro-0.141
 tul-0.140
Tokenab
Token ID397
Feature activation-9.35e-01

Pos logit contributions
 seek+0.059
ney+0.050
umpy+0.048
 than+0.046
and+0.045
Neg logit contributions
 walks-0.047
 spr-0.045
 strokes-0.044
 hearts-0.044
 hugs-0.044
Tokenra
Token ID430
Feature activation-1.19e+00

Pos logit contributions
 seek+0.068
ney+0.059
 than+0.056
umpy+0.056
and+0.051
Neg logit contributions
 walks-0.044
 hearts-0.042
 stro-0.042
 tul-0.041
 spr-0.041
Token!"
Token ID2474
Feature activation-1.24e-01

Pos logit contributions
 seek+0.043
ney+0.030
umpy+0.028
 than+0.027
and+0.022
Neg logit contributions
 walks-0.100
 hearts-0.099
 tul-0.098
 spr-0.097
 strokes-0.097
Token
Token ID198
Feature activation-1.16e-02

Pos logit contributions
 seek+0.008
umpy+0.006
ney+0.006
 than+0.006
and+0.006
Neg logit contributions
 walks-0.007
 hearts-0.007
 tul-0.007
ened-0.007
ience-0.007
Token said
Token ID531
Feature activation+3.48e-01

Pos logit contributions
 seek+0.018
 than+0.013
ney+0.012
Bob+0.011
 boy+0.011
Neg logit contributions
irs-0.017
 hearts-0.016
 walks-0.016
ened-0.016
 tul-0.016
Token,
Token ID11
Feature activation+2.06e-01

Pos logit contributions
 walks+0.024
ened+0.024
 hearts+0.023
irs+0.023
 spr+0.023
Neg logit contributions
 seek-0.017
 than-0.012
ney-0.012
umpy-0.012
and-0.011
Token "
Token ID366
Feature activation+7.12e-02

Pos logit contributions
 walks+0.014
ened+0.014
 hearts+0.014
 er+0.014
irs+0.014
Neg logit contributions
 seek-0.010
ney-0.008
 than-0.008
umpy-0.008
and-0.007
TokenA
Token ID32
Feature activation+5.89e-01

Pos logit contributions
 walks+0.005
ened+0.005
 spr+0.005
 strokes+0.005
 hearts+0.005
Neg logit contributions
 seek-0.004
ney-0.003
umpy-0.003
 than-0.003
and-0.003
Tokenbr
Token ID1671
Feature activation+2.82e-01

Pos logit contributions
ened+0.031
 walks+0.031
 stro+0.030
 hearts+0.030
irs+0.029
Neg logit contributions
 seek-0.041
ney-0.034
umpy-0.033
 than-0.033
and-0.029
Tokenac
Token ID330
Feature activation-3.80e+00

Pos logit contributions
 walks+0.026
 stro+0.025
 spr+0.024
 strokes+0.024
 hugs+0.024
Neg logit contributions
 seek-0.009
ney-0.007
umpy-0.006
 than-0.005
and-0.004
Tokenad
Token ID324
Feature activation-7.69e-01

Pos logit contributions
 seek+0.294
ney+0.264
umpy+0.254
 than+0.249
and+0.232
Neg logit contributions
 walks-0.162
 stro-0.161
 tul-0.158
 spr-0.156
 strokes-0.155
Tokenab
Token ID397
Feature activation-7.77e-01

Pos logit contributions
 seek+0.049
ney+0.042
umpy+0.040
 than+0.038
and+0.036
Neg logit contributions
 walks-0.044
 hearts-0.042
 spr-0.042
 strokes-0.042
 tul-0.042
Tokenra
Token ID430
Feature activation-1.12e+00

Pos logit contributions
 seek+0.054
ney+0.047
 than+0.045
umpy+0.043
kin+0.040
Neg logit contributions
 walks-0.039
 hearts-0.037
 stro-0.037
 tul-0.036
 strokes-0.036
Token!"
Token ID2474
Feature activation-1.22e-01

Pos logit contributions
 seek+0.053
ney+0.040
 than+0.040
umpy+0.039
and+0.033
Neg logit contributions
 hearts-0.080
 walks-0.080
 tul-0.079
 spr-0.078
 strokes-0.077
Token Suddenly
Token ID24975
Feature activation+7.00e-01

Pos logit contributions
 seek+0.005
ney+0.004
umpy+0.004
 than+0.004
and+0.004
Neg logit contributions
 walks-0.009
ened-0.009
 spr-0.009
 tul-0.009
 hearts-0.009
Token new
Token ID649
Feature activation+3.44e-01

Pos logit contributions
 walks+0.032
 spr+0.031
 tul+0.030
 stro+0.030
 hearts+0.030
Neg logit contributions
 seek-0.041
ney-0.035
umpy-0.034
 than-0.033
and-0.031
Token knowledge
Token ID3725
Feature activation+7.54e-01

Pos logit contributions
 stro+0.019
 walks+0.019
 tul+0.018
irs+0.018
 spr+0.018
Neg logit contributions
 seek-0.022
ney-0.018
 than-0.017
umpy-0.016
 active-0.015
Token he
Token ID339
Feature activation-6.86e-01

Pos logit contributions
 walks+0.045
 hearts+0.044
 strokes+0.043
 spr+0.043
 hugs+0.043
Neg logit contributions
 seek-0.046
ney-0.039
umpy-0.037
 than-0.034
and-0.033
Token made
Token ID925
Feature activation-1.77e+00

Pos logit contributions
 seek+0.041
ney+0.034
umpy+0.033
 than+0.031
and+0.029
Neg logit contributions
 walks-0.042
 hearts-0.041
 spr-0.040
 strokes-0.040
 stro-0.040
Token gu
Token ID915
Feature activation-1.43e+00

Pos logit contributions
 seek+0.120
ney+0.105
umpy+0.102
 than+0.090
and+0.088
Neg logit contributions
 walks-0.097
 strokes-0.094
 hearts-0.092
 hugs-0.091
ened-0.090
Tokenac
Token ID330
Feature activation-3.77e+00

Pos logit contributions
 seek+0.069
umpy+0.048
ney+0.046
Bob+0.045
rated+0.043
Neg logit contributions
 hearts-0.103
 er-0.102
ened-0.099
 strokes-0.099
 hugs-0.099
Tokenam
Token ID321
Feature activation-1.36e+00

Pos logit contributions
 seek+0.279
ney+0.213
umpy+0.208
 than+0.200
 seeker+0.196
Neg logit contributions
 walks-0.190
 hearts-0.188
 hugs-0.185
ened-0.184
 strokes-0.182
Tokenole
Token ID2305
Feature activation-1.72e+00

Pos logit contributions
 seek+0.082
ney+0.067
umpy+0.065
and+0.059
 than+0.058
Neg logit contributions
 walks-0.081
 hearts-0.079
ened-0.079
 strokes-0.078
 hugs-0.077
Token.
Token ID13
Feature activation-1.53e-01

Pos logit contributions
 seek+0.106
ney+0.091
umpy+0.089
 than+0.078
iring+0.077
Neg logit contributions
 walks-0.102
 hearts-0.096
 strokes-0.096
 hugs-0.095
 spr-0.095
Token
Token ID198
Feature activation-3.28e-01

Pos logit contributions
 seek+0.009
ney+0.007
umpy+0.007
 than+0.007
and+0.006
Neg logit contributions
 walks-0.010
 hearts-0.009
 strokes-0.009
 spr-0.009
 stro-0.009
Token
Token ID198
Feature activation-3.71e-02

Pos logit contributions
 seek+0.019
ney+0.016
umpy+0.015
 than+0.015
and+0.013
Neg logit contributions
 walks-0.021
 strokes-0.020
 hearts-0.019
 stro-0.019
 spr-0.019
Token and
Token ID290
Feature activation-1.89e-01

Pos logit contributions
ience+0.023
irs+0.022
 stro+0.022
 walks+0.022
 strokes+0.022
Neg logit contributions
 seek-0.017
 than-0.014
ney-0.012
 but-0.011
and-0.011
Token said
Token ID531
Feature activation+9.18e-02

Pos logit contributions
 seek+0.013
ney+0.010
 than+0.010
umpy+0.009
and+0.009
Neg logit contributions
 walks-0.010
 tul-0.010
 hearts-0.010
 spr-0.010
 strokes-0.010
Token "
Token ID366
Feature activation+3.69e-02

Pos logit contributions
 walks+0.006
irs+0.006
 hearts+0.006
 spr+0.006
ened+0.006
Neg logit contributions
 seek-0.004
 than-0.003
ney-0.003
and-0.003
umpy-0.003
TokenA
Token ID32
Feature activation+6.61e-01

Pos logit contributions
 walks+0.002
 hearts+0.002
 spr+0.002
 tul+0.002
 stro+0.002
Neg logit contributions
 seek-0.002
ney-0.002
umpy-0.002
 than-0.002
and-0.002
Tokenbr
Token ID1671
Feature activation+2.35e-01

Pos logit contributions
 stro+0.036
 walks+0.035
 hearts+0.035
ened+0.034
ishly+0.034
Neg logit contributions
 seek-0.047
ney-0.036
umpy-0.036
 than-0.035
and-0.032
Tokenac
Token ID330
Feature activation-3.76e+00

Pos logit contributions
 walks+0.020
 hearts+0.020
 spr+0.020
 stro+0.020
 strokes+0.020
Neg logit contributions
 seek-0.008
ney-0.006
umpy-0.005
 than-0.005
and-0.004
Tokenad
Token ID324
Feature activation-7.13e-01

Pos logit contributions
 seek+0.296
ney+0.265
umpy+0.255
 than+0.248
and+0.235
Neg logit contributions
 walks-0.159
 stro-0.154
 spr-0.151
 strokes-0.151
 tul-0.150
Tokenab
Token ID397
Feature activation-7.00e-01

Pos logit contributions
 seek+0.045
ney+0.039
umpy+0.037
 than+0.035
and+0.034
Neg logit contributions
 walks-0.041
 hearts-0.039
 spr-0.039
 strokes-0.039
 tul-0.039
Tokenra
Token ID430
Feature activation-1.04e+00

Pos logit contributions
 seek+0.050
ney+0.043
 than+0.041
umpy+0.039
kin+0.037
Neg logit contributions
 walks-0.035
 stro-0.033
 hearts-0.033
 tul-0.032
 strokes-0.032
Token!"
Token ID2474
Feature activation-4.14e-03

Pos logit contributions
 seek+0.033
ney+0.022
umpy+0.020
 than+0.020
and+0.014
Neg logit contributions
 walks-0.092
 hearts-0.091
 tul-0.090
 spr-0.089
 strokes-0.089
Token but
Token ID475
Feature activation-8.05e-01

Pos logit contributions
 seek+0.000
ney+0.000
 than+0.000
and+0.000
umpy+0.000
Neg logit contributions
ience-0.000
 walks-0.000
 tul-0.000
ened-0.000
 spr-0.000
Token said
Token ID531
Feature activation+1.16e-01

Pos logit contributions
 seek+0.002
ney+0.001
 than+0.001
umpy+0.001
and+0.001
Neg logit contributions
 walks-0.001
 hearts-0.001
 spr-0.001
 tul-0.001
 strokes-0.001
Token,
Token ID11
Feature activation+2.29e-01

Pos logit contributions
 walks+0.007
 spr+0.007
 hearts+0.007
 stro+0.007
ened+0.007
Neg logit contributions
 seek-0.006
ney-0.005
 than-0.005
umpy-0.005
and-0.005
Token "
Token ID366
Feature activation-1.13e-01

Pos logit contributions
 walks+0.016
 hearts+0.015
ened+0.015
irs+0.015
 strokes+0.015
Neg logit contributions
 seek-0.011
ney-0.009
 than-0.009
umpy-0.008
and-0.008
TokenA
Token ID32
Feature activation+6.88e-01

Pos logit contributions
 seek+0.007
ney+0.006
umpy+0.006
 than+0.005
and+0.005
Neg logit contributions
 walks-0.007
 hearts-0.006
 spr-0.006
 tul-0.006
 strokes-0.006
Tokenbr
Token ID1671
Feature activation+1.62e-01

Pos logit contributions
 walks+0.040
 hearts+0.039
 stro+0.038
ened+0.038
 strokes+0.038
Neg logit contributions
 seek-0.044
ney-0.036
umpy-0.035
 than-0.033
and-0.032
Tokenac
Token ID330
Feature activation-3.75e+00

Pos logit contributions
 walks+0.014
 hearts+0.014
 spr+0.014
 strokes+0.014
 tul+0.014
Neg logit contributions
 seek-0.006
ney-0.004
umpy-0.004
 than-0.003
and-0.003
Tokenad
Token ID324
Feature activation-7.19e-01

Pos logit contributions
 seek+0.320
ney+0.284
umpy+0.274
 than+0.269
and+0.259
Neg logit contributions
 walks-0.135
 hearts-0.127
 spr-0.126
 strokes-0.124
 tul-0.123
Tokenab
Token ID397
Feature activation-8.05e-01

Pos logit contributions
 seek+0.050
ney+0.043
umpy+0.041
iring+0.039
 than+0.039
Neg logit contributions
 walks-0.036
 spr-0.035
 er-0.035
 tul-0.034
 hugs-0.034
Tokenra
Token ID430
Feature activation-1.07e+00

Pos logit contributions
 seek+0.062
ney+0.054
 than+0.051
umpy+0.051
and+0.047
Neg logit contributions
 walks-0.036
 hearts-0.034
 stro-0.033
 spr-0.033
 tul-0.033
Token!
Token ID0
Feature activation-7.02e-02

Pos logit contributions
 seek+0.036
ney+0.025
umpy+0.023
 than+0.022
and+0.018
Neg logit contributions
 walks-0.092
 hearts-0.091
 spr-0.090
 tul-0.090
 strokes-0.090
Token I
Token ID314
Feature activation+5.67e-02

Pos logit contributions
 seek+0.005
ney+0.004
 than+0.004
umpy+0.004
and+0.003
Neg logit contributions
 walks-0.004
 tul-0.004
 spr-0.004
 stro-0.004
 hearts-0.004
Token air
Token ID1633
Feature activation-3.84e-04

Pos logit contributions
 seek+0.014
ney+0.013
 than+0.011
umpy+0.011
 boy+0.010
Neg logit contributions
 stro-0.016
irs-0.015
 tul-0.015
ience-0.015
 walks-0.015
Token.
Token ID13
Feature activation-4.14e-01

Pos logit contributions
 seek+0.000
ney+0.000
umpy+0.000
 than+0.000
and+0.000
Neg logit contributions
 walks-0.000
 hearts-0.000
 tul-0.000
 stro-0.000
 spr-0.000
Token "
Token ID366
Feature activation-4.74e-01

Pos logit contributions
 seek+0.026
ney+0.022
umpy+0.021
 than+0.020
rated+0.019
Neg logit contributions
 strokes-0.025
 spr-0.025
 walks-0.025
 hearts-0.024
 hugs-0.023
TokenA
Token ID32
Feature activation+5.20e-01

Pos logit contributions
 seek+0.025
ney+0.021
umpy+0.020
 than+0.019
and+0.017
Neg logit contributions
 walks-0.031
 hearts-0.030
 spr-0.030
 tul-0.030
 stro-0.030
Tokenbr
Token ID1671
Feature activation+1.36e-01

Pos logit contributions
 stro+0.025
 walks+0.025
 hearts+0.025
ened+0.025
 strokes+0.024
Neg logit contributions
 seek-0.039
ney-0.033
umpy-0.031
 than-0.031
and-0.028
Tokenac
Token ID330
Feature activation-3.75e+00

Pos logit contributions
 walks+0.011
 hearts+0.011
 spr+0.011
 strokes+0.011
 tul+0.011
Neg logit contributions
 seek-0.005
ney-0.004
umpy-0.003
 than-0.003
and-0.003
Tokenad
Token ID324
Feature activation-8.82e-01

Pos logit contributions
 seek+0.297
ney+0.267
umpy+0.256
 than+0.250
and+0.234
Neg logit contributions
 walks-0.155
 stro-0.152
 tul-0.148
 hearts-0.147
 spr-0.146
Tokenab
Token ID397
Feature activation-7.41e-01

Pos logit contributions
 seek+0.055
ney+0.047
umpy+0.045
 than+0.044
and+0.041
Neg logit contributions
 walks-0.051
 hearts-0.049
 spr-0.049
 strokes-0.048
 tul-0.048
Tokenra
Token ID430
Feature activation-1.16e+00

Pos logit contributions
 seek+0.051
ney+0.045
 than+0.042
umpy+0.040
kin+0.039
Neg logit contributions
 walks-0.037
 stro-0.036
 hearts-0.035
 tul-0.035
 strokes-0.034
Token!"
Token ID2474
Feature activation-1.13e-01

Pos logit contributions
 seek+0.039
ney+0.027
 than+0.026
umpy+0.025
aking+0.018
Neg logit contributions
 hearts-0.098
 tul-0.096
 walks-0.096
 stro-0.095
 spr-0.095
Token
Token ID198
Feature activation-1.66e-02

Pos logit contributions
 seek+0.006
ney+0.006
umpy+0.005
 than+0.005
rated+0.005
Neg logit contributions
 walks-0.007
 strokes-0.007
 spr-0.007
 hugs-0.007
 stro-0.007
Token did
Token ID750
Feature activation+2.37e-02

Pos logit contributions
 seek+0.005
ney+0.004
umpy+0.004
 than+0.004
and+0.003
Neg logit contributions
 walks-0.004
 hearts-0.004
 strokes-0.003
 tul-0.003
ened-0.003
Token!
Token ID0
Feature activation-4.54e-01

Pos logit contributions
 walks+0.002
 hearts+0.002
 spr+0.002
 strokes+0.001
 tul+0.001
Neg logit contributions
 seek-0.001
ney-0.001
umpy-0.001
 than-0.001
and-0.001
Token They
Token ID1119
Feature activation+4.16e-01

Pos logit contributions
 seek+0.031
ney+0.027
 than+0.025
umpy+0.025
rated+0.023
Neg logit contributions
 hugs-0.024
 strokes-0.024
 hearts-0.024
 walks-0.024
 stro-0.023
Token built
Token ID3170
Feature activation-2.27e+00

Pos logit contributions
 walks+0.027
 strokes+0.026
irs+0.026
 hearts+0.026
chief+0.025
Neg logit contributions
 seek-0.024
ney-0.020
o-0.016
aking-0.015
umpy-0.015
Token the
Token ID262
Feature activation-1.43e+00

Pos logit contributions
 seek+0.143
ney+0.121
 than+0.113
umpy+0.113
and+0.105
Neg logit contributions
 walks-0.127
 stro-0.125
 tul-0.123
 spr-0.123
 strokes-0.122
Token most
Token ID749
Feature activation-3.73e+00

Pos logit contributions
 seek+0.093
ney+0.081
umpy+0.079
 than+0.076
and+0.073
Neg logit contributions
 walks-0.078
 hearts-0.077
 spr-0.077
 tul-0.075
 strokes-0.074
Token beautiful
Token ID4950
Feature activation-5.91e-01

Pos logit contributions
 seek+0.252
ney+0.215
umpy+0.212
rated+0.199
asc+0.192
Neg logit contributions
 spr-0.199
 hearts-0.194
 walks-0.193
 er-0.190
 tul-0.189
Token castle
Token ID16669
Feature activation-1.48e+00

Pos logit contributions
 seek+0.041
umpy+0.037
ney+0.037
and+0.034
iring+0.033
Neg logit contributions
 hearts-0.030
 walks-0.030
 spr-0.029
 strokes-0.028
 tul-0.028
Token with
Token ID351
Feature activation-3.58e-01

Pos logit contributions
 seek+0.088
umpy+0.081
ney+0.073
Bob+0.070
rated+0.069
Neg logit contributions
 walks-0.086
 hearts-0.082
 spr-0.081
 hugs-0.080
 bows-0.080
Token the
Token ID262
Feature activation-1.17e-01

Pos logit contributions
 seek+0.020
ney+0.016
 than+0.016
 sunny+0.014
neys+0.014
Neg logit contributions
ience-0.022
 stro-0.022
 walks-0.022
ian-0.022
chief-0.021
Token clean
Token ID3424
Feature activation+4.10e-01

Pos logit contributions
 seek+0.008
ney+0.007
umpy+0.007
 than+0.006
and+0.006
Neg logit contributions
 walks-0.006
 hearts-0.006
 spr-0.006
 tul-0.006
 strokes-0.006
Token wand
Token ID11569
Feature activation+4.32e-01

Pos logit contributions
 walks+0.004
 stro+0.004
 hearts+0.004
 spr+0.004
 tul+0.004
Neg logit contributions
 seek-0.006
ney-0.005
 than-0.005
umpy-0.004
and-0.004
Token.
Token ID13
Feature activation-3.82e-01

Pos logit contributions
 walks+0.026
 hearts+0.025
 spr+0.025
 strokes+0.025
 stro+0.025
Neg logit contributions
 seek-0.025
ney-0.021
umpy-0.020
 than-0.019
and-0.018
Token "
Token ID366
Feature activation-4.01e-01

Pos logit contributions
 seek+0.023
ney+0.019
 than+0.018
umpy+0.018
rated+0.017
Neg logit contributions
 strokes-0.024
 walks-0.024
 hearts-0.023
 hugs-0.023
 stro-0.023
TokenA
Token ID32
Feature activation+5.12e-01

Pos logit contributions
 seek+0.024
ney+0.020
umpy+0.019
 than+0.019
and+0.018
Neg logit contributions
 walks-0.024
 hearts-0.023
 spr-0.023
 strokes-0.023
 tul-0.023
Tokenbr
Token ID1671
Feature activation+8.38e-02

Pos logit contributions
 walks+0.021
 hearts+0.021
 spr+0.020
 stro+0.020
ened+0.020
Neg logit contributions
 seek-0.041
ney-0.035
umpy-0.034
 than-0.033
and-0.032
Tokenac
Token ID330
Feature activation-3.72e+00

Pos logit contributions
 walks+0.007
 spr+0.007
 hearts+0.007
 stro+0.007
 strokes+0.007
Neg logit contributions
 seek-0.003
ney-0.002
umpy-0.002
 than-0.002
and-0.002
Tokenad
Token ID324
Feature activation-8.76e-01

Pos logit contributions
 seek+0.298
ney+0.265
umpy+0.256
 than+0.250
and+0.235
Neg logit contributions
 walks-0.152
 stro-0.147
 hearts-0.144
 spr-0.144
 tul-0.144
Tokenab
Token ID397
Feature activation-8.13e-01

Pos logit contributions
 seek+0.058
ney+0.050
umpy+0.048
 than+0.046
and+0.044
Neg logit contributions
 walks-0.047
 spr-0.045
 hearts-0.045
 strokes-0.045
 tul-0.045
Tokenra
Token ID430
Feature activation-1.19e+00

Pos logit contributions
 seek+0.060
ney+0.052
 than+0.049
umpy+0.048
and+0.044
Neg logit contributions
 walks-0.038
 hearts-0.036
 stro-0.036
 tul-0.036
 spr-0.036
Token!"
Token ID2474
Feature activation-2.12e-01

Pos logit contributions
 seek+0.043
ney+0.030
umpy+0.028
 than+0.027
and+0.022
Neg logit contributions
 walks-0.099
 hearts-0.099
 tul-0.097
 spr-0.097
 strokes-0.097
Token
Token ID198
Feature activation-6.67e-02

Pos logit contributions
 seek+0.011
ney+0.009
umpy+0.008
 than+0.008
and+0.007
Neg logit contributions
 walks-0.015
 stro-0.015
 spr-0.015
 strokes-0.015
 hugs-0.015
Token saying
Token ID2282
Feature activation+1.29e-01

Pos logit contributions
 seek+0.003
ney+0.002
 than+0.002
and+0.002
 except+0.002
Neg logit contributions
irs-0.002
ience-0.002
 walks-0.002
 tul-0.002
 hearts-0.002
Token,
Token ID11
Feature activation+1.87e-01

Pos logit contributions
 walks+0.008
 spr+0.008
 hearts+0.008
 tul+0.008
ened+0.008
Neg logit contributions
 seek-0.007
ney-0.006
 than-0.006
umpy-0.006
and-0.005
Token "
Token ID366
Feature activation-6.65e-02

Pos logit contributions
irs+0.013
 walks+0.012
 hearts+0.012
ened+0.012
ience+0.012
Neg logit contributions
 seek-0.010
 than-0.008
ney-0.007
umpy-0.007
 but-0.007
TokenA
Token ID32
Feature activation+6.59e-01

Pos logit contributions
 seek+0.004
ney+0.003
umpy+0.003
 than+0.003
and+0.003
Neg logit contributions
 walks-0.004
 stro-0.004
 hearts-0.004
 spr-0.004
 tul-0.004
Tokenbr
Token ID1671
Feature activation+5.96e-02

Pos logit contributions
 walks+0.039
 hearts+0.037
 stro+0.037
 strokes+0.037
 spr+0.037
Neg logit contributions
 seek-0.041
ney-0.035
umpy-0.033
 than-0.032
and-0.030
Tokenac
Token ID330
Feature activation-3.71e+00

Pos logit contributions
 walks+0.005
 hearts+0.005
 spr+0.005
 strokes+0.005
 tul+0.005
Neg logit contributions
 seek-0.002
ney-0.002
umpy-0.001
 than-0.001
and-0.001
Tokenad
Token ID324
Feature activation-7.42e-01

Pos logit contributions
 seek+0.325
ney+0.286
umpy+0.278
 than+0.271
and+0.266
Neg logit contributions
 walks-0.126
 hearts-0.119
 spr-0.118
ened-0.117
 hugs-0.116
Tokenab
Token ID397
Feature activation-7.55e-01

Pos logit contributions
 seek+0.050
ney+0.042
umpy+0.040
 than+0.038
and+0.037
Neg logit contributions
 walks-0.040
 spr-0.038
 hearts-0.038
 strokes-0.038
 hugs-0.038
Tokenra
Token ID430
Feature activation-1.07e+00

Pos logit contributions
 seek+0.059
ney+0.052
umpy+0.050
 than+0.049
and+0.046
Neg logit contributions
 walks-0.032
 hearts-0.030
 spr-0.030
 tul-0.030
 strokes-0.030
Token!"
Token ID2474
Feature activation-1.21e-01

Pos logit contributions
 seek+0.049
ney+0.037
 than+0.035
umpy+0.035
and+0.029
Neg logit contributions
 hearts-0.079
 walks-0.079
 tul-0.078
 spr-0.077
 stro-0.077
Token
Token ID198
Feature activation+7.04e-02

Pos logit contributions
 seek+0.009
ney+0.007
rated+0.007
umpy+0.007
asc+0.007
Neg logit contributions
 strokes-0.007
 walks-0.006
 hugs-0.006
 hearts-0.006
 stro-0.006
Token said
Token ID531
Feature activation+1.81e-01

Pos logit contributions
 seek+0.031
ney+0.024
 than+0.024
and+0.022
umpy+0.022
Neg logit contributions
irs-0.025
 tul-0.025
 walks-0.025
 strokes-0.024
 spr-0.024
Token,
Token ID11
Feature activation+7.86e-02

Pos logit contributions
 walks+0.013
irs+0.013
 spr+0.013
ened+0.013
 stro+0.013
Neg logit contributions
 seek-0.008
 than-0.006
and-0.005
umpy-0.005
ney-0.005
Token "
Token ID366
Feature activation-8.53e-02

Pos logit contributions
 walks+0.005
irs+0.005
ened+0.005
 hearts+0.005
 er+0.005
Neg logit contributions
 seek-0.004
ney-0.003
 than-0.003
umpy-0.003
and-0.003
TokenA
Token ID32
Feature activation+5.73e-01

Pos logit contributions
 seek+0.004
ney+0.004
umpy+0.003
 than+0.003
and+0.003
Neg logit contributions
 walks-0.006
 hearts-0.006
 spr-0.006
 strokes-0.006
 tul-0.006
Tokenbr
Token ID1671
Feature activation+2.27e-01

Pos logit contributions
 walks+0.031
ened+0.031
 hearts+0.030
 stro+0.030
 spr+0.029
Neg logit contributions
 seek-0.040
ney-0.032
umpy-0.032
 than-0.030
and-0.029
Tokenac
Token ID330
Feature activation-3.71e+00

Pos logit contributions
 walks+0.020
 spr+0.019
 hearts+0.019
 strokes+0.019
 stro+0.019
Neg logit contributions
 seek-0.008
ney-0.006
umpy-0.005
 than-0.005
and-0.004
Tokenad
Token ID324
Feature activation-7.81e-01

Pos logit contributions
 seek+0.297
ney+0.264
umpy+0.256
 than+0.249
and+0.235
Neg logit contributions
 walks-0.151
 stro-0.145
 spr-0.144
 tul-0.143
 strokes-0.142
Tokenab
Token ID397
Feature activation-7.72e-01

Pos logit contributions
 seek+0.050
ney+0.042
umpy+0.040
 than+0.039
and+0.037
Neg logit contributions
 walks-0.044
 hearts-0.043
 spr-0.043
 strokes-0.042
 tul-0.042
Tokenra
Token ID430
Feature activation-1.11e+00

Pos logit contributions
 seek+0.056
ney+0.047
 than+0.045
umpy+0.044
kin+0.041
Neg logit contributions
 walks-0.038
 hearts-0.036
 tul-0.035
 stro-0.035
 spr-0.035
Token,
Token ID11
Feature activation-1.37e-01

Pos logit contributions
 seek+0.052
ney+0.040
 than+0.039
umpy+0.038
and+0.032
Neg logit contributions
 walks-0.080
 hearts-0.080
 tul-0.079
 spr-0.078
 strokes-0.078
Token make
Token ID787
Feature activation-4.82e-01

Pos logit contributions
 seek+0.008
 than+0.006
ney+0.006
umpy+0.005
 but+0.005
Neg logit contributions
ened-0.009
 walks-0.008
 stro-0.008
irs-0.008
 strokes-0.008
Token.
Token ID13
Feature activation+7.49e-02

Pos logit contributions
 seek+0.029
ney+0.022
 than+0.020
o+0.020
 but+0.019
Neg logit contributions
ened-0.055
 imag-0.054
 strokes-0.054
irs-0.054
 hearts-0.054
Token Nut
Token ID11959
Feature activation-4.76e-01

Pos logit contributions
 walks+0.005
 hearts+0.005
 spr+0.005
 strokes+0.005
 tul+0.005
Neg logit contributions
 seek-0.004
ney-0.003
umpy-0.003
 than-0.003
and-0.003
Tokenty
Token ID774
Feature activation-1.19e+00

Pos logit contributions
 seek+0.027
ney+0.023
umpy+0.022
 than+0.021
and+0.020
Neg logit contributions
 walks-0.031
 stro-0.029
 spr-0.029
 hugs-0.029
 strokes-0.029
Token showed
Token ID3751
Feature activation-3.40e-01

Pos logit contributions
 seek+0.067
ney+0.053
umpy+0.049
 than+0.048
Bob+0.046
Neg logit contributions
 hearts-0.076
 walks-0.075
irs-0.075
 tul-0.075
 strokes-0.074
Token Bub
Token ID19645
Feature activation-1.06e+00

Pos logit contributions
 seek+0.014
Bob+0.011
 than+0.011
iring+0.011
and+0.011
Neg logit contributions
 hearts-0.024
 tul-0.024
 spr-0.024
ience-0.023
 eyel-0.023
Tokenbles
Token ID7689
Feature activation-3.70e+00

Pos logit contributions
 seek+0.054
ney+0.046
 than+0.041
umpy+0.038
neys+0.037
Neg logit contributions
 walks-0.069
 er-0.069
 bows-0.068
 bl-0.068
 spr-0.067
Token a
Token ID257
Feature activation-9.36e-01

Pos logit contributions
 seek+0.214
 than+0.191
and+0.186
ney+0.177
umpy+0.173
Neg logit contributions
ened-0.209
 stro-0.208
ience-0.207
 walks-0.206
ian-0.206
Token shiny
Token ID22441
Feature activation-1.04e+00

Pos logit contributions
 seek+0.055
ney+0.047
umpy+0.045
 than+0.043
and+0.041
Neg logit contributions
 walks-0.057
 spr-0.055
 hearts-0.055
 strokes-0.055
 tul-0.055
Token shield
Token ID7614
Feature activation-1.45e+00

Pos logit contributions
 seek+0.062
 than+0.052
ney+0.050
umpy+0.047
iring+0.041
Neg logit contributions
 walks-0.065
irs-0.063
 stro-0.063
 hearts-0.062
 strokes-0.060
Token that
Token ID326
Feature activation-6.16e-01

Pos logit contributions
 seek+0.065
 than+0.055
ney+0.052
umpy+0.046
and+0.040
Neg logit contributions
irs-0.109
 hearts-0.108
 walks-0.107
 tul-0.107
 strokes-0.105
Token he
Token ID339
Feature activation-7.14e-01

Pos logit contributions
 seek+0.035
umpy+0.033
ney+0.032
 than+0.029
and+0.028
Neg logit contributions
 spr-0.037
 walks-0.037
 er-0.036
 stro-0.036
 bows-0.035
Token said
Token ID531
Feature activation+2.74e-01

Pos logit contributions
 walks+0.010
 spr+0.010
 hearts+0.010
 tul+0.009
 strokes+0.009
Neg logit contributions
 seek-0.013
ney-0.011
 than-0.010
umpy-0.010
and-0.010
Token,
Token ID11
Feature activation+2.22e-01

Pos logit contributions
 walks+0.018
 spr+0.017
 hearts+0.017
 stro+0.017
 strokes+0.017
Neg logit contributions
 seek-0.015
ney-0.012
 than-0.012
umpy-0.011
and-0.011
Token "
Token ID366
Feature activation-1.23e-01

Pos logit contributions
 walks+0.015
 hearts+0.014
ened+0.014
 strokes+0.014
 spr+0.014
Neg logit contributions
 seek-0.012
ney-0.009
 than-0.009
umpy-0.009
and-0.008
TokenA
Token ID32
Feature activation+6.68e-01

Pos logit contributions
 seek+0.007
ney+0.006
umpy+0.005
 than+0.005
and+0.005
Neg logit contributions
 walks-0.008
 spr-0.008
 hearts-0.008
 strokes-0.008
 stro-0.008
Tokenbr
Token ID1671
Feature activation+1.80e-01

Pos logit contributions
 walks+0.039
 hearts+0.038
ened+0.038
 stro+0.038
 spr+0.038
Neg logit contributions
 seek-0.043
umpy-0.034
ney-0.034
 than-0.032
and-0.030
Tokenac
Token ID330
Feature activation-3.70e+00

Pos logit contributions
 walks+0.016
 stro+0.016
 spr+0.015
 hearts+0.015
 strokes+0.015
Neg logit contributions
 seek-0.006
ney-0.004
umpy-0.004
 than-0.003
and-0.003
Tokenad
Token ID324
Feature activation-7.28e-01

Pos logit contributions
 seek+0.303
ney+0.270
umpy+0.261
 than+0.254
and+0.242
Neg logit contributions
 walks-0.145
 stro-0.137
 hearts-0.137
 spr-0.136
 strokes-0.136
Tokenab
Token ID397
Feature activation-7.22e-01

Pos logit contributions
 seek+0.049
ney+0.042
umpy+0.040
 than+0.038
iring+0.038
Neg logit contributions
 walks-0.038
 spr-0.036
 tul-0.036
 er-0.036
 hugs-0.036
Tokenra
Token ID430
Feature activation-1.08e+00

Pos logit contributions
 seek+0.054
ney+0.047
 than+0.044
umpy+0.044
and+0.040
Neg logit contributions
 walks-0.033
 hearts-0.032
 stro-0.031
 spr-0.031
 strokes-0.031
Token!
Token ID0
Feature activation-1.65e-02

Pos logit contributions
 seek+0.038
ney+0.028
umpy+0.025
 than+0.024
and+0.020
Neg logit contributions
 walks-0.092
 hearts-0.090
 spr-0.089
 tul-0.089
 strokes-0.089
Token I
Token ID314
Feature activation+2.97e-02

Pos logit contributions
 seek+0.001
ney+0.001
umpy+0.001
 than+0.001
and+0.001
Neg logit contributions
 walks-0.001
 tul-0.001
 stro-0.001
 hearts-0.001
 hugs-0.001
Token said
Token ID531
Feature activation+8.36e-02

Pos logit contributions
irs+0.000
 walks+0.000
 hearts+0.000
 tul+0.000
 eyel+0.000
Neg logit contributions
 seek-0.000
ney-0.000
 than-0.000
Bob-0.000
 boy-0.000
Token,
Token ID11
Feature activation+1.45e-01

Pos logit contributions
 walks+0.006
 hearts+0.006
irs+0.006
 spr+0.006
 er+0.006
Neg logit contributions
 seek-0.004
 than-0.003
and-0.003
ney-0.002
umpy-0.002
Token "
Token ID366
Feature activation-2.02e-01

Pos logit contributions
 walks+0.010
 hearts+0.010
ened+0.010
 bows+0.010
irs+0.010
Neg logit contributions
 seek-0.007
ney-0.005
 than-0.005
umpy-0.005
and-0.005
TokenA
Token ID32
Feature activation+6.40e-01

Pos logit contributions
 seek+0.011
ney+0.009
umpy+0.008
 than+0.008
and+0.007
Neg logit contributions
 walks-0.014
 strokes-0.013
 hearts-0.013
 spr-0.013
 hugs-0.013
Tokenbr
Token ID1671
Feature activation+7.67e-02

Pos logit contributions
 walks+0.032
 hearts+0.031
 stro+0.030
ened+0.030
 strokes+0.030
Neg logit contributions
 seek-0.046
ney-0.039
umpy-0.038
 than-0.037
and-0.035
Tokenac
Token ID330
Feature activation-3.68e+00

Pos logit contributions
 walks+0.007
 hearts+0.006
 spr+0.006
 strokes+0.006
 tul+0.006
Neg logit contributions
 seek-0.003
ney-0.002
umpy-0.002
 than-0.002
and-0.001
Tokenad
Token ID324
Feature activation-7.51e-01

Pos logit contributions
 seek+0.291
ney+0.261
umpy+0.253
 than+0.243
and+0.230
Neg logit contributions
 walks-0.153
 stro-0.148
 hearts-0.145
 strokes-0.144
 tul-0.144
Tokenab
Token ID397
Feature activation-7.02e-01

Pos logit contributions
 seek+0.048
ney+0.041
umpy+0.039
 than+0.038
and+0.036
Neg logit contributions
 walks-0.042
 spr-0.041
 hearts-0.041
 tul-0.040
 strokes-0.040
Tokenra
Token ID430
Feature activation-1.13e+00

Pos logit contributions
 seek+0.054
ney+0.047
umpy+0.044
 than+0.044
and+0.041
Neg logit contributions
 walks-0.031
 hearts-0.030
 strokes-0.029
 stro-0.029
 tul-0.029
Token!"
Token ID2474
Feature activation-6.82e-02

Pos logit contributions
 seek+0.048
ney+0.035
 than+0.033
umpy+0.033
and+0.027
Neg logit contributions
 hearts-0.088
 walks-0.087
 tul-0.085
 spr-0.085
 strokes-0.085
Token Suddenly
Token ID24975
Feature activation+7.74e-01

Pos logit contributions
 seek+0.003
ney+0.003
umpy+0.003
and+0.003
 than+0.003
Neg logit contributions
 walks-0.005
 tul-0.004
 hearts-0.004
 spr-0.004
ened-0.004
Token j
Token ID474
Feature activation+5.16e-01

Pos logit contributions
 walks+0.003
 stro+0.003
 spr+0.003
ened+0.003
 strokes+0.003
Neg logit contributions
 seek-0.003
ney-0.003
umpy-0.003
 than-0.003
and-0.003
Tokenogged
Token ID42545
Feature activation-6.35e-01

Pos logit contributions
 tul+0.027
 walks+0.027
 spr+0.026
 stro+0.026
 hearts+0.026
Neg logit contributions
 seek-0.034
umpy-0.032
ney-0.030
 than-0.029
and-0.028
Token,
Token ID11
Feature activation-2.23e-01

Pos logit contributions
 seek+0.037
ney+0.035
umpy+0.032
neys+0.029
aghetti+0.029
Neg logit contributions
 walks-0.038
 hearts-0.036
ened-0.035
 strokes-0.035
 bows-0.035
Token the
Token ID262
Feature activation-7.13e-01

Pos logit contributions
 seek+0.013
ney+0.011
umpy+0.011
 than+0.010
and+0.010
Neg logit contributions
 walks-0.014
 hearts-0.013
 spr-0.013
 strokes-0.013
 tul-0.013
Token hung
Token ID9174
Feature activation-1.37e+00

Pos logit contributions
 seek+0.047
ney+0.042
umpy+0.040
 than+0.039
and+0.038
Neg logit contributions
 walks-0.039
 hearts-0.037
 spr-0.037
 bows-0.036
 tul-0.036
Tokenrier
Token ID5277
Feature activation-3.67e+00

Pos logit contributions
 seek+0.080
ney+0.066
umpy+0.064
 than+0.063
and+0.057
Neg logit contributions
 walks-0.084
 hearts-0.081
 stro-0.080
 strokes-0.079
 hugs-0.079
Token she
Token ID673
Feature activation-9.24e-01

Pos logit contributions
 seek+0.203
ney+0.171
umpy+0.162
and+0.145
aghetti+0.143
Neg logit contributions
 walks-0.237
 spr-0.233
 hearts-0.232
 strokes-0.230
 tul-0.227
Token got
Token ID1392
Feature activation-4.58e-01

Pos logit contributions
 seek+0.060
ney+0.051
umpy+0.049
 than+0.047
and+0.045
Neg logit contributions
 walks-0.052
 hearts-0.049
 spr-0.049
 strokes-0.049
 stro-0.049
Token.
Token ID13
Feature activation-2.08e-01

Pos logit contributions
 seek+0.026
umpy+0.020
ney+0.020
 than+0.020
and+0.018
Neg logit contributions
 walks-0.029
 hearts-0.029
 tul-0.028
 spr-0.028
 er-0.028
Token Finally
Token ID9461
Feature activation-2.81e-01

Pos logit contributions
 seek+0.013
ney+0.012
umpy+0.011
 than+0.011
and+0.011
Neg logit contributions
 walks-0.011
 spr-0.011
 hearts-0.011
ened-0.011
 tul-0.011
Token,
Token ID11
Feature activation+1.69e-01

Pos logit contributions
 seek+0.016
 than+0.012
ney+0.012
umpy+0.011
Bob+0.010
Neg logit contributions
 walks-0.018
 hearts-0.017
 stro-0.017
 spr-0.017
 strokes-0.017
Token her
Token ID607
Feature activation+5.78e-03

Pos logit contributions
 seek+0.051
ney+0.044
 than+0.042
umpy+0.041
and+0.039
Neg logit contributions
 walks-0.053
 spr-0.053
 tul-0.052
ened-0.052
 hearts-0.051
Token arrow
Token ID15452
Feature activation+8.50e-01

Pos logit contributions
 tul+0.000
irs+0.000
 spr+0.000
 stro+0.000
 walks+0.000
Neg logit contributions
 seek-0.000
 than-0.000
ney-0.000
umpy-0.000
 boy-0.000
Token.
Token ID13
Feature activation-2.62e-01

Pos logit contributions
irs+0.065
ience+0.060
ened+0.060
 walks+0.059
 tul+0.059
Neg logit contributions
 seek-0.040
 than-0.032
ney-0.032
umpy-0.026
and-0.025
Token She
Token ID1375
Feature activation+3.23e-01

Pos logit contributions
 seek+0.016
ney+0.014
umpy+0.013
 than+0.013
and+0.012
Neg logit contributions
 walks-0.015
 hearts-0.015
ened-0.015
 spr-0.015
 tul-0.015
Token unt
Token ID1418
Feature activation-1.84e-01

Pos logit contributions
ened+0.018
urbed+0.018
irs+0.017
 breaths+0.017
chief+0.017
Neg logit contributions
 seek-0.018
 than-0.016
?"-0.014
ney-0.014
 boy-0.014
Tokenies
Token ID444
Feature activation-3.66e+00

Pos logit contributions
 seek+0.013
ney+0.011
umpy+0.011
 than+0.011
and+0.010
Neg logit contributions
 walks-0.009
 spr-0.009
 tul-0.009
 stro-0.009
 hearts-0.008
Token the
Token ID262
Feature activation-1.65e+00

Pos logit contributions
 seek+0.208
 than+0.179
ney+0.163
and+0.160
umpy+0.151
Neg logit contributions
 walks-0.220
 tul-0.217
 stro-0.214
ience-0.213
 spr-0.208
Token rope
Token ID17182
Feature activation-9.69e-01

Pos logit contributions
 seek+0.117
ney+0.101
 than+0.098
umpy+0.096
and+0.085
Neg logit contributions
 walks-0.081
 stro-0.080
 tul-0.079
 spr-0.077
 strokes-0.077
Token and
Token ID290
Feature activation-6.82e-01

Pos logit contributions
 seek+0.044
ney+0.036
 than+0.034
umpy+0.032
and+0.029
Neg logit contributions
 walks-0.072
 tul-0.070
 stro-0.070
 hearts-0.069
 spr-0.069
Token puts
Token ID7584
Feature activation+1.87e-02

Pos logit contributions
 seek+0.046
 than+0.037
 boy+0.033
 but+0.033
ney+0.032
Neg logit contributions
irs-0.033
ann-0.033
urbed-0.032
ened-0.032
ir-0.031
Token it
Token ID340
Feature activation+9.21e-01

Pos logit contributions
 spr+0.001
 stro+0.001
 tul+0.001
ience+0.001
 walks+0.001
Neg logit contributions
 seek-0.001
 than-0.001
Bob-0.001
ney-0.001
 there-0.001
Token.
Token ID13
Feature activation+1.28e-01

Pos logit contributions
 walks+0.044
 hearts+0.043
 spr+0.043
 strokes+0.042
ened+0.042
Neg logit contributions
 seek-0.046
ney-0.039
umpy-0.037
and-0.033
Bob-0.032
Token He
Token ID679
Feature activation-3.51e-01

Pos logit contributions
 walks+0.010
 hearts+0.010
 strokes+0.009
ened+0.009
 spr+0.009
Neg logit contributions
 seek-0.006
ney-0.004
umpy-0.004
 than-0.004
and-0.004
Token was
Token ID373
Feature activation-1.69e-01

Pos logit contributions
 seek+0.020
ney+0.016
umpy+0.015
 than+0.015
rated+0.013
Neg logit contributions
 tul-0.023
irs-0.022
 strokes-0.022
 hearts-0.022
 walks-0.022
Token even
Token ID772
Feature activation-1.61e+00

Pos logit contributions
 seek+0.008
ney+0.007
umpy+0.006
 than+0.006
and+0.006
Neg logit contributions
 walks-0.012
 strokes-0.012
 tul-0.012
 hearts-0.012
 bows-0.011
Token hung
Token ID9174
Feature activation-1.02e+00

Pos logit contributions
 seek+0.115
ney+0.099
umpy+0.095
 than+0.093
and+0.089
Neg logit contributions
 walks-0.078
 hearts-0.075
 strokes-0.074
 spr-0.074
 tul-0.074
Tokenrier
Token ID5277
Feature activation-3.65e+00

Pos logit contributions
 seek+0.062
ney+0.051
umpy+0.048
and+0.045
 than+0.045
Neg logit contributions
 walks-0.062
 spr-0.062
ened-0.061
 hearts-0.061
 strokes-0.061
Token than
Token ID621
Feature activation-1.28e+00

Pos logit contributions
 seek+0.251
umpy+0.204
ney+0.204
aghetti+0.198
Bob+0.198
Neg logit contributions
 hearts-0.186
 hugs-0.181
 spr-0.179
 walks-0.177
ened-0.175
Token before
Token ID878
Feature activation+7.31e-01

Pos logit contributions
 seek+0.090
ney+0.081
umpy+0.079
aking+0.070
aghetti+0.069
Neg logit contributions
 hearts-0.065
 walks-0.064
 hugs-0.061
 er-0.061
ened-0.061
Token.
Token ID13
Feature activation+5.44e-02

Pos logit contributions
 walks+0.040
 hearts+0.036
 spr+0.036
 strokes+0.036
ened+0.036
Neg logit contributions
 seek-0.051
ney-0.045
umpy-0.044
and-0.041
aking-0.040
Token But
Token ID887
Feature activation+3.59e-01

Pos logit contributions
 walks+0.004
 hearts+0.004
 strokes+0.004
 hugs+0.003
 spr+0.003
Neg logit contributions
 seek-0.003
ney-0.002
umpy-0.002
 than-0.002
and-0.002
Token then
Token ID788
Feature activation-5.36e-01

Pos logit contributions
 walks+0.020
 hearts+0.019
 spr+0.019
 stro+0.019
ened+0.019
Neg logit contributions
 seek-0.023
umpy-0.022
ney-0.021
and-0.019
aghetti-0.018
Token said
Token ID531
Feature activation-2.80e-01

Pos logit contributions
 seek+0.044
ney+0.038
umpy+0.036
 than+0.035
and+0.033
Neg logit contributions
 walks-0.036
 hearts-0.035
 spr-0.035
 tul-0.035
 strokes-0.035
Token it
Token ID340
Feature activation+1.61e-01

Pos logit contributions
 seek+0.018
ney+0.016
umpy+0.015
 than+0.015
and+0.014
Neg logit contributions
 walks-0.015
 hearts-0.015
 spr-0.015
 strokes-0.015
 tul-0.015
Token was
Token ID373
Feature activation+2.29e-01

Pos logit contributions
 walks+0.010
 spr+0.009
 hugs+0.009
 stro+0.009
ened+0.009
Neg logit contributions
 seek-0.010
ney-0.008
umpy-0.008
 than-0.007
and-0.007
Token ok
Token ID12876
Feature activation-1.35e+00

Pos logit contributions
 walks+0.013
 spr+0.013
 hearts+0.012
 er+0.012
 hugs+0.012
Neg logit contributions
 seek-0.014
ney-0.013
umpy-0.012
and-0.012
aking-0.011
Token as
Token ID355
Feature activation+3.78e-01

Pos logit contributions
 seek+0.104
umpy+0.094
ney+0.091
asc+0.087
asis+0.085
Neg logit contributions
 hugs-0.058
 hearts-0.058
 walks-0.057
ened-0.053
 strokes-0.052
Token long
Token ID890
Feature activation+1.94e+00

Pos logit contributions
 walks+0.020
 hearts+0.019
 hugs+0.019
 spr+0.018
 tul+0.018
Neg logit contributions
ney-0.026
 seek-0.025
umpy-0.024
aking-0.022
asc-0.021
Token as
Token ID355
Feature activation+1.20e+00

Pos logit contributions
 tul+0.114
 spr+0.111
 hearts+0.110
 strokes+0.109
irs+0.108
Neg logit contributions
 seek-0.115
 than-0.099
umpy-0.085
ney-0.084
Bob-0.080
Token he
Token ID339
Feature activation+1.26e+00

Pos logit contributions
 walks+0.067
 hearts+0.065
 hugs+0.063
 spr+0.060
ened+0.059
Neg logit contributions
ney-0.079
 seek-0.078
umpy-0.077
aking-0.067
asc-0.064
Token trusted
Token ID13467
Feature activation+6.23e-01

Pos logit contributions
 walks+0.080
 hearts+0.077
 spr+0.077
 tul+0.077
 strokes+0.077
Neg logit contributions
 seek-0.071
ney-0.060
umpy-0.056
 than-0.054
and-0.051
Token the
Token ID262
Feature activation+7.65e-02

Pos logit contributions
 walks+0.031
 hearts+0.029
 hugs+0.029
ened+0.027
 strokes+0.027
Neg logit contributions
 seek-0.046
umpy-0.043
ney-0.042
aking-0.038
asc-0.037
Token vegetable
Token ID20236
Feature activation+1.44e+00

Pos logit contributions
 walks+0.005
 hearts+0.005
 hugs+0.004
 spr+0.004
ened+0.004
Neg logit contributions
 seek-0.005
ney-0.004
umpy-0.004
and-0.004
 than-0.003
Token cord
Token ID15050
Feature activation+1.58e+00

Pos logit contributions
 seek+0.014
ney+0.012
umpy+0.011
 than+0.011
and+0.010
Neg logit contributions
 stro-0.013
 tul-0.012
 walks-0.012
 spr-0.012
 strokes-0.012
Token was
Token ID373
Feature activation+4.59e-01

Pos logit contributions
 walks+0.106
 tul+0.103
 hearts+0.103
 spr+0.102
 strokes+0.102
Neg logit contributions
 seek-0.084
ney-0.070
umpy-0.065
 than-0.063
and-0.057
Token to
Token ID284
Feature activation+2.86e-01

Pos logit contributions
 walks+0.032
 hearts+0.031
 spr+0.031
 strokes+0.031
 tul+0.031
Neg logit contributions
 seek-0.023
ney-0.019
umpy-0.018
 than-0.017
and-0.016
Token charge
Token ID3877
Feature activation-7.09e-01

Pos logit contributions
 walks+0.016
 strokes+0.016
 hearts+0.016
ience+0.016
ened+0.016
Neg logit contributions
 seek-0.020
ney-0.015
 than-0.015
umpy-0.014
 sunny-0.013
Token the
Token ID262
Feature activation+4.93e-02

Pos logit contributions
 seek+0.035
ney+0.028
umpy+0.026
 than+0.025
and+0.023
Neg logit contributions
 walks-0.051
 hearts-0.049
 strokes-0.049
 spr-0.049
 hugs-0.049
Token robot
Token ID9379
Feature activation+1.98e+00

Pos logit contributions
 stro+0.003
 walks+0.003
 tul+0.003
 spr+0.003
 strokes+0.003
Neg logit contributions
 seek-0.003
ney-0.003
 than-0.003
umpy-0.003
and-0.002
Token's
Token ID338
Feature activation+6.43e-01

Pos logit contributions
 walks+0.123
 tul+0.120
 hearts+0.119
 spr+0.119
 stro+0.118
Neg logit contributions
 seek-0.114
ney-0.096
umpy-0.091
 than-0.090
and-0.083
Token battery
Token ID6555
Feature activation+9.82e-01

Pos logit contributions
 stro+0.036
 tul+0.036
 walks+0.035
 spr+0.034
irs+0.034
Neg logit contributions
 seek-0.041
ney-0.035
 than-0.035
umpy-0.034
and-0.030
Token.
Token ID13
Feature activation-4.09e-02

Pos logit contributions
 walks+0.057
 hearts+0.055
 spr+0.055
 strokes+0.055
ened+0.055
Neg logit contributions
 seek-0.060
ney-0.051
umpy-0.049
 than-0.045
and-0.044
Token Tim
Token ID5045
Feature activation-7.84e-01

Pos logit contributions
 seek+0.002
ney+0.002
umpy+0.002
 than+0.002
and+0.002
Neg logit contributions
 walks-0.003
 hearts-0.003
 spr-0.003
 tul-0.003
 strokes-0.003
Tokenmy
Token ID1820
Feature activation-1.06e+00

Pos logit contributions
 seek+0.049
 than+0.038
umpy+0.038
ney+0.038
and+0.035
Neg logit contributions
 walks-0.046
 hugs-0.046
 er-0.045
ened-0.045
 hearts-0.044
Token the
Token ID262
Feature activation+6.09e-02

Pos logit contributions
 walks+0.046
 hearts+0.045
 spr+0.044
 strokes+0.044
 tul+0.044
Neg logit contributions
 seek-0.045
ney-0.038
umpy-0.036
 than-0.035
and-0.032
Token bag
Token ID6131
Feature activation-3.46e-01

Pos logit contributions
 walks+0.003
 hearts+0.003
 spr+0.003
 strokes+0.003
 tul+0.003
Neg logit contributions
 seek-0.004
ney-0.004
umpy-0.004
 than-0.004
and-0.003
Token together
Token ID1978
Feature activation+4.65e-01

Pos logit contributions
 seek+0.022
Dan+0.019
asc+0.018
Bob+0.018
ney+0.018
Neg logit contributions
 hugs-0.017
 hearts-0.016
 bl-0.016
 lov-0.015
 er-0.015
Token and
Token ID290
Feature activation+2.15e-01

Pos logit contributions
 hearts+0.027
 hugs+0.026
 walks+0.025
 spr+0.025
 strokes+0.025
Neg logit contributions
 seek-0.028
ney-0.023
umpy-0.023
asc-0.021
Jen-0.021
Token take
Token ID1011
Feature activation+1.21e+00

Pos logit contributions
 walks+0.012
 hearts+0.011
 spr+0.011
 stro+0.011
 hugs+0.011
Neg logit contributions
 seek-0.014
ney-0.013
umpy-0.012
 than-0.012
and-0.011
Token turns
Token ID4962
Feature activation+1.94e+00

Pos logit contributions
 spr+0.057
 tul+0.056
 stro+0.055
 walks+0.051
 er+0.050
Neg logit contributions
 seek-0.086
 than-0.073
ney-0.072
and-0.070
umpy-0.069
Token playing
Token ID2712
Feature activation+1.32e+00

Pos logit contributions
 walks+0.132
 hearts+0.130
 tul+0.130
 spr+0.130
ened+0.128
Neg logit contributions
 seek-0.100
ney-0.079
 than-0.077
umpy-0.076
and-0.070
Token with
Token ID351
Feature activation+8.07e-01

Pos logit contributions
 walks+0.093
 hearts+0.088
 hugs+0.086
 bows+0.086
 strokes+0.085
Neg logit contributions
 seek-0.066
ney-0.060
umpy-0.057
iring-0.050
rated-0.048
Token the
Token ID262
Feature activation+6.37e-01

Pos logit contributions
 walks+0.046
 stro+0.045
 spr+0.044
 strokes+0.044
 tul+0.044
Neg logit contributions
 seek-0.050
ney-0.041
 than-0.040
umpy-0.038
and-0.037
Token mar
Token ID1667
Feature activation+6.68e-01

Pos logit contributions
 walks+0.032
 hearts+0.031
 spr+0.030
 tul+0.030
 hugs+0.030
Neg logit contributions
 seek-0.045
ney-0.040
umpy-0.038
 than-0.036
and-0.036
Tokenbles
Token ID7689
Feature activation-1.55e+00

Pos logit contributions
 walks+0.050
 bows+0.048
 er+0.048
 hugs+0.048
 hearts+0.048
Neg logit contributions
 seek-0.030
ney-0.024
umpy-0.021
 than-0.021
rated-0.019
Token moon
Token ID8824
Feature activation+6.50e-01

Pos logit contributions
 seek+0.052
ney+0.045
umpy+0.044
 than+0.043
and+0.041
Neg logit contributions
 walks-0.025
 hearts-0.024
 spr-0.024
 strokes-0.024
 tul-0.023
Token until
Token ID1566
Feature activation+7.76e-01

Pos logit contributions
 walks+0.041
 tul+0.040
 hearts+0.040
 hugs+0.039
 strokes+0.039
Neg logit contributions
 seek-0.037
ney-0.031
 than-0.028
umpy-0.028
and-0.026
Token he
Token ID339
Feature activation+4.90e-01

Pos logit contributions
 walks+0.057
 tul+0.056
 strokes+0.056
 er+0.056
 stro+0.056
Neg logit contributions
 seek-0.036
ney-0.027
 than-0.025
umpy-0.024
Bob-0.022
Token could
Token ID714
Feature activation+2.27e-01

Pos logit contributions
 walks+0.028
 spr+0.027
 hearts+0.027
 stro+0.027
 strokes+0.027
Neg logit contributions
 seek-0.031
ney-0.026
umpy-0.025
 than-0.024
and-0.023
Token feel
Token ID1254
Feature activation+1.32e-01

Pos logit contributions
 walks+0.013
 spr+0.012
 strokes+0.012
 stro+0.012
 hearts+0.012
Neg logit contributions
 seek-0.015
ney-0.013
umpy-0.012
 than-0.012
and-0.011
Token its
Token ID663
Feature activation+2.10e+00

Pos logit contributions
 walks+0.008
ience+0.008
 er+0.008
 tul+0.008
 hearts+0.008
Neg logit contributions
 seek-0.007
 than-0.006
ney-0.006
umpy-0.006
and-0.005
Token light
Token ID1657
Feature activation+1.67e+00

Pos logit contributions
 walks+0.117
 stro+0.112
 tul+0.111
 hearts+0.111
 spr+0.111
Neg logit contributions
 seek-0.136
ney-0.117
umpy-0.111
 than-0.109
and-0.100
Token on
Token ID319
Feature activation+4.74e-01

Pos logit contributions
 walks+0.106
 hearts+0.102
 stro+0.102
 tul+0.102
 spr+0.101
Neg logit contributions
 seek-0.095
ney-0.080
umpy-0.075
 than-0.073
and-0.067
Token his
Token ID465
Feature activation+1.02e-01

Pos logit contributions
ience+0.035
 stro+0.035
 tul+0.034
 walks+0.034
 hugs+0.034
Neg logit contributions
 seek-0.021
 than-0.017
ney-0.016
umpy-0.014
Bob-0.013
Token face
Token ID1986
Feature activation+3.29e-01

Pos logit contributions
 walks+0.006
 stro+0.006
 spr+0.006
 tul+0.005
 strokes+0.005
Neg logit contributions
 seek-0.007
ney-0.006
 than-0.005
umpy-0.005
and-0.005
Token.
Token ID13
Feature activation-1.35e-01

Pos logit contributions
 walks+0.019
 spr+0.018
 hearts+0.018
 strokes+0.018
ened+0.018
Neg logit contributions
 seek-0.021
ney-0.018
umpy-0.017
 than-0.016
and-0.016
Token new
Token ID649
Feature activation+8.45e-01

Pos logit contributions
 walks+0.058
 tul+0.058
 spr+0.057
 stro+0.057
ened+0.057
Neg logit contributions
 seek-0.067
ney-0.057
 than-0.055
umpy-0.054
and-0.049
Token haircut
Token ID45548
Feature activation+5.24e-01

Pos logit contributions
 walks+0.035
irs+0.035
 spr+0.034
 stro+0.034
 tul+0.034
Neg logit contributions
 seek-0.066
ney-0.056
 than-0.056
umpy-0.054
and-0.050
Token and
Token ID290
Feature activation+3.33e-01

Pos logit contributions
 walks+0.028
 hearts+0.028
 er+0.028
 strokes+0.027
 hugs+0.027
Neg logit contributions
 seek-0.034
umpy-0.029
ney-0.029
Bob-0.026
aking-0.026
Token thanked
Token ID26280
Feature activation-2.76e-01

Pos logit contributions
 walks+0.018
 hugs+0.018
 stro+0.017
 hearts+0.017
 bows+0.017
Neg logit contributions
 seek-0.022
umpy-0.021
ney-0.021
and-0.019
aking-0.018
Token the
Token ID262
Feature activation+1.57e-01

Pos logit contributions
 seek+0.017
ney+0.015
umpy+0.014
 than+0.014
and+0.013
Neg logit contributions
 walks-0.016
 hearts-0.016
 spr-0.015
 strokes-0.015
 tul-0.015
Token bar
Token ID2318
Feature activation+2.42e+00

Pos logit contributions
 walks+0.007
 hearts+0.006
 hugs+0.006
 spr+0.006
 strokes+0.006
Neg logit contributions
 seek-0.012
ney-0.011
umpy-0.011
 than-0.010
and-0.010
Tokenber
Token ID527
Feature activation+3.74e-02

Pos logit contributions
 stro+0.169
 hearts+0.167
 hugs+0.166
 walks+0.164
 strokes+0.163
Neg logit contributions
 seek-0.124
ney-0.085
umpy-0.081
rated-0.080
Bob-0.071
Token.
Token ID13
Feature activation-1.13e-01

Pos logit contributions
 walks+0.002
 hearts+0.002
 strokes+0.002
 stro+0.002
 hugs+0.002
Neg logit contributions
 seek-0.002
ney-0.002
umpy-0.002
 than-0.002
and-0.002
Token
Token ID198
Feature activation-2.23e-01

Pos logit contributions
 seek+0.008
ney+0.006
umpy+0.006
 than+0.006
and+0.006
Neg logit contributions
 walks-0.006
 hugs-0.006
 hearts-0.006
 strokes-0.006
 stro-0.006
Token
Token ID198
Feature activation+4.51e-02

Pos logit contributions
 seek+0.017
 than+0.014
ney+0.014
umpy+0.013
rated+0.013
Neg logit contributions
 walks-0.010
irs-0.009
 hugs-0.009
 strokes-0.009
 stro-0.009
TokenAfter
Token ID3260
Feature activation+1.12e+00

Pos logit contributions
 walks+0.002
 spr+0.002
 er+0.002
 stro+0.002
 scrib+0.002
Neg logit contributions
 seek-0.003
ney-0.003
 than-0.002
umpy-0.002
neys-0.002
Token her
Token ID607
Feature activation-1.39e-01

Pos logit contributions
 seek+0.042
 than+0.036
umpy+0.035
and+0.034
ney+0.034
Neg logit contributions
 spr-0.023
ience-0.023
 stro-0.023
ened-0.022
 walks-0.022
Token mom
Token ID1995
Feature activation-9.96e-01

Pos logit contributions
 seek+0.009
ney+0.008
umpy+0.008
 than+0.007
and+0.007
Neg logit contributions
 walks-0.008
 hearts-0.007
 spr-0.007
 strokes-0.007
 tul-0.007
Token and
Token ID290
Feature activation+3.00e-01

Pos logit contributions
 seek+0.064
ney+0.052
umpy+0.051
 than+0.044
Bob+0.044
Neg logit contributions
 hearts-0.060
 hugs-0.060
 walks-0.059
 strokes-0.057
 er-0.056
Token dad
Token ID9955
Feature activation+3.06e-01

Pos logit contributions
 walks+0.019
 strokes+0.018
 spr+0.018
 tul+0.018
 hearts+0.018
Neg logit contributions
 seek-0.017
ney-0.014
 than-0.013
umpy-0.013
and-0.012
Token to
Token ID284
Feature activation+7.29e-01

Pos logit contributions
 walks+0.020
 hearts+0.019
 spr+0.019
 tul+0.019
 strokes+0.019
Neg logit contributions
 seek-0.017
ney-0.014
umpy-0.014
 than-0.013
and-0.012
Token take
Token ID1011
Feature activation+8.71e-01

Pos logit contributions
 walks+0.038
ened+0.038
 hearts+0.037
 spr+0.037
 tul+0.037
Neg logit contributions
 seek-0.049
ney-0.040
 than-0.038
umpy-0.038
and-0.035
Token away
Token ID1497
Feature activation+1.03e-02

Pos logit contributions
 spr+0.038
 tul+0.037
 stro+0.036
ience+0.034
 er+0.033
Neg logit contributions
 seek-0.063
and-0.054
 than-0.054
umpy-0.051
ney-0.051
Token any
Token ID597
Feature activation-7.54e-01

Pos logit contributions
 walks+0.000
 hearts+0.000
 strokes+0.000
 hugs+0.000
 stro+0.000
Neg logit contributions
 seek-0.001
ney-0.001
umpy-0.001
 than-0.001
and-0.001
Token bear
Token ID6842
Feature activation+9.31e-01

Pos logit contributions
 seek+0.045
ney+0.038
umpy+0.036
 than+0.035
and+0.033
Neg logit contributions
 walks-0.046
 hearts-0.044
 spr-0.044
 strokes-0.044
 tul-0.044
Token toys
Token ID14958
Feature activation-2.20e-02

Pos logit contributions
 walks+0.056
 stro+0.053
 hearts+0.053
 spr+0.053
 hugs+0.052
Neg logit contributions
 seek-0.056
ney-0.048
umpy-0.046
 than-0.042
iring-0.041
Token or
Token ID393
Feature activation-1.07e+00

Pos logit contributions
 seek+0.001
ney+0.001
umpy+0.001
rated+0.001
iring+0.001
Neg logit contributions
 walks-0.001
 spr-0.001
 hearts-0.001
 hugs-0.001
 stro-0.001
Token be
Token ID307
Feature activation+3.76e-02

Pos logit contributions
 stro+0.019
ted+0.019
 tul+0.019
 walks+0.018
ience+0.018
Neg logit contributions
 seek-0.023
ney-0.018
 than-0.017
umpy-0.016
 active-0.015
Token very
Token ID845
Feature activation-2.42e-01

Pos logit contributions
 walks+0.003
 tul+0.003
 spr+0.003
 stro+0.003
 strokes+0.003
Neg logit contributions
 seek-0.002
ney-0.001
 than-0.001
umpy-0.001
aking-0.001
Token careful
Token ID8161
Feature activation+3.06e-01

Pos logit contributions
 seek+0.010
ney+0.008
umpy+0.008
 than+0.007
and+0.006
Neg logit contributions
 walks-0.019
 hearts-0.018
 spr-0.018
 strokes-0.018
 tul-0.018
Token when
Token ID618
Feature activation+1.24e+00

Pos logit contributions
 walks+0.017
ened+0.017
 hearts+0.017
 hugs+0.016
 strokes+0.016
Neg logit contributions
 seek-0.019
ney-0.016
umpy-0.015
Bob-0.014
asis-0.014
Token using
Token ID1262
Feature activation+7.87e-01

Pos logit contributions
 walks+0.070
 hearts+0.067
 spr+0.066
 strokes+0.065
 tul+0.065
Neg logit contributions
 seek-0.080
ney-0.071
umpy-0.068
 than-0.063
and-0.061
Token the
Token ID262
Feature activation+7.80e-01

Pos logit contributions
 stro+0.051
 tul+0.050
 spr+0.048
ience+0.048
 walks+0.047
Neg logit contributions
 seek-0.041
 than-0.035
ney-0.031
 but-0.030
 there-0.029
Token tools
Token ID4899
Feature activation+1.09e+00

Pos logit contributions
 stro+0.044
 walks+0.042
 tul+0.042
irs+0.041
 spr+0.041
Neg logit contributions
 seek-0.050
ney-0.043
umpy-0.041
 than-0.041
and-0.036
Token.
Token ID13
Feature activation-1.22e-01

Pos logit contributions
 walks+0.064
 hearts+0.062
 spr+0.062
 strokes+0.061
ened+0.061
Neg logit contributions
 seek-0.067
ney-0.056
umpy-0.053
 than-0.050
and-0.048
Token
Token ID198
Feature activation-3.88e-01

Pos logit contributions
 seek+0.008
umpy+0.006
ney+0.006
 than+0.006
kin+0.006
Neg logit contributions
 walks-0.007
 hearts-0.007
 strokes-0.007
 hugs-0.007
 bows-0.007
Token
Token ID198
Feature activation-1.26e-01

Pos logit contributions
 seek+0.027
ney+0.022
rated+0.022
 than+0.022
kin+0.021
Neg logit contributions
 walks-0.020
irs-0.019
 strokes-0.018
 hugs-0.018
alled-0.017
TokenSteve
Token ID19206
Feature activation-2.35e-01

Pos logit contributions
 seek+0.007
ney+0.006
umpy+0.006
kin+0.006
tub+0.005
Neg logit contributions
 walks-0.008
 hearts-0.007
 strokes-0.007
 bows-0.007
alled-0.007
Token when
Token ID618
Feature activation+6.77e-01

Pos logit contributions
ney+0.064
 seek+0.062
umpy+0.062
asc+0.055
aking+0.055
Neg logit contributions
ened-0.046
 hearts-0.044
 walks-0.043
 hugs-0.042
 er-0.041
Token Jack
Token ID3619
Feature activation+4.08e-01

Pos logit contributions
 walks+0.037
 hearts+0.035
ened+0.034
 hugs+0.032
 strokes+0.032
Neg logit contributions
ney-0.046
umpy-0.044
 seek-0.043
and-0.040
aking-0.040
Token arrived
Token ID5284
Feature activation+6.75e-02

Pos logit contributions
 walks+0.020
 hearts+0.019
 spr+0.019
 strokes+0.018
 tul+0.018
Neg logit contributions
 seek-0.030
ney-0.026
umpy-0.025
 than-0.024
and-0.023
Token he
Token ID339
Feature activation-6.61e-01

Pos logit contributions
 walks+0.004
 hearts+0.004
 spr+0.004
 strokes+0.004
ened+0.004
Neg logit contributions
 seek-0.004
ney-0.004
umpy-0.003
 than-0.003
and-0.003
Token had
Token ID550
Feature activation-2.20e-01

Pos logit contributions
 seek+0.043
ney+0.037
umpy+0.036
 than+0.034
and+0.033
Neg logit contributions
 walks-0.037
 stro-0.035
 hearts-0.035
 spr-0.035
 hugs-0.034
Token a
Token ID257
Feature activation+4.89e-02

Pos logit contributions
 seek+0.013
ney+0.011
 than+0.010
umpy+0.010
and+0.009
Neg logit contributions
 tul-0.013
 spr-0.013
 strokes-0.013
 walks-0.013
 hearts-0.013
Token big
Token ID1263
Feature activation-3.01e-01

Pos logit contributions
ened+0.003
irs+0.003
ted+0.003
 stro+0.003
 strokes+0.003
Neg logit contributions
 seek-0.003
ney-0.002
 than-0.002
umpy-0.002
 sunny-0.002
Token smile
Token ID8212
Feature activation+6.03e-02

Pos logit contributions
 seek+0.020
ney+0.017
 than+0.017
umpy+0.015
 sunny+0.014
Neg logit contributions
irs-0.016
 stro-0.016
 tul-0.015
 strokes-0.015
 walks-0.015
Token on
Token ID319
Feature activation+5.74e-01

Pos logit contributions
 walks+0.004
 hearts+0.004
 spr+0.004
 strokes+0.004
ened+0.004
Neg logit contributions
 seek-0.003
ney-0.003
umpy-0.002
 than-0.002
and-0.002
Token his
Token ID465
Feature activation+1.21e-01

Pos logit contributions
 tul+0.044
 stro+0.043
 spr+0.043
 walks+0.043
ience+0.043
Neg logit contributions
 seek-0.025
 than-0.018
ney-0.018
umpy-0.017
Bob-0.014
Token face
Token ID1986
Feature activation+2.56e-01

Pos logit contributions
 walks+0.006
 hearts+0.006
 spr+0.006
 strokes+0.006
 tul+0.006
Neg logit contributions
 seek-0.009
ney-0.007
umpy-0.007
 than-0.007
and-0.007
Token have
Token ID423
Feature activation+9.65e-01

Pos logit contributions
ience+0.034
 stro+0.033
 hearts+0.033
ened+0.032
 bows+0.032
Neg logit contributions
 seek-0.036
ney-0.027
 than-0.026
umpy-0.025
 sunny-0.024
Token some
Token ID617
Feature activation+8.13e-01

Pos logit contributions
 walks+0.053
ened+0.053
 stro+0.053
 spr+0.053
 strokes+0.052
Neg logit contributions
 seek-0.062
ney-0.049
 than-0.048
umpy-0.045
Bob-0.043
Token y
Token ID331
Feature activation-1.40e-01

Pos logit contributions
 walks+0.048
 hearts+0.046
 spr+0.046
 strokes+0.046
 tul+0.046
Neg logit contributions
 seek-0.050
ney-0.043
umpy-0.041
 than-0.039
and-0.037
Tokenummy
Token ID13513
Feature activation-6.21e-01

Pos logit contributions
 seek+0.006
ney+0.005
umpy+0.005
 than+0.005
and+0.004
Neg logit contributions
 walks-0.010
 stro-0.010
 bows-0.010
 tul-0.010
 spr-0.010
Token pie
Token ID2508
Feature activation-9.63e-03

Pos logit contributions
 seek+0.040
ney+0.034
umpy+0.033
 than+0.031
and+0.030
Neg logit contributions
 walks-0.036
 hearts-0.035
 spr-0.034
 strokes-0.034
 hugs-0.034
Token.
Token ID13
Feature activation+5.26e-02

Pos logit contributions
 seek+0.001
umpy+0.001
ney+0.001
rated+0.001
iring+0.001
Neg logit contributions
 walks-0.000
 er-0.000
 hugs-0.000
 hearts-0.000
 tul-0.000
Token The
Token ID383
Feature activation+2.47e-01

Pos logit contributions
 hugs+0.003
 walks+0.002
 hearts+0.002
 strokes+0.002
 stro+0.002
Neg logit contributions
 seek-0.004
ney-0.003
umpy-0.003
 than-0.003
aking-0.003
Token end
Token ID886
Feature activation-2.80e-01

Pos logit contributions
 walks+0.013
 stro+0.013
ened+0.012
 strokes+0.012
 spr+0.012
Neg logit contributions
 seek-0.017
ney-0.015
umpy-0.014
 than-0.013
and-0.012
Token.
Token ID13
Feature activation-1.72e-01

Pos logit contributions
 seek+0.020
umpy+0.017
ney+0.017
iring+0.015
emo+0.015
Neg logit contributions
 walks-0.014
 strokes-0.014
 hugs-0.013
 spr-0.013
 hearts-0.013
Token<|endoftext|>
Token ID50256
Feature activation+9.98e-01

Pos logit contributions
 seek+0.012
umpy+0.010
ney+0.010
 than+0.010
rated+0.009
Neg logit contributions
 hugs-0.009
 hearts-0.009
 stro-0.009
 walks-0.008
 kisses-0.008
TokenOnce
Token ID7454
Feature activation-4.11e-03

Pos logit contributions
 walks+0.049
 hugs+0.044
 strokes+0.041
 hearts+0.041
ted+0.041
Neg logit contributions
 seek-0.075
 than-0.065
ney-0.063
rated-0.063
umpy-0.062
Token,
Token ID11
Feature activation-5.36e-03

Pos logit contributions
 seek+0.039
ney+0.036
umpy+0.034
 than+0.031
neys+0.030
Neg logit contributions
 walks-0.035
 spr-0.032
 bows-0.032
 strokes-0.032
 hugs-0.032
Token and
Token ID290
Feature activation-2.90e-01

Pos logit contributions
 seek+0.000
ney+0.000
umpy+0.000
iring+0.000
 than+0.000
Neg logit contributions
 walks-0.000
 hugs-0.000
 spr-0.000
 hearts-0.000
 stro-0.000
Token chase
Token ID15505
Feature activation+1.74e+00

Pos logit contributions
 seek+0.019
ney+0.018
umpy+0.018
 than+0.017
iring+0.017
Neg logit contributions
 walks-0.015
 hearts-0.014
 tul-0.014
 stro-0.014
 hugs-0.014
Token each
Token ID1123
Feature activation+2.34e-03

Pos logit contributions
 walks+0.086
 hearts+0.080
 strokes+0.078
 hugs+0.077
 bows+0.077
Neg logit contributions
 seek-0.131
ney-0.117
umpy-0.109
 than-0.101
iring-0.099
Token other
Token ID584
Feature activation+1.42e+00

Pos logit contributions
 walks+0.000
 stro+0.000
 spr+0.000
 tul+0.000
 hearts+0.000
Neg logit contributions
 seek-0.000
ney-0.000
umpy-0.000
 than-0.000
and-0.000
Token in
Token ID287
Feature activation+9.19e-01

Pos logit contributions
 walks+0.080
 hearts+0.073
 bows+0.073
 hugs+0.072
 er+0.072
Neg logit contributions
 seek-0.093
ney-0.081
umpy-0.079
iring-0.075
Jen-0.072
Token the
Token ID262
Feature activation+2.00e-01

Pos logit contributions
 walks+0.052
 stro+0.052
 spr+0.051
 tul+0.051
 strokes+0.051
Neg logit contributions
 seek-0.058
ney-0.048
umpy-0.046
 than-0.046
and-0.042
Token garden
Token ID11376
Feature activation+1.44e+00

Pos logit contributions
 stro+0.013
ened+0.013
 walks+0.012
 tul+0.012
irs+0.012
Neg logit contributions
 seek-0.011
ney-0.010
umpy-0.009
 than-0.008
aking-0.008
Token.
Token ID13
Feature activation-3.37e-02

Pos logit contributions
 walks+0.081
 hearts+0.079
 spr+0.078
 strokes+0.077
 hugs+0.076
Neg logit contributions
 seek-0.093
ney-0.079
umpy-0.077
 than-0.071
and-0.068
Token
Token ID198
Feature activation-3.11e-01

Pos logit contributions
 seek+0.002
umpy+0.002
ney+0.002
 than+0.002
iest+0.002
Neg logit contributions
 walks-0.002
 hearts-0.002
 hugs-0.002
 strokes-0.002
 stro-0.002
Token
Token ID198
Feature activation-2.81e-02

Pos logit contributions
 seek+0.021
ney+0.017
umpy+0.017
 than+0.017
rated+0.016
Neg logit contributions
 walks-0.017
 hugs-0.015
 strokes-0.015
irs-0.015
 hearts-0.014
Token They
Token ID1119
Feature activation+5.10e-01

Pos logit contributions
 seek+0.013
ney+0.011
umpy+0.011
 than+0.011
iest+0.010
Neg logit contributions
 walks-0.007
 spr-0.007
 hearts-0.007
 stro-0.007
 strokes-0.007
Token are
Token ID389
Feature activation+4.63e-01

Pos logit contributions
 walks+0.031
 hearts+0.031
 strokes+0.031
 tul+0.031
ened+0.031
Neg logit contributions
 seek-0.029
ney-0.024
umpy-0.023
 than-0.022
and-0.020
Token curious
Token ID11040
Feature activation+3.63e-01

Pos logit contributions
 walks+0.029
 spr+0.029
 hearts+0.028
 hugs+0.028
 tul+0.028
Neg logit contributions
 seek-0.027
ney-0.023
umpy-0.022
aghetti-0.021
and-0.020
Token.
Token ID13
Feature activation-2.09e-01

Pos logit contributions
 walks+0.023
 hearts+0.022
 hugs+0.021
 strokes+0.021
ened+0.021
Neg logit contributions
 seek-0.023
ney-0.019
umpy-0.018
 than-0.016
iring-0.016
Token They
Token ID1119
Feature activation+5.48e-01

Pos logit contributions
 seek+0.016
umpy+0.014
ney+0.014
rated+0.013
 than+0.013
Neg logit contributions
 hearts-0.010
 walks-0.009
 spr-0.009
 hugs-0.009
 stro-0.009
Token want
Token ID765
Feature activation-4.74e-01

Pos logit contributions
 hearts+0.038
 strokes+0.038
ened+0.037
 tul+0.037
 walks+0.037
Neg logit contributions
 seek-0.027
ney-0.021
umpy-0.020
 than-0.020
Bob-0.017
Token to
Token ID284
Feature activation+4.45e-01

Pos logit contributions
 seek+0.024
ney+0.020
 than+0.018
umpy+0.017
and+0.017
Neg logit contributions
 tul-0.032
 stro-0.032
ened-0.032
 walks-0.031
ience-0.031
Token see
Token ID766
Feature activation-1.23e+00

Pos logit contributions
 walks+0.025
 spr+0.024
 strokes+0.024
 hearts+0.023
 tul+0.023
Neg logit contributions
 seek-0.029
ney-0.025
umpy-0.024
 than-0.023
and-0.022
Token what
Token ID644
Feature activation+5.35e-01

Pos logit contributions
 seek+0.092
ney+0.080
umpy+0.078
 than+0.072
and+0.071
Neg logit contributions
 walks-0.060
 spr-0.056
 hearts-0.056
 hugs-0.056
 strokes-0.055
Token is
Token ID318
Feature activation+5.75e-01

Pos logit contributions
 walks+0.029
 hearts+0.027
 hugs+0.027
 spr+0.027
 bows+0.027
Neg logit contributions
 seek-0.037
umpy-0.032
ney-0.031
 than-0.029
and-0.029
Token in
Token ID287
Feature activation+5.67e-01

Pos logit contributions
 walks+0.038
 spr+0.037
 bows+0.036
 tul+0.036
 hearts+0.036
Neg logit contributions
 seek-0.032
ney-0.027
umpy-0.026
and-0.024
 than-0.023
Token needed
Token ID2622
Feature activation-3.49e-02

Pos logit contributions
 walks+0.031
 hugs+0.029
ened+0.029
 spr+0.029
 stro+0.029
Neg logit contributions
 seek-0.030
ney-0.025
umpy-0.024
 than-0.023
and-0.022
Token to
Token ID284
Feature activation+6.08e-01

Pos logit contributions
 seek+0.002
 than+0.001
ney+0.001
and+0.001
 late+0.001
Neg logit contributions
ience-0.003
ened-0.002
 tul-0.002
 spr-0.002
irs-0.002
Token go
Token ID467
Feature activation-6.01e-01

Pos logit contributions
 walks+0.031
 spr+0.029
 hearts+0.028
 strokes+0.028
 tul+0.028
Neg logit contributions
 seek-0.043
ney-0.038
umpy-0.037
 than-0.035
and-0.034
Token to
Token ID284
Feature activation+5.71e-01

Pos logit contributions
 seek+0.038
ney+0.033
umpy+0.032
 than+0.030
and+0.028
Neg logit contributions
 walks-0.036
 hearts-0.033
 hugs-0.033
 strokes-0.032
 spr-0.032
Token the
Token ID262
Feature activation-8.09e-01

Pos logit contributions
 stro+0.033
ened+0.033
 spr+0.032
 tul+0.032
 strokes+0.032
Neg logit contributions
 seek-0.036
ney-0.028
 than-0.027
umpy-0.027
and-0.025
Token bathroom
Token ID12436
Feature activation-1.59e-01

Pos logit contributions
 seek+0.051
ney+0.042
umpy+0.038
 sunny+0.036
 rink+0.036
Neg logit contributions
ened-0.053
 stro-0.049
 breaths-0.049
ience-0.048
 strokes-0.048
Token.
Token ID13
Feature activation-1.19e-01

Pos logit contributions
 seek+0.011
ney+0.009
umpy+0.009
Bob+0.008
 than+0.008
Neg logit contributions
 walks-0.009
 hearts-0.008
 spr-0.008
 strokes-0.008
 er-0.008
Token She
Token ID1375
Feature activation+2.63e-01

Pos logit contributions
 seek+0.008
ney+0.007
umpy+0.007
 than+0.006
and+0.006
Neg logit contributions
 walks-0.006
 stro-0.006
 tul-0.006
 spr-0.006
 hearts-0.006
Token went
Token ID1816
Feature activation-8.52e-01

Pos logit contributions
 tul+0.016
irs+0.016
 hearts+0.016
 strokes+0.015
 bows+0.015
Neg logit contributions
 seek-0.016
ney-0.014
umpy-0.012
 than-0.011
 active-0.011
Token to
Token ID284
Feature activation-9.65e-02

Pos logit contributions
 seek+0.047
ney+0.038
umpy+0.036
 than+0.035
and+0.033
Neg logit contributions
 walks-0.055
 hearts-0.055
 spr-0.054
 tul-0.054
ened-0.054
Token the
Token ID262
Feature activation-1.26e+00

Pos logit contributions
 seek+0.006
ney+0.005
 than+0.005
umpy+0.005
and+0.005
Neg logit contributions
ience-0.005
 hearts-0.005
ened-0.005
 stro-0.005
 tul-0.005
Token with
Token ID351
Feature activation-8.83e-01

Pos logit contributions
 walks+0.002
 hearts+0.002
 hugs+0.002
 strokes+0.002
 stro+0.002
Neg logit contributions
 seek-0.001
ney-0.001
umpy-0.001
rated-0.001
 than-0.001
Token the
Token ID262
Feature activation-4.99e-01

Pos logit contributions
 seek+0.051
ney+0.042
 than+0.041
umpy+0.038
and+0.037
Neg logit contributions
 tul-0.053
 stro-0.052
 walks-0.051
 spr-0.051
ened-0.050
Token doll
Token ID3654
Feature activation+2.88e-01

Pos logit contributions
 seek+0.032
ney+0.026
 than+0.025
umpy+0.025
 boy+0.022
Neg logit contributions
ened-0.028
 stro-0.028
 walks-0.028
irs-0.028
ience-0.027
Token some
Token ID617
Feature activation+8.43e-02

Pos logit contributions
 walks+0.019
 spr+0.018
 tul+0.018
ened+0.018
 strokes+0.018
Neg logit contributions
 seek-0.016
ney-0.013
umpy-0.012
 than-0.012
and-0.011
Token more
Token ID517
Feature activation-8.27e-01

Pos logit contributions
 stro+0.004
 tul+0.004
ened+0.004
irs+0.004
chief+0.004
Neg logit contributions
 seek-0.006
 than-0.005
ney-0.005
umpy-0.004
and-0.004
Token.
Token ID13
Feature activation-3.06e-01

Pos logit contributions
 seek+0.048
ney+0.040
umpy+0.038
 than+0.037
and+0.034
Neg logit contributions
 walks-0.052
 hearts-0.050
 spr-0.050
 tul-0.050
 strokes-0.050
Token They
Token ID1119
Feature activation+5.80e-01

Pos logit contributions
 seek+0.019
ney+0.016
umpy+0.016
 than+0.015
and+0.014
Neg logit contributions
 walks-0.018
 hearts-0.018
 strokes-0.018
 hugs-0.017
 spr-0.017
Token do
Token ID466
Feature activation-2.21e-01

Pos logit contributions
ened+0.038
 walks+0.036
 hearts+0.036
irs+0.036
 strokes+0.036
Neg logit contributions
 seek-0.033
ney-0.026
 than-0.024
umpy-0.024
neys-0.020
Token not
Token ID407
Feature activation-7.72e-01

Pos logit contributions
 seek+0.007
ney+0.005
 than+0.005
umpy+0.004
and+0.004
Neg logit contributions
 tul-0.019
ened-0.019
 stro-0.019
 spr-0.019
 walks-0.019
Token lose
Token ID4425
Feature activation-5.45e-01

Pos logit contributions
 seek+0.050
ney+0.043
umpy+0.041
 than+0.040
and+0.038
Neg logit contributions
 walks-0.043
 hearts-0.041
 spr-0.041
 strokes-0.041
 tul-0.041
Token her
Token ID607
Feature activation+3.26e-01

Pos logit contributions
 seek+0.038
ney+0.032
umpy+0.031
 than+0.030
and+0.029
Neg logit contributions
 walks-0.028
 spr-0.027
 hearts-0.027
 tul-0.027
 strokes-0.027
Token cat
Token ID3797
Feature activation+4.16e-01

Pos logit contributions
 walks+0.019
 stro+0.018
ened+0.018
 spr+0.018
 tul+0.018
Neg logit contributions
 seek-0.022
ney-0.018
umpy-0.018
 than-0.017
and-0.016
Token always
Token ID1464
Feature activation-1.55e+00

Pos logit contributions
 walks+0.025
ened+0.025
 hearts+0.025
 tul+0.025
 stro+0.025
Neg logit contributions
 seek-0.024
ney-0.020
 than-0.018
umpy-0.018
and-0.016
Token tries
Token ID8404
Feature activation+7.51e-01

Pos logit contributions
 seek+0.116
 than+0.098
ney+0.097
umpy+0.094
aking+0.087
Neg logit contributions
 hearts-0.065
 tul-0.065
ened-0.065
irs-0.065
 walks-0.063
Token to
Token ID284
Feature activation+5.43e-01

Pos logit contributions
 walks+0.047
 spr+0.046
 tul+0.045
 hearts+0.045
 stro+0.045
Neg logit contributions
 seek-0.043
ney-0.036
umpy-0.034
 than-0.034
and-0.031
Token catch
Token ID4929
Feature activation-2.34e-02

Pos logit contributions
 walks+0.028
 hearts+0.027
ened+0.027
 spr+0.027
 strokes+0.026
Neg logit contributions
 seek-0.038
ney-0.032
umpy-0.030
 than-0.030
and-0.028
Token the
Token ID262
Feature activation-5.33e-01

Pos logit contributions
 seek+0.002
ney+0.002
umpy+0.001
 than+0.001
and+0.001
Neg logit contributions
 walks-0.001
 hearts-0.001
 strokes-0.001
 hugs-0.001
 spr-0.001
Token mouse
Token ID10211
Feature activation+1.95e-01

Pos logit contributions
 seek+0.037
ney+0.032
umpy+0.030
 than+0.029
and+0.028
Neg logit contributions
 walks-0.028
 hearts-0.027
 strokes-0.026
 hugs-0.026
 spr-0.026
Token,
Token ID11
Feature activation-4.38e-02

Pos logit contributions
 walks+0.013
 hearts+0.012
 spr+0.012
 tul+0.012
 stro+0.012
Neg logit contributions
 seek-0.011
ney-0.009
umpy-0.009
 than-0.008
and-0.008
Token but
Token ID475
Feature activation-1.01e+00

Pos logit contributions
 seek+0.002
ney+0.001
umpy+0.001
 than+0.001
and+0.001
Neg logit contributions
 walks-0.004
 spr-0.004
 hearts-0.004
 stro-0.004
 strokes-0.004
Token the
Token ID262
Feature activation-3.55e-01

Pos logit contributions
ney+0.070
 seek+0.068
umpy+0.067
and+0.062
aking+0.058
Neg logit contributions
 hearts-0.051
 walks-0.050
 spr-0.049
 hugs-0.048
 er-0.048
Token mouse
Token ID10211
Feature activation+4.33e-01

Pos logit contributions
 seek+0.024
ney+0.020
umpy+0.019
 than+0.019
and+0.018
Neg logit contributions
 walks-0.019
 hearts-0.018
 spr-0.018
 strokes-0.018
 tul-0.018
Token
Token ID198
Feature activation+4.93e-01

Pos logit contributions
 walks+0.102
 strokes+0.099
 hugs+0.096
 spr+0.096
 hearts+0.095
Neg logit contributions
 seek-0.092
ney-0.072
umpy-0.068
 than-0.066
rated-0.060
Token
Token ID198
Feature activation+5.34e-01

Pos logit contributions
 walks+0.037
 hearts+0.036
 spr+0.035
 tul+0.035
 strokes+0.035
Neg logit contributions
 seek-0.023
ney-0.018
umpy-0.017
 than-0.016
and-0.015
Token"
Token ID1
Feature activation+1.15e+00

Pos logit contributions
ened+0.030
 walks+0.029
 spr+0.029
 tul+0.028
 stro+0.028
Neg logit contributions
 seek-0.036
ney-0.030
umpy-0.028
 than-0.028
and-0.027
TokenHow
Token ID2437
Feature activation+1.85e-01

Pos logit contributions
 walks+0.074
 tul+0.074
 spr+0.073
 stro+0.073
 hearts+0.073
Neg logit contributions
 seek-0.060
ney-0.053
umpy-0.050
 than-0.047
and-0.044
Token will
Token ID481
Feature activation-1.09e+00

Pos logit contributions
 spr+0.010
 walks+0.010
 hugs+0.010
 scrib+0.009
 strokes+0.009
Neg logit contributions
 seek-0.013
ney-0.010
keeping-0.010
Bob-0.010
 than-0.010
Token that
Token ID326
Feature activation-2.54e-01

Pos logit contributions
 seek+0.051
ney+0.039
 than+0.036
umpy+0.036
and+0.031
Neg logit contributions
 walks-0.082
 stro-0.079
 hearts-0.079
 strokes-0.079
 spr-0.078
Token help
Token ID1037
Feature activation+1.15e+00

Pos logit contributions
 seek+0.015
ney+0.014
umpy+0.014
and+0.014
asc+0.013
Neg logit contributions
 spr-0.014
 er-0.014
 comb-0.014
 bl-0.014
 walks-0.014
Token?"
Token ID1701
Feature activation+6.29e-01

Pos logit contributions
 er+0.055
 walks+0.054
 comb+0.053
 bows+0.052
 scrib+0.051
Neg logit contributions
 seek-0.078
ney-0.072
umpy-0.071
 than-0.065
asc-0.064
Token her
Token ID607
Feature activation+8.93e-01

Pos logit contributions
ience+0.037
irs+0.037
 tul+0.037
 walks+0.036
 spr+0.036
Neg logit contributions
 seek-0.038
 than-0.030
ney-0.029
umpy-0.027
and-0.026
Token mom
Token ID1995
Feature activation-5.54e-02

Pos logit contributions
 walks+0.052
 hearts+0.050
 spr+0.050
 strokes+0.050
 tul+0.049
Neg logit contributions
 seek-0.056
ney-0.048
umpy-0.045
 than-0.044
and-0.041
Token wondered
Token ID14028
Feature activation+7.72e-01

Pos logit contributions
 seek+0.004
ney+0.003
umpy+0.003
 than+0.003
and+0.003
Neg logit contributions
 walks-0.003
 hearts-0.003
 tul-0.003
 spr-0.003
 strokes-0.003
Token and
Token ID290
Feature activation+6.55e-01

Pos logit contributions
 seek+0.052
ney+0.044
umpy+0.041
 than+0.039
and+0.038
Neg logit contributions
 walks-0.058
 spr-0.056
 hearts-0.056
 stro-0.055
 strokes-0.055
Token the
Token ID262
Feature activation-2.15e-01

Pos logit contributions
 walks+0.039
 hearts+0.037
 spr+0.037
 strokes+0.037
 tul+0.037
Neg logit contributions
 seek-0.041
ney-0.034
umpy-0.033
 than-0.032
and-0.030
Token robot
Token ID9379
Feature activation+1.93e+00

Pos logit contributions
 seek+0.013
ney+0.011
umpy+0.011
 than+0.010
and+0.009
Neg logit contributions
 stro-0.013
 walks-0.013
 spr-0.013
 tul-0.013
 strokes-0.013
Token were
Token ID547
Feature activation+7.04e-01

Pos logit contributions
 tul+0.106
irs+0.105
ened+0.103
 walks+0.103
 strokes+0.102
Neg logit contributions
 seek-0.124
ney-0.108
 than-0.099
umpy-0.097
and-0.089
Token the
Token ID262
Feature activation+2.31e-01

Pos logit contributions
 walks+0.044
 hearts+0.042
 spr+0.042
 tul+0.042
 strokes+0.041
Neg logit contributions
 seek-0.042
ney-0.035
umpy-0.034
 than-0.032
and-0.030
Token best
Token ID1266
Feature activation-2.14e+00

Pos logit contributions
 walks+0.017
 hearts+0.017
 spr+0.017
 tul+0.017
 strokes+0.017
Neg logit contributions
 seek-0.011
ney-0.009
umpy-0.008
 than-0.007
and-0.007
Token of
Token ID286
Feature activation-1.07e+00

Pos logit contributions
 seek+0.096
ney+0.076
umpy+0.071
 than+0.067
and+0.061
Neg logit contributions
 walks-0.163
 hearts-0.159
 spr-0.159
 tul-0.158
 stro-0.158
Token friends
Token ID2460
Feature activation+3.83e-02

Pos logit contributions
 seek+0.047
ney+0.032
 than+0.032
umpy+0.031
and+0.028
Neg logit contributions
 spr-0.082
 stro-0.081
ened-0.081
 strokes-0.080
 walks-0.080
Token.
Token ID13
Feature activation-7.97e-02

Pos logit contributions
 walks+0.002
 hearts+0.002
 hugs+0.002
 stro+0.002
 strokes+0.002
Neg logit contributions
 seek-0.003
ney-0.002
umpy-0.002
rated-0.002
and-0.002
Token<|endoftext|>
Token ID50256
Feature activation+9.17e-01

Pos logit contributions
 seek+0.005
ney+0.004
umpy+0.004
 than+0.004
rated+0.004
Neg logit contributions
 hearts-0.005
 walks-0.005
 strokes-0.005
 hugs-0.005
 spr-0.005
TokenOnce
Token ID7454
Feature activation-2.26e-02

Pos logit contributions
 walks+0.045
 hugs+0.039
 hearts+0.037
 strokes+0.036
 spr+0.034
Neg logit contributions
 seek-0.070
rated-0.062
 than-0.059
umpy-0.058
ney-0.058
Token big
Token ID1263
Feature activation-5.30e-01

Pos logit contributions
 seek+0.000
ney+0.000
umpy+0.000
 than+0.000
and+0.000
Neg logit contributions
 walks-0.000
 spr-0.000
 tul-0.000
 hearts-0.000
 strokes-0.000
Token river
Token ID7850
Feature activation+1.53e-01

Pos logit contributions
 seek+0.032
ney+0.028
umpy+0.027
 than+0.025
and+0.025
Neg logit contributions
 walks-0.031
 spr-0.030
 hearts-0.030
 tul-0.030
 strokes-0.029
Token.
Token ID13
Feature activation-4.75e-02

Pos logit contributions
 walks+0.009
 spr+0.009
 hearts+0.009
 strokes+0.009
 stro+0.009
Neg logit contributions
 seek-0.009
ney-0.008
umpy-0.007
 than-0.007
and-0.007
Token He
Token ID679
Feature activation-3.84e-01

Pos logit contributions
 seek+0.003
ney+0.002
 than+0.002
umpy+0.002
kins+0.002
Neg logit contributions
 walks-0.003
 spr-0.003
 strokes-0.003
 hearts-0.003
 hugs-0.003
Token was
Token ID373
Feature activation-2.49e-01

Pos logit contributions
 seek+0.025
ney+0.021
umpy+0.021
 than+0.020
and+0.019
Neg logit contributions
 walks-0.022
 spr-0.021
 stro-0.021
 hearts-0.020
 strokes-0.020
Token so
Token ID523
Feature activation-2.10e+00

Pos logit contributions
 seek+0.014
ney+0.012
umpy+0.012
aghetti+0.011
and+0.011
Neg logit contributions
 walks-0.016
 spr-0.016
 stro-0.015
 tul-0.015
 hearts-0.015
Token small
Token ID1402
Feature activation+1.12e+00

Pos logit contributions
 seek+0.141
aghetti+0.126
ney+0.125
keeping+0.117
and+0.113
Neg logit contributions
 spr-0.106
 walks-0.097
 stro-0.095
 hugs-0.093
 fl-0.090
Token and
Token ID290
Feature activation+1.77e-01

Pos logit contributions
 walks+0.070
 spr+0.066
 hearts+0.064
ened+0.064
 hugs+0.064
Neg logit contributions
 seek-0.069
ney-0.059
umpy-0.057
and-0.050
rated-0.050
Token the
Token ID262
Feature activation+2.19e-01

Pos logit contributions
 walks+0.009
 spr+0.009
ened+0.009
 fl+0.009
 hugs+0.009
Neg logit contributions
umpy-0.010
and-0.010
ney-0.010
 seek-0.010
aghetti-0.009
Token river
Token ID7850
Feature activation+1.07e+00

Pos logit contributions
 walks+0.012
 spr+0.011
 hearts+0.011
 tul+0.011
 bows+0.011
Neg logit contributions
 seek-0.015
ney-0.013
umpy-0.012
and-0.012
 than-0.012
Token seemed
Token ID3947
Feature activation+5.07e-01

Pos logit contributions
 walks+0.062
 spr+0.061
ened+0.060
 strokes+0.059
 stro+0.059
Neg logit contributions
 seek-0.066
ney-0.055
umpy-0.054
and-0.050
 than-0.050
Token anything
Token ID1997
Feature activation-1.24e+00

Pos logit contributions
 seek+0.013
ney+0.011
umpy+0.011
 than+0.011
and+0.010
Neg logit contributions
 walks-0.012
 spr-0.012
 tul-0.012
 hearts-0.012
 strokes-0.011
Token bad
Token ID2089
Feature activation+3.68e-01

Pos logit contributions
 seek+0.072
ney+0.069
umpy+0.059
Jen+0.058
aghetti+0.056
Neg logit contributions
 walks-0.075
 er-0.072
ened-0.070
 hugs-0.070
 stro-0.070
Token again
Token ID757
Feature activation-2.72e-01

Pos logit contributions
 walks+0.022
 hearts+0.020
ened+0.020
 er+0.020
 bows+0.019
Neg logit contributions
ney-0.023
 seek-0.022
umpy-0.021
aking-0.019
neys-0.019
Token.
Token ID13
Feature activation-2.51e-01

Pos logit contributions
ney+0.020
 seek+0.020
asc+0.019
umpy+0.018
aghetti+0.018
Neg logit contributions
 walks-0.012
ened-0.010
 hearts-0.010
 strokes-0.009
 er-0.009
Token He
Token ID679
Feature activation-5.34e-01

Pos logit contributions
 seek+0.016
ney+0.012
umpy+0.012
 than+0.012
rated+0.012
Neg logit contributions
 hearts-0.015
 walks-0.015
 hugs-0.014
 strokes-0.014
 stro-0.014
Token found
Token ID1043
Feature activation-2.08e+00

Pos logit contributions
 seek+0.031
ney+0.026
umpy+0.024
 than+0.024
and+0.021
Neg logit contributions
 walks-0.033
 tul-0.032
 hearts-0.032
 strokes-0.032
 spr-0.032
Token his
Token ID465
Feature activation-8.66e-01

Pos logit contributions
 seek+0.133
ney+0.114
umpy+0.109
 than+0.106
and+0.101
Neg logit contributions
 walks-0.116
 spr-0.111
 hearts-0.111
 tul-0.111
 stro-0.110
Token happiness
Token ID12157
Feature activation-1.24e+00

Pos logit contributions
 seek+0.054
ney+0.046
umpy+0.044
 than+0.043
and+0.040
Neg logit contributions
 walks-0.050
 hearts-0.048
 spr-0.048
 strokes-0.048
 tul-0.047
Token with
Token ID351
Feature activation-5.79e-01

Pos logit contributions
 seek+0.072
ney+0.064
umpy+0.063
and+0.055
neys+0.053
Neg logit contributions
 walks-0.075
ened-0.074
 hearts-0.073
 strokes-0.073
 hugs-0.072
Token the
Token ID262
Feature activation-3.49e-01

Pos logit contributions
 seek+0.033
 than+0.027
ney+0.027
umpy+0.025
and+0.024
Neg logit contributions
 tul-0.035
 spr-0.035
 walks-0.034
 stro-0.033
 strokes-0.033
Token hug
Token ID16225
Feature activation+7.71e-01

Pos logit contributions
 seek+0.021
ney+0.018
umpy+0.017
 than+0.017
and+0.015
Neg logit contributions
 walks-0.021
 stro-0.020
 strokes-0.020
ened-0.020
 hearts-0.020
Token.
Token ID13
Feature activation-2.36e-01

Pos logit contributions
 seek+0.065
umpy+0.056
ney+0.053
asc+0.048
 than+0.048
Neg logit contributions
 walks-0.064
 er-0.062
 strokes-0.062
 hearts-0.061
 spr-0.061
Token It
Token ID632
Feature activation+8.23e-01

Pos logit contributions
 seek+0.012
ney+0.010
umpy+0.009
 than+0.009
and+0.008
Neg logit contributions
 walks-0.016
 strokes-0.016
 spr-0.016
 hearts-0.016
 hugs-0.016
Token had
Token ID550
Feature activation-4.37e-01

Pos logit contributions
 walks+0.051
 spr+0.050
 stro+0.049
 strokes+0.048
ened+0.048
Neg logit contributions
 seek-0.048
ney-0.041
umpy-0.040
 than-0.038
and-0.037
Token a
Token ID257
Feature activation-3.10e-02

Pos logit contributions
 seek+0.024
ney+0.020
 than+0.020
umpy+0.018
neys+0.017
Neg logit contributions
ience-0.026
 stro-0.026
irs-0.026
 tul-0.026
ishly-0.026
Token picture
Token ID4286
Feature activation-6.83e-02

Pos logit contributions
 seek+0.002
ney+0.002
 than+0.001
umpy+0.001
Bob+0.001
Neg logit contributions
 walks-0.002
ened-0.002
 stro-0.002
 hearts-0.002
 strokes-0.002
Token of
Token ID286
Feature activation-2.42e+00

Pos logit contributions
 seek+0.004
ney+0.003
umpy+0.003
 than+0.002
rated+0.002
Neg logit contributions
 walks-0.005
 er-0.005
 spr-0.005
 strokes-0.005
 hearts-0.005
Token him
Token ID683
Feature activation-1.27e+00

Pos logit contributions
 seek+0.150
ney+0.125
umpy+0.117
 than+0.117
and+0.109
Neg logit contributions
 walks-0.140
 spr-0.135
 stro-0.135
 strokes-0.134
 tul-0.134
Token inside
Token ID2641
Feature activation+3.34e-01

Pos logit contributions
 seek+0.076
ney+0.064
umpy+0.062
 than+0.056
and+0.056
Neg logit contributions
 walks-0.078
 hearts-0.075
 spr-0.074
 strokes-0.074
 stro-0.074
Token.
Token ID13
Feature activation-3.91e-01

Pos logit contributions
 walks+0.016
 hearts+0.016
 hugs+0.015
ened+0.015
 strokes+0.015
Neg logit contributions
 seek-0.025
ney-0.022
umpy-0.021
and-0.019
aking-0.019
Token Tim
Token ID5045
Feature activation-9.46e-01

Pos logit contributions
 seek+0.023
ney+0.018
 than+0.017
umpy+0.017
aking+0.016
Neg logit contributions
 walks-0.025
 spr-0.024
 strokes-0.024
 hugs-0.024
 hearts-0.024
Token thought
Token ID1807
Feature activation-3.71e-01

Pos logit contributions
 seek+0.058
ney+0.050
umpy+0.047
 than+0.046
and+0.043
Neg logit contributions
 walks-0.056
 hearts-0.054
 spr-0.053
 strokes-0.053
 stro-0.053
Token long
Token ID890
Feature activation-4.34e-02

Pos logit contributions
ened+0.041
 walks+0.041
 hearts+0.041
 strokes+0.040
 spr+0.040
Neg logit contributions
 seek-0.026
ney-0.018
umpy-0.017
 than-0.017
and-0.016
Token time
Token ID640
Feature activation-2.92e-01

Pos logit contributions
 seek+0.002
 than+0.001
ney+0.001
umpy+0.001
Bob+0.001
Neg logit contributions
 hearts-0.003
 stro-0.003
irs-0.003
 strokes-0.003
 spr-0.003
Token.
Token ID13
Feature activation-8.03e-02

Pos logit contributions
 seek+0.015
 than+0.012
ney+0.012
umpy+0.011
and+0.011
Neg logit contributions
 walks-0.020
ened-0.020
 spr-0.019
 strokes-0.019
 hearts-0.019
Token She
Token ID1375
Feature activation+3.32e-01

Pos logit contributions
 seek+0.005
ney+0.004
umpy+0.004
 than+0.004
and+0.004
Neg logit contributions
 walks-0.005
 spr-0.004
ened-0.004
 hearts-0.004
 tul-0.004
Token was
Token ID373
Feature activation-2.12e-01

Pos logit contributions
 walks+0.020
irs+0.020
 hearts+0.019
 tul+0.019
ened+0.019
Neg logit contributions
 seek-0.020
ney-0.016
 than-0.015
umpy-0.014
and-0.013
Token so
Token ID523
Feature activation-2.18e+00

Pos logit contributions
 seek+0.011
ney+0.009
umpy+0.008
 than+0.008
and+0.007
Neg logit contributions
 walks-0.015
 stro-0.014
 spr-0.014
 hearts-0.014
 tul-0.014
Token happy
Token ID3772
Feature activation+1.42e-01

Pos logit contributions
 seek+0.116
ney+0.099
umpy+0.088
 than+0.085
aghetti+0.082
Neg logit contributions
 walks-0.144
 stro-0.143
 spr-0.142
 hugs-0.141
 tul-0.140
Token to
Token ID284
Feature activation+9.54e-01

Pos logit contributions
 walks+0.008
 spr+0.007
 hearts+0.007
 strokes+0.007
 hugs+0.007
Neg logit contributions
 seek-0.009
ney-0.008
umpy-0.008
 than-0.007
and-0.007
Token have
Token ID423
Feature activation+6.29e-01

Pos logit contributions
 walks+0.063
 hearts+0.062
 strokes+0.061
ened+0.061
 spr+0.061
Neg logit contributions
 seek-0.053
ney-0.042
umpy-0.040
 than-0.039
and-0.034
Token seen
Token ID1775
Feature activation-1.72e+00

Pos logit contributions
 tul+0.037
 spr+0.036
 walks+0.036
 hearts+0.036
 strokes+0.035
Neg logit contributions
 seek-0.040
ney-0.031
 than-0.031
umpy-0.030
and-0.028
Token something
Token ID1223
Feature activation+5.22e-01

Pos logit contributions
 seek+0.103
 than+0.083
ney+0.082
umpy+0.078
and+0.076
Neg logit contributions
 walks-0.100
 spr-0.098
 tul-0.096
ened-0.096
 stro-0.096