TOP ACTIVATIONS
MAX = 8.640

know , her mum was fore shadow ing ... the next
When they arrived they were ast ounded , the land was
." Billy was ast ounded ! He used the
She could feel the masse use 's warm hands pressing
lived a little girl named Gab by . She woke up
The girl had been fore shadow ed . Even though
the heavy bag had been fore shadow ed , and that
friend , a dog called Tri x . Mill ie wanted
friend ?" The tou can nods its head .
yard . He had been fore shadow ed about this bad
day she had to balance prec ar iously on her thin
ipop , it 's a sav our y l oll ipop
The bad ending was fore shadow ed by Maggie 's
forest and he was known far and wide for being a
there was a girl named Gab by . She was three
was a little girl named Gab by who liked to roll
ushed and thanked the woman prof use ly . The woman
eyes . She had been fore shadow ed about the power
's okay , my d arl ings . I forgive you
there was a girl named Far ah . She liked playing

MIN ACTIVATIONS
MIN = -7.873

! Let â s ride on ! â
different colors and sizes . Some are red , some are
. Let â s bake the cake and celebrate
Tim my started to mis be have . He didn 't
cannot draw on everything . Some things are not for drawing
a lot of toys . Some are old and some are
swimming and spl ashing . Some fish were big and some
different colors and shapes . Some looked like stars , some
play in the pool . Some things are dangerous ." <|endoftext|>
of what you eat . Some things are not good for
their moms and dads . Some kids jump on a tr
trains at the station . Some are fast and some are
and shells and coral . Some of the coral was broken
things in the universe . Some of them are very far
and looked at them . Some of them were still dry
They see many flowers . Some are red , some are
many pictures and sculptures . Some are colorful , some are
of what you eat . Some things are not good for
fall on the ground . Some tomatoes break . They make
things on the floor . Some are broken . Some are

INTERVAL 4.512 - 8.640
CONTAINS 0.228%

my decided to take a walk in the woods . He
there was a little girl named Lily . She loved music
, there was a girl named Pip . She was only
there was a little girl named Lily . She loved to
Then he heard a voice from behind him . " Maybe

INTERVAL 0.383 - 4.512
CONTAINS 40.106%

. " Hello , kids , I am here
when you lose something , it 's important to keep looking
he filled Bob 's bag with blue ink .
, there was a cater pillar named Cindy . She loved
what his mom had told him . She said real knives

INTERVAL -3.745 - 0.383
CONTAINS 59.130%

meet a new friend . One day , Tom went to
ia opens her present next . She sees a big doll
no bananas were to be found ! Jack had a sad
put the bread on the ground with her makeup ! The
was brave to explore the backyard and go right up to

INTERVAL -7.873 - -3.745
CONTAINS 0.536%

. " Let 's give it a special hug
He then got up and started winding his way through the
climbed a mountain , and even flew to the moon .
handle of the cart and began to push it . The
like bells and ho of s ." Anna said .
Token know
Token ID760
Feature activation-6.79e-01

Pos logit contributions
 sle+0.023
 shortcut+0.021
 spe+0.020
 grizz+0.020
oc+0.019
Neg logit contributions
 herself-0.025
 themselves-0.021
ena-0.021
iece-0.020
 shapes-0.020
Token,
Token ID11
Feature activation-7.28e-01

Pos logit contributions
 herself+0.017
ena+0.017
mons+0.016
iece+0.015
 themselves+0.015
Neg logit contributions
 sle-0.016
 gru-0.015
 shortcut-0.015
 spe-0.014
run-0.014
Token her
Token ID607
Feature activation-1.63e-01

Pos logit contributions
 herself+0.018
ena+0.017
 themselves+0.016
 shapes+0.015
 brushes+0.015
Neg logit contributions
 sle-0.020
 spe-0.019
 shortcut-0.018
 grizz-0.018
 gru-0.017
Token mum
Token ID25682
Feature activation+2.76e-01

Pos logit contributions
 herself+0.004
ena+0.003
 themselves+0.003
iece+0.003
 shapes+0.003
Neg logit contributions
 sle-0.005
 shortcut-0.005
 spe-0.005
 grizz-0.005
 gru-0.005
Token was
Token ID373
Feature activation-5.41e-02

Pos logit contributions
 sle+0.007
 grizz+0.006
 spe+0.006
oc+0.006
 gru+0.006
Neg logit contributions
 herself-0.008
 themselves-0.007
 shapes-0.006
ena-0.006
iece-0.006
Token fore
Token ID1674
Feature activation+8.64e+00

Pos logit contributions
 herself+0.002
 themselves+0.001
 shapes+0.001
ena+0.001
 brushes+0.001
Neg logit contributions
 sle-0.001
 spe-0.001
 gru-0.001
 grizz-0.001
 shortcut-0.001
Tokenshadow
Token ID19106
Feature activation-1.03e+00

Pos logit contributions
 sle+0.159
 grizz+0.138
 spe+0.136
 nut+0.134
 polar+0.127
Neg logit contributions
 herself-0.316
ena-0.277
 themselves-0.269
ups-0.264
 when-0.259
Tokening
Token ID278
Feature activation-1.38e+00

Pos logit contributions
 herself+0.024
 brushes+0.020
 styles+0.020
 shapes+0.020
 themselves+0.020
Neg logit contributions
 sle-0.029
 victorious-0.027
 grizz-0.027
run-0.027
 gru-0.026
Token...
Token ID986
Feature activation-2.12e+00

Pos logit contributions
 herself+0.036
ena+0.033
 brushes+0.031
 shapes+0.030
 themselves+0.030
Neg logit contributions
 sle-0.036
 shortcut-0.032
 grizz-0.032
 spe-0.032
 gru-0.032
Tokenthe
Token ID1169
Feature activation-6.20e-01

Pos logit contributions
 herself+0.053
ena+0.048
 shapes+0.047
 themselves+0.046
 styles+0.045
Neg logit contributions
 sle-0.056
 spe-0.052
run-0.052
oc-0.049
 shortcut-0.049
Token next
Token ID1306
Feature activation+1.20e+00

Pos logit contributions
 herself+0.019
 themselves+0.017
ena+0.016
iece+0.015
 shapes+0.015
Neg logit contributions
 sle-0.014
 shortcut-0.014
 spe-0.013
 grizz-0.013
 tow-0.013
Token When
Token ID1649
Feature activation-2.21e+00

Pos logit contributions
 sle+0.019
 spe+0.017
 grizz+0.017
 shortcut+0.016
 gru+0.016
Neg logit contributions
 herself-0.019
 themselves-0.016
ena-0.016
 shapes-0.015
 brushes-0.015
Token they
Token ID484
Feature activation-1.38e+00

Pos logit contributions
 herself+0.054
ena+0.043
 themselves+0.040
 shapes+0.040
 styles+0.039
Neg logit contributions
 sle-0.070
 spe-0.063
 shortcut-0.063
 grizz-0.061
 gru-0.059
Token arrived
Token ID5284
Feature activation+1.39e+00

Pos logit contributions
 herself+0.035
ena+0.028
 themselves+0.027
 when+0.027
 her+0.027
Neg logit contributions
 sle-0.043
 grizz-0.039
 gru-0.039
 spe-0.037
 shortcut-0.036
Token they
Token ID484
Feature activation+3.79e-01

Pos logit contributions
 sle+0.039
 shortcut+0.034
 grizz+0.033
 spe+0.033
 gru+0.033
Neg logit contributions
 herself-0.039
ena-0.033
 shapes-0.031
 styles-0.031
 themselves-0.031
Token were
Token ID547
Feature activation-1.28e+00

Pos logit contributions
 sle+0.010
 spe+0.008
 grizz+0.008
 shortcut+0.008
oc+0.008
Neg logit contributions
 herself-0.012
ena-0.010
 themselves-0.010
 shapes-0.009
 styles-0.009
Token ast
Token ID6468
Feature activation+7.97e+00

Pos logit contributions
 herself+0.037
 themselves+0.031
ena+0.031
 styles+0.031
 shapes+0.030
Neg logit contributions
 sle-0.034
 grizz-0.030
 spe-0.030
 shortcut-0.030
 gru-0.029
Tokenounded
Token ID6302
Feature activation-7.66e-01

Pos logit contributions
 sle+0.152
 spe+0.127
 grizz+0.122
 polar+0.113
oc+0.112
Neg logit contributions
 herself-0.297
ena-0.262
 themselves-0.254
 brushes-0.252
 shapes-0.251
Token,
Token ID11
Feature activation-4.14e-01

Pos logit contributions
 herself+0.022
ena+0.018
 brushes+0.017
 styles+0.017
 shapes+0.017
Neg logit contributions
 sle-0.020
 spe-0.018
 shortcut-0.018
 grizz-0.017
 gru-0.017
Token the
Token ID262
Feature activation+1.14e+00

Pos logit contributions
 herself+0.011
ena+0.010
 styles+0.009
iece+0.009
ina+0.009
Neg logit contributions
 sle-0.011
 shortcut-0.010
 gru-0.010
 spe-0.010
 grizz-0.010
Token land
Token ID1956
Feature activation+9.07e-01

Pos logit contributions
 sle+0.030
 spe+0.026
 shortcut+0.025
 grizz+0.025
 hunter+0.024
Neg logit contributions
 herself-0.036
ena-0.031
 themselves-0.030
iece-0.029
 brushes-0.028
Token was
Token ID373
Feature activation-9.51e-01

Pos logit contributions
 sle+0.024
 spe+0.022
 shortcut+0.021
 grizz+0.021
 gru+0.020
Neg logit contributions
 herself-0.027
ena-0.021
 themselves-0.021
 styles-0.021
 shapes-0.020
Token."
Token ID526
Feature activation-1.79e+00

Pos logit contributions
 herself+0.044
ena+0.037
 themselves+0.036
 shapes+0.035
 brushes+0.034
Neg logit contributions
 sle-0.042
 spe-0.037
 shortcut-0.037
 grizz-0.036
oc-0.035
Token
Token ID198
Feature activation+6.00e-01

Pos logit contributions
 herself+0.054
 themselves+0.044
ena+0.044
 shapes+0.042
 brushes+0.041
Neg logit contributions
 sle-0.048
 spe-0.042
 grizz-0.041
 shortcut-0.041
 gru-0.039
Token
Token ID198
Feature activation+1.22e+00

Pos logit contributions
 sle+0.020
 spe+0.019
 grizz+0.018
 shortcut+0.017
 gru+0.017
Neg logit contributions
 herself-0.015
 themselves-0.012
ena-0.011
 when-0.010
 shapes-0.010
TokenBilly
Token ID49640
Feature activation-3.14e-01

Pos logit contributions
 sle+0.030
 spe+0.026
 grizz+0.026
 shortcut+0.025
 gru+0.024
Neg logit contributions
 herself-0.039
 themselves-0.033
ena-0.033
 shapes-0.031
 brushes-0.031
Token was
Token ID373
Feature activation-1.26e+00

Pos logit contributions
 herself+0.009
 themselves+0.008
iece+0.008
 styles+0.008
ena+0.008
Neg logit contributions
oc-0.007
 nut-0.006
 grizz-0.006
 sle-0.006
 spe-0.006
Token ast
Token ID6468
Feature activation+7.75e+00

Pos logit contributions
 herself+0.035
 brushes+0.031
ena+0.031
 styles+0.030
 shapes+0.030
Neg logit contributions
 sle-0.031
 gru-0.028
 spe-0.028
 grizz-0.028
 victorious-0.028
Tokenounded
Token ID6302
Feature activation-4.59e-01

Pos logit contributions
 sle+0.181
 spe+0.155
 grizz+0.150
 polar+0.146
 nut+0.137
Neg logit contributions
 herself-0.252
ena-0.226
 brushes-0.217
ups-0.214
 themselves-0.213
Token!
Token ID0
Feature activation+1.03e+00

Pos logit contributions
 herself+0.012
ena+0.011
 brushes+0.010
 shapes+0.010
 themselves+0.010
Neg logit contributions
 sle-0.012
 grizz-0.011
 spe-0.011
 shortcut-0.011
 gru-0.011
Token He
Token ID679
Feature activation+3.23e-01

Pos logit contributions
 sle+0.024
 spe+0.022
 shortcut+0.019
 grizz+0.019
use+0.019
Neg logit contributions
 herself-0.035
 themselves-0.029
ena-0.028
 shapes-0.028
 brushes-0.027
Token used
Token ID973
Feature activation-1.08e+00

Pos logit contributions
 sle+0.008
oc+0.008
 grizz+0.008
 spe+0.008
 shortcut+0.007
Neg logit contributions
 herself-0.009
 themselves-0.008
ena-0.008
iece-0.007
 styles-0.007
Token the
Token ID262
Feature activation+5.17e-01

Pos logit contributions
 herself+0.028
ena+0.024
 themselves+0.022
 shapes+0.022
 styles+0.022
Neg logit contributions
 sle-0.031
 spe-0.028
 shortcut-0.028
 grizz-0.028
 gru-0.026
Token
Token ID220
Feature activation-2.11e-01

Pos logit contributions
 sle+0.012
 grizz+0.011
 spe+0.011
 gru+0.010
run+0.010
Neg logit contributions
 herself-0.016
ena-0.015
 shapes-0.014
 themselves-0.014
 styles-0.014
Token She
Token ID1375
Feature activation+3.61e-01

Pos logit contributions
 herself+0.007
ena+0.006
 shapes+0.006
 styles+0.006
 themselves+0.006
Neg logit contributions
 sle-0.004
 shortcut-0.004
run-0.004
 grizz-0.004
 spe-0.004
Token could
Token ID714
Feature activation-2.88e+00

Pos logit contributions
 sle+0.009
 spe+0.009
oc+0.008
 grizz+0.008
 nut+0.008
Neg logit contributions
 herself-0.011
 themselves-0.009
iece-0.009
 shapes-0.009
ena-0.009
Token feel
Token ID1254
Feature activation-7.54e-02

Pos logit contributions
 herself+0.078
ena+0.071
 shapes+0.069
 themselves+0.069
iece+0.068
Neg logit contributions
 sle-0.077
 spe-0.067
 shortcut-0.065
 grizz-0.064
 nut-0.063
Token the
Token ID262
Feature activation+1.34e+00

Pos logit contributions
 herself+0.002
ena+0.002
 themselves+0.002
 shapes+0.002
 brushes+0.002
Neg logit contributions
 sle-0.002
 spe-0.002
 grizz-0.002
 shortcut-0.002
 gru-0.002
Token masse
Token ID41869
Feature activation+7.74e+00

Pos logit contributions
 sle+0.034
 spe+0.029
 shortcut+0.029
 grizz+0.028
 nut+0.027
Neg logit contributions
 herself-0.041
ena-0.038
 themselves-0.036
iece-0.034
 brushes-0.033
Tokenuse
Token ID1904
Feature activation+1.22e+00

Pos logit contributions
 sle+0.125
 shortcut+0.099
 grizz+0.091
 spe+0.090
 nut+0.087
Neg logit contributions
 herself-0.312
ena-0.270
 themselves-0.267
 styles-0.259
 shapes-0.258
Token's
Token ID338
Feature activation-2.09e+00

Pos logit contributions
 sle+0.036
 spe+0.032
oc+0.031
 grizz+0.031
 shortcut+0.031
Neg logit contributions
 herself-0.031
 themselves-0.026
ena-0.026
 brushes-0.024
 shapes-0.024
Token warm
Token ID5814
Feature activation+1.01e+00

Pos logit contributions
 herself+0.052
iece+0.046
ena+0.046
 themselves+0.044
 brushes+0.043
Neg logit contributions
 sle-0.064
 grizz-0.056
 spe-0.055
 polar-0.053
 shortcut-0.052
Token hands
Token ID2832
Feature activation+6.74e-01

Pos logit contributions
 sle+0.029
 grizz+0.025
 shortcut+0.024
 spe+0.024
 nut+0.023
Neg logit contributions
 herself-0.027
ena-0.025
 themselves-0.024
iece-0.022
 brushes-0.022
Token pressing
Token ID12273
Feature activation-9.65e-01

Pos logit contributions
 sle+0.018
 spe+0.016
 grizz+0.015
 shortcut+0.015
run+0.015
Neg logit contributions
 herself-0.018
ena-0.016
 brushes-0.015
 themselves-0.015
 shapes-0.015
Token lived
Token ID5615
Feature activation-1.66e-01

Pos logit contributions
 sle+0.043
 grizz+0.038
 spe+0.038
 shortcut+0.038
 gru+0.037
Neg logit contributions
 herself-0.027
ena-0.021
 themselves-0.021
 shapes-0.020
 brushes-0.018
Token a
Token ID257
Feature activation+1.48e+00

Pos logit contributions
 herself+0.003
ena+0.002
 themselves+0.002
 shapes+0.002
 styles+0.002
Neg logit contributions
 sle-0.006
 spe-0.006
 grizz-0.006
 shortcut-0.006
oc-0.006
Token little
Token ID1310
Feature activation+1.64e+00

Pos logit contributions
run+0.046
 sle+0.046
 shortcut+0.043
use+0.042
 grizz+0.042
Neg logit contributions
 herself-0.030
 shapes-0.028
ena-0.027
 when-0.026
 hair-0.025
Token girl
Token ID2576
Feature activation+2.10e+00

Pos logit contributions
 sle+0.049
 shortcut+0.045
 grizz+0.044
 gru+0.044
run+0.043
Neg logit contributions
 herself-0.040
 shapes-0.033
ena-0.032
 themselves-0.032
 brushes-0.031
Token named
Token ID3706
Feature activation+4.35e+00

Pos logit contributions
 sle+0.046
 shortcut+0.039
 grizz+0.039
 spe+0.039
 gru+0.037
Neg logit contributions
 herself-0.072
ena-0.062
 themselves-0.060
 shapes-0.059
 brushes-0.058
Token Gab
Token ID12300
Feature activation+7.73e+00

Pos logit contributions
 sle+0.124
 spe+0.111
 grizz+0.110
 shortcut+0.109
 gru+0.104
Neg logit contributions
 herself-0.116
ena-0.096
 themselves-0.095
 shapes-0.093
 styles-0.090
Tokenby
Token ID1525
Feature activation+2.48e+00

Pos logit contributions
 sle+0.251
 grizz+0.238
 shortcut+0.233
 gru+0.232
 spe+0.232
Neg logit contributions
 herself-0.183
ena-0.148
 themselves-0.137
 shapes-0.132
 styles-0.127
Token.
Token ID13
Feature activation+9.19e-01

Pos logit contributions
 sle+0.076
 spe+0.067
 grizz+0.067
oc+0.067
 shortcut+0.065
Neg logit contributions
 herself-0.063
 themselves-0.050
ena-0.050
 shapes-0.049
 brushes-0.048
Token She
Token ID1375
Feature activation+1.63e+00

Pos logit contributions
 sle+0.030
 grizz+0.028
 shortcut+0.027
 gru+0.027
run+0.027
Neg logit contributions
 herself-0.025
 themselves-0.018
 her-0.016
 shapes-0.016
ena-0.016
Token woke
Token ID19092
Feature activation+2.69e+00

Pos logit contributions
 sle+0.055
 spe+0.049
 grizz+0.049
 shortcut+0.049
oc+0.047
Neg logit contributions
 herself-0.037
ena-0.029
 themselves-0.028
 shapes-0.028
 brushes-0.026
Token up
Token ID510
Feature activation+4.73e-01

Pos logit contributions
 sle+0.072
 grizz+0.062
 spe+0.060
 shortcut+0.058
 nut+0.058
Neg logit contributions
 herself-0.072
ena-0.067
 themselves-0.063
 brushes-0.061
 shapes-0.061
Token
Token ID198
Feature activation+1.36e+00

Pos logit contributions
 sle+0.020
 spe+0.018
 grizz+0.017
 shortcut+0.017
 hunter+0.016
Neg logit contributions
 herself-0.018
 themselves-0.014
ena-0.014
 shapes-0.013
 brushes-0.012
TokenThe
Token ID464
Feature activation+4.45e-02

Pos logit contributions
 sle+0.041
 spe+0.037
 grizz+0.036
 shortcut+0.036
oc+0.035
Neg logit contributions
 herself-0.035
ena-0.028
 themselves-0.028
 shapes-0.027
 brushes-0.026
Token girl
Token ID2576
Feature activation-7.47e-01

Pos logit contributions
 sle+0.001
 grizz+0.001
 spe+0.001
 shortcut+0.001
 gru+0.001
Neg logit contributions
 herself-0.001
ena-0.001
 themselves-0.001
 shapes-0.001
 brushes-0.001
Token had
Token ID550
Feature activation-2.67e-01

Pos logit contributions
 herself+0.022
 themselves+0.018
iece+0.018
ena+0.018
 shapes+0.018
Neg logit contributions
 sle-0.020
 spe-0.018
oc-0.017
 grizz-0.017
 gru-0.016
Token been
Token ID587
Feature activation+1.59e+00

Pos logit contributions
 herself+0.007
ena+0.006
iece+0.006
 themselves+0.006
mons+0.006
Neg logit contributions
 sle-0.007
 spe-0.006
 victorious-0.006
 shortcut-0.006
 grizz-0.006
Token fore
Token ID1674
Feature activation+7.72e+00

Pos logit contributions
 sle+0.043
 spe+0.038
 grizz+0.038
 shortcut+0.037
oc+0.036
Neg logit contributions
 herself-0.046
ena-0.038
 themselves-0.038
 shapes-0.037
 brushes-0.036
Tokenshadow
Token ID19106
Feature activation-6.37e-01

Pos logit contributions
 sle+0.137
 grizz+0.118
 spe+0.117
 nut+0.109
 polar+0.108
Neg logit contributions
 herself-0.293
ena-0.255
 themselves-0.251
ups-0.243
 shapes-0.242
Tokened
Token ID276
Feature activation-3.37e-01

Pos logit contributions
 herself+0.013
 styles+0.011
 brushes+0.011
 shapes+0.011
 themselves+0.011
Neg logit contributions
 sle-0.020
 grizz-0.019
 victorious-0.019
 shortcut-0.019
 gru-0.019
Token.
Token ID13
Feature activation+1.06e+00

Pos logit contributions
 herself+0.009
ena+0.008
 themselves+0.008
 brushes+0.008
 shapes+0.007
Neg logit contributions
 sle-0.009
 shortcut-0.008
 grizz-0.008
 spe-0.008
 gru-0.008
Token Even
Token ID3412
Feature activation-2.06e+00

Pos logit contributions
 sle+0.030
 grizz+0.027
 spe+0.026
 shortcut+0.026
oc+0.026
Neg logit contributions
 herself-0.029
ena-0.024
 themselves-0.024
 shapes-0.023
 styles-0.022
Token though
Token ID996
Feature activation-1.57e+00

Pos logit contributions
 herself+0.058
ena+0.052
 themselves+0.051
iece+0.049
 brushes+0.048
Neg logit contributions
 sle-0.052
 spe-0.049
 grizz-0.047
 gru-0.046
 shortcut-0.045
Token the
Token ID262
Feature activation+1.18e+00

Pos logit contributions
 sle+0.005
 spe+0.005
 grizz+0.005
 shortcut+0.005
oc+0.005
Neg logit contributions
 herself-0.004
ena-0.004
 brushes-0.004
 themselves-0.004
 styles-0.003
Token heavy
Token ID4334
Feature activation+1.23e+00

Pos logit contributions
 sle+0.029
 spe+0.025
 grizz+0.025
 shortcut+0.024
oc+0.024
Neg logit contributions
 herself-0.037
ena-0.031
 shapes-0.031
 themselves-0.031
 brushes-0.030
Token bag
Token ID6131
Feature activation+1.61e+00

Pos logit contributions
 sle+0.027
 spe+0.023
 grizz+0.023
 shortcut+0.022
 gru+0.022
Neg logit contributions
 herself-0.040
 shapes-0.035
ena-0.034
 clothes-0.033
 themselves-0.033
Token had
Token ID550
Feature activation+1.76e+00

Pos logit contributions
 sle+0.047
 spe+0.042
 grizz+0.042
 shortcut+0.041
oc+0.040
Neg logit contributions
 herself-0.044
ena-0.035
 themselves-0.035
 shapes-0.034
 brushes-0.033
Token been
Token ID587
Feature activation+2.00e+00

Pos logit contributions
 sle+0.044
 shortcut+0.040
 spe+0.039
 grizz+0.038
 nut+0.037
Neg logit contributions
 herself-0.052
ena-0.044
 themselves-0.043
 shapes-0.042
 brushes-0.041
Token fore
Token ID1674
Feature activation+7.68e+00

Pos logit contributions
 sle+0.054
 grizz+0.046
 spe+0.046
 shortcut+0.046
oc+0.045
Neg logit contributions
 herself-0.059
ena-0.049
 themselves-0.049
 shapes-0.047
 brushes-0.045
Tokenshadow
Token ID19106
Feature activation-6.50e-01

Pos logit contributions
 sle+0.147
 grizz+0.130
 spe+0.129
 Nice+0.119
 polar+0.119
Neg logit contributions
 herself-0.279
ena-0.241
 themselves-0.239
ups-0.237
 when-0.234
Tokened
Token ID276
Feature activation-3.93e-01

Pos logit contributions
 herself+0.015
 brushes+0.012
 styles+0.012
 shapes+0.012
ena+0.012
Neg logit contributions
 sle-0.020
 grizz-0.018
 shortcut-0.018
run-0.018
 victorious-0.018
Token,
Token ID11
Feature activation-7.70e-01

Pos logit contributions
 herself+0.011
ena+0.009
 themselves+0.009
 shapes+0.009
 brushes+0.009
Neg logit contributions
 sle-0.011
 shortcut-0.010
 grizz-0.010
 spe-0.010
oc-0.009
Token and
Token ID290
Feature activation-7.86e-01

Pos logit contributions
 herself+0.020
ena+0.016
 themselves+0.016
 shapes+0.016
 brushes+0.015
Neg logit contributions
 sle-0.023
 spe-0.021
 grizz-0.020
 shortcut-0.020
oc-0.020
Token that
Token ID326
Feature activation+1.44e-01

Pos logit contributions
 herself+0.022
ena+0.020
 brushes+0.019
 styles+0.019
 shapes+0.018
Neg logit contributions
 sle-0.021
 grizz-0.019
 spe-0.018
oc-0.018
 shortcut-0.018
Token friend
Token ID1545
Feature activation+6.57e-01

Pos logit contributions
 sle+0.003
 shortcut+0.003
 spe+0.003
 grizz+0.002
oc+0.002
Neg logit contributions
 herself-0.004
 themselves-0.003
ena-0.003
iece-0.003
 shapes-0.003
Token,
Token ID11
Feature activation+6.07e-01

Pos logit contributions
 sle+0.016
 polar+0.015
 grizz+0.014
oc+0.014
 spe+0.014
Neg logit contributions
 herself-0.020
 shapes-0.017
ena-0.017
 styles-0.016
iece-0.016
Token a
Token ID257
Feature activation+1.07e+00

Pos logit contributions
 sle+0.017
 spe+0.015
 shortcut+0.014
 grizz+0.014
 polar+0.014
Neg logit contributions
 herself-0.017
ena-0.014
iece-0.014
 shapes-0.014
 themselves-0.014
Token dog
Token ID3290
Feature activation+2.98e+00

Pos logit contributions
 sle+0.029
 shortcut+0.026
 grizz+0.026
oc+0.025
 spe+0.025
Neg logit contributions
 herself-0.029
ena-0.024
 shapes-0.024
 brushes-0.023
 themselves-0.023
Token called
Token ID1444
Feature activation+4.63e+00

Pos logit contributions
 sle+0.078
 shortcut+0.070
 gru+0.067
 spe+0.067
 grizz+0.066
Neg logit contributions
 herself-0.089
ena-0.075
 themselves-0.074
 when-0.071
 shapes-0.069
Token Tri
Token ID7563
Feature activation+7.60e+00

Pos logit contributions
 sle+0.123
 grizz+0.107
 spe+0.106
 shortcut+0.106
 gru+0.102
Neg logit contributions
 herself-0.135
ena-0.114
 themselves-0.110
 shapes-0.110
 brushes-0.107
Tokenx
Token ID87
Feature activation+2.02e+00

Pos logit contributions
 sle+0.243
 spe+0.218
 grizz+0.218
 shortcut+0.215
 gru+0.211
Neg logit contributions
 herself-0.185
ena-0.150
 themselves-0.144
 shapes-0.139
 brushes-0.134
Token.
Token ID13
Feature activation+6.77e-01

Pos logit contributions
 sle+0.051
oc+0.042
 shortcut+0.042
 grizz+0.042
 polar+0.042
Neg logit contributions
 herself-0.059
 brushes-0.051
ena-0.051
 themselves-0.049
 shapes-0.049
Token Mill
Token ID9212
Feature activation+1.64e+00

Pos logit contributions
 sle+0.021
 spe+0.019
 grizz+0.019
 shortcut+0.019
 gru+0.018
Neg logit contributions
 herself-0.017
 themselves-0.014
ena-0.013
 shapes-0.013
 brushes-0.013
Tokenie
Token ID494
Feature activation-4.56e-01

Pos logit contributions
 sle+0.047
 grizz+0.041
 spe+0.041
 shortcut+0.040
oc+0.040
Neg logit contributions
 herself-0.045
ena-0.037
 themselves-0.036
 shapes-0.036
 brushes-0.034
Token wanted
Token ID2227
Feature activation+1.31e+00

Pos logit contributions
 herself+0.013
iece+0.011
 themselves+0.011
ena+0.010
 shapes+0.010
Neg logit contributions
 sle-0.012
 grizz-0.011
oc-0.011
 spe-0.011
 hunter-0.011
Token friend
Token ID1545
Feature activation+1.17e+00

Pos logit contributions
 sle+0.020
 shortcut+0.017
 spe+0.017
 nut+0.017
 grizz+0.017
Neg logit contributions
 herself-0.020
ena-0.016
iece-0.016
 themselves-0.015
 brushes-0.014
Token?"
Token ID1701
Feature activation-2.98e+00

Pos logit contributions
 sle+0.023
 spe+0.019
 grizz+0.019
 shortcut+0.019
oc+0.018
Neg logit contributions
 herself-0.043
ena-0.037
 themselves-0.036
 shapes-0.036
 brushes-0.035
Token
Token ID198
Feature activation-3.56e-01

Pos logit contributions
 herself+0.092
ena+0.079
 shapes+0.079
 styles+0.077
iece+0.077
Neg logit contributions
 sle-0.070
 spe-0.060
 grizz-0.060
 gru-0.059
oc-0.058
Token
Token ID198
Feature activation+1.64e-01

Pos logit contributions
 herself+0.010
ena+0.008
 themselves+0.008
 shapes+0.007
 brushes+0.007
Neg logit contributions
 sle-0.011
 spe-0.010
 shortcut-0.009
 grizz-0.009
 trove-0.009
TokenThe
Token ID464
Feature activation+6.07e-02

Pos logit contributions
 sle+0.005
 spe+0.004
 grizz+0.004
oc+0.004
 shortcut+0.004
Neg logit contributions
 herself-0.004
ena-0.003
 shapes-0.003
 themselves-0.003
 styles-0.003
Token tou
Token ID2819
Feature activation+7.60e+00

Pos logit contributions
 sle+0.001
 grizz+0.001
 shortcut+0.001
 spe+0.001
 hunter+0.001
Neg logit contributions
 herself-0.002
ena-0.002
 themselves-0.002
iece-0.002
 shapes-0.002
Tokencan
Token ID5171
Feature activation-1.61e+00

Pos logit contributions
 sle+0.202
 spe+0.174
 grizz+0.172
 shortcut+0.162
 gru+0.162
Neg logit contributions
 herself-0.237
ena-0.208
iece-0.196
 themselves-0.192
ina-0.183
Token nods
Token ID34833
Feature activation+8.81e-02

Pos logit contributions
 herself+0.043
 shapes+0.036
ena+0.035
 themselves+0.035
 styles+0.035
Neg logit contributions
 sle-0.045
 spe-0.039
 shortcut-0.039
 grizz-0.039
 gru-0.038
Token its
Token ID663
Feature activation+2.10e+00

Pos logit contributions
 sle+0.002
 spe+0.002
 gru+0.002
oc+0.002
 grizz+0.002
Neg logit contributions
 herself-0.002
 brushes-0.002
 shapes-0.002
ena-0.002
 styles-0.002
Token head
Token ID1182
Feature activation+1.79e+00

Pos logit contributions
 sle+0.061
 nut+0.052
 hunter+0.051
 spe+0.051
 grizz+0.051
Neg logit contributions
 herself-0.056
ena-0.047
 themselves-0.046
iece-0.046
 brushes-0.045
Token.
Token ID13
Feature activation+1.90e-01

Pos logit contributions
 sle+0.049
 spe+0.042
 gru+0.042
 grizz+0.041
oc+0.041
Neg logit contributions
 herself-0.048
 shapes-0.040
 brushes-0.040
ena-0.040
 themselves-0.040
Token yard
Token ID12699
Feature activation-1.02e+00

Pos logit contributions
 herself+0.092
ena+0.079
 themselves+0.078
 shapes+0.075
 brushes+0.074
Neg logit contributions
 sle-0.068
 grizz-0.057
 spe-0.057
 shortcut-0.057
oc-0.055
Token.
Token ID13
Feature activation+9.46e-01

Pos logit contributions
 herself+0.028
ena+0.024
 brushes+0.024
 styles+0.023
 themselves+0.023
Neg logit contributions
 sle-0.027
 grizz-0.025
 spe-0.024
oc-0.024
 shortcut-0.024
Token He
Token ID679
Feature activation+3.51e-01

Pos logit contributions
 sle+0.022
 spe+0.019
 shortcut+0.018
use+0.018
oc+0.017
Neg logit contributions
 herself-0.032
 themselves-0.027
ena-0.027
 shapes-0.025
 hair-0.025
Token had
Token ID550
Feature activation-1.82e-01

Pos logit contributions
 sle+0.009
 grizz+0.009
 spe+0.008
 shortcut+0.008
oc+0.008
Neg logit contributions
 herself-0.010
 themselves-0.008
 shapes-0.008
ena-0.008
 styles-0.008
Token been
Token ID587
Feature activation+1.64e+00

Pos logit contributions
 herself+0.005
ena+0.004
iece+0.004
ina+0.004
 styles+0.004
Neg logit contributions
 sle-0.005
 shortcut-0.004
 spe-0.004
 grizz-0.004
 victorious-0.004
Token fore
Token ID1674
Feature activation+7.57e+00

Pos logit contributions
 sle+0.046
 spe+0.040
 grizz+0.040
 shortcut+0.039
oc+0.039
Neg logit contributions
 herself-0.047
ena-0.039
 themselves-0.038
 shapes-0.037
 brushes-0.035
Tokenshadow
Token ID19106
Feature activation-9.72e-01

Pos logit contributions
 sle+0.139
 spe+0.123
 grizz+0.121
 polar+0.111
 Nice+0.111
Neg logit contributions
 herself-0.280
ena-0.243
 themselves-0.239
ups-0.237
 when-0.232
Tokened
Token ID276
Feature activation-4.22e-01

Pos logit contributions
 herself+0.020
 styles+0.015
 shapes+0.015
 themselves+0.015
 brushes+0.015
Neg logit contributions
 sle-0.033
 grizz-0.030
 shortcut-0.030
run-0.029
 gru-0.029
Token about
Token ID546
Feature activation+5.39e-01

Pos logit contributions
 herself+0.012
ena+0.010
 shapes+0.009
 brushes+0.009
 themselves+0.009
Neg logit contributions
 sle-0.012
 shortcut-0.011
 grizz-0.010
 spe-0.010
oc-0.010
Token this
Token ID428
Feature activation+4.30e-01

Pos logit contributions
 sle+0.016
 grizz+0.014
 spe+0.014
 shortcut+0.014
 gru+0.014
Neg logit contributions
 herself-0.013
ena-0.011
 brushes-0.011
iece-0.011
 shapes-0.011
Token bad
Token ID2089
Feature activation+1.53e+00

Pos logit contributions
 sle+0.012
 spe+0.010
 grizz+0.010
 shortcut+0.010
oc+0.010
Neg logit contributions
 herself-0.012
ena-0.010
 themselves-0.010
 shapes-0.010
 brushes-0.010
Token day
Token ID1110
Feature activation+5.32e-01

Pos logit contributions
 sle+0.020
 spe+0.018
 shortcut+0.018
 grizz+0.018
 nut+0.017
Neg logit contributions
 herself-0.018
ena-0.015
 themselves-0.015
 shapes-0.014
 styles-0.014
Token she
Token ID673
Feature activation+2.02e+00

Pos logit contributions
 sle+0.014
 spe+0.014
 grizz+0.011
 guarding+0.011
 Nice+0.011
Neg logit contributions
 herself-0.014
ena-0.013
iece-0.013
 brushes-0.013
 opinions-0.012
Token had
Token ID550
Feature activation+3.54e-01

Pos logit contributions
 sle+0.048
 spe+0.047
oc+0.042
 grizz+0.041
 shortcut+0.041
Neg logit contributions
 herself-0.063
 themselves-0.054
ena-0.054
 shapes-0.051
iece-0.051
Token to
Token ID284
Feature activation+1.06e+00

Pos logit contributions
 spe+0.008
 sle+0.008
 shady+0.007
 guarding+0.007
 shortcut+0.007
Neg logit contributions
 herself-0.010
iece-0.010
ena-0.009
 themselves-0.009
 styles-0.009
Token balance
Token ID5236
Feature activation-7.69e-01

Pos logit contributions
 sle+0.023
 spe+0.020
 nut+0.020
 shortcut+0.019
 tow+0.019
Neg logit contributions
 herself-0.031
ena-0.031
iece-0.029
ups-0.028
 shapes-0.028
Token prec
Token ID3718
Feature activation+7.56e+00

Pos logit contributions
ena+0.019
era+0.017
 brushes+0.017
mons+0.017
 herself+0.017
Neg logit contributions
 sle-0.019
 spe-0.017
 gru-0.017
 shortcut-0.016
 nut-0.016
Tokenar
Token ID283
Feature activation+5.43e-01

Pos logit contributions
 sle+0.176
 grizz+0.146
 polar+0.144
 shortcut+0.144
 spe+0.143
Neg logit contributions
 herself-0.240
ena-0.214
iece-0.194
 hair-0.194
 brushes-0.191
Tokeniously
Token ID6819
Feature activation+1.75e+00

Pos logit contributions
 sle+0.014
 spe+0.012
 guarding+0.012
 grizz+0.011
oc+0.011
Neg logit contributions
 herself-0.015
 themselves-0.013
 brushes-0.013
 styles-0.013
 shapes-0.012
Token on
Token ID319
Feature activation+1.79e+00

Pos logit contributions
 sle+0.049
 spe+0.044
 shortcut+0.042
 gru+0.041
 grizz+0.041
Neg logit contributions
 herself-0.047
ena-0.041
 themselves-0.039
iece-0.037
 shapes-0.037
Token her
Token ID607
Feature activation-2.01e+00

Pos logit contributions
 sle+0.057
 spe+0.053
 shortcut+0.050
 tow+0.048
 grizz+0.048
Neg logit contributions
 herself-0.033
iece-0.033
ena-0.032
mons-0.031
 themselves-0.031
Token thin
Token ID7888
Feature activation-3.61e-01

Pos logit contributions
 herself+0.053
ena+0.048
 themselves+0.046
iece+0.043
 brushes+0.043
Neg logit contributions
 sle-0.059
 spe-0.050
 nut-0.048
 shortcut-0.048
 grizz-0.045
Tokenipop
Token ID42800
Feature activation+6.38e-01

Pos logit contributions
 sle+0.087
 spe+0.074
 grizz+0.074
 shortcut+0.073
 gru+0.069
Neg logit contributions
 herself-0.148
ena-0.127
 themselves-0.123
 shapes-0.123
 brushes-0.119
Token,
Token ID11
Feature activation-3.86e-01

Pos logit contributions
 sle+0.014
oc+0.011
 spe+0.011
 grizz+0.011
 shortcut+0.011
Neg logit contributions
 herself-0.021
ena-0.019
 themselves-0.018
iece-0.018
 brushes-0.018
Token it
Token ID340
Feature activation+7.35e-01

Pos logit contributions
 herself+0.011
 themselves+0.009
ena+0.009
iece+0.009
 shapes+0.008
Neg logit contributions
 sle-0.011
 spe-0.009
 grizz-0.009
 shortcut-0.009
 gru-0.009
Token's
Token ID338
Feature activation-2.78e+00

Pos logit contributions
 sle+0.017
 spe+0.015
 grizz+0.015
 shortcut+0.014
oc+0.014
Neg logit contributions
 herself-0.024
ena-0.020
 themselves-0.020
iece-0.020
 styles-0.019
Token a
Token ID257
Feature activation+1.50e+00

Pos logit contributions
 herself+0.081
ena+0.069
 themselves+0.067
iece+0.065
 shapes+0.065
Neg logit contributions
 sle-0.074
 spe-0.065
 grizz-0.064
 shortcut-0.064
 gru-0.061
Token sav
Token ID6799
Feature activation+7.52e+00

Pos logit contributions
 sle+0.036
 grizz+0.031
 shortcut+0.031
 spe+0.031
oc+0.030
Neg logit contributions
 herself-0.046
 shapes-0.038
ena-0.038
 themselves-0.037
 styles-0.036
Tokenour
Token ID454
Feature activation+1.41e-01

Pos logit contributions
 sle+0.191
 spe+0.174
 shortcut+0.174
 grizz+0.168
 polar+0.165
Neg logit contributions
 herself-0.224
ena-0.195
iece-0.190
 hair-0.178
 styles-0.175
Tokeny
Token ID88
Feature activation+3.02e-01

Pos logit contributions
 sle+0.003
oc+0.003
 spe+0.003
 nut+0.003
 grizz+0.003
Neg logit contributions
 herself-0.004
 themselves-0.004
ena-0.004
 brushes-0.004
iece-0.004
Token l
Token ID300
Feature activation+3.02e+00

Pos logit contributions
 sle+0.007
 spe+0.006
 grizz+0.006
 shortcut+0.006
oc+0.006
Neg logit contributions
 herself-0.010
ena-0.008
 themselves-0.008
 shapes-0.008
 brushes-0.008
Tokenoll
Token ID692
Feature activation+5.06e+00

Pos logit contributions
 sle+0.096
 spe+0.085
 grizz+0.085
 shortcut+0.085
oc+0.082
Neg logit contributions
 herself-0.076
ena-0.061
 themselves-0.056
 shapes-0.056
iece-0.054
Tokenipop
Token ID42800
Feature activation+9.95e-01

Pos logit contributions
 sle+0.108
 grizz+0.093
 shortcut+0.093
 spe+0.092
 gru+0.088
Neg logit contributions
 herself-0.175
ena-0.149
 shapes-0.145
 themselves-0.144
 brushes-0.140
Token
Token ID198
Feature activation+1.46e+00

Pos logit contributions
 sle+0.021
 spe+0.018
 grizz+0.018
 shortcut+0.018
oc+0.017
Neg logit contributions
 herself-0.024
ena-0.020
 themselves-0.020
 shapes-0.019
 brushes-0.018
TokenThe
Token ID464
Feature activation+2.40e-01

Pos logit contributions
 sle+0.044
 gru+0.040
oc+0.039
 shortcut+0.039
 grizz+0.039
Neg logit contributions
 herself-0.035
ena-0.029
 shapes-0.029
 styles-0.028
 brushes-0.028
Token bad
Token ID2089
Feature activation+7.88e-01

Pos logit contributions
 sle+0.006
 grizz+0.005
 spe+0.005
 polar+0.005
 gru+0.005
Neg logit contributions
 herself-0.007
ena-0.006
 themselves-0.006
 styles-0.006
 brushes-0.006
Token ending
Token ID7464
Feature activation+8.40e-01

Pos logit contributions
 sle+0.017
 spe+0.015
 grizz+0.014
 polar+0.014
 gru+0.014
Neg logit contributions
 herself-0.027
ena-0.024
 themselves-0.023
 brushes-0.022
 shapes-0.021
Token was
Token ID373
Feature activation+1.66e-01

Pos logit contributions
 sle+0.022
 spe+0.020
 shortcut+0.020
 grizz+0.020
 gru+0.019
Neg logit contributions
 herself-0.024
ena-0.020
 themselves-0.019
 styles-0.019
 shapes-0.019
Token fore
Token ID1674
Feature activation+7.52e+00

Pos logit contributions
 sle+0.005
 spe+0.004
 grizz+0.004
 shortcut+0.004
 gru+0.004
Neg logit contributions
 herself-0.004
ena-0.004
 shapes-0.004
 brushes-0.004
 styles-0.004
Tokenshadow
Token ID19106
Feature activation-9.44e-01

Pos logit contributions
 sle+0.133
 grizz+0.113
 spe+0.112
 nut+0.101
 polar+0.100
Neg logit contributions
 herself-0.287
 themselves-0.250
ena-0.249
ups-0.241
iece-0.239
Tokened
Token ID276
Feature activation-4.74e-01

Pos logit contributions
 herself+0.022
 styles+0.018
 brushes+0.018
 shapes+0.017
 themselves+0.016
Neg logit contributions
 sle-0.028
 victorious-0.026
 grizz-0.026
run-0.026
 spe-0.026
Token by
Token ID416
Feature activation+3.25e-01

Pos logit contributions
 herself+0.013
ena+0.012
 brushes+0.011
 styles+0.011
 shapes+0.011
Neg logit contributions
 sle-0.012
 shortcut-0.011
 spe-0.011
 grizz-0.011
 gru-0.011
Token Maggie
Token ID33014
Feature activation-2.56e-01

Pos logit contributions
 sle+0.008
 spe+0.008
 grizz+0.007
 gru+0.007
 polar+0.007
Neg logit contributions
 herself-0.008
iece-0.008
 brushes-0.008
ena-0.008
 styles-0.008
Token's
Token ID338
Feature activation-1.29e+00

Pos logit contributions
 herself+0.007
ena+0.006
 shapes+0.006
 brushes+0.006
 themselves+0.006
Neg logit contributions
 sle-0.007
oc-0.006
 grizz-0.006
 spe-0.006
 shortcut-0.006
Token forest
Token ID8222
Feature activation+7.65e-02

Pos logit contributions
 herself+0.016
ena+0.013
 themselves+0.013
 shapes+0.013
 brushes+0.013
Neg logit contributions
 sle-0.011
 spe-0.010
 grizz-0.010
 shortcut-0.010
oc-0.009
Token and
Token ID290
Feature activation+4.47e-01

Pos logit contributions
 sle+0.002
oc+0.002
 guarding+0.002
 spe+0.002
 grizz+0.002
Neg logit contributions
 herself-0.002
ena-0.002
 brushes-0.002
 themselves-0.002
 styles-0.002
Token he
Token ID339
Feature activation+5.66e-01

Pos logit contributions
 sle+0.012
 spe+0.011
 grizz+0.011
 shortcut+0.010
 nut+0.010
Neg logit contributions
 herself-0.012
ena-0.011
 themselves-0.011
 brushes-0.010
iece-0.010
Token was
Token ID373
Feature activation-5.86e-01

Pos logit contributions
 sle+0.016
 spe+0.015
 grizz+0.014
oc+0.014
 shortcut+0.014
Neg logit contributions
 herself-0.015
ena-0.013
 themselves-0.012
 brushes-0.012
 shapes-0.012
Token known
Token ID1900
Feature activation+2.63e+00

Pos logit contributions
 herself+0.016
ena+0.013
 themselves+0.013
 shapes+0.012
 brushes+0.012
Neg logit contributions
 sle-0.017
 spe-0.015
 grizz-0.015
 shortcut-0.015
 gru-0.015
Token far
Token ID1290
Feature activation+7.51e+00

Pos logit contributions
 sle+0.079
 spe+0.070
 grizz+0.070
 shortcut+0.069
oc+0.068
Neg logit contributions
 herself-0.070
ena-0.056
 themselves-0.055
 shapes-0.054
 styles-0.051
Token and
Token ID290
Feature activation+1.98e+00

Pos logit contributions
 sle+0.237
 spe+0.213
 grizz+0.210
 shortcut+0.205
 tow+0.201
Neg logit contributions
 herself-0.180
 when-0.142
 shapes-0.140
ena-0.140
 themselves-0.138
Token wide
Token ID3094
Feature activation+1.09e+00

Pos logit contributions
 sle+0.048
 grizz+0.041
 spe+0.041
 shortcut+0.040
oc+0.039
Neg logit contributions
 herself-0.064
ena-0.053
 themselves-0.053
 shapes-0.053
 brushes-0.051
Token for
Token ID329
Feature activation-3.75e-01

Pos logit contributions
 sle+0.035
 grizz+0.031
 spe+0.031
 shortcut+0.031
oc+0.030
Neg logit contributions
 herself-0.026
ena-0.021
 themselves-0.021
 shapes-0.020
 styles-0.019
Token being
Token ID852
Feature activation+1.34e+00

Pos logit contributions
 herself+0.009
ena+0.007
 themselves+0.007
 brushes+0.007
 shapes+0.007
Neg logit contributions
 sle-0.012
 grizz-0.011
 spe-0.011
 shortcut-0.010
 polar-0.010
Token a
Token ID257
Feature activation+2.57e+00

Pos logit contributions
 sle+0.034
 spe+0.030
 grizz+0.030
 shortcut+0.030
 gru+0.028
Neg logit contributions
 herself-0.041
ena-0.034
 themselves-0.034
 shapes-0.033
 brushes-0.032
Token there
Token ID612
Feature activation+1.53e+00

Pos logit contributions
 sle+0.035
 spe+0.030
 grizz+0.030
 shortcut+0.030
 hunter+0.030
Neg logit contributions
 herself-0.019
 themselves-0.014
ena-0.013
 when-0.013
 shapes-0.012
Token was
Token ID373
Feature activation-9.29e-01

Pos logit contributions
 sle+0.051
 grizz+0.046
 spe+0.046
 shortcut+0.045
 gru+0.044
Neg logit contributions
 herself-0.035
ena-0.028
 themselves-0.027
 shapes-0.026
 brushes-0.025
Token a
Token ID257
Feature activation+1.36e+00

Pos logit contributions
 herself+0.019
 styles+0.017
ena+0.016
ups+0.016
iece+0.016
Neg logit contributions
 sle-0.029
 spe-0.027
 nut-0.027
 polar-0.027
 grizz-0.026
Token girl
Token ID2576
Feature activation+2.21e+00

Pos logit contributions
run+0.042
 sle+0.042
 shortcut+0.039
|+0.039
use+0.039
Neg logit contributions
 herself-0.026
ena-0.025
 shapes-0.024
 when-0.023
 hair-0.023
Token named
Token ID3706
Feature activation+4.56e+00

Pos logit contributions
 sle+0.045
 spe+0.038
 grizz+0.037
 shortcut+0.036
oc+0.036
Neg logit contributions
 herself-0.079
ena-0.068
 themselves-0.068
 shapes-0.066
 styles-0.065
Token Gab
Token ID12300
Feature activation+7.51e+00

Pos logit contributions
 sle+0.117
 spe+0.109
 grizz+0.106
 shortcut+0.105
|+0.103
Neg logit contributions
 herself-0.124
 themselves-0.107
ena-0.106
 shapes-0.104
iece-0.104
Tokenby
Token ID1525
Feature activation+2.16e+00

Pos logit contributions
 sle+0.247
 grizz+0.238
 shortcut+0.235
 gru+0.234
 spe+0.232
Neg logit contributions
 herself-0.175
ena-0.138
 themselves-0.125
 shapes-0.119
ina-0.119
Token.
Token ID13
Feature activation+8.54e-01

Pos logit contributions
 sle+0.062
oc+0.056
 grizz+0.054
 spe+0.054
 polar+0.053
Neg logit contributions
 herself-0.060
 themselves-0.049
 shapes-0.047
 brushes-0.047
 styles-0.046
Token She
Token ID1375
Feature activation+1.30e+00

Pos logit contributions
 sle+0.029
 grizz+0.027
 shortcut+0.026
run+0.026
 spe+0.026
Neg logit contributions
 herself-0.022
 themselves-0.015
 her-0.014
 shapes-0.014
 styles-0.013
Token was
Token ID373
Feature activation-5.25e-01

Pos logit contributions
 sle+0.040
 spe+0.036
 grizz+0.035
oc+0.035
 shortcut+0.035
Neg logit contributions
 herself-0.032
 themselves-0.027
ena-0.026
 shapes-0.025
iece-0.025
Token three
Token ID1115
Feature activation-2.85e+00

Pos logit contributions
 herself+0.013
 themselves+0.010
ena+0.010
 shapes+0.010
iece+0.010
Neg logit contributions
 sle-0.016
 spe-0.015
 grizz-0.015
 shortcut-0.014
 polar-0.014
Token was
Token ID373
Feature activation-1.11e+00

Pos logit contributions
 sle+0.049
 grizz+0.044
 spe+0.044
 shortcut+0.044
 gru+0.043
Neg logit contributions
 herself-0.032
ena-0.024
 themselves-0.024
 shapes-0.023
 brushes-0.022
Token a
Token ID257
Feature activation+1.31e+00

Pos logit contributions
 herself+0.022
 styles+0.019
iece+0.019
ena+0.019
ups+0.018
Neg logit contributions
 sle-0.035
 spe-0.033
 nut-0.032
 grizz-0.032
 polar-0.032
Token little
Token ID1310
Feature activation+1.55e+00

Pos logit contributions
run+0.041
 sle+0.040
 shortcut+0.038
|+0.038
use+0.038
Neg logit contributions
 herself-0.026
ena-0.024
 shapes-0.024
 when-0.023
 hair-0.022
Token girl
Token ID2576
Feature activation+2.20e+00

Pos logit contributions
 sle+0.047
 shortcut+0.043
 grizz+0.042
 gru+0.042
run+0.042
Neg logit contributions
 herself-0.036
ena-0.030
 shapes-0.030
 brushes-0.028
 themselves-0.028
Token named
Token ID3706
Feature activation+4.51e+00

Pos logit contributions
 sle+0.048
 shortcut+0.040
 grizz+0.040
 spe+0.040
 gru+0.038
Neg logit contributions
 herself-0.076
ena-0.065
 themselves-0.064
 shapes-0.063
 brushes-0.061
Token Gab
Token ID12300
Feature activation+7.49e+00

Pos logit contributions
 sle+0.126
 spe+0.114
 grizz+0.112
 shortcut+0.111
 gru+0.107
Neg logit contributions
 herself-0.119
ena-0.100
 themselves-0.100
 shapes-0.097
 styles-0.095
Tokenby
Token ID1525
Feature activation+2.15e+00

Pos logit contributions
 sle+0.238
 grizz+0.221
 shortcut+0.219
 spe+0.217
 gru+0.215
Neg logit contributions
 herself-0.184
ena-0.149
 themselves-0.140
 shapes-0.136
 styles-0.131
Token who
Token ID508
Feature activation+2.31e+00

Pos logit contributions
 sle+0.065
oc+0.058
 grizz+0.057
 spe+0.057
 shortcut+0.055
Neg logit contributions
 herself-0.056
 themselves-0.045
ena-0.044
 shapes-0.044
 brushes-0.042
Token liked
Token ID8288
Feature activation-1.17e+00

Pos logit contributions
 sle+0.085
 spe+0.076
 grizz+0.076
 shortcut+0.076
 gru+0.074
Neg logit contributions
 herself-0.047
ena-0.035
 shapes-0.033
 themselves-0.033
 brushes-0.031
Token to
Token ID284
Feature activation+1.85e+00

Pos logit contributions
 herself+0.036
ena+0.032
 themselves+0.031
iece+0.031
 shapes+0.029
Neg logit contributions
 sle-0.029
 spe-0.025
 shortcut-0.024
 grizz-0.024
 nut-0.023
Token roll
Token ID4836
Feature activation+5.35e-01

Pos logit contributions
 sle+0.043
 spe+0.037
 shortcut+0.037
 nut+0.037
 grizz+0.036
Neg logit contributions
 herself-0.057
ena-0.051
 themselves-0.050
iece-0.048
 shapes-0.047
Tokenushed
Token ID7474
Feature activation+5.38e-01

Pos logit contributions
 sle+0.065
oc+0.063
run+0.062
 grizz+0.059
 spe+0.057
Neg logit contributions
 herself-0.126
 themselves-0.108
 shapes-0.105
ena-0.103
iece-0.103
Token and
Token ID290
Feature activation+1.50e+00

Pos logit contributions
 sle+0.014
 gru+0.013
 grizz+0.012
 spe+0.012
 Roger+0.012
Neg logit contributions
 herself-0.014
ena-0.013
 brushes-0.013
 styles-0.013
 themselves-0.012
Token thanked
Token ID26280
Feature activation+3.74e+00

Pos logit contributions
 sle+0.038
 spe+0.037
 gru+0.035
 grizz+0.035
 nut+0.033
Neg logit contributions
 herself-0.037
iece-0.037
ena-0.036
 themselves-0.034
 shapes-0.033
Token the
Token ID262
Feature activation+2.31e+00

Pos logit contributions
 sle+0.117
 spe+0.104
 Roger+0.103
 grizz+0.102
 gru+0.101
Neg logit contributions
 herself-0.069
ena-0.066
mons-0.066
 brushes-0.065
 styles-0.062
Token woman
Token ID2415
Feature activation+3.36e+00

Pos logit contributions
 sle+0.060
 grizz+0.052
 spe+0.051
 shortcut+0.051
 gru+0.049
Neg logit contributions
 herself-0.069
 themselves-0.060
ena-0.059
iece-0.056
 shapes-0.056
Token prof
Token ID1534
Feature activation+7.48e+00

Pos logit contributions
 sle+0.098
 spe+0.085
 grizz+0.085
 shortcut+0.084
oc+0.083
Neg logit contributions
 herself-0.090
ena-0.075
 themselves-0.072
 shapes-0.071
 brushes-0.070
Tokenuse
Token ID1904
Feature activation+1.86e+00

Pos logit contributions
 sle+0.087
 shortcut+0.062
 grizz+0.061
 spe+0.060
 polar+0.051
Neg logit contributions
 herself-0.341
ena-0.300
 themselves-0.299
iece-0.285
 styles-0.283
Tokenly
Token ID306
Feature activation+8.86e-01

Pos logit contributions
 sle+0.046
oc+0.042
use+0.041
 gru+0.041
run+0.039
Neg logit contributions
 herself-0.050
 themselves-0.044
ena-0.043
 brushes-0.042
mons-0.040
Token.
Token ID13
Feature activation+8.54e-01

Pos logit contributions
 sle+0.024
 grizz+0.021
 spe+0.020
oc+0.020
 gru+0.020
Neg logit contributions
 herself-0.025
ena-0.021
 themselves-0.020
 brushes-0.020
 shapes-0.020
Token The
Token ID383
Feature activation+9.32e-01

Pos logit contributions
 sle+0.019
 grizz+0.017
 gru+0.017
 Nice+0.016
 spe+0.016
Neg logit contributions
 herself-0.026
ena-0.023
 shapes-0.023
 styles-0.023
 themselves-0.022
Token woman
Token ID2415
Feature activation+1.49e+00

Pos logit contributions
 sle+0.023
 grizz+0.019
 spe+0.019
 gru+0.019
 shortcut+0.019
Neg logit contributions
 herself-0.029
 themselves-0.026
 shapes-0.025
ena-0.024
iece-0.024
Token eyes
Token ID2951
Feature activation+3.25e+00

Pos logit contributions
 sle+0.069
 spe+0.063
 grizz+0.062
 shortcut+0.062
oc+0.061
Neg logit contributions
 herself-0.053
ena-0.042
 shapes-0.041
 themselves-0.041
 brushes-0.038
Token.
Token ID13
Feature activation+7.64e-01

Pos logit contributions
 sle+0.084
 spe+0.076
 grizz+0.074
 shortcut+0.073
oc+0.071
Neg logit contributions
 herself-0.094
ena-0.082
 themselves-0.079
 shapes-0.076
 brushes-0.076
Token She
Token ID1375
Feature activation+5.47e-01

Pos logit contributions
 Nice+0.015
 Want+0.014
run+0.014
 sle+0.014
joy+0.014
Neg logit contributions
 herself-0.022
ena-0.021
 styles-0.020
 themselves-0.020
 shapes-0.020
Token had
Token ID550
Feature activation-5.00e-01

Pos logit contributions
 sle+0.014
 spe+0.013
 grizz+0.013
 shortcut+0.013
oc+0.012
Neg logit contributions
 herself-0.016
 themselves-0.013
ena-0.013
iece-0.013
 shapes-0.012
Token been
Token ID587
Feature activation+1.32e+00

Pos logit contributions
 herself+0.014
 themselves+0.013
ena+0.012
mons+0.012
 styles+0.012
Neg logit contributions
 shortcut-0.011
 sle-0.011
 spe-0.011
 victorious-0.011
 grizz-0.010
Token fore
Token ID1674
Feature activation+7.45e+00

Pos logit contributions
 sle+0.034
 spe+0.030
 grizz+0.030
 shortcut+0.030
 gru+0.029
Neg logit contributions
 herself-0.039
 themselves-0.033
ena-0.033
 shapes-0.032
 brushes-0.032
Tokenshadow
Token ID19106
Feature activation-1.10e+00

Pos logit contributions
 sle+0.133
 spe+0.113
 grizz+0.113
 nut+0.107
 polar+0.103
Neg logit contributions
 herself-0.278
ena-0.245
 themselves-0.238
 shapes-0.233
ups-0.233
Tokened
Token ID276
Feature activation-4.64e-01

Pos logit contributions
 herself+0.026
 styles+0.022
 themselves+0.021
 brushes+0.021
 shapes+0.021
Neg logit contributions
 sle-0.031
run-0.030
 victorious-0.030
 grizz-0.029
 shortcut-0.029
Token about
Token ID546
Feature activation+3.78e-01

Pos logit contributions
 herself+0.012
ena+0.011
 brushes+0.011
 themselves+0.011
 shapes+0.011
Neg logit contributions
 sle-0.012
 shortcut-0.011
 grizz-0.010
 spe-0.010
 gru-0.010
Token the
Token ID262
Feature activation+9.43e-01

Pos logit contributions
 sle+0.010
 grizz+0.010
 spe+0.010
 guarding+0.009
 shortcut+0.009
Neg logit contributions
 herself-0.008
ena-0.008
iece-0.008
 brushes-0.008
 styles-0.007
Token power
Token ID1176
Feature activation+7.47e-01

Pos logit contributions
 sle+0.020
 spe+0.017
 shortcut+0.016
 grizz+0.016
oc+0.016
Neg logit contributions
 herself-0.032
ena-0.028
 shapes-0.028
 themselves-0.027
 styles-0.027
Token's
Token ID338
Feature activation-2.54e+00

Pos logit contributions
 sle+0.058
 shortcut+0.054
 grizz+0.052
oc+0.051
 nut+0.049
Neg logit contributions
 herself-0.074
iece-0.063
 themselves-0.063
ena-0.062
 brushes-0.058
Token okay
Token ID8788
Feature activation-9.27e-02

Pos logit contributions
 herself+0.081
ena+0.069
 themselves+0.068
 shapes+0.067
 brushes+0.066
Neg logit contributions
 sle-0.060
 shortcut-0.052
 grizz-0.051
 spe-0.051
 gru-0.049
Token,
Token ID11
Feature activation+9.17e-01

Pos logit contributions
 herself+0.002
ena+0.002
 themselves+0.002
 shapes+0.002
 styles+0.002
Neg logit contributions
 sle-0.003
 spe-0.003
 grizz-0.003
 shortcut-0.003
oc-0.003
Token my
Token ID616
Feature activation-3.39e-01

Pos logit contributions
 sle+0.033
 spe+0.030
 grizz+0.029
 shortcut+0.029
oc+0.029
Neg logit contributions
 herself-0.019
ena-0.015
 themselves-0.014
 shapes-0.014
 brushes-0.013
Token d
Token ID288
Feature activation+1.36e+00

Pos logit contributions
 herself+0.011
 themselves+0.009
iece+0.009
ena+0.009
 shapes+0.008
Neg logit contributions
 sle-0.009
oc-0.007
 grizz-0.007
 spe-0.007
 polar-0.007
Tokenarl
Token ID7063
Feature activation+7.35e+00

Pos logit contributions
 sle+0.012
 grizz+0.007
 shortcut+0.007
 spe+0.007
oc+0.006
Neg logit contributions
 herself-0.065
 themselves-0.058
 shapes-0.057
ena-0.057
 styles-0.056
Tokenings
Token ID654
Feature activation+2.31e+00

Pos logit contributions
 sle+0.212
 shortcut+0.185
 spe+0.184
 grizz+0.182
oc+0.181
Neg logit contributions
 herself-0.201
ena-0.165
 themselves-0.160
 shapes-0.156
 brushes-0.151
Token.
Token ID13
Feature activation+2.53e-01

Pos logit contributions
 sle+0.073
 grizz+0.065
 shortcut+0.065
 spe+0.065
oc+0.063
Neg logit contributions
 herself-0.056
ena-0.045
 themselves-0.044
 shapes-0.043
 brushes-0.041
Token I
Token ID314
Feature activation+1.26e+00

Pos logit contributions
 sle+0.005
 spe+0.005
 shortcut+0.005
 grizz+0.005
oc+0.004
Neg logit contributions
 herself-0.009
ena-0.008
 themselves-0.007
 shapes-0.007
 brushes-0.007
Token forgive
Token ID20927
Feature activation+5.61e-01

Pos logit contributions
 sle+0.028
oc+0.025
 spe+0.025
 grizz+0.024
 shortcut+0.024
Neg logit contributions
 herself-0.043
ena-0.036
 themselves-0.036
 shapes-0.035
iece-0.035
Token you
Token ID345
Feature activation+6.60e-01

Pos logit contributions
 sle+0.013
 grizz+0.010
 nut+0.010
 spe+0.010
oc+0.010
Neg logit contributions
 herself-0.017
 brushes-0.016
ena-0.016
iece-0.015
 styles-0.015
Token there
Token ID612
Feature activation+1.29e+00

Pos logit contributions
 sle+0.022
 grizz+0.019
 shortcut+0.018
 spe+0.018
oc+0.017
Neg logit contributions
 herself-0.020
 themselves-0.016
ena-0.016
 shapes-0.015
 when-0.015
Token was
Token ID373
Feature activation-1.20e+00

Pos logit contributions
 sle+0.042
 spe+0.037
 grizz+0.037
 shortcut+0.037
oc+0.036
Neg logit contributions
 herself-0.031
ena-0.024
 themselves-0.024
 shapes-0.023
 brushes-0.022
Token a
Token ID257
Feature activation+1.26e+00

Pos logit contributions
 herself+0.025
 styles+0.023
ups+0.023
iece+0.023
ena+0.022
Neg logit contributions
 sle-0.035
 spe-0.034
 nut-0.034
 polar-0.033
 grizz-0.033
Token girl
Token ID2576
Feature activation+2.04e+00

Pos logit contributions
 sle+0.038
run+0.038
 shortcut+0.036
use+0.036
|+0.036
Neg logit contributions
 herself-0.026
 shapes-0.023
ena-0.023
 when-0.022
 hair-0.022
Token named
Token ID3706
Feature activation+4.51e+00

Pos logit contributions
 sle+0.044
 grizz+0.037
 spe+0.037
 shortcut+0.036
oc+0.034
Neg logit contributions
 herself-0.071
ena-0.061
 themselves-0.060
 shapes-0.059
 brushes-0.057
Token Far
Token ID6755
Feature activation+7.30e+00

Pos logit contributions
 sle+0.118
 spe+0.108
 grizz+0.106
 shortcut+0.105
|+0.102
Neg logit contributions
 herself-0.122
 themselves-0.104
ena-0.104
 shapes-0.101
iece-0.101
Tokenah
Token ID993
Feature activation-1.20e-01

Pos logit contributions
 sle+0.190
 grizz+0.174
 spe+0.166
 shortcut+0.165
 gru+0.162
Neg logit contributions
 herself-0.226
ena-0.190
 themselves-0.180
 shapes-0.168
 brushes-0.167
Token.
Token ID13
Feature activation+8.96e-01

Pos logit contributions
 herself+0.003
 themselves+0.003
 shapes+0.003
 brushes+0.003
 styles+0.003
Neg logit contributions
 sle-0.003
oc-0.003
 spe-0.003
 grizz-0.003
 polar-0.003
Token She
Token ID1375
Feature activation+1.21e+00

Pos logit contributions
 sle+0.028
 grizz+0.025
 spe+0.025
 shortcut+0.025
 gru+0.025
Neg logit contributions
 herself-0.023
ena-0.017
 themselves-0.017
 shapes-0.017
 brushes-0.016
Token liked
Token ID8288
Feature activation-1.40e+00

Pos logit contributions
 sle+0.039
 spe+0.035
 grizz+0.035
 shortcut+0.034
oc+0.033
Neg logit contributions
 herself-0.030
ena-0.024
 themselves-0.023
 shapes-0.023
 brushes-0.022
Token playing
Token ID2712
Feature activation-1.35e+00

Pos logit contributions
 herself+0.045
ena+0.041
 themselves+0.040
iece+0.040
era+0.037
Neg logit contributions
 sle-0.032
 spe-0.028
 shortcut-0.026
 grizz-0.025
 nut-0.025
Token!
Token ID0
Feature activation-7.38e-01

Pos logit contributions
 sle+0.024
 shortcut+0.023
oc+0.022
 spe+0.022
|+0.022
Neg logit contributions
 herself-0.025
ena-0.022
 shapes-0.022
 themselves-0.022
 brushes-0.021
Token Let
Token ID3914
Feature activation-1.13e+00

Pos logit contributions
 herself+0.024
ena+0.021
 shapes+0.021
 themselves+0.020
 styles+0.020
Neg logit contributions
 sle-0.017
 grizz-0.014
oc-0.014
 spe-0.014
 shortcut-0.014
Tokenâ
Token ID22940
Feature activation-2.13e+00

Pos logit contributions
 herself+0.030
ena+0.029
 themselves+0.026
 shapes+0.026
 brushes+0.025
Neg logit contributions
 sle-0.029
 shortcut-0.026
 nut-0.025
use-0.025
oc-0.025
Token
Token ID26391
Feature activation-2.07e+00

Pos logit contributions
 herself+0.062
 shapes+0.052
ena+0.051
 themselves+0.051
 brushes+0.050
Neg logit contributions
 sle-0.056
 grizz-0.050
 shortcut-0.049
oc-0.049
 spe-0.047
Token
Token ID8151
Feature activation+8.11e-01

Pos logit contributions
 herself+0.066
ena+0.056
 themselves+0.055
 shapes+0.054
 brushes+0.052
Neg logit contributions
 sle-0.050
 spe-0.043
 grizz-0.043
 shortcut-0.042
 gru-0.040
Tokens
Token ID82
Feature activation-7.75e+00

Pos logit contributions
 sle+0.022
 spe+0.019
 grizz+0.019
 shortcut+0.019
 gru+0.018
Neg logit contributions
 herself-0.023
ena-0.019
 themselves-0.019
 shapes-0.018
 brushes-0.018
Token ride
Token ID6594
Feature activation+4.23e-01

Pos logit contributions
 herself+0.241
ena+0.223
iece+0.211
 styles+0.207
 shapes+0.204
Neg logit contributions
 sle-0.167
 shortcut-0.155
 nut-0.151
 tow-0.143
 hunter-0.140
Token on
Token ID319
Feature activation-6.69e-01

Pos logit contributions
 sle+0.012
 shortcut+0.010
oc+0.010
 grizz+0.010
 spe+0.010
Neg logit contributions
 herself-0.012
ena-0.010
 shapes-0.010
 brushes-0.009
 themselves-0.009
Token!
Token ID0
Feature activation-6.01e-01

Pos logit contributions
 herself+0.019
ena+0.017
 shapes+0.016
 themselves+0.016
iece+0.016
Neg logit contributions
 sle-0.018
 shortcut-0.015
 grizz-0.015
 spe-0.015
 gru-0.015
Tokenâ
Token ID22940
Feature activation-8.77e-01

Pos logit contributions
 herself+0.019
ena+0.016
 themselves+0.016
 shapes+0.015
 brushes+0.015
Neg logit contributions
 sle-0.015
 spe-0.013
 grizz-0.013
 shortcut-0.013
 gru-0.012
Token
Token ID26391
Feature activation-1.50e+00

Pos logit contributions
 herself+0.025
ena+0.021
 themselves+0.020
 shapes+0.020
 brushes+0.019
Neg logit contributions
 sle-0.024
 spe-0.021
 grizz-0.021
 shortcut-0.021
oc-0.020
Token different
Token ID1180
Feature activation-3.13e+00

Pos logit contributions
 herself+0.084
ena+0.074
 styles+0.070
 brushes+0.070
iece+0.069
Neg logit contributions
 sle-0.063
 spe-0.055
 grizz-0.053
 shortcut-0.053
 polar-0.052
Token colors
Token ID7577
Feature activation-2.30e+00

Pos logit contributions
 herself+0.085
ena+0.073
iece+0.069
 brushes+0.066
 themselves+0.064
Neg logit contributions
 sle-0.084
 spe-0.079
 polar-0.073
 grizz-0.073
 nut-0.072
Token and
Token ID290
Feature activation+2.37e-02

Pos logit contributions
 herself+0.063
ena+0.054
 themselves+0.050
 brushes+0.049
 styles+0.049
Neg logit contributions
 sle-0.063
 spe-0.057
 nut-0.053
 shortcut-0.053
 grizz-0.053
Token sizes
Token ID10620
Feature activation-5.34e+00

Pos logit contributions
 sle+0.001
 spe+0.001
 shortcut+0.001
 nut+0.001
 grizz+0.001
Neg logit contributions
 herself-0.001
ena-0.000
iece-0.000
 themselves-0.000
ina-0.000
Token.
Token ID13
Feature activation-7.14e-01

Pos logit contributions
 herself+0.147
ena+0.132
 brushes+0.119
iece+0.117
 styles+0.117
Neg logit contributions
 sle-0.135
 spe-0.125
 nut-0.119
 shortcut-0.116
 guarding-0.114
Token Some
Token ID2773
Feature activation-7.68e+00

Pos logit contributions
 herself+0.020
ena+0.017
 themselves+0.016
 shapes+0.016
 styles+0.016
Neg logit contributions
 sle-0.019
 spe-0.017
 grizz-0.017
 shortcut-0.017
oc-0.017
Token are
Token ID389
Feature activation-2.83e+00

Pos logit contributions
 herself+0.216
ena+0.191
 themselves+0.167
iece+0.165
ina+0.162
Neg logit contributions
 sle-0.206
 spe-0.190
 nut-0.182
 shortcut-0.181
 hunter-0.171
Token red
Token ID2266
Feature activation-2.49e+00

Pos logit contributions
 herself+0.094
ena+0.080
 themselves+0.077
 shapes+0.075
iece+0.075
Neg logit contributions
 sle-0.065
 spe-0.057
 shortcut-0.055
 grizz-0.054
 gru-0.052
Token,
Token ID11
Feature activation-1.15e+00

Pos logit contributions
 herself+0.076
ena+0.070
 themselves+0.060
iece+0.060
 brushes+0.060
Neg logit contributions
 sle-0.060
 spe-0.054
 nut-0.052
oc-0.049
 grizz-0.048
Token some
Token ID617
Feature activation-3.34e+00

Pos logit contributions
 herself+0.022
ena+0.019
ina+0.017
iece+0.016
 styles+0.016
Neg logit contributions
 spe-0.039
 sle-0.038
 nut-0.035
 grizz-0.035
 shortcut-0.035
Token are
Token ID389
Feature activation-2.10e+00

Pos logit contributions
 herself+0.089
ena+0.075
 themselves+0.070
 shapes+0.068
 styles+0.067
Neg logit contributions
 sle-0.098
 spe-0.088
 shortcut-0.086
 grizz-0.084
 nut-0.082
Token.
Token ID13
Feature activation-3.25e-01

Pos logit contributions
 sle+0.019
 spe+0.017
 grizz+0.017
 shortcut+0.017
oc+0.016
Neg logit contributions
 herself-0.024
ena-0.021
 themselves-0.020
 shapes-0.020
 brushes-0.019
Token Let
Token ID3914
Feature activation-1.09e+00

Pos logit contributions
 herself+0.010
ena+0.009
 shapes+0.009
 themselves+0.009
 styles+0.008
Neg logit contributions
 sle-0.008
 grizz-0.007
 spe-0.006
oc-0.006
 gru-0.006
Tokenâ
Token ID22940
Feature activation-2.04e+00

Pos logit contributions
 herself+0.030
ena+0.029
 themselves+0.026
 shapes+0.026
iece+0.025
Neg logit contributions
 sle-0.026
 nut-0.023
 shortcut-0.023
oc-0.023
use-0.023
Token
Token ID26391
Feature activation-1.99e+00

Pos logit contributions
 herself+0.059
 shapes+0.051
ena+0.049
 themselves+0.048
 brushes+0.048
Neg logit contributions
 sle-0.052
oc-0.047
 grizz-0.046
 shortcut-0.046
 gru-0.044
Token
Token ID8151
Feature activation+8.94e-01

Pos logit contributions
 herself+0.064
ena+0.054
 themselves+0.053
 shapes+0.052
 brushes+0.051
Neg logit contributions
 sle-0.048
 spe-0.041
 grizz-0.041
 shortcut-0.040
oc-0.039
Tokens
Token ID82
Feature activation-7.63e+00

Pos logit contributions
 sle+0.024
 spe+0.021
 grizz+0.021
 shortcut+0.021
oc+0.020
Neg logit contributions
 herself-0.026
ena-0.022
 themselves-0.021
 shapes-0.021
 brushes-0.020
Token bake
Token ID28450
Feature activation-2.32e+00

Pos logit contributions
 herself+0.237
ena+0.218
iece+0.210
 styles+0.204
 shapes+0.200
Neg logit contributions
 sle-0.163
 shortcut-0.150
 nut-0.150
 tow-0.142
 hunter-0.138
Token the
Token ID262
Feature activation-1.15e+00

Pos logit contributions
 herself+0.052
ena+0.047
 brushes+0.042
iece+0.042
ina+0.042
Neg logit contributions
 sle-0.069
oc-0.062
 nut-0.060
 shortcut-0.060
 spe-0.060
Token cake
Token ID12187
Feature activation-1.64e+00

Pos logit contributions
 herself+0.035
ena+0.030
 themselves+0.030
iece+0.027
 styles+0.026
Neg logit contributions
 sle-0.032
 spe-0.026
 grizz-0.026
oc-0.025
 nut-0.025
Token and
Token ID290
Feature activation-1.10e+00

Pos logit contributions
 herself+0.047
ena+0.042
iece+0.039
 themselves+0.039
 rhythms+0.038
Neg logit contributions
 sle-0.040
oc-0.035
 spe-0.034
 nut-0.033
 feast-0.032
Token celebrate
Token ID10648
Feature activation-1.56e+00

Pos logit contributions
 herself+0.032
ena+0.029
iece+0.028
 styles+0.026
 themselves+0.026
Neg logit contributions
 sle-0.027
 spe-0.025
 nut-0.023
oc-0.022
 grizz-0.022
Token Tim
Token ID5045
Feature activation-1.90e+00

Pos logit contributions
 herself+0.058
ena+0.054
 styles+0.053
 brushes+0.051
 shapes+0.051
Neg logit contributions
 sle-0.032
 nut-0.029
 shortcut-0.029
 hunter-0.029
 grizz-0.028
Tokenmy
Token ID1820
Feature activation-2.39e+00

Pos logit contributions
 herself+0.056
 themselves+0.049
 shapes+0.045
 styles+0.044
 brushes+0.043
Neg logit contributions
 sle-0.050
 grizz-0.046
 shortcut-0.044
 polar-0.043
 hunter-0.043
Token started
Token ID2067
Feature activation-3.59e+00

Pos logit contributions
 herself+0.078
 themselves+0.069
 styles+0.065
 shapes+0.064
 brushes+0.062
Neg logit contributions
 sle-0.056
 grizz-0.051
 hunter-0.050
 polar-0.049
oc-0.048
Token to
Token ID284
Feature activation+7.49e-01

Pos logit contributions
 herself+0.117
ena+0.106
 themselves+0.103
 shapes+0.100
iece+0.100
Neg logit contributions
 sle-0.079
 grizz-0.067
 shortcut-0.067
 spe-0.065
 guarding-0.062
Token mis
Token ID2984
Feature activation+1.15e+00

Pos logit contributions
 sle+0.017
 shortcut+0.015
 grizz+0.015
 spe+0.014
 polar+0.014
Neg logit contributions
 herself-0.024
ena-0.022
 themselves-0.021
 shapes-0.020
iece-0.020
Tokenbe
Token ID1350
Feature activation-7.57e+00

Pos logit contributions
 sle+0.035
 shortcut+0.031
 spe+0.031
oc+0.031
 grizz+0.030
Neg logit contributions
 herself-0.029
ena-0.023
 themselves-0.023
 shapes-0.023
 brushes-0.022
Tokenhave
Token ID14150
Feature activation-3.23e+00

Pos logit contributions
 herself+0.207
 themselves+0.176
mons+0.176
 brushes+0.176
ena+0.164
Neg logit contributions
 sle-0.172
 gru-0.157
run-0.157
 shortcut-0.156
oc-0.154
Token.
Token ID13
Feature activation+2.59e-01

Pos logit contributions
 herself+0.089
ena+0.080
 brushes+0.079
ina+0.079
 themselves+0.077
Neg logit contributions
 shortcut-0.074
 victorious-0.072
 sle-0.071
oc-0.068
 gru-0.068
Token He
Token ID679
Feature activation-2.83e-01

Pos logit contributions
 sle+0.005
 spe+0.004
 grizz+0.004
 shortcut+0.004
oc+0.004
Neg logit contributions
 herself-0.010
ena-0.008
 themselves-0.008
 shapes-0.008
 brushes-0.008
Token didn
Token ID1422
Feature activation-1.54e+00

Pos logit contributions
 herself+0.009
 themselves+0.008
iece+0.007
 shapes+0.007
 styles+0.007
Neg logit contributions
 sle-0.007
 grizz-0.006
 hunter-0.006
 polar-0.006
 spe-0.006
Token't
Token ID470
Feature activation+1.24e+00

Pos logit contributions
 herself+0.047
 themselves+0.038
ena+0.038
 shapes+0.036
 brushes+0.035
Neg logit contributions
 sle-0.041
 spe-0.036
 grizz-0.035
 shortcut-0.034
oc-0.034
Token cannot
Token ID2314
Feature activation-1.01e+00

Pos logit contributions
 herself+0.010
ena+0.009
 themselves+0.009
 shapes+0.009
iece+0.008
Neg logit contributions
 sle-0.005
 spe-0.004
 shortcut-0.004
 grizz-0.004
oc-0.004
Token draw
Token ID3197
Feature activation-1.96e+00

Pos logit contributions
 herself+0.028
 themselves+0.025
iece+0.025
ena+0.024
 styles+0.022
Neg logit contributions
 sle-0.023
use-0.022
 nut-0.022
 shortcut-0.022
 tow-0.021
Token on
Token ID319
Feature activation-6.44e-01

Pos logit contributions
 herself+0.051
ena+0.047
 brushes+0.043
 themselves+0.043
era+0.042
Neg logit contributions
 sle-0.049
 nut-0.044
oc-0.044
 grizz-0.043
 spe-0.043
Token everything
Token ID2279
Feature activation-1.48e+00

Pos logit contributions
 herself+0.017
ena+0.016
iece+0.015
ina+0.015
era+0.014
Neg logit contributions
 sle-0.016
 nut-0.015
 spe-0.014
oc-0.014
 grizz-0.013
Token.
Token ID13
Feature activation-2.63e-01

Pos logit contributions
 herself+0.039
ena+0.033
 themselves+0.031
 shapes+0.030
 brushes+0.030
Neg logit contributions
 sle-0.042
 shortcut-0.037
 spe-0.037
 grizz-0.036
oc-0.036
Token Some
Token ID2773
Feature activation-7.48e+00

Pos logit contributions
 herself+0.009
 shapes+0.008
 themselves+0.008
ena+0.008
 styles+0.008
Neg logit contributions
 sle-0.005
 grizz-0.005
oc-0.004
 spe-0.004
 gru-0.004
Token things
Token ID1243
Feature activation-2.24e+00

Pos logit contributions
 herself+0.206
ena+0.187
iece+0.164
 themselves+0.163
era+0.162
Neg logit contributions
 sle-0.183
 spe-0.176
 nut-0.169
use-0.167
 shortcut-0.163
Token are
Token ID389
Feature activation-1.40e+00

Pos logit contributions
 herself+0.065
ena+0.057
 themselves+0.053
iece+0.052
 shapes+0.052
Neg logit contributions
 sle-0.057
 shortcut-0.052
 spe-0.050
 nut-0.048
 grizz-0.048
Token not
Token ID407
Feature activation+2.78e+00

Pos logit contributions
 herself+0.044
ena+0.037
iece+0.037
 themselves+0.036
 styles+0.035
Neg logit contributions
 sle-0.033
 spe-0.030
 shortcut-0.029
 grizz-0.028
 nut-0.027
Token for
Token ID329
Feature activation-1.16e+00

Pos logit contributions
 sle+0.061
 spe+0.055
 shortcut+0.054
 nut+0.053
 guarding+0.052
Neg logit contributions
 herself-0.086
ena-0.073
iece-0.072
era-0.071
 themselves-0.069
Token drawing
Token ID8263
Feature activation-3.45e+00

Pos logit contributions
 herself+0.030
ena+0.027
iece+0.027
ina+0.025
era+0.025
Neg logit contributions
 sle-0.029
 guarding-0.025
 spe-0.025
 shortcut-0.025
 fishing-0.025
Token a
Token ID257
Feature activation-1.43e+00

Pos logit contributions
 herself+0.045
ina+0.042
iece+0.041
ena+0.039
ant+0.037
Neg logit contributions
 spe-0.034
 sle-0.033
 shady-0.031
 shortcut-0.028
 nut-0.028
Token lot
Token ID1256
Feature activation+8.16e-01

Pos logit contributions
 herself+0.036
 shapes+0.033
 brushes+0.032
ena+0.031
 hair+0.031
Neg logit contributions
 sle-0.039
 grizz-0.037
run-0.036
oc-0.035
 gru-0.035
Token of
Token ID286
Feature activation-5.21e-01

Pos logit contributions
 sle+0.009
 spe+0.008
 tow+0.008
oc+0.007
 shortcut+0.007
Neg logit contributions
 herself-0.033
ena-0.029
iece-0.029
 themselves-0.029
ina-0.029
Token toys
Token ID14958
Feature activation-3.22e+00

Pos logit contributions
 herself+0.015
ena+0.013
iece+0.013
 themselves+0.013
ina+0.013
Neg logit contributions
 sle-0.013
 spe-0.012
 shortcut-0.011
 grizz-0.011
 nut-0.011
Token.
Token ID13
Feature activation-7.33e-01

Pos logit contributions
 herself+0.094
ena+0.077
 styles+0.075
 shapes+0.074
 themselves+0.074
Neg logit contributions
 sle-0.079
 spe-0.075
 shortcut-0.070
 guarding-0.069
 nut-0.067
Token Some
Token ID2773
Feature activation-7.47e+00

Pos logit contributions
 herself+0.020
ena+0.016
 themselves+0.015
 shapes+0.015
 styles+0.015
Neg logit contributions
 sle-0.021
 spe-0.019
 shortcut-0.018
 grizz-0.018
oc-0.018
Token are
Token ID389
Feature activation-2.53e+00

Pos logit contributions
 herself+0.206
ina+0.185
ena+0.183
iece+0.174
era+0.165
Neg logit contributions
 sle-0.192
 spe-0.176
 nut-0.159
use-0.156
 shortcut-0.155
Token old
Token ID1468
Feature activation-1.15e+00

Pos logit contributions
 herself+0.081
iece+0.068
ena+0.067
 themselves+0.067
 styles+0.063
Neg logit contributions
 sle-0.060
 spe-0.054
 shortcut-0.050
 grizz-0.049
 gru-0.048
Token and
Token ID290
Feature activation+4.19e-01

Pos logit contributions
 herself+0.037
iece+0.032
ena+0.031
 themselves+0.029
ina+0.029
Neg logit contributions
 sle-0.027
 spe-0.024
 nut-0.020
use-0.020
 grizz-0.020
Token some
Token ID617
Feature activation-3.40e+00

Pos logit contributions
 sle+0.012
 spe+0.012
 shortcut+0.011
 grizz+0.011
 gru+0.011
Neg logit contributions
 herself-0.011
ena-0.009
iece-0.008
 themselves-0.008
ina-0.008
Token are
Token ID389
Feature activation-1.97e+00

Pos logit contributions
 herself+0.095
ena+0.083
iece+0.076
 themselves+0.075
ina+0.074
Neg logit contributions
 sle-0.094
 spe-0.084
 shortcut-0.080
 nut-0.078
oc-0.077
Token swimming
Token ID14899
Feature activation-1.86e-01

Pos logit contributions
 herself+0.062
ena+0.052
 styles+0.049
 themselves+0.049
 brushes+0.049
Neg logit contributions
 sle-0.059
 spe-0.051
 grizz-0.051
 shortcut-0.051
oc-0.050
Token and
Token ID290
Feature activation-1.20e+00

Pos logit contributions
 herself+0.005
ena+0.004
 themselves+0.004
 shapes+0.004
 brushes+0.004
Neg logit contributions
 sle-0.006
 spe-0.005
 grizz-0.005
 shortcut-0.005
oc-0.005
Token spl
Token ID4328
Feature activation-9.15e-01

Pos logit contributions
 herself+0.038
ena+0.035
 styles+0.031
ina+0.031
iece+0.031
Neg logit contributions
 sle-0.027
 spe-0.026
 grizz-0.024
 polar-0.024
 nut-0.024
Tokenashing
Token ID2140
Feature activation-3.08e+00

Pos logit contributions
 herself+0.029
ena+0.024
 shapes+0.023
 themselves+0.023
 styles+0.022
Neg logit contributions
 sle-0.022
oc-0.019
 spe-0.019
 shortcut-0.019
run-0.019
Token.
Token ID13
Feature activation-7.37e-01

Pos logit contributions
 herself+0.089
ena+0.085
 brushes+0.074
ina+0.072
 styles+0.071
Neg logit contributions
 sle-0.072
 spe-0.068
 nut-0.066
 shortcut-0.065
 gru-0.064
Token Some
Token ID2773
Feature activation-7.46e+00

Pos logit contributions
 herself+0.020
ena+0.016
 shapes+0.016
 themselves+0.016
 styles+0.015
Neg logit contributions
 sle-0.021
 spe-0.019
 grizz-0.019
 shortcut-0.018
oc-0.018
Token fish
Token ID5916
Feature activation-2.42e+00

Pos logit contributions
 herself+0.229
ena+0.195
 themselves+0.178
iece+0.176
 brushes+0.173
Neg logit contributions
 sle-0.189
 spe-0.173
 shortcut-0.164
 grizz-0.163
 nut-0.162
Token were
Token ID547
Feature activation-1.70e+00

Pos logit contributions
 herself+0.066
ena+0.054
 themselves+0.051
 shapes+0.050
 styles+0.050
Neg logit contributions
 sle-0.069
 spe-0.063
 grizz-0.061
 shortcut-0.061
oc-0.060
Token big
Token ID1263
Feature activation+1.49e-01

Pos logit contributions
 herself+0.056
ena+0.047
 themselves+0.047
 shapes+0.046
 brushes+0.044
Neg logit contributions
 sle-0.040
 grizz-0.034
 spe-0.034
 shortcut-0.034
oc-0.033
Token and
Token ID290
Feature activation+1.35e-01

Pos logit contributions
 sle+0.005
 shortcut+0.005
 grizz+0.004
 spe+0.004
 gru+0.004
Neg logit contributions
 herself-0.003
ena-0.003
 themselves-0.003
 shapes-0.003
 brushes-0.002
Token some
Token ID617
Feature activation-3.50e+00

Pos logit contributions
 sle+0.004
 spe+0.003
 grizz+0.003
 shortcut+0.003
 gru+0.003
Neg logit contributions
 herself-0.004
ena-0.003
 themselves-0.003
 shapes-0.003
 styles-0.003
Token different
Token ID1180
Feature activation-4.53e+00

Pos logit contributions
 sle+0.034
 spe+0.031
 shortcut+0.029
 grizz+0.029
 polar+0.028
Neg logit contributions
 herself-0.040
ena-0.033
 themselves-0.032
iece-0.031
 styles-0.030
Token colors
Token ID7577
Feature activation-2.94e+00

Pos logit contributions
 herself+0.114
ena+0.095
iece+0.085
Her+0.083
mons+0.079
Neg logit contributions
 sle-0.132
 spe-0.125
 grizz-0.111
 polar-0.111
oc-0.110
Token and
Token ID290
Feature activation-7.94e-02

Pos logit contributions
 herself+0.083
ena+0.074
 themselves+0.068
 styles+0.067
 brushes+0.066
Neg logit contributions
 sle-0.073
 spe-0.068
 shortcut-0.060
 guarding-0.060
 nut-0.060
Token shapes
Token ID15268
Feature activation-4.18e+00

Pos logit contributions
 herself+0.002
ena+0.002
iece+0.001
era+0.001
mons+0.001
Neg logit contributions
 sle-0.003
 spe-0.002
 shortcut-0.002
 polar-0.002
 grizz-0.002
Token.
Token ID13
Feature activation-9.65e-01

Pos logit contributions
 herself+0.120
ena+0.109
 brushes+0.098
 styles+0.098
 themselves+0.097
Neg logit contributions
 sle-0.101
 spe-0.097
 guarding-0.086
 shortcut-0.084
 nut-0.083
Token Some
Token ID2773
Feature activation-7.46e+00

Pos logit contributions
 herself+0.027
ena+0.022
 themselves+0.021
 shapes+0.021
 styles+0.021
Neg logit contributions
 sle-0.026
 spe-0.023
 grizz-0.023
 shortcut-0.023
oc-0.022
Token looked
Token ID3114
Feature activation+1.52e+00

Pos logit contributions
 herself+0.213
ena+0.170
iece+0.154
 themselves+0.153
ina+0.149
Neg logit contributions
 sle-0.209
 spe-0.202
 shortcut-0.180
 tow-0.179
oc-0.179
Token like
Token ID588
Feature activation-5.73e-01

Pos logit contributions
 sle+0.044
 spe+0.040
 grizz+0.038
 shortcut+0.038
 gru+0.037
Neg logit contributions
 herself-0.041
ena-0.034
 themselves-0.032
iece-0.030
 styles-0.030
Token stars
Token ID5788
Feature activation-9.04e-03

Pos logit contributions
 herself+0.016
ena+0.014
iece+0.013
 themselves+0.012
era+0.012
Neg logit contributions
 sle-0.016
 spe-0.015
 shortcut-0.014
 grizz-0.013
 polar-0.013
Token,
Token ID11
Feature activation-2.14e+00

Pos logit contributions
 herself+0.000
ena+0.000
 themselves+0.000
 styles+0.000
 brushes+0.000
Neg logit contributions
 sle-0.000
 spe-0.000
 shortcut-0.000
 grizz-0.000
oc-0.000
Token some
Token ID617
Feature activation-3.83e+00

Pos logit contributions
 herself+0.048
ena+0.044
ina+0.041
ups+0.039
era+0.038
Neg logit contributions
 sle-0.061
 spe-0.061
 grizz-0.053
 gru-0.052
 polar-0.051
Token play
Token ID711
Feature activation-1.62e+00

Pos logit contributions
 herself+0.001
ena+0.001
 themselves+0.001
 shapes+0.001
 brushes+0.001
Neg logit contributions
 sle-0.001
 spe-0.001
 grizz-0.001
 shortcut-0.001
oc-0.001
Token in
Token ID287
Feature activation-5.81e-01

Pos logit contributions
 herself+0.042
ena+0.035
 themselves+0.034
 shapes+0.033
 brushes+0.033
Neg logit contributions
 sle-0.048
 spe-0.043
 shortcut-0.043
 grizz-0.043
oc-0.042
Token the
Token ID262
Feature activation-1.61e+00

Pos logit contributions
 herself+0.014
ena+0.011
 themselves+0.011
 shapes+0.011
 brushes+0.010
Neg logit contributions
 sle-0.019
 grizz-0.017
 spe-0.017
 shortcut-0.017
oc-0.016
Token pool
Token ID5933
Feature activation-1.85e+00

Pos logit contributions
 herself+0.041
 shapes+0.038
 themselves+0.036
 clothes+0.035
 hair+0.034
Neg logit contributions
 sle-0.045
 spe-0.043
run-0.041
oc-0.041
 grizz-0.041
Token.
Token ID13
Feature activation-3.29e-01

Pos logit contributions
 herself+0.056
ena+0.048
 brushes+0.046
 themselves+0.046
 shapes+0.045
Neg logit contributions
 sle-0.045
oc-0.039
 shortcut-0.039
 spe-0.039
 grizz-0.038
Token Some
Token ID2773
Feature activation-7.44e+00

Pos logit contributions
 herself+0.010
ena+0.008
 themselves+0.008
 shapes+0.008
 brushes+0.008
Neg logit contributions
 sle-0.009
 spe-0.007
 shortcut-0.007
 grizz-0.007
 gru-0.007
Token things
Token ID1243
Feature activation-2.28e+00

Pos logit contributions
 herself+0.209
ena+0.179
iece+0.160
ina+0.159
 themselves+0.157
Neg logit contributions
 sle-0.203
 shortcut-0.186
 spe-0.185
 nut-0.182
oc-0.177
Token are
Token ID389
Feature activation-1.64e+00

Pos logit contributions
 herself+0.064
ena+0.055
 themselves+0.052
 shapes+0.051
iece+0.050
Neg logit contributions
 sle-0.062
 shortcut-0.055
 spe-0.055
 grizz-0.053
 nut-0.052
Token dangerous
Token ID4923
Feature activation-5.92e-01

Pos logit contributions
 herself+0.050
ena+0.042
 themselves+0.041
 shapes+0.040
 brushes+0.039
Neg logit contributions
 sle-0.043
 grizz-0.037
 spe-0.037
 shortcut-0.036
oc-0.035
Token."
Token ID526
Feature activation-2.22e+00

Pos logit contributions
 herself+0.016
ena+0.014
 themselves+0.013
 brushes+0.013
 shapes+0.013
Neg logit contributions
 sle-0.016
 shortcut-0.014
 grizz-0.014
 spe-0.014
oc-0.014
Token<|endoftext|>
Token ID50256
Feature activation+1.07e+00

Pos logit contributions
 herself+0.067
ena+0.055
 themselves+0.055
 shapes+0.053
 brushes+0.051
Neg logit contributions
 sle-0.059
 grizz-0.052
 spe-0.052
 shortcut-0.052
oc-0.049
Token of
Token ID286
Feature activation+1.17e+00

Pos logit contributions
 herself+0.060
ena+0.052
 brushes+0.049
 styles+0.048
 themselves+0.048
Neg logit contributions
 sle-0.066
 shortcut-0.058
 spe-0.057
 grizz-0.057
 gru-0.056
Token what
Token ID644
Feature activation+1.18e-01

Pos logit contributions
 sle+0.034
 grizz+0.030
 nut+0.030
 spe+0.030
 poisonous+0.029
Neg logit contributions
 herself-0.026
iece-0.025
ena-0.024
 brushes-0.023
 styles-0.021
Token you
Token ID345
Feature activation+8.28e-01

Pos logit contributions
 sle+0.003
 shortcut+0.002
 nut+0.002
 spe+0.002
use+0.002
Neg logit contributions
 herself-0.004
ena-0.003
iece-0.003
 themselves-0.003
 styles-0.003
Token eat
Token ID4483
Feature activation-1.18e+00

Pos logit contributions
 sle+0.021
 shortcut+0.019
 grizz+0.019
 spe+0.019
oc+0.018
Neg logit contributions
 herself-0.024
ena-0.021
 themselves-0.020
 shapes-0.020
 styles-0.019
Token.
Token ID13
Feature activation-6.09e-01

Pos logit contributions
 herself+0.032
ena+0.029
 brushes+0.027
iece+0.026
 shapes+0.026
Neg logit contributions
 sle-0.031
oc-0.027
 shortcut-0.026
 nut-0.026
 spe-0.026
Token Some
Token ID2773
Feature activation-7.41e+00

Pos logit contributions
 herself+0.018
ena+0.015
 themselves+0.015
 shapes+0.014
 brushes+0.014
Neg logit contributions
 sle-0.016
 shortcut-0.014
 spe-0.014
 grizz-0.014
 gru-0.013
Token things
Token ID1243
Feature activation-2.29e+00

Pos logit contributions
 herself+0.203
ena+0.187
iece+0.172
 themselves+0.166
era+0.159
Neg logit contributions
 sle-0.195
 nut-0.176
 spe-0.176
oc-0.176
 shortcut-0.171
Token are
Token ID389
Feature activation-1.44e+00

Pos logit contributions
 herself+0.066
ena+0.057
 themselves+0.053
 shapes+0.052
iece+0.052
Neg logit contributions
 sle-0.061
 shortcut-0.054
 spe-0.054
 grizz-0.052
 nut-0.050
Token not
Token ID407
Feature activation+2.45e+00

Pos logit contributions
 herself+0.041
ena+0.035
 themselves+0.033
iece+0.033
 shapes+0.033
Neg logit contributions
 sle-0.039
 spe-0.034
 grizz-0.033
 shortcut-0.033
 nut-0.032
Token good
Token ID922
Feature activation-4.74e-01

Pos logit contributions
 sle+0.062
 spe+0.055
 nut+0.055
 shortcut+0.054
 hunter+0.052
Neg logit contributions
 herself-0.071
ena-0.063
iece-0.060
 themselves-0.057
era-0.056
Token for
Token ID329
Feature activation-8.63e-01

Pos logit contributions
 herself+0.014
ena+0.013
 brushes+0.012
 themselves+0.012
iece+0.012
Neg logit contributions
 sle-0.011
 shortcut-0.009
 spe-0.009
 grizz-0.009
use-0.009
Token their
Token ID511
Feature activation-1.42e+00

Pos logit contributions
 herself+0.026
ena+0.024
ina+0.023
mons+0.023
iece+0.023
Neg logit contributions
 sle-0.024
 guarding-0.020
 shortcut-0.020
 spe-0.020
 grizz-0.020
Token moms
Token ID34015
Feature activation-7.69e-01

Pos logit contributions
 herself+0.034
ena+0.027
iece+0.027
 themselves+0.025
ina+0.024
Neg logit contributions
 sle-0.046
 shortcut-0.040
 spe-0.040
 nut-0.039
 grizz-0.039
Token and
Token ID290
Feature activation+8.52e-01

Pos logit contributions
 herself+0.020
ena+0.017
 brushes+0.017
 shapes+0.017
 themselves+0.017
Neg logit contributions
 sle-0.021
 spe-0.018
oc-0.017
 guarding-0.017
 shortcut-0.017
Token dads
Token ID47713
Feature activation+1.48e-01

Pos logit contributions
 sle+0.027
 shortcut+0.024
 spe+0.023
 nut+0.023
 hunter+0.023
Neg logit contributions
 herself-0.020
iece-0.016
ena-0.016
ina-0.015
 themselves-0.014
Token.
Token ID13
Feature activation-6.76e-01

Pos logit contributions
 sle+0.004
 guarding+0.003
 shortcut+0.003
 spe+0.003
oc+0.003
Neg logit contributions
 herself-0.004
ena-0.004
 brushes-0.003
 shapes-0.003
iece-0.003
Token Some
Token ID2773
Feature activation-7.39e+00

Pos logit contributions
 herself+0.019
ena+0.016
iece+0.016
 shapes+0.015
 themselves+0.015
Neg logit contributions
 sle-0.017
 grizz-0.016
 spe-0.015
 Nice-0.015
 tow-0.015
Token kids
Token ID3988
Feature activation-3.18e+00

Pos logit contributions
 herself+0.192
ena+0.160
iece+0.155
ina+0.151
 themselves+0.143
Neg logit contributions
 sle-0.202
 shortcut-0.189
 spe-0.182
 nut-0.178
 owns-0.171
Token jump
Token ID4391
Feature activation-2.75e-01

Pos logit contributions
 herself+0.094
ena+0.079
 themselves+0.078
iece+0.076
 shapes+0.075
Neg logit contributions
 sle-0.080
 shortcut-0.070
 spe-0.069
 grizz-0.068
use-0.065
Token on
Token ID319
Feature activation+6.73e-02

Pos logit contributions
 herself+0.008
ena+0.007
 styles+0.006
 themselves+0.006
 shapes+0.006
Neg logit contributions
 sle-0.007
 shortcut-0.007
 spe-0.007
 grizz-0.006
oc-0.006
Token a
Token ID257
Feature activation-2.26e+00

Pos logit contributions
 sle+0.002
 spe+0.002
 tow+0.002
 shortcut+0.001
 grizz+0.001
Neg logit contributions
 herself-0.002
ena-0.002
iece-0.002
ina-0.002
 styles-0.002
Token tr
Token ID491
Feature activation-2.91e-01

Pos logit contributions
 herself+0.069
 shapes+0.058
ena+0.058
 themselves+0.057
 brushes+0.056
Neg logit contributions
 sle-0.057
 grizz-0.051
 spe-0.049
 shortcut-0.048
oc-0.047
Token trains
Token ID13404
Feature activation-3.49e+00

Pos logit contributions
 herself+0.068
ena+0.050
 themselves+0.048
 shapes+0.045
 brushes+0.043
Neg logit contributions
 sle-0.127
 spe-0.114
 shortcut-0.114
 grizz-0.113
oc-0.111
Token at
Token ID379
Feature activation-2.12e+00

Pos logit contributions
 herself+0.096
ena+0.081
 shapes+0.078
 styles+0.077
 themselves+0.076
Neg logit contributions
 sle-0.095
 shortcut-0.086
 spe-0.080
 grizz-0.080
 guarding-0.079
Token the
Token ID262
Feature activation-9.03e-01

Pos logit contributions
 herself+0.054
ena+0.044
 themselves+0.042
 shapes+0.042
 brushes+0.041
Neg logit contributions
 sle-0.064
 shortcut-0.057
 spe-0.057
 grizz-0.056
oc-0.054
Token station
Token ID4429
Feature activation-1.75e+00

Pos logit contributions
 herself+0.030
 themselves+0.025
ena+0.025
 shapes+0.024
 brushes+0.024
Neg logit contributions
 sle-0.022
 shortcut-0.018
 grizz-0.018
 spe-0.018
oc-0.017
Token.
Token ID13
Feature activation-4.97e-01

Pos logit contributions
 herself+0.051
ena+0.041
 brushes+0.040
 styles+0.039
 themselves+0.039
Neg logit contributions
 sle-0.044
 shortcut-0.039
 gru-0.038
 spe-0.037
 tow-0.037
Token Some
Token ID2773
Feature activation-7.38e+00

Pos logit contributions
 herself+0.013
ena+0.011
 themselves+0.011
 shapes+0.010
 brushes+0.010
Neg logit contributions
 sle-0.015
 spe-0.013
 grizz-0.013
 shortcut-0.013
oc-0.012
Token are
Token ID389
Feature activation-2.84e+00

Pos logit contributions
 herself+0.189
 themselves+0.133
ena+0.133
iece+0.126
 shapes+0.123
Neg logit contributions
 sle-0.230
 shortcut-0.211
 spe-0.203
 tow-0.202
oc-0.200
Token fast
Token ID3049
Feature activation-5.03e-01

Pos logit contributions
 herself+0.098
iece+0.080
 themselves+0.079
ena+0.078
 styles+0.076
Neg logit contributions
 sle-0.064
 spe-0.056
 shortcut-0.055
 grizz-0.052
 gru-0.051
Token and
Token ID290
Feature activation+2.47e-04

Pos logit contributions
 herself+0.013
ena+0.011
 themselves+0.010
 shapes+0.010
 brushes+0.010
Neg logit contributions
 sle-0.015
 spe-0.014
 grizz-0.013
 shortcut-0.013
oc-0.013
Token some
Token ID617
Feature activation-3.41e+00

Pos logit contributions
 sle+0.000
 spe+0.000
 shortcut+0.000
 grizz+0.000
 gru+0.000
Neg logit contributions
 herself-0.000
ena-0.000
iece-0.000
 themselves-0.000
ina-0.000
Token are
Token ID389
Feature activation-2.26e+00

Pos logit contributions
 herself+0.092
ena+0.074
 themselves+0.071
 shapes+0.068
iece+0.067
Neg logit contributions
 sle-0.101
 shortcut-0.090
 spe-0.089
 grizz-0.085
oc-0.084
Token and
Token ID290
Feature activation-1.34e-01

Pos logit contributions
 herself+0.052
ena+0.043
 styles+0.040
 brushes+0.040
 themselves+0.039
Neg logit contributions
 sle-0.052
 spe-0.046
 shortcut-0.046
oc-0.045
 grizz-0.044
Token shells
Token ID19679
Feature activation+1.69e-01

Pos logit contributions
 herself+0.004
ena+0.003
ina+0.003
iece+0.003
 styles+0.003
Neg logit contributions
 sle-0.004
 spe-0.004
 polar-0.003
 nut-0.003
 grizz-0.003
Token and
Token ID290
Feature activation-9.46e-01

Pos logit contributions
 sle+0.005
 spe+0.004
 shortcut+0.004
 grizz+0.004
 nut+0.004
Neg logit contributions
 herself-0.004
ena-0.004
 styles-0.003
 shapes-0.003
 brushes-0.003
Token coral
Token ID29537
Feature activation-2.26e+00

Pos logit contributions
 herself+0.027
ena+0.023
ina+0.022
iece+0.021
era+0.020
Neg logit contributions
 sle-0.027
 spe-0.025
 polar-0.023
 nut-0.023
 hunter-0.023
Token.
Token ID13
Feature activation-6.07e-01

Pos logit contributions
 herself+0.059
ena+0.050
 themselves+0.045
 brushes+0.045
 styles+0.044
Neg logit contributions
 sle-0.067
 spe-0.061
 shortcut-0.059
 grizz-0.058
oc-0.057
Token Some
Token ID2773
Feature activation-7.38e+00

Pos logit contributions
 herself+0.017
 themselves+0.014
ena+0.014
 shapes+0.013
 brushes+0.013
Neg logit contributions
 sle-0.018
 grizz-0.016
 shortcut-0.015
 spe-0.015
 gru-0.015
Token of
Token ID286
Feature activation-7.11e-01

Pos logit contributions
 herself+0.242
ena+0.201
 themselves+0.188
 styles+0.186
iece+0.183
Neg logit contributions
 sle-0.173
 spe-0.157
 shortcut-0.154
 nut-0.148
 hunter-0.147
Token the
Token ID262
Feature activation-1.20e+00

Pos logit contributions
 herself+0.019
ena+0.017
iece+0.016
 styles+0.015
 brushes+0.015
Neg logit contributions
 sle-0.019
 spe-0.018
 shortcut-0.017
 polar-0.017
 tow-0.017
Token coral
Token ID29537
Feature activation-2.45e+00

Pos logit contributions
 herself+0.043
ena+0.037
 themselves+0.035
 styles+0.034
 brushes+0.033
Neg logit contributions
 sle-0.027
 shortcut-0.022
 spe-0.022
 hunter-0.022
 nut-0.021
Token was
Token ID373
Feature activation-1.14e+00

Pos logit contributions
 herself+0.070
ena+0.059
 themselves+0.054
 styles+0.053
 brushes+0.053
Neg logit contributions
 sle-0.067
 spe-0.060
 shortcut-0.059
oc-0.057
 grizz-0.057
Token broken
Token ID5445
Feature activation-6.05e-01

Pos logit contributions
 herself+0.036
ena+0.030
 themselves+0.030
 shapes+0.029
 brushes+0.028
Neg logit contributions
 sle-0.028
 grizz-0.024
 spe-0.024
 shortcut-0.024
oc-0.024
Token things
Token ID1243
Feature activation-2.42e+00

Pos logit contributions
 herself+0.122
ena+0.099
 themselves+0.091
 brushes+0.087
iece+0.083
Neg logit contributions
 sle-0.175
 spe-0.156
 shortcut-0.154
oc-0.151
 grizz-0.151
Token in
Token ID287
Feature activation-5.62e-02

Pos logit contributions
 herself+0.062
ena+0.051
 themselves+0.049
 shapes+0.048
 brushes+0.046
Neg logit contributions
 sle-0.073
 spe-0.065
 shortcut-0.064
 grizz-0.063
oc-0.061
Token the
Token ID262
Feature activation-1.80e+00

Pos logit contributions
 herself+0.002
ena+0.001
 themselves+0.001
 shapes+0.001
 brushes+0.001
Neg logit contributions
 sle-0.002
 spe-0.001
 grizz-0.001
 shortcut-0.001
oc-0.001
Token universe
Token ID6881
Feature activation-2.28e+00

Pos logit contributions
 herself+0.057
ena+0.048
 themselves+0.047
 shapes+0.047
 brushes+0.045
Neg logit contributions
 sle-0.044
 spe-0.038
 grizz-0.038
 shortcut-0.038
oc-0.036
Token.
Token ID13
Feature activation-5.70e-01

Pos logit contributions
 herself+0.062
ena+0.051
 themselves+0.049
 brushes+0.048
 shapes+0.047
Neg logit contributions
 sle-0.065
 spe-0.057
oc-0.056
 shortcut-0.056
 grizz-0.055
Token Some
Token ID2773
Feature activation-7.37e+00

Pos logit contributions
 herself+0.017
ena+0.014
 themselves+0.014
 shapes+0.014
 brushes+0.013
Neg logit contributions
 sle-0.015
 spe-0.013
 shortcut-0.013
 grizz-0.013
 gru-0.012
Token of
Token ID286
Feature activation-8.74e-01

Pos logit contributions
 herself+0.202
ena+0.171
 themselves+0.157
iece+0.150
era+0.149
Neg logit contributions
 sle-0.211
 spe-0.189
 shortcut-0.185
oc-0.180
 hunter-0.177
Token them
Token ID606
Feature activation-2.17e+00

Pos logit contributions
 herself+0.022
ena+0.019
iece+0.018
 brushes+0.017
 styles+0.017
Neg logit contributions
 sle-0.026
 spe-0.023
 shortcut-0.023
 grizz-0.022
 polar-0.022
Token are
Token ID389
Feature activation-2.09e+00

Pos logit contributions
 herself+0.067
ena+0.059
 themselves+0.056
iece+0.054
 styles+0.053
Neg logit contributions
 sle-0.052
 spe-0.045
 shortcut-0.044
 grizz-0.042
use-0.042
Token very
Token ID845
Feature activation+3.93e-01

Pos logit contributions
 herself+0.066
ena+0.056
 themselves+0.054
iece+0.054
 styles+0.053
Neg logit contributions
 sle-0.051
 spe-0.045
 grizz-0.043
 shortcut-0.042
 polar-0.041
Token far
Token ID1290
Feature activation+4.01e+00

Pos logit contributions
 sle+0.009
 spe+0.008
 grizz+0.008
 shortcut+0.008
oc+0.008
Neg logit contributions
 herself-0.013
ena-0.011
 themselves-0.011
 shapes-0.010
 brushes-0.010
Token and
Token ID290
Feature activation-9.60e-01

Pos logit contributions
 herself+0.090
ena+0.076
 styles+0.073
 themselves+0.072
 rhythms+0.071
Neg logit contributions
 sle-0.081
 spe-0.074
 shortcut-0.070
 grizz-0.070
oc-0.068
Token looked
Token ID3114
Feature activation+1.77e+00

Pos logit contributions
 herself+0.028
ina+0.024
ena+0.023
 styles+0.023
 rhythms+0.023
Neg logit contributions
 sle-0.024
 spe-0.022
 polar-0.020
 grizz-0.019
 guarding-0.019
Token at
Token ID379
Feature activation+9.51e-01

Pos logit contributions
 sle+0.057
 grizz+0.051
 shortcut+0.051
 spe+0.051
oc+0.050
Neg logit contributions
 herself-0.043
 themselves-0.034
 shapes-0.033
ena-0.033
 brushes-0.032
Token them
Token ID606
Feature activation-1.40e+00

Pos logit contributions
 sle+0.027
 spe+0.024
 polar+0.024
 tow+0.023
 shortcut+0.023
Neg logit contributions
 herself-0.022
ena-0.020
iece-0.020
 styles-0.018
ups-0.018
Token.
Token ID13
Feature activation-7.97e-01

Pos logit contributions
 herself+0.039
ena+0.037
 brushes+0.033
 styles+0.032
ina+0.032
Neg logit contributions
 sle-0.035
 spe-0.031
 shortcut-0.029
 guarding-0.029
oc-0.028
Token Some
Token ID2773
Feature activation-7.36e+00

Pos logit contributions
 herself+0.023
ena+0.020
 shapes+0.019
 styles+0.019
 themselves+0.019
Neg logit contributions
 sle-0.020
 spe-0.018
 grizz-0.018
 shortcut-0.018
 gru-0.017
Token of
Token ID286
Feature activation-6.25e-01

Pos logit contributions
 herself+0.235
ena+0.196
 themselves+0.190
iece+0.187
ina+0.186
Neg logit contributions
 sle-0.160
 spe-0.159
 tow-0.138
 shortcut-0.136
oc-0.135
Token them
Token ID606
Feature activation-1.95e+00

Pos logit contributions
 herself+0.016
ena+0.014
iece+0.014
 styles+0.013
 brushes+0.013
Neg logit contributions
 sle-0.017
 spe-0.015
 tow-0.015
 polar-0.015
 shortcut-0.014
Token were
Token ID547
Feature activation-1.59e+00

Pos logit contributions
 herself+0.061
ena+0.050
 themselves+0.050
iece+0.050
 styles+0.048
Neg logit contributions
 spe-0.043
 sle-0.043
 grizz-0.038
oc-0.038
 guarding-0.038
Token still
Token ID991
Feature activation-4.50e-01

Pos logit contributions
 herself+0.050
ena+0.041
 themselves+0.041
iece+0.040
 styles+0.040
Neg logit contributions
 sle-0.039
 spe-0.035
 grizz-0.033
 shortcut-0.032
 gru-0.032
Token dry
Token ID5894
Feature activation-1.09e+00

Pos logit contributions
 herself+0.014
ena+0.012
 themselves+0.011
iece+0.011
 styles+0.011
Neg logit contributions
 sle-0.011
 spe-0.010
 grizz-0.009
 polar-0.009
 gru-0.009
Token They
Token ID1119
Feature activation-1.21e+00

Pos logit contributions
 herself+0.016
ena+0.013
 themselves+0.012
 shapes+0.012
 brushes+0.012
Neg logit contributions
 sle-0.018
 spe-0.016
 shortcut-0.016
 grizz-0.016
 gru-0.016
Token see
Token ID766
Feature activation-3.92e-01

Pos logit contributions
 herself+0.036
ena+0.030
iece+0.029
 shapes+0.029
 styles+0.029
Neg logit contributions
 sle-0.030
 spe-0.027
 shortcut-0.026
 hunter-0.026
 grizz-0.026
Token many
Token ID867
Feature activation-3.24e+00

Pos logit contributions
 herself+0.010
ena+0.009
iece+0.008
ina+0.008
mons+0.008
Neg logit contributions
 sle-0.010
 polar-0.009
 spe-0.009
 gru-0.009
 grizz-0.009
Token flowers
Token ID12734
Feature activation-1.25e+00

Pos logit contributions
 herself+0.073
ena+0.057
 themselves+0.054
iece+0.052
 brushes+0.047
Neg logit contributions
 sle-0.111
 spe-0.101
 shortcut-0.099
 grizz-0.098
oc-0.095
Token.
Token ID13
Feature activation-5.63e-01

Pos logit contributions
 herself+0.034
ena+0.029
 themselves+0.027
 shapes+0.027
 styles+0.026
Neg logit contributions
 sle-0.036
 spe-0.032
 shortcut-0.031
 grizz-0.031
 nut-0.030
Token Some
Token ID2773
Feature activation-7.36e+00

Pos logit contributions
 herself+0.014
ena+0.012
 themselves+0.011
 shapes+0.011
 brushes+0.011
Neg logit contributions
 sle-0.017
 spe-0.015
 grizz-0.015
 shortcut-0.015
oc-0.015
Token are
Token ID389
Feature activation-3.05e+00

Pos logit contributions
 herself+0.164
ena+0.151
iece+0.143
 themselves+0.126
era+0.125
Neg logit contributions
 spe-0.229
 sle-0.223
 nut-0.199
 shortcut-0.197
use-0.191
Token red
Token ID2266
Feature activation-2.34e+00

Pos logit contributions
 herself+0.101
iece+0.090
ena+0.088
 themselves+0.085
 styles+0.082
Neg logit contributions
 sle-0.066
 spe-0.062
 shortcut-0.054
 guarding-0.053
 grizz-0.053
Token,
Token ID11
Feature activation-5.40e-01

Pos logit contributions
 herself+0.069
ena+0.068
iece+0.061
 themselves+0.058
 rhythms+0.058
Neg logit contributions
 sle-0.055
 spe-0.052
 nut-0.047
oc-0.044
use-0.043
Token some
Token ID617
Feature activation-2.83e+00

Pos logit contributions
 herself+0.012
ena+0.011
ina+0.010
iece+0.010
 styles+0.009
Neg logit contributions
 spe-0.017
 sle-0.016
 nut-0.014
 gru-0.014
 grizz-0.014
Token are
Token ID389
Feature activation-2.19e+00

Pos logit contributions
 herself+0.079
ena+0.072
iece+0.066
 themselves+0.064
ina+0.061
Neg logit contributions
 sle-0.075
 spe-0.071
 shortcut-0.064
 nut-0.064
 tow-0.060
Token many
Token ID867
Feature activation-3.27e+00

Pos logit contributions
 sle+0.011
 spe+0.010
 shortcut+0.010
 tow+0.010
 hunter+0.009
Neg logit contributions
 herself-0.009
ina-0.008
ena-0.007
 styles-0.007
iece-0.007
Token pictures
Token ID5986
Feature activation-2.51e+00

Pos logit contributions
 herself+0.062
ena+0.046
 themselves+0.044
iece+0.039
 shapes+0.039
Neg logit contributions
 sle-0.121
 spe-0.111
 shortcut-0.110
 grizz-0.109
oc-0.107
Token and
Token ID290
Feature activation-1.27e+00

Pos logit contributions
 herself+0.077
ena+0.066
 themselves+0.063
 styles+0.062
 shapes+0.061
Neg logit contributions
 sle-0.058
 spe-0.051
 shortcut-0.051
 tow-0.048
 guarding-0.048
Token sculptures
Token ID42645
Feature activation-2.98e+00

Pos logit contributions
 herself+0.028
ina+0.026
ena+0.024
bed+0.022
mons+0.022
Neg logit contributions
 sle-0.036
 spe-0.036
 shortcut-0.034
 gru-0.033
 victorious-0.032
Token.
Token ID13
Feature activation-6.76e-01

Pos logit contributions
 herself+0.080
ena+0.068
 themselves+0.062
 styles+0.061
iece+0.061
Neg logit contributions
 sle-0.082
 spe-0.074
 shortcut-0.072
 nut-0.069
 grizz-0.069
Token Some
Token ID2773
Feature activation-7.35e+00

Pos logit contributions
 herself+0.021
ena+0.017
ina+0.017
iece+0.016
 styles+0.016
Neg logit contributions
 sle-0.016
 spe-0.015
 shortcut-0.014
 grizz-0.014
 tow-0.014
Token are
Token ID389
Feature activation-2.71e+00

Pos logit contributions
 herself+0.180
ena+0.144
iece+0.133
 themselves+0.130
ina+0.128
Neg logit contributions
 sle-0.217
 spe-0.205
 shortcut-0.203
 nut-0.192
oc-0.191
Token colorful
Token ID20239
Feature activation-2.32e+00

Pos logit contributions
 herself+0.094
iece+0.078
ena+0.078
 themselves+0.077
 styles+0.074
Neg logit contributions
 sle-0.057
 spe-0.052
 shortcut-0.048
 grizz-0.046
 guarding-0.046
Token,
Token ID11
Feature activation-1.02e+00

Pos logit contributions
 herself+0.070
ena+0.061
iece+0.057
 themselves+0.056
era+0.053
Neg logit contributions
 sle-0.056
 spe-0.053
oc-0.047
 grizz-0.046
 nut-0.046
Token some
Token ID617
Feature activation-3.35e+00

Pos logit contributions
 herself+0.020
ina+0.018
ena+0.017
iece+0.016
 styles+0.015
Neg logit contributions
 spe-0.031
 sle-0.030
 shortcut-0.028
 gru-0.028
 nut-0.027
Token are
Token ID389
Feature activation-2.08e+00

Pos logit contributions
 herself+0.092
ena+0.078
 themselves+0.072
iece+0.071
ina+0.067
Neg logit contributions
 sle-0.093
 spe-0.085
 shortcut-0.083
 nut-0.079
 grizz-0.078
Token of
Token ID286
Feature activation+1.06e+00

Pos logit contributions
 herself+0.065
ena+0.058
 brushes+0.055
 styles+0.054
iece+0.051
Neg logit contributions
 sle-0.067
 shortcut-0.059
run-0.058
 spe-0.057
oc-0.057
Token what
Token ID644
Feature activation+1.33e-01

Pos logit contributions
 sle+0.030
 grizz+0.027
 nut+0.027
 poisonous+0.027
 shortcut+0.026
Neg logit contributions
 herself-0.024
ena-0.023
iece-0.022
 brushes-0.021
 styles-0.020
Token you
Token ID345
Feature activation+6.15e-01

Pos logit contributions
 sle+0.003
 shortcut+0.003
 nut+0.003
 spe+0.003
 hunter+0.003
Neg logit contributions
 herself-0.004
iece-0.004
ena-0.004
 themselves-0.003
 styles-0.003
Token eat
Token ID4483
Feature activation-1.08e+00

Pos logit contributions
 sle+0.016
 shortcut+0.014
 grizz+0.014
 spe+0.014
oc+0.014
Neg logit contributions
 herself-0.018
ena-0.015
 themselves-0.015
 shapes-0.014
 styles-0.014
Token.
Token ID13
Feature activation-6.09e-01

Pos logit contributions
 herself+0.029
ena+0.026
 brushes+0.024
iece+0.024
 styles+0.024
Neg logit contributions
 sle-0.028
oc-0.025
 shortcut-0.025
 nut-0.025
 spe-0.024
Token Some
Token ID2773
Feature activation-7.35e+00

Pos logit contributions
 herself+0.018
ena+0.015
 themselves+0.015
 shapes+0.014
 brushes+0.013
Neg logit contributions
 sle-0.017
 shortcut-0.015
 spe-0.015
 grizz-0.015
 gru-0.014
Token things
Token ID1243
Feature activation-2.23e+00

Pos logit contributions
 herself+0.203
ena+0.188
iece+0.171
 themselves+0.166
era+0.160
Neg logit contributions
 sle-0.190
 nut-0.174
 spe-0.172
oc-0.170
 shortcut-0.167
Token are
Token ID389
Feature activation-1.58e+00

Pos logit contributions
 herself+0.064
ena+0.055
 themselves+0.052
 shapes+0.051
 styles+0.050
Neg logit contributions
 sle-0.059
 shortcut-0.052
 spe-0.052
 grizz-0.051
 nut-0.049
Token not
Token ID407
Feature activation+2.52e+00

Pos logit contributions
 herself+0.046
ena+0.039
 themselves+0.037
iece+0.037
 shapes+0.036
Neg logit contributions
 sle-0.042
 spe-0.037
 grizz-0.036
 shortcut-0.036
 nut-0.035
Token good
Token ID922
Feature activation-4.74e-01

Pos logit contributions
 sle+0.065
 spe+0.059
 nut+0.058
 shortcut+0.057
 hunter+0.055
Neg logit contributions
 herself-0.071
ena-0.063
iece-0.060
 themselves-0.057
era-0.056
Token for
Token ID329
Feature activation-8.71e-01

Pos logit contributions
 herself+0.014
ena+0.013
 brushes+0.012
 themselves+0.011
 styles+0.011
Neg logit contributions
 sle-0.011
 spe-0.010
 shortcut-0.010
 grizz-0.010
 guarding-0.009
Token fall
Token ID2121
Feature activation+5.18e-02

Pos logit contributions
 herself+0.036
ena+0.031
 themselves+0.029
 shapes+0.029
 styles+0.028
Neg logit contributions
 sle-0.039
 spe-0.035
 shortcut-0.034
 grizz-0.033
oc-0.033
Token on
Token ID319
Feature activation-7.86e-02

Pos logit contributions
 sle+0.001
 spe+0.001
 shortcut+0.001
 grizz+0.001
 nut+0.001
Neg logit contributions
 herself-0.001
ena-0.001
 styles-0.001
 shapes-0.001
 themselves-0.001
Token the
Token ID262
Feature activation-1.07e+00

Pos logit contributions
 herself+0.002
ena+0.002
 styles+0.002
 brushes+0.002
iece+0.002
Neg logit contributions
 sle-0.002
 spe-0.002
 grizz-0.002
 tow-0.002
 shortcut-0.002
Token ground
Token ID2323
Feature activation-8.15e-01

Pos logit contributions
 herself+0.028
ena+0.024
 themselves+0.022
 styles+0.021
 shapes+0.020
Neg logit contributions
 sle-0.032
 spe-0.028
 shortcut-0.028
 nut-0.027
 grizz-0.027
Token.
Token ID13
Feature activation-6.06e-01

Pos logit contributions
 herself+0.023
ena+0.020
 styles+0.019
 brushes+0.018
 themselves+0.018
Neg logit contributions
 sle-0.021
 spe-0.019
 shortcut-0.018
 grizz-0.018
oc-0.018
Token Some
Token ID2773
Feature activation-7.35e+00

Pos logit contributions
 herself+0.018
ena+0.016
 styles+0.015
iece+0.015
 shapes+0.015
Neg logit contributions
 sle-0.014
 spe-0.013
 grizz-0.012
 tow-0.012
 gru-0.012
Token tomatoes
Token ID23972
Feature activation-1.47e+00

Pos logit contributions
 herself+0.200
ena+0.181
 themselves+0.158
 styles+0.156
ina+0.154
Neg logit contributions
 sle-0.190
 spe-0.183
 nut-0.174
 shortcut-0.167
use-0.167
Token break
Token ID2270
Feature activation-1.39e+00

Pos logit contributions
 herself+0.040
ena+0.034
 themselves+0.032
 shapes+0.032
 styles+0.031
Neg logit contributions
 sle-0.041
 spe-0.037
 shortcut-0.036
 grizz-0.035
 nut-0.035
Token.
Token ID13
Feature activation-6.71e-01

Pos logit contributions
 herself+0.037
ena+0.034
 styles+0.032
 rhythms+0.030
 brushes+0.030
Neg logit contributions
 sle-0.034
 spe-0.033
oc-0.030
run-0.030
 nut-0.030
Token They
Token ID1119
Feature activation-1.04e+00

Pos logit contributions
 herself+0.020
ena+0.017
 styles+0.016
 shapes+0.016
iece+0.015
Neg logit contributions
 sle-0.016
 spe-0.015
 grizz-0.015
oc-0.014
 gru-0.014
Token make
Token ID787
Feature activation-3.47e+00

Pos logit contributions
 herself+0.031
ena+0.027
 themselves+0.026
iece+0.026
 styles+0.025
Neg logit contributions
 sle-0.026
 spe-0.024
 nut-0.022
 grizz-0.021
 shortcut-0.021
Token things
Token ID1243
Feature activation-2.46e+00

Pos logit contributions
 herself+0.087
ena+0.068
 themselves+0.065
iece+0.062
ina+0.060
Neg logit contributions
 sle-0.122
 spe-0.112
 shortcut-0.107
 grizz-0.107
oc-0.104
Token on
Token ID319
Feature activation-5.83e-01

Pos logit contributions
 herself+0.064
ena+0.052
 themselves+0.051
 shapes+0.050
 styles+0.048
Neg logit contributions
 sle-0.074
 spe-0.065
 shortcut-0.065
 grizz-0.065
oc-0.062
Token the
Token ID262
Feature activation-2.27e+00

Pos logit contributions
 herself+0.015
ena+0.013
iece+0.012
 styles+0.012
 shapes+0.012
Neg logit contributions
 sle-0.017
 spe-0.015
 shortcut-0.015
 tow-0.014
 grizz-0.014
Token floor
Token ID4314
Feature activation-3.72e+00

Pos logit contributions
 herself+0.059
 themselves+0.046
ena+0.046
 styles+0.043
ina+0.042
Neg logit contributions
 sle-0.069
 shortcut-0.061
 spe-0.060
 grizz-0.058
 gru-0.058
Token.
Token ID13
Feature activation-9.22e-02

Pos logit contributions
 herself+0.104
ena+0.087
 styles+0.085
 themselves+0.084
 brushes+0.084
Neg logit contributions
 sle-0.090
 spe-0.081
 shortcut-0.080
 grizz-0.078
 guarding-0.078
Token Some
Token ID2773
Feature activation-7.35e+00

Pos logit contributions
 herself+0.003
ena+0.002
 themselves+0.002
 shapes+0.002
 styles+0.002
Neg logit contributions
 sle-0.002
 grizz-0.002
 spe-0.002
 shortcut-0.002
oc-0.002
Token are
Token ID389
Feature activation-2.68e+00

Pos logit contributions
 herself+0.197
ena+0.163
ina+0.155
iece+0.154
 themselves+0.150
Neg logit contributions
 sle-0.196
 spe-0.183
 shortcut-0.183
 tow-0.171
run-0.168
Token broken
Token ID5445
Feature activation-1.52e+00

Pos logit contributions
 herself+0.091
ena+0.077
iece+0.077
 themselves+0.076
 styles+0.074
Neg logit contributions
 sle-0.058
 spe-0.052
 shortcut-0.049
 grizz-0.048
 gru-0.047
Token.
Token ID13
Feature activation+1.28e-01

Pos logit contributions
 herself+0.044
ena+0.040
 styles+0.036
iece+0.036
era+0.036
Neg logit contributions
 sle-0.036
 spe-0.033
 grizz-0.030
 shortcut-0.030
 guarding-0.028
Token Some
Token ID2773
Feature activation-7.10e+00

Pos logit contributions
 sle+0.003
 grizz+0.003
 spe+0.003
 shortcut+0.003
oc+0.003
Neg logit contributions
 herself-0.004
ena-0.003
 themselves-0.003
 shapes-0.003
 styles-0.003
Token are
Token ID389
Feature activation-2.48e+00

Pos logit contributions
 herself+0.200
ena+0.173
iece+0.163
ina+0.162
era+0.155
Neg logit contributions
 sle-0.178
 spe-0.169
 shortcut-0.165
run-0.154
oc-0.154
Tokenmy
Token ID1820
Feature activation-1.20e+00

Pos logit contributions
 herself+0.006
 themselves+0.005
ena+0.005
 shapes+0.005
 styles+0.005
Neg logit contributions
 sle-0.007
 grizz-0.006
 shortcut-0.006
oc-0.006
 spe-0.006
Token decided
Token ID3066
Feature activation+1.32e+00

Pos logit contributions
 herself+0.033
 themselves+0.027
ena+0.027
 shapes+0.026
 styles+0.026
Neg logit contributions
 sle-0.033
oc-0.030
 grizz-0.030
 polar-0.029
 shortcut-0.028
Token to
Token ID284
Feature activation+1.81e+00

Pos logit contributions
 sle+0.031
 shortcut+0.028
 spe+0.027
 grizz+0.027
 gru+0.026
Neg logit contributions
 herself-0.041
ena-0.036
 themselves-0.035
 shapes-0.034
 styles-0.033
Token take
Token ID1011
Feature activation+3.32e+00

Pos logit contributions
 sle+0.039
 shortcut+0.035
 tow+0.033
 nut+0.033
 spe+0.033
Neg logit contributions
 herself-0.059
ena-0.053
iece-0.051
 themselves-0.050
 shapes-0.049
Token a
Token ID257
Feature activation+1.81e+00

Pos logit contributions
 sle+0.104
 shortcut+0.093
 polar+0.093
 grizz+0.092
 Roger+0.091
Neg logit contributions
 herself-0.073
ena-0.066
 shapes-0.062
iece-0.061
 brushes-0.061
Token walk
Token ID2513
Feature activation+4.74e+00

Pos logit contributions
oc+0.048
 sle+0.047
 spe+0.045
 grizz+0.045
 gru+0.044
Neg logit contributions
 herself-0.042
 clothes-0.038
 brushes-0.038
 when-0.035
 hair-0.035
Token in
Token ID287
Feature activation+1.12e+00

Pos logit contributions
 sle+0.142
 shortcut+0.125
 grizz+0.124
 spe+0.124
oc+0.122
Neg logit contributions
 herself-0.121
ena-0.102
 themselves-0.097
 shapes-0.096
 brushes-0.095
Token the
Token ID262
Feature activation+4.38e-02

Pos logit contributions
 sle+0.041
 grizz+0.039
 spe+0.038
oc+0.037
 shortcut+0.037
Neg logit contributions
 herself-0.023
 themselves-0.017
ena-0.015
 shapes-0.015
 her-0.015
Token woods
Token ID16479
Feature activation+3.53e-01

Pos logit contributions
 sle+0.001
 grizz+0.001
 shortcut+0.001
 spe+0.001
 gru+0.001
Neg logit contributions
 herself-0.001
ena-0.001
 themselves-0.001
 shapes-0.001
 styles-0.001
Token.
Token ID13
Feature activation+8.13e-01

Pos logit contributions
 sle+0.010
 shortcut+0.009
 grizz+0.009
 spe+0.009
 gru+0.009
Neg logit contributions
 herself-0.009
ena-0.008
 brushes-0.007
 shapes-0.007
 themselves-0.007
Token He
Token ID679
Feature activation+8.60e-02

Pos logit contributions
 sle+0.018
 spe+0.016
 grizz+0.015
use+0.014
 shortcut+0.014
Neg logit contributions
 herself-0.028
 themselves-0.024
 shapes-0.024
ena-0.023
 brushes-0.023
Token there
Token ID612
Feature activation+1.31e+00

Pos logit contributions
 sle+0.025
 grizz+0.022
 shortcut+0.022
 spe+0.021
oc+0.021
Neg logit contributions
 herself-0.023
ena-0.019
 themselves-0.019
 shapes-0.018
 when-0.017
Token was
Token ID373
Feature activation-1.02e+00

Pos logit contributions
 sle+0.043
 spe+0.038
 grizz+0.038
 shortcut+0.038
oc+0.037
Neg logit contributions
 herself-0.031
ena-0.025
 themselves-0.024
 shapes-0.023
 brushes-0.022
Token a
Token ID257
Feature activation+1.43e+00

Pos logit contributions
 herself+0.020
 styles+0.017
ena+0.016
iece+0.016
 shapes+0.015
Neg logit contributions
 sle-0.034
 spe-0.032
 nut-0.031
 grizz-0.031
 polar-0.031
Token little
Token ID1310
Feature activation+1.85e+00

Pos logit contributions
run+0.045
 sle+0.044
 shortcut+0.042
use+0.042
|+0.042
Neg logit contributions
 herself-0.028
ena-0.026
 shapes-0.026
 when-0.024
 hair-0.024
Token girl
Token ID2576
Feature activation+2.23e+00

Pos logit contributions
 sle+0.055
 shortcut+0.051
 grizz+0.050
 gru+0.049
run+0.049
Neg logit contributions
 herself-0.045
 shapes-0.037
ena-0.036
 themselves-0.035
 brushes-0.035
Token named
Token ID3706
Feature activation+4.54e+00

Pos logit contributions
 sle+0.048
 grizz+0.040
 spe+0.040
 shortcut+0.040
oc+0.038
Neg logit contributions
 herself-0.077
ena-0.067
 themselves-0.066
 shapes-0.064
 brushes-0.063
Token Lily
Token ID20037
Feature activation+1.88e+00

Pos logit contributions
 sle+0.141
 spe+0.125
 grizz+0.125
 shortcut+0.124
oc+0.120
Neg logit contributions
 herself-0.114
ena-0.092
 themselves-0.091
 shapes-0.088
 styles-0.085
Token.
Token ID13
Feature activation+8.58e-01

Pos logit contributions
 sle+0.057
 spe+0.051
oc+0.051
 grizz+0.050
 shortcut+0.050
Neg logit contributions
 herself-0.047
ena-0.037
 themselves-0.037
 shapes-0.037
 brushes-0.035
Token She
Token ID1375
Feature activation+1.71e+00

Pos logit contributions
 sle+0.029
 shortcut+0.026
 spe+0.026
 grizz+0.026
 gru+0.025
Neg logit contributions
 herself-0.020
ena-0.015
 themselves-0.015
 shapes-0.015
 brushes-0.014
Token loved
Token ID6151
Feature activation-1.12e+00

Pos logit contributions
 sle+0.059
 grizz+0.053
 spe+0.053
 shortcut+0.053
 gru+0.051
Neg logit contributions
 herself-0.038
ena-0.029
 themselves-0.028
 shapes-0.028
 brushes-0.026
Token music
Token ID2647
Feature activation+4.01e-02

Pos logit contributions
 herself+0.034
ena+0.030
 themselves+0.029
iece+0.028
 shapes+0.027
Neg logit contributions
 sle-0.029
 spe-0.025
 grizz-0.024
 shortcut-0.024
 nut-0.023
Token,
Token ID11
Feature activation+1.06e+00

Pos logit contributions
 sle+0.034
 spe+0.030
 grizz+0.030
 shortcut+0.030
oc+0.029
Neg logit contributions
 herself-0.032
ena-0.027
 themselves-0.026
 shapes-0.025
 brushes-0.025
Token there
Token ID612
Feature activation+1.46e+00

Pos logit contributions
 sle+0.026
 spe+0.022
 grizz+0.022
 shortcut+0.022
oc+0.021
Neg logit contributions
 herself-0.034
ena-0.029
 themselves-0.028
 shapes-0.028
 styles-0.027
Token was
Token ID373
Feature activation-1.01e+00

Pos logit contributions
 sle+0.032
oc+0.031
 tow+0.029
 spe+0.029
 polar+0.029
Neg logit contributions
 herself-0.043
 styles-0.038
 brushes-0.037
ena-0.036
 shapes-0.036
Token a
Token ID257
Feature activation+1.60e+00

Pos logit contributions
 herself+0.022
 styles+0.020
iece+0.018
ena+0.018
ups+0.018
Neg logit contributions
 sle-0.030
 spe-0.029
 nut-0.028
 grizz-0.028
 polar-0.028
Token girl
Token ID2576
Feature activation+2.46e+00

Pos logit contributions
run+0.048
 sle+0.047
 shortcut+0.044
use+0.044
|+0.044
Neg logit contributions
 herself-0.035
ena-0.031
 shapes-0.031
 hair-0.029
 when-0.029
Token named
Token ID3706
Feature activation+4.59e+00

Pos logit contributions
 sle+0.050
 spe+0.042
 grizz+0.041
 shortcut+0.040
oc+0.040
Neg logit contributions
 herself-0.088
ena-0.075
 themselves-0.075
 shapes-0.074
 brushes-0.072
Token Pip
Token ID25149
Feature activation+5.02e+00

Pos logit contributions
 sle+0.123
 spe+0.111
 grizz+0.109
 shortcut+0.108
|+0.105
Neg logit contributions
 herself-0.122
ena-0.104
 themselves-0.104
 shapes-0.103
 styles-0.102
Token.
Token ID13
Feature activation+1.26e+00

Pos logit contributions
 sle+0.143
oc+0.128
 spe+0.126
 grizz+0.125
 nut+0.123
Neg logit contributions
 herself-0.129
 themselves-0.106
ena-0.105
 brushes-0.105
 shapes-0.104
Token She
Token ID1375
Feature activation+1.78e+00

Pos logit contributions
 sle+0.043
 spe+0.039
 shortcut+0.038
 gru+0.038
 grizz+0.038
Neg logit contributions
 herself-0.031
 themselves-0.022
 shapes-0.020
 when-0.020
ena-0.020
Token was
Token ID373
Feature activation-5.10e-01

Pos logit contributions
 sle+0.055
 spe+0.049
 grizz+0.048
 shortcut+0.048
oc+0.047
Neg logit contributions
 herself-0.046
ena-0.037
 themselves-0.036
 shapes-0.035
 brushes-0.034
Token only
Token ID691
Feature activation+9.54e-01

Pos logit contributions
 herself+0.012
 themselves+0.010
ena+0.010
iece+0.010
 shapes+0.010
Neg logit contributions
 sle-0.016
 spe-0.014
 grizz-0.014
 polar-0.013
 shortcut-0.013
Token there
Token ID612
Feature activation+1.51e+00

Pos logit contributions
 sle+0.033
 grizz+0.030
 spe+0.028
 shortcut+0.028
oc+0.027
Neg logit contributions
 herself-0.025
 themselves-0.019
ena-0.019
 when-0.019
 shapes-0.018
Token was
Token ID373
Feature activation-6.43e-01

Pos logit contributions
 sle+0.058
 grizz+0.052
 gru+0.051
 shortcut+0.051
 spe+0.051
Neg logit contributions
 herself-0.030
ena-0.022
 themselves-0.022
 shapes-0.021
 when-0.019
Token a
Token ID257
Feature activation+1.69e+00

Pos logit contributions
 herself+0.013
 styles+0.011
ena+0.010
iece+0.010
 shapes+0.010
Neg logit contributions
 sle-0.021
 spe-0.020
 nut-0.019
 grizz-0.019
 polar-0.019
Token little
Token ID1310
Feature activation+1.86e+00

Pos logit contributions
run+0.053
 sle+0.052
|+0.050
 shortcut+0.050
use+0.050
Neg logit contributions
 herself-0.033
ena-0.031
 shapes-0.030
 when-0.028
 hair-0.028
Token girl
Token ID2576
Feature activation+2.45e+00

Pos logit contributions
 sle+0.057
 shortcut+0.053
 grizz+0.051
run+0.051
 gru+0.051
Neg logit contributions
 herself-0.043
 shapes-0.035
ena-0.035
 themselves-0.033
 brushes-0.033
Token named
Token ID3706
Feature activation+4.66e+00

Pos logit contributions
 sle+0.056
 shortcut+0.048
 grizz+0.047
 spe+0.047
 gru+0.045
Neg logit contributions
 herself-0.082
ena-0.071
 themselves-0.068
 shapes-0.067
 brushes-0.065
Token Lily
Token ID20037
Feature activation+1.85e+00

Pos logit contributions
 sle+0.149
 spe+0.133
 grizz+0.133
 shortcut+0.132
oc+0.128
Neg logit contributions
 herself-0.115
ena-0.092
 themselves-0.090
 shapes-0.087
 brushes-0.084
Token.
Token ID13
Feature activation+9.11e-01

Pos logit contributions
 sle+0.060
 spe+0.054
 grizz+0.054
 shortcut+0.053
oc+0.052
Neg logit contributions
 herself-0.044
ena-0.035
 themselves-0.034
 shapes-0.033
 brushes-0.032
Token She
Token ID1375
Feature activation+1.54e+00

Pos logit contributions
 sle+0.030
 spe+0.027
 grizz+0.027
 shortcut+0.027
oc+0.026
Neg logit contributions
 herself-0.021
ena-0.017
 themselves-0.016
 shapes-0.016
 brushes-0.015
Token loved
Token ID6151
Feature activation-1.06e+00

Pos logit contributions
 sle+0.051
 spe+0.046
 grizz+0.045
 shortcut+0.045
oc+0.044
Neg logit contributions
 herself-0.035
ena-0.028
 themselves-0.028
 shapes-0.027
 brushes-0.025
Token to
Token ID284
Feature activation+1.94e+00

Pos logit contributions
 herself+0.033
ena+0.029
iece+0.028
 themselves+0.028
 shapes+0.026
Neg logit contributions
 sle-0.027
 spe-0.023
 grizz-0.021
 shortcut-0.021
 nut-0.021
TokenThen
Token ID6423
Feature activation-2.51e+00

Pos logit contributions
 sle+0.034
 grizz+0.030
 spe+0.030
 shortcut+0.030
oc+0.029
Neg logit contributions
 herself-0.030
ena-0.025
 themselves-0.024
 shapes-0.024
 brushes-0.023
Token he
Token ID339
Feature activation+2.97e-01

Pos logit contributions
 herself+0.081
ena+0.071
 styles+0.068
 shapes+0.067
 themselves+0.067
Neg logit contributions
 sle-0.059
 grizz-0.052
 shortcut-0.051
 hunter-0.050
 gru-0.050
Token heard
Token ID2982
Feature activation+2.78e-02

Pos logit contributions
 sle+0.007
 grizz+0.007
 shortcut+0.006
 spe+0.006
 gru+0.006
Neg logit contributions
 herself-0.010
 themselves-0.008
 shapes-0.008
 styles-0.008
ena-0.007
Token a
Token ID257
Feature activation+3.83e-01

Pos logit contributions
 sle+0.001
 grizz+0.001
 gru+0.001
 shortcut+0.001
 hunter+0.001
Neg logit contributions
 herself-0.001
 styles-0.001
ena-0.001
 brushes-0.000
 themselves-0.000
Token voice
Token ID3809
Feature activation+3.10e+00

Pos logit contributions
 sle+0.012
 spe+0.010
|+0.010
 grizz+0.010
 shortcut+0.010
Neg logit contributions
 herself-0.009
 shapes-0.008
ena-0.007
 themselves-0.007
 brushes-0.007
Token from
Token ID422
Feature activation+4.81e+00

Pos logit contributions
 sle+0.099
 spe+0.090
 shortcut+0.088
 grizz+0.088
oc+0.086
Neg logit contributions
 herself-0.076
ena-0.060
 themselves-0.059
 shapes-0.055
iece-0.054
Token behind
Token ID2157
Feature activation+1.87e+00

Pos logit contributions
 sle+0.147
 spe+0.131
oc+0.131
 shortcut+0.130
 grizz+0.129
Neg logit contributions
 herself-0.127
 themselves-0.101
ena-0.099
 shapes-0.093
 brushes-0.088
Token him
Token ID683
Feature activation-4.75e-01

Pos logit contributions
 sle+0.071
 spe+0.065
 shortcut+0.065
 grizz+0.062
oc+0.062
Neg logit contributions
 herself-0.038
 themselves-0.028
ena-0.025
 her-0.024
 shapes-0.022
Token.
Token ID13
Feature activation+1.20e+00

Pos logit contributions
 herself+0.012
ena+0.010
 themselves+0.010
 shapes+0.009
 styles+0.009
Neg logit contributions
 sle-0.014
 grizz-0.013
 spe-0.013
 shortcut-0.012
oc-0.012
Token "
Token ID366
Feature activation-2.18e+00

Pos logit contributions
 sle+0.037
 spe+0.033
 shortcut+0.032
|+0.031
 polar+0.031
Neg logit contributions
 herself-0.031
 themselves-0.025
ena-0.024
 shapes-0.023
 when-0.023
TokenMaybe
Token ID13300
Feature activation+1.64e+00

Pos logit contributions
 herself+0.063
ena+0.052
 themselves+0.051
 shapes+0.051
 brushes+0.049
Neg logit contributions
 sle-0.059
 grizz-0.052
 spe-0.052
 shortcut-0.052
oc-0.050
Token.
Token ID13
Feature activation-3.99e-02

Pos logit contributions
 herself+0.010
ena+0.008
 themselves+0.008
 shapes+0.008
 styles+0.008
Neg logit contributions
 sle-0.011
 spe-0.010
 grizz-0.009
 shortcut-0.009
oc-0.009
Token
Token ID198
Feature activation-1.80e-02

Pos logit contributions
 herself+0.001
ena+0.001
 themselves+0.001
 shapes+0.001
 hair+0.001
Neg logit contributions
 sle-0.001
 shortcut-0.001
 spe-0.001
 grizz-0.001
 feast-0.001
Token
Token ID198
Feature activation+7.36e-01

Pos logit contributions
 herself+0.000
 themselves+0.000
ena+0.000
 shapes+0.000
 brushes+0.000
Neg logit contributions
 sle-0.001
 spe-0.001
 shortcut-0.001
 grizz-0.001
 gru-0.001
Token"
Token ID1
Feature activation-2.00e+00

Pos logit contributions
 sle+0.023
 spe+0.021
 grizz+0.021
 shortcut+0.020
 feast+0.020
Neg logit contributions
 herself-0.019
ena-0.015
 themselves-0.015
 shapes-0.014
 brushes-0.014
TokenHello
Token ID15496
Feature activation+2.00e+00

Pos logit contributions
 herself+0.062
ena+0.052
 themselves+0.051
 shapes+0.050
 brushes+0.048
Neg logit contributions
 sle-0.052
 spe-0.045
 grizz-0.044
 shortcut-0.044
oc-0.043
Token,
Token ID11
Feature activation+1.28e+00

Pos logit contributions
 sle+0.068
 grizz+0.061
 gru+0.061
 shortcut+0.061
 Amen+0.061
Neg logit contributions
 herself-0.041
ena-0.036
 shapes-0.034
 themselves-0.033
 when-0.032
Token kids
Token ID3988
Feature activation-4.09e-01

Pos logit contributions
|+0.047
 sle+0.046
 spe+0.046
 Amen+0.045
 grizz+0.044
Neg logit contributions
 herself-0.021
ena-0.018
 shapes-0.016
 two-0.015
 themselves-0.015
Token,
Token ID11
Feature activation+5.03e-01

Pos logit contributions
 herself+0.011
ena+0.010
 themselves+0.009
 shapes+0.009
 when+0.008
Neg logit contributions
 sle-0.011
 shortcut-0.010
 grizz-0.010
 spe-0.010
 gru-0.010
Token I
Token ID314
Feature activation+2.02e+00

Pos logit contributions
 sle+0.015
 grizz+0.014
 spe+0.014
oc+0.014
 gru+0.014
Neg logit contributions
 herself-0.012
ena-0.010
 themselves-0.010
 shapes-0.010
 when-0.010
Token am
Token ID716
Feature activation-3.65e-01

Pos logit contributions
 sle+0.054
 grizz+0.045
 spe+0.045
 shortcut+0.045
 gru+0.044
Neg logit contributions
 herself-0.060
ena-0.052
 themselves-0.049
 shapes-0.048
 brushes-0.047
Token here
Token ID994
Feature activation+2.36e+00

Pos logit contributions
 herself+0.009
ena+0.007
 themselves+0.007
 shapes+0.007
 hair+0.006
Neg logit contributions
 sle-0.011
 shortcut-0.010
 spe-0.010
 grizz-0.010
oc-0.010
Token when
Token ID618
Feature activation-2.82e-01

Pos logit contributions
 sle+0.016
 shortcut+0.014
 spe+0.014
 guarding+0.013
 grizz+0.013
Neg logit contributions
 herself-0.014
ena-0.014
iece-0.013
 brushes-0.013
 themselves-0.012
Token you
Token ID345
Feature activation-3.75e-01

Pos logit contributions
 herself+0.007
ena+0.007
iece+0.007
 brushes+0.006
 styles+0.006
Neg logit contributions
 sle-0.007
 shortcut-0.007
 spe-0.007
 guarding-0.006
 hunter-0.006
Token lose
Token ID4425
Feature activation-3.90e-01

Pos logit contributions
 herself+0.012
ena+0.010
 themselves+0.010
 shapes+0.009
 styles+0.009
Neg logit contributions
 sle-0.009
 grizz-0.008
 shortcut-0.008
 spe-0.008
oc-0.008
Token something
Token ID1223
Feature activation+4.34e-01

Pos logit contributions
 herself+0.010
ena+0.009
iece+0.008
 shapes+0.008
 themselves+0.008
Neg logit contributions
 sle-0.011
 shortcut-0.010
 grizz-0.010
 spe-0.009
 gru-0.009
Token,
Token ID11
Feature activation-8.59e-01

Pos logit contributions
 sle+0.013
 grizz+0.011
 spe+0.011
oc+0.011
 shortcut+0.011
Neg logit contributions
 herself-0.012
 themselves-0.009
 shapes-0.009
ena-0.009
 brushes-0.009
Token it
Token ID340
Feature activation+6.62e-01

Pos logit contributions
 herself+0.021
ena+0.021
iece+0.019
 styles+0.018
 brushes+0.018
Neg logit contributions
 sle-0.021
 spe-0.020
 shortcut-0.020
run-0.018
 Nice-0.018
Token's
Token ID338
Feature activation-3.11e+00

Pos logit contributions
 sle+0.019
 spe+0.017
 grizz+0.017
 shortcut+0.017
oc+0.016
Neg logit contributions
 herself-0.018
ena-0.015
 themselves-0.014
 shapes-0.014
 brushes-0.014
Token important
Token ID1593
Feature activation-1.24e+00

Pos logit contributions
 herself+0.087
ena+0.075
 themselves+0.071
 shapes+0.071
 brushes+0.071
Neg logit contributions
 sle-0.083
 shortcut-0.074
 grizz-0.074
 spe-0.073
 gru-0.071
Token to
Token ID284
Feature activation+1.24e+00

Pos logit contributions
 herself+0.040
ena+0.036
iece+0.035
 themselves+0.033
 brushes+0.033
Neg logit contributions
 sle-0.025
 shortcut-0.025
run-0.022
 nut-0.022
 spe-0.022
Token keep
Token ID1394
Feature activation+5.76e-01

Pos logit contributions
 sle+0.030
 shortcut+0.028
 nut+0.026
 grizz+0.026
 hunter+0.026
Neg logit contributions
 herself-0.034
ena-0.031
iece-0.030
 brushes-0.030
 shapes-0.030
Token looking
Token ID2045
Feature activation+2.32e+00

Pos logit contributions
 sle+0.016
 shortcut+0.015
 grizz+0.014
 Roger+0.014
 gru+0.014
Neg logit contributions
 herself-0.013
ena-0.013
iece-0.012
 brushes-0.012
era-0.012
Token he
Token ID339
Feature activation+2.75e+00

Pos logit contributions
 sle+0.073
 grizz+0.063
 shortcut+0.061
 spe+0.061
oc+0.061
Neg logit contributions
 herself-0.079
 themselves-0.064
ena-0.062
 shapes-0.062
 when-0.058
Token filled
Token ID5901
Feature activation-1.35e+00

Pos logit contributions
 sle+0.073
 spe+0.070
oc+0.068
 grizz+0.068
 polar+0.062
Neg logit contributions
 herself-0.080
 themselves-0.067
iece-0.063
 shapes-0.061
ena-0.060
Token Bob
Token ID5811
Feature activation-5.51e-01

Pos logit contributions
 herself+0.038
ena+0.035
 brushes+0.034
 styles+0.033
ina+0.033
Neg logit contributions
 sle-0.032
oc-0.029
 nut-0.029
 grizz-0.029
 gru-0.028
Token's
Token ID338
Feature activation-2.84e+00

Pos logit contributions
 herself+0.015
ena+0.014
 themselves+0.013
 brushes+0.012
 styles+0.012
Neg logit contributions
 sle-0.014
oc-0.013
 nut-0.012
 spe-0.012
 victorious-0.011
Token bag
Token ID6131
Feature activation+1.82e+00

Pos logit contributions
 herself+0.074
ena+0.064
 themselves+0.060
ina+0.055
iece+0.055
Neg logit contributions
 sle-0.083
 spe-0.073
 gru-0.072
oc-0.071
 grizz-0.071
Token with
Token ID351
Feature activation+4.64e-01

Pos logit contributions
 sle+0.058
 spe+0.051
 grizz+0.051
 shortcut+0.051
oc+0.049
Neg logit contributions
 herself-0.044
ena-0.035
 themselves-0.035
 shapes-0.034
 brushes-0.032
Token blue
Token ID4171
Feature activation-2.83e+00

Pos logit contributions
 sle+0.013
 spe+0.012
 tow+0.011
 nut+0.011
 polar+0.011
Neg logit contributions
ena-0.012
 herself-0.012
era-0.010
iece-0.010
ina-0.010
Token ink
Token ID16882
Feature activation-5.94e-01

Pos logit contributions
 herself+0.070
ena+0.070
 themselves+0.063
era+0.058
iece+0.057
Neg logit contributions
 sle-0.072
 spe-0.069
 grizz-0.064
oc-0.064
 freshly-0.063
Token.
Token ID13
Feature activation+3.95e-01

Pos logit contributions
 herself+0.017
ena+0.016
 themselves+0.015
 brushes+0.014
iece+0.014
Neg logit contributions
 sle-0.015
 spe-0.013
 nut-0.013
 trove-0.013
 grizz-0.013
Token
Token ID220
Feature activation-1.28e+00

Pos logit contributions
 sle+0.010
 spe+0.008
 shortcut+0.008
 grizz+0.008
oc+0.008
Neg logit contributions
 herself-0.013
 themselves-0.011
ena-0.011
 shapes-0.010
 brushes-0.010
Token
Token ID198
Feature activation+2.84e-01

Pos logit contributions
 herself+0.038
 themselves+0.031
ena+0.031
 shapes+0.030
 brushes+0.028
Neg logit contributions
 sle-0.035
 spe-0.031
 grizz-0.031
 shortcut-0.030
 gru-0.029
Token,
Token ID11
Feature activation+8.48e-01

Pos logit contributions
 sle+0.035
 spe+0.031
 grizz+0.031
 hunter+0.030
 shortcut+0.030
Neg logit contributions
 herself-0.017
 when-0.012
ena-0.012
 themselves-0.011
 her-0.011
Token there
Token ID612
Feature activation+1.44e+00

Pos logit contributions
 sle+0.028
 grizz+0.025
 shortcut+0.024
 spe+0.024
oc+0.023
Neg logit contributions
 herself-0.022
ena-0.017
 themselves-0.017
 when-0.016
 shapes-0.016
Token was
Token ID373
Feature activation-9.67e-01

Pos logit contributions
 sle+0.053
 grizz+0.047
 shortcut+0.047
 spe+0.047
 gru+0.046
Neg logit contributions
 herself-0.030
ena-0.023
 themselves-0.022
 shapes-0.022
 brushes-0.020
Token a
Token ID257
Feature activation+1.52e+00

Pos logit contributions
 herself+0.019
 styles+0.016
ena+0.016
iece+0.015
ups+0.015
Neg logit contributions
 sle-0.032
 spe-0.030
 nut-0.029
 grizz-0.029
 polar-0.029
Token cater
Token ID20825
Feature activation+3.48e+00

Pos logit contributions
run+0.047
 sle+0.047
|+0.044
 shortcut+0.044
use+0.044
Neg logit contributions
 herself-0.030
ena-0.028
 shapes-0.028
 when-0.026
 hair-0.026
Tokenpillar
Token ID41643
Feature activation+1.14e+00

Pos logit contributions
 sle+0.084
 shortcut+0.074
 spe+0.070
 grizz+0.069
 gru+0.068
Neg logit contributions
 herself-0.111
ena-0.094
 shapes-0.090
ina-0.087
 themselves-0.086
Token named
Token ID3706
Feature activation+4.66e+00

Pos logit contributions
 sle+0.029
 spe+0.025
 grizz+0.025
 shortcut+0.025
oc+0.024
Neg logit contributions
 herself-0.035
ena-0.029
 themselves-0.029
 shapes-0.028
 brushes-0.028
Token Cindy
Token ID39062
Feature activation+4.61e+00

Pos logit contributions
 sle+0.110
 grizz+0.096
 spe+0.095
 shortcut+0.093
oc+0.089
Neg logit contributions
 herself-0.149
ena-0.128
 themselves-0.126
 shapes-0.123
 styles-0.120
Token.
Token ID13
Feature activation+9.67e-01

Pos logit contributions
 sle+0.151
 grizz+0.136
 spe+0.135
 shortcut+0.135
 gru+0.130
Neg logit contributions
 herself-0.108
ena-0.085
 themselves-0.083
 shapes-0.081
 brushes-0.077
Token She
Token ID1375
Feature activation+1.36e+00

Pos logit contributions
 sle+0.033
 gru+0.031
oc+0.031
run+0.030
 grizz+0.030
Neg logit contributions
 herself-0.025
 her-0.016
 themselves-0.016
 hair-0.015
 styles-0.015
Token loved
Token ID6151
Feature activation-1.05e+00

Pos logit contributions
 sle+0.040
 spe+0.036
 grizz+0.036
 shortcut+0.035
oc+0.035
Neg logit contributions
 herself-0.035
ena-0.029
 themselves-0.029
 shapes-0.027
 styles-0.027
Token what
Token ID644
Feature activation+8.62e-01

Pos logit contributions
 herself+0.060
ena+0.054
 styles+0.050
iece+0.050
 brushes+0.050
Neg logit contributions
 sle-0.062
 grizz-0.055
 spe-0.054
 gru-0.054
 nut-0.053
Token his
Token ID465
Feature activation-3.58e-01

Pos logit contributions
 sle+0.019
 spe+0.017
 gru+0.017
 grizz+0.017
 shortcut+0.017
Neg logit contributions
 herself-0.028
ena-0.024
 themselves-0.024
iece-0.024
 styles-0.023
Token mom
Token ID1995
Feature activation+3.03e-01

Pos logit contributions
 herself+0.009
iece+0.007
 themselves+0.007
ena+0.007
 shapes+0.007
Neg logit contributions
 sle-0.011
 grizz-0.010
 shortcut-0.010
 nut-0.010
 spe-0.010
Token had
Token ID550
Feature activation+1.84e+00

Pos logit contributions
 sle+0.009
 grizz+0.008
 spe+0.008
oc+0.008
 shortcut+0.007
Neg logit contributions
 herself-0.008
 themselves-0.007
ena-0.006
 shapes-0.006
 styles-0.006
Token told
Token ID1297
Feature activation+1.06e+00

Pos logit contributions
 sle+0.047
 shortcut+0.043
use+0.043
 grizz+0.042
 spe+0.041
Neg logit contributions
 herself-0.048
 themselves-0.043
ena-0.042
 shapes-0.040
 styles-0.040
Token him
Token ID683
Feature activation+5.82e-01

Pos logit contributions
 sle+0.025
 shortcut+0.022
 grizz+0.022
 Roger+0.022
 gru+0.021
Neg logit contributions
 herself-0.028
ena-0.027
 brushes-0.027
 styles-0.026
 shapes-0.026
Token.
Token ID13
Feature activation+6.13e-01

Pos logit contributions
 sle+0.014
 shortcut+0.013
 grizz+0.012
 gru+0.012
run+0.012
Neg logit contributions
 herself-0.016
ena-0.015
 styles-0.014
 brushes-0.014
iece-0.014
Token She
Token ID1375
Feature activation+5.81e-01

Pos logit contributions
 sle+0.014
 grizz+0.012
 spe+0.012
 shortcut+0.012
oc+0.012
Neg logit contributions
 herself-0.020
ena-0.017
 themselves-0.017
 shapes-0.017
 styles-0.016
Token said
Token ID531
Feature activation+1.76e+00

Pos logit contributions
 sle+0.016
 grizz+0.015
 spe+0.015
oc+0.014
 hunter+0.014
Neg logit contributions
 herself-0.016
 themselves-0.013
iece-0.013
 shapes-0.013
 styles-0.013
Token real
Token ID1103
Feature activation+1.26e+00

Pos logit contributions
 sle+0.042
 spe+0.039
 grizz+0.038
 gru+0.037
 nut+0.036
Neg logit contributions
 herself-0.052
ena-0.048
 styles-0.046
 shapes-0.045
 brushes-0.044
Token knives
Token ID23110
Feature activation-7.07e-01

Pos logit contributions
 sle+0.035
 grizz+0.031
 polar+0.031
 hunter+0.030
 nut+0.029
Neg logit contributions
 herself-0.036
iece-0.032
 themselves-0.030
ena-0.029
 styles-0.026
Token meet
Token ID1826
Feature activation+2.71e+00

Pos logit contributions
 sle+0.050
 shortcut+0.043
 nut+0.043
 spe+0.042
 tow+0.041
Neg logit contributions
 herself-0.061
ena-0.055
 themselves-0.053
iece-0.053
 shapes-0.052
Token a
Token ID257
Feature activation+1.73e+00

Pos logit contributions
 sle+0.089
 grizz+0.079
 shortcut+0.078
 spe+0.078
oc+0.076
Neg logit contributions
 herself-0.063
ena-0.050
 themselves-0.049
 shapes-0.048
 brushes-0.046
Token new
Token ID649
Feature activation+1.05e+00

Pos logit contributions
 sle+0.045
run+0.043
 grizz+0.041
 gru+0.040
use+0.040
Neg logit contributions
 herself-0.044
 shapes-0.040
ena-0.039
 when-0.038
 hair-0.037
Token friend
Token ID1545
Feature activation+1.03e+00

Pos logit contributions
 sle+0.029
 shortcut+0.025
 grizz+0.025
 spe+0.025
oc+0.024
Neg logit contributions
 herself-0.030
 themselves-0.025
ena-0.025
 shapes-0.024
 brushes-0.024
Token.
Token ID13
Feature activation+7.35e-01

Pos logit contributions
 sle+0.028
 shortcut+0.025
 spe+0.024
 grizz+0.024
oc+0.024
Neg logit contributions
 herself-0.028
 themselves-0.023
 shapes-0.023
ena-0.023
 brushes-0.023
Token One
Token ID1881
Feature activation-1.72e+00

Pos logit contributions
 sle+0.021
 spe+0.019
 grizz+0.019
 shortcut+0.018
 gru+0.018
Neg logit contributions
 herself-0.021
ena-0.017
 themselves-0.017
 shapes-0.017
 brushes-0.016
Token day
Token ID1110
Feature activation+8.82e-02

Pos logit contributions
 herself+0.041
ena+0.033
 themselves+0.031
 shapes+0.031
 brushes+0.029
Neg logit contributions
 sle-0.056
 grizz-0.050
 spe-0.050
 shortcut-0.050
 gru-0.048
Token,
Token ID11
Feature activation-6.08e-01

Pos logit contributions
 sle+0.003
 grizz+0.002
 spe+0.002
 shortcut+0.002
oc+0.002
Neg logit contributions
 herself-0.002
ena-0.002
 themselves-0.002
 shapes-0.002
 brushes-0.002
Token Tom
Token ID4186
Feature activation-1.11e+00

Pos logit contributions
 herself+0.016
ena+0.013
 themselves+0.013
 styles+0.013
ina+0.013
Neg logit contributions
 sle-0.016
 shortcut-0.015
 spe-0.015
 nut-0.015
 tow-0.014
Token went
Token ID1816
Feature activation+3.37e+00

Pos logit contributions
 herself+0.028
ena+0.023
 themselves+0.022
 shapes+0.022
 brushes+0.021
Neg logit contributions
 sle-0.034
 spe-0.030
 grizz-0.030
 shortcut-0.030
 gru-0.029
Token to
Token ID284
Feature activation+3.40e-01

Pos logit contributions
 sle+0.100
 spe+0.090
 grizz+0.088
oc+0.087
 shortcut+0.085
Neg logit contributions
 herself-0.091
ena-0.073
 themselves-0.072
 shapes-0.070
 brushes-0.068
Tokenia
Token ID544
Feature activation-1.76e+00

Pos logit contributions
 herself+0.035
 themselves+0.028
ena+0.027
 shapes+0.027
 styles+0.026
Neg logit contributions
 sle-0.041
 grizz-0.036
 spe-0.036
 shortcut-0.035
oc-0.035
Token opens
Token ID9808
Feature activation+6.69e-01

Pos logit contributions
 herself+0.050
 themselves+0.042
ena+0.040
 shapes+0.040
iece+0.039
Neg logit contributions
 sle-0.047
 grizz-0.041
 spe-0.040
oc-0.040
 shortcut-0.039
Token her
Token ID607
Feature activation+9.79e-02

Pos logit contributions
 sle+0.019
 spe+0.018
 grizz+0.016
 tow+0.016
 nut+0.016
Neg logit contributions
 herself-0.016
ena-0.014
iece-0.014
 styles-0.014
 shapes-0.013
Token present
Token ID1944
Feature activation+1.73e+00

Pos logit contributions
 sle+0.003
 grizz+0.003
 spe+0.003
 shortcut+0.003
oc+0.003
Neg logit contributions
 herself-0.002
 brushes-0.002
 shapes-0.002
 hair-0.002
ena-0.002
Token next
Token ID1306
Feature activation+9.85e-01

Pos logit contributions
 sle+0.044
 spe+0.037
 trove+0.035
 nut+0.034
 gru+0.034
Neg logit contributions
 herself-0.050
 themselves-0.043
 rhythms-0.043
ena-0.042
 styles-0.042
Token.
Token ID13
Feature activation+3.58e-01

Pos logit contributions
 sle+0.028
 grizz+0.025
 spe+0.025
 shortcut+0.025
oc+0.025
Neg logit contributions
 herself-0.027
ena-0.022
 shapes-0.022
 themselves-0.022
 brushes-0.021
Token She
Token ID1375
Feature activation+3.32e-01

Pos logit contributions
 sle+0.011
 spe+0.010
 grizz+0.010
 shortcut+0.010
oc+0.010
Neg logit contributions
 herself-0.009
ena-0.007
 themselves-0.007
 shapes-0.007
 styles-0.007
Token sees
Token ID7224
Feature activation-7.32e-01

Pos logit contributions
 sle+0.009
 spe+0.008
 grizz+0.008
 shortcut+0.008
oc+0.007
Neg logit contributions
 herself-0.010
ena-0.008
 themselves-0.008
 shapes-0.008
 brushes-0.008
Token a
Token ID257
Feature activation+3.49e-01

Pos logit contributions
 herself+0.017
iece+0.016
ina+0.016
abies+0.016
ough+0.016
Neg logit contributions
 sle-0.018
 spe-0.016
 grizz-0.016
 polar-0.016
 nut-0.016
Token big
Token ID1263
Feature activation+1.95e+00

Pos logit contributions
 sle+0.009
 grizz+0.008
|+0.008
oc+0.008
run+0.008
Neg logit contributions
 herself-0.010
 shapes-0.009
ena-0.008
 brushes-0.008
 hair-0.008
Token doll
Token ID3654
Feature activation-2.54e-03

Pos logit contributions
 sle+0.052
run+0.048
|+0.048
 shortcut+0.048
oc+0.046
Neg logit contributions
 herself-0.051
 shapes-0.045
 hair-0.042
ena-0.042
 brushes-0.041
Token no
Token ID645
Feature activation-1.47e+00

Pos logit contributions
 sle+0.007
 grizz+0.006
 spe+0.006
 shortcut+0.006
oc+0.006
Neg logit contributions
 herself-0.007
ena-0.006
 themselves-0.006
 shapes-0.006
 styles-0.006
Token bananas
Token ID35484
Feature activation-1.68e+00

Pos logit contributions
 herself+0.039
 themselves+0.032
ena+0.032
 shapes+0.030
 brushes+0.029
Neg logit contributions
 sle-0.044
 spe-0.039
 grizz-0.039
 shortcut-0.039
 nut-0.038
Token were
Token ID547
Feature activation-5.57e-01

Pos logit contributions
 herself+0.043
ena+0.035
 themselves+0.034
 shapes+0.033
 brushes+0.032
Neg logit contributions
 sle-0.051
 spe-0.046
 grizz-0.045
 shortcut-0.045
oc-0.044
Token to
Token ID284
Feature activation+1.91e+00

Pos logit contributions
 herself+0.017
ena+0.014
 themselves+0.014
 shapes+0.013
 brushes+0.013
Neg logit contributions
 sle-0.015
 grizz-0.013
 spe-0.013
 shortcut-0.013
oc-0.012
Token be
Token ID307
Feature activation-1.44e-01

Pos logit contributions
 sle+0.046
 spe+0.040
 grizz+0.039
 shortcut+0.039
oc+0.037
Neg logit contributions
 herself-0.061
ena-0.052
 themselves-0.051
 shapes-0.050
 brushes-0.049
Token found
Token ID1043
Feature activation+2.26e-01

Pos logit contributions
 herself+0.005
ena+0.004
 themselves+0.004
 styles+0.004
 shapes+0.004
Neg logit contributions
 sle-0.003
 spe-0.003
 grizz-0.003
 gru-0.003
 shortcut-0.003
Token!
Token ID0
Feature activation+5.35e-02

Pos logit contributions
 sle+0.007
 spe+0.006
 grizz+0.006
 shortcut+0.006
oc+0.006
Neg logit contributions
 herself-0.006
ena-0.005
 themselves-0.005
 shapes-0.005
 brushes-0.005
Token Jack
Token ID3619
Feature activation-1.48e+00

Pos logit contributions
 sle+0.001
 spe+0.001
 grizz+0.001
 shortcut+0.001
oc+0.001
Neg logit contributions
 herself-0.002
ena-0.001
 themselves-0.001
 shapes-0.001
 brushes-0.001
Token had
Token ID550
Feature activation-6.36e-01

Pos logit contributions
 herself+0.042
 themselves+0.036
ena+0.035
 styles+0.034
iece+0.034
Neg logit contributions
 sle-0.039
 grizz-0.037
oc-0.036
 spe-0.035
 nut-0.034
Token a
Token ID257
Feature activation-6.13e-01

Pos logit contributions
 herself+0.016
ena+0.014
 themselves+0.013
iece+0.013
 shapes+0.013
Neg logit contributions
 sle-0.018
 spe-0.017
 grizz-0.017
 shortcut-0.016
 nut-0.016
Token sad
Token ID6507
Feature activation+2.95e-01

Pos logit contributions
 herself+0.018
 shapes+0.015
ena+0.015
 hair+0.015
 brushes+0.015
Neg logit contributions
 sle-0.015
 grizz-0.013
|-0.013
 shortcut-0.013
 spe-0.013
Token put
Token ID1234
Feature activation-7.37e-02

Pos logit contributions
 herself+0.010
iece+0.009
ena+0.009
 themselves+0.009
 styles+0.008
Neg logit contributions
 spe-0.007
 sle-0.006
 nut-0.006
 poisonous-0.006
 shortcut-0.006
Token the
Token ID262
Feature activation+1.38e+00

Pos logit contributions
ena+0.002
 herself+0.002
iece+0.002
 rhythms+0.001
era+0.001
Neg logit contributions
 sle-0.002
 spe-0.002
 nut-0.002
 poisonous-0.002
 grizz-0.002
Token bread
Token ID8509
Feature activation-1.21e-01

Pos logit contributions
 sle+0.036
 nut+0.032
 spe+0.032
 shortcut+0.031
 grizz+0.031
Neg logit contributions
 herself-0.040
ena-0.036
 themselves-0.036
iece-0.033
 rhythms-0.031
Token on
Token ID319
Feature activation+1.44e+00

Pos logit contributions
 herself+0.004
ena+0.004
 styles+0.003
 rhythms+0.003
 brushes+0.003
Neg logit contributions
 sle-0.003
 nut-0.002
 shortcut-0.002
 spe-0.002
 poisonous-0.002
Token the
Token ID262
Feature activation+1.41e-01

Pos logit contributions
 sle+0.040
 spe+0.037
 nut+0.037
 Roger+0.037
 grizz+0.036
Neg logit contributions
ena-0.029
 herself-0.028
iece-0.028
 styles-0.028
ups-0.028
Token ground
Token ID2323
Feature activation+9.03e-02

Pos logit contributions
 sle+0.004
 spe+0.004
 shortcut+0.004
 grizz+0.004
 nut+0.004
Neg logit contributions
 herself-0.004
ena-0.003
 themselves-0.003
 styles-0.003
 brushes-0.003
Token with
Token ID351
Feature activation+1.26e+00

Pos logit contributions
 sle+0.002
 spe+0.002
 grizz+0.002
 shortcut+0.002
 nut+0.002
Neg logit contributions
 herself-0.003
ena-0.002
 themselves-0.002
 styles-0.002
 shapes-0.002
Token her
Token ID607
Feature activation+2.44e-02

Pos logit contributions
 spe+0.039
 sle+0.038
 nut+0.037
 poisonous+0.035
 tow+0.035
Neg logit contributions
ena-0.026
ina-0.024
era-0.023
iece-0.022
 styles-0.022
Token makeup
Token ID16029
Feature activation+1.40e+00

Pos logit contributions
 sle+0.001
 spe+0.001
 nut+0.001
oc+0.001
 grizz+0.001
Neg logit contributions
 herself-0.001
ena-0.001
iece-0.000
 themselves-0.000
 styles-0.000
Token!
Token ID0
Feature activation+3.42e-01

Pos logit contributions
 sle+0.034
 spe+0.031
 shortcut+0.030
 trove+0.029
 nut+0.029
Neg logit contributions
 herself-0.040
ena-0.037
 themselves-0.037
 rhythms-0.034
 shapes-0.032
Token The
Token ID383
Feature activation+1.83e-02

Pos logit contributions
 sle+0.007
 grizz+0.007
 Nice+0.007
 Want+0.006
 gru+0.006
Neg logit contributions
 herself-0.010
 themselves-0.009
ena-0.009
 shapes-0.009
iece-0.009
Token was
Token ID373
Feature activation+6.70e-01

Pos logit contributions
 sle+0.050
 grizz+0.048
oc+0.047
 spe+0.047
 hunter+0.045
Neg logit contributions
 herself-0.079
 themselves-0.068
iece-0.068
 styles-0.066
 shapes-0.066
Token brave
Token ID14802
Feature activation+1.85e+00

Pos logit contributions
 sle+0.019
 spe+0.017
 grizz+0.017
 shortcut+0.017
oc+0.016
Neg logit contributions
 herself-0.019
ena-0.015
 themselves-0.015
 shapes-0.015
 brushes-0.014
Token to
Token ID284
Feature activation+2.32e+00

Pos logit contributions
 sle+0.054
 grizz+0.048
 spe+0.048
 shortcut+0.047
 gru+0.046
Neg logit contributions
 herself-0.048
ena-0.040
 themselves-0.039
 brushes-0.038
 shapes-0.037
Token explore
Token ID7301
Feature activation+8.28e-01

Pos logit contributions
 sle+0.055
 shortcut+0.049
 grizz+0.046
 spe+0.046
 tow+0.046
Neg logit contributions
 herself-0.070
ena-0.063
 shapes-0.061
 styles-0.060
 brushes-0.060
Token the
Token ID262
Feature activation-2.78e-01

Pos logit contributions
 sle+0.026
 spe+0.023
 grizz+0.023
 shortcut+0.023
oc+0.022
Neg logit contributions
 herself-0.020
ena-0.017
 themselves-0.016
 shapes-0.015
 brushes-0.015
Token backyard
Token ID24296
Feature activation-4.69e-01

Pos logit contributions
 herself+0.008
 shapes+0.007
ena+0.007
 themselves+0.007
 brushes+0.007
Neg logit contributions
 sle-0.007
 spe-0.006
 grizz-0.006
 shortcut-0.006
oc-0.006
Token and
Token ID290
Feature activation-8.72e-01

Pos logit contributions
 herself+0.012
ena+0.011
 brushes+0.011
 themselves+0.010
 shapes+0.010
Neg logit contributions
 sle-0.012
 spe-0.011
 grizz-0.011
oc-0.011
 shortcut-0.011
Token go
Token ID467
Feature activation+1.93e+00

Pos logit contributions
 herself+0.024
ena+0.022
iece+0.021
 shapes+0.020
 styles+0.020
Neg logit contributions
 sle-0.021
 shortcut-0.020
 spe-0.019
 grizz-0.019
 nut-0.019
Token right
Token ID826
Feature activation-1.37e-02

Pos logit contributions
 sle+0.061
 spe+0.057
 grizz+0.054
oc+0.054
 gru+0.052
Neg logit contributions
 herself-0.052
 themselves-0.039
ena-0.036
 shapes-0.035
 when-0.035
Token up
Token ID510
Feature activation-1.22e+00

Pos logit contributions
 herself+0.000
ena+0.000
 shapes+0.000
 brushes+0.000
 styles+0.000
Neg logit contributions
 sle-0.000
 shortcut-0.000
 gru-0.000
 grizz-0.000
 victorious-0.000
Token to
Token ID284
Feature activation+3.92e-01

Pos logit contributions
 herself+0.033
ena+0.029
 shapes+0.027
 themselves+0.027
 brushes+0.027
Neg logit contributions
 sle-0.034
 spe-0.030
 grizz-0.030
 shortcut-0.030
 gru-0.029
Token.
Token ID13
Feature activation+1.78e-01

Pos logit contributions
 herself+0.042
 themselves+0.036
ena+0.036
 shapes+0.034
 brushes+0.034
Neg logit contributions
 sle-0.049
 grizz-0.043
 gru-0.043
 spe-0.043
oc-0.042
Token
Token ID198
Feature activation+7.13e-01

Pos logit contributions
 sle+0.004
 grizz+0.003
 spe+0.003
 shortcut+0.003
 gru+0.003
Neg logit contributions
 herself-0.006
ena-0.005
 shapes-0.005
 themselves-0.005
 styles-0.005
Token
Token ID198
Feature activation+1.18e+00

Pos logit contributions
 sle+0.016
 grizz+0.014
 shortcut+0.014
oc+0.014
 spe+0.013
Neg logit contributions
 herself-0.023
ena-0.020
 shapes-0.020
 themselves-0.019
 styles-0.019
Token"
Token ID1
Feature activation-2.75e+00

Pos logit contributions
 sle+0.024
oc+0.023
 tow+0.022
 shortcut+0.021
 gru+0.020
Neg logit contributions
 herself-0.038
 shapes-0.034
 styles-0.033
 themselves-0.032
iece-0.032
TokenLet
Token ID5756
Feature activation-4.60e-01

Pos logit contributions
 herself+0.086
iece+0.079
 clothes+0.078
 hair+0.075
 faces+0.075
Neg logit contributions
 spe-0.054
Roger-0.047
 gru-0.045
 tow-0.045
 sle-0.044
Token's
Token ID338
Feature activation-5.28e+00

Pos logit contributions
 herself+0.012
ena+0.011
 themselves+0.010
 styles+0.010
iece+0.010
Neg logit contributions
 sle-0.013
 shortcut-0.011
oc-0.011
 nut-0.011
 spe-0.011
Token give
Token ID1577
Feature activation+1.66e+00

Pos logit contributions
 herself+0.165
ena+0.155
iece+0.148
 styles+0.143
 themselves+0.143
Neg logit contributions
 sle-0.116
 nut-0.103
 shortcut-0.100
 spe-0.097
 tow-0.094
Token it
Token ID340
Feature activation+5.42e-01

Pos logit contributions
 sle+0.037
oc+0.032
 nut+0.032
 spe+0.031
 hunter+0.028
Neg logit contributions
ena-0.048
 herself-0.047
iece-0.047
 styles-0.045
 rhythms-0.043
Token a
Token ID257
Feature activation+1.77e+00

Pos logit contributions
 sle+0.014
 nut+0.012
 shortcut+0.012
 spe+0.011
 feast+0.011
Neg logit contributions
 herself-0.015
ena-0.014
iece-0.013
ina-0.013
 styles-0.013
Token special
Token ID2041
Feature activation+1.31e+00

Pos logit contributions
 sle+0.048
 grizz+0.045
oc+0.042
 polar+0.041
 spe+0.041
Neg logit contributions
 herself-0.048
 shapes-0.041
 brushes-0.040
 hair-0.039
ena-0.037
Token hug
Token ID16225
Feature activation+1.96e+00

Pos logit contributions
 sle+0.034
 grizz+0.029
 spe+0.029
 shortcut+0.029
oc+0.028
Neg logit contributions
 herself-0.040
ena-0.033
 themselves-0.032
 shapes-0.032
 brushes-0.031
Token He
Token ID679
Feature activation-1.18e-01

Pos logit contributions
 herself+0.089
ena+0.080
 shapes+0.078
 themselves+0.077
 styles+0.076
Neg logit contributions
 sle-0.051
oc-0.043
 grizz-0.041
run-0.040
 shortcut-0.040
Token then
Token ID788
Feature activation-1.62e+00

Pos logit contributions
 herself+0.004
 themselves+0.003
iece+0.003
 shapes+0.003
ena+0.003
Neg logit contributions
 sle-0.003
 grizz-0.003
 spe-0.003
 shortcut-0.003
oc-0.002
Token got
Token ID1392
Feature activation-5.59e-01

Pos logit contributions
 herself+0.050
 themselves+0.044
ena+0.043
 shapes+0.042
 styles+0.042
Neg logit contributions
 sle-0.039
 shortcut-0.034
 grizz-0.033
 gru-0.033
 spe-0.032
Token up
Token ID510
Feature activation-7.25e-01

Pos logit contributions
 herself+0.015
ena+0.014
 styles+0.014
 shapes+0.013
iece+0.013
Neg logit contributions
 sle-0.015
 gru-0.013
 shortcut-0.013
 grizz-0.013
 polar-0.012
Token and
Token ID290
Feature activation-2.39e-01

Pos logit contributions
 herself+0.017
ena+0.013
 themselves+0.013
 shapes+0.013
 brushes+0.012
Neg logit contributions
 sle-0.024
 spe-0.021
 grizz-0.021
 shortcut-0.021
oc-0.021
Token started
Token ID2067
Feature activation-4.06e+00

Pos logit contributions
 herself+0.007
ena+0.006
 themselves+0.006
 shapes+0.006
 styles+0.006
Neg logit contributions
 sle-0.006
 shortcut-0.006
 grizz-0.005
 spe-0.005
 gru-0.005
Token winding
Token ID28967
Feature activation-1.81e+00

Pos logit contributions
 herself+0.138
ena+0.127
 themselves+0.121
 shapes+0.121
 styles+0.119
Neg logit contributions
 sle-0.084
 shortcut-0.069
 grizz-0.068
 spe-0.068
 gru-0.064
Token his
Token ID465
Feature activation-1.57e+00

Pos logit contributions
 herself+0.047
ena+0.043
 brushes+0.038
 themselves+0.038
 shapes+0.038
Neg logit contributions
 sle-0.050
 shortcut-0.045
 spe-0.045
 grizz-0.044
 gru-0.043
Token way
Token ID835
Feature activation+1.99e+00

Pos logit contributions
 herself+0.051
ena+0.043
 themselves+0.042
 shapes+0.042
 brushes+0.040
Neg logit contributions
 sle-0.037
 spe-0.032
 grizz-0.032
 shortcut-0.031
oc-0.031
Token through
Token ID832
Feature activation-2.28e+00

Pos logit contributions
 sle+0.056
 spe+0.050
 grizz+0.049
 shortcut+0.049
oc+0.048
Neg logit contributions
 herself-0.055
ena-0.045
 themselves-0.045
 shapes-0.044
 brushes-0.042
Token the
Token ID262
Feature activation-1.31e+00

Pos logit contributions
 herself+0.057
ena+0.048
 themselves+0.046
 shapes+0.045
 brushes+0.045
Neg logit contributions
 sle-0.069
 spe-0.062
 grizz-0.061
 shortcut-0.060
oc-0.058
Token climbed
Token ID19952
Feature activation+4.83e-02

Pos logit contributions
 herself+0.065
ena+0.053
 themselves+0.051
 shapes+0.051
 brushes+0.049
Neg logit contributions
 sle-0.068
 grizz-0.060
 shortcut-0.060
 spe-0.060
 gru-0.058
Token a
Token ID257
Feature activation-8.50e-01

Pos logit contributions
 sle+0.002
 grizz+0.001
 spe+0.001
 shortcut+0.001
oc+0.001
Neg logit contributions
 herself-0.001
 themselves-0.001
ena-0.001
 shapes-0.001
 brushes-0.001
Token mountain
Token ID8598
Feature activation+6.11e-01

Pos logit contributions
 herself+0.019
 shapes+0.018
 clothes+0.017
 brushes+0.017
 when+0.016
Neg logit contributions
run-0.023
 grizz-0.022
reetings-0.022
joy-0.022
 sle-0.021
Token,
Token ID11
Feature activation-2.45e+00

Pos logit contributions
 sle+0.019
 spe+0.017
 grizz+0.016
 shortcut+0.016
oc+0.016
Neg logit contributions
 herself-0.016
ena-0.013
 themselves-0.012
 shapes-0.012
 brushes-0.012
Token and
Token ID290
Feature activation-2.00e+00

Pos logit contributions
 herself+0.059
ena+0.047
 themselves+0.046
 shapes+0.045
 brushes+0.043
Neg logit contributions
 sle-0.079
 spe-0.071
 grizz-0.070
 shortcut-0.070
oc-0.068
Token even
Token ID772
Feature activation-3.79e+00

Pos logit contributions
 herself+0.057
ena+0.049
 styles+0.048
 opinions+0.047
 shapes+0.047
Neg logit contributions
 sle-0.052
 spe-0.048
 polar-0.045
 grizz-0.044
|-0.044
Token flew
Token ID13112
Feature activation+1.53e+00

Pos logit contributions
 herself+0.102
ena+0.084
 themselves+0.083
 shapes+0.079
 styles+0.079
Neg logit contributions
 sle-0.108
 spe-0.098
 grizz-0.094
 polar-0.093
 shortcut-0.093
Token to
Token ID284
Feature activation-4.39e-01

Pos logit contributions
 sle+0.040
 spe+0.036
 shortcut+0.035
 grizz+0.034
 gru+0.034
Neg logit contributions
 herself-0.043
ena-0.039
 brushes-0.036
 styles-0.035
 shapes-0.035
Token the
Token ID262
Feature activation+2.44e-01

Pos logit contributions
 herself+0.010
 themselves+0.008
ena+0.008
 shapes+0.007
 styles+0.007
Neg logit contributions
 sle-0.015
 grizz-0.013
 shortcut-0.013
 spe-0.013
 gru-0.013
Token moon
Token ID8824
Feature activation-2.09e+00

Pos logit contributions
 sle+0.007
 spe+0.006
 grizz+0.006
 shortcut+0.006
oc+0.006
Neg logit contributions
 herself-0.007
ena-0.005
 shapes-0.005
 themselves-0.005
 brushes-0.005
Token.
Token ID13
Feature activation-1.98e-01

Pos logit contributions
 herself+0.059
ena+0.049
 brushes+0.049
 themselves+0.048
 styles+0.047
Neg logit contributions
 sle-0.059
oc-0.050
 spe-0.049
 polar-0.048
 grizz-0.048
Token handle
Token ID5412
Feature activation-1.07e+00

Pos logit contributions
 sle+0.017
 grizz+0.015
 spe+0.014
 Amen+0.014
|+0.014
Neg logit contributions
 herself-0.020
 shapes-0.017
 brushes-0.017
ena-0.016
 clothes-0.016
Token of
Token ID286
Feature activation-9.08e-02

Pos logit contributions
 herself+0.032
ena+0.030
 themselves+0.027
 shapes+0.027
 styles+0.027
Neg logit contributions
 sle-0.026
 shortcut-0.022
 spe-0.021
run-0.021
 tow-0.020
Token the
Token ID262
Feature activation+3.96e-01

Pos logit contributions
 herself+0.002
ena+0.002
 themselves+0.002
 shapes+0.002
 styles+0.002
Neg logit contributions
 sle-0.003
 spe-0.003
 grizz-0.003
 shortcut-0.003
oc-0.002
Token cart
Token ID6383
Feature activation-4.24e-01

Pos logit contributions
 sle+0.010
 spe+0.009
 grizz+0.009
 Amen+0.008
|+0.008
Neg logit contributions
 herself-0.013
 shapes-0.010
 brushes-0.010
ena-0.010
 themselves-0.010
Token and
Token ID290
Feature activation-4.64e-01

Pos logit contributions
 herself+0.011
ena+0.010
 shapes+0.010
 themselves+0.010
 styles+0.010
Neg logit contributions
 sle-0.011
 shortcut-0.010
 grizz-0.010
oc-0.010
 spe-0.010
Token began
Token ID2540
Feature activation-3.75e+00

Pos logit contributions
 herself+0.013
ena+0.012
 themselves+0.011
 opinions+0.011
 styles+0.011
Neg logit contributions
 sle-0.012
 spe-0.011
 shortcut-0.010
 tow-0.010
 grizz-0.010
Token to
Token ID284
Feature activation+2.48e-01

Pos logit contributions
 herself+0.114
ena+0.104
 themselves+0.097
iece+0.096
 shapes+0.094
Neg logit contributions
 sle-0.090
 spe-0.080
 shortcut-0.076
 grizz-0.075
run-0.073
Token push
Token ID4574
Feature activation-9.29e-01

Pos logit contributions
 sle+0.006
 shortcut+0.005
 spe+0.005
 grizz+0.005
oc+0.005
Neg logit contributions
 herself-0.008
ena-0.007
 themselves-0.007
 shapes-0.006
 brushes-0.006
Token it
Token ID340
Feature activation-3.92e-02

Pos logit contributions
ena+0.026
 herself+0.024
 brushes+0.024
ina+0.023
mons+0.023
Neg logit contributions
 sle-0.020
 spe-0.019
oc-0.019
 shortcut-0.018
run-0.018
Token.
Token ID13
Feature activation+5.64e-02

Pos logit contributions
ena+0.001
 herself+0.001
 brushes+0.001
 themselves+0.001
 styles+0.001
Neg logit contributions
 shortcut-0.001
 sle-0.001
run-0.001
 spe-0.001
 victorious-0.001
Token The
Token ID383
Feature activation-3.38e-01

Pos logit contributions
 sle+0.001
 grizz+0.001
 spe+0.001
 shortcut+0.001
oc+0.001
Neg logit contributions
 herself-0.002
ena-0.002
 shapes-0.001
 themselves-0.001
 styles-0.001
Token like
Token ID588
Feature activation-8.24e-01

Pos logit contributions
 herself+0.061
 shapes+0.045
 themselves+0.045
ena+0.043
 brushes+0.040
Neg logit contributions
 sle-0.105
 grizz-0.097
oc-0.095
 spe-0.094
 shortcut-0.094
Token bells
Token ID30987
Feature activation-5.90e-01

Pos logit contributions
 herself+0.020
ena+0.016
 themselves+0.015
iece+0.015
 shapes+0.015
Neg logit contributions
 sle-0.026
 spe-0.024
 grizz-0.023
 shortcut-0.023
 nut-0.022
Token and
Token ID290
Feature activation-1.07e+00

Pos logit contributions
 herself+0.017
ena+0.014
 themselves+0.014
iece+0.013
 brushes+0.013
Neg logit contributions
 sle-0.016
 spe-0.014
oc-0.013
 grizz-0.013
 shortcut-0.013
Token ho
Token ID8169
Feature activation+2.55e+00

Pos logit contributions
 herself+0.028
ena+0.025
iece+0.024
ina+0.023
 themselves+0.023
Neg logit contributions
 sle-0.032
 spe-0.028
 grizz-0.027
 nut-0.026
 hunter-0.026
Tokenof
Token ID1659
Feature activation-1.57e+00

Pos logit contributions
 sle+0.068
 spe+0.058
 grizz+0.058
 shortcut+0.058
oc+0.056
Neg logit contributions
 herself-0.076
ena-0.063
 themselves-0.062
 shapes-0.062
 styles-0.060
Tokens
Token ID82
Feature activation-3.86e+00

Pos logit contributions
 herself+0.048
ena+0.041
 themselves+0.040
iece+0.040
 brushes+0.038
Neg logit contributions
 sle-0.040
oc-0.035
 spe-0.034
 nut-0.033
 shortcut-0.032
Token."
Token ID526
Feature activation-2.73e+00

Pos logit contributions
 herself+0.108
ena+0.095
 themselves+0.088
iece+0.088
 shapes+0.087
Neg logit contributions
 sle-0.104
 spe-0.089
 shortcut-0.088
 grizz-0.085
 nut-0.084
Token Anna
Token ID11735
Feature activation-2.54e+00

Pos logit contributions
 herself+0.066
ena+0.052
 themselves+0.051
 shapes+0.050
 styles+0.048
Neg logit contributions
 sle-0.088
 spe-0.078
 grizz-0.078
 shortcut-0.078
 gru-0.076
Token said
Token ID531
Feature activation+1.11e+00

Pos logit contributions
 herself+0.080
 themselves+0.070
iece+0.070
 styles+0.069
mons+0.067
Neg logit contributions
 sle-0.054
 grizz-0.051
oc-0.050
 gru-0.047
 spe-0.047
Token.
Token ID13
Feature activation+1.24e-01

Pos logit contributions
 sle+0.033
 spe+0.029
 grizz+0.029
 shortcut+0.029
 gru+0.029
Neg logit contributions
 herself-0.028
ena-0.023
 shapes-0.022
 themselves-0.022
 styles-0.022
Token
Token ID198
Feature activation+3.33e-01

Pos logit contributions
 sle+0.003
 spe+0.002
 grizz+0.002
 shortcut+0.002
 gru+0.002
Neg logit contributions
 herself-0.004
ena-0.003
 themselves-0.003
 shapes-0.003
 styles-0.003