TOP ACTIVATIONS
MAX = 0.165

in his pocket . " These are mine now . You
at the bee . " These are our bugs ! You
movie . " These are for you ," their
one to Tom . " These are for you . I
," Ben says . " These are pretty . Can we
and some chocolate . " These are the things we need
. He says , " These are things for the people
!" they say . " These are awesome !"
said his mummy . " These small , green things are
," Mom says . " These are your k ites .
them . " These toys are not safe .
says . " These are for you , kids
. She says , " These are for our son .
said . " These are treasures from the old
!" Sam said . " These are my bricks . I
the man says . " These are my fruits . You
!" they said . " These are the best vans ever
kind . " These are my books . I
," grandma said . " These are for you to keep
. He says , " These are mine ! Go away

MIN ACTIVATIONS
MIN = -0.184

was very careful to prevent anyone from finding out .
grass around the hill was wit hered and yellow . He
, " I 'll prevent anyone from making fun of you
, he would always prevent anyone from eating the tasty treats
by the door to prevent anyone from coming in and taking
hadn 't been won by anyone for a long long time
it and wouldn 't let anyone else play with it .
because danger can come to anyone . <|endoftext|> Once there was
your friends and never let anyone tease them . <|endoftext|> Once
so they wouldn 't bother anyone else . <|endoftext|> Once upon
D illy wouldn 't let anyone . She kept it safe
day . He never helped anyone in the village . One
taken but the park keeper wouldn 't listen . He kept
little girl was scared she wouldn 't come back . But
and he would never let anyone else get near it .
to prevent them from hurting anyone , but there were too
, " Why won 't anyone be my friend ?"
a safe place so it wouldn 't get lost again .
from the river so it wouldn 't get stuck on the
not to be seen by anyone . One

INTERVAL 0.078 - 0.165
CONTAINS 1.078%

felt happy . They liked their room and their decorations .
in the attic . We have to remove some things to
was big and fancy , with a little kitchen inside .
. What should I do with it ?" She
and had lots of fun with it in the backyard .

INTERVAL -0.010 - 0.078
CONTAINS 59.103%

Lily drew a picture of a cat . She showed it
are happy to see him . They want to show him
?" " I can 't find my chalk
and run in the park . One day , Max saw
and said , � � € � � App l aud

INTERVAL -0.097 - -0.010
CONTAINS 39.488%

do . Lucy said yes and followed Jack .
street in a row , when the younger sister said to
tree and lay down for a nap . Suddenly , he
it now ?" His mom laughed and said , "
the deep woods . As she walked , she noticed something

INTERVAL -0.184 - -0.097
CONTAINS 0.331%

sunset was a sign of danger . A huge storm was
find it , but they couldn 't . Tim my was
on the ground . She didn 't know what it was
He was so excited and couldn 't wait to eat .
very obedient boy yesterday . Everywhere he went , he behaved
Token in
Token ID287
Feature activation+2.98e-02

Pos logit contributions
 collection+0.039
 wrists+0.038
 cand+0.038
 notebooks+0.036
 traded+0.035
Neg logit contributions
 reckless-0.047
 frightening-0.042
 afraid-0.042
 cal-0.041
 911-0.040
Token his
Token ID465
Feature activation+2.57e-02

Pos logit contributions
 collection+0.055
 wrists+0.054
 cand+0.054
 traded+0.053
 notebooks+0.051
Neg logit contributions
 reckless-0.044
 frightening-0.042
 cal-0.041
 911-0.040
Little-0.040
Token pocket
Token ID10000
Feature activation+3.57e-02

Pos logit contributions
 cand+0.029
 traded+0.028
 wrists+0.028
 collection+0.027
 extract+0.025
Neg logit contributions
 reckless-0.055
 cal-0.052
 gut-0.051
 911-0.051
 frightening-0.050
Token.
Token ID13
Feature activation+1.72e-02

Pos logit contributions
 collection+0.056
 wrists+0.055
 cand+0.054
 traded+0.051
 notebooks+0.051
Neg logit contributions
 reckless-0.066
 frightening-0.060
 cal-0.058
 afraid-0.058
 gut-0.057
Token "
Token ID366
Feature activation-1.06e-03

Pos logit contributions
 collection+0.027
 wrists+0.026
 cand+0.026
 notebooks+0.024
 traded+0.024
Neg logit contributions
 reckless-0.032
 frightening-0.029
 afraid-0.028
 cal-0.028
Little-0.028
TokenThese
Token ID4711
Feature activation+1.65e-01

Pos logit contributions
 reckless+0.002
Little+0.001
Down+0.001
 cal+0.001
 frightening+0.001
Neg logit contributions
 collection-0.002
 cand-0.002
 wrists-0.002
 notebooks-0.002
 mar-0.002
Token are
Token ID389
Feature activation+9.79e-02

Pos logit contributions
 wrists+0.234
 collection+0.230
 traded+0.219
 notebooks+0.216
 coloring+0.211
Neg logit contributions
 reckless-0.321
 cal-0.282
 frightening-0.277
 afraid-0.277
 gut-0.275
Token mine
Token ID6164
Feature activation+7.95e-02

Pos logit contributions
 collection+0.181
 wrists+0.179
 wrist+0.174
 notebooks+0.169
 extract+0.165
Neg logit contributions
 reckless-0.152
 frightening-0.136
 cal-0.133
 afraid-0.133
 evil-0.132
Token now
Token ID783
Feature activation+1.51e-02

Pos logit contributions
 collection+0.147
 wrists+0.144
 cand+0.142
 traded+0.135
 notebooks+0.135
Neg logit contributions
 reckless-0.127
 frightening-0.114
 afraid-0.110
 cal-0.109
 gut-0.106
Token.
Token ID13
Feature activation+1.65e-02

Pos logit contributions
 collection+0.029
 wrists+0.029
 cand+0.028
 notebooks+0.028
 traded+0.027
Neg logit contributions
 reckless-0.023
 frightening-0.020
 afraid-0.019
 cal-0.019
Little-0.019
Token You
Token ID921
Feature activation+4.08e-03

Pos logit contributions
 collection+0.035
 cand+0.034
 wrists+0.033
 notebooks+0.032
 sandwich+0.031
Neg logit contributions
 reckless-0.023
 911-0.019
 afraid-0.019
Little-0.018
 cal-0.018
Token at
Token ID379
Feature activation+2.57e-02

Pos logit contributions
 collection+0.001
 wrists+0.001
 cand+0.001
 notebooks+0.001
 traded+0.001
Neg logit contributions
 reckless-0.002
 frightening-0.002
 afraid-0.002
 cal-0.002
Little-0.002
Token the
Token ID262
Feature activation-1.69e-03

Pos logit contributions
 collection+0.046
 wrists+0.045
 cand+0.045
 traded+0.042
 notebooks+0.042
Neg logit contributions
 reckless-0.042
 frightening-0.038
 cal-0.037
 afraid-0.037
Little-0.036
Token bee
Token ID20697
Feature activation-5.90e-02

Pos logit contributions
 reckless+0.003
 frightening+0.003
 cal+0.003
 afraid+0.003
 beast+0.003
Neg logit contributions
 collection-0.002
 wrists-0.002
 cand-0.002
 traded-0.002
 notebooks-0.002
Token.
Token ID13
Feature activation+1.53e-02

Pos logit contributions
 reckless+0.106
 frightening+0.096
 afraid+0.094
 cal+0.093
Little+0.091
Neg logit contributions
 collection-0.096
 wrists-0.095
 cand-0.094
 notebooks-0.089
 traded-0.088
Token "
Token ID366
Feature activation-5.55e-04

Pos logit contributions
 collection+0.024
 wrists+0.023
 cand+0.023
 notebooks+0.021
 traded+0.021
Neg logit contributions
 reckless-0.029
 frightening-0.026
 afraid-0.026
 cal-0.026
Little-0.026
TokenThese
Token ID4711
Feature activation+1.64e-01

Pos logit contributions
 reckless+0.001
 frightening+0.001
 cal+0.001
 afraid+0.001
Little+0.001
Neg logit contributions
 collection-0.001
 wrists-0.001
 cand-0.001
 notebooks-0.001
 traded-0.001
Token are
Token ID389
Feature activation+1.01e-01

Pos logit contributions
 collection+0.248
 wrists+0.242
 cand+0.238
 traded+0.223
 notebooks+0.222
Neg logit contributions
 reckless-0.318
 frightening-0.291
 cal-0.282
 afraid-0.281
Little-0.277
Token our
Token ID674
Feature activation+1.11e-01

Pos logit contributions
 collection+0.191
 wrists+0.185
 wrist+0.177
 cand+0.177
 traded+0.176
Neg logit contributions
 reckless-0.161
 frightening-0.151
 afraid-0.147
 evil-0.141
 cal-0.141
Token bugs
Token ID11316
Feature activation-2.88e-02

Pos logit contributions
 wrists+0.140
 traded+0.137
 cand+0.135
 collection+0.134
 notebooks+0.125
Neg logit contributions
 reckless-0.227
 frightening-0.213
 cal-0.207
 afraid-0.207
 evil-0.206
Token!
Token ID0
Feature activation+2.07e-03

Pos logit contributions
 reckless+0.040
 frightening+0.035
 afraid+0.035
 cal+0.032
 outage+0.031
Neg logit contributions
 wrists-0.060
 collection-0.059
 cand-0.057
 notebooks-0.057
 traded-0.056
Token You
Token ID921
Feature activation+3.93e-03

Pos logit contributions
 collection+0.003
 cand+0.003
 wrists+0.003
 notebooks+0.003
 mar+0.003
Neg logit contributions
 reckless-0.004
 afraid-0.003
Little-0.003
 frightening-0.003
 911-0.003
Token movie
Token ID3807
Feature activation+3.53e-02

Pos logit contributions
 reckless+0.017
 frightening+0.016
 cal+0.016
 afraid+0.015
 beast+0.015
Neg logit contributions
 collection-0.011
 wrists-0.011
 cand-0.011
 traded-0.010
 notebooks-0.010
Token.
Token ID13
Feature activation+1.49e-02

Pos logit contributions
 collection+0.056
 wrists+0.056
 cand+0.055
 notebooks+0.051
 traded+0.051
Neg logit contributions
 reckless-0.064
 frightening-0.058
 afraid-0.057
 cal-0.057
Little-0.056
Token
Token ID198
Feature activation-1.45e-03

Pos logit contributions
 collection+0.023
 wrists+0.022
 cand+0.022
 notebooks+0.021
 traded+0.021
Neg logit contributions
 reckless-0.028
 frightening-0.026
 afraid-0.025
 cal-0.025
Little-0.025
Token
Token ID198
Feature activation+5.55e-04

Pos logit contributions
 reckless+0.003
 frightening+0.003
 unnecessary+0.003
 gut+0.003
 evil+0.003
Neg logit contributions
 collection-0.002
 wrists-0.001
 traded-0.001
 mar-0.001
 notebooks-0.001
Token"
Token ID1
Feature activation+4.31e-02

Pos logit contributions
 collection+0.001
 wrists+0.001
 cand+0.001
 traded+0.001
 mar+0.001
Neg logit contributions
 reckless-0.001
 frightening-0.001
 cal-0.001
 afraid-0.001
 gut-0.001
TokenThese
Token ID4711
Feature activation+1.61e-01

Pos logit contributions
 collection+0.085
 wrists+0.081
 cand+0.080
 notebooks+0.076
 traded+0.076
Neg logit contributions
 reckless-0.067
 frightening-0.059
Little-0.059
 cal-0.058
 afraid-0.057
Token are
Token ID389
Feature activation+9.58e-02

Pos logit contributions
 collection+0.203
 wrists+0.196
 cand+0.194
 traded+0.177
 notebooks+0.177
Neg logit contributions
 reckless-0.352
 frightening-0.326
 cal-0.316
 afraid-0.316
Little-0.312
Token for
Token ID329
Feature activation+1.72e-02

Pos logit contributions
 collection+0.179
 wrists+0.173
 wrist+0.165
 cand+0.165
 notebooks+0.162
Neg logit contributions
 reckless-0.154
 frightening-0.142
 afraid-0.139
 evil-0.135
 cal-0.134
Token you
Token ID345
Feature activation-6.81e-03

Pos logit contributions
 collection+0.029
 wrists+0.029
 cand+0.028
 traded+0.027
 notebooks+0.027
Neg logit contributions
 reckless-0.030
 frightening-0.027
 afraid-0.026
 cal-0.026
Little-0.026
Token,"
Token ID553
Feature activation-5.95e-03

Pos logit contributions
 reckless+0.012
 frightening+0.011
 cal+0.011
Little+0.010
 afraid+0.010
Neg logit contributions
 collection-0.012
 wrists-0.011
 cand-0.011
 traded-0.010
 mar-0.010
Token their
Token ID511
Feature activation+8.09e-02

Pos logit contributions
 reckless+0.010
 afraid+0.009
 frightening+0.009
 cal+0.009
Little+0.009
Neg logit contributions
 collection-0.011
 wrists-0.010
 cand-0.010
 traded-0.009
 notebooks-0.009
Token one
Token ID530
Feature activation+3.49e-04

Pos logit contributions
 collection+0.035
 cand+0.032
 wrists+0.032
 notebooks+0.032
 clubs+0.032
Neg logit contributions
 reckless-0.035
 cal-0.031
 afraid-0.031
 frightening-0.030
Little-0.029
Token to
Token ID284
Feature activation+4.50e-03

Pos logit contributions
 collection+0.001
 wrists+0.001
 cand+0.001
 traded+0.001
 notebooks+0.001
Neg logit contributions
 reckless-0.001
 cal-0.001
 frightening-0.001
 afraid-0.001
Little-0.001
Token Tom
Token ID4186
Feature activation+1.90e-02

Pos logit contributions
 collection+0.008
 cand+0.007
 notebooks+0.007
 wrists+0.007
 traded+0.007
Neg logit contributions
 reckless-0.008
 cal-0.007
Little-0.007
 frightening-0.007
 911-0.007
Token.
Token ID13
Feature activation+3.18e-02

Pos logit contributions
 cand+0.033
 wrists+0.033
 collection+0.033
 notebooks+0.032
 mar+0.031
Neg logit contributions
 reckless-0.030
 cal-0.028
 afraid-0.027
 911-0.026
 frightening-0.026
Token "
Token ID366
Feature activation+3.58e-03

Pos logit contributions
 collection+0.057
 cand+0.054
 wrists+0.053
 notebooks+0.051
 mar+0.050
Neg logit contributions
 reckless-0.053
 frightening-0.048
 cal-0.048
 afraid-0.048
 911-0.047
TokenThese
Token ID4711
Feature activation+1.61e-01

Pos logit contributions
 collection+0.007
 cand+0.007
 wrists+0.007
 notebooks+0.006
 mar+0.006
Neg logit contributions
 reckless-0.005
Little-0.005
Run-0.005
Down-0.005
 cal-0.005
Token are
Token ID389
Feature activation+9.38e-02

Pos logit contributions
 collection+0.238
 wrists+0.228
 notebooks+0.224
 traded+0.224
 cand+0.216
Neg logit contributions
 reckless-0.310
 cal-0.279
 frightening-0.277
 afraid-0.272
 gut-0.269
Token for
Token ID329
Feature activation+1.47e-02

Pos logit contributions
 collection+0.181
 wrists+0.174
 notebooks+0.171
 traded+0.169
 wrist+0.167
Neg logit contributions
 reckless-0.144
 frightening-0.130
 cal-0.126
 afraid-0.124
 evil-0.123
Token you
Token ID345
Feature activation-7.20e-03

Pos logit contributions
 collection+0.028
 wrists+0.027
 cand+0.026
 notebooks+0.026
 traded+0.026
Neg logit contributions
 reckless-0.023
 frightening-0.020
 afraid-0.020
 cal-0.020
 911-0.019
Token.
Token ID13
Feature activation+2.88e-02

Pos logit contributions
 reckless+0.013
 frightening+0.012
 cal+0.012
 afraid+0.012
Little+0.011
Neg logit contributions
 collection-0.012
 wrists-0.011
 cand-0.011
 traded-0.011
 notebooks-0.011
Token I
Token ID314
Feature activation+1.08e-02

Pos logit contributions
 collection+0.056
 cand+0.053
 wrists+0.052
 notebooks+0.051
 sandwich+0.049
Neg logit contributions
 reckless-0.045
 911-0.041
Little-0.039
 afraid-0.038
 gut-0.038
Token,"
Token ID553
Feature activation+2.22e-03

Pos logit contributions
 collection+0.077
 wrists+0.072
 cand+0.072
 mar+0.067
 traded+0.066
Neg logit contributions
 reckless-0.126
 frightening-0.117
 cal-0.113
 gut-0.113
 afraid-0.112
Token Ben
Token ID3932
Feature activation+6.59e-03

Pos logit contributions
 collection+0.004
 wrists+0.004
 cand+0.004
 extract+0.004
 traded+0.004
Neg logit contributions
 reckless-0.003
 afraid-0.003
 frightening-0.003
Run-0.003
Little-0.003
Token says
Token ID1139
Feature activation+4.72e-02

Pos logit contributions
 collection+0.010
 wrists+0.010
 cand+0.010
 traded+0.009
 notebooks+0.009
Neg logit contributions
 reckless-0.012
 frightening-0.011
 gut-0.011
Little-0.011
 cal-0.011
Token.
Token ID13
Feature activation+2.93e-02

Pos logit contributions
 collection+0.074
 wrists+0.072
 cand+0.071
 notebooks+0.067
 traded+0.066
Neg logit contributions
 reckless-0.090
 frightening-0.081
 afraid-0.080
 cal-0.080
 911-0.077
Token "
Token ID366
Feature activation+6.09e-03

Pos logit contributions
 collection+0.050
 wrists+0.048
 cand+0.048
 notebooks+0.045
 traded+0.044
Neg logit contributions
 reckless-0.052
 frightening-0.047
 afraid-0.046
 cal-0.046
 911-0.045
TokenThese
Token ID4711
Feature activation+1.59e-01

Pos logit contributions
 collection+0.013
 cand+0.012
 wrists+0.012
 mar+0.012
 notebooks+0.012
Neg logit contributions
 reckless-0.008
Little-0.007
 frightening-0.007
Down-0.007
 afraid-0.007
Token are
Token ID389
Feature activation+9.95e-02

Pos logit contributions
 wrists+0.275
 collection+0.266
uggets+0.264
 traded+0.259
 wrist+0.255
Neg logit contributions
 reckless-0.248
 frightening-0.227
 afraid-0.226
 evil-0.219
 cal-0.217
Token pretty
Token ID2495
Feature activation+7.82e-02

Pos logit contributions
 collection+0.182
 wrists+0.181
 wrist+0.180
ci+0.177
 traded+0.169
Neg logit contributions
 reckless-0.145
 frightening-0.138
 afraid-0.137
 evil-0.137
 cal-0.129
Token.
Token ID13
Feature activation+2.91e-02

Pos logit contributions
 collection+0.126
 traded+0.124
 wrists+0.121
 wrist+0.113
 notebooks+0.111
Neg logit contributions
 reckless-0.126
 frightening-0.121
 afraid-0.117
 cal-0.117
 911-0.114
Token Can
Token ID1680
Feature activation+1.05e-02

Pos logit contributions
 collection+0.053
 wrists+0.050
 cand+0.049
 notebooks+0.046
 mar+0.046
Neg logit contributions
 reckless-0.048
 911-0.044
 afraid-0.043
Little-0.042
 frightening-0.041
Token we
Token ID356
Feature activation+1.96e-02

Pos logit contributions
 collection+0.023
 wrists+0.022
 cand+0.021
 notebooks+0.021
 accessory+0.021
Neg logit contributions
 reckless-0.013
 911-0.011
 afraid-0.011
Little-0.010
 frightening-0.010
Token and
Token ID290
Feature activation-4.68e-03

Pos logit contributions
 collection+0.022
 wrists+0.021
 traded+0.021
 notebooks+0.020
 cand+0.020
Neg logit contributions
 reckless-0.020
 cal-0.018
 frightening-0.018
Little-0.018
 afraid-0.018
Token some
Token ID617
Feature activation+4.54e-02

Pos logit contributions
 reckless+0.008
 cal+0.008
 frightening+0.008
 afraid+0.008
Little+0.007
Neg logit contributions
 collection-0.008
 wrists-0.007
 traded-0.007
 cand-0.007
 notebooks-0.007
Token chocolate
Token ID11311
Feature activation+7.90e-02

Pos logit contributions
 collection+0.066
 wrists+0.065
 traded+0.061
 cand+0.059
 notebooks+0.059
Neg logit contributions
 reckless-0.087
 frightening-0.081
 cal-0.080
 afraid-0.078
Little-0.077
Token.
Token ID13
Feature activation+2.73e-02

Pos logit contributions
 traded+0.155
 notebooks+0.149
 collection+0.143
 clubs+0.142
 wrists+0.140
Neg logit contributions
 911-0.084
 worse-0.083
 though-0.080
 harder-0.079
 reckless-0.079
Token "
Token ID366
Feature activation+6.30e-03

Pos logit contributions
 collection+0.034
 wrists+0.033
 cand+0.033
 traded+0.030
 notebooks+0.030
Neg logit contributions
 reckless-0.060
 frightening-0.056
 cal-0.054
 afraid-0.054
 gut-0.053
TokenThese
Token ID4711
Feature activation+1.58e-01

Pos logit contributions
 collection+0.013
 wrists+0.012
 cand+0.011
 traded+0.011
 notebooks+0.011
Neg logit contributions
Little-0.009
 reckless-0.009
Down-0.008
Far-0.008
Run-0.008
Token are
Token ID389
Feature activation+9.69e-02

Pos logit contributions
 wrists+0.213
 collection+0.212
 traded+0.209
 notebooks+0.198
 wrist+0.192
Neg logit contributions
 reckless-0.312
 cal-0.288
 frightening-0.283
 afraid-0.281
 911-0.277
Token the
Token ID262
Feature activation+2.39e-02

Pos logit contributions
 wrists+0.170
 collection+0.170
 traded+0.169
 wrist+0.169
 notebooks+0.165
Neg logit contributions
 reckless-0.145
 cal-0.138
 frightening-0.136
 afraid-0.133
Little-0.130
Token things
Token ID1243
Feature activation-9.96e-03

Pos logit contributions
 wrists+0.028
 traded+0.027
 collection+0.026
 notebooks+0.025
 cand+0.024
Neg logit contributions
 reckless-0.052
 cal-0.049
 afraid-0.048
 frightening-0.048
 evil-0.047
Token we
Token ID356
Feature activation+3.66e-02

Pos logit contributions
 reckless+0.015
 frightening+0.013
 afraid+0.013
 cal+0.013
 911+0.013
Neg logit contributions
 wrists-0.019
 collection-0.019
 traded-0.018
 notebooks-0.018
 cand-0.018
Token need
Token ID761
Feature activation-1.16e-02

Pos logit contributions
 wrists+0.061
 collection+0.060
 cand+0.059
 notebooks+0.055
 wrist+0.055
Neg logit contributions
 reckless-0.065
 afraid-0.059
 frightening-0.058
 cal-0.055
 evil-0.054
Token.
Token ID13
Feature activation+2.74e-02

Pos logit contributions
 wrists+0.066
 collection+0.065
 cand+0.063
 notebooks+0.060
 mar+0.058
Neg logit contributions
 reckless-0.063
 frightening-0.056
 cal-0.056
 afraid-0.055
 911-0.054
Token He
Token ID679
Feature activation-1.55e-02

Pos logit contributions
 collection+0.038
 wrists+0.037
 cand+0.037
 traded+0.035
 mar+0.035
Neg logit contributions
 reckless-0.056
 frightening-0.052
 gut-0.049
 cal-0.049
 beast-0.049
Token says
Token ID1139
Feature activation+6.09e-02

Pos logit contributions
 reckless+0.027
 frightening+0.024
 cal+0.024
 afraid+0.024
 beast+0.023
Neg logit contributions
 collection-0.026
 wrists-0.026
 cand-0.025
 traded-0.024
 notebooks-0.024
Token,
Token ID11
Feature activation+2.46e-02

Pos logit contributions
 collection+0.088
 wrists+0.085
 cand+0.084
 traded+0.078
 notebooks+0.078
Neg logit contributions
 reckless-0.124
 frightening-0.114
 afraid-0.110
 cal-0.110
Little-0.109
Token "
Token ID366
Feature activation-2.48e-03

Pos logit contributions
 collection+0.032
 wrists+0.031
 cand+0.030
 coloring+0.028
 traded+0.028
Neg logit contributions
 reckless-0.053
 frightening-0.049
 unnecessary-0.047
Little-0.047
 cal-0.046
TokenThese
Token ID4711
Feature activation+1.58e-01

Pos logit contributions
 reckless+0.004
Little+0.004
Down+0.003
 cal+0.003
Run+0.003
Neg logit contributions
 collection-0.005
 wrists-0.005
 cand-0.004
 sandwich-0.004
 coloring-0.004
Token are
Token ID389
Feature activation+9.58e-02

Pos logit contributions
 wrists+0.235
 collection+0.224
 cand+0.212
 wrist+0.208
 traded+0.208
Neg logit contributions
 reckless-0.309
 911-0.281
 frightening-0.278
 afraid-0.275
 cal-0.273
Token things
Token ID1243
Feature activation-7.30e-03

Pos logit contributions
 wrists+0.169
 collection+0.162
 cand+0.157
 wrist+0.156
 notebooks+0.151
Neg logit contributions
 reckless-0.163
 frightening-0.148
 afraid-0.145
 cal-0.144
 911-0.143
Token for
Token ID329
Feature activation+1.15e-02

Pos logit contributions
 reckless+0.012
 frightening+0.010
 afraid+0.010
 cal+0.010
 911+0.010
Neg logit contributions
 wrists-0.014
 collection-0.013
 cand-0.013
 notebooks-0.012
 traded-0.012
Token the
Token ID262
Feature activation+1.01e-02

Pos logit contributions
 collection+0.021
 wrists+0.020
 cand+0.020
 traded+0.019
 notebooks+0.019
Neg logit contributions
 reckless-0.019
 frightening-0.017
 afraid-0.017
 cal-0.017
Little-0.016
Token people
Token ID661
Feature activation-5.44e-02

Pos logit contributions
 wrists+0.014
 collection+0.014
 cand+0.013
 notebooks+0.012
 traded+0.012
Neg logit contributions
 reckless-0.020
 frightening-0.019
 beast-0.019
 afraid-0.018
 cal-0.018
Token!"
Token ID2474
Feature activation-2.14e-03

Pos logit contributions
 collection+0.020
 wrists+0.020
 cand+0.020
 traded+0.018
 notebooks+0.018
Neg logit contributions
 reckless-0.019
 frightening-0.018
 cal-0.017
 afraid-0.017
Little-0.017
Token they
Token ID484
Feature activation+3.25e-02

Pos logit contributions
 reckless+0.004
 frightening+0.004
 afraid+0.004
 cal+0.004
Little+0.004
Neg logit contributions
 collection-0.003
 wrists-0.003
 cand-0.003
 notebooks-0.003
 traded-0.003
Token say
Token ID910
Feature activation+4.52e-02

Pos logit contributions
 collection+0.045
 wrists+0.044
 cand+0.043
 traded+0.040
 mar+0.039
Neg logit contributions
 reckless-0.068
 frightening-0.063
 cal-0.060
 afraid-0.060
 gut-0.060
Token.
Token ID13
Feature activation+2.54e-02

Pos logit contributions
 collection+0.072
 wrists+0.069
 cand+0.069
 notebooks+0.065
 mar+0.064
Neg logit contributions
 reckless-0.085
 frightening-0.076
 cal-0.076
 afraid-0.076
 911-0.074
Token "
Token ID366
Feature activation+3.98e-03

Pos logit contributions
 collection+0.041
 cand+0.040
 wrists+0.040
 notebooks+0.037
 traded+0.037
Neg logit contributions
 reckless-0.047
 frightening-0.043
 afraid-0.042
 cal-0.042
Little-0.041
TokenThese
Token ID4711
Feature activation+1.58e-01

Pos logit contributions
 collection+0.007
 cand+0.007
 wrists+0.007
 notebooks+0.007
 mar+0.006
Neg logit contributions
 reckless-0.007
Little-0.006
 frightening-0.006
 afraid-0.006
Down-0.006
Token are
Token ID389
Feature activation+9.51e-02

Pos logit contributions
 collection+0.225
 wrists+0.222
 traded+0.207
 cand+0.204
 notebooks+0.201
Neg logit contributions
 reckless-0.318
 frightening-0.286
 cal-0.283
 afraid-0.281
 evil-0.275
Token awesome
Token ID7427
Feature activation+6.84e-02

Pos logit contributions
 collection+0.192
 wrists+0.190
 wrist+0.182
 notebooks+0.177
 traded+0.177
Neg logit contributions
 reckless-0.139
 cal-0.126
 frightening-0.122
 afraid-0.120
 evil-0.118
Token!"
Token ID2474
Feature activation+7.30e-03

Pos logit contributions
 collection+0.122
 wrists+0.119
 cand+0.118
 traded+0.112
 notebooks+0.111
Neg logit contributions
 reckless-0.114
 frightening-0.103
 cal-0.100
 afraid-0.099
Little-0.097
Token
Token ID198
Feature activation-1.87e-02

Pos logit contributions
 collection+0.012
 wrists+0.011
 cand+0.011
 traded+0.011
 notebooks+0.011
Neg logit contributions
 reckless-0.014
 frightening-0.012
 beast-0.012
 cal-0.012
 afraid-0.012
Token
Token ID198
Feature activation-1.11e-02

Pos logit contributions
 reckless+0.040
 frightening+0.037
 beast+0.035
 unnecessary+0.035
 evil+0.035
Neg logit contributions
 wrists-0.024
 collection-0.024
 cand-0.023
 mar-0.023
 notebooks-0.022
Token said
Token ID531
Feature activation+3.07e-02

Pos logit contributions
 reckless+0.011
 afraid+0.010
 frightening+0.010
Little+0.010
 cal+0.010
Neg logit contributions
 collection-0.012
 wrists-0.011
 cand-0.011
 notebooks-0.010
 coloring-0.010
Token his
Token ID465
Feature activation+2.00e-02

Pos logit contributions
 collection+0.050
 wrists+0.049
 cand+0.049
 notebooks+0.046
 traded+0.045
Neg logit contributions
 reckless-0.056
 frightening-0.051
 afraid-0.050
 cal-0.050
 911-0.049
Token mummy
Token ID38567
Feature activation-3.65e-02

Pos logit contributions
 collection+0.028
 wrists+0.028
 cand+0.027
 traded+0.025
 notebooks+0.025
Neg logit contributions
 reckless-0.041
 frightening-0.037
 cal-0.036
 afraid-0.036
Little-0.036
Token.
Token ID13
Feature activation+1.59e-02

Pos logit contributions
 reckless+0.067
 cal+0.060
 frightening+0.060
 afraid+0.060
 911+0.057
Neg logit contributions
 collection-0.059
 wrists-0.058
 cand-0.057
 notebooks-0.055
 mar-0.054
Token "
Token ID366
Feature activation-3.67e-03

Pos logit contributions
 collection+0.029
 cand+0.027
 wrists+0.027
 notebooks+0.026
 mar+0.025
Neg logit contributions
 reckless-0.026
 frightening-0.024
Little-0.023
 afraid-0.023
 cal-0.023
TokenThese
Token ID4711
Feature activation+1.57e-01

Pos logit contributions
 reckless+0.006
 frightening+0.005
 cal+0.005
 afraid+0.005
 gut+0.005
Neg logit contributions
 collection-0.007
 wrists-0.007
 cand-0.006
 traded-0.006
 notebooks-0.006
Token small
Token ID1402
Feature activation+3.49e-02

Pos logit contributions
 collection+0.220
 wrists+0.215
 cand+0.211
 traded+0.198
 notebooks+0.197
Neg logit contributions
 reckless-0.322
 frightening-0.296
 afraid-0.287
 cal-0.287
 beast-0.282
Token,
Token ID11
Feature activation+4.10e-02

Pos logit contributions
 collection+0.051
 traded+0.050
 wrists+0.050
 cand+0.048
 notebooks+0.048
Neg logit contributions
 reckless-0.066
 frightening-0.061
 afraid-0.059
 beast-0.058
 cal-0.058
Token green
Token ID4077
Feature activation+7.38e-02

Pos logit contributions
 collection+0.070
 wrists+0.069
 cand+0.066
 traded+0.065
 notebooks+0.064
Neg logit contributions
 reckless-0.071
 frightening-0.065
 afraid-0.063
 cal-0.061
 911-0.061
Token things
Token ID1243
Feature activation+2.88e-04

Pos logit contributions
 collection+0.114
 wrists+0.113
 traded+0.111
 cand+0.110
 notebooks+0.108
Neg logit contributions
 reckless-0.132
 frightening-0.121
 beast-0.118
 afraid-0.117
 cal-0.116
Token are
Token ID389
Feature activation+7.47e-02

Pos logit contributions
 wrists+0.001
 collection+0.001
 cand+0.000
 notebooks+0.000
 wrist+0.000
Neg logit contributions
 reckless-0.000
 frightening-0.000
 afraid-0.000
 cal-0.000
 evil-0.000
Token,"
Token ID553
Feature activation-1.03e-02

Pos logit contributions
 reckless+0.013
 frightening+0.012
 afraid+0.012
 cal+0.011
Little+0.011
Neg logit contributions
 collection-0.015
 wrists-0.015
 cand-0.014
 traded-0.014
 notebooks-0.014
Token Mom
Token ID11254
Feature activation-1.05e-02

Pos logit contributions
 reckless+0.017
 cal+0.015
 afraid+0.015
 frightening+0.015
Little+0.015
Neg logit contributions
 collection-0.019
 wrists-0.018
 cand-0.017
 coloring-0.016
 notebooks-0.016
Token says
Token ID1139
Feature activation+4.18e-02

Pos logit contributions
 reckless+0.021
 frightening+0.019
 afraid+0.019
 cal+0.019
 gut+0.018
Neg logit contributions
 collection-0.015
 wrists-0.015
 cand-0.015
 traded-0.014
 notebooks-0.014
Token.
Token ID13
Feature activation+2.13e-02

Pos logit contributions
 collection+0.071
 wrists+0.068
 cand+0.066
 notebooks+0.065
 mar+0.063
Neg logit contributions
 reckless-0.075
 afraid-0.067
 cal-0.067
 frightening-0.066
 911-0.065
Token "
Token ID366
Feature activation-1.14e-03

Pos logit contributions
 collection+0.031
 wrists+0.031
 cand+0.030
 traded+0.028
 notebooks+0.028
Neg logit contributions
 reckless-0.042
 frightening-0.039
 cal-0.037
 afraid-0.037
 beast-0.037
TokenThese
Token ID4711
Feature activation+1.57e-01

Pos logit contributions
 reckless+0.002
 frightening+0.002
 cal+0.002
 afraid+0.002
 gut+0.002
Neg logit contributions
 collection-0.002
 wrists-0.002
 cand-0.002
 traded-0.002
 notebooks-0.002
Token are
Token ID389
Feature activation+9.71e-02

Pos logit contributions
 wrists+0.241
 collection+0.239
 traded+0.221
 wrist+0.217
uggets+0.215
Neg logit contributions
 reckless-0.295
 afraid-0.257
 frightening-0.257
 evil-0.256
 cal-0.253
Token your
Token ID534
Feature activation+7.60e-02

Pos logit contributions
 collection+0.193
 wrists+0.190
 wrist+0.184
 cand+0.175
 traded+0.174
Neg logit contributions
 reckless-0.144
 cal-0.127
 frightening-0.126
 afraid-0.123
 evil-0.123
Token k
Token ID479
Feature activation+5.59e-02

Pos logit contributions
 wrists+0.090
 traded+0.087
 collection+0.084
 cand+0.082
 notebooks+0.077
Neg logit contributions
 reckless-0.158
 frightening-0.148
 afraid-0.148
 evil-0.146
 gut-0.146
Tokenites
Token ID2737
Feature activation+6.34e-02

Pos logit contributions
 collection+0.109
 notebooks+0.109
 wrists+0.107
 traded+0.107
 cand+0.105
Neg logit contributions
 reckless-0.076
Little-0.059
 afraid-0.058
 gut-0.056
 beast-0.056
Token.
Token ID13
Feature activation+2.64e-02

Pos logit contributions
 wrists+0.117
 cand+0.110
 collection+0.110
 notebooks+0.110
 traded+0.106
Neg logit contributions
 reckless-0.102
 afraid-0.088
 frightening-0.083
 outage-0.083
 cal-0.083
Token them
Token ID606
Feature activation+3.29e-02

Pos logit contributions
 collection+0.018
 wrists+0.017
 cand+0.017
 traded+0.016
 notebooks+0.016
Neg logit contributions
 reckless-0.020
 frightening-0.018
 cal-0.018
Little-0.018
 911-0.018
Token.
Token ID13
Feature activation+2.51e-02

Pos logit contributions
 collection+0.057
 wrists+0.056
 cand+0.055
 traded+0.052
 notebooks+0.051
Neg logit contributions
 reckless-0.057
 frightening-0.051
 cal-0.050
 afraid-0.050
 911-0.049
Token
Token ID198
Feature activation-5.99e-03

Pos logit contributions
 collection+0.031
 wrists+0.031
 cand+0.031
 notebooks+0.029
 traded+0.029
Neg logit contributions
 reckless-0.053
 frightening-0.049
 beast-0.048
 unnecessary-0.048
 gut-0.048
Token
Token ID198
Feature activation-2.37e-03

Pos logit contributions
 reckless+0.014
 frightening+0.013
 unnecessary+0.013
 evil+0.013
 beast+0.012
Neg logit contributions
 collection-0.006
 cand-0.006
 wrists-0.006
 mar-0.006
 coloring-0.006
Token"
Token ID1
Feature activation+3.99e-02

Pos logit contributions
 reckless+0.004
 frightening+0.004
 afraid+0.003
 beast+0.003
 cal+0.003
Neg logit contributions
 collection-0.004
 wrists-0.004
 cand-0.004
 notebooks-0.004
 traded-0.004
TokenThese
Token ID4711
Feature activation+1.57e-01

Pos logit contributions
 collection+0.076
 wrists+0.072
 cand+0.072
 notebooks+0.067
 traded+0.067
Neg logit contributions
 reckless-0.065
 frightening-0.058
Little-0.056
 afraid-0.056
 cal-0.055
Token toys
Token ID14958
Feature activation+7.72e-02

Pos logit contributions
 collection+0.264
 wrists+0.263
uggets+0.249
 extract+0.247
 wrist+0.244
Neg logit contributions
 reckless-0.267
 afraid-0.236
 911-0.228
 frightening-0.225
 evil-0.220
Token are
Token ID389
Feature activation+8.59e-02

Pos logit contributions
 wrists+0.131
 collection+0.126
 notebooks+0.119
 cand+0.118
 wrist+0.116
Neg logit contributions
 reckless-0.136
 afraid-0.118
 frightening-0.118
 cal-0.116
 worse-0.112
Token not
Token ID407
Feature activation-4.76e-02

Pos logit contributions
 wrists+0.167
 collection+0.166
 wrist+0.163
 notebooks+0.152
 extract+0.152
Neg logit contributions
 reckless-0.131
 afraid-0.119
 frightening-0.119
 evil-0.113
 cal-0.109
Token safe
Token ID3338
Feature activation+2.24e-02

Pos logit contributions
 reckless+0.085
 afraid+0.076
 frightening+0.076
 evil+0.072
 911+0.071
Neg logit contributions
 wrists-0.082
 collection-0.082
 wrist-0.077
 notebooks-0.074
 extract-0.073
Token.
Token ID13
Feature activation+2.24e-02

Pos logit contributions
 collection+0.038
 wrists+0.037
 cand+0.036
 notebooks+0.035
 traded+0.034
Neg logit contributions
 reckless-0.040
 frightening-0.036
 afraid-0.035
 cal-0.034
 911-0.034
Token says
Token ID1139
Feature activation+4.26e-02

Pos logit contributions
 wrists+0.015
 collection+0.015
 cand+0.015
 traded+0.014
 notebooks+0.014
Neg logit contributions
 reckless-0.019
 frightening-0.017
 gut-0.017
Little-0.017
 cal-0.016
Token.
Token ID13
Feature activation+2.35e-02

Pos logit contributions
 collection+0.066
 cand+0.064
 wrists+0.063
 mar+0.059
 notebooks+0.059
Neg logit contributions
 reckless-0.082
 frightening-0.074
 afraid-0.073
 cal-0.072
 gut-0.071
Token
Token ID198
Feature activation-1.17e-02

Pos logit contributions
 collection+0.036
 wrists+0.035
 cand+0.035
 traded+0.033
 notebooks+0.032
Neg logit contributions
 reckless-0.045
 frightening-0.041
 cal-0.040
 afraid-0.040
Little-0.039
Token
Token ID198
Feature activation-6.71e-03

Pos logit contributions
 reckless+0.030
 frightening+0.028
 beast+0.028
 evil+0.027
 unnecessary+0.027
Neg logit contributions
 wrists-0.009
 collection-0.009
 mar-0.008
 cand-0.008
 traded-0.008
Token"
Token ID1
Feature activation+3.91e-02

Pos logit contributions
 reckless+0.012
 frightening+0.011
 beast+0.011
 afraid+0.010
 cal+0.010
Neg logit contributions
 wrists-0.011
 collection-0.011
 cand-0.010
 traded-0.010
 notebooks-0.010
TokenThese
Token ID4711
Feature activation+1.56e-01

Pos logit contributions
 collection+0.090
 cand+0.080
 wrists+0.077
 sandwich+0.077
 traded+0.076
Neg logit contributions
 reckless-0.051
Little-0.048
Down-0.047
Run-0.046
 frightening-0.043
Token are
Token ID389
Feature activation+9.44e-02

Pos logit contributions
 wrists+0.263
 collection+0.258
 traded+0.258
 wrist+0.249
uggets+0.247
Neg logit contributions
 reckless-0.243
 unnecessary-0.212
 cal-0.212
 afraid-0.205
 911-0.204
Token for
Token ID329
Feature activation+1.07e-02

Pos logit contributions
 wrists+0.161
 collection+0.161
 wrist+0.159
 traded+0.153
ci+0.147
Neg logit contributions
 reckless-0.144
 frightening-0.135
 afraid-0.134
 cal-0.134
 evil-0.133
Token you
Token ID345
Feature activation-8.20e-03

Pos logit contributions
 collection+0.016
 wrists+0.016
 traded+0.015
 notebooks+0.014
 cand+0.014
Neg logit contributions
 reckless-0.020
 frightening-0.018
 evil-0.018
 cal-0.017
 911-0.017
Token,
Token ID11
Feature activation+3.01e-02

Pos logit contributions
 reckless+0.015
 frightening+0.014
 afraid+0.013
 cal+0.013
Little+0.013
Neg logit contributions
 collection-0.013
 wrists-0.013
 cand-0.013
 traded-0.012
 notebooks-0.012
Token kids
Token ID3988
Feature activation+8.46e-03

Pos logit contributions
 collection+0.046
 wrists+0.044
 cand+0.043
 traded+0.041
 notebooks+0.041
Neg logit contributions
 reckless-0.058
 frightening-0.053
 afraid-0.052
 cal-0.052
Little-0.051
Token.
Token ID13
Feature activation+2.55e-02

Pos logit contributions
 wrists+0.057
 collection+0.057
 cand+0.054
 notebooks+0.053
 mar+0.053
Neg logit contributions
 reckless-0.039
 clearly-0.036
 cal-0.035
 afraid-0.035
 louder-0.035
Token She
Token ID1375
Feature activation-1.03e-02

Pos logit contributions
 wrists+0.034
 collection+0.033
 cand+0.033
 notebooks+0.030
 traded+0.030
Neg logit contributions
 reckless-0.055
 frightening-0.050
 beast-0.049
 gut-0.048
 cal-0.048
Token says
Token ID1139
Feature activation+6.34e-02

Pos logit contributions
 reckless+0.017
 cal+0.016
 frightening+0.015
 afraid+0.015
 evil+0.015
Neg logit contributions
 collection-0.018
 wrists-0.017
 cand-0.017
 traded-0.016
 notebooks-0.016
Token,
Token ID11
Feature activation+2.49e-02

Pos logit contributions
 collection+0.096
 wrists+0.094
 cand+0.090
 traded+0.087
 notebooks+0.085
Neg logit contributions
 reckless-0.122
 frightening-0.111
 cal-0.110
 afraid-0.109
 911-0.107
Token "
Token ID366
Feature activation-1.88e-03

Pos logit contributions
 collection+0.036
 wrists+0.035
 cand+0.035
 traded+0.032
 notebooks+0.032
Neg logit contributions
 reckless-0.050
 frightening-0.046
 cal-0.044
 afraid-0.044
 beast-0.044
TokenThese
Token ID4711
Feature activation+1.56e-01

Pos logit contributions
Little+0.003
 reckless+0.002
Sir+0.002
Run+0.002
Down+0.002
Neg logit contributions
 collection-0.004
 sandwich-0.004
 wrists-0.004
 coloring-0.003
 mar-0.003
Token are
Token ID389
Feature activation+9.42e-02

Pos logit contributions
 wrists+0.275
 collection+0.263
uggets+0.260
 wrist+0.257
 traded+0.252
Neg logit contributions
 reckless-0.239
 911-0.221
 afraid-0.208
 cal-0.205
 frightening-0.201
Token for
Token ID329
Feature activation+1.36e-02

Pos logit contributions
 wrists+0.181
 wrist+0.180
 collection+0.179
ci+0.170
 extract+0.167
Neg logit contributions
 reckless-0.131
 cal-0.117
 evil-0.117
 frightening-0.115
 afraid-0.115
Token our
Token ID674
Feature activation+9.99e-02

Pos logit contributions
 collection+0.025
 wrists+0.025
 cand+0.023
 traded+0.023
 notebooks+0.023
Neg logit contributions
 reckless-0.021
 frightening-0.018
 911-0.018
 afraid-0.018
 evil-0.018
Token son
Token ID3367
Feature activation-5.69e-03

Pos logit contributions
 wrists+0.113
 traded+0.106
 cand+0.101
 collection+0.101
 extract+0.096
Neg logit contributions
 reckless-0.207
 911-0.200
 frightening-0.199
 afraid-0.195
 gut-0.194
Token.
Token ID13
Feature activation+3.04e-02

Pos logit contributions
 reckless+0.009
 afraid+0.008
 cal+0.008
 911+0.007
 frightening+0.007
Neg logit contributions
 wrists-0.010
 collection-0.009
 cand-0.009
 notebooks-0.009
 extract-0.009
Token said
Token ID531
Feature activation+4.28e-02

Pos logit contributions
 collection+0.031
 wrists+0.030
 cand+0.029
 notebooks+0.028
 extract+0.028
Neg logit contributions
 reckless-0.028
 afraid-0.026
 frightening-0.025
 cal-0.024
Little-0.024
Token.
Token ID13
Feature activation+2.65e-02

Pos logit contributions
 collection+0.073
 cand+0.071
 wrists+0.071
 notebooks+0.068
 mar+0.067
Neg logit contributions
 reckless-0.075
 afraid-0.068
 frightening-0.066
 cal-0.066
 louder-0.065
Token
Token ID198
Feature activation-1.15e-02

Pos logit contributions
 collection+0.046
 cand+0.043
 wrists+0.042
 notebooks+0.041
 mar+0.040
Neg logit contributions
 reckless-0.047
 afraid-0.042
 frightening-0.042
 cal-0.042
 911-0.042
Token
Token ID198
Feature activation-7.53e-03

Pos logit contributions
 reckless+0.029
 frightening+0.027
 beast+0.027
 evil+0.027
 unnecessary+0.026
Neg logit contributions
 wrists-0.010
 traded-0.009
 mar-0.009
 cand-0.009
 collection-0.009
Token"
Token ID1
Feature activation+4.06e-02

Pos logit contributions
 reckless+0.013
 frightening+0.012
 beast+0.012
 afraid+0.011
 gut+0.011
Neg logit contributions
 wrists-0.013
 collection-0.012
 cand-0.012
 traded-0.012
 mar-0.012
TokenThese
Token ID4711
Feature activation+1.56e-01

Pos logit contributions
 collection+0.089
 sandwich+0.080
 cand+0.080
 wrists+0.078
 notebooks+0.076
Neg logit contributions
 reckless-0.056
Little-0.052
Down-0.052
 frightening-0.049
Run-0.049
Token are
Token ID389
Feature activation+9.48e-02

Pos logit contributions
 wrists+0.243
 collection+0.236
 traded+0.226
uggets+0.218
 notebooks+0.217
Neg logit contributions
 reckless-0.286
 afraid-0.254
 frightening-0.253
 911-0.251
 evil-0.248
Token treasures
Token ID29561
Feature activation+1.20e-02

Pos logit contributions
 wrists+0.179
 collection+0.174
 wrist+0.170
 traded+0.163
 notebooks+0.162
Neg logit contributions
 reckless-0.141
 frightening-0.131
 cal-0.129
 afraid-0.127
 evil-0.126
Token from
Token ID422
Feature activation+1.39e-02

Pos logit contributions
 wrists+0.024
 collection+0.022
 traded+0.022
 notebooks+0.021
 cand+0.021
Neg logit contributions
 reckless-0.018
 frightening-0.016
 afraid-0.015
 cal-0.015
 outage-0.014
Token the
Token ID262
Feature activation+1.66e-02

Pos logit contributions
 collection+0.025
 wrists+0.024
 cand+0.024
 traded+0.023
 notebooks+0.023
Neg logit contributions
 reckless-0.023
 frightening-0.021
 afraid-0.020
 cal-0.020
 911-0.019
Token old
Token ID1468
Feature activation-1.03e-02

Pos logit contributions
 wrists+0.025
 collection+0.025
 cand+0.024
 traded+0.024
 notebooks+0.023
Neg logit contributions
 reckless-0.031
 frightening-0.029
 beast-0.028
 afraid-0.028
 cal-0.028
Token!"
Token ID2474
Feature activation-4.78e-03

Pos logit contributions
 reckless+0.124
 frightening+0.113
 cal+0.109
 afraid+0.109
 gut+0.107
Neg logit contributions
 collection-0.111
 wrists-0.108
 cand-0.107
 traded-0.100
 notebooks-0.099
Token Sam
Token ID3409
Feature activation-2.44e-02

Pos logit contributions
 reckless+0.009
 frightening+0.008
 afraid+0.008
 cal+0.008
Little+0.008
Neg logit contributions
 collection-0.007
 wrists-0.007
 cand-0.007
 notebooks-0.007
 traded-0.007
Token said
Token ID531
Feature activation+3.03e-02

Pos logit contributions
 reckless+0.048
 frightening+0.044
 cal+0.042
 afraid+0.042
 gut+0.042
Neg logit contributions
 collection-0.036
 wrists-0.035
 cand-0.035
 traded-0.033
 notebooks-0.032
Token.
Token ID13
Feature activation+1.89e-02

Pos logit contributions
 collection+0.047
 wrists+0.046
 cand+0.045
 notebooks+0.043
 traded+0.042
Neg logit contributions
 reckless-0.058
 frightening-0.053
 afraid-0.052
 cal-0.052
 gut-0.050
Token "
Token ID366
Feature activation+5.91e-04

Pos logit contributions
 collection+0.030
 wrists+0.029
 cand+0.029
 traded+0.027
 mar+0.027
Neg logit contributions
 reckless-0.036
 frightening-0.032
 cal-0.031
 afraid-0.031
Little-0.031
TokenThese
Token ID4711
Feature activation+1.56e-01

Pos logit contributions
 wrists+0.001
 cand+0.001
 traded+0.001
 collection+0.001
 mar+0.001
Neg logit contributions
 reckless-0.001
 frightening-0.001
 cal-0.001
 gut-0.001
 beast-0.001
Token are
Token ID389
Feature activation+9.64e-02

Pos logit contributions
 collection+0.151
 cand+0.135
 wrists+0.131
 mar+0.122
 notebooks+0.120
Neg logit contributions
 reckless-0.391
 frightening-0.375
 cal-0.365
Little-0.364
 gut-0.363
Token my
Token ID616
Feature activation+6.89e-02

Pos logit contributions
 collection+0.170
 wrists+0.166
 cand+0.164
 notebooks+0.155
 traded+0.155
Neg logit contributions
 reckless-0.165
 frightening-0.149
 afraid-0.144
 cal-0.143
 911-0.139
Token bricks
Token ID28902
Feature activation-1.09e-03

Pos logit contributions
 collection+0.080
 wrists+0.076
 cand+0.075
 notebooks+0.068
 mar+0.068
Neg logit contributions
 reckless-0.159
 frightening-0.148
 cal-0.144
 afraid-0.143
Little-0.143
Token.
Token ID13
Feature activation+2.41e-02

Pos logit contributions
 reckless+0.002
 frightening+0.002
 afraid+0.002
 cal+0.002
 beast+0.002
Neg logit contributions
 wrists-0.002
 collection-0.002
 cand-0.002
 notebooks-0.002
 traded-0.002
Token I
Token ID314
Feature activation+7.83e-03

Pos logit contributions
 collection+0.043
 cand+0.040
 wrists+0.039
 notebooks+0.038
 sandwich+0.037
Neg logit contributions
 reckless-0.043
 911-0.038
 afraid-0.037
Little-0.036
 cal-0.035
Token the
Token ID262
Feature activation-2.18e-02

Pos logit contributions
 reckless+0.005
 frightening+0.005
 cal+0.005
 gut+0.005
 beast+0.004
Neg logit contributions
 collection-0.003
 wrists-0.003
 cand-0.003
 mar-0.003
 traded-0.003
Token man
Token ID582
Feature activation-5.46e-02

Pos logit contributions
 reckless+0.039
 frightening+0.035
 cal+0.034
 afraid+0.034
Little+0.034
Neg logit contributions
 collection-0.037
 wrists-0.036
 cand-0.036
 notebooks-0.033
 mar-0.033
Token says
Token ID1139
Feature activation+3.93e-02

Pos logit contributions
 reckless+0.102
 frightening+0.093
 gut+0.092
Little+0.091
 unnecessary+0.088
Neg logit contributions
 wrists-0.083
 collection-0.082
 cand-0.082
 traded-0.078
 mar-0.076
Token.
Token ID13
Feature activation+2.17e-02

Pos logit contributions
 collection+0.048
 wrists+0.047
 cand+0.047
 traded+0.043
 coloring+0.042
Neg logit contributions
 reckless-0.086
 frightening-0.080
Little-0.079
 beast-0.077
 cal-0.077
Token "
Token ID366
Feature activation+3.17e-05

Pos logit contributions
 collection+0.028
 wrists+0.027
 cand+0.027
 mar+0.026
 traded+0.026
Neg logit contributions
 reckless-0.045
 unnecessary-0.042
 frightening-0.042
Little-0.041
 gut-0.041
TokenThese
Token ID4711
Feature activation+1.56e-01

Pos logit contributions
 wrists+0.000
 cand+0.000
 traded+0.000
 mar+0.000
 collection+0.000
Neg logit contributions
 reckless-0.000
 frightening-0.000
 gut-0.000
 cal-0.000
 beast-0.000
Token are
Token ID389
Feature activation+9.50e-02

Pos logit contributions
 wrists+0.217
 collection+0.211
 traded+0.195
 cand+0.195
 wrist+0.191
Neg logit contributions
 reckless-0.320
 frightening-0.289
 afraid-0.287
 cal-0.284
 911-0.280
Token my
Token ID616
Feature activation+6.95e-02

Pos logit contributions
 collection+0.185
 wrists+0.184
 wrist+0.177
 cand+0.171
 notebooks+0.170
Neg logit contributions
 reckless-0.153
 frightening-0.135
 afraid-0.133
 evil-0.129
 cal-0.128
Token fruits
Token ID15921
Feature activation+2.52e-02

Pos logit contributions
 wrists+0.088
 collection+0.083
 traded+0.082
 cand+0.081
 notebooks+0.076
Neg logit contributions
 reckless-0.147
 frightening-0.137
 afraid-0.136
 cal-0.132
 gut-0.132
Token.
Token ID13
Feature activation+2.86e-02

Pos logit contributions
 wrists+0.044
 collection+0.043
 cand+0.041
 notebooks+0.041
 traded+0.039
Neg logit contributions
 reckless-0.045
 afraid-0.039
 frightening-0.038
 cal-0.037
 beast-0.037
Token You
Token ID921
Feature activation+3.26e-03

Pos logit contributions
 collection+0.050
 wrists+0.047
 coloring+0.043
 notebooks+0.043
 mar+0.043
Neg logit contributions
 reckless-0.051
 911-0.048
 afraid-0.044
 cal-0.042
Young-0.042
Token!"
Token ID2474
Feature activation-1.50e-03

Pos logit contributions
 reckless+0.059
 frightening+0.054
 gut+0.052
 unnecessary+0.052
 cal+0.051
Neg logit contributions
 collection-0.044
 cand-0.041
 sandwich-0.039
 wrists-0.039
 coloring-0.038
Token they
Token ID484
Feature activation+3.10e-02

Pos logit contributions
 reckless+0.003
 frightening+0.003
 afraid+0.003
 cal+0.003
Little+0.003
Neg logit contributions
 collection-0.002
 wrists-0.002
 cand-0.002
 traded-0.002
 notebooks-0.002
Token said
Token ID531
Feature activation+3.56e-02

Pos logit contributions
 collection+0.050
 wrists+0.048
 cand+0.048
 notebooks+0.045
 traded+0.045
Neg logit contributions
 reckless-0.057
 frightening-0.052
 afraid-0.051
 cal-0.050
Little-0.049
Token.
Token ID13
Feature activation+2.69e-02

Pos logit contributions
 collection+0.064
 cand+0.061
 wrists+0.060
 notebooks+0.059
 mar+0.057
Neg logit contributions
 reckless-0.061
 afraid-0.055
 louder-0.054
 cal-0.053
 frightening-0.053
Token "
Token ID366
Feature activation+4.91e-03

Pos logit contributions
 collection+0.042
 wrists+0.041
 cand+0.040
 traded+0.038
 notebooks+0.037
Neg logit contributions
 reckless-0.051
 frightening-0.047
 afraid-0.046
 cal-0.045
Little-0.045
TokenThese
Token ID4711
Feature activation+1.56e-01

Pos logit contributions
 collection+0.010
 cand+0.009
 wrists+0.009
 notebooks+0.008
 traded+0.008
Neg logit contributions
 reckless-0.008
Little-0.007
 frightening-0.007
 afraid-0.007
Down-0.007
Token are
Token ID389
Feature activation+9.73e-02

Pos logit contributions
 wrists+0.239
 collection+0.232
 cand+0.213
 wrist+0.211
 coloring+0.210
Neg logit contributions
 reckless-0.300
 frightening-0.269
 afraid-0.267
 911-0.259
 evil-0.258
Token the
Token ID262
Feature activation+2.27e-02

Pos logit contributions
 wrists+0.197
 collection+0.194
 wrist+0.191
 extract+0.182
 notebooks+0.181
Neg logit contributions
 reckless-0.126
 frightening-0.114
 afraid-0.112
 cal-0.112
 evil-0.112
Token best
Token ID1266
Feature activation+7.80e-02

Pos logit contributions
 wrists+0.035
 collection+0.033
 cand+0.032
 wrist+0.031
 extract+0.031
Neg logit contributions
 reckless-0.043
 frightening-0.040
 afraid-0.039
 evil-0.039
 cal-0.038
Token vans
Token ID47930
Feature activation-1.45e-02

Pos logit contributions
 wrists+0.134
 traded+0.116
 cand+0.116
 collection+0.115
 notebooks+0.115
Neg logit contributions
 reckless-0.127
 911-0.124
 frightening-0.122
 afraid-0.119
 lightning-0.119
Token ever
Token ID1683
Feature activation+1.92e-03

Pos logit contributions
 reckless+0.023
 afraid+0.020
 frightening+0.019
 cal+0.018
 outage+0.018
Neg logit contributions
 wrists-0.028
 collection-0.026
 notebooks-0.025
 cand-0.025
 wrist-0.024
Token kind
Token ID1611
Feature activation+1.19e-02

Pos logit contributions
 collection+0.019
 wrists+0.018
 cand+0.018
 notebooks+0.017
 traded+0.017
Neg logit contributions
 reckless-0.013
 frightening-0.011
 afraid-0.011
 cal-0.011
 evil-0.010
Token.
Token ID13
Feature activation+2.09e-02

Pos logit contributions
 collection+0.020
 wrists+0.020
 cand+0.019
 traded+0.018
 notebooks+0.018
Neg logit contributions
 reckless-0.022
 frightening-0.019
 afraid-0.019
 cal-0.019
 gut-0.018
Token
Token ID198
Feature activation-7.36e-03

Pos logit contributions
 collection+0.024
 traded+0.024
 cand+0.024
 mar+0.023
 notebooks+0.023
Neg logit contributions
 reckless-0.045
 frightening-0.042
 unnecessary-0.041
 beast-0.041
 gut-0.040
Token
Token ID198
Feature activation-3.29e-03

Pos logit contributions
 reckless+0.019
 unnecessary+0.017
 frightening+0.017
 evil+0.017
rified+0.017
Neg logit contributions
 mar-0.006
 collection-0.006
 traded-0.006
 cand-0.006
 notebooks-0.005
Token"
Token ID1
Feature activation+3.88e-02

Pos logit contributions
 reckless+0.006
 frightening+0.006
 unnecessary+0.005
 beast+0.005
 gut+0.005
Neg logit contributions
 collection-0.005
 traded-0.005
 cand-0.005
 mar-0.005
 notebooks-0.005
TokenThese
Token ID4711
Feature activation+1.56e-01

Pos logit contributions
 collection+0.082
 wrists+0.078
 cand+0.077
 notebooks+0.073
 traded+0.073
Neg logit contributions
 reckless-0.056
 frightening-0.049
Little-0.048
 cal-0.048
 afraid-0.047
Token are
Token ID389
Feature activation+9.37e-02

Pos logit contributions
 wrists+0.246
 collection+0.237
 traded+0.230
 wrist+0.224
uggets+0.222
Neg logit contributions
 reckless-0.283
 afraid-0.251
 frightening-0.248
 911-0.244
 evil-0.242
Token my
Token ID616
Feature activation+6.84e-02

Pos logit contributions
 collection+0.177
 wrists+0.176
 wrist+0.167
 cand+0.163
 notebooks+0.161
Neg logit contributions
 reckless-0.152
 frightening-0.136
 afraid-0.135
 evil-0.131
 cal-0.131
Token books
Token ID3835
Feature activation+4.40e-02

Pos logit contributions
 wrists+0.086
 traded+0.084
 collection+0.077
 cand+0.076
 extract+0.075
Neg logit contributions
 reckless-0.140
 frightening-0.133
 afraid-0.132
 gut-0.129
 911-0.128
Token.
Token ID13
Feature activation+2.88e-02

Pos logit contributions
 wrists+0.080
 collection+0.078
 cand+0.074
 notebooks+0.073
 mar+0.071
Neg logit contributions
 reckless-0.074
 frightening-0.065
 afraid-0.064
 cal-0.064
Little-0.060
Token I
Token ID314
Feature activation+7.70e-03

Pos logit contributions
 collection+0.048
 wrists+0.045
 cand+0.043
 notebooks+0.041
 mar+0.041
Neg logit contributions
 reckless-0.054
 911-0.049
 afraid-0.047
 cal-0.047
Little-0.046
Token,"
Token ID553
Feature activation-5.22e-03

Pos logit contributions
 wrists+0.085
 collection+0.085
 cand+0.084
 notebooks+0.082
 mar+0.078
Neg logit contributions
 reckless-0.046
 afraid-0.039
 cal-0.037
 frightening-0.036
Little-0.033
Token grandma
Token ID49890
Feature activation-2.91e-02

Pos logit contributions
 reckless+0.006
 afraid+0.006
 louder+0.006
Little+0.006
Run+0.006
Neg logit contributions
 collection-0.012
 extract-0.010
 cand-0.010
 wrists-0.010
 coloring-0.010
Token said
Token ID531
Feature activation+3.40e-02

Pos logit contributions
 reckless+0.050
 cal+0.046
 afraid+0.045
 frightening+0.045
 911+0.044
Neg logit contributions
 collection-0.051
 wrists-0.049
 cand-0.048
 notebooks-0.045
 mar-0.045
Token.
Token ID13
Feature activation+2.15e-02

Pos logit contributions
 collection+0.060
 wrists+0.057
 cand+0.056
 notebooks+0.055
 mar+0.053
Neg logit contributions
 reckless-0.059
 afraid-0.054
 frightening-0.052
 cal-0.051
 911-0.051
Token "
Token ID366
Feature activation+1.19e-04

Pos logit contributions
 wrists+0.028
 collection+0.028
 cand+0.028
 traded+0.026
 coloring+0.026
Neg logit contributions
 reckless-0.045
 frightening-0.042
 beast-0.041
 gut-0.040
 unnecessary-0.040
TokenThese
Token ID4711
Feature activation+1.56e-01

Pos logit contributions
 collection+0.000
 wrists+0.000
 cand+0.000
 notebooks+0.000
 traded+0.000
Neg logit contributions
 reckless-0.000
 frightening-0.000
 afraid-0.000
 cal-0.000
Little-0.000
Token are
Token ID389
Feature activation+9.58e-02

Pos logit contributions
 collection+0.259
uggets+0.258
 wrists+0.256
 extract+0.245
 crust+0.239
Neg logit contributions
 reckless-0.260
 afraid-0.231
 911-0.222
 evil-0.220
 frightening-0.219
Token for
Token ID329
Feature activation+1.34e-02

Pos logit contributions
 collection+0.194
 wrists+0.189
 wrist+0.187
 notebooks+0.177
 extract+0.176
Neg logit contributions
 reckless-0.136
 afraid-0.118
 frightening-0.117
 evil-0.113
 cal-0.112
Token you
Token ID345
Feature activation-7.75e-03

Pos logit contributions
 collection+0.025
 wrists+0.023
 cand+0.022
 notebooks+0.022
 traded+0.022
Neg logit contributions
 reckless-0.021
 frightening-0.019
 afraid-0.018
 evil-0.018
 911-0.018
Token to
Token ID284
Feature activation+8.57e-03

Pos logit contributions
 reckless+0.014
 frightening+0.013
 afraid+0.013
 cal+0.012
 911+0.012
Neg logit contributions
 collection-0.012
 wrists-0.012
 cand-0.012
 notebooks-0.011
 traded-0.011
Token keep
Token ID1394
Feature activation-6.30e-03

Pos logit contributions
 wrists+0.012
 collection+0.011
 cand+0.011
 notebooks+0.011
 extract+0.010
Neg logit contributions
 reckless-0.017
 frightening-0.016
 cal-0.016
 evil-0.015
 afraid-0.015
Token.
Token ID13
Feature activation+2.44e-02

Pos logit contributions
 reckless+0.208
 frightening+0.187
 afraid+0.185
 cal+0.180
 911+0.175
Neg logit contributions
 collection-0.187
 wrists-0.185
 cand-0.180
 notebooks-0.171
 traded-0.170
Token He
Token ID679
Feature activation-1.65e-02

Pos logit contributions
 collection+0.038
 wrists+0.037
 cand+0.037
 traded+0.035
 notebooks+0.034
Neg logit contributions
 reckless-0.046
 frightening-0.042
 cal-0.041
 afraid-0.040
 gut-0.040
Token says
Token ID1139
Feature activation+6.06e-02

Pos logit contributions
 reckless+0.028
 cal+0.025
 frightening+0.025
 afraid+0.025
 evil+0.024
Neg logit contributions
 collection-0.028
 wrists-0.028
 cand-0.027
 notebooks-0.026
 mar-0.025
Token,
Token ID11
Feature activation+2.49e-02

Pos logit contributions
 collection+0.091
 wrists+0.089
 cand+0.088
 traded+0.082
 notebooks+0.082
Neg logit contributions
 reckless-0.118
 frightening-0.108
 cal-0.105
 afraid-0.105
Little-0.103
Token "
Token ID366
Feature activation-3.49e-03

Pos logit contributions
 collection+0.037
 wrists+0.036
 cand+0.035
 traded+0.033
 notebooks+0.033
Neg logit contributions
 reckless-0.049
 frightening-0.045
 cal-0.044
 afraid-0.043
Little-0.043
TokenThese
Token ID4711
Feature activation+1.55e-01

Pos logit contributions
 reckless+0.006
 frightening+0.005
 cal+0.005
 afraid+0.005
Little+0.005
Neg logit contributions
 collection-0.007
 wrists-0.006
 cand-0.006
 notebooks-0.006
 traded-0.006
Token are
Token ID389
Feature activation+9.10e-02

Pos logit contributions
 wrists+0.257
 collection+0.249
uggets+0.239
 notebooks+0.239
 traded+0.238
Neg logit contributions
 reckless-0.267
 cal-0.234
 afraid-0.231
 911-0.229
 frightening-0.227
Token mine
Token ID6164
Feature activation+7.52e-02

Pos logit contributions
 collection+0.192
 wrists+0.190
 wrist+0.184
 extract+0.176
 cand+0.175
Neg logit contributions
 reckless-0.128
 evil-0.108
 frightening-0.108
 afraid-0.107
 cal-0.107
Token!
Token ID0
Feature activation+7.57e-03

Pos logit contributions
 collection+0.120
 wrists+0.118
 cand+0.117
 notebooks+0.110
 traded+0.110
Neg logit contributions
 reckless-0.139
 frightening-0.126
 afraid-0.122
 cal-0.121
 gut-0.119
Token Go
Token ID1514
Feature activation+6.37e-04

Pos logit contributions
 collection+0.016
 wrists+0.015
 cand+0.015
 notebooks+0.014
 mar+0.014
Neg logit contributions
 reckless-0.010
 911-0.009
 afraid-0.008
Little-0.008
 cal-0.008
Token away
Token ID1497
Feature activation+1.34e-02

Pos logit contributions
 collection+0.001
 wrists+0.001
 cand+0.001
 notebooks+0.001
 mar+0.001
Neg logit contributions
 reckless-0.001
 cal-0.001
 afraid-0.001
 frightening-0.001
 911-0.001
Token was
Token ID373
Feature activation+2.30e-03

Pos logit contributions
 reckless+0.047
 frightening+0.043
 afraid+0.043
 cal+0.042
 gut+0.041
Neg logit contributions
 collection-0.035
 wrists-0.035
 cand-0.035
 notebooks-0.032
 mar-0.032
Token very
Token ID845
Feature activation+2.42e-04

Pos logit contributions
 collection+0.004
 wrists+0.004
 cand+0.004
 traded+0.003
 notebooks+0.003
Neg logit contributions
 reckless-0.004
 frightening-0.004
 cal-0.004
Little-0.004
 afraid-0.004
Token careful
Token ID8161
Feature activation-1.35e-02

Pos logit contributions
 collection+0.000
 wrists+0.000
 traded+0.000
 cand+0.000
 mar+0.000
Neg logit contributions
 reckless-0.000
 beast-0.000
 911-0.000
Little-0.000
 frightening-0.000
Token to
Token ID284
Feature activation-2.52e-02

Pos logit contributions
 reckless+0.025
 frightening+0.023
 cal+0.022
Little+0.022
 beast+0.022
Neg logit contributions
 collection-0.022
 wrists-0.021
 cand-0.020
 coloring-0.019
 notebooks-0.019
Token prevent
Token ID2948
Feature activation-1.09e-01

Pos logit contributions
 reckless+0.051
 frightening+0.047
 cal+0.045
 afraid+0.045
Little+0.045
Neg logit contributions
 collection-0.037
 wrists-0.035
 cand-0.034
 traded-0.032
 mar-0.032
Token anyone
Token ID2687
Feature activation-1.84e-01

Pos logit contributions
 reckless+0.190
 frightening+0.169
Little+0.166
 cal+0.165
 beast+0.162
Neg logit contributions
 wrists-0.181
 collection-0.181
 cand-0.179
 mar-0.165
 notebooks-0.164
Token from
Token ID422
Feature activation-5.25e-02

Pos logit contributions
 reckless+0.317
 frightening+0.288
Little+0.287
 gut+0.284
 cal+0.283
Neg logit contributions
 collection-0.302
 cand-0.295
 wrists-0.289
 mar-0.274
 coloring-0.272
Token finding
Token ID4917
Feature activation+1.54e-03

Pos logit contributions
 reckless+0.103
 frightening+0.093
 cal+0.091
Little+0.091
 gut+0.089
Neg logit contributions
 collection-0.079
 wrists-0.077
 cand-0.075
 mar-0.070
 notebooks-0.069
Token out
Token ID503
Feature activation-3.07e-02

Pos logit contributions
 collection+0.003
 cand+0.003
 wrists+0.003
 mar+0.002
 notebooks+0.002
Neg logit contributions
 reckless-0.003
 frightening-0.002
Little-0.002
 afraid-0.002
 cal-0.002
Token.
Token ID13
Feature activation-7.04e-03

Pos logit contributions
 reckless+0.059
Little+0.054
 frightening+0.054
Run+0.053
 cal+0.052
Neg logit contributions
 collection-0.045
 cand-0.042
 wrists-0.041
 mar-0.039
 coloring-0.039
Token
Token ID198
Feature activation-3.08e-02

Pos logit contributions
 reckless+0.012
 frightening+0.011
Little+0.011
 cal+0.011
 afraid+0.011
Neg logit contributions
 collection-0.012
 cand-0.011
 wrists-0.011
 notebooks-0.011
 traded-0.011
Token grass
Token ID8701
Feature activation+2.31e-02

Pos logit contributions
 reckless+0.038
 frightening+0.034
 cal+0.033
 afraid+0.033
 911+0.033
Neg logit contributions
 collection-0.028
 wrists-0.027
 cand-0.027
 mar-0.025
 traded-0.024
Token around
Token ID1088
Feature activation-3.30e-02

Pos logit contributions
 wrists+0.033
 collection+0.033
 cand+0.032
 notebooks+0.030
 traded+0.029
Neg logit contributions
 reckless-0.046
 frightening-0.043
 afraid-0.042
 beast-0.041
 cal-0.041
Token the
Token ID262
Feature activation-2.08e-02

Pos logit contributions
 reckless+0.056
 frightening+0.050
 afraid+0.049
 cal+0.049
Little+0.048
Neg logit contributions
 collection-0.058
 wrists-0.057
 cand-0.056
 notebooks-0.053
 traded-0.053
Token hill
Token ID12788
Feature activation-1.48e-02

Pos logit contributions
 reckless+0.047
 frightening+0.043
 afraid+0.042
 outage+0.042
 911+0.042
Neg logit contributions
 collection-0.025
 wrists-0.024
 cand-0.023
 mar-0.022
 coloring-0.021
Token was
Token ID373
Feature activation-1.64e-02

Pos logit contributions
 reckless+0.027
 frightening+0.025
 afraid+0.025
 cal+0.024
 beast+0.024
Neg logit contributions
 wrists-0.024
 collection-0.023
 cand-0.023
 notebooks-0.022
 mar-0.021
Token wit
Token ID20868
Feature activation-1.83e-01

Pos logit contributions
 reckless+0.030
 frightening+0.027
Little+0.027
 cal+0.026
 gut+0.026
Neg logit contributions
 collection-0.026
 wrists-0.025
 cand-0.025
 traded-0.024
 mar-0.023
Tokenhered
Token ID6083
Feature activation+2.88e-02

Pos logit contributions
 reckless+0.338
 gut+0.308
 frightening+0.306
 afraid+0.297
 911+0.294
Neg logit contributions
 traded-0.256
 collection-0.250
 cand-0.246
 mar-0.242
 coloring-0.238
Token and
Token ID290
Feature activation-3.78e-02

Pos logit contributions
 collection+0.043
 wrists+0.041
 cand+0.041
 traded+0.039
 mar+0.039
Neg logit contributions
 reckless-0.056
 frightening-0.051
 cal-0.050
Little-0.049
 afraid-0.049
Token yellow
Token ID7872
Feature activation+4.68e-02

Pos logit contributions
 reckless+0.072
Little+0.069
 gut+0.066
 unnecessary+0.066
 frightening+0.065
Neg logit contributions
 collection-0.053
 traded-0.052
 cand-0.051
 wrists-0.049
 mar-0.049
Token.
Token ID13
Feature activation+2.26e-03

Pos logit contributions
 collection+0.075
 wrists+0.074
 cand+0.071
 notebooks+0.069
 traded+0.069
Neg logit contributions
 reckless-0.087
 frightening-0.079
 cal-0.076
 afraid-0.076
 beast-0.075
Token He
Token ID679
Feature activation-2.44e-02

Pos logit contributions
 cand+0.003
 collection+0.003
 wrists+0.003
 traded+0.003
 mar+0.003
Neg logit contributions
 reckless-0.005
 frightening-0.004
 cal-0.004
 unnecessary-0.004
 nearer-0.004
Token,
Token ID11
Feature activation+7.36e-03

Pos logit contributions
 collection+0.058
 wrists+0.055
 cand+0.055
 traded+0.051
 mar+0.050
Neg logit contributions
 reckless-0.094
 frightening-0.087
 cal-0.084
 afraid-0.084
Little-0.084
Token "
Token ID366
Feature activation-1.69e-02

Pos logit contributions
 collection+0.010
 wrists+0.010
 cand+0.009
 coloring+0.009
 traded+0.009
Neg logit contributions
 reckless-0.016
 frightening-0.014
 cal-0.014
 beast-0.014
Little-0.014
TokenI
Token ID40
Feature activation-1.85e-03

Pos logit contributions
 reckless+0.030
 gut+0.026
 frightening+0.026
 afraid+0.026
 911+0.026
Neg logit contributions
 wrists-0.028
 cand-0.027
 collection-0.027
 traded-0.027
 notebooks-0.026
Token'll
Token ID1183
Feature activation-1.91e-02

Pos logit contributions
 reckless+0.004
 911+0.003
 beast+0.003
 gut+0.003
 frightening+0.003
Neg logit contributions
 collection-0.003
 traded-0.002
 dress-0.002
 coloring-0.002
 mar-0.002
Token prevent
Token ID2948
Feature activation-1.06e-01

Pos logit contributions
 reckless+0.040
 frightening+0.037
 beast+0.035
 cal+0.035
 911+0.035
Neg logit contributions
 collection-0.026
 wrists-0.025
 cand-0.024
 traded-0.023
 coloring-0.022
Token anyone
Token ID2687
Feature activation-1.83e-01

Pos logit contributions
 reckless+0.186
 frightening+0.167
 gut+0.164
Little+0.161
 cal+0.161
Neg logit contributions
 wrists-0.175
 cand-0.175
 collection-0.173
 mar-0.163
 traded-0.158
Token from
Token ID422
Feature activation-5.20e-02

Pos logit contributions
 reckless+0.306
 gut+0.292
Little+0.282
 frightening+0.281
 beast+0.280
Neg logit contributions
 cand-0.294
 collection-0.294
 wrists-0.290
 coloring-0.278
 mar-0.270
Token making
Token ID1642
Feature activation+1.27e-02

Pos logit contributions
 reckless+0.108
Little+0.099
 gut+0.098
 frightening+0.098
 cal+0.097
Neg logit contributions
 wrists-0.068
 collection-0.068
 cand-0.064
 coloring-0.063
 mar-0.061
Token fun
Token ID1257
Feature activation+2.15e-02

Pos logit contributions
 wrists+0.020
 collection+0.020
 cand+0.020
 mar+0.019
 sandwich+0.018
Neg logit contributions
 reckless-0.023
 frightening-0.020
 gut-0.020
Little-0.020
Far-0.020
Token of
Token ID286
Feature activation+2.85e-03

Pos logit contributions
 collection+0.038
 cand+0.037
 wrists+0.036
 mar+0.034
 notebooks+0.034
Neg logit contributions
 reckless-0.037
 frightening-0.033
 gut-0.032
 afraid-0.032
 cal-0.032
Token you
Token ID345
Feature activation-4.14e-02

Pos logit contributions
 collection+0.005
 wrists+0.005
 cand+0.004
 mar+0.004
 notebooks+0.004
Neg logit contributions
 reckless-0.005
 frightening-0.005
 cal-0.005
 afraid-0.005
Little-0.005
Token,
Token ID11
Feature activation+2.33e-03

Pos logit contributions
 wrists+0.083
 cand+0.081
 collection+0.078
 traded+0.076
 notebooks+0.075
Neg logit contributions
 reckless-0.088
 frightening-0.079
 gut-0.077
 cal-0.077
Little-0.077
Token he
Token ID339
Feature activation-3.03e-02

Pos logit contributions
 wrists+0.004
 cand+0.004
 collection+0.004
 notebooks+0.004
 traded+0.004
Neg logit contributions
 reckless-0.004
 cal-0.004
 frightening-0.003
Little-0.003
 afraid-0.003
Token would
Token ID561
Feature activation-3.57e-02

Pos logit contributions
 reckless+0.056
 cal+0.052
 frightening+0.050
 afraid+0.050
Little+0.049
Neg logit contributions
 wrists-0.047
 cand-0.047
 collection-0.046
 notebooks-0.044
 mar-0.043
Token always
Token ID1464
Feature activation-8.68e-03

Pos logit contributions
 reckless+0.058
 cal+0.057
 gut+0.052
Little+0.052
 afraid+0.050
Neg logit contributions
 wrists-0.060
 cand-0.059
 notebooks-0.059
 clubs-0.055
 mar-0.055
Token prevent
Token ID2948
Feature activation-1.06e-01

Pos logit contributions
 reckless+0.013
 cal+0.013
 afraid+0.012
 fear+0.011
 frightening+0.011
Neg logit contributions
 wrists-0.016
 cand-0.016
 clubs-0.015
 notebooks-0.015
 mar-0.015
Token anyone
Token ID2687
Feature activation-1.82e-01

Pos logit contributions
 reckless+0.168
 frightening+0.151
 cal+0.145
 afraid+0.144
Little+0.143
Neg logit contributions
 collection-0.198
 wrists-0.194
 cand-0.193
 traded-0.182
 notebooks-0.181
Token from
Token ID422
Feature activation-4.97e-02

Pos logit contributions
 reckless+0.294
 frightening+0.265
 gut+0.257
Little+0.256
 cal+0.254
Neg logit contributions
 collection-0.325
 cand-0.318
 wrists-0.313
 mar-0.295
 traded-0.295
Token eating
Token ID6600
Feature activation+4.32e-02

Pos logit contributions
 reckless+0.100
 frightening+0.092
 afraid+0.090
 cal+0.089
 gut+0.087
Neg logit contributions
 collection-0.072
 wrists-0.071
 cand-0.070
 notebooks-0.065
 traded-0.065
Token the
Token ID262
Feature activation-2.13e-02

Pos logit contributions
 wrists+0.076
 collection+0.075
 cand+0.071
 traded+0.071
 notebooks+0.071
Neg logit contributions
 reckless-0.069
 cal-0.064
 frightening-0.064
 afraid-0.061
 911-0.060
Token tasty
Token ID25103
Feature activation+2.95e-02

Pos logit contributions
 frightening+0.038
 reckless+0.038
 cal+0.036
Little+0.035
 afraid+0.035
Neg logit contributions
 wrists-0.035
 collection-0.032
 traded-0.031
 notebooks-0.031
 wrist-0.030
Token treats
Token ID18432
Feature activation+5.40e-02

Pos logit contributions
 wrists+0.041
 traded+0.036
 notebooks+0.036
 wrist+0.035
 collection+0.033
Neg logit contributions
 cal-0.057
 frightening-0.057
 911-0.052
Little-0.052
 beast-0.051
Token by
Token ID416
Feature activation-1.18e-02

Pos logit contributions
 reckless+0.099
 frightening+0.090
 cal+0.089
 afraid+0.088
 911+0.087
Neg logit contributions
 wrists-0.082
 collection-0.082
 cand-0.080
 traded-0.074
 notebooks-0.074
Token the
Token ID262
Feature activation-5.23e-03

Pos logit contributions
 reckless+0.017
 frightening+0.015
 cal+0.015
 afraid+0.015
Little+0.014
Neg logit contributions
 collection-0.023
 cand-0.023
 wrists-0.022
 traded-0.022
 notebooks-0.022
Token door
Token ID3420
Feature activation-1.79e-02

Pos logit contributions
 reckless+0.011
 frightening+0.010
 afraid+0.009
 cal+0.009
Little+0.009
Neg logit contributions
 collection-0.008
 wrists-0.007
 cand-0.007
 notebooks-0.007
 mar-0.007
Token to
Token ID284
Feature activation-1.46e-02

Pos logit contributions
 reckless+0.030
 frightening+0.027
 cal+0.027
 afraid+0.026
Little+0.026
Neg logit contributions
 collection-0.031
 wrists-0.031
 cand-0.030
 notebooks-0.029
 traded-0.028
Token prevent
Token ID2948
Feature activation-1.04e-01

Pos logit contributions
 reckless+0.028
 frightening+0.026
 cal+0.025
 afraid+0.025
Little+0.025
Neg logit contributions
 collection-0.022
 wrists-0.022
 cand-0.021
 notebooks-0.020
 traded-0.020
Token anyone
Token ID2687
Feature activation-1.80e-01

Pos logit contributions
 reckless+0.182
 frightening+0.163
 cal+0.157
Little+0.157
 gut+0.157
Neg logit contributions
 collection-0.175
 cand-0.174
 wrists-0.173
 mar-0.161
 traded-0.159
Token from
Token ID422
Feature activation-4.59e-02

Pos logit contributions
 reckless+0.328
 frightening+0.299
 gut+0.294
Little+0.292
 beast+0.289
Neg logit contributions
 collection-0.284
 cand-0.277
 wrists-0.270
 coloring-0.254
 mar-0.254
Token coming
Token ID2406
Feature activation-1.90e-02

Pos logit contributions
 reckless+0.087
 frightening+0.079
 cal+0.077
Little+0.076
 afraid+0.076
Neg logit contributions
 collection-0.072
 wrists-0.070
 cand-0.069
 traded-0.064
 mar-0.064
Token in
Token ID287
Feature activation+1.63e-03

Pos logit contributions
 reckless+0.040
 frightening+0.037
Little+0.035
 beast+0.035
 cal+0.035
Neg logit contributions
 collection-0.026
 wrists-0.024
 cand-0.024
 traded-0.022
 mar-0.022
Token and
Token ID290
Feature activation-2.54e-02

Pos logit contributions
 collection+0.003
 wrists+0.003
 cand+0.002
 notebooks+0.002
 traded+0.002
Neg logit contributions
 reckless-0.003
 frightening-0.003
 afraid-0.003
 cal-0.003
Little-0.003
Token taking
Token ID2263
Feature activation-1.59e-02

Pos logit contributions
 reckless+0.047
 frightening+0.042
 cal+0.041
 afraid+0.041
 gut+0.040
Neg logit contributions
 collection-0.041
 wrists-0.041
 cand-0.040
 notebooks-0.038
 mar-0.037
Token hadn
Token ID8020
Feature activation-6.28e-02

Pos logit contributions
 reckless+0.019
 frightening+0.018
 afraid+0.018
 beast+0.017
 cal+0.017
Neg logit contributions
 wrists-0.016
 cand-0.015
 collection-0.015
 notebooks-0.015
 extract-0.014
Token't
Token ID470
Feature activation-7.45e-02

Pos logit contributions
 reckless+0.121
 cal+0.108
 gut+0.107
 frightening+0.106
 unnecessary+0.104
Neg logit contributions
 collection-0.098
 traded-0.094
 cand-0.093
 coloring-0.093
 mar-0.093
Token been
Token ID587
Feature activation-8.83e-03

Pos logit contributions
 reckless+0.140
 frightening+0.125
Little+0.123
 gut+0.123
 cal+0.122
Neg logit contributions
 collection-0.119
 cand-0.115
 wrists-0.111
 mar-0.110
 traded-0.109
Token won
Token ID1839
Feature activation-7.82e-02

Pos logit contributions
 reckless+0.016
 frightening+0.014
 afraid+0.014
 cal+0.014
 gut+0.013
Neg logit contributions
 collection-0.015
 wrists-0.015
 cand-0.014
 notebooks-0.014
 traded-0.014
Token by
Token ID416
Feature activation-5.64e-02

Pos logit contributions
 reckless+0.155
 cal+0.141
 frightening+0.141
Run+0.141
 unnecessary+0.139
Neg logit contributions
 collection-0.102
 coloring-0.098
 mar-0.097
 traded-0.093
 cand-0.090
Token anyone
Token ID2687
Feature activation-1.64e-01

Pos logit contributions
 reckless+0.110
 cal+0.100
 frightening+0.098
Little+0.098
Far+0.095
Neg logit contributions
 collection-0.086
 cand-0.085
 wrists-0.084
 mar-0.081
 coloring-0.077
Token for
Token ID329
Feature activation-1.29e-02

Pos logit contributions
 reckless+0.301
 frightening+0.275
 cal+0.267
Little+0.264
 gut+0.263
Neg logit contributions
 collection-0.263
 wrists-0.255
 cand-0.254
 traded-0.237
 mar-0.236
Token a
Token ID257
Feature activation-2.02e-02

Pos logit contributions
 reckless+0.023
 frightening+0.021
 afraid+0.020
 cal+0.020
 gut+0.019
Neg logit contributions
 collection-0.022
 wrists-0.021
 cand-0.021
 traded-0.020
 notebooks-0.020
Token long
Token ID890
Feature activation+7.77e-03

Pos logit contributions
 reckless+0.039
 frightening+0.037
 afraid+0.035
 cal+0.035
 beast+0.035
Neg logit contributions
 wrists-0.029
 collection-0.029
 cand-0.029
 notebooks-0.026
 traded-0.026
Token long
Token ID890
Feature activation+8.97e-03

Pos logit contributions
 wrists+0.015
 cand+0.015
 collection+0.014
 notebooks+0.014
 traded+0.014
Neg logit contributions
 reckless-0.011
 frightening-0.010
 beast-0.010
 afraid-0.010
 evil-0.009
Token time
Token ID640
Feature activation+3.77e-02

Pos logit contributions
 wrists+0.017
 cand+0.017
 collection+0.016
 notebooks+0.016
 traded+0.016
Neg logit contributions
 reckless-0.013
 frightening-0.012
 beast-0.012
 afraid-0.011
 evil-0.011
Token it
Token ID340
Feature activation-2.26e-03

Pos logit contributions
 collection+0.119
 wrists+0.112
 cand+0.112
 traded+0.108
 notebooks+0.105
Neg logit contributions
 reckless-0.143
 frightening-0.133
 afraid-0.129
 evil-0.126
 911-0.125
Token and
Token ID290
Feature activation-1.67e-02

Pos logit contributions
 reckless+0.004
 frightening+0.004
 afraid+0.003
 cal+0.003
 gut+0.003
Neg logit contributions
 wrists-0.004
 collection-0.004
 cand-0.004
 notebooks-0.004
 traded-0.004
Token wouldn
Token ID3636
Feature activation-1.43e-01

Pos logit contributions
 reckless+0.027
 afraid+0.024
 frightening+0.024
 evil+0.023
 cal+0.022
Neg logit contributions
 collection-0.030
 wrists-0.029
 cand-0.029
 notebooks-0.027
 sandwich-0.027
Token't
Token ID470
Feature activation-1.03e-01

Pos logit contributions
 reckless+0.281
 frightening+0.264
 urgent+0.255
 eerie+0.251
 unnecessary+0.251
Neg logit contributions
 collection-0.206
 mar-0.202
 traded-0.196
 cand-0.192
 wrists-0.187
Token let
Token ID1309
Feature activation-1.45e-02

Pos logit contributions
 reckless+0.184
 afraid+0.175
 frightening+0.166
 louder+0.164
 worse+0.164
Neg logit contributions
 wrists-0.164
 collection-0.161
 notebooks-0.154
 cand-0.150
ci-0.143
Token anyone
Token ID2687
Feature activation-1.62e-01

Pos logit contributions
 reckless+0.025
 frightening+0.023
 cal+0.022
 afraid+0.022
Little+0.022
Neg logit contributions
 collection-0.025
 wrists-0.024
 cand-0.024
 traded-0.023
 notebooks-0.022
Token else
Token ID2073
Feature activation-1.72e-02

Pos logit contributions
 reckless+0.250
 afraid+0.224
 frightening+0.223
 cal+0.211
 evil+0.203
Neg logit contributions
 collection-0.315
 wrists-0.311
 cand-0.302
 notebooks-0.286
 mar-0.280
Token play
Token ID711
Feature activation+2.83e-02

Pos logit contributions
 reckless+0.029
 afraid+0.027
 frightening+0.026
 worse+0.025
 clearly+0.025
Neg logit contributions
 wrists-0.031
 collection-0.029
 cand-0.028
 extract-0.027
 notebooks-0.027
Token with
Token ID351
Feature activation+6.01e-02

Pos logit contributions
 wrists+0.054
 collection+0.048
 cand+0.048
 notebooks+0.047
 extract+0.044
Neg logit contributions
 worse-0.041
 louder-0.040
 frightening-0.040
 loudly-0.040
 reckless-0.039
Token it
Token ID340
Feature activation-1.04e-02

Pos logit contributions
 collection+0.119
 wrists+0.115
 cand+0.115
 traded+0.111
 notebooks+0.110
Neg logit contributions
 reckless-0.086
 frightening-0.078
 afraid-0.075
 cal-0.074
 911-0.073
Token.
Token ID13
Feature activation-3.89e-03

Pos logit contributions
 reckless+0.019
 frightening+0.018
 afraid+0.017
 cal+0.017
 evil+0.016
Neg logit contributions
 wrists-0.017
 collection-0.016
 cand-0.016
 notebooks-0.015
 traded-0.015
Token because
Token ID780
Feature activation-1.94e-02

Pos logit contributions
 collection+0.011
 wrists+0.011
 cand+0.010
 mar+0.010
 traded+0.010
Neg logit contributions
 reckless-0.015
 frightening-0.014
Little-0.014
 afraid-0.014
 cal-0.014
Token danger
Token ID3514
Feature activation-9.33e-02

Pos logit contributions
 reckless+0.035
 cal+0.033
 frightening+0.033
Little+0.032
 gut+0.031
Neg logit contributions
 collection-0.030
 cand-0.029
 mar-0.028
 wrists-0.028
 traded-0.027
Token can
Token ID460
Feature activation-3.86e-02

Pos logit contributions
 reckless+0.177
 frightening+0.164
Little+0.159
 cal+0.159
 gut+0.158
Neg logit contributions
 collection-0.138
 cand-0.133
 wrists-0.131
 mar-0.125
 traded-0.125
Token come
Token ID1282
Feature activation-2.14e-02

Pos logit contributions
 reckless+0.081
 frightening+0.075
 afraid+0.073
 cal+0.072
Little+0.072
Neg logit contributions
 collection-0.051
 cand-0.049
 wrists-0.049
 traded-0.045
 mar-0.045
Token to
Token ID284
Feature activation-4.23e-02

Pos logit contributions
 reckless+0.039
 beast+0.036
Little+0.036
 frightening+0.035
 gut+0.035
Neg logit contributions
 collection-0.032
 cand-0.029
 traded-0.029
 mar-0.029
 clubs-0.029
Token anyone
Token ID2687
Feature activation-1.60e-01

Pos logit contributions
 reckless+0.078
 frightening+0.070
 afraid+0.068
Little+0.068
 cal+0.068
Neg logit contributions
 collection-0.066
 cand-0.064
 wrists-0.063
 mar-0.061
 traded-0.060
Token.
Token ID13
Feature activation+3.44e-03

Pos logit contributions
 reckless+0.331
 frightening+0.316
Little+0.311
 911+0.309
 gut+0.308
Neg logit contributions
 collection-0.205
 wrists-0.186
 cand-0.185
 mar-0.176
 traded-0.176
Token<|endoftext|>
Token ID50256
Feature activation-6.84e-02

Pos logit contributions
 collection+0.006
 wrists+0.006
 cand+0.006
 traded+0.005
 notebooks+0.005
Neg logit contributions
 reckless-0.006
 frightening-0.006
 cal-0.005
 afraid-0.005
 gut-0.005
TokenOnce
Token ID7454
Feature activation+2.92e-02

Pos logit contributions
 reckless+0.108
 frightening+0.096
 cal+0.092
 afraid+0.092
 gut+0.091
Neg logit contributions
 collection-0.128
 wrists-0.125
 cand-0.124
 traded-0.118
 notebooks-0.117
Token there
Token ID612
Feature activation+2.80e-02

Pos logit contributions
 collection+0.065
 mar+0.057
 coloring+0.056
 notebooks+0.056
 cand+0.055
Neg logit contributions
 reckless-0.043
 cal-0.037
 frightening-0.036
 eerie-0.036
 unnecessary-0.036
Token was
Token ID373
Feature activation-9.48e-03

Pos logit contributions
 wrists+0.043
 collection+0.041
 cand+0.041
 notebooks+0.040
 traded+0.037
Neg logit contributions
 reckless-0.052
 frightening-0.049
 afraid-0.048
Little-0.048
 cal-0.047
Token your
Token ID534
Feature activation+4.15e-02

Pos logit contributions
 reckless+0.015
 frightening+0.014
 afraid+0.013
 cal+0.013
 gut+0.013
Neg logit contributions
 collection-0.017
 wrists-0.016
 cand-0.016
 traded-0.015
 notebooks-0.015
Token friends
Token ID2460
Feature activation+3.26e-03

Pos logit contributions
 collection+0.062
 wrists+0.060
 cand+0.060
 traded+0.055
 notebooks+0.055
Neg logit contributions
 reckless-0.082
 frightening-0.075
 cal-0.072
 afraid-0.072
Little-0.072
Token and
Token ID290
Feature activation-1.06e-02

Pos logit contributions
 collection+0.005
 wrists+0.005
 cand+0.005
 notebooks+0.005
 traded+0.005
Neg logit contributions
 reckless-0.006
 frightening-0.005
 afraid-0.005
 cal-0.005
 gut-0.005
Token never
Token ID1239
Feature activation-1.02e-01

Pos logit contributions
 reckless+0.019
 gut+0.017
 frightening+0.017
 afraid+0.017
 cal+0.016
Neg logit contributions
 wrists-0.018
 collection-0.018
 cand-0.017
 notebooks-0.017
 mar-0.016
Token let
Token ID1309
Feature activation-7.10e-03

Pos logit contributions
 reckless+0.191
 frightening+0.174
 afraid+0.168
 cal+0.168
 gut+0.164
Neg logit contributions
 collection-0.162
 wrists-0.158
 cand-0.156
 notebooks-0.146
 traded-0.145
Token anyone
Token ID2687
Feature activation-1.60e-01

Pos logit contributions
 reckless+0.012
 frightening+0.011
 afraid+0.011
Little+0.011
 cal+0.011
Neg logit contributions
 collection-0.012
 cand-0.012
 wrists-0.011
 traded-0.011
 mar-0.011
Token tease
Token ID34691
Feature activation-3.47e-02

Pos logit contributions
 reckless+0.292
Little+0.272
 911+0.271
 frightening+0.268
 gut+0.268
Neg logit contributions
 collection-0.238
 cand-0.228
 traded-0.221
 wrists-0.219
 mar-0.216
Token them
Token ID606
Feature activation-1.01e-03

Pos logit contributions
 reckless+0.057
 frightening+0.051
 afraid+0.049
 cal+0.049
 gut+0.048
Neg logit contributions
 collection-0.064
 wrists-0.062
 cand-0.062
 traded-0.059
 notebooks-0.059
Token.
Token ID13
Feature activation+8.60e-03

Pos logit contributions
 reckless+0.002
 frightening+0.002
Little+0.002
 beast+0.002
 gut+0.002
Neg logit contributions
 collection-0.001
 cand-0.001
 wrists-0.001
 traded-0.001
 mar-0.001
Token<|endoftext|>
Token ID50256
Feature activation-6.72e-02

Pos logit contributions
 wrists+0.016
 collection+0.016
 cand+0.016
 notebooks+0.015
 traded+0.015
Neg logit contributions
 reckless-0.013
 frightening-0.012
 cal-0.012
 afraid-0.012
Little-0.012
TokenOnce
Token ID7454
Feature activation+3.11e-02

Pos logit contributions
 reckless+0.115
 frightening+0.104
 cal+0.100
 afraid+0.100
Little+0.098
Neg logit contributions
 collection-0.117
 wrists-0.114
 cand-0.113
 traded-0.106
 notebooks-0.106
Token so
Token ID523
Feature activation+2.33e-02

Pos logit contributions
 reckless+0.041
 frightening+0.037
 afraid+0.036
 cal+0.036
 beast+0.035
Neg logit contributions
 collection-0.035
 wrists-0.035
 cand-0.034
 traded-0.032
 notebooks-0.032
Token they
Token ID484
Feature activation+1.35e-02

Pos logit contributions
 collection+0.029
 wrists+0.028
 cand+0.028
 traded+0.026
 notebooks+0.025
Neg logit contributions
 reckless-0.051
 frightening-0.048
 cal-0.046
 afraid-0.046
 beast-0.046
Token wouldn
Token ID3636
Feature activation-1.46e-01

Pos logit contributions
 collection+0.022
 wrists+0.021
 cand+0.021
 traded+0.020
 notebooks+0.020
Neg logit contributions
 reckless-0.024
 frightening-0.022
 cal-0.021
 afraid-0.021
Little-0.021
Token't
Token ID470
Feature activation-1.01e-01

Pos logit contributions
 reckless+0.302
 frightening+0.289
 unnecessary+0.274
 urgent+0.269
 cal+0.269
Neg logit contributions
 collection-0.199
 traded-0.177
 dress-0.174
 mar-0.173
 wrists-0.172
Token bother
Token ID11393
Feature activation-4.90e-02

Pos logit contributions
 reckless+0.180
 frightening+0.163
 afraid+0.158
 cal+0.157
 gut+0.153
Neg logit contributions
 collection-0.170
 wrists-0.169
 cand-0.165
 notebooks-0.158
 traded-0.154
Token anyone
Token ID2687
Feature activation-1.60e-01

Pos logit contributions
 reckless+0.085
 frightening+0.078
 afraid+0.075
 911+0.075
 cal+0.074
Neg logit contributions
 collection-0.083
 cand-0.081
 wrists-0.081
 traded-0.077
 notebooks-0.076
Token else
Token ID2073
Feature activation-8.43e-03

Pos logit contributions
 reckless+0.269
 frightening+0.242
 gut+0.239
Little+0.237
 cal+0.234
Neg logit contributions
 collection-0.279
 cand-0.269
 wrists-0.267
 traded-0.253
 mar-0.249
Token.
Token ID13
Feature activation+7.25e-03

Pos logit contributions
 reckless+0.015
 frightening+0.014
 afraid+0.013
 cal+0.013
 evil+0.013
Neg logit contributions
 wrists-0.014
 collection-0.014
 cand-0.014
 notebooks-0.013
 mar-0.013
Token<|endoftext|>
Token ID50256
Feature activation-7.21e-02

Pos logit contributions
 wrists+0.013
 collection+0.013
 cand+0.013
 notebooks+0.012
 traded+0.012
Neg logit contributions
 reckless-0.012
 frightening-0.010
Little-0.010
 cal-0.010
 afraid-0.010
TokenOnce
Token ID7454
Feature activation+2.71e-02

Pos logit contributions
 reckless+0.126
 frightening+0.114
 cal+0.110
 afraid+0.110
 gut+0.108
Neg logit contributions
 collection-0.123
 wrists-0.120
 cand-0.119
 traded-0.112
 notebooks-0.111
Token upon
Token ID2402
Feature activation-7.34e-02

Pos logit contributions
 collection+0.062
 mar+0.055
 coloring+0.054
 cand+0.054
 notebooks+0.053
Neg logit contributions
 reckless-0.039
 cal-0.032
 frightening-0.031
 unnecessary-0.031
 beast-0.031
Token D
Token ID360
Feature activation-8.48e-03

Pos logit contributions
 reckless+0.045
 frightening+0.042
 afraid+0.042
 cal+0.039
 gut+0.039
Neg logit contributions
 wrists-0.048
 cand-0.046
 collection-0.046
 notebooks-0.046
 coloring-0.044
Tokenilly
Token ID6548
Feature activation+3.14e-03

Pos logit contributions
 reckless+0.014
 frightening+0.012
 afraid+0.012
 cal+0.012
 911+0.012
Neg logit contributions
 collection-0.016
 cand-0.015
 wrists-0.015
 notebooks-0.014
 traded-0.014
Token wouldn
Token ID3636
Feature activation-1.39e-01

Pos logit contributions
 collection+0.005
 cand+0.005
 wrists+0.005
 notebooks+0.005
 mar+0.004
Neg logit contributions
 reckless-0.006
 frightening-0.005
 afraid-0.005
 cal-0.005
 gut-0.005
Token't
Token ID470
Feature activation-9.65e-02

Pos logit contributions
 reckless+0.315
 frightening+0.295
 unnecessary+0.290
 urgent+0.288
 eerie+0.285
Neg logit contributions
 collection-0.145
�-0.135
 mar-0.134
 traded-0.130
 dress-0.126
Token let
Token ID1309
Feature activation-1.04e-02

Pos logit contributions
 reckless+0.167
 frightening+0.152
 afraid+0.148
 cal+0.146
 gut+0.142
Neg logit contributions
 collection-0.168
 wrists-0.166
 cand-0.163
 notebooks-0.155
 traded-0.152
Token anyone
Token ID2687
Feature activation-1.60e-01

Pos logit contributions
 reckless+0.018
 frightening+0.016
 cal+0.016
 afraid+0.016
 911+0.016
Neg logit contributions
 wrists-0.018
 collection-0.018
 cand-0.017
 notebooks-0.016
 traded-0.016
Token.
Token ID13
Feature activation+8.54e-03

Pos logit contributions
 reckless+0.262
 frightening+0.235
 gut+0.229
Little+0.229
 911+0.226
Neg logit contributions
 collection-0.284
 wrists-0.276
 cand-0.274
 traded-0.261
 mar-0.257
Token She
Token ID1375
Feature activation-1.83e-02

Pos logit contributions
 collection+0.015
 wrists+0.015
 cand+0.015
 notebooks+0.015
 sandwich+0.014
Neg logit contributions
 reckless-0.013
 cal-0.012
Little-0.012
 afraid-0.012
 frightening-0.012
Token kept
Token ID4030
Feature activation-1.63e-02

Pos logit contributions
 reckless+0.033
 frightening+0.030
 cal+0.030
 afraid+0.030
 gut+0.029
Neg logit contributions
 collection-0.029
 cand-0.029
 wrists-0.029
 notebooks-0.027
 coloring-0.026
Token it
Token ID340
Feature activation-2.94e-02

Pos logit contributions
 reckless+0.026
 afraid+0.025
 frightening+0.025
 cal+0.024
 louder+0.023
Neg logit contributions
 cand-0.030
 collection-0.030
 wrists-0.028
 notebooks-0.028
 traded-0.028
Token safe
Token ID3338
Feature activation+5.32e-03

Pos logit contributions
 reckless+0.047
 frightening+0.043
 cal+0.041
 afraid+0.041
 gut+0.040
Neg logit contributions
 wrists-0.054
 collection-0.053
 cand-0.053
 notebooks-0.050
 traded-0.050
Token day
Token ID1110
Feature activation+4.40e-04

Pos logit contributions
 reckless+0.047
 frightening+0.043
 gut+0.043
 cal+0.042
 beast+0.042
Neg logit contributions
 collection-0.017
 cand-0.016
 wrists-0.015
 traded-0.015
 mar-0.014
Token.
Token ID13
Feature activation-7.66e-03

Pos logit contributions
 wrists+0.001
 collection+0.001
 notebooks+0.001
 cand+0.001
 traded+0.001
Neg logit contributions
 reckless-0.001
 frightening-0.001
 afraid-0.001
 late-0.001
 evil-0.001
Token He
Token ID679
Feature activation-3.44e-02

Pos logit contributions
 reckless+0.013
 frightening+0.012
 afraid+0.012
 911+0.012
 cal+0.012
Neg logit contributions
 collection-0.013
 wrists-0.013
 cand-0.013
 mar-0.012
 notebooks-0.012
Token never
Token ID1239
Feature activation-9.58e-02

Pos logit contributions
 reckless+0.074
 frightening+0.069
 cal+0.067
 afraid+0.066
Little+0.066
Neg logit contributions
 collection-0.044
 wrists-0.043
 cand-0.043
 traded-0.039
 mar-0.039
Token helped
Token ID4193
Feature activation+3.10e-03

Pos logit contributions
 reckless+0.211
 911+0.202
 gut+0.202
Little+0.199
 frightening+0.198
Neg logit contributions
 collection-0.112
 cand-0.112
 traded-0.108
 mar-0.100
 wrists-0.099
Token anyone
Token ID2687
Feature activation-1.60e-01

Pos logit contributions
 collection+0.005
 cand+0.005
 wrists+0.005
 traded+0.005
 mar+0.005
Neg logit contributions
 reckless-0.006
 frightening-0.005
 cal-0.005
 afraid-0.005
Little-0.005
Token in
Token ID287
Feature activation+1.56e-02

Pos logit contributions
 reckless+0.286
 frightening+0.261
Little+0.255
 gut+0.251
 911+0.250
Neg logit contributions
 collection-0.260
 cand-0.252
 wrists-0.243
 mar-0.234
 coloring-0.232
Token the
Token ID262
Feature activation-1.23e-02

Pos logit contributions
 collection+0.027
 cand+0.026
 wrists+0.025
 mar+0.025
 notebooks+0.024
Neg logit contributions
 reckless-0.027
 frightening-0.024
Little-0.024
 cal-0.023
 gut-0.023
Token village
Token ID7404
Feature activation-4.36e-02

Pos logit contributions
 reckless+0.026
 frightening+0.024
 afraid+0.023
 cal+0.023
Little+0.023
Neg logit contributions
 wrists-0.017
 collection-0.017
 cand-0.016
 traded-0.015
 notebooks-0.015
Token.
Token ID13
Feature activation-7.16e-03

Pos logit contributions
 reckless+0.082
 frightening+0.075
 afraid+0.073
 cal+0.073
 gut+0.071
Neg logit contributions
 collection-0.068
 wrists-0.067
 cand-0.066
 notebooks-0.061
 traded-0.061
Token One
Token ID1881
Feature activation-3.61e-02

Pos logit contributions
 reckless+0.012
Little+0.011
 frightening+0.011
 911+0.011
 cal+0.011
Neg logit contributions
 wrists-0.013
 collection-0.012
 cand-0.012
 notebooks-0.011
 mar-0.011
Token taken
Token ID2077
Feature activation-2.17e-02

Pos logit contributions
 collection+0.081
 wrists+0.080
 cand+0.078
 notebooks+0.073
 mar+0.072
Neg logit contributions
 reckless-0.098
 frightening-0.089
 afraid-0.087
 cal-0.085
Little-0.084
Token but
Token ID475
Feature activation-3.98e-02

Pos logit contributions
 reckless+0.038
 frightening+0.035
 cal+0.034
 afraid+0.034
 911+0.033
Neg logit contributions
 wrists-0.037
 collection-0.037
 cand-0.035
 notebooks-0.033
 traded-0.033
Token the
Token ID262
Feature activation-7.25e-03

Pos logit contributions
 reckless+0.068
 frightening+0.062
 cal+0.060
 afraid+0.060
Little+0.059
Neg logit contributions
 collection-0.069
 wrists-0.068
 cand-0.067
 traded-0.063
 notebooks-0.063
Token park
Token ID3952
Feature activation+3.50e-02

Pos logit contributions
 reckless+0.013
 frightening+0.013
 afraid+0.013
 cal+0.012
 evil+0.012
Neg logit contributions
 collection-0.011
 wrists-0.011
 cand-0.011
 traded-0.010
 mar-0.010
Token keeper
Token ID28356
Feature activation-3.76e-02

Pos logit contributions
 wrists+0.063
 collection+0.059
 cand+0.058
 notebooks+0.058
 wrist+0.056
Neg logit contributions
 reckless-0.059
 frightening-0.056
 cal-0.053
 beast-0.052
 afraid-0.052
Token wouldn
Token ID3636
Feature activation-1.59e-01

Pos logit contributions
 reckless+0.073
 frightening+0.067
 cal+0.066
 afraid+0.066
Little+0.064
Neg logit contributions
 collection-0.057
 wrists-0.056
 cand-0.055
 notebooks-0.052
 mar-0.050
Token't
Token ID470
Feature activation-1.13e-01

Pos logit contributions
 reckless+0.337
 frightening+0.317
 unnecessary+0.306
 eerie+0.297
 afraid+0.297
Neg logit contributions
 collection-0.196
 traded-0.185
 cand-0.178
 mar-0.178
 wrists-0.174
Token listen
Token ID6004
Feature activation-1.03e-02

Pos logit contributions
 reckless+0.191
 frightening+0.173
 cal+0.166
 afraid+0.166
Little+0.163
Neg logit contributions
 collection-0.199
 wrists-0.195
 cand-0.193
 traded-0.182
 notebooks-0.181
Token.
Token ID13
Feature activation-1.23e-03

Pos logit contributions
 reckless+0.021
 frightening+0.019
 cal+0.018
 afraid+0.018
Little+0.018
Neg logit contributions
 collection-0.015
 wrists-0.015
 cand-0.014
 traded-0.013
 notebooks-0.013
Token He
Token ID679
Feature activation-2.82e-02

Pos logit contributions
 reckless+0.002
 frightening+0.002
 cal+0.002
 afraid+0.002
Little+0.002
Neg logit contributions
 collection-0.002
 wrists-0.002
 cand-0.002
 traded-0.002
 notebooks-0.002
Token kept
Token ID4030
Feature activation-1.66e-02

Pos logit contributions
 reckless+0.055
 frightening+0.050
 cal+0.049
 afraid+0.049
Little+0.048
Neg logit contributions
 collection-0.042
 wrists-0.041
 cand-0.041
 notebooks-0.038
 traded-0.038
Token little
Token ID1310
Feature activation-3.46e-02

Pos logit contributions
 reckless+0.031
 frightening+0.027
 cal+0.026
 afraid+0.026
 gut+0.026
Neg logit contributions
 collection-0.040
 cand-0.040
 wrists-0.039
 mar-0.037
 notebooks-0.037
Token girl
Token ID2576
Feature activation-1.28e-02

Pos logit contributions
 reckless+0.064
 frightening+0.059
 cal+0.057
 afraid+0.056
Little+0.056
Neg logit contributions
 collection-0.055
 wrists-0.054
 cand-0.053
 traded-0.050
 notebooks-0.050
Token was
Token ID373
Feature activation-4.42e-03

Pos logit contributions
 reckless+0.026
 frightening+0.024
 cal+0.023
 afraid+0.023
Little+0.022
Neg logit contributions
 collection-0.019
 wrists-0.018
 cand-0.018
 notebooks-0.017
 traded-0.017
Token scared
Token ID12008
Feature activation-7.97e-02

Pos logit contributions
 reckless+0.009
Little+0.008
 beast+0.008
 frightening+0.008
 gut+0.008
Neg logit contributions
 collection-0.006
 cand-0.006
 wrists-0.006
 traded-0.006
 coloring-0.005
Token she
Token ID673
Feature activation-5.11e-02

Pos logit contributions
 reckless+0.182
 beast+0.167
 frightening+0.167
Little+0.166
 gut+0.165
Neg logit contributions
 collection-0.087
 cand-0.083
 wrists-0.078
 coloring-0.074
 mar-0.073
Token wouldn
Token ID3636
Feature activation-1.59e-01

Pos logit contributions
 reckless+0.096
 frightening+0.087
Little+0.086
 cal+0.085
 911+0.085
Neg logit contributions
 collection-0.079
 wrists-0.075
 cand-0.075
 traded-0.074
 mar-0.070
Token't
Token ID470
Feature activation-1.12e-01

Pos logit contributions
 reckless+0.368
 frightening+0.353
 unnecessary+0.337
 eerie+0.332
 urgent+0.331
Neg logit contributions
 collection-0.162
 dress-0.148
 traded-0.144
 mar-0.142
�-0.139
Token come
Token ID1282
Feature activation-3.23e-02

Pos logit contributions
 reckless+0.218
 frightening+0.201
Little+0.194
 cal+0.193
 beast+0.191
Neg logit contributions
 collection-0.167
 cand-0.159
 wrists-0.158
 traded-0.151
 mar-0.146
Token back
Token ID736
Feature activation-2.54e-02

Pos logit contributions
 reckless+0.071
 frightening+0.065
Little+0.063
 cal+0.063
 beast+0.062
Neg logit contributions
 collection-0.041
 wrists-0.039
 cand-0.037
 traded-0.036
 coloring-0.035
Token.
Token ID13
Feature activation-1.21e-02

Pos logit contributions
 reckless+0.055
 frightening+0.050
 beast+0.048
Little+0.048
 cal+0.048
Neg logit contributions
 collection-0.034
 wrists-0.031
 cand-0.031
 coloring-0.029
 mar-0.029
Token But
Token ID887
Feature activation-4.29e-02

Pos logit contributions
 reckless+0.023
 frightening+0.022
 afraid+0.021
 cal+0.021
Little+0.021
Neg logit contributions
 collection-0.018
 wrists-0.018
 cand-0.018
 notebooks-0.016
 traded-0.016
Token and
Token ID290
Feature activation-1.35e-02

Pos logit contributions
 reckless+0.062
 afraid+0.058
 frightening+0.056
 clearly+0.052
 worse+0.052
Neg logit contributions
 wrists-0.071
 notebooks-0.071
 cand-0.071
 extract-0.069
 traded-0.069
Token he
Token ID339
Feature activation-2.42e-02

Pos logit contributions
 reckless+0.023
 cal+0.021
 frightening+0.021
 gut+0.021
 afraid+0.021
Neg logit contributions
 wrists-0.022
 collection-0.021
 cand-0.021
 notebooks-0.021
 clubs-0.020
Token would
Token ID561
Feature activation-3.47e-02

Pos logit contributions
 reckless+0.043
 cal+0.039
 afraid+0.039
 frightening+0.038
 gut+0.037
Neg logit contributions
 cand-0.038
 wrists-0.038
 collection-0.038
 notebooks-0.036
 coloring-0.035
Token never
Token ID1239
Feature activation-1.09e-01

Pos logit contributions
 reckless+0.062
 cal+0.059
 afraid+0.057
 gut+0.057
 frightening+0.057
Neg logit contributions
 cand-0.053
 wrists-0.051
 notebooks-0.051
 collection-0.050
 mar-0.047
Token let
Token ID1309
Feature activation-8.65e-03

Pos logit contributions
 reckless+0.169
 afraid+0.151
 frightening+0.150
 cal+0.149
 evil+0.141
Neg logit contributions
 wrists-0.207
 cand-0.202
 collection-0.202
 notebooks-0.195
 mar-0.187
Token anyone
Token ID2687
Feature activation-1.59e-01

Pos logit contributions
 reckless+0.015
 frightening+0.014
Little+0.013
 afraid+0.013
 gut+0.013
Neg logit contributions
 collection-0.015
 cand-0.014
 wrists-0.014
 traded-0.014
 mar-0.013
Token else
Token ID2073
Feature activation-1.32e-02

Pos logit contributions
 reckless+0.240
 frightening+0.214
 cal+0.205
 afraid+0.205
Little+0.200
Neg logit contributions
 collection-0.308
 wrists-0.302
 cand-0.300
 traded-0.283
 notebooks-0.283
Token get
Token ID651
Feature activation-2.22e-02

Pos logit contributions
 reckless+0.022
 frightening+0.021
 afraid+0.021
 cal+0.020
 evil+0.019
Neg logit contributions
 wrists-0.023
 collection-0.022
 cand-0.022
 notebooks-0.021
 extract-0.020
Token near
Token ID1474
Feature activation-3.30e-02

Pos logit contributions
 reckless+0.036
 frightening+0.033
 afraid+0.032
 cal+0.032
 evil+0.030
Neg logit contributions
 collection-0.040
 wrists-0.040
 cand-0.039
 notebooks-0.037
 traded-0.036
Token it
Token ID340
Feature activation-2.69e-02

Pos logit contributions
 reckless+0.050
 frightening+0.047
 cal+0.045
 afraid+0.044
 beast+0.044
Neg logit contributions
 collection-0.061
 wrists-0.060
 cand-0.060
 traded-0.058
 notebooks-0.058
Token.
Token ID13
Feature activation-3.80e-03

Pos logit contributions
 reckless+0.046
 frightening+0.043
 cal+0.042
 afraid+0.042
 evil+0.039
Neg logit contributions
 wrists-0.046
 cand-0.045
 collection-0.044
 notebooks-0.043
 traded-0.041
Token to
Token ID284
Feature activation-2.61e-02

Pos logit contributions
 reckless+0.005
 frightening+0.004
 afraid+0.004
 cal+0.004
 gut+0.004
Neg logit contributions
 collection-0.005
 wrists-0.005
 cand-0.005
 notebooks-0.005
 traded-0.005
Token prevent
Token ID2948
Feature activation-1.12e-01

Pos logit contributions
 reckless+0.050
 frightening+0.046
 afraid+0.044
 cal+0.044
Little+0.044
Neg logit contributions
 collection-0.040
 wrists-0.039
 cand-0.038
 traded-0.036
 notebooks-0.036
Token them
Token ID606
Feature activation-3.64e-02

Pos logit contributions
 reckless+0.193
 frightening+0.175
 cal+0.168
 afraid+0.168
Little+0.165
Neg logit contributions
 collection-0.194
 wrists-0.190
 cand-0.188
 traded-0.177
 notebooks-0.177
Token from
Token ID422
Feature activation-6.12e-02

Pos logit contributions
 reckless+0.046
 frightening+0.040
 cal+0.038
 afraid+0.038
Little+0.037
Neg logit contributions
 collection-0.079
 wrists-0.078
 cand-0.077
 traded-0.074
 notebooks-0.073
Token hurting
Token ID21530
Feature activation-2.00e-02

Pos logit contributions
 reckless+0.116
 frightening+0.105
 cal+0.103
Little+0.103
 gut+0.102
Neg logit contributions
 collection-0.095
 wrists-0.092
 cand-0.092
 mar-0.086
 traded-0.085
Token anyone
Token ID2687
Feature activation-1.59e-01

Pos logit contributions
 reckless+0.035
 frightening+0.032
 cal+0.031
 afraid+0.031
Little+0.030
Neg logit contributions
 collection-0.034
 wrists-0.033
 cand-0.033
 traded-0.031
 notebooks-0.031
Token,
Token ID11
Feature activation-9.86e-03

Pos logit contributions
 reckless+0.303
 frightening+0.278
Little+0.269
 cal+0.269
 gut+0.268
Neg logit contributions
 collection-0.240
 cand-0.233
 wrists-0.231
 mar-0.214
 traded-0.211
Token but
Token ID475
Feature activation-3.67e-02

Pos logit contributions
 reckless+0.016
 frightening+0.015
 cal+0.014
Little+0.014
 afraid+0.014
Neg logit contributions
 collection-0.017
 wrists-0.017
 cand-0.017
 traded-0.016
 mar-0.016
Token there
Token ID612
Feature activation+1.46e-02

Pos logit contributions
 reckless+0.059
 frightening+0.053
 afraid+0.053
 gut+0.050
 cal+0.050
Neg logit contributions
 wrists-0.070
 collection-0.068
 cand-0.067
 notebooks-0.065
 wrist-0.063
Token were
Token ID547
Feature activation+3.83e-02

Pos logit contributions
 wrists+0.023
 cand+0.022
 collection+0.022
 notebooks+0.021
 wrist+0.021
Neg logit contributions
 reckless-0.028
 frightening-0.025
 afraid-0.025
 cal-0.025
 gut-0.024
Token too
Token ID1165
Feature activation-2.56e-02

Pos logit contributions
 collection+0.065
 wrists+0.064
 cand+0.063
 notebooks+0.060
 traded+0.059
Neg logit contributions
 reckless-0.067
 frightening-0.061
 afraid-0.059
 cal-0.059
 gut-0.057
Token,
Token ID11
Feature activation-2.62e-03

Pos logit contributions
 reckless+0.093
 unnecessary+0.086
 gut+0.084
Little+0.082
Young+0.081
Neg logit contributions
 collection-0.048
 cand-0.047
 coloring-0.046
 mar-0.044
 tea-0.042
Token "
Token ID366
Feature activation-2.21e-02

Pos logit contributions
 reckless+0.005
 frightening+0.004
 gut+0.004
Little+0.004
 unnecessary+0.004
Neg logit contributions
 collection-0.004
 wrists-0.004
 cand-0.004
 traded-0.004
 mar-0.004
TokenWhy
Token ID5195
Feature activation-4.04e-02

Pos logit contributions
 reckless+0.038
 gut+0.037
 outage+0.036
 frightening+0.035
 unnecessary+0.035
Neg logit contributions
 traded-0.032
 cand-0.032
 wrists-0.031
 mar-0.030
 shared-0.030
Token won
Token ID1839
Feature activation-5.96e-02

Pos logit contributions
 reckless+0.094
 frightening+0.094
 cal+0.092
 unnecessary+0.092
Little+0.091
Neg logit contributions
 collection-0.037
 cand-0.033
 dress-0.032
 mar-0.032
 sandwich-0.032
Token't
Token ID470
Feature activation-9.22e-02

Pos logit contributions
 reckless+0.122
 gut+0.112
 unnecessary+0.112
 frightening+0.112
rig+0.110
Neg logit contributions
 collection-0.070
 mar-0.070
 cand-0.069
 dress-0.067
 traded-0.065
Token anyone
Token ID2687
Feature activation-1.59e-01

Pos logit contributions
 reckless+0.195
 unnecessary+0.177
 frightening+0.177
 gut+0.174
 afraid+0.172
Neg logit contributions
 collection-0.117
 cand-0.115
 traded-0.106
 dress-0.106
 mar-0.105
Token be
Token ID307
Feature activation+3.55e-02

Pos logit contributions
 reckless+0.293
 gut+0.281
Little+0.271
 911+0.270
 frightening+0.265
Neg logit contributions
 collection-0.230
 cand-0.228
 traded-0.220
 coloring-0.217
 sandwich-0.211
Token my
Token ID616
Feature activation+4.17e-02

Pos logit contributions
 collection+0.066
 wrists+0.065
 cand+0.064
 coloring+0.063
 traded+0.063
Neg logit contributions
 reckless-0.052
Little-0.048
 gut-0.047
 frightening-0.046
 beast-0.046
Token friend
Token ID1545
Feature activation-3.30e-02

Pos logit contributions
 collection+0.055
 cand+0.050
 wrists+0.049
 mar+0.047
 coloring+0.046
Neg logit contributions
 reckless-0.091
Little-0.084
 frightening-0.083
 cal-0.083
Down-0.081
Token?"
Token ID1701
Feature activation-4.70e-02

Pos logit contributions
 reckless+0.044
 frightening+0.039
 gut+0.038
 cal+0.037
 911+0.036
Neg logit contributions
 collection-0.069
 cand-0.067
 wrists-0.065
 traded-0.065
 coloring-0.064
Token
Token ID198
Feature activation-3.92e-02

Pos logit contributions
 reckless+0.086
 frightening+0.079
 afraid+0.076
 cal+0.075
 gut+0.075
Neg logit contributions
 collection-0.075
 wrists-0.074
 cand-0.073
 traded-0.069
 mar-0.068
Token a
Token ID257
Feature activation-2.92e-03

Pos logit contributions
 collection+0.042
 wrists+0.041
 cand+0.040
 traded+0.038
 notebooks+0.038
Neg logit contributions
 reckless-0.047
 frightening-0.043
 afraid-0.041
 cal-0.041
 gut-0.040
Token safe
Token ID3338
Feature activation+1.69e-02

Pos logit contributions
 reckless+0.005
 frightening+0.005
 cal+0.004
 afraid+0.004
 gut+0.004
Neg logit contributions
 wrists-0.005
 collection-0.005
 cand-0.005
 notebooks-0.005
 traded-0.005
Token place
Token ID1295
Feature activation+1.86e-02

Pos logit contributions
 wrists+0.024
 cand+0.023
 collection+0.023
 notebooks+0.022
 traded+0.022
Neg logit contributions
 reckless-0.033
 frightening-0.032
 gut-0.031
 cal-0.031
 afraid-0.031
Token so
Token ID523
Feature activation+2.42e-02

Pos logit contributions
 wrists+0.032
 cand+0.032
 collection+0.031
 notebooks+0.030
 traded+0.030
Neg logit contributions
 reckless-0.031
 frightening-0.028
 afraid-0.028
 cal-0.027
 gut-0.027
Token it
Token ID340
Feature activation-3.14e-02

Pos logit contributions
 collection+0.034
 wrists+0.031
 cand+0.031
 mar+0.028
 coloring+0.028
Neg logit contributions
 reckless-0.049
 beast-0.046
 cal-0.045
 frightening-0.045
 911-0.044
Token wouldn
Token ID3636
Feature activation-1.58e-01

Pos logit contributions
 reckless+0.059
 frightening+0.054
Little+0.053
 beast+0.052
 911+0.052
Neg logit contributions
 collection-0.050
 cand-0.046
 wrists-0.046
 traded-0.044
 mar-0.043
Token't
Token ID470
Feature activation-1.09e-01

Pos logit contributions
 reckless+0.328
 frightening+0.303
 unnecessary+0.294
 urgent+0.290
 gut+0.288
Neg logit contributions
 collection-0.200
 traded-0.183
 mar-0.183
 dress-0.181
 cand-0.177
Token get
Token ID651
Feature activation-3.97e-02

Pos logit contributions
 reckless+0.208
 frightening+0.188
Little+0.186
 beast+0.184
 gut+0.183
Neg logit contributions
 collection-0.169
 cand-0.162
 wrists-0.158
 traded-0.153
 mar-0.151
Token lost
Token ID2626
Feature activation-4.67e-02

Pos logit contributions
 reckless+0.062
 frightening+0.056
Little+0.054
 cal+0.053
 gut+0.053
Neg logit contributions
 collection-0.074
 cand-0.072
 wrists-0.072
 traded-0.069
 mar-0.068
Token again
Token ID757
Feature activation-9.07e-02

Pos logit contributions
 reckless+0.081
 frightening+0.073
 cal+0.070
Little+0.070
 afraid+0.070
Neg logit contributions
 collection-0.080
 wrists-0.078
 cand-0.078
 mar-0.073
 traded-0.072
Token.
Token ID13
Feature activation-1.99e-03

Pos logit contributions
 reckless+0.177
 frightening+0.163
 afraid+0.157
 cal+0.157
Little+0.154
Neg logit contributions
 collection-0.136
 wrists-0.133
 cand-0.131
 traded-0.123
 notebooks-0.122
Token from
Token ID422
Feature activation-1.88e-02

Pos logit contributions
 reckless+0.040
 frightening+0.035
 cal+0.034
 afraid+0.034
 911+0.033
Neg logit contributions
 wrists-0.060
 collection-0.058
 cand-0.058
 notebooks-0.057
 traded-0.055
Token the
Token ID262
Feature activation-7.59e-03

Pos logit contributions
 reckless+0.034
 frightening+0.030
 cal+0.030
 afraid+0.029
Little+0.029
Neg logit contributions
 collection-0.032
 wrists-0.031
 cand-0.030
 mar-0.029
 traded-0.029
Token river
Token ID7850
Feature activation-3.62e-02

Pos logit contributions
 reckless+0.015
 frightening+0.014
 cal+0.013
 afraid+0.013
Little+0.013
Neg logit contributions
 collection-0.011
 wrists-0.011
 cand-0.011
 mar-0.010
 traded-0.010
Token so
Token ID523
Feature activation+2.27e-02

Pos logit contributions
 reckless+0.064
 frightening+0.057
 afraid+0.057
 cal+0.055
 gut+0.054
Neg logit contributions
 wrists-0.061
 cand-0.059
 collection-0.059
 notebooks-0.058
 traded-0.056
Token it
Token ID340
Feature activation-3.19e-02

Pos logit contributions
 collection+0.033
 cand+0.031
 wrists+0.031
 traded+0.030
 mar+0.030
Neg logit contributions
 reckless-0.045
 cal-0.042
 frightening-0.042
 911-0.040
Little-0.040
Token wouldn
Token ID3636
Feature activation-1.58e-01

Pos logit contributions
 reckless+0.066
 frightening+0.060
Little+0.060
 911+0.058
 gut+0.058
Neg logit contributions
 collection-0.046
 wrists-0.042
 cand-0.042
 traded-0.042
 mar-0.041
Token't
Token ID470
Feature activation-1.09e-01

Pos logit contributions
 reckless+0.346
 frightening+0.324
 unnecessary+0.310
 eerie+0.306
 urgent+0.303
Neg logit contributions
 collection-0.188
 mar-0.180
 traded-0.177
 dress-0.169
 cand-0.164
Token get
Token ID651
Feature activation-3.94e-02

Pos logit contributions
 reckless+0.216
 frightening+0.195
Little+0.193
 gut+0.191
 cal+0.190
Neg logit contributions
 collection-0.161
 cand-0.153
 wrists-0.148
 traded-0.148
 mar-0.145
Token stuck
Token ID7819
Feature activation-3.22e-02

Pos logit contributions
 reckless+0.064
 frightening+0.057
Little+0.056
 gut+0.056
 beast+0.055
Neg logit contributions
 collection-0.071
 cand-0.070
 wrists-0.069
 traded-0.066
 mar-0.066
Token on
Token ID319
Feature activation+3.09e-02

Pos logit contributions
 reckless+0.069
 frightening+0.064
Little+0.063
 cal+0.063
 gut+0.062
Neg logit contributions
 collection-0.042
 wrists-0.039
 cand-0.039
 mar-0.037
 traded-0.037
Token the
Token ID262
Feature activation-1.82e-02

Pos logit contributions
 collection+0.054
 wrists+0.053
 cand+0.052
 mar+0.050
 traded+0.049
Neg logit contributions
 reckless-0.053
 frightening-0.047
 afraid-0.046
 cal-0.046
Little-0.045
Token not
Token ID407
Feature activation-7.20e-02

Pos logit contributions
 reckless+0.035
 cal+0.033
 frightening+0.033
Little+0.032
 beast+0.032
Neg logit contributions
 collection-0.029
 wrists-0.027
 cand-0.027
 coloring-0.026
 traded-0.025
Token to
Token ID284
Feature activation-2.14e-02

Pos logit contributions
 reckless+0.130
 frightening+0.119
Little+0.116
 cal+0.115
 gut+0.113
Neg logit contributions
 collection-0.118
 cand-0.114
 wrists-0.114
 traded-0.107
 mar-0.107
Token be
Token ID307
Feature activation+3.26e-02

Pos logit contributions
 reckless+0.039
 frightening+0.035
 afraid+0.034
 cal+0.034
 gut+0.033
Neg logit contributions
 collection-0.035
 wrists-0.035
 cand-0.035
 notebooks-0.033
 traded-0.032
Token seen
Token ID1775
Feature activation-6.07e-02

Pos logit contributions
 collection+0.062
 cand+0.060
 wrists+0.060
 traded+0.058
 mar+0.058
Neg logit contributions
 reckless-0.049
Little-0.044
 frightening-0.044
 cal-0.043
 gut-0.043
Token by
Token ID416
Feature activation-3.50e-02

Pos logit contributions
 reckless+0.128
Little+0.115
 frightening+0.115
Run+0.114
 outage+0.113
Neg logit contributions
 cand-0.081
 collection-0.080
 mar-0.076
 coloring-0.073
 wrists-0.070
Token anyone
Token ID2687
Feature activation-1.58e-01

Pos logit contributions
 reckless+0.064
 frightening+0.057
 cal+0.057
Little+0.056
 afraid+0.055
Neg logit contributions
 collection-0.057
 wrists-0.056
 cand-0.056
 mar-0.053
 notebooks-0.051
Token.
Token ID13
Feature activation-1.03e-02

Pos logit contributions
 reckless+0.293
 frightening+0.269
 gut+0.260
 cal+0.260
Little+0.259
Neg logit contributions
 collection-0.249
 cand-0.239
 wrists-0.237
 traded-0.224
 mar-0.223
Token
Token ID220
Feature activation-2.34e-04

Pos logit contributions
 reckless+0.016
 frightening+0.015
Little+0.015
 afraid+0.015
 cal+0.015
Neg logit contributions
 wrists-0.018
 cand-0.018
 collection-0.018
 notebooks-0.017
 mar-0.017
Token
Token ID198
Feature activation-3.37e-02

Pos logit contributions
 reckless+0.001
 frightening+0.000
 cal+0.000
 evil+0.000
 beast+0.000
Neg logit contributions
 collection-0.000
 traded-0.000
 wrists-0.000
 cand-0.000
 mar-0.000
Token
Token ID198
Feature activation-2.86e-02

Pos logit contributions
 reckless+0.076
 frightening+0.068
 cal+0.066
 evil+0.066
 unnecessary+0.066
Neg logit contributions
 collection-0.043
 wrists-0.040
 cand-0.040
 traded-0.038
 mar-0.038
TokenOne
Token ID3198
Feature activation-5.24e-03

Pos logit contributions
 reckless+0.056
 frightening+0.050
 cal+0.050
 unnecessary+0.049
 gut+0.049
Neg logit contributions
 collection-0.043
 traded-0.041
 cand-0.040
 wrists-0.040
 mar-0.039
Token felt
Token ID2936
Feature activation-3.55e-02

Pos logit contributions
 reckless+0.004
 cal+0.004
 afraid+0.003
 louder+0.003
 frightening+0.003
Neg logit contributions
 collection-0.004
 cand-0.004
uggets-0.004
 extract-0.004
 wrists-0.004
Token happy
Token ID3772
Feature activation+2.07e-02

Pos logit contributions
 reckless+0.060
 frightening+0.054
 afraid+0.052
Little+0.052
 cal+0.052
Neg logit contributions
 collection-0.062
 wrists-0.061
 cand-0.061
 traded-0.057
 notebooks-0.057
Token.
Token ID13
Feature activation+2.16e-02

Pos logit contributions
 wrists+0.038
 collection+0.037
 traded+0.035
 cand+0.035
 clubs+0.034
Neg logit contributions
 reckless-0.033
 afraid-0.030
 cal-0.030
 frightening-0.030
 louder-0.029
Token They
Token ID1119
Feature activation+5.67e-02

Pos logit contributions
 collection+0.026
 cand+0.024
 wrists+0.023
 traded+0.022
 notebooks+0.021
Neg logit contributions
 reckless-0.049
 afraid-0.046
 frightening-0.046
 cal-0.045
Down-0.045
Token liked
Token ID8288
Feature activation+1.03e-03

Pos logit contributions
 collection+0.087
 cand+0.084
 wrists+0.084
 notebooks+0.078
 coloring+0.076
Neg logit contributions
 reckless-0.107
 afraid-0.096
 frightening-0.095
 cal-0.095
Little-0.092
Token their
Token ID511
Feature activation+8.62e-02

Pos logit contributions
 collection+0.002
 wrists+0.002
 traded+0.002
 cand+0.002
 wrist+0.002
Neg logit contributions
 reckless-0.002
 frightening-0.002
 cal-0.002
 lightning-0.002
 afraid-0.002
Token room
Token ID2119
Feature activation+7.67e-02

Pos logit contributions
 wrists+0.104
 traded+0.104
 cand+0.093
 notebooks+0.092
 collection+0.092
Neg logit contributions
 reckless-0.169
 frightening-0.167
 lightning-0.165
 gut-0.164
 cal-0.163
Token and
Token ID290
Feature activation-6.69e-03

Pos logit contributions
 wrists+0.139
 collection+0.131
 cand+0.130
 traded+0.127
 notebooks+0.126
Neg logit contributions
 reckless-0.119
 cal-0.115
 afraid-0.107
 frightening-0.107
 louder-0.104
Token their
Token ID511
Feature activation+8.87e-02

Pos logit contributions
 reckless+0.013
 afraid+0.012
 frightening+0.012
 cal+0.011
 lightning+0.011
Neg logit contributions
 collection-0.009
 wrists-0.009
 cand-0.009
 extract-0.009
 traded-0.008
Token decorations
Token ID39011
Feature activation+4.35e-02

Pos logit contributions
 traded+0.104
 wrists+0.104
 cand+0.099
 collection+0.099
 notebooks+0.092
Neg logit contributions
 reckless-0.179
 frightening-0.172
 gut-0.170
 cal-0.169
 lightning-0.166
Token.
Token ID13
Feature activation+2.84e-02

Pos logit contributions
 wrists+0.085
 notebooks+0.082
 cand+0.080
 traded+0.080
 collection+0.079
Neg logit contributions
 cal-0.055
 reckless-0.055
 afraid-0.049
 worse-0.048
 clearly-0.048
Token in
Token ID287
Feature activation+2.53e-02

Pos logit contributions
 collection+0.049
 wrists+0.048
 cand+0.047
 traded+0.044
 notebooks+0.044
Neg logit contributions
 reckless-0.053
 frightening-0.049
 cal-0.047
 afraid-0.047
Little-0.046
Token the
Token ID262
Feature activation-9.49e-04

Pos logit contributions
 collection+0.042
 wrists+0.041
 cand+0.041
 mar+0.038
 notebooks+0.038
Neg logit contributions
 reckless-0.046
 frightening-0.040
 gut-0.039
 cal-0.039
 afraid-0.039
Token attic
Token ID46113
Feature activation-7.50e-03

Pos logit contributions
 reckless+0.002
 frightening+0.002
 cal+0.002
 afraid+0.002
 beast+0.002
Neg logit contributions
 collection-0.001
 wrists-0.001
 cand-0.001
 traded-0.001
 mar-0.001
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
 reckless+0.014
 frightening+0.013
 cal+0.013
Little+0.013
 afraid+0.013
Neg logit contributions
 collection-0.011
 wrists-0.011
 cand-0.011
 traded-0.010
 mar-0.010
Token We
Token ID775
Feature activation+5.16e-02

Pos logit contributions
 wrists+0.014
 cand+0.013
 collection+0.013
 traded+0.013
 coloring+0.012
Neg logit contributions
 reckless-0.022
 frightening-0.021
 gut-0.020
 beast-0.020
 cal-0.020
Token have
Token ID423
Feature activation+8.58e-02

Pos logit contributions
 collection+0.083
 wrists+0.080
 cand+0.079
 traded+0.076
 mar+0.074
Neg logit contributions
 reckless-0.095
 frightening-0.087
 cal-0.084
Little-0.083
 gut-0.083
Token to
Token ID284
Feature activation+1.31e-02

Pos logit contributions
 collection+0.129
 wrists+0.128
 cand+0.125
 notebooks+0.117
 traded+0.117
Neg logit contributions
 reckless-0.165
 frightening-0.153
Little-0.149
 cal-0.147
 beast-0.146
Token remove
Token ID4781
Feature activation-1.41e-03

Pos logit contributions
 collection+0.020
 wrists+0.019
 cand+0.019
 notebooks+0.018
 traded+0.018
Neg logit contributions
 reckless-0.025
 frightening-0.023
 cal-0.022
 afraid-0.022
Little-0.022
Token some
Token ID617
Feature activation+2.61e-02

Pos logit contributions
 reckless+0.003
 frightening+0.002
 afraid+0.002
 cal+0.002
 gut+0.002
Neg logit contributions
 wrists-0.002
 collection-0.002
 cand-0.002
 mar-0.002
 notebooks-0.002
Token things
Token ID1243
Feature activation-3.01e-02

Pos logit contributions
 cand+0.035
 collection+0.035
 wrists+0.033
 mar+0.032
 notebooks+0.031
Neg logit contributions
 reckless-0.056
 frightening-0.051
 afraid-0.050
 cal-0.049
 gut-0.049
Token to
Token ID284
Feature activation-8.55e-03

Pos logit contributions
 reckless+0.052
 frightening+0.048
 cal+0.046
 afraid+0.045
Little+0.045
Neg logit contributions
 collection-0.051
 wrists-0.050
 cand-0.050
 traded-0.046
 mar-0.046
Token was
Token ID373
Feature activation+2.40e-03

Pos logit contributions
 wrists+0.038
 notebooks+0.035
uggets+0.034
 wrist+0.034
 clubs+0.033
Neg logit contributions
 reckless-0.030
 afraid-0.029
 frightening-0.028
Little-0.028
 startled-0.027
Token big
Token ID1263
Feature activation-1.45e-02

Pos logit contributions
 collection+0.004
 wrists+0.004
 cand+0.004
 notebooks+0.003
 traded+0.003
Neg logit contributions
 reckless-0.005
 frightening-0.004
 afraid-0.004
 cal-0.004
Little-0.004
Token and
Token ID290
Feature activation-1.21e-02

Pos logit contributions
 reckless+0.022
 frightening+0.020
 cal+0.019
 afraid+0.019
 beast+0.018
Neg logit contributions
 wrists-0.028
 collection-0.027
 notebooks-0.026
 cand-0.026
 traded-0.026
Token fancy
Token ID14996
Feature activation+3.83e-02

Pos logit contributions
 reckless+0.023
 frightening+0.021
 afraid+0.021
 cal+0.020
Little+0.020
Neg logit contributions
 collection-0.019
 wrists-0.018
 cand-0.018
 notebooks-0.017
 traded-0.017
Token,
Token ID11
Feature activation+5.81e-03

Pos logit contributions
 notebooks+0.079
 traded+0.078
 wrists+0.077
 clubs+0.070
 collection+0.070
Neg logit contributions
 reckless-0.043
 lightning-0.040
 frightening-0.040
 outage-0.040
 though-0.040
Token with
Token ID351
Feature activation+9.01e-02

Pos logit contributions
 collection+0.013
 wrists+0.012
 notebooks+0.012
 cand+0.012
 traded+0.012
Neg logit contributions
 reckless-0.007
 frightening-0.007
 afraid-0.007
 louder-0.006
 cal-0.006
Token a
Token ID257
Feature activation+8.62e-03

Pos logit contributions
 collection+0.139
 wrists+0.131
 wrist+0.127
 traded+0.125
 cand+0.120
Neg logit contributions
 reckless-0.165
 frightening-0.154
 afraid-0.150
 cal-0.150
 lightning-0.150
Token little
Token ID1310
Feature activation-1.80e-02

Pos logit contributions
 collection+0.014
 wrists+0.014
 cand+0.014
 traded+0.014
 notebooks+0.013
Neg logit contributions
 reckless-0.014
 frightening-0.014
 afraid-0.013
 cal-0.013
Little-0.013
Token kitchen
Token ID9592
Feature activation+8.26e-02

Pos logit contributions
 reckless+0.034
 frightening+0.032
 afraid+0.032
 cal+0.032
 lightning+0.031
Neg logit contributions
 wrists-0.026
 collection-0.026
 traded-0.025
 notebooks-0.025
 cand-0.024
Token inside
Token ID2641
Feature activation+2.11e-02

Pos logit contributions
 traded+0.151
 wrists+0.148
 notebooks+0.143
 clubs+0.140
 mar+0.138
Neg logit contributions
 reckless-0.100
 loud-0.095
 frightening-0.093
 afraid-0.091
 cal-0.089
Token.
Token ID13
Feature activation+6.01e-04

Pos logit contributions
 wrists+0.039
 collection+0.039
 cand+0.038
 traded+0.037
 notebooks+0.037
Neg logit contributions
 reckless-0.032
 frightening-0.029
 afraid-0.029
Little-0.028
 cal-0.028
Token.
Token ID13
Feature activation+2.62e-03

Pos logit contributions
 collection+0.143
 wrists+0.141
 cand+0.139
 traded+0.132
 notebooks+0.132
Neg logit contributions
 reckless-0.129
 frightening-0.115
 afraid-0.111
 cal-0.111
 gut-0.108
Token What
Token ID1867
Feature activation+1.38e-02

Pos logit contributions
 collection+0.004
 wrists+0.004
 cand+0.004
 traded+0.003
 coloring+0.003
Neg logit contributions
 reckless-0.005
 frightening-0.005
 cal-0.005
 afraid-0.004
 unnecessary-0.004
Token should
Token ID815
Feature activation+1.10e-02

Pos logit contributions
 collection+0.022
 cand+0.022
 wrists+0.021
 notebooks+0.020
 traded+0.020
Neg logit contributions
 reckless-0.025
 frightening-0.023
 cal-0.023
 gut-0.022
 afraid-0.022
Token I
Token ID314
Feature activation-6.55e-03

Pos logit contributions
 wrists+0.023
 collection+0.023
 cand+0.023
 traded+0.022
 notebooks+0.022
Neg logit contributions
 reckless-0.014
 frightening-0.012
 evil-0.012
 afraid-0.012
 beast-0.012
Token do
Token ID466
Feature activation+4.76e-02

Pos logit contributions
 reckless+0.012
 frightening+0.011
 cal+0.011
 afraid+0.011
Little+0.011
Neg logit contributions
 collection-0.010
 wrists-0.010
 cand-0.010
 traded-0.009
 mar-0.009
Token with
Token ID351
Feature activation+8.36e-02

Pos logit contributions
 collection+0.084
 wrists+0.082
 cand+0.081
 traded+0.077
 notebooks+0.076
Neg logit contributions
 reckless-0.080
 frightening-0.073
 cal-0.070
 afraid-0.070
Little-0.069
Token it
Token ID340
Feature activation+1.84e-03

Pos logit contributions
 collection+0.128
 wrists+0.126
 coloring+0.119
 cand+0.118
 mar+0.118
Neg logit contributions
 reckless-0.168
 frightening-0.153
 cal-0.151
 afraid-0.147
 eerie-0.146
Token?"
Token ID1701
Feature activation-4.77e-02

Pos logit contributions
 collection+0.004
 cand+0.004
 wrists+0.004
 coloring+0.004
 traded+0.003
Neg logit contributions
 reckless-0.003
 frightening-0.002
 cal-0.002
 gut-0.002
 beast-0.002
Token
Token ID198
Feature activation-4.12e-02

Pos logit contributions
 reckless+0.088
 frightening+0.081
 afraid+0.078
 cal+0.078
Little+0.077
Neg logit contributions
 collection-0.077
 wrists-0.075
 cand-0.074
 traded-0.069
 notebooks-0.069
Token
Token ID198
Feature activation-3.33e-02

Pos logit contributions
 reckless+0.105
 frightening+0.094
 evil+0.092
 unnecessary+0.092
 beast+0.091
Neg logit contributions
 wrists-0.038
 collection-0.038
 coloring-0.037
 dress-0.037
 wrist-0.037
TokenShe
Token ID3347
Feature activation-1.98e-02

Pos logit contributions
 reckless+0.062
 frightening+0.057
 unnecessary+0.057
 beast+0.056
 cal+0.055
Neg logit contributions
 wrists-0.051
 collection-0.051
 cand-0.049
 wrist-0.048
 traded-0.047
Token and
Token ID290
Feature activation+4.56e-03

Pos logit contributions
 wrists+0.050
 collection+0.050
 cand+0.050
 traded+0.048
 notebooks+0.047
Neg logit contributions
 reckless-0.061
 frightening-0.056
 afraid-0.055
 gut-0.054
 cal-0.054
Token had
Token ID550
Feature activation+6.52e-02

Pos logit contributions
 collection+0.008
 cand+0.008
 wrists+0.007
 notebooks+0.007
 traded+0.007
Neg logit contributions
 reckless-0.008
 frightening-0.007
 afraid-0.007
 cal-0.007
 gut-0.007
Token lots
Token ID6041
Feature activation-2.16e-02

Pos logit contributions
 collection+0.117
 wrists+0.114
 cand+0.113
 notebooks+0.107
 traded+0.106
Neg logit contributions
 reckless-0.109
 frightening-0.098
 cal-0.095
 afraid-0.095
 gut-0.092
Token of
Token ID286
Feature activation+2.74e-02

Pos logit contributions
 reckless+0.027
 frightening+0.023
 afraid+0.022
Little+0.022
 911+0.022
Neg logit contributions
 collection-0.048
 cand-0.047
 wrists-0.046
 mar-0.044
 notebooks-0.044
Token fun
Token ID1257
Feature activation+4.90e-02

Pos logit contributions
 collection+0.043
 wrists+0.043
 cand+0.041
 traded+0.040
 notebooks+0.039
Neg logit contributions
 reckless-0.050
 frightening-0.048
 afraid-0.046
 cal-0.045
 gut-0.044
Token with
Token ID351
Feature activation+8.47e-02

Pos logit contributions
 wrists+0.085
 traded+0.083
 cand+0.080
 collection+0.079
 extract+0.076
Neg logit contributions
 reckless-0.079
 frightening-0.078
 afraid-0.076
 lightning-0.071
 storms-0.070
Token it
Token ID340
Feature activation+2.35e-03

Pos logit contributions
 collection+0.152
 wrists+0.149
 cand+0.147
 notebooks+0.138
 mar+0.138
Neg logit contributions
 reckless-0.141
 frightening-0.127
 cal-0.122
 afraid-0.122
Little-0.120
Token in
Token ID287
Feature activation+3.64e-02

Pos logit contributions
 wrists+0.004
 cand+0.004
 collection+0.004
 notebooks+0.004
 traded+0.004
Neg logit contributions
 reckless-0.004
 frightening-0.004
 afraid-0.004
 cal-0.004
 evil-0.004
Token the
Token ID262
Feature activation-3.27e-03

Pos logit contributions
 collection+0.068
 wrists+0.068
 cand+0.067
 traded+0.064
 notebooks+0.063
Neg logit contributions
 reckless-0.057
 frightening-0.052
 afraid-0.050
 cal-0.050
Little-0.049
Token backyard
Token ID24296
Feature activation+3.41e-02

Pos logit contributions
 reckless+0.006
 frightening+0.006
 cal+0.005
 beast+0.005
 afraid+0.005
Neg logit contributions
 wrists-0.006
 collection-0.005
 cand-0.005
 traded-0.005
 notebooks-0.005
Token.
Token ID13
Feature activation+9.51e-03

Pos logit contributions
 wrists+0.058
 cand+0.057
 collection+0.056
 notebooks+0.054
 traded+0.054
Neg logit contributions
 reckless-0.058
 afraid-0.052
 frightening-0.052
 cal-0.051
 gut-0.049
Token Lily
Token ID20037
Feature activation+2.44e-02

Pos logit contributions
 collection+0.020
 wrists+0.020
 cand+0.019
 traded+0.018
 clubs+0.018
Neg logit contributions
 reckless-0.014
 afraid-0.013
Little-0.013
 frightening-0.013
 911-0.013
Token drew
Token ID9859
Feature activation+3.47e-02

Pos logit contributions
 wrists+0.034
 collection+0.033
 cand+0.032
 notebooks+0.029
 mar+0.028
Neg logit contributions
 reckless-0.049
 afraid-0.045
 frightening-0.045
Little-0.044
 cal-0.043
Token a
Token ID257
Feature activation-6.94e-03

Pos logit contributions
 collection+0.070
 wrists+0.069
 cand+0.066
 traded+0.064
 notebooks+0.062
Neg logit contributions
 reckless-0.048
 frightening-0.044
 afraid-0.043
 evil-0.042
Little-0.042
Token picture
Token ID4286
Feature activation+1.28e-02

Pos logit contributions
 reckless+0.012
 frightening+0.012
 afraid+0.011
 evil+0.011
 beast+0.011
Neg logit contributions
 wrists-0.011
 cand-0.011
 collection-0.011
 traded-0.010
 notebooks-0.010
Token of
Token ID286
Feature activation+3.30e-02

Pos logit contributions
 wrists+0.036
 collection+0.034
 cand+0.033
 notebooks+0.032
 traded+0.032
Neg logit contributions
 reckless-0.008
 frightening-0.007
 afraid-0.007
 lightning-0.007
 louder-0.007
Token a
Token ID257
Feature activation-3.97e-03

Pos logit contributions
 collection+0.068
 wrists+0.064
 traded+0.064
 cand+0.062
 notebooks+0.060
Neg logit contributions
 reckless-0.042
 lightning-0.040
 frightening-0.040
 afraid-0.039
 evil-0.038
Token cat
Token ID3797
Feature activation-3.12e-02

Pos logit contributions
 reckless+0.007
 frightening+0.006
 afraid+0.006
 beast+0.006
 evil+0.006
Neg logit contributions
 wrists-0.007
 collection-0.006
 cand-0.006
 traded-0.006
 notebooks-0.006
Token.
Token ID13
Feature activation+6.84e-03

Pos logit contributions
 reckless+0.044
 afraid+0.042
 beast+0.038
 frightening+0.038
 though+0.038
Neg logit contributions
 wrists-0.062
 collection-0.058
 notebooks-0.056
 cand-0.056
 extract-0.054
Token She
Token ID1375
Feature activation-1.96e-02

Pos logit contributions
 collection+0.012
 wrists+0.012
 cand+0.012
 notebooks+0.011
 traded+0.011
Neg logit contributions
 reckless-0.011
 frightening-0.010
 afraid-0.010
Little-0.010
 cal-0.010
Token showed
Token ID3751
Feature activation+4.28e-02

Pos logit contributions
 reckless+0.035
 afraid+0.034
 frightening+0.033
 evil+0.031
Little+0.031
Neg logit contributions
 collection-0.028
 wrists-0.028
 cand-0.027
 clubs-0.026
 notebooks-0.025
Token it
Token ID340
Feature activation-6.49e-03

Pos logit contributions
 cand+0.055
 wrists+0.054
 collection+0.052
 traded+0.049
 mar+0.048
Neg logit contributions
 reckless-0.097
 frightening-0.087
 outage-0.085
 gut-0.085
Far-0.084
Token are
Token ID389
Feature activation+6.65e-02

Pos logit contributions
 collection+0.062
 wrists+0.059
 cand+0.059
 mar+0.057
 traded+0.057
Neg logit contributions
 reckless-0.120
 frightening-0.115
 unnecessary-0.109
Little-0.109
 beast-0.109
Token happy
Token ID3772
Feature activation+3.40e-02

Pos logit contributions
 collection+0.131
 wrists+0.129
 cand+0.128
 traded+0.123
 mar+0.122
Neg logit contributions
 reckless-0.096
 frightening-0.086
Little-0.083
 beast-0.081
 cal-0.080
Token to
Token ID284
Feature activation-7.00e-03

Pos logit contributions
 collection+0.050
 cand+0.049
 wrists+0.049
 mar+0.045
 notebooks+0.045
Neg logit contributions
 reckless-0.068
 frightening-0.062
Little-0.060
 afraid-0.060
 cal-0.060
Token see
Token ID766
Feature activation-2.60e-02

Pos logit contributions
 reckless+0.009
 frightening+0.008
 cal+0.007
Little+0.007
 afraid+0.007
Neg logit contributions
 collection-0.016
 wrists-0.015
 cand-0.015
 notebooks-0.015
 mar-0.014
Token him
Token ID683
Feature activation-3.49e-02

Pos logit contributions
 reckless+0.054
 frightening+0.050
 afraid+0.048
 gut+0.048
 beast+0.047
Neg logit contributions
 cand-0.036
 collection-0.036
 wrists-0.035
 traded-0.033
 mar-0.033
Token.
Token ID13
Feature activation+2.22e-02

Pos logit contributions
 reckless+0.067
 frightening+0.061
 cal+0.059
 afraid+0.059
Little+0.058
Neg logit contributions
 collection-0.054
 wrists-0.053
 cand-0.052
 traded-0.049
 notebooks-0.049
Token They
Token ID1119
Feature activation+5.32e-02

Pos logit contributions
 wrists+0.018
 collection+0.017
 cand+0.017
 traded+0.016
 notebooks+0.015
Neg logit contributions
 reckless-0.057
 frightening-0.054
 cal-0.053
 beast-0.052
 evil-0.052
Token want
Token ID765
Feature activation-1.22e-02

Pos logit contributions
 collection+0.072
 cand+0.069
 wrists+0.068
 traded+0.066
 mar+0.065
Neg logit contributions
 reckless-0.111
 frightening-0.104
Little-0.099
 beast-0.099
 gut-0.098
Token to
Token ID284
Feature activation-8.85e-03

Pos logit contributions
 reckless+0.023
 frightening+0.021
 afraid+0.020
 cal+0.020
Little+0.020
Neg logit contributions
 collection-0.019
 cand-0.019
 wrists-0.019
 notebooks-0.018
 traded-0.018
Token show
Token ID905
Feature activation+3.38e-02

Pos logit contributions
 reckless+0.018
 frightening+0.016
 cal+0.016
 afraid+0.016
Little+0.016
Neg logit contributions
 collection-0.013
 wrists-0.012
 cand-0.012
 notebooks-0.011
 traded-0.011
Token him
Token ID683
Feature activation-1.62e-02

Pos logit contributions
 cand+0.047
 wrists+0.046
 collection+0.046
 notebooks+0.043
 traded+0.043
Neg logit contributions
 reckless-0.070
 frightening-0.065
 afraid-0.063
 cal-0.061
 beast-0.061
Token?"
Token ID1701
Feature activation-4.05e-02

Pos logit contributions
 collection+0.056
 cand+0.053
 wrists+0.052
 traded+0.052
 coloring+0.051
Neg logit contributions
 reckless-0.034
 frightening-0.031
 gut-0.030
 cal-0.030
 unnecessary-0.029
Token
Token ID220
Feature activation-8.69e-03

Pos logit contributions
 reckless+0.064
 afraid+0.063
Little+0.063
 911+0.062
 frightening+0.061
Neg logit contributions
 collection-0.072
 wrists-0.066
 cand-0.065
 notebooks-0.065
 clubs-0.064
Token
Token ID198
Feature activation-3.54e-02

Pos logit contributions
 reckless+0.020
 frightening+0.019
 beast+0.019
 evil+0.018
 eerie+0.018
Neg logit contributions
 collection-0.010
 wrists-0.009
 cand-0.009
 coloring-0.009
 traded-0.009
Token
Token ID198
Feature activation-2.73e-02

Pos logit contributions
 reckless+0.070
 frightening+0.063
 beast+0.062
 afraid+0.061
 cal+0.061
Neg logit contributions
 collection-0.052
 wrists-0.052
 cand-0.051
 traded-0.049
 mar-0.048
Token"
Token ID1
Feature activation+3.55e-02

Pos logit contributions
 reckless+0.048
 frightening+0.044
 beast+0.042
 afraid+0.042
 cal+0.042
Neg logit contributions
 collection-0.046
 wrists-0.045
 cand-0.045
 traded-0.043
 mar-0.042
TokenI
Token ID40
Feature activation+2.92e-03

Pos logit contributions
 wrists+0.054
 cand+0.053
 collection+0.051
 mar+0.051
 traded+0.051
Neg logit contributions
 reckless-0.064
 911-0.060
 beast-0.060
 cal-0.060
 gut-0.059
Token can
Token ID460
Feature activation-4.85e-03

Pos logit contributions
 collection+0.005
 traded+0.004
 mar+0.004
 coloring+0.004
 drew+0.004
Neg logit contributions
 reckless-0.005
 911-0.005
 beast-0.005
 eerie-0.005
 unnecessary-0.005
Token't
Token ID470
Feature activation-6.34e-02

Pos logit contributions
 reckless+0.008
 frightening+0.007
 afraid+0.007
 cal+0.007
Little+0.007
Neg logit contributions
 collection-0.009
 wrists-0.009
 cand-0.009
 notebooks-0.008
 traded-0.008
Token find
Token ID1064
Feature activation-1.29e-02

Pos logit contributions
 reckless+0.127
 frightening+0.117
 cal+0.114
 afraid+0.112
 gut+0.112
Neg logit contributions
 collection-0.094
 cand-0.089
 wrists-0.089
 traded-0.083
 notebooks-0.082
Token my
Token ID616
Feature activation+5.65e-02

Pos logit contributions
 reckless+0.023
 frightening+0.020
 cal+0.020
 gut+0.019
 afraid+0.019
Neg logit contributions
 collection-0.022
 cand-0.022
 wrists-0.022
 mar-0.020
 notebooks-0.020
Token chalk
Token ID30860
Feature activation+5.10e-02

Pos logit contributions
 wrists+0.072
 cand+0.071
 traded+0.066
 collection+0.063
 extract+0.061
Neg logit contributions
 reckless-0.111
 frightening-0.109
 afraid-0.107
 lightning-0.105
 evil-0.104
Token and
Token ID290
Feature activation-1.79e-02

Pos logit contributions
 wrists+0.026
 collection+0.026
 cand+0.025
 notebooks+0.024
 traded+0.023
Neg logit contributions
 reckless-0.034
 frightening-0.031
 afraid-0.031
 cal-0.030
 911-0.030
Token run
Token ID1057
Feature activation-2.13e-02

Pos logit contributions
 reckless+0.036
 frightening+0.034
 afraid+0.034
 cal+0.033
 gut+0.032
Neg logit contributions
 collection-0.024
 cand-0.024
 wrists-0.024
 notebooks-0.022
 mar-0.021
Token in
Token ID287
Feature activation+1.40e-02

Pos logit contributions
 reckless+0.036
 frightening+0.033
 afraid+0.032
 cal+0.031
 gut+0.030
Neg logit contributions
 collection-0.037
 wrists-0.037
 cand-0.036
 notebooks-0.035
 traded-0.034
Token the
Token ID262
Feature activation-1.19e-02

Pos logit contributions
 collection+0.021
 wrists+0.020
 cand+0.019
 mar+0.019
 coloring+0.018
Neg logit contributions
 reckless-0.029
 frightening-0.025
 gut-0.024
 afraid-0.024
 beast-0.024
Token park
Token ID3952
Feature activation+1.89e-02

Pos logit contributions
 reckless+0.023
 frightening+0.022
 cal+0.021
 afraid+0.021
Little+0.021
Neg logit contributions
 collection-0.018
 wrists-0.018
 cand-0.017
 traded-0.016
 notebooks-0.016
Token.
Token ID13
Feature activation+1.37e-02

Pos logit contributions
 cand+0.033
 wrists+0.032
 collection+0.032
 notebooks+0.031
 traded+0.030
Neg logit contributions
 reckless-0.031
 afraid-0.028
 frightening-0.028
 cal-0.027
 louder-0.026
Token One
Token ID1881
Feature activation-2.05e-02

Pos logit contributions
 collection+0.021
 wrists+0.020
 cand+0.020
 notebooks+0.018
 traded+0.018
Neg logit contributions
 reckless-0.027
 frightening-0.024
 afraid-0.024
 cal-0.024
Little-0.023
Token day
Token ID1110
Feature activation+3.04e-02

Pos logit contributions
 reckless+0.056
 frightening+0.052
 cal+0.052
 gut+0.051
 911+0.051
Neg logit contributions
 collection-0.015
 cand-0.013
 wrists-0.012
 mar-0.011
 traded-0.011
Token,
Token ID11
Feature activation+1.83e-02

Pos logit contributions
 collection+0.040
 wrists+0.039
 cand+0.039
 traded+0.036
 mar+0.036
Neg logit contributions
 reckless-0.065
 frightening-0.060
 cal-0.058
 afraid-0.058
 gut-0.057
Token Max
Token ID5436
Feature activation+1.00e-02

Pos logit contributions
 collection+0.030
 wrists+0.030
 cand+0.029
 notebooks+0.027
 traded+0.027
Neg logit contributions
 reckless-0.033
 frightening-0.030
 afraid-0.029
Little-0.029
 cal-0.028
Token saw
Token ID2497
Feature activation-4.19e-02

Pos logit contributions
 collection+0.012
 traded+0.011
 cand+0.011
 wrists+0.011
 mar+0.010
Neg logit contributions
 reckless-0.023
 frightening-0.021
 gut-0.021
 911-0.021
Little-0.020
Token and
Token ID290
Feature activation-3.37e-02

Pos logit contributions
 collection+0.062
 cand+0.061
 notebooks+0.060
 wrists+0.060
 extract+0.056
Neg logit contributions
 reckless-0.051
 louder-0.047
 cal-0.047
 afraid-0.046
 frightening-0.046
Token said
Token ID531
Feature activation+3.76e-02

Pos logit contributions
 reckless+0.056
 cal+0.052
 frightening+0.051
 afraid+0.051
 louder+0.049
Neg logit contributions
 collection-0.058
 cand-0.056
 wrists-0.054
 notebooks-0.053
 extract-0.052
Token,
Token ID11
Feature activation+3.50e-03

Pos logit contributions
 collection+0.054
 wrists+0.053
 cand+0.052
 notebooks+0.048
 traded+0.048
Neg logit contributions
 reckless-0.076
 frightening-0.070
 cal-0.068
 afraid-0.068
 gut-0.067
Token �
Token ID6184
Feature activation-5.07e-03

Pos logit contributions
 collection+0.006
 wrists+0.006
 cand+0.006
 notebooks+0.005
 traded+0.005
Neg logit contributions
 reckless-0.006
 frightening-0.006
 afraid-0.006
 cal-0.006
Little-0.006
Token�
Token ID95
Feature activation+1.83e-02

Pos logit contributions
 reckless+0.009
 frightening+0.008
 cal+0.008
Little+0.008
 gut+0.008
Neg logit contributions
 cand-0.009
 collection-0.009
 wrists-0.008
 notebooks-0.008
 coloring-0.008
Token€
Token ID26391
Feature activation+3.98e-02

Pos logit contributions
 cand+0.034
 collection+0.033
 wrists+0.032
 notebooks+0.030
 mar+0.030
Neg logit contributions
 reckless-0.031
Little-0.028
 cal-0.028
 frightening-0.027
 beast-0.027
Token�
Token ID129
Feature activation+6.46e-04

Pos logit contributions
 collection+0.066
 wrists+0.063
 cand+0.061
 traded+0.059
 mar+0.058
Neg logit contributions
 reckless-0.072
 frightening-0.067
 cal-0.063
 afraid-0.063
 911-0.063
Token�
Token ID241
Feature activation+1.03e-02

Pos logit contributions
 collection+0.001
 cand+0.001
 wrists+0.001
 notebooks+0.001
 mar+0.001
Neg logit contributions
 reckless-0.001
 frightening-0.001
 cal-0.001
 afraid-0.001
Little-0.001
TokenApp
Token ID4677
Feature activation+1.45e-02

Pos logit contributions
 collection+0.018
 wrists+0.017
 cand+0.017
 traded+0.016
 notebooks+0.016
Neg logit contributions
 reckless-0.018
 frightening-0.016
 cal-0.015
 afraid-0.015
 gut-0.015
Tokenl
Token ID75
Feature activation-5.09e-02

Pos logit contributions
 collection+0.029
 cand+0.029
 wrists+0.028
 traded+0.027
 mar+0.027
Neg logit contributions
 reckless-0.021
 frightening-0.018
 afraid-0.017
 gut-0.017
 cal-0.017
Tokenaud
Token ID3885
Feature activation+4.10e-02

Pos logit contributions
 reckless+0.095
 frightening+0.087
 cal+0.084
 afraid+0.084
Little+0.082
Neg logit contributions
 collection-0.080
 wrists-0.078
 cand-0.077
 notebooks-0.072
 traded-0.072
Token do
Token ID466
Feature activation+2.35e-02

Pos logit contributions
 reckless+0.012
 frightening+0.011
 cal+0.011
 afraid+0.011
Little+0.011
Neg logit contributions
 collection-0.009
 wrists-0.009
 cand-0.009
 notebooks-0.008
 traded-0.008
Token.
Token ID13
Feature activation+6.69e-03

Pos logit contributions
 collection+0.036
 wrists+0.036
 cand+0.035
 traded+0.033
 notebooks+0.033
Neg logit contributions
 reckless-0.045
 frightening-0.041
 afraid-0.039
 cal-0.039
Little-0.039
Token Lucy
Token ID22162
Feature activation+2.91e-02

Pos logit contributions
 collection+0.010
 wrists+0.010
 cand+0.010
 traded+0.009
 notebooks+0.009
Neg logit contributions
 reckless-0.013
 frightening-0.012
 cal-0.011
 afraid-0.011
 gut-0.011
Token said
Token ID531
Feature activation+4.91e-02

Pos logit contributions
 collection+0.042
 wrists+0.041
 cand+0.041
 notebooks+0.038
 traded+0.037
Neg logit contributions
 reckless-0.058
 frightening-0.053
 cal-0.052
 afraid-0.051
Little-0.051
Token yes
Token ID3763
Feature activation+2.19e-02

Pos logit contributions
 collection+0.071
 cand+0.068
 wrists+0.068
 traded+0.063
 mar+0.062
Neg logit contributions
 reckless-0.098
 frightening-0.092
Little-0.088
 beast-0.088
 afraid-0.087
Token and
Token ID290
Feature activation-1.32e-02

Pos logit contributions
 collection+0.037
 wrists+0.037
 cand+0.036
 notebooks+0.035
 mar+0.034
Neg logit contributions
 reckless-0.038
 frightening-0.035
 afraid-0.034
 cal-0.034
Little-0.033
Token followed
Token ID3940
Feature activation-2.96e-02

Pos logit contributions
 reckless+0.023
 cal+0.021
 frightening+0.020
 afraid+0.020
Little+0.020
Neg logit contributions
 wrists-0.022
 collection-0.021
 cand-0.021
 notebooks-0.020
 wrist-0.019
Token Jack
Token ID3619
Feature activation-5.10e-05

Pos logit contributions
 reckless+0.045
 frightening+0.042
 911+0.041
 cal+0.041
Little+0.040
Neg logit contributions
 collection-0.055
 wrists-0.054
 cand-0.054
 notebooks-0.051
 traded-0.051
Token.
Token ID13
Feature activation+1.41e-02

Pos logit contributions
 reckless+0.000
 afraid+0.000
 frightening+0.000
 nearer+0.000
 cal+0.000
Neg logit contributions
 cand-0.000
 wrists-0.000
 notebooks-0.000
 collection-0.000
 traded-0.000
Token
Token ID198
Feature activation-1.77e-02

Pos logit contributions
 collection+0.024
 cand+0.023
 wrists+0.023
 notebooks+0.021
 mar+0.021
Neg logit contributions
 reckless-0.024
Little-0.023
 frightening-0.022
 911-0.022
Down-0.022
Token
Token ID198
Feature activation-1.35e-02

Pos logit contributions
 reckless+0.040
 frightening+0.036
 evil+0.035
 afraid+0.035
 gut+0.035
Neg logit contributions
 collection-0.021
 mar-0.020
 wrists-0.020
 traded-0.020
 cand-0.020
Token street
Token ID4675
Feature activation-6.13e-04

Pos logit contributions
 reckless+0.076
 frightening+0.070
 cal+0.068
 afraid+0.068
Little+0.067
Neg logit contributions
 collection-0.041
 wrists-0.040
 cand-0.039
 traded-0.036
 notebooks-0.035
Token in
Token ID287
Feature activation+2.31e-02

Pos logit contributions
 reckless+0.001
 frightening+0.001
 afraid+0.001
 cal+0.001
 gut+0.001
Neg logit contributions
 collection-0.001
 cand-0.001
 wrists-0.001
 notebooks-0.001
 traded-0.001
Token a
Token ID257
Feature activation-1.94e-02

Pos logit contributions
 collection+0.042
 wrists+0.041
 cand+0.041
 notebooks+0.038
 traded+0.038
Neg logit contributions
 reckless-0.038
 frightening-0.034
 cal-0.033
 afraid-0.032
 gut-0.032
Token row
Token ID5752
Feature activation+1.80e-02

Pos logit contributions
 reckless+0.031
 frightening+0.029
 cal+0.028
 afraid+0.027
Little+0.027
Neg logit contributions
 collection-0.036
 wrists-0.035
 cand-0.035
 notebooks-0.033
 traded-0.033
Token,
Token ID11
Feature activation+4.79e-03

Pos logit contributions
 collection+0.035
 wrists+0.034
 cand+0.034
 notebooks+0.032
 traded+0.032
Neg logit contributions
 reckless-0.027
 frightening-0.024
 afraid-0.023
 cal-0.023
Little-0.023
Token when
Token ID618
Feature activation-4.62e-02

Pos logit contributions
 collection+0.008
 cand+0.008
 wrists+0.008
 notebooks+0.008
 traded+0.007
Neg logit contributions
 reckless-0.008
 frightening-0.008
 afraid-0.008
 cal-0.007
Little-0.007
Token the
Token ID262
Feature activation-2.35e-02

Pos logit contributions
 reckless+0.092
 frightening+0.085
 cal+0.082
 afraid+0.082
Little+0.081
Neg logit contributions
 collection-0.067
 wrists-0.065
 cand-0.065
 traded-0.060
 notebooks-0.060
Token younger
Token ID7099
Feature activation-4.61e-02

Pos logit contributions
 reckless+0.033
 frightening+0.029
 cal+0.028
 afraid+0.028
Little+0.028
Neg logit contributions
 collection-0.047
 wrists-0.046
 cand-0.046
 mar-0.044
 traded-0.044
Token sister
Token ID6621
Feature activation-2.67e-02

Pos logit contributions
 reckless+0.083
 frightening+0.074
 afraid+0.073
 cal+0.072
 beast+0.071
Neg logit contributions
 collection-0.077
 wrists-0.075
 cand-0.074
 notebooks-0.070
 traded-0.070
Token said
Token ID531
Feature activation+3.60e-02

Pos logit contributions
 reckless+0.051
 frightening+0.047
 afraid+0.046
 cal+0.046
Little+0.045
Neg logit contributions
 collection-0.040
 wrists-0.040
 cand-0.039
 notebooks-0.036
 traded-0.036
Token to
Token ID284
Feature activation+1.51e-03

Pos logit contributions
 collection+0.057
 wrists+0.055
 cand+0.054
 notebooks+0.051
 traded+0.051
Neg logit contributions
 reckless-0.068
 frightening-0.062
 cal-0.060
 afraid-0.060
 gut-0.059
Token tree
Token ID5509
Feature activation-1.67e-02

Pos logit contributions
 reckless+0.027
 frightening+0.024
 afraid+0.023
 cal+0.023
 gut+0.023
Neg logit contributions
 collection-0.019
 wrists-0.019
 cand-0.019
 mar-0.017
 coloring-0.017
Token and
Token ID290
Feature activation-1.81e-02

Pos logit contributions
 reckless+0.028
 afraid+0.026
 frightening+0.025
 cal+0.025
 gut+0.024
Neg logit contributions
 wrists-0.028
 cand-0.028
 collection-0.028
 notebooks-0.027
 traded-0.026
Token lay
Token ID3830
Feature activation-2.83e-02

Pos logit contributions
 reckless+0.036
 frightening+0.033
 cal+0.032
 afraid+0.032
Little+0.031
Neg logit contributions
 collection-0.027
 wrists-0.026
 cand-0.026
 traded-0.024
 notebooks-0.024
Token down
Token ID866
Feature activation-2.68e-02

Pos logit contributions
 reckless+0.055
 frightening+0.051
 cal+0.049
Little+0.049
 beast+0.048
Neg logit contributions
 wrists-0.042
 collection-0.042
 cand-0.040
 traded-0.038
 mar-0.038
Token for
Token ID329
Feature activation-1.37e-02

Pos logit contributions
 reckless+0.059
 frightening+0.054
Little+0.052
 beast+0.052
 gut+0.052
Neg logit contributions
 collection-0.034
 wrists-0.033
 cand-0.032
 traded-0.030
 mar-0.030
Token a
Token ID257
Feature activation-2.06e-02

Pos logit contributions
 reckless+0.024
 frightening+0.022
 afraid+0.021
 cal+0.021
Little+0.021
Neg logit contributions
 collection-0.023
 wrists-0.023
 cand-0.023
 traded-0.021
 mar-0.021
Token nap
Token ID25422
Feature activation+4.30e-02

Pos logit contributions
 reckless+0.042
 frightening+0.037
 cal+0.036
 afraid+0.036
 911+0.036
Neg logit contributions
 collection-0.030
 cand-0.029
 wrists-0.028
 mar-0.027
 sandwich-0.026
Token.
Token ID13
Feature activation+2.49e-03

Pos logit contributions
 wrists+0.067
 collection+0.066
 cand+0.066
 traded+0.062
 notebooks+0.062
Neg logit contributions
 reckless-0.081
 frightening-0.073
 afraid-0.072
 gut-0.071
 cal-0.070
Token Suddenly
Token ID24975
Feature activation-4.10e-02

Pos logit contributions
 collection+0.004
 wrists+0.004
 cand+0.004
 traded+0.004
 mar+0.003
Neg logit contributions
 reckless-0.005
 frightening-0.004
 afraid-0.004
 cal-0.004
Little-0.004
Token,
Token ID11
Feature activation+4.16e-03

Pos logit contributions
 reckless+0.113
rig+0.109
 outage+0.106
umpy+0.103
 frightening+0.103
Neg logit contributions
 collection-0.030
 cand-0.029
 mar-0.028
 sandwich-0.022
 traded-0.022
Token he
Token ID339
Feature activation-2.88e-02

Pos logit contributions
 mar+0.005
 cand+0.004
 collection+0.004
 juice+0.004
 sandwich+0.004
Neg logit contributions
 reckless-0.010
 outage-0.009
rig-0.009
umpy-0.009
ending-0.009
Token it
Token ID340
Feature activation+9.74e-03

Pos logit contributions
 collection+0.056
 wrists+0.055
 cand+0.054
 mar+0.052
 notebooks+0.050
Neg logit contributions
 reckless-0.065
 frightening-0.058
 afraid-0.057
 cal-0.056
 gut-0.055
Token now
Token ID783
Feature activation+1.08e-02

Pos logit contributions
 collection+0.020
 cand+0.019
 wrists+0.018
 coloring+0.018
 sandwich+0.018
Neg logit contributions
 reckless-0.015
 frightening-0.013
 afraid-0.012
 gut-0.012
 cal-0.012
Token?"
Token ID1701
Feature activation-4.38e-02

Pos logit contributions
 collection+0.023
 wrists+0.021
 cand+0.021
 traded+0.021
 coloring+0.020
Neg logit contributions
 reckless-0.015
 frightening-0.013
 afraid-0.013
 gut-0.013
 cal-0.013
Token
Token ID198
Feature activation-3.53e-02

Pos logit contributions
 reckless+0.082
Little+0.077
 cal+0.077
 afraid+0.075
 frightening+0.075
Neg logit contributions
 collection-0.067
 cand-0.064
 wrists-0.064
 notebooks-0.060
 clubs-0.057
TokenHis
Token ID6653
Feature activation+1.46e-02

Pos logit contributions
 reckless+0.070
 frightening+0.065
 afraid+0.063
 unnecessary+0.062
 evil+0.061
Neg logit contributions
 collection-0.050
 wrists-0.050
 traded-0.049
 cand-0.048
 mar-0.048
Token mom
Token ID1995
Feature activation-9.97e-03

Pos logit contributions
 collection+0.023
 cand+0.023
 wrists+0.023
 traded+0.021
 notebooks+0.021
Neg logit contributions
 reckless-0.027
 frightening-0.025
 cal-0.024
 afraid-0.024
 gut-0.024
Token laughed
Token ID13818
Feature activation+4.23e-02

Pos logit contributions
 reckless+0.019
 frightening+0.017
 cal+0.017
Little+0.017
 afraid+0.017
Neg logit contributions
 collection-0.016
 wrists-0.015
 cand-0.015
 notebooks-0.014
 mar-0.014
Token and
Token ID290
Feature activation-1.66e-02

Pos logit contributions
 collection+0.076
 wrists+0.074
 cand+0.073
 notebooks+0.069
 traded+0.068
Neg logit contributions
 reckless-0.070
 frightening-0.064
 cal-0.062
 afraid-0.062
Little-0.060
Token said
Token ID531
Feature activation+4.70e-02

Pos logit contributions
 reckless+0.029
 cal+0.027
 frightening+0.026
Little+0.026
 afraid+0.026
Neg logit contributions
 collection-0.029
 cand-0.027
 wrists-0.027
 notebooks-0.026
 clubs-0.025
Token,
Token ID11
Feature activation+1.14e-02

Pos logit contributions
 collection+0.070
 wrists+0.069
 cand+0.067
 notebooks+0.064
 traded+0.063
Neg logit contributions
 reckless-0.092
 frightening-0.083
 cal-0.082
 afraid-0.081
 gut-0.080
Token "
Token ID366
Feature activation-9.60e-03

Pos logit contributions
 collection+0.018
 wrists+0.017
 cand+0.017
 notebooks+0.016
 traded+0.016
Neg logit contributions
 reckless-0.021
 frightening-0.019
 afraid-0.019
 cal-0.019
Little-0.019
Token the
Token ID262
Feature activation-1.70e-02

Pos logit contributions
 collection+0.024
 wrists+0.024
 cand+0.023
 notebooks+0.022
 traded+0.022
Neg logit contributions
 reckless-0.021
 frightening-0.018
 afraid-0.018
 cal-0.018
 gut-0.017
Token deep
Token ID2769
Feature activation-3.73e-02

Pos logit contributions
 reckless+0.038
 frightening+0.035
 afraid+0.034
 cal+0.034
 gut+0.034
Neg logit contributions
 collection-0.020
 wrists-0.019
 cand-0.019
 traded-0.018
 mar-0.018
Token woods
Token ID16479
Feature activation-5.58e-02

Pos logit contributions
 reckless+0.077
 frightening+0.068
 afraid+0.067
 911+0.066
 gut+0.066
Neg logit contributions
 collection-0.055
 wrists-0.051
 cand-0.051
 mar-0.048
 notebooks-0.048
Token.
Token ID13
Feature activation-6.04e-03

Pos logit contributions
 reckless+0.123
 frightening+0.114
Little+0.111
 cal+0.110
 gut+0.109
Neg logit contributions
 collection-0.070
 wrists-0.065
 cand-0.063
 traded-0.060
 coloring-0.059
Token As
Token ID1081
Feature activation-7.00e-02

Pos logit contributions
 reckless+0.012
 frightening+0.011
 cal+0.010
 afraid+0.010
Little+0.010
Neg logit contributions
 collection-0.009
 wrists-0.009
 cand-0.009
 traded-0.008
 notebooks-0.008
Token she
Token ID673
Feature activation-3.32e-02

Pos logit contributions
 reckless+0.177
 beast+0.155
 frightening+0.154
 gut+0.154
 outage+0.153
Neg logit contributions
 collection-0.072
 cand-0.067
 coloring-0.065
 mar-0.062
 tea-0.061
Token walked
Token ID6807
Feature activation-3.53e-02

Pos logit contributions
 reckless+0.079
 frightening+0.073
 afraid+0.070
 911+0.070
 cal+0.070
Neg logit contributions
 collection-0.038
 cand-0.035
 wrists-0.035
 mar-0.032
 traded-0.032
Token,
Token ID11
Feature activation-6.80e-03

Pos logit contributions
 reckless+0.080
 frightening+0.072
 cal+0.070
Little+0.070
 afraid+0.069
Neg logit contributions
 collection-0.044
 cand-0.041
 wrists-0.040
 mar-0.038
 traded-0.037
Token she
Token ID673
Feature activation-3.23e-02

Pos logit contributions
 reckless+0.016
 frightening+0.014
 outage+0.014
 cal+0.014
 gut+0.014
Neg logit contributions
 collection-0.008
 cand-0.008
 wrists-0.007
 coloring-0.007
 mar-0.007
Token noticed
Token ID6810
Feature activation-7.11e-02

Pos logit contributions
 reckless+0.073
 911+0.069
 unnecessary+0.067
 frightening+0.067
 outage+0.067
Neg logit contributions
 collection-0.041
 traded-0.038
 mar-0.035
 drew-0.034
 dress-0.034
Token something
Token ID1223
Feature activation-5.39e-02

Pos logit contributions
 reckless+0.156
umpy+0.146
 outage+0.141
rated+0.139
rig+0.139
Neg logit contributions
 cand-0.091
 mar-0.082
 collection-0.082
 dress-0.081
 wrists-0.079
Token sunset
Token ID26428
Feature activation-5.98e-03

Pos logit contributions
 wrists+0.103
 cand+0.099
 collection+0.099
 traded+0.096
 notebooks+0.095
Neg logit contributions
 reckless-0.101
 beast-0.096
 frightening-0.095
 gut-0.092
 cal-0.091
Token was
Token ID373
Feature activation-9.72e-03

Pos logit contributions
 reckless+0.011
 frightening+0.010
 cal+0.010
 beast+0.009
 afraid+0.009
Neg logit contributions
 wrists-0.010
 cand-0.009
 notebooks-0.009
 collection-0.009
 mar-0.009
Token a
Token ID257
Feature activation-9.58e-03

Pos logit contributions
 reckless+0.018
 frightening+0.016
 cal+0.016
 gut+0.016
Little+0.016
Neg logit contributions
 collection-0.016
 wrists-0.015
 cand-0.015
 traded-0.014
 mar-0.014
Token sign
Token ID1051
Feature activation+7.30e-03

Pos logit contributions
 reckless+0.016
 frightening+0.014
 cal+0.014
 afraid+0.014
Little+0.013
Neg logit contributions
 collection-0.017
 wrists-0.017
 cand-0.017
 traded-0.016
 notebooks-0.016
Token of
Token ID286
Feature activation+2.32e-02

Pos logit contributions
 wrists+0.014
 collection+0.014
 cand+0.013
 notebooks+0.013
 mar+0.013
Neg logit contributions
 reckless-0.011
 frightening-0.010
 cal-0.010
 afraid-0.010
 evil-0.009
Token danger
Token ID3514
Feature activation-1.08e-01

Pos logit contributions
 collection+0.043
 wrists+0.041
 cand+0.040
 traded+0.039
 notebooks+0.038
Neg logit contributions
 reckless-0.037
 frightening-0.035
 afraid-0.033
 cal-0.033
 evil-0.032
Token.
Token ID13
Feature activation+1.08e-02

Pos logit contributions
 reckless+0.210
 frightening+0.193
 afraid+0.186
 cal+0.186
Little+0.183
Neg logit contributions
 collection-0.162
 wrists-0.158
 cand-0.156
 traded-0.145
 notebooks-0.145
Token A
Token ID317
Feature activation-8.80e-03

Pos logit contributions
 collection+0.016
 wrists+0.016
 cand+0.016
 traded+0.015
 mar+0.014
Neg logit contributions
 reckless-0.021
 frightening-0.019
 cal-0.019
 afraid-0.019
 gut-0.018
Token huge
Token ID3236
Feature activation-2.89e-03

Pos logit contributions
 reckless+0.013
 frightening+0.011
 cal+0.011
Little+0.011
 gut+0.011
Neg logit contributions
 collection-0.018
 cand-0.017
 wrists-0.017
 mar-0.016
 traded-0.016
Token storm
Token ID6388
Feature activation-8.37e-02

Pos logit contributions
 reckless+0.004
 frightening+0.004
 beast+0.004
 afraid+0.004
 cal+0.004
Neg logit contributions
 wrists-0.005
 collection-0.005
 cand-0.005
 notebooks-0.005
 traded-0.005
Token was
Token ID373
Feature activation+3.23e-03

Pos logit contributions
 reckless+0.178
 gut+0.163
 unnecessary+0.160
 frightening+0.160
 911+0.158
Neg logit contributions
 collection-0.107
 cand-0.101
 traded-0.098
 wrists-0.097
 mar-0.095
Token find
Token ID1064
Feature activation-1.99e-02

Pos logit contributions
 reckless+0.082
 afraid+0.075
 frightening+0.075
 gut+0.073
 cal+0.073
Neg logit contributions
 wrists-0.078
 cand-0.076
 notebooks-0.073
 collection-0.073
 traded-0.069
Token it
Token ID340
Feature activation-1.76e-02

Pos logit contributions
 reckless+0.034
 frightening+0.031
 cal+0.030
 afraid+0.030
Little+0.029
Neg logit contributions
 collection-0.035
 wrists-0.034
 cand-0.034
 traded-0.032
 notebooks-0.032
Token,
Token ID11
Feature activation+1.22e-03

Pos logit contributions
 reckless+0.033
 frightening+0.030
 afraid+0.030
 cal+0.029
 gut+0.029
Neg logit contributions
 wrists-0.028
 collection-0.027
 cand-0.027
 notebooks-0.026
 traded-0.025
Token but
Token ID475
Feature activation-2.99e-02

Pos logit contributions
 collection+0.002
 wrists+0.002
 cand+0.002
 traded+0.002
 notebooks+0.002
Neg logit contributions
 reckless-0.002
 frightening-0.002
 afraid-0.002
 cal-0.002
Little-0.002
Token they
Token ID484
Feature activation+4.66e-02

Pos logit contributions
 reckless+0.051
 frightening+0.047
 afraid+0.046
 gut+0.043
 911+0.043
Neg logit contributions
 wrists-0.054
 collection-0.052
 cand-0.051
 notebooks-0.049
 wrist-0.048
Token couldn
Token ID3521
Feature activation-1.29e-01

Pos logit contributions
 wrists+0.086
 collection+0.086
 cand+0.085
 notebooks+0.081
 mar+0.078
Neg logit contributions
 reckless-0.075
 frightening-0.067
 afraid-0.065
 cal-0.063
 gut-0.062
Token't
Token ID470
Feature activation-8.63e-02

Pos logit contributions
 reckless+0.196
 frightening+0.174
 afraid+0.167
 cal+0.165
 gut+0.163
Neg logit contributions
 collection-0.252
 wrists-0.243
 cand-0.243
 traded-0.233
 mar-0.231
Token.
Token ID13
Feature activation+1.17e-02

Pos logit contributions
 reckless+0.178
 frightening+0.163
 cal+0.158
 afraid+0.158
Little+0.156
Neg logit contributions
 collection-0.123
 wrists-0.117
 cand-0.116
 traded-0.109
 mar-0.107
Token Tim
Token ID5045
Feature activation+6.35e-03

Pos logit contributions
 collection+0.020
 wrists+0.020
 notebooks+0.019
 cand+0.019
 wedding+0.018
Neg logit contributions
 reckless-0.019
 afraid-0.018
 frightening-0.018
 911-0.018
Little-0.017
Tokenmy
Token ID1820
Feature activation-2.07e-03

Pos logit contributions
 collection+0.009
 wrists+0.009
 cand+0.009
 notebooks+0.008
 traded+0.008
Neg logit contributions
 reckless-0.013
 frightening-0.012
 afraid-0.011
 cal-0.011
Little-0.011
Token was
Token ID373
Feature activation+7.93e-03

Pos logit contributions
 reckless+0.004
 frightening+0.004
 afraid+0.004
 cal+0.004
 beast+0.004
Neg logit contributions
 collection-0.003
 wrists-0.003
 cand-0.003
 notebooks-0.003
 coloring-0.003
Token on
Token ID319
Feature activation+3.58e-02

Pos logit contributions
 reckless+0.036
 frightening+0.033
 afraid+0.032
 beast+0.031
 cal+0.031
Neg logit contributions
 collection-0.031
 cand-0.030
 wrists-0.030
 traded-0.028
 notebooks-0.028
Token the
Token ID262
Feature activation-1.48e-02

Pos logit contributions
 collection+0.064
 wrists+0.063
 cand+0.063
 mar+0.059
 notebooks+0.058
Neg logit contributions
 reckless-0.060
 frightening-0.053
 afraid-0.052
 cal-0.051
 gut-0.051
Token ground
Token ID2323
Feature activation-2.24e-02

Pos logit contributions
 reckless+0.030
 frightening+0.027
 cal+0.026
 afraid+0.026
Little+0.026
Neg logit contributions
 collection-0.022
 wrists-0.021
 cand-0.021
 mar-0.019
 coloring-0.019
Token.
Token ID13
Feature activation+3.63e-03

Pos logit contributions
 reckless+0.045
 frightening+0.042
 cal+0.040
 afraid+0.040
Little+0.040
Neg logit contributions
 collection-0.032
 wrists-0.031
 cand-0.031
 traded-0.028
 notebooks-0.028
Token She
Token ID1375
Feature activation-2.19e-02

Pos logit contributions
 collection+0.006
 wrists+0.006
 cand+0.006
 notebooks+0.005
 traded+0.005
Neg logit contributions
 reckless-0.007
 frightening-0.006
 afraid-0.006
 cal-0.006
 gut-0.006
Token didn
Token ID1422
Feature activation-9.73e-02

Pos logit contributions
 reckless+0.047
 911+0.044
 unnecessary+0.043
 gut+0.043
 frightening+0.043
Neg logit contributions
 collection-0.028
 wrists-0.028
 cand-0.026
 traded-0.026
 mar-0.025
Token't
Token ID470
Feature activation-7.99e-02

Pos logit contributions
 reckless+0.206
 frightening+0.184
 unnecessary+0.180
 afraid+0.179
 gut+0.178
Neg logit contributions
 collection-0.120
 coloring-0.114
 mar-0.114
 cand-0.112
 juice-0.109
Token know
Token ID760
Feature activation-2.24e-02

Pos logit contributions
 reckless+0.114
Little+0.102
 gut+0.099
 frightening+0.099
 cal+0.096
Neg logit contributions
 collection-0.153
 cand-0.148
 traded-0.145
 wrists-0.143
 mar-0.143
Token what
Token ID644
Feature activation-1.46e-03

Pos logit contributions
 reckless+0.037
umpy+0.034
 booming+0.033
 urgent+0.033
 frightening+0.033
Neg logit contributions
 cand-0.040
 collection-0.039
 mar-0.037
 coloring-0.037
 wrists-0.037
Token it
Token ID340
Feature activation-1.35e-02

Pos logit contributions
 reckless+0.002
 frightening+0.002
 afraid+0.002
 cal+0.002
 beast+0.002
Neg logit contributions
 collection-0.003
 wrists-0.003
 cand-0.003
 traded-0.002
 notebooks-0.002
Token was
Token ID373
Feature activation+3.04e-03

Pos logit contributions
 reckless+0.033
 gut+0.031
 cal+0.030
 911+0.030
 unnecessary+0.030
Neg logit contributions
 collection-0.013
 cand-0.012
 mar-0.011
 traded-0.011
 coloring-0.011
Token He
Token ID679
Feature activation-2.71e-02

Pos logit contributions
 collection+0.001
 wrists+0.001
 cand+0.001
 notebooks+0.001
 traded+0.001
Neg logit contributions
 reckless-0.001
 frightening-0.001
Little-0.001
 afraid-0.001
 cal-0.001
Token was
Token ID373
Feature activation+5.78e-03

Pos logit contributions
 reckless+0.052
 frightening+0.047
 afraid+0.047
 cal+0.046
Little+0.045
Neg logit contributions
 collection-0.041
 wrists-0.040
 cand-0.040
 notebooks-0.037
 coloring-0.036
Token so
Token ID523
Feature activation+2.72e-02

Pos logit contributions
 collection+0.010
 wrists+0.009
 cand+0.009
 notebooks+0.009
 traded+0.009
Neg logit contributions
 reckless-0.011
 frightening-0.010
 afraid-0.009
 cal-0.009
 gut-0.009
Token excited
Token ID6568
Feature activation-2.72e-02

Pos logit contributions
 collection+0.044
 cand+0.042
 traded+0.041
 wrists+0.040
 mar+0.040
Neg logit contributions
 reckless-0.049
 911-0.046
 beast-0.046
 unnecessary-0.044
 frightening-0.044
Token and
Token ID290
Feature activation-2.34e-02

Pos logit contributions
 reckless+0.047
 frightening+0.043
 cal+0.041
 afraid+0.041
Little+0.041
Neg logit contributions
 collection-0.047
 wrists-0.045
 cand-0.045
 traded-0.042
 notebooks-0.042
Token couldn
Token ID3521
Feature activation-1.33e-01

Pos logit contributions
 reckless+0.040
 frightening+0.036
 afraid+0.035
 cal+0.035
 gut+0.034
Neg logit contributions
 collection-0.041
 wrists-0.040
 cand-0.040
 notebooks-0.037
 mar-0.037
Token't
Token ID470
Feature activation-9.23e-02

Pos logit contributions
 reckless+0.241
 frightening+0.217
 afraid+0.208
 gut+0.206
 unnecessary+0.205
Neg logit contributions
 collection-0.223
 cand-0.209
 traded-0.208
 wrists-0.202
 mar-0.201
Token wait
Token ID4043
Feature activation+8.18e-03

Pos logit contributions
 reckless+0.189
 frightening+0.175
 afraid+0.170
 cal+0.169
 gut+0.167
Neg logit contributions
 collection-0.129
 wrists-0.127
 cand-0.125
 notebooks-0.116
 mar-0.114
Token to
Token ID284
Feature activation-4.13e-02

Pos logit contributions
 collection+0.013
 wrists+0.012
 cand+0.012
 coloring+0.011
 mar+0.011
Neg logit contributions
 reckless-0.016
 frightening-0.014
 cal-0.014
Little-0.014
 afraid-0.014
Token eat
Token ID4483
Feature activation+2.89e-02

Pos logit contributions
 reckless+0.079
 frightening+0.073
 cal+0.071
 afraid+0.071
Little+0.069
Neg logit contributions
 collection-0.063
 wrists-0.062
 cand-0.061
 notebooks-0.057
 traded-0.057
Token.
Token ID13
Feature activation-3.84e-03

Pos logit contributions
 collection+0.053
 wrists+0.053
 cand+0.051
 traded+0.050
 notebooks+0.049
Neg logit contributions
 reckless-0.045
 frightening-0.041
 cal-0.041
 afraid-0.040
 911-0.039
Token very
Token ID845
Feature activation+9.28e-03

Pos logit contributions
 reckless+0.016
 frightening+0.014
 afraid+0.014
 cal+0.013
Little+0.013
Neg logit contributions
 wrists-0.020
 cand-0.020
 collection-0.020
 notebooks-0.019
 traded-0.018
Token obedient
Token ID41652
Feature activation+2.99e-03

Pos logit contributions
 collection+0.017
 wrists+0.016
 cand+0.016
 traded+0.016
 notebooks+0.016
Neg logit contributions
 reckless-0.015
 frightening-0.013
 cal-0.013
 beast-0.013
 911-0.013
Token boy
Token ID2933
Feature activation-1.66e-02

Pos logit contributions
 wrists+0.006
 collection+0.006
 cand+0.006
 notebooks+0.005
 traded+0.005
Neg logit contributions
 reckless-0.004
 frightening-0.004
 afraid-0.004
 cal-0.004
 gut-0.004
Token yesterday
Token ID7415
Feature activation-2.68e-02

Pos logit contributions
 reckless+0.023
 afraid+0.021
 though+0.019
 gut+0.019
 cal+0.019
Neg logit contributions
 wrists-0.034
 cand-0.033
 collection-0.032
 notebooks-0.032
 extract-0.030
Token.
Token ID13
Feature activation+1.40e-03

Pos logit contributions
 reckless+0.043
 afraid+0.039
 frightening+0.038
 cal+0.036
 gut+0.036
Neg logit contributions
 wrists-0.047
 collection-0.046
 cand-0.045
 notebooks-0.045
 traded-0.042
Token Everywhere
Token ID48201
Feature activation-1.05e-01

Pos logit contributions
 wrists+0.003
 cand+0.003
 collection+0.003
 notebooks+0.002
 mar+0.002
Neg logit contributions
 reckless-0.002
 frightening-0.002
 afraid-0.002
 cal-0.002
Little-0.002
Token he
Token ID339
Feature activation-4.78e-02

Pos logit contributions
 reckless+0.189
 frightening+0.171
 cal+0.166
 afraid+0.165
 gut+0.164
Neg logit contributions
 collection-0.175
 wrists-0.167
 cand-0.167
 traded-0.159
 mar-0.157
Token went
Token ID1816
Feature activation-3.68e-02

Pos logit contributions
 reckless+0.116
 frightening+0.107
 911+0.105
 gut+0.104
 afraid+0.104
Neg logit contributions
 collection-0.051
 cand-0.048
 wrists-0.047
 traded-0.045
 mar-0.043
Token,
Token ID11
Feature activation-8.42e-03

Pos logit contributions
 reckless+0.068
 frightening+0.062
 afraid+0.060
 cal+0.060
Little+0.060
Neg logit contributions
 wrists-0.059
 collection-0.059
 cand-0.057
 notebooks-0.053
 traded-0.052
Token he
Token ID339
Feature activation-4.35e-02

Pos logit contributions
 reckless+0.016
 frightening+0.014
 cal+0.014
 gut+0.014
 beast+0.014
Neg logit contributions
 collection-0.013
 cand-0.013
 wrists-0.012
 mar-0.012
 traded-0.012
Token behaved
Token ID39207
Feature activation+7.38e-03

Pos logit contributions
 reckless+0.092
 frightening+0.085
 cal+0.082
 afraid+0.082
Little+0.082
Neg logit contributions
 collection-0.058
 wrists-0.056
 cand-0.055
 traded-0.051
 notebooks-0.051