TOP ACTIVATIONS
MAX = 10.743

something new ! She thought av oc ados were the most
away . She learned that av oc ados are good for
something new ! She thought av oc ados were the most
. Did you know that av oc ados are very healthy
said , " I love av oc ados ! They are
still couldn 't find any av oc ados . Finally ,
girl who didn 't like av oc ados . She always
happy that it produced more av oc ados than ever before
but there were no other av oc ados .
" I will buy more av oc ados tomorrow . You
day on , Lily loved av oc ados and would ask
if she could request some av oc ados from the store
the tree if they were av oc ados , and they
and shared everything , including av oc ados . They were
on , whenever they wanted av oc ados , they would
time there was a little av ac ado . It was
Everyday he would take the av oc ados and scatter them
Maybe they could grow more av oc ados and share them
, and all the other av oc ados were very happy
. Mom says they are av oc ados . Sara wants

MIN ACTIVATIONS
MIN = -8.191

bang " and " p ow " and laugh .
om " and " p ow ". One day , they
back on his snow pl ow . He used his shovel
. " Me ow , me ow !" he
ch oo cho o cho o ". They both
" Me ow , me ow !" he says . He
. " Me ow !" the cat says .
. " The snow pl ow is here !"
shovel on my snow pl ow . I can lift him
goes " ch oo cho o cho o ".
pa carry things and pl ow the field .
" Me ow , me ow !" the dolls say .
" Me ow , me ow , Tom ! Can you
. " Me ow , me ow !" the
man . The snow pl ow came closer and closer .
" Me ow , me ow , I 'm a cat
!" " Me ow !" the cat says .
It is a snow pl ow . It is coming to
happened . The snow pl ow stopped . The driver got
" Me ow , me ow , I can help you

INTERVAL 6.010 - 10.743
CONTAINS 0.018%

his subjects learned an important lesson to never be jealous of
made sure to eat her av oc ados . She learned
were . There was one gard ener who was very patient
She learned an important lesson that day . Never leave
She was proud of how neatly she was hanging the flower

INTERVAL 1.276 - 6.010
CONTAINS 15.406%

toys , but not all of them . She tells them
was so excited to try and play it .
and the fish . He knew that the spells were not
stopped . From that day on , the little girl loved
spoon . She tried the cereal , and to her surprise

INTERVAL -3.458 - 1.276
CONTAINS 83.503%

, until he was out of breath .
so excited to play with the mel on in the water
in the sunshine . One day , Wh isk ers saw
sky and started to cry . Just then ,
it , he couldn 't . He asked all his friends

INTERVAL -8.191 - -3.458
CONTAINS 1.073%

a boy in the park who had no k ite .
. They wanted to pet him . They forgot that he
a big box of seeds that their dad gave them .
her house and not wander too far away . <|endoftext|> Once
tall tree and carefully picked him up . The superhero held
Token something
Token ID1223
Feature activation+1.94e+00

Pos logit contributions
oll+0.205
umm+0.200
atmeal+0.197
 bead+0.197
ware+0.194
Neg logit contributions
 scared-0.108
 worried-0.103
 sad-0.100
 afraid-0.091
 frightened-0.090
Token new
Token ID649
Feature activation+3.74e+00

Pos logit contributions
oll+0.109
 bead+0.107
der+0.107
oph+0.106
aces+0.105
Neg logit contributions
 scared-0.063
 worried-0.060
 sad-0.059
 frightened-0.055
 afraid-0.052
Token!
Token ID0
Feature activation+1.22e+00

Pos logit contributions
oph+0.218
oll+0.216
 av+0.214
der+0.208
atmeal+0.206
Neg logit contributions
 scared-0.108
 worried-0.098
 afraid-0.090
 voice-0.089
 frightened-0.089
Token She
Token ID1375
Feature activation+1.49e+00

Pos logit contributions
der+0.071
 av+0.071
ier+0.071
ils+0.070
oll+0.067
Neg logit contributions
 scared-0.030
 worried-0.028
 sad-0.027
 frightened-0.027
 shout-0.027
Token thought
Token ID1807
Feature activation+6.64e-01

Pos logit contributions
reating+0.093
der+0.087
ier+0.086
quet+0.085
 novels+0.085
Neg logit contributions
 scared-0.047
 worried-0.043
 sad-0.040
 frightened-0.038
 saw-0.036
Token av
Token ID1196
Feature activation+1.07e+01

Pos logit contributions
oph+0.034
der+0.033
assy+0.032
oll+0.032
umm+0.032
Neg logit contributions
 sad-0.024
 scared-0.023
 worried-0.022
 afraid-0.020
 frightened-0.020
Tokenoc
Token ID420
Feature activation+2.23e+00

Pos logit contributions
der+0.557
oll+0.525
oph+0.488
 av+0.485
ier+0.479
Neg logit contributions
 scared-0.383
 worried-0.368
 frightened-0.358
 sad-0.347
 afraid-0.343
Tokenados
Token ID22484
Feature activation+6.24e+00

Pos logit contributions
oll+0.054
der+0.053
 bead+0.050
oph+0.049
 mastered+0.047
Neg logit contributions
 scared-0.124
 worried-0.122
 frightened-0.120
 voice-0.119
 sad-0.118
Token were
Token ID547
Feature activation+1.93e+00

Pos logit contributions
der+0.350
oll+0.347
oph+0.334
ellery+0.320
 av+0.313
Neg logit contributions
 scared-0.211
 worried-0.191
 frightened-0.184
 sad-0.180
 afraid-0.170
Token the
Token ID262
Feature activation+1.54e+00

Pos logit contributions
der+0.114
oll+0.110
oph+0.105
ently+0.101
ware+0.099
Neg logit contributions
 scared-0.066
 worried-0.061
 sad-0.059
 frightened-0.058
 afraid-0.055
Token most
Token ID749
Feature activation+1.75e+00

Pos logit contributions
der+0.075
oll+0.071
oph+0.066
ils+0.065
 av+0.064
Neg logit contributions
 scared-0.062
 worried-0.060
 frightened-0.058
 sad-0.058
 afraid-0.056
Token away
Token ID1497
Feature activation-9.03e-01

Pos logit contributions
der+0.031
oll+0.029
 av+0.029
 bead+0.028
oph+0.027
Neg logit contributions
 scared-0.026
 worried-0.026
 frightened-0.026
 afraid-0.025
 shout-0.025
Token.
Token ID13
Feature activation+1.20e+00

Pos logit contributions
 scared+0.035
 worried+0.033
 sad+0.032
 frightened+0.031
 afraid+0.029
Neg logit contributions
oll-0.048
oph-0.047
der-0.045
ils-0.044
assy-0.044
Token She
Token ID1375
Feature activation+1.52e+00

Pos logit contributions
der+0.070
oll+0.068
ier+0.068
 av+0.065
ils+0.065
Neg logit contributions
 scared-0.034
 worried-0.032
 frightened-0.031
 sad-0.031
 shout-0.029
Token learned
Token ID4499
Feature activation+3.47e+00

Pos logit contributions
reating+0.089
quet+0.088
der+0.086
umm+0.085
igsaw+0.085
Neg logit contributions
 scared-0.053
 worried-0.048
 sad-0.047
 frightened-0.044
 confused-0.041
Token that
Token ID326
Feature activation-8.73e-01

Pos logit contributions
umm+0.204
oll+0.202
atmeal+0.199
oph+0.197
ware+0.195
Neg logit contributions
 scared-0.109
 sad-0.106
 worried-0.103
 frightened-0.091
 afraid-0.089
Token av
Token ID1196
Feature activation+1.07e+01

Pos logit contributions
 scared+0.033
 sad+0.032
 worried+0.032
 frightened+0.030
 worry+0.029
Neg logit contributions
oll-0.045
oph-0.044
der-0.044
ier-0.042
assy-0.042
Tokenoc
Token ID420
Feature activation+2.15e+00

Pos logit contributions
der+0.534
oll+0.506
oph+0.478
 av+0.469
ier+0.457
Neg logit contributions
 scared-0.401
 worried-0.388
 frightened-0.378
 sad-0.372
 afraid-0.363
Tokenados
Token ID22484
Feature activation+6.34e+00

Pos logit contributions
der+0.058
oll+0.057
oph+0.054
 bead+0.053
 mastered+0.053
Neg logit contributions
 scared-0.110
 worried-0.109
 voice-0.106
 sad-0.106
 frightened-0.106
Token are
Token ID389
Feature activation+2.06e+00

Pos logit contributions
der+0.356
oll+0.348
oph+0.343
 av+0.323
ellery+0.318
Neg logit contributions
 scared-0.209
 worried-0.196
 sad-0.189
 frightened-0.189
 afraid-0.179
Token good
Token ID922
Feature activation+3.35e+00

Pos logit contributions
oph+0.119
oll+0.115
der+0.114
lor+0.106
assy+0.106
Neg logit contributions
 scared-0.076
 worried-0.073
 sad-0.072
 afraid-0.067
 frightened-0.067
Token for
Token ID329
Feature activation+1.47e+00

Pos logit contributions
oph+0.173
oll+0.162
quet+0.161
der+0.158
 av+0.157
Neg logit contributions
 scared-0.132
 worried-0.125
 frightened-0.118
 afraid-0.118
 sad-0.115
Token something
Token ID1223
Feature activation+1.52e+00

Pos logit contributions
oll+0.178
umm+0.169
 bead+0.169
atmeal+0.169
der+0.167
Neg logit contributions
 scared-0.099
 worried-0.094
 sad-0.092
 frightened-0.084
 afraid-0.084
Token new
Token ID649
Feature activation+3.36e+00

Pos logit contributions
oll+0.085
der+0.084
 bead+0.083
oph+0.082
aces+0.080
Neg logit contributions
 scared-0.051
 worried-0.049
 sad-0.048
 frightened-0.045
 afraid-0.043
Token!
Token ID0
Feature activation+9.27e-01

Pos logit contributions
oph+0.194
oll+0.192
 av+0.188
der+0.184
atmeal+0.181
Neg logit contributions
 scared-0.101
 worried-0.092
 voice-0.085
 afraid-0.085
 sad-0.085
Token She
Token ID1375
Feature activation+1.12e+00

Pos logit contributions
der+0.051
ier+0.050
 av+0.050
ils+0.050
oll+0.049
Neg logit contributions
 scared-0.026
 worried-0.025
 sad-0.024
 shout-0.023
 frightened-0.023
Token thought
Token ID1807
Feature activation+2.24e-01

Pos logit contributions
reating+0.068
der+0.065
ier+0.064
oll+0.063
quet+0.062
Neg logit contributions
 scared-0.036
 worried-0.033
 sad-0.031
 frightened-0.030
 saw-0.028
Token av
Token ID1196
Feature activation+1.05e+01

Pos logit contributions
oph+0.011
der+0.011
oll+0.011
assy+0.011
 bead+0.011
Neg logit contributions
 sad-0.008
 scared-0.008
 worried-0.008
 frightened-0.007
 confused-0.007
Tokenoc
Token ID420
Feature activation+1.91e+00

Pos logit contributions
der+0.563
oll+0.527
oph+0.489
 av+0.484
ier+0.483
Neg logit contributions
 scared-0.363
 worried-0.347
 frightened-0.338
 sad-0.325
 afraid-0.323
Tokenados
Token ID22484
Feature activation+5.90e+00

Pos logit contributions
oll+0.047
der+0.044
 bead+0.043
oph+0.043
 mastered+0.040
Neg logit contributions
 scared-0.105
 worried-0.104
 frightened-0.102
 voice-0.101
 sad-0.100
Token were
Token ID547
Feature activation+1.46e+00

Pos logit contributions
der+0.328
oll+0.325
oph+0.313
ellery+0.298
 av+0.293
Neg logit contributions
 scared-0.203
 worried-0.185
 frightened-0.178
 sad-0.175
 afraid-0.165
Token the
Token ID262
Feature activation+1.15e+00

Pos logit contributions
der+0.085
oll+0.082
oph+0.079
ently+0.075
ware+0.074
Neg logit contributions
 scared-0.050
 worried-0.047
 sad-0.046
 frightened-0.044
 afraid-0.042
Token most
Token ID749
Feature activation+1.20e+00

Pos logit contributions
der+0.053
oll+0.051
oph+0.047
 av+0.046
 bead+0.046
Neg logit contributions
 scared-0.048
 worried-0.047
 frightened-0.046
 sad-0.045
 afraid-0.044
Token.
Token ID13
Feature activation+7.81e-01

Pos logit contributions
der+0.119
oll+0.112
 av+0.106
oph+0.105
ier+0.103
Neg logit contributions
 scared-0.117
 worried-0.115
 frightened-0.113
 sad-0.111
 afraid-0.109
Token Did
Token ID7731
Feature activation+8.10e-01

Pos logit contributions
der+0.038
oll+0.037
 bead+0.034
oph+0.034
 av+0.033
Neg logit contributions
 scared-0.030
 worried-0.029
 frightened-0.028
 sad-0.028
 afraid-0.027
Token you
Token ID345
Feature activation+1.74e-01

Pos logit contributions
der+0.039
oll+0.038
 av+0.035
oph+0.035
 bead+0.035
Neg logit contributions
 scared-0.030
 worried-0.030
 frightened-0.029
 sad-0.028
 afraid-0.028
Token know
Token ID760
Feature activation+1.70e+00

Pos logit contributions
der+0.009
oll+0.008
 mastered+0.008
 bead+0.008
 av+0.008
Neg logit contributions
 worried-0.005
 scared-0.005
 frightened-0.005
 barking-0.005
 afraid-0.005
Token that
Token ID326
Feature activation-1.04e+00

Pos logit contributions
oll+0.087
oph+0.087
der+0.082
assy+0.080
 mastered+0.080
Neg logit contributions
 scared-0.061
 sad-0.058
 worried-0.058
 afraid-0.055
 frightened-0.054
Token av
Token ID1196
Feature activation+9.96e+00

Pos logit contributions
 scared+0.039
 worried+0.036
 frightened+0.036
 sad+0.036
 afraid+0.034
Neg logit contributions
oll-0.054
der-0.054
oph-0.051
assy-0.049
ier-0.048
Tokenoc
Token ID420
Feature activation+1.64e+00

Pos logit contributions
der+0.545
oll+0.511
oph+0.479
ware+0.461
ier+0.459
Neg logit contributions
 scared-0.329
 worried-0.311
 frightened-0.303
 sad-0.300
 afraid-0.292
Tokenados
Token ID22484
Feature activation+5.28e+00

Pos logit contributions
oll+0.042
oph+0.040
 bead+0.040
der+0.040
 mastered+0.038
Neg logit contributions
 scared-0.087
 worried-0.086
 voice-0.084
 sad-0.084
 frightened-0.084
Token are
Token ID389
Feature activation+1.26e+00

Pos logit contributions
der+0.289
oll+0.280
oph+0.270
 av+0.258
 bead+0.257
Neg logit contributions
 scared-0.178
 worried-0.166
 frightened-0.161
 sad-0.158
 afraid-0.152
Token very
Token ID845
Feature activation-1.07e+00

Pos logit contributions
der+0.070
oll+0.069
oph+0.068
assy+0.063
 bead+0.063
Neg logit contributions
 scared-0.041
 worried-0.040
 sad-0.039
 frightened-0.037
 afraid-0.037
Token healthy
Token ID5448
Feature activation+1.36e+00

Pos logit contributions
 scared+0.042
 worried+0.041
 frightened+0.040
 sad+0.040
 afraid+0.039
Neg logit contributions
der-0.051
oll-0.049
oph-0.046
 av-0.045
 bead-0.045
Token said
Token ID531
Feature activation-3.38e-01

Pos logit contributions
der+0.079
oll+0.075
oph+0.072
 av+0.071
 bead+0.069
Neg logit contributions
 scared-0.049
 worried-0.048
 frightened-0.047
 afraid-0.045
 sad-0.045
Token,
Token ID11
Feature activation+2.13e+00

Pos logit contributions
 scared+0.013
 worried+0.013
 frightened+0.012
 sad+0.012
 afraid+0.012
Neg logit contributions
der-0.016
oll-0.016
oph-0.015
 av-0.015
 bead-0.014
Token "
Token ID366
Feature activation+1.17e+00

Pos logit contributions
der+0.103
oll+0.091
 av+0.088
oph+0.083
 bead+0.083
Neg logit contributions
 scared-0.082
 worried-0.080
 frightened-0.080
 panic-0.076
 afraid-0.076
TokenI
Token ID40
Feature activation-1.81e-01

Pos logit contributions
der+0.043
oph+0.039
oll+0.039
 bead+0.037
 av+0.037
Neg logit contributions
 scared-0.057
 worried-0.057
 shout-0.055
 sad-0.055
 frightened-0.055
Token love
Token ID1842
Feature activation+1.21e+00

Pos logit contributions
 boom+0.005
 barking+0.005
 voice+0.005
 frightened+0.005
 worried+0.005
Neg logit contributions
 mastered-0.009
der-0.009
oll-0.009
oph-0.008
 bead-0.008
Token av
Token ID1196
Feature activation+9.63e+00

Pos logit contributions
oll+0.055
der+0.054
oph+0.051
ier+0.050
 bead+0.048
Neg logit contributions
 scared-0.051
 worried-0.049
 sad-0.049
 frightened-0.047
 barking-0.045
Tokenoc
Token ID420
Feature activation+1.26e+00

Pos logit contributions
der+0.534
oll+0.494
oph+0.464
ware+0.457
ier+0.454
Neg logit contributions
 scared-0.314
 worried-0.300
 frightened-0.291
 sad-0.286
 afraid-0.279
Tokenados
Token ID22484
Feature activation+4.58e+00

Pos logit contributions
der+0.034
oll+0.033
 bead+0.031
oph+0.031
 mastered+0.029
Neg logit contributions
 scared-0.067
 worried-0.066
 frightened-0.065
 sad-0.064
 voice-0.063
Token!
Token ID0
Feature activation-6.37e-02

Pos logit contributions
oph+0.256
der+0.253
oll+0.250
ently+0.240
ellery+0.240
Neg logit contributions
 scared-0.136
 worried-0.123
 sad-0.120
 frightened-0.116
 afraid-0.115
Token They
Token ID1119
Feature activation-1.34e+00

Pos logit contributions
 scared+0.002
 worried+0.002
 frightened+0.002
 sad+0.002
 afraid+0.002
Neg logit contributions
der-0.003
oll-0.003
 av-0.003
oph-0.003
 bead-0.003
Token are
Token ID389
Feature activation+1.32e+00

Pos logit contributions
 scared+0.046
 worried+0.045
 frightened+0.043
 sad+0.043
 afraid+0.041
Neg logit contributions
der-0.070
oll-0.068
oph-0.064
 av-0.063
 bead-0.062
Token still
Token ID991
Feature activation-6.75e-01

Pos logit contributions
der+0.011
oll+0.011
oph+0.010
 av+0.010
 bead+0.010
Neg logit contributions
 scared-0.013
 worried-0.013
 frightened-0.013
 sad-0.012
 afraid-0.012
Token couldn
Token ID3521
Feature activation+1.87e+00

Pos logit contributions
 scared+0.027
 worried+0.027
 frightened+0.026
 sad+0.025
 afraid+0.025
Neg logit contributions
der-0.031
oll-0.030
oph-0.028
 av-0.028
 bead-0.028
Token't
Token ID470
Feature activation-1.26e+00

Pos logit contributions
der+0.111
oll+0.108
oph+0.104
ils+0.102
 av+0.102
Neg logit contributions
 scared-0.052
 worried-0.049
 frightened-0.048
 sad-0.047
 shout-0.044
Token find
Token ID1064
Feature activation+2.24e+00

Pos logit contributions
 scared+0.032
 worried+0.029
 frightened+0.028
 shout+0.028
 sad+0.027
Neg logit contributions
der-0.081
oll-0.079
oph-0.077
 av-0.076
ils-0.075
Token any
Token ID597
Feature activation+6.42e-01

Pos logit contributions
der+0.100
oll+0.094
oph+0.090
 av+0.088
 bead+0.086
Neg logit contributions
 scared-0.096
 worried-0.093
 frightened-0.091
 sad-0.089
 afraid-0.088
Token av
Token ID1196
Feature activation+9.50e+00

Pos logit contributions
der+0.028
oll+0.028
oph+0.027
assy+0.025
 av+0.025
Neg logit contributions
 scared-0.028
 worried-0.027
 frightened-0.027
 sad-0.026
 afraid-0.026
Tokenoc
Token ID420
Feature activation+1.17e+00

Pos logit contributions
der+0.549
oll+0.508
oph+0.483
ware+0.474
cake+0.466
Neg logit contributions
 scared-0.284
 worried-0.264
 frightened-0.259
 afraid-0.252
 sad-0.244
Tokenados
Token ID22484
Feature activation+4.56e+00

Pos logit contributions
oll+0.032
 bead+0.032
oph+0.031
der+0.031
 mastered+0.029
Neg logit contributions
 scared-0.058
 worried-0.058
 frightened-0.057
 sad-0.056
 voice-0.056
Token.
Token ID13
Feature activation+7.41e-01

Pos logit contributions
oph+0.240
der+0.239
oll+0.237
 bead+0.226
 av+0.220
Neg logit contributions
 scared-0.155
 worried-0.144
 frightened-0.135
 afraid-0.134
 sad-0.131
Token Finally
Token ID9461
Feature activation+2.07e+00

Pos logit contributions
der+0.035
oll+0.033
oph+0.032
 av+0.031
assy+0.030
Neg logit contributions
 scared-0.028
 worried-0.028
 frightened-0.026
 shout-0.026
 afraid-0.026
Token,
Token ID11
Feature activation+1.61e+00

Pos logit contributions
der+0.112
oll+0.107
oph+0.102
 av+0.101
 bead+0.100
Neg logit contributions
 scared-0.069
 worried-0.067
 frightened-0.064
 sad-0.064
 afraid-0.061
Token girl
Token ID2576
Feature activation-4.57e-02

Pos logit contributions
 scared+0.052
 worried+0.052
 shout+0.052
 afraid+0.051
 frightened+0.051
Neg logit contributions
 av-0.079
der-0.079
oll-0.078
 bead-0.078
oph-0.072
Token who
Token ID508
Feature activation-3.40e+00

Pos logit contributions
 scared+0.001
 worried+0.001
 frightened+0.001
 sad+0.001
 afraid+0.001
Neg logit contributions
der-0.002
oll-0.002
 av-0.002
 bead-0.002
oph-0.002
Token didn
Token ID1422
Feature activation+3.75e-01

Pos logit contributions
 worried+0.152
 barking+0.151
 shout+0.151
 scared+0.150
 frightened+0.149
Neg logit contributions
oll-0.119
der-0.108
 bead-0.105
ils-0.102
 tre-0.102
Token't
Token ID470
Feature activation-2.05e+00

Pos logit contributions
der+0.023
oll+0.022
oph+0.021
 av+0.021
ils+0.021
Neg logit contributions
 scared-0.010
 worried-0.009
 sad-0.009
 frightened-0.009
 shout-0.009
Token like
Token ID588
Feature activation-2.46e+00

Pos logit contributions
 scared+0.063
 worried+0.062
 frightened+0.060
 sad+0.058
 afraid+0.056
Neg logit contributions
der-0.112
oll-0.109
 av-0.101
 bead-0.101
oph-0.101
Token av
Token ID1196
Feature activation+9.46e+00

Pos logit contributions
 scared+0.103
 worried+0.100
 sad+0.097
 frightened+0.097
 afraid+0.093
Neg logit contributions
der-0.112
oll-0.111
oph-0.105
ier-0.102
 bead-0.101
Tokenoc
Token ID420
Feature activation+1.11e+00

Pos logit contributions
der+0.479
oll+0.438
oph+0.418
ware+0.409
ier+0.405
Neg logit contributions
 scared-0.361
 worried-0.345
 frightened-0.338
 sad-0.332
 afraid-0.322
Tokenados
Token ID22484
Feature activation+4.43e+00

Pos logit contributions
 bead+0.031
der+0.031
oll+0.030
Em+0.029
 neatly+0.029
Neg logit contributions
 scared-0.055
 worried-0.054
 frightened-0.053
 sad-0.053
 voice-0.052
Token.
Token ID13
Feature activation+9.12e-01

Pos logit contributions
der+0.241
oph+0.241
oll+0.230
 bead+0.223
ently+0.222
Neg logit contributions
 scared-0.152
 worried-0.142
 sad-0.138
 frightened-0.134
 afraid-0.129
Token She
Token ID1375
Feature activation+1.20e+00

Pos logit contributions
der+0.043
oll+0.042
oph+0.039
 av+0.039
 bead+0.038
Neg logit contributions
 scared-0.036
 worried-0.035
 frightened-0.034
 sad-0.034
 afraid-0.033
Token always
Token ID1464
Feature activation-1.92e-01

Pos logit contributions
der+0.065
oll+0.062
oph+0.059
reating+0.059
 av+0.058
Neg logit contributions
 scared-0.041
 worried-0.039
 frightened-0.038
 sad-0.038
 afraid-0.035
Token happy
Token ID3772
Feature activation+8.17e-01

Pos logit contributions
 scared+0.026
 worried+0.026
 frightened+0.025
 sad+0.024
 afraid+0.023
Neg logit contributions
der-0.048
oll-0.047
oph-0.044
 av-0.044
 bead-0.043
Token that
Token ID326
Feature activation-1.27e+00

Pos logit contributions
oll+0.040
der+0.037
oph+0.037
 bead+0.036
assy+0.036
Neg logit contributions
 scared-0.032
 worried-0.030
 frightened-0.029
 barking-0.029
 sad-0.028
Token it
Token ID340
Feature activation+1.51e+00

Pos logit contributions
 scared+0.044
 worried+0.041
 sad+0.041
 frightened+0.039
 afraid+0.037
Neg logit contributions
oll-0.071
der-0.071
oph-0.066
assy-0.064
aces-0.063
Token produced
Token ID4635
Feature activation+7.46e-02

Pos logit contributions
reating+0.090
der+0.086
oll+0.086
oph+0.082
ellery+0.079
Neg logit contributions
 scared-0.054
 worried-0.048
 frightened-0.047
 afraid-0.042
 sad-0.041
Token more
Token ID517
Feature activation+1.48e+00

Pos logit contributions
oll+0.004
der+0.004
oph+0.004
assy+0.003
reating+0.003
Neg logit contributions
 scared-0.003
 sad-0.003
 worried-0.003
 frightened-0.002
 afraid-0.002
Token av
Token ID1196
Feature activation+9.41e+00

Pos logit contributions
der+0.057
oll+0.057
oph+0.052
lor+0.049
ross+0.048
Neg logit contributions
 scared-0.076
 sad-0.073
 worried-0.072
 frightened-0.071
 confused-0.067
Tokenoc
Token ID420
Feature activation+1.11e+00

Pos logit contributions
der+0.558
oll+0.522
oph+0.498
ware+0.491
reating+0.490
Neg logit contributions
 scared-0.266
 worried-0.249
 frightened-0.244
 sad-0.234
 afraid-0.225
Tokenados
Token ID22484
Feature activation+4.49e+00

Pos logit contributions
oll+0.030
oph+0.029
 bead+0.028
der+0.027
 mastered+0.027
Neg logit contributions
 scared-0.057
 worried-0.057
 sad-0.056
 frightened-0.056
 voice-0.055
Token than
Token ID621
Feature activation-1.24e+00

Pos logit contributions
oll+0.234
der+0.233
oph+0.227
 bead+0.211
ests+0.210
Neg logit contributions
 scared-0.162
 worried-0.149
 frightened-0.142
 sad-0.140
 afraid-0.135
Token ever
Token ID1683
Feature activation-1.55e-01

Pos logit contributions
 scared+0.051
 sad+0.049
 worried+0.049
 frightened+0.047
 afraid+0.046
Neg logit contributions
oll-0.063
der-0.057
oph-0.057
assy-0.053
quet-0.053
Token before
Token ID878
Feature activation-5.35e-01

Pos logit contributions
 scared+0.007
 worried+0.007
 frightened+0.007
 afraid+0.007
 sad+0.007
Neg logit contributions
oll-0.007
der-0.006
oph-0.006
 bead-0.006
quet-0.006
Token but
Token ID475
Feature activation+2.63e-01

Pos logit contributions
der+0.009
 av+0.008
oll+0.008
 bead+0.008
 beads+0.007
Neg logit contributions
 worried-0.008
 scared-0.008
 frightened-0.008
 shout-0.008
 sad-0.008
Token there
Token ID612
Feature activation-5.20e-01

Pos logit contributions
der+0.011
oll+0.010
 av+0.010
 bead+0.010
oph+0.010
Neg logit contributions
 scared-0.011
 worried-0.011
 frightened-0.011
 shout-0.010
 sad-0.010
Token were
Token ID547
Feature activation+1.67e+00

Pos logit contributions
 scared+0.022
 worried+0.021
 frightened+0.021
 sad+0.020
 afraid+0.020
Neg logit contributions
der-0.023
oll-0.022
oph-0.021
 av-0.020
 bead-0.020
Token no
Token ID645
Feature activation-1.10e+00

Pos logit contributions
der+0.074
oll+0.070
 av+0.066
oph+0.065
 bead+0.065
Neg logit contributions
 scared-0.069
 worried-0.068
 frightened-0.066
 sad-0.064
 shout-0.064
Token other
Token ID584
Feature activation+1.56e+00

Pos logit contributions
 scared+0.044
 worried+0.042
 frightened+0.041
 sad+0.041
 afraid+0.039
Neg logit contributions
der-0.053
oll-0.052
oph-0.049
 bead-0.046
ils-0.046
Token av
Token ID1196
Feature activation+9.38e+00

Pos logit contributions
der+0.075
oll+0.074
oph+0.070
 bead+0.067
ier+0.065
Neg logit contributions
 scared-0.062
 worried-0.059
 frightened-0.058
 sad-0.057
 afraid-0.056
Tokenoc
Token ID420
Feature activation+1.03e+00

Pos logit contributions
der+0.472
oll+0.443
oph+0.423
 av+0.403
 bead+0.402
Neg logit contributions
 scared-0.354
 worried-0.342
 frightened-0.334
 sad-0.323
 afraid-0.321
Tokenados
Token ID22484
Feature activation+4.85e+00

Pos logit contributions
Em+0.032
oph+0.031
oll+0.031
 bead+0.030
der+0.029
Neg logit contributions
 scared-0.048
 worried-0.047
 voice-0.047
 frightened-0.046
 sorry-0.046
Token.
Token ID13
Feature activation+1.25e+00

Pos logit contributions
oph+0.251
der+0.251
oll+0.244
 bead+0.232
ently+0.227
Neg logit contributions
 scared-0.177
 worried-0.168
 frightened-0.160
 sad-0.158
 afraid-0.155
Token
Token ID198
Feature activation+1.30e+00

Pos logit contributions
der+0.054
 av+0.048
oph+0.045
oll+0.045
cake+0.044
Neg logit contributions
 worried-0.053
 scared-0.051
 worry-0.050
 frightened-0.049
 afraid-0.049
Token
Token ID198
Feature activation+1.16e+00

Pos logit contributions
der+0.045
oll+0.044
oph+0.043
ently+0.043
 mastered+0.042
Neg logit contributions
 worried-0.059
 scared-0.057
 frightened-0.057
 worry-0.056
 sad-0.055
Token "
Token ID366
Feature activation-1.34e-01

Pos logit contributions
der+0.001
oll+0.001
 av+0.001
cake+0.001
ier+0.001
Neg logit contributions
 scared-0.001
 panic-0.001
 frightened-0.001
 worried-0.001
 shout-0.001
TokenI
Token ID40
Feature activation-1.05e+00

Pos logit contributions
 scared+0.006
 sad+0.006
 worried+0.006
 frightened+0.006
 afraid+0.006
Neg logit contributions
der-0.005
oph-0.005
oll-0.005
 av-0.005
cake-0.005
Token will
Token ID481
Feature activation+1.56e-01

Pos logit contributions
 voice+0.031
 boom+0.030
 barking+0.030
 frightened+0.030
 sad+0.030
Neg logit contributions
 mastered-0.052
oll-0.051
der-0.048
oph-0.047
 bead-0.046
Token buy
Token ID2822
Feature activation+1.14e+00

Pos logit contributions
der+0.007
oll+0.007
 mastered+0.006
 av+0.006
oph+0.006
Neg logit contributions
 scared-0.006
 worried-0.006
 frightened-0.006
 afraid-0.006
 sad-0.005
Token more
Token ID517
Feature activation+1.50e+00

Pos logit contributions
der+0.060
oll+0.058
oph+0.055
 bead+0.053
 mastered+0.053
Neg logit contributions
 scared-0.039
 worried-0.038
 frightened-0.037
 sad-0.036
 afraid-0.035
Token av
Token ID1196
Feature activation+9.24e+00

Pos logit contributions
der+0.070
oph+0.067
oll+0.066
ier+0.061
ross+0.061
Neg logit contributions
 scared-0.062
 worried-0.059
 sad-0.058
 frightened-0.057
 shout-0.056
Tokenoc
Token ID420
Feature activation+4.94e-01

Pos logit contributions
der+0.587
oll+0.522
ware+0.501
oph+0.498
ently+0.487
Neg logit contributions
 scared-0.225
 worried-0.206
 frightened-0.198
 sad-0.194
 shout-0.192
Tokenados
Token ID22484
Feature activation+4.16e+00

Pos logit contributions
oll+0.013
der+0.013
oph+0.013
 bead+0.013
 mastered+0.012
Neg logit contributions
 scared-0.025
 worried-0.025
 voice-0.024
 frightened-0.024
 sad-0.024
Token tomorrow
Token ID9439
Feature activation-2.01e-01

Pos logit contributions
oph+0.234
der+0.229
oll+0.223
 mastered+0.217
ellery+0.217
Neg logit contributions
 scared-0.117
 worried-0.102
 afraid-0.097
 sad-0.096
 want-0.095
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
 scared+0.006
 worried+0.006
 sad+0.006
 shout+0.006
 frightened+0.006
Neg logit contributions
der-0.011
oll-0.011
 av-0.010
oph-0.010
 bead-0.010
Token You
Token ID921
Feature activation-1.67e+00

Pos logit contributions
 scared+0.036
 worried+0.036
 frightened+0.034
 sad+0.034
 afraid+0.033
Neg logit contributions
der-0.053
oll-0.051
 av-0.048
oph-0.047
 bead-0.047
Token day
Token ID1110
Feature activation+8.26e-02

Pos logit contributions
 scared+0.088
 worried+0.087
 frightened+0.084
 sad+0.083
 afraid+0.080
Neg logit contributions
der-0.109
oll-0.104
oph-0.098
 av-0.097
 bead-0.095
Token on
Token ID319
Feature activation+2.90e+00

Pos logit contributions
der+0.004
oph+0.004
 av+0.004
oll+0.003
ware+0.003
Neg logit contributions
 scared-0.004
 worried-0.003
 sad-0.003
 frightened-0.003
 afraid-0.003
Token,
Token ID11
Feature activation+1.40e+00

Pos logit contributions
der+0.212
oll+0.200
reating+0.195
ier+0.191
ellery+0.191
Neg logit contributions
 scared-0.055
 sad-0.055
 worried-0.052
 frightened-0.043
 voice-0.042
Token Lily
Token ID20037
Feature activation+2.43e+00

Pos logit contributions
der+0.091
oll+0.090
lor+0.087
ier+0.086
ently+0.085
Neg logit contributions
 sad-0.039
 scared-0.037
 worried-0.034
 frightened-0.031
 fearful-0.029
Token loved
Token ID6151
Feature activation+4.54e-01

Pos logit contributions
quet+0.148
reating+0.145
der+0.144
igsaw+0.140
oll+0.138
Neg logit contributions
 scared-0.080
 worried-0.076
 sad-0.071
 frightened-0.066
 afraid-0.062
Token av
Token ID1196
Feature activation+9.21e+00

Pos logit contributions
der+0.024
oph+0.024
oll+0.024
ier+0.023
lor+0.023
Neg logit contributions
 scared-0.017
 worried-0.017
 sad-0.016
 barking-0.016
 frightened-0.015
Tokenoc
Token ID420
Feature activation+9.33e-01

Pos logit contributions
der+0.500
oll+0.458
ier+0.429
ware+0.427
oph+0.426
Neg logit contributions
 scared-0.317
 worried-0.303
 frightened-0.294
 sad-0.290
 afraid-0.279
Tokenados
Token ID22484
Feature activation+4.76e+00

Pos logit contributions
der+0.028
oll+0.025
 bead+0.025
 mastered+0.024
Em+0.024
Neg logit contributions
 scared-0.045
 voice-0.045
 worried-0.045
 sorry-0.044
 sad-0.044
Token and
Token ID290
Feature activation+1.16e+00

Pos logit contributions
der+0.308
oph+0.306
ently+0.300
ross+0.295
ware+0.291
Neg logit contributions
 scared-0.127
 sad-0.103
 worried-0.099
 frightened-0.093
 afraid-0.088
Token would
Token ID561
Feature activation+1.27e+00

Pos logit contributions
quet+0.066
oll+0.066
igsaw+0.065
lor+0.065
oph+0.062
Neg logit contributions
 scared-0.040
 worried-0.037
 sad-0.037
 frightened-0.032
 afraid-0.032
Token ask
Token ID1265
Feature activation-1.17e+00

Pos logit contributions
der+0.054
oll+0.054
ier+0.051
oph+0.050
ently+0.048
Neg logit contributions
 scared-0.055
 worried-0.054
 sad-0.053
 shout-0.052
 yell-0.051
Token if
Token ID611
Feature activation-3.52e-01

Pos logit contributions
der+0.014
oll+0.013
oph+0.012
 av+0.012
 bead+0.012
Neg logit contributions
 scared-0.012
 worried-0.012
 frightened-0.012
 sad-0.012
 afraid-0.011
Token she
Token ID673
Feature activation+1.13e+00

Pos logit contributions
 scared+0.015
 worried+0.015
 sad+0.014
 frightened+0.014
 afraid+0.013
Neg logit contributions
der-0.016
oll-0.016
oph-0.014
 bead-0.014
aces-0.014
Token could
Token ID714
Feature activation-2.69e-01

Pos logit contributions
reating+0.067
der+0.065
quet+0.063
ently+0.062
ross+0.059
Neg logit contributions
 scared-0.038
 worried-0.035
 frightened-0.033
 sad-0.032
 afraid-0.031
Token request
Token ID2581
Feature activation-6.16e-01

Pos logit contributions
 scared+0.008
 sad+0.008
 worried+0.008
 shout+0.007
 yell+0.007
Neg logit contributions
der-0.016
oph-0.015
oll-0.015
ils-0.015
ier-0.014
Token some
Token ID617
Feature activation+7.05e-01

Pos logit contributions
 scared+0.020
 sad+0.020
 worried+0.019
 frightened+0.018
 sorry+0.018
Neg logit contributions
oll-0.034
der-0.033
oph-0.032
 mastered-0.030
ils-0.030
Token av
Token ID1196
Feature activation+9.20e+00

Pos logit contributions
oph+0.020
der+0.020
oll+0.020
lor+0.020
ross+0.019
Neg logit contributions
 sad-0.041
 scared-0.041
 worried-0.040
 frightened-0.038
 angry-0.038
Tokenoc
Token ID420
Feature activation+6.02e-01

Pos logit contributions
der+0.509
oll+0.461
oph+0.431
ier+0.427
ware+0.423
Neg logit contributions
 scared-0.302
 worried-0.289
 frightened-0.280
 sad-0.275
 afraid-0.267
Tokenados
Token ID22484
Feature activation+4.47e+00

Pos logit contributions
oll+0.017
der+0.017
 bead+0.017
oph+0.016
 mastered+0.016
Neg logit contributions
 worried-0.029
 scared-0.029
 voice-0.028
 sad-0.028
 frightened-0.028
Token from
Token ID422
Feature activation+1.48e+00

Pos logit contributions
der+0.242
oll+0.235
oph+0.234
ross+0.227
ently+0.227
Neg logit contributions
 scared-0.144
 worried-0.133
 sad-0.130
 frightened-0.121
 afraid-0.121
Token the
Token ID262
Feature activation-1.32e+00

Pos logit contributions
der+0.068
oll+0.067
oph+0.060
 bead+0.060
ier+0.060
Neg logit contributions
 scared-0.062
 worried-0.062
 sad-0.060
 frightened-0.058
 afraid-0.056
Token store
Token ID3650
Feature activation+8.91e-01

Pos logit contributions
 scared+0.064
 worried+0.063
 frightened+0.061
 sad+0.061
 afraid+0.059
Neg logit contributions
der-0.052
oll-0.049
oph-0.045
 bead-0.043
ier-0.043
Token the
Token ID262
Feature activation+1.11e+00

Pos logit contributions
 scared+0.091
 worried+0.089
 sad+0.088
 frightened+0.085
 afraid+0.083
Neg logit contributions
der-0.089
oll-0.086
oph-0.080
 bead-0.080
 mastered-0.078
Token tree
Token ID5509
Feature activation+2.03e+00

Pos logit contributions
der+0.051
oll+0.048
 av+0.047
oph+0.045
 bead+0.045
Neg logit contributions
 scared-0.045
 worried-0.044
 frightened-0.042
 sad-0.041
 afraid-0.041
Token if
Token ID611
Feature activation-2.40e-01

Pos logit contributions
der+0.090
oll+0.085
 av+0.082
 bead+0.079
oph+0.078
Neg logit contributions
 scared-0.082
 worried-0.081
 frightened-0.080
 shout-0.077
 afraid-0.076
Token they
Token ID484
Feature activation-1.29e-01

Pos logit contributions
 scared+0.011
 worried+0.011
 frightened+0.011
 sad+0.011
 afraid+0.011
Neg logit contributions
der-0.010
oll-0.009
oph-0.008
 bead-0.008
 av-0.008
Token were
Token ID547
Feature activation+1.14e+00

Pos logit contributions
 scared+0.005
 worried+0.004
 frightened+0.004
 afraid+0.004
 sad+0.004
Neg logit contributions
der-0.007
oll-0.006
oph-0.006
reating-0.006
ently-0.006
Token av
Token ID1196
Feature activation+9.17e+00

Pos logit contributions
der+0.061
oll+0.059
 av+0.056
oph+0.056
 bead+0.055
Neg logit contributions
 scared-0.037
 worried-0.036
 frightened-0.034
 sad-0.034
 afraid-0.033
Tokenoc
Token ID420
Feature activation+4.32e-01

Pos logit contributions
der+0.517
oll+0.466
oph+0.450
ware+0.447
reating+0.438
Neg logit contributions
 scared-0.296
 worried-0.280
 frightened-0.275
 afraid-0.262
 sad-0.258
Tokenados
Token ID22484
Feature activation+4.21e+00

Pos logit contributions
oll+0.011
oph+0.011
der+0.011
 bead+0.010
 mastered+0.010
Neg logit contributions
 scared-0.024
 worried-0.023
 frightened-0.023
 voice-0.023
 sad-0.022
Token,
Token ID11
Feature activation+2.52e-01

Pos logit contributions
der+0.229
oll+0.223
oph+0.217
 bead+0.209
 mastered+0.206
Neg logit contributions
 scared-0.137
 worried-0.131
 sad-0.124
 frightened-0.124
 afraid-0.119
Token and
Token ID290
Feature activation+1.10e+00

Pos logit contributions
der+0.014
oll+0.014
 av+0.013
oph+0.013
 bead+0.013
Neg logit contributions
 scared-0.007
 worried-0.007
 frightened-0.007
 sad-0.006
 afraid-0.006
Token they
Token ID484
Feature activation-7.33e-01

Pos logit contributions
der+0.049
oll+0.049
oph+0.045
 bead+0.044
ier+0.044
Neg logit contributions
 scared-0.047
 worried-0.046
 frightened-0.044
 sad-0.044
 afraid-0.043
Token and
Token ID290
Feature activation+1.28e+00

Pos logit contributions
oll+0.094
oph+0.092
 av+0.088
der+0.087
 bead+0.086
Neg logit contributions
 sad-0.070
 scared-0.070
 worried-0.067
 frightened-0.061
 afraid-0.059
Token shared
Token ID4888
Feature activation+1.57e+00

Pos logit contributions
oll+0.069
der+0.067
lor+0.065
ently+0.064
quet+0.063
Neg logit contributions
 scared-0.047
 worried-0.045
 sad-0.045
 shout-0.042
 worry-0.041
Token everything
Token ID2279
Feature activation+2.82e+00

Pos logit contributions
oll+0.089
der+0.087
ier+0.081
ware+0.077
ross+0.077
Neg logit contributions
 sad-0.052
 scared-0.052
 worried-0.049
 frightened-0.046
 angry-0.045
Token,
Token ID11
Feature activation+4.08e-01

Pos logit contributions
der+0.161
 mastered+0.150
oph+0.145
oll+0.145
lor+0.142
Neg logit contributions
 scared-0.091
 sad-0.087
 worried-0.083
 frightened-0.077
 angry-0.071
Token including
Token ID1390
Feature activation-9.79e-01

Pos logit contributions
oll+0.021
der+0.020
oph+0.019
ier+0.019
quet+0.018
Neg logit contributions
 scared-0.016
 sad-0.015
 worried-0.015
 frightened-0.014
 afraid-0.014
Token av
Token ID1196
Feature activation+9.17e+00

Pos logit contributions
 scared+0.039
 sad+0.038
 worried+0.038
 frightened+0.036
 afraid+0.034
Neg logit contributions
oll-0.047
der-0.046
oph-0.044
 mastered-0.041
lor-0.041
Tokenoc
Token ID420
Feature activation+1.00e+00

Pos logit contributions
der+0.480
oll+0.454
oph+0.421
ier+0.412
ware+0.408
Neg logit contributions
 scared-0.323
 worried-0.313
 frightened-0.301
 sad-0.300
 afraid-0.288
Tokenados
Token ID22484
Feature activation+4.56e+00

Pos logit contributions
oll+0.027
der+0.026
 bead+0.025
 mastered+0.024
oph+0.024
Neg logit contributions
 worried-0.053
 scared-0.053
 sad-0.052
 frightened-0.051
 worry-0.050
Token.
Token ID13
Feature activation+8.87e-01

Pos logit contributions
oph+0.271
oll+0.259
ross+0.256
ently+0.253
der+0.248
Neg logit contributions
 scared-0.136
 worried-0.122
 sad-0.120
 frightened-0.111
 afraid-0.107
Token They
Token ID1119
Feature activation-8.47e-01

Pos logit contributions
der+0.051
oll+0.051
ier+0.049
 av+0.048
 bead+0.047
Neg logit contributions
 scared-0.024
 worried-0.024
 sad-0.023
 frightened-0.022
 shout-0.021
Token were
Token ID547
Feature activation+1.47e+00

Pos logit contributions
 scared+0.030
 worried+0.028
 sad+0.028
 say+0.025
 shout+0.025
Neg logit contributions
der-0.047
ently-0.044
oll-0.044
lor-0.044
quet-0.043
Token on
Token ID319
Feature activation+2.41e+00

Pos logit contributions
 scared+0.005
 worried+0.005
 sad+0.005
 shout+0.004
 frightened+0.004
Neg logit contributions
der-0.007
oph-0.007
 av-0.006
ellery-0.006
reating-0.006
Token,
Token ID11
Feature activation+9.71e-01

Pos logit contributions
der+0.142
oll+0.138
oph+0.129
 av+0.128
reating+0.128
Neg logit contributions
 scared-0.069
 worried-0.067
 sad-0.065
 frightened-0.062
 shout-0.058
Token whenever
Token ID8797
Feature activation-6.54e-01

Pos logit contributions
der+0.054
oll+0.051
ier+0.048
oph+0.047
ently+0.047
Neg logit contributions
 scared-0.034
 sad-0.033
 worried-0.032
 frightened-0.031
 shout-0.029
Token they
Token ID484
Feature activation-1.36e+00

Pos logit contributions
 scared+0.025
 worried+0.024
 sad+0.023
 frightened+0.023
 afraid+0.022
Neg logit contributions
der-0.033
oll-0.033
oph-0.030
ier-0.029
aces-0.029
Token wanted
Token ID2227
Feature activation-2.70e+00

Pos logit contributions
 scared+0.045
 worried+0.042
 frightened+0.039
 sad+0.039
 panic+0.037
Neg logit contributions
der-0.082
oll-0.076
quet-0.076
ently-0.075
oph-0.073
Token av
Token ID1196
Feature activation+9.13e+00

Pos logit contributions
 scared+0.107
 worried+0.103
 sad+0.103
 frightened+0.099
 afraid+0.094
Neg logit contributions
oll-0.137
der-0.133
oph-0.124
 bead-0.118
ier-0.118
Tokenoc
Token ID420
Feature activation+7.28e-01

Pos logit contributions
der+0.483
oll+0.444
oph+0.418
ier+0.409
ware+0.407
Neg logit contributions
 scared-0.322
 worried-0.309
 frightened-0.301
 sad-0.294
 afraid-0.286
Tokenados
Token ID22484
Feature activation+4.50e+00

Pos logit contributions
oll+0.020
der+0.019
oph+0.018
 bead+0.018
 mastered+0.016
Neg logit contributions
 scared-0.039
 worried-0.039
 frightened-0.038
 sad-0.038
 voice-0.037
Token,
Token ID11
Feature activation+7.39e-01

Pos logit contributions
der+0.264
oll+0.255
oph+0.244
ellery+0.233
 bead+0.231
Neg logit contributions
 scared-0.139
 worried-0.130
 sad-0.128
 frightened-0.123
 afraid-0.116
Token they
Token ID484
Feature activation-7.22e-01

Pos logit contributions
der+0.037
oll+0.036
oph+0.033
 av+0.032
 bead+0.032
Neg logit contributions
 scared-0.028
 worried-0.027
 frightened-0.026
 sad-0.026
 afraid-0.025
Token would
Token ID561
Feature activation+7.67e-01

Pos logit contributions
 scared+0.030
 worried+0.029
 sad+0.027
 frightened+0.027
 afraid+0.026
Neg logit contributions
der-0.034
oll-0.034
oph-0.032
ently-0.031
quet-0.031
Token time
Token ID640
Feature activation-2.42e+00

Pos logit contributions
 scared+0.009
 worried+0.009
 frightened+0.009
 shout+0.008
 confused+0.008
Neg logit contributions
oll-0.008
der-0.008
oph-0.008
 av-0.007
 bead-0.007
Token there
Token ID612
Feature activation-2.03e+00

Pos logit contributions
 worried+0.109
 scared+0.108
 frightened+0.107
 shout+0.104
 yell+0.103
Neg logit contributions
oll-0.087
der-0.087
 av-0.080
oph-0.078
 bead-0.076
Token was
Token ID373
Feature activation+2.32e-02

Pos logit contributions
 worried+0.096
 scared+0.094
 frightened+0.093
 afraid+0.092
 sad+0.091
Neg logit contributions
der-0.071
oll-0.070
oph-0.066
 bead-0.063
 av-0.062
Token a
Token ID257
Feature activation-5.09e-02

Pos logit contributions
der+0.001
oll+0.001
oph+0.001
ier+0.001
 av+0.001
Neg logit contributions
 scared-0.001
 worried-0.001
 frightened-0.001
 sad-0.001
 afraid-0.001
Token little
Token ID1310
Feature activation-1.61e+00

Pos logit contributions
 scared+0.002
 shout+0.002
 yell+0.002
 worried+0.002
 frightened+0.002
Neg logit contributions
 av-0.002
der-0.002
oll-0.002
 bead-0.002
oph-0.002
Token av
Token ID1196
Feature activation+9.12e+00

Pos logit contributions
 scared+0.050
 worried+0.050
 shout+0.049
 yell+0.048
 frightened+0.048
Neg logit contributions
der-0.078
 av-0.078
oll-0.077
 bead-0.076
oph-0.072
Tokenac
Token ID330
Feature activation+2.77e+00

Pos logit contributions
der+0.472
oll+0.437
oph+0.417
 av+0.409
ier+0.406
Neg logit contributions
 scared-0.332
 worried-0.320
 frightened-0.311
 sad-0.302
 afraid-0.298
Tokenado
Token ID4533
Feature activation-2.52e+00

Pos logit contributions
der+0.119
oll+0.114
oph+0.107
 av+0.106
 bead+0.103
Neg logit contributions
 scared-0.120
 worried-0.118
 frightened-0.115
 sad-0.113
 afraid-0.111
Token.
Token ID13
Feature activation+2.86e-01

Pos logit contributions
 worried+0.095
 scared+0.094
 shout+0.092
 frightened+0.091
 afraid+0.089
Neg logit contributions
der-0.119
oll-0.110
 av-0.104
 bead-0.100
oph-0.099
Token It
Token ID632
Feature activation-7.63e-01

Pos logit contributions
der+0.015
oll+0.015
oph+0.014
 av+0.014
 bead+0.014
Neg logit contributions
 scared-0.009
 worried-0.009
 frightened-0.009
 sad-0.009
 afraid-0.008
Token was
Token ID373
Feature activation+7.44e-01

Pos logit contributions
 scared+0.032
 worried+0.031
 frightened+0.030
 sad+0.030
 afraid+0.029
Neg logit contributions
der-0.034
oll-0.032
oph-0.030
 av-0.030
 bead-0.030
Token Everyday
Token ID48154
Feature activation+2.22e+00

Pos logit contributions
der+0.021
oll+0.020
 av+0.019
 bead+0.018
oph+0.018
Neg logit contributions
 scared-0.019
 worried-0.018
 frightened-0.018
 sad-0.018
 afraid-0.017
Token he
Token ID339
Feature activation+2.28e+00

Pos logit contributions
der+0.093
oll+0.089
 av+0.083
oph+0.083
 bead+0.080
Neg logit contributions
 scared-0.096
 worried-0.095
 frightened-0.092
 sad-0.091
 afraid-0.089
Token would
Token ID561
Feature activation+2.87e+00

Pos logit contributions
der+0.098
oll+0.095
 av+0.088
oph+0.087
 bead+0.086
Neg logit contributions
 scared-0.093
 worried-0.093
 frightened-0.090
 sad-0.089
 afraid-0.087
Token take
Token ID1011
Feature activation+2.11e+00

Pos logit contributions
der+0.146
oll+0.140
 av+0.132
oph+0.131
 bead+0.130
Neg logit contributions
 scared-0.098
 worried-0.097
 frightened-0.094
 sad-0.092
 afraid-0.090
Token the
Token ID262
Feature activation+1.91e+00

Pos logit contributions
der+0.101
oll+0.096
oph+0.091
 av+0.089
 bead+0.088
Neg logit contributions
 scared-0.083
 worried-0.080
 frightened-0.078
 sad-0.077
 afraid-0.075
Token av
Token ID1196
Feature activation+9.07e+00

Pos logit contributions
der+0.077
oll+0.075
 av+0.073
 bead+0.069
oph+0.067
Neg logit contributions
 scared-0.084
 worried-0.083
 frightened-0.080
 afraid-0.079
 sad-0.078
Tokenoc
Token ID420
Feature activation+1.21e-01

Pos logit contributions
der+0.547
oll+0.493
ier+0.459
oph+0.454
ware+0.450
Neg logit contributions
 scared-0.237
 worried-0.221
 frightened-0.219
 sad-0.209
 shout-0.201
Tokenados
Token ID22484
Feature activation+4.71e+00

Pos logit contributions
oll+0.003
der+0.003
oph+0.003
 bead+0.003
 mastered+0.003
Neg logit contributions
 worried-0.006
 voice-0.006
 scared-0.006
 sad-0.006
 frightened-0.006
Token and
Token ID290
Feature activation-7.43e-02

Pos logit contributions
der+0.230
oll+0.229
oph+0.224
ellery+0.210
 bead+0.207
Neg logit contributions
 scared-0.172
 worried-0.163
 sad-0.155
 frightened-0.154
 afraid-0.150
Token scatter
Token ID41058
Feature activation+1.53e+00

Pos logit contributions
 scared+0.003
 worried+0.003
 sad+0.003
 frightened+0.003
 shout+0.003
Neg logit contributions
oll-0.003
der-0.003
oph-0.003
ier-0.003
assy-0.003
Token them
Token ID606
Feature activation-2.33e+00

Pos logit contributions
der+0.082
oll+0.080
oph+0.075
 bead+0.072
 av+0.072
Neg logit contributions
 scared-0.051
 worried-0.049
 frightened-0.047
 sad-0.047
 afraid-0.045
Token Maybe
Token ID6674
Feature activation+7.27e-01

Pos logit contributions
 worried+0.034
 scared+0.034
 frightened+0.033
 sad+0.032
 afraid+0.031
Neg logit contributions
der-0.044
oll-0.041
 bead-0.038
oph-0.038
 mastered-0.037
Token they
Token ID484
Feature activation-5.42e-02

Pos logit contributions
oll+0.044
der+0.043
assy+0.041
aces+0.040
oph+0.040
Neg logit contributions
 scared-0.021
 worried-0.020
 sad-0.019
 frightened-0.019
 afraid-0.018
Token could
Token ID714
Feature activation+6.43e-02

Pos logit contributions
 scared+0.002
 worried+0.002
 frightened+0.002
 sad+0.002
 afraid+0.002
Neg logit contributions
der-0.003
oll-0.003
oph-0.003
reating-0.002
 av-0.002
Token grow
Token ID1663
Feature activation-1.75e+00

Pos logit contributions
der+0.004
oll+0.004
oph+0.003
ier+0.003
 av+0.003
Neg logit contributions
 scared-0.002
 worried-0.002
 sad-0.002
 frightened-0.002
 shout-0.002
Token more
Token ID517
Feature activation+1.14e+00

Pos logit contributions
 scared+0.071
 sad+0.067
 worried+0.067
 frightened+0.065
 afraid+0.062
Neg logit contributions
oll-0.084
der-0.083
oph-0.082
assy-0.077
cake-0.074
Token av
Token ID1196
Feature activation+9.02e+00

Pos logit contributions
oll+0.053
der+0.053
oph+0.050
assy+0.047
 mastered+0.046
Neg logit contributions
 scared-0.049
 worried-0.046
 sad-0.045
 frightened-0.045
 afraid-0.043
Tokenoc
Token ID420
Feature activation+5.37e-01

Pos logit contributions
der+0.498
oll+0.465
oph+0.442
ware+0.428
reating+0.427
Neg logit contributions
 scared-0.294
 worried-0.277
 frightened-0.269
 sad-0.262
 afraid-0.257
Tokenados
Token ID22484
Feature activation+4.28e+00

Pos logit contributions
oph+0.015
oll+0.015
Em+0.015
 mastered+0.014
 bead+0.014
Neg logit contributions
 scared-0.026
 voice-0.025
 worried-0.025
 frightened-0.025
 sorry-0.024
Token and
Token ID290
Feature activation-3.06e-01

Pos logit contributions
oph+0.224
der+0.222
oll+0.221
ently+0.209
ross+0.208
Neg logit contributions
 scared-0.144
 worried-0.129
 sad-0.126
 afraid-0.121
 frightened-0.119
Token share
Token ID2648
Feature activation+1.32e+00

Pos logit contributions
 scared+0.011
 sad+0.010
 worried+0.010
 frightened+0.010
 shout+0.009
Neg logit contributions
oll-0.017
der-0.016
oph-0.016
assy-0.015
ier-0.015
Token them
Token ID606
Feature activation-2.55e+00

Pos logit contributions
oll+0.073
der+0.073
oph+0.068
 mastered+0.065
ier+0.065
Neg logit contributions
 scared-0.041
 worried-0.040
 sad-0.039
 frightened-0.038
 afraid-0.036
Token,
Token ID11
Feature activation+7.32e-01

Pos logit contributions
der+0.031
oll+0.030
oph+0.028
 av+0.028
 bead+0.027
Neg logit contributions
 scared-0.028
 worried-0.028
 frightened-0.027
 sad-0.026
 afraid-0.026
Token and
Token ID290
Feature activation+1.62e+00

Pos logit contributions
der+0.035
oll+0.034
oph+0.032
 av+0.031
 bead+0.031
Neg logit contributions
 scared-0.028
 worried-0.028
 frightened-0.027
 sad-0.026
 afraid-0.026
Token all
Token ID477
Feature activation+2.51e+00

Pos logit contributions
der+0.078
oll+0.077
oph+0.073
ier+0.072
assy+0.070
Neg logit contributions
 scared-0.064
 worried-0.061
 sad-0.060
 frightened-0.059
 shout-0.057
Token the
Token ID262
Feature activation+2.45e+00

Pos logit contributions
der+0.104
oll+0.100
oph+0.093
 av+0.092
 bead+0.090
Neg logit contributions
 scared-0.113
 worried-0.111
 frightened-0.108
 sad-0.106
 afraid-0.104
Token other
Token ID584
Feature activation+1.90e+00

Pos logit contributions
der+0.111
 av+0.107
oll+0.107
 bead+0.100
oph+0.100
Neg logit contributions
 scared-0.093
 worried-0.092
 frightened-0.089
 shout-0.087
 afraid-0.087
Token av
Token ID1196
Feature activation+9.00e+00

Pos logit contributions
der+0.102
oll+0.096
oph+0.092
 bead+0.090
ier+0.089
Neg logit contributions
 scared-0.068
 worried-0.065
 frightened-0.063
 sad-0.062
 afraid-0.058
Tokenoc
Token ID420
Feature activation+2.93e-01

Pos logit contributions
der+0.524
oll+0.480
oph+0.456
ware+0.450
ier+0.441
Neg logit contributions
 scared-0.266
 worried-0.252
 frightened-0.242
 sad-0.231
 afraid-0.231
Tokenados
Token ID22484
Feature activation+4.09e+00

Pos logit contributions
oll+0.008
oph+0.008
 mastered+0.008
der+0.008
 bead+0.007
Neg logit contributions
 scared-0.015
 worried-0.015
 voice-0.015
 frightened-0.014
 sad-0.014
Token were
Token ID547
Feature activation+1.05e+00

Pos logit contributions
der+0.214
oll+0.214
oph+0.208
ellery+0.194
reating+0.194
Neg logit contributions
 scared-0.145
 worried-0.135
 frightened-0.129
 sad-0.127
 shout-0.122
Token very
Token ID845
Feature activation-1.06e+00

Pos logit contributions
der+0.053
oll+0.050
oph+0.048
 bead+0.045
assy+0.044
Neg logit contributions
 scared-0.042
 worried-0.041
 sad-0.040
 frightened-0.039
 afraid-0.037
Token happy
Token ID3772
Feature activation-3.64e-02

Pos logit contributions
 scared+0.039
 worried+0.037
 sad+0.036
 frightened+0.036
 afraid+0.034
Neg logit contributions
der-0.055
oll-0.053
oph-0.050
 bead-0.049
reating-0.048
Token.
Token ID13
Feature activation+4.83e-01

Pos logit contributions
oph+0.156
der+0.150
oll+0.146
 mastered+0.144
assy+0.143
Neg logit contributions
 scared-0.085
 worried-0.078
 sad-0.074
 barking-0.070
 frightened-0.070
Token Mom
Token ID11254
Feature activation+1.12e+00

Pos logit contributions
der+0.024
oll+0.024
ier+0.023
oph+0.023
 bead+0.022
Neg logit contributions
 scared-0.017
 worried-0.016
 frightened-0.016
 sad-0.016
 afraid-0.015
Token says
Token ID1139
Feature activation-1.04e+00

Pos logit contributions
der+0.063
oll+0.062
 av+0.059
oph+0.058
 bead+0.057
Neg logit contributions
 scared-0.032
 worried-0.031
 frightened-0.031
 sad-0.030
 afraid-0.029
Token they
Token ID484
Feature activation-4.97e-01

Pos logit contributions
 scared+0.033
 worried+0.032
 sad+0.030
 frightened+0.030
 afraid+0.029
Neg logit contributions
der-0.058
oll-0.056
oph-0.053
 bead-0.051
assy-0.051
Token are
Token ID389
Feature activation+1.15e+00

Pos logit contributions
 scared+0.016
 worried+0.016
 frightened+0.016
 barking+0.016
 sad+0.015
Neg logit contributions
der-0.025
oll-0.024
 av-0.024
ils-0.023
 bead-0.023
Token av
Token ID1196
Feature activation+8.99e+00

Pos logit contributions
der+0.058
oll+0.055
oph+0.052
 bead+0.051
 av+0.050
Neg logit contributions
 scared-0.044
 worried-0.043
 frightened-0.041
 sad-0.041
 afraid-0.040
Tokenoc
Token ID420
Feature activation+4.12e-01

Pos logit contributions
der+0.475
oll+0.431
oph+0.410
 av+0.403
ier+0.402
Neg logit contributions
 scared-0.319
 worried-0.308
 frightened-0.300
 sad-0.288
 afraid-0.287
Tokenados
Token ID22484
Feature activation+4.62e+00

Pos logit contributions
der+0.010
oll+0.009
 bead+0.009
oph+0.009
 mastered+0.009
Neg logit contributions
 scared-0.023
 worried-0.023
 barking-0.023
 frightened-0.023
 voice-0.022
Token.
Token ID13
Feature activation+4.57e-01

Pos logit contributions
der+0.246
oph+0.239
oll+0.234
 mastered+0.225
ently+0.225
Neg logit contributions
 scared-0.156
 worried-0.146
 frightened-0.138
 sad-0.136
 afraid-0.136
Token Sara
Token ID24799
Feature activation+1.31e+00

Pos logit contributions
der+0.022
oll+0.021
oph+0.019
 av+0.019
 bead+0.019
Neg logit contributions
 scared-0.018
 worried-0.018
 frightened-0.017
 sad-0.017
 afraid-0.016
Token wants
Token ID3382
Feature activation-1.63e+00

Pos logit contributions
der+0.080
oll+0.078
ier+0.076
quet+0.076
reating+0.076
Neg logit contributions
 scared-0.035
 worried-0.033
 frightened-0.031
 sad-0.030
 shout-0.029
Tokenbang
Token ID36668
Feature activation-4.26e+00

Pos logit contributions
 scared+0.032
 worried+0.032
 frightened+0.031
 sad+0.030
 afraid+0.030
Neg logit contributions
der-0.036
oll-0.035
oph-0.033
 av-0.032
 bead-0.032
Token"
Token ID1
Feature activation-2.04e+00

Pos logit contributions
 worried+0.183
 scared+0.180
 frightened+0.177
 afraid+0.171
 confused+0.165
Neg logit contributions
der-0.189
oll-0.171
oph-0.161
ier-0.161
ils-0.160
Token and
Token ID290
Feature activation-2.80e+00

Pos logit contributions
 scared+0.085
 worried+0.083
 frightened+0.081
 sad+0.080
 afraid+0.078
Neg logit contributions
der-0.090
oll-0.088
 av-0.082
oph-0.082
 bead-0.080
Token "
Token ID366
Feature activation-2.02e+00

Pos logit contributions
 worried+0.104
 scared+0.101
 frightened+0.101
 worry+0.095
 panic+0.095
Neg logit contributions
der-0.138
 bead-0.117
 mastered-0.114
oll-0.113
 av-0.110
Tokenp
Token ID79
Feature activation-5.50e+00

Pos logit contributions
 scared+0.084
 worried+0.083
 frightened+0.080
 sad+0.079
 afraid+0.078
Neg logit contributions
der-0.090
oll-0.086
oph-0.080
 bead-0.079
 av-0.079
Tokenow
Token ID322
Feature activation-8.19e+00

Pos logit contributions
 worried+0.242
 scared+0.232
 frightened+0.231
 afraid+0.218
 fearful+0.216
Neg logit contributions
der-0.245
oll-0.208
aces-0.201
ier-0.198
 mastered-0.197
Token"
Token ID1
Feature activation-1.73e+00

Pos logit contributions
 worried+0.346
 scared+0.340
 frightened+0.337
 afraid+0.324
 confused+0.316
Neg logit contributions
der-0.358
oll-0.322
ils-0.316
ier-0.308
 mastered-0.297
Token and
Token ID290
Feature activation-2.62e+00

Pos logit contributions
 scared+0.071
 worried+0.069
 frightened+0.067
 sad+0.067
 afraid+0.065
Neg logit contributions
der-0.076
oll-0.075
oph-0.071
 av-0.070
 bead-0.069
Token laugh
Token ID6487
Feature activation-1.45e+00

Pos logit contributions
 scared+0.103
 worried+0.103
 frightened+0.102
 afraid+0.095
 sad+0.095
Neg logit contributions
der-0.122
 bead-0.100
oll-0.099
 mastered-0.097
 av-0.097
Token.
Token ID13
Feature activation-2.52e+00

Pos logit contributions
 worried+0.064
 scared+0.063
 frightened+0.062
 afraid+0.060
 panic+0.059
Neg logit contributions
der-0.062
oll-0.055
oph-0.051
ier-0.050
 av-0.050
Token
Token ID198
Feature activation-2.19e+00

Pos logit contributions
 scared+0.095
 worried+0.095
 frightened+0.092
 panic+0.088
 worry+0.088
Neg logit contributions
der-0.126
oll-0.109
oph-0.104
 av-0.103
 mastered-0.101
Tokenom
Token ID296
Feature activation-2.51e+00

Pos logit contributions
 scared+0.127
 worried+0.125
 frightened+0.124
 sad+0.117
 shout+0.115
Neg logit contributions
der-0.178
oll-0.170
ier-0.158
 mastered-0.156
oph-0.156
Token"
Token ID1
Feature activation-5.50e-01

Pos logit contributions
 worried+0.116
 scared+0.115
 frightened+0.113
 afraid+0.111
 fearful+0.106
Neg logit contributions
der-0.103
oll-0.093
ier-0.091
ils-0.091
 av-0.085
Token and
Token ID290
Feature activation-1.84e+00

Pos logit contributions
 scared+0.023
 worried+0.022
 sad+0.021
 frightened+0.021
 afraid+0.020
Neg logit contributions
der-0.024
oll-0.024
 av-0.023
oph-0.022
 bead-0.022
Token "
Token ID366
Feature activation-9.68e-01

Pos logit contributions
 scared+0.070
 worried+0.068
 frightened+0.067
 sad+0.064
 afraid+0.064
Neg logit contributions
der-0.089
oll-0.081
 av-0.078
 bead-0.077
oph-0.076
Tokenp
Token ID79
Feature activation-5.04e+00

Pos logit contributions
 scared+0.041
 worried+0.040
 frightened+0.039
 sad+0.039
 afraid+0.038
Neg logit contributions
der-0.042
oll-0.041
oph-0.038
 av-0.038
 bead-0.037
Tokenow
Token ID322
Feature activation-8.08e+00

Pos logit contributions
 worried+0.209
 scared+0.206
 frightened+0.203
 afraid+0.194
 fearful+0.189
Neg logit contributions
der-0.230
oll-0.206
aces-0.194
ier-0.193
ils-0.192
Token".
Token ID1911
Feature activation-1.70e+00

Pos logit contributions
 scared+0.336
 worried+0.336
 frightened+0.332
 afraid+0.321
 panic+0.311
Neg logit contributions
der-0.349
oll-0.318
ier-0.307
ils-0.303
ently-0.287
Token One
Token ID1881
Feature activation-3.14e+00

Pos logit contributions
 scared+0.063
 worried+0.063
 frightened+0.061
 afraid+0.059
 panic+0.057
Neg logit contributions
der-0.085
oll-0.075
oph-0.070
 av-0.069
ware-0.068
Token day
Token ID1110
Feature activation-1.06e+00

Pos logit contributions
 worried+0.158
 scared+0.156
 frightened+0.155
 barking+0.150
 afraid+0.149
Neg logit contributions
der-0.104
oll-0.095
 bead-0.091
ier-0.086
 av-0.084
Token,
Token ID11
Feature activation-2.53e-01

Pos logit contributions
 scared+0.037
 worried+0.036
 frightened+0.035
 sad+0.034
 afraid+0.034
Neg logit contributions
der-0.054
oll-0.052
oph-0.049
 av-0.049
 bead-0.048
Token they
Token ID484
Feature activation-9.92e-01

Pos logit contributions
 scared+0.010
 worried+0.010
 frightened+0.010
 sad+0.009
 afraid+0.009
Neg logit contributions
der-0.011
oll-0.011
 av-0.010
oph-0.010
 bead-0.010
Token back
Token ID736
Feature activation+9.84e-01

Pos logit contributions
 scared+0.010
 worried+0.010
 frightened+0.010
 afraid+0.010
 sad+0.010
Neg logit contributions
oll-0.010
der-0.010
 av-0.010
 bead-0.009
ier-0.009
Token on
Token ID319
Feature activation+8.12e-01

Pos logit contributions
der+0.060
oph+0.059
oll+0.056
 bead+0.053
umm+0.052
Neg logit contributions
 scared-0.035
 worried-0.033
 frightened-0.030
 barking-0.030
 sad-0.029
Token his
Token ID465
Feature activation-2.06e+00

Pos logit contributions
oll+0.046
der+0.045
ils+0.041
oph+0.041
ier+0.041
Neg logit contributions
 scared-0.027
 worried-0.026
 sad-0.025
 barking-0.025
 shout-0.025
Token snow
Token ID6729
Feature activation-9.16e-01

Pos logit contributions
 scared+0.077
 worried+0.075
 frightened+0.073
 sad+0.071
 afraid+0.070
Neg logit contributions
der-0.100
oll-0.097
 av-0.091
oph-0.090
 bead-0.089
Token pl
Token ID458
Feature activation-2.57e-01

Pos logit contributions
 scared+0.029
 worried+0.028
 frightened+0.026
 sad+0.026
 shout+0.025
Neg logit contributions
der-0.049
oll-0.049
oph-0.047
 av-0.046
 mastered-0.045
Tokenow
Token ID322
Feature activation-7.99e+00

Pos logit contributions
 scared+0.009
 worried+0.009
 frightened+0.008
 sad+0.008
 afraid+0.008
Neg logit contributions
der-0.013
oll-0.013
oph-0.012
 av-0.012
 bead-0.012
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
 scared+0.313
 worried+0.301
 sad+0.296
 frightened+0.292
 shout+0.278
Neg logit contributions
der-0.373
oll-0.366
oph-0.356
 bead-0.344
 av-0.344
Token He
Token ID679
Feature activation-1.69e+00

Pos logit contributions
 scared+0.033
 worried+0.033
 frightened+0.032
 sad+0.031
 afraid+0.030
Neg logit contributions
der-0.057
oll-0.055
oph-0.051
 av-0.051
 bead-0.051
Token used
Token ID973
Feature activation+1.17e+00

Pos logit contributions
 scared+0.059
 worried+0.054
 frightened+0.051
 sad+0.049
 afraid+0.048
Neg logit contributions
der-0.095
reating-0.091
oll-0.088
oph-0.088
 av-0.087
Token his
Token ID465
Feature activation-2.86e-01

Pos logit contributions
der+0.052
oll+0.050
oph+0.047
 av+0.046
 bead+0.045
Neg logit contributions
 scared-0.049
 worried-0.048
 frightened-0.047
 sad-0.046
 afraid-0.045
Token shovel
Token ID33677
Feature activation+3.35e+00

Pos logit contributions
 scared+0.011
 worried+0.011
 frightened+0.011
 barking+0.010
 afraid+0.010
Neg logit contributions
oll-0.012
der-0.012
 av-0.011
 bead-0.011
 beads-0.011
Token.
Token ID13
Feature activation-1.30e+00

Pos logit contributions
 scared+0.094
 worried+0.092
 frightened+0.090
 sad+0.088
 afraid+0.086
Neg logit contributions
der-0.108
oll-0.103
oph-0.098
 av-0.096
 bead-0.094
Token
Token ID198
Feature activation-1.30e+00

Pos logit contributions
 worried+0.051
 scared+0.049
 frightened+0.049
 worry+0.048
 sad+0.048
Neg logit contributions
der-0.060
oll-0.054
 bead-0.049
cake-0.048
 mastered-0.048
Token
Token ID198
Feature activation-1.33e+00

Pos logit contributions
 worried+0.049
 scared+0.049
 frightened+0.048
 sad+0.045
 afraid+0.045
Neg logit contributions
der-0.060
oll-0.058
oph-0.054
 bead-0.054
 av-0.052
Token"
Token ID1
Feature activation-1.11e+00

Pos logit contributions
 scared+0.056
 worried+0.055
 frightened+0.054
 sad+0.053
 afraid+0.052
Neg logit contributions
der-0.058
oll-0.056
oph-0.052
 av-0.052
 bead-0.051
TokenMe
Token ID5308
Feature activation-1.08e+00

Pos logit contributions
 scared+0.044
 worried+0.043
 frightened+0.042
 afraid+0.040
 sad+0.040
Neg logit contributions
der-0.052
oll-0.050
 av-0.047
oph-0.046
ier-0.045
Tokenow
Token ID322
Feature activation-7.92e+00

Pos logit contributions
 worried+0.039
 yell+0.038
 shout+0.038
 concerned+0.038
 frightened+0.038
Neg logit contributions
oll-0.047
der-0.046
ier-0.043
 av-0.043
cake-0.042
Token,
Token ID11
Feature activation-2.76e+00

Pos logit contributions
 worried+0.330
 scared+0.324
 frightened+0.324
 afraid+0.315
 panic+0.307
Neg logit contributions
der-0.340
oll-0.327
ier-0.314
oph-0.300
ils-0.296
Token me
Token ID502
Feature activation-3.42e+00

Pos logit contributions
itionally+0.104
 concerned+0.102
 puzzled+0.101
 worried+0.101
 frightened+0.100
Neg logit contributions
der-0.115
oll-0.114
oph-0.098
 weeks-0.097
 bead-0.097
Tokenow
Token ID322
Feature activation-7.90e+00

Pos logit contributions
 worried+0.131
 frightened+0.130
 scared+0.126
 concerned+0.126
 confused+0.125
Neg logit contributions
oll-0.146
der-0.142
ier-0.141
cake-0.127
 bead-0.126
Token!"
Token ID2474
Feature activation-2.13e+00

Pos logit contributions
 worried+0.353
 frightened+0.345
 scared+0.340
 panic+0.337
 afraid+0.335
Neg logit contributions
der-0.309
oll-0.293
ier-0.292
ils-0.264
oph-0.263
Token he
Token ID339
Feature activation-1.67e+00

Pos logit contributions
itionally+0.098
 worried+0.097
 concerned+0.096
 frightened+0.094
lings+0.092
Neg logit contributions
der-0.071
oll-0.061
 av-0.057
oph-0.054
 mastered-0.052
Tokench
Token ID354
Feature activation-4.48e+00

Pos logit contributions
der+0.003
oll+0.003
oph+0.003
 av+0.003
 bead+0.003
Neg logit contributions
 scared-0.003
 worried-0.003
 frightened-0.003
 sad-0.003
 afraid-0.003
Tokenoo
Token ID2238
Feature activation-5.71e+00

Pos logit contributions
 worried+0.167
 scared+0.167
 frightened+0.161
 sad+0.155
 afraid+0.154
Neg logit contributions
der-0.218
oll-0.210
ier-0.192
 bead-0.190
oph-0.190
Token cho
Token ID1727
Feature activation-3.87e+00

Pos logit contributions
 worried+0.243
 scared+0.239
 frightened+0.236
 afraid+0.230
 sad+0.223
Neg logit contributions
der-0.253
oll-0.232
ier-0.219
ils-0.203
ently-0.203
Tokeno
Token ID78
Feature activation-7.66e+00

Pos logit contributions
 scared+0.134
 worried+0.132
 frightened+0.127
 sad+0.123
 afraid+0.120
Neg logit contributions
der-0.202
oll-0.197
ier-0.181
oph-0.179
 av-0.177
Token cho
Token ID1727
Feature activation-4.32e+00

Pos logit contributions
 worried+0.330
 scared+0.323
 frightened+0.316
 afraid+0.310
 sad+0.303
Neg logit contributions
der-0.327
oll-0.316
ier-0.293
aces-0.271
ently-0.270
Tokeno
Token ID78
Feature activation-7.91e+00

Pos logit contributions
 scared+0.153
 worried+0.153
 frightened+0.147
 sad+0.142
 afraid+0.139
Neg logit contributions
der-0.221
oll-0.217
ier-0.201
oph-0.193
aces-0.193
Token".
Token ID1911
Feature activation-7.96e-01

Pos logit contributions
 worried+0.342
 scared+0.333
 frightened+0.326
 afraid+0.321
 confused+0.314
Neg logit contributions
der-0.338
oll-0.325
ier-0.306
ently-0.279
aces-0.278
Token
Token ID198
Feature activation-4.28e-01

Pos logit contributions
 scared+0.029
 worried+0.029
 frightened+0.028
 afraid+0.028
 sad+0.026
Neg logit contributions
der-0.039
oll-0.036
oph-0.034
 av-0.034
cake-0.032
Token
Token ID198
Feature activation-4.19e-01

Pos logit contributions
 scared+0.017
 worried+0.017
 frightened+0.017
 sad+0.016
 afraid+0.016
Neg logit contributions
der-0.019
oll-0.018
oph-0.017
assy-0.017
 av-0.017
TokenThey
Token ID2990
Feature activation-1.70e+00

Pos logit contributions
 scared+0.021
 worried+0.021
 frightened+0.020
 sad+0.020
 afraid+0.020
Neg logit contributions
der-0.014
oll-0.014
oph-0.013
 bead-0.012
 av-0.012
Token both
Token ID1111
Feature activation-1.75e+00

Pos logit contributions
 scared+0.062
 worried+0.061
 frightened+0.060
 afraid+0.058
 sad+0.058
Neg logit contributions
der-0.079
oll-0.077
oph-0.072
 av-0.072
 mastered-0.071
Token"
Token ID1
Feature activation-1.11e+00

Pos logit contributions
 scared+0.056
 worried+0.055
 frightened+0.054
 sad+0.053
 afraid+0.052
Neg logit contributions
der-0.058
oll-0.056
oph-0.052
 av-0.052
 bead-0.051
TokenMe
Token ID5308
Feature activation-1.08e+00

Pos logit contributions
 scared+0.044
 worried+0.043
 frightened+0.042
 afraid+0.040
 sad+0.040
Neg logit contributions
der-0.052
oll-0.050
 av-0.047
oph-0.046
ier-0.045
Tokenow
Token ID322
Feature activation-7.92e+00

Pos logit contributions
 worried+0.039
 yell+0.038
 shout+0.038
 concerned+0.038
 frightened+0.038
Neg logit contributions
oll-0.047
der-0.046
ier-0.043
 av-0.043
cake-0.042
Token,
Token ID11
Feature activation-2.76e+00

Pos logit contributions
 worried+0.330
 scared+0.324
 frightened+0.324
 afraid+0.315
 panic+0.307
Neg logit contributions
der-0.340
oll-0.327
ier-0.314
oph-0.300
ils-0.296
Token me
Token ID502
Feature activation-3.42e+00

Pos logit contributions
itionally+0.104
 concerned+0.102
 puzzled+0.101
 worried+0.101
 frightened+0.100
Neg logit contributions
der-0.115
oll-0.114
oph-0.098
 weeks-0.097
 bead-0.097
Tokenow
Token ID322
Feature activation-7.90e+00

Pos logit contributions
 worried+0.131
 frightened+0.130
 scared+0.126
 concerned+0.126
 confused+0.125
Neg logit contributions
oll-0.146
der-0.142
ier-0.141
cake-0.127
 bead-0.126
Token!"
Token ID2474
Feature activation-2.13e+00

Pos logit contributions
 worried+0.353
 frightened+0.345
 scared+0.340
 panic+0.337
 afraid+0.335
Neg logit contributions
der-0.309
oll-0.293
ier-0.292
ils-0.264
oph-0.263
Token he
Token ID339
Feature activation-1.67e+00

Pos logit contributions
itionally+0.098
 worried+0.097
 concerned+0.096
 frightened+0.094
lings+0.092
Neg logit contributions
der-0.071
oll-0.061
 av-0.057
oph-0.054
 mastered-0.052
Token says
Token ID1139
Feature activation-3.19e+00

Pos logit contributions
 worry+0.052
 worried+0.050
 sad+0.046
 frightened+0.046
 concerned+0.045
Neg logit contributions
der-0.085
oll-0.079
 brace-0.076
 mastered-0.074
aces-0.073
Token.
Token ID13
Feature activation-1.55e+00

Pos logit contributions
 worried+0.144
 scared+0.144
 frightened+0.142
 afraid+0.137
 panic+0.136
Neg logit contributions
der-0.128
oll-0.117
 av-0.108
ier-0.107
 bead-0.103
Token He
Token ID679
Feature activation-1.97e+00

Pos logit contributions
 worry+0.060
 worried+0.060
 panic+0.058
 scared+0.058
 frightened+0.058
Neg logit contributions
der-0.074
oll-0.062
 bead-0.057
 av-0.055
cake-0.055
Token.
Token ID13
Feature activation-1.39e+00

Pos logit contributions
der+0.073
oll+0.072
oph+0.070
 av+0.068
assy+0.065
Neg logit contributions
 scared-0.057
 worried-0.054
 sad-0.053
 frightened-0.053
 shout-0.051
Token
Token ID198
Feature activation-1.05e+00

Pos logit contributions
 scared+0.044
 worried+0.044
 frightened+0.042
 sad+0.041
 shout+0.040
Neg logit contributions
der-0.076
oll-0.073
 av-0.068
 bead-0.068
oph-0.068
Token
Token ID198
Feature activation-1.07e+00

Pos logit contributions
 worried+0.039
 scared+0.038
 frightened+0.038
 sad+0.036
 afraid+0.036
Neg logit contributions
der-0.050
oll-0.048
 bead-0.045
oph-0.045
 av-0.044
Token"
Token ID1
Feature activation-8.74e-01

Pos logit contributions
 scared+0.047
 worried+0.046
 frightened+0.045
 sad+0.044
 shout+0.044
Neg logit contributions
der-0.045
oll-0.043
 bead-0.040
oph-0.040
 av-0.039
TokenMe
Token ID5308
Feature activation-9.17e-01

Pos logit contributions
 scared+0.033
 worried+0.033
 frightened+0.032
 sad+0.031
 afraid+0.030
Neg logit contributions
der-0.042
oll-0.041
oph-0.038
 av-0.038
 bead-0.038
Tokenow
Token ID322
Feature activation-7.74e+00

Pos logit contributions
 worried+0.034
 yell+0.033
 concerned+0.033
 shout+0.033
itionally+0.033
Neg logit contributions
oll-0.040
der-0.037
 av-0.035
ier-0.035
 bead-0.035
Token!"
Token ID2474
Feature activation-2.06e+00

Pos logit contributions
 worried+0.342
 frightened+0.334
 scared+0.332
 afraid+0.328
 confused+0.318
Neg logit contributions
der-0.307
oll-0.296
ier-0.282
oph-0.272
ils-0.271
Token the
Token ID262
Feature activation-1.90e-01

Pos logit contributions
 worried+0.093
 barking+0.089
 frightened+0.089
 scared+0.089
 afraid+0.088
Neg logit contributions
der-0.078
oll-0.069
 av-0.068
 bead-0.068
oph-0.065
Token cat
Token ID3797
Feature activation-2.35e+00

Pos logit contributions
 worried+0.007
 scared+0.007
 afraid+0.007
 frightened+0.007
 confused+0.007
Neg logit contributions
der-0.008
 bead-0.008
 av-0.007
oll-0.007
 beads-0.007
Token says
Token ID1139
Feature activation-3.10e+00

Pos logit contributions
 worry+0.071
 worried+0.067
 panic+0.063
 frightened+0.063
 sad+0.062
Neg logit contributions
der-0.127
oll-0.115
 av-0.107
 bead-0.105
ils-0.104
Token.
Token ID13
Feature activation-1.50e+00

Pos logit contributions
 worried+0.143
 scared+0.142
 frightened+0.141
 panic+0.136
 afraid+0.136
Neg logit contributions
der-0.125
oll-0.111
ier-0.103
 av-0.102
 bead-0.098
Token.
Token ID13
Feature activation-1.21e+00

Pos logit contributions
 worried+0.201
 scared+0.197
 frightened+0.196
 afraid+0.193
 worry+0.189
Neg logit contributions
der-0.157
oll-0.143
 av-0.130
ier-0.123
 bead-0.121
Token "
Token ID366
Feature activation-7.23e-01

Pos logit contributions
 worried+0.048
 scared+0.047
 frightened+0.046
 sad+0.046
 worry+0.045
Neg logit contributions
der-0.056
oll-0.049
 av-0.046
 bead-0.045
oph-0.044
TokenThe
Token ID464
Feature activation-1.05e+00

Pos logit contributions
 worried+0.034
 scared+0.034
 sad+0.033
 frightened+0.032
 afraid+0.031
Neg logit contributions
der-0.028
oll-0.026
 bead-0.025
oph-0.024
 av-0.023
Token snow
Token ID6729
Feature activation+4.79e-01

Pos logit contributions
itionally+0.048
 concerned+0.048
 worried+0.047
 confused+0.047
 scared+0.046
Neg logit contributions
der-0.038
oll-0.032
 weeks-0.031
 bead-0.030
 months-0.030
Token pl
Token ID458
Feature activation+8.59e-01

Pos logit contributions
der+0.022
oll+0.021
ier+0.019
 bead+0.019
 av+0.019
Neg logit contributions
 worried-0.019
 scared-0.019
 frightened-0.019
 sad-0.018
 afraid-0.018
Tokenow
Token ID322
Feature activation-7.72e+00

Pos logit contributions
oll+0.050
der+0.048
ier+0.044
aces+0.043
 bead+0.043
Neg logit contributions
 scared-0.027
 worried-0.027
 frightened-0.025
 sad-0.025
 afraid-0.024
Token is
Token ID318
Feature activation-2.10e+00

Pos logit contributions
 worried+0.357
 frightened+0.343
 afraid+0.342
 scared+0.342
 confused+0.336
Neg logit contributions
der-0.287
oll-0.272
ier-0.269
ils-0.258
ently-0.245
Token here
Token ID994
Feature activation-3.70e+00

Pos logit contributions
 worried+0.075
 scared+0.074
 frightened+0.073
 shout+0.073
 yell+0.071
Neg logit contributions
der-0.095
oll-0.089
 av-0.084
ier-0.083
 bead-0.081
Token!"
Token ID2474
Feature activation-1.42e+00

Pos logit contributions
 worried+0.168
 scared+0.165
 frightened+0.163
 afraid+0.159
 sad+0.158
Neg logit contributions
der-0.143
oll-0.130
ier-0.123
oph-0.122
 av-0.120
Token
Token ID198
Feature activation-1.01e+00

Pos logit contributions
 worried+0.049
 scared+0.049
 frightened+0.048
 sad+0.047
 afraid+0.045
Neg logit contributions
der-0.074
oll-0.068
 av-0.064
oph-0.062
 bead-0.062
Token
Token ID198
Feature activation-1.04e+00

Pos logit contributions
 worried+0.037
 scared+0.037
 frightened+0.036
 sad+0.035
 worry+0.034
Neg logit contributions
der-0.049
oll-0.047
 bead-0.044
 av-0.043
oph-0.042
Token shovel
Token ID33677
Feature activation+3.90e+00

Pos logit contributions
der+0.034
 av+0.034
oll+0.033
 bead+0.033
 beads+0.030
Neg logit contributions
 scared-0.033
 worried-0.032
 frightened-0.032
 afraid-0.031
 sad-0.030
Token on
Token ID319
Feature activation+1.24e+00

Pos logit contributions
der+0.181
oll+0.175
oph+0.170
 av+0.164
 mastered+0.162
Neg logit contributions
 scared-0.152
 worried-0.149
 frightened-0.143
 sad-0.142
 afraid-0.137
Token my
Token ID616
Feature activation-1.22e+00

Pos logit contributions
oll+0.070
der+0.067
oph+0.065
assy+0.063
ier+0.062
Neg logit contributions
 scared-0.040
 sad-0.037
 worried-0.037
 frightened-0.036
 shout-0.035
Token snow
Token ID6729
Feature activation-4.71e-01

Pos logit contributions
 scared+0.042
 worried+0.041
 frightened+0.040
 sad+0.040
 shout+0.038
Neg logit contributions
der-0.063
oll-0.061
oph-0.058
 av-0.056
ier-0.056
Token pl
Token ID458
Feature activation+1.99e-01

Pos logit contributions
 scared+0.016
 worried+0.015
 frightened+0.015
 sad+0.015
 shout+0.014
Neg logit contributions
der-0.024
oll-0.024
oph-0.023
 av-0.022
 bead-0.022
Tokenow
Token ID322
Feature activation-7.71e+00

Pos logit contributions
der+0.008
oll+0.007
oph+0.007
 av+0.007
 bead+0.007
Neg logit contributions
 scared-0.009
 worried-0.009
 frightened-0.009
 sad-0.009
 shout-0.008
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
 scared+0.308
 worried+0.303
 frightened+0.294
 sad+0.288
 afraid+0.282
Neg logit contributions
der-0.356
oll-0.342
oph-0.319
 av-0.318
 bead-0.312
Token I
Token ID314
Feature activation-1.18e+00

Pos logit contributions
 scared+0.038
 worried+0.038
 frightened+0.037
 sad+0.036
 afraid+0.034
Neg logit contributions
der-0.054
oll-0.052
 bead-0.047
 av-0.047
oph-0.047
Token can
Token ID460
Feature activation+5.48e-01

Pos logit contributions
 scared+0.039
 worried+0.039
 frightened+0.038
 sad+0.037
 barking+0.036
Neg logit contributions
der-0.059
oll-0.059
 bead-0.055
 mastered-0.054
 av-0.054
Token lift
Token ID10303
Feature activation-5.05e-02

Pos logit contributions
der+0.028
oll+0.027
oph+0.025
 av+0.025
 bead+0.025
Neg logit contributions
 scared-0.019
 worried-0.019
 frightened-0.018
 sad-0.018
 afraid-0.018
Token him
Token ID683
Feature activation-4.32e+00

Pos logit contributions
 scared+0.002
 worried+0.002
 frightened+0.002
 sad+0.002
 afraid+0.002
Neg logit contributions
der-0.002
oll-0.002
 av-0.002
 bead-0.002
oph-0.002
Token goes
Token ID2925
Feature activation+5.70e-01

Pos logit contributions
 scared+0.005
 worried+0.005
 sad+0.004
 frightened+0.004
 shout+0.004
Neg logit contributions
der-0.007
oll-0.007
oph-0.006
 av-0.006
ier-0.006
Token "
Token ID366
Feature activation+7.47e-02

Pos logit contributions
der+0.025
oll+0.024
 av+0.022
oph+0.022
 bead+0.022
Neg logit contributions
 scared-0.023
 worried-0.023
 frightened-0.023
 sad-0.022
 afraid-0.022
Tokench
Token ID354
Feature activation-4.48e+00

Pos logit contributions
der+0.003
oll+0.003
oph+0.003
 av+0.003
 bead+0.003
Neg logit contributions
 scared-0.003
 worried-0.003
 frightened-0.003
 sad-0.003
 afraid-0.003
Tokenoo
Token ID2238
Feature activation-5.71e+00

Pos logit contributions
 worried+0.167
 scared+0.167
 frightened+0.161
 sad+0.155
 afraid+0.154
Neg logit contributions
der-0.218
oll-0.210
ier-0.192
 bead-0.190
oph-0.190
Token cho
Token ID1727
Feature activation-3.87e+00

Pos logit contributions
 worried+0.243
 scared+0.239
 frightened+0.236
 afraid+0.230
 sad+0.223
Neg logit contributions
der-0.253
oll-0.232
ier-0.219
ils-0.203
ently-0.203
Tokeno
Token ID78
Feature activation-7.66e+00

Pos logit contributions
 scared+0.134
 worried+0.132
 frightened+0.127
 sad+0.123
 afraid+0.120
Neg logit contributions
der-0.202
oll-0.197
ier-0.181
oph-0.179
 av-0.177
Token cho
Token ID1727
Feature activation-4.32e+00

Pos logit contributions
 worried+0.330
 scared+0.323
 frightened+0.316
 afraid+0.310
 sad+0.303
Neg logit contributions
der-0.327
oll-0.316
ier-0.293
aces-0.271
ently-0.270
Tokeno
Token ID78
Feature activation-7.91e+00

Pos logit contributions
 scared+0.153
 worried+0.153
 frightened+0.147
 sad+0.142
 afraid+0.139
Neg logit contributions
der-0.221
oll-0.217
ier-0.201
oph-0.193
aces-0.193
Token".
Token ID1911
Feature activation-7.96e-01

Pos logit contributions
 worried+0.342
 scared+0.333
 frightened+0.326
 afraid+0.321
 confused+0.314
Neg logit contributions
der-0.338
oll-0.325
ier-0.306
ently-0.279
aces-0.278
Token
Token ID198
Feature activation-4.28e-01

Pos logit contributions
 scared+0.029
 worried+0.029
 frightened+0.028
 afraid+0.028
 sad+0.026
Neg logit contributions
der-0.039
oll-0.036
oph-0.034
 av-0.034
cake-0.032
Token
Token ID198
Feature activation-4.19e-01

Pos logit contributions
 scared+0.017
 worried+0.017
 frightened+0.017
 sad+0.016
 afraid+0.016
Neg logit contributions
der-0.019
oll-0.018
oph-0.017
assy-0.017
 av-0.017
Tokenpa
Token ID8957
Feature activation-6.37e-01

Pos logit contributions
der+0.002
oll+0.002
oph+0.002
reating+0.002
 av+0.002
Neg logit contributions
 scared-0.003
 frightened-0.003
 worried-0.003
 sad-0.003
 afraid-0.003
Token carry
Token ID3283
Feature activation+1.96e-01

Pos logit contributions
 worried+0.027
 scared+0.027
 frightened+0.027
 afraid+0.025
 sad+0.025
Neg logit contributions
der-0.025
oll-0.025
 av-0.024
 bead-0.023
ils-0.023
Token things
Token ID1243
Feature activation+1.83e-02

Pos logit contributions
der+0.008
oll+0.008
 av+0.007
oph+0.007
 bead+0.007
Neg logit contributions
 scared-0.009
 worried-0.009
 frightened-0.008
 sad-0.008
 afraid-0.008
Token and
Token ID290
Feature activation-7.47e-01

Pos logit contributions
der+0.001
oph+0.001
oll+0.001
 av+0.001
 mastered+0.001
Neg logit contributions
 scared-0.001
 worried-0.001
 sad-0.001
 frightened-0.001
 shout-0.001
Token pl
Token ID458
Feature activation+1.51e+00

Pos logit contributions
 scared+0.025
 worried+0.025
 frightened+0.024
 sad+0.023
 afraid+0.023
Neg logit contributions
der-0.039
oll-0.038
oph-0.036
 av-0.035
 bead-0.035
Tokenow
Token ID322
Feature activation-7.61e+00

Pos logit contributions
oll+0.072
der+0.070
 av+0.064
ier+0.063
ellery+0.063
Neg logit contributions
 scared-0.059
 worried-0.058
 sad-0.057
 frightened-0.057
 afraid-0.054
Token the
Token ID262
Feature activation-1.89e+00

Pos logit contributions
 scared+0.320
 worried+0.316
 frightened+0.307
 sad+0.299
 afraid+0.296
Neg logit contributions
der-0.336
oll-0.320
 av-0.296
oph-0.293
 bead-0.289
Token field
Token ID2214
Feature activation-9.87e-01

Pos logit contributions
 scared+0.076
 worried+0.075
 frightened+0.073
 sad+0.072
 afraid+0.072
Neg logit contributions
der-0.084
oll-0.074
 av-0.072
ils-0.069
oph-0.069
Token.
Token ID13
Feature activation-1.44e-01

Pos logit contributions
 scared+0.039
 worried+0.038
 frightened+0.037
 sad+0.037
 afraid+0.036
Neg logit contributions
der-0.046
oll-0.044
oph-0.042
 av-0.041
 bead-0.040
Token
Token ID198
Feature activation-1.83e-01

Pos logit contributions
 scared+0.006
 worried+0.006
 frightened+0.006
 sad+0.005
 afraid+0.005
Neg logit contributions
der-0.007
oll-0.006
oph-0.006
 av-0.006
assy-0.006
Token
Token ID198
Feature activation-2.53e-01

Pos logit contributions
 scared+0.007
 worried+0.007
 frightened+0.007
 sad+0.007
 afraid+0.007
Neg logit contributions
der-0.008
oll-0.008
oph-0.008
 bead-0.007
reating-0.007
Token"
Token ID1
Feature activation-8.23e-01

Pos logit contributions
 scared+0.042
 worried+0.041
 frightened+0.040
 sad+0.039
 afraid+0.038
Neg logit contributions
der-0.044
oll-0.042
oph-0.039
 av-0.039
 bead-0.038
TokenMe
Token ID5308
Feature activation-9.04e-01

Pos logit contributions
 scared+0.034
 worried+0.034
 sad+0.033
 frightened+0.033
 afraid+0.032
Neg logit contributions
der-0.036
oll-0.034
 bead-0.032
oph-0.032
 av-0.032
Tokenow
Token ID322
Feature activation-7.49e+00

Pos logit contributions
 worried+0.032
 yell+0.032
 panic+0.031
 shout+0.031
itionally+0.031
Neg logit contributions
oll-0.040
der-0.040
ier-0.035
 av-0.035
aces-0.034
Token,
Token ID11
Feature activation-2.41e+00

Pos logit contributions
 worried+0.299
 frightened+0.298
 scared+0.297
 afraid+0.286
 panic+0.280
Neg logit contributions
der-0.339
oll-0.319
ier-0.304
ils-0.293
 av-0.292
Token me
Token ID502
Feature activation-3.10e+00

Pos logit contributions
 worried+0.089
 scared+0.089
 frightened+0.089
 shout+0.084
 afraid+0.083
Neg logit contributions
der-0.110
oll-0.103
 bead-0.093
oph-0.092
 av-0.092
Tokenow
Token ID322
Feature activation-7.57e+00

Pos logit contributions
 frightened+0.109
 worried+0.107
 concerned+0.103
 scared+0.103
 panic+0.103
Neg logit contributions
oll-0.139
der-0.134
ier-0.131
cake-0.120
aces-0.119
Token!"
Token ID2474
Feature activation-1.94e+00

Pos logit contributions
 worried+0.313
 frightened+0.311
 scared+0.309
 panic+0.302
 afraid+0.298
Neg logit contributions
der-0.329
oll-0.306
ier-0.299
ils-0.278
ently-0.277
Token the
Token ID262
Feature activation-1.88e-01

Pos logit contributions
 worried+0.077
 scared+0.077
 frightened+0.076
 afraid+0.073
 sad+0.072
Neg logit contributions
der-0.086
oll-0.080
oph-0.077
 av-0.077
 bead-0.075
Token dolls
Token ID36062
Feature activation+9.57e-01

Pos logit contributions
 worried+0.007
 scared+0.007
 afraid+0.007
 frightened+0.007
 confused+0.007
Neg logit contributions
der-0.008
 bead-0.008
 beads-0.007
 av-0.007
 bracelet-0.007
Token say
Token ID910
Feature activation-3.12e+00

Pos logit contributions
der+0.055
oll+0.051
 av+0.049
 bead+0.049
 mastered+0.048
Neg logit contributions
 worried-0.025
 frightened-0.025
 scared-0.025
 barking-0.024
 afraid-0.024
Token.
Token ID13
Feature activation-1.66e+00

Pos logit contributions
 scared+0.138
 worried+0.136
 frightened+0.135
 afraid+0.129
 sad+0.128
Neg logit contributions
der-0.133
oll-0.124
 av-0.112
ier-0.112
 bead-0.111
Token "
Token ID366
Feature activation-3.56e-01

Pos logit contributions
 scared+0.008
 frightened+0.008
 worried+0.008
 panic+0.007
 afraid+0.007
Neg logit contributions
der-0.009
oll-0.008
 av-0.008
 bead-0.007
cake-0.007
TokenMe
Token ID5308
Feature activation-5.41e-01

Pos logit contributions
 scared+0.016
 worried+0.015
 frightened+0.015
 sad+0.015
 afraid+0.015
Neg logit contributions
der-0.015
oll-0.014
oph-0.014
 bead-0.013
 av-0.013
Tokenow
Token ID322
Feature activation-7.34e+00

Pos logit contributions
itionally+0.019
 yell+0.019
 worried+0.019
 shout+0.019
 concerned+0.018
Neg logit contributions
oll-0.025
der-0.023
ier-0.022
cake-0.021
 av-0.021
Token,
Token ID11
Feature activation-1.90e+00

Pos logit contributions
 worried+0.318
 scared+0.311
 frightened+0.311
 afraid+0.303
 confused+0.301
Neg logit contributions
der-0.292
oll-0.282
ier-0.268
ils-0.256
oph-0.249
Token me
Token ID502
Feature activation-2.94e+00

Pos logit contributions
 scared+0.063
 worried+0.063
 frightened+0.062
 confused+0.059
 afraid+0.058
Neg logit contributions
der-0.094
oll-0.092
 av-0.085
oph-0.085
 bead-0.083
Tokenow
Token ID322
Feature activation-7.53e+00

Pos logit contributions
 worried+0.101
 frightened+0.100
 scared+0.098
 confused+0.098
 concerned+0.095
Neg logit contributions
oll-0.136
der-0.130
ier-0.128
cake-0.118
 bead-0.113
Token,
Token ID11
Feature activation-1.98e+00

Pos logit contributions
 worried+0.339
 frightened+0.330
 scared+0.328
 confused+0.324
 puzzled+0.321
Neg logit contributions
der-0.284
oll-0.264
ier-0.260
ils-0.238
ently-0.229
Token Tom
Token ID4186
Feature activation-9.19e-01

Pos logit contributions
 worried+0.065
 scared+0.065
 frightened+0.064
 confused+0.063
 afraid+0.061
Neg logit contributions
oll-0.096
der-0.096
 av-0.086
 bead-0.084
oph-0.084
Token!
Token ID0
Feature activation-1.09e+00

Pos logit contributions
 worried+0.042
 scared+0.042
 frightened+0.042
 sad+0.041
 shout+0.040
Neg logit contributions
der-0.034
oll-0.033
ier-0.030
 bead-0.029
 av-0.029
Token Can
Token ID1680
Feature activation-1.34e+00

Pos logit contributions
 worried+0.046
 scared+0.045
 sad+0.044
 frightened+0.044
 confused+0.042
Neg logit contributions
der-0.048
oll-0.047
 bead-0.041
cake-0.040
oph-0.040
Token you
Token ID345
Feature activation-7.59e-01

Pos logit contributions
 scared+0.063
 worried+0.063
 frightened+0.062
 afraid+0.060
 fearful+0.059
Neg logit contributions
der-0.049
oll-0.046
 bead-0.044
 av-0.043
oph-0.041
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
 scared+0.124
 worried+0.123
 frightened+0.122
 afraid+0.117
 sad+0.116
Neg logit contributions
der-0.115
oll-0.108
 av-0.101
ier-0.098
 bead-0.096
Token
Token ID198
Feature activation-9.15e-01

Pos logit contributions
 scared+0.037
 worried+0.037
 frightened+0.035
 sad+0.035
 afraid+0.034
Neg logit contributions
der-0.057
oll-0.055
 av-0.052
 bead-0.052
oph-0.052
Token
Token ID198
Feature activation-9.81e-01

Pos logit contributions
 worried+0.033
 scared+0.033
 frightened+0.032
 sad+0.030
 afraid+0.030
Neg logit contributions
der-0.044
oll-0.042
oph-0.041
 bead-0.040
 av-0.040
Token"
Token ID1
Feature activation-8.23e-01

Pos logit contributions
 scared+0.042
 worried+0.041
 frightened+0.040
 sad+0.039
 afraid+0.038
Neg logit contributions
der-0.044
oll-0.042
oph-0.039
 av-0.039
 bead-0.038
TokenMe
Token ID5308
Feature activation-9.04e-01

Pos logit contributions
 scared+0.034
 worried+0.034
 sad+0.033
 frightened+0.033
 afraid+0.032
Neg logit contributions
der-0.036
oll-0.034
 bead-0.032
oph-0.032
 av-0.032
Tokenow
Token ID322
Feature activation-7.49e+00

Pos logit contributions
 worried+0.032
 yell+0.032
 panic+0.031
 shout+0.031
itionally+0.031
Neg logit contributions
oll-0.040
der-0.040
ier-0.035
 av-0.035
aces-0.034
Token,
Token ID11
Feature activation-2.41e+00

Pos logit contributions
 worried+0.299
 frightened+0.298
 scared+0.297
 afraid+0.286
 panic+0.280
Neg logit contributions
der-0.339
oll-0.319
ier-0.304
ils-0.293
 av-0.292
Token me
Token ID502
Feature activation-3.10e+00

Pos logit contributions
 worried+0.089
 scared+0.089
 frightened+0.089
 shout+0.084
 afraid+0.083
Neg logit contributions
der-0.110
oll-0.103
 bead-0.093
oph-0.092
 av-0.092
Tokenow
Token ID322
Feature activation-7.57e+00

Pos logit contributions
 frightened+0.109
 worried+0.107
 concerned+0.103
 scared+0.103
 panic+0.103
Neg logit contributions
oll-0.139
der-0.134
ier-0.131
cake-0.120
aces-0.119
Token!"
Token ID2474
Feature activation-1.94e+00

Pos logit contributions
 worried+0.313
 frightened+0.311
 scared+0.309
 panic+0.302
 afraid+0.298
Neg logit contributions
der-0.329
oll-0.306
ier-0.299
ils-0.278
ently-0.277
Token the
Token ID262
Feature activation-1.88e-01

Pos logit contributions
 worried+0.077
 scared+0.077
 frightened+0.076
 afraid+0.073
 sad+0.072
Neg logit contributions
der-0.086
oll-0.080
oph-0.077
 av-0.077
 bead-0.075
Tokenman
Token ID805
Feature activation-4.47e+00

Pos logit contributions
 scared+0.048
 worried+0.047
 frightened+0.045
 sad+0.044
 shout+0.043
Neg logit contributions
der-0.066
oll-0.065
oph-0.062
 av-0.061
 bead-0.059
Token.
Token ID13
Feature activation-1.32e+00

Pos logit contributions
 scared+0.196
 worried+0.195
 frightened+0.190
 sad+0.186
 afraid+0.184
Neg logit contributions
der-0.185
oll-0.178
 av-0.163
 bead-0.162
oph-0.161
Token The
Token ID383
Feature activation-1.81e+00

Pos logit contributions
 worried+0.048
 scared+0.048
 frightened+0.046
 afraid+0.044
 sad+0.044
Neg logit contributions
der-0.067
oll-0.061
 bead-0.058
 av-0.057
oph-0.056
Token snow
Token ID6729
Feature activation+1.22e-01

Pos logit contributions
itionally+0.091
 afraid+0.083
 frightened+0.083
 scared+0.082
 concerned+0.082
Neg logit contributions
der-0.061
 bead-0.054
 beads-0.052
oll-0.051
 weeks-0.047
Token pl
Token ID458
Feature activation+3.97e-01

Pos logit contributions
der+0.006
oll+0.005
 bead+0.005
ier+0.005
 av+0.005
Neg logit contributions
 scared-0.005
 worried-0.005
 frightened-0.005
 sad-0.005
 afraid-0.005
Tokenow
Token ID322
Feature activation-7.47e+00

Pos logit contributions
der+0.019
oll+0.019
ier+0.017
 bead+0.017
oph+0.017
Neg logit contributions
 scared-0.015
 worried-0.015
 frightened-0.015
 sad-0.015
 afraid-0.014
Token came
Token ID1625
Feature activation-3.25e+00

Pos logit contributions
 scared+0.374
 worried+0.373
 frightened+0.363
 sad+0.357
 afraid+0.356
Neg logit contributions
der-0.270
oll-0.250
ier-0.225
 av-0.224
oph-0.223
Token closer
Token ID5699
Feature activation-3.29e+00

Pos logit contributions
 worried+0.132
 scared+0.129
 frightened+0.127
 afraid+0.126
 sad+0.124
Neg logit contributions
der-0.137
ier-0.125
oll-0.122
 bead-0.117
 av-0.116
Token and
Token ID290
Feature activation-1.72e+00

Pos logit contributions
 scared+0.145
 worried+0.142
 frightened+0.139
 sad+0.136
 afraid+0.134
Neg logit contributions
der-0.139
oll-0.132
oph-0.123
 av-0.122
 bead-0.120
Token closer
Token ID5699
Feature activation-2.09e+00

Pos logit contributions
 worried+0.039
 frightened+0.038
 scared+0.038
 voice+0.037
 panic+0.037
Neg logit contributions
der-0.102
 av-0.093
oll-0.092
 bead-0.092
cake-0.090
Token.
Token ID13
Feature activation-7.80e-01

Pos logit contributions
 scared+0.095
 worried+0.095
 frightened+0.093
 sad+0.091
 afraid+0.089
Neg logit contributions
der-0.086
oll-0.074
ier-0.070
 av-0.069
oph-0.069
Token"
Token ID1
Feature activation+5.37e-02

Pos logit contributions
der+0.005
oll+0.005
oph+0.005
 bead+0.004
 mastered+0.004
Neg logit contributions
 scared-0.008
 worried-0.008
 frightened-0.008
 sad-0.007
 shout-0.007
TokenMe
Token ID5308
Feature activation-5.65e-01

Pos logit contributions
der+0.002
oll+0.002
oph+0.002
 av+0.002
 bead+0.002
Neg logit contributions
 scared-0.002
 worried-0.002
 frightened-0.002
 sad-0.002
 afraid-0.002
Tokenow
Token ID322
Feature activation-7.35e+00

Pos logit contributions
 worried+0.019
 yell+0.019
 shout+0.019
 concerned+0.018
 panic+0.018
Neg logit contributions
oll-0.025
der-0.025
ier-0.023
cake-0.022
 av-0.022
Token,
Token ID11
Feature activation-1.88e+00

Pos logit contributions
 worried+0.325
 frightened+0.312
 scared+0.312
 afraid+0.308
 shout+0.305
Neg logit contributions
der-0.296
oll-0.270
ier-0.270
ils-0.256
oph-0.249
Token me
Token ID502
Feature activation-2.83e+00

Pos logit contributions
 worried+0.076
 confused+0.074
 frightened+0.074
 scared+0.073
 puzzled+0.072
Neg logit contributions
der-0.070
oll-0.067
oph-0.066
reating-0.061
cake-0.061
Tokenow
Token ID322
Feature activation-7.46e+00

Pos logit contributions
 worried+0.101
 concerned+0.101
 confused+0.099
 puzzled+0.097
 frightened+0.097
Neg logit contributions
oll-0.116
ier-0.116
der-0.112
cake-0.102
aces-0.100
Token,
Token ID11
Feature activation-2.03e+00

Pos logit contributions
 worried+0.339
 panic+0.325
 frightened+0.324
 scared+0.321
 confused+0.320
Neg logit contributions
der-0.290
ier-0.268
oll-0.261
ils-0.244
ently-0.236
Token I
Token ID314
Feature activation-1.72e+00

Pos logit contributions
 worried+0.077
 confused+0.076
 puzzled+0.075
 frightened+0.074
 scared+0.073
Neg logit contributions
der-0.080
oll-0.079
oph-0.074
cake-0.073
reating-0.072
Token'm
Token ID1101
Feature activation-1.16e+00

Pos logit contributions
 concerned+0.072
 confused+0.072
 worried+0.071
 puzzled+0.071
 sad+0.070
Neg logit contributions
oll-0.061
 mastered-0.061
ier-0.054
 bead-0.054
ils-0.053
Token a
Token ID257
Feature activation-3.06e-01

Pos logit contributions
 voice+0.047
 worry+0.046
 panic+0.044
 whine+0.044
 trigger+0.044
Neg logit contributions
oll-0.041
 av-0.038
der-0.038
 solving-0.037
 such-0.037
Token cat
Token ID3797
Feature activation-1.42e+00

Pos logit contributions
 shout+0.010
 worried+0.010
 scared+0.009
 confused+0.009
 yell+0.009
Neg logit contributions
der-0.014
 bead-0.014
oll-0.013
 av-0.013
 bracelet-0.013
Token!"
Token ID2474
Feature activation-1.64e+00

Pos logit contributions
 scared+0.029
 worried+0.029
 frightened+0.028
 sad+0.027
 afraid+0.027
Neg logit contributions
der-0.033
oll-0.031
 av-0.029
oph-0.029
ier-0.028
Token
Token ID198
Feature activation-1.33e+00

Pos logit contributions
 worried+0.061
 scared+0.060
 frightened+0.059
 worry+0.057
 panic+0.057
Neg logit contributions
der-0.080
oll-0.071
 av-0.067
 bead-0.067
oph-0.066
Token
Token ID198
Feature activation-1.35e+00

Pos logit contributions
 worried+0.059
 frightened+0.058
 scared+0.057
 worry+0.057
 panic+0.055
Neg logit contributions
der-0.051
oll-0.047
 bead-0.045
ently-0.044
oph-0.042
Token"
Token ID1
Feature activation-7.99e-01

Pos logit contributions
 worried+0.064
 scared+0.064
 frightened+0.063
 sad+0.061
 afraid+0.060
Neg logit contributions
der-0.050
oll-0.045
 bead-0.044
oph-0.043
 av-0.041
TokenMe
Token ID5308
Feature activation-8.05e-01

Pos logit contributions
 worried+0.036
 scared+0.036
 sad+0.034
 frightened+0.034
 afraid+0.033
Neg logit contributions
der-0.033
oll-0.029
 bead-0.029
oph-0.028
cake-0.027
Tokenow
Token ID322
Feature activation-7.43e+00

Pos logit contributions
 panic+0.032
 worried+0.032
 yell+0.032
 War+0.032
itionally+0.031
Neg logit contributions
oll-0.029
der-0.028
 av-0.027
quet-0.027
cake-0.027
Token!"
Token ID2474
Feature activation-2.03e+00

Pos logit contributions
 worried+0.331
 frightened+0.325
 scared+0.322
 afraid+0.318
 panic+0.314
Neg logit contributions
der-0.288
oll-0.276
ier-0.273
ils-0.259
oph-0.255
Token the
Token ID262
Feature activation-1.66e-01

Pos logit contributions
 worried+0.085
 scared+0.082
 frightened+0.082
 worry+0.080
 afraid+0.079
Neg logit contributions
der-0.084
oll-0.075
 av-0.073
oph-0.072
 bead-0.072
Token cat
Token ID3797
Feature activation-2.40e+00

Pos logit contributions
 scared+0.006
 worried+0.006
 afraid+0.006
itionally+0.006
 frightened+0.006
Neg logit contributions
der-0.007
 av-0.006
cake-0.006
oll-0.006
 bracelet-0.006
Token says
Token ID1139
Feature activation-3.28e+00

Pos logit contributions
 worried+0.065
 worry+0.062
 frightened+0.062
 scared+0.060
 sad+0.060
Neg logit contributions
der-0.139
oll-0.129
 av-0.123
 bead-0.121
ils-0.119
Token.
Token ID13
Feature activation-1.68e+00

Pos logit contributions
 worried+0.151
 scared+0.150
 frightened+0.149
 panic+0.144
 afraid+0.144
Neg logit contributions
der-0.131
oll-0.119
ier-0.110
 av-0.107
 bead-0.105
Token It
Token ID632
Feature activation-1.77e+00

Pos logit contributions
 worried+0.041
 scared+0.039
 sad+0.039
 frightened+0.037
 worry+0.037
Neg logit contributions
der-0.062
oll-0.056
 av-0.054
 bead-0.052
oph-0.051
Token is
Token ID318
Feature activation-5.73e-01

Pos logit contributions
 worry+0.092
 worried+0.088
 concerned+0.087
itionally+0.084
 frightened+0.083
Neg logit contributions
der-0.049
 mastered-0.039
 continues-0.036
oll-0.035
 bead-0.035
Token a
Token ID257
Feature activation-3.00e-01

Pos logit contributions
 worried+0.023
 shout+0.023
 scared+0.023
 frightened+0.022
 yell+0.022
Neg logit contributions
der-0.024
oll-0.022
 av-0.020
ier-0.019
oph-0.019
Token snow
Token ID6729
Feature activation+9.06e-01

Pos logit contributions
 scared+0.010
 confused+0.010
 worried+0.010
 frightened+0.010
 afraid+0.010
Neg logit contributions
der-0.013
 bead-0.012
 av-0.012
 puzzle-0.012
 bracelet-0.012
Token pl
Token ID458
Feature activation+1.26e+00

Pos logit contributions
der+0.043
oll+0.040
ier+0.037
 bead+0.037
 av+0.036
Neg logit contributions
 scared-0.036
 worried-0.036
 frightened-0.035
 sad-0.034
 afraid-0.034
Tokenow
Token ID322
Feature activation-7.39e+00

Pos logit contributions
der+0.062
oll+0.061
oph+0.056
 av+0.055
 bead+0.055
Neg logit contributions
 scared-0.047
 worried-0.046
 frightened-0.045
 sad-0.044
 afraid-0.043
Token.
Token ID13
Feature activation-1.41e+00

Pos logit contributions
 worried+0.369
 afraid+0.364
 frightened+0.360
 scared+0.359
 panic+0.351
Neg logit contributions
der-0.256
ier-0.238
oll-0.232
ently-0.213
ils-0.195
Token It
Token ID632
Feature activation-1.98e+00

Pos logit contributions
 worried+0.045
 scared+0.045
 sad+0.043
 frightened+0.042
 afraid+0.041
Neg logit contributions
der-0.077
oll-0.072
 av-0.068
oph-0.067
 bead-0.066
Token is
Token ID318
Feature activation-5.83e-01

Pos logit contributions
 worry+0.094
 worried+0.093
 concerned+0.090
 frightened+0.090
 afraid+0.088
Neg logit contributions
der-0.069
oll-0.059
 bead-0.057
 mastered-0.052
ware-0.052
Token coming
Token ID2406
Feature activation-2.72e+00

Pos logit contributions
 worried+0.021
 shout+0.021
 scared+0.021
 frightened+0.020
 yell+0.020
Neg logit contributions
der-0.027
oll-0.025
 av-0.023
 bead-0.023
ier-0.022
Token to
Token ID284
Feature activation-1.87e+00

Pos logit contributions
 worried+0.150
 concerned+0.145
 afraid+0.144
itionally+0.144
 puzzled+0.144
Neg logit contributions
der-0.069
reating-0.058
ier-0.055
oll-0.053
 neatly-0.052
Token happened
Token ID3022
Feature activation+5.02e-01

Pos logit contributions
der+0.101
oll+0.099
 av+0.092
oph+0.091
 bead+0.089
Neg logit contributions
 scared-0.114
 worried-0.111
 frightened-0.108
 sad-0.105
 afraid-0.104
Token.
Token ID13
Feature activation-8.44e-01

Pos logit contributions
der+0.021
oll+0.019
 av+0.018
ier+0.018
 bead+0.017
Neg logit contributions
 scared-0.022
 worried-0.022
 frightened-0.021
 sad-0.021
 shout-0.021
Token The
Token ID383
Feature activation-1.48e+00

Pos logit contributions
 scared+0.028
 worried+0.027
 frightened+0.026
 sad+0.026
 afraid+0.025
Neg logit contributions
der-0.045
oll-0.044
oph-0.041
 av-0.041
 bead-0.040
Token snow
Token ID6729
Feature activation+3.37e-01

Pos logit contributions
itionally+0.064
 shout+0.063
 afraid+0.060
 yell+0.060
 barking+0.060
Neg logit contributions
der-0.058
oll-0.051
 av-0.050
 weeks-0.049
 bead-0.049
Token pl
Token ID458
Feature activation+7.79e-01

Pos logit contributions
der+0.015
oll+0.014
 bead+0.013
ier+0.013
 av+0.013
Neg logit contributions
 worried-0.014
 scared-0.014
 frightened-0.013
 sad-0.013
 afraid-0.013
Tokenow
Token ID322
Feature activation-7.39e+00

Pos logit contributions
der+0.041
oll+0.040
oph+0.037
ier+0.037
 av+0.037
Neg logit contributions
 scared-0.027
 worried-0.026
 frightened-0.025
 sad-0.025
 afraid-0.024
Token stopped
Token ID5025
Feature activation-1.41e+00

Pos logit contributions
 worried+0.356
 scared+0.352
 frightened+0.343
 afraid+0.342
 shout+0.339
Neg logit contributions
der-0.280
oll-0.252
ier-0.238
ils-0.231
oph-0.228
Token.
Token ID13
Feature activation-1.23e+00

Pos logit contributions
 scared+0.063
 worried+0.063
 frightened+0.062
 shout+0.061
 sad+0.060
Neg logit contributions
der-0.055
 av-0.048
oll-0.048
ier-0.047
 bead-0.045
Token The
Token ID383
Feature activation-1.60e+00

Pos logit contributions
 scared+0.038
 worried+0.038
 frightened+0.036
 sad+0.035
 afraid+0.034
Neg logit contributions
der-0.069
oll-0.066
oph-0.062
 av-0.062
 bead-0.061
Token driver
Token ID4639
Feature activation-4.08e+00

Pos logit contributions
itionally+0.071
 shout+0.069
 afraid+0.068
 worried+0.068
 scared+0.067
Neg logit contributions
der-0.060
oll-0.052
 weeks-0.051
 bead-0.051
 av-0.050
Token got
Token ID1392
Feature activation-2.34e-01

Pos logit contributions
 scared+0.153
 worried+0.151
 frightened+0.147
 sad+0.144
 afraid+0.142
Neg logit contributions
der-0.194
oll-0.186
oph-0.173
 av-0.172
 bead-0.170
Token "
Token ID366
Feature activation-3.05e-01

Pos logit contributions
der+0.004
oll+0.004
 av+0.003
 bead+0.003
ier+0.003
Neg logit contributions
 scared-0.003
 worried-0.003
 frightened-0.003
 sad-0.003
 shout-0.003
TokenMe
Token ID5308
Feature activation-5.23e-01

Pos logit contributions
 scared+0.013
 worried+0.013
 frightened+0.012
 sad+0.012
 afraid+0.012
Neg logit contributions
der-0.013
oll-0.013
oph-0.012
 av-0.012
 bead-0.012
Tokenow
Token ID322
Feature activation-7.22e+00

Pos logit contributions
 yell+0.018
 shout+0.018
 panic+0.017
 worried+0.017
itionally+0.017
Neg logit contributions
oll-0.025
der-0.023
ier-0.022
cake-0.021
 av-0.021
Token,
Token ID11
Feature activation-1.88e+00

Pos logit contributions
 worried+0.303
 scared+0.300
 frightened+0.296
 afraid+0.286
 shout+0.282
Neg logit contributions
der-0.314
oll-0.295
ier-0.278
ils-0.270
 av-0.262
Token me
Token ID502
Feature activation-2.85e+00

Pos logit contributions
 worried+0.066
 scared+0.066
 frightened+0.065
 afraid+0.061
 confused+0.061
Neg logit contributions
der-0.090
oll-0.087
 av-0.081
 bead-0.080
oph-0.080
Tokenow
Token ID322
Feature activation-7.38e+00

Pos logit contributions
 worried+0.091
 frightened+0.089
 scared+0.088
 confused+0.086
 sad+0.085
Neg logit contributions
oll-0.141
der-0.138
ier-0.134
cake-0.124
aces-0.123
Token,
Token ID11
Feature activation-2.19e+00

Pos logit contributions
 worried+0.330
 scared+0.322
 frightened+0.321
 afraid+0.311
 panic+0.311
Neg logit contributions
der-0.299
oll-0.272
ier-0.268
ils-0.250
ently-0.240
Token I
Token ID314
Feature activation-1.85e+00

Pos logit contributions
 worried+0.075
 scared+0.075
 frightened+0.073
 confused+0.070
 afraid+0.070
Neg logit contributions
der-0.106
oll-0.103
 av-0.095
 bead-0.093
oph-0.092
Token can
Token ID460
Feature activation+9.17e-02

Pos logit contributions
 sad+0.074
 worried+0.073
 confused+0.073
 frightened+0.072
 barking+0.072
Neg logit contributions
oll-0.074
der-0.069
 mastered-0.068
 bead-0.066
ils-0.065
Token help
Token ID1037
Feature activation+7.12e-01

Pos logit contributions
der+0.005
oll+0.005
 av+0.004
 bead+0.004
oph+0.004
Neg logit contributions
 scared-0.003
 worried-0.003
 frightened-0.003
 sad-0.003
 afraid-0.003
Token you
Token ID345
Feature activation-1.49e+00

Pos logit contributions
der+0.032
oll+0.029
 av+0.029
 bead+0.028
reating+0.027
Neg logit contributions
 worried-0.029
 scared-0.029
 frightened-0.028
 sad-0.027
 afraid-0.027
Token his
Token ID465
Feature activation+8.72e-02

Pos logit contributions
der+0.058
oll+0.057
oph+0.055
 av+0.051
 bead+0.051
Neg logit contributions
 scared-0.050
 worried-0.048
 sad-0.047
 frightened-0.046
 shout-0.045
Token subjects
Token ID7481
Feature activation+9.28e-01

Pos logit contributions
der+0.004
oll+0.004
 av+0.003
oph+0.003
 bead+0.003
Neg logit contributions
 scared-0.004
 worried-0.004
 frightened-0.004
 sad-0.004
 afraid-0.004
Token learned
Token ID4499
Feature activation+3.07e+00

Pos logit contributions
der+0.043
oll+0.042
oph+0.041
 av+0.039
reating+0.038
Neg logit contributions
 scared-0.039
 worried-0.037
 sad-0.036
 frightened-0.035
 afraid-0.034
Token an
Token ID281
Feature activation+2.17e+00

Pos logit contributions
oll+0.168
oph+0.162
atmeal+0.160
quet+0.157
 bead+0.155
Neg logit contributions
 scared-0.106
 sad-0.101
 worried-0.099
 afraid-0.091
 frightened-0.091
Token important
Token ID1593
Feature activation+1.87e+00

Pos logit contributions
der+0.096
oll+0.092
oph+0.092
 bead+0.088
 av+0.087
Neg logit contributions
 scared-0.091
 worried-0.089
 sad-0.089
 frightened-0.086
 afraid-0.085
Token lesson
Token ID11483
Feature activation+6.16e+00

Pos logit contributions
der+0.056
oll+0.055
oph+0.050
 av+0.049
 bead+0.046
Neg logit contributions
 scared-0.105
 worried-0.103
 frightened-0.101
 sad-0.100
 afraid-0.098
Token to
Token ID284
Feature activation+1.61e+00

Pos logit contributions
oph+0.328
 av+0.315
 bead+0.313
oll+0.310
atmeal+0.305
Neg logit contributions
 scared-0.199
 worried-0.188
 sad-0.179
 afraid-0.179
 barking-0.178
Token never
Token ID1239
Feature activation-6.71e-01

Pos logit contributions
der+0.086
oll+0.084
oph+0.078
 av+0.077
 bead+0.076
Neg logit contributions
 scared-0.055
 worried-0.054
 frightened-0.051
 sad-0.051
 shout-0.051
Token be
Token ID307
Feature activation-1.80e-01

Pos logit contributions
 scared+0.021
 worried+0.020
 frightened+0.020
 sad+0.019
 afraid+0.019
Neg logit contributions
der-0.037
oll-0.036
oph-0.034
 av-0.034
 bead-0.033
Token jealous
Token ID19354
Feature activation-3.34e+00

Pos logit contributions
 scared+0.004
 worried+0.004
 frightened+0.004
 shout+0.004
 sad+0.004
Neg logit contributions
der-0.011
oll-0.010
 av-0.010
oph-0.010
 bead-0.010
Token of
Token ID286
Feature activation+1.52e+00

Pos logit contributions
 scared+0.148
 sad+0.139
 worried+0.138
 frightened+0.137
 afraid+0.133
Neg logit contributions
oph-0.145
oll-0.140
quet-0.138
ils-0.136
atmeal-0.135
Token made
Token ID925
Feature activation-9.70e-01

Pos logit contributions
 scared+0.031
 worried+0.031
 frightened+0.028
 sad+0.028
 afraid+0.028
Neg logit contributions
der-0.047
oll-0.046
oph-0.045
quet-0.044
 av-0.043
Token sure
Token ID1654
Feature activation+7.64e-01

Pos logit contributions
 scared+0.041
 worried+0.040
 frightened+0.039
 sad+0.038
 afraid+0.037
Neg logit contributions
der-0.043
oll-0.041
oph-0.038
 av-0.038
 bead-0.038
Token to
Token ID284
Feature activation+8.30e-01

Pos logit contributions
oll+0.038
der+0.036
oph+0.035
quet+0.034
 bead+0.033
Neg logit contributions
 scared-0.030
 worried-0.029
 sad-0.029
 frightened-0.027
 afraid-0.026
Token eat
Token ID4483
Feature activation+8.87e-01

Pos logit contributions
der+0.047
oll+0.046
oph+0.044
 av+0.042
ier+0.042
Neg logit contributions
 scared-0.026
 worried-0.025
 shout-0.025
 sad-0.024
 yell-0.024
Token her
Token ID607
Feature activation+7.79e-01

Pos logit contributions
der+0.042
oll+0.042
oph+0.039
 mastered+0.039
assy+0.038
Neg logit contributions
 scared-0.034
 sad-0.034
 worried-0.033
 frightened-0.031
 angry-0.030
Token av
Token ID1196
Feature activation+7.97e+00

Pos logit contributions
der+0.029
oll+0.028
oph+0.027
ross+0.026
ier+0.025
Neg logit contributions
 scared-0.040
 sad-0.040
 worried-0.039
 frightened-0.038
 worry-0.038
Tokenoc
Token ID420
Feature activation-7.35e-01

Pos logit contributions
der+0.464
oll+0.430
ier+0.399
oph+0.393
ware+0.389
Neg logit contributions
 scared-0.237
 worried-0.225
 frightened-0.219
 sad-0.217
 afraid-0.202
Tokenados
Token ID22484
Feature activation+3.52e+00

Pos logit contributions
 voice+0.038
 worried+0.038
 scared+0.038
 sad+0.037
 frightened+0.037
Neg logit contributions
oll-0.020
oph-0.019
der-0.019
 bead-0.019
 mastered-0.019
Token.
Token ID13
Feature activation+1.00e-01

Pos logit contributions
oph+0.216
der+0.214
oll+0.212
ross+0.212
ellery+0.203
Neg logit contributions
 scared-0.087
 sad-0.076
 worried-0.074
 frightened-0.066
 confused-0.061
Token She
Token ID1375
Feature activation+5.86e-01

Pos logit contributions
ier+0.006
ils+0.006
der+0.006
oll+0.006
quet+0.006
Neg logit contributions
 sad-0.002
 scared-0.002
 worried-0.002
 frightened-0.002
 barking-0.002
Token learned
Token ID4499
Feature activation+2.80e+00

Pos logit contributions
quet+0.042
reating+0.040
igsaw+0.039
umm+0.038
ier+0.037
Neg logit contributions
 scared-0.015
 worried-0.014
 saw-0.014
 sad-0.013
 confused-0.012
Token were
Token ID547
Feature activation+8.94e-01

Pos logit contributions
 scared+0.024
 worried+0.023
 frightened+0.022
 sad+0.022
 shout+0.021
Neg logit contributions
der-0.041
oll-0.040
oph-0.037
 bead-0.037
 av-0.037
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
der+0.045
oll+0.043
oph+0.041
 bead+0.040
 av+0.038
Neg logit contributions
 scared-0.036
 worried-0.035
 frightened-0.033
 sad-0.033
 afraid-0.032
Token There
Token ID1318
Feature activation-2.21e-01

Pos logit contributions
der+0.048
oll+0.046
oph+0.043
 av+0.043
 bead+0.042
Neg logit contributions
 scared-0.041
 worried-0.040
 frightened-0.039
 sad-0.038
 afraid-0.038
Token was
Token ID373
Feature activation+6.81e-01

Pos logit contributions
 scared+0.009
 worried+0.009
 frightened+0.009
 afraid+0.009
 sad+0.009
Neg logit contributions
der-0.009
oll-0.009
 av-0.008
oph-0.008
 bead-0.008
Token one
Token ID530
Feature activation-1.37e+00

Pos logit contributions
der+0.034
oll+0.033
oph+0.031
 av+0.030
 bead+0.030
Neg logit contributions
 scared-0.026
 worried-0.025
 frightened-0.024
 sad-0.024
 afraid-0.023
Token gard
Token ID16985
Feature activation+6.45e+00

Pos logit contributions
 scared+0.066
 worried+0.065
 frightened+0.064
 sad+0.063
 afraid+0.062
Neg logit contributions
der-0.052
oll-0.050
oph-0.045
 av-0.045
 bead-0.044
Tokenener
Token ID877
Feature activation+8.80e-01

Pos logit contributions
der+0.271
oll+0.261
ier+0.248
ellery+0.242
 av+0.238
Neg logit contributions
 scared-0.284
 shout-0.275
 sad-0.273
 frightened-0.272
 worried-0.269
Token who
Token ID508
Feature activation-2.54e+00

Pos logit contributions
der+0.037
oll+0.036
 av+0.034
oph+0.032
 bead+0.032
Neg logit contributions
 scared-0.037
 worried-0.036
 frightened-0.036
 sad-0.035
 afraid-0.035
Token was
Token ID373
Feature activation+1.26e+00

Pos logit contributions
 scared+0.096
 worried+0.093
 frightened+0.090
 sad+0.087
 afraid+0.085
Neg logit contributions
der-0.128
oll-0.122
oph-0.115
 av-0.113
 bead-0.112
Token very
Token ID845
Feature activation-2.93e-01

Pos logit contributions
der+0.061
oll+0.059
oph+0.056
 av+0.054
 bead+0.054
Neg logit contributions
 scared-0.049
 worried-0.048
 frightened-0.046
 sad-0.046
 afraid-0.045
Token patient
Token ID5827
Feature activation+2.13e+00

Pos logit contributions
 scared+0.009
 worried+0.008
 frightened+0.008
 shout+0.008
 barking+0.008
Neg logit contributions
der-0.016
oll-0.015
 av-0.015
oph-0.014
ier-0.014
Token
Token ID198
Feature activation+9.18e-01

Pos logit contributions
der+0.065
oll+0.062
 av+0.059
oph+0.059
ier+0.058
Neg logit contributions
 scared-0.026
 worried-0.025
 frightened-0.024
 sad-0.023
 shout-0.022
TokenShe
Token ID3347
Feature activation+9.67e-01

Pos logit contributions
der+0.046
oll+0.043
ier+0.042
 av+0.042
ware+0.041
Neg logit contributions
 scared-0.035
 worried-0.034
 frightened-0.033
 shout-0.032
 sad-0.032
Token learned
Token ID4499
Feature activation+3.33e+00

Pos logit contributions
der+0.056
reating+0.056
 av+0.054
ier+0.054
umm+0.052
Neg logit contributions
 scared-0.032
 worried-0.030
 sad-0.028
 frightened-0.028
 confused-0.026
Token an
Token ID281
Feature activation+2.40e+00

Pos logit contributions
atmeal+0.185
oll+0.185
umm+0.181
 bead+0.179
ier+0.176
Neg logit contributions
 scared-0.109
 worried-0.104
 sad-0.103
 frightened-0.093
 voice-0.091
Token important
Token ID1593
Feature activation+2.07e+00

Pos logit contributions
der+0.112
ware+0.109
oph+0.108
 bead+0.108
oll+0.106
Neg logit contributions
 scared-0.093
 sad-0.092
 worried-0.090
 frightened-0.088
 afraid-0.085
Token lesson
Token ID11483
Feature activation+6.50e+00

Pos logit contributions
der+0.062
oll+0.059
 av+0.053
oph+0.052
 bead+0.050
Neg logit contributions
 scared-0.118
 worried-0.117
 frightened-0.114
 sad-0.113
 afraid-0.111
Token that
Token ID326
Feature activation-1.11e+00

Pos logit contributions
atmeal+0.351
 av+0.349
 bead+0.344
oph+0.336
oll+0.336
Neg logit contributions
 scared-0.207
 worried-0.199
 afraid-0.187
 barking-0.182
 sad-0.179
Token day
Token ID1110
Feature activation+9.45e-01

Pos logit contributions
 scared+0.040
 worried+0.039
 sad+0.037
 frightened+0.037
 afraid+0.035
Neg logit contributions
der-0.059
oll-0.057
oph-0.054
 av-0.053
ier-0.052
Token.
Token ID13
Feature activation+1.12e+00

Pos logit contributions
 av+0.050
oph+0.049
der+0.049
 bead+0.049
oll+0.048
Neg logit contributions
 scared-0.032
 worried-0.031
 afraid-0.028
 sad-0.028
 frightened-0.028
Token Never
Token ID7236
Feature activation-3.27e-01

Pos logit contributions
der+0.060
ils+0.059
ier+0.059
oll+0.059
 av+0.057
Neg logit contributions
 scared-0.034
 worried-0.033
 frightened-0.031
 sad-0.031
 afraid-0.031
Token leave
Token ID2666
Feature activation-1.61e-01

Pos logit contributions
 scared+0.010
 worried+0.010
 frightened+0.009
 sad+0.009
 afraid+0.009
Neg logit contributions
der-0.018
oll-0.018
oph-0.017
 av-0.017
 bead-0.017
Token She
Token ID1375
Feature activation+6.69e-01

Pos logit contributions
der+0.028
oll+0.027
 av+0.025
oph+0.025
 bead+0.025
Neg logit contributions
 scared-0.019
 worried-0.018
 frightened-0.018
 sad-0.017
 afraid-0.017
Token was
Token ID373
Feature activation+3.00e-01

Pos logit contributions
der+0.040
reating+0.039
oll+0.039
ier+0.038
quet+0.038
Neg logit contributions
 scared-0.021
 worried-0.019
 sad-0.018
 frightened-0.018
 confused-0.017
Token proud
Token ID6613
Feature activation+6.16e-01

Pos logit contributions
oll+0.014
der+0.014
 av+0.013
 mastered+0.012
 bead+0.012
Neg logit contributions
 shout-0.010
 scared-0.010
 yell-0.010
 worried-0.010
 frightened-0.010
Token of
Token ID286
Feature activation+2.49e+00

Pos logit contributions
oll+0.025
der+0.024
oph+0.024
ils+0.024
ently+0.023
Neg logit contributions
 scared-0.029
 worried-0.028
 frightened-0.027
 sad-0.026
 afraid-0.026
Token how
Token ID703
Feature activation+1.21e+00

Pos logit contributions
oph+0.123
oll+0.123
der+0.123
cake+0.117
ier+0.115
Neg logit contributions
 scared-0.094
 worried-0.093
 sad-0.090
 afraid-0.086
 barking-0.085
Token neatly
Token ID29776
Feature activation+6.22e+00

Pos logit contributions
der+0.060
oll+0.058
oph+0.054
 bead+0.053
ier+0.052
Neg logit contributions
 scared-0.047
 worried-0.046
 frightened-0.045
 sad-0.044
 afraid-0.043
Token she
Token ID673
Feature activation+1.96e+00

Pos logit contributions
oll+0.344
der+0.337
reating+0.329
ests+0.322
assy+0.318
Neg logit contributions
 scared-0.217
 worried-0.199
 sad-0.195
 frightened-0.187
 confused-0.175
Token was
Token ID373
Feature activation+7.73e-01

Pos logit contributions
der+0.131
reating+0.126
ently+0.117
igsaw+0.115
quet+0.115
Neg logit contributions
 scared-0.060
 worried-0.052
 frightened-0.051
 saw-0.049
 afraid-0.044
Token hanging
Token ID10938
Feature activation-1.58e-01

Pos logit contributions
der+0.041
oph+0.038
oll+0.037
 bead+0.035
umm+0.034
Neg logit contributions
 scared-0.031
 worried-0.030
 frightened-0.028
 sad-0.028
 afraid-0.028
Token the
Token ID262
Feature activation-6.26e-01

Pos logit contributions
 scared+0.007
 worried+0.007
 sad+0.006
 frightened+0.006
 afraid+0.006
Neg logit contributions
der-0.007
oll-0.007
oph-0.007
ils-0.006
assy-0.006
Token flower
Token ID15061
Feature activation-6.37e-02

Pos logit contributions
 scared+0.026
 worried+0.024
 frightened+0.024
 sad+0.024
 afraid+0.023
Neg logit contributions
der-0.030
oll-0.029
oph-0.027
reating-0.026
 mastered-0.026
Token toys
Token ID14958
Feature activation+8.98e-01

Pos logit contributions
 scared+0.031
 worried+0.030
 frightened+0.030
 afraid+0.029
 sad+0.029
Neg logit contributions
der-0.026
oll-0.025
 av-0.024
 bead-0.023
oph-0.022
Token,
Token ID11
Feature activation-7.10e-01

Pos logit contributions
der+0.046
oll+0.045
oph+0.045
ellery+0.041
 mastered+0.041
Neg logit contributions
 scared-0.033
 worried-0.032
 sad-0.031
 frightened-0.030
 afraid-0.028
Token but
Token ID475
Feature activation-8.98e-01

Pos logit contributions
 scared+0.023
 worried+0.023
 frightened+0.022
 sad+0.021
 afraid+0.021
Neg logit contributions
der-0.038
oll-0.036
 av-0.034
oph-0.034
 bead-0.034
Token not
Token ID407
Feature activation-2.14e+00

Pos logit contributions
 scared+0.036
 worried+0.036
 sad+0.034
 frightened+0.034
 afraid+0.033
Neg logit contributions
der-0.042
oll-0.041
oph-0.039
 av-0.037
 bead-0.037
Token all
Token ID477
Feature activation+1.39e+00

Pos logit contributions
 scared+0.088
 worried+0.086
 frightened+0.083
 sad+0.082
 afraid+0.080
Neg logit contributions
der-0.098
oll-0.094
oph-0.089
 av-0.087
 bead-0.086
Token of
Token ID286
Feature activation+2.19e+00

Pos logit contributions
der+0.061
oll+0.059
oph+0.055
 av+0.055
 bead+0.054
Neg logit contributions
 scared-0.059
 worried-0.057
 frightened-0.056
 sad-0.055
 afraid-0.054
Token them
Token ID606
Feature activation-2.10e+00

Pos logit contributions
der+0.113
oll+0.111
oph+0.105
 bead+0.102
 av+0.101
Neg logit contributions
 scared-0.075
 worried-0.074
 sad-0.070
 frightened-0.070
 afraid-0.068
Token.
Token ID13
Feature activation-4.58e-01

Pos logit contributions
 scared+0.086
 worried+0.084
 frightened+0.082
 sad+0.080
 afraid+0.079
Neg logit contributions
der-0.094
oll-0.090
 av-0.083
oph-0.083
 bead-0.082
Token She
Token ID1375
Feature activation-5.25e-03

Pos logit contributions
 scared+0.019
 worried+0.018
 frightened+0.018
 sad+0.017
 afraid+0.017
Neg logit contributions
der-0.021
oll-0.020
 av-0.018
oph-0.018
 bead-0.018
Token tells
Token ID4952
Feature activation+2.08e-02

Pos logit contributions
 scared+0.000
 worried+0.000
 frightened+0.000
 sad+0.000
 afraid+0.000
Neg logit contributions
der-0.000
oll-0.000
oph-0.000
 av-0.000
 bead-0.000
Token them
Token ID606
Feature activation-1.32e+00

Pos logit contributions
der+0.001
oll+0.001
oph+0.001
 av+0.001
reating+0.001
Neg logit contributions
 scared-0.001
 worried-0.001
 frightened-0.001
 afraid-0.001
 sad-0.001
Token was
Token ID373
Feature activation+1.53e+00

Pos logit contributions
der+0.073
oll+0.071
oph+0.067
 av+0.067
 bead+0.066
Neg logit contributions
 scared-0.049
 worried-0.048
 frightened-0.046
 sad-0.045
 afraid-0.044
Token so
Token ID523
Feature activation+4.45e-01

Pos logit contributions
oll+0.073
der+0.071
 av+0.065
 mastered+0.063
ier+0.063
Neg logit contributions
 shout-0.051
 scared-0.050
 yell-0.050
 worried-0.049
 frightened-0.049
Token excited
Token ID6568
Feature activation+2.32e+00

Pos logit contributions
oll+0.024
der+0.023
 av+0.022
 mastered+0.021
ier+0.021
Neg logit contributions
 barking-0.012
 shout-0.011
 yell-0.011
 voice-0.011
 worried-0.011
Token to
Token ID284
Feature activation+2.36e+00

Pos logit contributions
der+0.093
oll+0.088
oph+0.083
 av+0.082
 bead+0.079
Neg logit contributions
 scared-0.107
 worried-0.105
 frightened-0.103
 sad-0.101
 afraid-0.099
Token try
Token ID1949
Feature activation+1.92e+00

Pos logit contributions
der+0.152
oll+0.147
oph+0.139
 bead+0.137
 av+0.137
Neg logit contributions
 scared-0.059
 worried-0.055
 shout-0.054
 frightened-0.053
 sad-0.052
Token and
Token ID290
Feature activation+2.21e+00

Pos logit contributions
der+0.099
oll+0.099
oph+0.092
 bead+0.091
umm+0.088
Neg logit contributions
 scared-0.069
 worried-0.068
 sad-0.065
 frightened-0.063
 barking-0.062
Token play
Token ID711
Feature activation-3.36e-01

Pos logit contributions
der+0.132
oll+0.129
oph+0.121
ier+0.119
 bead+0.116
Neg logit contributions
 scared-0.068
 shout-0.064
 worried-0.063
 frightened-0.061
 yell-0.061
Token it
Token ID340
Feature activation+1.43e+00

Pos logit contributions
 scared+0.013
 worried+0.012
 frightened+0.012
 sad+0.012
 shout+0.012
Neg logit contributions
der-0.016
oll-0.016
oph-0.016
assy-0.015
 av-0.014
Token.
Token ID13
Feature activation+1.42e+00

Pos logit contributions
der+0.070
oll+0.069
oph+0.068
 av+0.065
assy+0.064
Neg logit contributions
 scared-0.056
 sad-0.052
 worried-0.052
 frightened-0.051
 shout-0.048
Token
Token ID198
Feature activation+1.06e+00

Pos logit contributions
der+0.069
oll+0.066
oph+0.064
 av+0.062
 bead+0.060
Neg logit contributions
 scared-0.053
 worried-0.053
 frightened-0.050
 sad-0.049
 afraid-0.049
Token
Token ID198
Feature activation+8.51e-01

Pos logit contributions
oll+0.049
der+0.049
oph+0.047
assy+0.044
 mastered+0.044
Neg logit contributions
 scared-0.039
 worried-0.039
 frightened-0.038
 sad-0.037
 afraid-0.036
Token and
Token ID290
Feature activation-1.22e+00

Pos logit contributions
 worried+0.046
 scared+0.046
 frightened+0.045
 afraid+0.044
 sad+0.042
Neg logit contributions
der-0.041
oll-0.039
 bead-0.035
 av-0.035
 mastered-0.035
Token the
Token ID262
Feature activation-2.27e-01

Pos logit contributions
 scared+0.043
 worried+0.043
 frightened+0.041
 sad+0.040
 afraid+0.039
Neg logit contributions
der-0.062
oll-0.060
oph-0.056
 av-0.056
 bead-0.055
Token fish
Token ID5916
Feature activation-2.62e+00

Pos logit contributions
 scared+0.009
 worried+0.009
 frightened+0.008
 afraid+0.008
 sad+0.008
Neg logit contributions
der-0.010
 av-0.010
 bead-0.009
oll-0.009
 beads-0.009
Token.
Token ID13
Feature activation-3.21e-01

Pos logit contributions
 scared+0.103
 worried+0.102
 frightened+0.099
 sad+0.096
 afraid+0.095
Neg logit contributions
der-0.122
oll-0.116
 av-0.108
 bead-0.107
oph-0.107
Token He
Token ID679
Feature activation-1.31e+00

Pos logit contributions
 scared+0.010
 worried+0.010
 frightened+0.009
 sad+0.009
 afraid+0.009
Neg logit contributions
der-0.018
oll-0.017
oph-0.016
 av-0.016
 bead-0.016
Token knew
Token ID2993
Feature activation+1.94e+00

Pos logit contributions
 scared+0.046
 worried+0.044
 frightened+0.042
 sad+0.041
 afraid+0.040
Neg logit contributions
der-0.069
oll-0.068
 av-0.065
reating-0.064
oph-0.064
Token that
Token ID326
Feature activation-2.00e+00

Pos logit contributions
oph+0.096
oll+0.093
der+0.093
assy+0.091
quet+0.090
Neg logit contributions
 scared-0.072
 worried-0.069
 sad-0.066
 afraid-0.065
 frightened-0.064
Token the
Token ID262
Feature activation+8.92e-01

Pos logit contributions
 scared+0.079
 worried+0.075
 frightened+0.073
 sad+0.072
 afraid+0.071
Neg logit contributions
der-0.096
oll-0.095
oph-0.093
assy-0.087
ier-0.087
Token spells
Token ID10377
Feature activation+3.15e+00

Pos logit contributions
der+0.032
 bead+0.031
 av+0.029
oll+0.029
 beads+0.027
Neg logit contributions
 scared-0.040
 worried-0.040
 frightened-0.039
 barking-0.039
 sad-0.038
Token were
Token ID547
Feature activation+7.92e-01

Pos logit contributions
der+0.152
oll+0.148
oph+0.145
 av+0.139
ellery+0.135
Neg logit contributions
 scared-0.123
 worried-0.118
 frightened-0.114
 sad-0.114
 afraid-0.109
Token not
Token ID407
Feature activation-2.36e+00

Pos logit contributions
der+0.039
oll+0.038
oph+0.037
 av+0.035
 bead+0.035
Neg logit contributions
 scared-0.030
 worried-0.029
 frightened-0.028
 sad-0.028
 afraid-0.027
Token stopped
Token ID5025
Feature activation-7.55e-01

Pos logit contributions
 scared+0.126
 worried+0.119
 sad+0.116
 frightened+0.115
 afraid+0.107
Neg logit contributions
oll-0.159
der-0.158
oph-0.156
 av-0.152
ellery-0.148
Token.
Token ID13
Feature activation+2.55e-01

Pos logit contributions
 scared+0.030
 worried+0.029
 frightened+0.029
 shout+0.029
 afraid+0.028
Neg logit contributions
der-0.033
oll-0.030
 av-0.030
ier-0.028
 bead-0.028
Token From
Token ID3574
Feature activation+2.89e+00

Pos logit contributions
der+0.013
oll+0.013
oph+0.012
 av+0.012
 bead+0.012
Neg logit contributions
 scared-0.009
 worried-0.008
 frightened-0.008
 sad-0.008
 afraid-0.008
Token that
Token ID326
Feature activation-2.51e+00

Pos logit contributions
der+0.113
oll+0.107
oph+0.099
 av+0.098
 bead+0.095
Neg logit contributions
 scared-0.134
 worried-0.132
 frightened-0.131
 sad-0.128
 shout-0.126
Token day
Token ID1110
Feature activation-1.56e-01

Pos logit contributions
 scared+0.103
 worried+0.101
 frightened+0.099
 sad+0.096
 afraid+0.095
Neg logit contributions
der-0.113
oll-0.109
oph-0.102
 av-0.101
 bead-0.100
Token on
Token ID319
Feature activation+2.59e+00

Pos logit contributions
 scared+0.006
 worried+0.006
 sad+0.006
 frightened+0.006
 afraid+0.005
Neg logit contributions
der-0.008
oph-0.008
ware-0.007
 av-0.007
reating-0.007
Token,
Token ID11
Feature activation+1.11e+00

Pos logit contributions
der+0.169
oll+0.160
reating+0.155
oph+0.153
ellery+0.151
Neg logit contributions
 scared-0.065
 worried-0.062
 sad-0.061
 frightened-0.056
 voice-0.053
Token the
Token ID262
Feature activation+1.15e+00

Pos logit contributions
der+0.066
oll+0.064
ier+0.061
ils+0.059
ently+0.059
Neg logit contributions
 scared-0.036
 sad-0.034
 worried-0.033
 frightened-0.032
 fearful-0.029
Token little
Token ID1310
Feature activation-1.85e+00

Pos logit contributions
der+0.061
oll+0.059
 av+0.057
 bead+0.055
oph+0.055
Neg logit contributions
 scared-0.036
 worried-0.034
 shout-0.034
 frightened-0.033
 barking-0.033
Token girl
Token ID2576
Feature activation+2.06e-02

Pos logit contributions
 scared+0.053
 worried+0.052
 frightened+0.050
 sad+0.049
 afraid+0.048
Neg logit contributions
der-0.106
oll-0.102
 av-0.098
oph-0.098
 bead-0.096
Token loved
Token ID6151
Feature activation+1.60e-01

Pos logit contributions
quet+0.001
reating+0.001
der+0.001
oph+0.001
 av+0.001
Neg logit contributions
 scared-0.001
 worried-0.001
 sad-0.001
 frightened-0.001
 afraid-0.001
Token spoon
Token ID24556
Feature activation+1.82e+00

Pos logit contributions
der+0.009
oll+0.009
 av+0.009
 bead+0.008
oph+0.008
Neg logit contributions
 scared-0.009
 worried-0.009
 frightened-0.009
 afraid-0.009
 shout-0.009
Token.
Token ID13
Feature activation+8.57e-01

Pos logit contributions
der+0.086
oll+0.083
oph+0.082
 av+0.079
 mastered+0.078
Neg logit contributions
 scared-0.070
 worried-0.068
 sad-0.066
 frightened-0.065
 shout-0.062
Token She
Token ID1375
Feature activation+1.30e+00

Pos logit contributions
der+0.043
oll+0.043
oph+0.040
 av+0.040
ier+0.039
Neg logit contributions
 scared-0.030
 worried-0.029
 frightened-0.028
 sad-0.028
 shout-0.027
Token tried
Token ID3088
Feature activation+7.19e-01

Pos logit contributions
reating+0.072
der+0.071
oll+0.070
ier+0.068
quet+0.065
Neg logit contributions
 scared-0.045
 worried-0.042
 frightened-0.040
 sad-0.039
 confused-0.037
Token the
Token ID262
Feature activation+1.67e+00

Pos logit contributions
oll+0.032
der+0.032
oph+0.030
ils+0.029
ier+0.028
Neg logit contributions
 scared-0.031
 worried-0.031
 sad-0.030
 frightened-0.029
 barking-0.028
Token cereal
Token ID33158
Feature activation+4.80e+00

Pos logit contributions
der+0.070
oll+0.067
oph+0.062
 av+0.062
 bead+0.061
Neg logit contributions
 scared-0.074
 worried-0.073
 frightened-0.071
 sad-0.070
 afraid-0.069
Token,
Token ID11
Feature activation+8.77e-01

Pos logit contributions
oll+0.266
der+0.264
oph+0.258
ellery+0.257
assy+0.255
Neg logit contributions
 scared-0.154
 sad-0.146
 worried-0.141
 frightened-0.133
 shout-0.125
Token and
Token ID290
Feature activation+1.46e+00

Pos logit contributions
oll+0.055
der+0.054
oph+0.053
ils+0.051
assy+0.051
Neg logit contributions
 scared-0.022
 worried-0.022
 sad-0.021
 frightened-0.020
 afraid-0.018
Token to
Token ID284
Feature activation+6.20e-01

Pos logit contributions
oll+0.076
der+0.076
oph+0.070
reating+0.069
lor+0.069
Neg logit contributions
 scared-0.057
 worried-0.054
 sad-0.052
 frightened-0.051
 confused-0.049
Token her
Token ID607
Feature activation+1.07e+00

Pos logit contributions
der+0.031
oll+0.031
oph+0.029
ier+0.028
ils+0.028
Neg logit contributions
 scared-0.023
 worried-0.022
 shout-0.022
 sad-0.022
 yell-0.022
Token surprise
Token ID5975
Feature activation+2.01e+00

Pos logit contributions
der+0.058
oll+0.055
oph+0.053
ier+0.053
reating+0.052
Neg logit contributions
 scared-0.037
 worried-0.035
 sad-0.034
 frightened-0.034
 voice-0.032
Token,
Token ID11
Feature activation+2.88e-01

Pos logit contributions
 scared+0.000
 worried+0.000
 frightened+0.000
 sad+0.000
 afraid+0.000
Neg logit contributions
oll-0.000
der-0.000
oph-0.000
 av-0.000
 bead-0.000
Token until
Token ID1566
Feature activation+8.10e-01

Pos logit contributions
der+0.013
oll+0.012
 av+0.011
oph+0.011
 mastered+0.011
Neg logit contributions
 scared-0.011
 worried-0.011
 frightened-0.011
 afraid-0.011
 sad-0.010
Token he
Token ID339
Feature activation-6.70e-02

Pos logit contributions
der+0.042
oll+0.041
 bead+0.038
oph+0.038
ils+0.038
Neg logit contributions
 scared-0.030
 worried-0.029
 sad-0.027
 frightened-0.027
 shout-0.026
Token was
Token ID373
Feature activation+4.02e-01

Pos logit contributions
 scared+0.002
 worried+0.002
 frightened+0.002
 sad+0.002
 afraid+0.002
Neg logit contributions
der-0.004
oll-0.003
 av-0.003
 bead-0.003
reating-0.003
Token out
Token ID503
Feature activation+7.47e-01

Pos logit contributions
der+0.022
oll+0.022
oph+0.020
 av+0.020
 bead+0.020
Neg logit contributions
 scared-0.013
 worried-0.012
 frightened-0.012
 sad-0.012
 afraid-0.011
Token of
Token ID286
Feature activation+5.19e-01

Pos logit contributions
der+0.025
oll+0.024
oph+0.022
 av+0.022
 bead+0.021
Neg logit contributions
 scared-0.040
 worried-0.039
 frightened-0.038
 sad-0.038
 afraid-0.037
Token breath
Token ID8033
Feature activation+2.28e-01

Pos logit contributions
der+0.027
oll+0.027
oph+0.025
 av+0.025
 bead+0.024
Neg logit contributions
 scared-0.018
 worried-0.018
 frightened-0.017
 sad-0.017
 afraid-0.016
Token.
Token ID13
Feature activation+7.75e-01

Pos logit contributions
der+0.011
oll+0.011
 av+0.010
oph+0.010
 bead+0.010
Neg logit contributions
 scared-0.009
 worried-0.009
 frightened-0.008
 sad-0.008
 afraid-0.008
Token
Token ID220
Feature activation-1.43e+00

Pos logit contributions
der+0.041
oll+0.039
oph+0.037
 av+0.037
 mastered+0.036
Neg logit contributions
 scared-0.026
 worried-0.025
 frightened-0.024
 sad-0.024
 afraid-0.023
Token
Token ID198
Feature activation+7.99e-01

Pos logit contributions
 scared+0.041
 worried+0.040
 frightened+0.038
 sad+0.038
 afraid+0.036
Neg logit contributions
der-0.082
oll-0.079
oph-0.076
 av-0.075
 bead-0.075
Token
Token ID198
Feature activation+6.97e-01

Pos logit contributions
der+0.043
oll+0.042
oph+0.040
 av+0.039
 bead+0.039
Neg logit contributions
 scared-0.025
 worried-0.025
 frightened-0.024
 sad-0.023
 afraid-0.023
Token so
Token ID523
Feature activation-7.17e-01

Pos logit contributions
oll+0.024
 insepar+0.024
 solving+0.023
 av+0.022
der+0.022
Neg logit contributions
 trigger-0.032
awk-0.031
Run-0.029
 yell-0.029
 boom-0.029
Token excited
Token ID6568
Feature activation+9.51e-01

Pos logit contributions
 voice+0.020
 barking+0.020
 yell+0.020
 shout+0.019
 worried+0.019
Neg logit contributions
oll-0.037
der-0.035
 av-0.033
 bead-0.032
 mastered-0.032
Token to
Token ID284
Feature activation+1.14e+00

Pos logit contributions
der+0.039
oll+0.037
oph+0.034
 av+0.034
 bead+0.034
Neg logit contributions
 scared-0.043
 worried-0.043
 frightened-0.042
 sad-0.041
 afraid-0.040
Token play
Token ID711
Feature activation-1.11e+00

Pos logit contributions
der+0.061
oll+0.059
 av+0.056
oph+0.056
 bead+0.056
Neg logit contributions
 scared-0.035
 worried-0.035
 frightened-0.033
 sad-0.032
 afraid-0.032
Token with
Token ID351
Feature activation-4.44e-01

Pos logit contributions
 scared+0.049
 worried+0.048
 frightened+0.047
 sad+0.046
 afraid+0.045
Neg logit contributions
der-0.047
oll-0.045
oph-0.042
 av-0.042
 bead-0.041
Token the
Token ID262
Feature activation+4.90e-02

Pos logit contributions
 scared+0.016
 worried+0.016
 frightened+0.015
 sad+0.015
 afraid+0.015
Neg logit contributions
der-0.023
oll-0.022
oph-0.021
 av-0.020
 bead-0.020
Token mel
Token ID7758
Feature activation+3.00e+00

Pos logit contributions
der+0.001
 av+0.001
oll+0.001
oph+0.001
 bead+0.001
Neg logit contributions
 scared-0.003
 worried-0.003
 afraid-0.003
 frightened-0.003
 shout-0.003
Tokenon
Token ID261
Feature activation+3.63e-01

Pos logit contributions
der+0.102
oll+0.082
oph+0.078
 mastered+0.074
ils+0.074
Neg logit contributions
 scared-0.162
 worried-0.156
 frightened-0.154
 sad-0.150
 shout-0.148
Token in
Token ID287
Feature activation-9.97e-01

Pos logit contributions
der+0.017
oll+0.017
oph+0.016
 av+0.016
 bead+0.015
Neg logit contributions
 scared-0.014
 worried-0.014
 frightened-0.013
 sad-0.013
 afraid-0.013
Token the
Token ID262
Feature activation-1.80e+00

Pos logit contributions
 worried+0.049
 scared+0.048
 frightened+0.047
 sad+0.047
 afraid+0.046
Neg logit contributions
der-0.035
oll-0.032
 bead-0.032
oph-0.032
 mastered-0.030
Token water
Token ID1660
Feature activation-2.74e+00

Pos logit contributions
 worry+0.070
 worried+0.068
 afraid+0.068
 scared+0.066
 frightened+0.065
Neg logit contributions
oll-0.073
der-0.073
 bead-0.069
ils-0.068
 mastered-0.068
Token in
Token ID287
Feature activation+1.24e+00

Pos logit contributions
der+0.052
oll+0.050
oph+0.046
 av+0.046
 bead+0.045
Neg logit contributions
 scared-0.048
 worried-0.047
 frightened-0.046
 sad-0.045
 afraid-0.044
Token the
Token ID262
Feature activation-1.11e-01

Pos logit contributions
der+0.056
oll+0.054
oph+0.050
 av+0.049
 bead+0.048
Neg logit contributions
 scared-0.053
 worried-0.051
 frightened-0.050
 sad-0.049
 afraid-0.048
Token sunshine
Token ID34488
Feature activation+1.34e-01

Pos logit contributions
 scared+0.005
 worried+0.004
 frightened+0.004
 sad+0.004
 afraid+0.004
Neg logit contributions
der-0.005
oll-0.005
oph-0.004
 av-0.004
 bead-0.004
Token.
Token ID13
Feature activation+1.79e+00

Pos logit contributions
der+0.006
oll+0.006
oph+0.006
 av+0.006
 bead+0.006
Neg logit contributions
 scared-0.005
 worried-0.005
 frightened-0.005
 sad-0.005
 afraid-0.005
Token One
Token ID1881
Feature activation-1.51e+00

Pos logit contributions
der+0.085
oll+0.081
oph+0.077
 av+0.076
 bead+0.073
Neg logit contributions
 worried-0.069
 scared-0.068
 frightened-0.065
 sad-0.064
 afraid-0.063
Token day
Token ID1110
Feature activation+4.73e-01

Pos logit contributions
 worried+0.063
 scared+0.063
 frightened+0.061
 shout+0.060
 afraid+0.059
Neg logit contributions
der-0.062
oll-0.061
oph-0.056
 av-0.056
 bead-0.056
Token,
Token ID11
Feature activation+1.38e+00

Pos logit contributions
der+0.024
oll+0.023
oph+0.022
 av+0.022
 bead+0.021
Neg logit contributions
 scared-0.017
 worried-0.016
 frightened-0.016
 sad-0.016
 afraid-0.015
Token Wh
Token ID854
Feature activation+5.94e-01

Pos logit contributions
der+0.055
oll+0.052
 av+0.051
oph+0.050
 bead+0.048
Neg logit contributions
 scared-0.060
 worried-0.060
 frightened-0.058
 sad-0.058
 afraid-0.057
Tokenisk
Token ID1984
Feature activation+1.82e+00

Pos logit contributions
der+0.022
oll+0.022
oph+0.020
 av+0.019
 bead+0.019
Neg logit contributions
 scared-0.029
 worried-0.029
 frightened-0.028
 sad-0.027
 afraid-0.027
Tokeners
Token ID364
Feature activation-1.88e-01

Pos logit contributions
der+0.082
oll+0.076
umm+0.075
oph+0.072
 av+0.069
Neg logit contributions
 scared-0.072
 worried-0.066
 frightened-0.064
 afraid-0.064
 barking-0.063
Token saw
Token ID2497
Feature activation-1.03e+00

Pos logit contributions
 scared+0.006
 worried+0.006
 frightened+0.006
 sad+0.006
 afraid+0.006
Neg logit contributions
der-0.009
oll-0.009
oph-0.009
 av-0.009
 mastered-0.008
Token sky
Token ID6766
Feature activation-1.89e+00

Pos logit contributions
 worried+0.001
 shout+0.001
 afraid+0.001
 scared+0.001
 barking+0.001
Neg logit contributions
der-0.001
 av-0.001
cake-0.001
oll-0.001
ier-0.001
Token and
Token ID290
Feature activation+3.10e-01

Pos logit contributions
 scared+0.081
 worried+0.080
 frightened+0.078
 sad+0.076
 afraid+0.075
Neg logit contributions
der-0.082
oll-0.077
oph-0.072
 av-0.071
 bead-0.070
Token started
Token ID2067
Feature activation+1.27e-01

Pos logit contributions
der+0.015
oph+0.014
 mastered+0.013
 neatly+0.013
 av+0.013
Neg logit contributions
 barking-0.009
 yell-0.009
 War-0.009
 shout-0.009
 voice-0.008
Token to
Token ID284
Feature activation+2.53e-01

Pos logit contributions
der+0.004
 av+0.003
 solving+0.003
 mastered+0.003
 neatly+0.003
Neg logit contributions
 scared-0.006
 frightened-0.006
 worried-0.006
 sad-0.006
 shout-0.006
Token cry
Token ID3960
Feature activation-2.01e+00

Pos logit contributions
der+0.013
 av+0.012
 solving+0.012
 memor+0.012
 mastered+0.011
Neg logit contributions
 worried-0.006
 War-0.006
 sad-0.006
 frightened-0.006
 scared-0.005
Token.
Token ID13
Feature activation+7.24e-01

Pos logit contributions
 scared+0.085
 worried+0.084
 frightened+0.082
 shout+0.080
 afraid+0.080
Neg logit contributions
der-0.089
oll-0.081
 av-0.074
ier-0.073
oph-0.073
Token
Token ID198
Feature activation+6.25e-01

Pos logit contributions
der+0.037
oll+0.034
oph+0.033
 av+0.032
cake+0.032
Neg logit contributions
 worried-0.025
 scared-0.024
 frightened-0.023
 sad-0.023
 afraid-0.022
Token
Token ID198
Feature activation+4.77e-01

Pos logit contributions
der+0.030
oll+0.029
oph+0.028
 av+0.027
ently+0.027
Neg logit contributions
 scared-0.022
 worried-0.022
 frightened-0.021
 sad-0.021
 afraid-0.020
TokenJust
Token ID5703
Feature activation-5.40e-01

Pos logit contributions
der+0.018
oll+0.017
oph+0.016
 av+0.016
 bead+0.016
Neg logit contributions
 scared-0.022
 worried-0.022
 frightened-0.021
 sad-0.021
 afraid-0.021
Token then
Token ID788
Feature activation-1.25e-01

Pos logit contributions
 scared+0.025
 worried+0.024
 frightened+0.024
 sad+0.023
 afraid+0.023
Neg logit contributions
der-0.021
oll-0.020
oph-0.018
 av-0.018
 bead-0.018
Token,
Token ID11
Feature activation+5.29e-01

Pos logit contributions
 worried+0.005
 frightened+0.005
 shout+0.005
 yell+0.005
 scared+0.005
Neg logit contributions
der-0.005
oll-0.004
 av-0.004
oph-0.004
 mastered-0.004
Token it
Token ID340
Feature activation-8.59e-01

Pos logit contributions
der+0.066
oll+0.063
 av+0.060
oph+0.060
 bead+0.059
Neg logit contributions
 scared-0.054
 worried-0.053
 frightened-0.052
 sad-0.051
 afraid-0.050
Token,
Token ID11
Feature activation+3.27e-01

Pos logit contributions
 scared+0.032
 worried+0.032
 frightened+0.031
 afraid+0.030
 sad+0.029
Neg logit contributions
der-0.041
oll-0.040
 av-0.037
oph-0.037
 bead-0.037
Token he
Token ID339
Feature activation-5.37e-01

Pos logit contributions
der+0.015
oll+0.015
oph+0.014
 av+0.014
 bead+0.013
Neg logit contributions
 scared-0.013
 worried-0.013
 frightened-0.012
 sad-0.012
 afraid-0.012
Token couldn
Token ID3521
Feature activation+1.08e+00

Pos logit contributions
 scared+0.018
 worried+0.017
 frightened+0.017
 sad+0.016
 afraid+0.016
Neg logit contributions
der-0.029
oll-0.028
oph-0.026
 av-0.026
reating-0.026
Token't
Token ID470
Feature activation-2.02e+00

Pos logit contributions
der+0.071
oph+0.069
oll+0.068
ils+0.068
ellery+0.067
Neg logit contributions
 scared-0.024
 sad-0.020
 frightened-0.020
 shout-0.020
 worried-0.020
Token.
Token ID13
Feature activation+9.15e-01

Pos logit contributions
 scared+0.055
 worried+0.049
 frightened+0.049
 shout+0.049
 sad+0.046
Neg logit contributions
der-0.126
oll-0.122
oph-0.120
 av-0.119
reating-0.116
Token He
Token ID679
Feature activation-2.10e-01

Pos logit contributions
der+0.050
oll+0.048
oph+0.045
 av+0.045
 bead+0.045
Neg logit contributions
 scared-0.029
 worried-0.029
 frightened-0.028
 sad-0.027
 afraid-0.026
Token asked
Token ID1965
Feature activation-2.09e+00

Pos logit contributions
 scared+0.008
 worried+0.007
 frightened+0.007
 sad+0.007
 afraid+0.007
Neg logit contributions
der-0.011
oll-0.011
oph-0.010
 av-0.010
reating-0.010
Token all
Token ID477
Feature activation+1.88e+00

Pos logit contributions
 scared+0.084
 worried+0.081
 sad+0.080
 frightened+0.078
 afraid+0.076
Neg logit contributions
der-0.100
oll-0.098
oph-0.094
 bead-0.089
assy-0.088
Token his
Token ID465
Feature activation+9.40e-01

Pos logit contributions
der+0.064
oll+0.061
 av+0.058
 bead+0.056
reating+0.055
Neg logit contributions
 scared-0.093
 worried-0.093
 frightened-0.091
 afraid-0.088
 sad-0.088
Token friends
Token ID2460
Feature activation-8.06e-02

Pos logit contributions
oll+0.033
der+0.033
 av+0.031
 bead+0.029
ils+0.028
Neg logit contributions
 worried-0.043
 scared-0.043
 shout-0.042
 frightened-0.042
 boom-0.042
Token a
Token ID257
Feature activation-5.98e-02

Pos logit contributions
 scared+0.030
 worried+0.030
 frightened+0.029
 sad+0.028
 afraid+0.028
Neg logit contributions
der-0.037
oll-0.036
oph-0.033
 av-0.033
 bead-0.033
Token boy
Token ID2933
Feature activation-1.07e+00

Pos logit contributions
 scared+0.002
 worried+0.002
 frightened+0.002
 shout+0.002
 afraid+0.002
Neg logit contributions
der-0.003
 av-0.003
 bead-0.003
oph-0.003
oll-0.003
Token in
Token ID287
Feature activation+1.90e-01

Pos logit contributions
 frightened+0.044
 worried+0.044
 scared+0.043
 afraid+0.042
 sad+0.042
Neg logit contributions
oll-0.039
der-0.038
 av-0.037
ellery-0.036
 solving-0.036
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
der+0.010
oll+0.009
oph+0.009
ier+0.009
 av+0.008
Neg logit contributions
 scared-0.007
 worried-0.007
 frightened-0.007
 sad-0.007
 afraid-0.006
Token park
Token ID3952
Feature activation-3.45e-01

Pos logit contributions
 scared+0.052
 worried+0.051
 frightened+0.050
 sad+0.049
 afraid+0.048
Neg logit contributions
der-0.054
oll-0.052
oph-0.048
 av-0.048
 bead-0.047
Token who
Token ID508
Feature activation-3.46e+00

Pos logit contributions
 scared+0.015
 worried+0.015
 frightened+0.015
 afraid+0.015
 sad+0.015
Neg logit contributions
der-0.013
oll-0.011
 av-0.011
 bead-0.011
oph-0.011
Token had
Token ID550
Feature activation+1.01e+00

Pos logit contributions
 scared+0.130
 worried+0.128
 frightened+0.124
 sad+0.122
 afraid+0.119
Neg logit contributions
der-0.165
oll-0.159
oph-0.150
 av-0.149
 bead-0.146
Token no
Token ID645
Feature activation-1.39e+00

Pos logit contributions
der+0.053
oll+0.052
oph+0.049
 av+0.048
 bead+0.047
Neg logit contributions
 scared-0.036
 worried-0.035
 frightened-0.033
 sad-0.033
 afraid-0.032
Token k
Token ID479
Feature activation+7.51e-01

Pos logit contributions
 scared+0.054
 worried+0.052
 sad+0.052
 frightened+0.051
 shout+0.050
Neg logit contributions
der-0.067
oll-0.066
oph-0.064
ier-0.060
ils-0.059
Tokenite
Token ID578
Feature activation+9.19e-01

Pos logit contributions
der+0.033
oll+0.031
oph+0.029
 av+0.028
ils+0.027
Neg logit contributions
 scared-0.033
 worried-0.032
 frightened-0.032
 sad-0.031
 afraid-0.030
Token.
Token ID13
Feature activation+9.86e-01

Pos logit contributions
der+0.044
oll+0.044
oph+0.043
 av+0.040
assy+0.039
Neg logit contributions
 scared-0.036
 worried-0.034
 sad-0.033
 frightened-0.033
 shout-0.032
Token.
Token ID13
Feature activation-5.32e-01

Pos logit contributions
der+0.102
oll+0.098
oph+0.092
 av+0.091
 bead+0.090
Neg logit contributions
 scared-0.096
 worried-0.094
 frightened-0.091
 sad-0.090
 afraid-0.088
Token They
Token ID1119
Feature activation-1.98e+00

Pos logit contributions
 worried+0.025
 scared+0.024
 worry+0.023
 sad+0.023
 frightened+0.023
Neg logit contributions
der-0.020
oll-0.015
 novels-0.015
oph-0.015
 mastered-0.015
Token wanted
Token ID2227
Feature activation-1.82e+00

Pos logit contributions
 scared+0.069
 worried+0.068
 frightened+0.065
 sad+0.064
 afraid+0.062
Neg logit contributions
der-0.101
oll-0.098
oph-0.092
 av-0.092
 bead-0.090
Token to
Token ID284
Feature activation+6.72e-01

Pos logit contributions
 scared+0.087
 worried+0.086
 frightened+0.084
 sad+0.083
 afraid+0.081
Neg logit contributions
der-0.069
oll-0.065
 av-0.060
oph-0.060
 bead-0.058
Token pet
Token ID4273
Feature activation-2.29e+00

Pos logit contributions
der+0.041
oll+0.039
oph+0.037
 av+0.037
 bead+0.037
Neg logit contributions
 scared-0.017
 worried-0.017
 frightened-0.016
 sad-0.016
 afraid-0.015
Token him
Token ID683
Feature activation-3.63e+00

Pos logit contributions
 worried+0.102
 scared+0.100
 frightened+0.099
 sad+0.098
 afraid+0.097
Neg logit contributions
der-0.089
 av-0.083
oll-0.079
reating-0.078
ils-0.076
Token.
Token ID13
Feature activation-7.85e-01

Pos logit contributions
 scared+0.145
 worried+0.144
 frightened+0.140
 sad+0.136
 afraid+0.134
Neg logit contributions
der-0.166
oll-0.159
 av-0.148
oph-0.147
 bead-0.145
Token They
Token ID1119
Feature activation-2.12e+00

Pos logit contributions
 worried+0.039
 sad+0.038
 scared+0.037
 frightened+0.037
 panic+0.037
Neg logit contributions
der-0.024
oph-0.018
oll-0.018
 novels-0.017
 Tues-0.017
Token forgot
Token ID16453
Feature activation+2.99e+00

Pos logit contributions
 scared+0.074
 worried+0.073
 frightened+0.071
 sad+0.069
 afraid+0.067
Neg logit contributions
der-0.108
oll-0.104
oph-0.098
 av-0.097
 bead-0.096
Token that
Token ID326
Feature activation-2.35e+00

Pos logit contributions
der+0.106
oll+0.106
oph+0.100
 bead+0.097
 av+0.094
Neg logit contributions
 scared-0.152
 worried-0.149
 sad-0.144
 frightened-0.143
 afraid-0.141
Token he
Token ID339
Feature activation-1.36e-01

Pos logit contributions
 scared+0.095
 worried+0.095
 frightened+0.093
 sad+0.090
 afraid+0.089
Neg logit contributions
der-0.103
oll-0.098
 av-0.092
 bead-0.090
oph-0.089
Token a
Token ID257
Feature activation+6.03e-02

Pos logit contributions
der+0.044
oll+0.042
oph+0.040
 av+0.039
 bead+0.039
Neg logit contributions
 scared-0.036
 worried-0.035
 frightened-0.034
 sad-0.033
 afraid-0.032
Token big
Token ID1263
Feature activation+3.51e-01

Pos logit contributions
oll+0.003
 av+0.003
 bead+0.003
 bracelet+0.002
der+0.002
Neg logit contributions
 scared-0.002
 frightened-0.002
 worried-0.002
 afraid-0.002
 shout-0.002
Token box
Token ID3091
Feature activation+3.12e+00

Pos logit contributions
oll+0.014
der+0.014
 av+0.014
 bead+0.014
oph+0.013
Neg logit contributions
 scared-0.015
 frightened-0.014
 worried-0.014
 afraid-0.014
 shout-0.013
Token of
Token ID286
Feature activation+6.12e-01

Pos logit contributions
der+0.118
oll+0.112
oph+0.104
 av+0.103
 bead+0.100
Neg logit contributions
 scared-0.152
 worried-0.150
 frightened-0.145
 sad-0.144
 afraid-0.141
Token seeds
Token ID11904
Feature activation+1.74e+00

Pos logit contributions
der+0.026
oll+0.024
oph+0.023
 av+0.023
 bead+0.022
Neg logit contributions
 scared-0.027
 worried-0.027
 frightened-0.026
 sad-0.026
 afraid-0.025
Token that
Token ID326
Feature activation-3.86e+00

Pos logit contributions
der+0.078
oll+0.075
oph+0.072
 av+0.069
 bead+0.068
Neg logit contributions
 scared-0.073
 worried-0.071
 frightened-0.068
 sad-0.068
 afraid-0.066
Token their
Token ID511
Feature activation-5.43e-01

Pos logit contributions
 scared+0.143
 worried+0.138
 frightened+0.134
 sad+0.131
 afraid+0.127
Neg logit contributions
der-0.196
oll-0.186
oph-0.176
 av-0.174
 bead-0.171
Token dad
Token ID9955
Feature activation-8.93e-01

Pos logit contributions
 scared+0.019
 worried+0.018
 frightened+0.018
 afraid+0.017
 sad+0.017
Neg logit contributions
der-0.027
oll-0.027
 av-0.025
oph-0.025
 bead-0.025
Token gave
Token ID2921
Feature activation-7.74e-01

Pos logit contributions
 scared+0.032
 worried+0.031
 frightened+0.030
 sad+0.030
 afraid+0.029
Neg logit contributions
der-0.046
oll-0.043
oph-0.041
 av-0.041
 bead-0.040
Token them
Token ID606
Feature activation-2.99e+00

Pos logit contributions
 scared+0.030
 worried+0.030
 frightened+0.029
 afraid+0.028
 sad+0.028
Neg logit contributions
der-0.035
oll-0.034
 av-0.033
ils-0.032
oph-0.031
Token.
Token ID13
Feature activation+2.38e-01

Pos logit contributions
 scared+0.131
 worried+0.128
 frightened+0.124
 sad+0.123
 afraid+0.119
Neg logit contributions
der-0.130
oll-0.124
oph-0.118
 bead-0.112
 av-0.112
Token her
Token ID607
Feature activation-9.61e-01

Pos logit contributions
 scared+0.083
 worried+0.082
 sad+0.079
 frightened+0.079
 afraid+0.076
Neg logit contributions
der-0.087
oll-0.085
oph-0.079
 bead-0.077
 av-0.076
Token house
Token ID2156
Feature activation-8.55e-01

Pos logit contributions
 scared+0.037
 worried+0.036
 frightened+0.035
 sad+0.034
 afraid+0.033
Neg logit contributions
der-0.046
oll-0.045
oph-0.042
 av-0.042
 bead-0.041
Token and
Token ID290
Feature activation-2.26e-02

Pos logit contributions
 scared+0.033
 worried+0.033
 sad+0.032
 frightened+0.031
 afraid+0.030
Neg logit contributions
der-0.041
oll-0.041
oph-0.040
 av-0.038
 bead-0.038
Token not
Token ID407
Feature activation-2.57e+00

Pos logit contributions
 scared+0.001
 worried+0.001
 frightened+0.001
 sad+0.001
 afraid+0.001
Neg logit contributions
der-0.001
oll-0.001
oph-0.001
 av-0.001
 bead-0.001
Token wander
Token ID27776
Feature activation+5.85e-02

Pos logit contributions
 worried+0.082
 scared+0.081
 frightened+0.079
 sad+0.076
 afraid+0.074
Neg logit contributions
der-0.133
oll-0.125
 av-0.122
 bead-0.119
oph-0.115
Token too
Token ID1165
Feature activation-4.35e+00

Pos logit contributions
der+0.003
oll+0.003
 av+0.002
ier+0.002
 bead+0.002
Neg logit contributions
 scared-0.002
 worried-0.002
 frightened-0.002
 shout-0.002
 sad-0.002
Token far
Token ID1290
Feature activation-1.30e+00

Pos logit contributions
 worried+0.151
 scared+0.151
 shout+0.147
 frightened+0.146
 yell+0.143
Neg logit contributions
der-0.211
oll-0.198
 av-0.189
ier-0.185
 bead-0.185
Token away
Token ID1497
Feature activation-6.63e-01

Pos logit contributions
 scared+0.049
 worried+0.048
 frightened+0.047
 sad+0.046
 afraid+0.045
Neg logit contributions
der-0.063
oll-0.060
 av-0.056
oph-0.056
 bead-0.055
Token.
Token ID13
Feature activation+5.51e-01

Pos logit contributions
 scared+0.025
 worried+0.023
 sad+0.023
 frightened+0.022
 afraid+0.022
Neg logit contributions
oll-0.035
oph-0.035
der-0.033
assy-0.032
cake-0.032
Token<|endoftext|>
Token ID50256
Feature activation-3.06e-01

Pos logit contributions
der+0.030
oll+0.029
oph+0.028
 av+0.027
 bead+0.027
Neg logit contributions
 scared-0.017
 worried-0.017
 frightened-0.016
 sad-0.016
 afraid-0.016
TokenOnce
Token ID7454
Feature activation-2.27e+00

Pos logit contributions
 scared+0.009
 worried+0.008
 frightened+0.008
 sad+0.008
 afraid+0.007
Neg logit contributions
der-0.018
oll-0.017
oph-0.016
 av-0.016
 bead-0.016
Token tall
Token ID7331
Feature activation-1.41e+00

Pos logit contributions
 scared+0.033
 worried+0.033
 frightened+0.032
 sad+0.031
 fearful+0.030
Neg logit contributions
der-0.036
oll-0.035
oph-0.034
ellery-0.032
ils-0.032
Token tree
Token ID5509
Feature activation+6.13e-01

Pos logit contributions
 scared+0.056
 worried+0.052
 voice+0.052
 frightened+0.052
 sad+0.051
Neg logit contributions
oll-0.073
der-0.067
oph-0.065
ellery-0.063
reating-0.062
Token and
Token ID290
Feature activation-4.89e-01

Pos logit contributions
der+0.028
oll+0.027
oph+0.025
 av+0.025
 bead+0.025
Neg logit contributions
 scared-0.025
 worried-0.025
 frightened-0.024
 sad-0.024
 afraid-0.023
Token carefully
Token ID7773
Feature activation+1.57e+00

Pos logit contributions
 scared+0.018
 worried+0.017
 frightened+0.017
 sad+0.016
 afraid+0.016
Neg logit contributions
der-0.025
oll-0.025
oph-0.023
 av-0.023
 bead-0.023
Token picked
Token ID6497
Feature activation+9.23e-02

Pos logit contributions
der+0.079
oll+0.077
oph+0.073
reating+0.071
 av+0.070
Neg logit contributions
 scared-0.060
 worried-0.058
 frightened-0.055
 sad-0.054
 shout-0.053
Token him
Token ID683
Feature activation-3.77e+00

Pos logit contributions
der+0.005
oll+0.005
oph+0.004
 av+0.004
 bead+0.004
Neg logit contributions
 scared-0.003
 worried-0.003
 frightened-0.003
 sad-0.003
 afraid-0.003
Token up
Token ID510
Feature activation-6.22e-01

Pos logit contributions
 scared+0.097
 worried+0.089
 frightened+0.086
 sad+0.086
 afraid+0.077
Neg logit contributions
der-0.233
oll-0.225
oph-0.223
 bead-0.213
 av-0.211
Token.
Token ID13
Feature activation+1.39e-01

Pos logit contributions
 scared+0.021
 worried+0.020
 sad+0.019
 frightened+0.019
 confused+0.018
Neg logit contributions
oll-0.034
der-0.033
oph-0.032
ils-0.031
reating-0.031
Token The
Token ID383
Feature activation-6.72e-01

Pos logit contributions
der+0.007
oll+0.007
oph+0.007
 av+0.007
 bead+0.007
Neg logit contributions
 scared-0.005
 worried-0.005
 frightened-0.004
 sad-0.004
 afraid-0.004
Token superhero
Token ID17343
Feature activation+1.11e+00

Pos logit contributions
 scared+0.027
 worried+0.027
 frightened+0.026
 sad+0.026
 afraid+0.025
Neg logit contributions
der-0.031
oll-0.029
 av-0.027
oph-0.027
 bead-0.027
Token held
Token ID2714
Feature activation+4.36e-01

Pos logit contributions
der+0.053
oll+0.051
oph+0.047
 av+0.047
 bead+0.046
Neg logit contributions
 scared-0.044
 worried-0.043
 frightened-0.041
 sad-0.040
 afraid-0.039