TOP ACTIVATIONS
MAX = 0.126

monkey continued to go to talent shows and applaud for their
and would often go to talent shows together . They would
hard work , you can accomplish anything ." The
of creativity , you can accomplish anything ! <|endoftext|> Once there
hat ! Everyone in the audience applauded him too . Tim
worked together , they could accomplish anything . <|endoftext|> Once upon
alone . We can always accomplish more when we work together
your brain , you can accomplish anything . <|endoftext|> Once upon
. After a few lessons , Anna was a pro
some help , she could accomplish great things . <|endoftext|> Once
proud of ourselves when we accomplish something new . <|endoftext|> Once
to do many more heroic deeds and never gave up no
and dedication , you can achieve great things ! <|endoftext|> Once
End of Story . <|endoftext|> L illy had
got cle vere r and cle vere r . Eventually the
girl learned that she could accomplish anything with the help of
try hard , you can achieve anything . <|endoftext|> Once upon
ily went to the piano lessons every week and practiced very
. Everyone was able to accomplish their tasks and the event
keep trying , you can achieve anything . <|endoftext|> Tom my

MIN ACTIVATIONS
MIN = -0.140

examine it with the magn ifying glass ?" "
the necklace and the magn ifying glass and told him that
the necklace and the magn ifying glass and started to examine
hat and grabbed her magn ifying glass . She was ready
and handed him the magn ifying glass . He
. Lily had a magn ifying glass that she found in
, handing him the magn ifying glass . She looked around
a flashlight and a magn ifying glass to examine their finds
. They had a magn ifying glass and a small box
borrowed her mom 's magn ifying glass to take a closer
said , holding the magn ifying glass over a black ant
She had a big magn ifying glass that made things look
He pointed his magn ifying glass at different classmates and
cried , dropping the magn ifying glass . "
it she had a magn ifying glass , a tiny torch
flashlight , and a magn ifying glass . As
the watch with the magn ifying glass . He saw the
He took out a magn ifying glass and a notebook and
that the zipper had become unst uck . She was relieved
He brought his magn ifying glass and looked closely .

INTERVAL 0.060 - 0.126
CONTAINS 0.384%

thought it was time to teach her something important .
The moral of the story is : Listen to your
listen . The stories were so fun and funny
The moral of the story is that it 's important
games and see who can win . One day

INTERVAL -0.007 - 0.060
CONTAINS 62.535%

. Have a nice day . Bye bye ." She walks
wander outside his home and explore the hills and forests nearby
lot of fun making noises and pretending to be animals .
in one piece . All right ?" John nodded
said , " You were quite brave for a little pup

INTERVAL -0.073 - -0.007
CONTAINS 37.044%

have fun . He was the happiest re ind eer in
and colorful ribbon . <|endoftext|> Once upon a time , there
in the park . Lily and Ben want to go on
, and the flowers began to bloom ! The friends smiled
stopped crying and said , " It 's okay , bird

INTERVAL -0.140 - -0.073
CONTAINS 0.037%

. He picked up the big bag of soil and placed
She used her black past el to draw a big cat
. It was brown and wil ted and disgusting . She
in a huge green field full of other grapes . The
some of her hair had fallen into the cookie dough .
Token monkey
Token ID21657
Feature activation+2.72e-02

Pos logit contributions
izes+0.047
 phrases+0.042
 languages+0.040
 listened+0.040
ative+0.039
Neg logit contributions
 bisc-0.053
 spe-0.051
 pitcher-0.049
etchup-0.048
 snowball-0.048
Token continued
Token ID3767
Feature activation+7.46e-03

Pos logit contributions
 phrases+0.043
ative+0.042
 professor+0.042
izes+0.041
 languages+0.039
Neg logit contributions
 spe-0.047
 bisc-0.045
 bur-0.044
 snowball-0.044
 pitcher-0.043
Token to
Token ID284
Feature activation+3.65e-02

Pos logit contributions
izes+0.013
 phrases+0.012
ative+0.012
 languages+0.011
 listened+0.011
Neg logit contributions
 spe-0.013
etchup-0.012
 pitcher-0.012
 bisc-0.012
 snowball-0.012
Token go
Token ID467
Feature activation-7.17e-03

Pos logit contributions
izes+0.065
ative+0.064
 phrases+0.062
 professor+0.061
 languages+0.061
Neg logit contributions
 spe-0.063
 bur-0.058
 bisc-0.057
 extingu-0.057
 pitcher-0.056
Token to
Token ID284
Feature activation+1.99e-02

Pos logit contributions
 spe+0.013
 bisc+0.013
etchup+0.013
 pitcher+0.013
 funnel+0.012
Neg logit contributions
 phrases-0.011
izes-0.011
 listened-0.011
 languages-0.011
ative-0.011
Token talent
Token ID7401
Feature activation+1.26e-01

Pos logit contributions
izes+0.032
ative+0.031
 phrases+0.030
 listened+0.029
 languages+0.028
Neg logit contributions
 spe-0.036
 pitcher-0.035
 bisc-0.035
 snowball-0.034
etchup-0.034
Token shows
Token ID2523
Feature activation+2.82e-02

Pos logit contributions
 phrases+0.208
 listened+0.197
 languages+0.184
izes+0.181
ered+0.180
Neg logit contributions
 pic-0.175
 spe-0.174
etchup-0.172
 pitcher-0.169
 cups-0.162
Token and
Token ID290
Feature activation+1.98e-02

Pos logit contributions
lder+0.047
mber+0.045
izes+0.043
 listened+0.042
working+0.041
Neg logit contributions
 extingu-0.038
 snowball-0.037
 bisc-0.036
 towels-0.035
 bur-0.035
Token applaud
Token ID22517
Feature activation+3.07e-02

Pos logit contributions
izes+0.028
ative+0.027
 phrases+0.024
 professor+0.024
 questions+0.023
Neg logit contributions
 spe-0.039
 bisc-0.038
 pitcher-0.036
 bur-0.036
etchup-0.035
Token for
Token ID329
Feature activation+3.44e-02

Pos logit contributions
izes+0.049
 listened+0.049
 obey+0.046
 learn+0.045
ager+0.045
Neg logit contributions
ksh-0.049
ifying-0.048
umbs-0.048
 spe-0.047
 extingu-0.047
Token their
Token ID511
Feature activation+1.41e-02

Pos logit contributions
izes+0.052
ative+0.046
 phrases+0.043
 listened+0.043
 nodded+0.041
Neg logit contributions
 spe-0.069
etchup-0.063
 snowball-0.061
 bisc-0.060
 bur-0.060
Token and
Token ID290
Feature activation+1.38e-02

Pos logit contributions
izes+0.080
 phrases+0.074
 questions+0.068
mber+0.067
 elders+0.067
Neg logit contributions
 spe-0.077
 snowball-0.069
 bisc-0.069
etchup-0.068
 extingu-0.067
Token would
Token ID561
Feature activation+2.67e-02

Pos logit contributions
izes+0.022
ative+0.020
 languages+0.019
 phrases+0.019
 questions+0.019
Neg logit contributions
 spe-0.029
 bisc-0.025
etchup-0.025
 pitcher-0.024
 snowball-0.023
Token often
Token ID1690
Feature activation+4.13e-02

Pos logit contributions
izes+0.026
 phrases+0.023
ative+0.023
 languages+0.022
 professor+0.021
Neg logit contributions
 spe-0.069
 bisc-0.063
 pitcher-0.063
 snowball-0.063
 bur-0.062
Token go
Token ID467
Feature activation-5.25e-03

Pos logit contributions
izes+0.067
ative+0.063
 phrases+0.063
 languages+0.060
 questions+0.059
Neg logit contributions
 spe-0.078
 bisc-0.074
 pitcher-0.071
 extingu-0.069
etchup-0.068
Token to
Token ID284
Feature activation+2.11e-02

Pos logit contributions
 spe+0.011
 bisc+0.011
 pitcher+0.010
etchup+0.010
 extingu+0.010
Neg logit contributions
izes-0.007
 phrases-0.007
 listened-0.006
 stories-0.006
 questions-0.006
Token talent
Token ID7401
Feature activation+1.16e-01

Pos logit contributions
izes+0.027
 phrases+0.025
ative+0.025
 listened+0.025
 languages+0.024
Neg logit contributions
 spe-0.045
 bisc-0.044
etchup-0.044
 snowball-0.043
 pitcher-0.043
Token shows
Token ID2523
Feature activation+2.70e-02

Pos logit contributions
 phrases+0.179
 listened+0.165
izes+0.160
 languages+0.159
 nodded+0.151
Neg logit contributions
etchup-0.190
 spe-0.189
 pic-0.187
 pitcher-0.179
 bisc-0.176
Token together
Token ID1978
Feature activation+3.08e-02

Pos logit contributions
mber+0.045
lder+0.045
izes+0.044
 listened+0.041
ager+0.040
Neg logit contributions
 extingu-0.036
 snowball-0.036
 spe-0.035
etchup-0.035
 bisc-0.035
Token.
Token ID13
Feature activation+1.86e-02

Pos logit contributions
izes+0.052
 phrases+0.046
 elders+0.045
 professor+0.045
mber+0.044
Neg logit contributions
 spe-0.053
 bisc-0.048
 snowball-0.048
etchup-0.048
 extingu-0.048
Token They
Token ID1119
Feature activation+2.44e-02

Pos logit contributions
 Everyone+0.022
 everyone+0.020
 Everybody+0.019
 stories+0.019
 learn+0.018
Neg logit contributions
 spe-0.045
 bisc-0.042
 extingu-0.042
 pitcher-0.041
etchup-0.041
Token would
Token ID561
Feature activation+1.26e-02

Pos logit contributions
izes+0.041
 phrases+0.039
 questions+0.036
ently+0.036
 elders+0.036
Neg logit contributions
 spe-0.045
 bur-0.037
 pitcher-0.035
etchup-0.035
 bisc-0.035
Token hard
Token ID1327
Feature activation+4.64e-03

Pos logit contributions
ative+0.018
izes+0.017
itive+0.017
 languages+0.017
 professor+0.017
Neg logit contributions
 spe-0.024
 bisc-0.024
etchup-0.022
 bur-0.022
 pitcher-0.021
Token work
Token ID670
Feature activation+6.54e-02

Pos logit contributions
izes+0.007
ative+0.006
 languages+0.006
 professor+0.006
 phrases+0.006
Neg logit contributions
 spe-0.009
etchup-0.008
 extingu-0.008
 bisc-0.008
 pitcher-0.008
Token,
Token ID11
Feature activation+2.08e-02

Pos logit contributions
 phrases+0.106
 communicate+0.102
 professor+0.101
 languages+0.100
 questions+0.100
Neg logit contributions
 spe-0.111
etchup-0.102
 pitcher-0.098
 funnel-0.096
 snowball-0.095
Token you
Token ID345
Feature activation+3.51e-02

Pos logit contributions
izes+0.031
 phrases+0.030
ative+0.030
 languages+0.029
 professor+0.029
Neg logit contributions
 spe-0.038
 bisc-0.037
 pitcher-0.036
 snowball-0.036
etchup-0.035
Token can
Token ID460
Feature activation+3.53e-03

Pos logit contributions
 phrases+0.071
 languages+0.070
izes+0.069
 stories+0.067
 questions+0.066
Neg logit contributions
 spe-0.048
 pitcher-0.041
umbs-0.041
 bisc-0.040
 funnel-0.040
Token accomplish
Token ID9989
Feature activation+1.12e-01

Pos logit contributions
 languages+0.005
 phrases+0.005
izes+0.005
 stories+0.005
 questions+0.005
Neg logit contributions
 spe-0.006
 pitcher-0.006
 bur-0.006
 bisc-0.006
 snowball-0.006
Token anything
Token ID1997
Feature activation+7.29e-02

Pos logit contributions
ative+0.188
 listened+0.187
 phrases+0.186
izes+0.184
 professor+0.181
Neg logit contributions
 spe-0.180
 snowball-0.177
etchup-0.148
 pitcher-0.148
 bisc-0.146
Token."
Token ID526
Feature activation+2.91e-02

Pos logit contributions
 phrases+0.132
 nodded+0.126
 listened+0.123
izes+0.120
 professor+0.119
Neg logit contributions
 spe-0.109
etchup-0.090
 pitcher-0.087
 snowball-0.085
umbs-0.083
Token
Token ID198
Feature activation+7.95e-03

Pos logit contributions
izes+0.051
 phrases+0.047
 listened+0.046
 stories+0.046
ative+0.046
Neg logit contributions
 bisc-0.050
etchup-0.048
 spe-0.048
 pitcher-0.048
 snowball-0.047
Token
Token ID198
Feature activation+7.72e-03

Pos logit contributions
izes+0.011
 phrases+0.011
 listened+0.011
 questions+0.011
 stories+0.011
Neg logit contributions
 spe-0.015
 bisc-0.015
 pitcher-0.015
etchup-0.015
 extingu-0.014
TokenThe
Token ID464
Feature activation+4.70e-02

Pos logit contributions
izes+0.014
 phrases+0.014
 stories+0.014
 languages+0.014
 questions+0.014
Neg logit contributions
 bisc-0.012
 snowball-0.011
 pitcher-0.011
 spe-0.011
etchup-0.011
Token of
Token ID286
Feature activation+4.41e-03

Pos logit contributions
etchup+0.008
 spe+0.008
 bisc+0.008
 pitcher+0.008
 funnel+0.008
Neg logit contributions
izes-0.006
 phrases-0.006
ative-0.006
 listened-0.006
 languages-0.006
Token creativity
Token ID16389
Feature activation+7.11e-02

Pos logit contributions
ative+0.006
izes+0.005
 phrases+0.005
 listened+0.005
 languages+0.005
Neg logit contributions
 bisc-0.009
etchup-0.009
 pitcher-0.009
 spe-0.009
 extingu-0.009
Token,
Token ID11
Feature activation+1.90e-02

Pos logit contributions
izes+0.097
 phrases+0.095
 professor+0.091
 listened+0.090
lder+0.090
Neg logit contributions
 extingu-0.126
 bisc-0.126
 spe-0.125
etchup-0.125
 pitcher-0.124
Token you
Token ID345
Feature activation+3.24e-02

Pos logit contributions
izes+0.025
ative+0.024
 phrases+0.023
 listened+0.023
 professor+0.023
Neg logit contributions
 spe-0.038
 bisc-0.036
etchup-0.036
 pitcher-0.036
 funnel-0.035
Token can
Token ID460
Feature activation+3.24e-03

Pos logit contributions
 phrases+0.066
 languages+0.064
izes+0.064
mber+0.063
 stories+0.062
Neg logit contributions
 spe-0.043
 bur-0.035
 funnel-0.035
etchup-0.034
 sponge-0.033
Token accomplish
Token ID9989
Feature activation+1.11e-01

Pos logit contributions
 languages+0.005
 phrases+0.005
izes+0.004
 questions+0.004
 stories+0.004
Neg logit contributions
 spe-0.006
 pitcher-0.006
 bisc-0.006
 bur-0.006
 extingu-0.006
Token anything
Token ID1997
Feature activation+6.96e-02

Pos logit contributions
 listened+0.182
ative+0.179
 phrases+0.178
bye+0.174
izes+0.174
Neg logit contributions
 snowball-0.178
 spe-0.176
 pitcher-0.158
etchup-0.157
 bisc-0.156
Token!
Token ID0
Feature activation+2.77e-02

Pos logit contributions
 phrases+0.117
 nodded+0.111
 listened+0.108
izes+0.108
 professor+0.106
Neg logit contributions
 spe-0.114
etchup-0.099
 pitcher-0.096
 bisc-0.090
 snowball-0.089
Token<|endoftext|>
Token ID50256
Feature activation-1.15e-02

Pos logit contributions
 Everyone+0.033
 learn+0.033
 everyone+0.032
 phrases+0.032
 listened+0.032
Neg logit contributions
 spe-0.061
 bisc-0.059
 pitcher-0.058
 snowball-0.057
etchup-0.056
TokenOnce
Token ID7454
Feature activation-3.27e-02

Pos logit contributions
 spe+0.022
 bisc+0.021
 extingu+0.021
 pitcher+0.021
etchup+0.021
Neg logit contributions
ative-0.016
 listened-0.016
 everyone-0.016
 Everyone-0.016
 questions-0.016
Token there
Token ID612
Feature activation+1.04e-03

Pos logit contributions
 spe+0.055
 bisc+0.055
etchup+0.053
 pitcher+0.053
 extingu+0.051
Neg logit contributions
izes-0.053
 phrases-0.053
 listened-0.052
 stories-0.052
 questions-0.051
Token hat
Token ID6877
Feature activation-5.76e-02

Pos logit contributions
 bisc+0.036
 spe+0.036
etchup+0.035
 pitcher+0.035
 extingu+0.035
Neg logit contributions
izes-0.021
 phrases-0.020
ative-0.019
 languages-0.019
 questions-0.019
Token!
Token ID0
Feature activation+3.35e-03

Pos logit contributions
 bisc+0.113
 spe+0.105
 pitcher+0.104
 funnel+0.102
umbs+0.101
Neg logit contributions
 phrases-0.086
 languages-0.082
 professor-0.081
ative-0.079
 questions-0.079
Token Everyone
Token ID11075
Feature activation+6.00e-02

Pos logit contributions
izes+0.005
 phrases+0.005
 listened+0.005
ative+0.005
 stories+0.005
Neg logit contributions
 bisc-0.006
 spe-0.006
 pitcher-0.006
etchup-0.006
 funnel-0.006
Token in
Token ID287
Feature activation+2.91e-02

Pos logit contributions
 phrases+0.098
ative+0.094
 professor+0.092
isdom+0.091
 questions+0.090
Neg logit contributions
 spe-0.088
etchup-0.084
 spilled-0.076
 pitcher-0.074
 snowball-0.074
Token the
Token ID262
Feature activation+4.67e-02

Pos logit contributions
izes+0.040
 phrases+0.038
ative+0.036
 listened+0.035
 questions+0.034
Neg logit contributions
 spe-0.062
etchup-0.057
 bisc-0.057
 pitcher-0.057
 snowball-0.056
Token audience
Token ID5386
Feature activation+1.10e-01

Pos logit contributions
izes+0.080
 phrases+0.080
ative+0.078
 listened+0.077
 questions+0.074
Neg logit contributions
 bisc-0.077
 spe-0.076
 pitcher-0.075
etchup-0.073
 snowball-0.073
Token applauded
Token ID38911
Feature activation+3.46e-02

Pos logit contributions
 professor+0.164
ative+0.164
 phrases+0.163
izes+0.154
 questions+0.152
Neg logit contributions
 spe-0.176
 pitcher-0.169
etchup-0.168
 funnel-0.164
 cups-0.158
Token him
Token ID683
Feature activation-1.13e-02

Pos logit contributions
izes+0.059
Deal+0.055
 listened+0.054
 phrases+0.053
 professor+0.053
Neg logit contributions
 spe-0.050
 extingu-0.049
ksh-0.044
 cups-0.044
 bisc-0.044
Token too
Token ID1165
Feature activation+7.81e-03

Pos logit contributions
 bisc+0.019
 snowball+0.018
umbs+0.017
 spe+0.017
etchup+0.017
Neg logit contributions
izes-0.018
 phrases-0.017
 listened-0.017
ative-0.017
Deal-0.016
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
izes+0.014
 phrases+0.014
 languages+0.013
 professor+0.013
ative+0.013
Neg logit contributions
 spe-0.012
umbs-0.011
 snowball-0.011
 pitcher-0.011
 bisc-0.010
Token Tim
Token ID5045
Feature activation-5.48e-03

Pos logit contributions
izes+0.017
 listened+0.016
 phrases+0.016
 stories+0.016
ative+0.016
Neg logit contributions
 bisc-0.017
etchup-0.017
 spe-0.017
 pitcher-0.017
 funnel-0.017
Token worked
Token ID3111
Feature activation+5.18e-02

Pos logit contributions
ative+0.026
izes+0.026
 professor+0.025
mber+0.024
ya+0.024
Neg logit contributions
 spe-0.036
etchup-0.033
 bisc-0.033
 pitcher-0.033
 bur-0.032
Token together
Token ID1978
Feature activation+3.68e-02

Pos logit contributions
lder+0.088
 professor+0.086
 musician+0.085
 phrases+0.084
ibrarian+0.084
Neg logit contributions
 spe-0.079
 snowball-0.077
etchup-0.075
 funnel-0.071
 extingu-0.071
Token,
Token ID11
Feature activation+2.59e-02

Pos logit contributions
izes+0.055
 phrases+0.051
 professor+0.049
 languages+0.049
 elders+0.049
Neg logit contributions
 spe-0.071
 extingu-0.065
etchup-0.065
 bisc-0.064
 pitcher-0.064
Token they
Token ID484
Feature activation+2.00e-02

Pos logit contributions
izes+0.036
ative+0.033
 phrases+0.033
 languages+0.032
 professor+0.032
Neg logit contributions
 spe-0.054
etchup-0.050
 bisc-0.050
 pitcher-0.049
 extingu-0.048
Token could
Token ID714
Feature activation-2.61e-03

Pos logit contributions
izes+0.033
 phrases+0.032
 languages+0.032
mber+0.032
 stories+0.032
Neg logit contributions
 spe-0.034
 bur-0.029
 extingu-0.028
 funnel-0.028
 pitcher-0.027
Token accomplish
Token ID9989
Feature activation+1.09e-01

Pos logit contributions
 spe+0.005
 pitcher+0.005
 snowball+0.005
 bur+0.005
 extingu+0.005
Neg logit contributions
izes-0.004
 languages-0.004
 phrases-0.004
mber-0.004
ative-0.004
Token anything
Token ID1997
Feature activation+6.80e-02

Pos logit contributions
ative+0.172
 phrases+0.170
 listened+0.169
izes+0.168
lder+0.168
Neg logit contributions
 spe-0.182
 snowball-0.179
etchup-0.155
 funnel-0.154
 pitcher-0.154
Token.
Token ID13
Feature activation+1.93e-02

Pos logit contributions
 phrases+0.103
izes+0.099
 professor+0.093
ative+0.093
 nodded+0.093
Neg logit contributions
 spe-0.129
etchup-0.110
 extingu-0.105
 pitcher-0.104
 bisc-0.103
Token<|endoftext|>
Token ID50256
Feature activation-1.46e-02

Pos logit contributions
 phrases+0.024
 everyone+0.024
 stories+0.024
 listened+0.024
 Everyone+0.024
Neg logit contributions
 spe-0.040
 bisc-0.039
 pitcher-0.038
etchup-0.038
 extingu-0.037
TokenOnce
Token ID7454
Feature activation-3.52e-02

Pos logit contributions
 spe+0.029
 bisc+0.029
 pitcher+0.028
 extingu+0.028
etchup+0.027
Neg logit contributions
 everyone-0.019
 listened-0.019
 Everyone-0.019
ative-0.019
 questions-0.019
Token upon
Token ID2402
Feature activation+2.24e-02

Pos logit contributions
 spe+0.058
 bisc+0.057
etchup+0.056
 pitcher+0.056
 snowball+0.054
Neg logit contributions
izes-0.059
 phrases-0.059
 listened-0.057
 stories-0.057
ative-0.056
Token alone
Token ID3436
Feature activation-1.23e-02

Pos logit contributions
 phrases+0.053
izes+0.052
 listened+0.052
mber+0.051
 professor+0.050
Neg logit contributions
 spe-0.071
 extingu-0.068
etchup-0.067
 pitcher-0.066
 snowball-0.066
Token.
Token ID13
Feature activation+9.29e-03

Pos logit contributions
 spe+0.023
etchup+0.022
 extingu+0.022
 pitcher+0.021
 funnel+0.020
Neg logit contributions
izes-0.017
 phrases-0.017
mber-0.016
 stories-0.016
 listened-0.016
Token We
Token ID775
Feature activation-6.37e-03

Pos logit contributions
izes+0.015
 phrases+0.014
 stories+0.014
 listened+0.014
 questions+0.014
Neg logit contributions
 spe-0.016
etchup-0.016
 bisc-0.016
 pitcher-0.015
 funnel-0.015
Token can
Token ID460
Feature activation-3.32e-03

Pos logit contributions
 spe+0.010
 funnel+0.009
 pitcher+0.009
 bisc+0.009
 bur+0.009
Neg logit contributions
 phrases-0.011
 stories-0.011
 languages-0.010
izes-0.010
 songs-0.010
Token always
Token ID1464
Feature activation+5.52e-02

Pos logit contributions
 spe+0.006
 pitcher+0.006
 funnel+0.006
 extingu+0.006
 bur+0.006
Neg logit contributions
 languages-0.004
izes-0.004
 phrases-0.004
 stories-0.004
 questions-0.004
Token accomplish
Token ID9989
Feature activation+1.08e-01

Pos logit contributions
mber+0.085
ative+0.085
izes+0.084
 phrases+0.079
 languages+0.078
Neg logit contributions
 spe-0.094
 extingu-0.091
 bisc-0.087
 funnel-0.087
 pitcher-0.086
Token more
Token ID517
Feature activation+5.71e-02

Pos logit contributions
ative+0.174
 listened+0.171
bye+0.169
mber+0.169
 nodded+0.169
Neg logit contributions
 spe-0.174
 snowball-0.172
etchup-0.151
 funnel-0.151
 extingu-0.151
Token when
Token ID618
Feature activation+2.12e-02

Pos logit contributions
 listened+0.083
izes+0.081
 nodded+0.081
ative+0.079
ibrarian+0.079
Neg logit contributions
 spe-0.101
 snowball-0.094
 extingu-0.090
etchup-0.089
 ditch-0.089
Token we
Token ID356
Feature activation-1.09e-03

Pos logit contributions
izes+0.037
ative+0.036
 phrases+0.035
 languages+0.034
 listened+0.034
Neg logit contributions
 spe-0.036
etchup-0.033
 pitcher-0.032
 funnel-0.032
 extingu-0.030
Token work
Token ID670
Feature activation+7.69e-02

Pos logit contributions
 pitcher+0.002
 funnel+0.002
 spe+0.002
 sponge+0.002
 bisc+0.002
Neg logit contributions
mber-0.002
izes-0.002
ative-0.002
 stories-0.002
ya-0.002
Token together
Token ID1978
Feature activation+3.87e-02

Pos logit contributions
 phrases+0.130
mber+0.128
 nodded+0.126
 listened+0.126
 languages+0.125
Neg logit contributions
 spe-0.116
etchup-0.110
 funnel-0.109
 extingu-0.108
 pitcher-0.107
Token your
Token ID534
Feature activation+1.25e-02

Pos logit contributions
izes+0.037
ative+0.036
 phrases+0.034
 listened+0.034
 professor+0.033
Neg logit contributions
 spe-0.053
etchup-0.051
 bisc-0.051
 bur-0.050
 snowball-0.050
Token brain
Token ID3632
Feature activation+3.57e-03

Pos logit contributions
izes+0.019
 phrases+0.019
 listened+0.018
ative+0.018
master+0.017
Neg logit contributions
 spe-0.022
 bur-0.021
etchup-0.020
 pitcher-0.019
 funnel-0.019
Token,
Token ID11
Feature activation+5.92e-03

Pos logit contributions
 phrases+0.005
izes+0.005
ative+0.005
 stories+0.005
 professor+0.005
Neg logit contributions
etchup-0.006
 spe-0.006
 pitcher-0.006
 bisc-0.006
 bur-0.006
Token you
Token ID345
Feature activation+2.59e-02

Pos logit contributions
izes+0.009
ative+0.009
 phrases+0.008
 professor+0.008
 languages+0.008
Neg logit contributions
 spe-0.011
 bur-0.010
etchup-0.009
 pitcher-0.009
 bisc-0.009
Token can
Token ID460
Feature activation-2.71e-03

Pos logit contributions
 phrases+0.053
 languages+0.052
 stories+0.052
itive+0.051
ya+0.051
Neg logit contributions
 spe-0.031
 bur-0.027
 funnel-0.024
 bisc-0.024
 pitcher-0.023
Token accomplish
Token ID9989
Feature activation+1.08e-01

Pos logit contributions
 bur+0.005
 spe+0.005
 pitcher+0.004
+0.004
 bisc+0.004
Neg logit contributions
 languages-0.004
 stories-0.004
 phrases-0.004
izes-0.004
 questions-0.004
Token anything
Token ID1997
Feature activation+6.55e-02

Pos logit contributions
ative+0.187
 listened+0.187
bye+0.185
lder+0.185
 professor+0.181
Neg logit contributions
 snowball-0.156
 spe-0.153
etchup-0.133
 bisc-0.129
 pitcher-0.125
Token.
Token ID13
Feature activation+2.16e-02

Pos logit contributions
 phrases+0.105
 listened+0.098
 nodded+0.097
izes+0.096
 professor+0.095
Neg logit contributions
 spe-0.111
etchup-0.097
 pitcher-0.092
 bur-0.089
 snowball-0.088
Token<|endoftext|>
Token ID50256
Feature activation-1.48e-02

Pos logit contributions
izes+0.032
 phrases+0.032
 stories+0.031
 listened+0.031
 questions+0.031
Neg logit contributions
 spe-0.040
 bisc-0.039
etchup-0.039
 pitcher-0.038
 extingu-0.037
TokenOnce
Token ID7454
Feature activation-3.47e-02

Pos logit contributions
 spe+0.026
 bisc+0.025
 pitcher+0.025
 extingu+0.024
etchup+0.024
Neg logit contributions
 listened-0.023
izes-0.023
ative-0.023
 questions-0.023
 everyone-0.023
Token upon
Token ID2402
Feature activation+1.88e-02

Pos logit contributions
 spe+0.053
 bisc+0.052
etchup+0.052
 pitcher+0.051
 snowball+0.049
Neg logit contributions
 phrases-0.062
izes-0.061
 listened-0.060
ative-0.059
 stories-0.059
Token
Token ID26391
Feature activation+1.36e-02

Pos logit contributions
 stories+0.023
 phrases+0.022
 questions+0.022
 teachers+0.021
 languages+0.021
Neg logit contributions
 bisc-0.023
etchup-0.022
 pitcher-0.021
 funnel-0.020
ksh-0.019
Token.
Token ID13
Feature activation+9.27e-03

Pos logit contributions
izes+0.020
 phrases+0.020
 stories+0.019
 questions+0.019
ative+0.019
Neg logit contributions
 spe-0.026
 bisc-0.026
etchup-0.026
 pitcher-0.025
 snowball-0.024
Token After
Token ID2293
Feature activation+2.56e-02

Pos logit contributions
izes+0.013
 phrases+0.013
 listened+0.013
 stories+0.013
 questions+0.013
Neg logit contributions
 spe-0.017
 bisc-0.017
etchup-0.017
 pitcher-0.017
 extingu-0.016
Token a
Token ID257
Feature activation+2.51e-02

Pos logit contributions
izes+0.039
ative+0.037
 languages+0.035
 phrases+0.035
 professor+0.034
Neg logit contributions
 spe-0.048
etchup-0.046
 bisc-0.046
 pitcher-0.045
 funnel-0.044
Token few
Token ID1178
Feature activation+3.74e-02

Pos logit contributions
izes+0.042
 phrases+0.041
ative+0.040
 languages+0.040
 questions+0.038
Neg logit contributions
 bisc-0.044
 spe-0.044
 pitcher-0.043
 snowball-0.043
etchup-0.040
Token lessons
Token ID11658
Feature activation+1.08e-01

Pos logit contributions
ative+0.092
 phrases+0.089
izes+0.088
 professor+0.088
yer+0.086
Neg logit contributions
 spe-0.035
 snowball-0.031
etchup-0.031
 bisc-0.029
 pitcher-0.029
Token,
Token ID11
Feature activation+2.06e-02

Pos logit contributions
 advisor+0.143
ative+0.142
izes+0.141
 phrases+0.136
 professor+0.133
Neg logit contributions
 spe-0.210
etchup-0.204
 pitcher-0.195
 funnel-0.186
 umb-0.186
Token Anna
Token ID11735
Feature activation+7.70e-03

Pos logit contributions
izes+0.037
ative+0.037
 phrases+0.035
 languages+0.035
 listened+0.033
Neg logit contributions
 spe-0.031
 pitcher-0.031
 bisc-0.030
etchup-0.029
 snowball-0.027
Token was
Token ID373
Feature activation+1.69e-02

Pos logit contributions
ative+0.011
 languages+0.011
 phrases+0.011
izes+0.011
 stories+0.011
Neg logit contributions
 spe-0.014
 bisc-0.013
etchup-0.013
 pitcher-0.013
 bur-0.013
Token a
Token ID257
Feature activation+1.41e-03

Pos logit contributions
 languages+0.027
 phrases+0.025
izes+0.025
 stories+0.024
 deeds+0.024
Neg logit contributions
 spe-0.030
 pitcher-0.029
etchup-0.029
 snowball-0.027
 funnel-0.027
Token pro
Token ID386
Feature activation+2.57e-02

Pos logit contributions
izes+0.002
 languages+0.002
 phrases+0.002
ative+0.002
ently+0.002
Neg logit contributions
 spe-0.003
 pitcher-0.002
 snowball-0.002
 bisc-0.002
 funnel-0.002
Token some
Token ID617
Feature activation-7.03e-03

Pos logit contributions
izes+0.002
ative+0.002
yer+0.002
mber+0.002
 nodded+0.002
Neg logit contributions
 spe-0.003
etchup-0.003
 bisc-0.003
 bur-0.002
 umb-0.002
Token help
Token ID1037
Feature activation+1.32e-03

Pos logit contributions
 spe+0.014
 bisc+0.013
etchup+0.013
 snowball+0.012
 pitcher+0.012
Neg logit contributions
izes-0.010
ative-0.009
yer-0.009
master-0.009
mber-0.009
Token,
Token ID11
Feature activation+1.18e-02

Pos logit contributions
izes+0.002
 nodded+0.002
 phrases+0.002
mber+0.002
ya+0.002
Neg logit contributions
 spe-0.002
 bisc-0.002
 extingu-0.002
 pitcher-0.002
etchup-0.002
Token she
Token ID673
Feature activation-1.29e-03

Pos logit contributions
izes+0.015
ative+0.014
 phrases+0.013
 listened+0.012
 professor+0.012
Neg logit contributions
 spe-0.026
etchup-0.024
 bisc-0.024
 pitcher-0.023
 snowball-0.022
Token could
Token ID714
Feature activation-8.04e-03

Pos logit contributions
 spe+0.002
etchup+0.002
 bur+0.002
 bisc+0.002
 pitcher+0.002
Neg logit contributions
 languages-0.002
 stories-0.002
 phrases-0.002
 questions-0.002
ative-0.002
Token accomplish
Token ID9989
Feature activation+1.06e-01

Pos logit contributions
 spe+0.013
 bur+0.013
 pitcher+0.013
 extingu+0.012
+0.012
Neg logit contributions
mber-0.013
izes-0.012
 languages-0.012
 phrases-0.012
ative-0.011
Token great
Token ID1049
Feature activation+2.59e-02

Pos logit contributions
bye+0.197
lder+0.195
ative+0.195
 phrases+0.192
 listened+0.191
Neg logit contributions
 spe-0.142
 snowball-0.136
 bisc-0.112
 pic-0.111
etchup-0.111
Token things
Token ID1243
Feature activation+8.37e-02

Pos logit contributions
ibrarian+0.039
izes+0.039
mber+0.038
master+0.038
ative+0.036
Neg logit contributions
 spe-0.041
 snowball-0.038
 pic-0.037
 ditch-0.035
 bur-0.035
Token.
Token ID13
Feature activation+1.96e-02

Pos logit contributions
 phrases+0.143
mber+0.138
izes+0.137
 elders+0.135
 Everyone+0.132
Neg logit contributions
 spe-0.124
 extingu-0.112
etchup-0.109
 bisc-0.108
 bur-0.102
Token<|endoftext|>
Token ID50256
Feature activation-1.66e-02

Pos logit contributions
izes+0.035
 phrases+0.034
 listened+0.033
 stories+0.032
 questions+0.032
Neg logit contributions
 bisc-0.031
etchup-0.031
 spe-0.031
 pitcher-0.030
umbs-0.029
TokenOnce
Token ID7454
Feature activation-3.63e-02

Pos logit contributions
 spe+0.031
 bisc+0.029
 pitcher+0.029
 extingu+0.029
etchup+0.028
Neg logit contributions
 listened-0.024
ative-0.024
 everyone-0.024
 Everyone-0.024
 questions-0.024
Token proud
Token ID6613
Feature activation+1.57e-02

Pos logit contributions
 phrases+0.055
izes+0.053
 languages+0.051
mber+0.051
 professor+0.050
Neg logit contributions
 spe-0.083
etchup-0.075
 extingu-0.072
 pitcher-0.072
 funnel-0.070
Token of
Token ID286
Feature activation+1.75e-03

Pos logit contributions
izes+0.021
 phrases+0.020
 listened+0.020
 stories+0.019
 languages+0.019
Neg logit contributions
 spe-0.032
etchup-0.031
 bisc-0.030
 pitcher-0.030
 extingu-0.030
Token ourselves
Token ID6731
Feature activation+2.57e-02

Pos logit contributions
 nodded+0.003
mber+0.003
 professor+0.003
master+0.003
 phrases+0.003
Neg logit contributions
 spe-0.002
 extingu-0.002
 vinegar-0.002
etchup-0.002
 funnel-0.002
Token when
Token ID618
Feature activation+1.59e-02

Pos logit contributions
izes+0.041
 listened+0.040
mber+0.039
 phrases+0.039
 professor+0.039
Neg logit contributions
 spe-0.041
 extingu-0.040
 pitcher-0.039
 funnel-0.037
etchup-0.037
Token we
Token ID356
Feature activation-1.28e-02

Pos logit contributions
ative+0.027
izes+0.027
 phrases+0.026
 professor+0.026
 nodded+0.025
Neg logit contributions
 spe-0.026
etchup-0.025
 pitcher-0.024
 funnel-0.023
 bisc-0.022
Token accomplish
Token ID9989
Feature activation+1.06e-01

Pos logit contributions
 bur+0.018
 spe+0.018
 bisc+0.018
 pitcher+0.017
etchup+0.017
Neg logit contributions
mber-0.023
izes-0.023
ative-0.023
ya-0.022
 languages-0.022
Token something
Token ID1223
Feature activation+4.81e-02

Pos logit contributions
 phrases+0.167
ative+0.165
 professor+0.165
bye+0.164
mber+0.160
Neg logit contributions
 snowball-0.170
 spe-0.170
 pitcher-0.157
etchup-0.157
 funnel-0.155
Token new
Token ID649
Feature activation+1.14e-02

Pos logit contributions
 phrases+0.071
ative+0.067
 professor+0.066
 listened+0.066
 nodded+0.066
Neg logit contributions
 spe-0.088
etchup-0.081
 pitcher-0.080
 extingu-0.077
 bisc-0.076
Token.
Token ID13
Feature activation+2.11e-02

Pos logit contributions
mber+0.018
 phrases+0.017
izes+0.017
 listened+0.017
lder+0.017
Neg logit contributions
 spe-0.016
etchup-0.016
 extingu-0.016
 bisc-0.016
 snowball-0.015
Token<|endoftext|>
Token ID50256
Feature activation-1.74e-02

Pos logit contributions
 Everyone+0.022
 everyone+0.021
 learn+0.020
 Everybody+0.018
 kind+0.018
Neg logit contributions
 spe-0.052
 bisc-0.050
 pitcher-0.049
 snowball-0.049
 extingu-0.049
TokenOnce
Token ID7454
Feature activation-3.79e-02

Pos logit contributions
 spe+0.040
 extingu+0.038
 bisc+0.038
 pitcher+0.038
etchup+0.037
Neg logit contributions
 Everyone-0.019
 everyone-0.019
 listened-0.018
ative-0.018
 questions-0.018
Token to
Token ID284
Feature activation+3.31e-02

Pos logit contributions
izes+0.033
ative+0.030
 phrases+0.030
itive+0.028
 listened+0.028
Neg logit contributions
 spe-0.052
 bur-0.050
 snowball-0.050
 bisc-0.049
 extingu-0.046
Token do
Token ID466
Feature activation+5.93e-02

Pos logit contributions
izes+0.057
ative+0.056
 phrases+0.054
 languages+0.052
mber+0.051
Neg logit contributions
 spe-0.056
 bisc-0.053
 bur-0.053
 extingu-0.053
 pitcher-0.051
Token many
Token ID867
Feature activation+6.91e-02

Pos logit contributions
ative+0.098
izes+0.096
bye+0.095
 professor+0.092
 phrases+0.090
Neg logit contributions
 snowball-0.098
 mud-0.089
 bisc-0.086
 spe-0.084
etchup-0.084
Token more
Token ID517
Feature activation+5.65e-02

Pos logit contributions
ative+0.104
izes+0.103
lder+0.099
 professor+0.098
 phrases+0.097
Neg logit contributions
 spe-0.125
 snowball-0.121
 pic-0.108
etchup-0.106
 mud-0.105
Token heroic
Token ID23523
Feature activation+4.75e-02

Pos logit contributions
izes+0.088
ative+0.082
 listened+0.082
 phrases+0.078
mber+0.076
Neg logit contributions
 spe-0.107
 snowball-0.100
 bisc-0.092
etchup-0.091
 bur-0.089
Token deeds
Token ID23777
Feature activation+1.06e-01

Pos logit contributions
izes+0.070
 listened+0.063
mber+0.063
ative+0.062
master+0.062
Neg logit contributions
 spe-0.089
 bur-0.075
etchup-0.075
 bisc-0.074
 extingu-0.074
Token and
Token ID290
Feature activation+3.20e-02

Pos logit contributions
izes+0.177
 phrases+0.172
mber+0.168
 elders+0.167
lder+0.166
Neg logit contributions
 spe-0.158
 extingu-0.146
 bisc-0.137
etchup-0.132
 funnel-0.129
Token never
Token ID1239
Feature activation-6.99e-03

Pos logit contributions
ative+0.042
izes+0.040
lder+0.039
yer+0.038
itive+0.038
Neg logit contributions
 spe-0.062
 bisc-0.057
 bur-0.055
etchup-0.053
 pitcher-0.051
Token gave
Token ID2921
Feature activation-1.26e-02

Pos logit contributions
 spe+0.013
 bisc+0.011
 pitcher+0.011
 bur+0.011
etchup+0.011
Neg logit contributions
izes-0.012
ative-0.011
 stories-0.011
 languages-0.011
 questions-0.010
Token up
Token ID510
Feature activation+1.31e-02

Pos logit contributions
 snowball+0.023
 spe+0.023
etchup+0.023
 bisc+0.022
 bur+0.021
Neg logit contributions
izes-0.018
 phrases-0.017
ative-0.017
 listened-0.017
 professor-0.016
Token no
Token ID645
Feature activation-3.36e-02

Pos logit contributions
izes+0.020
 phrases+0.019
mber+0.018
 listened+0.018
ative+0.017
Neg logit contributions
 spe-0.024
 bisc-0.022
etchup-0.022
 snowball-0.021
 pitcher-0.021
Token and
Token ID290
Feature activation+2.68e-02

Pos logit contributions
 phrases+0.093
 languages+0.089
 professor+0.086
 jokes+0.085
 stories+0.085
Neg logit contributions
 spe-0.134
etchup-0.126
 pitcher-0.123
 extingu-0.122
 funnel-0.121
Token dedication
Token ID22445
Feature activation+5.32e-02

Pos logit contributions
ative+0.039
itive+0.038
yer+0.038
 professor+0.037
master+0.037
Neg logit contributions
 spe-0.046
 bisc-0.043
etchup-0.043
 pitcher-0.042
 bur-0.042
Token,
Token ID11
Feature activation+2.75e-02

Pos logit contributions
 languages+0.069
 professor+0.069
 phrases+0.069
izes+0.069
lder+0.068
Neg logit contributions
 spe-0.095
etchup-0.090
 extingu-0.088
 sponge-0.087
 bisc-0.086
Token you
Token ID345
Feature activation+3.62e-02

Pos logit contributions
izes+0.035
ative+0.033
 phrases+0.033
 listened+0.032
 languages+0.032
Neg logit contributions
 spe-0.057
etchup-0.055
 bisc-0.054
 pitcher-0.053
 funnel-0.052
Token can
Token ID460
Feature activation+6.05e-03

Pos logit contributions
 languages+0.074
 phrases+0.074
 stories+0.072
izes+0.071
 jokes+0.070
Neg logit contributions
 spe-0.046
 bur-0.037
 funnel-0.035
 pitcher-0.034
etchup-0.034
Token achieve
Token ID4620
Feature activation+1.05e-01

Pos logit contributions
 languages+0.009
 stories+0.008
 phrases+0.008
izes+0.008
 questions+0.008
Neg logit contributions
 spe-0.011
 pitcher-0.011
 bur-0.010
-0.010
 funnel-0.010
Token great
Token ID1049
Feature activation+4.55e-02

Pos logit contributions
 listened+0.167
ative+0.164
 phrases+0.162
lder+0.159
 professor+0.159
Neg logit contributions
 snowball-0.170
 spe-0.168
 bisc-0.151
 pitcher-0.148
etchup-0.147
Token things
Token ID1243
Feature activation+9.30e-02

Pos logit contributions
ibrarian+0.070
izes+0.067
 listened+0.066
isy+0.066
mber+0.065
Neg logit contributions
 spe-0.071
 snowball-0.067
 pic-0.066
 pitcher-0.065
 gaps-0.064
Token!
Token ID0
Feature activation+2.94e-02

Pos logit contributions
 phrases+0.157
mber+0.151
izes+0.147
 stories+0.146
 languages+0.145
Neg logit contributions
 spe-0.140
 extingu-0.129
etchup-0.128
 pitcher-0.123
 bisc-0.118
Token<|endoftext|>
Token ID50256
Feature activation-1.19e-02

Pos logit contributions
izes+0.041
 phrases+0.041
 listened+0.040
 questions+0.040
 stories+0.040
Neg logit contributions
 spe-0.057
 bisc-0.056
 pitcher-0.054
etchup-0.054
 extingu-0.053
TokenOnce
Token ID7454
Feature activation-3.24e-02

Pos logit contributions
 spe+0.021
 bisc+0.020
 pitcher+0.020
 extingu+0.019
etchup+0.019
Neg logit contributions
 listened-0.019
ative-0.019
izes-0.019
 questions-0.019
 phrases-0.018
Token
Token ID220
Feature activation+3.12e-03

Pos logit contributions
 Everyone+0.013
 phrases+0.013
 everyone+0.013
 stories+0.013
 listened+0.012
Neg logit contributions
 spe-0.026
 bisc-0.025
etchup-0.024
 pitcher-0.024
 extingu-0.024
Token
Token ID628
Feature activation+2.16e-02

Pos logit contributions
izes+0.004
 phrases+0.004
 listened+0.004
 questions+0.004
 stories+0.004
Neg logit contributions
 spe-0.006
 bisc-0.006
 pitcher-0.006
etchup-0.006
 extingu-0.006
Token
Token ID198
Feature activation+6.99e-03

Pos logit contributions
 everyone+0.032
 Everyone+0.031
 learn+0.030
 listened+0.030
 kind+0.029
Neg logit contributions
 spe-0.042
 bisc-0.040
etchup-0.039
 pitcher-0.039
 extingu-0.039
TokenEnd
Token ID12915
Feature activation+1.43e-02

Pos logit contributions
 listened+0.008
 learn+0.008
izes+0.008
 kind+0.007
 everyone+0.007
Neg logit contributions
 spe-0.017
 extingu-0.016
 bisc-0.015
 pitcher-0.015
etchup-0.015
Token of
Token ID286
Feature activation+2.59e-02

Pos logit contributions
 phrases+0.021
izes+0.021
 languages+0.020
 stories+0.019
 questions+0.019
Neg logit contributions
 bisc-0.027
 spe-0.027
etchup-0.027
 pitcher-0.026
 snowball-0.025
Token Story
Token ID8362
Feature activation+1.05e-01

Pos logit contributions
izes+0.034
 phrases+0.034
 listened+0.033
 stories+0.033
 questions+0.033
Neg logit contributions
 spe-0.053
 bisc-0.052
etchup-0.051
 pitcher-0.050
 extingu-0.049
Token.
Token ID13
Feature activation+2.48e-02

Pos logit contributions
 phrases+0.153
 questions+0.135
 listen+0.131
 jokes+0.127
itive+0.125
Neg logit contributions
umbs-0.190
 funnel-0.185
 pitcher-0.184
etchup-0.182
 spe-0.177
Token<|endoftext|>
Token ID50256
Feature activation-1.44e-02

Pos logit contributions
 Everyone+0.029
 everyone+0.028
 listened+0.028
 phrases+0.028
ative+0.028
Neg logit contributions
 spe-0.054
 bisc-0.052
 pitcher-0.052
etchup-0.051
 funnel-0.051
TokenL
Token ID43
Feature activation+1.90e-02

Pos logit contributions
 spe+0.029
 extingu+0.028
 bisc+0.028
 pitcher+0.027
etchup+0.026
Neg logit contributions
 Everyone-0.019
 everyone-0.019
 listened-0.019
ative-0.019
 questions-0.018
Tokenilly
Token ID6548
Feature activation+3.78e-02

Pos logit contributions
izes+0.023
 phrases+0.022
 stories+0.022
 questions+0.022
 listened+0.022
Neg logit contributions
 spe-0.042
 bisc-0.041
etchup-0.040
 pitcher-0.040
 extingu-0.039
Token had
Token ID550
Feature activation-2.11e-02

Pos logit contributions
 phrases+0.057
izes+0.057
ative+0.056
 questions+0.055
 languages+0.054
Neg logit contributions
 spe-0.071
 bisc-0.070
 bur-0.068
etchup-0.068
 pitcher-0.067
Token got
Token ID1392
Feature activation-5.02e-03

Pos logit contributions
ative+0.008
 languages+0.008
 phrases+0.008
 questions+0.008
izes+0.008
Neg logit contributions
 bur-0.009
 spe-0.009
 bisc-0.009
 snowball-0.008
 pitcher-0.008
Token cle
Token ID1190
Feature activation+6.99e-02

Pos logit contributions
 spe+0.009
 bisc+0.009
etchup+0.008
 pitcher+0.008
 snowball+0.008
Neg logit contributions
 phrases-0.008
izes-0.008
 languages-0.008
ative-0.008
 stories-0.007
Tokenvere
Token ID4119
Feature activation+3.21e-02

Pos logit contributions
 stories+0.106
 phrases+0.104
 languages+0.104
 listened+0.101
 jokes+0.098
Neg logit contributions
 pitcher-0.123
umbs-0.123
 spe-0.122
 funnel-0.121
 bisc-0.121
Tokenr
Token ID81
Feature activation+5.65e-02

Pos logit contributions
izes+0.042
 phrases+0.041
 stories+0.040
 listened+0.040
ative+0.039
Neg logit contributions
 spe-0.065
 bisc-0.064
etchup-0.063
 extingu-0.062
 pitcher-0.062
Token and
Token ID290
Feature activation+1.85e-02

Pos logit contributions
 phrases+0.080
izes+0.080
 listened+0.076
 questions+0.075
 professor+0.074
Neg logit contributions
 spe-0.108
umbs-0.103
 bisc-0.100
 pitcher-0.099
 extingu-0.099
Token cle
Token ID1190
Feature activation+1.05e-01

Pos logit contributions
ative+0.025
 languages+0.023
 phrases+0.023
 Everyone+0.022
 questions+0.022
Neg logit contributions
 spe-0.039
 bisc-0.037
 bur-0.035
 pitcher-0.035
etchup-0.035
Tokenvere
Token ID4119
Feature activation+4.19e-02

Pos logit contributions
 stories+0.181
 jokes+0.171
 languages+0.169
 phrases+0.168
 listens+0.167
Neg logit contributions
umbs-0.157
 spe-0.153
 funnel-0.152
 pitcher-0.149
 afloat-0.144
Tokenr
Token ID81
Feature activation+6.79e-02

Pos logit contributions
izes+0.063
 stories+0.062
 phrases+0.061
 listened+0.059
ative+0.059
Neg logit contributions
 spe-0.076
etchup-0.073
 extingu-0.072
 bisc-0.071
 snowball-0.070
Token.
Token ID13
Feature activation+1.70e-02

Pos logit contributions
 phrases+0.099
 listened+0.097
izes+0.097
 professor+0.096
Deal+0.095
Neg logit contributions
 spe-0.124
umbs-0.119
 extingu-0.111
 pitcher-0.110
 funnel-0.108
Token Eventually
Token ID16178
Feature activation+2.57e-02

Pos logit contributions
izes+0.029
 listened+0.028
 stories+0.028
 phrases+0.027
 questions+0.026
Neg logit contributions
 spe-0.028
etchup-0.028
 pitcher-0.028
 bisc-0.028
 funnel-0.027
Token the
Token ID262
Feature activation+3.00e-02

Pos logit contributions
 languages+0.039
izes+0.037
 phrases+0.037
 cultures+0.035
ative+0.034
Neg logit contributions
 spe-0.052
 funnel-0.046
 bisc-0.045
 pitcher-0.045
etchup-0.044
Token girl
Token ID2576
Feature activation+2.98e-02

Pos logit contributions
 bisc+0.052
 spe+0.052
 pitcher+0.051
etchup+0.050
 snowball+0.049
Neg logit contributions
izes-0.037
 phrases-0.035
ative-0.034
 listened-0.034
 stories-0.034
Token learned
Token ID4499
Feature activation+5.74e-02

Pos logit contributions
ative+0.046
 phrases+0.043
 questions+0.042
 languages+0.042
izes+0.042
Neg logit contributions
 spe-0.055
 bur-0.053
 bisc-0.051
etchup-0.051
 pitcher-0.051
Token that
Token ID326
Feature activation+5.96e-04

Pos logit contributions
izes+0.107
 leader+0.096
itive+0.094
lder+0.094
ative+0.093
Neg logit contributions
 umb-0.077
 spe-0.073
 bur-0.072
 bisc-0.071
 snowball-0.070
Token she
Token ID673
Feature activation-4.78e-03

Pos logit contributions
izes+0.001
ative+0.001
 phrases+0.001
 professor+0.001
yer+0.001
Neg logit contributions
 spe-0.001
 pitcher-0.001
etchup-0.001
 bisc-0.001
 bur-0.001
Token could
Token ID714
Feature activation-6.35e-03

Pos logit contributions
 spe+0.007
etchup+0.007
 bisc+0.007
 bur+0.006
 snowball+0.006
Neg logit contributions
 phrases-0.008
 stories-0.008
 questions-0.008
 languages-0.008
ative-0.008
Token accomplish
Token ID9989
Feature activation+1.04e-01

Pos logit contributions
 spe+0.012
 pitcher+0.011
 bisc+0.011
 snowball+0.011
 bur+0.011
Neg logit contributions
izes-0.009
 phrases-0.009
 languages-0.009
 stories-0.009
 questions-0.009
Token anything
Token ID1997
Feature activation+6.77e-02

Pos logit contributions
ative+0.177
 phrases+0.174
izes+0.172
bye+0.171
lder+0.170
Neg logit contributions
 spe-0.161
 snowball-0.160
 bisc-0.136
etchup-0.134
 pic-0.133
Token with
Token ID351
Feature activation+8.23e-03

Pos logit contributions
 phrases+0.109
izes+0.100
 nodded+0.097
ative+0.097
 listened+0.096
Neg logit contributions
 spe-0.121
etchup-0.102
 pitcher-0.096
 bisc-0.095
 snowball-0.095
Token the
Token ID262
Feature activation-7.02e-03

Pos logit contributions
izes+0.011
ative+0.010
 phrases+0.010
 professor+0.010
 listened+0.010
Neg logit contributions
 spe-0.016
etchup-0.016
 bisc-0.015
 bur-0.015
 snowball-0.015
Token help
Token ID1037
Feature activation+8.28e-03

Pos logit contributions
 bisc+0.012
 pitcher+0.012
 spe+0.012
 snowball+0.011
etchup+0.011
Neg logit contributions
izes-0.012
 phrases-0.012
 listened-0.011
ative-0.011
 stories-0.011
Token of
Token ID286
Feature activation-1.18e-02

Pos logit contributions
izes+0.012
 phrases+0.012
 listened+0.012
 nodded+0.011
 professor+0.011
Neg logit contributions
 bisc-0.015
 spe-0.015
 pitcher-0.014
umbs-0.014
etchup-0.014
Token try
Token ID1949
Feature activation+3.65e-02

Pos logit contributions
 stories+0.052
izes+0.051
 languages+0.051
ative+0.051
 phrases+0.051
Neg logit contributions
 spe-0.049
 bur-0.046
 bisc-0.045
 pitcher-0.044
umbs-0.042
Token hard
Token ID1327
Feature activation+1.62e-03

Pos logit contributions
izes+0.058
ative+0.056
 listened+0.056
 phrases+0.055
 nodded+0.054
Neg logit contributions
 spe-0.056
 bur-0.055
etchup-0.055
 snowball-0.055
 pitcher-0.054
Token,
Token ID11
Feature activation+1.49e-02

Pos logit contributions
izes+0.002
 listened+0.002
 phrases+0.002
ative+0.002
 stories+0.002
Neg logit contributions
etchup-0.003
 spe-0.003
 extingu-0.003
 pitcher-0.003
 bisc-0.003
Token you
Token ID345
Feature activation+3.48e-02

Pos logit contributions
ative+0.021
izes+0.020
 professor+0.019
 phrases+0.019
 languages+0.019
Neg logit contributions
 spe-0.029
 pitcher-0.027
etchup-0.027
 bisc-0.027
 bur-0.026
Token can
Token ID460
Feature activation+3.28e-03

Pos logit contributions
 stories+0.067
 languages+0.066
 phrases+0.066
 songs+0.065
ative+0.064
Neg logit contributions
 spe-0.043
 bur-0.040
 pitcher-0.038
 bisc-0.036
 funnel-0.035
Token achieve
Token ID4620
Feature activation+1.04e-01

Pos logit contributions
 languages+0.005
 stories+0.005
 phrases+0.004
izes+0.004
 questions+0.004
Neg logit contributions
 spe-0.006
 pitcher-0.006
 bur-0.006
 funnel-0.006
 bisc-0.006
Token anything
Token ID1997
Feature activation+8.11e-02

Pos logit contributions
 listened+0.159
ative+0.159
 phrases+0.157
 professor+0.154
izes+0.153
Neg logit contributions
 spe-0.174
 snowball-0.170
 bisc-0.157
etchup-0.156
 pitcher-0.154
Token.
Token ID13
Feature activation+2.49e-02

Pos logit contributions
 phrases+0.124
 listened+0.118
 nodded+0.115
ative+0.115
izes+0.114
Neg logit contributions
 spe-0.144
etchup-0.129
 pitcher-0.125
 bur-0.118
 snowball-0.113
Token<|endoftext|>
Token ID50256
Feature activation-1.52e-02

Pos logit contributions
 phrases+0.034
izes+0.034
 questions+0.033
 listened+0.033
 stories+0.033
Neg logit contributions
 spe-0.049
 bisc-0.048
 pitcher-0.047
etchup-0.046
 extingu-0.046
TokenOnce
Token ID7454
Feature activation-3.68e-02

Pos logit contributions
 spe+0.027
 bisc+0.026
 pitcher+0.025
 extingu+0.025
etchup+0.025
Neg logit contributions
 listened-0.023
ative-0.023
 questions-0.023
 everyone-0.023
izes-0.023
Token upon
Token ID2402
Feature activation+1.90e-02

Pos logit contributions
 spe+0.059
 bisc+0.058
etchup+0.057
 pitcher+0.057
 snowball+0.055
Neg logit contributions
 phrases-0.063
izes-0.063
 listened-0.061
 stories-0.061
ative-0.060
Tokenily
Token ID813
Feature activation-6.84e-03

Pos logit contributions
 phrases+0.016
izes+0.016
 listened+0.015
 stories+0.015
 questions+0.015
Neg logit contributions
 bisc-0.029
 spe-0.029
 pitcher-0.029
etchup-0.029
 snowball-0.028
Token went
Token ID1816
Feature activation-8.14e-03

Pos logit contributions
 bur+0.012
 bisc+0.012
 spe+0.012
 pitcher+0.011
 snowball+0.011
Neg logit contributions
ative-0.011
izes-0.011
 questions-0.010
 phrases-0.010
 languages-0.010
Token to
Token ID284
Feature activation+1.68e-02

Pos logit contributions
 bisc+0.013
etchup+0.012
 pitcher+0.012
 extingu+0.012
 snowball+0.012
Neg logit contributions
 phrases-0.014
 languages-0.014
izes-0.014
ative-0.013
 stories-0.013
Token the
Token ID262
Feature activation+3.68e-02

Pos logit contributions
izes+0.023
ative+0.022
 phrases+0.021
 languages+0.020
 listened+0.020
Neg logit contributions
 bisc-0.034
 spe-0.034
 extingu-0.033
 pitcher-0.033
etchup-0.033
Token piano
Token ID19132
Feature activation+5.86e-02

Pos logit contributions
 phrases+0.050
izes+0.049
 languages+0.049
ative+0.048
 questions+0.048
Neg logit contributions
 pitcher-0.070
 bisc-0.068
 funnel-0.066
 spe-0.065
etchup-0.063
Token lessons
Token ID11658
Feature activation+1.04e-01

Pos logit contributions
izes+0.108
 phrases+0.100
ative+0.098
 languages+0.097
 knows+0.095
Neg logit contributions
 pitcher-0.087
 bisc-0.083
 bur-0.082
 funnel-0.082
 extingu-0.082
Token every
Token ID790
Feature activation-1.28e-04

Pos logit contributions
izes+0.173
 phrases+0.164
ative+0.161
 elders+0.159
 advisor+0.159
Neg logit contributions
 spe-0.167
etchup-0.153
 funnel-0.148
 pitcher-0.147
 bisc-0.141
Token week
Token ID1285
Feature activation+1.07e-02

Pos logit contributions
 spe+0.000
 bisc+0.000
 snowball+0.000
 pitcher+0.000
etchup+0.000
Neg logit contributions
izes-0.000
 phrases-0.000
ative-0.000
 stories-0.000
 languages-0.000
Token and
Token ID290
Feature activation+2.14e-02

Pos logit contributions
izes+0.016
ative+0.014
 phrases+0.013
 listened+0.013
 languages+0.013
Neg logit contributions
 spe-0.022
 bisc-0.021
 pitcher-0.020
etchup-0.020
 funnel-0.019
Token practiced
Token ID19893
Feature activation+8.51e-02

Pos logit contributions
ative+0.037
izes+0.033
 languages+0.033
itive+0.031
ently+0.031
Neg logit contributions
 spe-0.036
etchup-0.032
 bur-0.032
 bisc-0.032
 pitcher-0.032
Token very
Token ID845
Feature activation+3.94e-02

Pos logit contributions
izes+0.130
mber+0.125
ative+0.124
bye+0.124
 elders+0.121
Neg logit contributions
 spe-0.156
 snowball-0.145
 funnel-0.136
 extingu-0.135
etchup-0.134
Token.
Token ID13
Feature activation+1.15e-02

Pos logit contributions
izes+0.045
 phrases+0.043
 languages+0.040
 listened+0.037
 questions+0.037
Neg logit contributions
 bisc-0.051
 pitcher-0.051
 spe-0.050
 extingu-0.049
etchup-0.049
Token Everyone
Token ID11075
Feature activation+5.71e-02

Pos logit contributions
izes+0.016
 phrases+0.016
 listened+0.015
 stories+0.015
 questions+0.015
Neg logit contributions
 spe-0.022
 bisc-0.022
etchup-0.022
 pitcher-0.022
 extingu-0.021
Token was
Token ID373
Feature activation+2.26e-02

Pos logit contributions
 phrases+0.104
isdom+0.101
 questions+0.097
 languages+0.097
izes+0.096
Neg logit contributions
etchup-0.074
 spe-0.073
 soaked-0.067
 pitcher-0.066
 spilled-0.063
Token able
Token ID1498
Feature activation-3.43e-02

Pos logit contributions
 phrases+0.031
izes+0.031
 languages+0.030
 questions+0.029
 stories+0.028
Neg logit contributions
 spe-0.046
 pitcher-0.044
etchup-0.043
 bisc-0.043
 extingu-0.041
Token to
Token ID284
Feature activation+2.25e-02

Pos logit contributions
 spe+0.062
 bisc+0.060
etchup+0.059
 pitcher+0.059
 snowball+0.056
Neg logit contributions
izes-0.055
 phrases-0.052
ative-0.051
 languages-0.051
 questions-0.049
Token accomplish
Token ID9989
Feature activation+1.04e-01

Pos logit contributions
 languages+0.037
izes+0.036
 phrases+0.035
 stories+0.034
 questions+0.034
Neg logit contributions
 pitcher-0.037
 extingu-0.037
 spe-0.037
 bisc-0.036
 sponge-0.035
Token their
Token ID511
Feature activation+5.09e-02

Pos logit contributions
bye+0.159
ative+0.155
 listened+0.153
 phrases+0.153
izes+0.152
Neg logit contributions
 spe-0.172
 snowball-0.172
 pitcher-0.159
 bisc-0.158
etchup-0.157
Token tasks
Token ID8861
Feature activation+7.10e-02

Pos logit contributions
izes+0.072
 nodded+0.069
mber+0.068
ative+0.067
His+0.067
Neg logit contributions
 pitcher-0.088
 spe-0.085
 bur-0.079
 umb-0.079
 funnel-0.077
Token and
Token ID290
Feature activation+3.04e-02

Pos logit contributions
 phrases+0.109
 languages+0.105
 jokes+0.103
izes+0.102
 stories+0.100
Neg logit contributions
 spe-0.120
 pitcher-0.120
etchup-0.112
 extingu-0.111
 bisc-0.111
Token the
Token ID262
Feature activation+2.34e-02

Pos logit contributions
ative+0.042
izes+0.041
itive+0.041
 languages+0.039
 phrases+0.038
Neg logit contributions
 spe-0.058
 pitcher-0.052
 bisc-0.050
etchup-0.050
 bur-0.049
Token event
Token ID1785
Feature activation+4.71e-02

Pos logit contributions
izes+0.032
 phrases+0.029
ative+0.028
isdom+0.028
ently+0.027
Neg logit contributions
 pitcher-0.048
 bisc-0.048
 spe-0.046
 funnel-0.042
etchup-0.042
Token keep
Token ID1394
Feature activation+2.30e-02

Pos logit contributions
izes+0.051
 languages+0.050
 phrases+0.050
 stories+0.050
ative+0.049
Neg logit contributions
 spe-0.050
 bisc-0.048
 pitcher-0.047
etchup-0.046
umbs-0.045
Token trying
Token ID2111
Feature activation+3.74e-02

Pos logit contributions
 phrases+0.033
ative+0.033
izes+0.033
 languages+0.032
 listened+0.031
Neg logit contributions
 spe-0.042
etchup-0.042
 bisc-0.041
 snowball-0.040
 pitcher-0.040
Token,
Token ID11
Feature activation+1.00e-02

Pos logit contributions
 phrases+0.050
izes+0.050
 listened+0.050
ative+0.050
 professor+0.049
Neg logit contributions
 spe-0.068
etchup-0.067
 snowball-0.067
 bisc-0.067
 pitcher-0.067
Token you
Token ID345
Feature activation+2.78e-02

Pos logit contributions
ative+0.015
izes+0.014
 professor+0.014
 phrases+0.014
 languages+0.014
Neg logit contributions
 spe-0.017
 pitcher-0.017
etchup-0.016
 bisc-0.016
 bur-0.016
Token can
Token ID460
Feature activation-4.82e-03

Pos logit contributions
 phrases+0.056
 languages+0.056
 stories+0.055
ya+0.054
mber+0.054
Neg logit contributions
 spe-0.033
 pitcher-0.029
 bisc-0.029
etchup-0.028
 bur-0.028
Token achieve
Token ID4620
Feature activation+1.03e-01

Pos logit contributions
 spe+0.009
 pitcher+0.008
 bisc+0.008
 bur+0.008
 funnel+0.008
Neg logit contributions
 languages-0.007
 stories-0.007
 phrases-0.007
izes-0.007
 questions-0.007
Token anything
Token ID1997
Feature activation+7.99e-02

Pos logit contributions
 listened+0.156
ative+0.156
 phrases+0.154
 professor+0.152
izes+0.147
Neg logit contributions
 snowball-0.173
 spe-0.172
 bisc-0.157
etchup-0.156
 pitcher-0.155
Token.
Token ID13
Feature activation+2.06e-02

Pos logit contributions
 phrases+0.124
ative+0.116
 listened+0.115
 professor+0.113
 nodded+0.112
Neg logit contributions
 spe-0.139
etchup-0.123
 pitcher-0.118
 snowball-0.112
 bur-0.111
Token<|endoftext|>
Token ID50256
Feature activation-1.38e-02

Pos logit contributions
izes+0.032
 phrases+0.031
 stories+0.030
 listened+0.030
 questions+0.030
Neg logit contributions
 spe-0.037
 bisc-0.037
etchup-0.036
 pitcher-0.036
 extingu-0.035
TokenTom
Token ID13787
Feature activation+1.24e-02

Pos logit contributions
 spe+0.022
 bisc+0.021
 pitcher+0.021
etchup+0.020
 extingu+0.020
Neg logit contributions
izes-0.024
 phrases-0.024
 listened-0.024
 questions-0.024
 stories-0.024
Tokenmy
Token ID1820
Feature activation+2.46e-02

Pos logit contributions
izes+0.025
 phrases+0.025
ative+0.023
 languages+0.023
 stories+0.023
Neg logit contributions
 bisc-0.017
etchup-0.017
 snowball-0.016
 pitcher-0.016
 spe-0.016
Token examine
Token ID10716
Feature activation+2.23e-02

Pos logit contributions
 spe+0.034
 bisc+0.034
 pitcher+0.032
etchup+0.032
 extingu+0.032
Neg logit contributions
izes-0.025
 phrases-0.024
 stories-0.024
 languages-0.023
 listened-0.023
Token it
Token ID340
Feature activation-1.44e-02

Pos logit contributions
 phrases+0.036
izes+0.035
ative+0.034
 listened+0.034
 professor+0.033
Neg logit contributions
 spe-0.040
etchup-0.038
 bisc-0.038
 snowball-0.037
 funnel-0.036
Token with
Token ID351
Feature activation-9.25e-03

Pos logit contributions
 bisc+0.028
 spe+0.028
 pitcher+0.027
 funnel+0.027
etchup+0.027
Neg logit contributions
 phrases-0.021
 languages-0.020
ative-0.018
 listened-0.018
 stories-0.018
Token the
Token ID262
Feature activation-2.54e-02

Pos logit contributions
 spe+0.020
 bisc+0.019
etchup+0.019
 pitcher+0.018
 extingu+0.018
Neg logit contributions
izes-0.012
 phrases-0.012
ative-0.011
 languages-0.011
 listened-0.011
Token magn
Token ID7842
Feature activation-3.77e-02

Pos logit contributions
 bisc+0.047
 spe+0.045
 pitcher+0.043
etchup+0.043
 funnel+0.043
Neg logit contributions
izes-0.041
 phrases-0.039
ative-0.039
 listened-0.039
 languages-0.038
Tokenifying
Token ID4035
Feature activation-1.40e-01

Pos logit contributions
 bisc+0.031
 spe+0.029
etchup+0.028
 snowball+0.028
 pitcher+0.028
Neg logit contributions
ative-0.094
izes-0.093
 questions-0.091
 phrases-0.091
 languages-0.091
Token glass
Token ID5405
Feature activation-5.48e-02

Pos logit contributions
 spe+0.118
etchup+0.117
 bisc+0.114
 extingu+0.112
 snowball+0.109
Neg logit contributions
 kind-0.325
 stories-0.323
 learn-0.318
 Everyone-0.314
 everyone-0.314
Token?"
Token ID1701
Feature activation+2.03e-02

Pos logit contributions
 bisc+0.122
 spe+0.121
 pitcher+0.120
etchup+0.118
 funnel+0.116
Neg logit contributions
 phrases-0.063
izes-0.060
 listened-0.059
 stories-0.057
 questions-0.057
Token
Token ID198
Feature activation-1.06e-02

Pos logit contributions
 everyone+0.024
 Everyone+0.024
 kind+0.022
 listened+0.022
 questions+0.022
Neg logit contributions
 spe-0.046
 extingu-0.046
 bisc-0.044
 funnel-0.043
umbs-0.043
Token
Token ID198
Feature activation-1.00e-02

Pos logit contributions
 spe+0.025
 bisc+0.024
 extingu+0.024
 pitcher+0.024
etchup+0.023
Neg logit contributions
 Everyone-0.011
 everyone-0.011
 kind-0.011
izes-0.011
 listened-0.010
Token"
Token ID1
Feature activation-2.45e-02

Pos logit contributions
 spe+0.022
 bisc+0.022
etchup+0.022
 pitcher+0.021
 extingu+0.021
Neg logit contributions
izes-0.011
 phrases-0.011
 listened-0.011
 stories-0.011
 questions-0.011
Token the
Token ID262
Feature activation-2.23e-02

Pos logit contributions
etchup+0.015
 extingu+0.015
 bisc+0.015
 snowball+0.015
 pitcher+0.015
Neg logit contributions
izes-0.010
 phrases-0.009
ative-0.009
 professor-0.009
 listened-0.008
Token necklace
Token ID38987
Feature activation-2.82e-02

Pos logit contributions
etchup+0.038
 bisc+0.037
 extingu+0.037
 spe+0.036
 snowball+0.036
Neg logit contributions
 stories-0.037
 kind-0.036
 languages-0.036
 questions-0.036
izes-0.036
Token and
Token ID290
Feature activation-8.07e-03

Pos logit contributions
 bisc+0.057
 spe+0.056
 pitcher+0.054
 extingu+0.054
etchup+0.053
Neg logit contributions
 phrases-0.039
izes-0.038
 questions-0.037
 languages-0.036
 listened-0.035
Token the
Token ID262
Feature activation-9.31e-03

Pos logit contributions
 bisc+0.017
 spe+0.017
etchup+0.016
 pitcher+0.016
 extingu+0.016
Neg logit contributions
izes-0.010
ative-0.010
 phrases-0.009
 languages-0.009
 questions-0.009
Token magn
Token ID7842
Feature activation-3.57e-02

Pos logit contributions
 spe+0.015
 bisc+0.015
etchup+0.015
 pitcher+0.014
 extingu+0.014
Neg logit contributions
izes-0.016
 phrases-0.016
 stories-0.015
 questions-0.015
 listened-0.015
Tokenifying
Token ID4035
Feature activation-1.39e-01

Pos logit contributions
 bisc+0.038
 snowball+0.035
 pitcher+0.034
etchup+0.033
 spe+0.033
Neg logit contributions
ative-0.083
izes-0.081
 phrases-0.079
 languages-0.078
 everyone-0.077
Token glass
Token ID5405
Feature activation-4.82e-02

Pos logit contributions
 bisc+0.128
 snowball+0.126
etchup+0.126
ksh+0.125
 extingu+0.124
Neg logit contributions
 kind-0.313
 stories-0.309
 everyone-0.307
 Everyone-0.302
 learn-0.300
Token and
Token ID290
Feature activation-7.80e-03

Pos logit contributions
 spe+0.112
 bisc+0.111
etchup+0.109
 pitcher+0.108
 extingu+0.106
Neg logit contributions
izes-0.048
 phrases-0.047
 listened-0.046
 stories-0.046
 questions-0.045
Token told
Token ID1297
Feature activation+1.31e-03

Pos logit contributions
 spe+0.016
 bisc+0.016
 pitcher+0.015
etchup+0.015
 extingu+0.015
Neg logit contributions
izes-0.010
ative-0.010
 phrases-0.010
 questions-0.010
 languages-0.009
Token him
Token ID683
Feature activation+2.29e-03

Pos logit contributions
izes+0.002
ative+0.002
 listened+0.002
 phrases+0.002
 professor+0.002
Neg logit contributions
 bisc-0.002
 snowball-0.002
 spe-0.002
 pitcher-0.002
etchup-0.002
Token that
Token ID326
Feature activation-1.69e-02

Pos logit contributions
izes+0.004
ative+0.004
 musician+0.004
ising+0.004
itive+0.003
Neg logit contributions
 spe-0.003
 bisc-0.003
 cups-0.003
etchup-0.003
 pitcher-0.003
Token the
Token ID262
Feature activation-1.76e-02

Pos logit contributions
 spe+0.003
 bisc+0.003
etchup+0.003
 pitcher+0.003
 snowball+0.003
Neg logit contributions
izes-0.001
 phrases-0.001
ative-0.001
 listened-0.001
 languages-0.001
Token necklace
Token ID38987
Feature activation-2.22e-02

Pos logit contributions
ksh+0.031
umbs+0.031
etchup+0.030
 extingu+0.030
 spe+0.030
Neg logit contributions
 kind-0.027
 stories-0.027
 people-0.026
 others-0.025
 questions-0.025
Token and
Token ID290
Feature activation+1.46e-03

Pos logit contributions
 bisc+0.048
 spe+0.048
 pitcher+0.046
etchup+0.046
 extingu+0.046
Neg logit contributions
 phrases-0.026
izes-0.026
 questions-0.025
 listened-0.025
 languages-0.025
Token the
Token ID262
Feature activation-5.57e-03

Pos logit contributions
izes+0.002
 phrases+0.002
 listened+0.002
 stories+0.002
 questions+0.002
Neg logit contributions
 spe-0.003
 bisc-0.003
etchup-0.003
 pitcher-0.003
 extingu-0.003
Token magn
Token ID7842
Feature activation-2.74e-02

Pos logit contributions
 spe+0.009
 bisc+0.009
etchup+0.009
 pitcher+0.009
 extingu+0.009
Neg logit contributions
izes-0.009
 stories-0.009
 phrases-0.009
 questions-0.009
 listened-0.009
Tokenifying
Token ID4035
Feature activation-1.37e-01

Pos logit contributions
 bisc+0.034
 snowball+0.032
 pitcher+0.032
etchup+0.031
 spe+0.029
Neg logit contributions
ative-0.059
izes-0.055
fully-0.054
ly-0.054
 listened-0.053
Token glass
Token ID5405
Feature activation-3.90e-02

Pos logit contributions
etchup+0.137
ksh+0.135
 bisc+0.134
 extingu+0.133
 spe+0.131
Neg logit contributions
 kind-0.290
 stories-0.290
 everyone-0.289
 Everyone-0.287
 learn-0.283
Token and
Token ID290
Feature activation-4.27e-03

Pos logit contributions
 spe+0.094
 bisc+0.093
etchup+0.092
 pitcher+0.090
 snowball+0.090
Neg logit contributions
izes-0.034
 stories-0.034
 listened-0.033
 phrases-0.033
 questions-0.032
Token started
Token ID2067
Feature activation-1.91e-03

Pos logit contributions
 spe+0.008
 bisc+0.008
 pitcher+0.008
etchup+0.008
 extingu+0.008
Neg logit contributions
izes-0.006
 phrases-0.006
ative-0.006
 questions-0.006
 languages-0.006
Token to
Token ID284
Feature activation+7.34e-03

Pos logit contributions
 bisc+0.004
 pitcher+0.004
 spe+0.004
umbs+0.004
 funnel+0.004
Neg logit contributions
 stories-0.002
 questions-0.002
 listened-0.002
 phrases-0.002
 learn-0.002
Token examine
Token ID10716
Feature activation+3.00e-02

Pos logit contributions
izes+0.011
 phrases+0.011
ative+0.011
 questions+0.010
 languages+0.010
Neg logit contributions
 spe-0.014
 bisc-0.014
 pitcher-0.013
etchup-0.013
 extingu-0.013
Token hat
Token ID6877
Feature activation-2.69e-02

Pos logit contributions
 spe+0.028
 bisc+0.028
 pitcher+0.027
etchup+0.027
 snowball+0.026
Neg logit contributions
izes-0.027
 phrases-0.027
 listened-0.026
ative-0.026
 languages-0.026
Token and
Token ID290
Feature activation-1.20e-02

Pos logit contributions
 bisc+0.051
 pitcher+0.048
 spe+0.047
 extingu+0.046
 snowball+0.046
Neg logit contributions
 phrases-0.042
izes-0.040
 languages-0.040
 questions-0.038
 elders-0.037
Token grabbed
Token ID13176
Feature activation+1.42e-02

Pos logit contributions
 bisc+0.024
 spe+0.023
 pitcher+0.022
etchup+0.022
 snowball+0.022
Neg logit contributions
ative-0.017
izes-0.017
 phrases-0.017
 languages-0.016
 questions-0.016
Token her
Token ID607
Feature activation-4.58e-02

Pos logit contributions
izes+0.013
 phrases+0.013
 listened+0.013
 stories+0.012
 questions+0.012
Neg logit contributions
 spe-0.034
 bisc-0.034
etchup-0.033
 pitcher-0.033
 extingu-0.033
Token magn
Token ID7842
Feature activation-3.72e-02

Pos logit contributions
 spe+0.076
 bisc+0.075
etchup+0.072
 pitcher+0.072
 extingu+0.070
Neg logit contributions
izes-0.077
 phrases-0.077
 listened-0.075
 stories-0.075
 questions-0.075
Tokenifying
Token ID4035
Feature activation-1.35e-01

Pos logit contributions
 bisc+0.041
etchup+0.040
 spe+0.040
 snowball+0.039
 pitcher+0.038
Neg logit contributions
ative-0.079
ly-0.075
 kindly-0.074
 confidently-0.073
fully-0.073
Token glass
Token ID5405
Feature activation-5.17e-02

Pos logit contributions
 spe+0.133
etchup+0.130
 snowball+0.122
 bisc+0.121
 extingu+0.120
Neg logit contributions
 kind-0.299
 stories-0.297
 Everyone-0.289
 learn-0.288
 everyone-0.285
Token.
Token ID13
Feature activation+3.77e-03

Pos logit contributions
 spe+0.124
 bisc+0.121
etchup+0.119
 funnel+0.117
 pitcher+0.117
Neg logit contributions
 stories-0.046
 questions-0.044
izes-0.044
ative-0.043
 kind-0.042
Token She
Token ID1375
Feature activation-1.59e-03

Pos logit contributions
 phrases+0.004
 questions+0.004
 Everyone+0.004
izes+0.004
 stories+0.004
Neg logit contributions
 spe-0.009
 bisc-0.009
 pitcher-0.008
etchup-0.008
 extingu-0.008
Token was
Token ID373
Feature activation+2.06e-02

Pos logit contributions
 spe+0.003
 bisc+0.003
etchup+0.003
 pitcher+0.003
 snowball+0.003
Neg logit contributions
izes-0.002
 phrases-0.002
ative-0.002
 questions-0.002
 languages-0.002
Token ready
Token ID3492
Feature activation-2.36e-02

Pos logit contributions
 phrases+0.022
izes+0.021
 languages+0.021
ative+0.020
 questions+0.020
Neg logit contributions
 spe-0.047
 bisc-0.046
 pitcher-0.046
etchup-0.046
 snowball-0.045
Token and
Token ID290
Feature activation+1.83e-02

Pos logit contributions
 bisc+0.013
 spe+0.010
 snowball+0.010
 poop+0.010
 pitcher+0.010
Neg logit contributions
ative-0.016
 phrases-0.016
izes-0.016
 Everybody-0.015
 languages-0.015
Token handed
Token ID10158
Feature activation-1.83e-02

Pos logit contributions
 languages+0.028
izes+0.027
 questions+0.026
 phrases+0.026
 stories+0.026
Neg logit contributions
 bisc-0.034
 spe-0.034
 pitcher-0.034
etchup-0.033
 extingu-0.031
Token him
Token ID683
Feature activation-1.96e-02

Pos logit contributions
 bisc+0.037
 spe+0.036
etchup+0.036
 pitcher+0.035
 snowball+0.035
Neg logit contributions
izes-0.024
 phrases-0.024
 listened-0.023
ative-0.023
 stories-0.023
Token the
Token ID262
Feature activation-3.28e-02

Pos logit contributions
 bisc+0.039
etchup+0.038
 extingu+0.037
 spe+0.037
 pitcher+0.037
Neg logit contributions
izes-0.028
 phrases-0.027
ative-0.025
 languages-0.025
 listened-0.025
Token magn
Token ID7842
Feature activation-3.90e-02

Pos logit contributions
etchup+0.056
 extingu+0.055
 snowball+0.055
umbs+0.055
 bisc+0.054
Neg logit contributions
 stories-0.053
 kind-0.053
 questions-0.052
izes-0.051
 languages-0.051
Tokenifying
Token ID4035
Feature activation-1.34e-01

Pos logit contributions
 bisc+0.044
 snowball+0.042
etchup+0.039
 pitcher+0.039
 spe+0.038
Neg logit contributions
ative-0.088
izes-0.085
 languages-0.083
 phrases-0.083
 stories-0.081
Token glass
Token ID5405
Feature activation-5.46e-02

Pos logit contributions
 bisc+0.112
 snowball+0.108
 spe+0.107
etchup+0.107
 extingu+0.105
Neg logit contributions
 stories-0.317
 kind-0.317
 everyone-0.311
 learn-0.309
 Everyone-0.309
Token.
Token ID13
Feature activation+5.39e-03

Pos logit contributions
 spe+0.129
 bisc+0.128
etchup+0.125
 pitcher+0.124
 snowball+0.123
Neg logit contributions
izes-0.051
 stories-0.050
 phrases-0.050
 questions-0.049
 listened-0.049
Token
Token ID198
Feature activation-1.06e-02

Pos logit contributions
izes+0.008
 phrases+0.008
 listened+0.008
 stories+0.008
 questions+0.008
Neg logit contributions
 spe-0.010
 bisc-0.010
etchup-0.010
 pitcher-0.009
 extingu-0.009
Token
Token ID198
Feature activation-9.08e-03

Pos logit contributions
 spe+0.022
 bisc+0.021
 pitcher+0.020
etchup+0.020
 extingu+0.020
Neg logit contributions
izes-0.014
 phrases-0.014
 questions-0.014
 listened-0.014
 stories-0.014
TokenHe
Token ID1544
Feature activation+1.72e-02

Pos logit contributions
 pitcher+0.013
 bisc+0.013
etchup+0.013
 spe+0.012
 funnel+0.012
Neg logit contributions
 phrases-0.017
 stories-0.017
 languages-0.017
izes-0.017
ative-0.016
Token.
Token ID13
Feature activation-1.26e-02

Pos logit contributions
 bisc+0.014
 spe+0.014
etchup+0.014
 pitcher+0.014
 extingu+0.013
Neg logit contributions
izes-0.009
 phrases-0.008
 listened-0.008
 languages-0.008
 questions-0.008
Token Lily
Token ID20037
Feature activation-1.00e-02

Pos logit contributions
 spe+0.035
 extingu+0.033
 bisc+0.031
etchup+0.031
 umb+0.029
Neg logit contributions
 Everyone-0.012
 everyone-0.011
 questions-0.010
 Everybody-0.010
 nodded-0.008
Token had
Token ID550
Feature activation-2.72e-02

Pos logit contributions
 spe+0.020
 bisc+0.020
etchup+0.019
 pitcher+0.019
 snowball+0.019
Neg logit contributions
izes-0.013
ative-0.013
 phrases-0.013
 questions-0.013
 stories-0.012
Token a
Token ID257
Feature activation-5.64e-03

Pos logit contributions
 spe+0.059
 bisc+0.058
etchup+0.056
 pitcher+0.055
 snowball+0.055
Neg logit contributions
izes-0.035
 phrases-0.032
ative-0.031
 languages-0.030
 questions-0.029
Token magn
Token ID7842
Feature activation-3.54e-02

Pos logit contributions
 spe+0.011
 bisc+0.010
etchup+0.010
 pitcher+0.010
 extingu+0.010
Neg logit contributions
izes-0.008
 phrases-0.008
 listened-0.008
 stories-0.008
 questions-0.008
Tokenifying
Token ID4035
Feature activation-1.34e-01

Pos logit contributions
 bisc+0.030
 pitcher+0.028
etchup+0.027
 spe+0.027
 snowball+0.027
Neg logit contributions
ative-0.090
 phrases-0.086
 questions-0.086
izes-0.085
 languages-0.085
Token glass
Token ID5405
Feature activation-5.14e-02

Pos logit contributions
etchup+0.139
 extingu+0.129
 bisc+0.128
 spe+0.126
ksh+0.126
Neg logit contributions
 kind-0.296
 stories-0.295
 learn-0.289
 everyone-0.287
 Everyone-0.284
Token that
Token ID326
Feature activation-5.69e-02

Pos logit contributions
 spe+0.112
 bisc+0.112
 pitcher+0.109
etchup+0.108
 extingu+0.106
Neg logit contributions
izes-0.060
 phrases-0.059
 listened-0.057
 questions-0.056
 stories-0.056
Token she
Token ID673
Feature activation-4.41e-02

Pos logit contributions
 bisc+0.116
 spe+0.110
 pitcher+0.107
etchup+0.105
 bur+0.103
Neg logit contributions
izes-0.080
 phrases-0.079
 languages-0.077
ative-0.076
 musician-0.071
Token found
Token ID1043
Feature activation-3.06e-02

Pos logit contributions
 spe+0.083
 bisc+0.079
etchup+0.078
 pitcher+0.072
 bur+0.072
Neg logit contributions
 phrases-0.066
izes-0.066
ative-0.064
 languages-0.064
 stories-0.062
Token in
Token ID287
Feature activation-2.32e-02

Pos logit contributions
 spe+0.061
 bisc+0.061
etchup+0.058
 extingu+0.056
 pitcher+0.055
Neg logit contributions
izes-0.043
 phrases-0.041
 listened-0.040
 listen-0.039
 learn-0.038
Token,
Token ID11
Feature activation-1.49e-02

Pos logit contributions
 spe+0.001
 bisc+0.001
 pitcher+0.001
 extingu+0.001
 sponge+0.001
Neg logit contributions
 languages-0.001
 professor-0.001
 phrases-0.001
 cultures-0.001
izes-0.001
Token handing
Token ID22786
Feature activation-9.81e-03

Pos logit contributions
 spe+0.029
 pitcher+0.027
 bisc+0.026
etchup+0.026
 extingu+0.026
Neg logit contributions
 languages-0.024
izes-0.023
 phrases-0.023
 stories-0.022
ative-0.022
Token him
Token ID683
Feature activation-1.73e-02

Pos logit contributions
 spe+0.026
 pitcher+0.025
 bisc+0.024
 funnel+0.024
etchup+0.024
Neg logit contributions
 everyone-0.008
 stories-0.008
 questions-0.007
 listened-0.007
 listen-0.007
Token the
Token ID262
Feature activation-2.87e-02

Pos logit contributions
 bisc+0.037
etchup+0.036
 snowball+0.036
 spe+0.036
 extingu+0.035
Neg logit contributions
izes-0.022
 phrases-0.021
ative-0.019
 languages-0.019
 professor-0.019
Token magn
Token ID7842
Feature activation-3.87e-02

Pos logit contributions
 bisc+0.049
 spe+0.046
 snowball+0.046
 pitcher+0.046
etchup+0.045
Neg logit contributions
izes-0.050
 phrases-0.049
 listened-0.048
ative-0.048
 languages-0.047
Tokenifying
Token ID4035
Feature activation-1.33e-01

Pos logit contributions
 bisc+0.022
 spe+0.022
 pitcher+0.019
etchup+0.019
 snowball+0.018
Neg logit contributions
izes-0.106
ative-0.105
 phrases-0.105
 listened-0.104
 questions-0.104
Token glass
Token ID5405
Feature activation-5.16e-02

Pos logit contributions
 spe+0.109
 bisc+0.106
etchup+0.101
 extingu+0.100
 snowball+0.095
Neg logit contributions
 stories-0.317
 kind-0.313
 everyone-0.308
izes-0.308
 learn-0.306
Token.
Token ID13
Feature activation+2.49e-04

Pos logit contributions
 spe+0.113
 bisc+0.113
 pitcher+0.110
etchup+0.109
 extingu+0.108
Neg logit contributions
izes-0.059
 phrases-0.059
 listened-0.056
 questions-0.056
 stories-0.056
Token She
Token ID1375
Feature activation+2.48e-03

Pos logit contributions
 Everyone+0.000
 everyone+0.000
 phrases+0.000
 questions+0.000
 learn+0.000
Neg logit contributions
 spe-0.001
 bisc-0.001
 extingu-0.001
etchup-0.001
 snowball-0.001
Token looked
Token ID3114
Feature activation+2.07e-02

Pos logit contributions
ative+0.003
 languages+0.003
 questions+0.003
izes+0.003
 phrases+0.003
Neg logit contributions
 spe-0.005
 bisc-0.005
 pitcher-0.005
etchup-0.005
 snowball-0.005
Token around
Token ID1088
Feature activation+9.09e-03

Pos logit contributions
 phrases+0.028
ative+0.026
izes+0.026
 languages+0.025
 listened+0.025
Neg logit contributions
 spe-0.043
 bisc-0.042
etchup-0.041
 snowball-0.041
 pitcher-0.040
Token a
Token ID257
Feature activation-1.71e-03

Pos logit contributions
etchup+0.068
 pitcher+0.068
 bisc+0.068
 extingu+0.067
 spe+0.067
Neg logit contributions
 stories-0.024
 listened-0.023
 questions-0.022
 kind-0.022
 everyone-0.021
Token flashlight
Token ID38371
Feature activation-1.81e-02

Pos logit contributions
umbs+0.003
etchup+0.003
 spe+0.003
 extingu+0.003
 bisc+0.003
Neg logit contributions
 kind-0.002
 stories-0.002
 questions-0.002
izes-0.002
 listened-0.002
Token and
Token ID290
Feature activation-2.44e-02

Pos logit contributions
 spe+0.039
 bisc+0.038
 extingu+0.038
 pitcher+0.037
 funnel+0.037
Neg logit contributions
izes-0.023
 phrases-0.022
 questions-0.021
 languages-0.021
 listened-0.020
Token a
Token ID257
Feature activation-5.61e-03

Pos logit contributions
 spe+0.050
 bisc+0.050
 pitcher+0.047
etchup+0.047
 funnel+0.047
Neg logit contributions
izes-0.033
ative-0.031
 phrases-0.031
 languages-0.029
 questions-0.029
Token magn
Token ID7842
Feature activation-3.16e-02

Pos logit contributions
 spe+0.009
 bisc+0.009
etchup+0.009
 pitcher+0.009
 extingu+0.009
Neg logit contributions
izes-0.009
 phrases-0.009
 stories-0.009
 questions-0.009
 listened-0.009
Tokenifying
Token ID4035
Feature activation-1.31e-01

Pos logit contributions
 bisc+0.029
etchup+0.027
 pitcher+0.027
 spe+0.026
 snowball+0.025
Neg logit contributions
ative-0.079
izes-0.075
 questions-0.075
 phrases-0.074
 languages-0.073
Token glass
Token ID5405
Feature activation-4.82e-02

Pos logit contributions
etchup+0.161
 extingu+0.150
 bisc+0.149
ksh+0.147
 spe+0.139
Neg logit contributions
 kind-0.266
 stories-0.265
 everyone-0.258
 sound-0.256
 learn-0.254
Token to
Token ID284
Feature activation-3.58e-02

Pos logit contributions
 spe+0.104
 bisc+0.104
etchup+0.101
 pitcher+0.101
 extingu+0.099
Neg logit contributions
izes-0.056
 phrases-0.055
 listened-0.054
 stories-0.053
 questions-0.053
Token examine
Token ID10716
Feature activation+8.13e-03

Pos logit contributions
 spe+0.063
 bisc+0.062
 snowball+0.057
 funnel+0.057
 extingu+0.057
Neg logit contributions
izes-0.060
 phrases-0.059
ative-0.059
 languages-0.056
 listened-0.055
Token their
Token ID511
Feature activation-2.42e-02

Pos logit contributions
izes+0.010
 phrases+0.010
 listened+0.010
ative+0.009
 learn+0.009
Neg logit contributions
 spe-0.017
 bisc-0.017
etchup-0.016
 snowball-0.016
 extingu-0.016
Token finds
Token ID7228
Feature activation-1.30e-02

Pos logit contributions
 spe+0.045
 bisc+0.045
etchup+0.043
 pitcher+0.043
 extingu+0.042
Neg logit contributions
izes-0.036
 phrases-0.035
 listened-0.035
ative-0.034
 questions-0.034
Token.
Token ID13
Feature activation-9.89e-03

Pos logit contributions
 spe+0.002
 bisc+0.002
etchup+0.002
 pitcher+0.001
 extingu+0.001
Neg logit contributions
izes-0.001
 phrases-0.001
 listened-0.001
ative-0.001
 stories-0.001
Token They
Token ID1119
Feature activation+9.51e-03

Pos logit contributions
 spe+0.028
etchup+0.027
 bisc+0.027
ifier+0.026
umbs+0.026
Neg logit contributions
 Everyone-0.008
 everyone-0.007
 loved-0.007
 often-0.006
:-0.006
Token had
Token ID550
Feature activation-2.64e-02

Pos logit contributions
izes+0.015
ative+0.014
 phrases+0.014
 stories+0.014
 languages+0.014
Neg logit contributions
 spe-0.018
 bisc-0.017
 pitcher-0.017
etchup-0.016
 funnel-0.016
Token a
Token ID257
Feature activation-7.00e-04

Pos logit contributions
 spe+0.061
 bisc+0.061
etchup+0.059
 pitcher+0.059
 extingu+0.058
Neg logit contributions
izes-0.027
 phrases-0.026
 listened-0.026
 stories-0.026
 questions-0.026
Token magn
Token ID7842
Feature activation-3.33e-02

Pos logit contributions
umbs+0.001
etchup+0.001
 spe+0.001
 extingu+0.001
 bisc+0.001
Neg logit contributions
 kind-0.001
 stories-0.001
 questions-0.001
 listened-0.001
izes-0.001
Tokenifying
Token ID4035
Feature activation-1.30e-01

Pos logit contributions
 bisc+0.029
etchup+0.026
 spe+0.026
 pitcher+0.026
 snowball+0.025
Neg logit contributions
ative-0.084
 questions-0.080
 phrases-0.080
izes-0.079
 languages-0.079
Token glass
Token ID5405
Feature activation-4.96e-02

Pos logit contributions
etchup+0.149
 extingu+0.147
 bisc+0.140
ksh+0.137
 spe+0.136
Neg logit contributions
 kind-0.272
 stories-0.270
 learn-0.264
 everyone-0.263
 Everyone-0.258
Token and
Token ID290
Feature activation-2.24e-02

Pos logit contributions
 spe+0.100
 bisc+0.099
etchup+0.096
 pitcher+0.096
 extingu+0.094
Neg logit contributions
izes-0.065
 phrases-0.064
 listened-0.062
 stories-0.062
 questions-0.062
Token a
Token ID257
Feature activation-7.22e-03

Pos logit contributions
 spe+0.047
 bisc+0.047
 pitcher+0.045
etchup+0.044
 extingu+0.043
Neg logit contributions
izes-0.030
ative-0.028
 phrases-0.028
 questions-0.027
 languages-0.027
Token small
Token ID1402
Feature activation-4.02e-02

Pos logit contributions
 spe+0.012
 bisc+0.011
etchup+0.011
 pitcher+0.011
 extingu+0.011
Neg logit contributions
izes-0.012
 phrases-0.012
 stories-0.012
 questions-0.012
 listened-0.012
Token box
Token ID3091
Feature activation-1.97e-02

Pos logit contributions
 spe+0.060
 bisc+0.059
etchup+0.059
 pitcher+0.057
 extingu+0.057
Neg logit contributions
 phrases-0.071
 stories-0.071
izes-0.070
 questions-0.070
 listened-0.069
Token borrowed
Token ID22546
Feature activation+4.95e-04

Pos logit contributions
izes+0.006
ative+0.006
 languages+0.006
 phrases+0.006
 questions+0.006
Neg logit contributions
 spe-0.009
 bisc-0.009
 snowball-0.009
 bur-0.009
 pitcher-0.008
Token her
Token ID607
Feature activation-2.47e-02

Pos logit contributions
 phrases+0.000
 stories+0.000
 questions+0.000
izes+0.000
 everyone+0.000
Neg logit contributions
 spe-0.001
 bisc-0.001
 pitcher-0.001
etchup-0.001
 funnel-0.001
Token mom
Token ID1995
Feature activation-5.59e-02

Pos logit contributions
 spe+0.043
 bisc+0.042
etchup+0.041
 pitcher+0.041
 extingu+0.040
Neg logit contributions
izes-0.040
 phrases-0.039
 stories-0.039
 questions-0.039
 listened-0.039
Token's
Token ID338
Feature activation-2.62e-02

Pos logit contributions
 bisc+0.109
 spe+0.109
etchup+0.105
 pitcher+0.104
 extingu+0.103
Neg logit contributions
izes-0.079
 phrases-0.077
 listened-0.075
 questions-0.074
 stories-0.074
Token magn
Token ID7842
Feature activation-3.93e-02

Pos logit contributions
 spe+0.044
 bisc+0.043
etchup+0.043
 pitcher+0.043
 funnel+0.041
Neg logit contributions
 phrases-0.043
 stories-0.042
 questions-0.042
izes-0.042
 listened-0.041
Tokenifying
Token ID4035
Feature activation-1.29e-01

Pos logit contributions
 bisc+0.039
 snowball+0.037
etchup+0.036
 pitcher+0.036
 spe+0.035
Neg logit contributions
ative-0.093
 phrases-0.088
 languages-0.086
 listened-0.086
 questions-0.086
Token glass
Token ID5405
Feature activation-5.20e-02

Pos logit contributions
etchup+0.123
 spe+0.123
 bisc+0.120
 snowball+0.120
ksh+0.118
Neg logit contributions
 kind-0.292
 stories-0.285
 everyone-0.278
 learn-0.277
 wise-0.276
Token to
Token ID284
Feature activation-2.75e-02

Pos logit contributions
 spe+0.111
 bisc+0.110
etchup+0.108
 pitcher+0.107
 extingu+0.105
Neg logit contributions
izes-0.061
 phrases-0.061
 stories-0.059
 listened-0.059
 questions-0.059
Token take
Token ID1011
Feature activation-4.23e-03

Pos logit contributions
 spe+0.054
 bisc+0.053
 extingu+0.049
 pitcher+0.049
etchup+0.049
Neg logit contributions
izes-0.041
ative-0.039
 phrases-0.039
 languages-0.038
 listened-0.036
Token a
Token ID257
Feature activation-8.32e-03

Pos logit contributions
 spe+0.010
 bisc+0.010
etchup+0.010
 pitcher+0.010
 extingu+0.010
Neg logit contributions
izes-0.004
 phrases-0.004
 listened-0.004
 stories-0.004
 questions-0.004
Token closer
Token ID5699
Feature activation-2.10e-02

Pos logit contributions
etchup+0.016
ays+0.016
ksh+0.015
icer+0.015
 extingu+0.015
Neg logit contributions
 listen-0.012
 kind-0.012
 learn-0.012
 questions-0.011
 stories-0.011
Token said
Token ID531
Feature activation+9.38e-03

Pos logit contributions
izes+0.018
ative+0.018
 stories+0.016
 languages+0.016
 phrases+0.016
Neg logit contributions
 bisc-0.033
 spe-0.032
 pitcher-0.032
etchup-0.031
 extingu-0.031
Token,
Token ID11
Feature activation-3.77e-03

Pos logit contributions
izes+0.012
 languages+0.012
 phrases+0.012
 stories+0.011
ative+0.011
Neg logit contributions
 spe-0.019
 extingu-0.019
 bisc-0.019
 pitcher-0.018
etchup-0.018
Token holding
Token ID4769
Feature activation+8.94e-03

Pos logit contributions
 spe+0.007
 pitcher+0.006
 bisc+0.006
 extingu+0.006
etchup+0.006
Neg logit contributions
izes-0.006
 languages-0.006
 phrases-0.006
ative-0.006
 questions-0.006
Token the
Token ID262
Feature activation-1.90e-02

Pos logit contributions
izes+0.008
 phrases+0.008
 stories+0.008
 listened+0.008
 questions+0.008
Neg logit contributions
 spe-0.022
 bisc-0.021
 pitcher-0.021
etchup-0.021
 extingu-0.020
Token magn
Token ID7842
Feature activation-3.22e-02

Pos logit contributions
 spe+0.029
 bisc+0.028
etchup+0.028
 pitcher+0.027
 extingu+0.027
Neg logit contributions
 phrases-0.034
izes-0.034
 questions-0.034
 stories-0.034
 listened-0.033
Tokenifying
Token ID4035
Feature activation-1.27e-01

Pos logit contributions
 bisc+0.032
etchup+0.028
 spe+0.028
 pitcher+0.027
 snowball+0.025
Neg logit contributions
ative-0.078
izes-0.072
 phrases-0.070
 professor-0.070
 kindly-0.070
Token glass
Token ID5405
Feature activation-4.70e-02

Pos logit contributions
 spe+0.113
 bisc+0.112
etchup+0.110
 extingu+0.104
ksh+0.103
Neg logit contributions
 kind-0.290
 stories-0.288
 everyone-0.283
 Everyone-0.283
 learn-0.283
Token over
Token ID625
Feature activation-3.77e-02

Pos logit contributions
 bisc+0.109
 spe+0.109
etchup+0.107
 pitcher+0.105
 extingu+0.104
Neg logit contributions
izes-0.045
 stories-0.044
 phrases-0.044
 listened-0.043
 questions-0.043
Token a
Token ID257
Feature activation-3.09e-02

Pos logit contributions
 bisc+0.099
 spe+0.098
 pitcher+0.096
etchup+0.096
 funnel+0.095
Neg logit contributions
 questions-0.026
 listened-0.026
 everyone-0.025
 phrases-0.025
 stories-0.025
Token black
Token ID2042
Feature activation-6.32e-02

Pos logit contributions
 spe+0.050
etchup+0.050
 bisc+0.050
 extingu+0.048
 pitcher+0.048
Neg logit contributions
izes-0.050
 stories-0.049
 phrases-0.049
 kind-0.049
 questions-0.049
Token ant
Token ID1885
Feature activation-2.51e-02

Pos logit contributions
etchup+0.121
 spe+0.121
 bisc+0.116
 pitcher+0.114
 funnel+0.112
Neg logit contributions
 stories-0.086
 phrases-0.086
 listened-0.085
 questions-0.085
ative-0.085
Token She
Token ID1375
Feature activation-1.66e-03

Pos logit contributions
 spe+0.009
 bisc+0.008
 extingu+0.008
etchup+0.008
 funnel+0.008
Neg logit contributions
 Everyone-0.004
 questions-0.004
 everyone-0.004
 stories-0.004
 learn-0.004
Token had
Token ID550
Feature activation-2.41e-02

Pos logit contributions
 spe+0.003
 bisc+0.003
etchup+0.003
 pitcher+0.003
 snowball+0.003
Neg logit contributions
ative-0.002
izes-0.002
 phrases-0.002
 languages-0.002
 stories-0.002
Token a
Token ID257
Feature activation+3.30e-03

Pos logit contributions
 spe+0.055
 bisc+0.055
etchup+0.053
 pitcher+0.053
 extingu+0.052
Neg logit contributions
izes-0.025
 phrases-0.025
 listened-0.024
 stories-0.024
 questions-0.024
Token big
Token ID1263
Feature activation-3.55e-02

Pos logit contributions
 kind+0.004
 stories+0.004
 questions+0.004
 listened+0.004
 phrases+0.004
Neg logit contributions
umbs-0.007
etchup-0.007
 bisc-0.007
 spe-0.007
 extingu-0.007
Token magn
Token ID7842
Feature activation-2.90e-02

Pos logit contributions
 spe+0.070
 bisc+0.070
etchup+0.068
 pitcher+0.067
 extingu+0.067
Neg logit contributions
izes-0.047
 phrases-0.047
 stories-0.047
 questions-0.046
 listened-0.046
Tokenifying
Token ID4035
Feature activation-1.26e-01

Pos logit contributions
 bisc+0.023
 spe+0.021
 pitcher+0.021
etchup+0.021
 snowball+0.020
Neg logit contributions
ative-0.074
 phrases-0.073
 questions-0.072
izes-0.072
 languages-0.071
Token glass
Token ID5405
Feature activation-4.45e-02

Pos logit contributions
etchup+0.132
 bisc+0.128
 extingu+0.125
 spe+0.122
 snowball+0.117
Neg logit contributions
 kind-0.276
 stories-0.276
 everyone-0.269
 learn-0.268
 Everyone-0.264
Token that
Token ID326
Feature activation-5.49e-02

Pos logit contributions
 spe+0.096
 bisc+0.095
 pitcher+0.093
etchup+0.093
 extingu+0.091
Neg logit contributions
izes-0.053
 phrases-0.052
 listened-0.051
 questions-0.050
 stories-0.050
Token made
Token ID925
Feature activation-1.86e-02

Pos logit contributions
 spe+0.106
 bisc+0.102
etchup+0.097
 pitcher+0.096
 funnel+0.092
Neg logit contributions
izes-0.083
 phrases-0.082
ative-0.079
 languages-0.079
 professor-0.075
Token things
Token ID1243
Feature activation+4.78e-02

Pos logit contributions
 spe+0.040
 bisc+0.039
 pitcher+0.037
etchup+0.037
 extingu+0.037
Neg logit contributions
izes-0.024
 listened-0.021
 phrases-0.021
ative-0.021
 professor-0.021
Token look
Token ID804
Feature activation-1.68e-02

Pos logit contributions
izes+0.072
 phrases+0.070
ative+0.068
 professor+0.067
 languages+0.067
Neg logit contributions
 spe-0.090
 bisc-0.085
 extingu-0.084
 pitcher-0.083
 funnel-0.082
Token
Token ID198
Feature activation-1.10e-02

Pos logit contributions
 spe+0.018
 bisc+0.018
 snowball+0.018
etchup+0.018
 pitcher+0.017
Neg logit contributions
izes-0.019
 phrases-0.019
 stories-0.018
ative-0.018
 listened-0.018
TokenHe
Token ID1544
Feature activation+2.07e-02

Pos logit contributions
 snowball+0.017
 bisc+0.016
 spe+0.015
etchup+0.015
 extingu+0.015
Neg logit contributions
 stories-0.020
izes-0.020
 questions-0.019
 phrases-0.019
 jokes-0.019
Token pointed
Token ID6235
Feature activation-3.98e-03

Pos logit contributions
ative+0.034
 languages+0.034
izes+0.033
 phrases+0.031
 stories+0.031
Neg logit contributions
 spe-0.036
 bisc-0.036
 pitcher-0.035
 snowball-0.033
 extingu-0.033
Token his
Token ID465
Feature activation-3.29e-02

Pos logit contributions
 spe+0.007
 bisc+0.007
 snowball+0.007
 extingu+0.007
etchup+0.007
Neg logit contributions
izes-0.006
 phrases-0.006
 listened-0.006
ative-0.005
 languages-0.005
Token magn
Token ID7842
Feature activation-3.49e-02

Pos logit contributions
 bisc+0.053
 spe+0.053
etchup+0.051
 pitcher+0.051
 extingu+0.049
Neg logit contributions
izes-0.058
 phrases-0.057
 listened-0.056
ative-0.055
 languages-0.054
Tokenifying
Token ID4035
Feature activation-1.25e-01

Pos logit contributions
 bisc+0.029
 spe+0.027
etchup+0.026
 pitcher+0.026
 snowball+0.025
Neg logit contributions
ative-0.087
izes-0.085
 phrases-0.085
 questions-0.083
 listened-0.083
Token glass
Token ID5405
Feature activation-4.87e-02

Pos logit contributions
 spe+0.117
 bisc+0.112
etchup+0.109
 snowball+0.105
 extingu+0.102
Neg logit contributions
 stories-0.290
izes-0.286
 kind-0.286
 questions-0.284
 phrases-0.283
Token at
Token ID379
Feature activation-4.14e-02

Pos logit contributions
 spe+0.095
 bisc+0.094
 pitcher+0.092
etchup+0.092
 extingu+0.090
Neg logit contributions
izes-0.068
 phrases-0.067
 listened-0.065
 stories-0.064
 questions-0.064
Token different
Token ID1180
Feature activation+3.25e-02

Pos logit contributions
 spe+0.090
 bisc+0.087
 extingu+0.086
 snowball+0.086
etchup+0.086
Neg logit contributions
izes-0.052
 phrases-0.048
 listened-0.045
ative-0.045
 languages-0.043
Token classmates
Token ID28999
Feature activation+5.58e-02

Pos logit contributions
izes+0.053
 listened+0.049
ative+0.049
 listen+0.045
 professor+0.044
Neg logit contributions
 spe-0.059
 bisc-0.058
 extingu-0.055
 pitcher-0.055
etchup-0.054
Token and
Token ID290
Feature activation+3.07e-03

Pos logit contributions
izes+0.086
ative+0.079
 phrases+0.077
mber+0.075
 listened+0.074
Neg logit contributions
 bisc-0.098
etchup-0.096
 spe-0.096
 extingu-0.093
 pitcher-0.093
Token cried
Token ID16896
Feature activation-1.99e-02

Pos logit contributions
ative+0.021
izes+0.021
 stories+0.020
 phrases+0.019
 languages+0.019
Neg logit contributions
 spe-0.029
 bisc-0.028
 pitcher-0.028
etchup-0.028
 funnel-0.028
Token,
Token ID11
Feature activation-4.63e-03

Pos logit contributions
 spe+0.052
 bisc+0.051
etchup+0.050
 pitcher+0.050
 funnel+0.049
Neg logit contributions
 questions-0.015
 phrases-0.014
 listened-0.014
izes-0.014
 everyone-0.014
Token dropping
Token ID12047
Feature activation+2.32e-03

Pos logit contributions
 spe+0.008
 snowball+0.007
 extingu+0.007
 bisc+0.007
 pitcher+0.007
Neg logit contributions
izes-0.008
 phrases-0.008
 languages-0.008
ative-0.008
 stories-0.007
Token the
Token ID262
Feature activation-2.57e-02

Pos logit contributions
izes+0.002
 phrases+0.002
 listened+0.002
 stories+0.002
 questions+0.002
Neg logit contributions
 spe-0.006
 bisc-0.006
etchup-0.005
 pitcher-0.005
 extingu-0.005
Token magn
Token ID7842
Feature activation-3.60e-02

Pos logit contributions
 spe+0.040
 bisc+0.038
etchup+0.038
 pitcher+0.037
 extingu+0.036
Neg logit contributions
 questions-0.046
 phrases-0.045
 stories-0.045
izes-0.045
 listened-0.045
Tokenifying
Token ID4035
Feature activation-1.25e-01

Pos logit contributions
 bisc+0.038
 spe+0.033
etchup+0.033
 pitcher+0.032
 snowball+0.031
Neg logit contributions
ative-0.086
izes-0.079
ly-0.078
fully-0.077
 professor-0.077
Token glass
Token ID5405
Feature activation-5.03e-02

Pos logit contributions
 bisc+0.122
 spe+0.121
etchup+0.118
ksh+0.115
 snowball+0.111
Neg logit contributions
 kind-0.274
 Everyone-0.271
 stories-0.271
 everyone-0.270
ative-0.266
Token.
Token ID13
Feature activation-1.30e-02

Pos logit contributions
 spe+0.110
 bisc+0.109
etchup+0.106
 pitcher+0.106
 extingu+0.104
Neg logit contributions
izes-0.058
 phrases-0.056
 listened-0.055
 stories-0.055
 questions-0.055
Token
Token ID198
Feature activation-1.98e-02

Pos logit contributions
 spe+0.028
 bisc+0.028
etchup+0.027
 pitcher+0.027
 extingu+0.027
Neg logit contributions
izes-0.015
 phrases-0.015
 listened-0.015
 questions-0.015
 stories-0.015
Token
Token ID198
Feature activation-1.59e-02

Pos logit contributions
 spe+0.037
 bisc+0.037
etchup+0.036
 pitcher+0.036
 extingu+0.035
Neg logit contributions
izes-0.029
 phrases-0.029
 listened-0.028
 stories-0.028
 questions-0.028
Token"
Token ID1
Feature activation-3.41e-02

Pos logit contributions
 bisc+0.032
 pitcher+0.032
etchup+0.032
 funnel+0.030
 snowball+0.030
Neg logit contributions
 stories-0.021
 phrases-0.021
 questions-0.021
 languages-0.021
ative-0.020
Token it
Token ID340
Feature activation+6.23e-03

Pos logit contributions
izes+0.003
 phrases+0.003
 listened+0.003
 stories+0.003
 questions+0.003
Neg logit contributions
 spe-0.010
 bisc-0.009
etchup-0.009
 pitcher-0.009
 extingu-0.009
Token she
Token ID673
Feature activation-6.11e-03

Pos logit contributions
 languages+0.008
 phrases+0.008
ative+0.007
 questions+0.007
 jokes+0.007
Neg logit contributions
 bisc-0.013
 spe-0.013
 bur-0.013
 pitcher-0.013
etchup-0.012
Token had
Token ID550
Feature activation-3.80e-02

Pos logit contributions
 spe+0.011
 bisc+0.011
etchup+0.011
 pitcher+0.011
 funnel+0.010
Neg logit contributions
 phrases-0.009
izes-0.009
ative-0.009
 questions-0.009
 languages-0.009
Token a
Token ID257
Feature activation-6.93e-03

Pos logit contributions
 spe+0.081
 bisc+0.080
 pitcher+0.076
etchup+0.076
 snowball+0.075
Neg logit contributions
izes-0.048
 phrases-0.046
ative-0.045
 questions-0.045
 languages-0.044
Token magn
Token ID7842
Feature activation-2.77e-02

Pos logit contributions
etchup+0.014
 spe+0.014
 bisc+0.014
 extingu+0.014
 pitcher+0.014
Neg logit contributions
izes-0.009
 stories-0.008
 phrases-0.008
 kind-0.008
 listened-0.008
Tokenifying
Token ID4035
Feature activation-1.23e-01

Pos logit contributions
 bisc+0.021
 spe+0.019
etchup+0.019
 pitcher+0.019
 snowball+0.018
Neg logit contributions
ative-0.072
izes-0.071
 phrases-0.070
 listened-0.070
 questions-0.069
Token glass
Token ID5405
Feature activation-4.79e-02

Pos logit contributions
etchup+0.135
 bisc+0.125
 extingu+0.125
 spe+0.121
ksh+0.118
Neg logit contributions
 kind-0.266
 stories-0.260
 everyone-0.256
 learn-0.255
 Everyone-0.254
Token,
Token ID11
Feature activation-1.02e-02

Pos logit contributions
 spe+0.104
 bisc+0.103
 pitcher+0.100
etchup+0.100
 extingu+0.098
Neg logit contributions
izes-0.057
 phrases-0.056
 listened-0.054
 questions-0.054
 stories-0.053
Token a
Token ID257
Feature activation-1.33e-02

Pos logit contributions
 spe+0.022
 bisc+0.021
 pitcher+0.020
etchup+0.020
 snowball+0.020
Neg logit contributions
izes-0.013
 phrases-0.013
ative-0.012
 languages-0.012
 questions-0.012
Token tiny
Token ID7009
Feature activation-3.09e-02

Pos logit contributions
 spe+0.024
 bisc+0.024
 pitcher+0.023
etchup+0.023
 snowball+0.023
Neg logit contributions
izes-0.020
 phrases-0.020
 listened-0.020
 stories-0.019
 questions-0.019
Token torch
Token ID28034
Feature activation-1.11e-03

Pos logit contributions
etchup+0.053
 bisc+0.053
 spe+0.051
umbs+0.051
 pitcher+0.051
Neg logit contributions
 kind-0.047
 stories-0.047
 phrases-0.047
 questions-0.046
 listened-0.045
Token flashlight
Token ID38371
Feature activation-2.68e-02

Pos logit contributions
 spe+0.020
 bisc+0.020
 pitcher+0.020
etchup+0.019
 snowball+0.019
Neg logit contributions
izes-0.021
 phrases-0.020
 listened-0.020
 stories-0.020
 questions-0.020
Token,
Token ID11
Feature activation-2.95e-02

Pos logit contributions
 spe+0.056
 bisc+0.054
 extingu+0.053
 funnel+0.052
 pitcher+0.052
Neg logit contributions
 phrases-0.035
izes-0.035
 questions-0.034
 languages-0.033
ative-0.032
Token and
Token ID290
Feature activation-9.41e-03

Pos logit contributions
 spe+0.060
 bisc+0.059
etchup+0.056
 snowball+0.056
 extingu+0.056
Neg logit contributions
izes-0.041
 phrases-0.039
ative-0.039
 languages-0.037
 listened-0.037
Token a
Token ID257
Feature activation-8.67e-03

Pos logit contributions
 spe+0.019
 bisc+0.018
 pitcher+0.017
etchup+0.017
 funnel+0.017
Neg logit contributions
izes-0.013
ative-0.013
 phrases-0.012
 languages-0.012
 questions-0.011
Token magn
Token ID7842
Feature activation-3.64e-02

Pos logit contributions
 bisc+0.014
 spe+0.014
 pitcher+0.014
etchup+0.014
 snowball+0.013
Neg logit contributions
izes-0.015
 phrases-0.014
 listened-0.014
ative-0.014
 questions-0.014
Tokenifying
Token ID4035
Feature activation-1.22e-01

Pos logit contributions
 bisc+0.029
 spe+0.028
 pitcher+0.027
etchup+0.026
 snowball+0.025
Neg logit contributions
izes-0.092
 phrases-0.091
ative-0.091
 questions-0.090
 listened-0.090
Token glass
Token ID5405
Feature activation-4.29e-02

Pos logit contributions
etchup+0.136
 spe+0.134
 bisc+0.134
 extingu+0.126
ksh+0.120
Neg logit contributions
 stories-0.251
 everyone-0.249
 kind-0.248
 Everyone-0.247
 listened-0.244
Token.
Token ID13
Feature activation-5.27e-03

Pos logit contributions
 spe+0.092
 bisc+0.092
 pitcher+0.090
etchup+0.089
 funnel+0.087
Neg logit contributions
izes-0.051
 phrases-0.051
 listened-0.049
 questions-0.049
 stories-0.048
Token
Token ID198
Feature activation-1.59e-02

Pos logit contributions
 spe+0.010
 bisc+0.010
etchup+0.010
 pitcher+0.010
 extingu+0.009
Neg logit contributions
izes-0.007
 phrases-0.007
 listened-0.007
 stories-0.007
 questions-0.007
Token
Token ID198
Feature activation-1.30e-02

Pos logit contributions
 spe+0.033
 bisc+0.032
 pitcher+0.031
 extingu+0.031
etchup+0.031
Neg logit contributions
izes-0.021
 questions-0.020
 phrases-0.020
 listened-0.020
 stories-0.020
TokenAs
Token ID1722
Feature activation+1.48e-03

Pos logit contributions
 bisc+0.027
 pitcher+0.026
etchup+0.026
 spe+0.026
 funnel+0.025
Neg logit contributions
 phrases-0.016
 stories-0.016
izes-0.016
 questions-0.015
 languages-0.015
Token the
Token ID262
Feature activation+6.84e-03

Pos logit contributions
 spe+0.036
 bisc+0.036
 pitcher+0.035
etchup+0.035
 funnel+0.034
Neg logit contributions
 everyone-0.011
 stories-0.009
 questions-0.009
 phrases-0.009
 languages-0.009
Token watch
Token ID2342
Feature activation+9.99e-03

Pos logit contributions
izes+0.011
 phrases+0.011
ative+0.011
 listened+0.011
 languages+0.011
Neg logit contributions
 bisc-0.012
 spe-0.012
 pitcher-0.012
etchup-0.011
 funnel-0.011
Token with
Token ID351
Feature activation+2.94e-03

Pos logit contributions
izes+0.014
 phrases+0.013
 listened+0.013
 elders+0.012
 languages+0.012
Neg logit contributions
 bisc-0.020
 spe-0.020
 pitcher-0.019
 funnel-0.019
umbs-0.019
Token the
Token ID262
Feature activation-1.61e-03

Pos logit contributions
 stories+0.002
 questions+0.002
 phrases+0.002
 everyone+0.002
izes+0.002
Neg logit contributions
 bisc-0.007
 spe-0.007
etchup-0.007
 pitcher-0.007
 funnel-0.007
Token magn
Token ID7842
Feature activation-2.16e-02

Pos logit contributions
 bisc+0.003
 spe+0.003
 pitcher+0.003
etchup+0.003
 funnel+0.003
Neg logit contributions
izes-0.003
ative-0.002
 listened-0.002
 phrases-0.002
 languages-0.002
Tokenifying
Token ID4035
Feature activation-1.20e-01

Pos logit contributions
 bisc+0.014
 spe+0.014
etchup+0.013
 pitcher+0.013
 snowball+0.012
Neg logit contributions
ative-0.057
izes-0.057
 phrases-0.056
 listened-0.056
 questions-0.055
Token glass
Token ID5405
Feature activation-3.35e-02

Pos logit contributions
 spe+0.106
 bisc+0.104
etchup+0.102
 extingu+0.099
 snowball+0.095
Neg logit contributions
 stories-0.277
 kind-0.275
 Everyone-0.271
 everyone-0.270
 learn-0.269
Token.
Token ID13
Feature activation-2.27e-03

Pos logit contributions
 spe+0.070
 bisc+0.070
 pitcher+0.068
etchup+0.067
 extingu+0.066
Neg logit contributions
izes-0.043
 phrases-0.042
 listened-0.040
 questions-0.040
ative-0.039
Token He
Token ID679
Feature activation+1.15e-02

Pos logit contributions
 spe+0.005
 bisc+0.005
etchup+0.005
 pitcher+0.005
 extingu+0.005
Neg logit contributions
 phrases-0.002
 questions-0.002
 stories-0.002
izes-0.002
 listened-0.002
Token saw
Token ID2497
Feature activation-1.44e-02

Pos logit contributions
ative+0.019
izes+0.019
 phrases+0.018
 languages+0.018
 stories+0.018
Neg logit contributions
 spe-0.020
 bisc-0.020
etchup-0.019
 pitcher-0.019
 snowball-0.018
Token the
Token ID262
Feature activation-5.91e-03

Pos logit contributions
 spe+0.033
 bisc+0.033
etchup+0.032
 pitcher+0.032
 extingu+0.032
Neg logit contributions
izes-0.015
 phrases-0.015
 listened-0.014
 stories-0.014
 questions-0.014
Token He
Token ID679
Feature activation+3.20e-02

Pos logit contributions
izes+0.041
itive+0.037
ative+0.036
ising+0.035
ya+0.034
Neg logit contributions
 bisc-0.031
 snowball-0.030
 extingu-0.028
umbs-0.028
 funnel-0.028
Token took
Token ID1718
Feature activation+1.42e-02

Pos logit contributions
 languages+0.047
izes+0.047
ative+0.047
 phrases+0.045
 questions+0.045
Neg logit contributions
 spe-0.059
 bisc-0.058
 pitcher-0.055
 snowball-0.054
 extingu-0.053
Token out
Token ID503
Feature activation+5.53e-03

Pos logit contributions
izes+0.022
 phrases+0.021
itive+0.020
 listened+0.020
 languages+0.020
Neg logit contributions
 snowball-0.025
 bisc-0.024
 spe-0.024
 extingu-0.023
etchup-0.022
Token a
Token ID257
Feature activation-1.17e-03

Pos logit contributions
izes+0.008
 phrases+0.007
ative+0.006
 listened+0.006
Deal+0.006
Neg logit contributions
 snowball-0.012
 spe-0.012
 extingu-0.011
 bisc-0.011
etchup-0.011
Token magn
Token ID7842
Feature activation-1.82e-02

Pos logit contributions
 spe+0.002
 bisc+0.002
 pitcher+0.002
etchup+0.002
 snowball+0.002
Neg logit contributions
izes-0.002
 phrases-0.002
 listened-0.002
ative-0.002
 languages-0.002
Tokenifying
Token ID4035
Feature activation-1.16e-01

Pos logit contributions
 bisc+0.012
 spe+0.012
etchup+0.011
 pitcher+0.011
 snowball+0.010
Neg logit contributions
ative-0.048
izes-0.048
 phrases-0.048
 questions-0.047
 listened-0.047
Token glass
Token ID5405
Feature activation-3.88e-02

Pos logit contributions
 spe+0.114
 bisc+0.113
etchup+0.113
 extingu+0.104
 snowball+0.104
Neg logit contributions
 stories-0.257
 kind-0.255
 everyone-0.251
 listened-0.251
ative-0.250
Token and
Token ID290
Feature activation-2.13e-03

Pos logit contributions
 spe+0.081
 bisc+0.080
etchup+0.078
 pitcher+0.078
 extingu+0.076
Neg logit contributions
izes-0.048
 phrases-0.048
 listened-0.047
 stories-0.046
 questions-0.046
Token a
Token ID257
Feature activation+1.98e-03

Pos logit contributions
 bisc+0.004
 spe+0.004
 bur+0.004
 pitcher+0.003
 extingu+0.003
Neg logit contributions
ative-0.003
 languages-0.003
izes-0.003
 questions-0.003
 phrases-0.003
Token notebook
Token ID20922
Feature activation+1.73e-03

Pos logit contributions
izes+0.003
 phrases+0.003
 listened+0.003
 languages+0.003
ative+0.003
Neg logit contributions
 spe-0.003
 bisc-0.003
 pitcher-0.003
etchup-0.003
 snowball-0.003
Token and
Token ID290
Feature activation-2.03e-03

Pos logit contributions
 phrases+0.002
izes+0.002
 languages+0.002
 questions+0.002
 listened+0.002
Neg logit contributions
 bisc-0.004
 spe-0.004
 pitcher-0.003
 extingu-0.003
 funnel-0.003
Token that
Token ID326
Feature activation-2.86e-03

Pos logit contributions
izes+0.052
 leader+0.051
 phrases+0.050
ative+0.050
 languages+0.049
Neg logit contributions
 spe-0.064
 umb-0.059
etchup-0.058
 snowball-0.057
 bisc-0.057
Token the
Token ID262
Feature activation+2.18e-03

Pos logit contributions
 spe+0.006
 bisc+0.006
etchup+0.006
 pitcher+0.006
 snowball+0.006
Neg logit contributions
izes-0.003
 phrases-0.003
ative-0.003
 questions-0.003
 listened-0.003
Token zipper
Token ID48992
Feature activation-2.48e-02

Pos logit contributions
izes+0.003
 phrases+0.003
 listened+0.003
 stories+0.003
 questions+0.003
Neg logit contributions
 spe-0.004
 bisc-0.004
 pitcher-0.004
etchup-0.004
 extingu-0.004
Token had
Token ID550
Feature activation-5.84e-02

Pos logit contributions
 spe+0.044
 bisc+0.041
etchup+0.041
 funnel+0.040
 pitcher+0.039
Neg logit contributions
 phrases-0.039
 questions-0.038
 stories-0.038
 languages-0.037
 jokes-0.037
Token become
Token ID1716
Feature activation+8.95e-03

Pos logit contributions
 spe+0.097
 bisc+0.095
 pitcher+0.088
etchup+0.085
 bur+0.083
Neg logit contributions
izes-0.102
 phrases-0.095
 languages-0.095
 questions-0.094
 professor-0.094
Token unst
Token ID15014
Feature activation-1.14e-01

Pos logit contributions
izes+0.013
 phrases+0.013
 listened+0.013
 stories+0.013
 questions+0.013
Neg logit contributions
 bisc-0.017
 spe-0.017
etchup-0.016
 pitcher-0.016
 extingu-0.016
Tokenuck
Token ID1347
Feature activation-1.95e-02

Pos logit contributions
 spe+0.224
 snowball+0.209
 bisc+0.204
 funnel+0.203
 extingu+0.199
Neg logit contributions
izes-0.158
ising-0.157
ative-0.157
 phrases-0.151
 Everyone-0.151
Token.
Token ID13
Feature activation+1.31e-02

Pos logit contributions
 spe+0.036
etchup+0.035
 bisc+0.034
 pitcher+0.033
 extingu+0.033
Neg logit contributions
 phrases-0.030
izes-0.028
 professor-0.028
 languages-0.027
 learn-0.026
Token She
Token ID1375
Feature activation+8.65e-03

Pos logit contributions
 phrases+0.015
izes+0.015
 listened+0.015
 stories+0.015
 questions+0.015
Neg logit contributions
 spe-0.029
 bisc-0.028
etchup-0.027
 pitcher-0.027
 extingu-0.027
Token was
Token ID373
Feature activation+2.40e-02

Pos logit contributions
 questions+0.011
ative+0.010
 phrases+0.010
 languages+0.010
izes+0.010
Neg logit contributions
 spe-0.019
 bisc-0.018
 bur-0.018
 snowball-0.017
 pitcher-0.017
Token relieved
Token ID22598
Feature activation-1.86e-02

Pos logit contributions
 phrases+0.026
izes+0.025
 questions+0.024
 languages+0.024
 stories+0.024
Neg logit contributions
 spe-0.055
 bisc-0.054
 pitcher-0.053
etchup-0.053
 snowball-0.052
Token
Token ID198
Feature activation-1.10e-02

Pos logit contributions
 spe+0.028
 bisc+0.028
 pitcher+0.027
 extingu+0.027
etchup+0.027
Neg logit contributions
izes-0.016
 listened-0.016
 questions-0.016
 everyone-0.015
 learn-0.015
TokenHe
Token ID1544
Feature activation+1.76e-02

Pos logit contributions
 snowball+0.020
 bisc+0.019
 pitcher+0.018
 ditch+0.017
 funnel+0.017
Neg logit contributions
 phrases-0.018
 stories-0.017
izes-0.017
 languages-0.017
 questions-0.017
Token brought
Token ID3181
Feature activation-1.15e-02

Pos logit contributions
izes+0.027
ative+0.027
 phrases+0.027
 languages+0.027
 stories+0.026
Neg logit contributions
 spe-0.032
 bisc-0.031
 snowball-0.030
 pitcher-0.029
etchup-0.029
Token his
Token ID465
Feature activation-1.13e-02

Pos logit contributions
 spe+0.028
 bisc+0.028
 pitcher+0.027
etchup+0.027
 extingu+0.027
Neg logit contributions
izes-0.010
 listened-0.010
 stories-0.010
 phrases-0.010
 questions-0.010
Token magn
Token ID7842
Feature activation-2.68e-02

Pos logit contributions
 spe+0.021
 bisc+0.021
etchup+0.020
 pitcher+0.020
 extingu+0.020
Neg logit contributions
izes-0.017
 phrases-0.016
 listened-0.016
 stories-0.016
 questions-0.016
Tokenifying
Token ID4035
Feature activation-1.14e-01

Pos logit contributions
 bisc+0.028
etchup+0.026
 spe+0.026
 pitcher+0.024
 snowball+0.024
Neg logit contributions
ative-0.063
 kindly-0.058
 listened-0.058
izes-0.058
 phrases-0.058
Token glass
Token ID5405
Feature activation-3.57e-02

Pos logit contributions
 spe+0.124
etchup+0.121
 bisc+0.121
 snowball+0.116
ksh+0.115
Neg logit contributions
 kind-0.241
 stories-0.236
 everyone-0.231
 learn-0.230
ative-0.230
Token and
Token ID290
Feature activation-1.25e-02

Pos logit contributions
 spe+0.072
 bisc+0.071
etchup+0.070
 pitcher+0.069
 snowball+0.068
Neg logit contributions
izes-0.045
 stories-0.044
 questions-0.044
 listened-0.044
 phrases-0.044
Token looked
Token ID3114
Feature activation+2.74e-02

Pos logit contributions
 bisc+0.024
 spe+0.024
etchup+0.023
 pitcher+0.023
 snowball+0.023
Neg logit contributions
izes-0.018
ative-0.018
 phrases-0.017
 languages-0.017
 questions-0.017
Token closely
Token ID7173
Feature activation+3.42e-03

Pos logit contributions
 phrases+0.036
izes+0.036
 listened+0.034
ative+0.034
 languages+0.034
Neg logit contributions
 spe-0.056
 bisc-0.055
etchup-0.054
 pitcher-0.053
 snowball-0.053
Token.
Token ID13
Feature activation+1.65e-03

Pos logit contributions
izes+0.005
 phrases+0.005
 professor+0.004
 languages+0.004
ative+0.004
Neg logit contributions
 spe-0.007
 bisc-0.007
etchup-0.006
 extingu-0.006
 snowball-0.006
Token thought
Token ID1807
Feature activation-4.69e-04

Pos logit contributions
 spe+0.050
 bisc+0.049
etchup+0.048
 pitcher+0.047
 snowball+0.045
Neg logit contributions
 phrases-0.037
izes-0.037
 languages-0.037
ative-0.036
 stories-0.035
Token it
Token ID340
Feature activation-2.15e-02

Pos logit contributions
 spe+0.001
 bisc+0.001
etchup+0.001
 snowball+0.001
 pitcher+0.001
Neg logit contributions
 phrases-0.001
izes-0.001
ative-0.001
 languages-0.001
 leader-0.001
Token was
Token ID373
Feature activation-3.76e-02

Pos logit contributions
 spe+0.041
 bisc+0.039
 snowball+0.038
 funnel+0.036
 pitcher+0.036
Neg logit contributions
 phrases-0.032
izes-0.032
 questions-0.030
 languages-0.030
 professor-0.030
Token time
Token ID640
Feature activation-4.51e-02

Pos logit contributions
 spe+0.078
 bisc+0.076
 pitcher+0.075
etchup+0.073
 snowball+0.073
Neg logit contributions
izes-0.050
 phrases-0.049
ative-0.046
 languages-0.046
 listened-0.045
Token to
Token ID284
Feature activation-7.61e-03

Pos logit contributions
 bisc+0.073
etchup+0.069
 spe+0.068
 pitcher+0.068
 funnel+0.068
Neg logit contributions
 phrases-0.079
izes-0.077
ative-0.076
 languages-0.076
 professor-0.072
Token teach
Token ID4545
Feature activation+6.26e-02

Pos logit contributions
 bisc+0.015
 pitcher+0.014
 spe+0.014
 extingu+0.014
 snowball+0.013
Neg logit contributions
 phrases-0.011
izes-0.011
ative-0.011
 languages-0.011
 professor-0.010
Token her
Token ID607
Feature activation+7.30e-03

Pos logit contributions
izes+0.086
ative+0.079
 professor+0.076
 leader+0.074
 listened+0.073
Neg logit contributions
 bisc-0.118
etchup-0.113
 pitcher-0.113
umbs-0.112
 cups-0.111
Token something
Token ID1223
Feature activation+2.61e-02

Pos logit contributions
izes+0.011
 professor+0.010
 phrases+0.010
ative+0.009
 languages+0.009
Neg logit contributions
 pitcher-0.013
 bisc-0.013
 snowball-0.013
etchup-0.013
 towel-0.012
Token important
Token ID1593
Feature activation-2.11e-02

Pos logit contributions
 phrases+0.039
izes+0.038
 professor+0.036
 languages+0.035
 listened+0.035
Neg logit contributions
 pitcher-0.046
 bisc-0.046
 spe-0.045
 snowball-0.045
etchup-0.045
Token.
Token ID13
Feature activation+8.83e-03

Pos logit contributions
 bisc+0.032
 extingu+0.029
etchup+0.029
 spe+0.027
 pitcher+0.026
Neg logit contributions
izes-0.036
 phrases-0.034
 Everybody-0.033
 musician-0.033
 professor-0.033
Token
Token ID198
Feature activation-1.26e-02

Pos logit contributions
izes+0.015
 phrases+0.014
ative+0.014
 listened+0.014
 stories+0.014
Neg logit contributions
 bisc-0.014
 pitcher-0.014
etchup-0.014
 spe-0.014
 funnel-0.014
Token
Token ID198
Feature activation-3.31e-03

Pos logit contributions
 spe+0.009
 bisc+0.009
 pitcher+0.009
etchup+0.009
 extingu+0.009
Neg logit contributions
izes-0.006
 phrases-0.006
 stories-0.006
 questions-0.006
 listened-0.006
TokenThe
Token ID464
Feature activation+4.54e-02

Pos logit contributions
 bisc+0.006
etchup+0.005
 pitcher+0.005
 spe+0.005
 snowball+0.005
Neg logit contributions
 phrases-0.005
izes-0.005
 stories-0.005
 questions-0.005
 languages-0.005
Token moral
Token ID6573
Feature activation+3.67e-02

Pos logit contributions
izes+0.049
 phrases+0.049
 listened+0.047
 languages+0.045
ative+0.045
Neg logit contributions
 pitcher-0.103
 bisc-0.103
 spe-0.101
 funnel-0.101
etchup-0.099
Token of
Token ID286
Feature activation+2.84e-02

Pos logit contributions
izes+0.058
itive+0.056
 phrases+0.055
 listened+0.053
 jokes+0.052
Neg logit contributions
etchup-0.065
 spe-0.063
 bisc-0.063
 pitcher-0.060
 funnel-0.060
Token the
Token ID262
Feature activation+6.27e-02

Pos logit contributions
izes+0.026
 phrases+0.026
 listened+0.025
 stories+0.025
 questions+0.024
Neg logit contributions
 spe-0.069
 bisc-0.068
etchup-0.067
 pitcher-0.066
 extingu-0.065
Token story
Token ID1621
Feature activation+6.35e-02

Pos logit contributions
izes+0.048
 phrases+0.045
ative+0.043
 listened+0.042
 languages+0.039
Neg logit contributions
 bisc-0.164
 spe-0.163
 pitcher-0.160
etchup-0.157
 snowball-0.157
Token is
Token ID318
Feature activation+2.39e-02

Pos logit contributions
itive+0.121
ya+0.119
 phrases+0.115
 communicate+0.111
 musician+0.111
Neg logit contributions
etchup-0.078
 funnel-0.077
 pitcher-0.074
 spe-0.074
 glass-0.072
Token:
Token ID25
Feature activation+6.21e-03

Pos logit contributions
itive+0.030
 phrases+0.029
izes+0.029
 nodded+0.027
master+0.027
Neg logit contributions
etchup-0.047
 pitcher-0.045
 funnel-0.044
 bur-0.044
 bisc-0.042
Token Listen
Token ID20600
Feature activation+3.59e-02

Pos logit contributions
izes+0.011
ative+0.010
 phrases+0.010
 listened+0.010
 professor+0.010
Neg logit contributions
etchup-0.010
 spe-0.010
 funnel-0.009
 pitcher-0.009
 bisc-0.009
Token to
Token ID284
Feature activation+2.60e-02

Pos logit contributions
itive+0.082
thel+0.079
 phrases+0.079
 Everybody+0.079
Deal+0.079
Neg logit contributions
 bur-0.024
 cup-0.019
 funnel-0.018
onna-0.018
 snowball-0.018
Token your
Token ID534
Feature activation+4.36e-02

Pos logit contributions
 nodded+0.046
ative+0.045
izes+0.044
master+0.044
 agreed+0.041
Neg logit contributions
 bur-0.039
 mud-0.036
 spe-0.036
 pic-0.035
 Mitt-0.035
Token listen
Token ID6004
Feature activation+3.96e-02

Pos logit contributions
ying+0.048
izes+0.048
ative+0.048
ently+0.048
mber+0.046
Neg logit contributions
 spe-0.028
 bisc-0.023
 extingu-0.022
 bur-0.022
 drown-0.021
Token.
Token ID13
Feature activation+1.27e-02

Pos logit contributions
 phrases+0.082
Deal+0.081
 professor+0.081
 languages+0.079
thel+0.079
Neg logit contributions
 spe-0.035
 extingu-0.031
 snowball-0.031
 funnel-0.030
 bisc-0.030
Token
Token ID198
Feature activation-6.62e-03

Pos logit contributions
izes+0.019
 phrases+0.018
 listened+0.018
 stories+0.018
 questions+0.018
Neg logit contributions
 spe-0.024
 bisc-0.024
etchup-0.023
 pitcher-0.023
 funnel-0.022
Token
Token ID198
Feature activation-4.23e-03

Pos logit contributions
 spe+0.013
 bisc+0.013
 pitcher+0.013
etchup+0.012
 extingu+0.012
Neg logit contributions
izes-0.009
 phrases-0.009
 questions-0.009
 listened-0.009
 stories-0.009
TokenThe
Token ID464
Feature activation+3.63e-02

Pos logit contributions
 pitcher+0.008
 snowball+0.007
etchup+0.007
 bisc+0.007
 funnel+0.007
Neg logit contributions
 stories-0.006
 phrases-0.006
 languages-0.006
izes-0.006
 questions-0.006
Token stories
Token ID3923
Feature activation+7.42e-02

Pos logit contributions
 phrases+0.051
ative+0.048
izes+0.047
ently+0.047
isdom+0.047
Neg logit contributions
 bisc-0.071
 spe-0.071
 snowball-0.069
 funnel-0.069
 pitcher-0.068
Token were
Token ID547
Feature activation-7.95e-03

Pos logit contributions
 elders+0.122
 phrases+0.120
ya+0.119
 questions+0.118
isdom+0.118
Neg logit contributions
 spe-0.116
umbs-0.112
 bisc-0.110
 pitcher-0.110
 funnel-0.109
Token so
Token ID523
Feature activation-5.52e-04

Pos logit contributions
 spe+0.016
 bisc+0.015
 snowball+0.015
 pitcher+0.014
etchup+0.014
Neg logit contributions
izes-0.011
 phrases-0.010
 professor-0.009
 languages-0.009
ative-0.009
Token fun
Token ID1257
Feature activation+1.89e-02

Pos logit contributions
 spe+0.001
etchup+0.001
 pitcher+0.001
 pic+0.001
 bisc+0.001
Neg logit contributions
ently-0.001
itive-0.001
 phrases-0.001
izes-0.001
fully-0.001
Token and
Token ID290
Feature activation+9.03e-03

Pos logit contributions
izes+0.029
 phrases+0.028
ative+0.026
 languages+0.026
 elders+0.026
Neg logit contributions
 spe-0.033
 bisc-0.032
etchup-0.032
 snowball-0.031
 pitcher-0.030
Token funny
Token ID8258
Feature activation+3.27e-02

Pos logit contributions
izes+0.012
ative+0.012
ently+0.011
 phrases+0.011
 professor+0.010
Neg logit contributions
 spe-0.019
 bisc-0.017
etchup-0.016
 snowball-0.016
 pitcher-0.016
Token
Token ID198
Feature activation-2.38e-03

Pos logit contributions
 spe+0.011
 bisc+0.010
 extingu+0.010
 pitcher+0.010
etchup+0.010
Neg logit contributions
 everyone-0.006
izes-0.006
 Everyone-0.006
 listened-0.005
 learn-0.005
TokenThe
Token ID464
Feature activation+4.27e-02

Pos logit contributions
 spe+0.005
 bisc+0.005
etchup+0.005
 pitcher+0.005
 extingu+0.005
Neg logit contributions
izes-0.003
 phrases-0.003
 listened-0.003
 stories-0.003
 questions-0.003
Token moral
Token ID6573
Feature activation+3.69e-02

Pos logit contributions
izes+0.037
 phrases+0.037
 listened+0.034
 questions+0.034
ative+0.033
Neg logit contributions
 bisc-0.106
 pitcher-0.105
 spe-0.104
etchup-0.102
 funnel-0.101
Token of
Token ID286
Feature activation+1.83e-02

Pos logit contributions
izes+0.064
itive+0.062
 phrases+0.062
 questions+0.057
 jokes+0.057
Neg logit contributions
etchup-0.058
 bisc-0.057
 spe-0.056
 pitcher-0.054
umbs-0.053
Token the
Token ID262
Feature activation+6.17e-02

Pos logit contributions
 listened+0.013
 everyone+0.013
 stories+0.013
izes+0.013
 learn+0.012
Neg logit contributions
 spe-0.048
 bisc-0.048
 pitcher-0.047
etchup-0.046
 funnel-0.046
Token story
Token ID1621
Feature activation+6.09e-02

Pos logit contributions
izes+0.046
 phrases+0.043
ative+0.040
 listened+0.039
 questions+0.038
Neg logit contributions
 bisc-0.162
 spe-0.160
 pitcher-0.157
etchup-0.154
 funnel-0.153
Token is
Token ID318
Feature activation+2.89e-02

Pos logit contributions
itive+0.106
ya+0.106
 phrases+0.105
 questions+0.102
mber+0.100
Neg logit contributions
 spe-0.081
 pitcher-0.080
 funnel-0.080
etchup-0.079
 snowball-0.077
Token that
Token ID326
Feature activation-7.11e-03

Pos logit contributions
izes+0.040
 phrases+0.038
 languages+0.038
 stories+0.037
itive+0.037
Neg logit contributions
etchup-0.055
 pitcher-0.053
 spe-0.051
 bur-0.051
 funnel-0.050
Token it
Token ID340
Feature activation+2.65e-03

Pos logit contributions
 spe+0.014
 bisc+0.014
 pitcher+0.014
etchup+0.014
 snowball+0.013
Neg logit contributions
izes-0.009
 phrases-0.009
ative-0.009
 questions-0.009
 stories-0.009
Token's
Token ID338
Feature activation-1.70e-02

Pos logit contributions
 stories+0.005
 songs+0.005
 phrases+0.005
 professor+0.005
 questions+0.004
Neg logit contributions
etchup-0.004
 bur-0.003
 pitcher-0.003
 funnel-0.003
 spe-0.003
Token important
Token ID1593
Feature activation-2.40e-02

Pos logit contributions
 spe+0.033
 pitcher+0.033
 bisc+0.033
 snowball+0.032
etchup+0.032
Neg logit contributions
 phrases-0.020
 languages-0.019
izes-0.019
 Everyone-0.018
 professor-0.018
Token games
Token ID1830
Feature activation+4.61e-02

Pos logit contributions
izes+0.057
 listened+0.056
ative+0.056
 professor+0.052
lder+0.051
Neg logit contributions
 spe-0.063
 snowball-0.062
 bisc-0.061
 cups-0.061
etchup-0.060
Token and
Token ID290
Feature activation-2.27e-03

Pos logit contributions
izes+0.063
 phrases+0.063
 listened+0.060
mber+0.059
ative+0.059
Neg logit contributions
 spe-0.087
 bisc-0.087
 pitcher-0.085
etchup-0.085
 extingu-0.083
Token see
Token ID766
Feature activation-8.03e-03

Pos logit contributions
 spe+0.004
 bisc+0.004
 pitcher+0.004
etchup+0.004
 funnel+0.004
Neg logit contributions
ative-0.003
izes-0.003
mber-0.003
 professor-0.003
ently-0.003
Token who
Token ID508
Feature activation-5.60e-03

Pos logit contributions
 spe+0.016
 snowball+0.016
 bisc+0.016
etchup+0.016
 extingu+0.015
Neg logit contributions
izes-0.011
 listened-0.010
ative-0.010
 phrases-0.010
mber-0.009
Token can
Token ID460
Feature activation-1.36e-02

Pos logit contributions
etchup+0.009
 spe+0.008
 pitcher+0.007
 funnel+0.007
 cups+0.007
Neg logit contributions
itive-0.011
ative-0.010
 phrases-0.010
ently-0.010
mber-0.010
Token win
Token ID1592
Feature activation+6.11e-02

Pos logit contributions
 spe+0.024
 pitcher+0.023
 funnel+0.022
 bisc+0.022
 snowball+0.022
Neg logit contributions
 phrases-0.021
ative-0.021
 languages-0.021
izes-0.021
mber-0.021
Token.
Token ID13
Feature activation-5.28e-03

Pos logit contributions
izes+0.088
 listened+0.086
ative+0.086
 phrases+0.084
 nodded+0.084
Neg logit contributions
 spe-0.112
etchup-0.108
 snowball-0.102
 pitcher-0.101
 funnel-0.099
Token
Token ID198
Feature activation-1.97e-02

Pos logit contributions
 spe+0.012
 bisc+0.012
 extingu+0.011
 pitcher+0.011
etchup+0.011
Neg logit contributions
 Everyone-0.006
 questions-0.006
 everyone-0.005
 languages-0.005
 Everybody-0.005
Token
Token ID198
Feature activation-1.61e-02

Pos logit contributions
 spe+0.046
 bisc+0.045
 extingu+0.045
 pitcher+0.044
etchup+0.042
Neg logit contributions
 questions-0.021
 learn-0.021
 everyone-0.020
izes-0.020
 kind-0.020
TokenOne
Token ID3198
Feature activation-1.84e-02

Pos logit contributions
 bisc+0.029
 spe+0.029
etchup+0.028
 pitcher+0.028
 snowball+0.028
Neg logit contributions
 phrases-0.025
izes-0.024
 stories-0.024
ative-0.024
 questions-0.023
Token day
Token ID1110
Feature activation+2.41e-03

Pos logit contributions
etchup+0.026
 snowball+0.026
 pitcher+0.026
 spe+0.024
 cup+0.024
Neg logit contributions
 phrases-0.039
izes-0.036
ently-0.034
gging-0.033
ative-0.033
Token.
Token ID13
Feature activation+6.20e-03

Pos logit contributions
 bisc+0.079
 spe+0.078
etchup+0.077
 pitcher+0.075
 snowball+0.074
Neg logit contributions
 phrases-0.041
izes-0.040
 languages-0.036
 listened-0.036
ative-0.035
Token Have
Token ID8192
Feature activation+7.57e-03

Pos logit contributions
izes+0.010
 phrases+0.010
 stories+0.010
 questions+0.009
 listened+0.009
Neg logit contributions
 spe-0.011
 bisc-0.011
etchup-0.011
 extingu-0.010
 pitcher-0.010
Token a
Token ID257
Feature activation+3.23e-02

Pos logit contributions
izes+0.010
 phrases+0.010
ative+0.009
 languages+0.009
 questions+0.009
Neg logit contributions
 bisc-0.016
 spe-0.015
etchup-0.015
 pitcher-0.015
 snowball-0.015
Token nice
Token ID3621
Feature activation-7.20e-04

Pos logit contributions
 phrases+0.033
izes+0.032
 languages+0.031
ative+0.030
 questions+0.029
Neg logit contributions
 bisc-0.077
 spe-0.077
 pitcher-0.074
 snowball-0.074
etchup-0.073
Token day
Token ID1110
Feature activation+1.28e-02

Pos logit contributions
 bisc+0.002
 spe+0.002
 pitcher+0.002
 snowball+0.002
etchup+0.002
Neg logit contributions
izes-0.001
 phrases-0.001
ative-0.001
 languages-0.001
 questions-0.001
Token.
Token ID13
Feature activation+6.28e-03

Pos logit contributions
 phrases+0.019
izes+0.017
 languages+0.017
 stories+0.016
 professor+0.016
Neg logit contributions
 spe-0.025
etchup-0.025
 bisc-0.025
 pitcher-0.024
 funnel-0.024
Token Bye
Token ID47843
Feature activation-9.02e-03

Pos logit contributions
izes+0.008
 phrases+0.008
 listened+0.008
 stories+0.008
 questions+0.008
Neg logit contributions
 spe-0.012
 bisc-0.012
 pitcher-0.012
etchup-0.012
 snowball-0.012
Token bye
Token ID33847
Feature activation-1.60e-02

Pos logit contributions
 spe+0.020
 bisc+0.020
etchup+0.020
 pitcher+0.019
 extingu+0.019
Neg logit contributions
 phrases-0.010
izes-0.010
 languages-0.010
ative-0.009
 stories-0.009
Token."
Token ID526
Feature activation+1.25e-02

Pos logit contributions
 bisc+0.031
 spe+0.031
etchup+0.031
 pitcher+0.030
 extingu+0.030
Neg logit contributions
 phrases-0.022
izes-0.022
 languages-0.020
 listened-0.020
 stories-0.020
Token She
Token ID1375
Feature activation+1.65e-02

Pos logit contributions
izes+0.025
ative+0.023
 phrases+0.023
 listened+0.023
 questions+0.022
Neg logit contributions
 bisc-0.018
 spe-0.018
 pitcher-0.017
etchup-0.017
 snowball-0.017
Token walks
Token ID11114
Feature activation+2.19e-02

Pos logit contributions
 phrases+0.023
izes+0.023
ative+0.023
 languages+0.022
 listened+0.022
Neg logit contributions
 bisc-0.032
 spe-0.032
etchup-0.031
 pitcher-0.030
 snowball-0.030
Token wander
Token ID27776
Feature activation+1.74e-02

Pos logit contributions
izes+0.027
ative+0.027
 phrases+0.026
 languages+0.025
mber+0.024
Neg logit contributions
 spe-0.023
 bur-0.021
 bisc-0.021
 extingu-0.021
 snowball-0.020
Token outside
Token ID2354
Feature activation+1.31e-02

Pos logit contributions
 phrases+0.029
izes+0.028
 listened+0.027
ative+0.027
 elders+0.026
Neg logit contributions
 spe-0.030
 bisc-0.029
 snowball-0.029
etchup-0.027
 extingu-0.027
Token his
Token ID465
Feature activation-1.75e-02

Pos logit contributions
izes+0.022
 phrases+0.021
mber+0.020
 listened+0.020
ative+0.019
Neg logit contributions
 spe-0.023
 snowball-0.022
 extingu-0.022
 funnel-0.022
etchup-0.022
Token home
Token ID1363
Feature activation-1.06e-02

Pos logit contributions
 spe+0.028
 bisc+0.028
etchup+0.027
 pitcher+0.026
 extingu+0.026
Neg logit contributions
izes-0.031
 stories-0.030
 phrases-0.030
 questions-0.030
 listened-0.030
Token and
Token ID290
Feature activation+1.33e-02

Pos logit contributions
 spe+0.018
 extingu+0.017
 bisc+0.017
 bur+0.017
 pitcher+0.017
Neg logit contributions
 phrases-0.017
izes-0.017
mber-0.017
 Everybody-0.016
 Everyone-0.016
Token explore
Token ID7301
Feature activation+3.78e-02

Pos logit contributions
ative+0.024
 phrases+0.022
izes+0.022
ently+0.021
 languages+0.021
Neg logit contributions
 bur-0.021
 spe-0.021
 bisc-0.021
 snowball-0.019
 funnel-0.019
Token the
Token ID262
Feature activation-2.87e-03

Pos logit contributions
izes+0.048
 phrases+0.046
 listened+0.044
ative+0.043
 leader+0.041
Neg logit contributions
 spe-0.080
 snowball-0.078
 bisc-0.078
etchup-0.075
 extingu-0.074
Token hills
Token ID18639
Feature activation+2.59e-02

Pos logit contributions
 spe+0.006
 bisc+0.005
etchup+0.005
 pitcher+0.005
 extingu+0.005
Neg logit contributions
izes-0.004
 phrases-0.004
 listened-0.004
 stories-0.004
 questions-0.004
Token and
Token ID290
Feature activation+2.72e-03

Pos logit contributions
izes+0.038
 phrases+0.038
 listened+0.035
mber+0.035
 elders+0.034
Neg logit contributions
 spe-0.051
etchup-0.046
 funnel-0.046
 snowball-0.046
 extingu-0.046
Token forests
Token ID17039
Feature activation+3.50e-02

Pos logit contributions
izes+0.004
ative+0.004
 phrases+0.004
 languages+0.004
 listened+0.004
Neg logit contributions
 spe-0.005
 bisc-0.005
etchup-0.004
 snowball-0.004
 pitcher-0.004
Token nearby
Token ID6716
Feature activation-2.03e-02

Pos logit contributions
izes+0.054
 phrases+0.052
mber+0.051
 elders+0.049
isdom+0.049
Neg logit contributions
 spe-0.068
 bisc-0.060
 snowball-0.060
 extingu-0.059
etchup-0.057
Token lot
Token ID1256
Feature activation-1.95e-02

Pos logit contributions
umbs+0.019
etchup+0.018
 extingu+0.018
 spe+0.018
 bisc+0.018
Neg logit contributions
 kind-0.010
 stories-0.009
 questions-0.009
izes-0.009
 leader-0.009
Token of
Token ID286
Feature activation-1.56e-02

Pos logit contributions
 bisc+0.037
etchup+0.037
 spe+0.036
 pitcher+0.036
 extingu+0.035
Neg logit contributions
izes-0.029
 phrases-0.028
 listened-0.027
 languages-0.026
 stories-0.025
Token fun
Token ID1257
Feature activation+2.31e-02

Pos logit contributions
 spe+0.032
 bisc+0.031
etchup+0.031
 pitcher+0.031
 extingu+0.030
Neg logit contributions
izes-0.020
 phrases-0.020
 stories-0.020
 questions-0.020
 listened-0.020
Token making
Token ID1642
Feature activation+2.36e-02

Pos logit contributions
izes+0.028
 phrases+0.028
ative+0.027
 professor+0.027
 listened+0.027
Neg logit contributions
 spe-0.048
etchup-0.048
 bisc-0.047
 pitcher-0.047
 funnel-0.047
Token noises
Token ID26782
Feature activation+4.51e-02

Pos logit contributions
izes+0.033
 listened+0.031
ative+0.031
 phrases+0.030
 professor+0.029
Neg logit contributions
 bisc-0.047
 snowball-0.046
 spe-0.046
 pitcher-0.046
etchup-0.045
Token and
Token ID290
Feature activation-4.63e-03

Pos logit contributions
izes+0.067
 phrases+0.065
 elders+0.062
mber+0.062
 languages+0.061
Neg logit contributions
 spe-0.085
 bisc-0.083
 pitcher-0.082
 snowball-0.082
 extingu-0.082
Token pretending
Token ID23662
Feature activation+2.50e-02

Pos logit contributions
 spe+0.008
 bisc+0.007
etchup+0.007
 extingu+0.007
 pitcher+0.007
Neg logit contributions
izes-0.008
ative-0.008
 professor-0.007
mber-0.007
 phrases-0.007
Token to
Token ID284
Feature activation-1.89e-02

Pos logit contributions
izes+0.042
 phrases+0.040
ative+0.039
 elders+0.038
 professor+0.038
Neg logit contributions
 spe-0.042
 snowball-0.040
 bisc-0.040
 pitcher-0.038
 funnel-0.038
Token be
Token ID307
Feature activation-9.57e-03

Pos logit contributions
 spe+0.037
 bisc+0.036
 pitcher+0.035
 snowball+0.035
 extingu+0.034
Neg logit contributions
izes-0.027
ative-0.027
 phrases-0.026
 questions-0.025
 listened-0.024
Token animals
Token ID4695
Feature activation+2.11e-02

Pos logit contributions
 spe+0.020
 bisc+0.019
 snowball+0.018
 extingu+0.018
etchup+0.018
Neg logit contributions
izes-0.014
 phrases-0.012
 listened-0.012
ative-0.011
 questions-0.011
Token.
Token ID13
Feature activation-6.08e-03

Pos logit contributions
izes+0.030
 phrases+0.029
mber+0.027
 listened+0.027
 elders+0.026
Neg logit contributions
 spe-0.042
 bisc-0.041
 snowball-0.040
etchup-0.039
 extingu-0.039
Token in
Token ID287
Feature activation-6.71e-03

Pos logit contributions
izes+0.021
 phrases+0.021
 listened+0.021
 stories+0.021
 questions+0.021
Neg logit contributions
 spe-0.032
 bisc-0.032
etchup-0.031
 pitcher-0.031
 extingu-0.030
Token one
Token ID530
Feature activation-9.02e-04

Pos logit contributions
 bisc+0.018
 spe+0.017
etchup+0.017
 pitcher+0.017
umbs+0.017
Neg logit contributions
 everyone-0.005
 stories-0.005
 kind-0.005
 learn-0.005
 questions-0.005
Token piece
Token ID3704
Feature activation-2.34e-02

Pos logit contributions
 spe+0.002
 bisc+0.002
etchup+0.002
 extingu+0.002
 pitcher+0.002
Neg logit contributions
izes-0.001
 questions-0.001
 kind-0.001
 stories-0.001
 languages-0.001
Token.
Token ID13
Feature activation-3.07e-03

Pos logit contributions
 pitcher+0.041
 bisc+0.041
 spe+0.041
etchup+0.041
umbs+0.041
Neg logit contributions
 phrases-0.038
izes-0.035
 listened-0.035
 languages-0.034
 professor-0.034
Token All
Token ID1439
Feature activation+1.79e-02

Pos logit contributions
 spe+0.006
 bisc+0.006
etchup+0.006
 pitcher+0.006
 snowball+0.005
Neg logit contributions
 phrases-0.004
 Everyone-0.004
 questions-0.004
izes-0.004
 everyone-0.004
Tokenright
Token ID3506
Feature activation+1.02e-02

Pos logit contributions
 phrases+0.027
 professor+0.025
izes+0.025
ative+0.024
 stories+0.024
Neg logit contributions
 spe-0.035
etchup-0.034
 snowball-0.033
 pitcher-0.032
 funnel-0.032
Token?"
Token ID1701
Feature activation+2.33e-02

Pos logit contributions
 phrases+0.017
izes+0.017
ative+0.016
 listened+0.016
 stories+0.016
Neg logit contributions
 spe-0.018
etchup-0.017
 pitcher-0.017
 bisc-0.017
 extingu-0.016
Token
Token ID198
Feature activation-5.11e-03

Pos logit contributions
izes+0.048
ative+0.045
 phrases+0.045
 stories+0.044
 languages+0.043
Neg logit contributions
 snowball-0.030
 spe-0.029
 bisc-0.029
 extingu-0.029
etchup-0.028
Token
Token ID198
Feature activation-4.66e-03

Pos logit contributions
 bisc+0.008
 spe+0.008
etchup+0.008
 snowball+0.008
 pitcher+0.008
Neg logit contributions
izes-0.009
 phrases-0.009
 stories-0.009
 languages-0.008
ative-0.008
TokenJohn
Token ID7554
Feature activation+1.83e-02

Pos logit contributions
 snowball+0.007
Uh+0.006
 pitcher+0.006
 extingu+0.006
 bisc+0.006
Neg logit contributions
izes-0.008
 stories-0.008
 phrases-0.008
 questions-0.008
 languages-0.008
Token nodded
Token ID14464
Feature activation+4.76e-02

Pos logit contributions
ative+0.030
 languages+0.029
izes+0.029
 professor+0.029
 stories+0.029
Neg logit contributions
 spe-0.030
 snowball-0.030
 bisc-0.029
 bur-0.029
 pitcher-0.028
Token said
Token ID531
Feature activation-6.18e-03

Pos logit contributions
 languages+0.008
 questions+0.008
izes+0.008
ative+0.008
 phrases+0.007
Neg logit contributions
 bisc-0.011
 spe-0.010
 bur-0.009
etchup-0.009
 pitcher-0.009
Token,
Token ID11
Feature activation-2.23e-02

Pos logit contributions
 spe+0.012
 bisc+0.010
 extingu+0.009
 pitcher+0.009
 snowball+0.009
Neg logit contributions
 languages-0.010
izes-0.010
 phrases-0.009
 communicate-0.009
 elders-0.009
Token "
Token ID366
Feature activation-2.64e-02

Pos logit contributions
 spe+0.041
 bisc+0.038
 extingu+0.037
 funnel+0.037
 pitcher+0.037
Neg logit contributions
 languages-0.037
izes-0.037
 phrases-0.035
ative-0.034
 stories-0.033
TokenYou
Token ID1639
Feature activation+2.77e-02

Pos logit contributions
 spe+0.047
 bisc+0.047
etchup+0.046
 pitcher+0.044
 snowball+0.044
Neg logit contributions
izes-0.042
 phrases-0.041
 stories-0.041
 questions-0.040
 languages-0.039
Token were
Token ID547
Feature activation+2.05e-02

Pos logit contributions
izes+0.060
 phrases+0.058
 languages+0.057
ently+0.057
 professor+0.055
Neg logit contributions
 spe-0.032
 bisc-0.031
 pitcher-0.027
 funnel-0.027
 sponge-0.026
Token quite
Token ID2407
Feature activation+3.43e-02

Pos logit contributions
izes+0.036
 phrases+0.035
 languages+0.032
 elders+0.030
 professor+0.030
Neg logit contributions
 spe-0.037
 bisc-0.033
etchup-0.032
 snowball-0.032
 pitcher-0.032
Token brave
Token ID14802
Feature activation+7.37e-03

Pos logit contributions
izes+0.050
 phrases+0.048
 languages+0.045
ently+0.045
ative+0.041
Neg logit contributions
 spe-0.070
 bisc-0.065
etchup-0.065
 pitcher-0.064
 snowball-0.060
Token for
Token ID329
Feature activation+5.51e-02

Pos logit contributions
izes+0.011
 phrases+0.010
 listened+0.009
 learn+0.009
 communicate+0.009
Neg logit contributions
 spe-0.015
 bisc-0.014
etchup-0.014
 snowball-0.014
 pitcher-0.014
Token a
Token ID257
Feature activation+2.57e-02

Pos logit contributions
izes+0.088
ative+0.083
 nodded+0.082
 phrases+0.081
master+0.079
Neg logit contributions
 spe-0.097
 bur-0.090
etchup-0.086
 mud-0.084
 bisc-0.083
Token little
Token ID1310
Feature activation-2.00e-02

Pos logit contributions
izes+0.039
 languages+0.036
 phrases+0.036
ative+0.035
 stories+0.033
Neg logit contributions
 spe-0.052
 bisc-0.050
 snowball-0.049
 pitcher-0.049
etchup-0.046
Token pup
Token ID15552
Feature activation+2.27e-02

Pos logit contributions
 spe+0.040
 bisc+0.039
 pitcher+0.038
etchup+0.038
 snowball+0.037
Neg logit contributions
izes-0.028
 phrases-0.027
 languages-0.027
 stories-0.027
 listened-0.027
Token have
Token ID423
Feature activation-6.98e-03

Pos logit contributions
ative+0.001
izes+0.001
ently+0.001
 phrases+0.001
ying+0.001
Neg logit contributions
 spe-0.001
 bisc-0.001
 snowball-0.001
 bur-0.001
 extingu-0.001
Token fun
Token ID1257
Feature activation+1.57e-02

Pos logit contributions
 spe+0.015
 bisc+0.015
etchup+0.015
 pitcher+0.015
 extingu+0.014
Neg logit contributions
 stories-0.008
 listened-0.008
 phrases-0.008
izes-0.008
 questions-0.008
Token.
Token ID13
Feature activation+6.74e-03

Pos logit contributions
izes+0.018
 phrases+0.018
 listened+0.016
ative+0.016
 questions+0.016
Neg logit contributions
 spe-0.035
 bisc-0.034
 snowball-0.034
etchup-0.034
 pitcher-0.033
Token He
Token ID679
Feature activation+1.62e-02

Pos logit contributions
 Everyone+0.008
 questions+0.007
 everyone+0.007
 stories+0.007
 learn+0.007
Neg logit contributions
 spe-0.016
 pitcher-0.015
 bisc-0.015
 extingu-0.015
etchup-0.015
Token was
Token ID373
Feature activation+1.71e-02

Pos logit contributions
 phrases+0.025
ative+0.025
 languages+0.023
izes+0.023
 Everyone+0.022
Neg logit contributions
 spe-0.033
 snowball-0.030
 bur-0.029
etchup-0.028
 bisc-0.027
Token the
Token ID262
Feature activation-7.53e-03

Pos logit contributions
 phrases+0.021
izes+0.021
 languages+0.020
 questions+0.017
ently+0.017
Neg logit contributions
 spe-0.039
 snowball-0.037
etchup-0.037
 pitcher-0.035
 bisc-0.035
Token happiest
Token ID49414
Feature activation+2.35e-02

Pos logit contributions
 spe+0.016
 bisc+0.015
 snowball+0.015
 extingu+0.014
 pitcher+0.014
Neg logit contributions
 phrases-0.011
izes-0.011
 languages-0.010
 listened-0.009
 stories-0.009
Token re
Token ID302
Feature activation+7.21e-03

Pos logit contributions
izes+0.045
 phrases+0.045
 learn+0.042
 languages+0.042
 questions+0.041
Neg logit contributions
 spe-0.039
 snowball-0.034
 bisc-0.034
etchup-0.033
 extingu-0.033
Tokenind
Token ID521
Feature activation-1.08e-02

Pos logit contributions
 phrases+0.014
 stories+0.013
 learn+0.013
 languages+0.013
 questions+0.013
Neg logit contributions
 spe-0.012
etchup-0.011
 extingu-0.010
icer-0.010
 snowball-0.010
Tokeneer
Token ID28153
Feature activation+2.25e-02

Pos logit contributions
 spe+0.016
etchup+0.015
 extingu+0.014
 snowball+0.014
umbs+0.013
Neg logit contributions
 phrases-0.023
izes-0.022
 learn-0.021
 languages-0.021
 stories-0.020
Token in
Token ID287
Feature activation+1.18e-02

Pos logit contributions
 phrases+0.043
 languages+0.040
izes+0.039
 questions+0.038
 stories+0.036
Neg logit contributions
 spe-0.039
etchup-0.037
 bisc-0.033
 snowball-0.033
umbs-0.031
Token and
Token ID290
Feature activation+8.84e-03

Pos logit contributions
izes+0.011
 phrases+0.010
 languages+0.010
 questions+0.010
 elders+0.010
Neg logit contributions
 spe-0.013
 bisc-0.012
 pitcher-0.012
umbs-0.012
 extingu-0.011
Token colorful
Token ID20239
Feature activation+5.62e-03

Pos logit contributions
izes+0.011
 questions+0.011
ya+0.011
ative+0.011
yer+0.011
Neg logit contributions
 spe-0.017
 bisc-0.016
 bur-0.016
etchup-0.015
 pitcher-0.015
Token ribbon
Token ID29092
Feature activation-8.15e-04

Pos logit contributions
izes+0.011
yer+0.010
 listened+0.010
ative+0.010
mber+0.010
Neg logit contributions
 spe-0.009
 bisc-0.008
etchup-0.007
 snowball-0.007
 umb-0.007
Token.
Token ID13
Feature activation+1.71e-02

Pos logit contributions
 spe+0.002
 bisc+0.001
 extingu+0.001
umbs+0.001
etchup+0.001
Neg logit contributions
izes-0.001
 questions-0.001
 phrases-0.001
mber-0.001
 languages-0.001
Token<|endoftext|>
Token ID50256
Feature activation-1.84e-02

Pos logit contributions
 everyone+0.021
 Everyone+0.020
 stories+0.020
ative+0.020
 listened+0.020
Neg logit contributions
 spe-0.036
 bisc-0.036
 pitcher-0.035
etchup-0.034
 extingu-0.034
TokenOnce
Token ID7454
Feature activation-3.91e-02

Pos logit contributions
 spe+0.045
 pitcher+0.043
 extingu+0.043
 bisc+0.043
etchup+0.042
Neg logit contributions
 everyone-0.018
 Everyone-0.018
 listened-0.017
ative-0.016
 kind-0.016
Token upon
Token ID2402
Feature activation+1.80e-02

Pos logit contributions
 spe+0.074
 bisc+0.074
 extingu+0.070
 pitcher+0.070
etchup+0.070
Neg logit contributions
izes-0.057
 questions-0.056
 everyone-0.055
 listened-0.055
 stories-0.055
Token a
Token ID257
Feature activation+3.49e-02

Pos logit contributions
izes+0.017
 phrases+0.017
 listened+0.017
 stories+0.017
 questions+0.016
Neg logit contributions
 spe-0.043
 bisc-0.042
etchup-0.041
 pitcher-0.041
 extingu-0.041
Token time
Token ID640
Feature activation-1.34e-02

Pos logit contributions
 kind+0.034
 often+0.033
 listened+0.032
 stories+0.031
 listen+0.031
Neg logit contributions
 spe-0.085
 extingu-0.084
etchup-0.083
 funnel-0.082
 bisc-0.082
Token,
Token ID11
Feature activation-1.11e-02

Pos logit contributions
 bisc+0.039
 spe+0.039
 pitcher+0.038
etchup+0.038
 extingu+0.037
Neg logit contributions
 listened-0.006
izes-0.006
 questions-0.005
 stories-0.005
 everyone-0.005
Token there
Token ID612
Feature activation-1.22e-02

Pos logit contributions
 bisc+0.025
 spe+0.023
 extingu+0.023
 pitcher+0.023
umbs+0.023
Neg logit contributions
 kind-0.014
 listened-0.014
 everyone-0.013
 loved-0.013
 often-0.012
Token in
Token ID287
Feature activation-3.18e-02

Pos logit contributions
 pitcher+0.027
etchup+0.027
 spe+0.027
 funnel+0.026
 bisc+0.026
Neg logit contributions
 phrases-0.019
 questions-0.017
ative-0.017
izes-0.017
 languages-0.017
Token the
Token ID262
Feature activation-6.37e-03

Pos logit contributions
 bisc+0.086
 spe+0.084
 pitcher+0.082
etchup+0.082
 extingu+0.081
Neg logit contributions
 everyone-0.022
 listened-0.020
 stories-0.020
izes-0.020
 languages-0.019
Token park
Token ID3952
Feature activation+1.70e-02

Pos logit contributions
 spe+0.013
 extingu+0.012
etchup+0.012
 bisc+0.012
umbs+0.012
Neg logit contributions
 stories-0.009
 kind-0.008
izes-0.008
 learn-0.008
 listened-0.008
Token.
Token ID13
Feature activation-7.77e-03

Pos logit contributions
 phrases+0.027
ative+0.025
izes+0.025
 professor+0.025
 questions+0.024
Neg logit contributions
 pitcher-0.031
 bisc-0.030
 spe-0.030
 funnel-0.030
etchup-0.029
Token Lily
Token ID20037
Feature activation-4.77e-03

Pos logit contributions
 spe+0.016
 bisc+0.016
 extingu+0.016
 pitcher+0.015
 bur+0.015
Neg logit contributions
 Everyone-0.010
 everyone-0.009
 questions-0.009
 Everybody-0.009
 kind-0.009
Token and
Token ID290
Feature activation-1.77e-02

Pos logit contributions
etchup+0.009
 spe+0.009
 snowball+0.009
 pitcher+0.009
 bisc+0.009
Neg logit contributions
 phrases-0.007
ative-0.007
izes-0.006
 stories-0.006
 languages-0.006
Token Ben
Token ID3932
Feature activation+8.95e-03

Pos logit contributions
 spe+0.024
 bisc+0.024
etchup+0.023
 pitcher+0.023
 snowball+0.022
Neg logit contributions
izes-0.037
ative-0.035
 phrases-0.034
 listened-0.033
 languages-0.033
Token want
Token ID765
Feature activation-3.62e-02

Pos logit contributions
 phrases+0.016
ative+0.016
izes+0.016
mber+0.015
ently+0.015
Neg logit contributions
 spe-0.014
 bisc-0.014
 pitcher-0.013
 snowball-0.013
 funnel-0.013
Token to
Token ID284
Feature activation+1.44e-02

Pos logit contributions
 spe+0.079
 bisc+0.078
etchup+0.076
 pitcher+0.076
 extingu+0.074
Neg logit contributions
 listened-0.043
 stories-0.042
 questions-0.042
 phrases-0.042
izes-0.041
Token go
Token ID467
Feature activation-1.70e-02

Pos logit contributions
izes+0.021
 phrases+0.021
ative+0.020
 languages+0.020
 stories+0.020
Neg logit contributions
 bisc-0.027
 spe-0.027
 pitcher-0.027
 snowball-0.026
etchup-0.026
Token on
Token ID319
Feature activation+1.52e-02

Pos logit contributions
 spe+0.036
 bisc+0.036
 pitcher+0.035
 extingu+0.034
etchup+0.034
Neg logit contributions
izes-0.021
 listened-0.020
 questions-0.020
 learn-0.020
 stories-0.020
Token,
Token ID11
Feature activation-1.37e-02

Pos logit contributions
 spe+0.073
 bisc+0.072
etchup+0.070
 pitcher+0.070
 extingu+0.069
Neg logit contributions
izes-0.037
 phrases-0.036
 listened-0.035
 questions-0.035
 stories-0.034
Token and
Token ID290
Feature activation-4.74e-05

Pos logit contributions
 bisc+0.030
 spe+0.030
etchup+0.030
 pitcher+0.029
 extingu+0.029
Neg logit contributions
izes-0.015
 listened-0.015
 phrases-0.015
 stories-0.015
 questions-0.014
Token the
Token ID262
Feature activation+3.32e-03

Pos logit contributions
 spe+0.000
 bisc+0.000
etchup+0.000
 pitcher+0.000
 snowball+0.000
Neg logit contributions
izes-0.000
 phrases-0.000
ative-0.000
 questions-0.000
 languages-0.000
Token flowers
Token ID12734
Feature activation+1.53e-02

Pos logit contributions
izes+0.005
 phrases+0.005
ative+0.005
 listened+0.005
 questions+0.004
Neg logit contributions
 spe-0.006
 bisc-0.006
 pitcher-0.006
etchup-0.006
 snowball-0.006
Token began
Token ID2540
Feature activation-1.57e-02

Pos logit contributions
 phrases+0.027
 languages+0.027
 professor+0.026
 questions+0.026
izes+0.026
Neg logit contributions
 spe-0.027
 pitcher-0.024
 bisc-0.023
etchup-0.023
 funnel-0.023
Token to
Token ID284
Feature activation-2.14e-02

Pos logit contributions
 bisc+0.034
 spe+0.033
 pitcher+0.032
etchup+0.032
 snowball+0.032
Neg logit contributions
 phrases-0.018
izes-0.018
 listened-0.018
 stories-0.018
 questions-0.018
Token bloom
Token ID29955
Feature activation-3.66e-03

Pos logit contributions
 spe+0.040
 bisc+0.037
 pitcher+0.036
 extingu+0.036
 funnel+0.036
Neg logit contributions
izes-0.034
 phrases-0.034
 languages-0.033
ative-0.033
 questions-0.033
Token!
Token ID0
Feature activation+1.21e-02

Pos logit contributions
 spe+0.008
 bisc+0.008
 pitcher+0.007
umbs+0.007
 snowball+0.007
Neg logit contributions
 phrases-0.005
izes-0.005
 languages-0.004
 questions-0.004
 listened-0.004
Token The
Token ID383
Feature activation+2.78e-02

Pos logit contributions
 phrases+0.016
izes+0.015
 stories+0.015
 listened+0.015
 questions+0.015
Neg logit contributions
 spe-0.025
 bisc-0.024
etchup-0.024
 pitcher-0.024
 extingu-0.023
Token friends
Token ID2460
Feature activation+5.92e-02

Pos logit contributions
izes+0.038
 phrases+0.037
 listened+0.036
 stories+0.036
 questions+0.036
Neg logit contributions
 spe-0.055
 bisc-0.055
 pitcher-0.053
etchup-0.053
 snowball-0.052
Token smiled
Token ID13541
Feature activation+1.92e-02

Pos logit contributions
izes+0.110
 phrases+0.108
 languages+0.105
 professor+0.105
 questions+0.104
Neg logit contributions
 spe-0.095
 pitcher-0.082
 bur-0.081
etchup-0.079
 funnel-0.079
Token stopped
Token ID5025
Feature activation-7.95e-03

Pos logit contributions
 phrases+0.052
ative+0.051
 languages+0.051
 cultures+0.051
 questions+0.050
Neg logit contributions
 spe-0.064
 snowball-0.061
 bur-0.061
 pitcher-0.061
 bisc-0.060
Token crying
Token ID13774
Feature activation+4.11e-03

Pos logit contributions
 spe+0.014
 bisc+0.014
etchup+0.013
 pitcher+0.013
 extingu+0.013
Neg logit contributions
izes-0.013
 phrases-0.013
 listened-0.012
 stories-0.012
 questions-0.012
Token and
Token ID290
Feature activation+3.35e-03

Pos logit contributions
 phrases+0.008
izes+0.007
 languages+0.007
 professor+0.007
 talents+0.007
Neg logit contributions
 spe-0.006
 extingu-0.006
 pitcher-0.006
 snowball-0.006
umbs-0.006
Token said
Token ID531
Feature activation-8.61e-03

Pos logit contributions
ative+0.006
 languages+0.006
 phrases+0.005
izes+0.005
 questions+0.005
Neg logit contributions
 spe-0.006
 bur-0.006
etchup-0.005
 bisc-0.005
 pitcher-0.005
Token,
Token ID11
Feature activation-2.87e-02

Pos logit contributions
 spe+0.016
 bisc+0.014
 pitcher+0.014
etchup+0.013
 snowball+0.013
Neg logit contributions
 languages-0.014
 phrases-0.013
 professor-0.013
 cultures-0.013
izes-0.012
Token "
Token ID366
Feature activation-2.85e-02

Pos logit contributions
 spe+0.050
 funnel+0.047
 extingu+0.047
 pitcher+0.047
 bisc+0.046
Neg logit contributions
 languages-0.048
izes-0.047
 phrases-0.046
ative-0.044
 stories-0.044
TokenIt
Token ID1026
Feature activation-1.55e-02

Pos logit contributions
 spe+0.045
 bisc+0.043
etchup+0.043
 pitcher+0.042
 snowball+0.041
Neg logit contributions
izes-0.051
 phrases-0.051
 stories-0.051
 questions-0.050
 languages-0.049
Token's
Token ID338
Feature activation-2.37e-02

Pos logit contributions
 snowball+0.022
 spe+0.021
etchup+0.021
 bur+0.021
 bisc+0.020
Neg logit contributions
 professor-0.028
 phrases-0.027
 languages-0.026
ative-0.026
 questions-0.026
Token okay
Token ID8788
Feature activation-5.13e-02

Pos logit contributions
 spe+0.038
 bisc+0.037
etchup+0.036
 snowball+0.036
 pitcher+0.036
Neg logit contributions
 phrases-0.042
izes-0.041
 languages-0.040
ative-0.039
 questions-0.039
Token,
Token ID11
Feature activation-2.01e-02

Pos logit contributions
 spe+0.114
 snowball+0.108
etchup+0.107
 bisc+0.106
 pitcher+0.103
Neg logit contributions
izes-0.064
 phrases-0.062
ative-0.058
 languages-0.057
 professor-0.055
Token bird
Token ID6512
Feature activation+6.60e-03

Pos logit contributions
 spe+0.040
 pitcher+0.036
 bisc+0.036
 extingu+0.035
 snowball+0.035
Neg logit contributions
izes-0.030
 phrases-0.029
ative-0.028
 listened-0.028
 questions-0.027
Token.
Token ID13
Feature activation+5.22e-03

Pos logit contributions
 phrases+0.069
izes+0.066
 languages+0.064
ative+0.063
 professor+0.063
Neg logit contributions
 spe-0.093
 pitcher-0.091
 bisc-0.091
etchup-0.090
 extingu-0.089
Token He
Token ID679
Feature activation+1.91e-02

Pos logit contributions
izes+0.008
 phrases+0.008
 listened+0.007
 stories+0.007
 questions+0.007
Neg logit contributions
 spe-0.010
 bisc-0.010
 pitcher-0.009
etchup-0.009
 extingu-0.009
Token picked
Token ID6497
Feature activation-6.41e-04

Pos logit contributions
ative+0.027
 languages+0.027
 phrases+0.026
izes+0.025
 stories+0.024
Neg logit contributions
 bisc-0.039
 spe-0.039
 pitcher-0.038
etchup-0.037
 snowball-0.036
Token up
Token ID510
Feature activation+1.33e-02

Pos logit contributions
 bisc+0.001
 spe+0.001
etchup+0.001
 pitcher+0.001
 snowball+0.001
Neg logit contributions
izes-0.001
 phrases-0.001
 listened-0.001
 languages-0.001
ative-0.001
Token the
Token ID262
Feature activation-1.97e-02

Pos logit contributions
izes+0.019
 phrases+0.018
ative+0.018
 listened+0.016
 languages+0.016
Neg logit contributions
 bisc-0.028
 snowball-0.027
 spe-0.027
 pitcher-0.027
etchup-0.026
Token big
Token ID1263
Feature activation-7.57e-02

Pos logit contributions
 spe+0.027
 bisc+0.027
 pitcher+0.026
etchup+0.025
 extingu+0.025
Neg logit contributions
izes-0.039
 phrases-0.039
 listened-0.038
ative-0.037
 stories-0.037
Token bag
Token ID6131
Feature activation-3.65e-02

Pos logit contributions
 bisc+0.129
 spe+0.127
etchup+0.125
 extingu+0.122
 pitcher+0.121
Neg logit contributions
izes-0.119
 stories-0.119
 questions-0.119
 phrases-0.119
 listened-0.118
Token of
Token ID286
Feature activation-4.01e-02

Pos logit contributions
 bisc+0.069
 pitcher+0.068
 spe+0.065
umbs+0.065
 extingu+0.065
Neg logit contributions
 phrases-0.056
 languages-0.054
izes-0.054
 questions-0.052
 stories-0.050
Token soil
Token ID9260
Feature activation-8.87e-03

Pos logit contributions
 spe+0.060
 bisc+0.059
etchup+0.057
 pitcher+0.057
 extingu+0.055
Neg logit contributions
izes-0.075
 phrases-0.074
 listened-0.072
ative-0.072
 stories-0.072
Token and
Token ID290
Feature activation+1.90e-03

Pos logit contributions
 spe+0.019
 bisc+0.018
 pitcher+0.018
 extingu+0.018
etchup+0.018
Neg logit contributions
 phrases-0.011
izes-0.011
 languages-0.011
 questions-0.011
ative-0.011
Token placed
Token ID4624
Feature activation+7.21e-03

Pos logit contributions
izes+0.003
ative+0.003
 phrases+0.003
 languages+0.003
 questions+0.003
Neg logit contributions
 bisc-0.004
 spe-0.003
 pitcher-0.003
etchup-0.003
 extingu-0.003
Token She
Token ID1375
Feature activation+4.01e-03

Pos logit contributions
 spe+0.006
 extingu+0.005
 bisc+0.005
etchup+0.005
 snowball+0.005
Neg logit contributions
 Everyone-0.003
 everyone-0.002
 Everybody-0.002
 stories-0.002
 learn-0.002
Token used
Token ID973
Feature activation-2.63e-03

Pos logit contributions
ative+0.006
izes+0.006
 phrases+0.006
 questions+0.006
 languages+0.006
Neg logit contributions
 spe-0.007
 pitcher-0.007
 bisc-0.007
etchup-0.006
 funnel-0.006
Token her
Token ID607
Feature activation-2.37e-02

Pos logit contributions
 bisc+0.007
 spe+0.006
etchup+0.006
 pitcher+0.006
 extingu+0.006
Neg logit contributions
 stories-0.002
 questions-0.002
 phrases-0.002
 everyone-0.002
 languages-0.002
Token black
Token ID2042
Feature activation-6.93e-02

Pos logit contributions
 spe+0.040
 bisc+0.039
 pitcher+0.038
etchup+0.038
 funnel+0.036
Neg logit contributions
izes-0.040
 phrases-0.039
 listened-0.038
ative-0.038
 stories-0.038
Token past
Token ID1613
Feature activation-2.79e-02

Pos logit contributions
 bisc+0.107
 spe+0.106
 pitcher+0.102
etchup+0.100
 funnel+0.099
Neg logit contributions
izes-0.127
 phrases-0.123
 listened-0.121
ative-0.120
 questions-0.118
Tokenel
Token ID417
Feature activation-8.12e-02

Pos logit contributions
 bisc+0.046
 spe+0.045
 pitcher+0.043
etchup+0.043
 extingu+0.042
Neg logit contributions
 questions-0.048
 phrases-0.048
 languages-0.047
 stories-0.047
izes-0.047
Token to
Token ID284
Feature activation-1.37e-02

Pos logit contributions
 spe+0.139
 bisc+0.139
etchup+0.132
 pitcher+0.130
 funnel+0.130
Neg logit contributions
izes-0.135
 phrases-0.130
 listened-0.126
ative-0.126
 professor-0.125
Token draw
Token ID3197
Feature activation+3.34e-02

Pos logit contributions
 spe+0.021
 bisc+0.020
 pitcher+0.019
 funnel+0.019
 extingu+0.018
Neg logit contributions
izes-0.027
ative-0.026
 phrases-0.025
mber-0.024
 languages-0.024
Token a
Token ID257
Feature activation-2.12e-02

Pos logit contributions
izes+0.049
 listened+0.047
 phrases+0.047
 elders+0.045
 obey+0.045
Neg logit contributions
 spe-0.063
 bisc-0.062
 snowball-0.059
 funnel-0.058
 pitcher-0.058
Token big
Token ID1263
Feature activation-5.55e-02

Pos logit contributions
 spe+0.036
 bisc+0.036
etchup+0.035
 pitcher+0.034
 extingu+0.034
Neg logit contributions
izes-0.034
 phrases-0.034
 stories-0.034
 listened-0.033
 questions-0.033
Token cat
Token ID3797
Feature activation-2.43e-02

Pos logit contributions
etchup+0.109
 bisc+0.107
 spe+0.107
 extingu+0.105
umbs+0.104
Neg logit contributions
 stories-0.076
 phrases-0.073
 questions-0.072
 kind-0.072
 languages-0.071
Token.
Token ID13
Feature activation-2.08e-03

Pos logit contributions
 spe+0.067
 bisc+0.066
 pitcher+0.064
etchup+0.064
 extingu+0.063
Neg logit contributions
izes-0.028
 phrases-0.027
 listened-0.026
 questions-0.026
 stories-0.025
Token It
Token ID632
Feature activation-1.23e-02

Pos logit contributions
 spe+0.004
 bisc+0.004
etchup+0.004
 extingu+0.004
 snowball+0.004
Neg logit contributions
 phrases-0.002
 stories-0.002
 questions-0.002
izes-0.002
 Everyone-0.002
Token was
Token ID373
Feature activation-1.45e-02

Pos logit contributions
 spe+0.028
 bisc+0.028
etchup+0.027
 pitcher+0.027
 extingu+0.027
Neg logit contributions
izes-0.013
 phrases-0.013
 listened-0.013
 stories-0.012
 questions-0.012
Token brown
Token ID7586
Feature activation-6.21e-02

Pos logit contributions
 bisc+0.032
etchup+0.031
umbs+0.031
 pitcher+0.031
 extingu+0.030
Neg logit contributions
 kind-0.016
 stories-0.016
 everyone-0.016
 listened-0.016
 questions-0.015
Token and
Token ID290
Feature activation-3.15e-02

Pos logit contributions
 spe+0.143
 bisc+0.142
etchup+0.139
 pitcher+0.139
 extingu+0.136
Neg logit contributions
izes-0.063
 phrases-0.062
 listened-0.061
 stories-0.061
 questions-0.060
Token wil
Token ID39716
Feature activation-8.92e-02

Pos logit contributions
 bisc+0.057
 spe+0.055
 pitcher+0.054
etchup+0.054
 extingu+0.054
Neg logit contributions
 listened-0.048
izes-0.047
 stories-0.047
 phrases-0.046
 kind-0.046
Tokented
Token ID1513
Feature activation-3.46e-02

Pos logit contributions
 pitcher+0.171
 bisc+0.170
 spe+0.169
 funnel+0.166
 extingu+0.161
Neg logit contributions
 listened-0.125
izes-0.122
 everyone-0.121
 phrases-0.120
ative-0.119
Token and
Token ID290
Feature activation-2.26e-02

Pos logit contributions
 spe+0.066
 bisc+0.065
 snowball+0.064
umbs+0.064
 pitcher+0.063
Neg logit contributions
izes-0.051
 phrases-0.050
 professor-0.047
 listened-0.045
 languages-0.045
Token disgusting
Token ID23374
Feature activation-2.38e-02

Pos logit contributions
 spe+0.042
 bisc+0.040
etchup+0.040
 pitcher+0.039
 snowball+0.038
Neg logit contributions
izes-0.035
 phrases-0.035
ative-0.034
 languages-0.034
 questions-0.033
Token.
Token ID13
Feature activation-6.29e-04

Pos logit contributions
 spe+0.056
 bisc+0.055
 pitcher+0.054
etchup+0.054
 snowball+0.053
Neg logit contributions
izes-0.024
 phrases-0.024
 listened-0.023
 questions-0.022
ative-0.022
Token She
Token ID1375
Feature activation-2.94e-03

Pos logit contributions
 spe+0.001
 bisc+0.001
etchup+0.001
 extingu+0.001
 snowball+0.001
Neg logit contributions
 Everyone-0.001
 everyone-0.001
 questions-0.001
 phrases-0.001
 stories-0.001
Token in
Token ID287
Feature activation-3.51e-02

Pos logit contributions
 spe+0.114
 pitcher+0.113
 bisc+0.111
etchup+0.111
 extingu+0.104
Neg logit contributions
 phrases-0.088
izes-0.088
 languages-0.085
 listened-0.083
 stories-0.083
Token a
Token ID257
Feature activation-1.42e-02

Pos logit contributions
etchup+0.084
 bisc+0.084
 spe+0.084
 pitcher+0.082
 funnel+0.082
Neg logit contributions
 stories-0.031
 everyone-0.030
 kind-0.030
 phrases-0.030
 listened-0.029
Token huge
Token ID3236
Feature activation-2.82e-02

Pos logit contributions
etchup+0.025
 extingu+0.025
 spe+0.024
 bisc+0.024
umbs+0.024
Neg logit contributions
 kind-0.022
 stories-0.022
 learn-0.021
 questions-0.021
izes-0.021
Token green
Token ID4077
Feature activation-4.85e-02

Pos logit contributions
 bisc+0.045
 spe+0.044
etchup+0.044
 extingu+0.043
 pitcher+0.042
Neg logit contributions
 stories-0.047
 phrases-0.047
izes-0.046
 kind-0.046
 listened-0.046
Token field
Token ID2214
Feature activation-4.36e-03

Pos logit contributions
 spe+0.072
 bisc+0.071
etchup+0.069
 pitcher+0.068
 extingu+0.067
Neg logit contributions
izes-0.088
 phrases-0.088
 stories-0.087
 listened-0.087
 questions-0.086
Token full
Token ID1336
Feature activation-7.44e-02

Pos logit contributions
 spe+0.009
 bisc+0.009
 pitcher+0.009
etchup+0.009
 funnel+0.009
Neg logit contributions
izes-0.005
 phrases-0.005
 questions-0.005
 languages-0.005
 listened-0.005
Token of
Token ID286
Feature activation-3.61e-02

Pos logit contributions
 spe+0.160
 bisc+0.159
 pitcher+0.155
etchup+0.153
 extingu+0.150
Neg logit contributions
izes-0.092
 phrases-0.089
 listened-0.086
 questions-0.084
 stories-0.084
Token other
Token ID584
Feature activation-5.19e-03

Pos logit contributions
 spe+0.059
 bisc+0.059
etchup+0.057
 pitcher+0.056
 extingu+0.055
Neg logit contributions
izes-0.062
 phrases-0.061
 listened-0.059
 stories-0.059
 questions-0.059
Token grapes
Token ID36656
Feature activation-3.24e-03

Pos logit contributions
 spe+0.010
 bisc+0.010
etchup+0.009
 pitcher+0.009
 extingu+0.009
Neg logit contributions
izes-0.007
 phrases-0.007
 stories-0.007
 questions-0.007
 listened-0.007
Token.
Token ID13
Feature activation+8.78e-03

Pos logit contributions
 spe+0.008
 bisc+0.007
etchup+0.007
 pitcher+0.007
 extingu+0.007
Neg logit contributions
izes-0.003
 phrases-0.003
 listened-0.003
 stories-0.003
 questions-0.003
Token The
Token ID383
Feature activation+2.42e-02

Pos logit contributions
 phrases+0.011
 Everyone+0.011
 everyone+0.011
 questions+0.011
izes+0.011
Neg logit contributions
 spe-0.018
 bisc-0.018
 extingu-0.017
etchup-0.017
 pitcher-0.017
Token some
Token ID617
Feature activation+3.01e-03

Pos logit contributions
 spe+0.019
 pitcher+0.019
 bisc+0.019
etchup+0.018
 extingu+0.018
Neg logit contributions
izes-0.012
 phrases-0.012
ative-0.011
 languages-0.011
 questions-0.011
Token of
Token ID286
Feature activation-4.07e-02

Pos logit contributions
izes+0.006
 phrases+0.005
ative+0.005
 professor+0.005
 languages+0.005
Neg logit contributions
 spe-0.005
 pitcher-0.005
etchup-0.005
 bisc-0.005
 funnel-0.004
Token her
Token ID607
Feature activation-2.52e-02

Pos logit contributions
 spe+0.089
etchup+0.087
 pitcher+0.087
 bisc+0.087
 extingu+0.084
Neg logit contributions
izes-0.051
 phrases-0.047
ative-0.046
 listened-0.045
 languages-0.043
Token hair
Token ID4190
Feature activation-2.78e-04

Pos logit contributions
 spe+0.045
 bisc+0.045
 pitcher+0.044
etchup+0.044
 extingu+0.041
Neg logit contributions
izes-0.041
 phrases-0.040
ative-0.039
 listened-0.038
 languages-0.038
Token had
Token ID550
Feature activation-6.73e-02

Pos logit contributions
 spe+0.001
 pitcher+0.001
 bisc+0.001
etchup+0.001
 funnel+0.001
Neg logit contributions
izes-0.000
 phrases-0.000
 questions-0.000
 professor-0.000
 languages-0.000
Token fallen
Token ID9292
Feature activation-8.45e-02

Pos logit contributions
 spe+0.112
 bisc+0.111
etchup+0.107
 pitcher+0.107
 extingu+0.104
Neg logit contributions
izes-0.112
 phrases-0.111
 listened-0.109
 stories-0.108
 questions-0.108
Token into
Token ID656
Feature activation-2.11e-02

Pos logit contributions
 bisc+0.181
 spe+0.177
etchup+0.176
 pitcher+0.171
 snowball+0.166
Neg logit contributions
 stories-0.097
izes-0.095
 everyone-0.093
 listened-0.093
 questions-0.093
Token the
Token ID262
Feature activation-2.73e-02

Pos logit contributions
 bisc+0.046
 spe+0.045
etchup+0.044
 pitcher+0.043
 extingu+0.043
Neg logit contributions
 stories-0.024
 everyone-0.024
 questions-0.024
 kind-0.023
 listened-0.023
Token cookie
Token ID19751
Feature activation-2.50e-02

Pos logit contributions
 spe+0.039
 bisc+0.038
etchup+0.037
 extingu+0.036
 snowball+0.035
Neg logit contributions
 stories-0.051
izes-0.050
 questions-0.050
 listened-0.050
 kind-0.049
Token dough
Token ID15756
Feature activation-6.16e-03

Pos logit contributions
 spe+0.052
 bisc+0.052
etchup+0.050
 pitcher+0.050
 extingu+0.049
Neg logit contributions
izes-0.031
 phrases-0.031
 listened-0.030
 stories-0.030
 questions-0.030
Token.
Token ID13
Feature activation+2.02e-03

Pos logit contributions
 spe+0.015
 bisc+0.014
 pitcher+0.014
etchup+0.014
 extingu+0.014
Neg logit contributions
izes-0.006
 phrases-0.006
 listened-0.006
 stories-0.006
 questions-0.006