TOP ACTIVATIONS
MAX = 4.953

, but it was all fore shadow ed by the kind
big tree with all her bird friends . One day ,
they both have pencil - c ray ons and they draw
needle and thread and carefully se wed Mr . Fl uffy
circles and play with his bird ie friends . One day
silence and enjoyed all his bird adventures . The end .
sing and play with her bird friends . One
s and his book of bird names . He sits on
around and play with his bird friends . One day ,
around and play with his bird ie friends . One day
needle and thread and carefully se wed the hole shut .
a needle and thread and se wed the t eddy bear
thread from her bag and se wed up Lucy 's dress
got some colorful fabric and se wed a beautiful dress for
off to play with her bird ie friends . The end
watched as her mom carefully se wed Ted 's arm back
in the garden and have pic n ics together .
be fun to put her bird in the cage and pretend
watched as his mom carefully se wed the stitch in the
scary hospital with all the x - rays . <|endoftext|> Once

MIN ACTIVATIONS
MIN = -5.621

wanted to see the or nam ents up close . They
said . " The or nam ents are so pretty !
to look at the or nam ents on the tree .
more careful . The or nam ents are not toys .
to me . The or nam ents were very important to
They reached for the or nam ents and touched them softly
" Hello , Mister P ige on !" The
a letter about the or nam ents and asked for one
branches . But the or nam ents were slippery and some
They reached for the or nam ents and tried to take
here and explore the or nam ents ." Da
to help sort the or nam ents . She picked up
had not touched the or nam ents . They hoped Mom
on it . The or nam ents were shiny and colorful
she could carry the or nam ents back with her .
she could carry the or nam ents to the place .
he especially liked the or nam ents he kept around .
She learned that the or nam ents were not for playing
Be careful , these or nam ents are very special .
the lock and the or nam ents . "

INTERVAL 2.309 - 4.953
CONTAINS 0.385%

him . " Be careful ! There might be
's mom took a photo of him when he fell .
her goal ! <|endoftext|> J imm ie was scared . He
sudden he heard a quiet voice calling : " Exc use
day warm and happy . So he grabbed the sack ,

INTERVAL -0.334 - 2.309
CONTAINS 67.816%

two friends laughed and celebrated their victory . From then on
time , there was a little girl named Lily . She
and put on a new dress ," said Lily 's mom
is good at making sounds . L ily opens
funny noises . L ila and Ben cl apped and laughed

INTERVAL -2.977 - -0.334
CONTAINS 31.777%

curious eyes and finally decided he would try the caul iflower
to go home . Now !" L ily and
mom heard her and asked , " What 's wrong ,
But the sheep just kept sleeping and didn 't wake
smile on her face , dreaming about all the cards she

INTERVAL -5.621 - -2.977
CONTAINS 0.021%

and said , " S oph ie , no matter how
have a tum my ac he . You just don 't
on . Mrs Smith opened the door and said
Mom my said " G abby , don 't be afraid
. The books he lf came crashing down on
Token,
Token ID11
Feature activation+7.54e-02

Pos logit contributions
 Tweet+0.032
 Madd+0.031
 Fif+0.029
 Suz+0.029
 Bird+0.028
Neg logit contributions
pron-0.027
 plate-0.026
 forks-0.025
 slice-0.025
 fork-0.024
Token but
Token ID475
Feature activation+4.28e-01

Pos logit contributions
 Tweet+0.003
 Madd+0.003
add+0.003
 Fif+0.003
 Kerry+0.003
Neg logit contributions
 plate-0.003
pron-0.003
 forks-0.003
 fork-0.003
 slice-0.003
Token it
Token ID340
Feature activation-1.66e-01

Pos logit contributions
 Tweet+0.022
 Madd+0.022
add+0.021
rig+0.021
 Bird+0.020
Neg logit contributions
 plate-0.012
 forks-0.012
lf-0.012
 fork-0.012
pron-0.012
Token was
Token ID373
Feature activation+1.05e+00

Pos logit contributions
pron+0.007
lf+0.007
 plate+0.007
 fork+0.007
 forks+0.006
Neg logit contributions
 Tweet-0.007
 Madd-0.007
 Dawn-0.006
 Fif-0.006
 Suz-0.006
Token all
Token ID477
Feature activation-1.45e+00

Pos logit contributions
 Tweet+0.041
 Madd+0.041
 Suz+0.037
 Dawn+0.037
 Fif+0.036
Neg logit contributions
pron-0.048
 plate-0.046
 slice-0.046
 forks-0.045
wy-0.044
Token fore
Token ID1674
Feature activation+4.56e+00

Pos logit contributions
pron+0.054
 plate+0.053
 slice+0.052
 forks+0.052
 fork+0.049
Neg logit contributions
 Madd-0.067
 Tweet-0.066
 Dawn-0.063
 Fif-0.062
 Kerry-0.062
Tokenshadow
Token ID19106
Feature activation-9.99e-01

Pos logit contributions
 Tweet+0.219
 Madd+0.203
 Bird+0.196
 Suz+0.193
 shelter+0.193
Neg logit contributions
 forks-0.174
 plate-0.173
pron-0.173
 fork-0.166
 slice-0.164
Tokened
Token ID276
Feature activation+5.52e-01

Pos logit contributions
pron+0.031
 forks+0.030
wy+0.029
 plate+0.029
 slice+0.029
Neg logit contributions
 Tweet-0.050
 Madd-0.050
 Fif-0.047
 shelter-0.046
 Suz-0.046
Token by
Token ID416
Feature activation-3.26e-01

Pos logit contributions
 Madd+0.026
 Tweet+0.026
 Suz+0.024
 Ph+0.024
 Dawn+0.024
Neg logit contributions
 forks-0.019
 plate-0.018
pron-0.018
 fork-0.018
 slice-0.017
Token the
Token ID262
Feature activation+4.03e-01

Pos logit contributions
 forks+0.011
 plate+0.011
pron+0.011
 slice+0.011
 fork+0.011
Neg logit contributions
 Tweet-0.015
 Madd-0.015
 Kerry-0.014
 shelter-0.014
 Dawn-0.014
Token kind
Token ID1611
Feature activation+6.33e-02

Pos logit contributions
 Tweet+0.019
 Madd+0.019
 Suz+0.018
 Dawn+0.017
 Bird+0.017
Neg logit contributions
 plate-0.016
 forks-0.015
pron-0.015
 slice-0.015
 fork-0.015
Token big
Token ID1263
Feature activation+6.74e-01

Pos logit contributions
pron+0.008
 forks+0.008
 plate+0.008
 slice+0.007
wy+0.007
Neg logit contributions
 Tweet-0.007
 Madd-0.007
 Bird-0.006
 Suz-0.006
 shelter-0.006
Token tree
Token ID5509
Feature activation+1.22e+00

Pos logit contributions
 Tweet+0.026
 Madd+0.024
 Bird+0.023
 Suz+0.022
 shelter+0.022
Neg logit contributions
pron-0.032
 forks-0.032
 plate-0.031
 slice-0.031
wy-0.030
Token with
Token ID351
Feature activation-2.77e-01

Pos logit contributions
 Tweet+0.048
 Madd+0.044
 Bird+0.042
 shelter+0.041
 Suz+0.041
Neg logit contributions
pron-0.057
 forks-0.057
 plate-0.056
 slice-0.054
wy-0.054
Token all
Token ID477
Feature activation-1.51e+00

Pos logit contributions
pron+0.013
 plate+0.012
 forks+0.012
wy+0.012
 slice+0.012
Neg logit contributions
 Tweet-0.013
 Madd-0.011
 Bird-0.011
 Suz-0.011
 shelter-0.010
Token her
Token ID607
Feature activation+6.35e-01

Pos logit contributions
pron+0.067
 forks+0.065
 plate+0.065
 slice+0.063
wy+0.062
Neg logit contributions
 Tweet-0.066
 Madd-0.062
 Bird-0.058
 Suz-0.058
 shelter-0.056
Token bird
Token ID6512
Feature activation+4.46e+00

Pos logit contributions
 Tweet+0.025
 Madd+0.025
 Suz+0.023
 Dawn+0.023
 Ph+0.023
Neg logit contributions
 forks-0.028
 plate-0.028
pron-0.028
 slice-0.027
 fork-0.027
Token friends
Token ID2460
Feature activation+6.88e-01

Pos logit contributions
 Madd+0.183
 Tweet+0.181
 Suz+0.169
 Ph+0.167
 Dawn+0.167
Neg logit contributions
pron-0.191
 plate-0.188
 forks-0.186
 slice-0.184
 fork-0.177
Token.
Token ID13
Feature activation+5.05e-01

Pos logit contributions
 Tweet+0.030
 Madd+0.028
 Bird+0.027
 Suz+0.026
 shelter+0.026
Neg logit contributions
pron-0.030
 forks-0.029
 plate-0.029
 slice-0.028
wy-0.027
Token One
Token ID1881
Feature activation+1.68e-01

Pos logit contributions
 rou+0.024
 regret+0.023
rig+0.023
 cal+0.023
 worse+0.022
Neg logit contributions
ossible-0.017
>-0.014
 plate-0.014
 forks-0.014
 slice-0.013
Token day
Token ID1110
Feature activation-1.05e+00

Pos logit contributions
 Madd+0.009
 Tweet+0.009
 Dawn+0.009
 Kel+0.008
 Beth+0.008
Neg logit contributions
pron-0.005
 plate-0.005
 slice-0.005
 fork-0.005
ossible-0.005
Token,
Token ID11
Feature activation+2.44e-01

Pos logit contributions
pron+0.045
 forks+0.044
 plate+0.044
 slice+0.043
wy+0.042
Neg logit contributions
 Tweet-0.046
 Madd-0.043
 Bird-0.040
 Suz-0.040
 shelter-0.039
Token they
Token ID484
Feature activation-8.42e-01

Pos logit contributions
 Tweet+0.046
 Madd+0.043
 Bird+0.041
 Suz+0.040
 shelter+0.039
Neg logit contributions
pron-0.052
 plate-0.049
wy-0.049
 slice-0.048
 forks-0.048
Token both
Token ID1111
Feature activation-2.57e+00

Pos logit contributions
pron+0.044
 plate+0.039
ucer+0.039
lf+0.039
Adam+0.038
Neg logit contributions
 Tweet-0.028
 shelter-0.024
 Madd-0.023
 Bird-0.023
 Suz-0.022
Token have
Token ID423
Feature activation+1.50e-02

Pos logit contributions
pron+0.126
 tripod+0.117
ucer+0.116
 plate+0.114
lf+0.113
Neg logit contributions
 Tweet-0.090
 Suz-0.079
 regret-0.078
 Madd-0.078
 Bird-0.078
Token pencil
Token ID21613
Feature activation-6.61e-01

Pos logit contributions
 Tweet+0.001
 Madd+0.001
 shelter+0.001
 Suz+0.000
 Bird+0.000
Neg logit contributions
pron-0.001
 plate-0.001
wy-0.001
ucer-0.001
 tripod-0.001
Token-
Token ID12
Feature activation+3.54e+00

Pos logit contributions
pron+0.025
 plate+0.024
 forks+0.024
 slice+0.023
wy+0.023
Neg logit contributions
 Tweet-0.032
 Madd-0.030
 Bird-0.029
 Suz-0.029
 shelter-0.028
Tokenc
Token ID66
Feature activation+4.45e+00

Pos logit contributions
 Tweet+0.181
 Madd+0.170
 Bird+0.162
 Suz+0.162
 shelter+0.157
Neg logit contributions
pron-0.127
 plate-0.124
 forks-0.122
 slice-0.116
 fork-0.115
Tokenray
Token ID2433
Feature activation+2.81e-01

Pos logit contributions
 Tweet+0.228
 Madd+0.224
 Bird+0.209
 Dawn+0.208
 shelter+0.208
Neg logit contributions
pron-0.150
 forks-0.147
 plate-0.145
 slice-0.144
wy-0.139
Tokenons
Token ID684
Feature activation-8.23e-02

Pos logit contributions
 Tweet+0.012
 Madd+0.012
 Bird+0.011
 Suz+0.011
 shelter+0.011
Neg logit contributions
pron-0.012
 forks-0.011
 plate-0.011
 slice-0.011
wy-0.011
Token and
Token ID290
Feature activation+6.91e-01

Pos logit contributions
pron+0.004
 forks+0.004
 plate+0.004
 slice+0.004
wy+0.003
Neg logit contributions
 Tweet-0.003
 Madd-0.003
 Bird-0.003
 Suz-0.003
 shelter-0.003
Token they
Token ID484
Feature activation+3.32e-01

Pos logit contributions
 Tweet+0.035
 Madd+0.032
 Bird+0.031
 Suz+0.031
 shelter+0.031
Neg logit contributions
pron-0.025
 plate-0.023
wy-0.023
 forks-0.022
ucer-0.022
Token draw
Token ID3197
Feature activation+6.68e-01

Pos logit contributions
 Tweet+0.013
 Bird+0.011
 Madd+0.011
 shelter+0.011
 Suz+0.011
Neg logit contributions
pron-0.015
 plate-0.014
 forks-0.014
lf-0.014
wy-0.014
Token needle
Token ID17598
Feature activation+1.34e+00

Pos logit contributions
 forks+0.000
pron+0.000
 plate+0.000
wy+0.000
 slice+0.000
Neg logit contributions
 Tweet-0.000
 Madd-0.000
 Bird-0.000
 Suz-0.000
 shelter-0.000
Token and
Token ID290
Feature activation+2.79e-01

Pos logit contributions
 Tweet+0.053
 Madd+0.051
 Suz+0.047
 Bird+0.047
 Dawn+0.045
Neg logit contributions
pron-0.062
 plate-0.061
 forks-0.061
 slice-0.059
 fork-0.058
Token thread
Token ID4704
Feature activation+1.34e+00

Pos logit contributions
 Tweet+0.013
 Madd+0.012
 Bird+0.011
 Suz+0.011
 Dawn+0.011
Neg logit contributions
pron-0.011
 forks-0.011
 plate-0.011
 slice-0.011
 fork-0.011
Token and
Token ID290
Feature activation+4.38e-01

Pos logit contributions
 Tweet+0.059
 Madd+0.058
 Bird+0.054
 Suz+0.054
 Fif+0.053
Neg logit contributions
pron-0.053
 plate-0.052
 forks-0.052
 slice-0.050
wy-0.048
Token carefully
Token ID7773
Feature activation+1.79e-01

Pos logit contributions
 Tweet+0.020
 Madd+0.020
 Bird+0.018
 Dawn+0.018
 Suz+0.018
Neg logit contributions
pron-0.017
 forks-0.017
 plate-0.017
 slice-0.017
 fork-0.016
Token se
Token ID384
Feature activation+4.39e+00

Pos logit contributions
 Tweet+0.008
 Madd+0.008
 Bird+0.008
 Suz+0.008
 Dawn+0.007
Neg logit contributions
pron-0.007
 forks-0.007
 plate-0.007
 slice-0.007
 fork-0.006
Tokenwed
Token ID19103
Feature activation-1.14e+00

Pos logit contributions
 Tweet+0.253
 Madd+0.240
 Suz+0.231
 Bird+0.230
 shelter+0.227
Neg logit contributions
pron-0.126
 plate-0.123
 forks-0.123
 slice-0.114
 fork-0.112
Token Mr
Token ID1770
Feature activation-9.99e-01

Pos logit contributions
 forks+0.040
 plate+0.038
pron+0.038
wy+0.036
 slice+0.035
Neg logit contributions
 Tweet-0.055
 Madd-0.054
 Bird-0.052
 shelter-0.051
 Bees-0.051
Token.
Token ID13
Feature activation-1.47e+00

Pos logit contributions
 plate+0.033
 forks+0.031
 slice+0.031
 fork+0.031
pron+0.031
Neg logit contributions
 Madd-0.054
 Tweet-0.052
 Suz-0.050
 Dawn-0.050
 Ph-0.049
Token Fl
Token ID1610
Feature activation+7.16e-01

Pos logit contributions
pron+0.078
 plate+0.077
 forks+0.077
 slice+0.075
 fork+0.073
Neg logit contributions
 Tweet-0.048
 Madd-0.045
 Suz-0.041
 Bird-0.041
 shelter-0.040
Tokenuffy
Token ID15352
Feature activation+6.19e-01

Pos logit contributions
 Tweet+0.025
 Madd+0.024
 Suz+0.022
 Bird+0.021
 Dawn+0.021
Neg logit contributions
pron-0.037
 plate-0.036
 forks-0.036
 slice-0.035
 fork-0.035
Token circles
Token ID13332
Feature activation+4.75e-01

Pos logit contributions
 plate+0.037
 forks+0.037
 slice+0.035
 fork+0.034
pron+0.034
Neg logit contributions
 Madd-0.058
 Tweet-0.057
 Dawn-0.054
 Suz-0.053
 Kel-0.052
Token and
Token ID290
Feature activation-1.11e-01

Pos logit contributions
 Tweet+0.020
 Madd+0.019
 Bird+0.018
 Suz+0.018
 shelter+0.017
Neg logit contributions
pron-0.021
 forks-0.021
 plate-0.021
 slice-0.020
 fork-0.019
Token play
Token ID711
Feature activation-4.45e-02

Pos logit contributions
pron+0.005
 forks+0.005
 plate+0.005
 slice+0.005
 fork+0.005
Neg logit contributions
 Tweet-0.004
 Madd-0.004
 Bird-0.004
 Suz-0.004
 shelter-0.004
Token with
Token ID351
Feature activation-3.04e-01

Pos logit contributions
pron+0.002
 forks+0.002
 plate+0.002
 slice+0.002
wy+0.002
Neg logit contributions
 Tweet-0.002
 Madd-0.002
 Bird-0.002
 Suz-0.002
 shelter-0.002
Token his
Token ID465
Feature activation+5.31e-02

Pos logit contributions
 forks+0.010
pron+0.010
 plate+0.010
 slice+0.010
 fork+0.009
Neg logit contributions
 Tweet-0.015
 Madd-0.015
 Suz-0.014
 Bird-0.014
 Dawn-0.014
Token bird
Token ID6512
Feature activation+4.37e+00

Pos logit contributions
 Tweet+0.002
 Madd+0.002
 Suz+0.002
 Bird+0.002
 Dawn+0.002
Neg logit contributions
pron-0.003
 plate-0.002
 forks-0.002
 slice-0.002
 fork-0.002
Tokenie
Token ID494
Feature activation+1.84e+00

Pos logit contributions
 Madd+0.168
 Tweet+0.167
 Suz+0.154
 Dawn+0.151
 Bird+0.149
Neg logit contributions
pron-0.200
 plate-0.195
 forks-0.193
 slice-0.191
 fork-0.185
Token friends
Token ID2460
Feature activation+6.50e-01

Pos logit contributions
 Tweet+0.080
 Madd+0.073
 Bird+0.071
 shelter+0.069
 Bug+0.068
Neg logit contributions
pron-0.080
 forks-0.078
 plate-0.077
 slice-0.076
wy-0.075
Token.
Token ID13
Feature activation+5.63e-01

Pos logit contributions
 Tweet+0.023
 Madd+0.020
 Bird+0.020
 shelter+0.019
 Bug+0.018
Neg logit contributions
pron-0.035
 forks-0.034
wy-0.033
 plate-0.033
 slice-0.033
Token One
Token ID1881
Feature activation+1.04e-01

Pos logit contributions
 Madd+0.026
 Suz+0.024
 shelter+0.024
 rou+0.023
rig+0.023
Neg logit contributions
ossible-0.019
pron-0.018
 plate-0.018
 slice-0.018
 forks-0.017
Token day
Token ID1110
Feature activation-1.12e+00

Pos logit contributions
 Tweet+0.005
 Madd+0.005
 Dawn+0.005
 Suz+0.005
 Bird+0.005
Neg logit contributions
pron-0.003
 plate-0.003
 slice-0.003
 forks-0.003
ossible-0.003
Token silence
Token ID9550
Feature activation-8.68e-01

Pos logit contributions
 plate+0.028
pron+0.028
 slice+0.027
 forks+0.027
 fork+0.027
Neg logit contributions
 Madd-0.021
 Tweet-0.021
 Suz-0.019
 Dawn-0.018
 Fif-0.018
Token and
Token ID290
Feature activation+1.03e-02

Pos logit contributions
pron+0.039
 plate+0.039
 forks+0.038
 slice+0.038
 fork+0.037
Neg logit contributions
 Tweet-0.036
 Madd-0.034
 Bird-0.031
 Suz-0.031
 Dawn-0.031
Token enjoyed
Token ID8359
Feature activation-2.13e+00

Pos logit contributions
 Tweet+0.000
 Madd+0.000
 Dawn+0.000
 Suz+0.000
 Bird+0.000
Neg logit contributions
 forks-0.000
pron-0.000
 plate-0.000
 slice-0.000
 fork-0.000
Token all
Token ID477
Feature activation-1.60e+00

Pos logit contributions
pron+0.081
 forks+0.080
 plate+0.079
 slice+0.076
 fork+0.074
Neg logit contributions
 Tweet-0.102
 Madd-0.097
 Suz-0.091
 Bird-0.091
 shelter-0.089
Token his
Token ID465
Feature activation+6.55e-01

Pos logit contributions
 forks+0.051
 plate+0.050
pron+0.050
 slice+0.049
 fork+0.047
Neg logit contributions
 Tweet-0.083
 Madd-0.079
 Bird-0.075
 Dawn-0.074
 Suz-0.073
Token bird
Token ID6512
Feature activation+4.34e+00

Pos logit contributions
 Tweet+0.030
 Madd+0.029
 Ph+0.028
 Dawn+0.028
 Suz+0.028
Neg logit contributions
 forks-0.026
pron-0.025
 plate-0.025
 fork-0.025
 slice-0.025
Token adventures
Token ID17545
Feature activation+1.46e+00

Pos logit contributions
 Madd+0.204
 Tweet+0.203
 Ph+0.194
 Dawn+0.192
 Fif+0.191
Neg logit contributions
 plate-0.157
pron-0.156
 slice-0.155
 forks-0.150
lf-0.144
Token.
Token ID13
Feature activation+3.49e-01

Pos logit contributions
 Tweet+0.071
 Madd+0.070
 Suz+0.065
 Bird+0.065
 Fif+0.064
Neg logit contributions
pron-0.052
 plate-0.052
 forks-0.051
 slice-0.049
lf-0.049
Token The
Token ID383
Feature activation+1.57e+00

Pos logit contributions
 Tweet+0.013
 Madd+0.013
 shelter+0.012
 Bird+0.012
 Suz+0.012
Neg logit contributions
pron-0.016
 plate-0.016
 forks-0.016
 slice-0.015
ossible-0.015
Token end
Token ID886
Feature activation-1.25e+00

Pos logit contributions
 Tweet+0.073
 Madd+0.071
 Suz+0.067
 Dawn+0.064
 Fif+0.064
Neg logit contributions
pron-0.064
 plate-0.063
 slice-0.063
 forks-0.062
 fork-0.060
Token.
Token ID13
Feature activation+2.38e-01

Pos logit contributions
pron+0.040
 plate+0.038
 slice+0.037
 fork+0.036
 forks+0.036
Neg logit contributions
 Tweet-0.068
 Madd-0.067
 Dawn-0.063
 Ph-0.062
 Fif-0.061
Token sing
Token ID1702
Feature activation+1.28e+00

Pos logit contributions
pron+0.020
 plate+0.020
 slice+0.019
 forks+0.019
 fork+0.019
Neg logit contributions
 Tweet-0.019
 Madd-0.019
 Suz-0.017
 Dawn-0.017
 Bird-0.017
Token and
Token ID290
Feature activation+1.22e-01

Pos logit contributions
 Tweet+0.053
 Madd+0.050
 Bird+0.047
 Suz+0.047
 shelter+0.046
Neg logit contributions
pron-0.057
 forks-0.056
 plate-0.056
 slice-0.054
 fork-0.053
Token play
Token ID711
Feature activation+1.81e-01

Pos logit contributions
 Tweet+0.005
 Madd+0.005
 Suz+0.005
 Bird+0.005
 shelter+0.004
Neg logit contributions
pron-0.005
 forks-0.005
 plate-0.005
 slice-0.005
 fork-0.005
Token with
Token ID351
Feature activation-1.52e-01

Pos logit contributions
 Tweet+0.008
 Madd+0.007
 Bird+0.007
 shelter+0.007
 Suz+0.007
Neg logit contributions
pron-0.008
 forks-0.008
 plate-0.008
 slice-0.008
wy-0.008
Token her
Token ID607
Feature activation+9.82e-02

Pos logit contributions
 forks+0.005
 plate+0.005
 fork+0.004
lf+0.004
 slice+0.004
Neg logit contributions
 Madd-0.007
 rou-0.007
rig-0.007
 Suz-0.007
 Tweet-0.007
Token bird
Token ID6512
Feature activation+4.33e+00

Pos logit contributions
 Tweet+0.004
 Madd+0.003
 Bird+0.003
 Suz+0.003
 shelter+0.003
Neg logit contributions
pron-0.005
 forks-0.005
 plate-0.005
 slice-0.005
wy-0.005
Token friends
Token ID2460
Feature activation+5.90e-01

Pos logit contributions
 Madd+0.177
 Tweet+0.170
 Suz+0.163
 Dawn+0.161
 Ph+0.161
Neg logit contributions
pron-0.189
 plate-0.183
 forks-0.179
 slice-0.178
lf-0.174
Token.
Token ID13
Feature activation+8.21e-01

Pos logit contributions
 Tweet+0.023
 Madd+0.021
 Bird+0.020
 shelter+0.019
 Bug+0.019
Neg logit contributions
pron-0.029
 forks-0.028
 slice-0.028
wy-0.028
 plate-0.028
Token
Token ID198
Feature activation+3.85e-01

Pos logit contributions
 rou+0.038
 Madd+0.036
 cal+0.036
rig+0.035
 regret+0.035
Neg logit contributions
ossible-0.030
>-0.025
 plate-0.024
 slice-0.024
pron-0.024
Token
Token ID198
Feature activation+9.17e-02

Pos logit contributions
 Tweet+0.015
 Madd+0.015
 Suz+0.014
 Bird+0.014
 Dawn+0.013
Neg logit contributions
pron-0.017
 plate-0.017
ossible-0.016
 slice-0.016
 forks-0.016
TokenOne
Token ID3198
Feature activation-4.48e-01

Pos logit contributions
 Madd+0.005
 Tweet+0.005
 Suz+0.005
 Dawn+0.005
 Bird+0.005
Neg logit contributions
pron-0.003
ossible-0.002
 slice-0.002
 plate-0.002
lf-0.002
Tokens
Token ID82
Feature activation+9.87e-01

Pos logit contributions
 Tweet+0.042
 Madd+0.041
 Suz+0.039
 Bird+0.039
 shelter+0.038
Neg logit contributions
pron-0.025
 forks-0.025
 plate-0.025
 slice-0.024
 fork-0.023
Token and
Token ID290
Feature activation+8.49e-01

Pos logit contributions
 Tweet+0.042
 Madd+0.040
 Suz+0.038
 Bird+0.036
 Dawn+0.036
Neg logit contributions
 plate-0.043
 forks-0.042
pron-0.042
 slice-0.041
 fork-0.040
Token his
Token ID465
Feature activation+9.10e-01

Pos logit contributions
 Tweet+0.040
 Madd+0.038
 Bird+0.036
 Suz+0.036
 shelter+0.035
Neg logit contributions
pron-0.033
 plate-0.032
 forks-0.032
 slice-0.031
wy-0.031
Token book
Token ID1492
Feature activation+1.61e+00

Pos logit contributions
 Tweet+0.045
 Madd+0.044
 Suz+0.042
 Bird+0.041
 Bug+0.040
Neg logit contributions
 plate-0.033
 forks-0.032
pron-0.032
 slice-0.031
 fork-0.031
Token of
Token ID286
Feature activation+2.18e+00

Pos logit contributions
 Tweet+0.077
 Madd+0.076
 Suz+0.071
 Bird+0.070
 Dawn+0.069
Neg logit contributions
 plate-0.061
pron-0.061
 forks-0.060
 slice-0.058
 fork-0.057
Token bird
Token ID6512
Feature activation+4.31e+00

Pos logit contributions
 Madd+0.101
 Tweet+0.098
 Suz+0.095
rig+0.094
 Bug+0.091
Neg logit contributions
 forks-0.085
 plate-0.082
 fork-0.077
 slice-0.077
 dice-0.076
Token names
Token ID3891
Feature activation+3.85e-01

Pos logit contributions
 Madd+0.182
 Tweet+0.175
 Suz+0.167
 Dawn+0.164
 Ph+0.164
Neg logit contributions
 plate-0.184
 forks-0.179
pron-0.176
 slice-0.176
 fork-0.169
Token.
Token ID13
Feature activation+6.90e-01

Pos logit contributions
 Tweet+0.017
 Madd+0.016
 Bird+0.015
 Suz+0.015
 shelter+0.015
Neg logit contributions
pron-0.016
 forks-0.016
 plate-0.016
 slice-0.015
wy-0.015
Token He
Token ID679
Feature activation+1.72e-01

Pos logit contributions
 Tweet+0.027
 Madd+0.026
 Bird+0.024
 Suz+0.024
 shelter+0.023
Neg logit contributions
pron-0.033
 forks-0.032
 plate-0.032
 slice-0.031
wy-0.030
Token sits
Token ID10718
Feature activation-7.33e-01

Pos logit contributions
 Tweet+0.009
 Madd+0.009
 Suz+0.009
 Dawn+0.008
 Bird+0.008
Neg logit contributions
 forks-0.006
 plate-0.006
 fork-0.006
pron-0.005
 slice-0.005
Token on
Token ID319
Feature activation-8.66e-01

Pos logit contributions
 forks+0.032
 plate+0.031
pron+0.031
 slice+0.030
wy+0.030
Neg logit contributions
 Tweet-0.032
 Madd-0.028
 Bird-0.028
 Suz-0.027
 shelter-0.027
Token around
Token ID1088
Feature activation-1.10e+00

Pos logit contributions
 Madd+0.105
 Tweet+0.100
 Suz+0.099
 Ph+0.097
 Dawn+0.096
Neg logit contributions
 plate-0.067
pron-0.066
 forks-0.066
 slice-0.064
 fork-0.064
Token and
Token ID290
Feature activation-2.31e-01

Pos logit contributions
pron+0.044
 plate+0.043
 forks+0.043
 slice+0.041
 fork+0.041
Neg logit contributions
 Tweet-0.049
 Madd-0.048
 Suz-0.045
 Dawn-0.044
 Bird-0.044
Token play
Token ID711
Feature activation-2.10e-02

Pos logit contributions
pron+0.011
 forks+0.011
 plate+0.010
 slice+0.010
wy+0.010
Neg logit contributions
 Tweet-0.009
 Madd-0.009
 Bird-0.008
 Suz-0.008
 shelter-0.008
Token with
Token ID351
Feature activation-2.42e-01

Pos logit contributions
pron+0.001
 forks+0.001
 plate+0.001
 slice+0.001
wy+0.001
Neg logit contributions
 Tweet-0.001
 Madd-0.001
 Bird-0.001
 Suz-0.001
 shelter-0.001
Token his
Token ID465
Feature activation+1.72e-01

Pos logit contributions
 forks+0.008
 plate+0.008
pron+0.007
 slice+0.007
 fork+0.007
Neg logit contributions
 Madd-0.012
 Tweet-0.012
 Suz-0.012
 Dawn-0.012
 Bird-0.011
Token bird
Token ID6512
Feature activation+4.29e+00

Pos logit contributions
 Madd+0.007
 Tweet+0.007
 Suz+0.006
 Ph+0.006
 Dawn+0.006
Neg logit contributions
pron-0.008
 plate-0.008
 forks-0.008
 slice-0.007
 fork-0.007
Token friends
Token ID2460
Feature activation+6.26e-01

Pos logit contributions
 Madd+0.166
 Tweet+0.164
 Suz+0.155
 Dawn+0.150
 Ph+0.148
Neg logit contributions
pron-0.195
 plate-0.191
 forks-0.189
 slice-0.186
 fork-0.181
Token.
Token ID13
Feature activation+6.28e-01

Pos logit contributions
 Tweet+0.023
 Madd+0.020
 Bird+0.020
 Bug+0.019
 shelter+0.019
Neg logit contributions
pron-0.033
 forks-0.031
 plate-0.031
 slice-0.031
wy-0.031
Token One
Token ID1881
Feature activation+2.29e-01

Pos logit contributions
 Madd+0.028
 Suz+0.026
 shelter+0.026
rig+0.025
 cal+0.025
Neg logit contributions
ossible-0.022
 plate-0.021
 slice-0.020
 forks-0.020
pron-0.020
Token day
Token ID1110
Feature activation-1.02e+00

Pos logit contributions
 Tweet+0.012
 Madd+0.012
 Dawn+0.011
 Suz+0.011
 Bird+0.011
Neg logit contributions
pron-0.008
 plate-0.007
 slice-0.007
 forks-0.007
 fork-0.007
Token,
Token ID11
Feature activation+2.65e-01

Pos logit contributions
pron+0.049
 plate+0.048
 forks+0.048
 slice+0.046
wy+0.046
Neg logit contributions
 Tweet-0.042
 Madd-0.038
 Bird-0.036
 Bug-0.035
 shelter-0.035
Token around
Token ID1088
Feature activation-9.26e-01

Pos logit contributions
 Madd+0.107
 Tweet+0.104
 Suz+0.101
 Dawn+0.098
 Ph+0.097
Neg logit contributions
pron-0.076
 plate-0.076
 forks-0.075
 slice-0.073
 fork-0.072
Token and
Token ID290
Feature activation-2.75e-01

Pos logit contributions
pron+0.037
 plate+0.037
 forks+0.037
 slice+0.035
 fork+0.035
Neg logit contributions
 Tweet-0.041
 Madd-0.040
 Suz-0.038
 Bird-0.037
 Dawn-0.037
Token play
Token ID711
Feature activation+1.31e-01

Pos logit contributions
pron+0.013
 forks+0.012
 plate+0.012
 slice+0.012
wy+0.012
Neg logit contributions
 Tweet-0.011
 Madd-0.010
 Bird-0.010
 Suz-0.010
 shelter-0.010
Token with
Token ID351
Feature activation-2.74e-01

Pos logit contributions
 Tweet+0.006
 Madd+0.005
 Bird+0.005
 shelter+0.005
 Bug+0.005
Neg logit contributions
pron-0.006
 forks-0.006
 plate-0.006
 slice-0.006
wy-0.006
Token his
Token ID465
Feature activation-5.00e-03

Pos logit contributions
 forks+0.009
 plate+0.009
pron+0.009
 fork+0.008
 slice+0.008
Neg logit contributions
 Madd-0.014
 Tweet-0.014
 Suz-0.013
 Dawn-0.013
 Bird-0.013
Token bird
Token ID6512
Feature activation+4.26e+00

Pos logit contributions
pron+0.000
 plate+0.000
 forks+0.000
 slice+0.000
 fork+0.000
Neg logit contributions
 Tweet-0.000
 Madd-0.000
 Suz-0.000
 Dawn-0.000
 Ph-0.000
Tokenie
Token ID494
Feature activation+1.92e+00

Pos logit contributions
 Madd+0.169
 Tweet+0.162
 Suz+0.157
 Dawn+0.154
 Ph+0.153
Neg logit contributions
pron-0.190
 plate-0.184
 forks-0.181
 slice-0.181
 fork-0.175
Token friends
Token ID2460
Feature activation+5.41e-01

Pos logit contributions
 Tweet+0.088
 Madd+0.084
 Suz+0.079
 Bird+0.079
 shelter+0.077
Neg logit contributions
pron-0.078
 forks-0.076
 plate-0.076
 slice-0.074
 fork-0.072
Token.
Token ID13
Feature activation+3.58e-01

Pos logit contributions
 Tweet+0.021
 Madd+0.019
 Bird+0.019
 Bug+0.018
 shelter+0.018
Neg logit contributions
pron-0.027
 forks-0.026
 plate-0.026
 slice-0.025
wy-0.025
Token One
Token ID1881
Feature activation+6.17e-02

Pos logit contributions
 Madd+0.016
rig+0.016
 rou+0.016
 cal+0.016
 regret+0.016
Neg logit contributions
ossible-0.012
 plate-0.011
pron-0.011
 slice-0.010
 forks-0.010
Token day
Token ID1110
Feature activation-1.12e+00

Pos logit contributions
 Tweet+0.003
 Madd+0.003
 Dawn+0.003
 Suz+0.003
 Kel+0.003
Neg logit contributions
pron-0.002
 plate-0.002
 slice-0.002
ossible-0.002
 fork-0.002
Token needle
Token ID17598
Feature activation+1.61e+00

Pos logit contributions
 Tweet+0.020
 Madd+0.019
 Bird+0.019
 Cro+0.018
 Suz+0.018
Neg logit contributions
 forks-0.018
 plate-0.018
 fork-0.017
pron-0.016
 slice-0.016
Token and
Token ID290
Feature activation+1.32e-01

Pos logit contributions
 Tweet+0.072
 Madd+0.070
 Fif+0.065
 Suz+0.065
 Dawn+0.065
Neg logit contributions
 plate-0.064
 forks-0.063
pron-0.062
 slice-0.061
 fork-0.059
Token thread
Token ID4704
Feature activation+1.30e+00

Pos logit contributions
 Tweet+0.006
 Madd+0.006
 Dawn+0.005
 Fif+0.005
 Ph+0.005
Neg logit contributions
 forks-0.005
 plate-0.005
pron-0.005
 slice-0.005
 fork-0.005
Token and
Token ID290
Feature activation+3.57e-01

Pos logit contributions
 Tweet+0.061
 Madd+0.060
 Fif+0.056
 Suz+0.056
 Bird+0.056
Neg logit contributions
 forks-0.047
 plate-0.047
pron-0.046
 slice-0.045
 fork-0.042
Token carefully
Token ID7773
Feature activation+3.26e-02

Pos logit contributions
 Tweet+0.016
 Madd+0.016
 Dawn+0.015
 Bird+0.015
 Fif+0.015
Neg logit contributions
 forks-0.014
 plate-0.014
pron-0.014
 slice-0.013
 fork-0.013
Token se
Token ID384
Feature activation+4.26e+00

Pos logit contributions
 Tweet+0.002
 Madd+0.002
 Bird+0.001
 Suz+0.001
 Dawn+0.001
Neg logit contributions
pron-0.001
 forks-0.001
 plate-0.001
 slice-0.001
 fork-0.001
Tokenwed
Token ID19103
Feature activation-1.31e+00

Pos logit contributions
 Tweet+0.237
 Madd+0.223
 Suz+0.216
 Bird+0.213
 shelter+0.211
Neg logit contributions
pron-0.131
 plate-0.130
 forks-0.129
 fork-0.119
 slice-0.118
Token the
Token ID262
Feature activation+3.35e-01

Pos logit contributions
 forks+0.043
 plate+0.040
 fork+0.038
pron+0.038
wy+0.038
Neg logit contributions
 Tweet-0.066
 Madd-0.065
 Bird-0.064
 Bees-0.063
 Fif-0.062
Token hole
Token ID7604
Feature activation-5.76e-01

Pos logit contributions
 Tweet+0.014
 Madd+0.013
 Suz+0.012
 Bird+0.012
 Dawn+0.012
Neg logit contributions
pron-0.015
 plate-0.015
 forks-0.015
 slice-0.014
 fork-0.014
Token shut
Token ID4423
Feature activation+1.38e-01

Pos logit contributions
 forks+0.022
 plate+0.022
pron+0.021
 slice+0.021
 fork+0.020
Neg logit contributions
 Tweet-0.028
 Madd-0.026
 Bird-0.025
 Bug-0.024
 Dawn-0.024
Token.
Token ID13
Feature activation+1.06e-01

Pos logit contributions
 Tweet+0.006
 Madd+0.006
 Bird+0.006
 Suz+0.006
 shelter+0.006
Neg logit contributions
pron-0.005
 forks-0.005
 plate-0.005
 slice-0.005
 fork-0.005
Token a
Token ID257
Feature activation-1.30e-02

Pos logit contributions
pron+0.009
 forks+0.008
 plate+0.008
 slice+0.008
 fork+0.008
Neg logit contributions
 Tweet-0.009
 Madd-0.008
 Bird-0.008
 Suz-0.008
 shelter-0.007
Token needle
Token ID17598
Feature activation+1.38e+00

Pos logit contributions
pron+0.001
 forks+0.001
 plate+0.000
wy+0.000
 slice+0.000
Neg logit contributions
 Tweet-0.001
 Madd-0.001
 Bird-0.001
 Suz-0.001
 shelter-0.001
Token and
Token ID290
Feature activation+2.92e-01

Pos logit contributions
 Tweet+0.055
 Madd+0.053
 Suz+0.048
 Bird+0.048
 Dawn+0.047
Neg logit contributions
pron-0.063
 plate-0.063
 forks-0.062
 slice-0.061
 fork-0.059
Token thread
Token ID4704
Feature activation+1.18e+00

Pos logit contributions
 Tweet+0.013
 Madd+0.013
 Dawn+0.012
 Suz+0.012
 Bird+0.012
Neg logit contributions
 forks-0.012
pron-0.012
 plate-0.011
 slice-0.011
 fork-0.011
Token and
Token ID290
Feature activation+4.14e-01

Pos logit contributions
 Madd+0.054
 Tweet+0.054
 Fif+0.050
 Dawn+0.050
 Suz+0.050
Neg logit contributions
 plate-0.042
 forks-0.042
pron-0.042
 slice-0.041
ucer-0.039
Token se
Token ID384
Feature activation+4.26e+00

Pos logit contributions
 Tweet+0.019
 Madd+0.019
 Dawn+0.018
 Fif+0.017
 Suz+0.017
Neg logit contributions
 forks-0.016
pron-0.016
 plate-0.016
 slice-0.015
 fork-0.015
Tokenwed
Token ID19103
Feature activation-1.16e+00

Pos logit contributions
 Tweet+0.233
 Madd+0.215
 Suz+0.211
 Bird+0.206
 shelter+0.206
Neg logit contributions
pron-0.137
 plate-0.136
 forks-0.135
 fork-0.124
 slice-0.121
Token the
Token ID262
Feature activation+5.89e-01

Pos logit contributions
 forks+0.039
 plate+0.037
pron+0.037
wy+0.036
 slice+0.036
Neg logit contributions
 Tweet-0.057
 Madd-0.056
 Bird-0.053
 Suz-0.053
 shelter-0.053
Token t
Token ID256
Feature activation-1.10e+00

Pos logit contributions
 Tweet+0.030
 Madd+0.029
 Suz+0.027
 Bird+0.027
 Dawn+0.026
Neg logit contributions
pron-0.021
 plate-0.021
 forks-0.021
 slice-0.020
 fork-0.020
Tokeneddy
Token ID21874
Feature activation+7.49e-02

Pos logit contributions
pron+0.054
 forks+0.054
 plate+0.054
 slice+0.053
 dice+0.051
Neg logit contributions
 Tweet-0.040
 Madd-0.037
 Suz-0.034
 Bird-0.034
 shelter-0.033
Token bear
Token ID6842
Feature activation+1.54e+00

Pos logit contributions
 Tweet+0.003
 Madd+0.003
 Bird+0.003
 Bug+0.002
 Suz+0.002
Neg logit contributions
pron-0.004
 forks-0.004
 plate-0.004
 slice-0.004
 tripod-0.003
Token thread
Token ID4704
Feature activation+1.16e+00

Pos logit contributions
 Tweet+0.008
 Madd+0.008
 Bird+0.007
 Suz+0.007
 Dawn+0.007
Neg logit contributions
pron-0.007
 forks-0.007
 plate-0.007
 slice-0.007
 fork-0.007
Token from
Token ID422
Feature activation+4.77e-01

Pos logit contributions
 Tweet+0.054
 Madd+0.052
 Bird+0.049
 Suz+0.048
 Fif+0.048
Neg logit contributions
pron-0.044
 plate-0.043
 forks-0.043
 slice-0.042
 fork-0.040
Token her
Token ID607
Feature activation-1.99e-01

Pos logit contributions
 Tweet+0.021
 Madd+0.020
 Bird+0.018
 Suz+0.018
 shelter+0.018
Neg logit contributions
pron-0.021
 forks-0.020
 plate-0.020
 slice-0.019
wy-0.019
Token bag
Token ID6131
Feature activation-2.08e-01

Pos logit contributions
pron+0.007
 forks+0.006
 plate+0.006
 slice+0.006
 fork+0.006
Neg logit contributions
 Tweet-0.011
 Madd-0.010
 Bird-0.010
 Suz-0.010
 shelter-0.009
Token and
Token ID290
Feature activation+4.48e-01

Pos logit contributions
pron+0.009
 plate+0.008
 forks+0.008
 slice+0.008
 fork+0.008
Neg logit contributions
 Tweet-0.009
 Madd-0.009
 Bird-0.008
 Suz-0.008
 shelter-0.008
Token se
Token ID384
Feature activation+4.24e+00

Pos logit contributions
 Tweet+0.020
 Madd+0.019
 Bird+0.018
 Suz+0.018
 Dawn+0.018
Neg logit contributions
pron-0.018
 forks-0.018
 plate-0.018
 slice-0.017
 fork-0.017
Tokenwed
Token ID19103
Feature activation-1.04e+00

Pos logit contributions
 Tweet+0.241
 Madd+0.229
 Suz+0.220
 Bird+0.218
 shelter+0.215
Neg logit contributions
pron-0.126
 forks-0.124
 plate-0.123
 slice-0.115
 fork-0.113
Token up
Token ID510
Feature activation+1.03e+00

Pos logit contributions
 forks+0.038
 plate+0.037
pron+0.037
 slice+0.035
wy+0.035
Neg logit contributions
 Tweet-0.049
 Madd-0.047
 Bird-0.045
 shelter-0.045
 Suz-0.044
Token Lucy
Token ID22162
Feature activation+1.25e-01

Pos logit contributions
 Tweet+0.048
 Madd+0.048
rig+0.046
 Kang+0.046
 Bird+0.046
Neg logit contributions
 forks-0.036
 plate-0.035
 slice-0.033
pron-0.032
 fork-0.032
Token's
Token ID338
Feature activation+1.05e+00

Pos logit contributions
 Tweet+0.006
 Madd+0.005
 Bird+0.005
 Suz+0.005
 shelter+0.005
Neg logit contributions
pron-0.005
 forks-0.005
 plate-0.005
 slice-0.005
 fork-0.005
Token dress
Token ID6576
Feature activation+1.43e+00

Pos logit contributions
 Tweet+0.049
 Madd+0.047
 Suz+0.044
 Bird+0.044
 Dawn+0.043
Neg logit contributions
pron-0.042
 plate-0.041
 forks-0.041
 slice-0.039
 fork-0.039
Token got
Token ID1392
Feature activation-4.17e-01

Pos logit contributions
 Tweet+0.019
 Madd+0.019
 Bird+0.017
 Suz+0.017
 Dawn+0.017
Neg logit contributions
pron-0.016
 forks-0.016
 plate-0.015
 slice-0.015
 fork-0.015
Token some
Token ID617
Feature activation+3.55e-01

Pos logit contributions
 forks+0.016
 plate+0.016
pron+0.015
 slice+0.015
 fork+0.015
Neg logit contributions
 Tweet-0.020
 Madd-0.019
 Bird-0.018
 Bug-0.018
 Suz-0.018
Token colorful
Token ID20239
Feature activation-5.37e-01

Pos logit contributions
 Tweet+0.018
 Bird+0.017
 Madd+0.017
 Fif+0.017
 Bug+0.016
Neg logit contributions
 forks-0.013
pron-0.013
 plate-0.013
 slice-0.012
 fork-0.012
Token fabric
Token ID9664
Feature activation-5.89e-01

Pos logit contributions
 forks+0.020
pron+0.019
 plate+0.019
 slice+0.019
 fork+0.018
Neg logit contributions
 Tweet-0.027
 Madd-0.026
 Bird-0.024
 Suz-0.024
 Dawn-0.024
Token and
Token ID290
Feature activation+4.45e-01

Pos logit contributions
pron+0.026
 plate+0.026
 forks+0.026
 slice+0.025
 fork+0.025
Neg logit contributions
 Tweet-0.024
 Madd-0.023
 Bird-0.022
 Suz-0.021
 Bug-0.021
Token se
Token ID384
Feature activation+4.23e+00

Pos logit contributions
 Tweet+0.021
 Madd+0.020
 Bird+0.019
 Suz+0.019
 Dawn+0.018
Neg logit contributions
pron-0.017
 forks-0.017
 plate-0.017
 slice-0.017
 fork-0.016
Tokenwed
Token ID19103
Feature activation-1.17e+00

Pos logit contributions
 Tweet+0.234
 Madd+0.221
 Suz+0.213
 Bird+0.210
 shelter+0.208
Neg logit contributions
pron-0.130
 plate-0.130
 forks-0.130
 fork-0.119
 slice-0.119
Token a
Token ID257
Feature activation+1.96e-01

Pos logit contributions
 forks+0.039
pron+0.037
 plate+0.037
 dice+0.035
 slice+0.035
Neg logit contributions
 Tweet-0.059
 Madd-0.057
 Bird-0.056
 shelter-0.055
 Bug-0.055
Token beautiful
Token ID4950
Feature activation+6.16e-01

Pos logit contributions
 Tweet+0.008
 Madd+0.008
 shelter+0.007
 Bird+0.007
 Suz+0.007
Neg logit contributions
 forks-0.009
wy-0.009
 plate-0.008
pron-0.008
ossible-0.008
Token dress
Token ID6576
Feature activation+6.24e-01

Pos logit contributions
 Tweet+0.028
 Madd+0.027
 shelter+0.025
 Suz+0.025
 Bird+0.025
Neg logit contributions
 forks-0.025
wy-0.024
 plate-0.024
pron-0.024
 slice-0.023
Token for
Token ID329
Feature activation-2.00e-01

Pos logit contributions
 Tweet+0.028
 Madd+0.027
 Bird+0.026
 Suz+0.025
 Bug+0.025
Neg logit contributions
pron-0.025
 plate-0.025
 forks-0.025
 slice-0.024
 fork-0.023
Token off
Token ID572
Feature activation-1.67e+00

Pos logit contributions
 Tweet+0.059
 Madd+0.058
 Bird+0.055
 Suz+0.055
 Bug+0.054
Neg logit contributions
pron-0.032
 plate-0.031
 forks-0.031
 slice-0.030
 fork-0.028
Token to
Token ID284
Feature activation+2.43e-01

Pos logit contributions
pron+0.068
 plate+0.067
 forks+0.066
 slice+0.065
 fork+0.063
Neg logit contributions
 Tweet-0.074
 Madd-0.071
 Bird-0.066
 Suz-0.066
 shelter-0.065
Token play
Token ID711
Feature activation-3.07e-01

Pos logit contributions
 Tweet+0.012
 Madd+0.012
 Suz+0.011
 Bird+0.011
 Dawn+0.011
Neg logit contributions
 plate-0.008
pron-0.008
 forks-0.008
 slice-0.008
 fork-0.008
Token with
Token ID351
Feature activation-3.90e-01

Pos logit contributions
 plate+0.012
pron+0.011
 slice+0.011
 forks+0.011
lf+0.011
Neg logit contributions
 Madd-0.014
 Tweet-0.014
 Suz-0.013
 Kel-0.013
 Bird-0.013
Token her
Token ID607
Feature activation+1.08e-01

Pos logit contributions
 slice+0.010
 plate+0.009
 forks+0.009
 mine+0.008
 fork+0.008
Neg logit contributions
rig-0.023
 rou-0.020
 Kang-0.019
 Bees-0.019
 dam-0.019
Token bird
Token ID6512
Feature activation+4.23e+00

Pos logit contributions
 Madd+0.004
 Tweet+0.004
 Suz+0.004
 Ph+0.004
 Kel+0.004
Neg logit contributions
 plate-0.005
pron-0.005
 forks-0.005
 slice-0.005
 fork-0.005
Tokenie
Token ID494
Feature activation+2.09e+00

Pos logit contributions
 Madd+0.181
 Ph+0.172
 Kel+0.169
 Tweet+0.168
 Dawn+0.167
Neg logit contributions
 plate-0.170
pron-0.168
 slice-0.167
 forks-0.155
lf-0.155
Token friends
Token ID2460
Feature activation+6.75e-01

Pos logit contributions
 Tweet+0.095
 Madd+0.094
 Suz+0.088
 Bird+0.085
 Dawn+0.085
Neg logit contributions
 plate-0.083
pron-0.083
 forks-0.081
 slice-0.079
 fork-0.077
Token.
Token ID13
Feature activation+5.02e-01

Pos logit contributions
 Tweet+0.029
 Madd+0.027
 Bird+0.026
 Suz+0.025
 shelter+0.025
Neg logit contributions
pron-0.030
 forks-0.029
 plate-0.028
 slice-0.028
wy-0.028
Token The
Token ID383
Feature activation+1.56e+00

Pos logit contributions
 rou+0.022
 dam+0.021
 Madd+0.021
 darker+0.020
 shelter+0.020
Neg logit contributions
ossible-0.018
>-0.016
�-0.015
 plate-0.015
 slice-0.015
Token end
Token ID886
Feature activation-1.07e+00

Pos logit contributions
 Tweet+0.090
 Madd+0.088
 Suz+0.084
 Bird+0.082
 Dawn+0.081
Neg logit contributions
pron-0.045
 plate-0.044
 forks-0.043
 slice-0.042
 fork-0.040
Token watched
Token ID7342
Feature activation-9.26e-01

Pos logit contributions
 Tweet+0.027
 Madd+0.026
 Bird+0.024
 Suz+0.024
 shelter+0.023
Neg logit contributions
pron-0.031
 plate-0.031
 forks-0.031
 slice-0.030
 fork-0.029
Token as
Token ID355
Feature activation-1.82e+00

Pos logit contributions
 forks+0.033
 plate+0.033
pron+0.032
 slice+0.031
 fork+0.030
Neg logit contributions
 Tweet-0.044
 Madd-0.044
 Dawn-0.043
 Suz-0.041
 shelter-0.041
Token her
Token ID607
Feature activation+7.37e-02

Pos logit contributions
 forks+0.070
 plate+0.068
pron+0.067
 fork+0.065
lf+0.063
Neg logit contributions
 Tweet-0.083
 Madd-0.080
 shelter-0.077
 Bird-0.077
 Dawn-0.077
Token mom
Token ID1995
Feature activation+1.25e+00

Pos logit contributions
 Tweet+0.004
 Madd+0.004
 Bird+0.003
 Suz+0.003
 shelter+0.003
Neg logit contributions
pron-0.003
 forks-0.003
 plate-0.003
 slice-0.003
wy-0.002
Token carefully
Token ID7773
Feature activation+7.93e-02

Pos logit contributions
 Madd+0.070
 Fif+0.067
 Tweet+0.067
 Dawn+0.066
 Ph+0.063
Neg logit contributions
 slice-0.038
pron-0.038
 plate-0.036
lf-0.035
 forks-0.035
Token se
Token ID384
Feature activation+4.23e+00

Pos logit contributions
 Tweet+0.004
 Madd+0.004
 Suz+0.004
 Bird+0.004
 Dawn+0.004
Neg logit contributions
pron-0.003
 plate-0.003
 forks-0.003
 slice-0.003
 fork-0.003
Tokenwed
Token ID19103
Feature activation-1.19e+00

Pos logit contributions
 Tweet+0.225
 Madd+0.210
 Suz+0.202
 Bird+0.200
 shelter+0.198
Neg logit contributions
 forks-0.140
pron-0.138
 plate-0.138
 fork-0.128
 slice-0.127
Token Ted
Token ID11396
Feature activation+1.06e+00

Pos logit contributions
 forks+0.043
 plate+0.041
 fork+0.039
pron+0.038
 dice+0.037
Neg logit contributions
 Tweet-0.058
 Madd-0.057
 Ph-0.055
 Dawn-0.055
 Bird-0.055
Token's
Token ID338
Feature activation+1.16e+00

Pos logit contributions
 Tweet+0.052
 Madd+0.051
 Suz+0.048
 Dawn+0.048
 Bird+0.047
Neg logit contributions
 plate-0.039
 forks-0.038
pron-0.038
 slice-0.038
 fork-0.036
Token arm
Token ID3211
Feature activation+1.22e+00

Pos logit contributions
 Tweet+0.052
 Madd+0.051
 Suz+0.048
 Bird+0.047
 Dawn+0.047
Neg logit contributions
pron-0.048
 plate-0.047
 forks-0.047
 slice-0.045
 fork-0.044
Token back
Token ID736
Feature activation+8.01e-01

Pos logit contributions
 Madd+0.063
 Dawn+0.060
 Tweet+0.059
 Fif+0.059
 Suz+0.058
Neg logit contributions
 plate-0.041
pron-0.040
 forks-0.039
 slice-0.038
 fork-0.037
Token in
Token ID287
Feature activation-1.33e+00

Pos logit contributions
pron+0.021
 plate+0.021
 forks+0.021
 slice+0.020
 fork+0.019
Neg logit contributions
 Tweet-0.023
 Madd-0.023
 Suz-0.021
 Bird-0.021
 Dawn-0.021
Token the
Token ID262
Feature activation+2.82e-01

Pos logit contributions
 plate+0.050
 forks+0.050
pron+0.049
 slice+0.048
 fork+0.046
Neg logit contributions
 Tweet-0.064
 Madd-0.062
 Bird-0.058
 Suz-0.057
 Dawn-0.057
Token garden
Token ID11376
Feature activation-7.94e-01

Pos logit contributions
 Tweet+0.013
 Madd+0.012
 Bird+0.011
 Suz+0.011
 Dawn+0.011
Neg logit contributions
pron-0.011
 plate-0.011
 forks-0.011
 slice-0.011
 fork-0.010
Token and
Token ID290
Feature activation+1.70e-02

Pos logit contributions
pron+0.033
 forks+0.032
 plate+0.032
 slice+0.031
 fork+0.030
Neg logit contributions
 Tweet-0.036
 Madd-0.034
 Suz-0.032
 Bird-0.032
 shelter-0.031
Token have
Token ID423
Feature activation+8.65e-02

Pos logit contributions
 Tweet+0.001
 Madd+0.001
 Bird+0.001
 Suz+0.001
 Dawn+0.001
Neg logit contributions
pron-0.001
 forks-0.001
 plate-0.001
 slice-0.001
 fork-0.001
Token pic
Token ID8301
Feature activation+4.23e+00

Pos logit contributions
 Tweet+0.004
 Madd+0.004
 Dawn+0.003
 Kel+0.003
 Suz+0.003
Neg logit contributions
 forks-0.004
 plate-0.004
 slice-0.004
 apr-0.004
pron-0.004
Tokenn
Token ID77
Feature activation+4.02e-01

Pos logit contributions
 Tweet+0.133
 Madd+0.115
 Bird+0.108
 shelter+0.107
 Bug+0.106
Neg logit contributions
 forks-0.243
 plate-0.236
 fork-0.235
pron-0.232
 slice-0.223
Tokenics
Token ID873
Feature activation+1.27e+00

Pos logit contributions
 Tweet+0.016
 Madd+0.015
 Suz+0.014
 Bird+0.014
 shelter+0.014
Neg logit contributions
 forks-0.019
pron-0.019
 plate-0.018
 slice-0.018
 fork-0.018
Token together
Token ID1978
Feature activation-1.08e-01

Pos logit contributions
 Tweet+0.055
 Madd+0.051
 Bird+0.048
 shelter+0.048
 Suz+0.048
Neg logit contributions
pron-0.055
 forks-0.053
 plate-0.053
 slice-0.051
wy-0.050
Token.
Token ID13
Feature activation+7.16e-01

Pos logit contributions
pron+0.005
 forks+0.005
wy+0.005
 plate+0.005
 dice+0.005
Neg logit contributions
 Tweet-0.004
 Madd-0.003
 Bird-0.003
 shelter-0.003
 Bug-0.003
Token
Token ID220
Feature activation-8.48e-01

Pos logit contributions
 Tweet+0.029
 Madd+0.028
 Suz+0.026
 Dawn+0.025
 shelter+0.025
Neg logit contributions
 plate-0.031
pron-0.031
 forks-0.031
 slice-0.031
ossible-0.030
Token be
Token ID307
Feature activation+6.72e-01

Pos logit contributions
 Madd+0.093
 Tweet+0.091
 Dawn+0.088
 Suz+0.088
 Kel+0.087
Neg logit contributions
 plate-0.066
 slice-0.064
 forks-0.061
pron-0.061
 fork-0.058
Token fun
Token ID1257
Feature activation+8.40e-01

Pos logit contributions
 Madd+0.028
 Tweet+0.027
 Suz+0.025
 Dawn+0.025
 Kel+0.024
Neg logit contributions
 forks-0.029
 plate-0.029
pron-0.029
wy-0.028
 slice-0.028
Token to
Token ID284
Feature activation-2.97e-01

Pos logit contributions
 Tweet+0.039
 Madd+0.038
 Bird+0.035
 Dawn+0.035
 Suz+0.035
Neg logit contributions
 plate-0.033
 forks-0.033
pron-0.032
 slice-0.031
 fork-0.031
Token put
Token ID1234
Feature activation+7.53e-02

Pos logit contributions
 plate+0.010
 slice+0.010
 forks+0.010
pron+0.010
 fork+0.009
Neg logit contributions
 Madd-0.015
 Tweet-0.014
 Dawn-0.014
 Suz-0.014
 Kel-0.013
Token her
Token ID607
Feature activation+6.66e-02

Pos logit contributions
 Tweet+0.004
 Madd+0.003
 Suz+0.003
 Bird+0.003
 shelter+0.003
Neg logit contributions
pron-0.003
 forks-0.003
 plate-0.003
 slice-0.003
wy-0.003
Token bird
Token ID6512
Feature activation+4.21e+00

Pos logit contributions
 Tweet+0.003
 Bird+0.003
 shelter+0.003
 Madd+0.002
 Bug+0.002
Neg logit contributions
pron-0.003
Adam-0.003
 forks-0.003
wy-0.003
lf-0.003
Token in
Token ID287
Feature activation-5.11e-01

Pos logit contributions
 Tweet+0.157
 Madd+0.142
 Bird+0.135
 shelter+0.133
 Suz+0.132
Neg logit contributions
pron-0.210
 forks-0.205
 plate-0.203
 slice-0.197
 fork-0.194
Token the
Token ID262
Feature activation+1.01e+00

Pos logit contributions
 plate+0.020
pron+0.020
 forks+0.020
 slice+0.019
 fork+0.018
Neg logit contributions
 Tweet-0.024
 Madd-0.023
 Bird-0.021
 Suz-0.021
 Bug-0.021
Token cage
Token ID17186
Feature activation+1.97e+00

Pos logit contributions
 Tweet+0.041
 shelter+0.037
 Madd+0.037
 Bird+0.037
 Suz+0.034
Neg logit contributions
pron-0.047
 forks-0.046
 plate-0.045
wy-0.044
 slice-0.044
Token and
Token ID290
Feature activation+6.20e-02

Pos logit contributions
 Tweet+0.086
 Madd+0.082
 Bird+0.077
 Suz+0.077
 shelter+0.075
Neg logit contributions
pron-0.084
 forks-0.082
 plate-0.082
 slice-0.079
wy-0.077
Token pretend
Token ID16614
Feature activation+8.59e-01

Pos logit contributions
 Tweet+0.003
 Madd+0.003
 Suz+0.003
 Dawn+0.003
 Bird+0.003
Neg logit contributions
 forks-0.002
 plate-0.002
pron-0.002
 slice-0.002
 fork-0.002
Token watched
Token ID7342
Feature activation-1.08e+00

Pos logit contributions
pron+0.015
 plate+0.014
 forks+0.014
 slice+0.014
 fork+0.014
Neg logit contributions
 Tweet-0.012
 Madd-0.012
 Bird-0.011
 Suz-0.011
 shelter-0.011
Token as
Token ID355
Feature activation-1.97e+00

Pos logit contributions
pron+0.037
 plate+0.036
 forks+0.036
 slice+0.035
lf+0.034
Neg logit contributions
 Tweet-0.054
 Madd-0.053
 Bird-0.051
 Dawn-0.051
 shelter-0.050
Token his
Token ID465
Feature activation-8.93e-02

Pos logit contributions
 forks+0.062
pron+0.062
 plate+0.060
 fork+0.059
lf+0.058
Neg logit contributions
 Tweet-0.102
 Madd-0.100
 Bird-0.097
 shelter-0.096
 Dawn-0.095
Token mom
Token ID1995
Feature activation+1.19e+00

Pos logit contributions
pron+0.003
 forks+0.003
 plate+0.003
 slice+0.003
 fork+0.003
Neg logit contributions
 Tweet-0.005
 Madd-0.004
 Bird-0.004
 Suz-0.004
 shelter-0.004
Token carefully
Token ID7773
Feature activation+1.56e-01

Pos logit contributions
 Tweet+0.064
 Madd+0.063
 Fif+0.060
 Dawn+0.060
 Bug+0.057
Neg logit contributions
pron-0.041
 slice-0.038
 forks-0.037
 plate-0.037
lf-0.037
Token se
Token ID384
Feature activation+4.20e+00

Pos logit contributions
 Tweet+0.008
 Madd+0.008
 Bird+0.007
 Dawn+0.007
 Suz+0.007
Neg logit contributions
pron-0.006
 forks-0.006
 plate-0.006
 slice-0.006
 fork-0.005
Tokenwed
Token ID19103
Feature activation-1.27e+00

Pos logit contributions
 Tweet+0.225
 Madd+0.213
 Suz+0.204
 Bird+0.202
 shelter+0.199
Neg logit contributions
pron-0.136
 forks-0.135
 plate-0.134
 slice-0.126
 fork-0.125
Token the
Token ID262
Feature activation+4.70e-01

Pos logit contributions
 forks+0.041
 plate+0.037
 fork+0.036
pron+0.035
wy+0.035
Neg logit contributions
 Bees-0.064
 Tweet-0.064
 Madd-0.063
rig-0.062
 Bird-0.062
Token stitch
Token ID24695
Feature activation+1.61e+00

Pos logit contributions
 Tweet+0.021
 shelter+0.020
 Madd+0.019
 Bird+0.019
 Bug+0.018
Neg logit contributions
pron-0.019
 slice-0.019
 forks-0.018
ossible-0.018
Adam-0.018
Token in
Token ID287
Feature activation-3.47e-01

Pos logit contributions
 Tweet+0.077
 Madd+0.074
 Dawn+0.070
 Bird+0.070
 Fif+0.070
Neg logit contributions
 plate-0.061
pron-0.060
 forks-0.060
 slice-0.057
 fork-0.056
Token the
Token ID262
Feature activation+8.22e-01

Pos logit contributions
pron+0.012
 plate+0.012
 forks+0.012
 slice+0.012
 fork+0.012
Neg logit contributions
 Tweet-0.017
 Madd-0.017
 Bird-0.016
 Suz-0.016
 shelter-0.015
Token scary
Token ID14343
Feature activation+1.02e+00

Pos logit contributions
pron+0.006
 plate+0.006
 forks+0.005
 fork+0.005
 slice+0.005
Neg logit contributions
 Tweet-0.005
 Madd-0.005
 Suz-0.005
 Dawn-0.005
 Ph-0.005
Token hospital
Token ID4436
Feature activation+1.12e+00

Pos logit contributions
 Tweet+0.044
 Madd+0.043
 Suz+0.041
 Dawn+0.039
 Fif+0.039
Neg logit contributions
 plate-0.044
pron-0.043
 forks-0.042
 fork-0.041
lf-0.040
Token with
Token ID351
Feature activation+9.36e-02

Pos logit contributions
 Tweet+0.056
 Madd+0.055
 Suz+0.051
 Bird+0.050
 Dawn+0.050
Neg logit contributions
 plate-0.040
pron-0.040
 forks-0.040
 slice-0.038
 fork-0.037
Token all
Token ID477
Feature activation-1.33e+00

Pos logit contributions
 Tweet+0.005
 Madd+0.005
 Suz+0.004
 Bird+0.004
 Dawn+0.004
Neg logit contributions
 forks-0.003
 plate-0.003
pron-0.003
 slice-0.003
 fork-0.003
Token the
Token ID262
Feature activation+1.36e+00

Pos logit contributions
 forks+0.047
 plate+0.047
pron+0.046
 slice+0.044
 fork+0.044
Neg logit contributions
 Tweet-0.066
 Madd-0.063
 Bird-0.060
 Suz-0.059
 Dawn-0.058
Token x
Token ID2124
Feature activation+4.20e+00

Pos logit contributions
 Tweet+0.064
 Madd+0.063
 Suz+0.061
 Dawn+0.057
 Ph+0.057
Neg logit contributions
 forks-0.054
 plate-0.054
pron-0.053
 fork-0.050
wy-0.048
Token-
Token ID12
Feature activation+2.11e+00

Pos logit contributions
 Tweet+0.191
 Madd+0.183
 Suz+0.170
 Bird+0.168
 Dawn+0.166
Neg logit contributions
pron-0.172
 plate-0.169
 forks-0.168
 slice-0.163
wy-0.159
Tokenrays
Token ID20477
Feature activation+2.81e-01

Pos logit contributions
 Tweet+0.103
 Madd+0.101
 Suz+0.093
 Bird+0.093
 Dawn+0.092
Neg logit contributions
pron-0.078
 forks-0.074
 plate-0.073
 slice-0.072
wy-0.071
Token.
Token ID13
Feature activation+4.77e-01

Pos logit contributions
 Tweet+0.012
 Madd+0.011
 Bird+0.010
 Suz+0.010
 shelter+0.010
Neg logit contributions
pron-0.012
 forks-0.012
 plate-0.012
 slice-0.012
wy-0.011
Token<|endoftext|>
Token ID50256
Feature activation-3.79e-01

Pos logit contributions
 Tweet+0.019
 Madd+0.018
 shelter+0.017
 Suz+0.017
 Bird+0.016
Neg logit contributions
pron-0.020
 plate-0.020
 forks-0.020
ossible-0.020
 slice-0.020
TokenOnce
Token ID7454
Feature activation+4.27e-01

Pos logit contributions
 plate+0.015
pron+0.015
 forks+0.015
 fork+0.014
ossible+0.014
Neg logit contributions
 Tweet-0.017
 Madd-0.016
 Suz-0.015
 Dawn-0.014
 shelter-0.014
Token wanted
Token ID2227
Feature activation-1.59e+00

Pos logit contributions
pron+0.035
 forks+0.034
 plate+0.034
 slice+0.033
 fork+0.032
Neg logit contributions
 Tweet-0.041
 Madd-0.039
 Suz-0.036
 Bird-0.036
 shelter-0.035
Token to
Token ID284
Feature activation-7.09e-01

Pos logit contributions
 forks+0.050
 plate+0.045
 slice+0.043
 fork+0.041
 plates+0.041
Neg logit contributions
rig-0.089
 Madd-0.086
 Tweet-0.085
 Ph-0.084
 Dawn-0.083
Token see
Token ID766
Feature activation-3.22e-01

Pos logit contributions
pron+0.027
 plate+0.027
 forks+0.026
 slice+0.026
 fork+0.025
Neg logit contributions
 Tweet-0.033
 Madd-0.032
 Bird-0.030
 Suz-0.030
 Bug-0.029
Token the
Token ID262
Feature activation-1.24e+00

Pos logit contributions
 forks+0.011
 plate+0.010
 slice+0.010
 fork+0.009
 apr+0.009
Neg logit contributions
 Tweet-0.017
 Madd-0.017
 Bug-0.016
 Suz-0.016
 Dawn-0.016
Token or
Token ID393
Feature activation-9.35e-01

Pos logit contributions
 plate+0.052
pron+0.052
 forks+0.051
 slice+0.049
 fork+0.049
Neg logit contributions
 Tweet-0.053
 Madd-0.053
 Suz-0.051
 Dawn-0.048
 Bird-0.048
Tokennam
Token ID7402
Feature activation-5.62e+00

Pos logit contributions
pron+0.031
 plate+0.029
 forks+0.029
wy+0.028
 slice+0.028
Neg logit contributions
 Tweet-0.051
 Madd-0.048
 Bird-0.046
 Suz-0.046
 shelter-0.045
Tokenents
Token ID658
Feature activation-2.47e+00

Pos logit contributions
pron+0.275
 forks+0.261
wy+0.258
 fork+0.257
 plate+0.257
Neg logit contributions
 Tweet-0.210
 Madd-0.189
 Suz-0.181
 Bird-0.174
 shelter-0.166
Token up
Token ID510
Feature activation-3.21e-01

Pos logit contributions
 plate+0.093
 forks+0.091
 slice+0.091
pron+0.090
 fork+0.084
Neg logit contributions
 Tweet-0.118
 Madd-0.115
 Bird-0.107
 Suz-0.107
 Dawn-0.107
Token close
Token ID1969
Feature activation-2.47e-01

Pos logit contributions
pron+0.013
 plate+0.013
 forks+0.013
 slice+0.012
 fork+0.012
Neg logit contributions
 Tweet-0.015
 Madd-0.014
 Bird-0.013
 Suz-0.013
 shelter-0.013
Token.
Token ID13
Feature activation+1.49e-01

Pos logit contributions
 forks+0.009
 plate+0.009
pron+0.009
 slice+0.008
 fork+0.008
Neg logit contributions
 Tweet-0.012
 Madd-0.012
 Bird-0.011
 Suz-0.011
 Bug-0.011
Token They
Token ID1119
Feature activation+1.28e-02

Pos logit contributions
 Tweet+0.005
 Madd+0.004
 Bird+0.004
 Suz+0.004
 shelter+0.004
Neg logit contributions
pron-0.008
 forks-0.008
 plate-0.008
 slice-0.008
wy-0.008
Token said
Token ID531
Feature activation-7.41e-01

Pos logit contributions
 Tweet+0.026
 Madd+0.025
 Bird+0.023
 Suz+0.023
 shelter+0.022
Neg logit contributions
pron-0.030
 plate-0.029
 forks-0.029
 slice-0.028
 fork-0.028
Token.
Token ID13
Feature activation+2.13e-02

Pos logit contributions
 plate+0.028
pron+0.027
 forks+0.027
 slice+0.027
 fork+0.025
Neg logit contributions
 Tweet-0.036
 Madd-0.034
 Bird-0.032
 Suz-0.032
 shelter-0.032
Token "
Token ID366
Feature activation+8.54e-01

Pos logit contributions
 Tweet+0.001
 Madd+0.001
 Suz+0.001
 shelter+0.001
 Bird+0.001
Neg logit contributions
 plate-0.001
pron-0.001
 forks-0.001
 slice-0.001
 fork-0.001
TokenThe
Token ID464
Feature activation+8.18e-01

Pos logit contributions
 Tweet+0.034
 Madd+0.031
 Bird+0.030
 Suz+0.029
 shelter+0.028
Neg logit contributions
pron-0.040
 forks-0.039
 plate-0.039
wy-0.038
 slice-0.037
Token or
Token ID393
Feature activation-5.25e-01

Pos logit contributions
 Tweet+0.033
 Bird+0.029
 Madd+0.028
 shelter+0.028
 Bug+0.027
Neg logit contributions
pron-0.040
 forks-0.039
wy-0.038
 plate-0.038
 slice-0.038
Tokennam
Token ID7402
Feature activation-5.41e+00

Pos logit contributions
pron+0.014
wy+0.013
ucer+0.012
ater+0.012
 plate+0.012
Neg logit contributions
 Tweet-0.033
 Madd-0.030
 Bird-0.030
 Fif-0.028
 Bug-0.028
Tokenents
Token ID658
Feature activation-2.43e+00

Pos logit contributions
pron+0.276
 forks+0.259
 fork+0.256
 plate+0.256
wy+0.252
Neg logit contributions
 Tweet-0.204
 Madd-0.177
 Suz-0.172
 Bird-0.164
 Kerry-0.157
Token are
Token ID389
Feature activation-2.35e-01

Pos logit contributions
pron+0.105
 forks+0.102
 plate+0.101
 slice+0.097
 fork+0.096
Neg logit contributions
 Tweet-0.106
 Madd-0.099
 Bird-0.093
 Suz-0.093
 shelter-0.091
Token so
Token ID523
Feature activation+2.51e-01

Pos logit contributions
pron+0.011
 plate+0.011
 forks+0.011
 slice+0.010
wy+0.010
Neg logit contributions
 Tweet-0.010
 Madd-0.009
 Bird-0.008
 Suz-0.008
 shelter-0.008
Token pretty
Token ID2495
Feature activation-4.46e-01

Pos logit contributions
 Tweet+0.010
 Madd+0.009
 Bird+0.008
 Suz+0.008
 Fif+0.008
Neg logit contributions
pron-0.012
 plate-0.012
 forks-0.012
wy-0.012
 fork-0.012
Token!
Token ID0
Feature activation-6.95e-01

Pos logit contributions
pron+0.019
 forks+0.017
 plate+0.017
 slice+0.017
wy+0.017
Neg logit contributions
 Tweet-0.020
 Madd-0.019
 Bird-0.018
 shelter-0.018
 Suz-0.017
Token to
Token ID284
Feature activation-2.39e-01

Pos logit contributions
pron+0.053
 forks+0.052
 plate+0.052
 slice+0.050
wy+0.049
Neg logit contributions
 Tweet-0.057
 Madd-0.054
 Bird-0.051
 Suz-0.051
 shelter-0.050
Token look
Token ID804
Feature activation+1.28e+00

Pos logit contributions
pron+0.010
 forks+0.010
 plate+0.009
wy+0.009
 slice+0.009
Neg logit contributions
 Tweet-0.011
 Madd-0.010
 Suz-0.010
 Bird-0.010
 shelter-0.009
Token at
Token ID379
Feature activation-5.42e-01

Pos logit contributions
 Tweet+0.035
 Madd+0.033
 shelter+0.031
 Suz+0.029
 Bunny+0.028
Neg logit contributions
pron-0.075
 tripod-0.073
lf-0.071
 slice-0.070
Adam-0.070
Token the
Token ID262
Feature activation-2.71e-01

Pos logit contributions
pron+0.021
 forks+0.021
 plate+0.021
 slice+0.020
 fork+0.020
Neg logit contributions
 Tweet-0.025
 Madd-0.024
 Bird-0.023
 Suz-0.023
 shelter-0.022
Token or
Token ID393
Feature activation-2.89e-01

Pos logit contributions
pron+0.011
 forks+0.011
 plate+0.011
 slice+0.010
 fork+0.010
Neg logit contributions
 Tweet-0.012
 Madd-0.012
 Suz-0.011
 Bird-0.011
 Dawn-0.011
Tokennam
Token ID7402
Feature activation-5.36e+00

Pos logit contributions
pron+0.009
wy+0.008
 forks+0.008
 plate+0.008
ucer+0.008
Neg logit contributions
 Tweet-0.016
 Madd-0.015
 Bird-0.015
 Suz-0.014
 Fif-0.014
Tokenents
Token ID658
Feature activation-2.27e+00

Pos logit contributions
pron+0.266
 forks+0.258
wy+0.254
 plate+0.254
 fork+0.252
Neg logit contributions
 Tweet-0.183
 Madd-0.160
 Suz-0.156
 Bird-0.144
 Twe-0.139
Token on
Token ID319
Feature activation-5.86e-01

Pos logit contributions
pron+0.094
 forks+0.092
 plate+0.091
 slice+0.089
wy+0.086
Neg logit contributions
 Tweet-0.101
 Madd-0.096
 Bird-0.090
 Suz-0.090
 shelter-0.088
Token the
Token ID262
Feature activation+1.42e-01

Pos logit contributions
pron+0.028
wy+0.026
ucer+0.026
 slice+0.026
 forks+0.025
Neg logit contributions
 Tweet-0.023
 Madd-0.021
 Bird-0.020
 shelter-0.020
 Bug-0.019
Token tree
Token ID5509
Feature activation+8.33e-01

Pos logit contributions
 Tweet+0.007
 Madd+0.007
 Bird+0.007
 shelter+0.007
 Bug+0.006
Neg logit contributions
pron-0.005
 forks-0.005
 slice-0.005
wy-0.005
 plate-0.004
Token.
Token ID13
Feature activation+2.91e-02

Pos logit contributions
 Tweet+0.041
 Madd+0.040
 Suz+0.037
 Dawn+0.037
 Bird+0.037
Neg logit contributions
 plate-0.031
pron-0.031
 forks-0.030
 slice-0.030
 fork-0.028
Token more
Token ID517
Feature activation+1.11e+00

Pos logit contributions
 Tweet+0.041
 Madd+0.039
 Bird+0.036
 Suz+0.036
 shelter+0.035
Neg logit contributions
pron-0.040
 forks-0.039
 plate-0.039
 slice-0.038
wy-0.037
Token careful
Token ID8161
Feature activation-1.61e+00

Pos logit contributions
 Tweet+0.055
 Madd+0.053
 Suz+0.050
 Bird+0.049
 shelter+0.048
Neg logit contributions
pron-0.042
 plate-0.041
 forks-0.041
 slice-0.039
wy-0.038
Token.
Token ID13
Feature activation-1.82e-01

Pos logit contributions
pron+0.078
 forks+0.076
 plate+0.076
 slice+0.074
wy+0.073
Neg logit contributions
 Tweet-0.060
 Madd-0.056
 Bird-0.053
 Suz-0.052
 shelter-0.052
Token The
Token ID383
Feature activation+9.14e-01

Pos logit contributions
ossible+0.006
 forks+0.006
pron+0.006
 plate+0.006
 slice+0.006
Neg logit contributions
 Tweet-0.009
 Madd-0.009
 shelter-0.008
 rou-0.008
 rust-0.008
Token or
Token ID393
Feature activation-6.09e-01

Pos logit contributions
 Tweet+0.042
 Madd+0.040
 Suz+0.038
 Bird+0.036
 Dawn+0.036
Neg logit contributions
pron-0.038
 plate-0.038
 forks-0.037
 slice-0.036
 fork-0.036
Tokennam
Token ID7402
Feature activation-5.32e+00

Pos logit contributions
pron+0.022
 forks+0.022
 plate+0.021
 slice+0.021
 fork+0.020
Neg logit contributions
 Tweet-0.030
 Madd-0.029
 Bird-0.027
 Suz-0.027
 shelter-0.027
Tokenents
Token ID658
Feature activation-2.41e+00

Pos logit contributions
pron+0.268
 forks+0.258
 plate+0.255
 fork+0.254
 apr+0.250
Neg logit contributions
 Tweet-0.188
 Madd-0.162
 Suz-0.161
 Bird-0.151
 shelter-0.144
Token are
Token ID389
Feature activation+3.20e-02

Pos logit contributions
pron+0.097
 plate+0.096
 forks+0.095
 slice+0.093
wy+0.090
Neg logit contributions
 Tweet-0.108
 Madd-0.104
 Suz-0.097
 Bird-0.096
 Dawn-0.095
Token not
Token ID407
Feature activation-1.42e+00

Pos logit contributions
 Tweet+0.001
 Madd+0.001
 Suz+0.001
 Bird+0.001
 Dawn+0.001
Neg logit contributions
pron-0.001
 forks-0.001
 plate-0.001
 slice-0.001
wy-0.001
Token toys
Token ID14958
Feature activation-1.39e+00

Pos logit contributions
pron+0.061
 forks+0.060
 plate+0.060
 slice+0.058
 fork+0.057
Neg logit contributions
 Tweet-0.061
 Madd-0.058
 Suz-0.054
 Bird-0.054
 shelter-0.053
Token.
Token ID13
Feature activation-7.03e-02

Pos logit contributions
pron+0.067
 forks+0.065
 plate+0.064
 slice+0.063
wy+0.062
Neg logit contributions
 Tweet-0.054
 Madd-0.048
 shelter-0.047
 Bird-0.046
 Bug-0.045
Token to
Token ID284
Feature activation+7.02e-01

Pos logit contributions
 Tweet+0.027
 Madd+0.024
 Bird+0.023
 shelter+0.023
 Suz+0.022
Neg logit contributions
pron-0.040
 forks-0.040
 plate-0.039
wy-0.039
 slice-0.039
Token me
Token ID502
Feature activation+2.27e+00

Pos logit contributions
 Tweet+0.035
 Madd+0.034
 Bird+0.032
 Suz+0.032
 shelter+0.031
Neg logit contributions
pron-0.025
 forks-0.025
 plate-0.025
 slice-0.024
 fork-0.023
Token.
Token ID13
Feature activation-4.14e-01

Pos logit contributions
 Tweet+0.104
 Madd+0.099
 Suz+0.093
 Bird+0.092
 shelter+0.090
Neg logit contributions
pron-0.092
 plate-0.091
 forks-0.090
 slice-0.087
 fork-0.085
Token The
Token ID383
Feature activation+8.65e-01

Pos logit contributions
 plate+0.016
pron+0.016
 forks+0.016
 slice+0.016
ossible+0.016
Neg logit contributions
 Tweet-0.018
 Madd-0.018
 Suz-0.016
 shelter-0.016
 Dawn-0.016
Token or
Token ID393
Feature activation-7.55e-01

Pos logit contributions
 Tweet+0.040
 Madd+0.039
 Suz+0.037
 Dawn+0.035
 Beth+0.035
Neg logit contributions
 plate-0.035
pron-0.035
 forks-0.034
 slice-0.033
 fork-0.033
Tokennam
Token ID7402
Feature activation-5.19e+00

Pos logit contributions
pron+0.031
 plate+0.030
 forks+0.030
 slice+0.029
wy+0.028
Neg logit contributions
 Tweet-0.035
 Madd-0.033
 Bird-0.031
 Suz-0.031
 shelter-0.030
Tokenents
Token ID658
Feature activation-2.36e+00

Pos logit contributions
pron+0.274
 forks+0.267
 fork+0.266
 plate+0.263
 apr+0.256
Neg logit contributions
 Tweet-0.161
 Suz-0.130
 Madd-0.127
 Bird-0.125
 Cro-0.118
Token were
Token ID547
Feature activation-2.34e-01

Pos logit contributions
pron+0.091
 plate+0.089
 forks+0.089
 slice+0.087
wy+0.084
Neg logit contributions
 Tweet-0.111
 Madd-0.107
 Suz-0.101
 Bird-0.099
 Dawn-0.098
Token very
Token ID845
Feature activation-1.44e-01

Pos logit contributions
pron+0.011
 forks+0.011
 plate+0.011
 slice+0.010
 fork+0.010
Neg logit contributions
 Tweet-0.009
 Madd-0.009
 Bird-0.008
 Suz-0.008
 shelter-0.008
Token important
Token ID1593
Feature activation+2.63e-01

Pos logit contributions
pron+0.007
 forks+0.007
 plate+0.007
 slice+0.007
 fork+0.006
Neg logit contributions
 Tweet-0.006
 Madd-0.005
 Bird-0.005
 Suz-0.005
 shelter-0.005
Token to
Token ID284
Feature activation-1.93e-01

Pos logit contributions
 Tweet+0.010
 Madd+0.009
 Bird+0.009
 shelter+0.009
 Suz+0.009
Neg logit contributions
pron-0.013
 plate-0.012
 forks-0.012
wy-0.012
 slice-0.012
Token They
Token ID1119
Feature activation-1.32e-01

Pos logit contributions
 forks+0.015
 plate+0.015
pron+0.014
 slice+0.014
 tripod+0.014
Neg logit contributions
 Tweet-0.011
 Madd-0.010
 Suz-0.010
 Bug-0.009
 Dawn-0.009
Token reached
Token ID4251
Feature activation-8.00e-01

Pos logit contributions
pron+0.005
 plate+0.005
 forks+0.005
 slice+0.005
 fork+0.005
Neg logit contributions
 Tweet-0.006
 Madd-0.006
 Suz-0.005
 Bird-0.005
 shelter-0.005
Token for
Token ID329
Feature activation-4.48e-01

Pos logit contributions
 forks+0.029
 plate+0.028
pron+0.028
 slice+0.027
 fork+0.026
Neg logit contributions
 Tweet-0.041
 Madd-0.039
 Bird-0.037
 Suz-0.037
 Bug-0.036
Token the
Token ID262
Feature activation-2.85e-01

Pos logit contributions
pron+0.020
 plate+0.019
 forks+0.019
wy+0.018
 slice+0.018
Neg logit contributions
 Tweet-0.019
 Madd-0.018
 Bird-0.017
 shelter-0.017
 Suz-0.017
Token or
Token ID393
Feature activation-4.33e-01

Pos logit contributions
pron+0.012
 forks+0.011
 plate+0.011
wy+0.011
 slice+0.011
Neg logit contributions
 Tweet-0.013
 Madd-0.012
 Bird-0.012
 shelter-0.011
 Bug-0.011
Tokennam
Token ID7402
Feature activation-5.18e+00

Pos logit contributions
pron+0.009
wy+0.008
 plate+0.008
 forks+0.007
ater+0.007
Neg logit contributions
 Tweet-0.028
 Madd-0.027
 Bird-0.027
 Suz-0.026
 shelter-0.026
Tokenents
Token ID658
Feature activation-2.44e+00

Pos logit contributions
pron+0.263
 forks+0.248
 fork+0.247
 plate+0.246
wy+0.246
Neg logit contributions
 Tweet-0.178
 Madd-0.160
 Suz-0.154
 Bird-0.146
 shelter-0.140
Token and
Token ID290
Feature activation-2.84e-01

Pos logit contributions
 plate+0.100
 forks+0.098
pron+0.097
 slice+0.096
 fork+0.091
Neg logit contributions
 Tweet-0.109
 Madd-0.105
 Suz-0.098
 Dawn-0.097
 Bird-0.097
Token touched
Token ID12615
Feature activation-6.75e-01

Pos logit contributions
pron+0.012
 plate+0.011
 forks+0.011
wy+0.011
 slice+0.011
Neg logit contributions
 Tweet-0.013
 Madd-0.012
 Bird-0.011
 Suz-0.011
 shelter-0.011
Token them
Token ID606
Feature activation-8.86e-02

Pos logit contributions
 forks+0.023
 plate+0.023
 slice+0.021
pron+0.020
 fork+0.020
Neg logit contributions
 Tweet-0.033
 Madd-0.033
 Suz-0.032
rig-0.031
 dam-0.030
Token softly
Token ID26625
Feature activation+1.70e+00

Pos logit contributions
 plate+0.003
 forks+0.003
pron+0.003
 slice+0.003
 fork+0.003
Neg logit contributions
 Tweet-0.004
 Madd-0.004
 Suz-0.004
 Dawn-0.004
 Bird-0.004
Token "
Token ID366
Feature activation+7.16e-01

Pos logit contributions
 Tweet+0.012
 Madd+0.012
 Bird+0.011
 Suz+0.011
 Bug+0.010
Neg logit contributions
pron-0.012
 forks-0.012
 plate-0.012
wy-0.012
 slice-0.011
TokenHello
Token ID15496
Feature activation+4.76e-01

Pos logit contributions
 Tweet+0.027
 Madd+0.025
 Bird+0.024
 Suz+0.023
 Dawn+0.022
Neg logit contributions
 forks-0.035
 plate-0.035
pron-0.035
 slice-0.034
 fork-0.033
Token,
Token ID11
Feature activation-1.43e+00

Pos logit contributions
 Tweet+0.022
 Madd+0.021
 Suz+0.019
 Bird+0.019
 shelter+0.019
Neg logit contributions
pron-0.019
 plate-0.019
 forks-0.019
 slice-0.018
 fork-0.018
Token Mister
Token ID43438
Feature activation-1.48e+00

Pos logit contributions
pron+0.060
 plate+0.060
 forks+0.059
 slice+0.058
 fork+0.056
Neg logit contributions
 Tweet-0.062
 Madd-0.060
 Suz-0.055
 Bird-0.055
 shelter-0.054
Token P
Token ID350
Feature activation-8.63e-01

Pos logit contributions
pron+0.060
 forks+0.058
 plate+0.057
 slice+0.055
wy+0.054
Neg logit contributions
 Tweet-0.070
 Madd-0.064
 Bird-0.064
 Suz-0.062
 Bug-0.060
Tokenige
Token ID10045
Feature activation-5.15e+00

Pos logit contributions
 forks+0.025
pron+0.025
 plate+0.024
 fork+0.023
 slice+0.023
Neg logit contributions
 Tweet-0.049
 Madd-0.046
 Suz-0.045
 Bird-0.044
 Dawn-0.044
Tokenon
Token ID261
Feature activation-1.85e+00

Pos logit contributions
 apr+0.184
 forks+0.174
pron+0.164
 plate+0.158
ifying+0.156
Neg logit contributions
 Tweet-0.275
 Bird-0.252
 Suz-0.240
 Madd-0.235
 Cro-0.235
Token!"
Token ID2474
Feature activation-1.17e+00

Pos logit contributions
pron+0.064
 forks+0.063
 plate+0.060
wy+0.059
 slice+0.058
Neg logit contributions
 Tweet-0.095
 Madd-0.090
 Bird-0.088
 Suz-0.086
 shelter-0.085
Token
Token ID198
Feature activation-2.40e-01

Pos logit contributions
 plate+0.047
 forks+0.046
pron+0.045
 slice+0.045
ossible+0.044
Neg logit contributions
 Tweet-0.049
 Madd-0.048
 shelter-0.047
 Kel-0.045
 Dawn-0.044
Token
Token ID198
Feature activation-4.40e-01

Pos logit contributions
pron+0.012
 plate+0.011
 forks+0.011
 slice+0.011
 fork+0.011
Neg logit contributions
 Tweet-0.009
 Madd-0.008
 Suz-0.008
 Bird-0.008
 shelter-0.008
TokenThe
Token ID464
Feature activation+1.38e+00

Pos logit contributions
 forks+0.022
 plate+0.021
wy+0.021
pron+0.021
 fork+0.020
Neg logit contributions
 Tweet-0.017
 Madd-0.016
 Bird-0.015
 Suz-0.014
 Bug-0.014
Token a
Token ID257
Feature activation+2.36e-01

Pos logit contributions
pron+0.021
wy+0.020
 forks+0.020
 plate+0.020
 slice+0.020
Neg logit contributions
 Tweet-0.017
 Madd-0.016
 Bird-0.015
 shelter-0.015
 Suz-0.015
Token letter
Token ID3850
Feature activation-2.21e-01

Pos logit contributions
 Tweet+0.008
 shelter+0.008
 Bird+0.008
 rust+0.007
rig+0.007
Neg logit contributions
wy-0.011
 forks-0.010
ater-0.010
ucer-0.009
annah-0.009
Token about
Token ID546
Feature activation-1.16e-01

Pos logit contributions
pron+0.009
 plate+0.009
 forks+0.009
 slice+0.009
 fork+0.008
Neg logit contributions
 Tweet-0.010
 Madd-0.010
 Suz-0.009
 Bird-0.009
 Bug-0.009
Token the
Token ID262
Feature activation-2.89e-01

Pos logit contributions
 forks+0.004
 plate+0.004
 slice+0.004
pron+0.004
 fork+0.004
Neg logit contributions
 Tweet-0.005
 Madd-0.005
 Suz-0.005
 Dawn-0.005
 Ph-0.005
Token or
Token ID393
Feature activation-2.50e-01

Pos logit contributions
pron+0.012
 plate+0.012
 forks+0.012
 slice+0.012
 fork+0.012
Neg logit contributions
 Tweet-0.013
 Madd-0.012
 Suz-0.011
 Bird-0.011
 Dawn-0.011
Tokennam
Token ID7402
Feature activation-5.15e+00

Pos logit contributions
pron+0.009
 plate+0.008
 forks+0.008
 slice+0.008
wy+0.008
Neg logit contributions
 Tweet-0.013
 Madd-0.012
 Bird-0.012
 Suz-0.011
 shelter-0.011
Tokenents
Token ID658
Feature activation-2.24e+00

Pos logit contributions
pron+0.266
 forks+0.257
wy+0.254
 fork+0.253
 apr+0.253
Neg logit contributions
 Tweet-0.162
 Madd-0.143
 Suz-0.139
 Bird-0.128
 Twe-0.123
Token and
Token ID290
Feature activation-2.07e-01

Pos logit contributions
pron+0.090
 plate+0.089
 forks+0.089
 slice+0.087
 fork+0.083
Neg logit contributions
 Tweet-0.103
 Madd-0.099
 Suz-0.092
 Bird-0.092
 Dawn-0.090
Token asked
Token ID1965
Feature activation-2.04e-02

Pos logit contributions
 forks+0.008
pron+0.008
 plate+0.008
 slice+0.008
 fork+0.008
Neg logit contributions
 Tweet-0.010
 Madd-0.009
 Suz-0.009
 Bird-0.009
 shelter-0.008
Token for
Token ID329
Feature activation-8.49e-01

Pos logit contributions
pron+0.001
 forks+0.001
 plate+0.001
 slice+0.001
 fork+0.001
Neg logit contributions
 Tweet-0.001
 Madd-0.001
 Bird-0.001
 Suz-0.001
 shelter-0.001
Token one
Token ID530
Feature activation-3.26e-01

Pos logit contributions
pron+0.043
wy+0.043
ater+0.041
ucer+0.041
 plate+0.040
Neg logit contributions
 Tweet-0.030
 Bird-0.027
 shelter-0.027
 Madd-0.026
 Suz-0.024
Token branches
Token ID13737
Feature activation-5.30e-01

Pos logit contributions
 Tweet+0.034
 Bird+0.030
 Bug+0.030
 Madd+0.029
 shelter+0.029
Neg logit contributions
pron-0.037
wy-0.036
 forks-0.035
ucer-0.035
ossible-0.035
Token.
Token ID13
Feature activation+7.26e-01

Pos logit contributions
 plate+0.020
pron+0.019
 forks+0.019
 slice+0.019
 fork+0.018
Neg logit contributions
 Tweet-0.025
 Madd-0.024
 Bird-0.023
 Dawn-0.022
 Suz-0.022
Token But
Token ID887
Feature activation+1.27e+00

Pos logit contributions
 Tweet+0.027
 Madd+0.025
 Bird+0.024
 Suz+0.024
 shelter+0.023
Neg logit contributions
pron-0.035
 forks-0.035
 plate-0.035
 slice-0.034
 fork-0.033
Token the
Token ID262
Feature activation-2.95e-01

Pos logit contributions
 Tweet+0.054
 Madd+0.052
 Suz+0.048
 Bird+0.048
 shelter+0.047
Neg logit contributions
pron-0.057
 forks-0.055
 plate-0.055
 slice-0.053
wy-0.053
Token or
Token ID393
Feature activation-2.21e-01

Pos logit contributions
pron+0.014
wy+0.013
 forks+0.013
 tripod+0.013
 plate+0.013
Neg logit contributions
 Tweet-0.012
 Bird-0.011
 shelter-0.011
 Madd-0.011
 Bug-0.010
Tokennam
Token ID7402
Feature activation-5.14e+00

Pos logit contributions
pron+0.006
wy+0.005
ater+0.005
 plate+0.005
 forks+0.005
Neg logit contributions
 Tweet-0.013
 Madd-0.013
 Bird-0.012
 Suz-0.012
 shelter-0.012
Tokenents
Token ID658
Feature activation-2.36e+00

Pos logit contributions
pron+0.252
 forks+0.236
 plate+0.233
 fork+0.232
wy+0.230
Neg logit contributions
 Tweet-0.201
 Madd-0.183
 Suz-0.177
 Bird-0.166
 shelter-0.160
Token were
Token ID547
Feature activation-3.33e-01

Pos logit contributions
 plate+0.097
 slice+0.095
pron+0.095
 forks+0.095
lf+0.090
Neg logit contributions
 Tweet-0.100
 Madd-0.096
 Dawn-0.090
 Bird-0.089
 Bug-0.088
Token slippery
Token ID32911
Feature activation-2.58e-01

Pos logit contributions
pron+0.017
 forks+0.017
 plate+0.016
 slice+0.016
 fork+0.016
Neg logit contributions
 Tweet-0.012
 Madd-0.011
 Bird-0.010
 Suz-0.010
 shelter-0.010
Token and
Token ID290
Feature activation-9.35e-01

Pos logit contributions
 forks+0.010
 plate+0.010
pron+0.010
 fork+0.009
 slice+0.009
Neg logit contributions
 Tweet-0.012
 Madd-0.012
 Suz-0.011
 Bird-0.011
 Dawn-0.011
Token some
Token ID617
Feature activation+5.96e-01

Pos logit contributions
 forks+0.039
pron+0.039
 plate+0.039
 slice+0.038
 fork+0.037
Neg logit contributions
 Tweet-0.040
 Madd-0.039
 Suz-0.036
 Bird-0.036
 shelter-0.036
Token They
Token ID1119
Feature activation+1.36e-01

Pos logit contributions
 Tweet+0.020
 Madd+0.019
 Bird+0.018
 Suz+0.018
 shelter+0.017
Neg logit contributions
pron-0.026
 forks-0.026
 plate-0.026
 slice-0.025
wy-0.025
Token reached
Token ID4251
Feature activation-6.21e-01

Pos logit contributions
 Tweet+0.006
 Madd+0.006
 Bird+0.005
 Suz+0.005
 shelter+0.005
Neg logit contributions
pron-0.006
 forks-0.006
 plate-0.006
 slice-0.005
wy-0.005
Token for
Token ID329
Feature activation-4.16e-01

Pos logit contributions
pron+0.025
 forks+0.024
 plate+0.024
 slice+0.023
 fork+0.023
Neg logit contributions
 Tweet-0.029
 Madd-0.028
 Bird-0.026
 Suz-0.026
 shelter-0.025
Token the
Token ID262
Feature activation-2.86e-01

Pos logit contributions
pron+0.018
 plate+0.017
 forks+0.017
 slice+0.017
wy+0.017
Neg logit contributions
 Tweet-0.018
 Madd-0.017
 Bird-0.016
 Suz-0.016
 shelter-0.016
Token or
Token ID393
Feature activation-3.09e-01

Pos logit contributions
pron+0.012
 forks+0.012
wy+0.012
 plate+0.012
 slice+0.011
Neg logit contributions
 Tweet-0.013
 Madd-0.012
 Bird-0.011
 shelter-0.011
 Bug-0.011
Tokennam
Token ID7402
Feature activation-5.12e+00

Pos logit contributions
pron+0.009
wy+0.008
ater+0.008
 plate+0.008
ossible+0.008
Neg logit contributions
 Tweet-0.018
 Madd-0.017
 Bird-0.017
 Suz-0.017
 shelter-0.016
Tokenents
Token ID658
Feature activation-2.31e+00

Pos logit contributions
pron+0.257
wy+0.241
 forks+0.241
 fork+0.240
 plate+0.238
Neg logit contributions
 Tweet-0.184
 Madd-0.170
 Suz-0.164
 Bird-0.149
 shelter-0.145
Token and
Token ID290
Feature activation-5.28e-02

Pos logit contributions
 plate+0.096
pron+0.095
 forks+0.094
 slice+0.092
 fork+0.088
Neg logit contributions
 Tweet-0.102
 Madd-0.096
 Bird-0.090
 Suz-0.089
 Dawn-0.088
Token tried
Token ID3088
Feature activation-9.60e-02

Pos logit contributions
pron+0.002
 plate+0.002
 forks+0.002
wy+0.002
 slice+0.002
Neg logit contributions
 Tweet-0.002
 Madd-0.002
 Bird-0.002
 Suz-0.002
 shelter-0.002
Token to
Token ID284
Feature activation+5.39e-01

Pos logit contributions
 forks+0.003
 plate+0.003
 slice+0.003
pron+0.003
 fork+0.003
Neg logit contributions
 Tweet-0.005
 Madd-0.005
 Ph-0.004
 Bug-0.004
 Suz-0.004
Token take
Token ID1011
Feature activation+2.13e-01

Pos logit contributions
 Tweet+0.026
 Madd+0.025
 Suz+0.024
 Bird+0.024
 shelter+0.023
Neg logit contributions
pron-0.020
 plate-0.020
 forks-0.019
 slice-0.019
 fork-0.018
Token here
Token ID994
Feature activation+1.13e+00

Pos logit contributions
 Tweet+0.040
 Madd+0.038
 Bird+0.036
 Suz+0.036
 shelter+0.035
Neg logit contributions
pron-0.038
 plate-0.037
 forks-0.037
 slice-0.036
 fork-0.034
Token and
Token ID290
Feature activation-6.30e-01

Pos logit contributions
 Tweet+0.042
 Madd+0.039
 Bird+0.037
 shelter+0.036
 Suz+0.036
Neg logit contributions
pron-0.055
 forks-0.053
 plate-0.052
 slice-0.052
wy-0.052
Token explore
Token ID7301
Feature activation-3.41e-02

Pos logit contributions
pron+0.024
 plate+0.024
 forks+0.024
 slice+0.023
 fork+0.022
Neg logit contributions
 Tweet-0.029
 Madd-0.028
 Suz-0.026
 Bird-0.026
 Dawn-0.025
Token the
Token ID262
Feature activation-2.92e-01

Pos logit contributions
pron+0.001
 forks+0.001
 plate+0.001
 slice+0.001
 fork+0.001
Neg logit contributions
 Tweet-0.002
 Madd-0.002
 Bird-0.001
 Suz-0.001
 shelter-0.001
Token or
Token ID393
Feature activation-1.60e-01

Pos logit contributions
pron+0.014
 plate+0.014
 forks+0.013
 slice+0.013
 fork+0.013
Neg logit contributions
 Tweet-0.012
 Madd-0.011
 Suz-0.010
 Bird-0.010
 Bug-0.010
Tokennam
Token ID7402
Feature activation-5.11e+00

Pos logit contributions
 forks+0.006
pron+0.006
 plate+0.006
 slice+0.006
 fork+0.006
Neg logit contributions
 Tweet-0.007
 Madd-0.007
 Suz-0.007
 Bird-0.007
 shelter-0.006
Tokenents
Token ID658
Feature activation-2.30e+00

Pos logit contributions
pron+0.263
 forks+0.255
wy+0.253
 plate+0.250
 fork+0.250
Neg logit contributions
 Tweet-0.173
 Madd-0.157
 Suz-0.156
 Bird-0.140
 Dawn-0.134
Token."
Token ID526
Feature activation-4.01e-01

Pos logit contributions
pron+0.094
 forks+0.091
 plate+0.091
 slice+0.088
 fork+0.086
Neg logit contributions
 Tweet-0.105
 Madd-0.099
 Bird-0.094
 Suz-0.093
 shelter-0.092
Token
Token ID198
Feature activation-1.52e-01

Pos logit contributions
 plate+0.017
 forks+0.017
pron+0.017
 slice+0.017
 fork+0.016
Neg logit contributions
 Tweet-0.017
 Madd-0.015
 Suz-0.015
 Bird-0.015
 shelter-0.015
Token
Token ID198
Feature activation-3.40e-01

Pos logit contributions
pron+0.007
 plate+0.007
 forks+0.007
 slice+0.007
 fork+0.007
Neg logit contributions
 Tweet-0.006
 Madd-0.005
 Suz-0.005
 Bird-0.005
 shelter-0.005
TokenDa
Token ID26531
Feature activation+3.96e-01

Pos logit contributions
 forks+0.014
pron+0.014
 plate+0.014
 slice+0.013
wy+0.013
Neg logit contributions
 Tweet-0.015
 Madd-0.015
 Bird-0.014
 Suz-0.013
 shelter-0.013
Token to
Token ID284
Feature activation-2.89e-01

Pos logit contributions
 plate+0.045
pron+0.044
 forks+0.044
 slice+0.043
 fork+0.042
Neg logit contributions
 Tweet-0.050
 Madd-0.047
 Bird-0.045
 Bug-0.044
 shelter-0.044
Token help
Token ID1037
Feature activation+1.50e+00

Pos logit contributions
pron+0.012
 forks+0.012
 plate+0.012
 slice+0.012
wy+0.012
Neg logit contributions
 Tweet-0.013
 Madd-0.012
 Suz-0.011
 Bird-0.011
 shelter-0.011
Token sort
Token ID3297
Feature activation-8.30e-02

Pos logit contributions
add+0.083
 Tweet+0.082
 Madd+0.081
utt+0.077
 Fif+0.077
Neg logit contributions
 plate-0.048
 forks-0.040
pron-0.040
lf-0.039
 fork-0.038
Token the
Token ID262
Feature activation+5.98e-02

Pos logit contributions
 forks+0.003
 plate+0.003
 slice+0.003
 fork+0.003
pron+0.003
Neg logit contributions
 Tweet-0.004
 Bird-0.004
 Madd-0.004
 Bug-0.004
 Fif-0.004
Token or
Token ID393
Feature activation+1.46e-01

Pos logit contributions
 Tweet+0.003
 Madd+0.003
 Bird+0.003
 Suz+0.003
 Bug+0.003
Neg logit contributions
 forks-0.002
 plate-0.002
pron-0.002
 slice-0.002
 fork-0.002
Tokennam
Token ID7402
Feature activation-5.09e+00

Pos logit contributions
 Tweet+0.008
 Madd+0.008
 Suz+0.007
 shelter+0.007
 Bird+0.007
Neg logit contributions
pron-0.005
 plate-0.005
wy-0.005
 slice-0.004
 forks-0.004
Tokenents
Token ID658
Feature activation-2.08e+00

Pos logit contributions
pron+0.252
wy+0.242
 fork+0.241
 forks+0.240
 plate+0.240
Neg logit contributions
 Tweet-0.181
 Madd-0.165
 Suz-0.157
 Dawn-0.143
 Bird-0.140
Token.
Token ID13
Feature activation-8.22e-02

Pos logit contributions
 plate+0.078
pron+0.077
 forks+0.076
 slice+0.075
 fork+0.070
Neg logit contributions
 Tweet-0.099
 Madd-0.095
 Bird-0.091
 Bug-0.088
 Suz-0.088
Token She
Token ID1375
Feature activation+1.95e-01

Pos logit contributions
 forks+0.004
 plate+0.004
pron+0.004
ossible+0.004
 slice+0.004
Neg logit contributions
 Tweet-0.003
 Madd-0.003
 shelter-0.002
 Bird-0.002
 Suz-0.002
Token picked
Token ID6497
Feature activation-2.70e-01

Pos logit contributions
 Tweet+0.009
 Madd+0.009
 Bird+0.009
 Suz+0.008
 Bug+0.008
Neg logit contributions
pron-0.008
 forks-0.007
 plate-0.007
 slice-0.007
 fork-0.007
Token up
Token ID510
Feature activation+1.27e+00

Pos logit contributions
 forks+0.010
 plate+0.009
 slice+0.009
pron+0.009
wy+0.009
Neg logit contributions
 Tweet-0.013
 Madd-0.013
 Suz-0.013
 Bird-0.012
 shelter-0.012
Token had
Token ID550
Feature activation-1.00e+00

Pos logit contributions
pron+0.017
 forks+0.017
 plate+0.017
 slice+0.016
 fork+0.016
Neg logit contributions
 Tweet-0.022
 Madd-0.021
 Bird-0.020
 Suz-0.020
 shelter-0.019
Token not
Token ID407
Feature activation-8.46e-01

Pos logit contributions
pron+0.044
 plate+0.043
 forks+0.043
 fork+0.041
 slice+0.041
Neg logit contributions
 Tweet-0.043
 Madd-0.039
 Bird-0.038
 shelter-0.037
 Suz-0.036
Token touched
Token ID12615
Feature activation-4.89e-01

Pos logit contributions
pron+0.033
 forks+0.033
 plate+0.033
 slice+0.032
 fork+0.031
Neg logit contributions
 Tweet-0.039
 Madd-0.038
 Suz-0.035
 Bird-0.035
 Dawn-0.034
Token the
Token ID262
Feature activation-6.46e-01

Pos logit contributions
 forks+0.017
 plate+0.017
 slice+0.016
pron+0.016
 fork+0.016
Neg logit contributions
 Madd-0.024
 Tweet-0.024
 Suz-0.023
 Dawn-0.022
 Ph-0.022
Token or
Token ID393
Feature activation-5.80e-01

Pos logit contributions
pron+0.030
 forks+0.029
wy+0.028
 plate+0.028
 slice+0.028
Neg logit contributions
 Tweet-0.027
 Madd-0.024
 Bird-0.024
 shelter-0.023
 Bug-0.022
Tokennam
Token ID7402
Feature activation-5.09e+00

Pos logit contributions
pron+0.024
 plate+0.023
 forks+0.022
wy+0.022
 slice+0.022
Neg logit contributions
 Tweet-0.027
 Madd-0.024
 Bird-0.024
 Suz-0.023
 shelter-0.023
Tokenents
Token ID658
Feature activation-2.13e+00

Pos logit contributions
pron+0.268
 forks+0.255
 apr+0.254
 fork+0.252
wy+0.252
Neg logit contributions
 Tweet-0.164
 Madd-0.138
 Suz-0.134
 Bird-0.130
 Twe-0.123
Token.
Token ID13
Feature activation+1.53e-01

Pos logit contributions
pron+0.091
 forks+0.089
 plate+0.089
 slice+0.087
 fork+0.084
Neg logit contributions
 Tweet-0.092
 Madd-0.087
 Bird-0.082
 Suz-0.081
 shelter-0.079
Token They
Token ID1119
Feature activation+1.25e-01

Pos logit contributions
 Tweet+0.006
 Madd+0.005
 Suz+0.005
 Dawn+0.005
 Bird+0.005
Neg logit contributions
 forks-0.007
 plate-0.007
pron-0.007
 Adam-0.007
 slice-0.007
Token hoped
Token ID10719
Feature activation+6.04e-01

Pos logit contributions
 Tweet+0.006
 Madd+0.006
 Suz+0.005
 Bird+0.005
 Dawn+0.005
Neg logit contributions
 plate-0.005
 forks-0.005
pron-0.005
 slice-0.005
 fork-0.005
Token Mom
Token ID11254
Feature activation-8.16e-01

Pos logit contributions
 Tweet+0.032
rig+0.031
 Madd+0.031
 Dawn+0.030
 Suz+0.029
Neg logit contributions
 forks-0.020
 plate-0.019
 Adam-0.018
 slice-0.017
pron-0.017
Token on
Token ID319
Feature activation-5.32e-01

Pos logit contributions
pron+0.048
 plate+0.048
 forks+0.048
 slice+0.047
 fork+0.044
Neg logit contributions
 Tweet-0.062
 Madd-0.060
 Bird-0.056
 Suz-0.056
 Dawn-0.055
Token it
Token ID340
Feature activation+3.44e-02

Pos logit contributions
pron+0.020
 forks+0.019
 plate+0.019
 slice+0.019
wy+0.019
Neg logit contributions
 Tweet-0.026
 Madd-0.025
 Bird-0.024
 Suz-0.023
 shelter-0.023
Token.
Token ID13
Feature activation+4.04e-01

Pos logit contributions
 Tweet+0.002
 Madd+0.002
 Suz+0.002
 Dawn+0.002
 Bird+0.002
Neg logit contributions
 plate-0.001
 forks-0.001
pron-0.001
 slice-0.001
 fork-0.001
Token The
Token ID383
Feature activation+1.66e+00

Pos logit contributions
 Tweet+0.016
 Madd+0.015
 Bird+0.015
 shelter+0.014
 Suz+0.014
Neg logit contributions
 plate-0.018
pron-0.018
 forks-0.018
 slice-0.017
ossible-0.017
Token or
Token ID393
Feature activation-1.79e-01

Pos logit contributions
 Tweet+0.064
 Bird+0.056
 shelter+0.055
 Madd+0.055
 Bug+0.053
Neg logit contributions
pron-0.083
wy-0.081
 forks-0.080
 plate-0.078
ucer-0.077
Tokennam
Token ID7402
Feature activation-5.03e+00

Pos logit contributions
pron+0.005
 forks+0.004
 plate+0.004
 slice+0.004
wy+0.004
Neg logit contributions
 Tweet-0.011
 Madd-0.010
 Bird-0.010
 Suz-0.010
 shelter-0.010
Tokenents
Token ID658
Feature activation-2.05e+00

Pos logit contributions
pron+0.250
 forks+0.236
 plate+0.232
 fork+0.231
wy+0.229
Neg logit contributions
 Tweet-0.196
 Madd-0.173
 Suz-0.169
 Bird-0.160
 Kerry-0.152
Token were
Token ID547
Feature activation-1.39e-01

Pos logit contributions
 slice+0.087
 plate+0.086
pron+0.086
 forks+0.086
lf+0.081
Neg logit contributions
 Tweet-0.085
 Madd-0.084
 Dawn-0.077
 Bird-0.077
 Bug-0.076
Token shiny
Token ID22441
Feature activation-2.62e-01

Pos logit contributions
pron+0.006
 forks+0.006
 plate+0.006
 slice+0.006
 fork+0.006
Neg logit contributions
 Tweet-0.006
 Madd-0.005
 Suz-0.005
 Bird-0.005
 shelter-0.005
Token and
Token ID290
Feature activation-6.62e-01

Pos logit contributions
 forks+0.011
 plate+0.011
pron+0.011
 slice+0.011
 fork+0.011
Neg logit contributions
 Tweet-0.011
 Madd-0.011
 Suz-0.010
 Bird-0.010
 Dawn-0.010
Token colorful
Token ID20239
Feature activation-3.46e-01

Pos logit contributions
pron+0.027
 forks+0.027
 plate+0.027
 slice+0.026
 fork+0.025
Neg logit contributions
 Tweet-0.030
 Madd-0.028
 Bird-0.027
 Suz-0.026
 shelter-0.026
Token she
Token ID673
Feature activation+3.26e-01

Pos logit contributions
 Tweet+0.004
 Madd+0.004
rig+0.004
 Bird+0.004
 Bug+0.004
Neg logit contributions
 forks-0.004
 plate-0.004
pron-0.004
lf-0.004
 fork-0.004
Token could
Token ID714
Feature activation+8.30e-01

Pos logit contributions
 Tweet+0.015
 Madd+0.015
 Suz+0.014
 Bird+0.014
 Dawn+0.013
Neg logit contributions
pron-0.013
 forks-0.013
 plate-0.013
 slice-0.012
wy-0.012
Token carry
Token ID3283
Feature activation+1.87e-01

Pos logit contributions
 Tweet+0.038
 Madd+0.036
 Bird+0.034
 Suz+0.034
 shelter+0.033
Neg logit contributions
pron-0.033
 plate-0.033
 forks-0.032
 slice-0.032
 fork-0.031
Token the
Token ID262
Feature activation-6.04e-02

Pos logit contributions
 Tweet+0.009
 Madd+0.009
 Suz+0.008
 Bird+0.008
 Fif+0.008
Neg logit contributions
 forks-0.007
 plate-0.007
pron-0.007
 slice-0.006
 fork-0.006
Token or
Token ID393
Feature activation-3.40e-01

Pos logit contributions
pron+0.003
 forks+0.003
 plate+0.003
wy+0.003
 slice+0.003
Neg logit contributions
 Tweet-0.003
 Madd-0.002
 Bird-0.002
 shelter-0.002
 Bug-0.002
Tokennam
Token ID7402
Feature activation-4.98e+00

Pos logit contributions
pron+0.009
 plate+0.009
 forks+0.009
 slice+0.009
wy+0.009
Neg logit contributions
 Tweet-0.020
 Madd-0.019
 Bird-0.018
 Suz-0.018
 shelter-0.018
Tokenents
Token ID658
Feature activation-2.32e+00

Pos logit contributions
pron+0.244
 forks+0.236
wy+0.234
 fork+0.232
 plate+0.230
Neg logit contributions
 Tweet-0.184
 Madd-0.169
 Suz-0.161
 Bird-0.153
 shelter-0.146
Token back
Token ID736
Feature activation-6.19e-02

Pos logit contributions
pron+0.093
 plate+0.091
 forks+0.091
 slice+0.088
 fork+0.085
Neg logit contributions
 Tweet-0.106
 Madd-0.101
 Bird-0.095
 Suz-0.094
 shelter-0.092
Token with
Token ID351
Feature activation-1.92e-01

Pos logit contributions
 forks+0.002
 plate+0.002
pron+0.002
 slice+0.002
 fork+0.002
Neg logit contributions
 Tweet-0.003
 Madd-0.003
 Bird-0.003
 Suz-0.003
 Dawn-0.003
Token her
Token ID607
Feature activation+3.07e-03

Pos logit contributions
pron+0.008
 forks+0.008
 plate+0.008
 slice+0.007
 fork+0.007
Neg logit contributions
 Tweet-0.009
 Madd-0.008
 Suz-0.008
 Bird-0.008
 Bug-0.008
Token.
Token ID13
Feature activation+2.88e-01

Pos logit contributions
 Tweet+0.000
 Madd+0.000
 Bird+0.000
 shelter+0.000
 Suz+0.000
Neg logit contributions
pron-0.000
wy-0.000
 forks-0.000
 plate-0.000
 slice-0.000
Token she
Token ID673
Feature activation-3.50e-02

Pos logit contributions
 forks+0.041
 plate+0.041
 fork+0.038
 Adam+0.038
 slice+0.037
Neg logit contributions
rig-0.068
 Tweet-0.066
add-0.066
 dam-0.065
 rou-0.064
Token could
Token ID714
Feature activation+6.83e-01

Pos logit contributions
pron+0.001
 forks+0.001
 plate+0.001
 slice+0.001
 fork+0.001
Neg logit contributions
 Tweet-0.002
 Madd-0.001
 Bird-0.001
 Suz-0.001
 shelter-0.001
Token carry
Token ID3283
Feature activation+2.31e-01

Pos logit contributions
 Tweet+0.029
 Madd+0.027
 Bird+0.025
 Suz+0.025
 shelter+0.025
Neg logit contributions
 forks-0.030
pron-0.030
 plate-0.030
wy-0.029
 slice-0.029
Token the
Token ID262
Feature activation+2.98e-02

Pos logit contributions
 Tweet+0.012
 Madd+0.011
 Suz+0.011
rig+0.011
 Bird+0.011
Neg logit contributions
 plate-0.008
 forks-0.008
 slice-0.007
pron-0.007
 apr-0.007
Token or
Token ID393
Feature activation-2.17e-01

Pos logit contributions
 Tweet+0.001
 Madd+0.001
 Bird+0.001
 shelter+0.001
 Bug+0.001
Neg logit contributions
pron-0.001
 forks-0.001
wy-0.001
 plate-0.001
 slice-0.001
Tokennam
Token ID7402
Feature activation-4.93e+00

Pos logit contributions
pron+0.007
 forks+0.007
 plate+0.007
 slice+0.007
wy+0.007
Neg logit contributions
 Tweet-0.012
 Madd-0.011
 Bird-0.010
 Suz-0.010
 shelter-0.010
Tokenents
Token ID658
Feature activation-2.30e+00

Pos logit contributions
pron+0.237
 forks+0.229
wy+0.227
 fork+0.224
 plate+0.223
Neg logit contributions
 Tweet-0.189
 Madd-0.176
 Suz-0.168
 Bird-0.158
 shelter-0.150
Token to
Token ID284
Feature activation-2.53e-02

Pos logit contributions
pron+0.091
 plate+0.090
 forks+0.089
 slice+0.087
 fork+0.083
Neg logit contributions
 Tweet-0.105
 Madd-0.101
 Bird-0.095
 Suz-0.094
 Dawn-0.092
Token the
Token ID262
Feature activation-3.41e-01

Pos logit contributions
pron+0.001
 plate+0.001
 forks+0.001
 slice+0.001
 fork+0.001
Neg logit contributions
 Tweet-0.001
 Madd-0.001
 Bird-0.001
 Suz-0.001
 Bug-0.001
Token place
Token ID1295
Feature activation+1.77e-02

Pos logit contributions
pron+0.015
 forks+0.014
wy+0.014
 plate+0.014
 slice+0.014
Neg logit contributions
 Tweet-0.015
 Madd-0.014
 shelter-0.013
 Bird-0.013
 Bug-0.013
Token.
Token ID13
Feature activation+4.89e-01

Pos logit contributions
 Tweet+0.001
 Madd+0.001
 Bird+0.001
 Bug+0.001
 Fif+0.001
Neg logit contributions
 plate-0.001
pron-0.001
 forks-0.001
 slice-0.001
 fork-0.001
Token he
Token ID339
Feature activation-1.34e-01

Pos logit contributions
pron+0.031
 plate+0.030
 forks+0.030
 slice+0.029
wy+0.029
Neg logit contributions
 Tweet-0.028
 Madd-0.027
 Suz-0.025
 Bird-0.025
 shelter-0.024
Token especially
Token ID2592
Feature activation-2.57e-01

Pos logit contributions
pron+0.006
 plate+0.006
 forks+0.006
 slice+0.005
 fork+0.005
Neg logit contributions
 Tweet-0.006
 Madd-0.005
 Dawn-0.005
 Suz-0.005
 Bird-0.005
Token liked
Token ID8288
Feature activation-1.50e+00

Pos logit contributions
pron+0.010
 forks+0.009
 plate+0.009
 slice+0.009
wy+0.009
Neg logit contributions
 Tweet-0.013
 Madd-0.012
 Suz-0.012
 Bird-0.011
 Dawn-0.011
Token the
Token ID262
Feature activation-7.25e-01

Pos logit contributions
 forks+0.061
pron+0.059
 plate+0.059
 slice+0.056
 fork+0.056
Neg logit contributions
 Tweet-0.068
 Madd-0.066
 Suz-0.062
 Dawn-0.062
 Bird-0.061
Token or
Token ID393
Feature activation-4.31e-01

Pos logit contributions
 plate+0.031
pron+0.031
 forks+0.030
 fork+0.030
 slice+0.029
Neg logit contributions
 Tweet-0.033
 Madd-0.032
 Suz-0.029
 Dawn-0.029
 Fif-0.029
Tokennam
Token ID7402
Feature activation-4.87e+00

Pos logit contributions
pron+0.017
 forks+0.017
 plate+0.017
 slice+0.016
wy+0.016
Neg logit contributions
 Tweet-0.020
 Madd-0.019
 Bird-0.018
 Suz-0.018
 shelter-0.018
Tokenents
Token ID658
Feature activation-1.95e+00

Pos logit contributions
pron+0.244
 forks+0.239
 plate+0.236
 fork+0.235
wy+0.230
Neg logit contributions
 Tweet-0.161
 Madd-0.142
 Suz-0.142
 Twe-0.128
 Kel-0.125
Token he
Token ID339
Feature activation-7.41e-01

Pos logit contributions
pron+0.075
 plate+0.074
 forks+0.074
 slice+0.072
wy+0.069
Neg logit contributions
 Tweet-0.092
 Madd-0.089
 Suz-0.083
 Bird-0.083
 Dawn-0.081
Token kept
Token ID4030
Feature activation-4.23e-02

Pos logit contributions
pron+0.031
 plate+0.030
 forks+0.030
 slice+0.029
wy+0.028
Neg logit contributions
 Tweet-0.033
 Madd-0.032
 Suz-0.030
 Bird-0.030
 shelter-0.029
Token around
Token ID1088
Feature activation-1.31e+00

Pos logit contributions
pron+0.002
 forks+0.002
 plate+0.002
 slice+0.002
 fork+0.002
Neg logit contributions
 Tweet-0.002
 Madd-0.002
 Bird-0.002
 Suz-0.002
 shelter-0.002
Token.
Token ID13
Feature activation+2.02e-01

Pos logit contributions
pron+0.057
 forks+0.055
 plate+0.055
 slice+0.054
wy+0.053
Neg logit contributions
 Tweet-0.057
 Madd-0.053
 Bird-0.050
 Suz-0.050
 shelter-0.049
Token She
Token ID1375
Feature activation+6.94e-01

Pos logit contributions
 Madd+0.006
 Tweet+0.006
 shelter+0.005
 Suz+0.005
 Bird+0.005
Neg logit contributions
 forks-0.009
pron-0.009
 plate-0.009
ossible-0.009
 slice-0.009
Token learned
Token ID4499
Feature activation-7.27e-01

Pos logit contributions
 Madd+0.035
 Tweet+0.034
 Dawn+0.032
 Suz+0.032
 Kerry+0.031
Neg logit contributions
 plate-0.026
 slice-0.025
 forks-0.025
pron-0.025
 fork-0.024
Token that
Token ID326
Feature activation+1.06e+00

Pos logit contributions
 forks+0.024
 plate+0.024
pron+0.023
ossible+0.023
 slice+0.023
Neg logit contributions
 Madd-0.036
 Tweet-0.035
 Suz-0.034
 Bird-0.033
 Bug-0.033
Token the
Token ID262
Feature activation+2.24e-01

Pos logit contributions
rig+0.053
 Kerry+0.048
 Dawn+0.048
add+0.048
 Madd+0.047
Neg logit contributions
 forks-0.040
 plate-0.036
 fork-0.035
lf-0.035
 slice-0.034
Token or
Token ID393
Feature activation-3.07e-02

Pos logit contributions
 Tweet+0.010
 Madd+0.010
 Suz+0.010
 Dawn+0.009
 Beth+0.009
Neg logit contributions
 plate-0.009
 forks-0.009
pron-0.009
 fork-0.009
 slice-0.009
Tokennam
Token ID7402
Feature activation-4.85e+00

Pos logit contributions
 forks+0.001
pron+0.001
 plate+0.001
 slice+0.001
 fork+0.001
Neg logit contributions
 Tweet-0.001
 Madd-0.001
 Suz-0.001
 Bird-0.001
 shelter-0.001
Tokenents
Token ID658
Feature activation-2.01e+00

Pos logit contributions
pron+0.226
 forks+0.214
 plate+0.213
 fork+0.209
wy+0.207
Neg logit contributions
 Tweet-0.197
 Madd-0.181
 Suz-0.175
 Bird-0.163
 shelter-0.159
Token were
Token ID547
Feature activation-1.01e-01

Pos logit contributions
 forks+0.066
 slice+0.065
 plate+0.065
pron+0.063
wy+0.060
Neg logit contributions
 Madd-0.101
 Tweet-0.100
 Dawn-0.096
 Kerry-0.094
 Fif-0.094
Token not
Token ID407
Feature activation-1.37e+00

Pos logit contributions
 forks+0.004
 slice+0.004
pron+0.004
wy+0.004
 plate+0.004
Neg logit contributions
 Madd-0.005
 Suz-0.004
 Tweet-0.004
 Ph-0.004
 Dawn-0.004
Token for
Token ID329
Feature activation-7.00e-01

Pos logit contributions
 forks+0.053
 slice+0.050
 plate+0.050
pron+0.049
 fork+0.048
Neg logit contributions
 Madd-0.063
 Suz-0.058
 Dawn-0.058
 Tweet-0.058
 Kerry-0.055
Token playing
Token ID2712
Feature activation-9.25e-01

Pos logit contributions
 forks+0.027
 plate+0.026
pron+0.026
 slice+0.024
 fork+0.024
Neg logit contributions
 Tweet-0.033
 Madd-0.032
 Suz-0.030
 Bird-0.030
 Dawn-0.029
TokenBe
Token ID3856
Feature activation+2.25e+00

Pos logit contributions
 Tweet+0.005
 Madd+0.005
 Bird+0.005
 Suz+0.005
 shelter+0.004
Neg logit contributions
pron-0.006
 plate-0.006
 forks-0.006
 slice-0.006
 fork-0.006
Token careful
Token ID8161
Feature activation-1.42e+00

Pos logit contributions
 Tweet+0.095
 Madd+0.088
 Bird+0.086
 shelter+0.084
 Suz+0.083
Neg logit contributions
pron-0.098
 plate-0.098
 forks-0.098
 slice-0.093
 fork-0.092
Token,
Token ID11
Feature activation-1.04e+00

Pos logit contributions
pron+0.061
 forks+0.059
 plate+0.059
 slice+0.058
 fork+0.056
Neg logit contributions
 Tweet-0.062
 Madd-0.059
 Bird-0.055
 Suz-0.055
 shelter-0.053
Token these
Token ID777
Feature activation-7.78e-01

Pos logit contributions
 forks+0.044
pron+0.044
 plate+0.044
 slice+0.042
 fork+0.041
Neg logit contributions
 Tweet-0.045
 Madd-0.043
 Suz-0.040
 Bird-0.040
 shelter-0.039
Token or
Token ID393
Feature activation-2.93e-01

Pos logit contributions
 forks+0.028
 plate+0.027
pron+0.027
 slice+0.026
 fork+0.025
Neg logit contributions
 Tweet-0.039
 Madd-0.038
 Suz-0.036
 Dawn-0.034
 Bird-0.034
Tokennam
Token ID7402
Feature activation-4.83e+00

Pos logit contributions
pron+0.011
 plate+0.011
 forks+0.011
 slice+0.010
wy+0.010
Neg logit contributions
 Tweet-0.014
 Madd-0.014
 Bird-0.013
 Suz-0.013
 shelter-0.013
Tokenents
Token ID658
Feature activation-1.86e+00

Pos logit contributions
pron+0.261
 plate+0.245
 fork+0.243
 forks+0.240
 apr+0.239
Neg logit contributions
 Tweet-0.154
 Madd-0.131
 Suz-0.126
 Bird-0.125
 Kerry-0.117
Token are
Token ID389
Feature activation+1.97e-02

Pos logit contributions
pron+0.076
 forks+0.075
 plate+0.075
 slice+0.073
wy+0.071
Neg logit contributions
 Tweet-0.083
 Madd-0.079
 Suz-0.074
 Bird-0.074
 shelter-0.072
Token very
Token ID845
Feature activation-1.71e-01

Pos logit contributions
 Tweet+0.001
 Madd+0.001
 Suz+0.001
 Bird+0.001
 shelter+0.001
Neg logit contributions
pron-0.001
 forks-0.001
 plate-0.001
 slice-0.001
wy-0.001
Token special
Token ID2041
Feature activation+6.21e-01

Pos logit contributions
pron+0.008
 plate+0.008
 forks+0.008
 slice+0.008
 fork+0.008
Neg logit contributions
 Tweet-0.007
 Madd-0.006
 Bird-0.006
 Suz-0.006
 shelter-0.006
Token.
Token ID13
Feature activation-8.34e-01

Pos logit contributions
 Tweet+0.029
 Madd+0.027
 Suz+0.026
 Bird+0.026
 Dawn+0.025
Neg logit contributions
 forks-0.025
pron-0.025
 plate-0.025
 slice-0.023
 fork-0.023
Token the
Token ID262
Feature activation+5.28e-01

Pos logit contributions
pron+0.016
wy+0.014
lf+0.014
 plate+0.014
ater+0.014
Neg logit contributions
 Tweet-0.013
 Madd-0.011
 shelter-0.011
 Suz-0.011
 Bird-0.011
Token lock
Token ID5793
Feature activation-7.35e-01

Pos logit contributions
 Tweet+0.026
 Madd+0.025
 shelter+0.024
 Bird+0.024
 Suz+0.024
Neg logit contributions
pron-0.019
wy-0.018
 forks-0.017
lf-0.017
 slice-0.017
Token and
Token ID290
Feature activation+4.01e-01

Pos logit contributions
pron+0.033
 forks+0.032
 plate+0.032
 slice+0.031
wy+0.031
Neg logit contributions
 Tweet-0.030
 Madd-0.028
 Suz-0.027
 Bird-0.027
 shelter-0.026
Token the
Token ID262
Feature activation+9.35e-01

Pos logit contributions
 Tweet+0.017
 Madd+0.016
 Suz+0.015
 Bird+0.015
 shelter+0.015
Neg logit contributions
pron-0.018
 plate-0.017
 forks-0.017
wy-0.016
 slice-0.016
Token or
Token ID393
Feature activation+3.57e-01

Pos logit contributions
 Tweet+0.047
 Madd+0.044
 shelter+0.043
 Bird+0.043
 Suz+0.042
Neg logit contributions
pron-0.033
wy-0.032
 forks-0.031
 plate-0.031
 slice-0.030
Tokennam
Token ID7402
Feature activation-4.81e+00

Pos logit contributions
 Tweet+0.015
 Madd+0.014
 Suz+0.013
 Bird+0.013
 shelter+0.013
Neg logit contributions
pron-0.016
wy-0.015
 plate-0.015
ater-0.015
 slice-0.014
Tokenents
Token ID658
Feature activation-1.90e+00

Pos logit contributions
pron+0.261
wy+0.242
 forks+0.238
 apr+0.235
 plate+0.234
Neg logit contributions
 Tweet-0.152
 Suz-0.138
 Madd-0.134
 shelter-0.117
 Bird-0.116
Token.
Token ID13
Feature activation+2.69e-01

Pos logit contributions
pron+0.082
 forks+0.080
 plate+0.079
 slice+0.077
wy+0.076
Neg logit contributions
 Tweet-0.082
 Madd-0.077
 Suz-0.073
 Bird-0.073
 shelter-0.071
Token
Token ID198
Feature activation+4.71e-02

Pos logit contributions
 Tweet+0.011
 Madd+0.011
 shelter+0.010
 Bug+0.010
 Bird+0.010
Neg logit contributions
pron-0.012
 forks-0.012
 plate-0.012
 slice-0.011
 fork-0.011
Token
Token ID198
Feature activation-1.61e-01

Pos logit contributions
 Tweet+0.002
 Madd+0.002
 Bird+0.001
 Suz+0.001
 shelter+0.001
Neg logit contributions
pron-0.002
 forks-0.002
 plate-0.002
 slice-0.002
 fork-0.002
Token"
Token ID1
Feature activation-1.30e+00

Pos logit contributions
 plate+0.007
wy+0.007
pron+0.007
 fork+0.007
 forks+0.007
Neg logit contributions
 Tweet-0.006
Su-0.006
 Dog-0.006
 Suz-0.006
 Kel-0.006
Token him
Token ID683
Feature activation+2.66e-01

Pos logit contributions
 plate+0.007
pron+0.007
 forks+0.007
 slice+0.007
 fork+0.007
Neg logit contributions
 Tweet-0.009
 Madd-0.009
 Suz-0.008
 Bird-0.008
 shelter-0.008
Token.
Token ID13
Feature activation+2.56e-01

Pos logit contributions
 Tweet+0.014
 Madd+0.014
 Dawn+0.013
 shelter+0.013
 Bug+0.013
Neg logit contributions
 plate-0.008
 forks-0.008
 fork-0.007
 slice-0.007
pron-0.007
Token
Token ID198
Feature activation+1.83e-01

Pos logit contributions
 Tweet+0.010
 Madd+0.009
 Bird+0.009
 Suz+0.008
 shelter+0.008
Neg logit contributions
pron-0.012
 forks-0.012
 plate-0.012
 slice-0.012
wy-0.012
Token
Token ID198
Feature activation-2.50e-02

Pos logit contributions
 Tweet+0.007
 Madd+0.006
 Bird+0.006
 Suz+0.006
 shelter+0.006
Neg logit contributions
pron-0.009
 plate-0.009
 forks-0.009
 slice-0.009
 fork-0.008
Token"
Token ID1
Feature activation-1.26e+00

Pos logit contributions
 forks+0.001
wy+0.001
 slice+0.001
nam+0.001
 apr+0.001
Neg logit contributions
 Tweet-0.001
 Dog-0.001
 Madd-0.001
 Bunny-0.001
 shelter-0.001
TokenBe
Token ID3856
Feature activation+2.38e+00

Pos logit contributions
pron+0.056
 forks+0.056
 plate+0.055
 slice+0.054
wy+0.053
Neg logit contributions
 Tweet-0.052
 Madd-0.051
 Bird-0.048
 Suz-0.047
 Beth-0.045
Token careful
Token ID8161
Feature activation-1.30e+00

Pos logit contributions
 Tweet+0.116
 Madd+0.110
 Bird+0.104
 Suz+0.104
 shelter+0.102
Neg logit contributions
pron-0.090
 forks-0.089
 plate-0.089
 slice-0.085
 fork-0.083
Token!
Token ID0
Feature activation-7.81e-01

Pos logit contributions
pron+0.060
 forks+0.059
wy+0.058
 plate+0.057
 slice+0.057
Neg logit contributions
 Tweet-0.054
 Madd-0.051
 Bird-0.049
 Suz-0.049
 shelter-0.045
Token There
Token ID1318
Feature activation+1.30e+00

Pos logit contributions
 plate+0.030
 forks+0.030
pron+0.029
 slice+0.028
 fork+0.028
Neg logit contributions
 Tweet-0.035
 Madd-0.034
 shelter-0.032
 Dawn-0.031
 Suz-0.031
Token might
Token ID1244
Feature activation+4.65e-01

Pos logit contributions
 Madd+0.068
 Tweet+0.067
 Dawn+0.065
 Fif+0.063
 Suz+0.062
Neg logit contributions
 plate-0.040
 forks-0.040
 fork-0.037
 slice-0.035
 dice-0.035
Token be
Token ID307
Feature activation-2.89e-01

Pos logit contributions
 Madd+0.023
 Tweet+0.022
 Dawn+0.022
 Suz+0.022
add+0.021
Neg logit contributions
 plate-0.015
 forks-0.014
 slice-0.014
 fork-0.014
lf-0.013
Token's
Token ID338
Feature activation-3.78e-01

Pos logit contributions
 plate+0.017
pron+0.016
 forks+0.016
 fork+0.015
lf+0.015
Neg logit contributions
 Tweet-0.023
 Madd-0.023
 Suz-0.022
 Dawn-0.021
 shelter-0.020
Token mom
Token ID1995
Feature activation+1.91e-01

Pos logit contributions
pron+0.013
 plate+0.013
 forks+0.013
 slice+0.012
 fork+0.012
Neg logit contributions
 Tweet-0.020
 Madd-0.019
 Suz-0.018
 Bird-0.018
 Bug-0.017
Token took
Token ID1718
Feature activation+1.43e-01

Pos logit contributions
 Tweet+0.010
 Madd+0.010
 Fif+0.009
 Dawn+0.009
 Bug+0.009
Neg logit contributions
pron-0.007
 plate-0.006
 slice-0.006
 forks-0.006
lf-0.006
Token a
Token ID257
Feature activation+3.24e-03

Pos logit contributions
 Tweet+0.007
 Madd+0.007
 Bees+0.007
rig+0.006
 Bird+0.006
Neg logit contributions
 forks-0.005
 plate-0.005
 slice-0.005
 apr-0.004
 dice-0.004
Token photo
Token ID4590
Feature activation-9.12e-01

Pos logit contributions
 Tweet+0.000
 Madd+0.000
 Bird+0.000
 Suz+0.000
 shelter+0.000
Neg logit contributions
pron-0.000
 plate-0.000
 forks-0.000
 slice-0.000
 fork-0.000
Token of
Token ID286
Feature activation+2.33e+00

Pos logit contributions
 plate+0.045
pron+0.045
 forks+0.044
 slice+0.043
 fork+0.042
Neg logit contributions
 Tweet-0.034
 Madd-0.032
 Bird-0.029
 Suz-0.029
 Dawn-0.029
Token him
Token ID683
Feature activation+8.49e-01

Pos logit contributions
 Tweet+0.094
 Madd+0.093
add+0.086
rig+0.086
 Dawn+0.085
Neg logit contributions
 plate-0.099
 forks-0.098
pron-0.096
 slice-0.095
 fork-0.093
Token when
Token ID618
Feature activation+1.54e-01

Pos logit contributions
 Tweet+0.040
 Madd+0.039
 Suz+0.036
 Bird+0.035
 Dawn+0.035
Neg logit contributions
 plate-0.034
pron-0.033
 forks-0.032
 slice-0.031
 fork-0.031
Token he
Token ID339
Feature activation-2.46e-02

Pos logit contributions
 Tweet+0.007
 Madd+0.007
rig+0.007
 Bird+0.007
 shelter+0.007
Neg logit contributions
pron-0.005
 forks-0.005
 plate-0.005
lf-0.005
 slice-0.005
Token fell
Token ID3214
Feature activation+3.38e-01

Pos logit contributions
pron+0.001
 plate+0.001
 slice+0.001
 forks+0.001
lf+0.001
Neg logit contributions
 Tweet-0.001
 Madd-0.001
 Fif-0.001
 Suz-0.001
 Bug-0.001
Token.
Token ID13
Feature activation-1.17e-01

Pos logit contributions
 Tweet+0.017
 Madd+0.017
 Bird+0.016
 Suz+0.016
 Dawn+0.015
Neg logit contributions
pron-0.012
 plate-0.011
 forks-0.011
 slice-0.011
 fork-0.010
Token her
Token ID607
Feature activation-3.31e-01

Pos logit contributions
pron+0.033
 plate+0.032
 forks+0.032
 slice+0.031
 fork+0.030
Neg logit contributions
 Tweet-0.035
 Madd-0.033
 Bird-0.031
 Suz-0.031
 Bug-0.030
Token goal
Token ID3061
Feature activation-5.78e-01

Pos logit contributions
pron+0.012
 plate+0.012
 forks+0.012
 slice+0.012
 fork+0.012
Neg logit contributions
 Tweet-0.016
 Madd-0.016
 Bird-0.015
 Suz-0.015
 Bug-0.014
Token!
Token ID0
Feature activation+1.51e-01

Pos logit contributions
pron+0.024
 forks+0.024
 plate+0.024
 slice+0.023
 fork+0.022
Neg logit contributions
 Tweet-0.026
 Madd-0.024
 Bird-0.023
 Suz-0.023
 shelter-0.022
Token<|endoftext|>
Token ID50256
Feature activation-4.99e-01

Pos logit contributions
 Tweet+0.006
 Madd+0.005
 Bird+0.005
 Suz+0.005
 shelter+0.005
Neg logit contributions
pron-0.007
 plate-0.007
 forks-0.007
 slice-0.007
 fork-0.007
TokenJ
Token ID41
Feature activation-1.16e+00

Pos logit contributions
 plate+0.019
 forks+0.019
pron+0.019
 fork+0.018
 slice+0.018
Neg logit contributions
 Tweet-0.023
 Madd-0.022
 Suz-0.020
 Bird-0.020
 shelter-0.020
Tokenimm
Token ID8608
Feature activation+2.80e+00

Pos logit contributions
pron+0.045
 forks+0.043
 plate+0.042
 slice+0.042
lf+0.042
Neg logit contributions
 Madd-0.053
 Tweet-0.052
 Suz-0.051
 Dawn-0.048
 Beth-0.047
Tokenie
Token ID494
Feature activation+9.08e-01

Pos logit contributions
 Madd+0.079
 Suz+0.075
 Tweet+0.074
 Dawn+0.066
 Beth+0.065
Neg logit contributions
pron-0.162
 plate-0.157
 slice-0.156
epad-0.150
 tripod-0.148
Token was
Token ID373
Feature activation+2.87e-01

Pos logit contributions
 Tweet+0.032
 Madd+0.030
 Bird+0.027
 Suz+0.027
 shelter+0.026
Neg logit contributions
pron-0.046
 forks-0.046
 plate-0.046
 slice-0.044
wy-0.043
Token scared
Token ID12008
Feature activation+7.28e-01

Pos logit contributions
 Tweet+0.013
 Madd+0.012
 Bird+0.011
 shelter+0.011
 Fif+0.011
Neg logit contributions
pron-0.012
 plate-0.012
 forks-0.011
 slice-0.011
 fork-0.011
Token.
Token ID13
Feature activation+2.97e-01

Pos logit contributions
 Tweet+0.033
 Madd+0.032
 Bird+0.030
 Fif+0.029
 Suz+0.029
Neg logit contributions
 plate-0.029
pron-0.029
 forks-0.029
 slice-0.027
 fork-0.027
Token He
Token ID679
Feature activation-3.43e-01

Pos logit contributions
 Tweet+0.011
 Madd+0.010
 shelter+0.010
 Bird+0.010
 Bug+0.010
Neg logit contributions
 forks-0.014
 plate-0.014
pron-0.014
 fork-0.013
 Adam-0.013
Token sudden
Token ID4802
Feature activation+7.44e-01

Pos logit contributions
 Tweet+0.030
 Madd+0.028
 Suz+0.027
 Bird+0.026
 Ph+0.026
Neg logit contributions
pron-0.030
 plate-0.029
 slice-0.027
 forks-0.027
 fork-0.027
Token he
Token ID339
Feature activation-3.05e-01

Pos logit contributions
 Tweet+0.036
 Madd+0.036
rig+0.034
 Dawn+0.034
 Suz+0.033
Neg logit contributions
 forks-0.026
 plate-0.024
 fork-0.024
lf-0.024
pron-0.024
Token heard
Token ID2982
Feature activation+4.39e-01

Pos logit contributions
 plate+0.010
pron+0.009
 forks+0.009
 fork+0.009
 slice+0.009
Neg logit contributions
 Tweet-0.017
 Madd-0.017
 Suz-0.015
 Kel-0.015
 Bird-0.015
Token a
Token ID257
Feature activation-2.58e-01

Pos logit contributions
 Madd+0.022
 Tweet+0.022
 Dawn+0.021
 Fif+0.020
 Bug+0.020
Neg logit contributions
 plate-0.014
lf-0.014
 forks-0.014
 fork-0.014
pron-0.013
Token quiet
Token ID5897
Feature activation+2.02e-01

Pos logit contributions
 plate+0.011
pron+0.010
 fork+0.010
 slice+0.010
lf+0.009
Neg logit contributions
 Tweet-0.012
 Madd-0.011
 Suz-0.011
 Dawn-0.011
 Ph-0.011
Token voice
Token ID3809
Feature activation+2.37e+00

Pos logit contributions
 Tweet+0.008
 Madd+0.008
add+0.007
 Dawn+0.007
 Fif+0.007
Neg logit contributions
 plate-0.009
pron-0.009
lf-0.009
 slice-0.008
 fork-0.008
Token calling
Token ID4585
Feature activation+4.14e-01

Pos logit contributions
 Tweet+0.120
 Madd+0.117
 Bug+0.109
 Dawn+0.107
add+0.107
Neg logit contributions
lf-0.075
 plate-0.074
 slice-0.073
ged-0.068
 tripod-0.067
Token:
Token ID25
Feature activation-5.22e-02

Pos logit contributions
 Tweet+0.019
 Madd+0.018
 Bird+0.017
 Suz+0.017
 shelter+0.017
Neg logit contributions
pron-0.016
 plate-0.016
 forks-0.016
 slice-0.015
 fork-0.015
Token "
Token ID366
Feature activation+5.13e-01

Pos logit contributions
pron+0.002
 plate+0.002
 forks+0.002
 slice+0.002
 fork+0.002
Neg logit contributions
 Tweet-0.003
 Madd-0.003
 shelter-0.003
 Suz-0.002
 Dawn-0.002
TokenExc
Token ID40127
Feature activation+1.98e+00

Pos logit contributions
 Tweet+0.022
 Madd+0.021
 Suz+0.020
 Bird+0.020
 shelter+0.019
Neg logit contributions
pron-0.022
 plate-0.021
 forks-0.021
 slice-0.021
wy-0.020
Tokenuse
Token ID1904
Feature activation-1.25e-01

Pos logit contributions
 Tweet+0.126
 Madd+0.120
 Bird+0.116
 Suz+0.116
 shelter+0.114
Neg logit contributions
pron-0.045
 forks-0.043
 plate-0.043
 slice-0.040
 fork-0.039
Token day
Token ID1110
Feature activation+1.92e-01

Pos logit contributions
 Tweet+0.049
 Madd+0.046
 Bird+0.045
 Bug+0.044
 Suz+0.043
Neg logit contributions
 plate-0.039
 slice-0.039
 forks-0.038
pron-0.036
 fork-0.035
Token warm
Token ID5814
Feature activation+7.49e-01

Pos logit contributions
 Tweet+0.008
 Madd+0.008
 Suz+0.007
 Bird+0.007
 shelter+0.007
Neg logit contributions
pron-0.009
 forks-0.008
 plate-0.008
wy-0.008
 slice-0.008
Token and
Token ID290
Feature activation+1.13e-01

Pos logit contributions
 Tweet+0.037
 Madd+0.036
 Bird+0.034
 Suz+0.033
 Dawn+0.033
Neg logit contributions
 plate-0.027
 forks-0.027
pron-0.026
 slice-0.025
 fork-0.025
Token happy
Token ID3772
Feature activation-2.87e-02

Pos logit contributions
 Tweet+0.005
 Madd+0.005
 Bird+0.005
 Suz+0.004
 Bug+0.004
Neg logit contributions
pron-0.005
 forks-0.005
 plate-0.005
 slice-0.005
 fork-0.004
Token.
Token ID13
Feature activation+6.96e-01

Pos logit contributions
 forks+0.001
 plate+0.001
pron+0.001
 slice+0.001
 fork+0.001
Neg logit contributions
 Tweet-0.001
 Madd-0.001
 Bird-0.001
 Suz-0.001
 Bug-0.001
Token So
Token ID1406
Feature activation+2.45e+00

Pos logit contributions
 Tweet+0.028
 Madd+0.026
 shelter+0.026
 Bird+0.025
 Suz+0.024
Neg logit contributions
 forks-0.030
 plate-0.029
ossible-0.029
 Adam-0.029
 fork-0.028
Token he
Token ID339
Feature activation+2.72e-01

Pos logit contributions
 Tweet+0.119
 Madd+0.114
 Bird+0.108
 shelter+0.107
 Suz+0.107
Neg logit contributions
 forks-0.086
 plate-0.084
pron-0.082
 fork-0.081
 slice-0.080
Token grabbed
Token ID13176
Feature activation-8.07e-02

Pos logit contributions
 Tweet+0.013
 Madd+0.013
 Suz+0.012
 Bird+0.012
 Kel+0.012
Neg logit contributions
pron-0.010
 plate-0.010
 forks-0.010
 slice-0.010
 fork-0.009
Token the
Token ID262
Feature activation+5.37e-01

Pos logit contributions
pron+0.004
 plate+0.003
 forks+0.003
 slice+0.003
lf+0.003
Neg logit contributions
 Tweet-0.003
 Madd-0.003
 Suz-0.003
 Bird-0.003
 shelter-0.003
Token sack
Token ID23704
Feature activation-1.61e-01

Pos logit contributions
 Tweet+0.023
 Madd+0.022
 Suz+0.020
 Bird+0.020
 shelter+0.020
Neg logit contributions
pron-0.023
lf-0.023
Adam-0.022
wy-0.022
 tripod-0.022
Token,
Token ID11
Feature activation+4.09e-01

Pos logit contributions
pron+0.008
 forks+0.008
 plate+0.008
 slice+0.008
wy+0.008
Neg logit contributions
 Tweet-0.006
 Madd-0.005
 Bird-0.005
 Suz-0.005
 shelter-0.005
Token two
Token ID734
Feature activation-3.98e-01

Pos logit contributions
 Madd+0.042
 Tweet+0.041
 shelter+0.038
 Bird+0.038
 Suz+0.037
Neg logit contributions
ucer-0.036
pron-0.036
 tripod-0.036
wy-0.036
ater-0.035
Token friends
Token ID2460
Feature activation-1.03e+00

Pos logit contributions
 forks+0.018
pron+0.018
 plate+0.018
 slice+0.017
 fork+0.017
Neg logit contributions
 Tweet-0.017
 Madd-0.015
 Bird-0.015
 Suz-0.014
 shelter-0.014
Token laughed
Token ID13818
Feature activation-1.10e+00

Pos logit contributions
pron+0.047
 plate+0.045
 forks+0.044
wy+0.043
 slice+0.043
Neg logit contributions
 Tweet-0.042
 Madd-0.041
 Suz-0.039
 Bird-0.038
 shelter-0.037
Token and
Token ID290
Feature activation-5.60e-02

Pos logit contributions
pron+0.048
 plate+0.048
 forks+0.048
 slice+0.046
 fork+0.045
Neg logit contributions
 Tweet-0.046
 Madd-0.043
 Bird-0.041
 Suz-0.041
 shelter-0.040
Token celebrated
Token ID13943
Feature activation-1.29e+00

Pos logit contributions
pron+0.002
 forks+0.002
 plate+0.002
 slice+0.002
 fork+0.002
Neg logit contributions
 Tweet-0.003
 Madd-0.002
 Bird-0.002
 shelter-0.002
 Suz-0.002
Token their
Token ID511
Feature activation-2.97e-01

Pos logit contributions
 forks+0.049
 plate+0.048
pron+0.048
 slice+0.046
 fork+0.045
Neg logit contributions
 Tweet-0.063
 Madd-0.059
 Bird-0.057
 Bug-0.056
 Suz-0.055
Token victory
Token ID5373
Feature activation-1.31e-01

Pos logit contributions
pron+0.012
 forks+0.012
 plate+0.012
 slice+0.012
 fork+0.011
Neg logit contributions
 Tweet-0.014
 Madd-0.013
 Bird-0.012
 Suz-0.012
 shelter-0.012
Token.
Token ID13
Feature activation+3.74e-01

Pos logit contributions
pron+0.006
 forks+0.005
 plate+0.005
 slice+0.005
wy+0.005
Neg logit contributions
 Tweet-0.006
 Madd-0.006
 Bird-0.005
 Suz-0.005
 shelter-0.005
Token From
Token ID3574
Feature activation-8.03e-02

Pos logit contributions
 Tweet+0.016
 Madd+0.015
 Bird+0.014
 shelter+0.014
 Suz+0.014
Neg logit contributions
pron-0.016
 forks-0.016
 plate-0.016
 slice-0.015
 fork-0.015
Token then
Token ID788
Feature activation+4.20e-01

Pos logit contributions
pron+0.003
 forks+0.003
 plate+0.003
 slice+0.003
 fork+0.003
Neg logit contributions
 Tweet-0.004
 Madd-0.004
 Bird-0.004
 Suz-0.004
 shelter-0.004
Token on
Token ID319
Feature activation-4.96e-01

Pos logit contributions
 Tweet+0.019
 Madd+0.018
 Bird+0.017
 Suz+0.017
 shelter+0.017
Neg logit contributions
pron-0.017
 forks-0.017
 plate-0.016
 slice-0.016
 fork-0.016
Token time
Token ID640
Feature activation+3.36e-01

Pos logit contributions
 Tweet+0.023
 Madd+0.022
 Bird+0.020
 Suz+0.020
 shelter+0.020
Neg logit contributions
 forks-0.026
pron-0.026
 plate-0.025
 slice-0.024
wy-0.024
Token,
Token ID11
Feature activation+5.79e-01

Pos logit contributions
 Tweet+0.015
 Madd+0.015
 Bird+0.014
 Suz+0.014
 shelter+0.014
Neg logit contributions
pron-0.013
 forks-0.013
 plate-0.013
 slice-0.012
 fork-0.012
Token there
Token ID612
Feature activation+1.68e+00

Pos logit contributions
 Tweet+0.026
 Madd+0.025
 Bird+0.023
 Suz+0.023
 shelter+0.023
Neg logit contributions
pron-0.023
 forks-0.023
 plate-0.023
 slice-0.022
 fork-0.021
Token was
Token ID373
Feature activation+6.18e-01

Pos logit contributions
 Tweet+0.066
rig+0.064
 rou+0.062
 Madd+0.061
 Dawn+0.061
Neg logit contributions
lf-0.071
 plate-0.069
 fork-0.068
 forks-0.066
pron-0.066
Token a
Token ID257
Feature activation-2.32e-01

Pos logit contributions
 Tweet+0.029
 Madd+0.028
 Suz+0.027
 Bird+0.027
 Dawn+0.026
Neg logit contributions
 plate-0.023
 forks-0.022
pron-0.022
 slice-0.022
 fork-0.021
Token little
Token ID1310
Feature activation+1.22e+00

Pos logit contributions
pron+0.010
 forks+0.010
 plate+0.010
 slice+0.010
wy+0.009
Neg logit contributions
 Tweet-0.010
 Madd-0.009
 Bird-0.009
 Suz-0.009
 shelter-0.009
Token girl
Token ID2576
Feature activation+9.68e-01

Pos logit contributions
 Tweet+0.051
 Madd+0.049
 Suz+0.046
 Bird+0.045
 Ph+0.044
Neg logit contributions
 plate-0.054
pron-0.054
 forks-0.053
 slice-0.052
 fork-0.051
Token named
Token ID3706
Feature activation+8.92e-01

Pos logit contributions
 Tweet+0.027
 Madd+0.025
 Bird+0.022
 Suz+0.022
 shelter+0.021
Neg logit contributions
pron-0.057
 plate-0.056
 forks-0.056
 slice-0.055
 fork-0.053
Token Lily
Token ID20037
Feature activation+6.79e-01

Pos logit contributions
 Tweet+0.029
 Madd+0.027
 Suz+0.025
 Bird+0.024
 shelter+0.023
Neg logit contributions
pron-0.050
 forks-0.049
 plate-0.048
wy-0.048
 slice-0.047
Token.
Token ID13
Feature activation+5.92e-01

Pos logit contributions
 Tweet+0.026
 Madd+0.023
 Bird+0.023
 Suz+0.023
 shelter+0.022
Neg logit contributions
pron-0.034
 forks-0.033
 plate-0.032
 dice-0.031
wy-0.031
Token She
Token ID1375
Feature activation+1.68e-01

Pos logit contributions
 Tweet+0.015
 Madd+0.014
 Bird+0.013
 Suz+0.012
 shelter+0.012
Neg logit contributions
pron-0.036
 forks-0.035
 plate-0.035
 slice-0.034
 fork-0.034
Token and
Token ID290
Feature activation+1.39e-01

Pos logit contributions
 Tweet+0.044
 Madd+0.042
 Bird+0.040
 Suz+0.039
 shelter+0.039
Neg logit contributions
pron-0.037
 forks-0.036
 plate-0.035
 slice-0.034
wy-0.034
Token put
Token ID1234
Feature activation+1.25e-01

Pos logit contributions
 Tweet+0.006
 Madd+0.006
 Bird+0.006
 Suz+0.006
 Dawn+0.006
Neg logit contributions
 forks-0.005
pron-0.005
 plate-0.005
 slice-0.005
 fork-0.005
Token on
Token ID319
Feature activation-3.51e-01

Pos logit contributions
 Tweet+0.006
 Madd+0.005
 Bird+0.005
 Suz+0.005
 shelter+0.005
Neg logit contributions
pron-0.005
 forks-0.005
 plate-0.005
 slice-0.005
 fork-0.005
Token a
Token ID257
Feature activation-6.75e-02

Pos logit contributions
pron+0.015
 forks+0.015
 plate+0.014
 slice+0.014
 fork+0.014
Neg logit contributions
 Tweet-0.015
 Madd-0.015
 Bird-0.014
 Suz-0.014
 shelter-0.013
Token new
Token ID649
Feature activation+1.75e+00

Pos logit contributions
 forks+0.003
wy+0.003
pron+0.003
 slice+0.003
 plate+0.003
Neg logit contributions
 Tweet-0.003
 Madd-0.003
 shelter-0.003
 Bird-0.002
 Suz-0.002
Token dress
Token ID6576
Feature activation+6.93e-01

Pos logit contributions
 Tweet+0.084
 Madd+0.081
 Bird+0.076
 Suz+0.075
 shelter+0.075
Neg logit contributions
pron-0.066
 forks-0.064
 plate-0.064
 slice-0.062
wy-0.062
Token,"
Token ID553
Feature activation-1.03e-01

Pos logit contributions
 Tweet+0.028
 Madd+0.026
 Bird+0.025
 Suz+0.025
 shelter+0.024
Neg logit contributions
pron-0.031
 forks-0.031
 plate-0.030
 slice-0.030
wy-0.029
Token said
Token ID531
Feature activation-5.87e-01

Pos logit contributions
pron+0.004
ossible+0.003
 plate+0.003
 slice+0.003
lf+0.003
Neg logit contributions
 Madd-0.005
rig-0.005
 dam-0.005
 shelter-0.005
add-0.005
Token Lily
Token ID20037
Feature activation+7.81e-01

Pos logit contributions
pron+0.025
 plate+0.024
 forks+0.024
 slice+0.023
 fork+0.023
Neg logit contributions
rig-0.023
 Madd-0.021
 shelter-0.020
 Tweet-0.020
 rou-0.020
Token's
Token ID338
Feature activation+3.17e-01

Pos logit contributions
 Tweet+0.034
 Madd+0.032
 Bird+0.030
 Suz+0.030
 shelter+0.028
Neg logit contributions
pron-0.034
 forks-0.033
 plate-0.033
 slice-0.032
wy-0.032
Token mom
Token ID1995
Feature activation+1.07e+00

Pos logit contributions
 Tweet+0.016
 Madd+0.016
 Suz+0.015
 Bird+0.015
 shelter+0.015
Neg logit contributions
pron-0.011
 plate-0.011
 forks-0.011
 fork-0.010
 slice-0.010
Token is
Token ID318
Feature activation+1.36e+00

Pos logit contributions
 Madd+0.000
 Tweet+0.000
 Suz+0.000
 Dawn+0.000
 Ph+0.000
Neg logit contributions
 forks-0.000
wy-0.000
 plate-0.000
 fork-0.000
pron-0.000
Token good
Token ID922
Feature activation-2.00e-01

Pos logit contributions
 Tweet+0.052
 Madd+0.047
 Bird+0.045
 Suz+0.043
 shelter+0.043
Neg logit contributions
pron-0.067
 plate-0.065
 forks-0.065
 slice-0.064
 fork-0.062
Token at
Token ID379
Feature activation-1.21e+00

Pos logit contributions
pron+0.008
 forks+0.008
 plate+0.008
 slice+0.008
wy+0.008
Neg logit contributions
 Tweet-0.009
 Madd-0.008
 Bird-0.008
 Suz-0.008
 shelter-0.008
Token making
Token ID1642
Feature activation-4.96e-01

Pos logit contributions
pron+0.049
 forks+0.048
 plate+0.048
 slice+0.046
wy+0.046
Neg logit contributions
 Tweet-0.055
 Madd-0.052
 Bird-0.050
 Suz-0.049
 shelter-0.048
Token sounds
Token ID5238
Feature activation-5.25e-01

Pos logit contributions
 plate+0.020
 forks+0.020
 slice+0.019
 fork+0.018
 dice+0.018
Neg logit contributions
 Madd-0.022
 Tweet-0.022
 Bug-0.020
 Dawn-0.020
 Suz-0.020
Token.
Token ID13
Feature activation-1.67e-01

Pos logit contributions
 forks+0.021
 plate+0.020
pron+0.020
 slice+0.020
 fork+0.019
Neg logit contributions
 Tweet-0.024
 Madd-0.023
 Suz-0.022
 Bird-0.022
 Dawn-0.022
Token
Token ID198
Feature activation-1.60e-01

Pos logit contributions
pron+0.008
 forks+0.008
 plate+0.007
 slice+0.007
 fork+0.007
Neg logit contributions
 Tweet-0.006
 Madd-0.006
 Suz-0.006
 shelter-0.006
 Dawn-0.006
Token
Token ID198
Feature activation-2.95e-01

Pos logit contributions
pron+0.007
 plate+0.007
 forks+0.007
 slice+0.007
ossible+0.007
Neg logit contributions
 Tweet-0.006
 Madd-0.006
 Bird-0.006
 Suz-0.006
 Dawn-0.005
TokenL
Token ID43
Feature activation-8.18e-03

Pos logit contributions
 forks+0.013
 plate+0.013
pron+0.013
 fork+0.013
 slice+0.012
Neg logit contributions
 Tweet-0.013
 Madd-0.011
 shelter-0.011
 Bird-0.011
 Bug-0.010
Tokenily
Token ID813
Feature activation+3.79e-01

Pos logit contributions
pron+0.000
 plate+0.000
 forks+0.000
 slice+0.000
wy+0.000
Neg logit contributions
 Tweet-0.000
 Madd-0.000
 Bird-0.000
 Suz-0.000
 shelter-0.000
Token opens
Token ID9808
Feature activation-3.29e-01

Pos logit contributions
 Tweet+0.017
 Madd+0.017
 Suz+0.016
 Bird+0.015
 Dawn+0.015
Neg logit contributions
 forks-0.016
 plate-0.016
pron-0.015
 slice-0.015
 fork-0.015
Token funny
Token ID8258
Feature activation+2.58e-01

Pos logit contributions
 forks+0.015
 plate+0.015
 slice+0.014
 fork+0.014
 plates+0.013
Neg logit contributions
 Tweet-0.019
 Bug-0.019
 Madd-0.019
 Dawn-0.018
 Kerry-0.018
Token noises
Token ID26782
Feature activation+9.49e-01

Pos logit contributions
 Tweet+0.011
 Madd+0.011
 Bug+0.010
 Bird+0.010
 Suz+0.010
Neg logit contributions
 forks-0.011
 plate-0.011
 slice-0.010
pron-0.010
 fork-0.010
Token.
Token ID13
Feature activation+3.47e-01

Pos logit contributions
 Tweet+0.041
 Madd+0.038
 Bird+0.036
 Suz+0.035
 shelter+0.035
Neg logit contributions
pron-0.041
 forks-0.040
 plate-0.040
 slice-0.039
wy-0.038
Token L
Token ID406
Feature activation+1.25e-01

Pos logit contributions
 Tweet+0.014
 Madd+0.014
 Suz+0.013
 shelter+0.013
 Bird+0.013
Neg logit contributions
 forks-0.015
 plate-0.015
pron-0.015
 fork-0.014
 slice-0.014
Tokenila
Token ID10102
Feature activation+3.09e-02

Pos logit contributions
 Tweet+0.006
 Madd+0.005
 Bird+0.005
 shelter+0.005
 Suz+0.005
Neg logit contributions
pron-0.005
 plate-0.005
 forks-0.005
 slice-0.005
wy-0.005
Token and
Token ID290
Feature activation-6.22e-02

Pos logit contributions
 Tweet+0.001
 Madd+0.001
 Suz+0.001
 Bird+0.001
 Dawn+0.001
Neg logit contributions
pron-0.001
 plate-0.001
 forks-0.001
 slice-0.001
 fork-0.001
Token Ben
Token ID3932
Feature activation+3.02e-01

Pos logit contributions
pron+0.003
lf+0.003
 slice+0.003
wy+0.003
ater+0.003
Neg logit contributions
 Tweet-0.003
 Bird-0.002
 Bunny-0.002
 Madd-0.002
 Dog-0.002
Token cl
Token ID537
Feature activation-1.76e-02

Pos logit contributions
 Tweet+0.012
 Madd+0.012
 Bird+0.011
 shelter+0.011
 Suz+0.011
Neg logit contributions
pron-0.014
 forks-0.013
 plate-0.013
 slice-0.013
wy-0.013
Tokenapped
Token ID6320
Feature activation+8.68e-01

Pos logit contributions
pron+0.001
 forks+0.001
 plate+0.001
 slice+0.001
 fork+0.001
Neg logit contributions
 Tweet-0.001
 Madd-0.001
 Suz-0.001
 Bird-0.001
 shelter-0.001
Token and
Token ID290
Feature activation+4.48e-01

Pos logit contributions
 Tweet+0.033
 Madd+0.030
 Bird+0.028
 Suz+0.028
 shelter+0.028
Neg logit contributions
pron-0.043
 forks-0.042
 plate-0.041
 slice-0.040
wy-0.040
Token laughed
Token ID13818
Feature activation-6.54e-01

Pos logit contributions
 Tweet+0.022
 Madd+0.021
 Suz+0.020
 Bird+0.020
 Bug+0.020
Neg logit contributions
 forks-0.016
pron-0.016
 plate-0.016
 slice-0.015
 fork-0.015
Token curious
Token ID11040
Feature activation-7.78e-01

Pos logit contributions
pron+0.033
wy+0.032
 forks+0.031
 plate+0.031
 slice+0.030
Neg logit contributions
 Tweet-0.029
 Madd-0.028
 shelter-0.026
 Suz-0.026
 Bird-0.026
Token eyes
Token ID2951
Feature activation-3.64e-01

Pos logit contributions
 forks+0.029
 plate+0.029
pron+0.028
 slice+0.027
 fork+0.027
Neg logit contributions
 Tweet-0.039
 Madd-0.037
 Bird-0.035
 Suz-0.034
 Dawn-0.034
Token and
Token ID290
Feature activation-2.95e-01

Pos logit contributions
pron+0.015
 plate+0.015
 forks+0.015
 slice+0.014
 fork+0.014
Neg logit contributions
 Tweet-0.016
 Madd-0.016
 Suz-0.014
 Bird-0.014
 shelter-0.014
Token finally
Token ID3443
Feature activation+8.54e-01

Pos logit contributions
pron+0.012
 forks+0.012
 plate+0.012
 slice+0.011
 fork+0.011
Neg logit contributions
 Tweet-0.013
 Madd-0.013
 Bird-0.012
 Suz-0.012
 shelter-0.012
Token decided
Token ID3066
Feature activation-7.32e-01

Pos logit contributions
 Tweet+0.035
 Madd+0.033
 Suz+0.031
 Bird+0.031
 shelter+0.030
Neg logit contributions
pron-0.042
 plate-0.039
 forks-0.039
wy-0.038
 slice-0.037
Token he
Token ID339
Feature activation-8.60e-01

Pos logit contributions
pron+0.031
 forks+0.031
 plate+0.030
 slice+0.030
 fork+0.029
Neg logit contributions
 Tweet-0.032
 Madd-0.030
 Bird-0.028
 Suz-0.028
 shelter-0.027
Token would
Token ID561
Feature activation+1.06e+00

Pos logit contributions
pron+0.033
 plate+0.032
 forks+0.032
 slice+0.032
 fork+0.031
Neg logit contributions
 Tweet-0.040
 Madd-0.039
 Suz-0.036
 shelter-0.035
 Bird-0.035
Token try
Token ID1949
Feature activation+6.80e-01

Pos logit contributions
 Tweet+0.049
 Madd+0.047
 Suz+0.044
 Bird+0.043
 Dawn+0.043
Neg logit contributions
pron-0.041
 plate-0.041
 slice-0.040
 forks-0.040
 fork-0.039
Token the
Token ID262
Feature activation-1.27e-01

Pos logit contributions
 Tweet+0.030
 Madd+0.029
 Suz+0.027
 Bird+0.027
 Bug+0.026
Neg logit contributions
pron-0.029
 plate-0.028
 forks-0.028
 slice-0.027
 fork-0.027
Token caul
Token ID46087
Feature activation-1.86e+00

Pos logit contributions
 plate+0.005
 fork+0.005
pron+0.005
 forks+0.005
 slice+0.005
Neg logit contributions
 Tweet-0.006
 Madd-0.006
 Suz-0.006
 Beth-0.005
 Dawn-0.005
Tokeniflower
Token ID42642
Feature activation-1.84e+00

Pos logit contributions
 forks+0.037
 plate+0.036
pron+0.035
 slice+0.033
 tripod+0.033
Neg logit contributions
 Tweet-0.120
 Madd-0.116
 Bird-0.112
 Bug-0.111
 shelter-0.109
Token to
Token ID284
Feature activation+7.27e-01

Pos logit contributions
 Tweet+0.005
 Madd+0.005
 shelter+0.005
 Bird+0.005
 Suz+0.004
Neg logit contributions
pron-0.008
 plate-0.007
wy-0.007
 forks-0.007
lf-0.007
Token go
Token ID467
Feature activation+1.45e+00

Pos logit contributions
 Tweet+0.034
 Madd+0.033
 Suz+0.031
 Bird+0.031
 Bug+0.029
Neg logit contributions
pron-0.028
 plate-0.028
 forks-0.028
 slice-0.027
 fork-0.026
Token home
Token ID1363
Feature activation-1.55e+00

Pos logit contributions
 Tweet+0.067
 Madd+0.064
 Bird+0.060
 Suz+0.060
 shelter+0.058
Neg logit contributions
pron-0.058
 forks-0.057
 plate-0.057
 slice-0.055
 fork-0.053
Token.
Token ID13
Feature activation-1.01e-01

Pos logit contributions
pron+0.081
 forks+0.077
 slice+0.076
wy+0.076
 plate+0.076
Neg logit contributions
 Tweet-0.052
 Madd-0.048
 shelter-0.047
 Bird-0.046
 Suz-0.045
Token Now
Token ID2735
Feature activation+2.86e-01

Pos logit contributions
 forks+0.004
 plate+0.004
pron+0.004
 slice+0.004
ossible+0.004
Neg logit contributions
 Tweet-0.005
 Madd-0.005
 Suz-0.004
 shelter-0.004
 Dawn-0.004
Token!"
Token ID2474
Feature activation-4.40e-01

Pos logit contributions
 Tweet+0.012
 Madd+0.011
 Bird+0.011
 shelter+0.011
 Suz+0.011
Neg logit contributions
pron-0.013
 forks-0.012
 plate-0.012
wy-0.012
 slice-0.012
Token
Token ID198
Feature activation+1.98e-01

Pos logit contributions
 plate+0.017
ossible+0.017
 forks+0.016
 Adam+0.016
pron+0.016
Neg logit contributions
 Tweet-0.019
 shelter-0.018
 Madd-0.018
 dam-0.018
add-0.018
Token
Token ID198
Feature activation-1.70e-02

Pos logit contributions
 Tweet+0.009
 Madd+0.008
 Suz+0.008
 shelter+0.008
 Bird+0.008
Neg logit contributions
pron-0.008
 plate-0.008
ossible-0.008
 forks-0.007
 slice-0.007
TokenL
Token ID43
Feature activation+3.34e-01

Pos logit contributions
pron+0.001
 forks+0.001
 plate+0.001
 slice+0.001
 fork+0.001
Neg logit contributions
 Tweet-0.001
 Madd-0.001
 Bird-0.001
 Suz-0.001
 shelter-0.001
Tokenily
Token ID813
Feature activation+7.18e-01

Pos logit contributions
 Tweet+0.014
 Madd+0.013
 Suz+0.012
 Bird+0.012
 shelter+0.012
Neg logit contributions
pron-0.015
 plate-0.015
 forks-0.015
 slice-0.015
 fork-0.014
Token and
Token ID290
Feature activation+1.78e-01

Pos logit contributions
 Tweet+0.032
 Madd+0.031
 Suz+0.029
 Bird+0.028
 Dawn+0.028
Neg logit contributions
 plate-0.031
 forks-0.031
pron-0.029
 fork-0.029
 slice-0.029
Token mom
Token ID1995
Feature activation+3.53e-01

Pos logit contributions
 Tweet+0.039
 Madd+0.037
 Ph+0.036
 Bug+0.036
 Suz+0.035
Neg logit contributions
 plate-0.018
 fork-0.017
 forks-0.017
pron-0.017
 slice-0.016
Token heard
Token ID2982
Feature activation+5.05e-01

Pos logit contributions
 Tweet+0.018
 Madd+0.017
 Fif+0.016
 Dawn+0.016
 shelter+0.016
Neg logit contributions
pron-0.012
 plate-0.012
 slice-0.012
 forks-0.012
lf-0.011
Token her
Token ID607
Feature activation-6.04e-01

Pos logit contributions
 Tweet+0.026
 Madd+0.026
 dam+0.025
 Dawn+0.025
 rou+0.025
Neg logit contributions
 plate-0.015
 forks-0.015
 slice-0.014
pron-0.013
 apr-0.013
Token and
Token ID290
Feature activation-7.25e-01

Pos logit contributions
 plate+0.023
 slice+0.021
 forks+0.021
 fork+0.020
ossible+0.019
Neg logit contributions
 Tweet-0.031
 Madd-0.029
 Bug-0.028
 Dawn-0.027
 Fif-0.027
Token asked
Token ID1965
Feature activation+1.27e-01

Pos logit contributions
 forks+0.026
 plate+0.025
pron+0.025
 slice+0.025
lf+0.024
Neg logit contributions
 Tweet-0.034
 Madd-0.032
 shelter-0.031
 Fif-0.030
rig-0.030
Token,
Token ID11
Feature activation-6.71e-01

Pos logit contributions
 Tweet+0.006
 Madd+0.006
rig+0.006
 rou+0.006
 Bug+0.006
Neg logit contributions
 forks-0.004
 plate-0.004
pron-0.004
 slice-0.004
 fork-0.004
Token "
Token ID366
Feature activation-1.13e-01

Pos logit contributions
pron+0.024
 plate+0.023
 forks+0.023
 slice+0.022
 fork+0.021
Neg logit contributions
 Tweet-0.033
 Madd-0.032
 shelter-0.030
 Suz-0.030
 Bird-0.030
TokenWhat
Token ID2061
Feature activation+1.36e+00

Pos logit contributions
pron+0.005
 forks+0.005
 plate+0.005
 slice+0.005
wy+0.005
Neg logit contributions
 Tweet-0.005
 Madd-0.005
 Bird-0.004
 Suz-0.004
 shelter-0.004
Token's
Token ID338
Feature activation-2.86e-01

Pos logit contributions
 Tweet+0.066
 Madd+0.065
 Dawn+0.061
 Suz+0.059
 Kerry+0.059
Neg logit contributions
 forks-0.052
 plate-0.052
pron-0.050
 slice-0.050
 fork-0.050
Token wrong
Token ID2642
Feature activation+1.25e+00

Pos logit contributions
pron+0.013
 forks+0.013
 plate+0.013
 slice+0.012
wy+0.012
Neg logit contributions
 Tweet-0.012
 Madd-0.011
 Suz-0.010
 Bird-0.010
 shelter-0.010
Token,
Token ID11
Feature activation-9.02e-01

Pos logit contributions
 Tweet+0.064
 Madd+0.060
add+0.058
 Bird+0.057
 Bug+0.056
Neg logit contributions
 plate-0.043
pron-0.041
 forks-0.040
 fork-0.039
 slice-0.038
Token
Token ID198
Feature activation-2.54e-01

Pos logit contributions
pron+0.005
 plate+0.005
ossible+0.005
 slice+0.004
 forks+0.004
Neg logit contributions
 Tweet-0.005
 Madd-0.004
 Suz-0.004
 shelter-0.004
 Bird-0.004
TokenBut
Token ID1537
Feature activation-4.69e-01

Pos logit contributions
pron+0.010
 plate+0.009
 forks+0.009
 slice+0.009
lf+0.009
Neg logit contributions
 Tweet-0.012
 Madd-0.012
 Suz-0.011
 Bird-0.011
 shelter-0.011
Token the
Token ID262
Feature activation-6.81e-01

Pos logit contributions
pron+0.020
 plate+0.019
 forks+0.019
 slice+0.019
 fork+0.018
Neg logit contributions
 Tweet-0.020
 Madd-0.019
 Suz-0.018
 Bird-0.018
 shelter-0.018
Token sheep
Token ID15900
Feature activation-2.37e-02

Pos logit contributions
pron+0.029
 plate+0.029
 forks+0.027
 fork+0.027
 slice+0.026
Neg logit contributions
 Tweet-0.030
 Madd-0.028
 Suz-0.028
 Fif-0.026
 Kel-0.025
Token just
Token ID655
Feature activation-9.12e-01

Pos logit contributions
 plate+0.001
pron+0.001
 forks+0.001
 fork+0.001
wy+0.001
Neg logit contributions
 Tweet-0.001
 Madd-0.001
 Suz-0.001
 Dawn-0.001
 Bird-0.001
Token kept
Token ID4030
Feature activation-4.22e-01

Pos logit contributions
pron+0.035
 forks+0.035
 plate+0.034
 slice+0.033
wy+0.033
Neg logit contributions
 Tweet-0.041
 Madd-0.039
 shelter-0.037
 Suz-0.036
 Dawn-0.036
Token sleeping
Token ID11029
Feature activation+4.37e-01

Pos logit contributions
pron+0.015
 plate+0.015
 forks+0.015
 slice+0.014
 fork+0.014
Neg logit contributions
 Tweet-0.019
 Madd-0.019
 Suz-0.018
 Bees-0.018
 Dawn-0.017
Token and
Token ID290
Feature activation-6.71e-01

Pos logit contributions
 Tweet+0.020
 Madd+0.019
 Suz+0.018
 Bird+0.017
 Dawn+0.017
Neg logit contributions
pron-0.018
 plate-0.018
 forks-0.018
 slice-0.017
 fork-0.016
Token didn
Token ID1422
Feature activation-8.88e-02

Pos logit contributions
pron+0.026
 forks+0.026
 plate+0.026
 slice+0.025
 fork+0.024
Neg logit contributions
 Tweet-0.028
 Madd-0.027
rig-0.025
 Dawn-0.025
 Suz-0.025
Token't
Token ID470
Feature activation-6.62e-01

Pos logit contributions
pron+0.002
 plate+0.002
 slice+0.002
ossible+0.002
lf+0.002
Neg logit contributions
 Tweet-0.005
 Madd-0.005
 Suz-0.005
 Bird-0.005
 shelter-0.005
Token wake
Token ID7765
Feature activation+2.58e+00

Pos logit contributions
 plate+0.025
pron+0.025
 slice+0.024
 forks+0.024
 fork+0.023
Neg logit contributions
 Tweet-0.030
 Madd-0.029
 Suz-0.027
 Bird-0.027
 Dawn-0.026
Token smile
Token ID8212
Feature activation-1.32e+00

Pos logit contributions
pron+0.033
 plate+0.032
 slice+0.031
 forks+0.031
 fork+0.030
Neg logit contributions
 Tweet-0.037
 Madd-0.035
 Suz-0.033
 Bird-0.033
 Dawn-0.032
Token on
Token ID319
Feature activation+1.94e-01

Pos logit contributions
 forks+0.067
 apr+0.062
wy+0.062
 dice+0.062
pron+0.062
Neg logit contributions
 Tweet-0.048
 Madd-0.043
 Bird-0.042
 Suz-0.041
 shelter-0.040
Token her
Token ID607
Feature activation-1.79e-01

Pos logit contributions
 Tweet+0.008
 Madd+0.007
 Bird+0.007
 Suz+0.007
 Fif+0.006
Neg logit contributions
pron-0.009
 forks-0.009
wy-0.009
 plate-0.009
ucer-0.009
Token face
Token ID1986
Feature activation+3.99e-01

Pos logit contributions
pron+0.006
wy+0.006
 forks+0.006
 slice+0.006
 dice+0.006
Neg logit contributions
 Tweet-0.009
 Madd-0.008
 Bird-0.008
 Suz-0.008
 shelter-0.008
Token,
Token ID11
Feature activation+4.39e-01

Pos logit contributions
 Tweet+0.018
 Madd+0.017
 Bird+0.016
 Suz+0.016
 shelter+0.016
Neg logit contributions
pron-0.016
 plate-0.016
 forks-0.016
 slice-0.015
 fork-0.015
Token dreaming
Token ID28368
Feature activation-8.08e-01

Pos logit contributions
 Tweet+0.019
 Madd+0.019
 shelter+0.017
 Suz+0.017
 Bird+0.017
Neg logit contributions
pron-0.019
 plate-0.018
 forks-0.017
 slice-0.017
 fork-0.017
Token about
Token ID546
Feature activation-4.94e-01

Pos logit contributions
pron+0.042
 plate+0.041
 forks+0.041
 slice+0.039
 fork+0.039
Neg logit contributions
 Tweet-0.028
 Madd-0.025
 Bird-0.024
 Suz-0.023
 Dawn-0.023
Token all
Token ID477
Feature activation-2.31e+00

Pos logit contributions
 forks+0.017
 plate+0.017
pron+0.016
 fork+0.016
 slice+0.015
Neg logit contributions
 Tweet-0.024
 Madd-0.023
add-0.022
 Suz-0.022
 Kerry-0.022
Token the
Token ID262
Feature activation+4.31e-01

Pos logit contributions
 plate+0.072
 forks+0.072
pron+0.070
 fork+0.066
 slice+0.066
Neg logit contributions
 Tweet-0.122
 Madd-0.115
add-0.111
 Bird-0.111
 Dawn-0.108
Token cards
Token ID4116
Feature activation-8.23e-01

Pos logit contributions
 Tweet+0.021
 Madd+0.019
 Suz+0.019
 Kerry+0.018
 Ph+0.018
Neg logit contributions
 forks-0.017
 plate-0.017
 fork-0.017
pron-0.017
 slice-0.015
Token she
Token ID673
Feature activation+3.13e-02

Pos logit contributions
pron+0.037
 forks+0.036
 plate+0.036
 slice+0.035
 fork+0.034
Neg logit contributions
 Tweet-0.034
 Madd-0.032
 Bird-0.030
 Suz-0.030
 Bug-0.029
Token and
Token ID290
Feature activation-3.43e-01

Pos logit contributions
 plate+0.047
 forks+0.045
pron+0.044
 fork+0.043
lf+0.043
Neg logit contributions
 Tweet-0.065
 Madd-0.064
 Dawn-0.059
 Suz-0.059
 Fif-0.058
Token said
Token ID531
Feature activation-8.90e-01

Pos logit contributions
pron+0.013
 forks+0.013
 plate+0.013
 slice+0.012
lf+0.012
Neg logit contributions
 Tweet-0.016
 Madd-0.015
 shelter-0.015
 Bird-0.014
 Bug-0.014
Token,
Token ID11
Feature activation-2.70e-01

Pos logit contributions
pron+0.033
 plate+0.033
 forks+0.032
 slice+0.032
 fork+0.030
Neg logit contributions
 Tweet-0.042
 Madd-0.040
 Bird-0.038
 Suz-0.038
 shelter-0.038
Token "
Token ID366
Feature activation+3.70e-01

Pos logit contributions
pron+0.010
 plate+0.009
 forks+0.009
 slice+0.009
 fork+0.009
Neg logit contributions
 Tweet-0.013
 Madd-0.013
 shelter-0.012
 Suz-0.012
 Bird-0.012
TokenS
Token ID50
Feature activation-7.29e-02

Pos logit contributions
 Tweet+0.015
 Madd+0.014
 Dawn+0.013
 Suz+0.013
 Bird+0.013
Neg logit contributions
 forks-0.017
pron-0.017
 plate-0.017
 slice-0.016
 fork-0.016
Tokenoph
Token ID2522
Feature activation-3.04e+00

Pos logit contributions
pron+0.003
 forks+0.003
 plate+0.003
 fork+0.002
 slice+0.002
Neg logit contributions
 Tweet-0.004
 Madd-0.003
 Dawn-0.003
 Suz-0.003
 Ph-0.003
Tokenie
Token ID494
Feature activation+8.16e-01

Pos logit contributions
pron+0.162
 forks+0.158
 plate+0.158
 dice+0.154
 slice+0.154
Neg logit contributions
 Tweet-0.100
 Madd-0.095
 Suz-0.087
 Bird-0.085
 shelter-0.083
Token,
Token ID11
Feature activation-8.78e-01

Pos logit contributions
 Tweet+0.037
 Madd+0.035
 Bird+0.033
 Suz+0.033
 shelter+0.032
Neg logit contributions
pron-0.033
 plate-0.032
 forks-0.032
 slice-0.031
 fork-0.030
Token no
Token ID645
Feature activation-5.63e-01

Pos logit contributions
pron+0.033
 forks+0.033
 plate+0.033
 slice+0.032
 fork+0.031
Neg logit contributions
 Tweet-0.043
 Madd-0.040
 Bird-0.038
 Suz-0.038
 Dawn-0.037
Token matter
Token ID2300
Feature activation-3.99e-01

Pos logit contributions
pron+0.023
 forks+0.022
 plate+0.022
 slice+0.021
wy+0.021
Neg logit contributions
 Tweet-0.027
 Madd-0.025
 Suz-0.024
 Bird-0.024
 shelter-0.024
Token how
Token ID703
Feature activation-6.55e-01

Pos logit contributions
pron+0.017
 forks+0.017
 plate+0.017
 slice+0.016
 fork+0.016
Neg logit contributions
 Tweet-0.017
 Madd-0.016
 Bird-0.015
 Suz-0.015
 shelter-0.015
Token have
Token ID423
Feature activation-4.71e-01

Pos logit contributions
 plate+0.025
pron+0.025
 slice+0.024
 forks+0.024
wy+0.023
Neg logit contributions
 Tweet-0.032
 Madd-0.031
 Suz-0.029
 Bird-0.028
 Dawn-0.028
Token a
Token ID257
Feature activation+8.05e-02

Pos logit contributions
pron+0.021
 forks+0.021
 plate+0.021
 slice+0.020
 fork+0.020
Neg logit contributions
 Tweet-0.019
 Madd-0.018
 Bird-0.017
 Suz-0.017
 shelter-0.017
Token tum
Token ID11814
Feature activation-2.57e-01

Pos logit contributions
 Tweet+0.003
 Madd+0.003
 Suz+0.003
 Bird+0.003
 shelter+0.003
Neg logit contributions
pron-0.004
 plate-0.003
 forks-0.003
 slice-0.003
 fork-0.003
Tokenmy
Token ID1820
Feature activation-7.97e-01

Pos logit contributions
pron+0.011
 forks+0.010
 plate+0.010
 slice+0.010
 fork+0.010
Neg logit contributions
 Tweet-0.012
 Madd-0.011
 Bird-0.010
 Suz-0.010
 shelter-0.010
Token ac
Token ID936
Feature activation-3.10e-01

Pos logit contributions
pron+0.035
 plate+0.035
 forks+0.034
 slice+0.033
wy+0.032
Neg logit contributions
 Tweet-0.034
 Madd-0.033
 Suz-0.031
 Bird-0.030
 Dawn-0.030
Tokenhe
Token ID258
Feature activation-3.53e+00

Pos logit contributions
 forks+0.009
pron+0.008
 plate+0.008
 slice+0.008
 fork+0.008
Neg logit contributions
 Tweet-0.018
 Madd-0.017
 Bird-0.016
 Bug-0.016
 shelter-0.016
Token.
Token ID13
Feature activation-1.90e-01

Pos logit contributions
 forks+0.153
pron+0.151
 slice+0.148
wy+0.147
 dice+0.146
Neg logit contributions
 Tweet-0.143
 Madd-0.128
 Bird-0.127
 shelter-0.124
 Suz-0.122
Token You
Token ID921
Feature activation-2.05e-01

Pos logit contributions
 forks+0.006
 plate+0.006
pron+0.006
 fork+0.006
ossible+0.005
Neg logit contributions
 Tweet-0.009
 Madd-0.009
 shelter-0.009
rig-0.009
 Dawn-0.009
Token just
Token ID655
Feature activation-6.06e-02

Pos logit contributions
 forks+0.007
 plate+0.007
wy+0.007
pron+0.007
 fork+0.007
Neg logit contributions
 Madd-0.010
 Tweet-0.010
 Suz-0.009
 Dawn-0.009
 Ph-0.009
Token don
Token ID836
Feature activation+7.17e-01

Pos logit contributions
 forks+0.002
 plate+0.002
pron+0.002
 slice+0.002
wy+0.002
Neg logit contributions
 Tweet-0.003
 Madd-0.003
 Suz-0.002
 Dawn-0.002
 Bird-0.002
Token't
Token ID470
Feature activation-3.39e-01

Pos logit contributions
 Tweet+0.032
 Madd+0.030
 Bird+0.028
 Suz+0.028
 shelter+0.027
Neg logit contributions
pron-0.030
 forks-0.030
 plate-0.029
 slice-0.029
 fork-0.028
Token on
Token ID319
Feature activation-1.37e+00

Pos logit contributions
 Tweet+0.001
 Madd+0.001
 Bird+0.001
 Suz+0.001
 Fif+0.001
Neg logit contributions
pron-0.000
 plate-0.000
 forks-0.000
 slice-0.000
wy-0.000
Token.
Token ID13
Feature activation-2.90e-01

Pos logit contributions
pron+0.055
 plate+0.054
 forks+0.054
 slice+0.052
 fork+0.051
Neg logit contributions
 Tweet-0.063
 Madd-0.060
 Bird-0.056
 Suz-0.056
 shelter-0.055
Token
Token ID198
Feature activation-1.21e-01

Pos logit contributions
 plate+0.013
pron+0.013
 forks+0.012
 slice+0.012
 Adam+0.012
Neg logit contributions
 Tweet-0.012
 Madd-0.011
 shelter-0.010
 Suz-0.010
 Bird-0.010
Token
Token ID198
Feature activation-3.13e-01

Pos logit contributions
pron+0.006
 plate+0.006
 forks+0.006
 slice+0.006
 fork+0.006
Neg logit contributions
 Tweet-0.004
 Madd-0.004
 Bird-0.004
 Suz-0.004
 shelter-0.004
TokenMrs
Token ID27034
Feature activation-2.24e+00

Pos logit contributions
 forks+0.013
 plate+0.012
 fork+0.012
wy+0.011
 slice+0.011
Neg logit contributions
 Tweet-0.014
 Madd-0.014
 Bird-0.013
 Beth-0.013
 shelter-0.013
Token Smith
Token ID4176
Feature activation-3.01e+00

Pos logit contributions
 forks+0.081
 plate+0.081
pron+0.078
 slice+0.076
 fork+0.075
Neg logit contributions
 Tweet-0.110
 Madd-0.108
 Suz-0.101
 shelter-0.098
 Dawn-0.098
Token opened
Token ID4721
Feature activation+2.43e-01

Pos logit contributions
 plate+0.119
pron+0.118
 slice+0.117
 forks+0.116
lf+0.113
Neg logit contributions
 Tweet-0.140
 Madd-0.135
 Dawn-0.124
 Suz-0.123
 shelter-0.122
Token the
Token ID262
Feature activation+3.86e-01

Pos logit contributions
 Tweet+0.009
 Madd+0.009
 Suz+0.008
 Bird+0.008
 Beth+0.008
Neg logit contributions
pron-0.012
wy-0.012
 forks-0.011
 slice-0.011
ater-0.011
Token door
Token ID3420
Feature activation+2.00e-01

Pos logit contributions
 Tweet+0.019
 Madd+0.018
 Bird+0.017
 Suz+0.017
 shelter+0.017
Neg logit contributions
pron-0.014
 forks-0.014
 plate-0.014
wy-0.014
 slice-0.013
Token and
Token ID290
Feature activation-5.74e-01

Pos logit contributions
 Tweet+0.008
 Madd+0.008
 Bird+0.007
 Bug+0.007
 Suz+0.007
Neg logit contributions
pron-0.009
 plate-0.009
 forks-0.009
 slice-0.009
 fork-0.008
Token said
Token ID531
Feature activation-6.93e-01

Pos logit contributions
pron+0.026
wy+0.026
 plate+0.025
ater+0.025
 forks+0.025
Neg logit contributions
 Tweet-0.024
 Madd-0.022
 Suz-0.022
 Bird-0.021
 Beth-0.021
TokenMom
Token ID29252
Feature activation-6.88e-01

Pos logit contributions
 forks+0.005
wy+0.005
 plate+0.005
pron+0.005
 fork+0.005
Neg logit contributions
 Tweet-0.004
 Madd-0.004
 Bird-0.003
 Dawn-0.003
 shelter-0.003
Tokenmy
Token ID1820
Feature activation+3.60e-02

Pos logit contributions
pron+0.032
 forks+0.031
 plate+0.031
wy+0.030
 dice+0.030
Neg logit contributions
 Tweet-0.027
 Madd-0.025
 Suz-0.024
 Bird-0.024
 Beth-0.023
Token said
Token ID531
Feature activation-1.15e+00

Pos logit contributions
 Tweet+0.002
 Madd+0.002
 shelter+0.002
 Suz+0.001
 Dawn+0.001
Neg logit contributions
pron-0.001
 plate-0.001
 forks-0.001
 slice-0.001
 fork-0.001
Token "
Token ID366
Feature activation+4.91e-01

Pos logit contributions
pron+0.048
 forks+0.048
 plate+0.047
 slice+0.046
wy+0.046
Neg logit contributions
 Tweet-0.051
 Madd-0.048
 Bird-0.045
 Suz-0.045
 shelter-0.044
TokenG
Token ID38
Feature activation-2.54e+00

Pos logit contributions
 Tweet+0.025
 shelter+0.024
 Madd+0.023
 dam+0.023
 Bird+0.023
Neg logit contributions
pron-0.018
ater-0.016
Adam-0.015
 plate-0.014
wy-0.014
Tokenabby
Token ID42457
Feature activation-3.87e+00

Pos logit contributions
 plate+0.079
pron+0.078
 forks+0.077
 slice+0.074
 fork+0.073
Neg logit contributions
 Tweet-0.140
 Madd-0.134
 Suz-0.129
 Bird-0.127
 Dawn-0.125
Token,
Token ID11
Feature activation-1.15e+00

Pos logit contributions
pron+0.160
 forks+0.158
 plate+0.157
 slice+0.152
 fork+0.148
Neg logit contributions
 Tweet-0.172
 Madd-0.164
 Suz-0.154
 Bird-0.153
 shelter-0.149
Token don
Token ID836
Feature activation-4.13e-01

Pos logit contributions
 forks+0.037
 plate+0.036
pron+0.035
 fork+0.033
 slice+0.032
Neg logit contributions
 Tweet-0.059
 Madd-0.056
add-0.054
 dam-0.054
rig-0.054
Token't
Token ID470
Feature activation-1.47e+00

Pos logit contributions
pron+0.016
 forks+0.016
 plate+0.016
 slice+0.015
wy+0.015
Neg logit contributions
 Tweet-0.019
 Madd-0.018
 Bird-0.017
 Suz-0.017
 shelter-0.017
Token be
Token ID307
Feature activation-3.03e-01

Pos logit contributions
 plate+0.067
 slice+0.065
 forks+0.065
pron+0.065
 fork+0.063
Neg logit contributions
 Tweet-0.056
 Madd-0.054
 Suz-0.050
 Bird-0.049
 shelter-0.049
Token afraid
Token ID7787
Feature activation+7.71e-01

Pos logit contributions
 plate+0.016
pron+0.016
 forks+0.016
 slice+0.016
 fork+0.016
Neg logit contributions
 Tweet-0.009
 Madd-0.009
 Suz-0.008
 Bird-0.008
 shelter-0.007
Token.
Token ID13
Feature activation-5.39e-01

Pos logit contributions
pron+0.007
 plate+0.007
 forks+0.007
 slice+0.006
 fork+0.006
Neg logit contributions
 Tweet-0.006
 Madd-0.006
 Suz-0.006
 Bird-0.006
 shelter-0.006
Token
Token ID198
Feature activation-3.67e-01

Pos logit contributions
pron+0.022
 plate+0.022
 forks+0.021
 Adam+0.020
 slice+0.020
Neg logit contributions
 shelter-0.021
 Tweet-0.020
 Madd-0.020
 ost-0.020
 Bird-0.019
Token
Token ID198
Feature activation-4.91e-01

Pos logit contributions
pron+0.017
 plate+0.016
 forks+0.016
 slice+0.016
ossible+0.016
Neg logit contributions
 Tweet-0.014
 Madd-0.014
 Bird-0.013
 shelter-0.013
 Suz-0.012
TokenThe
Token ID464
Feature activation+9.11e-01

Pos logit contributions
 forks+0.016
 plate+0.016
pron+0.015
 fork+0.015
 slice+0.014
Neg logit contributions
 Tweet-0.027
 Madd-0.026
 Suz-0.025
 Bird-0.025
 Beth-0.024
Token books
Token ID3835
Feature activation-2.20e+00

Pos logit contributions
 Tweet+0.041
 Madd+0.038
 Bird+0.037
 shelter+0.037
 Suz+0.036
Neg logit contributions
pron-0.036
wy-0.035
 forks-0.034
 slice-0.034
 plate-0.034
Tokenhe
Token ID258
Feature activation-3.27e+00

Pos logit contributions
pron+0.091
 forks+0.091
 plate+0.089
wy+0.086
 slice+0.086
Neg logit contributions
 Tweet-0.098
 Madd-0.091
 Suz-0.086
 Bird-0.086
 shelter-0.085
Tokenlf
Token ID1652
Feature activation-1.21e+00

Pos logit contributions
wy+0.045
 forks+0.041
pron+0.040
 slice+0.039
ossible+0.038
Neg logit contributions
 Tweet-0.227
 Madd-0.220
 Suz-0.215
 shelter-0.214
 Bird-0.213
Token came
Token ID1625
Feature activation+9.34e-01

Pos logit contributions
pron+0.052
 plate+0.052
 forks+0.052
lf+0.050
 slice+0.050
Neg logit contributions
 Tweet-0.052
 Madd-0.049
 Bird-0.046
 Fif-0.046
 Dawn-0.045
Token crashing
Token ID21899
Feature activation-6.73e-01

Pos logit contributions
 Tweet+0.045
 Madd+0.044
 Dawn+0.041
 Fif+0.041
 Suz+0.040
Neg logit contributions
pron-0.033
 plate-0.033
 forks-0.033
 slice-0.031
 fork-0.030
Token down
Token ID866
Feature activation-2.10e+00

Pos logit contributions
pron+0.027
 forks+0.026
 plate+0.026
 slice+0.025
wy+0.024
Neg logit contributions
 Tweet-0.031
 Madd-0.030
 Bird-0.028
 Suz-0.028
 shelter-0.027
Token on
Token ID319
Feature activation-9.86e-01

Pos logit contributions
 plate+0.073
 forks+0.073
pron+0.070
 slice+0.069
 dice+0.068
Neg logit contributions
 Tweet-0.102
 Madd-0.100
 Dawn-0.093
 Bird-0.092
 Fif-0.092