TOP ACTIVATIONS
MAX = 0.183

Everyone smiled and agreed . In the game
so they asked Bob . One day ,
wouldn 't allow it . Sam my was
â She asked . To her surprise
of a new idea . He decided to
with his friends instead . So Peter and
wouldn 't let him . " Mom ,
important ! â . They took the
safe behind Jack â s shield . <|endoftext|> One
â she said . But Peter wasn
â she said . Tim grabbed the
without her mom â s help . When she
they would support him . They opened the
nothing seemed quite right . Then one day
promising to protect him . Suddenly , they
mom 's favourite bowl . He found a
it would be safe . " We want
to beat Gray â s prices , because everyone
t willing to share . June pulled the
but she said no . M um said that

MIN ACTIVATIONS
MIN = -0.153

feel better . Lily felt a little bit better but still
." Tim and Lily looked around the movie place
was so amazing ! Everywhere it sang , it made people
near the clouds . Everywhere it flew , it left a
around the world . Everywhere it went , it made people
be okay . Lily felt a little weak because she was
happy and free , and it made Lily smile .
!" said Lily generously . They looked everywhere but couldn 't
what it was , so she asked her mom .
started to spread . Everywhere it went , something happened .
a candy . Lily felt a little better and went back
play with it . Everywhere she went , she felt the
cat named Jazz . Everywhere she went , she was always
that little seed . Everywhere it went , it made flowers
loved to sing . Everywhere it went , it would sing
to be beautiful , so it tried to make itself very
a gentle lung . Everywhere it went , it made its
was not hurt , but it was scared . The cat
any water anywhere . Everywhere she looked , it was all
it could not . Everywhere it looked there was too much

INTERVAL 0.099 - 0.183
CONTAINS 1.436%

harsh it may seem . <|endoftext|> Once there was a girl
the different things in the lab . But when the smell
named it Purple y . <|endoftext|> M olly the rabbit was
of her big discovery . <|endoftext|> Once upon a time there
up into a tree . The floods kept

INTERVAL 0.015 - 0.099
CONTAINS 36.873%

afraid the old man had come with bad news . But
and made it their new friend . They had so much
Tom my was very happy and he played with the
slower than the other jets . Anna and Ben are curious
, " Look , Ben . This is a boat .

INTERVAL -0.069 - 0.015
CONTAINS 54.135%

was very thirsty . He saw a big mel on ,
was to drink from a glass without using his paws .
Now Emma knew to keep her dresses away from her dog
play together , but it was hard to talk .
took him to the doctor 's office . The doctor said

INTERVAL -0.153 - -0.069
CONTAINS 7.557%

the panc ake mix and they made pancakes together . Lily
Sam was relieved , but he never forgot how brave he
dog were playing together . They liked to play and run
Lily to the bathroom . She cleaned her knee and put
but she was happy . She liked the cold air on
Token Everyone
Token ID11075
Feature activation-3.69e-02

Pos logit contributions
fulness+0.114
and+0.110
lights+0.102
ions+0.101
reating+0.097
Neg logit contributions
 saw-0.125
 spit-0.125
 picked-0.124
 pet-0.124
 observed-0.123
Token smiled
Token ID13541
Feature activation+3.33e-02

Pos logit contributions
 forgot+0.040
 remembered+0.039
 sne+0.036
 wish+0.036
 blew+0.036
Neg logit contributions
 This-0.084
 If-0.083
'.-0.081
and-0.080
".-0.079
Token and
Token ID290
Feature activation-2.30e-02

Pos logit contributions
 This+0.064
and+0.064
ion+0.063
 If+0.063
ions+0.059
Neg logit contributions
 remembered-0.059
 forgot-0.052
 saw-0.051
 bit-0.050
 sne-0.050
Token agreed
Token ID4987
Feature activation+6.50e-02

Pos logit contributions
 forgot+0.024
 remembered+0.023
 blew+0.021
 sne+0.021
 bit+0.021
Neg logit contributions
 This-0.055
 If-0.055
'.-0.053
and-0.053
".-0.052
Token.
Token ID13
Feature activation+8.24e-02

Pos logit contributions
ays+0.100
 If+0.097
anes+0.092
 Today+0.088
 This+0.088
Neg logit contributions
 bit-0.124
 forgot-0.122
 sne-0.121
 knew-0.118
 remembered-0.118
Token
Token ID220
Feature activation+1.83e-01

Pos logit contributions
fulness+0.122
rum+0.111
and+0.109
reating+0.108
ions+0.102
Neg logit contributions
 sw-0.141
 saw-0.141
 threw-0.137
 picked-0.134
 went-0.134
Token
Token ID198
Feature activation+7.89e-02

Pos logit contributions
keeping+0.342
overs+0.333
 lately+0.332
rier+0.328
umm+0.325
Neg logit contributions
 saw-0.184
 also-0.164
 lost-0.160
 felt-0.152
 met-0.148
Token
Token ID198
Feature activation+1.17e-03

Pos logit contributions
ment+0.101
ados+0.101
 anymore+0.100
ometers+0.098
overs+0.097
Neg logit contributions
 saw-0.140
 se-0.132
 forgot-0.128
 met-0.127
 sw-0.127
TokenIn
Token ID818
Feature activation-6.71e-04

Pos logit contributions
 This+0.002
 If+0.002
'.+0.002
".+0.002
and+0.002
Neg logit contributions
 forgot-0.002
 remembered-0.002
 blew-0.002
 bit-0.002
 dove-0.002
Token the
Token ID262
Feature activation-3.23e-02

Pos logit contributions
 remembered+0.001
 blew+0.001
 sne+0.001
 forgot+0.001
 bit+0.001
Neg logit contributions
and-0.001
 If-0.001
 This-0.001
'.-0.001
".-0.001
Token game
Token ID983
Feature activation+4.12e-02

Pos logit contributions
 forgot+0.039
 remembered+0.038
 blew+0.035
 sne+0.035
 wish+0.033
Neg logit contributions
 This-0.072
 If-0.071
'.-0.069
".-0.068
and-0.067
Token so
Token ID523
Feature activation+2.71e-02

Pos logit contributions
'.+0.077
.'+0.076
 This+0.073
 If+0.073
and+0.072
Neg logit contributions
 forgot-0.082
 sne-0.081
 remembered-0.080
 sw-0.078
 bit-0.077
Token they
Token ID484
Feature activation-5.74e-02

Pos logit contributions
'.+0.047
.'+0.046
and+0.045
 If+0.043
.'"+0.043
Neg logit contributions
 sw-0.049
 forgot-0.048
 dove-0.047
 remembered-0.047
 ate-0.046
Token asked
Token ID1965
Feature activation+6.03e-02

Pos logit contributions
 forgot+0.056
 remembered+0.053
 sne+0.052
 wish+0.051
 bit+0.051
Neg logit contributions
 This-0.129
 If-0.128
'.-0.127
".-0.123
.'-0.123
Token Bob
Token ID5811
Feature activation+8.47e-02

Pos logit contributions
uts+0.102
ula+0.098
 This+0.097
and+0.097
ays+0.095
Neg logit contributions
 sw-0.106
 remembered-0.102
 saw-0.102
 ate-0.100
 knew-0.100
Token.
Token ID13
Feature activation+8.30e-02

Pos logit contributions
 If+0.124
and+0.119
 This+0.119
uts+0.119
ula+0.119
Neg logit contributions
 remembered-0.156
 saw-0.150
 forgot-0.145
 sne-0.143
 ate-0.140
Token
Token ID220
Feature activation+1.80e-01

Pos logit contributions
ula+0.148
fulness+0.143
and+0.142
uts+0.128
overs+0.127
Neg logit contributions
 sw-0.131
 saw-0.125
 bark-0.124
 ate-0.123
 dove-0.120
Token
Token ID198
Feature activation+6.84e-02

Pos logit contributions
keeping+0.321
ados+0.312
overs+0.294
fulness+0.287
rs+0.284
Neg logit contributions
 saw-0.208
 also-0.196
 lost-0.177
 knew-0.171
 met-0.165
Token
Token ID198
Feature activation+1.79e-02

Pos logit contributions
anks+0.099
overs+0.092
fulness+0.089
ados+0.088
unes+0.087
Neg logit contributions
 saw-0.122
 dove-0.121
 sw-0.118
 also-0.111
 swoop-0.111
TokenOne
Token ID3198
Feature activation-1.42e-02

Pos logit contributions
 This+0.035
 If+0.034
and+0.034
fulness+0.034
overs+0.033
Neg logit contributions
 dove-0.022
 sw-0.022
 remembered-0.022
 blew-0.021
 chased-0.021
Token day
Token ID1110
Feature activation+1.21e-02

Pos logit contributions
 forgot+0.019
 remembered+0.019
 sne+0.018
 wish+0.018
 bit+0.018
Neg logit contributions
 If-0.029
 This-0.029
'.-0.028
and-0.028
".-0.027
Token,
Token ID11
Feature activation+1.23e-02

Pos logit contributions
and+0.018
 If+0.018
 This+0.017
'.+0.017
.'+0.017
Neg logit contributions
 remembered-0.024
 forgot-0.023
 sw-0.023
 bit-0.023
 blew-0.023
Token wouldn
Token ID3636
Feature activation+6.15e-02

Pos logit contributions
 wish+0.061
 blew+0.060
 buzz+0.059
 forgot+0.059
 remembered+0.058
Neg logit contributions
 If-0.119
-0.117
 This-0.117
'.-0.115
".-0.113
Token't
Token ID470
Feature activation-1.12e-02

Pos logit contributions
and+0.123
 This+0.109
ment+0.108
 If+0.108
boxes+0.104
Neg logit contributions
 hurt-0.095
 sne-0.094
 blew-0.086
 hit-0.085
 dropped-0.084
Token allow
Token ID1249
Feature activation+5.49e-02

Pos logit contributions
 forgot+0.015
 remembered+0.014
 sne+0.013
 bit+0.013
 wish+0.013
Neg logit contributions
 This-0.025
 If-0.024
and-0.024
'.-0.023
.'-0.023
Token it
Token ID340
Feature activation+8.51e-02

Pos logit contributions
 If+0.093
and+0.093
'.+0.092
.'"+0.091
 This+0.089
Neg logit contributions
 sne-0.099
 pin-0.092
 hurt-0.091
 cut-0.089
 sc-0.088
Token.
Token ID13
Feature activation+7.10e-02

Pos logit contributions
 If+0.136
 Here+0.133
 Playing+0.132
aps+0.132
 With+0.130
Neg logit contributions
 hurt-0.154
 dropped-0.144
 got-0.142
 forgot-0.140
 left-0.139
Token
Token ID220
Feature activation+1.78e-01

Pos logit contributions
fulness+0.117
".+0.109
ions+0.105
.'"+0.104
and+0.100
Neg logit contributions
 saw-0.127
 remembered-0.122
 observed-0.120
 felt-0.118
 forgot-0.117
Token
Token ID198
Feature activation+8.91e-02

Pos logit contributions
rs+0.298
keeping+0.296
overs+0.290
umm+0.283
.'"+0.282
Neg logit contributions
 saw-0.209
 felt-0.190
 knew-0.187
 also-0.186
 forgot-0.184
Token
Token ID198
Feature activation-2.57e-03

Pos logit contributions
fulness+0.120
ions+0.117
rs+0.116
der+0.116
ment+0.115
Neg logit contributions
 saw-0.154
 forgot-0.143
 met-0.138
 remembered-0.137
 knew-0.137
TokenSam
Token ID16305
Feature activation-3.17e-02

Pos logit contributions
 forgot+0.004
 remembered+0.004
 blew+0.004
 sne+0.004
 bit+0.004
Neg logit contributions
 This-0.005
 If-0.004
".-0.004
'.-0.004
.'-0.004
Tokenmy
Token ID1820
Feature activation-6.58e-02

Pos logit contributions
 forgot+0.052
 remembered+0.050
 blew+0.049
 sne+0.048
 wish+0.048
Neg logit contributions
 This-0.056
 If-0.055
'.-0.053
and-0.052
".-0.051
Token was
Token ID373
Feature activation-3.93e-02

Pos logit contributions
 wish+0.076
 forgot+0.070
 buzz+0.069
 blew+0.068
 dove+0.067
Neg logit contributions
 This-0.134
 If-0.134
-0.133
'.-0.130
 She-0.129
Tokenâ
Token ID22940
Feature activation+7.61e-03

Pos logit contributions
ently+0.235
ings+0.222
 anymore+0.221
fulness+0.220
eways+0.219
Neg logit contributions
 dove-0.179
 bit-0.176
 zoom-0.176
 put-0.174
 wish-0.171
Token
Token ID26391
Feature activation+6.22e-02

Pos logit contributions
and+0.014
 If+0.014
uts+0.013
 This+0.013
".+0.012
Neg logit contributions
 forgot-0.011
 remembered-0.011
 blew-0.011
 ate-0.011
 bit-0.010
Token She
Token ID1375
Feature activation-9.89e-03

Pos logit contributions
fulness+0.087
 If+0.084
and+0.084
ion+0.077
fully+0.075
Neg logit contributions
 forgot-0.133
 saw-0.125
 remembered-0.121
 dove-0.118
 ate-0.116
Token asked
Token ID1965
Feature activation+7.65e-02

Pos logit contributions
 forgot+0.012
 remembered+0.012
 sne+0.011
 blew+0.011
 bit+0.011
Neg logit contributions
 This-0.022
 If-0.022
and-0.022
'.-0.021
.'-0.021
Token.
Token ID13
Feature activation+1.52e-01

Pos logit contributions
uts+0.129
 This+0.123
and+0.118
 If+0.113
 Today+0.112
Neg logit contributions
 sne-0.132
 remembered-0.126
 knew-0.124
 saw-0.124
 h-0.123
Token
Token ID220
Feature activation+1.78e-01

Pos logit contributions
fulness+0.242
ently+0.239
ings+0.232
 anymore+0.230
anks+0.222
Neg logit contributions
 saw-0.236
 put-0.223
 pet-0.222
 climbed-0.220
 rubbed-0.216
Token
Token ID198
Feature activation+9.46e-02

Pos logit contributions
keeping+0.303
overs+0.287
fulness+0.285
ados+0.281
edly+0.272
Neg logit contributions
 saw-0.190
 also-0.172
 climbed-0.170
 knew-0.168
 felt-0.166
Token
Token ID198
Feature activation+3.89e-02

Pos logit contributions
 anymore+0.157
anks+0.149
ays+0.142
unes+0.136
ados+0.134
Neg logit contributions
 saw-0.155
 dove-0.152
 blew-0.148
lly-0.146
 se-0.146
TokenTo
Token ID2514
Feature activation+6.12e-02

Pos logit contributions
 anymore+0.079
 This+0.075
anks+0.075
 If+0.073
fulness+0.073
Neg logit contributions
 blew-0.049
 dove-0.049
 remembered-0.048
 chased-0.048
 forgot-0.048
Token her
Token ID607
Feature activation-3.50e-03

Pos logit contributions
and+0.127
 If+0.120
uts+0.116
 This+0.111
 That+0.109
Neg logit contributions
 sne-0.090
 forgot-0.084
 wish-0.083
 put-0.083
 cut-0.080
Token surprise
Token ID5975
Feature activation+6.59e-02

Pos logit contributions
 wish+0.004
 remembered+0.004
 forgot+0.004
 sne+0.004
 bit+0.004
Neg logit contributions
 If-0.008
 This-0.008
and-0.008
'.-0.008
".-0.008
Token of
Token ID286
Feature activation+2.49e-03

Pos logit contributions
'.+0.059
.'+0.057
 This+0.057
.'"+0.056
 If+0.056
Neg logit contributions
 sne-0.069
 cut-0.066
 dropped-0.065
 pin-0.065
 forgot-0.065
Token a
Token ID257
Feature activation-3.38e-02

Pos logit contributions
 This+0.004
'.+0.004
 If+0.004
and+0.004
.'+0.004
Neg logit contributions
 remembered-0.004
 forgot-0.004
 sne-0.004
 blew-0.004
 wish-0.004
Token new
Token ID649
Feature activation-2.10e-02

Pos logit contributions
 forgot+0.028
 remembered+0.026
 blew+0.024
 sne+0.023
 ate+0.022
Neg logit contributions
 This-0.087
 If-0.087
'.-0.085
".-0.083
.'-0.083
Token idea
Token ID2126
Feature activation+4.78e-02

Pos logit contributions
 forgot+0.022
 remembered+0.021
 sne+0.020
 wish+0.019
 blew+0.019
Neg logit contributions
 This-0.050
 If-0.050
'.-0.048
and-0.048
".-0.047
Token.
Token ID13
Feature activation+7.34e-02

Pos logit contributions
 If+0.065
 This+0.063
and+0.059
ays+0.057
angers+0.056
Neg logit contributions
 remembered-0.096
 knew-0.094
 did-0.094
 tried-0.092
 saw-0.092
Token
Token ID220
Feature activation+1.77e-01

Pos logit contributions
fulness+0.145
 anymore+0.137
 ourselves+0.131
ings+0.130
ions+0.130
Neg logit contributions
 tried-0.090
 looked-0.086
 saw-0.085
 felt-0.082
 took-0.081
Token
Token ID198
Feature activation+9.61e-02

Pos logit contributions
overs+0.330
keeping+0.328
ados+0.325
ment+0.315
rs+0.306
Neg logit contributions
 saw-0.185
 lost-0.165
 also-0.161
 felt-0.156
 knew-0.152
Token
Token ID198
Feature activation+9.95e-03

Pos logit contributions
unes+0.154
ados+0.153
overs+0.149
ays+0.146
der+0.145
Neg logit contributions
 saw-0.152
 met-0.132
 knew-0.131
 cast-0.129
 tried-0.129
TokenHe
Token ID1544
Feature activation-8.84e-02

Pos logit contributions
 This+0.018
.'+0.017
'.+0.017
".+0.017
 anymore+0.017
Neg logit contributions
 forgot-0.014
 remembered-0.014
 blew-0.014
 bit-0.013
 cut-0.013
Token decided
Token ID3066
Feature activation+1.58e-02

Pos logit contributions
 wish+0.094
 forgot+0.083
 boo+0.079
 forget+0.079
inge+0.078
Neg logit contributions
-0.179
.-0.173
'.-0.170
 This-0.168
 If-0.166
Token to
Token ID284
Feature activation-3.22e-02

Pos logit contributions
'.+0.023
 If+0.023
.'+0.022
 Today+0.022
 This+0.022
Neg logit contributions
 remembered-0.032
 forgot-0.032
 saw-0.030
 dropped-0.030
 blew-0.030
Token with
Token ID351
Feature activation+6.73e-02

Pos logit contributions
 If+0.107
uts+0.107
 This+0.103
and+0.103
ula+0.101
Neg logit contributions
 bit-0.163
 sne-0.156
 wish-0.155
 ate-0.152
 remembered-0.151
Token his
Token ID465
Feature activation+2.02e-02

Pos logit contributions
and+0.123
but+0.121
 If+0.119
uts+0.119
 This+0.118
Neg logit contributions
 sne-0.102
 wish-0.100
 bit-0.098
 cut-0.097
 sw-0.093
Token friends
Token ID2460
Feature activation+1.03e-01

Pos logit contributions
 If+0.047
 This+0.045
and+0.045
'.+0.043
.'+0.042
Neg logit contributions
 wish-0.023
 sne-0.023
 remembered-0.023
 bit-0.022
 forgot-0.022
Token instead
Token ID2427
Feature activation+1.04e-01

Pos logit contributions
and+0.115
 If+0.107
uts+0.105
 This+0.102
angers+0.095
Neg logit contributions
 remembered-0.233
 bit-0.232
 forgot-0.222
 ate-0.219
 blew-0.219
Token.
Token ID13
Feature activation+1.48e-01

Pos logit contributions
and+0.116
 If+0.107
 This+0.098
but+0.094
angers+0.092
Neg logit contributions
 bit-0.227
 sne-0.223
 blew-0.222
 forgot-0.221
 remembered-0.221
Token
Token ID220
Feature activation+1.77e-01

Pos logit contributions
ings+0.212
arling+0.197
fulness+0.196
ently+0.196
ment+0.185
Neg logit contributions
 ate-0.210
 zoom-0.210
 sw-0.210
 remembered-0.207
 threw-0.204
Token
Token ID198
Feature activation+7.23e-02

Pos logit contributions
 anymore+0.251
 lately+0.243
fulness+0.241
keeping+0.231
ados+0.225
Neg logit contributions
 saw-0.223
 met-0.220
 dropped-0.202
 loved-0.202
 also-0.198
Token
Token ID198
Feature activation+2.37e-02

Pos logit contributions
 anymore+0.092
unes+0.091
anks+0.086
uts+0.085
ays+0.084
Neg logit contributions
 saw-0.136
 knew-0.128
 remembered-0.128
 met-0.128
 sw-0.128
TokenSo
Token ID2396
Feature activation+4.19e-02

Pos logit contributions
 This+0.044
 If+0.042
 anymore+0.041
".+0.041
.'+0.040
Neg logit contributions
 remembered-0.034
 blew-0.032
 forgot-0.032
 dove-0.031
 chased-0.031
Token Peter
Token ID5613
Feature activation-2.08e-02

Pos logit contributions
and+0.072
 If+0.065
uts+0.064
 This+0.061
.'+0.061
Neg logit contributions
 remembered-0.078
 forgot-0.078
 bit-0.075
 sw-0.074
 sne-0.072
Token and
Token ID290
Feature activation+8.10e-03

Pos logit contributions
 forgot+0.029
 remembered+0.029
 blew+0.027
 sne+0.027
 bit+0.026
Neg logit contributions
 This-0.042
 If-0.042
'.-0.041
and-0.040
".-0.040
Token wouldn
Token ID3636
Feature activation+4.94e-02

Pos logit contributions
 forgot+0.030
 remembered+0.029
 blew+0.026
 sne+0.026
 wish+0.026
Neg logit contributions
 This-0.072
 If-0.071
'.-0.070
".-0.068
and-0.068
Token't
Token ID470
Feature activation-1.37e-02

Pos logit contributions
and+0.091
 If+0.085
 This+0.083
boxes+0.079
aris+0.078
Neg logit contributions
 sne-0.080
 hurt-0.079
 blew-0.073
 cut-0.073
 dropped-0.072
Token let
Token ID1309
Feature activation+3.57e-02

Pos logit contributions
 forgot+0.015
 remembered+0.014
 sne+0.014
 wish+0.013
 bit+0.013
Neg logit contributions
 This-0.033
 If-0.032
'.-0.031
and-0.031
.'-0.030
Token him
Token ID683
Feature activation+2.40e-02

Pos logit contributions
and+0.060
aps+0.053
 If+0.053
uts+0.052
but+0.051
Neg logit contributions
 sne-0.066
 hurt-0.064
 dropped-0.062
 wish-0.061
 remembered-0.060
Token.
Token ID13
Feature activation+3.94e-02

Pos logit contributions
 If+0.058
 This+0.055
 That+0.052
 By+0.052
 You+0.051
Neg logit contributions
 sne-0.029
 forgot-0.027
 wish-0.026
 hurt-0.026
 remembered-0.025
Token
Token ID220
Feature activation+1.76e-01

Pos logit contributions
and+0.050
uts+0.045
ions+0.045
ion+0.044
ula+0.043
Neg logit contributions
 remembered-0.086
 saw-0.083
 sw-0.083
 forgot-0.081
 wished-0.080
Token
Token ID198
Feature activation+7.16e-02

Pos logit contributions
rs+0.303
overs+0.291
ados+0.286
keeping+0.283
ting+0.276
Neg logit contributions
 saw-0.184
 lost-0.180
 also-0.176
 felt-0.172
 forgot-0.159
Token
Token ID198
Feature activation-9.55e-03

Pos logit contributions
ion+0.085
ions+0.084
ays+0.082
unes+0.081
 anymore+0.081
Neg logit contributions
 saw-0.146
 met-0.145
 remembered-0.142
 forgot-0.134
 lost-0.132
Token"
Token ID1
Feature activation-2.01e-02

Pos logit contributions
 remembered+0.017
 forgot+0.017
 blew+0.016
 bit+0.015
 sne+0.015
Neg logit contributions
 This-0.016
 If-0.015
and-0.015
'.-0.014
".-0.014
TokenMom
Token ID29252
Feature activation+3.30e-02

Pos logit contributions
 forgot+0.032
 remembered+0.031
 blew+0.030
 sne+0.029
 bit+0.029
Neg logit contributions
 This-0.038
 If-0.037
'.-0.035
".-0.035
and-0.035
Token,
Token ID11
Feature activation-1.32e-02

Pos logit contributions
and+0.062
 If+0.058
 This+0.058
uts+0.057
'.+0.053
Neg logit contributions
 remembered-0.058
 forgot-0.057
 saw-0.054
 sne-0.054
 cut-0.052
Token important
Token ID1593
Feature activation+9.03e-02

Pos logit contributions
 If+0.023
and+0.023
 This+0.023
'.+0.022
.'+0.021
Neg logit contributions
 forgot-0.018
 bit-0.018
 remembered-0.017
 blew-0.017
 sne-0.017
Token!
Token ID0
Feature activation+1.24e-01

Pos logit contributions
and+0.147
 If+0.140
 This+0.136
 Soon+0.127
 By+0.124
Neg logit contributions
 forgot-0.157
 remembered-0.156
 blew-0.154
 did-0.150
 knew-0.149
Tokenâ
Token ID22940
Feature activation-3.20e-03

Pos logit contributions
ings+0.209
ently+0.201
fulness+0.191
 anymore+0.183
uts+0.181
Neg logit contributions
 remembered-0.173
 sne-0.167
 bit-0.166
 saw-0.165
 threw-0.165
Token
Token ID26391
Feature activation+5.31e-02

Pos logit contributions
 forgot+0.005
 remembered+0.005
 blew+0.005
 ate+0.005
 bit+0.005
Neg logit contributions
 If-0.005
and-0.005
 This-0.005
uts-0.005
".-0.005
Token.
Token ID13
Feature activation+1.26e-01

Pos logit contributions
 If+0.075
fulness+0.074
and+0.071
ion+0.068
fully+0.065
Neg logit contributions
 forgot-0.110
 remembered-0.101
 saw-0.100
 ate-0.096
 sne-0.093
Token
Token ID220
Feature activation+1.75e-01

Pos logit contributions
ings+0.196
ently+0.186
fulness+0.172
uts+0.172
 anymore+0.171
Neg logit contributions
 remembered-0.191
 sne-0.182
 bit-0.179
 threw-0.175
 switched-0.174
Token
Token ID198
Feature activation+8.96e-02

Pos logit contributions
keeping+0.266
edly+0.261
fulness+0.257
fully+0.255
overs+0.255
Neg logit contributions
 also-0.182
 saw-0.171
 forgot-0.171
 gifted-0.163
 lost-0.162
Token
Token ID198
Feature activation+1.43e-02

Pos logit contributions
 anymore+0.136
anks+0.134
ados+0.132
rier+0.132
unes+0.130
Neg logit contributions
 saw-0.141
lly-0.132
 se-0.128
 also-0.127
 met-0.127
TokenThey
Token ID2990
Feature activation-5.49e-02

Pos logit contributions
 This+0.028
 If+0.027
".+0.025
 anymore+0.025
'.+0.025
Neg logit contributions
 remembered-0.020
 forgot-0.020
 blew-0.019
 bit-0.018
 sne-0.018
Token took
Token ID1718
Feature activation+3.69e-04

Pos logit contributions
 forgot+0.046
 remembered+0.042
 wish+0.041
 blew+0.041
 sne+0.040
Neg logit contributions
 This-0.131
'.-0.130
 If-0.129
.'-0.125
".-0.125
Token the
Token ID262
Feature activation-4.37e-02

Pos logit contributions
 If+0.001
and+0.001
 This+0.001
'.+0.001
.'+0.001
Neg logit contributions
 sne-0.001
 bit-0.001
 forgot-0.001
 remembered-0.001
 dropped-0.001
Token safe
Token ID3338
Feature activation+6.92e-02

Pos logit contributions
 forgot+0.022
 remembered+0.021
 sne+0.020
 blew+0.020
 wish+0.020
Neg logit contributions
 This-0.034
 If-0.034
'.-0.033
.'-0.032
".-0.032
Token behind
Token ID2157
Feature activation+8.08e-02

Pos logit contributions
 If+0.109
 This+0.107
ula+0.089
and+0.089
 These+0.087
Neg logit contributions
 sw-0.136
 forgot-0.133
 remembered-0.132
 bit-0.129
 fl-0.127
Token Jack
Token ID3619
Feature activation+6.82e-02

Pos logit contributions
 If+0.159
but+0.142
 This+0.140
and+0.136
uts+0.135
Neg logit contributions
 sne-0.130
 sw-0.125
 spark-0.124
 lost-0.124
 c-0.123
Tokenâ
Token ID22940
Feature activation+1.01e-02

Pos logit contributions
 If+0.124
 This+0.108
and+0.105
but+0.102
uts+0.099
Neg logit contributions
 sne-0.128
 forgot-0.126
 remembered-0.122
 ate-0.119
 bit-0.119
Token
Token ID26391
Feature activation+5.78e-02

Pos logit contributions
 If+0.019
and+0.019
uts+0.018
 This+0.017
".+0.016
Neg logit contributions
 forgot-0.015
 remembered-0.014
 blew-0.014
 sne-0.014
 ate-0.014
Token
Token ID8151
Feature activation+1.74e-01

Pos logit contributions
 If+0.109
ion+0.087
fulness+0.087
and+0.086
ions+0.082
Neg logit contributions
 forgot-0.111
 saw-0.097
 sne-0.097
 remembered-0.096
 ate-0.093
Tokens
Token ID82
Feature activation+3.53e-02

Pos logit contributions
 If+0.248
 This+0.240
 Do+0.228
 What+0.223
 itself+0.221
Neg logit contributions
 forgot-0.276
 knew-0.254
 remembered-0.245
 laughed-0.244
 saw-0.243
Token shield
Token ID7614
Feature activation+1.12e-01

Pos logit contributions
 If+0.067
 This+0.065
and+0.059
'.+0.059
 That+0.059
Neg logit contributions
 wish-0.060
 sne-0.057
 bit-0.055
 dream-0.054
 forgot-0.054
Token.
Token ID13
Feature activation+1.32e-01

Pos logit contributions
 If+0.149
 This+0.146
and+0.124
 That+0.124
ula+0.118
Neg logit contributions
 bit-0.239
 forgot-0.236
 remembered-0.235
 ate-0.226
 fl-0.225
Token<|endoftext|>
Token ID50256
Feature activation+1.02e-01

Pos logit contributions
ings+0.223
rier+0.221
ently+0.216
arling+0.200
fully+0.188
Neg logit contributions
 forgot-0.184
 knew-0.182
 remembered-0.171
 saw-0.170
 had-0.170
TokenOne
Token ID3198
Feature activation-1.22e-02

Pos logit contributions
fulness+0.213
rier+0.213
uts+0.212
 If+0.208
and+0.203
Neg logit contributions
 saw-0.142
 met-0.122
 did-0.116
 looked-0.114
 had-0.114
Tokenâ
Token ID22940
Feature activation-1.47e-02

Pos logit contributions
ings+0.145
ently+0.140
fulness+0.134
 anymore+0.132
uts+0.121
Neg logit contributions
 sne-0.173
 remembered-0.171
 bit-0.164
 cut-0.162
 threw-0.162
Token
Token ID26391
Feature activation+4.39e-02

Pos logit contributions
 forgot+0.026
 remembered+0.025
 blew+0.024
 bit+0.024
 sne+0.024
Neg logit contributions
 If-0.024
 This-0.024
and-0.023
'.-0.022
".-0.022
Token she
Token ID673
Feature activation-6.37e-02

Pos logit contributions
 If+0.063
and+0.061
fulness+0.060
ion+0.059
ula+0.054
Neg logit contributions
 forgot-0.092
 remembered-0.086
 saw-0.084
 ate-0.079
 sne-0.079
Token said
Token ID531
Feature activation+3.89e-02

Pos logit contributions
 remembered+0.061
 forgot+0.060
 blew+0.056
 dove+0.055
 wish+0.054
Neg logit contributions
 This-0.143
'.-0.141
".-0.140
 If-0.140
.'-0.135
Token.
Token ID13
Feature activation+1.22e-01

Pos logit contributions
and+0.064
 If+0.061
.'+0.058
'.+0.057
.'"+0.055
Neg logit contributions
 sne-0.074
 forgot-0.071
 remembered-0.070
 dropped-0.070
 bit-0.068
Token
Token ID220
Feature activation+1.74e-01

Pos logit contributions
 anymore+0.217
ently+0.199
ings+0.199
fulness+0.195
uts+0.182
Neg logit contributions
 saw-0.165
 remembered-0.164
 tried-0.157
 switched-0.157
 rubbed-0.155
Token
Token ID198
Feature activation+8.55e-02

Pos logit contributions
overs+0.300
keeping+0.288
rs+0.280
ados+0.277
edly+0.277
Neg logit contributions
 saw-0.160
 also-0.156
 lost-0.150
 climbed-0.150
 felt-0.147
Token
Token ID198
Feature activation+1.35e-02

Pos logit contributions
 anymore+0.141
unes+0.132
ados+0.128
rier+0.128
anks+0.128
Neg logit contributions
 saw-0.137
 blew-0.132
lly-0.126
 sw-0.125
 knew-0.124
TokenBut
Token ID1537
Feature activation+6.38e-03

Pos logit contributions
 This+0.025
 anymore+0.024
 If+0.024
".+0.023
.'+0.023
Neg logit contributions
 remembered-0.020
 blew-0.020
 forgot-0.019
 chased-0.018
 saw-0.018
Token Peter
Token ID5613
Feature activation-1.79e-02

Pos logit contributions
and+0.010
 This+0.010
 If+0.010
uts+0.009
'.+0.009
Neg logit contributions
 remembered-0.012
 forgot-0.012
 blew-0.012
 bit-0.011
 sne-0.011
Token wasn
Token ID2492
Feature activation-4.12e-03

Pos logit contributions
 forgot+0.018
 remembered+0.017
 sne+0.016
 blew+0.016
 bit+0.015
Neg logit contributions
 This-0.044
 If-0.044
'.-0.042
and-0.042
".-0.041
Tokenâ
Token ID22940
Feature activation-1.22e-02

Pos logit contributions
ently+0.138
ings+0.133
 anymore+0.124
fulness+0.123
eways+0.117
Neg logit contributions
 bit-0.154
 ate-0.142
 threw-0.142
 dove-0.141
 cut-0.140
Token
Token ID26391
Feature activation+2.69e-02

Pos logit contributions
 forgot+0.021
 remembered+0.020
 blew+0.020
 bit+0.020
 ate+0.019
Neg logit contributions
 If-0.020
 This-0.020
and-0.019
'.-0.019
".-0.018
Token she
Token ID673
Feature activation-5.17e-02

Pos logit contributions
 If+0.041
and+0.040
fulness+0.038
ion+0.035
 This+0.035
Neg logit contributions
 forgot-0.052
 remembered-0.049
 ate-0.047
 saw-0.047
 bit-0.046
Token said
Token ID531
Feature activation+5.22e-02

Pos logit contributions
 forgot+0.057
 remembered+0.057
 blew+0.054
 sne+0.053
 dove+0.050
Neg logit contributions
 This-0.113
 If-0.112
'.-0.111
".-0.110
.'-0.107
Token.
Token ID13
Feature activation+1.32e-01

Pos logit contributions
and+0.095
 If+0.085
keeping+0.082
 Today+0.079
 By+0.079
Neg logit contributions
 sne-0.088
 bit-0.087
 ate-0.087
 remembered-0.086
 forgot-0.086
Token
Token ID220
Feature activation+1.73e-01

Pos logit contributions
ently+0.217
fulness+0.214
ings+0.201
 anymore+0.200
ment+0.190
Neg logit contributions
 threw-0.181
 bit-0.179
 sw-0.175
 p-0.174
 saw-0.173
Token
Token ID198
Feature activation+7.77e-02

Pos logit contributions
keeping+0.308
overs+0.305
ment+0.296
edly+0.294
ados+0.290
Neg logit contributions
 saw-0.162
 also-0.160
 climbed-0.143
 did-0.143
 took-0.140
Token
Token ID198
Feature activation+3.94e-03

Pos logit contributions
rier+0.124
ados+0.116
 anymore+0.116
overs+0.115
ment+0.112
Neg logit contributions
 saw-0.131
 sw-0.123
 blew-0.121
lly-0.118
 dove-0.118
TokenTim
Token ID14967
Feature activation-7.27e-03

Pos logit contributions
 This+0.008
 If+0.008
'.+0.007
.'+0.007
".+0.007
Neg logit contributions
 forgot-0.005
 blew-0.005
 remembered-0.005
 chased-0.005
 bit-0.005
Token grabbed
Token ID13176
Feature activation-1.46e-02

Pos logit contributions
 forgot+0.012
 remembered+0.012
 blew+0.011
 sne+0.011
 bit+0.011
Neg logit contributions
 This-0.013
 If-0.013
'.-0.013
and-0.013
.'-0.012
Token the
Token ID262
Feature activation-2.18e-02

Pos logit contributions
 forgot+0.027
 remembered+0.027
 sne+0.027
 blew+0.026
 bit+0.026
Neg logit contributions
 If-0.021
 This-0.021
and-0.021
'.-0.021
.'-0.020
Token without
Token ID1231
Feature activation+2.27e-02

Pos logit contributions
 This+0.163
 If+0.162
 Today+0.159
 Those+0.159
 Do+0.157
Neg logit contributions
 bit-0.129
 packed-0.124
 buried-0.123
 cut-0.122
 left-0.121
Token her
Token ID607
Feature activation+1.37e-02

Pos logit contributions
 This+0.045
 If+0.043
and+0.043
'.+0.041
 Today+0.041
Neg logit contributions
 sne-0.037
 remembered-0.035
 forgot-0.034
 pin-0.034
 bit-0.034
Token mom
Token ID1995
Feature activation+6.41e-02

Pos logit contributions
 If+0.029
and+0.028
 This+0.028
'.+0.026
 That+0.026
Neg logit contributions
 wish-0.019
 sne-0.019
 bit-0.018
 forgot-0.018
 remembered-0.017
Tokenâ
Token ID22940
Feature activation+9.19e-03

Pos logit contributions
eway+0.128
eways+0.127
 If+0.126
 Everyone+0.123
 This+0.122
Neg logit contributions
 bit-0.092
 forgot-0.092
 cut-0.091
 remembered-0.086
 sne-0.086
Token
Token ID26391
Feature activation+6.00e-02

Pos logit contributions
and+0.017
 If+0.017
uts+0.016
 This+0.016
".+0.015
Neg logit contributions
 forgot-0.013
 blew-0.013
 remembered-0.013
 sne-0.013
 bit-0.013
Token
Token ID8151
Feature activation+1.73e-01

Pos logit contributions
 If+0.104
ion+0.089
ions+0.085
and+0.085
fulness+0.084
Neg logit contributions
 forgot-0.119
 remembered-0.103
 sne-0.103
 dove-0.100
 saw-0.100
Tokens
Token ID82
Feature activation+4.81e-02

Pos logit contributions
ment+0.277
 itself+0.276
ures+0.264
eway+0.252
 Grad+0.249
Neg logit contributions
 forgot-0.236
 hurt-0.223
 bit-0.216
 got-0.215
 naughty-0.214
Token help
Token ID1037
Feature activation+8.31e-02

Pos logit contributions
 This+0.072
 If+0.071
and+0.069
but+0.066
 That+0.066
Neg logit contributions
 wish-0.089
 sne-0.087
 bit-0.087
 remembered-0.084
 forgot-0.083
Token.
Token ID13
Feature activation+1.18e-01

Pos logit contributions
ays+0.125
 This+0.123
anes+0.122
aris+0.118
eways+0.117
Neg logit contributions
 sne-0.158
 hurt-0.156
 cut-0.155
 put-0.154
 bit-0.151
Token When
Token ID1649
Feature activation+2.30e-02

Pos logit contributions
 anymore+0.171
eways+0.167
ently+0.167
ings+0.163
fulness+0.163
Neg logit contributions
 c-0.161
 forgot-0.160
 put-0.160
 rubbed-0.159
 remembered-0.158
Token she
Token ID673
Feature activation-4.95e-02

Pos logit contributions
and+0.039
 If+0.038
.'+0.035
'.+0.034
 This+0.034
Neg logit contributions
 forgot-0.043
 sne-0.043
 blew-0.041
 remembered-0.040
 wish-0.040
Token they
Token ID484
Feature activation-4.73e-02

Pos logit contributions
and+0.042
.'+0.042
'.+0.041
 If+0.041
 This+0.040
Neg logit contributions
 remembered-0.052
 sne-0.052
 sw-0.050
 forgot-0.049
 dropped-0.049
Token would
Token ID561
Feature activation+1.36e-02

Pos logit contributions
 forgot+0.059
 remembered+0.057
 blew+0.056
 dove+0.055
 bit+0.054
Neg logit contributions
 If-0.098
 This-0.097
'.-0.093
-0.091
".-0.090
Token support
Token ID1104
Feature activation+4.36e-02

Pos logit contributions
and+0.036
 This+0.035
 If+0.035
'.+0.033
 That+0.033
Neg logit contributions
 sne-0.012
 forgot-0.012
 wish-0.011
 remembered-0.011
 bit-0.011
Token him
Token ID683
Feature activation+7.87e-02

Pos logit contributions
uts+0.072
but+0.067
unes+0.066
 This+0.066
and+0.063
Neg logit contributions
 sne-0.081
 remembered-0.076
 bit-0.075
 wish-0.075
 forgot-0.074
Token.
Token ID13
Feature activation+5.75e-02

Pos logit contributions
 This+0.101
ula+0.092
 If+0.091
 You+0.089
 These+0.087
Neg logit contributions
 bit-0.165
 wish-0.160
 hurt-0.157
 sne-0.152
 promise-0.152
Token
Token ID220
Feature activation+1.73e-01

Pos logit contributions
and+0.088
ion+0.086
ions+0.086
uts+0.080
eway+0.076
Neg logit contributions
 remembered-0.117
 saw-0.111
 forgot-0.111
 realized-0.104
 felt-0.103
Token
Token ID198
Feature activation+7.43e-02

Pos logit contributions
ados+0.324
keeping+0.316
overs+0.304
umm+0.300
ment+0.297
Neg logit contributions
 saw-0.180
 also-0.160
 lost-0.157
 felt-0.156
 forgot-0.151
Token
Token ID198
Feature activation-3.58e-03

Pos logit contributions
unes+0.111
ados+0.095
ment+0.094
ions+0.093
keeping+0.091
Neg logit contributions
 saw-0.142
 remembered-0.135
 forgot-0.133
 blew-0.130
 learned-0.127
TokenThey
Token ID2990
Feature activation-7.71e-02

Pos logit contributions
 remembered+0.006
 forgot+0.006
 blew+0.006
 bit+0.006
 sne+0.006
Neg logit contributions
 This-0.006
 If-0.006
and-0.006
".-0.006
'.-0.006
Token opened
Token ID4721
Feature activation+2.70e-02

Pos logit contributions
 wish+0.076
 forgot+0.074
 boo+0.070
 dove+0.065
 feather+0.064
Neg logit contributions
-0.159
 If-0.158
'.-0.156
.-0.155
 This-0.152
Token the
Token ID262
Feature activation-2.73e-02

Pos logit contributions
and+0.057
uts+0.052
 If+0.050
 This+0.050
'.+0.049
Neg logit contributions
 sw-0.038
 fl-0.037
 bit-0.037
 sne-0.037
 saw-0.037
Token nothing
Token ID2147
Feature activation-1.32e-02

Pos logit contributions
'.+0.025
 If+0.025
.'+0.024
 This+0.024
".+0.023
Neg logit contributions
 forgot-0.030
 remembered-0.029
 sne-0.028
 saw-0.028
 ate-0.027
Token seemed
Token ID3947
Feature activation+2.56e-02

Pos logit contributions
 forgot+0.015
 remembered+0.014
 blew+0.013
 bit+0.013
 sne+0.013
Neg logit contributions
 This-0.032
 If-0.031
'.-0.030
.'-0.030
and-0.030
Token quite
Token ID2407
Feature activation+1.52e-02

Pos logit contributions
 If+0.039
 This+0.039
and+0.037
'.+0.036
 That+0.035
Neg logit contributions
 forgot-0.051
 remembered-0.050
 bit-0.049
 lost-0.048
 saw-0.047
Token right
Token ID826
Feature activation+6.63e-02

Pos logit contributions
 This+0.034
and+0.033
 If+0.033
'.+0.032
uts+0.031
Neg logit contributions
 forgot-0.020
 remembered-0.020
 bit-0.018
 dropped-0.018
 blew-0.018
Token.
Token ID13
Feature activation+9.30e-02

Pos logit contributions
 This+0.105
 Everyone+0.096
and+0.095
 If+0.094
uts+0.088
Neg logit contributions
 remembered-0.149
 saw-0.135
 forgot-0.134
 did-0.131
 sw-0.128
Token
Token ID220
Feature activation+1.73e-01

Pos logit contributions
uts+0.160
ment+0.147
fulness+0.142
ings+0.139
rier+0.138
Neg logit contributions
 saw-0.139
 looked-0.125
 tried-0.119
 found-0.119
 felt-0.117
Token
Token ID198
Feature activation+9.22e-02

Pos logit contributions
keeping+0.308
ados+0.300
ting+0.290
rs+0.287
overs+0.284
Neg logit contributions
 saw-0.195
 felt-0.171
 also-0.167
 found-0.158
 did-0.158
Token
Token ID198
Feature activation+1.53e-02

Pos logit contributions
ment+0.140
unes+0.139
anks+0.138
ados+0.136
ancing+0.130
Neg logit contributions
 saw-0.162
 met-0.141
 also-0.136
 found-0.134
 observed-0.132
TokenThen
Token ID6423
Feature activation+4.37e-03

Pos logit contributions
 This+0.029
 If+0.028
".+0.028
uts+0.027
'.+0.027
Neg logit contributions
 remembered-0.021
 forgot-0.021
 saw-0.020
 bit-0.018
 blew-0.018
Token one
Token ID530
Feature activation-4.28e-02

Pos logit contributions
and+0.007
 If+0.007
 This+0.007
uts+0.006
'.+0.006
Neg logit contributions
 remembered-0.009
 forgot-0.009
 bit-0.008
 sne-0.008
 saw-0.008
Token day
Token ID1110
Feature activation-1.44e-02

Pos logit contributions
 forgot+0.075
 remembered+0.072
 blew+0.069
 sne+0.067
 dropped+0.066
Neg logit contributions
 This-0.072
 If-0.069
'.-0.067
and-0.066
-0.064
Token promising
Token ID11781
Feature activation+5.71e-02

Pos logit contributions
'.+0.002
 This+0.002
 If+0.002
.'+0.002
".+0.002
Neg logit contributions
 remembered-0.003
 forgot-0.003
 sne-0.003
 blew-0.003
 bit-0.003
Token to
Token ID284
Feature activation-2.19e-02

Pos logit contributions
'.+0.094
.'+0.093
.'"+0.090
 If+0.087
 This+0.085
Neg logit contributions
 forgot-0.105
 remembered-0.105
 sne-0.103
 saw-0.101
 lost-0.100
Token protect
Token ID1805
Feature activation+6.22e-02

Pos logit contributions
 forgot+0.024
 remembered+0.023
 sne+0.021
 blew+0.021
 wish+0.021
Neg logit contributions
 This-0.051
 If-0.051
'.-0.049
and-0.049
".-0.048
Token him
Token ID683
Feature activation+6.75e-02

Pos logit contributions
uts+0.105
 Today+0.104
 This+0.102
 By+0.101
'.+0.100
Neg logit contributions
 pin-0.102
 sne-0.099
 hurt-0.099
 sw-0.098
 remembered-0.093
Token.
Token ID13
Feature activation+7.95e-02

Pos logit contributions
 This+0.100
 If+0.099
 You+0.087
 That+0.085
 By+0.084
Neg logit contributions
 bit-0.132
 wish-0.131
 sne-0.129
 hurt-0.127
 forgot-0.125
Token
Token ID220
Feature activation+1.72e-01

Pos logit contributions
fulness+0.141
rier+0.128
and+0.125
.'+0.122
ment+0.121
Neg logit contributions
 saw-0.137
 observed-0.124
 forgot-0.122
 remembered-0.121
 threw-0.121
Token
Token ID198
Feature activation+9.66e-02

Pos logit contributions
keeping+0.291
overs+0.281
fulness+0.276
ment+0.275
.'+0.268
Neg logit contributions
 saw-0.208
 also-0.182
 forgot-0.181
 lost-0.178
 met-0.177
Token
Token ID198
Feature activation-9.29e-03

Pos logit contributions
ment+0.141
 anymore+0.141
rier+0.141
unes+0.136
overs+0.133
Neg logit contributions
 saw-0.171
 met-0.156
 blew-0.150
 forgot-0.149
 threw-0.148
TokenSuddenly
Token ID38582
Feature activation-7.56e-03

Pos logit contributions
 forgot+0.016
 remembered+0.016
 blew+0.015
 bit+0.015
 sne+0.015
Neg logit contributions
 This-0.015
 If-0.015
'.-0.015
".-0.014
.'-0.014
Token,
Token ID11
Feature activation-2.54e-02

Pos logit contributions
 remembered+0.017
 forgot+0.016
 blew+0.016
 sne+0.016
 dropped+0.015
Neg logit contributions
and-0.010
 If-0.010
 This-0.010
'.-0.009
.'-0.009
Token they
Token ID484
Feature activation-5.08e-02

Pos logit contributions
 forgot+0.058
 remembered+0.057
 blew+0.055
 sne+0.055
 bit+0.054
Neg logit contributions
 This-0.029
 If-0.029
'.-0.027
and-0.026
".-0.026
Token mom
Token ID1995
Feature activation+4.93e-02

Pos logit contributions
 forgot+0.019
 remembered+0.019
 sne+0.018
 wish+0.018
 bit+0.017
Neg logit contributions
 This-0.041
 If-0.041
'.-0.040
and-0.039
".-0.039
Token's
Token ID338
Feature activation-3.06e-03

Pos logit contributions
 If+0.104
 This+0.098
and+0.092
'.+0.092
 By+0.091
Neg logit contributions
 remembered-0.069
 cut-0.068
 bit-0.068
 forgot-0.068
 sne-0.064
Token favourite
Token ID12507
Feature activation-3.73e-03

Pos logit contributions
 remembered+0.003
 sne+0.003
 wish+0.003
 forgot+0.003
 bit+0.003
Neg logit contributions
 This-0.007
 If-0.007
and-0.007
'.-0.007
".-0.007
Token bowl
Token ID9396
Feature activation+7.52e-02

Pos logit contributions
 remembered+0.004
 bit+0.004
 forgot+0.004
 sne+0.004
 wish+0.004
Neg logit contributions
 If-0.008
 This-0.008
and-0.008
'.-0.008
 That-0.008
Token.
Token ID13
Feature activation+8.51e-02

Pos logit contributions
 If+0.117
 This+0.108
and+0.099
'.+0.097
 By+0.096
Neg logit contributions
 bit-0.147
 buried-0.145
 cut-0.145
 forgot-0.143
 ate-0.142
Token
Token ID220
Feature activation+1.72e-01

Pos logit contributions
fulness+0.142
arling+0.139
eways+0.137
eway+0.135
ently+0.134
Neg logit contributions
 sne-0.118
 t-0.117
 cut-0.117
 looked-0.115
 saw-0.113
Token
Token ID198
Feature activation+8.03e-02

Pos logit contributions
keeping+0.303
ados+0.296
rs+0.296
ting+0.289
overs+0.286
Neg logit contributions
 saw-0.168
 felt-0.161
 also-0.156
 lost-0.150
 loved-0.139
Token
Token ID198
Feature activation+1.62e-02

Pos logit contributions
unes+0.130
anks+0.129
 anymore+0.123
ados+0.120
ays+0.120
Neg logit contributions
 saw-0.125
 packed-0.117
 gentle-0.114
 cut-0.113
 put-0.113
TokenHe
Token ID1544
Feature activation-7.92e-02

Pos logit contributions
 This+0.028
 If+0.026
anks+0.026
uts+0.025
unes+0.025
Neg logit contributions
 remembered-0.024
 bit-0.024
 forgot-0.023
 packed-0.023
 blew-0.023
Token found
Token ID1043
Feature activation+7.83e-03

Pos logit contributions
 forgot+0.074
 wish+0.071
 buzz+0.067
 dove+0.065
 flood+0.062
Neg logit contributions
-0.170
'.-0.166
 This-0.165
.-0.164
 If-0.164
Token a
Token ID257
Feature activation-6.87e-02

Pos logit contributions
'.+0.014
 This+0.014
and+0.013
 If+0.013
.'+0.013
Neg logit contributions
 remembered-0.013
 sne-0.012
 cut-0.012
 blew-0.012
 dropped-0.012
Token it
Token ID340
Feature activation-3.63e-02

Pos logit contributions
'.+0.054
 This+0.054
.'+0.053
 If+0.052
and+0.052
Neg logit contributions
 forgot-0.063
 remembered-0.058
 dropped-0.055
 hurt-0.054
 saw-0.054
Token would
Token ID561
Feature activation-2.01e-03

Pos logit contributions
 forgot+0.038
 remembered+0.037
 blew+0.034
 sne+0.034
 wish+0.033
Neg logit contributions
 This-0.085
 If-0.083
'.-0.081
".-0.080
and-0.080
Token be
Token ID307
Feature activation-1.26e-03

Pos logit contributions
 forgot+0.002
 remembered+0.002
 sne+0.002
 wish+0.002
 bit+0.002
Neg logit contributions
 This-0.005
 If-0.005
and-0.005
'.-0.005
.'-0.005
Token safe
Token ID3338
Feature activation+8.51e-02

Pos logit contributions
 forgot+0.002
 remembered+0.002
 sne+0.001
 blew+0.001
 bit+0.001
Neg logit contributions
 This-0.003
 If-0.003
'.-0.003
and-0.003
.'-0.003
Token.
Token ID13
Feature activation+6.96e-02

Pos logit contributions
 This+0.159
 If+0.150
 Everyone+0.142
 These+0.138
and+0.137
Neg logit contributions
 remembered-0.141
 forgot-0.137
 lost-0.136
 bit-0.136
 buried-0.134
Token
Token ID220
Feature activation+1.71e-01

Pos logit contributions
ions+0.115
and+0.111
fulness+0.110
ectar+0.108
ment+0.101
Neg logit contributions
 saw-0.129
 forgot-0.117
 threw-0.117
 looked-0.114
 did-0.112
Token
Token ID198
Feature activation+9.21e-02

Pos logit contributions
ados+0.318
keeping+0.306
ting+0.298
overs+0.295
umm+0.291
Neg logit contributions
 saw-0.192
 also-0.171
 felt-0.160
 did-0.157
 forgot-0.157
Token
Token ID198
Feature activation+1.31e-02

Pos logit contributions
ados+0.147
unes+0.139
ment+0.137
ions+0.132
anks+0.132
Neg logit contributions
 saw-0.157
 forgot-0.140
 also-0.138
 packed-0.137
 did-0.136
Token"
Token ID1
Feature activation-1.76e-03

Pos logit contributions
 This+0.023
 If+0.022
and+0.021
'.+0.021
.'+0.021
Neg logit contributions
 forgot-0.020
 remembered-0.020
 blew-0.019
 packed-0.019
 saw-0.019
TokenWe
Token ID1135
Feature activation-5.28e-02

Pos logit contributions
 remembered+0.003
 forgot+0.002
 blew+0.002
 bit+0.002
 sne+0.002
Neg logit contributions
 This-0.003
 If-0.003
'.-0.003
".-0.003
.'-0.003
Token want
Token ID765
Feature activation+2.23e-02

Pos logit contributions
 forgot+0.073
 remembered+0.070
 blew+0.068
 dove+0.067
 sne+0.067
Neg logit contributions
'.-0.102
 If-0.101
 This-0.101
".-0.098
.'-0.097
Token to
Token ID284
Feature activation-6.40e-03

Pos logit contributions
 If+0.121
 This+0.116
 You+0.116
and+0.112
'.+0.111
Neg logit contributions
 bit-0.112
 hurt-0.112
 pin-0.112
 remembered-0.112
 forgot-0.110
Token beat
Token ID4405
Feature activation+6.17e-02

Pos logit contributions
 forgot+0.006
 remembered+0.006
 sne+0.006
 wish+0.005
 bit+0.005
Neg logit contributions
 If-0.016
 This-0.016
and-0.016
'.-0.015
 That-0.015
Token Gray
Token ID12723
Feature activation+8.23e-02

Pos logit contributions
 Today+0.095
 This+0.094
and+0.093
 If+0.091
but+0.090
Neg logit contributions
 sne-0.111
 hurt-0.110
 bit-0.108
 pin-0.108
 lost-0.104
Tokenâ
Token ID22940
Feature activation+3.75e-03

Pos logit contributions
 If+0.130
 This+0.117
and+0.113
ions+0.111
 By+0.111
Neg logit contributions
 bit-0.153
 sne-0.143
 forgot-0.141
 remembered-0.140
 ate-0.137
Token
Token ID26391
Feature activation+6.06e-02

Pos logit contributions
and+0.007
 If+0.007
 This+0.007
uts+0.006
".+0.006
Neg logit contributions
 forgot-0.006
 remembered-0.006
 sne-0.005
 bit-0.005
 blew-0.005
Token
Token ID8151
Feature activation+1.70e-01

Pos logit contributions
 If+0.105
ion+0.104
ions+0.102
fulness+0.095
and+0.092
Neg logit contributions
 forgot-0.115
 remembered-0.102
 sne-0.099
 chased-0.097
 saw-0.096
Tokens
Token ID82
Feature activation+5.56e-02

Pos logit contributions
ment+0.251
ures+0.239
 This+0.234
 If+0.233
 What+0.229
Neg logit contributions
 forgot-0.244
 hurt-0.241
 sc-0.236
 bit-0.235
 spoiled-0.224
Token prices
Token ID4536
Feature activation+9.38e-02

Pos logit contributions
 This+0.086
 If+0.082
ays+0.079
and+0.078
'.+0.077
Neg logit contributions
 bit-0.098
 wish-0.097
 plan-0.096
 hurt-0.095
 sne-0.094
Token,
Token ID11
Feature activation+5.83e-02

Pos logit contributions
 This+0.132
 If+0.126
 Today+0.124
and+0.111
 That+0.110
Neg logit contributions
 bit-0.192
 forgot-0.184
 remembered-0.180
 kept-0.179
 left-0.179
Token because
Token ID780
Feature activation+4.34e-02

Pos logit contributions
 This+0.096
and+0.094
'.+0.094
 If+0.093
.'+0.093
Neg logit contributions
 forgot-0.103
 bit-0.102
 sne-0.101
 remembered-0.100
 chased-0.094
Token everyone
Token ID2506
Feature activation-3.21e-02

Pos logit contributions
and+0.072
.'+0.070
'.+0.069
 This+0.069
 If+0.067
Neg logit contributions
 forgot-0.074
 remembered-0.074
 blew-0.073
 sne-0.073
 bit-0.071
Tokent
Token ID83
Feature activation+7.02e-03

Pos logit contributions
 itself+0.185
 If+0.179
 Then+0.177
 For+0.177
 This+0.177
Neg logit contributions
 forgot-0.123
 bit-0.110
 dropped-0.109
 hurt-0.106
 ate-0.103
Token willing
Token ID4684
Feature activation+2.93e-02

Pos logit contributions
 This+0.015
 If+0.014
'.+0.014
and+0.014
.'+0.014
Neg logit contributions
 remembered-0.010
 forgot-0.010
 dropped-0.009
 sne-0.009
 hurt-0.009
Token to
Token ID284
Feature activation-1.90e-02

Pos logit contributions
 This+0.042
 If+0.039
'.+0.037
 That+0.036
 Today+0.036
Neg logit contributions
 forgot-0.065
 remembered-0.064
 bit-0.061
 dropped-0.061
 lost-0.058
Token share
Token ID2648
Feature activation+6.76e-02

Pos logit contributions
 forgot+0.020
 remembered+0.019
 sne+0.018
 blew+0.018
 wish+0.017
Neg logit contributions
 This-0.045
 If-0.045
and-0.043
'.-0.043
.'-0.042
Token.
Token ID13
Feature activation+7.12e-02

Pos logit contributions
 This+0.123
 By+0.121
 Today+0.119
 Eventually+0.115
anes+0.114
Neg logit contributions
 pin-0.104
 comb-0.102
 cut-0.100
 bit-0.099
 broken-0.099
Token
Token ID220
Feature activation+1.70e-01

Pos logit contributions
fulness+0.123
overs+0.100
rier+0.099
reating+0.098
.'+0.098
Neg logit contributions
 saw-0.129
 forgot-0.121
 remembered-0.121
 tried-0.114
 felt-0.114
Token
Token ID198
Feature activation+8.02e-02

Pos logit contributions
keeping+0.304
ados+0.299
overs+0.293
rs+0.285
umm+0.282
Neg logit contributions
 saw-0.189
 also-0.172
 felt-0.172
 forgot-0.171
 did-0.164
Token
Token ID198
Feature activation+8.00e-03

Pos logit contributions
unes+0.111
overs+0.108
 anymore+0.108
ment+0.105
ados+0.105
Neg logit contributions
 saw-0.147
 remembered-0.140
 forgot-0.136
lly-0.135
 knew-0.135
TokenJune
Token ID15749
Feature activation-5.03e-02

Pos logit contributions
 This+0.015
 If+0.014
.'+0.013
".+0.013
'.+0.013
Neg logit contributions
 remembered-0.013
 forgot-0.013
 blew-0.012
 chased-0.012
 dove-0.012
Token pulled
Token ID5954
Feature activation+1.87e-02

Pos logit contributions
 forgot+0.060
 wish+0.059
 remembered+0.057
 blew+0.055
 sne+0.055
Neg logit contributions
 This-0.105
 If-0.103
'.-0.102
".-0.098
-0.098
Token the
Token ID262
Feature activation-4.24e-02

Pos logit contributions
'.+0.034
uts+0.034
.'+0.034
".+0.034
and+0.034
Neg logit contributions
 sne-0.029
 pin-0.029
 bit-0.028
 cut-0.027
 tug-0.027
Token but
Token ID475
Feature activation+2.40e-03

Pos logit contributions
 If+0.089
 This+0.086
and+0.083
 Everyone+0.076
 By+0.075
Neg logit contributions
 bit-0.126
 remembered-0.121
 ate-0.118
 forgot-0.117
 cut-0.116
Token she
Token ID673
Feature activation-6.21e-02

Pos logit contributions
and+0.004
 This+0.004
 If+0.003
'.+0.003
.'+0.003
Neg logit contributions
 forgot-0.005
 remembered-0.005
 blew-0.005
 sne-0.005
 bit-0.004
Token said
Token ID531
Feature activation+3.11e-02

Pos logit contributions
 blew+0.062
 wish+0.062
 buzz+0.061
 forgot+0.059
 bit+0.058
Neg logit contributions
-0.129
 If-0.127
 This-0.127
'.-0.125
".-0.122
Token no
Token ID645
Feature activation+3.65e-02

Pos logit contributions
and+0.065
 If+0.061
.'+0.059
'.+0.058
 This+0.058
Neg logit contributions
 remembered-0.047
 forgot-0.047
 sne-0.045
 dropped-0.044
 ate-0.043
Token.
Token ID13
Feature activation+8.10e-02

Pos logit contributions
 If+0.049
 This+0.046
and+0.040
ions+0.040
lor+0.040
Neg logit contributions
 sne-0.081
 bit-0.077
 wish-0.075
 forgot-0.075
 cut-0.074
Token
Token ID220
Feature activation+1.70e-01

Pos logit contributions
fulness+0.113
smart+0.109
ions+0.108
eway+0.106
and+0.103
Neg logit contributions
 sne-0.142
 went-0.142
 looked-0.141
 saw-0.141
 spoiled-0.140
Token
Token ID198
Feature activation+7.67e-02

Pos logit contributions
ados+0.316
keeping+0.307
rs+0.296
ting+0.288
umm+0.287
Neg logit contributions
 saw-0.165
 also-0.164
 felt-0.156
 lost-0.140
 knew-0.137
TokenM
Token ID44
Feature activation-1.37e-02

Pos logit contributions
ados+0.100
anks+0.097
fulness+0.096
ions+0.094
 anymore+0.093
Neg logit contributions
 saw-0.136
 blew-0.131
 spoiled-0.126
 dropped-0.125
 picked-0.124
Tokenum
Token ID388
Feature activation-7.03e-02

Pos logit contributions
 forgot+0.028
 remembered+0.028
 wish+0.027
 sne+0.027
 blew+0.027
Neg logit contributions
 This-0.018
 If-0.018
'.-0.017
and-0.017
".-0.016
Token said
Token ID531
Feature activation+1.09e-02

Pos logit contributions
 wish+0.086
 forgot+0.078
 buzz+0.070
 dove+0.069
 remembered+0.069
Neg logit contributions
-0.140
 If-0.139
 This-0.139
'.-0.136
 She-0.134
Token that
Token ID326
Feature activation+1.14e-02

Pos logit contributions
and+0.016
 If+0.016
 This+0.015
'.+0.015
.'+0.015
Neg logit contributions
 remembered-0.022
 forgot-0.022
 sne-0.022
 dropped-0.021
 blew-0.021
Token feel
Token ID1254
Feature activation-5.18e-02

Pos logit contributions
 forgot+0.056
 remembered+0.056
 blew+0.052
 dove+0.050
 bit+0.049
Neg logit contributions
 This-0.090
 If-0.088
'.-0.086
".-0.085
and-0.084
Token better
Token ID1365
Feature activation+3.07e-02

Pos logit contributions
 forgot+0.081
 remembered+0.079
 dove+0.076
 blew+0.076
 sne+0.075
Neg logit contributions
 If-0.092
 This-0.089
'.-0.088
".-0.088
-0.087
Token.
Token ID13
Feature activation+2.24e-02

Pos logit contributions
 This+0.044
 If+0.043
 Everywhere+0.039
 These+0.037
anes+0.037
Neg logit contributions
 forgot-0.068
 bit-0.068
 remembered-0.067
 hurt-0.065
 sne-0.062
Token Lily
Token ID20037
Feature activation-1.28e-01

Pos logit contributions
and+0.027
'.+0.026
.'+0.026
fulness+0.025
".+0.025
Neg logit contributions
 forgot-0.048
 remembered-0.047
 ate-0.045
 saw-0.045
 blew-0.045
Token felt
Token ID2936
Feature activation-8.20e-02

Pos logit contributions
 flood+0.221
 holy+0.204
Uh+0.204
 spe+0.203
 thrill+0.195
Neg logit contributions
-0.232
.-0.193
 She-0.193
 So-0.177
 -0.175
Token a
Token ID257
Feature activation-1.53e-01

Pos logit contributions
 forgot+0.140
 blew+0.139
 dove+0.135
 buzz+0.133
 remembered+0.131
Neg logit contributions
-0.140
 If-0.138
 This-0.132
 She-0.127
'.-0.121
Token little
Token ID1310
Feature activation-1.20e-01

Pos logit contributions
 swung+0.235
 chased+0.234
 met+0.233
 forgot+0.228
 swoop+0.228
Neg logit contributions
-0.259
.-0.230
'.-0.221
 This-0.221
".-0.218
Token bit
Token ID1643
Feature activation-7.18e-02

Pos logit contributions
lly+0.239
ep+0.235
 forgot+0.230
 blew+0.220
 swung+0.219
Neg logit contributions
-0.168
 If-0.164
.-0.160
 She-0.157
 That-0.153
Token better
Token ID1365
Feature activation-1.79e-03

Pos logit contributions
 forgot+0.114
 remembered+0.110
 blew+0.108
 dove+0.106
 chased+0.103
Neg logit contributions
 If-0.132
-0.121
 This-0.120
 She-0.117
'.-0.117
Token but
Token ID475
Feature activation-7.91e-02

Pos logit contributions
 remembered+0.004
 forgot+0.004
 bit+0.004
 sne+0.004
 blew+0.004
Neg logit contributions
 This-0.003
 If-0.003
and-0.002
'.-0.002
.'-0.002
Token still
Token ID991
Feature activation-3.62e-02

Pos logit contributions
 swoop+0.128
 buzz+0.126
 sne+0.126
 bit+0.124
 wish+0.121
Neg logit contributions
-0.129
 She-0.122
 That-0.117
 This-0.117
 If-0.116
Token."
Token ID526
Feature activation+3.46e-02

Pos logit contributions
 If+0.010
and+0.010
 This+0.009
'.+0.009
.'+0.009
Neg logit contributions
 bit-0.024
 forgot-0.023
 sne-0.022
 blew-0.022
 ate-0.022
Token
Token ID198
Feature activation+3.90e-02

Pos logit contributions
fulness+0.063
.'+0.062
rier+0.061
'.+0.060
".+0.057
Neg logit contributions
 ate-0.058
 played-0.057
 saw-0.055
 threw-0.054
 forgot-0.053
Token
Token ID198
Feature activation-3.57e-02

Pos logit contributions
'.+0.038
.'+0.036
 If+0.036
rier+0.034
overs+0.033
Neg logit contributions
 saw-0.089
 forgot-0.084
 ate-0.083
 knew-0.082
 remembered-0.082
TokenTim
Token ID14967
Feature activation-9.15e-02

Pos logit contributions
 forgot+0.072
 remembered+0.071
 wish+0.068
 sne+0.068
 blew+0.066
Neg logit contributions
 This-0.052
 If-0.050
'.-0.048
and-0.047
".-0.047
Token and
Token ID290
Feature activation-3.35e-02

Pos logit contributions
 wish+0.148
 flood+0.144
inge+0.139
Uh+0.139
 groom+0.139
Neg logit contributions
-0.156
.-0.145
 She-0.145
 They-0.140
 This-0.136
Token Lily
Token ID20037
Feature activation-1.52e-01

Pos logit contributions
 forgot+0.085
 remembered+0.084
 wish+0.082
 blew+0.081
 sne+0.081
Neg logit contributions
 This-0.030
 If-0.029
'.-0.025
and-0.024
".-0.024
Token looked
Token ID3114
Feature activation-8.00e-02

Pos logit contributions
 cruel+0.249
 painful+0.248
 crow+0.247
 annoying+0.246
 eagle+0.243
Neg logit contributions
 They-0.251
 She-0.245
.-0.236
-0.232
 So-0.211
Token around
Token ID1088
Feature activation+7.45e-03

Pos logit contributions
 forgot+0.211
 blew+0.210
 groom+0.205
Ta+0.202
uch+0.201
Neg logit contributions
.-0.056
-0.050
 If-0.048
 They-0.048
 This-0.046
Token the
Token ID262
Feature activation-4.73e-02

Pos logit contributions
 If+0.012
uts+0.012
'.+0.012
 This+0.012
".+0.012
Neg logit contributions
 saw-0.013
 sne-0.013
 remembered-0.013
 bit-0.013
 sw-0.012
Token movie
Token ID3807
Feature activation+2.11e-02

Pos logit contributions
 forgot+0.075
 remembered+0.074
 blew+0.068
 dropped+0.065
 sne+0.065
Neg logit contributions
 This-0.094
 If-0.088
-0.087
and-0.085
".-0.083
Token place
Token ID1295
Feature activation+3.63e-02

Pos logit contributions
 If+0.033
'.+0.033
.'+0.032
".+0.031
 This+0.031
Neg logit contributions
 forgot-0.039
 saw-0.038
 bit-0.038
 ate-0.037
 remembered-0.037
Token was
Token ID373
Feature activation-2.89e-02

Pos logit contributions
 remembered+0.081
 forgot+0.081
 sne+0.078
 dove+0.074
 boo+0.074
Neg logit contributions
".-0.133
 This-0.133
'.-0.130
-0.130
.-0.129
Token so
Token ID523
Feature activation-4.45e-02

Pos logit contributions
 forgot+0.038
 remembered+0.037
 blew+0.035
 sne+0.035
 bit+0.034
Neg logit contributions
 This-0.060
 If-0.060
'.-0.058
".-0.057
and-0.057
Token amazing
Token ID4998
Feature activation+6.14e-02

Pos logit contributions
 remembered+0.045
 forgot+0.044
 sne+0.040
 bit+0.040
 blew+0.039
Neg logit contributions
 This-0.105
 If-0.103
".-0.100
-0.099
'.-0.098
Token!
Token ID0
Feature activation+6.20e-02

Pos logit contributions
 If+0.112
and+0.105
 Today+0.103
 This+0.097
umbs+0.094
Neg logit contributions
 sw-0.101
 fl-0.099
 bit-0.097
 dove-0.097
 made-0.096
Token Everywhere
Token ID48201
Feature activation-3.07e-02

Pos logit contributions
fulness+0.101
and+0.097
ings+0.095
ancing+0.094
arling+0.094
Neg logit contributions
 saw-0.100
 looked-0.096
 dove-0.094
 gazed-0.091
 sw-0.091
Token it
Token ID340
Feature activation-1.51e-01

Pos logit contributions
 forgot+0.064
 remembered+0.063
 sne+0.061
 blew+0.061
 wish+0.060
Neg logit contributions
 This-0.041
 If-0.039
'.-0.038
".-0.036
.'-0.036
Token sang
Token ID25889
Feature activation+2.54e-02

Pos logit contributions
inge+0.211
ige+0.206
 argument+0.203
 stolen+0.203
 punished+0.202
Neg logit contributions
-0.256
.-0.245
 She-0.217
 They-0.203
 whenever-0.203
Token,
Token ID11
Feature activation-8.79e-04

Pos logit contributions
 If+0.043
 This+0.042
'.+0.042
.'+0.041
uts+0.040
Neg logit contributions
 remembered-0.043
 sw-0.042
 buzz-0.042
 saw-0.042
 fl-0.041
Token it
Token ID340
Feature activation-8.05e-02

Pos logit contributions
 forgot+0.002
 sne+0.002
 remembered+0.002
 blew+0.002
 dove+0.002
Neg logit contributions
'.-0.001
 If-0.001
 This-0.001
.'-0.001
and-0.001
Token made
Token ID925
Feature activation-6.37e-03

Pos logit contributions
 wish+0.066
 forgot+0.064
 remembered+0.063
 scrape+0.062
 boo+0.062
Neg logit contributions
-0.180
.-0.176
".-0.175
 This-0.174
 If-0.170
Token people
Token ID661
Feature activation-4.22e-02

Pos logit contributions
 sne+0.011
 forgot+0.011
 wish+0.011
 remembered+0.011
 blew+0.011
Neg logit contributions
 This-0.010
'.-0.010
 If-0.010
.'-0.010
and-0.010
Token near
Token ID1474
Feature activation-7.54e-03

Pos logit contributions
 forgot+0.039
 remembered+0.038
 sne+0.037
 blew+0.037
 bit+0.036
Neg logit contributions
 This-0.036
 If-0.036
'.-0.035
and-0.033
".-0.033
Token the
Token ID262
Feature activation-3.55e-02

Pos logit contributions
 sne+0.013
 remembered+0.013
 wish+0.012
 forgot+0.012
 sw+0.012
Neg logit contributions
 This-0.013
'.-0.013
 If-0.012
.'-0.012
and-0.012
Token clouds
Token ID15114
Feature activation+1.15e-02

Pos logit contributions
 forgot+0.054
 remembered+0.052
 blew+0.049
 sne+0.048
 bit+0.047
Neg logit contributions
 This-0.068
 If-0.067
'.-0.065
".-0.064
.'-0.063
Token.
Token ID13
Feature activation+4.55e-02

Pos logit contributions
 If+0.016
 This+0.015
'.+0.014
and+0.014
 That+0.013
Neg logit contributions
 blew-0.023
 sne-0.023
 bit-0.023
 sw-0.023
 remembered-0.023
Token Everywhere
Token ID48201
Feature activation-3.16e-02

Pos logit contributions
keeping+0.080
ula+0.075
uts+0.074
and+0.073
ings+0.073
Neg logit contributions
 saw-0.072
 ch-0.068
 dove-0.068
 dream-0.067
 looked-0.067
Token it
Token ID340
Feature activation-1.50e-01

Pos logit contributions
 forgot+0.068
 remembered+0.066
 sne+0.064
 blew+0.064
 wish+0.063
Neg logit contributions
 This-0.040
 If-0.039
'.-0.037
".-0.036
.'-0.035
Token flew
Token ID13112
Feature activation-2.49e-03

Pos logit contributions
inge+0.208
 stolen+0.204
 boo+0.203
 punished+0.203
ige+0.201
Neg logit contributions
-0.265
.-0.245
 She-0.212
."-0.207
 whenever-0.205
Token,
Token ID11
Feature activation-9.53e-03

Pos logit contributions
 forgot+0.005
 remembered+0.005
 blew+0.005
 sw+0.005
 saw+0.005
Neg logit contributions
and-0.003
'.-0.003
 If-0.003
 This-0.003
.'-0.003
Token it
Token ID340
Feature activation-1.03e-01

Pos logit contributions
 forgot+0.018
 remembered+0.017
 sne+0.017
 blew+0.017
 dove+0.017
Neg logit contributions
 This-0.015
 If-0.015
'.-0.015
.'-0.014
and-0.014
Token left
Token ID1364
Feature activation+2.56e-02

Pos logit contributions
 wish+0.108
 boo+0.101
Ta+0.099
 spit+0.097
 scrape+0.096
Neg logit contributions
-0.217
.-0.204
".-0.191
."-0.190
 She-0.188
Token a
Token ID257
Feature activation-7.59e-02

Pos logit contributions
 This+0.045
'.+0.044
uts+0.043
 Today+0.042
 If+0.042
Neg logit contributions
 sw-0.044
 remembered-0.043
 saw-0.043
 bit-0.042
 forgot-0.040
Token around
Token ID1088
Feature activation+3.79e-02

Pos logit contributions
 This+0.028
 If+0.027
and+0.027
'.+0.026
uts+0.026
Neg logit contributions
 buzz-0.027
 sw-0.027
 sne-0.026
 bit-0.026
 remembered-0.026
Token the
Token ID262
Feature activation-4.75e-02

Pos logit contributions
uts+0.089
 If+0.082
and+0.080
 This+0.075
umbs+0.075
Neg logit contributions
 sw-0.051
 sne-0.050
 buzz-0.049
 saw-0.048
 ch-0.047
Token world
Token ID995
Feature activation+4.51e-02

Pos logit contributions
 forgot+0.073
 remembered+0.072
 blew+0.067
 sne+0.065
 ate+0.064
Neg logit contributions
 This-0.088
 If-0.086
".-0.086
'.-0.086
.'-0.084
Token.
Token ID13
Feature activation+4.62e-02

Pos logit contributions
 If+0.063
 This+0.057
'.+0.054
 Today+0.052
 That+0.051
Neg logit contributions
 bit-0.094
 buzz-0.091
 spark-0.089
 ate-0.089
 saw-0.089
Token Everywhere
Token ID48201
Feature activation-1.69e-02

Pos logit contributions
ings+0.066
fulness+0.063
.'+0.063
 anymore+0.061
ently+0.060
Neg logit contributions
 saw-0.078
 buzz-0.078
 ate-0.077
 tried-0.077
 sang-0.075
Token it
Token ID340
Feature activation-1.50e-01

Pos logit contributions
 forgot+0.035
 remembered+0.034
 blew+0.033
 sne+0.032
 bit+0.032
Neg logit contributions
 This-0.023
 If-0.023
and-0.023
'.-0.022
.'-0.021
Token went
Token ID1816
Feature activation+1.89e-02

Pos logit contributions
 boo+0.195
 charged+0.187
inge+0.187
 scrape+0.184
 punished+0.181
Neg logit contributions
-0.273
.-0.257
 She-0.250
 before-0.221
."-0.218
Token,
Token ID11
Feature activation+7.64e-03

Pos logit contributions
and+0.033
 If+0.032
uts+0.031
'.+0.030
.'+0.030
Neg logit contributions
 saw-0.033
 sw-0.032
 flew-0.031
 dove-0.030
 remembered-0.030
Token it
Token ID340
Feature activation-9.28e-02

Pos logit contributions
 If+0.014
'.+0.014
 This+0.014
.'+0.013
and+0.013
Neg logit contributions
 forgot-0.013
 sne-0.012
 dove-0.012
 remembered-0.012
 blew-0.012
Token made
Token ID925
Feature activation-4.43e-03

Pos logit contributions
 wish+0.082
 boo+0.079
 tug+0.075
 scrape+0.073
 sne+0.072
Neg logit contributions
-0.205
.-0.198
".-0.196
 This-0.193
 She-0.193
Token people
Token ID661
Feature activation-3.45e-02

Pos logit contributions
 sne+0.008
 forgot+0.007
 wish+0.007
 remembered+0.007
 buzz+0.007
Neg logit contributions
 This-0.008
'.-0.007
 If-0.007
.'-0.007
and-0.007
Token be
Token ID307
Feature activation-1.87e-02

Pos logit contributions
 forgot+0.049
 remembered+0.049
 blew+0.045
 dove+0.045
 bit+0.044
Neg logit contributions
 This-0.091
 If-0.089
'.-0.087
".-0.087
-0.085
Token okay
Token ID8788
Feature activation+4.62e-02

Pos logit contributions
 forgot+0.035
 remembered+0.035
 blew+0.033
 sne+0.033
 bit+0.032
Neg logit contributions
 This-0.030
 If-0.029
'.-0.028
and-0.027
".-0.027
Token.
Token ID13
Feature activation+1.70e-02

Pos logit contributions
 This+0.073
anes+0.070
 Everyone+0.068
 If+0.067
ays+0.066
Neg logit contributions
 remembered-0.093
 forgot-0.090
 sne-0.086
 bit-0.085
 lost-0.085
Token Lily
Token ID20037
Feature activation-1.24e-01

Pos logit contributions
'.+0.024
.'+0.024
and+0.024
".+0.023
uts+0.022
Neg logit contributions
 forgot-0.034
 ate-0.032
 remembered-0.032
 bit-0.031
 saw-0.031
Token felt
Token ID2936
Feature activation-7.60e-02

Pos logit contributions
 flood+0.190
Uh+0.174
 ferry+0.169
 holy+0.167
 trace+0.162
Neg logit contributions
-0.239
.-0.211
 She-0.205
 So-0.190
 "-0.189
Token a
Token ID257
Feature activation-1.48e-01

Pos logit contributions
 blew+0.132
 forgot+0.130
 remembered+0.125
 dove+0.124
 buzz+0.124
Neg logit contributions
-0.128
 If-0.128
 This-0.121
 She-0.117
 That-0.110
Token little
Token ID1310
Feature activation-1.20e-01

Pos logit contributions
 swung+0.239
 met+0.239
 swoop+0.236
 groom+0.235
 chased+0.235
Neg logit contributions
-0.236
.-0.208
 And-0.201
 This-0.199
'.-0.195
Token weak
Token ID4939
Feature activation-4.18e-02

Pos logit contributions
lly+0.221
ep+0.221
 forgot+0.213
 blew+0.205
lf+0.205
Neg logit contributions
-0.180
 If-0.179
.-0.173
 She-0.171
 That-0.166
Token because
Token ID780
Feature activation-1.53e-02

Pos logit contributions
 forgot+0.073
 remembered+0.072
 blew+0.068
 ate+0.066
 wish+0.066
Neg logit contributions
 This-0.068
 If-0.067
'.-0.064
".-0.063
and-0.063
Token she
Token ID673
Feature activation-1.19e-01

Pos logit contributions
 forgot+0.033
 remembered+0.033
 blew+0.031
 sne+0.031
 wish+0.031
Neg logit contributions
 This-0.020
 If-0.019
'.-0.019
.'-0.018
".-0.018
Token was
Token ID373
Feature activation-4.93e-02

Pos logit contributions
 ferry+0.194
 groom+0.193
 island+0.183
 flood+0.180
 flag+0.179
Neg logit contributions
-0.211
 She-0.192
 That-0.189
.-0.182
 It-0.180
Token happy
Token ID3772
Feature activation+1.83e-02

Pos logit contributions
 forgot+0.064
 remembered+0.064
 bit+0.058
 sne+0.058
 blew+0.055
Neg logit contributions
-0.112
 This-0.112
 If-0.110
 She-0.105
and-0.105
Token and
Token ID290
Feature activation-7.24e-02

Pos logit contributions
 If+0.033
'.+0.030
 This+0.029
.'+0.028
 Today+0.027
Neg logit contributions
 saw-0.032
 forgot-0.032
 remembered-0.032
 dove-0.031
 bit-0.031
Token free
Token ID1479
Feature activation+4.08e-02

Pos logit contributions
 wish+0.074
 sne+0.071
ited+0.070
 sadd+0.069
 remembered+0.066
Neg logit contributions
-0.157
 This-0.156
 She-0.153
".-0.150
 If-0.149
Token,
Token ID11
Feature activation+7.66e-03

Pos logit contributions
 If+0.062
 Today+0.057
 This+0.057
and+0.056
'.+0.054
Neg logit contributions
 sw-0.081
 bit-0.079
 flew-0.078
 buzz-0.078
 ch-0.078
Token and
Token ID290
Feature activation-4.75e-02

Pos logit contributions
'.+0.012
 If+0.012
 This+0.011
.'+0.011
and+0.011
Neg logit contributions
 forgot-0.015
 sne-0.015
 blew-0.015
 remembered-0.015
 bit-0.014
Token it
Token ID340
Feature activation-1.47e-01

Pos logit contributions
 wish+0.114
 remembered+0.114
 forgot+0.113
 sne+0.112
 blew+0.111
Neg logit contributions
 This-0.039
 If-0.038
-0.034
 She-0.033
".-0.033
Token made
Token ID925
Feature activation-2.30e-02

Pos logit contributions
wy+0.194
affle+0.192
Ta+0.191
 promise+0.189
prise+0.184
Neg logit contributions
-0.278
.-0.250
 "-0.234
 She-0.225
 with-0.220
Token Lily
Token ID20037
Feature activation-4.39e-02

Pos logit contributions
 forgot+0.049
 remembered+0.049
 sne+0.047
 blew+0.047
 wish+0.046
Neg logit contributions
 This-0.029
 If-0.029
'.-0.028
".-0.026
and-0.026
Token smile
Token ID8212
Feature activation+1.20e-02

Pos logit contributions
 forgot+0.049
 remembered+0.048
 blew+0.045
 bit+0.042
 sne+0.042
Neg logit contributions
 This-0.100
 If-0.098
and-0.096
'.-0.095
".-0.095
Token.
Token ID13
Feature activation+3.64e-02

Pos logit contributions
 If+0.013
 This+0.012
and+0.011
'.+0.010
ion+0.010
Neg logit contributions
 remembered-0.030
 sne-0.029
 forgot-0.029
 bit-0.028
 blew-0.028
Token
Token ID198
Feature activation+3.37e-02

Pos logit contributions
ions+0.046
fulness+0.045
'.+0.042
.'+0.042
and+0.041
Neg logit contributions
 forgot-0.079
 saw-0.079
 ate-0.074
 remembered-0.073
 felt-0.073
Token!"
Token ID2474
Feature activation+2.69e-02

Pos logit contributions
 This+0.024
 If+0.023
'.+0.022
 That+0.021
and+0.021
Neg logit contributions
 forgot-0.043
 remembered-0.042
 hurt-0.041
 bit-0.041
 buried-0.041
Token said
Token ID531
Feature activation-4.79e-03

Pos logit contributions
'.+0.030
.'+0.030
 If+0.028
ions+0.027
fulness+0.026
Neg logit contributions
 forgot-0.061
 saw-0.061
 ate-0.060
 remembered-0.059
 felt-0.059
Token Lily
Token ID20037
Feature activation-2.76e-02

Pos logit contributions
 remembered+0.010
 forgot+0.010
 sne+0.010
 blew+0.010
 bit+0.010
Neg logit contributions
and-0.006
 If-0.006
'.-0.006
 This-0.006
.'-0.006
Token generously
Token ID42088
Feature activation-1.33e-02

Pos logit contributions
 forgot+0.062
 remembered+0.061
 blew+0.059
 sne+0.058
 bit+0.058
Neg logit contributions
 This-0.033
 If-0.032
'.-0.030
and-0.029
".-0.029
Token.
Token ID13
Feature activation+5.68e-02

Pos logit contributions
 forgot+0.033
 remembered+0.033
 sne+0.032
 blew+0.032
 bit+0.032
Neg logit contributions
 This-0.012
 If-0.012
'.-0.011
and-0.011
.'-0.011
Token They
Token ID1119
Feature activation-1.47e-01

Pos logit contributions
fulness+0.082
.'+0.079
ion+0.076
'.+0.075
and+0.072
Neg logit contributions
 saw-0.113
 ate-0.111
 forgot-0.110
 played-0.109
 threw-0.108
Token looked
Token ID3114
Feature activation-7.22e-02

Pos logit contributions
 crow+0.246
 eagle+0.245
 bully+0.243
 raven+0.235
 joke+0.233
Neg logit contributions
 They-0.242
-0.238
 She-0.228
.-0.226
 together-0.214
Token everywhere
Token ID8347
Feature activation+1.18e-02

Pos logit contributions
 blew+0.175
 forgot+0.173
 sne+0.171
 remembered+0.167
 rocked+0.167
Neg logit contributions
 This-0.066
 If-0.065
-0.063
 They-0.060
.-0.058
Token but
Token ID475
Feature activation-2.98e-02

Pos logit contributions
 This+0.017
 If+0.017
'.+0.017
.'+0.016
".+0.015
Neg logit contributions
 forgot-0.025
 remembered-0.024
 bit-0.023
 saw-0.023
 sne-0.022
Token couldn
Token ID3521
Feature activation+1.23e-02

Pos logit contributions
 forgot+0.050
 remembered+0.049
 sne+0.047
 blew+0.047
 wish+0.046
Neg logit contributions
 This-0.052
 If-0.051
'.-0.049
and-0.048
".-0.047
Token't
Token ID470
Feature activation-2.80e-02

Pos logit contributions
and+0.018
 If+0.017
.'+0.016
'.+0.016
 This+0.015
Neg logit contributions
 blew-0.023
 forgot-0.023
 sne-0.023
 hurt-0.023
 remembered-0.023
Token what
Token ID644
Feature activation+1.02e-03

Pos logit contributions
 This+0.047
'.+0.046
 If+0.044
.'+0.044
 Today+0.043
Neg logit contributions
 pin-0.040
 remembered-0.040
 sne-0.039
 forgot-0.039
 blew-0.039
Token it
Token ID340
Feature activation-6.13e-02

Pos logit contributions
 If+0.002
'.+0.002
.'+0.002
and+0.002
 This+0.002
Neg logit contributions
 remembered-0.002
 forgot-0.002
 pin-0.002
 blew-0.002
 sne-0.002
Token was
Token ID373
Feature activation-6.74e-03

Pos logit contributions
 remembered+0.092
 forgot+0.091
 wish+0.091
 groom+0.090
 dove+0.090
Neg logit contributions
 This-0.110
-0.106
 If-0.101
 She-0.101
 That-0.100
Token,
Token ID11
Feature activation+5.27e-03

Pos logit contributions
 forgot+0.016
 remembered+0.015
 blew+0.015
 sne+0.015
 bit+0.014
Neg logit contributions
 If-0.008
 This-0.008
'.-0.007
and-0.007
.'-0.007
Token so
Token ID523
Feature activation-4.91e-02

Pos logit contributions
'.+0.008
 This+0.008
.'+0.007
".+0.007
 If+0.007
Neg logit contributions
 forgot-0.011
 sne-0.010
 remembered-0.010
 bit-0.010
 wished-0.010
Token she
Token ID673
Feature activation-1.47e-01

Pos logit contributions
 forgot+0.124
 blew+0.122
 sne+0.122
 remembered+0.121
 bit+0.120
Neg logit contributions
 This-0.036
-0.033
 She-0.033
 If-0.032
".-0.027
Token asked
Token ID1965
Feature activation+4.05e-04

Pos logit contributions
 thrill+0.257
 merry+0.247
 flood+0.235
 connection+0.235
 happiest+0.234
Neg logit contributions
-0.206
.-0.187
 They-0.182
 She-0.180
 "-0.170
Token her
Token ID607
Feature activation-4.64e-02

Pos logit contributions
'.+0.001
 This+0.001
and+0.001
.'+0.001
uts+0.001
Neg logit contributions
 remembered-0.001
 sne-0.001
 dropped-0.001
 forgot-0.001
 saw-0.001
Token mom
Token ID1995
Feature activation+4.10e-02

Pos logit contributions
 forgot+0.087
 blew+0.083
 remembered+0.083
 rocked+0.082
 bit+0.081
Neg logit contributions
 This-0.070
 If-0.069
-0.068
".-0.064
and-0.063
Token.
Token ID13
Feature activation+3.01e-02

Pos logit contributions
uts+0.097
 This+0.090
'.+0.088
 Everyone+0.086
keeping+0.085
Neg logit contributions
 saw-0.046
 remembered-0.045
 cut-0.044
 forgot-0.039
 sne-0.039
Token
Token ID198
Feature activation+5.06e-02

Pos logit contributions
and+0.032
ions+0.028
.'+0.028
fulness+0.028
 If+0.027
Neg logit contributions
 saw-0.070
 blew-0.067
 remembered-0.067
 forgot-0.066
 cut-0.066
Token started
Token ID2067
Feature activation-8.98e-03

Pos logit contributions
 forgot+0.051
 wish+0.049
 remembered+0.049
 bit+0.045
 sne+0.045
Neg logit contributions
 This-0.134
 If-0.128
'.-0.128
-0.127
".-0.126
Token to
Token ID284
Feature activation-5.78e-02

Pos logit contributions
 sne+0.021
 forgot+0.021
 remembered+0.021
 blew+0.020
 bit+0.020
Neg logit contributions
 If-0.010
 This-0.010
'.-0.010
.'-0.009
".-0.009
Token spread
Token ID4104
Feature activation-3.65e-03

Pos logit contributions
 remembered+0.062
 forgot+0.062
 dove+0.050
 regretted+0.049
 ate+0.049
Neg logit contributions
 This-0.133
 If-0.129
'.-0.127
".-0.126
.'-0.124
Token.
Token ID13
Feature activation+4.56e-02

Pos logit contributions
 remembered+0.006
 sne+0.006
 bit+0.006
 forgot+0.006
 wish+0.005
Neg logit contributions
 This-0.007
 If-0.007
'.-0.006
and-0.006
uts-0.006
Token Everywhere
Token ID48201
Feature activation-3.01e-02

Pos logit contributions
fulness+0.072
and+0.070
ently+0.068
.'+0.068
ings+0.065
Neg logit contributions
 cut-0.078
 saw-0.077
 sne-0.076
 looked-0.076
 ate-0.075
Token it
Token ID340
Feature activation-1.46e-01

Pos logit contributions
 forgot+0.066
 remembered+0.065
 sne+0.063
 blew+0.063
 wish+0.062
Neg logit contributions
 This-0.037
 If-0.036
'.-0.034
".-0.033
.'-0.032
Token went
Token ID1816
Feature activation+4.50e-03

Pos logit contributions
ige+0.203
 traitor+0.198
 promise+0.197
inge+0.197
ited+0.196
Neg logit contributions
-0.227
.-0.217
 They-0.191
 She-0.189
 He-0.185
Token,
Token ID11
Feature activation+6.68e-03

Pos logit contributions
and+0.007
 If+0.007
'.+0.007
.'+0.006
uts+0.006
Neg logit contributions
 remembered-0.008
 saw-0.008
 forgot-0.008
 blew-0.008
 sw-0.008
Token something
Token ID1223
Feature activation-8.99e-02

Pos logit contributions
 If+0.011
 This+0.011
'.+0.011
and+0.010
.'+0.010
Neg logit contributions
 sne-0.012
 forgot-0.012
 blew-0.012
 dove-0.012
 remembered-0.011
Token happened
Token ID3022
Feature activation+5.80e-02

Pos logit contributions
ited+0.116
 remembered+0.115
 forgot+0.108
 wish+0.106
 groom+0.102
Neg logit contributions
.-0.166
-0.165
 This-0.158
and-0.158
 If-0.150
Token.
Token ID13
Feature activation+5.99e-02

Pos logit contributions
 If+0.078
 This+0.077
 Everyone+0.073
 These+0.072
 Those+0.070
Neg logit contributions
 remembered-0.125
 lost-0.121
 fl-0.121
 saw-0.121
 sw-0.118
Token a
Token ID257
Feature activation-6.80e-02

Pos logit contributions
 remembered+0.005
 forgot+0.005
 sne+0.005
 bit+0.005
 blew+0.005
Neg logit contributions
 This-0.005
 If-0.005
'.-0.004
and-0.004
.'-0.004
Token candy
Token ID18550
Feature activation+7.73e-03

Pos logit contributions
 forgot+0.092
 remembered+0.088
 wished+0.080
 chased+0.080
 blew+0.079
Neg logit contributions
 This-0.140
 If-0.134
-0.134
'.-0.130
".-0.129
Token.
Token ID13
Feature activation+4.36e-02

Pos logit contributions
 If+0.010
 This+0.009
'.+0.009
and+0.009
.'+0.008
Neg logit contributions
 bit-0.017
 pin-0.016
 forgot-0.016
 cut-0.016
 dove-0.016
Token Lily
Token ID20037
Feature activation-1.10e-01

Pos logit contributions
ions+0.075
.'+0.075
'.+0.073
ings+0.071
fulness+0.071
Neg logit contributions
 forgot-0.075
 saw-0.071
 ate-0.071
 remembered-0.065
 threw-0.065
Token felt
Token ID2936
Feature activation-6.70e-02

Pos logit contributions
 wish+0.172
 flood+0.165
Ta+0.156
 comet+0.154
 feather+0.154
Neg logit contributions
-0.190
.-0.169
 She-0.165
 They-0.157
 "-0.156
Token a
Token ID257
Feature activation-1.46e-01

Pos logit contributions
 blew+0.097
 forgot+0.095
 remembered+0.090
 dove+0.088
 buzz+0.087
Neg logit contributions
 If-0.133
-0.130
 This-0.129
 She-0.121
 That-0.117
Token little
Token ID1310
Feature activation-1.07e-01

Pos logit contributions
 met+0.232
 swung+0.229
 soared+0.227
 chased+0.225
 remembered+0.223
Neg logit contributions
-0.230
.-0.207
 This-0.199
'.-0.193
 And-0.192
Token better
Token ID1365
Feature activation-2.60e-02

Pos logit contributions
ep+0.183
lly+0.175
 blew+0.174
 forgot+0.174
 remembered+0.172
Neg logit contributions
 If-0.173
-0.171
.-0.168
 This-0.165
 That-0.162
Token and
Token ID290
Feature activation-7.73e-02

Pos logit contributions
 forgot+0.061
 remembered+0.060
 blew+0.058
 sne+0.058
 bit+0.057
Neg logit contributions
 This-0.029
 If-0.028
'.-0.026
and-0.025
".-0.025
Token went
Token ID1816
Feature activation-1.52e-02

Pos logit contributions
 wish+0.094
 feather+0.086
 buzz+0.084
 sne+0.083
ited+0.083
Neg logit contributions
-0.150
 She-0.145
 This-0.141
.-0.139
 If-0.138
Token back
Token ID736
Feature activation+1.04e-02

Pos logit contributions
 forgot+0.030
 remembered+0.029
 sne+0.028
 bit+0.028
 blew+0.028
Neg logit contributions
 If-0.022
 This-0.022
'.-0.021
.'-0.020
".-0.020
Token play
Token ID711
Feature activation-9.40e-03

Pos logit contributions
 forgot+0.079
 remembered+0.078
ited+0.076
 blew+0.076
 regretted+0.074
Neg logit contributions
 This-0.126
-0.120
 If-0.117
 She-0.115
'.-0.114
Token with
Token ID351
Feature activation-1.00e-02

Pos logit contributions
 forgot+0.020
 remembered+0.020
 sne+0.020
 buzz+0.020
 blew+0.020
Neg logit contributions
 If-0.011
 This-0.011
'.-0.011
.'-0.010
and-0.010
Token it
Token ID340
Feature activation+3.78e-03

Pos logit contributions
 forgot+0.019
 remembered+0.019
 wish+0.018
 dropped+0.018
 sne+0.018
Neg logit contributions
 This-0.015
'.-0.015
 If-0.015
.'-0.014
and-0.014
Token.
Token ID13
Feature activation+2.73e-02

Pos logit contributions
 This+0.004
 If+0.004
'.+0.004
.'+0.004
and+0.004
Neg logit contributions
 forgot-0.009
 bit-0.009
 remembered-0.009
 fl-0.008
 lost-0.008
Token Everywhere
Token ID48201
Feature activation-3.32e-02

Pos logit contributions
.'+0.028
ion+0.028
ions+0.028
'.+0.028
".+0.026
Neg logit contributions
 saw-0.064
 forgot-0.062
 remembered-0.060
 observed-0.060
 looked-0.058
Token she
Token ID673
Feature activation-1.45e-01

Pos logit contributions
 forgot+0.079
 remembered+0.077
 sne+0.076
 blew+0.075
 wish+0.075
Neg logit contributions
 This-0.034
 If-0.033
'.-0.031
".-0.030
.'-0.029
Token went
Token ID1816
Feature activation-3.30e-03

Pos logit contributions
 promise+0.225
 wish+0.224
inge+0.219
cod+0.214
 cruel+0.211
Neg logit contributions
 She-0.228
-0.212
 He-0.212
.-0.206
 They-0.204
Token,
Token ID11
Feature activation-5.99e-03

Pos logit contributions
 forgot+0.007
 remembered+0.007
 saw+0.006
 sne+0.006
 blew+0.006
Neg logit contributions
 If-0.004
'.-0.004
.'-0.004
and-0.004
".-0.004
Token she
Token ID673
Feature activation-1.07e-01

Pos logit contributions
 forgot+0.013
 remembered+0.012
 sne+0.012
 blew+0.012
 saw+0.012
Neg logit contributions
 If-0.008
'.-0.008
 This-0.008
.'-0.008
".-0.008
Token felt
Token ID2936
Feature activation-3.31e-02

Pos logit contributions
 wish+0.130
Ta+0.110
 promise+0.110
Uh+0.107
 spit+0.104
Neg logit contributions
-0.211
.-0.197
 She-0.193
and-0.185
 "-0.180
Token the
Token ID262
Feature activation-5.81e-02

Pos logit contributions
 forgot+0.045
 remembered+0.044
 blew+0.041
 sne+0.041
 bit+0.040
Neg logit contributions
 This-0.068
 If-0.068
'.-0.065
and-0.064
".-0.064
Token cat
Token ID3797
Feature activation+1.75e-02

Pos logit contributions
 forgot+0.043
 remembered+0.041
 blew+0.038
 sne+0.036
 wish+0.035
Neg logit contributions
 This-0.091
 If-0.089
'.-0.087
".-0.086
.'-0.085
Token named
Token ID3706
Feature activation+2.05e-02

Pos logit contributions
 If+0.048
and+0.045
 This+0.045
'.+0.044
.'+0.044
Neg logit contributions
 sne-0.012
 sw-0.012
 ate-0.012
 remembered-0.011
 bit-0.011
Token Jazz
Token ID21406
Feature activation+4.01e-02

Pos logit contributions
and+0.033
 If+0.031
but+0.030
.'+0.030
fulness+0.030
Neg logit contributions
 sne-0.038
 dove-0.036
 buzz-0.035
 cut-0.035
 forgot-0.035
Token.
Token ID13
Feature activation+2.16e-03

Pos logit contributions
 If+0.056
 This+0.048
 By+0.046
 Today+0.046
 For+0.045
Neg logit contributions
 saw-0.084
 forgot-0.084
 sne-0.083
 sw-0.082
 fl-0.082
Token Everywhere
Token ID48201
Feature activation-2.13e-02

Pos logit contributions
'.+0.003
.'+0.003
 If+0.003
and+0.002
".+0.002
Neg logit contributions
 remembered-0.005
 forgot-0.005
 sne-0.004
 blew-0.004
 saw-0.004
Token she
Token ID673
Feature activation-1.45e-01

Pos logit contributions
 forgot+0.045
 remembered+0.044
 blew+0.042
 sne+0.042
 wish+0.042
Neg logit contributions
 This-0.028
 If-0.028
'.-0.027
and-0.026
.'-0.025
Token went
Token ID1816
Feature activation+9.35e-03

Pos logit contributions
inge+0.250
 promise+0.238
 forget+0.213
 argument+0.211
 disgusted+0.211
Neg logit contributions
.-0.227
-0.221
 He-0.214
 She-0.213
 They-0.210
Token,
Token ID11
Feature activation+8.42e-03

Pos logit contributions
'.+0.014
.'+0.014
 If+0.013
and+0.013
".+0.013
Neg logit contributions
 remembered-0.017
 forgot-0.017
 saw-0.017
 sne-0.017
 dove-0.017
Token she
Token ID673
Feature activation-9.97e-02

Pos logit contributions
'.+0.013
.'+0.013
 If+0.013
".+0.012
 This+0.012
Neg logit contributions
 forgot-0.016
 remembered-0.016
 sne-0.016
 saw-0.015
 wish-0.015
Token was
Token ID373
Feature activation-4.56e-02

Pos logit contributions
Uh+0.114
 wish+0.113
inge+0.111
Ta+0.109
 spit+0.109
Neg logit contributions
-0.198
.-0.187
 She-0.182
 This-0.181
 They-0.175
Token always
Token ID1464
Feature activation-1.81e-02

Pos logit contributions
 forgot+0.065
 wish+0.061
 remembered+0.061
 blew+0.060
 bit+0.059
Neg logit contributions
 This-0.089
 If-0.087
-0.082
and-0.081
".-0.081
Token that
Token ID326
Feature activation-2.73e-02

Pos logit contributions
 This+0.005
 If+0.005
'.+0.005
and+0.005
.'+0.005
Neg logit contributions
 remembered-0.006
 forgot-0.006
 sne-0.006
 wish-0.005
 bit-0.005
Token little
Token ID1310
Feature activation-3.05e-02

Pos logit contributions
 forgot+0.037
 remembered+0.037
 blew+0.034
 sne+0.034
 bit+0.034
Neg logit contributions
 This-0.056
 If-0.055
'.-0.054
and-0.052
".-0.052
Token seed
Token ID9403
Feature activation+9.23e-03

Pos logit contributions
 forgot+0.041
 remembered+0.039
 blew+0.037
 sne+0.037
 wish+0.036
Neg logit contributions
 This-0.063
 If-0.063
'.-0.061
".-0.060
and-0.059
Token.
Token ID13
Feature activation+6.73e-02

Pos logit contributions
 If+0.018
 This+0.018
and+0.017
'.+0.017
.'+0.016
Neg logit contributions
 forgot-0.014
 remembered-0.014
 blew-0.014
 bit-0.013
 sne-0.013
Token Everywhere
Token ID48201
Feature activation-1.53e-02

Pos logit contributions
uts+0.082
.'+0.074
ings+0.074
fulness+0.073
 anymore+0.072
Neg logit contributions
 planted-0.110
 wish-0.107
 cut-0.107
 believed-0.106
 ate-0.103
Token it
Token ID340
Feature activation-1.45e-01

Pos logit contributions
 forgot+0.034
 remembered+0.033
 blew+0.032
 sne+0.031
 bit+0.031
Neg logit contributions
 If-0.019
 This-0.019
and-0.018
'.-0.018
.'-0.017
Token went
Token ID1816
Feature activation+3.76e-03

Pos logit contributions
 boo+0.185
Ta+0.174
 wish+0.167
 stolen+0.166
 merry+0.165
Neg logit contributions
-0.253
.-0.240
 She-0.229
 They-0.217
 whenever-0.216
Token,
Token ID11
Feature activation-2.36e-03

Pos logit contributions
 If+0.006
and+0.006
'.+0.005
 This+0.005
.'+0.005
Neg logit contributions
 forgot-0.007
 remembered-0.007
 saw-0.007
 blew-0.007
 dove-0.007
Token it
Token ID340
Feature activation-9.22e-02

Pos logit contributions
 forgot+0.004
 remembered+0.004
 sne+0.004
 blew+0.004
 wish+0.004
Neg logit contributions
 This-0.004
 If-0.004
'.-0.004
.'-0.004
and-0.004
Token made
Token ID925
Feature activation-5.23e-03

Pos logit contributions
 wish+0.090
 boo+0.082
Ta+0.079
 remembered+0.078
 sne+0.076
Neg logit contributions
-0.191
.-0.187
".-0.184
 This-0.182
 She-0.181
Token flowers
Token ID12734
Feature activation-2.27e-02

Pos logit contributions
 forgot+0.009
 sne+0.009
 remembered+0.009
 wish+0.009
 buzz+0.009
Neg logit contributions
 This-0.009
 If-0.009
'.-0.008
.'-0.008
".-0.008
Token loved
Token ID6151
Feature activation-3.47e-02

Pos logit contributions
 forgot+0.149
ited+0.147
 groom+0.146
 punished+0.144
 remembered+0.143
Neg logit contributions
-0.149
 This-0.132
.-0.131
 She-0.125
".-0.120
Token to
Token ID284
Feature activation-6.13e-02

Pos logit contributions
 forgot+0.095
 remembered+0.094
 blew+0.090
 wish+0.090
 sne+0.089
Neg logit contributions
 This-0.025
 If-0.023
and-0.021
'.-0.020
".-0.020
Token sing
Token ID1702
Feature activation+1.18e-02

Pos logit contributions
 forgot+0.074
 remembered+0.069
 regretted+0.063
 bit+0.059
 ruined+0.056
Neg logit contributions
 This-0.143
".-0.135
-0.133
'.-0.132
 If-0.130
Token.
Token ID13
Feature activation+4.00e-02

Pos logit contributions
 If+0.017
 This+0.017
'.+0.016
and+0.016
 That+0.015
Neg logit contributions
 fl-0.022
 sne-0.022
 buzz-0.022
 remembered-0.022
 ch-0.022
Token Everywhere
Token ID48201
Feature activation-2.84e-02

Pos logit contributions
ula+0.063
arling+0.061
keeping+0.060
and+0.060
uts+0.058
Neg logit contributions
 cut-0.070
 sne-0.067
 saw-0.066
 wish-0.066
 ch-0.065
Token it
Token ID340
Feature activation-1.45e-01

Pos logit contributions
 forgot+0.061
 remembered+0.060
 blew+0.058
 sne+0.058
 bit+0.057
Neg logit contributions
 This-0.036
 If-0.035
'.-0.034
".-0.032
and-0.032
Token went
Token ID1816
Feature activation-2.42e-03

Pos logit contributions
inge+0.192
 stolen+0.189
ige+0.185
 argument+0.184
Ta+0.184
Neg logit contributions
-0.260
.-0.246
 She-0.219
."-0.216
 They-0.206
Token,
Token ID11
Feature activation-3.56e-03

Pos logit contributions
 forgot+0.005
 remembered+0.005
 sw+0.005
 blew+0.005
 sne+0.005
Neg logit contributions
and-0.003
'.-0.003
 If-0.003
.'-0.003
 This-0.003
Token it
Token ID340
Feature activation-9.28e-02

Pos logit contributions
 forgot+0.007
 sne+0.006
 remembered+0.006
 dove+0.006
 blew+0.006
Neg logit contributions
'.-0.006
 If-0.006
 This-0.006
.'-0.005
and-0.005
Token would
Token ID561
Feature activation-4.51e-02

Pos logit contributions
 wish+0.091
Ta+0.083
 tug+0.082
ited+0.080
 stolen+0.080
Neg logit contributions
-0.200
.-0.192
".-0.185
."-0.183
 This-0.181
Token sing
Token ID1702
Feature activation+2.77e-02

Pos logit contributions
 forgot+0.042
 remembered+0.041
 blew+0.035
 bit+0.034
 dropped+0.033
Neg logit contributions
 This-0.110
 If-0.109
".-0.106
'.-0.106
.'-0.104
Token to
Token ID284
Feature activation-6.82e-02

Pos logit contributions
 forgot+0.051
 remembered+0.051
 sne+0.049
 blew+0.049
 bit+0.049
Neg logit contributions
 This-0.019
 If-0.019
'.-0.018
.'-0.017
".-0.017
Token be
Token ID307
Feature activation-6.06e-02

Pos logit contributions
 forgot+0.098
 remembered+0.098
 regretted+0.091
 blew+0.089
ited+0.085
Neg logit contributions
 This-0.135
".-0.133
-0.130
'.-0.129
 If-0.127
Token beautiful
Token ID4950
Feature activation-5.26e-03

Pos logit contributions
 forgot+0.093
 blew+0.089
 remembered+0.085
 regretted+0.082
ited+0.082
Neg logit contributions
".-0.107
 If-0.104
 This-0.103
-0.100
.-0.100
Token,
Token ID11
Feature activation-4.71e-03

Pos logit contributions
 sne+0.012
 remembered+0.012
 forgot+0.012
 bit+0.012
 wish+0.012
Neg logit contributions
 This-0.006
 If-0.006
'.-0.005
and-0.005
 That-0.005
Token so
Token ID523
Feature activation-1.20e-02

Pos logit contributions
 sne+0.009
 forgot+0.009
 bit+0.009
 remembered+0.009
 dove+0.009
Neg logit contributions
 This-0.007
'.-0.007
 If-0.007
.'-0.007
and-0.006
Token it
Token ID340
Feature activation-1.44e-01

Pos logit contributions
 forgot+0.025
 remembered+0.025
 blew+0.024
 sne+0.024
 wish+0.024
Neg logit contributions
 This-0.017
'.-0.016
 If-0.016
and-0.016
.'-0.015
Token tried
Token ID3088
Feature activation+3.19e-04

Pos logit contributions
inge+0.158
 UFO+0.157
 argument+0.157
 quarrel+0.157
Ta+0.154
Neg logit contributions
-0.258
.-0.250
 She-0.227
."-0.224
 They-0.221
Token to
Token ID284
Feature activation-6.53e-02

Pos logit contributions
 This+0.000
'.+0.000
 If+0.000
.'+0.000
 You+0.000
Neg logit contributions
 remembered-0.001
 sne-0.001
 forgot-0.001
 bit-0.001
 hurt-0.001
Token make
Token ID787
Feature activation-1.40e-02

Pos logit contributions
 forgot+0.083
 remembered+0.081
 regretted+0.076
 blew+0.069
 ate+0.067
Neg logit contributions
".-0.133
 If-0.131
'.-0.130
.'-0.130
 This-0.130
Token itself
Token ID2346
Feature activation-4.20e-02

Pos logit contributions
 forgot+0.026
 sne+0.026
 remembered+0.026
 wish+0.025
 blew+0.025
Neg logit contributions
 This-0.021
 If-0.021
'.-0.020
and-0.020
.'-0.019
Token very
Token ID845
Feature activation-6.78e-02

Pos logit contributions
 forgot+0.050
 remembered+0.049
 blew+0.045
 bit+0.043
 sne+0.043
Neg logit contributions
 This-0.094
 If-0.091
'.-0.089
".-0.089
and-0.089
Token a
Token ID257
Feature activation-1.18e-01

Pos logit contributions
 forgot+0.065
 remembered+0.063
 blew+0.060
 sne+0.059
 bit+0.059
Neg logit contributions
 This-0.047
 If-0.045
'.-0.043
and-0.042
".-0.042
Token gentle
Token ID10296
Feature activation-2.89e-02

Pos logit contributions
 forgot+0.208
ited+0.207
 struggled+0.206
 soared+0.203
 rocked+0.201
Neg logit contributions
-0.186
 "-0.171
.-0.162
 They-0.151
 And-0.150
Token lung
Token ID12317
Feature activation+2.03e-03

Pos logit contributions
 forgot+0.030
 remembered+0.029
 blew+0.027
 sne+0.026
 wish+0.026
Neg logit contributions
 This-0.068
 If-0.068
'.-0.066
".-0.065
and-0.065
Token.
Token ID13
Feature activation+3.31e-02

Pos logit contributions
 If+0.002
 This+0.002
and+0.002
'.+0.002
.'+0.002
Neg logit contributions
 forgot-0.005
 bit-0.005
 blew-0.005
 remembered-0.005
 sne-0.005
Token Everywhere
Token ID48201
Feature activation-2.73e-02

Pos logit contributions
and+0.045
but+0.039
.'+0.037
ula+0.036
arling+0.036
Neg logit contributions
 cut-0.067
 sne-0.067
 ate-0.066
 wish-0.065
 blew-0.064
Token it
Token ID340
Feature activation-1.44e-01

Pos logit contributions
 forgot+0.059
 remembered+0.058
 blew+0.056
 sne+0.056
 bit+0.055
Neg logit contributions
 This-0.034
 If-0.034
'.-0.032
and-0.031
".-0.031
Token went
Token ID1816
Feature activation+5.74e-03

Pos logit contributions
ige+0.202
Ta+0.201
 argument+0.200
 promise+0.189
ited+0.187
Neg logit contributions
-0.247
.-0.232
 He-0.221
."-0.204
 They-0.201
Token,
Token ID11
Feature activation+7.34e-03

Pos logit contributions
and+0.009
 If+0.008
'.+0.008
.'+0.008
but+0.008
Neg logit contributions
 remembered-0.011
 sw-0.011
 blew-0.011
 forgot-0.010
 saw-0.010
Token it
Token ID340
Feature activation-1.01e-01

Pos logit contributions
and+0.011
 If+0.011
'.+0.011
 This+0.011
.'+0.011
Neg logit contributions
 forgot-0.013
 sne-0.013
 remembered-0.013
 blew-0.013
 dove-0.013
Token made
Token ID925
Feature activation-1.47e-02

Pos logit contributions
Ta+0.107
 wish+0.105
Uh+0.100
ited+0.097
 forgot+0.093
Neg logit contributions
-0.205
.-0.196
".-0.184
."-0.183
 This-0.182
Token its
Token ID663
Feature activation-6.04e-02

Pos logit contributions
 forgot+0.026
 sne+0.025
 remembered+0.025
 blew+0.025
 wish+0.025
Neg logit contributions
 This-0.024
 If-0.024
'.-0.024
.'-0.023
and-0.023
Token was
Token ID373
Feature activation-8.92e-02

Pos logit contributions
 groom+0.154
 wish+0.138
Ta+0.134
 wave+0.132
ited+0.130
Neg logit contributions
-0.165
.-0.161
 She-0.152
 This-0.149
 If-0.144
Token not
Token ID407
Feature activation-3.79e-02

Pos logit contributions
ited+0.159
 forgot+0.156
 groom+0.143
 remembered+0.141
 rocked+0.141
Neg logit contributions
 If-0.144
 This-0.140
-0.139
 She-0.133
.-0.129
Token hurt
Token ID5938
Feature activation+2.73e-02

Pos logit contributions
 forgot+0.044
 remembered+0.042
 blew+0.040
 wish+0.040
 bit+0.040
Neg logit contributions
 If-0.083
 This-0.083
and-0.080
".-0.080
'.-0.079
Token,
Token ID11
Feature activation+1.20e-02

Pos logit contributions
 This+0.044
'.+0.043
'."+0.040
.'+0.040
 Today+0.039
Neg logit contributions
 hurt-0.053
 lost-0.050
 bit-0.050
 spoiled-0.048
 mean-0.048
Token but
Token ID475
Feature activation-3.63e-02

Pos logit contributions
'.+0.017
 This+0.016
.'+0.016
and+0.016
 If+0.015
Neg logit contributions
 forgot-0.025
 bit-0.025
 sne-0.024
 remembered-0.024
 blew-0.024
Token it
Token ID340
Feature activation-1.43e-01

Pos logit contributions
 forgot+0.083
 remembered+0.081
 blew+0.079
 sne+0.079
 wish+0.079
Neg logit contributions
 This-0.040
 If-0.039
-0.035
'.-0.035
".-0.034
Token was
Token ID373
Feature activation-5.85e-02

Pos logit contributions
 groom+0.310
 ferry+0.293
Ta+0.286
 bride+0.286
 wedding+0.280
Neg logit contributions
-0.175
.-0.170
 She-0.134
 "-0.127
 He-0.126
Token scared
Token ID12008
Feature activation+3.60e-02

Pos logit contributions
 forgot+0.102
 wish+0.098
 dove+0.096
 remembered+0.094
 blew+0.093
Neg logit contributions
 If-0.092
-0.084
 This-0.083
.-0.082
".-0.082
Token.
Token ID13
Feature activation+3.59e-02

Pos logit contributions
lor+0.058
keeping+0.057
angers+0.057
 This+0.057
'.+0.057
Neg logit contributions
 hurt-0.068
 mean-0.067
 sne-0.066
 bit-0.064
 naughty-0.063
Token The
Token ID383
Feature activation-8.06e-02

Pos logit contributions
fulness+0.050
reating+0.049
.'+0.048
ions+0.047
and+0.046
Neg logit contributions
 saw-0.074
 wished-0.069
 looked-0.069
 felt-0.069
 knew-0.069
Token cat
Token ID3797
Feature activation-9.62e-02

Pos logit contributions
ited+0.155
 forgot+0.155
 remembered+0.149
 regretted+0.139
 soared+0.139
Neg logit contributions
-0.122
 This-0.112
 "-0.109
 If-0.107
.-0.106
Token any
Token ID597
Feature activation-3.46e-02

Pos logit contributions
 forgot+0.042
 remembered+0.042
 sne+0.040
 blew+0.040
 wish+0.040
Neg logit contributions
 This-0.027
 If-0.027
'.-0.026
and-0.025
.'-0.025
Token water
Token ID1660
Feature activation+4.88e-02

Pos logit contributions
 forgot+0.043
 remembered+0.042
 blew+0.039
 sne+0.038
 ate+0.037
Neg logit contributions
 This-0.074
 If-0.074
'.-0.072
".-0.072
and-0.071
Token anywhere
Token ID6609
Feature activation+2.01e-02

Pos logit contributions
 If+0.076
 This+0.070
and+0.070
'.+0.066
 For+0.063
Neg logit contributions
 forgot-0.101
 sw-0.099
 bit-0.099
 buried-0.097
 ate-0.096
Token.
Token ID13
Feature activation+4.21e-02

Pos logit contributions
 If+0.027
 This+0.024
 Today+0.022
'.+0.022
 That+0.020
Neg logit contributions
 forgot-0.049
 bit-0.046
 lost-0.045
 saw-0.045
 hurt-0.045
Token Everywhere
Token ID48201
Feature activation-3.37e-02

Pos logit contributions
fulness+0.060
rier+0.056
uts+0.051
".+0.050
.'+0.050
Neg logit contributions
 saw-0.081
 looked-0.077
 observed-0.076
 knew-0.076
 felt-0.075
Token she
Token ID673
Feature activation-1.43e-01

Pos logit contributions
 forgot+0.081
 remembered+0.079
 sne+0.077
 blew+0.077
 wish+0.077
Neg logit contributions
 This-0.034
 If-0.032
'.-0.030
".-0.029
.'-0.028
Token looked
Token ID3114
Feature activation-3.14e-02

Pos logit contributions
 cruel+0.212
cod+0.205
inge+0.205
 wish+0.204
 itch+0.200
Neg logit contributions
 She-0.217
-0.217
 They-0.215
.-0.210
 He-0.209
Token,
Token ID11
Feature activation-2.06e-02

Pos logit contributions
 forgot+0.073
 remembered+0.072
 blew+0.069
 sne+0.069
 wish+0.069
Neg logit contributions
 This-0.034
 If-0.033
'.-0.031
and-0.030
".-0.030
Token it
Token ID340
Feature activation-9.30e-02

Pos logit contributions
 forgot+0.046
 remembered+0.045
 sne+0.044
 blew+0.043
 bit+0.043
Neg logit contributions
 This-0.025
 If-0.025
'.-0.024
".-0.023
.'-0.023
Token was
Token ID373
Feature activation-3.10e-02

Pos logit contributions
 wish+0.138
Ta+0.121
 boo+0.121
ited+0.114
 pigeon+0.114
Neg logit contributions
-0.167
.-0.163
 She-0.151
 They-0.147
 "-0.143
Token all
Token ID477
Feature activation-2.76e-02

Pos logit contributions
 forgot+0.040
 remembered+0.039
 blew+0.036
 sne+0.036
 wish+0.036
Neg logit contributions
 This-0.066
 If-0.065
'.-0.063
".-0.062
and-0.062
Token it
Token ID340
Feature activation-9.11e-02

Pos logit contributions
 forgot+0.064
 remembered+0.063
 blew+0.061
 sne+0.061
 bit+0.060
Neg logit contributions
 This-0.033
 If-0.032
'.-0.030
".-0.029
and-0.029
Token could
Token ID714
Feature activation-4.52e-02

Pos logit contributions
 groom+0.157
Ta+0.154
 wave+0.141
 wish+0.140
ited+0.138
Neg logit contributions
-0.133
.-0.128
 If-0.120
 before-0.111
 anymore-0.111
Token not
Token ID407
Feature activation-3.28e-02

Pos logit contributions
 remembered+0.073
 forgot+0.072
 blew+0.067
 ate+0.066
 bit+0.066
Neg logit contributions
 This-0.078
 If-0.077
'.-0.073
-0.073
".-0.073
Token.
Token ID13
Feature activation+3.53e-02

Pos logit contributions
 forgot+0.045
 remembered+0.044
 blew+0.041
 sne+0.040
 bit+0.040
Neg logit contributions
 This-0.066
 If-0.066
'.-0.065
".-0.064
.'-0.063
Token Everywhere
Token ID48201
Feature activation-3.52e-02

Pos logit contributions
and+0.042
fulness+0.042
.'+0.039
but+0.038
".+0.038
Neg logit contributions
 sne-0.069
 bit-0.069
 remembered-0.067
 saw-0.066
 wished-0.066
Token it
Token ID340
Feature activation-1.43e-01

Pos logit contributions
 forgot+0.080
 remembered+0.078
 sne+0.076
 wish+0.075
 blew+0.075
Neg logit contributions
 This-0.040
 If-0.038
'.-0.036
".-0.035
.'-0.034
Token looked
Token ID3114
Feature activation+3.05e-04

Pos logit contributions
ige+0.193
Ta+0.188
 groom+0.186
inge+0.184
 traitor+0.180
Neg logit contributions
-0.229
.-0.213
 He-0.198
 She-0.197
 But-0.189
Token there
Token ID612
Feature activation-1.18e-02

Pos logit contributions
 This+0.000
 If+0.000
and+0.000
'.+0.000
.'+0.000
Neg logit contributions
 forgot-0.001
 remembered-0.001
 blew-0.001
 saw-0.001
 dropped-0.001
Token was
Token ID373
Feature activation-1.91e-02

Pos logit contributions
 forgot+0.014
 remembered+0.014
 blew+0.013
 sne+0.013
 dropped+0.012
Neg logit contributions
 This-0.027
 If-0.026
'.-0.026
and-0.026
.'-0.025
Token too
Token ID1165
Feature activation-2.29e-02

Pos logit contributions
 forgot+0.032
 remembered+0.032
 blew+0.031
 sne+0.031
 bit+0.030
Neg logit contributions
 This-0.033
 If-0.033
'.-0.032
and-0.031
.'-0.031
Token much
Token ID881
Feature activation-1.87e-02

Pos logit contributions
 forgot+0.027
 remembered+0.027
 blew+0.025
 sne+0.025
 bit+0.024
Neg logit contributions
 This-0.052
 If-0.051
'.-0.050
and-0.049
".-0.048
Token harsh
Token ID11859
Feature activation+5.95e-02

Pos logit contributions
 This+0.054
and+0.053
 If+0.053
but+0.050
'.+0.050
Neg logit contributions
 forgot-0.036
 remembered-0.034
 lost-0.033
 hurt-0.033
 wish-0.032
Token it
Token ID340
Feature activation-4.60e-03

Pos logit contributions
and+0.108
 If+0.102
 This+0.101
but+0.096
'.+0.094
Neg logit contributions
 forgot-0.099
 blew-0.097
 bit-0.095
 remembered-0.094
 hurt-0.094
Token may
Token ID743
Feature activation+2.31e-02

Pos logit contributions
 forgot+0.006
 remembered+0.006
 sne+0.006
 blew+0.006
 bit+0.006
Neg logit contributions
 If-0.010
 This-0.010
and-0.009
'.-0.009
.'-0.009
Token seem
Token ID1283
Feature activation+4.73e-02

Pos logit contributions
 If+0.057
and+0.057
 This+0.055
'.+0.053
 For+0.052
Neg logit contributions
 forgot-0.024
 hurt-0.023
 sne-0.023
 wish-0.022
 remembered-0.021
Token.
Token ID13
Feature activation+8.29e-02

Pos logit contributions
 If+0.061
 This+0.057
and+0.057
ions+0.053
but+0.052
Neg logit contributions
 forgot-0.107
 bit-0.104
 sne-0.102
 hurt-0.101
 remembered-0.100
Token<|endoftext|>
Token ID50256
Feature activation+1.01e-01

Pos logit contributions
 anymore+0.115
arling+0.115
ula+0.114
angers+0.114
ings+0.113
Neg logit contributions
 wish-0.130
 knew-0.130
 remembered-0.128
 dream-0.122
 learned-0.120
TokenOnce
Token ID7454
Feature activation+3.27e-02

Pos logit contributions
uts+0.184
rier+0.181
fulness+0.173
oses+0.173
but+0.171
Neg logit contributions
 saw-0.170
 had-0.152
 met-0.149
 looked-0.147
 did-0.146
Token there
Token ID612
Feature activation+2.39e-03

Pos logit contributions
'.+0.061
 This+0.059
 If+0.059
and+0.058
.'+0.057
Neg logit contributions
 forgot-0.052
 saw-0.051
 remembered-0.050
 lost-0.046
 wish-0.046
Token was
Token ID373
Feature activation-1.66e-02

Pos logit contributions
 If+0.006
'.+0.006
 This+0.006
and+0.006
.'+0.006
Neg logit contributions
 forgot-0.002
 remembered-0.002
 saw-0.002
 sne-0.002
 wish-0.002
Token a
Token ID257
Feature activation-8.99e-02

Pos logit contributions
 forgot+0.029
 remembered+0.028
 sne+0.028
 blew+0.027
 wish+0.027
Neg logit contributions
 This-0.027
 If-0.027
'.-0.026
".-0.025
.'-0.025
Token girl
Token ID2576
Feature activation-2.15e-02

Pos logit contributions
 forgot+0.140
 remembered+0.135
 soared+0.130
 tricked+0.128
 struggled+0.128
Neg logit contributions
-0.167
.-0.147
 This-0.146
".-0.145
 "-0.144
Token the
Token ID262
Feature activation-1.26e-02

Pos logit contributions
uts+0.076
and+0.073
 If+0.072
'.+0.071
 This+0.071
Neg logit contributions
 sw-0.055
 sc-0.054
 forgot-0.053
 sne-0.053
 wave-0.053
Token different
Token ID1180
Feature activation+1.96e-02

Pos logit contributions
 forgot+0.017
 remembered+0.017
 sne+0.016
 wish+0.016
 bit+0.016
Neg logit contributions
 This-0.025
 If-0.025
'.-0.025
.'-0.024
and-0.024
Token things
Token ID1243
Feature activation+3.91e-02

Pos logit contributions
 This+0.040
 If+0.040
'.+0.039
uts+0.038
.'+0.038
Neg logit contributions
 forgot-0.025
 remembered-0.024
 bit-0.024
 wish-0.024
 sne-0.024
Token in
Token ID287
Feature activation+3.58e-02

Pos logit contributions
 If+0.068
 This+0.062
'.+0.059
and+0.059
.'"+0.059
Neg logit contributions
 forgot-0.074
 blew-0.070
 remembered-0.070
 sne-0.069
 sw-0.069
Token the
Token ID262
Feature activation+8.44e-03

Pos logit contributions
and+0.072
uts+0.068
but+0.068
 This+0.066
 If+0.066
Neg logit contributions
 remembered-0.051
 sne-0.051
 saw-0.051
 wish-0.050
 lost-0.049
Token lab
Token ID2248
Feature activation+1.03e-01

Pos logit contributions
uts+0.019
 If+0.018
and+0.018
 This+0.018
'.+0.017
Neg logit contributions
 forgot-0.009
 remembered-0.009
 bit-0.009
 saw-0.009
 wish-0.009
Token.
Token ID13
Feature activation+8.80e-02

Pos logit contributions
 If+0.147
 This+0.130
and+0.125
'.+0.116
.'"+0.116
Neg logit contributions
 forgot-0.220
 saw-0.216
 bit-0.213
 remembered-0.207
 sne-0.205
Token But
Token ID887
Feature activation+1.14e-02

Pos logit contributions
fulness+0.128
ently+0.128
.'"+0.126
.'+0.126
uts+0.123
Neg logit contributions
 saw-0.145
 forgot-0.140
 sw-0.136
 felt-0.136
 remembered-0.135
Token when
Token ID618
Feature activation+2.43e-02

Pos logit contributions
and+0.019
 If+0.019
'.+0.019
 This+0.019
.'+0.019
Neg logit contributions
 forgot-0.021
 remembered-0.021
 sne-0.020
 blew-0.019
 bit-0.019
Token the
Token ID262
Feature activation-1.92e-02

Pos logit contributions
and+0.045
 If+0.042
.'+0.039
 This+0.038
'.+0.038
Neg logit contributions
 forgot-0.046
 sne-0.044
 remembered-0.044
 blew-0.043
 sw-0.042
Token smell
Token ID8508
Feature activation-4.33e-02

Pos logit contributions
 forgot+0.028
 remembered+0.027
 sne+0.026
 blew+0.026
 bit+0.026
Neg logit contributions
 This-0.037
 If-0.037
'.-0.036
and-0.036
.'-0.035
Token named
Token ID3706
Feature activation+4.74e-02

Pos logit contributions
 forgot+0.039
 remembered+0.038
 wish+0.036
 sne+0.036
 blew+0.035
Neg logit contributions
 This-0.107
 If-0.106
'.-0.105
".-0.104
.'-0.102
Token it
Token ID340
Feature activation+1.83e-02

Pos logit contributions
'.+0.079
and+0.079
 If+0.079
.'+0.078
but+0.076
Neg logit contributions
 sne-0.075
 ate-0.075
 chased-0.074
 forgot-0.073
 sw-0.073
Token Purple
Token ID17265
Feature activation+5.16e-02

Pos logit contributions
 If+0.044
 This+0.042
'.+0.042
.'+0.041
and+0.041
Neg logit contributions
 remembered-0.022
 forgot-0.022
 blew-0.021
 sne-0.020
 sw-0.020
Tokeny
Token ID88
Feature activation+4.48e-02

Pos logit contributions
 If+0.087
and+0.079
 This+0.076
.'+0.072
 Today+0.071
Neg logit contributions
 sw-0.096
 dove-0.096
 forgot-0.096
 remembered-0.095
 blew-0.094
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
 If+0.082
and+0.081
 This+0.076
'.+0.070
.'+0.070
Neg logit contributions
 remembered-0.080
 forgot-0.079
 blew-0.075
 ate-0.073
 sne-0.073
Token<|endoftext|>
Token ID50256
Feature activation+9.96e-02

Pos logit contributions
rier+0.173
ings+0.161
keeping+0.160
ment+0.159
and+0.155
Neg logit contributions
 saw-0.166
 remembered-0.164
 played-0.161
 knew-0.159
 met-0.158
TokenM
Token ID44
Feature activation-1.18e-02

Pos logit contributions
rier+0.171
fulness+0.164
oses+0.157
uts+0.157
 If+0.154
Neg logit contributions
 saw-0.185
 met-0.165
 lost-0.162
 looked-0.161
 had-0.160
Tokenolly
Token ID5098
Feature activation-1.15e-02

Pos logit contributions
 forgot+0.021
 remembered+0.021
 wish+0.020
 sne+0.020
 blew+0.019
Neg logit contributions
 If-0.019
 This-0.019
and-0.018
'.-0.018
".-0.018
Token the
Token ID262
Feature activation-3.21e-03

Pos logit contributions
 forgot+0.021
 remembered+0.021
 blew+0.020
 sne+0.020
 bit+0.019
Neg logit contributions
 If-0.019
 This-0.019
and-0.018
'.-0.017
.'-0.017
Token rabbit
Token ID22746
Feature activation+1.42e-02

Pos logit contributions
 dove+0.003
 sne+0.003
 blew+0.003
 forgot+0.003
 remembered+0.003
Neg logit contributions
 If-0.008
 This-0.007
and-0.007
'.-0.007
.'-0.007
Token was
Token ID373
Feature activation-1.16e-02

Pos logit contributions
 If+0.037
and+0.036
 This+0.035
'.+0.035
.'+0.034
Neg logit contributions
 remembered-0.014
 forgot-0.014
 ate-0.012
 saw-0.012
 blew-0.012
Token of
Token ID286
Feature activation+2.62e-02

Pos logit contributions
 This+0.051
 If+0.049
'.+0.047
.'+0.044
'."+0.044
Neg logit contributions
 bit-0.051
 forgot-0.050
 remembered-0.050
 sne-0.049
 saw-0.049
Token her
Token ID607
Feature activation-3.52e-02

Pos logit contributions
 This+0.047
'.+0.046
uts+0.046
umbs+0.046
and+0.045
Neg logit contributions
 remembered-0.040
 sc-0.039
 knew-0.038
 pin-0.038
 fl-0.037
Token big
Token ID1263
Feature activation-4.65e-02

Pos logit contributions
 forgot+0.043
 remembered+0.042
 blew+0.039
 sne+0.038
 bit+0.037
Neg logit contributions
 This-0.078
 If-0.075
'.-0.075
".-0.073
.'-0.073
Token discovery
Token ID9412
Feature activation+6.67e-02

Pos logit contributions
 forgot+0.061
 remembered+0.058
 blew+0.055
 ate+0.054
 dropped+0.053
Neg logit contributions
 This-0.098
'.-0.095
 If-0.093
".-0.093
.'-0.092
Token.
Token ID13
Feature activation+8.68e-02

Pos logit contributions
 If+0.114
 This+0.100
 Today+0.093
and+0.090
arl+0.090
Neg logit contributions
 bit-0.127
 forgot-0.112
 pin-0.112
 knew-0.111
 sc-0.111
Token<|endoftext|>
Token ID50256
Feature activation+1.04e-01

Pos logit contributions
rier+0.159
fulness+0.144
 anymore+0.139
lights+0.132
ings+0.131
Neg logit contributions
 knew-0.128
 saw-0.115
 remembered-0.113
 learned-0.112
 wished-0.111
TokenOnce
Token ID7454
Feature activation+4.11e-02

Pos logit contributions
rier+0.201
fulness+0.197
uts+0.191
oses+0.188
 If+0.184
Neg logit contributions
 saw-0.167
 met-0.146
 had-0.145
 looked-0.142
 did-0.141
Token upon
Token ID2402
Feature activation-1.17e-02

Pos logit contributions
'.+0.079
 This+0.078
 If+0.077
and+0.076
.'+0.074
Neg logit contributions
 forgot-0.063
 saw-0.062
 remembered-0.060
 lost-0.055
 knew-0.055
Token a
Token ID257
Feature activation-3.69e-02

Pos logit contributions
 forgot+0.020
 remembered+0.020
 sne+0.019
 wish+0.019
 blew+0.018
Neg logit contributions
 This-0.020
 If-0.020
'.-0.019
.'-0.018
".-0.018
Token time
Token ID640
Feature activation+4.15e-02

Pos logit contributions
 forgot+0.064
 remembered+0.063
 blew+0.061
 sne+0.059
 wish+0.058
Neg logit contributions
 This-0.062
 If-0.060
'.-0.059
and-0.057
.'-0.057
Token there
Token ID612
Feature activation-1.35e-02

Pos logit contributions
'.+0.077
.'+0.077
 If+0.076
keeping+0.075
.'"+0.074
Neg logit contributions
 saw-0.061
 remembered-0.059
 forgot-0.058
 sne-0.055
 sw-0.054
Token up
Token ID510
Feature activation+4.91e-02

Pos logit contributions
 This+0.046
 If+0.045
 That+0.040
 You+0.040
 Everyone+0.039
Neg logit contributions
 bit-0.053
 buried-0.050
 sw-0.047
 sne-0.047
 hurt-0.046
Token into
Token ID656
Feature activation+5.31e-03

Pos logit contributions
 If+0.085
 This+0.085
 Those+0.084
 These+0.084
 That+0.082
Neg logit contributions
 bit-0.090
 buried-0.086
 hurt-0.085
 lost-0.084
 hit-0.083
Token a
Token ID257
Feature activation-4.65e-02

Pos logit contributions
 This+0.010
 If+0.009
and+0.009
'.+0.009
uts+0.009
Neg logit contributions
 sne-0.009
 remembered-0.008
 bit-0.008
 sw-0.008
 blew-0.008
Token tree
Token ID5509
Feature activation+4.68e-02

Pos logit contributions
 forgot+0.055
 remembered+0.054
 blew+0.051
 sne+0.047
 ate+0.047
Neg logit contributions
 This-0.101
'.-0.100
".-0.099
 If-0.099
.'-0.098
Token.
Token ID13
Feature activation+6.47e-02

Pos logit contributions
 If+0.057
 This+0.053
fulness+0.048
 You+0.047
 That+0.047
Neg logit contributions
 bit-0.112
 sne-0.107
 cut-0.106
 buried-0.104
 hurt-0.102
Token
Token ID220
Feature activation+1.22e-01

Pos logit contributions
 anymore+0.116
ings+0.107
ula+0.107
uts+0.106
reating+0.104
Neg logit contributions
 ate-0.097
 saw-0.091
 looked-0.090
 climbed-0.087
 knew-0.086
Token
Token ID198
Feature activation+4.09e-02

Pos logit contributions
fulness+0.208
reating+0.203
 lately+0.200
overs+0.194
lights+0.193
Neg logit contributions
 saw-0.151
 climbed-0.149
 lost-0.143
 forgot-0.142
 dropped-0.140
Token
Token ID198
Feature activation-2.54e-02

Pos logit contributions
 anymore+0.047
'.+0.045
unes+0.044
.'+0.044
overs+0.044
Neg logit contributions
 saw-0.086
 ate-0.081
 buzz-0.080
 knew-0.079
 forgot-0.079
TokenThe
Token ID464
Feature activation-7.36e-02

Pos logit contributions
 forgot+0.048
 remembered+0.047
 blew+0.045
 sne+0.044
 bit+0.044
Neg logit contributions
 This-0.039
 If-0.039
'.-0.037
and-0.036
".-0.036
Token floods
Token ID27283
Feature activation-2.40e-02

Pos logit contributions
 forgot+0.124
 remembered+0.118
 rocked+0.118
 pretended+0.116
 decorated+0.115
Neg logit contributions
-0.114
".-0.111
 If-0.110
 This-0.108
.-0.105
Token kept
Token ID4030
Feature activation+3.50e-05

Pos logit contributions
 forgot+0.032
 remembered+0.031
 blew+0.030
 sne+0.029
 bit+0.029
Neg logit contributions
 This-0.050
 If-0.049
'.-0.048
and-0.047
".-0.047
Token afraid
Token ID7787
Feature activation+5.77e-02

Pos logit contributions
 forgot+0.036
 remembered+0.035
 blew+0.033
 sne+0.033
 bit+0.032
Neg logit contributions
 This-0.058
 If-0.057
'.-0.055
and-0.054
".-0.054
Token the
Token ID262
Feature activation-4.10e-02

Pos logit contributions
'.+0.104
.'+0.101
 This+0.099
 If+0.099
.'"+0.098
Neg logit contributions
 sne-0.100
 forgot-0.093
 saw-0.093
 hurt-0.092
 bit-0.092
Token old
Token ID1468
Feature activation-6.68e-03

Pos logit contributions
 forgot+0.050
 remembered+0.048
 blew+0.044
 sne+0.042
 ate+0.042
Neg logit contributions
 This-0.090
 If-0.088
'.-0.086
".-0.085
.'-0.084
Token man
Token ID582
Feature activation+2.26e-02

Pos logit contributions
 forgot+0.006
 remembered+0.006
 sne+0.006
 blew+0.006
 dove+0.006
Neg logit contributions
 If-0.017
 This-0.017
and-0.017
'.-0.016
.'-0.016
Token had
Token ID550
Feature activation-1.44e-02

Pos logit contributions
 If+0.048
 This+0.048
'.+0.045
and+0.045
.'+0.044
Neg logit contributions
 forgot-0.033
 remembered-0.032
 bit-0.030
 sne-0.030
 ate-0.029
Token come
Token ID1282
Feature activation+5.51e-02

Pos logit contributions
 forgot+0.021
 remembered+0.020
 sne+0.019
 blew+0.019
 dropped+0.018
Neg logit contributions
 This-0.029
 If-0.029
'.-0.028
and-0.028
.'-0.028
Token with
Token ID351
Feature activation+2.90e-02

Pos logit contributions
 This+0.106
 If+0.101
 These+0.098
ions+0.096
 By+0.095
Neg logit contributions
 lost-0.087
 remembered-0.085
 hurt-0.083
 sne-0.080
 dropped-0.079
Token bad
Token ID2089
Feature activation-6.18e-03

Pos logit contributions
uts+0.045
and+0.045
 This+0.044
but+0.043
 If+0.042
Neg logit contributions
 sne-0.056
 wish-0.054
 remembered-0.054
 lost-0.054
 dropped-0.053
Token news
Token ID1705
Feature activation+7.83e-02

Pos logit contributions
 sne+0.009
 remembered+0.009
 forgot+0.009
 wish+0.009
 bit+0.008
Neg logit contributions
 If-0.012
 This-0.012
'.-0.012
and-0.011
.'-0.011
Token.
Token ID13
Feature activation+8.92e-02

Pos logit contributions
 If+0.105
 This+0.101
 Today+0.088
and+0.086
 That+0.085
Neg logit contributions
 bit-0.173
 forgot-0.168
 sne-0.164
 remembered-0.162
 saw-0.161
Token But
Token ID887
Feature activation+4.72e-03

Pos logit contributions
fulness+0.141
ions+0.133
ancing+0.131
ings+0.127
and+0.123
Neg logit contributions
 forgot-0.140
 saw-0.138
 knew-0.135
 wished-0.135
 bit-0.131
Token and
Token ID290
Feature activation-3.02e-02

Pos logit contributions
 If+0.039
.'+0.037
'.+0.037
 This+0.037
and+0.036
Neg logit contributions
 remembered-0.042
 forgot-0.041
 ate-0.040
 sne-0.039
 blew-0.039
Token made
Token ID925
Feature activation+1.19e-02

Pos logit contributions
 forgot+0.041
 remembered+0.040
 blew+0.038
 sne+0.037
 wish+0.037
Neg logit contributions
 This-0.062
 If-0.061
'.-0.059
".-0.058
and-0.058
Token it
Token ID340
Feature activation-1.85e-02

Pos logit contributions
'.+0.020
 This+0.020
 If+0.020
.'+0.019
".+0.019
Neg logit contributions
 sne-0.020
 wish-0.020
 dropped-0.019
 forgot-0.019
 sw-0.018
Token their
Token ID511
Feature activation-4.80e-02

Pos logit contributions
 forgot+0.026
 remembered+0.025
 sne+0.024
 wish+0.023
 blew+0.023
Neg logit contributions
 This-0.037
 If-0.037
'.-0.037
.'-0.035
".-0.035
Token new
Token ID649
Feature activation-1.55e-02

Pos logit contributions
 forgot+0.069
 remembered+0.069
 blew+0.067
 dropped+0.062
 buzz+0.061
Neg logit contributions
 This-0.094
 If-0.092
and-0.088
'.-0.087
".-0.087
Token friend
Token ID1545
Feature activation+6.95e-02

Pos logit contributions
 forgot+0.017
 remembered+0.017
 wish+0.016
 sne+0.016
 bit+0.016
Neg logit contributions
 If-0.036
 This-0.035
'.-0.035
.'-0.034
and-0.033
Token.
Token ID13
Feature activation+6.21e-02

Pos logit contributions
 If+0.104
 This+0.090
and+0.090
 By+0.085
.'+0.084
Neg logit contributions
 ate-0.140
 remembered-0.140
 sne-0.139
 forgot-0.139
 bit-0.137
Token They
Token ID1119
Feature activation-7.40e-02

Pos logit contributions
and+0.100
.'+0.094
ently+0.090
fulness+0.086
ing+0.085
Neg logit contributions
 sw-0.100
 ate-0.094
 wished-0.094
 threw-0.093
 knew-0.092
Token had
Token ID550
Feature activation-2.22e-02

Pos logit contributions
 wish+0.076
 boo+0.074
 sne+0.074
 bit+0.072
ited+0.072
Neg logit contributions
-0.147
.-0.142
 They-0.141
".-0.140
'.-0.139
Token so
Token ID523
Feature activation-3.57e-02

Pos logit contributions
 forgot+0.038
 remembered+0.037
 sne+0.035
 blew+0.035
 bit+0.035
Neg logit contributions
 This-0.039
 If-0.038
'.-0.037
and-0.036
.'-0.036
Token much
Token ID881
Feature activation-3.42e-02

Pos logit contributions
 forgot+0.081
 remembered+0.080
 blew+0.078
 sne+0.077
 bit+0.076
Neg logit contributions
 This-0.041
 If-0.040
and-0.037
'.-0.037
".-0.036
Token
Token ID198
Feature activation-1.96e-02

Pos logit contributions
 If+0.043
'.+0.042
 This+0.041
.'+0.040
".+0.040
Neg logit contributions
 forgot-0.049
 saw-0.049
 wish-0.047
 sne-0.047
 dove-0.047
TokenTom
Token ID13787
Feature activation-4.51e-02

Pos logit contributions
 forgot+0.035
 remembered+0.034
 blew+0.033
 sne+0.032
 bit+0.032
Neg logit contributions
 This-0.032
 If-0.032
'.-0.031
".-0.030
.'-0.029
Tokenmy
Token ID1820
Feature activation-9.17e-02

Pos logit contributions
 forgot+0.082
 wish+0.081
 remembered+0.079
 blew+0.079
 sne+0.078
Neg logit contributions
 This-0.066
 If-0.065
'.-0.063
and-0.062
".-0.061
Token was
Token ID373
Feature activation-2.84e-02

Pos logit contributions
 wish+0.157
 matter+0.150
 flood+0.149
ited+0.147
Uh+0.145
Neg logit contributions
-0.144
 This-0.126
.-0.125
 They-0.123
 She-0.121
Token very
Token ID845
Feature activation-4.45e-02

Pos logit contributions
 forgot+0.033
 remembered+0.032
 blew+0.030
 sne+0.030
 bit+0.029
Neg logit contributions
 This-0.064
 If-0.063
'.-0.061
and-0.060
".-0.060
Token happy
Token ID3772
Feature activation+2.28e-02

Pos logit contributions
 forgot+0.054
 remembered+0.054
 blew+0.054
 sne+0.053
 bit+0.051
Neg logit contributions
 If-0.093
 This-0.092
-0.089
and-0.088
".-0.085
Token and
Token ID290
Feature activation-5.47e-02

Pos logit contributions
 If+0.044
'.+0.042
 This+0.041
.'+0.040
'."+0.039
Neg logit contributions
 remembered-0.036
 sne-0.036
 forgot-0.036
 saw-0.036
 bit-0.035
Token he
Token ID339
Feature activation-8.37e-02

Pos logit contributions
 remembered+0.080
 wish+0.080
 sne+0.079
 blew+0.078
 forgot+0.077
Neg logit contributions
 This-0.099
-0.093
 If-0.090
 They-0.089
 She-0.089
Token played
Token ID2826
Feature activation+1.49e-02

Pos logit contributions
 wish+0.093
 matter+0.085
ited+0.085
 sne+0.083
Uh+0.082
Neg logit contributions
-0.174
 This-0.163
 They-0.162
 She-0.161
.-0.159
Token with
Token ID351
Feature activation-1.14e-03

Pos logit contributions
'.+0.022
 This+0.021
 If+0.021
.'+0.021
uts+0.020
Neg logit contributions
 sne-0.028
 pin-0.027
 buzz-0.027
 sw-0.027
 ate-0.027
Token the
Token ID262
Feature activation-4.20e-02

Pos logit contributions
 forgot+0.002
 remembered+0.002
 sne+0.002
 dropped+0.002
 bit+0.002
Neg logit contributions
 This-0.002
'.-0.002
 If-0.002
.'-0.002
and-0.002
Token slower
Token ID13611
Feature activation+3.95e-02

Pos logit contributions
 forgot+0.029
 remembered+0.028
 blew+0.027
 sne+0.027
 bit+0.027
Neg logit contributions
 This-0.035
 If-0.034
'.-0.034
and-0.033
.'-0.033
Token than
Token ID621
Feature activation+4.70e-02

Pos logit contributions
 If+0.062
and+0.060
 This+0.060
'.+0.058
 By+0.054
Neg logit contributions
 fl-0.077
 sne-0.076
 bit-0.076
 sw-0.076
 blew-0.075
Token the
Token ID262
Feature activation-1.81e-02

Pos logit contributions
'.+0.082
 This+0.081
.'+0.079
 If+0.079
 Today+0.077
Neg logit contributions
 forgot-0.083
 sne-0.079
 dropped-0.078
 lost-0.077
 bit-0.077
Token other
Token ID584
Feature activation-1.21e-02

Pos logit contributions
 forgot+0.021
 remembered+0.020
 blew+0.019
 sne+0.019
 bit+0.018
Neg logit contributions
 This-0.042
 If-0.041
and-0.040
'.-0.040
.'-0.039
Token jets
Token ID20792
Feature activation+3.95e-02

Pos logit contributions
 forgot+0.018
 remembered+0.017
 blew+0.017
 sne+0.016
 bit+0.016
Neg logit contributions
 This-0.024
 If-0.024
and-0.023
'.-0.023
.'-0.022
Token.
Token ID13
Feature activation+5.00e-02

Pos logit contributions
 If+0.046
and+0.045
 This+0.043
'.+0.037
 Today+0.036
Neg logit contributions
 hurt-0.092
 forgot-0.091
 blew-0.090
 sne-0.089
 bit-0.088
Token Anna
Token ID11735
Feature activation-3.44e-02

Pos logit contributions
fulness+0.099
ancing+0.095
and+0.095
keeping+0.094
.'+0.093
Neg logit contributions
 saw-0.066
 sne-0.061
 dropped-0.061
 forgot-0.061
 spit-0.061
Token and
Token ID290
Feature activation-4.13e-03

Pos logit contributions
 forgot+0.078
 remembered+0.076
 blew+0.073
 sne+0.073
 wish+0.073
Neg logit contributions
 This-0.039
 If-0.039
'.-0.036
".-0.035
and-0.035
Token Ben
Token ID3932
Feature activation-7.43e-02

Pos logit contributions
 sne+0.009
 remembered+0.008
 forgot+0.008
 dove+0.008
 blew+0.008
Neg logit contributions
and-0.006
 If-0.006
'.-0.006
 This-0.006
".-0.006
Token are
Token ID389
Feature activation-4.50e-02

Pos logit contributions
 remembered+0.108
 forgot+0.108
 regretted+0.102
 blew+0.101
 sne+0.099
Neg logit contributions
'.-0.132
 If-0.131
 This-0.130
.'-0.126
-0.126
Token curious
Token ID11040
Feature activation+3.97e-02

Pos logit contributions
 forgot+0.057
 remembered+0.054
 blew+0.053
 bit+0.052
 wish+0.051
Neg logit contributions
 If-0.095
 This-0.093
-0.089
".-0.088
and-0.087
Token,
Token ID11
Feature activation+5.04e-02

Pos logit contributions
'.+0.051
.'+0.050
and+0.048
 If+0.046
".+0.045
Neg logit contributions
 sne-0.049
 remembered-0.048
 sw-0.046
 dove-0.045
 pin-0.045
Token "
Token ID366
Feature activation-1.52e-02

Pos logit contributions
.'+0.076
and+0.075
'.+0.073
keeping+0.069
ings+0.068
Neg logit contributions
 remembered-0.103
 blew-0.099
 ate-0.095
 saw-0.095
 sne-0.095
TokenLook
Token ID8567
Feature activation+1.05e-02

Pos logit contributions
 forgot+0.022
 remembered+0.021
 blew+0.021
 sne+0.020
 bit+0.020
Neg logit contributions
 This-0.030
 If-0.030
'.-0.028
".-0.028
and-0.028
Token,
Token ID11
Feature activation+5.33e-03

Pos logit contributions
 If+0.016
and+0.016
 This+0.015
'.+0.014
 By+0.014
Neg logit contributions
 remembered-0.020
 forgot-0.020
 sne-0.020
 wish-0.019
 dropped-0.019
Token Ben
Token ID3932
Feature activation+1.44e-02

Pos logit contributions
and+0.008
'.+0.007
.'+0.007
 If+0.007
 This+0.007
Neg logit contributions
 sne-0.011
 forgot-0.011
 wish-0.011
 remembered-0.010
 dove-0.010
Token.
Token ID13
Feature activation+4.85e-02

Pos logit contributions
and+0.024
 If+0.021
 This+0.021
but+0.020
uts+0.019
Neg logit contributions
 remembered-0.031
 forgot-0.031
 bit-0.028
 sne-0.028
 saw-0.028
Token This
Token ID770
Feature activation-2.14e-02

Pos logit contributions
and+0.078
ently+0.067
.'+0.063
fulness+0.062
ings+0.062
Neg logit contributions
 wish-0.086
 dream-0.082
 cut-0.081
 dove-0.080
 ate-0.078
Token is
Token ID318
Feature activation-2.40e-03

Pos logit contributions
 forgot+0.034
 remembered+0.033
 blew+0.032
 sne+0.032
 wish+0.031
Neg logit contributions
 If-0.039
 This-0.039
and-0.038
'.-0.038
.'-0.037
Token a
Token ID257
Feature activation-5.59e-02

Pos logit contributions
 forgot+0.004
 remembered+0.004
 blew+0.004
 sne+0.004
 bit+0.004
Neg logit contributions
and-0.005
 This-0.004
'.-0.004
.'-0.004
 If-0.004
Token boat
Token ID8848
Feature activation+3.47e-02

Pos logit contributions
 forgot+0.066
 remembered+0.062
 blew+0.056
 ate+0.056
 chased+0.055
Neg logit contributions
 This-0.122
".-0.119
'.-0.118
 If-0.118
.'-0.116
Token.
Token ID13
Feature activation+5.80e-02

Pos logit contributions
 If+0.045
 This+0.041
and+0.040
 Today+0.036
'.+0.035
Neg logit contributions
 bit-0.080
 cut-0.075
 forgot-0.075
 blew-0.075
 ate-0.075
Token was
Token ID373
Feature activation-6.31e-03

Pos logit contributions
 If+0.013
 This+0.013
and+0.012
'.+0.012
.'+0.012
Neg logit contributions
 remembered-0.006
 forgot-0.005
 sne-0.005
 blew-0.005
 ate-0.005
Token very
Token ID845
Feature activation-3.34e-02

Pos logit contributions
 remembered+0.008
 forgot+0.008
 sne+0.008
 blew+0.008
 bit+0.007
Neg logit contributions
 This-0.013
 If-0.013
'.-0.013
and-0.013
.'-0.013
Token thirsty
Token ID47124
Feature activation+3.77e-02

Pos logit contributions
 forgot+0.039
 remembered+0.038
 blew+0.035
 sne+0.034
 bit+0.034
Neg logit contributions
 This-0.075
 If-0.074
'.-0.072
".-0.071
and-0.070
Token.
Token ID13
Feature activation+5.00e-02

Pos logit contributions
 If+0.047
 This+0.046
and+0.039
 For+0.035
 That+0.034
Neg logit contributions
 bit-0.089
 sne-0.087
 hurt-0.084
 sw-0.083
 cut-0.083
Token He
Token ID679
Feature activation-6.66e-02

Pos logit contributions
ula+0.068
ions+0.066
fulness+0.063
.'+0.062
'.+0.061
Neg logit contributions
 saw-0.099
 sw-0.096
 ate-0.096
 threw-0.095
 climbed-0.095
Token saw
Token ID2497
Feature activation-3.04e-02

Pos logit contributions
 forgot+0.056
 remembered+0.052
 blew+0.052
 wish+0.051
 buzz+0.047
Neg logit contributions
 This-0.154
 If-0.151
-0.150
'.-0.149
 She-0.148
Token a
Token ID257
Feature activation-9.12e-02

Pos logit contributions
 forgot+0.061
 remembered+0.060
 blew+0.057
 sne+0.057
 bit+0.056
Neg logit contributions
 This-0.043
 If-0.042
'.-0.040
and-0.039
".-0.039
Token big
Token ID1263
Feature activation-6.95e-02

Pos logit contributions
 forgot+0.131
 remembered+0.127
 believed+0.111
 chased+0.111
 swung+0.108
Neg logit contributions
-0.168
".-0.166
 This-0.164
'.-0.164
.'-0.159
Token mel
Token ID7758
Feature activation-3.38e-02

Pos logit contributions
 forgot+0.130
 remembered+0.121
 blew+0.111
 dropped+0.108
 wished+0.108
Neg logit contributions
 This-0.113
".-0.104
'.-0.104
 If-0.101
.'-0.100
Tokenon
Token ID261
Feature activation+7.55e-03

Pos logit contributions
 forgot+0.076
 remembered+0.074
 blew+0.071
 sne+0.071
 wish+0.071
Neg logit contributions
 This-0.041
 If-0.039
'.-0.038
".-0.037
and-0.037
Token,
Token ID11
Feature activation-1.09e-02

Pos logit contributions
 If+0.013
 This+0.013
'.+0.012
and+0.012
.'+0.012
Neg logit contributions
 blew-0.013
 remembered-0.013
 forgot-0.013
 sne-0.013
 ate-0.013
Token was
Token ID373
Feature activation-2.62e-02

Pos logit contributions
 forgot+0.060
 remembered+0.059
 wish+0.057
 sne+0.056
 dove+0.056
Neg logit contributions
 This-0.106
 If-0.104
'.-0.101
".-0.100
-0.097
Token to
Token ID284
Feature activation-3.74e-02

Pos logit contributions
 forgot+0.049
 remembered+0.048
 blew+0.046
 sne+0.046
 bit+0.045
Neg logit contributions
 This-0.041
 If-0.040
'.-0.039
and-0.038
".-0.038
Token drink
Token ID4144
Feature activation+1.42e-02

Pos logit contributions
 forgot+0.043
 remembered+0.042
 blew+0.038
 bit+0.038
 sne+0.037
Neg logit contributions
 This-0.085
 If-0.083
'.-0.081
".-0.080
.'-0.079
Token from
Token ID422
Feature activation+1.00e-02

Pos logit contributions
 If+0.029
 This+0.028
'.+0.028
and+0.028
.'+0.027
Neg logit contributions
 bit-0.019
 sne-0.019
 hurt-0.018
 cut-0.018
 blew-0.018
Token a
Token ID257
Feature activation-4.41e-02

Pos logit contributions
 If+0.018
 This+0.018
and+0.018
'.+0.017
uts+0.017
Neg logit contributions
 sne-0.016
 bit-0.016
 remembered-0.016
 forgot-0.015
 hurt-0.015
Token glass
Token ID5405
Feature activation+1.16e-02

Pos logit contributions
 forgot+0.057
 remembered+0.055
 blew+0.051
 sne+0.048
 ate+0.047
Neg logit contributions
 This-0.094
 If-0.092
'.-0.091
".-0.090
and-0.089
Token without
Token ID1231
Feature activation-3.93e-03

Pos logit contributions
 If+0.021
 This+0.021
'.+0.019
 That+0.019
and+0.019
Neg logit contributions
 bit-0.020
 forgot-0.019
 cut-0.018
 dove-0.018
 sne-0.018
Token using
Token ID1262
Feature activation+3.75e-02

Pos logit contributions
 remembered+0.007
 forgot+0.007
 sne+0.007
 bit+0.007
 blew+0.007
Neg logit contributions
 This-0.006
 If-0.006
'.-0.006
and-0.006
.'-0.006
Token his
Token ID465
Feature activation-2.71e-02

Pos logit contributions
and+0.071
but+0.068
 If+0.066
 This+0.066
'.+0.066
Neg logit contributions
 sne-0.059
 cut-0.058
 hurt-0.057
 bit-0.056
 dropped-0.056
Token paws
Token ID49193
Feature activation+9.12e-02

Pos logit contributions
 forgot+0.037
 remembered+0.036
 blew+0.034
 sne+0.033
 bit+0.033
Neg logit contributions
 This-0.056
 If-0.055
'.-0.054
and-0.052
".-0.052
Token.
Token ID13
Feature activation+5.87e-02

Pos logit contributions
 This+0.142
 If+0.140
 You+0.137
 Today+0.134
 Everyone+0.133
Neg logit contributions
 hurt-0.172
 forgot-0.171
 bit-0.169
 cut-0.168
 lost-0.165
Token Now
Token ID2735
Feature activation+1.44e-02

Pos logit contributions
and+0.105
uts+0.101
ings+0.100
but+0.100
unes+0.094
Neg logit contributions
 forgot-0.087
 missed-0.085
 remembered-0.084
 bit-0.082
 wished-0.081
Token Emma
Token ID18966
Feature activation-1.66e-02

Pos logit contributions
and+0.023
 If+0.021
 This+0.020
but+0.019
uts+0.019
Neg logit contributions
 forgot-0.030
 remembered-0.029
 bit-0.027
 ate-0.027
 blew-0.027
Token knew
Token ID2993
Feature activation+3.48e-02

Pos logit contributions
 forgot+0.019
 remembered+0.019
 blew+0.017
 sne+0.017
 bit+0.016
Neg logit contributions
 This-0.038
 If-0.038
and-0.037
'.-0.037
.'-0.036
Token to
Token ID284
Feature activation-1.91e-02

Pos logit contributions
 This+0.062
 If+0.062
and+0.061
'.+0.060
 Today+0.059
Neg logit contributions
 forgot-0.058
 remembered-0.058
 wish-0.053
 pin-0.053
 lost-0.052
Token keep
Token ID1394
Feature activation-7.11e-03

Pos logit contributions
 forgot+0.020
 remembered+0.019
 sne+0.018
 wish+0.018
 blew+0.017
Neg logit contributions
 This-0.046
 If-0.046
and-0.044
'.-0.044
.'-0.043
Token her
Token ID607
Feature activation-3.81e-02

Pos logit contributions
 remembered+0.013
 sne+0.013
 forgot+0.013
 blew+0.013
 pin+0.013
Neg logit contributions
 If-0.011
 This-0.011
and-0.011
'.-0.011
.'-0.010
Token dresses
Token ID27309
Feature activation+3.80e-02

Pos logit contributions
 forgot+0.044
 remembered+0.043
 blew+0.041
 sne+0.039
 ate+0.039
Neg logit contributions
 This-0.084
 If-0.083
'.-0.082
".-0.081
.'-0.080
Token away
Token ID1497
Feature activation+7.53e-02

Pos logit contributions
 If+0.068
 This+0.066
 That+0.061
 You+0.059
 For+0.058
Neg logit contributions
 forgot-0.068
 bit-0.065
 remembered-0.064
 decorated-0.063
 spark-0.062
Token from
Token ID422
Feature activation+1.97e-02

Pos logit contributions
 This+0.134
 If+0.133
 Those+0.118
 These+0.117
 That+0.117
Neg logit contributions
 bit-0.135
 forgot-0.133
 remembered-0.129
 spark-0.128
 fit-0.124
Token her
Token ID607
Feature activation-6.98e-03

Pos logit contributions
 This+0.037
 If+0.036
and+0.035
uts+0.034
'.+0.034
Neg logit contributions
 remembered-0.032
 sne-0.032
 forgot-0.032
 wish-0.031
 lost-0.031
Token dog
Token ID3290
Feature activation+6.07e-02

Pos logit contributions
 forgot+0.010
 remembered+0.010
 wish+0.009
 sne+0.009
 bit+0.009
Neg logit contributions
 If-0.015
 This-0.015
'.-0.014
and-0.014
 That-0.013
Token play
Token ID711
Feature activation+2.31e-02

Pos logit contributions
 forgot+0.025
 remembered+0.024
 sne+0.023
 blew+0.023
 wish+0.023
Neg logit contributions
 This-0.052
 If-0.051
'.-0.050
and-0.050
".-0.049
Token together
Token ID1978
Feature activation+7.53e-02

Pos logit contributions
 If+0.036
 This+0.036
'.+0.035
and+0.034
uts+0.034
Neg logit contributions
 pin-0.040
 sne-0.040
 saw-0.040
 wish-0.040
 hurt-0.039
Token,
Token ID11
Feature activation+2.73e-02

Pos logit contributions
 This+0.127
 If+0.122
 Those+0.116
 That+0.114
arl+0.112
Neg logit contributions
 bit-0.126
 cut-0.125
 sne-0.125
 fit-0.123
 fl-0.122
Token but
Token ID475
Feature activation-5.12e-03

Pos logit contributions
'.+0.041
.'+0.039
 This+0.038
".+0.038
 If+0.037
Neg logit contributions
 forgot-0.055
 sne-0.055
 bit-0.053
 pin-0.052
 wish-0.051
Token it
Token ID340
Feature activation-9.29e-02

Pos logit contributions
 forgot+0.011
 remembered+0.011
 sne+0.011
 blew+0.011
 bit+0.011
Neg logit contributions
'.-0.007
 This-0.006
.'-0.006
and-0.006
 If-0.006
Token was
Token ID373
Feature activation-2.21e-02

Pos logit contributions
 groom+0.156
Ta+0.155
 wish+0.151
 remembered+0.147
ited+0.147
Neg logit contributions
-0.125
.-0.121
 If-0.118
 They-0.113
 This-0.112
Token hard
Token ID1327
Feature activation+3.00e-02

Pos logit contributions
 forgot+0.040
 remembered+0.039
 blew+0.037
 sne+0.037
 bit+0.037
Neg logit contributions
 This-0.037
 If-0.036
'.-0.035
and-0.034
.'-0.034
Token to
Token ID284
Feature activation-7.16e-03

Pos logit contributions
 This+0.040
 If+0.039
'.+0.038
and+0.036
 Everyone+0.035
Neg logit contributions
 sne-0.060
 forgot-0.060
 remembered-0.059
 wish-0.058
 lost-0.058
Token talk
Token ID1561
Feature activation+7.09e-02

Pos logit contributions
 forgot+0.007
 sne+0.006
 remembered+0.006
 blew+0.006
 wish+0.006
Neg logit contributions
 This-0.018
 If-0.018
and-0.018
'.-0.017
 That-0.017
Token.
Token ID13
Feature activation+8.27e-02

Pos logit contributions
 This+0.100
 These+0.094
 Those+0.093
ummy+0.091
overs+0.091
Neg logit contributions
 remembered-0.137
 forgot-0.136
 did-0.135
 mean-0.134
 bit-0.133
Token
Token ID198
Feature activation+5.37e-02

Pos logit contributions
fulness+0.155
ancing+0.146
fully+0.139
ently+0.136
ing+0.136
Neg logit contributions
 saw-0.115
 did-0.109
 threw-0.107
 wished-0.107
 wondered-0.105
Token took
Token ID1718
Feature activation+2.71e-02

Pos logit contributions
 This+0.021
and+0.021
 If+0.021
'.+0.020
uts+0.020
Neg logit contributions
 remembered-0.009
 forgot-0.009
 sne-0.008
 blew-0.008
 saw-0.008
Token him
Token ID683
Feature activation+3.59e-02

Pos logit contributions
uts+0.041
and+0.040
but+0.037
 This+0.036
 Today+0.036
Neg logit contributions
 sne-0.053
 dropped-0.050
 cut-0.050
 remembered-0.049
 bit-0.049
Token to
Token ID284
Feature activation+8.81e-03

Pos logit contributions
 If+0.055
 This+0.054
uts+0.053
 You+0.052
and+0.051
Neg logit contributions
 remembered-0.062
 dropped-0.062
 sne-0.062
 lost-0.062
 bit-0.062
Token the
Token ID262
Feature activation-2.67e-02

Pos logit contributions
and+0.016
uts+0.016
 If+0.016
 This+0.016
'.+0.015
Neg logit contributions
 sne-0.015
 remembered-0.014
 cut-0.013
 wish-0.013
 forgot-0.013
Token doctor
Token ID6253
Feature activation+6.66e-02

Pos logit contributions
 forgot+0.036
 remembered+0.035
 blew+0.033
 sne+0.033
 bit+0.032
Neg logit contributions
 This-0.055
 If-0.054
'.-0.053
and-0.052
".-0.051
Token's
Token ID338
Feature activation+1.01e-03

Pos logit contributions
fulness+0.110
uts+0.109
 If+0.107
ula+0.105
 This+0.101
Neg logit contributions
 sne-0.124
 saw-0.124
 bit-0.122
 cut-0.120
 remembered-0.116
Token office
Token ID2607
Feature activation+7.36e-02

Pos logit contributions
 This+0.002
 If+0.002
'.+0.002
uts+0.002
and+0.002
Neg logit contributions
 sne-0.001
 remembered-0.001
 wish-0.001
 bit-0.001
 forgot-0.001
Token.
Token ID13
Feature activation+8.38e-02

Pos logit contributions
 If+0.098
 This+0.094
and+0.090
 Today+0.089
irs+0.088
Neg logit contributions
 saw-0.153
 bit-0.152
 remembered-0.150
 cut-0.148
 forgot-0.146
Token The
Token ID383
Feature activation-4.89e-02

Pos logit contributions
fulness+0.130
ings+0.124
ently+0.123
and+0.116
ancing+0.115
Neg logit contributions
 saw-0.151
 sne-0.150
 bit-0.150
 forgot-0.149
 ate-0.148
Token doctor
Token ID6253
Feature activation-5.25e-02

Pos logit contributions
 forgot+0.085
 remembered+0.082
 blew+0.078
 ate+0.076
 chased+0.075
Neg logit contributions
 This-0.080
 If-0.080
'.-0.076
".-0.074
and-0.073
Token said
Token ID531
Feature activation+3.36e-02

Pos logit contributions
 forgot+0.052
 remembered+0.050
 wish+0.048
 dove+0.047
 blew+0.046
Neg logit contributions
 This-0.119
 If-0.118
'.-0.115
".-0.113
-0.112
Token the
Token ID262
Feature activation-4.41e-02

Pos logit contributions
 forgot+0.064
 remembered+0.062
 blew+0.060
 sne+0.059
 wish+0.058
Neg logit contributions
 This-0.045
 If-0.045
'.-0.043
".-0.041
.'-0.041
Token panc
Token ID22154
Feature activation-2.40e-02

Pos logit contributions
 forgot+0.064
 remembered+0.062
 blew+0.056
 ate+0.055
 bit+0.054
Neg logit contributions
 This-0.087
 If-0.087
'.-0.082
".-0.081
and-0.080
Tokenake
Token ID539
Feature activation+3.20e-02

Pos logit contributions
 forgot+0.052
 remembered+0.051
 sne+0.049
 blew+0.049
 bit+0.048
Neg logit contributions
 This-0.030
 If-0.030
'.-0.028
and-0.027
".-0.027
Token mix
Token ID5022
Feature activation+4.17e-02

Pos logit contributions
 If+0.044
 This+0.040
and+0.039
'.+0.035
.'+0.034
Neg logit contributions
 sne-0.072
 blew-0.072
 cut-0.070
 melted-0.070
 bit-0.069
Token and
Token ID290
Feature activation-4.44e-02

Pos logit contributions
 This+0.065
 If+0.064
 Today+0.058
'.+0.054
 By+0.054
Neg logit contributions
 bit-0.083
 melted-0.080
 blew-0.079
 sne-0.079
 cut-0.078
Token they
Token ID484
Feature activation-8.16e-02

Pos logit contributions
 forgot+0.049
 remembered+0.049
 wish+0.047
 dove+0.045
 sne+0.044
Neg logit contributions
 This-0.096
 If-0.095
'.-0.091
".-0.090
and-0.090
Token made
Token ID925
Feature activation+1.25e-02

Pos logit contributions
 wish+0.094
 dove+0.091
 boo+0.087
 feather+0.086
 buzz+0.082
Neg logit contributions
-0.161
.-0.156
'.-0.149
 together-0.146
 She-0.143
Token pancakes
Token ID47008
Feature activation+2.66e-02

Pos logit contributions
 This+0.022
 If+0.022
'.+0.022
.'+0.021
".+0.021
Neg logit contributions
 sne-0.021
 wish-0.019
 dropped-0.019
 blew-0.018
 forgot-0.018
Token together
Token ID1978
Feature activation+3.86e-02

Pos logit contributions
 If+0.039
 This+0.037
'.+0.032
 That+0.032
 You+0.032
Neg logit contributions
 sne-0.056
 forgot-0.055
 ate-0.055
 cut-0.054
 blew-0.054
Token.
Token ID13
Feature activation+5.90e-02

Pos logit contributions
 If+0.049
 This+0.045
 You+0.042
 That+0.039
 Today+0.038
Neg logit contributions
 bit-0.090
 forgot-0.090
 ate-0.090
 cut-0.088
 blew-0.087
Token Lily
Token ID20037
Feature activation-9.54e-02

Pos logit contributions
ings+0.101
.'+0.096
fulness+0.095
ions+0.092
'.+0.091
Neg logit contributions
 forgot-0.105
 ate-0.102
 drank-0.093
 threw-0.091
 saw-0.090
Token Sam
Token ID3409
Feature activation-5.02e-02

Pos logit contributions
.'+0.106
and+0.106
fulness+0.100
".+0.096
.'"+0.096
Neg logit contributions
 forgot-0.107
 saw-0.103
 remembered-0.103
 threw-0.099
 realized-0.099
Token was
Token ID373
Feature activation-2.34e-02

Pos logit contributions
 forgot+0.057
 wish+0.055
 remembered+0.055
 sne+0.052
 blew+0.052
Neg logit contributions
 This-0.106
 If-0.104
'.-0.103
".-0.102
-0.100
Token relieved
Token ID22598
Feature activation+5.01e-02

Pos logit contributions
 forgot+0.036
 remembered+0.035
 sne+0.033
 blew+0.033
 bit+0.032
Neg logit contributions
 This-0.045
 If-0.044
'.-0.043
and-0.042
".-0.042
Token,
Token ID11
Feature activation+9.78e-03

Pos logit contributions
 If+0.098
 This+0.094
ays+0.093
and+0.093
ugs+0.092
Neg logit contributions
 remembered-0.078
 sne-0.076
 knew-0.075
 forgot-0.074
 bit-0.074
Token but
Token ID475
Feature activation-1.98e-02

Pos logit contributions
 This+0.015
 If+0.015
'.+0.015
.'+0.014
".+0.014
Neg logit contributions
 forgot-0.020
 remembered-0.019
 sne-0.019
 bit-0.019
 blew-0.018
Token he
Token ID339
Feature activation-7.41e-02

Pos logit contributions
 forgot+0.040
 remembered+0.040
 blew+0.038
 sne+0.038
 bit+0.037
Neg logit contributions
 This-0.028
 If-0.028
'.-0.027
".-0.026
.'-0.026
Token never
Token ID1239
Feature activation-6.93e-02

Pos logit contributions
 sne+0.079
 wish+0.078
 blew+0.077
 forgot+0.076
 boo+0.076
Neg logit contributions
-0.147
'.-0.139
 If-0.136
.-0.133
".-0.133
Token forgot
Token ID16453
Feature activation+4.31e-02

Pos logit contributions
 wish+0.043
 stuffed+0.043
inge+0.040
 comb+0.040
 groom+0.039
Neg logit contributions
-0.165
 This-0.164
.-0.161
".-0.159
and-0.159
Token how
Token ID703
Feature activation+1.07e-02

Pos logit contributions
 Today+0.080
 If+0.080
 This+0.079
and+0.079
'.+0.078
Neg logit contributions
 remembered-0.065
 forgot-0.065
 sne-0.064
 hurt-0.062
 sw-0.061
Token brave
Token ID14802
Feature activation+3.66e-02

Pos logit contributions
 If+0.025
 This+0.025
'.+0.024
 Today+0.024
and+0.024
Neg logit contributions
 forgot-0.013
 remembered-0.012
 hurt-0.011
 sne-0.011
 dropped-0.011
Token he
Token ID339
Feature activation-6.26e-02

Pos logit contributions
 If+0.067
'.+0.065
.'+0.065
 This+0.065
and+0.065
Neg logit contributions
 sne-0.063
 dove-0.062
 forgot-0.062
 sw-0.061
 remembered-0.060
Token dog
Token ID3290
Feature activation-4.47e-02

Pos logit contributions
 forgot+0.063
 remembered+0.060
 blew+0.056
 bit+0.052
 ate+0.052
Neg logit contributions
 This-0.102
'.-0.095
 If-0.095
".-0.094
-0.093
Token were
Token ID547
Feature activation-2.48e-02

Pos logit contributions
 forgot+0.060
 remembered+0.058
 blew+0.054
 wish+0.053
 sne+0.052
Neg logit contributions
 This-0.094
 If-0.089
'.-0.088
".-0.087
-0.087
Token playing
Token ID2712
Feature activation-2.25e-02

Pos logit contributions
 forgot+0.043
 remembered+0.042
 sne+0.040
 blew+0.040
 bit+0.039
Neg logit contributions
 This-0.042
 If-0.042
'.-0.040
and-0.039
".-0.039
Token together
Token ID1978
Feature activation+2.21e-02

Pos logit contributions
 forgot+0.047
 remembered+0.046
 sne+0.045
 blew+0.045
 bit+0.044
Neg logit contributions
 This-0.029
 If-0.029
'.-0.028
and-0.027
".-0.027
Token.
Token ID13
Feature activation+3.61e-03

Pos logit contributions
 This+0.036
 If+0.035
uts+0.032
and+0.032
 That+0.032
Neg logit contributions
 bit-0.041
 sne-0.039
 fl-0.039
 blew-0.038
 cut-0.038
Token They
Token ID1119
Feature activation-9.15e-02

Pos logit contributions
and+0.004
.'+0.004
'.+0.004
".+0.003
 If+0.003
Neg logit contributions
 forgot-0.009
 ate-0.009
 remembered-0.009
 saw-0.009
 sne-0.009
Token liked
Token ID8288
Feature activation-1.42e-02

Pos logit contributions
 wish+0.136
ited+0.131
 scrape+0.126
inge+0.126
 promise+0.122
Neg logit contributions
-0.167
 They-0.164
.-0.160
 She-0.155
 "-0.146
Token to
Token ID284
Feature activation-6.22e-02

Pos logit contributions
 forgot+0.034
 remembered+0.034
 sne+0.033
 blew+0.033
 bit+0.033
Neg logit contributions
'.-0.014
 If-0.014
 This-0.014
.'-0.013
".-0.013
Token play
Token ID711
Feature activation-1.88e-02

Pos logit contributions
 remembered+0.083
 forgot+0.082
 regretted+0.082
 blew+0.076
 wished+0.071
Neg logit contributions
 This-0.127
 If-0.118
".-0.117
'.-0.116
-0.116
Token and
Token ID290
Feature activation-5.99e-02

Pos logit contributions
 forgot+0.038
 remembered+0.038
 sne+0.037
 blew+0.036
 bit+0.036
Neg logit contributions
 This-0.026
 If-0.026
'.-0.025
and-0.024
.'-0.023
Token run
Token ID1057
Feature activation-5.88e-03

Pos logit contributions
 remembered+0.075
 forgot+0.071
 blew+0.069
 wish+0.069
ited+0.068
Neg logit contributions
 This-0.127
 If-0.118
 They-0.116
-0.115
".-0.112
Token Lily
Token ID20037
Feature activation+2.89e-02

Pos logit contributions
uts+0.020
and+0.019
'.+0.019
.'+0.019
but+0.019
Neg logit contributions
 sne-0.022
 dropped-0.021
 bit-0.021
 cut-0.020
 remembered-0.020
Token to
Token ID284
Feature activation+8.00e-03

Pos logit contributions
 If+0.045
'.+0.044
 This+0.044
".+0.043
fulness+0.043
Neg logit contributions
 bit-0.048
 cut-0.048
 sne-0.048
 dropped-0.047
 lost-0.047
Token the
Token ID262
Feature activation-3.15e-02

Pos logit contributions
 If+0.014
 This+0.014
uts+0.014
and+0.014
'.+0.014
Neg logit contributions
 sne-0.013
 cut-0.012
 wish-0.012
 remembered-0.012
 forgot-0.012
Token bathroom
Token ID12436
Feature activation+5.59e-02

Pos logit contributions
 forgot+0.047
 remembered+0.045
 blew+0.043
 sne+0.042
 bit+0.042
Neg logit contributions
 This-0.061
 If-0.060
'.-0.058
".-0.057
and-0.057
Token.
Token ID13
Feature activation+3.13e-02

Pos logit contributions
 This+0.079
 If+0.078
 You+0.073
 That+0.071
'.+0.070
Neg logit contributions
 bit-0.122
 cut-0.116
 forgot-0.115
 saw-0.115
 sne-0.113
Token She
Token ID1375
Feature activation-1.08e-01

Pos logit contributions
.'+0.042
'.+0.041
".+0.041
fulness+0.041
and+0.038
Neg logit contributions
 saw-0.069
 forgot-0.066
 dropped-0.064
 remembered-0.064
 smelled-0.064
Token cleaned
Token ID20750
Feature activation+2.15e-02

Pos logit contributions
 matter+0.155
 wish+0.153
ited+0.148
 urge+0.144
 itch+0.144
Neg logit contributions
-0.210
 "-0.176
 She-0.170
 They-0.170
.-0.165
Token her
Token ID607
Feature activation-3.65e-02

Pos logit contributions
and+0.041
'.+0.041
.'+0.040
uts+0.039
 This+0.038
Neg logit contributions
 pin-0.032
 dropped-0.032
 sne-0.031
 hurt-0.030
 spilled-0.030
Token knee
Token ID10329
Feature activation+2.07e-02

Pos logit contributions
 forgot+0.049
 remembered+0.047
 blew+0.044
 sne+0.042
 ate+0.042
Neg logit contributions
 This-0.077
 If-0.075
'.-0.073
".-0.072
.'-0.071
Token and
Token ID290
Feature activation-7.54e-02

Pos logit contributions
'.+0.036
 If+0.035
 This+0.035
.'+0.034
 You+0.032
Neg logit contributions
 bit-0.036
 forgot-0.036
 hurt-0.035
 cut-0.035
 remembered-0.034
Token put
Token ID1234
Feature activation+2.54e-02

Pos logit contributions
 wish+0.086
ited+0.079
 swoop+0.074
 forgot+0.073
 remembered+0.072
Neg logit contributions
-0.151
 She-0.141
 If-0.140
.-0.140
 This-0.139
Token but
Token ID475
Feature activation-1.93e-02

Pos logit contributions
 forgot+0.011
 sne+0.011
 remembered+0.011
 blew+0.011
 bit+0.011
Neg logit contributions
'.-0.007
 If-0.007
 This-0.007
.'-0.007
".-0.007
Token she
Token ID673
Feature activation-9.29e-02

Pos logit contributions
 forgot+0.044
 remembered+0.043
 blew+0.041
 sne+0.041
 bit+0.040
Neg logit contributions
 This-0.023
 If-0.023
'.-0.023
and-0.022
.'-0.021
Token was
Token ID373
Feature activation-6.09e-02

Pos logit contributions
 groom+0.143
 swoop+0.138
 scar+0.137
 feather+0.137
 wave+0.136
Neg logit contributions
-0.164
.-0.143
 She-0.135
and-0.126
 This-0.126
Token happy
Token ID3772
Feature activation+1.50e-02

Pos logit contributions
 forgot+0.093
 blew+0.090
 wish+0.089
 remembered+0.088
ited+0.087
Neg logit contributions
 If-0.111
 This-0.108
-0.106
 She-0.103
 That-0.098
Token.
Token ID13
Feature activation+3.34e-02

Pos logit contributions
 If+0.024
 This+0.021
'.+0.020
 Those+0.018
.'+0.018
Neg logit contributions
 remembered-0.030
 sne-0.030
 saw-0.030
 forgot-0.030
 bit-0.029
Token She
Token ID1375
Feature activation-1.01e-01

Pos logit contributions
.'+0.041
fulness+0.040
'.+0.039
ions+0.037
ently+0.037
Neg logit contributions
 saw-0.071
 forgot-0.068
 ate-0.066
 smelled-0.066
 remembered-0.064
Token liked
Token ID8288
Feature activation-2.10e-02

Pos logit contributions
inge+0.126
Uh+0.118
ited+0.117
 wish+0.116
 trace+0.113
Neg logit contributions
-0.203
 She-0.181
.-0.177
 They-0.177
 He-0.165
Token the
Token ID262
Feature activation-4.80e-02

Pos logit contributions
 forgot+0.052
 remembered+0.051
 sne+0.050
 blew+0.050
 bit+0.049
Neg logit contributions
 This-0.020
 If-0.020
'.-0.019
.'-0.018
".-0.017
Token cold
Token ID4692
Feature activation-1.05e-02

Pos logit contributions
 forgot+0.075
 remembered+0.073
 blew+0.067
 dropped+0.065
 bit+0.065
Neg logit contributions
 This-0.090
 If-0.087
".-0.086
-0.086
'.-0.084
Token air
Token ID1633
Feature activation+2.36e-02

Pos logit contributions
 forgot+0.015
 sne+0.015
 remembered+0.014
 bit+0.014
 blew+0.014
Neg logit contributions
 If-0.021
 This-0.021
'.-0.020
.'-0.020
and-0.020
Token on
Token ID319
Feature activation+1.89e-02

Pos logit contributions
 If+0.037
 This+0.035
and+0.033
'.+0.032
.'+0.030
Neg logit contributions
 sne-0.047
 bit-0.047
 forgot-0.047
 blew-0.046
 ate-0.045