TOP ACTIVATIONS
MAX = 0.394

would take out a j igsaw puzzle and try to solve
It was a big j igsaw puzzle and she wanted to
box and saw a j igsaw puzzle inside . He paused
. The ant was sad when she finished
The lazy ant replied , " I don
But it was a brave ant , so it set off
day a boss y little ant came along and decided he
The hard working ant said , " If you
Once , there was an ant . He was very hungry
The lazy ant thought for a moment and
there was a brave little ant . He wanted to go
! â The ant asked ,
Once , there was an ant who was very curious .
time there was a little ant called Tom . He was
back and watched as the ant broke free from the box
So she grabbed the ant with her free hand and
off ." The ant was a bit annoyed ,
! â said the ant . <|endoftext|> M ama was
food . The ant couldn 't believe it .
, there was a little ant who loved to crawl everywhere

MIN ACTIVATIONS
MIN = -0.374

the owner smiled . " Looks like you two are friends
laughed and said , " Looks like that apple made you
smiled and said , " Looks like you like the wardrobe
's mom my . " Looks like the wave destroyed all
table . " Looks like mom has some mail
smiled . " Looks like the frog likes you
felt bad and wished he hadn 't done that . He
. The neighbor said they hadn 't seen the toy but
She was glad that she hadn 't sne ez ed this
was sad and wished she hadn 't reversed the books .
see the meat , he hadn 't eaten in a long
seen any onions but they hadn 't . After
the boy replied that he hadn 't . But then ,
in the garden that she hadn 't seen before . It
to his seat . He hadn 't been too worried about
. He was happy he hadn 't been hurt . <|endoftext|>
toys and books that he hadn 't tried anything new .
sad . He wished he hadn 't given away his special
were so surprised . They hadn 't thought she was awake
. He was glad he hadn 't stayed afraid behind the

INTERVAL 0.202 - 0.394
CONTAINS 4.570%

She saw some glue on the floor . She thought it
stop slapping and to share the toys . She says slapping
Mom and went back to the garden . They took their
When he entered the living room , he saw
do it . He dropped the balls every time . But

INTERVAL 0.010 - 0.202
CONTAINS 49.853%

John and his mom finished their shopping and went home .
to the elevator . <|endoftext|> Tom and Sam are friends .
. Tim my felt bad and decided to share his tools
the park with his mom . He saw a big monkey
a time , there was a little boy named Tim my

INTERVAL -0.182 - 0.010
CONTAINS 34.639%

waiting for him . John opened the lid and
draw something too ?" Ben says . " Sure
ball in the park . She threw the ball high up
Anna was so happy to spend the rainy day outside watching
was proud and happy . She sang a song with them

INTERVAL -0.374 - -0.182
CONTAINS 10.937%

old came along . She stared at the hook in amaz
how sad Lily was and decided to give her his favorite
Hey D ucky , why don 't you come swim with
was just hungry . Please don 't bite me !" he
anything to eat . It felt very sad and lonely .
Token would
Token ID561
Feature activation-2.55e-01

Pos logit contributions
itates+0.129
 hadn+0.115
Looks+0.110
vere+0.109
bled+0.108
Neg logit contributions
 oak-0.260
 newspaper-0.244
 cane-0.244
 pine-0.244
 goat-0.242
Token take
Token ID1011
Feature activation-1.55e-01

Pos logit contributions
itates+0.497
 hadn+0.462
Looks+0.454
bled+0.449
vere+0.444
Neg logit contributions
 oak-0.438
 cane-0.398
 newspaper-0.397
 pine-0.396
 pony-0.388
Token out
Token ID503
Feature activation+2.70e-02

Pos logit contributions
 never+0.204
bled+0.204
itates+0.202
hes+0.202
 not+0.202
Neg logit contributions
 oak-0.317
 goat-0.305
 pigeon-0.288
 bull-0.287
 newspaper-0.281
Token a
Token ID257
Feature activation+1.10e-01

Pos logit contributions
 goat+0.055
 oak+0.054
 pigeon+0.053
round+0.053
affe+0.052
Neg logit contributions
 also-0.036
 not-0.033
Looks-0.032
hes-0.032
bled-0.031
Token j
Token ID474
Feature activation-2.29e-02

Pos logit contributions
 goat+0.161
 oak+0.159
round+0.150
ican+0.146
 bull+0.144
Neg logit contributions
itates-0.201
'd-0.200
vere-0.200
 hadn-0.200
 also-0.197
Tokenigsaw
Token ID45636
Feature activation+3.94e-01

Pos logit contributions
vere+0.028
itates+0.027
 refreshed+0.027
izes+0.025
 accomplished+0.024
Neg logit contributions
 goat-0.049
 oak-0.047
 pony-0.047
 newspaper-0.046
 bull-0.046
Token puzzle
Token ID15027
Feature activation+2.09e-01

Pos logit contributions
 oak+0.496
 goat+0.495
 pine+0.451
 pigeon+0.428
 bull+0.422
Neg logit contributions
itates-0.859
vere-0.857
 also-0.809
 hadn-0.809
izes-0.799
Token and
Token ID290
Feature activation+8.74e-02

Pos logit contributions
 oak+0.370
 goat+0.358
 bull+0.344
 pony+0.339
 pigeon+0.339
Neg logit contributions
 also-0.342
vere-0.332
 not-0.328
 never-0.327
 hadn-0.311
Token try
Token ID1949
Feature activation-1.46e-01

Pos logit contributions
 oak+0.156
 cane+0.142
 pine+0.142
 newspaper+0.142
 goat+0.136
Neg logit contributions
itates-0.172
 hadn-0.154
Looks-0.153
bled-0.153
vere-0.152
Token to
Token ID284
Feature activation+1.09e-01

Pos logit contributions
itates+0.315
 hadn+0.299
vere+0.290
Looks+0.288
bled+0.286
Neg logit contributions
 oak-0.211
 goat-0.189
 newspaper-0.187
 cane-0.187
 pine-0.186
Token solve
Token ID8494
Feature activation-1.91e-03

Pos logit contributions
 oak+0.198
 cane+0.180
 newspaper+0.179
 pine+0.179
igsaw+0.176
Neg logit contributions
itates-0.205
 hadn-0.190
Looks-0.186
bled-0.184
vere-0.177
Token It
Token ID632
Feature activation-1.39e-01

Pos logit contributions
 oak+0.138
 newspaper+0.126
 cane+0.125
igsaw+0.125
 pine+0.124
Neg logit contributions
itates-0.183
 hadn-0.174
vere-0.169
bled-0.169
Looks-0.168
Token was
Token ID373
Feature activation-2.30e-01

Pos logit contributions
itates+0.221
vere+0.198
bled+0.194
Looks+0.185
izes+0.181
Neg logit contributions
 oak-0.292
 pine-0.273
 cane-0.268
 pony-0.268
 newspaper-0.267
Token a
Token ID257
Feature activation+1.09e-01

Pos logit contributions
itates+0.488
 hadn+0.463
Looks+0.448
vere+0.444
bled+0.444
Neg logit contributions
 oak-0.346
 newspaper-0.312
 cane-0.311
 pine-0.310
 goat-0.309
Token big
Token ID1263
Feature activation+1.70e-01

Pos logit contributions
 oak+0.169
igsaw+0.151
 goat+0.151
 newspaper+0.149
 cane+0.149
Neg logit contributions
itates-0.216
 hadn-0.214
 also-0.199
Looks-0.198
vere-0.197
Token j
Token ID474
Feature activation-2.27e-02

Pos logit contributions
 oak+0.185
igsaw+0.173
 goat+0.171
 newspaper+0.161
 cane+0.161
Neg logit contributions
 hadn-0.377
 also-0.367
'd-0.352
itates-0.351
 definitely-0.351
Tokenigsaw
Token ID45636
Feature activation+3.91e-01

Pos logit contributions
itates+0.026
vere+0.025
 refreshed+0.024
 hadn+0.022
 certainly+0.022
Neg logit contributions
 goat-0.049
 oak-0.049
 newspaper-0.047
 pine-0.047
 cane-0.047
Token puzzle
Token ID15027
Feature activation+2.08e-01

Pos logit contributions
 oak+0.480
 goat+0.449
 pine+0.433
 bull+0.396
 cane+0.384
Neg logit contributions
 hadn-0.881
itates-0.875
vere-0.858
'd-0.843
 also-0.839
Token and
Token ID290
Feature activation+8.86e-02

Pos logit contributions
 oak+0.330
 goat+0.305
 pine+0.299
 cane+0.292
 bull+0.291
Neg logit contributions
 hadn-0.396
itates-0.390
vere-0.380
 also-0.380
'd-0.367
Token she
Token ID673
Feature activation-9.84e-02

Pos logit contributions
 oak+0.123
 newspaper+0.119
 cane+0.115
 pine+0.113
 pony+0.112
Neg logit contributions
itates-0.200
vere-0.184
bled-0.175
izes-0.172
eking-0.171
Token wanted
Token ID2227
Feature activation-2.49e-01

Pos logit contributions
itates+0.154
vere+0.149
Looks+0.145
bled+0.144
 Deal+0.142
Neg logit contributions
 oak-0.198
 cane-0.187
 pine-0.184
 pony-0.181
 newspaper-0.180
Token to
Token ID284
Feature activation+1.10e-01

Pos logit contributions
itates+0.582
 hadn+0.561
Looks+0.539
vere+0.537
bled+0.534
Neg logit contributions
 oak-0.317
 goat-0.278
 cane-0.276
 pine-0.276
 newspaper-0.275
Token box
Token ID3091
Feature activation+9.89e-02

Pos logit contributions
 oak+0.271
 pine+0.236
 goat+0.236
 newspaper+0.233
 cane+0.232
Neg logit contributions
itates-0.490
 hadn-0.471
Looks-0.451
bled-0.447
vere-0.445
Token and
Token ID290
Feature activation+8.51e-02

Pos logit contributions
 oak+0.096
 newspaper+0.081
 cane+0.081
 pine+0.080
 goat+0.080
Neg logit contributions
itates-0.262
 hadn-0.252
Looks-0.245
vere-0.244
bled-0.244
Token saw
Token ID2497
Feature activation-1.20e-01

Pos logit contributions
 oak+0.165
 pine+0.154
 newspaper+0.153
 cane+0.153
 goat+0.151
Neg logit contributions
itates-0.145
 hadn-0.131
vere-0.130
Looks-0.128
bled-0.124
Token a
Token ID257
Feature activation+1.07e-01

Pos logit contributions
itates+0.248
 hadn+0.236
Looks+0.228
bled+0.227
vere+0.226
Neg logit contributions
 oak-0.188
 newspaper-0.170
 cane-0.170
 goat-0.169
 pine-0.168
Token j
Token ID474
Feature activation-2.28e-02

Pos logit contributions
 oak+0.143
 newspaper+0.125
 pine+0.125
 cane+0.125
 goat+0.124
Neg logit contributions
itates-0.241
 hadn-0.232
Looks-0.223
vere-0.221
bled-0.219
Tokenigsaw
Token ID45636
Feature activation+3.88e-01

Pos logit contributions
itates+0.036
vere+0.033
 hadn+0.032
bled+0.032
 refreshed+0.032
Neg logit contributions
 oak-0.044
 goat-0.042
 newspaper-0.042
 pony-0.041
 cane-0.041
Token puzzle
Token ID15027
Feature activation+2.04e-01

Pos logit contributions
 oak+0.462
 goat+0.424
 pine+0.407
 bull+0.380
 pony+0.378
Neg logit contributions
itates-0.916
 hadn-0.907
vere-0.873
 also-0.857
Looks-0.845
Token inside
Token ID2641
Feature activation+9.23e-02

Pos logit contributions
 oak+0.273
 newspaper+0.244
 cane+0.243
 pine+0.242
 goat+0.239
Neg logit contributions
itates-0.470
 hadn-0.442
Looks-0.433
bled-0.427
vere-0.426
Token.
Token ID13
Feature activation+8.53e-02

Pos logit contributions
 oak+0.091
 newspaper+0.081
 pine+0.079
 cane+0.076
 goat+0.072
Neg logit contributions
itates-0.248
Looks-0.227
vere-0.227
bled-0.226
 hadn-0.224
Token He
Token ID679
Feature activation-1.55e-01

Pos logit contributions
 oak+0.137
 newspaper+0.125
 pine+0.124
 cane+0.123
 goat+0.122
Neg logit contributions
itates-0.174
 hadn-0.163
Looks-0.159
vere-0.158
bled-0.157
Token paused
Token ID24487
Feature activation-1.60e-01

Pos logit contributions
itates+0.249
vere+0.228
Looks+0.219
izes+0.208
bled+0.207
Neg logit contributions
 oak-0.318
 pine-0.305
 goat-0.296
 newspaper-0.296
 cane-0.295
Token.
Token ID13
Feature activation+9.05e-02

Pos logit contributions
 oak+0.001
igsaw+0.001
 goat+0.001
 cane+0.001
 pine+0.001
Neg logit contributions
itates-0.001
 hadn-0.001
vere-0.001
Looks-0.001
 also-0.001
Token
Token ID220
Feature activation-2.00e-01

Pos logit contributions
 oak+0.166
 newspaper+0.152
 pine+0.152
 cane+0.152
 goat+0.151
Neg logit contributions
itates-0.163
 hadn-0.152
Looks-0.146
bled-0.146
vere-0.145
Token
Token ID198
Feature activation-6.01e-02

Pos logit contributions
itates+0.386
 hadn+0.367
Looks+0.357
bled+0.352
vere+0.347
Neg logit contributions
 oak-0.329
 pine-0.303
 newspaper-0.302
 cane-0.300
igsaw-0.299
Token
Token ID198
Feature activation-5.85e-02

Pos logit contributions
itates+0.126
bled+0.121
 hadn+0.120
vere+0.115
eking+0.114
Neg logit contributions
 oak-0.086
 pine-0.084
 pony-0.076
 newspaper-0.076
igsaw-0.076
TokenThe
Token ID464
Feature activation+1.04e-01

Pos logit contributions
itates+0.114
bled+0.106
 hadn+0.106
vere+0.103
izes+0.098
Neg logit contributions
 oak-0.097
 pine-0.092
 goat-0.088
 pony-0.088
 newspaper-0.088
Token ant
Token ID1885
Feature activation+3.36e-01

Pos logit contributions
 oak+0.161
 cane+0.147
 newspaper+0.147
igsaw+0.146
 goat+0.145
Neg logit contributions
itates-0.210
 hadn-0.202
Looks-0.194
vere-0.191
bled-0.189
Token was
Token ID373
Feature activation-2.26e-01

Pos logit contributions
 oak+0.746
 pine+0.696
 newspaper+0.694
 cane+0.693
 goat+0.692
Neg logit contributions
itates-0.476
 hadn-0.429
Looks-0.416
vere-0.414
bled-0.411
Token sad
Token ID6507
Feature activation-5.73e-02

Pos logit contributions
itates+0.381
vere+0.342
Looks+0.342
 hadn+0.328
bled+0.328
Neg logit contributions
 oak-0.442
 pine-0.419
 pony-0.417
 goat-0.404
 cane-0.404
Token when
Token ID618
Feature activation+4.26e-02

Pos logit contributions
itates+0.149
 hadn+0.143
Looks+0.139
vere+0.139
bled+0.138
Neg logit contributions
 oak-0.058
 newspaper-0.050
 cane-0.050
 pine-0.049
 goat-0.049
Token she
Token ID673
Feature activation-9.18e-02

Pos logit contributions
 oak+0.066
 pine+0.060
 cane+0.059
 newspaper+0.059
 goat+0.059
Neg logit contributions
itates-0.090
 hadn-0.084
Looks-0.081
vere-0.081
bled-0.081
Token finished
Token ID5201
Feature activation-2.23e-03

Pos logit contributions
itates+0.153
bled+0.142
vere+0.140
Looks+0.139
 hadn+0.127
Neg logit contributions
 oak-0.184
 pine-0.172
 cane-0.171
 newspaper-0.169
 pony-0.168
Token
Token ID220
Feature activation-2.07e-01

Pos logit contributions
 oak+0.019
 newspaper+0.017
 pine+0.017
 cane+0.017
 goat+0.017
Neg logit contributions
itates-0.019
 hadn-0.018
Looks-0.018
vere-0.017
bled-0.017
Token
Token ID198
Feature activation-6.82e-02

Pos logit contributions
itates+0.424
 hadn+0.413
Looks+0.405
bled+0.393
vere+0.391
Neg logit contributions
 oak-0.300
 newspaper-0.280
 goat-0.274
 pine-0.274
igsaw-0.273
Token
Token ID198
Feature activation-6.64e-02

Pos logit contributions
itates+0.133
 hadn+0.126
bled+0.122
vere+0.121
Looks+0.121
Neg logit contributions
 oak-0.112
 pine-0.104
 newspaper-0.103
 goat-0.102
 cane-0.101
TokenThe
Token ID464
Feature activation+9.96e-02

Pos logit contributions
itates+0.127
 hadn+0.119
bled+0.116
vere+0.114
Looks+0.113
Neg logit contributions
 oak-0.113
 newspaper-0.104
 pine-0.103
 goat-0.103
 cane-0.103
Token lazy
Token ID16931
Feature activation+1.10e-01

Pos logit contributions
 oak+0.123
 newspaper+0.109
 cane+0.108
 pine+0.108
 goat+0.107
Neg logit contributions
itates-0.236
 hadn-0.226
Looks-0.219
vere-0.218
bled-0.217
Token ant
Token ID1885
Feature activation+3.36e-01

Pos logit contributions
 oak+0.112
igsaw+0.108
 newspaper+0.101
 pine+0.096
 cane+0.095
Neg logit contributions
 hadn-0.257
itates-0.250
 also-0.244
'd-0.239
bled-0.238
Token replied
Token ID8712
Feature activation-1.72e-01

Pos logit contributions
 oak+0.721
 pine+0.672
 cane+0.669
 goat+0.668
 newspaper+0.667
Neg logit contributions
itates-0.499
vere-0.450
Looks-0.444
bled-0.440
 hadn-0.431
Token,
Token ID11
Feature activation+1.49e-01

Pos logit contributions
itates+0.483
 hadn+0.464
Looks+0.453
vere+0.450
bled+0.450
Neg logit contributions
 oak-0.142
 newspaper-0.116
 cane-0.115
 pine-0.115
 goat-0.114
Token "
Token ID366
Feature activation-6.70e-02

Pos logit contributions
 oak+0.250
 goat+0.230
 cane+0.229
 pine+0.228
 newspaper+0.227
Neg logit contributions
itates-0.293
 hadn-0.269
Looks-0.267
vere-0.264
bled-0.260
TokenI
Token ID40
Feature activation-1.90e-01

Pos logit contributions
itates+0.101
bled+0.098
vere+0.097
 hadn+0.094
eking+0.094
Neg logit contributions
 oak-0.129
 goat-0.126
 pigeon-0.126
 pine-0.124
 newspaper-0.123
Token don
Token ID836
Feature activation-2.32e-01

Pos logit contributions
bled+0.339
eking+0.307
bling+0.307
itates+0.301
vere+0.299
Neg logit contributions
 oak-0.349
 pine-0.339
 cane-0.332
 dog-0.329
 pigeon-0.329
Token But
Token ID887
Feature activation-9.82e-02

Pos logit contributions
 oak+0.163
 newspaper+0.150
 cane+0.150
 pine+0.149
 goat+0.149
Neg logit contributions
itates-0.162
 hadn-0.152
Looks-0.146
vere-0.145
bled-0.144
Token it
Token ID340
Feature activation+4.30e-03

Pos logit contributions
itates+0.201
 hadn+0.185
vere+0.183
Looks+0.182
bled+0.179
Neg logit contributions
 oak-0.159
 pine-0.145
 goat-0.144
 cane-0.144
 newspaper-0.143
Token was
Token ID373
Feature activation-2.27e-01

Pos logit contributions
 oak+0.009
 pine+0.009
 goat+0.009
 cane+0.008
 animal+0.008
Neg logit contributions
itates-0.007
vere-0.006
Looks-0.006
 Deal-0.006
bled-0.006
Token a
Token ID257
Feature activation+1.16e-01

Pos logit contributions
itates+0.412
Looks+0.372
vere+0.362
bled+0.359
izes+0.343
Neg logit contributions
 oak-0.422
 pine-0.402
 cane-0.398
 newspaper-0.388
 goat-0.383
Token brave
Token ID14802
Feature activation+4.62e-02

Pos logit contributions
 oak+0.194
 newspaper+0.175
 pine+0.175
 cane+0.174
igsaw+0.174
Neg logit contributions
itates-0.224
 hadn-0.215
Looks-0.203
vere-0.203
bled-0.199
Token ant
Token ID1885
Feature activation+3.35e-01

Pos logit contributions
 oak+0.059
igsaw+0.057
ados+0.055
 newspaper+0.054
 magazines+0.053
Neg logit contributions
 also-0.090
 hadn-0.089
'd-0.087
 ok-0.085
 suppose-0.084
Token,
Token ID11
Feature activation+1.55e-01

Pos logit contributions
 oak+0.483
igsaw+0.461
 newspaper+0.448
 cane+0.437
 pine+0.427
Neg logit contributions
 hadn-0.694
itates-0.684
 also-0.643
vere-0.637
'd-0.635
Token so
Token ID523
Feature activation-1.00e-01

Pos logit contributions
 oak+0.268
 goat+0.247
 pine+0.246
 newspaper+0.242
 cane+0.241
Neg logit contributions
itates-0.313
bled-0.283
vere-0.279
Looks-0.274
izes-0.264
Token it
Token ID340
Feature activation+3.09e-03

Pos logit contributions
itates+0.217
vere+0.192
Looks+0.192
 hadn+0.191
bled+0.189
Neg logit contributions
 oak-0.159
 pine-0.146
 goat-0.143
 cane-0.140
 newspaper-0.138
Token set
Token ID900
Feature activation-4.79e-02

Pos logit contributions
 oak+0.006
 pine+0.006
 goat+0.006
 cane+0.006
 horse+0.006
Neg logit contributions
itates-0.005
vere-0.005
Looks-0.005
 Deal-0.005
bled-0.005
Token off
Token ID572
Feature activation+4.23e-02

Pos logit contributions
itates+0.089
 hadn+0.084
Looks+0.081
vere+0.080
bled+0.080
Neg logit contributions
 oak-0.085
 newspaper-0.078
 cane-0.078
 pine-0.078
 goat-0.077
Token day
Token ID1110
Feature activation+1.12e-01

Pos logit contributions
itates+0.036
 hadn+0.035
 also+0.033
Looks+0.033
 refreshed+0.032
Neg logit contributions
 oak-0.025
 cane-0.023
 newspaper-0.023
 goat-0.023
igsaw-0.022
Token a
Token ID257
Feature activation+1.21e-01

Pos logit contributions
 oak+0.158
 newspaper+0.137
 automobile+0.137
igsaw+0.134
 cane+0.133
Neg logit contributions
 hadn-0.244
itates-0.230
Looks-0.227
bled-0.226
 also-0.225
Token boss
Token ID6478
Feature activation+8.76e-02

Pos logit contributions
 oak+0.179
igsaw+0.159
 cane+0.157
 pine+0.156
 newspaper+0.155
Neg logit contributions
itates-0.247
 hadn-0.244
vere-0.231
Looks-0.228
 also-0.228
Tokeny
Token ID88
Feature activation+1.35e-01

Pos logit contributions
 oak+0.109
igsaw+0.102
 newspaper+0.099
 cane+0.098
 pine+0.097
Neg logit contributions
itates-0.201
 hadn-0.200
 also-0.188
'd-0.187
Looks-0.182
Token little
Token ID1310
Feature activation+1.97e-01

Pos logit contributions
 oak+0.147
igsaw+0.133
 newspaper+0.130
 pine+0.128
 cane+0.127
Neg logit contributions
 hadn-0.316
itates-0.306
'd-0.296
 also-0.293
bled-0.286
Token ant
Token ID1885
Feature activation+3.34e-01

Pos logit contributions
 oak+0.237
igsaw+0.230
 newspaper+0.218
 pine+0.209
 cane+0.208
Neg logit contributions
itates-0.419
 hadn-0.414
 also-0.401
Looks-0.400
vere-0.391
Token came
Token ID1625
Feature activation-1.51e-01

Pos logit contributions
 oak+0.714
 goat+0.662
 pine+0.662
 newspaper+0.661
 cane+0.661
Neg logit contributions
itates-0.503
 hadn-0.457
Looks-0.443
vere-0.440
bled-0.438
Token along
Token ID1863
Feature activation+4.14e-02

Pos logit contributions
itates+0.337
 hadn+0.323
Looks+0.317
vere+0.312
bled+0.309
Neg logit contributions
 oak-0.214
 pine-0.193
 goat-0.190
 cane-0.188
 ant-0.183
Token and
Token ID290
Feature activation+8.38e-02

Pos logit contributions
 oak+0.060
 newspaper+0.054
 pony+0.054
igsaw+0.054
 pine+0.054
Neg logit contributions
 hadn-0.085
itates-0.085
 also-0.081
vere-0.078
Looks-0.078
Token decided
Token ID3066
Feature activation-1.97e-01

Pos logit contributions
 oak+0.163
 pine+0.149
 goat+0.148
 cane+0.144
 pony+0.142
Neg logit contributions
itates-0.147
vere-0.139
bled-0.133
 Deal-0.131
Looks-0.127
Token he
Token ID339
Feature activation-1.04e-01

Pos logit contributions
itates+0.514
 hadn+0.483
Looks+0.480
bled+0.473
vere+0.472
Neg logit contributions
 oak-0.208
 goat-0.178
 newspaper-0.178
 cane-0.178
 pine-0.176
Token
Token ID198
Feature activation-6.06e-02

Pos logit contributions
itates+0.393
 hadn+0.376
Looks+0.366
bled+0.357
vere+0.353
Neg logit contributions
 oak-0.323
 pine-0.295
 newspaper-0.294
 cane-0.293
igsaw-0.292
Token
Token ID198
Feature activation-5.96e-02

Pos logit contributions
itates+0.120
 hadn+0.114
bled+0.110
Looks+0.109
vere+0.108
Neg logit contributions
 oak-0.098
 pine-0.091
 newspaper-0.089
 goat-0.088
 cane-0.088
TokenThe
Token ID464
Feature activation+1.03e-01

Pos logit contributions
itates+0.114
 hadn+0.107
bled+0.103
vere+0.102
Looks+0.102
Neg logit contributions
 oak-0.103
 pine-0.094
 newspaper-0.094
 cane-0.093
 goat-0.093
Token hard
Token ID1327
Feature activation+4.48e-02

Pos logit contributions
 oak+0.144
 newspaper+0.130
 cane+0.130
igsaw+0.128
 goat+0.128
Neg logit contributions
itates-0.225
 hadn-0.215
Looks-0.209
bled-0.206
vere-0.206
Tokenworking
Token ID16090
Feature activation+2.46e-01

Pos logit contributions
 newspaper+0.044
 oak+0.044
igsaw+0.043
 magazine+0.042
 automobile+0.040
Neg logit contributions
 also-0.100
 hadn-0.098
'd-0.095
itates-0.095
bled-0.094
Token ant
Token ID1885
Feature activation+3.34e-01

Pos logit contributions
igsaw+0.264
 oak+0.259
 newspaper+0.241
 cane+0.227
 magazine+0.221
Neg logit contributions
 hadn-0.578
itates-0.573
 also-0.558
'd-0.545
bled-0.545
Token said
Token ID531
Feature activation-1.82e-01

Pos logit contributions
 oak+0.736
 pine+0.687
 goat+0.685
 cane+0.684
 newspaper+0.683
Neg logit contributions
itates-0.481
vere-0.425
Looks-0.420
 hadn-0.416
bled-0.415
Token,
Token ID11
Feature activation+1.52e-01

Pos logit contributions
itates+0.512
 hadn+0.494
bled+0.483
vere+0.480
Looks+0.477
Neg logit contributions
 oak-0.147
 newspaper-0.121
 pine-0.118
 cane-0.117
 goat-0.116
Token "
Token ID366
Feature activation-6.74e-02

Pos logit contributions
 oak+0.233
 goat+0.217
 cane+0.214
 pony+0.214
 pine+0.212
Neg logit contributions
itates-0.318
Looks-0.295
 hadn-0.292
vere-0.287
bled-0.281
TokenIf
Token ID1532
Feature activation-8.58e-02

Pos logit contributions
itates+0.103
bled+0.099
vere+0.097
 hadn+0.094
 refreshed+0.092
Neg logit contributions
 oak-0.131
 goat-0.128
 cane-0.126
 pigeon-0.126
 newspaper-0.126
Token you
Token ID345
Feature activation-1.39e-01

Pos logit contributions
itates+0.135
 hadn+0.118
Looks+0.117
bled+0.116
vere+0.113
Neg logit contributions
 oak-0.182
 pine-0.170
 newspaper-0.169
 goat-0.169
 cane-0.169
TokenOnce
Token ID7454
Feature activation+1.69e-02

Pos logit contributions
 oak+0.083
 newspaper+0.077
 goat+0.076
 cane+0.076
 pine+0.075
Neg logit contributions
itates-0.087
 hadn-0.081
Looks-0.077
vere-0.076
bled-0.076
Token,
Token ID11
Feature activation+1.62e-01

Pos logit contributions
 oak+0.037
 newspaper+0.035
 cane+0.034
 pine+0.034
 goat+0.034
Neg logit contributions
itates-0.024
 hadn-0.022
Looks-0.021
vere-0.021
bled-0.021
Token there
Token ID612
Feature activation+1.17e-02

Pos logit contributions
 oak+0.266
 pine+0.244
 pony+0.239
 cane+0.238
 goat+0.237
Neg logit contributions
itates-0.315
vere-0.298
 hadn-0.296
Looks-0.296
bled-0.286
Token was
Token ID373
Feature activation-2.26e-01

Pos logit contributions
 oak+0.025
 pine+0.023
 newspaper+0.023
 cane+0.023
 goat+0.023
Neg logit contributions
itates-0.018
vere-0.016
Looks-0.016
bled-0.016
izes-0.015
Token an
Token ID281
Feature activation+9.82e-02

Pos logit contributions
itates+0.398
 also+0.384
 hadn+0.384
Looks+0.378
bled+0.371
Neg logit contributions
 oak-0.401
 cane-0.370
igsaw-0.367
 goat-0.359
 newspaper-0.359
Token ant
Token ID1885
Feature activation+3.34e-01

Pos logit contributions
 cane+0.161
igsaw+0.160
con+0.160
cone+0.149
 newspaper+0.147
Neg logit contributions
 also-0.174
 hadn-0.144
itates-0.143
 certainly-0.141
year-0.141
Token.
Token ID13
Feature activation+9.55e-02

Pos logit contributions
igsaw+0.599
 oak+0.578
 newspaper+0.563
 cane+0.551
 goat+0.535
Neg logit contributions
itates-0.570
 hadn-0.566
 also-0.553
 never-0.517
vere-0.499
Token He
Token ID679
Feature activation-1.42e-01

Pos logit contributions
 pony+0.169
 cane+0.168
 goat+0.166
 newspaper+0.166
 pigeon+0.158
Neg logit contributions
 hadn-0.156
itates-0.152
 also-0.148
Looks-0.141
 definitely-0.138
Token was
Token ID373
Feature activation-2.26e-01

Pos logit contributions
itates+0.193
vere+0.171
Looks+0.170
 hadn+0.167
bled+0.166
Neg logit contributions
 oak-0.328
 pine-0.311
 cane-0.306
 goat-0.305
 newspaper-0.305
Token very
Token ID845
Feature activation-8.89e-02

Pos logit contributions
itates+0.301
 hadn+0.276
Looks+0.264
vere+0.259
bled+0.256
Neg logit contributions
 oak-0.525
 pine-0.492
 goat-0.488
 cane-0.486
 newspaper-0.485
Token hungry
Token ID14720
Feature activation-1.20e-02

Pos logit contributions
itates+0.168
 hadn+0.164
Looks+0.161
vere+0.153
bled+0.152
Neg logit contributions
 oak-0.158
 pine-0.148
 ant-0.141
 goat-0.140
 cane-0.140
Token
Token ID220
Feature activation-2.00e-01

Pos logit contributions
 oak+0.061
 newspaper+0.059
 cane+0.058
igsaw+0.058
 goat+0.058
Neg logit contributions
itates-0.028
 hadn-0.026
Looks-0.025
 refreshed-0.024
 also-0.023
Token
Token ID198
Feature activation-6.14e-02

Pos logit contributions
itates+0.375
 hadn+0.355
Looks+0.344
bled+0.337
vere+0.336
Neg logit contributions
 oak-0.348
 pine-0.318
 newspaper-0.318
 cane-0.318
 goat-0.315
Token
Token ID198
Feature activation-6.04e-02

Pos logit contributions
itates+0.106
 hadn+0.100
Looks+0.096
vere+0.095
bled+0.094
Neg logit contributions
 oak-0.116
 newspaper-0.108
 cane-0.107
 goat-0.107
 pine-0.106
TokenThe
Token ID464
Feature activation+1.03e-01

Pos logit contributions
itates+0.105
 hadn+0.098
Looks+0.097
vere+0.095
 also+0.091
Neg logit contributions
 oak-0.114
 newspaper-0.105
 cane-0.105
 goat-0.104
igsaw-0.104
Token lazy
Token ID16931
Feature activation+1.11e-01

Pos logit contributions
 oak+0.105
 newspaper+0.094
igsaw+0.093
 cane+0.092
 goat+0.091
Neg logit contributions
itates-0.258
 hadn-0.248
Looks-0.241
 also-0.240
vere-0.239
Token ant
Token ID1885
Feature activation+3.34e-01

Pos logit contributions
igsaw+0.101
 magazine+0.089
 newspaper+0.089
 automobile+0.084
 magazines+0.082
Neg logit contributions
 also-0.263
 hadn-0.253
itates-0.248
 still-0.240
 not-0.240
Token thought
Token ID1807
Feature activation-1.69e-01

Pos logit contributions
 oak+0.720
 pine+0.672
 cane+0.671
 newspaper+0.669
 goat+0.669
Neg logit contributions
itates-0.489
 hadn-0.435
vere-0.433
Looks-0.433
bled-0.427
Token for
Token ID329
Feature activation-6.03e-02

Pos logit contributions
itates+0.306
 hadn+0.292
vere+0.279
Looks+0.277
bled+0.277
Neg logit contributions
 oak-0.303
 newspaper-0.277
 goat-0.277
 cane-0.276
 pine-0.275
Token a
Token ID257
Feature activation+1.13e-01

Pos logit contributions
itates+0.147
 hadn+0.134
vere+0.132
bled+0.132
Looks+0.131
Neg logit contributions
 oak-0.075
 newspaper-0.070
 pine-0.068
 cane-0.067
igsaw-0.066
Token moment
Token ID2589
Feature activation-3.48e-03

Pos logit contributions
 oak+0.193
 newspaper+0.176
 cane+0.175
 pine+0.175
 goat+0.175
Neg logit contributions
itates-0.218
 hadn-0.206
Looks-0.199
vere-0.197
bled-0.196
Token and
Token ID290
Feature activation+8.92e-02

Pos logit contributions
 hadn+0.008
itates+0.008
 also+0.008
vere+0.008
bled+0.008
Neg logit contributions
 oak-0.004
 pine-0.004
 goat-0.003
igsaw-0.003
 newspaper-0.003
Token there
Token ID612
Feature activation+1.29e-02

Pos logit contributions
 oak+0.258
 pine+0.237
 cane+0.234
 newspaper+0.234
 goat+0.233
Neg logit contributions
itates-0.297
 hadn-0.283
Looks-0.278
vere-0.278
bled-0.268
Token was
Token ID373
Feature activation-2.26e-01

Pos logit contributions
 oak+0.028
 pine+0.027
 newspaper+0.026
 cane+0.026
 goat+0.026
Neg logit contributions
itates-0.018
vere-0.017
Looks-0.016
 hadn-0.016
bled-0.016
Token a
Token ID257
Feature activation+1.16e-01

Pos logit contributions
 also+0.348
itates+0.334
Looks+0.330
 hadn+0.327
bled+0.316
Neg logit contributions
 oak-0.453
 cane-0.421
igsaw-0.416
 goat-0.403
 pony-0.400
Token brave
Token ID14802
Feature activation+4.35e-02

Pos logit contributions
igsaw+0.175
 oak+0.167
 newspaper+0.153
ados+0.151
 newspapers+0.148
Neg logit contributions
itates-0.229
 also-0.228
 hadn-0.217
'd-0.202
 sorry-0.202
Token little
Token ID1310
Feature activation+1.97e-01

Pos logit contributions
igsaw+0.063
ados+0.061
 newspapers+0.059
orns+0.057
 newspaper+0.056
Neg logit contributions
 also-0.077
 sorry-0.071
'd-0.069
 still-0.065
 hadn-0.065
Token ant
Token ID1885
Feature activation+3.33e-01

Pos logit contributions
igsaw+0.331
growth+0.318
con+0.281
 arrest+0.280
ados+0.275
Neg logit contributions
 also-0.312
 sorry-0.302
itates-0.299
 refreshed-0.268
 still-0.267
Token.
Token ID13
Feature activation+9.25e-02

Pos logit contributions
igsaw+0.605
 oak+0.580
 newspaper+0.562
 cane+0.553
 goat+0.528
Neg logit contributions
itates-0.580
 hadn-0.571
 also-0.552
 never-0.521
'd-0.513
Token He
Token ID679
Feature activation-1.45e-01

Pos logit contributions
 pony+0.180
 goat+0.174
 cane+0.171
 pigeon+0.167
 bike+0.165
Neg logit contributions
 hadn-0.130
itates-0.129
 also-0.124
 definitely-0.123
 certainly-0.120
Token wanted
Token ID2227
Feature activation-2.45e-01

Pos logit contributions
itates+0.217
vere+0.199
Looks+0.194
bled+0.192
 hadn+0.186
Neg logit contributions
 oak-0.316
 pine-0.300
 goat-0.294
 cane-0.291
 newspaper-0.289
Token to
Token ID284
Feature activation+1.15e-01

Pos logit contributions
 also+0.443
 not+0.437
 never+0.431
itates+0.417
 hadn+0.417
Neg logit contributions
igsaw-0.414
 oak-0.391
 automobile-0.360
affe-0.360
 pony-0.357
Token go
Token ID467
Feature activation-1.11e-01

Pos logit contributions
igsaw+0.238
 newspaper+0.227
 oak+0.225
 pony+0.225
 bull+0.223
Neg logit contributions
 also-0.153
 not-0.146
 actually-0.145
 hadn-0.141
 never-0.141
Token!
Token ID0
Feature activation+8.47e-02

Pos logit contributions
itates+0.179
 okay+0.175
 hadn+0.174
 also+0.170
 OK+0.167
Neg logit contributions
igsaw-0.150
 oak-0.144
 cane-0.130
 newspaper-0.130
 automobile-0.126
Tokenâ
Token ID22940
Feature activation-5.73e-02

Pos logit contributions
 oak+0.191
 cane+0.182
 newspaper+0.181
 goat+0.176
igsaw+0.175
Neg logit contributions
itates-0.116
 hadn-0.106
Looks-0.102
vere-0.095
 suppose-0.093
Token
Token ID26391
Feature activation+2.71e-02

Pos logit contributions
itates+0.091
 hadn+0.083
Looks+0.081
vere+0.078
izes+0.076
Neg logit contributions
 oak-0.117
 cane-0.110
 pony-0.109
 goat-0.108
 pine-0.108
Token
Token ID198
Feature activation-5.61e-02

Pos logit contributions
 oak+0.051
 pony+0.049
 cane+0.048
igsaw+0.047
 automobile+0.047
Neg logit contributions
itates-0.046
 hadn-0.043
vere-0.040
'd-0.039
 never-0.038
TokenThe
Token ID464
Feature activation+1.04e-01

Pos logit contributions
itates+0.108
 hadn+0.103
bled+0.101
Looks+0.098
vere+0.097
Neg logit contributions
 oak-0.093
 pine-0.087
 newspaper-0.086
 cane-0.084
 goat-0.084
Token ant
Token ID1885
Feature activation+3.33e-01

Pos logit contributions
igsaw+0.114
 oak+0.112
 cane+0.105
 pony+0.103
 newspaper+0.098
Neg logit contributions
itates-0.235
 also-0.230
 hadn-0.228
Looks-0.221
 suppose-0.221
Token asked
Token ID1965
Feature activation-1.48e-01

Pos logit contributions
 oak+0.732
 newspaper+0.683
 pine+0.682
 goat+0.680
 cane+0.680
Neg logit contributions
itates-0.480
 hadn-0.431
Looks-0.424
bled-0.421
vere-0.419
Token,
Token ID11
Feature activation+1.57e-01

Pos logit contributions
itates+0.330
 hadn+0.322
vere+0.316
 also+0.314
 suppose+0.311
Neg logit contributions
igsaw-0.181
 oak-0.180
 newspaper-0.164
 cane-0.164
 goat-0.157
Token
Token ID6184
Feature activation+4.32e-02

Pos logit contributions
 oak+0.278
 newspaper+0.255
 pine+0.255
 cane+0.254
 goat+0.254
Neg logit contributions
itates-0.292
 hadn-0.274
Looks-0.266
vere-0.263
bled-0.263
Token
Token ID95
Feature activation+7.65e-02

Pos logit contributions
 oak+0.072
 newspaper+0.069
 pony+0.067
 cane+0.067
 pine+0.067
Neg logit contributions
itates-0.079
vere-0.072
Looks-0.068
 hadn-0.068
izes-0.066
Token
Token ID26391
Feature activation+2.72e-02

Pos logit contributions
 oak+0.139
 pony+0.130
 cane+0.130
 pine+0.129
 goat+0.127
Neg logit contributions
itates-0.137
 hadn-0.126
Looks-0.123
vere-0.120
 certainly-0.119
TokenOnce
Token ID7454
Feature activation+2.15e-02

Pos logit contributions
 oak+0.087
 newspaper+0.079
 cane+0.079
 pine+0.079
 goat+0.078
Neg logit contributions
itates-0.106
 hadn-0.100
Looks-0.097
vere-0.096
bled-0.096
Token,
Token ID11
Feature activation+1.62e-01

Pos logit contributions
 oak+0.047
 newspaper+0.044
 cane+0.044
 pine+0.044
 goat+0.044
Neg logit contributions
itates-0.031
 hadn-0.028
Looks-0.027
vere-0.027
bled-0.027
Token there
Token ID612
Feature activation+1.34e-02

Pos logit contributions
 oak+0.259
 pine+0.237
 pony+0.231
 goat+0.228
 cane+0.228
Neg logit contributions
itates-0.316
vere-0.305
Looks-0.301
 hadn-0.294
bled-0.289
Token was
Token ID373
Feature activation-2.25e-01

Pos logit contributions
 oak+0.028
 pine+0.026
 newspaper+0.026
 cane+0.026
 goat+0.026
Neg logit contributions
itates-0.021
vere-0.019
Looks-0.018
bled-0.018
izes-0.018
Token an
Token ID281
Feature activation+9.82e-02

Pos logit contributions
 also+0.362
itates+0.358
Looks+0.352
 hadn+0.348
 certainly+0.331
Neg logit contributions
 oak-0.426
igsaw-0.397
 cane-0.393
 goat-0.382
 newspaper-0.377
Token ant
Token ID1885
Feature activation+3.33e-01

Pos logit contributions
igsaw+0.171
con+0.168
 cane+0.167
cone+0.157
 newspaper+0.154
Neg logit contributions
 also-0.169
 hadn-0.139
itates-0.138
 glad-0.138
year-0.136
Token who
Token ID508
Feature activation-3.30e-02

Pos logit contributions
igsaw+0.569
 oak+0.569
 newspaper+0.546
 cane+0.537
 goat+0.523
Neg logit contributions
itates-0.601
 hadn-0.582
 also-0.559
Looks-0.529
vere-0.529
Token was
Token ID373
Feature activation-2.25e-01

Pos logit contributions
itates+0.051
vere+0.050
bled+0.050
Looks+0.045
 Deal+0.045
Neg logit contributions
 oak-0.071
 pine-0.067
 cane-0.065
 goat-0.064
 newspaper-0.064
Token very
Token ID845
Feature activation-8.62e-02

Pos logit contributions
itates+0.317
 hadn+0.287
Looks+0.284
bled+0.281
vere+0.281
Neg logit contributions
 oak-0.517
 pine-0.486
 goat-0.478
 cane-0.470
 pony-0.469
Token curious
Token ID11040
Feature activation-3.11e-02

Pos logit contributions
itates+0.153
 hadn+0.147
Looks+0.143
bled+0.139
vere+0.139
Neg logit contributions
 oak-0.167
 pine-0.155
 ant-0.149
 goat-0.149
 cane-0.147
Token.
Token ID13
Feature activation+9.66e-02

Pos logit contributions
 hadn+0.062
itates+0.060
 also+0.057
'd+0.057
 certainly+0.056
Neg logit contributions
 oak-0.048
igsaw-0.046
 goat-0.042
 pine-0.042
 newspaper-0.042
Token time
Token ID640
Feature activation+1.14e-01

Pos logit contributions
 oak+0.198
 goat+0.177
 cane+0.175
 newspaper+0.174
 pine+0.172
Neg logit contributions
itates-0.215
 hadn-0.206
 also-0.195
Looks-0.195
vere-0.195
Token there
Token ID612
Feature activation+1.19e-02

Pos logit contributions
 oak+0.121
 newspaper+0.111
 goat+0.110
 cane+0.110
 pine+0.103
Neg logit contributions
itates-0.296
 hadn-0.288
vere-0.278
 also-0.271
bled-0.271
Token was
Token ID373
Feature activation-2.28e-01

Pos logit contributions
 oak+0.026
 pine+0.024
 newspaper+0.024
 cane+0.024
 goat+0.024
Neg logit contributions
itates-0.017
vere-0.015
Looks-0.015
bled-0.015
 hadn-0.015
Token a
Token ID257
Feature activation+1.15e-01

Pos logit contributions
 also+0.364
itates+0.363
 hadn+0.361
Looks+0.356
bled+0.350
Neg logit contributions
 oak-0.428
 cane-0.390
igsaw-0.389
 newspaper-0.378
 pony-0.375
Token little
Token ID1310
Feature activation+1.97e-01

Pos logit contributions
igsaw+0.171
 oak+0.165
ados+0.150
 newspaper+0.148
 newspapers+0.145
Neg logit contributions
 also-0.230
itates-0.229
 hadn-0.218
'd-0.206
 refreshed-0.204
Token ant
Token ID1885
Feature activation+3.33e-01

Pos logit contributions
igsaw+0.349
growth+0.330
con+0.301
 newspaper+0.289
 newspapers+0.287
Neg logit contributions
 also-0.312
 sorry-0.289
itates-0.284
 still-0.267
We-0.266
Token called
Token ID1444
Feature activation-9.87e-02

Pos logit contributions
igsaw+0.615
 oak+0.580
 newspaper+0.568
 cane+0.552
 goat+0.529
Neg logit contributions
itates-0.569
 hadn-0.564
 also-0.559
 never-0.521
'd-0.502
Token Tom
Token ID4186
Feature activation+6.93e-02

Pos logit contributions
 not+0.160
 hadn+0.154
 ok+0.150
 never+0.148
 also+0.145
Neg logit contributions
 pony-0.163
 cane-0.163
 bike-0.158
 oak-0.158
 automobile-0.152
Token.
Token ID13
Feature activation+9.26e-02

Pos logit contributions
igsaw+0.134
 oak+0.127
 pony+0.117
 newspaper+0.116
 cane+0.115
Neg logit contributions
 also-0.122
 hadn-0.119
itates-0.117
 never-0.112
'd-0.108
Token He
Token ID679
Feature activation-1.46e-01

Pos logit contributions
 oak+0.144
 pony+0.139
 goat+0.137
igsaw+0.134
 cane+0.132
Neg logit contributions
itates-0.184
 hadn-0.174
Looks-0.171
 also-0.171
 definitely-0.166
Token was
Token ID373
Feature activation-2.28e-01

Pos logit contributions
itates+0.208
vere+0.190
Looks+0.185
bled+0.184
 hadn+0.181
Neg logit contributions
 oak-0.326
 pine-0.310
 cane-0.305
 goat-0.304
 newspaper-0.301
Token back
Token ID736
Feature activation+8.10e-02

Pos logit contributions
itates+0.185
 hadn+0.173
bled+0.167
Looks+0.167
vere+0.165
Neg logit contributions
 oak-0.190
 newspaper-0.176
 cane-0.174
 goat-0.174
 pine-0.174
Token and
Token ID290
Feature activation+8.69e-02

Pos logit contributions
 oak+0.136
affe+0.136
 pigeon+0.135
 goat+0.135
igsaw+0.134
Neg logit contributions
itates-0.135
 also-0.129
 hadn-0.124
 never-0.122
 suppose-0.120
Token watched
Token ID7342
Feature activation-1.58e-01

Pos logit contributions
 oak+0.175
 pine+0.163
 goat+0.160
 cane+0.158
 pony+0.158
Neg logit contributions
itates-0.148
vere-0.134
 hadn-0.129
Looks-0.128
bled-0.126
Token as
Token ID355
Feature activation+5.05e-02

Pos logit contributions
itates+0.345
 hadn+0.335
bled+0.330
vere+0.324
Looks+0.319
Neg logit contributions
 oak-0.222
 newspaper-0.202
 cane-0.198
 goat-0.195
 pine-0.195
Token the
Token ID262
Feature activation+2.10e-01

Pos logit contributions
 oak+0.056
 pine+0.049
 cane+0.048
 newspaper+0.047
 goat+0.046
Neg logit contributions
itates-0.130
 hadn-0.122
Looks-0.120
vere-0.119
bled-0.118
Token ant
Token ID1885
Feature activation+3.33e-01

Pos logit contributions
 oak+0.124
 newspaper+0.098
igsaw+0.097
 cane+0.096
 pine+0.090
Neg logit contributions
itates-0.623
 hadn-0.601
Looks-0.590
bled-0.584
vere-0.583
Token broke
Token ID6265
Feature activation-3.02e-02

Pos logit contributions
 oak+0.669
 pine+0.628
 goat+0.614
 cane+0.605
 pony+0.605
Neg logit contributions
itates-0.570
vere-0.503
Looks-0.497
bled-0.479
 hadn-0.471
Token free
Token ID1479
Feature activation-2.59e-02

Pos logit contributions
bled+0.054
itates+0.053
 hadn+0.053
 also+0.050
Looks+0.049
Neg logit contributions
 oak-0.052
 newspaper-0.050
igsaw-0.049
 goat-0.048
 pony-0.048
Token from
Token ID422
Feature activation+5.06e-02

Pos logit contributions
 hadn+0.049
itates+0.047
bled+0.047
vere+0.045
 also+0.045
Neg logit contributions
 oak-0.040
 newspaper-0.038
igsaw-0.038
 pony-0.037
 goat-0.037
Token the
Token ID262
Feature activation+2.08e-01

Pos logit contributions
 oak+0.067
 newspaper+0.061
 cane+0.060
 goat+0.060
igsaw+0.060
Neg logit contributions
itates-0.114
 hadn-0.109
bled-0.106
Looks-0.106
vere-0.104
Token box
Token ID3091
Feature activation+1.01e-01

Pos logit contributions
 oak+0.187
igsaw+0.165
 newspaper+0.163
 cane+0.161
 pony+0.160
Neg logit contributions
itates-0.534
 hadn-0.521
vere-0.500
Looks-0.500
 also-0.500
Token
Token ID198
Feature activation-6.27e-02

Pos logit contributions
itates+0.133
 hadn+0.128
bled+0.127
Looks+0.125
vere+0.121
Neg logit contributions
 oak-0.096
 pine-0.089
 newspaper-0.086
igsaw-0.085
 cane-0.085
TokenSo
Token ID2396
Feature activation-1.10e-01

Pos logit contributions
itates+0.109
 hadn+0.102
bled+0.098
Looks+0.097
vere+0.097
Neg logit contributions
 oak-0.120
 newspaper-0.110
 pine-0.110
 cane-0.110
 goat-0.109
Token she
Token ID673
Feature activation-9.58e-02

Pos logit contributions
 hadn+0.229
itates+0.226
 also+0.216
Looks+0.215
bled+0.211
Neg logit contributions
 oak-0.156
 newspaper-0.148
 cane-0.145
 goat-0.143
igsaw-0.143
Token grabbed
Token ID13176
Feature activation-6.72e-02

Pos logit contributions
itates+0.146
vere+0.131
Looks+0.127
 hadn+0.127
bled+0.125
Neg logit contributions
 oak-0.206
 goat-0.194
 pine-0.194
 cane-0.192
 newspaper-0.189
Token the
Token ID262
Feature activation+2.11e-01

Pos logit contributions
bled+0.113
 also+0.109
Looks+0.109
 hadn+0.105
itates+0.103
Neg logit contributions
 oak-0.120
 goat-0.112
 newspaper-0.109
 automobile-0.109
affe-0.109
Token ant
Token ID1885
Feature activation+3.33e-01

Pos logit contributions
 oak+0.267
igsaw+0.254
 newspaper+0.233
 automobile+0.228
 goat+0.224
Neg logit contributions
itates-0.457
 hadn-0.449
 also-0.444
bled-0.432
 refreshed-0.423
Token with
Token ID351
Feature activation-4.35e-02

Pos logit contributions
igsaw+0.521
 newspaper+0.491
 oak+0.490
 goat+0.475
round+0.473
Neg logit contributions
 also-0.563
 never-0.547
 hadn-0.531
 not-0.515
 refreshed-0.514
Token her
Token ID607
Feature activation+1.01e-01

Pos logit contributions
itates+0.088
 hadn+0.087
Looks+0.084
bled+0.083
vere+0.082
Neg logit contributions
 oak-0.067
 newspaper-0.060
 goat-0.060
igsaw-0.059
 cane-0.059
Token free
Token ID1479
Feature activation-2.39e-02

Pos logit contributions
 oak+0.156
igsaw+0.143
 newspaper+0.138
 goat+0.133
 automobile+0.132
Neg logit contributions
 hadn-0.206
itates-0.197
 also-0.194
bled-0.191
vere-0.187
Token hand
Token ID1021
Feature activation+9.21e-02

Pos logit contributions
 hadn+0.044
bled+0.043
 certainly+0.042
 definitely+0.042
 also+0.042
Neg logit contributions
 oak-0.034
igsaw-0.034
 newspaper-0.033
 bull-0.031
 goat-0.030
Token and
Token ID290
Feature activation+7.89e-02

Pos logit contributions
 oak+0.141
igsaw+0.132
 bull+0.127
 pine+0.125
 newspaper+0.124
Neg logit contributions
 hadn-0.183
itates-0.174
bled-0.172
 also-0.171
 never-0.166
Token off
Token ID572
Feature activation+4.19e-02

Pos logit contributions
 oak+0.125
 newspaper+0.114
 cane+0.113
 pine+0.113
 goat+0.113
Neg logit contributions
itates-0.154
 hadn-0.146
vere-0.140
Looks-0.140
bled-0.138
Token."
Token ID526
Feature activation+3.29e-02

Pos logit contributions
 oak+0.077
 newspaper+0.073
 goat+0.072
 cane+0.072
igsaw+0.071
Neg logit contributions
itates-0.070
 hadn-0.068
vere-0.064
Looks-0.063
bled-0.063
Token
Token ID198
Feature activation-5.89e-02

Pos logit contributions
 oak+0.064
 newspaper+0.060
 cane+0.059
 pine+0.059
 goat+0.059
Neg logit contributions
itates-0.055
 hadn-0.051
Looks-0.049
vere-0.049
bled-0.048
Token
Token ID198
Feature activation-5.81e-02

Pos logit contributions
itates+0.116
 hadn+0.110
Looks+0.107
bled+0.105
vere+0.103
Neg logit contributions
 oak-0.097
 pine-0.090
 goat-0.089
 newspaper-0.088
 cane-0.088
TokenThe
Token ID464
Feature activation+1.02e-01

Pos logit contributions
itates+0.113
 hadn+0.108
bled+0.103
Looks+0.103
vere+0.102
Neg logit contributions
 oak-0.098
 pine-0.090
 goat-0.090
 cane-0.089
 newspaper-0.089
Token ant
Token ID1885
Feature activation+3.33e-01

Pos logit contributions
 oak+0.059
igsaw+0.055
 newspaper+0.050
 cane+0.048
 pony+0.044
Neg logit contributions
itates-0.298
 hadn-0.289
 also-0.283
Looks-0.281
bled-0.281
Token was
Token ID373
Feature activation-2.29e-01

Pos logit contributions
 oak+0.717
 pine+0.669
 goat+0.668
 cane+0.662
 newspaper+0.659
Neg logit contributions
itates-0.496
vere-0.440
Looks-0.433
bled-0.428
 hadn-0.427
Token a
Token ID257
Feature activation+1.13e-01

Pos logit contributions
itates+0.402
Looks+0.394
vere+0.367
izes+0.360
'd+0.355
Neg logit contributions
 oak-0.376
 ant-0.368
 pine-0.361
 pony-0.359
 animal-0.352
Token bit
Token ID1643
Feature activation-3.00e-03

Pos logit contributions
 oak+0.179
 newspaper+0.162
 pine+0.160
 cane+0.160
igsaw+0.159
Neg logit contributions
itates-0.231
 hadn-0.222
Looks-0.211
vere-0.209
bled-0.205
Token annoyed
Token ID25602
Feature activation-1.05e-01

Pos logit contributions
itates+0.004
 hadn+0.004
bled+0.004
vere+0.004
Looks+0.004
Neg logit contributions
 oak-0.007
 pine-0.006
 goat-0.006
 cane-0.006
 ant-0.006
Token,
Token ID11
Feature activation+1.51e-01

Pos logit contributions
itates+0.245
 hadn+0.234
vere+0.226
Looks+0.225
 also+0.224
Neg logit contributions
 oak-0.140
 newspaper-0.123
 pine-0.122
 cane-0.122
igsaw-0.118
Token!
Token ID0
Feature activation+8.37e-02

Pos logit contributions
itates+0.454
 hadn+0.419
Looks+0.411
vere+0.407
bled+0.406
Neg logit contributions
 oak-0.381
 newspaper-0.348
 pine-0.348
 cane-0.346
 goat-0.345
Tokenâ
Token ID22940
Feature activation-5.50e-02

Pos logit contributions
 oak+0.157
 goat+0.147
 pine+0.146
 newspaper+0.144
 cane+0.144
Neg logit contributions
itates-0.145
bled-0.134
 hadn-0.134
vere-0.129
Looks-0.129
Token
Token ID26391
Feature activation+2.93e-02

Pos logit contributions
itates+0.091
 hadn+0.084
vere+0.081
Looks+0.081
bled+0.080
Neg logit contributions
 oak-0.108
 newspaper-0.100
igsaw-0.100
 goat-0.099
 pine-0.099
Token said
Token ID531
Feature activation-1.85e-01

Pos logit contributions
 oak+0.054
 newspaper+0.049
 cane+0.049
 pine+0.049
 pony+0.049
Neg logit contributions
itates-0.052
 hadn-0.050
vere-0.047
Looks-0.046
 certainly-0.045
Token the
Token ID262
Feature activation+2.11e-01

Pos logit contributions
itates+0.409
Looks+0.373
 hadn+0.372
bled+0.353
vere+0.352
Neg logit contributions
 oak-0.274
 pony-0.253
 pine-0.250
 goat-0.247
 cane-0.246
Token ant
Token ID1885
Feature activation+3.33e-01

Pos logit contributions
 oak+0.334
 cane+0.305
 newspaper+0.305
 pine+0.301
 goat+0.301
Neg logit contributions
itates-0.442
 hadn-0.416
Looks-0.403
bled-0.401
vere-0.396
Token.
Token ID13
Feature activation+9.29e-02

Pos logit contributions
 oak+0.677
 newspaper+0.629
 cane+0.627
 pine+0.624
 goat+0.624
Neg logit contributions
itates-0.523
 hadn-0.490
Looks-0.465
vere-0.459
bled-0.457
Token<|endoftext|>
Token ID50256
Feature activation+5.40e-02

Pos logit contributions
 oak+0.168
 pine+0.156
 goat+0.154
 cane+0.153
 newspaper+0.153
Neg logit contributions
itates-0.168
 hadn-0.156
bled-0.153
vere-0.151
Looks-0.150
TokenM
Token ID44
Feature activation+6.69e-02

Pos logit contributions
 oak+0.083
 newspaper+0.075
 pine+0.075
 cane+0.075
 goat+0.075
Neg logit contributions
itates-0.112
 hadn-0.106
Looks-0.103
vere-0.102
bled-0.102
Tokenama
Token ID1689
Feature activation+1.39e-01

Pos logit contributions
 oak+0.105
 pine+0.095
 newspaper+0.095
 cane+0.094
 goat+0.094
Neg logit contributions
itates-0.139
 hadn-0.130
Looks-0.127
bled-0.126
vere-0.125
Token was
Token ID373
Feature activation-2.27e-01

Pos logit contributions
 oak+0.211
igsaw+0.207
 newspaper+0.198
 cane+0.198
 pony+0.188
Neg logit contributions
 hadn-0.277
itates-0.271
 also-0.264
'd-0.256
bled-0.250
Token food
Token ID2057
Feature activation+1.83e-01

Pos logit contributions
igsaw+0.107
 pony+0.094
 oak+0.092
 newspaper+0.090
affe+0.089
Neg logit contributions
 actually-0.158
 hadn-0.158
 also-0.155
bled-0.155
itates-0.154
Token.
Token ID13
Feature activation+8.52e-02

Pos logit contributions
 oak+0.252
igsaw+0.225
 newspaper+0.225
 goat+0.224
 pine+0.223
Neg logit contributions
itates-0.398
 hadn-0.390
vere-0.371
bled-0.369
Looks-0.366
Token
Token ID198
Feature activation-5.94e-02

Pos logit contributions
 oak+0.166
 newspaper+0.154
 cane+0.153
 goat+0.152
 pony+0.152
Neg logit contributions
itates-0.143
 hadn-0.137
Looks-0.129
vere-0.126
bled-0.125
Token
Token ID198
Feature activation-5.74e-02

Pos logit contributions
itates+0.117
 hadn+0.111
bled+0.109
Looks+0.108
vere+0.106
Neg logit contributions
 oak-0.096
 pine-0.089
 newspaper-0.088
 cane-0.087
 goat-0.086
TokenThe
Token ID464
Feature activation+1.03e-01

Pos logit contributions
itates+0.090
 hadn+0.084
Looks+0.081
vere+0.079
bled+0.077
Neg logit contributions
 oak-0.120
 newspaper-0.111
 cane-0.111
 goat-0.110
 pine-0.110
Token ant
Token ID1885
Feature activation+3.33e-01

Pos logit contributions
 oak+0.104
igsaw+0.091
 newspaper+0.091
 cane+0.089
 pine+0.087
Neg logit contributions
itates-0.262
 hadn-0.255
Looks-0.246
bled-0.244
vere-0.242
Token couldn
Token ID3521
Feature activation-2.31e-01

Pos logit contributions
 oak+0.724
 pine+0.680
 goat+0.667
 cane+0.659
 pony+0.654
Neg logit contributions
itates-0.503
vere-0.456
Looks-0.436
bled-0.434
 hadn-0.419
Token't
Token ID470
Feature activation-1.97e-01

Pos logit contributions
itates+0.272
 hadn+0.241
Looks+0.233
bled+0.226
vere+0.221
Neg logit contributions
 oak-0.568
 pine-0.530
 cane-0.530
igsaw-0.529
 newspaper-0.527
Token believe
Token ID1975
Feature activation-2.97e-02

Pos logit contributions
itates+0.321
 hadn+0.301
Looks+0.287
vere+0.285
bled+0.282
Neg logit contributions
 oak-0.391
 newspaper-0.366
 cane-0.363
 goat-0.361
 pine-0.360
Token it
Token ID340
Feature activation+2.61e-03

Pos logit contributions
itates+0.053
 hadn+0.052
bled+0.049
Looks+0.047
 also+0.047
Neg logit contributions
 oak-0.053
 newspaper-0.051
 goat-0.049
igsaw-0.049
 cane-0.049
Token.
Token ID13
Feature activation+8.98e-02

Pos logit contributions
 oak+0.003
 cane+0.002
 newspaper+0.002
 pine+0.002
 goat+0.002
Neg logit contributions
itates-0.007
Looks-0.006
 hadn-0.006
vere-0.006
bled-0.006
Token,
Token ID11
Feature activation+1.53e-01

Pos logit contributions
 oak+0.130
 goat+0.124
 newspaper+0.124
 cane+0.122
 pony+0.117
Neg logit contributions
itates-0.262
 hadn-0.255
vere-0.246
 also-0.243
 not-0.242
Token there
Token ID612
Feature activation+6.86e-03

Pos logit contributions
 oak+0.245
 pine+0.226
 cane+0.222
 newspaper+0.221
 goat+0.220
Neg logit contributions
itates-0.302
 hadn-0.285
vere-0.283
Looks-0.282
bled-0.272
Token was
Token ID373
Feature activation-2.33e-01

Pos logit contributions
 oak+0.015
 pine+0.014
 newspaper+0.014
 cane+0.014
 goat+0.014
Neg logit contributions
itates-0.010
vere-0.009
Looks-0.009
 hadn-0.008
bled-0.008
Token a
Token ID257
Feature activation+1.08e-01

Pos logit contributions
 also+0.345
Looks+0.329
 hadn+0.321
itates+0.320
 refreshed+0.314
Neg logit contributions
 oak-0.479
 cane-0.442
igsaw-0.439
 pony-0.428
 goat-0.426
Token little
Token ID1310
Feature activation+1.92e-01

Pos logit contributions
igsaw+0.161
 oak+0.156
 newspaper+0.141
 cane+0.137
ados+0.136
Neg logit contributions
itates-0.215
 also-0.212
 hadn-0.204
 refreshed-0.192
 sorry-0.191
Token ant
Token ID1885
Feature activation+3.33e-01

Pos logit contributions
igsaw+0.337
growth+0.318
con+0.297
oline+0.282
 newspaper+0.279
Neg logit contributions
 also-0.299
 sorry-0.292
itates-0.269
 refreshed-0.256
 still-0.254
Token who
Token ID508
Feature activation-3.95e-02

Pos logit contributions
igsaw+0.627
 oak+0.585
 newspaper+0.581
 cane+0.564
 pony+0.541
Neg logit contributions
itates-0.560
 also-0.544
 hadn-0.544
 never-0.509
vere-0.483
Token loved
Token ID6151
Feature activation-1.73e-01

Pos logit contributions
itates+0.065
vere+0.063
bled+0.063
 Deal+0.059
Looks+0.058
Neg logit contributions
 oak-0.081
 pine-0.077
 cane-0.074
 newspaper-0.073
 goat-0.072
Token to
Token ID284
Feature activation+1.11e-01

Pos logit contributions
 also+0.289
 not+0.289
 hadn+0.287
 never+0.282
itates+0.277
Neg logit contributions
igsaw-0.298
affe-0.255
 oak-0.253
 cane-0.235
 automobile-0.234
Token crawl
Token ID27318
Feature activation-3.72e-02

Pos logit contributions
igsaw+0.212
 goat+0.197
 newspaper+0.197
 automobile+0.192
 oak+0.191
Neg logit contributions
 also-0.157
 not-0.149
 refreshed-0.146
'd-0.146
 certainly-0.144
Token everywhere
Token ID8347
Feature activation-1.53e-02

Pos logit contributions
itates+0.048
 also+0.046
vere+0.045
 refreshed+0.045
 never+0.044
Neg logit contributions
 goat-0.078
 pigeon-0.077
 newspaper-0.077
igsaw-0.076
 bull-0.074
Token the
Token ID262
Feature activation+2.03e-01

Pos logit contributions
 oak+0.153
 pine+0.144
 goat+0.143
 pony+0.141
 cane+0.140
Neg logit contributions
itates-0.134
vere-0.121
 hadn-0.115
Looks-0.115
bled-0.113
Token owner
Token ID4870
Feature activation+1.28e-01

Pos logit contributions
 oak+0.081
 newspaper+0.054
 cane+0.053
 goat+0.052
 pine+0.050
Neg logit contributions
itates-0.661
 hadn-0.637
Looks-0.624
vere-0.621
bled-0.621
Token smiled
Token ID13541
Feature activation-1.41e-01

Pos logit contributions
 oak+0.296
 newspaper+0.277
 cane+0.276
 pine+0.275
 goat+0.275
Neg logit contributions
itates-0.169
 hadn-0.155
Looks-0.146
bled-0.144
vere-0.144
Token.
Token ID13
Feature activation+7.33e-02

Pos logit contributions
itates+0.319
 hadn+0.304
Looks+0.294
bled+0.293
vere+0.292
Neg logit contributions
 oak-0.192
 newspaper-0.172
 cane-0.170
 pine-0.170
 goat-0.169
Token "
Token ID366
Feature activation-8.40e-02

Pos logit contributions
 oak+0.119
 pine+0.109
 newspaper+0.106
 goat+0.105
 bull+0.104
Neg logit contributions
itates-0.148
vere-0.141
 hadn-0.136
bled-0.135
Looks-0.134
TokenLooks
Token ID41102
Feature activation-3.70e-01

Pos logit contributions
itates+0.133
vere+0.127
bled+0.127
izes+0.125
 hadn+0.121
Neg logit contributions
 oak-0.163
 pine-0.158
 pony-0.156
 goat-0.155
 pigeon-0.152
Token like
Token ID588
Feature activation-8.90e-02

Pos logit contributions
itates+0.771
bled+0.716
 hadn+0.715
Looks+0.696
vere+0.688
Neg logit contributions
 oak-0.573
 newspaper-0.527
 goat-0.526
 pine-0.524
 cane-0.522
Token you
Token ID345
Feature activation-1.52e-01

Pos logit contributions
itates+0.168
 hadn+0.150
bled+0.148
Looks+0.148
vere+0.144
Neg logit contributions
 oak-0.164
 goat-0.153
 pine-0.152
 newspaper-0.151
 cane-0.150
Token two
Token ID734
Feature activation+5.85e-02

Pos logit contributions
itates+0.254
vere+0.240
Looks+0.239
bled+0.238
 hadn+0.234
Neg logit contributions
 oak-0.300
 goat-0.275
 pine-0.271
 newspaper-0.268
 pony-0.266
Token are
Token ID389
Feature activation-2.50e-01

Pos logit contributions
 oak+0.094
 newspaper+0.085
 cane+0.085
 goat+0.085
 pine+0.085
Neg logit contributions
itates-0.119
 hadn-0.112
Looks-0.109
bled-0.108
vere-0.107
Token friends
Token ID2460
Feature activation+6.09e-02

Pos logit contributions
itates+0.531
 hadn+0.502
Looks+0.488
bled+0.482
vere+0.482
Neg logit contributions
 oak-0.381
 newspaper-0.342
 pine-0.342
 goat-0.342
 cane-0.341
Token laughed
Token ID13818
Feature activation-1.45e-01

Pos logit contributions
itates+0.026
vere+0.024
Looks+0.023
bled+0.023
 hadn+0.022
Neg logit contributions
 oak-0.036
 goat-0.034
 pony-0.034
 pine-0.033
 cane-0.033
Token and
Token ID290
Feature activation+7.87e-02

Pos logit contributions
itates+0.362
 hadn+0.347
bled+0.336
Looks+0.334
vere+0.333
Neg logit contributions
 oak-0.170
 pine-0.146
igsaw-0.146
 newspaper-0.144
 cane-0.141
Token said
Token ID531
Feature activation-1.95e-01

Pos logit contributions
 oak+0.153
 pine+0.144
 pony+0.143
 goat+0.142
 cane+0.138
Neg logit contributions
itates-0.135
vere-0.126
bled-0.119
 hadn-0.115
Looks-0.115
Token,
Token ID11
Feature activation+1.44e-01

Pos logit contributions
itates+0.516
 hadn+0.499
vere+0.483
bled+0.482
Looks+0.480
Neg logit contributions
 oak-0.189
 newspaper-0.160
 cane-0.159
 pine-0.158
 goat-0.157
Token "
Token ID366
Feature activation-7.65e-02

Pos logit contributions
 oak+0.200
 goat+0.186
 pony+0.186
 newspaper+0.181
 pine+0.181
Neg logit contributions
itates-0.319
Looks-0.295
 hadn-0.289
vere-0.287
bled-0.286
TokenLooks
Token ID41102
Feature activation-3.66e-01

Pos logit contributions
itates+0.124
bled+0.119
vere+0.117
 hadn+0.114
izes+0.110
Neg logit contributions
 oak-0.145
 newspaper-0.139
 pony-0.138
 goat-0.137
 pine-0.137
Token like
Token ID588
Feature activation-8.58e-02

Pos logit contributions
itates+0.753
 hadn+0.701
bled+0.698
Looks+0.683
vere+0.672
Neg logit contributions
 oak-0.574
 newspaper-0.530
 pine-0.524
 cane-0.524
 goat-0.523
Token that
Token ID326
Feature activation+6.87e-02

Pos logit contributions
itates+0.146
 hadn+0.132
Looks+0.129
bled+0.128
vere+0.126
Neg logit contributions
 oak-0.169
 newspaper-0.158
 cane-0.156
 pine-0.156
 goat-0.156
Token apple
Token ID17180
Feature activation+2.53e-01

Pos logit contributions
 oak+0.091
 goat+0.083
igsaw+0.083
 pine+0.082
 cane+0.081
Neg logit contributions
itates-0.148
 hadn-0.145
 certainly-0.138
 also-0.137
vere-0.137
Token made
Token ID925
Feature activation-1.60e-01

Pos logit contributions
 oak+0.433
igsaw+0.399
 cane+0.398
 pine+0.395
 newspaper+0.395
Neg logit contributions
itates-0.468
 hadn-0.454
vere-0.426
 also-0.423
Looks-0.419
Token you
Token ID345
Feature activation-1.50e-01

Pos logit contributions
itates+0.232
 hadn+0.216
Looks+0.205
vere+0.205
bled+0.203
Neg logit contributions
 oak-0.346
 cane-0.323
 pine-0.322
 newspaper-0.321
 goat-0.321
Token smiled
Token ID13541
Feature activation-1.34e-01

Pos logit contributions
 oak+0.292
 pine+0.274
 newspaper+0.272
 goat+0.272
 cane+0.270
Neg logit contributions
itates-0.186
 hadn-0.163
vere-0.163
Looks-0.160
bled-0.155
Token and
Token ID290
Feature activation+8.18e-02

Pos logit contributions
itates+0.332
 hadn+0.317
bled+0.308
Looks+0.305
vere+0.302
Neg logit contributions
 oak-0.157
 newspaper-0.137
 pine-0.135
igsaw-0.134
 goat-0.133
Token said
Token ID531
Feature activation-1.91e-01

Pos logit contributions
 oak+0.170
 pine+0.159
 goat+0.159
 newspaper+0.158
 pony+0.157
Neg logit contributions
itates-0.130
vere-0.116
 hadn-0.115
Looks-0.112
bled-0.109
Token,
Token ID11
Feature activation+1.43e-01

Pos logit contributions
itates+0.568
 hadn+0.546
Looks+0.535
vere+0.531
bled+0.530
Neg logit contributions
 oak-0.127
 newspaper-0.099
 cane-0.098
 pine-0.097
 goat-0.096
Token "
Token ID366
Feature activation-7.84e-02

Pos logit contributions
 oak+0.188
 pony+0.172
 goat+0.172
 pine+0.169
 cane+0.168
Neg logit contributions
itates-0.326
Looks-0.307
vere-0.299
 hadn-0.295
izes-0.289
TokenLooks
Token ID41102
Feature activation-3.65e-01

Pos logit contributions
itates+0.118
bled+0.108
vere+0.107
 hadn+0.106
izes+0.105
Neg logit contributions
 oak-0.164
 newspaper-0.157
 pine-0.156
 goat-0.156
 pony-0.154
Token like
Token ID588
Feature activation-8.49e-02

Pos logit contributions
itates+0.753
 hadn+0.702
bled+0.693
Looks+0.682
vere+0.675
Neg logit contributions
 oak-0.573
 newspaper-0.529
 pine-0.525
 cane-0.523
 goat-0.523
Token you
Token ID345
Feature activation-1.49e-01

Pos logit contributions
itates+0.170
bled+0.147
 hadn+0.146
Looks+0.146
vere+0.145
Neg logit contributions
 oak-0.149
 newspaper-0.141
 pine-0.139
 goat-0.138
 pony-0.136
Token like
Token ID588
Feature activation-8.53e-02

Pos logit contributions
itates+0.309
vere+0.292
Looks+0.288
bled+0.286
 hadn+0.282
Neg logit contributions
 oak-0.236
 goat-0.214
 newspaper-0.213
 pine-0.212
 cane-0.206
Token the
Token ID262
Feature activation+2.05e-01

Pos logit contributions
itates+0.159
 hadn+0.152
Looks+0.145
vere+0.144
bled+0.144
Neg logit contributions
 oak-0.148
 cane-0.135
 newspaper-0.135
 pine-0.134
 goat-0.134
Token wardrobe
Token ID35593
Feature activation+1.19e-01

Pos logit contributions
 oak+0.344
 newspaper+0.314
 cane+0.314
 pine+0.313
 goat+0.313
Neg logit contributions
itates-0.398
 hadn-0.378
Looks-0.364
vere-0.360
bled-0.356
Token's
Token ID338
Feature activation-7.83e-02

Pos logit contributions
 oak+0.051
 newspaper+0.046
 pine+0.045
 cane+0.045
igsaw+0.045
Neg logit contributions
itates-0.089
 hadn-0.085
bled-0.082
vere-0.082
Looks-0.081
Token mom
Token ID1995
Feature activation+1.35e-01

Pos logit contributions
Looks+0.213
itates+0.210
izes+0.204
 known+0.201
vere+0.200
Neg logit contributions
 bike-0.063
 oak-0.062
 dog-0.061
 newspaper-0.058
 cane-0.058
Tokenmy
Token ID1820
Feature activation-1.38e-02

Pos logit contributions
 oak+0.192
 pine+0.173
 cane+0.170
 pony+0.169
 goat+0.169
Neg logit contributions
itates-0.295
Looks-0.278
 hadn-0.273
bled-0.272
vere-0.271
Token.
Token ID13
Feature activation+8.27e-02

Pos logit contributions
itates+0.036
Looks+0.034
 hadn+0.033
vere+0.033
bled+0.032
Neg logit contributions
 oak-0.014
 cane-0.012
 pony-0.012
 pine-0.012
 goat-0.012
Token "
Token ID366
Feature activation-7.61e-02

Pos logit contributions
 oak+0.125
 goat+0.113
 cane+0.112
 pony+0.112
 newspaper+0.112
Neg logit contributions
itates-0.171
Looks-0.162
bled-0.159
vere-0.159
izes-0.158
TokenLooks
Token ID41102
Feature activation-3.65e-01

Pos logit contributions
vere+0.096
itates+0.095
bled+0.091
izes+0.090
 hadn+0.088
Neg logit contributions
 oak-0.166
 newspaper-0.158
 goat-0.158
 pigeon-0.155
 cane-0.155
Token like
Token ID588
Feature activation-8.60e-02

Pos logit contributions
itates+0.785
bled+0.740
Looks+0.703
 hadn+0.701
vere+0.681
Neg logit contributions
 oak-0.542
 goat-0.508
 newspaper-0.501
 cane-0.497
 pine-0.493
Token the
Token ID262
Feature activation+2.05e-01

Pos logit contributions
itates+0.168
 hadn+0.154
Looks+0.151
bled+0.148
vere+0.148
Neg logit contributions
 oak-0.149
 newspaper-0.137
 goat-0.135
 pine-0.135
 cane-0.135
Token wave
Token ID6769
Feature activation-2.89e-02

Pos logit contributions
 oak+0.261
 pine+0.233
 goat+0.232
 newspaper+0.231
 cane+0.224
Neg logit contributions
itates-0.507
 hadn-0.470
bled-0.461
vere-0.458
Looks-0.457
Token destroyed
Token ID6572
Feature activation-1.50e-02

Pos logit contributions
itates+0.060
 hadn+0.057
Looks+0.054
vere+0.054
bled+0.054
Neg logit contributions
 oak-0.045
 newspaper-0.040
 cane-0.040
 pine-0.040
 goat-0.040
Token all
Token ID477
Feature activation-2.94e-02

Pos logit contributions
 hadn+0.025
itates+0.024
 also+0.024
 certainly+0.023
bled+0.023
Neg logit contributions
 oak-0.026
 cane-0.025
igsaw-0.024
 goat-0.024
affe-0.024
Token table
Token ID3084
Feature activation+8.56e-02

Pos logit contributions
 oak+0.304
 pine+0.272
 cane+0.270
 goat+0.269
 newspaper+0.269
Neg logit contributions
itates-0.464
 hadn-0.443
Looks-0.426
vere-0.426
bled-0.423
Token.
Token ID13
Feature activation+8.91e-02

Pos logit contributions
 oak+0.093
 newspaper+0.082
 cane+0.081
 pine+0.080
 goat+0.077
Neg logit contributions
itates-0.218
Looks-0.206
 hadn-0.205
vere-0.200
bled-0.200
Token
Token ID198
Feature activation-6.40e-02

Pos logit contributions
 oak+0.173
 pony+0.159
 cane+0.158
 goat+0.157
 pine+0.157
Neg logit contributions
itates-0.153
 hadn-0.142
Looks-0.135
vere-0.133
bled-0.131
Token
Token ID198
Feature activation-6.27e-02

Pos logit contributions
itates+0.129
 hadn+0.125
Looks+0.122
bled+0.122
vere+0.118
Neg logit contributions
 oak-0.100
 pine-0.093
 newspaper-0.090
 cane-0.089
 goat-0.088
Token"
Token ID1
Feature activation+4.98e-02

Pos logit contributions
itates+0.134
 hadn+0.130
bled+0.126
vere+0.124
Looks+0.124
Neg logit contributions
 oak-0.091
 pine-0.083
 newspaper-0.082
 cane-0.081
 goat-0.080
TokenLooks
Token ID41102
Feature activation-3.65e-01

Pos logit contributions
 oak+0.102
 newspaper+0.095
 cane+0.094
 pine+0.094
 goat+0.094
Neg logit contributions
itates-0.080
 hadn-0.074
Looks-0.071
vere-0.070
bled-0.070
Token like
Token ID588
Feature activation-8.48e-02

Pos logit contributions
itates+0.766
 hadn+0.713
bled+0.712
Looks+0.694
vere+0.679
Neg logit contributions
 oak-0.558
 newspaper-0.522
 goat-0.519
 cane-0.512
 pine-0.511
Token mom
Token ID1995
Feature activation+1.36e-01

Pos logit contributions
itates+0.167
 hadn+0.160
Looks+0.153
vere+0.153
bled+0.152
Neg logit contributions
 oak-0.138
 cane-0.126
 pine-0.125
 newspaper-0.125
 goat-0.124
Token has
Token ID468
Feature activation-2.52e-01

Pos logit contributions
 oak+0.182
igsaw+0.170
 cane+0.167
 pine+0.161
 newspaper+0.160
Neg logit contributions
itates-0.307
 hadn-0.293
 also-0.285
vere-0.277
Looks-0.277
Token some
Token ID617
Feature activation+1.68e-02

Pos logit contributions
itates+0.517
 hadn+0.484
Looks+0.471
vere+0.467
bled+0.465
Neg logit contributions
 oak-0.400
 newspaper-0.365
 pine-0.362
 cane-0.362
 goat-0.361
Token mail
Token ID6920
Feature activation+5.99e-02

Pos logit contributions
 oak+0.023
 cane+0.021
 goat+0.020
 pine+0.020
 newspaper+0.020
Neg logit contributions
itates-0.037
 hadn-0.036
Looks-0.035
vere-0.035
bled-0.034
Token smiled
Token ID13541
Feature activation-1.30e-01

Pos logit contributions
 oak+0.104
 pine+0.097
 newspaper+0.097
 cane+0.097
 goat+0.096
Neg logit contributions
itates-0.079
 hadn-0.073
Looks-0.071
vere-0.071
bled-0.070
Token.
Token ID13
Feature activation+8.47e-02

Pos logit contributions
itates+0.328
 hadn+0.312
bled+0.303
Looks+0.302
vere+0.300
Neg logit contributions
 oak-0.147
 newspaper-0.127
igsaw-0.126
 pine-0.124
 goat-0.123
Token
Token ID198
Feature activation-6.66e-02

Pos logit contributions
 oak+0.137
 newspaper+0.126
 pine+0.125
 cane+0.125
 goat+0.124
Neg logit contributions
itates-0.167
 hadn-0.159
Looks-0.155
bled-0.155
vere-0.155
Token
Token ID198
Feature activation-6.46e-02

Pos logit contributions
itates+0.136
 hadn+0.131
Looks+0.128
bled+0.128
vere+0.126
Neg logit contributions
 oak-0.102
 pine-0.094
 newspaper-0.093
 cane-0.090
igsaw-0.089
Token"
Token ID1
Feature activation+4.89e-02

Pos logit contributions
itates+0.146
 hadn+0.143
bled+0.139
vere+0.138
Looks+0.135
Neg logit contributions
 oak-0.083
 pine-0.075
 newspaper-0.075
 cane-0.073
 goat-0.073
TokenLooks
Token ID41102
Feature activation-3.63e-01

Pos logit contributions
 oak+0.095
 goat+0.092
 pony+0.092
 pigeon+0.090
 pine+0.090
Neg logit contributions
itates-0.080
 hadn-0.077
vere-0.074
bled-0.073
eking-0.070
Token like
Token ID588
Feature activation-8.43e-02

Pos logit contributions
itates+0.743
 hadn+0.694
bled+0.690
Looks+0.671
vere+0.660
Neg logit contributions
 oak-0.574
 newspaper-0.531
 goat-0.530
 cane-0.528
 pine-0.527
Token the
Token ID262
Feature activation+2.12e-01

Pos logit contributions
itates+0.131
 hadn+0.126
vere+0.119
Looks+0.119
bled+0.117
Neg logit contributions
 oak-0.168
 cane-0.157
 newspaper-0.156
igsaw-0.156
 pine-0.154
Token frog
Token ID21264
Feature activation+1.54e-01

Pos logit contributions
 oak+0.203
 pine+0.171
 newspaper+0.169
 cane+0.169
 goat+0.169
Neg logit contributions
itates-0.572
 hadn-0.550
Looks-0.535
bled-0.533
vere-0.532
Token likes
Token ID7832
Feature activation-1.56e-01

Pos logit contributions
 oak+0.209
igsaw+0.190
 newspaper+0.188
 cane+0.187
 pine+0.184
Neg logit contributions
itates-0.341
 hadn-0.328
vere-0.313
Looks-0.311
 also-0.309
Token you
Token ID345
Feature activation-1.45e-01

Pos logit contributions
itates+0.266
 hadn+0.259
vere+0.249
 also+0.245
Looks+0.242
Neg logit contributions
 oak-0.276
igsaw-0.266
 cane-0.255
 newspaper-0.253
 automobile-0.246
Token felt
Token ID2936
Feature activation-2.40e-01

Pos logit contributions
itates+0.036
vere+0.035
Looks+0.032
bled+0.031
 hadn+0.031
Neg logit contributions
 oak-0.057
 goat-0.055
 pine-0.053
 cane-0.053
 newspaper-0.053
Token bad
Token ID2089
Feature activation+6.32e-02

Pos logit contributions
itates+0.422
 hadn+0.386
Looks+0.384
'd+0.381
bled+0.381
Neg logit contributions
 oak-0.440
 goat-0.411
 cane-0.407
 puppy-0.407
 dog-0.406
Token and
Token ID290
Feature activation+8.07e-02

Pos logit contributions
 oak+0.065
 newspaper+0.056
 cane+0.056
 goat+0.055
 pine+0.055
Neg logit contributions
itates-0.165
 hadn-0.158
Looks-0.154
vere-0.153
bled-0.152
Token wished
Token ID16555
Feature activation-2.12e-01

Pos logit contributions
 oak+0.164
 goat+0.158
 dog+0.153
 puppy+0.151
 pigeon+0.149
Neg logit contributions
itates-0.140
vere-0.129
 Deal-0.126
eking-0.116
ider-0.115
Token he
Token ID339
Feature activation-1.14e-01

Pos logit contributions
itates+0.465
Looks+0.418
 hadn+0.416
vere+0.405
bled+0.402
Neg logit contributions
 oak-0.320
 newspaper-0.290
 goat-0.289
 cane-0.287
 pine-0.285
Token hadn
Token ID8020
Feature activation-3.52e-01

Pos logit contributions
vere+0.167
 Deal+0.165
itates+0.159
 refreshed+0.149
Looks+0.146
Neg logit contributions
 oak-0.231
 puppy-0.227
 goat-0.227
 dog-0.225
 cane-0.222
Token't
Token ID470
Feature activation-2.11e-01

Pos logit contributions
itates+0.393
 hadn+0.342
Looks+0.335
bled+0.320
vere+0.312
Neg logit contributions
 oak-0.887
 pine-0.831
 cane-0.831
 newspaper-0.831
 goat-0.826
Token done
Token ID1760
Feature activation-1.58e-01

Pos logit contributions
itates+0.381
vere+0.335
bled+0.331
Looks+0.320
 hadn+0.317
Neg logit contributions
 oak-0.410
 pine-0.380
 newspaper-0.371
 goat-0.371
 pigeon-0.368
Token that
Token ID326
Feature activation+6.25e-02

Pos logit contributions
itates+0.298
 hadn+0.282
Looks+0.271
vere+0.269
bled+0.269
Neg logit contributions
 oak-0.275
 newspaper-0.251
 cane-0.251
 pine-0.250
 goat-0.249
Token.
Token ID13
Feature activation+7.79e-02

Pos logit contributions
 oak+0.107
igsaw+0.103
 pony+0.103
 goat+0.103
 cane+0.102
Neg logit contributions
 hadn-0.107
itates-0.105
vere-0.099
 refreshed-0.097
 also-0.097
Token He
Token ID679
Feature activation-1.54e-01

Pos logit contributions
 oak+0.131
 newspaper+0.120
 goat+0.119
 pine+0.118
 cane+0.118
Neg logit contributions
itates-0.154
 hadn-0.143
vere-0.141
bled-0.140
Looks-0.140
Token.
Token ID13
Feature activation+8.04e-02

Pos logit contributions
 oak+0.140
 newspaper+0.124
 cane+0.123
 pine+0.123
 goat+0.122
Neg logit contributions
itates-0.248
 hadn-0.234
Looks-0.230
vere-0.227
bled-0.226
Token The
Token ID383
Feature activation+9.53e-02

Pos logit contributions
 oak+0.148
 cane+0.140
 pony+0.139
 pine+0.137
 newspaper+0.136
Neg logit contributions
itates-0.142
 hadn-0.137
Looks-0.129
 also-0.122
 certainly-0.121
Token neighbor
Token ID4780
Feature activation+2.38e-01

Pos logit contributions
 oak+0.081
 cane+0.069
igsaw+0.068
 pine+0.068
 newspaper+0.066
Neg logit contributions
itates-0.258
 hadn-0.250
Looks-0.244
bled-0.241
vere-0.241
Token said
Token ID531
Feature activation-1.94e-01

Pos logit contributions
 oak+0.534
 pine+0.498
 goat+0.497
 newspaper+0.496
 cane+0.495
Neg logit contributions
itates-0.334
 hadn-0.294
Looks-0.292
vere-0.291
bled-0.288
Token they
Token ID484
Feature activation-1.40e-01

Pos logit contributions
 hadn+0.426
bled+0.424
itates+0.420
vere+0.408
 ok+0.404
Neg logit contributions
 oak-0.251
 newspaper-0.234
igsaw-0.230
 cane-0.230
 pine-0.225
Token hadn
Token ID8020
Feature activation-3.50e-01

Pos logit contributions
itates+0.240
vere+0.230
Looks+0.228
eking+0.224
 Deal+0.222
Neg logit contributions
 oak-0.252
 goat-0.243
 puppy-0.242
 cane-0.242
 automobile-0.237
Token't
Token ID470
Feature activation-2.11e-01

Pos logit contributions
itates+0.378
 hadn+0.327
Looks+0.320
bled+0.310
vere+0.302
Neg logit contributions
 oak-0.899
 newspaper-0.843
 cane-0.842
 pine-0.842
 goat-0.840
Token seen
Token ID1775
Feature activation-6.05e-02

Pos logit contributions
itates+0.397
 hadn+0.357
Looks+0.355
vere+0.355
bled+0.355
Neg logit contributions
 oak-0.382
 pine-0.350
 newspaper-0.348
 goat-0.348
 cane-0.345
Token the
Token ID262
Feature activation+2.06e-01

Pos logit contributions
 hadn+0.092
 actually+0.092
 ok+0.089
 also+0.089
 alright+0.087
Neg logit contributions
 oak-0.114
igsaw-0.110
 cane-0.107
 newspaper-0.105
round-0.104
Token toy
Token ID13373
Feature activation+2.41e-01

Pos logit contributions
 oak+0.210
igsaw+0.201
 pine+0.191
 cane+0.181
 pony+0.178
Neg logit contributions
 hadn-0.488
itates-0.477
 also-0.466
'd-0.452
 actually-0.451
Token but
Token ID475
Feature activation-1.00e-01

Pos logit contributions
 oak+0.350
igsaw+0.344
 pine+0.330
 newspaper+0.322
 cane+0.319
Neg logit contributions
 hadn-0.501
itates-0.492
 also-0.461
'd-0.459
bled-0.457
Token She
Token ID1375
Feature activation-1.55e-01

Pos logit contributions
 oak+0.143
 newspaper+0.132
 cane+0.132
 goat+0.132
 pine+0.131
Neg logit contributions
itates-0.124
 hadn-0.117
Looks-0.112
vere-0.109
bled-0.109
Token was
Token ID373
Feature activation-2.42e-01

Pos logit contributions
itates+0.248
vere+0.226
Looks+0.204
bled+0.201
eking+0.193
Neg logit contributions
 oak-0.341
 pine-0.331
 goat-0.321
 cane-0.316
 pony-0.316
Token glad
Token ID9675
Feature activation-1.57e-01

Pos logit contributions
itates+0.445
vere+0.410
Looks+0.394
izes+0.380
bled+0.361
Neg logit contributions
 oak-0.431
 pony-0.415
 pine-0.414
 ant-0.393
 apple-0.387
Token that
Token ID326
Feature activation+6.27e-02

Pos logit contributions
itates+0.315
 hadn+0.302
Looks+0.288
vere+0.286
bled+0.285
Neg logit contributions
 oak-0.252
 newspaper-0.230
 cane-0.229
 goat-0.227
 pine-0.227
Token she
Token ID673
Feature activation-1.08e-01

Pos logit contributions
 oak+0.097
 pine+0.087
 cane+0.087
 newspaper+0.086
 pony+0.086
Neg logit contributions
itates-0.137
Looks-0.122
bled-0.122
 hadn-0.121
vere-0.121
Token hadn
Token ID8020
Feature activation-3.50e-01

Pos logit contributions
vere+0.171
itates+0.170
 Deal+0.165
Looks+0.158
bled+0.150
Neg logit contributions
 oak-0.206
 goat-0.205
 puppy-0.199
 cane-0.199
 pony-0.197
Token't
Token ID470
Feature activation-2.09e-01

Pos logit contributions
itates+0.345
 hadn+0.311
Looks+0.284
vere+0.284
bled+0.278
Neg logit contributions
 oak-0.921
 newspaper-0.872
 cane-0.869
 pine-0.868
 goat-0.866
Token sne
Token ID10505
Feature activation-5.70e-02

Pos logit contributions
itates+0.302
bled+0.256
vere+0.256
Looks+0.244
 hadn+0.244
Neg logit contributions
 oak-0.488
 pine-0.454
 goat-0.449
 newspaper-0.445
 cane-0.443
Tokenez
Token ID8471
Feature activation+3.11e-02

Pos logit contributions
itates+0.110
 hadn+0.104
Looks+0.100
bled+0.100
vere+0.100
Neg logit contributions
 oak-0.096
 newspaper-0.088
 cane-0.088
 pine-0.088
igsaw-0.087
Tokened
Token ID276
Feature activation-2.86e-02

Pos logit contributions
 oak+0.056
 newspaper+0.051
 pine+0.051
 goat+0.051
 cane+0.051
Neg logit contributions
itates-0.058
 hadn-0.054
bled-0.053
vere-0.052
Looks-0.052
Token this
Token ID428
Feature activation+1.07e-01

Pos logit contributions
itates+0.057
 hadn+0.056
vere+0.052
Looks+0.052
bled+0.052
Neg logit contributions
 oak-0.045
 newspaper-0.041
 goat-0.041
 pine-0.041
igsaw-0.040
Token was
Token ID373
Feature activation-2.38e-01

Pos logit contributions
 oak+0.043
 cat+0.042
 dog+0.042
 puppy+0.041
 goat+0.041
Neg logit contributions
vere-0.045
itates-0.043
 Deal-0.041
Looks-0.041
bled-0.038
Token sad
Token ID6507
Feature activation-7.05e-02

Pos logit contributions
Looks+0.432
itates+0.421
'd+0.404
vere+0.394
 hadn+0.382
Neg logit contributions
 oak-0.401
 puppy-0.390
 pony-0.381
 automobile-0.378
 cat-0.377
Token and
Token ID290
Feature activation+8.06e-02

Pos logit contributions
itates+0.188
 hadn+0.180
Looks+0.178
bled+0.176
vere+0.175
Neg logit contributions
 oak-0.068
 cane-0.058
 newspaper-0.058
 pine-0.057
 goat-0.056
Token wished
Token ID16555
Feature activation-2.11e-01

Pos logit contributions
 oak+0.151
 dog+0.141
 puppy+0.137
 cat+0.137
 pony+0.133
Neg logit contributions
vere-0.143
itates-0.139
 Deal-0.135
Looks-0.128
 certainly-0.127
Token she
Token ID673
Feature activation-1.02e-01

Pos logit contributions
itates+0.477
Looks+0.430
 hadn+0.418
vere+0.416
bled+0.409
Neg logit contributions
 oak-0.296
 newspaper-0.274
 pony-0.270
 cane-0.269
 pine-0.263
Token hadn
Token ID8020
Feature activation-3.49e-01

Pos logit contributions
 Deal+0.151
itates+0.148
vere+0.147
Looks+0.143
 refreshed+0.141
Neg logit contributions
 cane-0.198
 puppy-0.196
 bike-0.194
 oak-0.194
 dog-0.193
Token't
Token ID470
Feature activation-2.09e-01

Pos logit contributions
itates+0.402
Looks+0.348
 hadn+0.342
bled+0.327
vere+0.319
Neg logit contributions
 oak-0.865
 cane-0.812
 pine-0.810
 newspaper-0.810
igsaw-0.801
Token reversed
Token ID17687
Feature activation-5.17e-02

Pos logit contributions
itates+0.367
Looks+0.326
bled+0.325
vere+0.315
 hadn+0.307
Neg logit contributions
 oak-0.401
 pine-0.373
 newspaper-0.373
 cane-0.370
 bike-0.370
Token the
Token ID262
Feature activation+2.03e-01

Pos logit contributions
itates+0.097
 hadn+0.094
 also+0.090
vere+0.088
bled+0.088
Neg logit contributions
 oak-0.088
 goat-0.083
 bull-0.080
 pigeon-0.078
 pine-0.077
Token books
Token ID3835
Feature activation+1.75e-01

Pos logit contributions
 oak+0.209
 goat+0.197
 pigeon+0.183
igsaw+0.178
 bull+0.171
Neg logit contributions
itates-0.519
 hadn-0.503
vere-0.492
Looks-0.473
'd-0.472
Token.
Token ID13
Feature activation+8.30e-02

Pos logit contributions
 oak+0.268
 newspaper+0.242
 cane+0.241
 pine+0.241
 goat+0.241
Neg logit contributions
itates-0.365
 hadn-0.346
Looks-0.334
vere-0.332
bled-0.331
Token see
Token ID766
Feature activation-1.42e-01

Pos logit contributions
 oak+0.206
 pine+0.185
 goat+0.184
 cane+0.181
 newspaper+0.180
Neg logit contributions
itates-0.192
Looks-0.174
 hadn-0.173
vere-0.169
bled-0.169
Token the
Token ID262
Feature activation+2.02e-01

Pos logit contributions
itates+0.279
 hadn+0.273
Looks+0.264
 also+0.263
vere+0.262
Neg logit contributions
 oak-0.223
igsaw-0.207
 cane-0.201
 newspaper-0.200
 goat-0.199
Token meat
Token ID6174
Feature activation+1.29e-01

Pos logit contributions
 oak+0.219
 newspaper+0.190
 cane+0.188
 pine+0.188
 goat+0.187
Neg logit contributions
itates-0.520
 hadn-0.497
Looks-0.484
vere-0.481
bled-0.481
Token,
Token ID11
Feature activation+1.41e-01

Pos logit contributions
 oak+0.154
 cane+0.139
 pine+0.138
 newspaper+0.137
 goat+0.136
Neg logit contributions
itates-0.318
Looks-0.298
 hadn-0.291
vere-0.288
bled-0.277
Token he
Token ID339
Feature activation-1.15e-01

Pos logit contributions
 oak+0.200
 newspaper+0.182
 pine+0.180
 goat+0.179
 cane+0.179
Neg logit contributions
itates-0.323
vere-0.294
bled-0.294
 hadn-0.292
Looks-0.292
Token hadn
Token ID8020
Feature activation-3.49e-01

Pos logit contributions
itates+0.188
vere+0.174
Looks+0.166
bled+0.165
 Deal+0.154
Neg logit contributions
 oak-0.239
 goat-0.227
 cane-0.227
 pine-0.225
 pony-0.222
Token't
Token ID470
Feature activation-2.08e-01

Pos logit contributions
itates+0.360
 hadn+0.317
Looks+0.299
bled+0.292
vere+0.290
Neg logit contributions
 oak-0.907
 newspaper-0.855
 cane-0.853
 pine-0.853
 goat-0.849
Token eaten
Token ID17065
Feature activation-4.15e-02

Pos logit contributions
itates+0.315
bled+0.275
vere+0.275
 hadn+0.273
Looks+0.270
Neg logit contributions
 oak-0.456
 pine-0.425
 newspaper-0.424
 goat-0.422
 cane-0.420
Token in
Token ID287
Feature activation+3.86e-02

Pos logit contributions
itates+0.081
 hadn+0.075
Looks+0.074
vere+0.073
bled+0.071
Neg logit contributions
 oak-0.070
 pine-0.065
 cane-0.064
 newspaper-0.064
 goat-0.063
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
 oak+0.058
 goat+0.052
 newspaper+0.052
 cane+0.052
 pony+0.052
Neg logit contributions
itates-0.080
 hadn-0.078
Looks-0.074
bled-0.074
vere-0.073
Token long
Token ID890
Feature activation+1.03e-01

Pos logit contributions
 oak+0.150
 newspaper+0.136
 cane+0.135
 pine+0.135
 goat+0.134
Neg logit contributions
itates-0.216
 hadn-0.204
Looks-0.198
bled-0.197
vere-0.196
Token seen
Token ID1775
Feature activation-7.96e-02

Pos logit contributions
 oak+0.222
 newspaper+0.207
 cane+0.206
 pine+0.206
 goat+0.206
Neg logit contributions
itates-0.143
 hadn-0.131
Looks-0.126
vere-0.124
bled-0.123
Token any
Token ID597
Feature activation+1.05e-01

Pos logit contributions
 hadn+0.153
itates+0.152
bled+0.144
Looks+0.142
vere+0.141
Neg logit contributions
 oak-0.126
igsaw-0.118
 cane-0.115
 newspaper-0.115
 goat-0.114
Token onions
Token ID24681
Feature activation+1.56e-01

Pos logit contributions
 oak+0.167
igsaw+0.156
 newspaper+0.153
 cane+0.153
 goat+0.151
Neg logit contributions
itates-0.196
 hadn-0.196
Looks-0.185
vere-0.185
 also-0.182
Token but
Token ID475
Feature activation-1.16e-01

Pos logit contributions
 oak+0.240
 cane+0.216
 newspaper+0.215
 goat+0.214
 pine+0.214
Neg logit contributions
itates-0.327
 hadn-0.304
Looks-0.300
vere-0.295
bled-0.293
Token they
Token ID484
Feature activation-1.42e-01

Pos logit contributions
itates+0.220
 hadn+0.205
Looks+0.199
vere+0.197
bled+0.196
Neg logit contributions
 oak-0.202
 newspaper-0.185
 cane-0.184
 pine-0.184
 goat-0.183
Token hadn
Token ID8020
Feature activation-3.48e-01

Pos logit contributions
itates+0.184
Looks+0.168
vere+0.158
bled+0.151
 refreshed+0.138
Neg logit contributions
 oak-0.343
 goat-0.319
 cane-0.319
 pine-0.311
 pigeon-0.309
Token't
Token ID470
Feature activation-2.06e-01

Pos logit contributions
itates+0.435
Looks+0.381
bled+0.362
 hadn+0.356
izes+0.345
Neg logit contributions
 oak-0.840
 pine-0.781
 cane-0.780
 newspaper-0.774
igsaw-0.773
Token.
Token ID13
Feature activation+8.72e-02

Pos logit contributions
itates+0.359
 hadn+0.329
Looks+0.321
bled+0.317
vere+0.317
Neg logit contributions
 oak-0.398
 pine-0.367
 cane-0.366
 newspaper-0.366
 goat-0.364
Token
Token ID198
Feature activation-6.23e-02

Pos logit contributions
 oak+0.147
 goat+0.134
 newspaper+0.134
 pine+0.133
igsaw+0.132
Neg logit contributions
itates-0.166
bled-0.155
 hadn-0.153
vere-0.153
Looks-0.151
Token
Token ID198
Feature activation-6.12e-02

Pos logit contributions
itates+0.126
bled+0.117
 hadn+0.116
Looks+0.114
vere+0.114
Neg logit contributions
 oak-0.098
 pine-0.092
 newspaper-0.090
 goat-0.089
 cane-0.087
TokenAfter
Token ID3260
Feature activation-1.37e-01

Pos logit contributions
itates+0.096
bled+0.088
 hadn+0.088
vere+0.086
 refreshed+0.083
Neg logit contributions
 oak-0.122
 pine-0.115
 newspaper-0.115
 goat-0.114
igsaw-0.113
Token the
Token ID262
Feature activation+2.07e-01

Pos logit contributions
 oak+0.123
 goat+0.111
 pine+0.111
 pony+0.110
 newspaper+0.109
Neg logit contributions
itates-0.172
vere-0.158
Looks-0.150
bled-0.148
 hadn-0.147
Token boy
Token ID2933
Feature activation+7.44e-02

Pos logit contributions
 oak+0.220
 newspaper+0.189
 cane+0.189
 goat+0.187
 pine+0.186
Neg logit contributions
itates-0.530
 hadn-0.510
Looks-0.495
vere-0.489
bled-0.488
Token replied
Token ID8712
Feature activation-1.85e-01

Pos logit contributions
 oak+0.117
 goat+0.114
 pony+0.114
 puppy+0.113
 pine+0.112
Neg logit contributions
itates-0.149
vere-0.139
Looks-0.138
 Deal-0.134
bled-0.133
Token that
Token ID326
Feature activation+6.60e-02

Pos logit contributions
itates+0.516
Looks+0.485
 hadn+0.483
vere+0.470
bled+0.464
Neg logit contributions
 oak-0.158
 pony-0.134
 cane-0.134
 newspaper-0.131
 goat-0.131
Token he
Token ID339
Feature activation-1.14e-01

Pos logit contributions
 oak+0.104
 pine+0.094
 newspaper+0.093
 cane+0.092
 goat+0.092
Neg logit contributions
itates-0.142
vere-0.127
Looks-0.126
 hadn-0.126
bled-0.126
Token hadn
Token ID8020
Feature activation-3.48e-01

Pos logit contributions
itates+0.187
vere+0.184
Looks+0.179
bled+0.176
bling+0.175
Neg logit contributions
 goat-0.217
 puppy-0.217
 cane-0.216
 oak-0.213
 cat-0.211
Token't
Token ID470
Feature activation-2.08e-01

Pos logit contributions
itates+0.401
Looks+0.342
 hadn+0.339
bled+0.332
vere+0.321
Neg logit contributions
 oak-0.874
 cane-0.817
 pine-0.814
 newspaper-0.813
 goat-0.810
Token.
Token ID13
Feature activation+7.65e-02

Pos logit contributions
itates+0.343
vere+0.299
Looks+0.294
bled+0.293
 hadn+0.280
Neg logit contributions
 oak-0.442
 pine-0.412
 goat-0.405
 cane-0.404
 newspaper-0.402
Token But
Token ID887
Feature activation-1.08e-01

Pos logit contributions
 oak+0.119
 cane+0.109
 newspaper+0.108
 goat+0.108
 pine+0.108
Neg logit contributions
itates-0.159
 hadn-0.152
Looks-0.146
vere-0.141
bled-0.141
Token then
Token ID788
Feature activation-1.21e-01

Pos logit contributions
itates+0.237
 hadn+0.226
Looks+0.219
vere+0.216
bled+0.216
Neg logit contributions
 oak-0.154
 newspaper-0.138
 cane-0.138
 pine-0.137
 goat-0.137
Token,
Token ID11
Feature activation+1.42e-01

Pos logit contributions
 hadn+0.287
itates+0.281
Looks+0.270
 also+0.268
bled+0.263
Neg logit contributions
 oak-0.141
igsaw-0.130
 newspaper-0.130
 cane-0.125
 bull-0.122
Token in
Token ID287
Feature activation+2.97e-02

Pos logit contributions
 oak+0.141
 newspaper+0.122
 pine+0.120
 goat+0.120
 cane+0.119
Neg logit contributions
itates-0.267
Looks-0.251
 hadn-0.250
vere-0.245
bled-0.245
Token the
Token ID262
Feature activation+1.92e-01

Pos logit contributions
 oak+0.038
 goat+0.035
 newspaper+0.034
 cane+0.034
 pony+0.033
Neg logit contributions
itates-0.068
 hadn-0.066
Looks-0.064
vere-0.063
 also-0.062
Token garden
Token ID11376
Feature activation+1.18e-01

Pos logit contributions
 oak+0.248
 cane+0.222
 newspaper+0.221
 goat+0.219
 pine+0.219
Neg logit contributions
itates-0.442
 hadn-0.422
Looks-0.411
vere-0.405
bled-0.403
Token that
Token ID326
Feature activation+5.59e-02

Pos logit contributions
 oak+0.120
 pine+0.100
 newspaper+0.099
 cane+0.096
 goat+0.094
Neg logit contributions
itates-0.318
Looks-0.297
 hadn-0.293
vere-0.291
bled-0.291
Token she
Token ID673
Feature activation-1.07e-01

Pos logit contributions
 oak+0.098
 pine+0.088
 newspaper+0.088
 cane+0.087
 goat+0.086
Neg logit contributions
itates-0.111
Looks-0.099
bled-0.098
vere-0.098
 hadn-0.097
Token hadn
Token ID8020
Feature activation-3.48e-01

Pos logit contributions
itates+0.187
bled+0.181
Looks+0.179
vere+0.175
 Deal+0.168
Neg logit contributions
 oak-0.193
 cane-0.177
 pine-0.176
 dog-0.175
 pony-0.174
Token't
Token ID470
Feature activation-2.07e-01

Pos logit contributions
itates+0.424
Looks+0.378
bled+0.355
 hadn+0.352
vere+0.337
Neg logit contributions
 oak-0.846
 cane-0.785
 pine-0.784
 newspaper-0.781
 pony-0.777
Token seen
Token ID1775
Feature activation-5.77e-02

Pos logit contributions
itates+0.398
Looks+0.359
 hadn+0.357
bled+0.356
vere+0.352
Neg logit contributions
 oak-0.369
 pine-0.337
 newspaper-0.336
 cane-0.333
 goat-0.329
Token before
Token ID878
Feature activation+5.75e-02

Pos logit contributions
itates+0.108
 hadn+0.106
bled+0.100
vere+0.099
Looks+0.098
Neg logit contributions
 oak-0.097
 goat-0.090
 cane-0.089
igsaw-0.089
 newspaper-0.089
Token.
Token ID13
Feature activation+8.49e-02

Pos logit contributions
 oak+0.062
 newspaper+0.053
 cane+0.053
 pine+0.053
 goat+0.052
Neg logit contributions
itates-0.149
 hadn-0.139
Looks-0.138
bled-0.137
vere-0.135
Token It
Token ID632
Feature activation-1.37e-01

Pos logit contributions
 oak+0.149
 newspaper+0.137
 cane+0.136
 pine+0.136
 goat+0.135
Neg logit contributions
itates-0.158
 hadn-0.148
Looks-0.143
bled-0.142
vere-0.142
Token to
Token ID284
Feature activation+1.08e-01

Pos logit contributions
 oak+0.106
affe+0.102
 goat+0.101
 pigeon+0.099
 bull+0.097
Neg logit contributions
itates-0.126
 hadn-0.119
 also-0.118
 refreshed-0.114
vere-0.112
Token his
Token ID465
Feature activation+9.23e-02

Pos logit contributions
 oak+0.163
 newspaper+0.147
 cane+0.147
 pine+0.147
 goat+0.145
Neg logit contributions
itates-0.231
 hadn-0.218
Looks-0.212
vere-0.210
bled-0.210
Token seat
Token ID5852
Feature activation+1.03e-01

Pos logit contributions
 oak+0.135
 goat+0.124
 newspaper+0.121
 pine+0.120
 pony+0.119
Neg logit contributions
itates-0.186
 hadn-0.185
vere-0.176
 also-0.176
bled-0.173
Token.
Token ID13
Feature activation+8.57e-02

Pos logit contributions
 oak+0.150
 goat+0.140
 pine+0.135
igsaw+0.134
 newspaper+0.134
Neg logit contributions
itates-0.213
 hadn-0.207
bled-0.199
vere-0.197
 also-0.196
Token He
Token ID679
Feature activation-1.53e-01

Pos logit contributions
 oak+0.138
 newspaper+0.126
 pine+0.125
 cane+0.124
igsaw+0.124
Neg logit contributions
itates-0.173
 hadn-0.163
bled-0.160
vere-0.160
Looks-0.159
Token hadn
Token ID8020
Feature activation-3.48e-01

Pos logit contributions
itates+0.253
vere+0.232
Looks+0.221
bled+0.221
 Deal+0.203
Neg logit contributions
 oak-0.316
 pine-0.305
 cane-0.297
 goat-0.293
 pony-0.291
Token't
Token ID470
Feature activation-2.08e-01

Pos logit contributions
itates+0.377
 hadn+0.326
Looks+0.319
bled+0.311
vere+0.302
Neg logit contributions
 oak-0.891
 pine-0.837
 newspaper-0.835
 cane-0.835
 goat-0.829
Token been
Token ID587
Feature activation-2.21e-01

Pos logit contributions
itates+0.298
bled+0.259
vere+0.256
Looks+0.251
 hadn+0.249
Neg logit contributions
 oak-0.471
 pine-0.441
 newspaper-0.439
 cane-0.437
 goat-0.435
Token too
Token ID1165
Feature activation-1.17e-01

Pos logit contributions
itates+0.332
Looks+0.297
 hadn+0.297
bled+0.288
vere+0.288
Neg logit contributions
 oak-0.466
 pine-0.438
 cane-0.436
 newspaper-0.436
 goat-0.433
Token worried
Token ID7960
Feature activation-9.73e-02

Pos logit contributions
itates+0.166
 hadn+0.153
Looks+0.143
vere+0.140
 also+0.140
Neg logit contributions
 oak-0.259
 newspaper-0.241
 goat-0.241
 cane-0.239
 pony-0.239
Token about
Token ID546
Feature activation-1.20e-01

Pos logit contributions
itates+0.186
 hadn+0.175
Looks+0.170
vere+0.168
bled+0.167
Neg logit contributions
 oak-0.166
 newspaper-0.152
 cane-0.152
 pine-0.151
 goat-0.150
Token.
Token ID13
Feature activation+7.67e-02

Pos logit contributions
 oak+0.219
 pine+0.192
 cane+0.189
 newspaper+0.185
 pony+0.184
Neg logit contributions
itates-0.461
Looks-0.438
 hadn-0.435
bled-0.422
vere-0.419
Token He
Token ID679
Feature activation-1.56e-01

Pos logit contributions
 oak+0.111
 newspaper+0.101
 pine+0.100
igsaw+0.097
 cane+0.096
Neg logit contributions
itates-0.164
bled-0.157
vere-0.156
Looks-0.153
 hadn-0.151
Token was
Token ID373
Feature activation-2.39e-01

Pos logit contributions
itates+0.266
vere+0.249
Looks+0.234
bled+0.225
 Deal+0.220
Neg logit contributions
 oak-0.321
 pine-0.315
 cane-0.298
 goat-0.297
 horse-0.295
Token happy
Token ID3772
Feature activation-8.99e-02

Pos logit contributions
itates+0.421
Looks+0.400
vere+0.396
izes+0.364
its+0.355
Neg logit contributions
 oak-0.427
 pony-0.418
 pine-0.415
 puppy-0.400
 horse-0.394
Token he
Token ID339
Feature activation-1.14e-01

Pos logit contributions
itates+0.215
 hadn+0.203
Looks+0.200
vere+0.197
bled+0.197
Neg logit contributions
 oak-0.113
 pine-0.100
 cane-0.100
 newspaper-0.100
 goat-0.099
Token hadn
Token ID8020
Feature activation-3.48e-01

Pos logit contributions
itates+0.164
Looks+0.152
vere+0.151
 Deal+0.147
bled+0.144
Neg logit contributions
 oak-0.239
 goat-0.230
 cane-0.229
 pony-0.224
 puppy-0.224
Token't
Token ID470
Feature activation-2.08e-01

Pos logit contributions
itates+0.396
 hadn+0.350
Looks+0.336
bled+0.326
vere+0.320
Neg logit contributions
 oak-0.872
 cane-0.818
 pine-0.818
 newspaper-0.817
 goat-0.814
Token been
Token ID587
Feature activation-2.20e-01

Pos logit contributions
itates+0.304
bled+0.263
vere+0.260
Looks+0.257
 hadn+0.257
Neg logit contributions
 oak-0.468
 pine-0.436
 goat-0.432
 cane-0.428
 newspaper-0.427
Token hurt
Token ID5938
Feature activation-1.50e-02

Pos logit contributions
itates+0.352
 hadn+0.323
Looks+0.318
bled+0.310
vere+0.308
Neg logit contributions
 oak-0.444
 pine-0.415
 goat-0.414
 cane-0.411
 newspaper-0.409
Token.
Token ID13
Feature activation+7.77e-02

Pos logit contributions
itates+0.030
 hadn+0.029
Looks+0.028
vere+0.028
bled+0.027
Neg logit contributions
 oak-0.024
 newspaper-0.022
 cane-0.022
 goat-0.022
 pine-0.022
Token<|endoftext|>
Token ID50256
Feature activation+3.73e-02

Pos logit contributions
 oak+0.111
 pine+0.101
 newspaper+0.101
igsaw+0.099
 cane+0.096
Neg logit contributions
itates-0.167
bled-0.162
vere-0.158
Looks-0.155
izes-0.154
Token toys
Token ID14958
Feature activation+1.28e-01

Pos logit contributions
 oak+0.069
 pine+0.064
 goat+0.062
 bull+0.060
 pony+0.058
Neg logit contributions
 hadn-0.100
itates-0.097
vere-0.093
'd-0.092
izes-0.091
Token and
Token ID290
Feature activation+8.18e-02

Pos logit contributions
 oak+0.177
 goat+0.159
 pine+0.159
 newspaper+0.157
 pony+0.154
Neg logit contributions
itates-0.280
 hadn-0.275
vere-0.260
bled-0.258
 also-0.256
Token books
Token ID3835
Feature activation+1.79e-01

Pos logit contributions
 oak+0.159
 newspaper+0.149
 cane+0.146
 pine+0.145
 bike+0.143
Neg logit contributions
itates-0.149
vere-0.130
Looks-0.125
bled-0.124
 Deal-0.120
Token that
Token ID326
Feature activation+6.61e-02

Pos logit contributions
 oak+0.264
 cane+0.240
 newspaper+0.239
 pine+0.233
igsaw+0.232
Neg logit contributions
itates-0.381
Looks-0.356
 hadn-0.350
vere-0.343
bled-0.343
Token he
Token ID339
Feature activation-1.12e-01

Pos logit contributions
 oak+0.120
 newspaper+0.109
 cane+0.109
 pine+0.109
 goat+0.107
Neg logit contributions
itates-0.125
Looks-0.110
bled-0.110
vere-0.109
 hadn-0.108
Token hadn
Token ID8020
Feature activation-3.48e-01

Pos logit contributions
vere+0.162
itates+0.161
 Deal+0.150
Looks+0.150
bled+0.144
Neg logit contributions
 oak-0.232
 cane-0.228
 goat-0.222
 dog-0.219
 puppy-0.219
Token't
Token ID470
Feature activation-2.09e-01

Pos logit contributions
itates+0.415
Looks+0.362
 hadn+0.356
bled+0.349
vere+0.332
Neg logit contributions
 oak-0.850
 cane-0.794
 pine-0.792
 newspaper-0.791
igsaw-0.791
Token tried
Token ID3088
Feature activation-1.72e-01

Pos logit contributions
itates+0.307
bled+0.259
vere+0.257
Looks+0.257
izes+0.236
Neg logit contributions
 oak-0.473
 pine-0.445
 newspaper-0.442
 cane-0.441
 goat-0.432
Token anything
Token ID1997
Feature activation+2.87e-02

Pos logit contributions
itates+0.387
 hadn+0.376
vere+0.359
Looks+0.356
bled+0.354
Neg logit contributions
 oak-0.234
 goat-0.208
 pine-0.204
 newspaper-0.204
 cane-0.202
Token new
Token ID649
Feature activation+9.71e-02

Pos logit contributions
 oak+0.042
 cane+0.038
 newspaper+0.038
 pine+0.038
 goat+0.037
Neg logit contributions
itates-0.062
 hadn-0.058
Looks-0.058
bled-0.057
vere-0.056
Token.
Token ID13
Feature activation+8.01e-02

Pos logit contributions
 oak+0.136
 goat+0.120
 newspaper+0.120
 pine+0.120
 cane+0.119
Neg logit contributions
itates-0.216
 hadn-0.208
vere-0.200
Looks-0.198
bled-0.197
Token sad
Token ID6507
Feature activation-6.89e-02

Pos logit contributions
itates+0.422
Looks+0.384
 hadn+0.374
'd+0.370
vere+0.369
Neg logit contributions
 oak-0.428
 cane-0.412
 pine-0.395
 newspaper-0.390
 olive-0.389
Token.
Token ID13
Feature activation+8.19e-02

Pos logit contributions
itates+0.172
 hadn+0.165
Looks+0.163
bled+0.160
vere+0.160
Neg logit contributions
 oak-0.077
 cane-0.068
 newspaper-0.068
 pine-0.067
 goat-0.066
Token He
Token ID679
Feature activation-1.54e-01

Pos logit contributions
 oak+0.127
 newspaper+0.115
igsaw+0.113
 cane+0.112
 pine+0.112
Neg logit contributions
itates-0.172
vere-0.158
Looks-0.158
 hadn-0.158
bled-0.157
Token wished
Token ID16555
Feature activation-2.08e-01

Pos logit contributions
vere+0.293
bled+0.278
bling+0.266
itates+0.262
Looks+0.246
Neg logit contributions
 oak-0.289
 puppy-0.279
 goat-0.278
 cane-0.276
 pine-0.267
Token he
Token ID339
Feature activation-1.10e-01

Pos logit contributions
itates+0.431
 hadn+0.398
Looks+0.390
vere+0.386
bled+0.382
Neg logit contributions
 oak-0.333
 newspaper-0.301
 pine-0.300
 cane-0.299
 goat-0.298
Token hadn
Token ID8020
Feature activation-3.48e-01

Pos logit contributions
vere+0.167
itates+0.164
 Deal+0.158
Looks+0.154
izes+0.150
Neg logit contributions
 oak-0.219
 puppy-0.214
 cane-0.214
 goat-0.210
 dog-0.208
Token't
Token ID470
Feature activation-2.07e-01

Pos logit contributions
itates+0.351
 hadn+0.309
Looks+0.291
vere+0.282
bled+0.282
Neg logit contributions
 oak-0.914
 newspaper-0.861
 cane-0.860
 pine-0.859
 goat-0.856
Token given
Token ID1813
Feature activation-9.18e-02

Pos logit contributions
itates+0.350
vere+0.304
bled+0.301
Looks+0.300
izes+0.288
Neg logit contributions
 oak-0.429
 pine-0.403
 newspaper-0.392
 cane-0.391
 goat-0.385
Token away
Token ID1497
Feature activation+9.87e-02

Pos logit contributions
 hadn+0.135
itates+0.132
bled+0.131
 also+0.130
 never+0.125
Neg logit contributions
 oak-0.182
 pony-0.171
igsaw-0.170
 bull-0.169
 cane-0.168
Token his
Token ID465
Feature activation+9.21e-02

Pos logit contributions
igsaw+0.182
 pony+0.175
 cane+0.173
 oak+0.170
 newspaper+0.169
Neg logit contributions
 also-0.150
 hadn-0.150
 not-0.150
 never-0.143
 actually-0.140
Token special
Token ID2041
Feature activation+6.34e-02

Pos logit contributions
 oak+0.125
 newspaper+0.116
igsaw+0.114
 cane+0.114
 pony+0.112
Neg logit contributions
itates-0.200
 hadn-0.196
bled-0.186
vere-0.185
Looks-0.183
Token were
Token ID547
Feature activation-2.65e-01

Pos logit contributions
 oak+0.258
 goat+0.234
 pine+0.234
 pony+0.229
 newspaper+0.229
Neg logit contributions
itates-0.233
vere-0.215
Looks-0.210
bled-0.200
 Deal-0.196
Token so
Token ID523
Feature activation-1.10e-01

Pos logit contributions
Looks+0.451
itates+0.432
'd+0.413
 hadn+0.401
bled+0.389
Neg logit contributions
 oak-0.474
 puppy-0.465
 pony-0.459
 dog-0.459
 goat-0.452
Token surprised
Token ID6655
Feature activation-1.52e-01

Pos logit contributions
Looks+0.176
'd+0.164
 hadn+0.161
 definitely+0.153
itates+0.152
Neg logit contributions
 oak-0.204
 pigeon-0.197
 dog-0.195
 ant-0.190
 goat-0.188
Token.
Token ID13
Feature activation+7.93e-02

Pos logit contributions
itates+0.304
 hadn+0.295
 also+0.281
vere+0.275
Looks+0.274
Neg logit contributions
 oak-0.248
 cane-0.226
igsaw-0.226
 newspaper-0.221
 pine-0.220
Token They
Token ID1119
Feature activation-2.05e-01

Pos logit contributions
 oak+0.140
 newspaper+0.129
 goat+0.127
 pine+0.127
 cane+0.127
Neg logit contributions
itates-0.146
 hadn-0.138
vere-0.134
bled-0.134
Looks-0.134
Token hadn
Token ID8020
Feature activation-3.48e-01

Pos logit contributions
itates+0.358
vere+0.330
bled+0.326
Looks+0.322
 Deal+0.308
Neg logit contributions
 oak-0.410
 goat-0.371
 pine-0.365
 puppy-0.354
 newspaper-0.353
Token't
Token ID470
Feature activation-2.07e-01

Pos logit contributions
itates+0.370
 hadn+0.320
Looks+0.313
bled+0.302
vere+0.297
Neg logit contributions
 oak-0.894
 newspaper-0.839
 pine-0.838
 cane-0.838
 goat-0.833
Token thought
Token ID1807
Feature activation-1.82e-01

Pos logit contributions
itates+0.307
 hadn+0.295
Looks+0.273
vere+0.269
bled+0.266
Neg logit contributions
 oak-0.438
 cane-0.410
 newspaper-0.408
igsaw-0.407
 goat-0.407
Token she
Token ID673
Feature activation-1.05e-01

Pos logit contributions
itates+0.363
 hadn+0.341
Looks+0.332
vere+0.327
bled+0.327
Neg logit contributions
 oak-0.298
 newspaper-0.272
 cane-0.271
 pine-0.270
 goat-0.269
Token was
Token ID373
Feature activation-2.41e-01

Pos logit contributions
itates+0.138
Looks+0.127
vere+0.127
bled+0.124
 refreshed+0.118
Neg logit contributions
 oak-0.243
 newspaper-0.234
 goat-0.234
 cane-0.232
 pine-0.229
Token awake
Token ID21693
Feature activation-6.88e-03

Pos logit contributions
itates+0.410
Looks+0.376
vere+0.360
bled+0.358
 hadn+0.357
Neg logit contributions
 oak-0.468
 goat-0.438
 newspaper-0.437
 pony-0.436
 cane-0.434
Token.
Token ID13
Feature activation+7.70e-02

Pos logit contributions
 oak+0.112
 newspaper+0.104
igsaw+0.104
 pine+0.102
 cane+0.101
Neg logit contributions
 hadn-0.130
itates-0.121
 refreshed-0.121
bled-0.120
'd-0.117
Token He
Token ID679
Feature activation-1.57e-01

Pos logit contributions
igsaw+0.111
 oak+0.105
 newspaper+0.102
 cane+0.101
 pine+0.099
Neg logit contributions
itates-0.157
vere-0.154
eking-0.154
izes-0.153
bled-0.150
Token was
Token ID373
Feature activation-2.41e-01

Pos logit contributions
itates+0.246
vere+0.230
bled+0.218
Looks+0.211
 Deal+0.204
Neg logit contributions
 oak-0.327
 pine-0.317
 goat-0.310
 pony-0.310
 cane-0.307
Token glad
Token ID9675
Feature activation-1.58e-01

Pos logit contributions
itates+0.419
vere+0.388
Looks+0.370
izes+0.361
 hadn+0.359
Neg logit contributions
 oak-0.440
 pony-0.429
 pine-0.423
 goat-0.412
 ant-0.409
Token he
Token ID339
Feature activation-1.15e-01

Pos logit contributions
itates+0.316
 hadn+0.312
Looks+0.292
bled+0.290
vere+0.289
Neg logit contributions
 oak-0.248
 newspaper-0.226
igsaw-0.224
 cane-0.222
 pine-0.220
Token hadn
Token ID8020
Feature activation-3.48e-01

Pos logit contributions
itates+0.172
vere+0.163
Looks+0.157
 Deal+0.156
bled+0.154
Neg logit contributions
 oak-0.228
 goat-0.224
 pony-0.223
 cane-0.223
 puppy-0.219
Token't
Token ID470
Feature activation-2.09e-01

Pos logit contributions
itates+0.402
 hadn+0.352
Looks+0.343
bled+0.331
vere+0.325
Neg logit contributions
 oak-0.863
 cane-0.811
 newspaper-0.809
 pine-0.808
 goat-0.806
Token stayed
Token ID9658
Feature activation-1.99e-01

Pos logit contributions
itates+0.301
bled+0.260
vere+0.258
Looks+0.253
 hadn+0.252
Neg logit contributions
 oak-0.475
 pine-0.443
 goat-0.438
 newspaper-0.436
 cane-0.435
Token afraid
Token ID7787
Feature activation-1.16e-01

Pos logit contributions
itates+0.378
 hadn+0.355
Looks+0.345
vere+0.339
bled+0.338
Neg logit contributions
 oak-0.343
 cane-0.316
 newspaper-0.314
 goat-0.313
 pine-0.312
Token behind
Token ID2157
Feature activation+8.36e-02

Pos logit contributions
itates+0.250
 hadn+0.238
Looks+0.230
vere+0.229
bled+0.227
Neg logit contributions
 oak-0.168
 newspaper-0.152
 cane-0.151
 pine-0.151
 goat-0.150
Token the
Token ID262
Feature activation+2.00e-01

Pos logit contributions
 oak+0.127
 newspaper+0.116
 goat+0.115
igsaw+0.115
 cane+0.114
Neg logit contributions
 hadn-0.167
itates-0.166
bled-0.156
vere-0.156
Looks-0.155
Token She
Token ID1375
Feature activation-1.43e-01

Pos logit contributions
 oak+0.137
 newspaper+0.125
 cane+0.125
 pine+0.125
 goat+0.124
Neg logit contributions
itates-0.145
 hadn-0.136
Looks-0.132
vere-0.131
bled-0.131
Token saw
Token ID2497
Feature activation-1.18e-01

Pos logit contributions
itates+0.246
vere+0.227
bled+0.227
Looks+0.216
bling+0.197
Neg logit contributions
 oak-0.286
 pine-0.272
 cane-0.267
 newspaper-0.263
 pony-0.262
Token some
Token ID617
Feature activation+1.76e-02

Pos logit contributions
itates+0.210
 hadn+0.209
bled+0.203
 also+0.203
Looks+0.200
Neg logit contributions
 oak-0.193
igsaw-0.184
 cane-0.175
 goat-0.174
 newspaper-0.171
Token glue
Token ID22749
Feature activation+8.53e-02

Pos logit contributions
 oak+0.025
igsaw+0.023
 goat+0.023
 cane+0.023
 pine+0.023
Neg logit contributions
itates-0.036
 hadn-0.035
Looks-0.034
vere-0.034
bled-0.034
Token on
Token ID319
Feature activation+3.06e-02

Pos logit contributions
 oak+0.108
igsaw+0.097
 pine+0.096
 newspaper+0.096
 goat+0.096
Neg logit contributions
itates-0.193
 hadn-0.191
bled-0.184
 also-0.181
vere-0.179
Token the
Token ID262
Feature activation+2.04e-01

Pos logit contributions
 oak+0.057
affe+0.056
 goat+0.055
round+0.054
 pony+0.053
Neg logit contributions
 also-0.049
 not-0.045
 never-0.044
 actually-0.043
 hadn-0.043
Token floor
Token ID4314
Feature activation+2.80e-02

Pos logit contributions
 oak+0.302
igsaw+0.277
 pine+0.271
 goat+0.267
 pony+0.260
Neg logit contributions
 hadn-0.403
itates-0.401
vere-0.386
bled-0.379
 also-0.378
Token.
Token ID13
Feature activation+7.79e-02

Pos logit contributions
 oak+0.027
 newspaper+0.023
 cane+0.023
 pine+0.023
 goat+0.022
Neg logit contributions
itates-0.075
 hadn-0.071
Looks-0.070
vere-0.069
bled-0.069
Token She
Token ID1375
Feature activation-1.46e-01

Pos logit contributions
 oak+0.150
 cane+0.141
 goat+0.140
 pony+0.139
 newspaper+0.139
Neg logit contributions
itates-0.128
 hadn-0.121
Looks-0.116
vere-0.111
 also-0.111
Token thought
Token ID1807
Feature activation-1.79e-01

Pos logit contributions
itates+0.265
vere+0.253
 refreshed+0.230
bled+0.229
 Deal+0.228
Neg logit contributions
 oak-0.262
 pine-0.259
 newspaper-0.249
 cane-0.249
 goat-0.247
Token it
Token ID340
Feature activation-6.65e-03

Pos logit contributions
itates+0.304
 hadn+0.300
bled+0.284
vere+0.283
Looks+0.277
Neg logit contributions
 oak-0.335
 goat-0.308
 pine-0.304
 newspaper-0.304
 cane-0.304
Token stop
Token ID2245
Feature activation-8.39e-02

Pos logit contributions
 oak+0.217
 pine+0.200
 newspaper+0.199
 goat+0.199
 cane+0.198
Neg logit contributions
itates-0.185
 hadn-0.166
Looks-0.163
bled-0.161
vere-0.160
Token slapping
Token ID48966
Feature activation-6.33e-02

Pos logit contributions
itates+0.185
 hadn+0.174
Looks+0.172
vere+0.169
bled+0.167
Neg logit contributions
 oak-0.119
 goat-0.108
 pine-0.108
 newspaper-0.107
 cane-0.107
Token and
Token ID290
Feature activation+8.48e-02

Pos logit contributions
itates+0.126
 hadn+0.115
 also+0.115
bled+0.114
izes+0.112
Neg logit contributions
 oak-0.106
igsaw-0.094
 pine-0.092
 pony-0.090
 goat-0.089
Token to
Token ID284
Feature activation+1.08e-01

Pos logit contributions
 oak+0.158
 newspaper+0.148
 cane+0.147
 pine+0.146
 bike+0.146
Neg logit contributions
itates-0.151
 hadn-0.138
Looks-0.137
bled-0.137
vere-0.133
Token share
Token ID2648
Feature activation-1.11e-01

Pos logit contributions
 oak+0.150
 pine+0.134
 goat+0.134
 newspaper+0.133
 cane+0.132
Neg logit contributions
itates-0.249
 hadn-0.234
Looks-0.227
vere-0.227
bled-0.225
Token the
Token ID262
Feature activation+2.09e-01

Pos logit contributions
itates+0.199
 also+0.191
 not+0.183
 hadn+0.183
Looks+0.176
Neg logit contributions
 oak-0.204
igsaw-0.188
 goat-0.183
 bull-0.177
 pony-0.175
Token toys
Token ID14958
Feature activation+1.26e-01

Pos logit contributions
 oak+0.253
 goat+0.224
igsaw+0.222
 cane+0.221
 pine+0.219
Neg logit contributions
itates-0.488
 hadn-0.472
vere-0.458
Looks-0.458
 also-0.448
Token.
Token ID13
Feature activation+7.83e-02

Pos logit contributions
 oak+0.201
 goat+0.187
 pine+0.182
 newspaper+0.181
 bull+0.179
Neg logit contributions
itates-0.254
 also-0.239
 hadn-0.236
vere-0.233
bled-0.227
Token She
Token ID1375
Feature activation-1.52e-01

Pos logit contributions
 oak+0.125
 newspaper+0.115
 cane+0.114
 pine+0.113
 goat+0.113
Neg logit contributions
itates-0.157
 hadn-0.151
Looks-0.146
bled-0.145
vere-0.144
Token says
Token ID1139
Feature activation-1.77e-01

Pos logit contributions
itates+0.266
bled+0.257
 hadn+0.253
vere+0.247
Looks+0.239
Neg logit contributions
 oak-0.283
 pine-0.275
 newspaper-0.268
 cane-0.268
 pony-0.266
Token slapping
Token ID48966
Feature activation-6.42e-02

Pos logit contributions
itates+0.280
 hadn+0.266
bled+0.252
vere+0.251
Looks+0.245
Neg logit contributions
 oak-0.359
 newspaper-0.333
 cane-0.332
igsaw-0.331
 pine-0.331
Token Mom
Token ID11254
Feature activation+4.19e-02

Pos logit contributions
 also+0.155
bled+0.154
itates+0.147
 still+0.144
 okay+0.143
Neg logit contributions
igsaw-0.236
 oak-0.222
cone-0.207
con-0.205
 cane-0.202
Token and
Token ID290
Feature activation+8.58e-02

Pos logit contributions
 oak+0.063
igsaw+0.062
 goat+0.054
 newspaper+0.054
 cane+0.054
Neg logit contributions
itates-0.087
 also-0.083
 refreshed-0.080
 hadn-0.080
bled-0.077
Token went
Token ID1816
Feature activation-1.06e-01

Pos logit contributions
 oak+0.196
igsaw+0.185
 newspaper+0.181
 cane+0.180
 pine+0.180
Neg logit contributions
itates-0.112
 hadn-0.108
Looks-0.100
bled-0.099
 also-0.098
Token back
Token ID736
Feature activation+7.00e-02

Pos logit contributions
itates+0.227
bled+0.205
vere+0.203
 hadn+0.203
 refreshed+0.201
Neg logit contributions
 oak-0.158
igsaw-0.145
 newspaper-0.144
 goat-0.142
 cane-0.141
Token to
Token ID284
Feature activation+1.08e-01

Pos logit contributions
affe+0.157
igsaw+0.142
 pigeon+0.140
 oak+0.137
 goat+0.137
Neg logit contributions
itates-0.099
 also-0.094
 not-0.089
 never-0.088
 still-0.086
Token the
Token ID262
Feature activation+2.10e-01

Pos logit contributions
 oak+0.171
igsaw+0.162
 cane+0.159
 newspaper+0.158
 goat+0.157
Neg logit contributions
itates-0.205
 hadn-0.200
 also-0.196
 refreshed-0.192
bled-0.190
Token garden
Token ID11376
Feature activation+1.22e-01

Pos logit contributions
 oak+0.249
 newspaper+0.218
 cane+0.218
 pine+0.216
 goat+0.215
Neg logit contributions
itates-0.514
 hadn-0.491
Looks-0.478
vere-0.474
bled-0.473
Token.
Token ID13
Feature activation+8.51e-02

Pos logit contributions
 oak+0.180
igsaw+0.174
 newspaper+0.169
 goat+0.168
 cane+0.167
Neg logit contributions
itates-0.247
 hadn-0.235
 also-0.231
 refreshed-0.228
vere-0.224
Token They
Token ID1119
Feature activation-2.04e-01

Pos logit contributions
 oak+0.167
 cane+0.155
 goat+0.154
 newspaper+0.154
 pony+0.153
Neg logit contributions
itates-0.144
 hadn-0.134
Looks-0.129
bled-0.124
vere-0.124
Token took
Token ID1718
Feature activation-1.66e-01

Pos logit contributions
itates+0.316
vere+0.295
Looks+0.273
bled+0.272
izes+0.264
Neg logit contributions
 oak-0.442
 pine-0.417
 goat-0.409
 cane-0.409
 newspaper-0.403
Token their
Token ID511
Feature activation+5.40e-02

Pos logit contributions
bled+0.232
 refreshed+0.225
 also+0.222
hes+0.222
itates+0.222
Neg logit contributions
 oak-0.337
igsaw-0.311
 cane-0.311
 newspaper-0.309
cone-0.308
Token
Token ID198
Feature activation-6.31e-02

Pos logit contributions
 oak+0.142
 newspaper+0.130
 pine+0.130
 cane+0.129
 goat+0.129
Neg logit contributions
itates-0.143
 hadn-0.134
Looks-0.130
vere-0.129
bled-0.129
Token
Token ID198
Feature activation-6.21e-02

Pos logit contributions
itates+0.121
 hadn+0.115
Looks+0.112
bled+0.112
vere+0.110
Neg logit contributions
 oak-0.106
 pine-0.098
 newspaper-0.098
 cane-0.096
 goat-0.096
TokenWhen
Token ID2215
Feature activation-6.53e-02

Pos logit contributions
itates+0.119
 hadn+0.113
bled+0.108
Looks+0.108
vere+0.108
Neg logit contributions
 oak-0.107
 pine-0.098
 newspaper-0.098
 goat-0.097
 cane-0.097
Token he
Token ID339
Feature activation-1.10e-01

Pos logit contributions
itates+0.126
 hadn+0.120
Looks+0.116
bled+0.114
vere+0.112
Neg logit contributions
 oak-0.109
igsaw-0.102
 newspaper-0.102
 cane-0.101
 goat-0.099
Token entered
Token ID5982
Feature activation-2.30e-02

Pos logit contributions
itates+0.188
 hadn+0.174
Looks+0.169
vere+0.169
bled+0.168
Neg logit contributions
 oak-0.215
 pine-0.198
 newspaper-0.198
 cane-0.197
 goat-0.196
Token the
Token ID262
Feature activation+2.06e-01

Pos logit contributions
 also+0.038
 refreshed+0.038
itates+0.037
 never+0.037
bled+0.037
Neg logit contributions
 goat-0.042
affe-0.041
 pigeon-0.041
 oak-0.039
 newspaper-0.038
Token living
Token ID2877
Feature activation-5.41e-02

Pos logit contributions
 goat+0.278
 oak+0.272
 cane+0.269
 pony+0.263
igsaw+0.258
Neg logit contributions
itates-0.449
 also-0.419
 hadn-0.412
 refreshed-0.401
vere-0.393
Token room
Token ID2119
Feature activation+6.24e-02

Pos logit contributions
itates+0.113
 hadn+0.108
Looks+0.104
vere+0.103
bled+0.103
Neg logit contributions
 oak-0.083
 newspaper-0.075
 cane-0.075
 pine-0.075
 goat-0.074
Token,
Token ID11
Feature activation+1.44e-01

Pos logit contributions
 oak+0.045
 newspaper+0.036
 cane+0.036
 goat+0.036
 pine+0.035
Neg logit contributions
itates-0.181
 hadn-0.174
Looks-0.169
vere-0.169
bled-0.169
Token he
Token ID339
Feature activation-1.11e-01

Pos logit contributions
 oak+0.229
 pine+0.207
 newspaper+0.206
 cane+0.206
 goat+0.205
Neg logit contributions
itates-0.294
 hadn-0.276
Looks-0.268
vere-0.267
bled-0.265
Token saw
Token ID2497
Feature activation-1.21e-01

Pos logit contributions
itates+0.258
vere+0.257
Looks+0.248
 Deal+0.245
bled+0.244
Neg logit contributions
 oak-0.146
 pine-0.138
 cane-0.129
 goat-0.127
 newspaper-0.124
Token do
Token ID466
Feature activation-1.52e-01

Pos logit contributions
itates+0.390
 hadn+0.366
Looks+0.356
bled+0.352
vere+0.350
Neg logit contributions
 oak-0.328
 cane-0.298
 newspaper-0.298
 pine-0.297
 goat-0.295
Token it
Token ID340
Feature activation+4.44e-03

Pos logit contributions
itates+0.305
 hadn+0.281
Looks+0.277
bled+0.272
vere+0.269
Neg logit contributions
 oak-0.246
 cane-0.228
 newspaper-0.226
 pine-0.226
 pony-0.222
Token.
Token ID13
Feature activation+8.92e-02

Pos logit contributions
 oak+0.006
 newspaper+0.005
 cane+0.005
 pine+0.005
 goat+0.005
Neg logit contributions
itates-0.010
 hadn-0.010
Looks-0.010
bled-0.010
vere-0.010
Token He
Token ID679
Feature activation-1.41e-01

Pos logit contributions
 oak+0.152
 pine+0.141
 goat+0.138
 cane+0.138
 newspaper+0.134
Neg logit contributions
itates-0.173
 hadn-0.161
Looks-0.156
bled-0.155
vere-0.155
Token dropped
Token ID5710
Feature activation-4.47e-02

Pos logit contributions
itates+0.212
vere+0.195
Looks+0.185
bled+0.181
 refreshed+0.172
Neg logit contributions
 oak-0.304
 pine-0.289
 cane-0.289
 goat-0.288
 newspaper-0.280
Token the
Token ID262
Feature activation+2.13e-01

Pos logit contributions
 hadn+0.069
 also+0.069
bled+0.069
itates+0.066
Looks+0.066
Neg logit contributions
 oak-0.080
igsaw-0.075
 goat-0.074
 pigeon-0.073
affe-0.073
Token balls
Token ID11333
Feature activation+1.82e-01

Pos logit contributions
 oak+0.295
 pine+0.259
 goat+0.259
igsaw+0.258
 newspaper+0.256
Neg logit contributions
itates-0.462
 hadn-0.449
vere-0.440
bled-0.428
Looks-0.426
Token every
Token ID790
Feature activation+6.72e-02

Pos logit contributions
 oak+0.219
 goat+0.197
 pine+0.195
 newspaper+0.194
 pony+0.190
Neg logit contributions
itates-0.430
 hadn-0.421
bled-0.405
vere-0.404
 also-0.399
Token time
Token ID640
Feature activation+1.14e-01

Pos logit contributions
 oak+0.102
 pine+0.091
 goat+0.090
 newspaper+0.089
igsaw+0.089
Neg logit contributions
 hadn-0.135
itates-0.135
 also-0.126
'd-0.126
vere-0.124
Token.
Token ID13
Feature activation+8.78e-02

Pos logit contributions
 oak+0.135
 newspaper+0.118
 goat+0.117
igsaw+0.116
 cane+0.116
Neg logit contributions
 hadn-0.272
itates-0.271
bled-0.257
vere-0.256
Looks-0.253
Token But
Token ID887
Feature activation-9.81e-02

Pos logit contributions
 oak+0.149
 pine+0.139
 goat+0.135
 cane+0.135
 newspaper+0.131
Neg logit contributions
itates-0.172
 hadn-0.160
Looks-0.155
vere-0.154
bled-0.154
TokenJohn
Token ID7554
Feature activation+4.56e-02

Pos logit contributions
itates+0.112
bled+0.102
 hadn+0.102
vere+0.100
Looks+0.096
Neg logit contributions
 oak-0.106
 pine-0.100
 newspaper-0.099
 goat-0.098
 pony-0.097
Token and
Token ID290
Feature activation+8.59e-02

Pos logit contributions
 oak+0.090
 pine+0.084
 goat+0.083
 cane+0.082
 newspaper+0.082
Neg logit contributions
itates-0.077
vere-0.071
Looks-0.068
 hadn-0.067
bled-0.067
Token his
Token ID465
Feature activation+9.72e-02

Pos logit contributions
 oak+0.142
 newspaper+0.139
igsaw+0.135
 cane+0.133
cone+0.133
Neg logit contributions
 hadn-0.151
 also-0.144
 known-0.142
 vowed-0.138
 never-0.136
Token mom
Token ID1995
Feature activation+1.39e-01

Pos logit contributions
 oak+0.167
 pine+0.154
igsaw+0.154
 newspaper+0.153
 cane+0.150
Neg logit contributions
itates-0.171
 hadn-0.171
bled-0.163
vere-0.162
 also-0.157
Token finished
Token ID5201
Feature activation-3.25e-03

Pos logit contributions
 oak+0.267
 pine+0.245
 newspaper+0.243
 goat+0.242
 cane+0.240
Neg logit contributions
itates-0.248
vere-0.225
Looks-0.219
 hadn-0.218
bled-0.215
Token their
Token ID511
Feature activation+5.67e-02

Pos logit contributions
itates+0.007
 hadn+0.006
Looks+0.006
bled+0.006
vere+0.006
Neg logit contributions
 oak-0.005
 goat-0.005
 cane-0.005
 newspaper-0.005
 pine-0.005
Token shopping
Token ID9735
Feature activation+8.75e-02

Pos logit contributions
 oak+0.090
 newspaper+0.081
 cane+0.081
 pine+0.081
 goat+0.081
Neg logit contributions
itates-0.116
 hadn-0.109
Looks-0.106
vere-0.105
bled-0.105
Token and
Token ID290
Feature activation+8.37e-02

Pos logit contributions
 oak+0.107
 goat+0.094
 pine+0.094
 cane+0.094
 newspaper+0.094
Neg logit contributions
itates-0.206
 hadn-0.199
vere-0.192
bled-0.192
Looks-0.191
Token went
Token ID1816
Feature activation-1.05e-01

Pos logit contributions
 oak+0.158
 newspaper+0.145
 goat+0.145
 pine+0.145
 cane+0.143
Neg logit contributions
itates-0.153
vere-0.137
Looks-0.132
 hadn-0.131
bled-0.129
Token home
Token ID1363
Feature activation+1.96e-01

Pos logit contributions
itates+0.240
 hadn+0.228
Looks+0.221
vere+0.220
bled+0.220
Neg logit contributions
 oak-0.140
 newspaper-0.124
 cane-0.123
 pine-0.123
 goat-0.123
Token.
Token ID13
Feature activation+8.61e-02

Pos logit contributions
 oak+0.246
 cane+0.217
 pine+0.217
 newspaper+0.215
 pony+0.212
Neg logit contributions
itates-0.466
Looks-0.439
 hadn-0.438
bled-0.422
vere-0.420
Token to
Token ID284
Feature activation+1.13e-01

Pos logit contributions
itates+0.211
 hadn+0.192
vere+0.190
bled+0.190
Looks+0.188
Neg logit contributions
 oak-0.166
 newspaper-0.151
igsaw-0.149
 goat-0.148
 cane-0.148
Token the
Token ID262
Feature activation+2.10e-01

Pos logit contributions
 oak+0.174
igsaw+0.163
 newspaper+0.160
 cane+0.156
 goat+0.156
Neg logit contributions
itates-0.221
 hadn-0.211
 also-0.209
 refreshed-0.206
vere-0.203
Token elevator
Token ID20932
Feature activation+1.12e-01

Pos logit contributions
 oak+0.265
igsaw+0.247
 cane+0.236
 goat+0.235
 newspaper+0.231
Neg logit contributions
itates-0.473
 hadn-0.449
 also-0.436
Looks-0.434
vere-0.432
Token.
Token ID13
Feature activation+8.96e-02

Pos logit contributions
 oak+0.165
 goat+0.154
 newspaper+0.151
 cane+0.150
igsaw+0.149
Neg logit contributions
itates-0.239
 hadn-0.223
vere-0.217
 also-0.215
Looks-0.212
Token<|endoftext|>
Token ID50256
Feature activation+4.43e-02

Pos logit contributions
 oak+0.174
 newspaper+0.160
 cane+0.159
igsaw+0.158
 goat+0.158
Neg logit contributions
itates-0.153
 hadn-0.143
Looks-0.137
vere-0.134
bled-0.132
TokenTom
Token ID13787
Feature activation+8.29e-02

Pos logit contributions
 oak+0.082
 goat+0.076
 newspaper+0.075
 cane+0.075
 pony+0.075
Neg logit contributions
itates-0.080
 hadn-0.073
Looks-0.070
vere-0.069
bled-0.068
Token and
Token ID290
Feature activation+8.70e-02

Pos logit contributions
 oak+0.134
igsaw+0.132
 newspaper+0.121
 pony+0.121
 cane+0.120
Neg logit contributions
itates-0.164
 hadn-0.161
 also-0.157
'd-0.152
 certainly-0.145
Token Sam
Token ID3409
Feature activation+7.09e-02

Pos logit contributions
 oak+0.136
igsaw+0.134
 newspaper+0.125
 cane+0.123
 pine+0.122
Neg logit contributions
 hadn-0.167
itates-0.166
 also-0.161
Looks-0.153
'd-0.152
Token are
Token ID389
Feature activation-2.39e-01

Pos logit contributions
 oak+0.163
igsaw+0.159
 newspaper+0.156
 cane+0.155
 goat+0.154
Neg logit contributions
itates-0.088
 hadn-0.086
 also-0.077
Looks-0.076
'd-0.075
Token friends
Token ID2460
Feature activation+7.08e-02

Pos logit contributions
itates+0.537
Looks+0.505
 hadn+0.501
vere+0.478
bled+0.472
Neg logit contributions
 oak-0.340
 pine-0.314
 newspaper-0.313
 pony-0.310
 goat-0.310
Token.
Token ID13
Feature activation+8.95e-02

Pos logit contributions
 oak+0.091
 newspaper+0.081
 cane+0.080
 pine+0.080
 goat+0.080
Neg logit contributions
itates-0.166
 hadn-0.158
Looks-0.154
vere-0.152
bled-0.152
Token.
Token ID13
Feature activation+8.47e-02

Pos logit contributions
itates+0.247
 hadn+0.236
bled+0.227
vere+0.226
Looks+0.223
Neg logit contributions
 oak-0.170
 newspaper-0.151
 pine-0.151
igsaw-0.149
 goat-0.149
Token Tim
Token ID5045
Feature activation+3.35e-02

Pos logit contributions
 oak+0.120
 newspaper+0.107
 cane+0.106
 goat+0.106
 pine+0.106
Neg logit contributions
itates-0.189
 hadn-0.178
Looks-0.174
vere-0.173
bled-0.172
Tokenmy
Token ID1820
Feature activation-2.33e-02

Pos logit contributions
 oak+0.047
 newspaper+0.042
 goat+0.042
 cane+0.041
 pine+0.041
Neg logit contributions
itates-0.075
vere-0.072
bled-0.070
Looks-0.070
 hadn-0.070
Token felt
Token ID2936
Feature activation-2.30e-01

Pos logit contributions
itates+0.040
vere+0.040
Looks+0.038
bled+0.037
 Deal+0.035
Neg logit contributions
 oak-0.044
 goat-0.043
 cane-0.041
 pony-0.041
 puppy-0.041
Token bad
Token ID2089
Feature activation+6.84e-02

Pos logit contributions
itates+0.428
vere+0.386
Looks+0.385
'd+0.384
bled+0.383
Neg logit contributions
 oak-0.406
 puppy-0.382
 cane-0.380
 dog-0.373
 goat-0.372
Token and
Token ID290
Feature activation+8.51e-02

Pos logit contributions
 oak+0.067
 newspaper+0.060
 cane+0.060
 goat+0.059
 pine+0.056
Neg logit contributions
itates-0.179
 hadn-0.171
Looks-0.170
vere-0.168
bled-0.168
Token decided
Token ID3066
Feature activation-2.00e-01

Pos logit contributions
 oak+0.169
 goat+0.158
 puppy+0.155
 dog+0.153
 pine+0.152
Neg logit contributions
itates-0.158
vere-0.146
 Deal-0.137
bling-0.128
bled-0.126
Token to
Token ID284
Feature activation+1.11e-01

Pos logit contributions
itates+0.523
 hadn+0.493
Looks+0.489
vere+0.484
bled+0.482
Neg logit contributions
 oak-0.209
 newspaper-0.181
 cane-0.180
 goat-0.179
 pine-0.175
Token share
Token ID2648
Feature activation-1.09e-01

Pos logit contributions
 oak+0.193
 automobile+0.171
 cane+0.169
 goat+0.169
 puppy+0.167
Neg logit contributions
itates-0.225
Looks-0.204
 hadn-0.198
bled-0.198
vere-0.196
Token his
Token ID465
Feature activation+9.70e-02

Pos logit contributions
 also+0.177
 hadn+0.175
 not+0.173
Looks+0.173
itates+0.172
Neg logit contributions
 oak-0.198
 pony-0.188
igsaw-0.187
 goat-0.186
affe-0.185
Token tools
Token ID4899
Feature activation+1.41e-01

Pos logit contributions
 oak+0.122
 pony+0.115
 pigeon+0.112
round+0.110
 goat+0.108
Neg logit contributions
 hadn-0.206
bled-0.194
itates-0.191
 also-0.190
vere-0.188
Token the
Token ID262
Feature activation+2.16e-01

Pos logit contributions
affe+0.207
igsaw+0.204
 oak+0.196
cone+0.188
 cane+0.187
Neg logit contributions
 also-0.195
 not-0.183
 known-0.178
 hadn-0.177
 never-0.175
Token park
Token ID3952
Feature activation+1.19e-01

Pos logit contributions
affe+0.339
igsaw+0.325
 oak+0.299
 cane+0.289
andel+0.288
Neg logit contributions
 also-0.413
itates-0.391
 not-0.367
 never-0.366
 refreshed-0.358
Token with
Token ID351
Feature activation-4.43e-02

Pos logit contributions
 oak+0.206
igsaw+0.193
 cane+0.191
 goat+0.191
 newspaper+0.190
Neg logit contributions
itates-0.226
 hadn-0.205
 also-0.201
vere-0.198
Looks-0.196
Token his
Token ID465
Feature activation+1.01e-01

Pos logit contributions
itates+0.080
 also+0.078
Looks+0.076
 refreshed+0.076
bled+0.075
Neg logit contributions
 oak-0.074
igsaw-0.072
 goat-0.069
 newspaper-0.068
 cane-0.067
Token mom
Token ID1995
Feature activation+1.35e-01

Pos logit contributions
igsaw+0.194
round+0.183
 oak+0.181
affe+0.179
andel+0.172
Neg logit contributions
 also-0.160
itates-0.150
 not-0.149
 hadn-0.145
 still-0.144
Token.
Token ID13
Feature activation+8.68e-02

Pos logit contributions
igsaw+0.259
 oak+0.248
 pigeon+0.226
 cane+0.226
affe+0.225
Neg logit contributions
 also-0.233
itates-0.213
 hadn-0.206
 still-0.199
'd-0.198
Token He
Token ID679
Feature activation-1.45e-01

Pos logit contributions
 oak+0.182
igsaw+0.171
 goat+0.169
 pony+0.167
 cane+0.165
Neg logit contributions
itates-0.136
 hadn-0.117
 also-0.115
Looks-0.111
bled-0.109
Token saw
Token ID2497
Feature activation-1.16e-01

Pos logit contributions
itates+0.273
vere+0.263
Looks+0.259
bled+0.256
 hadn+0.243
Neg logit contributions
 oak-0.253
 pine-0.239
 goat-0.233
 cane-0.233
 newspaper-0.229
Token a
Token ID257
Feature activation+1.13e-01

Pos logit contributions
itates+0.196
 also+0.191
Looks+0.185
bled+0.184
 hadn+0.183
Neg logit contributions
igsaw-0.213
 oak-0.203
 cane-0.193
 newspaper-0.185
 goat-0.184
Token big
Token ID1263
Feature activation+1.72e-01

Pos logit contributions
igsaw+0.183
 oak+0.173
ados+0.165
affe+0.155
 cane+0.154
Neg logit contributions
itates-0.211
 also-0.204
 hadn-0.194
vere-0.184
'd-0.182
Token monkey
Token ID21657
Feature activation+1.96e-01

Pos logit contributions
igsaw+0.249
growth+0.242
andel+0.214
con+0.208
affe+0.204
Neg logit contributions
 also-0.326
 sorry-0.297
 still-0.278
 not-0.270
itates-0.264
Token a
Token ID257
Feature activation+1.07e-01

Pos logit contributions
itates+0.260
 hadn+0.247
Looks+0.239
vere+0.239
bled+0.238
Neg logit contributions
 oak-0.166
 newspaper-0.151
 pine-0.150
 cane-0.149
 goat-0.147
Token time
Token ID640
Feature activation+1.08e-01

Pos logit contributions
 oak+0.187
 goat+0.167
 cane+0.165
 pine+0.163
 newspaper+0.163
Neg logit contributions
itates-0.203
 hadn-0.195
 also-0.184
Looks-0.184
vere-0.184
Token,
Token ID11
Feature activation+1.51e-01

Pos logit contributions
 oak+0.125
 goat+0.119
 newspaper+0.117
 cane+0.116
 pony+0.109
Neg logit contributions
itates-0.268
 hadn-0.259
vere-0.251
 also-0.247
 not-0.244
Token there
Token ID612
Feature activation+7.86e-03

Pos logit contributions
 oak+0.252
 pine+0.232
 cane+0.229
 newspaper+0.229
 goat+0.227
Neg logit contributions
itates-0.290
 hadn-0.276
Looks-0.270
vere-0.270
bled-0.261
Token was
Token ID373
Feature activation-2.31e-01

Pos logit contributions
 oak+0.018
 pine+0.017
 newspaper+0.017
 cane+0.017
 goat+0.017
Neg logit contributions
itates-0.010
 hadn-0.009
Looks-0.009
vere-0.009
bled-0.009
Token a
Token ID257
Feature activation+1.10e-01

Pos logit contributions
 also+0.347
itates+0.333
Looks+0.327
 refreshed+0.319
 hadn+0.319
Neg logit contributions
 oak-0.466
 cane-0.428
igsaw-0.428
 goat-0.414
 pony-0.409
Token little
Token ID1310
Feature activation+1.92e-01

Pos logit contributions
igsaw+0.168
 oak+0.160
 newspaper+0.145
ados+0.143
 newspapers+0.141
Neg logit contributions
itates-0.215
 also-0.215
 hadn-0.203
'd-0.192
 refreshed-0.192
Token boy
Token ID2933
Feature activation+8.17e-02

Pos logit contributions
igsaw+0.358
growth+0.344
con+0.320
 newspapers+0.303
oline+0.296
Neg logit contributions
 also-0.285
 sorry-0.266
itates-0.261
 refreshed-0.245
 still-0.240
Token named
Token ID3706
Feature activation+4.13e-02

Pos logit contributions
igsaw+0.183
 oak+0.171
 newspaper+0.166
 cane+0.164
 pony+0.159
Neg logit contributions
 also-0.115
itates-0.109
 never-0.109
 hadn-0.105
 not-0.100
Token Tim
Token ID5045
Feature activation+2.84e-02

Pos logit contributions
 oak+0.055
 cane+0.051
 newspaper+0.050
igsaw+0.050
 goat+0.049
Neg logit contributions
itates-0.095
 hadn-0.090
 also-0.087
bled-0.086
Looks-0.086
Tokenmy
Token ID1820
Feature activation-2.26e-02

Pos logit contributions
 oak+0.054
igsaw+0.053
 newspaper+0.050
 cane+0.050
 pony+0.050
Neg logit contributions
itates-0.048
 hadn-0.046
 also-0.045
'd-0.042
Looks-0.041
Token waiting
Token ID4953
Feature activation-1.33e-01

Pos logit contributions
itates+0.375
Looks+0.352
vere+0.328
 hadn+0.323
izes+0.303
Neg logit contributions
 oak-0.485
 pine-0.469
 pony-0.467
 cane-0.457
 goat-0.447
Token for
Token ID329
Feature activation-6.99e-02

Pos logit contributions
itates+0.308
Looks+0.288
 hadn+0.277
vere+0.271
bled+0.265
Neg logit contributions
 oak-0.179
 pine-0.166
 cane-0.164
 newspaper-0.163
 pony-0.159
Token him
Token ID683
Feature activation-2.92e-02

Pos logit contributions
 hadn+0.120
itates+0.117
bled+0.113
Looks+0.113
vere+0.111
Neg logit contributions
 oak-0.127
 cane-0.118
igsaw-0.116
 newspaper-0.115
 goat-0.114
Token.
Token ID13
Feature activation+8.25e-02

Pos logit contributions
itates+0.066
 hadn+0.065
vere+0.061
bled+0.061
Looks+0.061
Neg logit contributions
 oak-0.038
 newspaper-0.034
 goat-0.034
igsaw-0.034
 pine-0.034
Token
Token ID220
Feature activation-2.06e-01

Pos logit contributions
 oak+0.120
 newspaper+0.111
 pine+0.109
 cane+0.108
 goat+0.107
Neg logit contributions
itates-0.178
 hadn-0.166
bled-0.166
Looks-0.164
vere-0.164
Token
Token ID198
Feature activation-6.75e-02

Pos logit contributions
itates+0.405
 hadn+0.384
Looks+0.378
bled+0.365
vere+0.363
Neg logit contributions
 oak-0.334
 pine-0.307
 newspaper-0.303
 cane-0.302
igsaw-0.300
TokenJohn
Token ID7554
Feature activation+4.02e-02

Pos logit contributions
itates+0.142
 hadn+0.136
Looks+0.136
bled+0.133
vere+0.130
Neg logit contributions
 oak-0.093
 pine-0.088
 newspaper-0.088
 cane-0.083
igsaw-0.083
Token opened
Token ID4721
Feature activation-6.74e-02

Pos logit contributions
 oak+0.084
 pine+0.079
 goat+0.078
 cane+0.078
 newspaper+0.077
Neg logit contributions
itates-0.062
 hadn-0.056
vere-0.056
Looks-0.056
bled-0.054
Token the
Token ID262
Feature activation+2.04e-01

Pos logit contributions
 also+0.097
bled+0.093
 not+0.093
 certainly+0.092
 still+0.091
Neg logit contributions
 oak-0.133
igsaw-0.129
affe-0.126
 goat-0.126
 newspaper-0.124
Token lid
Token ID19789
Feature activation+1.44e-03

Pos logit contributions
 oak+0.289
igsaw+0.263
 goat+0.255
 pine+0.253
 cane+0.250
Neg logit contributions
itates-0.438
 hadn-0.411
 also-0.397
vere-0.396
bled-0.391
Token and
Token ID290
Feature activation+8.09e-02

Pos logit contributions
 oak+0.001
 goat+0.001
 newspaper+0.001
 pine+0.001
 cane+0.001
Neg logit contributions
itates-0.004
 hadn-0.004
bled-0.003
vere-0.003
Looks-0.003
Token draw
Token ID3197
Feature activation-1.55e-02

Pos logit contributions
itates+0.371
 hadn+0.347
Looks+0.330
vere+0.330
bled+0.326
Neg logit contributions
 oak-0.405
 newspaper-0.373
 pine-0.372
 goat-0.372
 cane-0.372
Token something
Token ID1223
Feature activation+1.14e-02

Pos logit contributions
itates+0.027
bled+0.025
vere+0.025
 also+0.024
 hadn+0.024
Neg logit contributions
 oak-0.029
 goat-0.026
igsaw-0.026
 cane-0.026
 pine-0.026
Token too
Token ID1165
Feature activation-1.11e-01

Pos logit contributions
 oak+0.019
 newspaper+0.018
 cane+0.018
 pine+0.018
 goat+0.017
Neg logit contributions
itates-0.022
 hadn-0.021
Looks-0.020
bled-0.020
vere-0.020
Token?"
Token ID1701
Feature activation+1.58e-02

Pos logit contributions
itates+0.194
 hadn+0.173
 also+0.171
vere+0.168
bled+0.163
Neg logit contributions
 oak-0.211
 goat-0.194
 cane-0.193
 pine-0.192
igsaw-0.191
Token Ben
Token ID3932
Feature activation+4.89e-02

Pos logit contributions
igsaw+0.030
 oak+0.028
 newspaper+0.027
 automobile+0.026
 magazine+0.026
Neg logit contributions
 also-0.027
itates-0.026
 hadn-0.025
 refreshed-0.025
 not-0.024
Token says
Token ID1139
Feature activation-1.72e-01

Pos logit contributions
 oak+0.111
 newspaper+0.103
 pine+0.102
 cane+0.102
 goat+0.102
Neg logit contributions
itates-0.067
 hadn-0.061
Looks-0.058
vere-0.057
bled-0.056
Token.
Token ID13
Feature activation+9.02e-02

Pos logit contributions
itates+0.429
 hadn+0.415
Looks+0.408
bled+0.395
vere+0.394
Neg logit contributions
 oak-0.190
 cane-0.169
 newspaper-0.166
 pine-0.166
 pony-0.163
Token
Token ID198
Feature activation-6.30e-02

Pos logit contributions
 oak+0.163
 newspaper+0.150
 cane+0.150
 pine+0.150
 goat+0.149
Neg logit contributions
itates-0.164
 hadn-0.155
Looks-0.149
vere-0.147
bled-0.147
Token
Token ID198
Feature activation-6.17e-02

Pos logit contributions
itates+0.123
 hadn+0.119
Looks+0.115
bled+0.113
vere+0.112
Neg logit contributions
 oak-0.104
 pine-0.097
 newspaper-0.094
 cane-0.094
 goat-0.093
Token"
Token ID1
Feature activation+5.11e-02

Pos logit contributions
itates+0.130
 hadn+0.120
Looks+0.118
bled+0.117
vere+0.116
Neg logit contributions
 oak-0.097
 newspaper-0.088
igsaw-0.088
 cane-0.087
 goat-0.086
TokenSure
Token ID19457
Feature activation-1.24e-01

Pos logit contributions
 oak+0.093
 newspaper+0.091
 cane+0.091
 goat+0.089
 pine+0.089
Neg logit contributions
 hadn-0.086
itates-0.084
bled-0.082
vere-0.078
eking-0.077
Token ball
Token ID2613
Feature activation+1.86e-01

Pos logit contributions
 oak+0.170
igsaw+0.147
 pine+0.144
 goat+0.139
 newspaper+0.136
Neg logit contributions
itates-0.556
 hadn-0.531
 also-0.526
Looks-0.521
vere-0.518
Token in
Token ID287
Feature activation+3.81e-02

Pos logit contributions
 oak+0.273
 newspaper+0.248
 goat+0.246
igsaw+0.245
 cane+0.245
Neg logit contributions
itates-0.399
 hadn-0.378
vere-0.363
Looks-0.363
bled-0.362
Token the
Token ID262
Feature activation+2.04e-01

Pos logit contributions
 goat+0.061
 oak+0.061
affe+0.059
 cane+0.058
 pony+0.058
Neg logit contributions
 also-0.071
itates-0.070
 hadn-0.067
Looks-0.067
 never-0.066
Token park
Token ID3952
Feature activation+1.13e-01

Pos logit contributions
 oak+0.265
igsaw+0.244
 goat+0.241
 cane+0.240
 pony+0.236
Neg logit contributions
itates-0.441
 also-0.420
Looks-0.410
 hadn-0.407
 certainly-0.407
Token.
Token ID13
Feature activation+8.18e-02

Pos logit contributions
 oak+0.137
 newspaper+0.120
 pine+0.119
 cane+0.118
 goat+0.118
Neg logit contributions
itates-0.276
 hadn-0.264
Looks-0.258
vere-0.256
bled-0.255
Token She
Token ID1375
Feature activation-1.44e-01

Pos logit contributions
 oak+0.156
 goat+0.146
 cane+0.145
 newspaper+0.144
 pine+0.144
Neg logit contributions
itates-0.142
 hadn-0.130
Looks-0.124
 also-0.120
vere-0.119
Token threw
Token ID9617
Feature activation-1.01e-01

Pos logit contributions
itates+0.251
vere+0.248
 Deal+0.238
Looks+0.234
bled+0.226
Neg logit contributions
 oak-0.271
 pine-0.257
 goat-0.256
 dog-0.252
 cane-0.250
Token the
Token ID262
Feature activation+2.03e-01

Pos logit contributions
 also+0.131
itates+0.125
 never+0.120
 not+0.115
Looks+0.112
Neg logit contributions
igsaw-0.226
 oak-0.214
 goat-0.204
 extingu-0.195
 cane-0.193
Token ball
Token ID2613
Feature activation+1.85e-01

Pos logit contributions
 oak+0.180
igsaw+0.159
 pine+0.156
 goat+0.151
 pony+0.149
Neg logit contributions
itates-0.536
 hadn-0.516
 also-0.504
bled-0.501
Looks-0.497
Token high
Token ID1029
Feature activation+1.20e-01

Pos logit contributions
 oak+0.315
igsaw+0.296
 goat+0.295
 newspaper+0.295
 cane+0.286
Neg logit contributions
itates-0.356
 hadn-0.328
bled-0.320
 also-0.317
vere-0.314
Token up
Token ID510
Feature activation-1.17e-02

Pos logit contributions
 oak+0.221
igsaw+0.217
 newspaper+0.210
 goat+0.210
 pony+0.204
Neg logit contributions
itates-0.210
bled-0.193
 also-0.190
 never-0.185
 hadn-0.180
Token Anna
Token ID11735
Feature activation+4.63e-02

Pos logit contributions
 newspaper+0.111
 cane+0.109
 oak+0.109
igsaw+0.107
 pine+0.105
Neg logit contributions
itates-0.163
 hadn-0.162
vere-0.161
bled-0.160
eking-0.159
Token was
Token ID373
Feature activation-2.39e-01

Pos logit contributions
 oak+0.095
 pine+0.091
 cane+0.090
 goat+0.089
 pony+0.089
Neg logit contributions
itates-0.072
vere-0.066
Looks-0.066
bled-0.063
 hadn-0.062
Token so
Token ID523
Feature activation-1.13e-01

Pos logit contributions
itates+0.442
vere+0.417
Looks+0.410
izes+0.396
 hadn+0.393
Neg logit contributions
 oak-0.399
 pony-0.392
 pine-0.386
 ant-0.365
 newspaper-0.364
Token happy
Token ID3772
Feature activation-8.99e-02

Pos logit contributions
itates+0.193
 hadn+0.185
Looks+0.184
vere+0.182
bled+0.180
Neg logit contributions
 oak-0.191
 pine-0.188
 pigeon-0.181
 pony-0.179
 newspaper-0.174
Token to
Token ID284
Feature activation+1.05e-01

Pos logit contributions
itates+0.198
 hadn+0.195
 also+0.185
vere+0.182
Looks+0.181
Neg logit contributions
 oak-0.123
igsaw-0.111
 cane-0.108
 goat-0.108
 newspaper-0.107
Token spend
Token ID4341
Feature activation-1.44e-01

Pos logit contributions
 oak+0.205
 pine+0.187
 goat+0.179
 pony+0.177
 cane+0.177
Neg logit contributions
itates-0.193
Looks-0.175
 hadn-0.172
vere-0.170
bled-0.169
Token the
Token ID262
Feature activation+2.05e-01

Pos logit contributions
itates+0.298
 hadn+0.286
Looks+0.276
bled+0.273
vere+0.271
Neg logit contributions
 oak-0.225
 goat-0.201
 newspaper-0.200
 cane-0.199
 pine-0.199
Token rainy
Token ID37259
Feature activation+1.24e-01

Pos logit contributions
 oak+0.305
 newspaper+0.274
 cane+0.273
 pine+0.272
 goat+0.271
Neg logit contributions
itates-0.442
 hadn-0.419
Looks-0.406
vere-0.403
bled-0.401
Token day
Token ID1110
Feature activation+1.02e-01

Pos logit contributions
 oak+0.224
 goat+0.206
 cane+0.199
 pigeon+0.198
 pine+0.197
Neg logit contributions
itates-0.212
 refreshed-0.209
 hadn-0.206
 also-0.195
Looks-0.195
Token outside
Token ID2354
Feature activation+1.19e-01

Pos logit contributions
 oak+0.187
 newspaper+0.172
 pine+0.171
 goat+0.171
 cane+0.171
Neg logit contributions
itates-0.182
 hadn-0.174
vere-0.166
Looks-0.165
bled-0.163
Token watching
Token ID4964
Feature activation-1.09e-01

Pos logit contributions
 oak+0.163
 newspaper+0.147
 goat+0.146
 cane+0.146
 pine+0.145
Neg logit contributions
itates-0.266
 hadn-0.256
Looks-0.245
vere-0.245
bled-0.243
Token was
Token ID373
Feature activation-2.41e-01

Pos logit contributions
 oak+0.081
 pony+0.078
 goat+0.077
 pine+0.077
 newspaper+0.076
Neg logit contributions
itates-0.064
vere-0.063
Looks-0.061
bled-0.059
 hadn-0.057
Token proud
Token ID6613
Feature activation-1.14e-01

Pos logit contributions
 hadn+0.363
Looks+0.353
itates+0.344
'd+0.332
vere+0.322
Neg logit contributions
 pony-0.484
 oak-0.472
 goat-0.455
 pigeon-0.454
 dog-0.453
Token and
Token ID290
Feature activation+8.29e-02

Pos logit contributions
itates+0.253
 hadn+0.239
Looks+0.234
bled+0.232
vere+0.231
Neg logit contributions
 oak-0.164
 newspaper-0.148
 pine-0.148
 cane-0.148
 goat-0.147
Token happy
Token ID3772
Feature activation-9.26e-02

Pos logit contributions
 dog+0.151
 oak+0.150
 goat+0.148
 newspaper+0.148
 puppy+0.147
Neg logit contributions
itates-0.135
eking-0.126
vere-0.125
Looks-0.124
bled-0.122
Token.
Token ID13
Feature activation+7.66e-02

Pos logit contributions
itates+0.216
 hadn+0.205
Looks+0.199
vere+0.198
bled+0.197
Neg logit contributions
 oak-0.123
 newspaper-0.109
 cane-0.108
 pine-0.108
 goat-0.108
Token She
Token ID1375
Feature activation-1.55e-01

Pos logit contributions
 oak+0.138
 newspaper+0.126
 cane+0.126
 pine+0.125
 goat+0.125
Neg logit contributions
itates-0.141
 hadn-0.133
Looks-0.128
vere-0.126
bled-0.125
Token sang
Token ID25889
Feature activation-1.81e-01

Pos logit contributions
itates+0.252
vere+0.249
bled+0.239
Looks+0.232
eking+0.229
Neg logit contributions
 oak-0.305
 goat-0.300
 pony-0.293
 pine-0.292
 dog-0.289
Token a
Token ID257
Feature activation+1.01e-01

Pos logit contributions
itates+0.364
 hadn+0.335
bled+0.333
vere+0.326
Looks+0.322
Neg logit contributions
 oak-0.307
 pine-0.275
 goat-0.268
igsaw-0.267
 newspaper-0.267
Token song
Token ID3496
Feature activation+9.67e-02

Pos logit contributions
 oak+0.159
 newspaper+0.146
 cane+0.145
 pine+0.145
 goat+0.143
Neg logit contributions
itates-0.208
 hadn-0.198
Looks-0.191
bled-0.190
vere-0.188
Token with
Token ID351
Feature activation-4.85e-02

Pos logit contributions
 oak+0.146
 newspaper+0.131
 cane+0.131
 pine+0.131
 goat+0.130
Neg logit contributions
itates-0.205
 hadn-0.194
Looks-0.188
vere-0.186
bled-0.186
Token them
Token ID606
Feature activation-4.96e-02

Pos logit contributions
 hadn+0.092
Looks+0.091
itates+0.090
 also+0.089
bled+0.088
Neg logit contributions
 oak-0.076
 cane-0.068
 pine-0.068
igsaw-0.067
 goat-0.066
Tokenold
Token ID727
Feature activation+1.02e-01

Pos logit contributions
 oak+0.357
 newspaper+0.324
 pine+0.323
 cane+0.321
 goat+0.320
Neg logit contributions
itates-0.445
 hadn-0.421
Looks-0.406
vere-0.403
bled-0.402
Token came
Token ID1625
Feature activation-1.58e-01

Pos logit contributions
 oak+0.163
igsaw+0.160
 cane+0.152
 newspaper+0.152
 pine+0.147
Neg logit contributions
itates-0.198
 hadn-0.195
Looks-0.182
'd-0.181
 also-0.179
Token along
Token ID1863
Feature activation+4.07e-02

Pos logit contributions
itates+0.329
 hadn+0.311
Looks+0.300
bled+0.298
vere+0.298
Neg logit contributions
 oak-0.247
 newspaper-0.224
 cane-0.223
 pine-0.222
 goat-0.221
Token.
Token ID13
Feature activation+8.62e-02

Pos logit contributions
 oak+0.063
igsaw+0.057
 pine+0.056
 newspaper+0.056
 pony+0.056
Neg logit contributions
itates-0.080
 hadn-0.080
 also-0.079
 never-0.074
 refreshed-0.073
Token She
Token ID1375
Feature activation-1.43e-01

Pos logit contributions
 oak+0.172
 pony+0.164
 goat+0.164
 cane+0.161
 pine+0.160
Neg logit contributions
itates-0.129
 hadn-0.124
 also-0.112
Looks-0.112
 certainly-0.110
Token stared
Token ID18484
Feature activation-2.06e-01

Pos logit contributions
itates+0.270
vere+0.267
Looks+0.247
bled+0.246
 Deal+0.236
Neg logit contributions
 oak-0.259
 pine-0.245
 goat-0.241
 cane-0.240
 newspaper-0.237
Token at
Token ID379
Feature activation+5.80e-02

Pos logit contributions
itates+0.373
bled+0.352
 hadn+0.339
 refreshed+0.330
 also+0.329
Neg logit contributions
 oak-0.371
igsaw-0.345
 newspaper-0.343
 goat-0.337
 pine-0.334
Token the
Token ID262
Feature activation+2.10e-01

Pos logit contributions
igsaw+0.090
 oak+0.087
 goat+0.083
affe+0.082
 newspaper+0.082
Neg logit contributions
 also-0.110
 hadn-0.105
bled-0.102
itates-0.102
 never-0.100
Token hook
Token ID8011
Feature activation+1.05e-01

Pos logit contributions
 oak+0.274
igsaw+0.242
 newspaper+0.237
 cane+0.236
 pine+0.235
Neg logit contributions
itates-0.485
 hadn-0.464
Looks-0.448
bled-0.446
vere-0.445
Token in
Token ID287
Feature activation+4.44e-02

Pos logit contributions
 oak+0.111
 newspaper+0.095
 cane+0.095
 pine+0.095
 goat+0.094
Neg logit contributions
itates-0.271
 hadn-0.259
Looks-0.252
vere-0.251
bled-0.250
Token amaz
Token ID40642
Feature activation-2.01e-02

Pos logit contributions
 oak+0.079
 goat+0.073
 pony+0.072
igsaw+0.071
 cane+0.071
Neg logit contributions
itates-0.077
 hadn-0.076
Looks-0.075
 also-0.072
vere-0.070
Token how
Token ID703
Feature activation+6.31e-03

Pos logit contributions
itates+0.231
 hadn+0.229
bled+0.221
 also+0.221
Looks+0.219
Neg logit contributions
 oak-0.186
igsaw-0.173
 cane-0.172
 goat-0.170
 newspaper-0.170
Token sad
Token ID6507
Feature activation-6.48e-02

Pos logit contributions
 oak+0.011
 puppy+0.010
 olive+0.010
igsaw+0.010
 dog+0.010
Neg logit contributions
itates-0.011
 Deal-0.011
vere-0.011
Looks-0.010
 suppose-0.010
Token Lily
Token ID20037
Feature activation+3.27e-02

Pos logit contributions
 hadn+0.096
 also+0.090
 not+0.086
 never+0.083
 actually+0.082
Neg logit contributions
igsaw-0.132
 oak-0.127
 newspaper-0.123
 cane-0.121
 bull-0.118
Token was
Token ID373
Feature activation-2.33e-01

Pos logit contributions
 oak+0.071
 cane+0.067
 newspaper+0.066
 goat+0.066
 pony+0.066
Neg logit contributions
itates-0.049
vere-0.045
Looks-0.044
bled-0.043
izes-0.040
Token and
Token ID290
Feature activation+8.77e-02

Pos logit contributions
itates+0.680
Looks+0.646
bled+0.635
 hadn+0.634
vere+0.631
Neg logit contributions
 oak-0.181
 newspaper-0.145
 goat-0.142
 cane-0.141
 pine-0.134
Token decided
Token ID3066
Feature activation-2.01e-01

Pos logit contributions
 oak+0.177
 goat+0.161
 pine+0.160
 pony+0.159
 puppy+0.157
Neg logit contributions
itates-0.158
vere-0.156
 Deal-0.142
Looks-0.135
bled-0.131
Token to
Token ID284
Feature activation+1.10e-01

Pos logit contributions
itates+0.565
Looks+0.534
bled+0.527
vere+0.524
 hadn+0.516
Neg logit contributions
 oak-0.181
 cane-0.153
 pony-0.152
 newspaper-0.150
 goat-0.150
Token give
Token ID1577
Feature activation-8.82e-02

Pos logit contributions
 oak+0.195
 cane+0.170
 pine+0.169
 goat+0.167
 pony+0.165
Neg logit contributions
itates-0.226
Looks-0.206
bled-0.200
 hadn-0.196
vere-0.196
Token her
Token ID607
Feature activation+9.94e-02

Pos logit contributions
itates+0.183
 hadn+0.172
Looks+0.167
vere+0.166
bled+0.165
Neg logit contributions
 oak-0.139
 newspaper-0.126
 cane-0.126
 pine-0.126
 goat-0.125
Token his
Token ID465
Feature activation+9.64e-02

Pos logit contributions
 oak+0.185
 bull+0.181
 goat+0.179
igsaw+0.171
affe+0.169
Neg logit contributions
 hadn-0.154
 also-0.151
itates-0.150
 not-0.143
 refreshed-0.141
Token favorite
Token ID4004
Feature activation+6.54e-02

Pos logit contributions
 oak+0.125
 goat+0.114
 bull+0.111
 newspaper+0.110
 pine+0.110
Neg logit contributions
itates-0.215
 hadn-0.212
vere-0.199
 also-0.197
bled-0.196
TokenHey
Token ID10814
Feature activation-5.15e-02

Pos logit contributions
itates+0.114
 hadn+0.106
vere+0.104
bled+0.103
izes+0.096
Neg logit contributions
 oak-0.137
 goat-0.129
 pine-0.127
 pony-0.124
 cane-0.124
Token D
Token ID360
Feature activation+7.77e-02

Pos logit contributions
itates+0.116
 hadn+0.115
bled+0.109
Looks+0.108
 ok+0.108
Neg logit contributions
 newspaper-0.063
 oak-0.062
igsaw-0.061
 cane-0.059
 automobile-0.056
Tokenucky
Token ID5309
Feature activation+9.27e-02

Pos logit contributions
 oak+0.125
 newspaper+0.118
 cane+0.116
 pine+0.114
 pony+0.114
Neg logit contributions
itates-0.155
 hadn-0.145
Looks-0.141
bled-0.141
vere-0.139
Token,
Token ID11
Feature activation+1.55e-01

Pos logit contributions
igsaw+0.121
 newspaper+0.114
 oak+0.113
 cane+0.108
 automobile+0.106
Neg logit contributions
 hadn-0.195
itates-0.190
 also-0.188
 ok-0.187
'd-0.186
Token why
Token ID1521
Feature activation-6.91e-02

Pos logit contributions
 oak+0.274
 pine+0.249
 goat+0.249
 newspaper+0.244
 cane+0.244
Neg logit contributions
itates-0.296
 hadn-0.274
Looks-0.267
vere-0.265
bled-0.265
Token don
Token ID836
Feature activation-2.35e-01

Pos logit contributions
itates+0.103
bled+0.093
Looks+0.091
 hadn+0.090
vere+0.088
Neg logit contributions
 oak-0.149
 pine-0.139
 cane-0.138
 newspaper-0.138
 goat-0.138
Token't
Token ID470
Feature activation-1.98e-01

Pos logit contributions
itates+0.286
 hadn+0.255
Looks+0.246
bled+0.236
vere+0.227
Neg logit contributions
 oak-0.562
 goat-0.528
 cane-0.527
 pony-0.519
 newspaper-0.519
Token you
Token ID345
Feature activation-1.41e-01

Pos logit contributions
itates+0.206
 hadn+0.194
vere+0.188
 also+0.180
Looks+0.178
Neg logit contributions
 oak-0.487
 newspaper-0.479
 cane-0.471
igsaw-0.470
 pine-0.465
Token come
Token ID1282
Feature activation-1.88e-01

Pos logit contributions
itates+0.263
Looks+0.247
 hadn+0.239
bled+0.238
vere+0.230
Neg logit contributions
 oak-0.257
 goat-0.238
 cane-0.235
 pine-0.234
 pony-0.229
Token swim
Token ID9422
Feature activation-2.43e-02

Pos logit contributions
itates+0.401
 hadn+0.375
bled+0.360
vere+0.360
Looks+0.359
Neg logit contributions
 oak-0.283
 newspaper-0.261
 cane-0.257
igsaw-0.255
 pine-0.253
Token with
Token ID351
Feature activation-4.13e-02

Pos logit contributions
itates+0.047
 hadn+0.043
vere+0.042
bled+0.042
Looks+0.041
Neg logit contributions
 oak-0.042
 newspaper-0.038
 pine-0.038
 cane-0.038
 goat-0.037
Token was
Token ID373
Feature activation-2.30e-01

Pos logit contributions
bled+0.329
itates+0.317
vere+0.304
bling+0.296
 refreshed+0.290
Neg logit contributions
 oak-0.436
 pine-0.427
 goat-0.421
 dog-0.417
 cane-0.416
Token just
Token ID655
Feature activation-1.56e-01

Pos logit contributions
itates+0.339
 hadn+0.303
bled+0.300
Looks+0.300
vere+0.288
Neg logit contributions
 oak-0.499
 goat-0.476
 pine-0.474
 cane-0.471
 pony-0.470
Token hungry
Token ID14720
Feature activation-1.27e-02

Pos logit contributions
itates+0.271
 hadn+0.243
Looks+0.242
vere+0.238
bled+0.238
Neg logit contributions
 oak-0.304
 pine-0.283
 goat-0.280
 cane-0.278
 newspaper-0.278
Token.
Token ID13
Feature activation+9.34e-02

Pos logit contributions
itates+0.027
 hadn+0.026
vere+0.025
 also+0.025
 refreshed+0.024
Neg logit contributions
 oak-0.018
igsaw-0.016
 goat-0.016
 cane-0.016
 newspaper-0.016
Token Please
Token ID4222
Feature activation-1.81e-01

Pos logit contributions
 oak+0.199
 newspaper+0.185
 pine+0.185
 cane+0.185
 goat+0.185
Neg logit contributions
itates-0.140
 hadn-0.129
Looks-0.124
bled-0.123
vere-0.122
Token don
Token ID836
Feature activation-2.35e-01

Pos logit contributions
itates+0.296
 hadn+0.277
Looks+0.264
vere+0.262
bled+0.261
Neg logit contributions
 oak-0.361
 newspaper-0.335
 cane-0.333
 pine-0.333
 goat-0.332
Token't
Token ID470
Feature activation-2.01e-01

Pos logit contributions
itates+0.258
 hadn+0.231
Looks+0.220
bled+0.210
vere+0.204
Neg logit contributions
 oak-0.594
 cane-0.559
 newspaper-0.557
 goat-0.555
 pine-0.553
Token bite
Token ID13197
Feature activation+7.77e-02

Pos logit contributions
itates+0.308
 hadn+0.294
vere+0.277
Looks+0.271
 also+0.269
Neg logit contributions
 oak-0.404
 newspaper-0.386
igsaw-0.386
 cane-0.379
 pine-0.376
Token me
Token ID502
Feature activation+6.45e-03

Pos logit contributions
igsaw+0.175
 oak+0.170
 cane+0.159
 automobile+0.158
 pony+0.158
Neg logit contributions
itates-0.090
 also-0.087
 hadn-0.087
vere-0.085
 ok-0.082
Token!"
Token ID2474
Feature activation+1.79e-02

Pos logit contributions
igsaw+0.011
 oak+0.011
 goat+0.010
 newspaper+0.010
 cane+0.010
Neg logit contributions
itates-0.011
vere-0.010
 also-0.010
 hadn-0.010
izes-0.010
Token he
Token ID339
Feature activation-1.08e-01

Pos logit contributions
 oak+0.031
 newspaper+0.029
igsaw+0.029
 cane+0.029
 pine+0.028
Neg logit contributions
itates-0.033
 hadn-0.032
Looks-0.030
vere-0.030
 also-0.029
Token anything
Token ID1997
Feature activation+3.15e-02

Pos logit contributions
 oak+0.119
 cane+0.108
 newspaper+0.107
 pine+0.106
 goat+0.103
Neg logit contributions
itates-0.109
Looks-0.102
bled-0.098
vere-0.097
 hadn-0.093
Token to
Token ID284
Feature activation+1.12e-01

Pos logit contributions
 oak+0.044
 cane+0.039
 pine+0.039
 newspaper+0.039
 goat+0.038
Neg logit contributions
itates-0.071
Looks-0.067
bled-0.065
 hadn-0.065
vere-0.064
Token eat
Token ID4483
Feature activation-3.84e-02

Pos logit contributions
 oak+0.211
 newspaper+0.201
 goat+0.199
 cane+0.198
igsaw+0.198
Neg logit contributions
itates-0.183
 hadn-0.175
vere-0.166
 also-0.164
Looks-0.163
Token.
Token ID13
Feature activation+8.66e-02

Pos logit contributions
itates+0.081
 hadn+0.079
vere+0.075
bled+0.075
Looks+0.074
Neg logit contributions
 oak-0.056
 goat-0.050
igsaw-0.050
 newspaper-0.050
 cane-0.050
Token It
Token ID632
Feature activation-1.38e-01

Pos logit contributions
 oak+0.147
 goat+0.136
 cane+0.136
 pine+0.135
 newspaper+0.134
Neg logit contributions
itates-0.165
 hadn-0.158
Looks-0.150
vere-0.149
 also-0.145
Token felt
Token ID2936
Feature activation-2.30e-01

Pos logit contributions
vere+0.245
bled+0.242
itates+0.229
bling+0.229
 Deal+0.221
Neg logit contributions
 oak-0.267
 pine-0.255
 animal-0.248
 cat-0.246
 cane-0.242
Token very
Token ID845
Feature activation-9.30e-02

Pos logit contributions
itates+0.412
Looks+0.380
vere+0.368
bled+0.362
 hadn+0.353
Neg logit contributions
 oak-0.441
 cane-0.402
 pine-0.402
 goat-0.393
 olive-0.392
Token sad
Token ID6507
Feature activation-6.34e-02

Pos logit contributions
itates+0.186
Looks+0.185
'd+0.171
bled+0.170
 hadn+0.170
Neg logit contributions
 oak-0.155
 ant-0.137
 olive-0.136
 pine-0.136
 cane-0.130
Token and
Token ID290
Feature activation+9.03e-02

Pos logit contributions
itates+0.163
 hadn+0.156
Looks+0.151
vere+0.151
bled+0.150
Neg logit contributions
 oak-0.067
 newspaper-0.058
 cane-0.057
 pine-0.057
 goat-0.057
Token lonely
Token ID21757
Feature activation-4.46e-03

Pos logit contributions
 oak+0.169
 dog+0.146
 cane+0.144
 pine+0.143
 goat+0.139
Neg logit contributions
vere-0.170
itates-0.162
bled-0.157
 Deal-0.154
Looks-0.153
Token.
Token ID13
Feature activation+8.63e-02

Pos logit contributions
itates+0.010
 hadn+0.010
vere+0.009
bled+0.009
Looks+0.009
Neg logit contributions
 oak-0.006
igsaw-0.006
 newspaper-0.006
 goat-0.006
 pine-0.006