TOP ACTIVATIONS
MAX = 60.302

bad . It hurt me ." Anna said .
hats ! You ruined them !" The dog heard
. It has oxygen inside ." <|endoftext|> Tom and Lily liked
away . Go away now ." Jen and Tom
learned that courage is important . They learned that soccer is
. Let 's play again ." They take their boats and
spaceship !" Ben said . " We can fly to the
a spaceship !" Ben said . " We can fly to
Can we go outside now ?" Anna and Ben asked .
and books ?" " Yes , mom ," Anna
You 're welcome , duck !" Mia says .
stops near their house . " Look , Anna
, I will help you ." said Sam . He took
ball . They are clever . They know how to share
but they were more careful . <|endoftext|> Anna and Ben were
with mom 's necklace again . They still played in the
We pulled the knot ." M ama sigh s
your toys and books ?" " Yes , mom
The fireworks will start soon ." L ily and
. Go away now ." Jen and Tom were

MIN ACTIVATIONS
MIN = -162.982

his head out of his bur row and saw something new
reward . It was ch ocol ates and sweets !
children put on their bath ro bes and ran around in
I â m sorry Jane . I
Everyday he would take the av oc ados and scatter them
't know the rh in oc eros was wild . Can
That â s a special necklace ,
It â s almost just as beautiful
It â s a magic book !
her mom my watched in amaz ement as the machine flew
That â s right , Emma .
That â s the gas from the
he would take the av oc ados and scatter them around
Let â s go find something else
Let â s not think about the
Let â s explore and have some
Let â s play a joke .
Let â s have a stretching race
Please don â t eat me , â
Who â s there ? â

INTERVAL 4.481 - 60.302
CONTAINS 48.806%

like to organize them . They left them all over the
then , Tim 's mom came into the room . She
But you have to be careful on the slide . It
a lot of fun . They agreed that the funny man
Lu cy : " But why , mum ? I want

INTERVAL -51.340 - 4.481
CONTAINS 44.437%

One day , a big dinosaur was in the park .
upon a time , there was a little girl named Lucy
all the bugs and butterflies I want ." She
pretty and decided to stay a while , but then she
people on the other side . Fl uffy

INTERVAL -107.161 - -51.340
CONTAINS 6.746%

pet shop . She asked her mom to buy it for
asks . The man smiles . He nods .
boy , why don 't you come and visit us ?"
Tim my smiled and went home , dreaming of his next
dreamed of being one . One day , her mom my

INTERVAL -162.982 - -107.161
CONTAINS 0.010%

Let â s have a stretching race
her mom my watched in amaz ement as the machine flew
give to the rh in oc eros . Lily walked to
't know the rh in oc eros was wild . Can
Who â s there ? â
Token bad
Token ID2089
Feature activation+4.72e+01

Pos logit contributions
 meant+0.242
Don+0.238
 said+0.220
 needed+0.212
 whispered+0.208
Neg logit contributions
igs-0.217
ig-0.212
ric-0.202
ap-0.188
 Majesty-0.187
Token.
Token ID13
Feature activation+5.37e+01

Pos logit contributions
Don+0.231
 meant+0.215
books+0.191
 said+0.190
 whispered+0.189
Neg logit contributions
igs-0.235
ig-0.224
 our-0.220
ric-0.213
ap-0.210
Token It
Token ID632
Feature activation+5.04e+01

Pos logit contributions
Don+0.218
 meant+0.208
 She+0.184
 said+0.182
 needed+0.175
Neg logit contributions
igs-0.332
ig-0.321
ric-0.310
 victory-0.285
 Majesty-0.283
Token hurt
Token ID5938
Feature activation+4.79e+01

Pos logit contributions
Don+0.177
books+0.166
 meant+0.158
jar+0.150
Today+0.149
Neg logit contributions
 our-0.289
igs-0.284
ric-0.284
ig-0.281
 arrest-0.270
Token me
Token ID502
Feature activation+4.79e+01

Pos logit contributions
Don+0.316
 meant+0.311
 said+0.288
 needed+0.284
 whispered+0.281
Neg logit contributions
igs-0.154
ig-0.145
ric-0.142
 our-0.135
ap-0.125
Token."
Token ID526
Feature activation+5.84e+01

Pos logit contributions
Don+0.222
 meant+0.211
 said+0.188
 needed+0.184
 whispered+0.179
Neg logit contributions
igs-0.253
ig-0.247
ric-0.240
 our-0.233
ap-0.227
Token Anna
Token ID11735
Feature activation+5.06e+01

Pos logit contributions
 meant+0.288
Don+0.283
books+0.259
 needed+0.239
 whispered+0.238
Neg logit contributions
 our-0.275
 my-0.252
 your-0.249
igs-0.249
 victory-0.246
Token said
Token ID531
Feature activation+5.22e+01

Pos logit contributions
books+0.182
Today+0.169
jar+0.167
Don+0.149
Wow+0.127
Neg logit contributions
 our-0.295
ig-0.291
ap-0.286
 my-0.283
e-0.278
Token.
Token ID13
Feature activation+5.84e+01

Pos logit contributions
Don+0.169
 meant+0.165
 said+0.145
 needed+0.136
 whispered+0.132
Neg logit contributions
igs-0.347
ric-0.335
ig-0.335
 our-0.317
 Majesty-0.311
Token
Token ID198
Feature activation+5.93e+01

Pos logit contributions
 meant+0.265
books+0.238
 needed+0.238
 cost+0.235
jar+0.234
Neg logit contributions
 your-0.258
 our-0.257
ap-0.252
 my-0.250
igs-0.227
Token
Token ID198
Feature activation+5.91e+01

Pos logit contributions
 meant+0.203
Don+0.192
 needed+0.168
 whispered+0.153
books+0.152
Neg logit contributions
igs-0.386
ig-0.366
 our-0.364
ric-0.363
ap-0.346
Token hats
Token ID23910
Feature activation+4.21e+01

Pos logit contributions
 meant+0.229
Don+0.226
 said+0.220
 whispered+0.208
 needed+0.205
Neg logit contributions
igs-0.199
ric-0.195
ig-0.189
ap-0.176
chie-0.172
Token!
Token ID0
Feature activation+5.36e+01

Pos logit contributions
 meant+0.226
Don+0.220
 said+0.215
 needed+0.202
 whispered+0.197
Neg logit contributions
igs-0.193
ric-0.180
ig-0.177
 Majesty-0.162
ocol-0.154
Token You
Token ID921
Feature activation+5.17e+01

Pos logit contributions
Don+0.230
 meant+0.226
 said+0.201
 needed+0.194
 whispered+0.194
Neg logit contributions
igs-0.291
ric-0.281
ig-0.280
 our-0.273
ap-0.263
Token ruined
Token ID20484
Feature activation+4.72e+01

Pos logit contributions
 meant+0.228
Don+0.226
 said+0.207
 needed+0.200
 whispered+0.192
Neg logit contributions
igs-0.286
ig-0.274
ric-0.271
 Majesty-0.250
 our-0.249
Token them
Token ID606
Feature activation+4.75e+01

Pos logit contributions
 said+0.274
 meant+0.262
Don+0.245
 whispered+0.236
 sounded+0.231
Neg logit contributions
ig-0.182
ode-0.175
igs-0.167
ench-0.165
earned-0.161
Token!"
Token ID2474
Feature activation+5.80e+01

Pos logit contributions
Don+0.318
 meant+0.309
 said+0.285
 needed+0.283
 whispered+0.278
Neg logit contributions
igs-0.152
ig-0.143
ric-0.137
 our-0.131
ap-0.124
Token
Token ID198
Feature activation+5.80e+01

Pos logit contributions
chy+0.315
 meant+0.312
Don+0.312
Today+0.306
books+0.300
Neg logit contributions
 our-0.246
 your-0.213
 my-0.203
 arrest-0.202
 victory-0.190
Token
Token ID198
Feature activation+5.79e+01

Pos logit contributions
 meant+0.160
Don+0.159
 needed+0.126
 said+0.126
 whispered+0.119
Neg logit contributions
igs-0.406
ig-0.391
ric-0.388
 our-0.380
ap-0.369
TokenThe
Token ID464
Feature activation+5.35e+01

Pos logit contributions
Don+0.225
 meant+0.213
 said+0.192
 needed+0.181
 whispered+0.178
Neg logit contributions
igs-0.355
ig-0.345
ric-0.341
 our-0.319
ap-0.316
Token dog
Token ID3290
Feature activation+5.05e+01

Pos logit contributions
Don+0.274
 meant+0.273
books+0.252
 needed+0.244
 whispered+0.241
Neg logit contributions
 our-0.222
igs-0.220
 victory-0.214
ig-0.211
 arrest-0.208
Token heard
Token ID2982
Feature activation+5.20e+01

Pos logit contributions
Today+0.237
jar+0.236
books+0.220
 meant+0.212
Don+0.212
Neg logit contributions
 dive-0.215
ig-0.215
 escape-0.213
 our-0.211
ap-0.208
Token.
Token ID13
Feature activation+5.28e+01

Pos logit contributions
 meant+0.147
 said+0.131
 She+0.122
Don+0.120
 needed+0.112
Neg logit contributions
 Majesty-0.315
 arrest-0.315
igs-0.309
ig-0.308
ocol-0.308
Token It
Token ID632
Feature activation+5.01e+01

Pos logit contributions
Don+0.192
 meant+0.184
 She+0.165
 said+0.157
 needed+0.148
Neg logit contributions
igs-0.342
ig-0.330
ric-0.309
chie-0.299
immers-0.297
Token has
Token ID468
Feature activation+4.92e+01

Pos logit contributions
Don+0.219
 meant+0.217
 said+0.195
 needed+0.189
 whispered+0.183
Neg logit contributions
igs-0.276
ig-0.265
ric-0.261
 our-0.245
ap-0.243
Token oxygen
Token ID11863
Feature activation+4.45e+01

Pos logit contributions
 meant+0.301
Don+0.299
 said+0.281
 needed+0.272
 whispered+0.268
Neg logit contributions
igs-0.183
ig-0.177
ric-0.170
ap-0.152
 Majesty-0.150
Token inside
Token ID2641
Feature activation+4.71e+01

Pos logit contributions
Don+0.206
 meant+0.200
 said+0.180
 needed+0.175
 whispered+0.172
Neg logit contributions
igs-0.232
ig-0.223
ric-0.220
 our-0.209
ap-0.203
Token."
Token ID526
Feature activation+5.79e+01

Pos logit contributions
Don+0.255
 meant+0.241
 said+0.219
 needed+0.214
 whispered+0.213
Neg logit contributions
igs-0.214
ig-0.203
ric-0.201
 our-0.200
ap-0.183
Token<|endoftext|>
Token ID50256
Feature activation+5.53e+01

Pos logit contributions
 meant+0.214
Don+0.208
 needed+0.182
books+0.180
chy+0.176
Neg logit contributions
 our-0.344
igs-0.340
ap-0.316
ric-0.316
ig-0.316
TokenTom
Token ID13787
Feature activation+5.00e+01

Pos logit contributions
Don+0.188
 meant+0.182
 whispered+0.154
 said+0.153
 needed+0.153
Neg logit contributions
igs-0.345
ig-0.339
ric-0.332
 our-0.321
ap-0.320
Token and
Token ID290
Feature activation+5.35e+01

Pos logit contributions
Don+0.252
jar+0.225
 meant+0.219
books+0.204
Grand+0.200
Neg logit contributions
ig-0.236
ric-0.231
igs-0.230
 our-0.228
ap-0.216
Token Lily
Token ID20037
Feature activation+5.14e+01

Pos logit contributions
 said+0.217
Don+0.210
 meant+0.210
 wanted+0.210
 She+0.198
Neg logit contributions
igs-0.377
ig-0.366
ric-0.355
immers-0.331
ocol-0.329
Token liked
Token ID8288
Feature activation+5.16e+01

Pos logit contributions
Don+0.202
jar+0.183
 meant+0.181
books+0.170
Grand+0.160
Neg logit contributions
ig-0.283
 our-0.280
igs-0.279
ap-0.277
ric-0.267
Token away
Token ID1497
Feature activation+4.44e+01

Pos logit contributions
Don+0.270
 meant+0.263
 said+0.250
 needed+0.243
 whispered+0.234
Neg logit contributions
igs-0.211
ig-0.193
ric-0.193
 Majesty-0.177
ocol-0.170
Token.
Token ID13
Feature activation+5.31e+01

Pos logit contributions
 meant+0.225
 said+0.217
Don+0.211
 needed+0.194
 whispered+0.192
Neg logit contributions
igs-0.217
ric-0.215
ig-0.211
pieces-0.205
latable-0.193
Token Go
Token ID1514
Feature activation+4.89e+01

Pos logit contributions
Don+0.199
 meant+0.188
 She+0.169
 said+0.164
 needed+0.158
Neg logit contributions
igs-0.367
ig-0.355
ric-0.328
 Majesty-0.310
pieces-0.307
Token away
Token ID1497
Feature activation+4.58e+01

Pos logit contributions
Don+0.264
 meant+0.254
 said+0.247
 needed+0.238
 whispered+0.226
Neg logit contributions
igs-0.240
ric-0.216
ig-0.215
 Majesty-0.204
ocol-0.200
Token now
Token ID783
Feature activation+4.86e+01

Pos logit contributions
 meant+0.218
 said+0.213
Don+0.197
 needed+0.185
 whispered+0.185
Neg logit contributions
ric-0.241
igs-0.236
pieces-0.235
ig-0.231
latable-0.220
Token."
Token ID526
Feature activation+5.78e+01

Pos logit contributions
Don+0.326
 meant+0.310
books+0.284
 wondered+0.283
 needed+0.282
Neg logit contributions
igs-0.153
 our-0.147
ig-0.147
ap-0.136
 arrest-0.133
Token
Token ID198
Feature activation+5.74e+01

Pos logit contributions
 meant+0.214
Don+0.203
books+0.194
jar+0.180
chy+0.179
Neg logit contributions
 our-0.330
igs-0.326
ric-0.323
ap-0.305
 arrest-0.304
Token
Token ID198
Feature activation+5.72e+01

Pos logit contributions
 meant+0.203
Don+0.188
 needed+0.169
books+0.162
 whispered+0.160
Neg logit contributions
igs-0.358
ric-0.349
 our-0.339
ig-0.336
 arrest-0.318
TokenJen
Token ID44875
Feature activation+4.93e+01

Pos logit contributions
 meant+0.261
Don+0.238
 needed+0.227
books+0.221
 whispered+0.220
Neg logit contributions
igs-0.299
ric-0.291
 our-0.281
ig-0.279
 arrest-0.271
Token and
Token ID290
Feature activation+5.48e+01

Pos logit contributions
Don+0.222
books+0.219
 meant+0.216
jar+0.213
Today+0.211
Neg logit contributions
ig-0.227
ric-0.212
igs-0.203
ap-0.202
 our-0.200
Token Tom
Token ID4186
Feature activation+5.14e+01

Pos logit contributions
Don+0.204
 meant+0.200
 needed+0.169
 said+0.169
 whispered+0.161
Neg logit contributions
igs-0.334
ric-0.324
ig-0.324
 our-0.313
ap-0.304
Token learned
Token ID4499
Feature activation+5.18e+01

Pos logit contributions
Don+0.236
books+0.227
jar+0.226
Grand+0.213
book+0.203
Neg logit contributions
 our-0.245
 victory-0.241
ric-0.237
ig-0.230
igs-0.228
Token that
Token ID326
Feature activation+5.22e+01

Pos logit contributions
 meant+0.297
Don+0.289
 said+0.281
 needed+0.259
 whispered+0.256
Neg logit contributions
igs-0.231
ig-0.231
ric-0.216
ocol-0.204
 Majesty-0.203
Token courage
Token ID11917
Feature activation+5.01e+01

Pos logit contributions
Don+0.310
 meant+0.298
 said+0.276
 whispered+0.272
jar+0.271
Neg logit contributions
igs-0.196
 our-0.186
ig-0.184
ric-0.183
 victory-0.176
Token is
Token ID318
Feature activation+4.95e+01

Pos logit contributions
Don+0.238
 meant+0.220
 said+0.199
 whispered+0.196
jar+0.196
Neg logit contributions
igs-0.250
ig-0.244
ric-0.242
 our-0.237
ap-0.228
Token important
Token ID1593
Feature activation+5.04e+01

Pos logit contributions
Don+0.264
 meant+0.257
 said+0.235
 needed+0.230
 whispered+0.226
Neg logit contributions
igs-0.223
ig-0.211
ric-0.209
 our-0.199
ap-0.191
Token.
Token ID13
Feature activation+5.78e+01

Pos logit contributions
Don+0.182
 meant+0.180
 said+0.157
 needed+0.151
 whispered+0.145
Neg logit contributions
igs-0.315
ig-0.305
ric-0.303
 our-0.285
ap-0.280
Token They
Token ID1119
Feature activation+5.63e+01

Pos logit contributions
Don+0.181
 meant+0.174
 said+0.149
 whispered+0.148
 needed+0.145
Neg logit contributions
ric-0.372
igs-0.368
 our-0.367
ig-0.355
ap-0.350
Token learned
Token ID4499
Feature activation+5.51e+01

Pos logit contributions
Don+0.250
books+0.240
jar+0.237
Grand+0.225
 meant+0.214
Neg logit contributions
 our-0.262
 victory-0.258
ric-0.251
ig-0.247
igs-0.243
Token that
Token ID326
Feature activation+5.55e+01

Pos logit contributions
 meant+0.322
Don+0.315
 said+0.304
 needed+0.283
 whispered+0.280
Neg logit contributions
igs-0.238
ig-0.237
ric-0.222
 Majesty-0.206
ocol-0.205
Token soccer
Token ID11783
Feature activation+5.08e+01

Pos logit contributions
Don+0.329
 meant+0.313
 said+0.291
jar+0.290
books+0.289
Neg logit contributions
igs-0.205
 our-0.201
ig-0.193
ric-0.192
 victory-0.191
Token is
Token ID318
Feature activation+5.32e+01

Pos logit contributions
Don+0.251
 meant+0.231
jar+0.214
Grand+0.213
 whispered+0.208
Neg logit contributions
igs-0.244
ig-0.240
 our-0.234
ric-0.230
ap-0.226
Token.
Token ID13
Feature activation+5.35e+01

Pos logit contributions
 meant+0.146
Don+0.142
 said+0.135
 needed+0.120
 whispered+0.118
Neg logit contributions
igs-0.314
ric-0.313
ig-0.302
chie-0.291
latable-0.287
Token Let
Token ID3914
Feature activation+5.35e+01

Pos logit contributions
Don+0.196
 meant+0.181
 said+0.165
 She+0.160
Wow+0.151
Neg logit contributions
igs-0.365
ig-0.338
ric-0.326
chie-0.309
ocol-0.309
Token's
Token ID338
Feature activation+4.83e+01

Pos logit contributions
 meant+0.221
Don+0.214
 said+0.211
 needed+0.184
 whispered+0.181
Neg logit contributions
igs-0.326
ric-0.321
ig-0.303
ocol-0.294
chie-0.291
Token play
Token ID711
Feature activation+5.00e+01

Pos logit contributions
 said+0.140
 went+0.130
 wanted+0.117
 had+0.111
 wondered+0.106
Neg logit contributions
ocol-0.378
igs-0.353
chie-0.348
eros-0.341
earned-0.335
Token again
Token ID757
Feature activation+4.92e+01

Pos logit contributions
Don+0.222
 said+0.221
 meant+0.216
 whispered+0.194
 sounded+0.189
Neg logit contributions
ric-0.279
igs-0.268
ocol-0.249
ig-0.249
chie-0.245
Token."
Token ID526
Feature activation+5.78e+01

Pos logit contributions
 meant+0.187
Don+0.185
 said+0.176
 needed+0.160
 whispered+0.158
Neg logit contributions
igs-0.293
ric-0.293
ig-0.283
chie-0.270
latable-0.266
Token They
Token ID1119
Feature activation+5.53e+01

Pos logit contributions
 meant+0.209
Don+0.204
 needed+0.164
jar+0.161
books+0.160
Neg logit contributions
igs-0.362
 our-0.350
ig-0.344
ap-0.342
ric-0.340
Token take
Token ID1011
Feature activation+5.04e+01

Pos logit contributions
 said+0.262
Don+0.252
 meant+0.249
 whispered+0.236
 needed+0.232
Neg logit contributions
igs-0.287
ric-0.277
ig-0.264
ocol-0.262
eros-0.258
Token their
Token ID511
Feature activation+4.96e+01

Pos logit contributions
 said+0.193
 went+0.180
 meant+0.179
 She+0.166
 cried+0.150
Neg logit contributions
ric-0.355
 Majesty-0.314
ocol-0.304
ode-0.303
chie-0.302
Token boats
Token ID16659
Feature activation+4.68e+01

Pos logit contributions
Don+0.224
 meant+0.223
 said+0.216
 needed+0.204
 whispered+0.195
Neg logit contributions
igs-0.258
ric-0.255
 Majesty-0.241
ig-0.241
ocol-0.228
Token and
Token ID290
Feature activation+5.47e+01

Pos logit contributions
 said+0.127
 meant+0.120
Don+0.115
 whispered+0.101
 needed+0.099
Neg logit contributions
ric-0.332
 Majesty-0.328
igs-0.328
ocol-0.314
ig-0.304
Token spaceship
Token ID40663
Feature activation+4.43e+01

Pos logit contributions
Don+0.277
 meant+0.270
 said+0.240
 needed+0.235
 whispered+0.233
Neg logit contributions
igs-0.197
ig-0.194
 our-0.183
 victory-0.179
ric-0.179
Token!"
Token ID2474
Feature activation+5.73e+01

Pos logit contributions
 meant+0.234
 said+0.224
Don+0.211
 whispered+0.204
 needed+0.204
Neg logit contributions
igs-0.208
ric-0.200
ig-0.193
latable-0.176
ocol-0.172
Token Ben
Token ID3932
Feature activation+4.98e+01

Pos logit contributions
Don+0.224
 meant+0.221
books+0.188
jar+0.186
 needed+0.183
Neg logit contributions
igs-0.327
ig-0.315
 our-0.315
ap-0.297
 arrest-0.294
Token said
Token ID531
Feature activation+5.13e+01

Pos logit contributions
Don+0.180
jar+0.167
Today+0.136
Wow+0.132
Welcome+0.128
Neg logit contributions
ey-0.272
ap-0.272
oy-0.271
 how-0.266
ig-0.263
Token.
Token ID13
Feature activation+5.76e+01

Pos logit contributions
Don+0.209
 meant+0.195
 needed+0.164
 said+0.164
jar+0.160
Neg logit contributions
igs-0.292
ig-0.286
ap-0.276
 our-0.275
ric-0.271
Token "
Token ID366
Feature activation+5.76e+01

Pos logit contributions
 meant+0.257
Don+0.242
chy+0.225
jar+0.220
 needed+0.218
Neg logit contributions
 our-0.270
 my-0.260
igs-0.259
ap-0.248
 your-0.240
TokenWe
Token ID1135
Feature activation+5.61e+01

Pos logit contributions
 meant+0.217
 said+0.194
Don+0.191
 needed+0.183
 whispered+0.181
Neg logit contributions
igs-0.333
ric-0.333
ig-0.318
 our-0.316
ap-0.310
Token can
Token ID460
Feature activation+5.29e+01

Pos logit contributions
 meant+0.253
Don+0.248
 said+0.228
 needed+0.223
 whispered+0.214
Neg logit contributions
igs-0.300
ig-0.287
ric-0.283
 our-0.266
ap-0.259
Token fly
Token ID6129
Feature activation+4.92e+01

Pos logit contributions
 meant+0.281
 said+0.280
 whispered+0.260
 wondered+0.258
Don+0.257
Neg logit contributions
igs-0.231
ig-0.224
ric-0.213
ocol-0.207
 risks-0.192
Token to
Token ID284
Feature activation+4.97e+01

Pos logit contributions
 meant+0.217
 said+0.208
Don+0.208
 whispered+0.189
 needed+0.188
Neg logit contributions
igs-0.252
ig-0.245
ric-0.240
 Majesty-0.234
ocol-0.233
Token the
Token ID262
Feature activation+4.86e+01

Pos logit contributions
 said+0.207
 meant+0.205
Don+0.197
 whispered+0.189
 needed+0.187
Neg logit contributions
ig-0.282
igs-0.274
ric-0.258
ode-0.250
immers-0.246
Token a
Token ID257
Feature activation+4.80e+01

Pos logit contributions
 meant+0.288
 said+0.276
Don+0.264
 needed+0.259
 whispered+0.259
Neg logit contributions
ig-0.187
igs-0.183
ric-0.178
chie-0.161
amental-0.159
Token spaceship
Token ID40663
Feature activation+4.43e+01

Pos logit contributions
Don+0.277
 meant+0.270
 said+0.240
 needed+0.235
 whispered+0.233
Neg logit contributions
igs-0.197
ig-0.194
 our-0.183
 victory-0.179
ric-0.179
Token!"
Token ID2474
Feature activation+5.73e+01

Pos logit contributions
 meant+0.234
 said+0.224
Don+0.211
 whispered+0.204
 needed+0.204
Neg logit contributions
igs-0.208
ric-0.200
ig-0.193
latable-0.176
ocol-0.172
Token Ben
Token ID3932
Feature activation+4.98e+01

Pos logit contributions
Don+0.224
 meant+0.221
books+0.188
jar+0.186
 needed+0.183
Neg logit contributions
igs-0.327
ig-0.315
 our-0.315
ap-0.297
 arrest-0.294
Token said
Token ID531
Feature activation+5.13e+01

Pos logit contributions
Don+0.180
jar+0.167
Today+0.136
Wow+0.132
Welcome+0.128
Neg logit contributions
ey-0.272
ap-0.272
oy-0.271
 how-0.266
ig-0.263
Token.
Token ID13
Feature activation+5.76e+01

Pos logit contributions
Don+0.209
 meant+0.195
 needed+0.164
 said+0.164
jar+0.160
Neg logit contributions
igs-0.292
ig-0.286
ap-0.276
 our-0.275
ric-0.271
Token "
Token ID366
Feature activation+5.76e+01

Pos logit contributions
 meant+0.257
Don+0.242
chy+0.225
jar+0.220
 needed+0.218
Neg logit contributions
 our-0.270
 my-0.260
igs-0.259
ap-0.248
 your-0.240
TokenWe
Token ID1135
Feature activation+5.61e+01

Pos logit contributions
 meant+0.217
 said+0.194
Don+0.191
 needed+0.183
 whispered+0.181
Neg logit contributions
igs-0.333
ric-0.333
ig-0.318
 our-0.316
ap-0.310
Token can
Token ID460
Feature activation+5.29e+01

Pos logit contributions
 meant+0.253
Don+0.248
 said+0.228
 needed+0.223
 whispered+0.214
Neg logit contributions
igs-0.300
ig-0.287
ric-0.283
 our-0.266
ap-0.259
Token fly
Token ID6129
Feature activation+4.92e+01

Pos logit contributions
 meant+0.281
 said+0.280
 whispered+0.260
 wondered+0.258
Don+0.257
Neg logit contributions
igs-0.231
ig-0.224
ric-0.213
ocol-0.207
 risks-0.192
Token to
Token ID284
Feature activation+4.97e+01

Pos logit contributions
 meant+0.217
 said+0.208
Don+0.208
 whispered+0.189
 needed+0.188
Neg logit contributions
igs-0.252
ig-0.245
ric-0.240
 Majesty-0.234
ocol-0.233
TokenCan
Token ID6090
Feature activation+5.56e+01

Pos logit contributions
 meant+0.216
 needed+0.215
chy+0.198
 She+0.189
 said+0.186
Neg logit contributions
 our-0.276
amental-0.248
 blast-0.242
 victory-0.241
 ourselves-0.235
Token we
Token ID356
Feature activation+5.21e+01

Pos logit contributions
Don+0.260
 said+0.246
 needed+0.228
 She+0.226
 meant+0.223
Neg logit contributions
igs-0.337
ric-0.322
ig-0.313
 Majesty-0.306
immers-0.305
Token go
Token ID467
Feature activation+4.74e+01

Pos logit contributions
 said+0.238
 meant+0.223
 needed+0.210
 She+0.208
 wanted+0.204
Neg logit contributions
igs-0.310
 Majesty-0.287
immers-0.283
ric-0.281
eros-0.278
Token outside
Token ID2354
Feature activation+4.46e+01

Pos logit contributions
 said+0.202
 whispered+0.170
 She+0.167
 meant+0.163
 needed+0.162
Neg logit contributions
igs-0.284
ric-0.280
ocol-0.279
 Majesty-0.276
chie-0.257
Token now
Token ID783
Feature activation+4.94e+01

Pos logit contributions
 meant+0.193
 said+0.190
Don+0.185
 needed+0.169
 whispered+0.167
Neg logit contributions
igs-0.247
ric-0.236
ig-0.227
 Majesty-0.214
chie-0.211
Token?"
Token ID1701
Feature activation+5.76e+01

Pos logit contributions
 said+0.227
 meant+0.225
 whispered+0.197
 She+0.193
Don+0.192
Neg logit contributions
ric-0.278
igs-0.265
chie-0.252
pieces-0.242
ig-0.238
Token Anna
Token ID11735
Feature activation+5.13e+01

Pos logit contributions
Don+0.276
 meant+0.272
books+0.255
 needed+0.236
jar+0.235
Neg logit contributions
 our-0.288
igs-0.258
 my-0.258
 your-0.254
ap-0.247
Token and
Token ID290
Feature activation+5.50e+01

Pos logit contributions
Don+0.220
jar+0.196
books+0.189
Wow+0.185
chy+0.181
Neg logit contributions
ig-0.246
ey-0.242
 my-0.239
 our-0.238
ap-0.236
Token Ben
Token ID3932
Feature activation+5.14e+01

Pos logit contributions
Don+0.184
 meant+0.179
 said+0.151
 needed+0.148
 whispered+0.141
Neg logit contributions
igs-0.357
ig-0.347
ric-0.342
 our-0.331
ap-0.324
Token asked
Token ID1965
Feature activation+5.35e+01

Pos logit contributions
Don+0.230
Me+0.195
jar+0.195
Well+0.182
Wow+0.182
Neg logit contributions
ap-0.250
ig-0.231
 breath-0.229
ey-0.228
 dive-0.227
Token.
Token ID13
Feature activation+5.90e+01

Pos logit contributions
Don+0.241
 meant+0.227
 needed+0.195
jar+0.195
Well+0.188
Neg logit contributions
igs-0.286
ap-0.283
ig-0.282
 our-0.276
ric-0.261
Token and
Token ID290
Feature activation+5.08e+01

Pos logit contributions
 meant+0.200
 said+0.196
Don+0.192
 needed+0.172
 whispered+0.170
Neg logit contributions
igs-0.241
ric-0.237
ig-0.228
 Majesty-0.223
 arrest-0.222
Token books
Token ID3835
Feature activation+4.35e+01

Pos logit contributions
 said+0.264
 meant+0.242
Don+0.233
 whispered+0.225
 needed+0.216
Neg logit contributions
ig-0.283
igs-0.270
ric-0.267
chie-0.266
latable-0.248
Token?"
Token ID1701
Feature activation+5.60e+01

Pos logit contributions
Don+0.262
 meant+0.254
 said+0.232
 needed+0.231
 whispered+0.227
Neg logit contributions
igs-0.164
ig-0.157
ric-0.152
 our-0.148
ap-0.141
Token
Token ID198
Feature activation+5.74e+01

Pos logit contributions
 meant+0.262
books+0.226
 needed+0.224
Don+0.223
chy+0.220
Neg logit contributions
 our-0.272
igs-0.263
 your-0.260
ap-0.255
 my-0.248
Token
Token ID198
Feature activation+5.71e+01

Pos logit contributions
 meant+0.213
Don+0.199
 needed+0.176
 whispered+0.160
books+0.159
Neg logit contributions
igs-0.360
 our-0.343
ig-0.342
ric-0.333
ap-0.323
Token"
Token ID1
Feature activation+5.76e+01

Pos logit contributions
 meant+0.256
Don+0.229
 needed+0.220
 said+0.199
 whispered+0.198
Neg logit contributions
igs-0.321
ig-0.311
 our-0.309
ap-0.289
ric-0.287
TokenYes
Token ID5297
Feature activation+5.35e+01

Pos logit contributions
 meant+0.294
 needed+0.262
chy+0.242
 whispered+0.238
books+0.237
Neg logit contributions
 our-0.248
 victory-0.242
igs-0.240
ap-0.227
ig-0.227
Token,
Token ID11
Feature activation+5.15e+01

Pos logit contributions
 meant+0.257
Don+0.254
 said+0.252
 needed+0.229
 whispered+0.226
Neg logit contributions
ric-0.266
igs-0.263
ig-0.246
 Majesty-0.228
 arrest-0.227
Token mom
Token ID1995
Feature activation+5.29e+01

Pos logit contributions
 said+0.225
 meant+0.222
Don+0.222
 whispered+0.192
 needed+0.188
Neg logit contributions
igs-0.314
ric-0.310
ig-0.285
immers-0.275
gy-0.271
Token,"
Token ID553
Feature activation+5.84e+01

Pos logit contributions
 meant+0.255
Don+0.248
 said+0.239
 needed+0.223
 whispered+0.219
Neg logit contributions
igs-0.271
ric-0.257
ig-0.256
 our-0.234
 Majesty-0.233
Token Anna
Token ID11735
Feature activation+5.26e+01

Pos logit contributions
 meant+0.318
Don+0.309
books+0.301
 needed+0.287
chy+0.278
Neg logit contributions
 our-0.285
 my-0.254
 your-0.250
igs-0.233
ap-0.228
TokenYou
Token ID1639
Feature activation+5.05e+01

Pos logit contributions
 meant+0.163
Don+0.146
 said+0.133
 needed+0.131
 whispered+0.127
Neg logit contributions
igs-0.355
ric-0.345
ig-0.342
 our-0.337
 victory-0.327
Token're
Token ID821
Feature activation+4.48e+01

Pos logit contributions
Don+0.240
 meant+0.239
 said+0.217
 needed+0.210
 whispered+0.205
Neg logit contributions
igs-0.260
ig-0.249
ric-0.245
 our-0.229
ap-0.224
Token welcome
Token ID7062
Feature activation+4.70e+01

Pos logit contributions
 meant+0.267
Don+0.265
 said+0.247
 needed+0.237
 whispered+0.234
Neg logit contributions
igs-0.179
ig-0.172
ric-0.162
 arrest-0.147
 Majesty-0.144
Token,
Token ID11
Feature activation+4.85e+01

Pos logit contributions
 said+0.206
 meant+0.201
Don+0.197
 She+0.188
 whispered+0.177
Neg logit contributions
ric-0.266
igs-0.261
ig-0.242
pieces-0.226
latable-0.221
Token duck
Token ID22045
Feature activation+4.54e+01

Pos logit contributions
 meant+0.227
Don+0.223
 said+0.220
 whispered+0.194
 needed+0.187
Neg logit contributions
igs-0.259
ric-0.254
ig-0.248
 Majesty-0.223
chie-0.222
Token!"
Token ID2474
Feature activation+5.75e+01

Pos logit contributions
 meant+0.235
 said+0.228
Don+0.227
 needed+0.211
 whispered+0.206
Neg logit contributions
igs-0.210
ric-0.196
ig-0.194
 arrest-0.181
 Majesty-0.178
Token Mia
Token ID32189
Feature activation+4.92e+01

Pos logit contributions
Don+0.297
 meant+0.291
 needed+0.258
 said+0.255
 whispered+0.251
Neg logit contributions
igs-0.268
ig-0.252
ric-0.247
 our-0.246
 victory-0.233
Token says
Token ID1139
Feature activation+5.10e+01

Pos logit contributions
Don+0.139
 meant+0.130
 needed+0.105
 said+0.101
books+0.096
Neg logit contributions
igs-0.345
ig-0.338
ric-0.334
 our-0.325
ap-0.318
Token.
Token ID13
Feature activation+5.73e+01

Pos logit contributions
Don+0.158
 meant+0.153
 said+0.137
 needed+0.126
 whispered+0.124
Neg logit contributions
igs-0.344
ric-0.334
ig-0.332
 our-0.313
 Majesty-0.309
Token
Token ID198
Feature activation+5.86e+01

Pos logit contributions
 meant+0.293
 needed+0.253
chy+0.246
jar+0.244
books+0.244
Neg logit contributions
 our-0.261
igs-0.245
 your-0.243
 my-0.238
ap-0.238
Token
Token ID198
Feature activation+5.85e+01

Pos logit contributions
 meant+0.166
Don+0.165
 needed+0.133
 said+0.130
 whispered+0.124
Neg logit contributions
igs-0.411
ig-0.394
ric-0.393
 our-0.382
ap-0.370
Token stops
Token ID9911
Feature activation+4.50e+01

Pos logit contributions
Don+0.239
jar+0.214
books+0.205
 meant+0.189
Grand+0.188
Neg logit contributions
 our-0.204
ric-0.200
igs-0.188
ig-0.188
 arrest-0.184
Token near
Token ID1474
Feature activation+4.65e+01

Pos logit contributions
Don+0.257
 meant+0.242
books+0.224
Well+0.220
jar+0.219
Neg logit contributions
ric-0.189
igs-0.185
 our-0.185
ig-0.175
ap-0.167
Token their
Token ID511
Feature activation+4.56e+01

Pos logit contributions
Don+0.329
 meant+0.306
jar+0.288
Well+0.287
 needed+0.286
Neg logit contributions
 our-0.142
ric-0.122
igs-0.121
ap-0.114
 victory-0.107
Token house
Token ID2156
Feature activation+4.30e+01

Pos logit contributions
Don+0.205
 meant+0.204
 said+0.191
 needed+0.185
 whispered+0.176
Neg logit contributions
igs-0.243
ig-0.230
ric-0.222
ap-0.205
 our-0.203
Token.
Token ID13
Feature activation+5.64e+01

Pos logit contributions
Don+0.262
Wow+0.223
Well+0.217
 meant+0.210
Uh+0.205
Neg logit contributions
ap-0.180
ric-0.176
 our-0.166
igs-0.159
ig-0.154
Token
Token ID198
Feature activation+5.75e+01

Pos logit contributions
Don+0.223
 meant+0.222
 needed+0.188
jar+0.182
 whispered+0.180
Neg logit contributions
 our-0.315
igs-0.308
ap-0.295
ric-0.286
ig-0.285
Token
Token ID198
Feature activation+5.71e+01

Pos logit contributions
 meant+0.213
Don+0.204
 needed+0.182
 whispered+0.171
 wondered+0.165
Neg logit contributions
igs-0.342
 our-0.335
ig-0.318
ric-0.314
ap-0.313
Token"
Token ID1
Feature activation+5.67e+01

Pos logit contributions
 meant+0.284
Don+0.279
 needed+0.250
 said+0.250
 whispered+0.243
Neg logit contributions
igs-0.276
ig-0.258
 our-0.257
ric-0.245
ap-0.244
TokenLook
Token ID8567
Feature activation+5.04e+01

Pos logit contributions
 meant+0.255
 needed+0.250
chy+0.244
 de+0.233
 whispered+0.230
Neg logit contributions
amental-0.250
 our-0.234
ode-0.225
latable-0.222
kel-0.220
Token,
Token ID11
Feature activation+5.17e+01

Pos logit contributions
 said+0.203
 meant+0.184
 whispered+0.166
Don+0.166
 needed+0.165
Neg logit contributions
ric-0.302
igs-0.291
 arrest-0.277
chie-0.268
ode-0.263
Token Anna
Token ID11735
Feature activation+5.19e+01

Pos logit contributions
 meant+0.169
 said+0.168
Don+0.159
 needed+0.137
 whispered+0.137
Neg logit contributions
igs-0.360
ig-0.352
ric-0.349
chie-0.331
ode-0.317
Token,
Token ID11
Feature activation+4.80e+01

Pos logit contributions
Don+0.186
 meant+0.175
 said+0.171
 She+0.164
 whispered+0.152
Neg logit contributions
igs-0.291
ric-0.281
ocol-0.262
 Majesty-0.260
ig-0.258
Token I
Token ID314
Feature activation+5.06e+01

Pos logit contributions
Don+0.182
 said+0.176
 meant+0.167
 wanted+0.153
 whispered+0.141
Neg logit contributions
igs-0.338
ric-0.321
ig-0.309
igsaw-0.301
immers-0.291
Token will
Token ID481
Feature activation+4.92e+01

Pos logit contributions
 meant+0.230
Don+0.222
 said+0.219
 needed+0.202
 whispered+0.198
Neg logit contributions
igs-0.293
ric-0.270
ig-0.266
 Majesty-0.254
ocol-0.250
Token help
Token ID1037
Feature activation+4.86e+01

Pos logit contributions
 said+0.236
 meant+0.233
 whispered+0.218
Don+0.213
 needed+0.204
Neg logit contributions
igs-0.254
ig-0.250
ric-0.244
pieces-0.234
chie-0.228
Token you
Token ID345
Feature activation+4.92e+01

Pos logit contributions
 said+0.281
 meant+0.276
Don+0.274
 She+0.253
 whispered+0.253
Neg logit contributions
ric-0.196
ig-0.194
igs-0.192
latable-0.183
 Majesty-0.179
Token."
Token ID526
Feature activation+5.75e+01

Pos logit contributions
 meant+0.196
 said+0.183
 whispered+0.172
Don+0.171
 She+0.164
Neg logit contributions
igs-0.283
ric-0.281
ig-0.280
ocol-0.267
latable-0.264
Token said
Token ID531
Feature activation+5.13e+01

Pos logit contributions
Don+0.192
 meant+0.186
books+0.184
jar+0.180
Today+0.163
Neg logit contributions
igs-0.341
 our-0.336
ig-0.321
 victory-0.314
ap-0.309
Token Sam
Token ID3409
Feature activation+5.05e+01

Pos logit contributions
Don+0.265
 meant+0.247
jar+0.232
books+0.226
Today+0.226
Neg logit contributions
igs-0.231
ig-0.223
 our-0.216
ap-0.208
 victory-0.205
Token.
Token ID13
Feature activation+5.75e+01

Pos logit contributions
Don+0.240
 meant+0.219
jar+0.214
Wow+0.209
Today+0.204
Neg logit contributions
ig-0.247
igs-0.226
ap-0.222
ric-0.221
 our-0.216
Token He
Token ID679
Feature activation+5.43e+01

Pos logit contributions
 meant+0.255
books+0.245
jar+0.244
 faded+0.241
chy+0.240
Neg logit contributions
 our-0.264
 my-0.248
igs-0.242
 your-0.238
 arrest-0.225
Token took
Token ID1718
Feature activation+5.08e+01

Pos logit contributions
Don+0.195
 meant+0.184
books+0.158
jar+0.156
Today+0.153
Neg logit contributions
igs-0.329
ig-0.326
ric-0.304
 our-0.297
ap-0.291
Token ball
Token ID2613
Feature activation+4.34e+01

Pos logit contributions
Don+0.218
 meant+0.210
 said+0.201
 needed+0.189
 whispered+0.187
Neg logit contributions
ric-0.234
igs-0.234
ig-0.218
 Majesty-0.212
 our-0.209
Token.
Token ID13
Feature activation+5.59e+01

Pos logit contributions
Don+0.205
 meant+0.191
 whispered+0.164
 wondered+0.164
 needed+0.162
Neg logit contributions
igs-0.234
ig-0.233
ap-0.225
 our-0.217
ric-0.213
Token They
Token ID1119
Feature activation+5.33e+01

Pos logit contributions
Don+0.186
 meant+0.183
 said+0.149
 needed+0.148
 whispered+0.146
Neg logit contributions
igs-0.359
ric-0.353
ig-0.349
 our-0.348
ap-0.345
Token are
Token ID389
Feature activation+4.85e+01

Pos logit contributions
Don+0.316
 meant+0.311
 said+0.290
 needed+0.283
 whispered+0.277
Neg logit contributions
igs-0.209
ig-0.198
ric-0.194
 our-0.178
ap-0.173
Token clever
Token ID14169
Feature activation+4.80e+01

Pos logit contributions
 meant+0.198
Don+0.195
 said+0.177
 needed+0.166
 whispered+0.162
Neg logit contributions
igs-0.279
ig-0.275
ric-0.271
ocol-0.250
 arrest-0.250
Token.
Token ID13
Feature activation+5.75e+01

Pos logit contributions
 meant+0.184
Don+0.184
 said+0.163
 needed+0.156
 whispered+0.151
Neg logit contributions
igs-0.287
ric-0.279
ig-0.277
 our-0.259
ap-0.254
Token They
Token ID1119
Feature activation+5.51e+01

Pos logit contributions
Don+0.190
 meant+0.187
 needed+0.152
 said+0.151
 whispered+0.150
Neg logit contributions
igs-0.365
ric-0.362
 our-0.361
ig-0.357
ap-0.356
Token know
Token ID760
Feature activation+5.21e+01

Pos logit contributions
Don+0.320
 meant+0.314
 said+0.288
 needed+0.282
 whispered+0.278
Neg logit contributions
igs-0.221
ig-0.211
ric-0.208
 our-0.193
ap-0.187
Token how
Token ID703
Feature activation+5.31e+01

Pos logit contributions
 meant+0.328
Don+0.327
 said+0.312
 needed+0.296
 whispered+0.292
Neg logit contributions
igs-0.186
ric-0.182
ig-0.178
chie-0.158
 arrest-0.153
Token to
Token ID284
Feature activation+5.25e+01

Pos logit contributions
Don+0.224
 meant+0.203
 needed+0.202
 wanted+0.199
 said+0.198
Neg logit contributions
ric-0.316
ig-0.310
ode-0.310
ocol-0.296
 Majesty-0.295
Token share
Token ID2648
Feature activation+5.29e+01

Pos logit contributions
 meant+0.220
 said+0.218
Don+0.209
 wanted+0.200
 She+0.197
Neg logit contributions
ric-0.275
 Majesty-0.273
ocol-0.270
igs-0.261
eros-0.261
Token but
Token ID475
Feature activation+5.44e+01

Pos logit contributions
Don+0.200
 meant+0.196
 said+0.171
 needed+0.167
books+0.164
Neg logit contributions
igs-0.272
ig-0.261
 our-0.260
ric-0.259
 victory-0.245
Token they
Token ID484
Feature activation+5.27e+01

Pos logit contributions
Don+0.232
 meant+0.228
 said+0.204
 needed+0.197
 whispered+0.191
Neg logit contributions
igs-0.308
ig-0.297
ric-0.293
 our-0.274
ap-0.270
Token were
Token ID547
Feature activation+4.86e+01

Pos logit contributions
Don+0.214
jar+0.196
 meant+0.193
books+0.193
Grand+0.191
Neg logit contributions
ig-0.262
ric-0.262
igs-0.254
 our-0.250
 victory-0.244
Token more
Token ID517
Feature activation+4.78e+01

Pos logit contributions
 meant+0.175
Don+0.167
 said+0.155
 needed+0.145
 whispered+0.141
Neg logit contributions
igs-0.310
ig-0.308
ric-0.306
 arrest-0.282
ocol-0.280
Token careful
Token ID8161
Feature activation+4.92e+01

Pos logit contributions
Don+0.288
 meant+0.274
books+0.251
Wow+0.250
 needed+0.250
Neg logit contributions
igs-0.191
ig-0.171
 our-0.161
ap-0.154
ric-0.150
Token.
Token ID13
Feature activation+5.75e+01

Pos logit contributions
 meant+0.165
Don+0.158
 said+0.141
 needed+0.135
 She+0.124
Neg logit contributions
igs-0.331
ig-0.323
ric-0.315
 Majesty-0.307
ocol-0.300
Token<|endoftext|>
Token ID50256
Feature activation+5.60e+01

Pos logit contributions
 meant+0.214
Don+0.204
 spoke+0.189
 whispered+0.186
 said+0.185
Neg logit contributions
 our-0.332
ric-0.322
ap-0.303
igs-0.302
 victory-0.287
TokenAnna
Token ID31160
Feature activation+5.08e+01

Pos logit contributions
Don+0.218
 meant+0.210
 wondered+0.188
 whispered+0.187
 said+0.180
Neg logit contributions
ig-0.317
igs-0.313
ric-0.305
ap-0.298
 our-0.294
Token and
Token ID290
Feature activation+5.50e+01

Pos logit contributions
Don+0.239
jar+0.214
Wow+0.205
books+0.198
 meant+0.196
Neg logit contributions
ig-0.246
e-0.233
 arrest-0.229
 our-0.223
igs-0.220
Token Ben
Token ID3932
Feature activation+5.16e+01

Pos logit contributions
Don+0.181
 meant+0.179
 said+0.175
 She+0.153
 needed+0.153
Neg logit contributions
igs-0.393
ig-0.374
ric-0.373
chie-0.350
 Majesty-0.342
Token were
Token ID547
Feature activation+5.18e+01

Pos logit contributions
Don+0.193
jar+0.190
books+0.176
 meant+0.166
Grand+0.162
Neg logit contributions
 our-0.275
 arrest-0.269
ig-0.258
ap-0.258
igs-0.251
Token with
Token ID351
Feature activation+4.76e+01

Pos logit contributions
 meant+0.213
Don+0.212
 said+0.193
 She+0.187
books+0.179
Neg logit contributions
ric-0.276
ig-0.258
igs-0.256
chie-0.246
ode-0.234
Token mom
Token ID1995
Feature activation+4.75e+01

Pos logit contributions
 meant+0.247
Don+0.242
 said+0.227
 needed+0.223
 whispered+0.212
Neg logit contributions
ig-0.220
igs-0.217
ric-0.210
 Majesty-0.194
chie-0.194
Token's
Token ID338
Feature activation+4.66e+01

Pos logit contributions
Don+0.270
 meant+0.255
 said+0.233
 needed+0.232
 whispered+0.230
Neg logit contributions
ig-0.188
igs-0.184
 our-0.179
ric-0.179
ap-0.168
Token necklace
Token ID38987
Feature activation+4.55e+01

Pos logit contributions
Don+0.283
 meant+0.280
 said+0.257
 needed+0.252
 whispered+0.248
Neg logit contributions
igs-0.176
ig-0.167
ric-0.162
 our-0.155
ap-0.146
Token again
Token ID757
Feature activation+4.89e+01

Pos logit contributions
Don+0.258
 meant+0.246
 said+0.224
 needed+0.222
 wondered+0.220
Neg logit contributions
igs-0.193
ig-0.190
 our-0.172
ric-0.171
ap-0.168
Token.
Token ID13
Feature activation+5.74e+01

Pos logit contributions
Don+0.229
 meant+0.206
Wow+0.185
Well+0.180
 needed+0.180
Neg logit contributions
igs-0.249
 our-0.248
ig-0.244
ap-0.233
 victory-0.224
Token They
Token ID1119
Feature activation+5.56e+01

Pos logit contributions
Don+0.195
 meant+0.191
 whispered+0.172
 spoke+0.170
 said+0.169
Neg logit contributions
 our-0.349
ric-0.340
igs-0.318
ap-0.312
 ourselves-0.304
Token still
Token ID991
Feature activation+5.05e+01

Pos logit contributions
jar+0.268
Don+0.268
books+0.260
oled+0.250
book+0.243
Neg logit contributions
ric-0.215
e-0.212
 least-0.208
 our-0.205
 stay-0.205
Token played
Token ID2826
Feature activation+5.30e+01

Pos logit contributions
Don+0.176
jar+0.156
books+0.155
Grand+0.148
Today+0.146
Neg logit contributions
ric-0.298
 our-0.290
ap-0.275
e-0.270
igs-0.270
Token in
Token ID287
Feature activation+5.04e+01

Pos logit contributions
Don+0.280
 meant+0.274
 said+0.253
 needed+0.244
 whispered+0.240
Neg logit contributions
igs-0.241
ric-0.233
ig-0.232
 our-0.210
 Majesty-0.204
Token the
Token ID262
Feature activation+4.84e+01

Pos logit contributions
Don+0.278
 meant+0.275
 said+0.254
 needed+0.247
 whispered+0.242
Neg logit contributions
igs-0.217
ig-0.209
ric-0.204
 our-0.184
ap-0.184
Token We
Token ID775
Feature activation+5.17e+01

Pos logit contributions
Don+0.248
 meant+0.242
 said+0.218
 needed+0.210
 whispered+0.205
Neg logit contributions
igs-0.280
ig-0.270
ric-0.262
 our-0.237
 Majesty-0.237
Token pulled
Token ID5954
Feature activation+4.62e+01

Pos logit contributions
Don+0.231
jar+0.207
 meant+0.201
Grand+0.197
books+0.197
Neg logit contributions
ig-0.259
igs-0.257
ric-0.252
 our-0.249
ap-0.249
Token the
Token ID262
Feature activation+4.51e+01

Pos logit contributions
Don+0.280
 meant+0.274
 said+0.252
 needed+0.248
 whispered+0.244
Neg logit contributions
igs-0.176
ig-0.167
ric-0.164
 our-0.155
ap-0.147
Token knot
Token ID29654
Feature activation+4.18e+01

Pos logit contributions
Don+0.237
 meant+0.228
 wondered+0.213
 needed+0.207
 suggested+0.205
Neg logit contributions
ig-0.200
igs-0.197
 our-0.195
 victory-0.193
ric-0.180
Token."
Token ID526
Feature activation+5.69e+01

Pos logit contributions
Don+0.186
 meant+0.175
 said+0.155
 needed+0.153
 whispered+0.148
Neg logit contributions
igs-0.229
ig-0.222
 our-0.214
ric-0.214
ap-0.206
Token
Token ID198
Feature activation+5.74e+01

Pos logit contributions
Don+0.175
 meant+0.174
 needed+0.142
books+0.138
 said+0.136
Neg logit contributions
igs-0.380
ig-0.362
 our-0.360
ric-0.359
ap-0.345
Token
Token ID198
Feature activation+5.72e+01

Pos logit contributions
 meant+0.201
Don+0.197
 needed+0.167
books+0.161
 whispered+0.154
Neg logit contributions
igs-0.365
 our-0.353
ig-0.344
ric-0.341
ap-0.331
TokenM
Token ID44
Feature activation+5.24e+01

Pos logit contributions
Don+0.200
 meant+0.190
 said+0.171
 needed+0.159
 whispered+0.156
Neg logit contributions
igs-0.367
ig-0.357
ric-0.357
 our-0.332
 Majesty-0.331
Tokenama
Token ID1689
Feature activation+5.08e+01

Pos logit contributions
Don+0.332
 meant+0.311
 needed+0.285
 said+0.284
 wondered+0.281
Neg logit contributions
igs-0.207
ig-0.197
ric-0.181
ap-0.170
 our-0.164
Token sigh
Token ID19680
Feature activation+4.90e+01

Pos logit contributions
Don+0.252
 meant+0.241
 needed+0.216
books+0.214
jar+0.212
Neg logit contributions
igs-0.245
ig-0.240
ric-0.234
 our-0.224
ap-0.219
Tokens
Token ID82
Feature activation+4.79e+01

Pos logit contributions
Don+0.238
jar+0.216
books+0.206
 meant+0.206
 wondered+0.195
Neg logit contributions
igs-0.241
ig-0.240
ap-0.238
 our-0.235
ric-0.216
Token your
Token ID534
Feature activation+4.57e+01

Pos logit contributions
 meant+0.248
 said+0.241
Don+0.241
 whispered+0.219
 needed+0.211
Neg logit contributions
igs-0.233
ig-0.226
ric-0.225
chie-0.214
 Majesty-0.212
Token toys
Token ID14958
Feature activation+4.47e+01

Pos logit contributions
Don+0.212
 meant+0.209
 said+0.190
 needed+0.184
 whispered+0.180
Neg logit contributions
igs-0.237
ric-0.229
ig-0.228
 our-0.208
ap-0.207
Token and
Token ID290
Feature activation+5.08e+01

Pos logit contributions
 meant+0.200
 said+0.196
Don+0.192
 needed+0.172
 whispered+0.170
Neg logit contributions
igs-0.241
ric-0.237
ig-0.228
 Majesty-0.223
 arrest-0.222
Token books
Token ID3835
Feature activation+4.35e+01

Pos logit contributions
 said+0.264
 meant+0.242
Don+0.233
 whispered+0.225
 needed+0.216
Neg logit contributions
ig-0.283
igs-0.270
ric-0.267
chie-0.266
latable-0.248
Token?"
Token ID1701
Feature activation+5.60e+01

Pos logit contributions
Don+0.262
 meant+0.254
 said+0.232
 needed+0.231
 whispered+0.227
Neg logit contributions
igs-0.164
ig-0.157
ric-0.152
 our-0.148
ap-0.141
Token
Token ID198
Feature activation+5.74e+01

Pos logit contributions
 meant+0.262
books+0.226
 needed+0.224
Don+0.223
chy+0.220
Neg logit contributions
 our-0.272
igs-0.263
 your-0.260
ap-0.255
 my-0.248
Token
Token ID198
Feature activation+5.71e+01

Pos logit contributions
 meant+0.213
Don+0.199
 needed+0.176
 whispered+0.160
books+0.159
Neg logit contributions
igs-0.360
 our-0.343
ig-0.342
ric-0.333
ap-0.323
Token"
Token ID1
Feature activation+5.76e+01

Pos logit contributions
 meant+0.256
Don+0.229
 needed+0.220
 said+0.199
 whispered+0.198
Neg logit contributions
igs-0.321
ig-0.311
 our-0.309
ap-0.289
ric-0.287
TokenYes
Token ID5297
Feature activation+5.35e+01

Pos logit contributions
 meant+0.294
 needed+0.262
chy+0.242
 whispered+0.238
books+0.237
Neg logit contributions
 our-0.248
 victory-0.242
igs-0.240
ap-0.227
ig-0.227
Token,
Token ID11
Feature activation+5.15e+01

Pos logit contributions
 meant+0.257
Don+0.254
 said+0.252
 needed+0.229
 whispered+0.226
Neg logit contributions
ric-0.266
igs-0.263
ig-0.246
 Majesty-0.228
 arrest-0.227
Token mom
Token ID1995
Feature activation+5.29e+01

Pos logit contributions
 said+0.225
 meant+0.222
Don+0.222
 whispered+0.192
 needed+0.188
Neg logit contributions
igs-0.314
ric-0.310
ig-0.285
immers-0.275
gy-0.271
Token The
Token ID383
Feature activation+4.87e+01

Pos logit contributions
Don+0.191
 meant+0.180
 She+0.159
 said+0.155
 needed+0.149
Neg logit contributions
igs-0.337
ig-0.323
ric-0.297
 Majesty-0.288
 victory-0.287
Token fireworks
Token ID26056
Feature activation+4.26e+01

Pos logit contributions
Don+0.254
 meant+0.246
 said+0.223
 whispered+0.221
 needed+0.213
Neg logit contributions
igs-0.228
ric-0.220
ig-0.213
 our-0.211
 victory-0.206
Token will
Token ID481
Feature activation+4.89e+01

Pos logit contributions
Don+0.228
 meant+0.205
Grand+0.188
books+0.187
 She+0.185
Neg logit contributions
igs-0.196
ig-0.194
ric-0.192
 our-0.187
ap-0.177
Token start
Token ID923
Feature activation+4.52e+01

Pos logit contributions
Don+0.317
 meant+0.309
 said+0.286
 needed+0.281
 whispered+0.277
Neg logit contributions
igs-0.165
ric-0.154
ig-0.152
 our-0.144
ap-0.133
Token soon
Token ID2582
Feature activation+4.74e+01

Pos logit contributions
Don+0.229
 meant+0.226
 said+0.206
 needed+0.201
 whispered+0.196
Neg logit contributions
igs-0.214
ig-0.205
ric-0.202
 our-0.188
ap-0.184
Token."
Token ID526
Feature activation+5.74e+01

Pos logit contributions
Don+0.248
 meant+0.230
 said+0.205
 needed+0.204
books+0.201
Neg logit contributions
igs-0.227
ig-0.216
 our-0.210
ric-0.207
ap-0.195
Token
Token ID198
Feature activation+5.67e+01

Pos logit contributions
 meant+0.231
Don+0.230
books+0.193
 needed+0.192
 whispered+0.192
Neg logit contributions
igs-0.321
 our-0.313
ig-0.309
ric-0.297
ap-0.294
Token
Token ID198
Feature activation+5.65e+01

Pos logit contributions
 meant+0.219
Don+0.196
 needed+0.182
 spoke+0.168
books+0.166
Neg logit contributions
igs-0.343
 our-0.330
ig-0.323
ric-0.309
ap-0.306
TokenL
Token ID43
Feature activation+4.76e+01

Pos logit contributions
 meant+0.198
Don+0.169
 needed+0.160
 said+0.151
 whispered+0.147
Neg logit contributions
igs-0.367
ig-0.354
 our-0.348
ap-0.331
ric-0.330
Tokenily
Token ID813
Feature activation+5.09e+01

Pos logit contributions
 meant+0.106
Don+0.093
 said+0.078
 needed+0.066
books+0.063
Neg logit contributions
ric-0.376
igs-0.371
ig-0.356
 Majesty-0.346
ode-0.336
Token and
Token ID290
Feature activation+5.45e+01

Pos logit contributions
Don+0.272
 meant+0.247
jar+0.242
Wow+0.237
books+0.234
Neg logit contributions
ig-0.222
igs-0.207
ap-0.190
ric-0.183
ode-0.181
Token.
Token ID13
Feature activation+5.31e+01

Pos logit contributions
 meant+0.225
 said+0.217
Don+0.211
 needed+0.194
 whispered+0.192
Neg logit contributions
igs-0.217
ric-0.215
ig-0.211
pieces-0.205
latable-0.193
Token Go
Token ID1514
Feature activation+4.89e+01

Pos logit contributions
Don+0.199
 meant+0.188
 She+0.169
 said+0.164
 needed+0.158
Neg logit contributions
igs-0.367
ig-0.355
ric-0.328
 Majesty-0.310
pieces-0.307
Token away
Token ID1497
Feature activation+4.58e+01

Pos logit contributions
Don+0.264
 meant+0.254
 said+0.247
 needed+0.238
 whispered+0.226
Neg logit contributions
igs-0.240
ric-0.216
ig-0.215
 Majesty-0.204
ocol-0.200
Token now
Token ID783
Feature activation+4.86e+01

Pos logit contributions
 meant+0.218
 said+0.213
Don+0.197
 needed+0.185
 whispered+0.185
Neg logit contributions
ric-0.241
igs-0.236
pieces-0.235
ig-0.231
latable-0.220
Token."
Token ID526
Feature activation+5.78e+01

Pos logit contributions
Don+0.326
 meant+0.310
books+0.284
 wondered+0.283
 needed+0.282
Neg logit contributions
igs-0.153
 our-0.147
ig-0.147
ap-0.136
 arrest-0.133
Token
Token ID198
Feature activation+5.74e+01

Pos logit contributions
 meant+0.214
Don+0.203
books+0.194
jar+0.180
chy+0.179
Neg logit contributions
 our-0.330
igs-0.326
ric-0.323
ap-0.305
 arrest-0.304
Token
Token ID198
Feature activation+5.72e+01

Pos logit contributions
 meant+0.203
Don+0.188
 needed+0.169
books+0.162
 whispered+0.160
Neg logit contributions
igs-0.358
ric-0.349
 our-0.339
ig-0.336
 arrest-0.318
TokenJen
Token ID44875
Feature activation+4.93e+01

Pos logit contributions
 meant+0.261
Don+0.238
 needed+0.227
books+0.221
 whispered+0.220
Neg logit contributions
igs-0.299
ric-0.291
 our-0.281
ig-0.279
 arrest-0.271
Token and
Token ID290
Feature activation+5.48e+01

Pos logit contributions
Don+0.222
books+0.219
 meant+0.216
jar+0.213
Today+0.211
Neg logit contributions
ig-0.227
ric-0.212
igs-0.203
ap-0.202
 our-0.200
Token Tom
Token ID4186
Feature activation+5.14e+01

Pos logit contributions
Don+0.204
 meant+0.200
 needed+0.169
 said+0.169
 whispered+0.161
Neg logit contributions
igs-0.334
ric-0.324
ig-0.324
 our-0.313
ap-0.304
Token were
Token ID547
Feature activation+5.06e+01

Pos logit contributions
books+0.176
 meant+0.175
jar+0.174
Don+0.169
Today+0.168
Neg logit contributions
ric-0.287
ap-0.285
ig-0.281
 our-0.270
igs-0.262
Token his
Token ID465
Feature activation-9.75e+01

Pos logit contributions
 meant+0.222
Don+0.215
 said+0.208
 whispered+0.200
 needed+0.194
Neg logit contributions
ric-0.191
igs-0.191
latable-0.190
ig-0.183
 arrest-0.172
Token head
Token ID1182
Feature activation-9.13e+01

Pos logit contributions
igs+0.440
ig+0.420
ric+0.413
 our+0.384
ap+0.375
Neg logit contributions
Don-0.515
 meant-0.508
 said-0.467
 needed-0.455
 whispered-0.446
Token out
Token ID503
Feature activation-8.61e+01

Pos logit contributions
ig+0.407
ric+0.395
ap+0.392
 our+0.379
igs+0.379
Neg logit contributions
Don-0.472
 meant-0.420
Wow-0.410
Well-0.384
jar-0.384
Token of
Token ID286
Feature activation-8.08e+01

Pos logit contributions
igs+0.388
ig+0.367
 our+0.360
ric+0.354
ap+0.343
Neg logit contributions
Don-0.465
 meant-0.435
books-0.383
 said-0.382
 needed-0.380
Token his
Token ID465
Feature activation-8.34e+01

Pos logit contributions
igs+0.381
ig+0.380
ric+0.372
ap+0.343
 Majesty+0.337
Neg logit contributions
 meant-0.406
Don-0.403
 said-0.380
 needed-0.360
 whispered-0.356
Token bur
Token ID4356
Feature activation-1.16e+02

Pos logit contributions
igs+0.389
ric+0.367
ig+0.362
 our+0.329
ap+0.324
Neg logit contributions
 meant-0.421
Don-0.417
 said-0.400
 needed-0.388
 whispered-0.378
Tokenrow
Token ID808
Feature activation-8.97e+01

Pos logit contributions
 victory+0.659
latable+0.630
pieces+0.615
 arrest+0.609
 our+0.608
Neg logit contributions
 meant-0.480
 needed-0.442
 said-0.437
 whispered-0.430
Don-0.395
Token and
Token ID290
Feature activation-6.35e+01

Pos logit contributions
igs+0.502
ig+0.486
ric+0.474
 our+0.451
ap+0.444
Neg logit contributions
Don-0.383
 meant-0.365
 said-0.320
 needed-0.310
 whispered-0.307
Token saw
Token ID2497
Feature activation-6.76e+01

Pos logit contributions
igs+0.372
ig+0.358
ric+0.355
 our+0.345
ap+0.332
Neg logit contributions
Don-0.247
 meant-0.238
 said-0.202
 needed-0.198
books-0.195
Token something
Token ID1223
Feature activation-6.57e+01

Pos logit contributions
igs+0.261
ig+0.247
ric+0.243
 our+0.226
ap+0.216
Neg logit contributions
Don-0.403
 meant-0.395
 said-0.363
 needed-0.357
 whispered-0.352
Token new
Token ID649
Feature activation-6.83e+01

Pos logit contributions
igs+0.264
 our+0.255
 victory+0.254
ig+0.249
ric+0.235
Neg logit contributions
Don-0.356
 meant-0.312
 She-0.300
Wow-0.300
jar-0.298
Token reward
Token ID6721
Feature activation-9.35e+01

Pos logit contributions
igs+0.526
ig+0.493
ric+0.488
 our+0.448
 Majesty+0.443
Neg logit contributions
 meant-0.479
Don-0.467
 said-0.441
 needed-0.432
 whispered-0.407
Token.
Token ID13
Feature activation-5.71e+01

Pos logit contributions
igs+0.462
ig+0.444
ric+0.435
 our+0.423
ap+0.405
Neg logit contributions
Don-0.454
 meant-0.435
 said-0.388
 needed-0.382
 whispered-0.377
Token It
Token ID632
Feature activation-6.42e+01

Pos logit contributions
 our+0.305
ric+0.302
ap+0.295
igs+0.289
ig+0.289
Neg logit contributions
 meant-0.231
Don-0.218
 needed-0.189
 whispered-0.189
books-0.188
Token was
Token ID373
Feature activation-6.21e+01

Pos logit contributions
ig+0.352
 our+0.344
igs+0.343
ric+0.328
 Majesty+0.324
Neg logit contributions
Don-0.254
books-0.225
jar-0.220
Grand-0.213
 meant-0.213
Token ch
Token ID442
Feature activation-7.16e+01

Pos logit contributions
 our+0.233
igs+0.225
 victory+0.213
ric+0.210
 my+0.206
Neg logit contributions
Don-0.377
 meant-0.334
Well-0.318
books-0.317
jar-0.314
Tokenocol
Token ID4668
Feature activation-1.15e+02

Pos logit contributions
ric+0.229
igs+0.229
 our+0.212
ig+0.206
 Majesty+0.205
Neg logit contributions
Don-0.453
 meant-0.451
 needed-0.412
 She-0.410
 said-0.410
Tokenates
Token ID689
Feature activation-7.40e+01

Pos logit contributions
latable+0.390
 our+0.367
igs+0.360
 victory+0.359
 Majesty+0.338
Neg logit contributions
 meant-0.684
Don-0.644
books-0.614
 said-0.612
 whispered-0.599
Token and
Token ID290
Feature activation-5.60e+01

Pos logit contributions
igs+0.321
ig+0.317
ric+0.306
 our+0.295
ap+0.288
Neg logit contributions
Don-0.410
 meant-0.383
 said-0.346
 needed-0.341
 whispered-0.340
Token sweets
Token ID42402
Feature activation-6.74e+01

Pos logit contributions
igs+0.287
ric+0.273
ig+0.273
 our+0.264
 Majesty+0.249
Neg logit contributions
Don-0.262
 meant-0.252
 said-0.223
 needed-0.217
 whispered-0.216
Token!
Token ID0
Feature activation-4.46e+01

Pos logit contributions
ig+0.269
igs+0.269
 our+0.255
ric+0.253
 victory+0.242
Neg logit contributions
Don-0.382
 meant-0.333
Wow-0.313
Grand-0.310
Well-0.306
Token
Token ID198
Feature activation-4.39e+01

Pos logit contributions
 our+0.230
igs+0.227
ric+0.227
ig+0.223
ap+0.219
Neg logit contributions
 meant-0.188
Don-0.180
 needed-0.162
books-0.160
 whispered-0.153
Token children
Token ID1751
Feature activation-8.09e+01

Pos logit contributions
Don+0.188
books+0.166
 meant+0.161
jar+0.155
Wow+0.152
Neg logit contributions
ig-0.319
 our-0.316
igs-0.305
ric-0.298
 my-0.289
Token put
Token ID1234
Feature activation-7.44e+01

Pos logit contributions
ric+0.443
ig+0.439
ap+0.434
igs+0.432
 our+0.424
Neg logit contributions
Don-0.337
 meant-0.304
jar-0.278
books-0.270
Wow-0.259
Token on
Token ID319
Feature activation-7.50e+01

Pos logit contributions
igs+0.324
ig+0.309
ric+0.303
 our+0.286
ap+0.276
Neg logit contributions
Don-0.408
 meant-0.399
 said-0.364
 needed-0.357
 whispered-0.351
Token their
Token ID511
Feature activation-8.05e+01

Pos logit contributions
igs+0.359
ig+0.352
ric+0.343
 Majesty+0.310
ap+0.308
Neg logit contributions
Don-0.373
 meant-0.369
 said-0.345
 needed-0.326
 whispered-0.324
Token bath
Token ID7837
Feature activation-8.09e+01

Pos logit contributions
igs+0.341
ig+0.323
ric+0.321
 our+0.290
ap+0.287
Neg logit contributions
Don-0.445
 meant-0.439
 said-0.410
 needed-0.400
 whispered-0.392
Tokenro
Token ID305
Feature activation-1.14e+02

Pos logit contributions
igs+0.476
ig+0.463
ric+0.450
 our+0.431
ap+0.427
Neg logit contributions
Don-0.330
 meant-0.314
 said-0.271
 needed-0.264
 whispered-0.260
Tokenbes
Token ID12636
Feature activation-9.23e+01

Pos logit contributions
igs+0.568
ric+0.551
ig+0.539
ode+0.539
pieces+0.538
Neg logit contributions
 said-0.489
 meant-0.484
Don-0.476
 whispered-0.455
books-0.446
Token and
Token ID290
Feature activation-6.16e+01

Pos logit contributions
ap+0.325
 our+0.320
ig+0.319
igs+0.293
 your+0.280
Neg logit contributions
Don-0.569
 meant-0.486
Wow-0.486
Uh-0.481
 wondered-0.477
Token ran
Token ID4966
Feature activation-6.47e+01

Pos logit contributions
igs+0.354
ig+0.343
 our+0.333
ric+0.331
ap+0.317
Neg logit contributions
Don-0.245
 meant-0.233
books-0.200
jar-0.193
 needed-0.191
Token around
Token ID1088
Feature activation-6.40e+01

Pos logit contributions
igs+0.240
ig+0.229
ric+0.222
 our+0.211
ap+0.201
Neg logit contributions
Don-0.395
 meant-0.389
 said-0.356
 needed-0.351
 whispered-0.345
Token in
Token ID287
Feature activation-6.21e+01

Pos logit contributions
 our+0.222
igs+0.214
ig+0.204
 victory+0.201
ap+0.200
Neg logit contributions
Don-0.403
 meant-0.399
 needed-0.349
 wondered-0.346
 whispered-0.346
Token
Token ID129
Feature activation-1.25e+02

Pos logit contributions
Don+0.170
jar+0.164
Grand+0.156
books+0.149
 meant+0.145
Neg logit contributions
ap-0.269
ig-0.265
igs-0.241
ish-0.238
 our-0.235
Token
Token ID241
Feature activation-9.93e+01

Pos logit contributions
igs+0.539
ig+0.515
ric+0.477
 our+0.450
ap+0.438
Neg logit contributions
Don-0.715
 meant-0.692
 said-0.653
 She-0.622
 whispered-0.614
TokenI
Token ID40
Feature activation-7.97e+01

Pos logit contributions
ric+0.657
igs+0.654
ig+0.635
 our+0.616
ap+0.610
Neg logit contributions
 meant-0.287
Don-0.287
 said-0.245
 whispered-0.238
 needed-0.237
Tokenâ
Token ID22940
Feature activation-9.61e+01

Pos logit contributions
igs+0.352
ig+0.349
ric+0.340
ap+0.321
 our+0.319
Neg logit contributions
Don-0.415
 meant-0.397
 said-0.356
books-0.352
 needed-0.351
Token
Token ID26391
Feature activation-9.55e+01

Pos logit contributions
igs+0.424
ig+0.404
ric+0.398
 our+0.371
ap+0.359
Neg logit contributions
Don-0.525
 meant-0.516
 said-0.473
 needed-0.461
 whispered-0.453
Token
Token ID8151
Feature activation-1.13e+02

Pos logit contributions
igs+0.489
ig+0.457
ric+0.447
 our+0.431
 victory+0.419
Neg logit contributions
Don-0.460
 meant-0.442
 said-0.421
 needed-0.399
 She-0.389
Tokenm
Token ID76
Feature activation-8.72e+01

Pos logit contributions
ric+0.595
ig+0.553
ap+0.524
igs+0.508
chie+0.490
Neg logit contributions
Don-0.515
 meant-0.466
 wondered-0.443
 whispered-0.439
jar-0.436
Token sorry
Token ID7926
Feature activation-7.83e+01

Pos logit contributions
igs+0.437
ig+0.421
ric+0.419
 our+0.391
ap+0.385
Neg logit contributions
Don-0.425
 meant-0.410
 said-0.369
 needed-0.362
 whispered-0.356
Token Jane
Token ID12091
Feature activation-7.87e+01

Pos logit contributions
igs+0.390
ig+0.377
ric+0.370
 our+0.356
ap+0.348
Neg logit contributions
Don-0.378
 meant-0.364
 said-0.327
 needed-0.324
 whispered-0.318
Token.
Token ID13
Feature activation-5.66e+01

Pos logit contributions
ig+0.416
ric+0.387
 our+0.384
igs+0.380
ap+0.380
Neg logit contributions
Don-0.357
jar-0.306
books-0.299
Wow-0.295
 meant-0.292
Token I
Token ID314
Feature activation-6.11e+01

Pos logit contributions
ric+0.292
igs+0.288
ig+0.283
 our+0.279
ap+0.270
Neg logit contributions
 meant-0.247
Don-0.241
 needed-0.217
 said-0.214
 whispered-0.211
Token Everyday
Token ID48154
Feature activation-8.74e+01

Pos logit contributions
Don+0.227
jar+0.204
 meant+0.193
Grand+0.186
Dead+0.184
Neg logit contributions
ig-0.223
ric-0.221
 our-0.218
igs-0.216
ap-0.209
Token he
Token ID339
Feature activation-7.39e+01

Pos logit contributions
igs+0.505
ig+0.491
ric+0.480
 our+0.470
ap+0.453
Neg logit contributions
Don-0.350
 meant-0.334
 said-0.288
 needed-0.281
 whispered-0.278
Token would
Token ID561
Feature activation-6.95e+01

Pos logit contributions
ig+0.462
igs+0.453
ric+0.440
 our+0.432
 Majesty+0.432
Neg logit contributions
Don-0.247
 meant-0.208
jar-0.202
books-0.193
Wow-0.189
Token take
Token ID1011
Feature activation-6.72e+01

Pos logit contributions
igs+0.301
ig+0.284
ric+0.276
 our+0.250
ocol+0.240
Neg logit contributions
 meant-0.389
Don-0.380
 said-0.366
 needed-0.354
 whispered-0.347
Token the
Token ID262
Feature activation-6.94e+01

Pos logit contributions
ric+0.362
igs+0.341
ig+0.330
ode+0.313
chie+0.302
Neg logit contributions
 meant-0.307
Don-0.297
 said-0.293
 needed-0.268
 whispered-0.267
Token av
Token ID1196
Feature activation-1.13e+02

Pos logit contributions
igs+0.308
ig+0.298
ric+0.290
 our+0.275
 victory+0.267
Neg logit contributions
Don-0.374
 meant-0.363
 said-0.330
 needed-0.322
 whispered-0.318
Tokenoc
Token ID420
Feature activation-1.11e+02

Pos logit contributions
ric+0.669
latable+0.669
 our+0.642
igs+0.638
 Majesty+0.637
Neg logit contributions
 meant-0.380
Don-0.374
Today-0.366
 said-0.350
 needed-0.342
Tokenados
Token ID22484
Feature activation-8.04e+01

Pos logit contributions
latable+0.443
igs+0.421
 dive+0.406
 our+0.399
ig+0.385
Neg logit contributions
 meant-0.584
Don-0.548
 needed-0.543
 said-0.536
 She-0.512
Token and
Token ID290
Feature activation-6.00e+01

Pos logit contributions
igs+0.312
ig+0.301
ric+0.289
 our+0.282
ap+0.267
Neg logit contributions
Don-0.475
 meant-0.455
 said-0.413
 needed-0.407
 wondered-0.406
Token scatter
Token ID41058
Feature activation-6.73e+01

Pos logit contributions
ig+0.304
igs+0.299
ric+0.294
ap+0.266
latable+0.263
Neg logit contributions
 meant-0.297
Don-0.291
 said-0.282
 needed-0.276
 whispered-0.255
Token them
Token ID606
Feature activation-6.44e+01

Pos logit contributions
igs+0.257
ig+0.243
ric+0.239
 our+0.222
ap+0.213
Neg logit contributions
Don-0.404
 meant-0.397
 said-0.367
 needed-0.359
 whispered-0.354
Token't
Token ID470
Feature activation-8.29e+01

Pos logit contributions
Don+0.248
 meant+0.244
 said+0.224
 needed+0.217
 whispered+0.213
Neg logit contributions
igs-0.220
ig-0.210
ric-0.209
 our-0.191
ap-0.188
Token know
Token ID760
Feature activation-7.47e+01

Pos logit contributions
igs+0.443
ig+0.425
ric+0.414
latable+0.374
amental+0.368
Neg logit contributions
 meant-0.417
Don-0.393
 said-0.391
 needed-0.370
 whispered-0.358
Token the
Token ID262
Feature activation-7.28e+01

Pos logit contributions
igs+0.284
ig+0.273
ric+0.267
ap+0.235
chie+0.231
Neg logit contributions
 meant-0.454
Don-0.445
 said-0.425
 needed-0.410
 whispered-0.401
Token rh
Token ID9529
Feature activation-1.01e+02

Pos logit contributions
igs+0.359
ig+0.346
ric+0.343
 our+0.324
ap+0.313
Neg logit contributions
Don-0.353
 meant-0.343
 said-0.311
 needed-0.302
 whispered-0.298
Tokenin
Token ID259
Feature activation-9.01e+01

Pos logit contributions
igs+0.486
latable+0.465
 Majesty+0.449
ric+0.449
ig+0.434
Neg logit contributions
 meant-0.486
Don-0.478
 needed-0.458
 said-0.454
 whispered-0.403
Tokenoc
Token ID420
Feature activation-1.12e+02

Pos logit contributions
ig+0.315
ric+0.309
igs+0.305
 our+0.283
 victory+0.280
Neg logit contributions
Don-0.551
 meant-0.513
 She-0.478
jar-0.475
 whispered-0.471
Tokeneros
Token ID27498
Feature activation-8.38e+01

Pos logit contributions
igs+0.349
latable+0.292
ig+0.278
 our+0.271
ric+0.259
Neg logit contributions
 meant-0.725
Don-0.689
 needed-0.681
 said-0.681
 man-0.624
Token was
Token ID373
Feature activation-6.87e+01

Pos logit contributions
ig+0.465
 our+0.461
igs+0.453
ric+0.451
 victory+0.441
Neg logit contributions
Don-0.325
jar-0.275
Grand-0.267
Wow-0.263
books-0.262
Token wild
Token ID4295
Feature activation-7.18e+01

Pos logit contributions
igs+0.324
ric+0.312
ig+0.307
 our+0.304
 victory+0.288
Neg logit contributions
Don-0.345
 meant-0.324
 said-0.296
 whispered-0.287
books-0.286
Token.
Token ID13
Feature activation-5.01e+01

Pos logit contributions
igs+0.430
ig+0.412
ric+0.407
 our+0.383
ap+0.377
Neg logit contributions
 meant-0.284
Don-0.279
 said-0.248
 needed-0.240
 whispered-0.229
Token Can
Token ID1680
Feature activation-5.47e+01

Pos logit contributions
ric+0.264
 our+0.261
 my+0.248
ap+0.243
igs+0.240
Neg logit contributions
 meant-0.207
Don-0.193
 needed-0.175
chy-0.175
books-0.175
Token
Token ID129
Feature activation-1.25e+02

Pos logit contributions
 meant+0.183
Don+0.177
 said+0.158
 needed+0.155
 She+0.153
Neg logit contributions
igs-0.243
ric-0.236
ig-0.232
 Majesty-0.219
 our-0.218
Token
Token ID241
Feature activation-9.93e+01

Pos logit contributions
igs+0.539
ig+0.515
ric+0.477
 our+0.450
ap+0.438
Neg logit contributions
Don-0.715
 meant-0.692
 said-0.653
 She-0.622
 whispered-0.614
TokenThat
Token ID2504
Feature activation-7.53e+01

Pos logit contributions
ric+0.657
igs+0.654
ig+0.635
 our+0.616
ap+0.610
Neg logit contributions
 meant-0.287
Don-0.287
 said-0.245
 whispered-0.238
 needed-0.237
Tokenâ
Token ID22940
Feature activation-9.31e+01

Pos logit contributions
igs+0.413
ig+0.397
ric+0.389
 our+0.365
ap+0.360
Neg logit contributions
 meant-0.330
Don-0.328
 said-0.295
 needed-0.284
 whispered-0.275
Token
Token ID26391
Feature activation-9.31e+01

Pos logit contributions
igs+0.404
ig+0.390
ric+0.384
 our+0.354
ap+0.352
Neg logit contributions
Don-0.509
 meant-0.496
 said-0.457
 needed-0.447
 whispered-0.442
Token
Token ID8151
Feature activation-1.11e+02

Pos logit contributions
igs+0.481
ig+0.459
ric+0.451
 our+0.430
ap+0.414
Neg logit contributions
Don-0.436
 meant-0.424
 said-0.390
 needed-0.376
 whispered-0.364
Tokens
Token ID82
Feature activation-8.38e+01

Pos logit contributions
ric+0.585
ig+0.550
ap+0.514
igs+0.507
 our+0.476
Neg logit contributions
Don-0.518
 meant-0.455
jar-0.451
 whispered-0.450
 wondered-0.446
Token a
Token ID257
Feature activation-7.81e+01

Pos logit contributions
igs+0.428
ig+0.412
ric+0.394
 arrest+0.378
 our+0.371
Neg logit contributions
 meant-0.400
Don-0.391
 said-0.361
 needed-0.350
 She-0.333
Token special
Token ID2041
Feature activation-7.99e+01

Pos logit contributions
igs+0.401
ig+0.388
ric+0.381
 our+0.361
ap+0.352
Neg logit contributions
Don-0.365
 meant-0.357
 said-0.321
 needed-0.311
 whispered-0.306
Token necklace
Token ID38987
Feature activation-7.91e+01

Pos logit contributions
igs+0.371
ig+0.353
ric+0.350
 our+0.327
ap+0.318
Neg logit contributions
Don-0.412
 meant-0.407
 said-0.373
 needed-0.363
 whispered-0.355
Token,
Token ID11
Feature activation-6.18e+01

Pos logit contributions
ig+0.348
igs+0.345
 our+0.323
ric+0.316
ap+0.301
Neg logit contributions
Don-0.434
 meant-0.382
 wondered-0.353
Wow-0.350
jar-0.345
Token
Token ID129
Feature activation-1.25e+02

Pos logit contributions
 meant+0.287
Don+0.283
 needed+0.251
 whispered+0.246
 wondered+0.241
Neg logit contributions
igs-0.249
ric-0.241
ig-0.239
 our-0.230
ap-0.215
Token
Token ID241
Feature activation-9.93e+01

Pos logit contributions
igs+0.539
ig+0.515
ric+0.477
 our+0.450
ap+0.438
Neg logit contributions
Don-0.715
 meant-0.692
 said-0.653
 She-0.622
 whispered-0.614
TokenIt
Token ID1026
Feature activation-7.78e+01

Pos logit contributions
ric+0.657
igs+0.654
ig+0.635
 our+0.616
ap+0.610
Neg logit contributions
 meant-0.287
Don-0.287
 said-0.245
 whispered-0.238
 needed-0.237
Tokenâ
Token ID22940
Feature activation-9.33e+01

Pos logit contributions
igs+0.408
ig+0.394
ric+0.389
 our+0.369
ap+0.359
Neg logit contributions
Don-0.356
 meant-0.345
 said-0.311
 needed-0.302
 whispered-0.297
Token
Token ID26391
Feature activation-9.31e+01

Pos logit contributions
igs+0.416
ig+0.402
ric+0.398
 our+0.367
ap+0.364
Neg logit contributions
Don-0.498
 meant-0.489
 said-0.447
 needed-0.438
 whispered-0.431
Token
Token ID8151
Feature activation-1.11e+02

Pos logit contributions
igs+0.473
ig+0.448
ric+0.440
 our+0.420
 victory+0.404
Neg logit contributions
Don-0.444
 meant-0.432
 said-0.400
 needed-0.385
 whispered-0.374
Tokens
Token ID82
Feature activation-8.37e+01

Pos logit contributions
ric+0.595
ig+0.550
ap+0.509
igs+0.497
chie+0.471
Neg logit contributions
Don-0.512
 meant-0.458
 wondered-0.449
 whispered-0.441
jar-0.438
Token almost
Token ID2048
Feature activation-7.88e+01

Pos logit contributions
igs+0.404
ig+0.388
ric+0.383
 our+0.360
ap+0.350
Neg logit contributions
Don-0.418
 meant-0.410
 said-0.372
 needed-0.362
 whispered-0.356
Token just
Token ID655
Feature activation-7.53e+01

Pos logit contributions
igs+0.344
ig+0.336
ric+0.333
 our+0.330
ap+0.315
Neg logit contributions
Don-0.425
 meant-0.394
books-0.362
jar-0.353
Grand-0.350
Token as
Token ID355
Feature activation-6.71e+01

Pos logit contributions
igs+0.344
ig+0.333
ric+0.332
 our+0.316
ap+0.304
Neg logit contributions
Don-0.390
 meant-0.374
 said-0.336
 needed-0.330
books-0.328
Token beautiful
Token ID4950
Feature activation-7.43e+01

Pos logit contributions
igs+0.318
 our+0.298
ig+0.296
ric+0.285
 victory+0.275
Neg logit contributions
Don-0.337
 meant-0.325
books-0.287
 whispered-0.285
 said-0.285
Token
Token ID129
Feature activation-1.25e+02

Pos logit contributions
Don+0.136
 meant+0.132
 said+0.112
 needed+0.107
 whispered+0.103
Neg logit contributions
igs-0.304
ig-0.295
ric-0.292
 our-0.279
ap-0.274
Token
Token ID241
Feature activation-9.93e+01

Pos logit contributions
igs+0.539
ig+0.515
ric+0.477
 our+0.450
ap+0.438
Neg logit contributions
Don-0.715
 meant-0.692
 said-0.653
 She-0.622
 whispered-0.614
TokenIt
Token ID1026
Feature activation-7.78e+01

Pos logit contributions
ric+0.657
igs+0.654
ig+0.635
 our+0.616
ap+0.610
Neg logit contributions
 meant-0.287
Don-0.287
 said-0.245
 whispered-0.238
 needed-0.237
Tokenâ
Token ID22940
Feature activation-9.33e+01

Pos logit contributions
igs+0.408
ig+0.394
ric+0.389
 our+0.369
ap+0.359
Neg logit contributions
Don-0.356
 meant-0.345
 said-0.311
 needed-0.302
 whispered-0.297
Token
Token ID26391
Feature activation-9.31e+01

Pos logit contributions
igs+0.416
ig+0.402
ric+0.398
 our+0.367
ap+0.364
Neg logit contributions
Don-0.498
 meant-0.489
 said-0.447
 needed-0.438
 whispered-0.431
Token
Token ID8151
Feature activation-1.11e+02

Pos logit contributions
igs+0.473
ig+0.448
ric+0.440
 our+0.420
 victory+0.404
Neg logit contributions
Don-0.444
 meant-0.432
 said-0.400
 needed-0.385
 whispered-0.374
Tokens
Token ID82
Feature activation-8.37e+01

Pos logit contributions
ric+0.595
ig+0.550
ap+0.509
igs+0.497
chie+0.471
Neg logit contributions
Don-0.512
 meant-0.458
 wondered-0.449
 whispered-0.441
jar-0.438
Token a
Token ID257
Feature activation-7.78e+01

Pos logit contributions
igs+0.404
ig+0.388
ric+0.383
 our+0.360
ap+0.350
Neg logit contributions
Don-0.418
 meant-0.410
 said-0.372
 needed-0.362
 whispered-0.356
Token magic
Token ID5536
Feature activation-8.11e+01

Pos logit contributions
igs+0.391
ig+0.383
ric+0.374
 our+0.356
ap+0.345
Neg logit contributions
Don-0.367
 meant-0.360
 said-0.320
 needed-0.311
 whispered-0.308
Token book
Token ID1492
Feature activation-7.87e+01

Pos logit contributions
igs+0.363
ig+0.355
ric+0.339
 our+0.320
ap+0.312
Neg logit contributions
Don-0.441
 meant-0.421
 said-0.383
 needed-0.374
 whispered-0.369
Token!
Token ID0
Feature activation-5.45e+01

Pos logit contributions
ig+0.306
igs+0.301
 our+0.296
ap+0.283
 my+0.275
Neg logit contributions
Don-0.473
Wow-0.392
 wondered-0.376
 meant-0.372
Well-0.363
Token her
Token ID607
Feature activation-9.45e+01

Pos logit contributions
Don+0.177
 meant+0.170
 said+0.150
 needed+0.145
 whispered+0.142
Neg logit contributions
igs-0.256
ig-0.247
ric-0.244
 our-0.234
ap-0.229
Token mom
Token ID1995
Feature activation-8.10e+01

Pos logit contributions
igs+0.432
ig+0.416
ric+0.410
 our+0.387
ap+0.375
Neg logit contributions
Don-0.497
 meant-0.484
 said-0.436
 needed-0.426
 whispered-0.421
Tokenmy
Token ID1820
Feature activation-8.85e+01

Pos logit contributions
ric+0.426
ig+0.415
ap+0.403
 our+0.398
igs+0.395
Neg logit contributions
Don-0.359
 meant-0.311
books-0.300
Wow-0.299
jar-0.292
Token watched
Token ID7342
Feature activation-7.30e+01

Pos logit contributions
ric+0.446
ig+0.439
ap+0.430
 our+0.428
igs+0.421
Neg logit contributions
Don-0.398
Wow-0.345
 meant-0.332
books-0.332
jar-0.329
Token in
Token ID287
Feature activation-6.94e+01

Pos logit contributions
igs+0.326
ig+0.308
ric+0.304
 our+0.303
ap+0.289
Neg logit contributions
Don-0.391
 meant-0.380
 needed-0.339
 said-0.339
 whispered-0.333
Token amaz
Token ID40642
Feature activation-1.11e+02

Pos logit contributions
igs+0.290
ig+0.276
ric+0.276
 our+0.268
ap+0.251
Neg logit contributions
Don-0.387
 meant-0.379
 said-0.343
 needed-0.340
 whispered-0.334
Tokenement
Token ID972
Feature activation-8.29e+01

Pos logit contributions
ric+0.762
igs+0.739
 arrest+0.704
 dive+0.696
ode+0.679
Neg logit contributions
 meant-0.340
 said-0.333
Don-0.279
 She-0.275
 answered-0.254
Token as
Token ID355
Feature activation-6.66e+01

Pos logit contributions
igs+0.469
ig+0.452
ric+0.447
 our+0.422
ap+0.412
Neg logit contributions
Don-0.344
 meant-0.339
 said-0.304
 needed-0.292
 whispered-0.285
Token the
Token ID262
Feature activation-6.89e+01

Pos logit contributions
igs+0.354
ig+0.338
 our+0.336
ric+0.328
 victory+0.320
Neg logit contributions
Don-0.292
 meant-0.279
 needed-0.242
 said-0.242
 whispered-0.239
Token machine
Token ID4572
Feature activation-7.35e+01

Pos logit contributions
igs+0.314
ig+0.305
ric+0.297
 our+0.285
 victory+0.283
Neg logit contributions
Don-0.356
 meant-0.343
 said-0.304
 needed-0.303
 whispered-0.298
Token flew
Token ID13112
Feature activation-6.71e+01

Pos logit contributions
ig+0.364
ap+0.354
 our+0.351
ric+0.338
 victory+0.337
Neg logit contributions
Don-0.335
Wow-0.285
jar-0.271
Grand-0.267
Today-0.264
Token
Token ID129
Feature activation-1.15e+02

Pos logit contributions
igs+0.754
ig+0.705
ric+0.684
 our+0.671
ode+0.649
Neg logit contributions
Don-0.480
 meant-0.457
 said-0.436
 needed-0.401
 She-0.392
Token
Token ID241
Feature activation-9.51e+01

Pos logit contributions
igs+0.493
ig+0.471
ric+0.470
 our+0.434
ap+0.422
Neg logit contributions
Don-0.632
 meant-0.623
 said-0.568
 needed-0.561
 whispered-0.547
TokenThat
Token ID2504
Feature activation-7.52e+01

Pos logit contributions
ric+0.607
igs+0.587
ig+0.573
ap+0.557
 our+0.556
Neg logit contributions
 meant-0.300
Don-0.288
 said-0.259
 whispered-0.258
 needed-0.250
Tokenâ
Token ID22940
Feature activation-9.26e+01

Pos logit contributions
igs+0.417
ig+0.401
ric+0.396
 our+0.374
ap+0.366
Neg logit contributions
Don-0.323
 meant-0.320
 said-0.286
 needed-0.276
 whispered-0.268
Token
Token ID26391
Feature activation-9.26e+01

Pos logit contributions
igs+0.402
ig+0.390
ric+0.385
 our+0.352
ap+0.352
Neg logit contributions
Don-0.502
 meant-0.490
 said-0.451
 needed-0.442
 whispered-0.437
Token
Token ID8151
Feature activation-1.11e+02

Pos logit contributions
igs+0.465
ig+0.446
ric+0.439
 our+0.415
ap+0.403
Neg logit contributions
Don-0.445
 meant-0.435
 said-0.395
 needed-0.384
 whispered-0.375
Tokens
Token ID82
Feature activation-8.31e+01

Pos logit contributions
ric+0.594
ig+0.555
ap+0.521
igs+0.521
 our+0.486
Neg logit contributions
Don-0.498
 meant-0.444
 whispered-0.443
 wondered-0.433
jar-0.432
Token right
Token ID826
Feature activation-7.94e+01

Pos logit contributions
igs+0.422
ig+0.407
ric+0.385
 arrest+0.374
 Majesty+0.365
Neg logit contributions
 meant-0.401
Don-0.391
 said-0.359
 needed-0.351
 She-0.335
Token,
Token ID11
Feature activation-6.66e+01

Pos logit contributions
igs+0.364
ig+0.355
ric+0.351
 our+0.334
ap+0.321
Neg logit contributions
Don-0.415
 meant-0.392
 said-0.358
 needed-0.354
 whispered-0.352
Token Emma
Token ID18966
Feature activation-7.28e+01

Pos logit contributions
igs+0.306
ig+0.293
ric+0.289
 our+0.271
ap+0.263
Neg logit contributions
Don-0.347
 meant-0.340
 said-0.310
 needed-0.303
 whispered-0.297
Token.
Token ID13
Feature activation-5.51e+01

Pos logit contributions
ig+0.330
igs+0.320
ric+0.316
 our+0.314
ap+0.309
Neg logit contributions
Don-0.372
jar-0.322
 meant-0.318
Wow-0.312
books-0.309
Token
Token ID129
Feature activation-1.15e+02

Pos logit contributions
igs+0.754
ig+0.705
ric+0.684
 our+0.671
ode+0.649
Neg logit contributions
Don-0.480
 meant-0.457
 said-0.436
 needed-0.401
 She-0.392
Token
Token ID241
Feature activation-9.51e+01

Pos logit contributions
igs+0.493
ig+0.471
ric+0.470
 our+0.434
ap+0.422
Neg logit contributions
Don-0.632
 meant-0.623
 said-0.568
 needed-0.561
 whispered-0.547
TokenThat
Token ID2504
Feature activation-7.52e+01

Pos logit contributions
ric+0.607
igs+0.587
ig+0.573
ap+0.557
 our+0.556
Neg logit contributions
 meant-0.300
Don-0.288
 said-0.259
 whispered-0.258
 needed-0.250
Tokenâ
Token ID22940
Feature activation-9.26e+01

Pos logit contributions
igs+0.417
ig+0.401
ric+0.396
 our+0.374
ap+0.366
Neg logit contributions
Don-0.323
 meant-0.320
 said-0.286
 needed-0.276
 whispered-0.268
Token
Token ID26391
Feature activation-9.26e+01

Pos logit contributions
igs+0.402
ig+0.390
ric+0.385
 our+0.352
ap+0.352
Neg logit contributions
Don-0.502
 meant-0.490
 said-0.451
 needed-0.442
 whispered-0.437
Token
Token ID8151
Feature activation-1.11e+02

Pos logit contributions
igs+0.465
ig+0.446
ric+0.439
 our+0.415
ap+0.403
Neg logit contributions
Don-0.445
 meant-0.435
 said-0.395
 needed-0.384
 whispered-0.375
Tokens
Token ID82
Feature activation-8.31e+01

Pos logit contributions
ric+0.594
ig+0.555
ap+0.521
igs+0.521
 our+0.486
Neg logit contributions
Don-0.498
 meant-0.444
 whispered-0.443
 wondered-0.433
jar-0.432
Token the
Token ID262
Feature activation-7.89e+01

Pos logit contributions
igs+0.422
ig+0.407
ric+0.385
 arrest+0.374
 Majesty+0.365
Neg logit contributions
 meant-0.401
Don-0.391
 said-0.359
 needed-0.351
 She-0.335
Token gas
Token ID3623
Feature activation-8.65e+01

Pos logit contributions
igs+0.388
ig+0.376
ric+0.374
 our+0.349
ap+0.340
Neg logit contributions
Don-0.384
 meant-0.375
 said-0.337
 needed-0.329
 whispered-0.326
Token from
Token ID422
Feature activation-6.98e+01

Pos logit contributions
igs+0.428
ric+0.416
ig+0.416
 our+0.390
ap+0.380
Neg logit contributions
Don-0.425
 meant-0.403
 said-0.362
 whispered-0.352
 needed-0.351
Token the
Token ID262
Feature activation-7.00e+01

Pos logit contributions
igs+0.273
ig+0.260
ric+0.254
 our+0.232
ap+0.227
Neg logit contributions
Don-0.413
 meant-0.407
 said-0.377
 needed-0.368
 whispered-0.361
Token he
Token ID339
Feature activation-7.39e+01

Pos logit contributions
igs+0.505
ig+0.491
ric+0.480
 our+0.470
ap+0.453
Neg logit contributions
Don-0.350
 meant-0.334
 said-0.288
 needed-0.281
 whispered-0.278
Token would
Token ID561
Feature activation-6.95e+01

Pos logit contributions
ig+0.462
igs+0.453
ric+0.440
 our+0.432
 Majesty+0.432
Neg logit contributions
Don-0.247
 meant-0.208
jar-0.202
books-0.193
Wow-0.189
Token take
Token ID1011
Feature activation-6.72e+01

Pos logit contributions
igs+0.301
ig+0.284
ric+0.276
 our+0.250
ocol+0.240
Neg logit contributions
 meant-0.389
Don-0.380
 said-0.366
 needed-0.354
 whispered-0.347
Token the
Token ID262
Feature activation-6.94e+01

Pos logit contributions
ric+0.362
igs+0.341
ig+0.330
ode+0.313
chie+0.302
Neg logit contributions
 meant-0.307
Don-0.297
 said-0.293
 needed-0.268
 whispered-0.267
Token av
Token ID1196
Feature activation-1.13e+02

Pos logit contributions
igs+0.308
ig+0.298
ric+0.290
 our+0.275
 victory+0.267
Neg logit contributions
Don-0.374
 meant-0.363
 said-0.330
 needed-0.322
 whispered-0.318
Tokenoc
Token ID420
Feature activation-1.11e+02

Pos logit contributions
ric+0.669
latable+0.669
 our+0.642
igs+0.638
 Majesty+0.637
Neg logit contributions
 meant-0.380
Don-0.374
Today-0.366
 said-0.350
 needed-0.342
Tokenados
Token ID22484
Feature activation-8.04e+01

Pos logit contributions
latable+0.443
igs+0.421
 dive+0.406
 our+0.399
ig+0.385
Neg logit contributions
 meant-0.584
Don-0.548
 needed-0.543
 said-0.536
 She-0.512
Token and
Token ID290
Feature activation-6.00e+01

Pos logit contributions
igs+0.312
ig+0.301
ric+0.289
 our+0.282
ap+0.267
Neg logit contributions
Don-0.475
 meant-0.455
 said-0.413
 needed-0.407
 wondered-0.406
Token scatter
Token ID41058
Feature activation-6.73e+01

Pos logit contributions
ig+0.304
igs+0.299
ric+0.294
ap+0.266
latable+0.263
Neg logit contributions
 meant-0.297
Don-0.291
 said-0.282
 needed-0.276
 whispered-0.255
Token them
Token ID606
Feature activation-6.44e+01

Pos logit contributions
igs+0.257
ig+0.243
ric+0.239
 our+0.222
ap+0.213
Neg logit contributions
Don-0.404
 meant-0.397
 said-0.367
 needed-0.359
 whispered-0.354
Token around
Token ID1088
Feature activation-6.26e+01

Pos logit contributions
igs+0.254
ig+0.243
 our+0.227
ric+0.226
 victory+0.217
Neg logit contributions
Don-0.392
 meant-0.369
 said-0.331
Grand-0.330
jar-0.330
Token
Token ID129
Feature activation-1.25e+02

Pos logit contributions
Don+0.271
 meant+0.254
 wondered+0.242
 whispered+0.237
 widened+0.236
Neg logit contributions
ig-0.199
 our-0.197
 victory-0.185
ap-0.174
 your-0.170
Token
Token ID241
Feature activation-9.93e+01

Pos logit contributions
igs+0.539
ig+0.515
ric+0.477
 our+0.450
ap+0.438
Neg logit contributions
Don-0.715
 meant-0.692
 said-0.653
 She-0.622
 whispered-0.614
TokenLet
Token ID5756
Feature activation-7.52e+01

Pos logit contributions
ric+0.657
igs+0.654
ig+0.635
 our+0.616
ap+0.610
Neg logit contributions
 meant-0.287
Don-0.287
 said-0.245
 whispered-0.238
 needed-0.237
Tokenâ
Token ID22940
Feature activation-9.07e+01

Pos logit contributions
igs+0.459
ric+0.432
ig+0.420
ocol+0.386
 Majesty+0.385
Neg logit contributions
 meant-0.333
 said-0.318
Don-0.312
 needed-0.277
 whispered-0.271
Token
Token ID26391
Feature activation-9.10e+01

Pos logit contributions
igs+0.387
ig+0.358
ric+0.348
 our+0.334
 Majesty+0.320
Neg logit contributions
Don-0.516
 meant-0.512
 said-0.471
 needed-0.454
 whispered-0.448
Token
Token ID8151
Feature activation-1.10e+02

Pos logit contributions
igs+0.534
ig+0.502
ric+0.487
 our+0.476
ocol+0.468
Neg logit contributions
Don-0.368
 meant-0.353
 said-0.340
 needed-0.314
 She-0.308
Tokens
Token ID82
Feature activation-8.22e+01

Pos logit contributions
ric+0.594
ig+0.570
igs+0.536
ap+0.529
 our+0.507
Neg logit contributions
Don-0.476
 meant-0.442
 wondered-0.409
 whispered-0.403
 faded-0.394
Token go
Token ID467
Feature activation-7.51e+01

Pos logit contributions
igs+0.395
ig+0.364
ric+0.359
ocol+0.351
latable+0.345
Neg logit contributions
 meant-0.435
 said-0.427
 needed-0.386
Don-0.382
 She-0.378
Token find
Token ID1064
Feature activation-7.44e+01

Pos logit contributions
igs+0.393
ric+0.386
 Majesty+0.358
ig+0.357
ode+0.355
Neg logit contributions
 said-0.343
 meant-0.338
Don-0.327
 whispered-0.305
 She-0.302
Token something
Token ID1223
Feature activation-7.03e+01

Pos logit contributions
ric+0.351
igs+0.336
ig+0.329
ap+0.302
ode+0.297
Neg logit contributions
 meant-0.401
 said-0.392
Don-0.383
 whispered-0.353
 needed-0.348
Token else
Token ID2073
Feature activation-7.68e+01

Pos logit contributions
igs+0.340
ig+0.321
 our+0.317
ric+0.312
 victory+0.293
Neg logit contributions
Don-0.357
 meant-0.329
 said-0.296
 whispered-0.293
jar-0.291
Token
Token ID129
Feature activation-1.15e+02

Pos logit contributions
igs+0.754
ig+0.705
ric+0.684
 our+0.671
ode+0.649
Neg logit contributions
Don-0.480
 meant-0.457
 said-0.436
 needed-0.401
 She-0.392
Token
Token ID241
Feature activation-9.51e+01

Pos logit contributions
igs+0.493
ig+0.471
ric+0.470
 our+0.434
ap+0.422
Neg logit contributions
Don-0.632
 meant-0.623
 said-0.568
 needed-0.561
 whispered-0.547
TokenLet
Token ID5756
Feature activation-7.56e+01

Pos logit contributions
ric+0.607
igs+0.587
ig+0.573
ap+0.557
 our+0.556
Neg logit contributions
 meant-0.300
Don-0.288
 said-0.259
 whispered-0.258
 needed-0.250
Tokenâ
Token ID22940
Feature activation-9.03e+01

Pos logit contributions
igs+0.466
ric+0.441
ig+0.425
ocol+0.391
 Majesty+0.381
Neg logit contributions
 meant-0.339
 said-0.322
Don-0.319
 needed-0.278
 wanted-0.276
Token
Token ID26391
Feature activation-9.07e+01

Pos logit contributions
igs+0.404
ig+0.380
ric+0.371
 our+0.352
 Majesty+0.336
Neg logit contributions
Don-0.492
 meant-0.485
 said-0.445
 needed-0.431
 whispered-0.424
Token
Token ID8151
Feature activation-1.09e+02

Pos logit contributions
igs+0.495
ig+0.465
ric+0.452
 our+0.439
 victory+0.424
Neg logit contributions
Don-0.404
 meant-0.390
 said-0.368
 needed-0.349
 She-0.340
Tokens
Token ID82
Feature activation-8.17e+01

Pos logit contributions
ric+0.596
ig+0.571
igs+0.543
ap+0.535
 our+0.512
Neg logit contributions
Don-0.467
 meant-0.436
 wondered-0.401
 whispered-0.401
jar-0.384
Token not
Token ID407
Feature activation-7.35e+01

Pos logit contributions
igs+0.393
ig+0.360
ocol+0.352
ric+0.349
latable+0.335
Neg logit contributions
 meant-0.437
 said-0.423
 She-0.386
 needed-0.384
Don-0.381
Token think
Token ID892
Feature activation-7.28e+01

Pos logit contributions
igs+0.390
ig+0.371
ric+0.346
ocol+0.324
latable+0.323
Neg logit contributions
 meant-0.374
Don-0.354
 said-0.335
 needed-0.327
 wanted-0.312
Token about
Token ID546
Feature activation-6.53e+01

Pos logit contributions
igs+0.326
ig+0.312
ric+0.308
 our+0.286
ap+0.279
Neg logit contributions
Don-0.388
 meant-0.382
 said-0.350
 needed-0.341
 whispered-0.334
Token the
Token ID262
Feature activation-6.76e+01

Pos logit contributions
igs+0.276
ig+0.274
ric+0.250
chie+0.234
ap+0.233
Neg logit contributions
 meant-0.369
Don-0.356
 said-0.346
 needed-0.324
 whispered-0.322
Token
Token ID129
Feature activation-1.15e+02

Pos logit contributions
igs+0.754
ig+0.705
ric+0.684
 our+0.671
ode+0.649
Neg logit contributions
Don-0.480
 meant-0.457
 said-0.436
 needed-0.401
 She-0.392
Token
Token ID241
Feature activation-9.51e+01

Pos logit contributions
igs+0.493
ig+0.471
ric+0.470
 our+0.434
ap+0.422
Neg logit contributions
Don-0.632
 meant-0.623
 said-0.568
 needed-0.561
 whispered-0.547
TokenLet
Token ID5756
Feature activation-7.56e+01

Pos logit contributions
ric+0.607
igs+0.587
ig+0.573
ap+0.557
 our+0.556
Neg logit contributions
 meant-0.300
Don-0.288
 said-0.259
 whispered-0.258
 needed-0.250
Tokenâ
Token ID22940
Feature activation-9.03e+01

Pos logit contributions
igs+0.466
ric+0.441
ig+0.425
ocol+0.391
 Majesty+0.381
Neg logit contributions
 meant-0.339
 said-0.322
Don-0.319
 needed-0.278
 wanted-0.276
Token
Token ID26391
Feature activation-9.07e+01

Pos logit contributions
igs+0.404
ig+0.380
ric+0.371
 our+0.352
 Majesty+0.336
Neg logit contributions
Don-0.492
 meant-0.485
 said-0.445
 needed-0.431
 whispered-0.424
Token
Token ID8151
Feature activation-1.09e+02

Pos logit contributions
igs+0.495
ig+0.465
ric+0.452
 our+0.439
 victory+0.424
Neg logit contributions
Don-0.404
 meant-0.390
 said-0.368
 needed-0.349
 She-0.340
Tokens
Token ID82
Feature activation-8.17e+01

Pos logit contributions
ric+0.596
ig+0.571
igs+0.543
ap+0.535
 our+0.512
Neg logit contributions
Don-0.467
 meant-0.436
 wondered-0.401
 whispered-0.401
jar-0.384
Token explore
Token ID7301
Feature activation-7.95e+01

Pos logit contributions
igs+0.393
ig+0.360
ocol+0.352
ric+0.349
latable+0.335
Neg logit contributions
 meant-0.437
 said-0.423
 She-0.386
 needed-0.384
Don-0.381
Token and
Token ID290
Feature activation-6.17e+01

Pos logit contributions
ric+0.366
igs+0.365
ig+0.353
 Majesty+0.336
 arrest+0.328
Neg logit contributions
 meant-0.401
Don-0.400
 said-0.383
 whispered-0.353
 needed-0.353
Token have
Token ID423
Feature activation-6.41e+01

Pos logit contributions
igs+0.350
ric+0.331
ig+0.326
ocol+0.303
latable+0.297
Neg logit contributions
 said-0.288
 meant-0.280
Don-0.274
 needed-0.257
 whispered-0.254
Token some
Token ID617
Feature activation-6.59e+01

Pos logit contributions
igs+0.328
ric+0.328
ig+0.326
ap+0.295
ocol+0.284
Neg logit contributions
Don-0.310
 meant-0.308
 said-0.293
 needed-0.273
 whispered-0.269
Token
Token ID129
Feature activation-1.15e+02

Pos logit contributions
igs+0.754
ig+0.705
ric+0.684
 our+0.671
ode+0.649
Neg logit contributions
Don-0.480
 meant-0.457
 said-0.436
 needed-0.401
 She-0.392
Token
Token ID241
Feature activation-9.51e+01

Pos logit contributions
igs+0.493
ig+0.471
ric+0.470
 our+0.434
ap+0.422
Neg logit contributions
Don-0.632
 meant-0.623
 said-0.568
 needed-0.561
 whispered-0.547
TokenLet
Token ID5756
Feature activation-7.56e+01

Pos logit contributions
ric+0.607
igs+0.587
ig+0.573
ap+0.557
 our+0.556
Neg logit contributions
 meant-0.300
Don-0.288
 said-0.259
 whispered-0.258
 needed-0.250
Tokenâ
Token ID22940
Feature activation-9.03e+01

Pos logit contributions
igs+0.466
ric+0.441
ig+0.425
ocol+0.391
 Majesty+0.381
Neg logit contributions
 meant-0.339
 said-0.322
Don-0.319
 needed-0.278
 wanted-0.276
Token
Token ID26391
Feature activation-9.07e+01

Pos logit contributions
igs+0.404
ig+0.380
ric+0.371
 our+0.352
 Majesty+0.336
Neg logit contributions
Don-0.492
 meant-0.485
 said-0.445
 needed-0.431
 whispered-0.424
Token
Token ID8151
Feature activation-1.09e+02

Pos logit contributions
igs+0.495
ig+0.465
ric+0.452
 our+0.439
 victory+0.424
Neg logit contributions
Don-0.404
 meant-0.390
 said-0.368
 needed-0.349
 She-0.340
Tokens
Token ID82
Feature activation-8.17e+01

Pos logit contributions
ric+0.596
ig+0.571
igs+0.543
ap+0.535
 our+0.512
Neg logit contributions
Don-0.467
 meant-0.436
 wondered-0.401
 whispered-0.401
jar-0.384
Token play
Token ID711
Feature activation-7.72e+01

Pos logit contributions
igs+0.393
ig+0.360
ocol+0.352
ric+0.349
latable+0.335
Neg logit contributions
 meant-0.437
 said-0.423
 She-0.386
 needed-0.384
Don-0.381
Token a
Token ID257
Feature activation-7.31e+01

Pos logit contributions
igs+0.368
ric+0.364
ig+0.344
 Majesty+0.321
 arrest+0.319
Neg logit contributions
Don-0.386
 meant-0.376
 said-0.363
 whispered-0.332
 needed-0.328
Token joke
Token ID9707
Feature activation-7.63e+01

Pos logit contributions
igs+0.371
ric+0.354
ig+0.346
 our+0.322
 arrest+0.321
Neg logit contributions
Don-0.346
 meant-0.334
 said-0.315
 needed-0.302
 whispered-0.295
Token.
Token ID13
Feature activation-5.28e+01

Pos logit contributions
 our+0.286
ig+0.280
ap+0.276
igs+0.275
 victory+0.248
Neg logit contributions
Don-0.462
 wondered-0.395
jar-0.391
 meant-0.390
 needed-0.388
Token
Token ID129
Feature activation-1.15e+02

Pos logit contributions
igs+0.754
ig+0.705
ric+0.684
 our+0.671
ode+0.649
Neg logit contributions
Don-0.480
 meant-0.457
 said-0.436
 needed-0.401
 She-0.392
Token
Token ID241
Feature activation-9.51e+01

Pos logit contributions
igs+0.493
ig+0.471
ric+0.470
 our+0.434
ap+0.422
Neg logit contributions
Don-0.632
 meant-0.623
 said-0.568
 needed-0.561
 whispered-0.547
TokenLet
Token ID5756
Feature activation-7.56e+01

Pos logit contributions
ric+0.607
igs+0.587
ig+0.573
ap+0.557
 our+0.556
Neg logit contributions
 meant-0.300
Don-0.288
 said-0.259
 whispered-0.258
 needed-0.250
Tokenâ
Token ID22940
Feature activation-9.03e+01

Pos logit contributions
igs+0.466
ric+0.441
ig+0.425
ocol+0.391
 Majesty+0.381
Neg logit contributions
 meant-0.339
 said-0.322
Don-0.319
 needed-0.278
 wanted-0.276
Token
Token ID26391
Feature activation-9.07e+01

Pos logit contributions
igs+0.404
ig+0.380
ric+0.371
 our+0.352
 Majesty+0.336
Neg logit contributions
Don-0.492
 meant-0.485
 said-0.445
 needed-0.431
 whispered-0.424
Token
Token ID8151
Feature activation-1.09e+02

Pos logit contributions
igs+0.495
ig+0.465
ric+0.452
 our+0.439
 victory+0.424
Neg logit contributions
Don-0.404
 meant-0.390
 said-0.368
 needed-0.349
 She-0.340
Tokens
Token ID82
Feature activation-8.17e+01

Pos logit contributions
ric+0.596
ig+0.571
igs+0.543
ap+0.535
 our+0.512
Neg logit contributions
Don-0.467
 meant-0.436
 wondered-0.401
 whispered-0.401
jar-0.384
Token have
Token ID423
Feature activation-7.38e+01

Pos logit contributions
igs+0.393
ig+0.360
ocol+0.352
ric+0.349
latable+0.335
Neg logit contributions
 meant-0.437
 said-0.423
 She-0.386
 needed-0.384
Don-0.381
Token a
Token ID257
Feature activation-7.31e+01

Pos logit contributions
ric+0.357
igs+0.355
ig+0.352
ap+0.310
 Majesty+0.300
Neg logit contributions
 meant-0.379
Don-0.377
 said-0.369
 needed-0.339
 whispered-0.335
Token stretching
Token ID20880
Feature activation-7.88e+01

Pos logit contributions
igs+0.364
ric+0.346
ig+0.339
 our+0.313
 arrest+0.310
Neg logit contributions
Don-0.355
 meant-0.350
 said-0.326
 needed-0.315
 whispered-0.307
Token race
Token ID3234
Feature activation-7.27e+01

Pos logit contributions
igs+0.372
ric+0.349
ig+0.336
 Majesty+0.332
 arrest+0.330
Neg logit contributions
 meant-0.394
Don-0.382
 said-0.370
 needed-0.353
 whispered-0.340
Token
Token ID241
Feature activation-9.93e+01

Pos logit contributions
igs+0.539
ig+0.515
ric+0.477
 our+0.450
ap+0.438
Neg logit contributions
Don-0.715
 meant-0.692
 said-0.653
 She-0.622
 whispered-0.614
TokenPlease
Token ID5492
Feature activation-8.23e+01

Pos logit contributions
ric+0.657
igs+0.654
ig+0.635
 our+0.616
ap+0.610
Neg logit contributions
 meant-0.287
Don-0.287
 said-0.245
 whispered-0.238
 needed-0.237
Token don
Token ID836
Feature activation-8.26e+01

Pos logit contributions
igs+0.363
ig+0.343
ric+0.336
 our+0.308
 Majesty+0.300
Neg logit contributions
Don-0.450
 meant-0.447
 said-0.416
 needed-0.401
 whispered-0.393
Tokenâ
Token ID22940
Feature activation-9.23e+01

Pos logit contributions
igs+0.470
ig+0.463
ric+0.457
 Majesty+0.453
ode+0.421
Neg logit contributions
 meant-0.360
Don-0.350
 said-0.322
 She-0.313
 needed-0.299
Token
Token ID26391
Feature activation-9.14e+01

Pos logit contributions
igs+0.418
ig+0.398
ric+0.392
 our+0.367
ap+0.354
Neg logit contributions
Don-0.494
 meant-0.486
 said-0.445
 needed-0.433
 whispered-0.425
Token
Token ID8151
Feature activation-1.09e+02

Pos logit contributions
igs+0.492
ig+0.472
ric+0.465
 our+0.441
ap+0.428
Neg logit contributions
Don-0.410
 meant-0.400
 said-0.362
 needed-0.350
 whispered-0.341
Tokent
Token ID83
Feature activation-8.11e+01

Pos logit contributions
ric+0.574
ig+0.561
ap+0.532
igs+0.524
 our+0.507
Neg logit contributions
Don-0.480
 meant-0.439
 wondered-0.420
 whispered-0.411
jar-0.400
Token eat
Token ID4483
Feature activation-8.10e+01

Pos logit contributions
igs+0.352
ig+0.337
ric+0.333
 our+0.311
ap+0.302
Neg logit contributions
Don-0.446
 meant-0.434
 said-0.398
 needed-0.389
 whispered-0.384
Token me
Token ID502
Feature activation-7.79e+01

Pos logit contributions
igs+0.300
ig+0.289
ric+0.287
 Majesty+0.256
ap+0.254
Neg logit contributions
 meant-0.488
Don-0.485
 said-0.456
 needed-0.437
 whispered-0.428
Token,
Token ID11
Feature activation-6.38e+01

Pos logit contributions
igs+0.359
ig+0.342
ric+0.339
 our+0.315
ap+0.306
Neg logit contributions
Don-0.405
 meant-0.400
 said-0.367
 needed-0.357
 whispered-0.350
Tokenâ
Token ID22940
Feature activation-7.80e+01

Pos logit contributions
igs+0.311
ig+0.299
ric+0.294
 our+0.278
ap+0.270
Neg logit contributions
Don-0.315
 meant-0.309
 said-0.280
 needed-0.273
 whispered-0.267
Token
Token ID129
Feature activation-1.15e+02

Pos logit contributions
igs+0.754
ig+0.705
ric+0.684
 our+0.671
ode+0.649
Neg logit contributions
Don-0.480
 meant-0.457
 said-0.436
 needed-0.401
 She-0.392
Token
Token ID241
Feature activation-9.51e+01

Pos logit contributions
igs+0.493
ig+0.471
ric+0.470
 our+0.434
ap+0.422
Neg logit contributions
Don-0.632
 meant-0.623
 said-0.568
 needed-0.561
 whispered-0.547
TokenWho
Token ID8241
Feature activation-7.65e+01

Pos logit contributions
ric+0.607
igs+0.587
ig+0.573
ap+0.557
 our+0.556
Neg logit contributions
 meant-0.300
Don-0.288
 said-0.259
 whispered-0.258
 needed-0.250
Tokenâ
Token ID22940
Feature activation-8.98e+01

Pos logit contributions
igs+0.445
ig+0.426
ric+0.423
 our+0.394
 Majesty+0.386
Neg logit contributions
 meant-0.313
Don-0.309
 said-0.280
 needed-0.269
 whispered-0.259
Token
Token ID26391
Feature activation-9.07e+01

Pos logit contributions
igs+0.372
ig+0.357
ric+0.353
 our+0.324
ap+0.318
Neg logit contributions
Don-0.510
 meant-0.500
 said-0.460
 needed-0.450
 whispered-0.443
Token
Token ID8151
Feature activation-1.09e+02

Pos logit contributions
igs+0.481
ig+0.464
ric+0.460
 our+0.433
ap+0.425
Neg logit contributions
Don-0.408
 meant-0.399
 said-0.357
 needed-0.347
 whispered-0.340
Tokens
Token ID82
Feature activation-8.19e+01

Pos logit contributions
ric+0.615
ig+0.567
igs+0.541
ap+0.535
 our+0.506
Neg logit contributions
Don-0.466
 meant-0.407
 whispered-0.390
jar-0.387
 wondered-0.387
Token there
Token ID612
Feature activation-6.95e+01

Pos logit contributions
igs+0.359
ig+0.339
ric+0.323
 our+0.299
 arrest+0.292
Neg logit contributions
 meant-0.460
Don-0.444
 said-0.425
 needed-0.411
 whispered-0.396
Token?
Token ID30
Feature activation-5.91e+01

Pos logit contributions
ric+0.246
igs+0.240
ig+0.239
 our+0.232
ap+0.220
Neg logit contributions
Don-0.434
 meant-0.399
 whispered-0.371
 wondered-0.365
 needed-0.365
Tokenâ
Token ID22940
Feature activation-7.45e+01

Pos logit contributions
ric+0.305
igs+0.295
ig+0.292
 our+0.283
ap+0.279
Neg logit contributions
 meant-0.253
Don-0.249
 said-0.223
 needed-0.219
 whispered-0.218
Token
Token ID26391
Feature activation-7.50e+01

Pos logit contributions
ric+0.285
ig+0.282
igs+0.273
ap+0.249
chie+0.232
Neg logit contributions
Don-0.442
 meant-0.421
 needed-0.392
 said-0.383
jar-0.378
Token like
Token ID588
Feature activation+3.58e+01

Pos logit contributions
 meant+0.199
Don+0.197
 said+0.182
 needed+0.177
 whispered+0.174
Neg logit contributions
igs-0.168
ig-0.162
ric-0.159
 our-0.144
 Majesty-0.143
Token to
Token ID284
Feature activation+3.54e+01

Pos logit contributions
Don+0.216
 meant+0.212
 said+0.196
 needed+0.191
 whispered+0.189
Neg logit contributions
igs-0.137
ig-0.130
ric-0.128
 our-0.119
ap-0.114
Token organize
Token ID16481
Feature activation+3.41e+01

Pos logit contributions
 meant+0.214
Don+0.205
 said+0.203
 needed+0.193
 whispered+0.191
Neg logit contributions
igs-0.136
ig-0.135
ric-0.128
 Majesty-0.111
ode-0.105
Token them
Token ID606
Feature activation+3.39e+01

Pos logit contributions
 meant+0.174
Don+0.169
 said+0.160
 needed+0.153
 whispered+0.150
Neg logit contributions
igs-0.160
ig-0.157
ric-0.155
 Majesty-0.140
ode-0.138
Token.
Token ID13
Feature activation+4.49e+01

Pos logit contributions
Don+0.180
 meant+0.152
Wow+0.143
Well+0.143
 wondered+0.140
Neg logit contributions
igs-0.169
 our-0.167
ig-0.161
ap-0.155
 victory-0.154
Token They
Token ID1119
Feature activation+4.22e+01

Pos logit contributions
 meant+0.157
Don+0.154
 whispered+0.129
 needed+0.127
jar+0.127
Neg logit contributions
igs-0.275
ric-0.265
ig-0.264
 our-0.263
ap-0.257
Token left
Token ID1364
Feature activation+3.64e+01

Pos logit contributions
Don+0.176
jar+0.160
Grand+0.142
Today+0.140
books+0.138
Neg logit contributions
ric-0.214
ig-0.213
ap-0.213
igs-0.213
 dive-0.210
Token them
Token ID606
Feature activation+3.71e+01

Pos logit contributions
Don+0.222
 meant+0.213
 said+0.193
 needed+0.191
 whispered+0.190
Neg logit contributions
igs-0.139
 our-0.132
ric-0.128
ig-0.128
 victory-0.119
Token all
Token ID477
Feature activation+3.34e+01

Pos logit contributions
Don+0.233
 meant+0.218
 said+0.196
books+0.196
Wow+0.195
Neg logit contributions
igs-0.140
ig-0.138
 our-0.129
ric-0.123
ap-0.122
Token over
Token ID625
Feature activation+3.59e+01

Pos logit contributions
Don+0.209
 meant+0.184
Grand+0.176
 whispered+0.174
jar+0.173
Neg logit contributions
 our-0.131
igs-0.119
ric-0.119
ig-0.115
ap-0.113
Token the
Token ID262
Feature activation+3.58e+01

Pos logit contributions
Don+0.216
 meant+0.211
 said+0.191
 whispered+0.188
 needed+0.186
Neg logit contributions
igs-0.141
ig-0.133
 our-0.130
ric-0.125
ap-0.119
Token then
Token ID788
Feature activation+2.46e+01

Pos logit contributions
Don+0.142
 meant+0.140
 said+0.130
 needed+0.126
 whispered+0.124
Neg logit contributions
igs-0.104
ric-0.098
ig-0.098
 our-0.089
 Majesty-0.087
Token,
Token ID11
Feature activation+2.53e+01

Pos logit contributions
 meant+0.122
Don+0.122
 said+0.115
 needed+0.110
 whispered+0.106
Neg logit contributions
igs-0.122
ric-0.120
ig-0.115
 Majesty-0.105
ode-0.105
Token Tim
Token ID5045
Feature activation+2.37e+01

Pos logit contributions
Don+0.119
 meant+0.118
 said+0.112
 needed+0.106
 whispered+0.099
Neg logit contributions
igs-0.139
ig-0.136
ric-0.135
 Majesty-0.124
ode-0.121
Token's
Token ID338
Feature activation+2.33e+01

Pos logit contributions
Don+0.109
jar+0.100
books+0.098
Wow+0.096
 meant+0.096
Neg logit contributions
ig-0.126
igs-0.111
ode-0.109
ap-0.105
 arrest-0.105
Token mom
Token ID1995
Feature activation+2.49e+01

Pos logit contributions
 meant+0.103
 said+0.096
Don+0.093
 needed+0.092
 whispered+0.087
Neg logit contributions
ric-0.131
igs-0.130
ig-0.119
 our-0.114
chie-0.112
Token came
Token ID1625
Feature activation+2.20e+01

Pos logit contributions
Don+0.100
books+0.091
 meant+0.087
jar+0.086
Wow+0.086
Neg logit contributions
ig-0.138
igs-0.127
ap-0.123
 arrest-0.122
 our-0.120
Token into
Token ID656
Feature activation+2.13e+01

Pos logit contributions
 meant+0.087
Don+0.086
 said+0.084
 She+0.080
 whispered+0.078
Neg logit contributions
igs-0.132
 Majesty-0.130
ric-0.126
ig-0.125
latable-0.117
Token the
Token ID262
Feature activation+2.14e+01

Pos logit contributions
Don+0.125
 meant+0.124
 said+0.113
 needed+0.112
 whispered+0.110
Neg logit contributions
igs-0.083
ig-0.078
ric-0.076
 our-0.074
 arrest-0.070
Token room
Token ID2119
Feature activation+1.91e+01

Pos logit contributions
Don+0.106
 meant+0.104
 said+0.095
 needed+0.092
 whispered+0.090
Neg logit contributions
igs-0.105
ig-0.101
ric-0.099
 our-0.093
ap-0.091
Token.
Token ID13
Feature activation+3.38e+01

Pos logit contributions
Don+0.102
 meant+0.097
Wow+0.089
Well+0.082
books+0.080
Neg logit contributions
ap-0.081
ig-0.079
igs-0.079
 arrest-0.078
 our-0.075
Token She
Token ID1375
Feature activation+3.09e+01

Pos logit contributions
books+0.111
Don+0.111
Today+0.109
 meant+0.109
jar+0.101
Neg logit contributions
 my-0.170
 our-0.169
 your-0.164
 how-0.157
igs-0.156
Token But
Token ID887
Feature activation+4.57e+01

Pos logit contributions
Don+0.175
 meant+0.160
 She+0.155
 said+0.137
 needed+0.136
Neg logit contributions
igs-0.292
ric-0.279
ig-0.278
immers-0.263
chie-0.262
Token you
Token ID345
Feature activation+4.36e+01

Pos logit contributions
Don+0.235
 meant+0.226
 said+0.221
 needed+0.198
 She+0.191
Neg logit contributions
latable-0.233
immers-0.233
igs-0.231
pieces-0.227
ric-0.225
Token have
Token ID423
Feature activation+4.28e+01

Pos logit contributions
 meant+0.219
 said+0.202
 needed+0.197
Don+0.195
 whispered+0.178
Neg logit contributions
igs-0.236
ig-0.232
 Majesty-0.225
ric-0.213
immers-0.213
Token to
Token ID284
Feature activation+4.13e+01

Pos logit contributions
 meant+0.206
Don+0.206
 said+0.199
 needed+0.194
 whispered+0.176
Neg logit contributions
igs-0.220
ig-0.217
ric-0.211
 Majesty-0.202
immers-0.201
Token be
Token ID307
Feature activation+3.97e+01

Pos logit contributions
 meant+0.191
 said+0.189
Don+0.172
 whispered+0.169
 She+0.166
Neg logit contributions
pieces-0.212
igs-0.211
ric-0.206
eros-0.202
ig-0.202
Token careful
Token ID8161
Feature activation+4.13e+01

Pos logit contributions
Don+0.155
 meant+0.151
 said+0.148
 whispered+0.136
 wondered+0.127
Neg logit contributions
ric-0.256
ig-0.238
igsaw-0.235
latable-0.231
igs-0.229
Token on
Token ID319
Feature activation+4.17e+01

Pos logit contributions
 meant+0.125
Don+0.118
 said+0.104
 She+0.104
 needed+0.099
Neg logit contributions
igs-0.284
ig-0.278
ric-0.278
 Majesty-0.272
ocol-0.268
Token the
Token ID262
Feature activation+3.89e+01

Pos logit contributions
 meant+0.232
Don+0.223
 said+0.213
 She+0.207
 needed+0.199
Neg logit contributions
ig-0.165
ric-0.162
igs-0.159
chie-0.156
ode-0.152
Token slide
Token ID10649
Feature activation+3.82e+01

Pos logit contributions
Don+0.202
 meant+0.197
 said+0.181
 needed+0.176
 whispered+0.173
Neg logit contributions
igs-0.181
ig-0.174
ric-0.172
 our-0.158
ap-0.157
Token.
Token ID13
Feature activation+4.92e+01

Pos logit contributions
Don+0.221
Wow+0.192
 meant+0.187
 wondered+0.187
Grand+0.186
Neg logit contributions
ig-0.164
 our-0.161
igs-0.157
ap-0.153
 your-0.144
Token It
Token ID632
Feature activation+4.64e+01

Pos logit contributions
Don+0.170
 meant+0.160
 She+0.149
 said+0.129
 needed+0.126
Neg logit contributions
igs-0.324
ig-0.316
ric-0.303
chie-0.292
 victory-0.291
Token a
Token ID257
Feature activation+1.58e+01

Pos logit contributions
Don+0.102
 meant+0.098
jar+0.089
 needed+0.088
 said+0.087
Neg logit contributions
igs-0.059
 our-0.058
ric-0.055
ig-0.054
 victory-0.051
Token lot
Token ID1256
Feature activation+1.21e+01

Pos logit contributions
Don+0.078
 meant+0.076
 said+0.069
 needed+0.067
 whispered+0.066
Neg logit contributions
igs-0.078
ig-0.075
ric-0.074
 our-0.071
ap-0.068
Token of
Token ID286
Feature activation+1.17e+01

Pos logit contributions
Don+0.070
 meant+0.069
 said+0.063
 needed+0.062
 whispered+0.061
Neg logit contributions
igs-0.050
ig-0.047
ric-0.046
 our-0.043
ap-0.042
Token fun
Token ID1257
Feature activation+1.36e+01

Pos logit contributions
Don+0.055
 meant+0.054
 said+0.048
 needed+0.047
 whispered+0.046
Neg logit contributions
igs-0.061
ig-0.058
ric-0.057
 our-0.055
ap-0.053
Token.
Token ID13
Feature activation+2.50e+01

Pos logit contributions
 meant+0.038
 said+0.037
Don+0.036
 needed+0.032
 wanted+0.031
Neg logit contributions
ric-0.091
igs-0.091
chie-0.089
ig-0.088
 Majesty-0.085
Token They
Token ID1119
Feature activation+2.22e+01

Pos logit contributions
Don+0.085
 meant+0.084
 said+0.072
 whispered+0.072
 spoke+0.070
Neg logit contributions
 our-0.154
ric-0.149
igs-0.148
ap-0.147
ig-0.141
Token agreed
Token ID4987
Feature activation+2.02e+01

Pos logit contributions
Don+0.097
jar+0.092
Today+0.089
 meant+0.089
books+0.088
Neg logit contributions
e-0.099
 our-0.090
 arrest-0.090
ric-0.087
 dive-0.086
Token that
Token ID326
Feature activation+2.17e+01

Pos logit contributions
Don+0.132
 meant+0.128
 needed+0.116
 said+0.115
jar+0.114
Neg logit contributions
igs-0.066
 our-0.062
ig-0.062
ric-0.060
ap-0.056
Token the
Token ID262
Feature activation+1.86e+01

Pos logit contributions
Don+0.122
 meant+0.114
books+0.113
jar+0.113
Wow+0.106
Neg logit contributions
 our-0.091
igs-0.079
 victory-0.077
ap-0.076
ric-0.076
Token funny
Token ID8258
Feature activation+1.55e+01

Pos logit contributions
Don+0.101
 meant+0.099
 said+0.089
 whispered+0.088
 needed+0.088
Neg logit contributions
igs-0.079
ig-0.077
ric-0.076
 our-0.074
 victory-0.073
Token man
Token ID582
Feature activation+1.74e+01

Pos logit contributions
Don+0.076
 meant+0.074
 said+0.067
 needed+0.066
 whispered+0.064
Neg logit contributions
igs-0.076
ig-0.073
ric-0.072
 our-0.068
ap-0.066
TokenLu
Token ID25596
Feature activation+3.68e+01

Pos logit contributions
 meant+0.223
Don+0.207
 needed+0.198
 said+0.190
 whispered+0.189
Neg logit contributions
igs-0.206
 our-0.198
ig-0.193
ap-0.182
ric-0.179
Tokency
Token ID948
Feature activation+3.48e+01

Pos logit contributions
Don+0.226
 meant+0.221
 said+0.202
 needed+0.201
books+0.196
Neg logit contributions
ig-0.132
igs-0.128
ric-0.121
 our-0.113
ap-0.112
Token:
Token ID25
Feature activation+4.04e+01

Pos logit contributions
Don+0.145
 meant+0.132
jar+0.126
books+0.117
Wow+0.115
Neg logit contributions
ig-0.195
igs-0.191
ric-0.185
ap-0.184
 Majesty-0.176
Token "
Token ID366
Feature activation+4.51e+01

Pos logit contributions
 meant+0.150
jar+0.132
Don+0.131
 whispered+0.128
books+0.123
Neg logit contributions
ric-0.224
 our-0.212
ap-0.211
 Majesty-0.208
 victory-0.203
TokenBut
Token ID1537
Feature activation+4.35e+01

Pos logit contributions
 meant+0.150
Don+0.141
 said+0.130
 needed+0.126
 whispered+0.121
Neg logit contributions
igs-0.287
ric-0.278
ig-0.277
 our-0.266
ap-0.259
Token why
Token ID1521
Feature activation+4.29e+01

Pos logit contributions
 meant+0.199
 said+0.191
 needed+0.175
 wanted+0.174
Don+0.171
Neg logit contributions
igs-0.245
immers-0.234
pieces-0.229
 arrest-0.228
ric-0.228
Token,
Token ID11
Feature activation+4.19e+01

Pos logit contributions
 meant+0.214
Don+0.214
 said+0.195
 needed+0.189
 whispered+0.184
Neg logit contributions
igs-0.208
ig-0.199
ric-0.195
 our-0.181
 Majesty-0.178
Token mum
Token ID25682
Feature activation+4.19e+01

Pos logit contributions
 meant+0.209
 said+0.193
Don+0.185
 needed+0.176
 wanted+0.172
Neg logit contributions
igs-0.235
ric-0.225
ig-0.220
immers-0.219
chie-0.218
Token?
Token ID30
Feature activation+4.65e+01

Pos logit contributions
Don+0.235
 meant+0.233
 said+0.214
 needed+0.209
 whispered+0.205
Neg logit contributions
igs-0.176
ig-0.167
ric-0.165
 our-0.153
ap-0.148
Token I
Token ID314
Feature activation+4.51e+01

Pos logit contributions
Don+0.220
 meant+0.218
 said+0.197
 needed+0.190
 whispered+0.189
Neg logit contributions
igs-0.230
ric-0.223
ig-0.223
 our-0.211
ap-0.207
Token want
Token ID765
Feature activation+4.48e+01

Pos logit contributions
 meant+0.203
Don+0.200
 said+0.185
 needed+0.180
 whispered+0.172
Neg logit contributions
igs-0.251
ig-0.237
ric-0.236
 our-0.221
 Majesty-0.220
TokenOne
Token ID3198
Feature activation-1.03e+01

Pos logit contributions
ric+0.036
 our+0.036
ig+0.036
igs+0.035
ap+0.034
Neg logit contributions
Don-0.029
 meant-0.028
 wondered-0.025
 whispered-0.025
 needed-0.024
Token day
Token ID1110
Feature activation-1.31e+01

Pos logit contributions
igs+0.065
ig+0.063
ric+0.062
 our+0.059
ap+0.058
Neg logit contributions
Don-0.037
 meant-0.036
 said-0.031
 needed-0.030
 whispered-0.029
Token,
Token ID11
Feature activation-1.00e+01

Pos logit contributions
igs+0.054
ig+0.052
ric+0.051
 our+0.048
ap+0.046
Neg logit contributions
Don-0.075
 meant-0.073
 said-0.067
 needed-0.066
 whispered-0.065
Token a
Token ID257
Feature activation-1.10e+01

Pos logit contributions
igs+0.041
ig+0.039
ric+0.039
 our+0.036
ap+0.035
Neg logit contributions
Don-0.057
 meant-0.056
 said-0.052
 needed-0.051
 whispered-0.050
Token big
Token ID1263
Feature activation-1.50e+01

Pos logit contributions
igs+0.050
ig+0.049
ric+0.047
 our+0.046
ap+0.045
Neg logit contributions
 meant-0.056
Don-0.056
 said-0.049
 whispered-0.048
 needed-0.048
Token dinosaur
Token ID28257
Feature activation-1.78e+01

Pos logit contributions
igs+0.096
ric+0.094
ig+0.087
pieces+0.087
chie+0.086
Neg logit contributions
 meant-0.051
 said-0.049
Don-0.048
 needed-0.045
 She-0.041
Token was
Token ID373
Feature activation-1.25e+01

Pos logit contributions
igs+0.081
ig+0.080
ric+0.076
 our+0.073
ap+0.072
Neg logit contributions
Don-0.094
 meant-0.091
 said-0.081
 whispered-0.079
 needed-0.078
Token in
Token ID287
Feature activation-1.13e+01

Pos logit contributions
igs+0.069
ig+0.069
ric+0.069
ode+0.064
 Majesty+0.063
Neg logit contributions
 meant-0.053
 said-0.051
Don-0.050
 needed-0.049
 whispered-0.045
Token the
Token ID262
Feature activation-1.34e+01

Pos logit contributions
ig+0.064
igs+0.062
ric+0.060
immers+0.057
ap+0.057
Neg logit contributions
Don-0.050
 meant-0.050
 said-0.045
 She-0.044
 needed-0.043
Token park
Token ID3952
Feature activation-1.60e+01

Pos logit contributions
igs+0.072
ig+0.068
ric+0.065
chie+0.060
ap+0.060
Neg logit contributions
Don-0.060
 meant-0.059
 said-0.055
 needed-0.054
 She-0.051
Token.
Token ID13
Feature activation+6.14e-01

Pos logit contributions
igs+0.072
ig+0.069
ric+0.068
 our+0.065
ap+0.065
Neg logit contributions
Don-0.085
 meant-0.083
 said-0.074
 whispered-0.072
 needed-0.072
Token upon
Token ID2402
Feature activation-1.64e+01

Pos logit contributions
igs+0.050
ig+0.049
ric+0.048
 our+0.046
ap+0.044
Neg logit contributions
Don-0.039
 meant-0.038
 said-0.034
 needed-0.033
 whispered-0.032
Token a
Token ID257
Feature activation-1.31e+01

Pos logit contributions
ig+0.102
igs+0.102
ric+0.095
 Majesty+0.092
ocol+0.090
Neg logit contributions
Don-0.074
 meant-0.069
 said-0.065
 She-0.063
 needed-0.060
Token time
Token ID640
Feature activation-1.33e+01

Pos logit contributions
igs+0.079
ric+0.077
ig+0.074
 Majesty+0.070
 our+0.070
Neg logit contributions
Don-0.051
 meant-0.049
 said-0.045
 needed-0.043
Today-0.041
Token,
Token ID11
Feature activation-1.07e+01

Pos logit contributions
igs+0.074
ric+0.071
ode+0.067
ig+0.067
 Majesty+0.067
Neg logit contributions
Don-0.057
 meant-0.056
 said-0.054
 She-0.052
 needed-0.051
Token there
Token ID612
Feature activation-9.18e+00

Pos logit contributions
igs+0.058
ric+0.055
ig+0.054
 Majesty+0.054
 arrest+0.053
Neg logit contributions
Don-0.051
 meant-0.049
 said-0.045
 needed-0.044
 She-0.043
Token was
Token ID373
Feature activation-1.26e+01

Pos logit contributions
ig+0.052
ric+0.051
igs+0.051
 our+0.051
ap+0.049
Neg logit contributions
Don-0.035
 meant-0.032
 whispered-0.030
jar-0.029
Grand-0.027
Token a
Token ID257
Feature activation-1.07e+01

Pos logit contributions
igs+0.069
ig+0.068
ric+0.067
 Majesty+0.063
chie+0.062
Neg logit contributions
Don-0.061
 meant-0.059
 said-0.054
 She-0.052
 needed-0.051
Token little
Token ID1310
Feature activation-1.48e+01

Pos logit contributions
igs+0.051
ig+0.050
ric+0.048
 our+0.046
ap+0.045
Neg logit contributions
Don-0.054
 meant-0.053
 said-0.047
 needed-0.046
 whispered-0.046
Token girl
Token ID2576
Feature activation-1.36e+01

Pos logit contributions
igs+0.105
ric+0.105
ig+0.099
 Majesty+0.096
 arrest+0.095
Neg logit contributions
Don-0.040
 meant-0.039
 said-0.036
 needed-0.033
 whispered-0.030
Token named
Token ID3706
Feature activation-8.99e+00

Pos logit contributions
igs+0.075
ric+0.072
ig+0.069
 Majesty+0.068
 victory+0.067
Neg logit contributions
Don-0.060
 meant-0.059
 said-0.058
 needed-0.055
 whispered-0.053
Token Lucy
Token ID22162
Feature activation-1.17e+01

Pos logit contributions
igs+0.052
ig+0.051
 our+0.049
ric+0.049
ap+0.047
Neg logit contributions
Don-0.036
 meant-0.035
 said-0.030
 needed-0.029
 whispered-0.029
Token all
Token ID477
Feature activation-5.49e+01

Pos logit contributions
ric+0.250
ig+0.240
igs+0.239
latable+0.224
 Majesty+0.219
Neg logit contributions
Don-0.299
 said-0.291
 meant-0.287
 whispered-0.258
 She-0.254
Token the
Token ID262
Feature activation-5.21e+01

Pos logit contributions
 Majesty+0.293
igs+0.284
ig+0.279
ode+0.273
ric+0.269
Neg logit contributions
 meant-0.266
 said-0.258
Don-0.242
 needed-0.223
 wanted-0.222
Token bugs
Token ID11316
Feature activation-5.73e+01

Pos logit contributions
igs+0.278
ric+0.274
ig+0.272
 arrest+0.244
 Majesty+0.244
Neg logit contributions
 meant-0.234
Don-0.228
 said-0.214
 needed-0.207
 whispered-0.196
Token and
Token ID290
Feature activation-4.46e+01

Pos logit contributions
igs+0.286
ig+0.275
ric+0.271
 our+0.256
ap+0.249
Neg logit contributions
Don-0.276
 meant-0.270
 said-0.244
 needed-0.238
 whispered-0.233
Token butterflies
Token ID44571
Feature activation-5.41e+01

Pos logit contributions
ig+0.247
igs+0.226
latable+0.217
ric+0.212
immers+0.208
Neg logit contributions
 said-0.222
Don-0.216
 meant-0.211
 needed-0.207
 wanted-0.204
Token I
Token ID314
Feature activation-4.46e+01

Pos logit contributions
igs+0.285
ig+0.274
ric+0.272
 our+0.255
ap+0.249
Neg logit contributions
Don-0.245
 meant-0.240
 said-0.218
 needed-0.210
 whispered-0.206
Token want
Token ID765
Feature activation-4.55e+01

Pos logit contributions
igs+0.214
ig+0.206
ric+0.203
 our+0.190
ap+0.186
Neg logit contributions
Don-0.224
 meant-0.219
 said-0.199
 needed-0.194
 whispered-0.190
Token."
Token ID526
Feature activation-3.12e+01

Pos logit contributions
igs+0.169
ig+0.161
ric+0.158
 our+0.143
ap+0.140
Neg logit contributions
Don-0.278
 meant-0.274
 said-0.254
 needed-0.248
 whispered-0.244
Token
Token ID198
Feature activation-3.10e+01

Pos logit contributions
 our+0.161
ap+0.156
ric+0.156
igs+0.156
ig+0.155
Neg logit contributions
 meant-0.139
Don-0.125
 needed-0.113
chy-0.112
books-0.112
Token
Token ID198
Feature activation-3.10e+01

Pos logit contributions
igs+0.179
 our+0.169
ig+0.169
ric+0.169
ap+0.159
Neg logit contributions
 meant-0.128
Don-0.114
 needed-0.110
 faded-0.102
 smelled-0.096
TokenShe
Token ID3347
Feature activation-3.50e+01

Pos logit contributions
igs+0.179
ig+0.172
 our+0.167
ric+0.167
ap+0.163
Neg logit contributions
 meant-0.129
Don-0.113
 needed-0.111
 said-0.103
 whispered-0.099
Token pretty
Token ID2495
Feature activation-4.01e+01

Pos logit contributions
igs+0.181
ig+0.169
 our+0.164
ric+0.163
 Majesty+0.156
Neg logit contributions
Don-0.185
 meant-0.178
 said-0.158
 needed-0.155
 whispered-0.153
Token and
Token ID290
Feature activation-2.84e+01

Pos logit contributions
igs+0.249
ric+0.242
ig+0.238
 our+0.223
ap+0.223
Neg logit contributions
 meant-0.149
Don-0.145
 said-0.131
 needed-0.125
 whispered-0.120
Token decided
Token ID3066
Feature activation-3.05e+01

Pos logit contributions
igs+0.181
ig+0.173
ric+0.171
 our+0.170
 victory+0.162
Neg logit contributions
Don-0.093
 meant-0.089
 said-0.073
books-0.071
jar-0.071
Token to
Token ID284
Feature activation-2.98e+01

Pos logit contributions
igs+0.126
ig+0.120
ric+0.119
 our+0.106
ap+0.106
Neg logit contributions
Don-0.177
 meant-0.174
 said-0.162
 needed-0.157
 whispered-0.154
Token stay
Token ID2652
Feature activation-3.19e+01

Pos logit contributions
igs+0.167
ig+0.157
ric+0.154
 unnecessary+0.149
ishing+0.147
Neg logit contributions
 meant-0.131
 said-0.131
 She-0.126
 wanted-0.121
 wondered-0.120
Token a
Token ID257
Feature activation-3.15e+01

Pos logit contributions
ric+0.173
igs+0.164
ig+0.157
 arrest+0.154
 Majesty+0.151
Neg logit contributions
Don-0.141
 meant-0.141
 said-0.136
 whispered-0.130
 needed-0.127
Token while
Token ID981
Feature activation-2.94e+01

Pos logit contributions
igs+0.170
ric+0.163
ig+0.158
 our+0.145
ap+0.145
Neg logit contributions
Don-0.141
 meant-0.132
 said-0.127
 needed-0.124
 whispered-0.120
Token,
Token ID11
Feature activation-2.61e+01

Pos logit contributions
ig+0.125
igs+0.125
 our+0.124
ap+0.120
 victory+0.111
Neg logit contributions
Don-0.157
 meant-0.152
Wow-0.140
books-0.136
Well-0.129
Token but
Token ID475
Feature activation-2.19e+01

Pos logit contributions
igs+0.144
ig+0.139
ric+0.137
 our+0.130
ap+0.127
Neg logit contributions
Don-0.113
 meant-0.111
 said-0.099
 needed-0.096
 whispered-0.093
Token then
Token ID788
Feature activation-2.50e+01

Pos logit contributions
igs+0.115
ig+0.110
ric+0.109
 our+0.101
ap+0.099
Neg logit contributions
Don-0.103
 meant-0.101
 said-0.092
 needed-0.089
 whispered-0.086
Token she
Token ID673
Feature activation-2.37e+01

Pos logit contributions
igs+0.145
ig+0.144
 our+0.137
ric+0.132
ap+0.132
Neg logit contributions
Don-0.095
 meant-0.091
books-0.080
jar-0.078
 needed-0.074
Token people
Token ID661
Feature activation-2.10e+01

Pos logit contributions
igs+0.118
ig+0.115
ric+0.114
 our+0.107
ap+0.105
Neg logit contributions
Don-0.105
 meant-0.103
 said-0.093
 needed-0.090
 whispered-0.088
Token on
Token ID319
Feature activation-1.88e+01

Pos logit contributions
igs+0.096
ig+0.093
ric+0.093
 our+0.086
ap+0.084
Neg logit contributions
Don-0.109
 meant-0.106
 said-0.096
 needed-0.094
 whispered-0.093
Token the
Token ID262
Feature activation-2.03e+01

Pos logit contributions
igs+0.085
ig+0.085
ric+0.081
ap+0.076
 Majesty+0.076
Neg logit contributions
Don-0.099
 meant-0.098
 said-0.090
 needed-0.087
 She-0.085
Token other
Token ID584
Feature activation-2.33e+01

Pos logit contributions
igs+0.103
ig+0.101
ric+0.099
ocol+0.093
 Majesty+0.092
Neg logit contributions
 meant-0.092
Don-0.089
 said-0.086
 needed-0.084
 whispered-0.080
Token side
Token ID1735
Feature activation-2.20e+01

Pos logit contributions
igs+0.099
ig+0.093
ric+0.090
 our+0.084
 Majesty+0.083
Neg logit contributions
 meant-0.130
Don-0.129
 said-0.121
 needed-0.119
 whispered-0.115
Token.
Token ID13
Feature activation-5.56e+00

Pos logit contributions
ig+0.095
igs+0.092
ric+0.090
ap+0.086
 our+0.086
Neg logit contributions
Don-0.124
 meant-0.111
Wow-0.103
jar-0.100
 whispered-0.100
Token
Token ID220
Feature activation-1.50e+01

Pos logit contributions
igs+0.030
ric+0.029
ig+0.028
 our+0.028
latable+0.026
Neg logit contributions
 meant-0.024
Don-0.023
books-0.020
 whispered-0.020
 wondered-0.019
Token
Token ID198
Feature activation-5.57e+00

Pos logit contributions
igs+0.092
ric+0.091
ig+0.087
 our+0.085
ap+0.083
Neg logit contributions
Don-0.053
 meant-0.052
 whispered-0.044
 needed-0.044
 spoke-0.043
Token
Token ID198
Feature activation-5.78e+00

Pos logit contributions
igs+0.034
ig+0.032
ric+0.032
 our+0.031
ap+0.030
Neg logit contributions
 meant-0.021
Don-0.019
 needed-0.017
 whispered-0.016
 said-0.016
TokenFl
Token ID7414
Feature activation-2.15e+01

Pos logit contributions
igs+0.030
ig+0.028
ric+0.027
 our+0.026
ap+0.025
Neg logit contributions
 meant-0.028
Don-0.027
 said-0.024
 needed-0.024
 whispered-0.023
Tokenuffy
Token ID15352
Feature activation-1.51e+01

Pos logit contributions
igs+0.085
ig+0.082
ric+0.080
ap+0.073
 our+0.071
Neg logit contributions
Don-0.131
 meant-0.129
 said-0.118
 needed-0.115
 whispered-0.114
Token pet
Token ID4273
Feature activation-8.43e+01

Pos logit contributions
igs+0.407
ig+0.393
ric+0.389
 our+0.367
ap+0.355
Neg logit contributions
Don-0.404
 meant-0.395
 said-0.356
 needed-0.347
 whispered-0.341
Token shop
Token ID6128
Feature activation-7.76e+01

Pos logit contributions
ig+0.391
ric+0.346
 our+0.344
igs+0.337
ap+0.334
Neg logit contributions
Don-0.430
 meant-0.391
books-0.372
Grand-0.366
 wondered-0.359
Token.
Token ID13
Feature activation-5.08e+01

Pos logit contributions
ig+0.322
ric+0.316
 our+0.312
igs+0.310
ap+0.300
Neg logit contributions
Don-0.424
 meant-0.379
Wow-0.365
books-0.346
jar-0.344
Token She
Token ID1375
Feature activation-5.58e+01

Pos logit contributions
ric+0.267
 our+0.267
igs+0.256
 my+0.255
ap+0.253
Neg logit contributions
 meant-0.206
Don-0.192
 needed-0.172
chy-0.170
 faded-0.170
Token asked
Token ID1965
Feature activation-5.58e+01

Pos logit contributions
igs+0.338
ig+0.335
ric+0.329
 our+0.321
 Majesty+0.316
Neg logit contributions
Don-0.194
 meant-0.174
jar-0.161
books-0.160
Wow-0.150
Token her
Token ID607
Feature activation-5.92e+01

Pos logit contributions
igs+0.237
 our+0.235
ig+0.228
ap+0.220
ric+0.216
Neg logit contributions
Don-0.302
 meant-0.280
jar-0.268
 needed-0.252
Today-0.252
Token mom
Token ID1995
Feature activation-5.76e+01

Pos logit contributions
igs+0.279
ig+0.270
ric+0.265
 our+0.253
ap+0.244
Neg logit contributions
Don-0.304
 meant-0.293
 said-0.262
 needed-0.259
 whispered-0.253
Token to
Token ID284
Feature activation-5.48e+01

Pos logit contributions
igs+0.279
ig+0.269
ric+0.266
 our+0.251
ap+0.245
Neg logit contributions
Don-0.286
 meant-0.278
 said-0.251
 needed-0.245
 whispered-0.241
Token buy
Token ID2822
Feature activation-5.81e+01

Pos logit contributions
igs+0.283
ig+0.273
ric+0.257
 Majesty+0.240
immers+0.223
Neg logit contributions
 meant-0.279
 said-0.278
Don-0.273
 whispered-0.254
 She-0.246
Token it
Token ID340
Feature activation-5.57e+01

Pos logit contributions
igs+0.248
ig+0.238
ric+0.235
 our+0.213
ap+0.212
Neg logit contributions
Don-0.323
 meant-0.320
 said-0.296
 needed-0.286
 whispered-0.282
Token for
Token ID329
Feature activation-5.32e+01

Pos logit contributions
 our+0.219
igs+0.217
ig+0.214
 my+0.191
 your+0.190
Neg logit contributions
Don-0.343
 meant-0.296
 wondered-0.291
Wow-0.285
Well-0.280
Token asks
Token ID7893
Feature activation-5.66e+01

Pos logit contributions
ap+0.367
 our+0.355
ig+0.348
 my+0.346
 dive+0.345
Neg logit contributions
jar-0.232
Don-0.228
books-0.195
Wow-0.185
Today-0.176
Token.
Token ID13
Feature activation-4.46e+01

Pos logit contributions
igs+0.317
ig+0.311
ric+0.299
 our+0.299
ap+0.296
Neg logit contributions
Don-0.235
 meant-0.225
 needed-0.192
 said-0.192
 whispered-0.187
Token
Token ID198
Feature activation-4.53e+01

Pos logit contributions
 our+0.212
ap+0.197
 my+0.196
 your+0.191
ig+0.180
Neg logit contributions
 meant-0.215
Don-0.187
 needed-0.187
chy-0.187
books-0.173
Token
Token ID198
Feature activation-4.49e+01

Pos logit contributions
igs+0.300
ig+0.288
 our+0.287
ric+0.286
ap+0.275
Neg logit contributions
 meant-0.145
Don-0.139
 needed-0.117
 said-0.110
 whispered-0.110
TokenThe
Token ID464
Feature activation-4.98e+01

Pos logit contributions
igs+0.238
ig+0.229
ric+0.225
 our+0.218
ap+0.211
Neg logit contributions
 meant-0.202
Don-0.201
 said-0.177
 needed-0.176
 whispered-0.170
Token man
Token ID582
Feature activation-5.53e+01

Pos logit contributions
igs+0.253
ig+0.249
ric+0.242
 victory+0.240
 our+0.238
Neg logit contributions
Don-0.225
 meant-0.219
books-0.189
 needed-0.186
 said-0.184
Token smiles
Token ID21845
Feature activation-5.40e+01

Pos logit contributions
ig+0.275
igs+0.262
ap+0.259
ric+0.256
 our+0.250
Neg logit contributions
Don-0.269
jar-0.240
 meant-0.230
books-0.227
Wow-0.214
Token.
Token ID13
Feature activation-3.68e+01

Pos logit contributions
ap+0.193
 our+0.178
 victory+0.169
ig+0.164
 how+0.161
Neg logit contributions
Don-0.316
jar-0.303
Well-0.281
books-0.280
Wow-0.275
Token He
Token ID679
Feature activation-4.49e+01

Pos logit contributions
 my+0.171
 our+0.169
 your+0.164
ric+0.161
ap+0.158
Neg logit contributions
 meant-0.149
Don-0.148
chy-0.144
 needed-0.142
books-0.139
Token nods
Token ID34833
Feature activation-4.89e+01

Pos logit contributions
igs+0.266
ig+0.262
ric+0.248
 our+0.248
 Majesty+0.240
Neg logit contributions
Don-0.177
 meant-0.150
jar-0.147
books-0.143
Wow-0.134
Token.
Token ID13
Feature activation-3.36e+01

Pos logit contributions
 our+0.220
ig+0.210
ap+0.208
igs+0.201
ric+0.197
Neg logit contributions
Don-0.245
 meant-0.237
jar-0.222
 needed-0.213
books-0.206
Token boy
Token ID2933
Feature activation-6.36e+01

Pos logit contributions
ric+0.446
igs+0.434
ig+0.398
pieces+0.377
latable+0.376
Neg logit contributions
 meant-0.233
 said-0.233
Don-0.231
 needed-0.197
 wanted-0.189
Token,
Token ID11
Feature activation-5.46e+01

Pos logit contributions
igs+0.325
ric+0.310
ig+0.303
 Majesty+0.275
 victory+0.273
Neg logit contributions
 meant-0.305
Don-0.301
 said-0.289
 needed-0.271
 whispered-0.264
Token why
Token ID1521
Feature activation-5.56e+01

Pos logit contributions
igs+0.308
ig+0.287
ric+0.273
immers+0.253
 Majesty+0.248
Neg logit contributions
Don-0.261
 meant-0.256
 said-0.248
 wanted-0.233
 needed-0.225
Token don
Token ID836
Feature activation-5.92e+01

Pos logit contributions
igs+0.375
ig+0.361
ric+0.344
 Majesty+0.342
chie+0.340
Neg logit contributions
 meant-0.199
Don-0.179
 said-0.178
 needed-0.160
 wanted-0.149
Token't
Token ID470
Feature activation-5.69e+01

Pos logit contributions
 Majesty+0.373
ig+0.370
igs+0.369
ric+0.355
ode+0.348
Neg logit contributions
 meant-0.231
Don-0.231
 said-0.205
 She-0.200
 needed-0.193
Token you
Token ID345
Feature activation-5.53e+01

Pos logit contributions
igs+0.280
ig+0.265
ric+0.265
 Majesty+0.248
latable+0.243
Neg logit contributions
 said-0.304
Don-0.295
 meant-0.295
 She-0.273
 needed-0.269
Token come
Token ID1282
Feature activation-5.64e+01

Pos logit contributions
igs+0.202
ig+0.196
ric+0.195
 our+0.183
ap+0.182
Neg logit contributions
Don-0.340
 meant-0.324
 said-0.295
 needed-0.292
 whispered-0.288
Token and
Token ID290
Feature activation-4.87e+01

Pos logit contributions
igs+0.364
ig+0.325
ocol+0.321
 Majesty+0.313
chie+0.309
Neg logit contributions
 meant-0.207
 said-0.195
 She-0.188
Don-0.182
 whispered-0.175
Token visit
Token ID3187
Feature activation-5.29e+01

Pos logit contributions
igs+0.326
ig+0.325
ocol+0.286
ishing+0.275
ric+0.274
Neg logit contributions
 said-0.222
Don-0.196
 needed-0.181
 wanted-0.180
 whispered-0.179
Token us
Token ID514
Feature activation-5.48e+01

Pos logit contributions
igs+0.253
ric+0.247
ig+0.243
chie+0.220
 Majesty+0.216
Neg logit contributions
 meant-0.269
 said-0.267
Don-0.264
 whispered-0.244
 needed-0.238
Token?"
Token ID1701
Feature activation-3.87e+01

Pos logit contributions
igs+0.206
ig+0.197
ric+0.193
 our+0.180
ap+0.174
Neg logit contributions
Don-0.332
 meant-0.324
 said-0.299
 needed-0.293
 whispered-0.288
Token Tim
Token ID5045
Feature activation-6.48e+01

Pos logit contributions
ric+0.293
 our+0.291
ap+0.285
igs+0.281
ig+0.278
Neg logit contributions
 meant-0.217
Don-0.204
books-0.177
 needed-0.176
 whispered-0.176
Tokenmy
Token ID1820
Feature activation-7.03e+01

Pos logit contributions
ig+0.313
ric+0.294
igs+0.286
 our+0.268
ap+0.267
Neg logit contributions
Don-0.323
 meant-0.289
jar-0.289
books-0.287
Wow-0.271
Token smiled
Token ID13541
Feature activation-6.57e+01

Pos logit contributions
igs+0.437
ig+0.436
ric+0.423
 our+0.408
ap+0.403
Neg logit contributions
Don-0.242
 meant-0.222
jar-0.197
books-0.193
Wow-0.179
Token and
Token ID290
Feature activation-5.35e+01

Pos logit contributions
 our+0.294
igs+0.292
ig+0.291
ap+0.286
ric+0.284
Neg logit contributions
Don-0.326
 meant-0.312
jar-0.285
books-0.279
 needed-0.278
Token went
Token ID1816
Feature activation-5.66e+01

Pos logit contributions
igs+0.419
ig+0.408
ric+0.406
 our+0.389
ap+0.384
Neg logit contributions
Don-0.109
 meant-0.104
 said-0.083
 needed-0.075
 whispered-0.071
Token home
Token ID1363
Feature activation-6.00e+01

Pos logit contributions
ric+0.312
igs+0.292
ocol+0.282
chie+0.279
ig+0.275
Neg logit contributions
 said-0.260
 whispered-0.240
Don-0.237
 She-0.236
 meant-0.229
Token,
Token ID11
Feature activation-4.94e+01

Pos logit contributions
igs+0.290
ig+0.277
ric+0.272
 our+0.260
ap+0.253
Neg logit contributions
Don-0.301
 meant-0.294
 said-0.259
 needed-0.256
 whispered-0.249
Token dreaming
Token ID28368
Feature activation-5.03e+01

Pos logit contributions
 our+0.213
igs+0.209
ric+0.204
ig+0.204
 arrest+0.191
Neg logit contributions
 meant-0.256
Don-0.239
books-0.231
Wow-0.215
 needed-0.206
Token of
Token ID286
Feature activation-5.28e+01

Pos logit contributions
igs+0.181
ig+0.174
ric+0.159
 our+0.156
 victory+0.154
Neg logit contributions
Don-0.315
 meant-0.304
 said-0.278
 whispered-0.275
 wondered-0.275
Token his
Token ID465
Feature activation-5.47e+01

Pos logit contributions
igs+0.229
ig+0.221
ric+0.216
 our+0.197
ap+0.196
Neg logit contributions
Don-0.291
 meant-0.288
 said-0.266
 needed-0.258
 whispered-0.253
Token next
Token ID1306
Feature activation-5.15e+01

Pos logit contributions
igs+0.224
ig+0.219
ric+0.206
 victory+0.199
 our+0.199
Neg logit contributions
Don-0.319
 meant-0.308
 said-0.276
 She-0.272
 whispered-0.269
Token dreamed
Token ID27947
Feature activation-6.64e+01

Pos logit contributions
igs+0.389
ig+0.368
ric+0.363
latable+0.332
ode+0.327
Neg logit contributions
 meant-0.385
Don-0.377
 said-0.357
 needed-0.347
 whispered-0.331
Token of
Token ID286
Feature activation-6.74e+01

Pos logit contributions
igs+0.253
ig+0.243
ric+0.229
 our+0.222
ap+0.212
Neg logit contributions
Don-0.399
 meant-0.387
 said-0.353
 whispered-0.346
 needed-0.345
Token being
Token ID852
Feature activation-6.88e+01

Pos logit contributions
igs+0.295
ig+0.288
ric+0.275
ap+0.255
chie+0.248
Neg logit contributions
Don-0.364
 meant-0.364
 said-0.338
 needed-0.326
 whispered-0.319
Token one
Token ID530
Feature activation-6.84e+01

Pos logit contributions
igs+0.345
ig+0.333
ric+0.329
 our+0.321
ap+0.306
Neg logit contributions
Don-0.323
 meant-0.311
 said-0.280
 needed-0.271
books-0.268
Token.
Token ID13
Feature activation-4.87e+01

Pos logit contributions
igs+0.349
ig+0.337
ric+0.331
 our+0.314
ap+0.306
Neg logit contributions
Don-0.323
 meant-0.316
 said-0.285
 needed-0.277
 whispered-0.272
Token One
Token ID1881
Feature activation-5.30e+01

Pos logit contributions
ric+0.273
 our+0.259
ig+0.257
igs+0.254
ap+0.251
Neg logit contributions
 meant-0.195
Don-0.182
 whispered-0.160
books-0.158
 smelled-0.158
Token day
Token ID1110
Feature activation-5.52e+01

Pos logit contributions
ig+0.289
igs+0.283
ric+0.271
 our+0.267
ap+0.259
Neg logit contributions
Don-0.227
 meant-0.210
books-0.187
 said-0.184
 whispered-0.182
Token,
Token ID11
Feature activation-5.02e+01

Pos logit contributions
igs+0.211
ig+0.203
 our+0.194
ric+0.186
ap+0.182
Neg logit contributions
Don-0.327
 meant-0.320
 needed-0.286
 said-0.285
 whispered-0.281
Token her
Token ID607
Feature activation-5.47e+01

Pos logit contributions
igs+0.285
ig+0.274
ric+0.271
 our+0.258
ap+0.252
Neg logit contributions
Don-0.209
 meant-0.204
 said-0.181
 needed-0.176
 whispered-0.172
Token mom
Token ID1995
Feature activation-5.50e+01

Pos logit contributions
igs+0.244
ig+0.240
ric+0.229
 our+0.220
ap+0.213
Neg logit contributions
Don-0.294
 meant-0.284
 said-0.255
 needed-0.250
 whispered-0.247
Tokenmy
Token ID1820
Feature activation-6.27e+01

Pos logit contributions
 our+0.269
ig+0.264
ric+0.259
 my+0.256
ap+0.250
Neg logit contributions
Don-0.244
books-0.222
jar-0.206
Wow-0.206
Dead-0.198
Token
Token ID129
Feature activation-1.15e+02

Pos logit contributions
igs+0.754
ig+0.705
ric+0.684
 our+0.671
ode+0.649
Neg logit contributions
Don-0.480
 meant-0.457
 said-0.436
 needed-0.401
 She-0.392
Token
Token ID241
Feature activation-9.51e+01

Pos logit contributions
igs+0.493
ig+0.471
ric+0.470
 our+0.434
ap+0.422
Neg logit contributions
Don-0.632
 meant-0.623
 said-0.568
 needed-0.561
 whispered-0.547
TokenLet
Token ID5756
Feature activation-7.56e+01

Pos logit contributions
ric+0.607
igs+0.587
ig+0.573
ap+0.557
 our+0.556
Neg logit contributions
 meant-0.300
Don-0.288
 said-0.259
 whispered-0.258
 needed-0.250
Tokenâ
Token ID22940
Feature activation-9.03e+01

Pos logit contributions
igs+0.466
ric+0.441
ig+0.425
ocol+0.391
 Majesty+0.381
Neg logit contributions
 meant-0.339
 said-0.322
Don-0.319
 needed-0.278
 wanted-0.276
Token
Token ID26391
Feature activation-9.07e+01

Pos logit contributions
igs+0.404
ig+0.380
ric+0.371
 our+0.352
 Majesty+0.336
Neg logit contributions
Don-0.492
 meant-0.485
 said-0.445
 needed-0.431
 whispered-0.424
Token
Token ID8151
Feature activation-1.09e+02

Pos logit contributions
igs+0.495
ig+0.465
ric+0.452
 our+0.439
 victory+0.424
Neg logit contributions
Don-0.404
 meant-0.390
 said-0.368
 needed-0.349
 She-0.340
Tokens
Token ID82
Feature activation-8.17e+01

Pos logit contributions
ric+0.596
ig+0.571
igs+0.543
ap+0.535
 our+0.512
Neg logit contributions
Don-0.467
 meant-0.436
 wondered-0.401
 whispered-0.401
jar-0.384
Token have
Token ID423
Feature activation-7.38e+01

Pos logit contributions
igs+0.393
ig+0.360
ocol+0.352
ric+0.349
latable+0.335
Neg logit contributions
 meant-0.437
 said-0.423
 She-0.386
 needed-0.384
Don-0.381
Token a
Token ID257
Feature activation-7.31e+01

Pos logit contributions
ric+0.357
igs+0.355
ig+0.352
ap+0.310
 Majesty+0.300
Neg logit contributions
 meant-0.379
Don-0.377
 said-0.369
 needed-0.339
 whispered-0.335
Token stretching
Token ID20880
Feature activation-7.88e+01

Pos logit contributions
igs+0.364
ric+0.346
ig+0.339
 our+0.313
 arrest+0.310
Neg logit contributions
Don-0.355
 meant-0.350
 said-0.326
 needed-0.315
 whispered-0.307
Token race
Token ID3234
Feature activation-7.27e+01

Pos logit contributions
igs+0.372
ric+0.349
ig+0.336
 Majesty+0.332
 arrest+0.330
Neg logit contributions
 meant-0.394
Don-0.382
 said-0.370
 needed-0.353
 whispered-0.340
Token her
Token ID607
Feature activation-9.45e+01

Pos logit contributions
Don+0.177
 meant+0.170
 said+0.150
 needed+0.145
 whispered+0.142
Neg logit contributions
igs-0.256
ig-0.247
ric-0.244
 our-0.234
ap-0.229
Token mom
Token ID1995
Feature activation-8.10e+01

Pos logit contributions
igs+0.432
ig+0.416
ric+0.410
 our+0.387
ap+0.375
Neg logit contributions
Don-0.497
 meant-0.484
 said-0.436
 needed-0.426
 whispered-0.421
Tokenmy
Token ID1820
Feature activation-8.85e+01

Pos logit contributions
ric+0.426
ig+0.415
ap+0.403
 our+0.398
igs+0.395
Neg logit contributions
Don-0.359
 meant-0.311
books-0.300
Wow-0.299
jar-0.292
Token watched
Token ID7342
Feature activation-7.30e+01

Pos logit contributions
ric+0.446
ig+0.439
ap+0.430
 our+0.428
igs+0.421
Neg logit contributions
Don-0.398
Wow-0.345
 meant-0.332
books-0.332
jar-0.329
Token in
Token ID287
Feature activation-6.94e+01

Pos logit contributions
igs+0.326
ig+0.308
ric+0.304
 our+0.303
ap+0.289
Neg logit contributions
Don-0.391
 meant-0.380
 needed-0.339
 said-0.339
 whispered-0.333
Token amaz
Token ID40642
Feature activation-1.11e+02

Pos logit contributions
igs+0.290
ig+0.276
ric+0.276
 our+0.268
ap+0.251
Neg logit contributions
Don-0.387
 meant-0.379
 said-0.343
 needed-0.340
 whispered-0.334
Tokenement
Token ID972
Feature activation-8.29e+01

Pos logit contributions
ric+0.762
igs+0.739
 arrest+0.704
 dive+0.696
ode+0.679
Neg logit contributions
 meant-0.340
 said-0.333
Don-0.279
 She-0.275
 answered-0.254
Token as
Token ID355
Feature activation-6.66e+01

Pos logit contributions
igs+0.469
ig+0.452
ric+0.447
 our+0.422
ap+0.412
Neg logit contributions
Don-0.344
 meant-0.339
 said-0.304
 needed-0.292
 whispered-0.285
Token the
Token ID262
Feature activation-6.89e+01

Pos logit contributions
igs+0.354
ig+0.338
 our+0.336
ric+0.328
 victory+0.320
Neg logit contributions
Don-0.292
 meant-0.279
 needed-0.242
 said-0.242
 whispered-0.239
Token machine
Token ID4572
Feature activation-7.35e+01

Pos logit contributions
igs+0.314
ig+0.305
ric+0.297
 our+0.285
 victory+0.283
Neg logit contributions
Don-0.356
 meant-0.343
 said-0.304
 needed-0.303
 whispered-0.298
Token flew
Token ID13112
Feature activation-6.71e+01

Pos logit contributions
ig+0.364
ap+0.354
 our+0.351
ric+0.338
 victory+0.337
Neg logit contributions
Don-0.335
Wow-0.285
jar-0.271
Grand-0.267
Today-0.264
Token give
Token ID1577
Feature activation-7.37e+01

Pos logit contributions
ig+0.354
igs+0.337
ric+0.315
immers+0.298
 Majesty+0.297
Neg logit contributions
 meant-0.378
 said-0.373
Don-0.370
 needed-0.345
 whispered-0.340
Token to
Token ID284
Feature activation-6.78e+01

Pos logit contributions
igs+0.334
ig+0.331
ric+0.325
ap+0.298
 Majesty+0.297
Neg logit contributions
Don-0.391
 meant-0.389
 said-0.371
 needed-0.353
 whispered-0.341
Token the
Token ID262
Feature activation-7.25e+01

Pos logit contributions
ig+0.384
igs+0.362
ric+0.350
ode+0.344
ap+0.342
Neg logit contributions
 said-0.282
 meant-0.277
Don-0.263
 She-0.251
 needed-0.250
Token rh
Token ID9529
Feature activation-9.75e+01

Pos logit contributions
igs+0.321
ig+0.305
ric+0.300
 our+0.283
 victory+0.276
Neg logit contributions
Don-0.391
 meant-0.381
 said-0.342
 needed-0.335
books-0.334
Tokenin
Token ID259
Feature activation-8.67e+01

Pos logit contributions
igs+0.458
latable+0.451
ric+0.433
 Majesty+0.425
ig+0.420
Neg logit contributions
Don-0.459
 meant-0.441
 said-0.440
 needed-0.421
 whispered-0.389
Tokenoc
Token ID420
Feature activation-1.09e+02

Pos logit contributions
igs+0.379
ig+0.359
ric+0.354
 our+0.328
ap+0.320
Neg logit contributions
Don-0.474
 meant-0.468
 said-0.433
 needed-0.421
 whispered-0.412
Tokeneros
Token ID27498
Feature activation-8.22e+01

Pos logit contributions
igs+0.324
latable+0.279
ig+0.263
 our+0.253
 dive+0.244
Neg logit contributions
 meant-0.667
Don-0.654
 said-0.644
 needed-0.631
 man-0.601
Token.
Token ID13
Feature activation-5.32e+01

Pos logit contributions
igs+0.405
ig+0.389
ric+0.383
 our+0.362
ap+0.352
Neg logit contributions
Don-0.402
 meant-0.392
 said-0.355
 needed-0.345
 whispered-0.339
Token Lily
Token ID20037
Feature activation-6.43e+01

Pos logit contributions
ric+0.267
 our+0.266
igs+0.251
 Majesty+0.250
 my+0.250
Neg logit contributions
 meant-0.236
Don-0.214
 needed-0.200
 faded-0.199
chy-0.197
Token walked
Token ID6807
Feature activation-6.49e+01

Pos logit contributions
ig+0.355
igs+0.352
ric+0.337
 our+0.331
 Majesty+0.328
Neg logit contributions
Don-0.265
jar-0.239
 meant-0.232
books-0.222
Wow-0.219
Token to
Token ID284
Feature activation-5.98e+01

Pos logit contributions
igs+0.234
ig+0.226
 our+0.224
ap+0.207
 victory+0.205
Neg logit contributions
 meant-0.389
Don-0.382
 needed-0.349
jar-0.335
 said-0.332
Token't
Token ID470
Feature activation-8.29e+01

Pos logit contributions
Don+0.248
 meant+0.244
 said+0.224
 needed+0.217
 whispered+0.213
Neg logit contributions
igs-0.220
ig-0.210
ric-0.209
 our-0.191
ap-0.188
Token know
Token ID760
Feature activation-7.47e+01

Pos logit contributions
igs+0.443
ig+0.425
ric+0.414
latable+0.374
amental+0.368
Neg logit contributions
 meant-0.417
Don-0.393
 said-0.391
 needed-0.370
 whispered-0.358
Token the
Token ID262
Feature activation-7.28e+01

Pos logit contributions
igs+0.284
ig+0.273
ric+0.267
ap+0.235
chie+0.231
Neg logit contributions
 meant-0.454
Don-0.445
 said-0.425
 needed-0.410
 whispered-0.401
Token rh
Token ID9529
Feature activation-1.01e+02

Pos logit contributions
igs+0.359
ig+0.346
ric+0.343
 our+0.324
ap+0.313
Neg logit contributions
Don-0.353
 meant-0.343
 said-0.311
 needed-0.302
 whispered-0.298
Tokenin
Token ID259
Feature activation-9.01e+01

Pos logit contributions
igs+0.486
latable+0.465
 Majesty+0.449
ric+0.449
ig+0.434
Neg logit contributions
 meant-0.486
Don-0.478
 needed-0.458
 said-0.454
 whispered-0.403
Tokenoc
Token ID420
Feature activation-1.12e+02

Pos logit contributions
ig+0.315
ric+0.309
igs+0.305
 our+0.283
 victory+0.280
Neg logit contributions
Don-0.551
 meant-0.513
 She-0.478
jar-0.475
 whispered-0.471
Tokeneros
Token ID27498
Feature activation-8.38e+01

Pos logit contributions
igs+0.349
latable+0.292
ig+0.278
 our+0.271
ric+0.259
Neg logit contributions
 meant-0.725
Don-0.689
 needed-0.681
 said-0.681
 man-0.624
Token was
Token ID373
Feature activation-6.87e+01

Pos logit contributions
ig+0.465
 our+0.461
igs+0.453
ric+0.451
 victory+0.441
Neg logit contributions
Don-0.325
jar-0.275
Grand-0.267
Wow-0.263
books-0.262
Token wild
Token ID4295
Feature activation-7.18e+01

Pos logit contributions
igs+0.324
ric+0.312
ig+0.307
 our+0.304
 victory+0.288
Neg logit contributions
Don-0.345
 meant-0.324
 said-0.296
 whispered-0.287
books-0.286
Token.
Token ID13
Feature activation-5.01e+01

Pos logit contributions
igs+0.430
ig+0.412
ric+0.407
 our+0.383
ap+0.377
Neg logit contributions
 meant-0.284
Don-0.279
 said-0.248
 needed-0.240
 whispered-0.229
Token Can
Token ID1680
Feature activation-5.47e+01

Pos logit contributions
ric+0.264
 our+0.261
 my+0.248
ap+0.243
igs+0.240
Neg logit contributions
 meant-0.207
Don-0.193
 needed-0.175
chy-0.175
books-0.175
Token
Token ID129
Feature activation-1.15e+02

Pos logit contributions
igs+0.754
ig+0.705
ric+0.684
 our+0.671
ode+0.649
Neg logit contributions
Don-0.480
 meant-0.457
 said-0.436
 needed-0.401
 She-0.392
Token
Token ID241
Feature activation-9.51e+01

Pos logit contributions
igs+0.493
ig+0.471
ric+0.470
 our+0.434
ap+0.422
Neg logit contributions
Don-0.632
 meant-0.623
 said-0.568
 needed-0.561
 whispered-0.547
TokenWho
Token ID8241
Feature activation-7.65e+01

Pos logit contributions
ric+0.607
igs+0.587
ig+0.573
ap+0.557
 our+0.556
Neg logit contributions
 meant-0.300
Don-0.288
 said-0.259
 whispered-0.258
 needed-0.250
Tokenâ
Token ID22940
Feature activation-8.98e+01

Pos logit contributions
igs+0.445
ig+0.426
ric+0.423
 our+0.394
 Majesty+0.386
Neg logit contributions
 meant-0.313
Don-0.309
 said-0.280
 needed-0.269
 whispered-0.259
Token
Token ID26391
Feature activation-9.07e+01

Pos logit contributions
igs+0.372
ig+0.357
ric+0.353
 our+0.324
ap+0.318
Neg logit contributions
Don-0.510
 meant-0.500
 said-0.460
 needed-0.450
 whispered-0.443
Token
Token ID8151
Feature activation-1.09e+02

Pos logit contributions
igs+0.481
ig+0.464
ric+0.460
 our+0.433
ap+0.425
Neg logit contributions
Don-0.408
 meant-0.399
 said-0.357
 needed-0.347
 whispered-0.340
Tokens
Token ID82
Feature activation-8.19e+01

Pos logit contributions
ric+0.615
ig+0.567
igs+0.541
ap+0.535
 our+0.506
Neg logit contributions
Don-0.466
 meant-0.407
 whispered-0.390
jar-0.387
 wondered-0.387
Token there
Token ID612
Feature activation-6.95e+01

Pos logit contributions
igs+0.359
ig+0.339
ric+0.323
 our+0.299
 arrest+0.292
Neg logit contributions
 meant-0.460
Don-0.444
 said-0.425
 needed-0.411
 whispered-0.396
Token?
Token ID30
Feature activation-5.91e+01

Pos logit contributions
ric+0.246
igs+0.240
ig+0.239
 our+0.232
ap+0.220
Neg logit contributions
Don-0.434
 meant-0.399
 whispered-0.371
 wondered-0.365
 needed-0.365
Tokenâ
Token ID22940
Feature activation-7.45e+01

Pos logit contributions
ric+0.305
igs+0.295
ig+0.292
 our+0.283
ap+0.279
Neg logit contributions
 meant-0.253
Don-0.249
 said-0.223
 needed-0.219
 whispered-0.218
Token
Token ID26391
Feature activation-7.50e+01

Pos logit contributions
ric+0.285
ig+0.282
igs+0.273
ap+0.249
chie+0.232
Neg logit contributions
Don-0.442
 meant-0.421
 needed-0.392
 said-0.383
jar-0.378