TOP ACTIVATIONS
MAX = 0.545

When Daisy put the magn ifying glass down , the cater
John chose some carrots and cuc umbers to take home .
those in need â even <|endoftext|> One day
I â m sorry Jane . I
reward . It was ch ocol ates and sweets !
quiet compared to the noisy se ag ull s outside .
- a - bo o to make him laugh . Benny
It â s almost just as beautiful
It â s a magic book !
like lettuce , tomatoes and cuc umbers . He smiled as
That â s a special necklace ,
by one 1 , 2 ,
The town spe ople had to find a
Let â s go find something else
't know the rh in oc eros was wild . Can
Suddenly , Bug gy heard a loud noise .
Everyday he would take the av oc ados and scatter them
Who â s there ? â
That â s right , Emma .
That â s the gas from the

MIN ACTIVATIONS
MIN = -0.243

help and waited for the ambulance . L ily
them the umbrella and the coat . She said , "
his magical experience with the fog . <|endoftext|> Once there was
" Yes , sir !" Lily said .
. He thinks maybe the hat is in the barn .
. As the fog filled the port , everything
Bob . The elf said , " G reetings
John and Sarah helped the ambulance driver until the baby was
Tim my watched as the ambulance drove away , hoping the
The little girl thanked the elf and waved goodbye as <|endoftext|>
and flew her to the waterfall . It was even brighter
Lucy smiled and knew the taxi driver had been right .
sorry . We rang the bell and made the park keeper
felt better and thanked the dentist . When she left the
mom and showed her the crown . Mom was
her leg and called an ambulance . He told Tom to
turns and listen to the bell next time .
fun . They hoped the car ride would be over soon
down and watched as the bus drove around . Everywhere she
Everyone wanted to share the treasure together . Now , the

INTERVAL 0.348 - 0.545
CONTAINS 0.419%

John chose some carrots and cuc umbers to take home .
Suddenly , Bug gy heard a loud noise .
Let â s go find something else
on the table in the living room . It is big
Let â s explore and have some

INTERVAL 0.151 - 0.348
CONTAINS 11.441%

So he put the banana in a bowl of water and
He loved to store food in his tunnel for the winter
it . She held onto the truck and hugged it tight
she decided to bake a pie . She put the ac
other . They were glad they had a park . <|endoftext|>

INTERVAL -0.046 - 0.151
CONTAINS 46.616%

perfect ," said her grandma . " Would you like to
and opened the zip . They saw many animals . There
you whistle , Lily ?" Tom asked . "
into the garden . She used the magnet to pick up
of her finger . L ily 's mom hears

INTERVAL -0.243 - -0.046
CONTAINS 41.524%

as much as you want . And the puppy is yours
Bunny . She is very small and soft . She does
and relax ." So , Benny took a deep
with his zoom cars quietly and waits for his mom .
started to panic because she didn 't want to lose her
Token When
Token ID1649
Feature activation+2.73e-01

Pos logit contributions
 our+0.461
 my+0.437
 your+0.436
ap+0.369
 exercise+0.365
Neg logit contributions
books-0.347
 meant-0.339
 whispered-0.313
jar-0.301
 faded-0.293
Token Daisy
Token ID40355
Feature activation+2.50e-01

Pos logit contributions
 our+0.421
 my+0.385
 your+0.380
ap+0.349
 arrest+0.349
Neg logit contributions
 meant-0.338
books-0.331
 whispered-0.324
 spoke-0.304
 said-0.302
Token put
Token ID1234
Feature activation+2.53e-01

Pos logit contributions
 our+0.399
 my+0.388
 your+0.385
 dive+0.363
 practice+0.350
Neg logit contributions
books-0.320
jar-0.285
lessness-0.278
post-0.264
 meant-0.261
Token the
Token ID262
Feature activation+2.71e-01

Pos logit contributions
 our+0.352
 my+0.314
 your+0.312
 arrest+0.287
ap+0.281
Neg logit contributions
 meant-0.361
books-0.358
 whispered-0.346
 said-0.320
 spoke-0.319
Token magn
Token ID7842
Feature activation+2.55e-01

Pos logit contributions
 our+0.385
 my+0.352
 your+0.351
 exercise+0.308
 arrest+0.299
Neg logit contributions
books-0.387
 meant-0.383
 whispered-0.365
jar-0.343
 spoke-0.343
Tokenifying
Token ID4035
Feature activation+3.90e-01

Pos logit contributions
 our+0.256
 my+0.229
 your+0.222
ap+0.185
 dive+0.176
Neg logit contributions
books-0.465
 meant-0.462
 whispered-0.444
Don-0.427
jar-0.423
Token glass
Token ID5405
Feature activation+2.03e-01

Pos logit contributions
 our+0.612
 exercise+0.578
 your+0.555
ap+0.541
 arrest+0.540
Neg logit contributions
 meant-0.413
books-0.406
 whispered-0.394
 said-0.393
jar-0.387
Token down
Token ID866
Feature activation+2.21e-01

Pos logit contributions
 our+0.252
 my+0.236
 your+0.233
ap+0.184
 dive+0.181
Neg logit contributions
books-0.324
 meant-0.318
Don-0.308
 whispered-0.303
jar-0.296
Token,
Token ID11
Feature activation+2.46e-01

Pos logit contributions
 our+0.267
 my+0.249
 your+0.248
 these+0.185
 victory+0.184
Neg logit contributions
books-0.381
 meant-0.367
 whispered-0.354
jar-0.340
 spoke-0.337
Token the
Token ID262
Feature activation+2.36e-01

Pos logit contributions
 our+0.517
ric+0.477
 my+0.475
ap+0.473
 your+0.468
Neg logit contributions
 meant-0.179
 whispered-0.163
books-0.152
 said-0.151
 spoke-0.145
Token cater
Token ID20825
Feature activation+2.09e-01

Pos logit contributions
 our+0.334
 my+0.313
 your+0.307
 exercise+0.275
 victory+0.270
Neg logit contributions
books-0.335
 meant-0.327
 whispered-0.304
Don-0.290
 spoke-0.287
Token John
Token ID1757
Feature activation+3.40e-01

Pos logit contributions
jar+0.239
books+0.227
post+0.205
book+0.202
ooky+0.197
Neg logit contributions
 our-0.286
 spend-0.281
 practice-0.278
ap-0.272
 how-0.269
Token chose
Token ID7690
Feature activation+3.01e-01

Pos logit contributions
 our+0.632
 my+0.588
 your+0.587
ap+0.529
 arrest+0.529
Neg logit contributions
books-0.334
 meant-0.319
 whispered-0.295
jar-0.288
Don-0.270
Token some
Token ID617
Feature activation+2.99e-01

Pos logit contributions
 our+0.325
 my+0.285
 your+0.283
 arrest+0.238
 exercise+0.237
Neg logit contributions
 meant-0.525
books-0.521
 whispered-0.505
 spoke-0.477
jar-0.474
Token carrots
Token ID34397
Feature activation+2.48e-01

Pos logit contributions
 our+0.413
 my+0.375
 your+0.374
 exercise+0.327
 arrest+0.317
Neg logit contributions
books-0.433
 meant-0.428
 whispered-0.411
jar-0.390
 spoke-0.386
Token and
Token ID290
Feature activation+2.96e-01

Pos logit contributions
 our+0.360
 your+0.329
 my+0.327
 exercise+0.283
ap+0.280
Neg logit contributions
books-0.346
 meant-0.336
 whispered-0.324
jar-0.314
 spoke-0.306
Token cuc
Token ID38421
Feature activation+3.87e-01

Pos logit contributions
 our+0.469
 my+0.429
 your+0.425
 arrest+0.392
ap+0.391
Neg logit contributions
 meant-0.370
 whispered-0.356
books-0.348
 said-0.336
 spoke-0.328
Tokenumbers
Token ID17024
Feature activation+2.05e-01

Pos logit contributions
 our+0.689
 my+0.638
 your+0.624
 arrest+0.582
 exercise+0.577
Neg logit contributions
books-0.418
 meant-0.405
 whispered-0.379
l-0.360
book-0.357
Token to
Token ID284
Feature activation+2.86e-01

Pos logit contributions
 our+0.306
 your+0.283
 my+0.277
 dive+0.235
 exercise+0.233
Neg logit contributions
books-0.278
jar-0.270
 meant-0.262
 whispered-0.261
Don-0.257
Token take
Token ID1011
Feature activation+2.41e-01

Pos logit contributions
 our+0.335
 my+0.290
ap+0.289
 your+0.279
 arrest+0.275
Neg logit contributions
 meant-0.427
 said-0.424
 whispered-0.418
books-0.411
 spoke-0.389
Token home
Token ID1363
Feature activation+2.04e-01

Pos logit contributions
 our+0.269
 my+0.238
 your+0.236
 exercise+0.201
 arrest+0.201
Neg logit contributions
books-0.408
 meant-0.408
 whispered-0.392
 spoke-0.370
 said-0.367
Token.
Token ID13
Feature activation+2.06e-01

Pos logit contributions
 our+0.350
 my+0.326
 your+0.322
 arrest+0.298
 exercise+0.294
Neg logit contributions
books-0.221
 meant-0.219
 whispered-0.207
 said-0.189
l-0.187
Token those
Token ID883
Feature activation+4.31e-01

Pos logit contributions
books+0.173
 meant+0.171
 whispered+0.167
 spoke+0.152
 widened+0.152
Neg logit contributions
 our-0.265
 my-0.244
 your-0.244
 exercise-0.213
 arrest-0.212
Token in
Token ID287
Feature activation+4.02e-01

Pos logit contributions
 our+0.585
 my+0.530
 your+0.523
 arrest+0.480
 exercise+0.473
Neg logit contributions
 meant-0.622
books-0.599
 whispered-0.585
 said-0.551
 spoke-0.541
Token need
Token ID761
Feature activation+3.33e-01

Pos logit contributions
 our+0.416
 my+0.361
 your+0.356
 exercise+0.330
ap+0.321
Neg logit contributions
 meant-0.687
books-0.666
 whispered-0.651
 said-0.617
jar-0.611
Tokenâ
Token ID22940
Feature activation+3.20e-01

Pos logit contributions
 our+0.362
 my+0.316
 your+0.311
 arrest+0.278
ap+0.277
Neg logit contributions
 meant-0.569
books-0.557
 whispered-0.544
 said-0.513
 spoke-0.512
Token
Token ID26391
Feature activation+3.21e-01

Pos logit contributions
 our+0.441
 my+0.403
 your+0.403
 exercise+0.360
ap+0.356
Neg logit contributions
books-0.458
 meant-0.450
 whispered-0.439
jar-0.417
 spoke-0.410
Token
Token ID447
Feature activation+3.71e-01

Pos logit contributions
 our+0.477
 my+0.436
 your+0.435
 exercise+0.382
 arrest+0.380
Neg logit contributions
books-0.430
 meant-0.430
 whispered-0.410
jar-0.384
 spoke-0.381
Token
Token ID251
Feature activation+2.57e-01

Pos logit contributions
 our+0.323
 my+0.299
 your+0.298
ap+0.266
 practice+0.236
Neg logit contributions
books-0.712
 meant-0.710
 whispered-0.657
book-0.642
 spoke-0.622
Tokeneven
Token ID10197
Feature activation+2.42e-01

Pos logit contributions
 our+0.423
 your+0.409
 my+0.404
ap+0.349
 exercise+0.341
Neg logit contributions
books-0.321
 meant-0.296
jar-0.282
 whispered-0.263
 faded-0.261
Token<|endoftext|>
Token ID50256
Feature activation+2.22e-01

Pos logit contributions
 our+0.405
 my+0.372
 your+0.371
 exercise+0.333
 arrest+0.331
Neg logit contributions
 meant-0.283
books-0.281
 whispered-0.268
 spoke-0.246
jar-0.244
TokenOne
Token ID3198
Feature activation+2.15e-01

Pos logit contributions
 our+0.343
 your+0.330
 my+0.328
ap+0.285
 exercise+0.283
Neg logit contributions
books-0.256
 meant-0.255
 whispered-0.250
jar-0.237
 faded-0.234
Token day
Token ID1110
Feature activation+2.23e-01

Pos logit contributions
 our+0.369
 my+0.343
 your+0.342
 exercise+0.309
ap+0.303
Neg logit contributions
books-0.241
 meant-0.237
 whispered-0.225
jar-0.209
 spoke-0.206
Token
Token ID129
Feature activation+4.75e-01

Pos logit contributions
books+0.238
jar+0.230
post+0.201
 faded+0.184
 meant+0.183
Neg logit contributions
 our-0.329
ap-0.316
 your-0.309
 my-0.298
 how-0.294
Token
Token ID241
Feature activation+4.01e-01

Pos logit contributions
 our+0.664
 my+0.590
 your+0.586
 dive+0.530
 exercise+0.529
Neg logit contributions
books-0.671
 meant-0.667
 whispered-0.643
 said-0.601
Don-0.598
TokenI
Token ID40
Feature activation+3.44e-01

Pos logit contributions
 our+0.744
 your+0.692
 my+0.692
 exercise+0.627
ap+0.619
Neg logit contributions
books-0.390
 meant-0.386
 whispered-0.367
jar-0.333
 spoke-0.331
Tokenâ
Token ID22940
Feature activation+3.38e-01

Pos logit contributions
 our+0.435
 your+0.404
 my+0.397
 exercise+0.348
ap+0.347
Neg logit contributions
books-0.544
 meant-0.505
jar-0.491
 whispered-0.490
post-0.471
Token
Token ID26391
Feature activation+3.37e-01

Pos logit contributions
 our+0.464
 my+0.420
 your+0.418
 exercise+0.363
 arrest+0.360
Neg logit contributions
books-0.497
 meant-0.495
 whispered-0.475
jar-0.446
 spoke-0.445
Token
Token ID8151
Feature activation+3.68e-01

Pos logit contributions
 our+0.514
 my+0.469
 your+0.466
 exercise+0.418
 victory+0.408
Neg logit contributions
books-0.443
 meant-0.434
 whispered-0.419
 spoke-0.392
 said-0.390
Tokenm
Token ID76
Feature activation+2.91e-01

Pos logit contributions
 our+0.580
 your+0.563
 my+0.549
ric+0.521
ap+0.518
Neg logit contributions
 whispered-0.394
jar-0.386
 faded-0.373
books-0.368
 meant-0.366
Token sorry
Token ID7926
Feature activation+2.66e-01

Pos logit contributions
 our+0.438
 my+0.402
 your+0.401
 exercise+0.351
ap+0.349
Neg logit contributions
books-0.390
 meant-0.384
 whispered-0.368
jar-0.348
 spoke-0.344
Token Jane
Token ID12091
Feature activation+2.30e-01

Pos logit contributions
 our+0.422
 my+0.388
 your+0.387
 exercise+0.343
ap+0.338
Neg logit contributions
books-0.334
 meant-0.332
 whispered-0.316
jar-0.295
 spoke-0.293
Token.
Token ID13
Feature activation+2.34e-01

Pos logit contributions
 our+0.371
 your+0.353
 my+0.352
ap+0.303
 exercise+0.295
Neg logit contributions
books-0.302
jar-0.277
post-0.253
Don-0.250
 widened-0.239
Token I
Token ID314
Feature activation+2.35e-01

Pos logit contributions
 our+0.327
 my+0.302
 your+0.299
 exercise+0.254
ap+0.253
Neg logit contributions
books-0.338
 meant-0.335
 whispered-0.320
 spoke-0.301
jar-0.300
Token reward
Token ID6721
Feature activation+3.27e-01

Pos logit contributions
 our+0.563
 my+0.510
 your+0.509
 exercise+0.461
 arrest+0.458
Neg logit contributions
 meant-0.480
books-0.455
 whispered-0.451
 said-0.421
 spoke-0.414
Token.
Token ID13
Feature activation+2.70e-01

Pos logit contributions
 our+0.486
 my+0.444
 your+0.444
 exercise+0.385
ap+0.377
Neg logit contributions
books-0.444
 meant-0.437
 whispered-0.419
jar-0.400
 spoke-0.392
Token It
Token ID632
Feature activation+2.47e-01

Pos logit contributions
 our+0.416
 my+0.395
 your+0.389
ap+0.329
 exercise+0.318
Neg logit contributions
books-0.357
 meant-0.353
 whispered-0.332
jar-0.312
l-0.311
Token was
Token ID373
Feature activation+2.62e-01

Pos logit contributions
 our+0.433
 my+0.415
 your+0.404
 dive+0.355
 exercise+0.349
Neg logit contributions
books-0.300
jar-0.260
oled-0.246
book-0.235
post-0.234
Token ch
Token ID442
Feature activation+2.39e-01

Pos logit contributions
 our+0.306
 my+0.283
 your+0.277
 victory+0.213
 exercise+0.213
Neg logit contributions
books-0.462
 whispered-0.432
 meant-0.430
jar-0.422
Don-0.419
Tokenocol
Token ID4668
Feature activation+3.67e-01

Pos logit contributions
 our+0.292
 your+0.256
 my+0.254
 exercise+0.223
 arrest+0.217
Neg logit contributions
 meant-0.390
books-0.385
 whispered-0.372
l-0.352
jar-0.350
Tokenates
Token ID689
Feature activation+2.12e-01

Pos logit contributions
 our+0.481
 my+0.431
 your+0.426
 exercise+0.394
 dive+0.374
Neg logit contributions
books-0.564
 meant-0.551
 whispered-0.525
book-0.504
 spoke-0.491
Token and
Token ID290
Feature activation+2.56e-01

Pos logit contributions
 our+0.289
 my+0.266
 your+0.266
ap+0.224
 exercise+0.221
Neg logit contributions
books-0.313
 meant-0.307
 whispered-0.299
Don-0.291
jar-0.289
Token sweets
Token ID42402
Feature activation+1.76e-01

Pos logit contributions
 our+0.384
 my+0.352
 your+0.351
 exercise+0.307
 arrest+0.302
Neg logit contributions
books-0.343
 meant-0.339
 whispered-0.323
jar-0.305
 spoke-0.300
Token!
Token ID0
Feature activation+2.10e-01

Pos logit contributions
 our+0.225
 my+0.209
 your+0.208
 exercise+0.178
 how+0.169
Neg logit contributions
books-0.254
Don-0.254
 whispered-0.248
 meant-0.246
jar-0.243
Token
Token ID198
Feature activation+1.83e-01

Pos logit contributions
 our+0.311
 my+0.292
 your+0.289
ap+0.241
 arrest+0.239
Neg logit contributions
books-0.294
 meant-0.287
 whispered-0.269
jar-0.257
 spoke-0.255
Token quiet
Token ID5897
Feature activation+3.32e-01

Pos logit contributions
 meant+0.169
 whispered+0.161
books+0.161
 said+0.150
jar+0.148
Neg logit contributions
 our-0.218
 your-0.197
 my-0.194
 arrest-0.180
ap-0.178
Token compared
Token ID3688
Feature activation+3.44e-01

Pos logit contributions
 our+0.586
 my+0.543
 your+0.542
 exercise+0.488
 arrest+0.483
Neg logit contributions
books-0.353
 meant-0.349
 whispered-0.329
jar-0.304
 spoke-0.301
Token to
Token ID284
Feature activation+3.26e-01

Pos logit contributions
 our+0.420
 my+0.374
 your+0.369
 exercise+0.329
ap+0.328
Neg logit contributions
 meant-0.544
books-0.536
 whispered-0.519
 spoke-0.486
 said-0.485
Token the
Token ID262
Feature activation+3.12e-01

Pos logit contributions
 our+0.362
 my+0.320
 your+0.316
 arrest+0.266
ap+0.265
Neg logit contributions
 meant-0.555
books-0.551
 whispered-0.534
 spoke-0.504
jar-0.500
Token noisy
Token ID31210
Feature activation+2.60e-01

Pos logit contributions
 our+0.455
 my+0.416
 your+0.414
 exercise+0.367
 arrest+0.356
Neg logit contributions
books-0.431
 meant-0.424
 whispered-0.407
 spoke-0.381
jar-0.380
Token se
Token ID384
Feature activation+3.64e-01

Pos logit contributions
 our+0.378
 my+0.345
 your+0.344
 exercise+0.304
 arrest+0.298
Neg logit contributions
books-0.359
 meant-0.355
 whispered-0.341
jar-0.321
 spoke-0.317
Tokenag
Token ID363
Feature activation+2.53e-01

Pos logit contributions
 our+0.550
 my+0.504
 your+0.490
 arrest+0.444
 dive+0.430
Neg logit contributions
books-0.498
 meant-0.486
 whispered-0.460
l-0.453
book-0.439
Tokenull
Token ID724
Feature activation+2.07e-01

Pos logit contributions
 our+0.410
 my+0.379
 your+0.376
 exercise+0.340
 arrest+0.336
Neg logit contributions
books-0.305
 meant-0.303
 whispered-0.285
 spoke-0.264
jar-0.261
Tokens
Token ID82
Feature activation+2.68e-01

Pos logit contributions
 our+0.355
 my+0.327
 your+0.327
ap+0.288
 exercise+0.286
Neg logit contributions
books-0.244
 meant-0.229
jar-0.218
 whispered-0.214
Don-0.205
Token outside
Token ID2354
Feature activation+2.38e-01

Pos logit contributions
 our+0.400
 your+0.366
 my+0.359
ap+0.330
 victory+0.314
Neg logit contributions
books-0.379
 meant-0.355
jar-0.343
 whispered-0.336
Don-0.330
Token.
Token ID13
Feature activation+2.15e-01

Pos logit contributions
 our+0.383
 your+0.358
 my+0.352
ap+0.296
 exercise+0.293
Neg logit contributions
books-0.317
 whispered-0.297
 meant-0.296
jar-0.283
Don-0.276
Token-
Token ID12
Feature activation+5.07e-01

Pos logit contributions
 said+0.252
books+0.244
l+0.237
 whispered+0.234
 meant+0.224
Neg logit contributions
latable-0.129
ric-0.110
igsaw-0.110
 backup-0.110
 arrest-0.107
Tokena
Token ID64
Feature activation+3.63e-01

Pos logit contributions
 our+0.832
 my+0.735
 your+0.730
ap+0.720
 dive+0.709
Neg logit contributions
 meant-0.611
 whispered-0.585
books-0.576
Don-0.556
l-0.555
Token-
Token ID12
Feature activation+4.19e-01

Pos logit contributions
 our+0.541
 my+0.510
 your+0.507
ap+0.463
 arrest+0.452
Neg logit contributions
books-0.496
 meant-0.457
jar-0.453
Don-0.430
 whispered-0.414
Tokenbo
Token ID2127
Feature activation+4.00e-01

Pos logit contributions
 our+0.670
 your+0.589
 my+0.587
 exercise+0.572
ap+0.559
Neg logit contributions
 meant-0.506
 whispered-0.499
books-0.488
 spoke-0.460
 said-0.460
Tokeno
Token ID78
Feature activation+3.10e-01

Pos logit contributions
 our+0.444
ric+0.433
 your+0.421
 my+0.398
ap+0.376
Neg logit contributions
books-0.679
jar-0.654
 widened-0.599
Don-0.594
 meant-0.593
Token to
Token ID284
Feature activation+3.64e-01

Pos logit contributions
 our+0.453
 my+0.433
 your+0.430
ric+0.407
ap+0.382
Neg logit contributions
books-0.436
jar-0.415
Don-0.368
 meant-0.366
post-0.361
Token make
Token ID787
Feature activation+3.11e-01

Pos logit contributions
igs+0.405
 our+0.392
latable+0.355
ric+0.350
ap+0.339
Neg logit contributions
 said-0.551
 whispered-0.550
 meant-0.540
 spoke-0.503
books-0.472
Token him
Token ID683
Feature activation+2.58e-01

Pos logit contributions
 our+0.361
 my+0.324
 your+0.314
 arrest+0.299
ap+0.293
Neg logit contributions
 meant-0.491
books-0.475
 whispered-0.471
 said-0.454
l-0.444
Token laugh
Token ID6487
Feature activation+2.54e-01

Pos logit contributions
 our+0.351
 my+0.319
 your+0.314
 arrest+0.290
ap+0.269
Neg logit contributions
 meant-0.373
books-0.358
 whispered-0.358
 said-0.333
 spoke-0.330
Token.
Token ID13
Feature activation+2.32e-01

Pos logit contributions
 our+0.378
 my+0.355
 your+0.351
 how+0.286
ap+0.285
Neg logit contributions
jar-0.373
books-0.351
Don-0.344
 whispered-0.343
 spoke-0.340
Token Benny
Token ID44275
Feature activation+2.07e-01

Pos logit contributions
 our+0.363
 my+0.347
 your+0.342
ap+0.286
 exercise+0.284
Neg logit contributions
books-0.302
 meant-0.301
 whispered-0.279
jar-0.266
 spoke-0.266
Token
Token ID129
Feature activation+4.75e-01

Pos logit contributions
 meant+0.249
books+0.242
 whispered+0.234
 spoke+0.222
jar+0.220
Neg logit contributions
 our-0.236
 your-0.214
 my-0.211
 exercise-0.185
ap-0.175
Token
Token ID241
Feature activation+4.01e-01

Pos logit contributions
 our+0.664
 my+0.590
 your+0.586
 dive+0.530
 exercise+0.529
Neg logit contributions
books-0.671
 meant-0.667
 whispered-0.643
 said-0.601
Don-0.598
TokenIt
Token ID1026
Feature activation+3.05e-01

Pos logit contributions
 our+0.744
 your+0.692
 my+0.692
 exercise+0.627
ap+0.619
Neg logit contributions
books-0.390
 meant-0.386
 whispered-0.367
jar-0.333
 spoke-0.331
Tokenâ
Token ID22940
Feature activation+3.28e-01

Pos logit contributions
 our+0.484
 my+0.445
 your+0.444
 exercise+0.394
ap+0.388
Neg logit contributions
books-0.379
 meant-0.375
 whispered-0.358
jar-0.335
 spoke-0.331
Token
Token ID26391
Feature activation+3.30e-01

Pos logit contributions
 our+0.448
 my+0.408
 your+0.407
 exercise+0.355
ap+0.354
Neg logit contributions
books-0.480
 meant-0.475
 whispered-0.460
jar-0.434
 spoke-0.431
Token
Token ID8151
Feature activation+3.63e-01

Pos logit contributions
 our+0.497
 my+0.454
 your+0.451
 exercise+0.401
 arrest+0.393
Neg logit contributions
books-0.437
 meant-0.432
 whispered-0.413
jar-0.386
 spoke-0.385
Tokens
Token ID82
Feature activation+3.12e-01

Pos logit contributions
 our+0.566
 your+0.549
 my+0.532
ric+0.528
ap+0.509
Neg logit contributions
 whispered-0.397
jar-0.389
 faded-0.370
books-0.367
 wished-0.365
Token almost
Token ID2048
Feature activation+2.75e-01

Pos logit contributions
 our+0.432
 my+0.391
 your+0.390
 exercise+0.340
 arrest+0.339
Neg logit contributions
 meant-0.449
books-0.448
 whispered-0.429
jar-0.402
 spoke-0.401
Token just
Token ID655
Feature activation+2.83e-01

Pos logit contributions
 our+0.372
 my+0.342
 your+0.342
 exercise+0.288
 dive+0.283
Neg logit contributions
books-0.425
 meant-0.404
 whispered-0.389
jar-0.381
Don-0.372
Token as
Token ID355
Feature activation+2.90e-01

Pos logit contributions
 our+0.380
 my+0.346
 your+0.346
 exercise+0.296
ap+0.289
Neg logit contributions
books-0.429
 meant-0.417
 whispered-0.400
jar-0.385
 spoke-0.376
Token beautiful
Token ID4950
Feature activation+2.58e-01

Pos logit contributions
 our+0.402
 your+0.371
 my+0.369
 exercise+0.316
 dive+0.299
Neg logit contributions
books-0.438
 meant-0.422
 whispered-0.408
jar-0.388
 spoke-0.386
Token
Token ID129
Feature activation+4.75e-01

Pos logit contributions
books+0.174
 meant+0.160
 whispered+0.147
jar+0.144
 spoke+0.134
Neg logit contributions
 our-0.309
 my-0.292
 your-0.290
 arrest-0.256
 exercise-0.255
Token
Token ID241
Feature activation+4.01e-01

Pos logit contributions
 our+0.664
 my+0.590
 your+0.586
 dive+0.530
 exercise+0.529
Neg logit contributions
books-0.671
 meant-0.667
 whispered-0.643
 said-0.601
Don-0.598
TokenIt
Token ID1026
Feature activation+3.05e-01

Pos logit contributions
 our+0.744
 your+0.692
 my+0.692
 exercise+0.627
ap+0.619
Neg logit contributions
books-0.390
 meant-0.386
 whispered-0.367
jar-0.333
 spoke-0.331
Tokenâ
Token ID22940
Feature activation+3.28e-01

Pos logit contributions
 our+0.484
 my+0.445
 your+0.444
 exercise+0.394
ap+0.388
Neg logit contributions
books-0.379
 meant-0.375
 whispered-0.358
jar-0.335
 spoke-0.331
Token
Token ID26391
Feature activation+3.30e-01

Pos logit contributions
 our+0.448
 my+0.408
 your+0.407
 exercise+0.355
ap+0.354
Neg logit contributions
books-0.480
 meant-0.475
 whispered-0.460
jar-0.434
 spoke-0.431
Token
Token ID8151
Feature activation+3.63e-01

Pos logit contributions
 our+0.497
 my+0.454
 your+0.451
 exercise+0.401
 arrest+0.393
Neg logit contributions
books-0.437
 meant-0.432
 whispered-0.413
jar-0.386
 spoke-0.385
Tokens
Token ID82
Feature activation+3.12e-01

Pos logit contributions
 our+0.566
 your+0.549
 my+0.532
ric+0.528
ap+0.509
Neg logit contributions
 whispered-0.397
jar-0.389
 faded-0.370
books-0.367
 wished-0.365
Token a
Token ID257
Feature activation+3.01e-01

Pos logit contributions
 our+0.432
 my+0.391
 your+0.390
 exercise+0.340
 arrest+0.339
Neg logit contributions
 meant-0.449
books-0.448
 whispered-0.429
jar-0.402
 spoke-0.401
Token magic
Token ID5536
Feature activation+2.35e-01

Pos logit contributions
 our+0.448
 my+0.411
 your+0.409
 exercise+0.363
 dive+0.352
Neg logit contributions
books-0.405
 meant-0.403
 whispered-0.384
 spoke-0.357
jar-0.357
Token book
Token ID1492
Feature activation+2.10e-01

Pos logit contributions
 our+0.331
 my+0.306
 your+0.304
 exercise+0.266
 dive+0.260
Neg logit contributions
books-0.329
 meant-0.327
 whispered-0.314
Don-0.301
jar-0.295
Token!
Token ID0
Feature activation+2.39e-01

Pos logit contributions
 our+0.280
 my+0.272
 your+0.268
 exercise+0.218
 how+0.214
Neg logit contributions
Don-0.305
books-0.291
 whispered-0.283
 widened-0.280
jar-0.278
Token like
Token ID588
Feature activation+3.16e-01

Pos logit contributions
 our+0.450
 your+0.412
 my+0.410
 exercise+0.348
 dive+0.338
Neg logit contributions
books-0.433
 meant-0.411
 whispered-0.404
jar-0.395
Don-0.386
Token lettuce
Token ID39406
Feature activation+2.68e-01

Pos logit contributions
 our+0.385
 my+0.345
 your+0.343
 exercise+0.291
 arrest+0.286
Neg logit contributions
books-0.510
 meant-0.507
 whispered-0.489
jar-0.464
 spoke-0.461
Token,
Token ID11
Feature activation+2.84e-01

Pos logit contributions
 our+0.379
 your+0.346
 my+0.344
 exercise+0.294
 dive+0.285
Neg logit contributions
books-0.392
 meant-0.372
 whispered-0.364
jar-0.363
Don-0.353
Token tomatoes
Token ID23972
Feature activation+2.50e-01

Pos logit contributions
 our+0.396
 my+0.355
 your+0.353
 arrest+0.347
ap+0.346
Neg logit contributions
 meant-0.396
 whispered-0.368
books-0.366
 said-0.353
l-0.344
Token and
Token ID290
Feature activation+2.68e-01

Pos logit contributions
 our+0.321
 your+0.295
 my+0.291
 exercise+0.237
 dive+0.233
Neg logit contributions
books-0.403
jar-0.381
 meant-0.374
Don-0.373
 whispered-0.368
Token cuc
Token ID38421
Feature activation+3.60e-01

Pos logit contributions
 our+0.374
ap+0.373
latable+0.359
 arrest+0.345
 victory+0.336
Neg logit contributions
 meant-0.348
 whispered-0.329
 said-0.328
 needed-0.310
l-0.302
Tokenumbers
Token ID17024
Feature activation+1.86e-01

Pos logit contributions
 our+0.641
 my+0.597
 your+0.582
 exercise+0.540
 arrest+0.539
Neg logit contributions
books-0.384
 meant-0.377
 whispered-0.351
l-0.333
book-0.325
Token.
Token ID13
Feature activation+2.13e-01

Pos logit contributions
 our+0.301
 your+0.281
 my+0.273
ap+0.226
 these+0.226
Neg logit contributions
books-0.237
jar-0.234
Don-0.225
 whispered-0.221
 meant-0.220
Token He
Token ID679
Feature activation+2.03e-01

Pos logit contributions
 our+0.359
 my+0.336
 your+0.335
ap+0.290
 exercise+0.289
Neg logit contributions
books-0.249
 meant-0.247
 whispered-0.233
jar-0.218
 spoke-0.218
Token smiled
Token ID13541
Feature activation+1.83e-01

Pos logit contributions
 our+0.375
 my+0.350
 your+0.350
 exercise+0.314
 dive+0.310
Neg logit contributions
books-0.205
 meant-0.196
 whispered-0.183
jar-0.179
Don-0.168
Token as
Token ID355
Feature activation+2.25e-01

Pos logit contributions
 our+0.278
 my+0.259
 your+0.256
ap+0.220
 exercise+0.214
Neg logit contributions
books-0.252
 meant-0.241
 whispered-0.231
jar-0.230
 spoke-0.219
Token
Token ID129
Feature activation+4.75e-01

Pos logit contributions
 meant+0.201
books+0.197
 whispered+0.182
jar+0.173
 spoke+0.169
Neg logit contributions
 our-0.275
 your-0.249
 my-0.249
ap-0.224
 arrest-0.222
Token
Token ID241
Feature activation+4.01e-01

Pos logit contributions
 our+0.664
 my+0.590
 your+0.586
 dive+0.530
 exercise+0.529
Neg logit contributions
books-0.671
 meant-0.667
 whispered-0.643
 said-0.601
Don-0.598
TokenThat
Token ID2504
Feature activation+2.96e-01

Pos logit contributions
 our+0.744
 your+0.692
 my+0.692
 exercise+0.627
ap+0.619
Neg logit contributions
books-0.390
 meant-0.386
 whispered-0.367
jar-0.333
 spoke-0.331
Tokenâ
Token ID22940
Feature activation+3.22e-01

Pos logit contributions
 our+0.475
 my+0.434
 your+0.433
 arrest+0.406
ap+0.391
Neg logit contributions
 meant-0.366
books-0.347
 whispered-0.334
 said-0.311
 spoke-0.308
Token
Token ID26391
Feature activation+3.25e-01

Pos logit contributions
 our+0.419
 my+0.381
 your+0.380
 exercise+0.333
ap+0.328
Neg logit contributions
books-0.488
 meant-0.482
 whispered-0.470
jar-0.446
 spoke-0.441
Token
Token ID8151
Feature activation+3.59e-01

Pos logit contributions
 our+0.497
 my+0.454
 your+0.452
 exercise+0.402
 arrest+0.394
Neg logit contributions
books-0.427
 meant-0.420
 whispered-0.400
jar-0.374
 spoke-0.373
Tokens
Token ID82
Feature activation+3.09e-01

Pos logit contributions
 our+0.572
 your+0.553
 my+0.533
ric+0.513
ap+0.512
Neg logit contributions
 whispered-0.394
jar-0.389
 faded-0.363
books-0.356
 wished-0.349
Token a
Token ID257
Feature activation+3.00e-01

Pos logit contributions
 our+0.447
 my+0.401
 your+0.399
 arrest+0.386
 dive+0.361
Neg logit contributions
 meant-0.429
books-0.411
 whispered-0.390
 said-0.371
 spoke-0.358
Token special
Token ID2041
Feature activation+2.65e-01

Pos logit contributions
 our+0.450
 my+0.410
 your+0.409
 arrest+0.359
 exercise+0.359
Neg logit contributions
books-0.397
 meant-0.396
 whispered-0.378
jar-0.353
 spoke-0.351
Token necklace
Token ID38987
Feature activation+2.10e-01

Pos logit contributions
 our+0.375
 my+0.338
 your+0.336
 arrest+0.297
 exercise+0.293
Neg logit contributions
 meant-0.374
books-0.373
 whispered-0.357
jar-0.335
 spoke-0.333
Token,
Token ID11
Feature activation+2.74e-01

Pos logit contributions
 our+0.290
 my+0.274
 your+0.271
 exercise+0.224
ap+0.214
Neg logit contributions
books-0.303
 meant-0.286
Don-0.284
 whispered-0.280
jar-0.280
Token by
Token ID416
Feature activation+3.89e-01

Pos logit contributions
 meant+0.207
 whispered+0.193
books+0.189
 faded+0.186
 spoke+0.184
Neg logit contributions
 our-0.250
 your-0.233
 my-0.228
ap-0.196
 exercise-0.195
Token one
Token ID530
Feature activation+3.27e-01

Pos logit contributions
 our+0.382
latable+0.325
 my+0.324
 your+0.319
ap+0.315
Neg logit contributions
 meant-0.658
 whispered-0.625
books-0.618
 said-0.604
 spoke-0.577
Token
Token ID6184
Feature activation+2.92e-01

Pos logit contributions
 our+0.467
 my+0.427
 your+0.424
 exercise+0.371
ap+0.365
Neg logit contributions
books-0.460
 meant-0.456
 whispered-0.436
jar-0.410
 spoke-0.408
Token
Token ID95
Feature activation+2.55e-01

Pos logit contributions
 our+0.420
 my+0.378
 your+0.378
 exercise+0.331
 arrest+0.330
Neg logit contributions
books-0.409
 meant-0.402
 whispered-0.388
jar-0.365
 spoke-0.361
Token
Token ID26391
Feature activation+3.08e-01

Pos logit contributions
 our+0.334
 my+0.317
 your+0.313
 exercise+0.271
ap+0.269
Neg logit contributions
 meant-0.377
books-0.365
 whispered-0.364
 spoke-0.344
jar-0.340
Token
Token ID447
Feature activation+3.59e-01

Pos logit contributions
 our+0.548
 my+0.513
 your+0.512
ap+0.460
 arrest+0.458
Neg logit contributions
 meant-0.324
books-0.313
 whispered-0.298
jar-0.274
 spoke-0.270
Token
Token ID250
Feature activation+2.53e-01

Pos logit contributions
 our+0.293
 my+0.273
 your+0.272
ap+0.235
 practice+0.213
Neg logit contributions
 meant-0.735
books-0.687
 whispered-0.669
 spoke-0.635
 faded-0.631
Token 1
Token ID352
Feature activation+1.92e-01

Pos logit contributions
 our+0.400
 my+0.382
 your+0.379
 exercise+0.344
 practice+0.341
Neg logit contributions
 meant-0.288
books-0.287
 whispered-0.283
 spoke-0.261
 said-0.253
Token,
Token ID11
Feature activation+2.60e-01

Pos logit contributions
 our+0.255
 my+0.234
 your+0.232
 exercise+0.198
ap+0.196
Neg logit contributions
books-0.291
 meant-0.282
 whispered-0.275
jar-0.271
Don-0.263
Token 2
Token ID362
Feature activation+1.92e-01

Pos logit contributions
 our+0.348
 my+0.311
 your+0.311
 arrest+0.275
ap+0.272
Neg logit contributions
 meant-0.382
books-0.378
 whispered-0.365
 spoke-0.340
 said-0.339
Token,
Token ID11
Feature activation+2.50e-01

Pos logit contributions
 our+0.253
 my+0.230
 your+0.229
 exercise+0.197
 arrest+0.192
Neg logit contributions
books-0.291
 meant-0.289
 whispered-0.277
jar-0.265
 spoke-0.261
Token
Token ID220
Feature activation+3.59e-01

Pos logit contributions
 meant+0.192
books+0.192
l+0.176
jar+0.163
 whispered+0.162
Neg logit contributions
 our-0.230
 your-0.207
 my-0.207
 Majesty-0.196
ap-0.191
Token
Token ID198
Feature activation+3.07e-01

Pos logit contributions
 our+0.538
 your+0.512
 my+0.499
ap+0.471
ric+0.421
Neg logit contributions
 meant-0.446
books-0.435
 whispered-0.435
jar-0.424
 spoke-0.412
Token
Token ID198
Feature activation+2.79e-01

Pos logit contributions
 our+0.487
 my+0.459
 your+0.459
ap+0.387
 exercise+0.382
Neg logit contributions
 meant-0.403
books-0.378
 whispered-0.357
 faded-0.350
 widened-0.339
TokenThe
Token ID464
Feature activation+2.85e-01

Pos logit contributions
 our+0.435
 my+0.404
 your+0.403
 exercise+0.349
ap+0.346
Neg logit contributions
 meant-0.373
books-0.352
 whispered-0.339
 faded-0.317
 spoke-0.314
Token town
Token ID3240
Feature activation+2.44e-01

Pos logit contributions
 our+0.425
 my+0.389
 your+0.386
 exercise+0.345
 victory+0.337
Neg logit contributions
books-0.390
 meant-0.384
 whispered-0.363
 spoke-0.338
jar-0.338
Tokenspe
Token ID4125
Feature activation+3.58e-01

Pos logit contributions
 our+0.419
 my+0.385
 your+0.384
ap+0.376
 victory+0.352
Neg logit contributions
books-0.273
jar-0.248
 meant-0.233
Don-0.226
 whispered-0.223
Tokenople
Token ID643
Feature activation+2.32e-01

Pos logit contributions
 our+0.420
 my+0.373
 your+0.371
 arrest+0.311
ap+0.311
Neg logit contributions
books-0.596
 meant-0.591
 whispered-0.570
jar-0.547
 spoke-0.536
Token had
Token ID550
Feature activation+2.77e-01

Pos logit contributions
 our+0.421
 your+0.391
ap+0.387
 my+0.382
 spend+0.377
Neg logit contributions
books-0.223
jar-0.221
Don-0.191
post-0.188
 meant-0.186
Token to
Token ID284
Feature activation+2.88e-01

Pos logit contributions
 our+0.351
 my+0.314
 your+0.313
 exercise+0.267
 arrest+0.263
Neg logit contributions
books-0.439
 meant-0.431
 whispered-0.417
jar-0.403
 spoke-0.391
Token find
Token ID1064
Feature activation+2.38e-01

Pos logit contributions
 our+0.299
 my+0.272
 your+0.260
igs+0.243
ric+0.239
Neg logit contributions
 meant-0.503
 whispered-0.476
books-0.466
 said-0.462
 spoke-0.449
Token a
Token ID257
Feature activation+2.51e-01

Pos logit contributions
 our+0.251
 my+0.225
 your+0.220
 arrest+0.193
 exercise+0.191
Neg logit contributions
 meant-0.417
books-0.403
 whispered-0.397
 spoke-0.376
 said-0.376
Token
Token ID129
Feature activation+4.75e-01

Pos logit contributions
books+0.199
 widened+0.194
 whispered+0.191
 meant+0.190
jar+0.181
Neg logit contributions
 our-0.182
 your-0.169
 my-0.168
 job-0.141
 exercise-0.141
Token
Token ID241
Feature activation+4.01e-01

Pos logit contributions
 our+0.664
 my+0.590
 your+0.586
 dive+0.530
 exercise+0.529
Neg logit contributions
books-0.671
 meant-0.667
 whispered-0.643
 said-0.601
Don-0.598
TokenLet
Token ID5756
Feature activation+3.17e-01

Pos logit contributions
 our+0.744
 your+0.692
 my+0.692
 exercise+0.627
ap+0.619
Neg logit contributions
books-0.390
 meant-0.386
 whispered-0.367
jar-0.333
 spoke-0.331
Tokenâ
Token ID22940
Feature activation+3.23e-01

Pos logit contributions
 our+0.509
 my+0.461
 your+0.454
ric+0.451
 arrest+0.451
Neg logit contributions
 meant-0.389
 said-0.355
 whispered-0.354
 spoke-0.326
books-0.323
Token
Token ID26391
Feature activation+3.23e-01

Pos logit contributions
 our+0.440
 my+0.395
 your+0.390
 arrest+0.330
 victory+0.321
Neg logit contributions
 meant-0.491
books-0.486
 whispered-0.464
 spoke-0.438
 said-0.433
Token
Token ID8151
Feature activation+3.58e-01

Pos logit contributions
 our+0.541
 my+0.502
 your+0.497
 exercise+0.450
 victory+0.441
Neg logit contributions
books-0.375
 meant-0.364
 whispered-0.349
 said-0.329
 spoke-0.325
Tokens
Token ID82
Feature activation+3.11e-01

Pos logit contributions
 our+0.585
 your+0.555
 my+0.537
ap+0.513
ric+0.511
Neg logit contributions
 whispered-0.374
jar-0.360
 meant-0.360
 faded-0.357
books-0.350
Token go
Token ID467
Feature activation+2.79e-01

Pos logit contributions
 our+0.373
 arrest+0.330
latable+0.328
 my+0.328
 your+0.312
Neg logit contributions
 meant-0.492
 said-0.470
 whispered-0.460
 spoke-0.424
books-0.413
Token find
Token ID1064
Feature activation+2.71e-01

Pos logit contributions
 our+0.400
 arrest+0.362
 my+0.358
 your+0.348
ric+0.341
Neg logit contributions
 whispered-0.353
 meant-0.352
 said-0.344
books-0.331
l-0.331
Token something
Token ID1223
Feature activation+2.65e-01

Pos logit contributions
 our+0.263
ap+0.249
ric+0.248
 arrest+0.244
 dive+0.240
Neg logit contributions
 meant-0.459
 whispered-0.443
 said-0.436
books-0.415
l-0.406
Token else
Token ID2073
Feature activation+2.52e-01

Pos logit contributions
 our+0.391
 your+0.357
 my+0.357
 exercise+0.313
 dive+0.301
Neg logit contributions
books-0.368
 meant-0.357
 whispered-0.348
jar-0.335
Don-0.327
Token't
Token ID470
Feature activation+4.16e-01

Pos logit contributions
 meant+0.263
books+0.261
 whispered+0.251
 spoke+0.236
 said+0.235
Neg logit contributions
 our-0.242
 my-0.217
 your-0.215
 arrest-0.195
 exercise-0.192
Token know
Token ID760
Feature activation+2.92e-01

Pos logit contributions
 our+0.562
 my+0.506
 your+0.495
latable+0.465
 arrest+0.453
Neg logit contributions
 meant-0.622
 whispered-0.591
books-0.568
 said-0.562
 spoke-0.551
Token the
Token ID262
Feature activation+3.14e-01

Pos logit contributions
 our+0.276
 my+0.237
 your+0.230
ap+0.217
 arrest+0.215
Neg logit contributions
 meant-0.530
books-0.513
 whispered-0.506
 said-0.483
 spoke-0.478
Token rh
Token ID9529
Feature activation+3.12e-01

Pos logit contributions
 our+0.467
 my+0.428
 your+0.426
 exercise+0.379
 victory+0.368
Neg logit contributions
books-0.424
 meant-0.414
 whispered-0.398
jar-0.372
 spoke-0.372
Tokenin
Token ID259
Feature activation+2.58e-01

Pos logit contributions
 our+0.510
 my+0.472
 your+0.468
 exercise+0.431
 dive+0.414
Neg logit contributions
 meant-0.367
books-0.363
 whispered-0.343
l-0.324
 spoke-0.319
Tokenoc
Token ID420
Feature activation+3.57e-01

Pos logit contributions
 our+0.379
 my+0.353
 your+0.344
 victory+0.313
 exercise+0.311
Neg logit contributions
books-0.347
jar-0.321
 whispered-0.315
 meant-0.310
 widened-0.309
Tokeneros
Token ID27498
Feature activation+2.23e-01

Pos logit contributions
 our+0.416
 your+0.367
 my+0.359
 exercise+0.342
 dive+0.327
Neg logit contributions
 meant-0.567
books-0.535
 whispered-0.524
l-0.522
 spoke-0.510
Token was
Token ID373
Feature activation+2.66e-01

Pos logit contributions
 our+0.397
 your+0.369
 my+0.368
 exercise+0.329
 arrest+0.325
Neg logit contributions
books-0.252
jar-0.232
post-0.206
Don-0.203
oled-0.201
Token wild
Token ID4295
Feature activation+2.38e-01

Pos logit contributions
 our+0.378
 my+0.347
 your+0.345
 exercise+0.295
 arrest+0.288
Neg logit contributions
books-0.386
 meant-0.371
 whispered-0.361
jar-0.344
Don-0.337
Token.
Token ID13
Feature activation+2.22e-01

Pos logit contributions
 our+0.405
 my+0.374
 your+0.373
 exercise+0.334
 arrest+0.332
Neg logit contributions
 meant-0.269
books-0.266
 whispered-0.252
 spoke-0.231
jar-0.230
Token Can
Token ID1680
Feature activation+2.24e-01

Pos logit contributions
 our+0.329
 my+0.316
 your+0.310
ap+0.249
 arrest+0.246
Neg logit contributions
books-0.312
 meant-0.303
 whispered-0.285
 faded-0.274
l-0.272
Token
Token ID198
Feature activation+3.35e-01

Pos logit contributions
books+0.273
jar+0.260
ooky+0.237
post+0.227
book+0.226
Neg logit contributions
 our-0.306
 my-0.295
 your-0.290
 how-0.288
 dive-0.273
Token
Token ID198
Feature activation+2.98e-01

Pos logit contributions
 our+0.586
 your+0.545
 my+0.544
ap+0.478
 exercise+0.476
Neg logit contributions
 meant-0.380
books-0.371
 whispered-0.347
 spoke-0.320
jar-0.315
TokenSuddenly
Token ID38582
Feature activation+2.58e-01

Pos logit contributions
 our+0.506
 my+0.468
 your+0.468
 exercise+0.414
 arrest+0.412
Neg logit contributions
 meant-0.349
books-0.342
 whispered-0.323
 spoke-0.298
jar-0.294
Token,
Token ID11
Feature activation+3.03e-01

Pos logit contributions
 our+0.351
 my+0.316
 your+0.312
 exercise+0.283
 arrest+0.278
Neg logit contributions
 meant-0.371
books-0.362
 whispered-0.361
 spoke-0.339
 said-0.338
Token Bug
Token ID15217
Feature activation+2.44e-01

Pos logit contributions
 our+0.414
 my+0.373
 your+0.368
ap+0.357
ric+0.349
Neg logit contributions
 meant-0.419
 whispered-0.411
books-0.397
 said-0.391
 spoke-0.384
Tokengy
Token ID1360
Feature activation+3.56e-01

Pos logit contributions
 our+0.344
 my+0.311
 your+0.305
 dive+0.275
ap+0.268
Neg logit contributions
books-0.370
jar-0.319
book-0.308
 meant-0.301
post-0.296
Token heard
Token ID2982
Feature activation+2.48e-01

Pos logit contributions
 our+0.651
 your+0.607
 my+0.603
 arrest+0.570
 dive+0.553
Neg logit contributions
books-0.371
jar-0.323
 meant-0.304
Don-0.286
book-0.284
Token a
Token ID257
Feature activation+2.61e-01

Pos logit contributions
 our+0.260
 my+0.224
 your+0.224
ap+0.190
 exercise+0.186
Neg logit contributions
 meant-0.441
books-0.437
 whispered-0.425
 spoke-0.403
jar-0.400
Token loud
Token ID7812
Feature activation+2.20e-01

Pos logit contributions
 our+0.390
 my+0.361
 your+0.353
 dive+0.312
 exercise+0.312
Neg logit contributions
books-0.366
 meant-0.357
 whispered-0.331
book-0.316
 spoke-0.306
Token noise
Token ID7838
Feature activation+1.89e-01

Pos logit contributions
 our+0.356
 my+0.326
 your+0.326
 exercise+0.291
 arrest+0.289
Neg logit contributions
 meant-0.266
books-0.265
 whispered-0.253
 spoke-0.233
jar-0.233
Token.
Token ID13
Feature activation+2.02e-01

Pos logit contributions
 our+0.270
 your+0.247
 my+0.247
 dive+0.234
 exercise+0.230
Neg logit contributions
books-0.259
 meant-0.233
 whispered-0.231
jar-0.228
Don-0.220
Token Everyday
Token ID48154
Feature activation+3.53e-01

Pos logit contributions
books+0.185
jar+0.181
post+0.174
book+0.164
 meant+0.161
Neg logit contributions
 our-0.204
 your-0.189
 my-0.187
 spend-0.174
 practice-0.169
Token he
Token ID339
Feature activation+3.15e-01

Pos logit contributions
 our+0.548
 my+0.502
 your+0.501
 exercise+0.444
 arrest+0.439
Neg logit contributions
books-0.450
 meant-0.448
 whispered-0.426
jar-0.397
 spoke-0.395
Token would
Token ID561
Feature activation+3.20e-01

Pos logit contributions
 our+0.576
 your+0.553
 my+0.549
 dive+0.512
 spend+0.509
Neg logit contributions
books-0.321
jar-0.290
oled-0.263
 meant-0.263
post-0.262
Token take
Token ID1011
Feature activation+2.83e-01

Pos logit contributions
 our+0.359
 my+0.312
 your+0.307
 arrest+0.260
ap+0.250
Neg logit contributions
 meant-0.546
 whispered-0.532
books-0.529
 spoke-0.502
 said-0.502
Token the
Token ID262
Feature activation+2.98e-01

Pos logit contributions
 our+0.329
ric+0.300
 my+0.291
ap+0.283
 your+0.279
Neg logit contributions
 meant-0.426
books-0.421
 whispered-0.413
 said-0.393
 spoke-0.387
Token av
Token ID1196
Feature activation+3.55e-01

Pos logit contributions
 our+0.406
 my+0.371
 your+0.368
 exercise+0.328
 victory+0.318
Neg logit contributions
books-0.443
 meant-0.435
 whispered-0.416
jar-0.393
 spoke-0.391
Tokenoc
Token ID420
Feature activation+3.50e-01

Pos logit contributions
 our+0.760
 your+0.698
 my+0.696
 exercise+0.659
 dive+0.657
Neg logit contributions
books-0.255
 meant-0.232
 whispered-0.209
l-0.200
book-0.190
Tokenados
Token ID22484
Feature activation+2.37e-01

Pos logit contributions
 our+0.469
 your+0.409
 my+0.403
 dive+0.401
 exercise+0.397
Neg logit contributions
 meant-0.493
books-0.478
l-0.460
 whispered-0.452
 spoke-0.437
Token and
Token ID290
Feature activation+2.64e-01

Pos logit contributions
 our+0.264
 your+0.235
 my+0.233
 exercise+0.192
ap+0.186
Neg logit contributions
books-0.407
 meant-0.403
 whispered-0.390
jar-0.380
 spoke-0.372
Token scatter
Token ID41058
Feature activation+2.20e-01

Pos logit contributions
 our+0.360
 my+0.322
 your+0.320
ap+0.310
latable+0.308
Neg logit contributions
 meant-0.383
 whispered-0.363
 said-0.354
books-0.352
 spoke-0.346
Token them
Token ID606
Feature activation+1.91e-01

Pos logit contributions
 our+0.239
 my+0.211
 your+0.210
 exercise+0.172
 arrest+0.168
Neg logit contributions
books-0.385
 meant-0.382
 whispered-0.371
jar-0.355
 spoke-0.351
Token
Token ID129
Feature activation+4.22e-01

Pos logit contributions
 our+0.775
 my+0.703
 your+0.701
 exercise+0.644
 arrest+0.627
Neg logit contributions
books-0.527
 meant-0.520
 whispered-0.506
 said-0.473
 spoke-0.460
Token
Token ID241
Feature activation+3.78e-01

Pos logit contributions
 our+0.569
 my+0.519
 your+0.516
 exercise+0.445
 arrest+0.441
Neg logit contributions
 meant-0.623
books-0.622
 whispered-0.596
jar-0.563
 spoke-0.560
TokenWho
Token ID8241
Feature activation+3.04e-01

Pos logit contributions
 our+0.660
 your+0.620
 my+0.616
 exercise+0.561
ap+0.550
Neg logit contributions
books-0.396
 meant-0.392
 whispered-0.383
 spoke-0.350
jar-0.347
Tokenâ
Token ID22940
Feature activation+3.14e-01

Pos logit contributions
 our+0.505
 my+0.465
 your+0.461
 arrest+0.416
 exercise+0.415
Neg logit contributions
 meant-0.358
books-0.350
 whispered-0.335
 spoke-0.308
 said-0.304
Token
Token ID26391
Feature activation+3.20e-01

Pos logit contributions
 our+0.398
 my+0.358
 your+0.357
 exercise+0.308
ap+0.302
Neg logit contributions
books-0.492
 meant-0.489
 whispered-0.471
jar-0.446
 spoke-0.442
Token
Token ID8151
Feature activation+3.55e-01

Pos logit contributions
 our+0.484
 my+0.444
 your+0.443
 exercise+0.391
 arrest+0.388
Neg logit contributions
books-0.418
 meant-0.417
 whispered-0.397
jar-0.371
 spoke-0.368
Tokens
Token ID82
Feature activation+3.08e-01

Pos logit contributions
 our+0.605
 your+0.587
 my+0.573
ric+0.551
ap+0.538
Neg logit contributions
 whispered-0.339
jar-0.331
books-0.322
 faded-0.317
 meant-0.304
Token there
Token ID612
Feature activation+2.60e-01

Pos logit contributions
 our+0.365
 my+0.315
 your+0.311
 arrest+0.291
 victory+0.270
Neg logit contributions
 meant-0.517
books-0.481
 whispered-0.478
 said-0.464
 spoke-0.455
Token?
Token ID30
Feature activation+2.49e-01

Pos logit contributions
 our+0.285
 your+0.260
 my+0.259
 exercise+0.208
ap+0.200
Neg logit contributions
books-0.462
 meant-0.443
 whispered-0.442
jar-0.427
Don-0.423
Tokenâ
Token ID22940
Feature activation+2.36e-01

Pos logit contributions
 our+0.349
 my+0.329
 your+0.326
ap+0.274
 exercise+0.271
Neg logit contributions
books-0.354
 meant-0.347
 whispered-0.335
 spoke-0.313
jar-0.312
Token
Token ID26391
Feature activation+2.47e-01

Pos logit contributions
 our+0.250
 my+0.239
 your+0.235
 exercise+0.209
ap+0.205
Neg logit contributions
books-0.385
 meant-0.380
 whispered-0.371
jar-0.367
 faded-0.363
Token
Token ID129
Feature activation+4.22e-01

Pos logit contributions
 our+0.775
 my+0.703
 your+0.701
 exercise+0.644
 arrest+0.627
Neg logit contributions
books-0.527
 meant-0.520
 whispered-0.506
 said-0.473
 spoke-0.460
Token
Token ID241
Feature activation+3.78e-01

Pos logit contributions
 our+0.569
 my+0.519
 your+0.516
 exercise+0.445
 arrest+0.441
Neg logit contributions
 meant-0.623
books-0.622
 whispered-0.596
jar-0.563
 spoke-0.560
TokenThat
Token ID2504
Feature activation+2.88e-01

Pos logit contributions
 our+0.660
 your+0.620
 my+0.616
 exercise+0.561
ap+0.550
Neg logit contributions
books-0.396
 meant-0.392
 whispered-0.383
 spoke-0.350
jar-0.347
Tokenâ
Token ID22940
Feature activation+3.13e-01

Pos logit contributions
 our+0.473
 my+0.431
 your+0.430
 arrest+0.397
ap+0.385
Neg logit contributions
 meant-0.348
books-0.334
 whispered-0.319
 spoke-0.295
 said-0.293
Token
Token ID26391
Feature activation+3.18e-01

Pos logit contributions
 our+0.404
 my+0.369
 your+0.367
 exercise+0.321
ap+0.316
Neg logit contributions
books-0.475
 meant-0.470
 whispered-0.458
jar-0.435
 spoke-0.430
Token
Token ID8151
Feature activation+3.53e-01

Pos logit contributions
 our+0.473
 my+0.431
 your+0.429
 exercise+0.379
 arrest+0.374
Neg logit contributions
books-0.427
 meant-0.424
 whispered-0.405
jar-0.379
 spoke-0.377
Tokens
Token ID82
Feature activation+3.03e-01

Pos logit contributions
 our+0.571
 your+0.551
 my+0.535
ric+0.513
ap+0.509
Neg logit contributions
 whispered-0.386
jar-0.369
books-0.346
 faded-0.345
 meant-0.337
Token right
Token ID826
Feature activation+2.69e-01

Pos logit contributions
 our+0.428
 my+0.379
 your+0.379
 arrest+0.372
 dive+0.344
Neg logit contributions
 meant-0.429
books-0.407
 whispered-0.384
 said-0.368
jar-0.356
Token,
Token ID11
Feature activation+2.88e-01

Pos logit contributions
 our+0.384
 my+0.352
 your+0.352
 exercise+0.303
ap+0.297
Neg logit contributions
books-0.383
 meant-0.374
 whispered-0.365
jar-0.345
 spoke-0.344
Token Emma
Token ID18966
Feature activation+2.33e-01

Pos logit contributions
 our+0.346
 my+0.304
 your+0.303
 arrest+0.268
ap+0.265
Neg logit contributions
 meant-0.462
books-0.452
 whispered-0.441
 said-0.415
jar-0.414
Token.
Token ID13
Feature activation+2.26e-01

Pos logit contributions
 our+0.345
 your+0.328
 my+0.325
 exercise+0.277
ap+0.274
Neg logit contributions
books-0.325
jar-0.302
 meant-0.286
Don-0.282
 whispered-0.280
Token
Token ID129
Feature activation+4.22e-01

Pos logit contributions
 our+0.775
 my+0.703
 your+0.701
 exercise+0.644
 arrest+0.627
Neg logit contributions
books-0.527
 meant-0.520
 whispered-0.506
 said-0.473
 spoke-0.460
Token
Token ID241
Feature activation+3.78e-01

Pos logit contributions
 our+0.569
 my+0.519
 your+0.516
 exercise+0.445
 arrest+0.441
Neg logit contributions
 meant-0.623
books-0.622
 whispered-0.596
jar-0.563
 spoke-0.560
TokenThat
Token ID2504
Feature activation+2.88e-01

Pos logit contributions
 our+0.660
 your+0.620
 my+0.616
 exercise+0.561
ap+0.550
Neg logit contributions
books-0.396
 meant-0.392
 whispered-0.383
 spoke-0.350
jar-0.347
Tokenâ
Token ID22940
Feature activation+3.13e-01

Pos logit contributions
 our+0.473
 my+0.431
 your+0.430
 arrest+0.397
ap+0.385
Neg logit contributions
 meant-0.348
books-0.334
 whispered-0.319
 spoke-0.295
 said-0.293
Token
Token ID26391
Feature activation+3.18e-01

Pos logit contributions
 our+0.404
 my+0.369
 your+0.367
 exercise+0.321
ap+0.316
Neg logit contributions
books-0.475
 meant-0.470
 whispered-0.458
jar-0.435
 spoke-0.430
Token
Token ID8151
Feature activation+3.53e-01

Pos logit contributions
 our+0.473
 my+0.431
 your+0.429
 exercise+0.379
 arrest+0.374
Neg logit contributions
books-0.427
 meant-0.424
 whispered-0.405
jar-0.379
 spoke-0.377
Tokens
Token ID82
Feature activation+3.03e-01

Pos logit contributions
 our+0.571
 your+0.551
 my+0.535
ric+0.513
ap+0.509
Neg logit contributions
 whispered-0.386
jar-0.369
books-0.346
 faded-0.345
 meant-0.337
Token the
Token ID262
Feature activation+2.99e-01

Pos logit contributions
 our+0.428
 my+0.379
 your+0.379
 arrest+0.372
 dive+0.344
Neg logit contributions
 meant-0.429
books-0.407
 whispered-0.384
 said-0.368
jar-0.356
Token gas
Token ID3623
Feature activation+2.23e-01

Pos logit contributions
 our+0.435
 my+0.399
 your+0.397
 exercise+0.352
 arrest+0.341
Neg logit contributions
books-0.412
 meant-0.407
 whispered-0.390
jar-0.366
 spoke-0.364
Token from
Token ID422
Feature activation+3.03e-01

Pos logit contributions
 our+0.338
 my+0.307
 your+0.307
 exercise+0.271
ap+0.268
Neg logit contributions
books-0.298
 meant-0.291
 whispered-0.281
jar-0.266
Don-0.262
Token the
Token ID262
Feature activation+2.63e-01

Pos logit contributions
 our+0.296
 my+0.250
 your+0.248
ap+0.222
 arrest+0.216
Neg logit contributions
 meant-0.543
books-0.527
 whispered-0.524
 said-0.496
 spoke-0.491
Token help
Token ID1037
Feature activation-1.36e-01

Pos logit contributions
 whispered+0.195
 meant+0.195
 said+0.193
 spoke+0.187
books+0.186
Neg logit contributions
ric-0.090
ap-0.087
 our-0.084
 my-0.074
latable-0.073
Token and
Token ID290
Feature activation-1.02e-01

Pos logit contributions
books+0.229
jar+0.218
 whispered+0.216
 faded+0.216
 spoke+0.214
Neg logit contributions
 our-0.169
 your-0.155
 my-0.145
 help-0.129
 how-0.121
Token waited
Token ID13488
Feature activation-1.68e-01

Pos logit contributions
 meant+0.105
books+0.101
 whispered+0.100
 spoke+0.091
 said+0.090
Neg logit contributions
 our-0.184
 my-0.171
 your-0.170
 exercise-0.156
ap-0.156
Token for
Token ID329
Feature activation-1.12e-01

Pos logit contributions
 meant+0.261
books+0.256
 whispered+0.241
 spoke+0.238
Don+0.234
Neg logit contributions
 our-0.233
 your-0.214
 my-0.207
ap-0.179
 exercise-0.176
Token the
Token ID262
Feature activation-1.30e-01

Pos logit contributions
 meant+0.165
books+0.164
 whispered+0.163
 said+0.156
jar+0.153
Neg logit contributions
 our-0.127
ap-0.121
 my-0.115
 your-0.114
ric-0.112
Token ambulance
Token ID22536
Feature activation-2.36e-01

Pos logit contributions
books+0.180
 meant+0.177
 whispered+0.169
 spoke+0.158
jar+0.155
Neg logit contributions
 our-0.193
 my-0.176
 your-0.175
 exercise-0.157
 arrest-0.155
Token.
Token ID13
Feature activation-1.42e-01

Pos logit contributions
books+0.395
 widened+0.370
jar+0.368
 meant+0.367
 whispered+0.363
Neg logit contributions
 our-0.274
 your-0.236
 my-0.223
 help-0.214
 arrest-0.205
Token
Token ID198
Feature activation-1.60e-01

Pos logit contributions
books+0.219
 meant+0.213
 whispered+0.203
 spoke+0.195
book+0.191
Neg logit contributions
 our-0.203
 your-0.183
 my-0.179
 arrest-0.142
 backup-0.136
Token
Token ID198
Feature activation-1.57e-01

Pos logit contributions
 meant+0.225
books+0.218
 whispered+0.203
 faded+0.198
 spoke+0.196
Neg logit contributions
 our-0.251
 your-0.227
 my-0.218
 arrest-0.180
 explore-0.174
TokenL
Token ID43
Feature activation-1.77e-01

Pos logit contributions
 meant+0.229
books+0.226
 whispered+0.211
 spoke+0.196
jar+0.193
Neg logit contributions
 our-0.235
 your-0.213
 my-0.211
 arrest-0.182
 exercise-0.176
Tokenily
Token ID813
Feature activation-1.82e-01

Pos logit contributions
 meant+0.109
books+0.106
 whispered+0.089
jar+0.081
book+0.079
Neg logit contributions
 our-0.399
 my-0.371
 your-0.370
 exercise-0.353
 dive-0.352
Token them
Token ID606
Feature activation-1.82e-01

Pos logit contributions
 meant+0.211
 whispered+0.205
 said+0.199
books+0.196
l+0.189
Neg logit contributions
 our-0.188
 arrest-0.171
 dive-0.170
ap-0.169
 my-0.168
Token the
Token ID262
Feature activation-1.27e-01

Pos logit contributions
 meant+0.276
 whispered+0.276
books+0.270
 said+0.264
 spoke+0.254
Neg logit contributions
 our-0.206
 my-0.184
 your-0.183
 arrest-0.181
ap-0.175
Token umbrella
Token ID25510
Feature activation-2.23e-01

Pos logit contributions
books+0.233
 meant+0.223
 whispered+0.216
 spoke+0.208
jar+0.207
Neg logit contributions
 our-0.136
 your-0.118
 my-0.117
 exercise-0.099
 victory-0.093
Token and
Token ID290
Feature activation-1.08e-01

Pos logit contributions
books+0.295
 meant+0.286
 whispered+0.272
jar+0.260
 spoke+0.256
Neg logit contributions
 our-0.344
 your-0.315
 my-0.315
 exercise-0.278
ap-0.273
Token the
Token ID262
Feature activation-1.31e-01

Pos logit contributions
 meant+0.131
 whispered+0.128
 said+0.120
books+0.116
 spoke+0.114
Neg logit contributions
 our-0.175
 my-0.161
 your-0.160
ap-0.154
ric-0.150
Token coat
Token ID13209
Feature activation-2.33e-01

Pos logit contributions
books+0.225
 meant+0.216
 whispered+0.207
jar+0.198
 spoke+0.197
Neg logit contributions
 our-0.157
 your-0.140
 my-0.139
 exercise-0.120
 arrest-0.115
Token.
Token ID13
Feature activation-1.47e-01

Pos logit contributions
books+0.377
Wow+0.352
Today+0.345
jar+0.344
Uh+0.341
Neg logit contributions
 our-0.280
 your-0.256
 help-0.246
 how-0.240
 my-0.239
Token She
Token ID1375
Feature activation-1.61e-01

Pos logit contributions
 meant+0.186
books+0.185
 spoke+0.172
 whispered+0.172
ooky+0.168
Neg logit contributions
 our-0.239
 your-0.234
 my-0.228
 exercise-0.177
ap-0.177
Token said
Token ID531
Feature activation-1.86e-01

Pos logit contributions
books+0.159
jar+0.129
oled+0.121
book+0.121
post+0.117
Neg logit contributions
 our-0.312
 my-0.292
 your-0.290
 how-0.269
 exercise-0.263
Token,
Token ID11
Feature activation-1.25e-01

Pos logit contributions
 meant+0.265
books+0.260
 whispered+0.255
 spoke+0.239
 said+0.239
Neg logit contributions
 our-0.252
 my-0.228
 your-0.224
 arrest-0.208
 exercise-0.203
Token "
Token ID366
Feature activation-1.84e-01

Pos logit contributions
ooky+0.236
 meant+0.214
books+0.213
able+0.200
jar+0.197
Neg logit contributions
 our-0.138
 your-0.134
 my-0.128
 how-0.111
 help-0.098
Token his
Token ID465
Feature activation-1.41e-01

Pos logit contributions
 meant+0.146
 whispered+0.138
 said+0.134
books+0.129
 wished+0.124
Neg logit contributions
 our-0.146
ap-0.140
ric-0.133
 my-0.128
 your-0.126
Token magical
Token ID10883
Feature activation-1.76e-01

Pos logit contributions
 meant+0.210
books+0.208
 whispered+0.204
 spoke+0.190
jar+0.188
Neg logit contributions
 our-0.189
 my-0.171
 your-0.170
 arrest-0.149
 exercise-0.146
Token experience
Token ID1998
Feature activation-1.78e-01

Pos logit contributions
 meant+0.254
 whispered+0.248
books+0.248
 spoke+0.231
 faded+0.228
Neg logit contributions
 our-0.244
 my-0.220
 your-0.217
 arrest-0.194
ap-0.188
Token with
Token ID351
Feature activation-1.12e-01

Pos logit contributions
books+0.249
 meant+0.245
 whispered+0.236
jar+0.231
Don+0.229
Neg logit contributions
 our-0.252
 your-0.230
 my-0.229
 victory-0.203
 exercise-0.201
Token the
Token ID262
Feature activation-1.35e-01

Pos logit contributions
 meant+0.192
books+0.188
 whispered+0.184
 said+0.174
 spoke+0.174
Neg logit contributions
 our-0.123
 my-0.107
 your-0.106
ap-0.095
 exercise-0.094
Token fog
Token ID19558
Feature activation-2.33e-01

Pos logit contributions
books+0.193
 meant+0.187
 whispered+0.178
jar+0.169
Don+0.167
Neg logit contributions
 our-0.193
 my-0.176
 your-0.176
 exercise-0.157
 victory-0.153
Token.
Token ID13
Feature activation-1.44e-01

Pos logit contributions
books+0.303
 meant+0.296
 whispered+0.284
jar+0.270
Don+0.267
Neg logit contributions
 our-0.355
 my-0.324
 your-0.323
 exercise-0.286
ap-0.284
Token<|endoftext|>
Token ID50256
Feature activation-1.62e-01

Pos logit contributions
books+0.143
 meant+0.142
 whispered+0.134
jar+0.122
 spoke+0.121
Neg logit contributions
 our-0.267
 my-0.249
 your-0.248
 exercise-0.223
 arrest-0.222
TokenOnce
Token ID7454
Feature activation-1.36e-01

Pos logit contributions
 whispered+0.226
 meant+0.220
books+0.216
 faded+0.211
 wil+0.211
Neg logit contributions
 our-0.210
 your-0.197
 my-0.195
 such-0.164
ap-0.163
Token there
Token ID612
Feature activation-1.59e-01

Pos logit contributions
books+0.153
 meant+0.152
 whispered+0.144
jar+0.132
 spoke+0.132
Neg logit contributions
 our-0.231
 my-0.213
 your-0.212
 exercise-0.192
 arrest-0.192
Token was
Token ID373
Feature activation-1.29e-01

Pos logit contributions
books+0.182
jar+0.172
 whispered+0.168
 meant+0.162
book+0.155
Neg logit contributions
 our-0.272
 your-0.258
 my-0.254
 dive-0.219
 exercise-0.217
Token
Token ID198
Feature activation-1.53e-01

Pos logit contributions
 meant+0.206
 wil+0.188
books+0.187
ooky+0.185
jar+0.184
Neg logit contributions
 our-0.173
 your-0.168
 my-0.167
 join-0.139
ap-0.137
Token
Token ID198
Feature activation-1.51e-01

Pos logit contributions
 meant+0.170
books+0.163
 whispered+0.155
jar+0.145
 spoke+0.144
Neg logit contributions
 our-0.274
 your-0.256
 my-0.255
ap-0.222
 exercise-0.218
Token"
Token ID1
Feature activation-1.52e-01

Pos logit contributions
books+0.180
 meant+0.174
 whispered+0.168
jar+0.156
Don+0.154
Neg logit contributions
 our-0.249
 my-0.228
 your-0.227
 arrest-0.207
 exercise-0.207
TokenYes
Token ID5297
Feature activation-1.75e-01

Pos logit contributions
books+0.193
 meant+0.192
 whispered+0.182
jar+0.171
book+0.165
Neg logit contributions
 our-0.240
 my-0.221
 your-0.219
 exercise-0.198
 dive-0.193
Token,
Token ID11
Feature activation-1.21e-01

Pos logit contributions
books+0.220
 meant+0.219
 whispered+0.211
 said+0.209
l+0.202
Neg logit contributions
 our-0.245
 your-0.223
ric-0.219
 my-0.218
 exercise-0.217
Token sir
Token ID15967
Feature activation-2.32e-01

Pos logit contributions
 meant+0.185
books+0.177
 whispered+0.177
 said+0.170
 spoke+0.165
Neg logit contributions
 our-0.151
 your-0.135
 my-0.133
ap-0.128
ric-0.125
Token!"
Token ID2474
Feature activation-1.53e-01

Pos logit contributions
 meant+0.297
books+0.286
 whispered+0.277
 said+0.264
 spoke+0.255
Neg logit contributions
 our-0.356
 my-0.325
 your-0.325
 exercise-0.299
 arrest-0.298
Token Lily
Token ID20037
Feature activation-1.88e-01

Pos logit contributions
books+0.216
 meant+0.211
 whispered+0.197
jar+0.195
Don+0.187
Neg logit contributions
 our-0.225
 my-0.206
 your-0.205
 exercise-0.175
 dive-0.172
Token said
Token ID531
Feature activation-1.81e-01

Pos logit contributions
jar+0.152
books+0.144
lessness+0.119
Don+0.115
post+0.111
Neg logit contributions
 our-0.392
 my-0.372
 your-0.364
 practice-0.336
ap-0.334
Token.
Token ID13
Feature activation-1.50e-01

Pos logit contributions
books+0.207
 meant+0.205
 whispered+0.187
jar+0.184
Don+0.179
Neg logit contributions
 our-0.309
 your-0.286
 my-0.283
ap-0.262
 exercise-0.253
Token
Token ID198
Feature activation-1.69e-01

Pos logit contributions
 meant+0.224
books+0.212
jar+0.201
ooky+0.199
l+0.197
Neg logit contributions
 our-0.213
 your-0.205
 my-0.199
ap-0.169
 arrest-0.160
Token.
Token ID13
Feature activation-1.34e-01

Pos logit contributions
 meant+0.249
books+0.247
 whispered+0.238
 spoke+0.220
jar+0.218
Neg logit contributions
 our-0.270
 my-0.246
 your-0.243
 arrest-0.216
 exercise-0.214
Token He
Token ID679
Feature activation-1.49e-01

Pos logit contributions
books+0.146
 meant+0.142
 whispered+0.138
jar+0.129
 spoke+0.129
Neg logit contributions
 our-0.235
 my-0.225
 your-0.221
 exercise-0.197
ap-0.192
Token thinks
Token ID6834
Feature activation-1.75e-01

Pos logit contributions
 meant+0.158
 whispered+0.152
books+0.147
 said+0.137
 spoke+0.137
Neg logit contributions
 our-0.265
 my-0.244
 your-0.244
 arrest-0.222
 exercise-0.220
Token maybe
Token ID3863
Feature activation-1.62e-01

Pos logit contributions
books+0.271
 meant+0.269
 whispered+0.260
jar+0.247
 spoke+0.245
Neg logit contributions
 our-0.228
 your-0.208
 my-0.207
 exercise-0.178
ap-0.171
Token the
Token ID262
Feature activation-1.29e-01

Pos logit contributions
 meant+0.192
 whispered+0.184
books+0.181
 said+0.171
 spoke+0.169
Neg logit contributions
 our-0.265
 my-0.241
 your-0.236
 arrest-0.232
ric-0.229
Token hat
Token ID6877
Feature activation-2.32e-01

Pos logit contributions
books+0.220
 meant+0.205
oled+0.204
 widened+0.196
jar+0.192
Neg logit contributions
 our-0.153
 my-0.144
 your-0.141
 exercise-0.126
 dive-0.113
Token is
Token ID318
Feature activation-1.36e-01

Pos logit contributions
books+0.397
jar+0.370
oled+0.354
book+0.340
Don+0.337
Neg logit contributions
 our-0.303
 your-0.280
 my-0.277
 exercise-0.225
 dive-0.217
Token in
Token ID287
Feature activation-9.56e-02

Pos logit contributions
books+0.219
 meant+0.202
 whispered+0.199
jar+0.198
Don+0.197
Neg logit contributions
 our-0.183
 my-0.168
 your-0.168
 exercise-0.137
 victory-0.128
Token the
Token ID262
Feature activation-1.19e-01

Pos logit contributions
books+0.180
 meant+0.178
 whispered+0.173
 spoke+0.166
jar+0.165
Neg logit contributions
 our-0.095
 my-0.083
 your-0.083
 exercise-0.065
 arrest-0.064
Token barn
Token ID25203
Feature activation-1.97e-01

Pos logit contributions
books+0.179
 meant+0.178
 whispered+0.168
 spoke+0.159
jar+0.154
Neg logit contributions
 our-0.168
 your-0.153
 my-0.153
 exercise-0.133
 arrest-0.128
Token.
Token ID13
Feature activation-1.45e-01

Pos logit contributions
books+0.257
Don+0.256
 meant+0.239
 whispered+0.237
oled+0.237
Neg logit contributions
 our-0.302
 your-0.282
 my-0.277
ap-0.243
 exercise-0.243
Token.
Token ID13
Feature activation-1.43e-01

Pos logit contributions
books+0.160
 meant+0.159
 whispered+0.149
jar+0.136
 spoke+0.135
Neg logit contributions
 our-0.304
 my-0.283
 your-0.282
 exercise-0.256
 arrest-0.254
Token
Token ID198
Feature activation-1.58e-01

Pos logit contributions
 meant+0.179
books+0.179
 whispered+0.168
jar+0.160
 faded+0.157
Neg logit contributions
 our-0.230
 your-0.208
 my-0.208
ap-0.185
 dive-0.174
Token
Token ID198
Feature activation-1.55e-01

Pos logit contributions
 meant+0.209
books+0.195
 whispered+0.192
 faded+0.191
jar+0.182
Neg logit contributions
 our-0.248
 your-0.227
 my-0.222
ap-0.191
 dive-0.185
TokenAs
Token ID1722
Feature activation-1.48e-01

Pos logit contributions
 meant+0.236
books+0.226
 whispered+0.223
 spoke+0.205
 faded+0.203
Neg logit contributions
 our-0.212
 my-0.192
 your-0.190
 exercise-0.165
 arrest-0.162
Token the
Token ID262
Feature activation-1.25e-01

Pos logit contributions
books+0.230
 meant+0.227
 whispered+0.218
jar+0.206
 spoke+0.205
Neg logit contributions
 our-0.197
 my-0.176
 your-0.176
 exercise-0.151
 arrest-0.148
Token fog
Token ID19558
Feature activation-2.31e-01

Pos logit contributions
 meant+0.167
books+0.163
 whispered+0.159
 spoke+0.150
jar+0.148
Neg logit contributions
 our-0.185
 your-0.172
 my-0.170
ap-0.150
 arrest-0.149
Token filled
Token ID5901
Feature activation-1.32e-01

Pos logit contributions
books+0.237
 meant+0.222
 whispered+0.206
jar+0.202
book+0.190
Neg logit contributions
 our-0.421
 my-0.387
 your-0.385
 dive-0.358
 exercise-0.355
Token the
Token ID262
Feature activation-1.25e-01

Pos logit contributions
 meant+0.218
books+0.210
 whispered+0.207
 said+0.197
 spoke+0.192
Neg logit contributions
 our-0.143
 my-0.133
 your-0.129
 arrest-0.127
 spend-0.116
Token port
Token ID2493
Feature activation-1.93e-01

Pos logit contributions
 meant+0.151
books+0.142
 whispered+0.141
jar+0.131
 faded+0.130
Neg logit contributions
 our-0.195
 your-0.182
 my-0.181
 arrest-0.169
 exercise-0.166
Token,
Token ID11
Feature activation-1.18e-01

Pos logit contributions
books+0.331
 meant+0.316
jar+0.302
 whispered+0.301
Don+0.295
Neg logit contributions
 our-0.227
 your-0.204
 my-0.202
ap-0.174
 dive-0.173
Token everything
Token ID2279
Feature activation-1.85e-01

Pos logit contributions
 meant+0.199
books+0.198
 whispered+0.192
 spoke+0.182
jar+0.181
Neg logit contributions
 our-0.134
 my-0.120
 your-0.119
 exercise-0.100
 arrest-0.100
Token Bob
Token ID5811
Feature activation-2.00e-01

Pos logit contributions
books+0.310
 meant+0.303
 whispered+0.294
jar+0.286
 spoke+0.278
Neg logit contributions
 our-0.190
 my-0.169
 your-0.167
 exercise-0.136
 arrest-0.133
Token.
Token ID13
Feature activation-1.41e-01

Pos logit contributions
books+0.257
 meant+0.242
jar+0.237
 whispered+0.224
Don+0.220
Neg logit contributions
 our-0.316
 my-0.299
 your-0.298
ap-0.257
 dive-0.256
Token
Token ID198
Feature activation-1.58e-01

Pos logit contributions
books+0.240
jar+0.224
 meant+0.218
 faded+0.218
book+0.217
Neg logit contributions
 our-0.168
 my-0.164
 your-0.156
 dive-0.122
 exercise-0.119
Token
Token ID198
Feature activation-1.56e-01

Pos logit contributions
 meant+0.182
books+0.180
 whispered+0.170
 faded+0.166
jar+0.165
Neg logit contributions
 our-0.270
 my-0.252
 your-0.251
 exercise-0.216
 dive-0.215
TokenThe
Token ID464
Feature activation-1.46e-01

Pos logit contributions
books+0.204
 meant+0.204
 whispered+0.194
jar+0.182
 spoke+0.180
Neg logit contributions
 our-0.239
 my-0.220
 your-0.218
 exercise-0.193
 arrest-0.191
Token elf
Token ID23878
Feature activation-2.31e-01

Pos logit contributions
books+0.216
 meant+0.205
 whispered+0.193
post+0.187
jar+0.187
Neg logit contributions
 our-0.196
 my-0.193
 your-0.181
 exercise-0.162
 victory-0.161
Token said
Token ID531
Feature activation-1.81e-01

Pos logit contributions
books+0.278
jar+0.235
post+0.218
Today+0.217
lessness+0.216
Neg logit contributions
 our-0.384
 my-0.367
 your-0.354
 dive-0.351
 arrest-0.330
Token,
Token ID11
Feature activation-1.22e-01

Pos logit contributions
 meant+0.271
books+0.269
 whispered+0.260
 spoke+0.245
 said+0.242
Neg logit contributions
 our-0.239
 my-0.216
 your-0.214
 exercise-0.188
 arrest-0.188
Token "
Token ID366
Feature activation-1.75e-01

Pos logit contributions
books+0.188
ooky+0.181
jar+0.180
 meant+0.173
 faded+0.165
Neg logit contributions
 our-0.161
 my-0.154
 your-0.153
 how-0.132
 exercise-0.118
TokenG
Token ID38
Feature activation-2.00e-01

Pos logit contributions
books+0.216
 meant+0.198
 whispered+0.195
jar+0.184
book+0.182
Neg logit contributions
 our-0.281
 my-0.268
 your-0.260
 arrest-0.244
 myself-0.231
Tokenreetings
Token ID46648
Feature activation-1.83e-01

Pos logit contributions
books+0.244
 meant+0.240
 whispered+0.224
l+0.211
jar+0.210
Neg logit contributions
 our-0.329
 my-0.299
 your-0.298
 exercise-0.271
 arrest-0.266
TokenJohn
Token ID7554
Feature activation-1.82e-01

Pos logit contributions
 meant+0.195
books+0.192
 whispered+0.186
jar+0.172
 spoke+0.171
Neg logit contributions
 our-0.229
 my-0.210
 your-0.208
 arrest-0.181
 exercise-0.181
Token and
Token ID290
Feature activation-1.00e-01

Pos logit contributions
books+0.261
jar+0.243
book+0.221
post+0.215
lessness+0.206
Neg logit contributions
 our-0.273
 my-0.262
 your-0.249
 arrest-0.223
 how-0.222
Token Sarah
Token ID10490
Feature activation-1.76e-01

Pos logit contributions
 meant+0.157
books+0.152
 whispered+0.148
 said+0.141
 spoke+0.141
Neg logit contributions
 our-0.129
 your-0.114
 my-0.113
ap-0.106
 exercise-0.104
Token helped
Token ID4193
Feature activation-1.46e-01

Pos logit contributions
books+0.272
jar+0.242
book+0.238
post+0.233
ooky+0.227
Neg logit contributions
 spend-0.236
 our-0.235
 help-0.229
 how-0.219
 practice-0.214
Token the
Token ID262
Feature activation-1.23e-01

Pos logit contributions
 meant+0.265
books+0.262
 whispered+0.254
 said+0.241
 spoke+0.240
Neg logit contributions
 our-0.146
 my-0.130
 your-0.128
 exercise-0.108
 arrest-0.107
Token ambulance
Token ID22536
Feature activation-2.30e-01

Pos logit contributions
 meant+0.163
books+0.158
 whispered+0.153
 spoke+0.142
jar+0.141
Neg logit contributions
 our-0.186
 your-0.170
 my-0.169
ap-0.151
 exercise-0.150
Token driver
Token ID4639
Feature activation-2.13e-01

Pos logit contributions
books+0.345
 meant+0.334
 whispered+0.318
jar+0.311
Don+0.305
Neg logit contributions
 our-0.312
 your-0.281
 my-0.276
 exercise-0.250
ap-0.249
Token until
Token ID1566
Feature activation-1.43e-01

Pos logit contributions
books+0.305
 meant+0.300
 whispered+0.287
jar+0.273
 spoke+0.269
Neg logit contributions
 our-0.302
 my-0.274
 your-0.274
 exercise-0.239
ap-0.236
Token the
Token ID262
Feature activation-1.37e-01

Pos logit contributions
books+0.249
 meant+0.242
 whispered+0.234
jar+0.226
Don+0.221
Neg logit contributions
 our-0.169
 your-0.148
 my-0.146
 exercise-0.121
 victory-0.119
Token baby
Token ID5156
Feature activation-2.19e-01

Pos logit contributions
books+0.211
 meant+0.205
 whispered+0.199
 spoke+0.187
jar+0.186
Neg logit contributions
 our-0.183
 my-0.166
 your-0.164
 exercise-0.145
 arrest-0.142
Token was
Token ID373
Feature activation-1.37e-01

Pos logit contributions
books+0.285
book+0.235
jar+0.232
post+0.223
 meant+0.216
Neg logit contributions
 our-0.371
 your-0.332
 my-0.331
 arrest-0.326
 exercise-0.301
Token Tim
Token ID5045
Feature activation-1.80e-01

Pos logit contributions
 meant+0.153
books+0.148
 whispered+0.147
 said+0.140
 spoke+0.139
Neg logit contributions
 our-0.121
 my-0.109
 your-0.108
ap-0.096
 exercise-0.095
Tokenmy
Token ID1820
Feature activation-1.66e-01

Pos logit contributions
books+0.304
jar+0.289
post+0.272
 meant+0.269
l+0.264
Neg logit contributions
 our-0.216
 my-0.191
 your-0.189
 spend-0.186
ap-0.184
Token watched
Token ID7342
Feature activation-1.50e-01

Pos logit contributions
books+0.206
jar+0.194
 meant+0.179
post+0.179
book+0.175
Neg logit contributions
 our-0.262
 spend-0.235
 your-0.232
 my-0.231
ap-0.229
Token as
Token ID355
Feature activation-1.10e-01

Pos logit contributions
 meant+0.271
books+0.269
 whispered+0.261
 spoke+0.247
jar+0.247
Neg logit contributions
 our-0.150
 my-0.132
 your-0.130
 arrest-0.107
 exercise-0.107
Token the
Token ID262
Feature activation-1.27e-01

Pos logit contributions
 meant+0.176
books+0.165
 whispered+0.164
l+0.161
 said+0.159
Neg logit contributions
 our-0.125
 my-0.118
ap-0.114
 your-0.113
ric-0.108
Token ambulance
Token ID22536
Feature activation-2.30e-01

Pos logit contributions
books+0.174
 meant+0.160
 whispered+0.157
jar+0.148
book+0.147
Neg logit contributions
 our-0.191
 my-0.176
 your-0.175
 exercise-0.161
 arrest-0.161
Token drove
Token ID10357
Feature activation-1.62e-01

Pos logit contributions
books+0.332
Today+0.294
jar+0.290
book+0.286
Wow+0.285
Neg logit contributions
 our-0.286
 spend-0.266
 help-0.264
 arrest-0.260
ap-0.254
Token away
Token ID1497
Feature activation-1.44e-01

Pos logit contributions
books+0.319
 meant+0.308
 whispered+0.301
jar+0.297
 spoke+0.287
Neg logit contributions
 our-0.166
 your-0.141
 my-0.134
 victory-0.101
 those-0.092
Token,
Token ID11
Feature activation-1.13e-01

Pos logit contributions
books+0.230
 whispered+0.212
 meant+0.212
Don+0.210
jar+0.208
Neg logit contributions
 our-0.194
 your-0.166
 my-0.161
 victory-0.149
ap-0.140
Token hoping
Token ID7725
Feature activation-1.71e-01

Pos logit contributions
books+0.198
 meant+0.184
book+0.177
l+0.173
ooky+0.173
Neg logit contributions
 our-0.145
 your-0.128
 my-0.127
 arrest-0.105
 exercise-0.099
Token the
Token ID262
Feature activation-1.37e-01

Pos logit contributions
 meant+0.282
 whispered+0.267
books+0.267
 said+0.257
 spoke+0.252
Neg logit contributions
 our-0.196
 my-0.177
 your-0.171
ap-0.163
 arrest-0.159
TokenThe
Token ID464
Feature activation-1.35e-01

Pos logit contributions
 meant+0.200
books+0.195
 whispered+0.190
 spoke+0.177
jar+0.170
Neg logit contributions
 our-0.214
 my-0.198
 your-0.197
 exercise-0.174
 arrest-0.168
Token little
Token ID1310
Feature activation-1.48e-01

Pos logit contributions
books+0.214
 meant+0.208
 whispered+0.199
 spoke+0.186
jar+0.186
Neg logit contributions
 our-0.168
 my-0.160
 your-0.153
 exercise-0.137
 victory-0.135
Token girl
Token ID2576
Feature activation-1.85e-01

Pos logit contributions
books+0.146
 meant+0.136
 whispered+0.127
jar+0.117
 spoke+0.115
Neg logit contributions
 our-0.279
 my-0.267
 your-0.263
 exercise-0.238
 dive-0.234
Token thanked
Token ID26280
Feature activation-1.48e-01

Pos logit contributions
books+0.213
 meant+0.182
jar+0.176
book+0.163
Don+0.161
Neg logit contributions
 our-0.334
 my-0.316
 your-0.309
 dive-0.282
ap-0.277
Token the
Token ID262
Feature activation-1.24e-01

Pos logit contributions
 meant+0.247
books+0.246
 whispered+0.241
 said+0.226
 spoke+0.226
Neg logit contributions
 our-0.166
 my-0.144
 your-0.142
ap-0.124
 arrest-0.124
Token elf
Token ID23878
Feature activation-2.30e-01

Pos logit contributions
books+0.179
 meant+0.173
 whispered+0.161
oled+0.155
jar+0.155
Neg logit contributions
 my-0.165
 our-0.162
 your-0.157
 exercise-0.140
 victory-0.138
Token and
Token ID290
Feature activation-9.53e-02

Pos logit contributions
 meant+0.311
books+0.309
 whispered+0.302
 spoke+0.279
 said+0.278
Neg logit contributions
 our-0.332
 my-0.302
 your-0.300
 arrest-0.268
 exercise-0.267
Token waved
Token ID26834
Feature activation-1.64e-01

Pos logit contributions
books+0.101
 meant+0.091
jar+0.082
 whispered+0.080
book+0.077
Neg logit contributions
 our-0.177
 your-0.166
 my-0.166
 exercise-0.147
 dive-0.146
Token goodbye
Token ID24829
Feature activation-1.60e-01

Pos logit contributions
 meant+0.247
books+0.246
 whispered+0.240
 spoke+0.224
 said+0.222
Neg logit contributions
 our-0.215
 my-0.192
 your-0.191
 exercise-0.169
 arrest-0.168
Token as
Token ID355
Feature activation-1.13e-01

Pos logit contributions
 meant+0.251
books+0.251
 whispered+0.233
jar+0.227
 spoke+0.222
Neg logit contributions
 our-0.208
 your-0.188
 my-0.187
ap-0.158
 arrest-0.156
Token<|endoftext|>
Token ID50256
Feature activation-1.61e-01

Pos logit contributions
books+0.145
 meant+0.142
 whispered+0.135
jar+0.126
 spoke+0.125
Neg logit contributions
 our-0.180
 your-0.166
 my-0.165
 exercise-0.145
 arrest-0.142
Token and
Token ID290
Feature activation-9.69e-02

Pos logit contributions
books+0.273
 meant+0.264
jar+0.249
 whispered+0.248
Don+0.244
Neg logit contributions
 our-0.260
 your-0.236
 my-0.233
ap-0.208
 exercise-0.205
Token flew
Token ID13112
Feature activation-1.55e-01

Pos logit contributions
books+0.101
 meant+0.095
 whispered+0.087
jar+0.086
Don+0.081
Neg logit contributions
 our-0.178
 my-0.164
 your-0.164
 exercise-0.148
 dive-0.146
Token her
Token ID607
Feature activation-1.36e-01

Pos logit contributions
books+0.251
 meant+0.249
 whispered+0.239
jar+0.230
 spoke+0.225
Neg logit contributions
 our-0.192
 my-0.172
 your-0.171
 exercise-0.143
ap-0.141
Token to
Token ID284
Feature activation-9.48e-02

Pos logit contributions
books+0.225
 meant+0.223
 whispered+0.212
jar+0.203
 spoke+0.200
Neg logit contributions
 our-0.165
 my-0.147
 your-0.146
 arrest-0.122
 exercise-0.122
Token the
Token ID262
Feature activation-1.18e-01

Pos logit contributions
 meant+0.163
books+0.163
 whispered+0.161
 spoke+0.151
jar+0.150
Neg logit contributions
 our-0.100
 my-0.088
 your-0.087
ap-0.076
 exercise-0.074
Token waterfall
Token ID42467
Feature activation-2.30e-01

Pos logit contributions
books+0.171
 meant+0.168
 whispered+0.159
 spoke+0.149
jar+0.149
Neg logit contributions
 our-0.170
 my-0.156
 your-0.154
 exercise-0.135
 dive-0.132
Token.
Token ID13
Feature activation-1.42e-01

Pos logit contributions
books+0.251
 meant+0.249
 whispered+0.228
Don+0.224
jar+0.219
Neg logit contributions
 our-0.400
 your-0.370
 my-0.367
ap-0.338
 dive-0.331
Token It
Token ID632
Feature activation-1.65e-01

Pos logit contributions
books+0.178
 meant+0.175
 whispered+0.162
 spoke+0.155
jar+0.151
Neg logit contributions
 our-0.228
 my-0.211
 your-0.210
ap-0.177
 exercise-0.175
Token was
Token ID373
Feature activation-1.29e-01

Pos logit contributions
books+0.208
jar+0.184
book+0.178
oled+0.177
post+0.169
Neg logit contributions
 our-0.275
 my-0.256
 your-0.244
 dive-0.236
 how-0.231
Token even
Token ID772
Feature activation-1.19e-01

Pos logit contributions
books+0.232
 meant+0.216
 whispered+0.215
jar+0.213
boards+0.209
Neg logit contributions
 our-0.148
 my-0.133
 your-0.130
 victory-0.098
 exercise-0.098
Token brighter
Token ID26083
Feature activation-1.67e-01

Pos logit contributions
 meant+0.191
books+0.190
 whispered+0.184
jar+0.173
 spoke+0.173
Neg logit contributions
 our-0.147
 my-0.131
 your-0.131
 exercise-0.112
 arrest-0.112
Token Lucy
Token ID22162
Feature activation-1.77e-01

Pos logit contributions
 meant+0.216
books+0.215
 whispered+0.202
 spoke+0.193
l+0.192
Neg logit contributions
 our-0.205
 my-0.192
 your-0.190
ap-0.154
 exercise-0.152
Token smiled
Token ID13541
Feature activation-1.53e-01

Pos logit contributions
books+0.182
 meant+0.161
jar+0.160
post+0.142
Don+0.142
Neg logit contributions
 our-0.333
 my-0.316
 your-0.310
ap-0.280
 dive-0.276
Token and
Token ID290
Feature activation-9.99e-02

Pos logit contributions
books+0.218
 meant+0.206
 whispered+0.196
jar+0.191
 spoke+0.189
Neg logit contributions
 our-0.224
 my-0.210
 your-0.207
ap-0.182
 victory-0.174
Token knew
Token ID2993
Feature activation-1.52e-01

Pos logit contributions
books+0.074
 meant+0.066
jar+0.058
 whispered+0.057
l+0.052
Neg logit contributions
 our-0.215
 your-0.203
 my-0.203
 exercise-0.183
ap-0.182
Token the
Token ID262
Feature activation-1.26e-01

Pos logit contributions
 meant+0.220
books+0.210
 whispered+0.210
 said+0.198
 spoke+0.194
Neg logit contributions
 our-0.212
 my-0.189
 your-0.183
 arrest-0.179
 dive-0.173
Token taxi
Token ID17536
Feature activation-2.29e-01

Pos logit contributions
books+0.191
oled+0.182
 whispered+0.177
 gasped+0.171
book+0.170
Neg logit contributions
 our-0.173
 my-0.165
 your-0.158
 victory-0.153
 job-0.147
Token driver
Token ID4639
Feature activation-2.17e-01

Pos logit contributions
books+0.253
jar+0.245
oled+0.233
 widened+0.228
 whispered+0.221
Neg logit contributions
 our-0.368
 my-0.351
 your-0.344
 how-0.320
 victory-0.319
Token had
Token ID550
Feature activation-1.12e-01

Pos logit contributions
books+0.235
jar+0.225
oled+0.202
 whispered+0.199
 widened+0.197
Neg logit contributions
 our-0.373
 my-0.356
 your-0.355
 how-0.324
 exercise-0.316
Token been
Token ID587
Feature activation-1.26e-01

Pos logit contributions
jar+0.168
books+0.168
oled+0.160
l+0.155
 widened+0.153
Neg logit contributions
 our-0.156
 your-0.149
 my-0.148
 practice-0.120
 exercise-0.119
Token right
Token ID826
Feature activation-1.37e-01

Pos logit contributions
books+0.197
 meant+0.194
 whispered+0.186
jar+0.179
 spoke+0.175
Neg logit contributions
 our-0.163
 my-0.148
 your-0.147
 exercise-0.125
ap-0.121
Token.
Token ID13
Feature activation-1.42e-01

Pos logit contributions
books+0.197
 meant+0.189
 whispered+0.186
 spoke+0.179
jar+0.178
Neg logit contributions
 our-0.194
 your-0.181
 my-0.181
 exercise-0.147
 victory-0.145
Token sorry
Token ID7926
Feature activation-1.64e-01

Pos logit contributions
 meant+0.167
 whispered+0.161
 said+0.156
books+0.154
jar+0.146
Neg logit contributions
 our-0.156
 my-0.142
 your-0.141
 exercise-0.138
ric-0.136
Token.
Token ID13
Feature activation-1.37e-01

Pos logit contributions
 meant+0.166
 said+0.164
 whispered+0.148
l+0.140
books+0.133
Neg logit contributions
 arrest-0.269
ric-0.266
latable-0.257
 our-0.253
 Wed-0.248
Token We
Token ID775
Feature activation-1.38e-01

Pos logit contributions
 meant+0.200
 whispered+0.188
Don+0.187
 said+0.181
books+0.178
Neg logit contributions
 our-0.165
 exercise-0.158
 victory-0.156
igs-0.151
 my-0.150
Token rang
Token ID28077
Feature activation-1.74e-01

Pos logit contributions
books+0.146
jar+0.124
 meant+0.122
 whispered+0.118
post+0.116
Neg logit contributions
 our-0.257
 your-0.238
 my-0.236
 arrest-0.215
 exercise-0.214
Token the
Token ID262
Feature activation-1.28e-01

Pos logit contributions
 meant+0.308
 whispered+0.300
books+0.298
 said+0.288
l+0.283
Neg logit contributions
 our-0.156
 my-0.135
 your-0.133
 arrest-0.128
ap-0.126
Token bell
Token ID8966
Feature activation-2.29e-01

Pos logit contributions
books+0.190
 meant+0.184
 whispered+0.176
oled+0.168
 widened+0.168
Neg logit contributions
 our-0.181
 your-0.165
 my-0.164
 victory-0.149
 exercise-0.144
Token and
Token ID290
Feature activation-1.04e-01

Pos logit contributions
books+0.337
Don+0.313
 meant+0.308
oled+0.305
jar+0.305
Neg logit contributions
 our-0.344
 your-0.314
 my-0.310
ap-0.261
 exercise-0.253
Token made
Token ID925
Feature activation-1.22e-01

Pos logit contributions
 meant+0.165
 whispered+0.163
 said+0.159
 spoke+0.153
books+0.145
Neg logit contributions
 our-0.121
latable-0.112
ap-0.109
igs-0.107
 my-0.107
Token the
Token ID262
Feature activation-1.24e-01

Pos logit contributions
 meant+0.196
 whispered+0.189
books+0.187
 said+0.185
l+0.178
Neg logit contributions
 our-0.128
 my-0.115
 your-0.114
 arrest-0.111
ap-0.110
Token park
Token ID3952
Feature activation-1.92e-01

Pos logit contributions
books+0.196
 meant+0.189
 whispered+0.180
oled+0.178
 spoke+0.173
Neg logit contributions
 our-0.166
 your-0.151
 my-0.149
 victory-0.134
 exercise-0.132
Token keeper
Token ID28356
Feature activation-2.08e-01

Pos logit contributions
books+0.277
 meant+0.266
 whispered+0.257
jar+0.247
Don+0.246
Neg logit contributions
 our-0.286
 my-0.258
 your-0.258
 exercise-0.217
 victory-0.216
Token felt
Token ID2936
Feature activation-1.21e-01

Pos logit contributions
books+0.234
jar+0.217
Don+0.194
post+0.193
oled+0.192
Neg logit contributions
 our-0.322
 my-0.307
 your-0.299
 how-0.282
 practice-0.275
Token better
Token ID1365
Feature activation-1.39e-01

Pos logit contributions
books+0.146
 meant+0.140
 whispered+0.135
jar+0.128
 spoke+0.125
Neg logit contributions
 our-0.201
 my-0.187
 your-0.187
 exercise-0.166
 victory-0.162
Token and
Token ID290
Feature activation-9.64e-02

Pos logit contributions
books+0.193
 meant+0.188
 whispered+0.178
 spoke+0.170
jar+0.168
Neg logit contributions
 our-0.206
 your-0.190
 my-0.189
 arrest-0.165
 exercise-0.164
Token thanked
Token ID26280
Feature activation-1.57e-01

Pos logit contributions
books+0.107
 meant+0.096
jar+0.090
 whispered+0.090
book+0.086
Neg logit contributions
 our-0.173
 my-0.162
 your-0.161
 exercise-0.144
 victory-0.139
Token the
Token ID262
Feature activation-1.29e-01

Pos logit contributions
 meant+0.233
books+0.233
 whispered+0.225
 said+0.213
 spoke+0.212
Neg logit contributions
 our-0.200
 my-0.175
 your-0.174
ap-0.165
 dive-0.159
Token dentist
Token ID38408
Feature activation-2.29e-01

Pos logit contributions
books+0.218
 meant+0.202
 widened+0.200
oled+0.197
 whispered+0.197
Neg logit contributions
 my-0.152
 your-0.149
 our-0.147
 exercise-0.133
 help-0.126
Token.
Token ID13
Feature activation-1.41e-01

Pos logit contributions
 meant+0.287
books+0.286
Don+0.276
jar+0.268
 whispered+0.267
Neg logit contributions
 our-0.350
 your-0.320
 my-0.319
ap-0.289
 exercise-0.283
Token When
Token ID1649
Feature activation-1.39e-01

Pos logit contributions
 meant+0.192
books+0.189
 faded+0.179
jar+0.175
 whispered+0.174
Neg logit contributions
 our-0.207
 your-0.200
 my-0.199
 exercise-0.163
 how-0.163
Token she
Token ID673
Feature activation-1.49e-01

Pos logit contributions
books+0.212
 meant+0.208
 whispered+0.199
jar+0.186
book+0.184
Neg logit contributions
 our-0.196
 your-0.180
 my-0.176
 exercise-0.153
 arrest-0.146
Token left
Token ID1364
Feature activation-1.44e-01

Pos logit contributions
books+0.204
 meant+0.186
jar+0.180
 whispered+0.178
post+0.177
Neg logit contributions
 our-0.216
 my-0.204
 your-0.201
 practice-0.178
 spend-0.177
Token the
Token ID262
Feature activation-1.24e-01

Pos logit contributions
books+0.274
 meant+0.267
 whispered+0.258
Don+0.248
jar+0.248
Neg logit contributions
 our-0.145
 your-0.137
 my-0.133
 arrest-0.099
 exercise-0.097
Token mom
Token ID1995
Feature activation-1.81e-01

Pos logit contributions
 whispered+0.135
 said+0.130
 meant+0.124
 gasped+0.123
books+0.122
Neg logit contributions
 our-0.176
 your-0.170
 gonna-0.168
 my-0.167
 exercise-0.164
Token and
Token ID290
Feature activation-1.05e-01

Pos logit contributions
 meant+0.205
books+0.204
 whispered+0.195
 spoke+0.178
jar+0.177
Neg logit contributions
 our-0.307
 my-0.283
 your-0.282
 exercise-0.255
 arrest-0.253
Token showed
Token ID3751
Feature activation-1.41e-01

Pos logit contributions
 meant+0.071
 whispered+0.070
books+0.065
 said+0.062
 spoke+0.059
Neg logit contributions
 our-0.225
 my-0.212
 your-0.211
 exercise-0.201
ap-0.199
Token her
Token ID607
Feature activation-1.43e-01

Pos logit contributions
 meant+0.184
 whispered+0.176
books+0.171
 said+0.166
 spoke+0.159
Neg logit contributions
 our-0.203
 my-0.186
ric-0.183
 arrest-0.181
 your-0.180
Token the
Token ID262
Feature activation-1.26e-01

Pos logit contributions
 meant+0.200
 whispered+0.196
 said+0.192
l+0.186
books+0.181
Neg logit contributions
 arrest-0.172
 our-0.165
 gonna-0.162
ap-0.160
latable-0.159
Token crown
Token ID12389
Feature activation-2.29e-01

Pos logit contributions
books+0.195
 meant+0.194
 whispered+0.183
jar+0.175
 spoke+0.174
Neg logit contributions
 our-0.166
 my-0.152
 your-0.151
 exercise-0.126
 arrest-0.126
Token.
Token ID13
Feature activation-1.43e-01

Pos logit contributions
books+0.283
Don+0.278
 meant+0.272
jar+0.261
 whispered+0.256
Neg logit contributions
 our-0.362
 your-0.327
 my-0.324
 how-0.285
 dive-0.283
Token
Token ID198
Feature activation-1.62e-01

Pos logit contributions
books+0.181
 meant+0.179
 whispered+0.167
jar+0.164
 spoke+0.158
Neg logit contributions
 our-0.231
 your-0.215
 my-0.211
ap-0.184
 dive-0.177
Token
Token ID198
Feature activation-1.60e-01

Pos logit contributions
 meant+0.216
books+0.201
 faded+0.194
 whispered+0.194
 spoke+0.188
Neg logit contributions
 our-0.260
 your-0.242
 my-0.234
 dive-0.193
ap-0.192
TokenMom
Token ID29252
Feature activation-1.84e-01

Pos logit contributions
books+0.196
 meant+0.196
 whispered+0.186
jar+0.172
 spoke+0.172
Neg logit contributions
 our-0.257
 my-0.237
 your-0.236
 exercise-0.210
 arrest-0.208
Token was
Token ID373
Feature activation-1.34e-01

Pos logit contributions
books+0.207
jar+0.184
Don+0.175
 meant+0.171
post+0.167
Neg logit contributions
 our-0.318
 my-0.302
 your-0.288
 how-0.275
 dive-0.264
Token her
Token ID607
Feature activation-1.37e-01

Pos logit contributions
books+0.140
 meant+0.124
jar+0.122
book+0.117
 whispered+0.116
Neg logit contributions
 our-0.156
 my-0.146
 your-0.145
 how-0.122
 arrest-0.121
Token leg
Token ID1232
Feature activation-1.92e-01

Pos logit contributions
 meant+0.167
 whispered+0.158
books+0.150
 said+0.149
l+0.145
Neg logit contributions
 our-0.224
 my-0.209
 your-0.204
 Wed-0.181
 myself-0.179
Token and
Token ID290
Feature activation-9.91e-02

Pos logit contributions
books+0.303
jar+0.278
book+0.270
Wow+0.269
Dear+0.267
Neg logit contributions
 our-0.220
 your-0.211
 my-0.201
 help-0.201
 how-0.199
Token called
Token ID1444
Feature activation-1.49e-01

Pos logit contributions
books+0.120
jar+0.099
book+0.095
 meant+0.093
Today+0.093
Neg logit contributions
 our-0.176
 your-0.166
 my-0.166
 how-0.147
 arrest-0.144
Token an
Token ID281
Feature activation-1.28e-01

Pos logit contributions
books+0.248
 meant+0.247
 whispered+0.238
jar+0.226
 spoke+0.225
Neg logit contributions
 our-0.175
 my-0.155
 your-0.155
 exercise-0.130
 arrest-0.129
Token ambulance
Token ID22536
Feature activation-2.29e-01

Pos logit contributions
books+0.173
 meant+0.164
 whispered+0.158
jar+0.153
 spoke+0.150
Neg logit contributions
 our-0.195
 my-0.181
 your-0.179
 exercise-0.158
 arrest-0.154
Token.
Token ID13
Feature activation-1.42e-01

Pos logit contributions
books+0.290
 meant+0.269
 whispered+0.257
jar+0.247
 widened+0.241
Neg logit contributions
 our-0.366
 your-0.332
 my-0.330
 arrest-0.309
ap-0.294
Token He
Token ID679
Feature activation-1.56e-01

Pos logit contributions
books+0.209
 meant+0.201
 whispered+0.191
 spoke+0.182
jar+0.182
Neg logit contributions
 our-0.204
 my-0.197
 your-0.191
 arrest-0.156
 these-0.146
Token told
Token ID1297
Feature activation-1.72e-01

Pos logit contributions
books+0.195
book+0.165
jar+0.163
post+0.155
Today+0.148
Neg logit contributions
 our-0.268
 my-0.255
 your-0.241
 how-0.223
 arrest-0.220
Token Tom
Token ID4186
Feature activation-1.90e-01

Pos logit contributions
 meant+0.190
 whispered+0.178
books+0.178
 said+0.174
l+0.167
Neg logit contributions
 our-0.279
ap-0.261
ric-0.256
 my-0.250
 dive-0.250
Token to
Token ID284
Feature activation-1.11e-01

Pos logit contributions
 meant+0.305
 whispered+0.289
 said+0.285
books+0.276
 spoke+0.273
Neg logit contributions
 our-0.232
 my-0.207
 your-0.200
ap-0.196
 dive-0.188
Token turns
Token ID4962
Feature activation-1.53e-01

Pos logit contributions
 meant+0.197
 whispered+0.187
books+0.184
 said+0.181
 spoke+0.169
Neg logit contributions
ric-0.151
 our-0.151
 my-0.144
 dive-0.137
latable-0.137
Token and
Token ID290
Feature activation-1.05e-01

Pos logit contributions
books+0.228
 meant+0.220
 whispered+0.216
Don+0.215
jar+0.215
Neg logit contributions
 our-0.209
 your-0.189
 my-0.179
 exercise-0.167
 arrest-0.164
Token listen
Token ID6004
Feature activation-1.55e-01

Pos logit contributions
 said+0.163
 whispered+0.160
 meant+0.146
 spoke+0.145
 wondered+0.144
Neg logit contributions
igs-0.133
ric-0.131
latable-0.131
igsaw-0.124
ocol-0.120
Token to
Token ID284
Feature activation-1.09e-01

Pos logit contributions
 meant+0.329
books+0.326
jar+0.319
 whispered+0.311
 widened+0.308
Neg logit contributions
 our-0.126
 your-0.107
 my-0.097
 how-0.068
 exercise-0.066
Token the
Token ID262
Feature activation-1.25e-01

Pos logit contributions
 meant+0.178
books+0.174
 whispered+0.173
 said+0.163
Don+0.161
Neg logit contributions
 our-0.114
 my-0.104
 your-0.100
 arrest-0.094
 dive-0.091
Token bell
Token ID8966
Feature activation-2.29e-01

Pos logit contributions
books+0.213
 meant+0.202
oled+0.198
book+0.194
 whispered+0.194
Neg logit contributions
 our-0.154
 your-0.137
 my-0.136
 exercise-0.132
 victory-0.118
Token next
Token ID1306
Feature activation-1.52e-01

Pos logit contributions
books+0.303
 meant+0.285
jar+0.278
Don+0.276
 whispered+0.275
Neg logit contributions
 our-0.357
 your-0.328
 my-0.317
ap-0.295
 exercise-0.290
Token time
Token ID640
Feature activation-1.49e-01

Pos logit contributions
books+0.178
 meant+0.176
 whispered+0.165
jar+0.154
 spoke+0.151
Neg logit contributions
 our-0.254
 my-0.235
 your-0.235
 arrest-0.212
 exercise-0.209
Token.
Token ID13
Feature activation-1.46e-01

Pos logit contributions
books+0.209
 meant+0.200
jar+0.191
Don+0.191
 whispered+0.189
Neg logit contributions
 our-0.234
 your-0.215
 my-0.201
ap-0.179
 exercise-0.174
Token
Token ID198
Feature activation-1.64e-01

Pos logit contributions
books+0.200
 meant+0.196
 whispered+0.188
 spoke+0.187
l+0.177
Neg logit contributions
 our-0.236
 your-0.214
 my-0.209
ap-0.173
 exercise-0.169
Token
Token ID198
Feature activation-1.62e-01

Pos logit contributions
 meant+0.251
books+0.244
 whispered+0.231
 spoke+0.228
 faded+0.225
Neg logit contributions
 our-0.238
 your-0.214
 my-0.204
 exercise-0.169
ap-0.163
Token fun
Token ID1257
Feature activation-1.61e-01

Pos logit contributions
books+0.225
jar+0.210
oled+0.210
 meant+0.209
 whispered+0.205
Neg logit contributions
 our-0.172
 your-0.151
 my-0.151
 exercise-0.119
 victory-0.117
Token.
Token ID13
Feature activation-1.43e-01

Pos logit contributions
books+0.256
 meant+0.249
 whispered+0.240
jar+0.235
Don+0.234
Neg logit contributions
 our-0.215
 your-0.194
 my-0.192
 exercise-0.160
ap-0.157
Token They
Token ID1119
Feature activation-1.47e-01

Pos logit contributions
books+0.154
 meant+0.152
 whispered+0.144
 spoke+0.136
jar+0.131
Neg logit contributions
 our-0.259
 your-0.238
 my-0.237
 exercise-0.210
ap-0.210
Token hoped
Token ID10719
Feature activation-1.81e-01

Pos logit contributions
books+0.209
jar+0.196
ooky+0.191
book+0.190
oled+0.184
Neg logit contributions
 spend-0.201
 our-0.192
 practice-0.185
 dive-0.184
 stay-0.180
Token the
Token ID262
Feature activation-1.37e-01

Pos logit contributions
 meant+0.275
books+0.264
 whispered+0.264
 said+0.250
 spoke+0.249
Neg logit contributions
 our-0.226
 my-0.209
 your-0.202
 arrest-0.189
ap-0.182
Token car
Token ID1097
Feature activation-2.28e-01

Pos logit contributions
books+0.214
 meant+0.207
 whispered+0.198
oled+0.196
 spoke+0.189
Neg logit contributions
 our-0.187
 my-0.168
 your-0.163
 victory-0.158
 exercise-0.158
Token ride
Token ID6594
Feature activation-2.00e-01

Pos logit contributions
oled+0.313
jar+0.303
books+0.302
book+0.274
Wow+0.271
Neg logit contributions
 our-0.335
 your-0.301
 my-0.293
 how-0.284
 exercise-0.272
Token would
Token ID561
Feature activation-1.46e-01

Pos logit contributions
jar+0.313
oled+0.311
books+0.301
Don+0.289
 widened+0.287
Neg logit contributions
 our-0.279
 your-0.248
 my-0.228
 victory-0.198
ap-0.197
Token be
Token ID307
Feature activation-1.42e-01

Pos logit contributions
books+0.306
 meant+0.300
 whispered+0.285
book+0.280
oled+0.279
Neg logit contributions
 our-0.124
 your-0.106
 my-0.104
 dive-0.081
 spend-0.078
Token over
Token ID625
Feature activation-1.29e-01

Pos logit contributions
books+0.241
jar+0.228
 meant+0.225
oled+0.223
Don+0.220
Neg logit contributions
 our-0.185
 your-0.163
 my-0.161
 exercise-0.127
 victory-0.127
Token soon
Token ID2582
Feature activation-1.76e-01

Pos logit contributions
books+0.208
 meant+0.200
 whispered+0.191
oled+0.190
jar+0.188
Neg logit contributions
 our-0.185
 your-0.167
 my-0.160
 victory-0.129
 exercise-0.129
Token down
Token ID866
Feature activation-1.38e-01

Pos logit contributions
books+0.236
 meant+0.231
 whispered+0.217
 spoke+0.210
jar+0.207
Neg logit contributions
 our-0.206
 your-0.194
 my-0.192
ap-0.156
 exercise-0.156
Token and
Token ID290
Feature activation-9.75e-02

Pos logit contributions
books+0.239
 meant+0.231
 whispered+0.216
 widened+0.212
oled+0.211
Neg logit contributions
 our-0.165
 your-0.163
 my-0.160
ap-0.115
 how-0.115
Token watched
Token ID7342
Feature activation-1.54e-01

Pos logit contributions
books+0.097
 meant+0.092
jar+0.082
 whispered+0.078
book+0.078
Neg logit contributions
 our-0.183
 your-0.173
 my-0.172
 exercise-0.155
 dive-0.154
Token as
Token ID355
Feature activation-1.14e-01

Pos logit contributions
books+0.268
 meant+0.267
 whispered+0.258
jar+0.244
 spoke+0.244
Neg logit contributions
 our-0.168
 my-0.148
 your-0.147
 exercise-0.123
 arrest-0.122
Token the
Token ID262
Feature activation-1.30e-01

Pos logit contributions
books+0.192
 meant+0.190
 whispered+0.184
jar+0.176
 spoke+0.174
Neg logit contributions
 our-0.133
 your-0.122
 my-0.119
 exercise-0.100
 victory-0.097
Token bus
Token ID1323
Feature activation-2.28e-01

Pos logit contributions
books+0.217
 meant+0.214
oled+0.207
 whispered+0.201
 widened+0.198
Neg logit contributions
 our-0.150
 my-0.144
 your-0.136
 exercise-0.132
 victory-0.125
Token drove
Token ID10357
Feature activation-1.65e-01

Pos logit contributions
jar+0.304
Don+0.296
books+0.296
 wil+0.279
lessness+0.277
Neg logit contributions
 race-0.268
 spend-0.265
 victory-0.255
ap-0.252
 join-0.252
Token around
Token ID1088
Feature activation-1.38e-01

Pos logit contributions
 meant+0.282
books+0.274
oled+0.272
jar+0.271
 whispered+0.269
Neg logit contributions
 our-0.185
 your-0.180
 my-0.175
 victory-0.131
 those-0.127
Token.
Token ID13
Feature activation-1.45e-01

Pos logit contributions
 meant+0.271
 whispered+0.261
books+0.261
jar+0.249
oled+0.248
Neg logit contributions
 our-0.132
 your-0.122
 my-0.111
 explore-0.090
 join-0.089
Token Everywhere
Token ID48201
Feature activation-1.60e-01

Pos logit contributions
books+0.183
 meant+0.181
 whispered+0.173
jar+0.164
 spoke+0.159
Neg logit contributions
 our-0.227
 my-0.209
 your-0.209
 exercise-0.187
ap-0.185
Token she
Token ID673
Feature activation-1.65e-01

Pos logit contributions
 meant+0.138
books+0.133
 whispered+0.127
 spoke+0.114
 said+0.110
Neg logit contributions
 our-0.317
 my-0.296
 your-0.294
 exercise-0.272
 arrest-0.271
Token Everyone
Token ID11075
Feature activation-1.88e-01

Pos logit contributions
books+0.170
 meant+0.169
 whispered+0.160
jar+0.154
 spoke+0.147
Neg logit contributions
 our-0.236
 my-0.216
 your-0.215
ap-0.190
 exercise-0.188
Token wanted
Token ID2227
Feature activation-1.62e-01

Pos logit contributions
books+0.212
jar+0.201
post+0.182
 meant+0.181
book+0.169
Neg logit contributions
 our-0.345
 my-0.319
 your-0.318
 dive-0.281
 how-0.280
Token to
Token ID284
Feature activation-1.08e-01

Pos logit contributions
 meant+0.289
books+0.285
 whispered+0.278
 spoke+0.265
 said+0.263
Neg logit contributions
 our-0.165
 my-0.145
 your-0.143
ap-0.126
 exercise-0.123
Token share
Token ID2648
Feature activation-1.48e-01

Pos logit contributions
 meant+0.161
 said+0.161
books+0.160
 whispered+0.159
 spoke+0.147
Neg logit contributions
ric-0.132
 our-0.129
igs-0.119
ap-0.111
 your-0.111
Token the
Token ID262
Feature activation-1.31e-01

Pos logit contributions
books+0.261
 meant+0.260
 whispered+0.251
jar+0.242
 spoke+0.239
Neg logit contributions
 our-0.164
 my-0.144
 your-0.144
 exercise-0.116
 arrest-0.114
Token treasure
Token ID14068
Feature activation-2.28e-01

Pos logit contributions
books+0.235
 meant+0.233
 whispered+0.224
jar+0.216
Don+0.215
Neg logit contributions
 our-0.142
 my-0.128
 your-0.125
 victory-0.102
 exercise-0.100
Token together
Token ID1978
Feature activation-1.55e-01

Pos logit contributions
books+0.287
 meant+0.286
 whispered+0.273
 spoke+0.252
jar+0.252
Neg logit contributions
 our-0.357
 my-0.327
 your-0.327
 exercise-0.291
 arrest-0.289
Token.
Token ID13
Feature activation-1.54e-01

Pos logit contributions
 meant+0.208
books+0.207
Don+0.203
 whispered+0.203
jar+0.201
Neg logit contributions
 our-0.233
 my-0.215
 your-0.215
 victory-0.185
ap-0.183
Token Now
Token ID2735
Feature activation-1.72e-01

Pos logit contributions
books+0.190
 meant+0.188
 whispered+0.178
jar+0.171
 spoke+0.164
Neg logit contributions
 our-0.252
 my-0.231
 your-0.230
 exercise-0.202
ap-0.202
Token,
Token ID11
Feature activation-1.31e-01

Pos logit contributions
 meant+0.254
books+0.249
 whispered+0.242
 spoke+0.228
 said+0.227
Neg logit contributions
 our-0.230
 my-0.209
 your-0.207
 exercise-0.184
 arrest-0.182
Token the
Token ID262
Feature activation-1.42e-01

Pos logit contributions
 meant+0.202
books+0.193
 whispered+0.191
 said+0.183
 spoke+0.182
Neg logit contributions
 our-0.162
 my-0.147
 your-0.145
latable-0.131
 arrest-0.131
Token John
Token ID1757
Feature activation+3.40e-01

Pos logit contributions
jar+0.239
books+0.227
post+0.205
book+0.202
ooky+0.197
Neg logit contributions
 our-0.286
 spend-0.281
 practice-0.278
ap-0.272
 how-0.269
Token chose
Token ID7690
Feature activation+3.01e-01

Pos logit contributions
 our+0.632
 my+0.588
 your+0.587
ap+0.529
 arrest+0.529
Neg logit contributions
books-0.334
 meant-0.319
 whispered-0.295
jar-0.288
Don-0.270
Token some
Token ID617
Feature activation+2.99e-01

Pos logit contributions
 our+0.325
 my+0.285
 your+0.283
 arrest+0.238
 exercise+0.237
Neg logit contributions
 meant-0.525
books-0.521
 whispered-0.505
 spoke-0.477
jar-0.474
Token carrots
Token ID34397
Feature activation+2.48e-01

Pos logit contributions
 our+0.413
 my+0.375
 your+0.374
 exercise+0.327
 arrest+0.317
Neg logit contributions
books-0.433
 meant-0.428
 whispered-0.411
jar-0.390
 spoke-0.386
Token and
Token ID290
Feature activation+2.96e-01

Pos logit contributions
 our+0.360
 your+0.329
 my+0.327
 exercise+0.283
ap+0.280
Neg logit contributions
books-0.346
 meant-0.336
 whispered-0.324
jar-0.314
 spoke-0.306
Token cuc
Token ID38421
Feature activation+3.87e-01

Pos logit contributions
 our+0.469
 my+0.429
 your+0.425
 arrest+0.392
ap+0.391
Neg logit contributions
 meant-0.370
 whispered-0.356
books-0.348
 said-0.336
 spoke-0.328
Tokenumbers
Token ID17024
Feature activation+2.05e-01

Pos logit contributions
 our+0.689
 my+0.638
 your+0.624
 arrest+0.582
 exercise+0.577
Neg logit contributions
books-0.418
 meant-0.405
 whispered-0.379
l-0.360
book-0.357
Token to
Token ID284
Feature activation+2.86e-01

Pos logit contributions
 our+0.306
 your+0.283
 my+0.277
 dive+0.235
 exercise+0.233
Neg logit contributions
books-0.278
jar-0.270
 meant-0.262
 whispered-0.261
Don-0.257
Token take
Token ID1011
Feature activation+2.41e-01

Pos logit contributions
 our+0.335
 my+0.290
ap+0.289
 your+0.279
 arrest+0.275
Neg logit contributions
 meant-0.427
 said-0.424
 whispered-0.418
books-0.411
 spoke-0.389
Token home
Token ID1363
Feature activation+2.04e-01

Pos logit contributions
 our+0.269
 my+0.238
 your+0.236
 exercise+0.201
 arrest+0.201
Neg logit contributions
books-0.408
 meant-0.408
 whispered-0.392
 spoke-0.370
 said-0.367
Token.
Token ID13
Feature activation+2.06e-01

Pos logit contributions
 our+0.350
 my+0.326
 your+0.322
 arrest+0.298
 exercise+0.294
Neg logit contributions
books-0.221
 meant-0.219
 whispered-0.207
 said-0.189
l-0.187
Token
Token ID198
Feature activation+3.35e-01

Pos logit contributions
books+0.273
jar+0.260
ooky+0.237
post+0.227
book+0.226
Neg logit contributions
 our-0.306
 my-0.295
 your-0.290
 how-0.288
 dive-0.273
Token
Token ID198
Feature activation+2.98e-01

Pos logit contributions
 our+0.586
 your+0.545
 my+0.544
ap+0.478
 exercise+0.476
Neg logit contributions
 meant-0.380
books-0.371
 whispered-0.347
 spoke-0.320
jar-0.315
TokenSuddenly
Token ID38582
Feature activation+2.58e-01

Pos logit contributions
 our+0.506
 my+0.468
 your+0.468
 exercise+0.414
 arrest+0.412
Neg logit contributions
 meant-0.349
books-0.342
 whispered-0.323
 spoke-0.298
jar-0.294
Token,
Token ID11
Feature activation+3.03e-01

Pos logit contributions
 our+0.351
 my+0.316
 your+0.312
 exercise+0.283
 arrest+0.278
Neg logit contributions
 meant-0.371
books-0.362
 whispered-0.361
 spoke-0.339
 said-0.338
Token Bug
Token ID15217
Feature activation+2.44e-01

Pos logit contributions
 our+0.414
 my+0.373
 your+0.368
ap+0.357
ric+0.349
Neg logit contributions
 meant-0.419
 whispered-0.411
books-0.397
 said-0.391
 spoke-0.384
Tokengy
Token ID1360
Feature activation+3.56e-01

Pos logit contributions
 our+0.344
 my+0.311
 your+0.305
 dive+0.275
ap+0.268
Neg logit contributions
books-0.370
jar-0.319
book-0.308
 meant-0.301
post-0.296
Token heard
Token ID2982
Feature activation+2.48e-01

Pos logit contributions
 our+0.651
 your+0.607
 my+0.603
 arrest+0.570
 dive+0.553
Neg logit contributions
books-0.371
jar-0.323
 meant-0.304
Don-0.286
book-0.284
Token a
Token ID257
Feature activation+2.61e-01

Pos logit contributions
 our+0.260
 my+0.224
 your+0.224
ap+0.190
 exercise+0.186
Neg logit contributions
 meant-0.441
books-0.437
 whispered-0.425
 spoke-0.403
jar-0.400
Token loud
Token ID7812
Feature activation+2.20e-01

Pos logit contributions
 our+0.390
 my+0.361
 your+0.353
 dive+0.312
 exercise+0.312
Neg logit contributions
books-0.366
 meant-0.357
 whispered-0.331
book-0.316
 spoke-0.306
Token noise
Token ID7838
Feature activation+1.89e-01

Pos logit contributions
 our+0.356
 my+0.326
 your+0.326
 exercise+0.291
 arrest+0.289
Neg logit contributions
 meant-0.266
books-0.265
 whispered-0.253
 spoke-0.233
jar-0.233
Token.
Token ID13
Feature activation+2.02e-01

Pos logit contributions
 our+0.270
 your+0.247
 my+0.247
 dive+0.234
 exercise+0.230
Neg logit contributions
books-0.259
 meant-0.233
 whispered-0.231
jar-0.228
Don-0.220
Token
Token ID129
Feature activation+4.75e-01

Pos logit contributions
books+0.199
 widened+0.194
 whispered+0.191
 meant+0.190
jar+0.181
Neg logit contributions
 our-0.182
 your-0.169
 my-0.168
 job-0.141
 exercise-0.141
Token
Token ID241
Feature activation+4.01e-01

Pos logit contributions
 our+0.664
 my+0.590
 your+0.586
 dive+0.530
 exercise+0.529
Neg logit contributions
books-0.671
 meant-0.667
 whispered-0.643
 said-0.601
Don-0.598
TokenLet
Token ID5756
Feature activation+3.17e-01

Pos logit contributions
 our+0.744
 your+0.692
 my+0.692
 exercise+0.627
ap+0.619
Neg logit contributions
books-0.390
 meant-0.386
 whispered-0.367
jar-0.333
 spoke-0.331
Tokenâ
Token ID22940
Feature activation+3.23e-01

Pos logit contributions
 our+0.509
 my+0.461
 your+0.454
ric+0.451
 arrest+0.451
Neg logit contributions
 meant-0.389
 said-0.355
 whispered-0.354
 spoke-0.326
books-0.323
Token
Token ID26391
Feature activation+3.23e-01

Pos logit contributions
 our+0.440
 my+0.395
 your+0.390
 arrest+0.330
 victory+0.321
Neg logit contributions
 meant-0.491
books-0.486
 whispered-0.464
 spoke-0.438
 said-0.433
Token
Token ID8151
Feature activation+3.58e-01

Pos logit contributions
 our+0.541
 my+0.502
 your+0.497
 exercise+0.450
 victory+0.441
Neg logit contributions
books-0.375
 meant-0.364
 whispered-0.349
 said-0.329
 spoke-0.325
Tokens
Token ID82
Feature activation+3.11e-01

Pos logit contributions
 our+0.585
 your+0.555
 my+0.537
ap+0.513
ric+0.511
Neg logit contributions
 whispered-0.374
jar-0.360
 meant-0.360
 faded-0.357
books-0.350
Token go
Token ID467
Feature activation+2.79e-01

Pos logit contributions
 our+0.373
 arrest+0.330
latable+0.328
 my+0.328
 your+0.312
Neg logit contributions
 meant-0.492
 said-0.470
 whispered-0.460
 spoke-0.424
books-0.413
Token find
Token ID1064
Feature activation+2.71e-01

Pos logit contributions
 our+0.400
 arrest+0.362
 my+0.358
 your+0.348
ric+0.341
Neg logit contributions
 whispered-0.353
 meant-0.352
 said-0.344
books-0.331
l-0.331
Token something
Token ID1223
Feature activation+2.65e-01

Pos logit contributions
 our+0.263
ap+0.249
ric+0.248
 arrest+0.244
 dive+0.240
Neg logit contributions
 meant-0.459
 whispered-0.443
 said-0.436
books-0.415
l-0.406
Token else
Token ID2073
Feature activation+2.52e-01

Pos logit contributions
 our+0.391
 your+0.357
 my+0.357
 exercise+0.313
 dive+0.301
Neg logit contributions
books-0.368
 meant-0.357
 whispered-0.348
jar-0.335
Don-0.327
Token on
Token ID319
Feature activation+4.11e-01

Pos logit contributions
 meant+0.232
books+0.231
 whispered+0.224
 spoke+0.212
 said+0.212
Neg logit contributions
 our-0.158
 my-0.145
 your-0.140
ap-0.119
 exercise-0.119
Token the
Token ID262
Feature activation+3.84e-01

Pos logit contributions
 our+0.516
 my+0.461
 your+0.454
ap+0.405
 arrest+0.405
Neg logit contributions
books-0.626
 meant-0.625
 whispered-0.599
 said-0.561
jar-0.558
Token table
Token ID3084
Feature activation+2.91e-01

Pos logit contributions
 our+0.611
 my+0.560
 your+0.559
ap+0.506
 arrest+0.498
Neg logit contributions
books-0.455
 meant-0.453
 whispered-0.436
jar-0.408
l-0.401
Token in
Token ID287
Feature activation+3.55e-01

Pos logit contributions
 our+0.446
 your+0.422
 my+0.420
ap+0.361
 exercise+0.355
Neg logit contributions
books-0.363
 meant-0.357
Don-0.355
 whispered-0.347
jar-0.334
Token the
Token ID262
Feature activation+3.29e-01

Pos logit contributions
 our+0.344
 my+0.304
 your+0.301
 arrest+0.223
 exercise+0.216
Neg logit contributions
books-0.681
 meant-0.677
 whispered-0.658
 spoke-0.634
jar-0.627
Token living
Token ID2877
Feature activation+3.50e-01

Pos logit contributions
 our+0.473
 my+0.434
 your+0.433
 exercise+0.374
 arrest+0.364
Neg logit contributions
books-0.474
 meant-0.470
 whispered-0.446
 spoke-0.420
jar-0.415
Token room
Token ID2119
Feature activation+2.72e-01

Pos logit contributions
 our+0.506
 my+0.459
 exercise+0.453
 your+0.453
 arrest+0.443
Neg logit contributions
books-0.447
 meant-0.441
 whispered-0.425
 said-0.403
book-0.398
Token.
Token ID13
Feature activation+2.48e-01

Pos logit contributions
 our+0.419
 your+0.395
 my+0.389
ap+0.341
 exercise+0.322
Neg logit contributions
books-0.343
 meant-0.342
Don-0.341
 whispered-0.323
jar-0.318
Token It
Token ID632
Feature activation+2.20e-01

Pos logit contributions
 our+0.375
 my+0.359
 your+0.355
 exercise+0.290
ap+0.290
Neg logit contributions
 meant-0.330
books-0.328
 whispered-0.307
 faded-0.294
 spoke-0.292
Token is
Token ID318
Feature activation+2.43e-01

Pos logit contributions
 our+0.342
 my+0.328
 your+0.317
 dive+0.280
 exercise+0.271
Neg logit contributions
books-0.309
jar-0.271
oled-0.271
post-0.254
Don-0.252
Token big
Token ID1263
Feature activation+2.48e-01

Pos logit contributions
 our+0.318
 my+0.292
 your+0.290
 exercise+0.235
 victory+0.217
Neg logit contributions
books-0.397
 whispered-0.373
 meant-0.370
 spoke-0.368
oled-0.367
Token
Token ID129
Feature activation+4.22e-01

Pos logit contributions
 our+0.775
 my+0.703
 your+0.701
 exercise+0.644
 arrest+0.627
Neg logit contributions
books-0.527
 meant-0.520
 whispered-0.506
 said-0.473
 spoke-0.460
Token
Token ID241
Feature activation+3.78e-01

Pos logit contributions
 our+0.569
 my+0.519
 your+0.516
 exercise+0.445
 arrest+0.441
Neg logit contributions
 meant-0.623
books-0.622
 whispered-0.596
jar-0.563
 spoke-0.560
TokenLet
Token ID5756
Feature activation+3.09e-01

Pos logit contributions
 our+0.660
 your+0.620
 my+0.616
 exercise+0.561
ap+0.550
Neg logit contributions
books-0.396
 meant-0.392
 whispered-0.383
 spoke-0.350
jar-0.347
Tokenâ
Token ID22940
Feature activation+3.13e-01

Pos logit contributions
 our+0.481
ric+0.435
igs+0.431
 my+0.429
 arrest+0.424
Neg logit contributions
 meant-0.396
 said-0.361
 whispered-0.358
 spoke-0.333
books-0.326
Token
Token ID26391
Feature activation+3.16e-01

Pos logit contributions
 our+0.436
 my+0.391
 your+0.388
 arrest+0.333
 exercise+0.324
Neg logit contributions
 meant-0.462
books-0.460
 whispered-0.438
 spoke-0.413
 said-0.407
Token
Token ID8151
Feature activation+3.52e-01

Pos logit contributions
 our+0.500
 my+0.460
 your+0.456
 exercise+0.408
 victory+0.399
Neg logit contributions
books-0.395
 meant-0.386
 whispered-0.370
 spoke-0.347
 said-0.347
Tokens
Token ID82
Feature activation+3.05e-01

Pos logit contributions
 our+0.585
 your+0.553
 my+0.537
ap+0.515
ric+0.509
Neg logit contributions
 whispered-0.367
 meant-0.349
jar-0.345
books-0.341
 faded-0.340
Token explore
Token ID7301
Feature activation+2.67e-01

Pos logit contributions
 our+0.345
latable+0.304
 my+0.300
 arrest+0.295
 your+0.284
Neg logit contributions
 meant-0.496
 said-0.468
 whispered-0.458
 spoke-0.426
books-0.417
Token and
Token ID290
Feature activation+2.80e-01

Pos logit contributions
 our+0.294
 my+0.261
 arrest+0.257
 your+0.253
ap+0.245
Neg logit contributions
 meant-0.428
books-0.423
 whispered-0.416
 said-0.397
l-0.385
Token have
Token ID423
Feature activation+2.64e-01

Pos logit contributions
 our+0.381
 my+0.342
 your+0.331
latable+0.328
ric+0.323
Neg logit contributions
 whispered-0.398
 meant-0.396
 said-0.395
 spoke-0.365
 wondered-0.341
Token some
Token ID617
Feature activation+2.32e-01

Pos logit contributions
 our+0.327
 my+0.290
ap+0.288
 your+0.287
ric+0.276
Neg logit contributions
 meant-0.401
books-0.390
 whispered-0.389
 said-0.370
 spoke-0.361
Token So
Token ID1406
Feature activation+1.21e-01

Pos logit contributions
 our+0.236
 my+0.225
 your+0.221
 exercise+0.184
ap+0.184
Neg logit contributions
books-0.178
 meant-0.174
 whispered-0.162
 faded-0.156
 spoke-0.155
Token he
Token ID339
Feature activation+1.37e-01

Pos logit contributions
 our+0.177
 my+0.161
 your+0.160
 exercise+0.143
 arrest+0.143
Neg logit contributions
 meant-0.164
books-0.161
 whispered-0.158
 said-0.147
 spoke-0.147
Token put
Token ID1234
Feature activation+1.38e-01

Pos logit contributions
 our+0.232
 my+0.219
 your+0.214
 dive+0.202
 exercise+0.199
Neg logit contributions
books-0.161
jar-0.141
 meant-0.135
lessness-0.132
oled-0.130
Token the
Token ID262
Feature activation+1.57e-01

Pos logit contributions
 our+0.144
 my+0.126
 your+0.125
 exercise+0.102
 arrest+0.100
Neg logit contributions
books-0.250
 meant-0.249
 whispered-0.240
jar-0.229
 spoke-0.228
Token banana
Token ID25996
Feature activation+8.26e-02

Pos logit contributions
 our+0.181
 my+0.166
 your+0.162
 exercise+0.137
 arrest+0.132
Neg logit contributions
books-0.266
 meant-0.264
 whispered-0.256
jar-0.243
 spoke-0.243
Token in
Token ID287
Feature activation+1.82e-01

Pos logit contributions
 our+0.109
 my+0.098
 your+0.098
 exercise+0.084
 arrest+0.083
Neg logit contributions
books-0.126
 meant-0.125
 whispered-0.120
jar-0.114
 spoke-0.113
Token a
Token ID257
Feature activation+1.55e-01

Pos logit contributions
 our+0.164
 your+0.143
 my+0.142
 arrest+0.099
 exercise+0.099
Neg logit contributions
 meant-0.368
books-0.367
 whispered-0.356
 spoke-0.350
oled-0.339
Token bowl
Token ID9396
Feature activation+8.83e-02

Pos logit contributions
 our+0.240
 my+0.221
 your+0.220
 exercise+0.195
 arrest+0.192
Neg logit contributions
 meant-0.199
books-0.198
 whispered-0.188
 spoke-0.175
jar-0.175
Token of
Token ID286
Feature activation+1.82e-01

Pos logit contributions
 our+0.126
 your+0.118
 my+0.117
ap+0.102
 dive+0.098
Neg logit contributions
books-0.126
 meant-0.120
 whispered-0.114
Don-0.114
 spoke-0.113
Token water
Token ID1660
Feature activation+9.50e-02

Pos logit contributions
 our+0.243
 my+0.220
 your+0.219
 exercise+0.189
 arrest+0.185
Neg logit contributions
books-0.277
 meant-0.276
 whispered-0.264
jar-0.250
 spoke-0.249
Token and
Token ID290
Feature activation+1.54e-01

Pos logit contributions
 our+0.133
 your+0.123
 my+0.120
 dive+0.102
 how+0.100
Neg logit contributions
books-0.143
 meant-0.133
jar-0.130
Don-0.130
 whispered-0.127
Token He
Token ID679
Feature activation+1.47e-01

Pos logit contributions
 our+0.262
 my+0.243
 your+0.241
 exercise+0.216
 dive+0.211
Neg logit contributions
books-0.192
 meant-0.189
 whispered-0.181
jar-0.169
 spoke-0.168
Token loved
Token ID6151
Feature activation+1.56e-01

Pos logit contributions
 our+0.240
 my+0.224
 your+0.223
 dive+0.217
 explore+0.215
Neg logit contributions
books-0.181
oled-0.173
jar-0.171
book-0.167
post-0.156
Token to
Token ID284
Feature activation+1.84e-01

Pos logit contributions
 our+0.181
ap+0.170
 my+0.158
 your+0.156
latable+0.146
Neg logit contributions
 meant-0.244
books-0.234
 whispered-0.232
 said-0.222
 spoke-0.221
Token store
Token ID3650
Feature activation+1.26e-01

Pos logit contributions
 our+0.197
 my+0.171
 your+0.158
igsaw+0.155
ric+0.151
Neg logit contributions
 whispered-0.309
 meant-0.295
 said-0.290
books-0.288
 wondered-0.287
Token food
Token ID2057
Feature activation+9.99e-02

Pos logit contributions
 our+0.159
 my+0.143
 your+0.142
 arrest+0.123
ap+0.123
Neg logit contributions
 meant-0.196
books-0.195
 whispered-0.188
 spoke-0.176
 said-0.174
Token in
Token ID287
Feature activation+1.89e-01

Pos logit contributions
 our+0.134
 your+0.122
 my+0.121
 exercise+0.106
 dive+0.100
Neg logit contributions
books-0.152
 meant-0.149
 whispered-0.144
jar-0.141
Don-0.140
Token his
Token ID465
Feature activation+1.52e-01

Pos logit contributions
 our+0.194
 my+0.167
 your+0.167
ap+0.150
 exercise+0.147
Neg logit contributions
 meant-0.335
books-0.332
 whispered-0.319
jar-0.305
 said-0.300
Token tunnel
Token ID13275
Feature activation+1.08e-01

Pos logit contributions
 our+0.223
 my+0.204
 your+0.201
 arrest+0.191
 exercise+0.187
Neg logit contributions
 whispered-0.183
 meant-0.181
books-0.179
 said-0.171
 needed-0.163
Token for
Token ID329
Feature activation+1.58e-01

Pos logit contributions
 our+0.157
 your+0.148
 my+0.146
ap+0.130
 exercise+0.128
Neg logit contributions
books-0.141
 meant-0.141
Don-0.138
 whispered-0.133
jar-0.133
Token the
Token ID262
Feature activation+1.44e-01

Pos logit contributions
 our+0.180
 my+0.159
 your+0.158
ap+0.142
 exercise+0.132
Neg logit contributions
books-0.266
 meant-0.263
 whispered-0.254
 spoke-0.239
jar-0.238
Token winter
Token ID7374
Feature activation+7.57e-02

Pos logit contributions
 our+0.175
 my+0.160
 exercise+0.155
 your+0.154
 victory+0.148
Neg logit contributions
books-0.252
oled-0.238
 widened-0.235
 meant-0.234
book-0.232
Token it
Token ID340
Feature activation+2.23e-01

Pos logit contributions
 our+0.320
 my+0.288
 arrest+0.286
latable+0.284
ap+0.275
Neg logit contributions
 whispered-0.370
 meant-0.368
 said-0.355
books-0.342
 spoke-0.331
Token.
Token ID13
Feature activation+2.18e-01

Pos logit contributions
 our+0.279
 my+0.264
 your+0.261
 dive+0.228
 exercise+0.227
Neg logit contributions
books-0.333
 meant-0.328
Don-0.324
oled-0.316
jar-0.313
Token She
Token ID1375
Feature activation+2.05e-01

Pos logit contributions
 our+0.361
 my+0.344
 your+0.342
ap+0.293
 arrest+0.291
Neg logit contributions
 meant-0.254
books-0.252
 whispered-0.234
jar-0.217
 spoke-0.217
Token held
Token ID2714
Feature activation+1.85e-01

Pos logit contributions
 our+0.395
 my+0.371
 your+0.370
 exercise+0.335
 dive+0.334
Neg logit contributions
books-0.188
 meant-0.177
jar-0.158
 whispered-0.157
Don-0.145
Token onto
Token ID4291
Feature activation+2.22e-01

Pos logit contributions
 our+0.243
 my+0.215
ric+0.210
 arrest+0.209
 your+0.208
Neg logit contributions
 meant-0.266
 whispered-0.264
books-0.254
 said-0.250
 spoke-0.238
Token the
Token ID262
Feature activation+2.17e-01

Pos logit contributions
 our+0.260
 my+0.227
 your+0.221
ap+0.211
ric+0.204
Neg logit contributions
 meant-0.349
books-0.346
 whispered-0.344
 said-0.322
 spoke-0.317
Token truck
Token ID7779
Feature activation+1.37e-01

Pos logit contributions
 our+0.315
 my+0.285
 your+0.284
 exercise+0.249
 arrest+0.248
Neg logit contributions
 meant-0.296
books-0.296
 whispered-0.285
jar-0.268
 spoke-0.264
Token and
Token ID290
Feature activation+2.26e-01

Pos logit contributions
 our+0.211
 my+0.193
 your+0.193
 exercise+0.171
ap+0.169
Neg logit contributions
books-0.178
 meant-0.177
 whispered-0.168
jar-0.157
 spoke-0.156
Token hugged
Token ID41130
Feature activation+1.54e-01

Pos logit contributions
 our+0.405
 your+0.376
 my+0.376
 exercise+0.339
 arrest+0.335
Neg logit contributions
books-0.239
 meant-0.232
 whispered-0.213
jar-0.204
 spoke-0.194
Token it
Token ID340
Feature activation+1.53e-01

Pos logit contributions
 our+0.184
 my+0.163
 your+0.161
 exercise+0.139
 arrest+0.139
Neg logit contributions
books-0.252
 meant-0.250
 whispered-0.242
 spoke-0.227
Don-0.226
Token tight
Token ID5381
Feature activation+1.63e-01

Pos logit contributions
 our+0.195
 my+0.183
 your+0.180
ap+0.160
 victory+0.158
Neg logit contributions
books-0.240
 meant-0.239
Don-0.224
jar-0.222
 whispered-0.220
Token she
Token ID673
Feature activation+3.62e-01

Pos logit contributions
books+0.207
 meant+0.201
 whispered+0.196
jar+0.189
l+0.182
Neg logit contributions
 our-0.205
 my-0.188
 your-0.186
 arrest-0.160
 exercise-0.160
Token decided
Token ID3066
Feature activation+3.05e-01

Pos logit contributions
 our+0.681
 your+0.652
 my+0.643
 spend+0.580
ap+0.574
Neg logit contributions
books-0.372
jar-0.306
 meant-0.304
post-0.279
Don-0.278
Token to
Token ID284
Feature activation+3.32e-01

Pos logit contributions
 our+0.346
 my+0.302
 your+0.293
ap+0.273
 arrest+0.265
Neg logit contributions
 meant-0.510
 whispered-0.498
books-0.497
 said-0.475
 spoke-0.472
Token bake
Token ID28450
Feature activation+2.48e-01

Pos logit contributions
 our+0.404
ric+0.390
igs+0.369
 my+0.352
latable+0.339
Neg logit contributions
 meant-0.492
 said-0.480
 whispered-0.479
 spoke-0.438
books-0.421
Token a
Token ID257
Feature activation+2.89e-01

Pos logit contributions
 our+0.279
 your+0.249
 my+0.248
 exercise+0.205
ap+0.195
Neg logit contributions
books-0.426
 meant-0.426
 whispered-0.413
jar-0.398
 spoke-0.395
Token pie
Token ID2508
Feature activation+2.36e-01

Pos logit contributions
 our+0.384
 my+0.348
 your+0.346
 exercise+0.300
ap+0.291
Neg logit contributions
 meant-0.436
books-0.434
 whispered-0.415
 spoke-0.390
jar-0.388
Token.
Token ID13
Feature activation+2.25e-01

Pos logit contributions
 our+0.327
 my+0.297
 your+0.297
 exercise+0.259
ap+0.254
Neg logit contributions
books-0.340
 meant-0.339
 whispered-0.326
 spoke-0.308
jar-0.308
Token She
Token ID1375
Feature activation+2.15e-01

Pos logit contributions
 our+0.382
 my+0.367
 your+0.364
 exercise+0.314
 arrest+0.310
Neg logit contributions
 meant-0.251
books-0.249
 whispered-0.231
 faded-0.217
 spoke-0.216
Token put
Token ID1234
Feature activation+2.20e-01

Pos logit contributions
 our+0.363
 your+0.341
 my+0.340
 spend+0.303
 exercise+0.301
Neg logit contributions
books-0.240
jar-0.219
 meant-0.217
Don-0.200
oled-0.198
Token the
Token ID262
Feature activation+2.42e-01

Pos logit contributions
 our+0.239
 my+0.210
 your+0.208
 exercise+0.177
 arrest+0.176
Neg logit contributions
 meant-0.383
books-0.382
 whispered-0.368
 spoke-0.348
jar-0.347
Token ac
Token ID936
Feature activation+2.20e-01

Pos logit contributions
 our+0.303
 my+0.271
 your+0.271
 arrest+0.235
 exercise+0.234
Neg logit contributions
 meant-0.379
books-0.375
 whispered-0.361
 said-0.340
jar-0.340
Token other
Token ID584
Feature activation+2.96e-01

Pos logit contributions
 our+0.483
 your+0.443
 my+0.435
 exercise+0.396
 victory+0.389
Neg logit contributions
books-0.466
 meant-0.444
jar-0.433
 whispered-0.427
Don-0.407
Token.
Token ID13
Feature activation+2.59e-01

Pos logit contributions
 our+0.436
 your+0.395
 my+0.390
 exercise+0.344
ap+0.337
Neg logit contributions
books-0.415
 meant-0.401
jar-0.389
 whispered-0.385
Don-0.369
Token They
Token ID1119
Feature activation+2.50e-01

Pos logit contributions
 our+0.419
 my+0.392
 your+0.390
ap+0.334
 exercise+0.327
Neg logit contributions
 meant-0.323
books-0.323
 whispered-0.303
 spoke-0.285
jar-0.285
Token were
Token ID547
Feature activation+2.44e-01

Pos logit contributions
 our+0.415
 your+0.386
 my+0.375
 spend+0.374
 practice+0.359
Neg logit contributions
books-0.294
jar-0.275
 meant-0.246
book-0.245
ooky-0.242
Token glad
Token ID9675
Feature activation+2.24e-01

Pos logit contributions
 our+0.379
 my+0.347
 your+0.346
 exercise+0.307
 arrest+0.305
Neg logit contributions
books-0.312
 meant-0.312
 whispered-0.297
jar-0.276
 spoke-0.275
Token they
Token ID484
Feature activation+2.17e-01

Pos logit contributions
 our+0.314
 my+0.288
 your+0.287
ap+0.262
 exercise+0.261
Neg logit contributions
 meant-0.308
 whispered-0.296
books-0.292
 said-0.285
 spoke-0.276
Token had
Token ID550
Feature activation+2.35e-01

Pos logit contributions
 our+0.375
 your+0.344
 my+0.335
 spend+0.332
 such+0.305
Neg logit contributions
books-0.243
jar-0.229
l-0.228
post-0.209
boards-0.202
Token a
Token ID257
Feature activation+2.36e-01

Pos logit contributions
 our+0.325
 your+0.291
 my+0.288
 victory+0.244
 exercise+0.243
Neg logit contributions
books-0.360
jar-0.347
 meant-0.346
 whispered-0.334
l-0.328
Token park
Token ID3952
Feature activation+1.76e-01

Pos logit contributions
 our+0.331
 my+0.302
 your+0.300
 exercise+0.262
 arrest+0.260
Neg logit contributions
books-0.334
 meant-0.333
 whispered-0.319
jar-0.299
 spoke-0.298
Token.
Token ID13
Feature activation+1.92e-01

Pos logit contributions
 our+0.268
 your+0.241
 my+0.240
 exercise+0.214
ap+0.213
Neg logit contributions
books-0.233
 meant-0.231
 whispered-0.221
jar-0.211
Don-0.210
Token<|endoftext|>
Token ID50256
Feature activation+1.76e-01

Pos logit contributions
 our+0.310
 my+0.283
 your+0.283
ap+0.233
 exercise+0.228
Neg logit contributions
books-0.250
 meant-0.247
 whispered-0.237
 spoke-0.231
l-0.222
Token perfect
Token ID2818
Feature activation-2.54e-02

Pos logit contributions
books+0.010
jar+0.010
Don+0.010
 whispered+0.010
 spoke+0.009
Neg logit contributions
 our-0.009
 your-0.008
 my-0.008
 those-0.006
 these-0.006
Token,"
Token ID553
Feature activation-3.31e-02

Pos logit contributions
books+0.043
Don+0.041
jar+0.040
 meant+0.040
 whispered+0.040
Neg logit contributions
 our-0.032
 my-0.029
 your-0.029
 exercise-0.022
 these-0.021
Token said
Token ID531
Feature activation-5.16e-02

Pos logit contributions
books+0.037
oled+0.032
jar+0.031
ooky+0.030
 meant+0.030
Neg logit contributions
 our-0.060
 my-0.060
 your-0.060
 exercise-0.049
 these-0.048
Token her
Token ID607
Feature activation-2.24e-02

Pos logit contributions
books+0.074
 meant+0.072
 whispered+0.069
jar+0.067
Don+0.065
Neg logit contributions
 our-0.074
 my-0.068
 your-0.068
 exercise-0.058
ap-0.056
Token grandma
Token ID49890
Feature activation-7.96e-02

Pos logit contributions
books+0.042
Don+0.042
jar+0.040
 spoke+0.040
 meant+0.040
Neg logit contributions
 our-0.021
 my-0.019
 your-0.018
 job-0.015
ap-0.013
Token.
Token ID13
Feature activation-2.58e-02

Pos logit contributions
books+0.096
 meant+0.093
jar+0.092
Don+0.091
 whispered+0.085
Neg logit contributions
 our-0.120
 your-0.113
 my-0.113
ap-0.104
 how-0.102
Token "
Token ID366
Feature activation-4.85e-02

Pos logit contributions
 meant+0.039
books+0.038
 whispered+0.036
ooky+0.036
jar+0.036
Neg logit contributions
 our-0.034
 my-0.034
 your-0.033
 exercise-0.024
 spend-0.024
TokenWould
Token ID17353
Feature activation-6.15e-02

Pos logit contributions
 meant+0.060
 whispered+0.058
books+0.057
 spoke+0.054
 faded+0.052
Neg logit contributions
 our-0.074
 my-0.070
 your-0.070
 exercise-0.061
 myself-0.060
Token you
Token ID345
Feature activation-3.22e-02

Pos logit contributions
books+0.125
 meant+0.124
 whispered+0.121
 said+0.119
 spoke+0.118
Neg logit contributions
 our-0.044
 my-0.034
 your-0.034
ap-0.032
ric-0.032
Token like
Token ID588
Feature activation-4.97e-05

Pos logit contributions
 meant+0.033
books+0.032
 whispered+0.031
 said+0.028
book+0.027
Neg logit contributions
 our-0.058
 arrest-0.054
 my-0.053
 your-0.052
ric-0.051
Token to
Token ID284
Feature activation+6.98e-03

Pos logit contributions
 meant+0.000
books+0.000
 whispered+0.000
 said+0.000
 spoke+0.000
Neg logit contributions
 our-0.000
 my-0.000
 your-0.000
 arrest-0.000
ap-0.000
Token and
Token ID290
Feature activation+1.45e-01

Pos logit contributions
 our+0.076
 your+0.067
 my+0.066
 dive+0.063
ap+0.063
Neg logit contributions
 meant-0.090
Don-0.090
books-0.089
oled-0.083
jar-0.081
Token opened
Token ID4721
Feature activation+1.09e-01

Pos logit contributions
 our+0.282
 my+0.263
 your+0.263
 exercise+0.238
 arrest+0.237
Neg logit contributions
books-0.131
 meant-0.129
 whispered-0.118
jar-0.109
 spoke-0.106
Token the
Token ID262
Feature activation+1.19e-01

Pos logit contributions
 our+0.122
 my+0.112
 exercise+0.110
 your+0.109
 arrest+0.107
Neg logit contributions
 meant-0.172
 whispered-0.170
books-0.168
 said-0.162
l-0.158
Token zip
Token ID19974
Feature activation+5.26e-02

Pos logit contributions
 our+0.194
 my+0.179
 your+0.177
 exercise+0.154
 dive+0.153
Neg logit contributions
books-0.149
 meant-0.148
 whispered-0.139
 spoke-0.132
Don-0.131
Token.
Token ID13
Feature activation+9.54e-02

Pos logit contributions
 our+0.084
 your+0.077
 my+0.076
ap+0.071
 dive+0.071
Neg logit contributions
books-0.063
Don-0.062
 meant-0.062
jar-0.059
 whispered-0.057
Token They
Token ID1119
Feature activation+9.15e-02

Pos logit contributions
 our+0.134
 your+0.120
 my+0.119
 dive+0.104
ap+0.101
Neg logit contributions
 meant-0.139
books-0.137
 whispered-0.131
 spoke-0.126
jar-0.124
Token saw
Token ID2497
Feature activation+9.21e-02

Pos logit contributions
 our+0.126
 dive+0.119
 your+0.114
 my+0.111
 spend+0.108
Neg logit contributions
books-0.129
jar-0.122
 meant-0.116
ooky-0.115
book-0.112
Token many
Token ID867
Feature activation+7.85e-02

Pos logit contributions
 our+0.102
 your+0.089
 my+0.088
 exercise+0.072
 victory+0.069
Neg logit contributions
books-0.166
 meant-0.160
 whispered-0.154
jar-0.153
 spoke-0.148
Token animals
Token ID4695
Feature activation+3.99e-02

Pos logit contributions
 our+0.102
 my+0.092
 your+0.092
 exercise+0.079
 dive+0.075
Neg logit contributions
books-0.124
 meant-0.121
 whispered-0.115
jar-0.115
Don-0.113
Token.
Token ID13
Feature activation+7.96e-02

Pos logit contributions
 our+0.054
 your+0.048
 my+0.047
 dive+0.043
 exercise+0.043
Neg logit contributions
books-0.061
 meant-0.058
jar-0.056
 whispered-0.055
Don-0.055
Token There
Token ID1318
Feature activation+7.10e-02

Pos logit contributions
 our+0.104
 my+0.090
 your+0.089
 join+0.078
 these+0.076
Neg logit contributions
 meant-0.121
books-0.119
 whispered-0.118
jar-0.117
 spoke-0.113
Token you
Token ID345
Feature activation+1.57e-01

Pos logit contributions
latable+0.181
igsaw+0.166
ap+0.161
 arrest+0.159
 victory+0.159
Neg logit contributions
 said-0.222
 meant-0.210
 whispered-0.206
Don-0.205
 needed-0.199
Token whistle
Token ID16121
Feature activation+1.14e-01

Pos logit contributions
 our+0.188
ric+0.179
igsaw+0.173
 Wed+0.169
 arrest+0.164
Neg logit contributions
 said-0.249
 whispered-0.247
 meant-0.240
 spoke-0.218
 answered-0.207
Token,
Token ID11
Feature activation+1.79e-01

Pos logit contributions
 our+0.137
 arrest+0.124
 my+0.118
 your+0.117
ric+0.107
Neg logit contributions
 meant-0.179
books-0.173
 whispered-0.171
 said-0.163
 spoke-0.158
Token Lily
Token ID20037
Feature activation+1.10e-01

Pos logit contributions
 our+0.242
 arrest+0.219
 your+0.218
 my+0.214
ap+0.208
Neg logit contributions
 meant-0.247
 whispered-0.235
books-0.230
 said-0.225
jar-0.217
Token?"
Token ID1701
Feature activation+1.33e-01

Pos logit contributions
 our+0.096
 your+0.088
 my+0.087
 exercise+0.063
ap+0.063
Neg logit contributions
books-0.220
 meant-0.206
jar-0.205
Don-0.202
 whispered-0.202
Token Tom
Token ID4186
Feature activation+1.09e-01

Pos logit contributions
 our+0.154
 your+0.150
 my+0.143
 dive+0.100
ap+0.100
Neg logit contributions
books-0.258
jar-0.234
 meant-0.229
ooky-0.227
book-0.224
Token asked
Token ID1965
Feature activation+1.11e-01

Pos logit contributions
 my+0.155
 your+0.146
 our+0.142
 practice+0.129
 dive+0.126
Neg logit contributions
jar-0.163
lessness-0.156
books-0.155
post-0.141
ooky-0.136
Token.
Token ID13
Feature activation+1.24e-01

Pos logit contributions
 our+0.167
 your+0.159
 my+0.155
ap+0.141
 exercise+0.129
Neg logit contributions
books-0.148
jar-0.145
 meant-0.138
Don-0.133
 whispered-0.130
Token
Token ID198
Feature activation+1.03e-01

Pos logit contributions
 my+0.141
 our+0.139
 your+0.131
 how+0.109
 join+0.105
Neg logit contributions
ooky-0.204
books-0.198
hed-0.191
 meant-0.191
 wil-0.190
Token
Token ID198
Feature activation+1.03e-01

Pos logit contributions
 our+0.180
 your+0.169
 my+0.168
ap+0.143
 exercise+0.139
Neg logit contributions
 meant-0.120
books-0.118
 whispered-0.111
 spoke-0.104
jar-0.103
Token"
Token ID1
Feature activation+1.10e-01

Pos logit contributions
 our+0.170
 my+0.156
 your+0.155
 exercise+0.141
 arrest+0.140
Neg logit contributions
books-0.122
 meant-0.121
 whispered-0.116
jar-0.107
 spoke-0.106
Token into
Token ID656
Feature activation+1.80e-01

Pos logit contributions
 our+0.174
 my+0.155
 your+0.154
 exercise+0.130
 arrest+0.130
Neg logit contributions
 meant-0.242
books-0.241
 whispered-0.234
 spoke-0.221
jar-0.220
Token the
Token ID262
Feature activation+1.39e-01

Pos logit contributions
 our+0.227
ap+0.213
ric+0.208
 my+0.202
 your+0.196
Neg logit contributions
 whispered-0.251
books-0.247
 meant-0.247
 gasped-0.243
 said-0.239
Token garden
Token ID11376
Feature activation+8.79e-02

Pos logit contributions
 our+0.217
 your+0.195
 my+0.194
 arrest+0.181
ap+0.175
Neg logit contributions
 meant-0.169
books-0.168
 whispered-0.163
jar-0.155
 said-0.151
Token.
Token ID13
Feature activation+1.04e-01

Pos logit contributions
 our+0.136
 your+0.126
 my+0.125
ap+0.113
 exercise+0.113
Neg logit contributions
 meant-0.112
books-0.112
 whispered-0.102
Don-0.098
jar-0.098
Token She
Token ID1375
Feature activation+9.21e-02

Pos logit contributions
 our+0.190
 my+0.177
 your+0.176
 exercise+0.159
 arrest+0.158
Neg logit contributions
books-0.106
 meant-0.105
 whispered-0.099
jar-0.090
 spoke-0.090
Token used
Token ID973
Feature activation+1.16e-01

Pos logit contributions
 our+0.159
 your+0.149
 my+0.148
 exercise+0.137
 dive+0.136
Neg logit contributions
books-0.104
 meant-0.095
jar-0.091
book-0.084
post-0.084
Token the
Token ID262
Feature activation+1.17e-01

Pos logit contributions
 our+0.156
 my+0.139
 your+0.138
ap+0.126
 arrest+0.122
Neg logit contributions
 meant-0.173
books-0.170
 whispered-0.165
 spoke-0.154
 said-0.154
Token magnet
Token ID19972
Feature activation+5.01e-02

Pos logit contributions
 our+0.152
 my+0.137
 your+0.136
 exercise+0.118
 victory+0.115
Neg logit contributions
books-0.182
 meant-0.179
 whispered-0.173
jar-0.164
 spoke-0.163
Token to
Token ID284
Feature activation+1.35e-01

Pos logit contributions
 our+0.047
 my+0.041
 your+0.041
 exercise+0.033
 arrest+0.032
Neg logit contributions
books-0.095
 meant-0.094
 whispered-0.091
jar-0.088
 spoke-0.086
Token pick
Token ID2298
Feature activation+8.39e-02

Pos logit contributions
 our+0.161
 my+0.144
 your+0.138
ap+0.122
 Wed+0.120
Neg logit contributions
books-0.212
 whispered-0.211
 meant-0.209
 said-0.199
 spoke-0.193
Token up
Token ID510
Feature activation+1.18e-01

Pos logit contributions
 our+0.083
 my+0.073
 your+0.072
 exercise+0.057
 arrest+0.055
Neg logit contributions
books-0.156
 meant-0.154
 whispered-0.150
jar-0.145
 spoke-0.143
Token of
Token ID286
Feature activation+1.18e-01

Pos logit contributions
 our+0.105
 my+0.096
 your+0.094
 arrest+0.073
ap+0.072
Neg logit contributions
books-0.138
 whispered-0.128
 meant-0.127
 spoke-0.122
Don-0.121
Token her
Token ID607
Feature activation+5.96e-02

Pos logit contributions
 our+0.141
 my+0.125
 your+0.124
 exercise+0.109
ap+0.109
Neg logit contributions
 meant-0.191
books-0.189
 whispered-0.183
jar-0.174
 spoke-0.172
Token finger
Token ID7660
Feature activation+1.89e-02

Pos logit contributions
 our+0.078
 my+0.070
 your+0.070
 exercise+0.064
 victory+0.061
Neg logit contributions
 meant-0.088
 whispered-0.085
books-0.084
 said-0.080
jar-0.077
Token.
Token ID13
Feature activation+5.18e-02

Pos logit contributions
 our+0.027
 your+0.026
 my+0.025
ap+0.022
 arrest+0.021
Neg logit contributions
books-0.028
Don-0.025
jar-0.024
 meant-0.024
Wow-0.024
Token
Token ID198
Feature activation+3.53e-02

Pos logit contributions
 your+0.071
 my+0.067
 our+0.065
 math+0.048
 Wed+0.047
Neg logit contributions
books-0.078
 meant-0.077
 whispered-0.075
 spoke-0.075
ited-0.073
Token
Token ID198
Feature activation+3.68e-02

Pos logit contributions
 our+0.058
 your+0.055
 my+0.053
 exercise+0.045
 arrest+0.044
Neg logit contributions
 meant-0.045
books-0.045
 whispered-0.041
 spoke-0.040
 faded-0.039
TokenL
Token ID43
Feature activation+2.99e-02

Pos logit contributions
 our+0.072
 my+0.067
 your+0.066
 exercise+0.062
 victory+0.062
Neg logit contributions
books-0.031
 meant-0.029
 whispered-0.029
Don-0.027
jar-0.026
Tokenily
Token ID813
Feature activation+1.92e-02

Pos logit contributions
 our+0.065
 my+0.061
 your+0.060
 exercise+0.056
 arrest+0.056
Neg logit contributions
 meant-0.020
books-0.020
 whispered-0.018
jar-0.016
 spoke-0.015
Token's
Token ID338
Feature activation+7.60e-02

Pos logit contributions
 our+0.033
 my+0.031
 your+0.031
 exercise+0.028
 arrest+0.027
Neg logit contributions
 meant-0.021
books-0.021
 whispered-0.020
 spoke-0.018
jar-0.018
Token mom
Token ID1995
Feature activation+9.10e-03

Pos logit contributions
 our+0.115
 my+0.105
 your+0.103
latable+0.092
 Wed+0.091
Neg logit contributions
 meant-0.096
 whispered-0.095
books-0.092
 said-0.089
 faded-0.088
Token hears
Token ID23180
Feature activation+5.31e-02

Pos logit contributions
 our+0.016
 my+0.015
 your+0.015
 exercise+0.013
 arrest+0.013
Neg logit contributions
books-0.010
 meant-0.010
 whispered-0.009
jar-0.009
 spoke-0.009
Token as
Token ID355
Feature activation-7.83e-02

Pos logit contributions
Don+0.266
 spoke+0.256
jar+0.255
books+0.255
 widened+0.254
Neg logit contributions
 our-0.185
 my-0.172
 your-0.170
 how-0.124
 exercise-0.123
Token much
Token ID881
Feature activation-9.95e-02

Pos logit contributions
books+0.094
 meant+0.091
 whispered+0.088
 said+0.084
jar+0.080
Neg logit contributions
 our-0.111
ric-0.106
latable-0.104
 arrest-0.103
 my-0.103
Token as
Token ID355
Feature activation-8.20e-02

Pos logit contributions
 meant+0.151
books+0.150
 whispered+0.145
 said+0.136
jar+0.134
Neg logit contributions
 our-0.129
 my-0.115
 your-0.114
ap-0.101
 arrest-0.100
Token you
Token ID345
Feature activation-1.19e-01

Pos logit contributions
 meant+0.143
books+0.142
 whispered+0.140
 said+0.134
jar+0.131
Neg logit contributions
 our-0.075
 my-0.065
 your-0.063
ric-0.063
 arrest-0.062
Token want
Token ID765
Feature activation-1.14e-01

Pos logit contributions
jar+0.207
books+0.206
Don+0.205
 meant+0.196
post+0.195
Neg logit contributions
 our-0.124
 your-0.109
 my-0.109
 join-0.108
 practice-0.107
Token.
Token ID13
Feature activation-1.14e-01

Pos logit contributions
 meant+0.164
books+0.164
 whispered+0.156
jar+0.155
Don+0.154
Neg logit contributions
 our-0.172
 your-0.158
 my-0.158
 exercise-0.129
 victory-0.123
Token And
Token ID843
Feature activation-1.14e-01

Pos logit contributions
 meant+0.126
books+0.121
 whispered+0.116
Don+0.113
jar+0.107
Neg logit contributions
 our-0.187
 your-0.170
 my-0.169
 arrest-0.168
igs-0.164
Token the
Token ID262
Feature activation-9.84e-02

Pos logit contributions
 meant+0.184
 said+0.178
 whispered+0.178
books+0.166
Don+0.159
Neg logit contributions
 arrest-0.121
 our-0.116
ric-0.113
latable-0.112
igs-0.104
Token puppy
Token ID26188
Feature activation-1.89e-01

Pos logit contributions
books+0.158
 meant+0.154
 whispered+0.150
 spoke+0.143
 widened+0.141
Neg logit contributions
 our-0.126
 my-0.115
 your-0.113
 exercise-0.103
 victory-0.098
Token is
Token ID318
Feature activation-1.06e-01

Pos logit contributions
books+0.307
jar+0.288
 meant+0.273
Don+0.267
book+0.266
Neg logit contributions
 our-0.276
 my-0.257
 your-0.255
 exercise-0.204
 arrest-0.198
Token yours
Token ID12431
Feature activation-1.71e-01

Pos logit contributions
 meant+0.200
 whispered+0.193
books+0.192
 said+0.182
 spoke+0.178
Neg logit contributions
 our-0.089
 your-0.075
 my-0.074
 arrest-0.073
ap-0.071
Token Bunny
Token ID28683
Feature activation-1.94e-01

Pos logit contributions
 meant+0.216
books+0.203
 whispered+0.199
 said+0.185
 spoke+0.183
Neg logit contributions
 our-0.263
 your-0.239
 my-0.235
 exercise-0.225
 arrest-0.225
Token.
Token ID13
Feature activation-1.34e-01

Pos logit contributions
books+0.241
 meant+0.238
 whispered+0.227
jar+0.212
 spoke+0.211
Neg logit contributions
 our-0.312
 my-0.286
 your-0.285
 exercise-0.253
 arrest-0.250
Token She
Token ID1375
Feature activation-1.45e-01

Pos logit contributions
books+0.172
 meant+0.168
 whispered+0.161
 spoke+0.150
jar+0.149
Neg logit contributions
 our-0.215
 my-0.198
 your-0.197
 arrest-0.172
 exercise-0.170
Token is
Token ID318
Feature activation-1.14e-01

Pos logit contributions
books+0.164
 meant+0.160
 whispered+0.151
jar+0.142
 spoke+0.139
Neg logit contributions
 our-0.248
 my-0.229
 your-0.228
 exercise-0.204
 arrest-0.204
Token very
Token ID845
Feature activation-9.50e-02

Pos logit contributions
 meant+0.155
 whispered+0.145
books+0.136
 said+0.136
 written+0.130
Neg logit contributions
 our-0.151
ap-0.145
 spend-0.138
ric-0.137
 dive-0.136
Token small
Token ID1402
Feature activation-1.42e-01

Pos logit contributions
books+0.112
 meant+0.111
 whispered+0.105
jar+0.098
 spoke+0.097
Neg logit contributions
 our-0.159
 my-0.147
 your-0.146
 exercise-0.130
 arrest-0.129
Token and
Token ID290
Feature activation-1.02e-01

Pos logit contributions
books+0.165
 meant+0.159
 whispered+0.152
jar+0.142
 spoke+0.142
Neg logit contributions
 our-0.242
 my-0.225
 your-0.224
 exercise-0.199
 arrest-0.198
Token soft
Token ID2705
Feature activation-1.54e-01

Pos logit contributions
 meant+0.114
books+0.113
 whispered+0.108
 spoke+0.099
jar+0.099
Neg logit contributions
 our-0.173
 my-0.159
 your-0.159
 exercise-0.143
 arrest-0.143
Token.
Token ID13
Feature activation-1.38e-01

Pos logit contributions
books+0.169
 meant+0.161
 whispered+0.153
jar+0.144
 spoke+0.143
Neg logit contributions
 our-0.273
 my-0.255
 your-0.254
 exercise-0.226
 arrest-0.222
Token She
Token ID1375
Feature activation-1.50e-01

Pos logit contributions
 meant+0.122
books+0.120
 whispered+0.113
jar+0.101
 spoke+0.100
Neg logit contributions
 our-0.266
 my-0.247
 your-0.247
 exercise-0.228
 arrest-0.225
Token does
Token ID857
Feature activation-1.28e-01

Pos logit contributions
books+0.186
 meant+0.172
 whispered+0.163
jar+0.161
Don+0.154
Neg logit contributions
 our-0.247
 my-0.229
 your-0.228
 exercise-0.202
 arrest-0.202
Token and
Token ID290
Feature activation-9.81e-03

Pos logit contributions
books+0.103
 meant+0.098
 whispered+0.095
Don+0.093
 spoke+0.093
Neg logit contributions
 our-0.094
 my-0.087
 your-0.087
ap-0.070
 arrest-0.070
Token relax
Token ID8960
Feature activation-9.19e-02

Pos logit contributions
 said+0.016
 whispered+0.016
 meant+0.015
 wished+0.015
 needed+0.015
Neg logit contributions
ric-0.011
latable-0.011
igsaw-0.011
 our-0.011
ap-0.010
Token."
Token ID526
Feature activation-7.23e-02

Pos logit contributions
books+0.134
 meant+0.131
 whispered+0.126
 spoke+0.120
jar+0.119
Neg logit contributions
 our-0.132
 my-0.120
 your-0.120
 exercise-0.102
ap-0.099
Token
Token ID198
Feature activation-7.16e-02

Pos logit contributions
books+0.115
 meant+0.114
 whispered+0.107
jar+0.104
 spoke+0.101
Neg logit contributions
 our-0.093
 your-0.085
 my-0.085
 exercise-0.071
ap-0.070
Token
Token ID198
Feature activation-6.95e-02

Pos logit contributions
 meant+0.091
books+0.086
 whispered+0.082
 faded+0.080
 spoke+0.078
Neg logit contributions
 our-0.116
 your-0.108
 my-0.107
 exercise-0.091
ap-0.090
TokenSo
Token ID2396
Feature activation-7.17e-02

Pos logit contributions
 meant+0.077
books+0.076
 whispered+0.072
jar+0.066
 spoke+0.066
Neg logit contributions
 our-0.122
 my-0.113
 your-0.113
 exercise-0.101
 arrest-0.100
Token,
Token ID11
Feature activation-2.37e-02

Pos logit contributions
 meant+0.088
 said+0.086
 whispered+0.084
 needed+0.079
 spoke+0.078
Neg logit contributions
ric-0.106
 our-0.103
latable-0.098
 arrest-0.097
 exercise-0.092
Token Benny
Token ID44275
Feature activation-9.67e-02

Pos logit contributions
 meant+0.034
 said+0.033
 whispered+0.033
Don+0.032
 spoke+0.031
Neg logit contributions
 our-0.030
ric-0.028
latable-0.026
 arrest-0.026
 my-0.026
Token took
Token ID1718
Feature activation-6.05e-02

Pos logit contributions
books+0.126
jar+0.112
 meant+0.104
book+0.103
post+0.101
Neg logit contributions
 our-0.154
 my-0.146
 your-0.146
 exercise-0.136
 dive-0.132
Token a
Token ID257
Feature activation-3.51e-02

Pos logit contributions
 meant+0.078
 whispered+0.071
 said+0.070
 spoke+0.063
 cried+0.063
Neg logit contributions
ric-0.096
ap-0.082
 our-0.079
 yeah-0.079
latable-0.079
Token deep
Token ID2769
Feature activation-5.15e-02

Pos logit contributions
books+0.044
 meant+0.044
 whispered+0.043
jar+0.040
 spoke+0.040
Neg logit contributions
 our-0.055
 my-0.050
 your-0.050
 arrest-0.045
ap-0.045
Token with
Token ID351
Feature activation-1.03e-01

Pos logit contributions
 meant+0.271
books+0.263
 whispered+0.259
 spoke+0.248
jar+0.246
Neg logit contributions
 our-0.179
 your-0.164
 my-0.161
 exercise-0.134
 victory-0.132
Token his
Token ID465
Feature activation-1.39e-01

Pos logit contributions
 meant+0.127
 whispered+0.118
 said+0.115
books+0.113
l+0.111
Neg logit contributions
ap-0.148
 our-0.141
 dive-0.134
ric-0.134
 spend-0.130
Token zoom
Token ID19792
Feature activation-1.90e-01

Pos logit contributions
books+0.231
 meant+0.227
 whispered+0.215
Don+0.210
book+0.206
Neg logit contributions
 our-0.179
 my-0.158
 your-0.158
 victory-0.137
 exercise-0.133
Token cars
Token ID5006
Feature activation-2.03e-01

Pos logit contributions
 meant+0.276
books+0.276
 whispered+0.266
jar+0.256
Don+0.252
Neg logit contributions
 our-0.266
 your-0.245
 my-0.243
ap-0.209
 exercise-0.209
Token quietly
Token ID12703
Feature activation-1.79e-01

Pos logit contributions
books+0.336
 meant+0.335
Don+0.324
jar+0.320
 whispered+0.317
Neg logit contributions
 our-0.246
 your-0.231
 my-0.219
 victory-0.196
ap-0.190
Token and
Token ID290
Feature activation-1.07e-01

Pos logit contributions
books+0.251
 meant+0.245
Don+0.237
 whispered+0.235
jar+0.230
Neg logit contributions
 our-0.264
 your-0.241
 my-0.238
ap-0.211
 exercise-0.205
Token waits
Token ID28364
Feature activation-1.78e-01

Pos logit contributions
 whispered+0.150
 meant+0.150
 said+0.139
books+0.138
 spoke+0.136
Neg logit contributions
 our-0.143
 my-0.130
 your-0.127
latable-0.126
 arrest-0.121
Token for
Token ID329
Feature activation-1.17e-01

Pos logit contributions
books+0.240
 meant+0.233
 whispered+0.227
jar+0.214
 said+0.209
Neg logit contributions
 our-0.257
 my-0.233
 your-0.229
 arrest-0.214
 exercise-0.208
Token his
Token ID465
Feature activation-1.49e-01

Pos logit contributions
books+0.183
 meant+0.181
 whispered+0.176
 said+0.166
jar+0.164
Neg logit contributions
 our-0.136
 my-0.119
 your-0.118
ap-0.118
ric-0.110
Token mom
Token ID1995
Feature activation-1.96e-01

Pos logit contributions
books+0.263
 meant+0.257
 whispered+0.244
Don+0.240
book+0.237
Neg logit contributions
 our-0.176
 my-0.154
 your-0.153
 victory-0.137
 exercise-0.134
Token.
Token ID13
Feature activation-1.47e-01

Pos logit contributions
books+0.397
 widened+0.384
jar+0.382
 meant+0.373
Don+0.371
Neg logit contributions
 our-0.179
 your-0.158
 my-0.150
 how-0.107
 exercise-0.105
Token started
Token ID2067
Feature activation-9.87e-02

Pos logit contributions
books+0.163
jar+0.147
 meant+0.144
book+0.136
post+0.131
Neg logit contributions
 our-0.273
 my-0.257
 your-0.256
 exercise-0.233
 dive-0.228
Token to
Token ID284
Feature activation-6.42e-02

Pos logit contributions
 meant+0.135
 whispered+0.130
 said+0.126
 spoke+0.126
Don+0.123
Neg logit contributions
 our-0.140
 my-0.124
 your-0.120
ap-0.119
 Majesty-0.117
Token panic
Token ID13619
Feature activation-1.43e-01

Pos logit contributions
 meant+0.081
 whispered+0.079
books+0.074
 said+0.073
 spoke+0.071
Neg logit contributions
 our-0.102
 my-0.093
 your-0.092
ap-0.083
 arrest-0.083
Token because
Token ID780
Feature activation-1.04e-01

Pos logit contributions
books+0.185
 meant+0.181
 whispered+0.174
 spoke+0.163
jar+0.163
Neg logit contributions
 our-0.225
 my-0.208
 your-0.207
 exercise-0.185
ap-0.178
Token she
Token ID673
Feature activation-1.32e-01

Pos logit contributions
 meant+0.096
books+0.095
 whispered+0.089
jar+0.080
 spoke+0.080
Neg logit contributions
 our-0.198
 my-0.184
 your-0.183
 exercise-0.168
ap-0.167
Token didn
Token ID1422
Feature activation-1.31e-01

Pos logit contributions
books+0.117
 meant+0.113
 whispered+0.105
jar+0.098
 spoke+0.093
Neg logit contributions
 our-0.259
 my-0.243
 your-0.241
 exercise-0.220
 arrest-0.218
Token't
Token ID470
Feature activation-1.06e-01

Pos logit contributions
 meant+0.185
books+0.176
l+0.164
 whispered+0.162
Don+0.157
Neg logit contributions
 our-0.183
 your-0.163
 my-0.163
ap-0.153
 Majesty-0.152
Token want
Token ID765
Feature activation-1.27e-01

Pos logit contributions
 meant+0.154
books+0.150
 whispered+0.144
 spoke+0.133
 said+0.133
Neg logit contributions
 our-0.150
 my-0.135
 your-0.133
 arrest-0.121
ap-0.120
Token to
Token ID284
Feature activation-8.82e-02

Pos logit contributions
 meant+0.210
books+0.201
 whispered+0.198
 said+0.187
 spoke+0.187
Neg logit contributions
 our-0.145
 my-0.124
 your-0.122
ap-0.122
ric-0.113
Token lose
Token ID4425
Feature activation-1.31e-01

Pos logit contributions
 meant+0.125
 whispered+0.119
 said+0.119
books+0.115
 wondered+0.108
Neg logit contributions
 our-0.106
igsaw-0.101
ric-0.095
igs-0.094
 my-0.094
Token her
Token ID607
Feature activation-1.28e-01

Pos logit contributions
 meant+0.183
 whispered+0.175
books+0.171
 said+0.162
Don+0.159
Neg logit contributions
 our-0.169
ap-0.157
latable-0.155
 dive-0.153
ric-0.151