TOP ACTIVATIONS
MAX = 4.099

forever known as the most depend able cook in the swamp
proud . The End . <|endoftext|> Once upon
time . The boy enjoyed the pastry and
. When the muff ins were ready , Tim
ries . The baker handed the boy a
not use . The baker was very upset .
wanted to see the magic muff in . The
my smiled and cut the muff in into smaller pieces .
. The foolish muff in was not happy .
: Rot is when something spo ils and gets bad .
not tried to get the muff in . They wished they
." <|endoftext|> Jimmy was a depend able young boy . Whenever
! The foolish muff in was very pleased .
and happy . Down on the ground , there
gifts . The j olly man gives them each
L ily looked at the muff in . It was still
boy was so relieved ! The baker was so patient even
flew Sally up to the muff in . Sally took a
, something special happened . The baker set out to bake
wait to eat the tasty muff ins . <|endoftext|> Once upon

MIN ACTIVATIONS
MIN = -3.854

different neck l aces , brace lets , and rings .
big , white pup with floppy ears and a curly tail
a little rabbit named Bun . Bun had big ears that
ch ubby bunny named Bun . Bun loved to eat carrots
. Fl oppy had big floppy ears and a little pink
her flashlight and the butterfly 's wings were so colorful !
M illy and her new friend played together until M
play with lasers . They had red lasers and blue lasers
a silly rabbit named Bun . Bun liked to hop and
small , brown puppy with floppy ears and a w agging
was a bunny named Bun . Bun loved to eat cel
a little girl named Beth . Beth was nice , but
, friendly brown dog with floppy ears . The dog started
was a girl named Beth . Beth wanted to go to
swim suits on . Lily 's mom said , " Be
One day , Sandy and her owners went for a walk
she could have it . Her friend said yes , and
a little girl called Beth . She was very intelligent ,
." So , Kitty and Dog gy curled up next to
around the house . He had so much fun playing with

INTERVAL 2.110 - 4.099
CONTAINS 0.678%

big and he ran inside to get some .
drawing pictures . One day , she met a
by and knocked over the old man 's plate . All
my tube . Go away !" L ily and
Come with me and I â € ™ ll show you

INTERVAL 0.122 - 2.110
CONTAINS 42.578%

saw was a very troubled squirrel . The squirrel was hopping
a time , there was a little boy named Tim my
see their sp ade and bucket . They are frozen .
got angry and tried to grab the pipe . Ben held
. Tim my loved to play outside in the park .

INTERVAL -1.866 - 0.122
CONTAINS 55.788%

of the spicy plant . They had to give up on
Whe e , it is like a flag !"
upon a time , there was a little fish named Fin
they knew . They felt safe and happy . They saw
. The sky was clear and the sun was shining .

INTERVAL -3.854 - -1.866
CONTAINS 0.956%

was out of sight . His mom held his hand and
. It was a fake pig ! It was moving really
. He gave Sam some envelop es and magazines .
. One day , Lily 's mom my noticed that the
In this park , there were two friends , a rock
Token forever
Token ID8097
Feature activation+1.21e+00

Pos logit contributions
ore+3.113
aster+3.059
oker+3.004
eling+2.986
 restaurant+2.771
Neg logit contributions
 prope-1.694
 pads-1.617
aters-1.575
 Dog-1.574
 Tess-1.546
Token known
Token ID1900
Feature activation-3.68e-01

Pos logit contributions
aster+2.350
ore+2.307
eling+2.266
oker+2.228
roads+2.102
Neg logit contributions
 pads-1.703
 prope-1.699
aters-1.663
lets-1.660
 tail-1.607
Token as
Token ID355
Feature activation+1.45e+00

Pos logit contributions
 prope+0.537
aters+0.528
 pads+0.514
lets+0.504
phones+0.496
Neg logit contributions
aster-0.716
ore-0.687
eling-0.680
oker-0.657
 baker-0.632
Token the
Token ID262
Feature activation+4.71e-01

Pos logit contributions
aster+2.416
ore+2.360
eling+2.286
oker+2.180
roads+2.065
Neg logit contributions
 prope-2.497
 Tess-2.435
 Dog-2.417
 pads-2.372
aters-2.358
Token most
Token ID749
Feature activation+1.00e+00

Pos logit contributions
aster+0.877
ore+0.848
eling+0.834
oker+0.801
 restaurant+0.749
Neg logit contributions
 prope-0.733
aters-0.712
 pads-0.699
lets-0.688
phones-0.675
Token depend
Token ID4745
Feature activation+4.10e+00

Pos logit contributions
aster+1.916
ore+1.818
eling+1.806
oker+1.709
roads+1.627
Neg logit contributions
 prope-1.540
aters-1.475
 pads-1.460
lets-1.447
 Tess-1.384
Tokenable
Token ID540
Feature activation+5.05e-01

Pos logit contributions
eling+5.241
aster+5.030
ulk+4.341
 salsa+4.237
 oven+4.190
Neg logit contributions
 pads-9.100
lets-9.094
un-9.057
aters-8.757
 Tess-8.665
Token cook
Token ID4255
Feature activation+1.65e+00

Pos logit contributions
aster+0.966
eling+0.925
ore+0.923
oker+0.878
 baker+0.826
Neg logit contributions
 prope-0.756
aters-0.738
 pads-0.723
lets-0.717
 Tess-0.697
Token in
Token ID287
Feature activation+1.50e+00

Pos logit contributions
aster+3.009
eling+2.951
ore+2.869
oker+2.700
atches+2.530
Neg logit contributions
 pads-2.558
aters-2.503
lets-2.502
 prope-2.484
phones-2.408
Token the
Token ID262
Feature activation+1.68e+00

Pos logit contributions
aster+2.302
ore+2.296
eling+2.241
oker+2.203
 baker+2.051
Neg logit contributions
 prope-2.686
aters-2.599
 Tess-2.534
 pads-2.518
 Dog-2.507
Token swamp
Token ID26837
Feature activation+1.56e+00

Pos logit contributions
aster+3.193
eling+3.028
ore+2.954
oker+2.921
ying+2.691
Neg logit contributions
 prope-2.483
aters-2.472
 pads-2.447
 Dog-2.418
 tail-2.396
Token proud
Token ID6613
Feature activation+2.03e-02

Pos logit contributions
aster+2.721
ore+2.603
eling+2.565
oker+2.440
roads+2.374
Neg logit contributions
 prope-2.474
lets-2.289
aters-2.278
 pads-2.272
 Tess-2.245
Token.
Token ID13
Feature activation+1.11e+00

Pos logit contributions
aster+0.035
ore+0.033
eling+0.033
oker+0.032
 baker+0.030
Neg logit contributions
 prope-0.035
aters-0.034
 pads-0.033
lets-0.033
phones-0.032
Token
Token ID220
Feature activation+7.65e-01

Pos logit contributions
aster+2.155
eling+2.115
ore+2.043
oker+1.981
 restaurant+1.907
Neg logit contributions
aters-1.603
 prope-1.599
 pads-1.575
 Tess-1.529
lets-1.529
Token
Token ID198
Feature activation+1.08e+00

Pos logit contributions
aster+1.481
eling+1.436
ore+1.404
oker+1.367
 restaurant+1.310
Neg logit contributions
aters-1.094
 prope-1.094
 pads-1.070
lets-1.047
 Tess-1.033
Token
Token ID198
Feature activation+1.15e+00

Pos logit contributions
aster+2.149
eling+2.109
ore+2.049
oker+2.015
 restaurant+1.924
Neg logit contributions
 prope-1.474
aters-1.435
 pads-1.416
Snow-1.398
Brown-1.393
TokenThe
Token ID464
Feature activation+4.09e+00

Pos logit contributions
aster+2.420
ore+2.404
eling+2.372
oker+2.279
 restaurant+2.207
Neg logit contributions
 prope-1.415
aters-1.360
Snow-1.346
Brown-1.346
 pads-1.346
Token End
Token ID5268
Feature activation+9.45e-01

Pos logit contributions
ore+8.952
oker+7.931
aster+7.661
eling+7.545
atches+7.458
Neg logit contributions
 no-6.840
 tail-6.410
 aunt-6.344
 prope-6.147
 owners-6.144
Token.
Token ID13
Feature activation+4.80e-02

Pos logit contributions
aster+1.784
eling+1.749
ore+1.683
oker+1.643
 baker+1.590
Neg logit contributions
 prope-1.381
aters-1.373
 pads-1.339
lets-1.320
phones-1.292
Token<|endoftext|>
Token ID50256
Feature activation+1.25e+00

Pos logit contributions
aster+0.087
ore+0.084
eling+0.082
oker+0.080
 baker+0.076
Neg logit contributions
 prope-0.077
aters-0.074
 pads-0.073
lets-0.072
phones-0.071
TokenOnce
Token ID7454
Feature activation+9.94e-01

Pos logit contributions
aster+1.564
eling+1.516
oker+1.365
ore+1.332
 baker+1.261
Neg logit contributions
aters-2.714
 pads-2.662
 prope-2.656
 Dog-2.594
 Tess-2.590
Token upon
Token ID2402
Feature activation-6.94e-01

Pos logit contributions
aster+1.878
eling+1.800
ore+1.749
oker+1.706
 baker+1.655
Neg logit contributions
 prope-1.475
aters-1.445
 pads-1.423
lets-1.401
 Tess-1.396
Token time
Token ID640
Feature activation-2.97e-02

Pos logit contributions
aster+0.414
ore+0.397
eling+0.392
oker+0.374
 baker+0.352
Neg logit contributions
 prope-0.496
aters-0.484
 pads-0.477
lets-0.471
phones-0.463
Token.
Token ID13
Feature activation+9.97e-01

Pos logit contributions
 prope+0.049
aters+0.048
 pads+0.047
lets+0.046
phones+0.046
Neg logit contributions
aster-0.052
ore-0.050
eling-0.049
oker-0.047
 restaurant-0.045
Token
Token ID220
Feature activation+8.83e-01

Pos logit contributions
aster+1.958
eling+1.932
ore+1.859
oker+1.825
 restaurant+1.733
Neg logit contributions
 prope-1.414
aters-1.408
 pads-1.378
 Tess-1.346
lets-1.346
Token
Token ID198
Feature activation+7.53e-01

Pos logit contributions
aster+1.734
eling+1.684
ore+1.634
oker+1.605
 restaurant+1.535
Neg logit contributions
aters-1.243
 prope-1.236
 pads-1.219
lets-1.186
 Tess-1.170
Token
Token ID198
Feature activation+6.45e-01

Pos logit contributions
aster+1.456
eling+1.429
ore+1.391
oker+1.363
 restaurant+1.299
Neg logit contributions
 prope-1.079
aters-1.049
 pads-1.033
Snow-1.005
lets-1.005
TokenThe
Token ID464
Feature activation+4.06e+00

Pos logit contributions
aster+1.316
ore+1.290
eling+1.287
oker+1.235
 restaurant+1.189
Neg logit contributions
 prope-0.863
aters-0.829
 pads-0.817
lets-0.800
Snow-0.798
Token boy
Token ID2933
Feature activation+6.29e-01

Pos logit contributions
ore+9.613
aster+8.691
oker+8.682
eling+8.596
ulk+8.276
Neg logit contributions
 no-5.969
 tail-5.786
 aunt-5.752
 neck-5.527
 scarf-5.448
Token enjoyed
Token ID8359
Feature activation+1.37e+00

Pos logit contributions
aster+1.196
ore+1.160
eling+1.139
oker+1.116
 restaurant+1.046
Neg logit contributions
 prope-0.938
aters-0.902
 pads-0.897
lets-0.894
phones-0.860
Token the
Token ID262
Feature activation+3.03e+00

Pos logit contributions
ore+2.209
aster+2.176
oker+2.140
eling+2.115
 restaurant+1.875
Neg logit contributions
 prope-2.453
 Dog-2.292
 Tess-2.287
aters-2.285
 pads-2.270
Token pastry
Token ID41738
Feature activation+8.94e-01

Pos logit contributions
ore+5.861
aster+5.771
eling+5.745
oker+5.546
ulk+5.052
Neg logit contributions
 tail-4.559
 pads-4.472
 scarf-4.440
aters-4.411
lets-4.394
Token and
Token ID290
Feature activation+6.73e-01

Pos logit contributions
aster+1.860
eling+1.810
ore+1.799
oker+1.737
 restaurant+1.630
Neg logit contributions
 prope-1.150
aters-1.129
 pads-1.124
lets-1.103
phones-1.066
Token.
Token ID13
Feature activation+1.90e-01

Pos logit contributions
aster+2.973
eling+2.949
ore+2.946
oker+2.909
roads+2.668
Neg logit contributions
aters-2.194
 prope-2.175
 pads-2.149
lets-2.132
Snow-2.047
Token
Token ID198
Feature activation+2.21e-01

Pos logit contributions
aster+0.352
ore+0.337
eling+0.336
oker+0.323
 baker+0.307
Neg logit contributions
 prope-0.296
aters-0.288
 pads-0.283
lets-0.279
phones-0.274
Token
Token ID198
Feature activation-9.62e-01

Pos logit contributions
aster+0.406
ore+0.389
eling+0.388
oker+0.373
 baker+0.355
Neg logit contributions
 prope-0.341
aters-0.332
 pads-0.326
lets-0.321
phones-0.316
TokenWhen
Token ID2215
Feature activation+5.56e-01

Pos logit contributions
 prope+1.554
aters+1.501
 pads+1.448
lets+1.437
phones+1.415
Neg logit contributions
aster-1.857
ore-1.715
eling-1.684
oker-1.637
 restaurant-1.583
Token the
Token ID262
Feature activation+2.71e+00

Pos logit contributions
aster+1.000
ore+0.976
eling+0.972
oker+0.936
 baker+0.883
Neg logit contributions
 prope-0.876
aters-0.848
 pads-0.833
lets-0.824
 Tess-0.823
Token muff
Token ID27563
Feature activation+4.05e+00

Pos logit contributions
aster+4.288
eling+4.204
ore+4.179
oker+4.138
roads+3.866
Neg logit contributions
 prope-4.904
 aunt-4.843
aters-4.730
Snow-4.719
 no-4.707
Tokenins
Token ID1040
Feature activation+2.29e+00

Pos logit contributions
aster+5.191
roads+4.317
ore+4.288
oker+4.283
eling+4.259
Neg logit contributions
lets-9.153
aters-8.853
un-8.513
 forever-8.225
 pads-8.215
Token were
Token ID547
Feature activation+2.06e+00

Pos logit contributions
aster+5.189
eling+5.050
oker+4.986
ore+4.898
roads+4.662
Neg logit contributions
lets-2.596
 prope-2.495
aters-2.487
 pads-2.431
 Dog-2.351
Token ready
Token ID3492
Feature activation+5.59e-01

Pos logit contributions
ore+3.355
aster+3.322
oker+3.266
eling+3.148
roads+2.989
Neg logit contributions
 prope-3.577
aters-3.467
 pads-3.406
 no-3.386
 forever-3.334
Token,
Token ID11
Feature activation+2.08e-01

Pos logit contributions
aster+1.201
eling+1.161
ore+1.156
oker+1.119
 baker+1.070
Neg logit contributions
 prope-0.688
aters-0.666
 pads-0.655
lets-0.644
 Tess-0.635
Token Tim
Token ID5045
Feature activation+4.25e-01

Pos logit contributions
aster+0.400
ore+0.385
eling+0.382
oker+0.368
 baker+0.351
Neg logit contributions
 prope-0.308
aters-0.298
 pads-0.293
lets-0.289
phones-0.284
Tokenries
Token ID1678
Feature activation+1.15e+00

Pos logit contributions
eling+2.860
oker+2.833
aster+2.821
ore+2.784
 salsa+2.306
Neg logit contributions
aters-5.020
lets-4.823
 pads-4.663
phones-4.613
 prope-4.549
Token.
Token ID13
Feature activation+2.45e+00

Pos logit contributions
aster+2.225
eling+2.166
ore+2.123
oker+2.065
 baker+1.925
Neg logit contributions
 pads-1.641
aters-1.629
 prope-1.611
lets-1.604
phones-1.549
Token
Token ID220
Feature activation+8.09e-01

Pos logit contributions
eling+5.640
aster+5.244
oker+5.172
ore+4.995
ying+4.844
Neg logit contributions
 Dog-3.054
 Tess-3.006
aters-2.931
 Brown-2.926
lets-2.880
Token
Token ID198
Feature activation+1.21e+00

Pos logit contributions
aster+1.576
eling+1.529
ore+1.487
oker+1.459
 restaurant+1.389
Neg logit contributions
aters-1.150
 prope-1.141
 pads-1.125
lets-1.101
 Tess-1.079
Token
Token ID198
Feature activation+1.04e+00

Pos logit contributions
eling+2.420
aster+2.386
ore+2.315
oker+2.282
 restaurant+2.177
Neg logit contributions
 prope-1.608
aters-1.565
Brown-1.549
Snow-1.541
 pads-1.534
TokenThe
Token ID464
Feature activation+4.04e+00

Pos logit contributions
aster+2.153
eling+2.149
ore+2.147
oker+2.057
 restaurant+1.971
Neg logit contributions
 prope-1.315
aters-1.259
Brown-1.250
Snow-1.242
 pads-1.234
Token baker
Token ID46412
Feature activation+9.27e-01

Pos logit contributions
ore+8.482
aster+7.774
oker+7.722
eling+7.462
ulk+7.173
Neg logit contributions
 no-6.835
 aunt-6.656
 neck-6.474
 owners-6.438
 mate-6.434
Token handed
Token ID10158
Feature activation+1.54e+00

Pos logit contributions
aster+1.691
ore+1.649
eling+1.614
oker+1.584
 restaurant+1.466
Neg logit contributions
 prope-1.438
lets-1.402
aters-1.382
 pads-1.372
 Tess-1.337
Token the
Token ID262
Feature activation+2.01e+00

Pos logit contributions
aster+2.586
eling+2.538
oker+2.504
ore+2.496
 restaurant+2.247
Neg logit contributions
 prope-2.549
 Dog-2.534
 Tess-2.532
lets-2.523
 pads-2.520
Token boy
Token ID2933
Feature activation+1.32e+00

Pos logit contributions
aster+3.771
ore+3.729
eling+3.600
oker+3.520
roads+3.154
Neg logit contributions
lets-3.210
 pads-3.199
 prope-3.188
 neck-3.114
aters-3.089
Token a
Token ID257
Feature activation+1.06e+00

Pos logit contributions
aster+2.654
eling+2.637
ore+2.601
oker+2.579
roads+2.349
Neg logit contributions
lets-1.749
 pads-1.748
aters-1.718
 prope-1.717
phones-1.647
Token not
Token ID407
Feature activation+1.85e-01

Pos logit contributions
aster+0.620
ore+0.601
eling+0.596
oker+0.577
 restaurant+0.551
Neg logit contributions
 prope-0.437
aters-0.421
 pads-0.412
lets-0.406
phones-0.400
Token use
Token ID779
Feature activation+2.20e-01

Pos logit contributions
aster+0.302
ore+0.289
eling+0.286
oker+0.274
 baker+0.260
Neg logit contributions
 prope-0.326
aters-0.317
 pads-0.312
lets-0.308
phones-0.304
Token.
Token ID13
Feature activation+7.65e-01

Pos logit contributions
aster+0.387
ore+0.371
eling+0.368
oker+0.354
 restaurant+0.336
Neg logit contributions
 prope-0.359
aters-0.350
 pads-0.344
lets-0.339
phones-0.334
Token
Token ID198
Feature activation+3.57e-01

Pos logit contributions
aster+1.515
eling+1.483
ore+1.430
oker+1.411
 restaurant+1.334
Neg logit contributions
 prope-1.079
aters-1.064
 pads-1.043
 Tess-1.034
lets-1.033
Token
Token ID198
Feature activation-3.24e-02

Pos logit contributions
aster+0.667
eling+0.642
ore+0.637
oker+0.616
 baker+0.586
Neg logit contributions
 prope-0.541
aters-0.526
 pads-0.516
lets-0.508
 Tess-0.501
TokenThe
Token ID464
Feature activation+4.03e+00

Pos logit contributions
 prope+0.046
aters+0.045
 pads+0.044
lets+0.043
phones+0.042
Neg logit contributions
aster-0.065
ore-0.062
eling-0.062
oker-0.059
 baker-0.057
Token baker
Token ID46412
Feature activation+1.27e+00

Pos logit contributions
ore+7.640
eling+6.875
aster+6.733
oker+6.707
roads+6.579
Neg logit contributions
 no-7.470
 tail-7.091
 aunt-6.935
 mate-6.892
 owners-6.846
Token was
Token ID373
Feature activation+1.23e+00

Pos logit contributions
aster+2.410
ore+2.362
eling+2.298
oker+2.282
 restaurant+2.115
Neg logit contributions
 prope-1.886
lets-1.833
aters-1.765
 pads-1.761
 Tess-1.720
Token very
Token ID845
Feature activation+3.11e+00

Pos logit contributions
aster+2.644
ore+2.621
oker+2.550
eling+2.504
 restaurant+2.352
Neg logit contributions
 prope-1.560
 pads-1.390
 Dog-1.387
aters-1.380
 Tess-1.378
Token upset
Token ID9247
Feature activation-6.94e-02

Pos logit contributions
aster+6.167
ore+5.903
roads+5.489
oker+5.437
eling+5.417
Neg logit contributions
 prope-4.949
 no-4.850
 undone-4.724
un-4.619
 tail-4.585
Token.
Token ID13
Feature activation+1.11e+00

Pos logit contributions
 prope+0.106
aters+0.103
 pads+0.101
lets+0.099
phones+0.098
Neg logit contributions
aster-0.131
ore-0.127
eling-0.125
oker-0.120
 restaurant-0.115
Token wanted
Token ID2227
Feature activation+6.28e-01

Pos logit contributions
aster+1.325
ore+1.285
eling+1.260
oker+1.221
 restaurant+1.152
Neg logit contributions
 prope-0.924
lets-0.891
aters-0.890
 pads-0.888
 Tess-0.853
Token to
Token ID284
Feature activation-8.55e-03

Pos logit contributions
aster+1.225
ore+1.193
eling+1.188
oker+1.164
 baker+1.074
Neg logit contributions
 prope-0.898
aters-0.865
 pads-0.855
lets-0.842
phones-0.833
Token see
Token ID766
Feature activation+3.16e-01

Pos logit contributions
 prope+0.015
aters+0.015
 pads+0.015
lets+0.015
phones+0.014
Neg logit contributions
aster-0.014
ore-0.013
eling-0.013
oker-0.012
 baker-0.012
Token the
Token ID262
Feature activation+2.32e+00

Pos logit contributions
aster+0.560
ore+0.538
eling+0.532
oker+0.517
 restaurant+0.485
Neg logit contributions
 prope-0.514
aters-0.497
 pads-0.488
lets-0.484
phones-0.476
Token magic
Token ID5536
Feature activation+2.54e+00

Pos logit contributions
aster+4.927
oker+4.595
ore+4.593
eling+4.571
roads+4.227
Neg logit contributions
 prope-3.268
Snow-2.971
 neck-2.942
Brown-2.931
 tail-2.926
Token muff
Token ID27563
Feature activation+4.03e+00

Pos logit contributions
eling+4.328
aster+4.285
oker+3.894
ore+3.774
roads+3.532
Neg logit contributions
 prope-4.575
lets-4.461
aters-4.351
 Brown-4.344
 scarf-4.318
Tokenin
Token ID259
Feature activation+1.92e+00

Pos logit contributions
aster+5.236
oker+4.151
roads+4.047
eling+4.045
ore+3.846
Neg logit contributions
lets-9.668
ocular-9.103
aters-8.945
 forever-8.868
un-8.857
Token.
Token ID13
Feature activation-8.30e-02

Pos logit contributions
aster+3.847
eling+3.648
ore+3.624
oker+3.576
 baker+3.350
Neg logit contributions
 pads-2.742
lets-2.728
 prope-2.622
aters-2.597
phones-2.562
Token
Token ID198
Feature activation+3.50e-01

Pos logit contributions
 prope+0.135
aters+0.130
 pads+0.127
lets+0.126
phones+0.124
Neg logit contributions
aster-0.149
ore-0.144
eling-0.140
oker-0.135
 baker-0.130
Token
Token ID198
Feature activation-6.43e-01

Pos logit contributions
aster+0.653
eling+0.627
ore+0.622
oker+0.601
 baker+0.574
Neg logit contributions
 prope-0.528
aters-0.515
 pads-0.506
lets-0.498
 Tess-0.489
TokenThe
Token ID464
Feature activation+2.58e+00

Pos logit contributions
 prope+1.124
aters+1.080
 pads+1.064
lets+1.048
phones+1.040
Neg logit contributions
aster-1.110
ore-1.063
eling-1.042
oker-0.989
 restaurant-0.973
Tokenmy
Token ID1820
Feature activation-5.33e-01

Pos logit contributions
 prope+2.001
aters+1.876
 pads+1.802
phones+1.750
lets+1.712
Neg logit contributions
aster-2.567
eling-2.510
ore-2.469
 baker-2.296
oker-2.274
Token smiled
Token ID13541
Feature activation-1.44e+00

Pos logit contributions
 prope+0.969
aters+0.951
 pads+0.922
lets+0.902
phones+0.898
Neg logit contributions
aster-0.855
eling-0.819
ore-0.818
oker-0.756
 baker-0.740
Token and
Token ID290
Feature activation+5.69e-01

Pos logit contributions
 prope+2.482
phones+2.429
aters+2.375
Brown+2.332
lets+2.323
Neg logit contributions
ore-2.400
aster-2.375
eling-2.256
 early-2.189
 restaurant-2.181
Token cut
Token ID2005
Feature activation+2.57e-01

Pos logit contributions
aster+0.942
ore+0.905
eling+0.883
oker+0.861
 restaurant+0.809
Neg logit contributions
 prope-1.001
aters-0.961
 pads-0.960
lets-0.956
 Tess-0.947
Token the
Token ID262
Feature activation+5.71e-01

Pos logit contributions
aster+0.394
ore+0.374
eling+0.372
oker+0.355
 baker+0.335
Neg logit contributions
 prope-0.480
aters-0.469
 pads-0.463
lets-0.457
phones-0.450
Token muff
Token ID27563
Feature activation+4.01e+00

Pos logit contributions
aster+1.104
ore+1.064
eling+1.060
oker+1.024
 baker+0.964
Neg logit contributions
 prope-0.854
aters-0.809
 pads-0.802
lets-0.783
phones-0.772
Tokenin
Token ID259
Feature activation+1.64e+00

Pos logit contributions
aster+4.910
 restaurant+4.385
roads+4.252
oker+4.072
eling+3.854
Neg logit contributions
lets-9.749
aters-9.136
un-8.985
 pads-8.915
ocular-8.914
Token into
Token ID656
Feature activation+6.46e-01

Pos logit contributions
aster+3.124
eling+3.042
oker+2.977
ore+2.936
 baker+2.817
Neg logit contributions
 pads-2.419
 prope-2.398
lets-2.398
aters-2.355
phones-2.274
Token smaller
Token ID4833
Feature activation+1.00e+00

Pos logit contributions
aster+1.038
ore+1.015
eling+1.002
oker+0.963
 restaurant+0.905
Neg logit contributions
 prope-1.153
aters-1.107
 pads-1.083
lets-1.078
 Tess-1.064
Token pieces
Token ID5207
Feature activation-1.31e-01

Pos logit contributions
aster+1.898
ore+1.806
eling+1.805
oker+1.780
 restaurant+1.657
Neg logit contributions
 prope-1.543
aters-1.516
 pads-1.472
lets-1.469
 neck-1.401
Token.
Token ID13
Feature activation-4.75e-01

Pos logit contributions
 prope+0.224
aters+0.218
 pads+0.213
lets+0.211
phones+0.209
Neg logit contributions
aster-0.221
ore-0.212
eling-0.208
oker-0.200
 baker-0.189
Token.
Token ID13
Feature activation+1.09e+00

Pos logit contributions
aster+0.268
ore+0.257
eling+0.255
oker+0.245
 baker+0.234
Neg logit contributions
 prope-0.226
aters-0.220
 pads-0.216
lets-0.213
phones-0.209
Token
Token ID198
Feature activation+7.07e-01

Pos logit contributions
eling+2.051
aster+2.050
ore+1.945
oker+1.917
 restaurant+1.794
Neg logit contributions
 prope-1.663
aters-1.636
 Tess-1.595
 Dog-1.587
 pads-1.586
Token
Token ID198
Feature activation+4.35e-01

Pos logit contributions
aster+1.368
eling+1.344
ore+1.297
oker+1.278
 baker+1.214
Neg logit contributions
 prope-1.006
aters-0.983
 pads-0.959
 Tess-0.944
lets-0.941
TokenThe
Token ID464
Feature activation+2.96e+00

Pos logit contributions
aster+0.883
ore+0.861
eling+0.853
oker+0.824
 restaurant+0.786
Neg logit contributions
 prope-0.594
aters-0.573
 pads-0.562
lets-0.554
Snow-0.544
Token foolish
Token ID19538
Feature activation+2.77e+00

Pos logit contributions
aster+6.272
ore+6.174
oker+6.010
eling+5.998
roads+5.595
Neg logit contributions
 prope-4.299
 neck-4.163
Brown-4.085
Snow-4.001
 Brown-4.000
Token muff
Token ID27563
Feature activation+3.98e+00

Pos logit contributions
aster+5.227
eling+5.042
oker+4.908
ore+4.858
roads+4.630
Neg logit contributions
 prope-4.631
 neck-4.427
 mate-4.306
 tail-4.290
 Brown-4.280
Tokenin
Token ID259
Feature activation+1.56e+00

Pos logit contributions
aster+4.956
eling+3.601
 salsa+3.570
roads+3.559
 restaurant+3.514
Neg logit contributions
lets-9.907
un-9.472
Brown-8.923
aters-8.913
 forever-8.896
Token was
Token ID373
Feature activation+6.41e-01

Pos logit contributions
aster+3.024
ore+2.913
eling+2.889
oker+2.859
 baker+2.631
Neg logit contributions
lets-2.315
 prope-2.269
 pads-2.266
aters-2.189
 neck-2.134
Token not
Token ID407
Feature activation+7.94e-01

Pos logit contributions
aster+1.286
ore+1.249
eling+1.221
oker+1.203
 restaurant+1.130
Neg logit contributions
 prope-0.906
 pads-0.852
aters-0.851
lets-0.832
 Tess-0.809
Token happy
Token ID3772
Feature activation+5.03e-01

Pos logit contributions
aster+1.590
ore+1.548
eling+1.516
oker+1.473
 restaurant+1.397
Neg logit contributions
 prope-1.137
aters-1.070
 pads-1.070
lets-1.057
 Tess-1.024
Token.
Token ID13
Feature activation+1.12e+00

Pos logit contributions
aster+0.967
ore+0.924
eling+0.920
oker+0.883
 baker+0.841
Neg logit contributions
 prope-0.740
aters-0.728
 pads-0.720
lets-0.709
phones-0.689
Token:
Token ID25
Feature activation+1.23e+00

Pos logit contributions
aster+1.430
ore+1.375
eling+1.334
oker+1.325
 restaurant+1.221
Neg logit contributions
 prope-1.391
aters-1.349
lets-1.343
 pads-1.332
 Tess-1.314
Token Rot
Token ID18481
Feature activation+1.37e+00

Pos logit contributions
aster+2.764
eling+2.704
ore+2.647
oker+2.602
 restaurant+2.441
Neg logit contributions
 prope-1.443
 Tess-1.339
 Dog-1.331
Brown-1.295
aters-1.276
Token is
Token ID318
Feature activation+5.42e-01

Pos logit contributions
aster+2.302
ore+2.080
oker+2.079
eling+2.060
 baker+1.973
Neg logit contributions
aters-2.356
 prope-2.354
 pads-2.342
lets-2.320
 Tess-2.274
Token when
Token ID618
Feature activation-2.65e-01

Pos logit contributions
aster+0.983
ore+0.944
eling+0.929
oker+0.907
 restaurant+0.853
Neg logit contributions
 prope-0.864
aters-0.835
 pads-0.825
lets-0.816
phones-0.800
Token something
Token ID1223
Feature activation+1.83e+00

Pos logit contributions
 prope+0.445
aters+0.432
 pads+0.424
lets+0.419
phones+0.411
Neg logit contributions
aster-0.456
ore-0.436
eling-0.433
oker-0.413
 restaurant-0.394
Token spo
Token ID15106
Feature activation+3.97e+00

Pos logit contributions
aster+3.449
ore+3.164
eling+3.159
oker+3.128
 baker+2.965
Neg logit contributions
 prope-2.867
 pads-2.780
lets-2.711
 Dog-2.657
aters-2.645
Tokenils
Token ID4487
Feature activation+8.76e-01

Pos logit contributions
aster+4.800
 baker+4.397
ore+4.363
 restaurant+4.338
oker+4.304
Neg logit contributions
aters-9.764
 pads-9.064
lets-8.777
tails-8.692
un-8.571
Token and
Token ID290
Feature activation+1.09e+00

Pos logit contributions
aster+1.637
eling+1.565
ore+1.565
oker+1.506
 baker+1.451
Neg logit contributions
 prope-1.311
aters-1.286
 pads-1.267
lets-1.248
 Tess-1.227
Token gets
Token ID3011
Feature activation+1.15e+00

Pos logit contributions
aster+2.014
ore+1.936
eling+1.856
oker+1.847
 baker+1.756
Neg logit contributions
 prope-1.721
 pads-1.668
lets-1.640
aters-1.639
phones-1.590
Token bad
Token ID2089
Feature activation+8.42e-01

Pos logit contributions
aster+2.111
ore+2.013
eling+1.964
oker+1.962
 baker+1.834
Neg logit contributions
 prope-1.816
 pads-1.735
aters-1.727
lets-1.714
 Tess-1.680
Token.
Token ID13
Feature activation-1.03e-02

Pos logit contributions
aster+1.560
eling+1.478
ore+1.470
oker+1.416
 baker+1.349
Neg logit contributions
 prope-1.302
aters-1.292
 pads-1.283
lets-1.258
phones-1.227
Token not
Token ID407
Feature activation-1.28e-02

Pos logit contributions
 prope+0.444
aters+0.435
 pads+0.417
lets+0.412
phones+0.407
Neg logit contributions
aster-0.580
eling-0.549
ore-0.545
oker-0.524
 restaurant-0.510
Token tried
Token ID3088
Feature activation+3.36e-01

Pos logit contributions
 prope+0.020
aters+0.019
 pads+0.019
lets+0.019
phones+0.018
Neg logit contributions
aster-0.024
ore-0.023
eling-0.023
oker-0.022
 restaurant-0.021
Token to
Token ID284
Feature activation-1.52e-01

Pos logit contributions
aster+0.642
ore+0.617
eling+0.614
oker+0.592
 baker+0.563
Neg logit contributions
 prope-0.503
aters-0.489
 pads-0.482
lets-0.474
phones-0.465
Token get
Token ID651
Feature activation+1.76e+00

Pos logit contributions
 prope+0.249
aters+0.243
 pads+0.240
lets+0.235
phones+0.231
Neg logit contributions
aster-0.274
ore-0.259
eling-0.258
oker-0.247
 restaurant-0.236
Token the
Token ID262
Feature activation+2.03e+00

Pos logit contributions
aster+3.389
oker+3.268
ore+3.262
eling+3.192
roads+3.005
Neg logit contributions
 prope-2.580
phones-2.492
 Dog-2.489
 pads-2.488
aters-2.466
Token muff
Token ID27563
Feature activation+3.97e+00

Pos logit contributions
aster+3.024
eling+2.830
ore+2.753
oker+2.720
roads+2.477
Neg logit contributions
 prope-4.109
aters-3.890
Snow-3.807
 scarf-3.801
 Brown-3.781
Tokenin
Token ID259
Feature activation+1.79e+00

Pos logit contributions
aster+4.675
roads+3.825
eling+3.783
oker+3.773
 restaurant+3.440
Neg logit contributions
lets-9.623
aters-9.385
ocular-9.236
un-9.126
Snow-9.001
Token.
Token ID13
Feature activation-8.22e-01

Pos logit contributions
aster+3.545
eling+3.544
ore+3.357
oker+3.310
 baker+3.097
Neg logit contributions
 pads-2.502
 prope-2.471
aters-2.443
lets-2.441
 Dog-2.324
Token They
Token ID1119
Feature activation-1.07e+00

Pos logit contributions
 prope+1.249
aters+1.200
lets+1.175
 pads+1.174
phones+1.154
Neg logit contributions
aster-1.571
ore-1.531
oker-1.432
eling-1.430
 restaurant-1.399
Token wished
Token ID16555
Feature activation+4.91e-01

Pos logit contributions
aters+1.843
 prope+1.775
 pads+1.730
phones+1.708
lets+1.708
Neg logit contributions
aster-1.849
eling-1.801
ore-1.691
oker-1.604
 restaurant-1.598
Token they
Token ID484
Feature activation-1.16e+00

Pos logit contributions
aster+0.874
ore+0.843
eling+0.840
oker+0.807
 baker+0.761
Neg logit contributions
 prope-0.792
aters-0.763
 pads-0.754
lets-0.742
phones-0.733
Token."
Token ID526
Feature activation-7.66e-01

Pos logit contributions
aster+0.539
ore+0.517
eling+0.513
oker+0.494
 baker+0.469
Neg logit contributions
 prope-0.475
aters-0.462
 pads-0.454
lets-0.448
phones-0.440
Token<|endoftext|>
Token ID50256
Feature activation+1.11e+00

Pos logit contributions
 prope+1.120
aters+1.034
 pads+1.016
lets+0.998
 mate+0.991
Neg logit contributions
aster-1.529
ore-1.471
eling-1.414
oker-1.392
 baker-1.344
TokenJimmy
Token ID40335
Feature activation-8.18e-02

Pos logit contributions
aster+1.525
eling+1.461
oker+1.344
ore+1.328
 baker+1.251
Neg logit contributions
aters-2.267
 pads-2.236
 prope-2.233
 Tess-2.174
 Dog-2.171
Token was
Token ID373
Feature activation+3.10e-01

Pos logit contributions
 prope+0.117
aters+0.113
 pads+0.110
lets+0.108
phones+0.107
Neg logit contributions
aster-0.161
ore-0.155
eling-0.153
oker-0.147
 restaurant-0.142
Token a
Token ID257
Feature activation+3.34e-01

Pos logit contributions
aster+0.590
ore+0.567
eling+0.564
oker+0.545
 restaurant+0.516
Neg logit contributions
 prope-0.466
aters-0.452
 pads-0.444
lets-0.434
phones-0.428
Token depend
Token ID4745
Feature activation+3.93e+00

Pos logit contributions
aster+0.643
ore+0.621
eling+0.616
oker+0.593
 restaurant+0.561
Neg logit contributions
 prope-0.493
aters-0.472
 pads-0.465
lets-0.458
phones-0.448
Tokenable
Token ID540
Feature activation+1.88e-01

Pos logit contributions
eling+5.164
aster+4.933
 salsa+4.616
oons+4.043
 restaurant+3.997
Neg logit contributions
un-9.003
 pads-8.536
aters-8.290
lets-8.246
 Tess-8.172
Token young
Token ID1862
Feature activation+4.98e-01

Pos logit contributions
aster+0.320
ore+0.306
eling+0.304
oker+0.291
 baker+0.275
Neg logit contributions
 prope-0.317
aters-0.308
 pads-0.303
lets-0.300
phones-0.295
Token boy
Token ID2933
Feature activation-1.21e-01

Pos logit contributions
aster+0.625
ore+0.582
eling+0.581
oker+0.539
 restaurant+0.488
Neg logit contributions
 prope-1.079
aters-1.050
 pads-1.036
lets-1.029
 Tess-1.013
Token.
Token ID13
Feature activation+6.18e-01

Pos logit contributions
 prope+0.200
aters+0.193
 pads+0.189
lets+0.187
phones+0.184
Neg logit contributions
aster-0.212
ore-0.204
eling-0.200
oker-0.192
 baker-0.184
Token Whenever
Token ID21326
Feature activation+1.01e+00

Pos logit contributions
aster+1.185
eling+1.139
ore+1.132
oker+1.082
 baker+1.030
Neg logit contributions
 prope-0.921
aters-0.912
 pads-0.890
lets-0.864
 Tess-0.844
Token!
Token ID0
Feature activation+1.13e+00

Pos logit contributions
aster+1.675
ore+1.605
eling+1.593
oker+1.550
 baker+1.474
Neg logit contributions
 prope-1.100
 pads-1.068
aters-1.066
lets-1.043
phones-0.987
Token
Token ID198
Feature activation+6.13e-01

Pos logit contributions
eling+2.165
aster+2.155
ore+2.049
oker+2.010
 restaurant+1.918
Neg logit contributions
 prope-1.660
aters-1.643
 pads-1.596
 Tess-1.588
 Dog-1.568
Token
Token ID198
Feature activation+3.35e-01

Pos logit contributions
aster+1.168
eling+1.138
ore+1.116
oker+1.088
 restaurant+1.035
Neg logit contributions
 prope-0.899
aters-0.874
 pads-0.852
lets-0.837
 Tess-0.832
TokenThe
Token ID464
Feature activation+2.39e+00

Pos logit contributions
aster+0.670
ore+0.650
eling+0.644
oker+0.621
 restaurant+0.595
Neg logit contributions
 prope-0.469
aters-0.453
 pads-0.444
lets-0.438
phones-0.429
Token foolish
Token ID19538
Feature activation+2.42e+00

Pos logit contributions
aster+5.133
ore+5.100
eling+4.893
oker+4.864
roads+4.508
Neg logit contributions
 prope-3.322
 neck-3.088
 tail-3.041
 scarf-3.003
Brown-2.997
Token muff
Token ID27563
Feature activation+3.93e+00

Pos logit contributions
aster+4.468
eling+4.384
ore+4.233
oker+4.091
roads+3.947
Neg logit contributions
 prope-4.003
 tail-3.780
 neck-3.773
lets-3.748
 mate-3.721
Tokenin
Token ID259
Feature activation+1.53e+00

Pos logit contributions
aster+4.947
 restaurant+3.722
ore+3.699
oker+3.604
 salsa+3.593
Neg logit contributions
lets-9.696
un-9.083
aters-8.729
tails-8.694
 forever-8.660
Token was
Token ID373
Feature activation+6.63e-01

Pos logit contributions
aster+2.970
ore+2.859
eling+2.821
oker+2.785
 restaurant+2.600
Neg logit contributions
lets-2.236
 prope-2.208
 pads-2.194
aters-2.133
phones-2.066
Token very
Token ID845
Feature activation+2.50e+00

Pos logit contributions
aster+1.436
ore+1.404
eling+1.372
oker+1.351
 restaurant+1.281
Neg logit contributions
 prope-0.822
 pads-0.774
aters-0.769
lets-0.754
phones-0.726
Token pleased
Token ID10607
Feature activation+2.06e-01

Pos logit contributions
aster+5.252
ore+5.038
eling+4.833
roads+4.659
oker+4.649
Neg logit contributions
 prope-3.606
 tail-3.422
 neck-3.385
lets-3.358
 no-3.339
Token.
Token ID13
Feature activation+9.07e-01

Pos logit contributions
aster+0.371
ore+0.355
eling+0.353
oker+0.338
 baker+0.322
Neg logit contributions
 prope-0.330
aters-0.322
 pads-0.316
lets-0.313
phones-0.307
Token and
Token ID290
Feature activation-1.30e+00

Pos logit contributions
 prope+1.631
aters+1.509
lets+1.501
phones+1.492
 pads+1.490
Neg logit contributions
ore-1.824
aster-1.775
eling-1.670
 restaurant-1.652
 baker-1.626
Token happy
Token ID3772
Feature activation-7.65e-01

Pos logit contributions
 prope+1.980
aters+1.965
phones+1.907
lets+1.889
 pads+1.841
Neg logit contributions
aster-2.446
eling-2.367
ore-2.344
oker-2.187
 restaurant-2.182
Token.
Token ID13
Feature activation-9.15e-01

Pos logit contributions
 prope+1.427
aters+1.369
 pads+1.345
phones+1.329
lets+1.328
Neg logit contributions
ore-1.189
aster-1.184
eling-1.108
oker-1.073
 restaurant-1.069
Token
Token ID198
Feature activation-2.87e-01

Pos logit contributions
 prope+1.551
aters+1.472
lets+1.460
 pads+1.448
phones+1.431
Neg logit contributions
aster-1.620
ore-1.572
eling-1.439
 restaurant-1.437
oker-1.428
Token
Token ID198
Feature activation-1.27e+00

Pos logit contributions
 prope+0.483
aters+0.468
 pads+0.459
lets+0.456
phones+0.449
Neg logit contributions
aster-0.501
ore-0.485
eling-0.466
oker-0.450
 restaurant-0.435
TokenDown
Token ID8048
Feature activation+3.89e+00

Pos logit contributions
 prope+1.869
aters+1.795
lets+1.732
phones+1.696
 pads+1.693
Neg logit contributions
aster-2.652
ore-2.474
eling-2.394
 restaurant-2.383
oker-2.369
Token on
Token ID319
Feature activation+3.85e-01

Pos logit contributions
oker+8.382
aster+8.349
 salsa+7.972
eling+7.826
aughter+7.781
Neg logit contributions
 prope-5.502
 pads-5.177
 Dog-5.056
 Ch-4.922
Snow-4.915
Token the
Token ID262
Feature activation-2.84e-01

Pos logit contributions
aster+0.550
ore+0.525
eling+0.522
oker+0.500
 baker+0.463
Neg logit contributions
 prope-0.763
aters-0.740
 pads-0.729
lets-0.718
phones-0.715
Token ground
Token ID2323
Feature activation+1.22e+00

Pos logit contributions
 prope+0.494
aters+0.488
lets+0.478
 pads+0.477
phones+0.466
Neg logit contributions
aster-0.479
ore-0.454
eling-0.446
oker-0.422
 restaurant-0.415
Token,
Token ID11
Feature activation+7.71e-01

Pos logit contributions
aster+2.703
eling+2.634
oker+2.561
ore+2.534
 baker+2.460
Neg logit contributions
 prope-1.376
 pads-1.365
aters-1.319
lets-1.276
 Dog-1.263
Token there
Token ID612
Feature activation-1.28e-02

Pos logit contributions
aster+1.391
ore+1.325
eling+1.320
oker+1.279
 baker+1.195
Neg logit contributions
 prope-1.258
 pads-1.184
aters-1.182
 Tess-1.159
lets-1.153
Token gifts
Token ID13201
Feature activation-6.89e-01

Pos logit contributions
 prope+0.436
aters+0.427
 pads+0.417
lets+0.412
phones+0.409
Neg logit contributions
aster-0.402
ore-0.379
eling-0.374
oker-0.359
 restaurant-0.348
Token.
Token ID13
Feature activation-4.74e-01

Pos logit contributions
 prope+1.260
aters+1.209
 pads+1.186
lets+1.167
phones+1.167
Neg logit contributions
aster-1.082
ore-1.062
eling-1.009
oker-0.986
 restaurant-0.957
Token
Token ID198
Feature activation+1.13e-02

Pos logit contributions
 prope+0.783
aters+0.747
 pads+0.735
lets+0.732
phones+0.724
Neg logit contributions
aster-0.847
ore-0.824
oker-0.769
eling-0.768
 baker-0.739
Token
Token ID198
Feature activation-8.53e-02

Pos logit contributions
aster+0.020
ore+0.020
eling+0.019
oker+0.019
 restaurant+0.018
Neg logit contributions
 prope-0.018
aters-0.018
 pads-0.017
lets-0.017
phones-0.017
TokenThe
Token ID464
Feature activation+2.00e+00

Pos logit contributions
 prope+0.120
aters+0.116
 pads+0.113
lets+0.112
phones+0.110
Neg logit contributions
aster-0.173
ore-0.166
eling-0.164
oker-0.159
 baker-0.152
Token j
Token ID474
Feature activation+3.88e+00

Pos logit contributions
ore+3.231
aster+3.180
eling+3.168
oker+2.878
ying+2.677
Neg logit contributions
 prope-3.717
 scarf-3.558
 pads-3.552
 Dog-3.540
 tail-3.518
Tokenolly
Token ID5098
Feature activation+2.18e+00

Pos logit contributions
eling+5.631
roads+4.529
 ang+4.487
aster+4.352
ore+3.919
Neg logit contributions
aters-8.898
 pads-8.373
 mate-8.180
lets-8.032
un-7.877
Token man
Token ID582
Feature activation+1.96e+00

Pos logit contributions
ore+3.176
eling+3.154
aster+3.073
oker+2.751
roads+2.633
Neg logit contributions
 pads-4.423
 scarf-4.284
lets-4.271
 tail-4.265
 prope-4.238
Token gives
Token ID3607
Feature activation+8.05e-01

Pos logit contributions
aster+3.270
ore+3.118
oker+3.028
eling+3.004
roads+2.644
Neg logit contributions
lets-3.521
 pads-3.516
 prope-3.513
aters-3.460
phones-3.309
Token them
Token ID606
Feature activation-7.25e-01

Pos logit contributions
aster+1.504
ore+1.442
eling+1.437
oker+1.393
 baker+1.305
Neg logit contributions
 prope-1.222
aters-1.221
 pads-1.203
lets-1.182
 Tess-1.164
Token each
Token ID1123
Feature activation+3.90e-01

Pos logit contributions
 prope+1.212
aters+1.153
 pads+1.123
lets+1.097
phones+1.094
Neg logit contributions
aster-1.287
ore-1.224
eling-1.191
 restaurant-1.149
oker-1.139
TokenL
Token ID43
Feature activation+2.38e+00

Pos logit contributions
aster+1.585
eling+1.554
ore+1.549
oker+1.479
 baker+1.431
Neg logit contributions
 prope-1.004
 pads-0.976
aters-0.971
Snow-0.958
lets-0.943
Tokenily
Token ID813
Feature activation+3.23e-02

Pos logit contributions
aster+4.082
eling+4.071
ore+3.991
 restaurant+3.719
roads+3.662
Neg logit contributions
 pads-3.883
aters-3.862
 mate-3.811
lets-3.703
 prope-3.663
Token looked
Token ID3114
Feature activation+1.12e+00

Pos logit contributions
aster+0.056
ore+0.054
eling+0.053
oker+0.051
 baker+0.048
Neg logit contributions
 prope-0.054
aters-0.053
 pads-0.052
lets-0.051
phones-0.050
Token at
Token ID379
Feature activation+1.85e-01

Pos logit contributions
aster+1.823
eling+1.625
oker+1.624
ore+1.618
 baker+1.491
Neg logit contributions
 prope-2.081
 pads-2.067
aters-2.019
lets-2.001
 tail-1.940
Token the
Token ID262
Feature activation+2.60e+00

Pos logit contributions
aster+0.308
ore+0.295
eling+0.292
oker+0.280
 baker+0.265
Neg logit contributions
 prope-0.322
aters-0.314
 pads-0.309
lets-0.304
phones-0.300
Token muff
Token ID27563
Feature activation+3.88e+00

Pos logit contributions
aster+4.561
eling+4.438
oker+4.426
ore+4.409
roads+4.026
Neg logit contributions
 prope-4.648
 scarf-4.346
 neck-4.319
 tail-4.287
aters-4.245
Tokenin
Token ID259
Feature activation+1.64e+00

Pos logit contributions
aster+4.336
roads+3.486
oker+3.460
 restaurant+3.403
eling+3.317
Neg logit contributions
lets-9.287
aters-9.211
un-8.879
 pads-8.834
 scarf-8.801
Token.
Token ID13
Feature activation+8.40e-01

Pos logit contributions
aster+3.246
eling+3.145
oker+3.096
ore+3.056
 baker+2.904
Neg logit contributions
 pads-2.334
 prope-2.332
lets-2.295
aters-2.293
 neck-2.193
Token It
Token ID632
Feature activation+2.75e-01

Pos logit contributions
aster+1.836
eling+1.806
ore+1.744
oker+1.702
 restaurant+1.634
Neg logit contributions
 prope-1.027
aters-1.006
 pads-0.977
lets-0.949
 Tess-0.943
Token was
Token ID373
Feature activation-2.43e-01

Pos logit contributions
aster+0.502
ore+0.483
eling+0.476
oker+0.461
 baker+0.438
Neg logit contributions
 prope-0.436
aters-0.422
 pads-0.417
lets-0.411
phones-0.402
Token still
Token ID991
Feature activation+5.30e-01

Pos logit contributions
 prope+0.361
aters+0.353
 pads+0.343
lets+0.343
phones+0.339
Neg logit contributions
aster-0.467
eling-0.444
ore-0.442
oker-0.422
 restaurant-0.409
Token boy
Token ID2933
Feature activation+4.86e-01

Pos logit contributions
eling+8.671
aster+8.590
ore+8.572
oker+8.121
roads+7.904
Neg logit contributions
 tail-3.625
 prope-3.546
 mate-3.422
 aunt-3.421
 no-3.396
Token was
Token ID373
Feature activation-1.95e-02

Pos logit contributions
aster+0.885
ore+0.850
eling+0.840
oker+0.814
 baker+0.766
Neg logit contributions
 prope-0.765
aters-0.737
 pads-0.733
lets-0.726
phones-0.708
Token so
Token ID523
Feature activation+1.22e+00

Pos logit contributions
 prope+0.029
aters+0.028
 pads+0.028
lets+0.028
phones+0.027
Neg logit contributions
aster-0.037
ore-0.036
eling-0.035
oker-0.034
 baker-0.033
Token relieved
Token ID22598
Feature activation+2.45e-01

Pos logit contributions
aster+2.279
eling+2.127
ore+2.122
oker+2.048
 restaurant+1.960
Neg logit contributions
 prope-1.956
 Dog-1.818
 pads-1.818
aters-1.805
 Tess-1.784
Token!
Token ID0
Feature activation+8.77e-01

Pos logit contributions
aster+0.462
ore+0.443
eling+0.440
oker+0.425
 baker+0.405
Neg logit contributions
 prope-0.370
aters-0.360
 pads-0.354
lets-0.349
phones-0.343
Token The
Token ID383
Feature activation+3.86e+00

Pos logit contributions
aster+1.614
eling+1.566
ore+1.508
oker+1.460
 restaurant+1.402
Neg logit contributions
 prope-1.377
aters-1.364
 pads-1.339
lets-1.296
 Tess-1.295
Token baker
Token ID46412
Feature activation+6.99e-01

Pos logit contributions
eling+8.643
aster+8.528
ore+8.342
oker+7.999
ying+7.809
Neg logit contributions
 tail-5.340
 prope-5.274
 aunt-5.152
 scarf-5.056
 no-5.032
Token was
Token ID373
Feature activation+4.90e-02

Pos logit contributions
aster+1.267
ore+1.221
eling+1.204
oker+1.170
 restaurant+1.101
Neg logit contributions
 prope-1.104
aters-1.057
lets-1.053
 pads-1.047
phones-1.018
Token so
Token ID523
Feature activation+1.26e+00

Pos logit contributions
aster+0.096
ore+0.092
eling+0.091
oker+0.088
 baker+0.084
Neg logit contributions
 prope-0.071
aters-0.069
 pads-0.068
lets-0.067
phones-0.065
Token patient
Token ID5827
Feature activation+3.56e-01

Pos logit contributions
aster+2.345
ore+2.191
eling+2.188
oker+2.103
 restaurant+2.006
Neg logit contributions
 prope-2.022
 Dog-1.869
 pads-1.854
aters-1.851
 Tess-1.849
Token even
Token ID772
Feature activation+2.46e+00

Pos logit contributions
aster+0.718
ore+0.689
eling+0.684
oker+0.662
 baker+0.632
Neg logit contributions
 prope-0.492
aters-0.475
 pads-0.468
lets-0.459
phones-0.450
Token flew
Token ID13112
Feature activation-5.26e-01

Pos logit contributions
aster+0.840
ore+0.809
eling+0.797
oker+0.779
 restaurant+0.737
Neg logit contributions
 prope-0.639
aters-0.612
 pads-0.611
lets-0.606
phones-0.583
Token Sally
Token ID25737
Feature activation-1.65e-02

Pos logit contributions
 prope+1.118
aters+1.103
 pads+1.077
phones+1.071
lets+1.068
Neg logit contributions
aster-0.659
ore-0.646
eling-0.612
oker-0.582
 baker-0.554
Token up
Token ID510
Feature activation+1.88e+00

Pos logit contributions
 prope+0.028
aters+0.027
 pads+0.027
lets+0.026
phones+0.026
Neg logit contributions
aster-0.028
ore-0.027
eling-0.026
oker-0.025
 restaurant-0.024
Token to
Token ID284
Feature activation+9.30e-01

Pos logit contributions
aster+4.150
eling+4.069
oker+3.935
ore+3.813
 salsa+3.622
Neg logit contributions
 prope-2.501
aters-2.280
 tail-2.262
 pads-2.228
 neck-2.216
Token the
Token ID262
Feature activation+1.97e+00

Pos logit contributions
aster+1.599
ore+1.575
eling+1.557
oker+1.518
 baker+1.401
Neg logit contributions
 prope-1.505
 Tess-1.468
aters-1.466
 Dog-1.448
 pads-1.448
Token muff
Token ID27563
Feature activation+3.86e+00

Pos logit contributions
aster+3.730
eling+3.584
oker+3.556
ore+3.528
ying+3.244
Neg logit contributions
 prope-3.253
 neck-2.990
aters-2.935
 tail-2.934
 pads-2.895
Tokenin
Token ID259
Feature activation+1.35e+00

Pos logit contributions
aster+4.256
oker+3.767
 salsa+3.435
eling+3.392
 restaurant+3.297
Neg logit contributions
lets-9.370
aters-9.189
un-9.002
tails-8.731
 pads-8.683
Token.
Token ID13
Feature activation+9.61e-01

Pos logit contributions
aster+2.755
eling+2.718
oker+2.632
ore+2.591
 baker+2.419
Neg logit contributions
 prope-1.787
 pads-1.787
aters-1.784
lets-1.777
phones-1.682
Token Sally
Token ID25737
Feature activation+2.28e-01

Pos logit contributions
aster+2.056
eling+2.017
ore+1.967
oker+1.898
 restaurant+1.829
Neg logit contributions
 prope-1.220
aters-1.209
 pads-1.184
 Tess-1.160
lets-1.159
Token took
Token ID1718
Feature activation+9.44e-01

Pos logit contributions
aster+0.410
ore+0.393
eling+0.388
oker+0.376
 restaurant+0.356
Neg logit contributions
 prope-0.365
aters-0.354
 pads-0.349
lets-0.345
phones-0.338
Token a
Token ID257
Feature activation+1.05e+00

Pos logit contributions
aster+1.874
eling+1.852
ore+1.834
oker+1.781
 restaurant+1.700
Neg logit contributions
 prope-1.294
aters-1.249
 Tess-1.239
 pads-1.236
lets-1.233
Token,
Token ID11
Feature activation+1.68e+00

Pos logit contributions
aster+1.413
eling+1.385
ore+1.365
oker+1.334
 baker+1.265
Neg logit contributions
 prope-0.781
aters-0.744
 pads-0.730
lets-0.721
 Dog-0.711
Token something
Token ID1223
Feature activation+2.34e+00

Pos logit contributions
eling+3.009
ore+3.007
aster+2.949
oker+2.941
ying+2.701
Neg logit contributions
 prope-2.641
 Dog-2.594
 Tess-2.538
 Bun-2.535
 Brown-2.508
Token special
Token ID2041
Feature activation+1.10e+00

Pos logit contributions
aster+4.903
ore+4.815
eling+4.704
oker+4.620
roads+4.311
Neg logit contributions
 Bun-3.016
 prope-3.010
 scarf-2.964
 Dog-2.929
 Tess-2.900
Token happened
Token ID3022
Feature activation+4.96e-01

Pos logit contributions
aster+1.693
eling+1.622
ore+1.576
oker+1.500
 baker+1.404
Neg logit contributions
 prope-2.006
aters-1.967
 pads-1.966
lets-1.952
 Tess-1.915
Token.
Token ID13
Feature activation+1.10e+00

Pos logit contributions
aster+0.908
eling+0.866
ore+0.863
oker+0.838
 baker+0.798
Neg logit contributions
 prope-0.772
aters-0.755
 pads-0.751
lets-0.734
phones-0.723
Token The
Token ID383
Feature activation+3.86e+00

Pos logit contributions
eling+2.140
aster+2.111
ore+2.032
oker+2.005
roads+1.844
Neg logit contributions
 prope-1.588
aters-1.540
 Tess-1.498
 Dog-1.491
 pads-1.491
Token baker
Token ID46412
Feature activation+1.31e+00

Pos logit contributions
eling+7.988
ore+7.979
aster+7.748
oker+7.729
roads+7.284
Neg logit contributions
 tail-5.866
 aunt-5.775
 prope-5.734
 scarf-5.709
 neck-5.593
Token set
Token ID900
Feature activation+1.66e+00

Pos logit contributions
aster+2.513
ore+2.439
eling+2.401
oker+2.374
 restaurant+2.185
Neg logit contributions
lets-1.901
 prope-1.895
aters-1.827
 pads-1.819
 Tess-1.773
Token out
Token ID503
Feature activation+1.79e+00

Pos logit contributions
eling+3.081
aster+3.040
ore+3.010
oker+2.999
 restaurant+2.726
Neg logit contributions
 prope-2.481
 Dog-2.387
lets-2.374
 Tess-2.368
aters-2.361
Token to
Token ID284
Feature activation+2.20e+00

Pos logit contributions
aster+3.327
eling+3.301
oker+3.292
ore+3.204
 baker+3.069
Neg logit contributions
 prope-2.611
aters-2.600
 pads-2.492
phones-2.484
lets-2.478
Token bake
Token ID28450
Feature activation+1.41e+00

Pos logit contributions
ore+4.091
oker+3.953
eling+3.829
ying+3.669
aster+3.612
Neg logit contributions
 prope-3.442
 Tess-3.382
 Dog-3.341
 mate-3.196
 scarf-3.188
Token wait
Token ID4043
Feature activation+8.24e-02

Pos logit contributions
aster+0.545
ore+0.525
eling+0.518
oker+0.496
 restaurant+0.471
Neg logit contributions
 prope-0.586
aters-0.566
 pads-0.557
lets-0.552
phones-0.547
Token to
Token ID284
Feature activation+1.91e-01

Pos logit contributions
aster+0.146
ore+0.140
eling+0.139
oker+0.133
 baker+0.127
Neg logit contributions
 prope-0.134
aters-0.130
 pads-0.128
lets-0.126
phones-0.124
Token eat
Token ID4483
Feature activation+1.44e+00

Pos logit contributions
aster+0.319
ore+0.307
eling+0.303
oker+0.291
 baker+0.275
Neg logit contributions
 prope-0.332
aters-0.323
 pads-0.317
lets-0.314
phones-0.310
Token the
Token ID262
Feature activation+2.28e+00

Pos logit contributions
aster+2.519
ore+2.452
oker+2.414
eling+2.413
 baker+2.224
Neg logit contributions
aters-2.313
 prope-2.294
 pads-2.253
 Tess-2.214
lets-2.174
Token tasty
Token ID25103
Feature activation+2.58e+00

Pos logit contributions
aster+4.139
ore+3.844
oker+3.803
eling+3.791
roads+3.470
Neg logit contributions
 prope-3.927
aters-3.641
 scarf-3.629
Brown-3.591
Snow-3.586
Token muff
Token ID27563
Feature activation+3.85e+00

Pos logit contributions
aster+4.061
eling+3.850
oker+3.734
ore+3.641
roads+3.451
Neg logit contributions
aters-4.855
 prope-4.796
 scarf-4.771
 Brown-4.729
 neck-4.724
Tokenins
Token ID1040
Feature activation+1.08e+00

Pos logit contributions
aster+4.566
eling+3.736
roads+3.616
oker+3.453
atches+3.405
Neg logit contributions
lets-9.551
aters-9.143
 pads-8.814
un-8.716
ocular-8.625
Token.
Token ID13
Feature activation-3.86e-01

Pos logit contributions
aster+2.030
eling+1.949
ore+1.910
oker+1.840
 restaurant+1.747
Neg logit contributions
 pads-1.616
 prope-1.603
aters-1.602
lets-1.593
phones-1.532
Token<|endoftext|>
Token ID50256
Feature activation+1.57e+00

Pos logit contributions
 prope+0.601
aters+0.564
 pads+0.556
lets+0.556
phones+0.547
Neg logit contributions
aster-0.723
ore-0.703
eling-0.676
oker-0.663
 baker-0.635
TokenOnce
Token ID7454
Feature activation+8.63e-01

Pos logit contributions
aster+1.735
eling+1.648
oker+1.485
ore+1.361
 baker+1.350
Neg logit contributions
aters-3.672
 pads-3.631
 Dog-3.573
 prope-3.570
Snow-3.535
Token upon
Token ID2402
Feature activation-5.60e-01

Pos logit contributions
aster+1.605
eling+1.538
ore+1.507
oker+1.462
 baker+1.406
Neg logit contributions
 prope-1.296
aters-1.271
 pads-1.256
lets-1.234
 Tess-1.232
Token different
Token ID1180
Feature activation-8.64e-01

Pos logit contributions
 prope+0.120
aters+0.117
 pads+0.115
lets+0.114
phones+0.112
Neg logit contributions
aster-0.100
ore-0.094
eling-0.093
oker-0.088
 restaurant-0.084
Token neck
Token ID7393
Feature activation-1.49e+00

Pos logit contributions
 prope+1.405
aters+1.360
 pads+1.327
Snow+1.323
 Dog+1.318
Neg logit contributions
aster-1.499
eling-1.456
ore-1.420
 restaurant-1.350
oker-1.334
Tokenl
Token ID75
Feature activation-1.35e+00

Pos logit contributions
 prope+2.459
aters+2.387
Brown+2.263
 Dog+2.251
 Tess+2.245
Neg logit contributions
aster-2.660
ore-2.591
eling-2.463
 restaurant-2.323
 baker-2.299
Tokenaces
Token ID2114
Feature activation-2.06e+00

Pos logit contributions
 prope+1.793
 Tess+1.634
Snow+1.622
aters+1.617
Brown+1.600
Neg logit contributions
aster-2.819
ore-2.773
eling-2.659
oker-2.583
 baker-2.510
Token,
Token ID11
Feature activation-1.44e+00

Pos logit contributions
 prope+3.604
aters+3.413
Brown+3.402
Snow+3.336
 Dog+3.332
Neg logit contributions
ore-3.555
aster-3.442
eling-3.180
he-3.159
ying-3.114
Token brace
Token ID22581
Feature activation-3.85e+00

Pos logit contributions
aters+2.139
 prope+2.066
un+1.959
Brown+1.947
phones+1.947
Neg logit contributions
aster-2.787
eling-2.624
ore-2.454
 baker-2.423
 restaurant-2.421
Tokenlets
Token ID5289
Feature activation-1.87e+00

Pos logit contributions
 prope+3.299
 Dog+3.283
Snow+2.714
Brown+2.617
 Tess+2.486
Neg logit contributions
ore-9.989
led-9.701
aster-9.489
eling-9.261
oker-9.135
Token,
Token ID11
Feature activation-1.91e+00

Pos logit contributions
 prope+3.223
aters+3.074
Brown+3.018
Snow+2.986
Wh+2.971
Neg logit contributions
ore-3.150
aster-3.126
eling-2.920
 baker-2.839
oker-2.772
Token and
Token ID290
Feature activation-8.01e-01

Pos logit contributions
aters+3.027
 prope+2.955
Brown+2.822
un+2.821
 Tess+2.798
Neg logit contributions
aster-3.490
eling-3.205
 baker-3.051
ore-3.040
 restaurant-3.014
Token rings
Token ID13917
Feature activation-1.91e+00

Pos logit contributions
 prope+1.196
aters+1.192
 pads+1.106
Snow+1.102
phones+1.098
Neg logit contributions
aster-1.522
eling-1.427
ore-1.408
oker-1.364
 baker-1.335
Token.
Token ID13
Feature activation-3.42e-01

Pos logit contributions
 prope+3.650
aters+3.551
Brown+3.448
 pads+3.389
Snow+3.379
Neg logit contributions
aster-2.761
ore-2.735
eling-2.635
 baker-2.428
 restaurant-2.419
Token big
Token ID1263
Feature activation-8.72e-01

Pos logit contributions
aters+1.846
lets+1.777
 prope+1.759
 pads+1.755
phones+1.729
Neg logit contributions
aster-1.937
ore-1.881
oker-1.787
 restaurant-1.781
eling-1.773
Token,
Token ID11
Feature activation-5.89e-01

Pos logit contributions
aters+1.315
 prope+1.315
lets+1.291
 pads+1.282
phones+1.272
Neg logit contributions
aster-1.627
ore-1.564
eling-1.517
oker-1.492
 restaurant-1.475
Token white
Token ID2330
Feature activation-1.54e+00

Pos logit contributions
aters+0.913
 prope+0.909
lets+0.884
 pads+0.867
phones+0.866
Neg logit contributions
aster-1.074
ore-1.025
eling-1.013
oker-0.977
 restaurant-0.950
Token pup
Token ID15552
Feature activation-1.35e+00

Pos logit contributions
 prope+2.433
aters+2.373
lets+2.371
 pads+2.322
phones+2.319
Neg logit contributions
aster-2.834
ore-2.824
 restaurant-2.631
oker-2.561
 baker-2.514
Token with
Token ID351
Feature activation-2.97e+00

Pos logit contributions
 prope+2.537
aters+2.448
lets+2.327
 pads+2.320
phones+2.317
Neg logit contributions
ore-2.113
aster-2.084
eling-1.928
oker-1.873
 restaurant-1.851
Token floppy
Token ID49415
Feature activation-3.82e+00

Pos logit contributions
aters+5.744
lets+5.387
 prope+5.346
phones+5.240
Brown+5.061
Neg logit contributions
aster-4.705
ore-4.557
eling-4.491
 salsa-4.172
 restaurant-4.025
Token ears
Token ID11368
Feature activation-2.55e+00

Pos logit contributions
Brown+5.179
 mate+5.131
 prope+5.130
Wh+5.025
aters+4.940
Neg logit contributions
eling-7.594
aster-7.520
ore-7.430
 salsa-6.980
 restaurant-6.940
Token and
Token ID290
Feature activation-2.47e+00

Pos logit contributions
 prope+4.853
Brown+4.760
 Tess+4.741
Wh+4.737
aters+4.699
Neg logit contributions
ore-3.947
aster-3.711
eling-3.441
 salsa-3.203
 restaurant-3.185
Token a
Token ID257
Feature activation-1.96e+00

Pos logit contributions
aters+4.011
 prope+3.833
Brown+3.705
phones+3.692
lets+3.669
Neg logit contributions
aster-4.473
eling-4.237
ore-4.195
 salsa-4.115
 restaurant-3.971
Token curly
Token ID45731
Feature activation-2.06e+00

Pos logit contributions
aters+3.247
lets+3.029
phones+3.021
 pads+2.981
 prope+2.953
Neg logit contributions
aster-3.429
ore-3.427
oker-3.189
eling-3.188
 restaurant-3.152
Token tail
Token ID7894
Feature activation-1.98e+00

Pos logit contributions
aters+2.198
 prope+2.171
phones+2.108
Brown+2.079
ocular+2.067
Neg logit contributions
aster-4.812
ore-4.548
eling-4.437
oker-4.259
 salsa-4.223
Token a
Token ID257
Feature activation+2.30e-02

Pos logit contributions
 prope+1.912
lets+1.851
aters+1.838
phones+1.834
 pads+1.780
Neg logit contributions
aster-2.388
ore-2.181
eling-2.098
 restaurant-2.080
 baker-2.073
Token little
Token ID1310
Feature activation+3.80e-02

Pos logit contributions
aster+0.045
ore+0.043
eling+0.043
oker+0.042
 baker+0.040
Neg logit contributions
 prope-0.033
aters-0.032
 pads-0.031
lets-0.031
phones-0.030
Token rabbit
Token ID22746
Feature activation-7.56e-01

Pos logit contributions
aster+0.048
ore+0.046
eling+0.045
oker+0.042
 baker+0.040
Neg logit contributions
 prope-0.081
aters-0.080
 pads-0.079
lets-0.078
phones-0.077
Token named
Token ID3706
Feature activation-1.89e+00

Pos logit contributions
 prope+1.055
aters+1.026
 pads+0.986
phones+0.971
lets+0.961
Neg logit contributions
aster-1.556
ore-1.523
eling-1.442
oker-1.396
 restaurant-1.376
Token Bun
Token ID28515
Feature activation-1.95e+00

Pos logit contributions
 prope+2.903
aters+2.600
lets+2.460
phones+2.428
 pads+2.403
Neg logit contributions
aster-3.958
ore-3.807
eling-3.600
 restaurant-3.540
 salsa-3.470
Token.
Token ID13
Feature activation-3.82e+00

Pos logit contributions
 prope+3.896
aters+3.684
phones+3.674
lets+3.461
 Tess+3.420
Neg logit contributions
aster-2.938
ore-2.904
eling-2.811
in-2.540
oker-2.473
Token Bun
Token ID28515
Feature activation-1.72e+00

Pos logit contributions
lets+6.100
 prope+5.631
 mate+5.560
phones+5.509
 pads+5.376
Neg logit contributions
ore-7.778
he-6.843
-6.777
aster-6.678
eling-6.625
Token had
Token ID550
Feature activation-3.22e+00

Pos logit contributions
 prope+2.784
aters+2.685
phones+2.639
lets+2.468
 mate+2.443
Neg logit contributions
ore-3.163
eling-3.103
aster-3.096
 restaurant-2.817
oker-2.809
Token big
Token ID1263
Feature activation-2.45e+00

Pos logit contributions
aters+6.645
lets+6.366
 prope+6.177
phones+6.177
ocular+5.745
Neg logit contributions
aster-5.147
ore-4.545
 baker-4.225
 restaurant-4.146
 salsa-4.128
Token ears
Token ID11368
Feature activation-2.40e+00

Pos logit contributions
Snow+3.856
phones+3.822
Brown+3.786
 prope+3.775
Wh+3.696
Neg logit contributions
eling-4.260
aster-4.247
ore-4.138
 restaurant-4.125
 salsa-3.974
Token that
Token ID326
Feature activation-1.66e+00

Pos logit contributions
 prope+4.767
aters+4.660
phones+4.522
 Tess+4.444
Wh+4.420
Neg logit contributions
ore-3.729
aster-3.527
 baker-3.148
eling-3.108
 restaurant-3.095
Token ch
Token ID442
Feature activation+1.40e+00

Pos logit contributions
aster+0.429
ore+0.414
eling+0.410
oker+0.396
 restaurant+0.377
Neg logit contributions
 prope-0.298
aters-0.287
 pads-0.281
lets-0.277
phones-0.271
Tokenubby
Token ID38393
Feature activation-2.90e-01

Pos logit contributions
aster+2.201
eling+2.126
ore+2.016
oker+1.926
 restaurant+1.879
Neg logit contributions
 prope-2.625
aters-2.494
 mate-2.428
un-2.426
lets-2.414
Token bunny
Token ID44915
Feature activation-7.70e-01

Pos logit contributions
 prope+0.421
aters+0.412
 pads+0.405
lets+0.396
phones+0.396
Neg logit contributions
aster-0.561
ore-0.542
eling-0.528
oker-0.515
 baker-0.505
Token named
Token ID3706
Feature activation-1.75e+00

Pos logit contributions
 prope+1.083
aters+1.052
 pads+1.010
phones+1.001
lets+0.979
Neg logit contributions
aster-1.568
ore-1.552
eling-1.458
oker-1.419
 restaurant-1.399
Token Bun
Token ID28515
Feature activation-1.91e+00

Pos logit contributions
 prope+2.725
aters+2.491
lets+2.401
phones+2.361
 pads+2.331
Neg logit contributions
aster-3.548
ore-3.412
eling-3.203
 restaurant-3.185
 baker-3.107
Token.
Token ID13
Feature activation-3.80e+00

Pos logit contributions
 prope+3.967
phones+3.773
aters+3.772
lets+3.558
 Tess+3.546
Neg logit contributions
aster-2.706
ore-2.689
eling-2.591
in-2.298
 restaurant-2.276
Token Bun
Token ID28515
Feature activation-1.50e+00

Pos logit contributions
lets+5.687
 prope+5.120
 mate+5.114
 pads+5.044
phones+5.037
Neg logit contributions
ore-8.334
-7.332
he-7.329
aster-7.152
eling-7.032
Token loved
Token ID6151
Feature activation-2.02e-01

Pos logit contributions
 prope+2.432
aters+2.357
phones+2.336
lets+2.201
 mate+2.141
Neg logit contributions
ore-2.710
aster-2.678
eling-2.672
 restaurant-2.434
oker-2.411
Token to
Token ID284
Feature activation-5.58e-01

Pos logit contributions
 prope+0.319
aters+0.309
 pads+0.301
lets+0.300
phones+0.295
Neg logit contributions
aster-0.369
ore-0.354
eling-0.348
oker-0.332
 restaurant-0.324
Token eat
Token ID4483
Feature activation+7.14e-01

Pos logit contributions
 prope+0.903
aters+0.877
 pads+0.861
lets+0.858
phones+0.842
Neg logit contributions
aster-1.035
ore-0.962
eling-0.948
oker-0.902
 restaurant-0.888
Token carrots
Token ID34397
Feature activation+1.35e-01

Pos logit contributions
aster+1.292
ore+1.229
eling+1.228
oker+1.209
 baker+1.113
Neg logit contributions
 prope-1.137
aters-1.119
 pads-1.104
lets-1.075
 Dog-1.052
Token.
Token ID13
Feature activation-1.30e+00

Pos logit contributions
 prope+1.349
aters+1.302
phones+1.265
 pads+1.250
lets+1.234
Neg logit contributions
aster-1.100
ore-1.086
eling-1.020
oker-0.978
 restaurant-0.949
Token Fl
Token ID1610
Feature activation-6.93e-01

Pos logit contributions
 prope+1.856
phones+1.749
lets+1.746
aters+1.699
 pads+1.686
Neg logit contributions
ore-2.609
aster-2.558
eling-2.431
 restaurant-2.406
oker-2.350
Tokenoppy
Token ID26696
Feature activation-9.16e-01

Pos logit contributions
 prope+1.389
aters+1.302
phones+1.297
 pads+1.295
lets+1.279
Neg logit contributions
ore-0.995
aster-0.973
eling-0.937
oker-0.908
 restaurant-0.856
Token had
Token ID550
Feature activation-3.34e+00

Pos logit contributions
 prope+1.412
aters+1.371
phones+1.337
 pads+1.276
lets+1.275
Neg logit contributions
aster-1.689
ore-1.677
eling-1.626
 restaurant-1.560
oker-1.549
Token big
Token ID1263
Feature activation-2.34e+00

Pos logit contributions
aters+6.990
 prope+6.741
lets+6.730
phones+6.672
ocular+6.136
Neg logit contributions
aster-4.929
ore-4.667
 restaurant-4.349
 baker-4.244
 salsa-4.233
Token floppy
Token ID49415
Feature activation-3.79e+00

Pos logit contributions
phones+3.670
Snow+3.652
Brown+3.644
Wh+3.595
 prope+3.572
Neg logit contributions
eling-4.053
aster-4.043
ore-4.018
 restaurant-3.990
 salsa-3.783
Token ears
Token ID11368
Feature activation-2.87e+00

Pos logit contributions
Wh+5.090
 prope+5.019
 undone+5.016
aters+5.016
ocular+4.964
Neg logit contributions
 restaurant-7.420
aster-7.370
eling-7.226
ore-7.144
 early-6.946
Token and
Token ID290
Feature activation-2.79e+00

Pos logit contributions
 prope+5.893
aters+5.743
phones+5.618
Brown+5.580
Wh+5.561
Neg logit contributions
ore-4.452
aster-3.955
 baker-3.618
eling-3.598
 restaurant-3.594
Token a
Token ID257
Feature activation-2.50e+00

Pos logit contributions
aters+5.345
 prope+5.165
phones+4.978
 undone+4.713
 mate+4.618
Neg logit contributions
aster-4.544
ore-4.153
 salsa-4.097
eling-4.081
 baker-4.034
Token little
Token ID1310
Feature activation-1.73e+00

Pos logit contributions
aters+4.494
phones+4.295
lets+4.091
 pads+4.050
 prope+3.889
Neg logit contributions
aster-4.174
ore-4.136
 baker-4.135
 restaurant-4.046
 salsa-3.913
Token pink
Token ID11398
Feature activation-2.68e+00

Pos logit contributions
phones+2.713
aters+2.681
 prope+2.656
Brown+2.590
Wh+2.582
Neg logit contributions
ore-3.108
aster-2.959
 restaurant-2.905
 baker-2.852
oker-2.795
Token her
Token ID607
Feature activation-2.33e+00

Pos logit contributions
aster+0.325
ore+0.311
eling+0.309
oker+0.296
 baker+0.279
Neg logit contributions
 prope-0.347
aters-0.338
 pads-0.332
lets-0.328
phones-0.324
Token flashlight
Token ID38371
Feature activation-1.33e+00

Pos logit contributions
aters+2.784
 pads+2.645
lets+2.615
Snow+2.515
 prope+2.513
Neg logit contributions
ore-5.283
aster-5.065
eling-4.850
oker-4.695
 restaurant-4.639
Token and
Token ID290
Feature activation-6.40e-01

Pos logit contributions
 prope+2.155
aters+2.124
Snow+2.045
Brown+2.027
 Tess+2.010
Neg logit contributions
ore-2.298
aster-2.257
eling-2.117
 early-2.054
oker-2.036
Token the
Token ID262
Feature activation-1.28e+00

Pos logit contributions
 prope+1.090
aters+1.089
 pads+1.028
lets+1.011
Brown+1.011
Neg logit contributions
aster-1.085
eling-1.032
ore-1.024
oker-0.954
 restaurant-0.930
Token butterfly
Token ID35113
Feature activation-1.25e+00

Pos logit contributions
aters+1.950
 prope+1.875
lets+1.815
 pads+1.811
phones+1.786
Neg logit contributions
aster-2.400
ore-2.325
eling-2.288
oker-2.244
 baker-2.200
Token's
Token ID338
Feature activation-3.76e+00

Pos logit contributions
 prope+2.089
aters+2.086
Brown+1.965
 Tess+1.938
phones+1.933
Neg logit contributions
aster-2.108
ore-2.097
eling-2.027
oker-1.877
 baker-1.840
Token wings
Token ID12098
Feature activation-1.30e+00

Pos logit contributions
aters+6.778
Wh+6.727
Brown+6.556
Nut+6.474
lets+6.446
Neg logit contributions
ore-6.735
aster-5.787
 early-5.555
eling-5.455
 oven-5.355
Token were
Token ID547
Feature activation-2.20e+00

Pos logit contributions
aters+2.375
 prope+2.360
phones+2.278
Brown+2.274
 pads+2.240
Neg logit contributions
ore-1.988
aster-1.987
eling-1.932
oker-1.758
 baker-1.659
Token so
Token ID523
Feature activation-5.82e-01

Pos logit contributions
aters+3.699
phones+3.625
 prope+3.574
Brown+3.570
Wh+3.511
Neg logit contributions
eling-3.713
aster-3.681
ore-3.592
 early-3.263
oker-3.246
Token colorful
Token ID20239
Feature activation-1.27e+00

Pos logit contributions
 prope+0.900
aters+0.898
 pads+0.869
phones+0.867
lets+0.854
Neg logit contributions
aster-1.059
ore-1.049
eling-1.025
oker-0.992
 baker-0.945
Token!
Token ID0
Feature activation+2.12e-01

Pos logit contributions
 prope+2.398
aters+2.275
Brown+2.245
 Tess+2.229
phones+2.224
Neg logit contributions
ore-1.949
aster-1.897
eling-1.767
oker-1.729
 restaurant-1.713
Token
Token ID198
Feature activation-4.37e-01

Pos logit contributions
 prope+0.383
aters+0.371
 pads+0.364
lets+0.361
phones+0.357
Neg logit contributions
aster-0.405
ore-0.391
eling-0.381
oker-0.366
 restaurant-0.349
TokenM
Token ID44
Feature activation+1.46e+00

Pos logit contributions
 prope+0.721
aters+0.695
 pads+0.681
lets+0.674
phones+0.672
Neg logit contributions
aster-0.796
ore-0.757
eling-0.750
oker-0.720
 restaurant-0.682
Tokenilly
Token ID6548
Feature activation-1.23e+00

Pos logit contributions
aster+2.459
ore+2.323
eling+2.288
oker+2.224
roads+2.177
Neg logit contributions
aters-2.499
 pads-2.378
 Tess-2.374
 prope-2.352
lets-2.342
Token and
Token ID290
Feature activation+5.91e-01

Pos logit contributions
aters+2.027
 prope+2.011
 pads+1.886
phones+1.869
lets+1.856
Neg logit contributions
aster-2.155
eling-2.083
ore-2.066
 baker-1.910
oker-1.907
Token her
Token ID607
Feature activation-3.11e+00

Pos logit contributions
aster+1.003
ore+0.974
eling+0.953
oker+0.935
 baker+0.870
Neg logit contributions
 prope-1.007
aters-0.975
 pads-0.960
 Tess-0.955
lets-0.951
Token new
Token ID649
Feature activation-3.75e+00

Pos logit contributions
aters+4.618
lets+4.505
 prope+4.495
 pads+4.406
 mate+4.206
Neg logit contributions
aster-5.742
ore-5.713
 restaurant-5.616
eling-5.583
 baker-5.360
Token friend
Token ID1545
Feature activation-1.38e+00

Pos logit contributions
 prope+6.168
 pads+6.166
phones+5.995
Wh+5.965
aters+5.921
Neg logit contributions
 restaurant-6.699
ore-6.694
 baker-6.280
 healer-6.039
 early-5.952
Token played
Token ID2826
Feature activation+5.75e-01

Pos logit contributions
 prope+2.245
aters+2.171
phones+2.046
Snow+2.008
Brown+2.006
Neg logit contributions
aster-2.508
ore-2.386
eling-2.376
oker-2.262
 restaurant-2.237
Token together
Token ID1978
Feature activation+2.49e-01

Pos logit contributions
aster+1.062
eling+1.015
ore+1.014
oker+0.971
 baker+0.922
Neg logit contributions
 prope-0.897
aters-0.867
 pads-0.858
lets-0.844
phones-0.828
Token until
Token ID1566
Feature activation+5.92e-01

Pos logit contributions
aster+0.476
ore+0.458
eling+0.454
oker+0.438
 baker+0.417
Neg logit contributions
 prope-0.370
aters-0.359
 pads-0.353
lets-0.348
phones-0.340
Token M
Token ID337
Feature activation+7.67e-01

Pos logit contributions
aster+0.947
ore+0.927
eling+0.919
oker+0.886
 baker+0.828
Neg logit contributions
 prope-1.048
aters-1.011
 pads-1.001
 Tess-0.998
lets-0.981
Token play
Token ID711
Feature activation+8.04e-01

Pos logit contributions
 prope+0.997
aters+0.961
 pads+0.950
lets+0.930
phones+0.914
Neg logit contributions
aster-1.078
ore-1.005
eling-0.968
oker-0.953
 restaurant-0.924
Token with
Token ID351
Feature activation+1.69e-01

Pos logit contributions
aster+1.411
eling+1.364
ore+1.331
oker+1.290
 baker+1.214
Neg logit contributions
 prope-1.333
aters-1.301
 pads-1.280
lets-1.265
phones-1.243
Token lasers
Token ID34628
Feature activation-3.46e-01

Pos logit contributions
aster+0.318
ore+0.306
eling+0.303
oker+0.292
 baker+0.278
Neg logit contributions
 prope-0.258
aters-0.250
 pads-0.246
lets-0.242
phones-0.238
Token.
Token ID13
Feature activation+5.22e-01

Pos logit contributions
 prope+0.606
aters+0.587
 pads+0.574
lets+0.569
phones+0.568
Neg logit contributions
aster-0.565
ore-0.551
eling-0.530
oker-0.515
 restaurant-0.495
Token They
Token ID1119
Feature activation-2.24e+00

Pos logit contributions
aster+1.143
eling+1.115
ore+1.111
oker+1.069
 baker+1.025
Neg logit contributions
aters-0.622
 prope-0.622
 pads-0.606
lets-0.588
 Tess-0.566
Token had
Token ID550
Feature activation-3.74e+00

Pos logit contributions
aters+3.845
 prope+3.778
phones+3.615
 pads+3.516
Snow+3.437
Neg logit contributions
aster-3.890
eling-3.657
ore-3.473
oker-3.465
 restaurant-3.441
Token red
Token ID2266
Feature activation-1.96e+00

Pos logit contributions
aters+9.417
phones+8.665
lets+8.585
Brown+8.575
ker+8.473
Neg logit contributions
aster-4.224
 restaurant-4.104
 baker-3.293
 contest-3.293
 early-3.250
Token lasers
Token ID34628
Feature activation-1.60e+00

Pos logit contributions
 prope+3.185
Snow+3.184
 Tess+3.143
Brown+3.115
 Dog+3.098
Neg logit contributions
aster-3.365
ore-3.318
eling-3.041
 restaurant-2.983
oker-2.940
Token and
Token ID290
Feature activation-2.96e+00

Pos logit contributions
 prope+2.635
aters+2.567
 Tess+2.495
Snow+2.490
Brown+2.475
Neg logit contributions
ore-2.763
aster-2.677
oker-2.525
eling-2.511
 baker-2.474
Token blue
Token ID4171
Feature activation-2.08e+00

Pos logit contributions
aters+5.380
Snow+4.690
 Tess+4.665
 prope+4.655
Brown+4.633
Neg logit contributions
aster-5.049
ore-4.551
eling-4.502
 salsa-4.372
oker-4.330
Token lasers
Token ID34628
Feature activation-1.29e+00

Pos logit contributions
Snow+3.611
 Tess+3.594
 prope+3.566
Brown+3.553
 Dog+3.539
Neg logit contributions
ore-3.472
aster-3.307
eling-3.042
oker-2.997
 restaurant-2.974
Token a
Token ID257
Feature activation+1.74e-01

Pos logit contributions
 prope+1.304
aters+1.265
lets+1.256
phones+1.241
 pads+1.226
Neg logit contributions
aster-1.748
ore-1.625
eling-1.577
oker-1.528
 baker-1.526
Token silly
Token ID14397
Feature activation+1.54e-01

Pos logit contributions
aster+0.345
ore+0.333
eling+0.330
oker+0.318
 baker+0.303
Neg logit contributions
 prope-0.245
aters-0.236
 pads-0.232
lets-0.228
phones-0.224
Token rabbit
Token ID22746
Feature activation-2.89e-01

Pos logit contributions
aster+0.296
ore+0.285
eling+0.283
oker+0.272
 baker+0.259
Neg logit contributions
 prope-0.227
aters-0.220
 pads-0.216
lets-0.213
phones-0.209
Token named
Token ID3706
Feature activation-1.76e+00

Pos logit contributions
 prope+0.376
aters+0.363
 pads+0.352
lets+0.345
phones+0.342
Neg logit contributions
aster-0.614
ore-0.597
eling-0.579
oker-0.562
 restaurant-0.546
Token Bun
Token ID28515
Feature activation-1.86e+00

Pos logit contributions
 prope+2.752
aters+2.512
lets+2.407
phones+2.348
 pads+2.334
Neg logit contributions
aster-3.602
ore-3.388
eling-3.225
 restaurant-3.217
 salsa-3.157
Token.
Token ID13
Feature activation-3.73e+00

Pos logit contributions
 prope+3.826
aters+3.625
phones+3.612
lets+3.420
 Tess+3.402
Neg logit contributions
aster-2.688
ore-2.650
eling-2.552
in-2.266
 restaurant-2.262
Token Bun
Token ID28515
Feature activation-1.35e+00

Pos logit contributions
lets+6.289
 pads+5.705
 prope+5.687
phones+5.608
 mate+5.586
Neg logit contributions
ore-7.422
aster-6.310
-6.300
he-6.299
eling-6.168
Token liked
Token ID8288
Feature activation-2.01e-01

Pos logit contributions
 prope+2.190
aters+2.115
phones+2.070
lets+1.969
 pads+1.924
Neg logit contributions
ore-2.448
aster-2.434
eling-2.410
oker-2.209
 restaurant-2.200
Token to
Token ID284
Feature activation+1.70e-01

Pos logit contributions
 prope+0.317
aters+0.310
 pads+0.301
lets+0.299
phones+0.294
Neg logit contributions
aster-0.367
ore-0.352
eling-0.345
oker-0.331
 restaurant-0.322
Token hop
Token ID1725
Feature activation+2.77e-01

Pos logit contributions
aster+0.315
ore+0.303
eling+0.300
oker+0.289
 baker+0.276
Neg logit contributions
 prope-0.264
aters-0.257
 pads-0.252
lets-0.248
phones-0.244
Token and
Token ID290
Feature activation-8.87e-02

Pos logit contributions
aster+0.463
eling+0.439
ore+0.438
oker+0.418
 baker+0.397
Neg logit contributions
 prope-0.482
aters-0.470
 pads-0.464
lets-0.457
phones-0.449
Token small
Token ID1402
Feature activation-4.16e-01

Pos logit contributions
aters+0.740
 prope+0.736
 pads+0.714
lets+0.711
phones+0.705
Neg logit contributions
aster-0.852
ore-0.813
eling-0.787
oker-0.777
 restaurant-0.761
Token,
Token ID11
Feature activation-6.74e-01

Pos logit contributions
 prope+0.705
aters+0.690
 pads+0.677
lets+0.672
phones+0.670
Neg logit contributions
aster-0.703
ore-0.681
eling-0.652
oker-0.642
 restaurant-0.627
Token brown
Token ID7586
Feature activation-9.90e-01

Pos logit contributions
 prope+0.998
aters+0.996
lets+0.956
phones+0.943
 pads+0.920
Neg logit contributions
aster-1.296
ore-1.220
eling-1.211
oker-1.169
 restaurant-1.148
Token puppy
Token ID26188
Feature activation-1.73e+00

Pos logit contributions
 prope+1.550
aters+1.508
lets+1.487
 pads+1.483
phones+1.480
Neg logit contributions
aster-1.815
ore-1.768
 restaurant-1.666
 baker-1.636
eling-1.630
Token with
Token ID351
Feature activation-2.85e+00

Pos logit contributions
 prope+3.186
aters+3.057
phones+2.895
 pads+2.847
lets+2.846
Neg logit contributions
ore-2.861
aster-2.799
eling-2.548
 restaurant-2.542
 baker-2.511
Token floppy
Token ID49415
Feature activation-3.72e+00

Pos logit contributions
aters+5.478
 prope+5.230
lets+5.197
phones+5.057
 pads+4.867
Neg logit contributions
aster-4.501
ore-4.307
eling-4.197
 salsa-3.973
 baker-3.818
Token ears
Token ID11368
Feature activation-2.62e+00

Pos logit contributions
 prope+5.144
Brown+5.096
ocular+5.055
aters+5.001
Wh+4.902
Neg logit contributions
aster-7.391
eling-7.045
ore-6.886
 salsa-6.749
 restaurant-6.713
Token and
Token ID290
Feature activation-2.34e+00

Pos logit contributions
 prope+5.116
Brown+4.932
 Tess+4.911
aters+4.896
phones+4.869
Neg logit contributions
ore-4.056
aster-3.889
eling-3.409
 salsa-3.355
 baker-3.331
Token a
Token ID257
Feature activation-2.32e+00

Pos logit contributions
aters+3.938
 prope+3.766
phones+3.674
Brown+3.598
lets+3.598
Neg logit contributions
aster-4.184
ore-3.792
eling-3.786
 salsa-3.784
 restaurant-3.627
Token w
Token ID266
Feature activation+7.11e-02

Pos logit contributions
aters+3.640
phones+3.516
lets+3.411
 prope+3.314
 Tess+3.309
Neg logit contributions
aster-4.189
ore-4.118
 restaurant-3.871
 salsa-3.856
 baker-3.850
Tokenagging
Token ID16406
Feature activation-2.04e+00

Pos logit contributions
aster+0.117
ore+0.112
eling+0.111
oker+0.106
 baker+0.100
Neg logit contributions
 prope-0.126
aters-0.122
 pads-0.120
lets-0.119
phones-0.117
Token was
Token ID373
Feature activation-9.44e-01

Pos logit contributions
 prope+0.641
aters+0.623
 pads+0.611
lets+0.608
phones+0.606
Neg logit contributions
aster-0.593
ore-0.572
eling-0.556
oker-0.531
 restaurant-0.514
Token a
Token ID257
Feature activation+5.45e-02

Pos logit contributions
 prope+1.416
aters+1.371
lets+1.371
phones+1.352
 pads+1.335
Neg logit contributions
aster-1.872
ore-1.728
eling-1.681
 restaurant-1.634
 baker-1.633
Token bunny
Token ID44915
Feature activation-5.29e-01

Pos logit contributions
aster+0.107
ore+0.103
eling+0.102
oker+0.099
 baker+0.095
Neg logit contributions
 prope-0.078
aters-0.076
 pads-0.074
lets-0.073
phones-0.072
Token named
Token ID3706
Feature activation-1.71e+00

Pos logit contributions
 prope+0.704
aters+0.684
 pads+0.660
phones+0.649
lets+0.642
Neg logit contributions
aster-1.114
ore-1.093
eling-1.044
oker-1.015
 restaurant-0.996
Token Bun
Token ID28515
Feature activation-1.94e+00

Pos logit contributions
 prope+2.576
aters+2.337
lets+2.271
phones+2.231
 pads+2.202
Neg logit contributions
aster-3.515
ore-3.427
eling-3.209
 restaurant-3.191
 baker-3.100
Token.
Token ID13
Feature activation-3.68e+00

Pos logit contributions
 prope+3.863
phones+3.673
aters+3.668
lets+3.470
 Tess+3.416
Neg logit contributions
aster-2.905
ore-2.885
eling-2.797
in-2.512
 restaurant-2.486
Token Bun
Token ID28515
Feature activation-1.40e+00

Pos logit contributions
lets+5.854
 mate+5.399
 prope+5.375
phones+5.233
 pads+5.217
Neg logit contributions
ore-7.537
aster-6.518
he-6.514
eling-6.452
-6.433
Token loved
Token ID6151
Feature activation-7.16e-02

Pos logit contributions
 prope+2.258
aters+2.181
phones+2.154
lets+2.050
 mate+1.995
Neg logit contributions
ore-2.550
aster-2.516
eling-2.512
 restaurant-2.297
oker-2.277
Token to
Token ID284
Feature activation-5.66e-01

Pos logit contributions
 prope+0.111
aters+0.108
 pads+0.106
lets+0.105
phones+0.103
Neg logit contributions
aster-0.132
ore-0.127
eling-0.125
oker-0.120
 restaurant-0.116
Token eat
Token ID4483
Feature activation+7.66e-01

Pos logit contributions
 prope+0.944
aters+0.917
 pads+0.903
lets+0.899
phones+0.883
Neg logit contributions
aster-1.020
ore-0.944
eling-0.929
oker-0.884
 restaurant-0.871
Token cel
Token ID18725
Feature activation+6.60e-01

Pos logit contributions
aster+1.381
ore+1.312
eling+1.311
oker+1.293
 baker+1.185
Neg logit contributions
 prope-1.225
aters-1.208
 pads-1.189
lets-1.157
 Tess-1.134
Token a
Token ID257
Feature activation+9.12e-02

Pos logit contributions
 prope+1.408
aters+1.360
lets+1.352
phones+1.342
 pads+1.318
Neg logit contributions
aster-1.859
ore-1.724
eling-1.668
 baker-1.626
 restaurant-1.624
Token little
Token ID1310
Feature activation+1.07e-01

Pos logit contributions
aster+0.182
ore+0.175
eling+0.173
oker+0.167
 baker+0.160
Neg logit contributions
 prope-0.128
aters-0.124
 pads-0.122
lets-0.120
phones-0.118
Token girl
Token ID2576
Feature activation-7.93e-01

Pos logit contributions
aster+0.139
ore+0.132
eling+0.130
oker+0.123
 baker+0.114
Neg logit contributions
 prope-0.224
aters-0.219
 pads-0.216
lets-0.214
phones-0.211
Token named
Token ID3706
Feature activation-7.23e-01

Pos logit contributions
 prope+1.095
aters+1.060
 pads+1.029
phones+1.005
lets+0.996
Neg logit contributions
aster-1.667
ore-1.595
eling-1.530
oker-1.479
 baker-1.460
Token Beth
Token ID19223
Feature activation-1.01e+00

Pos logit contributions
 prope+1.372
aters+1.310
 pads+1.277
lets+1.256
phones+1.253
Neg logit contributions
aster-1.159
ore-1.101
eling-1.048
oker-1.021
 baker-1.002
Token.
Token ID13
Feature activation-3.67e+00

Pos logit contributions
 prope+1.910
aters+1.835
phones+1.810
 pads+1.805
lets+1.764
Neg logit contributions
aster-1.582
ore-1.502
eling-1.481
oker-1.348
 baker-1.346
Token Beth
Token ID19223
Feature activation-4.67e-01

Pos logit contributions
 prope+8.162
lets+7.832
 mate+7.704
 pads+7.675
phones+7.426
Neg logit contributions
ore-5.476
he-4.526
aster-4.288
-4.284
 early-4.233
Token was
Token ID373
Feature activation-2.11e-01

Pos logit contributions
 prope+0.675
aters+0.650
 pads+0.633
phones+0.623
lets+0.623
Neg logit contributions
aster-0.916
ore-0.885
eling-0.876
oker-0.831
 restaurant-0.811
Token nice
Token ID3621
Feature activation-9.57e-01

Pos logit contributions
 prope+0.301
aters+0.294
 pads+0.288
lets+0.285
phones+0.278
Neg logit contributions
aster-0.418
ore-0.402
eling-0.398
oker-0.381
 baker-0.372
Token,
Token ID11
Feature activation-1.32e+00

Pos logit contributions
 prope+1.475
 pads+1.359
aters+1.351
phones+1.342
lets+1.325
Neg logit contributions
ore-1.821
aster-1.806
 restaurant-1.715
 baker-1.711
oker-1.680
Token but
Token ID475
Feature activation-9.74e-01

Pos logit contributions
 prope+1.936
aters+1.891
lets+1.862
phones+1.827
tails+1.804
Neg logit contributions
aster-2.649
ore-2.485
 restaurant-2.399
eling-2.389
 baker-2.382
Token,
Token ID11
Feature activation+6.70e-02

Pos logit contributions
 prope+0.173
aters+0.170
 pads+0.168
lets+0.165
phones+0.162
Neg logit contributions
aster-0.211
ore-0.203
eling-0.199
oker-0.193
 restaurant-0.186
Token friendly
Token ID8030
Feature activation-3.61e-01

Pos logit contributions
aster+0.126
ore+0.121
eling+0.120
oker+0.116
 restaurant+0.111
Neg logit contributions
 prope-0.101
aters-0.098
 pads-0.096
lets-0.095
phones-0.093
Token brown
Token ID7586
Feature activation-5.46e-01

Pos logit contributions
 prope+0.560
aters+0.555
 pads+0.550
lets+0.537
phones+0.529
Neg logit contributions
aster-0.662
ore-0.640
eling-0.615
oker-0.604
 restaurant-0.590
Token dog
Token ID3290
Feature activation-9.76e-01

Pos logit contributions
aters+0.799
 prope+0.797
 pads+0.793
lets+0.771
phones+0.754
Neg logit contributions
aster-1.054
ore-1.021
eling-0.971
oker-0.952
 restaurant-0.944
Token with
Token ID351
Feature activation-2.55e+00

Pos logit contributions
 prope+1.652
aters+1.615
 pads+1.550
lets+1.529
phones+1.507
Neg logit contributions
aster-1.704
ore-1.697
eling-1.602
oker-1.513
 restaurant-1.506
Token floppy
Token ID49415
Feature activation-3.63e+00

Pos logit contributions
aters+4.774
lets+4.589
 pads+4.438
phones+4.430
 prope+4.416
Neg logit contributions
aster-4.271
ore-4.004
eling-3.969
 restaurant-3.731
 baker-3.706
Token ears
Token ID11368
Feature activation-2.45e+00

Pos logit contributions
aters+4.452
Brown+4.431
 prope+4.429
Wh+4.400
Snow+4.342
Neg logit contributions
aster-7.689
ore-7.417
eling-7.365
 restaurant-7.170
 salsa-6.920
Token.
Token ID13
Feature activation+6.79e-01

Pos logit contributions
 prope+4.567
Brown+4.493
aters+4.482
Wh+4.459
 Tess+4.428
Neg logit contributions
ore-3.899
aster-3.683
eling-3.396
 restaurant-3.208
 baker-3.188
Token The
Token ID383
Feature activation-4.27e-01

Pos logit contributions
aster+1.244
eling+1.215
ore+1.172
oker+1.148
 baker+1.080
Neg logit contributions
 prope-1.068
aters-1.039
 pads-1.029
 Tess-1.012
 Dog-1.012
Token dog
Token ID3290
Feature activation-1.39e+00

Pos logit contributions
 prope+0.699
aters+0.690
lets+0.672
 pads+0.668
phones+0.655
Neg logit contributions
aster-0.762
ore-0.733
eling-0.714
oker-0.692
 restaurant-0.680
Token started
Token ID2067
Feature activation-9.50e-01

Pos logit contributions
aters+2.597
 prope+2.546
 pads+2.389
phones+2.372
lets+2.323
Neg logit contributions
eling-2.134
aster-2.124
ore-2.090
 restaurant-1.942
 baker-1.909
Token was
Token ID373
Feature activation-9.82e-01

Pos logit contributions
 prope+0.329
aters+0.321
 pads+0.315
lets+0.311
phones+0.309
Neg logit contributions
aster-0.319
ore-0.307
eling-0.301
oker-0.288
 restaurant-0.276
Token a
Token ID257
Feature activation+6.96e-02

Pos logit contributions
 prope+1.488
aters+1.429
lets+1.428
phones+1.415
 pads+1.376
Neg logit contributions
aster-1.949
ore-1.805
eling-1.757
oker-1.692
 restaurant-1.683
Token girl
Token ID2576
Feature activation-9.41e-01

Pos logit contributions
aster+0.139
ore+0.134
eling+0.132
oker+0.128
 baker+0.123
Neg logit contributions
 prope-0.098
aters-0.095
 pads-0.093
lets-0.092
phones-0.090
Token named
Token ID3706
Feature activation-9.37e-01

Pos logit contributions
 prope+1.328
aters+1.288
 pads+1.242
phones+1.220
lets+1.213
Neg logit contributions
aster-1.969
ore-1.878
eling-1.789
oker-1.720
 baker-1.710
Token Beth
Token ID19223
Feature activation-1.10e+00

Pos logit contributions
 prope+1.838
aters+1.741
 pads+1.692
lets+1.670
phones+1.666
Neg logit contributions
aster-1.475
ore-1.385
eling-1.296
oker-1.285
 baker-1.266
Token.
Token ID13
Feature activation-3.62e+00

Pos logit contributions
 prope+2.064
aters+1.989
phones+1.961
 pads+1.939
lets+1.909
Neg logit contributions
aster-1.735
ore-1.642
eling-1.621
 baker-1.470
oker-1.470
Token Beth
Token ID19223
Feature activation-5.28e-01

Pos logit contributions
 prope+7.997
lets+7.651
 mate+7.577
 pads+7.307
phones+7.206
Neg logit contributions
ore-5.429
he-4.547
aster-4.477
-4.393
 early-4.393
Token wanted
Token ID2227
Feature activation-6.45e-02

Pos logit contributions
 prope+0.766
aters+0.739
 pads+0.717
phones+0.709
lets+0.707
Neg logit contributions
aster-1.034
ore-0.998
eling-0.990
oker-0.935
 restaurant-0.916
Token to
Token ID284
Feature activation-2.43e-01

Pos logit contributions
 prope+0.100
aters+0.098
 pads+0.095
lets+0.095
phones+0.093
Neg logit contributions
aster-0.120
ore-0.115
eling-0.113
oker-0.109
 restaurant-0.105
Token go
Token ID467
Feature activation+3.20e-01

Pos logit contributions
 prope+0.408
aters+0.396
 pads+0.392
lets+0.386
phones+0.377
Neg logit contributions
aster-0.431
ore-0.404
eling-0.398
oker-0.384
 restaurant-0.369
Token to
Token ID284
Feature activation+9.08e-01

Pos logit contributions
aster+0.542
eling+0.517
ore+0.516
oker+0.496
 baker+0.469
Neg logit contributions
 prope-0.543
aters-0.532
 pads-0.524
lets-0.515
phones-0.511
Token swim
Token ID9422
Feature activation-6.00e-01

Pos logit contributions
 prope+2.011
aters+1.929
lets+1.907
 Tess+1.898
Brown+1.868
Neg logit contributions
ore-1.809
aster-1.797
eling-1.716
 restaurant-1.690
oker-1.654
Tokensuits
Token ID16554
Feature activation-1.30e+00

Pos logit contributions
 prope+0.948
aters+0.913
 Tess+0.889
lets+0.875
phones+0.872
Neg logit contributions
ore-1.092
aster-1.082
eling-1.030
oker-1.021
 restaurant-0.992
Token on
Token ID319
Feature activation-6.64e-01

Pos logit contributions
 prope+2.435
aters+2.277
Brown+2.277
lets+2.245
 Tess+2.240
Neg logit contributions
ore-2.043
aster-2.030
 restaurant-1.866
eling-1.857
oker-1.824
Token.
Token ID13
Feature activation-7.34e-01

Pos logit contributions
 prope+1.486
 pads+1.464
lets+1.459
aters+1.457
phones+1.439
Neg logit contributions
aster-0.807
ore-0.732
eling-0.713
oker-0.644
 restaurant-0.641
Token Lily
Token ID20037
Feature activation-8.41e-01

Pos logit contributions
 prope+1.109
aters+1.042
 pads+1.016
lets+1.008
phones+0.990
Neg logit contributions
aster-1.404
ore-1.354
eling-1.305
oker-1.278
 baker-1.273
Token's
Token ID338
Feature activation-3.62e+00

Pos logit contributions
 prope+1.466
aters+1.426
phones+1.368
lets+1.353
 Tess+1.334
Neg logit contributions
aster-1.407
ore-1.362
eling-1.357
 restaurant-1.260
oker-1.257
Token mom
Token ID1995
Feature activation-1.81e+00

Pos logit contributions
 prope+6.532
lets+6.440
Brown+6.390
aters+6.210
 Tess+6.204
Neg logit contributions
 restaurant-6.480
ore-5.989
aster-5.867
 early-5.681
 baker-5.610
Token said
Token ID531
Feature activation-8.07e-01

Pos logit contributions
 prope+3.103
aters+2.788
phones+2.696
lets+2.612
 pads+2.607
Neg logit contributions
aster-3.363
eling-3.326
ore-3.205
 restaurant-3.028
 baker-2.970
Token,
Token ID11
Feature activation-7.11e-01

Pos logit contributions
aters+1.283
 prope+1.268
 pads+1.229
phones+1.203
lets+1.200
Neg logit contributions
aster-1.486
ore-1.440
eling-1.405
oker-1.355
 baker-1.335
Token "
Token ID366
Feature activation-7.41e-01

Pos logit contributions
 prope+1.052
aters+1.028
 pads+0.989
lets+0.973
phones+0.955
Neg logit contributions
aster-1.380
ore-1.331
eling-1.307
 baker-1.249
oker-1.235
TokenBe
Token ID3856
Feature activation+2.03e-01

Pos logit contributions
 prope+1.349
aters+1.290
 pads+1.245
lets+1.244
phones+1.236
Neg logit contributions
aster-1.231
ore-1.155
eling-1.138
oker-1.110
 baker-1.055
TokenOne
Token ID3198
Feature activation+1.05e+00

Pos logit contributions
 prope+0.177
aters+0.168
 pads+0.162
lets+0.161
phones+0.158
Neg logit contributions
aster-0.372
ore-0.358
eling-0.354
oker-0.344
 restaurant-0.333
Token day
Token ID1110
Feature activation-3.08e-01

Pos logit contributions
eling+1.327
aster+1.324
ore+1.267
oker+1.202
 baker+1.120
Neg logit contributions
 pads-2.162
aters-2.146
 prope-2.140
 Dog-2.101
lets-2.089
Token,
Token ID11
Feature activation-4.76e-01

Pos logit contributions
 prope+0.450
aters+0.431
 pads+0.419
lets+0.413
phones+0.407
Neg logit contributions
aster-0.614
ore-0.593
eling-0.580
oker-0.564
 restaurant-0.540
Token Sandy
Token ID17581
Feature activation-2.23e-02

Pos logit contributions
 prope+0.813
aters+0.779
 pads+0.757
lets+0.750
phones+0.742
Neg logit contributions
aster-0.838
ore-0.804
eling-0.789
oker-0.763
 restaurant-0.725
Token and
Token ID290
Feature activation-2.62e-01

Pos logit contributions
 prope+0.036
aters+0.034
 pads+0.034
lets+0.033
phones+0.033
Neg logit contributions
aster-0.040
ore-0.039
eling-0.038
oker-0.037
 restaurant-0.035
Token her
Token ID607
Feature activation-3.61e+00

Pos logit contributions
 prope+0.481
aters+0.468
 pads+0.457
lets+0.454
phones+0.447
Neg logit contributions
aster-0.422
ore-0.398
eling-0.395
oker-0.373
 restaurant-0.361
Token owners
Token ID4393
Feature activation-6.37e-01

Pos logit contributions
 prope+5.906
aters+5.486
lets+5.011
phones+4.875
Brown+4.813
Neg logit contributions
 restaurant-6.806
aster-6.615
ore-6.611
 early-6.502
 baker-6.403
Token went
Token ID1816
Feature activation+8.05e-01

Pos logit contributions
 prope+0.988
aters+0.955
 pads+0.923
lets+0.901
phones+0.899
Neg logit contributions
aster-1.199
ore-1.154
eling-1.151
oker-1.097
 restaurant-1.065
Token for
Token ID329
Feature activation-9.75e-01

Pos logit contributions
aster+1.506
eling+1.447
ore+1.404
oker+1.384
 baker+1.304
Neg logit contributions
 prope-1.238
aters-1.216
 pads-1.214
phones-1.181
 Tess-1.178
Token a
Token ID257
Feature activation-9.87e-01

Pos logit contributions
 prope+1.465
aters+1.460
lets+1.435
 pads+1.421
phones+1.333
Neg logit contributions
aster-1.940
eling-1.778
ore-1.759
 restaurant-1.673
oker-1.649
Token walk
Token ID2513
Feature activation+1.30e-01

Pos logit contributions
aters+1.813
 prope+1.795
lets+1.704
 pads+1.690
phones+1.632
Neg logit contributions
aster-1.644
ore-1.540
eling-1.471
 restaurant-1.461
oker-1.434
Token she
Token ID673
Feature activation-1.05e+00

Pos logit contributions
 prope+0.360
aters+0.354
 pads+0.346
lets+0.341
phones+0.336
Neg logit contributions
aster-0.284
ore-0.269
eling-0.266
oker-0.253
 baker-0.240
Token could
Token ID714
Feature activation-1.00e+00

Pos logit contributions
 prope+1.440
aters+1.390
phones+1.273
 pads+1.268
 Tess+1.254
Neg logit contributions
aster-2.171
ore-2.144
eling-2.077
oker-2.040
 baker-1.980
Token have
Token ID423
Feature activation-9.28e-01

Pos logit contributions
 prope+1.819
aters+1.720
 pads+1.701
lets+1.675
Brown+1.658
Neg logit contributions
aster-1.704
ore-1.609
eling-1.513
oker-1.491
 restaurant-1.411
Token it
Token ID340
Feature activation-5.25e-01

Pos logit contributions
 prope+1.755
aters+1.699
 pads+1.638
lets+1.635
phones+1.624
Neg logit contributions
aster-1.480
eling-1.358
ore-1.319
oker-1.261
 restaurant-1.228
Token.
Token ID13
Feature activation-1.31e-01

Pos logit contributions
 prope+0.921
aters+0.874
 pads+0.852
lets+0.849
phones+0.845
Neg logit contributions
aster-0.882
ore-0.845
eling-0.821
oker-0.801
 restaurant-0.767
Token Her
Token ID2332
Feature activation-3.61e+00

Pos logit contributions
 prope+0.179
aters+0.172
 pads+0.169
lets+0.166
phones+0.163
Neg logit contributions
aster-0.268
ore-0.261
eling-0.254
oker-0.247
 baker-0.238
Token friend
Token ID1545
Feature activation-3.61e-01

Pos logit contributions
 prope+6.964
Brown+6.259
Snow+6.214
aters+6.207
phones+6.193
Neg logit contributions
 baker-5.644
ore-5.551
 restaurant-5.531
 answers-5.219
aster-5.207
Token said
Token ID531
Feature activation-9.29e-01

Pos logit contributions
 prope+0.703
aters+0.686
 pads+0.668
phones+0.657
lets+0.655
Neg logit contributions
aster-0.530
eling-0.501
ore-0.500
oker-0.474
 baker-0.452
Token yes
Token ID3763
Feature activation-7.02e-01

Pos logit contributions
aters+1.040
 prope+0.995
 pads+0.965
lets+0.936
phones+0.924
Neg logit contributions
aster-2.202
ore-2.102
eling-2.063
oker-2.024
 baker-1.993
Token,
Token ID11
Feature activation-4.86e-01

Pos logit contributions
 prope+1.105
aters+1.046
 pads+1.003
phones+0.989
lets+0.984
Neg logit contributions
aster-1.328
ore-1.290
eling-1.257
oker-1.214
 restaurant-1.181
Token and
Token ID290
Feature activation-6.52e-02

Pos logit contributions
 prope+0.578
aters+0.557
 pads+0.532
lets+0.526
phones+0.518
Neg logit contributions
aster-1.088
ore-1.047
eling-1.034
oker-1.004
 baker-0.976
Token a
Token ID257
Feature activation+1.25e-01

Pos logit contributions
 prope+1.510
aters+1.478
lets+1.466
phones+1.441
 pads+1.425
Neg logit contributions
aster-1.939
ore-1.795
eling-1.743
 baker-1.695
 restaurant-1.691
Token little
Token ID1310
Feature activation+1.69e-01

Pos logit contributions
aster+0.250
ore+0.241
eling+0.239
oker+0.231
 baker+0.221
Neg logit contributions
 prope-0.175
aters-0.170
 pads-0.166
lets-0.164
phones-0.161
Token girl
Token ID2576
Feature activation-8.56e-01

Pos logit contributions
aster+0.231
ore+0.218
eling+0.216
oker+0.204
 restaurant+0.190
Neg logit contributions
 prope-0.344
aters-0.336
 pads-0.332
lets-0.328
phones-0.324
Token called
Token ID1444
Feature activation-9.18e-01

Pos logit contributions
 prope+1.191
aters+1.159
 pads+1.122
phones+1.094
lets+1.086
Neg logit contributions
aster-1.795
ore-1.715
eling-1.644
oker-1.585
 baker-1.570
Token Beth
Token ID19223
Feature activation-1.22e+00

Pos logit contributions
 prope+1.782
aters+1.714
 pads+1.697
lets+1.653
phones+1.617
Neg logit contributions
aster-1.446
ore-1.370
oker-1.272
eling-1.245
 baker-1.220
Token.
Token ID13
Feature activation-3.60e+00

Pos logit contributions
 prope+2.349
aters+2.240
phones+2.211
 pads+2.207
lets+2.156
Neg logit contributions
aster-1.871
ore-1.797
eling-1.745
 baker-1.611
oker-1.583
Token She
Token ID1375
Feature activation-6.14e-01

Pos logit contributions
 prope+7.814
lets+7.685
 mate+7.499
 pads+7.461
phones+7.157
Neg logit contributions
ore-5.459
he-4.506
-4.446
 early-4.292
aster-4.288
Token was
Token ID373
Feature activation-2.83e-01

Pos logit contributions
 prope+0.868
aters+0.866
lets+0.813
 pads+0.813
phones+0.807
Neg logit contributions
aster-1.239
ore-1.192
eling-1.173
oker-1.119
 restaurant-1.105
Token very
Token ID845
Feature activation+1.41e+00

Pos logit contributions
 prope+0.408
aters+0.399
 pads+0.391
lets+0.387
phones+0.377
Neg logit contributions
aster-0.556
ore-0.534
eling-0.530
oker-0.506
 baker-0.498
Token intelligent
Token ID12661
Feature activation-2.28e+00

Pos logit contributions
aster+2.818
eling+2.656
ore+2.649
oker+2.531
roads+2.478
Neg logit contributions
 prope-2.004
 Tess-1.897
 Dog-1.889
aters-1.887
Snow-1.872
Token,
Token ID11
Feature activation-1.42e+00

Pos logit contributions
 prope+4.420
phones+4.084
Wh+4.052
Brown+4.048
Snow+3.995
Neg logit contributions
ore-3.650
 baker-3.503
 restaurant-3.467
aster-3.399
 early-3.222
Token."
Token ID526
Feature activation-1.55e+00

Pos logit contributions
aster+0.684
ore+0.653
eling+0.651
oker+0.627
 baker+0.595
Neg logit contributions
 prope-0.567
aters-0.553
 pads-0.543
lets-0.536
phones-0.525
Token So
Token ID1406
Feature activation-1.82e+00

Pos logit contributions
 prope+1.837
aters+1.641
 mate+1.606
 pads+1.567
lets+1.539
Neg logit contributions
ore-3.496
aster-3.494
eling-3.211
 baker-3.180
oker-3.143
Token,
Token ID11
Feature activation-1.38e+00

Pos logit contributions
 prope+3.353
aters+3.335
 pads+3.202
lets+3.048
phones+2.998
Neg logit contributions
aster-2.911
ore-2.791
eling-2.735
oker-2.546
 baker-2.533
Token Kitty
Token ID21168
Feature activation-9.69e-01

Pos logit contributions
 prope+2.142
aters+2.066
 pads+1.982
lets+1.939
 mate+1.876
Neg logit contributions
aster-2.647
ore-2.487
eling-2.441
 baker-2.319
oker-2.295
Token and
Token ID290
Feature activation-1.73e+00

Pos logit contributions
 prope+1.508
aters+1.436
phones+1.384
lets+1.342
 pads+1.336
Neg logit contributions
aster-1.829
ore-1.754
eling-1.718
oker-1.615
 restaurant-1.613
Token Dog
Token ID8532
Feature activation-3.60e+00

Pos logit contributions
 prope+1.091
aters+1.069
lets+0.908
 pads+0.903
 mate+0.785
Neg logit contributions
aster-4.969
eling-4.753
ore-4.598
 baker-4.441
 restaurant-4.389
Tokengy
Token ID1360
Feature activation-1.21e+00

Pos logit contributions
 prope+5.173
aters+4.473
 pads+3.922
 silk+3.895
Snow+3.862
Neg logit contributions
ore-7.751
aster-7.350
eling-7.301
led-6.732
in-6.526
Token curled
Token ID41108
Feature activation-4.11e-01

Pos logit contributions
 prope+2.118
aters+2.074
phones+1.976
 Tess+1.948
lets+1.936
Neg logit contributions
aster-2.072
eling-1.945
ore-1.879
 restaurant-1.800
oker-1.749
Token up
Token ID510
Feature activation+4.59e-01

Pos logit contributions
 prope+0.989
aters+0.979
 pads+0.965
lets+0.958
phones+0.943
Neg logit contributions
aster-0.398
ore-0.378
eling-0.367
oker-0.321
 restaurant-0.309
Token next
Token ID1306
Feature activation-8.40e-01

Pos logit contributions
aster+0.784
eling+0.742
ore+0.741
oker+0.719
 baker+0.672
Neg logit contributions
 prope-0.787
aters-0.763
 pads-0.750
lets-0.738
phones-0.727
Token to
Token ID284
Feature activation-1.15e+00

Pos logit contributions
 prope+1.440
aters+1.396
lets+1.383
 pads+1.376
phones+1.345
Neg logit contributions
aster-1.437
ore-1.411
eling-1.329
 restaurant-1.282
oker-1.267
Token around
Token ID1088
Feature activation+2.37e-01

Pos logit contributions
 prope+1.640
aters+1.638
 pads+1.594
lets+1.586
 Tess+1.567
Neg logit contributions
aster-1.144
ore-1.134
eling-1.075
oker-1.028
 restaurant-0.961
Token the
Token ID262
Feature activation-8.95e-01

Pos logit contributions
aster+0.375
ore+0.356
eling+0.353
oker+0.339
 baker+0.320
Neg logit contributions
 prope-0.436
aters-0.423
 pads-0.416
lets-0.410
phones-0.406
Token house
Token ID2156
Feature activation+5.53e-02

Pos logit contributions
lets+1.715
 prope+1.701
aters+1.696
 pads+1.670
phones+1.668
Neg logit contributions
aster-1.321
ore-1.263
eling-1.189
 restaurant-1.164
oker-1.145
Token.
Token ID13
Feature activation-1.48e+00

Pos logit contributions
aster+0.093
ore+0.089
eling+0.088
oker+0.084
 restaurant+0.080
Neg logit contributions
 prope-0.095
aters-0.092
 pads-0.091
lets-0.090
phones-0.088
Token He
Token ID679
Feature activation-1.73e+00

Pos logit contributions
lets+2.589
 prope+2.586
aters+2.522
 pads+2.519
phones+2.489
Neg logit contributions
ore-2.389
aster-2.368
eling-2.086
oker-2.082
 restaurant-2.079
Token had
Token ID550
Feature activation-3.60e+00

Pos logit contributions
aters+2.571
phones+2.361
 prope+2.353
 Tess+2.349
lets+2.335
Neg logit contributions
aster-3.288
eling-3.250
ore-3.190
 restaurant-2.976
oker-2.973
Token so
Token ID523
Feature activation-7.80e-01

Pos logit contributions
aters+8.291
lets+7.822
phones+7.510
 Tess+7.324
 prope+7.225
Neg logit contributions
aster-4.546
 restaurant-4.095
 cream-3.906
eling-3.900
 early-3.874
Token much
Token ID881
Feature activation-9.88e-01

Pos logit contributions
 prope+1.147
aters+1.142
phones+1.105
lets+1.098
 pads+1.079
Neg logit contributions
aster-1.521
ore-1.453
eling-1.406
oker-1.380
 restaurant-1.313
Token fun
Token ID1257
Feature activation-4.32e-01

Pos logit contributions
 prope+1.447
lets+1.429
aters+1.409
phones+1.373
 pads+1.357
Neg logit contributions
aster-1.933
ore-1.853
eling-1.794
oker-1.773
 restaurant-1.716
Token playing
Token ID2712
Feature activation+4.59e-01

Pos logit contributions
 prope+0.719
aters+0.711
 pads+0.693
lets+0.691
phones+0.671
Neg logit contributions
aster-0.751
ore-0.730
eling-0.715
oker-0.680
 restaurant-0.647
Token with
Token ID351
Feature activation-1.73e+00

Pos logit contributions
aster+0.754
eling+0.718
ore+0.716
oker+0.687
 baker+0.648
Neg logit contributions
 prope-0.817
aters-0.784
 pads-0.775
lets-0.763
phones-0.758
Token big
Token ID1263
Feature activation+1.41e-01

Pos logit contributions
aster+1.839
ore+1.776
eling+1.735
oker+1.730
 baker+1.590
Neg logit contributions
 prope-1.499
aters-1.417
 pads-1.392
 Tess-1.368
lets-1.362
Token and
Token ID290
Feature activation+9.34e-01

Pos logit contributions
aster+0.289
ore+0.278
eling+0.276
oker+0.267
 baker+0.255
Neg logit contributions
 prope-0.192
aters-0.186
 pads-0.182
lets-0.179
phones-0.176
Token he
Token ID339
Feature activation-5.00e-01

Pos logit contributions
aster+1.800
ore+1.784
eling+1.715
oker+1.696
 baker+1.572
Neg logit contributions
 prope-1.387
 pads-1.327
lets-1.294
aters-1.291
 Tess-1.289
Token ran
Token ID4966
Feature activation+2.65e-01

Pos logit contributions
 prope+0.855
aters+0.844
 pads+0.811
lets+0.797
phones+0.797
Neg logit contributions
aster-0.848
eling-0.813
ore-0.803
oker-0.750
 restaurant-0.745
Token inside
Token ID2641
Feature activation+1.80e+00

Pos logit contributions
aster+0.468
ore+0.447
eling+0.446
oker+0.428
 baker+0.404
Neg logit contributions
 prope-0.433
aters-0.421
 pads-0.416
lets-0.409
phones-0.404
Token to
Token ID284
Feature activation+2.21e+00

Pos logit contributions
eling+3.813
aster+3.777
oker+3.721
ore+3.624
 baker+3.442
Neg logit contributions
 prope-2.127
aters-2.091
 Dog-2.055
 Tess-2.046
 forever-2.000
Token get
Token ID651
Feature activation+1.99e+00

Pos logit contributions
ore+4.362
oker+4.228
eling+4.171
aster+4.049
ying+3.924
Neg logit contributions
 Dog-3.080
 Tess-3.064
 prope-3.022
 aunt-2.902
 no-2.897
Token some
Token ID617
Feature activation+2.30e+00

Pos logit contributions
aster+4.358
eling+4.358
oker+4.349
ore+4.318
roads+3.982
Neg logit contributions
 prope-2.273
 Dog-2.214
 Tess-2.197
 Snow-2.183
 Brown-2.162
Token.
Token ID13
Feature activation+3.92e-01

Pos logit contributions
aster+3.984
oker+3.839
eling+3.807
ore+3.791
roads+3.456
Neg logit contributions
aters-3.833
 prope-3.712
 pads-3.672
 Dog-3.612
 wool-3.596
Token
Token ID220
Feature activation+1.40e-01

Pos logit contributions
aster+0.739
eling+0.715
ore+0.707
oker+0.684
 baker+0.646
Neg logit contributions
 prope-0.590
aters-0.578
 pads-0.567
lets-0.556
 Tess-0.549
Token
Token ID198
Feature activation-3.71e-01

Pos logit contributions
aster+0.255
ore+0.245
eling+0.243
oker+0.234
 baker+0.223
Neg logit contributions
 prope-0.219
aters-0.213
 pads-0.210
lets-0.207
phones-0.203
Token drawing
Token ID8263
Feature activation-1.15e-01

Pos logit contributions
 prope+0.266
aters+0.261
 pads+0.255
lets+0.252
phones+0.248
Neg logit contributions
aster-0.279
ore-0.266
eling-0.264
oker-0.252
 restaurant-0.242
Token pictures
Token ID5986
Feature activation-6.85e-01

Pos logit contributions
 prope+0.188
aters+0.182
 pads+0.178
lets+0.176
phones+0.174
Neg logit contributions
aster-0.204
ore-0.195
eling-0.192
oker-0.185
 restaurant-0.177
Token.
Token ID13
Feature activation+8.37e-02

Pos logit contributions
 prope+1.248
aters+1.185
 pads+1.159
lets+1.149
phones+1.142
Neg logit contributions
aster-1.091
ore-1.059
eling-0.998
oker-0.985
 baker-0.951
Token
Token ID198
Feature activation+3.93e-01

Pos logit contributions
aster+0.161
ore+0.155
eling+0.153
oker+0.148
 baker+0.142
Neg logit contributions
 prope-0.125
aters-0.121
 pads-0.118
lets-0.117
phones-0.115
Token
Token ID198
Feature activation-2.95e-01

Pos logit contributions
aster+0.735
eling+0.709
ore+0.702
oker+0.680
 baker+0.646
Neg logit contributions
 prope-0.589
aters-0.575
 pads-0.565
lets-0.555
 Tess-0.546
TokenOne
Token ID3198
Feature activation+2.49e+00

Pos logit contributions
 prope+0.380
aters+0.366
 pads+0.356
lets+0.351
phones+0.344
Neg logit contributions
aster-0.641
ore-0.611
eling-0.606
oker-0.585
 baker-0.567
Token day
Token ID1110
Feature activation-6.96e-02

Pos logit contributions
eling+4.181
ore+3.805
oker+3.684
roads+3.674
aster+3.660
Neg logit contributions
 Dog-4.298
 tail-4.295
 pads-4.287
 aunt-4.261
Snow-4.247
Token,
Token ID11
Feature activation+4.75e-01

Pos logit contributions
 prope+0.096
aters+0.093
 pads+0.091
lets+0.090
phones+0.088
Neg logit contributions
aster-0.142
ore-0.137
eling-0.135
oker-0.131
 baker-0.126
Token she
Token ID673
Feature activation+8.20e-01

Pos logit contributions
aster+0.816
ore+0.790
eling+0.782
oker+0.753
 restaurant+0.704
Neg logit contributions
 prope-0.795
aters-0.770
 pads-0.758
lets-0.753
 Tess-0.748
Token met
Token ID1138
Feature activation-1.12e-01

Pos logit contributions
aster+1.580
ore+1.524
eling+1.498
oker+1.462
 baker+1.385
Neg logit contributions
 prope-1.208
aters-1.174
 pads-1.171
lets-1.156
 Tess-1.115
Token a
Token ID257
Feature activation+6.37e-01

Pos logit contributions
 prope+0.155
aters+0.149
 pads+0.146
lets+0.143
phones+0.141
Neg logit contributions
aster-0.228
ore-0.219
eling-0.216
oker-0.209
 restaurant-0.202
Token by
Token ID416
Feature activation+8.24e-01

Pos logit contributions
aster+1.101
eling+1.053
ore+1.031
oker+1.005
 baker+0.953
Neg logit contributions
 prope-1.010
aters-0.988
 pads-0.979
lets-0.957
phones-0.943
Token and
Token ID290
Feature activation+8.61e-01

Pos logit contributions
aster+1.715
eling+1.644
ore+1.628
oker+1.610
 baker+1.537
Neg logit contributions
 prope-1.092
aters-1.049
 pads-1.027
lets-1.010
 Dog-0.992
Token knocked
Token ID13642
Feature activation+1.81e+00

Pos logit contributions
aster+1.598
ore+1.515
eling+1.479
oker+1.472
 baker+1.361
Neg logit contributions
 prope-1.346
 pads-1.298
lets-1.280
aters-1.272
 Tess-1.267
Token over
Token ID625
Feature activation+5.13e-01

Pos logit contributions
aster+3.277
eling+3.248
oker+3.223
ore+3.057
 baker+3.017
Neg logit contributions
 pads-2.815
aters-2.744
 Dog-2.713
 prope-2.704
 Tess-2.618
Token the
Token ID262
Feature activation+2.76e+00

Pos logit contributions
aster+0.835
ore+0.801
eling+0.800
oker+0.768
 baker+0.716
Neg logit contributions
 prope-0.904
aters-0.884
 pads-0.872
lets-0.862
phones-0.846
Token old
Token ID1468
Feature activation+2.28e+00

Pos logit contributions
eling+5.882
aster+5.669
oker+5.496
ore+5.423
ying+5.108
Neg logit contributions
 tail-3.998
 pads-3.979
 neck-3.848
 scarf-3.838
 prope-3.794
Token man
Token ID582
Feature activation+1.41e+00

Pos logit contributions
eling+3.272
aster+2.943
ore+2.809
oker+2.794
ying+2.457
Neg logit contributions
 scarf-4.786
 prope-4.779
 neck-4.763
 tail-4.751
 pads-4.732
Token's
Token ID338
Feature activation+1.98e+00

Pos logit contributions
aster+2.892
eling+2.818
oker+2.788
ore+2.714
 baker+2.631
Neg logit contributions
 pads-1.875
aters-1.860
lets-1.838
 prope-1.811
phones-1.779
Token plate
Token ID7480
Feature activation+2.91e-01

Pos logit contributions
eling+3.599
aster+3.547
ore+3.414
oker+3.366
roads+3.117
Neg logit contributions
 tail-3.277
 pads-3.232
 neck-3.198
 prope-3.175
aters-3.170
Token.
Token ID13
Feature activation+1.62e+00

Pos logit contributions
aster+0.517
eling+0.494
ore+0.494
oker+0.472
 baker+0.449
Neg logit contributions
 prope-0.471
aters-0.460
 pads-0.454
lets-0.446
phones-0.438
Token All
Token ID1439
Feature activation+1.88e+00

Pos logit contributions
eling+3.252
aster+3.182
oker+3.061
ore+2.981
ying+2.818
Neg logit contributions
 Dog-2.340
 prope-2.329
 Tess-2.285
 Bun-2.244
 Brown-2.230
Token my
Token ID616
Feature activation-1.45e+00

Pos logit contributions
 prope+0.761
aters+0.751
 pads+0.720
lets+0.717
phones+0.693
Neg logit contributions
aster-0.939
eling-0.889
ore-0.883
oker-0.846
 restaurant-0.817
Token tube
Token ID12403
Feature activation-1.97e-01

Pos logit contributions
 prope+2.450
aters+2.437
lets+2.407
Brown+2.404
 pads+2.358
Neg logit contributions
ore-2.476
aster-2.431
oker-2.309
 restaurant-2.248
eling-2.224
Token.
Token ID13
Feature activation-6.51e-01

Pos logit contributions
 prope+0.300
aters+0.288
 pads+0.281
lets+0.278
phones+0.274
Neg logit contributions
aster-0.369
ore-0.358
eling-0.351
oker-0.341
 restaurant-0.325
Token Go
Token ID1514
Feature activation-4.79e-01

Pos logit contributions
 prope+1.055
aters+1.017
 pads+1.011
lets+0.986
phones+0.957
Neg logit contributions
aster-1.178
ore-1.148
oker-1.071
eling-1.069
 baker-1.011
Token away
Token ID1497
Feature activation-5.90e-01

Pos logit contributions
 prope+1.085
aters+1.055
 pads+1.035
lets+1.028
phones+1.004
Neg logit contributions
aster-0.569
ore-0.549
eling-0.510
oker-0.480
 restaurant-0.451
Token!"
Token ID2474
Feature activation+2.35e+00

Pos logit contributions
 prope+0.906
aters+0.877
 pads+0.868
lets+0.853
phones+0.845
Neg logit contributions
aster-1.103
ore-1.093
eling-1.042
oker-1.026
 restaurant-0.991
Token
Token ID198
Feature activation+5.71e-01

Pos logit contributions
eling+5.536
aster+5.306
oker+5.076
ore+5.021
 baker+4.864
Neg logit contributions
 Tess-2.617
 Dog-2.608
 Bun-2.600
 Snow-2.575
 pads-2.561
Token
Token ID198
Feature activation+5.98e-01

Pos logit contributions
aster+1.097
eling+1.064
ore+1.044
oker+1.015
 baker+0.974
Neg logit contributions
 prope-0.836
aters-0.815
 pads-0.799
lets-0.781
Snow-0.780
TokenL
Token ID43
Feature activation+2.18e+00

Pos logit contributions
aster+1.200
ore+1.175
eling+1.159
oker+1.111
 baker+1.070
Neg logit contributions
 prope-0.831
aters-0.803
 pads-0.795
lets-0.780
Snow-0.779
Tokenily
Token ID813
Feature activation+2.58e-02

Pos logit contributions
aster+3.618
eling+3.523
ore+3.445
roads+3.220
 baker+3.211
Neg logit contributions
aters-3.746
 pads-3.623
 mate-3.575
lets-3.532
un-3.530
Token and
Token ID290
Feature activation+1.60e+00

Pos logit contributions
aster+0.051
ore+0.049
eling+0.049
oker+0.047
 baker+0.045
Neg logit contributions
 prope-0.037
aters-0.036
 pads-0.035
lets-0.034
phones-0.034
Token Come
Token ID7911
Feature activation-6.70e-01

Pos logit contributions
 prope+1.162
aters+1.103
 pads+1.096
lets+1.094
phones+1.065
Neg logit contributions
aster-1.125
ore-1.097
eling-1.034
oker-1.027
 baker-0.983
Token with
Token ID351
Feature activation-3.03e-01

Pos logit contributions
 prope+1.302
aters+1.252
 pads+1.240
lets+1.215
phones+1.176
Neg logit contributions
aster-1.012
ore-0.984
eling-0.910
oker-0.883
 baker-0.863
Token me
Token ID502
Feature activation-2.77e-01

Pos logit contributions
 prope+0.504
aters+0.486
 pads+0.475
lets+0.474
phones+0.465
Neg logit contributions
aster-0.547
ore-0.516
eling-0.512
oker-0.487
 restaurant-0.470
Token and
Token ID290
Feature activation+7.39e-01

Pos logit contributions
 prope+0.408
aters+0.391
 pads+0.382
lets+0.376
phones+0.369
Neg logit contributions
aster-0.540
ore-0.524
eling-0.510
oker-0.495
 restaurant-0.475
Token I
Token ID314
Feature activation-6.09e-01

Pos logit contributions
aster+1.416
eling+1.366
ore+1.364
oker+1.309
 restaurant+1.251
Neg logit contributions
 prope-1.077
 pads-1.037
aters-1.030
 Tess-1.019
lets-1.015
Tokenâ
Token ID22940
Feature activation+2.34e+00

Pos logit contributions
 prope+0.914
aters+0.882
 pads+0.858
phones+0.840
lets+0.839
Neg logit contributions
aster-1.170
ore-1.142
eling-1.109
oker-1.081
 restaurant-1.040
Token€
Token ID26391
Feature activation-1.50e+00

Pos logit contributions
eling+3.669
aster+3.621
ore+3.560
 baker+3.504
 restaurant+3.502
Neg logit contributions
aters-4.100
 pads-4.039
 Tess-4.021
 Dog-3.964
Snow-3.934
Tokenâ„¢
Token ID8151
Feature activation-1.06e+00

Pos logit contributions
 prope+2.310
aters+2.104
lets+1.977
Brown+1.970
 pads+1.954
Neg logit contributions
aster-2.933
ore-2.862
oker-2.730
eling-2.682
 restaurant-2.647
Tokenll
Token ID297
Feature activation-1.58e+00

Pos logit contributions
 prope+2.247
aters+2.107
lets+2.068
phones+2.057
 pads+2.053
Neg logit contributions
aster-1.466
ore-1.455
oker-1.316
eling-1.306
 baker-1.217
Token show
Token ID905
Feature activation-6.01e-01

Pos logit contributions
 prope+2.933
 pads+2.862
aters+2.822
lets+2.762
phones+2.674
Neg logit contributions
aster-2.609
eling-2.468
ore-2.410
oker-2.184
 restaurant-2.127
Token you
Token ID345
Feature activation-4.97e-01

Pos logit contributions
 prope+1.233
aters+1.201
 pads+1.181
lets+1.170
phones+1.137
Neg logit contributions
aster-0.838
ore-0.804
eling-0.784
oker-0.730
 baker-0.680
Token saw
Token ID2497
Feature activation-1.29e-01

Pos logit contributions
aster+1.961
ore+1.867
eling+1.802
oker+1.797
 baker+1.724
Neg logit contributions
 prope-1.518
 pads-1.470
aters-1.468
 Tess-1.451
lets-1.447
Token was
Token ID373
Feature activation+7.93e-01

Pos logit contributions
 prope+0.220
aters+0.214
 pads+0.210
lets+0.207
phones+0.203
Neg logit contributions
aster-0.220
ore-0.212
eling-0.209
oker-0.199
 restaurant-0.190
Token a
Token ID257
Feature activation-4.88e-01

Pos logit contributions
aster+1.386
ore+1.315
eling+1.313
oker+1.286
 baker+1.200
Neg logit contributions
 prope-1.331
aters-1.257
 pads-1.240
 Tess-1.236
phones-1.226
Token very
Token ID845
Feature activation+2.38e+00

Pos logit contributions
aters+0.764
 prope+0.752
 pads+0.748
lets+0.733
phones+0.716
Neg logit contributions
aster-0.890
ore-0.860
eling-0.835
oker-0.813
 restaurant-0.801
Token troubled
Token ID17840
Feature activation+8.78e-01

Pos logit contributions
aster+5.165
eling+5.064
ore+4.976
oker+4.657
roads+4.547
Neg logit contributions
 prope-3.049
 mate-2.823
 tail-2.819
 Dog-2.747
 aunt-2.705
Token squirrel
Token ID33039
Feature activation+9.35e-01

Pos logit contributions
aster+1.641
eling+1.598
ore+1.557
oker+1.507
 baker+1.409
Neg logit contributions
 prope-1.357
aters-1.285
lets-1.250
 Tess-1.246
 pads-1.245
Token.
Token ID13
Feature activation-1.25e-01

Pos logit contributions
aster+1.682
ore+1.597
eling+1.589
oker+1.568
 baker+1.499
Neg logit contributions
 prope-1.451
 pads-1.440
aters-1.430
lets-1.401
phones-1.395
Token The
Token ID383
Feature activation-5.38e-01

Pos logit contributions
 prope+0.184
aters+0.179
 pads+0.174
lets+0.172
phones+0.169
Neg logit contributions
aster-0.239
ore-0.232
eling-0.227
oker-0.220
 restaurant-0.211
Token squirrel
Token ID33039
Feature activation-4.96e-01

Pos logit contributions
 prope+0.634
aters+0.634
lets+0.609
 pads+0.607
phones+0.580
Neg logit contributions
aster-1.186
ore-1.160
eling-1.153
oker-1.107
 baker-1.093
Token was
Token ID373
Feature activation-9.34e-01

Pos logit contributions
 prope+0.903
aters+0.885
 pads+0.851
lets+0.836
phones+0.835
Neg logit contributions
aster-0.778
ore-0.759
eling-0.758
oker-0.700
 restaurant-0.678
Token hopping
Token ID47153
Feature activation-8.62e-01

Pos logit contributions
aters+1.555
 prope+1.546
lets+1.502
 pads+1.494
phones+1.475
Neg logit contributions
aster-1.601
ore-1.563
eling-1.549
 restaurant-1.427
oker-1.427
Token a
Token ID257
Feature activation-4.50e-01

Pos logit contributions
 prope+0.910
aters+0.892
lets+0.860
phones+0.853
 pads+0.849
Neg logit contributions
aster-1.249
ore-1.173
eling-1.153
oker-1.106
 restaurant-1.097
Token time
Token ID640
Feature activation-9.80e-01

Pos logit contributions
 prope+0.913
aters+0.883
phones+0.870
lets+0.868
 pads+0.864
Neg logit contributions
aster-0.642
ore-0.592
eling-0.584
oker-0.559
 restaurant-0.536
Token,
Token ID11
Feature activation+4.83e-01

Pos logit contributions
 prope+1.579
aters+1.529
 pads+1.485
lets+1.480
phones+1.468
Neg logit contributions
aster-1.800
ore-1.789
oker-1.671
eling-1.669
 restaurant-1.608
Token there
Token ID612
Feature activation-1.41e-01

Pos logit contributions
aster+0.846
eling+0.807
ore+0.803
oker+0.772
 baker+0.729
Neg logit contributions
 prope-0.802
aters-0.773
 pads-0.768
lets-0.759
 Tess-0.739
Token was
Token ID373
Feature activation-8.28e-01

Pos logit contributions
 prope+0.243
aters+0.236
 pads+0.232
lets+0.230
phones+0.228
Neg logit contributions
aster-0.238
ore-0.229
eling-0.225
oker-0.215
 restaurant-0.207
Token a
Token ID257
Feature activation+2.27e-01

Pos logit contributions
 prope+1.232
aters+1.184
lets+1.178
phones+1.171
 pads+1.153
Neg logit contributions
aster-1.651
ore-1.528
eling-1.486
 restaurant-1.448
 baker-1.443
Token little
Token ID1310
Feature activation+1.91e-01

Pos logit contributions
aster+0.452
ore+0.436
eling+0.433
oker+0.417
 restaurant+0.397
Neg logit contributions
 prope-0.320
aters-0.308
 pads-0.302
lets-0.298
phones-0.291
Token boy
Token ID2933
Feature activation-6.76e-02

Pos logit contributions
aster+0.261
ore+0.246
eling+0.244
oker+0.230
 restaurant+0.214
Neg logit contributions
 prope-0.390
aters-0.381
 pads-0.376
lets-0.372
phones-0.367
Token named
Token ID3706
Feature activation-4.41e-01

Pos logit contributions
 prope+0.082
aters+0.079
 pads+0.077
lets+0.075
phones+0.074
Neg logit contributions
aster-0.150
ore-0.145
eling-0.143
oker-0.138
 restaurant-0.133
Token Tim
Token ID5045
Feature activation-3.11e-01

Pos logit contributions
 prope+0.569
aters+0.540
 pads+0.520
lets+0.518
phones+0.506
Neg logit contributions
aster-0.972
ore-0.934
eling-0.911
oker-0.889
 restaurant-0.875
Tokenmy
Token ID1820
Feature activation-7.07e-01

Pos logit contributions
 prope+0.405
aters+0.383
phones+0.371
 pads+0.370
lets+0.367
Neg logit contributions
aster-0.666
ore-0.640
eling-0.635
oker-0.608
 restaurant-0.591
Token see
Token ID766
Feature activation-1.84e+00

Pos logit contributions
 prope+0.294
aters+0.288
 pads+0.281
lets+0.279
phones+0.275
Neg logit contributions
aster-0.334
ore-0.317
eling-0.317
oker-0.301
 restaurant-0.290
Token their
Token ID511
Feature activation-4.98e-01

Pos logit contributions
aters+3.584
lets+3.578
phones+3.509
 prope+3.418
 pads+3.413
Neg logit contributions
aster-2.829
eling-2.633
ore-2.624
 restaurant-2.548
 salsa-2.395
Token sp
Token ID599
Feature activation+7.12e-01

Pos logit contributions
 prope+0.730
aters+0.709
 pads+0.696
lets+0.686
phones+0.679
Neg logit contributions
aster-0.950
ore-0.923
eling-0.906
oker-0.875
 restaurant-0.868
Tokenade
Token ID671
Feature activation+3.06e-01

Pos logit contributions
aster+0.955
ore+0.879
eling+0.877
oker+0.836
 baker+0.793
Neg logit contributions
aters-1.466
 prope-1.460
 pads-1.424
lets-1.407
phones-1.388
Token and
Token ID290
Feature activation+3.85e-01

Pos logit contributions
aster+0.535
ore+0.513
eling+0.508
oker+0.490
 baker+0.464
Neg logit contributions
 prope-0.503
aters-0.491
 pads-0.485
lets-0.476
phones-0.468
Token bucket
Token ID19236
Feature activation+2.26e-01

Pos logit contributions
aster+0.684
ore+0.666
eling+0.652
oker+0.633
 baker+0.597
Neg logit contributions
 prope-0.628
aters-0.605
 pads-0.600
lets-0.590
phones-0.578
Token.
Token ID13
Feature activation-9.87e-01

Pos logit contributions
aster+0.395
ore+0.378
eling+0.375
oker+0.360
 baker+0.342
Neg logit contributions
 prope-0.374
aters-0.364
 pads-0.359
lets-0.353
phones-0.347
Token They
Token ID1119
Feature activation-1.40e-01

Pos logit contributions
 prope+1.595
aters+1.529
 pads+1.528
lets+1.507
phones+1.476
Neg logit contributions
aster-1.802
ore-1.781
oker-1.625
eling-1.578
 baker-1.574
Token are
Token ID389
Feature activation-7.42e-01

Pos logit contributions
 prope+0.226
aters+0.222
 pads+0.217
lets+0.214
phones+0.211
Neg logit contributions
aster-0.251
eling-0.239
ore-0.239
oker-0.227
 restaurant-0.218
Token frozen
Token ID12912
Feature activation-1.76e-01

Pos logit contributions
 prope+1.264
aters+1.245
phones+1.224
lets+1.218
 pads+1.207
Neg logit contributions
aster-1.263
eling-1.206
ore-1.178
oker-1.120
 baker-1.114
Token.
Token ID13
Feature activation-1.18e+00

Pos logit contributions
 prope+0.304
aters+0.296
 pads+0.291
lets+0.289
phones+0.285
Neg logit contributions
aster-0.293
ore-0.282
eling-0.279
oker-0.264
 restaurant-0.255
Token got
Token ID1392
Feature activation+2.28e+00

Pos logit contributions
aster+1.130
ore+1.087
eling+1.052
oker+1.035
 baker+0.960
Neg logit contributions
 prope-1.116
aters-1.084
 pads-1.082
lets-1.082
phones-1.045
Token angry
Token ID7954
Feature activation+6.33e-01

Pos logit contributions
aster+3.689
oker+3.487
eling+3.426
ore+3.397
 restaurant+3.134
Neg logit contributions
 prope-4.268
 no-4.081
 Dog-4.057
 Tess-3.998
un-3.996
Token and
Token ID290
Feature activation+2.14e+00

Pos logit contributions
aster+1.362
eling+1.306
ore+1.298
oker+1.265
 baker+1.213
Neg logit contributions
 prope-0.786
aters-0.774
 pads-0.764
lets-0.749
phones-0.723
Token tried
Token ID3088
Feature activation+9.51e-01

Pos logit contributions
aster+4.354
ore+4.263
oker+3.991
eling+3.859
 restaurant+3.855
Neg logit contributions
 prope-3.068
 pads-3.067
 Dog-3.022
 Tess-2.987
lets-2.977
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
aster+1.947
eling+1.926
ore+1.889
oker+1.854
 baker+1.731
Neg logit contributions
 prope-1.258
 pads-1.221
aters-1.218
lets-1.200
 Dog-1.179
Token grab
Token ID5552
Feature activation+8.88e-01

Pos logit contributions
ore+1.677
aster+1.664
eling+1.593
oker+1.562
 restaurant+1.479
Neg logit contributions
 prope-1.689
aters-1.631
 Tess-1.606
 Dog-1.603
lets-1.599
Token the
Token ID262
Feature activation-4.77e-01

Pos logit contributions
aster+1.411
eling+1.370
ore+1.366
oker+1.319
 restaurant+1.244
Neg logit contributions
 prope-1.582
aters-1.558
 pads-1.527
lets-1.512
 Tess-1.498
Token pipe
Token ID12656
Feature activation+8.17e-01

Pos logit contributions
 prope+0.739
aters+0.728
 pads+0.705
lets+0.696
phones+0.684
Neg logit contributions
aster-0.881
ore-0.858
eling-0.833
oker-0.810
 baker-0.771
Token.
Token ID13
Feature activation+4.95e-01

Pos logit contributions
aster+1.518
eling+1.480
ore+1.433
oker+1.379
 restaurant+1.324
Neg logit contributions
 prope-1.254
aters-1.228
 pads-1.219
lets-1.209
phones-1.166
Token Ben
Token ID3932
Feature activation+6.65e-01

Pos logit contributions
aster+0.994
eling+0.961
ore+0.953
oker+0.921
 restaurant+0.879
Neg logit contributions
 prope-0.691
aters-0.672
 pads-0.658
lets-0.644
 Tess-0.640
Token held
Token ID2714
Feature activation+1.67e+00

Pos logit contributions
aster+1.211
ore+1.159
eling+1.132
oker+1.108
 restaurant+1.046
Neg logit contributions
 prope-1.059
 pads-1.031
lets-1.015
aters-1.014
phones-0.983
Token.
Token ID13
Feature activation-8.32e-01

Pos logit contributions
 prope+1.798
aters+1.762
phones+1.700
 pads+1.697
lets+1.675
Neg logit contributions
aster-1.518
ore-1.431
eling-1.366
oker-1.270
 restaurant-1.261
Token Tim
Token ID5045
Feature activation-3.54e-01

Pos logit contributions
 prope+1.165
lets+1.072
phones+1.068
 pads+1.064
aters+1.060
Neg logit contributions
aster-1.734
ore-1.656
eling-1.595
oker-1.569
 restaurant-1.552
Tokenmy
Token ID1820
Feature activation-8.56e-01

Pos logit contributions
 prope+0.447
aters+0.422
phones+0.409
 pads+0.407
lets+0.406
Neg logit contributions
aster-0.768
ore-0.738
eling-0.737
oker-0.703
 restaurant-0.687
Token loved
Token ID6151
Feature activation+6.28e-01

Pos logit contributions
 prope+1.256
aters+1.231
phones+1.176
lets+1.151
 pads+1.145
Neg logit contributions
aster-1.704
ore-1.623
eling-1.573
oker-1.514
 restaurant-1.514
Token to
Token ID284
Feature activation-3.45e-01

Pos logit contributions
aster+1.218
ore+1.185
eling+1.181
oker+1.151
 baker+1.064
Neg logit contributions
 prope-0.910
aters-0.882
 pads-0.869
lets-0.848
phones-0.839
Token play
Token ID711
Feature activation+6.44e-01

Pos logit contributions
 prope+0.588
aters+0.572
 pads+0.564
lets+0.557
phones+0.544
Neg logit contributions
aster-0.606
ore-0.562
eling-0.554
oker-0.532
 restaurant-0.516
Token outside
Token ID2354
Feature activation+4.22e-01

Pos logit contributions
aster+1.056
eling+1.022
ore+1.008
oker+0.973
 baker+0.905
Neg logit contributions
 prope-1.131
aters-1.099
 pads-1.078
lets-1.065
phones-1.060
Token in
Token ID287
Feature activation-4.76e-01

Pos logit contributions
aster+0.706
eling+0.673
ore+0.672
oker+0.647
 baker+0.606
Neg logit contributions
 prope-0.733
aters-0.715
 pads-0.705
lets-0.691
phones-0.682
Token the
Token ID262
Feature activation-4.29e-01

Pos logit contributions
 prope+1.057
aters+1.032
lets+1.027
 pads+1.025
phones+1.007
Neg logit contributions
aster-0.598
ore-0.534
eling-0.530
oker-0.484
 restaurant-0.475
Token park
Token ID3952
Feature activation+1.10e+00

Pos logit contributions
 prope+0.673
aters+0.645
lets+0.637
phones+0.630
 pads+0.625
Neg logit contributions
aster-0.806
ore-0.768
eling-0.747
oker-0.723
 restaurant-0.715
Token.
Token ID13
Feature activation-6.53e-01

Pos logit contributions
aster+2.066
eling+2.048
ore+1.974
oker+1.949
 baker+1.822
Neg logit contributions
 pads-1.641
aters-1.624
 prope-1.615
lets-1.577
 tail-1.532
Token of
Token ID286
Feature activation+9.98e-02

Pos logit contributions
 prope+0.026
aters+0.026
 pads+0.025
lets+0.025
phones+0.025
Neg logit contributions
aster-0.025
ore-0.024
eling-0.023
oker-0.022
 baker-0.021
Token the
Token ID262
Feature activation-1.70e-02

Pos logit contributions
aster+0.151
ore+0.143
eling+0.142
oker+0.135
 baker+0.127
Neg logit contributions
 prope-0.189
aters-0.185
 pads-0.182
lets-0.180
phones-0.177
Token spicy
Token ID26880
Feature activation+1.10e+00

Pos logit contributions
 prope+0.028
aters+0.027
 pads+0.026
lets+0.026
phones+0.026
Neg logit contributions
aster-0.030
ore-0.029
eling-0.029
oker-0.028
 baker-0.026
Token plant
Token ID4618
Feature activation+5.36e-01

Pos logit contributions
aster+1.770
ore+1.672
eling+1.670
oker+1.611
roads+1.482
Neg logit contributions
 prope-1.958
aters-1.927
 pads-1.906
lets-1.878
phones-1.826
Token.
Token ID13
Feature activation-3.86e-01

Pos logit contributions
aster+0.994
ore+0.951
eling+0.946
oker+0.909
 baker+0.867
Neg logit contributions
 prope-0.821
aters-0.798
 pads-0.794
lets-0.778
phones-0.761
Token They
Token ID1119
Feature activation-1.32e+00

Pos logit contributions
 prope+0.623
aters+0.596
 pads+0.586
lets+0.577
phones+0.565
Neg logit contributions
aster-0.698
ore-0.675
eling-0.654
oker-0.638
 baker-0.611
Token had
Token ID550
Feature activation-1.20e+00

Pos logit contributions
aters+2.022
 prope+1.936
 pads+1.894
Snow+1.826
phones+1.824
Neg logit contributions
aster-2.497
eling-2.478
ore-2.385
oker-2.233
 baker-2.192
Token to
Token ID284
Feature activation-3.04e-01

Pos logit contributions
aters+1.908
 prope+1.900
 pads+1.811
lets+1.804
phones+1.794
Neg logit contributions
aster-2.164
eling-2.060
ore-2.016
oker-1.912
 restaurant-1.896
Token give
Token ID1577
Feature activation+5.69e-01

Pos logit contributions
 prope+0.519
aters+0.510
 pads+0.505
lets+0.495
phones+0.486
Neg logit contributions
aster-0.521
eling-0.490
ore-0.485
oker-0.463
 restaurant-0.441
Token up
Token ID510
Feature activation+7.80e-03

Pos logit contributions
aster+0.952
ore+0.905
eling+0.903
oker+0.871
 baker+0.817
Neg logit contributions
 prope-0.979
aters-0.954
 pads-0.945
lets-0.939
 Tess-0.922
Token on
Token ID319
Feature activation-4.54e-01

Pos logit contributions
aster+0.015
ore+0.014
eling+0.014
oker+0.013
 restaurant+0.013
Neg logit contributions
 prope-0.012
aters-0.012
 pads-0.011
lets-0.011
phones-0.011
TokenWhe
Token ID10842
Feature activation+1.27e+00

Pos logit contributions
aster+0.917
eling+0.889
ore+0.889
oker+0.851
 restaurant+0.816
Neg logit contributions
 prope-0.779
aters-0.750
 pads-0.746
lets-0.731
Snow-0.722
Tokene
Token ID68
Feature activation-5.01e-01

Pos logit contributions
aster+2.265
ore+2.203
eling+2.188
oker+2.145
 baker+2.081
Neg logit contributions
aters-1.930
 prope-1.926
 pads-1.848
lets-1.833
un-1.783
Token,
Token ID11
Feature activation+2.36e-01

Pos logit contributions
 prope+0.695
aters+0.676
 pads+0.655
lets+0.640
phones+0.639
Neg logit contributions
aster-1.015
ore-0.984
eling-0.955
oker-0.929
 baker-0.896
Token it
Token ID340
Feature activation+5.21e-01

Pos logit contributions
aster+0.403
ore+0.388
eling+0.383
oker+0.368
 baker+0.349
Neg logit contributions
 prope-0.400
aters-0.387
 pads-0.382
lets-0.377
phones-0.371
Token is
Token ID318
Feature activation-2.08e+00

Pos logit contributions
aster+0.839
ore+0.789
eling+0.773
oker+0.752
 baker+0.711
Neg logit contributions
 prope-0.943
aters-0.916
 pads-0.909
lets-0.898
phones-0.879
Token like
Token ID588
Feature activation-1.07e+00

Pos logit contributions
aters+3.337
phones+3.336
 prope+3.296
lets+3.264
Brown+3.225
Neg logit contributions
aster-3.734
eling-3.568
ore-3.419
 restaurant-3.301
 salsa-3.266
Token a
Token ID257
Feature activation-9.17e-01

Pos logit contributions
aters+1.791
 prope+1.780
lets+1.748
 pads+1.711
phones+1.698
Neg logit contributions
aster-1.912
eling-1.746
ore-1.742
oker-1.680
 restaurant-1.641
Token flag
Token ID6056
Feature activation-7.43e-01

Pos logit contributions
aters+1.497
 pads+1.442
 prope+1.438
phones+1.432
lets+1.419
Neg logit contributions
aster-1.638
ore-1.552
oker-1.500
eling-1.496
 baker-1.474
Token!"
Token ID2474
Feature activation+1.38e+00

Pos logit contributions
 prope+1.067
aters+1.025
 pads+0.995
phones+0.982
 Tess+0.981
Neg logit contributions
aster-1.461
ore-1.417
eling-1.366
oker-1.351
 baker-1.312
Token
Token ID198
Feature activation+5.55e-01

Pos logit contributions
eling+2.880
aster+2.834
ore+2.748
oker+2.716
 baker+2.529
Neg logit contributions
 prope-1.849
 pads-1.787
aters-1.765
Snow-1.757
 Dog-1.744
Token
Token ID198
Feature activation+4.83e-01

Pos logit contributions
aster+1.063
eling+1.027
ore+1.008
oker+0.984
 baker+0.941
Neg logit contributions
 prope-0.818
aters-0.798
 pads-0.783
lets-0.769
Snow-0.757
Token upon
Token ID2402
Feature activation-6.61e-01

Pos logit contributions
aster+0.836
eling+0.797
ore+0.793
oker+0.760
 baker+0.727
Neg logit contributions
 prope-0.752
aters-0.734
 pads-0.724
lets-0.714
 Tess-0.700
Token a
Token ID257
Feature activation-2.80e-01

Pos logit contributions
 prope+0.973
aters+0.952
lets+0.915
phones+0.907
 pads+0.904
Neg logit contributions
aster-1.320
ore-1.235
eling-1.224
oker-1.164
 restaurant-1.159
Token time
Token ID640
Feature activation-1.21e+00

Pos logit contributions
 prope+0.557
aters+0.541
lets+0.529
 pads+0.529
phones+0.528
Neg logit contributions
aster-0.405
ore-0.377
eling-0.373
oker-0.355
 restaurant-0.339
Token,
Token ID11
Feature activation+8.32e-02

Pos logit contributions
 prope+2.022
aters+1.976
 pads+1.916
lets+1.901
phones+1.880
Neg logit contributions
ore-2.181
aster-2.167
oker-2.015
eling-2.003
 restaurant-1.944
Token there
Token ID612
Feature activation-2.65e-01

Pos logit contributions
aster+0.138
ore+0.132
eling+0.130
oker+0.125
 baker+0.118
Neg logit contributions
 prope-0.146
aters-0.142
 pads-0.140
lets-0.138
phones-0.136
Token was
Token ID373
Feature activation-9.48e-01

Pos logit contributions
 prope+0.466
aters+0.453
 pads+0.444
lets+0.441
phones+0.439
Neg logit contributions
aster-0.441
ore-0.425
eling-0.415
oker-0.396
 restaurant-0.383
Token a
Token ID257
Feature activation+2.37e-01

Pos logit contributions
 prope+1.438
aters+1.392
lets+1.381
phones+1.365
 pads+1.347
Neg logit contributions
aster-1.884
ore-1.722
eling-1.681
 restaurant-1.641
 baker-1.637
Token little
Token ID1310
Feature activation+1.23e-01

Pos logit contributions
aster+0.474
ore+0.458
eling+0.455
oker+0.438
 restaurant+0.416
Neg logit contributions
 prope-0.332
aters-0.319
 pads-0.313
lets-0.309
phones-0.302
Token fish
Token ID5916
Feature activation-8.87e-02

Pos logit contributions
aster+0.172
ore+0.163
eling+0.161
oker+0.153
 baker+0.143
Neg logit contributions
 prope-0.246
aters-0.241
 pads-0.237
lets-0.235
phones-0.232
Token named
Token ID3706
Feature activation-1.06e+00

Pos logit contributions
 prope+0.111
aters+0.107
 pads+0.104
lets+0.102
phones+0.101
Neg logit contributions
aster-0.191
ore-0.186
eling-0.183
oker-0.177
 restaurant-0.171
Token Fin
Token ID4463
Feature activation-5.27e-01

Pos logit contributions
 prope+1.869
aters+1.850
 pads+1.753
lets+1.752
phones+1.741
Neg logit contributions
aster-1.829
ore-1.755
eling-1.653
 restaurant-1.600
oker-1.583
Token they
Token ID484
Feature activation-8.71e-01

Pos logit contributions
aster+0.834
ore+0.791
eling+0.791
oker+0.755
 baker+0.720
Neg logit contributions
 prope-0.786
aters-0.762
 pads-0.758
lets-0.745
phones-0.732
Token knew
Token ID2993
Feature activation-5.29e-01

Pos logit contributions
aters+1.350
 prope+1.333
 pads+1.249
lets+1.244
phones+1.230
Neg logit contributions
aster-1.633
eling-1.567
ore-1.549
oker-1.485
 restaurant-1.432
Token.
Token ID13
Feature activation-3.54e-01

Pos logit contributions
 prope+0.918
aters+0.898
 pads+0.865
lets+0.859
phones+0.848
Neg logit contributions
aster-0.894
ore-0.853
eling-0.838
oker-0.804
 restaurant-0.770
Token They
Token ID1119
Feature activation-1.27e+00

Pos logit contributions
 prope+0.511
aters+0.493
 pads+0.482
lets+0.478
phones+0.468
Neg logit contributions
aster-0.699
ore-0.676
eling-0.650
oker-0.641
 restaurant-0.618
Token felt
Token ID2936
Feature activation-2.03e-01

Pos logit contributions
aters+2.271
 prope+2.187
 pads+2.104
phones+2.073
lets+2.062
Neg logit contributions
aster-2.128
eling-2.075
ore-1.938
oker-1.866
 restaurant-1.850
Token safe
Token ID3338
Feature activation-3.81e-01

Pos logit contributions
 prope+0.361
aters+0.358
 pads+0.348
lets+0.348
phones+0.342
Neg logit contributions
aster-0.331
ore-0.315
eling-0.314
oker-0.297
 restaurant-0.283
Token and
Token ID290
Feature activation+1.09e-01

Pos logit contributions
 prope+0.562
aters+0.550
 pads+0.531
lets+0.529
phones+0.522
Neg logit contributions
aster-0.724
ore-0.708
eling-0.691
oker-0.667
 restaurant-0.648
Token happy
Token ID3772
Feature activation-2.05e-02

Pos logit contributions
aster+0.203
ore+0.195
eling+0.193
oker+0.186
 baker+0.177
Neg logit contributions
 prope-0.168
aters-0.163
 pads-0.160
lets-0.158
phones-0.155
Token.
Token ID13
Feature activation-1.33e+00

Pos logit contributions
 prope+0.035
aters+0.034
 pads+0.033
lets+0.033
phones+0.032
Neg logit contributions
aster-0.035
ore-0.034
eling-0.033
oker-0.032
 restaurant-0.030
Token They
Token ID1119
Feature activation-1.33e+00

Pos logit contributions
 prope+2.250
aters+2.155
lets+2.097
phones+2.087
 pads+2.084
Neg logit contributions
aster-2.324
ore-2.301
oker-2.122
eling-2.047
 restaurant-2.045
Token saw
Token ID2497
Feature activation-3.84e-01

Pos logit contributions
aters+2.425
 prope+2.355
phones+2.280
 pads+2.264
lets+2.230
Neg logit contributions
aster-2.139
eling-2.074
ore-1.947
oker-1.915
 restaurant-1.873
Token.
Token ID13
Feature activation+5.14e-01

Pos logit contributions
aster+0.380
ore+0.363
eling+0.361
oker+0.346
 baker+0.329
Neg logit contributions
 prope-0.357
aters-0.348
 pads-0.342
lets-0.337
phones-0.332
Token The
Token ID383
Feature activation+8.41e-01

Pos logit contributions
aster+1.058
eling+1.022
ore+1.015
oker+0.980
 restaurant+0.934
Neg logit contributions
 prope-0.692
aters-0.676
 pads-0.663
lets-0.656
 Tess-0.641
Token sky
Token ID6766
Feature activation+2.84e-01

Pos logit contributions
aster+1.425
ore+1.397
eling+1.375
oker+1.319
 baker+1.201
Neg logit contributions
 prope-1.423
 pads-1.373
aters-1.358
lets-1.342
phones-1.327
Token was
Token ID373
Feature activation-1.54e+00

Pos logit contributions
aster+0.513
ore+0.491
eling+0.486
oker+0.469
 restaurant+0.444
Neg logit contributions
 prope-0.456
aters-0.442
 pads-0.435
lets-0.431
phones-0.422
Token clear
Token ID1598
Feature activation-1.17e+00

Pos logit contributions
lets+2.606
phones+2.546
aters+2.544
 prope+2.543
 pads+2.511
Neg logit contributions
aster-2.701
eling-2.488
ore-2.453
 restaurant-2.411
 baker-2.369
Token and
Token ID290
Feature activation-7.80e-01

Pos logit contributions
 prope+1.824
phones+1.704
aters+1.703
lets+1.700
 pads+1.693
Neg logit contributions
ore-2.144
aster-2.143
eling-2.035
 restaurant-1.997
oker-1.962
Token the
Token ID262
Feature activation+2.90e-01

Pos logit contributions
aters+1.335
 prope+1.332
lets+1.284
 pads+1.283
phones+1.270
Neg logit contributions
aster-1.312
eling-1.253
ore-1.229
oker-1.168
 restaurant-1.143
Token sun
Token ID4252
Feature activation-9.55e-02

Pos logit contributions
aster+0.567
ore+0.548
eling+0.542
oker+0.523
 baker+0.496
Neg logit contributions
 prope-0.421
aters-0.406
 pads-0.401
lets-0.393
phones-0.386
Token was
Token ID373
Feature activation-1.54e+00

Pos logit contributions
 prope+0.157
aters+0.152
 pads+0.149
lets+0.147
phones+0.145
Neg logit contributions
aster-0.168
ore-0.162
eling-0.160
oker-0.152
 baker-0.146
Token shining
Token ID22751
Feature activation-1.25e+00

Pos logit contributions
lets+2.491
aters+2.443
 prope+2.429
phones+2.422
 pads+2.411
Neg logit contributions
aster-2.756
eling-2.642
ore-2.550
 restaurant-2.447
 early-2.435
Token.
Token ID13
Feature activation+2.83e-01

Pos logit contributions
 prope+2.220
aters+2.159
 pads+2.154
phones+2.149
lets+2.145
Neg logit contributions
aster-1.964
ore-1.951
eling-1.875
 restaurant-1.796
 baker-1.742
Token was
Token ID373
Feature activation-1.94e+00

Pos logit contributions
 prope+0.882
aters+0.870
lets+0.842
 pads+0.838
phones+0.833
Neg logit contributions
aster-0.819
ore-0.798
eling-0.787
oker-0.742
 restaurant-0.713
Token out
Token ID503
Feature activation-3.78e-02

Pos logit contributions
lets+4.284
phones+4.206
 pads+4.128
aters+4.109
 prope+4.080
Neg logit contributions
aster-2.536
eling-2.334
ore-2.304
 restaurant-2.100
 early-2.021
Token of
Token ID286
Feature activation-4.71e-01

Pos logit contributions
 prope+0.066
aters+0.064
 pads+0.064
lets+0.063
phones+0.062
Neg logit contributions
aster-0.063
ore-0.061
eling-0.060
oker-0.057
 restaurant-0.054
Token sight
Token ID6504
Feature activation+1.26e-01

Pos logit contributions
 prope+0.851
 pads+0.845
aters+0.844
lets+0.842
phones+0.815
Neg logit contributions
aster-0.761
ore-0.707
eling-0.704
oker-0.659
 restaurant-0.648
Token.
Token ID13
Feature activation+1.87e-01

Pos logit contributions
aster+0.219
ore+0.210
eling+0.207
oker+0.199
 baker+0.189
Neg logit contributions
 prope-0.211
aters-0.205
 pads-0.202
lets-0.199
phones-0.196
Token His
Token ID2399
Feature activation-1.92e+00

Pos logit contributions
aster+0.365
ore+0.351
eling+0.349
oker+0.336
 baker+0.320
Neg logit contributions
 prope-0.273
aters-0.265
 pads-0.259
lets-0.255
phones-0.251
Token mom
Token ID1995
Feature activation-1.94e+00

Pos logit contributions
aters+2.736
lets+2.666
 prope+2.635
 pads+2.576
Brown+2.556
Neg logit contributions
aster-3.893
ore-3.803
eling-3.617
 restaurant-3.599
oker-3.431
Token held
Token ID2714
Feature activation+4.04e-01

Pos logit contributions
 prope+3.993
aters+3.990
 pads+3.810
phones+3.704
lets+3.658
Neg logit contributions
aster-2.742
eling-2.725
ore-2.610
 baker-2.367
 restaurant-2.298
Token his
Token ID465
Feature activation-1.10e+00

Pos logit contributions
aster+0.673
ore+0.642
eling+0.638
oker+0.618
 restaurant+0.581
Neg logit contributions
 prope-0.708
aters-0.686
 pads-0.673
lets-0.666
phones-0.661
Token hand
Token ID1021
Feature activation-2.24e+00

Pos logit contributions
aters+1.726
 prope+1.668
lets+1.665
 pads+1.640
 Tess+1.590
Neg logit contributions
aster-2.024
ore-1.953
eling-1.898
oker-1.792
 restaurant-1.786
Token and
Token ID290
Feature activation-5.32e-01

Pos logit contributions
 prope+4.074
aters+4.038
lets+3.816
 pads+3.727
Brown+3.725
Neg logit contributions
ore-3.649
aster-3.622
eling-3.221
 early-3.158
 restaurant-3.016
Token.
Token ID13
Feature activation+3.75e-01

Pos logit contributions
 prope+0.013
aters+0.012
 pads+0.012
lets+0.012
phones+0.012
Neg logit contributions
aster-0.013
ore-0.012
eling-0.012
oker-0.012
 restaurant-0.011
Token It
Token ID632
Feature activation-7.92e-01

Pos logit contributions
aster+0.727
eling+0.703
ore+0.697
oker+0.672
 baker+0.637
Neg logit contributions
 prope-0.550
aters-0.532
 pads-0.523
lets-0.514
phones-0.507
Token was
Token ID373
Feature activation-1.23e+00

Pos logit contributions
aters+1.491
 prope+1.485
lets+1.413
 pads+1.411
phones+1.406
Neg logit contributions
aster-1.206
eling-1.179
ore-1.172
oker-1.075
 restaurant-1.050
Token a
Token ID257
Feature activation-5.95e-01

Pos logit contributions
lets+1.951
aters+1.942
 prope+1.933
 pads+1.888
phones+1.879
Neg logit contributions
aster-2.287
eling-2.112
ore-2.103
 restaurant-2.035
 salsa-2.002
Token fake
Token ID8390
Feature activation-4.85e-01

Pos logit contributions
aters+0.954
 prope+0.934
 pads+0.928
lets+0.916
phones+0.899
Neg logit contributions
aster-1.064
ore-1.028
eling-0.987
oker-0.976
 restaurant-0.961
Token pig
Token ID12967
Feature activation-2.00e+00

Pos logit contributions
 prope+0.742
aters+0.734
 pads+0.717
lets+0.704
phones+0.697
Neg logit contributions
aster-0.895
ore-0.869
eling-0.846
oker-0.815
 baker-0.798
Token!
Token ID0
Feature activation+6.00e-01

Pos logit contributions
 prope+3.463
aters+3.288
Brown+3.024
phones+3.013
 pads+3.012
Neg logit contributions
ore-3.587
aster-3.498
eling-3.348
oker-3.193
 restaurant-3.133
Token It
Token ID632
Feature activation-6.70e-01

Pos logit contributions
aster+1.096
eling+1.067
ore+1.041
oker+1.009
 restaurant+0.947
Neg logit contributions
 prope-0.948
aters-0.922
 pads-0.911
lets-0.894
 Tess-0.887
Token was
Token ID373
Feature activation-1.46e+00

Pos logit contributions
aters+1.191
 prope+1.183
 pads+1.131
phones+1.121
lets+1.117
Neg logit contributions
aster-1.083
eling-1.052
ore-1.041
oker-0.970
 restaurant-0.939
Token moving
Token ID3867
Feature activation-1.37e+00

Pos logit contributions
aters+2.413
 prope+2.364
lets+2.340
phones+2.318
 pads+2.298
Neg logit contributions
aster-2.579
eling-2.455
ore-2.417
 salsa-2.301
 restaurant-2.274
Token really
Token ID1107
Feature activation-5.88e-01

Pos logit contributions
aters+2.684
 prope+2.643
lets+2.548
phones+2.531
 pads+2.529
Neg logit contributions
ore-1.945
aster-1.900
eling-1.868
oker-1.698
 restaurant-1.685
Token.
Token ID13
Feature activation-1.51e-01

Pos logit contributions
 prope+1.374
phones+1.333
aters+1.331
 Tess+1.295
lets+1.294
Neg logit contributions
aster-1.338
ore-1.330
eling-1.267
oker-1.214
 restaurant-1.198
Token He
Token ID679
Feature activation+2.27e-01

Pos logit contributions
 prope+0.257
aters+0.250
 pads+0.245
lets+0.243
phones+0.240
Neg logit contributions
aster-0.257
ore-0.247
eling-0.240
oker-0.233
 baker-0.221
Token gave
Token ID2921
Feature activation-4.86e-01

Pos logit contributions
aster+0.403
ore+0.386
eling+0.381
oker+0.368
 baker+0.348
Neg logit contributions
 prope-0.370
aters-0.358
 pads-0.354
lets-0.350
phones-0.343
Token Sam
Token ID3409
Feature activation-2.14e-01

Pos logit contributions
 prope+0.836
aters+0.811
 pads+0.788
lets+0.778
phones+0.772
Neg logit contributions
aster-0.843
ore-0.802
eling-0.784
oker-0.751
 restaurant-0.717
Token some
Token ID617
Feature activation+6.92e-01

Pos logit contributions
 prope+0.321
aters+0.310
 pads+0.300
lets+0.297
phones+0.294
Neg logit contributions
aster-0.412
ore-0.395
eling-0.387
oker-0.371
 restaurant-0.360
Token envelop
Token ID16441
Feature activation-2.48e+00

Pos logit contributions
aster+1.145
eling+1.109
ore+1.104
oker+1.072
 restaurant+0.990
Neg logit contributions
 prope-1.206
aters-1.162
 pads-1.145
lets-1.127
phones-1.116
Tokenes
Token ID274
Feature activation-4.93e-01

Pos logit contributions
 prope+4.671
aters+4.330
 pads+4.263
Snow+4.204
 Tess+4.183
Neg logit contributions
ore-3.930
aster-3.878
eling-3.562
in-3.554
led-3.471
Token and
Token ID290
Feature activation-5.99e-01

Pos logit contributions
 prope+0.716
aters+0.692
 pads+0.666
lets+0.657
phones+0.655
Neg logit contributions
aster-0.971
ore-0.943
eling-0.906
oker-0.872
 restaurant-0.859
Token magazines
Token ID16695
Feature activation+2.03e-01

Pos logit contributions
 prope+0.955
aters+0.946
 pads+0.903
lets+0.893
phones+0.885
Neg logit contributions
aster-1.095
eling-1.008
ore-1.006
oker-0.967
 baker-0.944
Token.
Token ID13
Feature activation-6.66e-01

Pos logit contributions
aster+0.362
ore+0.347
eling+0.345
oker+0.331
 baker+0.314
Neg logit contributions
 prope-0.330
aters-0.320
 pads-0.316
lets-0.311
phones-0.306
Token
Token ID198
Feature activation+1.18e-02

Pos logit contributions
 prope+1.207
aters+1.166
lets+1.143
 pads+1.136
phones+1.131
Neg logit contributions
aster-1.092
ore-1.041
eling-0.974
oker-0.974
 restaurant-0.926
Token.
Token ID13
Feature activation-6.53e-01

Pos logit contributions
 prope+2.469
aters+2.397
 pads+2.359
lets+2.301
phones+2.301
Neg logit contributions
ore-1.845
aster-1.819
eling-1.603
 baker-1.602
oker-1.596
Token One
Token ID1881
Feature activation+3.06e-01

Pos logit contributions
 prope+0.918
aters+0.843
lets+0.829
 pads+0.828
phones+0.827
Neg logit contributions
aster-1.342
ore-1.293
eling-1.238
oker-1.207
 restaurant-1.204
Token day
Token ID1110
Feature activation-2.56e-01

Pos logit contributions
aster+0.355
eling+0.334
ore+0.334
oker+0.310
 baker+0.285
Neg logit contributions
 prope-0.684
aters-0.672
 pads-0.665
lets-0.657
phones-0.646
Token,
Token ID11
Feature activation+2.57e-01

Pos logit contributions
 prope+0.366
aters+0.353
 pads+0.345
lets+0.339
phones+0.334
Neg logit contributions
aster-0.515
ore-0.498
eling-0.484
oker-0.471
 baker-0.453
Token Lily
Token ID20037
Feature activation-5.71e-01

Pos logit contributions
aster+0.430
ore+0.413
eling+0.409
oker+0.392
 restaurant+0.369
Neg logit contributions
 prope-0.447
aters-0.435
 pads-0.429
lets-0.424
phones-0.418
Token's
Token ID338
Feature activation-2.09e+00

Pos logit contributions
 prope+0.926
aters+0.885
phones+0.855
lets+0.845
 pads+0.844
Neg logit contributions
aster-1.032
ore-0.991
eling-0.973
oker-0.931
 restaurant-0.916
Token mom
Token ID1995
Feature activation-5.05e-01

Pos logit contributions
 prope+3.249
 pads+3.115
lets+3.107
aters+3.092
phones+2.915
Neg logit contributions
ore-3.990
 baker-3.916
 restaurant-3.913
aster-3.908
oker-3.687
Tokenmy
Token ID1820
Feature activation-5.76e-01

Pos logit contributions
 prope+0.911
aters+0.862
 pads+0.843
phones+0.834
lets+0.828
Neg logit contributions
aster-0.842
ore-0.796
eling-0.793
oker-0.750
 baker-0.731
Token noticed
Token ID6810
Feature activation+5.25e-01

Pos logit contributions
 prope+1.105
aters+1.056
 pads+1.031
phones+1.018
lets+1.014
Neg logit contributions
aster-0.884
ore-0.838
eling-0.828
oker-0.778
 baker-0.756
Token that
Token ID326
Feature activation+9.56e-01

Pos logit contributions
aster+0.950
ore+0.920
eling+0.911
oker+0.893
 baker+0.836
Neg logit contributions
 prope-0.821
aters-0.806
 pads-0.790
lets-0.777
 Tess-0.774
Token the
Token ID262
Feature activation+1.16e-01

Pos logit contributions
aster+1.712
eling+1.690
ore+1.679
oker+1.607
 baker+1.480
Neg logit contributions
 prope-1.535
aters-1.468
 Tess-1.458
 pads-1.443
Snow-1.419
Token In
Token ID554
Feature activation-7.97e-01

Pos logit contributions
 prope+0.967
aters+0.914
lets+0.898
 pads+0.898
phones+0.878
Neg logit contributions
aster-1.250
ore-1.190
oker-1.128
eling-1.127
 restaurant-1.113
Token this
Token ID428
Feature activation+8.64e-01

Pos logit contributions
 prope+1.794
aters+1.766
lets+1.765
 pads+1.746
phones+1.734
Neg logit contributions
aster-0.970
ore-0.856
eling-0.847
oker-0.782
 restaurant-0.774
Token park
Token ID3952
Feature activation+5.41e-01

Pos logit contributions
aster+1.482
eling+1.428
ore+1.396
oker+1.337
 baker+1.263
Neg logit contributions
 prope-1.454
aters-1.434
 pads-1.416
lets-1.382
 Dog-1.372
Token,
Token ID11
Feature activation-9.38e-01

Pos logit contributions
aster+1.171
eling+1.133
ore+1.130
oker+1.088
 baker+1.042
Neg logit contributions
 prope-0.660
aters-0.636
 pads-0.631
lets-0.620
 Dog-0.598
Token there
Token ID612
Feature activation-5.44e-01

Pos logit contributions
 prope+1.794
aters+1.793
 pads+1.723
lets+1.697
phones+1.676
Neg logit contributions
aster-1.412
ore-1.337
eling-1.297
oker-1.259
 restaurant-1.247
Token were
Token ID547
Feature activation-1.99e+00

Pos logit contributions
 prope+0.927
aters+0.898
phones+0.877
 pads+0.875
lets+0.870
Neg logit contributions
aster-0.942
ore-0.907
eling-0.890
oker-0.845
 restaurant-0.832
Token two
Token ID734
Feature activation-1.10e+00

Pos logit contributions
 prope+3.936
aters+3.878
phones+3.813
lets+3.779
 Tess+3.620
Neg logit contributions
aster-2.898
 restaurant-2.858
 baker-2.762
ore-2.660
eling-2.600
Token friends
Token ID2460
Feature activation-1.28e+00

Pos logit contributions
 prope+2.049
aters+1.964
phones+1.945
lets+1.913
Brown+1.901
Neg logit contributions
aster-1.685
eling-1.664
ore-1.600
 restaurant-1.568
oker-1.559
Token,
Token ID11
Feature activation-1.15e+00

Pos logit contributions
 prope+2.176
 pads+2.032
aters+2.032
Snow+1.952
phones+1.939
Neg logit contributions
aster-2.335
ore-2.322
eling-2.183
oker-2.117
 restaurant-2.085
Token a
Token ID257
Feature activation-3.61e-01

Pos logit contributions
 prope+1.770
aters+1.701
lets+1.654
 pads+1.639
phones+1.626
Neg logit contributions
aster-2.176
ore-2.046
eling-1.974
oker-1.968
 restaurant-1.966
Token rock
Token ID3881
Feature activation+3.30e-01

Pos logit contributions
 prope+0.619
aters+0.615
 pads+0.598
lets+0.592
phones+0.589
Neg logit contributions
aster-0.601
ore-0.573
eling-0.557
oker-0.549
 baker-0.535