TOP ACTIVATIONS
MAX = 4.099

forever known as the most depend able cook in the swamp
proud . The End . <|endoftext|> Once upon
time . The boy enjoyed the pastry and
. When the muff ins were ready , Tim
ries . The baker handed the boy a
not use . The baker was very upset .
wanted to see the magic muff in . The
my smiled and cut the muff in into smaller pieces .
. The foolish muff in was not happy .
: Rot is when something spo ils and gets bad .
not tried to get the muff in . They wished they
." <|endoftext|> Jimmy was a depend able young boy . Whenever
! The foolish muff in was very pleased .
and happy . Down on the ground , there
gifts . The j olly man gives them each
L ily looked at the muff in . It was still
boy was so relieved ! The baker was so patient even
flew Sally up to the muff in . Sally took a
, something special happened . The baker set out to bake
wait to eat the tasty muff ins . <|endoftext|> Once upon

MIN ACTIVATIONS
MIN = -3.854

different neck l aces , brace lets , and rings .
big , white pup with floppy ears and a curly tail
a little rabbit named Bun . Bun had big ears that
ch ubby bunny named Bun . Bun loved to eat carrots
. Fl oppy had big floppy ears and a little pink
her flashlight and the butterfly 's wings were so colorful !
M illy and her new friend played together until M
play with lasers . They had red lasers and blue lasers
a silly rabbit named Bun . Bun liked to hop and
small , brown puppy with floppy ears and a w agging
was a bunny named Bun . Bun loved to eat cel
a little girl named Beth . Beth was nice , but
, friendly brown dog with floppy ears . The dog started
was a girl named Beth . Beth wanted to go to
swim suits on . Lily 's mom said , " Be
One day , Sandy and her owners went for a walk
she could have it . Her friend said yes , and
a little girl called Beth . She was very intelligent ,
." So , Kitty and Dog gy curled up next to
around the house . He had so much fun playing with

INTERVAL 2.110 - 4.099
CONTAINS 0.678%

Lily 's family sat down to eat . Everyone said the
energy . Now , no one will ever have to pay
jam and the bee said ," Okay , now I 'll
to go to the bathroom really bad . She ran to
cook with his mom in the kitchen . One day ,

INTERVAL 0.122 - 2.110
CONTAINS 42.578%

had to be careful of the cat . The cat was
little girl named Lily . She had a pink dress that
that he was still able to play with his mar bles
ons , scissors and glue . They are very creative .
surfing and he felt regret that he couldn 't do it

INTERVAL -1.866 - 0.122
CONTAINS 55.788%

had a shiny bag and it made her feel very happy
air got thinner and thinner . They started getting dizz y
Tom says sorry to Sam . They hug and make up
were looking for . But the poster said the dog had
visit their neighbor , Mrs . Lee . Mrs . Lee

INTERVAL -3.854 - -1.866
CONTAINS 0.956%

park . The oak tree has many leaves and ac orns
decided to follow him . They went on an adventure together
, and it was singing in the rain . The dove
girl named Lily . She had a toy tank that she
happy and said , " Mom my , burying the rock
Token forever
Token ID8097
Feature activation+1.21e+00

Pos logit contributions
ore+3.113
aster+3.059
oker+3.004
eling+2.986
 restaurant+2.771
Neg logit contributions
 prope-1.694
 pads-1.617
aters-1.575
 Dog-1.574
 Tess-1.546
Token known
Token ID1900
Feature activation-3.68e-01

Pos logit contributions
aster+2.350
ore+2.307
eling+2.266
oker+2.228
roads+2.102
Neg logit contributions
 pads-1.703
 prope-1.699
aters-1.663
lets-1.660
 tail-1.607
Token as
Token ID355
Feature activation+1.45e+00

Pos logit contributions
 prope+0.537
aters+0.528
 pads+0.514
lets+0.504
phones+0.496
Neg logit contributions
aster-0.716
ore-0.687
eling-0.680
oker-0.657
 baker-0.632
Token the
Token ID262
Feature activation+4.71e-01

Pos logit contributions
aster+2.416
ore+2.360
eling+2.286
oker+2.180
roads+2.065
Neg logit contributions
 prope-2.497
 Tess-2.435
 Dog-2.417
 pads-2.372
aters-2.358
Token most
Token ID749
Feature activation+1.00e+00

Pos logit contributions
aster+0.877
ore+0.848
eling+0.834
oker+0.801
 restaurant+0.749
Neg logit contributions
 prope-0.733
aters-0.712
 pads-0.699
lets-0.688
phones-0.675
Token depend
Token ID4745
Feature activation+4.10e+00

Pos logit contributions
aster+1.916
ore+1.818
eling+1.806
oker+1.709
roads+1.627
Neg logit contributions
 prope-1.540
aters-1.475
 pads-1.460
lets-1.447
 Tess-1.384
Tokenable
Token ID540
Feature activation+5.05e-01

Pos logit contributions
eling+5.241
aster+5.030
ulk+4.341
 salsa+4.237
 oven+4.190
Neg logit contributions
 pads-9.100
lets-9.094
un-9.057
aters-8.757
 Tess-8.665
Token cook
Token ID4255
Feature activation+1.65e+00

Pos logit contributions
aster+0.966
eling+0.925
ore+0.923
oker+0.878
 baker+0.826
Neg logit contributions
 prope-0.756
aters-0.738
 pads-0.723
lets-0.717
 Tess-0.697
Token in
Token ID287
Feature activation+1.50e+00

Pos logit contributions
aster+3.009
eling+2.951
ore+2.869
oker+2.700
atches+2.530
Neg logit contributions
 pads-2.558
aters-2.503
lets-2.502
 prope-2.484
phones-2.408
Token the
Token ID262
Feature activation+1.68e+00

Pos logit contributions
aster+2.302
ore+2.296
eling+2.241
oker+2.203
 baker+2.051
Neg logit contributions
 prope-2.686
aters-2.599
 Tess-2.534
 pads-2.518
 Dog-2.507
Token swamp
Token ID26837
Feature activation+1.56e+00

Pos logit contributions
aster+3.193
eling+3.028
ore+2.954
oker+2.921
ying+2.691
Neg logit contributions
 prope-2.483
aters-2.472
 pads-2.447
 Dog-2.418
 tail-2.396
Token proud
Token ID6613
Feature activation+2.03e-02

Pos logit contributions
aster+2.721
ore+2.603
eling+2.565
oker+2.440
roads+2.374
Neg logit contributions
 prope-2.474
lets-2.289
aters-2.278
 pads-2.272
 Tess-2.245
Token.
Token ID13
Feature activation+1.11e+00

Pos logit contributions
aster+0.035
ore+0.033
eling+0.033
oker+0.032
 baker+0.030
Neg logit contributions
 prope-0.035
aters-0.034
 pads-0.033
lets-0.033
phones-0.032
Token
Token ID220
Feature activation+7.65e-01

Pos logit contributions
aster+2.155
eling+2.115
ore+2.043
oker+1.981
 restaurant+1.907
Neg logit contributions
aters-1.603
 prope-1.599
 pads-1.575
 Tess-1.529
lets-1.529
Token
Token ID198
Feature activation+1.08e+00

Pos logit contributions
aster+1.481
eling+1.436
ore+1.404
oker+1.367
 restaurant+1.310
Neg logit contributions
aters-1.094
 prope-1.094
 pads-1.070
lets-1.047
 Tess-1.033
Token
Token ID198
Feature activation+1.15e+00

Pos logit contributions
aster+2.149
eling+2.109
ore+2.049
oker+2.015
 restaurant+1.924
Neg logit contributions
 prope-1.474
aters-1.435
 pads-1.416
Snow-1.398
Brown-1.393
TokenThe
Token ID464
Feature activation+4.09e+00

Pos logit contributions
aster+2.420
ore+2.404
eling+2.372
oker+2.279
 restaurant+2.207
Neg logit contributions
 prope-1.415
aters-1.360
Snow-1.346
Brown-1.346
 pads-1.346
Token End
Token ID5268
Feature activation+9.45e-01

Pos logit contributions
ore+8.952
oker+7.931
aster+7.661
eling+7.545
atches+7.458
Neg logit contributions
 no-6.840
 tail-6.410
 aunt-6.344
 prope-6.147
 owners-6.144
Token.
Token ID13
Feature activation+4.80e-02

Pos logit contributions
aster+1.784
eling+1.749
ore+1.683
oker+1.643
 baker+1.590
Neg logit contributions
 prope-1.381
aters-1.373
 pads-1.339
lets-1.320
phones-1.292
Token<|endoftext|>
Token ID50256
Feature activation+1.25e+00

Pos logit contributions
aster+0.087
ore+0.084
eling+0.082
oker+0.080
 baker+0.076
Neg logit contributions
 prope-0.077
aters-0.074
 pads-0.073
lets-0.072
phones-0.071
TokenOnce
Token ID7454
Feature activation+9.94e-01

Pos logit contributions
aster+1.564
eling+1.516
oker+1.365
ore+1.332
 baker+1.261
Neg logit contributions
aters-2.714
 pads-2.662
 prope-2.656
 Dog-2.594
 Tess-2.590
Token upon
Token ID2402
Feature activation-6.94e-01

Pos logit contributions
aster+1.878
eling+1.800
ore+1.749
oker+1.706
 baker+1.655
Neg logit contributions
 prope-1.475
aters-1.445
 pads-1.423
lets-1.401
 Tess-1.396
Token time
Token ID640
Feature activation-2.97e-02

Pos logit contributions
aster+0.414
ore+0.397
eling+0.392
oker+0.374
 baker+0.352
Neg logit contributions
 prope-0.496
aters-0.484
 pads-0.477
lets-0.471
phones-0.463
Token.
Token ID13
Feature activation+9.97e-01

Pos logit contributions
 prope+0.049
aters+0.048
 pads+0.047
lets+0.046
phones+0.046
Neg logit contributions
aster-0.052
ore-0.050
eling-0.049
oker-0.047
 restaurant-0.045
Token
Token ID220
Feature activation+8.83e-01

Pos logit contributions
aster+1.958
eling+1.932
ore+1.859
oker+1.825
 restaurant+1.733
Neg logit contributions
 prope-1.414
aters-1.408
 pads-1.378
 Tess-1.346
lets-1.346
Token
Token ID198
Feature activation+7.53e-01

Pos logit contributions
aster+1.734
eling+1.684
ore+1.634
oker+1.605
 restaurant+1.535
Neg logit contributions
aters-1.243
 prope-1.236
 pads-1.219
lets-1.186
 Tess-1.170
Token
Token ID198
Feature activation+6.45e-01

Pos logit contributions
aster+1.456
eling+1.429
ore+1.391
oker+1.363
 restaurant+1.299
Neg logit contributions
 prope-1.079
aters-1.049
 pads-1.033
Snow-1.005
lets-1.005
TokenThe
Token ID464
Feature activation+4.06e+00

Pos logit contributions
aster+1.316
ore+1.290
eling+1.287
oker+1.235
 restaurant+1.189
Neg logit contributions
 prope-0.863
aters-0.829
 pads-0.817
lets-0.800
Snow-0.798
Token boy
Token ID2933
Feature activation+6.29e-01

Pos logit contributions
ore+9.613
aster+8.691
oker+8.682
eling+8.596
ulk+8.276
Neg logit contributions
 no-5.969
 tail-5.786
 aunt-5.752
 neck-5.527
 scarf-5.448
Token enjoyed
Token ID8359
Feature activation+1.37e+00

Pos logit contributions
aster+1.196
ore+1.160
eling+1.139
oker+1.116
 restaurant+1.046
Neg logit contributions
 prope-0.938
aters-0.902
 pads-0.897
lets-0.894
phones-0.860
Token the
Token ID262
Feature activation+3.03e+00

Pos logit contributions
ore+2.209
aster+2.176
oker+2.140
eling+2.115
 restaurant+1.875
Neg logit contributions
 prope-2.453
 Dog-2.292
 Tess-2.287
aters-2.285
 pads-2.270
Token pastry
Token ID41738
Feature activation+8.94e-01

Pos logit contributions
ore+5.861
aster+5.771
eling+5.745
oker+5.546
ulk+5.052
Neg logit contributions
 tail-4.559
 pads-4.472
 scarf-4.440
aters-4.411
lets-4.394
Token and
Token ID290
Feature activation+6.73e-01

Pos logit contributions
aster+1.860
eling+1.810
ore+1.799
oker+1.737
 restaurant+1.630
Neg logit contributions
 prope-1.150
aters-1.129
 pads-1.124
lets-1.103
phones-1.066
Token.
Token ID13
Feature activation+1.90e-01

Pos logit contributions
aster+2.973
eling+2.949
ore+2.946
oker+2.909
roads+2.668
Neg logit contributions
aters-2.194
 prope-2.175
 pads-2.149
lets-2.132
Snow-2.047
Token
Token ID198
Feature activation+2.21e-01

Pos logit contributions
aster+0.352
ore+0.337
eling+0.336
oker+0.323
 baker+0.307
Neg logit contributions
 prope-0.296
aters-0.288
 pads-0.283
lets-0.279
phones-0.274
Token
Token ID198
Feature activation-9.62e-01

Pos logit contributions
aster+0.406
ore+0.389
eling+0.388
oker+0.373
 baker+0.355
Neg logit contributions
 prope-0.341
aters-0.332
 pads-0.326
lets-0.321
phones-0.316
TokenWhen
Token ID2215
Feature activation+5.56e-01

Pos logit contributions
 prope+1.554
aters+1.501
 pads+1.448
lets+1.437
phones+1.415
Neg logit contributions
aster-1.857
ore-1.715
eling-1.684
oker-1.637
 restaurant-1.583
Token the
Token ID262
Feature activation+2.71e+00

Pos logit contributions
aster+1.000
ore+0.976
eling+0.972
oker+0.936
 baker+0.883
Neg logit contributions
 prope-0.876
aters-0.848
 pads-0.833
lets-0.824
 Tess-0.823
Token muff
Token ID27563
Feature activation+4.05e+00

Pos logit contributions
aster+4.288
eling+4.204
ore+4.179
oker+4.138
roads+3.866
Neg logit contributions
 prope-4.904
 aunt-4.843
aters-4.730
Snow-4.719
 no-4.707
Tokenins
Token ID1040
Feature activation+2.29e+00

Pos logit contributions
aster+5.191
roads+4.317
ore+4.288
oker+4.283
eling+4.259
Neg logit contributions
lets-9.153
aters-8.853
un-8.513
 forever-8.225
 pads-8.215
Token were
Token ID547
Feature activation+2.06e+00

Pos logit contributions
aster+5.189
eling+5.050
oker+4.986
ore+4.898
roads+4.662
Neg logit contributions
lets-2.596
 prope-2.495
aters-2.487
 pads-2.431
 Dog-2.351
Token ready
Token ID3492
Feature activation+5.59e-01

Pos logit contributions
ore+3.355
aster+3.322
oker+3.266
eling+3.148
roads+2.989
Neg logit contributions
 prope-3.577
aters-3.467
 pads-3.406
 no-3.386
 forever-3.334
Token,
Token ID11
Feature activation+2.08e-01

Pos logit contributions
aster+1.201
eling+1.161
ore+1.156
oker+1.119
 baker+1.070
Neg logit contributions
 prope-0.688
aters-0.666
 pads-0.655
lets-0.644
 Tess-0.635
Token Tim
Token ID5045
Feature activation+4.25e-01

Pos logit contributions
aster+0.400
ore+0.385
eling+0.382
oker+0.368
 baker+0.351
Neg logit contributions
 prope-0.308
aters-0.298
 pads-0.293
lets-0.289
phones-0.284
Tokenries
Token ID1678
Feature activation+1.15e+00

Pos logit contributions
eling+2.860
oker+2.833
aster+2.821
ore+2.784
 salsa+2.306
Neg logit contributions
aters-5.020
lets-4.823
 pads-4.663
phones-4.613
 prope-4.549
Token.
Token ID13
Feature activation+2.45e+00

Pos logit contributions
aster+2.225
eling+2.166
ore+2.123
oker+2.065
 baker+1.925
Neg logit contributions
 pads-1.641
aters-1.629
 prope-1.611
lets-1.604
phones-1.549
Token
Token ID220
Feature activation+8.09e-01

Pos logit contributions
eling+5.640
aster+5.244
oker+5.172
ore+4.995
ying+4.844
Neg logit contributions
 Dog-3.054
 Tess-3.006
aters-2.931
 Brown-2.926
lets-2.880
Token
Token ID198
Feature activation+1.21e+00

Pos logit contributions
aster+1.576
eling+1.529
ore+1.487
oker+1.459
 restaurant+1.389
Neg logit contributions
aters-1.150
 prope-1.141
 pads-1.125
lets-1.101
 Tess-1.079
Token
Token ID198
Feature activation+1.04e+00

Pos logit contributions
eling+2.420
aster+2.386
ore+2.315
oker+2.282
 restaurant+2.177
Neg logit contributions
 prope-1.608
aters-1.565
Brown-1.549
Snow-1.541
 pads-1.534
TokenThe
Token ID464
Feature activation+4.04e+00

Pos logit contributions
aster+2.153
eling+2.149
ore+2.147
oker+2.057
 restaurant+1.971
Neg logit contributions
 prope-1.315
aters-1.259
Brown-1.250
Snow-1.242
 pads-1.234
Token baker
Token ID46412
Feature activation+9.27e-01

Pos logit contributions
ore+8.482
aster+7.774
oker+7.722
eling+7.462
ulk+7.173
Neg logit contributions
 no-6.835
 aunt-6.656
 neck-6.474
 owners-6.438
 mate-6.434
Token handed
Token ID10158
Feature activation+1.54e+00

Pos logit contributions
aster+1.691
ore+1.649
eling+1.614
oker+1.584
 restaurant+1.466
Neg logit contributions
 prope-1.438
lets-1.402
aters-1.382
 pads-1.372
 Tess-1.337
Token the
Token ID262
Feature activation+2.01e+00

Pos logit contributions
aster+2.586
eling+2.538
oker+2.504
ore+2.496
 restaurant+2.247
Neg logit contributions
 prope-2.549
 Dog-2.534
 Tess-2.532
lets-2.523
 pads-2.520
Token boy
Token ID2933
Feature activation+1.32e+00

Pos logit contributions
aster+3.771
ore+3.729
eling+3.600
oker+3.520
roads+3.154
Neg logit contributions
lets-3.210
 pads-3.199
 prope-3.188
 neck-3.114
aters-3.089
Token a
Token ID257
Feature activation+1.06e+00

Pos logit contributions
aster+2.654
eling+2.637
ore+2.601
oker+2.579
roads+2.349
Neg logit contributions
lets-1.749
 pads-1.748
aters-1.718
 prope-1.717
phones-1.647
Token not
Token ID407
Feature activation+1.85e-01

Pos logit contributions
aster+0.620
ore+0.601
eling+0.596
oker+0.577
 restaurant+0.551
Neg logit contributions
 prope-0.437
aters-0.421
 pads-0.412
lets-0.406
phones-0.400
Token use
Token ID779
Feature activation+2.20e-01

Pos logit contributions
aster+0.302
ore+0.289
eling+0.286
oker+0.274
 baker+0.260
Neg logit contributions
 prope-0.326
aters-0.317
 pads-0.312
lets-0.308
phones-0.304
Token.
Token ID13
Feature activation+7.65e-01

Pos logit contributions
aster+0.387
ore+0.371
eling+0.368
oker+0.354
 restaurant+0.336
Neg logit contributions
 prope-0.359
aters-0.350
 pads-0.344
lets-0.339
phones-0.334
Token
Token ID198
Feature activation+3.57e-01

Pos logit contributions
aster+1.515
eling+1.483
ore+1.430
oker+1.411
 restaurant+1.334
Neg logit contributions
 prope-1.079
aters-1.064
 pads-1.043
 Tess-1.034
lets-1.033
Token
Token ID198
Feature activation-3.24e-02

Pos logit contributions
aster+0.667
eling+0.642
ore+0.637
oker+0.616
 baker+0.586
Neg logit contributions
 prope-0.541
aters-0.526
 pads-0.516
lets-0.508
 Tess-0.501
TokenThe
Token ID464
Feature activation+4.03e+00

Pos logit contributions
 prope+0.046
aters+0.045
 pads+0.044
lets+0.043
phones+0.042
Neg logit contributions
aster-0.065
ore-0.062
eling-0.062
oker-0.059
 baker-0.057
Token baker
Token ID46412
Feature activation+1.27e+00

Pos logit contributions
ore+7.640
eling+6.875
aster+6.733
oker+6.707
roads+6.579
Neg logit contributions
 no-7.470
 tail-7.091
 aunt-6.935
 mate-6.892
 owners-6.846
Token was
Token ID373
Feature activation+1.23e+00

Pos logit contributions
aster+2.410
ore+2.362
eling+2.298
oker+2.282
 restaurant+2.115
Neg logit contributions
 prope-1.886
lets-1.833
aters-1.765
 pads-1.761
 Tess-1.720
Token very
Token ID845
Feature activation+3.11e+00

Pos logit contributions
aster+2.644
ore+2.621
oker+2.550
eling+2.504
 restaurant+2.352
Neg logit contributions
 prope-1.560
 pads-1.390
 Dog-1.387
aters-1.380
 Tess-1.378
Token upset
Token ID9247
Feature activation-6.94e-02

Pos logit contributions
aster+6.167
ore+5.903
roads+5.489
oker+5.437
eling+5.417
Neg logit contributions
 prope-4.949
 no-4.850
 undone-4.724
un-4.619
 tail-4.585
Token.
Token ID13
Feature activation+1.11e+00

Pos logit contributions
 prope+0.106
aters+0.103
 pads+0.101
lets+0.099
phones+0.098
Neg logit contributions
aster-0.131
ore-0.127
eling-0.125
oker-0.120
 restaurant-0.115
Token wanted
Token ID2227
Feature activation+6.28e-01

Pos logit contributions
aster+1.325
ore+1.285
eling+1.260
oker+1.221
 restaurant+1.152
Neg logit contributions
 prope-0.924
lets-0.891
aters-0.890
 pads-0.888
 Tess-0.853
Token to
Token ID284
Feature activation-8.55e-03

Pos logit contributions
aster+1.225
ore+1.193
eling+1.188
oker+1.164
 baker+1.074
Neg logit contributions
 prope-0.898
aters-0.865
 pads-0.855
lets-0.842
phones-0.833
Token see
Token ID766
Feature activation+3.16e-01

Pos logit contributions
 prope+0.015
aters+0.015
 pads+0.015
lets+0.015
phones+0.014
Neg logit contributions
aster-0.014
ore-0.013
eling-0.013
oker-0.012
 baker-0.012
Token the
Token ID262
Feature activation+2.32e+00

Pos logit contributions
aster+0.560
ore+0.538
eling+0.532
oker+0.517
 restaurant+0.485
Neg logit contributions
 prope-0.514
aters-0.497
 pads-0.488
lets-0.484
phones-0.476
Token magic
Token ID5536
Feature activation+2.54e+00

Pos logit contributions
aster+4.927
oker+4.595
ore+4.593
eling+4.571
roads+4.227
Neg logit contributions
 prope-3.268
Snow-2.971
 neck-2.942
Brown-2.931
 tail-2.926
Token muff
Token ID27563
Feature activation+4.03e+00

Pos logit contributions
eling+4.328
aster+4.285
oker+3.894
ore+3.774
roads+3.532
Neg logit contributions
 prope-4.575
lets-4.461
aters-4.351
 Brown-4.344
 scarf-4.318
Tokenin
Token ID259
Feature activation+1.92e+00

Pos logit contributions
aster+5.236
oker+4.151
roads+4.047
eling+4.045
ore+3.846
Neg logit contributions
lets-9.668
ocular-9.103
aters-8.945
 forever-8.868
un-8.857
Token.
Token ID13
Feature activation-8.30e-02

Pos logit contributions
aster+3.847
eling+3.648
ore+3.624
oker+3.576
 baker+3.350
Neg logit contributions
 pads-2.742
lets-2.728
 prope-2.622
aters-2.597
phones-2.562
Token
Token ID198
Feature activation+3.50e-01

Pos logit contributions
 prope+0.135
aters+0.130
 pads+0.127
lets+0.126
phones+0.124
Neg logit contributions
aster-0.149
ore-0.144
eling-0.140
oker-0.135
 baker-0.130
Token
Token ID198
Feature activation-6.43e-01

Pos logit contributions
aster+0.653
eling+0.627
ore+0.622
oker+0.601
 baker+0.574
Neg logit contributions
 prope-0.528
aters-0.515
 pads-0.506
lets-0.498
 Tess-0.489
TokenThe
Token ID464
Feature activation+2.58e+00

Pos logit contributions
 prope+1.124
aters+1.080
 pads+1.064
lets+1.048
phones+1.040
Neg logit contributions
aster-1.110
ore-1.063
eling-1.042
oker-0.989
 restaurant-0.973
Tokenmy
Token ID1820
Feature activation-5.33e-01

Pos logit contributions
 prope+2.001
aters+1.876
 pads+1.802
phones+1.750
lets+1.712
Neg logit contributions
aster-2.567
eling-2.510
ore-2.469
 baker-2.296
oker-2.274
Token smiled
Token ID13541
Feature activation-1.44e+00

Pos logit contributions
 prope+0.969
aters+0.951
 pads+0.922
lets+0.902
phones+0.898
Neg logit contributions
aster-0.855
eling-0.819
ore-0.818
oker-0.756
 baker-0.740
Token and
Token ID290
Feature activation+5.69e-01

Pos logit contributions
 prope+2.482
phones+2.429
aters+2.375
Brown+2.332
lets+2.323
Neg logit contributions
ore-2.400
aster-2.375
eling-2.256
 early-2.189
 restaurant-2.181
Token cut
Token ID2005
Feature activation+2.57e-01

Pos logit contributions
aster+0.942
ore+0.905
eling+0.883
oker+0.861
 restaurant+0.809
Neg logit contributions
 prope-1.001
aters-0.961
 pads-0.960
lets-0.956
 Tess-0.947
Token the
Token ID262
Feature activation+5.71e-01

Pos logit contributions
aster+0.394
ore+0.374
eling+0.372
oker+0.355
 baker+0.335
Neg logit contributions
 prope-0.480
aters-0.469
 pads-0.463
lets-0.457
phones-0.450
Token muff
Token ID27563
Feature activation+4.01e+00

Pos logit contributions
aster+1.104
ore+1.064
eling+1.060
oker+1.024
 baker+0.964
Neg logit contributions
 prope-0.854
aters-0.809
 pads-0.802
lets-0.783
phones-0.772
Tokenin
Token ID259
Feature activation+1.64e+00

Pos logit contributions
aster+4.910
 restaurant+4.385
roads+4.252
oker+4.072
eling+3.854
Neg logit contributions
lets-9.749
aters-9.136
un-8.985
 pads-8.915
ocular-8.914
Token into
Token ID656
Feature activation+6.46e-01

Pos logit contributions
aster+3.124
eling+3.042
oker+2.977
ore+2.936
 baker+2.817
Neg logit contributions
 pads-2.419
 prope-2.398
lets-2.398
aters-2.355
phones-2.274
Token smaller
Token ID4833
Feature activation+1.00e+00

Pos logit contributions
aster+1.038
ore+1.015
eling+1.002
oker+0.963
 restaurant+0.905
Neg logit contributions
 prope-1.153
aters-1.107
 pads-1.083
lets-1.078
 Tess-1.064
Token pieces
Token ID5207
Feature activation-1.31e-01

Pos logit contributions
aster+1.898
ore+1.806
eling+1.805
oker+1.780
 restaurant+1.657
Neg logit contributions
 prope-1.543
aters-1.516
 pads-1.472
lets-1.469
 neck-1.401
Token.
Token ID13
Feature activation-4.75e-01

Pos logit contributions
 prope+0.224
aters+0.218
 pads+0.213
lets+0.211
phones+0.209
Neg logit contributions
aster-0.221
ore-0.212
eling-0.208
oker-0.200
 baker-0.189
Token.
Token ID13
Feature activation+1.09e+00

Pos logit contributions
aster+0.268
ore+0.257
eling+0.255
oker+0.245
 baker+0.234
Neg logit contributions
 prope-0.226
aters-0.220
 pads-0.216
lets-0.213
phones-0.209
Token
Token ID198
Feature activation+7.07e-01

Pos logit contributions
eling+2.051
aster+2.050
ore+1.945
oker+1.917
 restaurant+1.794
Neg logit contributions
 prope-1.663
aters-1.636
 Tess-1.595
 Dog-1.587
 pads-1.586
Token
Token ID198
Feature activation+4.35e-01

Pos logit contributions
aster+1.368
eling+1.344
ore+1.297
oker+1.278
 baker+1.214
Neg logit contributions
 prope-1.006
aters-0.983
 pads-0.959
 Tess-0.944
lets-0.941
TokenThe
Token ID464
Feature activation+2.96e+00

Pos logit contributions
aster+0.883
ore+0.861
eling+0.853
oker+0.824
 restaurant+0.786
Neg logit contributions
 prope-0.594
aters-0.573
 pads-0.562
lets-0.554
Snow-0.544
Token foolish
Token ID19538
Feature activation+2.77e+00

Pos logit contributions
aster+6.272
ore+6.174
oker+6.010
eling+5.998
roads+5.595
Neg logit contributions
 prope-4.299
 neck-4.163
Brown-4.085
Snow-4.001
 Brown-4.000
Token muff
Token ID27563
Feature activation+3.98e+00

Pos logit contributions
aster+5.227
eling+5.042
oker+4.908
ore+4.858
roads+4.630
Neg logit contributions
 prope-4.631
 neck-4.427
 mate-4.306
 tail-4.290
 Brown-4.280
Tokenin
Token ID259
Feature activation+1.56e+00

Pos logit contributions
aster+4.956
eling+3.601
 salsa+3.570
roads+3.559
 restaurant+3.514
Neg logit contributions
lets-9.907
un-9.472
Brown-8.923
aters-8.913
 forever-8.896
Token was
Token ID373
Feature activation+6.41e-01

Pos logit contributions
aster+3.024
ore+2.913
eling+2.889
oker+2.859
 baker+2.631
Neg logit contributions
lets-2.315
 prope-2.269
 pads-2.266
aters-2.189
 neck-2.134
Token not
Token ID407
Feature activation+7.94e-01

Pos logit contributions
aster+1.286
ore+1.249
eling+1.221
oker+1.203
 restaurant+1.130
Neg logit contributions
 prope-0.906
 pads-0.852
aters-0.851
lets-0.832
 Tess-0.809
Token happy
Token ID3772
Feature activation+5.03e-01

Pos logit contributions
aster+1.590
ore+1.548
eling+1.516
oker+1.473
 restaurant+1.397
Neg logit contributions
 prope-1.137
aters-1.070
 pads-1.070
lets-1.057
 Tess-1.024
Token.
Token ID13
Feature activation+1.12e+00

Pos logit contributions
aster+0.967
ore+0.924
eling+0.920
oker+0.883
 baker+0.841
Neg logit contributions
 prope-0.740
aters-0.728
 pads-0.720
lets-0.709
phones-0.689
Token:
Token ID25
Feature activation+1.23e+00

Pos logit contributions
aster+1.430
ore+1.375
eling+1.334
oker+1.325
 restaurant+1.221
Neg logit contributions
 prope-1.391
aters-1.349
lets-1.343
 pads-1.332
 Tess-1.314
Token Rot
Token ID18481
Feature activation+1.37e+00

Pos logit contributions
aster+2.764
eling+2.704
ore+2.647
oker+2.602
 restaurant+2.441
Neg logit contributions
 prope-1.443
 Tess-1.339
 Dog-1.331
Brown-1.295
aters-1.276
Token is
Token ID318
Feature activation+5.42e-01

Pos logit contributions
aster+2.302
ore+2.080
oker+2.079
eling+2.060
 baker+1.973
Neg logit contributions
aters-2.356
 prope-2.354
 pads-2.342
lets-2.320
 Tess-2.274
Token when
Token ID618
Feature activation-2.65e-01

Pos logit contributions
aster+0.983
ore+0.944
eling+0.929
oker+0.907
 restaurant+0.853
Neg logit contributions
 prope-0.864
aters-0.835
 pads-0.825
lets-0.816
phones-0.800
Token something
Token ID1223
Feature activation+1.83e+00

Pos logit contributions
 prope+0.445
aters+0.432
 pads+0.424
lets+0.419
phones+0.411
Neg logit contributions
aster-0.456
ore-0.436
eling-0.433
oker-0.413
 restaurant-0.394
Token spo
Token ID15106
Feature activation+3.97e+00

Pos logit contributions
aster+3.449
ore+3.164
eling+3.159
oker+3.128
 baker+2.965
Neg logit contributions
 prope-2.867
 pads-2.780
lets-2.711
 Dog-2.657
aters-2.645
Tokenils
Token ID4487
Feature activation+8.76e-01

Pos logit contributions
aster+4.800
 baker+4.397
ore+4.363
 restaurant+4.338
oker+4.304
Neg logit contributions
aters-9.764
 pads-9.064
lets-8.777
tails-8.692
un-8.571
Token and
Token ID290
Feature activation+1.09e+00

Pos logit contributions
aster+1.637
eling+1.565
ore+1.565
oker+1.506
 baker+1.451
Neg logit contributions
 prope-1.311
aters-1.286
 pads-1.267
lets-1.248
 Tess-1.227
Token gets
Token ID3011
Feature activation+1.15e+00

Pos logit contributions
aster+2.014
ore+1.936
eling+1.856
oker+1.847
 baker+1.756
Neg logit contributions
 prope-1.721
 pads-1.668
lets-1.640
aters-1.639
phones-1.590
Token bad
Token ID2089
Feature activation+8.42e-01

Pos logit contributions
aster+2.111
ore+2.013
eling+1.964
oker+1.962
 baker+1.834
Neg logit contributions
 prope-1.816
 pads-1.735
aters-1.727
lets-1.714
 Tess-1.680
Token.
Token ID13
Feature activation-1.03e-02

Pos logit contributions
aster+1.560
eling+1.478
ore+1.470
oker+1.416
 baker+1.349
Neg logit contributions
 prope-1.302
aters-1.292
 pads-1.283
lets-1.258
phones-1.227
Token not
Token ID407
Feature activation-1.28e-02

Pos logit contributions
 prope+0.444
aters+0.435
 pads+0.417
lets+0.412
phones+0.407
Neg logit contributions
aster-0.580
eling-0.549
ore-0.545
oker-0.524
 restaurant-0.510
Token tried
Token ID3088
Feature activation+3.36e-01

Pos logit contributions
 prope+0.020
aters+0.019
 pads+0.019
lets+0.019
phones+0.018
Neg logit contributions
aster-0.024
ore-0.023
eling-0.023
oker-0.022
 restaurant-0.021
Token to
Token ID284
Feature activation-1.52e-01

Pos logit contributions
aster+0.642
ore+0.617
eling+0.614
oker+0.592
 baker+0.563
Neg logit contributions
 prope-0.503
aters-0.489
 pads-0.482
lets-0.474
phones-0.465
Token get
Token ID651
Feature activation+1.76e+00

Pos logit contributions
 prope+0.249
aters+0.243
 pads+0.240
lets+0.235
phones+0.231
Neg logit contributions
aster-0.274
ore-0.259
eling-0.258
oker-0.247
 restaurant-0.236
Token the
Token ID262
Feature activation+2.03e+00

Pos logit contributions
aster+3.389
oker+3.268
ore+3.262
eling+3.192
roads+3.005
Neg logit contributions
 prope-2.580
phones-2.492
 Dog-2.489
 pads-2.488
aters-2.466
Token muff
Token ID27563
Feature activation+3.97e+00

Pos logit contributions
aster+3.024
eling+2.830
ore+2.753
oker+2.720
roads+2.477
Neg logit contributions
 prope-4.109
aters-3.890
Snow-3.807
 scarf-3.801
 Brown-3.781
Tokenin
Token ID259
Feature activation+1.79e+00

Pos logit contributions
aster+4.675
roads+3.825
eling+3.783
oker+3.773
 restaurant+3.440
Neg logit contributions
lets-9.623
aters-9.385
ocular-9.236
un-9.126
Snow-9.001
Token.
Token ID13
Feature activation-8.22e-01

Pos logit contributions
aster+3.545
eling+3.544
ore+3.357
oker+3.310
 baker+3.097
Neg logit contributions
 pads-2.502
 prope-2.471
aters-2.443
lets-2.441
 Dog-2.324
Token They
Token ID1119
Feature activation-1.07e+00

Pos logit contributions
 prope+1.249
aters+1.200
lets+1.175
 pads+1.174
phones+1.154
Neg logit contributions
aster-1.571
ore-1.531
oker-1.432
eling-1.430
 restaurant-1.399
Token wished
Token ID16555
Feature activation+4.91e-01

Pos logit contributions
aters+1.843
 prope+1.775
 pads+1.730
phones+1.708
lets+1.708
Neg logit contributions
aster-1.849
eling-1.801
ore-1.691
oker-1.604
 restaurant-1.598
Token they
Token ID484
Feature activation-1.16e+00

Pos logit contributions
aster+0.874
ore+0.843
eling+0.840
oker+0.807
 baker+0.761
Neg logit contributions
 prope-0.792
aters-0.763
 pads-0.754
lets-0.742
phones-0.733
Token."
Token ID526
Feature activation-7.66e-01

Pos logit contributions
aster+0.539
ore+0.517
eling+0.513
oker+0.494
 baker+0.469
Neg logit contributions
 prope-0.475
aters-0.462
 pads-0.454
lets-0.448
phones-0.440
Token<|endoftext|>
Token ID50256
Feature activation+1.11e+00

Pos logit contributions
 prope+1.120
aters+1.034
 pads+1.016
lets+0.998
 mate+0.991
Neg logit contributions
aster-1.529
ore-1.471
eling-1.414
oker-1.392
 baker-1.344
TokenJimmy
Token ID40335
Feature activation-8.18e-02

Pos logit contributions
aster+1.525
eling+1.461
oker+1.344
ore+1.328
 baker+1.251
Neg logit contributions
aters-2.267
 pads-2.236
 prope-2.233
 Tess-2.174
 Dog-2.171
Token was
Token ID373
Feature activation+3.10e-01

Pos logit contributions
 prope+0.117
aters+0.113
 pads+0.110
lets+0.108
phones+0.107
Neg logit contributions
aster-0.161
ore-0.155
eling-0.153
oker-0.147
 restaurant-0.142
Token a
Token ID257
Feature activation+3.34e-01

Pos logit contributions
aster+0.590
ore+0.567
eling+0.564
oker+0.545
 restaurant+0.516
Neg logit contributions
 prope-0.466
aters-0.452
 pads-0.444
lets-0.434
phones-0.428
Token depend
Token ID4745
Feature activation+3.93e+00

Pos logit contributions
aster+0.643
ore+0.621
eling+0.616
oker+0.593
 restaurant+0.561
Neg logit contributions
 prope-0.493
aters-0.472
 pads-0.465
lets-0.458
phones-0.448
Tokenable
Token ID540
Feature activation+1.88e-01

Pos logit contributions
eling+5.164
aster+4.933
 salsa+4.616
oons+4.043
 restaurant+3.997
Neg logit contributions
un-9.003
 pads-8.536
aters-8.290
lets-8.246
 Tess-8.172
Token young
Token ID1862
Feature activation+4.98e-01

Pos logit contributions
aster+0.320
ore+0.306
eling+0.304
oker+0.291
 baker+0.275
Neg logit contributions
 prope-0.317
aters-0.308
 pads-0.303
lets-0.300
phones-0.295
Token boy
Token ID2933
Feature activation-1.21e-01

Pos logit contributions
aster+0.625
ore+0.582
eling+0.581
oker+0.539
 restaurant+0.488
Neg logit contributions
 prope-1.079
aters-1.050
 pads-1.036
lets-1.029
 Tess-1.013
Token.
Token ID13
Feature activation+6.18e-01

Pos logit contributions
 prope+0.200
aters+0.193
 pads+0.189
lets+0.187
phones+0.184
Neg logit contributions
aster-0.212
ore-0.204
eling-0.200
oker-0.192
 baker-0.184
Token Whenever
Token ID21326
Feature activation+1.01e+00

Pos logit contributions
aster+1.185
eling+1.139
ore+1.132
oker+1.082
 baker+1.030
Neg logit contributions
 prope-0.921
aters-0.912
 pads-0.890
lets-0.864
 Tess-0.844
Token!
Token ID0
Feature activation+1.13e+00

Pos logit contributions
aster+1.675
ore+1.605
eling+1.593
oker+1.550
 baker+1.474
Neg logit contributions
 prope-1.100
 pads-1.068
aters-1.066
lets-1.043
phones-0.987
Token
Token ID198
Feature activation+6.13e-01

Pos logit contributions
eling+2.165
aster+2.155
ore+2.049
oker+2.010
 restaurant+1.918
Neg logit contributions
 prope-1.660
aters-1.643
 pads-1.596
 Tess-1.588
 Dog-1.568
Token
Token ID198
Feature activation+3.35e-01

Pos logit contributions
aster+1.168
eling+1.138
ore+1.116
oker+1.088
 restaurant+1.035
Neg logit contributions
 prope-0.899
aters-0.874
 pads-0.852
lets-0.837
 Tess-0.832
TokenThe
Token ID464
Feature activation+2.39e+00

Pos logit contributions
aster+0.670
ore+0.650
eling+0.644
oker+0.621
 restaurant+0.595
Neg logit contributions
 prope-0.469
aters-0.453
 pads-0.444
lets-0.438
phones-0.429
Token foolish
Token ID19538
Feature activation+2.42e+00

Pos logit contributions
aster+5.133
ore+5.100
eling+4.893
oker+4.864
roads+4.508
Neg logit contributions
 prope-3.322
 neck-3.088
 tail-3.041
 scarf-3.003
Brown-2.997
Token muff
Token ID27563
Feature activation+3.93e+00

Pos logit contributions
aster+4.468
eling+4.384
ore+4.233
oker+4.091
roads+3.947
Neg logit contributions
 prope-4.003
 tail-3.780
 neck-3.773
lets-3.748
 mate-3.721
Tokenin
Token ID259
Feature activation+1.53e+00

Pos logit contributions
aster+4.947
 restaurant+3.722
ore+3.699
oker+3.604
 salsa+3.593
Neg logit contributions
lets-9.696
un-9.083
aters-8.729
tails-8.694
 forever-8.660
Token was
Token ID373
Feature activation+6.63e-01

Pos logit contributions
aster+2.970
ore+2.859
eling+2.821
oker+2.785
 restaurant+2.600
Neg logit contributions
lets-2.236
 prope-2.208
 pads-2.194
aters-2.133
phones-2.066
Token very
Token ID845
Feature activation+2.50e+00

Pos logit contributions
aster+1.436
ore+1.404
eling+1.372
oker+1.351
 restaurant+1.281
Neg logit contributions
 prope-0.822
 pads-0.774
aters-0.769
lets-0.754
phones-0.726
Token pleased
Token ID10607
Feature activation+2.06e-01

Pos logit contributions
aster+5.252
ore+5.038
eling+4.833
roads+4.659
oker+4.649
Neg logit contributions
 prope-3.606
 tail-3.422
 neck-3.385
lets-3.358
 no-3.339
Token.
Token ID13
Feature activation+9.07e-01

Pos logit contributions
aster+0.371
ore+0.355
eling+0.353
oker+0.338
 baker+0.322
Neg logit contributions
 prope-0.330
aters-0.322
 pads-0.316
lets-0.313
phones-0.307
Token and
Token ID290
Feature activation-1.30e+00

Pos logit contributions
 prope+1.631
aters+1.509
lets+1.501
phones+1.492
 pads+1.490
Neg logit contributions
ore-1.824
aster-1.775
eling-1.670
 restaurant-1.652
 baker-1.626
Token happy
Token ID3772
Feature activation-7.65e-01

Pos logit contributions
 prope+1.980
aters+1.965
phones+1.907
lets+1.889
 pads+1.841
Neg logit contributions
aster-2.446
eling-2.367
ore-2.344
oker-2.187
 restaurant-2.182
Token.
Token ID13
Feature activation-9.15e-01

Pos logit contributions
 prope+1.427
aters+1.369
 pads+1.345
phones+1.329
lets+1.328
Neg logit contributions
ore-1.189
aster-1.184
eling-1.108
oker-1.073
 restaurant-1.069
Token
Token ID198
Feature activation-2.87e-01

Pos logit contributions
 prope+1.551
aters+1.472
lets+1.460
 pads+1.448
phones+1.431
Neg logit contributions
aster-1.620
ore-1.572
eling-1.439
 restaurant-1.437
oker-1.428
Token
Token ID198
Feature activation-1.27e+00

Pos logit contributions
 prope+0.483
aters+0.468
 pads+0.459
lets+0.456
phones+0.449
Neg logit contributions
aster-0.501
ore-0.485
eling-0.466
oker-0.450
 restaurant-0.435
TokenDown
Token ID8048
Feature activation+3.89e+00

Pos logit contributions
 prope+1.869
aters+1.795
lets+1.732
phones+1.696
 pads+1.693
Neg logit contributions
aster-2.652
ore-2.474
eling-2.394
 restaurant-2.383
oker-2.369
Token on
Token ID319
Feature activation+3.85e-01

Pos logit contributions
oker+8.382
aster+8.349
 salsa+7.972
eling+7.826
aughter+7.781
Neg logit contributions
 prope-5.502
 pads-5.177
 Dog-5.056
 Ch-4.922
Snow-4.915
Token the
Token ID262
Feature activation-2.84e-01

Pos logit contributions
aster+0.550
ore+0.525
eling+0.522
oker+0.500
 baker+0.463
Neg logit contributions
 prope-0.763
aters-0.740
 pads-0.729
lets-0.718
phones-0.715
Token ground
Token ID2323
Feature activation+1.22e+00

Pos logit contributions
 prope+0.494
aters+0.488
lets+0.478
 pads+0.477
phones+0.466
Neg logit contributions
aster-0.479
ore-0.454
eling-0.446
oker-0.422
 restaurant-0.415
Token,
Token ID11
Feature activation+7.71e-01

Pos logit contributions
aster+2.703
eling+2.634
oker+2.561
ore+2.534
 baker+2.460
Neg logit contributions
 prope-1.376
 pads-1.365
aters-1.319
lets-1.276
 Dog-1.263
Token there
Token ID612
Feature activation-1.28e-02

Pos logit contributions
aster+1.391
ore+1.325
eling+1.320
oker+1.279
 baker+1.195
Neg logit contributions
 prope-1.258
 pads-1.184
aters-1.182
 Tess-1.159
lets-1.153
Token gifts
Token ID13201
Feature activation-6.89e-01

Pos logit contributions
 prope+0.436
aters+0.427
 pads+0.417
lets+0.412
phones+0.409
Neg logit contributions
aster-0.402
ore-0.379
eling-0.374
oker-0.359
 restaurant-0.348
Token.
Token ID13
Feature activation-4.74e-01

Pos logit contributions
 prope+1.260
aters+1.209
 pads+1.186
lets+1.167
phones+1.167
Neg logit contributions
aster-1.082
ore-1.062
eling-1.009
oker-0.986
 restaurant-0.957
Token
Token ID198
Feature activation+1.13e-02

Pos logit contributions
 prope+0.783
aters+0.747
 pads+0.735
lets+0.732
phones+0.724
Neg logit contributions
aster-0.847
ore-0.824
oker-0.769
eling-0.768
 baker-0.739
Token
Token ID198
Feature activation-8.53e-02

Pos logit contributions
aster+0.020
ore+0.020
eling+0.019
oker+0.019
 restaurant+0.018
Neg logit contributions
 prope-0.018
aters-0.018
 pads-0.017
lets-0.017
phones-0.017
TokenThe
Token ID464
Feature activation+2.00e+00

Pos logit contributions
 prope+0.120
aters+0.116
 pads+0.113
lets+0.112
phones+0.110
Neg logit contributions
aster-0.173
ore-0.166
eling-0.164
oker-0.159
 baker-0.152
Token j
Token ID474
Feature activation+3.88e+00

Pos logit contributions
ore+3.231
aster+3.180
eling+3.168
oker+2.878
ying+2.677
Neg logit contributions
 prope-3.717
 scarf-3.558
 pads-3.552
 Dog-3.540
 tail-3.518
Tokenolly
Token ID5098
Feature activation+2.18e+00

Pos logit contributions
eling+5.631
roads+4.529
 ang+4.487
aster+4.352
ore+3.919
Neg logit contributions
aters-8.898
 pads-8.373
 mate-8.180
lets-8.032
un-7.877
Token man
Token ID582
Feature activation+1.96e+00

Pos logit contributions
ore+3.176
eling+3.154
aster+3.073
oker+2.751
roads+2.633
Neg logit contributions
 pads-4.423
 scarf-4.284
lets-4.271
 tail-4.265
 prope-4.238
Token gives
Token ID3607
Feature activation+8.05e-01

Pos logit contributions
aster+3.270
ore+3.118
oker+3.028
eling+3.004
roads+2.644
Neg logit contributions
lets-3.521
 pads-3.516
 prope-3.513
aters-3.460
phones-3.309
Token them
Token ID606
Feature activation-7.25e-01

Pos logit contributions
aster+1.504
ore+1.442
eling+1.437
oker+1.393
 baker+1.305
Neg logit contributions
 prope-1.222
aters-1.221
 pads-1.203
lets-1.182
 Tess-1.164
Token each
Token ID1123
Feature activation+3.90e-01

Pos logit contributions
 prope+1.212
aters+1.153
 pads+1.123
lets+1.097
phones+1.094
Neg logit contributions
aster-1.287
ore-1.224
eling-1.191
 restaurant-1.149
oker-1.139
TokenL
Token ID43
Feature activation+2.38e+00

Pos logit contributions
aster+1.585
eling+1.554
ore+1.549
oker+1.479
 baker+1.431
Neg logit contributions
 prope-1.004
 pads-0.976
aters-0.971
Snow-0.958
lets-0.943
Tokenily
Token ID813
Feature activation+3.23e-02

Pos logit contributions
aster+4.082
eling+4.071
ore+3.991
 restaurant+3.719
roads+3.662
Neg logit contributions
 pads-3.883
aters-3.862
 mate-3.811
lets-3.703
 prope-3.663
Token looked
Token ID3114
Feature activation+1.12e+00

Pos logit contributions
aster+0.056
ore+0.054
eling+0.053
oker+0.051
 baker+0.048
Neg logit contributions
 prope-0.054
aters-0.053
 pads-0.052
lets-0.051
phones-0.050
Token at
Token ID379
Feature activation+1.85e-01

Pos logit contributions
aster+1.823
eling+1.625
oker+1.624
ore+1.618
 baker+1.491
Neg logit contributions
 prope-2.081
 pads-2.067
aters-2.019
lets-2.001
 tail-1.940
Token the
Token ID262
Feature activation+2.60e+00

Pos logit contributions
aster+0.308
ore+0.295
eling+0.292
oker+0.280
 baker+0.265
Neg logit contributions
 prope-0.322
aters-0.314
 pads-0.309
lets-0.304
phones-0.300
Token muff
Token ID27563
Feature activation+3.88e+00

Pos logit contributions
aster+4.561
eling+4.438
oker+4.426
ore+4.409
roads+4.026
Neg logit contributions
 prope-4.648
 scarf-4.346
 neck-4.319
 tail-4.287
aters-4.245
Tokenin
Token ID259
Feature activation+1.64e+00

Pos logit contributions
aster+4.336
roads+3.486
oker+3.460
 restaurant+3.403
eling+3.317
Neg logit contributions
lets-9.287
aters-9.211
un-8.879
 pads-8.834
 scarf-8.801
Token.
Token ID13
Feature activation+8.40e-01

Pos logit contributions
aster+3.246
eling+3.145
oker+3.096
ore+3.056
 baker+2.904
Neg logit contributions
 pads-2.334
 prope-2.332
lets-2.295
aters-2.293
 neck-2.193
Token It
Token ID632
Feature activation+2.75e-01

Pos logit contributions
aster+1.836
eling+1.806
ore+1.744
oker+1.702
 restaurant+1.634
Neg logit contributions
 prope-1.027
aters-1.006
 pads-0.977
lets-0.949
 Tess-0.943
Token was
Token ID373
Feature activation-2.43e-01

Pos logit contributions
aster+0.502
ore+0.483
eling+0.476
oker+0.461
 baker+0.438
Neg logit contributions
 prope-0.436
aters-0.422
 pads-0.417
lets-0.411
phones-0.402
Token still
Token ID991
Feature activation+5.30e-01

Pos logit contributions
 prope+0.361
aters+0.353
 pads+0.343
lets+0.343
phones+0.339
Neg logit contributions
aster-0.467
eling-0.444
ore-0.442
oker-0.422
 restaurant-0.409
Token boy
Token ID2933
Feature activation+4.86e-01

Pos logit contributions
eling+8.671
aster+8.590
ore+8.572
oker+8.121
roads+7.904
Neg logit contributions
 tail-3.625
 prope-3.546
 mate-3.422
 aunt-3.421
 no-3.396
Token was
Token ID373
Feature activation-1.95e-02

Pos logit contributions
aster+0.885
ore+0.850
eling+0.840
oker+0.814
 baker+0.766
Neg logit contributions
 prope-0.765
aters-0.737
 pads-0.733
lets-0.726
phones-0.708
Token so
Token ID523
Feature activation+1.22e+00

Pos logit contributions
 prope+0.029
aters+0.028
 pads+0.028
lets+0.028
phones+0.027
Neg logit contributions
aster-0.037
ore-0.036
eling-0.035
oker-0.034
 baker-0.033
Token relieved
Token ID22598
Feature activation+2.45e-01

Pos logit contributions
aster+2.279
eling+2.127
ore+2.122
oker+2.048
 restaurant+1.960
Neg logit contributions
 prope-1.956
 Dog-1.818
 pads-1.818
aters-1.805
 Tess-1.784
Token!
Token ID0
Feature activation+8.77e-01

Pos logit contributions
aster+0.462
ore+0.443
eling+0.440
oker+0.425
 baker+0.405
Neg logit contributions
 prope-0.370
aters-0.360
 pads-0.354
lets-0.349
phones-0.343
Token The
Token ID383
Feature activation+3.86e+00

Pos logit contributions
aster+1.614
eling+1.566
ore+1.508
oker+1.460
 restaurant+1.402
Neg logit contributions
 prope-1.377
aters-1.364
 pads-1.339
lets-1.296
 Tess-1.295
Token baker
Token ID46412
Feature activation+6.99e-01

Pos logit contributions
eling+8.643
aster+8.528
ore+8.342
oker+7.999
ying+7.809
Neg logit contributions
 tail-5.340
 prope-5.274
 aunt-5.152
 scarf-5.056
 no-5.032
Token was
Token ID373
Feature activation+4.90e-02

Pos logit contributions
aster+1.267
ore+1.221
eling+1.204
oker+1.170
 restaurant+1.101
Neg logit contributions
 prope-1.104
aters-1.057
lets-1.053
 pads-1.047
phones-1.018
Token so
Token ID523
Feature activation+1.26e+00

Pos logit contributions
aster+0.096
ore+0.092
eling+0.091
oker+0.088
 baker+0.084
Neg logit contributions
 prope-0.071
aters-0.069
 pads-0.068
lets-0.067
phones-0.065
Token patient
Token ID5827
Feature activation+3.56e-01

Pos logit contributions
aster+2.345
ore+2.191
eling+2.188
oker+2.103
 restaurant+2.006
Neg logit contributions
 prope-2.022
 Dog-1.869
 pads-1.854
aters-1.851
 Tess-1.849
Token even
Token ID772
Feature activation+2.46e+00

Pos logit contributions
aster+0.718
ore+0.689
eling+0.684
oker+0.662
 baker+0.632
Neg logit contributions
 prope-0.492
aters-0.475
 pads-0.468
lets-0.459
phones-0.450
Token flew
Token ID13112
Feature activation-5.26e-01

Pos logit contributions
aster+0.840
ore+0.809
eling+0.797
oker+0.779
 restaurant+0.737
Neg logit contributions
 prope-0.639
aters-0.612
 pads-0.611
lets-0.606
phones-0.583
Token Sally
Token ID25737
Feature activation-1.65e-02

Pos logit contributions
 prope+1.118
aters+1.103
 pads+1.077
phones+1.071
lets+1.068
Neg logit contributions
aster-0.659
ore-0.646
eling-0.612
oker-0.582
 baker-0.554
Token up
Token ID510
Feature activation+1.88e+00

Pos logit contributions
 prope+0.028
aters+0.027
 pads+0.027
lets+0.026
phones+0.026
Neg logit contributions
aster-0.028
ore-0.027
eling-0.026
oker-0.025
 restaurant-0.024
Token to
Token ID284
Feature activation+9.30e-01

Pos logit contributions
aster+4.150
eling+4.069
oker+3.935
ore+3.813
 salsa+3.622
Neg logit contributions
 prope-2.501
aters-2.280
 tail-2.262
 pads-2.228
 neck-2.216
Token the
Token ID262
Feature activation+1.97e+00

Pos logit contributions
aster+1.599
ore+1.575
eling+1.557
oker+1.518
 baker+1.401
Neg logit contributions
 prope-1.505
 Tess-1.468
aters-1.466
 Dog-1.448
 pads-1.448
Token muff
Token ID27563
Feature activation+3.86e+00

Pos logit contributions
aster+3.730
eling+3.584
oker+3.556
ore+3.528
ying+3.244
Neg logit contributions
 prope-3.253
 neck-2.990
aters-2.935
 tail-2.934
 pads-2.895
Tokenin
Token ID259
Feature activation+1.35e+00

Pos logit contributions
aster+4.256
oker+3.767
 salsa+3.435
eling+3.392
 restaurant+3.297
Neg logit contributions
lets-9.370
aters-9.189
un-9.002
tails-8.731
 pads-8.683
Token.
Token ID13
Feature activation+9.61e-01

Pos logit contributions
aster+2.755
eling+2.718
oker+2.632
ore+2.591
 baker+2.419
Neg logit contributions
 prope-1.787
 pads-1.787
aters-1.784
lets-1.777
phones-1.682
Token Sally
Token ID25737
Feature activation+2.28e-01

Pos logit contributions
aster+2.056
eling+2.017
ore+1.967
oker+1.898
 restaurant+1.829
Neg logit contributions
 prope-1.220
aters-1.209
 pads-1.184
 Tess-1.160
lets-1.159
Token took
Token ID1718
Feature activation+9.44e-01

Pos logit contributions
aster+0.410
ore+0.393
eling+0.388
oker+0.376
 restaurant+0.356
Neg logit contributions
 prope-0.365
aters-0.354
 pads-0.349
lets-0.345
phones-0.338
Token a
Token ID257
Feature activation+1.05e+00

Pos logit contributions
aster+1.874
eling+1.852
ore+1.834
oker+1.781
 restaurant+1.700
Neg logit contributions
 prope-1.294
aters-1.249
 Tess-1.239
 pads-1.236
lets-1.233
Token,
Token ID11
Feature activation+1.68e+00

Pos logit contributions
aster+1.413
eling+1.385
ore+1.365
oker+1.334
 baker+1.265
Neg logit contributions
 prope-0.781
aters-0.744
 pads-0.730
lets-0.721
 Dog-0.711
Token something
Token ID1223
Feature activation+2.34e+00

Pos logit contributions
eling+3.009
ore+3.007
aster+2.949
oker+2.941
ying+2.701
Neg logit contributions
 prope-2.641
 Dog-2.594
 Tess-2.538
 Bun-2.535
 Brown-2.508
Token special
Token ID2041
Feature activation+1.10e+00

Pos logit contributions
aster+4.903
ore+4.815
eling+4.704
oker+4.620
roads+4.311
Neg logit contributions
 Bun-3.016
 prope-3.010
 scarf-2.964
 Dog-2.929
 Tess-2.900
Token happened
Token ID3022
Feature activation+4.96e-01

Pos logit contributions
aster+1.693
eling+1.622
ore+1.576
oker+1.500
 baker+1.404
Neg logit contributions
 prope-2.006
aters-1.967
 pads-1.966
lets-1.952
 Tess-1.915
Token.
Token ID13
Feature activation+1.10e+00

Pos logit contributions
aster+0.908
eling+0.866
ore+0.863
oker+0.838
 baker+0.798
Neg logit contributions
 prope-0.772
aters-0.755
 pads-0.751
lets-0.734
phones-0.723
Token The
Token ID383
Feature activation+3.86e+00

Pos logit contributions
eling+2.140
aster+2.111
ore+2.032
oker+2.005
roads+1.844
Neg logit contributions
 prope-1.588
aters-1.540
 Tess-1.498
 Dog-1.491
 pads-1.491
Token baker
Token ID46412
Feature activation+1.31e+00

Pos logit contributions
eling+7.988
ore+7.979
aster+7.748
oker+7.729
roads+7.284
Neg logit contributions
 tail-5.866
 aunt-5.775
 prope-5.734
 scarf-5.709
 neck-5.593
Token set
Token ID900
Feature activation+1.66e+00

Pos logit contributions
aster+2.513
ore+2.439
eling+2.401
oker+2.374
 restaurant+2.185
Neg logit contributions
lets-1.901
 prope-1.895
aters-1.827
 pads-1.819
 Tess-1.773
Token out
Token ID503
Feature activation+1.79e+00

Pos logit contributions
eling+3.081
aster+3.040
ore+3.010
oker+2.999
 restaurant+2.726
Neg logit contributions
 prope-2.481
 Dog-2.387
lets-2.374
 Tess-2.368
aters-2.361
Token to
Token ID284
Feature activation+2.20e+00

Pos logit contributions
aster+3.327
eling+3.301
oker+3.292
ore+3.204
 baker+3.069
Neg logit contributions
 prope-2.611
aters-2.600
 pads-2.492
phones-2.484
lets-2.478
Token bake
Token ID28450
Feature activation+1.41e+00

Pos logit contributions
ore+4.091
oker+3.953
eling+3.829
ying+3.669
aster+3.612
Neg logit contributions
 prope-3.442
 Tess-3.382
 Dog-3.341
 mate-3.196
 scarf-3.188
Token wait
Token ID4043
Feature activation+8.24e-02

Pos logit contributions
aster+0.545
ore+0.525
eling+0.518
oker+0.496
 restaurant+0.471
Neg logit contributions
 prope-0.586
aters-0.566
 pads-0.557
lets-0.552
phones-0.547
Token to
Token ID284
Feature activation+1.91e-01

Pos logit contributions
aster+0.146
ore+0.140
eling+0.139
oker+0.133
 baker+0.127
Neg logit contributions
 prope-0.134
aters-0.130
 pads-0.128
lets-0.126
phones-0.124
Token eat
Token ID4483
Feature activation+1.44e+00

Pos logit contributions
aster+0.319
ore+0.307
eling+0.303
oker+0.291
 baker+0.275
Neg logit contributions
 prope-0.332
aters-0.323
 pads-0.317
lets-0.314
phones-0.310
Token the
Token ID262
Feature activation+2.28e+00

Pos logit contributions
aster+2.519
ore+2.452
oker+2.414
eling+2.413
 baker+2.224
Neg logit contributions
aters-2.313
 prope-2.294
 pads-2.253
 Tess-2.214
lets-2.174
Token tasty
Token ID25103
Feature activation+2.58e+00

Pos logit contributions
aster+4.139
ore+3.844
oker+3.803
eling+3.791
roads+3.470
Neg logit contributions
 prope-3.927
aters-3.641
 scarf-3.629
Brown-3.591
Snow-3.586
Token muff
Token ID27563
Feature activation+3.85e+00

Pos logit contributions
aster+4.061
eling+3.850
oker+3.734
ore+3.641
roads+3.451
Neg logit contributions
aters-4.855
 prope-4.796
 scarf-4.771
 Brown-4.729
 neck-4.724
Tokenins
Token ID1040
Feature activation+1.08e+00

Pos logit contributions
aster+4.566
eling+3.736
roads+3.616
oker+3.453
atches+3.405
Neg logit contributions
lets-9.551
aters-9.143
 pads-8.814
un-8.716
ocular-8.625
Token.
Token ID13
Feature activation-3.86e-01

Pos logit contributions
aster+2.030
eling+1.949
ore+1.910
oker+1.840
 restaurant+1.747
Neg logit contributions
 pads-1.616
 prope-1.603
aters-1.602
lets-1.593
phones-1.532
Token<|endoftext|>
Token ID50256
Feature activation+1.57e+00

Pos logit contributions
 prope+0.601
aters+0.564
 pads+0.556
lets+0.556
phones+0.547
Neg logit contributions
aster-0.723
ore-0.703
eling-0.676
oker-0.663
 baker-0.635
TokenOnce
Token ID7454
Feature activation+8.63e-01

Pos logit contributions
aster+1.735
eling+1.648
oker+1.485
ore+1.361
 baker+1.350
Neg logit contributions
aters-3.672
 pads-3.631
 Dog-3.573
 prope-3.570
Snow-3.535
Token upon
Token ID2402
Feature activation-5.60e-01

Pos logit contributions
aster+1.605
eling+1.538
ore+1.507
oker+1.462
 baker+1.406
Neg logit contributions
 prope-1.296
aters-1.271
 pads-1.256
lets-1.234
 Tess-1.232
Token different
Token ID1180
Feature activation-8.64e-01

Pos logit contributions
 prope+0.120
aters+0.117
 pads+0.115
lets+0.114
phones+0.112
Neg logit contributions
aster-0.100
ore-0.094
eling-0.093
oker-0.088
 restaurant-0.084
Token neck
Token ID7393
Feature activation-1.49e+00

Pos logit contributions
 prope+1.405
aters+1.360
 pads+1.327
Snow+1.323
 Dog+1.318
Neg logit contributions
aster-1.499
eling-1.456
ore-1.420
 restaurant-1.350
oker-1.334
Tokenl
Token ID75
Feature activation-1.35e+00

Pos logit contributions
 prope+2.459
aters+2.387
Brown+2.263
 Dog+2.251
 Tess+2.245
Neg logit contributions
aster-2.660
ore-2.591
eling-2.463
 restaurant-2.323
 baker-2.299
Tokenaces
Token ID2114
Feature activation-2.06e+00

Pos logit contributions
 prope+1.793
 Tess+1.634
Snow+1.622
aters+1.617
Brown+1.600
Neg logit contributions
aster-2.819
ore-2.773
eling-2.659
oker-2.583
 baker-2.510
Token,
Token ID11
Feature activation-1.44e+00

Pos logit contributions
 prope+3.604
aters+3.413
Brown+3.402
Snow+3.336
 Dog+3.332
Neg logit contributions
ore-3.555
aster-3.442
eling-3.180
he-3.159
ying-3.114
Token brace
Token ID22581
Feature activation-3.85e+00

Pos logit contributions
aters+2.139
 prope+2.066
un+1.959
Brown+1.947
phones+1.947
Neg logit contributions
aster-2.787
eling-2.624
ore-2.454
 baker-2.423
 restaurant-2.421
Tokenlets
Token ID5289
Feature activation-1.87e+00

Pos logit contributions
 prope+3.299
 Dog+3.283
Snow+2.714
Brown+2.617
 Tess+2.486
Neg logit contributions
ore-9.989
led-9.701
aster-9.489
eling-9.261
oker-9.135
Token,
Token ID11
Feature activation-1.91e+00

Pos logit contributions
 prope+3.223
aters+3.074
Brown+3.018
Snow+2.986
Wh+2.971
Neg logit contributions
ore-3.150
aster-3.126
eling-2.920
 baker-2.839
oker-2.772
Token and
Token ID290
Feature activation-8.01e-01

Pos logit contributions
aters+3.027
 prope+2.955
Brown+2.822
un+2.821
 Tess+2.798
Neg logit contributions
aster-3.490
eling-3.205
 baker-3.051
ore-3.040
 restaurant-3.014
Token rings
Token ID13917
Feature activation-1.91e+00

Pos logit contributions
 prope+1.196
aters+1.192
 pads+1.106
Snow+1.102
phones+1.098
Neg logit contributions
aster-1.522
eling-1.427
ore-1.408
oker-1.364
 baker-1.335
Token.
Token ID13
Feature activation-3.42e-01

Pos logit contributions
 prope+3.650
aters+3.551
Brown+3.448
 pads+3.389
Snow+3.379
Neg logit contributions
aster-2.761
ore-2.735
eling-2.635
 baker-2.428
 restaurant-2.419
Token big
Token ID1263
Feature activation-8.72e-01

Pos logit contributions
aters+1.846
lets+1.777
 prope+1.759
 pads+1.755
phones+1.729
Neg logit contributions
aster-1.937
ore-1.881
oker-1.787
 restaurant-1.781
eling-1.773
Token,
Token ID11
Feature activation-5.89e-01

Pos logit contributions
aters+1.315
 prope+1.315
lets+1.291
 pads+1.282
phones+1.272
Neg logit contributions
aster-1.627
ore-1.564
eling-1.517
oker-1.492
 restaurant-1.475
Token white
Token ID2330
Feature activation-1.54e+00

Pos logit contributions
aters+0.913
 prope+0.909
lets+0.884
 pads+0.867
phones+0.866
Neg logit contributions
aster-1.074
ore-1.025
eling-1.013
oker-0.977
 restaurant-0.950
Token pup
Token ID15552
Feature activation-1.35e+00

Pos logit contributions
 prope+2.433
aters+2.373
lets+2.371
 pads+2.322
phones+2.319
Neg logit contributions
aster-2.834
ore-2.824
 restaurant-2.631
oker-2.561
 baker-2.514
Token with
Token ID351
Feature activation-2.97e+00

Pos logit contributions
 prope+2.537
aters+2.448
lets+2.327
 pads+2.320
phones+2.317
Neg logit contributions
ore-2.113
aster-2.084
eling-1.928
oker-1.873
 restaurant-1.851
Token floppy
Token ID49415
Feature activation-3.82e+00

Pos logit contributions
aters+5.744
lets+5.387
 prope+5.346
phones+5.240
Brown+5.061
Neg logit contributions
aster-4.705
ore-4.557
eling-4.491
 salsa-4.172
 restaurant-4.025
Token ears
Token ID11368
Feature activation-2.55e+00

Pos logit contributions
Brown+5.179
 mate+5.131
 prope+5.130
Wh+5.025
aters+4.940
Neg logit contributions
eling-7.594
aster-7.520
ore-7.430
 salsa-6.980
 restaurant-6.940
Token and
Token ID290
Feature activation-2.47e+00

Pos logit contributions
 prope+4.853
Brown+4.760
 Tess+4.741
Wh+4.737
aters+4.699
Neg logit contributions
ore-3.947
aster-3.711
eling-3.441
 salsa-3.203
 restaurant-3.185
Token a
Token ID257
Feature activation-1.96e+00

Pos logit contributions
aters+4.011
 prope+3.833
Brown+3.705
phones+3.692
lets+3.669
Neg logit contributions
aster-4.473
eling-4.237
ore-4.195
 salsa-4.115
 restaurant-3.971
Token curly
Token ID45731
Feature activation-2.06e+00

Pos logit contributions
aters+3.247
lets+3.029
phones+3.021
 pads+2.981
 prope+2.953
Neg logit contributions
aster-3.429
ore-3.427
oker-3.189
eling-3.188
 restaurant-3.152
Token tail
Token ID7894
Feature activation-1.98e+00

Pos logit contributions
aters+2.198
 prope+2.171
phones+2.108
Brown+2.079
ocular+2.067
Neg logit contributions
aster-4.812
ore-4.548
eling-4.437
oker-4.259
 salsa-4.223
Token a
Token ID257
Feature activation+2.30e-02

Pos logit contributions
 prope+1.912
lets+1.851
aters+1.838
phones+1.834
 pads+1.780
Neg logit contributions
aster-2.388
ore-2.181
eling-2.098
 restaurant-2.080
 baker-2.073
Token little
Token ID1310
Feature activation+3.80e-02

Pos logit contributions
aster+0.045
ore+0.043
eling+0.043
oker+0.042
 baker+0.040
Neg logit contributions
 prope-0.033
aters-0.032
 pads-0.031
lets-0.031
phones-0.030
Token rabbit
Token ID22746
Feature activation-7.56e-01

Pos logit contributions
aster+0.048
ore+0.046
eling+0.045
oker+0.042
 baker+0.040
Neg logit contributions
 prope-0.081
aters-0.080
 pads-0.079
lets-0.078
phones-0.077
Token named
Token ID3706
Feature activation-1.89e+00

Pos logit contributions
 prope+1.055
aters+1.026
 pads+0.986
phones+0.971
lets+0.961
Neg logit contributions
aster-1.556
ore-1.523
eling-1.442
oker-1.396
 restaurant-1.376
Token Bun
Token ID28515
Feature activation-1.95e+00

Pos logit contributions
 prope+2.903
aters+2.600
lets+2.460
phones+2.428
 pads+2.403
Neg logit contributions
aster-3.958
ore-3.807
eling-3.600
 restaurant-3.540
 salsa-3.470
Token.
Token ID13
Feature activation-3.82e+00

Pos logit contributions
 prope+3.896
aters+3.684
phones+3.674
lets+3.461
 Tess+3.420
Neg logit contributions
aster-2.938
ore-2.904
eling-2.811
in-2.540
oker-2.473
Token Bun
Token ID28515
Feature activation-1.72e+00

Pos logit contributions
lets+6.100
 prope+5.631
 mate+5.560
phones+5.509
 pads+5.376
Neg logit contributions
ore-7.778
he-6.843
-6.777
aster-6.678
eling-6.625
Token had
Token ID550
Feature activation-3.22e+00

Pos logit contributions
 prope+2.784
aters+2.685
phones+2.639
lets+2.468
 mate+2.443
Neg logit contributions
ore-3.163
eling-3.103
aster-3.096
 restaurant-2.817
oker-2.809
Token big
Token ID1263
Feature activation-2.45e+00

Pos logit contributions
aters+6.645
lets+6.366
 prope+6.177
phones+6.177
ocular+5.745
Neg logit contributions
aster-5.147
ore-4.545
 baker-4.225
 restaurant-4.146
 salsa-4.128
Token ears
Token ID11368
Feature activation-2.40e+00

Pos logit contributions
Snow+3.856
phones+3.822
Brown+3.786
 prope+3.775
Wh+3.696
Neg logit contributions
eling-4.260
aster-4.247
ore-4.138
 restaurant-4.125
 salsa-3.974
Token that
Token ID326
Feature activation-1.66e+00

Pos logit contributions
 prope+4.767
aters+4.660
phones+4.522
 Tess+4.444
Wh+4.420
Neg logit contributions
ore-3.729
aster-3.527
 baker-3.148
eling-3.108
 restaurant-3.095
Token ch
Token ID442
Feature activation+1.40e+00

Pos logit contributions
aster+0.429
ore+0.414
eling+0.410
oker+0.396
 restaurant+0.377
Neg logit contributions
 prope-0.298
aters-0.287
 pads-0.281
lets-0.277
phones-0.271
Tokenubby
Token ID38393
Feature activation-2.90e-01

Pos logit contributions
aster+2.201
eling+2.126
ore+2.016
oker+1.926
 restaurant+1.879
Neg logit contributions
 prope-2.625
aters-2.494
 mate-2.428
un-2.426
lets-2.414
Token bunny
Token ID44915
Feature activation-7.70e-01

Pos logit contributions
 prope+0.421
aters+0.412
 pads+0.405
lets+0.396
phones+0.396
Neg logit contributions
aster-0.561
ore-0.542
eling-0.528
oker-0.515
 baker-0.505
Token named
Token ID3706
Feature activation-1.75e+00

Pos logit contributions
 prope+1.083
aters+1.052
 pads+1.010
phones+1.001
lets+0.979
Neg logit contributions
aster-1.568
ore-1.552
eling-1.458
oker-1.419
 restaurant-1.399
Token Bun
Token ID28515
Feature activation-1.91e+00

Pos logit contributions
 prope+2.725
aters+2.491
lets+2.401
phones+2.361
 pads+2.331
Neg logit contributions
aster-3.548
ore-3.412
eling-3.203
 restaurant-3.185
 baker-3.107
Token.
Token ID13
Feature activation-3.80e+00

Pos logit contributions
 prope+3.967
phones+3.773
aters+3.772
lets+3.558
 Tess+3.546
Neg logit contributions
aster-2.706
ore-2.689
eling-2.591
in-2.298
 restaurant-2.276
Token Bun
Token ID28515
Feature activation-1.50e+00

Pos logit contributions
lets+5.687
 prope+5.120
 mate+5.114
 pads+5.044
phones+5.037
Neg logit contributions
ore-8.334
-7.332
he-7.329
aster-7.152
eling-7.032
Token loved
Token ID6151
Feature activation-2.02e-01

Pos logit contributions
 prope+2.432
aters+2.357
phones+2.336
lets+2.201
 mate+2.141
Neg logit contributions
ore-2.710
aster-2.678
eling-2.672
 restaurant-2.434
oker-2.411
Token to
Token ID284
Feature activation-5.58e-01

Pos logit contributions
 prope+0.319
aters+0.309
 pads+0.301
lets+0.300
phones+0.295
Neg logit contributions
aster-0.369
ore-0.354
eling-0.348
oker-0.332
 restaurant-0.324
Token eat
Token ID4483
Feature activation+7.14e-01

Pos logit contributions
 prope+0.903
aters+0.877
 pads+0.861
lets+0.858
phones+0.842
Neg logit contributions
aster-1.035
ore-0.962
eling-0.948
oker-0.902
 restaurant-0.888
Token carrots
Token ID34397
Feature activation+1.35e-01

Pos logit contributions
aster+1.292
ore+1.229
eling+1.228
oker+1.209
 baker+1.113
Neg logit contributions
 prope-1.137
aters-1.119
 pads-1.104
lets-1.075
 Dog-1.052
Token.
Token ID13
Feature activation-1.30e+00

Pos logit contributions
 prope+1.349
aters+1.302
phones+1.265
 pads+1.250
lets+1.234
Neg logit contributions
aster-1.100
ore-1.086
eling-1.020
oker-0.978
 restaurant-0.949
Token Fl
Token ID1610
Feature activation-6.93e-01

Pos logit contributions
 prope+1.856
phones+1.749
lets+1.746
aters+1.699
 pads+1.686
Neg logit contributions
ore-2.609
aster-2.558
eling-2.431
 restaurant-2.406
oker-2.350
Tokenoppy
Token ID26696
Feature activation-9.16e-01

Pos logit contributions
 prope+1.389
aters+1.302
phones+1.297
 pads+1.295
lets+1.279
Neg logit contributions
ore-0.995
aster-0.973
eling-0.937
oker-0.908
 restaurant-0.856
Token had
Token ID550
Feature activation-3.34e+00

Pos logit contributions
 prope+1.412
aters+1.371
phones+1.337
 pads+1.276
lets+1.275
Neg logit contributions
aster-1.689
ore-1.677
eling-1.626
 restaurant-1.560
oker-1.549
Token big
Token ID1263
Feature activation-2.34e+00

Pos logit contributions
aters+6.990
 prope+6.741
lets+6.730
phones+6.672
ocular+6.136
Neg logit contributions
aster-4.929
ore-4.667
 restaurant-4.349
 baker-4.244
 salsa-4.233
Token floppy
Token ID49415
Feature activation-3.79e+00

Pos logit contributions
phones+3.670
Snow+3.652
Brown+3.644
Wh+3.595
 prope+3.572
Neg logit contributions
eling-4.053
aster-4.043
ore-4.018
 restaurant-3.990
 salsa-3.783
Token ears
Token ID11368
Feature activation-2.87e+00

Pos logit contributions
Wh+5.090
 prope+5.019
 undone+5.016
aters+5.016
ocular+4.964
Neg logit contributions
 restaurant-7.420
aster-7.370
eling-7.226
ore-7.144
 early-6.946
Token and
Token ID290
Feature activation-2.79e+00

Pos logit contributions
 prope+5.893
aters+5.743
phones+5.618
Brown+5.580
Wh+5.561
Neg logit contributions
ore-4.452
aster-3.955
 baker-3.618
eling-3.598
 restaurant-3.594
Token a
Token ID257
Feature activation-2.50e+00

Pos logit contributions
aters+5.345
 prope+5.165
phones+4.978
 undone+4.713
 mate+4.618
Neg logit contributions
aster-4.544
ore-4.153
 salsa-4.097
eling-4.081
 baker-4.034
Token little
Token ID1310
Feature activation-1.73e+00

Pos logit contributions
aters+4.494
phones+4.295
lets+4.091
 pads+4.050
 prope+3.889
Neg logit contributions
aster-4.174
ore-4.136
 baker-4.135
 restaurant-4.046
 salsa-3.913
Token pink
Token ID11398
Feature activation-2.68e+00

Pos logit contributions
phones+2.713
aters+2.681
 prope+2.656
Brown+2.590
Wh+2.582
Neg logit contributions
ore-3.108
aster-2.959
 restaurant-2.905
 baker-2.852
oker-2.795
Token her
Token ID607
Feature activation-2.33e+00

Pos logit contributions
aster+0.325
ore+0.311
eling+0.309
oker+0.296
 baker+0.279
Neg logit contributions
 prope-0.347
aters-0.338
 pads-0.332
lets-0.328
phones-0.324
Token flashlight
Token ID38371
Feature activation-1.33e+00

Pos logit contributions
aters+2.784
 pads+2.645
lets+2.615
Snow+2.515
 prope+2.513
Neg logit contributions
ore-5.283
aster-5.065
eling-4.850
oker-4.695
 restaurant-4.639
Token and
Token ID290
Feature activation-6.40e-01

Pos logit contributions
 prope+2.155
aters+2.124
Snow+2.045
Brown+2.027
 Tess+2.010
Neg logit contributions
ore-2.298
aster-2.257
eling-2.117
 early-2.054
oker-2.036
Token the
Token ID262
Feature activation-1.28e+00

Pos logit contributions
 prope+1.090
aters+1.089
 pads+1.028
lets+1.011
Brown+1.011
Neg logit contributions
aster-1.085
eling-1.032
ore-1.024
oker-0.954
 restaurant-0.930
Token butterfly
Token ID35113
Feature activation-1.25e+00

Pos logit contributions
aters+1.950
 prope+1.875
lets+1.815
 pads+1.811
phones+1.786
Neg logit contributions
aster-2.400
ore-2.325
eling-2.288
oker-2.244
 baker-2.200
Token's
Token ID338
Feature activation-3.76e+00

Pos logit contributions
 prope+2.089
aters+2.086
Brown+1.965
 Tess+1.938
phones+1.933
Neg logit contributions
aster-2.108
ore-2.097
eling-2.027
oker-1.877
 baker-1.840
Token wings
Token ID12098
Feature activation-1.30e+00

Pos logit contributions
aters+6.778
Wh+6.727
Brown+6.556
Nut+6.474
lets+6.446
Neg logit contributions
ore-6.735
aster-5.787
 early-5.555
eling-5.455
 oven-5.355
Token were
Token ID547
Feature activation-2.20e+00

Pos logit contributions
aters+2.375
 prope+2.360
phones+2.278
Brown+2.274
 pads+2.240
Neg logit contributions
ore-1.988
aster-1.987
eling-1.932
oker-1.758
 baker-1.659
Token so
Token ID523
Feature activation-5.82e-01

Pos logit contributions
aters+3.699
phones+3.625
 prope+3.574
Brown+3.570
Wh+3.511
Neg logit contributions
eling-3.713
aster-3.681
ore-3.592
 early-3.263
oker-3.246
Token colorful
Token ID20239
Feature activation-1.27e+00

Pos logit contributions
 prope+0.900
aters+0.898
 pads+0.869
phones+0.867
lets+0.854
Neg logit contributions
aster-1.059
ore-1.049
eling-1.025
oker-0.992
 baker-0.945
Token!
Token ID0
Feature activation+2.12e-01

Pos logit contributions
 prope+2.398
aters+2.275
Brown+2.245
 Tess+2.229
phones+2.224
Neg logit contributions
ore-1.949
aster-1.897
eling-1.767
oker-1.729
 restaurant-1.713
Token
Token ID198
Feature activation-4.37e-01

Pos logit contributions
 prope+0.383
aters+0.371
 pads+0.364
lets+0.361
phones+0.357
Neg logit contributions
aster-0.405
ore-0.391
eling-0.381
oker-0.366
 restaurant-0.349
TokenM
Token ID44
Feature activation+1.46e+00

Pos logit contributions
 prope+0.721
aters+0.695
 pads+0.681
lets+0.674
phones+0.672
Neg logit contributions
aster-0.796
ore-0.757
eling-0.750
oker-0.720
 restaurant-0.682
Tokenilly
Token ID6548
Feature activation-1.23e+00

Pos logit contributions
aster+2.459
ore+2.323
eling+2.288
oker+2.224
roads+2.177
Neg logit contributions
aters-2.499
 pads-2.378
 Tess-2.374
 prope-2.352
lets-2.342
Token and
Token ID290
Feature activation+5.91e-01

Pos logit contributions
aters+2.027
 prope+2.011
 pads+1.886
phones+1.869
lets+1.856
Neg logit contributions
aster-2.155
eling-2.083
ore-2.066
 baker-1.910
oker-1.907
Token her
Token ID607
Feature activation-3.11e+00

Pos logit contributions
aster+1.003
ore+0.974
eling+0.953
oker+0.935
 baker+0.870
Neg logit contributions
 prope-1.007
aters-0.975
 pads-0.960
 Tess-0.955
lets-0.951
Token new
Token ID649
Feature activation-3.75e+00

Pos logit contributions
aters+4.618
lets+4.505
 prope+4.495
 pads+4.406
 mate+4.206
Neg logit contributions
aster-5.742
ore-5.713
 restaurant-5.616
eling-5.583
 baker-5.360
Token friend
Token ID1545
Feature activation-1.38e+00

Pos logit contributions
 prope+6.168
 pads+6.166
phones+5.995
Wh+5.965
aters+5.921
Neg logit contributions
 restaurant-6.699
ore-6.694
 baker-6.280
 healer-6.039
 early-5.952
Token played
Token ID2826
Feature activation+5.75e-01

Pos logit contributions
 prope+2.245
aters+2.171
phones+2.046
Snow+2.008
Brown+2.006
Neg logit contributions
aster-2.508
ore-2.386
eling-2.376
oker-2.262
 restaurant-2.237
Token together
Token ID1978
Feature activation+2.49e-01

Pos logit contributions
aster+1.062
eling+1.015
ore+1.014
oker+0.971
 baker+0.922
Neg logit contributions
 prope-0.897
aters-0.867
 pads-0.858
lets-0.844
phones-0.828
Token until
Token ID1566
Feature activation+5.92e-01

Pos logit contributions
aster+0.476
ore+0.458
eling+0.454
oker+0.438
 baker+0.417
Neg logit contributions
 prope-0.370
aters-0.359
 pads-0.353
lets-0.348
phones-0.340
Token M
Token ID337
Feature activation+7.67e-01

Pos logit contributions
aster+0.947
ore+0.927
eling+0.919
oker+0.886
 baker+0.828
Neg logit contributions
 prope-1.048
aters-1.011
 pads-1.001
 Tess-0.998
lets-0.981
Token play
Token ID711
Feature activation+8.04e-01

Pos logit contributions
 prope+0.997
aters+0.961
 pads+0.950
lets+0.930
phones+0.914
Neg logit contributions
aster-1.078
ore-1.005
eling-0.968
oker-0.953
 restaurant-0.924
Token with
Token ID351
Feature activation+1.69e-01

Pos logit contributions
aster+1.411
eling+1.364
ore+1.331
oker+1.290
 baker+1.214
Neg logit contributions
 prope-1.333
aters-1.301
 pads-1.280
lets-1.265
phones-1.243
Token lasers
Token ID34628
Feature activation-3.46e-01

Pos logit contributions
aster+0.318
ore+0.306
eling+0.303
oker+0.292
 baker+0.278
Neg logit contributions
 prope-0.258
aters-0.250
 pads-0.246
lets-0.242
phones-0.238
Token.
Token ID13
Feature activation+5.22e-01

Pos logit contributions
 prope+0.606
aters+0.587
 pads+0.574
lets+0.569
phones+0.568
Neg logit contributions
aster-0.565
ore-0.551
eling-0.530
oker-0.515
 restaurant-0.495
Token They
Token ID1119
Feature activation-2.24e+00

Pos logit contributions
aster+1.143
eling+1.115
ore+1.111
oker+1.069
 baker+1.025
Neg logit contributions
aters-0.622
 prope-0.622
 pads-0.606
lets-0.588
 Tess-0.566
Token had
Token ID550
Feature activation-3.74e+00

Pos logit contributions
aters+3.845
 prope+3.778
phones+3.615
 pads+3.516
Snow+3.437
Neg logit contributions
aster-3.890
eling-3.657
ore-3.473
oker-3.465
 restaurant-3.441
Token red
Token ID2266
Feature activation-1.96e+00

Pos logit contributions
aters+9.417
phones+8.665
lets+8.585
Brown+8.575
ker+8.473
Neg logit contributions
aster-4.224
 restaurant-4.104
 baker-3.293
 contest-3.293
 early-3.250
Token lasers
Token ID34628
Feature activation-1.60e+00

Pos logit contributions
 prope+3.185
Snow+3.184
 Tess+3.143
Brown+3.115
 Dog+3.098
Neg logit contributions
aster-3.365
ore-3.318
eling-3.041
 restaurant-2.983
oker-2.940
Token and
Token ID290
Feature activation-2.96e+00

Pos logit contributions
 prope+2.635
aters+2.567
 Tess+2.495
Snow+2.490
Brown+2.475
Neg logit contributions
ore-2.763
aster-2.677
oker-2.525
eling-2.511
 baker-2.474
Token blue
Token ID4171
Feature activation-2.08e+00

Pos logit contributions
aters+5.380
Snow+4.690
 Tess+4.665
 prope+4.655
Brown+4.633
Neg logit contributions
aster-5.049
ore-4.551
eling-4.502
 salsa-4.372
oker-4.330
Token lasers
Token ID34628
Feature activation-1.29e+00

Pos logit contributions
Snow+3.611
 Tess+3.594
 prope+3.566
Brown+3.553
 Dog+3.539
Neg logit contributions
ore-3.472
aster-3.307
eling-3.042
oker-2.997
 restaurant-2.974
Token a
Token ID257
Feature activation+1.74e-01

Pos logit contributions
 prope+1.304
aters+1.265
lets+1.256
phones+1.241
 pads+1.226
Neg logit contributions
aster-1.748
ore-1.625
eling-1.577
oker-1.528
 baker-1.526
Token silly
Token ID14397
Feature activation+1.54e-01

Pos logit contributions
aster+0.345
ore+0.333
eling+0.330
oker+0.318
 baker+0.303
Neg logit contributions
 prope-0.245
aters-0.236
 pads-0.232
lets-0.228
phones-0.224
Token rabbit
Token ID22746
Feature activation-2.89e-01

Pos logit contributions
aster+0.296
ore+0.285
eling+0.283
oker+0.272
 baker+0.259
Neg logit contributions
 prope-0.227
aters-0.220
 pads-0.216
lets-0.213
phones-0.209
Token named
Token ID3706
Feature activation-1.76e+00

Pos logit contributions
 prope+0.376
aters+0.363
 pads+0.352
lets+0.345
phones+0.342
Neg logit contributions
aster-0.614
ore-0.597
eling-0.579
oker-0.562
 restaurant-0.546
Token Bun
Token ID28515
Feature activation-1.86e+00

Pos logit contributions
 prope+2.752
aters+2.512
lets+2.407
phones+2.348
 pads+2.334
Neg logit contributions
aster-3.602
ore-3.388
eling-3.225
 restaurant-3.217
 salsa-3.157
Token.
Token ID13
Feature activation-3.73e+00

Pos logit contributions
 prope+3.826
aters+3.625
phones+3.612
lets+3.420
 Tess+3.402
Neg logit contributions
aster-2.688
ore-2.650
eling-2.552
in-2.266
 restaurant-2.262
Token Bun
Token ID28515
Feature activation-1.35e+00

Pos logit contributions
lets+6.289
 pads+5.705
 prope+5.687
phones+5.608
 mate+5.586
Neg logit contributions
ore-7.422
aster-6.310
-6.300
he-6.299
eling-6.168
Token liked
Token ID8288
Feature activation-2.01e-01

Pos logit contributions
 prope+2.190
aters+2.115
phones+2.070
lets+1.969
 pads+1.924
Neg logit contributions
ore-2.448
aster-2.434
eling-2.410
oker-2.209
 restaurant-2.200
Token to
Token ID284
Feature activation+1.70e-01

Pos logit contributions
 prope+0.317
aters+0.310
 pads+0.301
lets+0.299
phones+0.294
Neg logit contributions
aster-0.367
ore-0.352
eling-0.345
oker-0.331
 restaurant-0.322
Token hop
Token ID1725
Feature activation+2.77e-01

Pos logit contributions
aster+0.315
ore+0.303
eling+0.300
oker+0.289
 baker+0.276
Neg logit contributions
 prope-0.264
aters-0.257
 pads-0.252
lets-0.248
phones-0.244
Token and
Token ID290
Feature activation-8.87e-02

Pos logit contributions
aster+0.463
eling+0.439
ore+0.438
oker+0.418
 baker+0.397
Neg logit contributions
 prope-0.482
aters-0.470
 pads-0.464
lets-0.457
phones-0.449
Token small
Token ID1402
Feature activation-4.16e-01

Pos logit contributions
aters+0.740
 prope+0.736
 pads+0.714
lets+0.711
phones+0.705
Neg logit contributions
aster-0.852
ore-0.813
eling-0.787
oker-0.777
 restaurant-0.761
Token,
Token ID11
Feature activation-6.74e-01

Pos logit contributions
 prope+0.705
aters+0.690
 pads+0.677
lets+0.672
phones+0.670
Neg logit contributions
aster-0.703
ore-0.681
eling-0.652
oker-0.642
 restaurant-0.627
Token brown
Token ID7586
Feature activation-9.90e-01

Pos logit contributions
 prope+0.998
aters+0.996
lets+0.956
phones+0.943
 pads+0.920
Neg logit contributions
aster-1.296
ore-1.220
eling-1.211
oker-1.169
 restaurant-1.148
Token puppy
Token ID26188
Feature activation-1.73e+00

Pos logit contributions
 prope+1.550
aters+1.508
lets+1.487
 pads+1.483
phones+1.480
Neg logit contributions
aster-1.815
ore-1.768
 restaurant-1.666
 baker-1.636
eling-1.630
Token with
Token ID351
Feature activation-2.85e+00

Pos logit contributions
 prope+3.186
aters+3.057
phones+2.895
 pads+2.847
lets+2.846
Neg logit contributions
ore-2.861
aster-2.799
eling-2.548
 restaurant-2.542
 baker-2.511
Token floppy
Token ID49415
Feature activation-3.72e+00

Pos logit contributions
aters+5.478
 prope+5.230
lets+5.197
phones+5.057
 pads+4.867
Neg logit contributions
aster-4.501
ore-4.307
eling-4.197
 salsa-3.973
 baker-3.818
Token ears
Token ID11368
Feature activation-2.62e+00

Pos logit contributions
 prope+5.144
Brown+5.096
ocular+5.055
aters+5.001
Wh+4.902
Neg logit contributions
aster-7.391
eling-7.045
ore-6.886
 salsa-6.749
 restaurant-6.713
Token and
Token ID290
Feature activation-2.34e+00

Pos logit contributions
 prope+5.116
Brown+4.932
 Tess+4.911
aters+4.896
phones+4.869
Neg logit contributions
ore-4.056
aster-3.889
eling-3.409
 salsa-3.355
 baker-3.331
Token a
Token ID257
Feature activation-2.32e+00

Pos logit contributions
aters+3.938
 prope+3.766
phones+3.674
Brown+3.598
lets+3.598
Neg logit contributions
aster-4.184
ore-3.792
eling-3.786
 salsa-3.784
 restaurant-3.627
Token w
Token ID266
Feature activation+7.11e-02

Pos logit contributions
aters+3.640
phones+3.516
lets+3.411
 prope+3.314
 Tess+3.309
Neg logit contributions
aster-4.189
ore-4.118
 restaurant-3.871
 salsa-3.856
 baker-3.850
Tokenagging
Token ID16406
Feature activation-2.04e+00

Pos logit contributions
aster+0.117
ore+0.112
eling+0.111
oker+0.106
 baker+0.100
Neg logit contributions
 prope-0.126
aters-0.122
 pads-0.120
lets-0.119
phones-0.117
Token was
Token ID373
Feature activation-9.44e-01

Pos logit contributions
 prope+0.641
aters+0.623
 pads+0.611
lets+0.608
phones+0.606
Neg logit contributions
aster-0.593
ore-0.572
eling-0.556
oker-0.531
 restaurant-0.514
Token a
Token ID257
Feature activation+5.45e-02

Pos logit contributions
 prope+1.416
aters+1.371
lets+1.371
phones+1.352
 pads+1.335
Neg logit contributions
aster-1.872
ore-1.728
eling-1.681
 restaurant-1.634
 baker-1.633
Token bunny
Token ID44915
Feature activation-5.29e-01

Pos logit contributions
aster+0.107
ore+0.103
eling+0.102
oker+0.099
 baker+0.095
Neg logit contributions
 prope-0.078
aters-0.076
 pads-0.074
lets-0.073
phones-0.072
Token named
Token ID3706
Feature activation-1.71e+00

Pos logit contributions
 prope+0.704
aters+0.684
 pads+0.660
phones+0.649
lets+0.642
Neg logit contributions
aster-1.114
ore-1.093
eling-1.044
oker-1.015
 restaurant-0.996
Token Bun
Token ID28515
Feature activation-1.94e+00

Pos logit contributions
 prope+2.576
aters+2.337
lets+2.271
phones+2.231
 pads+2.202
Neg logit contributions
aster-3.515
ore-3.427
eling-3.209
 restaurant-3.191
 baker-3.100
Token.
Token ID13
Feature activation-3.68e+00

Pos logit contributions
 prope+3.863
phones+3.673
aters+3.668
lets+3.470
 Tess+3.416
Neg logit contributions
aster-2.905
ore-2.885
eling-2.797
in-2.512
 restaurant-2.486
Token Bun
Token ID28515
Feature activation-1.40e+00

Pos logit contributions
lets+5.854
 mate+5.399
 prope+5.375
phones+5.233
 pads+5.217
Neg logit contributions
ore-7.537
aster-6.518
he-6.514
eling-6.452
-6.433
Token loved
Token ID6151
Feature activation-7.16e-02

Pos logit contributions
 prope+2.258
aters+2.181
phones+2.154
lets+2.050
 mate+1.995
Neg logit contributions
ore-2.550
aster-2.516
eling-2.512
 restaurant-2.297
oker-2.277
Token to
Token ID284
Feature activation-5.66e-01

Pos logit contributions
 prope+0.111
aters+0.108
 pads+0.106
lets+0.105
phones+0.103
Neg logit contributions
aster-0.132
ore-0.127
eling-0.125
oker-0.120
 restaurant-0.116
Token eat
Token ID4483
Feature activation+7.66e-01

Pos logit contributions
 prope+0.944
aters+0.917
 pads+0.903
lets+0.899
phones+0.883
Neg logit contributions
aster-1.020
ore-0.944
eling-0.929
oker-0.884
 restaurant-0.871
Token cel
Token ID18725
Feature activation+6.60e-01

Pos logit contributions
aster+1.381
ore+1.312
eling+1.311
oker+1.293
 baker+1.185
Neg logit contributions
 prope-1.225
aters-1.208
 pads-1.189
lets-1.157
 Tess-1.134
Token a
Token ID257
Feature activation+9.12e-02

Pos logit contributions
 prope+1.408
aters+1.360
lets+1.352
phones+1.342
 pads+1.318
Neg logit contributions
aster-1.859
ore-1.724
eling-1.668
 baker-1.626
 restaurant-1.624
Token little
Token ID1310
Feature activation+1.07e-01

Pos logit contributions
aster+0.182
ore+0.175
eling+0.173
oker+0.167
 baker+0.160
Neg logit contributions
 prope-0.128
aters-0.124
 pads-0.122
lets-0.120
phones-0.118
Token girl
Token ID2576
Feature activation-7.93e-01

Pos logit contributions
aster+0.139
ore+0.132
eling+0.130
oker+0.123
 baker+0.114
Neg logit contributions
 prope-0.224
aters-0.219
 pads-0.216
lets-0.214
phones-0.211
Token named
Token ID3706
Feature activation-7.23e-01

Pos logit contributions
 prope+1.095
aters+1.060
 pads+1.029
phones+1.005
lets+0.996
Neg logit contributions
aster-1.667
ore-1.595
eling-1.530
oker-1.479
 baker-1.460
Token Beth
Token ID19223
Feature activation-1.01e+00

Pos logit contributions
 prope+1.372
aters+1.310
 pads+1.277
lets+1.256
phones+1.253
Neg logit contributions
aster-1.159
ore-1.101
eling-1.048
oker-1.021
 baker-1.002
Token.
Token ID13
Feature activation-3.67e+00

Pos logit contributions
 prope+1.910
aters+1.835
phones+1.810
 pads+1.805
lets+1.764
Neg logit contributions
aster-1.582
ore-1.502
eling-1.481
oker-1.348
 baker-1.346
Token Beth
Token ID19223
Feature activation-4.67e-01

Pos logit contributions
 prope+8.162
lets+7.832
 mate+7.704
 pads+7.675
phones+7.426
Neg logit contributions
ore-5.476
he-4.526
aster-4.288
-4.284
 early-4.233
Token was
Token ID373
Feature activation-2.11e-01

Pos logit contributions
 prope+0.675
aters+0.650
 pads+0.633
phones+0.623
lets+0.623
Neg logit contributions
aster-0.916
ore-0.885
eling-0.876
oker-0.831
 restaurant-0.811
Token nice
Token ID3621
Feature activation-9.57e-01

Pos logit contributions
 prope+0.301
aters+0.294
 pads+0.288
lets+0.285
phones+0.278
Neg logit contributions
aster-0.418
ore-0.402
eling-0.398
oker-0.381
 baker-0.372
Token,
Token ID11
Feature activation-1.32e+00

Pos logit contributions
 prope+1.475
 pads+1.359
aters+1.351
phones+1.342
lets+1.325
Neg logit contributions
ore-1.821
aster-1.806
 restaurant-1.715
 baker-1.711
oker-1.680
Token but
Token ID475
Feature activation-9.74e-01

Pos logit contributions
 prope+1.936
aters+1.891
lets+1.862
phones+1.827
tails+1.804
Neg logit contributions
aster-2.649
ore-2.485
 restaurant-2.399
eling-2.389
 baker-2.382
Token,
Token ID11
Feature activation+6.70e-02

Pos logit contributions
 prope+0.173
aters+0.170
 pads+0.168
lets+0.165
phones+0.162
Neg logit contributions
aster-0.211
ore-0.203
eling-0.199
oker-0.193
 restaurant-0.186
Token friendly
Token ID8030
Feature activation-3.61e-01

Pos logit contributions
aster+0.126
ore+0.121
eling+0.120
oker+0.116
 restaurant+0.111
Neg logit contributions
 prope-0.101
aters-0.098
 pads-0.096
lets-0.095
phones-0.093
Token brown
Token ID7586
Feature activation-5.46e-01

Pos logit contributions
 prope+0.560
aters+0.555
 pads+0.550
lets+0.537
phones+0.529
Neg logit contributions
aster-0.662
ore-0.640
eling-0.615
oker-0.604
 restaurant-0.590
Token dog
Token ID3290
Feature activation-9.76e-01

Pos logit contributions
aters+0.799
 prope+0.797
 pads+0.793
lets+0.771
phones+0.754
Neg logit contributions
aster-1.054
ore-1.021
eling-0.971
oker-0.952
 restaurant-0.944
Token with
Token ID351
Feature activation-2.55e+00

Pos logit contributions
 prope+1.652
aters+1.615
 pads+1.550
lets+1.529
phones+1.507
Neg logit contributions
aster-1.704
ore-1.697
eling-1.602
oker-1.513
 restaurant-1.506
Token floppy
Token ID49415
Feature activation-3.63e+00

Pos logit contributions
aters+4.774
lets+4.589
 pads+4.438
phones+4.430
 prope+4.416
Neg logit contributions
aster-4.271
ore-4.004
eling-3.969
 restaurant-3.731
 baker-3.706
Token ears
Token ID11368
Feature activation-2.45e+00

Pos logit contributions
aters+4.452
Brown+4.431
 prope+4.429
Wh+4.400
Snow+4.342
Neg logit contributions
aster-7.689
ore-7.417
eling-7.365
 restaurant-7.170
 salsa-6.920
Token.
Token ID13
Feature activation+6.79e-01

Pos logit contributions
 prope+4.567
Brown+4.493
aters+4.482
Wh+4.459
 Tess+4.428
Neg logit contributions
ore-3.899
aster-3.683
eling-3.396
 restaurant-3.208
 baker-3.188
Token The
Token ID383
Feature activation-4.27e-01

Pos logit contributions
aster+1.244
eling+1.215
ore+1.172
oker+1.148
 baker+1.080
Neg logit contributions
 prope-1.068
aters-1.039
 pads-1.029
 Tess-1.012
 Dog-1.012
Token dog
Token ID3290
Feature activation-1.39e+00

Pos logit contributions
 prope+0.699
aters+0.690
lets+0.672
 pads+0.668
phones+0.655
Neg logit contributions
aster-0.762
ore-0.733
eling-0.714
oker-0.692
 restaurant-0.680
Token started
Token ID2067
Feature activation-9.50e-01

Pos logit contributions
aters+2.597
 prope+2.546
 pads+2.389
phones+2.372
lets+2.323
Neg logit contributions
eling-2.134
aster-2.124
ore-2.090
 restaurant-1.942
 baker-1.909
Token was
Token ID373
Feature activation-9.82e-01

Pos logit contributions
 prope+0.329
aters+0.321
 pads+0.315
lets+0.311
phones+0.309
Neg logit contributions
aster-0.319
ore-0.307
eling-0.301
oker-0.288
 restaurant-0.276
Token a
Token ID257
Feature activation+6.96e-02

Pos logit contributions
 prope+1.488
aters+1.429
lets+1.428
phones+1.415
 pads+1.376
Neg logit contributions
aster-1.949
ore-1.805
eling-1.757
oker-1.692
 restaurant-1.683
Token girl
Token ID2576
Feature activation-9.41e-01

Pos logit contributions
aster+0.139
ore+0.134
eling+0.132
oker+0.128
 baker+0.123
Neg logit contributions
 prope-0.098
aters-0.095
 pads-0.093
lets-0.092
phones-0.090
Token named
Token ID3706
Feature activation-9.37e-01

Pos logit contributions
 prope+1.328
aters+1.288
 pads+1.242
phones+1.220
lets+1.213
Neg logit contributions
aster-1.969
ore-1.878
eling-1.789
oker-1.720
 baker-1.710
Token Beth
Token ID19223
Feature activation-1.10e+00

Pos logit contributions
 prope+1.838
aters+1.741
 pads+1.692
lets+1.670
phones+1.666
Neg logit contributions
aster-1.475
ore-1.385
eling-1.296
oker-1.285
 baker-1.266
Token.
Token ID13
Feature activation-3.62e+00

Pos logit contributions
 prope+2.064
aters+1.989
phones+1.961
 pads+1.939
lets+1.909
Neg logit contributions
aster-1.735
ore-1.642
eling-1.621
 baker-1.470
oker-1.470
Token Beth
Token ID19223
Feature activation-5.28e-01

Pos logit contributions
 prope+7.997
lets+7.651
 mate+7.577
 pads+7.307
phones+7.206
Neg logit contributions
ore-5.429
he-4.547
aster-4.477
-4.393
 early-4.393
Token wanted
Token ID2227
Feature activation-6.45e-02

Pos logit contributions
 prope+0.766
aters+0.739
 pads+0.717
phones+0.709
lets+0.707
Neg logit contributions
aster-1.034
ore-0.998
eling-0.990
oker-0.935
 restaurant-0.916
Token to
Token ID284
Feature activation-2.43e-01

Pos logit contributions
 prope+0.100
aters+0.098
 pads+0.095
lets+0.095
phones+0.093
Neg logit contributions
aster-0.120
ore-0.115
eling-0.113
oker-0.109
 restaurant-0.105
Token go
Token ID467
Feature activation+3.20e-01

Pos logit contributions
 prope+0.408
aters+0.396
 pads+0.392
lets+0.386
phones+0.377
Neg logit contributions
aster-0.431
ore-0.404
eling-0.398
oker-0.384
 restaurant-0.369
Token to
Token ID284
Feature activation+9.08e-01

Pos logit contributions
aster+0.542
eling+0.517
ore+0.516
oker+0.496
 baker+0.469
Neg logit contributions
 prope-0.543
aters-0.532
 pads-0.524
lets-0.515
phones-0.511
Token swim
Token ID9422
Feature activation-6.00e-01

Pos logit contributions
 prope+2.011
aters+1.929
lets+1.907
 Tess+1.898
Brown+1.868
Neg logit contributions
ore-1.809
aster-1.797
eling-1.716
 restaurant-1.690
oker-1.654
Tokensuits
Token ID16554
Feature activation-1.30e+00

Pos logit contributions
 prope+0.948
aters+0.913
 Tess+0.889
lets+0.875
phones+0.872
Neg logit contributions
ore-1.092
aster-1.082
eling-1.030
oker-1.021
 restaurant-0.992
Token on
Token ID319
Feature activation-6.64e-01

Pos logit contributions
 prope+2.435
aters+2.277
Brown+2.277
lets+2.245
 Tess+2.240
Neg logit contributions
ore-2.043
aster-2.030
 restaurant-1.866
eling-1.857
oker-1.824
Token.
Token ID13
Feature activation-7.34e-01

Pos logit contributions
 prope+1.486
 pads+1.464
lets+1.459
aters+1.457
phones+1.439
Neg logit contributions
aster-0.807
ore-0.732
eling-0.713
oker-0.644
 restaurant-0.641
Token Lily
Token ID20037
Feature activation-8.41e-01

Pos logit contributions
 prope+1.109
aters+1.042
 pads+1.016
lets+1.008
phones+0.990
Neg logit contributions
aster-1.404
ore-1.354
eling-1.305
oker-1.278
 baker-1.273
Token's
Token ID338
Feature activation-3.62e+00

Pos logit contributions
 prope+1.466
aters+1.426
phones+1.368
lets+1.353
 Tess+1.334
Neg logit contributions
aster-1.407
ore-1.362
eling-1.357
 restaurant-1.260
oker-1.257
Token mom
Token ID1995
Feature activation-1.81e+00

Pos logit contributions
 prope+6.532
lets+6.440
Brown+6.390
aters+6.210
 Tess+6.204
Neg logit contributions
 restaurant-6.480
ore-5.989
aster-5.867
 early-5.681
 baker-5.610
Token said
Token ID531
Feature activation-8.07e-01

Pos logit contributions
 prope+3.103
aters+2.788
phones+2.696
lets+2.612
 pads+2.607
Neg logit contributions
aster-3.363
eling-3.326
ore-3.205
 restaurant-3.028
 baker-2.970
Token,
Token ID11
Feature activation-7.11e-01

Pos logit contributions
aters+1.283
 prope+1.268
 pads+1.229
phones+1.203
lets+1.200
Neg logit contributions
aster-1.486
ore-1.440
eling-1.405
oker-1.355
 baker-1.335
Token "
Token ID366
Feature activation-7.41e-01

Pos logit contributions
 prope+1.052
aters+1.028
 pads+0.989
lets+0.973
phones+0.955
Neg logit contributions
aster-1.380
ore-1.331
eling-1.307
 baker-1.249
oker-1.235
TokenBe
Token ID3856
Feature activation+2.03e-01

Pos logit contributions
 prope+1.349
aters+1.290
 pads+1.245
lets+1.244
phones+1.236
Neg logit contributions
aster-1.231
ore-1.155
eling-1.138
oker-1.110
 baker-1.055
TokenOne
Token ID3198
Feature activation+1.05e+00

Pos logit contributions
 prope+0.177
aters+0.168
 pads+0.162
lets+0.161
phones+0.158
Neg logit contributions
aster-0.372
ore-0.358
eling-0.354
oker-0.344
 restaurant-0.333
Token day
Token ID1110
Feature activation-3.08e-01

Pos logit contributions
eling+1.327
aster+1.324
ore+1.267
oker+1.202
 baker+1.120
Neg logit contributions
 pads-2.162
aters-2.146
 prope-2.140
 Dog-2.101
lets-2.089
Token,
Token ID11
Feature activation-4.76e-01

Pos logit contributions
 prope+0.450
aters+0.431
 pads+0.419
lets+0.413
phones+0.407
Neg logit contributions
aster-0.614
ore-0.593
eling-0.580
oker-0.564
 restaurant-0.540
Token Sandy
Token ID17581
Feature activation-2.23e-02

Pos logit contributions
 prope+0.813
aters+0.779
 pads+0.757
lets+0.750
phones+0.742
Neg logit contributions
aster-0.838
ore-0.804
eling-0.789
oker-0.763
 restaurant-0.725
Token and
Token ID290
Feature activation-2.62e-01

Pos logit contributions
 prope+0.036
aters+0.034
 pads+0.034
lets+0.033
phones+0.033
Neg logit contributions
aster-0.040
ore-0.039
eling-0.038
oker-0.037
 restaurant-0.035
Token her
Token ID607
Feature activation-3.61e+00

Pos logit contributions
 prope+0.481
aters+0.468
 pads+0.457
lets+0.454
phones+0.447
Neg logit contributions
aster-0.422
ore-0.398
eling-0.395
oker-0.373
 restaurant-0.361
Token owners
Token ID4393
Feature activation-6.37e-01

Pos logit contributions
 prope+5.906
aters+5.486
lets+5.011
phones+4.875
Brown+4.813
Neg logit contributions
 restaurant-6.806
aster-6.615
ore-6.611
 early-6.502
 baker-6.403
Token went
Token ID1816
Feature activation+8.05e-01

Pos logit contributions
 prope+0.988
aters+0.955
 pads+0.923
lets+0.901
phones+0.899
Neg logit contributions
aster-1.199
ore-1.154
eling-1.151
oker-1.097
 restaurant-1.065
Token for
Token ID329
Feature activation-9.75e-01

Pos logit contributions
aster+1.506
eling+1.447
ore+1.404
oker+1.384
 baker+1.304
Neg logit contributions
 prope-1.238
aters-1.216
 pads-1.214
phones-1.181
 Tess-1.178
Token a
Token ID257
Feature activation-9.87e-01

Pos logit contributions
 prope+1.465
aters+1.460
lets+1.435
 pads+1.421
phones+1.333
Neg logit contributions
aster-1.940
eling-1.778
ore-1.759
 restaurant-1.673
oker-1.649
Token walk
Token ID2513
Feature activation+1.30e-01

Pos logit contributions
aters+1.813
 prope+1.795
lets+1.704
 pads+1.690
phones+1.632
Neg logit contributions
aster-1.644
ore-1.540
eling-1.471
 restaurant-1.461
oker-1.434
Token she
Token ID673
Feature activation-1.05e+00

Pos logit contributions
 prope+0.360
aters+0.354
 pads+0.346
lets+0.341
phones+0.336
Neg logit contributions
aster-0.284
ore-0.269
eling-0.266
oker-0.253
 baker-0.240
Token could
Token ID714
Feature activation-1.00e+00

Pos logit contributions
 prope+1.440
aters+1.390
phones+1.273
 pads+1.268
 Tess+1.254
Neg logit contributions
aster-2.171
ore-2.144
eling-2.077
oker-2.040
 baker-1.980
Token have
Token ID423
Feature activation-9.28e-01

Pos logit contributions
 prope+1.819
aters+1.720
 pads+1.701
lets+1.675
Brown+1.658
Neg logit contributions
aster-1.704
ore-1.609
eling-1.513
oker-1.491
 restaurant-1.411
Token it
Token ID340
Feature activation-5.25e-01

Pos logit contributions
 prope+1.755
aters+1.699
 pads+1.638
lets+1.635
phones+1.624
Neg logit contributions
aster-1.480
eling-1.358
ore-1.319
oker-1.261
 restaurant-1.228
Token.
Token ID13
Feature activation-1.31e-01

Pos logit contributions
 prope+0.921
aters+0.874
 pads+0.852
lets+0.849
phones+0.845
Neg logit contributions
aster-0.882
ore-0.845
eling-0.821
oker-0.801
 restaurant-0.767
Token Her
Token ID2332
Feature activation-3.61e+00

Pos logit contributions
 prope+0.179
aters+0.172
 pads+0.169
lets+0.166
phones+0.163
Neg logit contributions
aster-0.268
ore-0.261
eling-0.254
oker-0.247
 baker-0.238
Token friend
Token ID1545
Feature activation-3.61e-01

Pos logit contributions
 prope+6.964
Brown+6.259
Snow+6.214
aters+6.207
phones+6.193
Neg logit contributions
 baker-5.644
ore-5.551
 restaurant-5.531
 answers-5.219
aster-5.207
Token said
Token ID531
Feature activation-9.29e-01

Pos logit contributions
 prope+0.703
aters+0.686
 pads+0.668
phones+0.657
lets+0.655
Neg logit contributions
aster-0.530
eling-0.501
ore-0.500
oker-0.474
 baker-0.452
Token yes
Token ID3763
Feature activation-7.02e-01

Pos logit contributions
aters+1.040
 prope+0.995
 pads+0.965
lets+0.936
phones+0.924
Neg logit contributions
aster-2.202
ore-2.102
eling-2.063
oker-2.024
 baker-1.993
Token,
Token ID11
Feature activation-4.86e-01

Pos logit contributions
 prope+1.105
aters+1.046
 pads+1.003
phones+0.989
lets+0.984
Neg logit contributions
aster-1.328
ore-1.290
eling-1.257
oker-1.214
 restaurant-1.181
Token and
Token ID290
Feature activation-6.52e-02

Pos logit contributions
 prope+0.578
aters+0.557
 pads+0.532
lets+0.526
phones+0.518
Neg logit contributions
aster-1.088
ore-1.047
eling-1.034
oker-1.004
 baker-0.976
Token a
Token ID257
Feature activation+1.25e-01

Pos logit contributions
 prope+1.510
aters+1.478
lets+1.466
phones+1.441
 pads+1.425
Neg logit contributions
aster-1.939
ore-1.795
eling-1.743
 baker-1.695
 restaurant-1.691
Token little
Token ID1310
Feature activation+1.69e-01

Pos logit contributions
aster+0.250
ore+0.241
eling+0.239
oker+0.231
 baker+0.221
Neg logit contributions
 prope-0.175
aters-0.170
 pads-0.166
lets-0.164
phones-0.161
Token girl
Token ID2576
Feature activation-8.56e-01

Pos logit contributions
aster+0.231
ore+0.218
eling+0.216
oker+0.204
 restaurant+0.190
Neg logit contributions
 prope-0.344
aters-0.336
 pads-0.332
lets-0.328
phones-0.324
Token called
Token ID1444
Feature activation-9.18e-01

Pos logit contributions
 prope+1.191
aters+1.159
 pads+1.122
phones+1.094
lets+1.086
Neg logit contributions
aster-1.795
ore-1.715
eling-1.644
oker-1.585
 baker-1.570
Token Beth
Token ID19223
Feature activation-1.22e+00

Pos logit contributions
 prope+1.782
aters+1.714
 pads+1.697
lets+1.653
phones+1.617
Neg logit contributions
aster-1.446
ore-1.370
oker-1.272
eling-1.245
 baker-1.220
Token.
Token ID13
Feature activation-3.60e+00

Pos logit contributions
 prope+2.349
aters+2.240
phones+2.211
 pads+2.207
lets+2.156
Neg logit contributions
aster-1.871
ore-1.797
eling-1.745
 baker-1.611
oker-1.583
Token She
Token ID1375
Feature activation-6.14e-01

Pos logit contributions
 prope+7.814
lets+7.685
 mate+7.499
 pads+7.461
phones+7.157
Neg logit contributions
ore-5.459
he-4.506
-4.446
 early-4.292
aster-4.288
Token was
Token ID373
Feature activation-2.83e-01

Pos logit contributions
 prope+0.868
aters+0.866
lets+0.813
 pads+0.813
phones+0.807
Neg logit contributions
aster-1.239
ore-1.192
eling-1.173
oker-1.119
 restaurant-1.105
Token very
Token ID845
Feature activation+1.41e+00

Pos logit contributions
 prope+0.408
aters+0.399
 pads+0.391
lets+0.387
phones+0.377
Neg logit contributions
aster-0.556
ore-0.534
eling-0.530
oker-0.506
 baker-0.498
Token intelligent
Token ID12661
Feature activation-2.28e+00

Pos logit contributions
aster+2.818
eling+2.656
ore+2.649
oker+2.531
roads+2.478
Neg logit contributions
 prope-2.004
 Tess-1.897
 Dog-1.889
aters-1.887
Snow-1.872
Token,
Token ID11
Feature activation-1.42e+00

Pos logit contributions
 prope+4.420
phones+4.084
Wh+4.052
Brown+4.048
Snow+3.995
Neg logit contributions
ore-3.650
 baker-3.503
 restaurant-3.467
aster-3.399
 early-3.222
Token."
Token ID526
Feature activation-1.55e+00

Pos logit contributions
aster+0.684
ore+0.653
eling+0.651
oker+0.627
 baker+0.595
Neg logit contributions
 prope-0.567
aters-0.553
 pads-0.543
lets-0.536
phones-0.525
Token So
Token ID1406
Feature activation-1.82e+00

Pos logit contributions
 prope+1.837
aters+1.641
 mate+1.606
 pads+1.567
lets+1.539
Neg logit contributions
ore-3.496
aster-3.494
eling-3.211
 baker-3.180
oker-3.143
Token,
Token ID11
Feature activation-1.38e+00

Pos logit contributions
 prope+3.353
aters+3.335
 pads+3.202
lets+3.048
phones+2.998
Neg logit contributions
aster-2.911
ore-2.791
eling-2.735
oker-2.546
 baker-2.533
Token Kitty
Token ID21168
Feature activation-9.69e-01

Pos logit contributions
 prope+2.142
aters+2.066
 pads+1.982
lets+1.939
 mate+1.876
Neg logit contributions
aster-2.647
ore-2.487
eling-2.441
 baker-2.319
oker-2.295
Token and
Token ID290
Feature activation-1.73e+00

Pos logit contributions
 prope+1.508
aters+1.436
phones+1.384
lets+1.342
 pads+1.336
Neg logit contributions
aster-1.829
ore-1.754
eling-1.718
oker-1.615
 restaurant-1.613
Token Dog
Token ID8532
Feature activation-3.60e+00

Pos logit contributions
 prope+1.091
aters+1.069
lets+0.908
 pads+0.903
 mate+0.785
Neg logit contributions
aster-4.969
eling-4.753
ore-4.598
 baker-4.441
 restaurant-4.389
Tokengy
Token ID1360
Feature activation-1.21e+00

Pos logit contributions
 prope+5.173
aters+4.473
 pads+3.922
 silk+3.895
Snow+3.862
Neg logit contributions
ore-7.751
aster-7.350
eling-7.301
led-6.732
in-6.526
Token curled
Token ID41108
Feature activation-4.11e-01

Pos logit contributions
 prope+2.118
aters+2.074
phones+1.976
 Tess+1.948
lets+1.936
Neg logit contributions
aster-2.072
eling-1.945
ore-1.879
 restaurant-1.800
oker-1.749
Token up
Token ID510
Feature activation+4.59e-01

Pos logit contributions
 prope+0.989
aters+0.979
 pads+0.965
lets+0.958
phones+0.943
Neg logit contributions
aster-0.398
ore-0.378
eling-0.367
oker-0.321
 restaurant-0.309
Token next
Token ID1306
Feature activation-8.40e-01

Pos logit contributions
aster+0.784
eling+0.742
ore+0.741
oker+0.719
 baker+0.672
Neg logit contributions
 prope-0.787
aters-0.763
 pads-0.750
lets-0.738
phones-0.727
Token to
Token ID284
Feature activation-1.15e+00

Pos logit contributions
 prope+1.440
aters+1.396
lets+1.383
 pads+1.376
phones+1.345
Neg logit contributions
aster-1.437
ore-1.411
eling-1.329
 restaurant-1.282
oker-1.267
Token around
Token ID1088
Feature activation+2.37e-01

Pos logit contributions
 prope+1.640
aters+1.638
 pads+1.594
lets+1.586
 Tess+1.567
Neg logit contributions
aster-1.144
ore-1.134
eling-1.075
oker-1.028
 restaurant-0.961
Token the
Token ID262
Feature activation-8.95e-01

Pos logit contributions
aster+0.375
ore+0.356
eling+0.353
oker+0.339
 baker+0.320
Neg logit contributions
 prope-0.436
aters-0.423
 pads-0.416
lets-0.410
phones-0.406
Token house
Token ID2156
Feature activation+5.53e-02

Pos logit contributions
lets+1.715
 prope+1.701
aters+1.696
 pads+1.670
phones+1.668
Neg logit contributions
aster-1.321
ore-1.263
eling-1.189
 restaurant-1.164
oker-1.145
Token.
Token ID13
Feature activation-1.48e+00

Pos logit contributions
aster+0.093
ore+0.089
eling+0.088
oker+0.084
 restaurant+0.080
Neg logit contributions
 prope-0.095
aters-0.092
 pads-0.091
lets-0.090
phones-0.088
Token He
Token ID679
Feature activation-1.73e+00

Pos logit contributions
lets+2.589
 prope+2.586
aters+2.522
 pads+2.519
phones+2.489
Neg logit contributions
ore-2.389
aster-2.368
eling-2.086
oker-2.082
 restaurant-2.079
Token had
Token ID550
Feature activation-3.60e+00

Pos logit contributions
aters+2.571
phones+2.361
 prope+2.353
 Tess+2.349
lets+2.335
Neg logit contributions
aster-3.288
eling-3.250
ore-3.190
 restaurant-2.976
oker-2.973
Token so
Token ID523
Feature activation-7.80e-01

Pos logit contributions
aters+8.291
lets+7.822
phones+7.510
 Tess+7.324
 prope+7.225
Neg logit contributions
aster-4.546
 restaurant-4.095
 cream-3.906
eling-3.900
 early-3.874
Token much
Token ID881
Feature activation-9.88e-01

Pos logit contributions
 prope+1.147
aters+1.142
phones+1.105
lets+1.098
 pads+1.079
Neg logit contributions
aster-1.521
ore-1.453
eling-1.406
oker-1.380
 restaurant-1.313
Token fun
Token ID1257
Feature activation-4.32e-01

Pos logit contributions
 prope+1.447
lets+1.429
aters+1.409
phones+1.373
 pads+1.357
Neg logit contributions
aster-1.933
ore-1.853
eling-1.794
oker-1.773
 restaurant-1.716
Token playing
Token ID2712
Feature activation+4.59e-01

Pos logit contributions
 prope+0.719
aters+0.711
 pads+0.693
lets+0.691
phones+0.671
Neg logit contributions
aster-0.751
ore-0.730
eling-0.715
oker-0.680
 restaurant-0.647
Token with
Token ID351
Feature activation-1.73e+00

Pos logit contributions
aster+0.754
eling+0.718
ore+0.716
oker+0.687
 baker+0.648
Neg logit contributions
 prope-0.817
aters-0.784
 pads-0.775
lets-0.763
phones-0.758
Token Lily
Token ID20037
Feature activation+6.30e-01

Pos logit contributions
aster+0.541
ore+0.523
eling+0.517
oker+0.495
 restaurant+0.468
Neg logit contributions
 prope-0.494
aters-0.480
 pads-0.471
lets-0.467
 Tess-0.460
Token's
Token ID338
Feature activation-6.05e-01

Pos logit contributions
aster+1.170
ore+1.133
eling+1.116
oker+1.091
 baker+1.019
Neg logit contributions
 prope-0.960
aters-0.944
 pads-0.943
lets-0.928
 Tess-0.893
Token family
Token ID1641
Feature activation+1.43e+00

Pos logit contributions
 prope+0.767
aters+0.727
 pads+0.717
lets+0.705
phones+0.699
Neg logit contributions
aster-1.326
ore-1.270
eling-1.247
 restaurant-1.212
oker-1.212
Token sat
Token ID3332
Feature activation+2.94e+00

Pos logit contributions
aster+2.760
ore+2.676
eling+2.666
oker+2.557
roads+2.452
Neg logit contributions
aters-2.030
lets-2.029
 prope-2.023
 pads-2.007
 Tess-1.987
Token down
Token ID866
Feature activation+3.71e+00

Pos logit contributions
aster+6.036
oker+5.617
eling+5.166
ore+5.148
 salsa+4.850
Neg logit contributions
aters-4.883
 pads-4.686
 prope-4.623
 neck-4.619
 tail-4.605
Token to
Token ID284
Feature activation+2.12e+00

Pos logit contributions
eling+10.037
aster+9.757
oker+9.706
roads+8.925
 salsa+8.559
Neg logit contributions
aters-3.406
 Brown-3.380
 forever-3.320
 Snow-3.264
 Tess-3.185
Token eat
Token ID4483
Feature activation+2.30e+00

Pos logit contributions
oker+3.658
ore+3.619
ying+3.364
eling+3.274
roads+3.250
Neg logit contributions
 prope-3.508
 Tess-3.443
 Dog-3.395
 Snow-3.387
 scarf-3.368
Token.
Token ID13
Feature activation-3.37e-01

Pos logit contributions
oker+4.646
aster+4.557
ore+4.489
eling+4.486
roads+4.330
Neg logit contributions
aters-3.151
 pads-3.067
 wool-3.049
 scarf-3.043
 Snow-3.024
Token Everyone
Token ID11075
Feature activation+2.90e+00

Pos logit contributions
 prope+0.519
aters+0.495
 pads+0.485
lets+0.483
phones+0.474
Neg logit contributions
aster-0.634
ore-0.610
eling-0.593
oker-0.578
 restaurant-0.563
Token said
Token ID531
Feature activation+5.23e-02

Pos logit contributions
aster+6.881
oker+6.506
eling+6.417
ore+6.397
roads+6.177
Neg logit contributions
lets-3.236
 pads-3.114
 Bun-2.962
 prope-2.914
 Tess-2.862
Token the
Token ID262
Feature activation+1.38e+00

Pos logit contributions
aster+0.092
ore+0.088
eling+0.087
oker+0.083
 baker+0.080
Neg logit contributions
 prope-0.087
aters-0.085
 pads-0.083
lets-0.082
phones-0.081
Token energy
Token ID2568
Feature activation+3.43e-01

Pos logit contributions
aster+1.364
ore+1.303
eling+1.295
oker+1.254
 baker+1.193
Neg logit contributions
 prope-1.410
aters-1.381
 pads-1.370
lets-1.345
phones-1.329
Token.
Token ID13
Feature activation-4.13e-01

Pos logit contributions
aster+0.675
ore+0.649
eling+0.645
oker+0.621
 baker+0.595
Neg logit contributions
 prope-0.489
aters-0.475
 pads-0.468
lets-0.459
phones-0.450
Token Now
Token ID2735
Feature activation-4.98e-01

Pos logit contributions
 prope+0.667
aters+0.634
 pads+0.621
lets+0.617
phones+0.611
Neg logit contributions
aster-0.746
ore-0.718
eling-0.693
oker-0.679
 baker-0.649
Token,
Token ID11
Feature activation-2.29e-01

Pos logit contributions
 prope+0.882
aters+0.855
 pads+0.831
lets+0.815
phones+0.810
Neg logit contributions
aster-0.824
ore-0.781
eling-0.770
oker-0.745
 restaurant-0.709
Token no
Token ID645
Feature activation-1.04e-01

Pos logit contributions
 prope+0.390
aters+0.380
 pads+0.370
lets+0.364
phones+0.359
Neg logit contributions
aster-0.394
ore-0.373
eling-0.368
oker-0.355
 restaurant-0.339
Token one
Token ID530
Feature activation+2.36e+00

Pos logit contributions
 prope+0.161
aters+0.156
 pads+0.153
lets+0.151
phones+0.149
Neg logit contributions
aster-0.195
ore-0.186
eling-0.185
oker-0.178
 restaurant-0.170
Token will
Token ID481
Feature activation-1.50e-01

Pos logit contributions
aster+5.468
eling+5.439
ore+5.244
oker+5.099
 salsa+4.975
Neg logit contributions
 pads-2.584
 prope-2.475
lets-2.380
 forever-2.335
 no-2.245
Token ever
Token ID1683
Feature activation+1.07e+00

Pos logit contributions
 prope+0.239
aters+0.235
 pads+0.230
lets+0.226
phones+0.222
Neg logit contributions
aster-0.272
ore-0.260
eling-0.258
oker-0.248
 baker-0.235
Token have
Token ID423
Feature activation+9.82e-02

Pos logit contributions
aster+2.118
ore+2.084
eling+2.003
oker+1.971
 baker+1.904
Neg logit contributions
 prope-1.476
 pads-1.461
lets-1.419
aters-1.405
phones-1.360
Token to
Token ID284
Feature activation+8.13e-02

Pos logit contributions
aster+0.188
ore+0.180
eling+0.179
oker+0.172
 baker+0.164
Neg logit contributions
 prope-0.147
aters-0.142
 pads-0.139
lets-0.137
phones-0.135
Token pay
Token ID1414
Feature activation+8.44e-01

Pos logit contributions
aster+0.146
ore+0.139
eling+0.138
oker+0.133
 baker+0.126
Neg logit contributions
 prope-0.131
aters-0.127
 pads-0.125
lets-0.123
phones-0.121
Token jam
Token ID16853
Feature activation-1.62e-01

Pos logit contributions
aster+2.269
ore+2.249
eling+2.176
oker+2.139
roads+2.006
Neg logit contributions
 pads-2.047
 prope-2.027
aters-1.992
lets-1.972
 tail-1.945
Token and
Token ID290
Feature activation+9.82e-01

Pos logit contributions
 prope+0.242
aters+0.233
 pads+0.226
lets+0.225
phones+0.222
Neg logit contributions
aster-0.309
ore-0.298
eling-0.291
oker-0.283
 baker-0.271
Token the
Token ID262
Feature activation+1.61e+00

Pos logit contributions
aster+1.766
ore+1.748
eling+1.695
oker+1.664
 restaurant+1.542
Neg logit contributions
 prope-1.546
 pads-1.518
lets-1.497
aters-1.483
 Tess-1.443
Token bee
Token ID20697
Feature activation+1.83e-01

Pos logit contributions
aster+2.654
ore+2.651
eling+2.531
oker+2.490
roads+2.307
Neg logit contributions
 prope-2.756
 pads-2.726
 tail-2.668
 scarf-2.651
 neck-2.621
Token said
Token ID531
Feature activation+7.34e-01

Pos logit contributions
aster+0.329
ore+0.315
eling+0.312
oker+0.301
 baker+0.286
Neg logit contributions
 prope-0.291
aters-0.283
 pads-0.278
lets-0.275
phones-0.270
Token,"
Token ID553
Feature activation+2.48e+00

Pos logit contributions
aster+1.518
eling+1.473
ore+1.456
oker+1.404
 restaurant+1.345
Neg logit contributions
 prope-0.991
aters-0.919
lets-0.918
 pads-0.917
 Tess-0.887
Token Okay
Token ID16805
Feature activation+2.05e+00

Pos logit contributions
aster+5.116
ore+5.106
eling+5.073
oker+4.887
ying+4.704
Neg logit contributions
 prope-3.343
Brown-3.129
 Bun-3.125
 Tess-3.078
Snow-3.072
Token,
Token ID11
Feature activation+8.67e-01

Pos logit contributions
aster+4.063
eling+3.980
ore+3.718
oker+3.635
 restaurant+3.556
Neg logit contributions
aters-2.852
lets-2.828
 Tess-2.820
 prope-2.796
 Bun-2.786
Token now
Token ID783
Feature activation+9.97e-01

Pos logit contributions
aster+1.758
ore+1.713
eling+1.707
oker+1.641
 restaurant+1.565
Neg logit contributions
 prope-1.185
aters-1.109
 pads-1.092
lets-1.089
 Tess-1.085
Token I
Token ID314
Feature activation+8.43e-01

Pos logit contributions
aster+1.539
ore+1.498
eling+1.468
oker+1.387
 baker+1.305
Neg logit contributions
 prope-1.820
 pads-1.799
lets-1.782
aters-1.772
 Tess-1.736
Token'll
Token ID1183
Feature activation-4.37e-01

Pos logit contributions
aster+1.692
eling+1.621
ore+1.616
oker+1.547
 baker+1.494
Neg logit contributions
 prope-1.156
aters-1.130
 pads-1.120
lets-1.112
phones-1.056
Token to
Token ID284
Feature activation+4.52e-01

Pos logit contributions
 prope+1.760
aters+1.750
lets+1.700
phones+1.647
 pads+1.629
Neg logit contributions
aster-2.184
ore-2.039
eling-1.946
 restaurant-1.928
 baker-1.893
Token go
Token ID467
Feature activation+8.79e-01

Pos logit contributions
aster+0.891
ore+0.879
eling+0.866
oker+0.835
 baker+0.799
Neg logit contributions
 prope-0.631
aters-0.612
 pads-0.596
lets-0.585
phones-0.577
Token to
Token ID284
Feature activation+1.29e+00

Pos logit contributions
aster+1.677
eling+1.641
ore+1.585
oker+1.579
 baker+1.489
Neg logit contributions
 prope-1.276
aters-1.264
 pads-1.259
phones-1.228
 Dog-1.211
Token the
Token ID262
Feature activation+2.80e-01

Pos logit contributions
ore+2.168
eling+2.131
oker+2.106
aster+2.093
 baker+1.932
Neg logit contributions
 prope-2.195
 Tess-2.118
 Dog-2.111
aters-2.075
 Bun-2.056
Token bathroom
Token ID12436
Feature activation+5.26e-01

Pos logit contributions
aster+0.401
ore+0.381
eling+0.379
oker+0.360
 baker+0.333
Neg logit contributions
 prope-0.550
aters-0.535
 pads-0.528
lets-0.521
phones-0.517
Token really
Token ID1107
Feature activation+2.13e+00

Pos logit contributions
aster+1.032
eling+0.992
ore+0.990
oker+0.958
 baker+0.909
Neg logit contributions
 prope-0.743
aters-0.729
 pads-0.720
lets-0.702
phones-0.695
Token bad
Token ID2089
Feature activation+9.13e-01

Pos logit contributions
aster+3.310
ore+3.222
oker+3.173
eling+3.157
 baker+3.016
Neg logit contributions
 prope-3.824
Snow-3.799
 pads-3.757
aters-3.638
un-3.599
Token.
Token ID13
Feature activation+1.39e-01

Pos logit contributions
aster+1.687
eling+1.627
ore+1.606
oker+1.559
 baker+1.473
Neg logit contributions
aters-1.390
 prope-1.384
 pads-1.372
lets-1.323
phones-1.310
Token She
Token ID1375
Feature activation+8.46e-01

Pos logit contributions
aster+0.281
ore+0.271
eling+0.269
oker+0.260
 baker+0.249
Neg logit contributions
 prope-0.192
aters-0.186
 pads-0.183
lets-0.180
phones-0.176
Token ran
Token ID4966
Feature activation-1.35e-01

Pos logit contributions
aster+1.659
ore+1.606
oker+1.565
eling+1.548
 baker+1.475
Neg logit contributions
 prope-1.223
 pads-1.175
aters-1.162
lets-1.140
 Tess-1.129
Token to
Token ID284
Feature activation+8.54e-01

Pos logit contributions
 prope+0.230
aters+0.224
 pads+0.218
lets+0.217
phones+0.212
Neg logit contributions
aster-0.234
ore-0.225
eling-0.220
oker-0.211
 restaurant-0.203
Token cook
Token ID4255
Feature activation+2.02e+00

Pos logit contributions
 prope+0.699
aters+0.680
 pads+0.671
lets+0.663
phones+0.647
Neg logit contributions
aster-0.728
ore-0.672
eling-0.663
oker-0.636
 restaurant-0.619
Token with
Token ID351
Feature activation+5.19e-01

Pos logit contributions
aster+3.542
eling+3.522
oker+3.506
ore+3.485
roads+3.191
Neg logit contributions
aters-3.260
 pads-3.134
 neck-3.085
lets-3.077
 scarf-3.048
Token his
Token ID465
Feature activation+6.99e-01

Pos logit contributions
aster+0.710
ore+0.693
eling+0.675
oker+0.657
 baker+0.598
Neg logit contributions
 prope-1.039
aters-1.024
 pads-1.004
lets-0.993
 Tess-0.980
Token mom
Token ID1995
Feature activation+2.17e-01

Pos logit contributions
aster+1.486
eling+1.432
ore+1.429
oker+1.388
roads+1.311
Neg logit contributions
 prope-0.896
aters-0.878
 pads-0.858
lets-0.840
phones-0.823
Token in
Token ID287
Feature activation+1.35e+00

Pos logit contributions
aster+0.466
ore+0.451
eling+0.448
oker+0.434
 baker+0.415
Neg logit contributions
 prope-0.272
aters-0.265
 pads-0.259
lets-0.254
phones-0.248
Token the
Token ID262
Feature activation+2.35e+00

Pos logit contributions
ore+2.113
eling+2.057
aster+2.047
oker+2.036
 baker+1.863
Neg logit contributions
 prope-2.426
aters-2.364
 Tess-2.334
 Dog-2.305
Snow-2.246
Token kitchen
Token ID9592
Feature activation+1.23e+00

Pos logit contributions
eling+4.093
oker+3.924
aster+3.875
ore+3.756
ying+3.524
Neg logit contributions
 pads-3.935
 Dog-3.935
 wool-3.921
 scarf-3.891
 Tess-3.866
Token.
Token ID13
Feature activation-3.69e-01

Pos logit contributions
aster+2.360
eling+2.293
ore+2.276
oker+2.241
roads+2.062
Neg logit contributions
aters-1.780
 pads-1.734
 prope-1.722
lets-1.702
phones-1.642
Token One
Token ID1881
Feature activation+2.16e+00

Pos logit contributions
 prope+0.553
aters+0.517
 pads+0.515
lets+0.511
phones+0.506
Neg logit contributions
aster-0.719
ore-0.688
eling-0.664
oker-0.647
 restaurant-0.635
Token day
Token ID1110
Feature activation-4.47e-01

Pos logit contributions
eling+3.361
oker+3.027
ore+2.987
aster+2.976
roads+2.928
Neg logit contributions
 pads-3.905
 Dog-3.894
 tail-3.831
aters-3.828
Snow-3.827
Token,
Token ID11
Feature activation-3.76e-01

Pos logit contributions
 prope+0.662
aters+0.637
 pads+0.622
lets+0.611
phones+0.602
Neg logit contributions
aster-0.885
ore-0.854
eling-0.821
oker-0.803
 baker-0.773
Token had
Token ID550
Feature activation-2.43e-01

Pos logit contributions
aster+0.712
ore+0.688
eling+0.679
oker+0.660
 baker+0.630
Neg logit contributions
 prope-0.441
aters-0.422
 pads-0.418
lets-0.412
phones-0.400
Token to
Token ID284
Feature activation+2.62e-01

Pos logit contributions
 prope+0.360
aters+0.354
 pads+0.340
lets+0.337
phones+0.334
Neg logit contributions
aster-0.468
ore-0.446
eling-0.442
oker-0.423
 restaurant-0.415
Token be
Token ID307
Feature activation+1.03e+00

Pos logit contributions
aster+0.460
ore+0.447
eling+0.439
oker+0.424
 baker+0.402
Neg logit contributions
 prope-0.425
aters-0.413
 pads-0.406
lets-0.401
phones-0.394
Token careful
Token ID8161
Feature activation-8.67e-01

Pos logit contributions
aster+2.168
ore+2.155
eling+2.062
oker+2.057
 baker+1.917
Neg logit contributions
 prope-1.321
 pads-1.317
aters-1.267
lets-1.259
 tail-1.226
Token of
Token ID286
Feature activation+8.66e-01

Pos logit contributions
 prope+1.499
aters+1.425
 Tess+1.377
lets+1.375
 pads+1.370
Neg logit contributions
aster-1.502
ore-1.461
eling-1.398
oker-1.336
 restaurant-1.301
Token the
Token ID262
Feature activation+1.13e+00

Pos logit contributions
aster+1.449
ore+1.411
eling+1.376
oker+1.368
 baker+1.268
Neg logit contributions
 prope-1.470
 pads-1.444
aters-1.430
lets-1.414
 tail-1.364
Token cat
Token ID3797
Feature activation+3.07e-01

Pos logit contributions
aster+1.971
eling+1.898
ore+1.885
oker+1.802
roads+1.675
Neg logit contributions
 prope-1.869
 pads-1.867
aters-1.842
lets-1.803
 tail-1.768
Token.
Token ID13
Feature activation+4.91e-01

Pos logit contributions
aster+0.566
ore+0.543
eling+0.540
oker+0.520
 baker+0.494
Neg logit contributions
 prope-0.475
aters-0.462
 pads-0.458
lets-0.451
phones-0.441
Token The
Token ID383
Feature activation+2.04e-01

Pos logit contributions
aster+0.999
eling+0.972
ore+0.969
oker+0.929
 baker+0.885
Neg logit contributions
 prope-0.660
aters-0.655
 pads-0.639
lets-0.624
 Tess-0.612
Token cat
Token ID3797
Feature activation+4.19e-01

Pos logit contributions
aster+0.464
ore+0.450
eling+0.447
oker+0.433
 restaurant+0.415
Neg logit contributions
 prope-0.232
aters-0.223
 pads-0.218
lets-0.213
phones-0.208
Token was
Token ID373
Feature activation+9.56e-01

Pos logit contributions
aster+0.804
ore+0.774
eling+0.764
oker+0.744
 baker+0.698
Neg logit contributions
 prope-0.621
aters-0.597
 pads-0.597
lets-0.589
phones-0.573
Token little
Token ID1310
Feature activation+5.59e-01

Pos logit contributions
aster+1.545
eling+1.513
ore+1.511
oker+1.440
roads+1.370
Neg logit contributions
 prope-1.022
aters-0.948
 pads-0.938
lets-0.929
Snow-0.927
Token girl
Token ID2576
Feature activation-4.42e-01

Pos logit contributions
aster+0.759
eling+0.724
ore+0.713
oker+0.668
 restaurant+0.609
Neg logit contributions
 prope-1.149
aters-1.119
 pads-1.100
lets-1.092
 Tess-1.075
Token named
Token ID3706
Feature activation-6.03e-01

Pos logit contributions
 prope+0.574
aters+0.552
 pads+0.537
lets+0.522
phones+0.521
Neg logit contributions
aster-0.954
ore-0.917
eling-0.888
oker-0.866
 baker-0.843
Token Lily
Token ID20037
Feature activation-7.23e-01

Pos logit contributions
 prope+1.131
aters+1.086
 pads+1.061
lets+1.043
phones+1.032
Neg logit contributions
aster-0.976
ore-0.916
eling-0.877
oker-0.860
 baker-0.835
Token.
Token ID13
Feature activation-9.27e-01

Pos logit contributions
 prope+1.348
aters+1.303
phones+1.272
 pads+1.256
lets+1.246
Neg logit contributions
aster-1.161
ore-1.108
eling-1.048
oker-1.000
 restaurant-0.974
Token She
Token ID1375
Feature activation+1.39e-01

Pos logit contributions
 prope+1.569
phones+1.440
 pads+1.426
lets+1.425
aters+1.419
Neg logit contributions
aster-1.666
ore-1.594
oker-1.514
eling-1.484
 restaurant-1.464
Token had
Token ID550
Feature activation-1.91e+00

Pos logit contributions
aster+0.302
ore+0.292
eling+0.290
oker+0.280
 baker+0.269
Neg logit contributions
 prope-0.173
aters-0.167
 pads-0.163
lets-0.160
phones-0.157
Token a
Token ID257
Feature activation-7.22e-02

Pos logit contributions
aters+3.282
 prope+3.260
lets+3.143
phones+3.031
 pads+2.938
Neg logit contributions
aster-3.437
ore-3.053
 restaurant-3.030
 baker-3.002
eling-2.924
Token pink
Token ID11398
Feature activation-1.20e+00

Pos logit contributions
 prope+0.102
aters+0.099
 pads+0.097
lets+0.096
phones+0.094
Neg logit contributions
aster-0.143
ore-0.138
eling-0.136
oker-0.132
 baker-0.128
Token dress
Token ID6576
Feature activation-5.98e-01

Pos logit contributions
 prope+1.519
aters+1.476
 pads+1.472
lets+1.435
phones+1.411
Neg logit contributions
aster-2.542
ore-2.456
 restaurant-2.353
 baker-2.346
oker-2.282
Token that
Token ID326
Feature activation-2.46e-01

Pos logit contributions
 prope+1.078
aters+1.047
 pads+1.023
lets+1.009
 Tess+0.987
Neg logit contributions
aster-0.977
ore-0.945
eling-0.890
oker-0.870
 baker-0.850
Token that
Token ID326
Feature activation+5.87e-02

Pos logit contributions
aster+0.139
ore+0.134
eling+0.132
oker+0.128
 restaurant+0.122
Neg logit contributions
 prope-0.116
aters-0.113
 pads-0.111
lets-0.109
phones-0.107
Token he
Token ID339
Feature activation-3.16e-02

Pos logit contributions
aster+0.077
ore+0.072
eling+0.071
oker+0.067
 restaurant+0.063
Neg logit contributions
 prope-0.124
aters-0.121
 pads-0.120
lets-0.119
phones-0.117
Token was
Token ID373
Feature activation-1.34e-01

Pos logit contributions
 prope+0.045
aters+0.044
 pads+0.042
lets+0.042
phones+0.041
Neg logit contributions
aster-0.063
ore-0.060
eling-0.060
oker-0.058
 baker-0.055
Token still
Token ID991
Feature activation+7.92e-01

Pos logit contributions
 prope+0.182
aters+0.178
 pads+0.174
lets+0.171
phones+0.167
Neg logit contributions
aster-0.275
ore-0.263
eling-0.262
oker-0.253
 baker-0.244
Token able
Token ID1498
Feature activation-2.29e-01

Pos logit contributions
aster+1.434
ore+1.385
eling+1.366
oker+1.314
 restaurant+1.238
Neg logit contributions
 prope-1.273
aters-1.209
 pads-1.207
lets-1.180
 Tess-1.162
Token to
Token ID284
Feature activation+1.84e-01

Pos logit contributions
 prope+0.366
aters+0.356
 pads+0.344
lets+0.342
phones+0.337
Neg logit contributions
aster-0.415
ore-0.399
eling-0.393
oker-0.379
 restaurant-0.364
Token play
Token ID711
Feature activation+1.30e+00

Pos logit contributions
aster+0.313
ore+0.301
eling+0.297
oker+0.286
 baker+0.271
Neg logit contributions
 prope-0.312
aters-0.303
 pads-0.298
lets-0.295
phones-0.290
Token with
Token ID351
Feature activation-5.76e-02

Pos logit contributions
aster+2.283
eling+2.276
ore+2.205
oker+2.129
 baker+2.001
Neg logit contributions
 prope-2.097
aters-1.991
 pads-1.982
 Bun-1.954
 tail-1.949
Token his
Token ID465
Feature activation-2.38e-01

Pos logit contributions
 prope+0.117
aters+0.115
 pads+0.113
lets+0.112
phones+0.110
Neg logit contributions
aster-0.080
ore-0.075
eling-0.075
oker-0.071
 restaurant-0.066
Token mar
Token ID1667
Feature activation+2.38e-01

Pos logit contributions
 prope+0.322
aters+0.313
 pads+0.305
lets+0.302
phones+0.296
Neg logit contributions
aster-0.484
ore-0.466
eling-0.462
oker-0.447
 restaurant-0.434
Tokenbles
Token ID7689
Feature activation+1.41e-02

Pos logit contributions
aster+0.503
ore+0.483
eling+0.480
oker+0.463
 baker+0.445
Neg logit contributions
 prope-0.309
aters-0.300
 pads-0.293
lets-0.289
phones-0.282
Tokenons
Token ID684
Feature activation-1.25e+00

Pos logit contributions
 prope+0.114
aters+0.111
 pads+0.110
lets+0.109
phones+0.108
Neg logit contributions
aster-0.053
ore-0.049
eling-0.049
oker-0.045
 baker-0.041
Token,
Token ID11
Feature activation-1.60e+00

Pos logit contributions
 prope+2.016
aters+1.883
phones+1.842
Brown+1.834
Snow+1.819
Neg logit contributions
aster-2.226
ore-2.206
oker-2.058
eling-2.025
 baker-2.015
Token scissors
Token ID41786
Feature activation-1.32e+00

Pos logit contributions
 prope+2.493
aters+2.456
Brown+2.347
 Tess+2.337
Snow+2.329
Neg logit contributions
aster-3.016
eling-2.669
ore-2.649
 salsa-2.549
 restaurant-2.539
Token and
Token ID290
Feature activation-1.40e+00

Pos logit contributions
 prope+2.086
aters+1.982
 pads+1.926
Brown+1.924
Snow+1.916
Neg logit contributions
aster-2.395
ore-2.345
eling-2.279
oker-2.162
 baker-2.105
Token glue
Token ID22749
Feature activation-4.76e-01

Pos logit contributions
aters+2.349
 prope+2.285
 Dog+2.214
 pads+2.199
 Tess+2.198
Neg logit contributions
aster-2.473
eling-2.226
ore-2.139
oker-2.053
 salsa-2.049
Token.
Token ID13
Feature activation+1.42e-01

Pos logit contributions
 prope+0.810
aters+0.777
 pads+0.760
phones+0.750
Brown+0.747
Neg logit contributions
aster-0.808
ore-0.778
eling-0.756
oker-0.726
 restaurant-0.692
Token They
Token ID1119
Feature activation-6.92e-02

Pos logit contributions
aster+0.291
ore+0.281
eling+0.278
oker+0.269
 baker+0.257
Neg logit contributions
 prope-0.192
aters-0.186
 pads-0.182
lets-0.179
phones-0.176
Token are
Token ID389
Feature activation-4.26e-01

Pos logit contributions
 prope+0.105
aters+0.103
 pads+0.100
lets+0.098
phones+0.097
Neg logit contributions
aster-0.130
ore-0.124
eling-0.124
oker-0.119
 baker-0.114
Token very
Token ID845
Feature activation+1.43e+00

Pos logit contributions
 prope+0.657
aters+0.640
 pads+0.629
phones+0.617
lets+0.614
Neg logit contributions
aster-0.794
eling-0.757
ore-0.746
oker-0.716
 baker-0.710
Token creative
Token ID7325
Feature activation-8.55e-01

Pos logit contributions
aster+2.916
ore+2.792
eling+2.752
roads+2.630
oker+2.592
Neg logit contributions
 prope-2.036
lets-1.961
aters-1.922
 pads-1.909
 Tess-1.832
Token.
Token ID13
Feature activation-1.34e-01

Pos logit contributions
 prope+1.456
aters+1.365
 pads+1.351
phones+1.348
 Tess+1.341
Neg logit contributions
aster-1.434
ore-1.425
eling-1.381
oker-1.360
 restaurant-1.318
Token surfing
Token ID36254
Feature activation+1.91e+00

Pos logit contributions
aster+2.343
ore+2.233
eling+2.200
oker+2.182
 baker+2.063
Neg logit contributions
 prope-1.480
 pads-1.451
aters-1.437
lets-1.396
phones-1.384
Token and
Token ID290
Feature activation-7.42e-02

Pos logit contributions
aster+3.649
eling+3.501
ore+3.407
oker+3.311
 baker+3.241
Neg logit contributions
 pads-2.955
aters-2.787
 prope-2.778
lets-2.699
phones-2.645
Token he
Token ID339
Feature activation-2.45e-02

Pos logit contributions
 prope+0.132
aters+0.129
 pads+0.126
lets+0.125
phones+0.123
Neg logit contributions
aster-0.121
ore-0.115
eling-0.114
oker-0.109
 restaurant-0.104
Token felt
Token ID2936
Feature activation+9.54e-01

Pos logit contributions
 prope+0.038
aters+0.038
 pads+0.037
lets+0.036
phones+0.036
Neg logit contributions
aster-0.045
ore-0.043
eling-0.043
oker-0.041
 restaurant-0.039
Token regret
Token ID13721
Feature activation-1.95e-01

Pos logit contributions
aster+1.703
ore+1.631
eling+1.575
oker+1.571
 baker+1.478
Neg logit contributions
 prope-1.574
 pads-1.457
aters-1.451
 Dog-1.419
 Tess-1.412
Token that
Token ID326
Feature activation+3.16e-01

Pos logit contributions
 prope+0.333
aters+0.326
 pads+0.320
lets+0.317
phones+0.311
Neg logit contributions
aster-0.330
ore-0.318
eling-0.314
oker-0.298
 restaurant-0.284
Token he
Token ID339
Feature activation+5.17e-01

Pos logit contributions
aster+0.457
ore+0.436
eling+0.430
oker+0.408
 baker+0.383
Neg logit contributions
 prope-0.618
aters-0.600
 pads-0.595
lets-0.587
phones-0.580
Token couldn
Token ID3521
Feature activation+1.24e+00

Pos logit contributions
aster+1.088
ore+1.042
eling+1.022
oker+1.005
 baker+0.968
Neg logit contributions
 prope-0.674
 pads-0.646
aters-0.645
lets-0.631
phones-0.618
Token't
Token ID470
Feature activation+5.76e-01

Pos logit contributions
aster+2.687
eling+2.625
ore+2.584
oker+2.555
 baker+2.494
Neg logit contributions
 prope-1.462
aters-1.440
 pads-1.368
un-1.357
Snow-1.340
Token do
Token ID466
Feature activation+1.60e+00

Pos logit contributions
aster+0.979
ore+0.947
eling+0.937
oker+0.904
 baker+0.866
Neg logit contributions
 prope-0.965
aters-0.928
 pads-0.918
lets-0.903
phones-0.896
Token it
Token ID340
Feature activation+1.16e+00

Pos logit contributions
aster+2.970
ore+2.941
oker+2.828
eling+2.789
 baker+2.776
Neg logit contributions
 prope-2.366
 pads-2.245
phones-2.215
aters-2.198
lets-2.166
Token had
Token ID550
Feature activation-1.76e+00

Pos logit contributions
aster+0.382
ore+0.369
eling+0.366
oker+0.354
 baker+0.340
Neg logit contributions
 prope-0.217
aters-0.209
 pads-0.205
lets-0.201
phones-0.196
Token a
Token ID257
Feature activation-1.72e-01

Pos logit contributions
aters+3.004
 prope+2.964
lets+2.898
phones+2.766
 pads+2.718
Neg logit contributions
aster-3.211
ore-2.850
 baker-2.797
 restaurant-2.791
eling-2.750
Token shiny
Token ID22441
Feature activation-8.08e-02

Pos logit contributions
 prope+0.248
aters+0.244
 pads+0.238
lets+0.234
phones+0.230
Neg logit contributions
aster-0.335
ore-0.322
eling-0.316
oker-0.308
 baker-0.300
Token bag
Token ID6131
Feature activation-5.41e-01

Pos logit contributions
 prope+0.107
aters+0.104
 pads+0.102
lets+0.100
phones+0.098
Neg logit contributions
aster-0.167
ore-0.161
eling-0.159
oker-0.154
 baker-0.149
Token and
Token ID290
Feature activation-1.44e+00

Pos logit contributions
 prope+1.013
aters+0.971
 pads+0.949
lets+0.942
phones+0.930
Neg logit contributions
aster-0.851
ore-0.829
eling-0.775
oker-0.764
 restaurant-0.731
Token it
Token ID340
Feature activation-4.35e-02

Pos logit contributions
 prope+2.596
aters+2.505
 pads+2.345
phones+2.340
 mate+2.309
Neg logit contributions
aster-2.393
eling-2.253
ore-2.209
 baker-2.130
oker-2.091
Token made
Token ID925
Feature activation-1.36e-01

Pos logit contributions
 prope+0.070
aters+0.068
 pads+0.067
lets+0.066
phones+0.065
Neg logit contributions
aster-0.078
ore-0.075
eling-0.074
oker-0.071
 restaurant-0.068
Token her
Token ID607
Feature activation-6.42e-01

Pos logit contributions
 prope+0.242
aters+0.234
 pads+0.231
lets+0.228
phones+0.224
Neg logit contributions
aster-0.225
ore-0.215
eling-0.211
oker-0.202
 restaurant-0.192
Token feel
Token ID1254
Feature activation+8.18e-03

Pos logit contributions
 prope+1.030
aters+1.003
 pads+0.965
lets+0.965
phones+0.961
Neg logit contributions
aster-1.150
ore-1.099
eling-1.096
 restaurant-1.034
oker-1.033
Token very
Token ID845
Feature activation+1.11e+00

Pos logit contributions
aster+0.016
ore+0.015
eling+0.015
oker+0.014
 restaurant+0.014
Neg logit contributions
 prope-0.012
aters-0.012
 pads-0.011
lets-0.011
phones-0.011
Token happy
Token ID3772
Feature activation-5.53e-01

Pos logit contributions
aster+2.280
ore+2.149
eling+2.114
oker+2.055
 baker+1.997
Neg logit contributions
 prope-1.538
 pads-1.455
aters-1.442
lets-1.416
Snow-1.386
Token air
Token ID1633
Feature activation+4.67e-01

Pos logit contributions
 prope+0.152
aters+0.150
 pads+0.146
lets+0.145
phones+0.142
Neg logit contributions
aster-0.176
ore-0.169
eling-0.167
oker-0.161
 baker-0.154
Token got
Token ID1392
Feature activation-7.63e-01

Pos logit contributions
aster+0.868
ore+0.833
eling+0.822
oker+0.799
 baker+0.761
Neg logit contributions
 prope-0.718
aters-0.689
 pads-0.685
lets-0.671
phones-0.656
Token thinner
Token ID30125
Feature activation-2.21e-01

Pos logit contributions
aters+1.475
 prope+1.474
lets+1.445
 pads+1.443
phones+1.424
Neg logit contributions
aster-1.091
eling-1.058
ore-1.032
oker-0.942
 baker-0.924
Token and
Token ID290
Feature activation+4.43e-02

Pos logit contributions
 prope+0.307
aters+0.298
 pads+0.289
lets+0.288
phones+0.284
Neg logit contributions
aster-0.442
ore-0.426
eling-0.423
oker-0.406
 restaurant-0.391
Token thinner
Token ID30125
Feature activation-1.57e-01

Pos logit contributions
aster+0.078
ore+0.074
eling+0.074
oker+0.071
 baker+0.067
Neg logit contributions
 prope-0.073
aters-0.071
 pads-0.070
lets-0.069
phones-0.068
Token.
Token ID13
Feature activation-6.82e-02

Pos logit contributions
 prope+0.246
aters+0.240
 pads+0.234
lets+0.232
phones+0.229
Neg logit contributions
aster-0.287
ore-0.275
eling-0.273
oker-0.261
 restaurant-0.249
Token They
Token ID1119
Feature activation-4.25e-01

Pos logit contributions
 prope+0.109
aters+0.105
 pads+0.103
lets+0.102
phones+0.100
Neg logit contributions
aster-0.124
ore-0.119
eling-0.116
oker-0.113
 baker-0.107
Token started
Token ID2067
Feature activation+1.80e-01

Pos logit contributions
aters+0.590
 prope+0.589
 pads+0.561
phones+0.552
lets+0.552
Neg logit contributions
aster-0.853
eling-0.817
ore-0.806
oker-0.768
 baker-0.744
Token getting
Token ID1972
Feature activation+3.75e-01

Pos logit contributions
aster+0.336
ore+0.323
eling+0.320
oker+0.309
 baker+0.294
Neg logit contributions
 prope-0.275
aters-0.267
 pads-0.262
lets-0.258
phones-0.254
Token dizz
Token ID29527
Feature activation-5.54e-01

Pos logit contributions
aster+0.598
ore+0.570
eling+0.560
oker+0.542
 restaurant+0.507
Neg logit contributions
 prope-0.682
aters-0.662
 pads-0.653
lets-0.643
phones-0.634
Tokeny
Token ID88
Feature activation-1.53e-01

Pos logit contributions
 prope+0.989
aters+0.939
lets+0.930
 pads+0.929
phones+0.928
Neg logit contributions
aster-0.894
ore-0.881
eling-0.861
oker-0.817
 restaurant-0.762
Token Tom
Token ID4186
Feature activation+1.48e+00

Pos logit contributions
 prope+1.261
aters+1.222
lets+1.204
 pads+1.200
phones+1.188
Neg logit contributions
aster-1.136
ore-1.081
oker-1.017
eling-0.996
 baker-0.972
Token says
Token ID1139
Feature activation-6.01e-01

Pos logit contributions
aster+2.424
ore+2.266
oker+2.236
eling+2.133
 baker+1.983
Neg logit contributions
 prope-2.679
 pads-2.634
aters-2.599
lets-2.593
 Dog-2.535
Token sorry
Token ID7926
Feature activation-3.48e-01

Pos logit contributions
aters+1.018
 prope+1.004
 pads+0.983
lets+0.962
phones+0.959
Neg logit contributions
aster-1.049
ore-1.002
eling-0.974
oker-0.954
 baker-0.925
Token to
Token ID284
Feature activation+6.33e-01

Pos logit contributions
 prope+0.569
aters+0.548
 pads+0.529
phones+0.525
lets+0.522
Neg logit contributions
aster-0.623
ore-0.607
eling-0.584
oker-0.567
 restaurant-0.551
Token Sam
Token ID3409
Feature activation+8.48e-01

Pos logit contributions
ore+1.175
aster+1.175
eling+1.144
oker+1.126
 baker+1.063
Neg logit contributions
 prope-0.953
aters-0.915
 pads-0.902
lets-0.891
phones-0.882
Token.
Token ID13
Feature activation-8.44e-01

Pos logit contributions
aster+1.658
eling+1.588
ore+1.573
oker+1.545
 baker+1.453
Neg logit contributions
 prope-1.217
aters-1.203
 pads-1.202
lets-1.177
 Dog-1.124
Token They
Token ID1119
Feature activation-2.60e-01

Pos logit contributions
 prope+1.580
aters+1.522
lets+1.500
 pads+1.491
phones+1.489
Neg logit contributions
aster-1.352
ore-1.284
oker-1.211
eling-1.165
 baker-1.154
Token hug
Token ID16225
Feature activation-9.77e-01

Pos logit contributions
 prope+0.424
aters+0.416
 pads+0.406
lets+0.400
phones+0.396
Neg logit contributions
aster-0.462
eling-0.438
ore-0.438
oker-0.416
 restaurant-0.401
Token and
Token ID290
Feature activation+2.92e-01

Pos logit contributions
 prope+1.709
aters+1.658
 pads+1.608
lets+1.588
phones+1.581
Neg logit contributions
aster-1.730
ore-1.599
eling-1.542
oker-1.458
 restaurant-1.452
Token make
Token ID787
Feature activation+9.45e-01

Pos logit contributions
aster+0.516
ore+0.500
eling+0.493
oker+0.474
 baker+0.450
Neg logit contributions
 prope-0.478
aters-0.462
 pads-0.456
lets-0.450
phones-0.442
Token up
Token ID510
Feature activation+1.47e+00

Pos logit contributions
aster+1.676
ore+1.628
eling+1.611
oker+1.582
 baker+1.500
Neg logit contributions
 prope-1.501
aters-1.470
 pads-1.460
lets-1.428
 Dog-1.386
Token were
Token ID547
Feature activation-1.33e+00

Pos logit contributions
 prope+2.107
aters+2.101
 pads+1.994
phones+1.915
lets+1.913
Neg logit contributions
aster-2.301
eling-2.292
ore-2.206
oker-2.060
 restaurant-2.025
Token looking
Token ID2045
Feature activation-9.81e-02

Pos logit contributions
 prope+2.259
aters+2.225
 pads+2.142
Snow+2.128
lets+2.116
Neg logit contributions
aster-2.210
eling-2.185
ore-2.169
oker-2.013
 baker-2.009
Token for
Token ID329
Feature activation-9.84e-01

Pos logit contributions
 prope+0.191
aters+0.187
 pads+0.183
lets+0.181
phones+0.180
Neg logit contributions
aster-0.142
ore-0.137
eling-0.134
oker-0.128
 restaurant-0.121
Token.
Token ID13
Feature activation+1.25e-01

Pos logit contributions
 prope+1.818
aters+1.782
 pads+1.732
lets+1.705
Brown+1.696
Neg logit contributions
aster-1.571
ore-1.476
eling-1.475
oker-1.384
 restaurant-1.371
Token But
Token ID887
Feature activation+4.21e-02

Pos logit contributions
aster+0.235
ore+0.226
eling+0.224
oker+0.216
 baker+0.206
Neg logit contributions
 prope-0.191
aters-0.185
 pads-0.182
lets-0.179
phones-0.176
Token the
Token ID262
Feature activation-1.38e-01

Pos logit contributions
aster+0.076
ore+0.073
eling+0.072
oker+0.069
 baker+0.066
Neg logit contributions
 prope-0.067
aters-0.066
 pads-0.064
lets-0.063
phones-0.062
Token poster
Token ID11968
Feature activation+4.42e-01

Pos logit contributions
 prope+0.234
aters+0.229
 pads+0.224
lets+0.222
phones+0.217
Neg logit contributions
aster-0.240
ore-0.230
eling-0.226
oker-0.219
 restaurant-0.209
Token said
Token ID531
Feature activation-6.42e-01

Pos logit contributions
aster+0.828
ore+0.795
eling+0.784
oker+0.754
 restaurant+0.720
Neg logit contributions
 prope-0.679
aters-0.651
 pads-0.648
lets-0.641
phones-0.627
Token the
Token ID262
Feature activation+4.96e-03

Pos logit contributions
aters+1.032
 prope+1.018
 pads+1.000
lets+0.963
phones+0.947
Neg logit contributions
aster-1.169
ore-1.117
eling-1.084
oker-1.057
 baker-1.036
Token dog
Token ID3290
Feature activation-5.09e-01

Pos logit contributions
aster+0.009
ore+0.008
eling+0.008
oker+0.008
 restaurant+0.008
Neg logit contributions
 prope-0.008
aters-0.008
 pads-0.008
lets-0.008
phones-0.008
Token had
Token ID550
Feature activation-8.03e-01

Pos logit contributions
 prope+0.831
aters+0.815
 pads+0.776
lets+0.758
phones+0.758
Neg logit contributions
aster-0.893
ore-0.868
eling-0.859
oker-0.820
 baker-0.788
Token visit
Token ID3187
Feature activation+4.74e-01

Pos logit contributions
aster+1.475
eling+1.462
ore+1.456
oker+1.428
 baker+1.306
Neg logit contributions
 prope-1.635
aters-1.595
 Tess-1.579
 Dog-1.562
 pads-1.554
Token their
Token ID511
Feature activation+2.42e-02

Pos logit contributions
aster+0.860
eling+0.834
ore+0.832
oker+0.805
 baker+0.760
Neg logit contributions
 prope-0.740
aters-0.725
 pads-0.712
lets-0.702
phones-0.692
Token neighbor
Token ID4780
Feature activation+6.66e-01

Pos logit contributions
aster+0.046
ore+0.044
eling+0.044
oker+0.042
 restaurant+0.041
Neg logit contributions
 prope-0.036
aters-0.035
 pads-0.034
lets-0.034
phones-0.033
Token,
Token ID11
Feature activation-1.25e+00

Pos logit contributions
aster+1.313
eling+1.279
ore+1.263
oker+1.224
 baker+1.149
Neg logit contributions
aters-0.934
 pads-0.924
 prope-0.917
lets-0.905
phones-0.881
Token Mrs
Token ID9074
Feature activation+3.82e-01

Pos logit contributions
 prope+2.144
aters+2.096
lets+1.955
 pads+1.944
phones+1.896
Neg logit contributions
aster-2.262
ore-2.047
 baker-1.998
 restaurant-1.984
oker-1.984
Token.
Token ID13
Feature activation-1.14e+00

Pos logit contributions
aster+0.676
ore+0.648
eling+0.646
oker+0.621
 baker+0.583
Neg logit contributions
 prope-0.619
aters-0.610
 pads-0.599
lets-0.593
phones-0.579
Token Lee
Token ID5741
Feature activation+6.06e-01

Pos logit contributions
 prope+1.670
 pads+1.568
aters+1.554
lets+1.539
phones+1.490
Neg logit contributions
aster-2.251
ore-2.112
 restaurant-2.053
eling-2.053
 baker-2.046
Token.
Token ID13
Feature activation-6.18e-01

Pos logit contributions
aster+1.282
eling+1.235
ore+1.229
oker+1.193
 baker+1.123
Neg logit contributions
 prope-0.766
aters-0.753
 pads-0.748
lets-0.744
 Tess-0.724
Token Mrs
Token ID9074
Feature activation+6.40e-01

Pos logit contributions
 prope+1.073
aters+1.009
 pads+1.003
lets+0.992
phones+0.977
Neg logit contributions
aster-1.062
ore-0.998
eling-0.962
oker-0.938
 restaurant-0.933
Token.
Token ID13
Feature activation-1.19e+00

Pos logit contributions
aster+1.171
eling+1.122
ore+1.112
oker+1.075
 baker+0.999
Neg logit contributions
aters-0.998
 prope-0.997
 pads-0.978
lets-0.972
 Tess-0.944
Token Lee
Token ID5741
Feature activation+5.75e-01

Pos logit contributions
 prope+2.047
 pads+1.925
aters+1.888
lets+1.874
phones+1.825
Neg logit contributions
aster-2.070
ore-1.955
 restaurant-1.902
 baker-1.881
eling-1.874
Token park
Token ID3952
Feature activation+7.15e-01

Pos logit contributions
 prope+0.343
aters+0.335
lets+0.328
 pads+0.326
phones+0.323
Neg logit contributions
aster-0.285
ore-0.269
eling-0.265
oker-0.252
 restaurant-0.243
Token.
Token ID13
Feature activation+7.30e-01

Pos logit contributions
aster+1.315
eling+1.261
ore+1.259
oker+1.211
 baker+1.146
Neg logit contributions
 prope-1.107
 pads-1.094
aters-1.088
lets-1.069
phones-1.027
Token The
Token ID383
Feature activation-9.49e-01

Pos logit contributions
aster+1.334
eling+1.307
ore+1.283
oker+1.246
 baker+1.162
Neg logit contributions
 prope-1.140
aters-1.134
 pads-1.109
lets-1.075
 Dog-1.060
Token oak
Token ID26210
Feature activation+1.48e+00

Pos logit contributions
 prope+1.550
aters+1.528
lets+1.468
 pads+1.465
phones+1.456
Neg logit contributions
aster-1.665
ore-1.592
eling-1.579
 baker-1.494
oker-1.490
Token tree
Token ID5509
Feature activation+1.69e+00

Pos logit contributions
aster+2.800
ore+2.624
eling+2.600
oker+2.583
 restaurant+2.341
Neg logit contributions
 pads-2.343
 prope-2.341
aters-2.231
lets-2.218
 Dog-2.168
Token has
Token ID468
Feature activation-2.39e+00

Pos logit contributions
aster+2.958
ore+2.702
oker+2.686
eling+2.663
 baker+2.441
Neg logit contributions
 pads-3.000
 prope-2.941
lets-2.918
aters-2.863
 neck-2.753
Token many
Token ID867
Feature activation-7.53e-01

Pos logit contributions
aters+4.941
phones+4.718
 prope+4.697
lets+4.626
Brown+4.620
Neg logit contributions
aster-3.385
 restaurant-3.009
eling-2.985
 baker-2.966
ore-2.871
Token leaves
Token ID5667
Feature activation-3.76e-01

Pos logit contributions
 prope+1.054
aters+1.011
phones+0.993
 Tess+0.980
 pads+0.970
Neg logit contributions
aster-1.473
eling-1.454
ore-1.425
oker-1.351
 restaurant-1.344
Token and
Token ID290
Feature activation-1.20e+00

Pos logit contributions
 prope+0.638
aters+0.617
 pads+0.595
lets+0.595
phones+0.593
Neg logit contributions
aster-0.639
ore-0.624
eling-0.607
oker-0.578
 restaurant-0.554
Token ac
Token ID936
Feature activation-1.15e-01

Pos logit contributions
aters+1.902
 prope+1.876
 Tess+1.786
phones+1.779
un+1.772
Neg logit contributions
aster-2.192
eling-2.096
ore-2.004
 restaurant-1.947
 baker-1.920
Tokenorns
Token ID19942
Feature activation+2.62e-01

Pos logit contributions
 prope+0.235
aters+0.228
 pads+0.224
lets+0.223
phones+0.221
Neg logit contributions
aster-0.157
ore-0.150
eling-0.147
oker-0.138
 baker-0.130
Token decided
Token ID3066
Feature activation-4.63e-01

Pos logit contributions
aters+1.562
 prope+1.529
phones+1.453
 pads+1.451
lets+1.429
Neg logit contributions
aster-1.487
eling-1.439
ore-1.401
oker-1.345
 restaurant-1.314
Token to
Token ID284
Feature activation-9.31e-01

Pos logit contributions
aters+0.770
 prope+0.762
 pads+0.728
lets+0.716
phones+0.702
Neg logit contributions
aster-0.809
ore-0.780
eling-0.766
oker-0.730
 restaurant-0.709
Token follow
Token ID1061
Feature activation+6.38e-01

Pos logit contributions
aters+1.656
 pads+1.641
 prope+1.638
lets+1.569
phones+1.533
Neg logit contributions
aster-1.553
ore-1.456
eling-1.440
oker-1.365
 baker-1.315
Token him
Token ID683
Feature activation-1.16e+00

Pos logit contributions
aster+1.006
eling+0.964
ore+0.958
oker+0.934
 baker+0.881
Neg logit contributions
 prope-1.161
aters-1.113
 pads-1.103
lets-1.093
 Tess-1.091
Token.
Token ID13
Feature activation+6.91e-01

Pos logit contributions
 prope+2.188
aters+2.135
 Tess+2.028
Snow+2.025
Brown+2.024
Neg logit contributions
ore-1.766
aster-1.724
eling-1.633
oker-1.539
 restaurant-1.507
Token They
Token ID1119
Feature activation-1.99e+00

Pos logit contributions
aster+1.321
eling+1.275
ore+1.248
oker+1.217
 baker+1.150
Neg logit contributions
 prope-1.029
aters-1.015
 pads-1.012
lets-0.988
 Tess-0.980
Token went
Token ID1816
Feature activation-8.05e-01

Pos logit contributions
aters+3.817
 prope+3.670
 pads+3.471
phones+3.460
Brown+3.437
Neg logit contributions
aster-3.093
eling-3.068
ore-2.847
 restaurant-2.648
roads-2.582
Token on
Token ID319
Feature activation-6.46e-01

Pos logit contributions
 prope+1.562
aters+1.526
lets+1.473
 pads+1.472
Brown+1.429
Neg logit contributions
aster-1.184
ore-1.180
eling-1.095
oker-1.053
 restaurant-1.040
Token an
Token ID281
Feature activation-1.02e+00

Pos logit contributions
 prope+1.090
aters+1.089
 pads+1.055
lets+1.054
phones+1.020
Neg logit contributions
aster-1.113
ore-1.067
eling-1.037
 restaurant-0.989
oker-0.975
Token adventure
Token ID8855
Feature activation-3.25e-01

Pos logit contributions
 prope+1.563
aters+1.541
lets+1.507
 pads+1.482
phones+1.479
Neg logit contributions
aster-1.966
eling-1.856
ore-1.760
 restaurant-1.689
oker-1.678
Token together
Token ID1978
Feature activation-5.51e-01

Pos logit contributions
 prope+0.495
aters+0.483
 pads+0.467
lets+0.464
phones+0.454
Neg logit contributions
aster-0.612
ore-0.594
eling-0.577
oker-0.560
 restaurant-0.541
Token,
Token ID11
Feature activation-1.81e+00

Pos logit contributions
 prope+2.627
aters+2.611
 pads+2.453
phones+2.445
lets+2.429
Neg logit contributions
aster-2.702
ore-2.688
eling-2.532
oker-2.345
 restaurant-2.339
Token and
Token ID290
Feature activation-1.33e+00

Pos logit contributions
aters+2.915
 prope+2.842
lets+2.684
phones+2.666
Brown+2.653
Neg logit contributions
aster-3.236
eling-3.183
ore-3.178
 restaurant-2.857
 baker-2.851
Token it
Token ID340
Feature activation-7.46e-01

Pos logit contributions
aters+1.934
 prope+1.875
 pads+1.738
lets+1.737
phones+1.710
Neg logit contributions
aster-2.647
eling-2.506
ore-2.487
oker-2.380
 restaurant-2.347
Token was
Token ID373
Feature activation-2.28e+00

Pos logit contributions
aters+1.204
 prope+1.195
phones+1.142
 pads+1.132
lets+1.118
Neg logit contributions
aster-1.309
eling-1.288
ore-1.271
oker-1.187
 baker-1.151
Token singing
Token ID13777
Feature activation-2.11e+00

Pos logit contributions
phones+3.743
aters+3.677
lets+3.622
 prope+3.619
 pads+3.533
Neg logit contributions
aster-4.031
eling-3.954
ore-3.888
 baker-3.602
 early-3.522
Token in
Token ID287
Feature activation-2.03e+00

Pos logit contributions
aters+3.908
 prope+3.851
phones+3.809
 pads+3.741
lets+3.684
Neg logit contributions
ore-3.406
aster-3.278
 early-3.052
 restaurant-3.041
eling-2.959
Token the
Token ID262
Feature activation-1.09e+00

Pos logit contributions
lets+4.523
phones+4.473
 pads+4.466
aters+4.455
 prope+4.437
Neg logit contributions
aster-2.643
eling-2.322
ore-2.200
 restaurant-2.116
oker-1.960
Token rain
Token ID6290
Feature activation-1.10e+00

Pos logit contributions
aters+1.807
 prope+1.803
lets+1.795
phones+1.732
 pads+1.724
Neg logit contributions
aster-1.903
ore-1.798
eling-1.784
 restaurant-1.687
 baker-1.663
Token.
Token ID13
Feature activation-8.79e-01

Pos logit contributions
 prope+2.050
aters+1.945
lets+1.920
phones+1.910
 Tess+1.897
Neg logit contributions
aster-1.690
ore-1.686
eling-1.593
oker-1.519
 restaurant-1.477
Token The
Token ID383
Feature activation-1.90e+00

Pos logit contributions
 prope+0.966
aters+0.917
lets+0.866
 pads+0.854
Snow+0.817
Neg logit contributions
aster-2.039
ore-1.978
eling-1.916
oker-1.883
 restaurant-1.836
Token dove
Token ID38862
Feature activation-8.49e-01

Pos logit contributions
aters+3.041
lets+2.949
 pads+2.910
phones+2.867
 prope+2.815
Neg logit contributions
aster-3.555
eling-3.354
 baker-3.339
ore-3.287
 restaurant-3.184
Token girl
Token ID2576
Feature activation-7.54e-01

Pos logit contributions
aster+0.264
ore+0.249
eling+0.247
oker+0.232
 restaurant+0.215
Neg logit contributions
 prope-0.426
aters-0.416
 pads-0.410
lets-0.406
phones-0.401
Token named
Token ID3706
Feature activation-5.60e-01

Pos logit contributions
 prope+1.036
aters+1.000
 pads+0.969
phones+0.946
lets+0.938
Neg logit contributions
aster-1.593
ore-1.520
eling-1.466
oker-1.412
 baker-1.395
Token Lily
Token ID20037
Feature activation-1.28e+00

Pos logit contributions
 prope+1.037
aters+0.994
 pads+0.972
lets+0.957
phones+0.951
Neg logit contributions
aster-0.915
ore-0.870
eling-0.840
oker-0.810
 baker-0.790
Token.
Token ID13
Feature activation-1.65e+00

Pos logit contributions
 prope+2.569
aters+2.466
phones+2.453
 pads+2.353
lets+2.344
Neg logit contributions
aster-1.936
ore-1.823
eling-1.738
 restaurant-1.590
oker-1.571
Token She
Token ID1375
Feature activation-4.65e-01

Pos logit contributions
 prope+3.045
phones+2.831
lets+2.754
 pads+2.721
tails+2.673
Neg logit contributions
aster-2.776
ore-2.714
oker-2.488
 restaurant-2.469
 baker-2.434
Token had
Token ID550
Feature activation-2.36e+00

Pos logit contributions
 prope+0.638
aters+0.629
 pads+0.596
lets+0.594
phones+0.593
Neg logit contributions
aster-0.959
ore-0.922
eling-0.911
oker-0.871
 restaurant-0.853
Token a
Token ID257
Feature activation-5.15e-01

Pos logit contributions
aters+4.294
lets+4.164
 prope+4.146
phones+3.947
 pads+3.886
Neg logit contributions
aster-4.114
 baker-3.598
 restaurant-3.519
ore-3.516
eling-3.402
Token toy
Token ID13373
Feature activation-1.26e+00

Pos logit contributions
 prope+0.745
aters+0.744
 pads+0.727
lets+0.712
phones+0.701
Neg logit contributions
aster-0.992
ore-0.955
eling-0.927
oker-0.908
 baker-0.906
Token tank
Token ID6873
Feature activation+2.41e-01

Pos logit contributions
 prope+2.073
 pads+2.062
aters+2.060
lets+1.978
Snow+1.892
Neg logit contributions
ore-2.161
aster-2.155
 baker-1.986
oker-1.943
eling-1.937
Token that
Token ID326
Feature activation-4.09e-01

Pos logit contributions
aster+0.424
ore+0.405
eling+0.404
oker+0.386
 baker+0.366
Neg logit contributions
 prope-0.395
aters-0.386
 pads-0.379
lets-0.375
phones-0.368
Token she
Token ID673
Feature activation-1.80e-01

Pos logit contributions
 prope+0.790
aters+0.772
 pads+0.750
lets+0.743
phones+0.736
Neg logit contributions
aster-0.611
ore-0.585
eling-0.565
oker-0.546
 baker-0.524
Token happy
Token ID3772
Feature activation+2.06e-01

Pos logit contributions
aster+1.295
eling+1.231
ore+1.228
oker+1.170
 restaurant+1.119
Neg logit contributions
 prope-1.096
aters-1.038
 pads-1.026
lets-1.020
 Tess-0.997
Token and
Token ID290
Feature activation-6.03e-01

Pos logit contributions
aster+0.401
ore+0.384
eling+0.382
oker+0.368
 baker+0.351
Neg logit contributions
 prope-0.300
aters-0.292
 pads-0.287
lets-0.283
phones-0.278
Token said
Token ID531
Feature activation+9.53e-02

Pos logit contributions
 prope+1.130
aters+1.117
 pads+1.064
phones+1.055
lets+1.050
Neg logit contributions
aster-0.925
ore-0.886
eling-0.871
oker-0.835
 baker-0.801
Token,
Token ID11
Feature activation-1.63e-01

Pos logit contributions
aster+0.195
ore+0.188
eling+0.186
oker+0.180
 baker+0.173
Neg logit contributions
 prope-0.130
aters-0.126
 pads-0.123
lets-0.121
phones-0.119
Token "
Token ID366
Feature activation-2.25e+00

Pos logit contributions
 prope+0.217
aters+0.213
 pads+0.207
lets+0.203
phones+0.199
Neg logit contributions
aster-0.337
ore-0.326
eling-0.319
oker-0.309
 baker-0.301
TokenMom
Token ID29252
Feature activation-2.49e+00

Pos logit contributions
 prope+4.839
 mate+4.613
 pads+4.581
aters+4.512
phones+4.496
Neg logit contributions
aster-3.026
ore-2.697
 baker-2.665
oker-2.578
eling-2.575
Tokenmy
Token ID1820
Feature activation-1.59e+00

Pos logit contributions
 prope+4.904
Snow+4.212
phones+4.186
 pads+4.172
Wh+4.089
Neg logit contributions
aster-4.139
ore-3.962
eling-3.861
 restaurant-3.778
 baker-3.734
Token,
Token ID11
Feature activation-6.18e-01

Pos logit contributions
 prope+2.761
 pads+2.507
aters+2.498
phones+2.419
Snow+2.392
Neg logit contributions
aster-2.807
ore-2.767
eling-2.687
oker-2.596
 baker-2.587
Token burying
Token ID48360
Feature activation+5.12e-01

Pos logit contributions
aters+1.053
 prope+1.044
 pads+1.028
lets+0.989
phones+0.974
Neg logit contributions
aster-1.064
ore-1.018
eling-0.983
oker-0.955
 restaurant-0.931
Token the
Token ID262
Feature activation+3.13e-01

Pos logit contributions
aster+0.819
ore+0.778
eling+0.770
oker+0.747
 baker+0.702
Neg logit contributions
 prope-0.922
aters-0.900
 pads-0.883
lets-0.877
 Tess-0.863
Token rock
Token ID3881
Feature activation+7.77e-01

Pos logit contributions
aster+0.547
ore+0.522
eling+0.516
oker+0.497
 baker+0.469
Neg logit contributions
 prope-0.531
aters-0.510
 pads-0.504
lets-0.497
phones-0.488