TOP ACTIVATIONS
MAX = 544.175

the farmer approached , the goat panicked and quickly ran away
was very frustrated with the goat delaying his bed time and
were able to buy the puppy ! <|endoftext|> Once upon a
time there was a black goat who loved to delay his
he had to protect the puppy . So , he stood
like Tim my or the puppy . Tim my felt a
scared him away . The puppy 's family was so grateful
, Tim my and the puppy became best friends and went
she didn 't notice her puppy , Am igo , rush
a time there was a puppy called Poo chy . He
the woods , and the puppy never bit her again .
. " K itty , look at my new
. <|endoftext|> M olly the puppy was outside playing in the
. He had a little puppy named Spot . Tim my
loved to spoil her little grandson . One day ,
was so proud of her grandson and gave him a big
It 's okay ". The puppy w agged his tail and
?" Emma asked . The puppy just looked at her with
started to cry . The puppy then looked very sorry and
his head . " Oh puppy , why did you bite

MIN ACTIVATIONS
MIN = 544.175

smiles and says " Well done , pirates ! You are
to see them with her own eyes . She asked R
man would appear at the top of the hill . He
to do things on her own . One day , she
on the seat at the top . He looked very rude
make a game of their own . They put
hands and shouted " Well done !" All the
. When the story was done , the little girl felt
. When the jobs were done , the shop keeper said
After he was done , Jack decided to see
with a beautiful flag on top . Jack wanted to get
magazine ? I have my own magazines ." He
to regret what he had done . He apologized to the
food and when they were done , Lily felt proud of
park , knowing they had done something good . <|endoftext|> One
the good deed they had done . <|endoftext|> M andy was
so tired but he was done ! He made the biggest
and they went to the top of a hill . From
surprised by what she had done . She started
Finally , Jack reached the top of the vine . He

INTERVAL 544.174 - 544.174
CONTAINS 0.000%

Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+2.618
,+2.475
.+2.414
 the+2.196
 a+2.046
Neg logit contributions
lor-7.341
umm-7.243
 former-7.191
arians-7.183
ocol-7.080
Token farmer
Token ID18739
Feature activation+5.44e+02

Pos logit contributions
 and+6.408
,+6.371
.+6.205
 the+6.178
 a+5.916
Neg logit contributions
lor-3.691
umm-3.653
arians-3.536
 former-3.445
ocol-3.402
Token approached
Token ID10448
Feature activation+5.44e+02

Pos logit contributions
 and+4.286
,+4.255
.+4.124
 the+4.058
 a+3.835
Neg logit contributions
lor-5.732
umm-5.593
arians-5.574
 former-5.505
ocol-5.416
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+1.760
,+1.644
.+1.582
 the+1.404
 a+1.253
Neg logit contributions
lor-8.050
arians-8.010
ocol-7.973
umm-7.955
 former-7.868
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+2.259
,+2.157
.+2.038
 the+1.803
 a+1.581
Neg logit contributions
lor-7.664
umm-7.594
arians-7.539
ocol-7.493
 former-7.476
Token goat
Token ID26917
Feature activation+5.44e+02

Pos logit contributions
,+6.441
 and+6.430
.+6.254
 the+6.219
 a+5.954
Neg logit contributions
lor-3.604
umm-3.581
arians-3.492
 former-3.424
ocol-3.343
Token panicked
Token ID38225
Feature activation+5.44e+02

Pos logit contributions
 and+4.370
,+4.312
.+4.175
 the+4.151
 a+3.917
Neg logit contributions
lor-5.649
arians-5.484
umm-5.440
 former-5.419
osity-5.314
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
 and+1.307
,+1.294
 the+1.162
.+1.082
 a+0.925
Neg logit contributions
lor-8.391
umm-8.348
arians-8.293
ocol-8.221
 former-8.200
Token quickly
Token ID2952
Feature activation+5.44e+02

Pos logit contributions
 and+4.358
,+4.262
.+4.119
 the+3.997
 a+3.798
Neg logit contributions
lor-5.627
arians-5.500
umm-5.496
 former-5.457
rol-5.402
Token ran
Token ID4966
Feature activation+5.44e+02

Pos logit contributions
 and+4.755
,+4.660
 the+4.503
.+4.490
 a+4.263
Neg logit contributions
lor-5.285
umm-5.127
arians-5.096
 former-5.055
ocol-5.012
Token away
Token ID1497
Feature activation+5.44e+02

Pos logit contributions
 and+3.630
,+3.588
 the+3.377
.+3.371
 a+3.110
Neg logit contributions
lor-6.296
umm-6.190
arians-6.158
 former-6.114
rol-6.103
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
 and+3.784
,+3.760
.+3.623
 the+3.539
 a+3.293
Neg logit contributions
lor-6.220
arians-6.052
 former-6.022
umm-6.021
rol-5.923
Token very
Token ID845
Feature activation+5.44e+02

Pos logit contributions
 and+3.808
,+3.764
.+3.613
 the+3.491
 a+3.201
Neg logit contributions
lor-6.192
umm-6.047
 former-6.037
arians-6.020
rol-5.999
Token frustrated
Token ID14718
Feature activation+5.44e+02

Pos logit contributions
 and+4.292
,+4.196
.+4.050
 the+3.995
 a+3.738
Neg logit contributions
lor-5.680
 former-5.657
umm-5.643
ocol-5.559
arians-5.552
Token with
Token ID351
Feature activation+5.44e+02

Pos logit contributions
 and+1.851
,+1.799
 the+1.770
.+1.624
 a+1.521
Neg logit contributions
umm-7.780
lor-7.766
arians-7.710
osity-7.633
 former-7.613
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+2.591
,+2.530
.+2.362
 the+2.138
 a+1.940
Neg logit contributions
lor-7.378
ocol-7.282
arians-7.253
umm-7.233
 former-7.213
Token goat
Token ID26917
Feature activation+5.44e+02

Pos logit contributions
 and+5.224
,+5.194
.+4.987
 the+4.975
 a+4.706
Neg logit contributions
lor-4.878
umm-4.865
arians-4.747
 former-4.739
ocol-4.642
Token delaying
Token ID35221
Feature activation+5.44e+02

Pos logit contributions
 and+2.137
,+2.065
 the+1.966
.+1.886
 a+1.720
Neg logit contributions
lor-7.732
arians-7.624
umm-7.567
 former-7.539
osity-7.495
Token his
Token ID465
Feature activation+5.44e+02

Pos logit contributions
 and+2.905
,+2.845
.+2.642
 the+2.588
 a+2.387
Neg logit contributions
lor-6.976
arians-6.879
ocol-6.849
umm-6.842
 former-6.828
Token bed
Token ID3996
Feature activation+5.44e+02

Pos logit contributions
 and+5.398
,+5.324
 the+5.164
.+5.131
 a+4.884
Neg logit contributions
lor-4.644
umm-4.600
arians-4.519
 former-4.513
ocol-4.430
Tokentime
Token ID2435
Feature activation+5.44e+02

Pos logit contributions
 and+3.704
,+3.647
 the+3.571
.+3.441
 a+3.310
Neg logit contributions
lor-6.273
umm-6.139
arians-6.078
 former-6.045
ides-5.952
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
 and+3.441
,+3.358
 the+3.246
.+3.142
 a+2.998
Neg logit contributions
lor-6.543
umm-6.383
arians-6.354
 former-6.318
ides-6.256
Token were
Token ID547
Feature activation+5.44e+02

Pos logit contributions
 and+4.166
,+4.115
.+3.942
 the+3.862
 a+3.650
Neg logit contributions
lor-5.885
umm-5.733
arians-5.731
rol-5.641
osity-5.624
Token able
Token ID1498
Feature activation+5.44e+02

Pos logit contributions
 and+5.414
,+5.338
.+5.134
 the+5.079
 a+4.840
Neg logit contributions
lor-4.651
umm-4.446
arians-4.442
 former-4.413
rol-4.359
Token to
Token ID284
Feature activation+5.44e+02

Pos logit contributions
 and+1.256
,+1.185
.+0.951
 the+0.911
 a+0.674
Neg logit contributions
lor-8.749
umm-8.643
 former-8.581
osity-8.539
arians-8.538
Token buy
Token ID2822
Feature activation+5.44e+02

Pos logit contributions
 and+4.778
,+4.683
 the+4.505
.+4.495
 a+4.254
Neg logit contributions
lor-5.275
 former-5.193
arians-5.103
umm-5.088
ides-4.997
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+3.307
,+3.247
.+3.022
 the+2.907
 a+2.666
Neg logit contributions
lor-6.661
umm-6.617
 former-6.504
arians-6.499
ocol-6.484
Token puppy
Token ID26188
Feature activation+5.44e+02

Pos logit contributions
 and+5.100
,+5.058
.+4.867
 the+4.843
 a+4.562
Neg logit contributions
lor-4.913
umm-4.910
 former-4.829
arians-4.792
ocol-4.674
Token!
Token ID0
Feature activation+5.44e+02

Pos logit contributions
 and+2.503
,+2.461
 the+2.280
.+2.247
 a+2.003
Neg logit contributions
lor-7.464
umm-7.281
arians-7.275
 former-7.269
rol-7.156
Token<|endoftext|>
Token ID50256
Feature activation+5.44e+02

Pos logit contributions
 and+2.640
,+2.586
.+2.380
 the+2.347
 a+2.132
Neg logit contributions
lor-7.471
umm-7.362
arians-7.260
 former-7.235
osity-7.166
TokenOnce
Token ID7454
Feature activation+5.44e+02

Pos logit contributions
 and+3.397
,+3.349
 the+3.112
.+3.080
 a+2.879
Neg logit contributions
lor-6.682
umm-6.415
arians-6.325
rol-6.313
ocol-6.220
Token upon
Token ID2402
Feature activation+5.44e+02

Pos logit contributions
 and+4.853
,+4.685
.+4.551
 the+4.518
 a+4.381
Neg logit contributions
lor-5.072
umm-4.904
osity-4.882
ocol-4.869
rol-4.825
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
,+0.833
 and+0.771
.+0.499
 the+0.465
 a+0.012
Neg logit contributions
lor-9.128
 former-8.872
umm-8.798
arians-8.772
ocol-8.739
Token time
Token ID640
Feature activation+5.44e+02

Pos logit contributions
 and+3.097
,+3.035
 the+2.925
.+2.875
 a+2.618
Neg logit contributions
lor-6.883
umm-6.800
 former-6.713
arians-6.670
ocol-6.623
Token there
Token ID612
Feature activation+5.44e+02

Pos logit contributions
 and+0.328
.+0.094
 the+0.068
,+0.056
 a-0.177
Neg logit contributions
lor-9.592
umm-9.428
ocol-9.396
rol-9.346
arians-9.345
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
 and+1.747
,+1.600
.+1.492
 the+1.413
 a+1.242
Neg logit contributions
lor-8.157
umm-8.034
rol-7.930
ocol-7.870
arians-7.861
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
 and+0.856
,+0.831
.+0.638
 the+0.556
 a+0.173
Neg logit contributions
lor-9.056
umm-9.005
 former-8.954
ocol-8.918
arians-8.852
Token black
Token ID2042
Feature activation+5.44e+02

Pos logit contributions
 and+3.114
,+3.040
.+2.897
 the+2.877
 a+2.622
Neg logit contributions
umm-6.801
lor-6.790
arians-6.696
 former-6.676
ocol-6.617
Token goat
Token ID26917
Feature activation+5.44e+02

Pos logit contributions
 and+4.935
,+4.885
 the+4.786
.+4.726
 a+4.514
Neg logit contributions
lor-5.007
umm-4.998
arians-4.846
 former-4.796
ocol-4.796
Token who
Token ID508
Feature activation+5.44e+02

Pos logit contributions
 and+3.490
,+3.423
 the+3.303
.+3.154
 a+3.065
Neg logit contributions
lor-6.396
arians-6.186
umm-6.170
 former-6.095
osity-6.073
Token loved
Token ID6151
Feature activation+5.44e+02

Pos logit contributions
 and+4.390
,+4.353
 the+4.142
.+4.103
 a+3.856
Neg logit contributions
lor-5.637
umm-5.455
 former-5.449
arians-5.407
rol-5.349
Token to
Token ID284
Feature activation+5.44e+02

Pos logit contributions
 and+2.308
,+2.288
.+2.061
 the+2.002
 a+1.769
Neg logit contributions
lor-7.668
umm-7.622
 former-7.531
arians-7.521
ocol-7.506
Token delay
Token ID5711
Feature activation+5.44e+02

Pos logit contributions
 and+4.536
,+4.450
.+4.264
 the+4.247
 a+3.976
Neg logit contributions
lor-5.482
umm-5.364
 former-5.330
arians-5.307
ocol-5.305
Token his
Token ID465
Feature activation+5.44e+02

Pos logit contributions
 and+4.317
,+4.267
.+4.031
 the+4.024
 a+3.821
Neg logit contributions
lor-5.553
umm-5.533
 former-5.517
ocol-5.503
arians-5.458
Token he
Token ID339
Feature activation+5.44e+02

Pos logit contributions
 and+2.553
,+2.458
.+2.256
 the+2.144
 a+1.936
Neg logit contributions
lor-7.465
umm-7.264
 former-7.260
arians-7.260
ocol-7.203
Token had
Token ID550
Feature activation+5.44e+02

Pos logit contributions
 and+4.189
,+4.173
 the+3.986
.+3.961
 a+3.710
Neg logit contributions
lor-5.796
arians-5.684
umm-5.654
 former-5.585
ides-5.516
Token to
Token ID284
Feature activation+5.44e+02

Pos logit contributions
 and+2.723
,+2.657
.+2.443
 the+2.406
 a+2.117
Neg logit contributions
lor-7.334
 former-7.178
umm-7.144
arians-7.132
ocol-7.057
Token protect
Token ID1805
Feature activation+5.44e+02

Pos logit contributions
 and+4.559
,+4.464
.+4.271
 the+4.270
 a+4.002
Neg logit contributions
lor-5.483
 former-5.396
arians-5.329
umm-5.326
ocol-5.225
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+3.420
,+3.364
.+3.111
 the+2.997
 a+2.806
Neg logit contributions
lor-6.520
umm-6.470
ocol-6.423
arians-6.420
 former-6.406
Token puppy
Token ID26188
Feature activation+5.44e+02

Pos logit contributions
 and+5.104
,+5.098
 the+4.903
.+4.868
 a+4.613
Neg logit contributions
umm-4.920
lor-4.914
arians-4.831
 former-4.805
ocol-4.652
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+2.496
,+2.462
 the+2.289
.+2.215
 a+2.031
Neg logit contributions
lor-7.431
arians-7.289
umm-7.272
 former-7.271
rol-7.145
Token So
Token ID1406
Feature activation+5.44e+02

Pos logit contributions
 and+3.657
,+3.606
.+3.411
 the+3.390
 a+3.164
Neg logit contributions
lor-6.407
umm-6.351
 former-6.269
arians-6.225
rol-6.142
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+1.569
.+1.378
,+1.322
 the+1.224
 a+1.109
Neg logit contributions
umm-8.238
lor-8.151
 former-8.099
arians-8.032
bek-8.016
Token he
Token ID339
Feature activation+5.44e+02

Pos logit contributions
 and+2.831
,+2.747
.+2.597
 the+2.413
 a+2.226
Neg logit contributions
umm-7.052
 former-6.937
lor-6.933
arians-6.882
ocol-6.834
Token stood
Token ID6204
Feature activation+5.44e+02

Pos logit contributions
 and+4.119
,+4.118
.+3.967
 the+3.915
 a+3.654
Neg logit contributions
lor-5.894
arians-5.749
 former-5.695
umm-5.688
rol-5.575
Token like
Token ID588
Feature activation+5.44e+02

Pos logit contributions
 and+4.571
,+4.498
 the+4.284
.+4.264
 a+4.038
Neg logit contributions
lor-5.395
umm-5.306
ocol-5.285
arians-5.268
ides-5.223
Token Tim
Token ID5045
Feature activation+5.44e+02

Pos logit contributions
 and+3.924
,+3.855
.+3.639
 the+3.556
 a+3.366
Neg logit contributions
lor-6.053
umm-5.994
arians-5.936
ocol-5.898
 former-5.861
Tokenmy
Token ID1820
Feature activation+5.44e+02

Pos logit contributions
 and+3.757
,+3.708
 the+3.573
.+3.450
 a+3.335
Neg logit contributions
lor-6.146
arians-6.020
umm-6.004
rol-5.933
ocol-5.910
Token or
Token ID393
Feature activation+5.44e+02

Pos logit contributions
 and+2.741
,+2.677
 the+2.544
.+2.426
 a+2.302
Neg logit contributions
lor-7.151
arians-7.046
umm-7.019
 former-6.941
ides-6.923
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+4.017
,+3.912
.+3.741
 the+3.631
 a+3.423
Neg logit contributions
lor-5.933
umm-5.887
arians-5.864
 former-5.832
ocol-5.790
Token puppy
Token ID26188
Feature activation+5.44e+02

Pos logit contributions
 and+5.281
,+5.223
.+5.025
 the+5.015
 a+4.730
Neg logit contributions
lor-4.769
umm-4.729
arians-4.636
 former-4.621
ocol-4.546
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.676
,+1.597
 the+1.457
.+1.330
 a+1.214
Neg logit contributions
lor-8.255
arians-8.159
umm-8.104
 former-8.025
rol-7.991
Token Tim
Token ID5045
Feature activation+5.44e+02

Pos logit contributions
 and+3.458
,+3.398
.+3.213
 the+3.176
 a+2.946
Neg logit contributions
lor-6.576
umm-6.529
arians-6.458
 former-6.452
osity-6.340
Tokenmy
Token ID1820
Feature activation+5.44e+02

Pos logit contributions
,+4.026
 and+4.012
 the+3.869
.+3.859
 a+3.645
Neg logit contributions
lor-5.904
arians-5.763
umm-5.742
 former-5.717
rol-5.660
Token felt
Token ID2936
Feature activation+5.44e+02

Pos logit contributions
 and+3.719
,+3.704
.+3.586
 the+3.554
 a+3.325
Neg logit contributions
lor-6.178
arians-6.082
 former-5.986
umm-5.945
ides-5.856
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
 and+4.274
,+4.195
.+4.027
 the+3.920
 a+3.633
Neg logit contributions
lor-5.563
 former-5.515
ocol-5.508
umm-5.498
arians-5.411
Token scared
Token ID12008
Feature activation+5.44e+02

Pos logit contributions
 and+4.339
,+4.287
.+4.129
 the+3.982
 a+3.771
Neg logit contributions
lor-5.616
 former-5.538
arians-5.514
umm-5.485
bek-5.405
Token him
Token ID683
Feature activation+5.44e+02

Pos logit contributions
 and+3.170
,+3.076
.+2.857
 the+2.807
 a+2.631
Neg logit contributions
umm-6.765
ocol-6.641
lor-6.638
arians-6.567
 former-6.540
Token away
Token ID1497
Feature activation+5.44e+02

Pos logit contributions
 and+2.005
,+1.918
 the+1.814
.+1.663
 a+1.539
Neg logit contributions
lor-7.932
arians-7.748
 former-7.698
osity-7.666
ides-7.664
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.348
,+1.244
 the+1.101
.+0.994
 a+0.864
Neg logit contributions
lor-8.532
umm-8.506
 former-8.427
arians-8.407
osity-8.326
Token The
Token ID383
Feature activation+5.44e+02

Pos logit contributions
 and+3.487
,+3.425
.+3.240
 the+3.204
 a+2.985
Neg logit contributions
lor-6.549
umm-6.505
 former-6.413
arians-6.406
osity-6.308
Token puppy
Token ID26188
Feature activation+5.44e+02

Pos logit contributions
 and+4.607
,+4.599
.+4.401
 the+4.362
 a+4.055
Neg logit contributions
lor-5.425
umm-5.401
arians-5.330
 former-5.280
ocol-5.157
Token's
Token ID338
Feature activation+5.44e+02

Pos logit contributions
 and+4.043
,+4.003
.+3.891
 the+3.828
 a+3.604
Neg logit contributions
lor-5.913
arians-5.792
umm-5.706
 former-5.690
rol-5.590
Token family
Token ID1641
Feature activation+5.44e+02

Pos logit contributions
 and+5.260
,+5.239
 the+5.056
.+5.051
 a+4.734
Neg logit contributions
lor-4.775
umm-4.690
 former-4.682
arians-4.643
osity-4.491
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
 and+3.520
,+3.469
.+3.353
 the+3.298
 a+3.115
Neg logit contributions
lor-6.426
arians-6.294
umm-6.254
rol-6.159
 former-6.143
Token so
Token ID523
Feature activation+5.44e+02

Pos logit contributions
 and+3.934
,+3.883
.+3.724
 the+3.626
 a+3.346
Neg logit contributions
lor-6.068
 former-5.908
arians-5.871
umm-5.840
rol-5.828
Token grateful
Token ID14066
Feature activation+5.44e+02

Pos logit contributions
 and+4.267
,+4.199
.+4.032
 the+3.994
 a+3.733
Neg logit contributions
lor-5.723
 former-5.632
umm-5.625
arians-5.576
osity-5.556
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+0.293
.+0.094
,+0.026
 the-0.016
 a-0.133
Neg logit contributions
lor-9.640
umm-9.605
arians-9.492
ocol-9.488
 former-9.384
Token Tim
Token ID5045
Feature activation+5.44e+02

Pos logit contributions
 and+3.095
,+2.968
.+2.870
 the+2.640
 a+2.537
Neg logit contributions
umm-6.829
 former-6.664
arians-6.613
ocol-6.611
osity-6.577
Tokenmy
Token ID1820
Feature activation+5.44e+02

Pos logit contributions
,+4.013
 and+4.011
.+3.868
 the+3.821
 a+3.622
Neg logit contributions
lor-5.974
umm-5.859
arians-5.774
 former-5.747
osity-5.706
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
 and+3.641
,+3.614
.+3.527
 the+3.455
 a+3.259
Neg logit contributions
lor-6.298
umm-6.128
arians-6.091
 former-6.042
osity-5.985
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+2.706
,+2.626
.+2.480
 the+2.262
 a+2.163
Neg logit contributions
umm-7.313
lor-7.239
 former-7.208
arians-7.174
ocol-7.149
Token puppy
Token ID26188
Feature activation+5.44e+02

Pos logit contributions
 and+5.027
,+5.005
.+4.812
 the+4.760
 a+4.448
Neg logit contributions
lor-5.033
umm-4.997
arians-4.915
 former-4.880
osity-4.733
Token became
Token ID2627
Feature activation+5.44e+02

Pos logit contributions
 and+3.868
,+3.802
.+3.670
 the+3.634
 a+3.429
Neg logit contributions
lor-6.052
arians-5.977
umm-5.956
 former-5.909
osity-5.828
Token best
Token ID1266
Feature activation+5.44e+02

Pos logit contributions
 and+3.673
,+3.582
.+3.431
 the+3.290
 a+3.063
Neg logit contributions
umm-6.265
lor-6.248
 former-6.202
arians-6.190
ocol-6.078
Token friends
Token ID2460
Feature activation+5.44e+02

Pos logit contributions
 and+3.308
,+3.258
 the+3.132
.+3.099
 a+2.879
Neg logit contributions
umm-6.679
lor-6.672
arians-6.589
 former-6.534
ocol-6.493
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
 and+2.047
,+2.017
 the+1.918
.+1.811
 a+1.690
Neg logit contributions
lor-7.804
umm-7.638
arians-7.622
 former-7.484
osity-7.461
Token went
Token ID1816
Feature activation+5.44e+02

Pos logit contributions
 and+4.002
,+3.933
.+3.822
 the+3.655
 a+3.472
Neg logit contributions
umm-5.858
 former-5.845
lor-5.818
arians-5.803
osity-5.755
Token she
Token ID673
Feature activation+5.44e+02

Pos logit contributions
 and+3.223
,+3.125
.+2.989
 the+2.876
 a+2.695
Neg logit contributions
umm-6.771
lor-6.749
 former-6.624
arians-6.600
osity-6.547
Token didn
Token ID1422
Feature activation+5.44e+02

Pos logit contributions
 and+4.708
,+4.668
.+4.503
 the+4.460
 a+4.239
Neg logit contributions
lor-5.302
arians-5.172
umm-5.148
 former-5.121
rol-5.033
Token't
Token ID470
Feature activation+5.44e+02

Pos logit contributions
 and+4.535
,+4.475
.+4.277
 the+4.275
 a+3.993
Neg logit contributions
lor-5.506
umm-5.353
arians-5.315
 former-5.243
rol-5.205
Token notice
Token ID4003
Feature activation+5.44e+02

Pos logit contributions
 and+4.954
,+4.914
 the+4.707
.+4.683
 a+4.471
Neg logit contributions
lor-4.984
umm-4.970
arians-4.904
 former-4.898
ocol-4.832
Token her
Token ID607
Feature activation+5.44e+02

Pos logit contributions
 and+3.111
,+3.075
.+2.838
 the+2.808
 a+2.599
Neg logit contributions
lor-6.790
umm-6.769
 former-6.692
arians-6.673
osity-6.626
Token puppy
Token ID26188
Feature activation+5.44e+02

Pos logit contributions
 and+5.105
,+5.043
 the+4.858
.+4.844
 a+4.610
Neg logit contributions
lor-4.880
umm-4.874
 former-4.833
arians-4.817
ocol-4.689
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+4.207
,+4.133
 the+3.985
.+3.941
 a+3.747
Neg logit contributions
lor-5.781
arians-5.682
umm-5.648
 former-5.562
osity-5.515
Token Am
Token ID1703
Feature activation+5.44e+02

Pos logit contributions
 and+3.775
,+3.767
.+3.601
 the+3.525
 a+3.297
Neg logit contributions
lor-6.156
arians-6.021
umm-6.019
 former-5.970
osity-5.928
Tokenigo
Token ID14031
Feature activation+5.44e+02

Pos logit contributions
 and+6.856
,+6.767
 the+6.623
.+6.583
 a+6.387
Neg logit contributions
lor-3.322
umm-3.207
arians-3.145
ocol-3.082
 former-3.053
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+5.007
,+4.945
 the+4.814
.+4.761
 a+4.581
Neg logit contributions
lor-5.006
arians-4.855
umm-4.834
ocol-4.724
 former-4.708
Token rush
Token ID10484
Feature activation+5.44e+02

Pos logit contributions
 and+4.477
,+4.442
.+4.306
 the+4.231
 a+4.011
Neg logit contributions
lor-5.502
arians-5.357
umm-5.342
osity-5.291
 former-5.253
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
,+0.858
 and+0.792
.+0.532
 the+0.476
 a+0.012
Neg logit contributions
lor-9.120
 former-8.914
umm-8.848
ocol-8.770
arians-8.767
Token time
Token ID640
Feature activation+5.44e+02

Pos logit contributions
 and+3.092
,+3.030
 the+2.913
.+2.884
 a+2.621
Neg logit contributions
lor-6.906
umm-6.846
 former-6.725
arians-6.700
ocol-6.658
Token there
Token ID612
Feature activation+5.44e+02

Pos logit contributions
 and+0.284
.+0.064
 the+0.047
,+0.015
 a-0.205
Neg logit contributions
lor-9.646
umm-9.507
ocol-9.426
osity-9.408
arians-9.399
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
 and+1.716
,+1.567
.+1.483
 the+1.393
 a+1.227
Neg logit contributions
lor-8.202
umm-8.057
rol-7.948
arians-7.855
ocol-7.848
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
 and+1.154
,+1.128
.+0.963
 the+0.877
 a+0.504
Neg logit contributions
lor-8.783
umm-8.733
 former-8.643
ocol-8.615
arians-8.561
Token puppy
Token ID26188
Feature activation+5.44e+02

Pos logit contributions
 and+2.918
,+2.854
.+2.719
 the+2.695
 a+2.434
Neg logit contributions
umm-6.992
lor-6.972
arians-6.875
 former-6.844
ocol-6.791
Token called
Token ID1444
Feature activation+5.44e+02

Pos logit contributions
 and+4.039
,+3.948
 the+3.867
.+3.736
 a+3.624
Neg logit contributions
lor-5.797
arians-5.643
umm-5.585
 former-5.482
ocol-5.465
Token Poo
Token ID41778
Feature activation+5.44e+02

Pos logit contributions
 and+4.515
,+4.369
.+4.228
 the+4.153
 a+3.887
Neg logit contributions
lor-5.577
umm-5.435
arians-5.414
ocol-5.366
 former-5.298
Tokenchy
Token ID29658
Feature activation+5.44e+02

Pos logit contributions
 and+6.931
,+6.883
 the+6.726
.+6.628
 a+6.426
Neg logit contributions
lor-3.161
umm-3.039
arians-2.972
 former-2.961
osity-2.894
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+4.132
,+4.076
 the+3.949
.+3.846
 a+3.617
Neg logit contributions
lor-5.824
umm-5.738
arians-5.634
 former-5.618
ides-5.502
Token He
Token ID679
Feature activation+5.44e+02

Pos logit contributions
 and+5.457
,+5.364
.+5.210
 the+5.177
 a+4.927
Neg logit contributions
lor-4.674
umm-4.590
arians-4.454
 former-4.449
ocol-4.405
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+0.739
,+0.696
.+0.505
 the+0.232
 a+0.060
Neg logit contributions
 former-9.117
lor-9.066
ocol-9.009
umm-8.997
arians-8.997
Token woods
Token ID16479
Feature activation+5.44e+02

Pos logit contributions
 and+4.513
,+4.485
.+4.300
 the+4.231
 a+3.986
Neg logit contributions
lor-5.552
umm-5.544
 former-5.450
arians-5.341
ocol-5.281
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+2.337
,+2.274
 the+2.183
.+2.122
 a+1.948
Neg logit contributions
lor-7.417
umm-7.377
 former-7.368
arians-7.351
rol-7.221
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
,+4.573
 and+4.542
.+4.478
 the+4.337
 a+4.136
Neg logit contributions
lor-5.313
umm-5.270
arians-5.203
 former-5.197
osity-5.149
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+4.309
,+4.237
.+4.151
 the+3.934
 a+3.781
Neg logit contributions
lor-5.640
umm-5.638
 former-5.592
arians-5.536
osity-5.522
Token puppy
Token ID26188
Feature activation+5.44e+02

Pos logit contributions
 and+5.297
,+5.272
.+5.078
 the+5.049
 a+4.771
Neg logit contributions
umm-4.759
lor-4.726
arians-4.632
 former-4.625
ocol-4.489
Token never
Token ID1239
Feature activation+5.44e+02

Pos logit contributions
 and+3.473
,+3.401
 the+3.255
.+3.248
 a+3.047
Neg logit contributions
lor-6.478
arians-6.357
umm-6.328
 former-6.281
rol-6.185
Token bit
Token ID1643
Feature activation+5.44e+02

Pos logit contributions
 and+5.020
,+4.908
.+4.753
 the+4.729
 a+4.513
Neg logit contributions
lor-4.943
umm-4.936
 former-4.831
osity-4.825
arians-4.797
Token her
Token ID607
Feature activation+5.44e+02

Pos logit contributions
 and+4.589
,+4.543
.+4.329
 the+4.275
 a+4.099
Neg logit contributions
umm-5.393
lor-5.326
 former-5.253
ocol-5.235
rol-5.231
Token again
Token ID757
Feature activation+5.44e+02

Pos logit contributions
 and+4.522
,+4.493
 the+4.349
.+4.263
 a+4.070
Neg logit contributions
lor-5.392
 former-5.367
umm-5.365
arians-5.314
ocol-5.131
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.306
,+1.245
 the+1.123
.+0.963
 a+0.900
Neg logit contributions
lor-8.551
 former-8.527
umm-8.458
arians-8.380
ocol-8.297
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.697
,+0.571
 the+0.468
.+0.325
 a+0.290
Neg logit contributions
lor-9.186
arians-9.118
osity-8.982
umm-8.942
ocol-8.941
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+2.668
,+2.576
.+2.439
 the+2.424
 a+2.197
Neg logit contributions
lor-7.321
umm-7.170
arians-7.154
rol-7.053
 former-7.031
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+1.590
,+1.531
 the+1.350
.+1.324
 a+1.114
Neg logit contributions
lor-8.450
arians-8.105
umm-8.089
rol-8.081
ides-8.035
Token"
Token ID1
Feature activation+5.44e+02

Pos logit contributions
 and+6.286
,+6.222
.+6.066
 the+6.054
 a+5.797
Neg logit contributions
lor-3.808
umm-3.648
arians-3.641
 former-3.530
rol-3.496
TokenK
Token ID42
Feature activation+5.44e+02

Pos logit contributions
 and+6.374
,+6.319
.+6.139
 the+6.105
 a+5.853
Neg logit contributions
lor-3.771
umm-3.691
arians-3.599
 former-3.540
ocol-3.502
Tokenitty
Token ID9760
Feature activation+5.44e+02

Pos logit contributions
 and+7.905
,+7.831
 the+7.686
.+7.671
 a+7.437
Neg logit contributions
lor-2.240
umm-2.140
arians-2.055
ocol-1.999
 former-1.965
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+1.729
,+1.573
 the+1.502
.+1.467
 a+1.268
Neg logit contributions
lor-8.225
umm-8.116
arians-8.072
 former-7.964
ocol-7.955
Token look
Token ID804
Feature activation+5.44e+02

Pos logit contributions
 and+3.571
,+3.502
.+3.394
 the+3.263
 a+3.013
Neg logit contributions
lor-6.479
umm-6.344
arians-6.277
ocol-6.221
osity-6.205
Token at
Token ID379
Feature activation+5.44e+02

Pos logit contributions
 and+2.372
,+2.281
.+2.134
 the+2.108
 a+1.877
Neg logit contributions
lor-7.486
ocol-7.439
umm-7.432
arians-7.334
rol-7.332
Token my
Token ID616
Feature activation+5.44e+02

Pos logit contributions
 and+3.652
,+3.580
.+3.429
 the+3.228
 a+3.045
Neg logit contributions
lor-6.415
arians-6.271
umm-6.257
ocol-6.171
 former-6.145
Token new
Token ID649
Feature activation+5.44e+02

Pos logit contributions
 and+5.485
,+5.446
 the+5.265
.+5.257
 a+4.968
Neg logit contributions
lor-4.491
umm-4.463
 former-4.376
arians-4.356
ocol-4.231
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+2.176
,+2.124
 the+2.010
.+1.918
 a+1.808
Neg logit contributions
lor-7.746
arians-7.613
umm-7.606
 former-7.468
osity-7.459
Token<|endoftext|>
Token ID50256
Feature activation+5.44e+02

Pos logit contributions
 and+4.231
,+4.191
.+3.987
 the+3.961
 a+3.733
Neg logit contributions
lor-5.877
umm-5.807
 former-5.651
arians-5.650
osity-5.586
TokenM
Token ID44
Feature activation+5.44e+02

Pos logit contributions
 and+4.412
,+4.360
 the+4.140
.+4.100
 a+3.898
Neg logit contributions
lor-5.630
umm-5.439
rol-5.303
arians-5.302
ocol-5.234
Tokenolly
Token ID5098
Feature activation+5.44e+02

Pos logit contributions
 and+6.805
,+6.703
 the+6.540
.+6.536
 a+6.282
Neg logit contributions
lor-3.349
umm-3.280
arians-3.140
 former-3.140
ocol-3.109
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+2.345
,+2.314
 the+2.185
.+2.180
 a+1.957
Neg logit contributions
lor-7.546
umm-7.423
arians-7.374
 former-7.304
osity-7.242
Token puppy
Token ID26188
Feature activation+5.44e+02

Pos logit contributions
 and+4.829
,+4.779
.+4.581
 the+4.553
 a+4.313
Neg logit contributions
umm-5.232
lor-5.193
arians-5.083
 former-5.045
ocol-5.036
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
 and+3.042
,+3.001
.+2.864
 the+2.851
 a+2.610
Neg logit contributions
lor-6.868
arians-6.712
umm-6.697
 former-6.646
rol-6.545
Token outside
Token ID2354
Feature activation+5.44e+02

Pos logit contributions
 and+3.572
,+3.514
.+3.381
 the+3.295
 a+2.979
Neg logit contributions
lor-6.368
umm-6.304
 former-6.299
rol-6.210
arians-6.191
Token playing
Token ID2712
Feature activation+5.44e+02

Pos logit contributions
 and+3.017
,+2.946
.+2.786
 the+2.774
 a+2.561
Neg logit contributions
umm-6.774
 former-6.704
lor-6.678
ocol-6.629
osity-6.597
Token in
Token ID287
Feature activation+5.44e+02

Pos logit contributions
 and+1.715
,+1.679
 the+1.500
.+1.440
 a+1.225
Neg logit contributions
lor-8.019
umm-7.952
 former-7.905
arians-7.894
ocol-7.847
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+0.918
,+0.868
.+0.663
 the+0.457
 a+0.279
Neg logit contributions
lor-8.856
umm-8.838
ocol-8.808
 former-8.777
arians-8.698
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.777
,+0.765
 the+0.606
.+0.432
 a+0.334
Neg logit contributions
lor-9.012
umm-8.898
arians-8.825
 former-8.807
osity-8.795
Token He
Token ID679
Feature activation+5.44e+02

Pos logit contributions
 and+3.083
,+2.984
.+2.833
 the+2.823
 a+2.600
Neg logit contributions
umm-6.998
lor-6.835
 former-6.763
arians-6.740
ocol-6.740
Token had
Token ID550
Feature activation+5.44e+02

Pos logit contributions
 and+3.308
,+3.264
.+3.081
 the+3.070
 a+2.766
Neg logit contributions
lor-6.646
umm-6.485
arians-6.444
 former-6.425
osity-6.339
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
,+1.028
 and+1.020
.+0.802
 the+0.654
 a+0.251
Neg logit contributions
lor-9.021
umm-8.882
 former-8.835
arians-8.739
rol-8.709
Token little
Token ID1310
Feature activation+5.44e+02

Pos logit contributions
 and+4.183
,+4.100
.+3.969
 the+3.948
 a+3.633
Neg logit contributions
lor-5.824
umm-5.801
 former-5.733
arians-5.680
ocol-5.613
Token puppy
Token ID26188
Feature activation+5.44e+02

Pos logit contributions
 and+4.763
,+4.679
 the+4.580
.+4.552
 a+4.271
Neg logit contributions
umm-5.205
lor-5.143
 former-5.085
ocol-5.051
arians-5.044
Token named
Token ID3706
Feature activation+5.44e+02

Pos logit contributions
 and+4.065
,+3.978
 the+3.901
.+3.813
 a+3.603
Neg logit contributions
lor-5.830
arians-5.710
umm-5.654
 former-5.578
ocol-5.545
Token Spot
Token ID15899
Feature activation+5.44e+02

Pos logit contributions
 and+4.453
,+4.351
.+4.155
 the+4.081
 a+3.818
Neg logit contributions
lor-5.609
umm-5.509
arians-5.503
 former-5.415
osity-5.341
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.736
,+1.646
 the+1.507
.+1.391
 a+1.211
Neg logit contributions
lor-8.157
arians-8.037
 former-7.910
umm-7.903
osity-7.874
Token Tim
Token ID5045
Feature activation+5.44e+02

Pos logit contributions
 and+3.852
,+3.769
.+3.623
 the+3.550
 a+3.348
Neg logit contributions
lor-6.128
umm-6.124
arians-5.998
 former-5.979
ocol-5.936
Tokenmy
Token ID1820
Feature activation+5.44e+02

Pos logit contributions
,+3.938
 and+3.925
 the+3.769
.+3.759
 a+3.521
Neg logit contributions
lor-5.938
arians-5.761
umm-5.742
 former-5.724
rol-5.655
Token loved
Token ID6151
Feature activation+5.44e+02

Pos logit contributions
 and+4.200
,+4.143
.+3.921
 the+3.920
 a+3.646
Neg logit contributions
lor-5.799
umm-5.678
 former-5.621
arians-5.617
osity-5.521
Token to
Token ID284
Feature activation+5.44e+02

Pos logit contributions
 and+3.631
,+3.604
.+3.374
 the+3.315
 a+3.082
Neg logit contributions
umm-6.355
lor-6.322
 former-6.231
arians-6.216
ocol-6.204
Token spoil
Token ID20851
Feature activation+5.44e+02

Pos logit contributions
 and+4.916
,+4.821
.+4.642
 the+4.600
 a+4.334
Neg logit contributions
lor-5.101
umm-5.014
 former-4.982
arians-4.938
ocol-4.935
Token her
Token ID607
Feature activation+5.44e+02

Pos logit contributions
 and+3.598
,+3.511
.+3.254
 the+3.236
 a+3.041
Neg logit contributions
umm-6.340
lor-6.337
ocol-6.307
 former-6.265
arians-6.226
Token little
Token ID1310
Feature activation+5.44e+02

Pos logit contributions
 and+5.208
,+5.150
 the+4.993
.+4.921
 a+4.714
Neg logit contributions
lor-4.774
umm-4.764
 former-4.721
arians-4.700
ocol-4.572
Token grandson
Token ID31845
Feature activation+5.44e+02

Pos logit contributions
 and+4.918
,+4.832
 the+4.723
.+4.665
 a+4.462
Neg logit contributions
umm-5.078
lor-4.995
arians-4.939
 former-4.920
ocol-4.857
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+2.091
,+2.021
 the+1.866
.+1.779
 a+1.583
Neg logit contributions
lor-7.869
umm-7.765
arians-7.749
 former-7.673
ocol-7.623
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+3.663
,+3.594
.+3.401
 the+3.373
 a+3.153
Neg logit contributions
lor-6.411
umm-6.386
 former-6.258
arians-6.247
osity-6.184
TokenOne
Token ID3198
Feature activation+5.44e+02

Pos logit contributions
 and+1.597
,+1.552
 the+1.336
.+1.319
 a+1.103
Neg logit contributions
lor-8.459
umm-8.181
arians-8.135
 former-8.075
rol-8.074
Token day
Token ID1110
Feature activation+5.44e+02

Pos logit contributions
 and+3.611
,+3.573
.+3.387
 the+3.343
 a+3.059
Neg logit contributions
umm-6.283
lor-6.211
 former-6.088
ocol-6.064
osity-6.025
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+0.295
.+0.040
,+0.002
 the-0.048
 a-0.248
Neg logit contributions
lor-9.587
umm-9.528
ocol-9.520
arians-9.417
osity-9.397
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
 and+4.695
,+4.676
.+4.546
 the+4.518
 a+4.295
Neg logit contributions
lor-5.278
umm-5.171
arians-5.105
 former-5.065
osity-4.986
Token so
Token ID523
Feature activation+5.44e+02

Pos logit contributions
 and+3.878
,+3.809
.+3.671
 the+3.530
 a+3.267
Neg logit contributions
lor-6.166
umm-6.005
 former-5.974
rol-5.929
arians-5.897
Token proud
Token ID6613
Feature activation+5.44e+02

Pos logit contributions
 and+4.939
,+4.837
.+4.717
 the+4.609
 a+4.362
Neg logit contributions
lor-5.101
umm-5.082
 former-4.949
osity-4.906
ocol-4.904
Token of
Token ID286
Feature activation+5.44e+02

Pos logit contributions
 and+2.380
,+2.346
 the+2.223
.+2.150
 a+2.028
Neg logit contributions
umm-7.488
lor-7.446
arians-7.249
ocol-7.241
osity-7.238
Token her
Token ID607
Feature activation+5.44e+02

Pos logit contributions
 and+3.827
,+3.714
.+3.525
 the+3.375
 a+3.206
Neg logit contributions
lor-6.241
ocol-6.171
umm-6.145
arians-6.098
 former-6.095
Token grandson
Token ID31845
Feature activation+5.44e+02

Pos logit contributions
 and+5.468
,+5.397
 the+5.252
.+5.223
 a+4.978
Neg logit contributions
umm-4.554
lor-4.541
arians-4.445
 former-4.427
ocol-4.367
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
 and+3.009
,+2.955
 the+2.788
.+2.776
 a+2.544
Neg logit contributions
lor-6.932
umm-6.815
arians-6.772
 former-6.648
osity-6.637
Token gave
Token ID2921
Feature activation+5.44e+02

Pos logit contributions
 and+4.314
,+4.227
.+4.097
 the+3.982
 a+3.767
Neg logit contributions
lor-5.657
umm-5.605
arians-5.510
 former-5.509
osity-5.462
Token him
Token ID683
Feature activation+5.44e+02

Pos logit contributions
 and+3.233
,+3.169
.+2.955
 the+2.787
 a+2.601
Neg logit contributions
lor-6.844
umm-6.747
ocol-6.662
arians-6.620
 former-6.568
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
 and+2.734
,+2.677
.+2.473
 the+2.399
 a+2.131
Neg logit contributions
lor-7.311
umm-7.209
 former-7.109
arians-7.087
ocol-7.075
Token big
Token ID1263
Feature activation+5.44e+02

Pos logit contributions
 and+3.961
,+3.893
.+3.739
 the+3.696
 a+3.421
Neg logit contributions
lor-6.092
umm-6.003
 former-5.872
arians-5.843
ocol-5.816
TokenIt
Token ID1026
Feature activation+5.44e+02

Pos logit contributions
 and+5.820
,+5.780
.+5.584
 the+5.557
 a+5.305
Neg logit contributions
lor-4.338
umm-4.253
arians-4.154
 former-4.119
ocol-4.056
Token's
Token ID338
Feature activation+5.44e+02

Pos logit contributions
 and+3.092
,+3.002
.+2.850
 the+2.829
 a+2.565
Neg logit contributions
lor-6.968
umm-6.837
arians-6.804
 former-6.745
osity-6.726
Token okay
Token ID8788
Feature activation+5.44e+02

Pos logit contributions
 and+5.894
,+5.810
.+5.638
 the+5.573
 a+5.271
Neg logit contributions
lor-4.150
 former-4.110
umm-4.100
arians-4.021
ocol-3.982
Token".
Token ID1911
Feature activation+5.44e+02

Pos logit contributions
 and+1.799
,+1.645
 the+1.568
.+1.487
 a+1.303
Neg logit contributions
lor-8.114
umm-8.057
arians-8.018
 former-7.938
osity-7.913
Token The
Token ID383
Feature activation+5.44e+02

Pos logit contributions
 and+4.785
,+4.747
.+4.566
 the+4.551
 a+4.297
Neg logit contributions
lor-5.216
umm-5.163
arians-5.103
 former-5.078
osity-4.987
Token puppy
Token ID26188
Feature activation+5.44e+02

Pos logit contributions
 and+5.345
,+5.316
.+5.127
 the+5.107
 a+4.814
Neg logit contributions
umm-4.699
lor-4.692
arians-4.591
 former-4.555
ocol-4.438
Token w
Token ID266
Feature activation+5.44e+02

Pos logit contributions
 and+4.183
,+4.140
.+4.008
 the+3.984
 a+3.736
Neg logit contributions
lor-5.775
arians-5.635
umm-5.633
 former-5.593
osity-5.470
Tokenagged
Token ID14655
Feature activation+5.44e+02

Pos logit contributions
 and+8.457
,+8.388
.+8.212
 the+8.190
 a+7.943
Neg logit contributions
lor-1.656
umm-1.578
arians-1.485
 former-1.442
ocol-1.405
Token his
Token ID465
Feature activation+5.44e+02

Pos logit contributions
 and+3.207
,+3.180
.+3.053
 the+2.949
 a+2.696
Neg logit contributions
lor-6.667
umm-6.598
 former-6.579
osity-6.508
arians-6.487
Token tail
Token ID7894
Feature activation+5.44e+02

Pos logit contributions
 and+5.724
,+5.708
.+5.541
 the+5.533
 a+5.240
Neg logit contributions
lor-4.253
umm-4.188
 former-4.157
arians-4.129
ocol-4.046
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
 and+1.364
,+1.346
 the+1.287
.+1.199
 a+0.968
Neg logit contributions
lor-8.413
umm-8.343
 former-8.260
arians-8.240
osity-8.168
Token?"
Token ID1701
Feature activation+5.44e+02

Pos logit contributions
 and+3.984
,+3.912
.+3.790
 the+3.787
 a+3.525
Neg logit contributions
lor-6.004
umm-5.829
arians-5.792
 former-5.787
ocol-5.669
Token Emma
Token ID18966
Feature activation+5.44e+02

Pos logit contributions
 and+4.588
,+4.520
.+4.341
 the+4.235
 a+4.062
Neg logit contributions
lor-5.363
umm-5.234
arians-5.189
 former-5.162
ocol-5.074
Token asked
Token ID1965
Feature activation+5.44e+02

Pos logit contributions
 and+3.543
,+3.472
.+3.330
 the+3.325
 a+3.119
Neg logit contributions
lor-6.373
arians-6.235
umm-6.148
 former-6.119
osity-6.045
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.703
,+1.559
 the+1.397
.+1.340
 a+1.193
Neg logit contributions
lor-8.108
arians-8.028
umm-7.974
 former-7.931
ocol-7.928
Token The
Token ID383
Feature activation+5.44e+02

Pos logit contributions
 and+3.700
,+3.637
.+3.451
 the+3.407
 a+3.180
Neg logit contributions
lor-6.330
umm-6.221
arians-6.184
 former-6.137
osity-6.076
Token puppy
Token ID26188
Feature activation+5.44e+02

Pos logit contributions
 and+5.369
,+5.342
.+5.141
 the+5.116
 a+4.817
Neg logit contributions
lor-4.688
umm-4.670
arians-4.569
 former-4.549
ocol-4.406
Token just
Token ID655
Feature activation+5.44e+02

Pos logit contributions
 and+4.372
,+4.308
.+4.188
 the+4.161
 a+3.914
Neg logit contributions
lor-5.588
arians-5.440
umm-5.412
 former-5.385
osity-5.271
Token looked
Token ID3114
Feature activation+5.44e+02

Pos logit contributions
 and+5.342
,+5.285
.+5.104
 the+5.100
 a+4.828
Neg logit contributions
lor-4.587
umm-4.531
 former-4.469
arians-4.456
osity-4.423
Token at
Token ID379
Feature activation+5.44e+02

Pos logit contributions
 and+3.569
,+3.519
.+3.360
 the+3.360
 a+3.105
Neg logit contributions
lor-6.277
umm-6.221
 former-6.210
ocol-6.158
rol-6.110
Token her
Token ID607
Feature activation+5.44e+02

Pos logit contributions
 and+4.647
,+4.609
.+4.409
 the+4.266
 a+4.084
Neg logit contributions
lor-5.291
umm-5.204
arians-5.147
 former-5.109
ocol-5.063
Token with
Token ID351
Feature activation+5.44e+02

Pos logit contributions
 and+2.851
,+2.848
 the+2.691
.+2.639
 a+2.448
Neg logit contributions
lor-7.039
arians-6.942
umm-6.937
 former-6.937
ocol-6.777
Token started
Token ID2067
Feature activation+5.44e+02

Pos logit contributions
 and+4.579
,+4.507
.+4.353
 the+4.254
 a+4.040
Neg logit contributions
lor-5.379
umm-5.269
 former-5.251
arians-5.243
ocol-5.151
Token to
Token ID284
Feature activation+5.44e+02

Pos logit contributions
 and+3.601
,+3.539
.+3.323
 the+3.299
 a+3.063
Neg logit contributions
lor-6.326
 former-6.230
umm-6.194
arians-6.162
ocol-6.142
Token cry
Token ID3960
Feature activation+5.44e+02

Pos logit contributions
 and+5.459
,+5.369
.+5.190
 the+5.169
 a+4.931
Neg logit contributions
lor-4.521
 former-4.445
umm-4.443
arians-4.408
ocol-4.330
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.794
,+0.676
 the+0.636
.+0.418
 a+0.352
Neg logit contributions
lor-9.038
umm-9.003
 former-8.970
arians-8.909
osity-8.870
Token The
Token ID383
Feature activation+5.44e+02

Pos logit contributions
 and+4.609
,+4.538
.+4.362
 the+4.336
 a+4.087
Neg logit contributions
lor-5.462
umm-5.380
 former-5.312
arians-5.299
osity-5.208
Token puppy
Token ID26188
Feature activation+5.44e+02

Pos logit contributions
 and+5.071
,+5.026
.+4.823
 the+4.807
 a+4.526
Neg logit contributions
lor-4.976
umm-4.961
 former-4.865
arians-4.863
ocol-4.795
Token then
Token ID788
Feature activation+5.44e+02

Pos logit contributions
 and+3.905
,+3.860
.+3.733
 the+3.697
 a+3.465
Neg logit contributions
lor-6.057
arians-5.906
umm-5.862
 former-5.829
rol-5.737
Token looked
Token ID3114
Feature activation+5.44e+02

Pos logit contributions
 and+4.651
,+4.580
.+4.442
 the+4.372
 a+4.136
Neg logit contributions
lor-5.338
umm-5.180
arians-5.152
 former-5.147
ocol-5.081
Token very
Token ID845
Feature activation+5.44e+02

Pos logit contributions
 and+3.209
,+3.170
.+3.027
 the+2.988
 a+2.752
Neg logit contributions
lor-6.638
umm-6.588
 former-6.552
ocol-6.516
rol-6.506
Token sorry
Token ID7926
Feature activation+5.44e+02

Pos logit contributions
 and+4.663
,+4.552
.+4.379
 the+4.370
 a+4.134
Neg logit contributions
umm-5.315
 former-5.282
lor-5.264
ocol-5.246
arians-5.172
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
 and+1.342
,+1.310
 the+1.258
.+1.125
 a+0.975
Neg logit contributions
lor-8.432
umm-8.331
 former-8.237
osity-8.232
arians-8.231
Token his
Token ID465
Feature activation+5.44e+02

Pos logit contributions
 and+2.825
,+2.736
 the+2.580
.+2.517
 a+2.353
Neg logit contributions
lor-7.043
 former-6.884
umm-6.872
osity-6.848
arians-6.827
Token head
Token ID1182
Feature activation+5.44e+02

Pos logit contributions
 and+5.181
,+5.159
 the+4.992
.+4.933
 a+4.691
Neg logit contributions
lor-4.858
umm-4.812
 former-4.782
arians-4.757
ocol-4.598
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+2.328
,+2.264
 the+2.183
.+2.008
 a+1.886
Neg logit contributions
lor-7.591
umm-7.398
 former-7.385
rol-7.372
arians-7.361
Token "
Token ID366
Feature activation+5.44e+02

Pos logit contributions
 and+4.680
,+4.628
.+4.437
 the+4.396
 a+4.149
Neg logit contributions
lor-5.420
umm-5.317
 former-5.234
arians-5.226
osity-5.124
TokenOh
Token ID5812
Feature activation+5.44e+02

Pos logit contributions
 and+5.982
,+5.932
.+5.744
 the+5.716
 a+5.464
Neg logit contributions
lor-4.131
umm-4.041
arians-3.945
 former-3.917
ocol-3.873
Token puppy
Token ID26188
Feature activation+5.44e+02

Pos logit contributions
 and+3.827
,+3.600
.+3.532
 the+3.469
 a+3.180
Neg logit contributions
lor-6.348
umm-6.225
arians-6.091
ocol-6.083
 former-6.075
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+2.643
,+2.479
 the+2.389
.+2.349
 a+2.146
Neg logit contributions
lor-7.381
umm-7.262
arians-7.222
ocol-7.092
 former-7.079
Token why
Token ID1521
Feature activation+5.44e+02

Pos logit contributions
 and+4.059
,+3.960
.+3.833
 the+3.711
 a+3.473
Neg logit contributions
lor-6.018
umm-5.904
arians-5.808
ocol-5.798
osity-5.769
Token did
Token ID750
Feature activation+5.44e+02

Pos logit contributions
 and+4.077
,+3.945
.+3.801
 the+3.742
 a+3.509
Neg logit contributions
lor-5.920
umm-5.798
arians-5.768
osity-5.747
ocol-5.715
Token you
Token ID345
Feature activation+5.44e+02

Pos logit contributions
 and+3.491
,+3.313
.+3.177
 the+3.019
 a+2.800
Neg logit contributions
lor-6.589
arians-6.449
umm-6.445
ocol-6.387
 former-6.340
Token bite
Token ID13197
Feature activation+5.44e+02

Pos logit contributions
 and+4.634
,+4.558
.+4.398
 the+4.398
 a+4.142
Neg logit contributions
lor-5.343
umm-5.251
 former-5.223
arians-5.203
osity-5.123
Token smiles
Token ID21845
Feature activation+5.44e+02

Pos logit contributions
 and+4.501
,+4.467
.+4.335
 the+4.277
 a+4.049
Neg logit contributions
lor-5.487
arians-5.367
umm-5.365
ocol-5.287
 former-5.276
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
,+0.439
 and+0.411
 the+0.314
.+0.218
 a+0.016
Neg logit contributions
lor-9.284
umm-9.179
osity-9.137
arians-9.123
 former-9.059
Token says
Token ID1139
Feature activation+5.44e+02

Pos logit contributions
 and+4.696
,+4.652
.+4.490
 the+4.385
 a+4.163
Neg logit contributions
lor-5.252
umm-5.217
arians-5.152
 former-5.094
ocol-5.082
Token "
Token ID366
Feature activation+5.44e+02

Pos logit contributions
 and+0.911
,+0.677
.+0.617
 the+0.616
 a+0.398
Neg logit contributions
lor-9.068
umm-8.901
ocol-8.900
arians-8.865
osity-8.855
TokenWell
Token ID5779
Feature activation+5.44e+02

Pos logit contributions
 and+6.221
,+6.190
.+5.983
 the+5.960
 a+5.711
Neg logit contributions
lor-3.931
umm-3.869
arians-3.753
 former-3.714
ocol-3.663
Token done
Token ID1760
Feature activation+5.44e+02

Pos logit contributions
 and+4.212
,+4.002
.+3.968
 the+3.875
 a+3.660
Neg logit contributions
lor-5.812
umm-5.708
ocol-5.680
arians-5.667
ides-5.585
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+2.135
,+1.938
 the+1.833
.+1.772
 a+1.571
Neg logit contributions
lor-7.902
umm-7.795
arians-7.794
ocol-7.662
 former-7.656
Token pirates
Token ID27516
Feature activation+5.44e+02

Pos logit contributions
 and+3.768
,+3.682
.+3.587
 the+3.466
 a+3.272
Neg logit contributions
umm-6.219
lor-6.211
arians-6.127
 former-6.041
ocol-6.029
Token!
Token ID0
Feature activation+5.44e+02

Pos logit contributions
 and+2.835
,+2.748
 the+2.591
.+2.557
 a+2.366
Neg logit contributions
lor-7.176
arians-7.009
umm-7.002
ocol-6.885
 former-6.872
Token You
Token ID921
Feature activation+5.44e+02

Pos logit contributions
 and+5.704
,+5.626
.+5.451
 the+5.418
 a+5.171
Neg logit contributions
lor-4.422
umm-4.336
arians-4.245
 former-4.177
ocol-4.177
Token are
Token ID389
Feature activation+5.44e+02

Pos logit contributions
 and+4.308
,+4.236
.+4.076
 the+4.043
 a+3.773
Neg logit contributions
lor-5.777
umm-5.675
arians-5.621
 former-5.562
ocol-5.516
Token to
Token ID284
Feature activation+5.44e+02

Pos logit contributions
 and+1.568
,+1.515
.+1.320
 the+1.209
 a+0.958
Neg logit contributions
lor-8.447
umm-8.245
arians-8.243
 former-8.229
ocol-8.155
Token see
Token ID766
Feature activation+5.44e+02

Pos logit contributions
 and+4.288
,+4.205
.+4.031
 the+4.004
 a+3.756
Neg logit contributions
lor-5.704
 former-5.603
arians-5.560
umm-5.525
osity-5.419
Token them
Token ID606
Feature activation+5.44e+02

Pos logit contributions
 and+3.846
,+3.796
.+3.591
 the+3.443
 a+3.245
Neg logit contributions
lor-6.140
arians-5.947
ocol-5.935
 former-5.882
umm-5.873
Token with
Token ID351
Feature activation+5.44e+02

Pos logit contributions
 and+2.853
,+2.807
 the+2.650
.+2.628
 a+2.430
Neg logit contributions
lor-6.954
arians-6.882
 former-6.859
umm-6.806
osity-6.745
Token her
Token ID607
Feature activation+5.44e+02

Pos logit contributions
 and+3.673
,+3.598
.+3.415
 the+3.290
 a+3.071
Neg logit contributions
lor-6.290
ocol-6.246
umm-6.183
arians-6.178
 former-6.175
Token own
Token ID898
Feature activation+5.44e+02

Pos logit contributions
 and+4.508
,+4.427
 the+4.275
.+4.220
 a+4.032
Neg logit contributions
lor-5.475
arians-5.326
 former-5.305
umm-5.304
ocol-5.261
Token eyes
Token ID2951
Feature activation+5.44e+02

Pos logit contributions
 and+5.439
,+5.358
 the+5.210
.+5.174
 a+4.963
Neg logit contributions
lor-4.579
umm-4.433
arians-4.396
 former-4.392
ocol-4.387
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.498
,+0.451
 the+0.429
.+0.254
 a+0.158
Neg logit contributions
lor-9.296
umm-9.103
arians-9.068
 former-9.063
ides-9.004
Token She
Token ID1375
Feature activation+5.44e+02

Pos logit contributions
 and+3.467
,+3.409
 the+3.215
.+3.203
 a+2.962
Neg logit contributions
lor-6.663
umm-6.471
arians-6.439
 former-6.385
osity-6.325
Token asked
Token ID1965
Feature activation+5.44e+02

Pos logit contributions
,+4.102
 and+4.083
.+3.934
 the+3.902
 a+3.621
Neg logit contributions
lor-5.898
arians-5.757
 former-5.704
umm-5.659
ides-5.544
Token R
Token ID371
Feature activation+5.44e+02

Pos logit contributions
 and+5.695
,+5.584
.+5.441
 the+5.343
 a+5.135
Neg logit contributions
lor-4.348
arians-4.178
ocol-4.113
umm-4.101
 former-4.036
Token man
Token ID582
Feature activation+5.44e+02

Pos logit contributions
 and+3.952
,+3.856
.+3.706
 the+3.688
 a+3.426
Neg logit contributions
lor-6.083
umm-6.045
arians-5.911
 former-5.911
ocol-5.829
Token would
Token ID561
Feature activation+5.44e+02

Pos logit contributions
 and+3.731
,+3.690
 the+3.546
.+3.524
 a+3.344
Neg logit contributions
lor-6.191
arians-6.041
umm-6.003
ocol-5.948
 former-5.920
Token appear
Token ID1656
Feature activation+5.44e+02

Pos logit contributions
 and+4.368
,+4.324
.+4.145
 the+4.121
 a+3.871
Neg logit contributions
lor-5.636
umm-5.505
arians-5.473
 former-5.466
ocol-5.415
Token at
Token ID379
Feature activation+5.44e+02

Pos logit contributions
 and+2.468
,+2.456
 the+2.258
.+2.203
 a+2.002
Neg logit contributions
lor-7.472
umm-7.292
 former-7.231
arians-7.212
rol-7.194
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+2.671
,+2.632
.+2.395
 the+2.251
 a+2.043
Neg logit contributions
lor-7.337
umm-7.220
 former-7.199
arians-7.126
ocol-7.118
Token top
Token ID1353
Feature activation+5.44e+02

Pos logit contributions
 and+5.456
,+5.400
.+5.207
 the+5.159
 a+4.904
Neg logit contributions
lor-4.611
umm-4.604
arians-4.439
 former-4.425
vation-4.293
Token of
Token ID286
Feature activation+5.44e+02

Pos logit contributions
 and+2.659
,+2.617
 the+2.466
.+2.367
 a+2.320
Neg logit contributions
lor-7.246
umm-7.239
arians-7.040
 former-6.959
ocol-6.949
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+1.498
,+1.468
.+1.257
 the+1.044
 a+0.836
Neg logit contributions
lor-8.471
umm-8.418
ocol-8.407
 former-8.345
arians-8.299
Token hill
Token ID12788
Feature activation+5.44e+02

Pos logit contributions
 and+5.904
,+5.830
.+5.670
 the+5.627
 a+5.396
Neg logit contributions
umm-4.176
lor-4.143
arians-3.979
 former-3.962
ocol-3.866
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.085
,+1.032
 the+0.908
.+0.768
 a+0.724
Neg logit contributions
lor-8.807
umm-8.686
arians-8.617
rol-8.582
 former-8.566
Token He
Token ID679
Feature activation+5.44e+02

Pos logit contributions
 and+3.488
,+3.440
.+3.243
 the+3.202
 a+2.972
Neg logit contributions
lor-6.572
umm-6.530
arians-6.379
 former-6.369
osity-6.307
Token to
Token ID284
Feature activation+5.44e+02

Pos logit contributions
 and+2.300
,+2.233
.+2.067
 the+1.946
 a+1.679
Neg logit contributions
lor-7.624
umm-7.593
 former-7.522
arians-7.507
ocol-7.474
Token do
Token ID466
Feature activation+5.44e+02

Pos logit contributions
 and+4.363
,+4.258
 the+4.075
.+4.068
 a+3.815
Neg logit contributions
lor-5.649
umm-5.570
arians-5.540
 former-5.522
ides-5.390
Token things
Token ID1243
Feature activation+5.44e+02

Pos logit contributions
 and+3.412
,+3.309
.+3.122
 the+3.044
 a+2.769
Neg logit contributions
lor-6.635
 former-6.441
arians-6.424
umm-6.396
ocol-6.369
Token on
Token ID319
Feature activation+5.44e+02

Pos logit contributions
 and+2.559
,+2.447
 the+2.304
.+2.284
 a+2.078
Neg logit contributions
lor-7.293
umm-7.242
arians-7.181
 former-7.130
ocol-7.084
Token her
Token ID607
Feature activation+5.44e+02

Pos logit contributions
 and+4.389
,+4.305
.+4.089
 the+4.048
 a+3.825
Neg logit contributions
lor-5.597
umm-5.542
 former-5.477
arians-5.476
ocol-5.434
Token own
Token ID898
Feature activation+5.44e+02

Pos logit contributions
 and+5.320
,+5.242
 the+5.104
.+5.036
 a+4.840
Neg logit contributions
lor-4.661
umm-4.626
 former-4.619
arians-4.608
ocol-4.428
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+3.224
,+3.138
 the+3.021
.+2.912
 a+2.779
Neg logit contributions
lor-6.750
umm-6.667
arians-6.625
 former-6.613
ides-6.474
Token One
Token ID1881
Feature activation+5.44e+02

Pos logit contributions
 and+3.970
,+3.894
 the+3.704
.+3.702
 a+3.492
Neg logit contributions
umm-6.030
lor-6.018
arians-5.875
 former-5.862
osity-5.820
Token day
Token ID1110
Feature activation+5.44e+02

Pos logit contributions
 and+3.173
,+3.106
 the+2.957
.+2.947
 a+2.696
Neg logit contributions
umm-6.749
lor-6.711
 former-6.611
arians-6.526
ocol-6.524
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+0.798
,+0.545
.+0.511
 the+0.490
 a+0.251
Neg logit contributions
lor-9.071
umm-9.023
ocol-8.945
arians-8.892
osity-8.888
Token she
Token ID673
Feature activation+5.44e+02

Pos logit contributions
 and+4.317
,+4.198
.+4.078
 the+3.935
 a+3.709
Neg logit contributions
umm-5.589
lor-5.507
arians-5.398
osity-5.389
ocol-5.371
Token on
Token ID319
Feature activation+5.44e+02

Pos logit contributions
,+2.122
 and+2.117
 the+1.882
.+1.878
 a+1.616
Neg logit contributions
lor-7.843
umm-7.664
 former-7.610
arians-7.609
rol-7.605
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
,+1.007
 and+0.997
.+0.769
 the+0.601
 a+0.357
Neg logit contributions
lor-8.952
umm-8.745
arians-8.721
 former-8.702
ocol-8.667
Token seat
Token ID5852
Feature activation+5.44e+02

Pos logit contributions
 and+4.815
,+4.804
.+4.578
 the+4.541
 a+4.327
Neg logit contributions
lor-5.202
umm-5.125
arians-5.025
 former-4.934
vation-4.932
Token at
Token ID379
Feature activation+5.44e+02

Pos logit contributions
 and+3.956
,+3.924
 the+3.809
.+3.653
 a+3.556
Neg logit contributions
lor-6.038
umm-5.849
arians-5.789
rol-5.782
 former-5.744
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+2.173
,+2.139
.+1.896
 the+1.726
 a+1.557
Neg logit contributions
lor-7.798
umm-7.669
 former-7.632
arians-7.575
rol-7.528
Token top
Token ID1353
Feature activation+5.44e+02

Pos logit contributions
 and+5.042
,+4.994
.+4.774
 the+4.725
 a+4.466
Neg logit contributions
lor-5.045
umm-4.965
arians-4.847
 former-4.836
vation-4.722
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+2.601
,+2.570
 the+2.413
.+2.292
 a+2.227
Neg logit contributions
lor-7.297
umm-7.290
arians-7.078
 former-7.033
rol-7.026
Token He
Token ID679
Feature activation+5.44e+02

Pos logit contributions
 and+3.396
,+3.347
.+3.166
 the+3.115
 a+2.883
Neg logit contributions
lor-6.664
umm-6.585
arians-6.466
 former-6.460
ocol-6.390
Token looked
Token ID3114
Feature activation+5.44e+02

Pos logit contributions
 and+3.764
,+3.736
.+3.569
 the+3.514
 a+3.274
Neg logit contributions
lor-6.196
arians-6.089
umm-6.033
 former-6.032
ocol-5.953
Token very
Token ID845
Feature activation+5.44e+02

Pos logit contributions
 and+2.918
,+2.875
.+2.757
 the+2.684
 a+2.449
Neg logit contributions
lor-6.865
umm-6.857
ocol-6.823
 former-6.795
rol-6.787
Token rude
Token ID22066
Feature activation+5.44e+02

Pos logit contributions
 and+4.245
,+4.136
.+3.976
 the+3.961
 a+3.753
Neg logit contributions
umm-5.714
ocol-5.655
 former-5.640
lor-5.637
arians-5.570
Token make
Token ID787
Feature activation+5.44e+02

Pos logit contributions
 and+4.622
,+4.554
.+4.386
 the+4.343
 a+4.107
Neg logit contributions
lor-5.412
umm-5.374
arians-5.300
 former-5.291
ocol-5.189
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
 and+2.760
,+2.693
.+2.471
 the+2.358
 a+2.094
Neg logit contributions
lor-7.247
umm-7.111
 former-7.079
arians-7.064
ocol-7.043
Token game
Token ID983
Feature activation+5.44e+02

Pos logit contributions
 and+5.187
,+5.106
.+4.945
 the+4.924
 a+4.652
Neg logit contributions
lor-4.883
umm-4.836
 former-4.757
arians-4.723
ocol-4.640
Token of
Token ID286
Feature activation+5.44e+02

Pos logit contributions
 and+2.299
,+2.240
 the+2.113
.+2.037
 a+1.832
Neg logit contributions
lor-7.633
umm-7.558
arians-7.491
 former-7.407
rol-7.399
Token their
Token ID511
Feature activation+5.44e+02

Pos logit contributions
 and+5.113
,+5.039
.+4.853
 the+4.778
 a+4.535
Neg logit contributions
lor-4.903
umm-4.843
arians-4.783
 former-4.783
ocol-4.771
Token own
Token ID898
Feature activation+5.44e+02

Pos logit contributions
 and+5.286
,+5.211
.+5.040
 the+5.021
 a+4.770
Neg logit contributions
umm-4.764
lor-4.752
arians-4.675
 former-4.674
ocol-4.583
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+5.146
,+5.071
 the+4.934
.+4.872
 a+4.662
Neg logit contributions
lor-4.885
umm-4.804
arians-4.755
 former-4.746
ocol-4.633
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+4.692
,+4.646
.+4.452
 the+4.419
 a+4.172
Neg logit contributions
lor-5.426
umm-5.363
arians-5.246
 former-5.230
osity-5.182
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+1.584
,+1.560
.+1.358
 the+1.341
 a+1.098
Neg logit contributions
lor-8.512
arians-8.198
umm-8.189
rol-8.166
 former-8.156
TokenThey
Token ID2990
Feature activation+5.44e+02

Pos logit contributions
 and+5.431
,+5.370
.+5.193
 the+5.168
 a+4.918
Neg logit contributions
lor-4.686
umm-4.591
arians-4.511
 former-4.466
ocol-4.423
Token put
Token ID1234
Feature activation+5.44e+02

Pos logit contributions
 and+4.809
,+4.748
.+4.597
 the+4.522
 a+4.295
Neg logit contributions
lor-5.170
umm-5.107
arians-5.087
 former-5.015
ocol-4.992
Token hands
Token ID2832
Feature activation+5.44e+02

Pos logit contributions
 and+5.067
,+5.016
 the+4.837
.+4.830
 a+4.559
Neg logit contributions
lor-5.031
umm-4.963
arians-4.857
 former-4.845
ocol-4.795
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
 and+1.661
,+1.642
 the+1.551
.+1.406
 a+1.263
Neg logit contributions
lor-8.161
umm-8.089
 former-7.977
arians-7.962
rol-7.940
Token shouted
Token ID17293
Feature activation+5.44e+02

Pos logit contributions
 and+4.819
,+4.738
.+4.586
 the+4.473
 a+4.284
Neg logit contributions
lor-5.143
umm-5.079
 former-5.037
arians-5.003
osity-4.928
Token "
Token ID366
Feature activation+5.44e+02

Pos logit contributions
 and+3.206
,+3.070
 the+2.948
.+2.912
 a+2.698
Neg logit contributions
lor-6.791
umm-6.690
arians-6.594
ocol-6.576
osity-6.571
TokenWell
Token ID5779
Feature activation+5.44e+02

Pos logit contributions
 and+6.846
,+6.798
.+6.603
 the+6.584
 a+6.331
Neg logit contributions
lor-3.261
umm-3.203
arians-3.086
 former-3.043
ocol-3.008
Token done
Token ID1760
Feature activation+5.44e+02

Pos logit contributions
 and+4.998
,+4.840
.+4.754
 the+4.690
 a+4.450
Neg logit contributions
lor-5.132
umm-5.009
arians-4.917
ocol-4.910
rol-4.851
Token!"
Token ID2474
Feature activation+5.44e+02

Pos logit contributions
 and+4.482
,+4.321
.+4.191
 the+4.154
 a+3.909
Neg logit contributions
lor-5.622
umm-5.511
arians-5.447
ocol-5.360
 former-5.350
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+3.875
,+3.857
.+3.652
 the+3.632
 a+3.398
Neg logit contributions
lor-6.165
umm-6.050
arians-5.965
 former-5.938
osity-5.862
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+3.332
,+3.286
.+3.087
 the+3.054
 a+2.812
Neg logit contributions
lor-6.831
umm-6.646
arians-6.597
 former-6.568
ocol-6.517
TokenAll
Token ID3237
Feature activation+5.44e+02

Pos logit contributions
 and+4.770
,+4.706
.+4.529
 the+4.506
 a+4.257
Neg logit contributions
lor-5.349
umm-5.259
arians-5.174
 former-5.131
ocol-5.094
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+3.848
,+3.769
.+3.620
 the+3.475
 a+3.317
Neg logit contributions
lor-6.186
umm-6.160
ocol-6.014
rol-5.994
osity-5.978
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+4.954
,+4.880
 the+4.738
.+4.634
 a+4.470
Neg logit contributions
lor-5.156
arians-4.924
umm-4.884
ides-4.814
 former-4.807
Token When
Token ID1649
Feature activation+5.44e+02

Pos logit contributions
 and+4.352
,+4.295
.+4.114
 the+4.096
 a+3.842
Neg logit contributions
lor-5.759
umm-5.654
arians-5.567
 former-5.525
ocol-5.475
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+4.343
,+4.188
.+4.057
 the+3.943
 a+3.752
Neg logit contributions
lor-5.774
umm-5.687
arians-5.549
 former-5.520
ocol-5.504
Token story
Token ID1621
Feature activation+5.44e+02

Pos logit contributions
 and+5.331
,+5.238
.+5.060
 the+5.057
 a+4.783
Neg logit contributions
lor-4.820
umm-4.793
arians-4.626
 former-4.582
ocol-4.539
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
 and+4.378
,+4.247
.+4.159
 the+4.113
 a+3.900
Neg logit contributions
lor-5.647
umm-5.582
arians-5.464
ocol-5.400
vation-5.377
Token done
Token ID1760
Feature activation+5.44e+02

Pos logit contributions
 and+4.513
,+4.427
.+4.295
 the+4.203
 a+3.951
Neg logit contributions
lor-5.530
umm-5.444
rol-5.339
 former-5.334
ocol-5.307
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+1.809
,+1.657
.+1.562
 the+1.547
 a+1.329
Neg logit contributions
lor-8.165
umm-8.031
arians-7.984
ocol-7.950
 former-7.891
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+5.478
,+5.405
.+5.248
 the+5.103
 a+4.905
Neg logit contributions
lor-4.554
umm-4.453
arians-4.359
 former-4.347
ocol-4.297
Token little
Token ID1310
Feature activation+5.44e+02

Pos logit contributions
 and+5.380
,+5.311
.+5.125
 the+5.110
 a+4.831
Neg logit contributions
lor-4.737
umm-4.700
arians-4.572
 former-4.557
ocol-4.466
Token girl
Token ID2576
Feature activation+5.44e+02

Pos logit contributions
 and+4.111
,+4.038
.+3.879
 the+3.850
 a+3.598
Neg logit contributions
umm-5.905
lor-5.857
arians-5.729
 former-5.716
ocol-5.651
Token felt
Token ID2936
Feature activation+5.44e+02

Pos logit contributions
 and+3.731
,+3.695
 the+3.550
.+3.543
 a+3.309
Neg logit contributions
lor-6.249
arians-6.076
umm-6.075
 former-6.024
rol-5.910
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+3.656
,+3.592
 the+3.453
.+3.406
 a+3.232
Neg logit contributions
lor-6.277
umm-6.181
arians-6.125
 former-6.080
osity-6.002
Token When
Token ID1649
Feature activation+5.44e+02

Pos logit contributions
 and+5.246
,+5.208
.+5.010
 the+4.994
 a+4.732
Neg logit contributions
umm-4.838
lor-4.826
 former-4.680
arians-4.667
osity-4.610
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+2.822
,+2.677
.+2.554
 the+2.394
 a+2.213
Neg logit contributions
umm-7.249
lor-7.167
 former-7.026
arians-6.994
ocol-6.981
Token jobs
Token ID3946
Feature activation+5.44e+02

Pos logit contributions
 and+5.360
,+5.280
.+5.107
 the+5.088
 a+4.822
Neg logit contributions
umm-4.814
lor-4.799
arians-4.642
 former-4.570
ocol-4.551
Token were
Token ID547
Feature activation+5.44e+02

Pos logit contributions
 and+4.318
,+4.217
.+4.104
 the+4.035
 a+3.870
Neg logit contributions
lor-5.664
umm-5.630
arians-5.519
ocol-5.444
rol-5.423
Token done
Token ID1760
Feature activation+5.44e+02

Pos logit contributions
 and+4.805
,+4.731
.+4.580
 the+4.516
 a+4.273
Neg logit contributions
lor-5.200
umm-5.157
arians-5.052
 former-5.043
ocol-5.016
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+1.612
,+1.462
.+1.386
 the+1.356
 a+1.137
Neg logit contributions
lor-8.327
umm-8.291
arians-8.174
ocol-8.160
 former-8.121
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+3.795
,+3.743
.+3.594
 the+3.416
 a+3.222
Neg logit contributions
umm-6.179
lor-6.069
 former-5.983
arians-5.955
osity-5.896
Token shop
Token ID6128
Feature activation+5.44e+02

Pos logit contributions
 and+4.865
,+4.795
.+4.619
 the+4.589
 a+4.327
Neg logit contributions
umm-5.289
lor-5.276
arians-5.151
 former-5.084
ocol-5.039
Tokenkeeper
Token ID13884
Feature activation+5.44e+02

Pos logit contributions
 and+7.121
,+7.051
.+6.909
 the+6.900
 a+6.660
Neg logit contributions
lor-2.922
umm-2.863
arians-2.776
 former-2.678
osity-2.606
Token said
Token ID531
Feature activation+5.44e+02

Pos logit contributions
 and+4.354
,+4.321
.+4.185
 the+4.154
 a+3.931
Neg logit contributions
lor-5.636
umm-5.515
arians-5.484
 former-5.383
ocol-5.309
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+4.834
,+4.775
.+4.587
 the+4.567
 a+4.312
Neg logit contributions
lor-5.257
umm-5.254
 former-5.054
arians-5.035
osity-5.026
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+2.782
,+2.745
.+2.533
 the+2.509
 a+2.275
Neg logit contributions
lor-7.285
umm-7.024
arians-6.957
 former-6.955
rol-6.931
TokenAfter
Token ID3260
Feature activation+5.44e+02

Pos logit contributions
 and+5.881
,+5.834
.+5.653
 the+5.626
 a+5.372
Neg logit contributions
lor-4.244
umm-4.157
arians-4.058
 former-4.023
ocol-3.965
Token he
Token ID339
Feature activation+5.44e+02

Pos logit contributions
 and+4.208
,+4.068
.+3.931
 the+3.802
 a+3.541
Neg logit contributions
lor-5.790
umm-5.743
ocol-5.674
 former-5.612
arians-5.599
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
 and+4.459
,+4.419
.+4.284
 the+4.219
 a+3.988
Neg logit contributions
lor-5.533
umm-5.460
 former-5.353
arians-5.328
ocol-5.313
Token done
Token ID1760
Feature activation+5.44e+02

Pos logit contributions
 and+4.918
,+4.801
.+4.671
 the+4.597
 a+4.339
Neg logit contributions
lor-5.139
umm-5.051
 former-4.954
ocol-4.939
arians-4.908
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+4.611
,+4.444
.+4.378
 the+4.297
 a+4.076
Neg logit contributions
lor-5.414
umm-5.307
ocol-5.250
arians-5.188
 former-5.158
Token Jack
Token ID3619
Feature activation+5.44e+02

Pos logit contributions
 and+3.670
,+3.589
.+3.465
 the+3.254
 a+3.061
Neg logit contributions
umm-6.250
lor-6.122
 former-6.085
arians-6.022
osity-6.014
Token decided
Token ID3066
Feature activation+5.44e+02

Pos logit contributions
 and+3.720
,+3.714
.+3.549
 the+3.546
 a+3.303
Neg logit contributions
lor-6.251
umm-6.025
arians-6.010
 former-5.975
ides-5.869
Token to
Token ID284
Feature activation+5.44e+02

Pos logit contributions
 and+1.385
,+1.339
.+1.135
 the+1.082
 a+0.870
Neg logit contributions
lor-8.583
umm-8.357
 former-8.314
arians-8.282
osity-8.280
Token see
Token ID766
Feature activation+5.44e+02

Pos logit contributions
 and+4.139
,+4.061
.+3.908
 the+3.885
 a+3.638
Neg logit contributions
lor-5.895
umm-5.824
 former-5.766
arians-5.706
ocol-5.642
Token with
Token ID351
Feature activation+5.44e+02

Pos logit contributions
 and+2.733
,+2.673
 the+2.569
.+2.447
 a+2.341
Neg logit contributions
lor-7.140
umm-7.101
arians-6.969
ocol-6.927
rol-6.908
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
 and+3.932
,+3.892
.+3.707
 the+3.578
 a+3.297
Neg logit contributions
umm-6.035
lor-6.025
 former-5.919
ocol-5.894
arians-5.862
Token beautiful
Token ID4950
Feature activation+5.44e+02

Pos logit contributions
 and+5.080
,+5.009
.+4.854
 the+4.846
 a+4.570
Neg logit contributions
lor-4.951
umm-4.883
 former-4.784
arians-4.773
ocol-4.706
Token flag
Token ID6056
Feature activation+5.44e+02

Pos logit contributions
 and+5.178
,+5.104
.+4.974
 the+4.957
 a+4.708
Neg logit contributions
lor-4.777
umm-4.774
 former-4.636
ocol-4.600
arians-4.574
Token on
Token ID319
Feature activation+5.44e+02

Pos logit contributions
 and+3.136
,+3.089
 the+2.989
.+2.843
 a+2.738
Neg logit contributions
lor-6.851
umm-6.714
arians-6.651
osity-6.555
ocol-6.546
Token top
Token ID1353
Feature activation+5.44e+02

Pos logit contributions
 and+2.762
,+2.710
.+2.482
 the+2.376
 a+2.194
Neg logit contributions
lor-7.156
umm-7.106
ocol-7.008
 former-6.989
arians-6.984
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.616
,+1.586
 the+1.450
.+1.292
 a+1.257
Neg logit contributions
lor-8.311
umm-8.280
arians-8.067
ocol-8.050
ides-8.024
Token Jack
Token ID3619
Feature activation+5.44e+02

Pos logit contributions
 and+4.106
,+4.025
.+3.870
 the+3.832
 a+3.583
Neg logit contributions
umm-5.960
lor-5.957
 former-5.784
arians-5.783
osity-5.741
Token wanted
Token ID2227
Feature activation+5.44e+02

Pos logit contributions
 and+3.602
,+3.590
.+3.474
 the+3.454
 a+3.207
Neg logit contributions
lor-6.332
arians-6.186
umm-6.147
 former-6.086
osity-6.054
Token to
Token ID284
Feature activation+5.44e+02

Pos logit contributions
 and+1.175
,+1.142
.+0.994
 the+0.830
 a+0.604
Neg logit contributions
lor-8.752
umm-8.593
arians-8.561
osity-8.526
 former-8.513
Token get
Token ID651
Feature activation+5.44e+02

Pos logit contributions
 and+4.118
,+4.049
.+3.878
 the+3.811
 a+3.583
Neg logit contributions
lor-5.838
umm-5.772
arians-5.708
 former-5.697
osity-5.646
Token magazine
Token ID7093
Feature activation+5.44e+02

Pos logit contributions
 and+5.926
,+5.851
.+5.676
 the+5.649
 a+5.387
Neg logit contributions
lor-4.090
umm-4.021
arians-3.999
ocol-3.926
 former-3.898
Token?
Token ID30
Feature activation+5.44e+02

Pos logit contributions
 and+2.624
,+2.569
 the+2.440
.+2.411
 a+2.168
Neg logit contributions
lor-7.299
umm-7.165
 former-7.105
arians-7.089
osity-6.943
Token I
Token ID314
Feature activation+5.44e+02

Pos logit contributions
 and+5.325
,+5.273
.+5.084
 the+5.070
 a+4.791
Neg logit contributions
lor-4.756
umm-4.661
arians-4.615
 former-4.527
ocol-4.523
Token have
Token ID423
Feature activation+5.44e+02

Pos logit contributions
 and+4.361
,+4.335
 the+4.181
.+4.126
 a+3.889
Neg logit contributions
lor-5.581
 former-5.474
arians-5.462
umm-5.434
ocol-5.408
Token my
Token ID616
Feature activation+5.44e+02

Pos logit contributions
 and+3.105
,+3.047
.+2.830
 the+2.747
 a+2.461
Neg logit contributions
lor-6.955
umm-6.756
 former-6.752
arians-6.745
ocol-6.742
Token own
Token ID898
Feature activation+5.44e+02

Pos logit contributions
 and+4.897
,+4.831
.+4.645
 the+4.645
 a+4.358
Neg logit contributions
lor-5.049
umm-5.024
arians-5.005
 former-4.986
ocol-4.912
Token magazines
Token ID16695
Feature activation+5.44e+02

Pos logit contributions
 and+5.093
,+5.021
 the+4.861
.+4.833
 a+4.600
Neg logit contributions
lor-4.914
arians-4.781
umm-4.762
 former-4.720
ocol-4.652
Token."
Token ID526
Feature activation+5.44e+02

Pos logit contributions
 and+2.178
,+2.134
 the+2.036
.+1.929
 a+1.765
Neg logit contributions
lor-7.678
umm-7.556
arians-7.508
 former-7.427
ides-7.355
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+1.731
,+1.699
 the+1.509
.+1.483
 a+1.280
Neg logit contributions
lor-8.223
umm-8.080
 former-7.976
arians-7.913
ocol-7.894
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+1.654
,+1.623
 the+1.359
.+1.340
 a+1.161
Neg logit contributions
lor-8.334
 former-8.051
umm-8.045
arians-8.013
ocol-8.002
TokenHe
Token ID1544
Feature activation+5.44e+02

Pos logit contributions
 and+4.825
,+4.790
.+4.571
 the+4.565
 a+4.322
Neg logit contributions
lor-5.323
umm-5.148
arians-5.126
 former-5.115
ocol-5.035
Token to
Token ID284
Feature activation+5.44e+02

Pos logit contributions
 and+3.380
,+3.296
.+3.113
 the+3.089
 a+2.863
Neg logit contributions
lor-6.556
 former-6.410
umm-6.404
osity-6.354
arians-6.346
Token regret
Token ID13721
Feature activation+5.44e+02

Pos logit contributions
 and+5.185
,+5.092
.+4.942
 the+4.884
 a+4.670
Neg logit contributions
lor-4.768
 former-4.718
umm-4.714
arians-4.619
osity-4.570
Token what
Token ID644
Feature activation+5.44e+02

Pos logit contributions
 and+4.353
,+4.257
.+4.034
 the+4.012
 a+3.834
Neg logit contributions
lor-5.612
arians-5.502
ocol-5.402
umm-5.384
osity-5.383
Token he
Token ID339
Feature activation+5.44e+02

Pos logit contributions
 and+2.638
,+2.483
.+2.321
 the+2.200
 a+2.012
Neg logit contributions
lor-7.400
umm-7.258
ocol-7.195
osity-7.195
arians-7.189
Token had
Token ID550
Feature activation+5.44e+02

Pos logit contributions
,+3.575
 and+3.569
 the+3.345
.+3.345
 a+3.091
Neg logit contributions
lor-6.442
umm-6.295
arians-6.282
 former-6.206
ides-6.165
Token done
Token ID1760
Feature activation+5.44e+02

Pos logit contributions
 and+5.043
,+5.010
 the+4.825
.+4.780
 a+4.566
Neg logit contributions
lor-4.994
umm-4.795
arians-4.745
 former-4.741
ides-4.717
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.045
,+0.998
 the+0.843
.+0.725
 a+0.665
Neg logit contributions
lor-8.782
umm-8.658
arians-8.570
ocol-8.528
ides-8.522
Token He
Token ID679
Feature activation+5.44e+02

Pos logit contributions
 and+4.430
,+4.372
.+4.196
 the+4.167
 a+3.917
Neg logit contributions
lor-5.686
umm-5.627
 former-5.514
arians-5.492
osity-5.441
Token apologized
Token ID22893
Feature activation+5.44e+02

Pos logit contributions
,+4.494
 and+4.475
.+4.325
 the+4.266
 a+4.032
Neg logit contributions
lor-5.500
arians-5.325
 former-5.325
umm-5.280
bek-5.185
Token to
Token ID284
Feature activation+5.44e+02

Pos logit contributions
 and+1.233
,+1.231
 the+1.117
.+1.103
 a+0.896
Neg logit contributions
lor-8.501
 former-8.331
arians-8.315
umm-8.314
osity-8.273
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+0.973
,+0.947
.+0.751
 the+0.503
 a+0.419
Neg logit contributions
lor-8.904
arians-8.881
ocol-8.862
 former-8.835
umm-8.833
Token food
Token ID2057
Feature activation+5.44e+02

Pos logit contributions
 and+5.480
,+5.423
.+5.253
 the+5.236
 a+4.983
Neg logit contributions
lor-4.601
umm-4.584
arians-4.457
ocol-4.420
 former-4.399
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
 and+2.932
,+2.877
 the+2.757
.+2.693
 a+2.547
Neg logit contributions
lor-6.962
umm-6.960
arians-6.852
 former-6.757
rol-6.674
Token when
Token ID618
Feature activation+5.44e+02

Pos logit contributions
 and+4.392
,+4.356
.+4.218
 the+4.033
 a+3.837
Neg logit contributions
 former-5.561
umm-5.553
arians-5.527
lor-5.512
ocol-5.385
Token they
Token ID484
Feature activation+5.44e+02

Pos logit contributions
 and+2.943
,+2.797
.+2.646
 the+2.512
 a+2.299
Neg logit contributions
lor-7.099
umm-7.065
arians-7.018
 former-6.984
ocol-6.914
Token were
Token ID547
Feature activation+5.44e+02

Pos logit contributions
 and+4.068
,+3.948
.+3.810
 the+3.760
 a+3.533
Neg logit contributions
lor-5.978
umm-5.932
arians-5.810
 former-5.780
ocol-5.773
Token done
Token ID1760
Feature activation+5.44e+02

Pos logit contributions
 and+4.972
,+4.876
.+4.716
 the+4.681
 a+4.417
Neg logit contributions
lor-5.049
umm-4.989
arians-4.918
 former-4.905
ocol-4.888
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+3.337
,+3.174
 the+3.045
.+3.040
 a+2.827
Neg logit contributions
lor-6.620
umm-6.542
arians-6.471
ocol-6.464
 former-6.425
Token Lily
Token ID20037
Feature activation+5.44e+02

Pos logit contributions
 and+2.718
,+2.643
.+2.492
 the+2.315
 a+2.094
Neg logit contributions
lor-7.245
umm-7.206
 former-7.171
arians-7.147
ocol-7.020
Token felt
Token ID2936
Feature activation+5.44e+02

Pos logit contributions
 and+3.795
,+3.771
 the+3.632
.+3.608
 a+3.383
Neg logit contributions
lor-6.146
arians-6.022
umm-5.977
 former-5.916
osity-5.862
Token proud
Token ID6613
Feature activation+5.44e+02

Pos logit contributions
 and+3.902
,+3.809
.+3.618
 the+3.540
 a+3.265
Neg logit contributions
lor-6.050
umm-6.028
 former-5.950
ocol-5.917
arians-5.847
Token of
Token ID286
Feature activation+5.44e+02

Pos logit contributions
,+2.258
 and+2.245
 the+2.147
.+1.995
 a+1.898
Neg logit contributions
umm-7.572
lor-7.554
arians-7.348
 former-7.319
osity-7.312
Token park
Token ID3952
Feature activation+5.44e+02

Pos logit contributions
 and+4.733
,+4.666
.+4.492
 the+4.414
 a+4.166
Neg logit contributions
lor-5.412
umm-5.325
 former-5.280
arians-5.142
vation-5.046
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+1.875
,+1.790
 the+1.699
.+1.677
 a+1.497
Neg logit contributions
lor-7.896
 former-7.837
umm-7.797
arians-7.787
osity-7.645
Token knowing
Token ID6970
Feature activation+5.44e+02

Pos logit contributions
,+3.532
 and+3.496
.+3.434
 the+3.286
 a+3.112
Neg logit contributions
lor-6.320
umm-6.224
 former-6.208
arians-6.200
osity-6.143
Token they
Token ID484
Feature activation+5.44e+02

Pos logit contributions
 and+2.282
,+2.179
.+2.049
 the+1.887
 a+1.707
Neg logit contributions
lor-7.675
 former-7.559
umm-7.551
osity-7.470
arians-7.469
Token had
Token ID550
Feature activation+5.44e+02

Pos logit contributions
 and+3.791
,+3.770
.+3.564
 the+3.509
 a+3.293
Neg logit contributions
lor-6.221
arians-6.079
umm-6.054
 former-5.972
osity-5.965
Token done
Token ID1760
Feature activation+5.44e+02

Pos logit contributions
 and+4.432
,+4.370
.+4.167
 the+4.091
 a+3.808
Neg logit contributions
lor-5.664
umm-5.436
arians-5.421
 former-5.402
ides-5.296
Token something
Token ID1223
Feature activation+5.44e+02

Pos logit contributions
 and+3.174
,+3.123
.+2.902
 the+2.816
 a+2.537
Neg logit contributions
lor-6.921
 former-6.655
arians-6.641
umm-6.616
ocol-6.494
Token good
Token ID922
Feature activation+5.44e+02

Pos logit contributions
 and+3.756
,+3.646
 the+3.447
.+3.444
 a+3.218
Neg logit contributions
lor-6.239
umm-6.093
arians-6.026
 former-6.017
osity-5.957
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+2.213
,+2.147
 the+2.055
.+1.937
 a+1.831
Neg logit contributions
lor-7.757
umm-7.657
arians-7.587
osity-7.460
 former-7.444
Token<|endoftext|>
Token ID50256
Feature activation+5.44e+02

Pos logit contributions
 and+2.400
,+2.361
.+2.162
 the+2.106
 a+1.880
Neg logit contributions
lor-7.668
umm-7.566
 former-7.424
arians-7.422
rol-7.382
TokenOne
Token ID3198
Feature activation+5.44e+02

Pos logit contributions
 and+1.925
,+1.882
 the+1.638
.+1.590
 a+1.382
Neg logit contributions
lor-8.115
umm-7.806
arians-7.722
rol-7.710
 former-7.622
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+3.531
,+3.462
.+3.279
 the+3.116
 a+2.917
Neg logit contributions
lor-6.497
umm-6.451
ocol-6.419
 former-6.381
arians-6.356
Token good
Token ID922
Feature activation+5.44e+02

Pos logit contributions
 and+6.222
,+6.198
.+6.013
 the+5.960
 a+5.691
Neg logit contributions
lor-3.812
umm-3.764
arians-3.636
 former-3.603
ocol-3.558
Token deed
Token ID28637
Feature activation+5.44e+02

Pos logit contributions
 and+6.421
,+6.368
.+6.205
 the+6.188
 a+5.947
Neg logit contributions
lor-3.601
umm-3.566
arians-3.401
ocol-3.393
 former-3.348
Token they
Token ID484
Feature activation+5.44e+02

Pos logit contributions
 and+2.693
,+2.619
 the+2.455
.+2.421
 a+2.265
Neg logit contributions
lor-7.277
umm-7.175
arians-7.122
ocol-7.026
osity-7.019
Token had
Token ID550
Feature activation+5.44e+02

Pos logit contributions
 and+4.764
,+4.718
.+4.522
 the+4.461
 a+4.260
Neg logit contributions
lor-5.296
arians-5.172
umm-5.169
ocol-5.082
 former-5.056
Token done
Token ID1760
Feature activation+5.44e+02

Pos logit contributions
 and+4.793
,+4.732
 the+4.545
.+4.511
 a+4.287
Neg logit contributions
lor-5.252
arians-5.035
umm-5.030
 former-4.978
ides-4.975
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+2.227
,+2.157
 the+1.943
.+1.899
 a+1.710
Neg logit contributions
lor-7.779
umm-7.585
arians-7.548
 former-7.540
ocol-7.449
Token<|endoftext|>
Token ID50256
Feature activation+5.44e+02

Pos logit contributions
 and+3.564
,+3.517
.+3.315
 the+3.268
 a+3.061
Neg logit contributions
lor-6.566
umm-6.447
arians-6.320
 former-6.318
osity-6.265
TokenM
Token ID44
Feature activation+5.44e+02

Pos logit contributions
 and+4.231
,+4.172
 the+3.954
.+3.921
 a+3.731
Neg logit contributions
lor-5.851
umm-5.616
rol-5.506
arians-5.497
ocol-5.418
Tokenandy
Token ID10757
Feature activation+5.44e+02

Pos logit contributions
 and+6.754
,+6.660
.+6.496
 the+6.493
 a+6.246
Neg logit contributions
lor-3.404
umm-3.319
arians-3.184
 former-3.183
ocol-3.154
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
 and+2.803
,+2.765
.+2.631
 the+2.607
 a+2.409
Neg logit contributions
lor-7.092
umm-6.941
arians-6.910
 former-6.863
ocol-6.792
Token so
Token ID523
Feature activation+5.44e+02

Pos logit contributions
 and+5.308
,+5.232
.+5.065
 the+4.989
 a+4.730
Neg logit contributions
lor-4.723
umm-4.575
 former-4.556
arians-4.550
rol-4.512
Token tired
Token ID10032
Feature activation+5.44e+02

Pos logit contributions
 and+4.290
,+4.204
.+4.055
 the+3.967
 a+3.707
Neg logit contributions
umm-5.733
lor-5.727
 former-5.669
ocol-5.589
osity-5.587
Token but
Token ID475
Feature activation+5.44e+02

Pos logit contributions
 and+2.043
,+1.972
 the+1.872
.+1.766
 a+1.682
Neg logit contributions
umm-7.890
lor-7.800
arians-7.595
 former-7.525
rol-7.510
Token he
Token ID339
Feature activation+5.44e+02

Pos logit contributions
 and+4.070
,+4.009
.+3.883
 the+3.720
 a+3.508
Neg logit contributions
umm-5.889
lor-5.878
 former-5.839
osity-5.744
ocol-5.681
Token was
Token ID373
Feature activation+5.44e+02

Pos logit contributions
 and+4.169
,+4.140
.+3.962
 the+3.936
 a+3.694
Neg logit contributions
lor-5.870
umm-5.723
arians-5.670
 former-5.665
rol-5.548
Token done
Token ID1760
Feature activation+5.44e+02

Pos logit contributions
 and+4.868
,+4.792
.+4.607
 the+4.533
 a+4.269
Neg logit contributions
lor-5.175
 former-4.999
umm-4.998
arians-4.941
rol-4.918
Token!
Token ID0
Feature activation+5.44e+02

Pos logit contributions
 and+2.310
,+2.219
 the+2.048
.+1.990
 a+1.785
Neg logit contributions
lor-7.682
umm-7.482
 former-7.467
arians-7.405
ocol-7.393
Token He
Token ID679
Feature activation+5.44e+02

Pos logit contributions
 and+4.622
,+4.576
 the+4.369
.+4.355
 a+4.105
Neg logit contributions
umm-5.470
lor-5.460
 former-5.305
osity-5.237
arians-5.200
Token made
Token ID925
Feature activation+5.44e+02

Pos logit contributions
 and+4.189
,+4.176
.+4.016
 the+3.963
 a+3.718
Neg logit contributions
lor-5.836
umm-5.690
arians-5.652
 former-5.647
rol-5.525
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+3.215
,+3.183
.+2.978
 the+2.844
 a+2.569
Neg logit contributions
lor-6.839
umm-6.688
 former-6.627
rol-6.559
ocol-6.558
Token biggest
Token ID4094
Feature activation+5.44e+02

Pos logit contributions
 and+5.434
,+5.391
.+5.208
 the+5.184
 a+4.917
Neg logit contributions
umm-4.646
lor-4.630
 former-4.551
arians-4.510
osity-4.378
Token and
Token ID290
Feature activation+5.44e+02

Pos logit contributions
,+1.807
 and+1.796
 the+1.683
.+1.645
 a+1.411
Neg logit contributions
lor-7.973
arians-7.825
umm-7.807
 former-7.768
rol-7.755
Token they
Token ID484
Feature activation+5.44e+02

Pos logit contributions
 and+4.218
,+4.155
.+4.039
 the+3.898
 a+3.671
Neg logit contributions
lor-5.738
arians-5.608
umm-5.607
 former-5.534
osity-5.494
Token went
Token ID1816
Feature activation+5.44e+02

Pos logit contributions
 and+3.966
,+3.895
.+3.738
 the+3.682
 a+3.435
Neg logit contributions
lor-5.978
umm-5.855
arians-5.845
 former-5.802
rol-5.786
Token to
Token ID284
Feature activation+5.44e+02

Pos logit contributions
 and+2.622
,+2.540
 the+2.386
.+2.346
 a+2.132
Neg logit contributions
lor-7.211
arians-7.079
 former-7.053
umm-7.040
rol-7.033
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+2.045
,+1.963
.+1.770
 the+1.546
 a+1.359
Neg logit contributions
 former-7.784
lor-7.784
arians-7.703
umm-7.680
ocol-7.670
Token top
Token ID1353
Feature activation+5.44e+02

Pos logit contributions
 and+5.593
,+5.518
.+5.326
 the+5.273
 a+5.002
Neg logit contributions
lor-4.475
umm-4.428
 former-4.369
arians-4.306
rol-4.162
Token of
Token ID286
Feature activation+5.44e+02

Pos logit contributions
 and+2.650
,+2.583
 the+2.426
.+2.317
 a+2.260
Neg logit contributions
umm-7.229
lor-7.228
rol-7.013
arians-7.009
 former-7.000
Token a
Token ID257
Feature activation+5.44e+02

Pos logit contributions
 and+0.786
,+0.741
.+0.556
 the+0.264
 a+0.058
Neg logit contributions
lor-9.100
umm-9.085
arians-9.068
 former-9.068
ocol-9.066
Token hill
Token ID12788
Feature activation+5.44e+02

Pos logit contributions
 and+4.962
,+4.850
.+4.694
 the+4.689
 a+4.434
Neg logit contributions
lor-5.082
umm-5.016
 former-4.956
arians-4.898
ocol-4.772
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+0.707
,+0.661
 the+0.549
 a+0.344
.+0.339
Neg logit contributions
lor-9.098
umm-9.040
rol-8.979
 former-8.927
arians-8.922
Token From
Token ID3574
Feature activation+5.44e+02

Pos logit contributions
 and+5.109
,+5.068
.+4.907
 the+4.859
 a+4.584
Neg logit contributions
lor-4.886
umm-4.872
 former-4.759
arians-4.752
osity-4.698
Token surprised
Token ID6655
Feature activation+5.44e+02

Pos logit contributions
 and+4.137
,+4.043
.+3.887
 the+3.840
 a+3.565
Neg logit contributions
lor-5.865
umm-5.840
 former-5.820
ocol-5.749
arians-5.732
Token by
Token ID416
Feature activation+5.44e+02

Pos logit contributions
 and+1.626
,+1.581
 the+1.456
.+1.366
 a+1.295
Neg logit contributions
lor-8.188
umm-8.187
arians-8.027
osity-7.965
ocol-7.934
Token what
Token ID644
Feature activation+5.44e+02

Pos logit contributions
 and+4.181
,+4.125
.+3.935
 the+3.803
 a+3.618
Neg logit contributions
lor-5.825
umm-5.788
arians-5.733
ocol-5.726
 former-5.717
Token she
Token ID673
Feature activation+5.44e+02

Pos logit contributions
 and+4.128
,+4.031
.+3.867
 the+3.791
 a+3.566
Neg logit contributions
lor-5.947
umm-5.826
arians-5.754
 former-5.677
osity-5.662
Token had
Token ID550
Feature activation+5.44e+02

Pos logit contributions
 and+3.607
,+3.576
.+3.401
 the+3.360
 a+3.125
Neg logit contributions
lor-6.436
umm-6.350
arians-6.316
 former-6.227
ocol-6.183
Token done
Token ID1760
Feature activation+5.44e+02

Pos logit contributions
 and+5.076
,+5.020
 the+4.812
.+4.811
 a+4.549
Neg logit contributions
lor-5.009
umm-4.829
 former-4.803
arians-4.800
ides-4.729
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.711
,+1.653
 the+1.502
.+1.425
 a+1.281
Neg logit contributions
lor-8.141
umm-7.990
arians-7.941
 former-7.923
ocol-7.878
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+3.958
,+3.889
.+3.710
 the+3.681
 a+3.427
Neg logit contributions
lor-6.173
umm-6.061
arians-5.984
 former-5.931
osity-5.872
Token
Token ID198
Feature activation+5.44e+02

Pos logit contributions
 and+1.646
,+1.597
.+1.380
 the+1.355
 a+1.114
Neg logit contributions
lor-8.483
umm-8.221
arians-8.221
 former-8.168
ocol-8.128
TokenShe
Token ID3347
Feature activation+5.44e+02

Pos logit contributions
 and+5.573
,+5.510
.+5.332
 the+5.308
 a+5.056
Neg logit contributions
lor-4.566
umm-4.455
arians-4.386
 former-4.334
ocol-4.295
Token started
Token ID2067
Feature activation+5.44e+02

Pos logit contributions
 and+3.990
,+3.980
.+3.822
 the+3.778
 a+3.525
Neg logit contributions
lor-5.994
arians-5.857
umm-5.836
 former-5.831
ocol-5.699
TokenFinally
Token ID11158
Feature activation+5.44e+02

Pos logit contributions
 and+5.755
,+5.710
.+5.529
 the+5.507
 a+5.240
Neg logit contributions
lor-4.358
umm-4.264
arians-4.166
 former-4.146
osity-4.071
Token,
Token ID11
Feature activation+5.44e+02

Pos logit contributions
 and+0.332
.+0.049
,+0.030
 the-0.038
 a-0.239
Neg logit contributions
umm-9.542
lor-9.539
ocol-9.425
arians-9.348
osity-9.285
Token Jack
Token ID3619
Feature activation+5.44e+02

Pos logit contributions
 and+3.568
,+3.460
.+3.355
 the+3.166
 a+2.934
Neg logit contributions
umm-6.371
lor-6.241
 former-6.171
arians-6.124
ocol-6.099
Token reached
Token ID4251
Feature activation+5.44e+02

Pos logit contributions
 and+3.972
,+3.948
.+3.814
 the+3.803
 a+3.571
Neg logit contributions
lor-5.972
umm-5.769
arians-5.748
 former-5.655
rol-5.650
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+0.991
,+0.883
.+0.718
 the+0.550
 a+0.331
Neg logit contributions
lor-8.916
umm-8.801
 former-8.786
ocol-8.720
arians-8.698
Token top
Token ID1353
Feature activation+5.44e+02

Pos logit contributions
 and+5.804
,+5.741
.+5.558
 the+5.509
 a+5.242
Neg logit contributions
umm-4.268
lor-4.197
 former-4.099
arians-4.080
osity-3.986
Token of
Token ID286
Feature activation+5.44e+02

Pos logit contributions
 and+2.536
,+2.509
 the+2.368
.+2.266
 a+2.177
Neg logit contributions
umm-7.351
lor-7.239
arians-7.046
 former-7.028
rol-7.008
Token the
Token ID262
Feature activation+5.44e+02

Pos logit contributions
 and+0.951
,+0.919
.+0.756
 the+0.453
 a+0.263
Neg logit contributions
umm-8.902
ocol-8.865
lor-8.854
 former-8.804
arians-8.734
Token vine
Token ID17793
Feature activation+5.44e+02

Pos logit contributions
 and+6.387
,+6.322
.+6.168
 the+6.132
 a+5.898
Neg logit contributions
umm-3.665
lor-3.596
arians-3.459
 former-3.449
osity-3.358
Token.
Token ID13
Feature activation+5.44e+02

Pos logit contributions
 and+1.633
,+1.593
 the+1.515
.+1.347
 a+1.238
Neg logit contributions
lor-8.277
umm-8.198
arians-8.038
 former-8.034
rol-7.996
Token He
Token ID679
Feature activation+5.44e+02

Pos logit contributions
 and+4.139
,+4.082
.+3.903
 the+3.876
 a+3.595
Neg logit contributions
umm-5.943
lor-5.925
 former-5.764
arians-5.764
osity-5.724