TOP ACTIVATIONS
MAX = 55.022

Tom went to his bed and hid under the covers .
Tom went to his bed and hid under the covers
. Tom went to his bed and hid under
Tom went to his bed and hid under the
talking , table ?" The table did not say anything either
you talking , table ?" The table did not say anything
, table ?" The table did not say anything either .
table ?" The table did not say anything either . Tom
either . Tom was scared . Tom went to
anything either . Tom was scared . Tom went
not say anything either . Tom was scared .
say anything either . Tom was scared . Tom
Tom was scared . Tom went to his bed
. Tom was scared . Tom went to his
was scared . Tom went to his bed and
scared . Tom went to his bed and hid
did not say anything either . Tom was scared .
table did not say anything either . Tom was scared .
?" The table did not say anything either . Tom was
The table did not say anything either . Tom was scared

MIN ACTIVATIONS
MIN = 55.022

to create her own ash tr ay . She went to
, he saw an ash tr ay on the table .
. " B OO !! !" The ship
the ancient eye . <|endoftext|> E lla was having a special
were very happy . <|endoftext|> E lla loved to visit her
. " AAAA H !" Tom and Anna scream
The sign said " ST OP ". " Let
a book . E ma was disappointed . Reading
tr ay . The ash tr ay fell down and broke
hit her mom 's ash tr ay . The ash tr
named Bobby found the ash tr ay by the window .
Suddenly , the ash tr ay spoke , � �
Did you destroy my ash tr ay ?" Lily felt bad
and saw the broken ash tr ay . She was upset
cleaned up the broken ash tr ay and Lily learned to
also visiting Grand ma . Exc ited ly Tony asked Sarah
tr ay . The ash tr ay lived in a house
there was an ordinary ash tr ay . The ash tr
eat honey from the be eh ive . One day ,
they see a big ash tr ay near a bench .

INTERVAL 55.022 - 55.022
CONTAINS 0.000%

TokenTom
Token ID13787
Feature activation+5.50e+01

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+5.50e+01

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token to
Token ID284
Feature activation+5.50e+01

Pos logit contributions
 and+1.863
,+1.752
.+1.589
 the+1.515
 a+1.428
Neg logit contributions
umm-6.603
lor-6.542
ocol-6.493
 former-6.459
arians-6.451
Token his
Token ID465
Feature activation+5.50e+01

Pos logit contributions
 and+2.406
,+2.288
.+2.117
 the+1.958
 a+1.899
Neg logit contributions
umm-6.076
ocol-6.020
lor-5.976
 former-5.965
arians-5.918
Token bed
Token ID3996
Feature activation+5.50e+01

Pos logit contributions
 and+3.796
,+3.682
.+3.525
 the+3.457
 a+3.353
Neg logit contributions
umm-4.731
 former-4.621
lor-4.576
arians-4.555
ocol-4.543
Token and
Token ID290
Feature activation+5.50e+01

Pos logit contributions
 and+1.139
,+1.038
 the+0.892
.+0.868
 a+0.783
Neg logit contributions
umm-7.257
lor-7.218
 former-7.136
arians-7.077
ocol-7.042
Token hid
Token ID24519
Feature activation+5.50e+01

Pos logit contributions
 and+3.590
,+3.484
.+3.332
 the+3.198
 a+3.108
Neg logit contributions
umm-4.811
lor-4.796
 former-4.755
ocol-4.728
arians-4.696
Token under
Token ID739
Feature activation+5.50e+01

Pos logit contributions
 and+2.948
,+2.847
.+2.618
 the+2.548
 a+2.489
Neg logit contributions
umm-5.514
ocol-5.477
lor-5.387
arians-5.334
 former-5.329
Token the
Token ID262
Feature activation+5.50e+01

Pos logit contributions
 and+1.576
,+1.456
.+1.272
 the+1.108
 a+1.023
Neg logit contributions
umm-6.921
ocol-6.871
lor-6.809
 former-6.769
arians-6.698
Token covers
Token ID8698
Feature activation+5.50e+01

Pos logit contributions
 and+4.449
,+4.353
.+4.164
 the+4.122
 a+4.045
Neg logit contributions
umm-4.056
ocol-3.946
lor-3.928
 former-3.845
arians-3.803
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.788
,+0.680
.+0.454
 the+0.450
 a+0.390
Neg logit contributions
umm-7.681
lor-7.604
ocol-7.542
 former-7.447
arians-7.443
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+5.50e+01

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+5.50e+01

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token to
Token ID284
Feature activation+5.50e+01

Pos logit contributions
 and+1.863
,+1.752
.+1.589
 the+1.515
 a+1.428
Neg logit contributions
umm-6.603
lor-6.542
ocol-6.493
 former-6.459
arians-6.451
Token his
Token ID465
Feature activation+5.50e+01

Pos logit contributions
 and+2.406
,+2.288
.+2.117
 the+1.958
 a+1.899
Neg logit contributions
umm-6.076
ocol-6.020
lor-5.976
 former-5.965
arians-5.918
Token bed
Token ID3996
Feature activation+5.50e+01

Pos logit contributions
 and+3.796
,+3.682
.+3.525
 the+3.457
 a+3.353
Neg logit contributions
umm-4.731
 former-4.621
lor-4.576
arians-4.555
ocol-4.543
Token and
Token ID290
Feature activation+5.50e+01

Pos logit contributions
 and+1.139
,+1.038
 the+0.892
.+0.868
 a+0.783
Neg logit contributions
umm-7.257
lor-7.218
 former-7.136
arians-7.077
ocol-7.042
Token hid
Token ID24519
Feature activation+5.50e+01

Pos logit contributions
 and+3.590
,+3.484
.+3.332
 the+3.198
 a+3.108
Neg logit contributions
umm-4.811
lor-4.796
 former-4.755
ocol-4.728
arians-4.696
Token under
Token ID739
Feature activation+5.50e+01

Pos logit contributions
 and+2.948
,+2.847
.+2.618
 the+2.548
 a+2.489
Neg logit contributions
umm-5.514
ocol-5.477
lor-5.387
arians-5.334
 former-5.329
Token the
Token ID262
Feature activation+5.50e+01

Pos logit contributions
 and+1.576
,+1.456
.+1.272
 the+1.108
 a+1.023
Neg logit contributions
umm-6.921
ocol-6.871
lor-6.809
 former-6.769
arians-6.698
Token covers
Token ID8698
Feature activation+5.50e+01

Pos logit contributions
 and+4.449
,+4.353
.+4.164
 the+4.122
 a+4.045
Neg logit contributions
umm-4.056
ocol-3.946
lor-3.928
 former-3.845
arians-3.803
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+5.50e+01

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+5.50e+01

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token to
Token ID284
Feature activation+5.50e+01

Pos logit contributions
 and+1.863
,+1.752
.+1.589
 the+1.515
 a+1.428
Neg logit contributions
umm-6.603
lor-6.542
ocol-6.493
 former-6.459
arians-6.451
Token his
Token ID465
Feature activation+5.50e+01

Pos logit contributions
 and+2.406
,+2.288
.+2.117
 the+1.958
 a+1.899
Neg logit contributions
umm-6.076
ocol-6.020
lor-5.976
 former-5.965
arians-5.918
Token bed
Token ID3996
Feature activation+5.50e+01

Pos logit contributions
 and+3.796
,+3.682
.+3.525
 the+3.457
 a+3.353
Neg logit contributions
umm-4.731
 former-4.621
lor-4.576
arians-4.555
ocol-4.543
Token and
Token ID290
Feature activation+5.50e+01

Pos logit contributions
 and+1.139
,+1.038
 the+0.892
.+0.868
 a+0.783
Neg logit contributions
umm-7.257
lor-7.218
 former-7.136
arians-7.077
ocol-7.042
Token hid
Token ID24519
Feature activation+5.50e+01

Pos logit contributions
 and+3.590
,+3.484
.+3.332
 the+3.198
 a+3.108
Neg logit contributions
umm-4.811
lor-4.796
 former-4.755
ocol-4.728
arians-4.696
Token under
Token ID739
Feature activation+5.50e+01

Pos logit contributions
 and+2.948
,+2.847
.+2.618
 the+2.548
 a+2.489
Neg logit contributions
umm-5.514
ocol-5.477
lor-5.387
arians-5.334
 former-5.329
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+5.50e+01

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+5.50e+01

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token to
Token ID284
Feature activation+5.50e+01

Pos logit contributions
 and+1.863
,+1.752
.+1.589
 the+1.515
 a+1.428
Neg logit contributions
umm-6.603
lor-6.542
ocol-6.493
 former-6.459
arians-6.451
Token his
Token ID465
Feature activation+5.50e+01

Pos logit contributions
 and+2.406
,+2.288
.+2.117
 the+1.958
 a+1.899
Neg logit contributions
umm-6.076
ocol-6.020
lor-5.976
 former-5.965
arians-5.918
Token bed
Token ID3996
Feature activation+5.50e+01

Pos logit contributions
 and+3.796
,+3.682
.+3.525
 the+3.457
 a+3.353
Neg logit contributions
umm-4.731
 former-4.621
lor-4.576
arians-4.555
ocol-4.543
Token and
Token ID290
Feature activation+5.50e+01

Pos logit contributions
 and+1.139
,+1.038
 the+0.892
.+0.868
 a+0.783
Neg logit contributions
umm-7.257
lor-7.218
 former-7.136
arians-7.077
ocol-7.042
Token hid
Token ID24519
Feature activation+5.50e+01

Pos logit contributions
 and+3.590
,+3.484
.+3.332
 the+3.198
 a+3.108
Neg logit contributions
umm-4.811
lor-4.796
 former-4.755
ocol-4.728
arians-4.696
Token under
Token ID739
Feature activation+5.50e+01

Pos logit contributions
 and+2.948
,+2.847
.+2.618
 the+2.548
 a+2.489
Neg logit contributions
umm-5.514
ocol-5.477
lor-5.387
arians-5.334
 former-5.329
Token the
Token ID262
Feature activation+5.50e+01

Pos logit contributions
 and+1.576
,+1.456
.+1.272
 the+1.108
 a+1.023
Neg logit contributions
umm-6.921
ocol-6.871
lor-6.809
 former-6.769
arians-6.698
Token talking
Token ID3375
Feature activation+5.50e+01

Pos logit contributions
 and+3.769
,+3.621
.+3.469
 the+3.427
 a+3.317
Neg logit contributions
umm-4.672
lor-4.617
ocol-4.582
 former-4.572
arians-4.534
Token,
Token ID11
Feature activation+5.50e+01

Pos logit contributions
 and+2.798
,+2.700
.+2.519
 the+2.499
 a+2.384
Neg logit contributions
umm-5.631
lor-5.565
 former-5.507
ocol-5.488
arians-5.468
Token table
Token ID3084
Feature activation+5.50e+01

Pos logit contributions
 and+3.745
,+3.653
.+3.526
 the+3.412
 a+3.331
Neg logit contributions
umm-4.714
ocol-4.595
 former-4.582
lor-4.571
arians-4.538
Token?"
Token ID1701
Feature activation+5.50e+01

Pos logit contributions
 and+2.163
,+2.045
.+1.893
 the+1.892
 a+1.782
Neg logit contributions
umm-6.283
lor-6.172
ocol-6.126
 former-6.089
arians-6.067
Token The
Token ID383
Feature activation+5.50e+01

Pos logit contributions
 and+3.157
,+3.027
.+2.874
 the+2.762
 a+2.693
Neg logit contributions
umm-5.294
ocol-5.226
lor-5.208
 former-5.169
arians-5.127
Token table
Token ID3084
Feature activation+5.50e+01

Pos logit contributions
 and+4.237
,+4.129
.+3.956
 the+3.874
 a+3.767
Neg logit contributions
umm-4.312
ocol-4.202
 former-4.137
lor-4.128
arians-4.097
Token did
Token ID750
Feature activation+5.50e+01

Pos logit contributions
 and+3.767
,+3.652
.+3.507
 the+3.465
 a+3.360
Neg logit contributions
umm-4.683
lor-4.598
ocol-4.544
 former-4.491
arians-4.473
Token not
Token ID407
Feature activation+5.50e+01

Pos logit contributions
 and+2.385
,+2.255
.+2.096
 the+2.031
 a+1.925
Neg logit contributions
umm-6.051
lor-5.998
ocol-5.971
 former-5.955
arians-5.896
Token say
Token ID910
Feature activation+5.50e+01

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+5.50e+01

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+5.50e+01

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token you
Token ID345
Feature activation+5.50e+01

Pos logit contributions
 and+3.252
,+3.147
.+3.013
 the+2.930
 a+2.863
Neg logit contributions
umm-5.173
lor-5.075
arians-5.061
ocol-5.038
 former-5.002
Token talking
Token ID3375
Feature activation+5.50e+01

Pos logit contributions
 and+3.769
,+3.621
.+3.469
 the+3.427
 a+3.317
Neg logit contributions
umm-4.672
lor-4.617
ocol-4.582
 former-4.572
arians-4.534
Token,
Token ID11
Feature activation+5.50e+01

Pos logit contributions
 and+2.798
,+2.700
.+2.519
 the+2.499
 a+2.384
Neg logit contributions
umm-5.631
lor-5.565
 former-5.507
ocol-5.488
arians-5.468
Token table
Token ID3084
Feature activation+5.50e+01

Pos logit contributions
 and+3.745
,+3.653
.+3.526
 the+3.412
 a+3.331
Neg logit contributions
umm-4.714
ocol-4.595
 former-4.582
lor-4.571
arians-4.538
Token?"
Token ID1701
Feature activation+5.50e+01

Pos logit contributions
 and+2.163
,+2.045
.+1.893
 the+1.892
 a+1.782
Neg logit contributions
umm-6.283
lor-6.172
ocol-6.126
 former-6.089
arians-6.067
Token The
Token ID383
Feature activation+5.50e+01

Pos logit contributions
 and+3.157
,+3.027
.+2.874
 the+2.762
 a+2.693
Neg logit contributions
umm-5.294
ocol-5.226
lor-5.208
 former-5.169
arians-5.127
Token table
Token ID3084
Feature activation+5.50e+01

Pos logit contributions
 and+4.237
,+4.129
.+3.956
 the+3.874
 a+3.767
Neg logit contributions
umm-4.312
ocol-4.202
 former-4.137
lor-4.128
arians-4.097
Token did
Token ID750
Feature activation+5.50e+01

Pos logit contributions
 and+3.767
,+3.652
.+3.507
 the+3.465
 a+3.360
Neg logit contributions
umm-4.683
lor-4.598
ocol-4.544
 former-4.491
arians-4.473
Token not
Token ID407
Feature activation+5.50e+01

Pos logit contributions
 and+2.385
,+2.255
.+2.096
 the+2.031
 a+1.925
Neg logit contributions
umm-6.051
lor-5.998
ocol-5.971
 former-5.955
arians-5.896
Token say
Token ID910
Feature activation+5.50e+01

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+5.50e+01

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token,
Token ID11
Feature activation+5.50e+01

Pos logit contributions
 and+2.798
,+2.700
.+2.519
 the+2.499
 a+2.384
Neg logit contributions
umm-5.631
lor-5.565
 former-5.507
ocol-5.488
arians-5.468
Token table
Token ID3084
Feature activation+5.50e+01

Pos logit contributions
 and+3.745
,+3.653
.+3.526
 the+3.412
 a+3.331
Neg logit contributions
umm-4.714
ocol-4.595
 former-4.582
lor-4.571
arians-4.538
Token?"
Token ID1701
Feature activation+5.50e+01

Pos logit contributions
 and+2.163
,+2.045
.+1.893
 the+1.892
 a+1.782
Neg logit contributions
umm-6.283
lor-6.172
ocol-6.126
 former-6.089
arians-6.067
Token The
Token ID383
Feature activation+5.50e+01

Pos logit contributions
 and+3.157
,+3.027
.+2.874
 the+2.762
 a+2.693
Neg logit contributions
umm-5.294
ocol-5.226
lor-5.208
 former-5.169
arians-5.127
Token table
Token ID3084
Feature activation+5.50e+01

Pos logit contributions
 and+4.237
,+4.129
.+3.956
 the+3.874
 a+3.767
Neg logit contributions
umm-4.312
ocol-4.202
 former-4.137
lor-4.128
arians-4.097
Token did
Token ID750
Feature activation+5.50e+01

Pos logit contributions
 and+3.767
,+3.652
.+3.507
 the+3.465
 a+3.360
Neg logit contributions
umm-4.683
lor-4.598
ocol-4.544
 former-4.491
arians-4.473
Token not
Token ID407
Feature activation+5.50e+01

Pos logit contributions
 and+2.385
,+2.255
.+2.096
 the+2.031
 a+1.925
Neg logit contributions
umm-6.051
lor-5.998
ocol-5.971
 former-5.955
arians-5.896
Token say
Token ID910
Feature activation+5.50e+01

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+5.50e+01

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+5.50e+01

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token table
Token ID3084
Feature activation+5.50e+01

Pos logit contributions
 and+3.745
,+3.653
.+3.526
 the+3.412
 a+3.331
Neg logit contributions
umm-4.714
ocol-4.595
 former-4.582
lor-4.571
arians-4.538
Token?"
Token ID1701
Feature activation+5.50e+01

Pos logit contributions
 and+2.163
,+2.045
.+1.893
 the+1.892
 a+1.782
Neg logit contributions
umm-6.283
lor-6.172
ocol-6.126
 former-6.089
arians-6.067
Token The
Token ID383
Feature activation+5.50e+01

Pos logit contributions
 and+3.157
,+3.027
.+2.874
 the+2.762
 a+2.693
Neg logit contributions
umm-5.294
ocol-5.226
lor-5.208
 former-5.169
arians-5.127
Token table
Token ID3084
Feature activation+5.50e+01

Pos logit contributions
 and+4.237
,+4.129
.+3.956
 the+3.874
 a+3.767
Neg logit contributions
umm-4.312
ocol-4.202
 former-4.137
lor-4.128
arians-4.097
Token did
Token ID750
Feature activation+5.50e+01

Pos logit contributions
 and+3.767
,+3.652
.+3.507
 the+3.465
 a+3.360
Neg logit contributions
umm-4.683
lor-4.598
ocol-4.544
 former-4.491
arians-4.473
Token not
Token ID407
Feature activation+5.50e+01

Pos logit contributions
 and+2.385
,+2.255
.+2.096
 the+2.031
 a+1.925
Neg logit contributions
umm-6.051
lor-5.998
ocol-5.971
 former-5.955
arians-5.896
Token say
Token ID910
Feature activation+5.50e+01

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+5.50e+01

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+5.50e+01

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+5.50e+01

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token either
Token ID2035
Feature activation+5.50e+01

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+5.50e+01

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+5.50e+01

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+5.50e+01

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+5.50e+01

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+5.50e+01

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token to
Token ID284
Feature activation+5.50e+01

Pos logit contributions
 and+1.863
,+1.752
.+1.589
 the+1.515
 a+1.428
Neg logit contributions
umm-6.603
lor-6.542
ocol-6.493
 former-6.459
arians-6.451
Token anything
Token ID1997
Feature activation+5.50e+01

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+5.50e+01

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+5.50e+01

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+5.50e+01

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+5.50e+01

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+5.50e+01

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+5.50e+01

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token not
Token ID407
Feature activation+5.50e+01

Pos logit contributions
 and+2.385
,+2.255
.+2.096
 the+2.031
 a+1.925
Neg logit contributions
umm-6.051
lor-5.998
ocol-5.971
 former-5.955
arians-5.896
Token say
Token ID910
Feature activation+5.50e+01

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+5.50e+01

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+5.50e+01

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+5.50e+01

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+5.50e+01

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+5.50e+01

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
Token say
Token ID910
Feature activation+5.50e+01

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+5.50e+01

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+5.50e+01

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+5.50e+01

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+5.50e+01

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+5.50e+01

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+5.50e+01

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token Tom
Token ID4186
Feature activation+5.50e+01

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+5.50e+01

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+5.50e+01

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+5.50e+01

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+5.50e+01

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token to
Token ID284
Feature activation+5.50e+01

Pos logit contributions
 and+1.863
,+1.752
.+1.589
 the+1.515
 a+1.428
Neg logit contributions
umm-6.603
lor-6.542
ocol-6.493
 former-6.459
arians-6.451
Token his
Token ID465
Feature activation+5.50e+01

Pos logit contributions
 and+2.406
,+2.288
.+2.117
 the+1.958
 a+1.899
Neg logit contributions
umm-6.076
ocol-6.020
lor-5.976
 former-5.965
arians-5.918
Token bed
Token ID3996
Feature activation+5.50e+01

Pos logit contributions
 and+3.796
,+3.682
.+3.525
 the+3.457
 a+3.353
Neg logit contributions
umm-4.731
 former-4.621
lor-4.576
arians-4.555
ocol-4.543
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+5.50e+01

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+5.50e+01

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+5.50e+01

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+5.50e+01

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+5.50e+01

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token to
Token ID284
Feature activation+5.50e+01

Pos logit contributions
 and+1.863
,+1.752
.+1.589
 the+1.515
 a+1.428
Neg logit contributions
umm-6.603
lor-6.542
ocol-6.493
 former-6.459
arians-6.451
Token his
Token ID465
Feature activation+5.50e+01

Pos logit contributions
 and+2.406
,+2.288
.+2.117
 the+1.958
 a+1.899
Neg logit contributions
umm-6.076
ocol-6.020
lor-5.976
 former-5.965
arians-5.918
Token was
Token ID373
Feature activation+5.50e+01

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+5.50e+01

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+5.50e+01

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+5.50e+01

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token to
Token ID284
Feature activation+5.50e+01

Pos logit contributions
 and+1.863
,+1.752
.+1.589
 the+1.515
 a+1.428
Neg logit contributions
umm-6.603
lor-6.542
ocol-6.493
 former-6.459
arians-6.451
Token his
Token ID465
Feature activation+5.50e+01

Pos logit contributions
 and+2.406
,+2.288
.+2.117
 the+1.958
 a+1.899
Neg logit contributions
umm-6.076
ocol-6.020
lor-5.976
 former-5.965
arians-5.918
Token bed
Token ID3996
Feature activation+5.50e+01

Pos logit contributions
 and+3.796
,+3.682
.+3.525
 the+3.457
 a+3.353
Neg logit contributions
umm-4.731
 former-4.621
lor-4.576
arians-4.555
ocol-4.543
Token and
Token ID290
Feature activation+5.50e+01

Pos logit contributions
 and+1.139
,+1.038
 the+0.892
.+0.868
 a+0.783
Neg logit contributions
umm-7.257
lor-7.218
 former-7.136
arians-7.077
ocol-7.042
Token scared
Token ID12008
Feature activation+5.50e+01

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+5.50e+01

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+5.50e+01

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token to
Token ID284
Feature activation+5.50e+01

Pos logit contributions
 and+1.863
,+1.752
.+1.589
 the+1.515
 a+1.428
Neg logit contributions
umm-6.603
lor-6.542
ocol-6.493
 former-6.459
arians-6.451
Token his
Token ID465
Feature activation+5.50e+01

Pos logit contributions
 and+2.406
,+2.288
.+2.117
 the+1.958
 a+1.899
Neg logit contributions
umm-6.076
ocol-6.020
lor-5.976
 former-5.965
arians-5.918
Token bed
Token ID3996
Feature activation+5.50e+01

Pos logit contributions
 and+3.796
,+3.682
.+3.525
 the+3.457
 a+3.353
Neg logit contributions
umm-4.731
 former-4.621
lor-4.576
arians-4.555
ocol-4.543
Token and
Token ID290
Feature activation+5.50e+01

Pos logit contributions
 and+1.139
,+1.038
 the+0.892
.+0.868
 a+0.783
Neg logit contributions
umm-7.257
lor-7.218
 former-7.136
arians-7.077
ocol-7.042
Token hid
Token ID24519
Feature activation+5.50e+01

Pos logit contributions
 and+3.590
,+3.484
.+3.332
 the+3.198
 a+3.108
Neg logit contributions
umm-4.811
lor-4.796
 former-4.755
ocol-4.728
arians-4.696
Token did
Token ID750
Feature activation+5.50e+01

Pos logit contributions
 and+3.767
,+3.652
.+3.507
 the+3.465
 a+3.360
Neg logit contributions
umm-4.683
lor-4.598
ocol-4.544
 former-4.491
arians-4.473
Token not
Token ID407
Feature activation+5.50e+01

Pos logit contributions
 and+2.385
,+2.255
.+2.096
 the+2.031
 a+1.925
Neg logit contributions
umm-6.051
lor-5.998
ocol-5.971
 former-5.955
arians-5.896
Token say
Token ID910
Feature activation+5.50e+01

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+5.50e+01

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+5.50e+01

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+5.50e+01

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+5.50e+01

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+5.50e+01

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token table
Token ID3084
Feature activation+5.50e+01

Pos logit contributions
 and+4.237
,+4.129
.+3.956
 the+3.874
 a+3.767
Neg logit contributions
umm-4.312
ocol-4.202
 former-4.137
lor-4.128
arians-4.097
Token did
Token ID750
Feature activation+5.50e+01

Pos logit contributions
 and+3.767
,+3.652
.+3.507
 the+3.465
 a+3.360
Neg logit contributions
umm-4.683
lor-4.598
ocol-4.544
 former-4.491
arians-4.473
Token not
Token ID407
Feature activation+5.50e+01

Pos logit contributions
 and+2.385
,+2.255
.+2.096
 the+2.031
 a+1.925
Neg logit contributions
umm-6.051
lor-5.998
ocol-5.971
 former-5.955
arians-5.896
Token say
Token ID910
Feature activation+5.50e+01

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+5.50e+01

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+5.50e+01

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+5.50e+01

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+5.50e+01

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+5.50e+01

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token?"
Token ID1701
Feature activation+5.50e+01

Pos logit contributions
 and+2.163
,+2.045
.+1.893
 the+1.892
 a+1.782
Neg logit contributions
umm-6.283
lor-6.172
ocol-6.126
 former-6.089
arians-6.067
Token The
Token ID383
Feature activation+5.50e+01

Pos logit contributions
 and+3.157
,+3.027
.+2.874
 the+2.762
 a+2.693
Neg logit contributions
umm-5.294
ocol-5.226
lor-5.208
 former-5.169
arians-5.127
Token table
Token ID3084
Feature activation+5.50e+01

Pos logit contributions
 and+4.237
,+4.129
.+3.956
 the+3.874
 a+3.767
Neg logit contributions
umm-4.312
ocol-4.202
 former-4.137
lor-4.128
arians-4.097
Token did
Token ID750
Feature activation+5.50e+01

Pos logit contributions
 and+3.767
,+3.652
.+3.507
 the+3.465
 a+3.360
Neg logit contributions
umm-4.683
lor-4.598
ocol-4.544
 former-4.491
arians-4.473
Token not
Token ID407
Feature activation+5.50e+01

Pos logit contributions
 and+2.385
,+2.255
.+2.096
 the+2.031
 a+1.925
Neg logit contributions
umm-6.051
lor-5.998
ocol-5.971
 former-5.955
arians-5.896
Token say
Token ID910
Feature activation+5.50e+01

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+5.50e+01

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+5.50e+01

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+5.50e+01

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+5.50e+01

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token The
Token ID383
Feature activation+5.50e+01

Pos logit contributions
 and+3.157
,+3.027
.+2.874
 the+2.762
 a+2.693
Neg logit contributions
umm-5.294
ocol-5.226
lor-5.208
 former-5.169
arians-5.127
Token table
Token ID3084
Feature activation+5.50e+01

Pos logit contributions
 and+4.237
,+4.129
.+3.956
 the+3.874
 a+3.767
Neg logit contributions
umm-4.312
ocol-4.202
 former-4.137
lor-4.128
arians-4.097
Token did
Token ID750
Feature activation+5.50e+01

Pos logit contributions
 and+3.767
,+3.652
.+3.507
 the+3.465
 a+3.360
Neg logit contributions
umm-4.683
lor-4.598
ocol-4.544
 former-4.491
arians-4.473
Token not
Token ID407
Feature activation+5.50e+01

Pos logit contributions
 and+2.385
,+2.255
.+2.096
 the+2.031
 a+1.925
Neg logit contributions
umm-6.051
lor-5.998
ocol-5.971
 former-5.955
arians-5.896
Token say
Token ID910
Feature activation+5.50e+01

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+5.50e+01

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+5.50e+01

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+5.50e+01

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+5.50e+01

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+5.50e+01

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token to
Token ID284
Feature activation+5.50e+01

Pos logit contributions
 and+1.207
,+1.060
.+0.929
 the+0.828
 a+0.732
Neg logit contributions
umm-7.192
lor-7.177
osity-7.085
 former-7.077
ocol-7.076
Token create
Token ID2251
Feature activation+5.50e+01

Pos logit contributions
 and+3.689
,+3.543
.+3.397
 the+3.286
 a+3.193
Neg logit contributions
umm-4.840
lor-4.752
 former-4.741
ocol-4.702
arians-4.659
Token her
Token ID607
Feature activation+5.50e+01

Pos logit contributions
 and+2.296
,+2.165
.+1.944
 the+1.865
 a+1.723
Neg logit contributions
umm-6.167
ocol-6.151
lor-6.101
 former-6.077
arians-5.972
Token own
Token ID898
Feature activation+5.50e+01

Pos logit contributions
 and+4.339
,+4.219
.+4.042
 the+3.979
 a+3.851
Neg logit contributions
umm-4.203
 former-4.139
ocol-4.058
lor-4.057
arians-4.012
Token ash
Token ID12530
Feature activation+5.50e+01

Pos logit contributions
 and+4.914
,+4.791
.+4.606
 the+4.578
 a+4.465
Neg logit contributions
umm-3.630
 former-3.514
ocol-3.501
lor-3.483
arians-3.419
Tokentr
Token ID2213
Feature activation+5.50e+01

Pos logit contributions
 and+6.576
,+6.458
 the+6.323
.+6.242
 a+6.175
Neg logit contributions
umm-1.998
lor-1.860
 former-1.782
arians-1.779
ocol-1.712
Tokenay
Token ID323
Feature activation+5.50e+01

Pos logit contributions
 and+5.526
,+5.385
.+5.231
 the+5.209
 a+5.094
Neg logit contributions
umm-3.130
lor-2.987
ocol-2.919
 former-2.874
rol-2.853
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+2.385
,+2.255
 the+2.065
.+2.063
 a+1.961
Neg logit contributions
umm-6.052
lor-6.026
arians-5.962
ocol-5.945
 former-5.892
Token She
Token ID1375
Feature activation+5.50e+01

Pos logit contributions
 and+3.208
,+3.091
.+2.924
 the+2.842
 a+2.746
Neg logit contributions
umm-5.369
lor-5.251
 former-5.197
ocol-5.181
osity-5.163
Token went
Token ID1816
Feature activation+5.50e+01

Pos logit contributions
 and+3.609
,+3.520
.+3.381
 the+3.290
 a+3.181
Neg logit contributions
umm-4.835
lor-4.772
 former-4.738
ocol-4.738
arians-4.728
Token to
Token ID284
Feature activation+5.50e+01

Pos logit contributions
 and+1.740
,+1.654
.+1.535
 the+1.420
 a+1.334
Neg logit contributions
umm-6.622
lor-6.554
ocol-6.533
 former-6.496
osity-6.489
Token,
Token ID11
Feature activation+5.50e+01

Pos logit contributions
 and+0.329
,+0.157
.+0.050
 the-0.039
 a-0.130
Neg logit contributions
umm-8.211
ocol-8.076
lor-8.071
osity-7.980
arians-7.977
Token he
Token ID339
Feature activation+5.50e+01

Pos logit contributions
 and+3.982
,+3.846
.+3.717
 the+3.603
 a+3.513
Neg logit contributions
umm-4.573
ocol-4.410
lor-4.407
arians-4.335
osity-4.334
Token saw
Token ID2497
Feature activation+5.50e+01

Pos logit contributions
 and+2.893
,+2.805
.+2.671
 the+2.579
 a+2.462
Neg logit contributions
umm-5.533
lor-5.471
 former-5.431
ocol-5.391
arians-5.357
Token an
Token ID281
Feature activation+5.50e+01

Pos logit contributions
 and+1.078
,+0.944
.+0.803
 the+0.612
 a+0.469
Neg logit contributions
umm-7.435
lor-7.349
ocol-7.339
 former-7.285
osity-7.237
Token ash
Token ID12530
Feature activation+5.50e+01

Pos logit contributions
 and+4.301
,+4.195
.+4.064
 the+3.977
 a+3.857
Neg logit contributions
umm-4.272
ocol-4.132
 former-4.072
lor-4.023
arians-4.019
Tokentr
Token ID2213
Feature activation+5.50e+01

Pos logit contributions
 and+6.562
,+6.460
 the+6.290
.+6.267
 a+6.167
Neg logit contributions
umm-2.003
lor-1.855
ocol-1.752
arians-1.750
 former-1.731
Tokenay
Token ID323
Feature activation+5.50e+01

Pos logit contributions
 and+5.480
,+5.346
.+5.205
 the+5.151
 a+5.042
Neg logit contributions
umm-3.158
lor-3.006
ocol-2.968
 former-2.889
arians-2.882
Token on
Token ID319
Feature activation+5.50e+01

Pos logit contributions
 and+2.223
,+2.123
.+1.943
 the+1.899
 a+1.813
Neg logit contributions
umm-6.208
lor-6.167
ocol-6.101
arians-6.077
osity-6.059
Token the
Token ID262
Feature activation+5.50e+01

Pos logit contributions
 and+0.731
,+0.622
.+0.460
 the+0.287
 a+0.212
Neg logit contributions
umm-7.785
ocol-7.667
lor-7.642
 former-7.597
arians-7.555
Token table
Token ID3084
Feature activation+5.50e+01

Pos logit contributions
 and+4.677
,+4.585
.+4.405
 the+4.303
 a+4.219
Neg logit contributions
umm-3.882
lor-3.728
 former-3.687
arians-3.673
ocol-3.646
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.833
,+0.737
 the+0.552
.+0.507
 a+0.440
Neg logit contributions
umm-7.618
lor-7.537
ocol-7.430
 former-7.401
arians-7.385
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+4.769
,+4.651
.+4.482
 the+4.459
 a+4.401
Neg logit contributions
umm-3.753
lor-3.596
ocol-3.551
 former-3.528
arians-3.466
Token
Token ID220
Feature activation+5.50e+01

Pos logit contributions
 and+3.656
,+3.553
.+3.396
 the+3.272
 a+3.170
Neg logit contributions
umm-4.877
lor-4.798
ocol-4.728
 former-4.725
arians-4.698
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+1.638
,+1.544
.+1.333
 the+1.269
 a+1.169
Neg logit contributions
lor-6.845
umm-6.790
ocol-6.708
 former-6.689
osity-6.682
Token"
Token ID1
Feature activation+5.50e+01

Pos logit contributions
 and+3.174
,+3.075
.+2.905
 the+2.808
 a+2.696
Neg logit contributions
lor-5.396
umm-5.299
ocol-5.224
 former-5.210
arians-5.200
TokenB
Token ID33
Feature activation+5.50e+01

Pos logit contributions
 and+5.482
,+5.425
.+5.226
 the+5.101
 a+4.994
Neg logit contributions
umm-3.162
lor-3.050
ocol-2.999
 former-2.986
arians-2.972
TokenOO
Token ID6684
Feature activation+5.50e+01

Pos logit contributions
 and+6.967
,+6.856
.+6.702
 the+6.696
 a+6.563
Neg logit contributions
umm-1.691
lor-1.596
ocol-1.544
osity-1.529
 former-1.474
Token!!
Token ID3228
Feature activation+5.50e+01

Pos logit contributions
 and+5.372
,+5.217
.+5.082
 the+4.999
 a+4.899
Neg logit contributions
umm-3.234
ocol-3.124
lor-3.119
arians-3.019
ubb-2.989
Token!"
Token ID2474
Feature activation+5.50e+01

Pos logit contributions
 and+3.527
,+3.398
.+3.252
 the+3.222
 a+3.112
Neg logit contributions
umm-4.952
lor-4.882
ocol-4.754
 former-4.750
arians-4.743
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+3.345
,+3.221
.+3.059
 the+2.931
 a+2.838
Neg logit contributions
umm-5.108
lor-5.050
 former-4.991
ocol-4.988
arians-4.941
TokenThe
Token ID464
Feature activation+5.50e+01

Pos logit contributions
 and+4.307
,+4.192
.+4.006
 the+3.933
 a+3.821
Neg logit contributions
lor-4.274
umm-4.143
 former-4.068
ocol-4.067
arians-4.059
Token ship
Token ID4074
Feature activation+5.50e+01

Pos logit contributions
 and+5.173
,+5.057
.+4.901
 the+4.805
 a+4.692
Neg logit contributions
umm-3.419
lor-3.277
ocol-3.255
 former-3.218
arians-3.205
Token the
Token ID262
Feature activation+5.50e+01

Pos logit contributions
 and+1.785
,+1.670
.+1.486
 the+1.326
 a+1.280
Neg logit contributions
umm-6.712
ocol-6.665
lor-6.638
 former-6.536
arians-6.502
Token ancient
Token ID6156
Feature activation+5.50e+01

Pos logit contributions
 and+4.222
,+4.117
.+3.950
 the+3.871
 a+3.769
Neg logit contributions
umm-4.387
ocol-4.235
 former-4.234
lor-4.204
osity-4.125
Token eye
Token ID4151
Feature activation+5.50e+01

Pos logit contributions
 and+4.452
,+4.334
.+4.183
 the+4.146
 a+4.055
Neg logit contributions
umm-4.051
ocol-3.931
 former-3.907
lor-3.899
osity-3.836
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+2.597
,+2.478
.+2.296
 the+2.268
 a+2.188
Neg logit contributions
umm-5.834
lor-5.815
ocol-5.739
osity-5.656
arians-5.646
Token<|endoftext|>
Token ID50256
Feature activation+5.50e+01

Pos logit contributions
 and+2.539
,+2.431
.+2.246
 the+2.154
 a+2.081
Neg logit contributions
umm-6.071
lor-5.984
ocol-5.874
 former-5.861
osity-5.840
TokenE
Token ID36
Feature activation+5.50e+01

Pos logit contributions
 and+3.804
,+3.683
.+3.484
 the+3.435
 a+3.349
Neg logit contributions
umm-4.673
lor-4.649
ocol-4.521
arians-4.440
rol-4.440
Tokenlla
Token ID8466
Feature activation+5.50e+01

Pos logit contributions
 and+6.464
,+6.334
.+6.172
 the+6.088
 a+6.017
Neg logit contributions
umm-2.094
lor-2.071
ocol-1.998
arians-1.943
 former-1.913
Token was
Token ID373
Feature activation+5.50e+01

Pos logit contributions
 and+2.531
,+2.445
.+2.313
 the+2.262
 a+2.186
Neg logit contributions
umm-5.784
lor-5.743
ocol-5.686
 former-5.648
arians-5.645
Token having
Token ID1719
Feature activation+5.50e+01

Pos logit contributions
 and+2.971
,+2.876
.+2.749
 the+2.610
 a+2.456
Neg logit contributions
umm-5.479
ocol-5.399
 former-5.389
lor-5.358
rol-5.312
Token a
Token ID257
Feature activation+5.50e+01

Pos logit contributions
 and+1.467
,+1.357
.+1.173
 the+1.078
 a+0.914
Neg logit contributions
umm-7.010
 former-6.894
lor-6.887
ocol-6.854
arians-6.775
Token special
Token ID2041
Feature activation+5.50e+01

Pos logit contributions
 and+3.707
,+3.561
.+3.437
 the+3.371
 a+3.228
Neg logit contributions
umm-4.820
 former-4.678
ocol-4.664
lor-4.651
arians-4.591
Token were
Token ID547
Feature activation+5.50e+01

Pos logit contributions
 and+3.359
,+3.250
.+3.101
 the+2.990
 a+2.909
Neg logit contributions
umm-5.099
lor-5.019
 former-5.002
arians-4.967
ocol-4.964
Token very
Token ID845
Feature activation+5.50e+01

Pos logit contributions
 and+2.628
,+2.506
.+2.361
 the+2.203
 a+2.116
Neg logit contributions
umm-5.856
lor-5.757
 former-5.737
ocol-5.725
arians-5.692
Token happy
Token ID3772
Feature activation+5.50e+01

Pos logit contributions
 and+2.689
,+2.545
.+2.401
 the+2.307
 a+2.218
Neg logit contributions
umm-5.865
ocol-5.751
 former-5.734
lor-5.716
osity-5.670
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.584
,+0.488
 the+0.299
.+0.297
 a+0.206
Neg logit contributions
umm-7.880
lor-7.712
 former-7.695
ocol-7.679
arians-7.671
Token<|endoftext|>
Token ID50256
Feature activation+5.50e+01

Pos logit contributions
 and+2.480
,+2.363
.+2.189
 the+2.114
 a+2.015
Neg logit contributions
umm-6.104
lor-5.970
 former-5.941
ocol-5.935
arians-5.887
TokenE
Token ID36
Feature activation+5.50e+01

Pos logit contributions
 and+3.710
,+3.600
.+3.395
 the+3.349
 a+3.256
Neg logit contributions
umm-4.755
lor-4.729
ocol-4.627
arians-4.535
 former-4.507
Tokenlla
Token ID8466
Feature activation+5.50e+01

Pos logit contributions
 and+6.468
,+6.335
.+6.171
 the+6.102
 a+6.027
Neg logit contributions
umm-2.089
lor-2.029
ocol-1.974
arians-1.928
 former-1.898
Token loved
Token ID6151
Feature activation+5.50e+01

Pos logit contributions
 and+2.239
,+2.144
.+2.014
 the+1.968
 a+1.889
Neg logit contributions
umm-6.096
lor-6.030
ocol-5.973
arians-5.952
 former-5.929
Token to
Token ID284
Feature activation+5.50e+01

Pos logit contributions
 and+3.205
,+3.102
.+2.937
 the+2.793
 a+2.729
Neg logit contributions
umm-5.330
ocol-5.222
lor-5.168
 former-5.135
arians-5.107
Token visit
Token ID3187
Feature activation+5.50e+01

Pos logit contributions
 and+3.510
,+3.371
.+3.229
 the+3.108
 a+3.024
Neg logit contributions
umm-4.953
ocol-4.903
lor-4.839
 former-4.826
osity-4.778
Token her
Token ID607
Feature activation+5.50e+01

Pos logit contributions
 and+2.004
,+1.870
.+1.728
 the+1.574
 a+1.520
Neg logit contributions
umm-6.459
ocol-6.407
 former-6.342
lor-6.321
arians-6.310
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+1.086
,+0.991
.+0.802
 the+0.781
 a+0.692
Neg logit contributions
umm-7.331
ocol-7.220
lor-7.187
arians-7.152
osity-7.116
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+1.931
,+1.835
.+1.666
 the+1.547
 a+1.448
Neg logit contributions
umm-6.569
lor-6.521
ocol-6.422
arians-6.399
 former-6.380
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+1.391
,+1.281
.+1.104
 the+1.022
 a+0.947
Neg logit contributions
lor-7.005
umm-6.978
ocol-6.889
arians-6.866
 former-6.839
Token"
Token ID1
Feature activation+5.50e+01

Pos logit contributions
 and+3.444
,+3.325
.+3.180
 the+3.067
 a+2.986
Neg logit contributions
umm-5.081
lor-5.030
ocol-4.972
arians-4.962
vation-4.880
TokenAAAA
Token ID17922
Feature activation+5.50e+01

Pos logit contributions
 and+5.926
,+5.848
.+5.656
 the+5.538
 a+5.429
Neg logit contributions
umm-2.719
lor-2.581
ocol-2.573
 former-2.529
arians-2.514
TokenH
Token ID39
Feature activation+5.50e+01

Pos logit contributions
 and+6.163
,+6.039
.+5.886
 the+5.798
 a+5.700
Neg logit contributions
umm-2.435
lor-2.348
ocol-2.290
arians-2.247
 former-2.230
Token!"
Token ID2474
Feature activation+5.50e+01

Pos logit contributions
 and+5.865
,+5.657
.+5.550
 the+5.526
 a+5.442
Neg logit contributions
umm-2.694
lor-2.661
ocol-2.590
arians-2.505
vation-2.475
Token Tom
Token ID4186
Feature activation+5.50e+01

Pos logit contributions
 and+3.037
,+2.910
.+2.761
 the+2.655
 a+2.567
Neg logit contributions
umm-5.505
lor-5.387
ocol-5.376
arians-5.322
 former-5.318
Token and
Token ID290
Feature activation+5.50e+01

Pos logit contributions
 and+1.936
,+1.888
.+1.742
 the+1.714
 a+1.612
Neg logit contributions
umm-6.400
lor-6.328
arians-6.292
ocol-6.232
 former-6.219
Token Anna
Token ID11735
Feature activation+5.50e+01

Pos logit contributions
 and+3.375
,+3.256
.+3.102
 the+2.967
 a+2.887
Neg logit contributions
umm-5.191
lor-5.079
ocol-5.069
arians-5.016
 former-4.997
Token scream
Token ID8196
Feature activation+5.50e+01

Pos logit contributions
 and+4.134
,+4.026
.+3.885
 the+3.825
 a+3.725
Neg logit contributions
umm-4.261
lor-4.183
arians-4.162
ocol-4.150
 former-4.115
Token The
Token ID383
Feature activation+5.50e+01

Pos logit contributions
 and+2.947
,+2.813
.+2.669
 the+2.564
 a+2.500
Neg logit contributions
umm-5.599
ocol-5.450
lor-5.419
 former-5.354
arians-5.339
Token sign
Token ID1051
Feature activation+5.50e+01

Pos logit contributions
 and+4.890
,+4.797
.+4.645
 the+4.535
 a+4.431
Neg logit contributions
umm-3.601
lor-3.484
ocol-3.468
arians-3.418
vation-3.389
Token said
Token ID531
Feature activation+5.50e+01

Pos logit contributions
 and+2.407
,+2.295
.+2.147
 the+2.071
 a+1.994
Neg logit contributions
umm-6.045
lor-5.928
ocol-5.887
arians-5.868
 former-5.821
Token "
Token ID366
Feature activation+5.50e+01

Pos logit contributions
 and+2.029
,+1.839
.+1.764
 the+1.637
 a+1.547
Neg logit contributions
umm-6.428
lor-6.408
ocol-6.355
osity-6.284
arians-6.276
TokenST
Token ID2257
Feature activation+5.50e+01

Pos logit contributions
 and+5.947
,+5.881
.+5.674
 the+5.598
 a+5.492
Neg logit contributions
umm-2.637
lor-2.514
ocol-2.476
arians-2.442
 former-2.442
TokenOP
Token ID3185
Feature activation+5.50e+01

Pos logit contributions
 and+7.446
,+7.321
.+7.169
 the+7.131
 a+6.993
Neg logit contributions
umm-1.214
lor-1.114
ocol-1.110
arians-1.026
 former-1.017
Token".
Token ID1911
Feature activation+5.50e+01

Pos logit contributions
 and+5.186
,+5.037
.+4.896
 the+4.893
 a+4.781
Neg logit contributions
umm-3.381
lor-3.319
ocol-3.245
arians-3.214
 former-3.144
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+3.350
,+3.220
.+3.062
 the+2.966
 a+2.885
Neg logit contributions
umm-5.178
ocol-5.022
lor-5.014
 former-4.977
rol-4.925
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+1.369
,+1.242
.+1.073
 the+1.004
 a+0.929
Neg logit contributions
umm-7.054
lor-7.050
ocol-6.958
 former-6.888
arians-6.884
Token"
Token ID1
Feature activation+5.50e+01

Pos logit contributions
 and+3.222
,+3.079
.+2.938
 the+2.861
 a+2.773
Neg logit contributions
umm-5.326
lor-5.228
ocol-5.213
 former-5.142
arians-5.136
TokenLet
Token ID5756
Feature activation+5.50e+01

Pos logit contributions
 and+5.436
,+5.343
.+5.171
 the+5.054
 a+4.946
Neg logit contributions
umm-3.236
lor-3.092
ocol-3.057
arians-3.023
 former-3.015
Token a
Token ID257
Feature activation+5.50e+01

Pos logit contributions
 and+2.836
,+2.729
.+2.544
 the+2.434
 a+2.326
Neg logit contributions
umm-5.607
lor-5.539
 former-5.531
ocol-5.527
arians-5.425
Token book
Token ID1492
Feature activation+5.50e+01

Pos logit contributions
 and+3.978
,+3.861
.+3.723
 the+3.637
 a+3.491
Neg logit contributions
umm-4.503
lor-4.435
ocol-4.412
 former-4.388
osity-4.293
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+2.158
,+2.061
 the+1.912
.+1.879
 a+1.767
Neg logit contributions
umm-6.242
lor-6.200
 former-6.108
arians-6.060
ocol-6.033
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+3.422
,+3.311
.+3.146
 the+3.061
 a+2.972
Neg logit contributions
umm-5.103
lor-5.014
ocol-4.973
 former-4.964
arians-4.938
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+1.628
,+1.546
.+1.348
 the+1.259
 a+1.183
Neg logit contributions
lor-6.844
umm-6.778
ocol-6.756
 former-6.702
arians-6.666
TokenE
Token ID36
Feature activation+5.50e+01

Pos logit contributions
 and+4.391
,+4.295
.+4.132
 the+4.027
 a+3.929
Neg logit contributions
umm-4.142
lor-4.130
ocol-4.071
 former-4.029
arians-4.022
Tokenma
Token ID2611
Feature activation+5.50e+01

Pos logit contributions
 and+6.525
,+6.390
.+6.233
 the+6.179
 a+6.097
Neg logit contributions
umm-2.089
lor-1.997
ocol-1.961
arians-1.918
 former-1.882
Token was
Token ID373
Feature activation+5.50e+01

Pos logit contributions
 and+3.152
,+3.079
.+2.965
 the+2.889
 a+2.813
Neg logit contributions
umm-5.187
lor-5.156
arians-5.118
ocol-5.050
 former-5.035
Token disappointed
Token ID11679
Feature activation+5.50e+01

Pos logit contributions
 and+2.925
,+2.784
.+2.645
 the+2.518
 a+2.403
Neg logit contributions
umm-5.576
lor-5.508
ocol-5.487
 former-5.462
arians-5.422
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+1.025
,+0.895
 the+0.743
.+0.742
 a+0.661
Neg logit contributions
umm-7.430
ocol-7.260
 former-7.246
lor-7.236
arians-7.195
Token Reading
Token ID11725
Feature activation+5.50e+01

Pos logit contributions
 and+2.991
,+2.867
.+2.715
 the+2.661
 a+2.583
Neg logit contributions
umm-5.544
lor-5.379
 former-5.374
osity-5.369
ocol-5.357
Tokentr
Token ID2213
Feature activation+5.50e+01

Pos logit contributions
 and+6.150
,+6.047
 the+5.878
.+5.844
 a+5.752
Neg logit contributions
umm-2.378
lor-2.248
ocol-2.138
 former-2.128
arians-2.121
Tokenay
Token ID323
Feature activation+5.50e+01

Pos logit contributions
 and+5.511
,+5.378
.+5.232
 the+5.207
 a+5.095
Neg logit contributions
umm-3.115
lor-2.968
ocol-2.924
 former-2.849
rol-2.840
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+1.476
,+1.354
 the+1.211
.+1.195
 a+1.110
Neg logit contributions
umm-6.913
lor-6.841
ocol-6.767
arians-6.747
osity-6.707
Token The
Token ID383
Feature activation+5.50e+01

Pos logit contributions
 and+3.306
,+3.182
.+3.022
 the+2.912
 a+2.831
Neg logit contributions
umm-5.262
lor-5.178
 former-5.114
ocol-5.097
arians-5.034
Token ash
Token ID12530
Feature activation+5.50e+01

Pos logit contributions
 and+4.227
,+4.121
.+3.954
 the+3.883
 a+3.759
Neg logit contributions
umm-4.339
ocol-4.229
 former-4.183
lor-4.143
arians-4.081
Tokentr
Token ID2213
Feature activation+5.50e+01

Pos logit contributions
 and+6.339
,+6.242
.+6.087
 the+6.038
 a+5.921
Neg logit contributions
umm-2.173
lor-2.052
ocol-1.945
arians-1.934
 former-1.932
Tokenay
Token ID323
Feature activation+5.50e+01

Pos logit contributions
 and+5.489
,+5.361
.+5.230
 the+5.167
 a+5.058
Neg logit contributions
umm-3.139
lor-2.974
ocol-2.960
 former-2.870
rol-2.861
Token fell
Token ID3214
Feature activation+5.50e+01

Pos logit contributions
 and+3.657
,+3.546
.+3.407
 the+3.326
 a+3.237
Neg logit contributions
umm-4.777
lor-4.729
ocol-4.685
arians-4.639
osity-4.602
Token down
Token ID866
Feature activation+5.50e+01

Pos logit contributions
 and+2.628
,+2.533
.+2.399
 the+2.340
 a+2.234
Neg logit contributions
umm-5.777
lor-5.646
osity-5.583
ocol-5.581
 former-5.573
Token and
Token ID290
Feature activation+5.50e+01

Pos logit contributions
 and+0.716
,+0.614
.+0.481
 the+0.399
 a+0.320
Neg logit contributions
umm-7.726
lor-7.571
ocol-7.546
 former-7.535
arians-7.491
Token broke
Token ID6265
Feature activation+5.50e+01

Pos logit contributions
 and+3.269
,+3.135
.+2.981
 the+2.830
 a+2.767
Neg logit contributions
umm-5.286
lor-5.169
ocol-5.141
 former-5.107
arians-5.065
Token hit
Token ID2277
Feature activation+5.50e+01

Pos logit contributions
 and+4.250
,+4.113
.+3.941
 the+3.850
 a+3.733
Neg logit contributions
umm-4.187
lor-4.168
ocol-4.137
 former-4.056
arians-4.042
Token her
Token ID607
Feature activation+5.50e+01

Pos logit contributions
 and+1.600
,+1.489
.+1.302
 the+1.124
 a+1.044
Neg logit contributions
umm-6.865
ocol-6.823
lor-6.733
 former-6.696
arians-6.652
Token mom
Token ID1995
Feature activation+5.50e+01

Pos logit contributions
 and+4.576
,+4.482
.+4.308
 the+4.285
 a+4.159
Neg logit contributions
umm-3.867
lor-3.747
 former-3.729
arians-3.680
ocol-3.676
Token's
Token ID338
Feature activation+5.50e+01

Pos logit contributions
 and+3.075
,+2.983
.+2.782
 the+2.778
 a+2.704
Neg logit contributions
umm-5.288
lor-5.234
ocol-5.219
arians-5.147
rol-5.115
Token ash
Token ID12530
Feature activation+5.50e+01

Pos logit contributions
 and+4.647
,+4.556
.+4.371
 the+4.329
 a+4.202
Neg logit contributions
umm-3.859
 former-3.742
ocol-3.725
lor-3.695
arians-3.643
Tokentr
Token ID2213
Feature activation+5.50e+01

Pos logit contributions
 and+6.150
,+6.047
 the+5.878
.+5.844
 a+5.752
Neg logit contributions
umm-2.378
lor-2.248
ocol-2.138
 former-2.128
arians-2.121
Tokenay
Token ID323
Feature activation+5.50e+01

Pos logit contributions
 and+5.511
,+5.378
.+5.232
 the+5.207
 a+5.095
Neg logit contributions
umm-3.115
lor-2.968
ocol-2.924
 former-2.849
rol-2.840
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+1.476
,+1.354
 the+1.211
.+1.195
 a+1.110
Neg logit contributions
umm-6.913
lor-6.841
ocol-6.767
arians-6.747
osity-6.707
Token The
Token ID383
Feature activation+5.50e+01

Pos logit contributions
 and+3.306
,+3.182
.+3.022
 the+2.912
 a+2.831
Neg logit contributions
umm-5.262
lor-5.178
 former-5.114
ocol-5.097
arians-5.034
Token ash
Token ID12530
Feature activation+5.50e+01

Pos logit contributions
 and+4.227
,+4.121
.+3.954
 the+3.883
 a+3.759
Neg logit contributions
umm-4.339
ocol-4.229
 former-4.183
lor-4.143
arians-4.081
Tokentr
Token ID2213
Feature activation+5.50e+01

Pos logit contributions
 and+6.339
,+6.242
.+6.087
 the+6.038
 a+5.921
Neg logit contributions
umm-2.173
lor-2.052
ocol-1.945
arians-1.934
 former-1.932
Token named
Token ID3706
Feature activation+5.50e+01

Pos logit contributions
 and+3.243
,+3.124
.+3.001
 the+2.950
 a+2.857
Neg logit contributions
umm-5.172
lor-5.076
ocol-5.073
arians-4.998
 former-4.977
Token Bobby
Token ID17155
Feature activation+5.50e+01

Pos logit contributions
 and+3.566
,+3.383
.+3.246
 the+3.083
 a+3.017
Neg logit contributions
umm-5.044
ocol-4.923
lor-4.898
osity-4.854
arians-4.838
Token found
Token ID1043
Feature activation+5.50e+01

Pos logit contributions
 and+2.898
,+2.800
.+2.648
 the+2.609
 a+2.527
Neg logit contributions
umm-5.453
lor-5.426
ocol-5.356
arians-5.335
 former-5.309
Token the
Token ID262
Feature activation+5.50e+01

Pos logit contributions
 and+1.678
,+1.528
.+1.370
 the+1.198
 a+1.071
Neg logit contributions
umm-6.864
ocol-6.792
lor-6.792
rol-6.669
 former-6.652
Token ash
Token ID12530
Feature activation+5.50e+01

Pos logit contributions
 and+4.197
,+4.076
.+3.915
 the+3.847
 a+3.721
Neg logit contributions
umm-4.397
 former-4.228
ocol-4.227
lor-4.188
osity-4.096
Tokentr
Token ID2213
Feature activation+5.50e+01

Pos logit contributions
 and+6.664
,+6.561
 the+6.438
.+6.372
 a+6.296
Neg logit contributions
umm-1.862
lor-1.712
 former-1.604
arians-1.582
ocol-1.566
Tokenay
Token ID323
Feature activation+5.50e+01

Pos logit contributions
 and+5.503
,+5.370
.+5.230
 the+5.195
 a+5.085
Neg logit contributions
umm-3.137
lor-2.978
ocol-2.950
 former-2.860
rol-2.859
Token by
Token ID416
Feature activation+5.50e+01

Pos logit contributions
 and+2.210
,+2.108
.+1.937
 the+1.917
 a+1.824
Neg logit contributions
umm-6.169
lor-6.106
ocol-6.074
arians-6.038
osity-5.997
Token the
Token ID262
Feature activation+5.50e+01

Pos logit contributions
 and+2.985
,+2.886
.+2.690
 the+2.545
 a+2.466
Neg logit contributions
umm-5.485
 former-5.435
lor-5.405
ocol-5.385
arians-5.302
Token window
Token ID4324
Feature activation+5.50e+01

Pos logit contributions
 and+4.627
,+4.508
.+4.318
 the+4.230
 a+4.136
Neg logit contributions
umm-3.989
 former-3.845
ocol-3.774
lor-3.758
arians-3.752
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+1.368
,+1.254
 the+1.080
.+1.043
 a+0.995
Neg logit contributions
umm-7.124
lor-7.007
 former-6.913
ocol-6.904
osity-6.880
Token
Token ID198
Feature activation+5.50e+01

Pos logit contributions
 and+1.425
,+1.315
.+1.144
 the+1.084
 a+0.991
Neg logit contributions
lor-7.009
umm-7.003
ocol-6.889
 former-6.865
arians-6.848
TokenSuddenly
Token ID38582
Feature activation+5.50e+01

Pos logit contributions
 and+4.916
,+4.809
.+4.660
 the+4.576
 a+4.473
Neg logit contributions
umm-3.645
lor-3.561
ocol-3.524
 former-3.495
arians-3.460
Token,
Token ID11
Feature activation+5.50e+01

Pos logit contributions
 and+0.377
,+0.162
.+0.117
 the-0.016
 a-0.110
Neg logit contributions
umm-8.081
ocol-8.005
lor-7.962
 former-7.889
arians-7.872
Token the
Token ID262
Feature activation+5.50e+01

Pos logit contributions
 and+4.160
,+4.001
.+3.880
 the+3.726
 a+3.625
Neg logit contributions
umm-4.352
ocol-4.243
lor-4.206
 former-4.182
arians-4.134
Token ash
Token ID12530
Feature activation+5.50e+01

Pos logit contributions
 and+4.236
,+4.117
.+3.955
 the+3.885
 a+3.783
Neg logit contributions
umm-4.368
ocol-4.180
 former-4.161
lor-4.121
osity-4.073
Tokentr
Token ID2213
Feature activation+5.50e+01

Pos logit contributions
 and+6.604
,+6.506
.+6.333
 the+6.322
 a+6.199
Neg logit contributions
umm-1.928
lor-1.775
 former-1.693
arians-1.678
osity-1.668
Tokenay
Token ID323
Feature activation+5.50e+01

Pos logit contributions
 and+5.527
,+5.395
.+5.265
 the+5.205
 a+5.095
Neg logit contributions
umm-3.132
lor-2.958
ocol-2.950
 former-2.855
osity-2.846
Token spoke
Token ID5158
Feature activation+5.50e+01

Pos logit contributions
 and+4.026
,+3.927
.+3.772
 the+3.715
 a+3.610
Neg logit contributions
umm-4.439
lor-4.374
ocol-4.309
arians-4.296
osity-4.255
Token,
Token ID11
Feature activation+5.50e+01

Pos logit contributions
 and+2.348
,+2.236
.+2.051
 the+2.043
 a+1.896
Neg logit contributions
umm-6.065
lor-5.982
 former-5.933
osity-5.923
ocol-5.897
Token �
Token ID6184
Feature activation+5.50e+01

Pos logit contributions
 and+2.911
,+2.856
.+2.691
 the+2.593
 a+2.509
Neg logit contributions
umm-5.428
lor-5.353
ocol-5.344
osity-5.330
 former-5.320
Token�
Token ID95
Feature activation+5.50e+01

Pos logit contributions
 and+6.522
,+6.363
.+6.209
 the+6.160
 a+6.053
Neg logit contributions
umm-2.158
lor-2.036
ocol-2.029
arians-1.944
 former-1.923
TokenDid
Token ID11633
Feature activation+5.50e+01

Pos logit contributions
 and+4.904
,+4.866
.+4.675
 the+4.548
 a+4.452
Neg logit contributions
umm-3.678
lor-3.605
ocol-3.556
 former-3.529
arians-3.497
Token you
Token ID345
Feature activation+5.50e+01

Pos logit contributions
 and+3.013
,+2.843
.+2.728
 the+2.562
 a+2.509
Neg logit contributions
umm-5.470
ocol-5.400
lor-5.372
arians-5.353
osity-5.311
Token destroy
Token ID4117
Feature activation+5.50e+01

Pos logit contributions
 and+3.681
,+3.563
.+3.408
 the+3.335
 a+3.213
Neg logit contributions
umm-4.812
ocol-4.719
 former-4.715
lor-4.652
osity-4.630
Token my
Token ID616
Feature activation+5.50e+01

Pos logit contributions
 and+2.724
,+2.581
.+2.444
 the+2.254
 a+2.200
Neg logit contributions
umm-5.738
ocol-5.726
 former-5.586
lor-5.578
arians-5.563
Token ash
Token ID12530
Feature activation+5.50e+01

Pos logit contributions
 and+4.383
,+4.259
.+4.083
 the+4.060
 a+3.945
Neg logit contributions
umm-4.214
 former-4.091
ocol-4.059
lor-3.971
arians-3.942
Tokentr
Token ID2213
Feature activation+5.50e+01

Pos logit contributions
 and+6.671
,+6.532
 the+6.392
.+6.375
 a+6.265
Neg logit contributions
umm-1.897
lor-1.754
 former-1.692
arians-1.664
ocol-1.631
Tokenay
Token ID323
Feature activation+5.50e+01

Pos logit contributions
 and+5.522
,+5.379
.+5.238
 the+5.211
 a+5.105
Neg logit contributions
umm-3.110
lor-2.955
ocol-2.917
 former-2.871
arians-2.837
Token?"
Token ID1701
Feature activation+5.50e+01

Pos logit contributions
 and+3.556
,+3.413
.+3.277
 the+3.242
 a+3.150
Neg logit contributions
umm-4.943
lor-4.839
arians-4.769
ocol-4.763
 former-4.722
Token Lily
Token ID20037
Feature activation+5.50e+01

Pos logit contributions
 and+2.251
,+2.115
.+1.968
 the+1.897
 a+1.823
Neg logit contributions
umm-6.215
lor-6.128
 former-6.095
ocol-6.084
arians-6.029
Token felt
Token ID2936
Feature activation+5.50e+01

Pos logit contributions
 and+3.309
,+3.192
.+3.085
 the+3.027
 a+2.943
Neg logit contributions
umm-5.072
lor-4.987
arians-4.943
 former-4.923
osity-4.862
Token bad
Token ID2089
Feature activation+5.50e+01

Pos logit contributions
 and+3.897
,+3.763
.+3.617
 the+3.506
 a+3.390
Neg logit contributions
umm-4.594
ocol-4.501
lor-4.471
 former-4.446
ubb-4.366
Token and
Token ID290
Feature activation+5.50e+01

Pos logit contributions
 and+0.315
,+0.235
 the+0.079
.+0.078
 a-0.009
Neg logit contributions
umm-8.058
lor-7.927
 former-7.906
arians-7.862
ocol-7.851
Token saw
Token ID2497
Feature activation+5.50e+01

Pos logit contributions
 and+2.771
,+2.656
.+2.512
 the+2.358
 a+2.303
Neg logit contributions
umm-5.648
lor-5.579
ocol-5.571
 former-5.546
arians-5.521
Token the
Token ID262
Feature activation+5.50e+01

Pos logit contributions
 and+2.003
,+1.878
.+1.714
 the+1.553
 a+1.496
Neg logit contributions
umm-6.496
ocol-6.448
lor-6.389
 former-6.333
osity-6.330
Token broken
Token ID5445
Feature activation+5.50e+01

Pos logit contributions
 and+4.144
,+4.044
.+3.869
 the+3.802
 a+3.684
Neg logit contributions
umm-4.450
ocol-4.316
 former-4.264
lor-4.250
arians-4.171
Token ash
Token ID12530
Feature activation+5.50e+01

Pos logit contributions
 and+4.239
,+4.103
.+3.948
 the+3.897
 a+3.798
Neg logit contributions
umm-4.306
ocol-4.187
 former-4.102
lor-4.101
osity-4.010
Tokentr
Token ID2213
Feature activation+5.50e+01

Pos logit contributions
 and+6.619
,+6.524
 the+6.357
.+6.322
 a+6.237
Neg logit contributions
umm-1.911
lor-1.750
 former-1.659
arians-1.651
ocol-1.645
Tokenay
Token ID323
Feature activation+5.50e+01

Pos logit contributions
 and+5.497
,+5.370
.+5.219
 the+5.184
 a+5.078
Neg logit contributions
umm-3.131
lor-2.975
ocol-2.944
 former-2.870
arians-2.846
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+2.236
,+2.132
.+1.947
 the+1.933
 a+1.856
Neg logit contributions
umm-6.189
lor-6.084
ocol-6.052
arians-6.039
osity-6.002
Token She
Token ID1375
Feature activation+5.50e+01

Pos logit contributions
 and+2.896
,+2.782
.+2.643
 the+2.556
 a+2.475
Neg logit contributions
umm-5.637
lor-5.475
 former-5.457
ocol-5.435
osity-5.428
Token was
Token ID373
Feature activation+5.50e+01

Pos logit contributions
 and+2.991
,+2.903
.+2.789
 the+2.643
 a+2.576
Neg logit contributions
umm-5.402
lor-5.340
ocol-5.303
arians-5.277
 former-5.264
Token upset
Token ID9247
Feature activation+5.50e+01

Pos logit contributions
 and+3.298
,+3.190
.+3.045
 the+2.915
 a+2.799
Neg logit contributions
umm-5.176
ocol-5.090
lor-5.077
 former-5.063
arians-5.022
Token cleaned
Token ID20750
Feature activation+5.50e+01

Pos logit contributions
 and+3.895
,+3.797
.+3.648
 the+3.528
 a+3.435
Neg logit contributions
umm-4.568
lor-4.438
ocol-4.434
 former-4.417
osity-4.371
Token up
Token ID510
Feature activation+5.50e+01

Pos logit contributions
 and+2.101
,+2.004
.+1.878
 the+1.713
 a+1.668
Neg logit contributions
umm-6.381
ocol-6.224
lor-6.210
 former-6.199
arians-6.162
Token the
Token ID262
Feature activation+5.50e+01

Pos logit contributions
 and+1.261
,+1.168
.+1.031
 the+0.864
 a+0.815
Neg logit contributions
umm-7.189
ocol-7.070
 former-7.062
lor-7.007
arians-6.968
Token broken
Token ID5445
Feature activation+5.50e+01

Pos logit contributions
 and+4.276
,+4.180
.+4.015
 the+3.944
 a+3.849
Neg logit contributions
umm-4.297
ocol-4.157
 former-4.126
lor-4.095
arians-4.020
Token ash
Token ID12530
Feature activation+5.50e+01

Pos logit contributions
 and+4.272
,+4.154
.+4.013
 the+3.962
 a+3.868
Neg logit contributions
umm-4.204
ocol-4.111
 former-4.041
lor-4.020
arians-3.946
Tokentr
Token ID2213
Feature activation+5.50e+01

Pos logit contributions
 and+6.575
,+6.487
 the+6.338
.+6.320
 a+6.235
Neg logit contributions
umm-1.901
lor-1.743
 former-1.690
arians-1.660
osity-1.606
Tokenay
Token ID323
Feature activation+5.50e+01

Pos logit contributions
 and+5.497
,+5.375
.+5.232
 the+5.187
 a+5.096
Neg logit contributions
umm-3.106
lor-2.944
ocol-2.917
 former-2.861
arians-2.829
Token and
Token ID290
Feature activation+5.50e+01

Pos logit contributions
 and+2.000
,+1.915
.+1.772
 the+1.720
 a+1.665
Neg logit contributions
umm-6.390
lor-6.281
arians-6.241
ocol-6.214
osity-6.186
Token Lily
Token ID20037
Feature activation+5.50e+01

Pos logit contributions
 and+3.433
,+3.339
.+3.207
 the+3.027
 a+2.970
Neg logit contributions
umm-5.065
 former-4.939
ocol-4.899
lor-4.888
arians-4.857
Token learned
Token ID4499
Feature activation+5.50e+01

Pos logit contributions
 and+3.168
,+3.082
.+2.936
 the+2.874
 a+2.781
Neg logit contributions
umm-5.200
lor-5.137
arians-5.072
 former-5.062
ocol-5.014
Token to
Token ID284
Feature activation+5.50e+01

Pos logit contributions
 and+1.451
,+1.366
.+1.177
 the+1.087
 a+0.970
Neg logit contributions
umm-6.967
lor-6.938
 former-6.911
ocol-6.828
osity-6.806
Token also
Token ID635
Feature activation+5.50e+01

Pos logit contributions
 and+3.684
,+3.570
.+3.405
 the+3.290
 a+3.180
Neg logit contributions
umm-4.783
lor-4.715
ocol-4.694
 former-4.682
arians-4.611
Token visiting
Token ID10013
Feature activation+5.50e+01

Pos logit contributions
 and+3.633
,+3.517
.+3.360
 the+3.222
 a+3.113
Neg logit contributions
umm-4.844
ocol-4.773
lor-4.733
 former-4.732
ubb-4.664
Token Grand
Token ID5675
Feature activation+5.50e+01

Pos logit contributions
 and+3.235
,+3.129
.+2.925
 the+2.833
 a+2.775
Neg logit contributions
umm-5.227
ocol-5.141
lor-5.102
 former-5.088
arians-5.051
Tokenma
Token ID2611
Feature activation+5.50e+01

Pos logit contributions
 and+5.432
,+5.310
.+5.151
 the+5.088
 a+5.015
Neg logit contributions
umm-3.194
lor-3.100
ocol-3.065
arians-3.009
 former-2.973
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+1.768
,+1.666
.+1.463
 the+1.458
 a+1.380
Neg logit contributions
umm-6.653
lor-6.587
 former-6.537
ocol-6.532
arians-6.521
Token Exc
Token ID25268
Feature activation+5.50e+01

Pos logit contributions
 and+3.531
,+3.407
.+3.259
 the+3.167
 a+3.091
Neg logit contributions
umm-5.017
lor-4.906
 former-4.858
ocol-4.846
osity-4.829
Tokenited
Token ID863
Feature activation+5.50e+01

Pos logit contributions
 and+7.863
,+7.739
.+7.587
 the+7.504
 a+7.423
Neg logit contributions
umm-0.752
ocol-0.644
lor-0.635
 former-0.575
arians-0.556
Tokenly
Token ID306
Feature activation+5.50e+01

Pos logit contributions
 and+4.590
,+4.443
.+4.321
 the+4.256
 a+4.204
Neg logit contributions
umm-3.925
ocol-3.785
lor-3.739
osity-3.687
 former-3.675
Token Tony
Token ID8832
Feature activation+5.50e+01

Pos logit contributions
 and+2.801
,+2.636
.+2.554
 the+2.438
 a+2.377
Neg logit contributions
umm-5.666
lor-5.605
ocol-5.554
arians-5.529
osity-5.495
Token asked
Token ID1965
Feature activation+5.50e+01

Pos logit contributions
 and+3.520
,+3.390
.+3.263
 the+3.183
 a+3.115
Neg logit contributions
umm-4.869
lor-4.861
ocol-4.774
arians-4.773
osity-4.730
Token Sarah
Token ID10490
Feature activation+5.50e+01

Pos logit contributions
 and+3.882
,+3.713
.+3.599
 the+3.495
 a+3.450
Neg logit contributions
umm-4.548
ocol-4.476
lor-4.474
osity-4.400
arians-4.398
Tokentr
Token ID2213
Feature activation+5.50e+01

Pos logit contributions
 and+6.374
,+6.244
 the+6.130
.+6.045
 a+5.990
Neg logit contributions
umm-2.198
lor-2.065
 former-1.937
ocol-1.932
arians-1.930
Tokenay
Token ID323
Feature activation+5.50e+01

Pos logit contributions
 and+5.552
,+5.413
.+5.267
 the+5.244
 a+5.126
Neg logit contributions
umm-3.103
lor-2.949
ocol-2.905
rol-2.844
 former-2.825
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+2.882
,+2.758
 the+2.612
.+2.596
 a+2.500
Neg logit contributions
umm-5.551
lor-5.507
ocol-5.440
arians-5.393
osity-5.371
Token The
Token ID383
Feature activation+5.50e+01

Pos logit contributions
 and+3.550
,+3.429
.+3.274
 the+3.162
 a+3.049
Neg logit contributions
umm-5.054
lor-4.928
ocol-4.900
 former-4.831
osity-4.799
Token ash
Token ID12530
Feature activation+5.50e+01

Pos logit contributions
 and+4.310
,+4.181
.+4.011
 the+3.957
 a+3.843
Neg logit contributions
umm-4.314
ocol-4.169
 former-4.123
lor-4.070
arians-3.996
Tokentr
Token ID2213
Feature activation+5.50e+01

Pos logit contributions
 and+6.562
,+6.455
 the+6.303
.+6.295
 a+6.174
Neg logit contributions
umm-1.935
lor-1.820
 former-1.702
arians-1.700
ocol-1.681
Tokenay
Token ID323
Feature activation+5.50e+01

Pos logit contributions
 and+5.519
,+5.388
.+5.250
 the+5.203
 a+5.090
Neg logit contributions
umm-3.130
lor-2.961
ocol-2.943
rol-2.858
 former-2.854
Token lived
Token ID5615
Feature activation+5.50e+01

Pos logit contributions
 and+3.123
,+3.010
.+2.870
 the+2.815
 a+2.720
Neg logit contributions
umm-5.294
lor-5.257
ocol-5.215
arians-5.155
osity-5.112
Token in
Token ID287
Feature activation+5.50e+01

Pos logit contributions
 and+1.929
,+1.841
.+1.662
 the+1.584
 a+1.427
Neg logit contributions
umm-6.494
lor-6.439
 former-6.391
ocol-6.388
rol-6.333
Token a
Token ID257
Feature activation+5.50e+01

Pos logit contributions
 and+1.053
,+0.940
.+0.765
 the+0.582
 a+0.483
Neg logit contributions
umm-7.473
lor-7.396
ocol-7.349
 former-7.332
arians-7.235
Token house
Token ID2156
Feature activation+5.50e+01

Pos logit contributions
 and+3.667
,+3.501
.+3.369
 the+3.282
 a+3.190
Neg logit contributions
umm-4.931
lor-4.795
 former-4.776
ocol-4.765
arians-4.676
Token there
Token ID612
Feature activation+5.50e+01

Pos logit contributions
 and+2.005
,+1.852
.+1.751
 the+1.583
 a+1.484
Neg logit contributions
umm-6.471
ocol-6.377
lor-6.376
osity-6.321
 former-6.305
Token was
Token ID373
Feature activation+5.50e+01

Pos logit contributions
 and+1.610
,+1.458
.+1.359
 the+1.256
 a+1.165
Neg logit contributions
umm-6.831
lor-6.780
ocol-6.669
 former-6.630
arians-6.585
Token an
Token ID281
Feature activation+5.50e+01

Pos logit contributions
 and+0.846
,+0.744
.+0.581
 the+0.436
 a+0.254
Neg logit contributions
umm-7.655
lor-7.546
 former-7.542
ocol-7.528
osity-7.409
Token ordinary
Token ID8850
Feature activation+5.50e+01

Pos logit contributions
 and+4.095
,+3.979
.+3.857
 the+3.765
 a+3.665
Neg logit contributions
umm-4.463
ocol-4.333
 former-4.291
arians-4.208
lor-4.183
Token ash
Token ID12530
Feature activation+5.50e+01

Pos logit contributions
 and+4.511
,+4.369
.+4.253
 the+4.225
 a+4.111
Neg logit contributions
umm-4.010
 former-3.840
ocol-3.834
lor-3.782
arians-3.740
Tokentr
Token ID2213
Feature activation+5.50e+01

Pos logit contributions
 and+6.374
,+6.244
 the+6.130
.+6.045
 a+5.990
Neg logit contributions
umm-2.198
lor-2.065
 former-1.937
ocol-1.932
arians-1.930
Tokenay
Token ID323
Feature activation+5.50e+01

Pos logit contributions
 and+5.552
,+5.413
.+5.267
 the+5.244
 a+5.126
Neg logit contributions
umm-3.103
lor-2.949
ocol-2.905
rol-2.844
 former-2.825
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+2.882
,+2.758
 the+2.612
.+2.596
 a+2.500
Neg logit contributions
umm-5.551
lor-5.507
ocol-5.440
arians-5.393
osity-5.371
Token The
Token ID383
Feature activation+5.50e+01

Pos logit contributions
 and+3.550
,+3.429
.+3.274
 the+3.162
 a+3.049
Neg logit contributions
umm-5.054
lor-4.928
ocol-4.900
 former-4.831
osity-4.799
Token ash
Token ID12530
Feature activation+5.50e+01

Pos logit contributions
 and+4.310
,+4.181
.+4.011
 the+3.957
 a+3.843
Neg logit contributions
umm-4.314
ocol-4.169
 former-4.123
lor-4.070
arians-3.996
Tokentr
Token ID2213
Feature activation+5.50e+01

Pos logit contributions
 and+6.562
,+6.455
 the+6.303
.+6.295
 a+6.174
Neg logit contributions
umm-1.935
lor-1.820
 former-1.702
arians-1.700
ocol-1.681
Token eat
Token ID4483
Feature activation+5.50e+01

Pos logit contributions
 and+3.439
,+3.291
.+3.148
 the+3.054
 a+2.958
Neg logit contributions
umm-5.036
ocol-4.994
 former-4.943
lor-4.930
arians-4.848
Token honey
Token ID12498
Feature activation+5.50e+01

Pos logit contributions
 and+4.262
,+4.138
.+4.001
 the+3.887
 a+3.825
Neg logit contributions
umm-4.211
ocol-4.138
lor-4.025
 former-3.990
arians-3.929
Token from
Token ID422
Feature activation+5.50e+01

Pos logit contributions
 and+1.829
,+1.711
 the+1.538
.+1.535
 a+1.431
Neg logit contributions
umm-6.655
lor-6.543
ocol-6.451
arians-6.446
 former-6.441
Token the
Token ID262
Feature activation+5.50e+01

Pos logit contributions
 and+3.151
,+3.019
.+2.852
 the+2.690
 a+2.627
Neg logit contributions
umm-5.396
ocol-5.331
 former-5.270
lor-5.207
arians-5.143
Token be
Token ID307
Feature activation+5.50e+01

Pos logit contributions
 and+4.148
,+4.011
.+3.846
 the+3.781
 a+3.681
Neg logit contributions
umm-4.526
lor-4.299
ocol-4.297
 former-4.293
arians-4.208
Tokeneh
Token ID17231
Feature activation+5.50e+01

Pos logit contributions
 and+6.677
,+6.554
.+6.356
 the+6.349
 a+6.221
Neg logit contributions
umm-1.806
lor-1.761
ocol-1.759
rim-1.648
 former-1.608
Tokenive
Token ID425
Feature activation+5.50e+01

Pos logit contributions
 and+7.021
,+6.888
 the+6.728
.+6.723
 a+6.613
Neg logit contributions
umm-1.683
lor-1.486
ocol-1.450
arians-1.404
 former-1.384
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+1.419
,+1.322
 the+1.125
.+1.103
 a+1.033
Neg logit contributions
umm-7.017
lor-6.953
 former-6.861
ocol-6.860
arians-6.825
Token One
Token ID1881
Feature activation+5.50e+01

Pos logit contributions
 and+3.408
,+3.294
.+3.123
 the+3.033
 a+2.950
Neg logit contributions
umm-5.182
lor-5.021
ocol-5.005
 former-4.976
osity-4.911
Token day
Token ID1110
Feature activation+5.50e+01

Pos logit contributions
 and+2.890
,+2.765
.+2.626
 the+2.537
 a+2.442
Neg logit contributions
umm-5.627
lor-5.460
ocol-5.449
 former-5.415
osity-5.353
Token,
Token ID11
Feature activation+5.50e+01

Pos logit contributions
 and+0.204
,+0.002
.-0.077
 the-0.174
 a-0.261
Neg logit contributions
umm-8.298
ocol-8.213
lor-8.171
osity-8.086
arians-8.078
Token they
Token ID484
Feature activation+5.50e+01

Pos logit contributions
 and+2.348
,+2.202
.+2.079
 the+1.953
 a+1.856
Neg logit contributions
umm-6.161
ocol-6.032
lor-6.002
 former-5.955
osity-5.944
Token see
Token ID766
Feature activation+5.50e+01

Pos logit contributions
 and+2.828
,+2.721
.+2.595
 the+2.462
 a+2.391
Neg logit contributions
umm-5.655
ocol-5.545
lor-5.510
 former-5.491
arians-5.468
Token a
Token ID257
Feature activation+5.50e+01

Pos logit contributions
 and+0.756
,+0.635
.+0.479
 the+0.308
 a+0.143
Neg logit contributions
umm-7.610
lor-7.578
ocol-7.522
 former-7.489
arians-7.426
Token big
Token ID1263
Feature activation+5.50e+01

Pos logit contributions
 and+3.154
,+3.016
.+2.893
 the+2.793
 a+2.690
Neg logit contributions
umm-5.396
lor-5.249
ocol-5.215
 former-5.187
arians-5.160
Token ash
Token ID12530
Feature activation+5.50e+01

Pos logit contributions
 and+4.286
,+4.127
.+4.061
 the+3.941
 a+3.879
Neg logit contributions
umm-4.234
ocol-4.093
lor-4.004
 former-3.970
arians-3.967
Tokentr
Token ID2213
Feature activation+5.50e+01

Pos logit contributions
 and+6.438
,+6.326
 the+6.153
.+6.126
 a+6.030
Neg logit contributions
umm-2.135
lor-1.995
ocol-1.911
arians-1.876
 former-1.856
Tokenay
Token ID323
Feature activation+5.50e+01

Pos logit contributions
 and+5.808
,+5.674
.+5.525
 the+5.484
 a+5.372
Neg logit contributions
umm-2.829
lor-2.691
ocol-2.652
 former-2.564
arians-2.558
Token near
Token ID1474
Feature activation+5.50e+01

Pos logit contributions
 and+2.339
,+2.232
.+2.049
 the+2.019
 a+1.928
Neg logit contributions
umm-6.126
lor-6.069
ocol-6.036
arians-5.980
osity-5.945
Token a
Token ID257
Feature activation+5.50e+01

Pos logit contributions
 and+1.402
,+1.296
.+1.083
 the+0.928
 a+0.839
Neg logit contributions
umm-7.072
ocol-7.017
lor-6.965
 former-6.951
arians-6.875
Token bench
Token ID7624
Feature activation+5.50e+01

Pos logit contributions
 and+3.853
,+3.700
.+3.548
 the+3.452
 a+3.376
Neg logit contributions
umm-4.683
lor-4.586
ocol-4.548
 former-4.481
arians-4.468
Token.
Token ID13
Feature activation+5.50e+01

Pos logit contributions
 and+0.473
,+0.361
 the+0.162
.+0.139
 a+0.080
Neg logit contributions
umm-7.963
lor-7.877
ocol-7.819
arians-7.752
osity-7.749