TOP ACTIVATIONS
MAX = 1.000

Tom went to his bed and hid under the covers .
Tom went to his bed and hid under the covers
. Tom went to his bed and hid under
Tom went to his bed and hid under the
talking , table ?" The table did not say anything either
you talking , table ?" The table did not say anything
, table ?" The table did not say anything either .
table ?" The table did not say anything either . Tom
either . Tom was scared . Tom went to
anything either . Tom was scared . Tom went
not say anything either . Tom was scared .
say anything either . Tom was scared . Tom
Tom was scared . Tom went to his bed
. Tom was scared . Tom went to his
was scared . Tom went to his bed and
scared . Tom went to his bed and hid
did not say anything either . Tom was scared .
table did not say anything either . Tom was scared .
?" The table did not say anything either . Tom was
The table did not say anything either . Tom was scared

MIN ACTIVATIONS
MIN = 1.000

they all had great fun z ooming around together .
loud . " O w , ow , ow
can be !" H ilda was scared and tried
away . Fe eling frustrated and angry ,
scissors sn ip and sn ip and sn ip . The
ride in his raft . Exc ited , Lily jumped in
and sn ip and sn ip . The teacher followed the
. He tried to un plug the mixer , but he
and had fun pinch lot ion . It felt slim y
were very happy . <|endoftext|> E lla loved to visit her
. She saw another l oll ipop and it was bigger
fairy who was making l oll ip ops for all the
can be successful . â He was excited and felt
Ell y . E lly was very quiet and
some cookies and ice - cream inside ! They were so
long there was more fresh caul iflower to enjoy !
she noticed that the l oll ipop was gone . She
. " AAAA H !" Tom and Anna scream
. But she didn â t forget the memory
to worry . I â m so glad you

INTERVAL 1.000 - 1.000
CONTAINS 0.000%

TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.863
,+1.752
.+1.589
 the+1.515
 a+1.428
Neg logit contributions
umm-6.603
lor-6.542
ocol-6.493
 former-6.459
arians-6.451
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.406
,+2.288
.+2.117
 the+1.958
 a+1.899
Neg logit contributions
umm-6.076
ocol-6.020
lor-5.976
 former-5.965
arians-5.918
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.796
,+3.682
.+3.525
 the+3.457
 a+3.353
Neg logit contributions
umm-4.731
 former-4.621
lor-4.576
arians-4.555
ocol-4.543
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.139
,+1.038
 the+0.892
.+0.868
 a+0.783
Neg logit contributions
umm-7.257
lor-7.218
 former-7.136
arians-7.077
ocol-7.042
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.590
,+3.484
.+3.332
 the+3.198
 a+3.108
Neg logit contributions
umm-4.811
lor-4.796
 former-4.755
ocol-4.728
arians-4.696
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.948
,+2.847
.+2.618
 the+2.548
 a+2.489
Neg logit contributions
umm-5.514
ocol-5.477
lor-5.387
arians-5.334
 former-5.329
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.576
,+1.456
.+1.272
 the+1.108
 a+1.023
Neg logit contributions
umm-6.921
ocol-6.871
lor-6.809
 former-6.769
arians-6.698
Token covers
Token ID8698
Feature activation+1.00e+00

Pos logit contributions
 and+4.449
,+4.353
.+4.164
 the+4.122
 a+4.045
Neg logit contributions
umm-4.056
ocol-3.946
lor-3.928
 former-3.845
arians-3.803
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.788
,+0.680
.+0.454
 the+0.450
 a+0.390
Neg logit contributions
umm-7.681
lor-7.604
ocol-7.542
 former-7.447
arians-7.443
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.863
,+1.752
.+1.589
 the+1.515
 a+1.428
Neg logit contributions
umm-6.603
lor-6.542
ocol-6.493
 former-6.459
arians-6.451
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.406
,+2.288
.+2.117
 the+1.958
 a+1.899
Neg logit contributions
umm-6.076
ocol-6.020
lor-5.976
 former-5.965
arians-5.918
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.796
,+3.682
.+3.525
 the+3.457
 a+3.353
Neg logit contributions
umm-4.731
 former-4.621
lor-4.576
arians-4.555
ocol-4.543
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.139
,+1.038
 the+0.892
.+0.868
 a+0.783
Neg logit contributions
umm-7.257
lor-7.218
 former-7.136
arians-7.077
ocol-7.042
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.590
,+3.484
.+3.332
 the+3.198
 a+3.108
Neg logit contributions
umm-4.811
lor-4.796
 former-4.755
ocol-4.728
arians-4.696
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.948
,+2.847
.+2.618
 the+2.548
 a+2.489
Neg logit contributions
umm-5.514
ocol-5.477
lor-5.387
arians-5.334
 former-5.329
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.576
,+1.456
.+1.272
 the+1.108
 a+1.023
Neg logit contributions
umm-6.921
ocol-6.871
lor-6.809
 former-6.769
arians-6.698
Token covers
Token ID8698
Feature activation+1.00e+00

Pos logit contributions
 and+4.449
,+4.353
.+4.164
 the+4.122
 a+4.045
Neg logit contributions
umm-4.056
ocol-3.946
lor-3.928
 former-3.845
arians-3.803
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.863
,+1.752
.+1.589
 the+1.515
 a+1.428
Neg logit contributions
umm-6.603
lor-6.542
ocol-6.493
 former-6.459
arians-6.451
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.406
,+2.288
.+2.117
 the+1.958
 a+1.899
Neg logit contributions
umm-6.076
ocol-6.020
lor-5.976
 former-5.965
arians-5.918
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.796
,+3.682
.+3.525
 the+3.457
 a+3.353
Neg logit contributions
umm-4.731
 former-4.621
lor-4.576
arians-4.555
ocol-4.543
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.139
,+1.038
 the+0.892
.+0.868
 a+0.783
Neg logit contributions
umm-7.257
lor-7.218
 former-7.136
arians-7.077
ocol-7.042
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.590
,+3.484
.+3.332
 the+3.198
 a+3.108
Neg logit contributions
umm-4.811
lor-4.796
 former-4.755
ocol-4.728
arians-4.696
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.948
,+2.847
.+2.618
 the+2.548
 a+2.489
Neg logit contributions
umm-5.514
ocol-5.477
lor-5.387
arians-5.334
 former-5.329
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.863
,+1.752
.+1.589
 the+1.515
 a+1.428
Neg logit contributions
umm-6.603
lor-6.542
ocol-6.493
 former-6.459
arians-6.451
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.406
,+2.288
.+2.117
 the+1.958
 a+1.899
Neg logit contributions
umm-6.076
ocol-6.020
lor-5.976
 former-5.965
arians-5.918
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.796
,+3.682
.+3.525
 the+3.457
 a+3.353
Neg logit contributions
umm-4.731
 former-4.621
lor-4.576
arians-4.555
ocol-4.543
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.139
,+1.038
 the+0.892
.+0.868
 a+0.783
Neg logit contributions
umm-7.257
lor-7.218
 former-7.136
arians-7.077
ocol-7.042
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.590
,+3.484
.+3.332
 the+3.198
 a+3.108
Neg logit contributions
umm-4.811
lor-4.796
 former-4.755
ocol-4.728
arians-4.696
Token under
Token ID739
Feature activation+1.00e+00

Pos logit contributions
 and+2.948
,+2.847
.+2.618
 the+2.548
 a+2.489
Neg logit contributions
umm-5.514
ocol-5.477
lor-5.387
arians-5.334
 former-5.329
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.576
,+1.456
.+1.272
 the+1.108
 a+1.023
Neg logit contributions
umm-6.921
ocol-6.871
lor-6.809
 former-6.769
arians-6.698
Token talking
Token ID3375
Feature activation+1.00e+00

Pos logit contributions
 and+3.769
,+3.621
.+3.469
 the+3.427
 a+3.317
Neg logit contributions
umm-4.672
lor-4.617
ocol-4.582
 former-4.572
arians-4.534
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+2.798
,+2.700
.+2.519
 the+2.499
 a+2.384
Neg logit contributions
umm-5.631
lor-5.565
 former-5.507
ocol-5.488
arians-5.468
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.745
,+3.653
.+3.526
 the+3.412
 a+3.331
Neg logit contributions
umm-4.714
ocol-4.595
 former-4.582
lor-4.571
arians-4.538
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.163
,+2.045
.+1.893
 the+1.892
 a+1.782
Neg logit contributions
umm-6.283
lor-6.172
ocol-6.126
 former-6.089
arians-6.067
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.157
,+3.027
.+2.874
 the+2.762
 a+2.693
Neg logit contributions
umm-5.294
ocol-5.226
lor-5.208
 former-5.169
arians-5.127
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.237
,+4.129
.+3.956
 the+3.874
 a+3.767
Neg logit contributions
umm-4.312
ocol-4.202
 former-4.137
lor-4.128
arians-4.097
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.767
,+3.652
.+3.507
 the+3.465
 a+3.360
Neg logit contributions
umm-4.683
lor-4.598
ocol-4.544
 former-4.491
arians-4.473
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.385
,+2.255
.+2.096
 the+2.031
 a+1.925
Neg logit contributions
umm-6.051
lor-5.998
ocol-5.971
 former-5.955
arians-5.896
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token you
Token ID345
Feature activation+1.00e+00

Pos logit contributions
 and+3.252
,+3.147
.+3.013
 the+2.930
 a+2.863
Neg logit contributions
umm-5.173
lor-5.075
arians-5.061
ocol-5.038
 former-5.002
Token talking
Token ID3375
Feature activation+1.00e+00

Pos logit contributions
 and+3.769
,+3.621
.+3.469
 the+3.427
 a+3.317
Neg logit contributions
umm-4.672
lor-4.617
ocol-4.582
 former-4.572
arians-4.534
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+2.798
,+2.700
.+2.519
 the+2.499
 a+2.384
Neg logit contributions
umm-5.631
lor-5.565
 former-5.507
ocol-5.488
arians-5.468
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.745
,+3.653
.+3.526
 the+3.412
 a+3.331
Neg logit contributions
umm-4.714
ocol-4.595
 former-4.582
lor-4.571
arians-4.538
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.163
,+2.045
.+1.893
 the+1.892
 a+1.782
Neg logit contributions
umm-6.283
lor-6.172
ocol-6.126
 former-6.089
arians-6.067
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.157
,+3.027
.+2.874
 the+2.762
 a+2.693
Neg logit contributions
umm-5.294
ocol-5.226
lor-5.208
 former-5.169
arians-5.127
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.237
,+4.129
.+3.956
 the+3.874
 a+3.767
Neg logit contributions
umm-4.312
ocol-4.202
 former-4.137
lor-4.128
arians-4.097
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.767
,+3.652
.+3.507
 the+3.465
 a+3.360
Neg logit contributions
umm-4.683
lor-4.598
ocol-4.544
 former-4.491
arians-4.473
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.385
,+2.255
.+2.096
 the+2.031
 a+1.925
Neg logit contributions
umm-6.051
lor-5.998
ocol-5.971
 former-5.955
arians-5.896
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+2.798
,+2.700
.+2.519
 the+2.499
 a+2.384
Neg logit contributions
umm-5.631
lor-5.565
 former-5.507
ocol-5.488
arians-5.468
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.745
,+3.653
.+3.526
 the+3.412
 a+3.331
Neg logit contributions
umm-4.714
ocol-4.595
 former-4.582
lor-4.571
arians-4.538
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.163
,+2.045
.+1.893
 the+1.892
 a+1.782
Neg logit contributions
umm-6.283
lor-6.172
ocol-6.126
 former-6.089
arians-6.067
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.157
,+3.027
.+2.874
 the+2.762
 a+2.693
Neg logit contributions
umm-5.294
ocol-5.226
lor-5.208
 former-5.169
arians-5.127
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.237
,+4.129
.+3.956
 the+3.874
 a+3.767
Neg logit contributions
umm-4.312
ocol-4.202
 former-4.137
lor-4.128
arians-4.097
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.767
,+3.652
.+3.507
 the+3.465
 a+3.360
Neg logit contributions
umm-4.683
lor-4.598
ocol-4.544
 former-4.491
arians-4.473
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.385
,+2.255
.+2.096
 the+2.031
 a+1.925
Neg logit contributions
umm-6.051
lor-5.998
ocol-5.971
 former-5.955
arians-5.896
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+3.745
,+3.653
.+3.526
 the+3.412
 a+3.331
Neg logit contributions
umm-4.714
ocol-4.595
 former-4.582
lor-4.571
arians-4.538
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.163
,+2.045
.+1.893
 the+1.892
 a+1.782
Neg logit contributions
umm-6.283
lor-6.172
ocol-6.126
 former-6.089
arians-6.067
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.157
,+3.027
.+2.874
 the+2.762
 a+2.693
Neg logit contributions
umm-5.294
ocol-5.226
lor-5.208
 former-5.169
arians-5.127
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.237
,+4.129
.+3.956
 the+3.874
 a+3.767
Neg logit contributions
umm-4.312
ocol-4.202
 former-4.137
lor-4.128
arians-4.097
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.767
,+3.652
.+3.507
 the+3.465
 a+3.360
Neg logit contributions
umm-4.683
lor-4.598
ocol-4.544
 former-4.491
arians-4.473
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.385
,+2.255
.+2.096
 the+2.031
 a+1.925
Neg logit contributions
umm-6.051
lor-5.998
ocol-5.971
 former-5.955
arians-5.896
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.863
,+1.752
.+1.589
 the+1.515
 a+1.428
Neg logit contributions
umm-6.603
lor-6.542
ocol-6.493
 former-6.459
arians-6.451
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.385
,+2.255
.+2.096
 the+2.031
 a+1.925
Neg logit contributions
umm-6.051
lor-5.998
ocol-5.971
 former-5.955
arians-5.896
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.863
,+1.752
.+1.589
 the+1.515
 a+1.428
Neg logit contributions
umm-6.603
lor-6.542
ocol-6.493
 former-6.459
arians-6.451
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.406
,+2.288
.+2.117
 the+1.958
 a+1.899
Neg logit contributions
umm-6.076
ocol-6.020
lor-5.976
 former-5.965
arians-5.918
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.796
,+3.682
.+3.525
 the+3.457
 a+3.353
Neg logit contributions
umm-4.731
 former-4.621
lor-4.576
arians-4.555
ocol-4.543
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.863
,+1.752
.+1.589
 the+1.515
 a+1.428
Neg logit contributions
umm-6.603
lor-6.542
ocol-6.493
 former-6.459
arians-6.451
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.406
,+2.288
.+2.117
 the+1.958
 a+1.899
Neg logit contributions
umm-6.076
ocol-6.020
lor-5.976
 former-5.965
arians-5.918
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.863
,+1.752
.+1.589
 the+1.515
 a+1.428
Neg logit contributions
umm-6.603
lor-6.542
ocol-6.493
 former-6.459
arians-6.451
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.406
,+2.288
.+2.117
 the+1.958
 a+1.899
Neg logit contributions
umm-6.076
ocol-6.020
lor-5.976
 former-5.965
arians-5.918
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.796
,+3.682
.+3.525
 the+3.457
 a+3.353
Neg logit contributions
umm-4.731
 former-4.621
lor-4.576
arians-4.555
ocol-4.543
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.139
,+1.038
 the+0.892
.+0.868
 a+0.783
Neg logit contributions
umm-7.257
lor-7.218
 former-7.136
arians-7.077
ocol-7.042
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.412
,+1.293
.+1.098
 the+1.020
 a+0.941
Neg logit contributions
lor-7.041
umm-7.024
ocol-6.946
 former-6.898
arians-6.858
TokenTom
Token ID13787
Feature activation+1.00e+00

Pos logit contributions
 and+3.687
,+3.562
.+3.397
 the+3.319
 a+3.225
Neg logit contributions
umm-4.882
lor-4.844
ocol-4.760
arians-4.719
 former-4.711
Token went
Token ID1816
Feature activation+1.00e+00

Pos logit contributions
 and+2.963
,+2.892
.+2.778
 the+2.711
 a+2.614
Neg logit contributions
umm-5.348
lor-5.343
arians-5.275
 former-5.215
ocol-5.205
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.863
,+1.752
.+1.589
 the+1.515
 a+1.428
Neg logit contributions
umm-6.603
lor-6.542
ocol-6.493
 former-6.459
arians-6.451
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+2.406
,+2.288
.+2.117
 the+1.958
 a+1.899
Neg logit contributions
umm-6.076
ocol-6.020
lor-5.976
 former-5.965
arians-5.918
Token bed
Token ID3996
Feature activation+1.00e+00

Pos logit contributions
 and+3.796
,+3.682
.+3.525
 the+3.457
 a+3.353
Neg logit contributions
umm-4.731
 former-4.621
lor-4.576
arians-4.555
ocol-4.543
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.139
,+1.038
 the+0.892
.+0.868
 a+0.783
Neg logit contributions
umm-7.257
lor-7.218
 former-7.136
arians-7.077
ocol-7.042
Token hid
Token ID24519
Feature activation+1.00e+00

Pos logit contributions
 and+3.590
,+3.484
.+3.332
 the+3.198
 a+3.108
Neg logit contributions
umm-4.811
lor-4.796
 former-4.755
ocol-4.728
arians-4.696
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.767
,+3.652
.+3.507
 the+3.465
 a+3.360
Neg logit contributions
umm-4.683
lor-4.598
ocol-4.544
 former-4.491
arians-4.473
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.385
,+2.255
.+2.096
 the+2.031
 a+1.925
Neg logit contributions
umm-6.051
lor-5.998
ocol-5.971
 former-5.955
arians-5.896
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.090
,+2.970
.+2.784
 the+2.704
 a+2.605
Neg logit contributions
umm-5.474
lor-5.356
ocol-5.303
 former-5.286
arians-5.252
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.237
,+4.129
.+3.956
 the+3.874
 a+3.767
Neg logit contributions
umm-4.312
ocol-4.202
 former-4.137
lor-4.128
arians-4.097
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.767
,+3.652
.+3.507
 the+3.465
 a+3.360
Neg logit contributions
umm-4.683
lor-4.598
ocol-4.544
 former-4.491
arians-4.473
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.385
,+2.255
.+2.096
 the+2.031
 a+1.925
Neg logit contributions
umm-6.051
lor-5.998
ocol-5.971
 former-5.955
arians-5.896
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.530
,+0.416
.+0.244
 the+0.242
 a+0.178
Neg logit contributions
umm-7.916
ocol-7.783
lor-7.716
arians-7.661
osity-7.659
Token?"
Token ID1701
Feature activation+1.00e+00

Pos logit contributions
 and+2.163
,+2.045
.+1.893
 the+1.892
 a+1.782
Neg logit contributions
umm-6.283
lor-6.172
ocol-6.126
 former-6.089
arians-6.067
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.157
,+3.027
.+2.874
 the+2.762
 a+2.693
Neg logit contributions
umm-5.294
ocol-5.226
lor-5.208
 former-5.169
arians-5.127
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.237
,+4.129
.+3.956
 the+3.874
 a+3.767
Neg logit contributions
umm-4.312
ocol-4.202
 former-4.137
lor-4.128
arians-4.097
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.767
,+3.652
.+3.507
 the+3.465
 a+3.360
Neg logit contributions
umm-4.683
lor-4.598
ocol-4.544
 former-4.491
arians-4.473
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.385
,+2.255
.+2.096
 the+2.031
 a+1.925
Neg logit contributions
umm-6.051
lor-5.998
ocol-5.971
 former-5.955
arians-5.896
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+3.157
,+3.027
.+2.874
 the+2.762
 a+2.693
Neg logit contributions
umm-5.294
ocol-5.226
lor-5.208
 former-5.169
arians-5.127
Token table
Token ID3084
Feature activation+1.00e+00

Pos logit contributions
 and+4.237
,+4.129
.+3.956
 the+3.874
 a+3.767
Neg logit contributions
umm-4.312
ocol-4.202
 former-4.137
lor-4.128
arians-4.097
Token did
Token ID750
Feature activation+1.00e+00

Pos logit contributions
 and+3.767
,+3.652
.+3.507
 the+3.465
 a+3.360
Neg logit contributions
umm-4.683
lor-4.598
ocol-4.544
 former-4.491
arians-4.473
Token not
Token ID407
Feature activation+1.00e+00

Pos logit contributions
 and+2.385
,+2.255
.+2.096
 the+2.031
 a+1.925
Neg logit contributions
umm-6.051
lor-5.998
ocol-5.971
 former-5.955
arians-5.896
Token say
Token ID910
Feature activation+1.00e+00

Pos logit contributions
 and+3.928
,+3.808
.+3.643
 the+3.586
 a+3.479
Neg logit contributions
umm-4.599
ocol-4.497
lor-4.441
 former-4.426
arians-4.395
Token anything
Token ID1997
Feature activation+1.00e+00

Pos logit contributions
 and+4.183
,+4.003
.+3.863
 the+3.778
 a+3.681
Neg logit contributions
umm-4.334
lor-4.286
ocol-4.250
osity-4.151
 former-4.151
Token either
Token ID2035
Feature activation+1.00e+00

Pos logit contributions
 and+1.879
,+1.754
.+1.579
 the+1.551
 a+1.473
Neg logit contributions
umm-6.560
lor-6.471
ocol-6.446
 former-6.397
arians-6.397
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.913
,+0.778
.+0.608
 the+0.606
 a+0.525
Neg logit contributions
umm-7.529
lor-7.435
ocol-7.407
osity-7.325
arians-7.323
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+2.972
,+2.861
.+2.681
 the+2.591
 a+2.495
Neg logit contributions
umm-5.549
lor-5.485
ocol-5.433
 former-5.405
arians-5.364
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.005
,+2.934
.+2.806
 the+2.736
 a+2.639
Neg logit contributions
umm-5.344
lor-5.304
arians-5.239
 former-5.221
ocol-5.202
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.713
,+3.613
.+3.442
 the+3.330
 a+3.196
Neg logit contributions
umm-4.723
ocol-4.659
 former-4.649
lor-4.636
arians-4.572
Token they
Token ID484
Feature activation+1.00e+00

Pos logit contributions
 and+4.355
,+4.204
.+4.077
 the+3.920
 a+3.825
Neg logit contributions
umm-4.170
ocol-4.059
lor-4.054
 former-4.015
arians-4.004
Token all
Token ID477
Feature activation+1.00e+00

Pos logit contributions
 and+3.359
,+3.246
.+3.110
 the+3.007
 a+2.928
Neg logit contributions
umm-5.081
lor-5.012
ocol-4.974
 former-4.951
arians-4.946
Token had
Token ID550
Feature activation+1.00e+00

Pos logit contributions
 and+3.902
,+3.773
.+3.617
 the+3.502
 a+3.459
Neg logit contributions
umm-4.624
ocol-4.508
lor-4.488
 former-4.466
osity-4.446
Token great
Token ID1049
Feature activation+1.00e+00

Pos logit contributions
 and+2.940
,+2.825
.+2.662
 the+2.536
 a+2.420
Neg logit contributions
umm-5.542
lor-5.471
 former-5.443
ocol-5.419
arians-5.358
Token fun
Token ID1257
Feature activation+1.00e+00

Pos logit contributions
 and+4.383
,+4.264
.+4.109
 the+4.029
 a+3.916
Neg logit contributions
umm-4.146
ocol-4.067
lor-4.016
 former-3.984
arians-3.935
Token z
Token ID1976
Feature activation+1.00e+00

Pos logit contributions
 and+3.105
,+2.999
.+2.825
 the+2.809
 a+2.721
Neg logit contributions
umm-5.357
ocol-5.270
lor-5.217
arians-5.186
 former-5.184
Tokenooming
Token ID30602
Feature activation+1.00e+00

Pos logit contributions
 and+7.030
,+6.887
.+6.745
 the+6.688
 a+6.589
Neg logit contributions
umm-1.581
ocol-1.546
lor-1.534
arians-1.452
 former-1.380
Token around
Token ID1088
Feature activation+1.00e+00

Pos logit contributions
 and+3.366
,+3.272
.+3.097
 the+3.034
 a+2.972
Neg logit contributions
umm-5.087
lor-4.943
ocol-4.911
arians-4.908
 former-4.905
Token together
Token ID1978
Feature activation+1.00e+00

Pos logit contributions
 and+1.725
,+1.614
.+1.447
 the+1.345
 a+1.314
Neg logit contributions
umm-6.685
ocol-6.605
arians-6.563
lor-6.549
 former-6.508
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.879
,+0.754
.+0.574
 the+0.565
 a+0.509
Neg logit contributions
umm-7.549
lor-7.406
ocol-7.388
arians-7.382
 former-7.362
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+4.273
,+4.143
.+4.004
 the+3.900
 a+3.818
Neg logit contributions
umm-4.269
lor-4.200
arians-4.134
ocol-4.120
 former-4.094
Token loud
Token ID7812
Feature activation+1.00e+00

Pos logit contributions
 and+2.444
,+2.307
.+2.152
 the+2.089
 a+1.992
Neg logit contributions
umm-6.032
ocol-5.940
lor-5.860
 former-5.849
osity-5.836
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+1.085
,+0.973
.+0.798
 the+0.781
 a+0.685
Neg logit contributions
umm-7.302
ocol-7.232
lor-7.208
arians-7.119
 former-7.098
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.279
,+2.142
.+1.989
 the+1.906
 a+1.794
Neg logit contributions
umm-6.189
lor-6.102
ocol-6.036
 former-6.020
osity-6.020
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.417
,+1.286
.+1.096
 the+1.029
 a+0.947
Neg logit contributions
lor-6.971
umm-6.956
ocol-6.919
 former-6.851
arians-6.833
Token"
Token ID1
Feature activation+1.00e+00

Pos logit contributions
 and+3.371
,+3.223
.+3.073
 the+2.999
 a+2.903
Neg logit contributions
umm-5.120
lor-5.107
ocol-5.071
arians-5.005
 former-4.975
TokenO
Token ID46
Feature activation+1.00e+00

Pos logit contributions
 and+5.284
,+5.189
.+5.012
 the+4.904
 a+4.810
Neg logit contributions
umm-3.359
lor-3.242
ocol-3.222
 former-3.164
arians-3.155
Tokenw
Token ID86
Feature activation+1.00e+00

Pos logit contributions
 and+6.495
,+6.328
.+6.175
 the+6.173
 a+6.084
Neg logit contributions
umm-2.138
lor-2.052
ocol-1.998
arians-1.922
 former-1.895
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+1.417
,+1.186
.+1.113
 the+1.082
 a+1.004
Neg logit contributions
umm-7.030
lor-6.979
ocol-6.978
arians-6.869
osity-6.849
Token ow
Token ID12334
Feature activation+1.00e+00

Pos logit contributions
 and+5.269
,+5.120
.+4.990
 the+4.854
 a+4.776
Neg logit contributions
umm-3.287
ocol-3.208
lor-3.174
arians-3.064
osity-3.052
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+0.471
,+0.241
 the+0.140
.+0.139
 a+0.078
Neg logit contributions
umm-8.052
ocol-7.960
lor-7.872
arians-7.825
osity-7.804
Token ow
Token ID12334
Feature activation+1.00e+00

Pos logit contributions
 and+5.277
,+5.139
.+5.007
 the+4.862
 a+4.783
Neg logit contributions
umm-3.265
ocol-3.187
lor-3.131
arians-3.022
osity-3.020
Token can
Token ID460
Feature activation+1.00e+00

Pos logit contributions
 and+3.682
,+3.558
.+3.413
 the+3.380
 a+3.279
Neg logit contributions
umm-4.726
ocol-4.631
 former-4.626
arians-4.606
lor-4.595
Token be
Token ID307
Feature activation+1.00e+00

Pos logit contributions
 and+3.205
,+3.066
 the+2.925
.+2.909
 a+2.831
Neg logit contributions
umm-5.249
lor-5.134
 former-5.128
arians-5.115
ocol-5.094
Token!"
Token ID2474
Feature activation+1.00e+00

Pos logit contributions
 and+2.874
,+2.725
.+2.582
 the+2.521
 a+2.423
Neg logit contributions
umm-5.533
 former-5.464
lor-5.452
ocol-5.449
arians-5.421
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.588
,+2.481
.+2.320
 the+2.220
 a+2.178
Neg logit contributions
umm-5.845
lor-5.771
ocol-5.726
arians-5.689
 former-5.683
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.072
,+1.985
.+1.782
 the+1.694
 a+1.626
Neg logit contributions
lor-6.430
umm-6.373
ocol-6.308
 former-6.261
arians-6.238
TokenH
Token ID39
Feature activation+1.00e+00

Pos logit contributions
 and+4.945
,+4.862
.+4.684
 the+4.589
 a+4.492
Neg logit contributions
umm-3.617
lor-3.610
 former-3.497
arians-3.495
ocol-3.492
Tokenilda
Token ID27281
Feature activation+1.00e+00

Pos logit contributions
 and+6.981
,+6.825
.+6.676
 the+6.657
 a+6.578
Neg logit contributions
umm-1.640
lor-1.577
ocol-1.576
 former-1.491
arians-1.489
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.596
,+3.506
.+3.390
 the+3.327
 a+3.256
Neg logit contributions
umm-4.768
lor-4.740
arians-4.686
 former-4.657
ocol-4.608
Token scared
Token ID12008
Feature activation+1.00e+00

Pos logit contributions
 and+3.602
,+3.488
.+3.347
 the+3.204
 a+3.083
Neg logit contributions
umm-4.857
 former-4.774
lor-4.768
ocol-4.762
arians-4.695
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.085
,+0.953
 the+0.804
.+0.799
 a+0.720
Neg logit contributions
umm-7.395
ocol-7.241
lor-7.188
osity-7.173
 former-7.158
Token tried
Token ID3088
Feature activation+1.00e+00

Pos logit contributions
 and+3.670
,+3.535
.+3.398
 the+3.262
 a+3.205
Neg logit contributions
umm-4.810
ocol-4.713
 former-4.700
lor-4.690
arians-4.658
Token away
Token ID1497
Feature activation+1.00e+00

Pos logit contributions
 and+3.705
,+3.572
.+3.401
 the+3.289
 a+3.195
Neg logit contributions
umm-4.801
ocol-4.745
lor-4.671
 former-4.662
ubb-4.636
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+1.267
,+1.150
 the+0.949
.+0.948
 a+0.851
Neg logit contributions
umm-7.143
 former-7.029
lor-7.002
ocol-7.002
osity-6.958
Token
Token ID220
Feature activation+1.00e+00

Pos logit contributions
 and+3.237
,+3.132
.+2.961
 the+2.844
 a+2.771
Neg logit contributions
umm-5.349
lor-5.252
 former-5.203
arians-5.143
ocol-5.139
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.445
,+1.337
.+1.124
 the+1.082
 a+0.990
Neg logit contributions
lor-7.010
umm-6.983
ocol-6.880
arians-6.847
 former-6.841
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.740
,+1.642
.+1.462
 the+1.366
 a+1.283
Neg logit contributions
lor-6.783
umm-6.713
 former-6.614
arians-6.606
ocol-6.600
TokenFe
Token ID14304
Feature activation+1.00e+00

Pos logit contributions
 and+4.877
,+4.795
.+4.621
 the+4.507
 a+4.402
Neg logit contributions
lor-3.718
umm-3.696
 former-3.614
arians-3.579
ocol-3.577
Tokeneling
Token ID10809
Feature activation+1.00e+00

Pos logit contributions
 and+7.464
,+7.315
.+7.183
 the+7.063
 a+7.024
Neg logit contributions
umm-1.114
ocol-1.090
lor-1.057
arians-0.988
 former-0.963
Token frustrated
Token ID14718
Feature activation+1.00e+00

Pos logit contributions
 and+3.690
,+3.551
.+3.416
 the+3.277
 a+3.180
Neg logit contributions
umm-4.777
ocol-4.702
lor-4.676
 former-4.636
ubb-4.623
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+0.671
,+0.524
.+0.425
 the+0.372
 a+0.310
Neg logit contributions
umm-7.769
ocol-7.634
lor-7.597
arians-7.596
 former-7.537
Token angry
Token ID7954
Feature activation+1.00e+00

Pos logit contributions
 and+3.980
,+3.838
.+3.700
 the+3.559
 a+3.506
Neg logit contributions
umm-4.552
 former-4.443
ocol-4.425
lor-4.419
osity-4.368
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+1.017
,+0.850
.+0.741
 the+0.680
 a+0.598
Neg logit contributions
umm-7.439
lor-7.320
ocol-7.298
arians-7.265
osity-7.235
Token scissors
Token ID41786
Feature activation+1.00e+00

Pos logit contributions
 and+4.401
,+4.305
.+4.115
 the+4.055
 a+3.941
Neg logit contributions
umm-4.187
ocol-4.015
lor-4.002
 former-3.983
arians-3.958
Token sn
Token ID3013
Feature activation+1.00e+00

Pos logit contributions
 and+2.731
,+2.644
.+2.480
 the+2.452
 a+2.378
Neg logit contributions
umm-5.624
lor-5.555
ocol-5.539
arians-5.485
osity-5.471
Tokenip
Token ID541
Feature activation+1.00e+00

Pos logit contributions
 and+6.707
,+6.595
 the+6.425
.+6.422
 a+6.328
Neg logit contributions
umm-1.991
lor-1.934
ocol-1.909
 former-1.810
arians-1.794
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+3.836
,+3.753
.+3.572
 the+3.562
 a+3.486
Neg logit contributions
umm-4.615
lor-4.555
arians-4.478
ocol-4.431
vation-4.408
Token sn
Token ID3013
Feature activation+1.00e+00

Pos logit contributions
 and+3.768
,+3.649
.+3.504
 the+3.344
 a+3.294
Neg logit contributions
umm-4.710
ocol-4.683
lor-4.579
arians-4.558
 former-4.524
Tokenip
Token ID541
Feature activation+1.00e+00

Pos logit contributions
 and+6.706
,+6.570
.+6.387
 the+6.386
 a+6.286
Neg logit contributions
umm-1.996
lor-1.945
ocol-1.926
 former-1.832
arians-1.809
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+4.725
,+4.630
 the+4.434
.+4.431
 a+4.351
Neg logit contributions
umm-3.742
lor-3.698
arians-3.616
ocol-3.583
vation-3.535
Token sn
Token ID3013
Feature activation+1.00e+00

Pos logit contributions
 and+4.135
,+4.007
.+3.856
 the+3.710
 a+3.659
Neg logit contributions
umm-4.369
ocol-4.334
lor-4.219
arians-4.210
 former-4.189
Tokenip
Token ID541
Feature activation+1.00e+00

Pos logit contributions
 and+6.685
,+6.543
 the+6.360
.+6.353
 a+6.259
Neg logit contributions
umm-2.013
lor-1.947
ocol-1.938
 former-1.847
arians-1.809
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+5.273
,+5.171
 the+4.975
.+4.967
 a+4.891
Neg logit contributions
umm-3.211
lor-3.170
arians-3.083
ocol-3.058
vation-3.000
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+2.989
,+2.889
.+2.731
 the+2.627
 a+2.548
Neg logit contributions
umm-5.571
ocol-5.405
lor-5.401
 former-5.385
arians-5.342
Token ride
Token ID6594
Feature activation+1.00e+00

Pos logit contributions
 and+4.097
,+3.943
.+3.801
 the+3.732
 a+3.634
Neg logit contributions
umm-4.435
lor-4.339
ocol-4.312
 former-4.302
arians-4.221
Token in
Token ID287
Feature activation+1.00e+00

Pos logit contributions
 and+1.169
,+1.056
 the+0.857
.+0.853
 a+0.769
Neg logit contributions
umm-7.278
lor-7.165
ocol-7.095
 former-7.095
arians-7.095
Token his
Token ID465
Feature activation+1.00e+00

Pos logit contributions
 and+1.253
,+1.142
.+0.941
 the+0.800
 a+0.728
Neg logit contributions
umm-7.250
ocol-7.180
lor-7.148
 former-7.132
arians-7.040
Token raft
Token ID31812
Feature activation+1.00e+00

Pos logit contributions
 and+4.713
,+4.569
.+4.429
 the+4.341
 a+4.261
Neg logit contributions
umm-3.869
 former-3.731
lor-3.726
ocol-3.709
arians-3.621
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+1.944
,+1.837
 the+1.680
.+1.609
 a+1.595
Neg logit contributions
umm-6.523
lor-6.433
 former-6.350
arians-6.324
rol-6.272
Token Exc
Token ID25268
Feature activation+1.00e+00

Pos logit contributions
 and+2.602
,+2.497
.+2.333
 the+2.240
 a+2.167
Neg logit contributions
umm-5.963
lor-5.825
 former-5.777
ocol-5.745
osity-5.714
Tokenited
Token ID863
Feature activation+1.00e+00

Pos logit contributions
 and+7.779
,+7.659
.+7.507
 the+7.422
 a+7.337
Neg logit contributions
umm-0.816
ocol-0.699
lor-0.698
 former-0.633
arians-0.611
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+4.509
,+4.373
.+4.255
 the+4.190
 a+4.143
Neg logit contributions
umm-3.993
ocol-3.838
lor-3.825
osity-3.739
arians-3.732
Token Lily
Token ID20037
Feature activation+1.00e+00

Pos logit contributions
 and+2.092
,+1.963
.+1.824
 the+1.680
 a+1.625
Neg logit contributions
umm-6.404
ocol-6.275
lor-6.257
 former-6.215
arians-6.210
Token jumped
Token ID11687
Feature activation+1.00e+00

Pos logit contributions
 and+3.119
,+3.044
.+2.942
 the+2.848
 a+2.774
Neg logit contributions
umm-5.241
lor-5.225
arians-5.154
 former-5.096
osity-5.083
Token in
Token ID287
Feature activation+1.00e+00

Pos logit contributions
 and+2.529
,+2.437
.+2.287
 the+2.202
 a+2.117
Neg logit contributions
umm-5.939
lor-5.807
ocol-5.798
 former-5.728
rol-5.700
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+3.836
,+3.753
.+3.572
 the+3.562
 a+3.486
Neg logit contributions
umm-4.615
lor-4.555
arians-4.478
ocol-4.431
vation-4.408
Token sn
Token ID3013
Feature activation+1.00e+00

Pos logit contributions
 and+3.768
,+3.649
.+3.504
 the+3.344
 a+3.294
Neg logit contributions
umm-4.710
ocol-4.683
lor-4.579
arians-4.558
 former-4.524
Tokenip
Token ID541
Feature activation+1.00e+00

Pos logit contributions
 and+6.706
,+6.570
.+6.387
 the+6.386
 a+6.286
Neg logit contributions
umm-1.996
lor-1.945
ocol-1.926
 former-1.832
arians-1.809
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+4.725
,+4.630
 the+4.434
.+4.431
 a+4.351
Neg logit contributions
umm-3.742
lor-3.698
arians-3.616
ocol-3.583
vation-3.535
Token sn
Token ID3013
Feature activation+1.00e+00

Pos logit contributions
 and+4.135
,+4.007
.+3.856
 the+3.710
 a+3.659
Neg logit contributions
umm-4.369
ocol-4.334
lor-4.219
arians-4.210
 former-4.189
Tokenip
Token ID541
Feature activation+1.00e+00

Pos logit contributions
 and+6.685
,+6.543
 the+6.360
.+6.353
 a+6.259
Neg logit contributions
umm-2.013
lor-1.947
ocol-1.938
 former-1.847
arians-1.809
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+5.273
,+5.171
 the+4.975
.+4.967
 a+4.891
Neg logit contributions
umm-3.211
lor-3.170
arians-3.083
ocol-3.058
vation-3.000
Token The
Token ID383
Feature activation+1.00e+00

Pos logit contributions
 and+2.989
,+2.889
.+2.731
 the+2.627
 a+2.548
Neg logit contributions
umm-5.571
ocol-5.405
lor-5.401
 former-5.385
arians-5.342
Token teacher
Token ID4701
Feature activation+1.00e+00

Pos logit contributions
 and+4.594
,+4.489
.+4.319
 the+4.244
 a+4.128
Neg logit contributions
umm-3.999
ocol-3.853
lor-3.825
 former-3.797
arians-3.792
Token followed
Token ID3940
Feature activation+1.00e+00

Pos logit contributions
 and+3.558
,+3.466
.+3.350
 the+3.260
 a+3.178
Neg logit contributions
umm-4.842
lor-4.771
ocol-4.747
arians-4.737
 former-4.663
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.980
,+1.870
.+1.707
 the+1.575
 a+1.536
Neg logit contributions
umm-6.443
ocol-6.398
lor-6.320
arians-6.310
 former-6.270
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.420
,+0.289
 the+0.073
.+0.054
 a-0.015
Neg logit contributions
umm-8.009
ocol-7.882
lor-7.876
 former-7.791
arians-7.786
Token He
Token ID679
Feature activation+1.00e+00

Pos logit contributions
 and+3.109
,+3.007
.+2.844
 the+2.744
 a+2.658
Neg logit contributions
umm-5.444
ocol-5.313
lor-5.309
 former-5.253
arians-5.199
Token tried
Token ID3088
Feature activation+1.00e+00

Pos logit contributions
 and+3.293
,+3.242
.+3.109
 the+2.970
 a+2.881
Neg logit contributions
umm-5.108
lor-5.053
ocol-5.024
arians-5.003
 former-4.985
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+1.043
,+0.949
.+0.817
 the+0.684
 a+0.606
Neg logit contributions
umm-7.313
lor-7.252
 former-7.222
ocol-7.204
osity-7.192
Token un
Token ID555
Feature activation+1.00e+00

Pos logit contributions
 and+3.908
,+3.822
.+3.656
 the+3.508
 a+3.430
Neg logit contributions
umm-4.585
ocol-4.492
 former-4.468
arians-4.439
lor-4.434
Tokenplug
Token ID16875
Feature activation+1.00e+00

Pos logit contributions
 and+6.289
,+6.167
.+6.029
 the+5.935
 a+5.850
Neg logit contributions
umm-2.309
lor-2.214
ocol-2.175
arians-2.152
 former-2.117
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+5.031
,+4.909
.+4.767
 the+4.653
 a+4.625
Neg logit contributions
umm-3.404
ocol-3.383
lor-3.361
arians-3.325
vation-3.211
Token mixer
Token ID33938
Feature activation+1.00e+00

Pos logit contributions
 and+5.458
,+5.353
.+5.162
 the+5.093
 a+5.016
Neg logit contributions
umm-3.173
ocol-3.005
lor-2.966
arians-2.911
 former-2.878
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
 and+1.297
,+1.158
.+1.029
 the+0.997
 a+0.930
Neg logit contributions
umm-7.163
lor-7.025
arians-6.962
ocol-6.919
 former-6.915
Token but
Token ID475
Feature activation+1.00e+00

Pos logit contributions
 and+2.240
,+2.176
.+2.040
 the+1.892
 a+1.835
Neg logit contributions
umm-6.140
ocol-6.094
lor-6.023
arians-6.017
 former-6.009
Token he
Token ID339
Feature activation+1.00e+00

Pos logit contributions
 and+1.857
,+1.711
.+1.600
 the+1.408
 a+1.361
Neg logit contributions
umm-6.672
ocol-6.581
lor-6.560
arians-6.488
 former-6.481
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.303
,+1.170
.+1.003
 the+0.926
 a+0.882
Neg logit contributions
umm-7.109
ocol-7.019
lor-6.980
 former-6.947
arians-6.884
Token had
Token ID550
Feature activation+1.00e+00

Pos logit contributions
 and+3.767
,+3.640
.+3.498
 the+3.349
 a+3.266
Neg logit contributions
umm-4.721
lor-4.640
ocol-4.611
 former-4.602
arians-4.560
Token fun
Token ID1257
Feature activation+1.00e+00

Pos logit contributions
 and+3.431
,+3.309
.+3.134
 the+2.995
 a+2.866
Neg logit contributions
umm-5.090
lor-5.032
 former-5.021
ocol-4.988
arians-4.892
Token pinch
Token ID25394
Feature activation+1.00e+00

Pos logit contributions
 and+3.070
,+2.941
.+2.758
 the+2.741
 a+2.645
Neg logit contributions
umm-5.436
ocol-5.331
lor-5.318
 former-5.305
arians-5.214
Token lot
Token ID1256
Feature activation+1.00e+00

Pos logit contributions
 and+2.449
,+2.327
.+2.141
 the+2.050
 a+1.977
Neg logit contributions
umm-6.041
ocol-5.939
lor-5.882
 former-5.827
arians-5.808
Tokenion
Token ID295
Feature activation+1.00e+00

Pos logit contributions
 and+6.764
,+6.660
.+6.468
 the+6.409
 a+6.312
Neg logit contributions
umm-1.721
ocol-1.703
lor-1.630
osity-1.541
ubb-1.524
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+2.122
,+2.001
.+1.809
 the+1.808
 a+1.733
Neg logit contributions
umm-6.362
lor-6.231
arians-6.197
ocol-6.161
osity-6.124
Token It
Token ID632
Feature activation+1.00e+00

Pos logit contributions
 and+3.392
,+3.280
.+3.126
 the+3.027
 a+2.929
Neg logit contributions
umm-5.172
lor-5.066
 former-5.033
ocol-5.002
arians-4.959
Token felt
Token ID2936
Feature activation+1.00e+00

Pos logit contributions
 and+3.148
,+3.057
.+2.902
 the+2.797
 a+2.704
Neg logit contributions
umm-5.384
lor-5.279
 former-5.268
arians-5.198
ocol-5.197
Token slim
Token ID18862
Feature activation+1.00e+00

Pos logit contributions
 and+3.213
,+3.076
.+2.938
 the+2.801
 a+2.698
Neg logit contributions
umm-5.317
ocol-5.229
 former-5.186
lor-5.146
ubb-5.108
Tokeny
Token ID88
Feature activation+1.00e+00

Pos logit contributions
 and+4.204
,+4.150
.+4.012
 the+3.973
 a+3.834
Neg logit contributions
umm-4.289
ocol-4.091
lor-4.050
rol-3.995
osity-3.994
Token were
Token ID547
Feature activation+1.00e+00

Pos logit contributions
 and+3.359
,+3.250
.+3.101
 the+2.990
 a+2.909
Neg logit contributions
umm-5.099
lor-5.019
 former-5.002
arians-4.967
ocol-4.964
Token very
Token ID845
Feature activation+1.00e+00

Pos logit contributions
 and+2.628
,+2.506
.+2.361
 the+2.203
 a+2.116
Neg logit contributions
umm-5.856
lor-5.757
 former-5.737
ocol-5.725
arians-5.692
Token happy
Token ID3772
Feature activation+1.00e+00

Pos logit contributions
 and+2.689
,+2.545
.+2.401
 the+2.307
 a+2.218
Neg logit contributions
umm-5.865
ocol-5.751
 former-5.734
lor-5.716
osity-5.670
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.584
,+0.488
 the+0.299
.+0.297
 a+0.206
Neg logit contributions
umm-7.880
lor-7.712
 former-7.695
ocol-7.679
arians-7.671
Token<|endoftext|>
Token ID50256
Feature activation+1.00e+00

Pos logit contributions
 and+2.480
,+2.363
.+2.189
 the+2.114
 a+2.015
Neg logit contributions
umm-6.104
lor-5.970
 former-5.941
ocol-5.935
arians-5.887
TokenE
Token ID36
Feature activation+1.00e+00

Pos logit contributions
 and+3.710
,+3.600
.+3.395
 the+3.349
 a+3.256
Neg logit contributions
umm-4.755
lor-4.729
ocol-4.627
arians-4.535
 former-4.507
Tokenlla
Token ID8466
Feature activation+1.00e+00

Pos logit contributions
 and+6.468
,+6.335
.+6.171
 the+6.102
 a+6.027
Neg logit contributions
umm-2.089
lor-2.029
ocol-1.974
arians-1.928
 former-1.898
Token loved
Token ID6151
Feature activation+1.00e+00

Pos logit contributions
 and+2.239
,+2.144
.+2.014
 the+1.968
 a+1.889
Neg logit contributions
umm-6.096
lor-6.030
ocol-5.973
arians-5.952
 former-5.929
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+3.205
,+3.102
.+2.937
 the+2.793
 a+2.729
Neg logit contributions
umm-5.330
ocol-5.222
lor-5.168
 former-5.135
arians-5.107
Token visit
Token ID3187
Feature activation+1.00e+00

Pos logit contributions
 and+3.510
,+3.371
.+3.229
 the+3.108
 a+3.024
Neg logit contributions
umm-4.953
ocol-4.903
lor-4.839
 former-4.826
osity-4.778
Token her
Token ID607
Feature activation+1.00e+00

Pos logit contributions
 and+2.004
,+1.870
.+1.728
 the+1.574
 a+1.520
Neg logit contributions
umm-6.459
ocol-6.407
 former-6.342
lor-6.321
arians-6.310
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+1.082
,+0.946
.+0.774
 the+0.771
 a+0.695
Neg logit contributions
umm-7.332
lor-7.241
 former-7.233
arians-7.149
ocol-7.114
Token She
Token ID1375
Feature activation+1.00e+00

Pos logit contributions
 and+3.780
,+3.696
.+3.538
 the+3.446
 a+3.339
Neg logit contributions
umm-4.852
lor-4.685
 former-4.666
osity-4.660
ocol-4.568
Token saw
Token ID2497
Feature activation+1.00e+00

Pos logit contributions
 and+3.347
,+3.264
.+3.163
 the+3.016
 a+2.951
Neg logit contributions
lor-5.045
umm-5.039
 former-5.006
arians-4.918
ocol-4.892
Token another
Token ID1194
Feature activation+1.00e+00

Pos logit contributions
 and+2.111
,+2.007
.+1.886
 the+1.662
 a+1.555
Neg logit contributions
umm-6.381
lor-6.299
 former-6.252
ocol-6.250
osity-6.168
Token l
Token ID300
Feature activation+1.00e+00

Pos logit contributions
 and+3.566
,+3.458
.+3.319
 the+3.284
 a+3.162
Neg logit contributions
umm-4.962
lor-4.788
 former-4.772
ocol-4.770
arians-4.692
Tokenoll
Token ID692
Feature activation+1.00e+00

Pos logit contributions
 and+7.035
,+6.913
.+6.754
 the+6.722
 a+6.583
Neg logit contributions
umm-1.617
lor-1.580
ocol-1.538
ubb-1.379
 former-1.374
Tokenipop
Token ID42800
Feature activation+1.00e+00

Pos logit contributions
 and+7.076
,+6.912
.+6.803
 the+6.753
 a+6.672
Neg logit contributions
umm-1.466
ocol-1.430
lor-1.407
arians-1.276
ubb-1.235
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+2.079
,+1.975
.+1.831
 the+1.814
 a+1.711
Neg logit contributions
umm-6.353
lor-6.254
arians-6.161
ocol-6.151
 former-6.129
Token it
Token ID340
Feature activation+1.00e+00

Pos logit contributions
 and+3.189
,+3.089
.+2.953
 the+2.788
 a+2.693
Neg logit contributions
umm-5.253
lor-5.171
 former-5.165
ocol-5.118
arians-5.097
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.907
,+2.798
.+2.643
 the+2.553
 a+2.466
Neg logit contributions
umm-5.605
lor-5.494
 former-5.435
ocol-5.397
arians-5.394
Token bigger
Token ID5749
Feature activation+1.00e+00

Pos logit contributions
 and+3.067
,+2.949
.+2.789
 the+2.677
 a+2.557
Neg logit contributions
umm-5.441
ocol-5.332
 former-5.323
lor-5.297
rol-5.196
Token fairy
Token ID25607
Feature activation+1.00e+00

Pos logit contributions
 and+4.038
,+3.901
.+3.777
 the+3.683
 a+3.579
Neg logit contributions
umm-4.538
lor-4.394
 former-4.387
ocol-4.352
arians-4.305
Token who
Token ID508
Feature activation+1.00e+00

Pos logit contributions
 and+4.375
,+4.251
.+4.093
 the+4.070
 a+3.978
Neg logit contributions
umm-4.087
lor-4.066
arians-3.953
 former-3.941
ocol-3.937
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.915
,+3.809
.+3.658
 the+3.527
 a+3.462
Neg logit contributions
umm-4.567
lor-4.515
ocol-4.474
 former-4.430
arians-4.419
Token making
Token ID1642
Feature activation+1.00e+00

Pos logit contributions
 and+3.985
,+3.865
.+3.697
 the+3.583
 a+3.477
Neg logit contributions
umm-4.464
lor-4.443
ocol-4.403
 former-4.394
arians-4.333
Token l
Token ID300
Feature activation+1.00e+00

Pos logit contributions
 and+3.155
,+3.033
.+2.848
 the+2.729
 a+2.627
Neg logit contributions
umm-5.379
ocol-5.276
lor-5.261
 former-5.245
arians-5.140
Tokenoll
Token ID692
Feature activation+1.00e+00

Pos logit contributions
 and+7.236
,+7.115
.+6.923
 the+6.911
 a+6.759
Neg logit contributions
umm-1.491
lor-1.422
ocol-1.417
ubb-1.279
 former-1.278
Tokenip
Token ID541
Feature activation+1.00e+00

Pos logit contributions
 and+6.375
,+6.210
.+6.066
 the+6.014
 a+5.930
Neg logit contributions
umm-2.201
ocol-2.163
lor-2.137
arians-2.027
ubb-1.991
Tokenops
Token ID2840
Feature activation+1.00e+00

Pos logit contributions
 and+6.658
,+6.531
.+6.358
 the+6.330
 a+6.240
Neg logit contributions
umm-1.915
lor-1.848
ocol-1.764
arians-1.756
 former-1.717
Token for
Token ID329
Feature activation+1.00e+00

Pos logit contributions
 and+2.918
,+2.811
.+2.629
 the+2.627
 a+2.534
Neg logit contributions
umm-5.561
lor-5.449
ocol-5.408
 former-5.371
arians-5.330
Token all
Token ID477
Feature activation+1.00e+00

Pos logit contributions
 and+3.107
,+2.968
.+2.790
 the+2.636
 a+2.553
Neg logit contributions
ocol-5.422
umm-5.418
lor-5.343
 former-5.341
arians-5.304
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+3.088
,+2.972
.+2.790
 the+2.684
 a+2.645
Neg logit contributions
umm-5.420
ocol-5.357
lor-5.314
 former-5.273
osity-5.244
Token can
Token ID460
Feature activation+1.00e+00

Pos logit contributions
 and+3.971
,+3.850
.+3.697
 the+3.655
 a+3.547
Neg logit contributions
umm-4.488
lor-4.420
ocol-4.410
 former-4.404
arians-4.302
Token be
Token ID307
Feature activation+1.00e+00

Pos logit contributions
 and+3.580
,+3.454
.+3.298
 the+3.247
 a+3.146
Neg logit contributions
umm-4.884
ocol-4.805
lor-4.805
 former-4.774
arians-4.769
Token successful
Token ID4388
Feature activation+1.00e+00

Pos logit contributions
 and+3.518
,+3.378
.+3.208
 the+3.099
 a+2.983
Neg logit contributions
umm-4.940
ocol-4.925
 former-4.887
lor-4.851
arians-4.807
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+2.591
,+2.473
.+2.312
 the+2.283
 a+2.180
Neg logit contributions
umm-5.925
ocol-5.764
lor-5.743
 former-5.730
arians-5.728
Tokenâ
Token ID22940
Feature activation+1.00e+00

Pos logit contributions
 and+4.748
,+4.628
.+4.486
 the+4.391
 a+4.272
Neg logit contributions
umm-3.848
ocol-3.727
lor-3.715
arians-3.669
 former-3.666
Token
Token ID26391
Feature activation+1.00e+00

Pos logit contributions
 and+6.728
,+6.588
.+6.431
 the+6.356
 a+6.300
Neg logit contributions
umm-1.929
lor-1.810
ocol-1.769
arians-1.705
vation-1.702
Token He
Token ID679
Feature activation+1.00e+00

Pos logit contributions
 and+5.295
,+5.133
.+5.004
 the+4.957
 a+4.876
Neg logit contributions
umm-3.216
lor-3.210
arians-3.095
ocol-3.079
 former-3.019
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.870
,+3.757
.+3.607
 the+3.562
 a+3.452
Neg logit contributions
umm-4.599
lor-4.520
ocol-4.482
arians-4.460
 former-4.456
Token excited
Token ID6568
Feature activation+1.00e+00

Pos logit contributions
 and+3.324
,+3.198
.+3.041
 the+2.901
 a+2.797
Neg logit contributions
umm-5.171
ocol-5.123
 former-5.081
lor-5.073
arians-4.989
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.499
,+1.376
 the+1.209
.+1.207
 a+1.129
Neg logit contributions
umm-6.981
lor-6.839
ocol-6.781
 former-6.766
arians-6.741
Token felt
Token ID2936
Feature activation+1.00e+00

Pos logit contributions
 and+3.664
,+3.524
.+3.383
 the+3.262
 a+3.190
Neg logit contributions
umm-4.853
 former-4.729
ocol-4.728
lor-4.710
osity-4.680
Token Ell
Token ID7122
Feature activation+1.00e+00

Pos logit contributions
 and+4.154
,+4.008
.+3.849
 the+3.718
 a+3.630
Neg logit contributions
umm-4.290
lor-4.236
ocol-4.220
arians-4.192
osity-4.164
Tokeny
Token ID88
Feature activation+1.00e+00

Pos logit contributions
 and+5.215
,+5.105
.+4.912
 the+4.903
 a+4.795
Neg logit contributions
lor-3.264
umm-3.242
ocol-3.203
arians-3.149
 former-3.137
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+2.291
,+2.140
 the+1.984
.+1.970
 a+1.867
Neg logit contributions
umm-6.114
lor-6.100
 former-5.993
arians-5.977
ocol-5.969
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+3.454
,+3.307
.+3.155
 the+3.090
 a+2.985
Neg logit contributions
umm-5.110
lor-5.014
ocol-4.960
 former-4.918
osity-4.902
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.666
,+1.548
.+1.374
 the+1.298
 a+1.190
Neg logit contributions
lor-6.821
umm-6.779
ocol-6.705
 former-6.684
rol-6.634
TokenE
Token ID36
Feature activation+1.00e+00

Pos logit contributions
 and+4.853
,+4.733
.+4.573
 the+4.497
 a+4.381
Neg logit contributions
umm-3.679
lor-3.669
ocol-3.595
 former-3.562
arians-3.520
Tokenlly
Token ID12810
Feature activation+1.00e+00

Pos logit contributions
 and+6.515
,+6.380
.+6.228
 the+6.177
 a+6.095
Neg logit contributions
umm-2.081
lor-1.979
ocol-1.949
arians-1.893
 former-1.889
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+3.134
,+3.016
.+2.905
 the+2.857
 a+2.756
Neg logit contributions
umm-5.243
lor-5.205
 former-5.147
arians-5.131
ocol-5.097
Token very
Token ID845
Feature activation+1.00e+00

Pos logit contributions
 and+3.083
,+2.948
.+2.847
 the+2.669
 a+2.535
Neg logit contributions
umm-5.376
 former-5.324
lor-5.304
ocol-5.303
arians-5.230
Token quiet
Token ID5897
Feature activation+1.00e+00

Pos logit contributions
 and+3.748
,+3.587
.+3.464
 the+3.354
 a+3.253
Neg logit contributions
umm-4.812
ocol-4.726
 former-4.705
lor-4.635
osity-4.627
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+0.847
,+0.734
 the+0.599
.+0.586
 a+0.491
Neg logit contributions
umm-7.545
lor-7.420
 former-7.371
arians-7.352
ocol-7.347
Token some
Token ID617
Feature activation+1.00e+00

Pos logit contributions
 and+1.873
,+1.748
.+1.590
 the+1.423
 a+1.316
Neg logit contributions
umm-6.645
ocol-6.516
lor-6.506
 former-6.435
rol-6.432
Token cookies
Token ID14746
Feature activation+1.00e+00

Pos logit contributions
 and+4.091
,+3.982
.+3.819
 the+3.739
 a+3.624
Neg logit contributions
umm-4.420
ocol-4.363
 former-4.255
lor-4.229
arians-4.207
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.409
,+1.303
.+1.156
 the+1.141
 a+1.057
Neg logit contributions
umm-6.999
lor-6.890
ocol-6.848
arians-6.824
 former-6.761
Token ice
Token ID4771
Feature activation+1.00e+00

Pos logit contributions
 and+4.332
,+4.237
.+4.090
 the+3.921
 a+3.831
Neg logit contributions
umm-4.159
ocol-4.037
 former-4.022
arians-3.991
lor-3.985
Token-
Token ID12
Feature activation+1.00e+00

Pos logit contributions
 and+4.848
,+4.748
.+4.566
 the+4.550
 a+4.457
Neg logit contributions
umm-3.657
lor-3.537
ocol-3.533
arians-3.458
 former-3.417
Tokencream
Token ID36277
Feature activation+1.00e+00

Pos logit contributions
 and+7.002
,+6.911
.+6.733
 the+6.620
 a+6.517
Neg logit contributions
umm-1.602
lor-1.489
ocol-1.480
arians-1.422
 former-1.420
Token inside
Token ID2641
Feature activation+1.00e+00

Pos logit contributions
 and+3.317
,+3.194
.+3.023
 the+3.008
 a+2.928
Neg logit contributions
umm-5.206
lor-5.030
ocol-4.987
arians-4.986
rol-4.926
Token!
Token ID0
Feature activation+1.00e+00

Pos logit contributions
 and+1.082
,+0.952
.+0.761
 the+0.696
 a+0.656
Neg logit contributions
umm-7.385
ocol-7.269
lor-7.229
 former-7.189
arians-7.154
Token They
Token ID1119
Feature activation+1.00e+00

Pos logit contributions
 and+3.293
,+3.175
.+3.016
 the+2.913
 a+2.829
Neg logit contributions
umm-5.286
lor-5.146
ocol-5.132
 former-5.109
arians-5.080
Token were
Token ID547
Feature activation+1.00e+00

Pos logit contributions
 and+3.369
,+3.266
.+3.129
 the+3.000
 a+2.932
Neg logit contributions
umm-5.084
ocol-4.970
lor-4.957
 former-4.955
arians-4.931
Token so
Token ID523
Feature activation+1.00e+00

Pos logit contributions
 and+2.343
,+2.228
.+2.095
 the+1.951
 a+1.862
Neg logit contributions
umm-6.201
ocol-6.063
lor-6.027
 former-6.019
rol-5.979
Token long
Token ID890
Feature activation+1.00e+00

Pos logit contributions
 and+3.675
,+3.520
.+3.383
 the+3.273
 a+3.199
Neg logit contributions
umm-4.881
lor-4.758
ocol-4.746
 former-4.729
arians-4.650
Token there
Token ID612
Feature activation+1.00e+00

Pos logit contributions
 and+1.812
,+1.659
.+1.543
 the+1.445
 a+1.378
Neg logit contributions
umm-6.750
ocol-6.629
lor-6.625
arians-6.541
 former-6.529
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+1.921
,+1.747
.+1.644
 the+1.559
 a+1.494
Neg logit contributions
umm-6.589
lor-6.470
ocol-6.421
arians-6.372
 former-6.371
Token more
Token ID517
Feature activation+1.00e+00

Pos logit contributions
 and+3.506
,+3.358
.+3.215
 the+3.057
 a+2.952
Neg logit contributions
umm-5.069
 former-4.938
ocol-4.927
lor-4.904
ubb-4.800
Token fresh
Token ID4713
Feature activation+1.00e+00

Pos logit contributions
 and+4.581
,+4.475
.+4.315
 the+4.256
 a+4.160
Neg logit contributions
umm-4.025
ocol-3.888
 former-3.824
lor-3.807
ubb-3.738
Token caul
Token ID46087
Feature activation+1.00e+00

Pos logit contributions
 and+4.153
,+4.025
.+3.900
 the+3.821
 a+3.767
Neg logit contributions
umm-4.458
ocol-4.273
lor-4.194
 former-4.143
arians-4.121
Tokeniflower
Token ID42642
Feature activation+1.00e+00

Pos logit contributions
 and+7.102
,+6.992
.+6.811
 the+6.772
 a+6.690
Neg logit contributions
umm-1.576
lor-1.479
ocol-1.437
arians-1.343
 former-1.334
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+2.344
,+2.223
.+2.057
 the+2.032
 a+1.954
Neg logit contributions
umm-6.147
lor-6.020
ocol-5.980
 former-5.922
arians-5.903
Token enjoy
Token ID2883
Feature activation+1.00e+00

Pos logit contributions
 and+4.117
,+3.971
.+3.802
 the+3.696
 a+3.629
Neg logit contributions
umm-4.461
ocol-4.372
 former-4.344
lor-4.310
ubb-4.199
Token!
Token ID0
Feature activation+1.00e+00

Pos logit contributions
 and+2.142
,+2.017
.+1.825
 the+1.746
 a+1.690
Neg logit contributions
umm-6.357
ocol-6.295
lor-6.233
 former-6.216
arians-6.141
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+2.341
,+2.232
.+2.063
 the+1.965
 a+1.886
Neg logit contributions
umm-6.257
lor-6.158
ocol-6.059
 former-6.034
osity-6.016
Token she
Token ID673
Feature activation+1.00e+00

Pos logit contributions
 and+2.779
,+2.613
.+2.496
 the+2.336
 a+2.252
Neg logit contributions
umm-5.718
lor-5.676
ocol-5.607
 former-5.603
osity-5.537
Token noticed
Token ID6810
Feature activation+1.00e+00

Pos logit contributions
 and+3.411
,+3.309
.+3.165
 the+3.083
 a+2.981
Neg logit contributions
umm-4.963
lor-4.957
 former-4.938
ocol-4.874
arians-4.831
Token that
Token ID326
Feature activation+1.00e+00

Pos logit contributions
 and+1.977
,+1.855
.+1.696
 the+1.568
 a+1.443
Neg logit contributions
umm-6.445
lor-6.412
ocol-6.371
 former-6.347
osity-6.240
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+2.466
,+2.346
.+2.207
 the+2.055
 a+1.985
Neg logit contributions
umm-6.027
ocol-5.912
 former-5.911
lor-5.906
osity-5.786
Token l
Token ID300
Feature activation+1.00e+00

Pos logit contributions
 and+4.548
,+4.455
.+4.270
 the+4.212
 a+4.090
Neg logit contributions
umm-4.054
ocol-3.879
 former-3.876
lor-3.839
arians-3.792
Tokenoll
Token ID692
Feature activation+1.00e+00

Pos logit contributions
 and+6.313
,+6.196
.+6.013
 the+5.997
 a+5.857
Neg logit contributions
umm-2.358
lor-2.319
ocol-2.270
arians-2.135
 former-2.133
Tokenipop
Token ID42800
Feature activation+1.00e+00

Pos logit contributions
 and+7.062
,+6.911
.+6.793
 the+6.747
 a+6.666
Neg logit contributions
umm-1.494
ocol-1.447
lor-1.426
arians-1.287
 former-1.250
Token was
Token ID373
Feature activation+1.00e+00

Pos logit contributions
 and+2.960
,+2.853
.+2.699
 the+2.668
 a+2.572
Neg logit contributions
umm-5.514
lor-5.423
arians-5.326
ocol-5.317
 former-5.294
Token gone
Token ID3750
Feature activation+1.00e+00

Pos logit contributions
 and+3.917
,+3.811
.+3.654
 the+3.554
 a+3.431
Neg logit contributions
umm-4.592
ocol-4.492
 former-4.472
lor-4.461
osity-4.339
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+1.726
,+1.617
 the+1.466
.+1.428
 a+1.353
Neg logit contributions
umm-6.681
lor-6.582
 former-6.565
ocol-6.498
arians-6.433
Token She
Token ID1375
Feature activation+1.00e+00

Pos logit contributions
 and+3.782
,+3.656
.+3.506
 the+3.404
 a+3.301
Neg logit contributions
umm-4.839
lor-4.759
 former-4.672
ocol-4.656
osity-4.613
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+1.086
,+0.991
.+0.802
 the+0.781
 a+0.692
Neg logit contributions
umm-7.331
ocol-7.220
lor-7.187
arians-7.152
osity-7.116
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.931
,+1.835
.+1.666
 the+1.547
 a+1.448
Neg logit contributions
umm-6.569
lor-6.521
ocol-6.422
arians-6.399
 former-6.380
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
 and+1.391
,+1.281
.+1.104
 the+1.022
 a+0.947
Neg logit contributions
lor-7.005
umm-6.978
ocol-6.889
arians-6.866
 former-6.839
Token"
Token ID1
Feature activation+1.00e+00

Pos logit contributions
 and+3.444
,+3.325
.+3.180
 the+3.067
 a+2.986
Neg logit contributions
umm-5.081
lor-5.030
ocol-4.972
arians-4.962
vation-4.880
TokenAAAA
Token ID17922
Feature activation+1.00e+00

Pos logit contributions
 and+5.926
,+5.848
.+5.656
 the+5.538
 a+5.429
Neg logit contributions
umm-2.719
lor-2.581
ocol-2.573
 former-2.529
arians-2.514
TokenH
Token ID39
Feature activation+1.00e+00

Pos logit contributions
 and+6.163
,+6.039
.+5.886
 the+5.798
 a+5.700
Neg logit contributions
umm-2.435
lor-2.348
ocol-2.290
arians-2.247
 former-2.230
Token!"
Token ID2474
Feature activation+1.00e+00

Pos logit contributions
 and+5.865
,+5.657
.+5.550
 the+5.526
 a+5.442
Neg logit contributions
umm-2.694
lor-2.661
ocol-2.590
arians-2.505
vation-2.475
Token Tom
Token ID4186
Feature activation+1.00e+00

Pos logit contributions
 and+3.037
,+2.910
.+2.761
 the+2.655
 a+2.567
Neg logit contributions
umm-5.505
lor-5.387
ocol-5.376
arians-5.322
 former-5.318
Token and
Token ID290
Feature activation+1.00e+00

Pos logit contributions
 and+1.936
,+1.888
.+1.742
 the+1.714
 a+1.612
Neg logit contributions
umm-6.400
lor-6.328
arians-6.292
ocol-6.232
 former-6.219
Token Anna
Token ID11735
Feature activation+1.00e+00

Pos logit contributions
 and+3.375
,+3.256
.+3.102
 the+2.967
 a+2.887
Neg logit contributions
umm-5.191
lor-5.079
ocol-5.069
arians-5.016
 former-4.997
Token scream
Token ID8196
Feature activation+1.00e+00

Pos logit contributions
 and+4.134
,+4.026
.+3.885
 the+3.825
 a+3.725
Neg logit contributions
umm-4.261
lor-4.183
arians-4.162
ocol-4.150
 former-4.115
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+0.914
,+0.804
 the+0.653
.+0.631
 a+0.554
Neg logit contributions
umm-7.515
 former-7.348
ocol-7.338
lor-7.337
arians-7.273
Token But
Token ID887
Feature activation+1.00e+00

Pos logit contributions
 and+3.114
,+2.989
.+2.824
 the+2.759
 a+2.675
Neg logit contributions
umm-5.414
lor-5.300
 former-5.273
ocol-5.244
osity-5.221
Token she
Token ID673
Feature activation+1.00e+00

Pos logit contributions
 and+2.460
,+2.307
.+2.214
 the+2.044
 a+1.986
Neg logit contributions
umm-6.004
ocol-5.907
 former-5.885
lor-5.859
arians-5.812
Token didn
Token ID1422
Feature activation+1.00e+00

Pos logit contributions
 and+3.388
,+3.297
.+3.158
 the+3.084
 a+2.985
Neg logit contributions
umm-5.015
lor-4.955
 former-4.926
arians-4.887
ocol-4.872
Tokenâ
Token ID22940
Feature activation+1.00e+00

Pos logit contributions
 and+3.956
,+3.829
.+3.668
 the+3.584
 a+3.462
Neg logit contributions
umm-4.543
lor-4.488
ocol-4.428
 former-4.379
arians-4.379
Token
Token ID26391
Feature activation+1.00e+00

Pos logit contributions
 and+6.963
,+6.844
.+6.669
 the+6.613
 a+6.535
Neg logit contributions
umm-1.696
lor-1.567
ocol-1.521
 former-1.492
arians-1.471
Token
Token ID8151
Feature activation+1.00e+00

Pos logit contributions
 and+5.696
,+5.553
.+5.387
 the+5.376
 a+5.269
Neg logit contributions
lor-2.851
umm-2.800
arians-2.725
 former-2.654
osity-2.642
Tokent
Token ID83
Feature activation+1.00e+00

Pos logit contributions
 and+5.471
,+5.328
.+5.161
 the+5.149
 a+5.064
Neg logit contributions
umm-3.019
lor-2.980
rol-2.885
ocol-2.875
arians-2.854
Token forget
Token ID6044
Feature activation+1.00e+00

Pos logit contributions
 and+4.065
,+3.940
.+3.758
 the+3.736
 a+3.648
Neg logit contributions
umm-4.400
ocol-4.326
lor-4.289
 former-4.276
osity-4.250
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 and+1.913
,+1.781
.+1.635
 the+1.544
 a+1.479
Neg logit contributions
umm-6.475
lor-6.435
ocol-6.404
 former-6.352
osity-6.352
Token memory
Token ID4088
Feature activation+1.00e+00

Pos logit contributions
 and+4.813
,+4.700
.+4.528
 the+4.469
 a+4.347
Neg logit contributions
umm-3.785
ocol-3.617
 former-3.610
lor-3.590
arians-3.524
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 and+2.731
,+2.620
.+2.460
 the+2.369
 a+2.275
Neg logit contributions
umm-5.718
osity-5.630
 former-5.626
ocol-5.620
lor-5.592
Token worry
Token ID5490
Feature activation+1.00e+00

Pos logit contributions
 and+3.949
,+3.797
.+3.623
 the+3.591
 a+3.510
Neg logit contributions
umm-4.547
 former-4.416
ocol-4.397
lor-4.394
arians-4.362
Token.
Token ID13
Feature activation+1.00e+00

Pos logit contributions
 and+2.681
,+2.540
.+2.385
 the+2.379
 a+2.313
Neg logit contributions
umm-5.756
lor-5.630
ocol-5.626
osity-5.579
arians-5.573
Token I
Token ID314
Feature activation+1.00e+00

Pos logit contributions
 and+4.776
,+4.647
.+4.506
 the+4.412
 a+4.307
Neg logit contributions
umm-3.825
lor-3.732
ocol-3.688
 former-3.653
arians-3.638
Tokenâ
Token ID22940
Feature activation+1.00e+00

Pos logit contributions
 and+4.171
,+4.072
.+3.913
 the+3.894
 a+3.762
Neg logit contributions
umm-4.252
lor-4.221
 former-4.206
ocol-4.139
arians-4.100
Token
Token ID26391
Feature activation+1.00e+00

Pos logit contributions
 and+6.498
,+6.359
.+6.205
 the+6.136
 a+6.055
Neg logit contributions
umm-2.182
lor-2.041
ocol-2.012
 former-1.952
arians-1.931
Token
Token ID8151
Feature activation+1.00e+00

Pos logit contributions
 and+6.118
,+5.959
.+5.830
 the+5.793
 a+5.694
Neg logit contributions
umm-2.410
lor-2.392
arians-2.283
ocol-2.246
 former-2.225
Tokenm
Token ID76
Feature activation+1.00e+00

Pos logit contributions
 and+4.531
,+4.378
.+4.235
 the+4.214
 a+4.144
Neg logit contributions
umm-3.953
lor-3.890
ocol-3.804
rol-3.790
arians-3.774
Token so
Token ID523
Feature activation+1.00e+00

Pos logit contributions
 and+4.874
,+4.741
.+4.580
 the+4.536
 a+4.412
Neg logit contributions
umm-3.609
ocol-3.573
lor-3.573
 former-3.503
osity-3.493
Token glad
Token ID9675
Feature activation+1.00e+00

Pos logit contributions
 and+3.840
,+3.651
.+3.512
 the+3.430
 a+3.346
Neg logit contributions
umm-4.710
ocol-4.610
lor-4.587
 former-4.575
osity-4.533
Token you
Token ID345
Feature activation+1.00e+00

Pos logit contributions
 and+2.829
,+2.689
.+2.543
 the+2.457
 a+2.389
Neg logit contributions
umm-5.626
lor-5.565
ocol-5.503
 former-5.457
arians-5.447