TOP ACTIVATIONS
MAX = -1.000

family . The family was very poor but they prayed hard
and was very happy . <|endoftext|> Once upon a time ,
wood !" Grand pa stepped in and said
use the screw and was very happy . <|endoftext|> Once upon
Tom went to his bed and hid under the covers
talking , table ?" The table did not say anything either
It was coming from his closet ! Tom opened the closet
closet ! Tom opened the closet and saw a big monster
. All the animals were very scared and kept quiet .
apple . The lion was very happy and thanked the monkey
the toy fell from the table and broke . The woman
broke . The woman was very sad . She
Ben , but you are very funny . Can you say
smile . The animals were very happy to have made the
they all have fun . <|endoftext|> Anna is a big girl
tree with lots of cher ries on it . She wanted
. She liked the toy very much . It was a
The end . <|endoftext|> A woman had a broken
every night !" Tom was very scared and cried . From
night 's sleep again . <|endoftext|> Once upon a time ,

MIN ACTIVATIONS
MIN = -1.000

your bike ." The unknown person was happy and said ,
, " I can help you fix your bike ." The
He went to the unknown person and said , " I
person and said , " I can help you fix your
, " I will scare you every night !" Tom was
Tom and said , " I will scare you every night
scared and cried . From that day on , Tom was
and cried . From that day on , Tom was always
, Tom saw an unknown person who needed help . The
Tom . Tom liked to help others . One day ,
a good night 's sleep again . <|endoftext|> Once upon a
, there was a kind person named Tom . Tom liked
needed help . The unknown person had a flat tire on
an unknown person who needed help . The unknown person had
tire on a bike . Tom wanted to lend
on a bike . Tom wanted to lend a
never had a good night 's sleep again . <|endoftext|> Once
and he never had a good night 's sleep again .
in his room , and he never had a good night
his room , and he never had a good night 's

INTERVAL -1.000 - -1.000
CONTAINS 0.000%

Token family
Token ID1641
Feature activation-1.00e+00

Pos logit contributions
 and+4.349
,+4.225
.+4.180
 the+4.039
 a+3.972
Neg logit contributions
umm-4.999
lor-4.673
osity-4.632
arians-4.588
ocol-4.573
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+2.802
,+2.691
.+2.567
 the+2.451
 a+2.400
Neg logit contributions
umm-6.528
lor-6.312
osity-6.228
vation-6.195
rol-6.154
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.568
,+3.453
.+3.355
 the+3.204
 a+3.132
Neg logit contributions
umm-5.864
lor-5.656
osity-5.561
vation-5.551
arians-5.494
Token family
Token ID1641
Feature activation-1.00e+00

Pos logit contributions
 and+3.515
,+3.427
.+3.314
 the+3.149
 a+3.057
Neg logit contributions
umm-5.914
lor-5.618
osity-5.538
vation-5.526
arians-5.470
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 and+3.319
,+3.206
.+3.104
 the+2.957
 a+2.881
Neg logit contributions
umm-6.098
lor-5.896
osity-5.795
vation-5.789
arians-5.733
Token very
Token ID845
Feature activation-1.00e+00

Pos logit contributions
 and+2.278
,+2.155
.+2.081
 the+1.882
 a+1.795
Neg logit contributions
umm-7.102
lor-6.846
osity-6.810
vation-6.764
rol-6.740
Token poor
Token ID3595
Feature activation-1.00e+00

Pos logit contributions
 and+3.588
,+3.464
.+3.370
 the+3.217
 a+3.148
Neg logit contributions
umm-5.837
lor-5.607
osity-5.538
vation-5.523
arians-5.461
Token but
Token ID475
Feature activation-1.00e+00

Pos logit contributions
 and+0.769
,+0.715
 the+0.568
.+0.559
 a+0.480
Neg logit contributions
umm-8.389
lor-8.067
arians-8.022
osity-8.013
 former-7.957
Token they
Token ID484
Feature activation-1.00e+00

Pos logit contributions
 and+3.174
,+3.067
.+2.984
 the+2.766
 a+2.701
Neg logit contributions
umm-6.132
lor-5.860
osity-5.858
vation-5.814
arians-5.753
Token prayed
Token ID32168
Feature activation-1.00e+00

Pos logit contributions
 and+3.627
,+3.514
.+3.416
 the+3.258
 a+3.195
Neg logit contributions
umm-5.720
lor-5.515
osity-5.456
vation-5.434
arians-5.383
Token hard
Token ID1327
Feature activation-1.00e+00

Pos logit contributions
 and+2.094
,+2.027
.+1.851
 the+1.791
 a+1.688
Neg logit contributions
umm-6.959
osity-6.836
lor-6.814
vation-6.737
 former-6.695
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+4.566
,+4.478
.+4.372
 the+4.319
 a+4.224
Neg logit contributions
umm-4.600
lor-4.449
osity-4.400
rol-4.321
vation-4.298
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 and+4.008
,+3.943
.+3.888
 the+3.577
 a+3.553
Neg logit contributions
umm-5.260
osity-5.144
lor-5.023
 former-4.984
arians-4.953
Token very
Token ID845
Feature activation-1.00e+00

Pos logit contributions
 and+4.250
,+4.143
.+4.025
 the+3.799
 a+3.701
Neg logit contributions
umm-5.018
lor-4.882
osity-4.832
ubb-4.758
vation-4.756
Token happy
Token ID3772
Feature activation-1.00e+00

Pos logit contributions
 and+4.554
,+4.416
.+4.307
 the+4.140
 a+4.062
Neg logit contributions
umm-4.850
osity-4.639
ubb-4.569
lor-4.556
 former-4.531
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+1.518
,+1.424
.+1.257
 the+1.234
 a+1.185
Neg logit contributions
umm-7.834
osity-7.494
lor-7.491
arians-7.369
vation-7.352
Token<|endoftext|>
Token ID50256
Feature activation-1.00e+00

Pos logit contributions
 and+2.487
,+2.380
.+2.265
 the+2.107
 a+2.050
Neg logit contributions
umm-6.933
lor-6.744
osity-6.648
vation-6.588
rol-6.538
TokenOnce
Token ID7454
Feature activation-1.00e+00

Pos logit contributions
 and+3.021
,+2.919
.+2.765
 the+2.645
 a+2.586
Neg logit contributions
umm-6.267
lor-6.165
vation-5.991
rol-5.958
osity-5.920
Token upon
Token ID2402
Feature activation-1.00e+00

Pos logit contributions
 and+4.256
,+4.085
.+4.007
 the+3.853
 a+3.853
Neg logit contributions
umm-4.978
lor-4.825
osity-4.784
vation-4.695
rol-4.684
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+0.651
,+0.640
.+0.412
 the+0.255
 a+0.010
Neg logit contributions
lor-8.385
umm-8.373
vation-8.211
osity-8.140
rol-8.135
Token time
Token ID640
Feature activation-1.00e+00

Pos logit contributions
 and+2.999
,+2.896
.+2.799
 the+2.716
 a+2.596
Neg logit contributions
umm-6.320
lor-6.125
vation-6.007
osity-5.958
rol-5.911
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+0.302
.+0.099
,+0.021
 the-0.060
 a-0.119
Neg logit contributions
umm-8.912
lor-8.762
osity-8.696
rol-8.616
reetings-8.613
Token wood
Token ID4898
Feature activation-1.00e+00

Pos logit contributions
 and+5.188
,+5.052
.+4.942
 the+4.786
 a+4.688
Neg logit contributions
umm-4.302
lor-3.989
vation-3.911
osity-3.857
 former-3.844
Token!"
Token ID2474
Feature activation-1.00e+00

Pos logit contributions
 and+3.591
,+3.501
.+3.372
 the+3.338
 a+3.296
Neg logit contributions
umm-5.667
lor-5.471
vation-5.325
osity-5.295
arians-5.287
Token
Token ID220
Feature activation-1.00e+00

Pos logit contributions
 and+3.387
,+3.263
.+3.160
 the+2.976
 a+2.965
Neg logit contributions
umm-5.869
lor-5.753
vation-5.612
osity-5.581
rol-5.555
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.739
,+1.636
.+1.469
 the+1.356
 a+1.301
Neg logit contributions
umm-7.566
lor-7.543
osity-7.344
vation-7.336
rol-7.323
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.829
,+3.731
.+3.592
 the+3.442
 a+3.383
Neg logit contributions
umm-5.535
lor-5.480
vation-5.310
osity-5.246
rol-5.242
TokenGrand
Token ID23581
Feature activation-1.00e+00

Pos logit contributions
 and+5.715
,+5.623
.+5.501
 the+5.346
 a+5.265
Neg logit contributions
umm-3.706
lor-3.611
vation-3.465
osity-3.422
arians-3.406
Tokenpa
Token ID8957
Feature activation-1.00e+00

Pos logit contributions
 and+5.559
,+5.458
.+5.339
 the+5.270
 a+5.203
Neg logit contributions
umm-3.911
lor-3.735
vation-3.601
osity-3.564
arians-3.545
Token stepped
Token ID10764
Feature activation-1.00e+00

Pos logit contributions
 and+4.311
,+4.216
.+4.163
 the+4.012
 a+3.972
Neg logit contributions
umm-4.888
lor-4.769
osity-4.672
vation-4.665
arians-4.611
Token in
Token ID287
Feature activation-1.00e+00

Pos logit contributions
 and+3.425
,+3.355
.+3.240
 the+3.090
 a+3.013
Neg logit contributions
umm-5.901
lor-5.726
rol-5.671
vation-5.594
osity-5.579
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+2.408
,+2.342
.+2.229
 the+2.049
 a+2.028
Neg logit contributions
umm-6.752
lor-6.491
ocol-6.436
vation-6.414
osity-6.381
Token said
Token ID531
Feature activation-1.00e+00

Pos logit contributions
 and+4.029
,+3.920
.+3.837
 a+3.554
 the+3.554
Neg logit contributions
umm-5.226
lor-5.092
vation-5.049
osity-5.020
rol-5.005
Token use
Token ID779
Feature activation-1.00e+00

Pos logit contributions
 and+4.159
,+4.046
.+3.952
 the+3.846
 a+3.760
Neg logit contributions
umm-5.151
lor-4.938
osity-4.867
vation-4.851
 former-4.847
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+3.681
,+3.580
.+3.518
 the+3.209
 a+3.142
Neg logit contributions
umm-5.363
ocol-5.258
lor-5.183
ubb-5.183
osity-5.178
Token screw
Token ID9580
Feature activation-1.00e+00

Pos logit contributions
 and+5.590
,+5.487
.+5.385
 the+5.273
 a+5.152
Neg logit contributions
umm-3.762
lor-3.461
osity-3.450
ocol-3.408
vation-3.403
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+4.566
,+4.478
.+4.372
 the+4.319
 a+4.224
Neg logit contributions
umm-4.600
lor-4.449
osity-4.400
rol-4.321
vation-4.298
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 and+4.008
,+3.943
.+3.888
 the+3.577
 a+3.553
Neg logit contributions
umm-5.260
osity-5.144
lor-5.023
 former-4.984
arians-4.953
Token very
Token ID845
Feature activation-1.00e+00

Pos logit contributions
 and+4.250
,+4.143
.+4.025
 the+3.799
 a+3.701
Neg logit contributions
umm-5.018
lor-4.882
osity-4.832
ubb-4.758
vation-4.756
Token happy
Token ID3772
Feature activation-1.00e+00

Pos logit contributions
 and+4.554
,+4.416
.+4.307
 the+4.140
 a+4.062
Neg logit contributions
umm-4.850
osity-4.639
ubb-4.569
lor-4.556
 former-4.531
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+1.518
,+1.424
.+1.257
 the+1.234
 a+1.185
Neg logit contributions
umm-7.834
osity-7.494
lor-7.491
arians-7.369
vation-7.352
Token<|endoftext|>
Token ID50256
Feature activation-1.00e+00

Pos logit contributions
 and+2.487
,+2.380
.+2.265
 the+2.107
 a+2.050
Neg logit contributions
umm-6.933
lor-6.744
osity-6.648
vation-6.588
rol-6.538
TokenOnce
Token ID7454
Feature activation-1.00e+00

Pos logit contributions
 and+3.021
,+2.919
.+2.765
 the+2.645
 a+2.586
Neg logit contributions
umm-6.267
lor-6.165
vation-5.991
rol-5.958
osity-5.920
Token upon
Token ID2402
Feature activation-1.00e+00

Pos logit contributions
 and+4.256
,+4.085
.+4.007
 the+3.853
 a+3.853
Neg logit contributions
umm-4.978
lor-4.825
osity-4.784
vation-4.695
rol-4.684
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.600
,+1.489
.+1.355
 the+1.209
 a+1.148
Neg logit contributions
umm-7.709
lor-7.610
vation-7.463
osity-7.428
rol-7.407
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+4.087
,+3.972
.+3.870
 the+3.721
 a+3.649
Neg logit contributions
umm-5.363
lor-5.172
vation-5.065
osity-5.062
arians-5.001
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
 and+3.295
,+3.251
.+3.204
 the+3.077
 a+3.000
Neg logit contributions
umm-5.752
lor-5.704
vation-5.587
arians-5.565
osity-5.531
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+2.074
,+1.974
.+1.859
 the+1.723
 a+1.659
Neg logit contributions
umm-7.175
lor-7.051
vation-6.971
osity-6.939
rol-6.920
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.788
,+2.683
.+2.542
 a+2.231
 the+2.229
Neg logit contributions
umm-6.401
vation-6.255
lor-6.233
ocol-6.172
osity-6.149
Token bed
Token ID3996
Feature activation-1.00e+00

Pos logit contributions
 and+4.208
,+4.109
.+3.999
 the+3.878
 a+3.786
Neg logit contributions
umm-5.101
lor-4.826
 former-4.790
arians-4.783
vation-4.753
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+1.411
,+1.336
 the+1.254
.+1.200
 a+1.151
Neg logit contributions
umm-7.681
lor-7.624
vation-7.448
arians-7.356
 former-7.348
Token hid
Token ID24519
Feature activation-1.00e+00

Pos logit contributions
 and+4.014
,+3.926
.+3.827
 the+3.586
 a+3.520
Neg logit contributions
umm-5.094
lor-5.071
vation-4.981
osity-4.911
arians-4.882
Token under
Token ID739
Feature activation-1.00e+00

Pos logit contributions
 and+3.293
,+3.220
.+2.962
 the+2.839
 a+2.839
Neg logit contributions
umm-5.873
ocol-5.674
lor-5.649
rol-5.575
ubb-5.566
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.761
,+1.651
.+1.488
 the+1.182
 a+1.126
Neg logit contributions
umm-7.484
lor-7.290
ocol-7.250
ubb-7.162
vation-7.159
Token covers
Token ID8698
Feature activation-1.00e+00

Pos logit contributions
 and+4.863
,+4.781
.+4.637
 the+4.536
 a+4.484
Neg logit contributions
umm-4.469
lor-4.250
vation-4.197
osity-4.113
ocol-4.097
Token talking
Token ID3375
Feature activation-1.00e+00

Pos logit contributions
 and+4.249
,+4.090
.+3.988
 the+3.908
 a+3.808
Neg logit contributions
umm-4.965
lor-4.865
vation-4.793
osity-4.771
reetings-4.730
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+3.191
,+3.118
.+2.967
 the+2.926
 a+2.816
Neg logit contributions
umm-5.998
lor-5.880
osity-5.804
vation-5.741
arians-5.697
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
 and+4.143
,+4.079
.+4.020
 the+3.818
 a+3.767
Neg logit contributions
umm-5.100
lor-4.852
osity-4.846
vation-4.785
arians-4.774
Token?"
Token ID1701
Feature activation-1.00e+00

Pos logit contributions
 and+2.507
,+2.398
.+2.297
 the+2.275
 a+2.177
Neg logit contributions
umm-6.730
lor-6.534
vation-6.412
osity-6.339
arians-6.339
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.559
,+3.431
.+3.330
 the+3.135
 a+3.101
Neg logit contributions
umm-5.672
lor-5.515
osity-5.413
vation-5.409
ocol-5.375
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
 and+4.738
,+4.642
.+4.514
 the+4.365
 a+4.274
Neg logit contributions
umm-4.652
lor-4.357
vation-4.334
osity-4.315
ocol-4.284
Token did
Token ID750
Feature activation-1.00e+00

Pos logit contributions
 and+4.244
,+4.138
.+4.047
 the+3.950
 a+3.862
Neg logit contributions
umm-5.020
lor-4.857
vation-4.789
osity-4.700
rol-4.671
Token not
Token ID407
Feature activation-1.00e+00

Pos logit contributions
 and+2.739
,+2.611
.+2.500
 the+2.379
 a+2.286
Neg logit contributions
umm-6.457
lor-6.352
vation-6.300
osity-6.237
arians-6.163
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+4.356
,+4.245
.+4.125
 the+4.016
 a+3.922
Neg logit contributions
umm-4.976
vation-4.750
lor-4.709
osity-4.708
ubb-4.630
Token anything
Token ID1997
Feature activation-1.00e+00

Pos logit contributions
 and+4.550
,+4.340
.+4.260
 the+4.108
 a+4.029
Neg logit contributions
umm-4.769
lor-4.679
osity-4.553
vation-4.535
ubb-4.457
Token either
Token ID2035
Feature activation-1.00e+00

Pos logit contributions
 and+2.142
,+2.019
.+1.865
 the+1.843
 a+1.807
Neg logit contributions
umm-6.970
lor-6.833
osity-6.777
vation-6.754
arians-6.694
Token It
Token ID632
Feature activation-1.00e+00

Pos logit contributions
 and+3.611
,+3.505
.+3.389
 the+3.228
 a+3.138
Neg logit contributions
umm-5.817
lor-5.619
osity-5.522
vation-5.471
arians-5.423
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 and+3.662
,+3.559
.+3.454
 the+3.290
 a+3.218
Neg logit contributions
umm-5.710
lor-5.524
vation-5.421
osity-5.407
rol-5.356
Token coming
Token ID2406
Feature activation-1.00e+00

Pos logit contributions
 and+2.911
,+2.819
.+2.656
 the+2.425
 a+2.330
Neg logit contributions
umm-6.324
ocol-6.071
lor-6.060
osity-6.055
 former-6.025
Token from
Token ID422
Feature activation-1.00e+00

Pos logit contributions
 and+2.438
,+2.362
.+2.218
 the+2.128
 a+2.034
Neg logit contributions
umm-6.829
lor-6.523
osity-6.510
rol-6.481
vation-6.446
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+2.770
,+2.725
.+2.521
 the+2.291
 a+2.238
Neg logit contributions
umm-6.401
ocol-6.167
lor-6.123
vation-6.081
 former-6.069
Token closet
Token ID21615
Feature activation-1.00e+00

Pos logit contributions
 and+4.550
,+4.427
.+4.299
 the+4.249
 a+4.130
Neg logit contributions
umm-4.798
lor-4.512
 former-4.466
arians-4.453
vation-4.436
Token!
Token ID0
Feature activation-1.00e+00

Pos logit contributions
 and+2.110
,+2.012
 the+1.938
 a+1.841
.+1.839
Neg logit contributions
umm-7.119
lor-7.021
arians-6.790
vation-6.773
osity-6.759
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+3.377
,+3.281
.+3.164
 the+2.987
 a+2.907
Neg logit contributions
umm-6.035
lor-5.856
osity-5.744
vation-5.706
rol-5.660
Token opened
Token ID4721
Feature activation-1.00e+00

Pos logit contributions
 and+3.418
,+3.360
.+3.317
 the+3.168
 a+3.086
Neg logit contributions
umm-5.643
lor-5.629
vation-5.485
arians-5.457
osity-5.448
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.252
,+1.141
.+1.045
 a+0.794
 the+0.794
Neg logit contributions
umm-7.984
lor-7.841
osity-7.710
ocol-7.675
rol-7.667
Token closet
Token ID21615
Feature activation-1.00e+00

Pos logit contributions
 and+4.863
,+4.755
.+4.632
 the+4.519
 a+4.437
Neg logit contributions
umm-4.521
lor-4.344
vation-4.221
osity-4.220
ocol-4.142
Token closet
Token ID21615
Feature activation-1.00e+00

Pos logit contributions
 and+4.550
,+4.427
.+4.299
 the+4.249
 a+4.130
Neg logit contributions
umm-4.798
lor-4.512
 former-4.466
arians-4.453
vation-4.436
Token!
Token ID0
Feature activation-1.00e+00

Pos logit contributions
 and+2.110
,+2.012
 the+1.938
 a+1.841
.+1.839
Neg logit contributions
umm-7.119
lor-7.021
arians-6.790
vation-6.773
osity-6.759
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+3.377
,+3.281
.+3.164
 the+2.987
 a+2.907
Neg logit contributions
umm-6.035
lor-5.856
osity-5.744
vation-5.706
rol-5.660
Token opened
Token ID4721
Feature activation-1.00e+00

Pos logit contributions
 and+3.418
,+3.360
.+3.317
 the+3.168
 a+3.086
Neg logit contributions
umm-5.643
lor-5.629
vation-5.485
arians-5.457
osity-5.448
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.252
,+1.141
.+1.045
 a+0.794
 the+0.794
Neg logit contributions
umm-7.984
lor-7.841
osity-7.710
ocol-7.675
rol-7.667
Token closet
Token ID21615
Feature activation-1.00e+00

Pos logit contributions
 and+4.863
,+4.755
.+4.632
 the+4.519
 a+4.437
Neg logit contributions
umm-4.521
lor-4.344
vation-4.221
osity-4.220
ocol-4.142
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+2.552
,+2.486
.+2.384
 the+2.355
 a+2.298
Neg logit contributions
umm-6.623
lor-6.514
osity-6.319
vation-6.314
rol-6.255
Token saw
Token ID2497
Feature activation-1.00e+00

Pos logit contributions
 and+3.273
,+3.187
.+3.111
 the+2.850
 a+2.803
Neg logit contributions
umm-5.931
lor-5.846
vation-5.744
osity-5.722
rol-5.683
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+2.363
,+2.217
.+2.080
 the+1.791
 a+1.702
Neg logit contributions
umm-7.035
lor-6.785
ubb-6.713
osity-6.711
rol-6.701
Token big
Token ID1263
Feature activation-1.00e+00

Pos logit contributions
 and+4.065
,+3.936
.+3.860
 the+3.720
 a+3.631
Neg logit contributions
umm-5.310
lor-5.101
vation-5.010
osity-4.996
arians-4.925
Token monster
Token ID9234
Feature activation-1.00e+00

Pos logit contributions
 and+4.650
,+4.510
.+4.489
 the+4.322
 a+4.275
Neg logit contributions
umm-4.685
lor-4.378
ocol-4.330
vation-4.319
osity-4.301
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+1.801
,+1.676
 the+1.547
.+1.525
 a+1.462
Neg logit contributions
umm-7.273
lor-7.187
arians-7.072
osity-7.033
vation-7.027
Token All
Token ID1439
Feature activation-1.00e+00

Pos logit contributions
 and+3.988
,+3.886
.+3.765
 the+3.606
 a+3.540
Neg logit contributions
umm-5.473
lor-5.245
osity-5.147
vation-5.109
arians-5.105
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.800
,+1.669
.+1.583
 a+1.361
 the+1.336
Neg logit contributions
umm-7.534
osity-7.267
lor-7.263
vation-7.236
ubb-7.219
Token animals
Token ID4695
Feature activation-1.00e+00

Pos logit contributions
 and+4.237
,+4.141
.+4.056
 the+3.855
 a+3.786
Neg logit contributions
umm-5.182
arians-4.906
vation-4.888
lor-4.880
osity-4.820
Token were
Token ID547
Feature activation-1.00e+00

Pos logit contributions
 and+3.887
,+3.791
.+3.742
 the+3.566
 a+3.559
Neg logit contributions
umm-5.224
lor-5.088
vation-5.079
osity-5.077
rol-5.021
Token very
Token ID845
Feature activation-1.00e+00

Pos logit contributions
 and+3.961
,+3.866
.+3.774
 the+3.549
 a+3.461
Neg logit contributions
umm-5.193
lor-5.049
osity-5.029
rol-5.024
arians-4.989
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
 and+4.163
,+4.030
.+3.941
 the+3.781
 a+3.715
Neg logit contributions
umm-5.192
osity-4.940
ubb-4.911
vation-4.887
lor-4.882
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+0.829
,+0.734
.+0.655
 a+0.628
 the+0.618
Neg logit contributions
umm-8.229
osity-7.920
lor-7.853
rol-7.847
arians-7.826
Token kept
Token ID4030
Feature activation-1.00e+00

Pos logit contributions
 and+4.330
,+4.182
.+4.091
 a+3.843
 the+3.838
Neg logit contributions
umm-4.922
osity-4.770
lor-4.718
rol-4.698
ubb-4.686
Token quiet
Token ID5897
Feature activation-1.00e+00

Pos logit contributions
 and+4.409
,+4.331
.+4.188
 the+3.894
 a+3.874
Neg logit contributions
umm-4.715
osity-4.577
lor-4.571
ocol-4.556
ubb-4.535
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+2.050
,+1.939
 the+1.838
.+1.787
 a+1.767
Neg logit contributions
umm-7.210
lor-6.866
osity-6.808
vation-6.780
arians-6.779
Token apple
Token ID17180
Feature activation-1.00e+00

Pos logit contributions
 and+5.098
,+4.958
.+4.881
 the+4.855
 a+4.694
Neg logit contributions
umm-4.256
lor-4.016
ocol-3.872
osity-3.868
vation-3.855
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+2.133
,+2.038
 the+1.904
.+1.848
 a+1.781
Neg logit contributions
umm-7.113
lor-6.894
vation-6.733
osity-6.704
rol-6.699
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+4.081
,+3.999
.+3.894
 the+3.709
 a+3.648
Neg logit contributions
umm-5.363
lor-5.144
osity-5.067
vation-4.998
arians-4.991
Token lion
Token ID18744
Feature activation-1.00e+00

Pos logit contributions
 and+5.022
,+4.910
.+4.810
 the+4.619
 a+4.523
Neg logit contributions
umm-4.433
lor-4.151
vation-4.131
arians-4.111
osity-4.091
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 and+3.772
,+3.707
.+3.668
 the+3.497
 a+3.432
Neg logit contributions
umm-5.389
lor-5.289
vation-5.170
arians-5.157
osity-5.113
Token very
Token ID845
Feature activation-1.00e+00

Pos logit contributions
 and+3.465
,+3.370
.+3.312
 the+3.035
 a+2.923
Neg logit contributions
umm-5.694
lor-5.620
rol-5.561
osity-5.517
vation-5.492
Token happy
Token ID3772
Feature activation-1.00e+00

Pos logit contributions
 and+4.009
,+3.861
.+3.802
 the+3.615
 a+3.545
Neg logit contributions
umm-5.348
osity-5.102
lor-5.099
vation-5.075
ubb-5.060
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+0.643
,+0.578
.+0.483
 the+0.443
 a+0.407
Neg logit contributions
umm-8.425
lor-8.211
osity-8.156
arians-8.064
vation-8.033
Token thanked
Token ID26280
Feature activation-1.00e+00

Pos logit contributions
 and+4.513
,+4.396
.+4.341
 the+4.081
 a+4.069
Neg logit contributions
umm-4.790
lor-4.586
vation-4.530
osity-4.530
arians-4.464
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.222
,+2.112
.+1.979
 a+1.727
 the+1.692
Neg logit contributions
umm-7.080
lor-6.916
arians-6.815
ocol-6.796
vation-6.779
Token monkey
Token ID21657
Feature activation-1.00e+00

Pos logit contributions
 and+4.871
,+4.764
.+4.633
 the+4.470
 a+4.365
Neg logit contributions
umm-4.597
lor-4.326
vation-4.267
osity-4.246
arians-4.243
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+1.988
,+1.843
.+1.794
 the+1.445
 a+1.383
Neg logit contributions
umm-7.253
lor-7.065
osity-6.996
vation-6.969
arians-6.960
Token toy
Token ID13373
Feature activation-1.00e+00

Pos logit contributions
 and+4.496
,+4.469
.+4.329
 the+4.163
 a+4.047
Neg logit contributions
umm-4.868
lor-4.576
arians-4.525
vation-4.516
osity-4.508
Token fell
Token ID3214
Feature activation-1.00e+00

Pos logit contributions
 and+3.946
,+3.850
.+3.768
 the+3.636
 a+3.549
Neg logit contributions
umm-5.336
lor-5.208
vation-5.076
arians-5.066
osity-5.042
Token from
Token ID422
Feature activation-1.00e+00

Pos logit contributions
 and+2.871
,+2.817
.+2.690
 the+2.629
 a+2.524
Neg logit contributions
umm-6.251
lor-6.055
osity-6.051
rol-6.040
plets-5.936
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.896
,+2.820
.+2.657
 the+2.389
 a+2.338
Neg logit contributions
umm-6.298
ocol-6.074
 former-6.044
osity-5.964
ubb-5.963
Token table
Token ID3084
Feature activation-1.00e+00

Pos logit contributions
 and+4.946
,+4.841
.+4.704
 the+4.560
 a+4.472
Neg logit contributions
umm-4.521
lor-4.244
vation-4.139
osity-4.125
arians-4.089
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+1.419
,+1.340
 the+1.207
.+1.175
 a+1.106
Neg logit contributions
umm-7.744
lor-7.604
vation-7.463
rol-7.430
osity-7.405
Token broke
Token ID6265
Feature activation-1.00e+00

Pos logit contributions
 and+4.164
,+4.034
.+3.937
 the+3.665
 a+3.644
Neg logit contributions
umm-5.143
lor-4.952
osity-4.901
vation-4.882
rol-4.865
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+2.141
,+2.069
.+1.779
 the+1.698
 a+1.646
Neg logit contributions
umm-7.082
lor-6.818
rol-6.811
osity-6.746
plets-6.727
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.489
,+3.375
.+3.256
 the+3.053
 a+2.997
Neg logit contributions
umm-5.944
lor-5.728
osity-5.617
vation-5.591
rol-5.554
Token woman
Token ID2415
Feature activation-1.00e+00

Pos logit contributions
 and+3.923
,+3.892
.+3.754
 the+3.591
 a+3.479
Neg logit contributions
umm-5.453
lor-5.142
vation-5.124
arians-5.092
osity-5.087
Token broke
Token ID6265
Feature activation-1.00e+00

Pos logit contributions
 and+4.164
,+4.034
.+3.937
 the+3.665
 a+3.644
Neg logit contributions
umm-5.143
lor-4.952
osity-4.901
vation-4.882
rol-4.865
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+2.141
,+2.069
.+1.779
 the+1.698
 a+1.646
Neg logit contributions
umm-7.082
lor-6.818
rol-6.811
osity-6.746
plets-6.727
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.489
,+3.375
.+3.256
 the+3.053
 a+2.997
Neg logit contributions
umm-5.944
lor-5.728
osity-5.617
vation-5.591
rol-5.554
Token woman
Token ID2415
Feature activation-1.00e+00

Pos logit contributions
 and+3.923
,+3.892
.+3.754
 the+3.591
 a+3.479
Neg logit contributions
umm-5.453
lor-5.142
vation-5.124
arians-5.092
osity-5.087
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 and+2.888
,+2.824
.+2.738
 the+2.589
 a+2.540
Neg logit contributions
umm-6.238
lor-6.114
vation-6.046
rol-5.995
osity-5.994
Token very
Token ID845
Feature activation-1.00e+00

Pos logit contributions
 and+3.187
,+3.087
.+2.995
 the+2.764
 a+2.644
Neg logit contributions
umm-6.046
lor-5.890
rol-5.829
osity-5.810
vation-5.783
Token sad
Token ID6507
Feature activation-1.00e+00

Pos logit contributions
 and+3.814
,+3.629
.+3.556
 the+3.391
 a+3.324
Neg logit contributions
umm-5.533
ubb-5.304
osity-5.292
vation-5.272
ocol-5.267
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.354
,+0.318
 the+0.280
 a+0.165
.+0.108
Neg logit contributions
umm-8.571
lor-8.264
osity-8.256
arians-8.165
 former-8.147
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.254
,+3.137
.+3.020
 the+2.856
 a+2.818
Neg logit contributions
umm-6.141
lor-5.907
osity-5.818
vation-5.750
rol-5.740
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.635
,+1.525
.+1.406
 the+1.253
 a+1.190
Neg logit contributions
umm-7.695
lor-7.582
vation-7.458
osity-7.421
rol-7.376
TokenShe
Token ID3347
Feature activation-1.00e+00

Pos logit contributions
 and+4.302
,+4.188
.+4.085
 the+3.930
 a+3.860
Neg logit contributions
umm-5.149
lor-4.960
vation-4.861
osity-4.852
arians-4.795
Token Ben
Token ID3932
Feature activation-1.00e+00

Pos logit contributions
 and+4.399
,+4.337
.+4.238
 the+4.005
 a+3.895
Neg logit contributions
umm-4.792
lor-4.564
osity-4.556
vation-4.554
arians-4.502
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+3.455
,+3.382
.+3.241
 the+3.233
 a+3.132
Neg logit contributions
umm-5.595
lor-5.533
vation-5.460
rol-5.396
arians-5.372
Token but
Token ID475
Feature activation-1.00e+00

Pos logit contributions
 and+3.299
,+3.196
.+3.104
 the+2.939
 a+2.878
Neg logit contributions
umm-5.826
osity-5.665
lor-5.655
vation-5.584
rol-5.573
Token you
Token ID345
Feature activation-1.00e+00

Pos logit contributions
 and+2.923
,+2.735
.+2.732
 a+2.455
 the+2.453
Neg logit contributions
umm-6.308
lor-6.125
osity-6.106
ubb-6.011
vation-6.002
Token are
Token ID389
Feature activation-1.00e+00

Pos logit contributions
 and+4.192
,+4.085
.+4.017
 the+3.860
 a+3.791
Neg logit contributions
umm-5.047
lor-4.941
vation-4.857
osity-4.823
arians-4.814
Token very
Token ID845
Feature activation-1.00e+00

Pos logit contributions
 and+3.728
,+3.634
.+3.504
 the+3.284
 a+3.182
Neg logit contributions
umm-5.524
lor-5.331
osity-5.304
rol-5.283
vation-5.283
Token funny
Token ID8258
Feature activation-1.00e+00

Pos logit contributions
 and+4.154
,+3.996
.+3.879
 the+3.736
 a+3.653
Neg logit contributions
umm-5.265
lor-5.008
vation-4.977
ubb-4.977
osity-4.954
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+1.797
,+1.718
 the+1.613
.+1.556
 a+1.489
Neg logit contributions
umm-7.373
lor-7.168
osity-7.042
rol-7.026
arians-7.025
Token Can
Token ID1680
Feature activation-1.00e+00

Pos logit contributions
 and+5.128
,+5.013
.+4.913
 the+4.761
 a+4.685
Neg logit contributions
umm-4.285
lor-4.076
osity-3.983
vation-3.963
arians-3.914
Token you
Token ID345
Feature activation-1.00e+00

Pos logit contributions
 and+3.721
,+3.569
.+3.499
 the+3.315
 a+3.268
Neg logit contributions
umm-5.654
lor-5.476
osity-5.382
vation-5.374
arians-5.340
Token say
Token ID910
Feature activation-1.00e+00

Pos logit contributions
 and+4.085
,+3.941
.+3.871
 the+3.773
 a+3.671
Neg logit contributions
umm-5.152
lor-4.997
vation-4.947
osity-4.903
arians-4.882
Token smile
Token ID8212
Feature activation-1.00e+00

Pos logit contributions
 and+3.032
,+2.930
 the+2.789
.+2.778
 a+2.682
Neg logit contributions
umm-6.127
lor-6.038
osity-5.893
vation-5.890
arians-5.875
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+1.558
,+1.439
 the+1.320
.+1.262
 a+1.216
Neg logit contributions
umm-7.633
lor-7.507
osity-7.395
vation-7.344
rol-7.301
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.720
,+3.631
.+3.506
 the+3.338
 a+3.296
Neg logit contributions
umm-5.689
lor-5.518
osity-5.414
vation-5.353
rol-5.340
Token animals
Token ID4695
Feature activation-1.00e+00

Pos logit contributions
 and+5.045
,+4.939
.+4.852
 the+4.656
 a+4.560
Neg logit contributions
umm-4.408
lor-4.156
vation-4.111
arians-4.072
osity-4.066
Token were
Token ID547
Feature activation-1.00e+00

Pos logit contributions
 and+4.370
,+4.280
.+4.241
 the+4.032
 a+4.023
Neg logit contributions
umm-4.810
lor-4.662
vation-4.622
osity-4.600
rol-4.560
Token very
Token ID845
Feature activation-1.00e+00

Pos logit contributions
 and+3.184
,+3.057
.+3.009
 the+2.742
 a+2.678
Neg logit contributions
umm-6.070
lor-5.939
rol-5.866
osity-5.822
vation-5.811
Token happy
Token ID3772
Feature activation-1.00e+00

Pos logit contributions
 and+3.596
,+3.456
.+3.391
 the+3.210
 a+3.148
Neg logit contributions
umm-5.790
lor-5.545
osity-5.523
vation-5.506
ubb-5.458
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+0.929
,+0.864
.+0.761
 the+0.706
 a+0.663
Neg logit contributions
umm-8.200
lor-7.950
osity-7.921
arians-7.812
vation-7.811
Token have
Token ID423
Feature activation-1.00e+00

Pos logit contributions
 and+3.832
,+3.691
.+3.599
 the+3.446
 a+3.384
Neg logit contributions
umm-5.531
lor-5.350
vation-5.285
osity-5.243
arians-5.189
Token made
Token ID925
Feature activation-1.00e+00

Pos logit contributions
 and+4.143
,+4.005
.+3.863
 the+3.631
 a+3.525
Neg logit contributions
umm-5.101
lor-5.054
vation-4.889
arians-4.806
rol-4.784
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.774
,+2.715
.+2.557
 the+2.326
 a+2.241
Neg logit contributions
umm-6.438
lor-6.257
vation-6.166
rol-6.081
osity-6.036
Token they
Token ID484
Feature activation-1.00e+00

Pos logit contributions
 and+5.379
,+5.300
.+5.208
 a+4.982
 the+4.979
Neg logit contributions
umm-3.893
lor-3.672
vation-3.652
osity-3.637
arians-3.599
Token all
Token ID477
Feature activation-1.00e+00

Pos logit contributions
 and+3.906
,+3.795
.+3.692
 the+3.536
 a+3.477
Neg logit contributions
umm-5.451
lor-5.254
vation-5.170
osity-5.166
arians-5.110
Token have
Token ID423
Feature activation-1.00e+00

Pos logit contributions
 and+4.019
,+3.905
.+3.804
 the+3.652
 a+3.584
Neg logit contributions
umm-5.412
lor-5.207
osity-5.116
vation-5.108
arians-5.042
Token fun
Token ID1257
Feature activation-1.00e+00

Pos logit contributions
 and+3.338
,+3.242
.+3.105
 the+2.880
 a+2.729
Neg logit contributions
umm-5.837
lor-5.747
 former-5.601
rol-5.554
osity-5.541
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+2.341
,+2.220
 the+2.061
.+2.059
 a+1.980
Neg logit contributions
umm-6.911
lor-6.661
osity-6.624
arians-6.603
ocol-6.595
Token<|endoftext|>
Token ID50256
Feature activation-1.00e+00

Pos logit contributions
 and+2.959
,+2.854
.+2.750
 the+2.586
 a+2.515
Neg logit contributions
umm-6.486
lor-6.272
osity-6.183
vation-6.160
arians-6.100
TokenAnna
Token ID31160
Feature activation-1.00e+00

Pos logit contributions
 and+3.182
,+3.068
.+2.945
 the+2.809
 a+2.740
Neg logit contributions
umm-6.143
lor-6.001
vation-5.847
osity-5.809
rol-5.795
Token is
Token ID318
Feature activation-1.00e+00

Pos logit contributions
 and+3.492
,+3.393
.+3.324
 the+3.174
 a+3.120
Neg logit contributions
umm-5.764
lor-5.598
vation-5.479
osity-5.466
arians-5.415
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+3.236
,+3.121
.+3.046
 the+2.855
 a+2.752
Neg logit contributions
umm-6.128
lor-5.899
osity-5.851
vation-5.811
rol-5.794
Token big
Token ID1263
Feature activation-1.00e+00

Pos logit contributions
 and+3.237
,+3.118
.+3.029
 the+2.876
 a+2.800
Neg logit contributions
umm-6.143
lor-5.908
osity-5.828
vation-5.814
arians-5.760
Token girl
Token ID2576
Feature activation-1.00e+00

Pos logit contributions
 and+4.014
,+3.896
.+3.842
 the+3.680
 a+3.625
Neg logit contributions
umm-5.325
lor-5.036
osity-4.981
vation-4.947
arians-4.924
Token tree
Token ID5509
Feature activation-1.00e+00

Pos logit contributions
 and+3.419
,+3.320
.+3.213
 the+3.097
 a+3.047
Neg logit contributions
umm-5.959
lor-5.690
vation-5.598
osity-5.566
plets-5.548
Token with
Token ID351
Feature activation-1.00e+00

Pos logit contributions
 and+2.149
,+2.070
.+1.936
 the+1.865
 a+1.801
Neg logit contributions
umm-7.101
lor-6.886
osity-6.801
vation-6.763
rol-6.742
Token lots
Token ID6041
Feature activation-1.00e+00

Pos logit contributions
 and+3.135
,+3.058
.+2.930
 the+2.674
 a+2.540
Neg logit contributions
umm-6.285
lor-5.981
rol-5.922
osity-5.898
ocol-5.883
Token of
Token ID286
Feature activation-1.00e+00

Pos logit contributions
 and+2.598
,+2.514
.+2.397
 the+2.297
 a+2.241
Neg logit contributions
umm-6.786
lor-6.484
osity-6.425
ocol-6.394
vation-6.382
Token cher
Token ID21382
Feature activation-1.00e+00

Pos logit contributions
 and+5.383
,+5.292
.+5.174
 the+5.002
 a+4.928
Neg logit contributions
umm-4.004
lor-3.647
ocol-3.622
rol-3.610
osity-3.565
Tokenries
Token ID1678
Feature activation-1.00e+00

Pos logit contributions
 and+7.186
,+7.062
.+6.969
 the+6.831
 a+6.802
Neg logit contributions
umm-2.266
lor-2.033
ocol-1.897
osity-1.887
rol-1.886
Token on
Token ID319
Feature activation-1.00e+00

Pos logit contributions
 and+1.240
,+1.163
 the+0.984
.+0.966
 a+0.927
Neg logit contributions
umm-7.988
lor-7.687
plets-7.644
osity-7.636
rol-7.634
Token it
Token ID340
Feature activation-1.00e+00

Pos logit contributions
 and+3.213
,+3.114
.+2.951
 the+2.776
 a+2.741
Neg logit contributions
umm-6.095
lor-5.891
vation-5.714
arians-5.698
osity-5.685
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.434
,+0.332
 the+0.162
 a+0.093
.+0.032
Neg logit contributions
umm-8.677
lor-8.518
osity-8.386
vation-8.359
rol-8.359
Token She
Token ID1375
Feature activation-1.00e+00

Pos logit contributions
 and+3.224
,+3.115
.+3.019
 the+2.854
 a+2.803
Neg logit contributions
umm-6.178
lor-5.960
osity-5.860
vation-5.821
rol-5.781
Token wanted
Token ID2227
Feature activation-1.00e+00

Pos logit contributions
 and+3.261
,+3.176
.+3.104
 the+2.916
 a+2.845
Neg logit contributions
umm-6.035
lor-5.843
osity-5.757
arians-5.721
vation-5.715
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+3.361
,+3.243
 the+3.108
.+3.105
 a+3.007
Neg logit contributions
umm-5.876
lor-5.704
arians-5.595
osity-5.575
vation-5.557
Token She
Token ID1375
Feature activation-1.00e+00

Pos logit contributions
 and+3.578
,+3.465
.+3.355
 the+3.197
 a+3.137
Neg logit contributions
umm-5.860
lor-5.622
osity-5.553
vation-5.491
rol-5.465
Token liked
Token ID8288
Feature activation-1.00e+00

Pos logit contributions
 and+3.002
,+2.953
.+2.819
 the+2.707
 a+2.601
Neg logit contributions
umm-6.196
lor-6.071
vation-5.960
osity-5.935
arians-5.925
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.316
,+2.261
.+2.153
 the+1.860
 a+1.844
Neg logit contributions
umm-6.960
osity-6.719
ocol-6.661
lor-6.636
ubb-6.589
Token toy
Token ID13373
Feature activation-1.00e+00

Pos logit contributions
 and+4.942
,+4.858
.+4.733
 the+4.599
 a+4.465
Neg logit contributions
umm-4.504
lor-4.166
 former-4.097
osity-4.094
vation-4.079
Token very
Token ID845
Feature activation-1.00e+00

Pos logit contributions
 and+3.465
,+3.363
.+3.245
 the+3.200
 a+3.119
Neg logit contributions
umm-5.719
lor-5.496
arians-5.466
osity-5.408
vation-5.385
Token much
Token ID881
Feature activation-1.00e+00

Pos logit contributions
 and+3.373
,+3.232
.+3.093
 the+3.026
 a+2.911
Neg logit contributions
umm-5.999
lor-5.782
vation-5.699
osity-5.644
rol-5.638
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.401
,+0.311
 the+0.186
 a+0.149
.+0.142
Neg logit contributions
umm-8.705
lor-8.405
arians-8.350
vation-8.310
osity-8.284
Token It
Token ID632
Feature activation-1.00e+00

Pos logit contributions
 and+3.804
,+3.707
.+3.583
 the+3.426
 a+3.362
Neg logit contributions
umm-5.647
lor-5.373
osity-5.305
vation-5.243
ocol-5.232
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 and+3.277
,+3.184
.+3.062
 the+2.913
 a+2.840
Neg logit contributions
umm-6.024
lor-5.848
vation-5.744
osity-5.741
rol-5.730
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+3.053
,+2.934
.+2.846
 the+2.591
 a+2.477
Neg logit contributions
umm-6.152
osity-5.940
ocol-5.923
lor-5.913
 former-5.904
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+3.254
,+3.117
.+3.008
 the+2.863
 a+2.777
Neg logit contributions
umm-6.130
lor-5.949
osity-5.867
arians-5.788
vation-5.773
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.796
,+1.682
.+1.530
 the+1.393
 a+1.319
Neg logit contributions
umm-7.416
lor-7.401
vation-7.249
osity-7.231
reetings-7.201
TokenThe
Token ID464
Feature activation-1.00e+00

Pos logit contributions
 and+4.511
,+4.389
.+4.285
 the+4.139
 a+4.063
Neg logit contributions
umm-4.861
lor-4.742
vation-4.640
osity-4.614
arians-4.566
Token end
Token ID886
Feature activation-1.00e+00

Pos logit contributions
 and+4.613
,+4.505
.+4.385
 the+4.242
 a+4.142
Neg logit contributions
umm-4.812
lor-4.545
vation-4.503
osity-4.476
arians-4.447
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.529
,+0.379
.+0.213
 the+0.186
 a+0.110
Neg logit contributions
umm-8.829
lor-8.617
vation-8.528
osity-8.510
arians-8.441
Token<|endoftext|>
Token ID50256
Feature activation-1.00e+00

Pos logit contributions
 and+2.935
,+2.823
.+2.716
 the+2.560
 a+2.489
Neg logit contributions
umm-6.492
lor-6.307
osity-6.203
vation-6.179
arians-6.120
TokenA
Token ID32
Feature activation-1.00e+00

Pos logit contributions
 and+4.334
,+4.219
.+4.093
 the+3.963
 a+3.894
Neg logit contributions
umm-5.023
lor-4.873
vation-4.734
osity-4.686
rol-4.670
Token woman
Token ID2415
Feature activation-1.00e+00

Pos logit contributions
 and+4.145
,+4.014
.+3.928
 the+3.793
 a+3.723
Neg logit contributions
umm-5.292
lor-5.074
vation-4.982
osity-4.977
arians-4.895
Token had
Token ID550
Feature activation-1.00e+00

Pos logit contributions
 and+3.176
,+3.108
.+3.040
 the+2.906
 a+2.863
Neg logit contributions
umm-5.952
lor-5.846
vation-5.738
osity-5.709
arians-5.676
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+3.198
,+3.112
.+2.998
 the+2.814
 a+2.637
Neg logit contributions
umm-6.020
lor-6.012
rol-5.849
ocol-5.804
vation-5.789
Token broken
Token ID5445
Feature activation-1.00e+00

Pos logit contributions
 and+4.587
,+4.422
.+4.394
 the+4.251
 a+4.117
Neg logit contributions
umm-4.798
lor-4.515
osity-4.443
vation-4.440
 former-4.424
Token every
Token ID790
Feature activation-1.00e+00

Pos logit contributions
 and+3.002
,+2.876
 the+2.759
.+2.745
 a+2.638
Neg logit contributions
umm-6.049
lor-6.041
osity-5.895
rol-5.887
vation-5.881
Token night
Token ID1755
Feature activation-1.00e+00

Pos logit contributions
 and+3.908
,+3.785
.+3.674
 the+3.552
 a+3.457
Neg logit contributions
umm-5.460
lor-5.275
vation-5.199
osity-5.106
rol-5.073
Token!"
Token ID2474
Feature activation-1.00e+00

Pos logit contributions
 and+2.517
,+2.385
 the+2.250
.+2.219
 a+2.189
Neg logit contributions
umm-6.761
lor-6.552
vation-6.500
arians-6.378
osity-6.369
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.441
,+2.369
.+2.252
 the+2.106
 a+2.060
Neg logit contributions
umm-6.812
lor-6.633
vation-6.559
osity-6.495
rol-6.445
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 and+3.491
,+3.422
.+3.374
 the+3.237
 a+3.135
Neg logit contributions
umm-5.604
lor-5.530
vation-5.406
arians-5.379
osity-5.365
Token very
Token ID845
Feature activation-1.00e+00

Pos logit contributions
 and+3.748
,+3.648
.+3.531
 the+3.327
 a+3.212
Neg logit contributions
umm-5.413
lor-5.320
rol-5.221
vation-5.206
osity-5.200
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
 and+3.776
,+3.617
.+3.541
 the+3.397
 a+3.322
Neg logit contributions
umm-5.513
lor-5.311
osity-5.289
ubb-5.287
vation-5.278
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+0.291
,+0.222
 the+0.126
 a+0.120
.+0.078
Neg logit contributions
umm-8.675
lor-8.340
osity-8.330
ocol-8.294
ubb-8.284
Token cried
Token ID16896
Feature activation-1.00e+00

Pos logit contributions
 and+4.067
,+3.939
.+3.834
 the+3.625
 a+3.587
Neg logit contributions
umm-5.143
lor-5.013
osity-4.970
vation-4.953
rol-4.914
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.660
,+0.507
 the+0.389
.+0.331
 a+0.228
Neg logit contributions
umm-8.411
osity-8.140
ubb-8.118
lor-8.070
rol-8.039
Token From
Token ID3574
Feature activation-1.00e+00

Pos logit contributions
 and+3.192
,+3.077
.+2.968
 the+2.806
 a+2.741
Neg logit contributions
umm-6.239
lor-6.030
osity-5.937
vation-5.913
arians-5.869
Token night
Token ID1755
Feature activation-1.00e+00

Pos logit contributions
 and+4.956
,+4.843
.+4.763
 the+4.717
 a+4.554
Neg logit contributions
umm-4.393
lor-4.134
osity-4.053
vation-4.041
 former-4.005
Token's
Token ID338
Feature activation-1.00e+00

Pos logit contributions
 and+3.213
,+3.100
.+2.933
 the+2.926
 a+2.864
Neg logit contributions
umm-6.141
lor-5.889
vation-5.851
osity-5.724
plets-5.693
Token sleep
Token ID3993
Feature activation-1.00e+00

Pos logit contributions
 and+5.503
,+5.396
.+5.228
 the+5.204
 a+5.065
Neg logit contributions
umm-3.825
lor-3.576
 former-3.567
osity-3.528
vation-3.506
Token again
Token ID757
Feature activation-1.00e+00

Pos logit contributions
 and+3.354
,+3.245
 the+3.064
.+3.041
 a+2.980
Neg logit contributions
umm-5.952
lor-5.843
vation-5.702
arians-5.614
osity-5.555
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.516
,+0.404
 the+0.238
 a+0.212
.+0.086
Neg logit contributions
umm-8.552
lor-8.454
 former-8.319
vation-8.303
rim-8.219
Token<|endoftext|>
Token ID50256
Feature activation-1.00e+00

Pos logit contributions
 and+1.638
,+1.523
.+1.425
 the+1.277
 a+1.205
Neg logit contributions
umm-7.784
lor-7.579
osity-7.482
vation-7.474
arians-7.417
TokenOnce
Token ID7454
Feature activation-1.00e+00

Pos logit contributions
 and+2.072
,+1.971
.+1.834
 the+1.700
 a+1.633
Neg logit contributions
umm-7.240
lor-7.114
vation-6.966
osity-6.908
rol-6.905
Token upon
Token ID2402
Feature activation-1.00e+00

Pos logit contributions
 and+4.143
,+3.988
.+3.913
 the+3.762
 a+3.735
Neg logit contributions
umm-5.123
lor-4.966
osity-4.894
vation-4.841
rol-4.804
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+0.605
,+0.574
.+0.378
 the+0.217
 a+0.027
Neg logit contributions
umm-8.506
lor-8.451
vation-8.307
osity-8.245
rol-8.220
Token time
Token ID640
Feature activation-1.00e+00

Pos logit contributions
 and+2.984
,+2.873
.+2.779
 the+2.683
 a+2.573
Neg logit contributions
umm-6.339
lor-6.160
vation-6.041
osity-5.991
rol-5.945
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+0.265
.+0.056
,+0.020
 the-0.088
 a-0.155
Neg logit contributions
umm-8.971
lor-8.833
osity-8.735
vation-8.669
rol-8.663
Token your
Token ID534
Feature activation-1.00e+00

Pos logit contributions
 and+2.770
,+2.589
.+2.498
 a+2.208
 the+2.148
Neg logit contributions
umm-6.339
lor-6.263
ocol-6.239
arians-6.234
vation-6.129
Token bike
Token ID7161
Feature activation-1.00e+00

Pos logit contributions
 and+5.288
,+5.161
.+5.006
 the+4.929
 a+4.879
Neg logit contributions
umm-4.022
 former-3.762
lor-3.714
osity-3.691
arians-3.680
Token."
Token ID526
Feature activation-1.00e+00

Pos logit contributions
 and+2.872
,+2.704
 the+2.609
.+2.581
 a+2.531
Neg logit contributions
umm-6.287
lor-6.146
arians-6.067
vation-6.035
osity-5.947
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.217
,+3.110
.+3.005
 the+2.845
 a+2.781
Neg logit contributions
umm-6.173
lor-5.944
vation-5.871
osity-5.860
arians-5.812
Token unknown
Token ID6439
Feature activation-1.00e+00

Pos logit contributions
 and+3.934
,+3.810
.+3.704
 the+3.536
 a+3.465
Neg logit contributions
umm-5.506
lor-5.296
osity-5.210
vation-5.204
arians-5.134
Token person
Token ID1048
Feature activation-1.00e+00

Pos logit contributions
 and+4.103
,+3.963
.+3.875
 the+3.744
 a+3.685
Neg logit contributions
umm-5.248
lor-5.030
osity-4.969
vation-4.924
arians-4.861
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 and+3.406
,+3.297
.+3.215
 the+3.075
 a+3.012
Neg logit contributions
umm-5.874
lor-5.688
vation-5.613
osity-5.612
arians-5.574
Token happy
Token ID3772
Feature activation-1.00e+00

Pos logit contributions
 and+3.133
,+3.004
.+2.933
 the+2.688
 a+2.565
Neg logit contributions
umm-6.050
lor-5.911
osity-5.873
rol-5.814
vation-5.813
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+0.918
,+0.847
.+0.718
 the+0.656
 a+0.607
Neg logit contributions
umm-8.201
lor-7.983
osity-7.939
arians-7.841
vation-7.825
Token said
Token ID531
Feature activation-1.00e+00

Pos logit contributions
 and+3.620
,+3.505
.+3.445
 the+3.205
 a+3.173
Neg logit contributions
umm-5.701
osity-5.455
lor-5.448
vation-5.412
arians-5.390
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+1.399
,+1.150
.+1.140
 the+0.976
 a+0.911
Neg logit contributions
umm-7.946
lor-7.766
osity-7.695
vation-7.626
arians-7.587
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+1.798
,+1.551
.+1.548
 the+1.384
 a+1.310
Neg logit contributions
umm-7.516
lor-7.365
osity-7.302
vation-7.206
arians-7.183
Token "
Token ID366
Feature activation-1.00e+00

Pos logit contributions
 and+2.615
,+2.487
.+2.419
 the+2.286
 a+2.203
Neg logit contributions
umm-6.517
osity-6.404
lor-6.369
arians-6.298
plets-6.220
TokenI
Token ID40
Feature activation-1.00e+00

Pos logit contributions
 and+5.969
,+5.885
.+5.766
 the+5.600
 a+5.531
Neg logit contributions
umm-3.516
lor-3.316
vation-3.220
osity-3.181
arians-3.144
Token can
Token ID460
Feature activation-1.00e+00

Pos logit contributions
 and+4.364
,+4.240
.+4.158
 the+4.029
 a+3.930
Neg logit contributions
umm-5.019
lor-4.831
osity-4.734
vation-4.729
arians-4.659
Token help
Token ID1037
Feature activation-1.00e+00

Pos logit contributions
 and+4.348
,+4.216
.+4.129
 the+3.995
 a+3.914
Neg logit contributions
umm-4.985
lor-4.808
vation-4.746
osity-4.710
arians-4.702
Token you
Token ID345
Feature activation-1.00e+00

Pos logit contributions
 and+3.148
,+2.954
.+2.847
 a+2.663
 the+2.644
Neg logit contributions
umm-5.972
arians-5.900
vation-5.834
lor-5.823
ocol-5.738
Token fix
Token ID4259
Feature activation-1.00e+00

Pos logit contributions
 and+4.055
,+3.887
.+3.782
 the+3.764
 a+3.675
Neg logit contributions
umm-5.126
lor-5.038
osity-4.922
vation-4.922
arians-4.899
Token your
Token ID534
Feature activation-1.00e+00

Pos logit contributions
 and+2.770
,+2.589
.+2.498
 a+2.208
 the+2.148
Neg logit contributions
umm-6.339
lor-6.263
ocol-6.239
arians-6.234
vation-6.129
Token bike
Token ID7161
Feature activation-1.00e+00

Pos logit contributions
 and+5.288
,+5.161
.+5.006
 the+4.929
 a+4.879
Neg logit contributions
umm-4.022
 former-3.762
lor-3.714
osity-3.691
arians-3.680
Token."
Token ID526
Feature activation-1.00e+00

Pos logit contributions
 and+2.872
,+2.704
 the+2.609
.+2.581
 a+2.531
Neg logit contributions
umm-6.287
lor-6.146
arians-6.067
vation-6.035
osity-5.947
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.217
,+3.110
.+3.005
 the+2.845
 a+2.781
Neg logit contributions
umm-6.173
lor-5.944
vation-5.871
osity-5.860
arians-5.812
Token He
Token ID679
Feature activation-1.00e+00

Pos logit contributions
 and+3.543
,+3.435
.+3.339
 the+3.176
 a+3.122
Neg logit contributions
umm-5.858
lor-5.553
osity-5.527
arians-5.479
vation-5.470
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
 and+3.351
,+3.276
.+3.204
 the+3.016
 a+2.930
Neg logit contributions
umm-5.887
lor-5.765
arians-5.652
vation-5.627
osity-5.614
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.751
,+1.662
.+1.587
 the+1.408
 a+1.329
Neg logit contributions
umm-7.467
lor-7.325
osity-7.259
vation-7.257
arians-7.195
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 and+2.098
,+2.004
.+1.873
 the+1.525
 a+1.524
Neg logit contributions
umm-7.074
vation-6.942
lor-6.895
osity-6.872
arians-6.854
Token unknown
Token ID6439
Feature activation-1.00e+00

Pos logit contributions
 and+4.707
,+4.576
.+4.458
 the+4.245
 a+4.167
Neg logit contributions
umm-4.794
lor-4.552
osity-4.467
vation-4.442
arians-4.402
Token person
Token ID1048
Feature activation-1.00e+00

Pos logit contributions
 and+4.155
,+4.032
.+3.925
 the+3.844
 a+3.799
Neg logit contributions
umm-5.145
lor-4.904
osity-4.803
vation-4.773
arians-4.766
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+1.918
,+1.835
.+1.755
 the+1.681
 a+1.661
Neg logit contributions
umm-6.944
lor-6.917
osity-6.856
arians-6.841
vation-6.806
Token said
Token ID531
Feature activation-1.00e+00

Pos logit contributions
 and+3.348
,+3.231
.+3.178
 a+2.842
 the+2.841
Neg logit contributions
umm-5.825
vation-5.665
lor-5.665
arians-5.646
osity-5.645
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+1.798
,+1.551
.+1.548
 the+1.384
 a+1.310
Neg logit contributions
umm-7.516
lor-7.365
osity-7.302
vation-7.206
arians-7.183
Token "
Token ID366
Feature activation-1.00e+00

Pos logit contributions
 and+2.615
,+2.487
.+2.419
 the+2.286
 a+2.203
Neg logit contributions
umm-6.517
osity-6.404
lor-6.369
arians-6.298
plets-6.220
TokenI
Token ID40
Feature activation-1.00e+00

Pos logit contributions
 and+5.969
,+5.885
.+5.766
 the+5.600
 a+5.531
Neg logit contributions
umm-3.516
lor-3.316
vation-3.220
osity-3.181
arians-3.144
Token person
Token ID1048
Feature activation-1.00e+00

Pos logit contributions
 and+4.155
,+4.032
.+3.925
 the+3.844
 a+3.799
Neg logit contributions
umm-5.145
lor-4.904
osity-4.803
vation-4.773
arians-4.766
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+1.918
,+1.835
.+1.755
 the+1.681
 a+1.661
Neg logit contributions
umm-6.944
lor-6.917
osity-6.856
arians-6.841
vation-6.806
Token said
Token ID531
Feature activation-1.00e+00

Pos logit contributions
 and+3.348
,+3.231
.+3.178
 a+2.842
 the+2.841
Neg logit contributions
umm-5.825
vation-5.665
lor-5.665
arians-5.646
osity-5.645
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+1.798
,+1.551
.+1.548
 the+1.384
 a+1.310
Neg logit contributions
umm-7.516
lor-7.365
osity-7.302
vation-7.206
arians-7.183
Token "
Token ID366
Feature activation-1.00e+00

Pos logit contributions
 and+2.615
,+2.487
.+2.419
 the+2.286
 a+2.203
Neg logit contributions
umm-6.517
osity-6.404
lor-6.369
arians-6.298
plets-6.220
TokenI
Token ID40
Feature activation-1.00e+00

Pos logit contributions
 and+5.969
,+5.885
.+5.766
 the+5.600
 a+5.531
Neg logit contributions
umm-3.516
lor-3.316
vation-3.220
osity-3.181
arians-3.144
Token can
Token ID460
Feature activation-1.00e+00

Pos logit contributions
 and+4.364
,+4.240
.+4.158
 the+4.029
 a+3.930
Neg logit contributions
umm-5.019
lor-4.831
osity-4.734
vation-4.729
arians-4.659
Token help
Token ID1037
Feature activation-1.00e+00

Pos logit contributions
 and+4.348
,+4.216
.+4.129
 the+3.995
 a+3.914
Neg logit contributions
umm-4.985
lor-4.808
vation-4.746
osity-4.710
arians-4.702
Token you
Token ID345
Feature activation-1.00e+00

Pos logit contributions
 and+3.148
,+2.954
.+2.847
 a+2.663
 the+2.644
Neg logit contributions
umm-5.972
arians-5.900
vation-5.834
lor-5.823
ocol-5.738
Token fix
Token ID4259
Feature activation-1.00e+00

Pos logit contributions
 and+4.055
,+3.887
.+3.782
 the+3.764
 a+3.675
Neg logit contributions
umm-5.126
lor-5.038
osity-4.922
vation-4.922
arians-4.899
Token your
Token ID534
Feature activation-1.00e+00

Pos logit contributions
 and+2.770
,+2.589
.+2.498
 a+2.208
 the+2.148
Neg logit contributions
umm-6.339
lor-6.263
ocol-6.239
arians-6.234
vation-6.129
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+1.183
,+0.937
.+0.930
 the+0.777
 a+0.720
Neg logit contributions
umm-8.092
lor-7.987
osity-7.906
ocol-7.810
ubb-7.787
Token "
Token ID366
Feature activation-1.00e+00

Pos logit contributions
 and+2.584
,+2.479
.+2.397
 the+2.246
 a+2.208
Neg logit contributions
umm-6.535
lor-6.435
osity-6.434
arians-6.291
plets-6.284
TokenI
Token ID40
Feature activation-1.00e+00

Pos logit contributions
 and+6.337
,+6.226
.+6.123
 the+5.971
 a+5.899
Neg logit contributions
umm-3.131
lor-2.927
osity-2.825
vation-2.823
arians-2.761
Token will
Token ID481
Feature activation-1.00e+00

Pos logit contributions
 and+4.764
,+4.640
.+4.556
 the+4.417
 a+4.330
Neg logit contributions
umm-4.629
lor-4.451
osity-4.345
vation-4.344
rol-4.287
Token scare
Token ID19437
Feature activation-1.00e+00

Pos logit contributions
 and+4.047
,+3.893
.+3.797
 the+3.674
 a+3.588
Neg logit contributions
umm-5.234
lor-5.117
vation-5.028
osity-5.013
arians-4.983
Token you
Token ID345
Feature activation-1.00e+00

Pos logit contributions
 and+3.169
,+3.052
.+2.940
 a+2.683
 the+2.672
Neg logit contributions
umm-6.085
lor-5.918
vation-5.838
ocol-5.790
rol-5.778
Token every
Token ID790
Feature activation-1.00e+00

Pos logit contributions
 and+3.002
,+2.876
 the+2.759
.+2.745
 a+2.638
Neg logit contributions
umm-6.049
lor-6.041
osity-5.895
rol-5.887
vation-5.881
Token night
Token ID1755
Feature activation-1.00e+00

Pos logit contributions
 and+3.908
,+3.785
.+3.674
 the+3.552
 a+3.457
Neg logit contributions
umm-5.460
lor-5.275
vation-5.199
osity-5.106
rol-5.073
Token!"
Token ID2474
Feature activation-1.00e+00

Pos logit contributions
 and+2.517
,+2.385
 the+2.250
.+2.219
 a+2.189
Neg logit contributions
umm-6.761
lor-6.552
vation-6.500
arians-6.378
osity-6.369
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.441
,+2.369
.+2.252
 the+2.106
 a+2.060
Neg logit contributions
umm-6.812
lor-6.633
vation-6.559
osity-6.495
rol-6.445
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 and+3.491
,+3.422
.+3.374
 the+3.237
 a+3.135
Neg logit contributions
umm-5.604
lor-5.530
vation-5.406
arians-5.379
osity-5.365
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.910
,+2.847
.+2.720
 a+2.437
 the+2.406
Neg logit contributions
umm-6.343
lor-6.151
vation-6.040
ocol-6.005
arians-5.991
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+0.521
,+0.518
 the+0.408
.+0.356
 a+0.335
Neg logit contributions
lor-8.331
umm-8.322
vation-8.203
osity-8.159
arians-8.133
Token said
Token ID531
Feature activation-1.00e+00

Pos logit contributions
 and+3.847
,+3.747
.+3.665
 the+3.371
 a+3.336
Neg logit contributions
umm-5.412
lor-5.177
osity-5.163
vation-5.105
arians-5.082
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+1.183
,+0.937
.+0.930
 the+0.777
 a+0.720
Neg logit contributions
umm-8.092
lor-7.987
osity-7.906
ocol-7.810
ubb-7.787
Token "
Token ID366
Feature activation-1.00e+00

Pos logit contributions
 and+2.584
,+2.479
.+2.397
 the+2.246
 a+2.208
Neg logit contributions
umm-6.535
lor-6.435
osity-6.434
arians-6.291
plets-6.284
TokenI
Token ID40
Feature activation-1.00e+00

Pos logit contributions
 and+6.337
,+6.226
.+6.123
 the+5.971
 a+5.899
Neg logit contributions
umm-3.131
lor-2.927
osity-2.825
vation-2.823
arians-2.761
Token will
Token ID481
Feature activation-1.00e+00

Pos logit contributions
 and+4.764
,+4.640
.+4.556
 the+4.417
 a+4.330
Neg logit contributions
umm-4.629
lor-4.451
osity-4.345
vation-4.344
rol-4.287
Token scare
Token ID19437
Feature activation-1.00e+00

Pos logit contributions
 and+4.047
,+3.893
.+3.797
 the+3.674
 a+3.588
Neg logit contributions
umm-5.234
lor-5.117
vation-5.028
osity-5.013
arians-4.983
Token you
Token ID345
Feature activation-1.00e+00

Pos logit contributions
 and+3.169
,+3.052
.+2.940
 a+2.683
 the+2.672
Neg logit contributions
umm-6.085
lor-5.918
vation-5.838
ocol-5.790
rol-5.778
Token every
Token ID790
Feature activation-1.00e+00

Pos logit contributions
 and+3.002
,+2.876
 the+2.759
.+2.745
 a+2.638
Neg logit contributions
umm-6.049
lor-6.041
osity-5.895
rol-5.887
vation-5.881
Token night
Token ID1755
Feature activation-1.00e+00

Pos logit contributions
 and+3.908
,+3.785
.+3.674
 the+3.552
 a+3.457
Neg logit contributions
umm-5.460
lor-5.275
vation-5.199
osity-5.106
rol-5.073
Token scared
Token ID12008
Feature activation-1.00e+00

Pos logit contributions
 and+3.776
,+3.617
.+3.541
 the+3.397
 a+3.322
Neg logit contributions
umm-5.513
lor-5.311
osity-5.289
ubb-5.287
vation-5.278
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+0.291
,+0.222
 the+0.126
 a+0.120
.+0.078
Neg logit contributions
umm-8.675
lor-8.340
osity-8.330
ocol-8.294
ubb-8.284
Token cried
Token ID16896
Feature activation-1.00e+00

Pos logit contributions
 and+4.067
,+3.939
.+3.834
 the+3.625
 a+3.587
Neg logit contributions
umm-5.143
lor-5.013
osity-4.970
vation-4.953
rol-4.914
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.660
,+0.507
 the+0.389
.+0.331
 a+0.228
Neg logit contributions
umm-8.411
osity-8.140
ubb-8.118
lor-8.070
rol-8.039
Token From
Token ID3574
Feature activation-1.00e+00

Pos logit contributions
 and+3.192
,+3.077
.+2.968
 the+2.806
 a+2.741
Neg logit contributions
umm-6.239
lor-6.030
osity-5.937
vation-5.913
arians-5.869
Token that
Token ID326
Feature activation-1.00e+00

Pos logit contributions
 and+1.967
,+1.869
.+1.767
 the+1.549
 a+1.485
Neg logit contributions
umm-7.336
lor-7.152
vation-7.057
osity-7.051
arians-7.014
Token day
Token ID1110
Feature activation-1.00e+00

Pos logit contributions
 and+3.146
,+3.024
.+2.937
 the+2.785
 a+2.716
Neg logit contributions
umm-6.293
lor-6.093
vation-5.996
osity-5.989
arians-5.911
Token on
Token ID319
Feature activation-1.00e+00

Pos logit contributions
 and+2.109
,+1.970
.+1.891
 the+1.746
 a+1.680
Neg logit contributions
umm-7.264
lor-7.084
vation-6.979
osity-6.976
arians-6.908
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+0.345
,+0.143
.+0.139
 a-0.055
 the-0.056
Neg logit contributions
umm-8.969
lor-8.811
vation-8.669
arians-8.621
osity-8.599
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.905
,+2.781
.+2.712
 the+2.433
 a+2.405
Neg logit contributions
umm-6.407
osity-6.115
lor-6.099
vation-6.096
arians-6.058
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 and+3.298
,+3.211
.+3.146
 the+2.990
 a+2.929
Neg logit contributions
umm-5.923
lor-5.816
osity-5.693
vation-5.674
arians-5.648
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
 and+0.291
,+0.222
 the+0.126
 a+0.120
.+0.078
Neg logit contributions
umm-8.675
lor-8.340
osity-8.330
ocol-8.294
ubb-8.284
Token cried
Token ID16896
Feature activation-1.00e+00

Pos logit contributions
 and+4.067
,+3.939
.+3.834
 the+3.625
 a+3.587
Neg logit contributions
umm-5.143
lor-5.013
osity-4.970
vation-4.953
rol-4.914
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.660
,+0.507
 the+0.389
.+0.331
 a+0.228
Neg logit contributions
umm-8.411
osity-8.140
ubb-8.118
lor-8.070
rol-8.039
Token From
Token ID3574
Feature activation-1.00e+00

Pos logit contributions
 and+3.192
,+3.077
.+2.968
 the+2.806
 a+2.741
Neg logit contributions
umm-6.239
lor-6.030
osity-5.937
vation-5.913
arians-5.869
Token that
Token ID326
Feature activation-1.00e+00

Pos logit contributions
 and+1.967
,+1.869
.+1.767
 the+1.549
 a+1.485
Neg logit contributions
umm-7.336
lor-7.152
vation-7.057
osity-7.051
arians-7.014
Token day
Token ID1110
Feature activation-1.00e+00

Pos logit contributions
 and+3.146
,+3.024
.+2.937
 the+2.785
 a+2.716
Neg logit contributions
umm-6.293
lor-6.093
vation-5.996
osity-5.989
arians-5.911
Token on
Token ID319
Feature activation-1.00e+00

Pos logit contributions
 and+2.109
,+1.970
.+1.891
 the+1.746
 a+1.680
Neg logit contributions
umm-7.264
lor-7.084
vation-6.979
osity-6.976
arians-6.908
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+0.345
,+0.143
.+0.139
 a-0.055
 the-0.056
Neg logit contributions
umm-8.969
lor-8.811
vation-8.669
arians-8.621
osity-8.599
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.905
,+2.781
.+2.712
 the+2.433
 a+2.405
Neg logit contributions
umm-6.407
osity-6.115
lor-6.099
vation-6.096
arians-6.058
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 and+3.298
,+3.211
.+3.146
 the+2.990
 a+2.929
Neg logit contributions
umm-5.923
lor-5.816
osity-5.693
vation-5.674
arians-5.648
Token always
Token ID1464
Feature activation-1.00e+00

Pos logit contributions
 and+3.575
,+3.453
.+3.345
 the+3.119
 a+3.025
Neg logit contributions
umm-5.722
lor-5.634
osity-5.518
vation-5.513
rol-5.465
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+0.190
,+0.010
.-0.034
 the-0.180
 a-0.251
Neg logit contributions
umm-9.149
lor-8.942
osity-8.868
vation-8.847
rol-8.776
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.534
,+2.393
.+2.318
 the+2.102
 a+2.026
Neg logit contributions
umm-6.809
lor-6.550
osity-6.519
vation-6.511
arians-6.441
Token saw
Token ID2497
Feature activation-1.00e+00

Pos logit contributions
 and+3.121
,+3.047
.+2.992
 the+2.840
 a+2.763
Neg logit contributions
umm-6.025
lor-5.892
osity-5.764
vation-5.763
arians-5.737
Token an
Token ID281
Feature activation-1.00e+00

Pos logit contributions
 and+1.001
,+0.875
.+0.785
 the+0.440
 a+0.310
Neg logit contributions
umm-8.286
lor-8.098
osity-8.059
vation-7.952
ocol-7.950
Token unknown
Token ID6439
Feature activation-1.00e+00

Pos logit contributions
 and+4.504
,+4.402
.+4.322
 the+4.165
 a+4.068
Neg logit contributions
umm-4.903
lor-4.577
osity-4.536
vation-4.515
ocol-4.502
Token person
Token ID1048
Feature activation-1.00e+00

Pos logit contributions
 and+4.877
,+4.762
.+4.662
 the+4.568
 a+4.483
Neg logit contributions
umm-4.478
lor-4.186
osity-4.132
vation-4.081
ocol-4.041
Token who
Token ID508
Feature activation-1.00e+00

Pos logit contributions
 and+2.310
,+2.219
.+2.051
 the+1.996
 a+1.959
Neg logit contributions
umm-6.758
lor-6.671
osity-6.600
vation-6.591
arians-6.552
Token needed
Token ID2622
Feature activation-1.00e+00

Pos logit contributions
 and+3.674
,+3.578
.+3.474
 the+3.293
 a+3.248
Neg logit contributions
umm-5.573
lor-5.388
osity-5.354
vation-5.322
arians-5.298
Token help
Token ID1037
Feature activation-1.00e+00

Pos logit contributions
 and+2.985
,+2.818
.+2.684
 the+2.442
 a+2.294
Neg logit contributions
umm-6.196
lor-6.072
ocol-6.050
vation-5.960
osity-5.951
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+1.249
,+1.089
.+0.857
 the+0.836
 a+0.777
Neg logit contributions
umm-7.622
ocol-7.506
arians-7.501
lor-7.441
osity-7.432
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.341
,+3.236
.+3.126
 the+2.948
 a+2.899
Neg logit contributions
umm-6.036
lor-5.833
osity-5.734
vation-5.723
arians-5.668
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+3.310
,+3.112
.+3.031
 the+2.786
 a+2.713
Neg logit contributions
umm-6.112
lor-5.837
osity-5.827
arians-5.777
vation-5.761
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.460
,+0.377
.+0.185
 the+0.163
 a+0.095
Neg logit contributions
lor-8.502
umm-8.501
osity-8.352
vation-8.278
rim-8.269
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.886
,+2.775
.+2.673
 the+2.527
 a+2.454
Neg logit contributions
umm-6.545
lor-6.364
vation-6.250
osity-6.249
arians-6.190
Token liked
Token ID8288
Feature activation-1.00e+00

Pos logit contributions
 and+3.373
,+3.274
.+3.187
 the+3.045
 a+2.965
Neg logit contributions
umm-5.879
lor-5.709
osity-5.603
vation-5.594
arians-5.549
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.278
,+1.210
.+1.111
 the+0.877
 a+0.815
Neg logit contributions
umm-8.038
osity-7.767
lor-7.760
vation-7.652
ocol-7.645
Token help
Token ID1037
Feature activation-1.00e+00

Pos logit contributions
 and+3.920
,+3.799
.+3.682
 the+3.536
 a+3.435
Neg logit contributions
umm-5.404
lor-5.202
vation-5.149
osity-5.123
arians-5.068
Token others
Token ID1854
Feature activation-1.00e+00

Pos logit contributions
 and+3.128
,+3.027
.+2.865
 the+2.659
 a+2.644
Neg logit contributions
umm-6.185
lor-5.841
ocol-5.815
arians-5.794
rim-5.792
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+1.490
,+1.380
.+1.237
 the+1.186
 a+1.141
Neg logit contributions
umm-7.597
lor-7.452
osity-7.439
arians-7.391
vation-7.375
Token One
Token ID1881
Feature activation-1.00e+00

Pos logit contributions
 and+3.614
,+3.502
.+3.393
 the+3.247
 a+3.176
Neg logit contributions
umm-5.811
lor-5.588
osity-5.500
vation-5.484
arians-5.431
Token day
Token ID1110
Feature activation-1.00e+00

Pos logit contributions
 and+3.172
,+3.070
.+2.965
 the+2.815
 a+2.730
Neg logit contributions
umm-6.215
lor-5.967
vation-5.904
osity-5.892
rol-5.789
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+0.190
,+0.010
.-0.034
 the-0.180
 a-0.251
Neg logit contributions
umm-9.149
lor-8.942
osity-8.868
vation-8.847
rol-8.776
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+2.811
,+2.661
.+2.488
 the+2.241
 a+2.109
Neg logit contributions
umm-6.392
lor-6.328
osity-6.213
rol-6.164
vation-6.152
Token good
Token ID922
Feature activation-1.00e+00

Pos logit contributions
 and+5.774
,+5.607
.+5.570
 the+5.459
 a+5.321
Neg logit contributions
umm-3.579
lor-3.402
osity-3.315
vation-3.295
 former-3.265
Token night
Token ID1755
Feature activation-1.00e+00

Pos logit contributions
 and+4.956
,+4.843
.+4.763
 the+4.717
 a+4.554
Neg logit contributions
umm-4.393
lor-4.134
osity-4.053
vation-4.041
 former-4.005
Token's
Token ID338
Feature activation-1.00e+00

Pos logit contributions
 and+3.213
,+3.100
.+2.933
 the+2.926
 a+2.864
Neg logit contributions
umm-6.141
lor-5.889
vation-5.851
osity-5.724
plets-5.693
Token sleep
Token ID3993
Feature activation-1.00e+00

Pos logit contributions
 and+5.503
,+5.396
.+5.228
 the+5.204
 a+5.065
Neg logit contributions
umm-3.825
lor-3.576
 former-3.567
osity-3.528
vation-3.506
Token again
Token ID757
Feature activation-1.00e+00

Pos logit contributions
 and+3.354
,+3.245
 the+3.064
.+3.041
 a+2.980
Neg logit contributions
umm-5.952
lor-5.843
vation-5.702
arians-5.614
osity-5.555
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.516
,+0.404
 the+0.238
 a+0.212
.+0.086
Neg logit contributions
umm-8.552
lor-8.454
 former-8.319
vation-8.303
rim-8.219
Token<|endoftext|>
Token ID50256
Feature activation-1.00e+00

Pos logit contributions
 and+1.638
,+1.523
.+1.425
 the+1.277
 a+1.205
Neg logit contributions
umm-7.784
lor-7.579
osity-7.482
vation-7.474
arians-7.417
TokenOnce
Token ID7454
Feature activation-1.00e+00

Pos logit contributions
 and+2.072
,+1.971
.+1.834
 the+1.700
 a+1.633
Neg logit contributions
umm-7.240
lor-7.114
vation-6.966
osity-6.908
rol-6.905
Token upon
Token ID2402
Feature activation-1.00e+00

Pos logit contributions
 and+4.143
,+3.988
.+3.913
 the+3.762
 a+3.735
Neg logit contributions
umm-5.123
lor-4.966
osity-4.894
vation-4.841
rol-4.804
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+0.605
,+0.574
.+0.378
 the+0.217
 a+0.027
Neg logit contributions
umm-8.506
lor-8.451
vation-8.307
osity-8.245
rol-8.220
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+0.265
.+0.056
,+0.020
 the-0.088
 a-0.155
Neg logit contributions
umm-8.971
lor-8.833
osity-8.735
vation-8.669
rol-8.663
Token there
Token ID612
Feature activation-1.00e+00

Pos logit contributions
 and+1.940
,+1.790
.+1.731
 the+1.546
 a+1.459
Neg logit contributions
umm-7.329
lor-7.147
osity-7.099
vation-7.024
rol-7.018
Token was
Token ID373
Feature activation-1.00e+00

Pos logit contributions
 and+1.614
,+1.449
.+1.407
 the+1.221
 a+1.190
Neg logit contributions
umm-7.644
lor-7.480
osity-7.328
vation-7.322
rol-7.315
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+0.626
,+0.539
.+0.442
 the+0.229
 a+0.079
Neg logit contributions
umm-8.702
lor-8.470
osity-8.361
vation-8.346
rol-8.336
Token kind
Token ID1611
Feature activation-1.00e+00

Pos logit contributions
 and+2.975
,+2.860
.+2.768
 the+2.616
 a+2.543
Neg logit contributions
umm-6.405
lor-6.182
osity-6.097
vation-6.085
arians-6.034
Token person
Token ID1048
Feature activation-1.00e+00

Pos logit contributions
 and+4.255
,+4.146
.+4.069
 the+3.933
 a+3.868
Neg logit contributions
umm-5.098
lor-4.856
osity-4.781
vation-4.727
arians-4.690
Token named
Token ID3706
Feature activation-1.00e+00

Pos logit contributions
 and+3.705
,+3.589
.+3.468
 the+3.367
 a+3.309
Neg logit contributions
umm-5.580
lor-5.420
osity-5.317
vation-5.285
arians-5.251
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+3.310
,+3.112
.+3.031
 the+2.786
 a+2.713
Neg logit contributions
umm-6.112
lor-5.837
osity-5.827
arians-5.777
vation-5.761
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.460
,+0.377
.+0.185
 the+0.163
 a+0.095
Neg logit contributions
lor-8.502
umm-8.501
osity-8.352
vation-8.278
rim-8.269
Token Tom
Token ID4186
Feature activation-1.00e+00

Pos logit contributions
 and+2.886
,+2.775
.+2.673
 the+2.527
 a+2.454
Neg logit contributions
umm-6.545
lor-6.364
vation-6.250
osity-6.249
arians-6.190
Token liked
Token ID8288
Feature activation-1.00e+00

Pos logit contributions
 and+3.373
,+3.274
.+3.187
 the+3.045
 a+2.965
Neg logit contributions
umm-5.879
lor-5.709
osity-5.603
vation-5.594
arians-5.549
Token needed
Token ID2622
Feature activation-1.00e+00

Pos logit contributions
 and+3.674
,+3.578
.+3.474
 the+3.293
 a+3.248
Neg logit contributions
umm-5.573
lor-5.388
osity-5.354
vation-5.322
arians-5.298
Token help
Token ID1037
Feature activation-1.00e+00

Pos logit contributions
 and+2.985
,+2.818
.+2.684
 the+2.442
 a+2.294
Neg logit contributions
umm-6.196
lor-6.072
ocol-6.050
vation-5.960
osity-5.951
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+1.249
,+1.089
.+0.857
 the+0.836
 a+0.777
Neg logit contributions
umm-7.622
ocol-7.506
arians-7.501
lor-7.441
osity-7.432
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.341
,+3.236
.+3.126
 the+2.948
 a+2.899
Neg logit contributions
umm-6.036
lor-5.833
osity-5.734
vation-5.723
arians-5.668
Token unknown
Token ID6439
Feature activation-1.00e+00

Pos logit contributions
 and+4.270
,+4.136
.+4.017
 the+3.860
 a+3.786
Neg logit contributions
umm-5.176
lor-4.965
vation-4.889
osity-4.871
arians-4.802
Token person
Token ID1048
Feature activation-1.00e+00

Pos logit contributions
 and+4.193
,+4.052
.+3.952
 the+3.841
 a+3.774
Neg logit contributions
umm-5.131
lor-4.911
vation-4.854
osity-4.837
arians-4.759
Token had
Token ID550
Feature activation-1.00e+00

Pos logit contributions
 and+3.744
,+3.616
.+3.538
 the+3.397
 a+3.338
Neg logit contributions
umm-5.400
lor-5.260
osity-5.241
vation-5.194
arians-5.193
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+3.468
,+3.399
.+3.233
 the+2.957
 a+2.803
Neg logit contributions
umm-5.676
lor-5.665
osity-5.526
 former-5.502
vation-5.451
Token flat
Token ID6228
Feature activation-1.00e+00

Pos logit contributions
 and+4.291
,+4.133
.+4.096
 the+3.938
 a+3.794
Neg logit contributions
umm-5.051
lor-4.846
osity-4.769
 former-4.734
vation-4.730
Token tire
Token ID15867
Feature activation-1.00e+00

Pos logit contributions
 and+5.136
,+4.993
.+4.978
 the+4.870
 a+4.771
Neg logit contributions
umm-4.164
lor-3.939
osity-3.807
ocol-3.787
 former-3.781
Token on
Token ID319
Feature activation-1.00e+00

Pos logit contributions
 and+1.790
,+1.699
 the+1.591
.+1.528
 a+1.472
Neg logit contributions
umm-7.248
lor-7.060
osity-7.003
arians-6.981
vation-6.956
Token an
Token ID281
Feature activation-1.00e+00

Pos logit contributions
 and+1.001
,+0.875
.+0.785
 the+0.440
 a+0.310
Neg logit contributions
umm-8.286
lor-8.098
osity-8.059
vation-7.952
ocol-7.950
Token unknown
Token ID6439
Feature activation-1.00e+00

Pos logit contributions
 and+4.504
,+4.402
.+4.322
 the+4.165
 a+4.068
Neg logit contributions
umm-4.903
lor-4.577
osity-4.536
vation-4.515
ocol-4.502
Token person
Token ID1048
Feature activation-1.00e+00

Pos logit contributions
 and+4.877
,+4.762
.+4.662
 the+4.568
 a+4.483
Neg logit contributions
umm-4.478
lor-4.186
osity-4.132
vation-4.081
ocol-4.041
Token who
Token ID508
Feature activation-1.00e+00

Pos logit contributions
 and+2.310
,+2.219
.+2.051
 the+1.996
 a+1.959
Neg logit contributions
umm-6.758
lor-6.671
osity-6.600
vation-6.591
arians-6.552
Token needed
Token ID2622
Feature activation-1.00e+00

Pos logit contributions
 and+3.674
,+3.578
.+3.474
 the+3.293
 a+3.248
Neg logit contributions
umm-5.573
lor-5.388
osity-5.354
vation-5.322
arians-5.298
Token help
Token ID1037
Feature activation-1.00e+00

Pos logit contributions
 and+2.985
,+2.818
.+2.684
 the+2.442
 a+2.294
Neg logit contributions
umm-6.196
lor-6.072
ocol-6.050
vation-5.960
osity-5.951
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+1.249
,+1.089
.+0.857
 the+0.836
 a+0.777
Neg logit contributions
umm-7.622
ocol-7.506
arians-7.501
lor-7.441
osity-7.432
Token The
Token ID383
Feature activation-1.00e+00

Pos logit contributions
 and+3.341
,+3.236
.+3.126
 the+2.948
 a+2.899
Neg logit contributions
umm-6.036
lor-5.833
osity-5.734
vation-5.723
arians-5.668
Token unknown
Token ID6439
Feature activation-1.00e+00

Pos logit contributions
 and+4.270
,+4.136
.+4.017
 the+3.860
 a+3.786
Neg logit contributions
umm-5.176
lor-4.965
vation-4.889
osity-4.871
arians-4.802
Token person
Token ID1048
Feature activation-1.00e+00

Pos logit contributions
 and+4.193
,+4.052
.+3.952
 the+3.841
 a+3.774
Neg logit contributions
umm-5.131
lor-4.911
vation-4.854
osity-4.837
arians-4.759
Token had
Token ID550
Feature activation-1.00e+00

Pos logit contributions
 and+3.744
,+3.616
.+3.538
 the+3.397
 a+3.338
Neg logit contributions
umm-5.400
lor-5.260
osity-5.241
vation-5.194
arians-5.193
Token tire
Token ID15867
Feature activation-1.00e+00

Pos logit contributions
 and+5.136
,+4.993
.+4.978
 the+4.870
 a+4.771
Neg logit contributions
umm-4.164
lor-3.939
osity-3.807
ocol-3.787
 former-3.781
Token on
Token ID319
Feature activation-1.00e+00

Pos logit contributions
 and+1.790
,+1.699
 the+1.591
.+1.528
 a+1.472
Neg logit contributions
umm-7.248
lor-7.060
osity-7.003
arians-6.981
vation-6.956
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+2.282
,+2.156
.+2.007
 the+1.708
 a+1.682
Neg logit contributions
umm-6.888
lor-6.732
ocol-6.677
arians-6.647
osity-6.636
Token bike
Token ID7161
Feature activation-1.00e+00

Pos logit contributions
 and+4.957
,+4.827
.+4.708
 the+4.606
 a+4.549
Neg logit contributions
umm-4.323
lor-4.191
vation-4.030
arians-3.984
osity-3.954
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.795
,+0.705
 the+0.581
.+0.514
 a+0.512
Neg logit contributions
umm-8.256
lor-8.183
vation-8.073
osity-8.058
arians-8.035
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+2.660
,+2.553
.+2.454
 the+2.287
 a+2.234
Neg logit contributions
umm-6.716
lor-6.514
vation-6.414
osity-6.412
arians-6.361
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.537
,+1.438
.+1.320
 the+1.183
 a+1.121
Neg logit contributions
umm-7.711
lor-7.607
vation-7.466
osity-7.417
rol-7.402
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+4.667
,+4.548
.+4.449
 the+4.301
 a+4.229
Neg logit contributions
umm-4.825
lor-4.607
osity-4.525
vation-4.496
arians-4.446
Token wanted
Token ID2227
Feature activation-1.00e+00

Pos logit contributions
 and+3.099
,+3.024
.+2.989
 the+2.844
 a+2.760
Neg logit contributions
umm-5.915
lor-5.856
arians-5.783
osity-5.745
vation-5.700
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.186
,+1.083
.+1.000
 the+0.699
 a+0.649
Neg logit contributions
umm-7.986
lor-7.830
osity-7.767
vation-7.731
arians-7.720
Token lend
Token ID22096
Feature activation-1.00e+00

Pos logit contributions
 and+3.518
,+3.385
.+3.287
 the+3.103
 a+3.041
Neg logit contributions
umm-5.825
lor-5.611
vation-5.571
osity-5.540
arians-5.494
Token on
Token ID319
Feature activation-1.00e+00

Pos logit contributions
 and+1.790
,+1.699
 the+1.591
.+1.528
 a+1.472
Neg logit contributions
umm-7.248
lor-7.060
osity-7.003
arians-6.981
vation-6.956
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+2.282
,+2.156
.+2.007
 the+1.708
 a+1.682
Neg logit contributions
umm-6.888
lor-6.732
ocol-6.677
arians-6.647
osity-6.636
Token bike
Token ID7161
Feature activation-1.00e+00

Pos logit contributions
 and+4.957
,+4.827
.+4.708
 the+4.606
 a+4.549
Neg logit contributions
umm-4.323
lor-4.191
vation-4.030
arians-3.984
osity-3.954
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.795
,+0.705
 the+0.581
.+0.514
 a+0.512
Neg logit contributions
umm-8.256
lor-8.183
vation-8.073
osity-8.058
arians-8.035
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+2.660
,+2.553
.+2.454
 the+2.287
 a+2.234
Neg logit contributions
umm-6.716
lor-6.514
vation-6.414
osity-6.412
arians-6.361
Token
Token ID198
Feature activation-1.00e+00

Pos logit contributions
 and+1.537
,+1.438
.+1.320
 the+1.183
 a+1.121
Neg logit contributions
umm-7.711
lor-7.607
vation-7.466
osity-7.417
rol-7.402
TokenTom
Token ID13787
Feature activation-1.00e+00

Pos logit contributions
 and+4.667
,+4.548
.+4.449
 the+4.301
 a+4.229
Neg logit contributions
umm-4.825
lor-4.607
osity-4.525
vation-4.496
arians-4.446
Token wanted
Token ID2227
Feature activation-1.00e+00

Pos logit contributions
 and+3.099
,+3.024
.+2.989
 the+2.844
 a+2.760
Neg logit contributions
umm-5.915
lor-5.856
arians-5.783
osity-5.745
vation-5.700
Token to
Token ID284
Feature activation-1.00e+00

Pos logit contributions
 and+1.186
,+1.083
.+1.000
 the+0.699
 a+0.649
Neg logit contributions
umm-7.986
lor-7.830
osity-7.767
vation-7.731
arians-7.720
Token lend
Token ID22096
Feature activation-1.00e+00

Pos logit contributions
 and+3.518
,+3.385
.+3.287
 the+3.103
 a+3.041
Neg logit contributions
umm-5.825
lor-5.611
vation-5.571
osity-5.540
arians-5.494
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+1.281
,+1.117
.+1.041
 the+0.696
 a+0.644
Neg logit contributions
umm-7.846
lor-7.639
arians-7.636
osity-7.583
ocol-7.580
Token never
Token ID1239
Feature activation-1.00e+00

Pos logit contributions
 and+3.806
,+3.722
.+3.627
 the+3.475
 a+3.404
Neg logit contributions
umm-5.492
lor-5.356
osity-5.238
vation-5.226
arians-5.202
Token had
Token ID550
Feature activation-1.00e+00

Pos logit contributions
 and+4.292
,+4.136
.+4.042
 the+3.906
 a+3.835
Neg logit contributions
umm-4.988
osity-4.832
lor-4.786
vation-4.726
reetings-4.692
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+2.811
,+2.661
.+2.488
 the+2.241
 a+2.109
Neg logit contributions
umm-6.392
lor-6.328
osity-6.213
rol-6.164
vation-6.152
Token good
Token ID922
Feature activation-1.00e+00

Pos logit contributions
 and+5.774
,+5.607
.+5.570
 the+5.459
 a+5.321
Neg logit contributions
umm-3.579
lor-3.402
osity-3.315
vation-3.295
 former-3.265
Token night
Token ID1755
Feature activation-1.00e+00

Pos logit contributions
 and+4.956
,+4.843
.+4.763
 the+4.717
 a+4.554
Neg logit contributions
umm-4.393
lor-4.134
osity-4.053
vation-4.041
 former-4.005
Token's
Token ID338
Feature activation-1.00e+00

Pos logit contributions
 and+3.213
,+3.100
.+2.933
 the+2.926
 a+2.864
Neg logit contributions
umm-6.141
lor-5.889
vation-5.851
osity-5.724
plets-5.693
Token sleep
Token ID3993
Feature activation-1.00e+00

Pos logit contributions
 and+5.503
,+5.396
.+5.228
 the+5.204
 a+5.065
Neg logit contributions
umm-3.825
lor-3.576
 former-3.567
osity-3.528
vation-3.506
Token again
Token ID757
Feature activation-1.00e+00

Pos logit contributions
 and+3.354
,+3.245
 the+3.064
.+3.041
 a+2.980
Neg logit contributions
umm-5.952
lor-5.843
vation-5.702
arians-5.614
osity-5.555
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.516
,+0.404
 the+0.238
 a+0.212
.+0.086
Neg logit contributions
umm-8.552
lor-8.454
 former-8.319
vation-8.303
rim-8.219
Token<|endoftext|>
Token ID50256
Feature activation-1.00e+00

Pos logit contributions
 and+1.638
,+1.523
.+1.425
 the+1.277
 a+1.205
Neg logit contributions
umm-7.784
lor-7.579
osity-7.482
vation-7.474
arians-7.417
TokenOnce
Token ID7454
Feature activation-1.00e+00

Pos logit contributions
 and+2.072
,+1.971
.+1.834
 the+1.700
 a+1.633
Neg logit contributions
umm-7.240
lor-7.114
vation-6.966
osity-6.908
rol-6.905
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
,+2.542
 and+2.511
.+2.461
 the+2.226
 a+2.191
Neg logit contributions
umm-6.531
lor-6.331
osity-6.320
arians-6.259
vation-6.241
Token he
Token ID339
Feature activation-1.00e+00

Pos logit contributions
 and+2.939
,+2.856
.+2.794
 the+2.488
 a+2.455
Neg logit contributions
umm-6.271
lor-6.091
osity-6.065
arians-5.963
vation-5.961
Token never
Token ID1239
Feature activation-1.00e+00

Pos logit contributions
 and+3.806
,+3.722
.+3.627
 the+3.475
 a+3.404
Neg logit contributions
umm-5.492
lor-5.356
osity-5.238
vation-5.226
arians-5.202
Token had
Token ID550
Feature activation-1.00e+00

Pos logit contributions
 and+4.292
,+4.136
.+4.042
 the+3.906
 a+3.835
Neg logit contributions
umm-4.988
osity-4.832
lor-4.786
vation-4.726
reetings-4.692
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+2.811
,+2.661
.+2.488
 the+2.241
 a+2.109
Neg logit contributions
umm-6.392
lor-6.328
osity-6.213
rol-6.164
vation-6.152
Token good
Token ID922
Feature activation-1.00e+00

Pos logit contributions
 and+5.774
,+5.607
.+5.570
 the+5.459
 a+5.321
Neg logit contributions
umm-3.579
lor-3.402
osity-3.315
vation-3.295
 former-3.265
Token night
Token ID1755
Feature activation-1.00e+00

Pos logit contributions
 and+4.956
,+4.843
.+4.763
 the+4.717
 a+4.554
Neg logit contributions
umm-4.393
lor-4.134
osity-4.053
vation-4.041
 former-4.005
Token's
Token ID338
Feature activation-1.00e+00

Pos logit contributions
 and+3.213
,+3.100
.+2.933
 the+2.926
 a+2.864
Neg logit contributions
umm-6.141
lor-5.889
vation-5.851
osity-5.724
plets-5.693
Token sleep
Token ID3993
Feature activation-1.00e+00

Pos logit contributions
 and+5.503
,+5.396
.+5.228
 the+5.204
 a+5.065
Neg logit contributions
umm-3.825
lor-3.576
 former-3.567
osity-3.528
vation-3.506
Token again
Token ID757
Feature activation-1.00e+00

Pos logit contributions
 and+3.354
,+3.245
 the+3.064
.+3.041
 a+2.980
Neg logit contributions
umm-5.952
lor-5.843
vation-5.702
arians-5.614
osity-5.555
Token.
Token ID13
Feature activation-1.00e+00

Pos logit contributions
 and+0.516
,+0.404
 the+0.238
 a+0.212
.+0.086
Neg logit contributions
umm-8.552
lor-8.454
 former-8.319
vation-8.303
rim-8.219
Token in
Token ID287
Feature activation-1.00e+00

Pos logit contributions
 and+1.828
,+1.754
.+1.642
 the+1.556
 a+1.545
Neg logit contributions
umm-7.279
osity-6.988
lor-6.920
rol-6.912
ubb-6.900
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+3.776
,+3.633
.+3.498
 the+3.214
 a+3.188
Neg logit contributions
umm-5.479
lor-5.292
ocol-5.283
ubb-5.264
 former-5.205
Token room
Token ID2119
Feature activation-1.00e+00

Pos logit contributions
 and+5.205
,+5.010
.+4.931
 the+4.835
 a+4.753
Neg logit contributions
umm-4.188
 former-3.922
lor-3.896
vation-3.805
ocol-3.776
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+1.466
,+1.376
 the+1.265
.+1.226
 a+1.168
Neg logit contributions
umm-7.597
lor-7.479
arians-7.351
osity-7.332
 former-7.331
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
,+2.542
 and+2.511
.+2.461
 the+2.226
 a+2.191
Neg logit contributions
umm-6.531
lor-6.331
osity-6.320
arians-6.259
vation-6.241
Token he
Token ID339
Feature activation-1.00e+00

Pos logit contributions
 and+2.939
,+2.856
.+2.794
 the+2.488
 a+2.455
Neg logit contributions
umm-6.271
lor-6.091
osity-6.065
arians-5.963
vation-5.961
Token never
Token ID1239
Feature activation-1.00e+00

Pos logit contributions
 and+3.806
,+3.722
.+3.627
 the+3.475
 a+3.404
Neg logit contributions
umm-5.492
lor-5.356
osity-5.238
vation-5.226
arians-5.202
Token had
Token ID550
Feature activation-1.00e+00

Pos logit contributions
 and+4.292
,+4.136
.+4.042
 the+3.906
 a+3.835
Neg logit contributions
umm-4.988
osity-4.832
lor-4.786
vation-4.726
reetings-4.692
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+2.811
,+2.661
.+2.488
 the+2.241
 a+2.109
Neg logit contributions
umm-6.392
lor-6.328
osity-6.213
rol-6.164
vation-6.152
Token good
Token ID922
Feature activation-1.00e+00

Pos logit contributions
 and+5.774
,+5.607
.+5.570
 the+5.459
 a+5.321
Neg logit contributions
umm-3.579
lor-3.402
osity-3.315
vation-3.295
 former-3.265
Token night
Token ID1755
Feature activation-1.00e+00

Pos logit contributions
 and+4.956
,+4.843
.+4.763
 the+4.717
 a+4.554
Neg logit contributions
umm-4.393
lor-4.134
osity-4.053
vation-4.041
 former-4.005
Token his
Token ID465
Feature activation-1.00e+00

Pos logit contributions
 and+3.776
,+3.633
.+3.498
 the+3.214
 a+3.188
Neg logit contributions
umm-5.479
lor-5.292
ocol-5.283
ubb-5.264
 former-5.205
Token room
Token ID2119
Feature activation-1.00e+00

Pos logit contributions
 and+5.205
,+5.010
.+4.931
 the+4.835
 a+4.753
Neg logit contributions
umm-4.188
 former-3.922
lor-3.896
vation-3.805
ocol-3.776
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 and+1.466
,+1.376
 the+1.265
.+1.226
 a+1.168
Neg logit contributions
umm-7.597
lor-7.479
arians-7.351
osity-7.332
 former-7.331
Token and
Token ID290
Feature activation-1.00e+00

Pos logit contributions
,+2.542
 and+2.511
.+2.461
 the+2.226
 a+2.191
Neg logit contributions
umm-6.531
lor-6.331
osity-6.320
arians-6.259
vation-6.241
Token he
Token ID339
Feature activation-1.00e+00

Pos logit contributions
 and+2.939
,+2.856
.+2.794
 the+2.488
 a+2.455
Neg logit contributions
umm-6.271
lor-6.091
osity-6.065
arians-5.963
vation-5.961
Token never
Token ID1239
Feature activation-1.00e+00

Pos logit contributions
 and+3.806
,+3.722
.+3.627
 the+3.475
 a+3.404
Neg logit contributions
umm-5.492
lor-5.356
osity-5.238
vation-5.226
arians-5.202
Token had
Token ID550
Feature activation-1.00e+00

Pos logit contributions
 and+4.292
,+4.136
.+4.042
 the+3.906
 a+3.835
Neg logit contributions
umm-4.988
osity-4.832
lor-4.786
vation-4.726
reetings-4.692
Token a
Token ID257
Feature activation-1.00e+00

Pos logit contributions
 and+2.811
,+2.661
.+2.488
 the+2.241
 a+2.109
Neg logit contributions
umm-6.392
lor-6.328
osity-6.213
rol-6.164
vation-6.152
Token good
Token ID922
Feature activation-1.00e+00

Pos logit contributions
 and+5.774
,+5.607
.+5.570
 the+5.459
 a+5.321
Neg logit contributions
umm-3.579
lor-3.402
osity-3.315
vation-3.295
 former-3.265
Token night
Token ID1755
Feature activation-1.00e+00

Pos logit contributions
 and+4.956
,+4.843
.+4.763
 the+4.717
 a+4.554
Neg logit contributions
umm-4.393
lor-4.134
osity-4.053
vation-4.041
 former-4.005
Token's
Token ID338
Feature activation-1.00e+00

Pos logit contributions
 and+3.213
,+3.100
.+2.933
 the+2.926
 a+2.864
Neg logit contributions
umm-6.141
lor-5.889
vation-5.851
osity-5.724
plets-5.693