TOP ACTIVATIONS
MAX = -100.267

road has many cars and trucks . The cars and trucks
and finally found the car parked in front of a house
hammer , nails and colorful boxes . He placed the boxes
boxes . He placed the boxes on the roof and made
broke . The woman was very sad . She
. She liked the toy very much . It was a
Ben , but you are very funny . Can you say
tree with lots of cher ries on it . She wanted
share and have fun . <|endoftext|> Anna and Ben are twins
the window and saw cherry trees growing all around . She
the tools , on the boxes , on the cans ,
, Ben , it is very pretty !" Lily said ,
And that was enough . <|endoftext|> Tom and Anna are friends
Anna and Ben are very happy . They run to
the house to make it cozy . " Look
green . It has sharp teeth and a for ked tongue
" Like scissors or knives or guns ," Dad says .
in our pockets or our bags ." " Like
family . The family was very poor but they prayed hard
and Dad check in their bags and get their tickets .

MIN ACTIVATIONS
MIN = -100.267

woman . I have a gift for you . Look ."
't cry , woman . I have a gift for you
I love my toy ." The man said ,
love my toy ." The man said , "
in his room , and he never had a good night
scared and cried . From that day on , Tom was
his room , and he never had a good night 's
and he never had a good night 's sleep again .
Thank you , Tom !" Together , Tom and
and said , " Thank you , Tom !"
, " I can help you fix your bike ." The
your bike ." The unknown person was happy and said ,
, Tom and the unknown person fixed the bike . The
you , Tom !" Together , Tom and the
sounds and move . But one day , the toy fell
can 't fix it . I love my toy ."
He went to the unknown person and said , " I
needed help . The unknown person had a flat tire on
, there was a kind person named Tom . Tom liked
, Tom saw an unknown person who needed help . The

INTERVAL -100.267 - -100.267
CONTAINS 0.000%

Token road
Token ID2975
Feature activation-1.00e+02

Pos logit contributions
 and+4.832
,+4.748
.+4.643
 the+4.449
 a+4.366
Neg logit contributions
umm-4.552
lor-4.269
vation-4.239
osity-4.217
arians-4.200
Token has
Token ID468
Feature activation-1.00e+02

Pos logit contributions
 and+3.580
,+3.467
.+3.374
 the+3.226
 a+3.172
Neg logit contributions
umm-5.775
lor-5.561
osity-5.487
vation-5.485
arians-5.421
Token many
Token ID867
Feature activation-1.00e+02

Pos logit contributions
 and+3.258
,+3.193
.+3.078
 the+2.839
 a+2.710
Neg logit contributions
umm-6.076
lor-5.873
osity-5.792
rol-5.753
vation-5.733
Token cars
Token ID5006
Feature activation-1.00e+02

Pos logit contributions
 and+4.966
,+4.882
.+4.787
 the+4.666
 a+4.621
Neg logit contributions
umm-4.294
arians-3.986
lor-3.932
osity-3.925
vation-3.912
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+1.795
,+1.724
.+1.598
 the+1.567
 a+1.483
Neg logit contributions
umm-7.304
osity-7.096
lor-7.080
vation-7.080
rol-7.053
Token trucks
Token ID13960
Feature activation-1.00e+02

Pos logit contributions
 and+5.852
,+5.775
.+5.682
 the+5.414
 a+5.309
Neg logit contributions
umm-3.421
lor-3.160
osity-3.136
arians-3.135
plets-3.112
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+1.905
,+1.826
.+1.657
 the+1.615
 a+1.551
Neg logit contributions
umm-7.252
osity-7.045
lor-7.033
vation-7.022
rol-6.990
Token The
Token ID383
Feature activation-1.00e+02

Pos logit contributions
 and+3.355
,+3.236
.+3.138
 the+2.964
 a+2.899
Neg logit contributions
umm-6.115
lor-5.788
osity-5.752
rol-5.719
arians-5.695
Token cars
Token ID5006
Feature activation-1.00e+02

Pos logit contributions
 and+4.518
,+4.425
.+4.314
 the+4.126
 a+4.052
Neg logit contributions
umm-4.868
lor-4.572
vation-4.561
arians-4.551
osity-4.533
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+3.512
,+3.419
.+3.333
 the+3.172
 a+3.122
Neg logit contributions
umm-5.781
lor-5.560
vation-5.524
osity-5.515
rol-5.456
Token trucks
Token ID13960
Feature activation-1.00e+02

Pos logit contributions
 and+5.403
,+5.292
.+5.241
 a+4.896
 the+4.873
Neg logit contributions
umm-3.859
osity-3.611
arians-3.576
rol-3.570
ocol-3.561
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+2.946
,+2.830
.+2.786
 the+2.750
 a+2.706
Neg logit contributions
umm-6.178
osity-5.966
lor-5.880
arians-5.873
rol-5.844
Token finally
Token ID3443
Feature activation-1.00e+02

Pos logit contributions
 and+4.095
,+3.971
.+3.948
 the+3.629
 a+3.617
Neg logit contributions
umm-5.100
osity-5.004
vation-4.927
lor-4.927
rol-4.893
Token found
Token ID1043
Feature activation-1.00e+02

Pos logit contributions
 and+3.598
.+3.372
,+3.370
 the+3.172
 a+3.132
Neg logit contributions
umm-5.582
lor-5.547
osity-5.467
vation-5.421
rol-5.377
Token the
Token ID262
Feature activation-1.00e+02

Pos logit contributions
 and+2.546
,+2.388
.+2.253
 the+1.931
 a+1.834
Neg logit contributions
umm-6.808
lor-6.667
rol-6.619
osity-6.527
ubb-6.497
Token car
Token ID1097
Feature activation-1.00e+02

Pos logit contributions
 and+5.022
,+4.936
.+4.805
 the+4.670
 a+4.549
Neg logit contributions
umm-4.466
lor-4.105
osity-4.063
 former-4.059
vation-4.038
Token parked
Token ID19584
Feature activation-1.00e+02

Pos logit contributions
 and+2.436
,+2.333
 the+2.161
.+2.148
 a+2.093
Neg logit contributions
umm-6.722
lor-6.622
rol-6.547
vation-6.519
osity-6.495
Token in
Token ID287
Feature activation-1.00e+02

Pos logit contributions
 and+2.942
,+2.868
.+2.685
 the+2.550
 a+2.482
Neg logit contributions
umm-6.228
lor-6.052
rol-6.006
osity-5.912
vation-5.904
Token front
Token ID2166
Feature activation-1.00e+02

Pos logit contributions
 and+1.941
,+1.896
.+1.718
 the+1.453
 a+1.406
Neg logit contributions
umm-7.204
lor-7.059
vation-6.917
 former-6.891
rol-6.884
Token of
Token ID286
Feature activation-1.00e+02

Pos logit contributions
 and+2.786
,+2.681
.+2.542
 the+2.471
 a+2.424
Neg logit contributions
umm-6.547
lor-6.291
rol-6.215
osity-6.205
vation-6.198
Token a
Token ID257
Feature activation-1.00e+02

Pos logit contributions
 and+2.066
,+1.978
.+1.788
 the+1.398
 a+1.355
Neg logit contributions
umm-7.197
ocol-7.046
lor-7.014
rol-6.953
osity-6.921
Token house
Token ID2156
Feature activation-1.00e+02

Pos logit contributions
 and+4.579
,+4.430
.+4.356
 the+4.209
 a+4.126
Neg logit contributions
umm-4.813
lor-4.575
vation-4.505
osity-4.448
arians-4.433
Token hammer
Token ID15554
Feature activation-1.00e+02

Pos logit contributions
 and+4.784
,+4.630
.+4.604
 the+4.494
 a+4.363
Neg logit contributions
umm-4.487
lor-4.300
vation-4.222
osity-4.196
ocol-4.161
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+1.667
,+1.574
.+1.542
 the+1.450
 a+1.374
Neg logit contributions
umm-7.484
osity-7.252
lor-7.241
rol-7.173
plets-7.098
Token nails
Token ID23361
Feature activation-1.00e+02

Pos logit contributions
,+4.147
 and+4.142
.+4.099
 the+3.788
 a+3.629
Neg logit contributions
umm-4.988
osity-4.753
plets-4.732
ocol-4.723
lor-4.705
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+1.798
,+1.667
.+1.656
 the+1.557
 a+1.516
Neg logit contributions
umm-7.283
osity-7.107
lor-7.080
rol-7.025
ocol-6.965
Token colorful
Token ID20239
Feature activation-1.00e+02

Pos logit contributions
 and+4.828
,+4.789
.+4.733
 the+4.414
 a+4.294
Neg logit contributions
umm-4.343
osity-4.080
plets-4.072
lor-4.025
vation-4.020
Token boxes
Token ID10559
Feature activation-1.00e+02

Pos logit contributions
 and+5.226
,+5.116
.+5.036
 the+4.948
 a+4.849
Neg logit contributions
umm-4.039
ocol-3.799
osity-3.694
lor-3.680
plets-3.677
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+0.982
,+0.851
 the+0.724
.+0.719
 a+0.666
Neg logit contributions
umm-8.189
osity-7.933
lor-7.914
rol-7.884
vation-7.843
Token He
Token ID679
Feature activation-1.00e+02

Pos logit contributions
 and+4.126
,+4.045
.+3.915
 the+3.733
 a+3.674
Neg logit contributions
umm-5.373
lor-5.077
osity-4.995
rol-4.977
vation-4.952
Token placed
Token ID4624
Feature activation-1.00e+02

Pos logit contributions
 and+4.470
,+4.437
.+4.359
 the+4.168
 a+4.077
Neg logit contributions
umm-4.743
lor-4.561
arians-4.476
osity-4.466
vation-4.461
Token the
Token ID262
Feature activation-1.00e+02

Pos logit contributions
 and+2.435
,+2.359
.+2.271
 the+1.887
 a+1.861
Neg logit contributions
umm-6.867
ocol-6.629
rol-6.568
ubb-6.533
osity-6.520
Token boxes
Token ID10559
Feature activation-1.00e+02

Pos logit contributions
 and+5.393
,+5.304
.+5.196
 the+5.053
 a+4.963
Neg logit contributions
umm-4.044
lor-3.671
osity-3.657
 former-3.632
ocol-3.632
Token boxes
Token ID10559
Feature activation-1.00e+02

Pos logit contributions
 and+5.226
,+5.116
.+5.036
 the+4.948
 a+4.849
Neg logit contributions
umm-4.039
ocol-3.799
osity-3.694
lor-3.680
plets-3.677
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+0.982
,+0.851
 the+0.724
.+0.719
 a+0.666
Neg logit contributions
umm-8.189
osity-7.933
lor-7.914
rol-7.884
vation-7.843
Token He
Token ID679
Feature activation-1.00e+02

Pos logit contributions
 and+4.126
,+4.045
.+3.915
 the+3.733
 a+3.674
Neg logit contributions
umm-5.373
lor-5.077
osity-4.995
rol-4.977
vation-4.952
Token placed
Token ID4624
Feature activation-1.00e+02

Pos logit contributions
 and+4.470
,+4.437
.+4.359
 the+4.168
 a+4.077
Neg logit contributions
umm-4.743
lor-4.561
arians-4.476
osity-4.466
vation-4.461
Token the
Token ID262
Feature activation-1.00e+02

Pos logit contributions
 and+2.435
,+2.359
.+2.271
 the+1.887
 a+1.861
Neg logit contributions
umm-6.867
ocol-6.629
rol-6.568
ubb-6.533
osity-6.520
Token boxes
Token ID10559
Feature activation-1.00e+02

Pos logit contributions
 and+5.393
,+5.304
.+5.196
 the+5.053
 a+4.963
Neg logit contributions
umm-4.044
lor-3.671
osity-3.657
 former-3.632
ocol-3.632
Token on
Token ID319
Feature activation-1.00e+02

Pos logit contributions
 and+2.188
,+2.094
.+2.035
 the+1.880
 a+1.838
Neg logit contributions
umm-7.017
lor-6.741
osity-6.734
vation-6.715
rol-6.691
Token the
Token ID262
Feature activation-1.00e+02

Pos logit contributions
 and+1.622
,+1.542
.+1.448
 the+1.169
 a+1.162
Neg logit contributions
umm-7.547
lor-7.259
arians-7.183
ocol-7.170
osity-7.155
Token roof
Token ID9753
Feature activation-1.00e+02

Pos logit contributions
 and+5.833
,+5.758
.+5.607
 the+5.506
 a+5.441
Neg logit contributions
umm-3.554
lor-3.193
arians-3.189
vation-3.151
 former-3.143
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+1.798
,+1.703
.+1.622
 the+1.555
 a+1.535
Neg logit contributions
umm-7.400
lor-7.106
rol-7.084
osity-7.072
vation-7.047
Token made
Token ID925
Feature activation-1.00e+02

Pos logit contributions
 and+4.200
,+4.140
.+4.094
 the+3.746
 a+3.719
Neg logit contributions
umm-5.045
osity-4.764
vation-4.761
lor-4.756
arians-4.691
Token broke
Token ID6265
Feature activation-1.00e+02

Pos logit contributions
 and+4.164
,+4.034
.+3.937
 the+3.665
 a+3.644
Neg logit contributions
umm-5.143
lor-4.952
osity-4.901
vation-4.882
rol-4.865
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+2.141
,+2.069
.+1.779
 the+1.698
 a+1.646
Neg logit contributions
umm-7.082
lor-6.818
rol-6.811
osity-6.746
plets-6.727
Token The
Token ID383
Feature activation-1.00e+02

Pos logit contributions
 and+3.489
,+3.375
.+3.256
 the+3.053
 a+2.997
Neg logit contributions
umm-5.944
lor-5.728
osity-5.617
vation-5.591
rol-5.554
Token woman
Token ID2415
Feature activation-1.00e+02

Pos logit contributions
 and+3.923
,+3.892
.+3.754
 the+3.591
 a+3.479
Neg logit contributions
umm-5.453
lor-5.142
vation-5.124
arians-5.092
osity-5.087
Token was
Token ID373
Feature activation-1.00e+02

Pos logit contributions
 and+2.888
,+2.824
.+2.738
 the+2.589
 a+2.540
Neg logit contributions
umm-6.238
lor-6.114
vation-6.046
rol-5.995
osity-5.994
Token very
Token ID845
Feature activation-1.00e+02

Pos logit contributions
 and+3.187
,+3.087
.+2.995
 the+2.764
 a+2.644
Neg logit contributions
umm-6.046
lor-5.890
rol-5.829
osity-5.810
vation-5.783
Token sad
Token ID6507
Feature activation-1.00e+02

Pos logit contributions
 and+3.814
,+3.629
.+3.556
 the+3.391
 a+3.324
Neg logit contributions
umm-5.533
ubb-5.304
osity-5.292
vation-5.272
ocol-5.267
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+0.354
,+0.318
 the+0.280
 a+0.165
.+0.108
Neg logit contributions
umm-8.571
lor-8.264
osity-8.256
arians-8.165
 former-8.147
Token
Token ID198
Feature activation-1.00e+02

Pos logit contributions
 and+3.254
,+3.137
.+3.020
 the+2.856
 a+2.818
Neg logit contributions
umm-6.141
lor-5.907
osity-5.818
vation-5.750
rol-5.740
Token
Token ID198
Feature activation-1.00e+02

Pos logit contributions
 and+1.635
,+1.525
.+1.406
 the+1.253
 a+1.190
Neg logit contributions
umm-7.695
lor-7.582
vation-7.458
osity-7.421
rol-7.376
TokenShe
Token ID3347
Feature activation-1.00e+02

Pos logit contributions
 and+4.302
,+4.188
.+4.085
 the+3.930
 a+3.860
Neg logit contributions
umm-5.149
lor-4.960
vation-4.861
osity-4.852
arians-4.795
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+3.361
,+3.243
 the+3.108
.+3.105
 a+3.007
Neg logit contributions
umm-5.876
lor-5.704
arians-5.595
osity-5.575
vation-5.557
Token She
Token ID1375
Feature activation-1.00e+02

Pos logit contributions
 and+3.578
,+3.465
.+3.355
 the+3.197
 a+3.137
Neg logit contributions
umm-5.860
lor-5.622
osity-5.553
vation-5.491
rol-5.465
Token liked
Token ID8288
Feature activation-1.00e+02

Pos logit contributions
 and+3.002
,+2.953
.+2.819
 the+2.707
 a+2.601
Neg logit contributions
umm-6.196
lor-6.071
vation-5.960
osity-5.935
arians-5.925
Token the
Token ID262
Feature activation-1.00e+02

Pos logit contributions
 and+2.316
,+2.261
.+2.153
 the+1.860
 a+1.844
Neg logit contributions
umm-6.960
osity-6.719
ocol-6.661
lor-6.636
ubb-6.589
Token toy
Token ID13373
Feature activation-1.00e+02

Pos logit contributions
 and+4.942
,+4.858
.+4.733
 the+4.599
 a+4.465
Neg logit contributions
umm-4.504
lor-4.166
 former-4.097
osity-4.094
vation-4.079
Token very
Token ID845
Feature activation-1.00e+02

Pos logit contributions
 and+3.465
,+3.363
.+3.245
 the+3.200
 a+3.119
Neg logit contributions
umm-5.719
lor-5.496
arians-5.466
osity-5.408
vation-5.385
Token much
Token ID881
Feature activation-1.00e+02

Pos logit contributions
 and+3.373
,+3.232
.+3.093
 the+3.026
 a+2.911
Neg logit contributions
umm-5.999
lor-5.782
vation-5.699
osity-5.644
rol-5.638
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+0.401
,+0.311
 the+0.186
 a+0.149
.+0.142
Neg logit contributions
umm-8.705
lor-8.405
arians-8.350
vation-8.310
osity-8.284
Token It
Token ID632
Feature activation-1.00e+02

Pos logit contributions
 and+3.804
,+3.707
.+3.583
 the+3.426
 a+3.362
Neg logit contributions
umm-5.647
lor-5.373
osity-5.305
vation-5.243
ocol-5.232
Token was
Token ID373
Feature activation-1.00e+02

Pos logit contributions
 and+3.277
,+3.184
.+3.062
 the+2.913
 a+2.840
Neg logit contributions
umm-6.024
lor-5.848
vation-5.744
osity-5.741
rol-5.730
Token a
Token ID257
Feature activation-1.00e+02

Pos logit contributions
 and+3.053
,+2.934
.+2.846
 the+2.591
 a+2.477
Neg logit contributions
umm-6.152
osity-5.940
ocol-5.923
lor-5.913
 former-5.904
Token Ben
Token ID3932
Feature activation-1.00e+02

Pos logit contributions
 and+4.399
,+4.337
.+4.238
 the+4.005
 a+3.895
Neg logit contributions
umm-4.792
lor-4.564
osity-4.556
vation-4.554
arians-4.502
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+3.455
,+3.382
.+3.241
 the+3.233
 a+3.132
Neg logit contributions
umm-5.595
lor-5.533
vation-5.460
rol-5.396
arians-5.372
Token but
Token ID475
Feature activation-1.00e+02

Pos logit contributions
 and+3.299
,+3.196
.+3.104
 the+2.939
 a+2.878
Neg logit contributions
umm-5.826
osity-5.665
lor-5.655
vation-5.584
rol-5.573
Token you
Token ID345
Feature activation-1.00e+02

Pos logit contributions
 and+2.923
,+2.735
.+2.732
 a+2.455
 the+2.453
Neg logit contributions
umm-6.308
lor-6.125
osity-6.106
ubb-6.011
vation-6.002
Token are
Token ID389
Feature activation-1.00e+02

Pos logit contributions
 and+4.192
,+4.085
.+4.017
 the+3.860
 a+3.791
Neg logit contributions
umm-5.047
lor-4.941
vation-4.857
osity-4.823
arians-4.814
Token very
Token ID845
Feature activation-1.00e+02

Pos logit contributions
 and+3.728
,+3.634
.+3.504
 the+3.284
 a+3.182
Neg logit contributions
umm-5.524
lor-5.331
osity-5.304
rol-5.283
vation-5.283
Token funny
Token ID8258
Feature activation-1.00e+02

Pos logit contributions
 and+4.154
,+3.996
.+3.879
 the+3.736
 a+3.653
Neg logit contributions
umm-5.265
lor-5.008
vation-4.977
ubb-4.977
osity-4.954
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+1.797
,+1.718
 the+1.613
.+1.556
 a+1.489
Neg logit contributions
umm-7.373
lor-7.168
osity-7.042
rol-7.026
arians-7.025
Token Can
Token ID1680
Feature activation-1.00e+02

Pos logit contributions
 and+5.128
,+5.013
.+4.913
 the+4.761
 a+4.685
Neg logit contributions
umm-4.285
lor-4.076
osity-3.983
vation-3.963
arians-3.914
Token you
Token ID345
Feature activation-1.00e+02

Pos logit contributions
 and+3.721
,+3.569
.+3.499
 the+3.315
 a+3.268
Neg logit contributions
umm-5.654
lor-5.476
osity-5.382
vation-5.374
arians-5.340
Token say
Token ID910
Feature activation-1.00e+02

Pos logit contributions
 and+4.085
,+3.941
.+3.871
 the+3.773
 a+3.671
Neg logit contributions
umm-5.152
lor-4.997
vation-4.947
osity-4.903
arians-4.882
Token tree
Token ID5509
Feature activation-1.00e+02

Pos logit contributions
 and+3.419
,+3.320
.+3.213
 the+3.097
 a+3.047
Neg logit contributions
umm-5.959
lor-5.690
vation-5.598
osity-5.566
plets-5.548
Token with
Token ID351
Feature activation-1.00e+02

Pos logit contributions
 and+2.149
,+2.070
.+1.936
 the+1.865
 a+1.801
Neg logit contributions
umm-7.101
lor-6.886
osity-6.801
vation-6.763
rol-6.742
Token lots
Token ID6041
Feature activation-1.00e+02

Pos logit contributions
 and+3.135
,+3.058
.+2.930
 the+2.674
 a+2.540
Neg logit contributions
umm-6.285
lor-5.981
rol-5.922
osity-5.898
ocol-5.883
Token of
Token ID286
Feature activation-1.00e+02

Pos logit contributions
 and+2.598
,+2.514
.+2.397
 the+2.297
 a+2.241
Neg logit contributions
umm-6.786
lor-6.484
osity-6.425
ocol-6.394
vation-6.382
Token cher
Token ID21382
Feature activation-1.00e+02

Pos logit contributions
 and+5.383
,+5.292
.+5.174
 the+5.002
 a+4.928
Neg logit contributions
umm-4.004
lor-3.647
ocol-3.622
rol-3.610
osity-3.565
Tokenries
Token ID1678
Feature activation-1.00e+02

Pos logit contributions
 and+7.186
,+7.062
.+6.969
 the+6.831
 a+6.802
Neg logit contributions
umm-2.266
lor-2.033
ocol-1.897
osity-1.887
rol-1.886
Token on
Token ID319
Feature activation-1.00e+02

Pos logit contributions
 and+1.240
,+1.163
 the+0.984
.+0.966
 a+0.927
Neg logit contributions
umm-7.988
lor-7.687
plets-7.644
osity-7.636
rol-7.634
Token it
Token ID340
Feature activation-1.00e+02

Pos logit contributions
 and+3.213
,+3.114
.+2.951
 the+2.776
 a+2.741
Neg logit contributions
umm-6.095
lor-5.891
vation-5.714
arians-5.698
osity-5.685
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+0.434
,+0.332
 the+0.162
 a+0.093
.+0.032
Neg logit contributions
umm-8.677
lor-8.518
osity-8.386
vation-8.359
rol-8.359
Token She
Token ID1375
Feature activation-1.00e+02

Pos logit contributions
 and+3.224
,+3.115
.+3.019
 the+2.854
 a+2.803
Neg logit contributions
umm-6.178
lor-5.960
osity-5.860
vation-5.821
rol-5.781
Token wanted
Token ID2227
Feature activation-1.00e+02

Pos logit contributions
 and+3.261
,+3.176
.+3.104
 the+2.916
 a+2.845
Neg logit contributions
umm-6.035
lor-5.843
osity-5.757
arians-5.721
vation-5.715
Token share
Token ID2648
Feature activation-1.00e+02

Pos logit contributions
 and+3.416
,+3.303
.+3.203
 the+3.052
 a+2.984
Neg logit contributions
umm-5.983
lor-5.778
osity-5.688
vation-5.686
arians-5.629
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+2.259
,+2.240
.+2.067
 the+1.892
 a+1.874
Neg logit contributions
umm-6.582
lor-6.570
arians-6.454
osity-6.439
vation-6.427
Token have
Token ID423
Feature activation-1.00e+02

Pos logit contributions
 and+4.315
,+4.200
.+4.098
 the+3.901
 a+3.872
Neg logit contributions
umm-4.931
vation-4.712
osity-4.672
lor-4.668
arians-4.630
Token fun
Token ID1257
Feature activation-1.00e+02

Pos logit contributions
 and+3.232
,+3.122
.+2.962
 the+2.769
 a+2.626
Neg logit contributions
umm-6.006
lor-5.816
 former-5.706
arians-5.686
vation-5.669
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+0.528
,+0.409
 the+0.277
 a+0.199
.+0.180
Neg logit contributions
umm-8.620
lor-8.392
arians-8.371
vation-8.311
 former-8.267
Token<|endoftext|>
Token ID50256
Feature activation-1.00e+02

Pos logit contributions
 and+3.380
,+3.272
.+3.161
 the+3.014
 a+2.933
Neg logit contributions
umm-6.067
lor-5.850
osity-5.768
vation-5.737
arians-5.700
TokenAnna
Token ID31160
Feature activation-1.00e+02

Pos logit contributions
 and+4.344
,+4.229
.+4.091
 the+3.973
 a+3.903
Neg logit contributions
umm-5.001
lor-4.848
vation-4.710
osity-4.655
rol-4.639
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+2.952
,+2.863
.+2.780
 the+2.647
 a+2.582
Neg logit contributions
umm-6.325
lor-6.136
vation-6.033
osity-6.018
arians-5.972
Token Ben
Token ID3932
Feature activation-1.00e+02

Pos logit contributions
 and+3.012
,+2.879
.+2.824
 the+2.544
 a+2.482
Neg logit contributions
umm-6.171
lor-5.996
osity-5.992
vation-5.912
ocol-5.855
Token are
Token ID389
Feature activation-1.00e+02

Pos logit contributions
 and+3.365
,+3.259
.+3.171
 the+3.022
 a+2.951
Neg logit contributions
umm-5.921
lor-5.714
vation-5.633
osity-5.633
arians-5.565
Token twins
Token ID20345
Feature activation-1.00e+02

Pos logit contributions
 and+3.834
,+3.718
.+3.633
 the+3.482
 a+3.362
Neg logit contributions
umm-5.455
lor-5.219
osity-5.175
rol-5.163
ocol-5.159
Token the
Token ID262
Feature activation-1.00e+02

Pos logit contributions
 and+1.691
,+1.592
.+1.514
 the+1.281
 a+1.271
Neg logit contributions
umm-7.571
osity-7.331
lor-7.326
ocol-7.272
vation-7.263
Token window
Token ID4324
Feature activation-1.00e+02

Pos logit contributions
 and+4.619
,+4.537
.+4.448
 the+4.279
 a+4.220
Neg logit contributions
umm-4.825
lor-4.589
osity-4.514
vation-4.488
rol-4.421
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+0.688
,+0.616
.+0.587
 the+0.488
 a+0.423
Neg logit contributions
umm-8.472
osity-8.178
lor-8.159
vation-8.094
rol-8.057
Token saw
Token ID2497
Feature activation-1.00e+02

Pos logit contributions
 and+2.900
,+2.762
.+2.757
 a+2.433
 the+2.413
Neg logit contributions
umm-6.245
osity-6.132
vation-6.061
lor-6.041
rol-6.000
Token cherry
Token ID23612
Feature activation-1.00e+02

Pos logit contributions
 and+1.551
,+1.450
.+1.364
 the+0.984
 a+0.860
Neg logit contributions
umm-7.674
lor-7.519
osity-7.449
rol-7.445
ubb-7.326
Token trees
Token ID7150
Feature activation-1.00e+02

Pos logit contributions
 and+4.313
,+4.202
.+4.098
 the+3.988
 a+3.981
Neg logit contributions
umm-5.041
lor-4.734
vation-4.632
plets-4.631
arians-4.621
Token growing
Token ID3957
Feature activation-1.00e+02

Pos logit contributions
 and+1.131
,+1.023
.+0.927
 the+0.858
 a+0.840
Neg logit contributions
umm-8.050
lor-7.817
osity-7.767
rol-7.722
vation-7.721
Token all
Token ID477
Feature activation-1.00e+02

Pos logit contributions
 and+2.385
,+2.289
.+2.126
 the+2.043
 a+1.982
Neg logit contributions
umm-6.982
lor-6.677
rol-6.648
vation-6.552
osity-6.539
Token around
Token ID1088
Feature activation-1.00e+02

Pos logit contributions
 and+2.935
,+2.813
.+2.681
 the+2.520
 a+2.519
Neg logit contributions
umm-6.471
lor-6.186
osity-6.149
vation-6.144
rol-6.127
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+1.585
,+1.457
.+1.243
 a+1.137
 the+1.071
Neg logit contributions
umm-7.648
ocol-7.365
lor-7.333
vation-7.242
ubb-7.233
Token She
Token ID1375
Feature activation-1.00e+02

Pos logit contributions
 and+3.459
,+3.345
.+3.256
 the+3.099
 a+3.040
Neg logit contributions
umm-5.926
lor-5.714
osity-5.626
vation-5.543
rol-5.522
Token the
Token ID262
Feature activation-1.00e+02

Pos logit contributions
 and+1.709
,+1.640
.+1.536
 a+1.228
 the+1.224
Neg logit contributions
umm-7.398
lor-7.130
 former-7.088
ocol-7.080
arians-7.073
Token tools
Token ID4899
Feature activation-1.00e+02

Pos logit contributions
 and+5.273
,+5.203
.+5.051
 the+4.927
 a+4.862
Neg logit contributions
umm-4.098
vation-3.810
lor-3.788
arians-3.747
osity-3.697
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+2.447
,+2.297
.+2.228
 a+2.195
 the+2.188
Neg logit contributions
umm-6.578
lor-6.444
vation-6.417
osity-6.399
rol-6.391
Token on
Token ID319
Feature activation-1.00e+02

Pos logit contributions
,+1.830
 and+1.825
.+1.813
 the+1.526
 a+1.496
Neg logit contributions
umm-7.223
osity-7.035
lor-7.009
vation-6.980
plets-6.947
Token the
Token ID262
Feature activation-1.00e+02

Pos logit contributions
 and+2.119
,+2.047
.+1.942
 a+1.652
 the+1.636
Neg logit contributions
umm-6.986
lor-6.700
ocol-6.689
arians-6.677
 former-6.675
Token boxes
Token ID10559
Feature activation-1.00e+02

Pos logit contributions
 and+5.256
,+5.179
.+5.026
 the+4.910
 a+4.841
Neg logit contributions
umm-4.123
vation-3.821
lor-3.800
arians-3.778
osity-3.720
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+2.243
,+2.090
.+2.005
 the+1.988
 a+1.980
Neg logit contributions
umm-6.851
lor-6.671
vation-6.627
osity-6.627
rol-6.615
Token on
Token ID319
Feature activation-1.00e+02

Pos logit contributions
,+1.737
 and+1.735
.+1.733
 the+1.443
 a+1.412
Neg logit contributions
umm-7.298
osity-7.120
lor-7.100
vation-7.052
arians-7.029
Token the
Token ID262
Feature activation-1.00e+02

Pos logit contributions
 and+2.469
,+2.392
.+2.287
 a+1.998
 the+1.988
Neg logit contributions
umm-6.622
lor-6.341
 former-6.337
arians-6.329
ocol-6.326
Token cans
Token ID23916
Feature activation-1.00e+02

Pos logit contributions
 and+5.240
,+5.162
.+5.006
 the+4.896
 a+4.820
Neg logit contributions
umm-4.143
vation-3.836
lor-3.816
arians-3.809
osity-3.736
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+1.160
,+1.008
.+0.934
 the+0.932
 a+0.888
Neg logit contributions
umm-7.918
lor-7.723
rol-7.686
vation-7.673
osity-7.645
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+0.432
,+0.171
.+0.169
 the+0.068
 a+0.009
Neg logit contributions
umm-8.844
lor-8.663
osity-8.545
vation-8.541
ocol-8.472
Token Ben
Token ID3932
Feature activation-1.00e+02

Pos logit contributions
 and+3.169
,+2.994
.+2.955
 the+2.718
 a+2.643
Neg logit contributions
umm-6.178
lor-6.005
osity-5.931
vation-5.874
ocol-5.871
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+0.412
.+0.168
,+0.167
 the+0.136
 a+0.060
Neg logit contributions
umm-8.824
lor-8.581
osity-8.461
vation-8.451
reetings-8.412
Token it
Token ID340
Feature activation-1.00e+02

Pos logit contributions
 and+3.029
,+2.885
.+2.847
 the+2.584
 a+2.496
Neg logit contributions
umm-6.293
lor-6.128
osity-6.075
vation-5.985
ocol-5.977
Token is
Token ID318
Feature activation-1.00e+02

Pos logit contributions
 and+3.401
,+3.265
.+3.190
 the+3.054
 a+2.960
Neg logit contributions
umm-6.004
lor-5.815
vation-5.700
osity-5.661
arians-5.615
Token very
Token ID845
Feature activation-1.00e+02

Pos logit contributions
 and+2.485
,+2.356
.+2.273
 the+2.076
 a+1.956
Neg logit contributions
umm-6.860
lor-6.587
osity-6.477
vation-6.450
rol-6.433
Token pretty
Token ID2495
Feature activation-1.00e+02

Pos logit contributions
 and+3.209
,+3.070
.+2.971
 the+2.819
 a+2.738
Neg logit contributions
umm-6.262
lor-5.993
osity-5.915
vation-5.901
ubb-5.847
Token!"
Token ID2474
Feature activation-1.00e+02

Pos logit contributions
 and+3.173
,+3.054
 the+2.949
.+2.943
 a+2.819
Neg logit contributions
umm-6.141
lor-5.836
osity-5.692
vation-5.682
rol-5.675
Token Lily
Token ID20037
Feature activation-1.00e+02

Pos logit contributions
 and+2.461
,+2.338
.+2.245
 the+2.066
 a+1.984
Neg logit contributions
umm-6.917
lor-6.716
vation-6.615
osity-6.600
ubb-6.539
Token said
Token ID531
Feature activation-1.00e+02

Pos logit contributions
 and+2.826
,+2.732
.+2.667
 the+2.559
 a+2.466
Neg logit contributions
umm-6.371
lor-6.163
vation-6.108
osity-6.081
arians-6.059
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+0.283
,+0.037
 the+0.005
.-0.017
 a-0.055
Neg logit contributions
umm-8.842
osity-8.645
lor-8.616
vation-8.529
ubb-8.524
Token And
Token ID843
Feature activation-1.00e+02

Pos logit contributions
 and+2.933
,+2.824
.+2.713
 the+2.566
 a+2.490
Neg logit contributions
umm-6.489
lor-6.296
osity-6.194
vation-6.172
arians-6.127
Token that
Token ID326
Feature activation-1.00e+02

Pos logit contributions
 and+2.927
,+2.802
.+2.765
 the+2.493
 a+2.451
Neg logit contributions
umm-6.354
lor-6.171
osity-6.139
vation-6.051
arians-6.051
Token was
Token ID373
Feature activation-1.00e+02

Pos logit contributions
 and+3.489
,+3.342
.+3.225
 the+3.052
 a+3.001
Neg logit contributions
umm-5.780
lor-5.679
vation-5.528
osity-5.500
rol-5.483
Token enough
Token ID1576
Feature activation-1.00e+02

Pos logit contributions
 and+2.875
,+2.749
.+2.604
 the+2.346
 a+2.271
Neg logit contributions
umm-6.401
lor-6.257
osity-6.183
vation-6.157
rol-6.143
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+2.417
,+2.303
 the+2.160
.+2.145
 a+2.079
Neg logit contributions
umm-6.811
ocol-6.500
lor-6.477
arians-6.469
osity-6.459
Token<|endoftext|>
Token ID50256
Feature activation-1.00e+02

Pos logit contributions
 and+2.910
,+2.801
.+2.694
 the+2.541
 a+2.465
Neg logit contributions
umm-6.499
lor-6.312
osity-6.206
vation-6.189
arians-6.138
TokenTom
Token ID13787
Feature activation-1.00e+02

Pos logit contributions
 and+4.278
,+4.165
.+4.016
 the+3.907
 a+3.825
Neg logit contributions
umm-5.039
lor-4.935
vation-4.768
rol-4.717
osity-4.684
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+3.149
,+3.077
.+3.005
 the+2.877
 a+2.835
Neg logit contributions
umm-5.925
lor-5.836
osity-5.718
vation-5.671
rol-5.667
Token Anna
Token ID11735
Feature activation-1.00e+02

Pos logit contributions
 and+2.698
,+2.584
.+2.507
 the+2.244
 a+2.167
Neg logit contributions
umm-6.575
lor-6.416
osity-6.332
vation-6.287
rol-6.230
Token are
Token ID389
Feature activation-1.00e+02

Pos logit contributions
 and+3.101
,+3.006
.+2.921
 the+2.797
 a+2.729
Neg logit contributions
umm-6.085
lor-5.931
vation-5.834
osity-5.816
rol-5.783
Token friends
Token ID2460
Feature activation-1.00e+02

Pos logit contributions
 and+3.739
,+3.617
.+3.549
 the+3.378
 a+3.273
Neg logit contributions
umm-5.525
lor-5.303
osity-5.264
rol-5.253
ocol-5.227
Token
Token ID198
Feature activation-1.00e+02

Pos logit contributions
 and+1.569
,+1.458
.+1.331
 the+1.193
 a+1.146
Neg logit contributions
umm-7.715
lor-7.602
vation-7.483
osity-7.415
arians-7.387
TokenAnna
Token ID31160
Feature activation-1.00e+02

Pos logit contributions
 and+5.745
,+5.626
.+5.524
 the+5.376
 a+5.311
Neg logit contributions
umm-3.710
lor-3.524
vation-3.431
osity-3.395
arians-3.360
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+2.323
,+2.313
.+2.231
 the+2.143
 a+2.043
Neg logit contributions
umm-6.826
lor-6.669
vation-6.560
arians-6.541
osity-6.527
Token Ben
Token ID3932
Feature activation-1.00e+02

Pos logit contributions
 and+3.387
,+3.261
.+3.164
 the+2.916
 a+2.890
Neg logit contributions
umm-5.921
osity-5.622
ocol-5.599
vation-5.551
ubb-5.529
Token are
Token ID389
Feature activation-1.00e+02

Pos logit contributions
 and+4.207
,+4.115
.+4.007
 the+3.874
 a+3.787
Neg logit contributions
umm-5.063
lor-4.824
vation-4.818
osity-4.772
arians-4.740
Token very
Token ID845
Feature activation-1.00e+02

Pos logit contributions
 and+3.729
,+3.629
.+3.522
 the+3.348
 a+3.236
Neg logit contributions
umm-5.487
osity-5.261
arians-5.207
lor-5.200
rol-5.191
Token happy
Token ID3772
Feature activation-1.00e+02

Pos logit contributions
 and+3.642
,+3.498
.+3.414
 the+3.272
 a+3.210
Neg logit contributions
umm-5.714
osity-5.436
ubb-5.399
vation-5.398
lor-5.391
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+0.326
,+0.277
 the+0.131
 a+0.048
.+0.021
Neg logit contributions
umm-8.776
lor-8.436
osity-8.416
vation-8.370
arians-8.360
Token They
Token ID1119
Feature activation-1.00e+02

Pos logit contributions
 and+3.471
,+3.372
.+3.261
 the+3.110
 a+3.043
Neg logit contributions
umm-5.970
lor-5.706
osity-5.653
vation-5.611
arians-5.580
Token run
Token ID1057
Feature activation-1.00e+02

Pos logit contributions
 and+3.963
,+3.860
.+3.761
 the+3.605
 a+3.534
Neg logit contributions
umm-5.384
lor-5.151
osity-5.098
vation-5.095
arians-5.043
Token to
Token ID284
Feature activation-1.00e+02

Pos logit contributions
 and+1.814
,+1.728
.+1.646
 the+1.477
 a+1.422
Neg logit contributions
umm-7.410
lor-7.212
osity-7.147
vation-7.142
rol-7.124
Token the
Token ID262
Feature activation-1.00e+02

Pos logit contributions
 and+0.960
,+0.925
.+0.762
 the+0.582
 a+0.547
Neg logit contributions
umm-7.933
lor-7.734
ocol-7.685
vation-7.676
rol-7.669
Token house
Token ID2156
Feature activation-1.00e+02

Pos logit contributions
 and+5.103
,+5.013
.+4.886
 the+4.777
 a+4.698
Neg logit contributions
umm-4.290
lor-4.034
vation-3.915
arians-3.913
 former-3.893
Token to
Token ID284
Feature activation-1.00e+02

Pos logit contributions
 and+0.594
,+0.559
 the+0.418
.+0.369
 a+0.336
Neg logit contributions
umm-8.450
lor-8.239
osity-8.163
vation-8.161
arians-8.125
Token make
Token ID787
Feature activation-1.00e+02

Pos logit contributions
 and+4.087
,+3.971
.+3.836
 the+3.555
 a+3.504
Neg logit contributions
umm-5.195
ocol-4.994
vation-4.984
 former-4.953
lor-4.948
Token it
Token ID340
Feature activation-1.00e+02

Pos logit contributions
 and+3.560
,+3.499
.+3.264
 the+3.029
 a+2.922
Neg logit contributions
umm-5.646
lor-5.459
ocol-5.403
rol-5.398
 former-5.360
Token cozy
Token ID37438
Feature activation-1.00e+02

Pos logit contributions
 and+3.843
,+3.752
.+3.595
 the+3.496
 a+3.351
Neg logit contributions
umm-5.382
lor-5.130
 former-5.101
vation-5.081
osity-5.068
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+1.107
,+1.052
 the+0.924
.+0.868
 a+0.790
Neg logit contributions
umm-8.112
lor-7.804
 former-7.678
vation-7.608
arians-7.586
Token
Token ID198
Feature activation-1.00e+02

Pos logit contributions
 and+2.124
,+2.023
.+1.912
 the+1.750
 a+1.687
Neg logit contributions
umm-7.259
lor-7.091
vation-6.973
osity-6.962
arians-6.924
Token
Token ID198
Feature activation-1.00e+02

Pos logit contributions
 and+1.562
,+1.466
.+1.348
 the+1.201
 a+1.143
Neg logit contributions
umm-7.664
lor-7.584
vation-7.460
osity-7.401
rol-7.381
Token"
Token ID1
Feature activation-1.00e+02

Pos logit contributions
 and+4.690
,+4.578
.+4.479
 the+4.328
 a+4.256
Neg logit contributions
umm-4.746
lor-4.545
osity-4.444
vation-4.442
arians-4.382
TokenLook
Token ID8567
Feature activation-1.00e+02

Pos logit contributions
 and+5.868
,+5.731
.+5.652
 the+5.520
 a+5.457
Neg logit contributions
umm-3.541
lor-3.345
osity-3.267
vation-3.226
arians-3.183
Token green
Token ID4077
Feature activation-1.00e+02

Pos logit contributions
 and+3.989
,+3.883
.+3.782
 the+3.525
 a+3.471
Neg logit contributions
umm-5.279
osity-5.057
ocol-5.020
ubb-4.995
plets-4.965
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+0.556
,+0.479
 the+0.388
.+0.317
 a+0.267
Neg logit contributions
umm-8.561
lor-8.240
osity-8.140
arians-8.127
plets-8.117
Token It
Token ID632
Feature activation-1.00e+02

Pos logit contributions
 and+3.584
,+3.475
.+3.379
 the+3.200
 a+3.137
Neg logit contributions
umm-5.827
lor-5.596
osity-5.496
vation-5.486
arians-5.433
Token has
Token ID468
Feature activation-1.00e+02

Pos logit contributions
 and+4.497
,+4.385
.+4.286
 the+4.135
 a+4.065
Neg logit contributions
umm-4.904
lor-4.699
osity-4.603
vation-4.595
arians-4.538
Token sharp
Token ID7786
Feature activation-1.00e+02

Pos logit contributions
 and+3.909
,+3.824
.+3.713
 the+3.448
 a+3.288
Neg logit contributions
umm-5.462
lor-5.251
osity-5.167
rol-5.124
ubb-5.105
Token teeth
Token ID9941
Feature activation-1.00e+02

Pos logit contributions
 and+5.433
,+5.316
.+5.273
 the+5.173
 a+5.065
Neg logit contributions
umm-3.771
ocol-3.520
plets-3.493
lor-3.470
osity-3.452
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+0.315
,+0.269
 the+0.193
.+0.130
 a+0.052
Neg logit contributions
umm-8.639
lor-8.567
plets-8.437
osity-8.424
arians-8.363
Token a
Token ID257
Feature activation-1.00e+02

Pos logit contributions
 and+5.254
,+5.154
.+5.024
 the+4.831
 a+4.691
Neg logit contributions
umm-4.004
osity-3.731
lor-3.717
ocol-3.712
plets-3.692
Token for
Token ID329
Feature activation-1.00e+02

Pos logit contributions
 and+4.361
,+4.219
.+4.136
 the+4.052
 a+3.931
Neg logit contributions
umm-4.981
lor-4.742
osity-4.676
plets-4.648
ocol-4.629
Tokenked
Token ID9091
Feature activation-1.00e+02

Pos logit contributions
 and+3.725
,+3.532
.+3.429
 the+3.090
 a+3.061
Neg logit contributions
umm-5.573
ocol-5.453
ubb-5.418
arians-5.352
osity-5.346
Token tongue
Token ID11880
Feature activation-1.00e+02

Pos logit contributions
 and+4.979
,+4.869
.+4.674
 the+4.592
 a+4.540
Neg logit contributions
umm-4.291
lor-3.990
ocol-3.969
ubb-3.964
osity-3.957
Token"
Token ID1
Feature activation-1.00e+02

Pos logit contributions
 and+3.876
,+3.759
.+3.673
 the+3.524
 a+3.464
Neg logit contributions
umm-5.496
lor-5.366
osity-5.245
vation-5.231
arians-5.176
TokenLike
Token ID7594
Feature activation-1.00e+02

Pos logit contributions
 and+6.400
,+6.274
.+6.184
 the+6.037
 a+5.972
Neg logit contributions
umm-3.031
lor-2.825
osity-2.729
vation-2.717
rol-2.659
Token scissors
Token ID41786
Feature activation-1.00e+02

Pos logit contributions
 and+4.111
,+3.928
.+3.861
 the+3.665
 a+3.572
Neg logit contributions
ocol-5.015
umm-5.012
arians-4.939
lor-4.900
vation-4.853
Token or
Token ID393
Feature activation-1.00e+02

Pos logit contributions
 and+2.748
,+2.584
.+2.552
 the+2.490
 a+2.444
Neg logit contributions
umm-6.360
lor-6.263
ocol-6.224
arians-6.223
osity-6.173
Token knives
Token ID23110
Feature activation-1.00e+02

Pos logit contributions
 and+5.339
,+5.280
.+5.177
 the+4.943
 a+4.791
Neg logit contributions
umm-3.897
ocol-3.744
arians-3.715
lor-3.708
osity-3.707
Token or
Token ID393
Feature activation-1.00e+02

Pos logit contributions
 and+2.742
,+2.616
.+2.550
 the+2.504
 a+2.446
Neg logit contributions
umm-6.344
lor-6.217
arians-6.161
vation-6.152
ocol-6.080
Token guns
Token ID6541
Feature activation-1.00e+02

Pos logit contributions
 and+5.399
,+5.330
.+5.219
 the+5.003
 a+4.908
Neg logit contributions
umm-3.860
ocol-3.697
arians-3.632
osity-3.604
lor-3.597
Token,"
Token ID553
Feature activation-1.00e+02

Pos logit contributions
 and+2.733
,+2.622
.+2.524
 the+2.518
 a+2.433
Neg logit contributions
umm-6.328
lor-6.223
vation-6.174
arians-6.160
osity-6.074
Token Dad
Token ID17415
Feature activation-1.00e+02

Pos logit contributions
 and+3.592
,+3.429
.+3.361
 the+3.150
 a+3.110
Neg logit contributions
umm-5.733
osity-5.433
ocol-5.430
lor-5.413
vation-5.401
Token says
Token ID1139
Feature activation-1.00e+02

Pos logit contributions
 and+4.366
,+4.266
.+4.165
 the+4.046
 a+3.974
Neg logit contributions
umm-4.958
lor-4.765
vation-4.708
osity-4.680
arians-4.642
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+0.495
,+0.379
 the+0.231
.+0.143
 a+0.133
Neg logit contributions
umm-8.634
osity-8.458
lor-8.450
vation-8.429
arians-8.361
Token in
Token ID287
Feature activation-1.00e+02

Pos logit contributions
 and+3.172
,+3.045
.+2.910
 the+2.906
 a+2.798
Neg logit contributions
umm-5.995
lor-5.813
arians-5.766
vation-5.752
osity-5.722
Token our
Token ID674
Feature activation-1.00e+02

Pos logit contributions
 and+3.700
,+3.566
.+3.455
 a+3.180
 the+3.169
Neg logit contributions
umm-5.525
lor-5.420
ocol-5.366
vation-5.320
arians-5.303
Token pockets
Token ID16511
Feature activation-1.00e+02

Pos logit contributions
 and+5.556
,+5.423
.+5.305
 a+5.155
 the+5.151
Neg logit contributions
umm-3.852
lor-3.668
vation-3.560
ocol-3.530
arians-3.528
Token or
Token ID393
Feature activation-1.00e+02

Pos logit contributions
 and+2.330
,+2.190
 the+2.047
.+2.039
 a+2.023
Neg logit contributions
umm-6.789
lor-6.685
vation-6.572
osity-6.508
arians-6.507
Token our
Token ID674
Feature activation-1.00e+02

Pos logit contributions
 and+5.083
,+5.003
.+4.855
 the+4.612
 a+4.540
Neg logit contributions
umm-4.064
osity-3.939
arians-3.939
vation-3.900
 former-3.893
Token bags
Token ID11668
Feature activation-1.00e+02

Pos logit contributions
 and+5.205
,+5.108
.+4.956
 a+4.848
 the+4.840
Neg logit contributions
umm-4.184
lor-3.931
arians-3.894
ocol-3.876
vation-3.874
Token."
Token ID526
Feature activation-1.00e+02

Pos logit contributions
 and+2.403
,+2.277
.+2.116
 the+2.087
 a+2.065
Neg logit contributions
umm-6.701
lor-6.563
vation-6.545
osity-6.449
arians-6.438
Token
Token ID198
Feature activation-1.00e+02

Pos logit contributions
 and+1.594
,+1.515
.+1.379
 the+1.263
 a+1.175
Neg logit contributions
umm-7.664
lor-7.524
vation-7.363
osity-7.361
ocol-7.291
Token
Token ID198
Feature activation-1.00e+02

Pos logit contributions
 and+1.644
,+1.571
.+1.399
 the+1.298
 a+1.234
Neg logit contributions
umm-7.576
lor-7.518
vation-7.336
osity-7.300
reetings-7.244
Token"
Token ID1
Feature activation-1.00e+02

Pos logit contributions
 and+5.188
,+5.097
.+4.980
 the+4.852
 a+4.783
Neg logit contributions
umm-4.214
lor-4.088
vation-3.930
osity-3.916
arians-3.857
TokenLike
Token ID7594
Feature activation-1.00e+02

Pos logit contributions
 and+5.914
,+5.772
.+5.691
 the+5.552
 a+5.491
Neg logit contributions
umm-3.513
lor-3.313
osity-3.220
vation-3.197
rol-3.154
Token family
Token ID1641
Feature activation-1.00e+02

Pos logit contributions
 and+4.349
,+4.225
.+4.180
 the+4.039
 a+3.972
Neg logit contributions
umm-4.999
lor-4.673
osity-4.632
arians-4.588
ocol-4.573
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+2.802
,+2.691
.+2.567
 the+2.451
 a+2.400
Neg logit contributions
umm-6.528
lor-6.312
osity-6.228
vation-6.195
rol-6.154
Token The
Token ID383
Feature activation-1.00e+02

Pos logit contributions
 and+3.568
,+3.453
.+3.355
 the+3.204
 a+3.132
Neg logit contributions
umm-5.864
lor-5.656
osity-5.561
vation-5.551
arians-5.494
Token family
Token ID1641
Feature activation-1.00e+02

Pos logit contributions
 and+3.515
,+3.427
.+3.314
 the+3.149
 a+3.057
Neg logit contributions
umm-5.914
lor-5.618
osity-5.538
vation-5.526
arians-5.470
Token was
Token ID373
Feature activation-1.00e+02

Pos logit contributions
 and+3.319
,+3.206
.+3.104
 the+2.957
 a+2.881
Neg logit contributions
umm-6.098
lor-5.896
osity-5.795
vation-5.789
arians-5.733
Token very
Token ID845
Feature activation-1.00e+02

Pos logit contributions
 and+2.278
,+2.155
.+2.081
 the+1.882
 a+1.795
Neg logit contributions
umm-7.102
lor-6.846
osity-6.810
vation-6.764
rol-6.740
Token poor
Token ID3595
Feature activation-1.00e+02

Pos logit contributions
 and+3.588
,+3.464
.+3.370
 the+3.217
 a+3.148
Neg logit contributions
umm-5.837
lor-5.607
osity-5.538
vation-5.523
arians-5.461
Token but
Token ID475
Feature activation-1.00e+02

Pos logit contributions
 and+0.769
,+0.715
 the+0.568
.+0.559
 a+0.480
Neg logit contributions
umm-8.389
lor-8.067
arians-8.022
osity-8.013
 former-7.957
Token they
Token ID484
Feature activation-1.00e+02

Pos logit contributions
 and+3.174
,+3.067
.+2.984
 the+2.766
 a+2.701
Neg logit contributions
umm-6.132
lor-5.860
osity-5.858
vation-5.814
arians-5.753
Token prayed
Token ID32168
Feature activation-1.00e+02

Pos logit contributions
 and+3.627
,+3.514
.+3.416
 the+3.258
 a+3.195
Neg logit contributions
umm-5.720
lor-5.515
osity-5.456
vation-5.434
arians-5.383
Token hard
Token ID1327
Feature activation-1.00e+02

Pos logit contributions
 and+2.094
,+2.027
.+1.851
 the+1.791
 a+1.688
Neg logit contributions
umm-6.959
osity-6.836
lor-6.814
vation-6.737
 former-6.695
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+3.761
,+3.697
.+3.640
 the+3.497
 a+3.487
Neg logit contributions
umm-5.430
lor-5.290
osity-5.236
arians-5.211
vation-5.204
Token Dad
Token ID17415
Feature activation-1.00e+02

Pos logit contributions
 and+3.915
,+3.787
.+3.729
 the+3.429
 a+3.388
Neg logit contributions
umm-5.478
osity-5.235
lor-5.205
arians-5.100
vation-5.086
Token check
Token ID2198
Feature activation-1.00e+02

Pos logit contributions
 and+3.735
,+3.631
.+3.568
 the+3.425
 a+3.400
Neg logit contributions
umm-5.462
vation-5.297
lor-5.276
osity-5.272
arians-5.227
Token in
Token ID287
Feature activation-1.00e+02

Pos logit contributions
 and+3.958
,+3.881
.+3.786
 a+3.577
 the+3.528
Neg logit contributions
umm-5.127
lor-5.074
ocol-5.041
vation-5.026
rol-5.020
Token their
Token ID511
Feature activation-1.00e+02

Pos logit contributions
 and+1.988
,+1.966
.+1.849
 the+1.621
 a+1.592
Neg logit contributions
umm-7.024
ocol-6.826
lor-6.810
arians-6.789
vation-6.766
Token bags
Token ID11668
Feature activation-1.00e+02

Pos logit contributions
 and+6.113
,+6.025
.+5.912
 the+5.703
 a+5.646
Neg logit contributions
umm-3.228
lor-3.093
vation-3.003
arians-2.973
 former-2.957
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+1.085
,+1.004
.+0.914
 the+0.875
 a+0.857
Neg logit contributions
umm-7.893
lor-7.767
vation-7.755
osity-7.635
rol-7.630
Token get
Token ID651
Feature activation-1.00e+02

Pos logit contributions
 and+4.582
,+4.487
.+4.423
 the+4.153
 a+4.093
Neg logit contributions
umm-4.581
vation-4.484
arians-4.440
osity-4.435
lor-4.408
Token their
Token ID511
Feature activation-1.00e+02

Pos logit contributions
 and+4.074
,+3.980
.+3.811
 the+3.537
 a+3.464
Neg logit contributions
umm-5.187
rol-5.064
ocol-5.007
vation-4.986
lor-4.983
Token tickets
Token ID8587
Feature activation-1.00e+02

Pos logit contributions
 and+6.709
,+6.640
.+6.496
 the+6.345
 a+6.242
Neg logit contributions
umm-2.576
lor-2.424
vation-2.419
arians-2.357
osity-2.277
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+1.479
,+1.409
 the+1.260
.+1.205
 a+1.183
Neg logit contributions
umm-7.557
lor-7.447
vation-7.444
rol-7.357
osity-7.355
Token woman
Token ID2415
Feature activation-1.00e+02

Pos logit contributions
 and+4.270
,+4.185
.+4.130
 the+3.878
 a+3.868
Neg logit contributions
umm-5.082
lor-4.872
osity-4.773
arians-4.726
vation-4.698
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+0.815
,+0.632
 the+0.487
.+0.485
 a+0.451
Neg logit contributions
umm-8.388
lor-8.255
osity-8.199
arians-8.145
vation-8.128
Token I
Token ID314
Feature activation-1.00e+02

Pos logit contributions
 and+4.814
,+4.691
.+4.599
 the+4.441
 a+4.365
Neg logit contributions
umm-4.617
lor-4.407
osity-4.354
vation-4.284
arians-4.257
Token have
Token ID423
Feature activation-1.00e+02

Pos logit contributions
 and+4.542
,+4.434
.+4.337
 the+4.236
 a+4.136
Neg logit contributions
umm-4.795
lor-4.618
osity-4.533
vation-4.515
arians-4.465
Token a
Token ID257
Feature activation-1.00e+02

Pos logit contributions
 and+2.943
,+2.805
.+2.687
 the+2.425
 a+2.291
Neg logit contributions
umm-6.288
lor-6.189
 former-6.085
osity-6.057
ocol-6.039
Token gift
Token ID6979
Feature activation-1.00e+02

Pos logit contributions
 and+4.495
,+4.343
.+4.293
 the+4.158
 a+4.021
Neg logit contributions
umm-4.870
lor-4.654
osity-4.571
 former-4.535
vation-4.516
Token for
Token ID329
Feature activation-1.00e+02

Pos logit contributions
 and+1.926
,+1.818
.+1.672
 the+1.640
 a+1.532
Neg logit contributions
umm-7.298
lor-7.145
osity-7.027
vation-7.023
rol-6.998
Token you
Token ID345
Feature activation-1.00e+02

Pos logit contributions
 and+3.536
,+3.329
.+3.223
 the+2.891
 a+2.858
Neg logit contributions
umm-5.802
ocol-5.692
arians-5.652
vation-5.547
lor-5.544
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+1.484
,+1.373
 the+1.309
.+1.168
 a+1.155
Neg logit contributions
umm-7.455
osity-7.360
lor-7.358
arians-7.296
vation-7.252
Token Look
Token ID6803
Feature activation-1.00e+02

Pos logit contributions
 and+5.421
,+5.315
.+5.216
 the+5.059
 a+4.974
Neg logit contributions
umm-4.003
lor-3.795
osity-3.719
vation-3.670
arians-3.641
Token."
Token ID526
Feature activation-1.00e+02

Pos logit contributions
 and+2.055
,+1.850
.+1.814
 the+1.705
 a+1.665
Neg logit contributions
umm-7.027
lor-6.880
osity-6.868
ocol-6.856
arians-6.781
Token't
Token ID470
Feature activation-1.00e+02

Pos logit contributions
 and+4.414
,+4.303
.+4.203
 the+4.058
 a+3.981
Neg logit contributions
umm-5.006
lor-4.793
osity-4.709
vation-4.686
arians-4.633
Token cry
Token ID3960
Feature activation-1.00e+02

Pos logit contributions
 and+5.020
,+4.821
.+4.723
 the+4.660
 a+4.613
Neg logit contributions
umm-4.354
osity-4.109
lor-3.975
vation-3.969
 former-3.950
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+1.154
,+0.931
 the+0.890
.+0.858
 a+0.831
Neg logit contributions
umm-8.106
osity-7.841
lor-7.813
arians-7.757
ubb-7.707
Token woman
Token ID2415
Feature activation-1.00e+02

Pos logit contributions
 and+4.270
,+4.185
.+4.130
 the+3.878
 a+3.868
Neg logit contributions
umm-5.082
lor-4.872
osity-4.773
arians-4.726
vation-4.698
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+0.815
,+0.632
 the+0.487
.+0.485
 a+0.451
Neg logit contributions
umm-8.388
lor-8.255
osity-8.199
arians-8.145
vation-8.128
Token I
Token ID314
Feature activation-1.00e+02

Pos logit contributions
 and+4.814
,+4.691
.+4.599
 the+4.441
 a+4.365
Neg logit contributions
umm-4.617
lor-4.407
osity-4.354
vation-4.284
arians-4.257
Token have
Token ID423
Feature activation-1.00e+02

Pos logit contributions
 and+4.542
,+4.434
.+4.337
 the+4.236
 a+4.136
Neg logit contributions
umm-4.795
lor-4.618
osity-4.533
vation-4.515
arians-4.465
Token a
Token ID257
Feature activation-1.00e+02

Pos logit contributions
 and+2.943
,+2.805
.+2.687
 the+2.425
 a+2.291
Neg logit contributions
umm-6.288
lor-6.189
 former-6.085
osity-6.057
ocol-6.039
Token gift
Token ID6979
Feature activation-1.00e+02

Pos logit contributions
 and+4.495
,+4.343
.+4.293
 the+4.158
 a+4.021
Neg logit contributions
umm-4.870
lor-4.654
osity-4.571
 former-4.535
vation-4.516
Token for
Token ID329
Feature activation-1.00e+02

Pos logit contributions
 and+1.926
,+1.818
.+1.672
 the+1.640
 a+1.532
Neg logit contributions
umm-7.298
lor-7.145
osity-7.027
vation-7.023
rol-6.998
Token you
Token ID345
Feature activation-1.00e+02

Pos logit contributions
 and+3.536
,+3.329
.+3.223
 the+2.891
 a+2.858
Neg logit contributions
umm-5.802
ocol-5.692
arians-5.652
vation-5.547
lor-5.544
Token I
Token ID314
Feature activation-1.00e+02

Pos logit contributions
 and+5.149
,+5.050
.+4.943
 the+4.792
 a+4.699
Neg logit contributions
umm-4.298
lor-4.067
osity-4.013
arians-3.936
vation-3.932
Token love
Token ID1842
Feature activation-1.00e+02

Pos logit contributions
 and+4.181
,+4.083
.+3.975
 the+3.865
 a+3.772
Neg logit contributions
umm-5.123
lor-4.947
osity-4.873
vation-4.854
rol-4.805
Token my
Token ID616
Feature activation-1.00e+02

Pos logit contributions
 and+3.089
,+2.999
.+2.852
 a+2.608
 the+2.585
Neg logit contributions
umm-6.210
ocol-5.954
osity-5.906
arians-5.895
lor-5.867
Token toy
Token ID13373
Feature activation-1.00e+02

Pos logit contributions
 and+3.529
,+3.432
.+3.283
 the+3.243
 a+3.107
Neg logit contributions
umm-5.763
 former-5.478
arians-5.451
lor-5.430
osity-5.417
Token."
Token ID526
Feature activation-1.00e+02

Pos logit contributions
 and+2.642
,+2.551
 the+2.410
.+2.392
 a+2.323
Neg logit contributions
umm-6.593
lor-6.381
arians-6.371
vation-6.269
osity-6.258
Token
Token ID198
Feature activation-1.00e+02

Pos logit contributions
 and+2.391
,+2.297
.+2.192
 the+2.028
 a+2.010
Neg logit contributions
umm-6.904
lor-6.708
osity-6.600
vation-6.598
arians-6.558
Token
Token ID198
Feature activation-1.00e+02

Pos logit contributions
 and+1.629
,+1.533
.+1.401
 the+1.238
 a+1.205
Neg logit contributions
umm-7.622
lor-7.552
vation-7.434
osity-7.350
arians-7.324
TokenThe
Token ID464
Feature activation-1.00e+02

Pos logit contributions
 and+4.058
,+3.954
.+3.853
 the+3.677
 a+3.622
Neg logit contributions
umm-5.384
lor-5.207
vation-5.112
osity-5.089
arians-5.053
Token man
Token ID582
Feature activation-1.00e+02

Pos logit contributions
 and+5.218
,+5.156
.+5.036
 the+4.893
 a+4.801
Neg logit contributions
umm-4.152
lor-3.929
vation-3.854
osity-3.833
arians-3.825
Token said
Token ID531
Feature activation-1.00e+02

Pos logit contributions
 and+3.366
,+3.313
.+3.242
 the+3.087
 a+3.012
Neg logit contributions
umm-5.772
lor-5.647
vation-5.564
osity-5.504
arians-5.494
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+0.504
.+0.242
,+0.218
 the+0.079
 a+0.030
Neg logit contributions
umm-8.776
lor-8.617
osity-8.537
vation-8.466
ocol-8.453
Token love
Token ID1842
Feature activation-1.00e+02

Pos logit contributions
 and+4.181
,+4.083
.+3.975
 the+3.865
 a+3.772
Neg logit contributions
umm-5.123
lor-4.947
osity-4.873
vation-4.854
rol-4.805
Token my
Token ID616
Feature activation-1.00e+02

Pos logit contributions
 and+3.089
,+2.999
.+2.852
 a+2.608
 the+2.585
Neg logit contributions
umm-6.210
ocol-5.954
osity-5.906
arians-5.895
lor-5.867
Token toy
Token ID13373
Feature activation-1.00e+02

Pos logit contributions
 and+3.529
,+3.432
.+3.283
 the+3.243
 a+3.107
Neg logit contributions
umm-5.763
 former-5.478
arians-5.451
lor-5.430
osity-5.417
Token."
Token ID526
Feature activation-1.00e+02

Pos logit contributions
 and+2.642
,+2.551
 the+2.410
.+2.392
 a+2.323
Neg logit contributions
umm-6.593
lor-6.381
arians-6.371
vation-6.269
osity-6.258
Token
Token ID198
Feature activation-1.00e+02

Pos logit contributions
 and+2.391
,+2.297
.+2.192
 the+2.028
 a+2.010
Neg logit contributions
umm-6.904
lor-6.708
osity-6.600
vation-6.598
arians-6.558
Token
Token ID198
Feature activation-1.00e+02

Pos logit contributions
 and+1.629
,+1.533
.+1.401
 the+1.238
 a+1.205
Neg logit contributions
umm-7.622
lor-7.552
vation-7.434
osity-7.350
arians-7.324
TokenThe
Token ID464
Feature activation-1.00e+02

Pos logit contributions
 and+4.058
,+3.954
.+3.853
 the+3.677
 a+3.622
Neg logit contributions
umm-5.384
lor-5.207
vation-5.112
osity-5.089
arians-5.053
Token man
Token ID582
Feature activation-1.00e+02

Pos logit contributions
 and+5.218
,+5.156
.+5.036
 the+4.893
 a+4.801
Neg logit contributions
umm-4.152
lor-3.929
vation-3.854
osity-3.833
arians-3.825
Token said
Token ID531
Feature activation-1.00e+02

Pos logit contributions
 and+3.366
,+3.313
.+3.242
 the+3.087
 a+3.012
Neg logit contributions
umm-5.772
lor-5.647
vation-5.564
osity-5.504
arians-5.494
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+0.504
.+0.242
,+0.218
 the+0.079
 a+0.030
Neg logit contributions
umm-8.776
lor-8.617
osity-8.537
vation-8.466
ocol-8.453
Token "
Token ID366
Feature activation-1.00e+02

Pos logit contributions
 and+2.546
,+2.439
.+2.367
 the+2.189
 a+2.141
Neg logit contributions
umm-6.667
lor-6.525
osity-6.484
vation-6.398
arians-6.396
Token in
Token ID287
Feature activation-1.00e+02

Pos logit contributions
 and+1.828
,+1.754
.+1.642
 the+1.556
 a+1.545
Neg logit contributions
umm-7.279
osity-6.988
lor-6.920
rol-6.912
ubb-6.900
Token his
Token ID465
Feature activation-1.00e+02

Pos logit contributions
 and+3.776
,+3.633
.+3.498
 the+3.214
 a+3.188
Neg logit contributions
umm-5.479
lor-5.292
ocol-5.283
ubb-5.264
 former-5.205
Token room
Token ID2119
Feature activation-1.00e+02

Pos logit contributions
 and+5.205
,+5.010
.+4.931
 the+4.835
 a+4.753
Neg logit contributions
umm-4.188
 former-3.922
lor-3.896
vation-3.805
ocol-3.776
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+1.466
,+1.376
 the+1.265
.+1.226
 a+1.168
Neg logit contributions
umm-7.597
lor-7.479
arians-7.351
osity-7.332
 former-7.331
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
,+2.542
 and+2.511
.+2.461
 the+2.226
 a+2.191
Neg logit contributions
umm-6.531
lor-6.331
osity-6.320
arians-6.259
vation-6.241
Token he
Token ID339
Feature activation-1.00e+02

Pos logit contributions
 and+2.939
,+2.856
.+2.794
 the+2.488
 a+2.455
Neg logit contributions
umm-6.271
lor-6.091
osity-6.065
arians-5.963
vation-5.961
Token never
Token ID1239
Feature activation-1.00e+02

Pos logit contributions
 and+3.806
,+3.722
.+3.627
 the+3.475
 a+3.404
Neg logit contributions
umm-5.492
lor-5.356
osity-5.238
vation-5.226
arians-5.202
Token had
Token ID550
Feature activation-1.00e+02

Pos logit contributions
 and+4.292
,+4.136
.+4.042
 the+3.906
 a+3.835
Neg logit contributions
umm-4.988
osity-4.832
lor-4.786
vation-4.726
reetings-4.692
Token a
Token ID257
Feature activation-1.00e+02

Pos logit contributions
 and+2.811
,+2.661
.+2.488
 the+2.241
 a+2.109
Neg logit contributions
umm-6.392
lor-6.328
osity-6.213
rol-6.164
vation-6.152
Token good
Token ID922
Feature activation-1.00e+02

Pos logit contributions
 and+5.774
,+5.607
.+5.570
 the+5.459
 a+5.321
Neg logit contributions
umm-3.579
lor-3.402
osity-3.315
vation-3.295
 former-3.265
Token night
Token ID1755
Feature activation-1.00e+02

Pos logit contributions
 and+4.956
,+4.843
.+4.763
 the+4.717
 a+4.554
Neg logit contributions
umm-4.393
lor-4.134
osity-4.053
vation-4.041
 former-4.005
Token scared
Token ID12008
Feature activation-1.00e+02

Pos logit contributions
 and+3.776
,+3.617
.+3.541
 the+3.397
 a+3.322
Neg logit contributions
umm-5.513
lor-5.311
osity-5.289
ubb-5.287
vation-5.278
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+0.291
,+0.222
 the+0.126
 a+0.120
.+0.078
Neg logit contributions
umm-8.675
lor-8.340
osity-8.330
ocol-8.294
ubb-8.284
Token cried
Token ID16896
Feature activation-1.00e+02

Pos logit contributions
 and+4.067
,+3.939
.+3.834
 the+3.625
 a+3.587
Neg logit contributions
umm-5.143
lor-5.013
osity-4.970
vation-4.953
rol-4.914
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+0.660
,+0.507
 the+0.389
.+0.331
 a+0.228
Neg logit contributions
umm-8.411
osity-8.140
ubb-8.118
lor-8.070
rol-8.039
Token From
Token ID3574
Feature activation-1.00e+02

Pos logit contributions
 and+3.192
,+3.077
.+2.968
 the+2.806
 a+2.741
Neg logit contributions
umm-6.239
lor-6.030
osity-5.937
vation-5.913
arians-5.869
Token that
Token ID326
Feature activation-1.00e+02

Pos logit contributions
 and+1.967
,+1.869
.+1.767
 the+1.549
 a+1.485
Neg logit contributions
umm-7.336
lor-7.152
vation-7.057
osity-7.051
arians-7.014
Token day
Token ID1110
Feature activation-1.00e+02

Pos logit contributions
 and+3.146
,+3.024
.+2.937
 the+2.785
 a+2.716
Neg logit contributions
umm-6.293
lor-6.093
vation-5.996
osity-5.989
arians-5.911
Token on
Token ID319
Feature activation-1.00e+02

Pos logit contributions
 and+2.109
,+1.970
.+1.891
 the+1.746
 a+1.680
Neg logit contributions
umm-7.264
lor-7.084
vation-6.979
osity-6.976
arians-6.908
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+0.345
,+0.143
.+0.139
 a-0.055
 the-0.056
Neg logit contributions
umm-8.969
lor-8.811
vation-8.669
arians-8.621
osity-8.599
Token Tom
Token ID4186
Feature activation-1.00e+02

Pos logit contributions
 and+2.905
,+2.781
.+2.712
 the+2.433
 a+2.405
Neg logit contributions
umm-6.407
osity-6.115
lor-6.099
vation-6.096
arians-6.058
Token was
Token ID373
Feature activation-1.00e+02

Pos logit contributions
 and+3.298
,+3.211
.+3.146
 the+2.990
 a+2.929
Neg logit contributions
umm-5.923
lor-5.816
osity-5.693
vation-5.674
arians-5.648
Token his
Token ID465
Feature activation-1.00e+02

Pos logit contributions
 and+3.776
,+3.633
.+3.498
 the+3.214
 a+3.188
Neg logit contributions
umm-5.479
lor-5.292
ocol-5.283
ubb-5.264
 former-5.205
Token room
Token ID2119
Feature activation-1.00e+02

Pos logit contributions
 and+5.205
,+5.010
.+4.931
 the+4.835
 a+4.753
Neg logit contributions
umm-4.188
 former-3.922
lor-3.896
vation-3.805
ocol-3.776
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+1.466
,+1.376
 the+1.265
.+1.226
 a+1.168
Neg logit contributions
umm-7.597
lor-7.479
arians-7.351
osity-7.332
 former-7.331
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
,+2.542
 and+2.511
.+2.461
 the+2.226
 a+2.191
Neg logit contributions
umm-6.531
lor-6.331
osity-6.320
arians-6.259
vation-6.241
Token he
Token ID339
Feature activation-1.00e+02

Pos logit contributions
 and+2.939
,+2.856
.+2.794
 the+2.488
 a+2.455
Neg logit contributions
umm-6.271
lor-6.091
osity-6.065
arians-5.963
vation-5.961
Token never
Token ID1239
Feature activation-1.00e+02

Pos logit contributions
 and+3.806
,+3.722
.+3.627
 the+3.475
 a+3.404
Neg logit contributions
umm-5.492
lor-5.356
osity-5.238
vation-5.226
arians-5.202
Token had
Token ID550
Feature activation-1.00e+02

Pos logit contributions
 and+4.292
,+4.136
.+4.042
 the+3.906
 a+3.835
Neg logit contributions
umm-4.988
osity-4.832
lor-4.786
vation-4.726
reetings-4.692
Token a
Token ID257
Feature activation-1.00e+02

Pos logit contributions
 and+2.811
,+2.661
.+2.488
 the+2.241
 a+2.109
Neg logit contributions
umm-6.392
lor-6.328
osity-6.213
rol-6.164
vation-6.152
Token good
Token ID922
Feature activation-1.00e+02

Pos logit contributions
 and+5.774
,+5.607
.+5.570
 the+5.459
 a+5.321
Neg logit contributions
umm-3.579
lor-3.402
osity-3.315
vation-3.295
 former-3.265
Token night
Token ID1755
Feature activation-1.00e+02

Pos logit contributions
 and+4.956
,+4.843
.+4.763
 the+4.717
 a+4.554
Neg logit contributions
umm-4.393
lor-4.134
osity-4.053
vation-4.041
 former-4.005
Token's
Token ID338
Feature activation-1.00e+02

Pos logit contributions
 and+3.213
,+3.100
.+2.933
 the+2.926
 a+2.864
Neg logit contributions
umm-6.141
lor-5.889
vation-5.851
osity-5.724
plets-5.693
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
,+2.542
 and+2.511
.+2.461
 the+2.226
 a+2.191
Neg logit contributions
umm-6.531
lor-6.331
osity-6.320
arians-6.259
vation-6.241
Token he
Token ID339
Feature activation-1.00e+02

Pos logit contributions
 and+2.939
,+2.856
.+2.794
 the+2.488
 a+2.455
Neg logit contributions
umm-6.271
lor-6.091
osity-6.065
arians-5.963
vation-5.961
Token never
Token ID1239
Feature activation-1.00e+02

Pos logit contributions
 and+3.806
,+3.722
.+3.627
 the+3.475
 a+3.404
Neg logit contributions
umm-5.492
lor-5.356
osity-5.238
vation-5.226
arians-5.202
Token had
Token ID550
Feature activation-1.00e+02

Pos logit contributions
 and+4.292
,+4.136
.+4.042
 the+3.906
 a+3.835
Neg logit contributions
umm-4.988
osity-4.832
lor-4.786
vation-4.726
reetings-4.692
Token a
Token ID257
Feature activation-1.00e+02

Pos logit contributions
 and+2.811
,+2.661
.+2.488
 the+2.241
 a+2.109
Neg logit contributions
umm-6.392
lor-6.328
osity-6.213
rol-6.164
vation-6.152
Token good
Token ID922
Feature activation-1.00e+02

Pos logit contributions
 and+5.774
,+5.607
.+5.570
 the+5.459
 a+5.321
Neg logit contributions
umm-3.579
lor-3.402
osity-3.315
vation-3.295
 former-3.265
Token night
Token ID1755
Feature activation-1.00e+02

Pos logit contributions
 and+4.956
,+4.843
.+4.763
 the+4.717
 a+4.554
Neg logit contributions
umm-4.393
lor-4.134
osity-4.053
vation-4.041
 former-4.005
Token's
Token ID338
Feature activation-1.00e+02

Pos logit contributions
 and+3.213
,+3.100
.+2.933
 the+2.926
 a+2.864
Neg logit contributions
umm-6.141
lor-5.889
vation-5.851
osity-5.724
plets-5.693
Token sleep
Token ID3993
Feature activation-1.00e+02

Pos logit contributions
 and+5.503
,+5.396
.+5.228
 the+5.204
 a+5.065
Neg logit contributions
umm-3.825
lor-3.576
 former-3.567
osity-3.528
vation-3.506
Token again
Token ID757
Feature activation-1.00e+02

Pos logit contributions
 and+3.354
,+3.245
 the+3.064
.+3.041
 a+2.980
Neg logit contributions
umm-5.952
lor-5.843
vation-5.702
arians-5.614
osity-5.555
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+0.516
,+0.404
 the+0.238
 a+0.212
.+0.086
Neg logit contributions
umm-8.552
lor-8.454
 former-8.319
vation-8.303
rim-8.219
TokenThank
Token ID10449
Feature activation-1.00e+02

Pos logit contributions
 and+5.772
,+5.641
.+5.552
 the+5.410
 a+5.337
Neg logit contributions
umm-3.660
lor-3.462
osity-3.375
vation-3.351
arians-3.297
Token you
Token ID345
Feature activation-1.00e+02

Pos logit contributions
 and+3.238
,+3.081
.+2.979
 the+2.807
 a+2.755
Neg logit contributions
umm-6.202
lor-6.026
vation-5.926
arians-5.862
ocol-5.849
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+1.134
,+0.939
.+0.901
 the+0.810
 a+0.722
Neg logit contributions
umm-8.207
lor-7.977
osity-7.846
arians-7.835
vation-7.831
Token Tom
Token ID4186
Feature activation-1.00e+02

Pos logit contributions
 and+3.720
,+3.608
.+3.548
 the+3.319
 a+3.287
Neg logit contributions
umm-5.638
lor-5.368
osity-5.359
vation-5.334
arians-5.310
Token!"
Token ID2474
Feature activation-1.00e+02

Pos logit contributions
 and+2.134
,+1.985
.+1.840
 the+1.811
 a+1.756
Neg logit contributions
umm-7.078
lor-6.935
osity-6.767
arians-6.756
vation-6.752
Token
Token ID198
Feature activation-1.00e+02

Pos logit contributions
 and+3.210
,+3.133
.+3.023
 the+2.875
 a+2.808
Neg logit contributions
umm-6.098
lor-5.895
vation-5.829
osity-5.795
arians-5.764
Token
Token ID198
Feature activation-1.00e+02

Pos logit contributions
 and+1.545
,+1.450
.+1.331
 the+1.192
 a+1.127
Neg logit contributions
umm-7.725
lor-7.622
vation-7.491
osity-7.437
rol-7.413
TokenTogether
Token ID41631
Feature activation-1.00e+02

Pos logit contributions
 and+4.870
,+4.752
.+4.654
 the+4.506
 a+4.434
Neg logit contributions
umm-4.580
lor-4.370
osity-4.279
vation-4.262
arians-4.206
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+0.237
,+0.097
.+0.029
 the-0.121
 a-0.181
Neg logit contributions
umm-9.147
lor-8.916
osity-8.834
vation-8.816
arians-8.766
Token Tom
Token ID4186
Feature activation-1.00e+02

Pos logit contributions
 and+2.255
,+2.135
.+2.052
 the+1.839
 a+1.791
Neg logit contributions
umm-7.097
lor-6.830
vation-6.802
osity-6.795
arians-6.737
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+3.130
,+3.061
.+3.007
 the+2.883
 a+2.818
Neg logit contributions
umm-6.005
lor-5.871
osity-5.779
vation-5.756
arians-5.750
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+0.918
,+0.847
.+0.718
 the+0.656
 a+0.607
Neg logit contributions
umm-8.201
lor-7.983
osity-7.939
arians-7.841
vation-7.825
Token said
Token ID531
Feature activation-1.00e+02

Pos logit contributions
 and+3.620
,+3.505
.+3.445
 the+3.205
 a+3.173
Neg logit contributions
umm-5.701
osity-5.455
lor-5.448
vation-5.412
arians-5.390
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+1.399
,+1.150
.+1.140
 the+0.976
 a+0.911
Neg logit contributions
umm-7.946
lor-7.766
osity-7.695
vation-7.626
arians-7.587
Token "
Token ID366
Feature activation-1.00e+02

Pos logit contributions
 and+2.510
,+2.434
.+2.356
 the+2.180
 a+2.123
Neg logit contributions
umm-6.705
lor-6.527
osity-6.497
arians-6.435
vation-6.427
TokenThank
Token ID10449
Feature activation-1.00e+02

Pos logit contributions
 and+5.772
,+5.641
.+5.552
 the+5.410
 a+5.337
Neg logit contributions
umm-3.660
lor-3.462
osity-3.375
vation-3.351
arians-3.297
Token you
Token ID345
Feature activation-1.00e+02

Pos logit contributions
 and+3.238
,+3.081
.+2.979
 the+2.807
 a+2.755
Neg logit contributions
umm-6.202
lor-6.026
vation-5.926
arians-5.862
ocol-5.849
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+1.134
,+0.939
.+0.901
 the+0.810
 a+0.722
Neg logit contributions
umm-8.207
lor-7.977
osity-7.846
arians-7.835
vation-7.831
Token Tom
Token ID4186
Feature activation-1.00e+02

Pos logit contributions
 and+3.720
,+3.608
.+3.548
 the+3.319
 a+3.287
Neg logit contributions
umm-5.638
lor-5.368
osity-5.359
vation-5.334
arians-5.310
Token!"
Token ID2474
Feature activation-1.00e+02

Pos logit contributions
 and+2.134
,+1.985
.+1.840
 the+1.811
 a+1.756
Neg logit contributions
umm-7.078
lor-6.935
osity-6.767
arians-6.756
vation-6.752
Token
Token ID198
Feature activation-1.00e+02

Pos logit contributions
 and+3.210
,+3.133
.+3.023
 the+2.875
 a+2.808
Neg logit contributions
umm-6.098
lor-5.895
vation-5.829
osity-5.795
arians-5.764
Token
Token ID198
Feature activation-1.00e+02

Pos logit contributions
 and+1.545
,+1.450
.+1.331
 the+1.192
 a+1.127
Neg logit contributions
umm-7.725
lor-7.622
vation-7.491
osity-7.437
rol-7.413
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+1.798
,+1.551
.+1.548
 the+1.384
 a+1.310
Neg logit contributions
umm-7.516
lor-7.365
osity-7.302
vation-7.206
arians-7.183
Token "
Token ID366
Feature activation-1.00e+02

Pos logit contributions
 and+2.615
,+2.487
.+2.419
 the+2.286
 a+2.203
Neg logit contributions
umm-6.517
osity-6.404
lor-6.369
arians-6.298
plets-6.220
TokenI
Token ID40
Feature activation-1.00e+02

Pos logit contributions
 and+5.969
,+5.885
.+5.766
 the+5.600
 a+5.531
Neg logit contributions
umm-3.516
lor-3.316
vation-3.220
osity-3.181
arians-3.144
Token can
Token ID460
Feature activation-1.00e+02

Pos logit contributions
 and+4.364
,+4.240
.+4.158
 the+4.029
 a+3.930
Neg logit contributions
umm-5.019
lor-4.831
osity-4.734
vation-4.729
arians-4.659
Token help
Token ID1037
Feature activation-1.00e+02

Pos logit contributions
 and+4.348
,+4.216
.+4.129
 the+3.995
 a+3.914
Neg logit contributions
umm-4.985
lor-4.808
vation-4.746
osity-4.710
arians-4.702
Token you
Token ID345
Feature activation-1.00e+02

Pos logit contributions
 and+3.148
,+2.954
.+2.847
 a+2.663
 the+2.644
Neg logit contributions
umm-5.972
arians-5.900
vation-5.834
lor-5.823
ocol-5.738
Token fix
Token ID4259
Feature activation-1.00e+02

Pos logit contributions
 and+4.055
,+3.887
.+3.782
 the+3.764
 a+3.675
Neg logit contributions
umm-5.126
lor-5.038
osity-4.922
vation-4.922
arians-4.899
Token your
Token ID534
Feature activation-1.00e+02

Pos logit contributions
 and+2.770
,+2.589
.+2.498
 a+2.208
 the+2.148
Neg logit contributions
umm-6.339
lor-6.263
ocol-6.239
arians-6.234
vation-6.129
Token bike
Token ID7161
Feature activation-1.00e+02

Pos logit contributions
 and+5.288
,+5.161
.+5.006
 the+4.929
 a+4.879
Neg logit contributions
umm-4.022
 former-3.762
lor-3.714
osity-3.691
arians-3.680
Token."
Token ID526
Feature activation-1.00e+02

Pos logit contributions
 and+2.872
,+2.704
 the+2.609
.+2.581
 a+2.531
Neg logit contributions
umm-6.287
lor-6.146
arians-6.067
vation-6.035
osity-5.947
Token The
Token ID383
Feature activation-1.00e+02

Pos logit contributions
 and+3.217
,+3.110
.+3.005
 the+2.845
 a+2.781
Neg logit contributions
umm-6.173
lor-5.944
vation-5.871
osity-5.860
arians-5.812
Token your
Token ID534
Feature activation-1.00e+02

Pos logit contributions
 and+2.770
,+2.589
.+2.498
 a+2.208
 the+2.148
Neg logit contributions
umm-6.339
lor-6.263
ocol-6.239
arians-6.234
vation-6.129
Token bike
Token ID7161
Feature activation-1.00e+02

Pos logit contributions
 and+5.288
,+5.161
.+5.006
 the+4.929
 a+4.879
Neg logit contributions
umm-4.022
 former-3.762
lor-3.714
osity-3.691
arians-3.680
Token."
Token ID526
Feature activation-1.00e+02

Pos logit contributions
 and+2.872
,+2.704
 the+2.609
.+2.581
 a+2.531
Neg logit contributions
umm-6.287
lor-6.146
arians-6.067
vation-6.035
osity-5.947
Token The
Token ID383
Feature activation-1.00e+02

Pos logit contributions
 and+3.217
,+3.110
.+3.005
 the+2.845
 a+2.781
Neg logit contributions
umm-6.173
lor-5.944
vation-5.871
osity-5.860
arians-5.812
Token unknown
Token ID6439
Feature activation-1.00e+02

Pos logit contributions
 and+3.934
,+3.810
.+3.704
 the+3.536
 a+3.465
Neg logit contributions
umm-5.506
lor-5.296
osity-5.210
vation-5.204
arians-5.134
Token person
Token ID1048
Feature activation-1.00e+02

Pos logit contributions
 and+4.103
,+3.963
.+3.875
 the+3.744
 a+3.685
Neg logit contributions
umm-5.248
lor-5.030
osity-4.969
vation-4.924
arians-4.861
Token was
Token ID373
Feature activation-1.00e+02

Pos logit contributions
 and+3.406
,+3.297
.+3.215
 the+3.075
 a+3.012
Neg logit contributions
umm-5.874
lor-5.688
vation-5.613
osity-5.612
arians-5.574
Token happy
Token ID3772
Feature activation-1.00e+02

Pos logit contributions
 and+3.133
,+3.004
.+2.933
 the+2.688
 a+2.565
Neg logit contributions
umm-6.050
lor-5.911
osity-5.873
rol-5.814
vation-5.813
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+0.918
,+0.847
.+0.718
 the+0.656
 a+0.607
Neg logit contributions
umm-8.201
lor-7.983
osity-7.939
arians-7.841
vation-7.825
Token said
Token ID531
Feature activation-1.00e+02

Pos logit contributions
 and+3.620
,+3.505
.+3.445
 the+3.205
 a+3.173
Neg logit contributions
umm-5.701
osity-5.455
lor-5.448
vation-5.412
arians-5.390
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+1.399
,+1.150
.+1.140
 the+0.976
 a+0.911
Neg logit contributions
umm-7.946
lor-7.766
osity-7.695
vation-7.626
arians-7.587
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+0.237
,+0.097
.+0.029
 the-0.121
 a-0.181
Neg logit contributions
umm-9.147
lor-8.916
osity-8.834
vation-8.816
arians-8.766
Token Tom
Token ID4186
Feature activation-1.00e+02

Pos logit contributions
 and+2.255
,+2.135
.+2.052
 the+1.839
 a+1.791
Neg logit contributions
umm-7.097
lor-6.830
vation-6.802
osity-6.795
arians-6.737
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+3.130
,+3.061
.+3.007
 the+2.883
 a+2.818
Neg logit contributions
umm-6.005
lor-5.871
osity-5.779
vation-5.756
arians-5.750
Token the
Token ID262
Feature activation-1.00e+02

Pos logit contributions
 and+0.713
,+0.586
.+0.497
 a+0.198
 the+0.189
Neg logit contributions
umm-8.677
osity-8.383
vation-8.361
lor-8.343
arians-8.319
Token unknown
Token ID6439
Feature activation-1.00e+02

Pos logit contributions
 and+3.678
,+3.552
.+3.446
 the+3.258
 a+3.182
Neg logit contributions
umm-5.761
lor-5.525
osity-5.468
vation-5.451
ubb-5.385
Token person
Token ID1048
Feature activation-1.00e+02

Pos logit contributions
 and+4.161
,+4.024
.+3.925
 the+3.792
 a+3.738
Neg logit contributions
umm-5.234
lor-4.983
osity-4.952
vation-4.892
ubb-4.856
Token fixed
Token ID5969
Feature activation-1.00e+02

Pos logit contributions
 and+3.975
,+3.871
.+3.767
 the+3.640
 a+3.586
Neg logit contributions
umm-5.311
lor-5.108
osity-5.061
vation-5.040
arians-5.018
Token the
Token ID262
Feature activation-1.00e+02

Pos logit contributions
 and+2.148
,+2.067
.+1.970
 a+1.703
 the+1.611
Neg logit contributions
umm-6.933
arians-6.756
ocol-6.708
osity-6.641
ubb-6.622
Token bike
Token ID7161
Feature activation-1.00e+02

Pos logit contributions
 and+4.352
,+4.255
.+4.148
 the+3.984
 a+3.900
Neg logit contributions
umm-5.057
lor-4.767
osity-4.728
vation-4.688
 former-4.671
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+2.051
,+1.990
 the+1.848
.+1.781
 a+1.767
Neg logit contributions
umm-7.025
lor-6.833
arians-6.781
rol-6.772
vation-6.766
Token The
Token ID383
Feature activation-1.00e+02

Pos logit contributions
 and+3.727
,+3.615
.+3.517
 the+3.363
 a+3.292
Neg logit contributions
umm-5.696
lor-5.482
osity-5.392
vation-5.381
arians-5.327
Token you
Token ID345
Feature activation-1.00e+02

Pos logit contributions
 and+3.238
,+3.081
.+2.979
 the+2.807
 a+2.755
Neg logit contributions
umm-6.202
lor-6.026
vation-5.926
arians-5.862
ocol-5.849
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+1.134
,+0.939
.+0.901
 the+0.810
 a+0.722
Neg logit contributions
umm-8.207
lor-7.977
osity-7.846
arians-7.835
vation-7.831
Token Tom
Token ID4186
Feature activation-1.00e+02

Pos logit contributions
 and+3.720
,+3.608
.+3.548
 the+3.319
 a+3.287
Neg logit contributions
umm-5.638
lor-5.368
osity-5.359
vation-5.334
arians-5.310
Token!"
Token ID2474
Feature activation-1.00e+02

Pos logit contributions
 and+2.134
,+1.985
.+1.840
 the+1.811
 a+1.756
Neg logit contributions
umm-7.078
lor-6.935
osity-6.767
arians-6.756
vation-6.752
Token
Token ID198
Feature activation-1.00e+02

Pos logit contributions
 and+3.210
,+3.133
.+3.023
 the+2.875
 a+2.808
Neg logit contributions
umm-6.098
lor-5.895
vation-5.829
osity-5.795
arians-5.764
Token
Token ID198
Feature activation-1.00e+02

Pos logit contributions
 and+1.545
,+1.450
.+1.331
 the+1.192
 a+1.127
Neg logit contributions
umm-7.725
lor-7.622
vation-7.491
osity-7.437
rol-7.413
TokenTogether
Token ID41631
Feature activation-1.00e+02

Pos logit contributions
 and+4.870
,+4.752
.+4.654
 the+4.506
 a+4.434
Neg logit contributions
umm-4.580
lor-4.370
osity-4.279
vation-4.262
arians-4.206
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+0.237
,+0.097
.+0.029
 the-0.121
 a-0.181
Neg logit contributions
umm-9.147
lor-8.916
osity-8.834
vation-8.816
arians-8.766
Token Tom
Token ID4186
Feature activation-1.00e+02

Pos logit contributions
 and+2.255
,+2.135
.+2.052
 the+1.839
 a+1.791
Neg logit contributions
umm-7.097
lor-6.830
vation-6.802
osity-6.795
arians-6.737
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+3.130
,+3.061
.+3.007
 the+2.883
 a+2.818
Neg logit contributions
umm-6.005
lor-5.871
osity-5.779
vation-5.756
arians-5.750
Token the
Token ID262
Feature activation-1.00e+02

Pos logit contributions
 and+0.713
,+0.586
.+0.497
 a+0.198
 the+0.189
Neg logit contributions
umm-8.677
osity-8.383
vation-8.361
lor-8.343
arians-8.319
Token sounds
Token ID5238
Feature activation-1.00e+02

Pos logit contributions
 and+3.556
,+3.437
.+3.250
 the+3.043
 a+2.966
Neg logit contributions
umm-5.701
lor-5.511
rol-5.472
ocol-5.444
osity-5.439
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+1.383
,+1.316
.+1.150
 the+1.098
 a+1.028
Neg logit contributions
umm-7.771
osity-7.513
lor-7.483
vation-7.442
rol-7.439
Token move
Token ID1445
Feature activation-1.00e+02

Pos logit contributions
 and+4.828
,+4.731
.+4.589
 the+4.380
 a+4.316
Neg logit contributions
umm-4.437
osity-4.259
 former-4.216
lor-4.157
vation-4.148
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+2.263
,+2.200
.+1.978
 the+1.892
 a+1.859
Neg logit contributions
umm-6.785
rol-6.593
osity-6.575
lor-6.565
arians-6.534
Token But
Token ID887
Feature activation-1.00e+02

Pos logit contributions
 and+3.990
,+3.902
.+3.766
 the+3.612
 a+3.545
Neg logit contributions
umm-5.460
lor-5.187
osity-5.119
arians-5.067
rol-5.056
Token one
Token ID530
Feature activation-1.00e+02

Pos logit contributions
 and+2.557
.+2.409
,+2.362
 a+2.060
 the+2.055
Neg logit contributions
umm-6.682
osity-6.468
ocol-6.429
lor-6.418
 former-6.392
Token day
Token ID1110
Feature activation-1.00e+02

Pos logit contributions
 and+3.201
,+3.095
.+3.005
 the+2.851
 a+2.776
Neg logit contributions
umm-6.190
lor-5.936
osity-5.868
vation-5.849
ubb-5.788
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+0.759
.+0.513
,+0.475
 the+0.268
 a+0.233
Neg logit contributions
umm-8.443
lor-8.293
osity-8.223
ocol-8.197
vation-8.174
Token the
Token ID262
Feature activation-1.00e+02

Pos logit contributions
 and+1.988
,+1.843
.+1.794
 the+1.445
 a+1.383
Neg logit contributions
umm-7.253
lor-7.065
osity-6.996
vation-6.969
arians-6.960
Token toy
Token ID13373
Feature activation-1.00e+02

Pos logit contributions
 and+4.496
,+4.469
.+4.329
 the+4.163
 a+4.047
Neg logit contributions
umm-4.868
lor-4.576
arians-4.525
vation-4.516
osity-4.508
Token fell
Token ID3214
Feature activation-1.00e+02

Pos logit contributions
 and+3.946
,+3.850
.+3.768
 the+3.636
 a+3.549
Neg logit contributions
umm-5.336
lor-5.208
vation-5.076
arians-5.066
osity-5.042
Token can
Token ID460
Feature activation-1.00e+02

Pos logit contributions
 and+4.230
,+4.132
.+4.026
 the+3.909
 a+3.816
Neg logit contributions
umm-5.078
lor-4.902
osity-4.833
vation-4.813
rol-4.757
Token't
Token ID470
Feature activation-1.00e+02

Pos logit contributions
 and+4.414
,+4.301
.+4.197
 the+4.059
 a+3.982
Neg logit contributions
umm-4.931
lor-4.752
vation-4.673
osity-4.663
arians-4.628
Token fix
Token ID4259
Feature activation-1.00e+02

Pos logit contributions
 and+4.234
,+4.138
.+3.982
 the+3.932
 a+3.851
Neg logit contributions
umm-5.036
lor-4.816
vation-4.811
osity-4.749
arians-4.732
Token it
Token ID340
Feature activation-1.00e+02

Pos logit contributions
 and+1.749
,+1.595
.+1.434
 a+1.200
 the+1.158
Neg logit contributions
umm-7.428
lor-7.291
ocol-7.250
arians-7.169
vation-7.141
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+2.514
,+2.383
 the+2.255
.+2.183
 a+2.170
Neg logit contributions
umm-6.676
lor-6.446
vation-6.351
arians-6.348
osity-6.340
Token I
Token ID314
Feature activation-1.00e+02

Pos logit contributions
 and+5.149
,+5.050
.+4.943
 the+4.792
 a+4.699
Neg logit contributions
umm-4.298
lor-4.067
osity-4.013
arians-3.936
vation-3.932
Token love
Token ID1842
Feature activation-1.00e+02

Pos logit contributions
 and+4.181
,+4.083
.+3.975
 the+3.865
 a+3.772
Neg logit contributions
umm-5.123
lor-4.947
osity-4.873
vation-4.854
rol-4.805
Token my
Token ID616
Feature activation-1.00e+02

Pos logit contributions
 and+3.089
,+2.999
.+2.852
 a+2.608
 the+2.585
Neg logit contributions
umm-6.210
ocol-5.954
osity-5.906
arians-5.895
lor-5.867
Token toy
Token ID13373
Feature activation-1.00e+02

Pos logit contributions
 and+3.529
,+3.432
.+3.283
 the+3.243
 a+3.107
Neg logit contributions
umm-5.763
 former-5.478
arians-5.451
lor-5.430
osity-5.417
Token."
Token ID526
Feature activation-1.00e+02

Pos logit contributions
 and+2.642
,+2.551
 the+2.410
.+2.392
 a+2.323
Neg logit contributions
umm-6.593
lor-6.381
arians-6.371
vation-6.269
osity-6.258
Token
Token ID198
Feature activation-1.00e+02

Pos logit contributions
 and+2.391
,+2.297
.+2.192
 the+2.028
 a+2.010
Neg logit contributions
umm-6.904
lor-6.708
osity-6.600
vation-6.598
arians-6.558
Token He
Token ID679
Feature activation-1.00e+02

Pos logit contributions
 and+3.543
,+3.435
.+3.339
 the+3.176
 a+3.122
Neg logit contributions
umm-5.858
lor-5.553
osity-5.527
arians-5.479
vation-5.470
Token went
Token ID1816
Feature activation-1.00e+02

Pos logit contributions
 and+3.351
,+3.276
.+3.204
 the+3.016
 a+2.930
Neg logit contributions
umm-5.887
lor-5.765
arians-5.652
vation-5.627
osity-5.614
Token to
Token ID284
Feature activation-1.00e+02

Pos logit contributions
 and+1.751
,+1.662
.+1.587
 the+1.408
 a+1.329
Neg logit contributions
umm-7.467
lor-7.325
osity-7.259
vation-7.257
arians-7.195
Token the
Token ID262
Feature activation-1.00e+02

Pos logit contributions
 and+2.098
,+2.004
.+1.873
 the+1.525
 a+1.524
Neg logit contributions
umm-7.074
vation-6.942
lor-6.895
osity-6.872
arians-6.854
Token unknown
Token ID6439
Feature activation-1.00e+02

Pos logit contributions
 and+4.707
,+4.576
.+4.458
 the+4.245
 a+4.167
Neg logit contributions
umm-4.794
lor-4.552
osity-4.467
vation-4.442
arians-4.402
Token person
Token ID1048
Feature activation-1.00e+02

Pos logit contributions
 and+4.155
,+4.032
.+3.925
 the+3.844
 a+3.799
Neg logit contributions
umm-5.145
lor-4.904
osity-4.803
vation-4.773
arians-4.766
Token and
Token ID290
Feature activation-1.00e+02

Pos logit contributions
 and+1.918
,+1.835
.+1.755
 the+1.681
 a+1.661
Neg logit contributions
umm-6.944
lor-6.917
osity-6.856
arians-6.841
vation-6.806
Token said
Token ID531
Feature activation-1.00e+02

Pos logit contributions
 and+3.348
,+3.231
.+3.178
 a+2.842
 the+2.841
Neg logit contributions
umm-5.825
vation-5.665
lor-5.665
arians-5.646
osity-5.645
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+1.798
,+1.551
.+1.548
 the+1.384
 a+1.310
Neg logit contributions
umm-7.516
lor-7.365
osity-7.302
vation-7.206
arians-7.183
Token "
Token ID366
Feature activation-1.00e+02

Pos logit contributions
 and+2.615
,+2.487
.+2.419
 the+2.286
 a+2.203
Neg logit contributions
umm-6.517
osity-6.404
lor-6.369
arians-6.298
plets-6.220
TokenI
Token ID40
Feature activation-1.00e+02

Pos logit contributions
 and+5.969
,+5.885
.+5.766
 the+5.600
 a+5.531
Neg logit contributions
umm-3.516
lor-3.316
vation-3.220
osity-3.181
arians-3.144
Token needed
Token ID2622
Feature activation-1.00e+02

Pos logit contributions
 and+3.674
,+3.578
.+3.474
 the+3.293
 a+3.248
Neg logit contributions
umm-5.573
lor-5.388
osity-5.354
vation-5.322
arians-5.298
Token help
Token ID1037
Feature activation-1.00e+02

Pos logit contributions
 and+2.985
,+2.818
.+2.684
 the+2.442
 a+2.294
Neg logit contributions
umm-6.196
lor-6.072
ocol-6.050
vation-5.960
osity-5.951
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+1.249
,+1.089
.+0.857
 the+0.836
 a+0.777
Neg logit contributions
umm-7.622
ocol-7.506
arians-7.501
lor-7.441
osity-7.432
Token The
Token ID383
Feature activation-1.00e+02

Pos logit contributions
 and+3.341
,+3.236
.+3.126
 the+2.948
 a+2.899
Neg logit contributions
umm-6.036
lor-5.833
osity-5.734
vation-5.723
arians-5.668
Token unknown
Token ID6439
Feature activation-1.00e+02

Pos logit contributions
 and+4.270
,+4.136
.+4.017
 the+3.860
 a+3.786
Neg logit contributions
umm-5.176
lor-4.965
vation-4.889
osity-4.871
arians-4.802
Token person
Token ID1048
Feature activation-1.00e+02

Pos logit contributions
 and+4.193
,+4.052
.+3.952
 the+3.841
 a+3.774
Neg logit contributions
umm-5.131
lor-4.911
vation-4.854
osity-4.837
arians-4.759
Token had
Token ID550
Feature activation-1.00e+02

Pos logit contributions
 and+3.744
,+3.616
.+3.538
 the+3.397
 a+3.338
Neg logit contributions
umm-5.400
lor-5.260
osity-5.241
vation-5.194
arians-5.193
Token a
Token ID257
Feature activation-1.00e+02

Pos logit contributions
 and+3.468
,+3.399
.+3.233
 the+2.957
 a+2.803
Neg logit contributions
umm-5.676
lor-5.665
osity-5.526
 former-5.502
vation-5.451
Token flat
Token ID6228
Feature activation-1.00e+02

Pos logit contributions
 and+4.291
,+4.133
.+4.096
 the+3.938
 a+3.794
Neg logit contributions
umm-5.051
lor-4.846
osity-4.769
 former-4.734
vation-4.730
Token tire
Token ID15867
Feature activation-1.00e+02

Pos logit contributions
 and+5.136
,+4.993
.+4.978
 the+4.870
 a+4.771
Neg logit contributions
umm-4.164
lor-3.939
osity-3.807
ocol-3.787
 former-3.781
Token on
Token ID319
Feature activation-1.00e+02

Pos logit contributions
 and+1.790
,+1.699
 the+1.591
.+1.528
 a+1.472
Neg logit contributions
umm-7.248
lor-7.060
osity-7.003
arians-6.981
vation-6.956
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+0.265
.+0.056
,+0.020
 the-0.088
 a-0.155
Neg logit contributions
umm-8.971
lor-8.833
osity-8.735
vation-8.669
rol-8.663
Token there
Token ID612
Feature activation-1.00e+02

Pos logit contributions
 and+1.940
,+1.790
.+1.731
 the+1.546
 a+1.459
Neg logit contributions
umm-7.329
lor-7.147
osity-7.099
vation-7.024
rol-7.018
Token was
Token ID373
Feature activation-1.00e+02

Pos logit contributions
 and+1.614
,+1.449
.+1.407
 the+1.221
 a+1.190
Neg logit contributions
umm-7.644
lor-7.480
osity-7.328
vation-7.322
rol-7.315
Token a
Token ID257
Feature activation-1.00e+02

Pos logit contributions
 and+0.626
,+0.539
.+0.442
 the+0.229
 a+0.079
Neg logit contributions
umm-8.702
lor-8.470
osity-8.361
vation-8.346
rol-8.336
Token kind
Token ID1611
Feature activation-1.00e+02

Pos logit contributions
 and+2.975
,+2.860
.+2.768
 the+2.616
 a+2.543
Neg logit contributions
umm-6.405
lor-6.182
osity-6.097
vation-6.085
arians-6.034
Token person
Token ID1048
Feature activation-1.00e+02

Pos logit contributions
 and+4.255
,+4.146
.+4.069
 the+3.933
 a+3.868
Neg logit contributions
umm-5.098
lor-4.856
osity-4.781
vation-4.727
arians-4.690
Token named
Token ID3706
Feature activation-1.00e+02

Pos logit contributions
 and+3.705
,+3.589
.+3.468
 the+3.367
 a+3.309
Neg logit contributions
umm-5.580
lor-5.420
osity-5.317
vation-5.285
arians-5.251
Token Tom
Token ID4186
Feature activation-1.00e+02

Pos logit contributions
 and+3.310
,+3.112
.+3.031
 the+2.786
 a+2.713
Neg logit contributions
umm-6.112
lor-5.837
osity-5.827
arians-5.777
vation-5.761
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+0.460
,+0.377
.+0.185
 the+0.163
 a+0.095
Neg logit contributions
lor-8.502
umm-8.501
osity-8.352
vation-8.278
rim-8.269
Token Tom
Token ID4186
Feature activation-1.00e+02

Pos logit contributions
 and+2.886
,+2.775
.+2.673
 the+2.527
 a+2.454
Neg logit contributions
umm-6.545
lor-6.364
vation-6.250
osity-6.249
arians-6.190
Token liked
Token ID8288
Feature activation-1.00e+02

Pos logit contributions
 and+3.373
,+3.274
.+3.187
 the+3.045
 a+2.965
Neg logit contributions
umm-5.879
lor-5.709
osity-5.603
vation-5.594
arians-5.549
Token,
Token ID11
Feature activation-1.00e+02

Pos logit contributions
 and+0.190
,+0.010
.-0.034
 the-0.180
 a-0.251
Neg logit contributions
umm-9.149
lor-8.942
osity-8.868
vation-8.847
rol-8.776
Token Tom
Token ID4186
Feature activation-1.00e+02

Pos logit contributions
 and+2.534
,+2.393
.+2.318
 the+2.102
 a+2.026
Neg logit contributions
umm-6.809
lor-6.550
osity-6.519
vation-6.511
arians-6.441
Token saw
Token ID2497
Feature activation-1.00e+02

Pos logit contributions
 and+3.121
,+3.047
.+2.992
 the+2.840
 a+2.763
Neg logit contributions
umm-6.025
lor-5.892
osity-5.764
vation-5.763
arians-5.737
Token an
Token ID281
Feature activation-1.00e+02

Pos logit contributions
 and+1.001
,+0.875
.+0.785
 the+0.440
 a+0.310
Neg logit contributions
umm-8.286
lor-8.098
osity-8.059
vation-7.952
ocol-7.950
Token unknown
Token ID6439
Feature activation-1.00e+02

Pos logit contributions
 and+4.504
,+4.402
.+4.322
 the+4.165
 a+4.068
Neg logit contributions
umm-4.903
lor-4.577
osity-4.536
vation-4.515
ocol-4.502
Token person
Token ID1048
Feature activation-1.00e+02

Pos logit contributions
 and+4.877
,+4.762
.+4.662
 the+4.568
 a+4.483
Neg logit contributions
umm-4.478
lor-4.186
osity-4.132
vation-4.081
ocol-4.041
Token who
Token ID508
Feature activation-1.00e+02

Pos logit contributions
 and+2.310
,+2.219
.+2.051
 the+1.996
 a+1.959
Neg logit contributions
umm-6.758
lor-6.671
osity-6.600
vation-6.591
arians-6.552
Token needed
Token ID2622
Feature activation-1.00e+02

Pos logit contributions
 and+3.674
,+3.578
.+3.474
 the+3.293
 a+3.248
Neg logit contributions
umm-5.573
lor-5.388
osity-5.354
vation-5.322
arians-5.298
Token help
Token ID1037
Feature activation-1.00e+02

Pos logit contributions
 and+2.985
,+2.818
.+2.684
 the+2.442
 a+2.294
Neg logit contributions
umm-6.196
lor-6.072
ocol-6.050
vation-5.960
osity-5.951
Token.
Token ID13
Feature activation-1.00e+02

Pos logit contributions
 and+1.249
,+1.089
.+0.857
 the+0.836
 a+0.777
Neg logit contributions
umm-7.622
ocol-7.506
arians-7.501
lor-7.441
osity-7.432
Token The
Token ID383
Feature activation-1.00e+02

Pos logit contributions
 and+3.341
,+3.236
.+3.126
 the+2.948
 a+2.899
Neg logit contributions
umm-6.036
lor-5.833
osity-5.734
vation-5.723
arians-5.668