TOP ACTIVATIONS
MAX = 2.787

children . They stare at Sullivan , some with accusing looks
of downs ides to this card , though . Firstly ,
at King s College , London , is herself
To commemorate the 65 th birthday of Sultan Hassan al Bol
and brand to hotels and golf courses worldwide . Once again
t mark the end of basketball on Al cat raz .
lap , holding hands with Sullivan . Experts believe this could
for 67 seconds when the booster engines separated from the ascending
L HD who is advising Sullivan , says it has had
with plenty of signs for Termin us along the way .
Team by the United States Basketball Writers Association ( USB WA
, two members of the college s student assembly
to describe their feelings about college debt in three em oj
mans the border with North Korea , where reports have emerged
campus of Saint Mary 's College September 1 , 2010 ,
reality ," he says . Sullivan knew his life and career
Larry Updates (@ 1 DL arry News ) August 16
of speaking within a more general , city wide style :
last seen wearing a black baseball cap , black leather coat
owned by Hudson County Community College . The plan is for

MIN ACTIVATIONS
MIN = -2.846

Mill iy et , a smugg ler named H id ir
3 . 5 - liter Eco Boost ® engine that produces
Gujarat Shank ar Bh ai Veg ad . Union Minister P
sounding like cock ney Tony Sop rano .
is to sew a straight seam . To practice sewing a
Show how to sew a seam : One of the most
and his attorney , Josh Engel , said he currently is
y atta travelled to Mog adish u to discuss the issue
of the capital , Mog adish u . They
: This can also be inadvert ent : people might assume
to start developing in AWS Lamb da with server less framework
will be joined by Jim Hub ay and the late John
washed it with T amiya Olive Dr ab . I focused
But then that happened with Moz art . The idea of
win ners of a Tom Em anski - esque back -
cross ing highly traff icked thorough f ares
of the objectionable material being traff icked across peer to peer
No ar Ha O ved Ve Ha L omed , D
and climate scientists who used ecosystem observations , laboratory experiments and
new domain buying service Google Dom ains at 1 . 30

INTERVAL 1.378 - 2.787
CONTAINS 0.531%

... Did that Pokémon just talk ?
is best to wash the Tu pper ware container completely .
., a licensing division of AT & T / Western Electric
Damian Allow ay committed to UCLA on Friday via Ble acher
John McCain . Newsletter Sign Up Continue reading the

INTERVAL -0.030 - 1.378
CONTAINS 52.454%

admitted to the hospital . Duncan had become infected while in
many people may know them by other names such as red
s Prom ethe a , serial ized from 1999
set up , concrete foot ings were poured ( one is
campaign is over , and you return the backer survey ,

INTERVAL -1.438 - -0.030
CONTAINS 46.541%

and Co ., giving up two receptions on three targets for
endowed . It seemed like the proper next step . I
year - old left guard is quietly one of the best
in America . Everyone gave themselves the rhetorical authority to tweet
akes on infinite variety . BC :

INTERVAL -2.846 - -1.438
CONTAINS 0.474%

7 Rams lead . Fil ch ock remained in for B
company in Houston , as well as Qualcomm , the San
Russian - Uk rain ian border , says the underlying cause
4 ms G 2 G AU Opt ronics AH VA (
to the government 's Startup India programme is deb atable ,
Token children
Token ID1751
Feature activation+1.93e-01

Pos logit contributions
 mercy+0.271
theless+0.267
 incap+0.254
ilty+0.244
 autonomy+0.238
Neg logit contributions
virt-0.270
-0.245
-0.243
-0.241
Dem-0.236
Token.
Token ID13
Feature activation+5.86e-01

Pos logit contributions
 mercy+0.555
theless+0.549
 incap+0.518
ilty+0.495
 autonomy+0.481
Neg logit contributions
virt-0.686
-0.629
-0.623
-0.618
Dem-0.608
Token They
Token ID1119
Feature activation+4.21e-01

Pos logit contributions
 mercy+1.678
theless+1.634
 incap+1.548
ilty+1.481
 communion+1.436
Neg logit contributions
virt-2.082
-1.909
-1.888
-1.877
Dem-1.831
Token stare
Token ID24170
Feature activation+3.27e-01

Pos logit contributions
 mercy+1.300
theless+1.278
 incap+1.215
ilty+1.163
 communion+1.133
Neg logit contributions
virt-1.407
-1.286
-1.272
-1.266
Dem-1.239
Token at
Token ID379
Feature activation-1.16e-01

Pos logit contributions
 mercy+0.972
theless+0.950
 incap+0.907
ilty+0.864
 communion+0.843
Neg logit contributions
virt-1.131
-1.037
-1.027
-1.018
Dem-1.000
Token Sullivan
Token ID18501
Feature activation+2.79e+00

Pos logit contributions
virt+0.414
+0.381
+0.375
+0.373
Dem+0.368
Neg logit contributions
theless-0.326
 mercy-0.324
 incap-0.303
ilty-0.292
 autonomy-0.282
Token,
Token ID11
Feature activation+6.16e-01

Pos logit contributions
 mercy+7.065
theless+6.381
 incap+6.262
 communion+5.816
ilty+5.777
Neg logit contributions
virt-10.224
-9.493
-9.459
-9.420
Econom-9.178
Token some
Token ID617
Feature activation-8.66e-01

Pos logit contributions
 mercy+1.750
theless+1.681
 incap+1.617
ilty+1.526
 communion+1.497
Neg logit contributions
virt-2.226
-2.043
-2.026
-2.015
Dem-1.971
Token with
Token ID351
Feature activation+1.05e-01

Pos logit contributions
virt+3.093
+2.852
+2.783
+2.759
Dem+2.747
Neg logit contributions
theless-2.382
 mercy-2.343
 incap-2.208
ilty-2.160
etheless-2.060
Token accusing
Token ID19786
Feature activation-2.99e-02

Pos logit contributions
 mercy+0.307
theless+0.302
 incap+0.287
ilty+0.274
 autonomy+0.266
Neg logit contributions
virt-0.367
-0.336
-0.333
-0.330
Dem-0.325
Token looks
Token ID3073
Feature activation-5.40e-01

Pos logit contributions
virt+0.107
+0.098
+0.097
+0.096
Dem+0.095
Neg logit contributions
theless-0.084
 mercy-0.084
 incap-0.078
ilty-0.075
 autonomy-0.073
Token of
Token ID286
Feature activation-2.17e-01

Pos logit contributions
virt+3.807
+3.432
+3.372
+3.347
Dem+3.306
Neg logit contributions
theless-3.775
 mercy-3.722
 incap-3.541
ilty-3.472
 communion-3.343
Token downs
Token ID21838
Feature activation+1.43e+00

Pos logit contributions
virt+0.746
+0.682
+0.675
+0.670
Dem+0.659
Neg logit contributions
 mercy-0.647
theless-0.640
 incap-0.605
ilty-0.580
 autonomy-0.563
Tokenides
Token ID1460
Feature activation+1.10e+00

Pos logit contributions
 mercy+4.294
theless+4.171
 incap+3.997
ilty+3.926
ints+3.784
Neg logit contributions
virt-4.742
-4.341
-4.293
-4.254
Dem-4.184
Token to
Token ID284
Feature activation+1.02e+00

Pos logit contributions
 mercy+2.863
theless+2.765
 incap+2.617
ilty+2.527
 autonomy+2.434
Neg logit contributions
virt-4.164
-3.845
-3.814
-3.795
Dem-3.729
Token this
Token ID428
Feature activation+1.61e+00

Pos logit contributions
 mercy+2.832
theless+2.721
 incap+2.615
ilty+2.458
 communion+2.420
Neg logit contributions
virt-3.684
-3.423
-3.358
-3.346
Dem-3.296
Token card
Token ID2657
Feature activation+2.70e+00

Pos logit contributions
 mercy+4.294
theless+3.994
 incap+3.921
ilty+3.666
 autonomy+3.589
Neg logit contributions
virt-5.948
-5.470
-5.432
-5.412
Dem-5.303
Token,
Token ID11
Feature activation+6.77e-01

Pos logit contributions
 mercy+7.719
theless+7.345
 incap+7.065
ilty+6.969
 autonomy+6.664
Neg logit contributions
virt-8.676
-8.025
-7.923
-7.880
Dem-7.627
Token though
Token ID996
Feature activation+4.22e-01

Pos logit contributions
 mercy+1.719
theless+1.662
 incap+1.582
ilty+1.479
 autonomy+1.458
Neg logit contributions
virt-2.614
-2.427
-2.400
-2.400
Dem-2.346
Token.
Token ID13
Feature activation+2.27e-01

Pos logit contributions
 mercy+1.181
theless+1.152
 incap+1.092
ilty+1.039
 autonomy+1.015
Neg logit contributions
virt-1.527
-1.402
-1.390
-1.381
Dem-1.355
Token Firstly
Token ID41039
Feature activation-4.55e-01

Pos logit contributions
 mercy+0.744
theless+0.730
 incap+0.699
ilty+0.668
 autonomy+0.654
Neg logit contributions
virt-0.713
-0.647
-0.638
-0.634
Dem-0.621
Token,
Token ID11
Feature activation+2.46e-01

Pos logit contributions
virt+1.583
+1.447
+1.434
+1.423
Dem+1.400
Neg logit contributions
 mercy-1.326
theless-1.325
 incap-1.242
ilty-1.200
 autonomy-1.159
Token at
Token ID379
Feature activation+1.16e-01

Pos logit contributions
virt+1.100
+1.003
+0.991
+0.983
Dem+0.970
Neg logit contributions
 mercy-1.002
theless-0.997
 incap-0.940
ilty-0.901
 autonomy-0.872
Token King
Token ID2677
Feature activation-7.04e-02

Pos logit contributions
 mercy+0.356
theless+0.352
 incap+0.334
ilty+0.320
 autonomy+0.312
Neg logit contributions
virt-0.390
-0.356
-0.352
-0.349
Dem-0.343
Token
Token ID447
Feature activation+4.26e-02

Pos logit contributions
virt+0.282
+0.261
+0.259
+0.257
Dem+0.254
Neg logit contributions
 mercy-0.170
theless-0.168
 incap-0.158
ilty-0.148
 autonomy-0.144
Token
Token ID247
Feature activation+1.99e-01

Pos logit contributions
 mercy+0.204
theless+0.201
 incap+0.195
ilty+0.190
 autonomy+0.187
Neg logit contributions
virt-0.070
-0.058
-0.056
-0.054
Dem-0.053
Tokens
Token ID82
Feature activation+4.94e-01

Pos logit contributions
 mercy+0.676
theless+0.669
 incap+0.638
ilty+0.614
 autonomy+0.599
Neg logit contributions
virt-0.603
-0.545
-0.538
-0.533
Dem-0.523
Token College
Token ID5535
Feature activation+2.69e+00

Pos logit contributions
 mercy+1.416
theless+1.375
 incap+1.303
ilty+1.244
 communion+1.219
Neg logit contributions
virt-1.750
-1.608
-1.595
-1.589
Dem-1.543
Token,
Token ID11
Feature activation+4.26e-01

Pos logit contributions
 mercy+8.082
 incap+7.637
theless+7.637
 communion+7.269
 autonomy+7.211
Neg logit contributions
virt-8.076
-7.409
-7.407
-7.267
Dem-7.012
Token London
Token ID3576
Feature activation-1.28e-01

Pos logit contributions
 mercy+1.514
theless+1.488
 incap+1.435
ilty+1.380
 communion+1.351
Neg logit contributions
virt-1.219
-1.093
-1.079
-1.073
Dem-1.045
Token,
Token ID11
Feature activation+2.32e-02

Pos logit contributions
virt+0.445
+0.408
+0.403
+0.398
Dem+0.395
Neg logit contributions
 mercy-0.376
theless-0.372
 incap-0.352
ilty-0.336
 autonomy-0.325
Token is
Token ID318
Feature activation-2.28e-01

Pos logit contributions
 mercy+0.074
theless+0.074
 incap+0.070
ilty+0.067
 autonomy+0.065
Neg logit contributions
virt-0.075
-0.068
-0.067
-0.067
Dem-0.065
Token herself
Token ID5223
Feature activation-9.02e-01

Pos logit contributions
virt+0.789
+0.723
+0.715
+0.709
Dem+0.698
Neg logit contributions
 mercy-0.674
theless-0.666
 incap-0.630
ilty-0.603
 autonomy-0.585
TokenTo
Token ID2514
Feature activation+5.07e-02

Pos logit contributions
virt+0.832
+0.695
+0.679
+0.671
Dem+0.662
Neg logit contributions
 mercy-2.060
theless-2.048
 incap-1.976
ilty-1.910
 autonomy-1.887
Token commemorate
Token ID40243
Feature activation+9.28e-01

Pos logit contributions
 mercy+0.156
theless+0.155
 incap+0.146
ilty+0.140
 autonomy+0.137
Neg logit contributions
virt-0.169
-0.154
-0.153
-0.151
Dem-0.149
Token the
Token ID262
Feature activation+5.74e-01

Pos logit contributions
 mercy+2.670
theless+2.615
 incap+2.483
ilty+2.362
 communion+2.318
Neg logit contributions
virt-3.281
-3.020
-2.981
-2.964
Dem-2.906
Token 65
Token ID6135
Feature activation+3.04e-01

Pos logit contributions
 mercy+1.856
theless+1.814
 incap+1.739
ilty+1.666
 communion+1.634
Neg logit contributions
virt-1.839
-1.674
-1.651
-1.645
Dem-1.606
Tokenth
Token ID400
Feature activation+1.70e+00

Pos logit contributions
 mercy+0.695
theless+0.684
 incap+0.636
ilty+0.600
 autonomy+0.577
Neg logit contributions
virt-1.258
-1.170
-1.159
-1.152
Dem-1.136
Token birthday
Token ID10955
Feature activation+2.46e+00

Pos logit contributions
 mercy+3.062
theless+2.813
 incap+2.700
 communion+2.480
ilty+2.450
Neg logit contributions
virt-7.790
-7.191
-7.165
-7.071
Dem-7.011
Token of
Token ID286
Feature activation+9.87e-01

Pos logit contributions
 mercy+6.271
theless+5.928
 incap+5.684
ilty+5.543
 communion+5.367
Neg logit contributions
virt-8.965
-8.163
-8.146
-8.117
Dem-7.938
Token Sultan
Token ID37399
Feature activation-1.95e-01

Pos logit contributions
 mercy+3.103
theless+3.035
 incap+2.890
ilty+2.756
 autonomy+2.739
Neg logit contributions
virt-3.215
-2.931
-2.894
-2.881
Dem-2.818
Token Hassan
Token ID24485
Feature activation+3.35e-01

Pos logit contributions
virt+0.579
+0.520
+0.510
+0.508
Dem+0.499
Neg logit contributions
theless-0.673
 mercy-0.672
 incap-0.639
ilty-0.614
 communion-0.600
Tokenal
Token ID282
Feature activation+5.12e-01

Pos logit contributions
 mercy+1.402
theless+1.384
 incap+1.333
ilty+1.294
 autonomy+1.271
Neg logit contributions
virt-0.747
-0.654
-0.643
-0.631
Dem-0.614
Token Bol
Token ID10797
Feature activation+6.08e-01

Pos logit contributions
 mercy+1.535
theless+1.507
 incap+1.439
ilty+1.385
 autonomy+1.341
Neg logit contributions
virt-1.738
-1.597
-1.580
-1.559
Dem-1.538
Token and
Token ID290
Feature activation+4.24e-01

Pos logit contributions
 mercy+0.780
theless+0.768
 incap+0.726
ilty+0.694
 autonomy+0.674
Neg logit contributions
virt-0.991
-0.912
-0.902
-0.895
Dem-0.881
Token brand
Token ID4508
Feature activation+2.73e-01

Pos logit contributions
 mercy+1.215
theless+1.192
 incap+1.132
ilty+1.076
 autonomy+1.051
Neg logit contributions
virt-1.507
-1.387
-1.374
-1.363
Dem-1.339
Token to
Token ID284
Feature activation+5.00e-01

Pos logit contributions
 mercy+0.769
theless+0.755
 incap+0.716
ilty+0.684
 autonomy+0.667
Neg logit contributions
virt-0.980
-0.902
-0.893
-0.885
Dem-0.872
Token hotels
Token ID18505
Feature activation+6.13e-01

Pos logit contributions
 mercy+1.487
theless+1.453
 incap+1.388
ilty+1.317
 communion+1.294
Neg logit contributions
virt-1.717
-1.587
-1.565
-1.555
Dem-1.514
Token and
Token ID290
Feature activation-1.19e-01

Pos logit contributions
 mercy+1.728
theless+1.682
 incap+1.605
ilty+1.534
 autonomy+1.491
Neg logit contributions
virt-2.195
-2.021
-2.003
-1.992
Dem-1.953
Token golf
Token ID13126
Feature activation+2.42e+00

Pos logit contributions
virt+0.410
+0.373
+0.370
+0.366
Dem+0.363
Neg logit contributions
 mercy-0.355
theless-0.353
 incap-0.331
ilty-0.320
 autonomy-0.308
Token courses
Token ID10902
Feature activation+7.22e-01

Pos logit contributions
 mercy+5.412
 incap+5.024
theless+4.971
 autonomy+4.608
ces+4.562
Neg logit contributions
virt-9.126
-8.830
-8.712
-8.712
 ILCS-8.465
Token worldwide
Token ID8688
Feature activation+7.98e-02

Pos logit contributions
 mercy+2.066
theless+2.019
 incap+1.919
ilty+1.833
 autonomy+1.787
Neg logit contributions
virt-2.541
-2.339
-2.319
-2.305
Dem-2.261
Token.
Token ID13
Feature activation+2.30e-01

Pos logit contributions
 mercy+0.243
theless+0.241
 incap+0.228
ilty+0.218
 autonomy+0.212
Neg logit contributions
virt-0.270
-0.246
-0.244
-0.242
Dem-0.238
Token Once
Token ID4874
Feature activation+9.47e-01

Pos logit contributions
 mercy+0.678
theless+0.665
 incap+0.626
ilty+0.603
 autonomy+0.585
Neg logit contributions
virt-0.798
-0.733
-0.726
-0.718
Dem-0.702
Token again
Token ID757
Feature activation-9.15e-02

Pos logit contributions
 mercy+2.619
theless+2.561
 incap+2.416
ilty+2.279
 autonomy+2.264
Neg logit contributions
virt-3.450
-3.193
-3.155
-3.105
Dem-3.063
Tokent
Token ID83
Feature activation-2.33e-01

Pos logit contributions
virt+0.233
+0.204
+0.201
+0.199
Dem+0.193
Neg logit contributions
 mercy-0.401
theless-0.398
 incap-0.382
ilty-0.371
 autonomy-0.363
Token mark
Token ID1317
Feature activation-3.16e-01

Pos logit contributions
virt+0.733
+0.667
+0.658
+0.653
Dem+0.639
Neg logit contributions
 mercy-0.767
theless-0.758
 incap-0.721
ilty-0.692
 autonomy-0.676
Token the
Token ID262
Feature activation-7.10e-01

Pos logit contributions
virt+1.121
+1.028
+1.017
+1.014
Dem+0.999
Neg logit contributions
 mercy-0.903
theless-0.900
 incap-0.845
ilty-0.810
 autonomy-0.781
Token end
Token ID886
Feature activation-1.31e+00

Pos logit contributions
virt+2.261
+2.058
+2.039
+2.011
Dem+1.990
Neg logit contributions
theless-2.256
 mercy-2.241
 incap-2.117
ilty-2.050
imposed-1.983
Token of
Token ID286
Feature activation+2.19e-02

Pos logit contributions
virt+4.014
+3.690
+3.558
+3.543
Dem+3.513
Neg logit contributions
theless-4.317
 mercy-4.227
 incap-4.088
ilty-3.896
imposed-3.774
Token basketball
Token ID9669
Feature activation+2.41e+00

Pos logit contributions
 mercy+0.062
theless+0.062
 incap+0.058
ilty+0.056
 autonomy+0.054
Neg logit contributions
virt-0.078
-0.072
-0.071
-0.071
Dem-0.070
Token on
Token ID319
Feature activation+9.85e-02

Pos logit contributions
 mercy+6.500
theless+5.987
 incap+5.890
 communion+5.760
 autonomy+5.744
Neg logit contributions
virt-8.412
-7.987
-7.833
-7.823
Dem-7.450
Token Al
Token ID978
Feature activation+1.37e-01

Pos logit contributions
 mercy+0.317
theless+0.313
 incap+0.298
ilty+0.286
 autonomy+0.279
Neg logit contributions
virt-0.316
-0.287
-0.284
-0.281
Dem-0.275
Tokencat
Token ID9246
Feature activation-3.45e-01

Pos logit contributions
 mercy+0.376
theless+0.368
 incap+0.350
ilty+0.342
imposed+0.329
Neg logit contributions
virt-0.491
-0.461
-0.459
-0.450
Dem-0.438
Tokenraz
Token ID3247
Feature activation+3.90e-02

Pos logit contributions
virt+1.127
+1.027
+1.020
+1.009
Dem+0.990
Neg logit contributions
 mercy-1.082
theless-1.065
 incap-1.013
ilty-0.978
 autonomy-0.949
Token.
Token ID13
Feature activation-7.11e-02

Pos logit contributions
 mercy+0.119
theless+0.118
 incap+0.112
ilty+0.107
 autonomy+0.104
Neg logit contributions
virt-0.131
-0.120
-0.118
-0.118
Dem-0.116
Token lap
Token ID14779
Feature activation-1.11e+00

Pos logit contributions
virt+0.384
+0.349
+0.345
+0.342
Dem+0.336
Neg logit contributions
 mercy-0.400
theless-0.395
 incap-0.375
ilty-0.361
 autonomy-0.352
Token,
Token ID11
Feature activation+8.18e-02

Pos logit contributions
virt+3.790
+3.436
+3.353
Dem+3.300
+3.291
Neg logit contributions
theless-3.343
 mercy-3.283
 incap-3.171
ilty-2.960
 autonomy-2.923
Token holding
Token ID4769
Feature activation+6.79e-01

Pos logit contributions
 mercy+0.226
theless+0.223
 incap+0.210
ilty+0.201
 autonomy+0.194
Neg logit contributions
virt-0.299
-0.275
-0.272
-0.270
Dem-0.266
Token hands
Token ID2832
Feature activation+1.19e-01

Pos logit contributions
 mercy+1.965
theless+1.926
 incap+1.822
ilty+1.745
 communion+1.709
Neg logit contributions
virt-2.397
-2.207
-2.190
-2.168
Dem-2.127
Token with
Token ID351
Feature activation+2.94e-01

Pos logit contributions
 mercy+0.362
theless+0.360
 incap+0.341
ilty+0.326
 autonomy+0.317
Neg logit contributions
virt-0.399
-0.364
-0.359
-0.355
Dem-0.351
Token Sullivan
Token ID18501
Feature activation+2.41e+00

Pos logit contributions
 mercy+0.828
theless+0.819
 incap+0.773
ilty+0.738
 autonomy+0.715
Neg logit contributions
virt-1.057
-0.971
-0.960
-0.953
Dem-0.939
Token.
Token ID13
Feature activation+1.53e-01

Pos logit contributions
 mercy+7.005
theless+6.634
 incap+6.312
ilty+6.045
 communion+5.983
Neg logit contributions
virt-8.043
-7.425
-7.359
-7.285
Dem-7.140
Token Experts
Token ID36095
Feature activation+8.84e-02

Pos logit contributions
 mercy+0.465
theless+0.461
 incap+0.438
ilty+0.419
 autonomy+0.408
Neg logit contributions
virt-0.516
-0.472
-0.466
-0.463
Dem-0.456
Token believe
Token ID1975
Feature activation+2.28e-02

Pos logit contributions
 mercy+0.295
theless+0.292
 incap+0.278
ilty+0.268
 autonomy+0.261
Neg logit contributions
virt-0.273
-0.246
-0.244
-0.241
Dem-0.237
Token this
Token ID428
Feature activation+7.31e-01

Pos logit contributions
 mercy+0.065
theless+0.065
 incap+0.061
ilty+0.059
 autonomy+0.057
Neg logit contributions
virt-0.081
-0.074
-0.073
-0.073
Dem-0.072
Token could
Token ID714
Feature activation-4.30e-01

Pos logit contributions
 mercy+2.121
theless+2.070
 incap+1.968
ilty+1.860
 autonomy+1.839
Neg logit contributions
virt-2.580
-2.371
-2.353
-2.332
Dem-2.291
Token for
Token ID329
Feature activation+6.62e-01

Pos logit contributions
 mercy+0.065
theless+0.065
 incap+0.061
ilty+0.058
 autonomy+0.057
Neg logit contributions
virt-0.082
-0.075
-0.074
-0.074
Dem-0.073
Token 67
Token ID8275
Feature activation+1.91e-01

Pos logit contributions
 mercy+1.958
theless+1.914
 incap+1.829
ilty+1.727
 autonomy+1.705
Neg logit contributions
virt-2.299
-2.112
-2.095
-2.079
Dem-2.039
Token seconds
Token ID4201
Feature activation+6.89e-01

Pos logit contributions
 mercy+0.581
theless+0.573
 incap+0.544
ilty+0.521
 autonomy+0.507
Neg logit contributions
virt-0.646
-0.590
-0.584
-0.580
Dem-0.569
Token when
Token ID618
Feature activation+7.58e-03

Pos logit contributions
 mercy+1.955
theless+1.912
 incap+1.821
ilty+1.726
 communion+1.689
Neg logit contributions
virt-2.459
-2.257
-2.239
-2.233
Dem-2.187
Token the
Token ID262
Feature activation-1.27e-01

Pos logit contributions
 mercy+0.021
theless+0.021
 incap+0.020
ilty+0.019
 autonomy+0.018
Neg logit contributions
virt-0.028
-0.025
-0.025
-0.025
Dem-0.025
Token booster
Token ID30430
Feature activation+2.40e+00

Pos logit contributions
virt+0.450
+0.413
+0.408
+0.405
Dem+0.399
Neg logit contributions
 mercy-0.367
theless-0.363
 incap-0.343
ilty-0.328
 autonomy-0.318
Token engines
Token ID11874
Feature activation+3.12e-01

Pos logit contributions
 mercy+6.460
theless+6.112
 incap+5.997
 autonomy+5.623
 retrieval+5.602
Neg logit contributions
virt-8.540
-7.856
-7.810
-7.694
Dem-7.545
Token separated
Token ID11266
Feature activation-4.10e-01

Pos logit contributions
 mercy+0.845
theless+0.834
 incap+0.784
ilty+0.747
 autonomy+0.724
Neg logit contributions
virt-1.161
-1.070
-1.059
-1.051
Dem-1.035
Token from
Token ID422
Feature activation+4.48e-01

Pos logit contributions
virt+1.404
+1.282
+1.262
+1.259
Dem+1.233
Neg logit contributions
theless-1.218
 mercy-1.215
 incap-1.141
ilty-1.099
imposed-1.059
Token the
Token ID262
Feature activation+5.45e-01

Pos logit contributions
 mercy+1.232
theless+1.213
 incap+1.144
ilty+1.090
 autonomy+1.060
Neg logit contributions
virt-1.646
-1.514
-1.499
-1.487
Dem-1.464
Token ascending
Token ID41988
Feature activation+2.70e-01

Pos logit contributions
 mercy+1.539
theless+1.505
 incap+1.426
ilty+1.352
 autonomy+1.330
Neg logit contributions
virt-1.978
-1.817
-1.800
-1.781
Dem-1.748
Token L
Token ID406
Feature activation+1.36e-02

Pos logit contributions
virt+0.142
+0.128
+0.127
+0.126
Dem+0.124
Neg logit contributions
 mercy-0.151
theless-0.150
 incap-0.143
ilty-0.137
 autonomy-0.134
TokenHD
Token ID10227
Feature activation+6.93e-01

Pos logit contributions
 mercy+0.044
theless+0.044
 incap+0.042
ilty+0.040
 autonomy+0.039
Neg logit contributions
virt-0.043
-0.039
-0.039
-0.038
Dem-0.038
Token who
Token ID508
Feature activation-2.77e-01

Pos logit contributions
 mercy+2.254
theless+2.217
 incap+2.120
ilty+2.052
 autonomy+1.983
Neg logit contributions
virt-2.171
-1.991
-1.958
-1.948
Dem-1.891
Token is
Token ID318
Feature activation-2.24e-01

Pos logit contributions
virt+0.898
+0.815
+0.805
+0.796
Dem+0.788
Neg logit contributions
 mercy-0.879
theless-0.873
 incap-0.828
ilty-0.793
 autonomy-0.773
Token advising
Token ID30341
Feature activation+8.50e-03

Pos logit contributions
virt+0.768
+0.703
+0.695
+0.690
Dem+0.678
Neg logit contributions
 mercy-0.673
theless-0.664
 incap-0.630
ilty-0.603
 autonomy-0.586
Token Sullivan
Token ID18501
Feature activation+2.39e+00

Pos logit contributions
 mercy+0.024
theless+0.024
 incap+0.023
ilty+0.022
 autonomy+0.021
Neg logit contributions
virt-0.030
-0.028
-0.027
-0.027
Dem-0.027
Token,
Token ID11
Feature activation+2.37e-01

Pos logit contributions
 mercy+6.157
theless+5.737
 incap+5.498
ilty+5.382
 autonomy+5.203
Neg logit contributions
virt-8.773
-8.241
-8.122
-8.010
Dem-7.849
Token says
Token ID1139
Feature activation+2.11e-01

Pos logit contributions
 mercy+0.736
theless+0.726
 incap+0.689
ilty+0.660
 autonomy+0.643
Neg logit contributions
virt-0.788
-0.719
-0.711
-0.706
Dem-0.693
Token it
Token ID340
Feature activation+8.18e-01

Pos logit contributions
 mercy+0.602
theless+0.600
 incap+0.562
ilty+0.539
 communion+0.521
Neg logit contributions
virt-0.751
-0.688
-0.681
-0.676
Dem-0.666
Token has
Token ID468
Feature activation+6.64e-04

Pos logit contributions
 mercy+2.260
theless+2.210
 incap+2.092
ilty+1.977
 autonomy+1.929
Neg logit contributions
virt-2.994
-2.765
-2.738
-2.727
Dem-2.674
Token had
Token ID550
Feature activation+4.58e-01

Pos logit contributions
 mercy+0.002
theless+0.002
 incap+0.002
ilty+0.002
 communion+0.002
Neg logit contributions
virt-0.002
-0.002
-0.002
-0.002
Dem-0.002
Token with
Token ID351
Feature activation-5.28e-02

Pos logit contributions
virt+0.576
+0.527
+0.521
+0.517
Dem+0.509
Neg logit contributions
 mercy-0.497
theless-0.492
 incap-0.465
ilty-0.446
 autonomy-0.433
Token plenty
Token ID6088
Feature activation-4.77e-01

Pos logit contributions
virt+0.182
+0.166
+0.164
+0.163
Dem+0.160
Neg logit contributions
 mercy-0.157
theless-0.155
 incap-0.147
ilty-0.141
 autonomy-0.137
Token of
Token ID286
Feature activation-4.57e-01

Pos logit contributions
virt+1.523
+1.396
+1.358
+1.357
Dem+1.326
Neg logit contributions
 mercy-1.511
theless-1.510
 incap-1.445
ilty-1.382
 communion-1.343
Token signs
Token ID5895
Feature activation-3.47e-01

Pos logit contributions
virt+1.579
+1.447
+1.428
+1.422
Dem+1.399
Neg logit contributions
 mercy-1.324
theless-1.323
 incap-1.246
ilty-1.200
imposed-1.160
Token for
Token ID329
Feature activation+1.50e-01

Pos logit contributions
virt+1.200
+1.101
+1.086
+1.078
Dem+1.063
Neg logit contributions
 mercy-1.017
theless-1.015
 incap-0.956
ilty-0.917
 autonomy-0.888
Token Termin
Token ID15527
Feature activation+2.37e+00

Pos logit contributions
 mercy+0.426
theless+0.421
 incap+0.397
ilty+0.380
 autonomy+0.368
Neg logit contributions
virt-0.538
-0.494
-0.488
-0.485
Dem-0.477
Tokenus
Token ID385
Feature activation+5.57e-01

Pos logit contributions
 mercy+6.932
theless+6.559
 incap+6.411
ilty+6.245
ints+6.093
Neg logit contributions
virt-7.534
-7.167
-7.154
-6.749
-6.631
Token along
Token ID1863
Feature activation+1.72e-02

Pos logit contributions
 mercy+1.651
theless+1.627
 incap+1.542
ilty+1.475
 autonomy+1.435
Neg logit contributions
virt-1.927
-1.766
-1.746
-1.731
Dem-1.701
Token the
Token ID262
Feature activation+1.20e-01

Pos logit contributions
 mercy+0.049
theless+0.049
 incap+0.046
ilty+0.044
 autonomy+0.043
Neg logit contributions
virt-0.061
-0.056
-0.055
-0.055
Dem-0.054
Token way
Token ID835
Feature activation-8.98e-01

Pos logit contributions
 mercy+0.442
theless+0.435
 incap+0.417
ilty+0.403
 autonomy+0.394
Neg logit contributions
virt-0.329
-0.295
-0.291
-0.288
Dem-0.281
Token.
Token ID13
Feature activation+2.09e-02

Pos logit contributions
virt+2.918
+2.672
+2.620
+2.588
Dem+2.586
Neg logit contributions
theless-2.806
 mercy-2.772
 incap-2.640
ilty-2.527
 autonomy-2.462
Token Team
Token ID4816
Feature activation+2.54e-01

Pos logit contributions
virt+1.780
+1.637
+1.623
Dem+1.597
+1.591
Neg logit contributions
 mercy-1.585
theless-1.582
 incap-1.516
ilty-1.432
imposed-1.390
Token by
Token ID416
Feature activation-1.25e-01

Pos logit contributions
 mercy+0.724
theless+0.716
 incap+0.676
ilty+0.645
 autonomy+0.626
Neg logit contributions
virt-0.907
-0.833
-0.824
-0.818
Dem-0.805
Token the
Token ID262
Feature activation-2.92e-01

Pos logit contributions
virt+0.418
+0.379
+0.376
+0.371
Dem+0.367
Neg logit contributions
 mercy-0.385
theless-0.382
 incap-0.362
ilty-0.345
 communion-0.335
Token United
Token ID1578
Feature activation-4.67e-01

Pos logit contributions
virt+0.933
+0.844
+0.836
+0.825
Dem+0.816
Neg logit contributions
 mercy-0.940
theless-0.934
 incap-0.887
ilty-0.849
 autonomy-0.825
Token States
Token ID1829
Feature activation-9.01e-01

Pos logit contributions
virt+1.187
+1.031
+1.026
Dem+1.006
+1.004
Neg logit contributions
 mercy-1.806
theless-1.797
 incap-1.723
ilty-1.654
 communion-1.632
Token Basketball
Token ID25911
Feature activation+2.37e+00

Pos logit contributions
virt+2.962
+2.690
+2.677
Dem+2.613
+2.596
Neg logit contributions
theless-2.787
 mercy-2.755
 incap-2.626
ilty-2.488
 autonomy-2.429
Token Writers
Token ID37732
Feature activation+6.76e-01

Pos logit contributions
 mercy+4.873
theless+4.479
 incap+4.230
ilty+3.982
 autonomy+3.959
Neg logit contributions
virt-9.852
-9.523
-9.386
-9.132
Dem-8.821
Token Association
Token ID5396
Feature activation+4.24e-01

Pos logit contributions
 mercy+1.592
theless+1.542
 incap+1.447
ilty+1.371
 autonomy+1.332
Neg logit contributions
virt-2.747
-2.555
-2.530
-2.527
Dem-2.469
Token (
Token ID357
Feature activation+8.62e-01

Pos logit contributions
 mercy+1.196
theless+1.179
 incap+1.113
ilty+1.065
 autonomy+1.033
Neg logit contributions
virt-1.526
-1.403
-1.388
-1.379
Dem-1.355
TokenUSB
Token ID27155
Feature activation+5.96e-01

Pos logit contributions
 mercy+3.182
theless+3.168
 incap+3.043
ilty+2.973
imposed+2.900
Neg logit contributions
virt-2.229
-2.030
-1.977
-1.973
Dem-1.907
TokenWA
Token ID15543
Feature activation+3.91e-01

Pos logit contributions
 mercy+1.268
theless+1.242
 incap+1.151
ilty+1.085
 autonomy+1.035
Neg logit contributions
virt-2.546
-2.376
-2.352
-2.345
Dem-2.308
Token,
Token ID11
Feature activation-3.90e-02

Pos logit contributions
virt+3.333
+3.020
+2.968
+2.936
Dem+2.910
Neg logit contributions
theless-2.944
 mercy-2.912
 incap-2.764
ilty-2.650
etheless-2.561
Token two
Token ID734
Feature activation-7.07e-01

Pos logit contributions
virt+0.134
+0.122
+0.121
+0.120
Dem+0.118
Neg logit contributions
 mercy-0.117
theless-0.115
 incap-0.109
ilty-0.104
 autonomy-0.102
Token members
Token ID1866
Feature activation+1.21e-01

Pos logit contributions
virt+2.416
+2.193
+2.161
+2.155
Dem+2.129
Neg logit contributions
theless-2.102
 mercy-2.097
 incap-1.976
ilty-1.901
imposed-1.828
Token of
Token ID286
Feature activation-9.49e-04

Pos logit contributions
 mercy+0.377
theless+0.375
 incap+0.354
ilty+0.340
 autonomy+0.331
Neg logit contributions
virt-0.398
-0.362
-0.359
-0.355
Dem-0.349
Token the
Token ID262
Feature activation+5.27e-02

Pos logit contributions
virt+0.003
+0.003
+0.003
+0.003
Dem+0.003
Neg logit contributions
 mercy-0.003
theless-0.003
 incap-0.003
ilty-0.003
 autonomy-0.002
Token college
Token ID4152
Feature activation+2.36e+00

Pos logit contributions
 mercy+0.165
theless+0.163
 incap+0.154
ilty+0.148
 autonomy+0.144
Neg logit contributions
virt-0.174
-0.159
-0.157
-0.156
Dem-0.153
Token
Token ID447
Feature activation+4.33e-01

Pos logit contributions
 mercy+7.089
theless+6.652
 incap+6.459
 communion+6.228
 autonomy+6.215
Neg logit contributions
virt-7.570
-6.892
-6.881
-6.868
Dem-6.541
Token
Token ID247
Feature activation+2.39e-01

Pos logit contributions
 mercy+2.047
theless+2.015
 incap+1.957
ilty+1.910
 autonomy+1.879
Neg logit contributions
virt-0.729
-0.604
-0.584
-0.576
Dem-0.553
Tokens
Token ID82
Feature activation+6.67e-01

Pos logit contributions
 mercy+0.812
theless+0.804
 incap+0.766
ilty+0.738
 autonomy+0.720
Neg logit contributions
virt-0.720
-0.651
-0.642
-0.637
Dem-0.624
Token student
Token ID3710
Feature activation+7.20e-01

Pos logit contributions
 mercy+1.969
theless+1.925
 incap+1.833
ilty+1.750
 autonomy+1.714
Neg logit contributions
virt-2.317
-2.117
-2.100
-2.084
Dem-2.045
Token assembly
Token ID10474
Feature activation-3.74e-02

Pos logit contributions
 mercy+2.098
theless+2.055
 incap+1.950
ilty+1.863
 communion+1.836
Neg logit contributions
virt-2.520
-2.313
-2.289
-2.269
Dem-2.220
Token to
Token ID284
Feature activation-2.07e-01

Pos logit contributions
virt+4.037
+3.671
+3.642
Dem+3.554
+3.546
Neg logit contributions
theless-3.538
 mercy-3.437
 incap-3.281
ilty-3.191
etheless-3.122
Token describe
Token ID6901
Feature activation+3.76e-01

Pos logit contributions
virt+0.669
+0.608
+0.601
+0.596
Dem+0.585
Neg logit contributions
 mercy-0.663
theless-0.655
 incap-0.623
ilty-0.598
 autonomy-0.582
Token their
Token ID511
Feature activation-3.24e-01

Pos logit contributions
 mercy+0.977
theless+0.954
 incap+0.902
ilty+0.855
 autonomy+0.831
Neg logit contributions
virt-1.443
-1.334
-1.321
-1.313
Dem-1.290
Token feelings
Token ID7666
Feature activation+3.70e-01

Pos logit contributions
virt+1.103
+1.008
+0.998
+0.987
Dem+0.974
Neg logit contributions
 mercy-0.973
theless-0.971
 incap-0.909
ilty-0.876
imposed-0.847
Token about
Token ID546
Feature activation+4.36e-01

Pos logit contributions
 mercy+1.036
theless+1.017
 incap+0.961
ilty+0.917
 autonomy+0.892
Neg logit contributions
virt-1.341
-1.234
-1.222
-1.213
Dem-1.192
Token college
Token ID4152
Feature activation+2.34e+00

Pos logit contributions
 mercy+1.261
theless+1.235
 incap+1.177
ilty+1.120
 autonomy+1.094
Neg logit contributions
virt-1.545
-1.419
-1.403
-1.396
Dem-1.366
Token debt
Token ID5057
Feature activation+5.06e-01

Pos logit contributions
 mercy+6.226
 incap+5.766
theless+5.561
 autonomy+5.478
 communion+5.282
Neg logit contributions
virt-8.433
-7.878
-7.850
-7.814
Dem-7.479
Token in
Token ID287
Feature activation+2.79e-01

Pos logit contributions
 mercy+1.452
theless+1.424
 incap+1.351
ilty+1.291
 autonomy+1.255
Neg logit contributions
virt-1.796
-1.648
-1.633
-1.620
Dem-1.592
Token three
Token ID1115
Feature activation+3.81e-02

Pos logit contributions
 mercy+0.819
theless+0.791
 incap+0.759
ilty+0.721
 communion+0.708
Neg logit contributions
virt-0.979
-0.895
-0.890
-0.882
Dem-0.863
Token em
Token ID795
Feature activation-1.58e+00

Pos logit contributions
 mercy+0.110
theless+0.106
 incap+0.103
ilty+0.097
 autonomy+0.095
Neg logit contributions
virt-0.135
-0.124
-0.123
-0.122
Dem-0.120
Tokenoj
Token ID13210
Feature activation-7.38e-01

Pos logit contributions
virt+3.378
+2.862
+2.824
Dem+2.733
+2.625
Neg logit contributions
 mercy-6.646
theless-6.624
 incap-6.348
 autonomy-6.136
ilty-6.126
Token mans
Token ID25045
Feature activation-4.70e-01

Pos logit contributions
virt+0.634
+0.578
+0.572
+0.566
Dem+0.560
Neg logit contributions
theless-0.549
 mercy-0.546
 incap-0.515
ilty-0.500
 autonomy-0.479
Token the
Token ID262
Feature activation+9.60e-02

Pos logit contributions
virt+1.621
+1.479
+1.457
+1.456
Dem+1.441
Neg logit contributions
theless-1.405
 mercy-1.368
 incap-1.300
ilty-1.239
 autonomy-1.205
Token border
Token ID4865
Feature activation-1.36e+00

Pos logit contributions
 mercy+0.285
theless+0.282
 incap+0.266
ilty+0.255
 autonomy+0.248
Neg logit contributions
virt-0.332
-0.304
-0.300
-0.298
Dem-0.293
Token with
Token ID351
Feature activation+7.34e-01

Pos logit contributions
virt+4.064
+3.719
+3.652
+3.649
Dem+3.599
Neg logit contributions
theless-4.435
 mercy-4.365
 incap-4.164
ilty-4.028
etheless-3.917
Token North
Token ID2258
Feature activation+1.62e+00

Pos logit contributions
 mercy+2.437
theless+2.380
 incap+2.282
ilty+2.176
 autonomy+2.164
Neg logit contributions
virt-2.288
-2.079
-2.052
-2.036
Dem-1.973
Token Korea
Token ID4969
Feature activation+2.34e+00

Pos logit contributions
 mercy+5.749
theless+5.619
 incap+5.397
ilty+5.225
 autonomy+5.173
Neg logit contributions
virt-4.560
-4.113
-4.038
-3.968
Dem-3.803
Token,
Token ID11
Feature activation+4.15e-01

Pos logit contributions
 mercy+5.895
theless+5.571
 incap+5.229
ilty+5.084
 communion+4.982
Neg logit contributions
virt-8.574
-8.057
-8.017
-7.979
-7.714
Token where
Token ID810
Feature activation-1.71e-01

Pos logit contributions
 mercy+1.199
theless+1.162
 incap+1.107
ilty+1.053
 communion+1.031
Neg logit contributions
virt-1.478
-1.357
-1.346
-1.339
Dem-1.310
Token reports
Token ID3136
Feature activation+1.83e-01

Pos logit contributions
virt+0.571
+0.519
+0.513
+0.510
Dem+0.502
Neg logit contributions
 mercy-0.526
theless-0.524
 incap-0.495
ilty-0.476
 autonomy-0.459
Token have
Token ID423
Feature activation+1.21e-02

Pos logit contributions
 mercy+0.560
theless+0.553
 incap+0.524
ilty+0.502
 autonomy+0.489
Neg logit contributions
virt-0.617
-0.564
-0.557
-0.553
Dem-0.544
Token emerged
Token ID9349
Feature activation-4.84e-01

Pos logit contributions
 mercy+0.034
theless+0.033
 incap+0.031
ilty+0.030
 autonomy+0.029
Neg logit contributions
virt-0.044
-0.040
-0.040
-0.039
Dem-0.039
Token campus
Token ID7611
Feature activation+1.01e+00

Pos logit contributions
virt+0.323
+0.293
+0.290
+0.287
Dem+0.281
Neg logit contributions
 mercy-0.329
theless-0.325
 incap-0.310
ilty-0.298
 autonomy-0.290
Token of
Token ID286
Feature activation+3.96e-01

Pos logit contributions
 mercy+2.969
theless+2.861
 incap+2.753
ilty+2.647
 autonomy+2.583
Neg logit contributions
virt-3.528
-3.224
-3.202
-3.183
Dem-3.100
Token Saint
Token ID9281
Feature activation-3.85e-01

Pos logit contributions
 mercy+1.296
theless+1.273
 incap+1.216
ilty+1.173
 autonomy+1.142
Neg logit contributions
virt-1.247
-1.135
-1.123
-1.116
Dem-1.085
Token Mary
Token ID5335
Feature activation-4.15e-01

Pos logit contributions
virt+1.325
+1.204
+1.193
+1.179
Dem+1.171
Neg logit contributions
 mercy-1.143
theless-1.136
 incap-1.076
ilty-1.027
 autonomy-0.997
Token's
Token ID338
Feature activation-2.19e-01

Pos logit contributions
virt+1.624
+1.490
+1.477
Dem+1.466
+1.458
Neg logit contributions
theless-1.037
 mercy-1.024
 incap-0.966
ilty-0.914
etheless-0.882
Token College
Token ID5535
Feature activation+2.34e+00

Pos logit contributions
virt+0.747
+0.682
+0.675
+0.666
Dem+0.662
Neg logit contributions
 mercy-0.657
theless-0.657
 incap-0.619
ilty-0.591
 autonomy-0.574
Token September
Token ID2693
Feature activation+3.13e-02

Pos logit contributions
 mercy+5.962
 incap+5.486
theless+5.472
ilty+5.377
 communion+5.204
Neg logit contributions
virt-8.358
-7.878
-7.743
-7.674
-7.357
Token 1
Token ID352
Feature activation-4.10e-01

Pos logit contributions
 mercy+0.104
theless+0.102
 incap+0.097
ilty+0.094
 autonomy+0.091
Neg logit contributions
virt-0.097
-0.088
-0.087
-0.086
Dem-0.084
Token,
Token ID11
Feature activation+3.40e-01

Pos logit contributions
virt+1.492
+1.374
+1.360
+1.350
Dem+1.328
Neg logit contributions
 mercy-1.138
theless-1.122
 incap-1.059
ilty-1.011
 autonomy-0.979
Token 2010
Token ID3050
Feature activation+7.35e-01

Pos logit contributions
 mercy+1.265
theless+1.257
 incap+1.202
ilty+1.161
 autonomy+1.135
Neg logit contributions
virt-0.917
-0.818
-0.808
-0.794
Dem-0.782
Token,
Token ID11
Feature activation+1.42e-01

Pos logit contributions
 mercy+1.985
theless+1.937
 incap+1.834
ilty+1.763
 autonomy+1.698
Neg logit contributions
virt-2.714
-2.499
-2.484
-2.467
Dem-2.414
Token reality
Token ID3950
Feature activation-6.60e-01

Pos logit contributions
virt+0.940
+0.857
+0.849
+0.837
Dem+0.826
Neg logit contributions
 mercy-0.899
theless-0.896
 incap-0.849
ilty-0.817
 autonomy-0.793
Token,"
Token ID553
Feature activation-2.33e-01

Pos logit contributions
virt+2.210
+2.012
+1.979
+1.975
Dem+1.944
Neg logit contributions
 mercy-1.992
theless-1.991
 incap-1.875
ilty-1.814
 autonomy-1.748
Token he
Token ID339
Feature activation-4.60e-01

Pos logit contributions
virt+0.827
+0.756
+0.753
+0.742
Dem+0.735
Neg logit contributions
theless-0.664
 mercy-0.663
 incap-0.621
ilty-0.593
 communion-0.576
Token says
Token ID1139
Feature activation-4.10e-01

Pos logit contributions
virt+1.431
+1.280
+1.268
+1.248
Dem+1.245
Neg logit contributions
 mercy-1.521
theless-1.520
 incap-1.441
ilty-1.387
 autonomy-1.353
Token.
Token ID13
Feature activation+2.96e-01

Pos logit contributions
virt+1.315
+1.192
+1.162
Dem+1.152
+1.148
Neg logit contributions
theless-1.317
 mercy-1.309
 incap-1.241
ilty-1.189
 communion-1.161
Token Sullivan
Token ID18501
Feature activation+2.34e+00

Pos logit contributions
 mercy+0.796
theless+0.787
 incap+0.740
ilty+0.704
 autonomy+0.683
Neg logit contributions
virt-1.107
-1.020
-1.010
-1.003
Dem-0.989
Token knew
Token ID2993
Feature activation+2.13e-01

Pos logit contributions
 mercy+6.166
theless+5.851
 incap+5.574
ilty+5.252
 autonomy+5.187
Neg logit contributions
virt-8.388
-7.850
-7.831
-7.688
Dem-7.559
Token his
Token ID465
Feature activation-4.07e-01

Pos logit contributions
 mercy+0.605
theless+0.599
 incap+0.565
ilty+0.540
 autonomy+0.523
Neg logit contributions
virt-0.760
-0.698
-0.690
-0.685
Dem-0.674
Token life
Token ID1204
Feature activation+2.80e-01

Pos logit contributions
virt+1.418
+1.301
+1.289
+1.278
Dem+1.259
Neg logit contributions
 mercy-1.180
theless-1.178
 incap-1.101
ilty-1.061
imposed-1.022
Token and
Token ID290
Feature activation+4.49e-01

Pos logit contributions
 mercy+0.785
theless+0.769
 incap+0.732
ilty+0.693
 autonomy+0.679
Neg logit contributions
virt-1.018
-0.937
-0.927
-0.922
Dem-0.905
Token career
Token ID3451
Feature activation+8.24e-01

Pos logit contributions
 mercy+1.266
theless+1.238
 incap+1.181
ilty+1.112
 autonomy+1.101
Neg logit contributions
virt-1.624
-1.495
-1.481
-1.470
Dem-1.441
Token
Token ID851
Feature activation-7.20e-02

Pos logit contributions
 mercy+2.371
theless+2.333
 incap+2.224
ilty+2.159
 autonomy+2.090
Neg logit contributions
virt-2.222
-2.013
-1.997
-1.971
Dem-1.922
Token Larry
Token ID13633
Feature activation-5.96e-01

Pos logit contributions
virt+0.226
+0.205
+0.202
+0.201
Dem+0.197
Neg logit contributions
 mercy-0.236
theless-0.234
 incap-0.223
ilty-0.214
 autonomy-0.209
Token Updates
Token ID28090
Feature activation+4.23e-01

Pos logit contributions
virt+1.835
+1.662
+1.640
+1.625
Dem+1.607
Neg logit contributions
 mercy-1.988
theless-1.972
 incap-1.882
ilty-1.805
imposed-1.760
Token (@
Token ID4275
Feature activation+9.99e-01

Pos logit contributions
 mercy+1.709
theless+1.687
 incap+1.624
ilty+1.575
 autonomy+1.546
Neg logit contributions
virt-1.004
-0.882
-0.870
-0.859
Dem-0.831
Token1
Token ID16
Feature activation-2.12e-01

Pos logit contributions
 mercy+3.843
theless+3.819
 incap+3.651
ilty+3.595
ints+3.481
Neg logit contributions
virt-2.430
-2.171
-2.152
-2.114
Dem-2.024
TokenDL
Token ID19260
Feature activation+2.33e+00

Pos logit contributions
virt+0.621
+0.559
+0.551
+0.547
Dem+0.536
Neg logit contributions
 mercy-0.741
theless-0.734
 incap-0.700
ilty-0.674
 autonomy-0.659
Tokenarry
Token ID6532
Feature activation+1.33e-01

Pos logit contributions
 mercy+8.006
theless+7.897
ilty+7.574
 incap+7.521
ints+7.416
Neg logit contributions
virt-6.013
-5.618
-5.423
-5.400
Dem-5.067
TokenNews
Token ID9980
Feature activation+3.10e-01

Pos logit contributions
 mercy+0.458
theless+0.453
 incap+0.432
ilty+0.416
 autonomy+0.406
Neg logit contributions
virt-0.396
-0.358
-0.353
-0.350
Dem-0.343
Token)
Token ID8
Feature activation+4.06e-01

Pos logit contributions
 mercy+0.909
theless+0.898
 incap+0.849
ilty+0.812
 autonomy+0.789
Neg logit contributions
virt-1.079
-0.989
-0.979
-0.971
Dem-0.955
Token August
Token ID2932
Feature activation+8.65e-01

Pos logit contributions
 mercy+1.344
theless+1.306
 incap+1.251
ilty+1.194
 autonomy+1.179
Neg logit contributions
virt-1.266
-1.147
-1.140
-1.126
Dem-1.093
Token 16
Token ID1467
Feature activation+8.02e-01

Pos logit contributions
 mercy+2.668
theless+2.542
 incap+2.429
ilty+2.336
 communion+2.287
Neg logit contributions
virt-2.837
-2.607
-2.588
-2.559
-2.450
Token of
Token ID286
Feature activation+1.99e-01

Pos logit contributions
virt+1.387
+1.269
+1.249
+1.237
Dem+1.220
Neg logit contributions
 mercy-1.267
theless-1.260
 incap-1.194
ilty-1.144
 autonomy-1.113
Token speaking
Token ID5486
Feature activation-2.79e-01

Pos logit contributions
 mercy+0.546
theless+0.539
 incap+0.508
ilty+0.484
 autonomy+0.470
Neg logit contributions
virt-0.729
-0.672
-0.665
-0.660
Dem-0.650
Token within
Token ID1626
Feature activation+4.02e-01

Pos logit contributions
virt+0.942
+0.862
+0.847
+0.840
Dem+0.830
Neg logit contributions
 mercy-0.844
theless-0.842
 incap-0.791
ilty-0.760
 communion-0.737
Token a
Token ID257
Feature activation+5.91e-01

Pos logit contributions
 mercy+1.144
theless+1.126
 incap+1.066
ilty+1.016
 autonomy+0.989
Neg logit contributions
virt-1.442
-1.323
-1.312
-1.302
Dem-1.281
Token more
Token ID517
Feature activation+1.28e+00

Pos logit contributions
 mercy+1.796
theless+1.748
 incap+1.669
ilty+1.595
 autonomy+1.564
Neg logit contributions
virt-2.008
-1.826
-1.822
-1.819
Dem-1.777
Token general
Token ID2276
Feature activation+2.33e+00

Pos logit contributions
 mercy+3.586
theless+3.401
 incap+3.331
 autonomy+3.130
 communion+3.105
Neg logit contributions
virt-4.586
-4.223
-4.194
-4.173
Dem-4.071
Token,
Token ID11
Feature activation+1.68e-01

Pos logit contributions
 mercy+6.756
 incap+6.417
theless+6.328
 autonomy+6.083
 communion+6.012
Neg logit contributions
virt-7.782
-7.155
-7.058
-6.979
Dem-6.854
Token city
Token ID1748
Feature activation+1.54e-01

Pos logit contributions
 mercy+0.491
theless+0.485
 incap+0.458
ilty+0.438
 autonomy+0.426
Neg logit contributions
virt-0.587
-0.539
-0.533
-0.529
Dem-0.520
Tokenwide
Token ID4421
Feature activation+7.67e-01

Pos logit contributions
 mercy+0.460
theless+0.455
 incap+0.431
ilty+0.412
 autonomy+0.401
Neg logit contributions
virt-0.525
-0.481
-0.476
-0.472
Dem-0.464
Token style
Token ID3918
Feature activation+2.11e-01

Pos logit contributions
 mercy+2.109
theless+2.050
 incap+1.971
ilty+1.856
 autonomy+1.836
Neg logit contributions
virt-2.819
-2.583
-2.579
-2.569
Dem-2.510
Token:
Token ID25
Feature activation+3.68e-01

Pos logit contributions
 mercy+0.629
theless+0.622
 incap+0.588
ilty+0.563
 autonomy+0.547
Neg logit contributions
virt-0.723
-0.662
-0.654
-0.649
Dem-0.639
Token last
Token ID938
Feature activation+2.17e-01

Pos logit contributions
virt+0.892
+0.823
+0.813
+0.809
Dem+0.798
Neg logit contributions
 mercy-0.640
theless-0.637
 incap-0.595
ilty-0.566
 autonomy-0.549
Token seen
Token ID1775
Feature activation+4.60e-01

Pos logit contributions
 mercy+0.477
theless+0.465
 incap+0.432
ilty+0.406
 communion+0.392
Neg logit contributions
virt-0.924
-0.860
-0.853
-0.846
Dem-0.833
Token wearing
Token ID5762
Feature activation+1.38e-01

Pos logit contributions
 mercy+1.101
theless+1.072
 incap+1.007
ilty+0.950
 communion+0.922
Neg logit contributions
virt-1.860
-1.726
-1.715
-1.696
Dem-1.671
Token a
Token ID257
Feature activation+4.11e-02

Pos logit contributions
theless+0.407
 mercy+0.407
 incap+0.383
ilty+0.367
 autonomy+0.355
Neg logit contributions
virt-0.474
-0.435
-0.430
-0.426
Dem-0.421
Token black
Token ID2042
Feature activation+4.52e-01

Pos logit contributions
theless+0.113
 mercy+0.112
 incap+0.106
ilty+0.100
 autonomy+0.097
Neg logit contributions
virt-0.150
-0.138
-0.137
-0.135
Dem-0.134
Token baseball
Token ID9283
Feature activation+2.32e+00

Pos logit contributions
 mercy+1.334
theless+1.314
 incap+1.246
ilty+1.192
 communion+1.158
Neg logit contributions
virt-1.568
-1.436
-1.420
-1.411
Dem-1.386
Token cap
Token ID1451
Feature activation-2.30e-01

Pos logit contributions
 mercy+5.365
theless+4.955
 incap+4.697
 communion+4.466
imposed+4.458
Neg logit contributions
virt-9.422
-8.757
-8.692
-8.639
Dem-8.372
Token,
Token ID11
Feature activation+2.11e-01

Pos logit contributions
virt+0.768
+0.698
+0.691
+0.680
Dem+0.679
Neg logit contributions
theless-0.709
 mercy-0.702
 incap-0.669
ilty-0.631
 autonomy-0.625
Token black
Token ID2042
Feature activation+5.93e-01

Pos logit contributions
 mercy+0.588
theless+0.585
 incap+0.550
ilty+0.524
 autonomy+0.509
Neg logit contributions
virt-0.764
-0.703
-0.696
-0.690
Dem-0.682
Token leather
Token ID11620
Feature activation-7.00e-01

Pos logit contributions
 mercy+1.827
theless+1.799
 incap+1.709
ilty+1.640
 communion+1.598
Neg logit contributions
virt-1.985
-1.811
-1.789
-1.778
Dem-1.743
Token coat
Token ID13209
Feature activation+2.14e-02

Pos logit contributions
virt+2.422
+2.216
+2.203
+2.156
Dem+2.152
Neg logit contributions
theless-2.024
 mercy-2.003
 incap-1.886
ilty-1.813
 autonomy-1.775
Token owned
Token ID6898
Feature activation-3.41e-01

Pos logit contributions
virt+0.756
+0.687
+0.679
+0.671
Dem+0.662
Neg logit contributions
 mercy-0.769
theless-0.763
 incap-0.725
ilty-0.696
 autonomy-0.678
Token by
Token ID416
Feature activation+1.77e-02

Pos logit contributions
virt+1.306
+1.200
+1.185
+1.182
Dem+1.172
Neg logit contributions
 mercy-0.883
theless-0.881
 incap-0.821
ilty-0.780
 communion-0.750
Token Hudson
Token ID19995
Feature activation+5.20e-01

Pos logit contributions
 mercy+0.057
theless+0.056
 incap+0.053
ilty+0.051
 autonomy+0.050
Neg logit contributions
virt-0.057
-0.052
-0.051
-0.051
Dem-0.050
Token County
Token ID3418
Feature activation+1.14e+00

Pos logit contributions
 mercy+1.641
theless+1.623
 incap+1.541
ilty+1.479
 autonomy+1.440
Neg logit contributions
virt-1.694
-1.544
-1.525
-1.512
Dem-1.485
Token Community
Token ID8108
Feature activation-4.26e-01

Pos logit contributions
 mercy+3.378
theless+3.289
 incap+3.155
ilty+3.039
 autonomy+2.950
Neg logit contributions
virt-3.867
-3.597
-3.536
-3.512
Dem-3.376
Token College
Token ID5535
Feature activation+2.32e+00

Pos logit contributions
virt+1.411
+1.279
+1.264
+1.258
Dem+1.248
Neg logit contributions
 mercy-1.319
theless-1.313
 incap-1.244
ilty-1.187
 autonomy-1.156
Token.
Token ID13
Feature activation+1.92e-02

Pos logit contributions
 mercy+5.956
theless+5.636
 incap+5.487
ilty+5.280
 autonomy+5.194
Neg logit contributions
virt-8.250
-7.842
-7.755
-7.746
-7.346
Token The
Token ID383
Feature activation-7.15e-02

Pos logit contributions
 mercy+0.058
theless+0.057
 incap+0.054
ilty+0.052
 autonomy+0.050
Neg logit contributions
virt-0.065
-0.060
-0.059
-0.059
Dem-0.058
Token plan
Token ID1410
Feature activation-1.28e-03

Pos logit contributions
virt+0.253
+0.232
+0.229
+0.228
Dem+0.224
Neg logit contributions
 mercy-0.206
theless-0.204
 incap-0.192
ilty-0.184
 autonomy-0.178
Token is
Token ID318
Feature activation-2.04e-01

Pos logit contributions
virt+0.004
+0.004
+0.004
+0.004
Dem+0.004
Neg logit contributions
 mercy-0.004
theless-0.004
 incap-0.004
ilty-0.004
 autonomy-0.003
Token for
Token ID329
Feature activation+2.84e-01

Pos logit contributions
virt+0.734
+0.674
+0.667
+0.663
Dem+0.654
Neg logit contributions
 mercy-0.579
theless-0.572
 incap-0.540
ilty-0.514
 autonomy-0.498
Token Mill
Token ID9212
Feature activation+2.43e-01

Pos logit contributions
virt+3.556
+3.235
Dem+3.186
+3.171
+3.160
Neg logit contributions
theless-3.268
 mercy-3.267
 incap-3.098
ilty-2.965
imposed-2.903
Tokeniy
Token ID7745
Feature activation-4.17e-01

Pos logit contributions
 mercy+0.658
theless+0.650
 incap+0.612
ilty+0.582
 autonomy+0.564
Neg logit contributions
virt-0.901
-0.830
-0.821
-0.816
Dem-0.804
Tokenet
Token ID316
Feature activation-1.72e-01

Pos logit contributions
virt+1.496
+1.375
+1.362
+1.352
Dem+1.328
Neg logit contributions
 mercy-1.178
theless-1.162
 incap-1.098
ilty-1.048
 autonomy-1.017
Token,
Token ID11
Feature activation-2.41e-01

Pos logit contributions
virt+0.606
+0.556
+0.548
+0.545
Dem+0.537
Neg logit contributions
 mercy-0.497
theless-0.491
 incap-0.464
ilty-0.443
 autonomy-0.430
Token a
Token ID257
Feature activation-1.40e-01

Pos logit contributions
virt+0.831
+0.760
+0.751
+0.746
Dem+0.736
Neg logit contributions
 mercy-0.713
theless-0.707
 incap-0.668
ilty-0.641
 communion-0.618
Token smugg
Token ID19176
Feature activation-2.85e+00

Pos logit contributions
virt+0.463
+0.422
+0.417
+0.413
Dem+0.407
Neg logit contributions
 mercy-0.433
theless-0.429
 incap-0.407
ilty-0.391
 autonomy-0.379
Tokenler
Token ID1754
Feature activation-5.88e-01

Pos logit contributions
virt+4.236
Dem+3.322
+3.106
sky+3.054
entry+2.997
Neg logit contributions
theless-13.373
 mercy-13.152
 incap-12.931
etheless-12.451
 autonomy-12.259
Token named
Token ID3706
Feature activation+1.11e-01

Pos logit contributions
virt+2.063
+1.878
+1.861
+1.846
Dem+1.832
Neg logit contributions
theless-1.693
 mercy-1.680
 incap-1.585
ilty-1.533
ints-1.470
Token H
Token ID367
Feature activation+6.22e-01

Pos logit contributions
 mercy+0.368
theless+0.365
 incap+0.347
ilty+0.334
 autonomy+0.326
Neg logit contributions
virt-0.342
-0.310
-0.306
-0.303
Dem-0.298
Tokenid
Token ID312
Feature activation+4.76e-01

Pos logit contributions
 mercy+2.055
theless+2.021
 incap+1.942
ilty+1.898
ints+1.831
Neg logit contributions
virt-1.918
-1.751
-1.733
-1.715
Dem-1.652
Tokenir
Token ID343
Feature activation-1.55e-01

Pos logit contributions
 mercy+1.579
theless+1.562
 incap+1.485
ilty+1.433
 autonomy+1.393
Neg logit contributions
virt-1.472
-1.341
-1.326
-1.310
Dem-1.283
Token 3
Token ID513
Feature activation-3.28e-01

Pos logit contributions
 mercy+1.375
theless+1.349
 incap+1.279
ilty+1.221
 autonomy+1.214
Neg logit contributions
virt-1.470
-1.347
-1.329
-1.321
Dem-1.300
Token.
Token ID13
Feature activation-9.98e-02

Pos logit contributions
virt+1.077
+0.981
+0.970
+0.961
Dem+0.947
Neg logit contributions
 mercy-1.028
theless-1.028
 incap-0.969
ilty-0.931
 communion-0.902
Token5
Token ID20
Feature activation-3.24e-01

Pos logit contributions
virt+0.342
+0.311
+0.308
+0.307
Dem+0.301
Neg logit contributions
theless-0.301
 mercy-0.299
 incap-0.286
ilty-0.271
imposed-0.264
Token-
Token ID12
Feature activation-5.29e-01

Pos logit contributions
virt+1.044
+0.946
+0.936
+0.927
Dem+0.912
Neg logit contributions
 mercy-1.038
theless-1.037
 incap-0.980
ilty-0.943
 communion-0.914
Tokenliter
Token ID17201
Feature activation+4.00e-01

Pos logit contributions
virt+1.827
+1.662
+1.643
+1.628
Dem+1.615
Neg logit contributions
 mercy-1.578
theless-1.574
 incap-1.477
ilty-1.412
 communion-1.373
Token Eco
Token ID38719
Feature activation-2.69e+00

Pos logit contributions
 mercy+1.290
theless+1.269
 incap+1.206
ilty+1.160
 autonomy+1.143
Neg logit contributions
virt-1.269
-1.159
-1.147
-1.134
Dem-1.115
TokenBoost
Token ID45686
Feature activation-1.55e-01

Pos logit contributions
virt+4.813
+3.963
Dem+3.955
+3.883
+3.862
Neg logit contributions
theless-12.233
 mercy-11.901
 incap-11.389
etheless-11.160
ilty-10.990
Token®
Token ID7461
Feature activation-4.24e-01

Pos logit contributions
virt+0.497
+0.452
+0.447
+0.443
Dem+0.435
Neg logit contributions
 mercy-0.500
theless-0.496
 incap-0.471
ilty-0.452
 autonomy-0.440
Token engine
Token ID3113
Feature activation-2.04e-01

Pos logit contributions
virt+1.108
+0.977
+0.964
+0.958
Dem+0.936
Neg logit contributions
 mercy-1.617
theless-1.607
 incap-1.536
ilty-1.483
 communion-1.451
Token that
Token ID326
Feature activation-2.67e-01

Pos logit contributions
virt+0.690
+0.630
+0.621
+0.615
Dem+0.608
Neg logit contributions
 mercy-0.617
theless-0.617
 incap-0.581
ilty-0.557
 communion-0.538
Token produces
Token ID11073
Feature activation-3.94e-01

Pos logit contributions
virt+0.879
+0.799
+0.790
+0.783
Dem+0.771
Neg logit contributions
 mercy-0.834
theless-0.826
 incap-0.784
ilty-0.753
 communion-0.731
Token Gujarat
Token ID30311
Feature activation+2.78e-01

Pos logit contributions
 mercy+0.379
theless+0.375
 incap+0.357
ilty+0.343
 autonomy+0.335
Neg logit contributions
virt-0.347
-0.314
-0.310
-0.308
Dem-0.301
Token Shank
Token ID41595
Feature activation-1.08e+00

Pos logit contributions
 mercy+0.923
theless+0.911
 incap+0.867
ilty+0.835
 autonomy+0.815
Neg logit contributions
virt-0.863
-0.785
-0.776
-0.768
Dem-0.751
Tokenar
Token ID283
Feature activation-6.51e-01

Pos logit contributions
virt+3.199
+2.758
Dem+2.726
+2.723
+2.712
Neg logit contributions
 mercy-3.716
theless-3.715
 incap-3.525
 communion-3.321
ilty-3.291
Token Bh
Token ID16581
Feature activation-1.78e-01

Pos logit contributions
virt+2.005
+1.812
+1.783
+1.770
Dem+1.746
Neg logit contributions
 mercy-2.173
theless-2.161
 incap-2.047
ilty-1.964
 communion-1.920
Tokenai
Token ID1872
Feature activation+6.36e-01

Pos logit contributions
virt+0.543
+0.492
+0.486
+0.481
Dem+0.472
Neg logit contributions
 mercy-0.599
theless-0.593
 incap-0.565
ilty-0.545
 autonomy-0.530
Token Veg
Token ID8427
Feature activation-2.66e+00

Pos logit contributions
 mercy+1.976
theless+1.926
 incap+1.844
ilty+1.787
 autonomy+1.728
Neg logit contributions
virt-2.079
-1.918
-1.908
-1.894
Dem-1.836
Tokenad
Token ID324
Feature activation-2.64e-01

Pos logit contributions
virt+8.337
+7.461
Dem+7.369
+7.137
Econom+7.080
Neg logit contributions
theless-8.044
 mercy-7.683
 incap-7.337
etheless-7.175
 confidentiality-6.891
Token.
Token ID13
Feature activation+1.09e-02

Pos logit contributions
virt+0.848
+0.768
+0.759
+0.752
Dem+0.742
Neg logit contributions
 mercy-0.848
theless-0.841
 incap-0.799
ilty-0.764
 autonomy-0.746
Token Union
Token ID4479
Feature activation+6.24e-01

Pos logit contributions
 mercy+0.032
theless+0.032
 incap+0.030
ilty+0.029
 autonomy+0.028
Neg logit contributions
virt-0.038
-0.034
-0.034
-0.034
Dem-0.033
Token Minister
Token ID4139
Feature activation+1.42e-01

Pos logit contributions
 mercy+2.199
theless+2.169
 incap+2.062
ilty+1.988
 autonomy+1.960
Neg logit contributions
virt-1.817
-1.640
-1.609
-1.595
Dem-1.554
Token P
Token ID350
Feature activation-6.67e-02

Pos logit contributions
 mercy+0.462
theless+0.457
 incap+0.434
ilty+0.417
 autonomy+0.407
Neg logit contributions
virt-0.449
-0.407
-0.402
-0.399
Dem-0.392
Token sounding
Token ID22655
Feature activation+3.42e-01

Pos logit contributions
 mercy+1.038
theless+1.013
 incap+0.967
ilty+0.913
 autonomy+0.894
Neg logit contributions
virt-1.347
-1.231
-1.230
-1.214
Dem-1.194
Token like
Token ID588
Feature activation-5.94e-01

Pos logit contributions
 mercy+0.930
theless+0.910
 incap+0.864
ilty+0.819
 autonomy+0.796
Neg logit contributions
virt-1.268
-1.160
-1.158
-1.144
Dem-1.125
Token cock
Token ID7540
Feature activation+8.95e-02

Pos logit contributions
virt+1.920
+1.766
+1.725
+1.724
Dem+1.695
Neg logit contributions
 mercy-1.864
theless-1.855
 incap-1.756
ilty-1.691
 autonomy-1.642
Tokenney
Token ID1681
Feature activation-4.65e-01

Pos logit contributions
 mercy+0.335
theless+0.332
 incap+0.318
ilty+0.307
 autonomy+0.301
Neg logit contributions
virt-0.239
-0.213
-0.210
-0.208
Dem-0.204
Token Tony
Token ID8832
Feature activation-9.84e-01

Pos logit contributions
virt+1.514
+1.387
+1.356
+1.355
Dem+1.334
Neg logit contributions
 mercy-1.461
theless-1.453
 incap-1.379
ilty-1.322
 autonomy-1.285
Token Sop
Token ID35643
Feature activation-2.64e+00

Pos logit contributions
virt+3.025
+2.747
+2.701
+2.669
Dem+2.640
Neg logit contributions
theless-3.273
 mercy-3.270
 incap-3.089
ilty-2.973
 autonomy-2.909
Tokenrano
Token ID35823
Feature activation-5.73e-01

Pos logit contributions
virt+3.170
+2.450
sky+2.237
Dem+2.180
+2.130
Neg logit contributions
theless-13.779
 mercy-13.189
 incap-13.054
etheless-12.984
ciating-12.254
Token.
Token ID13
Feature activation+3.46e-01

Pos logit contributions
virt+1.827
+1.669
+1.636
+1.628
Dem+1.600
Neg logit contributions
 mercy-1.840
theless-1.832
 incap-1.740
ilty-1.668
 autonomy-1.627
Token
Token ID447
Feature activation-1.33e-01

Pos logit contributions
 mercy+1.419
theless+1.401
 incap+1.344
ilty+1.302
 communion+1.279
Neg logit contributions
virt-0.801
-0.691
-0.686
-0.673
Dem-0.651
Token
Token ID251
Feature activation-6.11e-01

Pos logit contributions
virt+0.223
+0.185
+0.180
+0.177
Dem+0.169
Neg logit contributions
 mercy-0.633
theless-0.631
 incap-0.609
ilty-0.592
 autonomy-0.583
Token
Token ID198
Feature activation+5.50e-01

Pos logit contributions
virt+2.018
+1.854
+1.818
+1.809
Dem+1.788
Neg logit contributions
 mercy-1.890
theless-1.870
 incap-1.783
ilty-1.710
 autonomy-1.661
Token is
Token ID318
Feature activation+2.63e-01

Pos logit contributions
virt+0.182
+0.166
+0.163
+0.162
Dem+0.160
Neg logit contributions
 mercy-0.173
theless-0.172
 incap-0.162
ilty-0.156
 autonomy-0.152
Token to
Token ID284
Feature activation-6.12e-01

Pos logit contributions
 mercy+0.702
theless+0.690
 incap+0.651
ilty+0.619
 autonomy+0.600
Neg logit contributions
virt-0.988
-0.912
-0.903
-0.897
Dem-0.883
Token sew
Token ID34249
Feature activation-1.20e+00

Pos logit contributions
virt+2.060
+1.875
+1.854
+1.841
Dem+1.813
Neg logit contributions
 mercy-1.861
theless-1.853
 incap-1.743
ilty-1.676
 autonomy-1.631
Token a
Token ID257
Feature activation-6.70e-01

Pos logit contributions
virt+3.868
+3.503
+3.422
+3.371
Dem+3.356
Neg logit contributions
theless-3.815
 mercy-3.734
 incap-3.549
ilty-3.437
etheless-3.341
Token straight
Token ID3892
Feature activation-5.05e-01

Pos logit contributions
virt+1.613
+1.414
+1.391
+1.363
Dem+1.337
Neg logit contributions
theless-2.671
 mercy-2.666
 incap-2.553
ilty-2.474
 autonomy-2.427
Token seam
Token ID15787
Feature activation-2.64e+00

Pos logit contributions
virt+1.311
+1.163
+1.144
+1.131
Dem+1.107
Neg logit contributions
 mercy-1.927
theless-1.917
 incap-1.834
ilty-1.771
 autonomy-1.737
Token.
Token ID13
Feature activation-3.11e-01

Pos logit contributions
virt+7.808
+7.023
+6.680
Dem+6.668
sect+6.540
Neg logit contributions
theless-8.491
 mercy-8.325
 incap-7.955
ilty-7.615
 autonomy-7.566
Token To
Token ID1675
Feature activation-5.15e-01

Pos logit contributions
virt+0.991
+0.900
+0.893
+0.884
Dem+0.865
Neg logit contributions
 mercy-1.008
theless-0.993
 incap-0.946
ilty-0.907
 autonomy-0.885
Token practice
Token ID3357
Feature activation+3.58e-01

Pos logit contributions
virt+1.654
+1.496
+1.475
+1.468
Dem+1.445
Neg logit contributions
 mercy-1.648
theless-1.644
 incap-1.550
ilty-1.493
 autonomy-1.455
Token sewing
Token ID35356
Feature activation-8.40e-01

Pos logit contributions
 mercy+1.028
theless+1.009
 incap+0.955
ilty+0.909
 autonomy+0.887
Neg logit contributions
virt-1.275
-1.174
-1.165
-1.156
Dem-1.135
Token a
Token ID257
Feature activation-5.45e-01

Pos logit contributions
virt+2.712
+2.457
+2.389
+2.357
Dem+2.343
Neg logit contributions
theless-2.633
 mercy-2.617
 incap-2.487
ilty-2.421
etheless-2.337
Token Show
Token ID5438
Feature activation+6.74e-01

Pos logit contributions
 mercy+0.028
theless+0.027
 incap+0.026
ilty+0.025
 autonomy+0.024
Neg logit contributions
virt-0.027
-0.025
-0.024
-0.024
Dem-0.024
Token how
Token ID703
Feature activation-1.87e-01

Pos logit contributions
 mercy+1.788
theless+1.745
 incap+1.647
ilty+1.564
 autonomy+1.519
Neg logit contributions
virt-2.547
-2.352
-2.327
-2.316
Dem-2.274
Token to
Token ID284
Feature activation-6.55e-01

Pos logit contributions
virt+0.672
+0.618
+0.613
+0.606
Dem+0.597
Neg logit contributions
 mercy-0.525
theless-0.523
 incap-0.492
ilty-0.471
 communion-0.455
Token sew
Token ID34249
Feature activation-1.22e+00

Pos logit contributions
virt+2.087
+1.890
+1.870
+1.854
Dem+1.826
Neg logit contributions
theless-2.097
 mercy-2.095
 incap-1.974
ilty-1.908
 autonomy-1.853
Token a
Token ID257
Feature activation-5.89e-01

Pos logit contributions
virt+4.013
+3.699
+3.642
+3.571
Dem+3.541
Neg logit contributions
theless-3.702
 mercy-3.654
 incap-3.461
ilty-3.354
etheless-3.245
Token seam
Token ID15787
Feature activation-2.61e+00

Pos logit contributions
virt+2.148
+1.977
+1.963
+1.934
Dem+1.912
Neg logit contributions
theless-1.608
 mercy-1.604
 incap-1.509
ilty-1.436
 autonomy-1.397
Token:
Token ID25
Feature activation+1.08e-02

Pos logit contributions
virt+8.565
+7.834
+7.533
Dem+7.495
+7.357
Neg logit contributions
theless-7.509
 mercy-7.356
 incap-6.903
ilty-6.664
etheless-6.598
Token One
Token ID1881
Feature activation-4.70e-01

Pos logit contributions
 mercy+0.035
theless+0.034
 incap+0.033
ilty+0.031
 autonomy+0.030
Neg logit contributions
virt-0.035
-0.031
-0.031
-0.031
Dem-0.030
Token of
Token ID286
Feature activation+1.19e-01

Pos logit contributions
virt+1.540
+1.400
+1.383
+1.370
Dem+1.347
Neg logit contributions
 mercy-1.474
theless-1.467
 incap-1.387
ilty-1.336
 autonomy-1.296
Token the
Token ID262
Feature activation-2.28e-01

Pos logit contributions
 mercy+0.333
theless+0.331
 incap+0.311
ilty+0.298
 autonomy+0.288
Neg logit contributions
virt-0.429
-0.394
-0.389
-0.385
Dem-0.380
Token most
Token ID749
Feature activation-5.11e-01

Pos logit contributions
virt+0.751
+0.684
+0.676
+0.669
Dem+0.658
Neg logit contributions
 mercy-0.714
theless-0.709
 incap-0.671
ilty-0.646
 autonomy-0.627
Token and
Token ID290
Feature activation-1.56e-01

Pos logit contributions
 mercy+1.485
theless+1.437
 incap+1.393
ilty+1.315
 autonomy+1.294
Neg logit contributions
virt-1.776
-1.636
-1.614
-1.601
Dem-1.564
Token his
Token ID465
Feature activation-5.51e-01

Pos logit contributions
virt+0.551
+0.504
+0.499
+0.496
Dem+0.490
Neg logit contributions
 mercy-0.446
theless-0.445
 incap-0.417
ilty-0.401
ints-0.386
Token attorney
Token ID6136
Feature activation-2.10e-01

Pos logit contributions
virt+1.911
+1.751
+1.732
+1.713
Dem+1.700
Neg logit contributions
theless-1.605
 mercy-1.594
 incap-1.483
ilty-1.434
etheless-1.380
Token,
Token ID11
Feature activation+3.51e-01

Pos logit contributions
virt+0.749
+0.683
+0.678
+0.671
Dem+0.664
Neg logit contributions
 mercy-0.599
theless-0.597
 incap-0.562
ilty-0.536
 communion-0.518
Token Josh
Token ID8518
Feature activation+1.60e-01

Pos logit contributions
 mercy+1.146
theless+1.129
 incap+1.075
ilty+1.034
 autonomy+1.010
Neg logit contributions
virt-1.111
-1.011
-0.998
-0.989
Dem-0.966
Token Engel
Token ID46073
Feature activation-2.59e+00

Pos logit contributions
 mercy+0.537
theless+0.531
 incap+0.506
ilty+0.488
 autonomy+0.475
Neg logit contributions
virt-0.489
-0.443
-0.438
-0.434
Dem-0.424
Token,
Token ID11
Feature activation-5.03e-02

Pos logit contributions
virt+7.785
sky+6.622
+6.539
+6.513
Dem+6.471
Neg logit contributions
theless-8.278
 mercy-8.270
 incap-8.050
ciating-7.476
 autonomy-7.445
Token said
Token ID531
Feature activation-4.49e-02

Pos logit contributions
virt+0.186
+0.171
+0.170
+0.168
Dem+0.166
Neg logit contributions
 mercy-0.137
theless-0.135
 incap-0.127
ilty-0.121
 autonomy-0.118
Token he
Token ID339
Feature activation+2.87e-01

Pos logit contributions
virt+0.157
+0.144
+0.142
+0.141
Dem+0.139
Neg logit contributions
 mercy-0.130
theless-0.130
 incap-0.122
ilty-0.117
 autonomy-0.113
Token currently
Token ID3058
Feature activation-9.64e-02

Pos logit contributions
 mercy+0.842
theless+0.831
 incap+0.785
ilty+0.751
 autonomy+0.730
Neg logit contributions
virt-0.998
-0.916
-0.905
-0.900
Dem-0.884
Token is
Token ID318
Feature activation+1.12e-01

Pos logit contributions
virt+0.315
+0.287
+0.283
+0.281
Dem+0.277
Neg logit contributions
 mercy-0.303
theless-0.301
 incap-0.284
ilty-0.273
 autonomy-0.266
Tokeny
Token ID88
Feature activation-3.38e-01

Pos logit contributions
 mercy+1.781
theless+1.719
 incap+1.639
ilty+1.614
ints+1.543
Neg logit contributions
virt-2.102
-1.966
-1.952
-1.925
Dem-1.864
Tokenatta
Token ID25014
Feature activation+8.82e-01

Pos logit contributions
virt+1.364
+1.259
+1.247
+1.239
Dem+1.226
Neg logit contributions
 mercy-0.810
theless-0.800
 incap-0.744
ilty-0.701
 autonomy-0.680
Token travelled
Token ID21650
Feature activation+5.95e-02

Pos logit contributions
 mercy+2.334
theless+2.293
 incap+2.136
ilty+2.020
 autonomy+1.994
Neg logit contributions
virt-3.311
-3.072
-3.047
-3.035
Dem-2.960
Token to
Token ID284
Feature activation-2.58e-01

Pos logit contributions
 mercy+0.162
theless+0.159
 incap+0.150
ilty+0.143
 autonomy+0.139
Neg logit contributions
virt-0.221
-0.203
-0.201
-0.200
Dem-0.197
Token Mog
Token ID30926
Feature activation-1.52e+00

Pos logit contributions
virt+0.761
+0.684
+0.675
+0.669
Dem+0.659
Neg logit contributions
 mercy-0.891
theless-0.886
 incap-0.844
ilty-0.816
 autonomy-0.791
Tokenadish
Token ID48563
Feature activation-2.57e+00

Pos logit contributions
virt+1.477
Dem+0.880
+0.759
+0.698
+0.670
Neg logit contributions
theless-8.413
 mercy-8.193
 incap-7.967
etheless-7.672
 confidentiality-7.642
Tokenu
Token ID84
Feature activation-5.80e-02

Pos logit contributions
virt+5.876
+4.976
+4.893
Dem+4.808
+4.712
Neg logit contributions
theless-10.524
 mercy-10.479
 incap-9.969
etheless-9.481
 autonomy-9.322
Token to
Token ID284
Feature activation-2.55e-01

Pos logit contributions
virt+0.204
+0.187
+0.185
+0.183
Dem+0.181
Neg logit contributions
 mercy-0.168
theless-0.167
 incap-0.158
ilty-0.150
 communion-0.145
Token discuss
Token ID2112
Feature activation-1.05e-02

Pos logit contributions
virt+0.821
+0.747
+0.738
+0.731
Dem+0.721
Neg logit contributions
 mercy-0.811
theless-0.806
 incap-0.764
ilty-0.735
 autonomy-0.714
Token the
Token ID262
Feature activation+2.80e-01

Pos logit contributions
virt+0.037
+0.034
+0.034
+0.033
Dem+0.033
Neg logit contributions
 mercy-0.030
theless-0.030
 incap-0.028
ilty-0.027
 communion-0.026
Token issue
Token ID2071
Feature activation+3.45e-01

Pos logit contributions
 mercy+0.791
theless+0.779
 incap+0.736
ilty+0.702
 autonomy+0.683
Neg logit contributions
virt-1.010
-0.928
-0.919
-0.912
Dem-0.897
Token of
Token ID286
Feature activation+2.79e-01

Pos logit contributions
 mercy+1.628
theless+1.591
 incap+1.510
ilty+1.443
 autonomy+1.413
Neg logit contributions
virt-1.918
-1.761
-1.752
-1.732
Dem-1.699
Token the
Token ID262
Feature activation+3.17e-01

Pos logit contributions
 mercy+0.829
theless+0.820
 incap+0.776
ilty+0.743
 autonomy+0.722
Neg logit contributions
virt-0.961
-0.880
-0.870
-0.863
Dem-0.849
Token capital
Token ID3139
Feature activation-4.47e-02

Pos logit contributions
 mercy+1.181
theless+1.146
 incap+1.110
ilty+1.067
 autonomy+1.065
Neg logit contributions
virt-0.861
-0.778
-0.768
-0.753
Dem-0.734
Token,
Token ID11
Feature activation+2.19e-01

Pos logit contributions
virt+0.101
+0.088
+0.086
+0.085
Dem+0.083
Neg logit contributions
 mercy-0.186
theless-0.185
 incap-0.178
ilty-0.172
 communion-0.169
Token Mog
Token ID30926
Feature activation-1.61e+00

Pos logit contributions
 mercy+0.941
theless+0.927
 incap+0.894
ilty+0.868
 autonomy+0.853
Neg logit contributions
virt-0.473
-0.409
-0.402
-0.395
Dem-0.382
Tokenadish
Token ID48563
Feature activation-2.57e+00

Pos logit contributions
virt+1.404
Dem+0.723
+0.546
sky+0.543
+0.513
Neg logit contributions
theless-9.230
 mercy-8.906
 incap-8.742
etheless-8.490
 confidentiality-8.402
Tokenu
Token ID84
Feature activation-3.17e-01

Pos logit contributions
virt+5.751
+4.879
Dem+4.729
+4.707
+4.583
Neg logit contributions
theless-10.768
 mercy-10.591
 incap-10.161
etheless-9.759
 autonomy-9.488
Token.
Token ID13
Feature activation-1.81e-02

Pos logit contributions
virt+1.080
+0.985
+0.970
+0.963
Dem+0.957
Neg logit contributions
theless-0.958
 mercy-0.953
 incap-0.900
ilty-0.856
 communion-0.833
Token
Token ID198
Feature activation+4.74e-01

Pos logit contributions
virt+0.063
+0.057
+0.057
+0.056
Dem+0.056
Neg logit contributions
 mercy-0.053
theless-0.053
 incap-0.050
ilty-0.048
 communion-0.046
Token
Token ID198
Feature activation+4.95e-02

Pos logit contributions
 mercy+1.201
theless+1.116
 incap+1.070
ilty+1.045
ints+0.998
Neg logit contributions
virt-1.815
-1.692
-1.688
-1.667
Dem-1.605
TokenThey
Token ID2990
Feature activation-8.17e-02

Pos logit contributions
 mercy+0.187
theless+0.185
 incap+0.177
ilty+0.172
 autonomy+0.168
Neg logit contributions
virt-0.129
-0.115
-0.114
-0.112
Dem-0.108
Token :
Token ID1058
Feature activation-7.59e-01

Pos logit contributions
virt+0.714
+0.651
+0.644
+0.638
Dem+0.626
Neg logit contributions
 mercy-0.673
theless-0.665
 incap-0.631
ilty-0.605
 autonomy-0.589
Token This
Token ID770
Feature activation+4.63e-01

Pos logit contributions
virt+2.419
+2.189
+2.159
+2.146
Dem+2.124
Neg logit contributions
 mercy-2.442
theless-2.418
 incap-2.301
ilty-2.223
 communion-2.160
Token can
Token ID460
Feature activation-2.79e-03

Pos logit contributions
 mercy+1.382
theless+1.354
 incap+1.288
ilty+1.226
 autonomy+1.202
Neg logit contributions
virt-1.592
-1.455
-1.449
-1.429
Dem-1.399
Token also
Token ID635
Feature activation+2.17e-01

Pos logit contributions
virt+0.010
+0.009
+0.009
+0.009
Dem+0.008
Neg logit contributions
 mercy-0.008
theless-0.008
 incap-0.008
ilty-0.007
 autonomy-0.007
Token be
Token ID307
Feature activation+5.91e-01

Pos logit contributions
 mercy+0.670
theless+0.656
 incap+0.625
ilty+0.596
 autonomy+0.582
Neg logit contributions
virt-0.725
-0.665
-0.658
-0.650
Dem-0.638
Token inadvert
Token ID25373
Feature activation-2.56e+00

Pos logit contributions
 mercy+1.563
theless+1.525
 incap+1.439
ilty+1.354
 autonomy+1.330
Neg logit contributions
virt-2.235
-2.054
-2.047
-2.019
Dem-1.991
Tokenent
Token ID298
Feature activation-5.81e-01

Pos logit contributions
virt+7.613
+6.621
+6.499
Dem+6.493
+6.438
Neg logit contributions
 mercy-8.334
theless-8.251
 incap-7.937
 autonomy-7.513
 confidentiality-7.478
Token:
Token ID25
Feature activation-1.97e-01

Pos logit contributions
virt+1.934
+1.784
+1.749
+1.740
Dem+1.732
Neg logit contributions
theless-1.754
 mercy-1.730
 incap-1.633
ilty-1.579
 communion-1.524
Token people
Token ID661
Feature activation-1.59e-01

Pos logit contributions
virt+0.687
+0.629
+0.622
+0.618
Dem+0.607
Neg logit contributions
 mercy-0.579
theless-0.572
 incap-0.541
ilty-0.517
 autonomy-0.503
Token might
Token ID1244
Feature activation-4.75e-01

Pos logit contributions
virt+0.554
+0.507
+0.502
+0.498
Dem+0.490
Neg logit contributions
 mercy-0.465
theless-0.460
 incap-0.435
ilty-0.416
 autonomy-0.404
Token assume
Token ID7048
Feature activation+3.66e-01

Pos logit contributions
virt+1.577
+1.435
+1.419
+1.408
Dem+1.386
Neg logit contributions
 mercy-1.467
theless-1.458
 incap-1.379
ilty-1.327
 autonomy-1.287
Token to
Token ID284
Feature activation-1.61e-01

Pos logit contributions
 mercy+1.864
theless+1.824
 incap+1.722
ilty+1.639
 autonomy+1.592
Neg logit contributions
virt-2.723
-2.512
-2.491
-2.471
Dem-2.420
Token start
Token ID923
Feature activation-1.55e-01

Pos logit contributions
virt+0.508
+0.463
+0.457
+0.452
Dem+0.439
Neg logit contributions
 mercy-0.525
theless-0.516
 incap-0.495
ilty-0.473
 autonomy-0.465
Token developing
Token ID5922
Feature activation+1.13e-01

Pos logit contributions
virt+0.558
+0.513
+0.507
+0.504
Dem+0.495
Neg logit contributions
 mercy-0.439
theless-0.434
 incap-0.409
ilty-0.391
 autonomy-0.379
Token in
Token ID287
Feature activation-5.03e-01

Pos logit contributions
 mercy+0.324
theless+0.319
 incap+0.302
ilty+0.288
 autonomy+0.281
Neg logit contributions
virt-0.399
-0.366
-0.363
-0.359
Dem-0.352
Token AWS
Token ID30865
Feature activation-3.61e-01

Pos logit contributions
virt+1.664
+1.519
+1.485
+1.482
Dem+1.477
Neg logit contributions
theless-1.555
 mercy-1.545
 incap-1.458
ilty-1.405
 communion-1.362
Token Lamb
Token ID21114
Feature activation-2.55e+00

Pos logit contributions
virt+1.198
+1.095
+1.078
+1.071
Dem+1.060
Neg logit contributions
 mercy-1.118
theless-1.114
 incap-1.050
ilty-1.006
 communion-0.977
Tokenda
Token ID6814
Feature activation-1.11e+00

Pos logit contributions
virt+5.706
+4.816
Dem+4.770
+4.503
+4.470
Neg logit contributions
theless-10.569
 mercy-10.147
 incap-9.952
etheless-9.506
ilty-9.401
Token with
Token ID351
Feature activation-1.81e-01

Pos logit contributions
virt+3.593
+3.289
Dem+3.237
+3.203
+3.179
Neg logit contributions
theless-3.532
 mercy-3.486
 incap-3.294
ilty-3.121
imposed-3.074
Token server
Token ID4382
Feature activation-3.88e-02

Pos logit contributions
virt+0.616
+0.563
+0.555
+0.552
Dem+0.545
Neg logit contributions
 mercy-0.544
theless-0.541
 incap-0.510
ilty-0.489
 communion-0.475
Tokenless
Token ID1203
Feature activation+6.97e-01

Pos logit contributions
virt+0.102
+0.092
+0.090
+0.089
Dem+0.087
Neg logit contributions
 mercy-0.146
theless-0.145
 incap-0.139
ilty-0.134
 autonomy-0.131
Token framework
Token ID9355
Feature activation-1.00e-01

Pos logit contributions
 mercy+2.121
theless+2.080
 incap+1.983
ilty+1.896
 autonomy+1.858
Neg logit contributions
virt-2.340
-2.134
-2.128
-2.108
Dem-2.034
Token will
Token ID481
Feature activation-9.34e-02

Pos logit contributions
 mercy+1.171
theless+1.146
 incap+1.088
ilty+1.038
 autonomy+1.010
Neg logit contributions
virt-1.465
-1.350
-1.332
-1.332
Dem-1.302
Token be
Token ID307
Feature activation+5.86e-02

Pos logit contributions
virt+0.314
+0.287
+0.284
+0.282
Dem+0.277
Neg logit contributions
 mercy-0.285
theless-0.282
 incap-0.267
ilty-0.256
 autonomy-0.249
Token joined
Token ID5399
Feature activation-4.96e-01

Pos logit contributions
 mercy+0.162
theless+0.159
 incap+0.151
ilty+0.144
 autonomy+0.140
Neg logit contributions
virt-0.215
-0.197
-0.196
-0.195
Dem-0.191
Token by
Token ID416
Feature activation+5.75e-02

Pos logit contributions
virt+1.794
+1.631
+1.610
+1.608
Dem+1.589
Neg logit contributions
theless-1.393
 mercy-1.386
 incap-1.306
ilty-1.229
 autonomy-1.204
Token Jim
Token ID5395
Feature activation-2.80e-02

Pos logit contributions
 mercy+0.185
theless+0.183
 incap+0.174
ilty+0.167
 autonomy+0.163
Neg logit contributions
virt-0.184
-0.167
-0.165
-0.164
Dem-0.161
Token Hub
Token ID14699
Feature activation-2.53e+00

Pos logit contributions
virt+0.090
+0.082
+0.081
+0.080
Dem+0.079
Neg logit contributions
 mercy-0.090
theless-0.089
 incap-0.085
ilty-0.081
 autonomy-0.079
Tokenay
Token ID323
Feature activation-5.23e-01

Pos logit contributions
virt+7.507
+6.398
Dem+6.385
sky+6.363
+6.135
Neg logit contributions
 mercy-8.283
theless-8.180
 incap-7.955
 communion-7.452
 autonomy-7.428
Token and
Token ID290
Feature activation-2.71e-01

Pos logit contributions
virt+1.805
+1.641
+1.616
+1.603
Dem+1.589
Neg logit contributions
 mercy-1.563
theless-1.549
 incap-1.467
ilty-1.393
 autonomy-1.360
Token the
Token ID262
Feature activation+2.71e-01

Pos logit contributions
virt+0.821
+0.742
+0.733
+0.727
Dem+0.716
Neg logit contributions
 mercy-0.914
theless-0.909
 incap-0.864
ilty-0.834
imposed-0.809
Token late
Token ID2739
Feature activation+7.56e-01

Pos logit contributions
 mercy+0.859
theless+0.846
 incap+0.805
ilty+0.773
 autonomy+0.755
Neg logit contributions
virt-0.881
-0.803
-0.793
-0.788
Dem-0.772
Token John
Token ID1757
Feature activation+6.06e-01

Pos logit contributions
 mercy+2.452
theless+2.410
 incap+2.294
ilty+2.200
 autonomy+2.173
Neg logit contributions
virt-2.399
-2.191
-2.160
-2.140
Dem-2.088
Token washed
Token ID18989
Feature activation+2.64e-02

Pos logit contributions
virt+0.027
+0.025
+0.025
+0.025
Dem+0.024
Neg logit contributions
 mercy-0.021
theless-0.020
 incap-0.019
ilty-0.018
 autonomy-0.018
Token it
Token ID340
Feature activation+2.24e-01

Pos logit contributions
 mercy+0.081
theless+0.080
 incap+0.076
ilty+0.073
 autonomy+0.071
Neg logit contributions
virt-0.088
-0.080
-0.080
-0.079
Dem-0.078
Token with
Token ID351
Feature activation+4.16e-01

Pos logit contributions
 mercy+0.675
theless+0.663
 incap+0.628
ilty+0.601
 autonomy+0.585
Neg logit contributions
virt-0.764
-0.701
-0.695
-0.689
Dem-0.674
Token T
Token ID309
Feature activation+1.11e+00

Pos logit contributions
 mercy+1.319
theless+1.296
 incap+1.230
ilty+1.179
 autonomy+1.152
Neg logit contributions
virt-1.356
-1.239
-1.225
-1.211
Dem-1.186
Tokenamiya
Token ID38241
Feature activation+1.78e-01

Pos logit contributions
 mercy+3.573
theless+3.488
 incap+3.346
ilty+3.293
ints+3.231
Neg logit contributions
virt-3.483
-3.184
-3.147
-3.102
Dem-3.026
Token Olive
Token ID30012
Feature activation-2.53e+00

Pos logit contributions
 mercy+0.544
theless+0.536
 incap+0.508
ilty+0.487
 autonomy+0.474
Neg logit contributions
virt-0.600
-0.548
-0.543
-0.538
Dem-0.528
Token Dr
Token ID1583
Feature activation-9.24e-01

Pos logit contributions
virt+7.723
+6.867
Dem+6.785
+6.591
 Tul+6.547
Neg logit contributions
theless-7.917
 mercy-7.712
 incap-7.398
ilty-6.996
imposed-6.851
Tokenab
Token ID397
Feature activation-5.17e-01

Pos logit contributions
virt+2.457
+2.162
+2.132
Dem+2.115
+2.075
Neg logit contributions
theless-3.429
 mercy-3.394
 incap-3.248
ilty-3.093
 communion-3.092
Token.
Token ID13
Feature activation-2.17e-01

Pos logit contributions
virt+1.723
+1.568
+1.537
+1.535
Dem+1.520
Neg logit contributions
 mercy-1.592
theless-1.581
 incap-1.502
ilty-1.417
 autonomy-1.402
Token I
Token ID314
Feature activation+8.34e-02

Pos logit contributions
virt+0.737
+0.674
+0.666
+0.661
Dem+0.651
Neg logit contributions
 mercy-0.658
theless-0.651
 incap-0.618
ilty-0.591
 autonomy-0.575
Token focused
Token ID5670
Feature activation-6.57e-01

Pos logit contributions
 mercy+0.229
theless+0.226
 incap+0.213
ilty+0.203
 autonomy+0.196
Neg logit contributions
virt-0.306
-0.282
-0.279
-0.277
Dem-0.272
Token But
Token ID887
Feature activation+2.61e-01

Pos logit contributions
virt+0.741
+0.676
+0.669
+0.663
Dem+0.652
Neg logit contributions
 mercy-0.685
theless-0.677
 incap-0.642
ilty-0.616
 autonomy-0.599
Token then
Token ID788
Feature activation-7.41e-02

Pos logit contributions
 mercy+0.785
theless+0.772
 incap+0.734
ilty+0.699
 autonomy+0.685
Neg logit contributions
virt-0.896
-0.823
-0.813
-0.807
Dem-0.791
Token that
Token ID326
Feature activation-3.95e-01

Pos logit contributions
virt+0.255
+0.233
+0.231
+0.229
Dem+0.226
Neg logit contributions
 mercy-0.220
theless-0.218
 incap-0.206
ilty-0.198
 autonomy-0.191
Token happened
Token ID3022
Feature activation-6.58e-01

Pos logit contributions
virt+1.447
+1.327
+1.309
+1.302
Dem+1.288
Neg logit contributions
theless-1.065
 mercy-1.063
 incap-0.984
ilty-0.968
etheless-0.919
Token with
Token ID351
Feature activation-4.07e-01

Pos logit contributions
virt+2.220
+2.023
+1.989
+1.978
Dem+1.957
Neg logit contributions
theless-1.971
 mercy-1.970
 incap-1.857
ilty-1.793
imposed-1.734
Token Moz
Token ID28036
Feature activation-2.52e+00

Pos logit contributions
virt+1.370
+1.242
+1.225
+1.219
Dem+1.204
Neg logit contributions
 mercy-1.234
theless-1.233
 incap-1.156
ilty-1.117
imposed-1.086
Tokenart
Token ID433
Feature activation-2.71e-01

Pos logit contributions
virt+6.323
Dem+5.410
+5.177
 Moz+5.084
+5.056
Neg logit contributions
theless-9.314
 mercy-9.158
 incap-8.758
ilty-8.715
etheless-8.637
Token.
Token ID13
Feature activation-5.43e-01

Pos logit contributions
virt+0.944
+0.863
+0.850
+0.845
Dem+0.834
Neg logit contributions
 mercy-0.796
theless-0.792
 incap-0.746
ilty-0.714
 autonomy-0.691
Token The
Token ID383
Feature activation-2.54e-01

Pos logit contributions
virt+1.772
+1.613
+1.592
+1.579
Dem+1.559
Neg logit contributions
 mercy-1.708
theless-1.698
 incap-1.610
ilty-1.547
 autonomy-1.504
Token idea
Token ID2126
Feature activation+6.87e-01

Pos logit contributions
virt+0.843
+0.768
+0.758
+0.751
Dem+0.740
Neg logit contributions
 mercy-0.783
theless-0.779
 incap-0.733
ilty-0.708
imposed-0.684
Token of
Token ID286
Feature activation+1.94e-01

Pos logit contributions
 mercy+1.981
theless+1.943
 incap+1.851
ilty+1.766
 autonomy+1.719
Neg logit contributions
virt-2.421
-2.227
-2.208
-2.185
Dem-2.149
Tokenwin
Token ID5404
Feature activation+2.86e-01

Pos logit contributions
 mercy+0.321
theless+0.318
 incap+0.302
ilty+0.291
 autonomy+0.284
Neg logit contributions
virt-0.300
-0.272
-0.269
-0.267
Dem-0.261
Tokenners
Token ID2741
Feature activation+9.36e-03

Pos logit contributions
 mercy+0.856
theless+0.846
 incap+0.801
ilty+0.767
 autonomy+0.746
Neg logit contributions
virt-0.980
-0.897
-0.887
-0.880
Dem-0.865
Token of
Token ID286
Feature activation-4.65e-01

Pos logit contributions
 mercy+0.028
theless+0.027
 incap+0.026
ilty+0.025
 autonomy+0.024
Neg logit contributions
virt-0.032
-0.030
-0.029
-0.029
Dem-0.029
Token a
Token ID257
Feature activation-2.68e-01

Pos logit contributions
virt+1.529
+1.384
+1.382
+1.357
Dem+1.343
Neg logit contributions
theless-1.445
 mercy-1.438
 incap-1.366
ilty-1.306
imposed-1.271
Token Tom
Token ID4186
Feature activation-8.87e-02

Pos logit contributions
virt+0.859
+0.781
+0.773
+0.764
Dem+0.752
Neg logit contributions
 mercy-0.857
theless-0.849
 incap-0.808
ilty-0.775
 autonomy-0.754
Token Em
Token ID2295
Feature activation-2.52e+00

Pos logit contributions
virt+0.281
+0.255
+0.252
+0.250
Dem+0.246
Neg logit contributions
 mercy-0.288
theless-0.285
 incap-0.271
ilty-0.260
 autonomy-0.254
Tokenanski
Token ID44978
Feature activation-5.45e-01

Pos logit contributions
virt+8.906
Dem+8.038
+7.863
+7.739
sky+7.607
Neg logit contributions
theless-6.675
 mercy-6.355
 incap-6.325
etheless-5.949
ciating-5.737
Token-
Token ID12
Feature activation-3.19e-01

Pos logit contributions
virt+1.565
+1.402
+1.395
+1.364
Dem+1.349
Neg logit contributions
theless-1.922
 mercy-1.916
 incap-1.826
ilty-1.757
 communion-1.709
Tokenesque
Token ID28939
Feature activation+4.74e-01

Pos logit contributions
virt+1.000
+0.903
+0.892
+0.883
Dem+0.873
Neg logit contributions
 mercy-1.054
theless-1.045
 incap-0.996
ilty-0.953
 communion-0.931
Token back
Token ID736
Feature activation-8.52e-02

Pos logit contributions
 mercy+1.527
theless+1.497
 incap+1.425
ilty+1.372
 autonomy+1.343
Neg logit contributions
virt-1.525
-1.393
-1.371
-1.370
Dem-1.337
Token-
Token ID12
Feature activation+2.53e-01

Pos logit contributions
virt+0.278
+0.253
+0.250
+0.248
Dem+0.244
Neg logit contributions
 mercy-0.268
theless-0.267
 incap-0.253
ilty-0.243
 autonomy-0.236
Token
Token ID251
Feature activation-6.66e-01

Pos logit contributions
 mercy+1.740
theless+1.722
 incap+1.666
ilty+1.626
 autonomy+1.599
Neg logit contributions
virt-0.590
-0.482
-0.466
-0.463
Dem-0.444
Token
Token ID960
Feature activation+2.07e-01

Pos logit contributions
virt+2.299
+2.098
+2.074
+2.041
Dem+2.025
Neg logit contributions
 mercy-1.973
theless-1.952
 incap-1.855
ilty-1.759
 communion-1.715
Tokencross
Token ID19692
Feature activation-1.00e+00

Pos logit contributions
 mercy+0.678
theless+0.672
 incap+0.639
ilty+0.616
 autonomy+0.600
Neg logit contributions
virt-0.640
-0.585
-0.578
-0.573
Dem-0.560
Tokening
Token ID278
Feature activation-7.99e-01

Pos logit contributions
virt+3.445
+3.109
+3.080
+3.075
Dem+3.039
Neg logit contributions
 mercy-2.997
theless-2.939
 incap-2.807
ilty-2.692
 autonomy-2.606
Token highly
Token ID4047
Feature activation-4.83e-03

Pos logit contributions
virt+2.676
+2.430
+2.401
+2.380
Dem+2.353
Neg logit contributions
 mercy-2.438
theless-2.418
 incap-2.287
ilty-2.200
 autonomy-2.129
Token traff
Token ID11878
Feature activation-2.52e+00

Pos logit contributions
virt+0.017
+0.016
+0.016
+0.015
Dem+0.015
Neg logit contributions
 mercy-0.014
theless-0.014
 incap-0.013
ilty-0.012
 autonomy-0.012
Tokenicked
Token ID9484
Feature activation-1.50e+00

Pos logit contributions
virt+6.228
+5.416
+5.240
Dem+5.216
sect+5.158
Neg logit contributions
 mercy-9.607
theless-9.526
 incap-9.248
etheless-8.724
 communion-8.710
Token thorough
Token ID9321
Feature activation-5.89e-01

Pos logit contributions
virt+4.828
+4.363
+4.276
+4.265
Dem+4.251
Neg logit contributions
 mercy-4.619
theless-4.614
 incap-4.372
ilty-4.218
etheless-4.145
Tokenf
Token ID69
Feature activation-1.75e-01

Pos logit contributions
virt+2.234
+2.062
+2.042
+2.028
Dem+1.998
Neg logit contributions
 mercy-1.548
theless-1.527
 incap-1.434
ilty-1.364
 autonomy-1.320
Tokenares
Token ID3565
Feature activation+4.11e-01

Pos logit contributions
virt+0.660
+0.611
+0.604
+0.600
Dem+0.590
Neg logit contributions
 mercy-0.465
theless-0.456
 incap-0.430
ilty-0.410
 communion-0.397
Token
Token ID960
Feature activation+2.80e-01

Pos logit contributions
 mercy+1.161
theless+1.129
 incap+1.070
ilty+1.024
 autonomy+0.995
Neg logit contributions
virt-1.475
-1.354
-1.350
-1.342
Dem-1.316
Token of
Token ID286
Feature activation-1.32e-01

Pos logit contributions
 mercy+0.567
theless+0.565
 incap+0.533
ilty+0.515
 communion+0.497
Neg logit contributions
virt-0.625
-0.569
-0.560
-0.558
Dem-0.549
Token the
Token ID262
Feature activation-1.15e-01

Pos logit contributions
virt+0.471
+0.430
+0.425
+0.423
Dem+0.416
Neg logit contributions
 mercy-0.377
theless-0.375
 incap-0.352
ilty-0.337
 communion-0.326
Token objectionable
Token ID48540
Feature activation-7.41e-01

Pos logit contributions
virt+0.412
+0.379
+0.375
+0.372
Dem+0.366
Neg logit contributions
 mercy-0.329
theless-0.324
 incap-0.307
ilty-0.293
 autonomy-0.285
Token material
Token ID2587
Feature activation+8.70e-01

Pos logit contributions
virt+2.742
+2.517
+2.472
+2.459
Dem+2.454
Neg logit contributions
theless-1.979
 mercy-1.968
 incap-1.832
ilty-1.759
ints-1.696
Token being
Token ID852
Feature activation+7.94e-01

Pos logit contributions
 mercy+2.165
theless+2.117
 incap+1.984
ilty+1.867
 autonomy+1.831
Neg logit contributions
virt-3.398
-3.163
-3.153
-3.108
Dem-3.064
Token traff
Token ID11878
Feature activation-2.51e+00

Pos logit contributions
 mercy+2.194
theless+2.129
 incap+2.012
ilty+1.896
 autonomy+1.873
Neg logit contributions
virt-2.896
-2.682
-2.678
-2.635
Dem-2.590
Tokenicked
Token ID9484
Feature activation-1.21e+00

Pos logit contributions
virt+5.996
Dem+4.937
+4.849
+4.819
sect+4.799
Neg logit contributions
theless-9.848
 mercy-9.839
 incap-9.360
etheless-8.970
 communion-8.939
Token across
Token ID1973
Feature activation-3.66e-01

Pos logit contributions
virt+4.063
+3.636
+3.557
Dem+3.531
+3.526
Neg logit contributions
theless-3.633
 mercy-3.592
 incap-3.416
ilty-3.286
ints-3.187
Token peer
Token ID12720
Feature activation-8.49e-01

Pos logit contributions
virt+1.280
+1.167
+1.148
+1.146
Dem+1.132
Neg logit contributions
 mercy-1.063
theless-1.058
 incap-0.995
ilty-0.960
 communion-0.921
Token to
Token ID284
Feature activation-5.37e-01

Pos logit contributions
virt+2.618
+2.361
+2.314
+2.314
Dem+2.300
Neg logit contributions
 mercy-2.786
theless-2.777
 incap-2.628
ilty-2.538
ints-2.459
Token peer
Token ID12720
Feature activation-4.19e-01

Pos logit contributions
virt+1.261
+1.104
+1.086
+1.073
Dem+1.045
Neg logit contributions
 mercy-2.187
theless-2.168
 incap-2.084
ilty-2.019
 autonomy-1.980
TokenNo
Token ID2949
Feature activation-4.84e-01

Pos logit contributions
 mercy+2.065
theless+2.023
 incap+1.937
ilty+1.888
 communion+1.834
Neg logit contributions
virt-1.725
-1.562
-1.534
-1.514
Dem-1.460
Tokenar
Token ID283
Feature activation+4.45e-02

Pos logit contributions
virt+1.435
+1.290
+1.276
+1.266
Dem+1.257
Neg logit contributions
 mercy-1.662
theless-1.661
 incap-1.576
ilty-1.514
 autonomy-1.478
Token Ha
Token ID9398
Feature activation-3.45e-01

Pos logit contributions
 mercy+0.135
theless+0.134
 incap+0.127
ilty+0.122
 autonomy+0.118
Neg logit contributions
virt-0.150
-0.138
-0.136
-0.135
Dem-0.132
TokenO
Token ID46
Feature activation+3.58e-01

Pos logit contributions
virt+0.984
+0.880
+0.872
+0.868
Dem+0.861
Neg logit contributions
 mercy-1.230
theless-1.229
 incap-1.167
ilty-1.124
 autonomy-1.099
Tokenved
Token ID1079
Feature activation-5.32e-01

Pos logit contributions
 mercy+1.123
theless+1.103
 incap+1.055
ilty+1.014
 communion+0.988
Neg logit contributions
virt-1.168
-1.071
-1.061
-1.049
Dem-1.021
Token Ve
Token ID8016
Feature activation-2.51e+00

Pos logit contributions
virt+1.638
+1.475
+1.458
+1.452
Dem+1.436
Neg logit contributions
 mercy-1.771
theless-1.754
 incap-1.669
ilty-1.598
 autonomy-1.564
TokenHa
Token ID23303
Feature activation+7.86e-02

Pos logit contributions
virt+7.929
Dem+7.446
+7.162
+7.062
Econom+7.017
Neg logit contributions
theless-7.276
 mercy-7.170
 incap-6.841
etheless-6.640
 autonomy-6.426
TokenL
Token ID43
Feature activation+2.98e-01

Pos logit contributions
 mercy+0.262
theless+0.258
 incap+0.247
ilty+0.238
 autonomy+0.232
Neg logit contributions
virt-0.242
-0.220
-0.217
-0.215
Dem-0.210
Tokenomed
Token ID12657
Feature activation-4.28e-01

Pos logit contributions
 mercy+0.940
theless+0.927
 incap+0.885
ilty+0.851
 autonomy+0.827
Neg logit contributions
virt-0.969
-0.885
-0.875
-0.864
Dem-0.847
Token,
Token ID11
Feature activation-5.78e-01

Pos logit contributions
virt+1.428
+1.299
+1.283
+1.277
Dem+1.261
Neg logit contributions
 mercy-1.317
theless-1.307
 incap-1.236
ilty-1.181
 autonomy-1.151
Token D
Token ID360
Feature activation+1.81e-01

Pos logit contributions
virt+1.991
+1.815
+1.800
+1.788
Dem+1.765
Neg logit contributions
 mercy-1.702
theless-1.697
 incap-1.603
ilty-1.539
 communion-1.480
Token and
Token ID290
Feature activation-4.26e-01

Pos logit contributions
virt+0.598
+0.549
+0.543
+0.539
Dem+0.530
Neg logit contributions
 mercy-0.477
theless-0.474
 incap-0.445
ilty-0.426
 communion-0.412
Token climate
Token ID4258
Feature activation-7.59e-02

Pos logit contributions
virt+1.408
+1.284
+1.269
+1.259
Dem+1.237
Neg logit contributions
 mercy-1.328
theless-1.313
 incap-1.246
ilty-1.195
 autonomy-1.164
Token scientists
Token ID5519
Feature activation+5.28e-01

Pos logit contributions
virt+0.276
+0.254
+0.251
+0.249
Dem+0.246
Neg logit contributions
 mercy-0.211
theless-0.209
 incap-0.197
ilty-0.188
 autonomy-0.182
Token who
Token ID508
Feature activation-1.69e-01

Pos logit contributions
 mercy+1.441
theless+1.408
 incap+1.338
ilty+1.275
 autonomy+1.241
Neg logit contributions
virt-1.931
-1.773
-1.760
-1.748
Dem-1.719
Token used
Token ID973
Feature activation-1.33e-01

Pos logit contributions
virt+0.571
+0.521
+0.515
+0.512
Dem+0.503
Neg logit contributions
 mercy-0.516
theless-0.510
 incap-0.483
ilty-0.463
 autonomy-0.451
Token ecosystem
Token ID13187
Feature activation-2.51e+00

Pos logit contributions
virt+0.456
+0.418
+0.413
+0.410
Dem+0.403
Neg logit contributions
 mercy-0.400
theless-0.396
 incap-0.374
ilty-0.359
 autonomy-0.348
Token observations
Token ID13050
Feature activation+1.20e-02

Pos logit contributions
virt+7.682
+7.033
Dem+6.871
+6.865
+6.816
Neg logit contributions
theless-7.536
 mercy-7.374
 incap-7.002
ilty-6.832
etheless-6.797
Token,
Token ID11
Feature activation+2.37e-04

Pos logit contributions
 mercy+0.034
theless+0.034
 incap+0.032
ilty+0.030
 autonomy+0.029
Neg logit contributions
virt-0.043
-0.040
-0.039
-0.039
Dem-0.039
Token laboratory
Token ID14010
Feature activation-3.45e-01

Pos logit contributions
 mercy+0.001
theless+0.001
 incap+0.001
ilty+0.001
 autonomy+0.001
Neg logit contributions
virt-0.001
-0.001
-0.001
-0.001
Dem-0.001
Token experiments
Token ID10256
Feature activation+1.94e-01

Pos logit contributions
virt+1.144
+1.043
+1.030
+1.021
Dem+1.007
Neg logit contributions
 mercy-1.067
theless-1.059
 incap-1.000
ilty-0.960
 communion-0.931
Token and
Token ID290
Feature activation-1.03e-01

Pos logit contributions
 mercy+0.556
theless+0.550
 incap+0.519
ilty+0.495
 autonomy+0.480
Neg logit contributions
virt-0.691
-0.634
-0.628
-0.622
Dem-0.613
Token new
Token ID649
Feature activation+5.46e-01

Pos logit contributions
 mercy+0.986
theless+0.968
 incap+0.917
ilty+0.875
 autonomy+0.858
Neg logit contributions
virt-1.132
-1.033
-1.029
-1.017
Dem-0.998
Token domain
Token ID7386
Feature activation-9.37e-01

Pos logit contributions
 mercy+1.377
theless+1.351
 incap+1.267
ilty+1.202
 autonomy+1.165
Neg logit contributions
virt-2.130
-1.970
-1.957
-1.938
Dem-1.911
Token buying
Token ID7067
Feature activation-1.80e-01

Pos logit contributions
virt+3.028
+2.755
+2.703
+2.663
Dem+2.640
Neg logit contributions
theless-2.955
 mercy-2.940
 incap-2.789
ilty-2.683
imposed-2.596
Token service
Token ID2139
Feature activation+4.89e-01

Pos logit contributions
virt+0.576
+0.524
+0.513
+0.513
Dem+0.505
Neg logit contributions
theless-0.578
 mercy-0.574
 incap-0.544
ilty-0.524
imposed-0.507
Token Google
Token ID3012
Feature activation+6.30e-01

Pos logit contributions
 mercy+1.349
theless+1.324
 incap+1.252
ilty+1.196
 autonomy+1.158
Neg logit contributions
virt-1.782
-1.642
-1.634
-1.616
Dem-1.586
Token Dom
Token ID9666
Feature activation-2.51e+00

Pos logit contributions
 mercy+2.328
theless+2.301
 incap+2.198
ilty+2.132
 autonomy+2.085
Neg logit contributions
virt-1.695
-1.510
-1.501
-1.476
Dem-1.438
Tokenains
Token ID1299
Feature activation-7.39e-01

Pos logit contributions
virt+5.763
+5.170
+4.979
Dem+4.850
+4.645
Neg logit contributions
theless-10.064
 mercy-9.648
 incap-9.415
etheless-8.886
 communion-8.811
Token at
Token ID379
Feature activation+5.02e-01

Pos logit contributions
virt+2.600
+2.397
+2.342
+2.333
Dem+2.309
Neg logit contributions
 mercy-2.139
theless-2.133
 incap-2.012
ilty-1.902
 communion-1.858
Token 1
Token ID352
Feature activation-6.77e-01

Pos logit contributions
 mercy+1.431
theless+1.387
 incap+1.335
ilty+1.260
 autonomy+1.233
Neg logit contributions
virt-1.792
-1.650
-1.634
-1.623
Dem-1.587
Token.
Token ID13
Feature activation+2.71e-01

Pos logit contributions
virt+2.333
+2.149
+2.121
+2.105
Dem+2.073
Neg logit contributions
theless-1.993
 mercy-1.976
 incap-1.843
ilty-1.793
 autonomy-1.724
Token30
Token ID1270
Feature activation+1.60e-01

Pos logit contributions
 mercy+0.838
theless+0.828
 incap+0.784
ilty+0.753
 autonomy+0.732
Neg logit contributions
virt-0.902
-0.823
-0.814
-0.807
Dem-0.793
Token...
Token ID986
Feature activation-5.27e-01

Pos logit contributions
 mercy+2.130
theless+2.065
 incap+1.963
ilty+1.872
 autonomy+1.816
Neg logit contributions
virt-2.671
-2.459
-2.444
-2.419
Dem-2.360
Token
Token ID198
Feature activation+6.17e-01

Pos logit contributions
virt+1.699
+1.550
+1.524
+1.518
Dem+1.497
Neg logit contributions
theless-1.665
 mercy-1.661
 incap-1.570
ilty-1.510
 autonomy-1.471
Token
Token ID198
Feature activation+2.08e-02

Pos logit contributions
 mercy+1.538
theless+1.416
 incap+1.374
ilty+1.338
ints+1.282
Neg logit contributions
virt-2.379
-2.228
-2.217
-2.185
-2.105
TokenDid
Token ID11633
Feature activation-6.47e-01

Pos logit contributions
 mercy+0.081
theless+0.080
 incap+0.077
ilty+0.075
 autonomy+0.073
Neg logit contributions
virt-0.052
-0.046
-0.046
-0.045
Dem-0.044
Token that
Token ID326
Feature activation-6.41e-01

Pos logit contributions
virt+2.052
+1.857
+1.837
+1.829
Dem+1.808
Neg logit contributions
theless-2.077
 mercy-2.061
 incap-1.964
ilty-1.879
 autonomy-1.835
Token Pokémon
Token ID9887
Feature activation+1.40e+00

Pos logit contributions
virt+2.018
+1.836
+1.818
+1.807
Dem+1.780
Neg logit contributions
theless-2.063
 mercy-2.046
 incap-1.936
ilty-1.859
etheless-1.820
Token just
Token ID655
Feature activation-3.26e-01

Pos logit contributions
 mercy+4.340
theless+4.114
 incap+4.018
ilty+3.858
 communion+3.765
Neg logit contributions
virt-4.633
-4.256
-4.201
-4.098
Dem-3.991
Token talk
Token ID1561
Feature activation-3.82e-01

Pos logit contributions
virt+1.025
+0.927
+0.916
+0.909
Dem+0.897
Neg logit contributions
 mercy-1.061
theless-1.055
 incap-1.001
ilty-0.963
 autonomy-0.939
Token?
Token ID30
Feature activation-2.22e-01

Pos logit contributions
virt+1.349
+1.237
+1.221
+1.219
Dem+1.199
Neg logit contributions
 mercy-1.096
theless-1.095
 incap-1.028
ilty-0.977
 autonomy-0.953
Token
Token ID198
Feature activation+7.35e-02

Pos logit contributions
virt+0.742
+0.678
+0.670
+0.665
Dem+0.654
Neg logit contributions
 mercy-0.680
theless-0.674
 incap-0.638
ilty-0.612
 autonomy-0.596
Token
Token ID198
Feature activation-3.48e-01

Pos logit contributions
 mercy+0.198
theless+0.190
 incap+0.181
ilty+0.174
 autonomy+0.167
Neg logit contributions
virt-0.273
-0.252
-0.251
-0.248
Dem-0.242
Token is
Token ID318
Feature activation-3.87e-02

Pos logit contributions
 mercy+1.398
theless+1.374
 incap+1.301
ilty+1.241
 communion+1.203
Neg logit contributions
virt-1.715
-1.578
-1.566
-1.558
Dem-1.526
Token best
Token ID1266
Feature activation-5.78e-02

Pos logit contributions
virt+0.131
+0.120
+0.119
+0.118
Dem+0.116
Neg logit contributions
 mercy-0.117
theless-0.116
 incap-0.110
ilty-0.105
 autonomy-0.102
Token to
Token ID284
Feature activation-1.42e-01

Pos logit contributions
virt+0.206
+0.190
+0.187
+0.186
Dem+0.183
Neg logit contributions
 mercy-0.163
theless-0.163
 incap-0.154
ilty-0.146
 autonomy-0.143
Token wash
Token ID13502
Feature activation-5.84e-02

Pos logit contributions
virt+0.470
+0.428
+0.423
+0.420
Dem+0.413
Neg logit contributions
 mercy-0.442
theless-0.438
 incap-0.415
ilty-0.398
 autonomy-0.387
Token the
Token ID262
Feature activation-2.20e-01

Pos logit contributions
virt+0.202
+0.185
+0.182
+0.181
Dem+0.178
Neg logit contributions
 mercy-0.172
theless-0.171
 incap-0.161
ilty-0.155
 autonomy-0.150
Token Tu
Token ID16749
Feature activation+1.71e+00

Pos logit contributions
virt+0.772
+0.708
+0.701
+0.696
Dem+0.686
Neg logit contributions
 mercy-0.635
theless-0.631
 incap-0.594
ilty-0.569
 autonomy-0.552
Tokenpper
Token ID2848
Feature activation+2.35e-01

Pos logit contributions
 mercy+4.643
theless+4.365
 incap+4.295
ilty+4.253
ints+4.042
Neg logit contributions
virt-6.074
-5.667
-5.649
-5.524
Dem-5.378
Tokenware
Token ID1574
Feature activation+1.01e+00

Pos logit contributions
 mercy+0.864
theless+0.857
 incap+0.818
ilty+0.788
 communion+0.773
Neg logit contributions
virt-0.641
-0.576
-0.568
-0.562
Dem-0.546
Token container
Token ID9290
Feature activation+6.79e-01

Pos logit contributions
 mercy+2.985
theless+2.903
 incap+2.766
ilty+2.660
 communion+2.603
Neg logit contributions
virt-3.424
-3.152
-3.123
-3.112
Dem-3.001
Token completely
Token ID3190
Feature activation+9.28e-01

Pos logit contributions
 mercy+1.982
theless+1.939
 incap+1.839
ilty+1.769
 autonomy+1.709
Neg logit contributions
virt-2.363
-2.179
-2.165
-2.152
Dem-2.093
Token.
Token ID13
Feature activation+1.32e-01

Pos logit contributions
 mercy+2.625
theless+2.549
 incap+2.434
ilty+2.324
 autonomy+2.254
Neg logit contributions
virt-3.314
-3.021
-3.017
-3.014
Dem-2.935
Token.,
Token ID1539
Feature activation+7.40e-01

Pos logit contributions
 mercy+1.505
theless+1.484
 incap+1.413
ilty+1.358
 autonomy+1.321
Neg logit contributions
virt-1.614
-1.474
-1.458
-1.447
Dem-1.418
Token a
Token ID257
Feature activation+2.50e-01

Pos logit contributions
 mercy+2.199
theless+2.158
 incap+2.060
ilty+1.977
 autonomy+1.937
Neg logit contributions
virt-2.508
-2.289
-2.272
-2.271
Dem-2.199
Token licensing
Token ID15665
Feature activation-2.70e-01

Pos logit contributions
 mercy+0.831
theless+0.818
 incap+0.778
ilty+0.750
 autonomy+0.734
Neg logit contributions
virt-0.774
-0.702
-0.697
-0.696
Dem-0.673
Token division
Token ID7297
Feature activation-1.07e-01

Pos logit contributions
virt+0.902
+0.823
+0.814
+0.807
Dem+0.794
Neg logit contributions
 mercy-0.832
theless-0.824
 incap-0.781
ilty-0.749
 autonomy-0.728
Token of
Token ID286
Feature activation+1.72e-01

Pos logit contributions
virt+0.353
+0.321
+0.317
+0.314
Dem+0.310
Neg logit contributions
 mercy-0.333
theless-0.332
 incap-0.313
ilty-0.299
 autonomy-0.291
Token AT
Token ID5161
Feature activation+1.63e+00

Pos logit contributions
 mercy+0.588
theless+0.581
 incap+0.555
ilty+0.535
 autonomy+0.523
Neg logit contributions
virt-0.515
-0.466
-0.460
-0.456
Dem-0.445
Token&
Token ID5
Feature activation-1.15e+00

Pos logit contributions
 mercy+5.108
theless+4.833
 incap+4.743
ilty+4.672
 autonomy+4.493
Neg logit contributions
virt-5.102
-4.692
-4.644
-4.553
Dem-4.401
TokenT
Token ID51
Feature activation+4.95e-01

Pos logit contributions
virt+2.632
+2.325
+2.310
+2.270
Dem+2.247
Neg logit contributions
theless-4.771
 mercy-4.716
 incap-4.564
etheless-4.397
ilty-4.374
Token/
Token ID14
Feature activation+3.13e-01

Pos logit contributions
 mercy+1.458
theless+1.424
 incap+1.364
ilty+1.309
 autonomy+1.272
Neg logit contributions
virt-1.711
-1.566
-1.554
-1.543
Dem-1.512
TokenWestern
Token ID24227
Feature activation+5.39e-01

Pos logit contributions
 mercy+1.167
theless+1.152
 incap+1.105
ilty+1.076
 autonomy+1.047
Neg logit contributions
virt-0.837
-0.748
-0.736
-0.728
Dem-0.704
Token Electric
Token ID13944
Feature activation+1.30e+00

Pos logit contributions
 mercy+1.601
theless+1.580
 incap+1.492
ilty+1.432
 autonomy+1.394
Neg logit contributions
virt-1.856
-1.704
-1.686
-1.675
Dem-1.637
Token Damian
Token ID42574
Feature activation-2.36e-01

Pos logit contributions
 mercy+0.290
theless+0.287
 incap+0.273
ilty+0.263
 autonomy+0.257
Neg logit contributions
virt-0.265
-0.240
-0.237
-0.235
Dem-0.230
Token Allow
Token ID22507
Feature activation-6.45e-01

Pos logit contributions
virt+0.727
+0.657
+0.647
+0.641
Dem+0.633
Neg logit contributions
 mercy-0.784
theless-0.778
 incap-0.742
ilty-0.712
 autonomy-0.695
Tokenay
Token ID323
Feature activation-6.99e-02

Pos logit contributions
virt+2.020
+1.832
+1.797
Dem+1.776
+1.774
Neg logit contributions
 mercy-2.108
theless-2.086
 incap-1.999
ilty-1.886
 communion-1.858
Token committed
Token ID5364
Feature activation-1.99e-01

Pos logit contributions
virt+0.242
+0.222
+0.219
+0.218
Dem+0.214
Neg logit contributions
 mercy-0.207
theless-0.204
 incap-0.193
ilty-0.185
 autonomy-0.180
Token to
Token ID284
Feature activation+1.75e-03

Pos logit contributions
virt+0.729
+0.670
+0.662
+0.658
Dem+0.650
Neg logit contributions
 mercy-0.548
theless-0.547
 incap-0.512
ilty-0.486
 autonomy-0.470
Token UCLA
Token ID21750
Feature activation+1.40e+00

Pos logit contributions
 mercy+0.007
theless+0.007
 incap+0.006
ilty+0.006
 autonomy+0.006
Neg logit contributions
virt-0.005
-0.004
-0.004
-0.004
Dem-0.004
Token on
Token ID319
Feature activation+2.07e-02

Pos logit contributions
 mercy+3.959
theless+3.763
 incap+3.552
ilty+3.472
 autonomy+3.426
Neg logit contributions
virt-4.927
-4.553
-4.531
-4.508
Dem-4.354
Token Friday
Token ID3217
Feature activation-3.23e-01

Pos logit contributions
 mercy+0.075
theless+0.074
 incap+0.071
ilty+0.068
 autonomy+0.067
Neg logit contributions
virt-0.058
-0.052
-0.051
-0.051
Dem-0.050
Token via
Token ID2884
Feature activation+1.70e-01

Pos logit contributions
virt+1.094
+0.999
+0.979
+0.973
Dem+0.968
Neg logit contributions
 mercy-0.978
theless-0.972
 incap-0.926
ilty-0.880
 communion-0.855
Token Ble
Token ID17175
Feature activation+1.41e+00

Pos logit contributions
 mercy+0.538
theless+0.532
 incap+0.505
ilty+0.485
 autonomy+0.472
Neg logit contributions
virt-0.555
-0.506
-0.500
-0.496
Dem-0.487
Tokenacher
Token ID3493
Feature activation-1.01e+00

Pos logit contributions
 mercy+4.568
theless+4.355
 incap+4.268
ilty+4.161
ints+4.156
Neg logit contributions
virt-4.282
-3.984
-3.867
-3.783
Dem-3.672
Token John
Token ID1757
Feature activation+2.80e-01

Pos logit contributions
virt+1.047
+0.943
+0.937
+0.925
Dem+0.915
Neg logit contributions
 mercy-1.123
theless-1.115
 incap-1.059
ilty-1.021
 communion-0.992
Token McCain
Token ID14264
Feature activation-2.69e-01

Pos logit contributions
 mercy+0.946
theless+0.930
 incap+0.892
ilty+0.853
 autonomy+0.839
Neg logit contributions
virt-0.846
-0.776
-0.762
-0.758
Dem-0.735
Token.
Token ID13
Feature activation-7.52e-02

Pos logit contributions
virt+0.907
+0.828
+0.819
+0.810
Dem+0.800
Neg logit contributions
theless-0.824
 mercy-0.822
 incap-0.775
ilty-0.742
 autonomy-0.722
Token
Token ID198
Feature activation-1.27e-02

Pos logit contributions
virt+0.249
+0.227
+0.224
+0.222
Dem+0.218
Neg logit contributions
 mercy-0.234
theless-0.231
 incap-0.219
ilty-0.210
 autonomy-0.205
Token
Token ID198
Feature activation-4.97e-01

Pos logit contributions
virt+0.047
+0.043
+0.043
+0.043
Dem+0.042
Neg logit contributions
 mercy-0.034
theless-0.033
 incap-0.032
ilty-0.030
 autonomy-0.029
TokenNewsletter
Token ID33031
Feature activation+1.56e+00

Pos logit contributions
virt+1.247
+1.101
+1.085
+1.072
Dem+1.057
Neg logit contributions
 mercy-1.950
theless-1.941
 incap-1.855
ilty-1.796
 autonomy-1.761
Token Sign
Token ID5865
Feature activation+3.42e-01

Pos logit contributions
 mercy+3.572
theless+3.466
 incap+3.352
ilty+3.170
 autonomy+3.084
Neg logit contributions
virt-6.052
-5.895
-5.833
-5.697
-5.548
Token Up
Token ID3205
Feature activation-2.43e-01

Pos logit contributions
 mercy+1.086
theless+1.047
 incap+1.009
ilty+0.981
 communion+0.947
Neg logit contributions
virt-1.101
-1.014
-0.998
-0.979
Dem-0.965
Token Continue
Token ID10054
Feature activation+5.37e-01

Pos logit contributions
virt+0.741
+0.672
+0.664
+0.657
Dem+0.643
Neg logit contributions
 mercy-0.817
theless-0.806
 incap-0.768
ilty-0.742
 autonomy-0.725
Token reading
Token ID3555
Feature activation+1.13e+00

Pos logit contributions
 mercy+0.788
theless+0.755
 incap+0.676
ilty+0.612
 autonomy+0.589
Neg logit contributions
virt-2.662
-2.516
-2.496
-2.476
Dem-2.446
Token the
Token ID262
Feature activation+1.69e-01

Pos logit contributions
 mercy+2.658
theless+2.458
 incap+2.401
ilty+2.259
 autonomy+2.246
Neg logit contributions
virt-4.573
-4.271
-4.238
-4.200
-4.105
Token admitted
Token ID6848
Feature activation-1.15e-01

Pos logit contributions
virt+0.057
+0.052
+0.052
+0.052
Dem+0.051
Neg logit contributions
 mercy-0.035
theless-0.034
 incap-0.032
ilty-0.030
 communion-0.029
Token to
Token ID284
Feature activation+6.29e-01

Pos logit contributions
virt+0.414
+0.379
+0.375
+0.373
Dem+0.369
Neg logit contributions
 mercy-0.321
theless-0.319
 incap-0.299
ilty-0.285
 autonomy-0.277
Token the
Token ID262
Feature activation+7.06e-01

Pos logit contributions
 mercy+1.896
theless+1.840
 incap+1.770
ilty+1.682
 communion+1.652
Neg logit contributions
virt-2.160
-1.977
-1.965
-1.939
Dem-1.880
Token hospital
Token ID4436
Feature activation+9.11e-01

Pos logit contributions
 mercy+2.157
theless+2.065
 incap+1.992
ilty+1.886
 communion+1.862
Neg logit contributions
virt-2.415
-2.233
-2.219
-2.187
Dem-2.108
Token.
Token ID13
Feature activation-1.52e-01

Pos logit contributions
 mercy+2.578
theless+2.476
 incap+2.393
ilty+2.272
 communion+2.220
Neg logit contributions
virt-3.256
-3.005
-2.998
-2.962
Dem-2.893
Token Duncan
Token ID18625
Feature activation+1.57e-01

Pos logit contributions
virt+0.519
+0.475
+0.470
+0.466
Dem+0.458
Neg logit contributions
 mercy-0.454
theless-0.448
 incap-0.424
ilty-0.407
 autonomy-0.395
Token had
Token ID550
Feature activation-3.78e-01

Pos logit contributions
 mercy+0.427
theless+0.421
 incap+0.396
ilty+0.378
 autonomy+0.366
Neg logit contributions
virt-0.579
-0.534
-0.528
-0.524
Dem-0.516
Token become
Token ID1716
Feature activation-7.10e-01

Pos logit contributions
virt+1.361
+1.249
+1.235
+1.229
Dem+1.210
Neg logit contributions
 mercy-1.054
theless-1.053
 incap-0.986
ilty-0.948
 autonomy-0.910
Token infected
Token ID14112
Feature activation-2.58e-01

Pos logit contributions
virt+2.333
+2.129
+2.094
+2.079
Dem+2.051
Neg logit contributions
 mercy-2.211
theless-2.203
 incap-2.065
ilty-1.994
 autonomy-1.931
Token while
Token ID981
Feature activation-9.28e-01

Pos logit contributions
virt+0.884
+0.805
+0.796
+0.791
Dem+0.779
Neg logit contributions
 mercy-0.768
theless-0.763
 incap-0.720
ilty-0.690
 autonomy-0.670
Token in
Token ID287
Feature activation-3.39e-01

Pos logit contributions
virt+3.404
+3.133
+3.095
+3.072
Dem+3.046
Neg logit contributions
theless-2.509
 mercy-2.476
 incap-2.312
ilty-2.235
ints-2.147
Token many
Token ID867
Feature activation-6.45e-01

Pos logit contributions
virt+0.075
+0.068
+0.068
+0.067
Dem+0.066
Neg logit contributions
 mercy-0.066
theless-0.065
 incap-0.062
ilty-0.059
 autonomy-0.058
Token people
Token ID661
Feature activation-1.52e-01

Pos logit contributions
virt+2.166
+1.975
+1.946
+1.934
Dem+1.906
Neg logit contributions
 mercy-1.960
theless-1.944
 incap-1.842
ilty-1.776
 autonomy-1.716
Token may
Token ID743
Feature activation+2.94e-02

Pos logit contributions
virt+0.509
+0.464
+0.458
+0.454
Dem+0.447
Neg logit contributions
 mercy-0.469
theless-0.464
 incap-0.439
ilty-0.422
 autonomy-0.410
Token know
Token ID760
Feature activation+1.25e-01

Pos logit contributions
 mercy+0.089
theless+0.088
 incap+0.084
ilty+0.080
 autonomy+0.078
Neg logit contributions
virt-0.100
-0.092
-0.091
-0.090
Dem-0.088
Token them
Token ID606
Feature activation+4.72e-01

Pos logit contributions
 mercy+0.351
theless+0.346
 incap+0.327
ilty+0.312
 autonomy+0.303
Neg logit contributions
virt-0.452
-0.416
-0.413
-0.409
Dem-0.403
Token by
Token ID416
Feature activation+5.03e-01

Pos logit contributions
 mercy+1.286
theless+1.262
 incap+1.194
ilty+1.134
 autonomy+1.099
Neg logit contributions
virt-1.737
-1.608
-1.600
-1.579
Dem-1.555
Token other
Token ID584
Feature activation-3.48e-01

Pos logit contributions
 mercy+1.488
theless+1.458
 incap+1.391
ilty+1.324
 autonomy+1.294
Neg logit contributions
virt-1.740
-1.596
-1.582
-1.567
Dem-1.541
Token names
Token ID3891
Feature activation-1.90e-01

Pos logit contributions
virt+1.269
+1.166
+1.153
+1.150
Dem+1.131
Neg logit contributions
 mercy-0.958
theless-0.950
 incap-0.890
ilty-0.852
 autonomy-0.822
Token such
Token ID884
Feature activation+7.20e-01

Pos logit contributions
virt+0.674
+0.619
+0.610
+0.608
Dem+0.597
Neg logit contributions
 mercy-0.545
theless-0.541
 incap-0.510
ilty-0.488
 autonomy-0.472
Token as
Token ID355
Feature activation+2.52e-01

Pos logit contributions
 mercy+1.458
theless+1.381
 incap+1.303
ilty+1.213
 autonomy+1.171
Neg logit contributions
virt-3.127
-2.931
-2.912
-2.866
Dem-2.839
Token red
Token ID2266
Feature activation-2.81e-01

Pos logit contributions
 mercy+0.776
theless+0.766
 incap+0.727
ilty+0.696
 autonomy+0.678
Neg logit contributions
virt-0.841
-0.768
-0.759
-0.753
Dem-0.739
Token
Token ID447
Feature activation+1.42e-01

Pos logit contributions
 mercy+3.748
theless+3.703
 incap+3.568
ilty+3.466
 autonomy+3.413
Neg logit contributions
virt-1.808
-1.556
-1.525
-1.501
Dem-1.454
Token
Token ID247
Feature activation+2.53e-01

Pos logit contributions
 mercy+0.674
theless+0.658
 incap+0.642
ilty+0.630
 autonomy+0.620
Neg logit contributions
virt-0.234
-0.195
-0.189
-0.184
Dem-0.179
Tokens
Token ID82
Feature activation+1.77e-01

Pos logit contributions
 mercy+0.873
theless+0.865
 incap+0.826
ilty+0.796
 communion+0.776
Neg logit contributions
virt-0.756
-0.680
-0.669
-0.667
Dem-0.652
Token Prom
Token ID10335
Feature activation-4.36e-01

Pos logit contributions
 mercy+0.538
theless+0.531
 incap+0.503
ilty+0.482
 autonomy+0.469
Neg logit contributions
virt-0.601
-0.549
-0.543
-0.538
Dem-0.529
Tokenethe
Token ID10567
Feature activation-6.40e-02

Pos logit contributions
virt+1.617
+1.492
+1.465
+1.461
Dem+1.451
Neg logit contributions
 mercy-1.177
theless-1.166
 incap-1.093
ilty-1.035
 communion-1.003
Tokena
Token ID64
Feature activation+8.42e-02

Pos logit contributions
virt+0.247
+0.227
+0.225
+0.224
Dem+0.222
Neg logit contributions
 mercy-0.164
theless-0.164
 incap-0.152
ilty-0.145
 autonomy-0.140
Token,
Token ID11
Feature activation+2.67e-01

Pos logit contributions
 mercy+0.269
theless+0.267
 incap+0.253
ilty+0.243
 autonomy+0.236
Neg logit contributions
virt-0.271
-0.247
-0.244
-0.242
Dem-0.238
Token serial
Token ID11389
Feature activation+4.85e-01

Pos logit contributions
 mercy+0.796
theless+0.785
 incap+0.744
ilty+0.711
 autonomy+0.692
Neg logit contributions
virt-0.919
-0.842
-0.832
-0.826
Dem-0.811
Tokenized
Token ID1143
Feature activation+3.04e-01

Pos logit contributions
 mercy+1.517
theless+1.501
 incap+1.422
ilty+1.359
 autonomy+1.332
Neg logit contributions
virt-1.597
-1.456
-1.440
-1.422
Dem-1.398
Token from
Token ID422
Feature activation+3.21e-02

Pos logit contributions
 mercy+0.875
theless+0.862
 incap+0.814
ilty+0.778
 autonomy+0.757
Neg logit contributions
virt-1.077
-0.990
-0.980
-0.971
Dem-0.954
Token 1999
Token ID7358
Feature activation-2.25e-03

Pos logit contributions
 mercy+0.095
theless+0.094
 incap+0.089
ilty+0.085
 autonomy+0.082
Neg logit contributions
virt-0.111
-0.102
-0.100
-0.100
Dem-0.098
Token set
Token ID900
Feature activation+6.61e-02

Pos logit contributions
virt+0.224
+0.206
+0.204
+0.203
Dem+0.200
Neg logit contributions
 mercy-0.162
theless-0.159
 incap-0.150
ilty-0.143
 autonomy-0.138
Token up
Token ID510
Feature activation-4.77e-01

Pos logit contributions
 mercy+0.211
theless+0.208
 incap+0.198
ilty+0.190
 autonomy+0.185
Neg logit contributions
virt-0.214
-0.195
-0.193
-0.191
Dem-0.187
Token,
Token ID11
Feature activation-1.25e-02

Pos logit contributions
virt+1.624
+1.483
+1.463
+1.448
Dem+1.432
Neg logit contributions
 mercy-1.435
theless-1.435
 incap-1.350
ilty-1.291
 autonomy-1.254
Token concrete
Token ID10017
Feature activation-6.02e-02

Pos logit contributions
virt+0.044
+0.041
+0.040
+0.040
Dem+0.039
Neg logit contributions
 mercy-0.036
theless-0.036
 incap-0.034
ilty-0.032
 autonomy-0.031
Token foot
Token ID2366
Feature activation-2.36e-01

Pos logit contributions
virt+0.208
+0.191
+0.188
+0.187
Dem+0.184
Neg logit contributions
 mercy-0.178
theless-0.177
 incap-0.167
ilty-0.160
 autonomy-0.155
Tokenings
Token ID654
Feature activation+6.22e-02

Pos logit contributions
virt+0.733
+0.662
+0.652
+0.646
Dem+0.637
Neg logit contributions
 mercy-0.785
theless-0.782
 incap-0.741
ilty-0.713
 autonomy-0.693
Token were
Token ID547
Feature activation-1.65e-01

Pos logit contributions
 mercy+0.172
theless+0.169
 incap+0.160
ilty+0.152
 autonomy+0.148
Neg logit contributions
virt-0.227
-0.209
-0.207
-0.206
Dem-0.202
Token poured
Token ID19036
Feature activation-4.62e-01

Pos logit contributions
virt+0.622
+0.574
+0.568
+0.564
Dem+0.556
Neg logit contributions
 mercy-0.436
theless-0.431
 incap-0.404
ilty-0.385
 autonomy-0.372
Token (
Token ID357
Feature activation+3.36e-01

Pos logit contributions
virt+1.557
+1.414
+1.392
+1.377
Dem+1.366
Neg logit contributions
theless-1.405
 mercy-1.403
 incap-1.324
ilty-1.267
 autonomy-1.233
Tokenone
Token ID505
Feature activation-1.06e-01

Pos logit contributions
 mercy+1.099
theless+1.088
 incap+1.037
ilty+1.001
imposed+0.977
Neg logit contributions
virt-1.038
-0.948
-0.939
-0.929
Dem-0.904
Token is
Token ID318
Feature activation+6.53e-02

Pos logit contributions
virt+0.366
+0.335
+0.332
+0.329
Dem+0.324
Neg logit contributions
 mercy-0.315
theless-0.314
 incap-0.296
ilty-0.283
 autonomy-0.275
Token campaign
Token ID1923
Feature activation+2.19e-02

Pos logit contributions
virt+0.004
+0.003
+0.003
+0.003
Dem+0.003
Neg logit contributions
 mercy-0.003
theless-0.003
 incap-0.003
ilty-0.003
 autonomy-0.003
Token is
Token ID318
Feature activation-1.35e-01

Pos logit contributions
 mercy+0.065
theless+0.064
 incap+0.061
ilty+0.058
 communion+0.056
Neg logit contributions
virt-0.076
-0.069
-0.068
-0.068
Dem-0.067
Token over
Token ID625
Feature activation-3.83e-01

Pos logit contributions
virt+0.460
+0.420
+0.416
+0.412
Dem+0.406
Neg logit contributions
 mercy-0.407
theless-0.403
 incap-0.381
ilty-0.365
 autonomy-0.355
Token,
Token ID11
Feature activation+4.85e-02

Pos logit contributions
virt+1.338
+1.220
+1.205
+1.197
Dem+1.183
Neg logit contributions
 mercy-1.115
theless-1.110
 incap-1.047
ilty-0.999
 autonomy-0.970
Token and
Token ID290
Feature activation-3.60e-01

Pos logit contributions
 mercy+0.152
theless+0.149
 incap+0.142
ilty+0.136
 autonomy+0.133
Neg logit contributions
virt-0.160
-0.146
-0.144
-0.143
Dem-0.140
Token you
Token ID345
Feature activation+6.43e-01

Pos logit contributions
virt+1.206
+1.097
+1.085
+1.076
Dem+1.065
Neg logit contributions
 mercy-1.095
theless-1.092
 incap-1.032
ilty-0.989
 autonomy-0.959
Token return
Token ID1441
Feature activation+7.63e-03

Pos logit contributions
 mercy+1.945
theless+1.890
 incap+1.800
ilty+1.733
 autonomy+1.682
Neg logit contributions
virt-2.182
-2.011
-1.982
-1.966
Dem-1.920
Token the
Token ID262
Feature activation+2.54e-01

Pos logit contributions
 mercy+0.023
theless+0.022
 incap+0.021
ilty+0.020
 autonomy+0.020
Neg logit contributions
virt-0.026
-0.024
-0.024
-0.024
Dem-0.023
Token backer
Token ID37617
Feature activation+7.25e-01

Pos logit contributions
 mercy+0.779
theless+0.762
 incap+0.726
ilty+0.698
 communion+0.678
Neg logit contributions
virt-0.857
-0.784
-0.774
-0.769
Dem-0.752
Token survey
Token ID5526
Feature activation+6.57e-01

Pos logit contributions
 mercy+1.981
theless+1.921
 incap+1.827
ilty+1.747
 communion+1.695
Neg logit contributions
virt-2.688
-2.479
-2.452
-2.434
Dem-2.381
Token,
Token ID11
Feature activation+3.55e-01

Pos logit contributions
 mercy+1.857
theless+1.815
 incap+1.725
ilty+1.660
 communion+1.598
Neg logit contributions
virt-2.358
-2.171
-2.146
-2.132
Dem-2.086
Token and
Token ID290
Feature activation-4.43e-01

Pos logit contributions
virt+3.785
+3.425
+3.351
+3.341
Dem+3.326
Neg logit contributions
theless-3.558
 mercy-3.490
 incap-3.350
ilty-3.220
 communion-3.108
Token Co
Token ID1766
Feature activation+5.03e-01

Pos logit contributions
virt+1.415
+1.283
+1.265
+1.258
Dem+1.242
Neg logit contributions
 mercy-1.418
theless-1.409
 incap-1.336
ilty-1.287
 communion-1.250
Token.,
Token ID1539
Feature activation+3.77e-01

Pos logit contributions
 mercy+1.550
theless+1.507
 incap+1.440
ilty+1.388
imposed+1.346
Neg logit contributions
virt-1.675
-1.535
-1.520
-1.503
Dem-1.471
Token giving
Token ID3501
Feature activation+1.77e-01

Pos logit contributions
 mercy+1.074
theless+1.049
 incap+0.996
ilty+0.939
 autonomy+0.925
Neg logit contributions
virt-1.349
-1.246
-1.236
-1.228
Dem-1.200
Token up
Token ID510
Feature activation-7.17e-01

Pos logit contributions
 mercy+0.553
theless+0.546
 incap+0.518
ilty+0.497
 autonomy+0.484
Neg logit contributions
virt-0.586
-0.535
-0.529
-0.524
Dem-0.514
Token two
Token ID734
Feature activation-7.72e-02

Pos logit contributions
virt+2.192
+1.960
+1.944
+1.921
Dem+1.902
Neg logit contributions
theless-2.391
 mercy-2.366
 incap-2.245
ilty-2.178
imposed-2.111
Token receptions
Token ID35401
Feature activation-5.31e-01

Pos logit contributions
virt+0.260
+0.237
+0.234
+0.233
Dem+0.229
Neg logit contributions
 mercy-0.236
theless-0.233
 incap-0.221
ilty-0.212
 autonomy-0.206
Token on
Token ID319
Feature activation-7.93e-01

Pos logit contributions
virt+1.778
+1.624
+1.595
+1.588
Dem+1.566
Neg logit contributions
 mercy-1.625
theless-1.613
 incap-1.526
ilty-1.465
 autonomy-1.422
Token three
Token ID1115
Feature activation-2.30e-01

Pos logit contributions
virt+2.420
+2.168
+2.134
+2.134
Dem+2.109
Neg logit contributions
theless-2.650
 mercy-2.624
 incap-2.483
ilty-2.419
imposed-2.331
Token targets
Token ID6670
Feature activation-9.39e-01

Pos logit contributions
virt+0.715
+0.648
+0.641
+0.634
Dem+0.622
Neg logit contributions
 mercy-0.764
theless-0.753
 incap-0.719
ilty-0.690
 autonomy-0.674
Token for
Token ID329
Feature activation-2.97e-01

Pos logit contributions
virt+3.058
+2.776
+2.704
+2.695
Dem+2.683
Neg logit contributions
theless-2.925
 mercy-2.899
 incap-2.748
ilty-2.650
 autonomy-2.558
Token endowed
Token ID44134
Feature activation+6.87e-01

Pos logit contributions
 mercy+0.635
theless+0.625
 incap+0.591
ilty+0.565
 autonomy+0.550
Neg logit contributions
virt-0.777
-0.714
-0.706
-0.701
Dem-0.689
Token.
Token ID13
Feature activation-1.16e-01

Pos logit contributions
 mercy+2.069
theless+2.031
 incap+1.925
ilty+1.839
 communion+1.806
Neg logit contributions
virt-2.345
-2.149
-2.127
-2.107
Dem-2.069
Token It
Token ID632
Feature activation+6.25e-01

Pos logit contributions
virt+0.361
+0.327
+0.324
+0.321
Dem+0.314
Neg logit contributions
 mercy-0.384
theless-0.379
 incap-0.360
ilty-0.347
 autonomy-0.338
Token seemed
Token ID3947
Feature activation+5.33e-02

Pos logit contributions
 mercy+1.706
theless+1.679
 incap+1.576
ilty+1.507
 autonomy+1.457
Neg logit contributions
virt-2.277
-2.104
-2.087
-2.072
Dem-2.046
Token like
Token ID588
Feature activation-1.15e-01

Pos logit contributions
 mercy+0.146
theless+0.145
 incap+0.136
ilty+0.130
 autonomy+0.125
Neg logit contributions
virt-0.196
-0.180
-0.178
-0.177
Dem-0.174
Token the
Token ID262
Feature activation-3.45e-01

Pos logit contributions
virt+0.385
+0.351
+0.347
+0.344
Dem+0.339
Neg logit contributions
 mercy-0.350
theless-0.347
 incap-0.328
ilty-0.315
 autonomy-0.306
Token proper
Token ID1774
Feature activation-6.63e-01

Pos logit contributions
virt+1.098
+0.997
+0.985
+0.975
Dem+0.959
Neg logit contributions
 mercy-1.112
theless-1.106
 incap-1.048
ilty-1.008
 autonomy-0.980
Token next
Token ID1306
Feature activation-8.94e-01

Pos logit contributions
virt+2.063
+1.870
+1.841
+1.825
Dem+1.795
Neg logit contributions
theless-2.185
 mercy-2.176
 incap-2.057
ilty-1.978
imposed-1.919
Token step
Token ID2239
Feature activation-6.40e-01

Pos logit contributions
virt+2.579
+2.317
+2.287
+2.264
Dem+2.223
Neg logit contributions
theless-3.147
 mercy-3.085
 incap-2.943
ilty-2.831
ints-2.766
Token.
Token ID13
Feature activation-1.96e-01

Pos logit contributions
virt+2.196
+2.016
+1.986
+1.968
Dem+1.947
Neg logit contributions
theless-1.888
 mercy-1.886
 incap-1.785
ilty-1.701
 communion-1.645
Token I
Token ID314
Feature activation-1.73e-01

Pos logit contributions
virt+0.617
+0.560
+0.553
+0.548
Dem+0.538
Neg logit contributions
 mercy-0.639
theless-0.632
 incap-0.601
ilty-0.577
 autonomy-0.563
Tokenyear
Token ID1941
Feature activation+6.62e-01

Pos logit contributions
 mercy+2.459
theless+2.375
 incap+2.227
ilty+2.213
imposed+2.166
Neg logit contributions
virt-3.626
-3.428
-3.426
-3.391
-3.270
Token-
Token ID12
Feature activation+1.98e-01

Pos logit contributions
 mercy+2.016
theless+1.970
 incap+1.880
ilty+1.801
 autonomy+1.758
Neg logit contributions
virt-2.235
-2.041
-2.017
-2.002
Dem-1.970
Tokenold
Token ID727
Feature activation-4.06e-02

Pos logit contributions
 mercy+0.651
theless+0.632
 incap+0.604
ilty+0.600
imposed+0.582
Neg logit contributions
virt-0.604
-0.550
-0.549
-0.539
Dem-0.520
Token left
Token ID1364
Feature activation-7.80e-01

Pos logit contributions
virt+0.134
+0.122
+0.121
+0.120
Dem+0.118
Neg logit contributions
 mercy-0.127
theless-0.125
 incap-0.119
ilty-0.114
 autonomy-0.111
Token guard
Token ID4860
Feature activation+1.72e-01

Pos logit contributions
virt+2.417
+2.199
+2.161
+2.153
Dem+2.120
Neg logit contributions
theless-2.562
 mercy-2.533
 incap-2.418
ilty-2.301
 autonomy-2.249
Token is
Token ID318
Feature activation-2.86e-01

Pos logit contributions
 mercy+0.514
theless+0.507
 incap+0.480
ilty+0.460
 autonomy+0.448
Neg logit contributions
virt-0.586
-0.537
-0.531
-0.528
Dem-0.518
Token quietly
Token ID12703
Feature activation-8.33e-01

Pos logit contributions
virt+0.989
+0.905
+0.895
+0.888
Dem+0.874
Neg logit contributions
 mercy-0.849
theless-0.840
 incap-0.794
ilty-0.760
 autonomy-0.739
Token one
Token ID530
Feature activation-6.93e-01

Pos logit contributions
virt+2.816
+2.572
+2.542
+2.507
Dem+2.483
Neg logit contributions
 mercy-2.506
theless-2.488
 incap-2.358
ilty-2.272
 autonomy-2.186
Token of
Token ID286
Feature activation-4.20e-02

Pos logit contributions
virt+2.158
+1.954
+1.944
+1.913
Dem+1.897
Neg logit contributions
theless-2.258
 mercy-2.236
 incap-2.130
ilty-2.041
ints-1.987
Token the
Token ID262
Feature activation-3.12e-01

Pos logit contributions
virt+0.150
+0.137
+0.136
+0.135
Dem+0.133
Neg logit contributions
 mercy-0.119
theless-0.119
 incap-0.111
ilty-0.106
 communion-0.103
Token best
Token ID1266
Feature activation-7.44e-01

Pos logit contributions
virt+1.027
+0.936
+0.925
+0.918
Dem+0.902
Neg logit contributions
 mercy-0.974
theless-0.963
 incap-0.914
ilty-0.877
 autonomy-0.854
Token in
Token ID287
Feature activation-3.02e-01

Pos logit contributions
virt+1.896
+1.737
+1.712
+1.695
Dem+1.676
Neg logit contributions
 mercy-1.680
theless-1.676
 incap-1.578
ilty-1.513
 autonomy-1.465
Token America
Token ID2253
Feature activation-7.47e-01

Pos logit contributions
virt+0.994
+0.903
+0.892
+0.882
Dem+0.873
Neg logit contributions
 mercy-0.940
theless-0.939
 incap-0.883
ilty-0.849
imposed-0.822
Token.
Token ID13
Feature activation+1.14e-01

Pos logit contributions
virt+2.464
+2.259
+2.216
+2.199
Dem+2.175
Neg logit contributions
theless-2.302
 mercy-2.302
 incap-2.176
ilty-2.082
 autonomy-2.023
Token Everyone
Token ID11075
Feature activation-1.98e-01

Pos logit contributions
 mercy+0.356
theless+0.346
 incap+0.328
ilty+0.316
 autonomy+0.308
Neg logit contributions
virt-0.374
-0.341
-0.340
-0.337
Dem-0.325
Token gave
Token ID2921
Feature activation+5.68e-01

Pos logit contributions
virt+0.719
+0.661
+0.654
+0.649
Dem+0.639
Neg logit contributions
 mercy-0.550
theless-0.544
 incap-0.513
ilty-0.489
 autonomy-0.474
Token themselves
Token ID2405
Feature activation-4.90e-01

Pos logit contributions
 mercy+1.685
theless+1.634
 incap+1.563
ilty+1.491
 communion+1.467
Neg logit contributions
virt-1.965
-1.806
-1.795
-1.779
Dem-1.739
Token the
Token ID262
Feature activation+6.83e-01

Pos logit contributions
virt+1.662
+1.519
+1.513
+1.493
Dem+1.464
Neg logit contributions
 mercy-1.452
theless-1.452
 incap-1.362
ilty-1.322
imposed-1.269
Token rhetorical
Token ID38505
Feature activation+3.25e-01

Pos logit contributions
 mercy+2.076
theless+1.989
 incap+1.923
ilty+1.840
 autonomy+1.816
Neg logit contributions
virt-2.303
-2.101
-2.091
-2.088
Dem-2.047
Token authority
Token ID4934
Feature activation+4.93e-01

Pos logit contributions
 mercy+0.950
theless+0.910
 incap+0.883
ilty+0.834
 autonomy+0.825
Neg logit contributions
virt-1.143
-1.051
-1.043
-1.034
Dem-1.019
Token to
Token ID284
Feature activation+4.21e-01

Pos logit contributions
 mercy+1.331
theless+1.310
 incap+1.235
ilty+1.177
 autonomy+1.141
Neg logit contributions
virt-1.832
-1.689
-1.671
-1.659
Dem-1.635
Token tweet
Token ID6126
Feature activation+3.95e-01

Pos logit contributions
 mercy+1.345
theless+1.311
 incap+1.260
ilty+1.191
 autonomy+1.184
Neg logit contributions
virt-1.363
-1.250
-1.235
-1.220
Dem-1.200
Tokenakes
Token ID1124
Feature activation+2.99e-01

Pos logit contributions
 mercy+3.030
theless+2.997
 incap+2.866
ilty+2.794
 autonomy+2.712
Neg logit contributions
virt-2.187
-1.948
-1.931
-1.913
Dem-1.860
Token on
Token ID319
Feature activation-1.86e-01

Pos logit contributions
 mercy+0.888
theless+0.867
 incap+0.827
ilty+0.787
 autonomy+0.771
Neg logit contributions
virt-1.038
-0.952
-0.944
-0.936
Dem-0.921
Token infinite
Token ID15541
Feature activation+1.49e-01

Pos logit contributions
virt+0.637
+0.583
+0.575
+0.570
Dem+0.559
Neg logit contributions
 mercy-0.553
theless-0.553
 incap-0.518
ilty-0.500
imposed-0.483
Token variety
Token ID4996
Feature activation-4.32e-01

Pos logit contributions
 mercy+0.411
theless+0.406
 incap+0.382
ilty+0.365
 autonomy+0.353
Neg logit contributions
virt-0.543
-0.500
-0.495
-0.491
Dem-0.484
Token
Token ID447
Feature activation+1.67e-01

Pos logit contributions
virt+1.427
+1.304
+1.280
+1.272
Dem+1.251
Neg logit contributions
 mercy-1.341
theless-1.336
 incap-1.267
ilty-1.213
 communion-1.182
Token
Token ID251
Feature activation-6.35e-01

Pos logit contributions
 mercy+0.770
theless+0.755
 incap+0.733
ilty+0.718
 communion+0.704
Neg logit contributions
virt-0.297
-0.248
-0.241
-0.239
Dem-0.235
Token.
Token ID13
Feature activation+1.70e-01

Pos logit contributions
virt+2.113
+1.931
+1.890
+1.872
Dem+1.849
Neg logit contributions
 mercy-1.948
theless-1.937
 incap-1.842
ilty-1.763
imposed-1.708
Token
Token ID198
Feature activation+3.52e-01

Pos logit contributions
 mercy+0.528
theless+0.520
 incap+0.492
ilty+0.472
 autonomy+0.461
Neg logit contributions
virt-0.562
-0.511
-0.508
-0.503
Dem-0.492
Token
Token ID198
Feature activation-1.64e-01

Pos logit contributions
 mercy+0.914
theless+0.866
 incap+0.824
ilty+0.800
ints+0.763
Neg logit contributions
virt-1.333
-1.237
-1.235
-1.218
Dem-1.178
TokenBC
Token ID2749
Feature activation+5.00e-03

Pos logit contributions
virt+0.419
+0.370
+0.365
+0.360
Dem+0.351
Neg logit contributions
 mercy-0.634
theless-0.628
 incap-0.602
ilty-0.584
 autonomy-0.571
Token:
Token ID25
Feature activation-2.07e-01

Pos logit contributions
 mercy+0.012
theless+0.011
 incap+0.011
ilty+0.010
 autonomy+0.010
Neg logit contributions
virt-0.020
-0.019
-0.019
-0.019
Dem-0.018
Token7
Token ID22
Feature activation-1.30e-01

Pos logit contributions
 mercy+2.532
theless+2.498
 incap+2.353
ilty+2.300
imposed+2.227
Neg logit contributions
virt-2.614
-2.409
-2.384
-2.368
Dem-2.284
Token Rams
Token ID16077
Feature activation-2.81e-01

Pos logit contributions
virt+0.399
+0.362
+0.357
+0.354
Dem+0.347
Neg logit contributions
 mercy-0.433
theless-0.429
 incap-0.409
ilty-0.393
 autonomy-0.383
Token lead
Token ID1085
Feature activation-1.78e+00

Pos logit contributions
virt+0.911
+0.829
+0.819
+0.813
Dem+0.802
Neg logit contributions
 mercy-0.887
theless-0.882
 incap-0.835
ilty-0.802
 autonomy-0.781
Token.
Token ID13
Feature activation-4.12e-01

Pos logit contributions
virt+5.710
+5.151
Dem+5.042
+5.035
+5.009
Neg logit contributions
 mercy-5.436
theless-5.399
 incap-5.129
ilty-4.885
 communion-4.777
Token Fil
Token ID7066
Feature activation-8.92e-02

Pos logit contributions
virt+1.362
+1.242
+1.227
+1.216
Dem+1.201
Neg logit contributions
 mercy-1.281
theless-1.270
 incap-1.207
ilty-1.158
 autonomy-1.126
Tokench
Token ID354
Feature activation-1.45e+00

Pos logit contributions
virt+0.302
+0.272
+0.270
+0.267
Dem+0.265
Neg logit contributions
 mercy-0.272
theless-0.269
 incap-0.256
ilty-0.243
 autonomy-0.239
Tokenock
Token ID735
Feature activation+3.58e-01

Pos logit contributions
virt+5.052
+4.559
+4.494
+4.403
Dem+4.376
Neg logit contributions
 mercy-4.108
theless-4.026
 incap-3.961
 communion-3.662
ilty-3.653
Token remained
Token ID6150
Feature activation+2.28e-02

Pos logit contributions
 mercy+1.015
theless+1.000
 incap+0.942
ilty+0.901
 autonomy+0.873
Neg logit contributions
virt-1.285
-1.183
-1.170
-1.164
Dem-1.143
Token in
Token ID287
Feature activation+2.82e-01

Pos logit contributions
 mercy+0.068
theless+0.067
 incap+0.064
ilty+0.061
 autonomy+0.059
Neg logit contributions
virt-0.078
-0.071
-0.071
-0.070
Dem-0.069
Token for
Token ID329
Feature activation+7.87e-01

Pos logit contributions
 mercy+0.814
theless+0.791
 incap+0.757
ilty+0.717
 communion+0.704
Neg logit contributions
virt-1.000
-0.920
-0.915
-0.908
Dem-0.888
Token B
Token ID347
Feature activation+5.73e-01

Pos logit contributions
 mercy+2.383
theless+2.294
 incap+2.194
ilty+2.071
 autonomy+2.060
Neg logit contributions
virt-2.685
-2.471
-2.451
-2.435
Dem-2.363
Token company
Token ID1664
Feature activation+5.14e-01

Pos logit contributions
virt+1.156
+1.049
+1.037
+1.028
Dem+1.015
Neg logit contributions
 mercy-1.103
theless-1.101
 incap-1.037
ilty-0.998
 autonomy-0.967
Token in
Token ID287
Feature activation-2.71e-01

Pos logit contributions
 mercy+1.484
theless+1.442
 incap+1.386
ilty+1.327
 autonomy+1.294
Neg logit contributions
virt-1.800
-1.659
-1.644
-1.627
Dem-1.593
Token Houston
Token ID6995
Feature activation+6.82e-01

Pos logit contributions
virt+0.751
+0.671
+0.661
+0.654
Dem+0.643
Neg logit contributions
 mercy-0.988
theless-0.981
 incap-0.936
ilty-0.903
 autonomy-0.882
Token,
Token ID11
Feature activation-4.46e-01

Pos logit contributions
 mercy+1.982
theless+1.933
 incap+1.850
ilty+1.774
 autonomy+1.728
Neg logit contributions
virt-2.377
-2.183
-2.159
-2.143
Dem-2.093
Token as
Token ID355
Feature activation-4.49e-01

Pos logit contributions
virt+1.482
+1.350
+1.333
+1.322
Dem+1.306
Neg logit contributions
 mercy-1.378
theless-1.374
 incap-1.294
ilty-1.240
 autonomy-1.205
Token well
Token ID880
Feature activation-1.62e+00

Pos logit contributions
virt+1.450
+1.313
+1.297
+1.286
Dem+1.273
Neg logit contributions
 mercy-1.430
theless-1.424
 incap-1.345
ilty-1.300
 communion-1.253
Token as
Token ID355
Feature activation-4.07e-01

Pos logit contributions
virt+5.898
+5.419
+5.392
Dem+5.381
+5.311
Neg logit contributions
theless-4.275
 mercy-4.188
 incap-3.914
ilty-3.732
etheless-3.629
Token Qualcomm
Token ID32903
Feature activation+2.32e-01

Pos logit contributions
virt+1.400
+1.279
+1.266
+1.255
Dem+1.238
Neg logit contributions
 mercy-1.209
theless-1.197
 incap-1.131
ilty-1.085
 autonomy-1.049
Token,
Token ID11
Feature activation-1.49e-01

Pos logit contributions
 mercy+0.657
theless+0.647
 incap+0.612
ilty+0.585
 autonomy+0.568
Neg logit contributions
virt-0.831
-0.765
-0.757
-0.752
Dem-0.738
Token the
Token ID262
Feature activation-1.59e-01

Pos logit contributions
virt+0.499
+0.455
+0.450
+0.446
Dem+0.439
Neg logit contributions
 mercy-0.460
theless-0.455
 incap-0.431
ilty-0.413
 autonomy-0.402
Token San
Token ID2986
Feature activation-2.41e-01

Pos logit contributions
virt+0.482
+0.437
+0.431
+0.427
Dem+0.419
Neg logit contributions
 mercy-0.541
theless-0.534
 incap-0.510
ilty-0.490
 autonomy-0.481
Token Russian
Token ID3394
Feature activation+7.73e-01

Pos logit contributions
virt+0.079
+0.072
+0.071
+0.070
Dem+0.069
Neg logit contributions
 mercy-0.079
theless-0.078
 incap-0.074
ilty-0.071
 autonomy-0.069
Token-
Token ID12
Feature activation-3.86e-03

Pos logit contributions
 mercy+2.328
theless+2.261
 incap+2.147
ilty+2.076
 autonomy+2.026
Neg logit contributions
virt-2.634
-2.435
-2.420
-2.362
Dem-2.332
TokenUk
Token ID28425
Feature activation+1.04e-01

Pos logit contributions
virt+0.011
+0.010
+0.010
+0.010
Dem+0.010
Neg logit contributions
 mercy-0.013
theless-0.013
 incap-0.013
ilty-0.012
 autonomy-0.012
Tokenrain
Token ID3201
Feature activation-1.56e-01

Pos logit contributions
 mercy+0.363
theless+0.359
 incap+0.342
ilty+0.331
 autonomy+0.323
Neg logit contributions
virt-0.302
-0.273
-0.270
-0.266
Dem-0.260
Tokenian
Token ID666
Feature activation+4.66e-01

Pos logit contributions
virt+0.579
+0.534
+0.526
+0.525
Dem+0.517
Neg logit contributions
 mercy-0.425
theless-0.420
 incap-0.395
ilty-0.376
 autonomy-0.363
Token border
Token ID4865
Feature activation-1.48e+00

Pos logit contributions
 mercy+1.421
theless+1.398
 incap+1.323
ilty+1.274
 autonomy+1.247
Neg logit contributions
virt-1.569
-1.441
-1.432
-1.408
Dem-1.381
Token,
Token ID11
Feature activation+1.65e-01

Pos logit contributions
virt+4.797
+4.435
+4.335
+4.293
Dem+4.286
Neg logit contributions
theless-4.517
 mercy-4.476
 incap-4.226
ilty-4.039
 autonomy-3.909
Token says
Token ID1139
Feature activation+3.42e-01

Pos logit contributions
 mercy+0.490
theless+0.483
 incap+0.457
ilty+0.438
 communion+0.426
Neg logit contributions
virt-0.570
-0.521
-0.516
-0.515
Dem-0.503
Token the
Token ID262
Feature activation+6.53e-02

Pos logit contributions
 mercy+1.019
theless+1.004
 incap+0.953
ilty+0.910
 autonomy+0.888
Neg logit contributions
virt-1.179
-1.081
-1.070
-1.060
Dem-1.041
Token underlying
Token ID10238
Feature activation-7.49e-01

Pos logit contributions
 mercy+0.190
theless+0.188
 incap+0.178
ilty+0.170
 autonomy+0.165
Neg logit contributions
virt-0.229
-0.210
-0.208
-0.206
Dem-0.203
Token cause
Token ID2728
Feature activation-3.72e-01

Pos logit contributions
virt+2.555
+2.335
+2.298
+2.281
Dem+2.267
Neg logit contributions
theless-2.219
 mercy-2.204
 incap-2.046
ilty-2.008
etheless-1.927
Token 4
Token ID604
Feature activation-5.72e-01

Pos logit contributions
virt+1.550
+1.399
+1.384
+1.367
Dem+1.360
Neg logit contributions
 mercy-1.666
theless-1.647
 incap-1.570
ilty-1.506
 communion-1.470
Tokenms
Token ID907
Feature activation-5.26e-01

Pos logit contributions
virt+1.928
+1.752
+1.736
+1.710
Dem+1.705
Neg logit contributions
 mercy-1.745
theless-1.739
 incap-1.644
ilty-1.560
 autonomy-1.528
Token G
Token ID402
Feature activation-4.41e-01

Pos logit contributions
virt+1.289
+1.129
+1.116
+1.097
Dem+1.089
Neg logit contributions
 mercy-2.072
theless-2.059
 incap-1.981
ilty-1.909
 communion-1.866
Token2
Token ID17
Feature activation-1.05e+00

Pos logit contributions
virt+1.559
+1.418
+1.398
+1.390
Dem+1.385
Neg logit contributions
theless-1.295
 mercy-1.281
 incap-1.208
ilty-1.141
etheless-1.107
TokenG
Token ID38
Feature activation-1.46e-01

Pos logit contributions
virt+2.931
+2.617
+2.590
+2.560
Dem+2.544
Neg logit contributions
theless-3.836
 mercy-3.833
 incap-3.656
ilty-3.507
etheless-3.443
Token AU
Token ID27548
Feature activation-2.10e+00

Pos logit contributions
virt+0.551
+0.508
+0.503
+0.500
Dem+0.493
Neg logit contributions
 mercy-0.383
theless-0.381
 incap-0.355
ilty-0.339
 communion-0.324
Token Opt
Token ID13123
Feature activation-1.12e+00

Pos logit contributions
virt+4.378
+3.790
+3.747
+3.647
Dem+3.622
Neg logit contributions
theless-8.984
 mercy-8.714
etheless-8.398
 incap-8.368
ciating-8.127
Tokenronics
Token ID20844
Feature activation-5.55e-01

Pos logit contributions
virt+2.979
+2.642
+2.597
+2.563
Dem+2.531
Neg logit contributions
 mercy-4.216
theless-4.183
 incap-3.995
ilty-3.830
 autonomy-3.772
Token AH
Token ID28159
Feature activation-7.48e-01

Pos logit contributions
virt+1.961
+1.789
+1.778
+1.769
Dem+1.743
Neg logit contributions
 mercy-1.591
theless-1.577
 incap-1.481
ilty-1.425
 communion-1.371
TokenVA
Token ID11731
Feature activation-2.79e-01

Pos logit contributions
virt+2.110
+1.896
+1.866
+1.847
Dem+1.818
Neg logit contributions
 mercy-2.696
theless-2.693
 incap-2.567
ilty-2.465
 autonomy-2.417
Token (
Token ID357
Feature activation-3.36e-01

Pos logit contributions
virt+0.955
+0.874
+0.865
+0.858
Dem+0.846
Neg logit contributions
 mercy-0.835
theless-0.828
 incap-0.781
ilty-0.748
 autonomy-0.727
Token to
Token ID284
Feature activation-1.20e-01

Pos logit contributions
virt+0.629
+0.578
+0.572
+0.568
Dem+0.560
Neg logit contributions
 mercy-0.476
theless-0.473
 incap-0.442
ilty-0.424
 communion-0.410
Token the
Token ID262
Feature activation+4.92e-02

Pos logit contributions
virt+0.412
+0.377
+0.373
+0.370
Dem+0.364
Neg logit contributions
 mercy-0.361
theless-0.357
 incap-0.337
ilty-0.324
 communion-0.314
Token government
Token ID1230
Feature activation-4.23e-01

Pos logit contributions
 mercy+0.143
theless+0.141
 incap+0.134
ilty+0.128
 autonomy+0.124
Neg logit contributions
virt-0.173
-0.159
-0.157
-0.156
Dem-0.153
Token's
Token ID338
Feature activation-4.41e-01

Pos logit contributions
virt+1.576
+1.451
+1.435
+1.422
Dem+1.404
Neg logit contributions
theless-1.120
 mercy-1.117
 incap-1.036
ilty-1.005
 communion-0.964
Token Startup
Token ID40472
Feature activation-2.91e-01

Pos logit contributions
virt+1.528
+1.400
+1.387
+1.373
Dem+1.350
Neg logit contributions
theless-1.286
 mercy-1.284
 incap-1.198
ilty-1.165
 communion-1.123
Token India
Token ID3794
Feature activation-1.65e+00

Pos logit contributions
virt+0.915
+0.831
+0.821
+0.814
Dem+0.799
Neg logit contributions
 mercy-0.951
theless-0.940
 incap-0.895
ilty-0.860
 autonomy-0.839
Token programme
Token ID11383
Feature activation-2.08e-01

Pos logit contributions
virt+5.130
+4.708
+4.592
+4.573
Dem+4.523
Neg logit contributions
theless-5.259
 mercy-5.081
 incap-4.802
ilty-4.747
etheless-4.644
Token is
Token ID318
Feature activation-4.44e-02

Pos logit contributions
virt+0.688
+0.627
+0.617
+0.614
Dem+0.605
Neg logit contributions
 mercy-0.647
theless-0.642
 incap-0.607
ilty-0.583
 communion-0.568
Token deb
Token ID1915
Feature activation-3.37e-01

Pos logit contributions
virt+0.144
+0.130
+0.128
+0.128
Dem+0.126
Neg logit contributions
 mercy-0.140
theless-0.139
 incap-0.132
ilty-0.127
 communion-0.123
Tokenatable
Token ID21156
Feature activation+3.61e-01

Pos logit contributions
virt+1.121
+1.018
+1.007
+0.999
Dem+0.982
Neg logit contributions
 mercy-1.048
theless-1.038
 incap-0.981
ilty-0.932
 autonomy-0.918
Token,
Token ID11
Feature activation+7.42e-02

Pos logit contributions
 mercy+1.041
theless+1.026
 incap+0.970
ilty+0.926
 autonomy+0.903
Neg logit contributions
virt-1.274
-1.171
-1.161
-1.151
Dem-1.128