TOP ACTIVATIONS
MAX = 1.368

Y ards : 304 R ushes : 46
Y ards : 307 R ushes : 33
Y ards : 316 R ushes : 31
Ol we us , D . ( 1986 ).
., Sal m iv alli , C ., Vo et en
е к о т о р ы е
M ., Sch ol te , R ., Sal m iv
nn [ Keys [ search Space ][ [ i ]] ],
P el leg r ini , A . D ., Long
intervention research . School Psychology Quarterly , 23 , 26 -
ushes : 31 Opp onent : New Mexico
el leg r ini , A . D ., Long ,
. Draw String ($ art ext [ $ i - 1
ISI My ers , D . ( 2007 ).
. D ., Bart ini , M . ( 2001 ).
а р а к т е р н о�
., G ue ld ner , B . A ., Ross
[ { Keys [ search Space ][ [ i ] ],
Crossref Pe eters , M ., C ill essen
о� т в е р д и т ь

MIN ACTIVATIONS
MIN = -1.327

ed . com and V odka O ' Clock Podcast .
Phone List Ctrl "> < ul > < li ng -
Font Style ] :: B old ) $ Br ush 2
heat sinks . The two form - in - one heat
of this year 's event . A few days
xx brand ? f ref = ts <|endoftext|> A month before
Program . At iku Ab ub ak ar was the first
the cl umps with Val le jo earth pigment and took
PF Bentley / Civil Beat Good win says
) Mem c py In Memory 4 66 284 4 64
prison MORE and Hillary Clinton Hillary Diane Rodham Clinton Sanders :
. The new vehicle E ct o - 1 was also
. Author of The Far rah W ether s Mysteries ,
a home for Sword Base . Editor 's Note
on Facebook Share this comment on Twitter NY T
- pre y - cry stal - emb ry o -
Which gives us A B OM B at 11 a
Democratic presidential nominee Hillary Clinton Hillary Diane Rodham Clinton Sanders :
everything , how can it be expanding ? Like what
name "> span > < p ng - bind =" phone

INTERVAL 0.695 - 1.368
CONTAINS 0.146%

Date : Nov . 24 , 1944 Tre
When we get the down payment we re
of Youth and Ad olesc ence , 39 , 10 41
$ 4 . 2 million divorce settlement from Musk in August
Mexico . Que ens land s tropical

INTERVAL 0.021 - 0.695
CONTAINS 45.037%

home so early . " I played soccer in
in an press release . Almost half of Hillary
). More refined calculations later indicated a miss of only 7
were wed to a member of the same - sex in
ing R FI source : " In the event

INTERVAL -0.653 - 0.021
CONTAINS 54.636%

Krist of God da ert ( Ag 2 r - La
, and it was the main thing they judged the C
Some people took issue with one , two , several or
John Gavin ), but Hitchcock has other plans . When she
the bitter end , all the way up the convention ,

INTERVAL -1.327 - -0.653
CONTAINS 0.180%

In some of them even minute details like nails , muscles
te legraph . co . uk / business / analysis /
a home for Sword Base . Editor 's Note
Dan Fry d man of web firm In igo . A
of pretending . R USE LL : Speaking of
TokenY
Token ID56
Feature activation+4.41e-01

Pos logit contributions
RP+1.583
leon+1.507
Salt+1.464
wark+1.455
Meat+1.431
Neg logit contributions
 opera-4.095
reads-3.768
 misleading-3.648
 glim-3.600
 wra-3.572
Tokenards
Token ID1371
Feature activation+4.00e-02

Pos logit contributions
RP+2.421
wark+2.347
leon+2.335
chev+2.265
Salt+2.260
Neg logit contributions
 opera-3.068
reads-2.713
 misleading-2.648
 glim-2.611
 wra-2.568
Token:
Token ID25
Feature activation-1.37e-01

Pos logit contributions
RP+0.205
leon+0.202
wark+0.199
chev+0.194
Salt+0.193
Neg logit contributions
 opera-0.292
reads-0.271
 misleading-0.255
 wra-0.245
=~-0.244
Token 304
Token ID31672
Feature activation+1.77e-01

Pos logit contributions
 opera+0.872
reads+0.801
 misleading+0.744
 wra+0.712
=~+0.706
Neg logit contributions
RP-0.830
leon-0.820
wark-0.811
chev-0.792
Salt-0.788
Token
Token ID198
Feature activation+7.87e-01

Pos logit contributions
RP+1.044
leon+1.028
wark+1.018
chev+0.993
Salt+0.989
Neg logit contributions
 opera-1.161
reads-1.064
 misleading-0.993
 glim-0.955
 wra-0.952
Token
Token ID198
Feature activation+1.37e+00

Pos logit contributions
RP+4.077
leon+4.004
wark+3.849
Salt+3.817
chev+3.759
Neg logit contributions
 opera-5.656
 glim-5.249
reads-5.108
 misleading-4.885
 wra-4.754
TokenR
Token ID49
Feature activation+2.33e-01

Pos logit contributions
RP+4.716
Salt+4.070
Meat+3.938
leon+3.914
Token+3.826
Neg logit contributions
 opera-12.823
 glim-11.870
reads-11.265
 misleading-11.257
 asbestos-11.248
Tokenushes
Token ID17237
Feature activation+4.62e-01

Pos logit contributions
RP+1.677
wark+1.615
leon+1.603
chev+1.578
Salt+1.564
Neg logit contributions
 opera-1.274
reads-1.076
 misleading-1.042
 glim-1.014
 asbestos-0.988
Token:
Token ID25
Feature activation+2.59e-01

Pos logit contributions
RP+2.164
leon+2.130
wark+2.084
chev+2.020
Salt+2.016
Neg logit contributions
 opera-3.580
reads-3.309
 glim-3.176
 misleading-3.128
 wra-3.036
Token 46
Token ID6337
Feature activation+5.16e-01

Pos logit contributions
RP+1.476
leon+1.460
wark+1.438
Salt+1.402
chev+1.397
Neg logit contributions
 opera-1.730
reads-1.599
 misleading-1.489
 glim-1.445
 wra-1.429
Token
Token ID198
Feature activation+5.50e-01

Pos logit contributions
RP+2.830
leon+2.750
wark+2.721
Salt+2.677
chev+2.655
Neg logit contributions
 opera-3.542
reads-3.239
 glim-3.088
 misleading-3.037
 wra-2.934
TokenY
Token ID56
Feature activation+5.17e-01

Pos logit contributions
RP+1.505
leon+1.425
Salt+1.389
wark+1.380
Meat+1.362
Neg logit contributions
 opera-3.842
reads-3.524
 misleading-3.416
 glim-3.364
 wra-3.342
Tokenards
Token ID1371
Feature activation+5.32e-02

Pos logit contributions
RP+2.786
wark+2.699
leon+2.679
chev+2.603
Salt+2.593
Neg logit contributions
 opera-3.631
reads-3.191
 misleading-3.135
 glim-3.109
 wra-3.041
Token:
Token ID25
Feature activation-3.20e-01

Pos logit contributions
RP+0.272
leon+0.268
wark+0.265
chev+0.257
Salt+0.256
Neg logit contributions
 opera-0.389
reads-0.362
 misleading-0.340
 wra-0.327
=~-0.325
Token 307
Token ID38369
Feature activation+2.70e-01

Pos logit contributions
 opera+1.993
reads+1.822
 misleading+1.695
 wra+1.618
=~+1.606
Neg logit contributions
RP-1.973
leon-1.953
wark-1.936
chev-1.895
Salt-1.877
Token
Token ID198
Feature activation+7.74e-01

Pos logit contributions
RP+1.558
leon+1.526
wark+1.515
chev+1.474
Salt+1.471
Neg logit contributions
 opera-1.799
reads-1.652
 misleading-1.542
 glim-1.510
 wra-1.483
Token
Token ID198
Feature activation+1.34e+00

Pos logit contributions
RP+4.038
leon+3.960
wark+3.819
Salt+3.782
chev+3.725
Neg logit contributions
 opera-5.538
 glim-5.108
reads-4.995
 misleading-4.780
 wra-4.647
TokenR
Token ID49
Feature activation+2.74e-01

Pos logit contributions
RP+4.661
Salt+4.042
Meat+3.938
leon+3.874
Token+3.809
Neg logit contributions
 opera-12.553
 glim-11.621
 asbestos-11.035
 misleading-11.010
reads-11.002
Tokenushes
Token ID17237
Feature activation+4.20e-01

Pos logit contributions
RP+1.952
wark+1.872
leon+1.855
chev+1.825
ptic+1.814
Neg logit contributions
 opera-1.517
reads-1.271
 misleading-1.245
 glim-1.221
 asbestos-1.188
Token:
Token ID25
Feature activation+9.99e-02

Pos logit contributions
RP+1.992
leon+1.961
wark+1.921
chev+1.862
Salt+1.856
Neg logit contributions
 opera-3.222
reads-2.982
 glim-2.833
 misleading-2.817
 wra-2.729
Token 33
Token ID4747
Feature activation+5.48e-01

Pos logit contributions
RP+0.579
leon+0.572
wark+0.565
chev+0.550
Salt+0.549
Neg logit contributions
 opera-0.661
reads-0.610
 misleading-0.568
 wra-0.544
 glim-0.541
Token
Token ID198
Feature activation+5.19e-01

Pos logit contributions
RP+3.004
leon+2.921
wark+2.884
Salt+2.824
chev+2.823
Neg logit contributions
 opera-3.766
reads-3.443
 glim-3.289
 misleading-3.243
 asbestos-3.121
TokenY
Token ID56
Feature activation+6.23e-01

Pos logit contributions
RP+1.363
leon+1.313
wark+1.287
Salt+1.258
chev+1.241
Neg logit contributions
 opera-3.204
reads-2.972
 misleading-2.848
 wra-2.765
 asbestos-2.743
Tokenards
Token ID1371
Feature activation+3.06e-01

Pos logit contributions
RP+3.294
wark+3.182
leon+3.143
Salt+3.055
chev+3.044
Neg logit contributions
 opera-4.438
reads-3.880
 misleading-3.823
 glim-3.810
 wra-3.702
Token:
Token ID25
Feature activation-1.64e-01

Pos logit contributions
RP+1.479
leon+1.450
wark+1.427
chev+1.385
Salt+1.377
Neg logit contributions
 opera-2.333
reads-2.157
 misleading-2.037
 glim-2.025
 wra-1.976
Token 316
Token ID34131
Feature activation+1.78e-01

Pos logit contributions
 opera+1.045
reads+0.960
 misleading+0.892
 wra+0.853
=~+0.846
Neg logit contributions
RP-0.994
leon-0.983
wark-0.972
chev-0.949
Salt-0.945
Token
Token ID198
Feature activation+8.70e-01

Pos logit contributions
RP+1.018
wark+0.990
leon+0.988
chev+0.960
Salt+0.958
Neg logit contributions
 opera-1.202
reads-1.102
 misleading-1.027
 glim-1.008
 wra-0.985
Token
Token ID198
Feature activation+1.33e+00

Pos logit contributions
RP+4.452
leon+4.357
wark+4.198
Salt+4.162
chev+4.074
Neg logit contributions
 opera-6.291
 glim-5.887
reads-5.675
 misleading-5.434
 wra-5.299
TokenR
Token ID49
Feature activation+3.60e-01

Pos logit contributions
RP+4.711
Salt+4.120
Meat+3.942
leon+3.910
Token+3.850
Neg logit contributions
 opera-12.367
 glim-11.471
reads-10.925
 asbestos-10.895
 misleading-10.873
Tokenushes
Token ID17237
Feature activation+5.71e-01

Pos logit contributions
RP+2.542
wark+2.417
leon+2.383
ptic+2.347
chev+2.344
Neg logit contributions
 opera-2.038
reads-1.693
 misleading-1.682
 glim-1.665
 asbestos-1.618
Token:
Token ID25
Feature activation+1.29e-01

Pos logit contributions
RP+2.611
leon+2.560
wark+2.500
Salt+2.420
chev+2.418
Neg logit contributions
 opera-4.463
reads-4.124
 glim-4.019
 misleading-3.907
 wra-3.795
Token 31
Token ID3261
Feature activation+5.16e-01

Pos logit contributions
RP+0.747
leon+0.737
wark+0.729
Salt+0.709
chev+0.708
Neg logit contributions
 opera-0.860
reads-0.794
 misleading-0.739
 glim-0.711
 wra-0.709
Token
Token ID198
Feature activation+5.34e-01

Pos logit contributions
RP+2.746
wark+2.632
leon+2.631
Salt+2.587
chev+2.574
Neg logit contributions
 opera-3.601
reads-3.298
 glim-3.170
 misleading-3.102
 asbestos-2.997
Token
Token ID198
Feature activation+4.28e-03

Pos logit contributions
RP+0.672
leon+0.663
wark+0.654
chev+0.639
Salt+0.636
Neg logit contributions
 opera-0.773
reads-0.712
 misleading-0.665
 wra-0.638
=~-0.631
Token
Token ID198
Feature activation+1.35e-01

Pos logit contributions
RP+0.026
leon+0.026
wark+0.026
chev+0.025
Salt+0.025
Neg logit contributions
 opera-0.027
reads-0.025
 misleading-0.023
 wra-0.022
=~-0.022
TokenOl
Token ID30098
Feature activation-1.32e-01

Pos logit contributions
RP+0.669
leon+0.653
wark+0.639
Salt+0.630
Meat+0.620
Neg logit contributions
 opera-1.017
reads-0.944
 misleading-0.899
 wra-0.868
 glim-0.864
Tokenwe
Token ID732
Feature activation+2.11e-01

Pos logit contributions
 opera+0.979
reads+0.906
 misleading+0.861
 wra+0.822
=~+0.818
Neg logit contributions
RP-0.667
leon-0.663
wark-0.661
chev-0.637
Salt-0.628
Tokenus
Token ID385
Feature activation+6.33e-01

Pos logit contributions
RP+0.742
leon+0.740
wark+0.725
chev+0.697
Salt+0.684
Neg logit contributions
 opera-1.882
reads-1.754
 misleading-1.691
 glim-1.657
 wra-1.638
Token,
Token ID11
Feature activation+1.31e+00

Pos logit contributions
RP+3.220
leon+3.192
wark+3.099
chev+3.044
Salt+2.977
Neg logit contributions
 opera-4.570
reads-4.214
 glim-4.122
 misleading-4.038
 wra-3.872
Token D
Token ID360
Feature activation+7.68e-01

Pos logit contributions
leon+5.633
RP+5.615
wark+5.396
Salt+5.262
chev+5.156
Neg logit contributions
 opera-9.507
 glim-9.154
reads-8.949
 misleading-8.455
=~-8.433
Token.
Token ID13
Feature activation+4.48e-01

Pos logit contributions
RP+3.193
wark+3.136
leon+3.118
Salt+2.975
chev+2.959
Neg logit contributions
 opera-6.224
reads-5.787
 misleading-5.508
 wra-5.334
=~-5.295
Token (
Token ID357
Feature activation+1.87e-01

Pos logit contributions
RP+2.241
wark+2.207
leon+2.178
chev+2.078
Salt+2.070
Neg logit contributions
 opera-3.295
reads-3.082
 misleading-2.870
 glim-2.805
=~-2.801
Token 1986
Token ID12113
Feature activation+5.96e-01

Pos logit contributions
RP+1.068
leon+1.049
wark+1.039
Salt+1.017
chev+1.003
Neg logit contributions
 opera-1.261
reads-1.170
 misleading-1.085
 glim-1.052
 wra-1.045
Token ).
Token ID6739
Feature activation+1.06e-04

Pos logit contributions
RP+2.034
wark+1.994
leon+1.982
chev+1.874
Salt+1.863
Neg logit contributions
 opera-5.216
reads-4.975
 misleading-4.716
=~-4.610
 glim-4.559
Token.,
Token ID1539
Feature activation-8.56e-04

Pos logit contributions
RP+2.993
leon+2.926
wark+2.899
Salt+2.779
chev+2.764
Neg logit contributions
 opera-5.362
reads-5.002
 misleading-4.729
 wra-4.579
=~-4.561
Token Sal
Token ID4849
Feature activation-8.51e-03

Pos logit contributions
 opera+0.006
reads+0.005
 misleading+0.005
 wra+0.005
 glim+0.005
Neg logit contributions
RP-0.005
leon-0.005
wark-0.005
chev-0.005
Salt-0.005
Tokenm
Token ID76
Feature activation+3.71e-01

Pos logit contributions
 opera+0.066
reads+0.061
 misleading+0.058
 wra+0.056
=~+0.056
Neg logit contributions
leon-0.040
RP-0.040
wark-0.039
chev-0.038
Salt-0.037
Tokeniv
Token ID452
Feature activation+3.05e-01

Pos logit contributions
wark+1.218
RP+1.208
leon+1.208
chev+1.164
Salt+1.104
Neg logit contributions
 opera-3.372
reads-3.140
 misleading-3.036
 glim-2.953
=~-2.941
Tokenalli
Token ID36546
Feature activation+7.36e-01

Pos logit contributions
leon+1.008
RP+1.005
wark+0.987
chev+0.944
Salt+0.918
Neg logit contributions
 opera-2.777
reads-2.596
 misleading-2.498
 glim-2.440
 wra-2.413
Token,
Token ID11
Feature activation+1.30e+00

Pos logit contributions
RP+3.639
leon+3.596
wark+3.500
chev+3.439
Salt+3.386
Neg logit contributions
 opera-5.333
reads-4.960
 glim-4.883
 misleading-4.692
=~-4.528
Token C
Token ID327
Feature activation+7.90e-01

Pos logit contributions
leon+3.852
RP+3.766
chev+3.710
wark+3.626
Salt+3.468
Neg logit contributions
 opera-11.200
 glim-10.836
reads-10.754
=~-10.197
 misleading-10.146
Token.,
Token ID1539
Feature activation+1.55e-01

Pos logit contributions
RP+3.117
leon+3.057
wark+3.048
Salt+2.881
chev+2.876
Neg logit contributions
 opera-6.556
reads-6.136
 misleading-5.845
 wra-5.620
=~-5.615
Token Vo
Token ID20687
Feature activation+7.05e-02

Pos logit contributions
RP+0.852
leon+0.846
wark+0.836
Salt+0.813
chev+0.808
Neg logit contributions
 opera-1.072
reads-0.990
 misleading-0.934
 glim-0.900
 wra-0.892
Tokenet
Token ID316
Feature activation+4.14e-01

Pos logit contributions
RP+0.406
leon+0.403
wark+0.398
chev+0.388
Salt+0.381
Neg logit contributions
 opera-0.473
reads-0.431
 misleading-0.411
=~-0.390
 glim-0.390
Tokenen
Token ID268
Feature activation+1.57e-01

Pos logit contributions
leon+2.109
RP+2.097
wark+2.086
chev+2.059
Salt+1.973
Neg logit contributions
 opera-2.980
reads-2.765
 misleading-2.642
=~-2.526
 glim-2.525
Token
Token ID121
Feature activation-1.92e-01

Pos logit contributions
 opera+0.320
reads+0.299
 misleading+0.281
 wra+0.272
=~+0.271
Neg logit contributions
RP-0.198
leon-0.195
wark-0.192
chev-0.187
Salt-0.185
Tokenе
Token ID16843
Feature activation+1.68e-01

Pos logit contributions
 opera+1.331
reads+1.232
 misleading+1.151
 wra+1.106
=~+1.098
Neg logit contributions
RP-1.055
leon-1.041
wark-1.029
chev-1.003
Salt-0.997
Tokenк
Token ID31583
Feature activation+1.29e-02

Pos logit contributions
RP+0.725
leon+0.714
wark+0.701
chev+0.681
Salt+0.669
Neg logit contributions
 opera-1.355
reads-1.280
 misleading-1.197
 wra-1.164
=~-1.161
Tokenо
Token ID15166
Feature activation+6.01e-01

Pos logit contributions
RP+0.059
leon+0.059
wark+0.058
chev+0.056
Salt+0.055
Neg logit contributions
 opera-0.101
reads-0.095
 misleading-0.089
 wra-0.086
=~-0.086
Tokenт
Token ID20375
Feature activation+2.43e-01

Pos logit contributions
RP+1.666
leon+1.585
wark+1.577
chev+1.483
Salt+1.448
Neg logit contributions
 opera-5.685
reads-5.451
 misleading-5.126
 wra-5.023
=~-4.994
Tokenо
Token ID15166
Feature activation+1.29e+00

Pos logit contributions
RP+1.022
leon+1.003
wark+0.983
chev+0.957
Salt+0.935
Neg logit contributions
 opera-1.988
reads-1.884
 misleading-1.763
 wra-1.710
=~-1.704
Tokenр
Token ID21169
Feature activation-3.09e-02

Pos logit contributions
RP+1.858
leon+1.407
wark+1.372
chev+1.271
ptic+1.245
Neg logit contributions
 opera-13.496
reads-12.957
 misleading-12.265
 wra-12.237
 galaxies-12.163
Tokenы
Token ID45035
Feature activation+7.90e-02

Pos logit contributions
 opera+0.239
reads+0.225
 misleading+0.211
 wra+0.204
=~+0.202
Neg logit contributions
RP-0.144
leon-0.141
wark-0.139
chev-0.136
Salt-0.134
Tokenе
Token ID16843
Feature activation+2.46e-01

Pos logit contributions
RP+0.326
leon+0.320
wark+0.316
chev+0.305
Salt+0.302
Neg logit contributions
 opera-0.654
reads-0.615
 misleading-0.581
 wra-0.563
=~-0.558
Token
Token ID12466
Feature activation-8.35e-02

Pos logit contributions
RP+1.115
leon+1.092
wark+1.075
chev+1.041
Salt+1.030
Neg logit contributions
 opera-1.920
reads-1.815
 misleading-1.695
 wra-1.646
 glim-1.636
Token
Token ID116
Feature activation-1.58e-01

Pos logit contributions
 opera+0.635
reads+0.592
 misleading+0.556
 wra+0.539
=~+0.534
Neg logit contributions
RP-0.403
leon-0.397
wark-0.392
chev-0.380
Salt-0.377
Token M
Token ID337
Feature activation+8.59e-01

Pos logit contributions
RP+4.274
leon+4.224
wark+4.074
Salt+3.933
chev+3.902
Neg logit contributions
 opera-6.972
reads-6.532
 glim-6.422
 misleading-6.153
 wra-6.000
Token.,
Token ID1539
Feature activation-4.06e-03

Pos logit contributions
RP+3.516
leon+3.445
wark+3.433
chev+3.254
Salt+3.238
Neg logit contributions
 opera-7.007
reads-6.519
 misleading-6.230
 wra-6.023
=~-5.987
Token Sch
Token ID3059
Feature activation-1.38e-01

Pos logit contributions
 opera+0.028
reads+0.026
 misleading+0.024
 wra+0.023
 glim+0.023
Neg logit contributions
RP-0.022
leon-0.022
wark-0.022
chev-0.021
Salt-0.021
Tokenol
Token ID349
Feature activation-7.11e-03

Pos logit contributions
 opera+1.000
reads+0.927
 misleading+0.873
 wra+0.839
=~+0.833
Neg logit contributions
RP-0.713
leon-0.705
wark-0.697
chev-0.677
Salt-0.671
Tokente
Token ID660
Feature activation+7.37e-01

Pos logit contributions
 opera+0.044
reads+0.040
 misleading+0.037
 wra+0.035
=~+0.035
Neg logit contributions
RP-0.045
leon-0.045
wark-0.044
chev-0.043
Salt-0.043
Token,
Token ID11
Feature activation+1.29e+00

Pos logit contributions
leon+3.735
RP+3.714
wark+3.582
chev+3.520
Salt+3.447
Neg logit contributions
 opera-5.323
reads-4.895
 glim-4.822
 misleading-4.731
 wra-4.510
Token R
Token ID371
Feature activation+6.78e-01

Pos logit contributions
RP+5.108
leon+4.958
wark+4.744
Salt+4.399
chev+4.271
Neg logit contributions
 opera-10.095
reads-9.454
 glim-9.384
 misleading-8.970
=~-8.904
Token.,
Token ID1539
Feature activation-8.56e-04

Pos logit contributions
RP+2.993
leon+2.926
wark+2.899
Salt+2.779
chev+2.764
Neg logit contributions
 opera-5.362
reads-5.002
 misleading-4.729
 wra-4.579
=~-4.561
Token Sal
Token ID4849
Feature activation-8.51e-03

Pos logit contributions
 opera+0.006
reads+0.005
 misleading+0.005
 wra+0.005
 glim+0.005
Neg logit contributions
RP-0.005
leon-0.005
wark-0.005
chev-0.005
Salt-0.005
Tokenm
Token ID76
Feature activation+3.71e-01

Pos logit contributions
 opera+0.066
reads+0.061
 misleading+0.058
 wra+0.056
=~+0.056
Neg logit contributions
leon-0.040
RP-0.040
wark-0.039
chev-0.038
Salt-0.037
Tokeniv
Token ID452
Feature activation+3.05e-01

Pos logit contributions
wark+1.218
RP+1.208
leon+1.208
chev+1.164
Salt+1.104
Neg logit contributions
 opera-3.372
reads-3.140
 misleading-3.036
 glim-2.953
=~-2.941
Tokennn
Token ID20471
Feature activation+3.07e-01

Pos logit contributions
RP+1.498
leon+1.468
wark+1.440
Salt+1.425
chev+1.409
Neg logit contributions
 opera-2.082
reads-1.884
 misleading-1.808
 glim-1.754
 asbestos-1.736
Token[
Token ID58
Feature activation+1.83e-01

Pos logit contributions
RP+1.376
leon+1.343
wark+1.319
Salt+1.291
chev+1.284
Neg logit contributions
 opera-2.453
reads-2.260
 misleading-2.160
 glim-2.082
 wra-2.080
TokenKeys
Token ID40729
Feature activation+1.12e+00

Pos logit contributions
RP+0.961
leon+0.943
wark+0.924
Salt+0.911
chev+0.908
Neg logit contributions
 opera-1.319
reads-1.204
 misleading-1.147
 glim-1.106
 asbestos-1.100
Token[
Token ID58
Feature activation+4.79e-01

Pos logit contributions
RP+4.228
leon+4.048
Salt+3.926
Token+3.778
wark+3.758
Neg logit contributions
 opera-9.459
 glim-8.700
reads-8.476
 misleading-8.309
 asbestos-8.149
Tokensearch
Token ID12947
Feature activation+9.52e-01

Pos logit contributions
RP+2.030
leon+2.011
wark+1.957
chev+1.913
Salt+1.906
Neg logit contributions
 opera-3.902
reads-3.592
 misleading-3.465
 glim-3.371
 wra-3.364
TokenSpace
Token ID14106
Feature activation+1.28e+00

Pos logit contributions
RP+3.130
leon+2.911
Salt+2.899
Meat+2.820
Token+2.763
Neg logit contributions
 opera-8.786
reads-8.186
 glim-8.049
 misleading-7.943
 wra-7.854
Token][
Token ID7131
Feature activation+8.58e-01

Pos logit contributions
RP+4.271
leon+4.119
wark+3.851
Salt+3.810
Token+3.657
Neg logit contributions
 opera-10.955
reads-10.166
 misleading-9.825
 glim-9.734
 galaxies-9.557
Token[
Token ID58
Feature activation+4.76e-01

Pos logit contributions
RP+3.646
leon+3.592
Salt+3.485
chev+3.449
wark+3.446
Neg logit contributions
 opera-6.798
reads-6.147
 misleading-5.969
 glim-5.913
 asbestos-5.806
Tokeni
Token ID72
Feature activation+4.12e-01

Pos logit contributions
RP+2.480
leon+2.435
wark+2.385
Salt+2.347
chev+2.335
Neg logit contributions
 opera-3.416
reads-3.088
 misleading-2.965
 glim-2.892
 asbestos-2.865
Token]]
Token ID11907
Feature activation-2.06e-01

Pos logit contributions
RP+1.985
leon+1.948
wark+1.910
Salt+1.863
chev+1.850
Neg logit contributions
 opera-3.109
reads-2.871
 misleading-2.720
 glim-2.628
 wra-2.615
Token],
Token ID4357
Feature activation+8.50e-02

Pos logit contributions
 opera+1.670
reads+1.563
 misleading+1.478
 wra+1.429
=~+1.421
Neg logit contributions
RP-0.892
leon-0.879
wark-0.866
chev-0.837
Salt-0.830
TokenP
Token ID47
Feature activation+2.56e-01

Pos logit contributions
RP+1.298
leon+1.225
Salt+1.178
wark+1.171
Meat+1.143
Neg logit contributions
 opera-3.815
reads-3.567
 misleading-3.456
 wra-3.353
 glim-3.350
Tokenel
Token ID417
Feature activation-2.00e-01

Pos logit contributions
leon+1.456
RP+1.454
wark+1.425
chev+1.386
Salt+1.353
Neg logit contributions
 opera-1.740
reads-1.573
 misleading-1.529
 wra-1.446
 glim-1.438
Tokenleg
Token ID1455
Feature activation+9.80e-01

Pos logit contributions
 opera+1.877
reads+1.773
 misleading+1.691
 wra+1.642
=~+1.635
Neg logit contributions
RP-0.612
leon-0.600
wark-0.588
chev-0.560
Salt-0.552
Tokenr
Token ID81
Feature activation+2.82e-01

Pos logit contributions
RP+4.621
leon+4.491
wark+4.411
chev+4.375
Salt+4.114
Neg logit contributions
 opera-7.450
reads-6.746
 misleading-6.704
 glim-6.672
 asbestos-6.350
Tokenini
Token ID5362
Feature activation+7.76e-01

Pos logit contributions
leon+1.401
RP+1.399
wark+1.357
chev+1.331
Salt+1.308
Neg logit contributions
 opera-2.094
reads-1.936
 misleading-1.841
 glim-1.789
 wra-1.777
Token,
Token ID11
Feature activation+1.25e+00

Pos logit contributions
RP+3.827
leon+3.780
wark+3.616
chev+3.531
Salt+3.529
Neg logit contributions
 opera-5.668
 glim-5.302
reads-5.242
 misleading-5.010
 wra-4.856
Token A
Token ID317
Feature activation+1.23e+00

Pos logit contributions
RP+5.495
leon+5.337
wark+5.105
Salt+5.046
Token+4.916
Neg logit contributions
 opera-9.214
 glim-9.033
reads-8.734
 misleading-8.322
 wra-8.089
Token.
Token ID13
Feature activation+8.49e-01

Pos logit contributions
RP+4.975
leon+4.944
wark+4.908
Salt+4.666
chev+4.634
Neg logit contributions
 opera-9.717
reads-9.077
 misleading-8.593
=~-8.353
 wra-8.315
Token D
Token ID360
Feature activation+5.29e-01

Pos logit contributions
RP+4.442
wark+4.389
leon+4.341
chev+4.169
Salt+4.159
Neg logit contributions
 opera-5.884
reads-5.457
 misleading-5.051
=~-4.954
 wra-4.889
Token.,
Token ID1539
Feature activation+4.08e-02

Pos logit contributions
RP+1.631
leon+1.591
wark+1.582
chev+1.490
Salt+1.488
Neg logit contributions
 opera-4.910
reads-4.610
 misleading-4.414
 wra-4.295
=~-4.265
Token Long
Token ID5882
Feature activation-2.27e-01

Pos logit contributions
RP+0.207
leon+0.205
wark+0.202
chev+0.195
Salt+0.195
Neg logit contributions
 opera-0.299
reads-0.279
 misleading-0.262
 wra-0.252
 glim-0.251
Token intervention
Token ID9572
Feature activation+1.65e-01

Pos logit contributions
RP+0.611
leon+0.604
wark+0.599
chev+0.581
Salt+0.581
Neg logit contributions
 opera-0.618
reads-0.577
 misleading-0.523
=~-0.509
 wra-0.503
Token research
Token ID2267
Feature activation+1.36e-01

Pos logit contributions
RP+1.173
leon+1.146
wark+1.141
chev+1.120
Salt+1.112
Neg logit contributions
 opera-0.855
reads-0.792
 misleading-0.705
=~-0.682
 glim-0.673
Token.
Token ID764
Feature activation+6.47e-01

Pos logit contributions
RP+0.685
wark+0.662
leon+0.662
chev+0.642
Salt+0.635
Neg logit contributions
 opera-0.996
reads-0.936
 misleading-0.871
=~-0.846
 glim-0.844
Token School
Token ID3961
Feature activation+2.36e-01

Pos logit contributions
RP+3.025
leon+2.943
wark+2.893
Salt+2.797
Meat+2.774
Neg logit contributions
 opera-4.867
reads-4.555
 glim-4.311
 misleading-4.281
 wra-4.201
Token Psychology
Token ID18430
Feature activation+1.53e-01

Pos logit contributions
RP+1.259
leon+1.251
wark+1.240
Salt+1.201
chev+1.197
Neg logit contributions
 opera-1.645
reads-1.526
 misleading-1.424
 glim-1.392
=~-1.386
Token Quarterly
Token ID36082
Feature activation+1.25e+00

Pos logit contributions
RP+0.777
leon+0.755
wark+0.752
chev+0.725
Salt+0.718
Neg logit contributions
 opera-1.128
reads-1.044
 misleading-0.986
=~-0.952
 glim-0.951
Token,
Token ID11
Feature activation+8.54e-01

Pos logit contributions
RP+5.749
leon+5.392
wark+5.369
Salt+5.276
ptic+5.115
Neg logit contributions
 opera-9.108
reads-8.456
 glim-8.206
 misleading-8.070
 galaxies-7.706
Token 23
Token ID2242
Feature activation+8.03e-01

Pos logit contributions
RP+4.573
leon+4.417
wark+4.355
Salt+4.311
Meat+4.184
Neg logit contributions
 opera-5.756
reads-5.400
 glim-5.214
 misleading-4.992
=~-4.840
Token,
Token ID11
Feature activation+4.73e-01

Pos logit contributions
RP+3.779
leon+3.629
wark+3.586
Salt+3.518
Meat+3.432
Neg logit contributions
 opera-6.085
reads-5.654
 misleading-5.356
 glim-5.192
 galaxies-5.126
Token 26
Token ID2608
Feature activation+3.55e-01

Pos logit contributions
RP+2.734
leon+2.668
wark+2.631
Salt+2.579
chev+2.543
Neg logit contributions
 opera-3.083
reads-2.887
 misleading-2.659
 glim-2.586
=~-2.547
Token -
Token ID532
Feature activation+3.26e-01

Pos logit contributions
RP+1.582
leon+1.541
wark+1.508
Salt+1.464
chev+1.443
Neg logit contributions
 opera-2.821
reads-2.661
 misleading-2.491
 glim-2.464
=~-2.418
Tokenushes
Token ID17237
Feature activation+5.71e-01

Pos logit contributions
RP+2.542
wark+2.417
leon+2.383
ptic+2.347
chev+2.344
Neg logit contributions
 opera-2.038
reads-1.693
 misleading-1.682
 glim-1.665
 asbestos-1.618
Token:
Token ID25
Feature activation+1.29e-01

Pos logit contributions
RP+2.611
leon+2.560
wark+2.500
Salt+2.420
chev+2.418
Neg logit contributions
 opera-4.463
reads-4.124
 glim-4.019
 misleading-3.907
 wra-3.795
Token 31
Token ID3261
Feature activation+5.16e-01

Pos logit contributions
RP+0.747
leon+0.737
wark+0.729
Salt+0.709
chev+0.708
Neg logit contributions
 opera-0.860
reads-0.794
 misleading-0.739
 glim-0.711
 wra-0.709
Token
Token ID198
Feature activation+5.34e-01

Pos logit contributions
RP+2.746
wark+2.632
leon+2.631
Salt+2.587
chev+2.574
Neg logit contributions
 opera-3.601
reads-3.298
 glim-3.170
 misleading-3.102
 asbestos-2.997
Token
Token ID198
Feature activation+8.06e-01

Pos logit contributions
RP+2.951
leon+2.898
wark+2.829
Salt+2.790
chev+2.765
Neg logit contributions
 opera-3.688
reads-3.361
 glim-3.250
 misleading-3.169
 wra-3.071
TokenOpp
Token ID27524
Feature activation+1.24e+00

Pos logit contributions
RP+3.949
Salt+3.657
leon+3.639
Token+3.579
Meat+3.554
Neg logit contributions
 opera-6.261
reads-5.696
 glim-5.531
 misleading-5.473
 asbestos-5.358
Tokenonent
Token ID3471
Feature activation+5.44e-01

Pos logit contributions
RP+3.974
leon+3.753
wark+3.617
plet+3.545
chev+3.497
Neg logit contributions
 opera-11.089
reads-10.375
 glim-10.256
 misleading-9.971
 asbestos-9.718
Token:
Token ID25
Feature activation-5.83e-02

Pos logit contributions
RP+2.492
leon+2.444
wark+2.397
Salt+2.328
chev+2.303
Neg logit contributions
 opera-4.256
reads-3.936
 glim-3.820
 misleading-3.717
 wra-3.617
Token New
Token ID968
Feature activation+1.69e-02

Pos logit contributions
 opera+0.419
reads+0.390
 misleading+0.365
 wra+0.351
=~+0.350
Neg logit contributions
RP-0.305
leon-0.301
wark-0.297
chev-0.288
Salt-0.288
Token Mexico
Token ID5828
Feature activation-6.24e-02

Pos logit contributions
RP+0.069
leon+0.068
wark+0.067
chev+0.065
Salt+0.064
Neg logit contributions
 opera-0.140
reads-0.131
 misleading-0.125
 glim-0.121
=~-0.121
Token
Token ID198
Feature activation+2.70e-01

Pos logit contributions
 opera+0.451
reads+0.423
 misleading+0.391
 glim+0.380
=~+0.377
Neg logit contributions
RP-0.324
leon-0.318
wark-0.314
Salt-0.308
chev-0.303
Tokenel
Token ID417
Feature activation-2.00e-01

Pos logit contributions
leon+1.456
RP+1.454
wark+1.425
chev+1.386
Salt+1.353
Neg logit contributions
 opera-1.740
reads-1.573
 misleading-1.529
 wra-1.446
 glim-1.438
Tokenleg
Token ID1455
Feature activation+9.80e-01

Pos logit contributions
 opera+1.877
reads+1.773
 misleading+1.691
 wra+1.642
=~+1.635
Neg logit contributions
RP-0.612
leon-0.600
wark-0.588
chev-0.560
Salt-0.552
Tokenr
Token ID81
Feature activation+2.82e-01

Pos logit contributions
RP+4.621
leon+4.491
wark+4.411
chev+4.375
Salt+4.114
Neg logit contributions
 opera-7.450
reads-6.746
 misleading-6.704
 glim-6.672
 asbestos-6.350
Tokenini
Token ID5362
Feature activation+7.76e-01

Pos logit contributions
leon+1.401
RP+1.399
wark+1.357
chev+1.331
Salt+1.308
Neg logit contributions
 opera-2.094
reads-1.936
 misleading-1.841
 glim-1.789
 wra-1.777
Token,
Token ID11
Feature activation+1.25e+00

Pos logit contributions
RP+3.827
leon+3.780
wark+3.616
chev+3.531
Salt+3.529
Neg logit contributions
 opera-5.668
 glim-5.302
reads-5.242
 misleading-5.010
 wra-4.856
Token A
Token ID317
Feature activation+1.23e+00

Pos logit contributions
RP+5.495
leon+5.337
wark+5.105
Salt+5.046
Token+4.916
Neg logit contributions
 opera-9.214
 glim-9.033
reads-8.734
 misleading-8.322
 wra-8.089
Token.
Token ID13
Feature activation+8.49e-01

Pos logit contributions
RP+4.975
leon+4.944
wark+4.908
Salt+4.666
chev+4.634
Neg logit contributions
 opera-9.717
reads-9.077
 misleading-8.593
=~-8.353
 wra-8.315
Token D
Token ID360
Feature activation+5.29e-01

Pos logit contributions
RP+4.442
wark+4.389
leon+4.341
chev+4.169
Salt+4.159
Neg logit contributions
 opera-5.884
reads-5.457
 misleading-5.051
=~-4.954
 wra-4.889
Token.,
Token ID1539
Feature activation+4.08e-02

Pos logit contributions
RP+1.631
leon+1.591
wark+1.582
chev+1.490
Salt+1.488
Neg logit contributions
 opera-4.910
reads-4.610
 misleading-4.414
 wra-4.295
=~-4.265
Token Long
Token ID5882
Feature activation-2.27e-01

Pos logit contributions
RP+0.207
leon+0.205
wark+0.202
chev+0.195
Salt+0.195
Neg logit contributions
 opera-0.299
reads-0.279
 misleading-0.262
 wra-0.252
 glim-0.251
Token,
Token ID11
Feature activation+7.11e-01

Pos logit contributions
 opera+1.553
reads+1.442
 misleading+1.336
 wra+1.286
=~+1.274
Neg logit contributions
RP-1.265
leon-1.248
wark-1.232
chev-1.199
Salt-1.199
Token.
Token ID13
Feature activation-3.75e-01

Pos logit contributions
RP+0.012
leon+0.012
wark+0.012
chev+0.011
Salt+0.011
Neg logit contributions
 opera-0.017
reads-0.016
 misleading-0.015
 wra-0.014
=~-0.014
TokenDraw
Token ID25302
Feature activation+3.72e-01

Pos logit contributions
 opera+2.453
reads+2.290
 misleading+2.108
 wra+2.025
=~+2.015
Neg logit contributions
RP-2.155
leon-2.144
wark-2.134
chev-2.066
Salt-2.048
TokenString
Token ID10100
Feature activation+1.54e-01

Pos logit contributions
RP+1.376
leon+1.311
wark+1.279
Salt+1.267
chev+1.232
Neg logit contributions
 opera-3.274
reads-3.046
 misleading-2.921
 wra-2.840
 galaxies-2.820
Token($
Token ID16763
Feature activation+6.37e-02

Pos logit contributions
RP+0.845
leon+0.827
wark+0.813
Salt+0.792
chev+0.786
Neg logit contributions
 opera-1.089
reads-1.000
 misleading-0.939
 glim-0.909
 wra-0.904
Tokenart
Token ID433
Feature activation+2.16e-01

Pos logit contributions
RP+0.156
leon+0.153
wark+0.149
chev+0.140
Salt+0.139
Neg logit contributions
 opera-0.633
reads-0.603
 misleading-0.573
 wra-0.557
=~-0.556
Tokenext
Token ID2302
Feature activation+1.21e+00

Pos logit contributions
RP+1.223
leon+1.210
wark+1.192
chev+1.170
Salt+1.156
Neg logit contributions
 opera-1.456
reads-1.332
 misleading-1.261
 glim-1.208
 wra-1.207
Token[
Token ID58
Feature activation+3.83e-01

Pos logit contributions
RP+4.462
leon+4.274
Salt+4.112
wark+4.006
chev+3.981
Neg logit contributions
 opera-10.445
 glim-10.067
 misleading-9.199
reads-9.193
 wra-9.113
Token$
Token ID3
Feature activation+2.91e-01

Pos logit contributions
RP+2.023
leon+1.984
wark+1.948
Salt+1.905
chev+1.904
Neg logit contributions
 opera-2.764
reads-2.529
 misleading-2.396
 glim-2.339
 wra-2.315
Tokeni
Token ID72
Feature activation+2.25e-01

Pos logit contributions
RP+1.619
leon+1.590
wark+1.568
chev+1.535
Salt+1.521
Neg logit contributions
 opera-2.008
reads-1.834
 misleading-1.734
 glim-1.683
 wra-1.669
Token-
Token ID12
Feature activation+4.49e-01

Pos logit contributions
RP+1.318
leon+1.297
wark+1.283
chev+1.250
Salt+1.247
Neg logit contributions
 opera-1.482
reads-1.353
 misleading-1.270
 glim-1.235
 wra-1.222
Token1
Token ID16
Feature activation+2.54e-01

Pos logit contributions
RP+2.457
leon+2.399
wark+2.361
Salt+2.313
chev+2.308
Neg logit contributions
 opera-3.159
reads-2.858
 misleading-2.731
 glim-2.730
 wra-2.638
Token ISI
Token ID22048
Feature activation-4.34e-01

Pos logit contributions
RP+1.163
leon+1.144
wark+1.120
Salt+1.104
Meat+1.076
Neg logit contributions
 opera-1.649
reads-1.534
 misleading-1.443
 glim-1.387
 wra-1.385
Token
Token ID198
Feature activation+4.66e-02

Pos logit contributions
 opera+2.625
reads+2.385
 misleading+2.221
 wra+2.113
=~+2.098
Neg logit contributions
RP-2.732
leon-2.725
wark-2.687
chev-2.633
Salt-2.600
Token
Token ID198
Feature activation+2.89e-01

Pos logit contributions
RP+0.281
leon+0.278
wark+0.275
chev+0.268
Salt+0.267
Neg logit contributions
 opera-0.298
reads-0.273
 misleading-0.254
 wra-0.243
=~-0.241
TokenMy
Token ID3666
Feature activation-3.31e-02

Pos logit contributions
RP+1.258
leon+1.224
Salt+1.186
wark+1.180
Meat+1.177
Neg logit contributions
 opera-2.369
reads-2.194
 misleading-2.108
 glim-2.043
 wra-2.037
Tokeners
Token ID364
Feature activation+5.66e-01

Pos logit contributions
 opera+0.248
reads+0.230
 misleading+0.218
 wra+0.209
=~+0.208
Neg logit contributions
RP-0.164
leon-0.161
wark-0.159
chev-0.155
Salt-0.154
Token,
Token ID11
Feature activation+1.18e+00

Pos logit contributions
RP+3.054
leon+3.028
wark+2.983
chev+2.926
Salt+2.849
Neg logit contributions
 opera-3.927
reads-3.621
 misleading-3.420
 wra-3.261
=~-3.257
Token D
Token ID360
Feature activation+8.97e-01

Pos logit contributions
RP+5.120
leon+4.999
wark+4.786
Salt+4.622
chev+4.533
Neg logit contributions
 opera-8.897
 glim-8.396
reads-8.291
 misleading-7.856
abwe-7.737
Token.
Token ID13
Feature activation+7.24e-01

Pos logit contributions
RP+3.574
wark+3.510
leon+3.489
chev+3.315
Salt+3.287
Neg logit contributions
 opera-7.378
reads-6.819
 misleading-6.550
 wra-6.372
=~-6.330
Token (
Token ID357
Feature activation+4.49e-01

Pos logit contributions
RP+3.552
wark+3.437
leon+3.428
Salt+3.257
chev+3.200
Neg logit contributions
 opera-5.365
reads-4.982
 glim-4.789
 misleading-4.665
 wra-4.593
Token 2007
Token ID4343
Feature activation+6.18e-01

Pos logit contributions
RP+2.572
leon+2.540
wark+2.493
Salt+2.462
chev+2.414
Neg logit contributions
 opera-2.967
reads-2.735
 misleading-2.539
 glim-2.525
 wra-2.463
Token ).
Token ID6739
Feature activation-1.68e-01

Pos logit contributions
RP+2.129
wark+2.081
leon+2.064
chev+1.933
Salt+1.924
Neg logit contributions
 opera-5.399
reads-5.123
 misleading-4.901
=~-4.786
 glim-4.725
Token.
Token ID13
Feature activation+5.50e-01

Pos logit contributions
RP+3.358
leon+3.357
wark+3.288
chev+3.154
Salt+3.101
Neg logit contributions
 opera-7.352
reads-6.925
 misleading-6.598
=~-6.394
 wra-6.370
Token D
Token ID360
Feature activation+4.17e-01

Pos logit contributions
RP+2.725
leon+2.661
wark+2.653
Salt+2.527
chev+2.499
Neg logit contributions
 opera-4.032
reads-3.799
 glim-3.560
 misleading-3.532
=~-3.456
Token.,
Token ID1539
Feature activation-1.61e-02

Pos logit contributions
RP+1.408
leon+1.376
wark+1.360
chev+1.298
Salt+1.285
Neg logit contributions
 opera-3.763
reads-3.537
 misleading-3.376
 wra-3.281
=~-3.266
Token Bart
Token ID13167
Feature activation+2.63e-02

Pos logit contributions
 opera+0.111
reads+0.103
 misleading+0.097
 wra+0.092
 glim+0.092
Neg logit contributions
RP-0.089
leon-0.088
wark-0.087
chev-0.084
Salt-0.084
Tokenini
Token ID5362
Feature activation+3.31e-01

Pos logit contributions
RP+0.140
leon+0.139
wark+0.137
chev+0.134
Salt+0.131
Neg logit contributions
 opera-0.187
reads-0.172
 misleading-0.164
 wra-0.156
=~-0.156
Token,
Token ID11
Feature activation+1.18e+00

Pos logit contributions
RP+1.822
leon+1.811
wark+1.789
chev+1.744
Salt+1.719
Neg logit contributions
 opera-2.257
reads-2.097
 misleading-1.972
=~-1.880
 wra-1.876
Token M
Token ID337
Feature activation+9.15e-01

Pos logit contributions
leon+5.019
RP+4.974
wark+4.730
chev+4.549
Salt+4.538
Neg logit contributions
 opera-8.659
 glim-8.348
reads-8.323
 misleading-7.801
=~-7.700
Token.
Token ID13
Feature activation+5.75e-01

Pos logit contributions
RP+3.539
leon+3.518
wark+3.455
chev+3.295
Salt+3.241
Neg logit contributions
 opera-7.548
reads-7.110
 misleading-6.781
=~-6.565
 wra-6.550
Token (
Token ID357
Feature activation+4.62e-01

Pos logit contributions
RP+2.829
leon+2.770
wark+2.745
Salt+2.622
chev+2.588
Neg logit contributions
 opera-4.200
reads-3.980
 glim-3.766
 misleading-3.700
=~-3.628
Token 2001
Token ID5878
Feature activation+5.82e-01

Pos logit contributions
RP+2.656
leon+2.620
wark+2.578
Salt+2.534
chev+2.495
Neg logit contributions
 opera-3.040
reads-2.807
 misleading-2.598
 glim-2.593
 wra-2.523
Token ).
Token ID6739
Feature activation-8.36e-02

Pos logit contributions
RP+2.006
wark+1.956
leon+1.938
Salt+1.826
chev+1.813
Neg logit contributions
 opera-5.097
reads-4.856
 misleading-4.620
=~-4.511
 glim-4.446
Tokenа
Token ID16142
Feature activation+2.56e-01

Pos logit contributions
RP+0.234
leon+0.229
wark+0.225
chev+0.219
Salt+0.216
Neg logit contributions
 opera-0.421
reads-0.394
 misleading-0.371
 wra-0.359
=~-0.357
Tokenр
Token ID21169
Feature activation+4.85e-01

Pos logit contributions
RP+0.769
leon+0.745
wark+0.732
chev+0.696
Salt+0.683
Neg logit contributions
 opera-2.406
reads-2.293
 misleading-2.180
=~-2.115
 wra-2.113
Tokenа
Token ID16142
Feature activation+2.89e-01

Pos logit contributions
RP+1.903
leon+1.869
wark+1.802
chev+1.775
Salt+1.741
Neg logit contributions
 opera-4.020
reads-3.837
 misleading-3.591
 wra-3.513
 glim-3.504
Tokenк
Token ID31583
Feature activation+1.47e-02

Pos logit contributions
RP+0.887
leon+0.868
wark+0.845
chev+0.815
Salt+0.791
Neg logit contributions
 opera-2.680
reads-2.551
 misleading-2.417
 wra-2.359
 glim-2.355
Tokenт
Token ID20375
Feature activation+2.45e-01

Pos logit contributions
RP+0.066
leon+0.065
wark+0.064
chev+0.062
Salt+0.061
Neg logit contributions
 opera-0.116
reads-0.110
 misleading-0.103
 wra-0.100
=~-0.099
Tokenе
Token ID16843
Feature activation+1.17e+00

Pos logit contributions
RP+1.191
leon+1.164
wark+1.154
chev+1.125
Salt+1.104
Neg logit contributions
 opera-1.840
reads-1.732
 misleading-1.610
 wra-1.566
 glim-1.550
Tokenр
Token ID21169
Feature activation+2.47e-01

Pos logit contributions
RP+3.377
leon+3.139
wark+2.895
chev+2.876
ptic+2.782
Neg logit contributions
 opera-10.587
reads-10.112
 misleading-9.521
 wra-9.401
 glim-9.371
Tokenн
Token ID22177
Feature activation+6.93e-02

Pos logit contributions
RP+1.131
leon+1.103
wark+1.086
chev+1.062
Salt+1.042
Neg logit contributions
 opera-1.911
reads-1.819
 misleading-1.690
 wra-1.645
 glim-1.625
Tokenо�
Token ID25443
Feature activation+6.57e-02

Pos logit contributions
RP+0.327
leon+0.317
wark+0.311
chev+0.304
Salt+0.300
Neg logit contributions
 opera-0.532
reads-0.499
 misleading-0.470
 wra-0.455
 glim-0.451
Token
Token ID117
Feature activation+2.34e-01

Pos logit contributions
RP+0.330
leon+0.325
wark+0.321
chev+0.311
Salt+0.306
Neg logit contributions
 opera-0.488
reads-0.455
 misleading-0.426
 wra-0.411
 glim-0.409
Token
Token ID12466
Feature activation+4.14e-02

Pos logit contributions
RP+1.062
leon+1.040
wark+1.023
chev+0.995
Salt+0.978
Neg logit contributions
 opera-1.822
reads-1.730
 misleading-1.612
 wra-1.564
=~-1.553
Token.,
Token ID1539
Feature activation+9.41e-02

Pos logit contributions
RP+2.334
wark+2.258
leon+2.246
chev+2.133
Salt+2.128
Neg logit contributions
 opera-5.371
reads-5.019
 misleading-4.797
 wra-4.653
=~-4.633
Token G
Token ID402
Feature activation+2.87e-01

Pos logit contributions
RP+0.519
leon+0.515
wark+0.509
Salt+0.493
chev+0.492
Neg logit contributions
 opera-0.649
reads-0.602
 misleading-0.564
 glim-0.540
 wra-0.539
Tokenue
Token ID518
Feature activation+6.96e-02

Pos logit contributions
RP+1.584
leon+1.574
wark+1.564
chev+1.509
Salt+1.482
Neg logit contributions
 opera-1.979
reads-1.798
 misleading-1.729
=~-1.653
 glim-1.644
Tokenld
Token ID335
Feature activation+1.37e-01

Pos logit contributions
leon+0.369
RP+0.365
wark+0.361
chev+0.352
Salt+0.344
Neg logit contributions
 opera-0.499
reads-0.462
 misleading-0.438
 wra-0.417
 glim-0.417
Tokenner
Token ID1008
Feature activation+4.54e-01

Pos logit contributions
RP+0.729
wark+0.719
leon+0.719
chev+0.700
Salt+0.682
Neg logit contributions
 opera-0.972
reads-0.895
 misleading-0.855
 glim-0.817
 wra-0.811
Token,
Token ID11
Feature activation+1.17e+00

Pos logit contributions
RP+2.341
leon+2.338
wark+2.283
chev+2.251
Salt+2.163
Neg logit contributions
 opera-3.255
reads-2.990
 misleading-2.884
 glim-2.859
=~-2.737
Token B
Token ID347
Feature activation+8.41e-01

Pos logit contributions
RP+5.209
leon+5.151
wark+4.914
Salt+4.769
chev+4.691
Neg logit contributions
 opera-8.693
 glim-8.243
reads-8.116
 misleading-7.658
abwe-7.530
Token.
Token ID13
Feature activation+5.97e-01

Pos logit contributions
RP+3.415
leon+3.398
wark+3.387
chev+3.187
Salt+3.141
Neg logit contributions
 opera-6.863
reads-6.369
 misleading-6.113
 wra-5.904
=~-5.865
Token A
Token ID317
Feature activation+4.98e-01

Pos logit contributions
RP+2.960
leon+2.880
wark+2.873
Salt+2.733
chev+2.685
Neg logit contributions
 opera-4.403
reads-4.106
 glim-3.927
 misleading-3.850
 wra-3.769
Token.,
Token ID1539
Feature activation-4.55e-02

Pos logit contributions
RP+1.870
leon+1.836
wark+1.817
chev+1.745
Salt+1.723
Neg logit contributions
 opera-4.287
reads-4.024
 misleading-3.829
 wra-3.707
=~-3.692
Token Ross
Token ID9847
Feature activation+3.62e-01

Pos logit contributions
 opera+0.314
reads+0.291
 misleading+0.273
 wra+0.261
=~+0.260
Neg logit contributions
RP-0.251
leon-0.249
wark-0.246
chev-0.238
Salt-0.238
Token[
Token ID58
Feature activation+5.74e-01

Pos logit contributions
RP+1.086
leon+1.068
wark+1.042
Salt+1.024
chev+1.009
Neg logit contributions
 opera-2.026
reads-1.869
 misleading-1.785
 glim-1.717
 wra-1.707
Token {
Token ID1391
Feature activation+1.35e-01

Pos logit contributions
RP+3.263
leon+3.208
wark+3.153
Salt+3.125
chev+3.106
Neg logit contributions
 opera-3.699
reads-3.363
 misleading-3.212
 glim-3.089
 wra-3.062
Token Keys
Token ID26363
Feature activation+5.40e-01

Pos logit contributions
RP+0.727
leon+0.716
wark+0.702
Salt+0.692
chev+0.686
Neg logit contributions
 opera-0.954
reads-0.876
 misleading-0.831
 wra-0.796
 glim-0.795
Token[
Token ID58
Feature activation+5.21e-01

Pos logit contributions
RP+2.290
leon+2.204
Salt+2.169
wark+2.141
Meat+2.098
Neg logit contributions
 opera-4.396
reads-4.033
 misleading-3.869
 glim-3.772
 wra-3.717
Tokensearch
Token ID12947
Feature activation+8.63e-01

Pos logit contributions
RP+2.420
leon+2.399
wark+2.347
chev+2.292
Salt+2.291
Neg logit contributions
 opera-3.982
reads-3.667
 misleading-3.521
 glim-3.426
 wra-3.401
TokenSpace
Token ID14106
Feature activation+1.17e+00

Pos logit contributions
RP+2.984
Salt+2.799
leon+2.772
Meat+2.715
wark+2.689
Neg logit contributions
 opera-7.638
reads-7.156
 misleading-6.939
 wra-6.757
 asbestos-6.660
Token][
Token ID7131
Feature activation+6.58e-01

Pos logit contributions
RP+5.575
leon+5.316
Salt+5.184
wark+5.127
Token+5.102
Neg logit contributions
 opera-8.422
reads-7.641
 misleading-7.446
 glim-7.381
 asbestos-7.157
Token[
Token ID58
Feature activation+4.57e-01

Pos logit contributions
RP+3.315
leon+3.266
Salt+3.193
wark+3.180
chev+3.151
Neg logit contributions
 opera-4.728
reads-4.274
 misleading-4.132
 glim-4.038
 asbestos-3.983
Tokeni
Token ID72
Feature activation+3.63e-01

Pos logit contributions
RP+2.395
leon+2.348
wark+2.307
Salt+2.291
chev+2.255
Neg logit contributions
 opera-3.254
reads-2.952
 misleading-2.834
 glim-2.759
 asbestos-2.721
Token]
Token ID60
Feature activation-7.58e-02

Pos logit contributions
RP+1.724
leon+1.687
wark+1.658
Salt+1.621
chev+1.608
Neg logit contributions
 opera-2.775
reads-2.565
 misleading-2.428
 glim-2.341
 wra-2.339
Token],
Token ID4357
Feature activation+4.53e-01

Pos logit contributions
 opera+0.574
reads+0.535
 misleading+0.504
 wra+0.485
 glim+0.481
Neg logit contributions
RP-0.368
leon-0.363
wark-0.357
chev-0.347
Salt-0.346
Token Crossref
Token ID23613
Feature activation+2.99e-01

Pos logit contributions
RP+0.474
leon+0.467
wark+0.460
chev+0.449
Salt+0.447
Neg logit contributions
 opera-0.587
reads-0.543
 misleading-0.507
 wra-0.487
=~-0.482
Token
Token ID198
Feature activation-2.28e-02

Pos logit contributions
RP+1.703
leon+1.686
wark+1.666
chev+1.621
Salt+1.618
Neg logit contributions
 opera-1.987
reads-1.833
 misleading-1.713
 wra-1.634
=~-1.620
Token
Token ID198
Feature activation+3.16e-01

Pos logit contributions
 opera+0.142
reads+0.131
 misleading+0.121
 wra+0.116
=~+0.115
Neg logit contributions
RP-0.140
leon-0.138
wark-0.137
chev-0.134
Salt-0.133
TokenPe
Token ID6435
Feature activation+3.04e-01

Pos logit contributions
RP+1.036
leon+0.983
Salt+0.946
wark+0.944
Meat+0.914
Neg logit contributions
 opera-2.935
reads-2.755
 misleading-2.660
 glim-2.580
 wra-2.577
Tokeneters
Token ID7307
Feature activation+2.44e-01

Pos logit contributions
leon+1.551
RP+1.540
wark+1.506
chev+1.478
Salt+1.449
Neg logit contributions
 opera-2.222
reads-2.049
 misleading-1.986
 glim-1.883
=~-1.874
Token,
Token ID11
Feature activation+1.17e+00

Pos logit contributions
RP+1.353
leon+1.339
wark+1.323
chev+1.290
Salt+1.274
Neg logit contributions
 opera-1.668
reads-1.543
 misleading-1.452
 wra-1.385
=~-1.379
Token M
Token ID337
Feature activation+1.06e+00

Pos logit contributions
RP+5.109
leon+5.018
wark+4.808
Salt+4.650
chev+4.538
Neg logit contributions
 opera-8.757
 glim-8.194
reads-8.190
 misleading-7.744
 wra-7.586
Token.,
Token ID1539
Feature activation+1.70e-01

Pos logit contributions
RP+4.289
wark+4.229
leon+4.220
chev+3.973
Salt+3.952
Neg logit contributions
 opera-8.593
reads-7.977
 misleading-7.635
 wra-7.391
=~-7.340
Token C
Token ID327
Feature activation+3.87e-01

Pos logit contributions
RP+0.939
leon+0.932
wark+0.921
Salt+0.891
chev+0.888
Neg logit contributions
 opera-1.166
reads-1.081
 misleading-1.015
 glim-0.976
 wra-0.969
Tokenill
Token ID359
Feature activation+4.61e-01

Pos logit contributions
RP+2.197
leon+2.191
wark+2.183
chev+2.106
Salt+2.058
Neg logit contributions
 opera-2.596
reads-2.344
 misleading-2.276
=~-2.164
 glim-2.155
Tokenessen
Token ID44483
Feature activation+2.88e-01

Pos logit contributions
RP+2.512
leon+2.497
wark+2.490
chev+2.420
Salt+2.369
Neg logit contributions
 opera-3.163
reads-2.895
 misleading-2.785
 glim-2.693
=~-2.630
Token
Token ID123
Feature activation-2.68e-01

Pos logit contributions
 opera+1.223
reads+1.139
 misleading+1.071
 wra+1.035
=~+1.027
Neg logit contributions
RP-0.792
leon-0.781
wark-0.770
chev-0.748
Salt-0.743
Tokenо�
Token ID25443
Feature activation-3.03e-01

Pos logit contributions
 opera+2.404
reads+2.265
 misleading+2.155
 wra+2.090
=~+2.079
Neg logit contributions
RP-0.927
leon-0.912
wark-0.894
chev-0.856
Salt-0.848
Token
Token ID112
Feature activation-7.54e-02

Pos logit contributions
 opera+2.436
reads+2.279
 misleading+2.153
 wra+2.081
=~+2.068
Neg logit contributions
RP-1.327
leon-1.307
wark-1.288
chev-1.245
Salt-1.237
Tokenт
Token ID20375
Feature activation+3.74e-01

Pos logit contributions
 opera+0.648
reads+0.612
 misleading+0.578
 wra+0.562
=~+0.557
Neg logit contributions
RP-0.289
leon-0.283
wark-0.279
chev-0.268
Salt-0.265
Tokenв
Token ID38857
Feature activation-7.12e-03

Pos logit contributions
RP+1.381
leon+1.340
wark+1.327
chev+1.276
Salt+1.251
Neg logit contributions
 opera-3.250
reads-3.083
 misleading-2.889
 wra-2.826
=~-2.797
Tokenе
Token ID16843
Feature activation+1.16e+00

Pos logit contributions
 opera+0.051
reads+0.047
 misleading+0.044
 wra+0.042
=~+0.042
Neg logit contributions
RP-0.038
leon-0.037
wark-0.037
chev-0.036
Salt-0.036
Tokenр
Token ID21169
Feature activation+2.99e-01

Pos logit contributions
RP+2.077
leon+1.873
chev+1.737
wark+1.634
ptic+1.528
Neg logit contributions
 opera-11.879
reads-11.472
 misleading-10.739
 wra-10.695
 glim-10.626
Tokenд
Token ID43666
Feature activation-3.35e-01

Pos logit contributions
RP+0.739
leon+0.714
wark+0.679
chev+0.660
Salt+0.629
Neg logit contributions
 opera-2.948
reads-2.835
 misleading-2.675
 wra-2.623
 glim-2.602
Tokenи
Token ID18849
Feature activation+1.38e-01

Pos logit contributions
 opera+2.073
reads+1.896
 misleading+1.760
 wra+1.679
=~+1.666
Neg logit contributions
RP-2.089
leon-2.068
wark-2.046
chev-1.996
Salt-1.990
Tokenт
Token ID20375
Feature activation+1.65e-01

Pos logit contributions
RP+0.570
leon+0.558
wark+0.549
chev+0.532
Salt+0.522
Neg logit contributions
 opera-1.135
reads-1.076
 misleading-1.011
 wra-0.984
=~-0.973
Tokenь
Token ID45367
Feature activation+1.44e-01

Pos logit contributions
RP+0.736
leon+0.717
wark+0.707
chev+0.692
Salt+0.677
Neg logit contributions
 opera-1.302
reads-1.235
 misleading-1.155
 wra-1.117
=~-1.108
Tokened
Token ID276
Feature activation-4.66e-01

Pos logit contributions
 opera+1.675
reads+1.551
 misleading+1.445
 wra+1.385
=~+1.377
Neg logit contributions
RP-1.392
leon-1.375
wark-1.358
chev-1.325
Salt-1.315
Token.
Token ID13
Feature activation-3.75e-01

Pos logit contributions
 opera+3.222
reads+3.009
 misleading+2.784
 wra+2.674
=~+2.641
Neg logit contributions
RP-2.515
leon-2.504
wark-2.463
chev-2.429
Salt-2.386
Tokencom
Token ID785
Feature activation-8.10e-01

Pos logit contributions
 opera+2.036
reads+1.855
 misleading+1.677
 wra+1.580
=~+1.571
Neg logit contributions
RP-2.590
leon-2.582
wark-2.556
chev-2.506
Salt-2.494
Token and
Token ID290
Feature activation-1.38e-01

Pos logit contributions
 opera+5.210
reads+4.793
 misleading+4.423
 wra+4.244
=~+4.178
Neg logit contributions
RP-4.710
leon-4.685
wark-4.582
chev-4.552
Salt-4.475
Token V
Token ID569
Feature activation-1.65e-01

Pos logit contributions
 opera+0.883
reads+0.814
 misleading+0.757
 wra+0.724
=~+0.724
Neg logit contributions
RP-0.823
leon-0.812
wark-0.805
Salt-0.781
chev-0.778
Tokenodka
Token ID28601
Feature activation-1.33e+00

Pos logit contributions
 opera+1.194
reads+1.109
 misleading+1.040
 wra+1.000
=~+0.994
Neg logit contributions
RP-0.858
leon-0.848
wark-0.837
chev-0.814
Salt-0.808
Token O
Token ID440
Feature activation-3.39e-01

Pos logit contributions
 opera+6.979
reads+6.269
 misleading+5.658
 wra+5.364
 galaxies+5.153
Neg logit contributions
leon-9.168
RP-9.129
wark-9.060
chev-8.988
Meat-8.758
Token'
Token ID6
Feature activation+5.27e-01

Pos logit contributions
 opera+2.470
reads+2.310
 misleading+2.162
 wra+2.087
=~+2.054
Neg logit contributions
RP-1.703
leon-1.688
wark-1.665
chev-1.636
Salt-1.605
TokenClock
Token ID44758
Feature activation-7.22e-02

Pos logit contributions
RP+1.093
leon+1.046
wark+0.997
Salt+0.951
Meat+0.920
Neg logit contributions
 opera-5.465
reads-5.153
 misleading-4.978
 glim-4.926
 wra-4.869
Token Podcast
Token ID16036
Feature activation-8.43e-02

Pos logit contributions
 opera+0.466
reads+0.429
 misleading+0.399
 wra+0.382
=~+0.379
Neg logit contributions
RP-0.431
leon-0.424
wark-0.420
chev-0.409
Salt-0.407
Token.
Token ID13
Feature activation-3.02e-01

Pos logit contributions
 opera+0.613
reads+0.569
 misleading+0.535
 wra+0.514
=~+0.511
Neg logit contributions
RP-0.435
leon-0.429
wark-0.424
chev-0.412
Salt-0.409
TokenPhone
Token ID6132
Feature activation+6.45e-01

Pos logit contributions
RP+2.572
leon+2.498
wark+2.390
Salt+2.387
Token+2.358
Neg logit contributions
 opera-5.716
reads-5.282
 misleading-5.116
 glim-5.006
 asbestos-4.995
TokenList
Token ID8053
Feature activation+6.60e-01

Pos logit contributions
RP+1.952
leon+1.854
wark+1.775
Salt+1.752
Token+1.745
Neg logit contributions
 opera-5.984
reads-5.617
 misleading-5.362
 glim-5.267
 asbestos-5.252
TokenCtrl
Token ID40069
Feature activation-1.47e-01

Pos logit contributions
RP+2.375
leon+2.309
wark+2.207
Salt+2.186
chev+2.169
Neg logit contributions
 opera-5.766
reads-5.432
 misleading-5.149
 glim-5.009
 asbestos-5.002
Token">
Token ID5320
Feature activation+3.41e-01

Pos logit contributions
 opera+0.911
reads+0.838
 misleading+0.774
 wra+0.738
=~+0.733
Neg logit contributions
RP-0.915
leon-0.905
wark-0.895
chev-0.875
Salt-0.870
Token <
Token ID1279
Feature activation-4.10e-01

Pos logit contributions
RP+1.983
leon+1.973
wark+1.934
chev+1.895
Salt+1.881
Neg logit contributions
 opera-2.199
reads-2.035
 misleading-1.885
 wra-1.816
=~-1.814
Tokenul
Token ID377
Feature activation-1.30e+00

Pos logit contributions
 opera+2.090
reads+1.891
 misleading+1.706
 wra+1.625
=~+1.595
Neg logit contributions
RP-2.972
leon-2.940
wark-2.930
Salt-2.863
chev-2.862
Token>
Token ID29
Feature activation+8.04e-02

Pos logit contributions
 opera+7.534
reads+6.683
 misleading+6.377
 wra+6.229
=~+6.070
Neg logit contributions
RP-7.956
leon-7.866
wark-7.818
Meat-7.767
Salt-7.745
Token <
Token ID1279
Feature activation-2.20e-01

Pos logit contributions
RP+0.488
leon+0.484
wark+0.477
chev+0.467
Salt+0.465
Neg logit contributions
 opera-0.506
reads-0.467
 misleading-0.431
 wra-0.413
=~-0.412
Tokenli
Token ID4528
Feature activation-9.56e-01

Pos logit contributions
 opera+1.487
reads+1.374
 misleading+1.282
 wra+1.231
=~+1.221
Neg logit contributions
RP-1.242
leon-1.228
wark-1.216
chev-1.183
Salt-1.178
Token ng
Token ID23370
Feature activation-1.83e-01

Pos logit contributions
 opera+3.669
reads+3.078
 misleading+2.809
 wra+2.686
=~+2.503
Neg logit contributions
RP-7.954
leon-7.908
wark-7.886
Meat-7.816
Salt-7.786
Token-
Token ID12
Feature activation+1.69e-01

Pos logit contributions
 opera+1.122
reads+1.038
 misleading+0.952
 wra+0.909
=~+0.905
Neg logit contributions
RP-1.135
leon-1.131
wark-1.123
chev-1.089
Salt-1.087
TokenFont
Token ID23252
Feature activation+3.43e-01

Pos logit contributions
RP+2.205
leon+2.093
wark+2.031
Salt+2.019
Meat+1.990
Neg logit contributions
 opera-4.068
reads-3.745
 misleading-3.607
 wra-3.489
 asbestos-3.480
TokenStyle
Token ID21466
Feature activation-3.40e-01

Pos logit contributions
RP+1.438
leon+1.387
wark+1.347
Salt+1.316
chev+1.291
Neg logit contributions
 opera-2.854
reads-2.656
 misleading-2.527
 glim-2.461
 wra-2.450
Token]
Token ID60
Feature activation-3.90e-01

Pos logit contributions
 opera+2.658
reads+2.509
 misleading+2.349
=~+2.268
 wra+2.252
Neg logit contributions
leon-1.504
RP-1.495
wark-1.491
chev-1.450
Salt-1.415
Token::
Token ID3712
Feature activation-5.26e-01

Pos logit contributions
 opera+3.182
reads+3.032
 misleading+2.835
=~+2.761
 wra+2.719
Neg logit contributions
wark-1.530
leon-1.522
RP-1.511
chev-1.489
Salt-1.434
TokenB
Token ID33
Feature activation-9.64e-01

Pos logit contributions
 opera+4.673
reads+4.452
 misleading+4.202
=~+4.074
 wra+4.069
Neg logit contributions
leon-1.723
wark-1.716
RP-1.711
chev-1.629
ptic-1.546
Tokenold
Token ID727
Feature activation-1.20e+00

Pos logit contributions
 opera+5.970
reads+5.872
 misleading+5.088
 wra+4.907
=~+4.817
Neg logit contributions
RP-5.422
leon-5.394
wark-5.286
Meat-5.266
chev-5.210
Token)
Token ID8
Feature activation-3.62e-01

Pos logit contributions
 opera+6.457
reads+5.882
 misleading+5.310
 wra+5.149
=~+5.030
Neg logit contributions
RP-7.783
wark-7.758
leon-7.661
chev-7.578
soDeliveryDate-7.489
Token $
Token ID720
Feature activation-3.22e-01

Pos logit contributions
 opera+2.434
reads+2.242
 misleading+2.104
 wra+2.019
=~+1.987
Neg logit contributions
RP-2.052
leon-2.033
wark-2.015
chev-1.966
Salt-1.953
TokenBr
Token ID9414
Feature activation-3.06e-02

Pos logit contributions
 opera+2.752
reads+2.609
 misleading+2.451
 wra+2.376
=~+2.373
Neg logit contributions
RP-1.205
leon-1.201
wark-1.181
chev-1.131
Salt-1.111
Tokenush
Token ID1530
Feature activation+1.78e-01

Pos logit contributions
 opera+0.199
reads+0.185
 misleading+0.171
 wra+0.163
=~+0.163
Neg logit contributions
RP-0.179
leon-0.177
wark-0.175
chev-0.171
Salt-0.170
Token2
Token ID17
Feature activation+1.94e-02

Pos logit contributions
RP+1.064
leon+1.043
wark+1.029
Salt+1.008
chev+1.006
Neg logit contributions
 opera-1.153
reads-1.049
 misleading-0.987
 wra-0.938
 glim-0.935
Token heat
Token ID4894
Feature activation+5.25e-01

Pos logit contributions
RP+0.216
leon+0.213
wark+0.210
chev+0.205
Salt+0.203
Neg logit contributions
 opera-0.269
reads-0.249
 misleading-0.232
 wra-0.222
=~-0.222
Token sinks
Token ID38614
Feature activation-5.35e-02

Pos logit contributions
RP+3.308
leon+3.261
wark+3.194
Salt+3.149
chev+3.141
Neg logit contributions
 opera-3.068
reads-2.794
 misleading-2.615
=~-2.483
 glim-2.474
Token.
Token ID13
Feature activation-3.36e-02

Pos logit contributions
 opera+0.367
reads+0.340
 misleading+0.317
 wra+0.304
=~+0.302
Neg logit contributions
RP-0.298
leon-0.294
wark-0.291
chev-0.283
Salt-0.282
Token The
Token ID383
Feature activation-2.14e-01

Pos logit contributions
 opera+0.220
reads+0.202
 misleading+0.189
 wra+0.181
 glim+0.180
Neg logit contributions
RP-0.200
leon-0.196
wark-0.194
Salt-0.189
chev-0.188
Token two
Token ID734
Feature activation-7.64e-01

Pos logit contributions
 opera+1.314
reads+1.206
 misleading+1.111
=~+1.059
 wra+1.057
Neg logit contributions
RP-1.345
leon-1.326
wark-1.314
chev-1.282
Salt-1.277
Token form
Token ID1296
Feature activation-1.16e+00

Pos logit contributions
 opera+4.751
reads+4.318
 misleading+4.058
 wra+3.919
 galaxies+3.806
Neg logit contributions
leon-4.668
RP-4.639
wark-4.594
chev-4.475
Salt-4.461
Token-
Token ID12
Feature activation-4.12e-01

Pos logit contributions
 opera+6.383
reads+5.786
 misleading+5.348
 wra+5.315
 galaxies+5.058
Neg logit contributions
RP-7.481
wark-7.439
leon-7.394
Salt-7.299
soDeliveryDate-7.181
Tokenin
Token ID259
Feature activation-5.40e-01

Pos logit contributions
 opera+3.116
reads+2.955
 misleading+2.739
 wra+2.660
=~+2.637
Neg logit contributions
RP-1.962
leon-1.917
wark-1.916
Salt-1.850
chev-1.829
Token-
Token ID12
Feature activation-5.42e-01

Pos logit contributions
 opera+3.305
reads+3.027
 misleading+2.811
 wra+2.709
 galaxies+2.639
Neg logit contributions
RP-3.359
leon-3.344
wark-3.313
chev-3.235
Salt-3.235
Tokenone
Token ID505
Feature activation-2.97e-01

Pos logit contributions
 opera+4.209
reads+3.999
 misleading+3.710
 wra+3.624
=~+3.565
Neg logit contributions
RP-2.427
leon-2.368
wark-2.362
Salt-2.306
chev-2.281
Token heat
Token ID4894
Feature activation+1.55e-01

Pos logit contributions
 opera+1.554
reads+1.399
 misleading+1.277
 wra+1.207
=~+1.192
Neg logit contributions
RP-2.135
leon-2.117
wark-2.098
chev-2.056
Salt-2.047
Token of
Token ID286
Feature activation-2.81e-01

Pos logit contributions
 opera+3.672
reads+3.358
 misleading+3.089
 wra+2.852
 galaxies+2.794
Neg logit contributions
wark-4.800
RP-4.776
leon-4.767
chev-4.701
Salt-4.628
Token this
Token ID428
Feature activation-1.47e-01

Pos logit contributions
 opera+1.924
reads+1.767
 misleading+1.674
 wra+1.574
 galaxies+1.561
Neg logit contributions
RP-1.564
leon-1.557
wark-1.550
chev-1.499
Salt-1.494
Token year
Token ID614
Feature activation+1.11e-01

Pos logit contributions
 opera+0.779
reads+0.689
 misleading+0.648
 wra+0.609
 galaxies+0.590
Neg logit contributions
RP-1.051
leon-1.045
wark-1.039
Salt-1.013
chev-1.011
Token's
Token ID338
Feature activation+9.11e-02

Pos logit contributions
RP+0.588
leon+0.581
wark+0.573
chev+0.558
Salt+0.555
Neg logit contributions
 opera-0.789
reads-0.732
 misleading-0.686
 wra-0.660
=~-0.655
Token event
Token ID1785
Feature activation-5.47e-02

Pos logit contributions
RP+0.524
leon+0.518
wark+0.513
chev+0.499
Salt+0.497
Neg logit contributions
 opera-0.608
reads-0.559
 misleading-0.523
 wra-0.501
=~-0.496
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
 opera+0.381
reads+0.354
 misleading+0.331
=~+0.316
 wra+0.315
Neg logit contributions
RP-0.293
wark-0.291
leon-0.291
chev-0.283
Salt-0.278
Token
Token ID198
Feature activation-1.39e-01

Pos logit contributions
 opera+5.287
reads+4.958
 misleading+4.308
 wra+4.018
 galaxies+3.999
Neg logit contributions
RP-7.764
soDeliveryDate-7.739
plet-7.732
wark-7.645
chev-7.594
Token
Token ID198
Feature activation+2.82e-02

Pos logit contributions
 opera+0.861
reads+0.791
 misleading+0.732
 wra+0.696
=~+0.691
Neg logit contributions
RP-0.863
leon-0.855
wark-0.850
chev-0.831
Salt-0.822
TokenA
Token ID32
Feature activation+7.14e-02

Pos logit contributions
RP+0.137
leon+0.134
wark+0.131
Salt+0.129
Meat+0.128
Neg logit contributions
 opera-0.216
reads-0.199
 misleading-0.189
 wra-0.183
 glim-0.182
Token few
Token ID1178
Feature activation+7.48e-02

Pos logit contributions
RP+0.461
leon+0.461
wark+0.459
chev+0.443
Salt+0.443
Neg logit contributions
 opera-0.419
reads-0.386
 misleading-0.347
=~-0.339
 galaxies-0.333
Token days
Token ID1528
Feature activation+3.13e-01

Pos logit contributions
RP+0.534
leon+0.531
wark+0.522
Salt+0.510
chev+0.510
Neg logit contributions
 opera-0.387
reads-0.352
 misleading-0.309
=~-0.308
 glim-0.297
Tokenxx
Token ID5324
Feature activation-1.15e-01

Pos logit contributions
leon+0.865
RP+0.857
wark+0.849
chev+0.833
Salt+0.798
Neg logit contributions
 opera-1.554
reads-1.432
 misleading-1.384
 wra-1.333
 glim-1.320
Tokenbrand
Token ID17938
Feature activation-9.55e-02

Pos logit contributions
 opera+0.809
reads+0.747
 misleading+0.705
 wra+0.676
=~+0.672
Neg logit contributions
RP-0.620
leon-0.611
wark-0.604
chev-0.588
Salt-0.584
Token?
Token ID30
Feature activation-2.96e-01

Pos logit contributions
 opera+0.668
reads+0.617
 misleading+0.579
 wra+0.557
=~+0.553
Neg logit contributions
RP-0.519
leon-0.514
wark-0.507
chev-0.496
Salt-0.489
Tokenf
Token ID69
Feature activation-4.69e-01

Pos logit contributions
 opera+2.498
reads+2.346
 misleading+2.222
 wra+2.151
=~+2.140
Neg logit contributions
RP-1.173
leon-1.153
wark-1.138
chev-1.095
Salt-1.086
Tokenref
Token ID5420
Feature activation-3.56e-01

Pos logit contributions
 opera+4.042
reads+3.884
 misleading+3.602
=~+3.532
 wra+3.489
Neg logit contributions
RP-1.712
wark-1.626
leon-1.589
chev-1.549
Salt-1.535
Token=
Token ID28
Feature activation-1.12e+00

Pos logit contributions
 opera+1.402
reads+1.212
 misleading+1.069
 wra+0.981
 glim+0.961
Neg logit contributions
RP-3.025
leon-2.997
wark-2.973
chev-2.926
Salt-2.920
Tokents
Token ID912
Feature activation-4.00e-01

Pos logit contributions
 opera+6.486
reads+6.075
 misleading+5.420
=~+5.273
 galaxies+5.065
Neg logit contributions
RP-6.879
leon-6.802
wark-6.745
Salt-6.703
Meat-6.605
Token<|endoftext|>
Token ID50256
Feature activation-3.78e-02

Pos logit contributions
 opera+2.720
reads+2.515
 misleading+2.347
 wra+2.250
=~+2.234
Neg logit contributions
RP-2.243
leon-2.220
wark-2.193
chev-2.136
Salt-2.127
TokenA
Token ID32
Feature activation-3.68e-02

Pos logit contributions
 opera+0.291
reads+0.268
 misleading+0.255
 wra+0.246
 glim+0.244
Neg logit contributions
RP-0.182
leon-0.179
wark-0.176
Salt-0.172
chev-0.171
Token month
Token ID1227
Feature activation-2.26e-02

Pos logit contributions
 opera+0.229
reads+0.211
 misleading+0.193
=~+0.187
 wra+0.185
Neg logit contributions
RP-0.226
leon-0.225
wark-0.223
chev-0.217
Salt-0.217
Token before
Token ID878
Feature activation-3.18e-01

Pos logit contributions
 opera+0.154
reads+0.143
 misleading+0.133
=~+0.129
 glim+0.129
Neg logit contributions
RP-0.125
leon-0.124
wark-0.122
chev-0.119
Salt-0.118
Token Program
Token ID6118
Feature activation-3.18e-01

Pos logit contributions
RP+0.487
leon+0.475
wark+0.474
chev+0.460
Salt+0.457
Neg logit contributions
 opera-0.551
reads-0.513
 misleading-0.475
 wra-0.458
=~-0.457
Token.
Token ID13
Feature activation-3.04e-01

Pos logit contributions
 opera+2.196
reads+2.031
 misleading+1.900
 wra+1.822
=~+1.810
Neg logit contributions
RP-1.742
leon-1.724
wark-1.702
chev-1.660
Salt-1.651
Token At
Token ID1629
Feature activation-6.65e-02

Pos logit contributions
 opera+1.981
reads+1.823
 misleading+1.698
 wra+1.627
=~+1.613
Neg logit contributions
RP-1.792
leon-1.771
wark-1.752
chev-1.708
Salt-1.702
Tokeniku
Token ID28643
Feature activation-1.31e-01

Pos logit contributions
 opera+0.453
reads+0.420
 misleading+0.393
 wra+0.378
=~+0.377
Neg logit contributions
RP-0.370
leon-0.365
wark-0.362
Salt-0.352
chev-0.351
Token Ab
Token ID2275
Feature activation-8.30e-03

Pos logit contributions
 opera+0.866
reads+0.800
 misleading+0.747
 wra+0.712
=~+0.710
Neg logit contributions
RP-0.764
leon-0.755
wark-0.747
chev-0.729
Salt-0.724
Tokenub
Token ID549
Feature activation-1.12e+00

Pos logit contributions
 opera+0.056
reads+0.051
 misleading+0.049
=~+0.046
 wra+0.046
Neg logit contributions
leon-0.047
RP-0.047
wark-0.047
chev-0.046
Salt-0.045
Tokenak
Token ID461
Feature activation-6.82e-01

Pos logit contributions
 opera+5.207
reads+4.802
 misleading+4.150
 wra+3.958
=~+3.932
Neg logit contributions
RP-8.293
leon-8.178
wark-8.090
Salt-8.016
Meat-7.972
Tokenar
Token ID283
Feature activation-5.73e-02

Pos logit contributions
 opera+4.274
reads+3.961
 misleading+3.601
=~+3.453
 wra+3.437
Neg logit contributions
leon-4.120
RP-4.117
wark-4.057
chev-3.959
Salt-3.953
Token was
Token ID373
Feature activation+3.08e-01

Pos logit contributions
 opera+0.386
reads+0.353
 misleading+0.334
 wra+0.319
 glim+0.318
Neg logit contributions
RP-0.326
leon-0.321
wark-0.320
chev-0.312
Salt-0.306
Token the
Token ID262
Feature activation+4.56e-01

Pos logit contributions
RP+1.727
leon+1.693
wark+1.672
chev+1.637
Salt+1.620
Neg logit contributions
 opera-2.042
reads-1.924
 misleading-1.754
=~-1.723
 wra-1.709
Token first
Token ID717
Feature activation+1.12e-01

Pos logit contributions
RP+2.741
wark+2.671
leon+2.656
chev+2.586
Salt+2.584
Neg logit contributions
 opera-2.748
reads-2.640
 misleading-2.357
=~-2.339
 glim-2.278
Token the
Token ID262
Feature activation-3.51e-01

Pos logit contributions
RP+0.201
leon+0.199
wark+0.198
chev+0.193
Salt+0.191
Neg logit contributions
 opera-0.247
reads-0.230
 misleading-0.213
=~-0.206
 wra-0.203
Token cl
Token ID537
Feature activation-4.19e-01

Pos logit contributions
 opera+2.161
reads+1.980
 misleading+1.833
 wra+1.749
=~+1.738
Neg logit contributions
RP-2.196
leon-2.173
wark-2.150
chev-2.101
Salt-2.091
Tokenumps
Token ID8142
Feature activation-3.75e-01

Pos logit contributions
 opera+3.036
reads+2.837
 misleading+2.645
 wra+2.552
=~+2.534
Neg logit contributions
RP-2.153
leon-2.129
wark-2.100
Salt-2.033
chev-2.032
Token with
Token ID351
Feature activation-3.30e-01

Pos logit contributions
 opera+2.411
reads+2.211
 misleading+2.066
 wra+1.978
=~+1.952
Neg logit contributions
RP-2.244
leon-2.217
wark-2.193
chev-2.140
Salt-2.131
Token Val
Token ID3254
Feature activation-4.61e-01

Pos logit contributions
 opera+2.156
reads+1.993
 misleading+1.850
 wra+1.768
=~+1.765
Neg logit contributions
RP-1.938
leon-1.917
wark-1.894
chev-1.847
Salt-1.843
Tokenle
Token ID293
Feature activation-1.11e+00

Pos logit contributions
 opera+3.877
reads+3.653
 misleading+3.437
 wra+3.341
=~+3.311
Neg logit contributions
RP-1.824
leon-1.762
wark-1.754
chev-1.699
Salt-1.684
Tokenjo
Token ID7639
Feature activation-4.05e-01

Pos logit contributions
 opera+4.923
reads+4.410
 misleading+3.840
 wra+3.615
=~+3.556
Neg logit contributions
RP-8.358
wark-8.169
Salt-8.152
leon-8.101
Meat-8.099
Token earth
Token ID4534
Feature activation-1.34e-01

Pos logit contributions
 opera+2.579
reads+2.372
 misleading+2.203
 wra+2.104
=~+2.091
Neg logit contributions
RP-2.449
leon-2.422
wark-2.396
chev-2.339
Salt-2.329
Token pigment
Token ID43385
Feature activation-6.33e-01

Pos logit contributions
 opera+0.832
reads+0.767
 misleading+0.715
=~+0.682
 wra+0.675
Neg logit contributions
RP-0.823
leon-0.816
wark-0.807
chev-0.787
Salt-0.787
Token and
Token ID290
Feature activation-7.85e-02

Pos logit contributions
 opera+4.255
reads+3.909
 misleading+3.649
 wra+3.529
=~+3.463
Neg logit contributions
RP-3.597
leon-3.566
wark-3.508
chev-3.434
Salt-3.399
Token took
Token ID1718
Feature activation+3.20e-01

Pos logit contributions
 opera+0.500
reads+0.465
 misleading+0.428
=~+0.410
 wra+0.407
Neg logit contributions
RP-0.473
leon-0.466
wark-0.461
chev-0.451
Salt-0.450
Token
Token ID198
Feature activation+1.59e-01

Pos logit contributions
RP+0.881
leon+0.871
wark+0.858
chev+0.838
Salt+0.835
Neg logit contributions
 opera-0.977
reads-0.895
 misleading-0.837
 glim-0.808
 wra-0.802
TokenPF
Token ID42668
Feature activation-6.54e-01

Pos logit contributions
RP+0.780
leon+0.761
wark+0.742
Salt+0.740
Meat+0.731
Neg logit contributions
 opera-1.213
reads-1.111
 misleading-1.062
 glim-1.026
 wra-1.025
Token Bentley
Token ID38881
Feature activation-5.43e-01

Pos logit contributions
 opera+3.893
reads+3.513
 misleading+3.238
 wra+3.063
=~+3.062
Neg logit contributions
RP-4.210
leon-4.143
wark-4.097
chev-4.039
Meat-4.006
Token/
Token ID14
Feature activation-3.78e-01

Pos logit contributions
 opera+3.568
reads+3.325
 misleading+3.084
 wra+2.954
=~+2.948
Neg logit contributions
leon-3.062
RP-3.061
wark-3.059
chev-2.956
Salt-2.931
TokenCivil
Token ID32610
Feature activation-7.29e-01

Pos logit contributions
 opera+2.853
reads+2.703
 misleading+2.509
=~+2.420
 wra+2.420
Neg logit contributions
leon-1.772
RP-1.761
wark-1.748
chev-1.698
Meat-1.635
Token Beat
Token ID12568
Feature activation-1.09e+00

Pos logit contributions
 opera+6.162
reads+5.797
 misleading+5.350
=~+5.199
 asbestos+5.166
Neg logit contributions
chev-2.689
RP-2.656
leon-2.639
wark-2.604
soDeliveryDate-2.515
Token
Token ID198
Feature activation-5.06e-01

Pos logit contributions
 opera+6.066
reads+5.568
 misleading+5.016
 wra+4.845
 asbestos+4.728
Neg logit contributions
RP-7.065
leon-6.997
wark-6.872
chev-6.749
Salt-6.729
Token
Token ID198
Feature activation-2.67e-02

Pos logit contributions
 opera+2.827
reads+2.630
 misleading+2.382
 wra+2.241
=~+2.231
Neg logit contributions
RP-3.324
wark-3.305
leon-3.285
chev-3.229
Salt-3.175
TokenGood
Token ID10248
Feature activation-8.62e-02

Pos logit contributions
 opera+0.206
reads+0.190
 misleading+0.180
 wra+0.174
 glim+0.173
Neg logit contributions
RP-0.128
leon-0.126
wark-0.123
Salt-0.121
chev-0.120
Tokenwin
Token ID5404
Feature activation-8.11e-02

Pos logit contributions
 opera+0.675
reads+0.632
 misleading+0.595
 wra+0.574
=~+0.570
Neg logit contributions
RP-0.396
leon-0.390
wark-0.385
chev-0.372
Salt-0.370
Token says
Token ID1139
Feature activation-1.24e-01

Pos logit contributions
 opera+0.532
reads+0.490
 misleading+0.456
 wra+0.436
=~+0.435
Neg logit contributions
RP-0.474
leon-0.469
wark-0.464
chev-0.453
Salt-0.450
Token)
Token ID8
Feature activation-1.03e-01

Pos logit contributions
 opera+1.721
reads+1.585
 misleading+1.460
 wra+1.392
=~+1.386
Neg logit contributions
RP-1.698
leon-1.683
wark-1.671
chev-1.636
Salt-1.619
Token Mem
Token ID4942
Feature activation-3.74e-01

Pos logit contributions
 opera+0.802
reads+0.746
 misleading+0.706
 wra+0.681
=~+0.676
Neg logit contributions
RP-0.481
leon-0.472
wark-0.466
chev-0.450
Salt-0.449
Tokenc
Token ID66
Feature activation+1.06e-03

Pos logit contributions
 opera+2.668
reads+2.526
 misleading+2.310
=~+2.233
 wra+2.228
Neg logit contributions
RP-1.934
leon-1.900
wark-1.886
Salt-1.818
Meat-1.812
Tokenpy
Token ID9078
Feature activation-1.13e-01

Pos logit contributions
RP+0.006
leon+0.006
wark+0.006
chev+0.006
Salt+0.006
Neg logit contributions
 opera-0.007
reads-0.007
 misleading-0.006
=~-0.006
 wra-0.006
TokenIn
Token ID818
Feature activation+4.74e-01

Pos logit contributions
 opera+0.843
reads+0.786
 misleading+0.738
 wra+0.711
=~+0.708
Neg logit contributions
RP-0.553
leon-0.547
wark-0.540
chev-0.524
Salt-0.520
TokenMemory
Token ID30871
Feature activation-1.08e+00

Pos logit contributions
RP+1.366
leon+1.296
wark+1.258
Salt+1.222
Meat+1.197
Neg logit contributions
 opera-4.568
reads-4.274
 misleading-4.115
 glim-4.028
 wra-4.027
Token 4
Token ID604
Feature activation-5.43e-01

Pos logit contributions
 opera+6.383
reads+5.875
 misleading+5.410
 wra+5.168
=~+5.135
Neg logit contributions
leon-6.628
RP-6.583
wark-6.477
chev-6.440
soDeliveryDate-6.353
Token66
Token ID2791
Feature activation-3.15e-01

Pos logit contributions
 opera+2.589
reads+2.292
 misleading+2.074
 wra+1.950
=~+1.940
Neg logit contributions
RP-4.116
leon-4.082
wark-4.048
chev-3.989
Salt-3.963
Token284
Token ID30336
Feature activation-7.58e-01

Pos logit contributions
 opera+1.871
reads+1.708
 misleading+1.576
 wra+1.500
=~+1.487
Neg logit contributions
RP-2.034
leon-2.023
wark-2.001
chev-1.958
Salt-1.948
Token 4
Token ID604
Feature activation-5.86e-01

Pos logit contributions
 opera+4.329
reads+3.951
 misleading+3.638
 wra+3.460
 galaxies+3.374
Neg logit contributions
leon-4.975
RP-4.919
wark-4.892
chev-4.788
Salt-4.735
Token64
Token ID2414
Feature activation+3.16e-02

Pos logit contributions
 opera+2.448
reads+2.147
 misleading+1.870
 wra+1.743
=~+1.723
Neg logit contributions
RP-4.772
leon-4.730
wark-4.720
chev-4.632
Salt-4.583
Token prison
Token ID3770
Feature activation-2.23e-02

Pos logit contributions
RP+0.503
wark+0.500
leon+0.499
Salt+0.483
chev+0.481
Neg logit contributions
 opera-0.500
reads-0.471
 misleading-0.424
 glim-0.411
=~-0.411
Token MORE
Token ID12011
Feature activation-6.73e-02

Pos logit contributions
 opera+0.152
reads+0.141
 misleading+0.130
 glim+0.126
 wra+0.125
Neg logit contributions
RP-0.125
leon-0.124
wark-0.122
chev-0.119
Salt-0.119
Token and
Token ID290
Feature activation+8.45e-02

Pos logit contributions
 opera+0.440
reads+0.407
 misleading+0.377
 wra+0.362
=~+0.361
Neg logit contributions
RP-0.395
leon-0.389
wark-0.386
chev-0.376
Salt-0.372
Token Hillary
Token ID5073
Feature activation-4.25e-01

Pos logit contributions
leon+0.502
RP+0.502
wark+0.492
chev+0.480
Salt+0.478
Neg logit contributions
 opera-0.530
reads-0.497
 misleading-0.452
=~-0.440
 glim-0.439
Token Clinton
Token ID2605
Feature activation-4.82e-02

Pos logit contributions
 opera+2.750
reads+2.497
 misleading+2.367
 wra+2.258
 asbestos+2.211
Neg logit contributions
RP-2.529
leon-2.493
wark-2.438
chev-2.418
Salt-2.409
Token Hillary
Token ID5073
Feature activation-1.08e+00

Pos logit contributions
 opera+0.296
reads+0.271
 misleading+0.251
 wra+0.239
=~+0.237
Neg logit contributions
RP-0.303
leon-0.300
wark-0.297
chev-0.290
Salt-0.289
Token Diane
Token ID26542
Feature activation-6.89e-01

Pos logit contributions
 opera+3.860
reads+3.081
 misleading+2.880
 wra+2.510
 asbestos+2.394
Neg logit contributions
RP-9.422
leon-9.228
chev-9.153
plet-9.086
wark-9.076
Token Rodham
Token ID41848
Feature activation-6.54e-01

Pos logit contributions
 opera+3.694
reads+3.321
 misleading+3.130
 wra+2.932
 asbestos+2.820
Neg logit contributions
RP-4.940
leon-4.657
wark-4.577
Salt-4.572
Meat-4.561
Token Clinton
Token ID2605
Feature activation-7.65e-01

Pos logit contributions
 opera+4.202
reads+3.766
 misleading+3.597
 wra+3.433
=~+3.423
Neg logit contributions
RP-3.932
leon-3.749
wark-3.738
chev-3.714
Salt-3.669
TokenSanders
Token ID26747
Feature activation-5.53e-01

Pos logit contributions
 opera+5.411
reads+5.039
 misleading+4.702
 wra+4.529
=~+4.492
Neg logit contributions
RP-4.012
wark-3.905
leon-3.879
chev-3.799
Salt-3.728
Token:
Token ID25
Feature activation-3.48e-01

Pos logit contributions
 opera+4.281
reads+4.015
 misleading+3.758
=~+3.662
 glim+3.649
Neg logit contributions
RP-2.556
leon-2.536
wark-2.493
chev-2.402
Salt-2.380
Token.
Token ID13
Feature activation+1.61e-01

Pos logit contributions
 opera+0.964
reads+0.894
 misleading+0.839
 wra+0.804
=~+0.800
Neg logit contributions
RP-0.718
leon-0.709
wark-0.703
chev-0.685
Salt-0.679
Token The
Token ID383
Feature activation-1.44e-01

Pos logit contributions
RP+0.868
leon+0.857
wark+0.844
Salt+0.820
chev+0.820
Neg logit contributions
 opera-1.133
reads-1.048
 misleading-0.982
 wra-0.947
 glim-0.946
Token new
Token ID649
Feature activation-2.16e-01

Pos logit contributions
 opera+0.604
reads+0.533
 misleading+0.471
=~+0.436
 wra+0.433
Neg logit contributions
RP-1.181
leon-1.171
wark-1.162
chev-1.143
Salt-1.137
Token vehicle
Token ID4038
Feature activation-5.86e-01

Pos logit contributions
 opera+1.218
reads+1.104
 misleading+1.017
 wra+0.965
=~+0.955
Neg logit contributions
RP-1.463
leon-1.449
wark-1.436
chev-1.404
Salt-1.399
Token E
Token ID412
Feature activation-4.97e-01

Pos logit contributions
 opera+3.378
reads+3.083
 misleading+2.812
 wra+2.687
=~+2.678
Neg logit contributions
RP-3.858
wark-3.785
leon-3.782
chev-3.679
Salt-3.672
Tokenct
Token ID310
Feature activation-1.08e+00

Pos logit contributions
 opera+3.591
reads+3.369
 misleading+3.112
=~+2.984
 wra+2.979
Neg logit contributions
RP-2.532
leon-2.443
wark-2.424
Salt-2.355
Meat-2.341
Tokeno
Token ID78
Feature activation-7.51e-01

Pos logit contributions
 opera+4.629
reads+4.127
 misleading+3.415
 wra+3.279
=~+3.255
Neg logit contributions
RP-8.284
wark-8.202
Salt-8.099
Meat-8.070
leon-8.063
Token-
Token ID12
Feature activation-5.16e-01

Pos logit contributions
 opera+4.107
reads+3.772
 misleading+3.428
=~+3.237
 wra+3.224
Neg logit contributions
wark-4.950
RP-4.921
leon-4.920
chev-4.796
Salt-4.728
Token1
Token ID16
Feature activation-6.66e-01

Pos logit contributions
 opera+3.048
reads+2.859
 misleading+2.582
=~+2.470
 wra+2.436
Neg logit contributions
wark-3.209
leon-3.205
RP-3.171
chev-3.109
plet-3.049
Token was
Token ID373
Feature activation-3.30e-01

Pos logit contributions
 opera+3.292
reads+2.910
 misleading+2.659
 wra+2.506
 galaxies+2.471
Neg logit contributions
RP-4.812
leon-4.808
wark-4.799
Salt-4.680
Meat-4.679
Token also
Token ID635
Feature activation-6.17e-02

Pos logit contributions
 opera+2.752
reads+2.553
 misleading+2.448
 wra+2.371
=~+2.333
Neg logit contributions
RP-1.343
leon-1.324
wark-1.311
Salt-1.250
chev-1.248
Token.
Token ID13
Feature activation-3.02e-01

Pos logit contributions
 opera+0.613
reads+0.569
 misleading+0.535
 wra+0.514
=~+0.511
Neg logit contributions
RP-0.435
leon-0.429
wark-0.424
chev-0.412
Salt-0.409
Token Author
Token ID6434
Feature activation-4.89e-01

Pos logit contributions
 opera+1.827
reads+1.670
 misleading+1.547
 wra+1.474
=~+1.462
Neg logit contributions
RP-1.926
leon-1.906
wark-1.887
chev-1.843
Salt-1.836
Token of
Token ID286
Feature activation-3.12e-01

Pos logit contributions
 opera+2.802
reads+2.551
 misleading+2.366
 wra+2.222
=~+2.188
Neg logit contributions
RP-3.191
leon-3.181
wark-3.180
chev-3.110
Salt-3.065
Token The
Token ID383
Feature activation-4.01e-01

Pos logit contributions
 opera+2.145
reads+1.984
 misleading+1.853
 wra+1.779
=~+1.764
Neg logit contributions
RP-1.725
leon-1.706
wark-1.684
chev-1.643
Salt-1.632
Token Far
Token ID6755
Feature activation-2.81e-01

Pos logit contributions
 opera+2.943
reads+2.739
 misleading+2.568
 wra+2.460
 galaxies+2.437
Neg logit contributions
RP-2.013
leon-1.980
wark-1.951
chev-1.923
Salt-1.870
Tokenrah
Token ID11392
Feature activation-1.08e+00

Pos logit contributions
 opera+1.624
reads+1.529
 misleading+1.383
 wra+1.305
=~+1.305
Neg logit contributions
RP-1.823
leon-1.794
wark-1.791
chev-1.745
Meat-1.737
Token W
Token ID370
Feature activation-4.52e-01

Pos logit contributions
 opera+6.718
reads+6.370
 misleading+5.836
 wra+5.511
 galaxies+5.339
Neg logit contributions
RP-6.169
leon-6.152
chev-5.990
wark-5.904
Salt-5.885
Tokenether
Token ID6750
Feature activation-5.09e-01

Pos logit contributions
 opera+3.554
reads+3.395
 misleading+3.152
 wra+3.046
 galaxies+3.016
Neg logit contributions
RP-2.012
leon-1.953
wark-1.929
Salt-1.866
chev-1.863
Tokens
Token ID82
Feature activation-2.14e-01

Pos logit contributions
 opera+3.961
reads+3.796
 misleading+3.491
 wra+3.398
 galaxies+3.356
Neg logit contributions
leon-2.243
RP-2.241
wark-2.226
chev-2.110
Meat-2.081
Token Mysteries
Token ID28298
Feature activation-4.10e-01

Pos logit contributions
 opera+1.378
reads+1.265
 misleading+1.179
 wra+1.129
=~+1.122
Neg logit contributions
RP-1.280
leon-1.265
wark-1.248
chev-1.220
Salt-1.217
Token,
Token ID11
Feature activation-1.02e-03

Pos logit contributions
 opera+2.544
reads+2.342
 misleading+2.159
 wra+2.052
=~+2.038
Neg logit contributions
RP-2.498
leon-2.481
wark-2.472
chev-2.424
Salt-2.391
Token a
Token ID257
Feature activation+5.59e-03

Pos logit contributions
RP+0.280
leon+0.277
wark+0.274
chev+0.266
Salt+0.266
Neg logit contributions
 opera-0.329
reads-0.304
 misleading-0.283
 wra-0.272
 glim-0.271
Token home
Token ID1363
Feature activation+5.66e-02

Pos logit contributions
RP+0.036
leon+0.035
wark+0.035
chev+0.034
Salt+0.034
Neg logit contributions
 opera-0.034
reads-0.031
 misleading-0.028
 wra-0.027
=~-0.027
Token for
Token ID329
Feature activation-5.32e-01

Pos logit contributions
RP+0.291
leon+0.288
wark+0.284
chev+0.278
Salt+0.275
Neg logit contributions
 opera-0.412
reads-0.382
 misleading-0.361
 wra-0.346
=~-0.342
Token Sword
Token ID11535
Feature activation-4.75e-01

Pos logit contributions
 opera+3.044
reads+2.769
 misleading+2.561
 wra+2.433
=~+2.391
Neg logit contributions
RP-3.495
leon-3.453
chev-3.411
wark-3.386
Salt-3.344
Token Base
Token ID7308
Feature activation-6.61e-01

Pos logit contributions
 opera+2.562
reads+2.318
 misleading+2.096
 wra+1.980
=~+1.956
Neg logit contributions
RP-3.258
leon-3.240
wark-3.221
chev-3.196
Salt-3.145
Token.
Token ID13
Feature activation-1.08e+00

Pos logit contributions
 opera+4.103
reads+3.873
 misleading+3.541
=~+3.369
 wra+3.333
Neg logit contributions
wark-3.806
RP-3.790
leon-3.762
chev-3.748
Meat-3.679
Token
Token ID198
Feature activation-2.40e-01

Pos logit contributions
 opera+5.298
reads+4.973
 misleading+4.358
 galaxies+4.045
 wra+4.044
Neg logit contributions
RP-7.321
soDeliveryDate-7.252
leon-7.231
plet-7.218
wark-7.216
Token
Token ID198
Feature activation-1.77e-03

Pos logit contributions
 opera+1.448
reads+1.333
 misleading+1.229
 wra+1.165
=~+1.156
Neg logit contributions
RP-1.507
leon-1.497
wark-1.491
chev-1.458
Salt-1.437
TokenEditor
Token ID17171
Feature activation+1.85e-01

Pos logit contributions
 opera+0.014
reads+0.013
 misleading+0.012
 wra+0.012
 glim+0.011
Neg logit contributions
RP-0.009
leon-0.008
wark-0.008
Salt-0.008
Meat-0.008
Token's
Token ID338
Feature activation+5.60e-01

Pos logit contributions
RP+0.940
leon+0.915
wark+0.907
Salt+0.880
chev+0.874
Neg logit contributions
 opera-1.367
reads-1.268
 misleading-1.186
=~-1.145
 wra-1.144
Token Note
Token ID5740
Feature activation+4.25e-01

Pos logit contributions
RP+3.074
wark+2.965
leon+2.938
Salt+2.862
Token+2.835
Neg logit contributions
 opera-3.800
reads-3.481
 misleading-3.221
=~-3.186
 asbestos-3.129
Token on
Token ID319
Feature activation-1.88e-01

Pos logit contributions
RP+0.338
leon+0.334
wark+0.331
chev+0.323
Salt+0.321
Neg logit contributions
 opera-0.454
reads-0.421
 misleading-0.395
 wra-0.379
=~-0.375
Token Facebook
Token ID3203
Feature activation+1.86e-01

Pos logit contributions
 opera+0.968
reads+0.879
 misleading+0.800
 wra+0.745
 galaxies+0.731
Neg logit contributions
RP-1.358
leon-1.351
wark-1.337
chev-1.321
Salt-1.307
Token Share
Token ID8734
Feature activation+5.29e-01

Pos logit contributions
RP+1.107
leon+1.094
wark+1.078
Salt+1.051
chev+1.051
Neg logit contributions
 opera-1.204
reads-1.105
 misleading-1.029
 glim-0.990
 wra-0.990
Token this
Token ID428
Feature activation+1.25e-01

Pos logit contributions
RP+2.471
leon+2.452
wark+2.374
chev+2.327
Salt+2.293
Neg logit contributions
 opera-4.031
reads-3.764
 glim-3.534
 misleading-3.524
=~-3.453
Token comment
Token ID2912
Feature activation+3.32e-01

Pos logit contributions
RP+0.822
leon+0.815
wark+0.804
chev+0.788
Salt+0.783
Neg logit contributions
 opera-0.724
reads-0.663
 misleading-0.607
=~-0.578
 wra-0.577
Token on
Token ID319
Feature activation-1.07e+00

Pos logit contributions
RP+1.646
leon+1.625
wark+1.599
chev+1.550
Salt+1.538
Neg logit contributions
 opera-2.474
reads-2.303
 misleading-2.161
 glim-2.114
 wra-2.089
Token Twitter
Token ID3009
Feature activation-2.98e-02

Pos logit contributions
 opera+3.094
reads+2.522
 misleading+2.171
 wra+1.821
 galaxies+1.788
Neg logit contributions
leon-9.807
RP-9.785
chev-9.711
wark-9.646
Meat-9.630
Token
Token ID198
Feature activation+3.01e-01

Pos logit contributions
 opera+0.193
reads+0.178
 misleading+0.166
 wra+0.159
=~+0.158
Neg logit contributions
RP-0.177
leon-0.175
wark-0.173
chev-0.169
Salt-0.168
Token
Token ID198
Feature activation+1.82e-01

Pos logit contributions
RP+1.724
leon+1.704
wark+1.671
Salt+1.634
chev+1.627
Neg logit contributions
 opera-2.024
reads-1.852
 misleading-1.737
 glim-1.727
 wra-1.675
TokenNY
Token ID12805
Feature activation-1.46e-01

Pos logit contributions
RP+0.632
leon+0.614
wark+0.589
Salt+0.580
chev+0.561
Neg logit contributions
 opera-1.647
reads-1.536
 misleading-1.474
 wra-1.439
 asbestos-1.427
TokenT
Token ID51
Feature activation+4.05e-01

Pos logit contributions
 opera+1.065
reads+0.995
 misleading+0.928
 wra+0.887
=~+0.887
Neg logit contributions
leon-0.737
RP-0.729
wark-0.729
chev-0.712
Salt-0.689
Token-
Token ID12
Feature activation-3.15e-01

Pos logit contributions
 opera+0.968
reads+0.890
 misleading+0.819
 wra+0.783
=~+0.775
Neg logit contributions
RP-0.987
leon-0.971
wark-0.967
Salt-0.945
chev-0.941
Tokenpre
Token ID3866
Feature activation+1.22e-01

Pos logit contributions
 opera+2.611
reads+2.487
 misleading+2.296
=~+2.233
 wra+2.233
Neg logit contributions
RP-1.277
wark-1.223
leon-1.208
Salt-1.201
Meat-1.192
Tokeny
Token ID88
Feature activation+8.60e-03

Pos logit contributions
RP+0.401
leon+0.389
wark+0.382
Salt+0.367
chev+0.364
Neg logit contributions
 opera-1.112
reads-1.051
 misleading-0.995
 wra-0.966
=~-0.963
Token-
Token ID12
Feature activation-3.84e-02

Pos logit contributions
RP+0.050
leon+0.049
wark+0.048
chev+0.047
Salt+0.047
Neg logit contributions
 opera-0.057
reads-0.053
 misleading-0.049
 wra-0.047
=~-0.047
Tokencry
Token ID20470
Feature activation-6.12e-02

Pos logit contributions
 opera+0.279
reads+0.257
 misleading+0.244
 wra+0.235
 glim+0.233
Neg logit contributions
RP-0.198
leon-0.198
wark-0.194
chev-0.190
Salt-0.187
Tokenstal
Token ID7757
Feature activation-1.07e+00

Pos logit contributions
 opera+0.422
reads+0.393
 misleading+0.367
 wra+0.350
=~+0.349
Neg logit contributions
leon-0.335
RP-0.335
wark-0.329
chev-0.323
Salt-0.319
Token-
Token ID12
Feature activation-8.31e-01

Pos logit contributions
 opera+6.068
reads+5.458
 misleading+4.889
 wra+4.798
=~+4.696
Neg logit contributions
RP-7.008
leon-6.701
Salt-6.694
wark-6.652
Meat-6.618
Tokenemb
Token ID24419
Feature activation-4.79e-02

Pos logit contributions
 opera+4.227
reads+3.855
 misleading+3.360
 wra+3.255
=~+3.227
Neg logit contributions
RP-5.984
leon-5.765
Salt-5.764
wark-5.745
Meat-5.722
Tokenry
Token ID563
Feature activation-4.41e-01

Pos logit contributions
 opera+0.358
reads+0.332
 misleading+0.313
 wra+0.301
=~+0.299
Neg logit contributions
RP-0.238
leon-0.235
wark-0.232
chev-0.225
Salt-0.224
Tokeno
Token ID78
Feature activation-1.66e-01

Pos logit contributions
 opera+2.314
reads+2.078
 misleading+1.885
 wra+1.793
=~+1.776
Neg logit contributions
RP-3.159
leon-3.101
wark-3.084
Salt-3.027
chev-3.022
Token-
Token ID12
Feature activation-7.83e-02

Pos logit contributions
 opera+1.136
reads+1.050
 misleading+0.980
 wra+0.941
=~+0.934
Neg logit contributions
RP-0.925
leon-0.913
wark-0.903
chev-0.878
Salt-0.875
Token Which
Token ID9022
Feature activation+9.81e-03

Pos logit contributions
 opera+1.539
reads+1.420
 misleading+1.323
 wra+1.267
=~+1.261
Neg logit contributions
RP-1.325
leon-1.307
wark-1.290
chev-1.261
Salt-1.258
Token gives
Token ID3607
Feature activation+5.68e-02

Pos logit contributions
RP+0.058
leon+0.058
wark+0.057
chev+0.056
Salt+0.056
Neg logit contributions
 opera-0.063
reads-0.058
 misleading-0.054
 wra-0.051
=~-0.051
Token us
Token ID514
Feature activation+1.42e-01

Pos logit contributions
RP+0.324
leon+0.319
wark+0.315
chev+0.307
Salt+0.307
Neg logit contributions
 opera-0.378
reads-0.350
 misleading-0.323
=~-0.311
 wra-0.311
Token A
Token ID317
Feature activation-2.06e-01

Pos logit contributions
RP+0.792
leon+0.778
wark+0.775
chev+0.757
Salt+0.752
Neg logit contributions
 opera-0.957
reads-0.889
 misleading-0.821
=~-0.790
 wra-0.789
Token
Token ID1906
Feature activation-8.27e-02

Pos logit contributions
 opera+1.128
reads+1.024
 misleading+0.937
 wra+0.885
 galaxies+0.871
Neg logit contributions
RP-1.428
leon-1.412
wark-1.408
chev-1.379
Salt-1.364
TokenB
Token ID33
Feature activation-1.06e+00

Pos logit contributions
 opera+0.754
reads+0.710
 misleading+0.677
 wra+0.658
=~+0.652
Neg logit contributions
RP-0.277
leon-0.269
wark-0.264
chev-0.253
Salt-0.252
TokenOM
Token ID2662
Feature activation-1.47e-01

Pos logit contributions
 opera+6.972
reads+6.371
 misleading+5.938
=~+5.712
 wra+5.710
Neg logit contributions
leon-6.027
RP-5.975
wark-5.837
Salt-5.753
chev-5.652
TokenB
Token ID33
Feature activation-4.21e-01

Pos logit contributions
 opera+1.374
reads+1.297
 misleading+1.236
 wra+1.201
=~+1.196
Neg logit contributions
RP-0.454
leon-0.446
wark-0.435
chev-0.414
Salt-0.411
Token at
Token ID379
Feature activation-2.21e-01

Pos logit contributions
 opera+2.813
reads+2.595
 misleading+2.420
 wra+2.319
=~+2.301
Neg logit contributions
RP-2.416
leon-2.389
wark-2.362
chev-2.303
Salt-2.290
Token 11
Token ID1367
Feature activation+2.51e-02

Pos logit contributions
 opera+1.463
reads+1.352
 misleading+1.260
 wra+1.204
=~+1.198
Neg logit contributions
RP-1.270
leon-1.252
wark-1.242
chev-1.214
Salt-1.206
Tokena
Token ID64
Feature activation+1.11e-01

Pos logit contributions
RP+0.148
leon+0.144
wark+0.144
chev+0.140
Salt+0.139
Neg logit contributions
 opera-0.164
reads-0.152
 misleading-0.141
 wra-0.134
=~-0.133
TokenDemocratic
Token ID33939
Feature activation+6.78e-02

Pos logit contributions
 opera+0.027
reads+0.025
 misleading+0.023
 wra+0.023
 glim+0.022
Neg logit contributions
RP-0.017
leon-0.016
wark-0.016
Salt-0.016
chev-0.016
Token presidential
Token ID4787
Feature activation-2.85e-02

Pos logit contributions
RP+0.436
leon+0.429
wark+0.425
Salt+0.416
chev+0.415
Neg logit contributions
 opera-0.397
reads-0.366
 misleading-0.336
=~-0.325
 wra-0.320
Token nominee
Token ID10429
Feature activation-3.21e-01

Pos logit contributions
 opera+0.194
reads+0.181
 misleading+0.168
=~+0.163
 glim+0.161
Neg logit contributions
leon-0.156
RP-0.156
wark-0.152
chev-0.149
Salt-0.148
Token Hillary
Token ID5073
Feature activation-3.23e-01

Pos logit contributions
 opera+1.981
reads+1.816
 misleading+1.683
 wra+1.606
=~+1.594
Neg logit contributions
RP-2.002
leon-1.983
wark-1.962
chev-1.916
Salt-1.907
Token Clinton
Token ID2605
Feature activation+5.72e-02

Pos logit contributions
 opera+2.177
reads+1.996
 misleading+1.879
 wra+1.798
=~+1.768
Neg logit contributions
RP-1.843
leon-1.820
wark-1.789
chev-1.756
Salt-1.749
Token Hillary
Token ID5073
Feature activation-1.06e+00

Pos logit contributions
RP+0.333
leon+0.331
wark+0.330
chev+0.322
Salt+0.318
Neg logit contributions
 opera-0.373
reads-0.345
 misleading-0.318
=~-0.308
 wra-0.306
Token Diane
Token ID26542
Feature activation-8.23e-01

Pos logit contributions
 opera+4.018
reads+3.328
 misleading+3.073
 wra+2.759
 asbestos+2.636
Neg logit contributions
RP-8.974
leon-8.827
chev-8.787
wark-8.749
Salt-8.549
Token Rodham
Token ID41848
Feature activation-7.69e-01

Pos logit contributions
 opera+4.401
reads+4.007
 misleading+3.721
 wra+3.528
 asbestos+3.406
Neg logit contributions
RP-5.839
leon-5.517
wark-5.433
Salt-5.400
chev-5.396
Token Clinton
Token ID2605
Feature activation-8.45e-01

Pos logit contributions
 opera+4.963
reads+4.468
 misleading+4.242
 wra+4.080
=~+4.045
Neg logit contributions
RP-4.560
leon-4.381
wark-4.367
chev-4.329
Salt-4.285
TokenSanders
Token ID26747
Feature activation-6.50e-01

Pos logit contributions
 opera+5.970
reads+5.575
 misleading+5.194
 wra+5.005
=~+4.972
Neg logit contributions
RP-4.379
wark-4.294
leon-4.255
chev-4.179
Salt-4.076
Token:
Token ID25
Feature activation-4.08e-01

Pos logit contributions
 opera+4.967
reads+4.633
 misleading+4.356
 wra+4.206
=~+4.193
Neg logit contributions
RP-3.108
leon-3.072
wark-3.028
chev-2.933
Salt-2.912
Token everything
Token ID2279
Feature activation-4.51e-02

Pos logit contributions
 opera+0.872
reads+0.794
 misleading+0.718
 wra+0.680
 glim+0.677
Neg logit contributions
RP-1.135
leon-1.122
wark-1.113
chev-1.091
Salt-1.084
Token,
Token ID11
Feature activation+1.64e-02

Pos logit contributions
 opera+0.304
reads+0.280
 misleading+0.261
 wra+0.251
=~+0.249
Neg logit contributions
RP-0.256
leon-0.253
wark-0.250
chev-0.244
Salt-0.243
Token how
Token ID703
Feature activation-3.33e-01

Pos logit contributions
RP+0.102
leon+0.101
wark+0.099
chev+0.098
Salt+0.096
Neg logit contributions
 opera-0.101
reads-0.093
 misleading-0.085
 glim-0.083
=~-0.082
Token can
Token ID460
Feature activation-1.64e-01

Pos logit contributions
 opera+2.082
reads+1.893
 misleading+1.769
 wra+1.692
=~+1.670
Neg logit contributions
RP-2.063
leon-2.036
wark-2.021
Salt-1.967
chev-1.966
Token it
Token ID340
Feature activation-6.16e-01

Pos logit contributions
 opera+1.112
reads+1.022
 misleading+0.961
 wra+0.921
=~+0.913
Neg logit contributions
RP-0.922
leon-0.912
wark-0.904
chev-0.877
Salt-0.874
Token be
Token ID307
Feature activation-1.05e+00

Pos logit contributions
 opera+3.157
reads+2.777
 misleading+2.557
 wra+2.424
 galaxies+2.376
Neg logit contributions
RP-4.466
leon-4.456
wark-4.417
Salt-4.326
chev-4.307
Token expanding
Token ID11581
Feature activation-3.34e-01

Pos logit contributions
 opera+4.049
reads+3.380
 misleading+3.220
 wra+2.832
 galaxies+2.828
Neg logit contributions
leon-8.875
RP-8.858
wark-8.718
chev-8.597
Salt-8.562
Token?
Token ID30
Feature activation-2.68e-01

Pos logit contributions
 opera+2.103
reads+1.929
 misleading+1.794
 wra+1.711
=~+1.701
Neg logit contributions
RP-2.039
leon-2.016
wark-1.999
chev-1.948
Salt-1.942
Token Like
Token ID4525
Feature activation+8.10e-02

Pos logit contributions
 opera+1.546
reads+1.410
 misleading+1.295
 wra+1.228
=~+1.223
Neg logit contributions
RP-1.782
leon-1.771
wark-1.752
chev-1.718
Salt-1.704
Token
Token ID1399
Feature activation+2.02e-01

Pos logit contributions
RP+0.410
leon+0.405
wark+0.398
chev+0.390
Salt+0.382
Neg logit contributions
 opera-0.594
reads-0.556
 misleading-0.517
 wra-0.498
=~-0.497
Token what
Token ID644
Feature activation-2.89e-01

Pos logit contributions
RP+1.114
leon+1.107
wark+1.073
chev+1.061
Meat+1.042
Neg logit contributions
 opera-1.366
reads-1.288
 misleading-1.175
=~-1.139
 wra-1.130
Tokenname
Token ID3672
Feature activation+1.65e-01

Pos logit contributions
RP+0.741
leon+0.731
wark+0.701
chev+0.700
Salt+0.686
Neg logit contributions
 opera-1.516
reads-1.400
 misleading-1.347
 glim-1.304
 asbestos-1.301
Token">
Token ID23984
Feature activation-5.28e-02

Pos logit contributions
leon+0.899
RP+0.894
wark+0.866
chev+0.863
Salt+0.855
Neg logit contributions
 opera-1.137
reads-1.059
 misleading-0.983
 glim-0.946
 wra-0.945
Tokenspan
Token ID12626
Feature activation+3.54e-01

Pos logit contributions
 opera+0.312
reads+0.285
 misleading+0.263
 wra+0.250
=~+0.248
Neg logit contributions
RP-0.343
leon-0.340
wark-0.337
chev-0.329
Salt-0.329
Token>
Token ID29
Feature activation+1.04e-01

Pos logit contributions
RP+1.770
leon+1.739
wark+1.675
chev+1.655
Salt+1.603
Neg logit contributions
 opera-2.593
reads-2.409
 misleading-2.293
 glim-2.265
 asbestos-2.193
Token <
Token ID1279
Feature activation-3.09e-01

Pos logit contributions
RP+0.699
leon+0.697
wark+0.679
chev+0.670
Salt+0.669
Neg logit contributions
 opera-0.586
reads-0.535
 misleading-0.489
 wra-0.468
 glim-0.466
Tokenp
Token ID79
Feature activation-1.05e+00

Pos logit contributions
 opera+2.899
reads+2.738
 misleading+2.612
 wra+2.536
=~+2.524
Neg logit contributions
RP-0.946
leon-0.926
wark-0.904
chev-0.863
Salt-0.853
Token ng
Token ID23370
Feature activation-3.12e-01

Pos logit contributions
 opera+6.447
reads+5.887
 misleading+5.489
 wra+5.310
=~+5.210
Neg logit contributions
RP-6.259
leon-6.148
wark-6.148
chev-6.067
Meat-6.058
Token-
Token ID12
Feature activation-1.48e-01

Pos logit contributions
 opera+1.921
reads+1.768
 misleading+1.626
 wra+1.557
=~+1.545
Neg logit contributions
RP-1.933
leon-1.921
wark-1.913
chev-1.852
Salt-1.849
Tokenbind
Token ID21653
Feature activation+1.03e-02

Pos logit contributions
 opera+1.066
reads+0.985
 misleading+0.929
 wra+0.894
 glim+0.885
Neg logit contributions
RP-0.774
leon-0.765
wark-0.754
chev-0.736
Salt-0.729
Token="
Token ID2625
Feature activation+2.07e-01

Pos logit contributions
RP+0.086
leon+0.085
wark+0.084
Salt+0.083
chev+0.083
Neg logit contributions
 opera-0.042
reads-0.036
 misleading-0.033
 wra-0.030
 galaxies-0.029
Tokenphone
Token ID4862
Feature activation+4.86e-01

Pos logit contributions
RP+0.718
leon+0.713
wark+0.680
chev+0.672
Salt+0.650
Neg logit contributions
 opera-1.860
reads-1.739
 misleading-1.669
 glim-1.619
 wra-1.618
Token
Token ID198
Feature activation+6.20e-01

Pos logit contributions
RP+1.646
leon+1.629
wark+1.600
chev+1.564
Salt+1.561
Neg logit contributions
 opera-1.892
reads-1.730
 misleading-1.623
 glim-1.598
 wra-1.561
TokenDate
Token ID10430
Feature activation-1.90e-01

Pos logit contributions
RP+1.697
leon+1.603
Salt+1.555
wark+1.539
Meat+1.513
Neg logit contributions
 opera-6.037
reads-5.661
 misleading-5.460
 glim-5.399
 wra-5.349
Token:
Token ID25
Feature activation-5.26e-02

Pos logit contributions
 opera+1.325
reads+1.233
 misleading+1.152
 wra+1.100
=~+1.098
Neg logit contributions
RP-1.021
leon-1.011
wark-1.004
chev-0.975
Salt-0.962
Token Nov
Token ID5267
Feature activation+1.32e-01

Pos logit contributions
 opera+0.389
reads+0.362
 misleading+0.340
 wra+0.327
=~+0.326
Neg logit contributions
RP-0.264
leon-0.261
wark-0.257
chev-0.250
Salt-0.249
Token.
Token ID13
Feature activation+1.91e-01

Pos logit contributions
RP+0.679
leon+0.670
wark+0.661
chev+0.642
Salt+0.639
Neg logit contributions
 opera-0.966
reads-0.897
 misleading-0.842
 wra-0.811
=~-0.805
Token 24
Token ID1987
Feature activation+6.99e-01

Pos logit contributions
RP+1.124
leon+1.110
wark+1.093
chev+1.068
Salt+1.066
Neg logit contributions
 opera-1.249
reads-1.149
 misleading-1.073
 glim-1.043
 wra-1.028
Token,
Token ID11
Feature activation+2.27e-01

Pos logit contributions
RP+3.546
leon+3.469
wark+3.384
Salt+3.323
chev+3.301
Neg logit contributions
 opera-5.044
reads-4.666
 glim-4.581
 misleading-4.402
=~-4.251
Token 1944
Token ID16994
Feature activation+1.15e-01

Pos logit contributions
RP+1.225
leon+1.202
wark+1.184
Salt+1.157
chev+1.151
Neg logit contributions
 opera-1.580
reads-1.482
 misleading-1.370
 glim-1.353
=~-1.325
Token
Token ID198
Feature activation-1.34e-01

Pos logit contributions
RP+0.678
leon+0.669
wark+0.664
chev+0.649
Salt+0.643
Neg logit contributions
 opera-0.743
reads-0.686
 misleading-0.638
 wra-0.610
=~-0.607
Token
Token ID198
Feature activation+1.05e-02

Pos logit contributions
 opera+0.817
reads+0.754
 misleading+0.695
 wra+0.660
=~+0.656
Neg logit contributions
RP-0.835
leon-0.826
wark-0.823
chev-0.802
Salt-0.795
TokenTre
Token ID31055
Feature activation+1.31e-01

Pos logit contributions
RP+0.048
leon+0.047
wark+0.046
Salt+0.046
chev+0.045
Neg logit contributions
 opera-0.083
reads-0.077
 misleading-0.073
 wra-0.070
 glim-0.070
Token
Token ID246
Feature activation-2.30e-01

Pos logit contributions
 opera+1.856
reads+1.725
 misleading+1.619
 wra+1.559
=~+1.546
Neg logit contributions
RP-1.322
leon-1.304
wark-1.290
chev-1.250
Salt-1.246
TokenWhen
Token ID2215
Feature activation-6.43e-02

Pos logit contributions
 opera+1.681
reads+1.569
 misleading+1.470
 wra+1.413
=~+1.408
Neg logit contributions
RP-1.158
leon-1.148
wark-1.135
chev-1.104
Salt-1.089
Token we
Token ID356
Feature activation-2.59e-01

Pos logit contributions
 opera+0.450
reads+0.420
 misleading+0.389
 wra+0.375
=~+0.374
Neg logit contributions
RP-0.347
leon-0.339
wark-0.338
chev-0.328
Salt-0.327
Token get
Token ID651
Feature activation-9.09e-02

Pos logit contributions
 opera+1.601
reads+1.471
 misleading+1.362
 wra+1.296
=~+1.292
Neg logit contributions
RP-1.614
leon-1.597
wark-1.583
chev-1.545
Salt-1.536
Token the
Token ID262
Feature activation+2.77e-01

Pos logit contributions
 opera+0.579
reads+0.537
 misleading+0.492
=~+0.473
 wra+0.466
Neg logit contributions
RP-0.548
leon-0.537
wark-0.535
chev-0.521
Salt-0.520
Token down
Token ID866
Feature activation+8.12e-01

Pos logit contributions
RP+1.657
wark+1.611
leon+1.606
Salt+1.573
chev+1.569
Neg logit contributions
 opera-1.738
reads-1.628
 misleading-1.471
=~-1.449
 galaxies-1.402
Token payment
Token ID6074
Feature activation+1.12e-01

Pos logit contributions
RP+2.228
wark+2.017
leon+1.943
chev+1.881
Salt+1.839
Neg logit contributions
 opera-7.558
reads-7.319
 misleading-6.901
=~-6.730
 galaxies-6.723
Token we
Token ID356
Feature activation-4.95e-01

Pos logit contributions
RP+0.607
leon+0.591
wark+0.588
chev+0.576
Salt+0.572
Neg logit contributions
 opera-0.770
reads-0.721
 misleading-0.666
=~-0.641
 glim-0.636
Token
Token ID447
Feature activation-1.29e-01

Pos logit contributions
 opera+3.175
reads+2.882
 misleading+2.706
 wra+2.608
=~+2.548
Neg logit contributions
RP-2.964
leon-2.937
wark-2.892
chev-2.823
Salt-2.812
Token
Token ID247
Feature activation+1.66e-01

Pos logit contributions
 opera+0.931
reads+0.864
 misleading+0.811
 wra+0.779
=~+0.774
Neg logit contributions
RP-0.678
leon-0.669
wark-0.661
chev-0.642
Salt-0.639
Tokenre
Token ID260
Feature activation-3.46e-01

Pos logit contributions
RP+0.744
leon+0.736
wark+0.722
chev+0.702
Salt+0.694
Neg logit contributions
 opera-1.312
reads-1.234
 misleading-1.160
 wra-1.113
=~-1.113
Token of
Token ID286
Feature activation+9.27e-01

Pos logit contributions
RP+3.455
leon+3.382
wark+3.335
chev+3.257
Salt+3.237
Neg logit contributions
 opera-3.987
reads-3.679
 misleading-3.410
 glim-3.401
 wra-3.288
Token Youth
Token ID17582
Feature activation+9.01e-01

Pos logit contributions
RP+4.608
leon+4.495
wark+4.356
chev+4.299
Salt+4.171
Neg logit contributions
 opera-6.440
reads-6.086
 glim-5.668
 misleading-5.580
=~-5.517
Token and
Token ID290
Feature activation+5.21e-01

Pos logit contributions
RP+4.571
leon+4.488
wark+4.318
chev+4.304
ptic+4.239
Neg logit contributions
 opera-6.192
reads-5.834
 glim-5.588
 misleading-5.389
 galaxies-5.317
Token Ad
Token ID1215
Feature activation+4.70e-01

Pos logit contributions
leon+2.934
RP+2.909
wark+2.816
chev+2.737
Salt+2.715
Neg logit contributions
 opera-3.493
reads-3.243
 misleading-3.032
 glim-3.018
 wra-2.919
Tokenolesc
Token ID16850
Feature activation+5.86e-01

Pos logit contributions
RP+2.592
leon+2.589
Salt+2.445
wark+2.426
chev+2.423
Neg logit contributions
 opera-3.236
reads-2.892
 glim-2.869
 misleading-2.848
 asbestos-2.706
Tokenence
Token ID594
Feature activation+8.22e-01

Pos logit contributions
RP+1.310
wark+1.238
leon+1.210
Salt+1.095
chev+1.088
Neg logit contributions
 opera-5.952
reads-5.623
 glim-5.433
 misleading-5.426
 wra-5.304
Token,
Token ID11
Feature activation+8.68e-01

Pos logit contributions
RP+4.284
leon+4.170
wark+4.152
chev+4.008
Salt+3.956
Neg logit contributions
 opera-5.793
reads-5.365
 misleading-5.066
 glim-4.909
=~-4.877
Token 39
Token ID5014
Feature activation+1.04e+00

Pos logit contributions
RP+4.623
leon+4.481
wark+4.386
Salt+4.339
Meat+4.249
Neg logit contributions
 opera-5.920
reads-5.484
 glim-5.364
 misleading-5.138
 asbestos-4.957
Token,
Token ID11
Feature activation+3.35e-01

Pos logit contributions
RP+5.109
leon+4.912
wark+4.881
Salt+4.811
chev+4.728
Neg logit contributions
 opera-7.461
reads-6.919
 misleading-6.565
 glim-6.264
 wra-6.227
Token 10
Token ID838
Feature activation+2.61e-01

Pos logit contributions
RP+1.961
leon+1.919
wark+1.895
Salt+1.853
chev+1.836
Neg logit contributions
 opera-2.173
reads-2.025
 misleading-1.870
 glim-1.811
 wra-1.786
Token41
Token ID3901
Feature activation+2.59e-01

Pos logit contributions
RP+1.464
leon+1.442
wark+1.420
Salt+1.383
chev+1.368
Neg logit contributions
 opera-1.772
reads-1.644
 misleading-1.537
 glim-1.502
 wra-1.473
Token $
Token ID720
Feature activation+1.74e-01

Pos logit contributions
RP+0.600
leon+0.584
wark+0.582
chev+0.570
Salt+0.569
Neg logit contributions
 opera-0.562
reads-0.526
 misleading-0.474
=~-0.465
 glim-0.453
Token4
Token ID19
Feature activation-2.05e-01

Pos logit contributions
RP+0.991
leon+0.969
wark+0.954
Salt+0.934
chev+0.928
Neg logit contributions
 opera-1.189
reads-1.086
 misleading-1.026
 glim-1.000
 wra-0.986
Token.
Token ID13
Feature activation+3.88e-02

Pos logit contributions
 opera+1.282
reads+1.182
 misleading+1.095
 wra+1.044
=~+1.039
Neg logit contributions
RP-1.270
leon-1.251
wark-1.238
chev-1.211
Salt-1.207
Token2
Token ID17
Feature activation-5.19e-03

Pos logit contributions
RP+0.191
leon+0.187
wark+0.184
Salt+0.179
chev+0.178
Neg logit contributions
 opera-0.293
reads-0.273
 misleading-0.258
 glim-0.250
 wra-0.248
Token million
Token ID1510
Feature activation+1.66e-01

Pos logit contributions
 opera+0.026
reads+0.024
 misleading+0.022
=~+0.020
 wra+0.020
Neg logit contributions
RP-0.038
leon-0.037
wark-0.037
chev-0.037
Salt-0.037
Token divorce
Token ID13609
Feature activation+8.21e-01

Pos logit contributions
RP+1.043
leon+1.014
wark+1.011
chev+0.986
Salt+0.984
Neg logit contributions
 opera-0.984
reads-0.933
 misleading-0.847
=~-0.823
 glim-0.802
Token settlement
Token ID9443
Feature activation+3.46e-01

Pos logit contributions
RP+4.639
leon+4.325
wark+4.273
Token+4.183
chev+4.182
Neg logit contributions
reads-5.107
 opera-5.106
 misleading-4.458
 galaxies-4.451
=~-4.436
Token from
Token ID422
Feature activation+2.04e-01

Pos logit contributions
RP+1.689
leon+1.619
wark+1.611
chev+1.573
Salt+1.547
Neg logit contributions
 opera-2.568
reads-2.436
 misleading-2.241
=~-2.201
 glim-2.190
Token Musk
Token ID20119
Feature activation+2.25e-01

Pos logit contributions
RP+1.240
leon+1.214
wark+1.204
chev+1.176
Salt+1.174
Neg logit contributions
 opera-1.265
reads-1.194
 misleading-1.084
=~-1.050
 glim-1.049
Token in
Token ID287
Feature activation+1.33e-01

Pos logit contributions
RP+1.185
leon+1.160
wark+1.150
chev+1.126
Salt+1.101
Neg logit contributions
 opera-1.595
reads-1.499
 misleading-1.390
 wra-1.345
 glim-1.344
Token August
Token ID2932
Feature activation-1.57e-01

Pos logit contributions
RP+0.696
leon+0.683
wark+0.681
Salt+0.661
chev+0.659
Neg logit contributions
 opera-0.933
reads-0.879
 misleading-0.805
 glim-0.792
=~-0.788
Token Mexico
Token ID5828
Feature activation-3.07e-01

Pos logit contributions
 opera+0.389
reads+0.364
 misleading+0.338
 glim+0.325
=~+0.324
Neg logit contributions
RP-0.295
leon-0.292
wark-0.286
chev-0.281
Salt-0.279
Token.
Token ID13
Feature activation-4.38e-02

Pos logit contributions
 opera+1.977
reads+1.817
 misleading+1.690
 wra+1.617
=~+1.603
Neg logit contributions
RP-1.842
leon-1.822
wark-1.802
chev-1.759
Salt-1.750
Token
Token ID198
Feature activation+1.47e-02

Pos logit contributions
 opera+0.290
reads+0.266
 misleading+0.249
 glim+0.240
 wra+0.239
Neg logit contributions
RP-0.256
leon-0.252
wark-0.248
Salt-0.244
chev-0.242
Token
Token ID198
Feature activation+5.74e-02

Pos logit contributions
RP+0.088
leon+0.087
wark+0.086
chev+0.084
Salt+0.084
Neg logit contributions
 opera-0.094
reads-0.086
 misleading-0.080
 wra-0.076
=~-0.076
TokenQue
Token ID15681
Feature activation+1.56e-01

Pos logit contributions
RP+0.276
leon+0.271
wark+0.264
Salt+0.261
Meat+0.258
Neg logit contributions
 opera-0.443
reads-0.408
 misleading-0.389
 wra-0.376
 glim-0.374
Tokenens
Token ID641
Feature activation+7.22e-01

Pos logit contributions
leon+0.781
RP+0.774
wark+0.757
chev+0.734
Salt+0.730
Neg logit contributions
 opera-1.164
reads-1.077
 misleading-1.027
 glim-0.988
=~-0.979
Tokenland
Token ID1044
Feature activation+1.23e-02

Pos logit contributions
RP+2.651
wark+2.562
leon+2.466
Salt+2.400
Meat+2.377
Neg logit contributions
 opera-6.078
reads-5.796
 misleading-5.482
 glim-5.369
=~-5.328
Token
Token ID447
Feature activation+2.31e-01

Pos logit contributions
RP+0.074
leon+0.072
wark+0.071
chev+0.070
Salt+0.070
Neg logit contributions
 opera-0.079
reads-0.074
 misleading-0.068
=~-0.065
 wra-0.065
Token
Token ID247
Feature activation-9.42e-02

Pos logit contributions
RP+1.204
leon+1.190
wark+1.154
Salt+1.132
chev+1.128
Neg logit contributions
 opera-1.677
reads-1.561
 misleading-1.444
 wra-1.393
 glim-1.385
Tokens
Token ID82
Feature activation+7.24e-02

Pos logit contributions
 opera+0.641
reads+0.593
 misleading+0.554
 wra+0.531
=~+0.527
Neg logit contributions
RP-0.531
leon-0.524
wark-0.516
chev-0.504
Salt-0.503
Token tropical
Token ID19690
Feature activation+4.11e-01

Pos logit contributions
RP+0.453
leon+0.446
wark+0.441
Salt+0.433
Meat+0.430
Neg logit contributions
 opera-0.439
reads-0.409
 misleading-0.373
=~-0.358
 wra-0.354
Token home
Token ID1363
Feature activation+4.27e-01

Pos logit contributions
leon+1.611
RP+1.588
wark+1.572
chev+1.542
Salt+1.542
Neg logit contributions
 opera-2.002
reads-1.895
 misleading-1.725
=~-1.684
 glim-1.678
Token so
Token ID523
Feature activation-8.71e-03

Pos logit contributions
leon+2.365
RP+2.299
chev+2.280
wark+2.263
Salt+2.233
Neg logit contributions
 opera-2.820
reads-2.651
 misleading-2.402
 glim-2.370
=~-2.366
Token early
Token ID1903
Feature activation-2.01e-01

Pos logit contributions
 opera+0.062
reads+0.058
 misleading+0.054
=~+0.052
 wra+0.052
Neg logit contributions
RP-0.046
leon-0.046
wark-0.044
chev-0.043
Salt-0.043
Token.
Token ID13
Feature activation-7.64e-02

Pos logit contributions
 opera+1.405
reads+1.302
 misleading+1.218
 wra+1.169
=~+1.162
Neg logit contributions
RP-1.089
leon-1.078
wark-1.064
chev-1.037
Salt-1.030
Token
Token ID198
Feature activation+4.65e-02

Pos logit contributions
 opera+0.498
reads+0.459
 misleading+0.428
 wra+0.410
 glim+0.409
Neg logit contributions
RP-0.451
leon-0.448
wark-0.443
Salt-0.430
chev-0.430
Token
Token ID198
Feature activation+5.51e-02

Pos logit contributions
RP+0.283
leon+0.280
wark+0.277
chev+0.271
Salt+0.269
Neg logit contributions
 opera-0.294
reads-0.270
 misleading-0.251
 wra-0.239
=~-0.237
Token"
Token ID1
Feature activation+1.68e-01

Pos logit contributions
RP+0.254
leon+0.250
wark+0.245
Salt+0.240
chev+0.237
Neg logit contributions
 opera-0.435
reads-0.403
 misleading-0.383
 wra-0.370
 glim-0.368
TokenI
Token ID40
Feature activation+2.83e-01

Pos logit contributions
RP+0.827
leon+0.811
wark+0.793
Salt+0.786
Meat+0.775
Neg logit contributions
 opera-1.276
reads-1.179
 misleading-1.115
 wra-1.080
 glim-1.079
Token played
Token ID2826
Feature activation+3.21e-01

Pos logit contributions
RP+1.623
leon+1.610
wark+1.597
chev+1.569
Salt+1.549
Neg logit contributions
 opera-1.851
reads-1.728
 misleading-1.588
=~-1.547
 glim-1.532
Token soccer
Token ID11783
Feature activation-1.08e-01

Pos logit contributions
leon+1.917
RP+1.904
wark+1.883
chev+1.841
Salt+1.837
Neg logit contributions
 opera-1.972
reads-1.881
 misleading-1.693
=~-1.652
 glim-1.621
Token in
Token ID287
Feature activation+4.79e-02

Pos logit contributions
 opera+0.733
reads+0.679
 misleading+0.633
 wra+0.606
=~+0.605
Neg logit contributions
RP-0.600
leon-0.595
wark-0.587
chev-0.572
Salt-0.569
Token in
Token ID287
Feature activation+4.34e-02

Pos logit contributions
RP+0.051
leon+0.050
wark+0.050
chev+0.049
Salt+0.048
Neg logit contributions
 opera-0.073
reads-0.068
 misleading-0.063
 wra-0.061
=~-0.061
Token an
Token ID281
Feature activation-1.69e-02

Pos logit contributions
RP+0.239
leon+0.236
wark+0.235
chev+0.228
Salt+0.227
Neg logit contributions
 opera-0.298
reads-0.276
 misleading-0.254
=~-0.248
 wra-0.247
Token press
Token ID1803
Feature activation-4.96e-01

Pos logit contributions
 opera+0.100
reads+0.092
 misleading+0.084
 glim+0.081
 wra+0.081
Neg logit contributions
RP-0.109
leon-0.107
wark-0.107
chev-0.105
Salt-0.104
Token release
Token ID2650
Feature activation-1.52e-01

Pos logit contributions
 opera+2.564
reads+2.293
 misleading+2.104
 wra+1.983
 glim+1.902
Neg logit contributions
RP-3.571
leon-3.558
wark-3.525
chev-3.451
Meat-3.442
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
 opera+1.085
reads+1.007
 misleading+0.945
 wra+0.907
=~+0.900
Neg logit contributions
RP-0.800
leon-0.790
wark-0.780
chev-0.759
Salt-0.756
Token
Token ID564
Feature activation+1.12e-01

Pos logit contributions
RP+0.626
leon+0.617
wark+0.612
Salt+0.598
chev+0.593
Neg logit contributions
 opera-0.750
reads-0.689
 misleading-0.644
 glim-0.635
 wra-0.624
Token
Token ID250
Feature activation-1.19e-01

Pos logit contributions
RP+0.547
leon+0.531
wark+0.524
chev+0.510
Salt+0.508
Neg logit contributions
 opera-0.852
reads-0.790
 misleading-0.748
 wra-0.727
=~-0.717
TokenAlmost
Token ID23379
Feature activation-6.69e-02

Pos logit contributions
 opera+0.828
reads+0.761
 misleading+0.713
 wra+0.689
 glim+0.686
Neg logit contributions
RP-0.662
leon-0.652
wark-0.642
Salt-0.628
chev-0.627
Token half
Token ID2063
Feature activation-4.44e-01

Pos logit contributions
 opera+0.400
reads+0.368
 misleading+0.337
 glim+0.321
=~+0.320
Neg logit contributions
RP-0.428
leon-0.422
wark-0.419
chev-0.411
Salt-0.409
Token of
Token ID286
Feature activation-4.97e-01

Pos logit contributions
 opera+2.778
reads+2.537
 misleading+2.368
 wra+2.268
=~+2.240
Neg logit contributions
RP-2.723
leon-2.712
wark-2.664
chev-2.600
Salt-2.595
Token Hillary
Token ID5073
Feature activation-2.25e-01

Pos logit contributions
 opera+3.282
reads+3.011
 misleading+2.827
 wra+2.709
=~+2.664
Neg logit contributions
RP-2.877
leon-2.861
wark-2.822
chev-2.741
Salt-2.737
Token).
Token ID737
Feature activation+1.22e-01

Pos logit contributions
RP+1.020
leon+1.003
wark+0.990
chev+0.964
Salt+0.953
Neg logit contributions
 opera-1.310
reads-1.230
 misleading-1.139
 wra-1.095
 glim-1.090
Token More
Token ID3125
Feature activation-1.06e-01

Pos logit contributions
RP+0.706
leon+0.693
wark+0.680
Salt+0.668
chev+0.662
Neg logit contributions
 opera-0.815
reads-0.751
 misleading-0.698
 glim-0.677
 wra-0.675
Token refined
Token ID20449
Feature activation-2.47e-01

Pos logit contributions
 opera+0.642
reads+0.595
 misleading+0.539
=~+0.524
 wra+0.520
Neg logit contributions
RP-0.664
leon-0.657
wark-0.644
chev-0.637
Salt-0.634
Token calculations
Token ID16765
Feature activation+2.12e-01

Pos logit contributions
 opera+1.492
reads+1.367
 misleading+1.262
 wra+1.202
=~+1.197
Neg logit contributions
RP-1.574
leon-1.556
wark-1.539
chev-1.507
Salt-1.499
Token later
Token ID1568
Feature activation+1.28e-01

Pos logit contributions
RP+1.171
leon+1.158
wark+1.144
chev+1.113
Salt+1.101
Neg logit contributions
 opera-1.444
reads-1.358
 misleading-1.241
=~-1.205
 wra-1.196
Token indicated
Token ID8203
Feature activation+1.51e-01

Pos logit contributions
RP+0.716
leon+0.708
wark+0.700
chev+0.683
Salt+0.674
Neg logit contributions
 opera-0.875
reads-0.815
 misleading-0.750
=~-0.724
 wra-0.724
Token a
Token ID257
Feature activation-1.36e-01

Pos logit contributions
RP+0.801
leon+0.787
wark+0.777
chev+0.756
Salt+0.753
Neg logit contributions
 opera-1.074
reads-1.001
 misleading-0.929
 wra-0.897
=~-0.892
Token miss
Token ID2051
Feature activation-9.99e-02

Pos logit contributions
 opera+0.843
reads+0.776
 misleading+0.708
 wra+0.678
=~+0.678
Neg logit contributions
RP-0.850
leon-0.840
wark-0.829
chev-0.810
Salt-0.807
Token of
Token ID286
Feature activation-2.49e-01

Pos logit contributions
 opera+0.669
reads+0.619
 misleading+0.571
 wra+0.552
 glim+0.544
Neg logit contributions
RP-0.576
leon-0.567
wark-0.561
chev-0.549
Salt-0.543
Token only
Token ID691
Feature activation+2.33e-01

Pos logit contributions
 opera+1.563
reads+1.434
 misleading+1.332
 wra+1.272
=~+1.261
Neg logit contributions
RP-1.525
leon-1.509
wark-1.493
chev-1.458
Salt-1.451
Token 7
Token ID767
Feature activation-1.01e-01

Pos logit contributions
RP+1.376
leon+1.360
wark+1.343
chev+1.320
Salt+1.316
Neg logit contributions
 opera-1.485
reads-1.382
 misleading-1.256
 wra-1.215
 glim-1.210
Token were
Token ID547
Feature activation+2.37e-01

Pos logit contributions
 opera+1.929
reads+1.763
 misleading+1.642
 wra+1.570
=~+1.549
Neg logit contributions
RP-1.878
leon-1.859
wark-1.838
chev-1.795
Salt-1.787
Token wed
Token ID29445
Feature activation+8.08e-02

Pos logit contributions
RP+1.409
leon+1.372
wark+1.350
Salt+1.329
chev+1.327
Neg logit contributions
 opera-1.486
reads-1.409
 misleading-1.267
=~-1.260
 glim-1.234
Token to
Token ID284
Feature activation-3.64e-01

Pos logit contributions
RP+0.449
leon+0.444
wark+0.438
chev+0.429
Salt+0.425
Neg logit contributions
 opera-0.548
reads-0.510
 misleading-0.470
 glim-0.455
=~-0.454
Token a
Token ID257
Feature activation-1.26e-01

Pos logit contributions
 opera+2.262
reads+2.068
 misleading+1.922
 wra+1.834
=~+1.815
Neg logit contributions
RP-2.258
leon-2.236
wark-2.214
chev-2.161
Salt-2.151
Token member
Token ID2888
Feature activation+2.13e-01

Pos logit contributions
 opera+0.773
reads+0.713
 misleading+0.655
 wra+0.626
=~+0.625
Neg logit contributions
RP-0.793
leon-0.784
wark-0.773
chev-0.756
Salt-0.754
Token of
Token ID286
Feature activation+3.92e-02

Pos logit contributions
RP+1.262
leon+1.250
wark+1.227
chev+1.199
Salt+1.196
Neg logit contributions
 opera-1.371
reads-1.273
 misleading-1.173
=~-1.128
 wra-1.123
Token the
Token ID262
Feature activation+2.26e-01

Pos logit contributions
RP+0.207
leon+0.205
wark+0.202
chev+0.196
Salt+0.195
Neg logit contributions
 opera-0.278
reads-0.259
 misleading-0.242
 wra-0.233
=~-0.233
Token same
Token ID976
Feature activation+2.06e-01

Pos logit contributions
RP+1.181
leon+1.174
wark+1.149
Salt+1.120
chev+1.115
Neg logit contributions
 opera-1.593
reads-1.509
 misleading-1.382
=~-1.348
 glim-1.342
Token-
Token ID12
Feature activation+2.08e-02

Pos logit contributions
RP+1.187
leon+1.171
wark+1.158
Salt+1.125
chev+1.125
Neg logit contributions
 opera-1.350
reads-1.266
 misleading-1.162
=~-1.127
 wra-1.120
Tokensex
Token ID8044
Feature activation+2.62e-01

Pos logit contributions
RP+0.117
leon+0.116
wark+0.115
chev+0.112
Salt+0.111
Neg logit contributions
 opera-0.141
reads-0.130
 misleading-0.122
 wra-0.117
 glim-0.116
Token in
Token ID287
Feature activation+1.82e-02

Pos logit contributions
RP+1.596
leon+1.581
wark+1.563
Salt+1.501
chev+1.500
Neg logit contributions
 opera-1.580
reads-1.529
 misleading-1.367
 glim-1.315
=~-1.314
Tokening
Token ID278
Feature activation-5.60e-02

Pos logit contributions
RP+0.163
leon+0.159
wark+0.157
chev+0.152
Salt+0.152
Neg logit contributions
 opera-0.178
reads-0.166
 misleading-0.153
 wra-0.147
=~-0.146
Token R
Token ID371
Feature activation+6.48e-02

Pos logit contributions
 opera+0.376
reads+0.351
 misleading+0.324
=~+0.312
 wra+0.312
Neg logit contributions
RP-0.319
leon-0.312
wark-0.306
chev-0.299
Salt-0.297
TokenFI
Token ID11674
Feature activation-9.67e-02

Pos logit contributions
RP+0.260
leon+0.248
wark+0.243
chev+0.235
Salt+0.234
Neg logit contributions
 opera-0.552
reads-0.513
 misleading-0.491
 glim-0.477
 wra-0.477
Token source
Token ID2723
Feature activation-1.52e-01

Pos logit contributions
 opera+0.641
reads+0.593
 misleading+0.549
 wra+0.526
=~+0.525
Neg logit contributions
RP-0.560
leon-0.546
wark-0.538
chev-0.528
Salt-0.522
Token:
Token ID25
Feature activation+1.90e-01

Pos logit contributions
 opera+0.973
reads+0.901
 misleading+0.832
 wra+0.797
=~+0.797
Neg logit contributions
RP-0.908
leon-0.893
wark-0.879
chev-0.863
Salt-0.853
Token
Token ID198
Feature activation+2.56e-01

Pos logit contributions
RP+1.068
leon+1.044
wark+1.021
chev+1.009
Salt+0.999
Neg logit contributions
 opera-1.280
reads-1.202
 misleading-1.109
 wra-1.065
=~-1.062
Token
Token ID198
Feature activation+2.24e-01

Pos logit contributions
RP+1.479
leon+1.457
wark+1.432
Salt+1.400
chev+1.398
Neg logit contributions
 opera-1.707
reads-1.564
 misleading-1.467
 glim-1.448
 wra-1.412
Token"
Token ID1
Feature activation+1.40e-01

Pos logit contributions
RP+1.123
leon+1.090
wark+1.050
Salt+1.047
chev+1.037
Neg logit contributions
 opera-1.683
reads-1.545
 misleading-1.479
 wra-1.427
 asbestos-1.420
Token In
Token ID554
Feature activation-1.16e-01

Pos logit contributions
RP+0.702
leon+0.685
wark+0.666
Salt+0.654
chev+0.653
Neg logit contributions
 opera-1.044
reads-0.967
 misleading-0.916
 wra-0.884
 asbestos-0.876
Token the
Token ID262
Feature activation-2.24e-01

Pos logit contributions
 opera+0.762
reads+0.705
 misleading+0.656
 wra+0.628
=~+0.624
Neg logit contributions
RP-0.683
leon-0.674
wark-0.666
chev-0.650
Salt-0.645
Token event
Token ID1785
Feature activation-8.84e-02

Pos logit contributions
 opera+1.426
reads+1.318
 misleading+1.221
 wra+1.166
=~+1.162
Neg logit contributions
RP-1.358
leon-1.340
wark-1.321
chev-1.293
Salt-1.285
Token Krist
Token ID14912
Feature activation-4.17e-01

Pos logit contributions
 opera+0.740
reads+0.687
 misleading+0.642
 wra+0.616
=~+0.613
Neg logit contributions
RP-0.581
leon-0.576
wark-0.567
chev-0.551
Salt-0.549
Tokenof
Token ID1659
Feature activation+1.49e-02

Pos logit contributions
 opera+2.624
reads+2.420
 misleading+2.221
 wra+2.137
=~+2.104
Neg logit contributions
RP-2.536
leon-2.501
wark-2.476
Salt-2.425
chev-2.415
Token God
Token ID1793
Feature activation-2.96e-01

Pos logit contributions
RP+0.082
leon+0.081
wark+0.080
chev+0.078
Salt+0.077
Neg logit contributions
 opera-0.103
reads-0.094
 misleading-0.089
 wra-0.085
 glim-0.085
Tokenda
Token ID6814
Feature activation-7.73e-02

Pos logit contributions
 opera+2.304
reads+2.154
 misleading+2.024
 wra+1.959
=~+1.942
Neg logit contributions
RP-1.369
leon-1.350
wark-1.332
chev-1.287
Salt-1.286
Tokenert
Token ID861
Feature activation+2.36e-01

Pos logit contributions
 opera+0.608
reads+0.569
 misleading+0.537
=~+0.521
 wra+0.517
Neg logit contributions
RP-0.352
leon-0.346
wark-0.343
chev-0.331
Salt-0.328
Token (
Token ID357
Feature activation-1.24e-01

Pos logit contributions
RP+1.177
leon+1.152
wark+1.141
chev+1.138
Salt+1.097
Neg logit contributions
 opera-1.755
reads-1.640
 misleading-1.540
=~-1.501
 wra-1.471
TokenAg
Token ID10262
Feature activation-4.26e-01

Pos logit contributions
 opera+0.997
reads+0.931
 misleading+0.881
 wra+0.851
=~+0.845
Neg logit contributions
RP-0.543
leon-0.534
wark-0.525
chev-0.508
Salt-0.506
Token2
Token ID17
Feature activation+2.53e-01

Pos logit contributions
 opera+2.672
reads+2.443
 misleading+2.271
 wra+2.170
=~+2.157
Neg logit contributions
RP-2.634
leon-2.602
wark-2.569
chev-2.514
Salt-2.502
Tokenr
Token ID81
Feature activation-2.77e-01

Pos logit contributions
RP+1.387
leon+1.367
wark+1.348
chev+1.316
Salt+1.306
Neg logit contributions
 opera-1.765
reads-1.629
 misleading-1.530
 wra-1.467
=~-1.457
Token-
Token ID12
Feature activation-2.47e-01

Pos logit contributions
 opera+1.738
reads+1.590
 misleading+1.479
 wra+1.409
=~+1.393
Neg logit contributions
RP-1.701
leon-1.683
wark-1.671
chev-1.631
Salt-1.617
TokenLa
Token ID14772
Feature activation+2.62e-01

Pos logit contributions
 opera+1.699
reads+1.590
 misleading+1.478
=~+1.415
 wra+1.408
Neg logit contributions
RP-1.329
wark-1.321
leon-1.307
chev-1.281
Salt-1.241
Token,
Token ID11
Feature activation-1.00e-01

Pos logit contributions
 opera+0.562
reads+0.520
 misleading+0.484
 wra+0.466
=~+0.465
Neg logit contributions
RP-0.445
leon-0.439
wark-0.436
chev-0.423
Salt-0.420
Token and
Token ID290
Feature activation-2.47e-02

Pos logit contributions
 opera+0.653
reads+0.603
 misleading+0.555
=~+0.536
 wra+0.534
Neg logit contributions
RP-0.587
leon-0.578
wark-0.575
chev-0.560
Salt-0.557
Token it
Token ID340
Feature activation-6.83e-02

Pos logit contributions
 opera+0.161
reads+0.150
 misleading+0.138
=~+0.133
 wra+0.132
Neg logit contributions
RP-0.144
leon-0.142
wark-0.141
chev-0.138
Salt-0.137
Token was
Token ID373
Feature activation-1.57e-01

Pos logit contributions
 opera+0.420
reads+0.382
 misleading+0.354
 wra+0.339
 glim+0.339
Neg logit contributions
RP-0.427
leon-0.424
wark-0.420
chev-0.410
Salt-0.406
Token the
Token ID262
Feature activation-1.23e-01

Pos logit contributions
 opera+0.912
reads+0.835
 misleading+0.757
=~+0.732
 wra+0.724
Neg logit contributions
RP-1.028
leon-1.012
wark-1.009
chev-0.983
Salt-0.981
Token main
Token ID1388
Feature activation-7.00e-02

Pos logit contributions
 opera+0.754
reads+0.691
 misleading+0.635
=~+0.609
 wra+0.607
Neg logit contributions
RP-0.771
leon-0.758
wark-0.752
chev-0.736
Salt-0.731
Token thing
Token ID1517
Feature activation+4.30e-02

Pos logit contributions
 opera+0.388
reads+0.352
 misleading+0.321
=~+0.308
 wra+0.306
Neg logit contributions
RP-0.478
leon-0.471
wark-0.470
chev-0.458
Salt-0.457
Token they
Token ID484
Feature activation-6.64e-02

Pos logit contributions
RP+0.239
leon+0.236
wark+0.235
chev+0.228
Salt+0.226
Neg logit contributions
 opera-0.292
reads-0.270
 misleading-0.249
=~-0.243
 wra-0.242
Token judged
Token ID19589
Feature activation-3.65e-01

Pos logit contributions
 opera+0.411
reads+0.376
 misleading+0.346
=~+0.331
 wra+0.330
Neg logit contributions
RP-0.416
leon-0.410
wark-0.406
chev-0.398
Salt-0.395
Token the
Token ID262
Feature activation-2.52e-01

Pos logit contributions
 opera+2.446
reads+2.259
 misleading+2.100
 wra+2.019
=~+2.008
Neg logit contributions
RP-2.083
leon-2.060
wark-2.037
chev-1.987
Salt-1.973
Token C
Token ID327
Feature activation+1.14e-01

Pos logit contributions
 opera+1.723
reads+1.592
 misleading+1.489
 wra+1.428
=~+1.416
Neg logit contributions
RP-1.410
leon-1.394
wark-1.378
chev-1.341
Salt-1.335
Token Some
Token ID2773
Feature activation+2.60e-01

Pos logit contributions
RP+0.307
leon+0.301
wark+0.300
Salt+0.292
chev+0.289
Neg logit contributions
 opera-0.346
reads-0.317
 misleading-0.295
 glim-0.290
 wra-0.287
Token people
Token ID661
Feature activation+9.48e-03

Pos logit contributions
RP+1.519
wark+1.490
leon+1.473
chev+1.436
Salt+1.430
Neg logit contributions
 opera-1.680
reads-1.571
 misleading-1.429
=~-1.408
 wra-1.378
Token took
Token ID1718
Feature activation-1.24e-01

Pos logit contributions
RP+0.055
leon+0.054
wark+0.054
chev+0.052
Salt+0.052
Neg logit contributions
 opera-0.063
reads-0.058
 misleading-0.054
=~-0.052
 wra-0.052
Token issue
Token ID2071
Feature activation-1.52e-01

Pos logit contributions
 opera+0.816
reads+0.752
 misleading+0.695
=~+0.676
 wra+0.670
Neg logit contributions
RP-0.721
leon-0.713
wark-0.707
chev-0.685
Salt-0.682
Token with
Token ID351
Feature activation-1.09e-01

Pos logit contributions
 opera+0.973
reads+0.893
 misleading+0.833
 wra+0.794
=~+0.783
Neg logit contributions
RP-0.911
leon-0.901
wark-0.890
chev-0.869
Salt-0.866
Token one
Token ID530
Feature activation-2.62e-02

Pos logit contributions
 opera+0.730
reads+0.677
 misleading+0.626
 wra+0.603
=~+0.603
Neg logit contributions
RP-0.620
leon-0.610
wark-0.606
chev-0.589
Salt-0.586
Token,
Token ID11
Feature activation+3.36e-01

Pos logit contributions
 opera+0.166
reads+0.153
 misleading+0.141
=~+0.136
 wra+0.135
Neg logit contributions
RP-0.159
leon-0.156
wark-0.156
chev-0.152
Salt-0.151
Token two
Token ID734
Feature activation-8.21e-02

Pos logit contributions
RP+1.935
wark+1.919
leon+1.909
chev+1.866
Salt+1.847
Neg logit contributions
 opera-2.168
reads-2.014
=~-1.823
 misleading-1.807
 glim-1.791
Token,
Token ID11
Feature activation+3.37e-01

Pos logit contributions
 opera+0.521
reads+0.480
 misleading+0.443
=~+0.425
 wra+0.424
Neg logit contributions
RP-0.497
leon-0.490
wark-0.487
chev-0.475
Salt-0.472
Token several
Token ID1811
Feature activation+1.60e-01

Pos logit contributions
RP+2.096
wark+2.095
leon+2.089
chev+2.034
Salt+2.015
Neg logit contributions
 opera-2.028
reads-1.886
 misleading-1.696
=~-1.654
 wra-1.640
Token or
Token ID393
Feature activation+3.59e-01

Pos logit contributions
RP+1.024
wark+1.003
leon+1.002
chev+0.976
Salt+0.974
Neg logit contributions
 opera-0.946
reads-0.874
 misleading-0.785
=~-0.777
 wra-0.758
TokenJohn
Token ID7554
Feature activation-3.51e-02

Pos logit contributions
 opera+1.558
reads+1.463
 misleading+1.374
 wra+1.326
=~+1.320
Neg logit contributions
RP-0.879
leon-0.867
wark-0.862
chev-0.830
Salt-0.819
Token Gavin
Token ID30857
Feature activation+1.03e-01

Pos logit contributions
 opera+0.246
reads+0.228
 misleading+0.214
 wra+0.205
=~+0.204
Neg logit contributions
RP-0.190
leon-0.188
wark-0.186
chev-0.181
Salt-0.179
Token),
Token ID828
Feature activation-1.58e-01

Pos logit contributions
leon+0.544
RP+0.543
wark+0.527
chev+0.518
Salt+0.512
Neg logit contributions
 opera-0.731
reads-0.682
 misleading-0.641
 glim-0.620
=~-0.615
Token but
Token ID475
Feature activation+2.35e-02

Pos logit contributions
 opera+1.029
reads+0.950
 misleading+0.880
 wra+0.846
=~+0.842
Neg logit contributions
RP-0.930
leon-0.922
wark-0.908
chev-0.888
Salt-0.882
Token Hitchcock
Token ID49817
Feature activation-1.42e-01

Pos logit contributions
RP+0.134
leon+0.133
wark+0.130
chev+0.127
Salt+0.127
Neg logit contributions
 opera-0.157
reads-0.145
 misleading-0.135
 wra-0.130
=~-0.129
Token has
Token ID468
Feature activation-1.96e-01

Pos logit contributions
 opera+0.925
reads+0.852
 misleading+0.791
 wra+0.758
=~+0.754
Neg logit contributions
RP-0.834
leon-0.825
wark-0.817
chev-0.797
Salt-0.790
Token other
Token ID584
Feature activation-3.30e-01

Pos logit contributions
 opera+1.291
reads+1.191
 misleading+1.104
 wra+1.062
=~+1.054
Neg logit contributions
RP-1.143
leon-1.131
wark-1.116
chev-1.090
Salt-1.083
Token plans
Token ID3352
Feature activation-4.21e-02

Pos logit contributions
 opera+1.706
reads+1.539
 misleading+1.398
 wra+1.320
=~+1.316
Neg logit contributions
RP-2.385
leon-2.358
wark-2.333
chev-2.297
Salt-2.285
Token.
Token ID13
Feature activation-2.04e-01

Pos logit contributions
 opera+0.306
reads+0.285
 misleading+0.266
 wra+0.256
=~+0.255
Neg logit contributions
RP-0.216
leon-0.214
wark-0.211
chev-0.207
Salt-0.203
Token When
Token ID1649
Feature activation-8.11e-02

Pos logit contributions
 opera+1.332
reads+1.224
 misleading+1.141
 wra+1.094
=~+1.084
Neg logit contributions
RP-1.209
leon-1.195
wark-1.180
chev-1.150
Salt-1.148
Token she
Token ID673
Feature activation-1.65e-01

Pos logit contributions
 opera+0.526
reads+0.486
 misleading+0.450
 wra+0.431
=~+0.430
Neg logit contributions
RP-0.481
leon-0.476
wark-0.469
chev-0.458
Salt-0.457
Token the
Token ID262
Feature activation-5.94e-01

Pos logit contributions
 opera+1.705
reads+1.577
 misleading+1.471
 wra+1.412
=~+1.403
Neg logit contributions
RP-1.415
leon-1.399
wark-1.384
chev-1.348
Salt-1.340
Token bitter
Token ID12922
Feature activation-5.36e-02

Pos logit contributions
 opera+3.178
reads+2.841
 misleading+2.613
 wra+2.474
=~+2.418
Neg logit contributions
RP-4.213
leon-4.178
wark-4.141
chev-4.055
Salt-4.035
Token end
Token ID886
Feature activation-1.51e-01

Pos logit contributions
 opera+0.286
reads+0.262
 misleading+0.238
=~+0.227
 glim+0.225
Neg logit contributions
RP-0.375
leon-0.371
wark-0.368
chev-0.362
Salt-0.360
Token,
Token ID11
Feature activation+9.73e-03

Pos logit contributions
 opera+1.046
reads+0.970
 misleading+0.904
 wra+0.867
=~+0.864
Neg logit contributions
RP-0.829
leon-0.821
wark-0.812
chev-0.789
Salt-0.782
Token all
Token ID477
Feature activation-1.37e-01

Pos logit contributions
RP+0.057
leon+0.057
wark+0.056
chev+0.055
Salt+0.054
Neg logit contributions
 opera-0.063
reads-0.058
 misleading-0.053
=~-0.052
 glim-0.052
Token the
Token ID262
Feature activation-1.07e-01

Pos logit contributions
 opera+0.909
reads+0.842
 misleading+0.775
=~+0.751
 wra+0.746
Neg logit contributions
RP-0.784
leon-0.771
wark-0.766
chev-0.746
Salt-0.742
Token way
Token ID835
Feature activation-2.46e-01

Pos logit contributions
 opera+0.624
reads+0.570
 misleading+0.520
=~+0.501
 glim+0.495
Neg logit contributions
RP-0.695
leon-0.689
wark-0.683
chev-0.671
Salt-0.667
Token up
Token ID510
Feature activation-2.09e-01

Pos logit contributions
 opera+1.563
reads+1.439
 misleading+1.334
 wra+1.274
=~+1.269
Neg logit contributions
RP-1.492
leon-1.474
wark-1.462
chev-1.426
Salt-1.417
Token the
Token ID262
Feature activation-2.52e-01

Pos logit contributions
 opera+1.215
reads+1.113
 misleading+1.022
 wra+0.974
=~+0.968
Neg logit contributions
RP-1.383
leon-1.368
wark-1.360
chev-1.332
Salt-1.320
Token convention
Token ID9831
Feature activation-9.77e-02

Pos logit contributions
 opera+1.563
reads+1.443
 misleading+1.336
 wra+1.276
=~+1.275
Neg logit contributions
RP-1.555
leon-1.540
wark-1.519
chev-1.482
Salt-1.477
Token,
Token ID11
Feature activation-2.18e-01

Pos logit contributions
 opera+0.660
reads+0.616
 misleading+0.572
=~+0.547
 glim+0.547
Neg logit contributions
RP-0.548
leon-0.543
wark-0.532
chev-0.522
Salt-0.519
Token In
Token ID554
Feature activation+2.47e-02

Pos logit contributions
RP+1.769
leon+1.743
wark+1.702
Salt+1.687
chev+1.659
Neg logit contributions
 opera-2.063
reads-1.920
 misleading-1.787
 glim-1.761
 wra-1.725
Token some
Token ID617
Feature activation-1.35e-01

Pos logit contributions
RP+0.147
leon+0.146
wark+0.144
chev+0.141
Salt+0.140
Neg logit contributions
 opera-0.158
reads-0.147
 misleading-0.136
=~-0.131
 wra-0.130
Token of
Token ID286
Feature activation+3.71e-04

Pos logit contributions
 opera+0.740
reads+0.682
 misleading+0.616
=~+0.590
 wra+0.583
Neg logit contributions
RP-0.931
leon-0.923
wark-0.910
chev-0.894
Salt-0.888
Token them
Token ID606
Feature activation-1.28e-01

Pos logit contributions
RP+0.002
leon+0.002
wark+0.002
chev+0.002
Salt+0.002
Neg logit contributions
 opera-0.003
reads-0.002
 misleading-0.002
=~-0.002
 wra-0.002
Token even
Token ID772
Feature activation+1.15e-01

Pos logit contributions
 opera+0.868
reads+0.809
 misleading+0.751
=~+0.722
 wra+0.719
Neg logit contributions
RP-0.714
leon-0.707
wark-0.695
chev-0.681
Salt-0.676
Token minute
Token ID5664
Feature activation-7.64e-01

Pos logit contributions
RP+0.704
leon+0.699
wark+0.683
chev+0.680
Salt+0.671
Neg logit contributions
 opera-0.698
reads-0.663
 misleading-0.597
=~-0.587
 wra-0.571
Token details
Token ID3307
Feature activation-3.53e-01

Pos logit contributions
 opera+4.455
reads+3.982
 misleading+3.699
 wra+3.554
 galaxies+3.479
Neg logit contributions
RP-5.056
wark-4.982
leon-4.974
Salt-4.820
chev-4.816
Token like
Token ID588
Feature activation+9.53e-02

Pos logit contributions
 opera+2.257
reads+2.073
 misleading+1.928
 wra+1.843
=~+1.827
Neg logit contributions
RP-2.127
leon-2.104
wark-2.082
chev-2.032
Salt-2.022
Token nails
Token ID23361
Feature activation-2.50e-01

Pos logit contributions
RP+0.595
leon+0.592
wark+0.583
chev+0.578
Salt+0.567
Neg logit contributions
 opera-0.573
reads-0.536
 misleading-0.489
=~-0.477
 wra-0.461
Token,
Token ID11
Feature activation-2.30e-01

Pos logit contributions
 opera+1.630
reads+1.507
 misleading+1.398
 wra+1.334
=~+1.333
Neg logit contributions
RP-1.469
leon-1.453
wark-1.437
chev-1.403
Salt-1.395
Token muscles
Token ID12749
Feature activation-3.03e-02

Pos logit contributions
 opera+1.358
reads+1.242
 misleading+1.149
=~+1.094
 wra+1.084
Neg logit contributions
RP-1.490
leon-1.478
wark-1.465
chev-1.438
Salt-1.426
Tokente
Token ID660
Feature activation-6.87e-02

Pos logit contributions
 opera+0.288
reads+0.263
 misleading+0.243
 wra+0.231
=~+0.229
Neg logit contributions
RP-0.315
leon-0.312
wark-0.308
chev-0.301
Salt-0.300
Tokenlegraph
Token ID16606
Feature activation-2.54e-01

Pos logit contributions
 opera+0.440
reads+0.405
 misleading+0.376
 wra+0.360
=~+0.357
Neg logit contributions
RP-0.413
leon-0.408
wark-0.404
chev-0.395
Salt-0.393
Token.
Token ID13
Feature activation-4.38e-01

Pos logit contributions
 opera+1.720
reads+1.608
 misleading+1.477
 wra+1.420
=~+1.416
Neg logit contributions
RP-1.398
leon-1.377
wark-1.373
chev-1.348
Salt-1.339
Tokenco
Token ID1073
Feature activation-3.60e-01

Pos logit contributions
 opera+2.119
reads+1.917
 misleading+1.701
 wra+1.615
=~+1.593
Neg logit contributions
RP-3.297
leon-3.247
wark-3.237
Salt-3.186
chev-3.174
Token.
Token ID13
Feature activation-2.57e-01

Pos logit contributions
 opera+2.423
reads+2.271
 misleading+2.089
 wra+2.004
=~+1.982
Neg logit contributions
RP-1.990
wark-1.959
leon-1.952
Salt-1.913
chev-1.909
Tokenuk
Token ID2724
Feature activation-6.68e-01

Pos logit contributions
 opera+1.211
reads+1.088
 misleading+0.975
 wra+0.920
=~+0.913
Neg logit contributions
RP-1.941
wark-1.919
leon-1.919
chev-1.894
Salt-1.887
Token/
Token ID14
Feature activation-9.79e-04

Pos logit contributions
 opera+4.231
reads+3.984
 misleading+3.615
=~+3.439
 wra+3.435
Neg logit contributions
RP-3.837
wark-3.804
leon-3.802
chev-3.786
Salt-3.690
Tokenbusiness
Token ID22680
Feature activation-1.06e-01

Pos logit contributions
 opera+0.007
reads+0.006
 misleading+0.006
 wra+0.006
=~+0.006
Neg logit contributions
RP-0.005
leon-0.005
wark-0.005
chev-0.005
Salt-0.005
Token/
Token ID14
Feature activation-2.04e-01

Pos logit contributions
 opera+0.764
reads+0.708
 misleading+0.665
 wra+0.639
=~+0.635
Neg logit contributions
RP-0.557
leon-0.550
wark-0.543
chev-0.528
Salt-0.525
Tokenanalysis
Token ID20930
Feature activation-4.32e-01

Pos logit contributions
 opera+1.447
reads+1.328
 misleading+1.259
 wra+1.212
 glim+1.200
Neg logit contributions
RP-1.089
leon-1.087
wark-1.061
chev-1.038
Salt-1.019
Token/
Token ID14
Feature activation-2.04e-01

Pos logit contributions
 opera+2.915
reads+2.707
 misleading+2.516
 wra+2.409
=~+2.390
Neg logit contributions
RP-2.420
leon-2.384
wark-2.363
chev-2.308
Salt-2.306
Token a
Token ID257
Feature activation+5.59e-03

Pos logit contributions
RP+0.280
leon+0.277
wark+0.274
chev+0.266
Salt+0.266
Neg logit contributions
 opera-0.329
reads-0.304
 misleading-0.283
 wra-0.272
 glim-0.271
Token home
Token ID1363
Feature activation+5.66e-02

Pos logit contributions
RP+0.036
leon+0.035
wark+0.035
chev+0.034
Salt+0.034
Neg logit contributions
 opera-0.034
reads-0.031
 misleading-0.028
 wra-0.027
=~-0.027
Token for
Token ID329
Feature activation-5.32e-01

Pos logit contributions
RP+0.291
leon+0.288
wark+0.284
chev+0.278
Salt+0.275
Neg logit contributions
 opera-0.412
reads-0.382
 misleading-0.361
 wra-0.346
=~-0.342
Token Sword
Token ID11535
Feature activation-4.75e-01

Pos logit contributions
 opera+3.044
reads+2.769
 misleading+2.561
 wra+2.433
=~+2.391
Neg logit contributions
RP-3.495
leon-3.453
chev-3.411
wark-3.386
Salt-3.344
Token Base
Token ID7308
Feature activation-6.61e-01

Pos logit contributions
 opera+2.562
reads+2.318
 misleading+2.096
 wra+1.980
=~+1.956
Neg logit contributions
RP-3.258
leon-3.240
wark-3.221
chev-3.196
Salt-3.145
Token.
Token ID13
Feature activation-1.08e+00

Pos logit contributions
 opera+4.103
reads+3.873
 misleading+3.541
=~+3.369
 wra+3.333
Neg logit contributions
wark-3.806
RP-3.790
leon-3.762
chev-3.748
Meat-3.679
Token
Token ID198
Feature activation-2.40e-01

Pos logit contributions
 opera+5.298
reads+4.973
 misleading+4.358
 galaxies+4.045
 wra+4.044
Neg logit contributions
RP-7.321
soDeliveryDate-7.252
leon-7.231
plet-7.218
wark-7.216
Token
Token ID198
Feature activation-1.77e-03

Pos logit contributions
 opera+1.448
reads+1.333
 misleading+1.229
 wra+1.165
=~+1.156
Neg logit contributions
RP-1.507
leon-1.497
wark-1.491
chev-1.458
Salt-1.437
TokenEditor
Token ID17171
Feature activation+1.85e-01

Pos logit contributions
 opera+0.014
reads+0.013
 misleading+0.012
 wra+0.012
 glim+0.011
Neg logit contributions
RP-0.009
leon-0.008
wark-0.008
Salt-0.008
Meat-0.008
Token's
Token ID338
Feature activation+5.60e-01

Pos logit contributions
RP+0.940
leon+0.915
wark+0.907
Salt+0.880
chev+0.874
Neg logit contributions
 opera-1.367
reads-1.268
 misleading-1.186
=~-1.145
 wra-1.144
Token Note
Token ID5740
Feature activation+4.25e-01

Pos logit contributions
RP+3.074
wark+2.965
leon+2.938
Salt+2.862
Token+2.835
Neg logit contributions
 opera-3.800
reads-3.481
 misleading-3.221
=~-3.186
 asbestos-3.129
Token Dan
Token ID6035
Feature activation-3.45e-01

Pos logit contributions
 opera+2.041
reads+1.882
 misleading+1.747
 wra+1.676
=~+1.664
Neg logit contributions
RP-1.850
leon-1.830
wark-1.813
chev-1.763
Salt-1.755
Token Fry
Token ID25305
Feature activation-9.74e-02

Pos logit contributions
 opera+2.436
reads+2.262
 misleading+2.114
 wra+2.034
=~+2.016
Neg logit contributions
RP-1.839
leon-1.810
wark-1.787
chev-1.745
Salt-1.735
Tokend
Token ID67
Feature activation-2.82e-01

Pos logit contributions
 opera+0.665
reads+0.614
 misleading+0.575
 wra+0.549
=~+0.548
Neg logit contributions
RP-0.547
leon-0.540
wark-0.535
chev-0.521
Salt-0.516
Tokenman
Token ID805
Feature activation-2.90e-01

Pos logit contributions
 opera+2.208
reads+2.062
 misleading+1.947
 wra+1.877
=~+1.867
Neg logit contributions
RP-1.301
leon-1.283
wark-1.268
chev-1.227
Salt-1.214
Token of
Token ID286
Feature activation-8.97e-02

Pos logit contributions
 opera+1.944
reads+1.794
 misleading+1.673
 wra+1.604
=~+1.592
Neg logit contributions
RP-1.658
leon-1.639
wark-1.621
chev-1.579
Salt-1.571
Token web
Token ID3992
Feature activation-7.27e-01

Pos logit contributions
 opera+0.596
reads+0.554
 misleading+0.513
 wra+0.492
=~+0.491
Neg logit contributions
RP-0.515
wark-0.509
leon-0.508
chev-0.488
Salt-0.488
Token firm
Token ID4081
Feature activation-5.41e-01

Pos logit contributions
 opera+4.634
reads+4.247
 misleading+3.925
 wra+3.802
=~+3.729
Neg logit contributions
RP-4.301
leon-4.225
wark-4.161
chev-4.156
Salt-4.109
Token In
Token ID554
Feature activation-8.04e-02

Pos logit contributions
 opera+3.690
reads+3.386
 misleading+3.176
 wra+3.055
=~+3.014
Neg logit contributions
RP-3.019
leon-2.980
wark-2.925
chev-2.879
Salt-2.856
Tokenigo
Token ID14031
Feature activation-3.16e-01

Pos logit contributions
 opera+0.589
reads+0.545
 misleading+0.516
 wra+0.496
=~+0.493
Neg logit contributions
RP-0.412
leon-0.409
wark-0.405
Salt-0.391
chev-0.391
Token.
Token ID13
Feature activation+9.73e-02

Pos logit contributions
 opera+2.134
reads+1.969
 misleading+1.839
 wra+1.764
=~+1.749
Neg logit contributions
RP-1.784
leon-1.765
wark-1.740
chev-1.703
Salt-1.691
Token A
Token ID317
Feature activation-2.75e-01

Pos logit contributions
RP+0.562
leon+0.558
wark+0.556
chev+0.535
Salt+0.531
Neg logit contributions
 opera-0.646
reads-0.594
 misleading-0.553
 wra-0.539
 glim-0.534
Token of
Token ID286
Feature activation+9.91e-04

Pos logit contributions
 opera+1.424
reads+1.323
 misleading+1.233
 wra+1.184
=~+1.177
Neg logit contributions
RP-1.137
leon-1.127
wark-1.111
chev-1.083
Salt-1.076
Token pretending
Token ID23662
Feature activation-1.30e-01

Pos logit contributions
RP+0.006
leon+0.006
wark+0.006
chev+0.006
Salt+0.006
Neg logit contributions
 opera-0.006
reads-0.006
 misleading-0.005
=~-0.005
 wra-0.005
Token.
Token ID13
Feature activation-1.72e-01

Pos logit contributions
 opera+0.785
reads+0.731
 misleading+0.672
=~+0.638
 glim+0.635
Neg logit contributions
RP-0.807
leon-0.800
wark-0.786
chev-0.777
Salt-0.768
Token
Token ID198
Feature activation-2.68e-01

Pos logit contributions
 opera+1.123
reads+1.035
 misleading+0.966
 wra+0.928
 glim+0.922
Neg logit contributions
RP-1.013
leon-1.000
wark-0.987
Salt-0.962
chev-0.960
Token
Token ID198
Feature activation-5.04e-02

Pos logit contributions
 opera+1.586
reads+1.470
 misleading+1.342
 wra+1.269
=~+1.262
Neg logit contributions
RP-1.708
wark-1.695
leon-1.695
chev-1.654
Salt-1.633
TokenR
Token ID49
Feature activation-6.69e-01

Pos logit contributions
 opera+0.413
reads+0.387
 misleading+0.366
 wra+0.354
=~+0.352
Neg logit contributions
RP-0.212
leon-0.210
wark-0.207
chev-0.199
Salt-0.197
TokenUSE
Token ID19108
Feature activation-1.64e-01

Pos logit contributions
 opera+3.608
reads+3.286
 misleading+2.964
=~+2.842
 galaxies+2.746
Neg logit contributions
leon-4.498
wark-4.454
RP-4.411
chev-4.382
Meat-4.315
TokenLL
Token ID3069
Feature activation-3.08e-01

Pos logit contributions
 opera+1.237
reads+1.160
 misleading+1.097
=~+1.063
 wra+1.036
Neg logit contributions
wark-0.767
RP-0.765
leon-0.762
chev-0.749
Salt-0.728
Token:
Token ID25
Feature activation-2.36e-01

Pos logit contributions
 opera+2.086
reads+1.952
 misleading+1.817
=~+1.730
 wra+1.720
Neg logit contributions
RP-1.684
leon-1.680
wark-1.674
chev-1.628
Salt-1.600
Token Speaking
Token ID21393
Feature activation+1.93e-01

Pos logit contributions
 opera+1.561
reads+1.439
 misleading+1.341
 wra+1.286
=~+1.276
Neg logit contributions
RP-1.367
leon-1.351
wark-1.336
chev-1.302
Salt-1.297
Token of
Token ID286
Feature activation-4.08e-02

Pos logit contributions
RP+1.134
leon+1.124
wark+1.105
chev+1.072
Salt+1.071
Neg logit contributions
 opera-1.239
reads-1.157
 misleading-1.065
=~-1.035
 wra-1.023