TOP ACTIVATIONS
MAX = 1.668

in front of buildings like Sur rog ate 's Court .
The post Why Hillary Clinton Is Richard Nixon in a Pants
and sol es ") is an autos omal dominant ker at
what ' makes us who we are .' The
и е а м н и к
two - thirds of where they were at the start ,
4 years old and once I turn AA on things slow
а м н и к
t be where we are if I was the
person loaded , initially failing , sec ures mark ( 1
Medicare to stay the way it is while 28 percent think
proved its met tle off - road racing in the
b ] Mega Pony [/ b ][/ color ] - 20
: Because of a failed will save , your character now
b ong i and Sh us aku U ch iy ama
Hillary Clinton 's campaign manager Rob by M ook said
placeholder p > li > ul >
know yet how bad that number is . "
) ( 8 ) 1 . Gordon Reid (
Catalan pro - independence flag - the est el ada -

MIN ACTIVATIONS
MIN = -1.680

G ED NET CALL UPDATE : 05 41 z BLACK FO
ON ALL PERSON EL UPDATE : 05 45 z K USH
G ED NET CALL UPDATE : 05 43 z ST OW
G ED NET CALL UPDATE : 05 42 z SH AM
G ED NET CALL UPDATE : 05 50 z VOL UN
G ED NET CALL UPDATE : 0 551 z VOL UN
G ED NET CALL UPDATE : 05 38 z SEM INAL
. list _ images ( args [" images "] ): #
. list _ images ( args [ " images " ]
97 ] Ble acher Report ,[ 98 ] Sporting News ,[
2007 Tony Shal h oub 2007 Charlie Sheen
the daughter of Ted Kot che ff , director of ,
2003 Ray Rom ano 2004 Kel sey Gram mer
" -- images ", required = True , help =" path
for public monitoring and accountability ; and iv ) mandate that
x . x . x . 151 103 30 29 x
2008 Steve Care ll 2008 Lee Pace
Sheen 2008 Alec Baldwin 2008 Steve Care ll
2007 Steve Care ll 2007 Tony Shal h oub
Tony Shal h oub 2007 Charlie Sheen 2008 Alec

INTERVAL 0.831 - 1.668
CONTAINS 0.080%

year , but if the Jets want to see playoff hockey
of " off - the - record " information , noting
are a bit safer than they otherwise might be . So
a . dot . gov / people / in jury /
, El j ig ide y was not from Il j

INTERVAL -0.006 - 0.831
CONTAINS 50.877%

Who would suffer from the end of this perverse subsidy of
the city 's famous Cock ers Roses business .
the upload page . In later desperation , I
hefty new driving charges without first creating viable alternatives to car
normally used in overseas wars for states ide law enforcement purposes

INTERVAL -0.843 - -0.006
CONTAINS 48.966%

AP X S Principal Investigator R alf G ell ert of
call you a liar . 13 . Resident Evil
However , maybe a niche for a simple jet engine can
M ATE 16 . 04 + at all , I
may be saved to the government ?" Besides , he argued

INTERVAL -1.680 - -0.843
CONTAINS 0.077%

x . x . 175 99 33 87 x . x
2005 Eric McCorm ack 2005 Ray Rom
,[ 98 ] Sporting News ,[ 99 ] SB Nation ,[
a thing as a ' Bo hem ian sp arrow slaying
Unt rust worthy Why 18 81 ? Robert Louis
Token in
Token ID287
Feature activation+1.83e-01

Pos logit contributions
+0.044
 POS+0.040
+0.036
 LW+0.036
idis+0.035
Neg logit contributions
wagen-0.042
amaz-0.039
 municip-0.039
pu-0.038
ells-0.038
Token front
Token ID2166
Feature activation-1.14e-01

Pos logit contributions
wagen+1.194
amaz+1.108
pu+1.084
 municip+1.068
ells+1.064
Neg logit contributions
-1.645
 POS-1.458
-1.374
-1.334
idious-1.329
Token of
Token ID286
Feature activation+1.94e-01

Pos logit contributions
+0.816
 POS+0.738
+0.659
 LW+0.656
idis+0.639
Neg logit contributions
wagen-0.942
amaz-0.887
 municip-0.881
pu-0.863
ells-0.860
Token buildings
Token ID6832
Feature activation-5.11e-02

Pos logit contributions
wagen+1.300
amaz+1.213
pu+1.187
 municip+1.176
ells+1.173
Neg logit contributions
-1.692
 POS-1.516
-1.414
 LW-1.379
idious-1.373
Token like
Token ID588
Feature activation-3.45e-01

Pos logit contributions
+0.447
 POS+0.407
+0.374
 LW+0.369
idious+0.365
Neg logit contributions
wagen-0.340
amaz-0.317
pu-0.309
 municip-0.308
ells-0.305
Token Sur
Token ID4198
Feature activation+1.67e+00

Pos logit contributions
+2.582
 POS+2.325
+2.090
idis+2.067
 LW+2.046
Neg logit contributions
wagen-2.673
amaz-2.515
 municip-2.483
pu-2.470
ells-2.431
Tokenrog
Token ID3828
Feature activation-5.19e-02

Pos logit contributions
rac+4.207
pu+3.693
wagen+3.592
ells+3.101
amaz+3.091
Neg logit contributions
-20.217
 awa-19.158
-18.573
 POS-17.804
 Lauder-17.350
Tokenate
Token ID378
Feature activation+9.75e-02

Pos logit contributions
+0.412
 POS+0.372
+0.338
 LW+0.334
idious+0.333
Neg logit contributions
wagen-0.383
amaz-0.358
 municip-0.354
pu-0.348
ells-0.346
Token's
Token ID338
Feature activation+5.52e-01

Pos logit contributions
wagen+0.655
amaz+0.610
 municip+0.595
pu+0.594
ells+0.586
Neg logit contributions
-0.850
 POS-0.768
-0.711
 LW-0.698
idis-0.692
Token Court
Token ID3078
Feature activation+1.15e-01

Pos logit contributions
wagen+3.152
pu+2.971
amaz+2.922
ells+2.890
rac+2.853
Neg logit contributions
-5.290
 POS-4.620
 awa-4.444
-4.419
idious-4.362
Token.
Token ID13
Feature activation-3.92e-02

Pos logit contributions
wagen+0.871
amaz+0.820
pu+0.807
ells+0.795
rac+0.791
Neg logit contributions
-0.898
 POS-0.789
-0.729
idious-0.711
-0.707
TokenThe
Token ID464
Feature activation-2.79e-01

Pos logit contributions
+0.174
 POS+0.155
+0.142
 LW+0.139
+0.138
Neg logit contributions
wagen-0.164
amaz-0.154
pu-0.151
 municip-0.150
ells-0.149
Token post
Token ID1281
Feature activation-2.15e-01

Pos logit contributions
+2.080
 POS+1.878
+1.685
 LW+1.685
idis+1.644
Neg logit contributions
wagen-2.206
amaz-2.081
pu-2.022
 municip-2.009
ells-2.005
Token Why
Token ID4162
Feature activation+1.48e-01

Pos logit contributions
+1.711
 POS+1.553
+1.401
 LW+1.401
idis+1.365
Neg logit contributions
wagen-1.594
amaz-1.480
 municip-1.457
pu-1.449
ells-1.435
Token Hillary
Token ID5073
Feature activation+1.76e-01

Pos logit contributions
wagen+0.973
amaz+0.907
pu+0.891
ells+0.878
 municip+0.877
Neg logit contributions
-1.303
 POS-1.166
-1.089
-1.066
idious-1.061
Token Clinton
Token ID2605
Feature activation+3.34e-01

Pos logit contributions
wagen+0.956
amaz+0.883
pu+0.867
ells+0.855
rac+0.844
Neg logit contributions
-1.755
 POS-1.580
-1.498
idious-1.468
-1.462
Token Is
Token ID1148
Feature activation+1.39e+00

Pos logit contributions
wagen+2.054
amaz+1.895
pu+1.856
ells+1.843
rac+1.823
Neg logit contributions
-3.084
 POS-2.750
-2.604
-2.533
idious-2.532
Token Richard
Token ID6219
Feature activation+4.88e-01

Pos logit contributions
wagen+6.692
rac+6.374
pu+6.266
 Lib+6.169
amaz+6.101
Neg logit contributions
-12.715
 POS-10.959
-10.951
-10.938
verett-10.495
Token Nixon
Token ID16642
Feature activation+1.07e-01

Pos logit contributions
wagen+2.114
amaz+1.931
pu+1.926
rac+1.828
ells+1.788
Neg logit contributions
-5.372
 POS-4.753
-4.620
 awa-4.569
-4.540
Token in
Token ID287
Feature activation-2.19e-01

Pos logit contributions
wagen+0.759
amaz+0.710
pu+0.697
 municip+0.690
ells+0.686
Neg logit contributions
-0.887
 POS-0.791
-0.735
 LW-0.715
idious-0.713
Token a
Token ID257
Feature activation-3.53e-01

Pos logit contributions
+1.686
 POS+1.504
+1.367
 LW+1.342
idious+1.326
Neg logit contributions
wagen-1.677
amaz-1.580
pu-1.547
 municip-1.544
ells-1.534
Token Pants
Token ID41689
Feature activation+3.37e-02

Pos logit contributions
+2.781
 POS+2.517
+2.270
 LW+2.245
idious+2.199
Neg logit contributions
wagen-2.624
amaz-2.482
 municip-2.414
ells-2.400
pu-2.395
Token and
Token ID290
Feature activation+7.27e-01

Pos logit contributions
wagen+3.413
pu+3.276
ells+3.189
rac+3.138
amaz+3.135
Neg logit contributions
-5.476
 POS-4.663
 awa-4.656
-4.487
-4.400
Token sol
Token ID1540
Feature activation+2.64e-01

Pos logit contributions
wagen+2.678
pu+2.460
 rotor+2.387
ells+2.327
 municip+2.300
Neg logit contributions
-8.387
-7.379
 POS-7.320
 awa-7.112
idis-7.058
Tokenes
Token ID274
Feature activation+5.14e-01

Pos logit contributions
wagen+1.701
amaz+1.585
pu+1.560
ells+1.552
rac+1.523
Neg logit contributions
-2.376
 POS-2.123
-1.986
 LW-1.927
-1.924
Token")
Token ID4943
Feature activation+5.25e-01

Pos logit contributions
wagen+3.161
pu+2.919
ells+2.870
amaz+2.830
rac+2.799
Neg logit contributions
-4.849
 POS-4.180
 awa-3.996
-3.871
-3.868
Token is
Token ID318
Feature activation+3.99e-01

Pos logit contributions
wagen+3.003
pu+2.829
amaz+2.829
ells+2.761
rac+2.722
Neg logit contributions
-5.039
 POS-4.347
 awa-4.202
-4.187
-4.072
Token an
Token ID281
Feature activation+1.39e+00

Pos logit contributions
wagen+2.643
amaz+2.468
pu+2.457
ells+2.371
rac+2.362
Neg logit contributions
-3.544
 POS-3.094
-2.909
 awa-2.833
-2.825
Token autos
Token ID44619
Feature activation-2.73e-01

Pos logit contributions
wagen+3.090
rac+2.902
pu+2.842
amaz+2.614
 rotor+2.588
Neg logit contributions
-16.856
-15.007
 POS-14.377
 awa-14.152
 Lauder-14.147
Tokenomal
Token ID18048
Feature activation+4.46e-01

Pos logit contributions
+2.231
 POS+2.040
idious+1.878
idis+1.860
 LW+1.841
Neg logit contributions
wagen-1.921
amaz-1.800
 municip-1.793
pu-1.731
 rotor-1.730
Token dominant
Token ID11410
Feature activation-5.87e-01

Pos logit contributions
wagen+2.654
pu+2.495
amaz+2.411
 municip+2.400
rac+2.386
Neg logit contributions
-4.246
 POS-3.607
-3.522
 awa-3.432
 LW-3.399
Token ker
Token ID41927
Feature activation+2.38e-01

Pos logit contributions
+1.897
 POS+1.718
 LW+1.309
idis+1.175
idious+1.167
Neg logit contributions
wagen-6.806
amaz-6.709
 municip-6.492
ells-6.471
rices-6.429
Tokenat
Token ID265
Feature activation+6.64e-02

Pos logit contributions
wagen+1.669
amaz+1.583
pu+1.565
rac+1.534
ells+1.522
Neg logit contributions
-2.027
 POS-1.786
-1.672
 awa-1.636
 LW-1.617
Token what
Token ID644
Feature activation+3.47e-01

Pos logit contributions
wagen+1.004
amaz+0.938
pu+0.920
 municip+0.912
ells+0.907
Neg logit contributions
-1.237
 POS-1.106
-1.026
 LW-1.002
-0.997
Token '
Token ID705
Feature activation+8.96e-02

Pos logit contributions
wagen+2.403
amaz+2.248
pu+2.225
 municip+2.201
ells+2.198
Neg logit contributions
-2.924
 POS-2.592
-2.413
-2.354
 LW-2.348
Tokenmakes
Token ID49123
Feature activation+1.39e-01

Pos logit contributions
wagen+0.649
amaz+0.608
pu+0.596
 municip+0.591
ells+0.588
Neg logit contributions
-0.734
 POS-0.657
-0.605
 LW-0.592
idious-0.587
Token us
Token ID514
Feature activation+6.70e-01

Pos logit contributions
wagen+0.849
amaz+0.787
pu+0.768
 municip+0.761
ells+0.756
Neg logit contributions
-1.294
 POS-1.170
-1.093
 LW-1.069
-1.066
Token who
Token ID508
Feature activation+5.03e-01

Pos logit contributions
wagen+4.183
amaz+4.051
 municip+3.983
pu+3.953
rac+3.952
Neg logit contributions
-5.831
 POS-4.980
-4.829
-4.791
idis-4.665
Token we
Token ID356
Feature activation+1.36e+00

Pos logit contributions
wagen+2.890
amaz+2.676
pu+2.637
ells+2.636
rac+2.603
Neg logit contributions
-4.777
 POS-4.269
-4.036
-3.977
idious-3.952
Token are
Token ID389
Feature activation+7.57e-01

Pos logit contributions
wagen+5.799
pu+5.706
ells+5.694
 medic+5.665
 Lib+5.461
Neg logit contributions
-13.533
-10.955
-10.945
 POS-10.932
 Lauder-10.723
Token.'
Token ID2637
Feature activation+1.20e-01

Pos logit contributions
wagen+4.020
amaz+3.669
pu+3.645
 municip+3.631
ells+3.591
Neg logit contributions
-7.501
 POS-6.586
-6.296
-6.200
 LW-6.190
Token
Token ID198
Feature activation-3.46e-02

Pos logit contributions
wagen+0.799
amaz+0.746
pu+0.733
ells+0.725
 municip+0.717
Neg logit contributions
-1.052
 POS-0.946
-0.878
 LW-0.861
idious-0.858
Token
Token ID198
Feature activation+2.74e-01

Pos logit contributions
+0.281
 POS+0.257
+0.235
 LW+0.231
idis+0.228
Neg logit contributions
wagen-0.252
amaz-0.234
 municip-0.233
pu-0.227
ells-0.225
TokenThe
Token ID464
Feature activation+9.82e-02

Pos logit contributions
wagen+1.818
amaz+1.707
pu+1.688
rac+1.651
ells+1.648
Neg logit contributions
-2.380
 POS-2.154
-1.979
 LW-1.960
idious-1.947
Tokenи
Token ID18849
Feature activation+2.09e-02

Pos logit contributions
wagen+3.631
amaz+3.426
pu+3.346
ells+3.338
rac+3.318
Neg logit contributions
-3.436
 POS-3.027
-2.783
-2.759
 LW-2.721
Tokenе
Token ID16843
Feature activation+2.72e-01

Pos logit contributions
wagen+0.167
amaz+0.157
pu+0.155
 municip+0.154
ells+0.153
Neg logit contributions
-0.155
 POS-0.137
-0.125
 LW-0.122
idious-0.121
Token
Token ID12466
Feature activation+3.05e-01

Pos logit contributions
wagen+1.911
amaz+1.785
pu+1.773
 municip+1.752
ells+1.743
Neg logit contributions
-2.241
 POS-2.001
-1.868
idious-1.832
-1.821
Token
Token ID121
Feature activation+1.91e-01

Pos logit contributions
wagen+2.322
amaz+2.170
pu+2.162
ells+2.121
 municip+2.120
Neg logit contributions
-2.347
 POS-2.095
-1.901
-1.875
idious-1.867
Tokenа
Token ID16142
Feature activation-4.96e-02

Pos logit contributions
wagen+1.463
amaz+1.379
pu+1.359
ells+1.349
 municip+1.344
Neg logit contributions
-1.461
 POS-1.299
-1.184
 LW-1.154
idious-1.150
Token
Token ID141
Feature activation+1.32e+00

Pos logit contributions
+0.392
 POS+0.349
+0.320
 LW+0.313
+0.309
Neg logit contributions
wagen-0.376
amaz-0.352
pu-0.345
 municip-0.343
ells-0.341
Token
Token ID239
Feature activation-4.31e-01

Pos logit contributions
wagen+4.842
amaz+4.638
rac+4.457
 municip+4.456
pu+4.282
Neg logit contributions
-13.185
 POS-12.230
 Lauder-11.392
 berth-11.310
-11.288
Tokenм
Token ID43108
Feature activation+3.02e-01

Pos logit contributions
+2.707
 POS+2.333
+2.100
 LW+1.994
idious+1.953
Neg logit contributions
wagen-3.956
amaz-3.730
pu-3.651
 municip-3.639
ells-3.627
Tokenн
Token ID22177
Feature activation+5.90e-01

Pos logit contributions
wagen+2.299
amaz+2.180
rac+2.134
pu+2.133
ells+2.120
Neg logit contributions
-2.270
 POS-2.043
-1.876
idis-1.841
-1.837
Tokenи
Token ID18849
Feature activation+1.48e-01

Pos logit contributions
wagen+3.123
amaz+2.891
pu+2.854
rac+2.817
ells+2.813
Neg logit contributions
-5.662
 POS-5.191
-4.910
idis-4.848
-4.825
Tokenк
Token ID31583
Feature activation+1.07e-01

Pos logit contributions
wagen+1.212
amaz+1.139
pu+1.131
 municip+1.122
rac+1.117
Neg logit contributions
-1.042
 POS-0.921
-0.842
 LW-0.823
-0.820
Token two
Token ID734
Feature activation+5.84e-02

Pos logit contributions
wagen+2.144
amaz+2.025
pu+1.971
 municip+1.968
ells+1.937
Neg logit contributions
-2.729
 POS-2.410
-2.250
-2.187
idious-2.181
Token-
Token ID12
Feature activation+2.20e-01

Pos logit contributions
wagen+0.443
amaz+0.416
pu+0.411
 municip+0.407
ells+0.403
Neg logit contributions
-0.459
 POS-0.406
-0.373
 LW-0.363
-0.362
Tokenthirds
Token ID17936
Feature activation+2.30e-01

Pos logit contributions
wagen+1.638
amaz+1.550
pu+1.530
rac+1.487
ells+1.479
Neg logit contributions
-1.756
 POS-1.564
-1.436
 LW-1.395
-1.388
Token of
Token ID286
Feature activation+2.81e-01

Pos logit contributions
wagen+1.693
amaz+1.610
pu+1.577
 municip+1.571
rac+1.544
Neg logit contributions
-1.837
 POS-1.625
-1.492
 LW-1.458
-1.457
Token where
Token ID810
Feature activation+6.05e-01

Pos logit contributions
wagen+2.139
amaz+2.034
pu+1.984
 municip+1.975
ells+1.954
Neg logit contributions
-2.194
 POS-1.926
-1.768
-1.729
 LW-1.720
Token they
Token ID484
Feature activation+1.30e+00

Pos logit contributions
wagen+4.173
amaz+3.996
 municip+3.889
pu+3.861
ells+3.843
Neg logit contributions
-4.965
 POS-4.367
-3.985
-3.982
idious-3.955
Token were
Token ID547
Feature activation+1.07e+00

Pos logit contributions
pu+6.868
wagen+6.827
amaz+6.700
 medic+6.579
ells+6.526
Neg logit contributions
-11.080
 POS-9.408
idious-9.196
 awa-9.087
-8.991
Token at
Token ID379
Feature activation+6.57e-01

Pos logit contributions
amaz+6.415
wagen+6.348
 municip+6.160
 rotor+5.998
pu+5.955
Neg logit contributions
-8.996
 POS-7.919
wine-7.439
idious-7.317
-7.265
Token the
Token ID262
Feature activation+3.30e-01

Pos logit contributions
wagen+4.267
amaz+4.146
 municip+4.038
 rotor+3.968
pu+3.944
Neg logit contributions
-5.584
 POS-4.872
-4.525
-4.493
wine-4.488
Token start
Token ID923
Feature activation+4.24e-01

Pos logit contributions
wagen+2.269
amaz+2.136
 municip+2.105
pu+2.094
ells+2.069
Neg logit contributions
-2.813
 POS-2.465
-2.310
-2.260
idis-2.247
Token,
Token ID11
Feature activation+6.16e-01

Pos logit contributions
wagen+2.954
amaz+2.763
pu+2.717
 municip+2.699
ells+2.668
Neg logit contributions
-3.485
 POS-3.137
-2.905
 LW-2.822
-2.819
Token 4
Token ID604
Feature activation+6.64e-01

Pos logit contributions
wagen+1.840
amaz+1.649
pu+1.632
rac+1.560
ells+1.558
Neg logit contributions
-4.293
 POS-3.863
-3.710
-3.613
 LW-3.597
Token years
Token ID812
Feature activation+1.02e-01

Pos logit contributions
wagen+3.761
amaz+3.597
pu+3.503
ells+3.441
 rotor+3.378
Neg logit contributions
-6.382
 POS-5.464
idious-5.271
-5.212
 awa-5.192
Token old
Token ID1468
Feature activation+1.75e-01

Pos logit contributions
wagen+0.569
amaz+0.522
pu+0.510
 municip+0.504
ells+0.501
Neg logit contributions
-1.000
 POS-0.908
-0.852
 LW-0.835
idis-0.831
Token and
Token ID290
Feature activation+3.33e-01

Pos logit contributions
wagen+1.241
amaz+1.158
pu+1.146
ells+1.129
 municip+1.121
Neg logit contributions
-1.453
 POS-1.286
-1.195
idious-1.162
-1.162
Token once
Token ID1752
Feature activation-3.24e-01

Pos logit contributions
wagen+1.039
amaz+0.854
pu+0.838
ells+0.817
rac+0.798
Neg logit contributions
-4.131
 POS-3.723
-3.595
-3.564
idious-3.553
Token I
Token ID314
Feature activation+1.28e+00

Pos logit contributions
+1.854
 POS+1.742
 LW+1.512
+1.486
idis+1.423
Neg logit contributions
wagen-3.061
amaz-2.915
 municip-2.903
pu-2.823
rices-2.813
Token turn
Token ID1210
Feature activation+5.87e-01

Pos logit contributions
wagen+5.478
 rotor+5.278
pu+5.070
rac+5.069
 medic+4.836
Neg logit contributions
-13.570
-11.489
-11.330
 POS-11.296
 awa-11.173
Token AA
Token ID15923
Feature activation+1.09e+00

Pos logit contributions
wagen+4.723
amaz+4.535
pu+4.481
rac+4.435
ells+4.420
Neg logit contributions
-4.171
 POS-3.478
-3.331
idious-3.244
lihood-3.197
Token on
Token ID319
Feature activation-3.34e-02

Pos logit contributions
pu+5.658
rac+5.436
wagen+5.382
ells+5.245
amaz+5.241
Neg logit contributions
-10.438
 awa-9.074
idious-8.742
 POS-8.601
-8.571
Token things
Token ID1243
Feature activation+3.59e-01

Pos logit contributions
+0.260
 POS+0.235
+0.212
 LW+0.210
idious+0.205
Neg logit contributions
wagen-0.256
amaz-0.239
pu-0.235
 municip-0.234
rac-0.229
Token slow
Token ID3105
Feature activation+3.45e-01

Pos logit contributions
wagen+2.195
pu+2.066
amaz+2.060
rac+2.019
 rotor+2.007
Neg logit contributions
-3.306
 POS-2.908
-2.799
-2.727
idis-2.697
Token
Token ID141
Feature activation+7.16e-01

Pos logit contributions
+1.280
 POS+1.120
+1.017
 LW+0.991
idis+0.978
Neg logit contributions
wagen-1.494
amaz-1.406
pu-1.381
 municip-1.377
ells-1.362
Token
Token ID227
Feature activation+7.21e-02

Pos logit contributions
wagen+3.258
amaz+2.848
pu+2.794
rices+2.792
 municip+2.754
Neg logit contributions
-7.479
 POS-6.837
-6.373
-6.366
lihood-6.325
Token
Token ID12466
Feature activation+1.40e-01

Pos logit contributions
wagen+0.488
amaz+0.455
pu+0.446
 municip+0.443
ells+0.439
Neg logit contributions
-0.622
 POS-0.558
-0.520
 LW-0.507
idious-0.505
Token
Token ID121
Feature activation+1.87e-01

Pos logit contributions
wagen+1.085
amaz+1.018
pu+1.007
 municip+0.993
ells+0.991
Neg logit contributions
-1.067
 POS-0.949
-0.865
 LW-0.845
idious-0.842
Tokenа
Token ID16142
Feature activation-3.10e-02

Pos logit contributions
wagen+1.437
amaz+1.357
pu+1.335
ells+1.324
 municip+1.319
Neg logit contributions
-1.422
 POS-1.268
-1.154
 LW-1.125
idious-1.118
Token
Token ID141
Feature activation+1.25e+00

Pos logit contributions
+0.245
 POS+0.218
+0.200
 LW+0.195
+0.193
Neg logit contributions
wagen-0.235
amaz-0.220
pu-0.216
 municip-0.214
ells-0.213
Token
Token ID239
Feature activation-3.73e-01

Pos logit contributions
wagen+5.343
amaz+5.113
 municip+4.934
rac+4.906
pu+4.791
Neg logit contributions
-12.053
 POS-11.104
-10.280
 Lauder-10.269
-10.226
Tokenм
Token ID43108
Feature activation+3.22e-01

Pos logit contributions
+2.372
 POS+2.047
+1.841
 LW+1.760
idious+1.721
Neg logit contributions
wagen-3.403
amaz-3.207
pu-3.143
 municip-3.131
ells-3.119
Tokenн
Token ID22177
Feature activation+4.39e-01

Pos logit contributions
wagen+2.438
amaz+2.316
pu+2.265
rac+2.262
ells+2.250
Neg logit contributions
-2.416
 POS-2.173
-2.004
idis-1.964
-1.963
Tokenи
Token ID18849
Feature activation+1.96e-02

Pos logit contributions
wagen+2.465
amaz+2.283
pu+2.246
ells+2.215
rac+2.204
Neg logit contributions
-4.168
 POS-3.806
-3.591
idis-3.537
-3.517
Tokenк
Token ID31583
Feature activation+6.07e-02

Pos logit contributions
wagen+0.165
amaz+0.156
pu+0.154
 municip+0.153
ells+0.152
Neg logit contributions
-0.136
 POS-0.120
-0.109
 LW-0.106
-0.105
Token
Token ID447
Feature activation-2.16e-01

Pos logit contributions
+1.182
 POS+1.073
+1.007
 LW+0.983
+0.973
Neg logit contributions
wagen-0.780
amaz-0.715
 municip-0.710
pu-0.690
ells-0.684
Token
Token ID247
Feature activation-5.89e-02

Pos logit contributions
+1.589
 POS+1.412
+1.295
 LW+1.261
+1.245
Neg logit contributions
wagen-1.745
amaz-1.644
 municip-1.607
pu-1.605
ells-1.586
Tokent
Token ID83
Feature activation-1.15e-01

Pos logit contributions
+0.328
 POS+0.276
+0.241
 LW+0.234
idious+0.230
Neg logit contributions
wagen-0.582
amaz-0.556
pu-0.550
 municip-0.543
ells-0.543
Token be
Token ID307
Feature activation+1.07e-01

Pos logit contributions
+0.941
 POS+0.867
+0.788
 LW+0.772
idis+0.766
Neg logit contributions
wagen-0.826
amaz-0.771
 municip-0.752
ells-0.752
pu-0.750
Token where
Token ID810
Feature activation+4.53e-01

Pos logit contributions
wagen+0.787
amaz+0.738
pu+0.724
 municip+0.718
ells+0.714
Neg logit contributions
-0.869
 POS-0.772
-0.713
 LW-0.695
idious-0.691
Token we
Token ID356
Feature activation+1.24e+00

Pos logit contributions
wagen+2.724
amaz+2.514
pu+2.455
 municip+2.439
ells+2.420
Neg logit contributions
-4.230
 POS-3.736
-3.559
idious-3.481
idis-3.459
Token are
Token ID389
Feature activation+6.73e-01

Pos logit contributions
wagen+6.131
pu+5.913
amaz+5.893
ells+5.829
 rotor+5.802
Neg logit contributions
-11.041
 POS-9.417
idious-9.406
-9.205
lihood-9.148
Token if
Token ID611
Feature activation-1.09e-01

Pos logit contributions
wagen+4.700
amaz+4.435
 municip+4.375
pu+4.314
 rotor+4.273
Neg logit contributions
-5.525
 POS-4.780
-4.497
-4.364
idious-4.353
Token I
Token ID314
Feature activation+8.18e-02

Pos logit contributions
+0.907
 POS+0.817
+0.752
 LW+0.738
idis+0.730
Neg logit contributions
wagen-0.768
amaz-0.721
pu-0.705
 municip-0.698
ells-0.696
Token was
Token ID373
Feature activation+2.03e-01

Pos logit contributions
wagen+0.582
amaz+0.544
pu+0.535
 municip+0.529
ells+0.526
Neg logit contributions
-0.681
 POS-0.607
-0.561
 LW-0.549
idious-0.546
Token the
Token ID262
Feature activation+3.43e-01

Pos logit contributions
wagen+1.516
amaz+1.420
pu+1.394
 municip+1.389
ells+1.369
Neg logit contributions
-1.624
 POS-1.423
-1.327
 LW-1.280
idious-1.276
Token person
Token ID1048
Feature activation-1.32e-01

Pos logit contributions
+1.534
 POS+1.408
+1.307
 LW+1.286
idious+1.277
Neg logit contributions
wagen-0.938
amaz-0.868
pu-0.846
 municip-0.834
ells-0.830
Token loaded
Token ID9639
Feature activation+1.13e-02

Pos logit contributions
+0.949
 POS+0.840
+0.760
 LW+0.746
idious+0.733
Neg logit contributions
wagen-1.089
amaz-1.030
pu-1.013
 municip-1.003
ells-0.999
Token,
Token ID11
Feature activation+1.67e-01

Pos logit contributions
wagen+0.083
amaz+0.078
pu+0.077
 municip+0.076
ells+0.076
Neg logit contributions
-0.091
 POS-0.081
-0.075
 LW-0.073
idis-0.072
Token initially
Token ID7317
Feature activation+5.27e-01

Pos logit contributions
wagen+0.820
amaz+0.742
pu+0.725
 municip+0.723
ells+0.714
Neg logit contributions
-1.757
 POS-1.597
-1.509
 LW-1.480
idious-1.476
Token failing
Token ID9894
Feature activation+3.46e-02

Pos logit contributions
wagen+3.869
pu+3.659
 municip+3.640
amaz+3.628
 rotor+3.588
Neg logit contributions
-4.081
 POS-3.521
-3.245
idis-3.242
-3.223
Token,
Token ID11
Feature activation+1.23e+00

Pos logit contributions
wagen+0.254
amaz+0.237
pu+0.233
 municip+0.232
ells+0.231
Neg logit contributions
-0.280
 POS-0.249
-0.230
 LW-0.224
-0.224
Token sec
Token ID792
Feature activation+8.44e-01

Pos logit contributions
wagen+5.371
 municip+5.095
 rotor+5.021
 medic+4.866
rac+4.839
Neg logit contributions
-12.197
 POS-10.660
-10.560
-10.316
wine-10.296
Tokenures
Token ID942
Feature activation+4.95e-01

Pos logit contributions
ells+3.596
wagen+3.580
pu+3.493
rices+3.452
rac+3.381
Neg logit contributions
-9.170
 POS-8.071
-7.784
-7.749
 LW-7.653
Token mark
Token ID1317
Feature activation+8.96e-02

Pos logit contributions
wagen+2.851
amaz+2.600
 municip+2.552
 rotor+2.531
pu+2.525
Neg logit contributions
-4.640
 POS-4.108
-3.980
-3.944
idis-3.932
Token (
Token ID357
Feature activation-2.45e-01

Pos logit contributions
wagen+0.579
amaz+0.533
pu+0.526
 municip+0.523
ells+0.516
Neg logit contributions
-0.800
 POS-0.722
-0.674
-0.658
idis-0.657
Token1
Token ID16
Feature activation-1.25e-01

Pos logit contributions
+1.902
 POS+1.687
+1.553
 LW+1.519
idis+1.513
Neg logit contributions
wagen-1.860
amaz-1.748
 municip-1.719
pu-1.703
ells-1.685
Token Medicare
Token ID13771
Feature activation+1.27e-01

Pos logit contributions
wagen+1.394
amaz+1.308
pu+1.288
 municip+1.279
ells+1.258
Neg logit contributions
-1.729
 POS-1.537
-1.427
-1.396
 LW-1.390
Token to
Token ID284
Feature activation+3.00e-01

Pos logit contributions
wagen+0.954
amaz+0.898
pu+0.881
 municip+0.877
ells+0.871
Neg logit contributions
-1.000
 POS-0.886
-0.815
 LW-0.796
idious-0.791
Token stay
Token ID2652
Feature activation+2.42e-01

Pos logit contributions
wagen+2.028
amaz+1.890
pu+1.853
 municip+1.841
ells+1.829
Neg logit contributions
-2.599
 POS-2.326
-2.161
 LW-2.111
-2.107
Token the
Token ID262
Feature activation+7.53e-01

Pos logit contributions
wagen+1.642
amaz+1.550
 municip+1.531
pu+1.527
ells+1.503
Neg logit contributions
-2.076
 POS-1.830
-1.712
-1.671
 LW-1.671
Token way
Token ID835
Feature activation+9.17e-01

Pos logit contributions
wagen+5.176
 municip+4.845
pu+4.815
 medic+4.785
amaz+4.780
Neg logit contributions
-6.250
 POS-5.245
-4.955
lihood-4.941
-4.905
Token it
Token ID340
Feature activation+1.22e+00

Pos logit contributions
wagen+4.748
amaz+4.706
 municip+4.664
 medic+4.558
 Lib+4.500
Neg logit contributions
-8.825
 POS-7.450
-7.152
idis-7.042
lihood-7.037
Token is
Token ID318
Feature activation+8.78e-01

Pos logit contributions
ells+4.835
pu+4.805
amaz+4.788
 Lib+4.608
 medic+4.573
Neg logit contributions
-12.705
lihood-10.931
-10.673
idious-10.640
 POS-10.587
Token while
Token ID981
Feature activation-1.68e-01

Pos logit contributions
 municip+5.133
wagen+5.119
amaz+4.978
pu+4.856
 medic+4.796
Neg logit contributions
-7.910
 POS-6.766
-6.460
-6.291
 awa-6.240
Token 28
Token ID2579
Feature activation-5.13e-01

Pos logit contributions
+1.237
 POS+1.140
 LW+1.034
+1.029
idious+1.003
Neg logit contributions
wagen-1.325
amaz-1.231
pu-1.220
ells-1.214
rices-1.199
Token percent
Token ID1411
Feature activation+6.14e-01

Pos logit contributions
+2.226
 POS+2.093
 LW+1.689
+1.676
idis+1.571
Neg logit contributions
wagen-5.402
amaz-5.202
rac-5.083
ells-5.076
 municip-5.069
Token think
Token ID892
Feature activation+2.96e-01

Pos logit contributions
wagen+3.778
amaz+3.479
 municip+3.439
pu+3.412
 rotor+3.375
Neg logit contributions
-5.693
 POS-4.935
-4.648
-4.590
idious-4.527
Token proved
Token ID8302
Feature activation-7.48e-02

Pos logit contributions
wagen+3.965
 rotor+3.558
pu+3.535
amaz+3.478
 municip+3.449
Neg logit contributions
-6.113
 POS-5.340
-5.106
-5.094
lihood-4.968
Token its
Token ID663
Feature activation+3.59e-01

Pos logit contributions
+0.559
 POS+0.501
+0.455
 LW+0.448
idious+0.442
Neg logit contributions
wagen-0.590
amaz-0.556
pu-0.542
ells-0.541
 municip-0.538
Token met
Token ID1138
Feature activation+1.08e-01

Pos logit contributions
wagen+2.553
amaz+2.365
pu+2.333
 municip+2.327
ells+2.286
Neg logit contributions
-3.004
 POS-2.655
-2.487
-2.404
idis-2.401
Tokentle
Token ID7100
Feature activation+5.23e-01

Pos logit contributions
wagen+1.094
amaz+1.044
pu+1.030
 municip+1.022
ells+1.020
Neg logit contributions
-0.571
 POS-0.475
-0.414
 LW-0.396
-0.390
Token off
Token ID572
Feature activation+8.67e-01

Pos logit contributions
wagen+3.521
amaz+3.276
 municip+3.257
pu+3.249
 rotor+3.211
Neg logit contributions
-4.564
 POS-3.947
-3.718
-3.615
idis-3.569
Token-
Token ID12
Feature activation+1.20e+00

Pos logit contributions
wagen+5.797
rac+5.499
 rotor+5.482
amaz+5.436
pu+5.427
Neg logit contributions
-7.149
 POS-6.039
-5.845
-5.752
verett-5.599
Tokenroad
Token ID6344
Feature activation+4.48e-01

Pos logit contributions
rac+3.961
wagen+3.943
pu+3.800
amaz+3.597
ells+3.391
Neg logit contributions
-13.736
 POS-12.292
-11.815
 Lauder-11.775
 awa-11.718
Token
Token ID784
Feature activation+5.91e-01

Pos logit contributions
wagen+3.111
amaz+2.863
pu+2.818
 municip+2.811
rac+2.775
Neg logit contributions
-3.830
 POS-3.371
-3.156
-3.081
idis-3.059
Token racing
Token ID11717
Feature activation+5.71e-01

Pos logit contributions
wagen+4.210
amaz+3.902
 municip+3.850
pu+3.847
 rotor+3.810
Neg logit contributions
-4.843
 POS-4.264
-3.991
idious-3.877
idis-3.869
Token in
Token ID287
Feature activation+3.25e-01

Pos logit contributions
wagen+4.046
amaz+3.709
 municip+3.675
 rotor+3.628
pu+3.617
Neg logit contributions
-4.803
 POS-4.177
-3.974
-3.870
idious-3.833
Token the
Token ID262
Feature activation-6.87e-02

Pos logit contributions
wagen+2.375
amaz+2.232
 municip+2.184
pu+2.184
ells+2.162
Neg logit contributions
-2.637
 POS-2.322
-2.160
-2.109
idious-2.091
Tokenb
Token ID65
Feature activation+3.92e-01

Pos logit contributions
+0.687
 POS+0.612
+0.556
 LW+0.547
idis+0.545
Neg logit contributions
wagen-0.750
amaz-0.703
 municip-0.698
pu-0.687
ells-0.683
Token]
Token ID60
Feature activation+4.42e-02

Pos logit contributions
wagen+2.450
amaz+2.323
pu+2.302
ells+2.251
rac+2.244
Neg logit contributions
-3.558
 POS-3.154
 awa-2.954
-2.949
idious-2.898
TokenMega
Token ID43471
Feature activation+1.16e-01

Pos logit contributions
wagen+0.314
amaz+0.294
pu+0.288
 municip+0.285
ells+0.284
Neg logit contributions
-0.368
 POS-0.329
-0.304
 LW-0.297
idious-0.295
Token Pony
Token ID30377
Feature activation-4.53e-02

Pos logit contributions
wagen+0.725
amaz+0.677
pu+0.657
 municip+0.649
ells+0.646
Neg logit contributions
-1.063
 POS-0.958
-0.893
 LW-0.874
-0.873
Token[/
Token ID13412
Feature activation+1.05e-01

Pos logit contributions
+0.375
 POS+0.335
+0.309
 LW+0.303
idis+0.300
Neg logit contributions
wagen-0.325
amaz-0.304
pu-0.298
 municip-0.296
ells-0.294
Tokenb
Token ID65
Feature activation+1.20e+00

Pos logit contributions
wagen+0.804
amaz+0.757
pu+0.744
ells+0.734
 municip+0.733
Neg logit contributions
-0.819
 POS-0.724
-0.665
 LW-0.648
idious-0.643
Token][/
Token ID44926
Feature activation-1.58e-01

Pos logit contributions
pu+4.615
wagen+4.550
rac+4.545
amaz+4.397
ells+4.356
Neg logit contributions
-12.449
 awa-11.143
 POS-11.123
-10.790
idious-10.630
Tokencolor
Token ID8043
Feature activation+8.92e-01

Pos logit contributions
+1.398
 POS+1.283
+1.186
idis+1.169
 LW+1.168
Neg logit contributions
wagen-1.013
 municip-0.933
amaz-0.932
pu-0.902
ells-0.894
Token]
Token ID60
Feature activation+1.94e-01

Pos logit contributions
wagen+4.551
amaz+4.385
pu+4.379
rac+4.370
ells+4.205
Neg logit contributions
-8.678
 awa-7.639
 POS-7.517
-7.205
idious-7.201
Token -
Token ID532
Feature activation+6.35e-02

Pos logit contributions
wagen+1.395
amaz+1.319
pu+1.284
ells+1.274
 municip+1.265
Neg logit contributions
-1.593
 POS-1.405
-1.298
idious-1.284
-1.279
Token 20
Token ID1160
Feature activation+7.18e-02

Pos logit contributions
wagen+0.472
amaz+0.444
pu+0.435
 municip+0.432
ells+0.429
Neg logit contributions
-0.508
 POS-0.451
-0.416
 LW-0.405
idious-0.404
Token:
Token ID25
Feature activation+3.63e-01

Pos logit contributions
wagen+2.205
amaz+2.099
pu+2.071
ells+2.036
rac+2.019
Neg logit contributions
-2.594
 POS-2.275
-2.107
 awa-2.099
idious-2.068
Token Because
Token ID4362
Feature activation+2.84e-01

Pos logit contributions
wagen+2.130
amaz+1.988
pu+1.957
ells+1.933
 municip+1.883
Neg logit contributions
-3.437
 POS-3.065
-2.882
idious-2.850
-2.827
Token of
Token ID286
Feature activation-3.48e-01

Pos logit contributions
wagen+2.067
amaz+1.952
pu+1.939
ells+1.896
rac+1.896
Neg logit contributions
-2.313
 POS-2.009
-1.874
 awa-1.845
idious-1.827
Token a
Token ID257
Feature activation+4.98e-02

Pos logit contributions
+2.328
 POS+2.170
 LW+1.922
+1.910
idis+1.842
Neg logit contributions
wagen-2.966
 municip-2.777
amaz-2.745
rices-2.665
pu-2.661
Token failed
Token ID4054
Feature activation-9.49e-02

Pos logit contributions
wagen+0.386
amaz+0.363
pu+0.357
 municip+0.354
ells+0.352
Neg logit contributions
-0.385
 POS-0.339
-0.311
 LW-0.303
-0.301
Token will
Token ID481
Feature activation+1.20e+00

Pos logit contributions
+0.625
 POS+0.563
+0.497
 LW+0.496
idious+0.480
Neg logit contributions
wagen-0.830
amaz-0.791
 municip-0.772
pu-0.767
ells-0.760
Token save
Token ID3613
Feature activation+8.67e-02

Pos logit contributions
wagen+1.685
pu+1.563
 rotor+1.515
 medic+1.495
rac+1.359
Neg logit contributions
-15.803
-14.097
-13.885
 awa-13.835
 POS-13.795
Token,
Token ID11
Feature activation+4.81e-01

Pos logit contributions
wagen+0.644
amaz+0.605
pu+0.595
ells+0.587
 municip+0.586
Neg logit contributions
-0.695
 POS-0.616
-0.568
 LW-0.554
idious-0.551
Token your
Token ID534
Feature activation+5.83e-01

Pos logit contributions
wagen+3.424
pu+3.268
amaz+3.214
ells+3.158
rac+3.144
Neg logit contributions
-3.955
 POS-3.356
-3.246
 awa-3.223
-3.111
Token character
Token ID2095
Feature activation+7.61e-01

Pos logit contributions
wagen+2.485
amaz+2.182
pu+2.174
 rotor+2.105
rac+2.105
Neg logit contributions
-6.538
 POS-5.660
-5.539
 awa-5.520
-5.443
Token now
Token ID783
Feature activation-1.96e-02

Pos logit contributions
wagen+3.845
pu+3.378
ells+3.266
amaz+3.216
rac+3.201
Neg logit contributions
-7.967
-6.814
 POS-6.792
 awa-6.762
-6.617
Tokenb
Token ID65
Feature activation+6.12e-01

Pos logit contributions
wagen+2.092
amaz+1.974
pu+1.951
ells+1.915
rac+1.905
Neg logit contributions
-2.257
 POS-1.998
-1.842
 LW-1.792
idious-1.778
Tokenong
Token ID506
Feature activation+4.00e-02

Pos logit contributions
wagen+5.804
amaz+5.588
ells+5.515
pu+5.508
rac+5.463
Neg logit contributions
-3.724
 POS-3.048
 awa-2.709
-2.708
-2.606
Tokeni
Token ID72
Feature activation+4.25e-01

Pos logit contributions
wagen+0.239
amaz+0.221
 municip+0.214
pu+0.214
ells+0.210
Neg logit contributions
-0.377
 POS-0.343
-0.321
 LW-0.315
idis-0.314
Token and
Token ID290
Feature activation+2.26e-01

Pos logit contributions
wagen+2.722
amaz+2.558
pu+2.525
ells+2.489
 municip+2.457
Neg logit contributions
-3.713
 POS-3.379
-3.115
idious-3.067
 LW-3.064
Token Sh
Token ID911
Feature activation-1.70e-02

Pos logit contributions
wagen+1.341
amaz+1.242
pu+1.223
 municip+1.200
ells+1.189
Neg logit contributions
-2.139
 POS-1.927
-1.811
 LW-1.770
-1.764
Tokenus
Token ID385
Feature activation+1.19e+00

Pos logit contributions
+0.156
 POS+0.143
+0.133
idis+0.131
 LW+0.130
Neg logit contributions
wagen-0.105
amaz-0.096
 municip-0.095
pu-0.093
ells-0.092
Tokenaku
Token ID8719
Feature activation-5.61e-01

Pos logit contributions
amaz+3.760
wagen+3.640
pu+3.417
ells+3.069
rac+3.039
Neg logit contributions
-14.012
 POS-12.873
-12.178
 Lauder-12.154
 awa-12.092
Token U
Token ID471
Feature activation+3.15e-01

Pos logit contributions
+3.661
 POS+3.230
+2.880
 LW+2.770
idis+2.725
Neg logit contributions
wagen-4.855
amaz-4.695
rac-4.533
ells-4.521
 municip-4.508
Tokench
Token ID354
Feature activation-9.72e-02

Pos logit contributions
wagen+1.187
amaz+1.062
pu+1.026
ells+0.999
rac+0.982
Neg logit contributions
-3.668
 POS-3.386
-3.216
 LW-3.159
idious-3.155
Tokeniy
Token ID7745
Feature activation-2.75e-02

Pos logit contributions
+0.731
 POS+0.651
+0.597
idis+0.589
 LW+0.586
Neg logit contributions
wagen-0.755
amaz-0.708
 municip-0.706
pu-0.694
ells-0.686
Tokenama
Token ID1689
Feature activation+3.34e-01

Pos logit contributions
+0.231
 POS+0.210
+0.195
 LW+0.190
idis+0.190
Neg logit contributions
wagen-0.190
amaz-0.177
 municip-0.175
pu-0.173
ells-0.172
Token
Token ID198
Feature activation+1.33e-01

Pos logit contributions
+0.019
 POS+0.017
+0.016
 LW+0.016
idis+0.015
Neg logit contributions
wagen-0.017
amaz-0.016
 municip-0.016
pu-0.015
ells-0.015
TokenHillary
Token ID20397
Feature activation+7.29e-02

Pos logit contributions
wagen+0.886
amaz+0.826
pu+0.809
 municip+0.797
ells+0.797
Neg logit contributions
-1.171
 POS-1.056
-0.978
 LW-0.960
idious-0.955
Token Clinton
Token ID2605
Feature activation+2.61e-02

Pos logit contributions
wagen+0.397
amaz+0.364
pu+0.357
ells+0.355
 municip+0.352
Neg logit contributions
-0.726
 POS-0.657
-0.618
idious-0.607
-0.605
Token's
Token ID338
Feature activation+1.84e-01

Pos logit contributions
wagen+0.173
amaz+0.161
pu+0.158
 municip+0.156
ells+0.155
Neg logit contributions
-0.230
 POS-0.207
-0.193
 LW-0.189
idious-0.187
Token campaign
Token ID1923
Feature activation+1.41e-01

Pos logit contributions
wagen+0.872
amaz+0.790
pu+0.761
 municip+0.759
ells+0.753
Neg logit contributions
-1.976
 POS-1.796
-1.701
-1.671
 LW-1.667
Token manager
Token ID4706
Feature activation+1.18e+00

Pos logit contributions
wagen+1.033
amaz+0.963
pu+0.940
 municip+0.940
ells+0.934
Neg logit contributions
-1.156
 POS-1.016
-0.941
idious-0.919
-0.917
Token Rob
Token ID3851
Feature activation+4.63e-01

Pos logit contributions
wagen+4.803
 Lib+4.298
rac+4.283
amaz+4.278
ells+4.177
Neg logit contributions
-12.147
 POS-10.651
lihood-10.413
-10.295
-10.217
Tokenby
Token ID1525
Feature activation-4.83e-02

Pos logit contributions
wagen+2.960
amaz+2.747
pu+2.700
rac+2.669
ells+2.663
Neg logit contributions
-4.215
 POS-3.743
-3.552
-3.422
 awa-3.416
Token M
Token ID337
Feature activation+2.29e-01

Pos logit contributions
+0.395
 POS+0.359
+0.328
 LW+0.325
idis+0.319
Neg logit contributions
wagen-0.349
amaz-0.328
 municip-0.321
pu-0.318
ells-0.315
Tokenook
Token ID566
Feature activation-3.51e-01

Pos logit contributions
wagen+1.291
amaz+1.184
pu+1.156
ells+1.143
 municip+1.131
Neg logit contributions
-2.245
 POS-2.041
-1.909
 LW-1.872
-1.865
Token said
Token ID531
Feature activation-3.94e-02

Pos logit contributions
+2.621
 POS+2.425
+2.171
 LW+2.157
idis+2.138
Neg logit contributions
wagen-2.681
amaz-2.535
 municip-2.481
pu-2.479
ells-2.441
Token placeholder
Token ID46076
Feature activation-1.84e-01

Pos logit contributions
wagen+0.956
amaz+0.900
pu+0.885
 municip+0.874
ells+0.872
Neg logit contributions
-1.114
 POS-0.986
-0.920
 LW-0.894
idis-0.892
Token
Token ID3556
Feature activation+1.32e-01

Pos logit contributions
+1.504
 POS+1.368
+1.243
 LW+1.225
idious+1.212
Neg logit contributions
wagen-1.331
amaz-1.233
 municip-1.206
pu-1.205
ells-1.187
Tokenp
Token ID79
Feature activation+1.09e+00

Pos logit contributions
wagen+0.913
amaz+0.852
pu+0.839
 municip+0.825
ells+0.823
Neg logit contributions
-1.134
 POS-1.015
-0.942
 LW-0.920
idious-0.914
Token>
Token ID29
Feature activation+3.99e-01

Pos logit contributions
wagen+2.132
pu+2.000
amaz+1.932
>+1.929
rac+1.910
Neg logit contributions
-13.200
 POS-12.349
-12.017
lihood-11.757
 Lauder-11.718
Token
Token ID7359
Feature activation+4.25e-01

Pos logit contributions
wagen+2.029
amaz+1.879
pu+1.857
ells+1.856
rac+1.789
Neg logit contributions
-4.008
 POS-3.649
-3.459
 LW-3.404
idious-3.379
Tokenli
Token ID4528
Feature activation+1.17e+00

Pos logit contributions
wagen+3.390
pu+3.290
amaz+3.213
rac+3.159
ells+3.158
Neg logit contributions
-3.185
 POS-2.740
-2.529
 LW-2.438
-2.433
Token>
Token ID29
Feature activation+6.34e-01

Pos logit contributions
pu+2.386
wagen+2.282
>+2.165
rac+2.163
amaz+2.086
Neg logit contributions
-14.131
 POS-13.018
 awa-12.805
-12.782
idious-12.488
Token
Token ID7359
Feature activation+4.34e-01

Pos logit contributions
wagen+2.863
ells+2.676
pu+2.667
amaz+2.632
rac+2.543
Neg logit contributions
-6.470
 POS-5.946
-5.653
 LW-5.586
idious-5.541
Tokenul
Token ID377
Feature activation+1.08e+00

Pos logit contributions
wagen+2.786
pu+2.677
amaz+2.591
ells+2.584
rac+2.561
Neg logit contributions
-3.943
 POS-3.470
-3.296
-3.191
 LW-3.167
Token>
Token ID29
Feature activation+4.60e-01

Pos logit contributions
pu+2.192
wagen+1.992
rac+1.970
>+1.851
ells+1.838
Neg logit contributions
-13.524
 awa-12.445
 POS-12.258
-11.979
idious-11.813
Token
Token ID7359
Feature activation+3.56e-01

Pos logit contributions
wagen+2.333
pu+2.148
ells+2.131
amaz+2.115
 rotor+2.055
Neg logit contributions
-4.584
 POS-4.189
-3.943
 LW-3.912
idious-3.879
Token know
Token ID760
Feature activation+1.42e-01

Pos logit contributions
+0.757
 POS+0.717
+0.647
 LW+0.638
idious+0.628
Neg logit contributions
wagen-0.667
amaz-0.622
pu-0.603
 municip-0.601
ells-0.600
Token yet
Token ID1865
Feature activation-2.50e-01

Pos logit contributions
wagen+1.033
amaz+0.978
pu+0.957
 municip+0.957
ells+0.944
Neg logit contributions
-1.158
 POS-1.004
-0.933
-0.911
idious-0.905
Token how
Token ID703
Feature activation+8.83e-03

Pos logit contributions
+1.882
 POS+1.741
+1.581
 LW+1.556
idis+1.531
Neg logit contributions
wagen-1.939
amaz-1.805
pu-1.771
 municip-1.747
ells-1.747
Token bad
Token ID2089
Feature activation-1.41e-01

Pos logit contributions
wagen+0.072
amaz+0.068
pu+0.067
 municip+0.066
ells+0.066
Neg logit contributions
-0.064
 POS-0.057
-0.051
 LW-0.050
idious-0.050
Token that
Token ID326
Feature activation+6.04e-01

Pos logit contributions
+1.044
 POS+0.948
+0.858
 LW+0.844
idis+0.838
Neg logit contributions
wagen-1.116
amaz-1.044
pu-1.028
 municip-1.015
ells-1.012
Token number
Token ID1271
Feature activation+1.17e+00

Pos logit contributions
wagen+3.850
amaz+3.594
 municip+3.558
 rotor+3.551
ells+3.550
Neg logit contributions
-5.435
 POS-4.574
-4.418
-4.291
idis-4.192
Token is
Token ID318
Feature activation+6.37e-01

Pos logit contributions
ells+5.852
wagen+5.846
amaz+5.771
 rotor+5.647
pu+5.509
Neg logit contributions
-11.273
 POS-9.161
-9.130
-9.058
 awa-8.780
Token.
Token ID13
Feature activation-3.18e-02

Pos logit contributions
wagen+4.119
amaz+3.958
 municip+3.871
pu+3.820
ells+3.785
Neg logit contributions
-5.630
 POS-4.739
-4.443
-4.437
 LW-4.329
Token
Token ID198
Feature activation-2.77e-01

Pos logit contributions
+0.264
 POS+0.239
+0.220
 LW+0.216
idis+0.214
Neg logit contributions
wagen-0.226
amaz-0.211
pu-0.207
 municip-0.207
ells-0.203
Token
Token ID198
Feature activation-1.20e-01

Pos logit contributions
+2.034
 POS+1.913
 LW+1.713
+1.702
idis+1.674
Neg logit contributions
wagen-2.191
 municip-2.091
amaz-2.026
rices-1.968
pu-1.949
Token"
Token ID1
Feature activation-1.82e-01

Pos logit contributions
+0.962
 POS+0.857
+0.789
 LW+0.771
idious+0.765
Neg logit contributions
wagen-0.886
amaz-0.831
pu-0.815
 municip-0.809
ells-0.805
Token)
Token ID8
Feature activation+3.55e-01

Pos logit contributions
wagen+1.925
amaz+1.787
pu+1.785
ells+1.730
rac+1.726
Neg logit contributions
-3.390
 POS-3.002
 awa-2.872
-2.820
idious-2.804
Token (
Token ID357
Feature activation+2.31e-02

Pos logit contributions
wagen+2.071
amaz+1.918
pu+1.915
ells+1.870
rac+1.859
Neg logit contributions
-3.396
 POS-2.950
 awa-2.809
-2.806
idious-2.798
Token8
Token ID23
Feature activation+2.34e-01

Pos logit contributions
wagen+0.151
amaz+0.138
 municip+0.138
pu+0.135
ells+0.133
Neg logit contributions
-0.206
 POS-0.187
-0.174
 LW-0.171
idis-0.169
Token)
Token ID8
Feature activation+1.91e-01

Pos logit contributions
wagen+1.366
amaz+1.275
pu+1.259
ells+1.229
rac+1.221
Neg logit contributions
-2.239
 POS-1.993
-1.872
idious-1.846
 awa-1.845
Token
Token ID198
Feature activation+1.56e-01

Pos logit contributions
wagen+1.249
amaz+1.175
pu+1.153
 municip+1.134
ells+1.134
Neg logit contributions
-1.677
 POS-1.477
-1.398
idious-1.362
-1.358
Token
Token ID198
Feature activation+1.17e+00

Pos logit contributions
wagen+1.051
amaz+0.982
pu+0.966
ells+0.950
 municip+0.943
Neg logit contributions
-1.361
 POS-1.215
-1.131
idious-1.102
 LW-1.102
Token1
Token ID16
Feature activation-4.60e-02

Pos logit contributions
ells+5.181
wagen+5.057
amaz+4.897
Lib+4.872
pu+4.846
Neg logit contributions
-10.788
 POS-10.383
 awa-9.755
lihood-9.565
-9.519
Token.
Token ID13
Feature activation+2.04e-01

Pos logit contributions
+0.358
 POS+0.316
+0.292
idious+0.283
+0.283
Neg logit contributions
wagen-0.351
amaz-0.332
pu-0.326
 municip-0.323
ells-0.322
Token Gordon
Token ID11646
Feature activation-3.29e-01

Pos logit contributions
wagen+1.292
amaz+1.210
pu+1.194
ells+1.170
 municip+1.168
Neg logit contributions
-1.850
 POS-1.642
-1.537
idious-1.501
-1.499
Token Reid
Token ID16095
Feature activation+3.89e-02

Pos logit contributions
+1.937
 POS+1.787
 LW+1.555
+1.554
idis+1.498
Neg logit contributions
wagen-3.082
amaz-2.943
 municip-2.881
pu-2.817
ells-2.800
Token (
Token ID357
Feature activation+9.18e-02

Pos logit contributions
wagen+0.250
amaz+0.233
pu+0.228
 municip+0.226
ells+0.225
Neg logit contributions
-0.349
 POS-0.314
-0.295
-0.286
 LW-0.286
Token Catalan
Token ID31066
Feature activation+6.83e-01

Pos logit contributions
wagen+0.651
amaz+0.602
pu+0.594
 municip+0.592
ells+0.583
Neg logit contributions
-0.959
 POS-0.866
-0.812
idious-0.794
 LW-0.794
Token pro
Token ID386
Feature activation+2.90e-01

Pos logit contributions
wagen+3.000
pu+2.867
 municip+2.816
amaz+2.799
 rotor+2.740
Neg logit contributions
-7.238
 POS-6.377
-6.232
idious-6.184
-6.154
Token-
Token ID12
Feature activation+6.48e-01

Pos logit contributions
wagen+2.317
amaz+2.198
pu+2.181
rac+2.142
ells+2.141
Neg logit contributions
-2.148
 POS-1.867
-1.725
 awa-1.669
idious-1.669
Tokenindependence
Token ID39894
Feature activation-3.19e-01

Pos logit contributions
wagen+2.959
rac+2.811
pu+2.810
amaz+2.807
ells+2.692
Neg logit contributions
-6.863
 POS-6.302
-5.982
 LW-5.913
 awa-5.888
Token flag
Token ID6056
Feature activation-6.66e-02

Pos logit contributions
+2.253
 POS+2.094
+1.859
 LW+1.849
idis+1.793
Neg logit contributions
wagen-2.573
amaz-2.430
ells-2.377
rices-2.362
pu-2.356
Token -
Token ID532
Feature activation+1.16e+00

Pos logit contributions
+0.532
 POS+0.474
+0.436
 LW+0.426
idious+0.423
Neg logit contributions
wagen-0.497
amaz-0.466
pu-0.458
 municip-0.454
ells-0.452
Tokenthe
Token ID1169
Feature activation+1.57e-01

Pos logit contributions
wagen+4.532
 Lib+4.053
pu+4.018
amaz+3.993
rac+3.958
Neg logit contributions
-12.050
 POS-10.815
idious-10.557
verett-10.350
idis-10.345
Token est
Token ID1556
Feature activation+2.94e-01

Pos logit contributions
wagen+0.902
amaz+0.828
pu+0.813
 municip+0.812
ells+0.801
Neg logit contributions
-1.512
 POS-1.374
-1.286
-1.270
idious-1.262
Tokenel
Token ID417
Feature activation-1.20e+00

Pos logit contributions
wagen+1.939
amaz+1.813
ells+1.802
pu+1.796
rac+1.763
Neg logit contributions
-2.633
 POS-2.332
-2.163
-2.128
 awa-2.127
Tokenada
Token ID4763
Feature activation+2.41e-01

Pos logit contributions
+3.829
 POS+3.554
idis+3.203
+3.066
ardi+3.038
Neg logit contributions
wagen-12.363
 municip-11.999
 Pistons-11.874
 rotor-11.825
amaz-11.764
Token -
Token ID532
Feature activation-1.73e-01

Pos logit contributions
wagen+1.715
amaz+1.590
pu+1.587
 municip+1.566
ells+1.549
Neg logit contributions
-2.005
 POS-1.766
-1.631
-1.622
idious-1.612
TokenG
Token ID38
Feature activation+1.15e-01

Pos logit contributions
+0.398
 POS+0.356
+0.325
 LW+0.319
+0.314
Neg logit contributions
wagen-0.393
amaz-0.368
 municip-0.363
pu-0.360
ells-0.356
TokenED
Token ID1961
Feature activation+1.98e-01

Pos logit contributions
wagen+0.565
amaz+0.519
pu+0.505
ells+0.501
 municip+0.488
Neg logit contributions
-1.203
 POS-1.092
-1.029
idious-1.014
idis-1.011
Token NET
Token ID30502
Feature activation+4.31e-01

Pos logit contributions
wagen+1.475
amaz+1.396
pu+1.370
ells+1.357
 municip+1.356
Neg logit contributions
-1.572
 POS-1.359
-1.277
idious-1.249
-1.245
Token CALL
Token ID42815
Feature activation-1.82e-01

Pos logit contributions
wagen+1.958
amaz+1.858
ells+1.853
pu+1.829
rac+1.767
Neg logit contributions
-4.488
 POS-3.894
idious-3.865
-3.848
idis-3.796
Token UPDATE
Token ID35717
Feature activation-3.38e-01

Pos logit contributions
+1.454
 POS+1.314
+1.196
 LW+1.180
idis+1.155
Neg logit contributions
wagen-1.353
amaz-1.262
pu-1.239
 municip-1.232
ells-1.216
Token:
Token ID25
Feature activation-1.68e+00

Pos logit contributions
+2.538
 POS+2.276
+2.048
 LW+2.034
idis+1.978
Neg logit contributions
wagen-2.683
amaz-2.489
pu-2.468
 municip-2.442
ells-2.422
Token 05
Token ID8870
Feature activation-1.02e+00

Pos logit contributions
 POS+8.848
+8.791
 LW+8.123
 FL+8.068
 WA+7.743
Neg logit contributions
wagen-13.980
amaz-13.142
 municip-12.617
rac-12.436
rices-12.410
Token41
Token ID3901
Feature activation-2.77e-01

Pos logit contributions
+4.850
 POS+4.833
+4.483
 LW+4.242
+3.984
Neg logit contributions
wagen-9.931
 municip-9.637
rices-9.085
 rotor-8.964
amaz-8.921
Tokenz
Token ID89
Feature activation-7.51e-02

Pos logit contributions
+1.766
 POS+1.564
+1.390
 LW+1.378
idis+1.351
Neg logit contributions
wagen-2.463
 municip-2.360
amaz-2.307
pu-2.275
ells-2.269
Token BLACK
Token ID31963
Feature activation+1.49e-01

Pos logit contributions
+0.593
 POS+0.536
+0.488
 LW+0.481
idis+0.472
Neg logit contributions
wagen-0.565
amaz-0.528
pu-0.519
 municip-0.516
ells-0.513
TokenFO
Token ID6080
Feature activation-1.02e-01

Pos logit contributions
wagen+0.920
amaz+0.851
pu+0.840
ells+0.830
rac+0.811
Neg logit contributions
-1.378
 POS-1.231
-1.157
idious-1.137
idis-1.136
Token ON
Token ID6177
Feature activation-3.59e-01

Pos logit contributions
+1.185
 POS+1.074
+0.969
 LW+0.952
idis+0.931
Neg logit contributions
wagen-1.232
amaz-1.152
pu-1.130
 municip-1.128
ells-1.116
Token ALL
Token ID11096
Feature activation-4.29e-01

Pos logit contributions
+2.687
 POS+2.505
+2.235
 LW+2.217
idis+2.118
Neg logit contributions
wagen-2.835
amaz-2.626
 municip-2.579
pu-2.559
rices-2.545
Token PERSON
Token ID46740
Feature activation-2.65e-01

Pos logit contributions
+3.015
 POS+2.880
 LW+2.502
+2.501
idis+2.355
Neg logit contributions
wagen-3.563
amaz-3.321
 municip-3.239
pu-3.227
rices-3.203
TokenEL
Token ID3698
Feature activation-5.44e-01

Pos logit contributions
+1.573
 POS+1.373
+1.212
 LW+1.178
+1.145
Neg logit contributions
wagen-2.522
amaz-2.395
pu-2.354
 municip-2.347
rac-2.305
Token UPDATE
Token ID35717
Feature activation-2.31e-01

Pos logit contributions
+4.117
 POS+3.937
 LW+3.463
+3.429
 FL+3.304
Neg logit contributions
wagen-4.197
amaz-3.876
 municip-3.785
pu-3.783
rac-3.702
Token:
Token ID25
Feature activation-1.57e+00

Pos logit contributions
+1.768
 POS+1.586
+1.436
 LW+1.413
idis+1.388
Neg logit contributions
wagen-1.803
amaz-1.682
pu-1.657
 municip-1.641
ells-1.631
Token 05
Token ID8870
Feature activation-9.74e-01

Pos logit contributions
+8.497
 POS+8.320
 LW+7.613
 FL+7.486
 WA+7.133
Neg logit contributions
wagen-13.244
amaz-12.439
 municip-11.908
rac-11.836
rices-11.796
Token45
Token ID2231
Feature activation-1.38e-01

Pos logit contributions
+5.223
 POS+4.983
+4.535
 LW+4.162
WAY+4.142
Neg logit contributions
wagen-8.682
 municip-8.626
rices-8.207
amaz-8.141
 rotor-7.989
Tokenz
Token ID89
Feature activation-3.28e-03

Pos logit contributions
+0.914
 POS+0.807
+0.726
 LW+0.712
idis+0.703
Neg logit contributions
wagen-1.200
 municip-1.135
amaz-1.128
pu-1.112
ells-1.104
Token K
Token ID509
Feature activation-3.39e-02

Pos logit contributions
+0.026
 POS+0.024
+0.022
 LW+0.021
idis+0.021
Neg logit contributions
wagen-0.024
amaz-0.023
pu-0.022
 municip-0.022
ells-0.022
TokenUSH
Token ID27143
Feature activation-2.71e-01

Pos logit contributions
+0.262
 POS+0.233
+0.214
 LW+0.208
idis+0.207
Neg logit contributions
wagen-0.261
amaz-0.245
pu-0.241
 municip-0.239
ells-0.238
TokenG
Token ID38
Feature activation+1.82e-01

Pos logit contributions
wagen+0.412
amaz+0.386
pu+0.379
 municip+0.377
ells+0.374
Neg logit contributions
-0.438
 POS-0.390
-0.359
 LW-0.351
-0.347
TokenED
Token ID1961
Feature activation+3.02e-01

Pos logit contributions
wagen+0.862
amaz+0.794
ells+0.773
pu+0.771
rac+0.739
Neg logit contributions
-1.922
 POS-1.736
-1.640
idious-1.625
 awa-1.618
Token NET
Token ID30502
Feature activation+5.69e-01

Pos logit contributions
wagen+2.064
amaz+1.955
pu+1.919
ells+1.895
 municip+1.891
Neg logit contributions
-2.540
 POS-2.187
-2.087
idious-2.059
idis-2.042
Token CALL
Token ID42815
Feature activation-1.26e-01

Pos logit contributions
wagen+2.294
pu+2.210
ells+2.201
amaz+2.172
 Lib+2.160
Neg logit contributions
-6.230
idious-5.373
 awa-5.272
verett-5.248
-5.236
Token UPDATE
Token ID35717
Feature activation-2.80e-01

Pos logit contributions
+1.077
 POS+0.966
+0.896
 LW+0.876
idious+0.871
Neg logit contributions
wagen-0.860
amaz-0.803
pu-0.786
 municip-0.779
ells-0.776
Token:
Token ID25
Feature activation-1.57e+00

Pos logit contributions
+1.984
 POS+1.837
 LW+1.638
+1.621
idis+1.558
Neg logit contributions
wagen-2.339
 municip-2.168
amaz-2.158
pu-2.120
rices-2.102
Token 05
Token ID8870
Feature activation-6.77e-01

Pos logit contributions
+7.886
 POS+7.499
 LW+6.904
 FL+6.726
 WA+6.388
Neg logit contributions
wagen-13.859
amaz-13.120
 municip-12.647
rac-12.549
rices-12.493
Token43
Token ID3559
Feature activation-2.41e-01

Pos logit contributions
+3.811
 POS+3.500
+3.307
+2.991
 LW+2.963
Neg logit contributions
wagen-6.256
 municip-6.160
amaz-5.842
rices-5.838
 rotor-5.714
Tokenz
Token ID89
Feature activation-3.09e-02

Pos logit contributions
+1.538
 POS+1.363
+1.216
 LW+1.203
idis+1.187
Neg logit contributions
wagen-2.142
 municip-2.048
amaz-2.007
pu-1.976
 rotor-1.968
Token ST
Token ID3563
Feature activation+1.43e-01

Pos logit contributions
+0.247
 POS+0.222
+0.203
 LW+0.199
idis+0.197
Neg logit contributions
wagen-0.230
amaz-0.215
pu-0.211
 municip-0.210
ells-0.208
TokenOW
Token ID3913
Feature activation-3.38e-01

Pos logit contributions
wagen+1.279
amaz+1.216
pu+1.202
ells+1.199
 municip+1.188
Neg logit contributions
-0.916
 POS-0.777
-0.708
idis-0.689
-0.687
TokenG
Token ID38
Feature activation+1.65e-01

Pos logit contributions
wagen+0.145
amaz+0.136
 municip+0.133
pu+0.133
ells+0.131
Neg logit contributions
-0.151
 POS-0.134
-0.123
 LW-0.121
-0.119
TokenED
Token ID1961
Feature activation+2.57e-01

Pos logit contributions
wagen+0.799
amaz+0.734
pu+0.713
ells+0.713
 municip+0.688
Neg logit contributions
-1.732
 POS-1.568
-1.481
idious-1.462
idis-1.457
Token NET
Token ID30502
Feature activation+5.61e-01

Pos logit contributions
wagen+1.901
amaz+1.800
pu+1.763
ells+1.749
 municip+1.748
Neg logit contributions
-2.053
 POS-1.762
-1.664
idious-1.630
-1.625
Token CALL
Token ID42815
Feature activation-1.21e-01

Pos logit contributions
wagen+2.250
ells+2.162
amaz+2.134
pu+2.108
 rotor+2.041
Neg logit contributions
-6.058
idious-5.271
-5.211
 POS-5.209
idis-5.167
Token UPDATE
Token ID35717
Feature activation-3.26e-01

Pos logit contributions
+1.013
 POS+0.912
+0.839
 LW+0.823
idis+0.814
Neg logit contributions
wagen-0.860
amaz-0.803
pu-0.787
 municip-0.781
ells-0.775
Token:
Token ID25
Feature activation-1.55e+00

Pos logit contributions
+2.449
 POS+2.195
+1.972
 LW+1.960
+1.909
Neg logit contributions
wagen-2.595
amaz-2.407
pu-2.386
 municip-2.360
ells-2.343
Token 05
Token ID8870
Feature activation-5.90e-01

Pos logit contributions
+8.011
 POS+7.747
 LW+7.181
 FL+6.979
 WA+6.671
Neg logit contributions
wagen-13.554
amaz-12.812
 municip-12.352
rices-12.178
rac-12.148
Token42
Token ID3682
Feature activation-1.59e-01

Pos logit contributions
+3.308
 POS+3.022
+2.798
+2.575
 LW+2.566
Neg logit contributions
wagen-5.577
 municip-5.438
amaz-5.199
rices-5.158
 rotor-5.070
Tokenz
Token ID89
Feature activation-3.71e-02

Pos logit contributions
+1.046
 POS+0.924
+0.829
 LW+0.817
idis+0.805
Neg logit contributions
wagen-1.399
 municip-1.326
amaz-1.315
pu-1.294
ells-1.283
Token SH
Token ID6006
Feature activation-8.74e-02

Pos logit contributions
+0.296
 POS+0.267
+0.244
 LW+0.240
idis+0.237
Neg logit contributions
wagen-0.276
amaz-0.258
pu-0.253
 municip-0.251
ells-0.250
TokenAM
Token ID2390
Feature activation-3.23e-01

Pos logit contributions
+0.734
 POS+0.662
+0.610
 LW+0.598
+0.590
Neg logit contributions
wagen-0.619
amaz-0.575
pu-0.563
 municip-0.561
ells-0.554
TokenG
Token ID38
Feature activation+2.22e-01

Pos logit contributions
wagen+0.811
amaz+0.763
pu+0.750
ells+0.740
 municip+0.737
Neg logit contributions
-0.898
 POS-0.799
-0.737
idious-0.720
 LW-0.718
TokenED
Token ID1961
Feature activation+2.82e-01

Pos logit contributions
wagen+1.027
amaz+0.947
ells+0.927
pu+0.921
rac+0.881
Neg logit contributions
-2.372
 POS-2.136
-2.021
 awa-2.012
idious-2.009
Token NET
Token ID30502
Feature activation+6.33e-01

Pos logit contributions
wagen+1.866
amaz+1.765
pu+1.731
ells+1.709
 municip+1.707
Neg logit contributions
-2.441
 POS-2.112
-2.016
idious-1.995
idis-1.976
Token CALL
Token ID42815
Feature activation-3.92e-02

Pos logit contributions
wagen+2.448
pu+2.396
ells+2.393
 Lib+2.371
amaz+2.335
Neg logit contributions
-6.972
idious-6.026
 awa-5.931
verett-5.890
-5.843
Token UPDATE
Token ID35717
Feature activation-1.88e-01

Pos logit contributions
+0.343
 POS+0.305
+0.286
idious+0.280
idis+0.279
Neg logit contributions
wagen-0.260
amaz-0.244
pu-0.238
ells-0.236
 municip-0.236
Token:
Token ID25
Feature activation-1.48e+00

Pos logit contributions
+1.368
 POS+1.248
+1.118
 LW+1.114
+1.073
Neg logit contributions
wagen-1.533
amaz-1.422
 municip-1.419
pu-1.395
ells-1.379
Token 05
Token ID8870
Feature activation-9.37e-01

Pos logit contributions
+7.504
 POS+7.028
 LW+6.304
 FL+6.271
 WA+5.896
Neg logit contributions
wagen-13.297
amaz-12.724
 municip-12.240
rac-12.191
rices-12.060
Token50
Token ID1120
Feature activation-1.08e-01

Pos logit contributions
+4.077
 POS+4.043
+3.440
 LW+3.200
idis+3.118
Neg logit contributions
wagen-9.373
 municip-9.230
amaz-9.066
rices-8.937
 rotor-8.732
Tokenz
Token ID89
Feature activation+7.07e-03

Pos logit contributions
+0.723
 POS+0.639
+0.576
 LW+0.564
idis+0.557
Neg logit contributions
wagen-0.935
 municip-0.883
amaz-0.881
pu-0.867
ells-0.859
Token VOL
Token ID38570
Feature activation+1.66e-01

Pos logit contributions
wagen+0.053
amaz+0.050
pu+0.049
 municip+0.048
ells+0.048
Neg logit contributions
-0.056
 POS-0.050
-0.046
 LW-0.045
idis-0.045
TokenUN
Token ID4944
Feature activation+2.89e-01

Pos logit contributions
wagen+1.234
amaz+1.157
pu+1.139
ells+1.119
 municip+1.117
Neg logit contributions
-1.316
 POS-1.153
-1.073
idious-1.046
-1.045
TokenG
Token ID38
Feature activation+1.75e-01

Pos logit contributions
wagen+0.295
amaz+0.276
pu+0.271
 municip+0.270
ells+0.267
Neg logit contributions
-0.313
 POS-0.279
-0.256
 LW-0.250
-0.248
TokenED
Token ID1961
Feature activation+3.33e-01

Pos logit contributions
wagen+0.833
amaz+0.766
ells+0.747
pu+0.746
rac+0.714
Neg logit contributions
-1.857
 POS-1.677
-1.584
idious-1.569
 awa-1.565
Token NET
Token ID30502
Feature activation+6.07e-01

Pos logit contributions
wagen+2.209
amaz+2.088
pu+2.053
ells+2.031
 municip+2.021
Neg logit contributions
-2.860
 POS-2.458
-2.357
idious-2.342
idis-2.318
Token CALL
Token ID42815
Feature activation-1.45e-01

Pos logit contributions
wagen+2.344
ells+2.288
pu+2.285
 Lib+2.253
amaz+2.216
Neg logit contributions
-6.718
idious-5.802
 awa-5.728
verett-5.670
-5.636
Token UPDATE
Token ID35717
Feature activation-2.75e-01

Pos logit contributions
+1.242
 POS+1.115
+1.033
 LW+1.010
idious+1.004
Neg logit contributions
wagen-0.990
amaz-0.925
pu-0.905
 municip-0.897
ells-0.893
Token:
Token ID25
Feature activation-1.44e+00

Pos logit contributions
+1.942
 POS+1.795
 LW+1.601
+1.590
idis+1.520
Neg logit contributions
wagen-2.300
 municip-2.138
amaz-2.124
pu-2.084
rices-2.066
Token 0
Token ID657
Feature activation-5.70e-01

Pos logit contributions
+7.442
 POS+7.139
 LW+6.440
 FL+6.313
 WA+5.981
Neg logit contributions
wagen-12.756
amaz-12.202
 municip-11.819
rices-11.605
rac-11.530
Token551
Token ID43697
Feature activation-3.20e-01

Pos logit contributions
+4.280
 POS+3.919
+3.580
 LW+3.537
idis+3.395
Neg logit contributions
wagen-4.368
amaz-4.174
 municip-4.093
rices-3.929
pu-3.922
Tokenz
Token ID89
Feature activation+2.41e-01

Pos logit contributions
+2.115
 POS+1.871
+1.664
 LW+1.643
idis+1.604
Neg logit contributions
wagen-2.782
amaz-2.620
 municip-2.618
pu-2.585
ells-2.570
Token VOL
Token ID38570
Feature activation-7.03e-03

Pos logit contributions
wagen+1.760
amaz+1.664
pu+1.633
ells+1.608
 municip+1.607
Neg logit contributions
-1.943
 POS-1.676
-1.572
idious-1.549
-1.541
TokenUN
Token ID4944
Feature activation+5.86e-01

Pos logit contributions
+0.059
 POS+0.052
+0.048
 LW+0.047
idious+0.047
Neg logit contributions
wagen-0.050
amaz-0.047
pu-0.046
 municip-0.045
ells-0.045
TokenG
Token ID38
Feature activation+3.70e-01

Pos logit contributions
+2.104
 POS+1.901
+1.728
 LW+1.697
idis+1.669
Neg logit contributions
wagen-1.998
amaz-1.878
pu-1.833
 municip-1.829
ells-1.804
TokenED
Token ID1961
Feature activation-2.24e-01

Pos logit contributions
wagen+2.513
amaz+2.348
pu+2.318
ells+2.305
 municip+2.269
Neg logit contributions
-3.124
 POS-2.665
-2.551
idious-2.532
idis-2.521
Token NET
Token ID30502
Feature activation+2.49e-01

Pos logit contributions
+1.724
 POS+1.580
+1.417
 LW+1.412
idis+1.361
Neg logit contributions
wagen-1.728
amaz-1.623
pu-1.589
 municip-1.584
ells-1.566
Token CALL
Token ID42815
Feature activation-2.35e-02

Pos logit contributions
wagen+1.637
amaz+1.539
pu+1.518
ells+1.514
rac+1.487
Neg logit contributions
-2.174
 POS-1.873
-1.793
idious-1.781
idis-1.755
Token UPDATE
Token ID35717
Feature activation-2.87e-01

Pos logit contributions
+0.178
 POS+0.157
+0.144
 LW+0.140
idious+0.140
Neg logit contributions
wagen-0.184
amaz-0.173
pu-0.170
 municip-0.169
ells-0.168
Token:
Token ID25
Feature activation-1.38e+00

Pos logit contributions
+2.189
 POS+1.990
+1.786
 LW+1.783
idis+1.727
Neg logit contributions
wagen-2.245
amaz-2.088
pu-2.055
 municip-2.044
ells-2.019
Token 05
Token ID8870
Feature activation-4.78e-01

Pos logit contributions
 POS+8.243
+8.150
 LW+7.520
 FL+7.376
 WA+7.067
Neg logit contributions
wagen-11.222
amaz-10.510
 municip-10.174
rices-10.031
rac-9.948
Token38
Token ID2548
Feature activation-3.14e-02

Pos logit contributions
+2.891
 POS+2.725
+2.487
 LW+2.402
idis+2.307
Neg logit contributions
wagen-4.385
 municip-4.186
amaz-4.017
rices-4.006
 rotor-3.907
Tokenz
Token ID89
Feature activation+2.65e-02

Pos logit contributions
+0.215
 POS+0.190
+0.171
 LW+0.167
idis+0.165
Neg logit contributions
wagen-0.268
amaz-0.253
 municip-0.251
pu-0.249
ells-0.247
Token SEM
Token ID48603
Feature activation-1.52e-01

Pos logit contributions
wagen+0.197
amaz+0.185
pu+0.181
 municip+0.180
ells+0.179
Neg logit contributions
-0.212
 POS-0.190
-0.174
 LW-0.170
idis-0.169
TokenINAL
Token ID17961
Feature activation+5.01e-03

Pos logit contributions
+1.109
 POS+0.978
+0.890
 LW+0.867
idis+0.859
Neg logit contributions
wagen-1.244
amaz-1.173
pu-1.152
 municip-1.145
ells-1.139
Token.
Token ID13
Feature activation-1.54e-01

Pos logit contributions
wagen+3.372
pu+3.186
amaz+3.132
ells+3.129
rac+3.128
Neg logit contributions
-4.518
 POS-3.841
 awa-3.738
-3.664
-3.648
Tokenlist
Token ID4868
Feature activation+2.50e-01

Pos logit contributions
+0.889
 POS+0.767
+0.676
 LW+0.660
idis+0.651
Neg logit contributions
wagen-1.476
amaz-1.405
 municip-1.394
pu-1.383
ells-1.367
Token_
Token ID62
Feature activation+6.24e-01

Pos logit contributions
wagen+2.035
amaz+1.928
pu+1.922
ells+1.898
rac+1.880
Neg logit contributions
-1.824
 POS-1.570
-1.443
 awa-1.430
idious-1.406
Tokenimages
Token ID17566
Feature activation+4.48e-01

Pos logit contributions
wagen+2.480
amaz+2.359
pu+2.299
ells+2.239
rac+2.235
Neg logit contributions
-7.163
 POS-6.506
 awa-6.304
-6.185
 berth-6.054
Token(
Token ID7
Feature activation-3.16e-02

Pos logit contributions
wagen+2.420
pu+2.263
amaz+2.229
ells+2.203
rac+2.195
Neg logit contributions
-4.440
 POS-3.915
 awa-3.810
-3.694
idious-3.676
Tokenargs
Token ID22046
Feature activation-1.36e+00

Pos logit contributions
+0.260
 POS+0.232
+0.214
 LW+0.209
idious+0.208
Neg logit contributions
wagen-0.229
amaz-0.214
pu-0.210
 municip-0.208
ells-0.207
Token["
Token ID14692
Feature activation-5.07e-03

Pos logit contributions
+2.780
 POS+2.447
+1.926
 LW+1.720
"]=>+1.699
Neg logit contributions
wagen-15.873
amaz-15.570
 municip-15.538
 rotor-14.617
 Pistons-14.572
Tokenimages
Token ID17566
Feature activation-2.41e-01

Pos logit contributions
+0.051
 POS+0.046
+0.043
idis+0.043
 LW+0.043
Neg logit contributions
wagen-0.027
amaz-0.025
 municip-0.025
pu-0.024
 rotor-0.024
Token"]
Token ID8973
Feature activation+6.73e-02

Pos logit contributions
+0.744
 POS+0.570
+0.420
 LW+0.397
+0.379
Neg logit contributions
wagen-2.941
amaz-2.838
 municip-2.830
ells-2.777
pu-2.772
Token):
Token ID2599
Feature activation-9.97e-02

Pos logit contributions
wagen+0.445
amaz+0.414
pu+0.406
 municip+0.401
ells+0.400
Neg logit contributions
-0.593
 POS-0.534
-0.496
 LW-0.486
idious-0.483
Token #
Token ID1303
Feature activation+1.57e-01

Pos logit contributions
+0.863
 POS+0.799
+0.729
 LW+0.729
idious+0.708
Neg logit contributions
wagen-0.664
amaz-0.616
pu-0.602
 municip-0.602
ells-0.598
Token.
Token ID764
Feature activation-1.01e-01

Pos logit contributions
wagen+2.263
pu+2.063
ells+2.023
rac+2.008
amaz+1.999
Neg logit contributions
-5.312
 POS-4.640
-4.509
 awa-4.504
-4.500
Token list
Token ID1351
Feature activation+3.53e-01

Pos logit contributions
+0.824
 POS+0.735
+0.678
 LW+0.662
idious+0.657
Neg logit contributions
wagen-0.740
amaz-0.693
pu-0.680
 municip-0.675
ells-0.671
Token_
Token ID62
Feature activation+7.25e-01

Pos logit contributions
wagen+2.748
pu+2.613
amaz+2.589
ells+2.585
rac+2.545
Neg logit contributions
-2.672
 POS-2.260
 awa-2.137
-2.126
idious-2.086
Tokenimages
Token ID17566
Feature activation+6.25e-01

Pos logit contributions
wagen+2.748
amaz+2.591
pu+2.549
ells+2.516
rac+2.464
Neg logit contributions
-8.431
 POS-7.611
 awa-7.400
-7.301
 berth-7.108
Token (
Token ID357
Feature activation+6.90e-02

Pos logit contributions
wagen+3.219
pu+3.069
ells+2.986
rac+2.948
amaz+2.928
Neg logit contributions
-6.309
 POS-5.383
 awa-5.370
-5.243
-5.205
Token args
Token ID26498
Feature activation-1.33e+00

Pos logit contributions
wagen+0.566
amaz+0.534
pu+0.527
ells+0.523
 municip+0.521
Neg logit contributions
-0.505
 POS-0.433
-0.401
-0.388
idious-0.386
Token [
Token ID685
Feature activation-5.69e-01

Pos logit contributions
 POS+7.063
+6.606
 LW+6.154
+5.919
 FL+5.762
Neg logit contributions
wagen-11.403
amaz-11.238
 municip-11.010
rac-10.345
ocese-10.330
Token "
Token ID366
Feature activation-2.90e-01

Pos logit contributions
+4.204
 POS+3.949
 LW+3.509
+3.459
idis+3.352
Neg logit contributions
wagen-4.404
amaz-4.159
 municip-4.005
pu-3.999
ells-3.949
Tokenimages
Token ID17566
Feature activation-2.70e-01

Pos logit contributions
+2.709
 POS+2.515
idis+2.324
 LW+2.314
+2.307
Neg logit contributions
wagen-1.694
 municip-1.560
amaz-1.510
 rotor-1.488
pu-1.450
Token"
Token ID1
Feature activation-8.43e-03

Pos logit contributions
+1.930
 POS+1.765
+1.586
 LW+1.565
idious+1.545
Neg logit contributions
wagen-2.172
 municip-2.057
amaz-2.056
 rotor-1.976
ells-1.969
Token ]
Token ID2361
Feature activation+1.81e-01

Pos logit contributions
+0.070
 POS+0.062
+0.058
 LW+0.056
+0.056
Neg logit contributions
wagen-0.060
amaz-0.056
pu-0.055
ells-0.055
 municip-0.055
Token97
Token ID5607
Feature activation-3.43e-01

Pos logit contributions
+3.016
 POS+2.631
+2.434
 LW+2.279
+2.220
Neg logit contributions
wagen-5.158
amaz-4.923
 municip-4.895
ells-4.805
rices-4.760
Token]
Token ID60
Feature activation+2.19e-01

Pos logit contributions
+2.792
 POS+2.494
+2.299
 LW+2.266
+2.233
Neg logit contributions
wagen-2.501
amaz-2.343
pu-2.283
 municip-2.269
ells-2.258
Token Ble
Token ID17175
Feature activation+1.11e-01

Pos logit contributions
wagen+1.456
amaz+1.361
pu+1.352
 municip+1.332
rac+1.319
Neg logit contributions
-1.907
 POS-1.678
-1.581
idious-1.548
-1.545
Tokenacher
Token ID3493
Feature activation+9.24e-02

Pos logit contributions
wagen+0.813
amaz+0.766
pu+0.752
ells+0.743
 municip+0.740
Neg logit contributions
-0.906
 POS-0.808
-0.745
 LW-0.726
-0.723
Token Report
Token ID6358
Feature activation-9.12e-02

Pos logit contributions
wagen+0.736
amaz+0.696
pu+0.679
 municip+0.678
ells+0.674
Neg logit contributions
-0.688
 POS-0.612
-0.556
 LW-0.546
idis-0.540
Token,[
Token ID17414
Feature activation-1.31e+00

Pos logit contributions
+0.627
 POS+0.548
+0.496
 LW+0.483
idis+0.477
Neg logit contributions
wagen-0.782
amaz-0.740
pu-0.727
 municip-0.722
ells-0.719
Token98
Token ID4089
Feature activation-2.19e-01

Pos logit contributions
+4.313
 POS+3.628
+3.431
+3.048
 LW+2.916
Neg logit contributions
wagen-13.751
amaz-13.495
 municip-13.232
rahim-13.070
ells-12.931
Token]
Token ID60
Feature activation+1.86e-01

Pos logit contributions
+1.830
 POS+1.639
+1.513
 LW+1.484
+1.470
Neg logit contributions
wagen-1.557
amaz-1.458
pu-1.424
 municip-1.413
ells-1.406
Token Sporting
Token ID31790
Feature activation+1.61e-01

Pos logit contributions
wagen+1.225
amaz+1.143
pu+1.135
 municip+1.118
ells+1.107
Neg logit contributions
-1.630
 POS-1.438
-1.358
idious-1.326
-1.325
Token News
Token ID3000
Feature activation-4.73e-02

Pos logit contributions
wagen+1.191
amaz+1.116
pu+1.096
 municip+1.089
ells+1.083
Neg logit contributions
-1.303
 POS-1.150
-1.064
idious-1.034
idis-1.031
Token,[
Token ID17414
Feature activation-1.18e+00

Pos logit contributions
+0.321
 POS+0.280
+0.252
 LW+0.246
idis+0.243
Neg logit contributions
wagen-0.410
amaz-0.388
pu-0.382
 municip-0.379
ells-0.378
Token
Token ID198
Feature activation-1.22e+00

Pos logit contributions
+2.161
 POS+2.041
 LW+1.836
+1.811
idis+1.786
Neg logit contributions
wagen-2.365
 municip-2.268
amaz-2.192
rices-2.132
 rotor-2.104
Token2007
Token ID12726
Feature activation-4.16e-02

Pos logit contributions
+3.567
 POS+2.735
wine+2.535
+2.394
+2.314
Neg logit contributions
 municip-14.374
wagen-14.125
 rotor-13.766
rices-13.437
amaz-13.340
Token Tony
Token ID8832
Feature activation+6.50e-02

Pos logit contributions
+0.349
 POS+0.314
+0.289
 LW+0.284
idis+0.281
Neg logit contributions
wagen-0.294
amaz-0.274
pu-0.269
 municip-0.267
ells-0.265
Token Shal
Token ID47974
Feature activation-2.55e-01

Pos logit contributions
wagen+0.561
amaz+0.532
pu+0.524
 municip+0.519
ells+0.518
Neg logit contributions
-0.442
 POS-0.384
-0.348
 LW-0.336
-0.335
Tokenh
Token ID71
Feature activation+4.82e-03

Pos logit contributions
+1.782
 POS+1.604
idis+1.475
 LW+1.461
+1.429
Neg logit contributions
wagen-2.078
 municip-1.967
amaz-1.909
pu-1.881
 rotor-1.871
Tokenoub
Token ID12944
Feature activation-1.29e+00

Pos logit contributions
wagen+0.036
amaz+0.033
pu+0.033
 municip+0.033
ells+0.032
Neg logit contributions
-0.039
 POS-0.035
-0.032
 LW-0.031
idis-0.031
Token
Token ID198
Feature activation-3.84e-01

Pos logit contributions
+7.646
 POS+7.166
 LW+6.419
idis+6.350
+6.089
Neg logit contributions
wagen-10.304
amaz-9.890
 municip-9.637
ells-9.584
pu-9.473
Token
Token ID198
Feature activation-1.21e+00

Pos logit contributions
+2.678
 POS+2.558
 LW+2.304
+2.253
idis+2.220
Neg logit contributions
wagen-3.127
 municip-3.019
amaz-2.891
rices-2.815
 rotor-2.767
Token2007
Token ID12726
Feature activation-1.28e-01

Pos logit contributions
+4.249
 POS+3.173
wine+2.915
+2.829
+2.806
Neg logit contributions
wagen-13.285
 municip-13.069
 rotor-12.776
amaz-12.698
ells-12.661
Token Charlie
Token ID11526
Feature activation+1.30e-01

Pos logit contributions
+1.064
 POS+0.966
+0.884
 LW+0.876
idis+0.865
Neg logit contributions
wagen-0.906
amaz-0.841
pu-0.822
 municip-0.821
ells-0.813
Token Sheen
Token ID50189
Feature activation-7.89e-01

Pos logit contributions
wagen+0.871
amaz+0.816
pu+0.801
ells+0.790
 municip+0.788
Neg logit contributions
-1.136
 POS-1.016
-0.945
 LW-0.921
idious-0.919
Tokenthe
Token ID1169
Feature activation-7.84e-02

Pos logit contributions
wagen+1.254
amaz+1.188
pu+1.164
ells+1.142
rac+1.139
Neg logit contributions
-1.518
 POS-1.368
-1.252
 LW-1.229
idious-1.220
Token daughter
Token ID4957
Feature activation-6.93e-02

Pos logit contributions
+0.643
 POS+0.578
+0.531
 LW+0.524
idious+0.517
Neg logit contributions
wagen-0.567
amaz-0.528
pu-0.518
 municip-0.514
ells-0.511
Token of
Token ID286
Feature activation+6.88e-02

Pos logit contributions
+0.534
 POS+0.474
+0.434
 LW+0.423
idis+0.420
Neg logit contributions
wagen-0.535
amaz-0.504
pu-0.495
 municip-0.491
ells-0.489
Token Ted
Token ID11396
Feature activation-1.26e-01

Pos logit contributions
wagen+0.493
amaz+0.463
pu+0.454
 municip+0.450
ells+0.447
Neg logit contributions
-0.569
 POS-0.508
-0.469
 LW-0.458
-0.455
Token Kot
Token ID21702
Feature activation-3.76e-01

Pos logit contributions
+0.947
 POS+0.841
+0.768
 LW+0.756
idis+0.753
Neg logit contributions
wagen-1.003
amaz-0.934
 municip-0.918
pu-0.913
ells-0.905
Tokenche
Token ID2395
Feature activation-1.28e+00

Pos logit contributions
+3.629
 POS+3.358
idis+3.140
+3.090
 LW+3.081
Neg logit contributions
wagen-2.014
 municip-1.930
amaz-1.829
pu-1.790
 rotor-1.775
Tokenff
Token ID487
Feature activation+1.26e-01

Pos logit contributions
+3.301
 POS+3.039
idis+2.589
wine+2.553
verett+2.372
Neg logit contributions
 municip-14.089
wagen-13.777
 rotor-13.412
amaz-13.337
rices-13.085
Token,
Token ID11
Feature activation+2.29e-01

Pos logit contributions
wagen+0.873
amaz+0.815
pu+0.800
ells+0.791
 municip+0.790
Neg logit contributions
-1.066
 POS-0.954
-0.883
 LW-0.863
-0.859
Token director
Token ID3437
Feature activation-1.49e-01

Pos logit contributions
wagen+1.638
amaz+1.552
pu+1.518
 municip+1.498
ells+1.493
Neg logit contributions
-1.874
 POS-1.663
-1.535
-1.493
idious-1.489
Token of
Token ID286
Feature activation+2.88e-02

Pos logit contributions
+1.132
 POS+1.004
+0.917
 LW+0.897
idious+0.888
Neg logit contributions
wagen-1.164
amaz-1.092
pu-1.074
 municip-1.066
ells-1.061
Token,
Token ID11
Feature activation+1.94e-01

Pos logit contributions
wagen+0.206
amaz+0.193
pu+0.189
 municip+0.187
ells+0.186
Neg logit contributions
-0.239
 POS-0.213
-0.197
 LW-0.192
-0.191
Token
Token ID198
Feature activation-2.29e-01

Pos logit contributions
+5.958
 POS+5.588
idious+4.863
idis+4.852
 LW+4.846
Neg logit contributions
wagen-7.464
amaz-7.229
 municip-7.066
pu-7.006
ells-7.001
Token
Token ID198
Feature activation-5.71e-01

Pos logit contributions
+1.704
 POS+1.598
 LW+1.440
+1.429
idis+1.400
Neg logit contributions
wagen-1.789
 municip-1.699
amaz-1.660
rices-1.604
pu-1.589
Token2003
Token ID16088
Feature activation-2.34e-01

Pos logit contributions
+2.773
 POS+2.258
+2.083
+2.074
wine+1.972
Neg logit contributions
wagen-5.790
 municip-5.667
amaz-5.527
 rotor-5.489
pu-5.429
Token Ray
Token ID7760
Feature activation+7.27e-01

Pos logit contributions
+1.961
 POS+1.804
+1.638
 LW+1.636
idis+1.604
Neg logit contributions
wagen-1.639
amaz-1.511
 municip-1.478
pu-1.477
ells-1.460
Token Rom
Token ID3570
Feature activation-4.89e-02

Pos logit contributions
wagen+0.644
pu+0.572
 Lib+0.498
rac+0.493
ells+0.473
Neg logit contributions
-10.232
 POS-9.320
-9.206
 awa-9.187
-9.046
Tokenano
Token ID5733
Feature activation-1.28e+00

Pos logit contributions
+0.321
 POS+0.281
+0.252
 LW+0.247
idis+0.245
Neg logit contributions
wagen-0.431
amaz-0.406
 municip-0.402
pu-0.399
ells-0.397
Token 2004
Token ID5472
Feature activation-1.77e-01

Pos logit contributions
+6.664
 POS+6.289
 LW+5.267
+5.254
 FL+5.224
Neg logit contributions
wagen-10.780
amaz-10.626
 municip-10.412
pu-10.340
ells-10.228
Token Kel
Token ID15150
Feature activation-6.74e-02

Pos logit contributions
+1.517
 POS+1.386
+1.271
 LW+1.256
idis+1.241
Neg logit contributions
wagen-1.212
amaz-1.119
 municip-1.094
pu-1.092
ells-1.083
Tokensey
Token ID4397
Feature activation+3.71e-01

Pos logit contributions
+0.368
 POS+0.313
+0.273
 LW+0.265
idis+0.264
Neg logit contributions
wagen-0.668
amaz-0.636
 municip-0.628
pu-0.625
ells-0.621
Token Gram
Token ID20159
Feature activation-1.50e-01

Pos logit contributions
wagen+3.004
pu+2.828
amaz+2.820
ells+2.790
rac+2.762
Neg logit contributions
-2.723
 POS-2.311
-2.182
 awa-2.090
idious-2.086
Tokenmer
Token ID647
Feature activation-2.02e-01

Pos logit contributions
+0.780
 POS+0.664
+0.574
 LW+0.560
idis+0.549
Neg logit contributions
wagen-1.515
amaz-1.447
 municip-1.435
pu-1.421
ells-1.414
Token "
Token ID366
Feature activation-5.77e-01

Pos logit contributions
+2.586
 POS+2.451
 LW+2.159
+2.092
idis+2.009
Neg logit contributions
wagen-3.671
amaz-3.457
 municip-3.423
pu-3.356
ells-3.329
Token--
Token ID438
Feature activation-1.85e-01

Pos logit contributions
+3.944
 POS+3.618
+3.338
wine+3.245
 LW+3.201
Neg logit contributions
wagen-4.653
 municip-4.410
amaz-4.256
 rotor-4.208
rahim-4.196
Tokenimages
Token ID17566
Feature activation-2.47e-01

Pos logit contributions
+1.783
 POS+1.642
+1.535
idis+1.524
 LW+1.515
Neg logit contributions
wagen-1.035
 municip-0.947
amaz-0.927
 rotor-0.915
pu-0.907
Token",
Token ID1600
Feature activation-2.06e-01

Pos logit contributions
+1.618
 POS+1.440
+1.275
idis+1.249
 LW+1.246
Neg logit contributions
wagen-2.167
amaz-2.065
 municip-2.059
 rotor-1.990
pu-1.989
Token required
Token ID2672
Feature activation+2.00e-01

Pos logit contributions
+1.769
 POS+1.604
+1.480
 LW+1.456
idious+1.437
Neg logit contributions
wagen-1.406
amaz-1.310
pu-1.276
 municip-1.269
ells-1.263
Token=
Token ID28
Feature activation-1.27e+00

Pos logit contributions
wagen+1.287
amaz+1.198
pu+1.177
ells+1.156
 municip+1.151
Neg logit contributions
-1.803
 POS-1.624
-1.511
 LW-1.479
idious-1.475
TokenTrue
Token ID17821
Feature activation-5.11e-03

Pos logit contributions
+6.373
 POS+5.796
+5.514
+5.090
WAY+5.036
Neg logit contributions
 municip-12.393
wagen-12.168
 rotor-11.236
ozyg-11.052
rices-10.997
Token,
Token ID11
Feature activation-3.92e-01

Pos logit contributions
+0.040
 POS+0.035
+0.032
 LW+0.032
idis+0.031
Neg logit contributions
wagen-0.039
amaz-0.037
 municip-0.036
pu-0.036
ells-0.036
Token help
Token ID1037
Feature activation-7.74e-02

Pos logit contributions
+2.848
 POS+2.677
 LW+2.399
+2.358
idis+2.301
Neg logit contributions
wagen-3.055
amaz-2.876
 municip-2.847
pu-2.828
ells-2.801
Token="
Token ID2625
Feature activation-3.95e-01

Pos logit contributions
+0.445
 POS+0.381
+0.340
 LW+0.325
idis+0.318
Neg logit contributions
wagen-0.753
amaz-0.714
 municip-0.714
pu-0.696
ells-0.691
Tokenpath
Token ID6978
Feature activation-3.62e-01

Pos logit contributions
+2.634
 POS+2.400
+2.166
idis+2.142
wine+2.137
Neg logit contributions
wagen-3.272
 municip-3.160
amaz-3.052
 rotor-3.017
pu-2.967
Token for
Token ID329
Feature activation+4.22e-02

Pos logit contributions
wagen+1.362
amaz+1.278
pu+1.253
 municip+1.245
ells+1.236
Neg logit contributions
-1.518
 POS-1.352
-1.248
 LW-1.217
-1.211
Token public
Token ID1171
Feature activation+9.50e-02

Pos logit contributions
wagen+0.315
amaz+0.295
pu+0.289
 municip+0.287
ells+0.286
Neg logit contributions
-0.338
 POS-0.301
-0.277
 LW-0.271
idious-0.268
Token monitoring
Token ID9904
Feature activation-1.69e-02

Pos logit contributions
wagen+0.724
amaz+0.678
pu+0.669
 municip+0.665
ells+0.659
Neg logit contributions
-0.743
 POS-0.658
-0.608
 LW-0.589
-0.586
Token and
Token ID290
Feature activation-7.36e-02

Pos logit contributions
+0.129
 POS+0.115
+0.104
 LW+0.102
idious+0.101
Neg logit contributions
wagen-0.133
amaz-0.125
pu-0.122
ells-0.121
 municip-0.121
Token accountability
Token ID18241
Feature activation-6.09e-03

Pos logit contributions
+0.609
 POS+0.549
+0.501
 LW+0.494
idious+0.491
Neg logit contributions
wagen-0.527
amaz-0.493
pu-0.483
ells-0.478
 municip-0.476
Token;
Token ID26
Feature activation-1.26e+00

Pos logit contributions
+0.048
 POS+0.043
+0.039
 LW+0.038
idis+0.038
Neg logit contributions
wagen-0.046
amaz-0.043
pu-0.043
 municip-0.042
ells-0.042
Token and
Token ID290
Feature activation-9.46e-01

Pos logit contributions
+8.470
 POS+7.269
 LW+6.462
idis+6.360
idious+6.319
Neg logit contributions
wagen-9.746
amaz-9.262
ells-9.164
rahim-9.015
pu-8.864
Token iv
Token ID21628
Feature activation-1.53e-01

Pos logit contributions
+7.168
 POS+6.248
 LW+5.658
+5.596
idious+5.570
Neg logit contributions
wagen-7.040
amaz-6.693
ells-6.475
pu-6.403
rahim-6.368
Token)
Token ID8
Feature activation+1.99e-02

Pos logit contributions
+1.314
 POS+1.198
+1.118
 LW+1.095
idis+1.084
Neg logit contributions
wagen-1.036
amaz-0.961
 municip-0.948
pu-0.928
ells-0.922
Token mandate
Token ID14598
Feature activation-1.53e-01

Pos logit contributions
wagen+0.160
amaz+0.150
pu+0.147
ells+0.146
 municip+0.146
Neg logit contributions
-0.147
 POS-0.131
-0.119
 LW-0.117
idis-0.115
Token that
Token ID326
Feature activation+1.08e-01

Pos logit contributions
+1.140
 POS+1.034
+0.925
 LW+0.919
idious+0.908
Neg logit contributions
wagen-1.207
amaz-1.130
pu-1.107
ells-1.102
rac-1.086
Token x
Token ID2124
Feature activation-4.33e-02

Pos logit contributions
+1.915
 POS+1.810
 LW+1.436
+1.394
idis+1.309
Neg logit contributions
wagen-5.390
amaz-5.159
 municip-5.090
rices-4.995
ells-4.975
Token.
Token ID13
Feature activation-3.60e-01

Pos logit contributions
+0.321
 POS+0.286
+0.260
 LW+0.255
idis+0.252
Neg logit contributions
wagen-0.347
amaz-0.326
 municip-0.321
pu-0.319
ells-0.316
Tokenx
Token ID87
Feature activation-4.06e-01

Pos logit contributions
+2.813
 POS+2.607
idis+2.359
+2.358
 LW+2.348
Neg logit contributions
wagen-2.646
 municip-2.539
amaz-2.432
 rotor-2.386
rices-2.347
Token.
Token ID13
Feature activation-7.27e-01

Pos logit contributions
+2.566
 POS+2.389
 LW+2.119
+2.085
idis+2.053
Neg logit contributions
wagen-3.584
 municip-3.456
amaz-3.351
rices-3.246
 rotor-3.237
Tokenx
Token ID87
Feature activation-6.28e-01

Pos logit contributions
+4.956
 POS+4.662
idis+4.283
wine+4.216
 LW+4.147
Neg logit contributions
 municip-5.726
wagen-5.694
 rotor-5.316
amaz-5.249
rices-5.138
Token.
Token ID13
Feature activation-1.26e+00

Pos logit contributions
+3.860
 POS+3.550
+3.127
idis+3.118
 LW+3.111
Neg logit contributions
wagen-5.576
 municip-5.394
amaz-5.248
 rotor-5.108
rices-5.099
Token151
Token ID24309
Feature activation-3.02e-01

Pos logit contributions
+4.607
 POS+4.584
idis+4.398
wine+3.977
+3.634
Neg logit contributions
wagen-12.662
amaz-12.441
rahim-12.269
rices-12.153
 rotor-12.109
Token 103
Token ID15349
Feature activation-6.22e-01

Pos logit contributions
+2.303
 POS+2.143
 LW+1.961
+1.916
idis+1.877
Neg logit contributions
wagen-2.305
amaz-2.145
 municip-2.130
pu-2.098
rices-2.075
Token30
Token ID1270
Feature activation-1.64e-01

Pos logit contributions
+4.588
 POS+4.387
+4.020
 LW+3.909
idis+3.792
Neg logit contributions
wagen-4.860
 municip-4.606
amaz-4.341
rices-4.322
 rotor-4.227
Token29
Token ID1959
Feature activation-5.37e-01

Pos logit contributions
+1.202
 POS+1.089
+1.000
 LW+0.971
idis+0.957
Neg logit contributions
wagen-1.314
amaz-1.230
 municip-1.220
pu-1.187
rices-1.183
Token x
Token ID2124
Feature activation+7.13e-02

Pos logit contributions
+2.094
 POS+1.985
 LW+1.580
+1.511
idis+1.396
Neg logit contributions
wagen-6.002
amaz-5.751
 municip-5.702
rices-5.599
ells-5.561
Token2008
Token ID11528
Feature activation+2.22e-01

Pos logit contributions
+4.494
+3.142
 POS+3.138
+3.044
wine+2.878
Neg logit contributions
 municip-13.652
wagen-13.602
 rotor-13.010
amaz-12.885
 Pistons-12.673
Token Steve
Token ID6542
Feature activation+4.88e-02

Pos logit contributions
wagen+1.532
amaz+1.449
pu+1.426
ells+1.403
rac+1.396
Neg logit contributions
-1.878
 POS-1.647
-1.549
-1.491
idious-1.490
Token Care
Token ID7276
Feature activation+3.88e-01

Pos logit contributions
wagen+0.392
amaz+0.370
pu+0.363
 municip+0.361
ells+0.359
Neg logit contributions
-0.361
 POS-0.319
-0.291
 LW-0.284
idis-0.281
Tokenll
Token ID297
Feature activation-5.39e-01

Pos logit contributions
wagen+2.777
ells+2.637
pu+2.617
amaz+2.601
rac+2.573
Neg logit contributions
-3.220
 POS-2.805
 awa-2.627
-2.605
-2.534
Token
Token ID198
Feature activation-2.90e-01

Pos logit contributions
+3.859
 POS+3.683
 LW+3.343
+3.269
idis+3.227
Neg logit contributions
wagen-4.216
 municip-4.014
amaz-3.938
rices-3.833
ells-3.813
Token
Token ID198
Feature activation-1.24e+00

Pos logit contributions
+2.112
 POS+1.990
 LW+1.791
+1.770
idis+1.743
Neg logit contributions
wagen-2.302
 municip-2.207
amaz-2.132
rices-2.074
 rotor-2.050
Token2008
Token ID11528
Feature activation+1.51e-01

Pos logit contributions
+4.273
+3.039
 POS+2.964
+2.943
wine+2.776
Neg logit contributions
 municip-13.959
wagen-13.793
 rotor-13.168
amaz-13.017
 Pistons-12.814
Token Lee
Token ID5741
Feature activation+3.41e-01

Pos logit contributions
wagen+1.041
amaz+0.980
pu+0.962
ells+0.949
 municip+0.944
Neg logit contributions
-1.279
 POS-1.130
-1.058
idious-1.021
-1.020
Token Pace
Token ID44111
Feature activation-1.13e-01

Pos logit contributions
wagen+2.041
amaz+1.904
pu+1.888
ells+1.879
rac+1.841
Neg logit contributions
-3.238
 POS-2.874
-2.720
-2.645
 LW-2.610
Token
Token ID198
Feature activation-1.31e-01

Pos logit contributions
+0.942
 POS+0.843
+0.779
 LW+0.761
idious+0.756
Neg logit contributions
wagen-0.801
amaz-0.749
pu-0.734
 municip-0.728
ells-0.724
Token
Token ID198
Feature activation-1.02e+00

Pos logit contributions
+1.026
 POS+0.945
+0.857
 LW+0.853
idis+0.838
Neg logit contributions
wagen-0.981
 municip-0.920
amaz-0.912
pu-0.879
rices-0.877
Token Sheen
Token ID50189
Feature activation-7.89e-01

Pos logit contributions
wagen+0.871
amaz+0.816
pu+0.801
ells+0.790
 municip+0.788
Neg logit contributions
-1.136
 POS-1.016
-0.945
 LW-0.921
idious-0.919
Token 2008
Token ID3648
Feature activation+1.68e-01

Pos logit contributions
+5.618
 POS+5.156
 LW+4.682
idis+4.586
+4.573
Neg logit contributions
wagen-6.057
amaz-5.756
 municip-5.638
ells-5.586
pu-5.560
Token Alec
Token ID38285
Feature activation-1.91e-01

Pos logit contributions
wagen+1.148
amaz+1.082
pu+1.064
ells+1.045
rac+1.041
Neg logit contributions
-1.439
 POS-1.268
-1.189
-1.149
idious-1.148
Token Baldwin
Token ID26402
Feature activation-9.92e-02

Pos logit contributions
+1.345
 POS+1.212
+1.084
 LW+1.080
idis+1.060
Neg logit contributions
wagen-1.595
amaz-1.496
 municip-1.480
pu-1.468
ells-1.448
Token
Token ID198
Feature activation-5.59e-02

Pos logit contributions
+0.817
 POS+0.743
+0.682
 LW+0.672
idis+0.666
Neg logit contributions
wagen-0.708
amaz-0.660
 municip-0.651
pu-0.644
ells-0.636
Token
Token ID198
Feature activation-1.23e+00

Pos logit contributions
+0.450
 POS+0.412
+0.375
 LW+0.372
idis+0.366
Neg logit contributions
wagen-0.409
amaz-0.381
 municip-0.380
pu-0.369
ells-0.366
Token2008
Token ID11528
Feature activation+2.22e-01

Pos logit contributions
+4.494
+3.142
 POS+3.138
+3.044
wine+2.878
Neg logit contributions
 municip-13.652
wagen-13.602
 rotor-13.010
amaz-12.885
 Pistons-12.673
Token Steve
Token ID6542
Feature activation+4.88e-02

Pos logit contributions
wagen+1.532
amaz+1.449
pu+1.426
ells+1.403
rac+1.396
Neg logit contributions
-1.878
 POS-1.647
-1.549
-1.491
idious-1.490
Token Care
Token ID7276
Feature activation+3.88e-01

Pos logit contributions
wagen+0.392
amaz+0.370
pu+0.363
 municip+0.361
ells+0.359
Neg logit contributions
-0.361
 POS-0.319
-0.291
 LW-0.284
idis-0.281
Tokenll
Token ID297
Feature activation-5.39e-01

Pos logit contributions
wagen+2.777
ells+2.637
pu+2.617
amaz+2.601
rac+2.573
Neg logit contributions
-3.220
 POS-2.805
 awa-2.627
-2.605
-2.534
Token
Token ID198
Feature activation-2.90e-01

Pos logit contributions
+3.859
 POS+3.683
 LW+3.343
+3.269
idis+3.227
Neg logit contributions
wagen-4.216
 municip-4.014
amaz-3.938
rices-3.833
ells-3.813
Token2007
Token ID12726
Feature activation-1.01e-01

Pos logit contributions
+4.092
 POS+3.120
wine+2.847
+2.794
+2.715
Neg logit contributions
wagen-11.552
 municip-11.301
 rotor-11.161
amaz-10.991
ells-10.944
Token Steve
Token ID6542
Feature activation+1.48e-01

Pos logit contributions
+0.842
 POS+0.763
+0.699
 LW+0.691
idis+0.682
Neg logit contributions
wagen-0.719
amaz-0.669
pu-0.655
 municip-0.653
ells-0.646
Token Care
Token ID7276
Feature activation+4.01e-01

Pos logit contributions
wagen+1.112
amaz+1.056
pu+1.039
ells+1.034
 municip+1.027
Neg logit contributions
-1.164
 POS-1.015
-0.945
 LW-0.912
-0.909
Tokenll
Token ID297
Feature activation-7.17e-01

Pos logit contributions
wagen+2.914
pu+2.731
ells+2.727
amaz+2.720
rac+2.695
Neg logit contributions
-3.318
 POS-2.895
-2.693
 awa-2.664
-2.619
Token
Token ID198
Feature activation-2.98e-01

Pos logit contributions
+5.319
 POS+4.864
 LW+4.396
idis+4.356
+4.349
Neg logit contributions
wagen-5.357
amaz-5.090
pu-4.966
 municip-4.959
ells-4.928
Token
Token ID198
Feature activation-1.22e+00

Pos logit contributions
+2.161
 POS+2.041
 LW+1.836
+1.811
idis+1.786
Neg logit contributions
wagen-2.365
 municip-2.268
amaz-2.192
rices-2.132
 rotor-2.104
Token2007
Token ID12726
Feature activation-4.16e-02

Pos logit contributions
+3.567
 POS+2.735
wine+2.535
+2.394
+2.314
Neg logit contributions
 municip-14.374
wagen-14.125
 rotor-13.766
rices-13.437
amaz-13.340
Token Tony
Token ID8832
Feature activation+6.50e-02

Pos logit contributions
+0.349
 POS+0.314
+0.289
 LW+0.284
idis+0.281
Neg logit contributions
wagen-0.294
amaz-0.274
pu-0.269
 municip-0.267
ells-0.265
Token Shal
Token ID47974
Feature activation-2.55e-01

Pos logit contributions
wagen+0.561
amaz+0.532
pu+0.524
 municip+0.519
ells+0.518
Neg logit contributions
-0.442
 POS-0.384
-0.348
 LW-0.336
-0.335
Tokenh
Token ID71
Feature activation+4.82e-03

Pos logit contributions
+1.782
 POS+1.604
idis+1.475
 LW+1.461
+1.429
Neg logit contributions
wagen-2.078
 municip-1.967
amaz-1.909
pu-1.881
 rotor-1.871
Tokenoub
Token ID12944
Feature activation-1.29e+00

Pos logit contributions
wagen+0.036
amaz+0.033
pu+0.033
 municip+0.033
ells+0.032
Neg logit contributions
-0.039
 POS-0.035
-0.032
 LW-0.031
idis-0.031
Token Tony
Token ID8832
Feature activation+6.50e-02

Pos logit contributions
+0.349
 POS+0.314
+0.289
 LW+0.284
idis+0.281
Neg logit contributions
wagen-0.294
amaz-0.274
pu-0.269
 municip-0.267
ells-0.265
Token Shal
Token ID47974
Feature activation-2.55e-01

Pos logit contributions
wagen+0.561
amaz+0.532
pu+0.524
 municip+0.519
ells+0.518
Neg logit contributions
-0.442
 POS-0.384
-0.348
 LW-0.336
-0.335
Tokenh
Token ID71
Feature activation+4.82e-03

Pos logit contributions
+1.782
 POS+1.604
idis+1.475
 LW+1.461
+1.429
Neg logit contributions
wagen-2.078
 municip-1.967
amaz-1.909
pu-1.881
 rotor-1.871
Tokenoub
Token ID12944
Feature activation-1.29e+00

Pos logit contributions
wagen+0.036
amaz+0.033
pu+0.033
 municip+0.033
ells+0.032
Neg logit contributions
-0.039
 POS-0.035
-0.032
 LW-0.031
idis-0.031
Token
Token ID198
Feature activation-3.84e-01

Pos logit contributions
+7.646
 POS+7.166
 LW+6.419
idis+6.350
+6.089
Neg logit contributions
wagen-10.304
amaz-9.890
 municip-9.637
ells-9.584
pu-9.473
Token
Token ID198
Feature activation-1.21e+00

Pos logit contributions
+2.678
 POS+2.558
 LW+2.304
+2.253
idis+2.220
Neg logit contributions
wagen-3.127
 municip-3.019
amaz-2.891
rices-2.815
 rotor-2.767
Token2007
Token ID12726
Feature activation-1.28e-01

Pos logit contributions
+4.249
 POS+3.173
wine+2.915
+2.829
+2.806
Neg logit contributions
wagen-13.285
 municip-13.069
 rotor-12.776
amaz-12.698
ells-12.661
Token Charlie
Token ID11526
Feature activation+1.30e-01

Pos logit contributions
+1.064
 POS+0.966
+0.884
 LW+0.876
idis+0.865
Neg logit contributions
wagen-0.906
amaz-0.841
pu-0.822
 municip-0.821
ells-0.813
Token Sheen
Token ID50189
Feature activation-7.89e-01

Pos logit contributions
wagen+0.871
amaz+0.816
pu+0.801
ells+0.790
 municip+0.788
Neg logit contributions
-1.136
 POS-1.016
-0.945
 LW-0.921
idious-0.919
Token 2008
Token ID3648
Feature activation+1.68e-01

Pos logit contributions
+5.618
 POS+5.156
 LW+4.682
idis+4.586
+4.573
Neg logit contributions
wagen-6.057
amaz-5.756
 municip-5.638
ells-5.586
pu-5.560
Token Alec
Token ID38285
Feature activation-1.91e-01

Pos logit contributions
wagen+1.148
amaz+1.082
pu+1.064
ells+1.045
rac+1.041
Neg logit contributions
-1.439
 POS-1.268
-1.189
-1.149
idious-1.148
Token year
Token ID614
Feature activation+1.50e-02

Pos logit contributions
+1.003
 POS+0.919
 LW+0.831
+0.823
idious+0.815
Neg logit contributions
wagen-0.884
amaz-0.819
pu-0.806
ells-0.798
 municip-0.796
Token,
Token ID11
Feature activation+7.33e-02

Pos logit contributions
wagen+0.111
amaz+0.104
pu+0.102
 municip+0.101
ells+0.101
Neg logit contributions
-0.121
 POS-0.108
-0.099
 LW-0.097
-0.097
Token but
Token ID475
Feature activation+9.70e-02

Pos logit contributions
wagen+0.531
amaz+0.497
pu+0.488
 municip+0.485
ells+0.481
Neg logit contributions
-0.600
 POS-0.534
-0.494
 LW-0.480
-0.479
Token if
Token ID611
Feature activation-3.37e-01

Pos logit contributions
wagen+0.726
amaz+0.682
pu+0.668
 municip+0.665
ells+0.658
Neg logit contributions
-0.772
 POS-0.682
-0.631
-0.612
idious-0.611
Token the
Token ID262
Feature activation-2.80e-02

Pos logit contributions
+2.445
 POS+2.196
 LW+1.983
+1.967
idis+1.917
Neg logit contributions
wagen-2.710
amaz-2.554
pu-2.518
ells-2.499
 municip-2.479
Token Jets
Token ID14728
Feature activation+9.38e-01

Pos logit contributions
+0.223
 POS+0.201
+0.184
 LW+0.182
idious+0.179
Neg logit contributions
wagen-0.208
amaz-0.194
pu-0.191
 municip-0.189
ells-0.189
Token want
Token ID765
Feature activation+3.14e-01

Pos logit contributions
wagen+5.739
 rotor+5.353
pu+5.313
amaz+5.280
 municip+5.251
Neg logit contributions
-8.334
 POS-6.975
-6.745
idious-6.623
-6.618
Token to
Token ID284
Feature activation+6.33e-02

Pos logit contributions
wagen+2.253
amaz+2.128
pu+2.060
 municip+2.058
ells+2.046
Neg logit contributions
-2.491
 POS-2.242
-2.096
-2.068
idious-2.055
Token see
Token ID766
Feature activation-5.07e-02

Pos logit contributions
wagen+0.490
amaz+0.462
pu+0.452
 municip+0.449
ells+0.447
Neg logit contributions
-0.486
 POS-0.436
-0.396
 LW-0.390
idis-0.383
Token playoff
Token ID13119
Feature activation-2.08e-01

Pos logit contributions
+0.392
 POS+0.351
+0.320
 LW+0.315
idis+0.311
Neg logit contributions
wagen-0.390
amaz-0.365
pu-0.359
 municip-0.356
ells-0.355
Token hockey
Token ID12217
Feature activation-1.98e-01

Pos logit contributions
+1.672
 POS+1.522
+1.368
 LW+1.358
idious+1.336
Neg logit contributions
wagen-1.534
amaz-1.424
pu-1.412
 municip-1.394
ells-1.389
Token of
Token ID286
Feature activation-8.41e-02

Pos logit contributions
wagen+0.456
amaz+0.430
pu+0.424
 municip+0.421
ells+0.418
Neg logit contributions
-0.485
 POS-0.431
-0.400
 LW-0.387
idis-0.385
Token "
Token ID366
Feature activation+1.63e-01

Pos logit contributions
+0.656
 POS+0.593
+0.537
 LW+0.531
idis+0.521
Neg logit contributions
wagen-0.643
amaz-0.601
pu-0.586
ells-0.580
 municip-0.579
Tokenoff
Token ID2364
Feature activation+1.60e-01

Pos logit contributions
wagen+1.219
amaz+1.153
pu+1.138
rac+1.119
ells+1.117
Neg logit contributions
-1.289
 POS-1.149
-1.053
 LW-1.032
idious-1.021
Token-
Token ID12
Feature activation+4.14e-01

Pos logit contributions
wagen+1.253
amaz+1.198
pu+1.180
rac+1.173
ells+1.169
Neg logit contributions
-1.201
 POS-1.048
-0.974
 LW-0.937
idis-0.925
Tokenthe
Token ID1169
Feature activation+4.65e-01

Pos logit contributions
wagen+2.565
pu+2.455
amaz+2.445
rac+2.442
ells+2.378
Neg logit contributions
-3.704
 POS-3.344
-3.139
 LW-3.058
idious-3.006
Token-
Token ID12
Feature activation+9.10e-01

Pos logit contributions
wagen+3.528
pu+3.344
amaz+3.329
ells+3.285
rac+3.276
Neg logit contributions
-3.622
 POS-3.078
-2.858
 awa-2.857
idious-2.775
Tokenrecord
Token ID22105
Feature activation-3.60e-01

Pos logit contributions
pu+4.387
rac+4.325
wagen+4.299
ells+4.166
amaz+4.145
Neg logit contributions
-9.483
 POS-8.384
 awa-8.078
 LW-7.766
 Lauder-7.708
Token"
Token ID1
Feature activation-4.26e-01

Pos logit contributions
+2.686
 POS+2.430
+2.185
 LW+2.147
+2.131
Neg logit contributions
wagen-2.834
amaz-2.675
pu-2.573
ells-2.553
 municip-2.543
Token information
Token ID1321
Feature activation-3.74e-01

Pos logit contributions
+3.090
 POS+2.842
 LW+2.512
+2.507
+2.468
Neg logit contributions
wagen-3.416
amaz-3.217
pu-3.101
ells-3.086
rices-3.041
Token,
Token ID11
Feature activation-3.66e-01

Pos logit contributions
+2.732
 POS+2.467
+2.226
 LW+2.214
idious+2.177
Neg logit contributions
wagen-2.988
amaz-2.821
pu-2.744
ells-2.713
rices-2.670
Token noting
Token ID10820
Feature activation-1.15e-01

Pos logit contributions
+2.640
 POS+2.420
+2.146
 LW+2.141
+2.104
Neg logit contributions
wagen-2.934
amaz-2.781
ells-2.690
pu-2.673
rices-2.669
Token are
Token ID389
Feature activation-4.25e-02

Pos logit contributions
wagen+1.318
amaz+1.228
pu+1.208
 municip+1.200
ells+1.198
Neg logit contributions
-1.540
 POS-1.363
-1.272
-1.237
 LW-1.233
Token a
Token ID257
Feature activation+3.57e-01

Pos logit contributions
+0.330
 POS+0.298
+0.270
 LW+0.265
idis+0.262
Neg logit contributions
wagen-0.325
amaz-0.304
pu-0.299
ells-0.295
 municip-0.294
Token bit
Token ID1643
Feature activation-1.24e-01

Pos logit contributions
wagen+2.259
amaz+2.092
pu+2.061
 municip+2.047
ells+2.017
Neg logit contributions
-3.229
 POS-2.871
-2.721
 LW-2.650
idis-2.643
Token safer
Token ID14178
Feature activation-3.78e-02

Pos logit contributions
+0.962
 POS+0.866
+0.787
 LW+0.770
idious+0.763
Neg logit contributions
wagen-0.955
amaz-0.896
pu-0.877
ells-0.870
 municip-0.867
Token than
Token ID621
Feature activation+1.37e-01

Pos logit contributions
+0.296
 POS+0.264
+0.242
 LW+0.237
idious+0.235
Neg logit contributions
wagen-0.287
amaz-0.269
pu-0.264
 municip-0.261
ells-0.261
Token they
Token ID484
Feature activation+9.88e-01

Pos logit contributions
wagen+1.035
amaz+0.972
pu+0.954
 municip+0.947
ells+0.942
Neg logit contributions
-1.073
 POS-0.953
-0.876
 LW-0.854
idis-0.848
Token otherwise
Token ID4306
Feature activation+6.40e-01

Pos logit contributions
wagen+5.587
ells+5.275
pu+5.259
amaz+5.194
 medic+5.157
Neg logit contributions
-9.247
-7.388
 POS-7.365
-7.307
lihood-7.130
Token might
Token ID1244
Feature activation+6.20e-01

Pos logit contributions
wagen+4.062
pu+3.770
amaz+3.765
ells+3.764
 municip+3.654
Neg logit contributions
-5.692
 POS-4.875
-4.679
-4.649
idis-4.510
Token be
Token ID307
Feature activation+3.04e-01

Pos logit contributions
wagen+3.545
amaz+3.277
ells+3.240
pu+3.207
 municip+3.107
Neg logit contributions
-5.769
 POS-5.086
-4.902
-4.834
idis-4.804
Token.
Token ID13
Feature activation-1.18e-01

Pos logit contributions
wagen+2.204
amaz+2.068
pu+2.032
 municip+2.030
ells+2.011
Neg logit contributions
-2.460
 POS-2.188
-2.004
 LW-1.970
-1.963
Token So
Token ID1406
Feature activation+1.44e-01

Pos logit contributions
+0.944
 POS+0.873
+0.789
 LW+0.789
idis+0.763
Neg logit contributions
wagen-0.875
amaz-0.809
 municip-0.802
pu-0.789
ells-0.777
Tokena
Token ID64
Feature activation+9.76e-02

Pos logit contributions
wagen+0.253
amaz+0.238
 municip+0.235
pu+0.234
ells+0.231
Neg logit contributions
-0.209
 POS-0.185
-0.167
 LW-0.163
idis-0.163
Token.
Token ID13
Feature activation+1.32e-01

Pos logit contributions
wagen+0.760
amaz+0.716
pu+0.704
ells+0.695
 municip+0.695
Neg logit contributions
-0.747
 POS-0.660
-0.605
 LW-0.589
idious-0.585
Tokendot
Token ID26518
Feature activation+1.16e-01

Pos logit contributions
wagen+0.835
amaz+0.775
pu+0.759
 municip+0.750
ells+0.746
Neg logit contributions
-1.210
 POS-1.092
-1.019
 LW-0.997
-0.991
Token.
Token ID13
Feature activation+3.41e-01

Pos logit contributions
wagen+0.894
amaz+0.843
pu+0.830
ells+0.819
 municip+0.815
Neg logit contributions
-0.901
 POS-0.791
-0.729
idious-0.707
 LW-0.707
Tokengov
Token ID9567
Feature activation+4.65e-01

Pos logit contributions
wagen+1.731
amaz+1.580
pu+1.572
ells+1.530
rac+1.508
Neg logit contributions
-3.559
 POS-3.207
-3.016
-2.980
idious-2.976
Token/
Token ID14
Feature activation+1.02e+00

Pos logit contributions
wagen+2.812
pu+2.625
amaz+2.597
rac+2.544
ells+2.543
Neg logit contributions
-4.338
 POS-3.786
-3.593
 awa-3.589
idious-3.562
Tokenpeople
Token ID15332
Feature activation-1.32e-01

Pos logit contributions
wagen+5.905
pu+5.856
rac+5.589
amaz+5.568
ells+5.441
Neg logit contributions
-8.991
 POS-7.977
-7.559
-7.390
 Lauder-7.338
Token/
Token ID14
Feature activation+2.42e-01

Pos logit contributions
+1.099
 POS+0.985
+0.911
 LW+0.892
idis+0.889
Neg logit contributions
wagen-0.927
amaz-0.870
pu-0.847
 municip-0.846
ells-0.837
Tokenin
Token ID259
Feature activation-1.35e-01

Pos logit contributions
wagen+1.727
amaz+1.628
pu+1.608
ells+1.571
rac+1.569
Neg logit contributions
-2.000
 POS-1.787
-1.641
 LW-1.608
-1.597
Tokenjury
Token ID21871
Feature activation-2.05e-01

Pos logit contributions
+0.954
 POS+0.840
+0.763
 LW+0.741
idious+0.739
Neg logit contributions
wagen-1.117
amaz-1.057
pu-1.035
 municip-1.034
ells-1.025
Token/
Token ID14
Feature activation+1.86e-01

Pos logit contributions
+1.696
 POS+1.520
+1.402
 LW+1.372
idis+1.370
Neg logit contributions
wagen-1.464
amaz-1.370
 municip-1.336
pu-1.332
ells-1.323
Token,
Token ID11
Feature activation+4.43e-01

Pos logit contributions
+0.048
 POS+0.043
+0.040
 LW+0.039
+0.039
Neg logit contributions
wagen-0.045
amaz-0.042
pu-0.041
 municip-0.041
ells-0.041
Token El
Token ID2574
Feature activation+1.27e-01

Pos logit contributions
wagen+3.080
amaz+2.913
pu+2.866
 municip+2.856
ells+2.790
Neg logit contributions
-3.733
 POS-3.252
-3.017
-2.991
idious-2.950
Tokenj
Token ID73
Feature activation-1.43e-01

Pos logit contributions
wagen+0.935
amaz+0.878
pu+0.863
 municip+0.851
ells+0.849
Neg logit contributions
-1.033
 POS-0.921
-0.849
 LW-0.828
-0.823
Tokenig
Token ID328
Feature activation+1.86e-01

Pos logit contributions
+1.168
 POS+1.044
+0.963
idis+0.945
 LW+0.943
Neg logit contributions
wagen-1.035
amaz-0.965
 municip-0.951
pu-0.945
ells-0.935
Tokenide
Token ID485
Feature activation+2.50e-01

Pos logit contributions
wagen+1.341
amaz+1.261
pu+1.239
ells+1.215
 municip+1.213
Neg logit contributions
-1.540
 POS-1.372
-1.267
 LW-1.236
-1.231
Tokeny
Token ID88
Feature activation+9.01e-01

Pos logit contributions
wagen+1.725
amaz+1.628
pu+1.611
 municip+1.577
ells+1.575
Neg logit contributions
-2.116
 POS-1.890
-1.753
idious-1.726
-1.708
Token was
Token ID373
Feature activation+3.26e-01

Pos logit contributions
wagen+6.237
pu+6.000
amaz+5.976
ells+5.809
rac+5.784
Neg logit contributions
-7.138
 POS-6.259
idious-5.806
-5.768
-5.710
Token not
Token ID407
Feature activation-9.24e-03

Pos logit contributions
wagen+2.368
amaz+2.233
pu+2.207
 municip+2.200
ells+2.153
Neg logit contributions
-2.650
 POS-2.310
-2.140
-2.103
idis-2.093
Token from
Token ID422
Feature activation-6.89e-02

Pos logit contributions
+0.071
 POS+0.064
+0.058
 LW+0.057
idious+0.056
Neg logit contributions
wagen-0.071
amaz-0.067
pu-0.065
 municip-0.065
ells-0.065
Token Il
Token ID13778
Feature activation+3.79e-01

Pos logit contributions
+0.564
 POS+0.506
+0.466
 LW+0.458
idis+0.452
Neg logit contributions
wagen-0.500
amaz-0.466
pu-0.457
 municip-0.452
ells-0.450
Tokenj
Token ID73
Feature activation+2.16e-01

Pos logit contributions
wagen+2.697
amaz+2.548
pu+2.501
rac+2.456
ells+2.449
Neg logit contributions
-3.132
 POS-2.814
-2.614
 LW-2.534
-2.520
Token Who
Token ID5338
Feature activation+1.11e-01

Pos logit contributions
+2.246
 POS+2.146
 LW+1.912
+1.880
idis+1.824
Neg logit contributions
wagen-2.359
amaz-2.186
 municip-2.175
pu-2.120
rices-2.114
Token would
Token ID561
Feature activation+7.54e-02

Pos logit contributions
wagen+0.830
amaz+0.778
pu+0.764
 municip+0.758
ells+0.755
Neg logit contributions
-0.883
 POS-0.785
-0.723
 LW-0.705
idious-0.700
Token suffer
Token ID8659
Feature activation-1.47e-01

Pos logit contributions
wagen+0.560
amaz+0.526
pu+0.515
 municip+0.511
ells+0.509
Neg logit contributions
-0.603
 POS-0.540
-0.495
 LW-0.484
idis-0.480
Token from
Token ID422
Feature activation+8.87e-02

Pos logit contributions
+1.120
 POS+1.025
+0.921
 LW+0.907
idious+0.895
Neg logit contributions
wagen-1.132
amaz-1.053
pu-1.040
ells-1.035
 municip-1.018
Token the
Token ID262
Feature activation+2.96e-02

Pos logit contributions
wagen+0.674
amaz+0.634
pu+0.622
 municip+0.619
ells+0.614
Neg logit contributions
-0.696
 POS-0.616
-0.567
 LW-0.552
idis-0.549
Token end
Token ID886
Feature activation+8.68e-02

Pos logit contributions
wagen+0.216
amaz+0.202
pu+0.198
 municip+0.196
ells+0.195
Neg logit contributions
-0.241
 POS-0.216
-0.198
 LW-0.194
idious-0.193
Token of
Token ID286
Feature activation-1.24e-01

Pos logit contributions
wagen+0.656
amaz+0.619
pu+0.605
 municip+0.601
ells+0.598
Neg logit contributions
-0.684
 POS-0.604
-0.561
 LW-0.545
idis-0.542
Token this
Token ID428
Feature activation+1.53e-01

Pos logit contributions
+0.945
 POS+0.855
+0.768
 LW+0.759
idious+0.748
Neg logit contributions
wagen-0.964
amaz-0.902
pu-0.887
ells-0.877
 municip-0.868
Token perverse
Token ID43060
Feature activation+1.59e-01

Pos logit contributions
wagen+1.115
amaz+1.045
pu+1.026
 municip+1.022
ells+1.010
Neg logit contributions
-1.256
 POS-1.112
-1.034
 LW-1.005
idis-1.000
Token subsidy
Token ID28646
Feature activation+3.64e-01

Pos logit contributions
wagen+1.162
amaz+1.087
pu+1.067
 municip+1.062
ells+1.051
Neg logit contributions
-1.289
 POS-1.140
-1.059
 LW-1.029
idis-1.026
Token of
Token ID286
Feature activation-2.25e-01

Pos logit contributions
wagen+2.595
amaz+2.419
 municip+2.389
pu+2.385
 rotor+2.343
Neg logit contributions
-3.055
 POS-2.632
-2.492
idis-2.406
 LW-2.397
Token the
Token ID262
Feature activation+2.21e-01

Pos logit contributions
wagen+2.385
amaz+2.253
ells+2.215
 municip+2.208
pu+2.206
Neg logit contributions
-2.858
 POS-2.502
-2.354
-2.290
idious-2.278
Token city
Token ID1748
Feature activation+3.89e-01

Pos logit contributions
wagen+1.507
amaz+1.407
 municip+1.383
pu+1.380
ells+1.371
Neg logit contributions
-1.904
 POS-1.687
-1.582
-1.535
idious-1.532
Token's
Token ID338
Feature activation+2.66e-01

Pos logit contributions
wagen+2.331
amaz+2.161
 municip+2.153
pu+2.144
ells+2.131
Neg logit contributions
-3.661
 POS-3.217
-3.024
idious-2.998
 LW-2.970
Token famous
Token ID5863
Feature activation-8.57e-02

Pos logit contributions
wagen+1.846
amaz+1.724
 municip+1.702
pu+1.693
ells+1.682
Neg logit contributions
-2.262
 POS-2.003
-1.871
-1.820
idious-1.820
Token Cock
Token ID23769
Feature activation+2.27e-01

Pos logit contributions
+0.719
 POS+0.655
+0.598
 LW+0.593
idis+0.584
Neg logit contributions
wagen-0.597
amaz-0.559
pu-0.545
 municip-0.536
ells-0.535
Tokeners
Token ID364
Feature activation+5.31e-01

Pos logit contributions
wagen+1.732
amaz+1.620
pu+1.597
ells+1.584
rac+1.572
Neg logit contributions
-1.811
 POS-1.582
-1.479
 LW-1.425
-1.423
Token Roses
Token ID41244
Feature activation+4.27e-01

Pos logit contributions
wagen+3.819
ells+3.539
pu+3.527
amaz+3.453
 municip+3.381
Neg logit contributions
-4.435
 POS-3.793
-3.657
-3.512
 LW-3.468
Token business
Token ID1597
Feature activation+5.32e-01

Pos logit contributions
wagen+2.969
pu+2.745
ells+2.739
 municip+2.705
amaz+2.681
Neg logit contributions
-3.677
 POS-3.145
-3.015
-2.945
 LW-2.904
Token.
Token ID13
Feature activation-2.20e-02

Pos logit contributions
wagen+3.506
pu+3.184
 municip+3.171
ells+3.160
 rotor+3.140
Neg logit contributions
-4.743
 POS-4.005
-3.893
-3.823
idis-3.758
Token
Token ID198
Feature activation+2.23e-01

Pos logit contributions
+0.189
 POS+0.168
+0.156
 LW+0.152
idious+0.152
Neg logit contributions
wagen-0.152
amaz-0.142
pu-0.139
ells-0.137
 municip-0.137
Token
Token ID198
Feature activation+3.46e-02

Pos logit contributions
wagen+1.471
amaz+1.376
pu+1.356
ells+1.334
rac+1.323
Neg logit contributions
-1.976
 POS-1.754
-1.639
idious-1.602
 LW-1.593
Token the
Token ID262
Feature activation-1.88e-01

Pos logit contributions
+1.069
 POS+0.962
+0.877
 LW+0.863
idis+0.846
Neg logit contributions
wagen-1.039
amaz-0.975
pu-0.956
 municip-0.949
ells-0.942
Token upload
Token ID9516
Feature activation-6.09e-02

Pos logit contributions
+1.494
 POS+1.392
 LW+1.255
+1.248
idis+1.210
Neg logit contributions
wagen-1.372
amaz-1.292
pu-1.260
ells-1.237
 municip-1.231
Token page
Token ID2443
Feature activation+2.74e-02

Pos logit contributions
+0.492
 POS+0.445
+0.408
 LW+0.402
idious+0.395
Neg logit contributions
wagen-0.444
amaz-0.416
pu-0.407
 municip-0.404
ells-0.401
Token.
Token ID13
Feature activation+2.41e-01

Pos logit contributions
wagen+0.206
amaz+0.194
pu+0.190
 municip+0.189
ells+0.188
Neg logit contributions
-0.216
 POS-0.192
-0.177
 LW-0.173
idious-0.171
Token
Token ID198
Feature activation+1.02e-01

Pos logit contributions
wagen+1.641
amaz+1.541
pu+1.505
ells+1.486
 municip+1.485
Neg logit contributions
-2.075
 POS-1.851
-1.724
idious-1.681
 LW-1.676
Token
Token ID198
Feature activation+2.09e-01

Pos logit contributions
wagen+0.715
amaz+0.667
 municip+0.652
pu+0.652
ells+0.643
Neg logit contributions
-0.860
 POS-0.776
-0.715
 LW-0.702
idis-0.696
TokenIn
Token ID818
Feature activation-3.99e-01

Pos logit contributions
wagen+1.544
amaz+1.456
pu+1.429
ells+1.409
 municip+1.403
Neg logit contributions
-1.677
 POS-1.498
-1.379
 LW-1.348
idious-1.338
Token later
Token ID1568
Feature activation-3.08e-01

Pos logit contributions
+2.935
 POS+2.676
+2.409
 LW+2.384
idious+2.347
Neg logit contributions
wagen-3.121
amaz-2.962
 municip-2.887
pu-2.871
ells-2.830
Token desperation
Token ID31265
Feature activation-2.46e-02

Pos logit contributions
+2.525
 POS+2.368
 LW+2.135
+2.118
idious+2.087
Neg logit contributions
wagen-2.163
amaz-2.012
pu-1.954
ells-1.925
 municip-1.920
Token,
Token ID11
Feature activation+1.79e-01

Pos logit contributions
+0.200
 POS+0.179
+0.164
 LW+0.161
idious+0.160
Neg logit contributions
wagen-0.180
amaz-0.169
pu-0.166
 municip-0.164
ells-0.163
Token I
Token ID314
Feature activation+3.05e-02

Pos logit contributions
wagen+1.298
amaz+1.219
pu+1.192
 municip+1.189
ells+1.177
Neg logit contributions
-1.468
 POS-1.292
-1.203
idious-1.168
-1.168
Token hefty
Token ID29784
Feature activation-8.79e-02

Pos logit contributions
+0.548
 POS+0.492
+0.448
 LW+0.441
idis+0.436
Neg logit contributions
wagen-0.517
amaz-0.486
pu-0.478
ells-0.472
 municip-0.471
Token new
Token ID649
Feature activation+2.46e-01

Pos logit contributions
+0.706
 POS+0.642
+0.582
 LW+0.575
idious+0.568
Neg logit contributions
wagen-0.643
amaz-0.602
pu-0.595
ells-0.587
 municip-0.581
Token driving
Token ID5059
Feature activation+2.52e-01

Pos logit contributions
wagen+1.858
amaz+1.739
 municip+1.713
pu+1.699
ells+1.679
Neg logit contributions
-1.961
 POS-1.699
-1.598
-1.540
idis-1.540
Token charges
Token ID4530
Feature activation+8.94e-04

Pos logit contributions
wagen+1.754
amaz+1.646
 municip+1.622
pu+1.595
 rotor+1.578
Neg logit contributions
-2.136
 POS-1.866
-1.756
idis-1.708
 LW-1.705
Token without
Token ID1231
Feature activation+8.47e-02

Pos logit contributions
wagen+0.007
amaz+0.006
pu+0.006
 municip+0.006
ells+0.006
Neg logit contributions
-0.007
 POS-0.006
-0.006
 LW-0.006
idious-0.006
Token first
Token ID717
Feature activation+4.46e-01

Pos logit contributions
wagen+0.640
amaz+0.601
pu+0.591
 municip+0.584
ells+0.583
Neg logit contributions
-0.665
 POS-0.594
-0.544
 LW-0.533
idis-0.527
Token creating
Token ID4441
Feature activation+1.77e-01

Pos logit contributions
wagen+3.298
amaz+3.117
 municip+3.084
pu+3.031
 rotor+2.995
Neg logit contributions
-3.561
 POS-3.050
-2.869
-2.787
idis-2.786
Token viable
Token ID13971
Feature activation+2.40e-01

Pos logit contributions
wagen+1.283
amaz+1.203
 municip+1.178
pu+1.173
ells+1.161
Neg logit contributions
-1.451
 POS-1.278
-1.193
-1.160
 LW-1.157
Token alternatives
Token ID14693
Feature activation+2.59e-01

Pos logit contributions
wagen+1.661
amaz+1.542
 municip+1.523
pu+1.510
ells+1.494
Neg logit contributions
-2.049
 POS-1.810
-1.702
 LW-1.650
idious-1.648
Token to
Token ID284
Feature activation-2.65e-02

Pos logit contributions
wagen+1.924
amaz+1.794
 municip+1.771
pu+1.769
ells+1.744
Neg logit contributions
-2.076
 POS-1.829
-1.692
 LW-1.650
idious-1.641
Token car
Token ID1097
Feature activation+4.36e-01

Pos logit contributions
+0.209
 POS+0.186
+0.170
 LW+0.167
idious+0.165
Neg logit contributions
wagen-0.200
amaz-0.188
pu-0.184
 municip-0.183
ells-0.182
Token normally
Token ID7685
Feature activation-4.04e-01

Pos logit contributions
+2.444
 POS+2.224
+1.999
 LW+1.987
idious+1.978
Neg logit contributions
wagen-2.467
amaz-2.352
pu-2.277
 municip-2.272
ells-2.262
Token used
Token ID973
Feature activation-1.88e-01

Pos logit contributions
+2.923
 POS+2.625
+2.342
 LW+2.339
idious+2.303
Neg logit contributions
wagen-3.261
amaz-3.080
pu-2.986
ells-2.969
rac-2.943
Token in
Token ID287
Feature activation-2.37e-02

Pos logit contributions
+1.462
 POS+1.314
+1.198
 LW+1.174
idious+1.159
Neg logit contributions
wagen-1.425
amaz-1.339
pu-1.319
ells-1.304
rac-1.286
Token overseas
Token ID11292
Feature activation-1.53e-01

Pos logit contributions
+0.200
 POS+0.181
+0.166
 LW+0.164
idious+0.162
Neg logit contributions
wagen-0.165
amaz-0.154
pu-0.150
ells-0.148
 municip-0.148
Token wars
Token ID9976
Feature activation+3.51e-02

Pos logit contributions
+1.289
 POS+1.196
+1.080
 LW+1.079
idious+1.066
Neg logit contributions
wagen-1.053
amaz-0.990
pu-0.954
ells-0.942
 municip-0.931
Token for
Token ID329
Feature activation+1.15e-01

Pos logit contributions
wagen+0.260
amaz+0.244
pu+0.239
 municip+0.237
ells+0.236
Neg logit contributions
-0.282
 POS-0.252
-0.232
 LW-0.226
-0.224
Token states
Token ID2585
Feature activation-1.33e-01

Pos logit contributions
wagen+0.813
amaz+0.761
pu+0.746
 municip+0.746
ells+0.738
Neg logit contributions
-0.966
 POS-0.861
-0.803
 LW-0.777
-0.776
Tokenide
Token ID485
Feature activation-5.75e-01

Pos logit contributions
+1.033
 POS+0.918
+0.840
idious+0.823
 LW+0.821
Neg logit contributions
wagen-1.016
amaz-0.960
pu-0.935
 municip-0.929
ells-0.928
Token law
Token ID1099
Feature activation-1.54e-01

Pos logit contributions
+4.253
 POS+3.959
 LW+3.570
idious+3.491
 berth+3.453
Neg logit contributions
wagen-4.445
amaz-4.162
pu-4.033
ells-3.997
rices-3.973
Token enforcement
Token ID5394
Feature activation-1.27e-01

Pos logit contributions
+1.049
 POS+0.914
+0.828
 LW+0.803
idious+0.795
Neg logit contributions
wagen-1.332
amaz-1.262
pu-1.241
 municip-1.233
ells-1.228
Token purposes
Token ID4959
Feature activation-2.59e-01

Pos logit contributions
+1.003
 POS+0.904
+0.820
 LW+0.813
idious+0.804
Neg logit contributions
wagen-0.961
amaz-0.902
pu-0.880
 municip-0.870
ells-0.870
Token AP
Token ID3486
Feature activation+1.71e-01

Pos logit contributions
wagen+1.494
amaz+1.400
pu+1.376
 municip+1.366
ells+1.354
Neg logit contributions
-1.821
 POS-1.614
-1.506
-1.460
 LW-1.458
TokenX
Token ID55
Feature activation-4.83e-02

Pos logit contributions
wagen+1.011
amaz+0.942
pu+0.922
ells+0.905
rac+0.897
Neg logit contributions
-1.627
 POS-1.468
-1.376
 LW-1.345
idious-1.345
TokenS
Token ID50
Feature activation+1.77e-01

Pos logit contributions
+0.403
 POS+0.365
+0.337
 LW+0.329
idis+0.327
Neg logit contributions
wagen-0.345
amaz-0.321
 municip-0.316
pu-0.311
ells-0.308
Token Principal
Token ID32641
Feature activation-4.73e-01

Pos logit contributions
wagen+1.218
amaz+1.137
pu+1.114
 municip+1.105
ells+1.098
Neg logit contributions
-1.513
 POS-1.356
-1.257
 LW-1.229
idious-1.221
Token Investigator
Token ID49499
Feature activation+1.35e-02

Pos logit contributions
+3.811
 POS+3.701
 LW+3.460
idis+3.307
wine+3.241
Neg logit contributions
wagen-3.289
amaz-3.049
 municip-3.046
pu-2.910
rices-2.860
Token R
Token ID371
Feature activation-8.85e-02

Pos logit contributions
wagen+0.094
amaz+0.088
pu+0.086
 municip+0.085
ells+0.085
Neg logit contributions
-0.115
 POS-0.103
-0.095
 LW-0.093
idious-0.093
Tokenalf
Token ID1604
Feature activation-1.36e-01

Pos logit contributions
+0.676
 POS+0.599
+0.549
 LW+0.537
idis+0.535
Neg logit contributions
wagen-0.689
amaz-0.648
 municip-0.633
pu-0.633
ells-0.626
Token G
Token ID402
Feature activation-8.42e-02

Pos logit contributions
+1.108
 POS+1.001
+0.917
 LW+0.906
idis+0.897
Neg logit contributions
wagen-0.974
amaz-0.911
 municip-0.891
pu-0.890
ells-0.876
Tokenell
Token ID695
Feature activation-3.53e-02

Pos logit contributions
+0.684
 POS+0.613
+0.565
 LW+0.553
idis+0.551
Neg logit contributions
wagen-0.610
amaz-0.572
pu-0.560
 municip-0.559
ells-0.551
Tokenert
Token ID861
Feature activation+1.01e-01

Pos logit contributions
+0.278
 POS+0.246
+0.227
 LW+0.221
+0.220
Neg logit contributions
wagen-0.268
amaz-0.251
pu-0.247
 municip-0.244
ells-0.244
Token of
Token ID286
Feature activation+2.22e-01

Pos logit contributions
wagen+0.757
amaz+0.706
pu+0.695
ells+0.687
rac+0.682
Neg logit contributions
-0.813
 POS-0.717
-0.662
-0.645
 LW-0.641
Token call
Token ID869
Feature activation+1.19e-01

Pos logit contributions
wagen+0.710
amaz+0.664
pu+0.651
 municip+0.645
ells+0.642
Neg logit contributions
-0.829
 POS-0.741
-0.684
 LW-0.669
idious-0.665
Token you
Token ID345
Feature activation+6.84e-02

Pos logit contributions
wagen+0.855
amaz+0.808
pu+0.790
 municip+0.780
ells+0.777
Neg logit contributions
-0.981
 POS-0.873
-0.806
 LW-0.787
idious-0.784
Token a
Token ID257
Feature activation+4.62e-03

Pos logit contributions
wagen+0.500
amaz+0.469
pu+0.460
 municip+0.456
ells+0.453
Neg logit contributions
-0.555
 POS-0.495
-0.456
 LW-0.446
idis-0.443
Token liar
Token ID31866
Feature activation-3.47e-02

Pos logit contributions
wagen+0.031
amaz+0.029
pu+0.028
 municip+0.028
ells+0.028
Neg logit contributions
-0.040
 POS-0.036
-0.034
 LW-0.033
idious-0.033
Token.
Token ID13
Feature activation+3.85e-01

Pos logit contributions
+0.278
 POS+0.248
+0.228
 LW+0.223
idious+0.221
Neg logit contributions
wagen-0.258
amaz-0.242
pu-0.238
 municip-0.236
ells-0.235
Token
Token ID198
Feature activation-2.37e-01

Pos logit contributions
wagen+2.545
amaz+2.427
pu+2.385
ells+2.333
rac+2.306
Neg logit contributions
-3.313
 POS-2.936
-2.737
idious-2.719
idis-2.662
Token
Token ID198
Feature activation+2.70e-01

Pos logit contributions
+1.778
 POS+1.657
 LW+1.492
+1.484
idis+1.457
Neg logit contributions
wagen-1.851
 municip-1.750
amaz-1.709
rices-1.653
pu-1.640
Token13
Token ID1485
Feature activation-9.45e-02

Pos logit contributions
wagen+1.795
amaz+1.715
pu+1.675
ells+1.643
rac+1.642
Neg logit contributions
-2.313
 POS-2.101
-1.930
 LW-1.903
idious-1.902
Token.
Token ID13
Feature activation+9.94e-02

Pos logit contributions
+0.695
 POS+0.621
+0.562
 LW+0.554
idis+0.544
Neg logit contributions
wagen-0.765
amaz-0.717
 municip-0.709
pu-0.702
ells-0.694
Token Resident
Token ID22373
Feature activation-1.36e-02

Pos logit contributions
wagen+0.677
amaz+0.633
pu+0.621
 municip+0.612
ells+0.611
Neg logit contributions
-0.855
 POS-0.766
-0.710
 LW-0.693
idious-0.693
Token Evil
Token ID10461
Feature activation+2.41e-01

Pos logit contributions
+0.166
 POS+0.155
+0.146
 LW+0.145
idious+0.144
Neg logit contributions
wagen-0.044
amaz-0.038
pu-0.035
 municip-0.035
ells-0.035
Token However
Token ID2102
Feature activation-6.56e-02

Pos logit contributions
+0.027
 POS+0.024
+0.022
 LW+0.022
idis+0.022
Neg logit contributions
wagen-0.024
amaz-0.023
pu-0.022
 municip-0.022
ells-0.022
Token,
Token ID11
Feature activation+2.84e-01

Pos logit contributions
+0.513
 POS+0.458
+0.419
 LW+0.411
idious+0.407
Neg logit contributions
wagen-0.497
amaz-0.469
pu-0.459
 municip-0.455
ells-0.453
Token maybe
Token ID3863
Feature activation+4.78e-01

Pos logit contributions
wagen+2.108
amaz+1.962
pu+1.932
 municip+1.926
 rotor+1.905
Neg logit contributions
-2.294
 POS-2.009
-1.874
-1.817
idious-1.808
Token a
Token ID257
Feature activation+1.28e-01

Pos logit contributions
wagen+3.488
amaz+3.218
 rotor+3.172
pu+3.167
 municip+3.156
Neg logit contributions
-3.912
 POS-3.348
-3.173
idious-3.078
-3.073
Token niche
Token ID21404
Feature activation+1.14e-01

Pos logit contributions
wagen+0.929
amaz+0.866
pu+0.853
 municip+0.848
ells+0.838
Neg logit contributions
-1.047
 POS-0.928
-0.862
 LW-0.835
idious-0.834
Token for
Token ID329
Feature activation-5.55e-02

Pos logit contributions
wagen+0.853
amaz+0.796
pu+0.786
 municip+0.778
ells+0.771
Neg logit contributions
-0.911
 POS-0.799
-0.747
 LW-0.718
idious-0.717
Token a
Token ID257
Feature activation-2.18e-01

Pos logit contributions
+0.434
 POS+0.389
+0.354
 LW+0.348
idis+0.344
Neg logit contributions
wagen-0.422
amaz-0.397
pu-0.389
 municip-0.386
ells-0.384
Token simple
Token ID2829
Feature activation-1.22e-01

Pos logit contributions
+1.727
 POS+1.585
 LW+1.422
+1.416
idis+1.387
Neg logit contributions
wagen-1.605
amaz-1.524
pu-1.477
ells-1.472
 municip-1.460
Token jet
Token ID12644
Feature activation-4.17e-02

Pos logit contributions
+0.981
 POS+0.890
+0.808
 LW+0.802
idious+0.789
Neg logit contributions
wagen-0.892
amaz-0.840
pu-0.819
ells-0.812
 municip-0.812
Token engine
Token ID3113
Feature activation-4.78e-02

Pos logit contributions
+0.343
 POS+0.310
+0.283
 LW+0.280
idious+0.277
Neg logit contributions
wagen-0.298
amaz-0.280
 municip-0.273
pu-0.273
ells-0.271
Token can
Token ID460
Feature activation+3.57e-01

Pos logit contributions
+0.372
 POS+0.338
+0.305
 LW+0.304
idious+0.300
Neg logit contributions
wagen-0.360
amaz-0.341
pu-0.331
 municip-0.330
ells-0.329
Token M
Token ID337
Feature activation-1.88e-01

Pos logit contributions
wagen+0.831
amaz+0.775
pu+0.762
 municip+0.757
ells+0.751
Neg logit contributions
-0.984
 POS-0.877
-0.814
 LW-0.792
-0.791
TokenATE
Token ID6158
Feature activation-4.12e-01

Pos logit contributions
+1.249
 POS+1.093
+0.981
 LW+0.957
idious+0.946
Neg logit contributions
wagen-1.658
amaz-1.570
pu-1.538
 municip-1.531
ells-1.522
Token 16
Token ID1467
Feature activation-1.75e-01

Pos logit contributions
+3.226
 POS+3.010
 LW+2.737
+2.670
idis+2.607
Neg logit contributions
wagen-3.011
amaz-2.865
pu-2.743
rahim-2.733
 municip-2.706
Token.
Token ID13
Feature activation+5.88e-02

Pos logit contributions
+1.285
 POS+1.151
+1.044
 LW+1.026
idis+1.005
Neg logit contributions
wagen-1.396
amaz-1.324
pu-1.291
 municip-1.284
ells-1.274
Token04
Token ID3023
Feature activation+1.10e-02

Pos logit contributions
wagen+0.517
amaz+0.491
pu+0.480
 municip+0.479
ells+0.477
Neg logit contributions
-0.389
 POS-0.343
-0.305
 LW-0.301
idis-0.295
Token+
Token ID10
Feature activation-9.55e-02

Pos logit contributions
wagen+0.083
amaz+0.078
pu+0.076
 municip+0.076
ells+0.075
Neg logit contributions
-0.088
 POS-0.078
-0.072
 LW-0.070
idious-0.070
Token at
Token ID379
Feature activation-8.24e-03

Pos logit contributions
+0.751
 POS+0.677
+0.615
 LW+0.608
idis+0.598
Neg logit contributions
wagen-0.717
amaz-0.677
pu-0.661
 municip-0.654
ells-0.652
Token all
Token ID477
Feature activation-4.36e-01

Pos logit contributions
+0.063
 POS+0.056
+0.052
 LW+0.050
idis+0.050
Neg logit contributions
wagen-0.064
amaz-0.060
pu-0.059
 municip-0.058
ells-0.058
Token,
Token ID11
Feature activation+2.17e-01

Pos logit contributions
+3.290
 POS+2.980
+2.661
 LW+2.625
idious+2.591
Neg logit contributions
wagen-3.369
amaz-3.227
pu-3.155
ells-3.119
 municip-3.068
Token I
Token ID314
Feature activation+2.97e-01

Pos logit contributions
wagen+1.650
amaz+1.542
pu+1.522
 municip+1.516
ells+1.496
Neg logit contributions
-1.700
 POS-1.481
-1.381
-1.343
idious-1.338
Token
Token ID447
Feature activation-2.81e-01

Pos logit contributions
wagen+2.021
amaz+1.883
pu+1.861
 municip+1.841
ells+1.833
Neg logit contributions
-2.540
 POS-2.254
-2.092
 LW-2.052
idious-2.052
Token may
Token ID743
Feature activation+1.75e-01

Pos logit contributions
+0.044
 POS+0.039
+0.036
 LW+0.035
idious+0.035
Neg logit contributions
wagen-0.043
amaz-0.040
pu-0.040
 municip-0.039
ells-0.039
Token be
Token ID307
Feature activation-9.75e-02

Pos logit contributions
wagen+1.145
amaz+1.062
pu+1.044
 municip+1.032
ells+1.029
Neg logit contributions
-1.562
 POS-1.395
-1.309
 LW-1.276
-1.272
Token saved
Token ID7448
Feature activation-3.35e-01

Pos logit contributions
+0.758
 POS+0.690
+0.626
 LW+0.615
idious+0.609
Neg logit contributions
wagen-0.740
amaz-0.694
pu-0.674
 municip-0.671
ells-0.668
Token to
Token ID284
Feature activation-3.68e-01

Pos logit contributions
+2.391
 POS+2.166
+1.937
idious+1.919
 LW+1.907
Neg logit contributions
wagen-2.727
amaz-2.545
pu-2.494
ells-2.468
 municip-2.462
Token the
Token ID262
Feature activation-5.33e-01

Pos logit contributions
+2.785
 POS+2.509
 LW+2.243
+2.235
idis+2.181
Neg logit contributions
wagen-2.854
amaz-2.639
 municip-2.592
pu-2.590
rac-2.542
Token government
Token ID1230
Feature activation-6.83e-02

Pos logit contributions
+4.217
 POS+3.993
 LW+3.585
+3.512
 berth+3.469
Neg logit contributions
wagen-3.824
amaz-3.496
pu-3.399
rac-3.359
ells-3.356
Token?"
Token ID1701
Feature activation+2.77e-02

Pos logit contributions
+0.515
 POS+0.461
+0.420
 LW+0.411
idious+0.407
Neg logit contributions
wagen-0.536
amaz-0.504
pu-0.494
 municip-0.490
ells-0.488
Token Besides
Token ID16238
Feature activation-8.54e-02

Pos logit contributions
wagen+0.204
amaz+0.191
pu+0.187
 municip+0.186
ells+0.185
Neg logit contributions
-0.223
 POS-0.199
-0.183
 LW-0.179
idious-0.178
Token,
Token ID11
Feature activation-6.99e-02

Pos logit contributions
+0.670
 POS+0.601
+0.550
 LW+0.538
idious+0.534
Neg logit contributions
wagen-0.645
amaz-0.605
pu-0.593
 municip-0.587
ells-0.587
Token he
Token ID339
Feature activation-7.40e-02

Pos logit contributions
+0.548
 POS+0.494
+0.453
 LW+0.444
idis+0.439
Neg logit contributions
wagen-0.526
amaz-0.496
pu-0.485
 municip-0.479
ells-0.478
Token argued
Token ID7189
Feature activation-2.28e-01

Pos logit contributions
+0.615
 POS+0.565
+0.519
 LW+0.512
idis+0.506
Neg logit contributions
wagen-0.516
amaz-0.482
pu-0.472
 municip-0.465
ells-0.463
Tokenx
Token ID87
Feature activation-2.15e-01

Pos logit contributions
+1.916
 POS+1.759
+1.610
idis+1.594
 LW+1.590
Neg logit contributions
wagen-1.666
 municip-1.571
amaz-1.535
 rotor-1.490
rices-1.473
Token.
Token ID13
Feature activation-4.12e-01

Pos logit contributions
+1.479
 POS+1.346
+1.203
 LW+1.198
idis+1.174
Neg logit contributions
wagen-1.819
 municip-1.720
amaz-1.704
pu-1.653
rices-1.642
Tokenx
Token ID87
Feature activation-2.34e-01

Pos logit contributions
+3.154
 POS+2.912
idis+2.675
+2.650
 LW+2.620
Neg logit contributions
wagen-3.048
 municip-2.963
amaz-2.808
 rotor-2.783
rices-2.722
Token.
Token ID13
Feature activation-4.57e-01

Pos logit contributions
+1.682
 POS+1.496
+1.365
 LW+1.336
idis+1.318
Neg logit contributions
wagen-1.918
amaz-1.799
 municip-1.793
pu-1.755
ells-1.744
Token175
Token ID17430
Feature activation-6.93e-01

Pos logit contributions
+3.259
 POS+2.863
+2.642
 LW+2.575
idis+2.552
Neg logit contributions
wagen-3.728
 municip-3.481
amaz-3.467
ells-3.377
pu-3.349
Token 99
Token ID7388
Feature activation-8.65e-01

Pos logit contributions
+4.766
 POS+4.737
 LW+4.366
+4.034
idis+4.009
Neg logit contributions
wagen-5.417
 municip-5.054
amaz-5.021
pu-4.882
rices-4.869
Token33
Token ID2091
Feature activation-2.96e-01

Pos logit contributions
+5.731
 POS+5.288
+4.931
 LW+4.776
idis+4.602
Neg logit contributions
wagen-7.057
 municip-6.953
amaz-6.471
rices-6.434
 rotor-6.398
Token87
Token ID5774
Feature activation-5.48e-01

Pos logit contributions
+2.170
 POS+1.989
+1.838
 LW+1.781
idis+1.760
Neg logit contributions
wagen-2.359
 municip-2.216
amaz-2.196
rices-2.127
 rotor-2.101
Token x
Token ID2124
Feature activation-4.03e-02

Pos logit contributions
+2.120
 POS+1.996
 LW+1.596
+1.553
idis+1.481
Neg logit contributions
wagen-6.122
amaz-5.849
 municip-5.734
ells-5.669
rices-5.649
Token.
Token ID13
Feature activation-1.15e-01

Pos logit contributions
+0.299
 POS+0.266
+0.243
 LW+0.237
idis+0.235
Neg logit contributions
wagen-0.323
amaz-0.304
 municip-0.298
pu-0.297
ells-0.294
Tokenx
Token ID87
Feature activation-2.02e-01

Pos logit contributions
+0.972
 POS+0.886
+0.816
 LW+0.802
idis+0.802
Neg logit contributions
wagen-0.788
 municip-0.729
amaz-0.728
pu-0.704
 rotor-0.699
Token
Token ID198
Feature activation-1.17e-01

Pos logit contributions
+2.153
 POS+1.953
+1.771
 LW+1.761
idis+1.757
Neg logit contributions
wagen-2.006
amaz-1.903
 municip-1.868
pu-1.858
ells-1.825
Token
Token ID198
Feature activation-8.60e-01

Pos logit contributions
+0.922
 POS+0.850
+0.770
 LW+0.767
idis+0.753
Neg logit contributions
wagen-0.879
 municip-0.823
amaz-0.817
pu-0.788
rices-0.785
Token2005
Token ID14315
Feature activation-1.68e-01

Pos logit contributions
+2.955
 POS+2.163
+1.914
+1.912
wine+1.745
Neg logit contributions
wagen-9.706
 municip-9.690
amaz-9.412
 rotor-9.261
ells-9.116
Token Eric
Token ID7651
Feature activation+7.20e-02

Pos logit contributions
+1.389
 POS+1.270
+1.157
 LW+1.151
idis+1.130
Neg logit contributions
wagen-1.192
amaz-1.106
pu-1.081
 municip-1.080
ells-1.070
Token McCorm
Token ID42037
Feature activation+9.37e-02

Pos logit contributions
wagen+0.603
amaz+0.571
pu+0.564
 municip+0.557
ells+0.556
Neg logit contributions
-0.510
 POS-0.442
-0.406
-0.390
 LW-0.389
Tokenack
Token ID441
Feature activation-9.56e-01

Pos logit contributions
wagen+0.630
amaz+0.591
pu+0.576
ells+0.571
rac+0.566
Neg logit contributions
-0.823
 POS-0.735
-0.682
-0.665
 LW-0.664
Token
Token ID198
Feature activation-2.30e-01

Pos logit contributions
+6.504
 POS+6.116
 LW+5.480
idis+5.453
+5.324
Neg logit contributions
wagen-7.301
amaz-7.035
 municip-6.822
ells-6.715
pu-6.685
Token
Token ID198
Feature activation-9.44e-01

Pos logit contributions
+1.718
 POS+1.610
 LW+1.450
+1.439
idis+1.411
Neg logit contributions
wagen-1.796
 municip-1.707
amaz-1.666
rices-1.613
pu-1.596
Token2005
Token ID14315
Feature activation-3.22e-01

Pos logit contributions
+3.291
 POS+2.397
+2.153
+2.139
wine+1.965
Neg logit contributions
wagen-10.543
 municip-10.420
amaz-10.210
 rotor-10.018
ells-9.907
Token Ray
Token ID7760
Feature activation+8.19e-01

Pos logit contributions
+2.652
 POS+2.455
 LW+2.231
+2.212
idis+2.178
Neg logit contributions
wagen-2.275
amaz-2.096
 municip-2.063
pu-2.051
ells-2.031
Token Rom
Token ID3570
Feature activation-1.57e-02

Pos logit contributions
wagen+0.812
pu+0.779
 Lib+0.732
rac+0.723
ells+0.668
Neg logit contributions
-11.344
 POS-10.302
 awa-10.260
-10.200
-10.014
Token,[
Token ID17414
Feature activation-1.31e+00

Pos logit contributions
+0.627
 POS+0.548
+0.496
 LW+0.483
idis+0.477
Neg logit contributions
wagen-0.782
amaz-0.740
pu-0.727
 municip-0.722
ells-0.719
Token98
Token ID4089
Feature activation-2.19e-01

Pos logit contributions
+4.313
 POS+3.628
+3.431
+3.048
 LW+2.916
Neg logit contributions
wagen-13.751
amaz-13.495
 municip-13.232
rahim-13.070
ells-12.931
Token]
Token ID60
Feature activation+1.86e-01

Pos logit contributions
+1.830
 POS+1.639
+1.513
 LW+1.484
+1.470
Neg logit contributions
wagen-1.557
amaz-1.458
pu-1.424
 municip-1.413
ells-1.406
Token Sporting
Token ID31790
Feature activation+1.61e-01

Pos logit contributions
wagen+1.225
amaz+1.143
pu+1.135
 municip+1.118
ells+1.107
Neg logit contributions
-1.630
 POS-1.438
-1.358
idious-1.326
-1.325
Token News
Token ID3000
Feature activation-4.73e-02

Pos logit contributions
wagen+1.191
amaz+1.116
pu+1.096
 municip+1.089
ells+1.083
Neg logit contributions
-1.303
 POS-1.150
-1.064
idious-1.034
idis-1.031
Token,[
Token ID17414
Feature activation-1.18e+00

Pos logit contributions
+0.321
 POS+0.280
+0.252
 LW+0.246
idis+0.243
Neg logit contributions
wagen-0.410
amaz-0.388
pu-0.382
 municip-0.379
ells-0.378
Token99
Token ID2079
Feature activation-5.48e-01

Pos logit contributions
+5.137
 POS+4.685
+4.431
+4.119
wine+4.116
Neg logit contributions
wagen-11.184
amaz-10.772
rahim-10.724
ells-10.724
 municip-10.648
Token]
Token ID60
Feature activation+2.94e-01

Pos logit contributions
+4.230
 POS+3.777
+3.456
 LW+3.419
+3.339
Neg logit contributions
wagen-4.140
amaz-3.907
pu-3.767
 municip-3.749
ells-3.745
Token SB
Token ID18056
Feature activation+3.81e-01

Pos logit contributions
wagen+1.923
amaz+1.795
pu+1.794
 municip+1.762
rac+1.748
Neg logit contributions
-2.582
 POS-2.254
-2.151
idious-2.103
-2.100
Token Nation
Token ID8741
Feature activation-5.37e-01

Pos logit contributions
wagen+1.660
amaz+1.493
ells+1.468
pu+1.467
rac+1.464
Neg logit contributions
-4.183
 POS-3.742
-3.586
-3.552
idious-3.526
Token,[
Token ID17414
Feature activation-1.03e+00

Pos logit contributions
+3.136
 POS+2.694
+2.369
 LW+2.362
idis+2.310
Neg logit contributions
wagen-5.056
amaz-4.802
pu-4.690
 municip-4.669
ells-4.647
Token a
Token ID257
Feature activation-8.83e-02

Pos logit contributions
+0.705
 POS+0.639
+0.577
idious+0.570
 LW+0.567
Neg logit contributions
wagen-0.727
amaz-0.680
pu-0.667
ells-0.662
 municip-0.660
Token thing
Token ID1517
Feature activation+3.35e-01

Pos logit contributions
+0.657
 POS+0.593
+0.534
 LW+0.527
idious+0.524
Neg logit contributions
wagen-0.699
amaz-0.657
pu-0.645
ells-0.641
 municip-0.638
Token as
Token ID355
Feature activation+3.09e-02

Pos logit contributions
wagen+2.268
amaz+2.143
pu+2.088
 municip+2.078
ells+2.064
Neg logit contributions
-2.860
 POS-2.551
-2.367
 LW-2.328
idis-2.311
Token a
Token ID257
Feature activation+5.94e-02

Pos logit contributions
wagen+0.226
amaz+0.211
pu+0.208
 municip+0.206
ells+0.206
Neg logit contributions
-0.250
 POS-0.224
-0.206
 LW-0.202
idious-0.201
Token '
Token ID705
Feature activation-1.92e-01

Pos logit contributions
wagen+0.414
amaz+0.386
pu+0.378
 municip+0.375
ells+0.374
Neg logit contributions
-0.503
 POS-0.453
-0.417
 LW-0.410
idious-0.407
TokenBo
Token ID16635
Feature activation-8.57e-01

Pos logit contributions
+1.582
 POS+1.415
+1.317
idious+1.287
 LW+1.281
Neg logit contributions
wagen-1.355
amaz-1.254
 municip-1.248
ells-1.229
pu-1.222
Tokenhem
Token ID4411
Feature activation-5.95e-01

Pos logit contributions
+4.236
 POS+3.552
idious+3.168
idis+3.153
 LW+3.098
Neg logit contributions
wagen-8.402
 municip-7.995
pu-7.904
 rotor-7.685
 Pistons-7.674
Tokenian
Token ID666
Feature activation-1.41e-02

Pos logit contributions
+2.106
 POS+1.621
idis+1.310
+1.303
idious+1.272
Neg logit contributions
wagen-6.847
amaz-6.544
pu-6.498
 municip-6.496
ells-6.471
Token sp
Token ID599
Feature activation+2.47e-01

Pos logit contributions
+0.104
 POS+0.092
+0.084
 LW+0.082
idious+0.081
Neg logit contributions
wagen-0.113
amaz-0.107
pu-0.105
 municip-0.104
ells-0.104
Tokenarrow
Token ID6018
Feature activation-7.32e-02

Pos logit contributions
wagen+2.112
amaz+2.016
pu+1.973
ells+1.968
rac+1.960
Neg logit contributions
-1.724
 POS-1.483
-1.336
-1.286
 LW-1.283
Token slaying
Token ID45183
Feature activation+4.68e-01

Pos logit contributions
+0.568
 POS+0.503
+0.462
 LW+0.451
idis+0.447
Neg logit contributions
wagen-0.563
amaz-0.529
pu-0.520
 municip-0.516
ells-0.513
Token Unt
Token ID26970
Feature activation-8.92e-02

Pos logit contributions
wagen+0.754
amaz+0.706
pu+0.692
 municip+0.684
ells+0.681
Neg logit contributions
-0.925
 POS-0.820
-0.765
 LW-0.743
-0.742
Tokenrust
Token ID11469
Feature activation+1.25e-01

Pos logit contributions
+0.771
 POS+0.698
+0.641
 LW+0.635
idious+0.634
Neg logit contributions
wagen-0.602
amaz-0.557
 municip-0.546
pu-0.544
ells-0.536
Tokenworthy
Token ID18275
Feature activation+1.21e-01

Pos logit contributions
wagen+0.769
amaz+0.716
pu+0.700
ells+0.688
 municip+0.687
Neg logit contributions
-1.169
 POS-1.049
-0.987
 LW-0.961
-0.959
Token
Token ID198
Feature activation-1.23e-01

Pos logit contributions
wagen+0.852
amaz+0.794
pu+0.780
 municip+0.774
ells+0.765
Neg logit contributions
-1.016
 POS-0.898
-0.838
-0.813
 LW-0.812
Token
Token ID198
Feature activation-2.53e-01

Pos logit contributions
+0.967
 POS+0.891
+0.806
 LW+0.804
idis+0.787
Neg logit contributions
wagen-0.922
 municip-0.859
amaz-0.853
pu-0.826
rices-0.820
TokenWhy
Token ID5195
Feature activation-1.00e+00

Pos logit contributions
+2.015
 POS+1.793
+1.654
 LW+1.611
+1.604
Neg logit contributions
wagen-1.895
amaz-1.766
pu-1.734
 municip-1.732
ells-1.713
Token 18
Token ID1248
Feature activation-1.11e-01

Pos logit contributions
+5.534
 POS+5.383
 LW+4.845
+4.574
idis+4.451
Neg logit contributions
wagen-8.811
 municip-8.132
amaz-8.068
rices-8.048
pu-7.943
Token81
Token ID6659
Feature activation+1.97e-01

Pos logit contributions
+0.791
 POS+0.708
+0.641
 LW+0.629
idis+0.615
Neg logit contributions
wagen-0.922
amaz-0.869
 municip-0.853
pu-0.846
ells-0.842
Token?
Token ID30
Feature activation+2.34e-01

Pos logit contributions
wagen+1.486
amaz+1.420
pu+1.395
ells+1.379
rac+1.364
Neg logit contributions
-1.526
 POS-1.332
-1.239
idious-1.216
-1.203
Token Robert
Token ID5199
Feature activation+3.37e-01

Pos logit contributions
wagen+1.600
amaz+1.509
pu+1.486
ells+1.457
rac+1.440
Neg logit contributions
-2.005
 POS-1.760
-1.646
idious-1.614
-1.611
Token Louis
Token ID5593
Feature activation+1.04e-01

Pos logit contributions
wagen+2.323
amaz+2.200
pu+2.185
ells+2.146
rac+2.132
Neg logit contributions
-2.853
 POS-2.494
-2.378
-2.285
idious-2.261