TOP ACTIVATIONS
MAX = 6.184

allies in Ukraine exposes not only their current wars , but
sin (_ Time . y / _ P ulse Per iod
his s remind us not only of lost fidelity , but
made us want to not only dream but actually pursue this
's rant was not so much the content , but rather
Card that will help not only your Vend read Deck ,
new liberty applies to not only sex but also race .
How Vera Rubin changed science Dear Science : Could
, it 's not so much that they 're bad players
Chall engers "] [ Site " W ijk a an Z
used this golden opportunity not only to continue its Juda isation
immediately with mechanisms to not only derive the necessary information ,
Joe V itt Aff idav it Scott ie
abduct ing children - not only from their homes , but
in history is alarming not only in and of itself ,
biotech alliance , wasted no time in expressing its delight after
GDP have already changed not only their legal headquarters but ,
12 , 000 App rox imate number of adult Buddh
Vera Rubin changed science Dear Science : Could my
the architecture and nature me ld into one ($ 80 ,

MIN ACTIVATIONS
MIN = -6.104

a start . <|endoftext|> Abstract Background Depression presents a
[ H / t The Hill ] <|endoftext|> L
start . <|endoftext|> Abstract Background Depression presents a significant
a head ) and fl ot illa ( four people on
Recommend Share this comment on Facebook Share this comment on Twitter
+ 1 2 18 K Shares Act ress Amy
avier Dum us que et al . 2014 . The Kepler
Recommend Share this comment on Facebook Share this comment on Twitter
Recommend Share this comment on Facebook Share this comment on Twitter
Rated 5 out of 5 by Cory B from The best
10 / 6 / 08 Rating : 5 Crit
pur ak ushi Profile Joined August 2012 United States
in connection with the fl ot illa incident and will prevent
: 47 ] < sh o ja > Frank er Z
a Gaza - bound fl ot illa that killed eight Turkish
a , T > D ere f for Shared < '
. Although he 's only briefly seen , Row lf had
Rated 5 out of 5 by Quake D ude from Explosive
org / document / t att oo - compl aint
the amount of health the monsters you 'll face have .

INTERVAL 3.112 - 6.184
CONTAINS 0.224%

proposed sales includes 19 F - 16 V jets made by
, Lv iv and Iv ano - Frank iv sk .
from their carry - on baggage when going through airline security
Forum for the 2013 / 14 season New castle
2 "] [ EC O " A 46 "] [ White

INTERVAL 0.040 - 3.112
CONTAINS 47.447%

. Obama said with damp eyes Mrs . Obama
� s dangerous . In a recent article for
noticed that everyone picked up on that very clear theme .
, and that more people tend to do more things .
is why the trade school has such a high job placement

INTERVAL -3.032 - 0.040
CONTAINS 52.096%

were playing the Jones - co ached Celtics .
The Donald , bringing people together * Clinton pledges
� light � � ). Good SET ON FIRE
can prepare for their client � � s upcoming trial which
actions merely served to hard en the young pun ks '

INTERVAL -6.104 - -3.032
CONTAINS 0.232%

When I joined Netflix in 2014 , we had
again to all of you for your interest and assistance .
avier Dum us que et al . 2014 . The Kepler
, then apply the black hat # morph ological operator to
Stalin ism ; an ers atz " social ism "
Token allies
Token ID7681
Feature activation+6.18e-01

Pos logit contributions
istered+0.331
 incumbent+0.308
uscript+0.297
 Imran+0.291
 Warden+0.290
Neg logit contributions
\">-0.372
 <+-0.367
agine-0.357
cin-0.356
andre-0.346
Token in
Token ID287
Feature activation+4.32e-01

Pos logit contributions
\">+0.490
 <++0.485
agine+0.469
cin+0.464
andre+0.456
Neg logit contributions
istered-0.436
 incumbent-0.404
uscript-0.393
 Imran-0.387
sembly-0.381
Token Ukraine
Token ID7049
Feature activation+1.75e-01

Pos logit contributions
\">+0.324
 <++0.322
agine+0.309
cin+0.307
andre+0.302
Neg logit contributions
istered-0.324
 incumbent-0.298
uscript-0.295
 Imran-0.287
sembly-0.287
Token exposes
Token ID32142
Feature activation+6.05e-01

Pos logit contributions
\">+0.139
 <++0.138
agine+0.133
cin+0.132
andre+0.130
Neg logit contributions
istered-0.122
 incumbent-0.112
uscript-0.111
 Imran-0.108
sembly-0.107
Token not
Token ID407
Feature activation+4.70e-01

Pos logit contributions
\">+0.504
 <++0.496
agine+0.480
cin+0.478
andre+0.470
Neg logit contributions
istered-0.406
 incumbent-0.376
uscript-0.365
 Imran-0.358
sembly-0.352
Token only
Token ID691
Feature activation+6.18e+00

Pos logit contributions
\">+0.384
 <++0.380
agine+0.366
cin+0.366
andre+0.359
Neg logit contributions
istered-0.321
 incumbent-0.296
uscript-0.290
 Imran-0.283
sembly-0.281
Token their
Token ID511
Feature activation+1.51e+00

Pos logit contributions
 <++4.666
\">+4.603
cin+4.477
agine+4.437
andre+4.381
Neg logit contributions
istered-4.428
uscript-4.061
sembly-4.031
 incumbent-3.867
 Imran-3.789
Token current
Token ID1459
Feature activation+2.11e+00

Pos logit contributions
\">+1.134
 <++1.125
agine+1.082
cin+1.076
andre+1.058
Neg logit contributions
istered-1.130
 incumbent-1.041
uscript-1.032
sembly-1.004
 Imran-1.002
Token wars
Token ID9976
Feature activation+3.64e-01

Pos logit contributions
\">+1.582
 <++1.572
agine+1.511
cin+1.506
andre+1.480
Neg logit contributions
istered-1.570
 incumbent-1.437
uscript-1.436
sembly-1.403
 Imran-1.387
Token,
Token ID11
Feature activation-3.53e-01

Pos logit contributions
\">+0.279
 <++0.277
agine+0.266
cin+0.265
andre+0.260
Neg logit contributions
istered-0.266
 incumbent-0.244
uscript-0.242
sembly-0.236
 Imran-0.235
Token but
Token ID475
Feature activation+3.76e-01

Pos logit contributions
istered+0.255
 incumbent+0.237
uscript+0.230
 Imran+0.227
sembly+0.223
Neg logit contributions
\">-0.274
 <+-0.270
agine-0.261
cin-0.259
andre-0.256
Tokensin
Token ID31369
Feature activation+2.74e-01

Pos logit contributions
istered+1.994
 incumbent+1.836
uscript+1.812
sembly+1.767
 Imran+1.750
Neg logit contributions
\">-1.868
 <+-1.859
agine-1.782
andre-1.734
cin-1.733
Token (_
Token ID44104
Feature activation-2.70e-01

Pos logit contributions
\">+0.219
 <++0.217
agine+0.210
cin+0.208
andre+0.206
Neg logit contributions
istered-0.191
 incumbent-0.174
uscript-0.173
sembly-0.167
 Imran-0.166
TokenTime
Token ID7575
Feature activation+2.30e-01

Pos logit contributions
istered+0.198
 incumbent+0.182
uscript+0.181
sembly+0.176
 Imran+0.175
Neg logit contributions
\">-0.205
 <+-0.204
agine-0.196
cin-0.195
andre-0.192
Token.
Token ID13
Feature activation+1.09e+00

Pos logit contributions
\">+0.174
 <++0.172
agine+0.167
cin+0.165
andre+0.163
Neg logit contributions
istered-0.170
 incumbent-0.156
uscript-0.155
sembly-0.151
 Imran-0.150
Tokeny
Token ID88
Feature activation-6.94e-01

Pos logit contributions
\">+0.910
 <++0.904
agine+0.873
cin+0.868
andre+0.856
Neg logit contributions
istered-0.720
 incumbent-0.654
uscript-0.650
sembly-0.630
 Imran-0.625
Token /
Token ID1220
Feature activation+6.16e+00

Pos logit contributions
istered+0.544
 incumbent+0.503
uscript+0.500
sembly+0.485
 Imran+0.484
Neg logit contributions
\">-0.490
 <+-0.480
agine-0.469
cin-0.461
andre-0.454
Token _
Token ID4808
Feature activation-3.50e-01

Pos logit contributions
 <++4.402
\">+4.376
cin+4.257
agine+4.199
andre+4.062
Neg logit contributions
istered-4.615
uscript-4.292
 incumbent-4.232
sembly-4.191
 Imran-4.117
TokenP
Token ID47
Feature activation+3.57e-01

Pos logit contributions
istered+0.280
 incumbent+0.259
uscript+0.258
sembly+0.252
 Imran+0.250
Neg logit contributions
\">-0.243
 <+-0.241
agine-0.231
cin-0.230
andre-0.226
Tokenulse
Token ID9615
Feature activation+1.91e-01

Pos logit contributions
\">+0.176
 <++0.175
agine+0.163
cin+0.162
andre+0.158
Neg logit contributions
istered-0.356
 incumbent-0.335
uscript-0.334
 Imran-0.327
sembly-0.324
TokenPer
Token ID5990
Feature activation-7.38e-01

Pos logit contributions
\">+0.137
 <++0.136
agine+0.133
cin+0.132
andre+0.129
Neg logit contributions
istered-0.148
 incumbent-0.135
uscript-0.134
 Imran-0.132
 Warden-0.129
Tokeniod
Token ID2101
Feature activation-5.89e-01

Pos logit contributions
istered+0.697
uscript+0.651
 incumbent+0.649
 Imran+0.634
sembly+0.628
Neg logit contributions
\">-0.408
 <+-0.406
agine-0.383
cin-0.370
andre-0.367
Token his
Token ID465
Feature activation-1.02e+00

Pos logit contributions
\">+0.089
 <++0.088
agine+0.085
cin+0.084
andre+0.083
Neg logit contributions
istered-0.078
 incumbent-0.071
uscript-0.071
sembly-0.068
 Imran-0.068
Tokens
Token ID82
Feature activation+3.47e-01

Pos logit contributions
istered+0.999
 incumbent+0.934
uscript+0.933
sembly+0.917
 Imran+0.911
Neg logit contributions
\">-0.523
 <+-0.519
agine-0.489
cin-0.478
andre-0.470
Token remind
Token ID7101
Feature activation+1.03e+00

Pos logit contributions
\">+0.271
 <++0.269
agine+0.259
cin+0.258
andre+0.252
Neg logit contributions
istered-0.249
 incumbent-0.227
uscript-0.226
sembly-0.220
 Imran-0.218
Token us
Token ID514
Feature activation+8.61e-01

Pos logit contributions
\">+0.697
 <++0.691
agine+0.662
cin+0.659
andre+0.646
Neg logit contributions
istered-0.843
 incumbent-0.780
uscript-0.777
sembly-0.760
 Imran-0.755
Token not
Token ID407
Feature activation+6.12e-01

Pos logit contributions
\">+0.660
 <++0.656
agine+0.631
cin+0.628
andre+0.618
Neg logit contributions
istered-0.627
 incumbent-0.574
uscript-0.572
sembly-0.558
 Imran-0.553
Token only
Token ID691
Feature activation+5.97e+00

Pos logit contributions
\">+0.481
 <++0.478
agine+0.461
cin+0.459
andre+0.451
Neg logit contributions
istered-0.434
 incumbent-0.397
uscript-0.395
sembly-0.384
 Imran-0.382
Token of
Token ID286
Feature activation+2.83e+00

Pos logit contributions
\">+4.148
 <++4.128
agine+3.886
cin+3.885
andre+3.828
Neg logit contributions
istered-4.627
uscript-4.229
 incumbent-4.218
sembly-4.212
 Imran-4.111
Token lost
Token ID2626
Feature activation+1.53e+00

Pos logit contributions
\">+2.258
 <++2.240
agine+2.160
cin+2.153
andre+2.117
Neg logit contributions
istered-1.961
 incumbent-1.774
uscript-1.771
sembly-1.733
 Imran-1.710
Token fidelity
Token ID37744
Feature activation+1.30e+00

Pos logit contributions
\">+1.223
 <++1.216
agine+1.171
cin+1.167
andre+1.148
Neg logit contributions
istered-1.062
 incumbent-0.971
uscript-0.965
sembly-0.937
 Imran-0.932
Token,
Token ID11
Feature activation-9.49e-01

Pos logit contributions
\">+0.975
 <++0.968
agine+0.931
cin+0.927
andre+0.910
Neg logit contributions
istered-0.966
 incumbent-0.886
uscript-0.883
sembly-0.861
 Imran-0.854
Token but
Token ID475
Feature activation-4.38e-01

Pos logit contributions
istered+0.686
 incumbent+0.636
uscript+0.629
 Imran+0.610
sembly+0.607
Neg logit contributions
\">-0.728
 <+-0.723
agine-0.700
cin-0.694
andre-0.684
Token made
Token ID925
Feature activation+1.09e+00

Pos logit contributions
\">+1.492
 <++1.485
agine+1.428
cin+1.423
andre+1.402
Neg logit contributions
istered-1.351
 incumbent-1.235
uscript-1.231
sembly-1.198
 Imran-1.187
Token us
Token ID514
Feature activation+3.88e-01

Pos logit contributions
\">+0.822
 <++0.815
agine+0.785
cin+0.783
andre+0.768
Neg logit contributions
istered-0.808
 incumbent-0.744
uscript-0.740
sembly-0.720
 Imran-0.717
Token want
Token ID765
Feature activation+7.00e-01

Pos logit contributions
\">+0.307
 <++0.303
cin+0.292
agine+0.291
andre+0.287
Neg logit contributions
istered-0.273
 incumbent-0.251
uscript-0.249
 Imran-0.242
sembly-0.241
Token to
Token ID284
Feature activation+5.06e-01

Pos logit contributions
\">+0.549
 <++0.545
agine+0.525
cin+0.523
andre+0.514
Neg logit contributions
istered-0.498
 incumbent-0.456
uscript-0.453
sembly-0.441
 Imran-0.438
Token not
Token ID407
Feature activation+7.62e-02

Pos logit contributions
\">+0.389
 <++0.386
cin+0.372
agine+0.372
andre+0.366
Neg logit contributions
istered-0.365
 incumbent-0.336
uscript-0.333
 Imran-0.323
sembly-0.323
Token only
Token ID691
Feature activation+5.78e+00

Pos logit contributions
\">+0.060
 <++0.060
agine+0.058
cin+0.057
andre+0.057
Neg logit contributions
istered-0.053
 incumbent-0.049
uscript-0.049
sembly-0.047
 Imran-0.047
Token dream
Token ID4320
Feature activation+6.84e-01

Pos logit contributions
\">+4.349
 <++4.348
agine+4.144
cin+4.063
andre+3.947
Neg logit contributions
istered-4.258
 incumbent-3.868
sembly-3.864
uscript-3.837
 Imran-3.698
Token but
Token ID475
Feature activation-1.61e-01

Pos logit contributions
\">+0.540
 <++0.538
agine+0.517
cin+0.515
andre+0.507
Neg logit contributions
istered-0.483
 incumbent-0.440
uscript-0.438
sembly-0.427
 Imran-0.424
Token actually
Token ID1682
Feature activation+4.03e-01

Pos logit contributions
istered+0.113
 incumbent+0.103
uscript+0.102
 Imran+0.099
sembly+0.099
Neg logit contributions
\">-0.127
 <+-0.125
cin-0.122
agine-0.121
andre-0.119
Token pursue
Token ID10660
Feature activation-4.14e-01

Pos logit contributions
\">+0.315
 <++0.312
cin+0.301
agine+0.300
andre+0.297
Neg logit contributions
istered-0.287
 incumbent-0.264
uscript-0.261
sembly-0.254
 Imran-0.253
Token this
Token ID428
Feature activation-2.57e-01

Pos logit contributions
istered+0.289
 incumbent+0.267
uscript+0.263
 Imran+0.256
sembly+0.254
Neg logit contributions
\">-0.329
 <+-0.323
cin-0.315
agine-0.314
andre-0.310
Token's
Token ID338
Feature activation-3.53e-01

Pos logit contributions
istered+0.492
 incumbent+0.454
uscript+0.450
 Imran+0.437
sembly+0.434
Neg logit contributions
\">-0.535
 <+-0.530
agine-0.511
cin-0.507
andre-0.498
Token rant
Token ID30993
Feature activation-4.09e-02

Pos logit contributions
istered+0.258
 incumbent+0.239
uscript+0.235
 Imran+0.230
sembly+0.229
Neg logit contributions
\">-0.270
 <+-0.267
agine-0.258
cin-0.256
andre-0.251
Token was
Token ID373
Feature activation+2.54e-01

Pos logit contributions
istered+0.030
 incumbent+0.027
uscript+0.027
sembly+0.026
 Imran+0.026
Neg logit contributions
\">-0.032
 <+-0.031
agine-0.030
cin-0.030
andre-0.029
Token not
Token ID407
Feature activation+1.38e-01

Pos logit contributions
\">+0.200
 <++0.199
agine+0.191
cin+0.191
andre+0.187
Neg logit contributions
istered-0.180
 incumbent-0.164
uscript-0.164
sembly-0.159
 Imran-0.158
Token so
Token ID523
Feature activation+2.29e+00

Pos logit contributions
\">+0.109
 <++0.108
agine+0.104
cin+0.104
andre+0.102
Neg logit contributions
istered-0.098
 incumbent-0.090
uscript-0.089
sembly-0.087
 Imran-0.086
Token much
Token ID881
Feature activation+5.59e+00

Pos logit contributions
\">+1.708
 <++1.700
agine+1.632
cin+1.629
andre+1.604
Neg logit contributions
istered-1.700
uscript-1.563
 incumbent-1.556
sembly-1.527
 Imran-1.500
Token the
Token ID262
Feature activation+1.89e+00

Pos logit contributions
\">+4.083
 <++4.058
agine+3.896
cin+3.871
andre+3.810
Neg logit contributions
istered-4.165
uscript-3.845
sembly-3.738
 incumbent-3.702
 Imran-3.548
Token content
Token ID2695
Feature activation+5.01e-01

Pos logit contributions
\">+1.396
 <++1.388
agine+1.332
cin+1.327
andre+1.304
Neg logit contributions
istered-1.423
 incumbent-1.304
uscript-1.303
sembly-1.271
 Imran-1.259
Token,
Token ID11
Feature activation-3.30e-01

Pos logit contributions
\">+0.387
 <++0.384
agine+0.371
cin+0.368
andre+0.362
Neg logit contributions
istered-0.364
 incumbent-0.334
uscript-0.331
sembly-0.322
 Imran-0.321
Token but
Token ID475
Feature activation+1.29e-01

Pos logit contributions
istered+0.241
 incumbent+0.223
uscript+0.220
 Imran+0.215
sembly+0.214
Neg logit contributions
\">-0.251
 <+-0.248
agine-0.241
cin-0.239
andre-0.235
Token rather
Token ID2138
Feature activation+4.97e-01

Pos logit contributions
\">+0.104
 <++0.103
agine+0.099
cin+0.099
andre+0.097
Neg logit contributions
istered-0.090
 incumbent-0.082
uscript-0.082
sembly-0.080
 Imran-0.079
Token Card
Token ID5172
Feature activation-1.15e+00

Pos logit contributions
istered+0.625
 incumbent+0.574
uscript+0.574
sembly+0.555
 Imran+0.551
Neg logit contributions
\">-0.668
 <+-0.663
agine-0.642
cin-0.637
andre-0.630
Token that
Token ID326
Feature activation+1.61e+00

Pos logit contributions
istered+0.822
uscript+0.761
 incumbent+0.759
 Imran+0.730
sembly+0.725
Neg logit contributions
\">-0.879
 <+-0.871
agine-0.847
cin-0.841
andre-0.828
Token will
Token ID481
Feature activation+2.23e-01

Pos logit contributions
\">+1.277
 <++1.268
agine+1.220
cin+1.216
andre+1.197
Neg logit contributions
istered-1.127
 incumbent-1.028
uscript-1.024
sembly-0.998
 Imran-0.987
Token help
Token ID1037
Feature activation+4.25e-01

Pos logit contributions
\">+0.172
 <++0.171
agine+0.165
cin+0.164
andre+0.162
Neg logit contributions
istered-0.160
 incumbent-0.148
uscript-0.146
 Imran-0.141
sembly-0.141
Token not
Token ID407
Feature activation-1.47e+00

Pos logit contributions
\">+0.318
 <++0.315
agine+0.305
cin+0.304
andre+0.299
Neg logit contributions
istered-0.316
 incumbent-0.293
uscript-0.289
 Imran-0.281
sembly-0.280
Token only
Token ID691
Feature activation+5.55e+00

Pos logit contributions
istered+1.000
 incumbent+0.919
uscript+0.907
 Imran+0.881
sembly+0.876
Neg logit contributions
\">-1.199
 <+-1.186
agine-1.145
cin-1.137
andre-1.119
Token your
Token ID534
Feature activation+2.49e+00

Pos logit contributions
\">+4.249
 <++4.225
cin+3.989
agine+3.973
andre+3.929
Neg logit contributions
istered-3.982
uscript-3.668
sembly-3.607
 incumbent-3.599
 Imran-3.466
Token Vend
Token ID44220
Feature activation+1.30e+00

Pos logit contributions
\">+1.998
 <++1.990
agine+1.911
cin+1.906
andre+1.876
Neg logit contributions
istered-1.720
 incumbent-1.565
uscript-1.562
sembly-1.520
 Imran-1.505
Tokenread
Token ID961
Feature activation+7.64e-01

Pos logit contributions
\">+0.844
 <++0.836
agine+0.805
cin+0.800
andre+0.781
Neg logit contributions
istered-1.086
 incumbent-1.010
uscript-0.998
sembly-0.986
 Imran-0.985
Token Deck
Token ID20961
Feature activation+5.20e-01

Pos logit contributions
\">+0.620
 <++0.616
agine+0.594
cin+0.591
andre+0.582
Neg logit contributions
istered-0.523
 incumbent-0.477
uscript-0.474
sembly-0.461
 Imran-0.458
Token,
Token ID11
Feature activation-1.98e-01

Pos logit contributions
\">+0.397
 <++0.394
agine+0.380
cin+0.378
andre+0.372
Neg logit contributions
istered-0.379
 incumbent-0.349
uscript-0.347
sembly-0.337
 Imran-0.335
Token new
Token ID649
Feature activation+5.54e-01

Pos logit contributions
\">+0.456
 <++0.453
agine+0.436
cin+0.434
andre+0.426
Neg logit contributions
istered-0.441
 incumbent-0.405
uscript-0.402
sembly-0.391
 Imran-0.390
Token liberty
Token ID12354
Feature activation+9.91e-01

Pos logit contributions
\">+0.430
 <++0.427
agine+0.412
cin+0.409
andre+0.402
Neg logit contributions
istered-0.398
 incumbent-0.366
uscript-0.363
 Imran-0.351
sembly-0.351
Token applies
Token ID8991
Feature activation+1.26e+00

Pos logit contributions
\">+0.767
 <++0.763
agine+0.735
cin+0.730
andre+0.719
Neg logit contributions
istered-0.716
 incumbent-0.661
uscript-0.650
 Imran-0.631
sembly-0.627
Token to
Token ID284
Feature activation-5.40e-02

Pos logit contributions
\">+0.994
 <++0.985
agine+0.951
cin+0.946
andre+0.929
Neg logit contributions
istered-0.890
 incumbent-0.819
uscript-0.808
 Imran-0.783
sembly-0.782
Token not
Token ID407
Feature activation-1.47e+00

Pos logit contributions
istered+0.036
 incumbent+0.033
uscript+0.033
 Imran+0.032
sembly+0.031
Neg logit contributions
\">-0.045
 <+-0.044
agine-0.043
cin-0.043
andre-0.042
Token only
Token ID691
Feature activation+5.39e+00

Pos logit contributions
istered+0.996
 incumbent+0.916
uscript+0.903
 Imran+0.872
sembly+0.865
Neg logit contributions
\">-1.204
 <+-1.188
agine-1.151
cin-1.144
andre-1.128
Token sex
Token ID1714
Feature activation+8.00e-01

Pos logit contributions
\">+4.207
 <++4.194
cin+3.964
agine+3.956
andre+3.918
Neg logit contributions
istered-3.790
sembly-3.441
uscript-3.423
 incumbent-3.346
 Imran-3.266
Token but
Token ID475
Feature activation-3.20e-01

Pos logit contributions
\">+0.603
 <++0.599
agine+0.575
cin+0.573
andre+0.564
Neg logit contributions
istered-0.593
 incumbent-0.545
uscript-0.542
sembly-0.529
 Imran-0.525
Token also
Token ID635
Feature activation+3.01e-01

Pos logit contributions
istered+0.224
 incumbent+0.208
uscript+0.204
 Imran+0.199
sembly+0.196
Neg logit contributions
\">-0.253
 <+-0.249
cin-0.243
agine-0.242
andre-0.238
Token race
Token ID3234
Feature activation+1.40e-01

Pos logit contributions
\">+0.234
 <++0.232
cin+0.224
agine+0.224
andre+0.220
Neg logit contributions
istered-0.215
 incumbent-0.200
uscript-0.196
 Imran-0.191
sembly-0.190
Token.
Token ID13
Feature activation-5.56e-01

Pos logit contributions
\">+0.108
 <++0.107
agine+0.104
cin+0.103
andre+0.101
Neg logit contributions
istered-0.100
 incumbent-0.093
uscript-0.091
 Imran-0.089
sembly-0.088
TokenHow
Token ID2437
Feature activation+2.88e+00

Pos logit contributions
\">+3.393
 <++3.313
cin+3.194
agine+3.168
andre+3.142
Neg logit contributions
istered-3.701
sembly-3.456
 incumbent-3.440
uscript-3.419
 Imran-3.296
Token Vera
Token ID44413
Feature activation-1.16e-01

Pos logit contributions
\">+2.327
 <++2.304
agine+2.234
cin+2.222
andre+2.182
Neg logit contributions
istered-1.972
uscript-1.787
 incumbent-1.766
sembly-1.751
 Imran-1.706
Token Rubin
Token ID34599
Feature activation-1.36e+00

Pos logit contributions
istered+0.080
 incumbent+0.073
uscript+0.072
 Imran+0.070
sembly+0.070
Neg logit contributions
\">-0.094
 <+-0.093
cin-0.090
agine-0.090
andre-0.088
Token changed
Token ID3421
Feature activation+2.01e+00

Pos logit contributions
istered+0.949
uscript+0.867
 incumbent+0.862
 Imran+0.828
sembly+0.825
Neg logit contributions
\">-1.085
 <+-1.081
agine-1.039
cin-1.029
andre-1.015
Token science
Token ID3783
Feature activation+9.14e-02

Pos logit contributions
\">+1.611
 <++1.598
agine+1.545
cin+1.532
andre+1.508
Neg logit contributions
istered-1.392
uscript-1.265
 incumbent-1.260
sembly-1.238
 Imran-1.211
Token
Token ID198
Feature activation+5.36e+00

Pos logit contributions
\">+0.074
 <++0.074
agine+0.071
cin+0.071
andre+0.070
Neg logit contributions
istered-0.062
 incumbent-0.057
uscript-0.057
sembly-0.055
 Imran-0.055
Token
Token ID198
Feature activation+4.98e+00

Pos logit contributions
\">+4.474
 <++4.441
cin+4.213
agine+4.173
andre+4.151
Neg logit contributions
istered-3.571
sembly-3.495
uscript-3.269
 incumbent-3.199
 incorpor-3.158
TokenDear
Token ID20266
Feature activation+2.48e+00

Pos logit contributions
\">+3.510
 <++3.430
cin+3.305
agine+3.275
andre+3.255
Neg logit contributions
istered-3.913
sembly-3.673
 incumbent-3.642
uscript-3.614
 Imran-3.487
Token Science
Token ID5800
Feature activation+1.59e-01

Pos logit contributions
\">+2.029
 <++2.028
agine+1.970
cin+1.957
andre+1.929
Neg logit contributions
istered-1.662
uscript-1.494
 incumbent-1.492
sembly-1.470
 Imran-1.436
Token:
Token ID25
Feature activation-6.63e-02

Pos logit contributions
\">+0.124
 <++0.123
agine+0.119
cin+0.118
andre+0.116
Neg logit contributions
istered-0.114
 incumbent-0.105
uscript-0.104
sembly-0.101
 Imran-0.101
Token Could
Token ID10347
Feature activation+1.08e+00

Pos logit contributions
istered+0.047
 incumbent+0.043
uscript+0.043
 Imran+0.042
sembly+0.041
Neg logit contributions
\">-0.052
 <+-0.052
agine-0.050
cin-0.050
andre-0.049
Token,
Token ID11
Feature activation-7.01e-02

Pos logit contributions
istered+0.138
 incumbent+0.128
uscript+0.125
 Imran+0.122
sembly+0.120
Neg logit contributions
\">-0.153
 <+-0.151
agine-0.145
cin-0.144
andre-0.143
Token it
Token ID340
Feature activation+2.95e-01

Pos logit contributions
istered+0.049
 incumbent+0.046
uscript+0.044
 Imran+0.043
sembly+0.043
Neg logit contributions
\">-0.056
 <+-0.056
agine-0.054
cin-0.053
andre-0.052
Token's
Token ID338
Feature activation+1.42e+00

Pos logit contributions
\">+0.234
 <++0.232
agine+0.224
cin+0.223
andre+0.219
Neg logit contributions
istered-0.208
 incumbent-0.190
uscript-0.189
sembly-0.183
 Imran-0.182
Token not
Token ID407
Feature activation+5.16e-02

Pos logit contributions
\">+1.169
 <++1.162
agine+1.121
cin+1.117
andre+1.099
Neg logit contributions
istered-0.958
 incumbent-0.870
uscript-0.867
sembly-0.842
 Imran-0.835
Token so
Token ID523
Feature activation+1.87e+00

Pos logit contributions
\">+0.043
 <++0.043
cin+0.041
agine+0.041
andre+0.040
Neg logit contributions
istered-0.034
 incumbent-0.032
uscript-0.031
 Imran-0.030
sembly-0.030
Token much
Token ID881
Feature activation+5.36e+00

Pos logit contributions
\">+1.436
 <++1.428
agine+1.375
cin+1.369
andre+1.349
Neg logit contributions
istered-1.359
uscript-1.243
 incumbent-1.239
sembly-1.212
 Imran-1.195
Token that
Token ID326
Feature activation+1.72e+00

Pos logit contributions
\">+4.001
 <++3.986
agine+3.885
cin+3.845
andre+3.824
Neg logit contributions
istered-3.889
uscript-3.559
sembly-3.493
 incumbent-3.382
 Imran-3.306
Token they
Token ID484
Feature activation+4.90e-01

Pos logit contributions
\">+1.338
 <++1.330
agine+1.281
cin+1.276
andre+1.255
Neg logit contributions
istered-1.230
 incumbent-1.122
uscript-1.121
sembly-1.092
 Imran-1.080
Token're
Token ID821
Feature activation+6.19e-01

Pos logit contributions
\">+0.348
 <++0.347
agine+0.331
cin+0.329
andre+0.325
Neg logit contributions
istered-0.385
 incumbent-0.359
uscript-0.355
 Imran-0.343
sembly-0.342
Token bad
Token ID2089
Feature activation-3.14e-01

Pos logit contributions
\">+0.507
 <++0.502
agine+0.485
cin+0.483
andre+0.476
Neg logit contributions
istered-0.420
 incumbent-0.385
uscript-0.381
 Imran-0.369
sembly-0.369
Token players
Token ID1938
Feature activation+3.56e-01

Pos logit contributions
istered+0.226
 incumbent+0.210
uscript+0.206
 Imran+0.201
sembly+0.198
Neg logit contributions
\">-0.244
 <+-0.242
agine-0.233
cin-0.232
andre-0.228
Token Chall
Token ID10772
Feature activation+2.44e+00

Pos logit contributions
\">+0.506
 <++0.502
agine+0.485
cin+0.482
andre+0.475
Neg logit contributions
istered-0.464
 incumbent-0.426
uscript-0.422
sembly-0.411
 Imran-0.409
Tokenengers
Token ID9302
Feature activation+1.61e+00

Pos logit contributions
\">+1.468
 <++1.460
agine+1.397
cin+1.393
andre+1.352
Neg logit contributions
istered-2.174
uscript-2.024
 incumbent-2.024
sembly-1.984
 Imran-1.956
Token"]
Token ID8973
Feature activation+1.86e+00

Pos logit contributions
\">+1.075
 <++1.069
agine+1.024
cin+1.016
andre+1.001
Neg logit contributions
istered-1.327
 incumbent-1.232
uscript-1.223
sembly-1.194
 Imran-1.192
Token [
Token ID685
Feature activation+2.35e+00

Pos logit contributions
\">+1.443
 <++1.436
cin+1.373
agine+1.363
andre+1.340
Neg logit contributions
istered-1.342
sembly-1.213
uscript-1.213
 incumbent-1.209
 Imran-1.176
TokenSite
Token ID29123
Feature activation+1.01e+00

Pos logit contributions
\">+1.299
 <++1.294
agine+1.238
cin+1.219
andre+1.201
Neg logit contributions
istered-2.212
 incumbent-2.080
uscript-2.070
 Imran-1.996
sembly-1.988
Token "
Token ID366
Feature activation+5.28e+00

Pos logit contributions
\">+0.810
 <++0.805
agine+0.776
cin+0.773
andre+0.761
Neg logit contributions
istered-0.699
 incumbent-0.638
uscript-0.634
sembly-0.616
 Imran-0.612
TokenW
Token ID54
Feature activation+9.51e-01

Pos logit contributions
\">+4.040
 <++3.997
cin+3.864
agine+3.816
andre+3.758
Neg logit contributions
istered-3.832
 incumbent-3.534
uscript-3.509
sembly-3.447
 Imran-3.378
Tokenijk
Token ID45961
Feature activation+1.76e+00

Pos logit contributions
\">+0.681
 <++0.675
agine+0.648
cin+0.638
andre+0.632
Neg logit contributions
istered-0.742
uscript-0.687
 incumbent-0.682
sembly-0.658
 Imran-0.657
Token a
Token ID257
Feature activation+1.61e+00

Pos logit contributions
\">+1.570
 <++1.554
agine+1.502
cin+1.495
andre+1.485
Neg logit contributions
istered-1.072
 incumbent-0.965
uscript-0.962
 Imran-0.926
sembly-0.925
Tokenan
Token ID272
Feature activation+1.67e+00

Pos logit contributions
 <++1.244
\">+1.242
agine+1.196
andre+1.171
cin+1.168
Neg logit contributions
istered-1.165
 incumbent-1.062
uscript-1.058
 Imran-1.030
sembly-1.013
Token Z
Token ID1168
Feature activation+1.65e+00

Pos logit contributions
\">+1.518
 <++1.512
agine+1.463
cin+1.450
andre+1.448
Neg logit contributions
istered-0.967
 incumbent-0.879
uscript-0.866
 Imran-0.842
sembly-0.819
Token used
Token ID973
Feature activation+2.23e-01

Pos logit contributions
\">+0.156
 <++0.155
agine+0.150
cin+0.149
andre+0.146
Neg logit contributions
istered-0.134
 incumbent-0.123
uscript-0.122
 Imran-0.118
sembly-0.117
Token this
Token ID428
Feature activation+8.08e-01

Pos logit contributions
\">+0.182
 <++0.180
agine+0.174
cin+0.173
andre+0.170
Neg logit contributions
istered-0.152
 incumbent-0.139
uscript-0.138
 Imran-0.134
sembly-0.134
Token golden
Token ID10861
Feature activation+6.70e-01

Pos logit contributions
\">+0.565
 <++0.554
agine+0.537
cin+0.531
andre+0.521
Neg logit contributions
istered-0.648
 incumbent-0.607
uscript-0.595
 Imran-0.584
sembly-0.579
Token opportunity
Token ID3663
Feature activation+6.18e-01

Pos logit contributions
\">+0.461
 <++0.456
agine+0.437
cin+0.432
andre+0.427
Neg logit contributions
istered-0.545
 incumbent-0.506
uscript-0.501
 Imran-0.488
sembly-0.483
Token not
Token ID407
Feature activation-1.00e-03

Pos logit contributions
\">+0.493
 <++0.487
agine+0.472
cin+0.468
andre+0.460
Neg logit contributions
istered-0.433
 incumbent-0.398
uscript-0.394
 Imran-0.382
sembly-0.381
Token only
Token ID691
Feature activation+5.21e+00

Pos logit contributions
istered+0.001
 incumbent+0.001
uscript+0.001
 Imran+0.001
sembly+0.001
Neg logit contributions
\">-0.001
 <+-0.001
agine-0.001
cin-0.001
andre-0.001
Token to
Token ID284
Feature activation+2.84e+00

Pos logit contributions
\">+3.780
 <++3.778
cin+3.604
agine+3.578
andre+3.545
Neg logit contributions
istered-3.934
uscript-3.617
sembly-3.563
 incumbent-3.548
 Imran-3.443
Token continue
Token ID2555
Feature activation+1.45e+00

Pos logit contributions
\">+2.103
 <++2.091
agine+2.006
cin+1.998
andre+1.962
Neg logit contributions
istered-2.139
 incumbent-1.959
uscript-1.957
sembly-1.912
 Imran-1.890
Token its
Token ID663
Feature activation-3.53e-01

Pos logit contributions
\">+1.168
 <++1.161
agine+1.119
cin+1.115
andre+1.097
Neg logit contributions
istered-0.996
 incumbent-0.907
uscript-0.903
sembly-0.878
 Imran-0.871
Token Juda
Token ID24771
Feature activation+1.70e-02

Pos logit contributions
istered+0.243
 incumbent+0.226
uscript+0.220
 Imran+0.216
sembly+0.213
Neg logit contributions
\">-0.285
 <+-0.282
agine-0.275
cin-0.271
andre-0.267
Tokenisation
Token ID5612
Feature activation+9.74e-01

Pos logit contributions
\">+0.014
 <++0.014
agine+0.013
cin+0.013
andre+0.013
Neg logit contributions
istered-0.011
 incumbent-0.010
uscript-0.010
sembly-0.010
 Imran-0.010
Token immediately
Token ID3393
Feature activation+4.47e-01

Pos logit contributions
istered+1.273
 incumbent+1.175
uscript+1.150
 Imran+1.119
sembly+1.107
Neg logit contributions
\">-1.503
 <+-1.485
agine-1.446
cin-1.437
andre-1.413
Token with
Token ID351
Feature activation+3.09e-01

Pos logit contributions
\">+0.348
 <++0.346
agine+0.333
cin+0.332
andre+0.326
Neg logit contributions
istered-0.321
 incumbent-0.294
uscript-0.292
sembly-0.284
 Imran-0.282
Token mechanisms
Token ID11701
Feature activation+9.23e-01

Pos logit contributions
\">+0.258
 <++0.256
agine+0.248
cin+0.247
andre+0.243
Neg logit contributions
istered-0.204
 incumbent-0.187
uscript-0.185
sembly-0.179
 Imran-0.178
Token to
Token ID284
Feature activation-2.49e-01

Pos logit contributions
\">+0.714
 <++0.709
agine+0.684
cin+0.680
andre+0.669
Neg logit contributions
istered-0.666
 incumbent-0.612
uscript-0.607
sembly-0.591
 Imran-0.588
Token not
Token ID407
Feature activation-1.49e+00

Pos logit contributions
istered+0.194
 incumbent+0.181
uscript+0.178
 Imran+0.173
sembly+0.172
Neg logit contributions
\">-0.177
 <+-0.177
agine-0.170
cin-0.169
andre-0.167
Token only
Token ID691
Feature activation+5.19e+00

Pos logit contributions
istered+1.034
 incumbent+0.951
uscript+0.944
sembly+0.905
 Imran+0.904
Neg logit contributions
\">-1.184
 <+-1.174
agine-1.129
cin-1.119
andre-1.106
Token derive
Token ID27099
Feature activation+1.55e+00

Pos logit contributions
\">+3.784
 <++3.723
cin+3.541
agine+3.537
andre+3.454
Neg logit contributions
istered-3.971
uscript-3.634
sembly-3.613
 incumbent-3.592
 Imran-3.518
Token the
Token ID262
Feature activation+6.64e-01

Pos logit contributions
\">+1.258
 <++1.250
agine+1.204
cin+1.200
andre+1.182
Neg logit contributions
istered-1.062
 incumbent-0.965
uscript-0.962
sembly-0.937
 Imran-0.928
Token necessary
Token ID3306
Feature activation+2.04e+00

Pos logit contributions
\">+0.529
 <++0.526
agine+0.507
cin+0.504
andre+0.496
Neg logit contributions
istered-0.464
 incumbent-0.427
uscript-0.421
sembly-0.409
 Imran-0.408
Token information
Token ID1321
Feature activation-6.09e-01

Pos logit contributions
\">+1.629
 <++1.619
agine+1.561
cin+1.554
andre+1.529
Neg logit contributions
istered-1.423
 incumbent-1.302
uscript-1.293
sembly-1.256
 Imran-1.249
Token,
Token ID11
Feature activation-1.47e-01

Pos logit contributions
istered+0.437
 incumbent+0.406
uscript+0.400
 Imran+0.389
sembly+0.385
Neg logit contributions
\">-0.471
 <+-0.469
agine-0.453
cin-0.450
andre-0.441
Token
Token ID198
Feature activation+6.76e-01

Pos logit contributions
\">+0.703
 <++0.699
agine+0.678
cin+0.675
andre+0.666
Neg logit contributions
istered-0.437
 incumbent-0.391
uscript-0.388
sembly-0.373
 Imran-0.372
TokenJoe
Token ID19585
Feature activation+8.56e-01

Pos logit contributions
\">+0.551
 <++0.548
agine+0.532
cin+0.526
andre+0.517
Neg logit contributions
istered-0.459
 incumbent-0.418
uscript-0.417
 Imran-0.403
sembly-0.398
Token V
Token ID569
Feature activation+1.44e+00

Pos logit contributions
\">+0.686
 <++0.680
agine+0.656
cin+0.653
andre+0.643
Neg logit contributions
istered-0.596
 incumbent-0.544
uscript-0.542
sembly-0.524
 Imran-0.524
Tokenitt
Token ID715
Feature activation+9.49e-01

Pos logit contributions
\">+1.075
 <++1.070
agine+1.031
cin+1.026
andre+1.010
Neg logit contributions
istered-1.074
 incumbent-0.988
uscript-0.981
sembly-0.961
 Imran-0.951
Token Aff
Token ID6708
Feature activation+1.72e+00

Pos logit contributions
\">+0.695
 <++0.691
agine+0.664
cin+0.661
andre+0.649
Neg logit contributions
istered-0.724
 incumbent-0.667
uscript-0.663
sembly-0.648
 Imran-0.643
Tokenidav
Token ID20331
Feature activation+5.19e+00

Pos logit contributions
\">+1.530
 <++1.522
agine+1.479
cin+1.475
andre+1.453
Neg logit contributions
istered-1.031
 incumbent-0.926
uscript-0.924
sembly-0.894
 Imran-0.885
Tokenit
Token ID270
Feature activation+1.25e+00

Pos logit contributions
\">+3.761
 <++3.701
agine+3.638
cin+3.623
andre+3.581
Neg logit contributions
istered-3.905
 incumbent-3.628
uscript-3.506
sembly-3.502
 Imran-3.492
Token
Token ID198
Feature activation+9.30e-01

Pos logit contributions
\">+1.004
 <++0.998
agine+0.962
cin+0.958
andre+0.942
Neg logit contributions
istered-0.870
 incumbent-0.795
uscript-0.791
sembly-0.768
 Imran-0.763
Token
Token ID198
Feature activation+7.23e-01

Pos logit contributions
\">+0.856
 <++0.851
agine+0.825
cin+0.822
andre+0.810
Neg logit contributions
istered-0.536
 incumbent-0.479
uscript-0.476
sembly-0.459
 Imran-0.456
TokenScott
Token ID19040
Feature activation+1.43e+00

Pos logit contributions
\">+0.554
 <++0.551
agine+0.533
cin+0.527
andre+0.517
Neg logit contributions
istered-0.526
 incumbent-0.482
uscript-0.480
 Imran-0.466
sembly-0.460
Tokenie
Token ID494
Feature activation+3.64e-01

Pos logit contributions
\">+1.167
 <++1.161
agine+1.120
cin+1.116
andre+1.100
Neg logit contributions
istered-0.974
 incumbent-0.889
uscript-0.882
sembly-0.864
 Imran-0.851
Token abduct
Token ID24827
Feature activation+7.95e-01

Pos logit contributions
\">+1.014
 <++1.009
agine+0.977
cin+0.974
andre+0.961
Neg logit contributions
istered-0.595
 incumbent-0.529
uscript-0.526
sembly-0.508
 Imran-0.502
Tokening
Token ID278
Feature activation+9.37e-01

Pos logit contributions
\">+0.548
 <++0.543
agine+0.522
cin+0.519
andre+0.508
Neg logit contributions
istered-0.641
 incumbent-0.594
uscript-0.589
sembly-0.574
 Imran-0.573
Token children
Token ID1751
Feature activation+1.65e+00

Pos logit contributions
\">+0.703
 <++0.698
agine+0.671
cin+0.669
andre+0.656
Neg logit contributions
istered-0.701
 incumbent-0.646
uscript-0.639
 Imran-0.622
sembly-0.621
Token -
Token ID532
Feature activation+1.12e+00

Pos logit contributions
\">+1.239
 <++1.236
agine+1.187
cin+1.181
andre+1.161
Neg logit contributions
istered-1.229
uscript-1.125
 incumbent-1.125
sembly-1.101
 Imran-1.085
Token not
Token ID407
Feature activation+3.14e-01

Pos logit contributions
\">+0.899
 <++0.892
agine+0.859
cin+0.857
andre+0.843
Neg logit contributions
istered-0.774
 incumbent-0.708
uscript-0.701
sembly-0.679
 Imran-0.679
Token only
Token ID691
Feature activation+5.16e+00

Pos logit contributions
\">+0.251
 <++0.249
agine+0.240
cin+0.239
andre+0.236
Neg logit contributions
istered-0.220
 incumbent-0.202
uscript-0.199
 Imran-0.193
sembly-0.192
Token from
Token ID422
Feature activation+2.64e+00

Pos logit contributions
 <++3.736
\">+3.725
agine+3.551
cin+3.542
andre+3.472
Neg logit contributions
istered-3.892
uscript-3.589
sembly-3.565
 incumbent-3.529
 Imran-3.410
Token their
Token ID511
Feature activation+6.08e-01

Pos logit contributions
\">+2.050
 <++2.037
agine+1.961
cin+1.948
andre+1.924
Neg logit contributions
istered-1.878
uscript-1.718
 incumbent-1.709
sembly-1.682
 Imran-1.646
Token homes
Token ID5682
Feature activation-5.39e-01

Pos logit contributions
\">+0.497
 <++0.493
agine+0.476
cin+0.474
andre+0.466
Neg logit contributions
istered-0.414
 incumbent-0.377
uscript-0.375
sembly-0.364
 Imran-0.362
Token,
Token ID11
Feature activation-5.46e-01

Pos logit contributions
istered+0.391
 incumbent+0.359
uscript+0.357
 Imran+0.346
sembly+0.346
Neg logit contributions
\">-0.414
 <+-0.411
agine-0.396
cin-0.394
andre-0.387
Token but
Token ID475
Feature activation+5.27e-01

Pos logit contributions
istered+0.396
 incumbent+0.364
uscript+0.360
 Imran+0.352
sembly+0.347
Neg logit contributions
\">-0.422
 <+-0.416
agine-0.403
cin-0.402
andre-0.395
Token in
Token ID287
Feature activation+3.46e-01

Pos logit contributions
\">+0.380
 <++0.377
agine+0.364
cin+0.361
andre+0.355
Neg logit contributions
istered-0.341
 incumbent-0.314
uscript-0.308
 Imran-0.301
sembly-0.298
Token history
Token ID2106
Feature activation+6.86e-01

Pos logit contributions
\">+0.276
 <++0.274
agine+0.264
cin+0.263
andre+0.258
Neg logit contributions
istered-0.242
 incumbent-0.222
uscript-0.220
 Imran-0.213
sembly-0.212
Token is
Token ID318
Feature activation-4.90e-01

Pos logit contributions
\">+0.530
 <++0.524
agine+0.506
cin+0.504
andre+0.495
Neg logit contributions
istered-0.499
 incumbent-0.458
uscript-0.453
 Imran-0.441
sembly-0.440
Token alarming
Token ID24128
Feature activation-1.01e+00

Pos logit contributions
istered+0.340
 incumbent+0.321
uscript+0.308
 Imran+0.303
 Warden+0.299
Neg logit contributions
\">-0.395
 <+-0.386
cin-0.372
agine-0.372
andre-0.367
Token not
Token ID407
Feature activation-1.08e-01

Pos logit contributions
istered+0.708
 incumbent+0.658
uscript+0.641
 Imran+0.630
sembly+0.618
Neg logit contributions
\">-0.808
 <+-0.796
agine-0.772
cin-0.767
andre-0.751
Token only
Token ID691
Feature activation+5.08e+00

Pos logit contributions
istered+0.073
 incumbent+0.068
uscript+0.066
 Imran+0.066
sembly+0.064
Neg logit contributions
\">-0.089
 <+-0.088
agine-0.085
cin-0.085
andre-0.083
Token in
Token ID287
Feature activation+2.11e+00

Pos logit contributions
\">+3.477
 <++3.465
agine+3.300
cin+3.297
andre+3.259
Neg logit contributions
istered-4.045
uscript-3.746
sembly-3.687
 incumbent-3.649
 Imran-3.539
Token and
Token ID290
Feature activation-8.11e-01

Pos logit contributions
\">+1.617
 <++1.606
agine+1.546
cin+1.539
andre+1.516
Neg logit contributions
istered-1.535
 incumbent-1.404
uscript-1.400
sembly-1.365
 Imran-1.351
Token of
Token ID286
Feature activation-1.35e+00

Pos logit contributions
istered+0.581
 incumbent+0.533
uscript+0.527
sembly+0.513
 Imran+0.510
Neg logit contributions
\">-0.634
 <+-0.629
agine-0.607
cin-0.604
andre-0.593
Token itself
Token ID2346
Feature activation+3.10e-01

Pos logit contributions
istered+0.974
 incumbent+0.933
 Imran+0.895
uscript+0.889
 tether+0.867
Neg logit contributions
\">-1.031
 <+-1.023
agine-0.991
cin-0.978
andre-0.950
Token,
Token ID11
Feature activation-6.55e-01

Pos logit contributions
\">+0.236
 <++0.234
agine+0.226
cin+0.225
andre+0.221
Neg logit contributions
istered-0.228
 incumbent-0.209
uscript-0.208
sembly-0.203
 Imran-0.202
Token biotech
Token ID47693
Feature activation+2.44e-02

Pos logit contributions
istered+0.276
 incumbent+0.258
uscript+0.252
sembly+0.244
 Imran+0.243
Neg logit contributions
\">-0.276
 <+-0.273
agine-0.264
cin-0.262
andre-0.259
Token alliance
Token ID12801
Feature activation-8.57e-01

Pos logit contributions
\">+0.021
 <++0.021
agine+0.020
cin+0.020
andre+0.020
Neg logit contributions
istered-0.016
 incumbent-0.014
uscript-0.014
 Imran-0.013
sembly-0.013
Token,
Token ID11
Feature activation+8.11e-01

Pos logit contributions
istered+0.604
 incumbent+0.567
uscript+0.547
 Imran+0.534
sembly+0.532
Neg logit contributions
\">-0.673
 <+-0.663
agine-0.640
cin-0.638
andre-0.633
Token wasted
Token ID18359
Feature activation+2.22e-01

Pos logit contributions
\">+0.667
 <++0.663
agine+0.640
cin+0.637
andre+0.628
Neg logit contributions
istered-0.546
 incumbent-0.497
uscript-0.494
sembly-0.480
 Imran-0.477
Token no
Token ID645
Feature activation+1.01e+00

Pos logit contributions
\">+0.189
 <++0.188
agine+0.182
cin+0.181
andre+0.178
Neg logit contributions
istered-0.143
 incumbent-0.129
uscript-0.128
sembly-0.125
 Imran-0.123
Token time
Token ID640
Feature activation+5.07e+00

Pos logit contributions
\">+0.747
 <++0.740
cin+0.710
agine+0.710
andre+0.699
Neg logit contributions
istered-0.764
 incumbent-0.708
uscript-0.698
 Imran-0.679
sembly-0.678
Token in
Token ID287
Feature activation+6.38e-01

Pos logit contributions
 <++3.591
\">+3.561
cin+3.412
agine+3.359
 Getty+3.273
Neg logit contributions
istered-3.951
uscript-3.615
sembly-3.606
 incumbent-3.502
 Imran-3.428
Token expressing
Token ID16621
Feature activation+1.06e+00

Pos logit contributions
\">+0.505
 <++0.501
agine+0.484
cin+0.482
andre+0.475
Neg logit contributions
istered-0.449
 incumbent-0.413
uscript-0.409
sembly-0.397
 Imran-0.396
Token its
Token ID663
Feature activation-1.06e+00

Pos logit contributions
\">+0.861
 <++0.855
agine+0.826
cin+0.822
andre+0.810
Neg logit contributions
istered-0.720
 incumbent-0.661
uscript-0.652
sembly-0.632
 Imran-0.632
Token delight
Token ID10974
Feature activation+5.24e-01

Pos logit contributions
istered+0.681
 incumbent+0.632
uscript+0.603
 Imran+0.596
 Warden+0.584
Neg logit contributions
\">-0.900
 <+-0.892
agine-0.869
cin-0.863
andre-0.850
Tokenafter
Token ID8499
Feature activation+2.18e+00

Pos logit contributions
\">+0.405
 <++0.401
agine+0.388
cin+0.385
andre+0.379
Neg logit contributions
istered-0.379
 incumbent-0.350
uscript-0.345
 Imran-0.336
sembly-0.334
Token GDP
Token ID12396
Feature activation+7.88e-02

Pos logit contributions
istered+0.131
 incumbent+0.119
uscript+0.118
sembly+0.114
 Imran+0.113
Neg logit contributions
\">-0.190
 <+-0.189
agine-0.183
cin-0.182
andre-0.180
Token have
Token ID423
Feature activation+3.02e-02

Pos logit contributions
\">+0.062
 <++0.061
agine+0.059
cin+0.059
andre+0.058
Neg logit contributions
istered-0.056
 incumbent-0.051
uscript-0.051
sembly-0.049
 Imran-0.049
Token already
Token ID1541
Feature activation-3.93e-01

Pos logit contributions
\">+0.026
 <++0.025
agine+0.025
cin+0.025
andre+0.024
Neg logit contributions
istered-0.020
 incumbent-0.018
uscript-0.018
 Imran-0.017
sembly-0.017
Token changed
Token ID3421
Feature activation+4.22e-01

Pos logit contributions
istered+0.251
 incumbent+0.229
uscript+0.225
 Imran+0.217
sembly+0.217
Neg logit contributions
\">-0.338
 <+-0.335
agine-0.325
cin-0.323
andre-0.318
Token not
Token ID407
Feature activation-4.74e-01

Pos logit contributions
\">+0.299
 <++0.295
agine+0.284
cin+0.282
andre+0.277
Neg logit contributions
istered-0.332
 incumbent-0.309
uscript-0.305
sembly-0.298
 Imran-0.297
Token only
Token ID691
Feature activation+5.04e+00

Pos logit contributions
istered+0.330
 incumbent+0.304
uscript+0.301
sembly+0.291
 Imran+0.290
Neg logit contributions
\">-0.380
 <+-0.378
agine-0.362
cin-0.361
andre-0.355
Token their
Token ID511
Feature activation+2.43e-01

Pos logit contributions
 <++3.493
\">+3.492
agine+3.334
cin+3.331
andre+3.255
Neg logit contributions
istered-3.994
uscript-3.653
 incumbent-3.587
sembly-3.563
 Imran-3.508
Token legal
Token ID2742
Feature activation+1.44e-01

Pos logit contributions
\">+0.191
 <++0.189
agine+0.182
cin+0.181
andre+0.179
Neg logit contributions
istered-0.173
 incumbent-0.160
uscript-0.157
sembly-0.153
 Imran-0.153
Token headquarters
Token ID10043
Feature activation+1.85e-01

Pos logit contributions
\">+0.112
 <++0.111
agine+0.107
cin+0.106
andre+0.105
Neg logit contributions
istered-0.103
 incumbent-0.095
uscript-0.094
sembly-0.091
 Imran-0.091
Token but
Token ID475
Feature activation-1.07e+00

Pos logit contributions
\">+0.140
 <++0.138
agine+0.133
cin+0.132
andre+0.130
Neg logit contributions
istered-0.137
 incumbent-0.127
uscript-0.126
sembly-0.123
 Imran-0.122
Token,
Token ID11
Feature activation-8.44e-01

Pos logit contributions
istered+0.772
 incumbent+0.722
uscript+0.702
 Imran+0.685
sembly+0.684
Neg logit contributions
\">-0.827
 <+-0.816
agine-0.790
cin-0.789
andre-0.772
Token
Token ID198
Feature activation-6.02e-02

Pos logit contributions
\">+0.017
 <++0.017
agine+0.016
cin+0.016
andre+0.016
Neg logit contributions
istered-0.010
 incumbent-0.009
uscript-0.009
 Imran-0.008
 Warden-0.008
Token12
Token ID1065
Feature activation+3.69e-01

Pos logit contributions
istered+0.049
 incumbent+0.045
uscript+0.044
 Imran+0.043
sembly+0.043
Neg logit contributions
\">-0.042
 <+-0.041
agine-0.040
cin-0.039
andre-0.038
Token,
Token ID11
Feature activation-7.18e-01

Pos logit contributions
\">+0.281
 <++0.278
agine+0.268
cin+0.267
andre+0.262
Neg logit contributions
istered-0.271
 incumbent-0.250
uscript-0.248
sembly-0.241
 Imran-0.240
Token000
Token ID830
Feature activation+4.58e-01

Pos logit contributions
istered+0.639
 incumbent+0.595
uscript+0.591
sembly+0.578
 Imran+0.578
Neg logit contributions
\">-0.434
 <+-0.430
agine-0.411
cin-0.409
andre-0.400
Token App
Token ID2034
Feature activation+1.38e-02

Pos logit contributions
\">+0.375
 <++0.373
agine+0.360
cin+0.358
andre+0.353
Neg logit contributions
istered-0.310
 incumbent-0.282
uscript-0.281
sembly-0.273
 Imran-0.271
Tokenrox
Token ID13907
Feature activation+4.99e+00

Pos logit contributions
\">+0.010
 <++0.010
agine+0.009
cin+0.009
andre+0.009
Neg logit contributions
istered-0.011
 incumbent-0.010
uscript-0.010
sembly-0.010
 Imran-0.010
Tokenimate
Token ID1920
Feature activation+1.96e+00

Pos logit contributions
\">+3.256
 <++3.250
cin+3.148
agine+3.141
andre+3.073
Neg logit contributions
istered-4.035
uscript-3.788
 incumbent-3.781
sembly-3.687
 Imran-3.649
Token number
Token ID1271
Feature activation-1.38e+00

Pos logit contributions
\">+1.507
 <++1.498
agine+1.442
cin+1.435
andre+1.410
Neg logit contributions
istered-1.417
 incumbent-1.295
uscript-1.292
sembly-1.260
 Imran-1.249
Token of
Token ID286
Feature activation+7.42e-01

Pos logit contributions
istered+0.995
 incumbent+0.913
uscript+0.906
sembly+0.882
 Imran+0.877
Neg logit contributions
\">-1.065
 <+-1.055
agine-1.016
cin-1.012
andre-0.997
Token adult
Token ID4044
Feature activation+7.96e-02

Pos logit contributions
\">+0.605
 <++0.601
agine+0.580
cin+0.578
andre+0.568
Neg logit contributions
istered-0.505
 incumbent-0.460
uscript-0.457
sembly-0.444
 Imran-0.441
Token Buddh
Token ID20921
Feature activation+4.14e-01

Pos logit contributions
\">+0.063
 <++0.062
agine+0.060
cin+0.060
andre+0.059
Neg logit contributions
istered-0.057
 incumbent-0.052
uscript-0.051
 Imran-0.050
sembly-0.050
Token Vera
Token ID44413
Feature activation-1.16e-01

Pos logit contributions
\">+2.327
 <++2.304
agine+2.234
cin+2.222
andre+2.182
Neg logit contributions
istered-1.972
uscript-1.787
 incumbent-1.766
sembly-1.751
 Imran-1.706
Token Rubin
Token ID34599
Feature activation-1.36e+00

Pos logit contributions
istered+0.080
 incumbent+0.073
uscript+0.072
 Imran+0.070
sembly+0.070
Neg logit contributions
\">-0.094
 <+-0.093
cin-0.090
agine-0.090
andre-0.088
Token changed
Token ID3421
Feature activation+2.01e+00

Pos logit contributions
istered+0.949
uscript+0.867
 incumbent+0.862
 Imran+0.828
sembly+0.825
Neg logit contributions
\">-1.085
 <+-1.081
agine-1.039
cin-1.029
andre-1.015
Token science
Token ID3783
Feature activation+9.14e-02

Pos logit contributions
\">+1.611
 <++1.598
agine+1.545
cin+1.532
andre+1.508
Neg logit contributions
istered-1.392
uscript-1.265
 incumbent-1.260
sembly-1.238
 Imran-1.211
Token
Token ID198
Feature activation+5.36e+00

Pos logit contributions
\">+0.074
 <++0.074
agine+0.071
cin+0.071
andre+0.070
Neg logit contributions
istered-0.062
 incumbent-0.057
uscript-0.057
sembly-0.055
 Imran-0.055
Token
Token ID198
Feature activation+4.98e+00

Pos logit contributions
\">+4.474
 <++4.441
cin+4.213
agine+4.173
andre+4.151
Neg logit contributions
istered-3.571
sembly-3.495
uscript-3.269
 incumbent-3.199
 incorpor-3.158
TokenDear
Token ID20266
Feature activation+2.48e+00

Pos logit contributions
\">+3.510
 <++3.430
cin+3.305
agine+3.275
andre+3.255
Neg logit contributions
istered-3.913
sembly-3.673
 incumbent-3.642
uscript-3.614
 Imran-3.487
Token Science
Token ID5800
Feature activation+1.59e-01

Pos logit contributions
\">+2.029
 <++2.028
agine+1.970
cin+1.957
andre+1.929
Neg logit contributions
istered-1.662
uscript-1.494
 incumbent-1.492
sembly-1.470
 Imran-1.436
Token:
Token ID25
Feature activation-6.63e-02

Pos logit contributions
\">+0.124
 <++0.123
agine+0.119
cin+0.118
andre+0.116
Neg logit contributions
istered-0.114
 incumbent-0.105
uscript-0.104
sembly-0.101
 Imran-0.101
Token Could
Token ID10347
Feature activation+1.08e+00

Pos logit contributions
istered+0.047
 incumbent+0.043
uscript+0.043
 Imran+0.042
sembly+0.041
Neg logit contributions
\">-0.052
 <+-0.052
agine-0.050
cin-0.050
andre-0.049
Token my
Token ID616
Feature activation+9.22e-01

Pos logit contributions
\">+0.879
 <++0.872
agine+0.842
cin+0.838
andre+0.825
Neg logit contributions
istered-0.741
 incumbent-0.675
uscript-0.672
sembly-0.653
 Imran-0.647
Token the
Token ID262
Feature activation+1.41e-01

Pos logit contributions
istered+0.324
 incumbent+0.295
uscript+0.292
 Imran+0.283
sembly+0.283
Neg logit contributions
\">-0.404
 <+-0.401
agine-0.387
cin-0.386
andre-0.380
Token architecture
Token ID10959
Feature activation-9.25e-01

Pos logit contributions
\">+0.119
 <++0.118
agine+0.114
cin+0.113
andre+0.112
Neg logit contributions
istered-0.093
 incumbent-0.084
uscript-0.083
sembly-0.081
 Imran-0.081
Token and
Token ID290
Feature activation+6.94e-01

Pos logit contributions
istered+0.642
 incumbent+0.591
uscript+0.580
 Imran+0.564
sembly+0.556
Neg logit contributions
\">-0.746
 <+-0.734
agine-0.710
cin-0.706
andre-0.695
Token nature
Token ID3450
Feature activation+1.91e-02

Pos logit contributions
\">+0.570
 <++0.566
agine+0.546
cin+0.545
andre+0.536
Neg logit contributions
istered-0.469
 incumbent-0.427
uscript-0.424
sembly-0.411
 Imran-0.409
Token me
Token ID502
Feature activation+1.59e-01

Pos logit contributions
\">+0.015
 <++0.015
agine+0.015
cin+0.015
andre+0.014
Neg logit contributions
istered-0.013
 incumbent-0.012
uscript-0.012
 Imran-0.012
sembly-0.012
Tokenld
Token ID335
Feature activation+4.96e+00

Pos logit contributions
\">+0.104
 <++0.103
cin+0.098
agine+0.098
andre+0.095
Neg logit contributions
istered-0.134
uscript-0.124
 incumbent-0.123
sembly-0.121
 Imran-0.120
Token into
Token ID656
Feature activation+1.79e+00

Pos logit contributions
\">+3.808
 <++3.800
agine+3.671
cin+3.636
andre+3.616
Neg logit contributions
istered-3.525
uscript-3.249
 incumbent-3.244
sembly-3.165
 Imran-3.130
Token one
Token ID530
Feature activation+6.47e-01

Pos logit contributions
\">+1.439
 <++1.431
agine+1.380
cin+1.370
andre+1.349
Neg logit contributions
istered-1.239
 incumbent-1.130
uscript-1.128
sembly-1.096
 Imran-1.086
Token ($
Token ID7198
Feature activation+6.37e-01

Pos logit contributions
\">+0.532
 <++0.528
agine+0.510
cin+0.508
andre+0.500
Neg logit contributions
istered-0.436
 incumbent-0.397
uscript-0.394
sembly-0.383
 Imran-0.380
Token80
Token ID1795
Feature activation-5.47e-01

Pos logit contributions
 <++0.373
\">+0.372
agine+0.355
cin+0.353
andre+0.345
Neg logit contributions
istered-0.575
 incumbent-0.535
uscript-0.532
 Imran-0.521
sembly-0.515
Token,
Token ID11
Feature activation+1.94e-01

Pos logit contributions
istered+0.403
 incumbent+0.374
uscript+0.369
 Imran+0.358
sembly+0.355
Neg logit contributions
\">-0.413
 <+-0.408
agine-0.393
cin-0.392
andre-0.387
Token a
Token ID257
Feature activation-1.71e-01

Pos logit contributions
\">+0.527
 <++0.523
agine+0.503
cin+0.502
andre+0.493
Neg logit contributions
istered-0.454
 incumbent-0.416
uscript-0.411
 Imran-0.398
sembly-0.398
Token start
Token ID923
Feature activation-7.70e-02

Pos logit contributions
istered+0.118
 incumbent+0.108
uscript+0.107
sembly+0.104
 Imran+0.103
Neg logit contributions
\">-0.138
 <+-0.136
agine-0.132
cin-0.131
andre-0.129
Token.
Token ID13
Feature activation+7.21e-01

Pos logit contributions
istered+0.055
 incumbent+0.050
uscript+0.050
sembly+0.049
 Imran+0.048
Neg logit contributions
\">-0.060
 <+-0.060
agine-0.058
cin-0.057
andre-0.056
Token<|endoftext|>
Token ID50256
Feature activation-1.27e-01

Pos logit contributions
\">+0.625
 <++0.621
agine+0.601
cin+0.599
andre+0.589
Neg logit contributions
istered-0.452
 incumbent-0.409
uscript-0.406
sembly-0.393
 Imran-0.392
TokenAbstract
Token ID23839
Feature activation-4.49e+00

Pos logit contributions
istered+0.099
 incumbent+0.091
uscript+0.090
 Imran+0.088
sembly+0.087
Neg logit contributions
\">-0.091
 <+-0.091
agine-0.087
cin-0.086
andre-0.085
Token
Token ID198
Feature activation-6.10e+00

Pos logit contributions
istered+2.616
 incumbent+2.423
uscript+2.388
 Imran+2.330
 Warden+2.295
Neg logit contributions
\">-3.979
 <+-3.968
agine-3.851
cin-3.829
andre-3.794
Token
Token ID198
Feature activation-5.74e+00

Pos logit contributions
istered+2.530
 incumbent+2.291
 Imran+2.267
 Warden+2.093
uscript+2.092
Neg logit contributions
agine-6.158
 <+-6.137
\">-6.117
cin-6.017
andre-5.970
TokenBackground
Token ID21756
Feature activation-4.27e+00

Pos logit contributions
istered+4.047
 incumbent+3.692
uscript+3.689
 Imran+3.561
iman+3.506
Neg logit contributions
 <+-4.368
\">-4.249
agine-4.235
cin-4.077
andre-4.029
Token Depression
Token ID22483
Feature activation-1.51e+00

Pos logit contributions
istered+2.813
 incumbent+2.634
uscript+2.567
 Imran+2.550
 Warden+2.512
Neg logit contributions
\">-3.436
 <+-3.435
agine-3.333
andre-3.293
cin-3.285
Token presents
Token ID10969
Feature activation-7.63e-01

Pos logit contributions
istered+1.057
 incumbent+0.977
uscript+0.958
 Imran+0.936
 Warden+0.918
Neg logit contributions
 <+-1.190
\">-1.185
agine-1.148
cin-1.135
andre-1.127
Token a
Token ID257
Feature activation-1.76e-01

Pos logit contributions
istered+0.500
 incumbent+0.464
uscript+0.450
 Imran+0.439
sembly+0.431
Neg logit contributions
\">-0.638
 <+-0.633
agine-0.615
cin-0.611
andre-0.604
Token
Token ID198
Feature activation+7.25e-01

Pos logit contributions
\">+0.190
 <++0.189
agine+0.183
cin+0.182
andre+0.180
Neg logit contributions
istered-0.129
 incumbent-0.116
uscript-0.115
 Imran-0.111
sembly-0.111
Token
Token ID198
Feature activation+4.27e-01

Pos logit contributions
\">+0.671
 <++0.667
agine+0.648
cin+0.644
andre+0.636
Neg logit contributions
istered-0.412
 incumbent-0.369
uscript-0.365
 Imran-0.351
sembly-0.349
Token[
Token ID58
Feature activation+7.49e-01

Pos logit contributions
\">+0.285
 <++0.283
agine+0.273
cin+0.269
andre+0.263
Neg logit contributions
istered-0.354
 incumbent-0.328
uscript-0.326
 Imran-0.317
sembly-0.316
TokenH
Token ID39
Feature activation-1.25e+00

Pos logit contributions
\">+0.558
 <++0.554
agine+0.533
cin+0.527
andre+0.519
Neg logit contributions
istered-0.563
 incumbent-0.518
uscript-0.515
sembly-0.500
 Imran-0.499
Token/
Token ID14
Feature activation-4.81e+00

Pos logit contributions
istered+0.885
 incumbent+0.817
uscript+0.811
 Imran+0.776
sembly+0.774
Neg logit contributions
\">-0.974
 <+-0.973
agine-0.923
cin-0.917
andre-0.911
Tokent
Token ID83
Feature activation-5.87e+00

Pos logit contributions
istered+2.449
 incumbent+2.132
uscript+2.053
 Imran+2.047
 Warden+1.982
Neg logit contributions
\">-4.672
 <+-4.632
agine-4.555
cin-4.463
bleacher-4.409
Token The
Token ID383
Feature activation-9.94e-01

Pos logit contributions
istered+4.120
 incumbent+3.865
uscript+3.769
 Imran+3.725
 Warden+3.700
Neg logit contributions
\">-4.455
 <+-4.392
agine-4.293
cin-4.270
andre-4.155
Token Hill
Token ID3327
Feature activation+1.22e+00

Pos logit contributions
istered+0.745
 incumbent+0.690
uscript+0.680
 Imran+0.665
sembly+0.659
Neg logit contributions
\">-0.738
 <+-0.733
agine-0.705
cin-0.701
andre-0.687
Token]
Token ID60
Feature activation-5.45e-01

Pos logit contributions
\">+0.661
 <++0.656
agine+0.619
cin+0.616
andre+0.600
Neg logit contributions
istered-1.166
 incumbent-1.089
uscript-1.086
sembly-1.066
 Imran-1.059
Token<|endoftext|>
Token ID50256
Feature activation-2.24e-01

Pos logit contributions
istered+0.415
 incumbent+0.382
uscript+0.381
sembly+0.371
 Imran+0.368
Neg logit contributions
\">-0.399
 <+-0.397
agine-0.381
cin-0.379
andre-0.373
TokenL
Token ID43
Feature activation-1.33e+00

Pos logit contributions
istered+0.174
 incumbent+0.160
uscript+0.159
 Imran+0.154
sembly+0.154
Neg logit contributions
\">-0.160
 <+-0.160
agine-0.154
cin-0.152
andre-0.149
Token start
Token ID923
Feature activation-7.70e-02

Pos logit contributions
istered+0.118
 incumbent+0.108
uscript+0.107
sembly+0.104
 Imran+0.103
Neg logit contributions
\">-0.138
 <+-0.136
agine-0.132
cin-0.131
andre-0.129
Token.
Token ID13
Feature activation+7.21e-01

Pos logit contributions
istered+0.055
 incumbent+0.050
uscript+0.050
sembly+0.049
 Imran+0.048
Neg logit contributions
\">-0.060
 <+-0.060
agine-0.058
cin-0.057
andre-0.056
Token<|endoftext|>
Token ID50256
Feature activation-1.27e-01

Pos logit contributions
\">+0.625
 <++0.621
agine+0.601
cin+0.599
andre+0.589
Neg logit contributions
istered-0.452
 incumbent-0.409
uscript-0.406
sembly-0.393
 Imran-0.392
TokenAbstract
Token ID23839
Feature activation-4.49e+00

Pos logit contributions
istered+0.099
 incumbent+0.091
uscript+0.090
 Imran+0.088
sembly+0.087
Neg logit contributions
\">-0.091
 <+-0.091
agine-0.087
cin-0.086
andre-0.085
Token
Token ID198
Feature activation-6.10e+00

Pos logit contributions
istered+2.616
 incumbent+2.423
uscript+2.388
 Imran+2.330
 Warden+2.295
Neg logit contributions
\">-3.979
 <+-3.968
agine-3.851
cin-3.829
andre-3.794
Token
Token ID198
Feature activation-5.74e+00

Pos logit contributions
istered+2.530
 incumbent+2.291
 Imran+2.267
 Warden+2.093
uscript+2.092
Neg logit contributions
agine-6.158
 <+-6.137
\">-6.117
cin-6.017
andre-5.970
TokenBackground
Token ID21756
Feature activation-4.27e+00

Pos logit contributions
istered+4.047
 incumbent+3.692
uscript+3.689
 Imran+3.561
iman+3.506
Neg logit contributions
 <+-4.368
\">-4.249
agine-4.235
cin-4.077
andre-4.029
Token Depression
Token ID22483
Feature activation-1.51e+00

Pos logit contributions
istered+2.813
 incumbent+2.634
uscript+2.567
 Imran+2.550
 Warden+2.512
Neg logit contributions
\">-3.436
 <+-3.435
agine-3.333
andre-3.293
cin-3.285
Token presents
Token ID10969
Feature activation-7.63e-01

Pos logit contributions
istered+1.057
 incumbent+0.977
uscript+0.958
 Imran+0.936
 Warden+0.918
Neg logit contributions
 <+-1.190
\">-1.185
agine-1.148
cin-1.135
andre-1.127
Token a
Token ID257
Feature activation-1.76e-01

Pos logit contributions
istered+0.500
 incumbent+0.464
uscript+0.450
 Imran+0.439
sembly+0.431
Neg logit contributions
\">-0.638
 <+-0.633
agine-0.615
cin-0.611
andre-0.604
Token significant
Token ID2383
Feature activation+4.14e-01

Pos logit contributions
istered+0.117
 incumbent+0.108
uscript+0.106
 Imran+0.102
sembly+0.101
Neg logit contributions
\">-0.145
 <+-0.144
agine-0.140
cin-0.139
andre-0.137
Token a
Token ID257
Feature activation+5.82e-01

Pos logit contributions
\">+0.222
 <++0.220
agine+0.212
cin+0.211
andre+0.208
Neg logit contributions
istered-0.204
 incumbent-0.187
uscript-0.186
sembly-0.181
 Imran-0.180
Token head
Token ID1182
Feature activation+1.73e+00

Pos logit contributions
\">+0.454
 <++0.451
agine+0.434
cin+0.432
andre+0.425
Neg logit contributions
istered-0.417
 incumbent-0.382
uscript-0.380
sembly-0.370
 Imran-0.367
Token)
Token ID8
Feature activation+6.20e-01

Pos logit contributions
\">+1.280
 <++1.279
agine+1.232
cin+1.228
andre+1.205
Neg logit contributions
istered-1.292
uscript-1.179
 incumbent-1.177
sembly-1.155
 Imran-1.146
Token and
Token ID290
Feature activation-2.55e-01

Pos logit contributions
\">+0.477
 <++0.473
agine+0.455
cin+0.453
andre+0.446
Neg logit contributions
istered-0.451
 incumbent-0.413
uscript-0.412
sembly-0.401
 Imran-0.398
Token fl
Token ID781
Feature activation-6.85e-01

Pos logit contributions
istered+0.176
 incumbent+0.161
uscript+0.160
sembly+0.156
 Imran+0.154
Neg logit contributions
\">-0.205
 <+-0.204
agine-0.197
cin-0.196
andre-0.193
Tokenot
Token ID313
Feature activation-5.53e+00

Pos logit contributions
istered+0.468
uscript+0.420
 incumbent+0.420
sembly+0.404
 Imran+0.403
Neg logit contributions
\">-0.562
 <+-0.558
agine-0.536
cin-0.534
andre-0.525
Tokenilla
Token ID5049
Feature activation+6.86e-01

Pos logit contributions
istered+5.293
uscript+4.803
 incumbent+4.791
iman+4.732
sembly+4.686
Neg logit contributions
 <+-2.961
\">-2.904
agine-2.719
andre-2.636
cin-2.630
Token (
Token ID357
Feature activation+8.03e-02

Pos logit contributions
\">+0.578
 <++0.575
agine+0.555
cin+0.553
andre+0.544
Neg logit contributions
istered-0.447
 incumbent-0.405
uscript-0.404
sembly-0.393
 Imran-0.389
Tokenfour
Token ID14337
Feature activation-4.62e-02

Pos logit contributions
\">+0.053
 <++0.053
agine+0.050
cin+0.050
andre+0.049
Neg logit contributions
istered-0.067
 incumbent-0.062
uscript-0.062
sembly-0.060
 Imran-0.060
Token people
Token ID661
Feature activation+6.82e-01

Pos logit contributions
istered+0.028
 incumbent+0.026
uscript+0.025
sembly+0.025
 Imran+0.024
Neg logit contributions
\">-0.041
 <+-0.041
agine-0.039
cin-0.039
andre-0.039
Token on
Token ID319
Feature activation+4.35e-01

Pos logit contributions
\">+0.545
 <++0.542
agine+0.523
cin+0.520
andre+0.511
Neg logit contributions
istered-0.474
 incumbent-0.433
uscript-0.431
sembly-0.421
 Imran-0.416
TokenRecommend
Token ID41248
Feature activation-2.15e+00

Pos logit contributions
istered+1.237
 incumbent+1.142
uscript+1.116
 Imran+1.096
 Warden+1.073
Neg logit contributions
\">-1.503
 <+-1.501
agine-1.461
cin-1.453
andre-1.417
Token Share
Token ID8734
Feature activation-3.65e+00

Pos logit contributions
istered+1.165
 incumbent+1.071
uscript+1.020
 Imran+1.013
 Warden+1.001
Neg logit contributions
\">-2.008
 <+-1.990
cin-1.933
agine-1.926
bleacher-1.908
Token this
Token ID428
Feature activation-2.12e+00

Pos logit contributions
istered+2.221
 incumbent+2.085
 Imran+1.985
uscript+1.938
 Warden+1.924
Neg logit contributions
\">-3.138
 <+-3.121
agine-3.050
cin-3.025
andre-3.019
Token comment
Token ID2912
Feature activation-4.28e+00

Pos logit contributions
istered+1.915
 incumbent+1.830
uscript+1.779
 Imran+1.758
 Warden+1.737
Neg logit contributions
\">-1.225
 <+-1.204
agine-1.162
cin-1.135
andre-1.120
Token on
Token ID319
Feature activation-2.85e+00

Pos logit contributions
istered+2.385
 incumbent+2.173
 Imran+2.099
uscript+2.084
 Warden+2.055
Neg logit contributions
\">-3.849
 <+-3.821
agine-3.803
cin-3.762
andre-3.695
Token Facebook
Token ID3203
Feature activation-5.52e+00

Pos logit contributions
istered+1.644
 incumbent+1.519
 Warden+1.470
uscript+1.456
 Imran+1.455
Neg logit contributions
\">-2.543
 <+-2.494
agine-2.483
cin-2.451
andre-2.432
Token Share
Token ID8734
Feature activation-2.97e+00

Pos logit contributions
istered+2.327
 incumbent+2.226
 Warden+2.205
 Needs+2.151
 Imran+2.138
Neg logit contributions
\">-5.389
bleacher-5.362
agine-5.305
 <+-5.302
cin-5.301
Token this
Token ID428
Feature activation-1.83e+00

Pos logit contributions
istered+1.874
 incumbent+1.728
uscript+1.654
 Imran+1.649
 Warden+1.617
Neg logit contributions
\">-2.510
 <+-2.498
agine-2.430
cin-2.408
andre-2.391
Token comment
Token ID2912
Feature activation-2.73e+00

Pos logit contributions
istered+1.663
 incumbent+1.583
uscript+1.550
 Imran+1.525
 Warden+1.506
Neg logit contributions
\">-1.043
 <+-1.027
agine-0.986
cin-0.964
andre-0.953
Token on
Token ID319
Feature activation-2.05e+00

Pos logit contributions
istered+1.616
 incumbent+1.476
uscript+1.436
 Imran+1.413
 Warden+1.384
Neg logit contributions
\">-2.407
 <+-2.391
agine-2.348
cin-2.332
andre-2.292
Token Twitter
Token ID3009
Feature activation-2.58e+00

Pos logit contributions
istered+1.563
 incumbent+1.462
uscript+1.435
 Imran+1.427
 Warden+1.402
Neg logit contributions
\">-1.476
 <+-1.454
agine-1.428
cin-1.413
andre-1.382
Token+
Token ID10
Feature activation+7.08e-01

Pos logit contributions
\">+0.067
 <++0.067
agine+0.064
cin+0.064
andre+0.063
Neg logit contributions
istered-0.073
 incumbent-0.067
uscript-0.067
 Imran-0.065
sembly-0.064
Token1
Token ID16
Feature activation-1.93e+00

Pos logit contributions
\">+0.504
 <++0.501
agine+0.481
cin+0.477
andre+0.470
Neg logit contributions
istered-0.554
 incumbent-0.512
uscript-0.510
 Imran-0.496
sembly-0.496
Token 2
Token ID362
Feature activation-3.19e+00

Pos logit contributions
istered+1.378
 incumbent+1.263
uscript+1.254
 Imran+1.221
sembly+1.215
Neg logit contributions
\">-1.491
 <+-1.484
agine-1.433
cin-1.423
andre-1.410
Token 18
Token ID1248
Feature activation-2.93e+00

Pos logit contributions
istered+2.112
 incumbent+1.927
uscript+1.921
 Imran+1.866
sembly+1.836
Neg logit contributions
\">-2.595
 <+-2.574
agine-2.520
andre-2.486
cin-2.479
TokenK
Token ID42
Feature activation-2.53e-01

Pos logit contributions
istered+2.413
 incumbent+2.261
uscript+2.259
 Imran+2.204
 Warden+2.179
Neg logit contributions
\">-1.892
 <+-1.869
agine-1.804
cin-1.781
andre-1.751
Token Shares
Token ID23997
Feature activation-5.51e+00

Pos logit contributions
istered+0.097
 incumbent+0.082
uscript+0.081
sembly+0.077
 Imran+0.076
Neg logit contributions
\">-0.280
 <+-0.279
agine-0.272
cin-0.271
andre-0.268
Token
Token ID198
Feature activation-3.52e+00

Pos logit contributions
istered+2.748
 incumbent+2.528
uscript+2.475
 Imran+2.427
 Warden+2.381
Neg logit contributions
\">-5.299
 <+-5.203
cin-5.105
agine-5.087
andre-5.037
Token
Token ID198
Feature activation-3.46e+00

Pos logit contributions
istered+1.684
 incumbent+1.515
 Imran+1.483
uscript+1.437
 Warden+1.396
Neg logit contributions
\">-3.464
 <+-3.452
agine-3.422
cin-3.363
andre-3.317
TokenAct
Token ID6398
Feature activation-1.98e-01

Pos logit contributions
istered+2.663
 incumbent+2.423
uscript+2.412
 Imran+2.349
iman+2.288
Neg logit contributions
\">-2.459
 <+-2.458
agine-2.387
cin-2.307
andre-2.253
Tokenress
Token ID601
Feature activation+1.16e-01

Pos logit contributions
istered+0.142
 incumbent+0.130
uscript+0.129
sembly+0.125
 Imran+0.125
Neg logit contributions
\">-0.155
 <+-0.154
agine-0.148
cin-0.147
andre-0.145
Token Amy
Token ID14235
Feature activation-1.22e-01

Pos logit contributions
\">+0.100
 <++0.099
agine+0.096
cin+0.095
andre+0.094
Neg logit contributions
istered-0.074
 incumbent-0.067
uscript-0.067
sembly-0.065
 Imran-0.064
Tokenavier
Token ID19492
Feature activation-3.48e-01

Pos logit contributions
istered+1.033
uscript+0.955
 incumbent+0.949
 Imran+0.915
sembly+0.914
Neg logit contributions
\">-1.040
 <+-1.026
agine-0.984
cin-0.971
bleacher-0.949
Token Dum
Token ID30933
Feature activation+7.35e-01

Pos logit contributions
istered+0.216
 incumbent+0.195
uscript+0.194
sembly+0.188
 Imran+0.186
Neg logit contributions
\">-0.304
 <+-0.303
agine-0.293
cin-0.292
andre-0.288
Tokenus
Token ID385
Feature activation-4.73e-01

Pos logit contributions
\">+0.490
 <++0.488
agine+0.463
cin+0.461
andre+0.452
Neg logit contributions
istered-0.609
uscript-0.563
 incumbent-0.562
 Imran-0.545
sembly-0.543
Tokenque
Token ID4188
Feature activation-3.07e-01

Pos logit contributions
istered+0.169
 incumbent+0.140
uscript+0.138
sembly+0.130
 Imran+0.128
Neg logit contributions
\">-0.538
 <+-0.536
agine-0.522
cin-0.520
andre-0.515
Token et
Token ID2123
Feature activation-2.92e+00

Pos logit contributions
istered+0.219
 incumbent+0.200
uscript+0.199
sembly+0.195
 Imran+0.192
Neg logit contributions
\">-0.240
 <+-0.239
agine-0.229
cin-0.229
andre-0.226
Token al
Token ID435
Feature activation-5.43e+00

Pos logit contributions
istered+1.706
 incumbent+1.540
uscript+1.518
 Imran+1.484
 Warden+1.421
Neg logit contributions
\">-2.657
 <+-2.628
agine-2.552
cin-2.536
bleacher-2.482
Token.
Token ID13
Feature activation-2.66e+00

Pos logit contributions
istered+3.631
uscript+3.343
 incumbent+3.323
 Imran+3.171
sembly+3.116
Neg logit contributions
\">-4.401
 <+-4.326
agine-4.118
cin-4.117
bleacher-4.071
Token 2014
Token ID1946
Feature activation-8.43e-01

Pos logit contributions
istered+1.695
 incumbent+1.558
uscript+1.535
 Imran+1.504
 Warden+1.485
Neg logit contributions
\">-2.243
 <+-2.232
agine-2.163
cin-2.150
andre-2.122
Token.
Token ID13
Feature activation-1.62e+00

Pos logit contributions
istered+0.603
 incumbent+0.552
uscript+0.549
sembly+0.534
 Imran+0.531
Neg logit contributions
\">-0.658
 <+-0.654
agine-0.630
cin-0.627
andre-0.617
Token The
Token ID383
Feature activation-1.45e+00

Pos logit contributions
istered+0.944
 incumbent+0.855
uscript+0.842
 Imran+0.823
 Warden+0.813
Neg logit contributions
\">-1.451
 <+-1.446
agine-1.407
cin-1.400
andre-1.381
Token Kepler
Token ID40294
Feature activation+1.43e+00

Pos logit contributions
istered+0.957
 incumbent+0.878
uscript+0.861
 Imran+0.835
sembly+0.833
Neg logit contributions
 <+-1.202
\">-1.202
agine-1.157
cin-1.150
andre-1.135
TokenRecommend
Token ID41248
Feature activation-2.43e+00

Pos logit contributions
istered+1.447
 incumbent+1.343
uscript+1.307
 Imran+1.278
 Warden+1.256
Neg logit contributions
 <+-1.844
\">-1.841
agine-1.798
cin-1.786
andre-1.736
Token Share
Token ID8734
Feature activation-3.71e+00

Pos logit contributions
istered+1.306
 incumbent+1.197
 Imran+1.136
uscript+1.134
 Warden+1.120
Neg logit contributions
\">-2.307
 <+-2.279
cin-2.215
agine-2.195
bleacher-2.189
Token this
Token ID428
Feature activation-2.01e+00

Pos logit contributions
istered+2.259
 incumbent+2.146
 Imran+2.027
uscript+1.978
 Warden+1.953
Neg logit contributions
\">-3.189
 <+-3.162
agine-3.069
cin-3.063
andre-3.060
Token comment
Token ID2912
Feature activation-3.91e+00

Pos logit contributions
istered+1.812
 incumbent+1.728
uscript+1.674
 Imran+1.662
 Warden+1.636
Neg logit contributions
\">-1.176
 <+-1.153
agine-1.109
cin-1.085
andre-1.075
Token on
Token ID319
Feature activation-2.55e+00

Pos logit contributions
istered+2.217
 incumbent+2.017
 Imran+1.946
uscript+1.936
 Warden+1.904
Neg logit contributions
\">-3.509
 <+-3.485
agine-3.460
cin-3.424
andre-3.361
Token Facebook
Token ID3203
Feature activation-5.39e+00

Pos logit contributions
istered+1.492
 incumbent+1.378
 Warden+1.324
 Imran+1.318
uscript+1.317
Neg logit contributions
\">-2.277
 <+-2.230
agine-2.209
cin-2.188
andre-2.165
Token Share
Token ID8734
Feature activation-2.64e+00

Pos logit contributions
istered+2.244
 incumbent+2.153
 Warden+2.075
 Imran+2.033
 Needs+2.029
Neg logit contributions
\">-5.325
bleacher-5.294
cin-5.261
 <+-5.254
agine-5.219
Token this
Token ID428
Feature activation-1.53e+00

Pos logit contributions
istered+1.694
 incumbent+1.559
uscript+1.494
 Imran+1.488
 Warden+1.457
Neg logit contributions
\">-2.220
 <+-2.209
agine-2.145
cin-2.126
andre-2.112
Token comment
Token ID2912
Feature activation-2.32e+00

Pos logit contributions
istered+1.398
 incumbent+1.329
uscript+1.300
 Imran+1.280
 Warden+1.265
Neg logit contributions
\">-0.872
 <+-0.859
agine-0.822
cin-0.805
andre-0.796
Token on
Token ID319
Feature activation-1.77e+00

Pos logit contributions
istered+1.392
 incumbent+1.270
uscript+1.237
 Imran+1.218
 Warden+1.193
Neg logit contributions
\">-2.039
 <+-2.027
agine-1.986
cin-1.975
andre-1.940
Token Twitter
Token ID3009
Feature activation-2.26e+00

Pos logit contributions
istered+1.353
 incumbent+1.264
uscript+1.242
 Imran+1.236
 Warden+1.216
Neg logit contributions
\">-1.273
 <+-1.256
agine-1.229
cin-1.218
andre-1.192
TokenRecommend
Token ID41248
Feature activation-2.19e+00

Pos logit contributions
istered+1.315
 incumbent+1.214
uscript+1.188
 Imran+1.164
 Warden+1.139
Neg logit contributions
 <+-1.579
\">-1.575
agine-1.535
cin-1.524
andre-1.486
Token Share
Token ID8734
Feature activation-3.28e+00

Pos logit contributions
istered+1.184
 incumbent+1.089
uscript+1.032
 Imran+1.027
 Warden+1.016
Neg logit contributions
\">-2.049
 <+-2.033
cin-1.970
agine-1.966
bleacher-1.947
Token this
Token ID428
Feature activation-1.93e+00

Pos logit contributions
istered+2.025
 incumbent+1.894
 Imran+1.802
uscript+1.766
 Warden+1.747
Neg logit contributions
\">-2.803
 <+-2.791
agine-2.722
cin-2.698
andre-2.693
Token comment
Token ID2912
Feature activation-4.02e+00

Pos logit contributions
istered+1.747
 incumbent+1.669
uscript+1.618
 Imran+1.603
 Warden+1.581
Neg logit contributions
\">-1.119
 <+-1.099
agine-1.058
cin-1.035
andre-1.021
Token on
Token ID319
Feature activation-2.82e+00

Pos logit contributions
istered+2.258
 incumbent+2.058
 Imran+1.984
uscript+1.968
 Warden+1.940
Neg logit contributions
\">-3.616
 <+-3.593
agine-3.573
cin-3.537
andre-3.470
Token Facebook
Token ID3203
Feature activation-5.38e+00

Pos logit contributions
istered+1.623
 incumbent+1.501
 Warden+1.451
 Imran+1.437
uscript+1.434
Neg logit contributions
\">-2.521
 <+-2.472
agine-2.459
cin-2.432
andre-2.407
Token Share
Token ID8734
Feature activation-2.72e+00

Pos logit contributions
istered+2.278
 incumbent+2.174
 Warden+2.138
 Needs+2.086
 Imran+2.075
Neg logit contributions
\">-5.246
bleacher-5.239
 <+-5.187
agine-5.177
cin-5.175
Token this
Token ID428
Feature activation-1.74e+00

Pos logit contributions
istered+1.730
 incumbent+1.593
uscript+1.526
 Imran+1.522
 Warden+1.490
Neg logit contributions
\">-2.282
 <+-2.273
agine-2.211
cin-2.189
andre-2.174
Token comment
Token ID2912
Feature activation-2.65e+00

Pos logit contributions
istered+1.585
 incumbent+1.509
uscript+1.475
 Imran+1.453
 Warden+1.435
Neg logit contributions
\">-0.996
 <+-0.982
agine-0.940
cin-0.920
andre-0.909
Token on
Token ID319
Feature activation-2.02e+00

Pos logit contributions
istered+1.567
 incumbent+1.431
uscript+1.389
 Imran+1.371
 Warden+1.342
Neg logit contributions
\">-2.341
 <+-2.328
agine-2.286
cin-2.271
andre-2.230
Token Twitter
Token ID3009
Feature activation-2.55e+00

Pos logit contributions
istered+1.541
 incumbent+1.440
uscript+1.414
 Imran+1.406
 Warden+1.382
Neg logit contributions
\">-1.454
 <+-1.435
agine-1.408
cin-1.394
andre-1.362
TokenRated
Token ID15322
Feature activation-4.96e-01

Pos logit contributions
istered+1.297
 incumbent+1.178
uscript+1.176
 Imran+1.141
sembly+1.140
Neg logit contributions
\">-1.247
 <+-1.246
agine-1.204
cin-1.182
andre-1.155
Token 5
Token ID642
Feature activation-1.46e+00

Pos logit contributions
istered+0.390
 incumbent+0.362
uscript+0.358
 Imran+0.349
sembly+0.348
Neg logit contributions
\">-0.351
 <+-0.348
agine-0.335
cin-0.334
andre-0.328
Token out
Token ID503
Feature activation-2.80e+00

Pos logit contributions
istered+0.926
 incumbent+0.853
uscript+0.831
 Imran+0.819
 Warden+0.789
Neg logit contributions
\">-1.243
 <+-1.235
agine-1.201
cin-1.190
andre-1.170
Token of
Token ID286
Feature activation-3.62e+00

Pos logit contributions
istered+1.785
 incumbent+1.658
 Imran+1.601
uscript+1.595
 Warden+1.549
Neg logit contributions
\">-2.298
agine-2.284
 <+-2.265
cin-2.216
andre-2.207
Token 5
Token ID642
Feature activation-1.95e+00

Pos logit contributions
istered+2.769
 incumbent+2.563
uscript+2.486
 Imran+2.447
 Warden+2.417
Neg logit contributions
\">-2.623
 <+-2.580
agine-2.519
cin-2.499
andre-2.450
Token by
Token ID416
Feature activation-5.33e+00

Pos logit contributions
istered+1.216
 incumbent+1.107
uscript+1.086
 Imran+1.066
 Warden+1.037
Neg logit contributions
\">-1.687
 <+-1.672
agine-1.634
cin-1.618
andre-1.592
Token Cory
Token ID28785
Feature activation-1.43e+00

Pos logit contributions
istered+3.644
 incumbent+3.381
 Imran+3.281
uscript+3.250
 Warden+3.229
Neg logit contributions
\">-4.245
 <+-4.128
agine-4.070
cin-4.023
bleacher-3.928
TokenB
Token ID33
Feature activation-1.81e+00

Pos logit contributions
istered+1.062
 incumbent+0.978
uscript+0.973
sembly+0.946
 Imran+0.942
Neg logit contributions
\">-1.065
 <+-1.057
agine-1.018
cin-1.008
andre-0.987
Token from
Token ID422
Feature activation-3.62e+00

Pos logit contributions
istered+1.334
 incumbent+1.220
uscript+1.217
sembly+1.182
 Imran+1.173
Neg logit contributions
\">-1.373
 <+-1.362
agine-1.311
cin-1.304
andre-1.278
Token The
Token ID383
Feature activation-2.09e+00

Pos logit contributions
istered+2.311
 incumbent+2.109
uscript+2.062
 Imran+2.049
sembly+2.029
Neg logit contributions
\">-3.033
 <+-3.026
agine-2.931
cin-2.923
andre-2.878
Token best
Token ID1266
Feature activation+8.68e-01

Pos logit contributions
istered+1.457
 incumbent+1.343
uscript+1.306
 Imran+1.294
sembly+1.280
Neg logit contributions
 <+-1.647
\">-1.645
agine-1.593
cin-1.582
andre-1.553
Token 10
Token ID838
Feature activation-8.42e-01

Pos logit contributions
istered+1.075
 incumbent+1.003
uscript+0.998
 Imran+0.972
sembly+0.969
Neg logit contributions
\">-0.735
 <+-0.720
agine-0.697
cin-0.687
andre-0.674
Token/
Token ID14
Feature activation-1.14e+00

Pos logit contributions
istered+0.602
 incumbent+0.551
uscript+0.548
sembly+0.531
 Imran+0.530
Neg logit contributions
\">-0.657
 <+-0.651
agine-0.630
cin-0.624
andre-0.615
Token6
Token ID21
Feature activation-1.75e+00

Pos logit contributions
istered+0.999
 incumbent+0.930
uscript+0.922
 Imran+0.908
 Warden+0.896
Neg logit contributions
\">-0.689
 <+-0.685
agine-0.670
cin-0.659
andre-0.640
Token/
Token ID14
Feature activation-1.78e+00

Pos logit contributions
istered+1.113
 incumbent+1.015
uscript+1.001
 Imran+0.984
 Warden+0.966
Neg logit contributions
\">-1.475
 <+-1.461
agine-1.445
cin-1.424
andre-1.405
Token08
Token ID2919
Feature activation-1.81e+00

Pos logit contributions
istered+1.826
 incumbent+1.730
uscript+1.710
 Imran+1.685
 Warden+1.675
Neg logit contributions
 <+-0.802
\">-0.801
agine-0.785
cin-0.747
andre-0.725
Token Rating
Token ID12028
Feature activation-5.32e+00

Pos logit contributions
istered+1.545
 incumbent+1.438
uscript+1.429
 Imran+1.392
sembly+1.387
Neg logit contributions
\">-1.156
 <+-1.132
agine-1.100
cin-1.081
andre-1.063
Token:
Token ID25
Feature activation-3.26e+00

Pos logit contributions
istered+3.209
 incumbent+2.977
 Imran+2.873
uscript+2.864
 Warden+2.824
Neg logit contributions
agine-4.491
\">-4.468
 <+-4.453
cin-4.357
andre-4.319
Token 5
Token ID642
Feature activation-1.49e+00

Pos logit contributions
istered+2.362
 incumbent+2.202
uscript+2.154
 Imran+2.121
 Warden+2.102
Neg logit contributions
\">-2.435
 <+-2.403
agine-2.368
cin-2.334
andre-2.311
Token
Token ID198
Feature activation-2.62e+00

Pos logit contributions
istered+0.898
 incumbent+0.806
uscript+0.802
sembly+0.776
 Imran+0.769
Neg logit contributions
\">-1.339
 <+-1.332
agine-1.286
cin-1.284
andre-1.264
Token
Token ID198
Feature activation-2.43e+00

Pos logit contributions
istered+1.312
 incumbent+1.175
 Imran+1.149
uscript+1.129
 Warden+1.104
Neg logit contributions
\">-2.543
 <+-2.525
agine-2.509
cin-2.453
andre-2.422
TokenCrit
Token ID18559
Feature activation-3.30e+00

Pos logit contributions
istered+1.659
 incumbent+1.498
uscript+1.490
 Imran+1.440
sembly+1.427
Neg logit contributions
\">-1.960
 <+-1.950
agine-1.903
cin-1.857
andre-1.827
Token
Token ID198
Feature activation-4.61e-01

Pos logit contributions
\">+1.226
 <++1.224
agine+1.172
cin+1.169
andre+1.149
Neg logit contributions
istered-1.094
 incumbent-1.000
uscript-0.997
sembly-0.973
 Imran-0.959
Token
Token ID198
Feature activation+2.02e-01

Pos logit contributions
istered+0.253
 incumbent+0.225
uscript+0.222
 Imran+0.216
 Warden+0.209
Neg logit contributions
\">-0.434
 <+-0.431
agine-0.422
cin-0.418
andre-0.412
Tokenpur
Token ID14225
Feature activation+9.04e-01

Pos logit contributions
\">+0.140
 <++0.139
agine+0.133
cin+0.132
andre+0.129
Neg logit contributions
istered-0.163
 incumbent-0.150
uscript-0.150
 Imran-0.145
sembly-0.145
Tokenak
Token ID461
Feature activation-1.19e+00

Pos logit contributions
\">+0.611
 <++0.605
agine+0.578
cin+0.575
andre+0.565
Neg logit contributions
istered-0.743
 incumbent-0.687
uscript-0.686
sembly-0.669
 Imran-0.664
Tokenushi
Token ID17731
Feature activation-2.60e+00

Pos logit contributions
istered+0.915
uscript+0.842
 incumbent+0.840
sembly+0.815
 Imran+0.810
Neg logit contributions
\">-0.868
 <+-0.855
agine-0.820
cin-0.812
andre-0.800
Token Profile
Token ID13118
Feature activation-5.23e+00

Pos logit contributions
istered+1.943
 incumbent+1.796
uscript+1.782
 Imran+1.719
sembly+1.705
Neg logit contributions
\">-1.941
 <+-1.899
agine-1.854
cin-1.828
andre-1.803
Token Joined
Token ID16709
Feature activation-9.03e-01

Pos logit contributions
istered+3.610
 incumbent+3.351
uscript+3.250
 Warden+3.208
 Imran+3.206
Neg logit contributions
\">-4.128
agine-3.987
 <+-3.985
cin-3.918
andre-3.845
Token August
Token ID2932
Feature activation-1.92e-01

Pos logit contributions
istered+0.721
 incumbent+0.675
uscript+0.660
 Imran+0.655
 Warden+0.641
Neg logit contributions
\">-0.623
 <+-0.610
agine-0.602
cin-0.592
andre-0.583
Token 2012
Token ID2321
Feature activation-1.78e+00

Pos logit contributions
istered+0.137
 incumbent+0.126
uscript+0.123
 Imran+0.122
 Warden+0.120
Neg logit contributions
\">-0.150
 <+-0.148
agine-0.144
cin-0.143
andre-0.140
Token United
Token ID1578
Feature activation-9.65e-01

Pos logit contributions
istered+1.482
 incumbent+1.384
 Imran+1.353
uscript+1.353
 Warden+1.340
Neg logit contributions
\">-1.170
 <+-1.153
agine-1.121
cin-1.115
andre-1.080
Token States
Token ID1829
Feature activation-1.20e+00

Pos logit contributions
istered+0.676
 incumbent+0.618
uscript+0.612
sembly+0.596
 Imran+0.594
Neg logit contributions
\">-0.767
 <+-0.762
agine-0.735
cin-0.732
andre-0.720
Token in
Token ID287
Feature activation-2.86e-01

Pos logit contributions
\">+0.340
 <++0.337
agine+0.325
cin+0.322
andre+0.317
Neg logit contributions
istered-0.351
 incumbent-0.325
uscript-0.321
 Imran-0.313
sembly-0.313
Token connection
Token ID4637
Feature activation-1.41e+00

Pos logit contributions
istered+0.200
 incumbent+0.184
uscript+0.182
 Imran+0.177
sembly+0.177
Neg logit contributions
\">-0.227
 <+-0.226
agine-0.218
cin-0.216
andre-0.212
Token with
Token ID351
Feature activation-6.78e-01

Pos logit contributions
istered+0.995
 incumbent+0.912
uscript+0.904
sembly+0.879
 Imran+0.875
Neg logit contributions
\">-1.114
 <+-1.105
agine-1.066
cin-1.062
andre-1.044
Token the
Token ID262
Feature activation-2.14e-01

Pos logit contributions
istered+0.432
 incumbent+0.393
uscript+0.385
 Imran+0.376
sembly+0.374
Neg logit contributions
\">-0.583
 <+-0.579
agine-0.563
cin-0.557
andre-0.548
Token fl
Token ID781
Feature activation-1.50e-01

Pos logit contributions
istered+0.165
 incumbent+0.153
uscript+0.150
 Imran+0.148
sembly+0.146
Neg logit contributions
\">-0.157
 <+-0.155
agine-0.150
cin-0.148
andre-0.146
Tokenot
Token ID313
Feature activation-5.18e+00

Pos logit contributions
istered+0.081
uscript+0.072
 incumbent+0.070
 Imran+0.067
sembly+0.065
Neg logit contributions
\">-0.144
 <+-0.143
agine-0.138
cin-0.137
andre-0.135
Tokenilla
Token ID5049
Feature activation-5.06e-01

Pos logit contributions
istered+4.784
uscript+4.408
iman+4.300
 incumbent+4.285
 Imran+4.260
Neg logit contributions
 <+-2.920
\">-2.806
agine-2.613
bleacher-2.573
cin-2.522
Token incident
Token ID4519
Feature activation-7.16e-02

Pos logit contributions
istered+0.409
 incumbent+0.382
uscript+0.376
 Imran+0.370
sembly+0.366
Neg logit contributions
\">-0.347
 <+-0.342
agine-0.330
cin-0.325
andre-0.322
Token and
Token ID290
Feature activation-4.88e-01

Pos logit contributions
istered+0.051
 incumbent+0.047
uscript+0.047
sembly+0.045
 Imran+0.045
Neg logit contributions
\">-0.056
 <+-0.055
agine-0.053
cin-0.053
andre-0.052
Token will
Token ID481
Feature activation+4.01e-01

Pos logit contributions
istered+0.349
 incumbent+0.326
uscript+0.317
 Imran+0.312
sembly+0.307
Neg logit contributions
\">-0.379
 <+-0.375
agine-0.365
cin-0.361
andre-0.355
Token prevent
Token ID2948
Feature activation+4.38e-01

Pos logit contributions
\">+0.304
 <++0.302
agine+0.292
cin+0.290
andre+0.286
Neg logit contributions
istered-0.294
 incumbent-0.273
uscript-0.268
 Imran-0.261
sembly-0.259
Token:
Token ID25
Feature activation+1.88e+00

Pos logit contributions
\">+0.442
 <++0.439
agine+0.425
cin+0.422
andre+0.415
Neg logit contributions
istered-0.402
 incumbent-0.368
uscript-0.365
 Imran-0.355
sembly-0.349
Token47
Token ID2857
Feature activation+2.05e+00

Pos logit contributions
\">+0.938
 <++0.930
cin+0.855
agine+0.843
andre+0.821
Neg logit contributions
istered-1.882
sembly-1.778
 incumbent-1.772
uscript-1.761
 Imran-1.712
Token]
Token ID60
Feature activation+2.09e+00

Pos logit contributions
\">+1.060
 <++1.050
agine+0.990
cin+0.984
andre+0.959
Neg logit contributions
istered-2.007
 incumbent-1.883
uscript-1.876
sembly-1.842
 Imran-1.831
Token <
Token ID1279
Feature activation+1.58e+00

Pos logit contributions
 <++1.208
\">+1.175
agine+1.092
cin+1.088
andre+1.065
Neg logit contributions
istered-1.953
 incumbent-1.818
uscript-1.812
sembly-1.789
 Imran-1.772
Tokensh
Token ID1477
Feature activation+8.58e-01

Pos logit contributions
\">+1.174
 <++1.166
agine+1.121
cin+1.117
andre+1.098
Neg logit contributions
istered-1.191
 incumbent-1.096
uscript-1.090
sembly-1.064
 Imran-1.057
Tokeno
Token ID78
Feature activation-5.00e+00

Pos logit contributions
\">+0.640
 <++0.636
agine+0.608
cin+0.606
andre+0.593
Neg logit contributions
istered-0.646
uscript-0.591
 incumbent-0.589
sembly-0.573
 Imran-0.568
Tokenja
Token ID6592
Feature activation-1.19e+00

Pos logit contributions
istered+3.578
uscript+3.198
 incumbent+3.099
iman+3.095
sembly+3.009
Neg logit contributions
\">-3.836
 <+-3.836
cin-3.611
agine-3.604
bleacher-3.526
Token>
Token ID29
Feature activation-1.38e+00

Pos logit contributions
istered+1.009
uscript+0.936
 incumbent+0.926
sembly+0.905
 Imran+0.895
Neg logit contributions
\">-0.765
 <+-0.762
agine-0.728
cin-0.719
andre-0.700
Token Frank
Token ID5278
Feature activation+4.45e-01

Pos logit contributions
istered+1.017
 incumbent+0.948
uscript+0.932
 Imran+0.913
 Warden+0.909
Neg logit contributions
\">-1.034
 <+-1.001
agine-0.985
cin-0.978
andre-0.965
Tokener
Token ID263
Feature activation+3.40e-01

Pos logit contributions
\">+0.346
 <++0.343
agine+0.331
cin+0.329
andre+0.324
Neg logit contributions
istered-0.319
 incumbent-0.293
uscript-0.291
sembly-0.282
 Imran-0.281
TokenZ
Token ID57
Feature activation-2.13e-01

Pos logit contributions
\">+0.262
 <++0.258
agine+0.249
cin+0.248
andre+0.244
Neg logit contributions
istered-0.247
 incumbent-0.227
uscript-0.227
sembly-0.219
 Imran-0.218
Token a
Token ID257
Feature activation+2.65e-01

Pos logit contributions
istered+0.119
 incumbent+0.110
uscript+0.108
 Imran+0.105
sembly+0.105
Neg logit contributions
\">-0.143
 <+-0.142
agine-0.137
cin-0.136
andre-0.133
Token Gaza
Token ID11010
Feature activation-8.97e-01

Pos logit contributions
\">+0.200
 <++0.200
agine+0.192
cin+0.191
andre+0.187
Neg logit contributions
istered-0.196
 incumbent-0.181
uscript-0.179
sembly-0.174
 Imran-0.174
Token-
Token ID12
Feature activation+1.48e+00

Pos logit contributions
istered+0.688
 incumbent+0.635
uscript+0.622
 Imran+0.606
sembly+0.604
Neg logit contributions
 <+-0.656
\">-0.650
agine-0.630
cin-0.627
andre-0.613
Tokenbound
Token ID7784
Feature activation+1.35e+00

Pos logit contributions
\">+1.210
 <++1.203
agine+1.160
cin+1.155
andre+1.137
Neg logit contributions
istered-1.001
 incumbent-0.911
uscript-0.906
sembly-0.881
 Imran-0.874
Token fl
Token ID781
Feature activation+5.67e-01

Pos logit contributions
\">+0.992
 <++0.985
agine+0.946
cin+0.941
andre+0.925
Neg logit contributions
istered-1.032
 incumbent-0.951
uscript-0.946
sembly-0.922
 Imran-0.917
Tokenot
Token ID313
Feature activation-5.00e+00

Pos logit contributions
\">+0.525
 <++0.523
agine+0.506
cin+0.504
andre+0.497
Neg logit contributions
istered-0.324
 incumbent-0.290
uscript-0.287
sembly-0.277
 Imran-0.275
Tokenilla
Token ID5049
Feature activation+1.09e+00

Pos logit contributions
istered+4.657
 incumbent+4.273
uscript+4.238
iman+4.198
 Warden+4.190
Neg logit contributions
 <+-2.796
\">-2.710
agine-2.575
andre-2.540
cin-2.528
Token that
Token ID326
Feature activation+2.52e-01

Pos logit contributions
\">+0.842
 <++0.833
agine+0.805
cin+0.798
andre+0.786
Neg logit contributions
istered-0.789
 incumbent-0.727
uscript-0.720
 Imran-0.699
sembly-0.697
Token killed
Token ID2923
Feature activation-7.30e-01

Pos logit contributions
\">+0.195
 <++0.193
agine+0.187
cin+0.184
andre+0.181
Neg logit contributions
istered-0.182
 incumbent-0.169
uscript-0.166
 Imran-0.162
sembly-0.160
Token eight
Token ID3624
Feature activation+6.03e-02

Pos logit contributions
istered+0.575
 incumbent+0.541
uscript+0.527
 Imran+0.525
 Warden+0.512
Neg logit contributions
\">-0.513
 <+-0.513
agine-0.495
cin-0.493
andre-0.480
Token Turkish
Token ID9663
Feature activation+1.76e+00

Pos logit contributions
\">+0.045
 <++0.044
agine+0.043
cin+0.043
andre+0.042
Neg logit contributions
istered-0.045
 incumbent-0.042
uscript-0.041
 Imran-0.041
sembly-0.040
Tokena
Token ID64
Feature activation+5.59e-01

Pos logit contributions
\">+0.343
 <++0.341
agine+0.330
cin+0.327
andre+0.323
Neg logit contributions
istered-0.270
 incumbent-0.246
uscript-0.245
 Imran-0.236
sembly-0.235
Token,
Token ID837
Feature activation-1.02e+00

Pos logit contributions
\">+0.468
 <++0.462
agine+0.451
cin+0.448
andre+0.441
Neg logit contributions
istered-0.367
 incumbent-0.335
uscript-0.331
 Imran-0.321
sembly-0.317
Token T
Token ID309
Feature activation-1.02e-01

Pos logit contributions
istered+0.475
 incumbent+0.413
uscript+0.410
sembly+0.392
 Imran+0.388
Neg logit contributions
\">-1.047
 <+-1.042
agine-1.013
cin-1.010
andre-0.997
Token >
Token ID1875
Feature activation+5.45e-01

Pos logit contributions
istered+0.080
 incumbent+0.074
uscript+0.074
 Imran+0.072
sembly+0.071
Neg logit contributions
\">-0.072
 <+-0.071
agine-0.069
cin-0.069
andre-0.068
Token D
Token ID360
Feature activation-1.08e+00

Pos logit contributions
\">+0.462
 <++0.459
agine+0.444
cin+0.442
andre+0.436
Neg logit contributions
istered-0.353
 incumbent-0.321
uscript-0.318
sembly-0.308
 Imran-0.307
Tokenere
Token ID567
Feature activation-4.99e+00

Pos logit contributions
istered+0.894
 incumbent+0.825
uscript+0.825
sembly+0.800
 Imran+0.792
Neg logit contributions
\">-0.722
 <+-0.716
agine-0.688
cin-0.677
andre-0.667
Tokenf
Token ID69
Feature activation-1.81e+00

Pos logit contributions
istered+3.125
uscript+2.802
 incumbent+2.792
 Imran+2.741
iman+2.666
Neg logit contributions
\">-4.166
 <+-4.029
agine-3.949
cin-3.914
bleacher-3.883
Token for
Token ID329
Feature activation-8.68e-01

Pos logit contributions
istered+1.425
 incumbent+1.315
uscript+1.292
 Imran+1.268
sembly+1.258
Neg logit contributions
\">-1.272
agine-1.233
 <+-1.232
cin-1.218
andre-1.197
Token Shared
Token ID39403
Feature activation+4.65e-01

Pos logit contributions
istered+0.424
 incumbent+0.373
uscript+0.366
 Imran+0.357
 Warden+0.347
Neg logit contributions
\">-0.870
 <+-0.860
agine-0.849
cin-0.843
andre-0.835
Token <
Token ID1279
Feature activation-6.93e-01

Pos logit contributions
\">+0.285
 <++0.280
agine+0.271
cin+0.268
andre+0.263
Neg logit contributions
istered-0.410
 incumbent-0.382
uscript-0.379
 Imran-0.371
sembly-0.368
Token '
Token ID705
Feature activation-1.57e-02

Pos logit contributions
istered+0.526
 incumbent+0.492
uscript+0.482
 Imran+0.477
 Warden+0.467
Neg logit contributions
\">-0.501
 <+-0.495
agine-0.484
cin-0.479
andre-0.470
Token.
Token ID13
Feature activation-1.33e-01

Pos logit contributions
\">+0.353
 <++0.351
agine+0.338
cin+0.336
andre+0.331
Neg logit contributions
istered-0.334
 incumbent-0.306
uscript-0.304
sembly-0.296
 Imran-0.295
Token Although
Token ID4900
Feature activation-2.23e-01

Pos logit contributions
istered+0.084
 incumbent+0.076
uscript+0.076
sembly+0.074
 Imran+0.073
Neg logit contributions
\">-0.115
 <+-0.114
agine-0.110
cin-0.110
andre-0.108
Token he
Token ID339
Feature activation-2.56e-02

Pos logit contributions
istered+0.149
 incumbent+0.135
uscript+0.134
sembly+0.131
 Imran+0.129
Neg logit contributions
\">-0.185
 <+-0.184
agine-0.177
cin-0.177
andre-0.174
Token's
Token ID338
Feature activation-3.00e-01

Pos logit contributions
istered+0.018
 incumbent+0.016
uscript+0.016
sembly+0.016
 Imran+0.015
Neg logit contributions
\">-0.021
 <+-0.021
agine-0.020
cin-0.020
andre-0.019
Token only
Token ID691
Feature activation-4.09e-01

Pos logit contributions
istered+0.211
 incumbent+0.193
uscript+0.192
sembly+0.186
 Imran+0.185
Neg logit contributions
\">-0.239
 <+-0.237
agine-0.228
cin-0.227
andre-0.224
Token briefly
Token ID11589
Feature activation-4.95e+00

Pos logit contributions
istered+0.287
 incumbent+0.262
uscript+0.261
sembly+0.254
 Imran+0.252
Neg logit contributions
\">-0.325
 <+-0.323
agine-0.311
cin-0.310
andre-0.305
Token seen
Token ID1775
Feature activation-8.22e-01

Pos logit contributions
istered+3.301
 incumbent+3.015
uscript+2.936
 Imran+2.871
 tether+2.820
Neg logit contributions
\">-4.062
 <+-4.039
agine-3.912
cin-3.855
andre-3.824
Token,
Token ID11
Feature activation-1.96e+00

Pos logit contributions
istered+0.597
 incumbent+0.547
uscript+0.543
sembly+0.527
 Imran+0.526
Neg logit contributions
\">-0.634
 <+-0.629
agine-0.607
cin-0.603
andre-0.593
Token Row
Token ID11314
Feature activation+7.73e-01

Pos logit contributions
istered+1.260
 incumbent+1.146
uscript+1.128
 Imran+1.097
sembly+1.091
Neg logit contributions
\">-1.673
 <+-1.661
agine-1.606
cin-1.599
andre-1.577
Tokenlf
Token ID1652
Feature activation-1.30e-01

Pos logit contributions
\">+0.375
 <++0.372
agine+0.349
cin+0.346
andre+0.337
Neg logit contributions
istered-0.781
 incumbent-0.733
uscript-0.732
sembly-0.715
 Imran-0.714
Token had
Token ID550
Feature activation-1.50e+00

Pos logit contributions
istered+0.089
 incumbent+0.082
uscript+0.081
sembly+0.079
 Imran+0.078
Neg logit contributions
\">-0.105
 <+-0.104
agine-0.100
cin-0.100
andre-0.098
TokenRated
Token ID15322
Feature activation-4.12e-01

Pos logit contributions
istered+0.929
 incumbent+0.847
uscript+0.842
 Imran+0.820
 Warden+0.797
Neg logit contributions
\">-0.897
 <+-0.889
agine-0.869
cin-0.854
andre-0.832
Token 5
Token ID642
Feature activation-8.83e-01

Pos logit contributions
istered+0.326
 incumbent+0.302
uscript+0.299
sembly+0.291
 Imran+0.291
Neg logit contributions
\">-0.290
 <+-0.288
agine-0.277
cin-0.275
andre-0.270
Token out
Token ID503
Feature activation-2.38e+00

Pos logit contributions
istered+0.575
 incumbent+0.528
uscript+0.516
 Imran+0.503
 Warden+0.492
Neg logit contributions
\">-0.740
 <+-0.737
agine-0.718
cin-0.711
andre-0.700
Token of
Token ID286
Feature activation-3.01e+00

Pos logit contributions
istered+1.532
 incumbent+1.418
uscript+1.370
 Imran+1.369
 Warden+1.332
Neg logit contributions
\">-1.959
 <+-1.936
agine-1.931
cin-1.882
andre-1.868
Token 5
Token ID642
Feature activation-1.50e+00

Pos logit contributions
istered+2.285
 incumbent+2.122
uscript+2.059
 Imran+2.032
 Warden+2.010
Neg logit contributions
\">-2.178
 <+-2.146
agine-2.106
cin-2.078
andre-2.041
Token by
Token ID416
Feature activation-4.94e+00

Pos logit contributions
istered+0.942
 incumbent+0.859
uscript+0.844
 Imran+0.824
 Warden+0.805
Neg logit contributions
\">-1.292
 <+-1.283
agine-1.250
cin-1.241
andre-1.221
Token Quake
Token ID42901
Feature activation-2.21e+00

Pos logit contributions
istered+3.357
 incumbent+3.111
 Imran+3.018
uscript+3.002
 Warden+2.972
Neg logit contributions
\">-3.960
 <+-3.856
agine-3.794
cin-3.756
bleacher-3.664
TokenD
Token ID35
Feature activation-2.01e+00

Pos logit contributions
istered+1.695
 incumbent+1.562
uscript+1.550
sembly+1.503
 Imran+1.502
Neg logit contributions
\">-1.595
 <+-1.575
agine-1.515
cin-1.500
andre-1.463
Tokenude
Token ID2507
Feature activation-2.15e+00

Pos logit contributions
istered+1.501
 incumbent+1.380
uscript+1.373
 Imran+1.329
sembly+1.327
Neg logit contributions
\">-1.496
 <+-1.478
agine-1.421
cin-1.414
andre-1.384
Token from
Token ID422
Feature activation-2.83e+00

Pos logit contributions
istered+1.536
 incumbent+1.406
uscript+1.400
sembly+1.353
 Imran+1.351
Neg logit contributions
\">-1.666
 <+-1.648
agine-1.608
cin-1.587
andre-1.551
Token Explosive
Token ID48178
Feature activation-6.94e-01

Pos logit contributions
istered+1.847
 incumbent+1.686
uscript+1.657
 Imran+1.638
sembly+1.625
Neg logit contributions
\">-2.353
 <+-2.337
agine-2.266
cin-2.261
andre-2.222
Tokenorg
Token ID2398
Feature activation-1.39e+00

Pos logit contributions
istered+0.516
 incumbent+0.462
uscript+0.459
 Imran+0.444
sembly+0.440
Neg logit contributions
\">-0.653
 <+-0.653
agine-0.629
cin-0.620
andre-0.609
Token/
Token ID14
Feature activation-7.91e-02

Pos logit contributions
istered+0.870
 incumbent+0.799
uscript+0.792
 Imran+0.772
 Warden+0.747
Neg logit contributions
 <+-1.175
\">-1.174
agine-1.143
cin-1.135
andre-1.117
Tokendocument
Token ID22897
Feature activation+8.88e-01

Pos logit contributions
istered+0.059
uscript+0.054
 incumbent+0.054
sembly+0.052
 Imran+0.052
Neg logit contributions
\">-0.059
 <+-0.059
agine-0.057
cin-0.056
andre-0.055
Token/
Token ID14
Feature activation+1.91e-01

Pos logit contributions
\">+0.723
 <++0.720
agine+0.694
cin+0.692
andre+0.680
Neg logit contributions
istered-0.604
 incumbent-0.550
uscript-0.548
sembly-0.532
 Imran-0.527
Tokent
Token ID83
Feature activation-2.29e+00

Pos logit contributions
\">+0.127
 <++0.127
agine+0.121
cin+0.120
andre+0.118
Neg logit contributions
istered-0.158
 incumbent-0.146
uscript-0.146
sembly-0.143
 Imran-0.142
Tokenatt
Token ID1078
Feature activation-4.92e+00

Pos logit contributions
istered+1.154
uscript+1.004
 incumbent+0.990
sembly+0.946
 Imran+0.932
Neg logit contributions
 <+-2.258
\">-2.256
agine-2.176
cin-2.171
andre-2.137
Tokenoo
Token ID2238
Feature activation-7.32e-01

Pos logit contributions
istered+3.110
uscript+2.824
 incumbent+2.711
iman+2.652
 Warden+2.581
Neg logit contributions
\">-4.213
 <+-4.177
agine-3.948
bleacher-3.941
cin-3.909
Token-
Token ID12
Feature activation+4.08e-01

Pos logit contributions
istered+0.531
 incumbent+0.484
uscript+0.483
sembly+0.469
 Imran+0.467
Neg logit contributions
\">-0.564
 <+-0.563
agine-0.541
cin-0.536
andre-0.526
Tokencompl
Token ID23855
Feature activation+7.13e-01

Pos logit contributions
\">+0.304
 <++0.302
agine+0.289
cin+0.286
andre+0.282
Neg logit contributions
istered-0.308
 incumbent-0.282
uscript-0.282
sembly-0.274
 Imran-0.272
Tokenaint
Token ID2913
Feature activation+3.11e+00

Pos logit contributions
\">+0.462
 <++0.459
agine+0.437
cin+0.434
andre+0.426
Neg logit contributions
istered-0.606
 incumbent-0.562
uscript-0.560
sembly-0.547
 Imran-0.544
Token
Token ID198
Feature activation+7.67e-01

Pos logit contributions
\">+2.503
 <++2.489
agine+2.402
cin+2.394
andre+2.376
Neg logit contributions
istered-2.137
 incumbent-1.929
uscript-1.908
sembly-1.870
 Imran-1.841
Token the
Token ID262
Feature activation-7.50e-01

Pos logit contributions
istered+0.755
 incumbent+0.695
uscript+0.685
 Imran+0.661
sembly+0.661
Neg logit contributions
\">-0.978
 <+-0.963
agine-0.935
cin-0.932
andre-0.919
Token amount
Token ID2033
Feature activation-1.66e-01

Pos logit contributions
istered+0.542
 incumbent+0.501
uscript+0.495
sembly+0.480
 Imran+0.478
Neg logit contributions
\">-0.579
 <+-0.571
agine-0.554
cin-0.551
andre-0.542
Token of
Token ID286
Feature activation-1.29e+00

Pos logit contributions
istered+0.118
 incumbent+0.109
uscript+0.108
 Imran+0.105
sembly+0.104
Neg logit contributions
\">-0.129
 <+-0.128
agine-0.124
cin-0.123
andre-0.121
Token health
Token ID1535
Feature activation-1.30e+00

Pos logit contributions
istered+0.913
 incumbent+0.841
uscript+0.832
sembly+0.805
 Imran+0.803
Neg logit contributions
\">-1.024
 <+-1.008
agine-0.979
cin-0.973
andre-0.959
Token the
Token ID262
Feature activation-1.25e+00

Pos logit contributions
istered+0.911
 incumbent+0.836
uscript+0.826
 Imran+0.803
sembly+0.799
Neg logit contributions
\">-1.033
 <+-1.022
agine-0.989
cin-0.980
andre-0.964
Token monsters
Token ID10963
Feature activation-4.88e+00

Pos logit contributions
istered+0.970
 incumbent+0.906
uscript+0.894
 Warden+0.869
 Imran+0.866
Neg logit contributions
\">-0.896
 <+-0.881
agine-0.855
cin-0.848
andre-0.833
Token you
Token ID345
Feature activation-5.43e-01

Pos logit contributions
istered+3.411
 incumbent+3.233
 Warden+3.060
uscript+3.059
 tether+3.036
Neg logit contributions
 <+-3.728
\">-3.715
cin-3.557
agine-3.556
andre-3.533
Token'll
Token ID1183
Feature activation-1.53e+00

Pos logit contributions
istered+0.427
 incumbent+0.393
uscript+0.391
sembly+0.380
 Imran+0.377
Neg logit contributions
\">-0.385
 <+-0.383
agine-0.367
cin-0.365
andre-0.360
Token face
Token ID1986
Feature activation+2.16e+00

Pos logit contributions
istered+1.155
 incumbent+1.065
uscript+1.052
sembly+1.017
 Imran+1.015
Neg logit contributions
\">-1.121
 <+-1.113
agine-1.068
cin-1.064
andre-1.057
Token have
Token ID423
Feature activation+4.37e-01

Pos logit contributions
\">+1.657
 <++1.648
agine+1.585
cin+1.578
andre+1.552
Neg logit contributions
istered-1.579
 incumbent-1.444
uscript-1.441
sembly-1.405
 Imran-1.393
Token.
Token ID13
Feature activation-2.55e-01

Pos logit contributions
\">+0.346
 <++0.343
agine+0.330
cin+0.329
andre+0.323
Neg logit contributions
istered-0.309
 incumbent-0.283
uscript-0.281
sembly-0.273
 Imran-0.272
Token proposed
Token ID5150
Feature activation+1.18e+00

Pos logit contributions
\">+0.240
 <++0.238
cin+0.228
agine+0.228
andre+0.224
Neg logit contributions
istered-0.211
 incumbent-0.196
uscript-0.191
 Imran-0.188
sembly-0.185
Token sales
Token ID4200
Feature activation-1.23e+00

Pos logit contributions
\">+0.830
 <++0.824
agine+0.789
cin+0.788
andre+0.773
Neg logit contributions
istered-0.930
 incumbent-0.870
uscript-0.854
 Imran-0.839
sembly-0.831
Token includes
Token ID3407
Feature activation-1.17e+00

Pos logit contributions
istered+0.847
 incumbent+0.806
uscript+0.773
 Imran+0.767
sembly+0.750
Neg logit contributions
\">-0.970
 <+-0.965
agine-0.923
cin-0.921
andre-0.903
Token 19
Token ID678
Feature activation-7.43e-01

Pos logit contributions
istered+0.836
 incumbent+0.777
uscript+0.758
 Imran+0.750
 Warden+0.733
Neg logit contributions
\">-0.915
 <+-0.913
cin-0.875
agine-0.874
andre-0.862
Token F
Token ID376
Feature activation+9.67e-01

Pos logit contributions
istered+0.497
 incumbent+0.458
uscript+0.449
 Imran+0.437
sembly+0.432
Neg logit contributions
\">-0.609
 <+-0.609
agine-0.587
cin-0.585
andre-0.578
Token-
Token ID12
Feature activation+3.17e+00

Pos logit contributions
\">+0.761
 <++0.757
agine+0.726
cin+0.724
andre+0.711
Neg logit contributions
istered-0.689
 incumbent-0.630
uscript-0.623
sembly-0.606
 Imran-0.602
Token16
Token ID1433
Feature activation+3.02e+00

Pos logit contributions
\">+1.905
 <++1.893
agine+1.801
cin+1.793
andre+1.752
Neg logit contributions
istered-2.830
 incumbent-2.638
uscript-2.625
sembly-2.570
 Imran-2.557
TokenV
Token ID53
Feature activation+1.17e+00

Pos logit contributions
\">+2.220
 <++2.192
agine+2.107
cin+2.102
andre+2.066
Neg logit contributions
istered-2.299
uscript-2.115
 incumbent-2.104
sembly-2.059
 Imran-2.033
Token jets
Token ID20792
Feature activation+1.62e-01

Pos logit contributions
\">+0.883
 <++0.878
agine+0.844
cin+0.840
andre+0.826
Neg logit contributions
istered-0.873
 incumbent-0.803
uscript-0.797
sembly-0.776
 Imran-0.774
Token made
Token ID925
Feature activation+5.79e-01

Pos logit contributions
\">+0.126
 <++0.125
agine+0.121
cin+0.120
andre+0.118
Neg logit contributions
istered-0.116
 incumbent-0.107
uscript-0.106
 Imran-0.103
sembly-0.102
Token by
Token ID416
Feature activation+1.45e-01

Pos logit contributions
\">+0.481
 <++0.478
agine+0.462
cin+0.460
andre+0.453
Neg logit contributions
istered-0.384
 incumbent-0.353
uscript-0.346
 Imran-0.338
sembly-0.337
Token,
Token ID11
Feature activation-5.93e-02

Pos logit contributions
istered+1.009
 incumbent+0.928
uscript+0.923
sembly+0.891
 Imran+0.888
Neg logit contributions
\">-1.114
 <+-1.105
agine-1.069
cin-1.055
andre-1.040
Token Lv
Token ID24388
Feature activation+7.18e-01

Pos logit contributions
istered+0.043
 incumbent+0.040
uscript+0.039
sembly+0.038
 Imran+0.038
Neg logit contributions
\">-0.045
 <+-0.045
agine-0.044
cin-0.043
andre-0.043
Tokeniv
Token ID452
Feature activation+9.65e-01

Pos logit contributions
\">+0.510
 <++0.506
agine+0.484
cin+0.482
andre+0.473
Neg logit contributions
istered-0.565
 incumbent-0.521
uscript-0.519
sembly-0.505
 Imran-0.502
Token and
Token ID290
Feature activation+5.24e-01

Pos logit contributions
\">+0.737
 <++0.732
agine+0.704
cin+0.702
andre+0.690
Neg logit contributions
istered-0.706
 incumbent-0.646
uscript-0.644
sembly-0.627
 Imran-0.623
Token Iv
Token ID16975
Feature activation+1.70e+00

Pos logit contributions
\">+0.434
 <++0.431
agine+0.416
cin+0.414
andre+0.407
Neg logit contributions
istered-0.351
 incumbent-0.320
uscript-0.317
sembly-0.308
 Imran-0.306
Tokenano
Token ID5733
Feature activation+4.01e+00

Pos logit contributions
\">+0.474
 <++0.464
agine+0.410
cin+0.407
andre+0.385
Neg logit contributions
istered-2.070
 incumbent-1.964
uscript-1.959
sembly-1.930
 Imran-1.919
Token-
Token ID12
Feature activation+2.27e+00

Pos logit contributions
\">+2.849
 <++2.818
agine+2.732
cin+2.706
andre+2.645
Neg logit contributions
istered-3.090
uscript-2.854
 incumbent-2.844
sembly-2.822
 Imran-2.729
TokenFrank
Token ID17439
Feature activation+9.42e-01

Pos logit contributions
\">+1.301
 <++1.296
cin+1.219
agine+1.219
andre+1.194
Neg logit contributions
istered-2.091
 incumbent-1.953
uscript-1.952
sembly-1.912
 Imran-1.899
Tokeniv
Token ID452
Feature activation+1.12e+00

Pos logit contributions
\">+0.673
 <++0.666
agine+0.642
cin+0.634
andre+0.627
Neg logit contributions
istered-0.736
 incumbent-0.681
uscript-0.675
sembly-0.658
 Imran-0.652
Tokensk
Token ID8135
Feature activation+8.40e-01

Pos logit contributions
\">+0.854
 <++0.848
agine+0.816
cin+0.812
andre+0.798
Neg logit contributions
istered-0.822
 incumbent-0.754
uscript-0.751
sembly-0.731
 Imran-0.726
Token.
Token ID13
Feature activation+1.23e+00

Pos logit contributions
\">+0.631
 <++0.628
agine+0.603
cin+0.601
andre+0.591
Neg logit contributions
istered-0.624
 incumbent-0.572
uscript-0.571
sembly-0.556
 Imran-0.553
Token from
Token ID422
Feature activation-1.02e+00

Pos logit contributions
\">+0.374
 <++0.370
agine+0.356
cin+0.355
andre+0.349
Neg logit contributions
istered-0.349
 incumbent-0.322
uscript-0.318
 Imran-0.310
sembly-0.308
Token their
Token ID511
Feature activation+9.26e-01

Pos logit contributions
istered+0.774
 incumbent+0.720
uscript+0.706
 Imran+0.692
 Warden+0.683
Neg logit contributions
\">-0.759
 <+-0.747
agine-0.723
cin-0.719
andre-0.706
Token carry
Token ID3283
Feature activation+2.83e+00

Pos logit contributions
\">+0.650
 <++0.642
agine+0.619
cin+0.615
andre+0.605
Neg logit contributions
istered-0.738
 incumbent-0.684
uscript-0.676
 Imran-0.660
sembly-0.656
Token-
Token ID12
Feature activation+5.87e-01

Pos logit contributions
\">+2.116
 <++2.091
cin+2.005
agine+2.000
andre+1.959
Neg logit contributions
istered-2.117
 incumbent-1.941
uscript-1.940
sembly-1.917
 Imran-1.881
Tokenon
Token ID261
Feature activation+1.90e-01

Pos logit contributions
\">+0.480
 <++0.479
agine+0.458
cin+0.454
andre+0.449
Neg logit contributions
istered-0.400
uscript-0.364
 incumbent-0.362
sembly-0.347
 Imran-0.347
Token baggage
Token ID30466
Feature activation+4.04e+00

Pos logit contributions
\">+0.121
 <++0.119
agine+0.114
cin+0.114
andre+0.112
Neg logit contributions
istered-0.165
 incumbent-0.154
uscript-0.152
 Imran-0.149
sembly-0.147
Token when
Token ID618
Feature activation+9.32e-02

Pos logit contributions
\">+2.977
 <++2.955
agine+2.830
cin+2.814
andre+2.755
Neg logit contributions
istered-3.045
uscript-2.795
 incumbent-2.789
sembly-2.767
 Imran-2.703
Token going
Token ID1016
Feature activation+1.38e+00

Pos logit contributions
\">+0.072
 <++0.071
agine+0.068
cin+0.068
andre+0.067
Neg logit contributions
istered-0.068
 incumbent-0.063
uscript-0.062
 Imran-0.060
sembly-0.060
Token through
Token ID832
Feature activation-4.53e-01

Pos logit contributions
\">+1.174
 <++1.166
agine+1.127
cin+1.123
andre+1.105
Neg logit contributions
istered-0.891
 incumbent-0.809
uscript-0.803
sembly-0.777
 Imran-0.773
Token airline
Token ID18091
Feature activation+2.88e+00

Pos logit contributions
istered+0.318
 incumbent+0.294
uscript+0.289
 Imran+0.282
sembly+0.280
Neg logit contributions
\">-0.360
 <+-0.357
agine-0.344
cin-0.343
andre-0.337
Token security
Token ID2324
Feature activation+3.46e-01

Pos logit contributions
\">+2.125
 <++2.115
agine+2.029
cin+2.022
andre+1.980
Neg logit contributions
istered-2.179
uscript-1.998
 incumbent-1.992
sembly-1.962
 Imran-1.928
Token Forum
Token ID14867
Feature activation+4.49e-01

Pos logit contributions
istered+0.578
 incumbent+0.533
uscript+0.521
 Imran+0.512
sembly+0.508
Neg logit contributions
\">-0.677
 <+-0.673
agine-0.652
cin-0.649
andre-0.637
Token for
Token ID329
Feature activation+1.62e-01

Pos logit contributions
\">+0.346
 <++0.343
agine+0.331
cin+0.329
andre+0.324
Neg logit contributions
istered-0.324
 incumbent-0.300
uscript-0.297
sembly-0.288
 Imran-0.288
Token the
Token ID262
Feature activation-2.46e-01

Pos logit contributions
\">+0.129
 <++0.128
agine+0.124
cin+0.123
andre+0.121
Neg logit contributions
istered-0.113
 incumbent-0.105
uscript-0.103
 Imran-0.101
sembly-0.100
Token 2013
Token ID2211
Feature activation+6.32e-01

Pos logit contributions
istered+0.172
 incumbent+0.161
uscript+0.156
 Imran+0.153
sembly+0.152
Neg logit contributions
\">-0.195
 <+-0.194
agine-0.188
cin-0.185
andre-0.183
Token/
Token ID14
Feature activation-6.71e-01

Pos logit contributions
\">+0.504
 <++0.501
agine+0.483
cin+0.481
andre+0.473
Neg logit contributions
istered-0.442
 incumbent-0.403
uscript-0.401
sembly-0.390
 Imran-0.387
Token14
Token ID1415
Feature activation+3.47e+00

Pos logit contributions
istered+0.616
 incumbent+0.580
uscript+0.573
sembly+0.562
 Imran+0.559
Neg logit contributions
\">-0.383
 <+-0.383
agine-0.364
cin-0.361
andre-0.351
Token season
Token ID1622
Feature activation-8.24e-01

Pos logit contributions
\">+2.498
 <++2.484
cin+2.389
agine+2.354
andre+2.348
Neg logit contributions
istered-2.664
uscript-2.451
 incumbent-2.400
sembly-2.393
 Imran-2.347
Token
Token ID198
Feature activation-1.51e-01

Pos logit contributions
istered+0.578
 incumbent+0.534
uscript+0.527
 Imran+0.511
sembly+0.510
Neg logit contributions
\">-0.653
 <+-0.646
agine-0.625
cin-0.620
andre-0.610
Token
Token ID198
Feature activation-1.26e-01

Pos logit contributions
istered+0.083
 incumbent+0.075
uscript+0.073
 Imran+0.072
 Warden+0.069
Neg logit contributions
\">-0.142
 <+-0.141
agine-0.138
cin-0.137
andre-0.135
TokenNew
Token ID3791
Feature activation+1.77e+00

Pos logit contributions
istered+0.095
 incumbent+0.087
uscript+0.086
 Imran+0.084
sembly+0.083
Neg logit contributions
\">-0.093
 <+-0.093
agine-0.090
cin-0.088
andre-0.086
Tokencastle
Token ID18676
Feature activation+1.46e+00

Pos logit contributions
\">+1.315
 <++1.306
agine+1.255
cin+1.250
andre+1.229
Neg logit contributions
istered-1.329
 incumbent-1.220
uscript-1.216
sembly-1.185
 Imran-1.177
Token2
Token ID17
Feature activation+1.08e+00

Pos logit contributions
\">+0.664
 <++0.660
agine+0.633
cin+0.627
andre+0.616
Neg logit contributions
istered-0.853
 incumbent-0.789
uscript-0.787
sembly-0.768
 Imran-0.766
Token"]
Token ID8973
Feature activation+2.57e+00

Pos logit contributions
\">+0.509
 <++0.508
agine+0.479
cin+0.471
andre+0.463
Neg logit contributions
istered-1.109
 incumbent-1.046
uscript-1.037
 Imran-1.019
sembly-1.015
Token [
Token ID685
Feature activation+2.30e+00

Pos logit contributions
\">+2.051
 <++2.038
agine+1.962
cin+1.955
andre+1.923
Neg logit contributions
istered-1.789
 incumbent-1.631
uscript-1.624
sembly-1.584
 Imran-1.566
TokenEC
Token ID2943
Feature activation+2.21e+00

Pos logit contributions
 <++1.957
\">+1.954
agine+1.908
cin+1.873
andre+1.847
Neg logit contributions
istered-1.486
 incumbent-1.342
uscript-1.325
 Imran-1.276
 Warden-1.256
TokenO
Token ID46
Feature activation+1.40e+00

Pos logit contributions
\">+1.454
 <++1.446
agine+1.376
cin+1.363
andre+1.350
Neg logit contributions
istered-1.862
 incumbent-1.732
uscript-1.718
sembly-1.678
 Imran-1.662
Token "
Token ID366
Feature activation+4.79e+00

Pos logit contributions
\">+1.201
 <++1.182
agine+1.160
andre+1.153
bleacher+1.147
Neg logit contributions
istered-0.887
 incumbent-0.817
 Warden-0.777
uscript-0.774
 Imran-0.773
TokenA
Token ID32
Feature activation+1.14e+00

Pos logit contributions
 <++3.763
\">+3.761
cin+3.665
agine+3.597
andre+3.560
Neg logit contributions
istered-3.333
uscript-3.044
 incumbent-3.037
sembly-2.990
 Imran-2.931
Token46
Token ID3510
Feature activation+3.51e+00

Pos logit contributions
\">+0.669
 <++0.668
agine+0.630
cin+0.621
andre+0.612
Neg logit contributions
istered-1.033
 incumbent-0.962
uscript-0.957
 Imran-0.937
sembly-0.935
Token"]
Token ID8973
Feature activation+4.62e+00

Pos logit contributions
\">+1.327
 <++1.307
cin+1.195
agine+1.161
andre+1.138
Neg logit contributions
istered-3.931
uscript-3.705
 incumbent-3.688
sembly-3.652
 Imran-3.622
Token [
Token ID685
Feature activation+4.19e+00

Pos logit contributions
 <++3.491
\">+3.476
cin+3.339
agine+3.295
andre+3.254
Neg logit contributions
istered-3.420
sembly-3.146
uscript-3.111
 incumbent-3.084
 Imran-2.954
TokenWhite
Token ID12256
Feature activation+3.44e+00

Pos logit contributions
\">+2.483
 <++2.465
agine+2.366
cin+2.284
andre+2.263
Neg logit contributions
istered-3.821
 incumbent-3.562
uscript-3.518
 Imran-3.436
sembly-3.405
Token.
Token ID13
Feature activation-4.93e-01

Pos logit contributions
istered+0.398
 incumbent+0.366
uscript+0.360
 Imran+0.354
 Warden+0.347
Neg logit contributions
\">-0.462
 <+-0.459
agine-0.445
cin-0.442
andre-0.434
Token Obama
Token ID2486
Feature activation+2.96e-01

Pos logit contributions
istered+0.390
 incumbent+0.364
uscript+0.357
 Imran+0.352
 Warden+0.346
Neg logit contributions
\">-0.345
 <+-0.343
agine-0.330
cin-0.328
andre-0.322
Token said
Token ID531
Feature activation-1.88e-01

Pos logit contributions
\">+0.234
 <++0.233
agine+0.224
cin+0.223
andre+0.220
Neg logit contributions
istered-0.209
 incumbent-0.191
uscript-0.190
sembly-0.186
 Imran-0.184
Token with
Token ID351
Feature activation-2.94e-01

Pos logit contributions
istered+0.133
 incumbent+0.121
uscript+0.121
sembly+0.117
 Imran+0.117
Neg logit contributions
\">-0.148
 <+-0.147
agine-0.142
cin-0.141
andre-0.139
Token damp
Token ID21151
Feature activation-6.14e-02

Pos logit contributions
istered+0.186
 incumbent+0.170
uscript+0.166
 Imran+0.161
sembly+0.160
Neg logit contributions
\">-0.255
 <+-0.252
agine-0.245
cin-0.244
andre-0.240
Token eyes
Token ID2951
Feature activation+1.31e+00

Pos logit contributions
istered+0.046
 incumbent+0.042
uscript+0.041
 Imran+0.040
sembly+0.040
Neg logit contributions
\">-0.046
 <+-0.046
agine-0.044
cin-0.044
andre-0.043
Token
Token ID198
Feature activation+7.50e-01

Pos logit contributions
\">+1.018
 <++1.013
agine+0.973
cin+0.972
andre+0.957
Neg logit contributions
istered-0.941
 incumbent-0.857
uscript-0.855
sembly-0.832
 Imran-0.824
Token
Token ID198
Feature activation+5.43e-01

Pos logit contributions
\">+0.691
 <++0.687
agine+0.665
cin+0.663
andre+0.654
Neg logit contributions
istered-0.432
 incumbent-0.386
uscript-0.384
sembly-0.370
 Imran-0.368
TokenMrs
Token ID27034
Feature activation+1.34e-01

Pos logit contributions
\">+0.396
 <++0.394
agine+0.380
cin+0.377
andre+0.369
Neg logit contributions
istered-0.416
 incumbent-0.382
uscript-0.380
 Imran-0.370
sembly-0.369
Token.
Token ID13
Feature activation-7.46e-01

Pos logit contributions
\">+0.105
 <++0.105
agine+0.101
cin+0.100
andre+0.099
Neg logit contributions
istered-0.095
 incumbent-0.087
uscript-0.086
 Imran-0.084
sembly-0.083
Token Obama
Token ID2486
Feature activation+7.64e-01

Pos logit contributions
istered+0.596
 incumbent+0.552
uscript+0.547
sembly+0.534
 Imran+0.533
Neg logit contributions
\">-0.521
 <+-0.516
agine-0.495
cin-0.493
andre-0.483
Token�
Token ID247
Feature activation-1.68e-01

Pos logit contributions
\">+0.494
 <++0.489
agine+0.476
cin+0.474
andre+0.467
Neg logit contributions
istered-0.275
 incumbent-0.245
uscript-0.242
 Imran-0.234
sembly-0.234
Tokens
Token ID82
Feature activation+3.54e-01

Pos logit contributions
istered+0.128
 incumbent+0.118
uscript+0.117
 Imran+0.114
sembly+0.114
Neg logit contributions
\">-0.123
 <+-0.123
agine-0.117
cin-0.116
andre-0.114
Token dangerous
Token ID4923
Feature activation-7.49e-01

Pos logit contributions
\">+0.281
 <++0.277
agine+0.267
cin+0.266
andre+0.262
Neg logit contributions
istered-0.249
 incumbent-0.233
uscript-0.226
 Imran-0.221
sembly-0.220
Token.
Token ID13
Feature activation-9.09e-02

Pos logit contributions
istered+0.523
 incumbent+0.483
uscript+0.476
 Imran+0.460
sembly+0.460
Neg logit contributions
\">-0.599
 <+-0.590
agine-0.572
cin-0.567
andre-0.557
Token
Token ID198
Feature activation-6.58e-02

Pos logit contributions
istered+0.059
 incumbent+0.054
uscript+0.053
 Imran+0.052
 Warden+0.051
Neg logit contributions
\">-0.076
 <+-0.076
agine-0.074
cin-0.073
andre-0.072
Token
Token ID198
Feature activation+5.53e-02

Pos logit contributions
istered+0.036
 incumbent+0.032
uscript+0.032
 Imran+0.031
 Warden+0.030
Neg logit contributions
\">-0.062
 <+-0.062
agine-0.060
cin-0.060
andre-0.059
TokenIn
Token ID818
Feature activation+3.54e-01

Pos logit contributions
\">+0.038
 <++0.038
agine+0.036
cin+0.036
andre+0.035
Neg logit contributions
istered-0.045
 incumbent-0.041
uscript-0.041
 Imran-0.040
sembly-0.040
Token a
Token ID257
Feature activation+2.27e-01

Pos logit contributions
\">+0.282
 <++0.280
agine+0.270
cin+0.268
andre+0.264
Neg logit contributions
istered-0.247
 incumbent-0.226
uscript-0.225
sembly-0.219
 Imran-0.218
Token recent
Token ID2274
Feature activation-9.16e-01

Pos logit contributions
\">+0.178
 <++0.177
agine+0.171
cin+0.170
andre+0.167
Neg logit contributions
istered-0.162
 incumbent-0.148
uscript-0.147
sembly-0.143
 Imran-0.142
Token article
Token ID2708
Feature activation+5.78e-01

Pos logit contributions
istered+0.672
 incumbent+0.621
uscript+0.613
 Imran+0.597
sembly+0.592
Neg logit contributions
\">-0.698
 <+-0.692
agine-0.666
cin-0.665
andre-0.653
Token for
Token ID329
Feature activation-4.84e-01

Pos logit contributions
\">+0.434
 <++0.431
agine+0.415
cin+0.413
andre+0.406
Neg logit contributions
istered-0.430
 incumbent-0.396
uscript-0.394
sembly-0.383
 Imran-0.381
Token noticed
Token ID6810
Feature activation-2.74e-01

Pos logit contributions
istered+0.571
 incumbent+0.527
uscript+0.526
 Imran+0.510
sembly+0.500
Neg logit contributions
\">-0.583
 <+-0.576
agine-0.554
cin-0.553
andre-0.548
Token that
Token ID326
Feature activation-1.11e-01

Pos logit contributions
istered+0.186
 incumbent+0.172
uscript+0.169
 Imran+0.166
 Warden+0.163
Neg logit contributions
\">-0.222
 <+-0.219
agine-0.213
cin-0.212
andre-0.208
Token everyone
Token ID2506
Feature activation-2.44e-01

Pos logit contributions
istered+0.076
 incumbent+0.071
uscript+0.069
 Imran+0.068
 Warden+0.067
Neg logit contributions
\">-0.089
 <+-0.088
agine-0.085
cin-0.085
andre-0.083
Token picked
Token ID6497
Feature activation-9.59e-01

Pos logit contributions
istered+0.171
 incumbent+0.158
uscript+0.156
 Imran+0.152
sembly+0.150
Neg logit contributions
\">-0.194
 <+-0.192
agine-0.185
cin-0.185
andre-0.181
Token up
Token ID510
Feature activation-1.45e+00

Pos logit contributions
istered+0.622
 incumbent+0.571
uscript+0.561
 Imran+0.548
sembly+0.542
Neg logit contributions
\">-0.808
 <+-0.805
agine-0.778
cin-0.776
andre-0.759
Token on
Token ID319
Feature activation+3.36e-01

Pos logit contributions
istered+0.955
 incumbent+0.876
uscript+0.861
 Imran+0.846
sembly+0.831
Neg logit contributions
\">-1.201
 <+-1.195
cin-1.157
agine-1.156
andre-1.128
Token that
Token ID326
Feature activation-4.66e-01

Pos logit contributions
\">+0.280
 <++0.278
agine+0.268
cin+0.268
andre+0.262
Neg logit contributions
istered-0.221
 incumbent-0.204
uscript-0.200
 Imran-0.196
sembly-0.194
Token very
Token ID845
Feature activation-2.23e-01

Pos logit contributions
istered+0.322
 incumbent+0.303
uscript+0.297
 Imran+0.291
 Warden+0.288
Neg logit contributions
\">-0.370
 <+-0.362
agine-0.355
cin-0.352
andre-0.345
Token clear
Token ID1598
Feature activation+9.11e-01

Pos logit contributions
istered+0.156
 incumbent+0.147
uscript+0.143
 Imran+0.140
 Warden+0.139
Neg logit contributions
\">-0.176
 <+-0.173
agine-0.169
cin-0.167
andre-0.164
Token theme
Token ID7505
Feature activation+9.45e-01

Pos logit contributions
\">+0.681
 <++0.673
agine+0.653
cin+0.647
andre+0.635
Neg logit contributions
istered-0.678
 incumbent-0.631
uscript-0.622
 Imran-0.607
sembly-0.601
Token.
Token ID13
Feature activation+4.53e-01

Pos logit contributions
\">+0.727
 <++0.722
agine+0.695
cin+0.692
andre+0.680
Neg logit contributions
istered-0.686
 incumbent-0.629
uscript-0.626
sembly-0.609
 Imran-0.605
Token,
Token ID11
Feature activation-3.12e-01

Pos logit contributions
istered+0.380
 incumbent+0.351
uscript+0.347
 Imran+0.336
sembly+0.335
Neg logit contributions
\">-0.414
 <+-0.409
agine-0.395
cin-0.394
andre-0.387
Token and
Token ID290
Feature activation+2.51e-01

Pos logit contributions
istered+0.225
 incumbent+0.208
uscript+0.205
 Imran+0.199
sembly+0.199
Neg logit contributions
\">-0.242
 <+-0.240
agine-0.231
cin-0.231
andre-0.227
Token that
Token ID326
Feature activation+1.38e+00

Pos logit contributions
\">+0.200
 <++0.198
agine+0.191
cin+0.191
andre+0.188
Neg logit contributions
istered-0.175
 incumbent-0.162
uscript-0.158
 Imran-0.154
sembly-0.154
Token more
Token ID517
Feature activation-6.08e-01

Pos logit contributions
\">+1.102
 <++1.094
agine+1.055
cin+1.050
andre+1.035
Neg logit contributions
istered-0.965
 incumbent-0.884
uscript-0.875
sembly-0.850
 Imran-0.848
Token people
Token ID661
Feature activation+1.03e+00

Pos logit contributions
istered+0.419
 incumbent+0.385
uscript+0.381
sembly+0.369
 Imran+0.369
Neg logit contributions
\">-0.488
 <+-0.485
agine-0.469
cin-0.467
andre-0.459
Token tend
Token ID4327
Feature activation+8.25e-01

Pos logit contributions
\">+0.766
 <++0.761
agine+0.732
cin+0.729
andre+0.717
Neg logit contributions
istered-0.777
 incumbent-0.716
uscript-0.714
sembly-0.690
 Imran-0.689
Token to
Token ID284
Feature activation+6.41e-01

Pos logit contributions
\">+0.641
 <++0.637
agine+0.611
cin+0.611
andre+0.600
Neg logit contributions
istered-0.594
 incumbent-0.542
uscript-0.540
sembly-0.528
 Imran-0.524
Token do
Token ID466
Feature activation+7.10e-01

Pos logit contributions
\">+0.457
 <++0.453
agine+0.435
cin+0.434
andre+0.427
Neg logit contributions
istered-0.501
 incumbent-0.464
uscript-0.461
 Imran-0.447
sembly-0.446
Token more
Token ID517
Feature activation+1.62e-01

Pos logit contributions
\">+0.573
 <++0.569
agine+0.550
cin+0.547
andre+0.538
Neg logit contributions
istered-0.488
 incumbent-0.447
uscript-0.444
sembly-0.429
 Imran-0.429
Token things
Token ID1243
Feature activation+1.28e+00

Pos logit contributions
\">+0.134
 <++0.133
agine+0.129
cin+0.128
andre+0.126
Neg logit contributions
istered-0.109
 incumbent-0.099
uscript-0.098
sembly-0.095
 Imran-0.095
Token.
Token ID13
Feature activation-2.73e-01

Pos logit contributions
\">+0.964
 <++0.957
agine+0.920
cin+0.916
andre+0.900
Neg logit contributions
istered-0.949
 incumbent-0.870
uscript-0.867
sembly-0.846
 Imran-0.839
Token is
Token ID318
Feature activation+5.26e-01

Pos logit contributions
\">+0.388
 <++0.384
agine+0.370
cin+0.369
andre+0.363
Neg logit contributions
istered-0.362
 incumbent-0.334
uscript-0.329
 Imran-0.320
sembly-0.318
Token why
Token ID1521
Feature activation+1.50e+00

Pos logit contributions
\">+0.409
 <++0.406
agine+0.390
cin+0.390
andre+0.383
Neg logit contributions
istered-0.377
 incumbent-0.349
uscript-0.343
 Imran-0.335
sembly-0.333
Token the
Token ID262
Feature activation+1.02e+00

Pos logit contributions
\">+1.222
 <++1.215
agine+1.172
cin+1.166
andre+1.148
Neg logit contributions
istered-1.021
uscript-0.926
 incumbent-0.925
sembly-0.900
 Imran-0.889
Token trade
Token ID3292
Feature activation+8.05e-01

Pos logit contributions
\">+0.853
 <++0.847
agine+0.819
cin+0.816
andre+0.803
Neg logit contributions
istered-0.667
 incumbent-0.610
uscript-0.602
sembly-0.583
 Imran-0.582
Token school
Token ID1524
Feature activation+1.10e+00

Pos logit contributions
\">+0.691
 <++0.687
agine+0.663
cin+0.661
andre+0.649
Neg logit contributions
istered-0.512
uscript-0.462
 incumbent-0.461
sembly-0.451
 Imran-0.444
Token has
Token ID468
Feature activation+1.11e+00

Pos logit contributions
\">+0.879
 <++0.872
agine+0.842
cin+0.839
andre+0.825
Neg logit contributions
istered-0.761
 incumbent-0.697
uscript-0.691
sembly-0.670
 Imran-0.669
Token such
Token ID884
Feature activation+3.14e+00

Pos logit contributions
\">+0.899
 <++0.893
agine+0.862
cin+0.858
andre+0.845
Neg logit contributions
istered-0.758
 incumbent-0.691
uscript-0.687
sembly-0.667
 Imran-0.663
Token a
Token ID257
Feature activation+1.18e+00

Pos logit contributions
\">+2.578
 <++2.546
agine+2.452
cin+2.447
andre+2.401
Neg logit contributions
istered-2.091
uscript-1.919
sembly-1.877
 incumbent-1.870
 Imran-1.819
Token high
Token ID1029
Feature activation-9.18e-02

Pos logit contributions
\">+0.993
 <++0.987
agine+0.952
cin+0.948
andre+0.934
Neg logit contributions
istered-0.769
 incumbent-0.696
uscript-0.695
sembly-0.674
 Imran-0.668
Token job
Token ID1693
Feature activation-1.15e-01

Pos logit contributions
istered+0.065
 incumbent+0.060
uscript+0.059
sembly+0.057
 Imran+0.057
Neg logit contributions
\">-0.072
 <+-0.072
agine-0.069
cin-0.069
andre-0.068
Token placement
Token ID13127
Feature activation+1.22e-01

Pos logit contributions
istered+0.086
 incumbent+0.080
uscript+0.079
sembly+0.077
 Imran+0.077
Neg logit contributions
\">-0.086
 <+-0.085
agine-0.082
cin-0.082
andre-0.081
Token were
Token ID547
Feature activation+7.25e-01

Pos logit contributions
istered+0.697
 incumbent+0.647
uscript+0.635
 Imran+0.615
sembly+0.610
Neg logit contributions
\">-0.759
 <+-0.751
agine-0.728
cin-0.724
andre-0.709
Token playing
Token ID2712
Feature activation+6.52e-01

Pos logit contributions
\">+0.585
 <++0.580
agine+0.560
cin+0.557
andre+0.550
Neg logit contributions
istered-0.500
 incumbent-0.459
uscript-0.454
 Imran-0.439
sembly-0.439
Token the
Token ID262
Feature activation+3.74e-01

Pos logit contributions
\">+0.505
 <++0.502
agine+0.485
cin+0.482
andre+0.473
Neg logit contributions
istered-0.470
 incumbent-0.436
uscript-0.426
 Imran-0.418
sembly-0.416
Token Jones
Token ID5437
Feature activation+5.89e-01

Pos logit contributions
\">+0.236
 <++0.234
agine+0.224
cin+0.223
andre+0.218
Neg logit contributions
istered-0.323
 incumbent-0.303
uscript-0.298
 Imran-0.292
sembly-0.291
Token-
Token ID12
Feature activation-5.69e-01

Pos logit contributions
\">+0.469
 <++0.467
agine+0.450
cin+0.446
andre+0.438
Neg logit contributions
istered-0.410
 incumbent-0.378
uscript-0.374
sembly-0.363
 Imran-0.362
Tokenco
Token ID1073
Feature activation-8.59e-01

Pos logit contributions
istered+0.436
 incumbent+0.404
uscript+0.398
sembly+0.387
 Imran+0.387
Neg logit contributions
\">-0.412
 <+-0.410
agine-0.395
cin-0.389
andre-0.381
Tokenached
Token ID2317
Feature activation+1.95e+00

Pos logit contributions
istered+0.532
 incumbent+0.479
uscript+0.474
sembly+0.459
 Imran+0.457
Neg logit contributions
\">-0.754
 <+-0.750
agine-0.726
cin-0.722
andre-0.710
Token Celtics
Token ID24116
Feature activation+5.86e-01

Pos logit contributions
\">+1.285
 <++1.275
agine+1.218
cin+1.212
andre+1.189
Neg logit contributions
istered-1.629
 incumbent-1.509
uscript-1.505
sembly-1.472
 Imran-1.461
Token.
Token ID13
Feature activation-3.16e-01

Pos logit contributions
\">+0.458
 <++0.455
agine+0.440
cin+0.435
andre+0.429
Neg logit contributions
istered-0.417
 incumbent-0.386
uscript-0.380
sembly-0.368
 Imran-0.368
Token
Token ID198
Feature activation+3.03e-01

Pos logit contributions
istered+0.196
 incumbent+0.177
uscript+0.175
 Imran+0.170
sembly+0.168
Neg logit contributions
\">-0.277
 <+-0.275
agine-0.268
cin-0.266
andre-0.262
Token
Token ID198
Feature activation+2.80e-01

Pos logit contributions
\">+0.282
 <++0.281
agine+0.273
cin+0.272
andre+0.268
Neg logit contributions
istered-0.169
 incumbent-0.151
uscript-0.149
 Imran-0.145
sembly-0.141
Token The
Token ID383
Feature activation-1.02e+00

Pos logit contributions
\">+0.310
 <++0.308
agine+0.297
cin+0.296
andre+0.291
Neg logit contributions
istered-0.284
 incumbent-0.261
uscript-0.258
 Imran-0.253
sembly-0.250
Token Donald
Token ID3759
Feature activation+5.96e-01

Pos logit contributions
istered+0.731
 incumbent+0.681
uscript+0.664
 Imran+0.652
sembly+0.640
Neg logit contributions
\">-0.797
 <+-0.793
agine-0.763
cin-0.760
andre-0.745
Token,
Token ID11
Feature activation+4.27e-01

Pos logit contributions
\">+0.459
 <++0.456
agine+0.439
cin+0.437
andre+0.430
Neg logit contributions
istered-0.432
 incumbent-0.397
uscript-0.395
sembly-0.384
 Imran-0.382
Token bringing
Token ID6079
Feature activation+2.75e-01

Pos logit contributions
\">+0.333
 <++0.330
agine+0.318
cin+0.317
andre+0.312
Neg logit contributions
istered-0.306
 incumbent-0.283
uscript-0.277
 Imran-0.273
sembly-0.269
Token people
Token ID661
Feature activation-1.43e+00

Pos logit contributions
\">+0.223
 <++0.222
agine+0.214
cin+0.213
andre+0.210
Neg logit contributions
istered-0.189
 incumbent-0.172
uscript-0.171
sembly-0.166
 Imran-0.165
Token together
Token ID1978
Feature activation-2.60e-01

Pos logit contributions
istered+0.981
 incumbent+0.906
uscript+0.887
 Imran+0.871
sembly+0.853
Neg logit contributions
\">-1.153
 <+-1.146
cin-1.107
agine-1.099
andre-1.088
Token
Token ID198
Feature activation+2.97e-01

Pos logit contributions
istered+0.170
 incumbent+0.155
uscript+0.154
 Imran+0.149
sembly+0.148
Neg logit contributions
\">-0.219
 <+-0.218
agine-0.210
cin-0.210
andre-0.206
Token
Token ID198
Feature activation+2.45e-01

Pos logit contributions
\">+0.277
 <++0.275
agine+0.267
cin+0.266
andre+0.262
Neg logit contributions
istered-0.167
 incumbent-0.150
uscript-0.148
 Imran-0.143
sembly-0.140
Token*
Token ID9
Feature activation+6.40e-01

Pos logit contributions
\">+0.144
 <++0.143
agine+0.136
cin+0.135
andre+0.132
Neg logit contributions
istered-0.222
 incumbent-0.207
uscript-0.206
sembly-0.202
 Imran-0.201
Token Clinton
Token ID2605
Feature activation-1.07e+00

Pos logit contributions
\">+0.500
 <++0.496
agine+0.479
cin+0.476
andre+0.468
Neg logit contributions
istered-0.457
 incumbent-0.420
uscript-0.416
 Imran-0.406
sembly-0.403
Token pledges
Token ID31241
Feature activation-1.16e+00

Pos logit contributions
istered+0.756
 incumbent+0.706
uscript+0.693
 Imran+0.675
 Warden+0.661
Neg logit contributions
\">-0.836
 <+-0.829
agine-0.794
cin-0.791
andre-0.780
Token�
Token ID246
Feature activation-7.72e-01

Pos logit contributions
istered+0.158
 incumbent+0.143
uscript+0.142
sembly+0.137
 Imran+0.136
Neg logit contributions
\">-0.220
 <+-0.219
agine-0.212
cin-0.211
andre-0.208
Tokenlight
Token ID2971
Feature activation-3.84e-01

Pos logit contributions
istered+0.589
 incumbent+0.542
uscript+0.540
sembly+0.524
 Imran+0.520
Neg logit contributions
\">-0.565
 <+-0.561
agine-0.538
cin-0.533
andre-0.523
Token�
Token ID447
Feature activation-1.15e+00

Pos logit contributions
istered+0.321
 incumbent+0.298
uscript+0.297
sembly+0.290
 Imran+0.288
Neg logit contributions
\">-0.253
 <+-0.251
agine-0.241
cin-0.238
andre-0.234
Token�
Token ID247
Feature activation-1.05e+00

Pos logit contributions
istered+0.607
 incumbent+0.542
uscript+0.531
sembly+0.521
 Imran+0.509
Neg logit contributions
\">-1.125
 <+-1.105
cin-1.080
agine-1.077
andre-1.059
Token).
Token ID737
Feature activation+4.25e-01

Pos logit contributions
istered+0.780
 incumbent+0.722
uscript+0.716
sembly+0.696
 Imran+0.695
Neg logit contributions
\">-0.785
 <+-0.776
agine-0.747
cin-0.742
andre-0.731
Token Good
Token ID4599
Feature activation-1.22e+00

Pos logit contributions
\">+0.345
 <++0.341
agine+0.330
cin+0.329
andre+0.324
Neg logit contributions
istered-0.291
 incumbent-0.267
uscript-0.264
 Imran-0.256
sembly-0.256
Token SET
Token ID25823
Feature activation+1.73e+00

Pos logit contributions
istered+0.842
 incumbent+0.777
uscript+0.771
 Imran+0.745
sembly+0.742
Neg logit contributions
\">-0.978
 <+-0.965
agine-0.937
cin-0.928
andre-0.914
Token ON
Token ID6177
Feature activation+8.00e-01

Pos logit contributions
\">+1.278
 <++1.269
agine+1.219
cin+1.214
andre+1.192
Neg logit contributions
istered-1.307
 incumbent-1.201
uscript-1.196
sembly-1.167
 Imran-1.158
Token FIRE
Token ID39110
Feature activation-8.71e-01

Pos logit contributions
\">+0.644
 <++0.639
agine+0.616
cin+0.614
andre+0.604
Neg logit contributions
istered-0.552
 incumbent-0.505
uscript-0.501
sembly-0.487
 Imran-0.484
Token
Token ID198
Feature activation-1.19e-01

Pos logit contributions
istered+0.643
 incumbent+0.596
uscript+0.593
 Imran+0.574
sembly+0.574
Neg logit contributions
\">-0.653
 <+-0.646
agine-0.625
cin-0.621
andre-0.613
Token
Token ID198
Feature activation+3.16e-01

Pos logit contributions
istered+0.065
 incumbent+0.058
uscript+0.057
 Imran+0.056
 Warden+0.054
Neg logit contributions
\">-0.112
 <+-0.112
agine-0.109
cin-0.108
andre-0.107
Token can
Token ID460
Feature activation-3.36e-01

Pos logit contributions
istered+0.717
 incumbent+0.649
uscript+0.638
 Imran+0.622
 Warden+0.600
Neg logit contributions
\">-1.118
 <+-1.116
cin-1.075
agine-1.073
andre-1.057
Token prepare
Token ID8335
Feature activation-7.23e-01

Pos logit contributions
istered+0.251
 incumbent+0.232
uscript+0.229
 Imran+0.223
sembly+0.221
Neg logit contributions
\">-0.251
 <+-0.248
cin-0.239
agine-0.239
andre-0.235
Token for
Token ID329
Feature activation+1.48e+00

Pos logit contributions
istered+0.503
 incumbent+0.471
uscript+0.462
 Imran+0.454
sembly+0.446
Neg logit contributions
\">-0.571
 <+-0.565
cin-0.547
agine-0.546
andre-0.531
Token their
Token ID511
Feature activation-2.04e-01

Pos logit contributions
\">+1.248
 <++1.239
agine+1.196
cin+1.193
andre+1.173
Neg logit contributions
istered-0.968
 incumbent-0.883
uscript-0.873
sembly-0.847
 Imran-0.847
Token client
Token ID5456
Feature activation+1.37e+00

Pos logit contributions
istered+0.149
 incumbent+0.139
uscript+0.136
 Imran+0.134
sembly+0.132
Neg logit contributions
\">-0.155
 <+-0.154
agine-0.149
cin-0.148
andre-0.145
Token�
Token ID447
Feature activation-1.53e-02

Pos logit contributions
\">+1.037
 <++1.030
agine+0.992
cin+0.988
andre+0.971
Neg logit contributions
istered-1.004
 incumbent-0.920
uscript-0.917
sembly-0.895
 Imran-0.885
Token�
Token ID247
Feature activation-2.28e-01

Pos logit contributions
istered+0.008
 incumbent+0.007
uscript+0.007
 Imran+0.007
sembly+0.007
Neg logit contributions
\">-0.015
 <+-0.015
agine-0.014
cin-0.014
andre-0.014
Tokens
Token ID82
Feature activation-7.37e-01

Pos logit contributions
istered+0.175
 incumbent+0.162
uscript+0.161
sembly+0.156
 Imran+0.156
Neg logit contributions
\">-0.167
 <+-0.166
agine-0.159
cin-0.157
andre-0.155
Token upcoming
Token ID7865
Feature activation-6.75e-02

Pos logit contributions
istered+0.514
 incumbent+0.480
uscript+0.466
 Imran+0.461
 Warden+0.457
Neg logit contributions
\">-0.584
 <+-0.575
agine-0.558
cin-0.555
andre-0.543
Token trial
Token ID4473
Feature activation+1.09e+00

Pos logit contributions
istered+0.042
 incumbent+0.039
uscript+0.038
 Imran+0.037
 Warden+0.037
Neg logit contributions
\">-0.058
 <+-0.058
agine-0.056
cin-0.056
andre-0.055
Token which
Token ID543
Feature activation+1.41e-01

Pos logit contributions
\">+0.844
 <++0.837
agine+0.806
cin+0.802
andre+0.788
Neg logit contributions
istered-0.793
 incumbent-0.730
uscript-0.724
sembly-0.705
 Imran-0.701
Token actions
Token ID4028
Feature activation+6.43e-01

Pos logit contributions
istered+0.394
 incumbent+0.365
uscript+0.357
 Imran+0.351
 Warden+0.344
Neg logit contributions
\">-0.417
 <+-0.411
agine-0.400
cin-0.395
andre-0.391
Token merely
Token ID6974
Feature activation+4.60e-01

Pos logit contributions
\">+0.498
 <++0.493
agine+0.476
cin+0.472
andre+0.464
Neg logit contributions
istered-0.466
 incumbent-0.429
uscript-0.424
sembly-0.411
 Imran-0.411
Token served
Token ID4983
Feature activation-1.36e-01

Pos logit contributions
\">+0.357
 <++0.354
agine+0.342
cin+0.339
andre+0.333
Neg logit contributions
istered-0.334
 incumbent-0.307
uscript-0.302
 Imran-0.294
sembly-0.293
Token to
Token ID284
Feature activation+2.07e-01

Pos logit contributions
istered+0.097
 incumbent+0.089
uscript+0.088
sembly+0.086
 Imran+0.086
Neg logit contributions
\">-0.107
 <+-0.106
agine-0.102
cin-0.102
andre-0.100
Token hard
Token ID1327
Feature activation-2.27e-01

Pos logit contributions
\">+0.153
 <++0.151
agine+0.146
cin+0.145
andre+0.143
Neg logit contributions
istered-0.158
 incumbent-0.146
uscript-0.144
 Imran-0.140
sembly-0.140
Tokenen
Token ID268
Feature activation-1.41e-01

Pos logit contributions
istered+0.126
 incumbent+0.113
uscript+0.111
sembly+0.107
 Imran+0.107
Neg logit contributions
\">-0.213
 <+-0.212
agine-0.205
cin-0.204
andre-0.202
Token the
Token ID262
Feature activation-2.98e-01

Pos logit contributions
istered+0.095
 incumbent+0.086
uscript+0.086
sembly+0.084
 Imran+0.083
Neg logit contributions
\">-0.117
 <+-0.116
agine-0.112
cin-0.111
andre-0.110
Token young
Token ID1862
Feature activation+4.15e-01

Pos logit contributions
istered+0.203
 incumbent+0.186
uscript+0.184
sembly+0.179
 Imran+0.178
Neg logit contributions
\">-0.243
 <+-0.241
agine-0.233
cin-0.232
andre-0.228
Token pun
Token ID4000
Feature activation-1.46e+00

Pos logit contributions
\">+0.338
 <++0.336
agine+0.324
cin+0.322
andre+0.318
Neg logit contributions
istered-0.283
 incumbent-0.257
uscript-0.257
sembly-0.251
 Imran-0.247
Tokenks
Token ID591
Feature activation+1.72e-01

Pos logit contributions
istered+0.432
 incumbent+0.334
uscript+0.334
sembly+0.302
 Imran+0.296
Neg logit contributions
\">-1.760
 <+-1.750
agine-1.707
cin-1.691
andre-1.680
Token'
Token ID6
Feature activation-1.02e+00

Pos logit contributions
\">+0.131
 <++0.130
agine+0.125
cin+0.124
andre+0.122
Neg logit contributions
istered-0.127
 incumbent-0.117
uscript-0.116
sembly-0.113
 Imran-0.112
Token
Token ID198
Feature activation-4.94e-01

Pos logit contributions
istered+0.238
 incumbent+0.215
uscript+0.213
sembly+0.207
 Imran+0.206
Neg logit contributions
\">-0.325
 <+-0.323
agine-0.312
cin-0.311
andre-0.307
Token
Token ID198
Feature activation-1.89e-01

Pos logit contributions
istered+0.268
 incumbent+0.240
uscript+0.235
 Imran+0.230
 Warden+0.223
Neg logit contributions
\">-0.466
 <+-0.463
agine-0.453
cin-0.449
andre-0.443
TokenWhen
Token ID2215
Feature activation-1.15e+00

Pos logit contributions
istered+0.159
 incumbent+0.147
uscript+0.146
 Imran+0.143
sembly+0.142
Neg logit contributions
\">-0.123
 <+-0.122
agine-0.117
cin-0.116
andre-0.113
Token I
Token ID314
Feature activation+2.11e+00

Pos logit contributions
istered+0.794
 incumbent+0.726
uscript+0.720
sembly+0.701
 Imran+0.697
Neg logit contributions
\">-0.928
 <+-0.922
agine-0.889
cin-0.886
andre-0.871
Token joined
Token ID5399
Feature activation+1.04e+00

Pos logit contributions
\">+1.665
 <++1.657
agine+1.591
cin+1.584
andre+1.558
Neg logit contributions
istered-1.491
 incumbent-1.362
uscript-1.361
sembly-1.329
 Imran-1.305
Token Netflix
Token ID12074
Feature activation-3.21e+00

Pos logit contributions
\">+0.904
 <++0.902
agine+0.869
cin+0.866
andre+0.855
Neg logit contributions
istered-0.647
uscript-0.587
 incumbent-0.581
sembly-0.566
 Imran-0.557
Token in
Token ID287
Feature activation-5.54e-02

Pos logit contributions
istered+2.235
 incumbent+2.051
uscript+2.006
 Imran+1.986
 Warden+1.945
Neg logit contributions
\">-2.531
 <+-2.510
agine-2.414
cin-2.405
andre-2.364
Token 2014
Token ID1946
Feature activation+6.32e-01

Pos logit contributions
istered+0.041
 incumbent+0.037
uscript+0.037
sembly+0.036
 Imran+0.036
Neg logit contributions
\">-0.042
 <+-0.042
agine-0.040
cin-0.040
andre-0.040
Token,
Token ID11
Feature activation-9.01e-01

Pos logit contributions
\">+0.479
 <++0.477
agine+0.457
cin+0.456
andre+0.449
Neg logit contributions
istered-0.465
 incumbent-0.426
uscript-0.426
sembly-0.415
 Imran-0.410
Token we
Token ID356
Feature activation-8.49e-01

Pos logit contributions
istered+0.628
 incumbent+0.574
uscript+0.570
sembly+0.554
 Imran+0.551
Neg logit contributions
\">-0.720
 <+-0.715
agine-0.689
cin-0.687
andre-0.675
Token had
Token ID550
Feature activation-1.97e+00

Pos logit contributions
istered+0.551
 incumbent+0.501
uscript+0.497
sembly+0.481
 Imran+0.478
Neg logit contributions
\">-0.718
 <+-0.713
agine-0.689
cin-0.687
andre-0.677
Token again
Token ID757
Feature activation-2.01e+00

Pos logit contributions
istered+0.773
 incumbent+0.714
uscript+0.706
sembly+0.686
 Imran+0.684
Neg logit contributions
\">-0.848
 <+-0.839
agine-0.815
cin-0.808
andre-0.789
Token to
Token ID284
Feature activation-8.82e-01

Pos logit contributions
istered+1.396
 incumbent+1.298
uscript+1.282
 Imran+1.244
sembly+1.236
Neg logit contributions
\">-1.598
 <+-1.576
agine-1.544
cin-1.519
andre-1.474
Token all
Token ID477
Feature activation-7.56e-01

Pos logit contributions
istered+0.575
 incumbent+0.525
uscript+0.519
 Imran+0.503
sembly+0.503
Neg logit contributions
\">-0.743
 <+-0.738
agine-0.715
cin-0.711
andre-0.698
Token of
Token ID286
Feature activation-1.69e+00

Pos logit contributions
istered+0.562
 incumbent+0.520
uscript+0.515
 Imran+0.499
sembly+0.498
Neg logit contributions
\">-0.569
 <+-0.564
agine-0.544
cin-0.540
andre-0.530
Token you
Token ID345
Feature activation-2.44e+00

Pos logit contributions
istered+1.203
 incumbent+1.105
uscript+1.097
 Imran+1.065
sembly+1.059
Neg logit contributions
\">-1.332
 <+-1.322
agine-1.280
cin-1.270
andre-1.238
Token for
Token ID329
Feature activation-4.06e+00

Pos logit contributions
istered+1.918
 incumbent+1.813
uscript+1.791
 Imran+1.742
 Warden+1.714
Neg logit contributions
\">-1.713
 <+-1.675
agine-1.625
cin-1.620
andre-1.570
Token your
Token ID534
Feature activation-2.19e+00

Pos logit contributions
istered+3.003
 incumbent+2.800
uscript+2.760
 Imran+2.689
 Warden+2.663
Neg logit contributions
\">-3.040
 <+-2.981
cin-2.893
agine-2.872
bleacher-2.817
Token interest
Token ID1393
Feature activation-2.00e+00

Pos logit contributions
istered+1.375
 incumbent+1.252
uscript+1.229
 Imran+1.188
sembly+1.184
Neg logit contributions
\">-1.901
 <+-1.880
agine-1.819
cin-1.813
andre-1.783
Token and
Token ID290
Feature activation-3.86e-01

Pos logit contributions
istered+1.474
 incumbent+1.357
uscript+1.337
 Imran+1.302
sembly+1.297
Neg logit contributions
\">-1.517
 <+-1.503
agine-1.459
cin-1.442
andre-1.410
Token assistance
Token ID6829
Feature activation-3.28e-01

Pos logit contributions
istered+0.251
 incumbent+0.228
uscript+0.225
sembly+0.218
 Imran+0.218
Neg logit contributions
\">-0.328
 <+-0.325
agine-0.314
cin-0.313
andre-0.308
Token.
Token ID13
Feature activation-1.21e+00

Pos logit contributions
istered+0.244
 incumbent+0.225
uscript+0.223
sembly+0.217
 Imran+0.216
Neg logit contributions
\">-0.247
 <+-0.245
agine-0.236
cin-0.235
andre-0.230
Tokenavier
Token ID19492
Feature activation-3.48e-01

Pos logit contributions
istered+1.033
uscript+0.955
 incumbent+0.949
 Imran+0.915
sembly+0.914
Neg logit contributions
\">-1.040
 <+-1.026
agine-0.984
cin-0.971
bleacher-0.949
Token Dum
Token ID30933
Feature activation+7.35e-01

Pos logit contributions
istered+0.216
 incumbent+0.195
uscript+0.194
sembly+0.188
 Imran+0.186
Neg logit contributions
\">-0.304
 <+-0.303
agine-0.293
cin-0.292
andre-0.288
Tokenus
Token ID385
Feature activation-4.73e-01

Pos logit contributions
\">+0.490
 <++0.488
agine+0.463
cin+0.461
andre+0.452
Neg logit contributions
istered-0.609
uscript-0.563
 incumbent-0.562
 Imran-0.545
sembly-0.543
Tokenque
Token ID4188
Feature activation-3.07e-01

Pos logit contributions
istered+0.169
 incumbent+0.140
uscript+0.138
sembly+0.130
 Imran+0.128
Neg logit contributions
\">-0.538
 <+-0.536
agine-0.522
cin-0.520
andre-0.515
Token et
Token ID2123
Feature activation-2.92e+00

Pos logit contributions
istered+0.219
 incumbent+0.200
uscript+0.199
sembly+0.195
 Imran+0.192
Neg logit contributions
\">-0.240
 <+-0.239
agine-0.229
cin-0.229
andre-0.226
Token al
Token ID435
Feature activation-5.43e+00

Pos logit contributions
istered+1.706
 incumbent+1.540
uscript+1.518
 Imran+1.484
 Warden+1.421
Neg logit contributions
\">-2.657
 <+-2.628
agine-2.552
cin-2.536
bleacher-2.482
Token.
Token ID13
Feature activation-2.66e+00

Pos logit contributions
istered+3.631
uscript+3.343
 incumbent+3.323
 Imran+3.171
sembly+3.116
Neg logit contributions
\">-4.401
 <+-4.326
agine-4.118
cin-4.117
bleacher-4.071
Token 2014
Token ID1946
Feature activation-8.43e-01

Pos logit contributions
istered+1.695
 incumbent+1.558
uscript+1.535
 Imran+1.504
 Warden+1.485
Neg logit contributions
\">-2.243
 <+-2.232
agine-2.163
cin-2.150
andre-2.122
Token.
Token ID13
Feature activation-1.62e+00

Pos logit contributions
istered+0.603
 incumbent+0.552
uscript+0.549
sembly+0.534
 Imran+0.531
Neg logit contributions
\">-0.658
 <+-0.654
agine-0.630
cin-0.627
andre-0.617
Token The
Token ID383
Feature activation-1.45e+00

Pos logit contributions
istered+0.944
 incumbent+0.855
uscript+0.842
 Imran+0.823
 Warden+0.813
Neg logit contributions
\">-1.451
 <+-1.446
agine-1.407
cin-1.400
andre-1.381
Token Kepler
Token ID40294
Feature activation+1.43e+00

Pos logit contributions
istered+0.957
 incumbent+0.878
uscript+0.861
 Imran+0.835
sembly+0.833
Neg logit contributions
 <+-1.202
\">-1.202
agine-1.157
cin-1.150
andre-1.135
Token,
Token ID11
Feature activation-1.43e+00

Pos logit contributions
\">+0.192
 <++0.189
agine+0.184
cin+0.183
andre+0.180
Neg logit contributions
istered-0.173
 incumbent-0.159
uscript-0.157
 Imran-0.153
sembly-0.151
Token then
Token ID788
Feature activation+5.00e-01

Pos logit contributions
istered+0.816
 incumbent+0.738
uscript+0.720
 Imran+0.699
 Warden+0.682
Neg logit contributions
\">-1.319
 <+-1.299
agine-1.277
cin-1.268
andre-1.253
Token apply
Token ID4174
Feature activation-2.49e-01

Pos logit contributions
\">+0.313
 <++0.310
agine+0.297
cin+0.295
andre+0.289
Neg logit contributions
istered-0.435
 incumbent-0.406
uscript-0.402
 Imran-0.392
sembly-0.391
Token the
Token ID262
Feature activation-8.28e-01

Pos logit contributions
istered+0.156
 incumbent+0.141
uscript+0.140
 Imran+0.135
sembly+0.134
Neg logit contributions
\">-0.216
 <+-0.215
agine-0.208
cin-0.207
andre-0.204
Token black
Token ID2042
Feature activation-1.35e+00

Pos logit contributions
istered+0.579
 incumbent+0.540
uscript+0.521
 Imran+0.512
 Warden+0.504
Neg logit contributions
\">-0.656
 <+-0.643
agine-0.631
cin-0.628
andre-0.613
Tokenhat
Token ID5183
Feature activation-3.05e+00

Pos logit contributions
istered+1.180
 incumbent+1.106
uscript+1.104
 Imran+1.071
sembly+1.055
Neg logit contributions
\">-0.813
 <+-0.783
agine-0.761
cin-0.752
andre-0.747
Token #
Token ID1303
Feature activation-4.34e-01

Pos logit contributions
istered+1.996
 incumbent+1.838
uscript+1.805
sembly+1.747
 Imran+1.740
Neg logit contributions
\">-2.562
 <+-2.505
agine-2.448
cin-2.434
andre-2.399
Token morph
Token ID17488
Feature activation+1.76e-01

Pos logit contributions
istered+0.191
 incumbent+0.167
uscript+0.164
 Imran+0.155
sembly+0.154
Neg logit contributions
\">-0.456
 <+-0.453
agine-0.443
cin-0.441
andre-0.436
Tokenological
Token ID2770
Feature activation-1.33e-01

Pos logit contributions
\">+0.151
 <++0.150
agine+0.145
cin+0.144
andre+0.142
Neg logit contributions
istered-0.113
 incumbent-0.102
uscript-0.102
sembly-0.097
 Imran-0.097
Token operator
Token ID10088
Feature activation-4.52e-01

Pos logit contributions
istered+0.112
 incumbent+0.106
uscript+0.103
 Imran+0.101
 Warden+0.100
Neg logit contributions
\">-0.086
 <+-0.085
agine-0.082
cin-0.081
andre-0.080
Token to
Token ID284
Feature activation+7.15e-01

Pos logit contributions
istered+0.305
 incumbent+0.279
uscript+0.275
 Imran+0.267
 Warden+0.262
Neg logit contributions
\">-0.369
 <+-0.365
agine-0.356
cin-0.353
andre-0.348
Token Stalin
Token ID17482
Feature activation-1.51e+00

Pos logit contributions
\">+0.199
 <++0.199
agine+0.191
cin+0.190
andre+0.187
Neg logit contributions
istered-0.167
uscript-0.151
 incumbent-0.151
sembly-0.147
 Imran-0.146
Tokenism
Token ID1042
Feature activation-1.68e+00

Pos logit contributions
istered+1.148
 incumbent+1.055
uscript+1.053
 Imran+1.012
sembly+1.011
Neg logit contributions
\">-1.108
 <+-1.101
agine-1.059
cin-1.043
andre-1.034
Token;
Token ID26
Feature activation-2.58e-01

Pos logit contributions
istered+1.173
 incumbent+1.083
uscript+1.066
sembly+1.034
 Imran+1.031
Neg logit contributions
\">-1.336
 <+-1.321
agine-1.278
cin-1.266
andre-1.247
Token an
Token ID281
Feature activation+2.68e-01

Pos logit contributions
istered+0.181
 incumbent+0.166
uscript+0.165
sembly+0.160
 Imran+0.159
Neg logit contributions
\">-0.205
 <+-0.203
agine-0.196
cin-0.195
andre-0.192
Token
Token ID220
Feature activation+1.92e-01

Pos logit contributions
\">+0.225
 <++0.223
agine+0.215
cin+0.215
andre+0.211
Neg logit contributions
istered-0.176
 incumbent-0.160
uscript-0.159
sembly-0.154
 Imran-0.153
Tokeners
Token ID364
Feature activation-3.15e+00

Pos logit contributions
\">+0.161
 <++0.160
agine+0.155
cin+0.154
andre+0.152
Neg logit contributions
istered-0.126
 incumbent-0.114
uscript-0.113
sembly-0.110
 Imran-0.109
Tokenatz
Token ID27906
Feature activation+7.66e-01

Pos logit contributions
istered+2.119
uscript+1.890
 incumbent+1.863
sembly+1.821
 Warden+1.769
Neg logit contributions
\">-2.635
 <+-2.609
cin-2.495
agine-2.485
bleacher-2.453
Token "
Token ID366
Feature activation-1.10e-01

Pos logit contributions
\">+0.609
 <++0.605
agine+0.583
cin+0.580
andre+0.571
Neg logit contributions
istered-0.537
 incumbent-0.490
uscript-0.488
sembly-0.475
 Imran-0.471
Tokensocial
Token ID14557
Feature activation+3.13e-02

Pos logit contributions
istered+0.083
 incumbent+0.076
uscript+0.076
sembly+0.074
 Imran+0.073
Neg logit contributions
\">-0.082
 <+-0.081
agine-0.078
cin-0.077
andre-0.076
Tokenism
Token ID1042
Feature activation-7.62e-01

Pos logit contributions
\">+0.025
 <++0.024
agine+0.023
cin+0.023
andre+0.023
Neg logit contributions
istered-0.022
 incumbent-0.020
uscript-0.020
sembly-0.020
 Imran-0.020
Token"
Token ID1
Feature activation+5.14e-01

Pos logit contributions
istered+0.586
 incumbent+0.544
uscript+0.539
sembly+0.520
 Imran+0.519
Neg logit contributions
\">-0.552
 <+-0.548
agine-0.529
cin-0.524
andre-0.517