TOP ACTIVATIONS
MAX = 1.472

s constitution , and Kim Jong - il elected as chairman
UNESCO ( the United Nations Educational , Scientific and Cultural Organisation
or dis comb ob ulated ;
they ve been so successful at Bro d ard
( 0 s )} c xd 4 { ( 341 s
down as it is dem e aning for women , but
IES M OST READ <|endoftext|> K illed for watching
# 2 , but trust me , your plans are going
over rated hacks like Van Go gh , Ing mar Berg
cancer and one of the hardest things I have dealt with
ceremony my life took a turn for the worse . My
other possibility that comes to mind is that having a video
an advocacy group ," Mc Qu ill an told Polit i
EP uses the stream cipher RC 4 for confidentiality ,[ 9
volatility is a result of supply - and - demand fundamentals
D IV IDE AND CON QU ER was about introducing the
Bird sey e view of Brigham City , Utah Date :
It would be dis pir iting if the book came
but unless you 've been there and you 've worked those
obtained nude photos of Jennifer Lawrence and other celebrities and posted

MIN ACTIVATIONS
MIN = -1.768

activist had received th rea ... From : A
ain told a Phoenix ra ­ d io st a ­
. REUTERS / Cl od agh Kil c oy ne
10 . 13 71 / journal . p one . 00
. All of those el ... It looks increasingly
in the current bill that enshr ined that protection .
takeover of V enez ue ... From Vox :
ale br achy ot is vict oria e .
There are knit - a ... <|endoftext|> Buy Photo F SU
This screen also includes a native Bl ur Reduction mode to
to make other smartphone owners jealous . What s
s anti - Pet ro car ibe position and said
Off er : No Take aways
card companies ; now i ... Although this one
name is still a polit ­ ic ­ al in ­
Porn Pt 4 - Ott os Old e Film 2008 01
­ on to this pres ­ id ­ ent ,
well young man , every once in a while one of
Obama is the worst ne ­ go ­ ti ­ at
ian with a r ak ish edge , full of word

INTERVAL 0.662 - 1.472
CONTAINS 0.879%

she would put them to sleep each night as children ,
If it comes to that point , we will no longer
demand for Argentine debt ; OPEC failed to cut production ,
path ," says Boy ega the day after the final Force
, Y Ba 2 Cu 3 O 6 + x ,

INTERVAL -0.148 - 0.662
CONTAINS 71.802%

team ," he said . " I can 't
but it was always enjoyable ," he said . " I
They were very healthy this year . There were teams I
the sake of putting something in the grid 1
had reacted to unpleasant images by vomiting or crying .

INTERVAL -0.958 - -0.148
CONTAINS 27.269%

100 percent renewables energy mix before mid - century .
the river . It 's this balancing act that you have
their kids that do not conflict with their own values .
our research with a bill to prohibit bear - feeding ,
arrest , Ll anes said today in a statement

INTERVAL -1.768 - -0.958
CONTAINS 0.050%

very close with J eth ro P ugh [ who died
Ford was ind ignant . He denied the allegation
2010 United States 21 50 Posts # 16 Glad to hear
Ac ro Ac ro Ac ro
on local governments and their ability to solve complex problems such
Tokens
Token ID82
Feature activation+9.63e-02

Pos logit contributions
+0.266
+0.246
knit+0.243
+0.232
 Citiz+0.226
Neg logit contributions
ample-0.312
pport-0.292
alone-0.291
 proto-0.274
 amp-0.269
Token constitution
Token ID11784
Feature activation-1.18e-02

Pos logit contributions
+0.427
+0.395
knit+0.393
+0.378
 Citiz+0.370
Neg logit contributions
ample-0.366
alone-0.340
pport-0.338
 proto-0.309
abama-0.307
Token,
Token ID11
Feature activation-8.30e-02

Pos logit contributions
ample+0.050
pport+0.047
alone+0.047
 proto+0.044
 amp+0.043
Neg logit contributions
-0.047
-0.044
knit-0.043
-0.041
 Citiz-0.040
Token and
Token ID290
Feature activation-1.03e-02

Pos logit contributions
ample+0.338
pport+0.314
alone+0.314
 proto+0.292
abama+0.286
Neg logit contributions
-0.347
-0.322
knit-0.318
-0.305
 Citiz-0.300
Token Kim
Token ID6502
Feature activation+6.26e-01

Pos logit contributions
ample+0.041
pport+0.039
alone+0.038
 proto+0.035
abama+0.035
Neg logit contributions
-0.043
-0.040
knit-0.040
-0.038
 Citiz-0.037
Token Jong
Token ID17960
Feature activation+1.47e+00

Pos logit contributions
+2.509
+2.344
knit+2.285
+2.194
 Citiz+2.149
Neg logit contributions
ample-2.631
pport-2.453
alone-2.438
 proto-2.272
abama-2.242
Token-
Token ID12
Feature activation+6.75e-01

Pos logit contributions
+5.078
+4.681
knit+4.534
+4.326
 Citiz+4.243
Neg logit contributions
ample-6.745
pport-6.390
alone-6.336
rawdownloadcloneembedreportprint-5.908
 proto-5.875
Tokenil
Token ID346
Feature activation+1.47e-01

Pos logit contributions
+2.315
+2.132
knit+2.081
+1.968
 Citiz+1.889
Neg logit contributions
ample-3.213
pport-3.053
alone-3.001
 proto-2.859
abama-2.804
Token elected
Token ID7018
Feature activation-4.66e-03

Pos logit contributions
+0.618
+0.577
knit+0.568
+0.545
 Citiz+0.533
Neg logit contributions
ample-0.596
pport-0.552
alone-0.551
 proto-0.516
 amp-0.502
Token as
Token ID355
Feature activation-2.47e-01

Pos logit contributions
ample+0.019
pport+0.018
alone+0.018
 proto+0.016
abama+0.016
Neg logit contributions
-0.019
-0.018
knit-0.018
-0.017
 Citiz-0.017
Token chairman
Token ID8900
Feature activation-1.84e-01

Pos logit contributions
ample+1.032
pport+0.958
alone+0.956
 proto+0.899
 amp+0.876
Neg logit contributions
-1.002
-0.933
knit-0.922
-0.881
 Citiz-0.864
Token UNESCO
Token ID49372
Feature activation+1.79e-01

Pos logit contributions
ample+0.207
alone+0.193
pport+0.193
 proto+0.179
abama+0.175
Neg logit contributions
-0.195
-0.179
knit-0.178
-0.171
 Citiz-0.168
Token (
Token ID357
Feature activation+7.46e-02

Pos logit contributions
+0.734
+0.679
knit+0.673
+0.644
 Citiz+0.634
Neg logit contributions
ample-0.746
pport-0.692
alone-0.691
 proto-0.644
abama-0.631
Tokenthe
Token ID1169
Feature activation+5.15e-01

Pos logit contributions
+0.282
+0.261
knit+0.258
+0.245
 Citiz+0.240
Neg logit contributions
ample-0.333
pport-0.310
alone-0.310
 proto-0.292
 amp-0.285
Token United
Token ID1578
Feature activation+2.93e-01

Pos logit contributions
+1.849
+1.713
knit+1.708
+1.615
 Citiz+1.614
Neg logit contributions
ample-2.353
pport-2.202
alone-2.194
abama-2.022
 proto-2.008
Token Nations
Token ID7973
Feature activation+3.47e-01

Pos logit contributions
+1.110
+1.020
knit+1.013
+0.961
 Citiz+0.947
Neg logit contributions
ample-1.303
alone-1.217
pport-1.216
 proto-1.141
 amp-1.119
Token Educational
Token ID30038
Feature activation+1.44e+00

Pos logit contributions
+1.369
+1.273
knit+1.258
+1.202
 Citiz+1.188
Neg logit contributions
ample-1.476
pport-1.375
alone-1.373
 proto-1.282
 amp-1.257
Token,
Token ID11
Feature activation-3.86e-01

Pos logit contributions
+4.832
+4.354
knit+4.302
+4.108
 Citiz+3.940
Neg logit contributions
ample-6.929
alone-6.538
pport-6.427
 proto-6.029
abama-5.973
Token Scientific
Token ID22060
Feature activation-1.41e-01

Pos logit contributions
ample+1.587
pport+1.480
alone+1.444
 proto+1.424
 amp+1.349
Neg logit contributions
-1.566
-1.479
knit-1.451
-1.425
 Citiz-1.339
Token and
Token ID290
Feature activation-5.56e-01

Pos logit contributions
ample+0.526
pport+0.484
alone+0.483
 proto+0.450
 amp+0.438
Neg logit contributions
-0.632
-0.592
knit-0.587
-0.563
 Citiz-0.555
Token Cultural
Token ID23897
Feature activation+4.19e-01

Pos logit contributions
ample+2.044
pport+1.884
alone+1.838
 proto+1.781
 amp+1.682
Neg logit contributions
-2.484
-2.374
knit-2.331
-2.278
 Citiz-2.152
Token Organisation
Token ID30801
Feature activation-3.55e-02

Pos logit contributions
+1.458
+1.339
knit+1.320
+1.252
 Citiz+1.223
Neg logit contributions
ample-1.995
pport-1.870
alone-1.868
 proto-1.768
 amp-1.728
Token
Token ID251
Feature activation-1.28e-02

Pos logit contributions
+0.606
+0.557
knit+0.546
+0.521
 Citiz+0.504
Neg logit contributions
ample-0.838
pport-0.785
alone-0.784
 proto-0.744
 amp-0.727
Token or
Token ID393
Feature activation-4.16e-02

Pos logit contributions
ample+0.051
pport+0.047
alone+0.047
 proto+0.044
abama+0.043
Neg logit contributions
-0.055
-0.051
knit-0.051
-0.049
 Citiz-0.048
Token
Token ID564
Feature activation+2.61e-01

Pos logit contributions
ample+0.167
pport+0.155
alone+0.155
 proto+0.145
 amp+0.141
Neg logit contributions
-0.175
-0.163
knit-0.161
-0.154
 Citiz-0.152
Token
Token ID250
Feature activation+5.11e-01

Pos logit contributions
+0.730
+0.657
knit+0.637
+0.602
CHA+0.554
Neg logit contributions
ample-1.420
pport-1.346
alone-1.335
 proto-1.288
 amp-1.262
Tokendis
Token ID6381
Feature activation+2.62e-01

Pos logit contributions
+2.030
knit+1.882
+1.882
+1.780
 Citiz+1.725
Neg logit contributions
ample-2.149
pport-2.006
alone-1.987
 proto-1.897
 amp-1.848
Tokencomb
Token ID24011
Feature activation+1.36e+00

Pos logit contributions
+1.150
+1.076
knit+1.061
+1.019
 Citiz+1.005
Neg logit contributions
ample-1.015
pport-0.935
alone-0.934
 proto-0.869
 amp-0.844
Tokenob
Token ID672
Feature activation-5.27e-01

Pos logit contributions
+5.802
knit+5.542
+5.427
+5.154
CHA+5.094
Neg logit contributions
ample-4.835
alone-4.536
pport-4.506
 proto-4.439
abama-4.235
Tokenulated
Token ID4817
Feature activation+2.74e-01

Pos logit contributions
ample+2.211
pport+2.058
alone+2.050
 proto+1.909
 amp+1.885
Neg logit contributions
-2.119
-1.942
knit-1.886
-1.848
 Citiz-1.826
Token
Token ID447
Feature activation+5.24e-01

Pos logit contributions
+1.161
+1.083
knit+1.070
+1.029
 Citiz+1.004
Neg logit contributions
ample-1.098
pport-1.017
alone-1.016
 proto-0.950
 amp-0.925
Token
Token ID251
Feature activation+1.01e-01

Pos logit contributions
+1.786
+1.640
knit+1.590
+1.542
 Citiz+1.462
Neg logit contributions
ample-2.527
alone-2.370
pport-2.368
 proto-2.248
 amp-2.202
Token;
Token ID26
Feature activation+5.89e-01

Pos logit contributions
+0.428
+0.399
knit+0.394
+0.379
 Citiz+0.370
Neg logit contributions
ample-0.402
pport-0.373
alone-0.371
 proto-0.347
abama-0.339
Token they
Token ID484
Feature activation+3.36e-01

Pos logit contributions
+2.125
+1.937
knit+1.920
+1.867
 Citiz+1.830
Neg logit contributions
ample-2.194
pport-2.042
alone-2.034
abama-1.897
 proto-1.897
Token
Token ID447
Feature activation+9.17e-02

Pos logit contributions
+1.381
+1.280
knit+1.272
+1.218
 Citiz+1.180
Neg logit contributions
ample-1.390
pport-1.290
alone-1.286
 proto-1.208
abama-1.178
Token
Token ID247
Feature activation+2.65e-01

Pos logit contributions
+0.249
+0.223
knit+0.217
+0.204
 Citiz+0.193
Neg logit contributions
ample-0.505
pport-0.479
alone-0.478
 proto-0.458
 amp-0.449
Tokenve
Token ID303
Feature activation+1.37e-01

Pos logit contributions
+1.052
+0.979
knit+0.969
+0.923
 Citiz+0.897
Neg logit contributions
ample-1.126
pport-1.048
alone-1.048
 proto-0.988
 amp-0.958
Token been
Token ID587
Feature activation+4.52e-01

Pos logit contributions
+0.552
+0.512
knit+0.507
+0.484
 Citiz+0.474
Neg logit contributions
ample-0.579
pport-0.540
alone-0.538
 proto-0.504
abama-0.493
Token so
Token ID523
Feature activation+1.35e+00

Pos logit contributions
+1.911
+1.777
knit+1.770
+1.689
 Citiz+1.662
Neg logit contributions
ample-1.803
pport-1.679
alone-1.668
 proto-1.542
abama-1.533
Token successful
Token ID4388
Feature activation+3.80e-01

Pos logit contributions
+4.472
knit+4.067
+4.004
+3.743
 Citiz+3.704
Neg logit contributions
ample-6.304
pport-5.972
alone-5.968
abama-5.707
riter-5.438
Token at
Token ID379
Feature activation+1.60e-01

Pos logit contributions
+1.585
+1.476
knit+1.466
+1.400
 Citiz+1.370
Neg logit contributions
ample-1.540
pport-1.431
alone-1.422
 proto-1.335
abama-1.303
Token Bro
Token ID2806
Feature activation+1.12e-01

Pos logit contributions
+0.671
+0.622
knit+0.614
+0.592
 Citiz+0.583
Neg logit contributions
ample-0.651
pport-0.606
alone-0.601
 proto-0.561
abama-0.551
Tokend
Token ID67
Feature activation-7.42e-02

Pos logit contributions
+0.453
+0.421
knit+0.416
+0.398
 Citiz+0.390
Neg logit contributions
ample-0.465
pport-0.432
alone-0.431
 proto-0.406
 amp-0.395
Tokenard
Token ID446
Feature activation-1.03e+00

Pos logit contributions
ample+0.324
pport+0.302
alone+0.302
 proto+0.284
 amp+0.277
Neg logit contributions
-0.287
-0.266
knit-0.262
-0.250
 Citiz-0.245
Token(
Token ID7
Feature activation+1.62e-02

Pos logit contributions
+0.153
+0.140
knit+0.138
+0.132
CHA+0.125
Neg logit contributions
ample-0.210
pport-0.198
alone-0.196
 proto-0.188
 amp-0.183
Token0
Token ID15
Feature activation+6.64e-02

Pos logit contributions
+0.062
+0.058
knit+0.057
+0.055
 Citiz+0.053
Neg logit contributions
ample-0.071
pport-0.066
alone-0.066
 proto-0.062
 amp-0.061
Tokens
Token ID82
Feature activation+8.88e-03

Pos logit contributions
+0.230
+0.211
knit+0.209
+0.198
CHA+0.187
Neg logit contributions
ample-0.316
pport-0.298
alone-0.295
 proto-0.282
 amp-0.274
Token)}
Token ID38165
Feature activation-5.41e-03

Pos logit contributions
+0.032
+0.030
knit+0.030
+0.028
 Citiz+0.027
Neg logit contributions
ample-0.041
pport-0.038
alone-0.038
 proto-0.036
 amp-0.035
Token c
Token ID269
Feature activation+1.47e-01

Pos logit contributions
ample+0.021
pport+0.019
alone+0.019
 proto+0.018
abama+0.017
Neg logit contributions
-0.024
-0.022
knit-0.022
-0.021
 Citiz-0.021
Tokenxd
Token ID24954
Feature activation+1.33e+00

Pos logit contributions
+0.555
+0.514
knit+0.510
+0.485
 Citiz+0.471
Neg logit contributions
ample-0.648
alone-0.607
pport-0.606
 proto-0.571
 amp-0.556
Token4
Token ID19
Feature activation+7.12e-01

Pos logit contributions
+4.082
+3.757
knit+3.636
+3.469
CHA+3.296
Neg logit contributions
ample-6.541
pport-6.358
alone-6.230
 proto-5.940
rawdownloadcloneembedreportprint-5.759
Token {
Token ID1391
Feature activation+1.50e-01

Pos logit contributions
+1.925
+1.741
knit+1.709
+1.575
 Citiz+1.499
Neg logit contributions
ample-3.867
pport-3.703
alone-3.689
 proto-3.476
abama-3.448
Token(
Token ID7
Feature activation+9.10e-02

Pos logit contributions
+0.521
+0.477
knit+0.471
+0.448
CHA+0.425
Neg logit contributions
ample-0.715
pport-0.673
alone-0.667
 proto-0.639
 amp-0.623
Token341
Token ID33660
Feature activation+3.77e-01

Pos logit contributions
+0.348
+0.323
knit+0.319
+0.305
 Citiz+0.295
Neg logit contributions
ample-0.401
pport-0.374
alone-0.373
 proto-0.352
 amp-0.343
Tokens
Token ID82
Feature activation+3.55e-01

Pos logit contributions
+1.276
+1.170
knit+1.160
+1.095
CHA+1.036
Neg logit contributions
ample-1.807
pport-1.712
alone-1.693
 proto-1.614
abama-1.576
Token down
Token ID866
Feature activation-1.83e-01

Pos logit contributions
+0.884
+0.819
knit+0.805
+0.772
 Citiz+0.753
Neg logit contributions
ample-1.016
pport-0.955
alone-0.937
 proto-0.878
abama-0.870
Token as
Token ID355
Feature activation-2.60e-01

Pos logit contributions
ample+0.748
pport+0.691
alone+0.691
 proto+0.649
 amp+0.632
Neg logit contributions
-0.762
-0.713
knit-0.706
-0.673
 Citiz-0.664
Token it
Token ID340
Feature activation-1.54e-01

Pos logit contributions
ample+1.008
pport+0.930
alone+0.929
 proto+0.872
 amp+0.846
Neg logit contributions
-1.129
-1.060
knit-1.048
-1.002
 Citiz-0.985
Token is
Token ID318
Feature activation-3.88e-03

Pos logit contributions
ample+0.583
pport+0.536
alone+0.535
 proto+0.496
abama+0.485
Neg logit contributions
-0.682
-0.637
knit-0.630
-0.606
 Citiz-0.595
Token dem
Token ID1357
Feature activation+2.39e-01

Pos logit contributions
ample+0.015
pport+0.014
alone+0.014
 proto+0.013
abama+0.013
Neg logit contributions
-0.017
-0.015
knit-0.015
-0.015
 Citiz-0.014
Tokene
Token ID68
Feature activation+1.29e+00

Pos logit contributions
+0.865
+0.795
knit+0.787
+0.744
 Citiz+0.724
Neg logit contributions
ample-1.099
alone-1.031
pport-1.026
 proto-0.974
 amp-0.952
Tokenaning
Token ID7574
Feature activation+4.04e-01

Pos logit contributions
+4.138
+3.708
knit+3.582
+3.481
Pont+3.179
Neg logit contributions
ample-6.344
pport-5.959
alone-5.877
 proto-5.659
 amp-5.542
Token for
Token ID329
Feature activation+2.92e-01

Pos logit contributions
+1.747
+1.620
knit+1.609
+1.557
 Citiz+1.508
Neg logit contributions
ample-1.551
pport-1.434
alone-1.433
 proto-1.320
abama-1.298
Token women
Token ID1466
Feature activation+1.31e-01

Pos logit contributions
+1.131
+1.042
knit+1.037
+0.989
 Citiz+0.967
Neg logit contributions
ample-1.264
pport-1.183
alone-1.179
 proto-1.098
abama-1.083
Token,
Token ID11
Feature activation+1.81e-01

Pos logit contributions
+0.564
+0.526
knit+0.521
+0.500
 Citiz+0.490
Neg logit contributions
ample-0.518
pport-0.480
alone-0.478
 proto-0.445
 amp-0.433
Token but
Token ID475
Feature activation+9.36e-02

Pos logit contributions
+0.759
+0.700
knit+0.695
+0.668
 Citiz+0.657
Neg logit contributions
ample-0.726
alone-0.673
pport-0.671
 proto-0.623
abama-0.615
TokenIES
Token ID11015
Feature activation+4.42e-02

Pos logit contributions
ample+0.353
pport+0.333
alone+0.332
 proto+0.313
 amp+0.309
Neg logit contributions
-0.252
-0.231
knit-0.228
-0.215
 Citiz-0.214
Token
Token ID198
Feature activation+5.42e-01

Pos logit contributions
+0.181
+0.168
knit+0.166
+0.159
 Citiz+0.155
Neg logit contributions
ample-0.184
pport-0.171
alone-0.170
 proto-0.159
abama-0.156
Token
Token ID198
Feature activation+5.21e-01

Pos logit contributions
+2.307
knit+2.132
+2.120
+2.036
CHA+1.989
Neg logit contributions
ample-2.119
pport-1.990
alone-1.908
 proto-1.858
emort-1.817
TokenM
Token ID44
Feature activation-1.68e-01

Pos logit contributions
+2.035
+1.856
knit+1.836
+1.778
CHA+1.706
Neg logit contributions
ample-2.266
pport-2.117
alone-2.091
 proto-2.015
 amp-1.944
TokenOST
Token ID10892
Feature activation+7.61e-01

Pos logit contributions
ample+0.672
pport+0.623
alone+0.620
 proto+0.584
 amp+0.566
Neg logit contributions
-0.711
-0.663
knit-0.654
-0.628
 Citiz-0.614
Token READ
Token ID20832
Feature activation+1.27e+00

Pos logit contributions
+3.089
+2.821
knit+2.791
+2.698
CHA+2.638
Neg logit contributions
ample-3.190
alone-2.934
pport-2.899
 proto-2.729
abama-2.695
Token<|endoftext|>
Token ID50256
Feature activation+6.65e-02

Pos logit contributions
+4.803
+4.201
knit+4.154
+4.100
CHA+3.998
Neg logit contributions
ample-5.594
pport-5.212
alone-5.207
 proto-4.836
abama-4.816
TokenK
Token ID42
Feature activation-3.12e-01

Pos logit contributions
+0.256
+0.237
knit+0.234
+0.223
 Citiz+0.217
Neg logit contributions
ample-0.292
pport-0.273
alone-0.272
 proto-0.257
 amp-0.250
Tokenilled
Token ID2967
Feature activation+3.21e-01

Pos logit contributions
ample+1.308
pport+1.215
alone+1.213
 proto+1.139
 amp+1.109
Neg logit contributions
-1.259
-1.172
knit-1.157
-1.107
 Citiz-1.086
Token for
Token ID329
Feature activation+2.75e-01

Pos logit contributions
+1.386
+1.290
knit+1.286
+1.222
 Citiz+1.214
Neg logit contributions
ample-1.243
pport-1.155
alone-1.143
 proto-1.060
abama-1.037
Token watching
Token ID4964
Feature activation-2.92e-01

Pos logit contributions
+1.151
+1.075
knit+1.053
+1.014
 Citiz+1.009
Neg logit contributions
ample-1.111
alone-1.025
pport-1.024
 proto-0.951
abama-0.934
Token #
Token ID1303
Feature activation+2.94e-01

Pos logit contributions
ample+0.068
pport+0.063
alone+0.063
 proto+0.059
abama+0.057
Neg logit contributions
-0.065
-0.060
knit-0.059
-0.057
 Citiz-0.056
Token2
Token ID17
Feature activation+1.14e-01

Pos logit contributions
+1.057
+0.970
knit+0.963
+0.914
 Citiz+0.886
Neg logit contributions
ample-1.362
pport-1.275
alone-1.271
 proto-1.207
 amp-1.179
Token,
Token ID11
Feature activation+1.94e-02

Pos logit contributions
+0.436
+0.404
knit+0.399
+0.380
 Citiz+0.372
Neg logit contributions
ample-0.503
pport-0.469
alone-0.468
 proto-0.441
 amp-0.430
Token but
Token ID475
Feature activation+5.43e-02

Pos logit contributions
+0.077
+0.072
knit+0.071
+0.068
 Citiz+0.067
Neg logit contributions
ample-0.082
pport-0.076
alone-0.076
 proto-0.072
 amp-0.070
Token trust
Token ID3774
Feature activation+1.01e-01

Pos logit contributions
+0.214
+0.198
knit+0.196
+0.187
 Citiz+0.183
Neg logit contributions
ample-0.233
pport-0.217
alone-0.216
 proto-0.203
abama-0.199
Token me
Token ID502
Feature activation+1.25e+00

Pos logit contributions
+0.450
+0.422
knit+0.418
+0.401
 Citiz+0.394
Neg logit contributions
ample-0.377
pport-0.347
alone-0.347
 proto-0.323
 amp-0.313
Token,
Token ID11
Feature activation+5.40e-01

Pos logit contributions
+4.732
+4.446
knit+4.315
+4.157
CHA+4.045
Neg logit contributions
ample-5.320
pport-5.041
alone-4.998
abama-4.749
 proto-4.704
Token your
Token ID534
Feature activation+1.32e-01

Pos logit contributions
+2.014
+1.852
knit+1.841
+1.760
 Citiz+1.706
Neg logit contributions
ample-2.398
pport-2.244
alone-2.239
abama-2.119
 proto-2.106
Token plans
Token ID3352
Feature activation-1.73e-01

Pos logit contributions
+0.560
+0.519
knit+0.513
+0.497
 Citiz+0.484
Neg logit contributions
ample-0.526
pport-0.487
alone-0.485
 proto-0.452
abama-0.444
Token are
Token ID389
Feature activation+8.89e-02

Pos logit contributions
ample+0.685
pport+0.633
alone+0.632
 proto+0.591
abama+0.575
Neg logit contributions
-0.741
-0.691
knit-0.685
-0.655
 Citiz-0.640
Token going
Token ID1016
Feature activation-4.79e-01

Pos logit contributions
+0.356
+0.331
knit+0.327
+0.312
 Citiz+0.305
Neg logit contributions
ample-0.375
pport-0.349
alone-0.349
 proto-0.327
abama-0.320
Token over
Token ID625
Feature activation+1.94e-01

Pos logit contributions
ample+0.436
pport+0.404
alone+0.403
 proto+0.377
abama+0.367
Neg logit contributions
-0.432
-0.402
knit-0.397
-0.381
 Citiz-0.374
Tokenrated
Token ID4111
Feature activation+5.32e-02

Pos logit contributions
+0.735
+0.680
knit+0.675
+0.641
 Citiz+0.629
Neg logit contributions
ample-0.856
pport-0.801
alone-0.799
 proto-0.753
abama-0.734
Token hacks
Token ID30445
Feature activation+3.50e-01

Pos logit contributions
+0.218
+0.200
knit+0.199
+0.191
 Citiz+0.187
Neg logit contributions
ample-0.221
pport-0.206
alone-0.205
 proto-0.188
abama-0.188
Token like
Token ID588
Feature activation-4.99e-01

Pos logit contributions
+1.430
+1.329
knit+1.313
+1.259
 Citiz+1.234
Neg logit contributions
ample-1.449
pport-1.346
alone-1.341
 proto-1.258
 amp-1.226
Token Van
Token ID6656
Feature activation+5.39e-01

Pos logit contributions
ample+2.178
alone+2.028
pport+2.024
 proto+1.927
 amp+1.883
Neg logit contributions
-1.908
-1.779
knit-1.753
-1.669
 Citiz-1.628
Token Go
Token ID1514
Feature activation+1.25e+00

Pos logit contributions
+2.310
+2.158
knit+2.134
+2.046
 Citiz+2.009
Neg logit contributions
ample-2.130
pport-1.969
alone-1.965
 proto-1.837
 amp-1.786
Tokengh
Token ID456
Feature activation-1.47e-01

Pos logit contributions
+5.635
+5.364
knit+5.297
+4.909
CHA+4.876
Neg logit contributions
ample-4.572
pport-4.173
alone-3.945
 proto-3.891
abama-3.763
Token,
Token ID11
Feature activation+1.17e-01

Pos logit contributions
ample+0.609
alone+0.565
pport+0.564
 proto+0.530
 amp+0.516
Neg logit contributions
-0.597
-0.556
knit-0.550
-0.525
 Citiz-0.517
Token Ing
Token ID17589
Feature activation+1.70e-01

Pos logit contributions
+0.462
+0.426
knit+0.421
+0.404
 Citiz+0.398
Neg logit contributions
ample-0.500
pport-0.466
alone-0.465
 proto-0.432
abama-0.426
Tokenmar
Token ID3876
Feature activation-2.05e-01

Pos logit contributions
+0.741
+0.694
knit+0.685
+0.657
 Citiz+0.648
Neg logit contributions
ample-0.659
alone-0.609
pport-0.609
 proto-0.568
 amp-0.553
Token Berg
Token ID24626
Feature activation+7.86e-01

Pos logit contributions
ample+0.869
alone+0.807
pport+0.796
 proto+0.792
 amp+0.771
Neg logit contributions
-0.773
knit-0.761
-0.719
-0.704
 Citiz-0.685
Token cancer
Token ID4890
Feature activation-3.19e-01

Pos logit contributions
+1.015
+0.948
knit+0.938
+0.898
 Citiz+0.888
Neg logit contributions
ample-0.912
pport-0.838
alone-0.838
 proto-0.774
abama-0.767
Token and
Token ID290
Feature activation+1.32e-01

Pos logit contributions
ample+1.366
alone+1.272
pport+1.271
 proto+1.195
 amp+1.169
Neg logit contributions
-1.254
-1.163
knit-1.151
-1.098
 Citiz-1.080
Token one
Token ID530
Feature activation-1.90e-01

Pos logit contributions
+0.538
+0.502
knit+0.499
+0.473
 Citiz+0.467
Neg logit contributions
ample-0.551
pport-0.508
alone-0.507
 proto-0.472
abama-0.465
Token of
Token ID286
Feature activation-3.55e-01

Pos logit contributions
ample+0.722
pport+0.665
alone+0.664
 proto+0.619
 amp+0.602
Neg logit contributions
-0.845
-0.792
knit-0.783
-0.752
 Citiz-0.740
Token the
Token ID262
Feature activation-1.30e-01

Pos logit contributions
ample+1.456
pport+1.360
alone+1.348
 proto+1.273
 amp+1.248
Neg logit contributions
-1.458
-1.365
knit-1.345
-1.295
 Citiz-1.269
Token hardest
Token ID17612
Feature activation+1.25e+00

Pos logit contributions
ample+0.544
pport+0.504
alone+0.504
 proto+0.472
 amp+0.460
Neg logit contributions
-0.529
-0.492
knit-0.487
-0.465
 Citiz-0.457
Token things
Token ID1243
Feature activation+7.02e-01

Pos logit contributions
+4.702
knit+4.466
+4.413
+4.121
CHA+3.984
Neg logit contributions
ample-5.319
alone-4.844
pport-4.813
abama-4.521
 proto-4.430
Token I
Token ID314
Feature activation+4.10e-01

Pos logit contributions
+2.830
+2.679
knit+2.637
+2.499
 Citiz+2.456
Neg logit contributions
ample-2.885
pport-2.662
alone-2.644
 proto-2.483
abama-2.449
Token have
Token ID423
Feature activation+3.31e-01

Pos logit contributions
+1.617
knit+1.508
+1.506
+1.413
 Citiz+1.365
Neg logit contributions
ample-1.754
pport-1.617
alone-1.599
 proto-1.510
abama-1.478
Token dealt
Token ID11829
Feature activation-3.67e-03

Pos logit contributions
+1.194
+1.115
knit+1.102
+1.032
 Citiz+1.004
Neg logit contributions
ample-1.523
pport-1.420
alone-1.406
 proto-1.327
abama-1.303
Token with
Token ID351
Feature activation+2.22e-01

Pos logit contributions
ample+0.014
alone+0.013
pport+0.013
 proto+0.012
 amp+0.012
Neg logit contributions
-0.016
-0.015
knit-0.015
-0.015
 Citiz-0.014
Token ceremony
Token ID10542
Feature activation-2.58e-01

Pos logit contributions
+0.941
+0.868
knit+0.857
+0.819
 Citiz+0.799
Neg logit contributions
ample-1.111
pport-1.039
alone-1.031
 proto-0.963
abama-0.945
Token my
Token ID616
Feature activation-1.70e-01

Pos logit contributions
ample+1.083
alone+1.006
pport+1.004
 proto+0.948
 amp+0.927
Neg logit contributions
-1.037
-0.967
knit-0.952
-0.912
 Citiz-0.898
Token life
Token ID1204
Feature activation-8.14e-02

Pos logit contributions
ample+0.736
pport+0.685
alone+0.684
 proto+0.642
abama+0.625
Neg logit contributions
-0.665
-0.615
knit-0.609
-0.581
 Citiz-0.568
Token took
Token ID1718
Feature activation+2.60e-01

Pos logit contributions
ample+0.347
pport+0.323
alone+0.322
 proto+0.302
 amp+0.294
Neg logit contributions
-0.324
-0.301
knit-0.298
-0.284
 Citiz-0.278
Token a
Token ID257
Feature activation+2.80e-01

Pos logit contributions
+1.122
+1.047
knit+1.034
+0.995
 Citiz+0.971
Neg logit contributions
ample-1.008
pport-0.939
alone-0.934
 proto-0.867
abama-0.848
Token turn
Token ID1210
Feature activation+1.24e+00

Pos logit contributions
+1.149
+1.068
knit+1.056
+1.012
 Citiz+0.985
Neg logit contributions
ample-1.155
pport-1.073
alone-1.073
 proto-0.996
abama-0.977
Token for
Token ID329
Feature activation+4.49e-01

Pos logit contributions
+5.113
knit+4.745
+4.728
+4.547
CHA+4.324
Neg logit contributions
ample-4.765
pport-4.472
alone-4.441
abama-4.089
 proto-4.069
Token the
Token ID262
Feature activation-1.79e-01

Pos logit contributions
+1.678
+1.547
knit+1.524
+1.456
 Citiz+1.412
Neg logit contributions
ample-2.022
alone-1.886
pport-1.883
 proto-1.772
abama-1.728
Token worse
Token ID4785
Feature activation+2.33e-01

Pos logit contributions
ample+0.680
pport+0.627
alone+0.625
 proto+0.587
 amp+0.570
Neg logit contributions
-0.791
-0.745
knit-0.736
-0.705
 Citiz-0.697
Token.
Token ID13
Feature activation+1.17e-01

Pos logit contributions
+0.910
+0.842
knit+0.833
+0.793
 Citiz+0.770
Neg logit contributions
ample-1.003
pport-0.936
alone-0.933
 proto-0.877
abama-0.854
Token My
Token ID2011
Feature activation-3.92e-02

Pos logit contributions
+0.477
+0.444
knit+0.439
+0.417
 Citiz+0.409
Neg logit contributions
ample-0.484
pport-0.453
alone-0.445
 proto-0.419
abama-0.410
Token other
Token ID584
Feature activation+2.45e-01

Pos logit contributions
ample+0.097
pport+0.090
alone+0.090
 proto+0.084
abama+0.082
Neg logit contributions
-0.103
-0.096
knit-0.095
-0.091
 Citiz-0.090
Token possibility
Token ID5885
Feature activation+1.88e-01

Pos logit contributions
+1.085
+1.014
knit+1.007
+0.968
 Citiz+0.949
Neg logit contributions
ample-0.915
alone-0.849
pport-0.846
abama-0.771
 proto-0.771
Token that
Token ID326
Feature activation+4.35e-01

Pos logit contributions
+0.764
+0.713
knit+0.704
+0.671
 Citiz+0.659
Neg logit contributions
ample-0.783
pport-0.728
alone-0.724
 proto-0.677
abama-0.665
Token comes
Token ID2058
Feature activation+5.94e-02

Pos logit contributions
+1.786
+1.667
knit+1.662
+1.568
 Citiz+1.564
Neg logit contributions
ample-1.757
pport-1.628
alone-1.625
abama-1.520
 proto-1.493
Token to
Token ID284
Feature activation+6.58e-02

Pos logit contributions
+0.245
+0.229
knit+0.225
+0.215
 Citiz+0.211
Neg logit contributions
ample-0.244
pport-0.227
alone-0.225
 proto-0.210
abama-0.206
Token mind
Token ID2000
Feature activation+1.24e+00

Pos logit contributions
+0.213
+0.194
knit+0.191
+0.180
 Citiz+0.176
Neg logit contributions
ample-0.327
alone-0.308
pport-0.308
 proto-0.291
abama-0.286
Token is
Token ID318
Feature activation+3.63e-02

Pos logit contributions
+4.457
+4.127
knit+4.108
+3.904
CHA+3.685
Neg logit contributions
ample-5.384
pport-5.025
alone-4.980
abama-4.648
 proto-4.626
Token that
Token ID326
Feature activation-2.47e-01

Pos logit contributions
+0.142
+0.132
knit+0.130
+0.124
 Citiz+0.122
Neg logit contributions
ample-0.156
pport-0.146
alone-0.145
 proto-0.136
abama-0.133
Token having
Token ID1719
Feature activation-6.69e-02

Pos logit contributions
ample+1.019
pport+0.945
alone+0.944
 proto+0.885
 amp+0.862
Neg logit contributions
-1.015
-0.945
knit-0.934
-0.894
 Citiz-0.877
Token a
Token ID257
Feature activation+4.46e-01

Pos logit contributions
ample+0.262
pport+0.244
alone+0.243
 proto+0.226
abama+0.221
Neg logit contributions
-0.286
-0.267
knit-0.265
-0.254
 Citiz-0.249
Token video
Token ID2008
Feature activation+2.24e-01

Pos logit contributions
+1.918
+1.785
knit+1.774
+1.699
 Citiz+1.672
Neg logit contributions
ample-1.732
pport-1.604
alone-1.597
 proto-1.462
abama-1.459
Token an
Token ID281
Feature activation+3.84e-01

Pos logit contributions
+1.125
+1.041
knit+1.035
+0.991
 Citiz+0.976
Neg logit contributions
ample-1.054
pport-0.973
alone-0.969
 proto-0.902
abama-0.888
Token advocacy
Token ID15364
Feature activation-5.44e-02

Pos logit contributions
+1.625
knit+1.487
+1.486
+1.426
 Citiz+1.404
Neg logit contributions
ample-1.529
pport-1.417
alone-1.409
 proto-1.312
abama-1.277
Token group
Token ID1448
Feature activation+1.34e-01

Pos logit contributions
ample+0.234
pport+0.218
alone+0.218
 proto+0.205
 amp+0.200
Neg logit contributions
-0.214
-0.199
knit-0.196
-0.187
 Citiz-0.184
Token,"
Token ID553
Feature activation+1.57e-02

Pos logit contributions
+0.546
+0.508
knit+0.502
+0.481
 Citiz+0.471
Neg logit contributions
ample-0.553
pport-0.513
alone-0.512
 proto-0.481
 amp-0.468
Token Mc
Token ID1982
Feature activation+3.49e-01

Pos logit contributions
+0.068
+0.064
knit+0.063
+0.060
 Citiz+0.059
Neg logit contributions
ample-0.062
pport-0.057
alone-0.057
 proto-0.053
 amp-0.052
TokenQu
Token ID4507
Feature activation+1.24e+00

Pos logit contributions
+0.867
+0.766
knit+0.749
+0.697
 Citiz+0.657
Neg logit contributions
ample-2.008
pport-1.904
alone-1.895
 proto-1.825
 amp-1.789
Tokenill
Token ID359
Feature activation+1.94e-01

Pos logit contributions
+3.011
knit+2.629
+2.619
+2.335
Pont+2.250
Neg logit contributions
ample-6.717
pport-6.503
 proto-6.468
alone-6.253
 amp-6.107
Tokenan
Token ID272
Feature activation+1.79e-01

Pos logit contributions
+0.819
+0.759
knit+0.757
+0.712
CHA+0.693
Neg logit contributions
ample-0.767
pport-0.717
alone-0.699
 proto-0.679
 amp-0.651
Token told
Token ID1297
Feature activation+3.87e-01

Pos logit contributions
+0.773
+0.722
knit+0.713
+0.685
 Citiz+0.669
Neg logit contributions
ample-0.698
pport-0.645
alone-0.640
 proto-0.602
 amp-0.583
Token Polit
Token ID7793
Feature activation+2.93e-01

Pos logit contributions
+1.548
+1.428
knit+1.422
+1.357
 Citiz+1.341
Neg logit contributions
ample-1.635
pport-1.510
alone-1.500
 proto-1.410
abama-1.381
Tokeni
Token ID72
Feature activation-1.38e-01

Pos logit contributions
+1.375
+1.286
knit+1.275
+1.226
 Citiz+1.200
Neg logit contributions
ample-1.031
pport-0.956
alone-0.950
 proto-0.891
 amp-0.855
TokenEP
Token ID8905
Feature activation-2.91e-01

Pos logit contributions
ample+2.559
pport+2.425
alone+2.419
 proto+2.311
 amp+2.268
Neg logit contributions
-1.151
-1.028
knit-1.003
-0.924
 Citiz-0.924
Token uses
Token ID3544
Feature activation-6.32e-02

Pos logit contributions
ample+1.133
pport+1.047
alone+1.044
 proto+0.975
 amp+0.948
Neg logit contributions
-1.265
-1.182
knit-1.169
-1.122
 Citiz-1.101
Token the
Token ID262
Feature activation+3.81e-02

Pos logit contributions
ample+0.252
pport+0.234
alone+0.233
 proto+0.216
abama+0.212
Neg logit contributions
-0.268
-0.251
knit-0.248
-0.237
 Citiz-0.232
Token stream
Token ID4269
Feature activation-2.12e-01

Pos logit contributions
+0.163
+0.153
knit+0.152
+0.145
 Citiz+0.142
Neg logit contributions
ample-0.150
pport-0.139
alone-0.139
 proto-0.128
abama-0.126
Token cipher
Token ID38012
Feature activation+5.03e-01

Pos logit contributions
ample+0.758
pport+0.694
alone+0.693
 proto+0.643
 amp+0.623
Neg logit contributions
-0.991
-0.931
knit-0.922
-0.887
 Citiz-0.873
Token RC
Token ID13987
Feature activation+1.23e+00

Pos logit contributions
+2.027
+1.907
knit+1.885
+1.798
 Citiz+1.744
Neg logit contributions
ample-2.048
pport-1.936
alone-1.903
abama-1.787
 proto-1.763
Token4
Token ID19
Feature activation+7.01e-01

Pos logit contributions
+3.901
knit+3.648
+3.622
+3.503
CHA+3.300
Neg logit contributions
ample-5.931
alone-5.660
pport-5.647
abama-5.279
 proto-5.121
Token for
Token ID329
Feature activation-1.52e-01

Pos logit contributions
+2.936
+2.770
knit+2.730
+2.625
 Citiz+2.528
Neg logit contributions
ample-2.763
pport-2.580
alone-2.573
abama-2.364
 proto-2.342
Token confidentiality
Token ID32554
Feature activation+2.33e-01

Pos logit contributions
ample+0.580
pport+0.534
alone+0.533
 proto+0.494
abama+0.482
Neg logit contributions
-0.671
-0.628
knit-0.622
-0.597
 Citiz-0.586
Token,[
Token ID17414
Feature activation+8.30e-02

Pos logit contributions
+0.917
+0.852
knit+0.838
+0.809
 Citiz+0.785
Neg logit contributions
ample-0.995
pport-0.928
alone-0.924
 proto-0.862
abama-0.854
Token9
Token ID24
Feature activation+3.44e-01

Pos logit contributions
+0.356
+0.332
knit+0.329
+0.315
 Citiz+0.309
Neg logit contributions
ample-0.328
pport-0.303
alone-0.303
 proto-0.283
 amp-0.275
Token volatility
Token ID30772
Feature activation+9.76e-05

Pos logit contributions
+1.033
+0.960
knit+0.949
+0.917
 Citiz+0.897
Neg logit contributions
ample-0.938
pport-0.871
alone-0.869
 proto-0.798
abama-0.783
Token is
Token ID318
Feature activation+3.52e-01

Pos logit contributions
+0.000
+0.000
knit+0.000
+0.000
 Citiz+0.000
Neg logit contributions
ample-0.000
pport-0.000
alone-0.000
 proto-0.000
abama-0.000
Token a
Token ID257
Feature activation+1.20e-01

Pos logit contributions
+1.466
+1.360
knit+1.348
+1.289
 Citiz+1.266
Neg logit contributions
ample-1.420
alone-1.315
pport-1.314
 proto-1.206
abama-1.201
Token result
Token ID1255
Feature activation+1.09e-01

Pos logit contributions
+0.544
+0.512
knit+0.504
+0.485
 Citiz+0.476
Neg logit contributions
ample-0.443
alone-0.408
pport-0.407
 proto-0.368
abama-0.367
Token of
Token ID286
Feature activation+8.22e-02

Pos logit contributions
+0.492
+0.461
knit+0.455
+0.437
 Citiz+0.426
Neg logit contributions
ample-0.406
pport-0.375
alone-0.373
 proto-0.347
abama-0.336
Token supply
Token ID5127
Feature activation+1.23e+00

Pos logit contributions
+0.348
+0.324
knit+0.320
+0.307
 Citiz+0.301
Neg logit contributions
ample-0.330
pport-0.305
alone-0.305
 proto-0.284
abama-0.276
Token-
Token ID12
Feature activation-3.46e-03

Pos logit contributions
+5.373
+4.974
knit+4.826
+4.693
 Citiz+4.532
Neg logit contributions
ample-4.664
alone-4.279
pport-4.271
 proto-3.925
abama-3.916
Tokenand
Token ID392
Feature activation-3.94e-01

Pos logit contributions
ample+0.012
pport+0.011
alone+0.011
 proto+0.010
 amp+0.010
Neg logit contributions
-0.016
-0.015
knit-0.015
-0.015
 Citiz-0.014
Token-
Token ID12
Feature activation-1.02e-01

Pos logit contributions
ample+1.891
pport+1.775
alone+1.772
 proto+1.679
 amp+1.646
Neg logit contributions
-1.354
-1.244
knit-1.223
-1.162
 Citiz-1.146
Tokendemand
Token ID28550
Feature activation+6.36e-01

Pos logit contributions
ample+0.422
pport+0.392
alone+0.392
 proto+0.370
 amp+0.359
Neg logit contributions
-0.417
-0.386
knit-0.384
-0.366
 Citiz-0.357
Token fundamentals
Token ID33099
Feature activation+4.74e-01

Pos logit contributions
+2.918
+2.741
knit+2.660
+2.582
 Citiz+2.510
Neg logit contributions
ample-2.326
alone-2.131
pport-2.118
 proto-1.943
abama-1.921
Token D
Token ID360
Feature activation-2.40e-01

Pos logit contributions
+1.437
+1.339
knit+1.325
+1.252
 Citiz+1.222
Neg logit contributions
ample-1.540
pport-1.446
alone-1.407
 proto-1.331
abama-1.307
TokenIV
Token ID3824
Feature activation+7.00e-01

Pos logit contributions
ample+0.973
alone+0.902
pport+0.900
 proto+0.841
 amp+0.821
Neg logit contributions
-1.002
-0.937
knit-0.924
-0.887
 Citiz-0.871
TokenIDE
Token ID14114
Feature activation+6.23e-01

Pos logit contributions
+1.830
+1.587
knit+1.583
+1.467
CHA+1.379
Neg logit contributions
ample-3.907
pport-3.746
alone-3.691
 proto-3.566
abama-3.490
Token AND
Token ID5357
Feature activation-1.30e-03

Pos logit contributions
+2.713
knit+2.511
+2.507
+2.392
 Citiz+2.355
Neg logit contributions
ample-2.383
pport-2.219
alone-2.165
 proto-2.027
abama-2.003
Token CON
Token ID7102
Feature activation+1.22e-01

Pos logit contributions
ample+0.005
pport+0.004
alone+0.004
 proto+0.004
 amp+0.004
Neg logit contributions
-0.006
-0.006
knit-0.006
-0.005
 Citiz-0.005
TokenQU
Token ID10917
Feature activation+1.22e+00

Pos logit contributions
+0.408
+0.372
knit+0.368
+0.348
 Citiz+0.336
Neg logit contributions
ample-0.597
pport-0.561
alone-0.560
 proto-0.532
 amp-0.521
TokenER
Token ID1137
Feature activation+3.25e-01

Pos logit contributions
+2.830
knit+2.446
+2.399
CHA+2.223
+2.184
Neg logit contributions
ample-7.059
pport-6.870
alone-6.705
 proto-6.478
abama-6.438
Token was
Token ID373
Feature activation+1.20e-01

Pos logit contributions
+1.433
+1.340
knit+1.328
+1.273
 Citiz+1.247
Neg logit contributions
ample-1.238
pport-1.145
alone-1.139
 proto-1.063
abama-1.036
Token about
Token ID546
Feature activation+8.24e-02

Pos logit contributions
+0.492
+0.458
knit+0.453
+0.433
 Citiz+0.425
Neg logit contributions
ample-0.494
pport-0.459
alone-0.456
 proto-0.428
 amp-0.417
Token introducing
Token ID16118
Feature activation+4.76e-02

Pos logit contributions
+0.333
+0.311
knit+0.308
+0.294
 Citiz+0.288
Neg logit contributions
ample-0.343
pport-0.319
alone-0.317
 proto-0.295
abama-0.289
Token the
Token ID262
Feature activation-1.74e-01

Pos logit contributions
+0.190
+0.177
knit+0.175
+0.167
 Citiz+0.163
Neg logit contributions
ample-0.201
pport-0.187
alone-0.187
 proto-0.174
abama-0.171
TokenBird
Token ID42562
Feature activation+2.43e-01

Pos logit contributions
ample+0.846
alone+0.790
pport+0.784
 proto+0.739
 amp+0.721
Neg logit contributions
-0.720
-0.671
knit-0.658
-0.628
 Citiz-0.624
Tokensey
Token ID4397
Feature activation-6.14e-02

Pos logit contributions
+0.929
+0.863
knit+0.855
+0.810
 Citiz+0.795
Neg logit contributions
ample-1.067
pport-0.995
alone-0.991
 proto-0.934
 amp-0.913
Tokene
Token ID68
Feature activation+1.25e-01

Pos logit contributions
ample+0.276
pport+0.258
alone+0.257
 proto+0.243
 amp+0.237
Neg logit contributions
-0.230
-0.213
knit-0.210
-0.200
 Citiz-0.195
Token view
Token ID1570
Feature activation+9.09e-02

Pos logit contributions
+0.530
+0.495
knit+0.490
+0.466
 Citiz+0.460
Neg logit contributions
ample-0.500
pport-0.465
alone-0.461
 proto-0.429
abama-0.420
Token of
Token ID286
Feature activation+2.11e-01

Pos logit contributions
+0.400
+0.375
knit+0.371
+0.355
 Citiz+0.347
Neg logit contributions
ample-0.348
pport-0.322
alone-0.319
 proto-0.298
 amp-0.290
Token Brigham
Token ID37434
Feature activation+1.22e+00

Pos logit contributions
+0.720
+0.656
knit+0.650
+0.613
 Citiz+0.601
Neg logit contributions
ample-1.020
pport-0.961
alone-0.955
 proto-0.900
abama-0.883
Token City
Token ID2254
Feature activation-3.30e-02

Pos logit contributions
+4.632
+4.227
knit+4.171
+4.007
 Citiz+3.879
Neg logit contributions
ample-5.193
pport-4.965
alone-4.795
 proto-4.507
abama-4.470
Token,
Token ID11
Feature activation+4.55e-01

Pos logit contributions
ample+0.145
pport+0.135
alone+0.135
 proto+0.127
 amp+0.124
Neg logit contributions
-0.126
-0.117
knit-0.115
-0.110
 Citiz-0.108
Token Utah
Token ID10202
Feature activation+1.73e-02

Pos logit contributions
+1.476
+1.327
knit+1.314
+1.244
 Citiz+1.215
Neg logit contributions
ample-2.274
pport-2.151
alone-2.124
 proto-2.001
abama-1.976
Token Date
Token ID7536
Feature activation-1.13e-01

Pos logit contributions
+0.070
+0.065
knit+0.065
+0.062
 Citiz+0.061
Neg logit contributions
ample-0.072
pport-0.067
alone-0.067
 proto-0.063
 amp-0.061
Token:
Token ID25
Feature activation-3.24e-01

Pos logit contributions
ample+0.468
pport+0.435
alone+0.434
 proto+0.407
 amp+0.396
Neg logit contributions
-0.466
-0.434
knit-0.429
-0.410
 Citiz-0.402
Token
Token ID198
Feature activation+7.71e-02

Pos logit contributions
+0.897
+0.836
knit+0.836
+0.797
CHA+0.774
Neg logit contributions
ample-0.745
pport-0.696
alone-0.677
 proto-0.644
abama-0.623
TokenIt
Token ID1026
Feature activation-4.83e-02

Pos logit contributions
+0.304
+0.281
knit+0.279
+0.266
 Citiz+0.257
Neg logit contributions
ample-0.332
pport-0.310
alone-0.308
 proto-0.293
 amp-0.284
Token would
Token ID561
Feature activation+1.67e-01

Pos logit contributions
ample+0.187
pport+0.172
alone+0.172
 proto+0.160
 amp+0.156
Neg logit contributions
-0.211
-0.197
knit-0.195
-0.187
 Citiz-0.184
Token be
Token ID307
Feature activation+3.78e-02

Pos logit contributions
+0.688
+0.640
knit+0.633
+0.606
 Citiz+0.593
Neg logit contributions
ample-0.687
pport-0.637
alone-0.636
 proto-0.597
 amp-0.579
Token dis
Token ID595
Feature activation-5.56e-01

Pos logit contributions
+0.159
+0.148
knit+0.146
+0.141
 Citiz+0.137
Neg logit contributions
ample-0.152
alone-0.141
pport-0.140
 proto-0.131
abama-0.128
Tokenpir
Token ID4063
Feature activation+1.22e+00

Pos logit contributions
ample+2.869
pport+2.729
alone+2.694
 proto+2.561
 amp+2.504
Neg logit contributions
-1.702
-1.548
knit-1.516
-1.419
 Citiz-1.390
Tokeniting
Token ID1780
Feature activation+4.89e-02

Pos logit contributions
+4.487
+4.194
knit+4.099
+3.965
 Citiz+3.769
Neg logit contributions
ample-5.405
alone-5.121
pport-5.007
 proto-4.831
 amp-4.669
Token if
Token ID611
Feature activation+1.32e-01

Pos logit contributions
+0.203
+0.190
knit+0.188
+0.180
 Citiz+0.176
Neg logit contributions
ample-0.199
pport-0.185
alone-0.184
 proto-0.172
abama-0.168
Token the
Token ID262
Feature activation+3.18e-02

Pos logit contributions
+0.508
+0.468
knit+0.464
+0.444
 Citiz+0.433
Neg logit contributions
ample-0.580
pport-0.542
alone-0.542
 proto-0.508
abama-0.497
Token book
Token ID1492
Feature activation-2.84e-01

Pos logit contributions
+0.144
+0.134
knit+0.134
+0.129
 Citiz+0.126
Neg logit contributions
ample-0.118
pport-0.109
alone-0.108
 proto-0.100
abama-0.098
Token came
Token ID1625
Feature activation-1.78e-01

Pos logit contributions
ample+1.095
pport+1.010
alone+1.008
 proto+0.943
 amp+0.916
Neg logit contributions
-1.239
-1.158
knit-1.147
-1.098
 Citiz-1.081
Token but
Token ID475
Feature activation+1.77e-01

Pos logit contributions
+0.462
+0.429
knit+0.423
+0.404
 Citiz+0.394
Neg logit contributions
ample-0.514
pport-0.478
alone-0.477
 proto-0.449
abama-0.438
Token unless
Token ID4556
Feature activation-4.16e-02

Pos logit contributions
+0.691
+0.642
knit+0.637
+0.604
 Citiz+0.592
Neg logit contributions
ample-0.761
alone-0.708
pport-0.706
 proto-0.660
abama-0.653
Token you
Token ID345
Feature activation+2.32e-02

Pos logit contributions
ample+0.192
alone+0.180
pport+0.179
 proto+0.169
abama+0.166
Neg logit contributions
-0.150
-0.138
knit-0.136
-0.130
 Citiz-0.126
Token've
Token ID1053
Feature activation-1.92e-01

Pos logit contributions
+0.088
+0.081
knit+0.081
+0.077
 Citiz+0.074
Neg logit contributions
ample-0.103
pport-0.096
alone-0.096
 proto-0.090
abama-0.088
Token been
Token ID587
Feature activation+6.11e-02

Pos logit contributions
ample+0.715
pport+0.658
alone+0.655
 proto+0.609
abama+0.590
Neg logit contributions
-0.869
-0.814
knit-0.806
-0.774
 Citiz-0.758
Token there
Token ID612
Feature activation+1.21e+00

Pos logit contributions
+0.257
+0.241
knit+0.238
+0.226
 Citiz+0.223
Neg logit contributions
ample-0.245
pport-0.227
alone-0.226
 proto-0.211
abama-0.206
Token and
Token ID290
Feature activation+2.48e-01

Pos logit contributions
+5.066
+4.755
knit+4.754
+4.460
CHA+4.417
Neg logit contributions
ample-4.606
pport-4.245
alone-4.205
abama-3.909
 proto-3.905
Token you
Token ID345
Feature activation+1.18e-01

Pos logit contributions
+0.945
+0.873
knit+0.870
+0.821
 Citiz+0.803
Neg logit contributions
ample-1.095
pport-1.018
alone-1.017
 proto-0.956
abama-0.934
Token've
Token ID1053
Feature activation+4.71e-02

Pos logit contributions
+0.453
knit+0.419
+0.417
+0.395
 Citiz+0.381
Neg logit contributions
ample-0.516
pport-0.478
alone-0.476
 proto-0.449
abama-0.438
Token worked
Token ID3111
Feature activation+4.87e-01

Pos logit contributions
+0.188
+0.174
knit+0.173
+0.165
 Citiz+0.160
Neg logit contributions
ample-0.200
pport-0.186
alone-0.184
 proto-0.173
abama-0.169
Token those
Token ID883
Feature activation-1.27e-01

Pos logit contributions
+1.952
+1.806
knit+1.793
+1.695
 Citiz+1.661
Neg logit contributions
ample-2.029
pport-1.887
alone-1.882
 proto-1.756
abama-1.736
Token obtained
Token ID6492
Feature activation-5.28e-02

Pos logit contributions
+1.637
+1.524
knit+1.507
+1.433
 Citiz+1.410
Neg logit contributions
ample-1.664
pport-1.537
alone-1.529
 proto-1.424
abama-1.409
Token nude
Token ID26349
Feature activation+3.02e-01

Pos logit contributions
ample+0.207
pport+0.192
alone+0.191
 proto+0.177
abama+0.173
Neg logit contributions
-0.227
-0.212
knit-0.211
-0.201
 Citiz-0.198
Token photos
Token ID5205
Feature activation+3.50e-01

Pos logit contributions
+1.292
+1.201
knit+1.191
+1.140
 Citiz+1.120
Neg logit contributions
ample-1.191
pport-1.104
alone-1.101
 proto-1.022
abama-0.996
Token of
Token ID286
Feature activation+3.79e-01

Pos logit contributions
+1.537
+1.435
knit+1.417
+1.362
 Citiz+1.332
Neg logit contributions
ample-1.339
pport-1.243
alone-1.228
 proto-1.147
abama-1.124
Token Jennifer
Token ID16348
Feature activation+3.34e-01

Pos logit contributions
+1.489
+1.377
knit+1.371
+1.299
 Citiz+1.286
Neg logit contributions
ample-1.619
pport-1.518
alone-1.507
 proto-1.401
abama-1.379
Token Lawrence
Token ID13914
Feature activation+1.21e+00

Pos logit contributions
+0.971
+0.875
knit+0.860
+0.807
 Citiz+0.784
Neg logit contributions
ample-1.779
pport-1.683
alone-1.677
 proto-1.598
 amp-1.567
Token and
Token ID290
Feature activation+2.75e-01

Pos logit contributions
+4.753
+4.414
knit+4.367
+4.239
 Citiz+4.040
Neg logit contributions
ample-5.005
pport-4.715
alone-4.543
 proto-4.340
abama-4.309
Token other
Token ID584
Feature activation+8.75e-02

Pos logit contributions
+1.043
+0.961
knit+0.952
+0.906
 Citiz+0.887
Neg logit contributions
ample-1.220
pport-1.143
alone-1.137
 proto-1.068
abama-1.044
Token celebrities
Token ID21305
Feature activation+4.71e-01

Pos logit contributions
+0.321
+0.296
knit+0.293
+0.278
 Citiz+0.272
Neg logit contributions
ample-0.399
pport-0.374
alone-0.373
 proto-0.351
abama-0.343
Token and
Token ID290
Feature activation-1.50e-01

Pos logit contributions
+1.917
+1.783
knit+1.764
+1.693
 Citiz+1.656
Neg logit contributions
ample-1.950
pport-1.818
alone-1.796
 proto-1.686
abama-1.656
Token posted
Token ID4481
Feature activation-3.44e-01

Pos logit contributions
ample+0.653
pport+0.609
alone+0.608
 proto+0.573
 amp+0.558
Neg logit contributions
-0.581
-0.539
knit-0.532
-0.508
 Citiz-0.497
Token activist
Token ID11276
Feature activation-2.58e-01

Pos logit contributions
+0.372
+0.341
knit+0.340
+0.323
 Citiz+0.315
Neg logit contributions
ample-0.462
pport-0.433
alone-0.432
 proto-0.403
abama-0.397
Token had
Token ID550
Feature activation-2.66e-01

Pos logit contributions
ample+1.110
alone+1.038
pport+1.033
 proto+0.983
 amp+0.956
Neg logit contributions
-1.003
-0.940
knit-0.921
-0.879
 Citiz-0.857
Token received
Token ID2722
Feature activation-2.37e-01

Pos logit contributions
ample+1.088
pport+1.010
alone+1.007
 proto+0.942
 amp+0.917
Neg logit contributions
-1.104
-1.028
knit-1.018
-0.974
 Citiz-0.955
Token th
Token ID294
Feature activation-2.87e-01

Pos logit contributions
ample+0.904
alone+0.834
pport+0.833
 proto+0.774
abama+0.752
Neg logit contributions
-1.047
-0.980
knit-0.971
-0.931
 Citiz-0.916
Tokenrea
Token ID21468
Feature activation-5.33e-01

Pos logit contributions
ample+1.047
pport+0.962
alone+0.960
 proto+0.892
 amp+0.865
Neg logit contributions
-1.313
-1.232
knit-1.219
-1.173
 Citiz-1.153
Token...
Token ID986
Feature activation-1.77e+00

Pos logit contributions
ample+1.932
pport+1.771
alone+1.765
 proto+1.642
 amp+1.592
Neg logit contributions
-2.458
-2.306
knit-2.287
-2.196
 Citiz-2.161
Token
Token ID198
Feature activation-4.09e-02

Pos logit contributions
ample+5.346
alone+5.034
pport+4.908
 proto+4.704
 amp+4.415
Neg logit contributions
-8.402
-8.156
knit-7.904
-7.692
 Citiz-7.652
Token
Token ID198
Feature activation-3.97e-02

Pos logit contributions
ample+0.147
pport+0.136
alone+0.134
 proto+0.125
abama+0.121
Neg logit contributions
-0.190
-0.178
knit-0.177
-0.170
CHA-0.164
TokenFrom
Token ID4863
Feature activation+2.28e-01

Pos logit contributions
ample+0.174
alone+0.162
pport+0.162
 proto+0.152
 amp+0.148
Neg logit contributions
-0.153
-0.142
knit-0.140
-0.134
 Citiz-0.132
Token:
Token ID25
Feature activation+6.56e-02

Pos logit contributions
+0.875
+0.809
knit+0.799
+0.763
 Citiz+0.749
Neg logit contributions
ample-1.003
pport-0.936
alone-0.933
 proto-0.877
 amp-0.855
Token A
Token ID317
Feature activation-1.17e-02

Pos logit contributions
+0.264
+0.246
knit+0.242
+0.232
 Citiz+0.227
Neg logit contributions
ample-0.277
pport-0.257
alone-0.257
 proto-0.241
 amp-0.235
Tokenain
Token ID391
Feature activation+3.80e-02

Pos logit contributions
+1.653
knit+1.538
+1.537
+1.451
 Citiz+1.418
Neg logit contributions
ample-1.559
pport-1.478
alone-1.463
 proto-1.401
 amp-1.351
Token told
Token ID1297
Feature activation+1.89e-01

Pos logit contributions
+0.143
+0.132
knit+0.130
+0.124
 Citiz+0.121
Neg logit contributions
ample-0.170
pport-0.159
alone-0.158
 proto-0.149
 amp-0.146
Token a
Token ID257
Feature activation+1.70e-01

Pos logit contributions
+0.769
+0.712
knit+0.707
+0.675
 Citiz+0.666
Neg logit contributions
ample-0.785
pport-0.726
alone-0.723
 proto-0.676
abama-0.661
Token Phoenix
Token ID9643
Feature activation-1.27e-02

Pos logit contributions
+0.754
+0.707
knit+0.700
+0.671
 Citiz+0.665
Neg logit contributions
ample-0.643
pport-0.590
alone-0.585
 proto-0.540
abama-0.532
Token ra
Token ID2179
Feature activation-4.01e-01

Pos logit contributions
ample+0.047
pport+0.043
alone+0.042
 proto+0.039
abama+0.038
Neg logit contributions
-0.058
-0.055
knit-0.054
-0.052
 Citiz-0.051
Token­
Token ID3907
Feature activation-1.57e+00

Pos logit contributions
ample+1.384
alone+1.261
pport+1.257
 proto+1.167
 amp+1.127
Neg logit contributions
-1.919
-1.808
knit-1.789
-1.722
 Citiz-1.693
Tokend
Token ID67
Feature activation-6.87e-01

Pos logit contributions
ample+6.161
pport+5.758
alone+5.742
abama+5.122
 proto+5.065
Neg logit contributions
-6.560
-6.077
knit-6.021
 Citiz-5.863
-5.851
Tokenio
Token ID952
Feature activation-1.49e-01

Pos logit contributions
ample+3.084
alone+2.876
pport+2.874
 proto+2.693
 amp+2.638
Neg logit contributions
-2.571
-2.388
knit-2.355
-2.239
 Citiz-2.206
Token st
Token ID336
Feature activation-1.44e-01

Pos logit contributions
ample+0.581
pport+0.536
alone+0.536
 proto+0.498
abama+0.485
Neg logit contributions
-0.648
-0.606
knit-0.600
-0.575
 Citiz-0.566
Tokena
Token ID64
Feature activation+2.73e-01

Pos logit contributions
ample+0.593
pport+0.553
alone+0.549
 proto+0.518
 amp+0.503
Neg logit contributions
-0.587
-0.546
knit-0.537
-0.514
 Citiz-0.505
Token­
Token ID3907
Feature activation-8.91e-01

Pos logit contributions
+1.281
+1.205
knit+1.188
+1.144
 Citiz+1.125
Neg logit contributions
ample-0.961
alone-0.874
pport-0.872
 proto-0.811
abama-0.783
Token.
Token ID13
Feature activation-6.62e-02

Pos logit contributions
ample+4.050
alone+3.736
pport+3.692
 proto+3.529
 amp+3.468
Neg logit contributions
-3.707
-3.527
knit-3.450
-3.307
 Citiz-3.206
Token REUTERS
Token ID15862
Feature activation-2.48e-01

Pos logit contributions
ample+0.293
pport+0.273
alone+0.272
 proto+0.257
 amp+0.250
Neg logit contributions
-0.254
-0.234
knit-0.231
-0.221
 Citiz-0.216
Token/
Token ID14
Feature activation+4.92e-01

Pos logit contributions
ample+1.011
alone+0.938
pport+0.937
 proto+0.877
 amp+0.857
Neg logit contributions
-1.029
-0.964
knit-0.952
-0.910
 Citiz-0.900
TokenCl
Token ID2601
Feature activation-4.29e-01

Pos logit contributions
+1.718
+1.553
knit+1.534
+1.457
 Citiz+1.419
Neg logit contributions
ample-2.360
pport-2.211
alone-2.186
 proto-2.098
abama-2.049
Tokenod
Token ID375
Feature activation-1.63e-01

Pos logit contributions
ample+1.490
pport+1.356
alone+1.355
 proto+1.251
 amp+1.211
Neg logit contributions
-2.052
-1.932
knit-1.912
-1.843
 Citiz-1.813
Tokenagh
Token ID10471
Feature activation-1.50e+00

Pos logit contributions
ample+0.642
pport+0.595
alone+0.593
 proto+0.556
 amp+0.540
Neg logit contributions
-0.699
-0.654
knit-0.647
-0.616
 Citiz-0.606
Token Kil
Token ID19902
Feature activation-6.39e-01

Pos logit contributions
ample+6.196
alone+5.874
pport+5.693
 proto+5.685
 amp+5.514
Neg logit contributions
-5.795
-5.468
knit-5.269
-5.075
 Citiz-5.044
Tokenc
Token ID66
Feature activation+4.10e-02

Pos logit contributions
ample+2.629
alone+2.447
pport+2.435
 proto+2.274
 amp+2.224
Neg logit contributions
-2.627
-2.457
knit-2.401
-2.346
 Citiz-2.276
Tokenoy
Token ID726
Feature activation-2.05e-02

Pos logit contributions
+0.149
+0.137
knit+0.135
+0.129
 Citiz+0.126
Neg logit contributions
ample-0.188
pport-0.176
alone-0.174
 proto-0.166
 amp-0.162
Tokenne
Token ID710
Feature activation+2.32e-01

Pos logit contributions
ample+0.063
pport+0.056
alone+0.056
 proto+0.051
 amp+0.049
Neg logit contributions
-0.107
-0.101
knit-0.100
-0.096
 Citiz-0.095
Token
Token ID198
Feature activation-7.51e-02

Pos logit contributions
+1.050
+0.976
knit+0.971
+0.933
 Citiz+0.917
Neg logit contributions
ample-0.863
pport-0.793
alone-0.788
 proto-0.731
abama-0.709
Token10
Token ID940
Feature activation-1.67e-02

Pos logit contributions
ample+0.261
pport+0.245
alone+0.245
 proto+0.232
abama+0.228
Neg logit contributions
-0.194
-0.179
knit-0.177
-0.168
 Citiz-0.159
Token.
Token ID13
Feature activation-4.01e-01

Pos logit contributions
ample+0.081
pport+0.076
alone+0.076
 proto+0.072
abama+0.070
Neg logit contributions
-0.057
-0.052
knit-0.051
-0.048
 Citiz-0.046
Token13
Token ID1485
Feature activation-6.97e-01

Pos logit contributions
ample+1.672
pport+1.567
alone+1.543
 proto+1.450
 amp+1.419
Neg logit contributions
-1.620
-1.516
knit-1.491
-1.416
 Citiz-1.412
Token71
Token ID4869
Feature activation-6.45e-01

Pos logit contributions
ample+2.588
pport+2.456
alone+2.428
 amp+2.241
 proto+2.238
Neg logit contributions
-3.094
-2.858
knit-2.852
-2.757
 Citiz-2.702
Token/
Token ID14
Feature activation-6.89e-01

Pos logit contributions
ample+2.659
pport+2.466
alone+2.464
 proto+2.307
 amp+2.250
Neg logit contributions
-2.651
-2.472
knit-2.443
-2.336
 Citiz-2.308
Tokenjournal
Token ID24891
Feature activation-1.43e+00

Pos logit contributions
ample+2.714
pport+2.530
alone+2.514
 proto+2.358
 amp+2.310
Neg logit contributions
-2.891
-2.720
knit-2.714
-2.567
 Citiz-2.545
Token.
Token ID13
Feature activation-1.57e-01

Pos logit contributions
ample+5.983
alone+5.620
pport+5.491
 amp+4.977
 proto+4.970
Neg logit contributions
-5.593
 Citiz-5.528
-5.315
knit-5.180
-5.024
Tokenp
Token ID79
Feature activation-1.01e+00

Pos logit contributions
ample+0.718
pport+0.672
alone+0.670
 proto+0.633
 amp+0.618
Neg logit contributions
-0.573
-0.528
knit-0.522
-0.496
 Citiz-0.487
Tokenone
Token ID505
Feature activation-3.35e-01

Pos logit contributions
ample+3.518
pport+3.204
alone+3.192
 amp+2.836
 proto+2.829
Neg logit contributions
-4.911
-4.570
knit-4.449
 Citiz-4.445
-4.395
Token.
Token ID13
Feature activation-5.62e-01

Pos logit contributions
ample+1.569
pport+1.468
alone+1.468
 proto+1.383
 amp+1.356
Neg logit contributions
-1.192
-1.098
knit-1.082
-1.027
 Citiz-1.019
Token00
Token ID405
Feature activation-4.37e-01

Pos logit contributions
ample+2.663
pport+2.494
alone+2.484
 proto+2.330
 amp+2.298
Neg logit contributions
-1.959
-1.800
knit-1.752
 Citiz-1.692
-1.688
Token.
Token ID13
Feature activation-1.17e-01

Pos logit contributions
ample+0.855
pport+0.797
alone+0.796
 proto+0.750
 amp+0.732
Neg logit contributions
-0.742
-0.687
knit-0.679
-0.647
 Citiz-0.634
Token All
Token ID1439
Feature activation-9.07e-02

Pos logit contributions
ample+0.473
pport+0.440
alone+0.437
 proto+0.409
abama+0.398
Neg logit contributions
-0.491
-0.458
knit-0.452
-0.433
 Citiz-0.425
Token of
Token ID286
Feature activation-8.04e-03

Pos logit contributions
ample+0.349
pport+0.323
alone+0.323
 proto+0.300
abama+0.291
Neg logit contributions
-0.397
-0.371
knit-0.368
-0.352
 Citiz-0.346
Token those
Token ID883
Feature activation-2.31e-01

Pos logit contributions
ample+0.033
alone+0.030
pport+0.030
 proto+0.028
abama+0.028
Neg logit contributions
-0.033
-0.031
knit-0.031
-0.029
 Citiz-0.029
Token el
Token ID1288
Feature activation+8.75e-02

Pos logit contributions
ample+0.894
alone+0.825
pport+0.824
 proto+0.765
abama+0.744
Neg logit contributions
-1.007
-0.941
knit-0.931
-0.892
 Citiz-0.877
Token...
Token ID986
Feature activation-1.43e+00

Pos logit contributions
+0.455
+0.431
knit+0.426
+0.412
 Citiz+0.406
Neg logit contributions
ample-0.264
alone-0.240
pport-0.238
 proto-0.216
 amp-0.207
Token
Token ID198
Feature activation-1.94e-02

Pos logit contributions
ample+4.325
alone+4.025
pport+3.946
 proto+3.729
 amp+3.504
Neg logit contributions
-7.047
-6.773
knit-6.598
-6.457
 Citiz-6.377
Token
Token ID198
Feature activation-3.56e-02

Pos logit contributions
ample+0.070
pport+0.065
alone+0.064
 proto+0.060
abama+0.058
Neg logit contributions
-0.090
-0.084
knit-0.084
-0.080
CHA-0.078
TokenIt
Token ID1026
Feature activation-4.92e-02

Pos logit contributions
ample+0.155
alone+0.144
pport+0.144
 proto+0.135
 amp+0.132
Neg logit contributions
-0.138
-0.129
knit-0.127
-0.121
 Citiz-0.119
Token looks
Token ID3073
Feature activation-4.94e-01

Pos logit contributions
ample+0.200
pport+0.185
alone+0.185
 proto+0.173
 amp+0.169
Neg logit contributions
-0.205
-0.192
knit-0.189
-0.181
 Citiz-0.178
Token increasingly
Token ID6481
Feature activation-2.64e-01

Pos logit contributions
ample+1.862
pport+1.715
alone+1.711
 proto+1.607
 amp+1.554
Neg logit contributions
-2.203
-2.067
knit-2.039
-1.961
 Citiz-1.930
Token in
Token ID287
Feature activation-3.09e-01

Pos logit contributions
ample+3.193
alone+2.935
pport+2.914
 proto+2.744
 amp+2.669
Neg logit contributions
-3.872
-3.634
knit-3.623
 Citiz-3.456
-3.431
Token the
Token ID262
Feature activation-5.04e-01

Pos logit contributions
ample+1.306
pport+1.216
alone+1.214
 proto+1.139
 amp+1.111
Neg logit contributions
-1.235
-1.152
knit-1.148
-1.085
 Citiz-1.075
Token current
Token ID1459
Feature activation+2.33e-01

Pos logit contributions
ample+1.800
alone+1.654
pport+1.645
 proto+1.539
 amp+1.485
Neg logit contributions
-2.344
-2.208
knit-2.192
-2.097
 Citiz-2.069
Token bill
Token ID2855
Feature activation-2.12e-01

Pos logit contributions
+1.065
+0.997
knit+0.984
+0.953
 Citiz+0.932
Neg logit contributions
ample-0.849
pport-0.782
alone-0.778
 proto-0.720
abama-0.703
Token that
Token ID326
Feature activation-2.85e-01

Pos logit contributions
ample+0.907
pport+0.843
alone+0.842
 proto+0.793
 amp+0.773
Neg logit contributions
-0.836
-0.777
knit-0.769
-0.732
 Citiz-0.720
Token enshr
Token ID46919
Feature activation-1.40e+00

Pos logit contributions
ample+1.154
pport+1.068
alone+1.067
 proto+1.000
 amp+0.973
Neg logit contributions
-1.192
-1.111
knit-1.099
-1.052
 Citiz-1.033
Tokenined
Token ID1389
Feature activation-8.90e-01

Pos logit contributions
ample+4.588
alone+4.178
pport+4.140
 proto+3.598
 amp+3.595
Neg logit contributions
-6.756
-6.446
 Citiz-6.350
knit-6.223
-6.084
Token that
Token ID326
Feature activation-6.30e-01

Pos logit contributions
ample+3.682
pport+3.403
alone+3.378
 proto+3.233
 amp+3.147
Neg logit contributions
-3.617
-3.387
knit-3.369
-3.186
 Citiz-3.144
Token protection
Token ID4800
Feature activation-5.84e-02

Pos logit contributions
ample+2.494
alone+2.291
pport+2.291
 proto+2.159
 amp+2.092
Neg logit contributions
-2.699
-2.529
knit-2.502
-2.387
 Citiz-2.355
Token.
Token ID13
Feature activation+4.22e-02

Pos logit contributions
ample+0.254
pport+0.237
alone+0.237
 proto+0.223
 amp+0.217
Neg logit contributions
-0.227
-0.210
knit-0.208
-0.198
 Citiz-0.194
Token
Token ID198
Feature activation+2.24e-02

Pos logit contributions
+0.189
+0.177
knit+0.174
+0.168
 Citiz+0.164
Neg logit contributions
ample-0.160
pport-0.148
alone-0.146
 proto-0.137
abama-0.132
Token takeover
Token ID27011
Feature activation-1.08e-02

Pos logit contributions
+1.535
+1.431
knit+1.407
+1.356
 Citiz+1.334
Neg logit contributions
ample-1.464
alone-1.360
pport-1.359
 proto-1.247
abama-1.226
Token of
Token ID286
Feature activation+2.24e-01

Pos logit contributions
ample+0.039
pport+0.036
alone+0.036
 proto+0.033
 amp+0.032
Neg logit contributions
-0.050
-0.047
knit-0.046
-0.044
 Citiz-0.044
Token V
Token ID569
Feature activation+2.57e-01

Pos logit contributions
+0.918
+0.854
knit+0.844
+0.809
 Citiz+0.796
Neg logit contributions
ample-0.921
alone-0.861
pport-0.860
 proto-0.793
abama-0.779
Tokenenez
Token ID11437
Feature activation+3.85e-01

Pos logit contributions
+0.958
+0.884
knit+0.871
+0.831
 Citiz+0.809
Neg logit contributions
ample-1.156
pport-1.087
alone-1.083
 proto-1.022
 amp-0.995
Tokenue
Token ID518
Feature activation-3.51e-01

Pos logit contributions
+1.727
+1.615
knit+1.593
+1.536
 Citiz+1.506
Neg logit contributions
ample-1.432
pport-1.319
alone-1.314
 proto-1.228
 amp-1.196
Token...
Token ID986
Feature activation-1.36e+00

Pos logit contributions
ample+1.337
pport+1.232
alone+1.229
 proto+1.166
 amp+1.122
Neg logit contributions
-1.533
-1.446
knit-1.432
-1.373
 Citiz-1.336
Token
Token ID198
Feature activation-4.40e-02

Pos logit contributions
ample+4.099
alone+3.798
pport+3.728
 proto+3.516
abama+3.286
Neg logit contributions
-6.734
-6.472
knit-6.326
-6.159
 Citiz-6.096
Token
Token ID198
Feature activation-5.42e-02

Pos logit contributions
ample+0.159
pport+0.146
alone+0.144
 proto+0.136
abama+0.131
Neg logit contributions
-0.204
-0.190
knit-0.190
-0.182
CHA-0.176
TokenFrom
Token ID4863
Feature activation+2.50e-01

Pos logit contributions
ample+0.236
alone+0.219
pport+0.219
 proto+0.206
 amp+0.201
Neg logit contributions
-0.210
-0.195
knit-0.193
-0.184
 Citiz-0.181
Token Vox
Token ID28035
Feature activation+2.60e-01

Pos logit contributions
+0.899
+0.827
knit+0.816
+0.776
 Citiz+0.759
Neg logit contributions
ample-1.162
pport-1.089
alone-1.087
 proto-1.026
 amp-1.003
Token:
Token ID25
Feature activation-1.02e-01

Pos logit contributions
+0.971
+0.895
knit+0.883
+0.841
 Citiz+0.824
Neg logit contributions
ample-1.165
pport-1.092
alone-1.087
 proto-1.027
 amp-1.003
Tokenale
Token ID1000
Feature activation+1.01e-01

Pos logit contributions
ample+0.154
pport+0.145
alone+0.141
 proto+0.136
 amp+0.129
Neg logit contributions
-0.190
knit-0.179
-0.178
-0.169
CHA-0.164
Token br
Token ID865
Feature activation+3.95e-02

Pos logit contributions
+0.476
+0.445
knit+0.441
+0.424
 Citiz+0.417
Neg logit contributions
ample-0.359
alone-0.327
pport-0.327
 proto-0.299
abama-0.291
Tokenachy
Token ID35586
Feature activation-3.58e-01

Pos logit contributions
+0.176
knit+0.165
+0.165
+0.157
CHA+0.152
Neg logit contributions
ample-0.144
pport-0.135
alone-0.133
 proto-0.126
 amp-0.120
Tokenot
Token ID313
Feature activation-1.14e-01

Pos logit contributions
ample+1.763
pport+1.660
alone+1.652
 proto+1.577
 amp+1.541
Neg logit contributions
-1.174
-1.075
knit-1.066
-0.998
 Citiz-0.965
Tokenis
Token ID271
Feature activation-4.67e-01

Pos logit contributions
ample+0.448
pport+0.416
alone+0.412
 proto+0.390
 amp+0.377
Neg logit contributions
-0.485
-0.455
knit-0.451
-0.429
 Citiz-0.417
Token vict
Token ID2210
Feature activation-1.36e+00

Pos logit contributions
ample+2.087
pport+1.944
alone+1.943
 proto+1.837
 amp+1.795
Neg logit contributions
-1.758
-1.629
knit-1.610
-1.531
 Citiz-1.501
Tokenoria
Token ID7661
Feature activation-2.58e-01

Pos logit contributions
ample+5.044
alone+4.644
pport+4.603
 proto+4.169
 amp+4.113
Neg logit contributions
-6.054
-5.675
knit-5.542
-5.409
 Citiz-5.387
Tokene
Token ID68
Feature activation-1.37e-01

Pos logit contributions
ample+1.148
pport+1.071
alone+1.069
 proto+1.008
 amp+0.984
Neg logit contributions
-0.979
-0.906
knit-0.895
-0.853
 Citiz-0.835
Token.
Token ID13
Feature activation-3.08e-01

Pos logit contributions
ample+0.612
pport+0.571
alone+0.570
 proto+0.538
 amp+0.524
Neg logit contributions
-0.518
-0.479
knit-0.472
-0.450
 Citiz-0.440
Token
Token ID198
Feature activation+2.82e-02

Pos logit contributions
ample+1.150
pport+1.058
alone+1.056
 proto+0.983
 amp+0.954
Neg logit contributions
-1.386
-1.299
knit-1.285
-1.235
 Citiz-1.213
Token
Token ID198
Feature activation-7.21e-02

Pos logit contributions
+0.128
knit+0.119
+0.119
+0.114
CHA+0.110
Neg logit contributions
ample-0.104
pport-0.097
alone-0.095
 proto-0.090
abama-0.087
Token There
Token ID1318
Feature activation+5.73e-01

Pos logit contributions
+0.645
+0.599
knit+0.598
+0.568
 Citiz+0.557
Neg logit contributions
ample-0.626
pport-0.583
alone-0.579
 proto-0.541
abama-0.530
Token are
Token ID389
Feature activation+3.60e-01

Pos logit contributions
+2.151
+1.983
knit+1.966
+1.869
 Citiz+1.833
Neg logit contributions
ample-2.560
pport-2.380
alone-2.373
 proto-2.242
abama-2.201
Token knit
Token ID30495
Feature activation+5.05e-01

Pos logit contributions
+1.524
knit+1.422
+1.415
+1.348
 Citiz+1.326
Neg logit contributions
ample-1.428
alone-1.327
pport-1.322
 proto-1.219
abama-1.212
Token-
Token ID12
Feature activation+1.23e-01

Pos logit contributions
+2.222
+2.071
knit+2.065
+1.969
 Citiz+1.933
Neg logit contributions
ample-1.941
pport-1.789
alone-1.782
 proto-1.664
abama-1.619
Tokena
Token ID64
Feature activation+3.79e-02

Pos logit contributions
+0.479
+0.444
knit+0.437
+0.418
 Citiz+0.410
Neg logit contributions
ample-0.536
alone-0.499
pport-0.499
 proto-0.469
 amp-0.457
Token...
Token ID986
Feature activation-1.35e+00

Pos logit contributions
+0.145
+0.134
knit+0.132
+0.126
 Citiz+0.124
Neg logit contributions
ample-0.167
pport-0.156
alone-0.156
 proto-0.147
 amp-0.143
Token<|endoftext|>
Token ID50256
Feature activation+3.92e-02

Pos logit contributions
ample+4.177
alone+3.880
pport+3.839
 proto+3.610
 amp+3.401
Neg logit contributions
-6.607
-6.341
knit-6.126
-6.021
 Citiz-5.965
TokenBuy
Token ID14518
Feature activation-2.35e-01

Pos logit contributions
+0.151
+0.139
knit+0.138
+0.131
 Citiz+0.128
Neg logit contributions
ample-0.172
pport-0.160
alone-0.160
 proto-0.151
 amp-0.147
Token Photo
Token ID5555
Feature activation-9.57e-02

Pos logit contributions
ample+0.895
alone+0.826
pport+0.826
 proto+0.771
 amp+0.750
Neg logit contributions
-1.038
-0.975
knit-0.960
-0.925
 Citiz-0.908
Token F
Token ID376
Feature activation-3.06e-01

Pos logit contributions
ample+0.411
pport+0.383
alone+0.382
 proto+0.360
 amp+0.351
Neg logit contributions
-0.376
-0.349
knit-0.345
-0.330
 Citiz-0.323
TokenSU
Token ID12564
Feature activation-5.62e-02

Pos logit contributions
ample+1.266
pport+1.174
alone+1.171
 proto+1.097
 amp+1.070
Neg logit contributions
-1.259
-1.174
knit-1.159
-1.110
 Citiz-1.091
Token This
Token ID770
Feature activation-2.99e-01

Pos logit contributions
ample+0.376
pport+0.352
alone+0.349
 proto+0.328
abama+0.319
Neg logit contributions
-0.372
-0.347
knit-0.342
-0.327
 Citiz-0.318
Token screen
Token ID3159
Feature activation+9.55e-03

Pos logit contributions
ample+1.166
pport+1.079
alone+1.075
 proto+1.003
abama+0.973
Neg logit contributions
-1.298
-1.214
knit-1.199
-1.150
 Citiz-1.128
Token also
Token ID635
Feature activation-7.36e-01

Pos logit contributions
+0.042
+0.039
knit+0.039
+0.037
 Citiz+0.036
Neg logit contributions
ample-0.037
pport-0.034
alone-0.034
 proto-0.031
abama-0.031
Token includes
Token ID3407
Feature activation-4.05e-01

Pos logit contributions
ample+2.257
pport+2.036
alone+2.032
 proto+1.861
 amp+1.797
Neg logit contributions
-3.808
-3.600
knit-3.566
-3.449
 Citiz-3.402
Token a
Token ID257
Feature activation-3.33e-01

Pos logit contributions
ample+1.590
pport+1.468
alone+1.465
 proto+1.371
 amp+1.334
Neg logit contributions
-1.747
-1.631
knit-1.615
-1.548
 Citiz-1.521
Token native
Token ID6868
Feature activation-1.34e+00

Pos logit contributions
ample+1.311
pport+1.211
alone+1.209
 proto+1.131
 amp+1.100
Neg logit contributions
-1.429
-1.335
knit-1.320
-1.266
 Citiz-1.243
Token Bl
Token ID1086
Feature activation+1.25e-01

Pos logit contributions
ample+5.019
alone+4.556
pport+4.527
 proto+4.328
 amp+4.308
Neg logit contributions
-5.874
knit-5.423
-5.423
-5.216
 Citiz-5.190
Tokenur
Token ID333
Feature activation+2.18e-01

Pos logit contributions
+0.532
+0.498
knit+0.491
+0.471
 Citiz+0.456
Neg logit contributions
ample-0.487
pport-0.458
alone-0.450
 proto-0.427
abama-0.411
Token Reduction
Token ID33396
Feature activation-2.44e-01

Pos logit contributions
+0.845
+0.786
knit+0.773
+0.739
 Citiz+0.721
Neg logit contributions
ample-0.945
pport-0.885
alone-0.876
 proto-0.828
abama-0.810
Token mode
Token ID4235
Feature activation-6.42e-01

Pos logit contributions
ample+0.928
pport+0.853
alone+0.853
 proto+0.797
 amp+0.777
Neg logit contributions
-1.080
-1.010
knit-1.002
-0.961
 Citiz-0.944
Token to
Token ID284
Feature activation-1.84e-01

Pos logit contributions
ample+2.774
alone+2.573
pport+2.571
 proto+2.437
 amp+2.391
Neg logit contributions
-2.513
-2.319
knit-2.303
-2.196
 Citiz-2.160
Token to
Token ID284
Feature activation-3.06e-01

Pos logit contributions
ample+3.616
alone+3.421
pport+3.321
 proto+3.074
 amp+2.994
Neg logit contributions
-5.352
-5.076
knit-4.983
-4.844
 Citiz-4.821
Token make
Token ID787
Feature activation-4.23e-01

Pos logit contributions
ample+1.220
pport+1.124
alone+1.123
 proto+1.056
 amp+1.032
Neg logit contributions
-1.302
-1.217
knit-1.203
-1.154
 Citiz-1.132
Token other
Token ID584
Feature activation-5.54e-02

Pos logit contributions
ample+1.797
pport+1.671
alone+1.668
 proto+1.569
 amp+1.529
Neg logit contributions
-1.689
-1.570
knit-1.551
-1.482
 Citiz-1.453
Token smartphone
Token ID11745
Feature activation+2.88e-01

Pos logit contributions
ample+0.217
pport+0.200
alone+0.200
 proto+0.185
abama+0.182
Neg logit contributions
-0.240
-0.224
knit-0.222
-0.212
 Citiz-0.209
Token owners
Token ID4393
Feature activation+2.38e-02

Pos logit contributions
+1.144
+1.061
knit+1.052
+1.002
 Citiz+0.979
Neg logit contributions
ample-1.230
pport-1.138
alone-1.134
abama-1.057
 proto-1.055
Token jealous
Token ID19354
Feature activation-1.32e+00

Pos logit contributions
+0.095
+0.088
knit+0.087
+0.083
 Citiz+0.081
Neg logit contributions
ample-0.101
pport-0.094
alone-0.094
 proto-0.088
abama-0.086
Token.
Token ID13
Feature activation-2.32e-02

Pos logit contributions
ample+5.343
alone+4.920
pport+4.879
 proto+4.599
 amp+4.508
Neg logit contributions
-5.346
-4.956
knit-4.920
 Citiz-4.764
-4.668
Token What
Token ID1867
Feature activation+3.58e-01

Pos logit contributions
ample+0.096
pport+0.089
alone+0.088
 proto+0.083
abama+0.081
Neg logit contributions
-0.096
-0.089
knit-0.088
-0.084
 Citiz-0.082
Token
Token ID447
Feature activation+2.35e-01

Pos logit contributions
+1.431
+1.325
knit+1.312
+1.249
 Citiz+1.235
Neg logit contributions
ample-1.498
pport-1.401
alone-1.392
abama-1.296
 proto-1.291
Token
Token ID247
Feature activation+1.69e-01

Pos logit contributions
+0.734
+0.673
knit+0.653
+0.621
 Citiz+0.596
Neg logit contributions
ample-1.197
pport-1.129
alone-1.126
 proto-1.076
 amp-1.050
Tokens
Token ID82
Feature activation+2.81e-01

Pos logit contributions
+0.712
+0.663
knit+0.655
+0.629
 Citiz+0.613
Neg logit contributions
ample-0.677
pport-0.628
alone-0.625
 proto-0.589
abama-0.573
Token
Token ID247
Feature activation-9.70e-02

Pos logit contributions
+0.555
+0.503
knit+0.482
+0.462
 Citiz+0.425
Neg logit contributions
ample-1.039
pport-0.982
alone-0.979
 proto-0.936
 amp-0.917
Tokens
Token ID82
Feature activation+1.82e-01

Pos logit contributions
ample+0.435
pport+0.405
alone+0.405
 proto+0.382
 amp+0.373
Neg logit contributions
-0.365
-0.337
knit-0.333
-0.317
 Citiz-0.311
Token anti
Token ID3098
Feature activation-1.77e-01

Pos logit contributions
+0.854
+0.794
knit+0.785
+0.763
 Citiz+0.745
Neg logit contributions
ample-0.647
alone-0.593
pport-0.592
 proto-0.542
abama-0.530
Token-
Token ID12
Feature activation+2.86e-01

Pos logit contributions
ample+0.855
pport+0.802
alone+0.801
 proto+0.758
 amp+0.742
Neg logit contributions
-0.601
-0.552
knit-0.543
-0.514
 Citiz-0.505
TokenPet
Token ID25803
Feature activation-5.72e-01

Pos logit contributions
+1.156
+1.068
knit+1.057
+1.016
 Citiz+0.983
Neg logit contributions
ample-1.196
pport-1.118
alone-1.112
 proto-1.050
abama-1.027
Tokenro
Token ID305
Feature activation-1.31e+00

Pos logit contributions
ample+2.459
pport+2.305
alone+2.283
 proto+2.184
 amp+2.090
Neg logit contributions
-2.229
-2.094
knit-2.043
-1.950
 Citiz-1.924
Tokencar
Token ID7718
Feature activation-5.65e-01

Pos logit contributions
ample+3.829
alone+3.500
pport+3.386
 proto+3.059
 amp+3.037
Neg logit contributions
-6.785
-6.450
 Citiz-6.307
knit-6.292
-6.154
Tokenibe
Token ID32438
Feature activation-1.62e-01

Pos logit contributions
ample+1.921
alone+1.751
pport+1.745
 proto+1.609
 amp+1.562
Neg logit contributions
-2.730
-2.564
knit-2.536
-2.453
 Citiz-2.450
Token position
Token ID2292
Feature activation-1.35e-01

Pos logit contributions
ample+0.612
pport+0.563
alone+0.563
 proto+0.523
 amp+0.508
Neg logit contributions
-0.725
-0.678
knit-0.671
-0.646
 Citiz-0.634
Token and
Token ID290
Feature activation+9.54e-02

Pos logit contributions
ample+0.533
pport+0.494
alone+0.492
 proto+0.460
 amp+0.448
Neg logit contributions
-0.579
-0.540
knit-0.534
-0.513
 Citiz-0.502
Token said
Token ID531
Feature activation+3.32e-01

Pos logit contributions
+0.403
+0.374
knit+0.370
+0.356
 Citiz+0.351
Neg logit contributions
ample-0.382
pport-0.355
alone-0.353
 proto-0.329
abama-0.322
Token
Token ID198
Feature activation+5.50e-02

Pos logit contributions
+0.548
+0.514
knit+0.508
+0.486
 Citiz+0.478
Neg logit contributions
ample-0.463
pport-0.430
alone-0.424
 proto-0.395
abama-0.385
Token
Token ID198
Feature activation-1.84e-02

Pos logit contributions
+0.248
+0.232
knit+0.231
+0.221
CHA+0.215
Neg logit contributions
ample-0.203
pport-0.189
alone-0.185
 proto-0.175
abama-0.170
TokenOff
Token ID9362
Feature activation-3.94e-01

Pos logit contributions
ample+0.078
pport+0.072
alone+0.072
 proto+0.068
 amp+0.066
Neg logit contributions
-0.074
-0.069
knit-0.068
-0.065
 Citiz-0.064
Tokener
Token ID263
Feature activation-5.49e-01

Pos logit contributions
ample+1.672
alone+1.549
pport+1.541
 proto+1.450
 amp+1.416
Neg logit contributions
-1.590
-1.483
knit-1.456
-1.400
 Citiz-1.387
Token:
Token ID25
Feature activation-5.13e-01

Pos logit contributions
ample+2.424
pport+2.264
alone+2.250
 proto+2.138
 amp+2.075
Neg logit contributions
-2.085
-1.927
knit-1.904
-1.809
 Citiz-1.805
Token No
Token ID1400
Feature activation-1.28e+00

Pos logit contributions
ample+1.864
pport+1.708
alone+1.707
 proto+1.591
 amp+1.540
Neg logit contributions
-2.360
-2.215
knit-2.192
-2.109
 Citiz-2.074
Token
Token ID198
Feature activation-8.17e-01

Pos logit contributions
ample+4.177
pport+3.825
alone+3.761
 proto+3.538
 amp+3.407
Neg logit contributions
-6.192
knit-5.850
-5.846
 Citiz-5.579
-5.551
Token
Token ID198
Feature activation-8.32e-01

Pos logit contributions
ample+2.644
alone+2.414
pport+2.378
 proto+2.188
 amp+2.133
Neg logit contributions
-4.065
-3.845
knit-3.791
 Citiz-3.736
-3.671
TokenTake
Token ID12322
Feature activation-8.48e-02

Pos logit contributions
ample+3.270
alone+2.992
pport+2.942
 proto+2.700
abama+2.663
Neg logit contributions
-3.574
-3.361
knit-3.288
 Citiz-3.254
-3.143
Tokenaways
Token ID23949
Feature activation-1.20e-01

Pos logit contributions
ample+0.381
alone+0.355
pport+0.354
 proto+0.334
 amp+0.326
Neg logit contributions
-0.318
-0.295
knit-0.290
-0.277
 Citiz-0.272
Token
Token ID198
Feature activation+1.52e-01

Pos logit contributions
ample+0.519
pport+0.483
alone+0.483
 proto+0.454
 amp+0.443
Neg logit contributions
-0.466
-0.432
knit-0.427
-0.408
 Citiz-0.399
Token card
Token ID2657
Feature activation-3.88e-01

Pos logit contributions
+1.231
+1.136
knit+1.121
+1.072
 Citiz+1.048
Neg logit contributions
ample-1.477
pport-1.378
alone-1.375
 proto-1.293
abama-1.264
Token companies
Token ID2706
Feature activation-3.62e-01

Pos logit contributions
ample+1.907
pport+1.791
alone+1.791
 proto+1.702
 amp+1.665
Neg logit contributions
-1.289
-1.182
knit-1.163
-1.096
 Citiz-1.072
Token;
Token ID26
Feature activation+2.88e-01

Pos logit contributions
ample+1.534
alone+1.425
pport+1.424
 proto+1.342
 amp+1.309
Neg logit contributions
-1.440
-1.344
knit-1.331
-1.263
 Citiz-1.239
Token now
Token ID783
Feature activation-6.26e-02

Pos logit contributions
+1.223
+1.137
knit+1.134
+1.077
 Citiz+1.064
Neg logit contributions
ample-1.138
pport-1.054
alone-1.050
 proto-0.973
abama-0.965
Token i
Token ID1312
Feature activation+1.27e-01

Pos logit contributions
ample+0.273
pport+0.255
alone+0.254
 proto+0.239
 amp+0.233
Neg logit contributions
-0.242
-0.225
knit-0.222
-0.212
 Citiz-0.207
Token...
Token ID986
Feature activation-1.28e+00

Pos logit contributions
+0.537
+0.500
knit+0.494
+0.473
 Citiz+0.464
Neg logit contributions
ample-0.512
pport-0.472
alone-0.472
 proto-0.443
abama-0.430
Token
Token ID198
Feature activation-2.51e-02

Pos logit contributions
ample+4.035
alone+3.761
pport+3.674
 proto+3.456
 amp+3.268
Neg logit contributions
-6.191
-5.912
knit-5.802
-5.633
 Citiz-5.563
Token
Token ID198
Feature activation-2.04e-03

Pos logit contributions
ample+0.090
pport+0.083
alone+0.082
 proto+0.077
abama+0.075
Neg logit contributions
-0.116
-0.109
knit-0.108
-0.104
CHA-0.100
TokenAlthough
Token ID7003
Feature activation-2.47e-01

Pos logit contributions
ample+0.009
pport+0.008
alone+0.008
 proto+0.008
 amp+0.008
Neg logit contributions
-0.008
-0.007
knit-0.007
-0.007
 Citiz-0.007
Token this
Token ID428
Feature activation-4.40e-01

Pos logit contributions
ample+1.036
pport+0.962
alone+0.960
 proto+0.903
 amp+0.879
Neg logit contributions
-1.001
-0.932
knit-0.921
-0.880
 Citiz-0.863
Token one
Token ID530
Feature activation-3.77e-03

Pos logit contributions
ample+1.732
pport+1.605
alone+1.603
 proto+1.510
 amp+1.466
Neg logit contributions
-1.882
-1.762
knit-1.740
-1.669
 Citiz-1.636
Token name
Token ID1438
Feature activation+2.48e-01

Pos logit contributions
ample+0.567
pport+0.524
alone+0.522
 proto+0.487
 amp+0.473
Neg logit contributions
-0.623
-0.582
knit-0.575
-0.551
 Citiz-0.542
Token is
Token ID318
Feature activation-9.40e-02

Pos logit contributions
+1.070
+1.002
knit+0.994
+0.946
 Citiz+0.921
Neg logit contributions
ample-0.974
pport-0.904
alone-0.891
 proto-0.832
abama-0.817
Token still
Token ID991
Feature activation-1.84e-01

Pos logit contributions
ample+0.369
pport+0.341
alone+0.340
 proto+0.317
 amp+0.308
Neg logit contributions
-0.406
-0.379
knit-0.375
-0.359
 Citiz-0.353
Token a
Token ID257
Feature activation+2.83e-02

Pos logit contributions
ample+0.711
pport+0.656
alone+0.655
 proto+0.611
 amp+0.594
Neg logit contributions
-0.808
-0.755
knit-0.747
-0.717
 Citiz-0.705
Token polit
Token ID1408
Feature activation+1.93e-01

Pos logit contributions
+0.126
+0.118
knit+0.117
+0.112
 Citiz+0.110
Neg logit contributions
ample-0.108
alone-0.099
pport-0.099
 proto-0.091
abama-0.090
Token­
Token ID3907
Feature activation-1.27e+00

Pos logit contributions
+0.915
+0.859
knit+0.850
+0.814
 Citiz+0.800
Neg logit contributions
ample-0.672
alone-0.616
pport-0.615
 proto-0.569
 amp-0.546
Tokenic
Token ID291
Feature activation-1.17e-01

Pos logit contributions
ample+3.431
alone+3.024
pport+3.012
 amp+2.603
 proto+2.578
Neg logit contributions
-6.993
-6.676
knit-6.607
-6.444
 Citiz-6.420
Token­
Token ID3907
Feature activation-9.82e-01

Pos logit contributions
ample+0.408
pport+0.374
alone+0.374
 proto+0.344
abama+0.332
Neg logit contributions
-0.559
-0.527
knit-0.520
-0.500
 Citiz-0.493
Tokenal
Token ID282
Feature activation-2.36e-02

Pos logit contributions
ample+3.706
alone+3.425
pport+3.389
 proto+3.125
 amp+3.083
Neg logit contributions
-4.352
-4.093
knit-4.050
-3.913
 Citiz-3.885
Token in
Token ID287
Feature activation-1.13e-01

Pos logit contributions
ample+0.088
alone+0.082
pport+0.082
 proto+0.075
abama+0.073
Neg logit contributions
-0.106
-0.099
knit-0.098
-0.094
 Citiz-0.093
Token­
Token ID3907
Feature activation-6.49e-01

Pos logit contributions
ample+0.401
pport+0.367
alone+0.367
 proto+0.339
 amp+0.328
Neg logit contributions
-0.534
-0.502
knit-0.496
-0.477
 Citiz-0.469
Token Porn
Token ID29629
Feature activation+2.74e-02

Pos logit contributions
ample+1.338
alone+1.245
pport+1.239
 proto+1.164
 amp+1.138
Neg logit contributions
-1.310
-1.214
knit-1.199
-1.152
 Citiz-1.128
Token Pt
Token ID34352
Feature activation-9.96e-02

Pos logit contributions
+0.107
+0.100
knit+0.098
+0.094
 Citiz+0.092
Neg logit contributions
ample-0.118
pport-0.110
alone-0.110
 proto-0.103
 amp-0.101
Token 4
Token ID604
Feature activation-1.07e-01

Pos logit contributions
ample+0.417
pport+0.387
alone+0.386
 proto+0.362
 amp+0.353
Neg logit contributions
-0.404
-0.376
knit-0.372
-0.355
 Citiz-0.349
Token-
Token ID12
Feature activation-4.00e-02

Pos logit contributions
ample+0.491
pport+0.459
alone+0.458
 proto+0.433
 amp+0.423
Neg logit contributions
-0.393
-0.363
knit-0.358
-0.341
 Citiz-0.333
TokenOtt
Token ID49092
Feature activation-6.57e-01

Pos logit contributions
ample+0.182
pport+0.170
alone+0.170
 proto+0.160
 amp+0.157
Neg logit contributions
-0.147
-0.136
knit-0.134
-0.127
 Citiz-0.125
Tokenos
Token ID418
Feature activation-1.27e+00

Pos logit contributions
ample+2.599
alone+2.414
pport+2.399
 proto+2.237
 amp+2.183
Neg logit contributions
-2.792
-2.606
knit-2.570
-2.492
 Citiz-2.411
Token Old
Token ID5706
Feature activation-4.03e-01

Pos logit contributions
ample+5.336
alone+4.974
pport+4.942
 proto+4.624
 amp+4.551
Neg logit contributions
-5.027
-4.602
knit-4.550
-4.408
 Citiz-4.235
Tokene
Token ID68
Feature activation+3.70e-01

Pos logit contributions
ample+1.720
alone+1.609
pport+1.596
 proto+1.508
 amp+1.474
Neg logit contributions
-1.585
-1.479
knit-1.450
-1.395
 Citiz-1.351
Token Film
Token ID13741
Feature activation-1.95e-02

Pos logit contributions
+1.471
+1.368
knit+1.351
+1.287
 Citiz+1.273
Neg logit contributions
ample-1.568
pport-1.467
alone-1.451
 proto-1.368
abama-1.329
Token 2008
Token ID3648
Feature activation-1.90e-01

Pos logit contributions
ample+0.086
pport+0.080
alone+0.080
 proto+0.075
 amp+0.073
Neg logit contributions
-0.075
-0.069
knit-0.068
-0.065
 Citiz-0.064
Token01
Token ID486
Feature activation+5.22e-01

Pos logit contributions
ample+0.946
pport+0.889
alone+0.888
 proto+0.843
 amp+0.825
Neg logit contributions
-0.619
-0.565
knit-0.557
-0.526
 Citiz-0.513
Token­
Token ID3907
Feature activation-9.22e-01

Pos logit contributions
ample+1.653
pport+1.505
alone+1.502
 proto+1.387
 amp+1.341
Neg logit contributions
-2.385
-2.245
knit-2.223
-2.144
 Citiz-2.112
Tokenon
Token ID261
Feature activation+9.16e-02

Pos logit contributions
ample+3.952
alone+3.671
pport+3.633
 proto+3.363
 amp+3.323
Neg logit contributions
-3.648
-3.349
knit-3.304
-3.213
 Citiz-3.198
Token to
Token ID284
Feature activation+2.25e-01

Pos logit contributions
+0.404
+0.377
knit+0.371
+0.360
 Citiz+0.351
Neg logit contributions
ample-0.346
alone-0.323
pport-0.321
 proto-0.296
abama-0.289
Token this
Token ID428
Feature activation+2.34e-01

Pos logit contributions
+0.891
+0.825
knit+0.810
+0.781
 Citiz+0.767
Neg logit contributions
ample-0.961
pport-0.895
alone-0.894
 proto-0.829
abama-0.816
Token pres
Token ID906
Feature activation+3.53e-01

Pos logit contributions
+0.992
+0.928
knit+0.913
+0.880
 Citiz+0.866
Neg logit contributions
ample-0.924
pport-0.857
alone-0.855
 proto-0.786
abama-0.776
Token­
Token ID3907
Feature activation-1.26e+00

Pos logit contributions
+1.629
+1.533
knit+1.521
+1.456
 Citiz+1.439
Neg logit contributions
ample-1.253
pport-1.152
alone-1.136
 proto-1.081
abama-1.037
Tokenid
Token ID312
Feature activation+2.93e-01

Pos logit contributions
ample+5.672
alone+5.240
pport+5.162
 proto+4.680
 amp+4.646
Neg logit contributions
-4.856
-4.474
knit-4.392
 Citiz-4.306
-4.304
Token­
Token ID3907
Feature activation-1.23e+00

Pos logit contributions
+1.357
+1.275
knit+1.259
+1.203
 Citiz+1.178
Neg logit contributions
ample-1.042
pport-0.957
alone-0.956
 proto-0.884
abama-0.862
Tokenent
Token ID298
Feature activation+1.03e-01

Pos logit contributions
ample+6.074
alone+5.646
pport+5.619
 proto+5.207
 amp+5.149
Neg logit contributions
-4.131
-3.802
knit-3.722
 Citiz-3.561
-3.554
Token,
Token ID11
Feature activation-2.30e-01

Pos logit contributions
+0.389
+0.360
knit+0.354
+0.339
 Citiz+0.330
Neg logit contributions
ample-0.456
pport-0.426
alone-0.424
 proto-0.399
abama-0.389
Token
Token ID447
Feature activation+1.96e-01

Pos logit contributions
ample+0.993
pport+0.924
alone+0.922
 proto+0.864
abama+0.843
Neg logit contributions
-0.905
-0.838
knit-0.827
-0.790
 Citiz-0.776
Tokenwell
Token ID4053
Feature activation-3.13e-01

Pos logit contributions
ample+0.863
alone+0.804
pport+0.802
 proto+0.753
 amp+0.735
Neg logit contributions
-0.781
-0.726
knit-0.716
-0.685
 Citiz-0.673
Token young
Token ID1862
Feature activation-3.33e-01

Pos logit contributions
ample+1.322
alone+1.232
pport+1.229
 proto+1.160
 amp+1.135
Neg logit contributions
-1.248
-1.160
knit-1.144
-1.096
 Citiz-1.076
Token man
Token ID582
Feature activation+3.61e-02

Pos logit contributions
ample+1.395
pport+1.293
alone+1.293
 proto+1.219
 amp+1.190
Neg logit contributions
-1.345
-1.253
knit-1.237
-1.183
 Citiz-1.158
Token,
Token ID11
Feature activation-4.94e-01

Pos logit contributions
+0.141
+0.131
knit+0.129
+0.123
 Citiz+0.121
Neg logit contributions
ample-0.157
pport-0.146
alone-0.146
 proto-0.137
 amp-0.134
Token every
Token ID790
Feature activation-3.70e-01

Pos logit contributions
ample+2.047
alone+1.901
pport+1.899
 proto+1.783
 amp+1.745
Neg logit contributions
-2.005
-1.872
knit-1.846
-1.767
 Citiz-1.741
Token once
Token ID1752
Feature activation-1.26e+00

Pos logit contributions
ample+1.615
alone+1.509
pport+1.505
 proto+1.424
 amp+1.397
Neg logit contributions
-1.423
-1.313
knit-1.297
-1.241
 Citiz-1.214
Token in
Token ID287
Feature activation-6.05e-01

Pos logit contributions
ample+3.801
alone+3.393
pport+3.304
 proto+3.095
 amp+3.079
Neg logit contributions
-6.316
-6.107
knit-6.012
-5.736
 Citiz-5.735
Token a
Token ID257
Feature activation-1.97e-01

Pos logit contributions
ample+2.194
pport+2.022
alone+1.996
 proto+1.875
 amp+1.824
Neg logit contributions
-2.771
-2.603
knit-2.585
-2.469
 Citiz-2.448
Token while
Token ID981
Feature activation-1.70e-03

Pos logit contributions
ample+0.815
pport+0.760
alone+0.756
 proto+0.714
 amp+0.696
Neg logit contributions
-0.800
-0.745
knit-0.739
-0.703
 Citiz-0.691
Token one
Token ID530
Feature activation-4.49e-01

Pos logit contributions
ample+0.008
pport+0.007
alone+0.007
 proto+0.007
abama+0.007
Neg logit contributions
-0.006
-0.006
knit-0.006
-0.005
 Citiz-0.005
Token of
Token ID286
Feature activation-5.22e-01

Pos logit contributions
ample+1.675
pport+1.551
alone+1.546
 proto+1.445
 amp+1.409
Neg logit contributions
-2.014
-1.886
knit-1.870
-1.792
 Citiz-1.772
Token Obama
Token ID2486
Feature activation+7.32e-02

Pos logit contributions
ample+2.065
alone+1.933
pport+1.920
 proto+1.815
 amp+1.774
Neg logit contributions
-1.934
-1.805
knit-1.782
-1.701
 Citiz-1.668
Token is
Token ID318
Feature activation+5.37e-02

Pos logit contributions
+0.310
+0.290
knit+0.286
+0.274
 Citiz+0.269
Neg logit contributions
ample-0.292
pport-0.271
alone-0.269
 proto-0.250
abama-0.245
Token the
Token ID262
Feature activation+2.17e-01

Pos logit contributions
+0.229
+0.213
knit+0.211
+0.203
 Citiz+0.199
Neg logit contributions
ample-0.213
pport-0.197
alone-0.197
 proto-0.181
abama-0.179
Token worst
Token ID5290
Feature activation+4.21e-01

Pos logit contributions
+0.917
+0.853
knit+0.840
+0.807
 Citiz+0.800
Neg logit contributions
ample-0.865
alone-0.801
pport-0.801
 proto-0.731
abama-0.726
Token ne
Token ID497
Feature activation+6.99e-02

Pos logit contributions
+1.798
+1.684
knit+1.662
+1.585
 Citiz+1.575
Neg logit contributions
ample-1.656
alone-1.530
pport-1.529
abama-1.380
 proto-1.380
Token­
Token ID3907
Feature activation-1.25e+00

Pos logit contributions
+0.329
+0.310
knit+0.305
+0.293
 Citiz+0.289
Neg logit contributions
ample-0.247
alone-0.226
pport-0.225
 proto-0.208
abama-0.202
Tokengo
Token ID2188
Feature activation-6.04e-01

Pos logit contributions
ample+4.571
alone+4.215
pport+4.196
 proto+3.741
abama+3.716
Neg logit contributions
-5.742
-5.281
knit-5.251
-5.186
 Citiz-5.099
Token­
Token ID3907
Feature activation-1.16e+00

Pos logit contributions
ample+2.066
pport+1.885
alone+1.882
 proto+1.739
 amp+1.682
Neg logit contributions
-2.906
-2.736
knit-2.708
-2.611
 Citiz-2.571
Tokenti
Token ID20259
Feature activation-6.44e-02

Pos logit contributions
ample+4.017
alone+3.709
pport+3.647
 proto+3.357
 amp+3.249
Neg logit contributions
-5.531
-5.171
knit-5.125
-4.999
 Citiz-4.967
Token­
Token ID3907
Feature activation-8.37e-01

Pos logit contributions
ample+0.222
alone+0.203
pport+0.203
 proto+0.188
 amp+0.181
Neg logit contributions
-0.307
-0.289
knit-0.286
-0.275
 Citiz-0.271
Tokenat
Token ID265
Feature activation-1.78e-01

Pos logit contributions
ample+3.303
alone+3.065
pport+3.042
 proto+2.830
abama+2.760
Neg logit contributions
-3.568
-3.331
knit-3.284
-3.186
 Citiz-3.140
Tokenian
Token ID666
Feature activation-2.42e-01

Pos logit contributions
ample+1.275
alone+1.181
pport+1.179
 proto+1.107
 amp+1.077
Neg logit contributions
-1.311
-1.223
knit-1.207
-1.158
 Citiz-1.138
Token with
Token ID351
Feature activation+2.66e-01

Pos logit contributions
ample+0.973
pport+0.901
alone+0.899
 proto+0.841
 amp+0.819
Neg logit contributions
-1.023
-0.955
knit-0.944
-0.904
 Citiz-0.888
Token a
Token ID257
Feature activation-3.49e-02

Pos logit contributions
+1.148
+1.074
knit+1.059
+1.017
 Citiz+1.003
Neg logit contributions
ample-1.031
pport-0.957
alone-0.953
 proto-0.873
abama-0.866
Token r
Token ID374
Feature activation-3.44e-01

Pos logit contributions
ample+0.134
pport+0.124
alone+0.123
 proto+0.114
abama+0.111
Neg logit contributions
-0.153
-0.144
knit-0.142
-0.136
 Citiz-0.134
Tokenak
Token ID461
Feature activation-5.31e-01

Pos logit contributions
ample+1.471
pport+1.369
alone+1.367
 proto+1.285
 amp+1.253
Neg logit contributions
-1.365
-1.268
knit-1.252
-1.196
 Citiz-1.173
Tokenish
Token ID680
Feature activation-1.25e+00

Pos logit contributions
ample+1.795
pport+1.630
alone+1.615
 proto+1.485
 amp+1.445
Neg logit contributions
-2.585
-2.421
knit-2.374
-2.321
 Citiz-2.319
Token edge
Token ID5743
Feature activation-1.75e-01

Pos logit contributions
ample+4.946
alone+4.556
pport+4.534
 proto+4.394
 amp+4.244
Neg logit contributions
-5.162
-4.837
knit-4.785
-4.587
 Citiz-4.492
Token,
Token ID11
Feature activation-2.32e-01

Pos logit contributions
ample+0.740
pport+0.688
alone+0.686
 proto+0.644
 amp+0.628
Neg logit contributions
-0.704
-0.654
knit-0.647
-0.618
 Citiz-0.605
Token full
Token ID1336
Feature activation-2.86e-01

Pos logit contributions
ample+0.931
pport+0.862
alone+0.859
 proto+0.801
abama+0.781
Neg logit contributions
-0.981
-0.914
knit-0.905
-0.867
 Citiz-0.853
Token of
Token ID286
Feature activation-2.23e-02

Pos logit contributions
ample+1.039
pport+0.954
alone+0.952
 proto+0.884
 amp+0.857
Neg logit contributions
-1.318
-1.237
knit-1.224
-1.177
 Citiz-1.158
Token word
Token ID1573
Feature activation-3.76e-03

Pos logit contributions
ample+0.086
pport+0.079
alone+0.079
 proto+0.072
abama+0.071
Neg logit contributions
-0.098
-0.092
knit-0.091
-0.087
 Citiz-0.086
Token she
Token ID673
Feature activation+1.05e-01

Pos logit contributions
+0.297
+0.278
knit+0.274
+0.263
 Citiz+0.259
Neg logit contributions
ample-0.263
pport-0.244
alone-0.243
 proto-0.225
abama-0.220
Token would
Token ID561
Feature activation+3.04e-01

Pos logit contributions
+0.459
+0.428
knit+0.424
+0.406
 Citiz+0.398
Neg logit contributions
ample-0.403
pport-0.372
alone-0.369
 proto-0.343
abama-0.335
Token put
Token ID1234
Feature activation-6.93e-02

Pos logit contributions
+1.259
+1.168
knit+1.167
+1.110
 Citiz+1.078
Neg logit contributions
ample-1.232
pport-1.144
alone-1.143
 proto-1.052
abama-1.042
Token them
Token ID606
Feature activation+1.46e-01

Pos logit contributions
ample+0.274
pport+0.254
alone+0.253
 proto+0.236
abama+0.230
Neg logit contributions
-0.296
-0.277
knit-0.273
-0.262
 Citiz-0.257
Token to
Token ID284
Feature activation+3.16e-01

Pos logit contributions
+0.634
+0.595
knit+0.589
+0.563
 Citiz+0.550
Neg logit contributions
ample-0.565
pport-0.526
alone-0.515
 proto-0.482
abama-0.474
Token sleep
Token ID3993
Feature activation+6.72e-01

Pos logit contributions
+1.351
+1.261
knit+1.245
+1.192
 Citiz+1.168
Neg logit contributions
ample-1.248
pport-1.155
alone-1.150
 proto-1.072
abama-1.046
Token each
Token ID1123
Feature activation+3.24e-02

Pos logit contributions
+2.697
+2.503
knit+2.446
+2.347
 Citiz+2.273
Neg logit contributions
ample-2.820
pport-2.644
alone-2.585
 proto-2.445
abama-2.419
Token night
Token ID1755
Feature activation-1.26e-01

Pos logit contributions
+0.121
+0.112
knit+0.111
+0.105
 Citiz+0.103
Neg logit contributions
ample-0.146
pport-0.136
alone-0.136
 proto-0.128
abama-0.125
Token as
Token ID355
Feature activation-1.71e-01

Pos logit contributions
ample+0.533
pport+0.496
alone+0.495
 proto+0.465
 amp+0.453
Neg logit contributions
-0.503
-0.468
knit-0.462
-0.441
 Citiz-0.433
Token children
Token ID1751
Feature activation+1.46e-01

Pos logit contributions
ample+0.666
pport+0.615
alone+0.614
 proto+0.572
 amp+0.556
Neg logit contributions
-0.743
-0.695
knit-0.687
-0.659
 Citiz-0.647
Token,
Token ID11
Feature activation+1.02e-03

Pos logit contributions
+0.570
+0.530
knit+0.521
+0.499
 Citiz+0.486
Neg logit contributions
ample-0.631
pport-0.589
alone-0.586
 proto-0.552
 amp-0.538
Token If
Token ID1002
Feature activation+4.17e-01

Pos logit contributions
+0.987
+0.924
knit+0.910
+0.875
 Citiz+0.854
Neg logit contributions
ample-0.883
pport-0.819
alone-0.806
 proto-0.758
abama-0.737
Token it
Token ID340
Feature activation+3.29e-01

Pos logit contributions
+1.652
+1.527
knit+1.506
+1.452
 Citiz+1.407
Neg logit contributions
ample-1.779
pport-1.651
alone-1.648
 proto-1.545
abama-1.519
Token comes
Token ID2058
Feature activation+3.12e-01

Pos logit contributions
+1.382
+1.290
knit+1.269
+1.223
 Citiz+1.191
Neg logit contributions
ample-1.324
pport-1.223
alone-1.219
 proto-1.145
 amp-1.113
Token to
Token ID284
Feature activation+1.72e-01

Pos logit contributions
+1.334
+1.253
knit+1.232
+1.182
 Citiz+1.160
Neg logit contributions
ample-1.223
pport-1.134
alone-1.119
 proto-1.050
abama-1.026
Token that
Token ID326
Feature activation+5.16e-02

Pos logit contributions
+0.727
+0.678
knit+0.668
+0.644
 Citiz+0.630
Neg logit contributions
ample-0.690
pport-0.637
alone-0.635
 proto-0.594
abama-0.578
Token point
Token ID966
Feature activation+9.27e-01

Pos logit contributions
+0.197
+0.182
knit+0.179
+0.172
 Citiz+0.167
Neg logit contributions
ample-0.228
pport-0.212
alone-0.212
 proto-0.200
 amp-0.195
Token,
Token ID11
Feature activation+1.96e-01

Pos logit contributions
+3.420
+3.159
knit+3.092
+2.997
 Citiz+2.844
Neg logit contributions
ample-4.133
pport-3.835
alone-3.802
 proto-3.622
abama-3.565
Token we
Token ID356
Feature activation+1.04e-02

Pos logit contributions
+0.789
+0.730
knit+0.723
+0.693
 Citiz+0.678
Neg logit contributions
ample-0.830
pport-0.768
alone-0.767
 proto-0.718
abama-0.705
Token will
Token ID481
Feature activation+3.42e-01

Pos logit contributions
+0.045
+0.042
knit+0.041
+0.040
 Citiz+0.039
Neg logit contributions
ample-0.041
pport-0.038
alone-0.038
 proto-0.035
 amp-0.034
Token no
Token ID645
Feature activation+1.72e-01

Pos logit contributions
+1.372
+1.274
knit+1.260
+1.202
 Citiz+1.158
Neg logit contributions
ample-1.435
alone-1.327
pport-1.323
 proto-1.235
abama-1.221
Token longer
Token ID2392
Feature activation+2.89e-01

Pos logit contributions
+0.572
+0.524
knit+0.515
+0.488
 Citiz+0.475
Neg logit contributions
ample-0.847
pport-0.795
alone-0.794
 proto-0.753
 amp-0.737
Token demand
Token ID3512
Feature activation-1.92e-02

Pos logit contributions
+1.171
+1.084
knit+1.072
+1.031
 Citiz+1.008
Neg logit contributions
ample-1.110
alone-1.038
pport-1.035
 proto-0.948
abama-0.946
Token for
Token ID329
Feature activation-2.74e-01

Pos logit contributions
ample+0.075
pport+0.069
alone+0.069
 proto+0.064
 amp+0.062
Neg logit contributions
-0.083
-0.078
knit-0.077
-0.074
 Citiz-0.072
Token Argentine
Token ID36399
Feature activation-2.25e-01

Pos logit contributions
ample+1.076
pport+0.994
alone+0.992
 proto+0.929
 amp+0.904
Neg logit contributions
-1.177
-1.100
knit-1.089
-1.043
 Citiz-1.025
Token debt
Token ID5057
Feature activation-1.75e-01

Pos logit contributions
ample+0.835
pport+0.768
alone+0.767
 proto+0.714
 amp+0.692
Neg logit contributions
-1.016
-0.953
knit-0.942
-0.906
 Citiz-0.891
Token;
Token ID26
Feature activation+2.13e-01

Pos logit contributions
ample+0.726
pport+0.674
alone+0.673
 proto+0.631
 amp+0.615
Neg logit contributions
-0.714
-0.665
knit-0.656
-0.628
 Citiz-0.615
Token OPEC
Token ID49280
Feature activation+6.82e-01

Pos logit contributions
+0.861
+0.795
knit+0.784
+0.754
 Citiz+0.740
Neg logit contributions
ample-0.888
pport-0.826
alone-0.826
 proto-0.769
abama-0.757
Token failed
Token ID4054
Feature activation+1.22e-01

Pos logit contributions
+2.835
+2.612
knit+2.561
+2.501
 Citiz+2.415
Neg logit contributions
ample-2.765
pport-2.564
alone-2.540
 proto-2.361
abama-2.337
Token to
Token ID284
Feature activation-1.73e-01

Pos logit contributions
+0.560
+0.526
knit+0.519
+0.500
 Citiz+0.491
Neg logit contributions
ample-0.442
pport-0.406
alone-0.406
 proto-0.375
abama-0.364
Token cut
Token ID2005
Feature activation-1.54e-01

Pos logit contributions
ample+0.697
alone+0.646
pport+0.645
 proto+0.601
abama+0.586
Neg logit contributions
-0.728
-0.677
knit-0.672
-0.645
 Citiz-0.628
Token production
Token ID3227
Feature activation+4.11e-01

Pos logit contributions
ample+0.603
pport+0.556
alone+0.555
 proto+0.516
abama+0.502
Neg logit contributions
-0.666
-0.622
knit-0.614
-0.590
 Citiz-0.579
Token,
Token ID11
Feature activation+2.33e-01

Pos logit contributions
+1.680
+1.535
knit+1.522
+1.462
 Citiz+1.404
Neg logit contributions
ample-1.699
pport-1.584
alone-1.574
 proto-1.462
abama-1.445
Token path
Token ID3108
Feature activation+5.41e-01

Pos logit contributions
ample+0.229
pport+0.212
alone+0.212
 proto+0.197
abama+0.193
Neg logit contributions
-0.233
-0.216
knit-0.215
-0.206
 Citiz-0.201
Token,"
Token ID553
Feature activation-1.47e-01

Pos logit contributions
+2.331
+2.173
knit+2.164
+2.066
 Citiz+2.006
Neg logit contributions
ample-2.096
pport-1.937
alone-1.918
 proto-1.797
abama-1.768
Token says
Token ID1139
Feature activation-2.06e-01

Pos logit contributions
ample+0.574
pport+0.531
alone+0.528
 proto+0.493
abama+0.480
Neg logit contributions
-0.639
-0.595
knit-0.590
-0.566
 Citiz-0.557
Token Boy
Token ID6387
Feature activation+1.25e-01

Pos logit contributions
ample+0.914
pport+0.852
alone+0.852
 proto+0.804
 amp+0.784
Neg logit contributions
-0.781
-0.724
knit-0.714
-0.681
 Citiz-0.666
Tokenega
Token ID26470
Feature activation+4.48e-01

Pos logit contributions
+0.509
+0.474
knit+0.471
+0.448
 Citiz+0.438
Neg logit contributions
ample-0.522
pport-0.484
alone-0.483
 proto-0.455
 amp-0.443
Token the
Token ID262
Feature activation+7.18e-01

Pos logit contributions
+1.573
+1.444
knit+1.429
+1.358
 Citiz+1.313
Neg logit contributions
ample-2.110
pport-1.973
alone-1.957
 proto-1.855
abama-1.818
Token day
Token ID1110
Feature activation-4.92e-01

Pos logit contributions
+2.847
+2.649
knit+2.598
+2.500
 Citiz+2.448
Neg logit contributions
ample-3.015
pport-2.802
alone-2.791
 proto-2.572
abama-2.556
Token after
Token ID706
Feature activation+2.52e-01

Pos logit contributions
ample+1.569
alone+1.426
pport+1.413
 proto+1.316
 amp+1.270
Neg logit contributions
-2.471
-2.335
knit-2.311
-2.227
 Citiz-2.200
Token the
Token ID262
Feature activation+3.06e-01

Pos logit contributions
+0.995
+0.920
knit+0.912
+0.873
 Citiz+0.855
Neg logit contributions
ample-1.075
pport-1.000
alone-0.993
 proto-0.930
abama-0.918
Token final
Token ID2457
Feature activation+1.86e-01

Pos logit contributions
+1.213
+1.125
knit+1.110
+1.067
 Citiz+1.038
Neg logit contributions
ample-1.289
pport-1.201
alone-1.189
abama-1.105
 proto-1.099
Token Force
Token ID5221
Feature activation-3.75e-01

Pos logit contributions
+0.733
+0.679
knit+0.670
+0.645
 Citiz+0.626
Neg logit contributions
ample-0.791
pport-0.738
alone-0.733
 proto-0.685
abama-0.679
Token,
Token ID11
Feature activation-1.30e-01

Pos logit contributions
ample+0.194
pport+0.180
alone+0.180
 proto+0.166
abama+0.163
Neg logit contributions
-0.219
-0.205
knit-0.202
-0.194
 Citiz-0.190
Token Y
Token ID575
Feature activation+3.54e-01

Pos logit contributions
ample+0.549
pport+0.510
alone+0.509
 proto+0.476
abama+0.466
Neg logit contributions
-0.523
-0.486
knit-0.480
-0.460
 Citiz-0.448
TokenBa
Token ID34458
Feature activation+1.96e-01

Pos logit contributions
+1.362
+1.269
knit+1.246
+1.200
 Citiz+1.156
Neg logit contributions
ample-1.537
pport-1.441
alone-1.430
 proto-1.346
abama-1.319
Token2
Token ID17
Feature activation+3.13e-01

Pos logit contributions
+0.739
+0.685
knit+0.671
+0.645
 Citiz+0.626
Neg logit contributions
ample-0.874
pport-0.818
alone-0.815
 proto-0.766
abama-0.750
TokenCu
Token ID46141
Feature activation+3.96e-01

Pos logit contributions
+1.227
+1.137
knit+1.119
+1.078
 Citiz+1.046
Neg logit contributions
ample-1.352
pport-1.265
alone-1.257
 proto-1.177
abama-1.154
Token3
Token ID18
Feature activation+8.49e-01

Pos logit contributions
+1.353
+1.237
knit+1.218
+1.162
 Citiz+1.122
Neg logit contributions
ample-1.910
pport-1.793
alone-1.786
 proto-1.692
abama-1.657
TokenO
Token ID46
Feature activation+1.88e-01

Pos logit contributions
+3.268
+2.998
knit+2.951
+2.850
CHA+2.701
Neg logit contributions
ample-3.693
pport-3.470
alone-3.408
 proto-3.194
abama-3.169
Token6
Token ID21
Feature activation+3.50e-02

Pos logit contributions
+0.634
+0.582
knit+0.573
+0.542
 Citiz+0.528
Neg logit contributions
ample-0.918
pport-0.862
alone-0.860
 proto-0.816
 amp-0.797
Token+
Token ID10
Feature activation-4.60e-02

Pos logit contributions
+0.122
+0.112
knit+0.110
+0.105
 Citiz+0.102
Neg logit contributions
ample-0.166
pport-0.156
alone-0.156
 proto-0.147
abama-0.144
Tokenx
Token ID87
Feature activation-1.99e-01

Pos logit contributions
ample+0.157
pport+0.143
alone+0.143
 proto+0.132
 amp+0.128
Neg logit contributions
-0.222
-0.209
knit-0.207
-0.200
 Citiz-0.196
Token,
Token ID11
Feature activation+9.46e-02

Pos logit contributions
ample+0.854
pport+0.793
alone+0.793
 proto+0.748
 amp+0.729
Neg logit contributions
-0.786
-0.729
knit-0.722
-0.687
 Citiz-0.677
Token team
Token ID1074
Feature activation+4.86e-01

Pos logit contributions
+1.533
+1.428
knit+1.421
+1.358
 Citiz+1.334
Neg logit contributions
ample-1.487
pport-1.377
alone-1.376
 proto-1.273
abama-1.257
Token,"
Token ID553
Feature activation+1.40e-02

Pos logit contributions
+1.863
+1.733
knit+1.726
+1.629
 Citiz+1.600
Neg logit contributions
ample-2.118
pport-1.971
alone-1.959
 proto-1.833
abama-1.804
Token he
Token ID339
Feature activation+1.99e-01

Pos logit contributions
+0.078
+0.074
knit+0.074
+0.071
 Citiz+0.070
Neg logit contributions
ample-0.037
pport-0.033
alone-0.033
 proto-0.030
abama-0.028
Token said
Token ID531
Feature activation-7.79e-03

Pos logit contributions
+0.860
+0.799
knit+0.795
+0.763
 Citiz+0.744
Neg logit contributions
ample-0.777
pport-0.713
alone-0.707
 proto-0.666
abama-0.649
Token.
Token ID13
Feature activation-3.89e-02

Pos logit contributions
ample+0.037
pport+0.035
alone+0.034
 proto+0.033
 amp+0.032
Neg logit contributions
-0.027
-0.025
knit-0.025
-0.023
 Citiz-0.023
Token
Token ID198
Feature activation+3.32e-02

Pos logit contributions
ample+0.138
pport+0.126
alone+0.125
 proto+0.116
 amp+0.113
Neg logit contributions
-0.183
-0.172
knit-0.170
-0.164
 Citiz-0.161
Token
Token ID198
Feature activation-2.51e-02

Pos logit contributions
+0.150
+0.140
knit+0.140
+0.134
CHA+0.129
Neg logit contributions
ample-0.122
pport-0.114
alone-0.111
 proto-0.105
abama-0.102
Token"
Token ID1
Feature activation-2.09e-01

Pos logit contributions
ample+0.110
pport+0.103
alone+0.102
 proto+0.097
 amp+0.094
Neg logit contributions
-0.096
-0.089
knit-0.088
-0.084
 Citiz-0.082
TokenI
Token ID40
Feature activation+1.58e-01

Pos logit contributions
ample+0.877
alone+0.815
pport+0.812
 proto+0.762
 amp+0.743
Neg logit contributions
-0.845
-0.786
knit-0.775
-0.743
 Citiz-0.732
Token can
Token ID460
Feature activation-3.39e-02

Pos logit contributions
+0.640
+0.594
knit+0.592
+0.561
 Citiz+0.548
Neg logit contributions
ample-0.664
pport-0.615
alone-0.610
 proto-0.576
abama-0.560
Token't
Token ID470
Feature activation-6.40e-02

Pos logit contributions
ample+0.147
alone+0.136
pport+0.136
 proto+0.128
abama+0.125
Neg logit contributions
-0.132
knit-0.122
-0.122
-0.115
 Citiz-0.112
Token but
Token ID475
Feature activation+7.34e-02

Pos logit contributions
+0.894
+0.831
knit+0.823
+0.784
 Citiz+0.764
Neg logit contributions
ample-0.975
pport-0.905
alone-0.903
 proto-0.849
abama-0.829
Token it
Token ID340
Feature activation+2.26e-01

Pos logit contributions
+0.296
+0.275
knit+0.272
+0.260
 Citiz+0.255
Neg logit contributions
ample-0.309
pport-0.287
alone-0.286
 proto-0.268
abama-0.261
Token was
Token ID373
Feature activation+3.51e-01

Pos logit contributions
+0.961
+0.899
knit+0.895
+0.849
 Citiz+0.831
Neg logit contributions
ample-0.898
pport-0.829
alone-0.820
 proto-0.769
abama-0.750
Token always
Token ID1464
Feature activation-8.60e-02

Pos logit contributions
+1.375
+1.277
knit+1.271
+1.207
 Citiz+1.175
Neg logit contributions
ample-1.496
pport-1.388
alone-1.386
 proto-1.297
abama-1.271
Token enjoyable
Token ID20050
Feature activation-1.35e-01

Pos logit contributions
ample+0.356
pport+0.330
alone+0.330
 proto+0.310
 amp+0.302
Neg logit contributions
-0.353
-0.328
knit-0.324
-0.310
 Citiz-0.305
Token,"
Token ID553
Feature activation+1.06e-01

Pos logit contributions
ample+0.553
pport+0.513
alone+0.512
 proto+0.481
 amp+0.468
Neg logit contributions
-0.561
-0.523
knit-0.516
-0.494
 Citiz-0.486
Token he
Token ID339
Feature activation+2.13e-01

Pos logit contributions
+0.467
+0.436
knit+0.433
+0.415
 Citiz+0.407
Neg logit contributions
ample-0.407
pport-0.376
alone-0.373
 proto-0.346
abama-0.338
Token said
Token ID531
Feature activation+1.21e-01

Pos logit contributions
+0.905
+0.845
knit+0.839
+0.803
 Citiz+0.775
Neg logit contributions
ample-0.848
pport-0.783
alone-0.776
 proto-0.727
abama-0.708
Token.
Token ID13
Feature activation-7.18e-02

Pos logit contributions
+0.416
+0.381
knit+0.378
+0.357
 Citiz+0.344
Neg logit contributions
ample-0.579
pport-0.544
alone-0.539
 proto-0.512
 amp-0.499
Token "
Token ID366
Feature activation+4.18e-02

Pos logit contributions
ample+0.257
pport+0.235
alone+0.234
 proto+0.217
 amp+0.211
Neg logit contributions
-0.335
-0.315
knit-0.312
-0.300
 Citiz-0.294
TokenI
Token ID40
Feature activation+2.77e-01

Pos logit contributions
+0.162
+0.151
knit+0.149
+0.142
 Citiz+0.139
Neg logit contributions
ample-0.182
pport-0.170
alone-0.169
 proto-0.160
 amp-0.156
Token They
Token ID1119
Feature activation-1.69e-01

Pos logit contributions
ample+0.350
pport+0.324
alone+0.322
 proto+0.303
 amp+0.293
Neg logit contributions
-0.362
-0.338
knit-0.335
-0.320
 Citiz-0.312
Token were
Token ID547
Feature activation+8.95e-02

Pos logit contributions
ample+0.662
pport+0.612
alone+0.611
 proto+0.571
 amp+0.555
Neg logit contributions
-0.733
-0.685
knit-0.677
-0.650
 Citiz-0.639
Token very
Token ID845
Feature activation+2.56e-01

Pos logit contributions
+0.385
+0.360
knit+0.356
+0.342
 Citiz+0.333
Neg logit contributions
ample-0.350
pport-0.325
alone-0.323
 proto-0.301
abama-0.293
Token healthy
Token ID5448
Feature activation-5.16e-01

Pos logit contributions
+1.030
+0.959
knit+0.951
+0.903
 Citiz+0.885
Neg logit contributions
ample-1.070
pport-0.994
alone-0.994
 proto-0.923
abama-0.911
Token this
Token ID428
Feature activation-9.98e-02

Pos logit contributions
ample+1.989
alone+1.835
pport+1.830
 proto+1.715
 amp+1.668
Neg logit contributions
-2.262
-2.113
knit-2.086
-2.006
 Citiz-1.987
Token year
Token ID614
Feature activation+9.26e-03

Pos logit contributions
ample+0.381
alone+0.352
pport+0.352
 proto+0.328
 amp+0.319
Neg logit contributions
-0.440
-0.412
knit-0.407
-0.391
 Citiz-0.385
Token.
Token ID13
Feature activation-4.67e-02

Pos logit contributions
+0.037
+0.034
knit+0.034
+0.032
 Citiz+0.032
Neg logit contributions
ample-0.039
pport-0.037
alone-0.036
 proto-0.034
 amp-0.033
Token There
Token ID1318
Feature activation+4.76e-01

Pos logit contributions
ample+0.195
pport+0.181
alone+0.180
 proto+0.169
 amp+0.165
Neg logit contributions
-0.190
-0.177
knit-0.175
-0.167
 Citiz-0.163
Token were
Token ID547
Feature activation-8.56e-02

Pos logit contributions
+1.993
+1.859
knit+1.836
+1.762
 Citiz+1.715
Neg logit contributions
ample-1.919
pport-1.771
alone-1.762
 proto-1.669
 amp-1.620
Token teams
Token ID3466
Feature activation-7.66e-02

Pos logit contributions
ample+0.337
alone+0.313
pport+0.312
 proto+0.290
abama+0.283
Neg logit contributions
-0.366
-0.342
knit-0.338
-0.325
 Citiz-0.316
Token I
Token ID314
Feature activation+1.08e-01

Pos logit contributions
ample+0.331
pport+0.308
alone+0.307
 proto+0.289
 amp+0.282
Neg logit contributions
-0.300
-0.279
knit-0.275
-0.263
 Citiz-0.257
Token the
Token ID262
Feature activation-2.59e-02

Pos logit contributions
ample+0.104
pport+0.096
alone+0.096
 proto+0.089
abama+0.087
Neg logit contributions
-0.117
-0.109
knit-0.108
-0.104
 Citiz-0.102
Token sake
Token ID11060
Feature activation+3.73e-01

Pos logit contributions
ample+0.104
pport+0.097
alone+0.096
 proto+0.090
abama+0.088
Neg logit contributions
-0.109
-0.102
knit-0.101
-0.097
 Citiz-0.095
Token of
Token ID286
Feature activation-6.63e-02

Pos logit contributions
+1.621
+1.520
knit+1.497
+1.437
CHA+1.390
Neg logit contributions
ample-1.419
pport-1.325
alone-1.325
 proto-1.226
abama-1.198
Token putting
Token ID5137
Feature activation-5.86e-02

Pos logit contributions
ample+0.244
pport+0.226
alone+0.225
 proto+0.207
abama+0.202
Neg logit contributions
-0.302
-0.283
knit-0.279
-0.269
 Citiz-0.265
Token something
Token ID1223
Feature activation-1.31e-01

Pos logit contributions
ample+0.239
pport+0.222
alone+0.221
 proto+0.207
 amp+0.201
Neg logit contributions
-0.244
-0.227
knit-0.224
-0.215
 Citiz-0.211
Token in
Token ID287
Feature activation+1.87e-01

Pos logit contributions
ample+0.518
pport+0.480
alone+0.478
 proto+0.446
 amp+0.434
Neg logit contributions
-0.558
-0.521
knit-0.515
-0.494
 Citiz-0.485
Token the
Token ID262
Feature activation-1.56e-01

Pos logit contributions
+0.786
+0.728
knit+0.723
+0.688
 Citiz+0.679
Neg logit contributions
ample-0.753
pport-0.701
alone-0.699
 proto-0.650
abama-0.640
Token grid
Token ID10706
Feature activation+1.97e-01

Pos logit contributions
ample+0.606
pport+0.563
alone+0.560
 proto+0.520
abama+0.507
Neg logit contributions
-0.677
-0.631
knit-0.625
-0.599
 Citiz-0.590
Token
Token ID198
Feature activation-1.64e-01

Pos logit contributions
+0.823
+0.771
knit+0.761
+0.727
 Citiz+0.710
Neg logit contributions
ample-0.789
pport-0.741
alone-0.736
 proto-0.686
abama-0.667
Token
Token ID198
Feature activation-1.52e-01

Pos logit contributions
ample+0.585
pport+0.540
alone+0.532
 proto+0.499
abama+0.482
Neg logit contributions
-0.766
-0.717
knit-0.714
-0.685
CHA-0.662
Token1
Token ID16
Feature activation+1.55e-01

Pos logit contributions
ample+0.666
alone+0.619
pport+0.619
 proto+0.581
 amp+0.567
Neg logit contributions
-0.584
-0.542
knit-0.534
-0.509
 Citiz-0.502
Token had
Token ID550
Feature activation+2.79e-02

Pos logit contributions
ample+0.796
pport+0.733
alone+0.732
 proto+0.683
 amp+0.662
Neg logit contributions
-0.897
-0.838
knit-0.830
-0.796
 Citiz-0.780
Token reacted
Token ID22798
Feature activation+7.06e-02

Pos logit contributions
+0.115
+0.107
knit+0.105
+0.101
 Citiz+0.099
Neg logit contributions
ample-0.115
pport-0.107
alone-0.106
 proto-0.098
abama-0.097
Token to
Token ID284
Feature activation-8.58e-02

Pos logit contributions
+0.309
+0.288
knit+0.284
+0.274
 Citiz+0.267
Neg logit contributions
ample-0.273
pport-0.251
alone-0.249
 proto-0.233
abama-0.228
Token unpleasant
Token ID22029
Feature activation+2.32e-02

Pos logit contributions
ample+0.346
pport+0.320
alone+0.319
 proto+0.299
abama+0.291
Neg logit contributions
-0.360
-0.336
knit-0.332
-0.318
 Citiz-0.312
Token images
Token ID4263
Feature activation+1.32e-01

Pos logit contributions
+0.100
+0.094
knit+0.092
+0.089
 Citiz+0.087
Neg logit contributions
ample-0.091
pport-0.084
alone-0.084
 proto-0.078
abama-0.076
Token by
Token ID416
Feature activation-1.00e-01

Pos logit contributions
+0.548
+0.511
knit+0.504
+0.483
 Citiz+0.471
Neg logit contributions
ample-0.541
pport-0.501
alone-0.497
 proto-0.469
abama-0.459
Token vomiting
Token ID30874
Feature activation-1.65e-01

Pos logit contributions
ample+0.418
pport+0.388
alone+0.387
 proto+0.363
 amp+0.353
Neg logit contributions
-0.408
-0.379
knit-0.375
-0.358
 Citiz-0.351
Token or
Token ID393
Feature activation+1.38e-01

Pos logit contributions
ample+0.683
alone+0.634
pport+0.633
 proto+0.595
 amp+0.580
Neg logit contributions
-0.673
-0.626
knit-0.619
-0.592
 Citiz-0.581
Token crying
Token ID13774
Feature activation+2.32e-01

Pos logit contributions
+0.591
+0.553
knit+0.544
+0.522
 Citiz+0.510
Neg logit contributions
ample-0.550
pport-0.508
alone-0.503
 proto-0.468
abama-0.460
Token.
Token ID13
Feature activation-3.90e-02

Pos logit contributions
+0.871
+0.803
knit+0.791
+0.756
 Citiz+0.732
Neg logit contributions
ample-1.042
pport-0.973
alone-0.965
 proto-0.915
abama-0.893
Token
Token ID198
Feature activation+1.99e-01

Pos logit contributions
ample+0.148
pport+0.137
alone+0.135
 proto+0.127
abama+0.123
Neg logit contributions
-0.175
-0.163
knit-0.161
-0.155
 Citiz-0.151
Token 100
Token ID1802
Feature activation+8.00e-02

Pos logit contributions
+1.020
+0.941
knit+0.931
+0.894
 Citiz+0.868
Neg logit contributions
ample-1.149
pport-1.064
alone-1.055
abama-0.976
 proto-0.976
Token percent
Token ID1411
Feature activation+1.37e-01

Pos logit contributions
+0.318
+0.294
knit+0.291
+0.279
 Citiz+0.271
Neg logit contributions
ample-0.341
pport-0.317
alone-0.316
 proto-0.297
abama-0.289
Token renewables
Token ID36351
Feature activation+7.21e-02

Pos logit contributions
+0.486
+0.442
knit+0.439
+0.417
 Citiz+0.403
Neg logit contributions
ample-0.644
pport-0.601
alone-0.595
 proto-0.561
abama-0.556
Token energy
Token ID2568
Feature activation+3.22e-01

Pos logit contributions
+0.354
+0.330
knit+0.327
+0.316
 Citiz+0.309
Neg logit contributions
ample-0.240
pport-0.217
alone-0.215
 proto-0.198
abama-0.195
Token mix
Token ID5022
Feature activation+7.40e-02

Pos logit contributions
+1.539
+1.422
knit+1.414
+1.365
CHA+1.324
Neg logit contributions
ample-1.113
pport-1.011
alone-0.994
 proto-0.918
abama-0.910
Token before
Token ID878
Feature activation-1.95e-01

Pos logit contributions
+0.333
+0.309
knit+0.305
+0.294
 Citiz+0.287
Neg logit contributions
ample-0.275
pport-0.255
alone-0.250
 proto-0.235
abama-0.231
Token mid
Token ID3095
Feature activation-3.21e-01

Pos logit contributions
ample+0.735
pport+0.675
alone+0.675
 proto+0.628
 amp+0.609
Neg logit contributions
-0.870
-0.815
knit-0.806
-0.775
 Citiz-0.761
Token-
Token ID12
Feature activation+3.19e-01

Pos logit contributions
ample+1.523
alone+1.431
pport+1.427
 proto+1.350
 amp+1.320
Neg logit contributions
-1.114
-1.024
knit-1.011
-0.960
 Citiz-0.944
Tokencentury
Token ID14792
Feature activation+2.11e-01

Pos logit contributions
+1.290
+1.191
knit+1.177
+1.128
CHA+1.078
Neg logit contributions
ample-1.337
pport-1.246
alone-1.226
 proto-1.171
 amp-1.146
Token.
Token ID13
Feature activation-5.43e-02

Pos logit contributions
+0.776
+0.711
knit+0.697
+0.671
 Citiz+0.648
Neg logit contributions
ample-0.963
pport-0.902
alone-0.890
 proto-0.847
abama-0.832
Token
Token ID198
Feature activation-1.40e-01

Pos logit contributions
ample+0.210
pport+0.195
alone+0.192
 proto+0.179
abama+0.174
Neg logit contributions
-0.239
-0.223
knit-0.220
-0.212
 Citiz-0.206
Token the
Token ID262
Feature activation-3.94e-01

Pos logit contributions
ample+0.021
pport+0.020
alone+0.020
 proto+0.018
 amp+0.018
Neg logit contributions
-0.020
-0.018
knit-0.018
-0.017
 Citiz-0.017
Token river
Token ID7850
Feature activation-1.61e-01

Pos logit contributions
ample+1.377
alone+1.262
pport+1.258
 proto+1.172
 amp+1.136
Neg logit contributions
-1.861
-1.749
knit-1.729
-1.666
 Citiz-1.641
Token.
Token ID13
Feature activation+3.41e-03

Pos logit contributions
ample+0.691
alone+0.643
pport+0.643
 proto+0.605
 amp+0.590
Neg logit contributions
-0.631
-0.586
knit-0.579
-0.553
 Citiz-0.543
Token It
Token ID632
Feature activation-8.40e-02

Pos logit contributions
+0.013
+0.012
knit+0.012
+0.011
 Citiz+0.011
Neg logit contributions
ample-0.015
pport-0.014
alone-0.014
 proto-0.013
abama-0.013
Token's
Token ID338
Feature activation+1.10e-01

Pos logit contributions
ample+0.334
alone+0.310
pport+0.309
 proto+0.288
 amp+0.282
Neg logit contributions
-0.356
-0.333
knit-0.329
-0.315
 Citiz-0.311
Token this
Token ID428
Feature activation-1.74e-01

Pos logit contributions
+0.449
+0.416
knit+0.413
+0.395
 Citiz+0.385
Neg logit contributions
ample-0.454
pport-0.421
alone-0.418
 proto-0.393
abama-0.386
Token balancing
Token ID22486
Feature activation+1.69e-01

Pos logit contributions
ample+0.669
pport+0.616
alone+0.615
 proto+0.573
 amp+0.556
Neg logit contributions
-0.768
-0.718
knit-0.711
-0.683
 Citiz-0.670
Token act
Token ID719
Feature activation+5.89e-01

Pos logit contributions
+0.704
+0.656
knit+0.648
+0.622
 Citiz+0.607
Neg logit contributions
ample-0.691
pport-0.642
alone-0.640
 proto-0.597
abama-0.582
Token that
Token ID326
Feature activation+3.03e-01

Pos logit contributions
+2.241
+2.077
knit+2.055
+1.963
 Citiz+1.904
Neg logit contributions
ample-2.585
pport-2.424
alone-2.403
 proto-2.272
abama-2.234
Token you
Token ID345
Feature activation+4.40e-03

Pos logit contributions
+1.308
+1.221
knit+1.210
+1.158
 Citiz+1.138
Neg logit contributions
ample-1.189
pport-1.099
alone-1.090
 proto-1.023
abama-0.999
Token have
Token ID423
Feature activation+1.30e-01

Pos logit contributions
+0.019
+0.017
knit+0.017
+0.016
 Citiz+0.016
Neg logit contributions
ample-0.018
pport-0.016
alone-0.016
 proto-0.015
 amp-0.015
Token their
Token ID511
Feature activation-6.91e-02

Pos logit contributions
+0.273
+0.252
knit+0.250
+0.238
 Citiz+0.232
Neg logit contributions
ample-0.318
alone-0.296
pport-0.296
 proto-0.278
abama-0.271
Token kids
Token ID3988
Feature activation+1.61e-01

Pos logit contributions
ample+0.294
pport+0.273
alone+0.273
 proto+0.257
 amp+0.249
Neg logit contributions
-0.276
-0.256
knit-0.253
-0.242
 Citiz-0.236
Token that
Token ID326
Feature activation+6.07e-02

Pos logit contributions
+0.647
+0.601
knit+0.594
+0.569
 Citiz+0.556
Neg logit contributions
ample-0.682
pport-0.634
alone-0.632
 proto-0.595
 amp-0.579
Token do
Token ID466
Feature activation-1.74e-01

Pos logit contributions
+0.246
+0.228
knit+0.227
+0.216
 Citiz+0.211
Neg logit contributions
ample-0.254
alone-0.236
pport-0.235
 proto-0.221
abama-0.214
Token not
Token ID407
Feature activation-3.72e-01

Pos logit contributions
ample+0.708
pport+0.657
alone+0.655
 proto+0.614
 amp+0.599
Neg logit contributions
-0.728
-0.679
knit-0.671
-0.642
 Citiz-0.633
Token conflict
Token ID5358
Feature activation-3.80e-01

Pos logit contributions
ample+1.474
pport+1.363
alone+1.361
 proto+1.272
 amp+1.234
Neg logit contributions
-1.587
-1.482
knit-1.466
-1.405
 Citiz-1.377
Token with
Token ID351
Feature activation-5.40e-02

Pos logit contributions
ample+1.443
pport+1.332
alone+1.322
 proto+1.234
 amp+1.203
Neg logit contributions
-1.681
-1.579
knit-1.567
-1.500
 Citiz-1.481
Token their
Token ID511
Feature activation-2.53e-01

Pos logit contributions
ample+0.233
alone+0.217
pport+0.217
 proto+0.203
abama+0.199
Neg logit contributions
-0.212
-0.196
knit-0.193
-0.185
 Citiz-0.180
Token own
Token ID898
Feature activation-4.11e-02

Pos logit contributions
ample+0.995
pport+0.919
alone+0.917
 proto+0.856
 amp+0.832
Neg logit contributions
-1.090
-1.017
knit-1.007
-0.966
 Citiz-0.947
Token values
Token ID3815
Feature activation-1.35e-01

Pos logit contributions
ample+0.162
pport+0.150
alone+0.150
 proto+0.140
abama+0.136
Neg logit contributions
-0.176
-0.164
knit-0.163
-0.156
 Citiz-0.153
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
ample+0.592
pport+0.551
alone+0.551
 proto+0.518
 amp+0.506
Neg logit contributions
-0.521
-0.483
knit-0.477
-0.454
 Citiz-0.447
Token our
Token ID674
Feature activation-1.67e-01

Pos logit contributions
+0.822
+0.763
knit+0.756
+0.722
 Citiz+0.712
Neg logit contributions
ample-0.843
alone-0.785
pport-0.785
 proto-0.735
abama-0.716
Token research
Token ID2267
Feature activation-2.23e-01

Pos logit contributions
ample+0.637
pport+0.587
alone+0.586
 proto+0.547
 amp+0.531
Neg logit contributions
-0.736
-0.689
knit-0.681
-0.654
 Citiz-0.643
Token with
Token ID351
Feature activation-7.78e-03

Pos logit contributions
ample+0.897
pport+0.830
alone+0.829
 proto+0.776
 amp+0.755
Neg logit contributions
-0.939
-0.876
knit-0.866
-0.829
 Citiz-0.814
Token a
Token ID257
Feature activation-4.96e-02

Pos logit contributions
ample+0.031
pport+0.029
alone+0.029
 proto+0.026
abama+0.026
Neg logit contributions
-0.033
-0.031
knit-0.031
-0.029
 Citiz-0.029
Token bill
Token ID2855
Feature activation-3.53e-02

Pos logit contributions
ample+0.193
pport+0.178
alone+0.178
 proto+0.166
 amp+0.161
Neg logit contributions
-0.215
-0.201
knit-0.199
-0.191
 Citiz-0.188
Token to
Token ID284
Feature activation-2.02e-01

Pos logit contributions
ample+0.143
pport+0.132
alone+0.132
 proto+0.123
 amp+0.120
Neg logit contributions
-0.148
-0.138
knit-0.136
-0.131
 Citiz-0.128
Token prohibit
Token ID20767
Feature activation+1.05e-01

Pos logit contributions
ample+0.813
alone+0.752
pport+0.752
 proto+0.702
abama+0.683
Neg logit contributions
-0.853
-0.795
knit-0.785
-0.753
 Citiz-0.739
Token bear
Token ID6842
Feature activation-1.62e-02

Pos logit contributions
+0.437
+0.407
knit+0.402
+0.386
 Citiz+0.380
Neg logit contributions
ample-0.427
pport-0.396
alone-0.395
 proto-0.368
abama-0.359
Token-
Token ID12
Feature activation-3.48e-01

Pos logit contributions
ample+0.060
pport+0.055
alone+0.055
 proto+0.051
abama+0.049
Neg logit contributions
-0.073
-0.069
knit-0.068
-0.066
 Citiz-0.064
Tokenfeeding
Token ID22824
Feature activation-1.12e-01

Pos logit contributions
ample+1.545
alone+1.445
pport+1.432
 proto+1.349
 amp+1.315
Neg logit contributions
-1.331
-1.233
knit-1.208
-1.159
 Citiz-1.144
Token,
Token ID11
Feature activation-2.34e-01

Pos logit contributions
ample+0.435
pport+0.402
alone+0.398
 proto+0.373
abama+0.363
Neg logit contributions
-0.486
-0.455
knit-0.450
-0.432
 Citiz-0.424
Token arrest
Token ID3251
Feature activation+1.49e-02

Pos logit contributions
+0.738
knit+0.679
+0.676
+0.657
 Citiz+0.649
Neg logit contributions
ample-0.653
pport-0.604
alone-0.598
 proto-0.557
abama-0.540
Token,
Token ID11
Feature activation+1.18e-01

Pos logit contributions
+0.062
+0.057
knit+0.057
+0.054
 Citiz+0.053
Neg logit contributions
ample-0.061
pport-0.056
alone-0.056
 proto-0.053
 amp-0.051
Token
Token ID447
Feature activation+9.41e-02

Pos logit contributions
+0.510
+0.474
knit+0.471
+0.452
 Citiz+0.445
Neg logit contributions
ample-0.462
pport-0.427
alone-0.422
 proto-0.395
abama-0.388
Token
Token ID251
Feature activation-1.66e-01

Pos logit contributions
+0.290
+0.263
knit+0.260
+0.244
 Citiz+0.238
Neg logit contributions
ample-0.485
pport-0.456
alone-0.456
 proto-0.434
 amp-0.425
Token Ll
Token ID18315
Feature activation-1.36e-01

Pos logit contributions
ample+0.660
alone+0.610
pport+0.609
 proto+0.569
 amp+0.554
Neg logit contributions
-0.709
-0.663
knit-0.656
-0.628
 Citiz-0.617
Tokenanes
Token ID7305
Feature activation-2.43e-01

Pos logit contributions
ample+0.566
pport+0.525
alone+0.525
 proto+0.492
 amp+0.479
Neg logit contributions
-0.552
-0.515
knit-0.507
-0.486
 Citiz-0.476
Token said
Token ID531
Feature activation-1.47e-01

Pos logit contributions
ample+0.933
alone+0.859
pport+0.857
 proto+0.799
 amp+0.776
Neg logit contributions
-1.071
-1.003
knit-0.992
-0.951
 Citiz-0.936
Token today
Token ID1909
Feature activation+1.77e-01

Pos logit contributions
ample+0.605
pport+0.562
alone+0.560
 proto+0.526
 amp+0.511
Neg logit contributions
-0.604
-0.562
knit-0.556
-0.533
 Citiz-0.522
Token in
Token ID287
Feature activation+3.94e-01

Pos logit contributions
+0.679
+0.628
knit+0.619
+0.592
 Citiz+0.578
Neg logit contributions
ample-0.781
pport-0.728
alone-0.725
 proto-0.685
 amp-0.667
Token a
Token ID257
Feature activation+3.49e-01

Pos logit contributions
+1.689
+1.570
knit+1.546
+1.491
 Citiz+1.459
Neg logit contributions
ample-1.545
pport-1.430
alone-1.412
 proto-1.320
abama-1.302
Token statement
Token ID2643
Feature activation+1.98e-01

Pos logit contributions
+1.626
+1.523
knit+1.505
+1.447
 Citiz+1.412
Neg logit contributions
ample-1.250
pport-1.145
alone-1.129
 proto-1.046
abama-1.029
Token very
Token ID845
Feature activation+2.64e-01

Pos logit contributions
+0.181
+0.168
knit+0.166
+0.159
 Citiz+0.156
Neg logit contributions
ample-0.181
pport-0.168
alone-0.168
 proto-0.158
 amp-0.153
Token close
Token ID1969
Feature activation+5.50e-02

Pos logit contributions
+1.069
+0.994
knit+0.986
+0.937
 Citiz+0.921
Neg logit contributions
ample-1.099
pport-1.016
alone-1.016
 proto-0.949
abama-0.929
Token with
Token ID351
Feature activation+2.52e-01

Pos logit contributions
+0.249
+0.234
knit+0.232
+0.222
 Citiz+0.218
Neg logit contributions
ample-0.204
pport-0.187
alone-0.187
 proto-0.174
 amp-0.168
Token J
Token ID449
Feature activation+1.33e-01

Pos logit contributions
+1.053
+0.984
knit+0.974
+0.929
 Citiz+0.909
Neg logit contributions
ample-1.015
alone-0.936
pport-0.932
 proto-0.870
abama-0.853
Tokeneth
Token ID2788
Feature activation-1.97e-01

Pos logit contributions
+0.533
+0.494
knit+0.488
+0.466
 Citiz+0.456
Neg logit contributions
ample-0.556
pport-0.519
alone-0.517
 proto-0.487
abama-0.474
Tokenro
Token ID305
Feature activation-1.11e+00

Pos logit contributions
ample+0.902
pport+0.843
alone+0.842
 proto+0.795
 amp+0.776
Neg logit contributions
-0.723
-0.668
knit-0.659
-0.627
 Citiz-0.613
Token P
Token ID350
Feature activation-4.90e-01

Pos logit contributions
ample+4.479
alone+4.158
pport+4.120
 proto+3.895
 amp+3.825
Neg logit contributions
-4.557
knit-4.207
-4.184
-3.987
 Citiz-3.955
Tokenugh
Token ID6724
Feature activation-8.43e-01

Pos logit contributions
ample+1.804
pport+1.679
alone+1.670
 proto+1.566
 amp+1.512
Neg logit contributions
-2.205
-2.079
knit-2.042
-1.956
 Citiz-1.905
Token [
Token ID685
Feature activation-2.39e-01

Pos logit contributions
ample+3.259
alone+3.042
pport+3.005
 proto+2.804
 amp+2.746
Neg logit contributions
-3.608
-3.326
knit-3.292
 Citiz-3.199
-3.174
Tokenwho
Token ID8727
Feature activation-1.45e-01

Pos logit contributions
ample+1.014
alone+0.946
pport+0.938
 proto+0.878
 amp+0.859
Neg logit contributions
-0.952
-0.890
knit-0.873
-0.839
 Citiz-0.832
Token died
Token ID3724
Feature activation-4.38e-01

Pos logit contributions
ample+0.558
alone+0.516
pport+0.512
 proto+0.478
 amp+0.467
Neg logit contributions
-0.637
-0.597
knit-0.589
-0.567
 Citiz-0.562
Token
Token ID198
Feature activation+3.56e-02

Pos logit contributions
ample+0.443
pport+0.410
alone+0.407
 proto+0.381
 amp+0.370
Neg logit contributions
-0.494
-0.461
knit-0.455
-0.439
 Citiz-0.429
Token
Token ID198
Feature activation-5.73e-02

Pos logit contributions
+0.163
+0.152
knit+0.151
+0.145
CHA+0.140
Neg logit contributions
ample-0.130
pport-0.121
alone-0.118
 proto-0.111
abama-0.108
TokenFord
Token ID37308
Feature activation+1.71e-01

Pos logit contributions
ample+0.247
pport+0.230
alone+0.230
 proto+0.217
 amp+0.211
Neg logit contributions
-0.225
-0.208
knit-0.206
-0.197
 Citiz-0.192
Token was
Token ID373
Feature activation+1.26e-01

Pos logit contributions
+0.736
+0.686
knit+0.678
+0.653
 Citiz+0.640
Neg logit contributions
ample-0.675
pport-0.626
alone-0.622
 proto-0.582
abama-0.565
Token ind
Token ID773
Feature activation-1.60e-01

Pos logit contributions
+0.545
+0.508
knit+0.503
+0.483
 Citiz+0.475
Neg logit contributions
ample-0.492
pport-0.455
alone-0.452
 proto-0.421
abama-0.412
Tokenignant
Token ID25114
Feature activation-1.06e+00

Pos logit contributions
ample+0.619
alone+0.570
pport+0.570
 proto+0.530
 amp+0.516
Neg logit contributions
-0.703
-0.658
knit-0.650
-0.625
 Citiz-0.615
Token.
Token ID13
Feature activation+1.19e-01

Pos logit contributions
ample+4.158
alone+3.842
pport+3.814
 proto+3.627
 amp+3.518
Neg logit contributions
-4.487
-4.203
knit-4.168
 Citiz-3.934
-3.926
Token He
Token ID679
Feature activation+1.59e-01

Pos logit contributions
+0.527
+0.491
knit+0.481
+0.468
 Citiz+0.455
Neg logit contributions
ample-0.463
pport-0.430
alone-0.421
 proto-0.396
abama-0.386
Token denied
Token ID6699
Feature activation-1.01e-01

Pos logit contributions
+0.699
+0.653
knit+0.645
+0.622
 Citiz+0.605
Neg logit contributions
ample-0.614
pport-0.568
alone-0.563
 proto-0.527
abama-0.512
Token the
Token ID262
Feature activation+1.08e-01

Pos logit contributions
ample+0.416
pport+0.386
alone+0.386
 proto+0.362
 amp+0.353
Neg logit contributions
-0.413
-0.384
knit-0.380
-0.363
 Citiz-0.357
Token allegation
Token ID24389
Feature activation+9.78e-02

Pos logit contributions
+0.517
+0.486
knit+0.480
+0.464
 Citiz+0.455
Neg logit contributions
ample-0.375
pport-0.342
alone-0.340
 proto-0.311
abama-0.305
Token 2010
Token ID3050
Feature activation-9.78e-02

Pos logit contributions
+1.348
+1.252
knit+1.232
+1.172
 Citiz+1.146
Neg logit contributions
ample-1.604
pport-1.508
alone-1.481
 proto-1.399
abama-1.371
Token United
Token ID1578
Feature activation-3.61e-01

Pos logit contributions
ample+0.433
pport+0.404
alone+0.403
 proto+0.380
 amp+0.371
Neg logit contributions
-0.373
-0.345
knit-0.341
-0.325
 Citiz-0.318
Token States
Token ID1829
Feature activation+4.45e-05

Pos logit contributions
ample+1.073
pport+0.969
alone+0.966
 proto+0.884
 amp+0.850
Neg logit contributions
-1.888
-1.794
knit-1.778
-1.716
 Citiz-1.686
Token 21
Token ID2310
Feature activation-3.57e-01

Pos logit contributions
+0.000
+0.000
knit+0.000
+0.000
 Citiz+0.000
Neg logit contributions
ample-0.000
pport-0.000
alone-0.000
 proto-0.000
abama-0.000
Token50
Token ID1120
Feature activation-3.39e-02

Pos logit contributions
ample+1.488
pport+1.384
alone+1.384
 proto+1.299
 amp+1.271
Neg logit contributions
-1.438
-1.344
knit-1.328
-1.259
 Citiz-1.244
Token Posts
Token ID12043
Feature activation-1.12e+00

Pos logit contributions
ample+0.115
pport+0.105
alone+0.104
 proto+0.097
abama+0.093
Neg logit contributions
-0.165
-0.156
knit-0.154
-0.148
 Citiz-0.145
Token #
Token ID1303
Feature activation-8.79e-01

Pos logit contributions
ample+3.708
pport+3.389
alone+3.348
 proto+3.215
 amp+3.037
Neg logit contributions
-5.396
-5.103
knit-5.037
-4.876
 Citiz-4.800
Token16
Token ID1433
Feature activation+2.13e-01

Pos logit contributions
ample+2.875
alone+2.679
pport+2.637
 proto+2.362
rawdownloadcloneembedreportprint+2.295
Neg logit contributions
-4.332
-4.106
knit-3.994
 Citiz-3.989
-3.879
Token Glad
Token ID28442
Feature activation-6.12e-01

Pos logit contributions
+0.891
+0.830
knit+0.823
+0.785
 Citiz+0.777
Neg logit contributions
ample-0.864
pport-0.798
alone-0.795
 proto-0.735
abama-0.727
Token to
Token ID284
Feature activation+2.17e-01

Pos logit contributions
ample+2.462
pport+2.284
alone+2.280
 proto+2.148
 amp+2.088
Neg logit contributions
-2.563
-2.389
knit-2.349
-2.259
 Citiz-2.211
Token hear
Token ID3285
Feature activation+2.51e-01

Pos logit contributions
+0.840
+0.778
knit+0.776
+0.738
 Citiz+0.722
Neg logit contributions
ample-0.942
alone-0.873
pport-0.871
 proto-0.813
abama-0.805
Token Ac
Token ID4013
Feature activation+8.04e-02

Pos logit contributions
ample+0.269
pport+0.250
alone+0.249
 proto+0.232
 amp+0.226
Neg logit contributions
-0.273
-0.253
knit-0.250
-0.240
 Citiz-0.236
Tokenro
Token ID305
Feature activation-6.91e-01

Pos logit contributions
+0.326
+0.303
knit+0.300
+0.286
 Citiz+0.280
Neg logit contributions
ample-0.335
pport-0.312
alone-0.310
 proto-0.292
 amp-0.284
Token
Token ID198
Feature activation-2.16e-01

Pos logit contributions
ample+2.959
alone+2.764
pport+2.759
 proto+2.587
 amp+2.528
Neg logit contributions
-2.730
-2.518
knit-2.484
-2.373
 Citiz-2.353
Token
Token ID198
Feature activation-3.15e-01

Pos logit contributions
ample+0.758
pport+0.696
alone+0.690
 proto+0.642
 amp+0.619
Neg logit contributions
-1.021
-0.959
knit-0.951
-0.914
 Citiz-0.887
TokenAc
Token ID12832
Feature activation-2.38e-01

Pos logit contributions
ample+1.400
alone+1.305
pport+1.305
 proto+1.229
 amp+1.199
Neg logit contributions
-1.190
-1.102
knit-1.088
-1.037
 Citiz-1.017
Tokenro
Token ID305
Feature activation-9.68e-01

Pos logit contributions
ample+1.082
pport+1.010
alone+1.010
 proto+0.953
 amp+0.930
Neg logit contributions
-0.883
-0.815
knit-0.804
-0.765
 Citiz-0.750
Token
Token ID198
Feature activation-2.88e-01

Pos logit contributions
ample+3.807
alone+3.540
pport+3.506
 proto+3.316
 amp+3.246
Neg logit contributions
-4.106
-3.835
knit-3.788
-3.634
 Citiz-3.574
Token
Token ID198
Feature activation-3.17e-01

Pos logit contributions
ample+1.008
pport+0.925
alone+0.918
 proto+0.853
 amp+0.823
Neg logit contributions
-1.367
-1.284
knit-1.274
-1.225
 Citiz-1.190
TokenAc
Token ID12832
Feature activation-1.28e-01

Pos logit contributions
ample+1.399
alone+1.304
pport+1.303
 proto+1.227
 amp+1.197
Neg logit contributions
-1.209
-1.121
knit-1.107
-1.054
 Citiz-1.036
Tokenro
Token ID305
Feature activation-8.75e-01

Pos logit contributions
ample+0.585
pport+0.547
alone+0.546
 proto+0.516
 amp+0.504
Neg logit contributions
-0.469
-0.433
knit-0.427
-0.406
 Citiz-0.397
Token
Token ID198
Feature activation-3.63e-01

Pos logit contributions
ample+3.176
alone+2.925
pport+2.906
 proto+2.723
 amp+2.649
Neg logit contributions
-3.994
-3.751
knit-3.707
-3.567
 Citiz-3.529
Token on
Token ID319
Feature activation-1.40e-01

Pos logit contributions
ample+1.072
pport+0.974
alone+0.974
 proto+0.895
 amp+0.864
Neg logit contributions
-1.656
-1.560
knit-1.550
-1.496
 Citiz-1.476
Token local
Token ID1957
Feature activation-6.97e-01

Pos logit contributions
ample+0.537
alone+0.495
pport+0.495
 proto+0.457
abama+0.446
Neg logit contributions
-0.621
-0.578
knit-0.572
-0.549
 Citiz-0.541
Token governments
Token ID6905
Feature activation-1.92e-01

Pos logit contributions
ample+2.578
pport+2.368
alone+2.365
 proto+2.230
 amp+2.154
Neg logit contributions
-3.130
-2.951
knit-2.917
-2.808
 Citiz-2.756
Token and
Token ID290
Feature activation-7.16e-02

Pos logit contributions
ample+0.766
pport+0.709
alone+0.708
 proto+0.663
 amp+0.645
Neg logit contributions
-0.813
-0.759
knit-0.751
-0.719
 Citiz-0.706
Token their
Token ID511
Feature activation-5.49e-01

Pos logit contributions
ample+0.280
pport+0.259
alone+0.259
 proto+0.237
abama+0.235
Neg logit contributions
-0.310
-0.288
knit-0.286
-0.273
 Citiz-0.270
Token ability
Token ID2694
Feature activation-9.82e-01

Pos logit contributions
ample+2.024
pport+1.858
alone+1.855
 proto+1.734
 amp+1.680
Neg logit contributions
-2.494
-2.344
knit-2.318
-2.229
 Citiz-2.191
Token to
Token ID284
Feature activation-2.17e-01

Pos logit contributions
ample+3.054
pport+2.785
alone+2.767
 proto+2.546
 amp+2.452
Neg logit contributions
-4.928
-4.654
knit-4.625
-4.467
 Citiz-4.444
Token solve
Token ID8494
Feature activation-8.93e-02

Pos logit contributions
ample+0.849
alone+0.784
pport+0.783
 proto+0.727
abama+0.711
Neg logit contributions
-0.934
-0.870
knit-0.864
-0.826
 Citiz-0.809
Token complex
Token ID3716
Feature activation-1.12e-01

Pos logit contributions
ample+0.336
alone+0.309
pport+0.308
 proto+0.286
abama+0.280
Neg logit contributions
-0.400
-0.373
knit-0.370
-0.356
 Citiz-0.349
Token problems
Token ID2761
Feature activation+8.84e-02

Pos logit contributions
ample+0.405
pport+0.371
alone+0.371
 proto+0.341
abama+0.334
Neg logit contributions
-0.516
-0.483
knit-0.480
-0.460
 Citiz-0.453
Token such
Token ID884
Feature activation+1.13e-01

Pos logit contributions
+0.355
+0.329
knit+0.326
+0.312
 Citiz+0.305
Neg logit contributions
ample-0.373
pport-0.346
alone-0.345
 proto-0.325
 amp-0.317