TOP ACTIVATIONS
MAX = 11.792

In January 2010 , CNN / OR G released a poll
places , Major Rob Wolf end en of the V icks
Empire Cors airs Leadership . END OF TRAN SM ISSION ///
pic . twitter . com / Eh U g I x
his fight against Mark Hunt back at UFC 200 .
pic . twitter . com / CS 3 IV IP 9
pic . twitter . com / E q 4 R IL
py _ speed _ com par ison , 0 ) RE
youtube . com / channel / UC _ K k BY
myself for the rocky transition back into the work week .
trigger panic measures such as border closures and travel bans and
pic . twitter . com / 74 E g V V
pic . twitter . com / u I ba H Ex
South Korean soldier mans the border with North Korea , where
not originally imagined as a border . The earliest ancient texts
pic . twitter . com / j Ax 6 c U
pic . twitter . com / xx A II 1 k
pic . twitter . com / z J X w 2
pic . twitter . com / O JV L F Re
NR restrictions on my research back in the spring of 2000

MIN ACTIVATIONS
MIN = -10.313

when his effort hit an upright and the rebound was well
exp orter quite an achievement for a country of just
of the State : The Life of Murray N . Roth
higher energy prices were an insol uble problem ; to Obama
opt - out at any time . You agree to receive
real ! Everyday . Is Breat htaking . Every day ,
opt - out at any time . You agree to receive
C CT s , significantly allev iating poverty . In addition
subway gr ates or on sto ops or in encamp ments
opt - out at any time . You agree to receive
U . S . Supreme Court . The Court will decide
In the months before the IPO , Black fish
early June and jailed in Kuwait . On July
son Ash ish developed a breathing problem . Man j ula
hope , striking at the heart of evil . Then the
books , such as in Horton H ears a Who .
country that claims no towering invent ors or explorers , C
Oklahoma , a seism ically inactive area before gas fracking operations
deliver daily investing tips and stock picks , provide analysis with
his r ags - to - ric hes story " <|endoftext|>

INTERVAL 6.266 - 11.792
CONTAINS 0.230%

don 't even play games with AA on ( Anti Ali
fan poses for a photograph next to Walker 's statue
Vladimir Putin chairs a meeting with government members at the Kremlin
century the old religious tradition with its am oral and anthrop
Her relative was also hit with the item and suffered

INTERVAL 0.739 - 6.266
CONTAINS 33.568%

powerful combination to develop mental and physical ability in putting accuracy
t play that game , the price impact would
a powerful anti - war , anti - nuclear , peace
oint ly organized by Zionist youth movement Ha No ar Ha
Rebecca H ower ton , Sam P anche vre ( Sam

INTERVAL -4.787 - 0.739
CONTAINS 65.254%

he had called the tower during my takeoff roll and told
s Ship of the Year title at their feet . Twice
Sign ing Day is just four days away , and
Big Spot store , where you ll see the
family owned Se gu in s dist

INTERVAL -10.313 - -4.787
CONTAINS 0.947%

was soft . Them is straight facts . He just couldn
man eu ver quicker , twitch ier defensive backs .
that ends with Sed aris covered in blood , missing a
have $ 16 million in space , plus a large chunk
, 000 new leases for homeless families , but as many
TokenIn
Token ID818
Feature activation-1.56e+00

Pos logit contributions
vernment+1.116
 Aval+1.110
Desk+1.077
herent+1.064
 emancipation+1.040
Neg logit contributions
owship-1.237
 precaution-1.191
WARNING-1.189
UC-1.160
 suspic-1.150
Token January
Token ID3269
Feature activation+8.44e-01

Pos logit contributions
 Aval+0.906
vernment+0.906
herent+0.853
 emancipation+0.840
Desk+0.824
Neg logit contributions
WARNING-1.279
owship-1.271
UC-1.247
±-1.213
 precaution-1.209
Token 2010
Token ID3050
Feature activation-8.92e-01

Pos logit contributions
WARNING+0.607
owship+0.601
UC+0.596
 precaution+0.576
±+0.574
Neg logit contributions
vernment-0.576
 Aval-0.570
 emancipation-0.540
herent-0.539
Desk-0.531
Token,
Token ID11
Feature activation-3.89e-01

Pos logit contributions
vernment+0.585
 Aval+0.578
 emancipation+0.547
herent+0.545
Desk+0.538
Neg logit contributions
WARNING-0.666
owship-0.659
UC-0.655
 precaution-0.634
±-0.631
Token CNN
Token ID8100
Feature activation+3.92e+00

Pos logit contributions
vernment+0.242
 Aval+0.241
 emancipation+0.225
herent+0.224
Desk+0.220
Neg logit contributions
WARNING-0.305
owship-0.302
UC-0.298
 precaution-0.290
±-0.289
Token/
Token ID14
Feature activation+1.18e+01

Pos logit contributions
WARNING+2.794
UC+2.761
owship+2.755
±+2.659
 precaution+2.639
Neg logit contributions
vernment-2.663
 Aval-2.624
 emancipation-2.509
herent-2.505
ivery-2.464
TokenOR
Token ID1581
Feature activation+4.81e+00

Pos logit contributions
UC+6.242
WARNING+5.891
±+5.429
owship+5.240
CW+5.173
Neg logit contributions
vernment-10.212
 Aval-10.067
 emancipation-9.847
ivery-9.761
herent-9.639
TokenG
Token ID38
Feature activation+2.17e+00

Pos logit contributions
WARNING+2.621
UC+2.613
owship+2.574
 precaution+2.450
±+2.441
Neg logit contributions
vernment-4.130
 Aval-4.064
herent-3.902
 emancipation-3.877
ivery-3.814
Token released
Token ID2716
Feature activation-1.88e+00

Pos logit contributions
WARNING+1.575
owship+1.557
UC+1.548
 precaution+1.495
±+1.490
Neg logit contributions
vernment-1.462
 Aval-1.445
 emancipation-1.371
herent-1.368
Desk-1.347
Token a
Token ID257
Feature activation+2.86e-01

Pos logit contributions
vernment+1.141
 Aval+1.130
 emancipation+1.053
herent+1.051
Desk+1.030
Neg logit contributions
WARNING-1.500
owship-1.483
UC-1.468
 precaution-1.426
±-1.423
Token poll
Token ID3278
Feature activation-2.73e+00

Pos logit contributions
WARNING+0.222
owship+0.220
UC+0.218
 precaution+0.211
±+0.210
Neg logit contributions
vernment-0.180
 Aval-0.178
 emancipation-0.168
herent-0.167
Desk-0.164
Token places
Token ID4113
Feature activation-4.25e+00

Pos logit contributions
 Aval+1.729
vernment+1.724
 emancipation+1.617
herent+1.574
Desk+1.542
Neg logit contributions
owship-2.278
WARNING-2.239
UC-2.200
±-2.125
��-2.124
Token,
Token ID11
Feature activation-2.05e+00

Pos logit contributions
vernment+2.556
 Aval+2.556
 emancipation+2.353
herent+2.342
Desk+2.270
Neg logit contributions
owship-3.398
WARNING-3.360
UC-3.271
 precaution-3.172
 UCH-3.168
Token Major
Token ID8386
Feature activation-4.42e+00

Pos logit contributions
 Aval+1.275
vernment+1.274
 emancipation+1.198
herent+1.176
Desk+1.148
Neg logit contributions
owship-1.595
WARNING-1.593
UC-1.561
±-1.513
 precaution-1.495
Token Rob
Token ID3851
Feature activation-3.58e+00

Pos logit contributions
 Aval+2.376
vernment+2.350
 emancipation+2.201
herent+2.161
 Contributions+2.146
Neg logit contributions
WARNING-3.816
owship-3.767
UC-3.691
 precaution-3.592
±-3.559
Token Wolf
Token ID8662
Feature activation-1.41e+00

Pos logit contributions
vernment+2.281
 Aval+2.252
herent+2.147
 emancipation+2.091
ivery+2.066
Neg logit contributions
WARNING-2.761
owship-2.693
UC-2.675
±-2.604
 precaution-2.598
Tokenend
Token ID437
Feature activation+1.18e+01

Pos logit contributions
vernment+0.906
 Aval+0.899
herent+0.843
 emancipation+0.842
Desk+0.829
Neg logit contributions
WARNING-1.065
owship-1.056
UC-1.043
 precaution-1.009
±-0.999
Tokenen
Token ID268
Feature activation+2.69e+00

Pos logit contributions
UC+6.135
owship+6.077
±+6.013
WARNING+5.947
uca+5.652
Neg logit contributions
 Aval-9.665
vernment-9.353
 emancipation-9.237
Desk-8.813
herent-8.713
Token of
Token ID286
Feature activation-8.35e-01

Pos logit contributions
WARNING+1.967
owship+1.943
UC+1.935
 precaution+1.873
±+1.867
Neg logit contributions
vernment-1.796
 Aval-1.773
 emancipation-1.684
herent-1.677
Desk-1.656
Token the
Token ID262
Feature activation-3.81e-01

Pos logit contributions
 Aval+0.522
vernment+0.522
 emancipation+0.486
herent+0.481
Desk+0.471
Neg logit contributions
WARNING-0.654
owship-0.647
UC-0.636
±-0.621
 precaution-0.616
Token V
Token ID569
Feature activation-9.82e-01

Pos logit contributions
vernment+0.259
 Aval+0.259
 emancipation+0.245
herent+0.241
Desk+0.236
Neg logit contributions
WARNING-0.276
owship-0.274
UC-0.269
±-0.262
 precaution-0.259
Tokenicks
Token ID3378
Feature activation+1.21e+00

Pos logit contributions
vernment+0.789
 Aval+0.778
herent+0.745
 emancipation+0.744
ivery+0.742
Neg logit contributions
WARNING-0.585
owship-0.580
UC-0.561
 precaution-0.552
±-0.550
Token Empire
Token ID8065
Feature activation+4.63e+00

Pos logit contributions
WARNING+1.067
owship+1.056
UC+1.049
 precaution+1.015
±+1.012
Neg logit contributions
vernment-0.892
 Aval-0.883
 emancipation-0.834
herent-0.831
Desk-0.818
Token Cors
Token ID26978
Feature activation-2.34e+00

Pos logit contributions
WARNING+3.691
owship+3.654
UC+3.646
 precaution+3.547
±+3.538
Neg logit contributions
vernment-2.772
 Aval-2.722
herent-2.551
 emancipation-2.541
Desk-2.509
Tokenairs
Token ID3468
Feature activation+2.92e-01

Pos logit contributions
vernment+1.237
 Aval+1.213
herent+1.145
 emancipation+1.134
ivery+1.120
Neg logit contributions
WARNING-2.036
owship-2.015
UC-2.006
 precaution-1.948
±-1.935
Token Leadership
Token ID26935
Feature activation-3.02e+00

Pos logit contributions
WARNING+0.220
owship+0.218
UC+0.215
 precaution+0.209
±+0.208
Neg logit contributions
vernment-0.188
 Aval-0.186
 emancipation-0.178
herent-0.176
Desk-0.173
Token.
Token ID13
Feature activation+2.19e+00

Pos logit contributions
 Aval+1.918
vernment+1.898
 emancipation+1.808
Desk+1.795
herent+1.793
Neg logit contributions
owship-2.253
WARNING-2.244
UC-2.197
 precaution-2.150
±-2.126
TokenEND
Token ID10619
Feature activation+1.15e+01

Pos logit contributions
WARNING+1.556
owship+1.540
UC+1.527
 precaution+1.477
±+1.470
Neg logit contributions
vernment-1.508
 Aval-1.493
 emancipation-1.415
herent-1.411
Desk-1.393
Token OF
Token ID3963
Feature activation+2.56e-01

Pos logit contributions
WARNING+7.417
UC+7.206
owship+6.791
GAME+6.732
 UCH+6.642
Neg logit contributions
vernment-8.641
 Aval-8.539
 emancipation-7.983
Desk-7.866
ivery-7.794
Token TRAN
Token ID48213
Feature activation+5.91e-01

Pos logit contributions
owship+0.170
WARNING+0.169
UC+0.166
 precaution+0.161
±+0.160
Neg logit contributions
vernment-0.188
 Aval-0.187
herent-0.178
 emancipation-0.177
Desk-0.175
TokenSM
Token ID12310
Feature activation+5.15e+00

Pos logit contributions
owship+0.239
WARNING+0.225
 precaution+0.216
UC+0.213
±+0.207
Neg logit contributions
vernment-0.593
 Aval-0.587
herent-0.576
Desk-0.571
 emancipation-0.561
TokenISSION
Token ID40373
Feature activation+3.47e+00

Pos logit contributions
WARNING+3.473
UC+3.438
owship+3.370
±+3.234
 precaution+3.229
Neg logit contributions
vernment-3.756
 Aval-3.722
 emancipation-3.551
herent-3.538
Desk-3.491
Token ///
Token ID34013
Feature activation+2.16e+00

Pos logit contributions
WARNING+2.464
UC+2.412
owship+2.410
 precaution+2.321
±+2.319
Neg logit contributions
vernment-2.408
 Aval-2.367
 emancipation-2.254
herent-2.250
Desk-2.211
Token pic
Token ID8301
Feature activation+4.94e+00

Pos logit contributions
WARNING+3.082
UC+3.038
owship+2.991
 precaution+2.892
±+2.882
Neg logit contributions
vernment-3.426
 Aval-3.345
 emancipation-3.216
herent-3.214
Desk-3.152
Token.
Token ID13
Feature activation+4.65e+00

Pos logit contributions
WARNING+3.251
UC+3.241
±+3.137
CW+2.944
owship+2.923
Neg logit contributions
vernment-3.812
 Aval-3.628
 emancipation-3.537
ivery-3.423
herent-3.404
Tokentwitter
Token ID6956
Feature activation+7.65e+00

Pos logit contributions
WARNING+2.542
UC+2.503
±+2.410
owship+2.280
CW+2.214
Neg logit contributions
vernment-4.040
 Aval-3.973
 emancipation-3.880
herent-3.740
ivery-3.730
Token.
Token ID13
Feature activation+5.02e+00

Pos logit contributions
WARNING+4.573
UC+4.546
±+4.443
CW+4.150
RW+4.105
Neg logit contributions
vernment-6.358
 Aval-6.014
 emancipation-5.917
ivery-5.618
 Contributions-5.616
Tokencom
Token ID785
Feature activation+6.07e+00

Pos logit contributions
WARNING+4.074
UC+4.038
±+3.955
CW+3.799
owship+3.797
Neg logit contributions
vernment-3.072
 Aval-2.949
 emancipation-2.854
herent-2.640
ivery-2.624
Token/
Token ID14
Feature activation+1.09e+01

Pos logit contributions
WARNING+3.249
UC+3.238
±+3.158
CW+2.918
RW+2.914
Neg logit contributions
vernment-5.405
 Aval-5.157
 emancipation-5.040
herent-4.884
ivery-4.857
TokenEh
Token ID43894
Feature activation+9.19e-01

Pos logit contributions
UC+7.320
WARNING+7.100
CW+6.780
owship+6.777
±+6.745
Neg logit contributions
vernment-7.774
 Aval-7.714
 emancipation-7.249
herent-7.090
 Contributions-6.929
TokenU
Token ID52
Feature activation+3.09e-01

Pos logit contributions
WARNING+0.624
owship+0.621
UC+0.606
 precaution+0.597
±+0.587
Neg logit contributions
vernment-0.659
 Aval-0.652
herent-0.621
 emancipation-0.621
Desk-0.617
Tokeng
Token ID70
Feature activation-7.82e-01

Pos logit contributions
WARNING+0.214
owship+0.213
UC+0.207
 precaution+0.206
±+0.201
Neg logit contributions
vernment-0.216
 Aval-0.214
herent-0.204
 emancipation-0.204
Desk-0.203
TokenI
Token ID40
Feature activation-9.78e-01

Pos logit contributions
vernment+0.561
 Aval+0.553
herent+0.531
Desk+0.531
 emancipation+0.528
Neg logit contributions
owship-0.525
WARNING-0.521
 precaution-0.506
UC-0.502
±-0.492
Tokenx
Token ID87
Feature activation+1.90e+00

Pos logit contributions
vernment+0.682
 Aval+0.676
herent+0.650
Desk+0.647
 emancipation+0.644
Neg logit contributions
owship-0.671
WARNING-0.671
 precaution-0.650
UC-0.642
±-0.632
Token his
Token ID465
Feature activation+1.53e+00

Pos logit contributions
 Aval+0.696
vernment+0.692
 emancipation+0.656
herent+0.646
ivery+0.624
Neg logit contributions
WARNING-1.030
owship-1.026
UC-1.014
±-0.981
 precaution-0.973
Token fight
Token ID1907
Feature activation+5.29e+00

Pos logit contributions
WARNING+1.179
owship+1.169
UC+1.159
 precaution+1.121
±+1.119
Neg logit contributions
vernment-0.966
 Aval-0.958
 emancipation-0.905
herent-0.899
Desk-0.883
Token against
Token ID1028
Feature activation+4.43e+00

Pos logit contributions
WARNING+3.346
owship+3.284
UC+3.282
 precaution+3.162
±+3.132
Neg logit contributions
vernment-4.083
 Aval-4.018
herent-3.820
 emancipation-3.806
Desk-3.784
Token Mark
Token ID2940
Feature activation+8.09e+00

Pos logit contributions
WARNING+3.216
UC+3.183
owship+3.158
 precaution+3.116
±+3.049
Neg logit contributions
vernment-2.980
 Aval-2.794
Desk-2.738
herent-2.731
 emancipation-2.700
Token Hunt
Token ID12937
Feature activation+3.42e+00

Pos logit contributions
owship+5.473
WARNING+5.438
UC+5.425
 RW+5.365
 precaution+5.312
Neg logit contributions
vernment-5.540
 emancipation-5.200
herent-5.089
Desk-5.017
ivery-5.006
Token back
Token ID736
Feature activation+1.08e+01

Pos logit contributions
WARNING+2.568
owship+2.528
UC+2.524
 precaution+2.445
±+2.435
Neg logit contributions
vernment-2.233
 Aval-2.194
 emancipation-2.078
herent-2.065
Desk-2.049
Token at
Token ID379
Feature activation+2.44e+00

Pos logit contributions
WARNING+7.079
UC+6.932
owship+6.808
±+6.694
 precaution+6.568
Neg logit contributions
vernment-7.459
 Aval-7.099
herent-7.008
 emancipation-6.974
Desk-6.867
Token UFC
Token ID11448
Feature activation+2.92e+00

Pos logit contributions
WARNING+1.957
owship+1.935
UC+1.932
 precaution+1.872
±+1.863
Neg logit contributions
vernment-1.466
 Aval-1.431
 emancipation-1.351
herent-1.347
Desk-1.336
Token 200
Token ID939
Feature activation+2.19e+00

Pos logit contributions
WARNING+2.084
owship+2.067
UC+2.064
 precaution+1.990
±+1.989
Neg logit contributions
vernment-2.006
 Aval-1.966
 emancipation-1.861
herent-1.854
Desk-1.842
Token.
Token ID13
Feature activation-1.51e+00

Pos logit contributions
WARNING+1.680
owship+1.661
UC+1.651
 precaution+1.601
±+1.593
Neg logit contributions
vernment-1.393
 Aval-1.374
 emancipation-1.297
herent-1.293
Desk-1.275
Token
Token ID198
Feature activation-2.40e+00

Pos logit contributions
 Aval+1.035
vernment+1.017
herent+0.966
Desk+0.958
 emancipation+0.951
Neg logit contributions
owship-1.075
WARNING-1.056
UC-1.037
 precaution-1.033
±-1.004
Token pic
Token ID8301
Feature activation+4.68e+00

Pos logit contributions
owship+0.542
WARNING+0.540
UC+0.530
 precaution+0.519
±+0.512
Neg logit contributions
 Aval-0.497
vernment-0.495
 emancipation-0.465
herent-0.464
Desk-0.461
Token.
Token ID13
Feature activation+4.09e+00

Pos logit contributions
WARNING+3.119
UC+3.106
±+3.007
owship+2.875
CW+2.839
Neg logit contributions
vernment-3.555
 Aval-3.370
 emancipation-3.269
ivery-3.195
herent-3.174
Tokentwitter
Token ID6956
Feature activation+6.61e+00

Pos logit contributions
WARNING+2.295
UC+2.270
±+2.167
owship+2.116
 precaution+2.015
Neg logit contributions
vernment-3.499
 Aval-3.435
 emancipation-3.332
herent-3.232
ivery-3.217
Token.
Token ID13
Feature activation+4.49e+00

Pos logit contributions
WARNING+4.087
UC+4.045
±+3.958
CW+3.677
RW+3.658
Neg logit contributions
vernment-5.384
 Aval-5.064
 emancipation-4.991
ivery-4.792
 Contributions-4.746
Tokencom
Token ID785
Feature activation+5.36e+00

Pos logit contributions
WARNING+3.708
UC+3.669
±+3.578
owship+3.462
CW+3.446
Neg logit contributions
vernment-2.692
 Aval-2.595
 emancipation-2.492
herent-2.333
ivery-2.312
Token/
Token ID14
Feature activation+1.06e+01

Pos logit contributions
WARNING+2.941
UC+2.918
±+2.846
CW+2.628
RW+2.628
Neg logit contributions
vernment-4.717
 Aval-4.500
 emancipation-4.388
herent-4.281
ivery-4.252
TokenCS
Token ID7902
Feature activation+2.75e+00

Pos logit contributions
UC+7.170
WARNING+6.970
owship+6.650
CW+6.635
±+6.616
Neg logit contributions
vernment-7.618
 Aval-7.559
 emancipation-7.107
herent-6.949
 Contributions-6.795
Token3
Token ID18
Feature activation-1.88e+00

Pos logit contributions
WARNING+1.891
owship+1.866
UC+1.858
 precaution+1.786
±+1.783
Neg logit contributions
vernment-1.969
 Aval-1.947
 emancipation-1.849
herent-1.843
Desk-1.817
TokenIV
Token ID3824
Feature activation-9.23e-01

Pos logit contributions
vernment+1.323
 Aval+1.308
herent+1.262
Desk+1.260
 emancipation+1.248
Neg logit contributions
owship-1.268
WARNING-1.261
 precaution-1.228
UC-1.203
±-1.186
TokenIP
Token ID4061
Feature activation+1.29e+00

Pos logit contributions
vernment+0.649
 Aval+0.641
herent+0.616
Desk+0.614
 emancipation+0.612
Neg logit contributions
owship-0.629
WARNING-0.628
 precaution-0.609
UC-0.603
±-0.590
Token9
Token ID24
Feature activation-3.86e-01

Pos logit contributions
WARNING+0.883
owship+0.878
UC+0.860
 precaution+0.844
±+0.832
Neg logit contributions
vernment-0.924
 Aval-0.913
herent-0.870
 emancipation-0.869
Desk-0.862
Token pic
Token ID8301
Feature activation+3.97e+00

Pos logit contributions
owship+0.055
WARNING+0.055
UC+0.053
 precaution+0.052
±+0.052
Neg logit contributions
 Aval-0.055
vernment-0.055
herent-0.052
 emancipation-0.052
Desk-0.051
Token.
Token ID13
Feature activation+4.51e+00

Pos logit contributions
WARNING+2.722
UC+2.715
±+2.618
owship+2.534
 precaution+2.487
Neg logit contributions
vernment-2.918
 Aval-2.801
 emancipation-2.716
herent-2.637
ivery-2.625
Tokentwitter
Token ID6956
Feature activation+6.36e+00

Pos logit contributions
WARNING+2.573
UC+2.537
±+2.432
owship+2.380
 precaution+2.302
Neg logit contributions
vernment-3.801
 Aval-3.738
 emancipation-3.626
herent-3.521
Desk-3.513
Token.
Token ID13
Feature activation+4.99e+00

Pos logit contributions
WARNING+4.024
UC+4.001
±+3.921
RW+3.645
CW+3.635
Neg logit contributions
vernment-5.042
 Aval-4.778
 emancipation-4.703
ivery-4.487
 Contributions-4.466
Tokencom
Token ID785
Feature activation+5.35e+00

Pos logit contributions
WARNING+4.100
UC+4.066
±+3.990
owship+3.836
CW+3.816
Neg logit contributions
vernment-2.985
 Aval-2.866
 emancipation-2.769
herent-2.575
Desk-2.569
Token/
Token ID14
Feature activation+1.06e+01

Pos logit contributions
UC+2.974
WARNING+2.973
±+2.894
owship+2.700
RW+2.684
Neg logit contributions
vernment-4.619
 Aval-4.444
 emancipation-4.336
herent-4.208
 Contributions-4.165
TokenE
Token ID36
Feature activation+1.60e+00

Pos logit contributions
UC+7.161
WARNING+6.959
owship+6.642
CW+6.631
±+6.625
Neg logit contributions
vernment-7.579
 Aval-7.534
 emancipation-7.094
herent-6.929
 Contributions-6.769
Tokenq
Token ID80
Feature activation+1.62e+00

Pos logit contributions
WARNING+1.101
owship+1.092
UC+1.074
 precaution+1.048
±+1.037
Neg logit contributions
vernment-1.141
 Aval-1.129
 emancipation-1.073
herent-1.073
Desk-1.061
Token4
Token ID19
Feature activation-1.86e+00

Pos logit contributions
WARNING+1.106
owship+1.098
UC+1.079
 precaution+1.053
±+1.041
Neg logit contributions
vernment-1.167
 Aval-1.154
 emancipation-1.097
herent-1.097
Desk-1.087
TokenR
Token ID49
Feature activation+5.88e-01

Pos logit contributions
vernment+1.310
 Aval+1.295
herent+1.249
Desk+1.247
 emancipation+1.237
Neg logit contributions
owship-1.255
WARNING-1.247
 precaution-1.213
UC-1.187
±-1.169
TokenIL
Token ID4146
Feature activation+3.24e+00

Pos logit contributions
WARNING+0.399
owship+0.397
UC+0.387
 precaution+0.382
±+0.375
Neg logit contributions
vernment-0.421
 Aval-0.417
herent-0.398
 emancipation-0.397
Desk-0.394
Tokenpy
Token ID9078
Feature activation+2.01e+00

Pos logit contributions
WARNING+1.429
owship+1.414
UC+1.408
±+1.365
 precaution+1.343
Neg logit contributions
vernment-1.471
 Aval-1.452
 emancipation-1.391
Desk-1.356
herent-1.342
Token_
Token ID62
Feature activation+6.31e-01

Pos logit contributions
WARNING+1.473
owship+1.447
UC+1.446
 precaution+1.391
±+1.390
Neg logit contributions
vernment-1.347
 Aval-1.331
 emancipation-1.261
herent-1.251
Desk-1.231
Tokenspeed
Token ID12287
Feature activation-3.20e+00

Pos logit contributions
WARNING+0.465
owship+0.461
UC+0.457
 precaution+0.443
±+0.441
Neg logit contributions
vernment-0.419
 Aval-0.414
 emancipation-0.392
herent-0.392
Desk-0.386
Token_
Token ID62
Feature activation-1.11e+00

Pos logit contributions
vernment+1.954
 Aval+1.924
Desk+1.833
herent+1.833
 emancipation+1.826
Neg logit contributions
owship-2.472
WARNING-2.466
UC-2.401
 precaution-2.367
±-2.326
Tokencom
Token ID785
Feature activation-4.97e+00

Pos logit contributions
vernment+0.747
 Aval+0.736
herent+0.709
 emancipation+0.696
Desk+0.693
Neg logit contributions
owship-0.798
WARNING-0.795
UC-0.780
 precaution-0.766
±-0.752
Tokenpar
Token ID1845
Feature activation+1.05e+01

Pos logit contributions
vernment+2.477
 Aval+2.340
herent+2.335
ivery+2.239
Desk+2.181
Neg logit contributions
owship-4.430
WARNING-4.408
UC-4.340
 precaution-4.291
±-4.198
Tokenison
Token ID1653
Feature activation+2.70e+00

Pos logit contributions
UC+6.163
owship+6.113
±+6.079
WARNING+6.060
 precaution+5.744
Neg logit contributions
 Aval-8.047
vernment-7.866
 emancipation-7.665
 Contributions-7.368
herent-7.257
Token,
Token ID11
Feature activation-1.28e+00

Pos logit contributions
WARNING+1.994
UC+1.962
owship+1.948
±+1.891
 precaution+1.887
Neg logit contributions
vernment-1.804
 Aval-1.770
 emancipation-1.688
herent-1.668
Desk-1.641
Token 0
Token ID657
Feature activation-1.22e+00

Pos logit contributions
vernment+0.853
 Aval+0.844
herent+0.808
 emancipation+0.797
Desk+0.791
Neg logit contributions
owship-0.924
WARNING-0.921
UC-0.902
 precaution-0.880
±-0.878
Token)
Token ID8
Feature activation+1.46e+00

Pos logit contributions
vernment+0.730
 Aval+0.725
herent+0.681
 emancipation+0.680
ivery+0.670
Neg logit contributions
owship-0.967
WARNING-0.964
UC-0.940
 precaution-0.932
±-0.915
Token RE
Token ID4526
Feature activation+1.17e+00

Pos logit contributions
WARNING+0.991
owship+0.986
UC+0.972
 precaution+0.939
±+0.935
Neg logit contributions
vernment-1.055
 Aval-1.047
 emancipation-0.994
herent-0.993
Desk-0.978
Tokenyoutube
Token ID11604
Feature activation+2.28e+00

Pos logit contributions
vernment+0.085
 Aval+0.083
herent+0.078
 emancipation+0.077
Desk+0.076
Neg logit contributions
WARNING-0.144
owship-0.143
UC-0.142
 precaution-0.138
±-0.137
Token.
Token ID13
Feature activation+4.09e+00

Pos logit contributions
WARNING+1.636
UC+1.604
owship+1.571
±+1.555
 precaution+1.510
Neg logit contributions
vernment-1.595
 Aval-1.558
 emancipation-1.493
herent-1.461
ivery-1.440
Tokencom
Token ID785
Feature activation+1.49e+00

Pos logit contributions
WARNING+3.473
UC+3.424
±+3.334
owship+3.307
 precaution+3.197
Neg logit contributions
vernment-2.325
 Aval-2.273
 emancipation-2.178
herent-2.039
ivery-2.030
Token/
Token ID14
Feature activation+8.67e+00

Pos logit contributions
WARNING+0.960
owship+0.947
UC+0.940
 precaution+0.905
±+0.901
Neg logit contributions
vernment-1.132
 Aval-1.121
 emancipation-1.068
herent-1.064
Desk-1.052
Tokenchannel
Token ID17620
Feature activation+2.20e+00

Pos logit contributions
WARNING+4.470
UC+4.447
owship+4.202
±+4.102
 precaution+3.983
Neg logit contributions
 Aval-7.593
vernment-7.540
 emancipation-7.239
ivery-7.042
Desk-6.990
Token/
Token ID14
Feature activation+1.05e+01

Pos logit contributions
WARNING+1.358
UC+1.328
owship+1.304
±+1.279
 precaution+1.247
Neg logit contributions
vernment-1.753
 Aval-1.714
 emancipation-1.647
herent-1.626
ivery-1.604
TokenUC
Token ID9598
Feature activation+5.28e+00

Pos logit contributions
UC+4.494
WARNING+3.938
owship+3.642
CW+3.444
±+3.430
Neg logit contributions
vernment-10.683
 Aval-10.545
 emancipation-10.172
herent-9.949
Desk-9.936
Token_
Token ID62
Feature activation+5.52e+00

Pos logit contributions
WARNING+3.606
UC+3.602
owship+3.526
±+3.414
 precaution+3.350
Neg logit contributions
vernment-3.798
 Aval-3.749
 emancipation-3.559
herent-3.523
Desk-3.446
TokenK
Token ID42
Feature activation-2.52e+00

Pos logit contributions
WARNING+3.754
UC+3.754
owship+3.661
±+3.542
 precaution+3.469
Neg logit contributions
vernment-3.997
 Aval-3.954
 emancipation-3.739
herent-3.709
Desk-3.619
Tokenk
Token ID74
Feature activation-2.59e+00

Pos logit contributions
vernment+1.776
 Aval+1.759
Desk+1.700
herent+1.694
 emancipation+1.682
Neg logit contributions
owship-1.679
WARNING-1.665
 precaution-1.630
UC-1.583
±-1.568
TokenBY
Token ID17513
Feature activation+2.96e-01

Pos logit contributions
vernment+1.822
 Aval+1.799
Desk+1.741
herent+1.730
 emancipation+1.719
Neg logit contributions
owship-1.744
WARNING-1.730
 precaution-1.691
UC-1.645
±-1.627
Token myself
Token ID3589
Feature activation+4.90e+00

Pos logit contributions
WARNING+0.145
owship+0.144
UC+0.142
 precaution+0.138
±+0.137
Neg logit contributions
vernment-0.112
 Aval-0.112
 emancipation-0.105
herent-0.104
Desk-0.103
Token for
Token ID329
Feature activation+1.29e+00

Pos logit contributions
WARNING+3.622
UC+3.584
owship+3.568
 precaution+3.494
±+3.481
Neg logit contributions
vernment-3.168
 Aval-3.099
herent-2.972
 emancipation-2.948
Desk-2.896
Token the
Token ID262
Feature activation+1.40e+00

Pos logit contributions
WARNING+0.988
owship+0.981
UC+0.969
 precaution+0.935
±+0.932
Neg logit contributions
vernment-0.825
 Aval-0.822
 emancipation-0.774
herent-0.767
Desk-0.757
Token rocky
Token ID26898
Feature activation+3.16e+00

Pos logit contributions
WARNING+1.012
owship+1.007
UC+0.991
 precaution+0.956
±+0.953
Neg logit contributions
 Aval-0.949
vernment-0.949
 emancipation-0.896
herent-0.887
Desk-0.874
Token transition
Token ID6801
Feature activation+4.32e+00

Pos logit contributions
WARNING+2.352
owship+2.325
UC+2.310
 precaution+2.238
±+2.230
Neg logit contributions
vernment-2.074
 Aval-2.042
herent-1.933
 emancipation-1.932
Desk-1.903
Token back
Token ID736
Feature activation+1.03e+01

Pos logit contributions
WARNING+3.129
owship+3.099
UC+3.080
 precaution+2.981
±+2.974
Neg logit contributions
vernment-2.903
 Aval-2.843
herent-2.714
 emancipation-2.701
Desk-2.677
Token into
Token ID656
Feature activation+5.18e+00

Pos logit contributions
UC+6.633
owship+6.605
WARNING+6.538
±+6.481
 precaution+6.209
Neg logit contributions
vernment-7.201
 Aval-6.888
herent-6.872
 emancipation-6.760
Desk-6.721
Token the
Token ID262
Feature activation+3.27e+00

Pos logit contributions
WARNING+3.752
owship+3.725
UC+3.697
 precaution+3.606
±+3.589
Neg logit contributions
vernment-3.454
 Aval-3.371
herent-3.214
 emancipation-3.210
Desk-3.169
Token work
Token ID670
Feature activation-1.83e-01

Pos logit contributions
WARNING+2.504
owship+2.481
UC+2.463
 precaution+2.391
±+2.382
Neg logit contributions
vernment-2.084
 Aval-2.047
 emancipation-1.935
herent-1.934
Desk-1.906
Token week
Token ID1285
Feature activation+5.26e+00

Pos logit contributions
 Aval+0.110
vernment+0.110
 emancipation+0.103
herent+0.103
Desk+0.102
Neg logit contributions
WARNING-0.146
owship-0.145
UC-0.143
 precaution-0.138
±-0.137
Token.
Token ID13
Feature activation+2.71e+00

Pos logit contributions
WARNING+3.789
owship+3.753
UC+3.743
 precaution+3.637
±+3.630
Neg logit contributions
vernment-3.507
 Aval-3.429
 emancipation-3.280
herent-3.268
Desk-3.246
Token trigger
Token ID7616
Feature activation+2.47e+00

Pos logit contributions
vernment+0.027
 Aval+0.027
 emancipation+0.025
herent+0.025
Desk+0.025
Neg logit contributions
WARNING-0.029
owship-0.028
UC-0.028
±-0.027
 precaution-0.027
Token panic
Token ID13619
Feature activation+8.57e-01

Pos logit contributions
WARNING+1.841
owship+1.816
UC+1.808
 precaution+1.764
±+1.743
Neg logit contributions
vernment-1.624
 Aval-1.601
herent-1.516
 emancipation-1.513
Desk-1.496
Token measures
Token ID5260
Feature activation+2.41e+00

Pos logit contributions
WARNING+0.650
owship+0.644
UC+0.638
 precaution+0.618
±+0.617
Neg logit contributions
vernment-0.552
 Aval-0.545
 emancipation-0.514
herent-0.513
Desk-0.506
Token such
Token ID884
Feature activation-3.89e-01

Pos logit contributions
WARNING+1.839
owship+1.813
UC+1.812
 precaution+1.762
±+1.749
Neg logit contributions
vernment-1.532
 Aval-1.507
 emancipation-1.428
herent-1.427
Desk-1.406
Token as
Token ID355
Feature activation-7.07e-02

Pos logit contributions
vernment+0.221
 Aval+0.218
 emancipation+0.205
herent+0.203
Desk+0.200
Neg logit contributions
WARNING-0.324
owship-0.323
UC-0.318
±-0.308
 precaution-0.306
Token border
Token ID4865
Feature activation+1.03e+01

Pos logit contributions
vernment+0.047
 Aval+0.047
 emancipation+0.045
herent+0.044
Desk+0.043
Neg logit contributions
WARNING-0.052
owship-0.052
UC-0.051
±-0.049
 precaution-0.048
Token closures
Token ID32149
Feature activation+2.82e+00

Pos logit contributions
 precaution+5.767
WARNING+5.630
UC+5.606
owship+5.410
±+5.289
Neg logit contributions
vernment-8.296
 Aval-8.188
herent-7.893
ivery-7.717
 emancipation-7.641
Token and
Token ID290
Feature activation+6.80e-01

Pos logit contributions
WARNING+2.061
UC+2.031
owship+2.029
 precaution+1.970
±+1.956
Neg logit contributions
vernment-1.894
 Aval-1.863
 emancipation-1.771
herent-1.770
Desk-1.742
Token travel
Token ID3067
Feature activation+2.08e+00

Pos logit contributions
WARNING+0.496
owship+0.492
UC+0.487
±+0.468
 precaution+0.464
Neg logit contributions
vernment-0.458
 Aval-0.454
 emancipation-0.431
herent-0.426
Desk-0.420
Token bans
Token ID20075
Feature activation+5.48e+00

Pos logit contributions
WARNING+1.608
owship+1.593
UC+1.580
 precaution+1.530
±+1.526
Neg logit contributions
vernment-1.302
 Aval-1.286
 emancipation-1.214
herent-1.210
Desk-1.192
Token and
Token ID290
Feature activation-1.04e+00

Pos logit contributions
WARNING+3.902
UC+3.859
owship+3.797
 precaution+3.772
±+3.717
Neg logit contributions
vernment-3.680
 Aval-3.600
herent-3.447
 emancipation-3.427
Desk-3.390
Token pic
Token ID8301
Feature activation+5.36e+00

Pos logit contributions
WARNING+4.000
UC+3.928
owship+3.906
 precaution+3.796
±+3.784
Neg logit contributions
vernment-3.777
 Aval-3.703
herent-3.538
 emancipation-3.527
Desk-3.474
Token.
Token ID13
Feature activation+5.14e+00

Pos logit contributions
WARNING+3.505
UC+3.498
±+3.404
owship+3.214
 RW+3.193
Neg logit contributions
vernment-4.099
 Aval-3.863
 emancipation-3.784
ivery-3.695
herent-3.665
Tokentwitter
Token ID6956
Feature activation+6.56e+00

Pos logit contributions
WARNING+2.866
UC+2.839
±+2.716
owship+2.615
 precaution+2.559
Neg logit contributions
vernment-4.364
 Aval-4.274
 emancipation-4.166
Desk-4.053
herent-4.052
Token.
Token ID13
Feature activation+4.90e+00

Pos logit contributions
WARNING+4.094
UC+4.083
±+3.998
RW+3.698
CW+3.693
Neg logit contributions
vernment-5.245
 Aval-4.959
 emancipation-4.879
ivery-4.666
 Contributions-4.636
Tokencom
Token ID785
Feature activation+5.46e+00

Pos logit contributions
WARNING+4.046
UC+4.038
±+3.943
owship+3.814
CW+3.770
Neg logit contributions
vernment-2.882
 Aval-2.772
 emancipation-2.672
herent-2.512
Desk-2.498
Token/
Token ID14
Feature activation+1.02e+01

Pos logit contributions
UC+3.019
WARNING+3.017
±+2.947
owship+2.754
RW+2.722
Neg logit contributions
vernment-4.702
 Aval-4.509
 emancipation-4.410
herent-4.294
ivery-4.249
Token74
Token ID4524
Feature activation+1.17e+00

Pos logit contributions
UC+6.908
WARNING+6.712
owship+6.430
±+6.390
CW+6.384
Neg logit contributions
vernment-7.299
 Aval-7.247
 emancipation-6.829
herent-6.686
 Contributions-6.527
TokenE
Token ID36
Feature activation+2.47e+00

Pos logit contributions
WARNING+0.797
owship+0.792
UC+0.775
 precaution+0.761
±+0.750
Neg logit contributions
vernment-0.837
 Aval-0.828
herent-0.789
 emancipation-0.788
Desk-0.782
Tokeng
Token ID70
Feature activation+1.35e+00

Pos logit contributions
WARNING+1.709
owship+1.689
UC+1.678
 precaution+1.618
±+1.612
Neg logit contributions
vernment-1.754
 Aval-1.735
 emancipation-1.647
herent-1.643
Desk-1.621
TokenV
Token ID53
Feature activation+1.51e+00

Pos logit contributions
WARNING+0.907
owship+0.901
UC+0.884
 precaution+0.864
±+0.854
Neg logit contributions
vernment-0.980
 Aval-0.968
 emancipation-0.921
herent-0.921
Desk-0.912
TokenV
Token ID53
Feature activation+1.62e+00

Pos logit contributions
WARNING+1.030
owship+1.021
UC+1.004
 precaution+0.980
±+0.970
Neg logit contributions
vernment-1.078
 Aval-1.066
 emancipation-1.013
herent-1.013
Desk-1.002
Token pic
Token ID8301
Feature activation+4.96e+00

Pos logit contributions
WARNING+1.216
owship+1.202
UC+1.192
 precaution+1.149
±+1.143
Neg logit contributions
vernment-1.383
 Aval-1.368
 emancipation-1.303
herent-1.300
Desk-1.284
Token.
Token ID13
Feature activation+4.89e+00

Pos logit contributions
WARNING+3.291
UC+3.269
±+3.187
owship+3.073
 precaution+3.020
Neg logit contributions
vernment-3.700
 Aval-3.543
 emancipation-3.409
ivery-3.363
herent-3.349
Tokentwitter
Token ID6956
Feature activation+7.05e+00

Pos logit contributions
WARNING+2.681
UC+2.651
±+2.557
owship+2.437
CW+2.348
Neg logit contributions
vernment-4.213
 Aval-4.160
 emancipation-4.046
herent-3.899
ivery-3.885
Token.
Token ID13
Feature activation+4.74e+00

Pos logit contributions
WARNING+4.300
UC+4.252
±+4.165
CW+3.859
RW+3.812
Neg logit contributions
vernment-5.804
 Aval-5.463
 emancipation-5.360
ivery-5.163
 Contributions-5.122
Tokencom
Token ID785
Feature activation+4.90e+00

Pos logit contributions
WARNING+3.921
UC+3.886
±+3.796
owship+3.687
CW+3.646
Neg logit contributions
vernment-2.799
 Aval-2.725
 emancipation-2.605
herent-2.448
ivery-2.415
Token/
Token ID14
Feature activation+1.01e+01

Pos logit contributions
WARNING+2.745
UC+2.724
±+2.649
owship+2.493
CW+2.461
Neg logit contributions
vernment-4.231
 Aval-4.060
 emancipation-3.947
herent-3.858
ivery-3.820
Tokenu
Token ID84
Feature activation+1.20e+00

Pos logit contributions
UC+6.852
WARNING+6.677
owship+6.394
±+6.349
CW+6.336
Neg logit contributions
vernment-7.270
 Aval-7.209
 emancipation-6.782
herent-6.651
 Contributions-6.479
TokenI
Token ID40
Feature activation-1.54e+00

Pos logit contributions
WARNING+0.808
owship+0.803
UC+0.787
 precaution+0.771
±+0.760
Neg logit contributions
vernment-0.872
 Aval-0.862
herent-0.821
 emancipation-0.821
Desk-0.815
Tokenba
Token ID7012
Feature activation+8.10e-01

Pos logit contributions
vernment+1.066
 Aval+1.060
herent+1.017
Desk+1.017
 emancipation+1.008
Neg logit contributions
owship-1.060
WARNING-1.058
 precaution-1.029
UC-1.006
±-0.995
TokenH
Token ID39
Feature activation-1.11e+00

Pos logit contributions
WARNING+0.547
owship+0.546
UC+0.531
 precaution+0.525
±+0.514
Neg logit contributions
vernment-0.582
 Aval-0.575
herent-0.548
 emancipation-0.547
Desk-0.546
TokenEx
Token ID3109
Feature activation-2.14e+00

Pos logit contributions
vernment+0.782
 Aval+0.775
herent+0.744
Desk+0.743
 emancipation+0.740
Neg logit contributions
owship-0.751
WARNING-0.750
 precaution-0.729
UC-0.717
±-0.704
Token South
Token ID2520
Feature activation+2.02e+00

Pos logit contributions
vernment+0.173
 Aval+0.170
herent+0.161
 emancipation+0.161
Desk+0.158
Neg logit contributions
owship-0.180
WARNING-0.179
UC-0.175
±-0.170
 precaution-0.169
Token Korean
Token ID6983
Feature activation+7.98e-01

Pos logit contributions
WARNING+1.380
UC+1.360
owship+1.354
 precaution+1.313
±+1.303
Neg logit contributions
vernment-1.441
 Aval-1.429
 emancipation-1.356
herent-1.353
Desk-1.338
Token soldier
Token ID10686
Feature activation+2.57e+00

Pos logit contributions
WARNING+0.580
owship+0.576
UC+0.569
 precaution+0.550
±+0.549
Neg logit contributions
vernment-0.539
 Aval-0.533
 emancipation-0.505
herent-0.502
Desk-0.495
Token mans
Token ID25045
Feature activation+8.97e-01

Pos logit contributions
WARNING+1.716
owship+1.696
UC+1.683
 precaution+1.631
±+1.617
Neg logit contributions
vernment-1.884
 Aval-1.859
 emancipation-1.768
herent-1.768
Desk-1.747
Token the
Token ID262
Feature activation-1.73e-01

Pos logit contributions
WARNING+0.705
owship+0.698
UC+0.694
 precaution+0.671
±+0.670
Neg logit contributions
vernment-0.554
 Aval-0.549
 emancipation-0.516
herent-0.515
Desk-0.507
Token border
Token ID4865
Feature activation+9.98e+00

Pos logit contributions
vernment+0.116
 Aval+0.115
 emancipation+0.110
herent+0.108
Desk+0.107
Neg logit contributions
WARNING-0.126
owship-0.126
UC-0.124
±-0.119
 precaution-0.119
Token with
Token ID351
Feature activation+4.82e+00

Pos logit contributions
WARNING+6.358
 precaution+6.256
UC+6.252
owship+6.145
±+5.986
Neg logit contributions
 Aval-7.052
vernment-7.044
ivery-6.788
herent-6.787
 emancipation-6.655
Token North
Token ID2258
Feature activation+3.98e+00

Pos logit contributions
WARNING+3.142
UC+3.051
owship+3.051
 precaution+3.020
±+2.969
Neg logit contributions
vernment-3.554
 Aval-3.487
Desk-3.340
herent-3.336
 emancipation-3.323
Token Korea
Token ID4969
Feature activation+1.04e+00

Pos logit contributions
WARNING+2.294
owship+2.245
UC+2.243
±+2.164
 precaution+2.159
Neg logit contributions
vernment-3.254
 Aval-3.206
herent-3.068
Desk-3.061
 emancipation-3.060
Token,
Token ID11
Feature activation+2.24e+00

Pos logit contributions
WARNING+0.802
owship+0.799
UC+0.790
 precaution+0.762
±+0.759
Neg logit contributions
vernment-0.650
 Aval-0.644
 emancipation-0.607
herent-0.603
Desk-0.594
Token where
Token ID810
Feature activation-1.22e+00

Pos logit contributions
WARNING+1.659
owship+1.643
UC+1.630
 precaution+1.575
±+1.569
Neg logit contributions
vernment-1.476
 Aval-1.461
 emancipation-1.382
herent-1.377
Desk-1.357
Token not
Token ID407
Feature activation+6.09e-02

Pos logit contributions
vernment+0.342
 Aval+0.341
 emancipation+0.322
herent+0.320
Desk+0.313
Neg logit contributions
WARNING-0.407
owship-0.404
UC-0.398
 precaution-0.384
±-0.383
Token originally
Token ID6198
Feature activation-6.78e-01

Pos logit contributions
WARNING+0.046
owship+0.046
UC+0.045
 precaution+0.044
±+0.044
Neg logit contributions
 Aval-0.039
vernment-0.039
 emancipation-0.037
herent-0.037
Desk-0.036
Token imagined
Token ID15758
Feature activation+1.33e+00

Pos logit contributions
 Aval+0.443
vernment+0.439
 emancipation+0.417
herent+0.416
Desk+0.404
Neg logit contributions
WARNING-0.509
owship-0.505
UC-0.499
 precaution-0.479
±-0.477
Token as
Token ID355
Feature activation+2.30e+00

Pos logit contributions
WARNING+1.050
owship+1.042
UC+1.030
 precaution+0.998
±+0.993
Neg logit contributions
vernment-0.817
 Aval-0.811
 emancipation-0.763
herent-0.759
Desk-0.746
Token a
Token ID257
Feature activation+2.17e+00

Pos logit contributions
WARNING+1.779
owship+1.759
UC+1.751
 precaution+1.708
±+1.698
Neg logit contributions
vernment-1.450
 Aval-1.412
 emancipation-1.334
herent-1.330
Desk-1.324
Token border
Token ID4865
Feature activation+9.91e+00

Pos logit contributions
WARNING+1.640
owship+1.618
UC+1.613
 precaution+1.575
±+1.560
Neg logit contributions
vernment-1.400
 Aval-1.374
 emancipation-1.296
herent-1.294
Desk-1.283
Token.
Token ID13
Feature activation-4.61e-01

Pos logit contributions
UC+8.273
WARNING+8.253
owship+8.139
 precaution+8.090
±+7.967
Neg logit contributions
vernment-4.997
 Aval-4.748
 emancipation-4.505
ivery-4.494
 Contributions-4.419
Token The
Token ID383
Feature activation+2.94e-01

Pos logit contributions
 Aval+0.287
vernment+0.282
herent+0.265
 emancipation+0.265
Desk+0.260
Neg logit contributions
WARNING-0.361
owship-0.361
UC-0.353
 precaution-0.346
±-0.340
Token earliest
Token ID14555
Feature activation-1.38e+00

Pos logit contributions
WARNING+0.222
owship+0.221
UC+0.218
 precaution+0.211
±+0.210
Neg logit contributions
vernment-0.190
 Aval-0.189
 emancipation-0.179
herent-0.177
Desk-0.174
Token ancient
Token ID6156
Feature activation+2.02e+00

Pos logit contributions
 Aval+0.913
vernment+0.905
 emancipation+0.867
herent+0.850
Desk+0.833
Neg logit contributions
owship-1.026
WARNING-1.024
UC-1.000
±-0.959
 precaution-0.951
Token texts
Token ID13399
Feature activation-6.66e-01

Pos logit contributions
WARNING+1.568
owship+1.547
UC+1.543
 precaution+1.503
±+1.493
Neg logit contributions
vernment-1.258
 Aval-1.235
herent-1.165
 emancipation-1.163
Desk-1.149
Token pic
Token ID8301
Feature activation+5.64e+00

Pos logit contributions
WARNING+1.042
owship+1.033
UC+1.019
 precaution+0.987
±+0.983
Neg logit contributions
vernment-1.060
 Aval-1.048
 emancipation-0.995
herent-0.992
Desk-0.981
Token.
Token ID13
Feature activation+7.55e+00

Pos logit contributions
WARNING+3.587
UC+3.581
±+3.475
CW+3.254
RW+3.238
Neg logit contributions
vernment-4.450
 Aval-4.213
 emancipation-4.112
ivery-4.022
herent-3.976
Tokentwitter
Token ID6956
Feature activation+8.30e+00

Pos logit contributions
WARNING+3.756
UC+3.680
±+3.553
CW+3.207
RW+3.186
Neg logit contributions
 Aval-6.826
vernment-6.815
 emancipation-6.630
ivery-6.341
 Contributions-6.306
Token.
Token ID13
Feature activation+5.15e+00

Pos logit contributions
WARNING+4.846
UC+4.835
±+4.740
CW+4.379
RW+4.341
Neg logit contributions
vernment-6.960
 Aval-6.499
 emancipation-6.454
ivery-6.188
 Contributions-6.110
Tokencom
Token ID785
Feature activation+5.79e+00

Pos logit contributions
WARNING+4.222
UC+4.200
±+4.101
owship+3.959
CW+3.941
Neg logit contributions
vernment-3.085
 Aval-2.977
 emancipation-2.863
ivery-2.669
herent-2.652
Token/
Token ID14
Feature activation+9.85e+00

Pos logit contributions
WARNING+3.166
UC+3.156
±+3.073
RW+2.840
owship+2.838
Neg logit contributions
vernment-5.083
 Aval-4.837
 emancipation-4.733
herent-4.604
ivery-4.573
Tokenj
Token ID73
Feature activation+1.58e+00

Pos logit contributions
UC+6.664
WARNING+6.503
owship+6.237
±+6.170
CW+6.158
Neg logit contributions
vernment-7.078
 Aval-7.031
 emancipation-6.611
herent-6.480
 Contributions-6.319
TokenAx
Token ID31554
Feature activation+1.17e+00

Pos logit contributions
WARNING+1.075
owship+1.066
UC+1.049
 precaution+1.023
±+1.013
Neg logit contributions
vernment-1.136
 Aval-1.122
 emancipation-1.068
herent-1.067
Desk-1.057
Token6
Token ID21
Feature activation-4.59e-02

Pos logit contributions
WARNING+0.793
owship+0.789
UC+0.772
 precaution+0.758
±+0.746
Neg logit contributions
vernment-0.840
 Aval-0.830
herent-0.791
 emancipation-0.791
Desk-0.785
Tokenc
Token ID66
Feature activation-1.08e-01

Pos logit contributions
vernment+0.033
 Aval+0.032
herent+0.031
Desk+0.031
 emancipation+0.031
Neg logit contributions
WARNING-0.031
owship-0.031
UC-0.030
 precaution-0.030
±-0.029
TokenU
Token ID52
Feature activation-1.98e-01

Pos logit contributions
vernment+0.078
 Aval+0.077
herent+0.074
Desk+0.073
 emancipation+0.073
Neg logit contributions
owship-0.072
WARNING-0.072
UC-0.070
 precaution-0.070
±-0.068
Token pic
Token ID8301
Feature activation+2.74e+00

Pos logit contributions
WARNING+0.169
owship+0.168
UC+0.164
 precaution+0.159
±+0.158
Neg logit contributions
vernment-0.235
 Aval-0.234
 emancipation-0.222
herent-0.222
Desk-0.220
Token.
Token ID13
Feature activation+3.77e+00

Pos logit contributions
WARNING+1.897
UC+1.889
±+1.828
owship+1.796
 precaution+1.754
Neg logit contributions
vernment-1.984
 Aval-1.910
 emancipation-1.842
herent-1.801
ivery-1.795
Tokentwitter
Token ID6956
Feature activation+5.67e+00

Pos logit contributions
WARNING+2.138
UC+2.122
±+2.029
owship+1.978
 precaution+1.912
Neg logit contributions
vernment-3.193
 Aval-3.110
 emancipation-3.035
herent-2.953
Desk-2.938
Token.
Token ID13
Feature activation+4.33e+00

Pos logit contributions
WARNING+3.651
UC+3.641
±+3.558
CW+3.314
RW+3.308
Neg logit contributions
vernment-4.430
 Aval-4.218
 emancipation-4.138
ivery-3.956
 Contributions-3.937
Tokencom
Token ID785
Feature activation+4.87e+00

Pos logit contributions
WARNING+3.630
UC+3.630
±+3.543
owship+3.457
CW+3.394
Neg logit contributions
vernment-2.497
 Aval-2.399
 emancipation-2.304
herent-2.176
Desk-2.166
Token/
Token ID14
Feature activation+9.83e+00

Pos logit contributions
UC+2.769
WARNING+2.768
±+2.682
owship+2.558
RW+2.501
Neg logit contributions
vernment-4.129
 Aval-3.989
 emancipation-3.887
herent-3.784
ivery-3.739
Tokenxx
Token ID5324
Feature activation+2.17e+00

Pos logit contributions
UC+6.653
WARNING+6.476
owship+6.218
±+6.163
CW+6.146
Neg logit contributions
vernment-7.044
 Aval-6.990
 emancipation-6.594
herent-6.456
 Contributions-6.301
TokenA
Token ID32
Feature activation-1.66e+00

Pos logit contributions
WARNING+1.486
owship+1.470
UC+1.454
 precaution+1.409
±+1.400
Neg logit contributions
vernment-1.558
 Aval-1.541
 emancipation-1.465
herent-1.462
Desk-1.445
TokenII
Token ID3978
Feature activation-1.58e+00

Pos logit contributions
vernment+1.160
 Aval+1.152
herent+1.103
Desk+1.100
 emancipation+1.095
Neg logit contributions
owship-1.141
WARNING-1.138
 precaution-1.102
UC-1.085
±-1.069
Token1
Token ID16
Feature activation-1.15e+00

Pos logit contributions
vernment+1.104
 Aval+1.090
herent+1.049
Desk+1.048
 emancipation+1.039
Neg logit contributions
owship-1.085
WARNING-1.082
 precaution-1.051
UC-1.034
±-1.015
Tokenk
Token ID74
Feature activation-2.13e+00

Pos logit contributions
vernment+0.813
 Aval+0.804
herent+0.773
Desk+0.770
 emancipation+0.766
Neg logit contributions
owship-0.777
WARNING-0.773
 precaution-0.749
UC-0.740
±-0.725
Token pic
Token ID8301
Feature activation+4.84e+00

Pos logit contributions
WARNING+1.308
owship+1.296
UC+1.282
 precaution+1.236
±+1.230
Neg logit contributions
vernment-1.484
 Aval-1.470
 emancipation-1.398
herent-1.394
Desk-1.378
Token.
Token ID13
Feature activation+4.07e+00

Pos logit contributions
WARNING+3.182
UC+3.178
±+3.069
CW+2.903
owship+2.894
Neg logit contributions
vernment-3.728
 Aval-3.537
 emancipation-3.448
ivery-3.358
herent-3.342
Tokentwitter
Token ID6956
Feature activation+6.79e+00

Pos logit contributions
WARNING+2.252
UC+2.240
±+2.139
owship+2.047
 precaution+1.984
Neg logit contributions
vernment-3.517
 Aval-3.437
 emancipation-3.357
herent-3.259
ivery-3.237
Token.
Token ID13
Feature activation+4.32e+00

Pos logit contributions
WARNING+4.147
UC+4.121
±+4.024
CW+3.748
RW+3.704
Neg logit contributions
vernment-5.586
 Aval-5.264
 emancipation-5.183
ivery-4.951
 Contributions-4.922
Tokencom
Token ID785
Feature activation+5.37e+00

Pos logit contributions
WARNING+3.595
UC+3.568
±+3.483
owship+3.373
CW+3.356
Neg logit contributions
vernment-2.569
 Aval-2.469
 emancipation-2.372
herent-2.231
ivery-2.203
Token/
Token ID14
Feature activation+9.74e+00

Pos logit contributions
WARNING+2.944
UC+2.935
±+2.861
owship+2.654
CW+2.649
Neg logit contributions
vernment-4.703
 Aval-4.496
 emancipation-4.385
herent-4.269
ivery-4.235
Tokenz
Token ID89
Feature activation+2.45e+00

Pos logit contributions
UC+6.596
WARNING+6.441
owship+6.179
±+6.114
CW+6.100
Neg logit contributions
vernment-6.991
 Aval-6.935
 emancipation-6.528
herent-6.398
 Contributions-6.242
TokenJ
Token ID41
Feature activation+1.19e+00

Pos logit contributions
WARNING+1.668
owship+1.649
UC+1.635
 precaution+1.578
±+1.571
Neg logit contributions
vernment-1.767
 Aval-1.748
 emancipation-1.661
herent-1.657
Desk-1.636
TokenX
Token ID55
Feature activation+1.20e+00

Pos logit contributions
WARNING+0.810
owship+0.805
UC+0.788
 precaution+0.774
±+0.763
Neg logit contributions
vernment-0.853
 Aval-0.844
herent-0.804
 emancipation-0.803
Desk-0.797
Tokenw
Token ID86
Feature activation-3.33e-01

Pos logit contributions
WARNING+0.816
owship+0.812
UC+0.794
 precaution+0.780
±+0.768
Neg logit contributions
vernment-0.856
 Aval-0.846
herent-0.805
 emancipation-0.805
Desk-0.799
Token2
Token ID17
Feature activation-1.99e+00

Pos logit contributions
vernment+0.238
 Aval+0.235
herent+0.224
Desk+0.224
 emancipation+0.224
Neg logit contributions
owship-0.225
WARNING-0.225
UC-0.217
 precaution-0.217
±-0.212
Token pic
Token ID8301
Feature activation+4.87e+00

Pos logit contributions
WARNING+2.535
UC+2.504
owship+2.486
 precaution+2.387
±+2.371
Neg logit contributions
vernment-3.209
 Aval-3.168
 emancipation-3.033
herent-3.025
Desk-2.981
Token.
Token ID13
Feature activation+6.50e+00

Pos logit contributions
WARNING+3.154
UC+3.152
±+3.046
owship+2.879
CW+2.877
Neg logit contributions
vernment-3.782
 Aval-3.580
 emancipation-3.474
ivery-3.416
herent-3.381
Tokentwitter
Token ID6956
Feature activation+7.75e+00

Pos logit contributions
WARNING+3.325
UC+3.282
±+3.134
CW+2.884
RW+2.857
Neg logit contributions
vernment-5.841
 Aval-5.799
 emancipation-5.646
ivery-5.406
herent-5.371
Token.
Token ID13
Feature activation+4.62e+00

Pos logit contributions
WARNING+4.601
UC+4.596
±+4.491
CW+4.179
RW+4.140
Neg logit contributions
vernment-6.451
 Aval-6.049
 emancipation-5.975
ivery-5.728
 Contributions-5.663
Tokencom
Token ID785
Feature activation+5.38e+00

Pos logit contributions
WARNING+3.826
UC+3.807
±+3.700
owship+3.597
CW+3.570
Neg logit contributions
vernment-2.731
 Aval-2.640
 emancipation-2.531
ivery-2.361
herent-2.360
Token/
Token ID14
Feature activation+9.72e+00

Pos logit contributions
WARNING+2.975
UC+2.961
±+2.878
owship+2.687
RW+2.673
Neg logit contributions
vernment-4.689
 Aval-4.483
 emancipation-4.372
herent-4.262
ivery-4.218
TokenO
Token ID46
Feature activation-2.30e+00

Pos logit contributions
UC+6.581
WARNING+6.423
owship+6.163
±+6.093
CW+6.080
Neg logit contributions
vernment-6.985
 Aval-6.938
 emancipation-6.530
herent-6.399
 Contributions-6.240
TokenJV
Token ID41697
Feature activation+1.56e+00

Pos logit contributions
vernment+1.581
 Aval+1.566
Desk+1.517
herent+1.511
 emancipation+1.494
Neg logit contributions
owship-1.580
WARNING-1.569
 precaution-1.532
UC-1.496
±-1.478
TokenL
Token ID43
Feature activation+1.41e+00

Pos logit contributions
WARNING+1.066
owship+1.058
UC+1.040
 precaution+1.016
±+1.004
Neg logit contributions
vernment-1.115
 Aval-1.103
 emancipation-1.049
herent-1.049
Desk-1.039
TokenF
Token ID37
Feature activation+6.17e-01

Pos logit contributions
WARNING+0.967
owship+0.959
UC+0.941
 precaution+0.923
±+0.911
Neg logit contributions
vernment-1.006
 Aval-0.995
herent-0.948
 emancipation-0.947
Desk-0.938
TokenRe
Token ID3041
Feature activation-1.37e+00

Pos logit contributions
WARNING+0.425
owship+0.422
UC+0.412
 precaution+0.407
±+0.400
Neg logit contributions
vernment-0.437
 Aval-0.432
herent-0.412
 emancipation-0.411
Desk-0.409
TokenNR
Token ID24723
Feature activation+2.10e+00

Pos logit contributions
WARNING+0.224
owship+0.221
UC+0.219
 precaution+0.210
±+0.209
Neg logit contributions
vernment-0.324
 Aval-0.321
 emancipation-0.307
herent-0.307
Desk-0.303
Token restrictions
Token ID8733
Feature activation+3.26e+00

Pos logit contributions
WARNING+1.522
owship+1.509
UC+1.488
 precaution+1.440
±+1.436
Neg logit contributions
vernment-1.420
 Aval-1.408
 emancipation-1.334
herent-1.325
Desk-1.305
Token on
Token ID319
Feature activation+2.38e+00

Pos logit contributions
WARNING+2.437
owship+2.404
UC+2.401
 precaution+2.330
±+2.322
Neg logit contributions
vernment-2.120
 Aval-2.085
herent-1.974
 emancipation-1.970
Desk-1.951
Token my
Token ID616
Feature activation+1.71e+00

Pos logit contributions
WARNING+1.684
owship+1.666
UC+1.655
 precaution+1.602
±+1.593
Neg logit contributions
vernment-1.645
 Aval-1.623
 emancipation-1.539
herent-1.537
Desk-1.520
Token research
Token ID2267
Feature activation+2.30e+00

Pos logit contributions
WARNING+1.257
owship+1.242
UC+1.226
 precaution+1.185
±+1.184
Neg logit contributions
vernment-1.141
 Aval-1.131
 emancipation-1.073
herent-1.064
Desk-1.048
Token back
Token ID736
Feature activation+9.72e+00

Pos logit contributions
WARNING+1.735
owship+1.715
UC+1.700
 precaution+1.648
±+1.641
Neg logit contributions
vernment-1.487
 Aval-1.472
 emancipation-1.391
herent-1.386
Desk-1.367
Token in
Token ID287
Feature activation+3.38e+00

Pos logit contributions
UC+6.566
WARNING+6.544
owship+6.421
±+6.334
 precaution+6.199
Neg logit contributions
vernment-6.541
 Aval-6.323
herent-6.272
 emancipation-6.071
ivery-6.040
Token the
Token ID262
Feature activation+4.36e+00

Pos logit contributions
WARNING+2.502
UC+2.472
owship+2.464
 precaution+2.396
±+2.383
Neg logit contributions
vernment-2.232
 Aval-2.172
 emancipation-2.066
herent-2.064
Desk-2.051
Token spring
Token ID6076
Feature activation+5.12e+00

Pos logit contributions
WARNING+3.336
UC+3.305
owship+3.287
 precaution+3.220
±+3.188
Neg logit contributions
vernment-2.755
 Aval-2.667
herent-2.546
 emancipation-2.522
Desk-2.507
Token of
Token ID286
Feature activation+3.20e-02

Pos logit contributions
WARNING+3.617
owship+3.580
UC+3.568
 precaution+3.439
±+3.432
Neg logit contributions
vernment-3.508
 Aval-3.461
 emancipation-3.296
herent-3.291
Desk-3.259
Token 2000
Token ID4751
Feature activation+6.56e-02

Pos logit contributions
WARNING+0.024
owship+0.024
UC+0.024
 precaution+0.023
±+0.023
Neg logit contributions
 Aval-0.021
vernment-0.021
 emancipation-0.020
herent-0.019
Desk-0.019
Token when
Token ID618
Feature activation-2.43e+00

Pos logit contributions
 Aval+1.692
vernment+1.680
 emancipation+1.588
herent+1.571
Desk+1.539
Neg logit contributions
WARNING-2.099
owship-2.087
UC-2.042
 precaution-1.969
±-1.956
Token his
Token ID465
Feature activation-1.73e+00

Pos logit contributions
 Aval+1.511
vernment+1.471
 emancipation+1.412
herent+1.392
Desk+1.349
Neg logit contributions
owship-1.910
WARNING-1.908
UC-1.871
 precaution-1.781
±-1.780
Token effort
Token ID3626
Feature activation-3.18e+00

Pos logit contributions
 Aval+1.023
vernment+1.019
 emancipation+0.963
herent+0.952
Desk+0.926
Neg logit contributions
owship-1.400
WARNING-1.397
UC-1.376
±-1.318
 precaution-1.315
Token hit
Token ID2277
Feature activation-2.64e+00

Pos logit contributions
 Aval+2.166
vernment+2.113
 emancipation+2.031
herent+2.011
ivery+1.974
Neg logit contributions
owship-2.308
WARNING-2.302
UC-2.233
 precaution-2.136
±-2.115
Token an
Token ID281
Feature activation-5.63e+00

Pos logit contributions
 Aval+1.564
vernment+1.540
 emancipation+1.457
herent+1.444
Desk+1.402
Neg logit contributions
WARNING-2.144
owship-2.127
UC-2.102
±-2.022
 precaution-2.019
Token upright
Token ID24826
Feature activation-1.03e+01

Pos logit contributions
 Aval+3.631
vernment+3.475
 emancipation+3.437
herent+3.351
abol+3.309
Neg logit contributions
WARNING-4.261
owship-4.201
UC-4.151
±-3.976
��-3.945
Token and
Token ID290
Feature activation-4.61e+00

Pos logit contributions
 Aval+6.396
vernment+5.929
 emancipation+5.819
herent+5.731
Desk+5.597
Neg logit contributions
owship-7.900
WARNING-7.778
UC-7.619
��-7.191
CW-7.181
Token the
Token ID262
Feature activation-2.91e+00

Pos logit contributions
 Aval+3.008
vernment+2.836
 emancipation+2.772
herent+2.724
abol+2.642
Neg logit contributions
owship-3.520
WARNING-3.464
UC-3.401
±-3.223
 precaution-3.194
Token rebound
Token ID23623
Feature activation-6.05e+00

Pos logit contributions
 Aval+1.871
vernment+1.861
 emancipation+1.769
herent+1.747
Desk+1.702
Neg logit contributions
owship-2.210
WARNING-2.195
UC-2.150
±-2.059
 precaution-2.048
Token was
Token ID373
Feature activation-2.26e+00

Pos logit contributions
 Aval+3.961
vernment+3.743
 emancipation+3.640
herent+3.615
ivery+3.549
Neg logit contributions
owship-4.512
WARNING-4.471
UC-4.346
��-4.160
±-4.122
Token well
Token ID880
Feature activation+2.57e+00

Pos logit contributions
 Aval+1.505
vernment+1.485
 emancipation+1.408
herent+1.407
Desk+1.370
Neg logit contributions
WARNING-1.670
owship-1.663
UC-1.631
±-1.568
 precaution-1.562
Token exp
Token ID1033
Feature activation+1.74e+00

Pos logit contributions
vernment+0.284
 Aval+0.279
 emancipation+0.261
herent+0.259
Desk+0.254
Neg logit contributions
WARNING-0.485
owship-0.479
UC-0.476
 precaution-0.462
±-0.460
Tokenorter
Token ID4337
Feature activation+5.27e-02

Pos logit contributions
WARNING+1.301
owship+1.290
UC+1.283
 precaution+1.239
±+1.237
Neg logit contributions
vernment-1.132
 Aval-1.121
 emancipation-1.059
herent-1.052
Desk-1.041
Token
Token ID851
Feature activation-1.84e+00

Pos logit contributions
WARNING+0.040
owship+0.040
UC+0.039
 precaution+0.038
±+0.038
Neg logit contributions
vernment-0.034
 Aval-0.034
herent-0.032
 emancipation-0.032
Desk-0.031
Token quite
Token ID2407
Feature activation-2.24e+00

Pos logit contributions
vernment+1.174
 Aval+1.154
 emancipation+1.091
herent+1.089
Desk+1.066
Neg logit contributions
WARNING-1.409
owship-1.384
UC-1.379
 precaution-1.324
 UCH-1.304
Token an
Token ID281
Feature activation-5.65e+00

Pos logit contributions
 Aval+1.326
vernment+1.325
 emancipation+1.249
herent+1.237
Desk+1.217
Neg logit contributions
WARNING-1.803
owship-1.784
UC-1.771
±-1.688
 precaution-1.687
Token achievement
Token ID13293
Feature activation-1.01e+01

Pos logit contributions
 Aval+2.970
 emancipation+2.932
vernment+2.919
herent+2.710
Desk+2.659
Neg logit contributions
WARNING-4.893
UC-4.763
owship-4.732
��-4.700
±-4.570
Token for
Token ID329
Feature activation-1.74e+00

Pos logit contributions
vernment+5.239
 Aval+5.121
 emancipation+4.840
Desk+4.639
herent+4.626
Neg logit contributions
WARNING-8.293
owship-8.244
��-7.884
UC-7.843
 precaution-7.789
Token a
Token ID257
Feature activation-2.26e+00

Pos logit contributions
vernment+1.042
 Aval+1.034
 emancipation+0.978
herent+0.964
Desk+0.941
Neg logit contributions
WARNING-1.406
owship-1.390
UC-1.381
±-1.324
 precaution-1.317
Token country
Token ID1499
Feature activation-3.92e+00

Pos logit contributions
vernment+1.152
 Aval+1.138
 emancipation+1.062
herent+1.044
Desk+1.012
Neg logit contributions
WARNING-2.031
owship-2.027
UC-2.001
±-1.928
 precaution-1.906
Token of
Token ID286
Feature activation+9.61e-01

Pos logit contributions
 Aval+2.522
vernment+2.508
 emancipation+2.406
herent+2.378
Desk+2.309
Neg logit contributions
WARNING-2.920
owship-2.907
UC-2.812
 precaution-2.726
±-2.708
Token just
Token ID655
Feature activation-1.75e-01

Pos logit contributions
WARNING+0.729
owship+0.721
UC+0.714
 precaution+0.688
±+0.685
Neg logit contributions
vernment-0.619
 Aval-0.617
 emancipation-0.584
herent-0.581
Desk-0.567
Token of
Token ID286
Feature activation-3.53e+00

Pos logit contributions
 Aval+1.780
vernment+1.669
herent+1.619
 emancipation+1.595
 Contributions+1.584
Neg logit contributions
owship-2.497
WARNING-2.429
UC-2.420
 precaution-2.370
 suspic-2.347
Token the
Token ID262
Feature activation-3.79e+00

Pos logit contributions
 Aval+1.854
vernment+1.738
 emancipation+1.706
herent+1.692
abol+1.627
Neg logit contributions
owship-3.103
WARNING-3.100
UC-3.037
RW-2.959
 precaution-2.941
Token State
Token ID1812
Feature activation-4.19e+00

Pos logit contributions
 Aval+1.931
vernment+1.845
herent+1.770
 emancipation+1.748
 Contributions+1.671
Neg logit contributions
owship-3.397
WARNING-3.367
UC-3.274
 precaution-3.239
±-3.192
Token:
Token ID25
Feature activation-5.01e+00

Pos logit contributions
 Aval+1.857
vernment+1.692
 Contributions+1.681
herent+1.639
 emancipation+1.637
Neg logit contributions
owship-4.109
��-3.962
 precaution-3.907
UC-3.900
 suspic-3.894
Token The
Token ID383
Feature activation-3.77e+00

Pos logit contributions
 Aval+2.139
 Contributions+2.033
herent+1.995
 Birth+1.980
Desk+1.975
Neg logit contributions
owship-5.014
WARNING-4.724
UC-4.672
 precaution-4.670
ENE-4.657
Token Life
Token ID5155
Feature activation-1.00e+01

Pos logit contributions
 Aval+1.837
vernment+1.800
 emancipation+1.701
herent+1.695
Desk+1.642
Neg logit contributions
owship-3.450
WARNING-3.431
UC-3.362
±-3.279
 precaution-3.271
Token of
Token ID286
Feature activation-3.59e+00

Pos logit contributions
vernment+4.816
 Aval+4.631
 Contributions+4.407
 emancipation+4.366
herent+4.344
Neg logit contributions
owship-9.037
WARNING-8.747
 precaution-8.616
 suspic-8.562
UC-8.554
Token Murray
Token ID12164
Feature activation+5.18e-01

Pos logit contributions
 Aval+2.179
vernment+2.120
 emancipation+2.043
herent+2.024
abol+1.985
Neg logit contributions
owship-2.845
WARNING-2.824
UC-2.758
 precaution-2.724
±-2.669
Token N
Token ID399
Feature activation+1.62e-02

Pos logit contributions
WARNING+0.394
UC+0.393
owship+0.383
±+0.377
 precaution+0.369
Neg logit contributions
vernment-0.326
 Aval-0.320
Desk-0.312
herent-0.311
 emancipation-0.309
Token.
Token ID13
Feature activation-4.03e+00

Pos logit contributions
WARNING+0.012
owship+0.012
UC+0.012
 precaution+0.012
±+0.012
Neg logit contributions
vernment-0.010
 Aval-0.010
 emancipation-0.010
herent-0.010
Desk-0.010
Token Roth
Token ID16131
Feature activation-2.43e-01

Pos logit contributions
vernment+2.171
 Aval+2.171
 emancipation+2.004
herent+2.003
Desk+1.944
Neg logit contributions
WARNING-3.463
owship-3.450
 precaution-3.365
UC-3.332
±-3.306
Token higher
Token ID2440
Feature activation-7.86e-01

Pos logit contributions
vernment+1.217
 Aval+1.215
 emancipation+1.164
herent+1.132
Desk+1.100
Neg logit contributions
WARNING-1.615
owship-1.601
UC-1.573
±-1.520
 precaution-1.509
Token energy
Token ID2568
Feature activation-2.67e+00

Pos logit contributions
vernment+0.433
 Aval+0.430
 emancipation+0.407
herent+0.395
Desk+0.389
Neg logit contributions
WARNING-0.670
owship-0.664
UC-0.654
±-0.634
 precaution-0.632
Token prices
Token ID4536
Feature activation-2.24e-02

Pos logit contributions
 Aval+1.031
vernment+1.030
 emancipation+0.976
Desk+0.910
herent+0.907
Neg logit contributions
WARNING-2.681
owship-2.676
UC-2.628
±-2.565
��-2.555
Token were
Token ID547
Feature activation-1.43e+00

Pos logit contributions
vernment+0.015
 Aval+0.015
 emancipation+0.014
herent+0.014
Desk+0.014
Neg logit contributions
WARNING-0.017
owship-0.016
UC-0.016
 precaution-0.016
±-0.016
Token an
Token ID281
Feature activation-2.55e+00

Pos logit contributions
vernment+0.893
 Aval+0.888
 emancipation+0.841
herent+0.833
Desk+0.815
Neg logit contributions
WARNING-1.111
owship-1.103
UC-1.092
±-1.052
 precaution-1.046
Token insol
Token ID35831
Feature activation-9.95e+00

Pos logit contributions
 Aval+1.563
vernment+1.561
 emancipation+1.501
herent+1.475
Desk+1.435
Neg logit contributions
WARNING-1.997
owship-1.971
UC-1.956
±-1.900
 precaution-1.862
Tokenuble
Token ID26664
Feature activation-3.66e+00

Pos logit contributions
herent+5.430
vernment+5.410
 Aval+5.352
ivery+5.163
abol+5.072
Neg logit contributions
WARNING-7.996
��-7.643
 precaution-7.549
±-7.539
owship-7.509
Token problem
Token ID1917
Feature activation-2.87e+00

Pos logit contributions
 Aval+2.252
vernment+2.228
 emancipation+2.183
herent+2.097
abol+2.040
Neg logit contributions
owship-2.859
WARNING-2.857
UC-2.795
±-2.683
��-2.672
Token;
Token ID26
Feature activation-1.77e+00

Pos logit contributions
vernment+1.798
 Aval+1.795
 emancipation+1.701
herent+1.680
Desk+1.640
Neg logit contributions
WARNING-2.209
owship-2.195
UC-2.153
±-2.077
 precaution-2.077
Token to
Token ID284
Feature activation+1.91e+00

Pos logit contributions
vernment+1.081
 Aval+1.069
 emancipation+1.014
herent+0.997
Desk+0.976
Neg logit contributions
WARNING-1.407
owship-1.394
UC-1.375
±-1.329
 precaution-1.329
Token Obama
Token ID2486
Feature activation-1.27e+00

Pos logit contributions
WARNING+1.555
owship+1.540
UC+1.536
 precaution+1.498
±+1.485
Neg logit contributions
vernment-1.114
 Aval-1.094
herent-1.029
 emancipation-1.025
Desk-1.020
Token opt
Token ID2172
Feature activation-5.20e+00

Pos logit contributions
 Aval+1.265
vernment+1.253
herent+1.144
 emancipation+1.141
Desk+1.036
Neg logit contributions
WARNING-5.840
owship-5.796
UC-5.789
 suspic-5.740
 precaution-5.637
Token-
Token ID12
Feature activation-2.22e+00

Pos logit contributions
vernment+2.855
 Aval+2.853
herent+2.660
Desk+2.645
 emancipation+2.644
Neg logit contributions
owship-4.341
WARNING-4.300
 suspic-4.290
UC-4.253
 precaution-4.162
Tokenout
Token ID448
Feature activation-5.26e+00

Pos logit contributions
vernment+1.314
 Aval+1.294
herent+1.220
 emancipation+1.218
Desk+1.201
Neg logit contributions
WARNING-1.789
owship-1.780
UC-1.760
 precaution-1.718
±-1.700
Token at
Token ID379
Feature activation-5.53e+00

Pos logit contributions
 Aval+2.834
vernment+2.696
herent+2.608
 emancipation+2.591
Desk+2.570
Neg logit contributions
owship-4.510
 suspic-4.489
WARNING-4.353
UC-4.325
 precaution-4.245
Token any
Token ID597
Feature activation-6.90e+00

Pos logit contributions
 Aval+3.697
vernment+3.630
 emancipation+3.506
herent+3.494
Desk+3.435
Neg logit contributions
owship-3.983
 suspic-3.971
WARNING-3.948
UC-3.784
 precaution-3.743
Token time
Token ID640
Feature activation-9.82e+00

Pos logit contributions
 Aval+3.101
vernment+3.005
 emancipation+2.880
Desk+2.839
herent+2.813
Neg logit contributions
owship-6.389
 suspic-6.352
WARNING-6.328
UC-6.121
 precaution-6.083
Token.
Token ID13
Feature activation-2.75e+00

Pos logit contributions
vernment+4.551
 Aval+4.544
Desk+4.309
herent+4.231
ivery+4.081
Neg logit contributions
 suspic-9.345
owship-8.684
 reluct-8.582
 precaution-8.450
 neighb-8.360
Token You
Token ID921
Feature activation-4.10e+00

Pos logit contributions
vernment+1.731
 Aval+1.722
herent+1.614
 emancipation+1.608
Desk+1.599
Neg logit contributions
owship-2.111
WARNING-2.102
UC-2.070
 precaution-2.026
±-2.001
Token agree
Token ID4236
Feature activation+3.18e+00

Pos logit contributions
vernment+2.132
 Aval+2.125
 emancipation+1.987
herent+1.945
Desk+1.916
Neg logit contributions
owship-3.535
WARNING-3.523
UC-3.489
 suspic-3.440
��-3.379
Token to
Token ID284
Feature activation-2.01e+00

Pos logit contributions
WARNING+2.318
UC+2.261
owship+2.237
±+2.232
 precaution+2.172
Neg logit contributions
vernment-2.136
 Aval-2.090
 emancipation-2.006
herent-1.967
ivery-1.956
Token receive
Token ID3328
Feature activation-2.17e+00

Pos logit contributions
 Aval+1.345
vernment+1.339
 emancipation+1.267
herent+1.266
Desk+1.254
Neg logit contributions
WARNING-1.464
owship-1.453
UC-1.437
 precaution-1.397
±-1.372
Tokenreal
Token ID5305
Feature activation-1.61e+00

Pos logit contributions
WARNING+1.195
owship+1.182
UC+1.173
 precaution+1.136
±+1.131
Neg logit contributions
vernment-1.069
 Aval-1.056
 emancipation-0.999
herent-0.996
Desk-0.982
Token!
Token ID0
Feature activation-8.92e-02

Pos logit contributions
vernment+0.936
 Aval+0.929
 emancipation+0.870
herent+0.870
Desk+0.846
Neg logit contributions
WARNING-1.309
owship-1.305
UC-1.285
 precaution-1.259
±-1.243
Token Everyday
Token ID48154
Feature activation+2.00e+00

Pos logit contributions
vernment+0.058
 Aval+0.058
herent+0.054
 emancipation+0.054
Desk+0.053
Neg logit contributions
owship-0.066
WARNING-0.066
UC-0.065
 precaution-0.064
±-0.063
Token.
Token ID13
Feature activation+2.36e+00

Pos logit contributions
WARNING+1.517
owship+1.503
UC+1.491
 precaution+1.444
±+1.439
Neg logit contributions
vernment-1.282
 Aval-1.268
 emancipation-1.197
herent-1.194
Desk-1.175
Token Is
Token ID1148
Feature activation+2.23e+00

Pos logit contributions
WARNING+1.730
owship+1.706
UC+1.697
 precaution+1.640
±+1.636
Neg logit contributions
vernment-1.584
 Aval-1.561
 emancipation-1.482
herent-1.477
Desk-1.458
Token Breat
Token ID40285
Feature activation-9.82e+00

Pos logit contributions
WARNING+1.711
owship+1.690
UC+1.682
 precaution+1.628
±+1.626
Neg logit contributions
vernment-1.411
 Aval-1.392
 emancipation-1.313
herent-1.308
Desk-1.296
Tokenhtaking
Token ID34148
Feature activation-1.84e+00

Pos logit contributions
vernment+4.345
herent+4.005
 Aval+4.000
ivery+3.674
 emancipation+3.627
Neg logit contributions
WARNING-9.040
owship-9.032
UC-8.961
 precaution-8.751
��-8.651
Token.
Token ID13
Feature activation+1.64e+00

Pos logit contributions
vernment+1.132
 Aval+1.131
herent+1.056
 emancipation+1.055
Desk+1.031
Neg logit contributions
WARNING-1.442
owship-1.440
UC-1.411
 precaution-1.379
±-1.363
Token Every
Token ID3887
Feature activation+5.67e-01

Pos logit contributions
WARNING+1.211
owship+1.199
UC+1.189
 precaution+1.152
±+1.146
Neg logit contributions
vernment-1.094
 Aval-1.083
 emancipation-1.024
herent-1.021
Desk-1.007
Token day
Token ID1110
Feature activation-4.66e-01

Pos logit contributions
WARNING+0.408
owship+0.405
UC+0.401
 precaution+0.386
±+0.386
Neg logit contributions
vernment-0.387
 Aval-0.383
 emancipation-0.363
herent-0.362
Desk-0.357
Token,
Token ID11
Feature activation-1.44e-01

Pos logit contributions
vernment+0.298
 Aval+0.297
 emancipation+0.279
herent+0.278
Desk+0.272
Neg logit contributions
WARNING-0.354
owship-0.352
UC-0.348
 precaution-0.337
±-0.335
Token opt
Token ID2172
Feature activation-5.18e+00

Pos logit contributions
 Aval+1.275
vernment+1.260
herent+1.154
 emancipation+1.148
Desk+1.040
Neg logit contributions
WARNING-5.858
owship-5.814
UC-5.809
 suspic-5.760
 precaution-5.654
Token-
Token ID12
Feature activation-2.12e+00

Pos logit contributions
vernment+2.853
 Aval+2.849
herent+2.655
 emancipation+2.643
Desk+2.639
Neg logit contributions
owship-4.307
WARNING-4.272
 suspic-4.245
UC-4.229
 precaution-4.130
Tokenout
Token ID448
Feature activation-5.15e+00

Pos logit contributions
vernment+1.261
 Aval+1.243
herent+1.171
 emancipation+1.169
Desk+1.152
Neg logit contributions
WARNING-1.710
owship-1.700
UC-1.682
 precaution-1.641
±-1.625
Token at
Token ID379
Feature activation-5.45e+00

Pos logit contributions
 Aval+2.784
vernment+2.654
herent+2.557
 emancipation+2.541
Desk+2.513
Neg logit contributions
owship-4.412
 suspic-4.395
WARNING-4.262
UC-4.238
 precaution-4.159
Token any
Token ID597
Feature activation-6.85e+00

Pos logit contributions
 Aval+3.642
vernment+3.569
 emancipation+3.453
herent+3.428
Desk+3.373
Neg logit contributions
owship-3.926
 suspic-3.913
WARNING-3.896
UC-3.730
 precaution-3.697
Token time
Token ID640
Feature activation-9.80e+00

Pos logit contributions
 Aval+3.076
vernment+2.984
 emancipation+2.861
Desk+2.807
herent+2.796
Neg logit contributions
owship-6.336
 suspic-6.306
WARNING-6.287
UC-6.074
 precaution-6.038
Token.
Token ID13
Feature activation-2.55e+00

Pos logit contributions
vernment+4.553
 Aval+4.547
Desk+4.293
herent+4.230
ivery+4.084
Neg logit contributions
 suspic-9.311
owship-8.648
 reluct-8.539
 precaution-8.413
WARNING-8.327
Token You
Token ID921
Feature activation-4.21e+00

Pos logit contributions
vernment+1.606
 Aval+1.597
herent+1.499
 emancipation+1.493
Desk+1.483
Neg logit contributions
owship-1.944
WARNING-1.939
UC-1.909
 precaution-1.866
±-1.845
Token agree
Token ID4236
Feature activation+3.14e+00

Pos logit contributions
vernment+2.164
 Aval+2.164
 emancipation+2.012
herent+1.972
Desk+1.943
Neg logit contributions
owship-3.644
WARNING-3.630
UC-3.600
 suspic-3.560
��-3.484
Token to
Token ID284
Feature activation-2.01e+00

Pos logit contributions
WARNING+2.291
UC+2.235
owship+2.213
±+2.204
 precaution+2.150
Neg logit contributions
vernment-2.106
 Aval-2.061
 emancipation-1.977
herent-1.940
ivery-1.929
Token receive
Token ID3328
Feature activation-2.26e+00

Pos logit contributions
 Aval+1.341
vernment+1.334
 emancipation+1.263
herent+1.261
Desk+1.249
Neg logit contributions
WARNING-1.462
owship-1.451
UC-1.435
 precaution-1.394
±-1.370
Token C
Token ID327
Feature activation-6.97e-01

Pos logit contributions
WARNING+2.255
owship+2.236
UC+2.231
 precaution+2.188
±+2.159
Neg logit contributions
vernment-1.755
 Aval-1.744
herent-1.637
Desk-1.627
 emancipation-1.624
TokenCT
Token ID4177
Feature activation-2.35e+00

Pos logit contributions
vernment+0.569
 Aval+0.556
 emancipation+0.537
herent+0.536
Desk+0.529
Neg logit contributions
WARNING-0.412
owship-0.408
 precaution-0.382
UC-0.381
 UCH-0.374
Tokens
Token ID82
Feature activation-5.78e-01

Pos logit contributions
vernment+1.476
 Aval+1.425
 emancipation+1.360
herent+1.347
 Contributions+1.329
Neg logit contributions
WARNING-1.844
owship-1.808
UC-1.744
 precaution-1.718
±-1.709
Token,
Token ID11
Feature activation-1.83e+00

Pos logit contributions
vernment+0.375
 Aval+0.370
 emancipation+0.350
herent+0.349
Desk+0.343
Neg logit contributions
WARNING-0.437
owship-0.432
UC-0.427
 precaution-0.414
±-0.411
Token significantly
Token ID5566
Feature activation-1.31e+00

Pos logit contributions
vernment+1.155
 Aval+1.135
 emancipation+1.082
herent+1.062
Desk+1.043
Neg logit contributions
WARNING-1.422
owship-1.408
UC-1.388
±-1.332
 precaution-1.330
Token allev
Token ID23863
Feature activation-9.75e+00

Pos logit contributions
vernment+0.771
 Aval+0.757
 emancipation+0.727
herent+0.707
Desk+0.694
Neg logit contributions
WARNING-1.071
owship-1.062
UC-1.047
 precaution-1.007
±-1.007
Tokeniating
Token ID26336
Feature activation-3.53e+00

Pos logit contributions
herent+3.517
vernment+3.504
ivery+3.377
 Aval+3.347
abol+3.285
Neg logit contributions
 suspic-9.529
WARNING-9.382
 precaution-9.226
owship-9.139
 reluct-9.137
Token poverty
Token ID8098
Feature activation-9.84e-01

Pos logit contributions
vernment+2.013
 Aval+2.007
 emancipation+1.918
herent+1.852
Desk+1.796
Neg logit contributions
WARNING-2.947
owship-2.912
UC-2.867
±-2.760
 UCH-2.723
Token.
Token ID13
Feature activation-1.23e+00

Pos logit contributions
vernment+0.624
 Aval+0.621
 emancipation+0.588
herent+0.580
Desk+0.569
Neg logit contributions
WARNING-0.752
owship-0.749
UC-0.736
 precaution-0.713
±-0.710
Token In
Token ID554
Feature activation-8.72e-01

Pos logit contributions
 Aval+0.733
vernment+0.733
 emancipation+0.680
herent+0.680
Desk+0.672
Neg logit contributions
WARNING-0.994
owship-0.990
UC-0.973
 precaution-0.952
±-0.939
Token addition
Token ID3090
Feature activation+4.25e+00

Pos logit contributions
vernment+0.528
 Aval+0.523
herent+0.492
 emancipation+0.491
Desk+0.482
Neg logit contributions
WARNING-0.695
owship-0.690
UC-0.681
 precaution-0.660
±-0.659
Token subway
Token ID19612
Feature activation-8.01e-03

Pos logit contributions
vernment+0.152
 Aval+0.151
 emancipation+0.141
herent+0.141
Desk+0.139
Neg logit contributions
WARNING-0.192
owship-0.191
UC-0.189
 precaution-0.183
±-0.182
Token gr
Token ID1036
Feature activation+1.28e+00

Pos logit contributions
vernment+0.005
 Aval+0.005
 emancipation+0.005
Desk+0.005
herent+0.005
Neg logit contributions
WARNING-0.006
owship-0.006
UC-0.006
 precaution-0.006
±-0.006
Tokenates
Token ID689
Feature activation+8.94e-01

Pos logit contributions
WARNING+1.220
owship+1.211
UC+1.207
±+1.174
 precaution+1.173
Neg logit contributions
vernment-0.569
 Aval-0.557
 emancipation-0.516
herent-0.512
Desk-0.501
Token or
Token ID393
Feature activation-5.37e-01

Pos logit contributions
WARNING+0.669
owship+0.661
UC+0.657
 precaution+0.636
±+0.634
Neg logit contributions
vernment-0.584
 Aval-0.577
 emancipation-0.546
herent-0.544
Desk-0.537
Token on
Token ID319
Feature activation+2.79e-01

Pos logit contributions
vernment+0.342
 Aval+0.338
 emancipation+0.318
herent+0.316
Desk+0.313
Neg logit contributions
WARNING-0.412
owship-0.408
UC-0.405
 precaution-0.392
±-0.391
Token sto
Token ID3995
Feature activation-9.65e+00

Pos logit contributions
WARNING+0.212
owship+0.210
UC+0.208
 precaution+0.202
±+0.201
Neg logit contributions
vernment-0.180
 Aval-0.177
 emancipation-0.168
herent-0.167
Desk-0.165
Tokenops
Token ID2840
Feature activation-1.75e+00

Pos logit contributions
vernment+4.774
 Aval+4.584
herent+4.464
ivery+4.369
abol+4.315
Neg logit contributions
owship-8.291
WARNING-8.112
UC-8.102
 precaution-8.049
GAME-7.774
Token or
Token ID393
Feature activation-2.39e-01

Pos logit contributions
vernment+1.115
 Aval+1.106
herent+1.034
 emancipation+1.033
Desk+1.021
Neg logit contributions
WARNING-1.346
owship-1.341
UC-1.316
 precaution-1.280
±-1.268
Token in
Token ID287
Feature activation-1.19e+00

Pos logit contributions
vernment+0.148
 Aval+0.146
 emancipation+0.137
herent+0.137
Desk+0.135
Neg logit contributions
WARNING-0.189
owship-0.187
UC-0.185
 precaution-0.179
±-0.179
Token encamp
Token ID48020
Feature activation+1.12e+00

Pos logit contributions
vernment+0.754
 Aval+0.745
 emancipation+0.698
herent+0.698
Desk+0.686
Neg logit contributions
WARNING-0.914
owship-0.907
UC-0.897
 precaution-0.868
±-0.866
Tokenments
Token ID902
Feature activation+2.47e+00

Pos logit contributions
WARNING+0.635
owship+0.622
UC+0.617
 precaution+0.592
±+0.586
Neg logit contributions
vernment-0.936
 Aval-0.927
herent-0.890
 emancipation-0.886
Desk-0.876
Token opt
Token ID2172
Feature activation-5.03e+00

Pos logit contributions
 Aval+1.241
vernment+1.239
herent+1.131
 emancipation+1.114
Desk+1.025
Neg logit contributions
WARNING-5.728
UC-5.681
owship-5.680
 suspic-5.622
 precaution-5.529
Token-
Token ID12
Feature activation-2.16e+00

Pos logit contributions
vernment+2.768
 Aval+2.759
herent+2.577
Desk+2.563
 emancipation+2.561
Neg logit contributions
owship-4.183
WARNING-4.146
 suspic-4.136
UC-4.103
 precaution-4.014
Tokenout
Token ID448
Feature activation-5.16e+00

Pos logit contributions
vernment+1.279
 Aval+1.259
herent+1.186
 emancipation+1.185
Desk+1.168
Neg logit contributions
WARNING-1.739
owship-1.729
UC-1.711
 precaution-1.670
±-1.653
Token at
Token ID379
Feature activation-5.15e+00

Pos logit contributions
 Aval+2.762
vernment+2.650
herent+2.558
 emancipation+2.528
Desk+2.520
Neg logit contributions
 suspic-4.437
owship-4.423
WARNING-4.275
UC-4.258
 precaution-4.182
Token any
Token ID597
Feature activation-6.70e+00

Pos logit contributions
 Aval+3.462
vernment+3.418
herent+3.293
 emancipation+3.289
Desk+3.231
Neg logit contributions
owship-3.683
 suspic-3.670
WARNING-3.654
UC-3.510
 precaution-3.473
Token time
Token ID640
Feature activation-9.64e+00

Pos logit contributions
 Aval+3.019
vernment+2.952
 emancipation+2.819
Desk+2.766
herent+2.759
Neg logit contributions
owship-6.190
WARNING-6.144
 suspic-6.143
UC-5.947
 precaution-5.895
Token.
Token ID13
Feature activation-2.48e+00

Pos logit contributions
vernment+4.494
 Aval+4.473
Desk+4.267
herent+4.183
ivery+4.032
Neg logit contributions
 suspic-9.137
owship-8.539
 reluct-8.376
 precaution-8.295
WARNING-8.183
Token You
Token ID921
Feature activation-3.92e+00

Pos logit contributions
vernment+1.567
 Aval+1.558
herent+1.462
 emancipation+1.456
Desk+1.446
Neg logit contributions
owship-1.893
WARNING-1.887
UC-1.859
 precaution-1.817
±-1.797
Token agree
Token ID4236
Feature activation+3.32e+00

Pos logit contributions
 Aval+2.025
vernment+2.011
 emancipation+1.871
herent+1.832
Desk+1.808
Neg logit contributions
owship-3.402
WARNING-3.385
UC-3.361
 suspic-3.317
��-3.260
Token to
Token ID284
Feature activation-1.70e+00

Pos logit contributions
WARNING+2.432
UC+2.371
owship+2.347
±+2.334
 precaution+2.280
Neg logit contributions
vernment-2.223
 Aval-2.177
 emancipation-2.088
herent-2.050
ivery-2.038
Token receive
Token ID3328
Feature activation-1.91e+00

Pos logit contributions
 Aval+1.133
vernment+1.126
 emancipation+1.066
herent+1.064
Desk+1.054
Neg logit contributions
WARNING-1.235
owship-1.223
UC-1.211
 precaution-1.177
±-1.157
Token U
Token ID471
Feature activation-3.25e-01

Pos logit contributions
 Aval+1.846
vernment+1.786
 emancipation+1.736
herent+1.713
Desk+1.647
Neg logit contributions
WARNING-2.567
owship-2.537
UC-2.486
±-2.445
 precaution-2.378
Token.
Token ID13
Feature activation-3.88e+00

Pos logit contributions
vernment+0.208
 Aval+0.206
 emancipation+0.194
herent+0.194
Desk+0.191
Neg logit contributions
WARNING-0.248
owship-0.246
UC-0.243
 precaution-0.237
±-0.235
TokenS
Token ID50
Feature activation-3.53e+00

Pos logit contributions
 Aval+2.791
vernment+2.712
Desk+2.649
herent+2.603
 emancipation+2.595
Neg logit contributions
owship-2.649
WARNING-2.554
 suspic-2.525
 precaution-2.512
UC-2.474
Token.
Token ID13
Feature activation-3.86e+00

Pos logit contributions
 Aval+2.099
vernment+2.096
herent+1.967
 emancipation+1.963
Desk+1.944
Neg logit contributions
owship-2.820
WARNING-2.811
UC-2.733
 precaution-2.718
 suspic-2.658
Token Supreme
Token ID5617
Feature activation-7.28e+00

Pos logit contributions
 Aval+1.383
vernment+1.381
 emancipation+1.263
herent+1.252
Desk+1.195
Neg logit contributions
owship-3.987
WARNING-3.971
UC-3.903
±-3.854
 precaution-3.829
Token Court
Token ID3078
Feature activation-9.45e+00

Pos logit contributions
 Aval+1.543
vernment+1.320
herent+1.221
 emancipation+1.139
Desk+1.131
Neg logit contributions
owship-8.660
WARNING-8.528
 suspic-8.425
UC-8.388
 precaution-8.236
Token.
Token ID13
Feature activation-4.27e+00

Pos logit contributions
vernment+4.952
 Aval+4.950
 emancipation+4.629
herent+4.610
Desk+4.515
Neg logit contributions
owship-7.784
WARNING-7.681
UC-7.254
 precaution-7.166
 UCH-7.147
Token The
Token ID383
Feature activation-6.44e-01

Pos logit contributions
 Aval+2.838
vernment+2.734
 Contributions+2.592
Desk+2.588
herent+2.581
Neg logit contributions
owship-3.157
WARNING-3.112
 precaution-3.058
UC-3.008
 suspic-2.984
Token Court
Token ID3078
Feature activation-8.26e+00

Pos logit contributions
 Aval+0.387
vernment+0.386
 emancipation+0.364
herent+0.360
Desk+0.352
Neg logit contributions
WARNING-0.516
owship-0.511
UC-0.503
±-0.492
 precaution-0.487
Token will
Token ID481
Feature activation-1.47e+00

Pos logit contributions
 Aval+4.766
vernment+4.675
 emancipation+4.480
herent+4.387
Desk+4.291
Neg logit contributions
owship-6.522
WARNING-6.449
UC-6.159
±-6.126
��-6.009
Token decide
Token ID5409
Feature activation-6.48e+00

Pos logit contributions
vernment+0.923
 Aval+0.919
 emancipation+0.867
herent+0.867
ivery+0.840
Neg logit contributions
WARNING-1.134
owship-1.132
UC-1.118
±-1.070
 precaution-1.063
TokenIn
Token ID818
Feature activation-2.33e+00

Pos logit contributions
vernment+0.509
 Aval+0.504
Desk+0.485
herent+0.484
 emancipation+0.480
Neg logit contributions
owship-0.528
WARNING-0.515
 precaution-0.509
UC-0.506
±-0.490
Token the
Token ID262
Feature activation-7.65e-02

Pos logit contributions
 Aval+1.370
vernment+1.364
herent+1.303
 emancipation+1.278
Desk+1.246
Neg logit contributions
owship-1.880
WARNING-1.876
UC-1.838
±-1.792
 precaution-1.780
Token months
Token ID1933
Feature activation+1.74e-01

Pos logit contributions
vernment+0.050
 Aval+0.050
 emancipation+0.047
herent+0.047
Desk+0.046
Neg logit contributions
WARNING-0.057
owship-0.057
UC-0.056
±-0.054
 precaution-0.054
Token before
Token ID878
Feature activation-5.45e+00

Pos logit contributions
WARNING+0.127
owship+0.126
UC+0.124
 precaution+0.120
±+0.119
Neg logit contributions
vernment-0.118
 Aval-0.117
 emancipation-0.111
herent-0.110
Desk-0.108
Token the
Token ID262
Feature activation-1.76e+00

Pos logit contributions
 Aval+3.183
vernment+3.130
 emancipation+3.061
herent+2.887
 Birth+2.847
Neg logit contributions
owship-4.407
WARNING-4.350
UC-4.224
±-4.137
 precaution-4.097
Token IPO
Token ID41805
Feature activation-9.42e+00

Pos logit contributions
 Aval+1.212
vernment+1.206
 emancipation+1.158
herent+1.126
ivery+1.104
Neg logit contributions
owship-1.254
WARNING-1.251
UC-1.227
±-1.186
 precaution-1.178
Token,
Token ID11
Feature activation+5.34e-01

Pos logit contributions
 Aval+5.024
vernment+4.945
Desk+4.879
 emancipation+4.813
ivery+4.640
Neg logit contributions
owship-7.710
WARNING-7.451
UC-7.186
 precaution-7.109
±-7.087
Token
Token ID564
Feature activation-9.56e-01

Pos logit contributions
WARNING+0.421
owship+0.418
UC+0.413
 precaution+0.401
±+0.399
Neg logit contributions
vernment-0.327
 Aval-0.325
 emancipation-0.306
herent-0.304
Desk-0.300
Token
Token ID250
Feature activation+1.32e+00

Pos logit contributions
vernment+0.831
 Aval+0.822
herent+0.790
 emancipation+0.786
Desk+0.777
Neg logit contributions
WARNING-0.509
owship-0.506
UC-0.494
 precaution-0.478
±-0.470
TokenBlack
Token ID9915
Feature activation-2.55e+00

Pos logit contributions
WARNING+0.958
owship+0.952
UC+0.938
 precaution+0.914
±+0.903
Neg logit contributions
vernment-0.893
 Aval-0.878
herent-0.831
 emancipation-0.830
Desk-0.820
Tokenfish
Token ID11084
Feature activation+1.10e+00

Pos logit contributions
vernment+1.203
 Aval+1.187
herent+1.088
Desk+1.083
 emancipation+1.082
Neg logit contributions
owship-2.363
WARNING-2.349
UC-2.333
 precaution-2.273
��-2.237
Token early
Token ID1903
Feature activation-8.28e-01

Pos logit contributions
WARNING+1.167
owship+1.149
UC+1.145
 precaution+1.113
±+1.105
Neg logit contributions
vernment-1.038
 Aval-1.022
 emancipation-0.966
herent-0.964
Desk-0.953
Token June
Token ID2795
Feature activation+1.50e+00

Pos logit contributions
 Aval+0.512
vernment+0.509
 emancipation+0.482
herent+0.479
Desk+0.468
Neg logit contributions
owship-0.651
WARNING-0.646
UC-0.638
 precaution-0.611
±-0.610
Token and
Token ID290
Feature activation-4.40e-01

Pos logit contributions
WARNING+1.124
owship+1.116
UC+1.106
 precaution+1.069
±+1.066
Neg logit contributions
vernment-0.974
 Aval-0.959
 emancipation-0.910
herent-0.909
Desk-0.894
Token jailed
Token ID22159
Feature activation-3.06e-01

Pos logit contributions
vernment+0.293
 Aval+0.290
 emancipation+0.276
herent+0.273
Desk+0.270
Neg logit contributions
owship-0.323
WARNING-0.323
UC-0.318
±-0.305
 precaution-0.305
Token in
Token ID287
Feature activation-1.11e+00

Pos logit contributions
 Aval+0.179
vernment+0.179
 emancipation+0.168
herent+0.167
Desk+0.165
Neg logit contributions
WARNING-0.247
owship-0.245
UC-0.245
 precaution-0.236
±-0.234
Token Kuwait
Token ID27028
Feature activation-9.41e+00

Pos logit contributions
vernment+0.688
 Aval+0.685
 emancipation+0.645
herent+0.645
Desk+0.630
Neg logit contributions
owship-0.869
WARNING-0.862
UC-0.854
±-0.818
 precaution-0.814
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
vernment+4.816
 Aval+4.676
herent+4.356
Desk+4.343
ivery+4.334
Neg logit contributions
WARNING-7.625
UC-7.573
owship-7.550
 precaution-7.397
GAME-7.385
Token
Token ID198
Feature activation-5.98e-01

Pos logit contributions
vernment+0.827
 Aval+0.820
 emancipation+0.773
herent+0.773
Desk+0.768
Neg logit contributions
WARNING-0.777
owship-0.775
UC-0.763
 precaution-0.741
±-0.733
Token
Token ID198
Feature activation-1.66e+00

Pos logit contributions
vernment+0.431
 Aval+0.430
herent+0.408
 emancipation+0.407
Desk+0.404
Neg logit contributions
owship-0.403
WARNING-0.402
UC-0.394
 precaution-0.384
±-0.377
TokenOn
Token ID2202
Feature activation+3.79e-01

Pos logit contributions
vernment+1.111
 Aval+1.084
Desk+1.084
herent+1.051
 emancipation+1.033
Neg logit contributions
owship-1.206
 precaution-1.153
WARNING-1.142
UC-1.124
 suspic-1.118
Token July
Token ID2901
Feature activation+1.17e-01

Pos logit contributions
WARNING+0.263
owship+0.262
UC+0.258
±+0.249
 precaution+0.248
Neg logit contributions
vernment-0.268
 Aval-0.266
 emancipation-0.253
herent-0.252
Desk-0.249
Token son
Token ID3367
Feature activation+2.93e+00

Pos logit contributions
WARNING+1.874
owship+1.853
UC+1.839
 precaution+1.789
±+1.774
Neg logit contributions
vernment-1.787
 Aval-1.756
herent-1.664
 emancipation-1.662
Desk-1.643
Token Ash
Token ID7844
Feature activation+2.30e-01

Pos logit contributions
WARNING+2.136
UC+2.113
owship+2.109
 precaution+2.045
±+2.033
Neg logit contributions
vernment-1.960
 Aval-1.910
herent-1.820
 emancipation-1.819
Desk-1.796
Tokenish
Token ID680
Feature activation+7.42e-01

Pos logit contributions
WARNING+0.194
owship+0.192
UC+0.190
 precaution+0.185
±+0.184
Neg logit contributions
vernment-0.129
 Aval-0.129
 emancipation-0.120
herent-0.120
Desk-0.118
Token developed
Token ID4166
Feature activation+6.49e-01

Pos logit contributions
WARNING+0.554
owship+0.550
UC+0.541
 precaution+0.523
±+0.522
Neg logit contributions
vernment-0.486
 Aval-0.484
 emancipation-0.457
herent-0.454
Desk-0.449
Token a
Token ID257
Feature activation+9.99e-01

Pos logit contributions
WARNING+0.476
owship+0.474
UC+0.468
±+0.452
 precaution+0.449
Neg logit contributions
vernment-0.431
 Aval-0.429
 emancipation-0.408
herent-0.405
Desk-0.396
Token breathing
Token ID12704
Feature activation-9.36e+00

Pos logit contributions
owship+0.706
WARNING+0.703
UC+0.692
±+0.669
 precaution+0.660
Neg logit contributions
vernment-0.690
 Aval-0.688
 emancipation-0.652
herent-0.650
Desk-0.638
Token problem
Token ID1917
Feature activation+1.39e-01

Pos logit contributions
vernment+4.892
 Aval+4.806
 emancipation+4.646
herent+4.583
 Miracle+4.397
Neg logit contributions
owship-7.793
UC-7.657
WARNING-7.601
��-7.355
龍契士-7.282
Token.
Token ID13
Feature activation-6.47e-01

Pos logit contributions
WARNING+0.106
owship+0.105
UC+0.104
 precaution+0.100
±+0.100
Neg logit contributions
vernment-0.090
 Aval-0.089
 emancipation-0.084
herent-0.083
Desk-0.082
Token Man
Token ID1869
Feature activation+1.16e+00

Pos logit contributions
vernment+0.424
 Aval+0.423
 emancipation+0.396
herent+0.396
Desk+0.393
Neg logit contributions
WARNING-0.479
owship-0.478
UC-0.471
 precaution-0.459
±-0.455
Tokenj
Token ID73
Feature activation+3.36e+00

Pos logit contributions
WARNING+0.949
owship+0.940
UC+0.937
 precaution+0.909
±+0.906
Neg logit contributions
vernment-0.668
 Aval-0.661
 emancipation-0.619
herent-0.614
Desk-0.607
Tokenula
Token ID4712
Feature activation+4.33e-01

Pos logit contributions
WARNING+1.361
UC+1.328
owship+1.314
±+1.262
 precaution+1.191
Neg logit contributions
 Aval-3.346
vernment-3.337
 emancipation-3.203
herent-3.152
Desk-3.138
Token hope
Token ID2911
Feature activation+9.08e-01

Pos logit contributions
 Aval+0.524
vernment+0.522
 emancipation+0.507
herent+0.494
ivery+0.484
Neg logit contributions
WARNING-0.521
owship-0.513
UC-0.509
±-0.489
 precaution-0.481
Token,
Token ID11
Feature activation+6.62e-01

Pos logit contributions
WARNING+0.690
owship+0.682
UC+0.677
 precaution+0.656
±+0.653
Neg logit contributions
vernment-0.584
 Aval-0.577
 emancipation-0.545
herent-0.543
Desk-0.535
Token striking
Token ID8871
Feature activation-2.90e-01

Pos logit contributions
WARNING+0.512
owship+0.507
UC+0.502
 precaution+0.486
±+0.485
Neg logit contributions
vernment-0.417
 Aval-0.414
 emancipation-0.391
herent-0.389
Desk-0.381
Token at
Token ID379
Feature activation-3.07e+00

Pos logit contributions
 Aval+0.192
vernment+0.190
 emancipation+0.182
herent+0.180
Desk+0.175
Neg logit contributions
WARNING-0.215
owship-0.213
UC-0.211
±-0.203
 precaution-0.202
Token the
Token ID262
Feature activation-3.34e+00

Pos logit contributions
 Aval+1.843
vernment+1.825
 emancipation+1.754
herent+1.714
abol+1.642
Neg logit contributions
WARNING-2.466
owship-2.443
UC-2.414
±-2.322
 precaution-2.301
Token heart
Token ID2612
Feature activation-9.13e+00

Pos logit contributions
 Aval+2.227
vernment+2.183
 emancipation+2.139
herent+2.068
ivery+2.005
Neg logit contributions
owship-2.457
WARNING-2.451
UC-2.399
±-2.310
 UCH-2.261
Token of
Token ID286
Feature activation-1.30e+00

Pos logit contributions
vernment+4.919
 Aval+4.907
herent+4.777
 emancipation+4.670
strings+4.560
Neg logit contributions
WARNING-7.350
owship-7.241
��-7.063
UC-7.005
 precaution-6.993
Token evil
Token ID6181
Feature activation+2.97e+00

Pos logit contributions
 Aval+0.831
vernment+0.823
 emancipation+0.788
herent+0.775
Desk+0.745
Neg logit contributions
WARNING-1.008
owship-1.001
UC-0.989
±-0.947
 precaution-0.944
Token.
Token ID13
Feature activation-9.16e-01

Pos logit contributions
WARNING+2.191
owship+2.169
UC+2.156
 precaution+2.089
±+2.078
Neg logit contributions
vernment-1.963
 Aval-1.931
herent-1.829
 emancipation-1.826
Desk-1.810
Token Then
Token ID3244
Feature activation+1.55e+00

Pos logit contributions
 Aval+0.615
vernment+0.603
herent+0.569
 emancipation+0.568
Desk+0.560
Neg logit contributions
owship-0.673
WARNING-0.670
UC-0.656
 precaution-0.647
±-0.630
Token the
Token ID262
Feature activation-5.31e-01

Pos logit contributions
WARNING+1.192
owship+1.181
UC+1.171
 precaution+1.134
±+1.130
Neg logit contributions
vernment-0.976
 Aval-0.966
 emancipation-0.911
herent-0.907
Desk-0.893
Token books
Token ID3835
Feature activation-1.29e+00

Pos logit contributions
WARNING+0.559
owship+0.554
UC+0.548
 precaution+0.535
±+0.529
Neg logit contributions
vernment-0.439
 Aval-0.432
 emancipation-0.411
herent-0.408
Desk-0.399
Token,
Token ID11
Feature activation+1.92e-01

Pos logit contributions
vernment+0.812
 Aval+0.805
 emancipation+0.759
herent+0.757
Desk+0.740
Neg logit contributions
WARNING-0.987
owship-0.984
UC-0.965
 precaution-0.942
±-0.929
Token such
Token ID884
Feature activation-2.30e+00

Pos logit contributions
WARNING+0.152
owship+0.150
UC+0.149
 precaution+0.145
±+0.144
Neg logit contributions
vernment-0.118
 Aval-0.116
 emancipation-0.110
herent-0.109
Desk-0.107
Token as
Token ID355
Feature activation-8.22e-01

Pos logit contributions
vernment+1.269
 Aval+1.261
 emancipation+1.194
herent+1.180
Desk+1.153
Neg logit contributions
WARNING-1.950
owship-1.950
UC-1.907
 precaution-1.855
±-1.845
Token in
Token ID287
Feature activation-1.71e-01

Pos logit contributions
vernment+0.509
 Aval+0.508
 emancipation+0.480
herent+0.475
Desk+0.465
Neg logit contributions
WARNING-0.641
owship-0.636
UC-0.627
 precaution-0.611
±-0.605
Token Horton
Token ID46051
Feature activation-9.09e+00

Pos logit contributions
vernment+0.105
 Aval+0.104
 emancipation+0.098
herent+0.098
Desk+0.095
Neg logit contributions
WARNING-0.134
owship-0.134
UC-0.132
 precaution-0.128
±-0.128
Token H
Token ID367
Feature activation-4.65e+00

Pos logit contributions
 Aval+5.098
vernment+4.800
herent+4.700
 emancipation+4.624
 Birth+4.572
Neg logit contributions
owship-7.397
WARNING-7.346
 precaution-7.177
UC-7.031
 UCH-6.867
Tokenears
Token ID4127
Feature activation+7.65e-02

Pos logit contributions
 Aval+3.167
vernment+3.150
ivery+3.039
herent+3.032
 emancipation+3.011
Neg logit contributions
WARNING-3.312
owship-3.183
 precaution-3.148
UC-3.092
±-3.057
Token a
Token ID257
Feature activation-1.67e+00

Pos logit contributions
WARNING+0.059
owship+0.059
UC+0.058
 precaution+0.056
±+0.056
Neg logit contributions
vernment-0.048
 Aval-0.047
 emancipation-0.045
herent-0.045
Desk-0.044
Token Who
Token ID5338
Feature activation-4.38e+00

Pos logit contributions
vernment+1.040
 Aval+1.021
 emancipation+0.967
herent+0.962
Desk+0.948
Neg logit contributions
WARNING-1.308
owship-1.286
UC-1.279
±-1.248
 precaution-1.243
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
vernment+2.659
 Aval+2.650
herent+2.457
 emancipation+2.433
Desk+2.364
Neg logit contributions
owship-3.460
WARNING-3.459
UC-3.342
 precaution-3.310
 UCH-3.249
Token country
Token ID1499
Feature activation-3.19e+00

Pos logit contributions
vernment+1.682
 Aval+1.677
 emancipation+1.595
herent+1.560
Desk+1.527
Neg logit contributions
owship-2.051
WARNING-2.032
UC-1.985
±-1.928
 precaution-1.901
Token that
Token ID326
Feature activation-8.91e-01

Pos logit contributions
 Aval+1.902
vernment+1.887
 emancipation+1.814
herent+1.787
Desk+1.716
Neg logit contributions
WARNING-2.539
owship-2.521
UC-2.448
 precaution-2.385
±-2.375
Token claims
Token ID3667
Feature activation+3.40e-01

Pos logit contributions
vernment+0.613
 Aval+0.609
 emancipation+0.583
herent+0.573
Desk+0.560
Neg logit contributions
WARNING-0.639
owship-0.631
UC-0.620
 precaution-0.597
±-0.596
Token no
Token ID645
Feature activation-4.09e+00

Pos logit contributions
WARNING+0.268
owship+0.265
UC+0.263
 precaution+0.254
±+0.253
Neg logit contributions
vernment-0.209
 Aval-0.207
 emancipation-0.196
herent-0.195
Desk-0.190
Token towering
Token ID38879
Feature activation-1.48e+00

Pos logit contributions
vernment+2.696
 Aval+2.677
 emancipation+2.653
herent+2.533
 Birth+2.423
Neg logit contributions
WARNING-3.012
owship-2.963
UC-2.926
±-2.770
��-2.769
Token invent
Token ID8067
Feature activation-9.03e+00

Pos logit contributions
 Aval+0.957
vernment+0.952
 emancipation+0.914
herent+0.891
Desk+0.873
Neg logit contributions
WARNING-1.128
owship-1.114
UC-1.101
±-1.057
 precaution-1.050
Tokenors
Token ID669
Feature activation-3.94e+00

Pos logit contributions
vernment+3.226
 Aval+3.126
herent+2.999
ivery+2.949
 emancipation+2.885
Neg logit contributions
WARNING-9.113
owship-8.717
UC-8.665
 precaution-8.661
 UCH-8.594
Token or
Token ID393
Feature activation-6.25e-02

Pos logit contributions
vernment+2.440
 Aval+2.414
 emancipation+2.283
herent+2.266
Desk+2.210
Neg logit contributions
WARNING-3.063
owship-3.045
UC-2.983
 precaution-2.901
±-2.866
Token explorers
Token ID45531
Feature activation-9.13e-01

Pos logit contributions
vernment+0.036
 Aval+0.035
 emancipation+0.033
herent+0.033
Desk+0.032
Neg logit contributions
WARNING-0.052
owship-0.052
UC-0.051
 precaution-0.050
±-0.049
Token,
Token ID11
Feature activation-4.18e-01

Pos logit contributions
vernment+0.588
 Aval+0.581
 emancipation+0.550
herent+0.547
Desk+0.537
Neg logit contributions
WARNING-0.693
owship-0.686
UC-0.680
 precaution-0.658
±-0.655
Token C
Token ID327
Feature activation-1.53e+00

Pos logit contributions
vernment+0.260
 Aval+0.260
 emancipation+0.246
herent+0.242
Desk+0.237
Neg logit contributions
WARNING-0.326
owship-0.324
UC-0.319
 precaution-0.309
±-0.308
Token Oklahoma
Token ID10433
Feature activation-4.96e+00

Pos logit contributions
 Aval+0.733
vernment+0.719
herent+0.673
 emancipation+0.670
Desk+0.667
Neg logit contributions
owship-0.888
WARNING-0.883
UC-0.859
 precaution-0.851
±-0.832
Token,
Token ID11
Feature activation-3.64e+00

Pos logit contributions
 Aval+3.050
vernment+3.025
 emancipation+2.865
herent+2.762
Desk+2.729
Neg logit contributions
WARNING-3.791
owship-3.783
UC-3.686
±-3.580
��-3.548
Token a
Token ID257
Feature activation-2.26e+00

Pos logit contributions
 Aval+2.325
vernment+2.312
 emancipation+2.194
herent+2.160
Desk+2.087
Neg logit contributions
WARNING-2.738
owship-2.716
UC-2.657
±-2.567
 precaution-2.549
Token seism
Token ID30518
Feature activation-4.14e+00

Pos logit contributions
vernment+1.253
 Aval+1.252
herent+1.176
 emancipation+1.174
Desk+1.126
Neg logit contributions
owship-1.906
WARNING-1.892
UC-1.846
±-1.796
��-1.767
Tokenically
Token ID1146
Feature activation-2.15e+00

Pos logit contributions
vernment+2.416
 Aval+2.313
herent+2.267
Desk+2.176
ivery+2.167
Neg logit contributions
WARNING-3.334
owship-3.319
UC-3.253
��-3.210
 precaution-3.179
Token inactive
Token ID28621
Feature activation-9.00e+00

Pos logit contributions
vernment+1.541
 Aval+1.530
herent+1.484
 emancipation+1.471
Desk+1.435
Neg logit contributions
owship-1.460
WARNING-1.437
UC-1.403
±-1.341
��-1.339
Token area
Token ID1989
Feature activation-1.40e+00

Pos logit contributions
 Aval+3.890
 emancipation+3.664
herent+3.591
vernment+3.584
 admission+3.326
Neg logit contributions
owship-8.379
WARNING-8.325
��-8.092
CW-7.943
UC-7.919
Token before
Token ID878
Feature activation-3.64e+00

Pos logit contributions
vernment+0.880
 Aval+0.876
 emancipation+0.822
herent+0.819
Desk+0.794
Neg logit contributions
WARNING-1.084
owship-1.074
UC-1.052
 precaution-1.016
±-1.006
Token gas
Token ID3623
Feature activation-2.87e+00

Pos logit contributions
vernment+2.111
 Aval+2.106
 emancipation+2.044
herent+1.944
Desk+1.888
Neg logit contributions
owship-2.953
WARNING-2.927
UC-2.837
±-2.769
��-2.734
Token fracking
Token ID25441
Feature activation+2.48e+00

Pos logit contributions
vernment+1.847
 Aval+1.812
 emancipation+1.797
Desk+1.743
herent+1.721
Neg logit contributions
owship-2.137
WARNING-2.110
UC-2.080
±-2.017
 precaution-1.997
Token operations
Token ID4560
Feature activation-8.82e-01

Pos logit contributions
WARNING+1.731
owship+1.707
UC+1.698
 precaution+1.645
±+1.634
Neg logit contributions
vernment-1.742
 Aval-1.719
herent-1.629
 emancipation-1.627
Desk-1.607
Token deliver
Token ID5203
Feature activation-1.56e+00

Pos logit contributions
 Aval+0.815
vernment+0.815
 emancipation+0.762
herent+0.762
Desk+0.755
Neg logit contributions
WARNING-0.905
owship-0.895
UC-0.882
 precaution-0.855
±-0.851
Token daily
Token ID4445
Feature activation-1.37e+00

Pos logit contributions
 Aval+1.004
vernment+0.994
herent+0.944
 emancipation+0.939
Desk+0.927
Neg logit contributions
WARNING-1.181
owship-1.169
UC-1.155
 precaution-1.112
±-1.110
Token investing
Token ID14771
Feature activation-3.15e+00

Pos logit contributions
 Aval+0.854
vernment+0.850
herent+0.799
 emancipation+0.799
Desk+0.787
Neg logit contributions
WARNING-1.062
owship-1.052
UC-1.043
±-1.004
 precaution-1.003
Token tips
Token ID9040
Feature activation-2.59e+00

Pos logit contributions
 Aval+1.937
vernment+1.870
 emancipation+1.828
herent+1.796
Desk+1.773
Neg logit contributions
owship-2.466
WARNING-2.451
UC-2.434
��-2.353
±-2.311
Token and
Token ID290
Feature activation-3.58e+00

Pos logit contributions
 Aval+1.596
vernment+1.584
herent+1.485
 emancipation+1.475
Desk+1.464
Neg logit contributions
owship-2.027
WARNING-2.025
UC-1.966
 precaution-1.932
 UCH-1.886
Token stock
Token ID4283
Feature activation-8.98e+00

Pos logit contributions
 Aval+2.246
vernment+2.217
 emancipation+2.115
herent+2.109
Desk+2.082
Neg logit contributions
WARNING-2.731
owship-2.694
UC-2.670
±-2.567
��-2.551
Token picks
Token ID11103
Feature activation-2.32e+00

Pos logit contributions
 Aval+5.072
 emancipation+4.987
vernment+4.876
Desk+4.871
herent+4.816
Neg logit contributions
WARNING-6.874
owship-6.857
UC-6.601
��-6.547
ENE-6.463
Token,
Token ID11
Feature activation-2.08e+00

Pos logit contributions
 Aval+1.424
vernment+1.416
 emancipation+1.333
herent+1.332
Desk+1.305
Neg logit contributions
owship-1.821
WARNING-1.817
UC-1.779
 precaution-1.723
±-1.699
Token provide
Token ID2148
Feature activation-2.46e+00

Pos logit contributions
vernment+1.346
 Aval+1.344
 emancipation+1.261
herent+1.259
Desk+1.241
Neg logit contributions
WARNING-1.571
owship-1.557
UC-1.537
±-1.472
 precaution-1.470
Token analysis
Token ID3781
Feature activation-1.68e+00

Pos logit contributions
 Aval+1.554
vernment+1.541
 emancipation+1.461
herent+1.454
Desk+1.435
Neg logit contributions
WARNING-1.896
owship-1.875
UC-1.850
±-1.779
 precaution-1.764
Token with
Token ID351
Feature activation+2.90e+00

Pos logit contributions
 Aval+1.040
vernment+1.037
herent+0.972
 emancipation+0.963
Desk+0.957
Neg logit contributions
WARNING-1.325
owship-1.316
UC-1.288
 precaution-1.252
±-1.240
Token his
Token ID465
Feature activation+6.16e-01

Pos logit contributions
vernment+0.282
 Aval+0.279
 emancipation+0.266
herent+0.264
Desk+0.259
Neg logit contributions
WARNING-0.348
owship-0.343
UC-0.341
 precaution-0.328
±-0.328
Token r
Token ID374
Feature activation+1.40e+00

Pos logit contributions
WARNING+0.481
owship+0.476
UC+0.472
 precaution+0.457
±+0.456
Neg logit contributions
vernment-0.384
 Aval-0.379
 emancipation-0.360
herent-0.358
Desk-0.351
Tokenags
Token ID3775
Feature activation+4.70e-01

Pos logit contributions
WARNING+1.156
owship+1.147
UC+1.140
 precaution+1.104
±+1.103
Neg logit contributions
vernment-0.799
 Aval-0.796
 emancipation-0.739
herent-0.735
Desk-0.728
Token-
Token ID12
Feature activation-2.15e+00

Pos logit contributions
WARNING+0.368
owship+0.365
UC+0.362
 precaution+0.350
±+0.349
Neg logit contributions
vernment-0.293
 Aval-0.288
 emancipation-0.271
herent-0.271
Desk-0.266
Tokento
Token ID1462
Feature activation-4.99e+00

Pos logit contributions
vernment+1.147
 Aval+1.093
herent+1.061
abol+1.059
ivery+1.015
Neg logit contributions
WARNING-1.867
owship-1.864
UC-1.844
 precaution-1.817
 UCH-1.786
Token-
Token ID12
Feature activation-8.95e+00

Pos logit contributions
vernment+2.872
 Aval+2.821
herent+2.675
 emancipation+2.621
ivery+2.611
Neg logit contributions
owship-4.092
WARNING-4.071
UC-4.015
 suspic-3.896
 precaution-3.886
Tokenric
Token ID1173
Feature activation-3.51e+00

Pos logit contributions
 Aval+2.950
vernment+2.828
abol+2.720
herent+2.673
ivery+2.630
Neg logit contributions
��-9.128
owship-9.102
 precaution-9.098
UC-9.078
WARNING-8.948
Tokenhes
Token ID956
Feature activation-7.72e-01

Pos logit contributions
vernment+1.433
 Aval+1.385
herent+1.288
 emancipation+1.272
Desk+1.237
Neg logit contributions
WARNING-3.492
owship-3.459
UC-3.432
 precaution-3.349
±-3.343
Token story
Token ID1621
Feature activation+6.25e-02

Pos logit contributions
vernment+0.424
 Aval+0.417
 emancipation+0.397
herent+0.391
ivery+0.380
Neg logit contributions
WARNING-0.660
owship-0.658
UC-0.650
 precaution-0.627
±-0.627
Token"
Token ID1
Feature activation+2.29e+00

Pos logit contributions
WARNING+0.053
owship+0.052
UC+0.052
 precaution+0.050
±+0.050
Neg logit contributions
vernment-0.035
 Aval-0.035
 emancipation-0.032
herent-0.032
Desk-0.032
Token<|endoftext|>
Token ID50256
Feature activation+1.87e+00

Pos logit contributions
WARNING+1.674
owship+1.644
UC+1.641
 precaution+1.583
±+1.580
Neg logit contributions
vernment-1.548
 Aval-1.519
 emancipation-1.451
herent-1.442
Desk-1.420
Token don
Token ID836
Feature activation-5.24e-01

Pos logit contributions
WARNING+0.707
owship+0.699
UC+0.692
 precaution+0.668
±+0.664
Neg logit contributions
vernment-0.789
 Aval-0.781
 emancipation-0.743
herent-0.742
Desk-0.732
Token't
Token ID470
Feature activation+1.13e+00

Pos logit contributions
vernment+0.374
 Aval+0.370
 emancipation+0.351
herent+0.351
Desk+0.346
Neg logit contributions
WARNING-0.360
owship-0.356
UC-0.353
 precaution-0.341
±-0.339
Token even
Token ID772
Feature activation+1.31e+00

Pos logit contributions
WARNING+0.851
owship+0.841
UC+0.837
 precaution+0.809
±+0.809
Neg logit contributions
vernment-0.740
 Aval-0.727
 emancipation-0.686
herent-0.683
Desk-0.675
Token play
Token ID711
Feature activation+5.19e+00

Pos logit contributions
WARNING+0.948
owship+0.938
UC+0.930
 precaution+0.900
±+0.896
Neg logit contributions
vernment-0.895
 Aval-0.885
 emancipation-0.838
herent-0.836
Desk-0.825
Token games
Token ID1830
Feature activation+5.68e+00

Pos logit contributions
WARNING+2.256
owship+2.184
UC+2.170
±+2.071
 precaution+2.044
Neg logit contributions
vernment-4.989
 Aval-4.955
 emancipation-4.779
herent-4.756
Desk-4.732
Token with
Token ID351
Feature activation+6.91e+00

Pos logit contributions
WARNING+2.864
UC+2.793
±+2.713
owship+2.710
 precaution+2.600
Neg logit contributions
vernment-5.086
 Aval-5.028
 emancipation-4.854
ivery-4.770
Desk-4.732
Token AA
Token ID15923
Feature activation-8.03e-03

Pos logit contributions
WARNING+3.375
±+3.238
UC+3.231
owship+3.168
 RW+3.089
Neg logit contributions
vernment-6.257
 Aval-5.893
Desk-5.850
 emancipation-5.803
herent-5.797
Token on
Token ID319
Feature activation+1.09e+00

Pos logit contributions
vernment+0.005
 Aval+0.005
 emancipation+0.004
herent+0.004
Desk+0.004
Neg logit contributions
WARNING-0.006
owship-0.006
UC-0.006
 precaution-0.006
±-0.006
Token (
Token ID357
Feature activation+5.50e-01

Pos logit contributions
WARNING+0.845
UC+0.830
owship+0.820
±+0.805
 precaution+0.798
Neg logit contributions
vernment-0.704
 Aval-0.683
 emancipation-0.647
herent-0.642
ivery-0.635
TokenAnti
Token ID28795
Feature activation-2.04e+00

Pos logit contributions
WARNING+0.360
owship+0.359
UC+0.354
 precaution+0.343
±+0.339
Neg logit contributions
vernment-0.407
 Aval-0.405
 emancipation-0.385
herent-0.384
Desk-0.380
TokenAli
Token ID37893
Feature activation+5.78e-01

Pos logit contributions
 Aval+1.409
vernment+1.405
herent+1.342
Desk+1.333
 emancipation+1.330
Neg logit contributions
owship-1.444
WARNING-1.420
UC-1.405
 precaution-1.371
±-1.338
Token fan
Token ID4336
Feature activation-7.31e-01

Pos logit contributions
WARNING+1.029
owship+1.019
UC+1.007
 precaution+0.968
±+0.965
Neg logit contributions
vernment-1.219
 Aval-1.208
 emancipation-1.153
herent-1.149
Desk-1.135
Token poses
Token ID17313
Feature activation+2.07e+00

Pos logit contributions
vernment+0.496
 Aval+0.495
 emancipation+0.470
herent+0.468
Desk+0.460
Neg logit contributions
WARNING-0.528
owship-0.522
UC-0.512
 precaution-0.497
±-0.495
Token for
Token ID329
Feature activation+1.90e+00

Pos logit contributions
WARNING+1.504
owship+1.489
UC+1.474
 precaution+1.425
±+1.423
Neg logit contributions
vernment-1.405
 Aval-1.392
 emancipation-1.319
herent-1.314
Desk-1.296
Token a
Token ID257
Feature activation+1.68e+00

Pos logit contributions
WARNING+1.444
owship+1.430
UC+1.420
 precaution+1.371
±+1.371
Neg logit contributions
vernment-1.218
 Aval-1.202
 emancipation-1.139
herent-1.134
Desk-1.116
Token photograph
Token ID8408
Feature activation+3.84e+00

Pos logit contributions
WARNING+1.145
owship+1.133
UC+1.122
 precaution+1.083
±+1.079
Neg logit contributions
vernment-1.204
 Aval-1.191
 emancipation-1.133
herent-1.129
Desk-1.115
Token next
Token ID1306
Feature activation+6.44e+00

Pos logit contributions
WARNING+2.631
owship+2.590
UC+2.587
 precaution+2.517
±+2.503
Neg logit contributions
vernment-2.746
 Aval-2.686
herent-2.543
 emancipation-2.541
Desk-2.519
Token to
Token ID284
Feature activation+2.04e+00

Pos logit contributions
WARNING+4.225
UC+4.155
 precaution+4.146
±+4.107
owship+3.968
Neg logit contributions
vernment-4.621
 Aval-4.444
ivery-4.256
herent-4.232
Desk-4.212
Token Walker
Token ID10120
Feature activation-2.06e+00

Pos logit contributions
WARNING+1.548
owship+1.527
UC+1.525
 precaution+1.492
±+1.474
Neg logit contributions
vernment-1.310
 Aval-1.278
herent-1.198
 emancipation-1.197
Desk-1.192
Token's
Token ID338
Feature activation-7.58e-01

Pos logit contributions
 Aval+1.262
vernment+1.259
herent+1.183
 emancipation+1.176
Desk+1.161
Neg logit contributions
WARNING-1.611
owship-1.607
UC-1.565
 precaution-1.530
±-1.517
Token statue
Token ID15207
Feature activation-4.16e-01

Pos logit contributions
vernment+0.479
 Aval+0.479
 emancipation+0.456
herent+0.450
Desk+0.445
Neg logit contributions
WARNING-0.579
owship-0.575
UC-0.568
±-0.547
 precaution-0.546
Token
Token ID198
Feature activation-3.18e-01

Pos logit contributions
vernment+0.256
 Aval+0.255
herent+0.240
 emancipation+0.240
Desk+0.237
Neg logit contributions
WARNING-0.325
owship-0.324
UC-0.318
 precaution-0.309
±-0.307
Token Vladimir
Token ID14840
Feature activation-9.34e-01

Pos logit contributions
vernment+0.315
 Aval+0.314
herent+0.290
 emancipation+0.289
Desk+0.286
Neg logit contributions
owship-0.439
WARNING-0.438
UC-0.431
 precaution-0.418
±-0.415
Token Putin
Token ID8144
Feature activation+1.62e+00

Pos logit contributions
 Aval+0.587
vernment+0.583
 emancipation+0.543
herent+0.541
Desk+0.537
Neg logit contributions
owship-0.721
WARNING-0.721
UC-0.706
 precaution-0.686
±-0.679
Token chairs
Token ID18791
Feature activation+4.54e-01

Pos logit contributions
WARNING+1.101
owship+1.090
UC+1.078
 precaution+1.038
±+1.034
Neg logit contributions
vernment-1.174
 Aval-1.163
 emancipation-1.103
herent-1.101
Desk-1.088
Token a
Token ID257
Feature activation+3.32e+00

Pos logit contributions
WARNING+0.361
owship+0.359
UC+0.354
±+0.342
 precaution+0.341
Neg logit contributions
vernment-0.277
 Aval-0.273
 emancipation-0.256
herent-0.255
Desk-0.252
Token meeting
Token ID3249
Feature activation+4.30e+00

Pos logit contributions
WARNING+2.592
owship+2.558
UC+2.549
 precaution+2.485
±+2.466
Neg logit contributions
vernment-2.060
 Aval-2.031
herent-1.920
 emancipation-1.916
Desk-1.887
Token with
Token ID351
Feature activation+7.17e+00

Pos logit contributions
WARNING+2.874
owship+2.852
UC+2.829
 precaution+2.745
±+2.728
Neg logit contributions
vernment-3.127
 Aval-3.075
 emancipation-2.942
herent-2.928
Desk-2.890
Token government
Token ID1230
Feature activation+5.18e+00

Pos logit contributions
WARNING+5.063
UC+5.063
 precaution+5.004
owship+4.967
±+4.919
Neg logit contributions
vernment-4.719
 Aval-4.512
ivery-4.392
 emancipation-4.371
Desk-4.371
Token members
Token ID1866
Feature activation+2.27e+00

Pos logit contributions
WARNING+4.855
owship+4.795
UC+4.784
 precaution+4.708
±+4.689
Neg logit contributions
vernment-2.361
 Aval-2.303
herent-2.135
 emancipation-2.126
ivery-2.094
Token at
Token ID379
Feature activation+3.41e+00

Pos logit contributions
WARNING+1.751
owship+1.735
UC+1.716
 precaution+1.661
±+1.652
Neg logit contributions
vernment-1.440
 Aval-1.424
 emancipation-1.338
herent-1.335
Desk-1.319
Token the
Token ID262
Feature activation+2.03e+00

Pos logit contributions
WARNING+2.558
owship+2.527
UC+2.525
 precaution+2.471
±+2.447
Neg logit contributions
vernment-2.174
 Aval-2.130
herent-2.031
 emancipation-2.028
Desk-2.015
Token Kremlin
Token ID19077
Feature activation+9.30e-01

Pos logit contributions
WARNING+1.538
owship+1.520
UC+1.513
 precaution+1.477
±+1.463
Neg logit contributions
vernment-1.297
 Aval-1.272
herent-1.206
 emancipation-1.204
Desk-1.191
Token century
Token ID4289
Feature activation-2.64e-01

Pos logit contributions
WARNING+0.343
owship+0.341
UC+0.337
±+0.327
 precaution+0.326
Neg logit contributions
vernment-0.171
 Aval-0.170
 emancipation-0.158
herent-0.155
Desk-0.152
Token the
Token ID262
Feature activation-3.12e-01

Pos logit contributions
 Aval+0.168
vernment+0.166
 emancipation+0.158
herent+0.156
Desk+0.152
Neg logit contributions
WARNING-0.203
owship-0.202
UC-0.199
 precaution-0.192
±-0.191
Token old
Token ID1468
Feature activation+5.86e+00

Pos logit contributions
 Aval+0.190
vernment+0.189
 emancipation+0.180
herent+0.177
Desk+0.174
Neg logit contributions
WARNING-0.247
owship-0.245
UC-0.242
±-0.234
 precaution-0.233
Token religious
Token ID4158
Feature activation+1.19e+00

Pos logit contributions
WARNING+4.507
UC+4.484
owship+4.455
 precaution+4.368
±+4.319
Neg logit contributions
vernment-3.648
 Aval-3.469
herent-3.308
ivery-3.285
Desk-3.282
Token tradition
Token ID6761
Feature activation+4.31e+00

Pos logit contributions
WARNING+0.892
owship+0.883
UC+0.875
 precaution+0.845
±+0.844
Neg logit contributions
vernment-0.781
 Aval-0.777
 emancipation-0.735
herent-0.732
Desk-0.720
Token with
Token ID351
Feature activation+6.57e+00

Pos logit contributions
WARNING+3.041
UC+3.006
owship+3.004
 precaution+2.913
±+2.903
Neg logit contributions
vernment-2.968
 Aval-2.904
herent-2.765
Desk-2.749
 emancipation-2.747
Token its
Token ID663
Feature activation+2.12e+00

Pos logit contributions
WARNING+4.751
UC+4.711
owship+4.682
 precaution+4.588
±+4.547
Neg logit contributions
vernment-4.320
 Aval-4.177
Desk-3.993
ivery-3.979
herent-3.938
Token am
Token ID716
Feature activation-2.09e+00

Pos logit contributions
WARNING+1.557
owship+1.540
UC+1.529
 precaution+1.483
±+1.474
Neg logit contributions
vernment-1.420
 Aval-1.399
 emancipation-1.322
herent-1.321
Desk-1.306
Tokenoral
Token ID6864
Feature activation-1.40e+00

Pos logit contributions
vernment+1.400
 Aval+1.384
herent+1.329
 emancipation+1.310
ivery+1.294
Neg logit contributions
WARNING-1.533
owship-1.510
UC-1.504
 precaution-1.458
±-1.442
Token and
Token ID290
Feature activation+3.15e+00

Pos logit contributions
 Aval+0.886
vernment+0.875
 emancipation+0.847
herent+0.831
abol+0.803
Neg logit contributions
WARNING-1.075
owship-1.068
UC-1.055
±-1.010
 precaution-1.007
Token anthrop
Token ID17911
Feature activation-1.07e+00

Pos logit contributions
WARNING+2.268
owship+2.239
UC+2.226
 precaution+2.167
±+2.151
Neg logit contributions
vernment-2.146
 Aval-2.112
herent-1.997
 emancipation-1.994
Desk-1.981
Token
Token ID198
Feature activation+1.48e+00

Pos logit contributions
WARNING+0.898
UC+0.880
owship+0.865
±+0.844
 precaution+0.828
Neg logit contributions
vernment-1.126
 Aval-1.097
 emancipation-1.056
herent-1.043
ivery-1.031
TokenHer
Token ID9360
Feature activation+1.92e+00

Pos logit contributions
WARNING+1.016
owship+1.011
UC+0.997
 precaution+0.969
±+0.960
Neg logit contributions
vernment-1.048
 Aval-1.037
 emancipation-0.984
herent-0.984
Desk-0.974
Token relative
Token ID3585
Feature activation-1.35e+00

Pos logit contributions
WARNING+1.415
owship+1.399
UC+1.389
 precaution+1.346
±+1.339
Neg logit contributions
vernment-1.285
 Aval-1.268
 emancipation-1.200
herent-1.198
Desk-1.181
Token was
Token ID373
Feature activation+3.08e+00

Pos logit contributions
 Aval+0.911
vernment+0.883
 emancipation+0.844
herent+0.829
Desk+0.823
Neg logit contributions
owship-1.005
WARNING-0.993
UC-0.971
±-0.937
 precaution-0.930
Token also
Token ID635
Feature activation+3.97e+00

Pos logit contributions
WARNING+2.152
owship+2.118
UC+2.113
 precaution+2.050
±+2.028
Neg logit contributions
vernment-2.170
 Aval-2.127
 emancipation-2.029
herent-2.020
Desk-1.994
Token hit
Token ID2277
Feature activation+6.59e+00

Pos logit contributions
WARNING+2.533
UC+2.483
owship+2.480
 precaution+2.406
±+2.368
Neg logit contributions
vernment-3.047
 Aval-2.980
 emancipation-2.860
herent-2.843
Desk-2.811
Token with
Token ID351
Feature activation+3.25e+00

Pos logit contributions
WARNING+4.527
UC+4.518
owship+4.418
 precaution+4.363
±+4.268
Neg logit contributions
vernment-4.592
 Aval-4.413
 emancipation-4.287
herent-4.246
Desk-4.154
Token the
Token ID262
Feature activation+1.37e+00

Pos logit contributions
WARNING+2.454
owship+2.416
UC+2.406
 precaution+2.348
±+2.320
Neg logit contributions
vernment-2.107
 Aval-2.062
 emancipation-1.961
herent-1.960
Desk-1.928
Token item
Token ID2378
Feature activation+8.84e-01

Pos logit contributions
WARNING+1.144
owship+1.133
UC+1.125
 precaution+1.094
±+1.090
Neg logit contributions
vernment-0.773
 Aval-0.762
 emancipation-0.714
herent-0.712
Desk-0.700
Token and
Token ID290
Feature activation-5.82e-01

Pos logit contributions
WARNING+0.665
owship+0.659
UC+0.653
 precaution+0.632
±+0.630
Neg logit contributions
vernment-0.574
 Aval-0.568
 emancipation-0.536
herent-0.535
Desk-0.527
Token suffered
Token ID6989
Feature activation+1.00e+00

Pos logit contributions
 Aval+0.403
vernment+0.395
 emancipation+0.376
herent+0.372
Desk+0.371
Neg logit contributions
owship-0.419
WARNING-0.412
UC-0.406
±-0.392
��-0.387
Token powerful
Token ID3665
Feature activation+1.38e+00

Pos logit contributions
owship+0.378
WARNING+0.376
UC+0.371
±+0.357
 precaution+0.354
Neg logit contributions
vernment-0.319
 Aval-0.318
 emancipation-0.302
herent-0.300
Desk-0.294
Token combination
Token ID6087
Feature activation+1.02e+00

Pos logit contributions
WARNING+1.041
owship+1.032
UC+1.023
±+0.986
 precaution+0.986
Neg logit contributions
vernment-0.895
 Aval-0.887
 emancipation-0.841
herent-0.837
Desk-0.824
Token to
Token ID284
Feature activation-6.99e-01

Pos logit contributions
WARNING+0.761
owship+0.753
UC+0.748
 precaution+0.724
±+0.721
Neg logit contributions
vernment-0.665
 Aval-0.657
 emancipation-0.621
herent-0.619
Desk-0.611
Token develop
Token ID1205
Feature activation-1.24e+00

Pos logit contributions
 Aval+0.448
vernment+0.447
 emancipation+0.427
herent+0.422
Desk+0.413
Neg logit contributions
WARNING-0.526
owship-0.523
UC-0.518
±-0.500
 precaution-0.497
Token mental
Token ID5110
Feature activation+2.74e+00

Pos logit contributions
 Aval+0.766
vernment+0.762
 emancipation+0.733
herent+0.721
Desk+0.703
Neg logit contributions
WARNING-0.967
owship-0.955
UC-0.943
��-0.912
±-0.908
Token and
Token ID290
Feature activation+3.12e+00

Pos logit contributions
WARNING+1.792
owship+1.768
UC+1.750
±+1.679
 precaution+1.671
Neg logit contributions
vernment-2.044
 Aval-2.029
 emancipation-1.951
herent-1.928
Desk-1.908
Token physical
Token ID3518
Feature activation+1.76e+00

Pos logit contributions
WARNING+1.889
owship+1.866
UC+1.848
 precaution+1.782
±+1.768
Neg logit contributions
vernment-2.495
 Aval-2.465
 emancipation-2.351
herent-2.347
Desk-2.324
Token ability
Token ID2694
Feature activation-5.01e-02

Pos logit contributions
WARNING+1.121
owship+1.101
UC+1.092
±+1.047
 precaution+1.037
Neg logit contributions
vernment-1.346
 Aval-1.342
 emancipation-1.300
herent-1.281
Desk-1.262
Token in
Token ID287
Feature activation+1.77e+00

Pos logit contributions
vernment+0.031
 Aval+0.031
 emancipation+0.029
herent+0.029
Desk+0.028
Neg logit contributions
WARNING-0.039
owship-0.039
UC-0.038
 precaution-0.037
±-0.037
Token putting
Token ID5137
Feature activation+6.54e-01

Pos logit contributions
WARNING+1.335
owship+1.321
UC+1.311
 precaution+1.269
±+1.264
Neg logit contributions
vernment-1.148
 Aval-1.136
 emancipation-1.073
herent-1.071
Desk-1.054
Token accuracy
Token ID9922
Feature activation+1.60e+00

Pos logit contributions
WARNING+0.493
owship+0.488
UC+0.484
 precaution+0.467
±+0.466
Neg logit contributions
vernment-0.423
 Aval-0.419
 emancipation-0.396
herent-0.395
Desk-0.389
Token
Token ID447
Feature activation+6.50e-02

Pos logit contributions
vernment+0.850
 Aval+0.846
herent+0.803
 emancipation+0.797
Desk+0.792
Neg logit contributions
owship-0.868
WARNING-0.866
UC-0.847
 precaution-0.830
±-0.812
Token
Token ID247
Feature activation+7.53e-01

Pos logit contributions
WARNING+0.041
owship+0.040
UC+0.040
 precaution+0.038
±+0.037
Neg logit contributions
vernment-0.051
 Aval-0.050
herent-0.048
 emancipation-0.048
Desk-0.047
Tokent
Token ID83
Feature activation+3.42e-01

Pos logit contributions
WARNING+0.525
owship+0.516
UC+0.511
 precaution+0.496
±+0.490
Neg logit contributions
vernment-0.536
 Aval-0.527
herent-0.502
 emancipation-0.500
Desk-0.491
Token play
Token ID711
Feature activation+3.11e+00

Pos logit contributions
WARNING+0.249
owship+0.247
UC+0.245
 precaution+0.236
±+0.235
Neg logit contributions
vernment-0.230
 Aval-0.228
 emancipation-0.216
herent-0.215
Desk-0.212
Token that
Token ID326
Feature activation+5.82e+00

Pos logit contributions
WARNING+2.357
owship+2.326
UC+2.312
 precaution+2.263
±+2.239
Neg logit contributions
vernment-1.999
 Aval-1.966
 emancipation-1.861
herent-1.857
Desk-1.828
Token game
Token ID983
Feature activation+5.37e+00

Pos logit contributions
WARNING+3.436
 precaution+3.367
UC+3.335
owship+3.301
±+3.222
Neg logit contributions
vernment-4.651
 Aval-4.537
herent-4.365
 emancipation-4.348
Desk-4.309
Token,
Token ID11
Feature activation+1.02e+00

Pos logit contributions
WARNING+3.794
UC+3.758
owship+3.705
 precaution+3.668
±+3.627
Neg logit contributions
vernment-3.627
 Aval-3.562
 emancipation-3.410
herent-3.399
Desk-3.347
Token the
Token ID262
Feature activation+5.05e-01

Pos logit contributions
WARNING+0.797
owship+0.790
UC+0.783
 precaution+0.759
±+0.756
Neg logit contributions
vernment-0.633
 Aval-0.626
 emancipation-0.590
herent-0.588
Desk-0.579
Token price
Token ID2756
Feature activation+3.93e+00

Pos logit contributions
WARNING+0.402
owship+0.399
UC+0.393
±+0.380
 precaution+0.378
Neg logit contributions
vernment-0.308
 Aval-0.305
 emancipation-0.288
herent-0.285
Desk-0.280
Token impact
Token ID2928
Feature activation+3.51e+00

Pos logit contributions
WARNING+2.790
owship+2.757
UC+2.756
 precaution+2.669
±+2.647
Neg logit contributions
vernment-2.712
 Aval-2.662
herent-2.532
 emancipation-2.522
Desk-2.486
Token would
Token ID561
Feature activation-1.11e+00

Pos logit contributions
WARNING+2.596
UC+2.567
owship+2.559
 precaution+2.491
±+2.473
Neg logit contributions
vernment-2.310
 Aval-2.264
herent-2.149
 emancipation-2.148
Desk-2.119
Token a
Token ID257
Feature activation+4.82e-01

Pos logit contributions
vernment+1.586
 Aval+1.574
 emancipation+1.529
herent+1.496
Desk+1.428
Neg logit contributions
WARNING-1.899
owship-1.876
UC-1.841
��-1.774
±-1.761
Token powerful
Token ID3665
Feature activation+1.15e+00

Pos logit contributions
WARNING+0.353
owship+0.349
UC+0.345
±+0.333
 precaution+0.332
Neg logit contributions
vernment-0.324
 Aval-0.321
 emancipation-0.305
herent-0.303
Desk-0.298
Token anti
Token ID3098
Feature activation-3.09e+00

Pos logit contributions
WARNING+0.817
owship+0.808
UC+0.801
 precaution+0.772
±+0.771
Neg logit contributions
vernment-0.791
 Aval-0.783
 emancipation-0.744
herent-0.740
Desk-0.729
Token-
Token ID12
Feature activation-5.81e-01

Pos logit contributions
vernment+1.824
 Aval+1.794
herent+1.700
 emancipation+1.696
Desk+1.642
Neg logit contributions
owship-2.508
WARNING-2.492
UC-2.430
 precaution-2.376
±-2.356
Tokenwar
Token ID5767
Feature activation-2.10e+00

Pos logit contributions
vernment+0.424
 Aval+0.417
herent+0.400
 emancipation+0.398
Desk+0.394
Neg logit contributions
owship-0.391
WARNING-0.389
UC-0.378
 precaution-0.370
±-0.366
Token,
Token ID11
Feature activation+1.93e+00

Pos logit contributions
vernment+1.290
 Aval+1.256
 emancipation+1.223
herent+1.191
Desk+1.157
Neg logit contributions
WARNING-1.662
owship-1.645
UC-1.622
±-1.575
 precaution-1.552
Token anti
Token ID3098
Feature activation-5.29e-01

Pos logit contributions
WARNING+1.198
owship+1.182
UC+1.173
 precaution+1.130
±+1.123
Neg logit contributions
vernment-1.505
 Aval-1.489
 emancipation-1.419
herent-1.418
Desk-1.403
Token-
Token ID12
Feature activation-1.09e+00

Pos logit contributions
vernment+0.332
 Aval+0.327
 emancipation+0.309
herent+0.309
Desk+0.303
Neg logit contributions
WARNING-0.410
owship-0.408
UC-0.402
 precaution-0.391
±-0.389
Tokennuclear
Token ID43131
Feature activation+1.92e-01

Pos logit contributions
vernment+0.724
 Aval+0.709
herent+0.681
 emancipation+0.677
Desk+0.669
Neg logit contributions
owship-0.797
WARNING-0.792
UC-0.769
 precaution-0.756
±-0.749
Token,
Token ID11
Feature activation+2.13e+00

Pos logit contributions
WARNING+0.146
owship+0.144
UC+0.142
±+0.138
 precaution+0.137
Neg logit contributions
vernment-0.125
 Aval-0.123
 emancipation-0.117
herent-0.116
Desk-0.114
Token peace
Token ID4167
Feature activation+1.58e+00

Pos logit contributions
WARNING+1.324
owship+1.307
UC+1.296
 precaution+1.250
±+1.241
Neg logit contributions
vernment-1.655
 Aval-1.639
 emancipation-1.561
herent-1.560
Desk-1.544
Tokenoint
Token ID1563
Feature activation+6.16e+00

Pos logit contributions
WARNING+2.535
UC+2.520
owship+2.507
±+2.403
 precaution+2.401
Neg logit contributions
vernment-2.004
 Aval-1.993
 emancipation-1.870
herent-1.856
Desk-1.823
Tokenly
Token ID306
Feature activation+4.03e+00

Pos logit contributions
WARNING+4.489
owship+4.439
UC+4.411
 precaution+4.300
±+4.261
Neg logit contributions
vernment-4.043
 Aval-4.006
herent-3.773
 emancipation-3.766
Desk-3.727
Token organized
Token ID8389
Feature activation-3.47e-01

Pos logit contributions
WARNING+2.821
owship+2.787
UC+2.767
 precaution+2.679
±+2.665
Neg logit contributions
vernment-2.820
 Aval-2.793
 emancipation-2.649
herent-2.647
Desk-2.613
Token by
Token ID416
Feature activation+1.51e+00

Pos logit contributions
vernment+0.217
 Aval+0.213
 emancipation+0.201
herent+0.196
Desk+0.195
Neg logit contributions
WARNING-0.272
owship-0.269
UC-0.266
±-0.258
 precaution-0.257
Token Zionist
Token ID29390
Feature activation+2.63e+00

Pos logit contributions
WARNING+1.165
owship+1.154
UC+1.143
 precaution+1.108
±+1.104
Neg logit contributions
vernment-0.955
 Aval-0.943
 emancipation-0.889
herent-0.884
Desk-0.871
Token youth
Token ID6205
Feature activation+1.34e+00

Pos logit contributions
WARNING+2.007
owship+1.986
UC+1.971
 precaution+1.909
±+1.903
Neg logit contributions
vernment-1.685
 Aval-1.664
 emancipation-1.573
herent-1.567
Desk-1.545
Token movement
Token ID3356
Feature activation+1.65e+00

Pos logit contributions
WARNING+0.928
owship+0.917
UC+0.905
 precaution+0.878
±+0.877
Neg logit contributions
vernment-0.949
 Aval-0.938
 emancipation-0.896
herent-0.888
Desk-0.877
Token Ha
Token ID9398
Feature activation+4.03e+00

Pos logit contributions
WARNING+1.247
owship+1.235
UC+1.222
 precaution+1.186
±+1.181
Neg logit contributions
vernment-1.075
 Aval-1.066
 emancipation-1.006
herent-0.999
Desk-0.985
TokenNo
Token ID2949
Feature activation-5.06e+00

Pos logit contributions
WARNING+2.812
UC+2.805
owship+2.712
±+2.649
 precaution+2.618
Neg logit contributions
vernment-2.814
 Aval-2.793
 emancipation-2.660
herent-2.621
ivery-2.586
Tokenar
Token ID283
Feature activation+7.86e-01

Pos logit contributions
vernment+3.579
 Aval+3.552
 emancipation+3.403
herent+3.384
Desk+3.367
Neg logit contributions
owship-3.424
WARNING-3.377
UC-3.246
 precaution-3.213
 UCH-3.182
Token Ha
Token ID9398
Feature activation+1.28e+00

Pos logit contributions
WARNING+0.597
owship+0.590
UC+0.585
 precaution+0.567
±+0.565
Neg logit contributions
vernment-0.508
 Aval-0.502
 emancipation-0.474
herent-0.472
Desk-0.466
Token Rebecca
Token ID23489
Feature activation+1.53e+00

Pos logit contributions
WARNING+0.238
owship+0.236
UC+0.233
 precaution+0.226
±+0.225
Neg logit contributions
vernment-0.204
 Aval-0.202
 emancipation-0.190
herent-0.190
Desk-0.187
Token H
Token ID367
Feature activation-1.21e+00

Pos logit contributions
WARNING+1.142
owship+1.133
UC+1.126
 precaution+1.089
±+1.089
Neg logit contributions
vernment-0.994
 Aval-0.974
 emancipation-0.929
herent-0.925
Desk-0.913
Tokenower
Token ID789
Feature activation+6.71e-01

Pos logit contributions
vernment+0.875
 Aval+0.864
herent+0.821
 emancipation+0.817
Desk+0.809
Neg logit contributions
WARNING-0.821
owship-0.806
UC-0.796
 precaution-0.779
±-0.768
Tokenton
Token ID1122
Feature activation+1.40e+00

Pos logit contributions
WARNING+0.487
owship+0.481
UC+0.478
 precaution+0.462
±+0.462
Neg logit contributions
vernment-0.454
 Aval-0.449
 emancipation-0.426
herent-0.423
Desk-0.419
Token,
Token ID11
Feature activation+3.81e-01

Pos logit contributions
WARNING+1.029
owship+1.018
UC+1.012
 precaution+0.979
±+0.975
Neg logit contributions
vernment-0.935
 Aval-0.923
 emancipation-0.875
herent-0.873
Desk-0.860
Token Sam
Token ID3409
Feature activation+1.72e+00

Pos logit contributions
WARNING+0.290
owship+0.287
UC+0.284
 precaution+0.275
±+0.274
Neg logit contributions
vernment-0.245
 Aval-0.244
 emancipation-0.229
herent-0.229
Desk-0.225
Token P
Token ID350
Feature activation-3.19e+00

Pos logit contributions
WARNING+1.294
owship+1.281
UC+1.275
 precaution+1.232
±+1.229
Neg logit contributions
vernment-1.115
 Aval-1.100
 emancipation-1.042
herent-1.037
Desk-1.023
Tokenanche
Token ID6362
Feature activation-1.96e-01

Pos logit contributions
vernment+2.125
 Aval+2.054
herent+1.981
ivery+1.951
 emancipation+1.937
Neg logit contributions
WARNING-2.373
owship-2.307
UC-2.252
 precaution-2.244
±-2.201
Tokenvre
Token ID43933
Feature activation+3.21e+00

Pos logit contributions
vernment+0.143
 Aval+0.143
 emancipation+0.135
herent+0.135
Desk+0.132
Neg logit contributions
owship-0.132
WARNING-0.132
UC-0.129
 precaution-0.125
±-0.123
Token (
Token ID357
Feature activation+1.65e+00

Pos logit contributions
WARNING+2.278
UC+2.259
owship+2.248
±+2.176
 precaution+2.173
Neg logit contributions
vernment-2.184
 Aval-2.149
 emancipation-2.064
herent-2.048
Desk-2.018
TokenSam
Token ID16305
Feature activation+5.82e-01

Pos logit contributions
WARNING+1.241
owship+1.226
UC+1.220
 precaution+1.179
±+1.176
Neg logit contributions
vernment-1.070
 Aval-1.057
 emancipation-0.999
herent-0.995
Desk-0.980
Token he
Token ID339
Feature activation+1.33e+00

Pos logit contributions
vernment+0.156
 Aval+0.155
 emancipation+0.147
herent+0.146
Desk+0.142
Neg logit contributions
WARNING-0.189
owship-0.188
UC-0.185
 precaution-0.178
±-0.178
Token had
Token ID550
Feature activation+1.67e+00

Pos logit contributions
WARNING+1.041
owship+1.031
UC+1.021
 precaution+0.989
±+0.985
Neg logit contributions
vernment-0.820
 Aval-0.811
 emancipation-0.764
herent-0.761
Desk-0.748
Token called
Token ID1444
Feature activation+1.05e+00

Pos logit contributions
WARNING+1.187
owship+1.173
UC+1.165
 precaution+1.129
±+1.123
Neg logit contributions
vernment-1.152
 Aval-1.137
 emancipation-1.078
herent-1.076
Desk-1.063
Token the
Token ID262
Feature activation-1.95e+00

Pos logit contributions
WARNING+0.805
owship+0.797
UC+0.790
 precaution+0.764
±+0.762
Neg logit contributions
vernment-0.666
 Aval-0.660
 emancipation-0.623
herent-0.620
Desk-0.611
Token tower
Token ID10580
Feature activation-2.03e-01

Pos logit contributions
 Aval+1.292
vernment+1.280
 emancipation+1.216
herent+1.205
Desk+1.180
Neg logit contributions
WARNING-1.432
owship-1.428
UC-1.393
±-1.346
 precaution-1.331
Token during
Token ID1141
Feature activation-1.05e+00

Pos logit contributions
vernment+0.130
 Aval+0.130
 emancipation+0.122
herent+0.121
Desk+0.120
Neg logit contributions
WARNING-0.154
owship-0.152
UC-0.151
±-0.145
 precaution-0.144
Token my
Token ID616
Feature activation+1.76e+00

Pos logit contributions
 Aval+0.646
vernment+0.645
 emancipation+0.610
herent+0.608
Desk+0.590
Neg logit contributions
WARNING-0.827
owship-0.822
UC-0.811
±-0.775
 precaution-0.768
Token takeoff
Token ID45306
Feature activation+2.49e+00

Pos logit contributions
WARNING+1.252
owship+1.239
UC+1.228
 precaution+1.184
±+1.181
Neg logit contributions
vernment-1.221
 Aval-1.210
 emancipation-1.149
herent-1.143
Desk-1.128
Token roll
Token ID4836
Feature activation-3.02e-01

Pos logit contributions
WARNING+1.814
owship+1.790
UC+1.781
 precaution+1.738
±+1.722
Neg logit contributions
vernment-1.673
 Aval-1.647
herent-1.555
 emancipation-1.552
Desk-1.534
Token and
Token ID290
Feature activation-1.45e+00

Pos logit contributions
vernment+0.194
 Aval+0.192
 emancipation+0.183
herent+0.182
Desk+0.180
Neg logit contributions
WARNING-0.228
owship-0.226
UC-0.224
±-0.215
 precaution-0.214
Token told
Token ID1297
Feature activation-1.54e+00

Pos logit contributions
 Aval+0.966
vernment+0.955
 emancipation+0.915
herent+0.899
Desk+0.884
Neg logit contributions
owship-1.066
WARNING-1.063
UC-1.047
±-0.994
��-0.986
Tokens
Token ID82
Feature activation+2.02e-02

Pos logit contributions
vernment+0.681
 Aval+0.673
herent+0.642
 emancipation+0.628
Desk+0.619
Neg logit contributions
WARNING-0.866
owship-0.850
UC-0.840
 precaution-0.820
±-0.810
Token Ship
Token ID16656
Feature activation+9.84e-01

Pos logit contributions
WARNING+0.015
owship+0.015
UC+0.015
 precaution+0.014
±+0.014
Neg logit contributions
 Aval-0.013
vernment-0.013
 emancipation-0.013
herent-0.013
Desk-0.012
Token of
Token ID286
Feature activation+4.24e-01

Pos logit contributions
WARNING+0.728
owship+0.720
UC+0.713
 precaution+0.691
±+0.687
Neg logit contributions
vernment-0.652
 Aval-0.646
 emancipation-0.612
herent-0.612
Desk-0.602
Token the
Token ID262
Feature activation+2.28e-01

Pos logit contributions
WARNING+0.329
owship+0.325
UC+0.324
 precaution+0.313
±+0.311
Neg logit contributions
 Aval-0.267
vernment-0.264
 emancipation-0.248
herent-0.247
ivery-0.241
Token Year
Token ID6280
Feature activation-2.02e+00

Pos logit contributions
WARNING+0.174
owship+0.173
UC+0.171
 precaution+0.166
±+0.165
Neg logit contributions
 Aval-0.146
vernment-0.146
 emancipation-0.136
herent-0.136
Desk-0.134
Token title
Token ID3670
Feature activation-1.43e+00

Pos logit contributions
vernment+1.247
 Aval+1.243
herent+1.169
 emancipation+1.164
Desk+1.136
Neg logit contributions
WARNING-1.578
owship-1.570
UC-1.533
 precaution-1.479
±-1.478
Token at
Token ID379
Feature activation-2.39e+00

Pos logit contributions
 Aval+0.866
vernment+0.864
herent+0.812
 emancipation+0.808
Desk+0.794
Neg logit contributions
WARNING-1.136
owship-1.124
UC-1.103
±-1.069
 precaution-1.068
Token their
Token ID511
Feature activation-3.11e+00

Pos logit contributions
vernment+1.440
 Aval+1.433
 emancipation+1.372
herent+1.355
ivery+1.308
Neg logit contributions
WARNING-1.879
owship-1.856
UC-1.833
±-1.773
 precaution-1.768
Token feet
Token ID3625
Feature activation-5.25e+00

Pos logit contributions
 Aval+1.969
vernment+1.964
 emancipation+1.916
herent+1.843
ivery+1.821
Neg logit contributions
WARNING-2.340
owship-2.331
UC-2.272
 precaution-2.193
±-2.182
Token.
Token ID13
Feature activation-6.89e-02

Pos logit contributions
vernment+3.007
 Aval+2.970
herent+2.773
 emancipation+2.768
ivery+2.725
Neg logit contributions
owship-4.254
WARNING-4.247
UC-4.123
 precaution-4.012
 UCH-3.997
Token Twice
Token ID41217
Feature activation-3.32e-01

Pos logit contributions
 Aval+0.046
vernment+0.046
herent+0.043
 emancipation+0.043
Desk+0.043
Neg logit contributions
owship-0.050
WARNING-0.049
UC-0.049
 precaution-0.048
±-0.047
Token
Token ID198
Feature activation-1.23e+00

Pos logit contributions
 Aval+1.565
vernment+1.531
herent+1.478
Desk+1.472
 emancipation+1.457
Neg logit contributions
owship-1.692
 suspic-1.630
WARNING-1.623
 precaution-1.610
UC-1.588
TokenSign
Token ID11712
Feature activation+2.26e-01

Pos logit contributions
vernment+0.875
 Aval+0.872
Desk+0.839
herent+0.838
 emancipation+0.833
Neg logit contributions
owship-0.833
 precaution-0.796
WARNING-0.795
UC-0.776
��-0.760
Tokening
Token ID278
Feature activation+6.21e-01

Pos logit contributions
WARNING+0.175
owship+0.173
UC+0.171
±+0.166
 precaution+0.166
Neg logit contributions
 Aval-0.141
vernment-0.141
 emancipation-0.133
herent-0.133
Desk-0.131
Token Day
Token ID3596
Feature activation-2.79e+00

Pos logit contributions
WARNING+0.478
owship+0.472
UC+0.465
±+0.450
 precaution+0.450
Neg logit contributions
 Aval-0.392
vernment-0.390
 emancipation-0.373
herent-0.365
ivery-0.359
Token is
Token ID318
Feature activation-3.73e+00

Pos logit contributions
 Aval+1.880
vernment+1.834
 emancipation+1.786
herent+1.758
Desk+1.707
Neg logit contributions
WARNING-2.018
owship-2.010
UC-1.951
±-1.880
 precaution-1.878
Token just
Token ID655
Feature activation-4.27e+00

Pos logit contributions
 Aval+2.208
vernment+2.170
 emancipation+2.089
herent+2.046
ivery+1.966
Neg logit contributions
owship-3.003
WARNING-2.998
UC-2.928
 precaution-2.818
±-2.802
Token four
Token ID1440
Feature activation-1.96e+00

Pos logit contributions
 Aval+2.709
vernment+2.648
 emancipation+2.570
herent+2.534
Desk+2.446
Neg logit contributions
owship-3.242
WARNING-3.206
UC-3.161
 precaution-2.982
��-2.971
Token days
Token ID1528
Feature activation-1.13e+00

Pos logit contributions
 Aval+1.569
vernment+1.565
 emancipation+1.514
herent+1.482
ivery+1.458
Neg logit contributions
WARNING-1.165
owship-1.142
UC-1.118
±-1.060
��-1.049
Token away
Token ID1497
Feature activation-8.55e-01

Pos logit contributions
vernment+0.774
 Aval+0.772
 emancipation+0.735
herent+0.721
ivery+0.708
Neg logit contributions
WARNING-0.806
owship-0.802
UC-0.788
 precaution-0.755
±-0.755
Token,
Token ID11
Feature activation-8.66e-01

Pos logit contributions
vernment+0.544
 Aval+0.540
 emancipation+0.508
herent+0.504
Desk+0.496
Neg logit contributions
WARNING-0.653
owship-0.650
UC-0.639
 precaution-0.622
±-0.617
Token and
Token ID290
Feature activation+7.63e-03

Pos logit contributions
 Aval+0.546
vernment+0.543
 emancipation+0.517
herent+0.507
Desk+0.495
Neg logit contributions
WARNING-0.665
owship-0.663
UC-0.647
 precaution-0.632
±-0.628
Token Big
Token ID4403
Feature activation+6.14e-01

Pos logit contributions
WARNING+0.660
owship+0.655
UC+0.647
 precaution+0.626
±+0.625
Neg logit contributions
vernment-0.585
 Aval-0.580
 emancipation-0.549
herent-0.545
Desk-0.538
Token Spot
Token ID15899
Feature activation-4.29e+00

Pos logit contributions
WARNING+0.480
owship+0.475
UC+0.471
 precaution+0.457
±+0.455
Neg logit contributions
vernment-0.382
 Aval-0.378
 emancipation-0.355
herent-0.355
Desk-0.350
Token store
Token ID3650
Feature activation+2.72e-01

Pos logit contributions
 Aval+2.616
vernment+2.588
 emancipation+2.424
herent+2.424
Desk+2.407
Neg logit contributions
WARNING-3.326
owship-3.320
UC-3.200
±-3.126
 precaution-3.119
Token,
Token ID11
Feature activation+2.44e+00

Pos logit contributions
WARNING+0.209
owship+0.208
UC+0.204
 precaution+0.198
±+0.197
Neg logit contributions
vernment-0.172
 Aval-0.171
 emancipation-0.161
herent-0.161
Desk-0.159
Token where
Token ID810
Feature activation+1.16e-01

Pos logit contributions
WARNING+1.843
owship+1.819
UC+1.814
 precaution+1.758
±+1.748
Neg logit contributions
vernment-1.575
 Aval-1.549
 emancipation-1.467
herent-1.466
Desk-1.444
Token you
Token ID345
Feature activation-6.42e-01

Pos logit contributions
WARNING+0.087
owship+0.087
UC+0.085
±+0.083
 precaution+0.083
Neg logit contributions
vernment-0.075
 Aval-0.075
 emancipation-0.070
herent-0.070
Desk-0.069
Token
Token ID447
Feature activation-1.08e+00

Pos logit contributions
vernment+0.374
 Aval+0.370
herent+0.344
 emancipation+0.343
Desk+0.337
Neg logit contributions
owship-0.526
WARNING-0.526
UC-0.514
 precaution-0.503
±-0.498
Token
Token ID247
Feature activation+1.03e+00

Pos logit contributions
vernment+0.966
 Aval+0.958
herent+0.930
 emancipation+0.918
Desk+0.906
Neg logit contributions
WARNING-0.553
owship-0.545
UC-0.527
 precaution-0.517
±-0.490
Tokenll
Token ID297
Feature activation-1.81e+00

Pos logit contributions
WARNING+0.715
owship+0.707
UC+0.699
 precaution+0.681
±+0.671
Neg logit contributions
vernment-0.731
 Aval-0.723
herent-0.686
 emancipation-0.685
Desk-0.674
Token see
Token ID766
Feature activation+2.48e+00

Pos logit contributions
 Aval+1.056
vernment+1.046
herent+0.980
 emancipation+0.971
ivery+0.955
Neg logit contributions
WARNING-1.467
owship-1.465
UC-1.439
 precaution-1.394
±-1.385
Token the
Token ID262
Feature activation-8.44e-01

Pos logit contributions
WARNING+1.873
owship+1.848
UC+1.843
 precaution+1.788
±+1.777
Neg logit contributions
vernment-1.596
 Aval-1.568
herent-1.486
 emancipation-1.484
Desk-1.464
Token family
Token ID1641
Feature activation+2.83e+00

Pos logit contributions
vernment+1.893
 Aval+1.845
 emancipation+1.764
herent+1.745
ivery+1.727
Neg logit contributions
WARNING-2.259
owship-2.244
UC-2.199
 precaution-2.139
±-2.128
Token owned
Token ID6898
Feature activation-1.74e+00

Pos logit contributions
WARNING+2.076
owship+2.054
UC+2.039
 precaution+1.974
±+1.965
Neg logit contributions
vernment-1.892
 Aval-1.870
 emancipation-1.768
herent-1.765
Desk-1.741
Token Se
Token ID1001
Feature activation-1.07e+00

Pos logit contributions
vernment+1.129
 Aval+1.125
 emancipation+1.071
herent+1.061
Desk+1.038
Neg logit contributions
WARNING-1.308
owship-1.302
UC-1.279
 precaution-1.225
±-1.217
Tokengu
Token ID5162
Feature activation-1.35e+00

Pos logit contributions
vernment+0.778
 Aval+0.760
herent+0.731
 emancipation+0.724
Desk+0.716
Neg logit contributions
WARNING-0.725
owship-0.714
UC-0.709
 precaution-0.687
±-0.681
Tokenin
Token ID259
Feature activation+6.91e-01

Pos logit contributions
vernment+0.855
 Aval+0.841
herent+0.797
 emancipation+0.793
Desk+0.782
Neg logit contributions
WARNING-1.032
owship-1.023
UC-1.012
 precaution-0.985
±-0.977
Token
Token ID447
Feature activation-9.85e-03

Pos logit contributions
WARNING+0.546
owship+0.541
UC+0.535
 precaution+0.520
±+0.517
Neg logit contributions
vernment-0.425
 Aval-0.420
 emancipation-0.394
herent-0.394
Desk-0.387
Token
Token ID247
Feature activation-1.95e+00

Pos logit contributions
vernment+0.009
 Aval+0.009
herent+0.009
 emancipation+0.009
Desk+0.009
Neg logit contributions
WARNING-0.005
owship-0.005
UC-0.005
 precaution-0.004
±-0.004
Tokens
Token ID82
Feature activation-2.02e-01

Pos logit contributions
vernment+1.236
 Aval+1.216
herent+1.155
 emancipation+1.149
Desk+1.129
Neg logit contributions
WARNING-1.504
owship-1.498
UC-1.469
 precaution-1.428
±-1.406
Token
Token ID564
Feature activation-8.30e-01

Pos logit contributions
vernment+0.134
 Aval+0.132
 emancipation+0.126
herent+0.125
Desk+0.123
Neg logit contributions
WARNING-0.150
owship-0.149
UC-0.147
 precaution-0.141
±-0.141
Token
Token ID250
Feature activation-2.07e+00

Pos logit contributions
vernment+0.638
 Aval+0.633
herent+0.607
 emancipation+0.601
Desk+0.594
Neg logit contributions
WARNING-0.525
owship-0.520
UC-0.508
 precaution-0.496
±-0.483
Tokendist
Token ID17080
Feature activation-3.98e-01

Pos logit contributions
vernment+1.436
 Aval+1.391
herent+1.357
Desk+1.344
 emancipation+1.335
Neg logit contributions
owship-1.472
WARNING-1.449
UC-1.414
 precaution-1.404
 UCH-1.364
Token was
Token ID373
Feature activation+2.37e+00

Pos logit contributions
WARNING+0.181
owship+0.179
UC+0.177
 precaution+0.171
±+0.171
Neg logit contributions
vernment-0.161
 Aval-0.160
 emancipation-0.151
herent-0.150
Desk-0.148
Token soft
Token ID2705
Feature activation-8.86e-01

Pos logit contributions
WARNING+1.762
owship+1.741
UC+1.731
 precaution+1.682
±+1.672
Neg logit contributions
vernment-1.566
 Aval-1.543
 emancipation-1.459
herent-1.458
Desk-1.439
Token.
Token ID13
Feature activation+7.75e-01

Pos logit contributions
vernment+0.559
 Aval+0.554
herent+0.522
 emancipation+0.522
Desk+0.514
Neg logit contributions
WARNING-0.681
owship-0.676
UC-0.667
 precaution-0.647
±-0.645
Token Them
Token ID26460
Feature activation+2.44e+00

Pos logit contributions
WARNING+0.585
owship+0.583
UC+0.575
 precaution+0.560
±+0.554
Neg logit contributions
 Aval-0.497
vernment-0.497
herent-0.465
 emancipation-0.464
Desk-0.460
Token is
Token ID318
Feature activation+7.66e-01

Pos logit contributions
WARNING+1.869
owship+1.849
UC+1.838
 precaution+1.784
±+1.775
Neg logit contributions
vernment-1.547
 Aval-1.525
 emancipation-1.439
herent-1.436
Desk-1.417
Token straight
Token ID3892
Feature activation-6.07e+00

Pos logit contributions
WARNING+0.594
owship+0.587
UC+0.583
±+0.560
 precaution+0.560
Neg logit contributions
vernment-0.480
 Aval-0.478
 emancipation-0.451
herent-0.450
Desk-0.439
Token facts
Token ID6419
Feature activation-8.40e-01

Pos logit contributions
 Aval+3.666
vernment+3.490
 emancipation+3.441
herent+3.257
Desk+3.239
Neg logit contributions
WARNING-4.738
owship-4.689
UC-4.685
 UCH-4.456
��-4.448
Token.
Token ID13
Feature activation+1.47e+00

Pos logit contributions
vernment+0.516
 Aval+0.515
 emancipation+0.484
herent+0.480
Desk+0.470
Neg logit contributions
WARNING-0.661
owship-0.657
UC-0.644
 precaution-0.625
±-0.623
Token He
Token ID679
Feature activation-4.39e-01

Pos logit contributions
WARNING+1.111
owship+1.102
UC+1.091
 precaution+1.059
±+1.052
Neg logit contributions
vernment-0.955
 Aval-0.947
 emancipation-0.892
herent-0.890
Desk-0.878
Token just
Token ID655
Feature activation-9.17e-01

Pos logit contributions
vernment+0.288
 Aval+0.286
 emancipation+0.271
herent+0.269
Desk+0.264
Neg logit contributions
WARNING-0.328
owship-0.324
UC-0.320
 precaution-0.308
±-0.308
Token couldn
Token ID3521
Feature activation-6.38e-01

Pos logit contributions
vernment+0.601
 Aval+0.598
 emancipation+0.567
herent+0.564
Desk+0.550
Neg logit contributions
WARNING-0.685
owship-0.681
UC-0.671
±-0.640
 precaution-0.637
Tokenman
Token ID805
Feature activation-1.32e+00

Pos logit contributions
WARNING+0.873
owship+0.866
UC+0.857
 precaution+0.831
±+0.827
Neg logit contributions
vernment-0.768
 Aval-0.761
 emancipation-0.718
herent-0.716
Desk-0.706
Tokeneu
Token ID12496
Feature activation+3.37e+00

Pos logit contributions
vernment+0.581
 Aval+0.566
herent+0.517
 emancipation+0.516
ivery+0.508
Neg logit contributions
WARNING-1.285
owship-1.257
UC-1.236
 precaution-1.217
±-1.214
Tokenver
Token ID332
Feature activation+1.17e-01

Pos logit contributions
WARNING+2.392
owship+2.366
UC+2.348
 precaution+2.270
±+2.260
Neg logit contributions
vernment-2.333
 Aval-2.309
 emancipation-2.189
herent-2.184
Desk-2.155
Token quicker
Token ID20061
Feature activation-4.31e+00

Pos logit contributions
WARNING+0.094
owship+0.094
UC+0.092
 precaution+0.089
±+0.089
Neg logit contributions
vernment-0.070
 Aval-0.069
 emancipation-0.065
herent-0.065
Desk-0.063
Token,
Token ID11
Feature activation-1.24e+00

Pos logit contributions
 Aval+2.141
vernment+2.085
 emancipation+1.999
herent+1.925
 Contributions+1.858
Neg logit contributions
owship-3.916
WARNING-3.835
UC-3.739
±-3.654
��-3.641
Token twitch
Token ID37366
Feature activation-5.04e+00

Pos logit contributions
 Aval+0.837
vernment+0.832
 emancipation+0.792
herent+0.777
Desk+0.757
Neg logit contributions
owship-0.910
WARNING-0.902
UC-0.879
±-0.850
��-0.848
Tokenier
Token ID959
Feature activation+2.42e-01

Pos logit contributions
vernment+2.839
 Aval+2.835
 emancipation+2.731
herent+2.727
ivery+2.679
Neg logit contributions
owship-4.037
WARNING-4.033
UC-3.874
 precaution-3.870
±-3.777
Token defensive
Token ID6110
Feature activation+1.62e+00

Pos logit contributions
WARNING+0.214
owship+0.214
UC+0.211
±+0.204
 precaution+0.204
Neg logit contributions
vernment-0.125
 Aval-0.124
 emancipation-0.116
herent-0.114
Desk-0.112
Token backs
Token ID12983
Feature activation+2.01e+00

Pos logit contributions
WARNING+1.248
owship+1.236
UC+1.226
 precaution+1.188
±+1.185
Neg logit contributions
vernment-1.017
 Aval-1.006
 emancipation-0.948
herent-0.945
Desk-0.931
Token.
Token ID13
Feature activation-1.27e+00

Pos logit contributions
WARNING+1.532
owship+1.514
UC+1.510
 precaution+1.462
±+1.456
Neg logit contributions
vernment-1.279
 Aval-1.258
 emancipation-1.191
herent-1.188
Desk-1.175
Token
Token ID198
Feature activation-8.26e-01

Pos logit contributions
 Aval+0.867
vernment+0.844
herent+0.799
Desk+0.796
 emancipation+0.792
Neg logit contributions
owship-0.931
WARNING-0.916
UC-0.894
 precaution-0.890
±-0.862
Token that
Token ID326
Feature activation+9.51e-01

Pos logit contributions
WARNING+0.227
owship+0.226
UC+0.222
 precaution+0.216
±+0.215
Neg logit contributions
 Aval-0.187
vernment-0.187
 emancipation-0.176
herent-0.176
Desk-0.171
Token ends
Token ID5645
Feature activation+4.56e+00

Pos logit contributions
WARNING+0.686
owship+0.681
UC+0.672
 precaution+0.649
±+0.646
Neg logit contributions
vernment-0.647
 Aval-0.644
 emancipation-0.609
herent-0.607
Desk-0.597
Token with
Token ID351
Feature activation+1.38e+00

Pos logit contributions
WARNING+3.028
owship+2.969
UC+2.962
 precaution+2.842
±+2.837
Neg logit contributions
vernment-3.392
 Aval-3.325
 emancipation-3.162
herent-3.152
Desk-3.131
Token Sed
Token ID22710
Feature activation+1.81e+00

Pos logit contributions
WARNING+1.056
owship+1.047
UC+1.037
 precaution+1.004
±+1.001
Neg logit contributions
vernment-0.879
 Aval-0.871
 emancipation-0.822
herent-0.820
Desk-0.806
Tokenaris
Token ID20066
Feature activation-1.92e+00

Pos logit contributions
WARNING+1.480
UC+1.465
owship+1.461
±+1.422
 precaution+1.402
Neg logit contributions
vernment-1.061
 Aval-1.041
 emancipation-0.981
herent-0.970
Desk-0.954
Token covered
Token ID5017
Feature activation-5.91e+00

Pos logit contributions
 Aval+1.231
vernment+1.223
 emancipation+1.157
herent+1.146
Desk+1.128
Neg logit contributions
owship-1.463
WARNING-1.457
UC-1.424
 precaution-1.381
±-1.378
Token in
Token ID287
Feature activation-1.89e+00

Pos logit contributions
 Aval+3.134
vernment+2.979
 emancipation+2.909
herent+2.878
Desk+2.792
Neg logit contributions
WARNING-4.994
owship-4.931
UC-4.883
±-4.754
 precaution-4.742
Token blood
Token ID2910
Feature activation-2.79e+00

Pos logit contributions
 Aval+1.259
vernment+1.225
 emancipation+1.171
herent+1.161
Desk+1.131
Neg logit contributions
owship-1.404
WARNING-1.398
UC-1.370
±-1.324
 precaution-1.317
Token,
Token ID11
Feature activation-1.23e+00

Pos logit contributions
 Aval+1.566
vernment+1.554
herent+1.463
 emancipation+1.447
ivery+1.426
Neg logit contributions
WARNING-2.312
owship-2.288
UC-2.259
±-2.201
 precaution-2.196
Token missing
Token ID4814
Feature activation-3.66e+00

Pos logit contributions
 Aval+0.810
vernment+0.803
 emancipation+0.763
herent+0.757
Desk+0.748
Neg logit contributions
owship-0.903
WARNING-0.901
UC-0.886
±-0.856
 precaution-0.851
Token a
Token ID257
Feature activation-4.21e+00

Pos logit contributions
 Aval+2.176
vernment+2.096
 emancipation+2.022
herent+1.999
Desk+1.927
Neg logit contributions
WARNING-2.930
owship-2.897
UC-2.870
±-2.776
 precaution-2.726
Token have
Token ID423
Feature activation-5.71e+00

Pos logit contributions
vernment+3.055
 Aval+3.044
herent+2.886
 emancipation+2.855
abol+2.775
Neg logit contributions
WARNING-3.663
owship-3.619
UC-3.570
 precaution-3.422
 UCH-3.394
Token $
Token ID720
Feature activation+2.67e-01

Pos logit contributions
 Aval+3.338
vernment+3.331
 emancipation+3.182
herent+3.113
Desk+2.991
Neg logit contributions
WARNING-4.616
owship-4.566
UC-4.459
��-4.348
 precaution-4.207
Token16
Token ID1433
Feature activation-1.80e+00

Pos logit contributions
WARNING+0.194
owship+0.194
UC+0.190
 precaution+0.187
±+0.183
Neg logit contributions
vernment-0.178
 Aval-0.177
herent-0.168
 emancipation-0.167
Desk-0.166
Token million
Token ID1510
Feature activation+3.29e+00

Pos logit contributions
vernment+1.105
 Aval+1.082
 emancipation+1.024
herent+1.017
Desk+1.001
Neg logit contributions
WARNING-1.426
owship-1.411
UC-1.402
 precaution-1.355
±-1.344
Token in
Token ID287
Feature activation+9.76e-01

Pos logit contributions
WARNING+2.467
owship+2.445
UC+2.441
 precaution+2.372
±+2.355
Neg logit contributions
vernment-2.108
 Aval-2.084
herent-1.974
 emancipation-1.956
Desk-1.955
Token space
Token ID2272
Feature activation-5.33e+00

Pos logit contributions
WARNING+0.685
owship+0.677
UC+0.672
 precaution+0.651
±+0.647
Neg logit contributions
vernment-0.683
 Aval-0.675
 emancipation-0.640
herent-0.639
Desk-0.632
Token,
Token ID11
Feature activation-1.94e+00

Pos logit contributions
 Aval+3.037
vernment+3.023
 emancipation+2.846
herent+2.736
abol+2.672
Neg logit contributions
WARNING-4.421
owship-4.375
UC-4.257
��-4.128
±-4.101
Token plus
Token ID5556
Feature activation-1.91e+00

Pos logit contributions
 Aval+1.283
vernment+1.280
 emancipation+1.203
herent+1.185
Desk+1.169
Neg logit contributions
WARNING-1.432
owship-1.418
UC-1.386
 precaution-1.328
��-1.327
Token a
Token ID257
Feature activation-7.23e-01

Pos logit contributions
vernment+1.195
 Aval+1.187
 emancipation+1.127
herent+1.105
Desk+1.087
Neg logit contributions
WARNING-1.494
owship-1.479
UC-1.450
±-1.387
��-1.386
Token large
Token ID1588
Feature activation+1.04e+00

Pos logit contributions
vernment+0.483
 Aval+0.481
 emancipation+0.456
herent+0.452
ivery+0.443
Neg logit contributions
WARNING-0.530
owship-0.530
UC-0.518
±-0.496
��-0.492
Token chunk
Token ID16058
Feature activation+1.18e+00

Pos logit contributions
WARNING+0.757
owship+0.748
UC+0.742
 precaution+0.714
±+0.713
Neg logit contributions
vernment-0.705
 Aval-0.698
 emancipation-0.662
herent-0.658
Desk-0.649
Token,
Token ID11
Feature activation+5.44e-01

Pos logit contributions
vernment+1.432
 Aval+1.405
 emancipation+1.327
herent+1.321
Desk+1.306
Neg logit contributions
WARNING-1.578
owship-1.560
UC-1.535
 precaution-1.487
±-1.476
Token000
Token ID830
Feature activation-2.70e-01

Pos logit contributions
WARNING+0.249
owship+0.245
UC+0.241
 precaution+0.230
±+0.227
Neg logit contributions
vernment-0.514
 Aval-0.509
 emancipation-0.490
herent-0.489
Desk-0.484
Token new
Token ID649
Feature activation-5.89e-01

Pos logit contributions
vernment+0.168
 Aval+0.166
 emancipation+0.157
herent+0.156
Desk+0.154
Neg logit contributions
WARNING-0.211
owship-0.209
UC-0.207
 precaution-0.201
±-0.200
Token leases
Token ID38522
Feature activation-2.46e+00

Pos logit contributions
vernment+0.365
 Aval+0.361
 emancipation+0.340
herent+0.338
Desk+0.333
Neg logit contributions
WARNING-0.462
owship-0.457
UC-0.453
 precaution-0.439
±-0.438
Token for
Token ID329
Feature activation-1.24e+00

Pos logit contributions
vernment+1.535
 Aval+1.524
 emancipation+1.425
herent+1.418
Desk+1.385
Neg logit contributions
WARNING-1.927
owship-1.908
UC-1.865
 precaution-1.815
±-1.794
Token homeless
Token ID10463
Feature activation-6.00e+00

Pos logit contributions
vernment+0.786
 Aval+0.782
 emancipation+0.734
herent+0.730
Desk+0.716
Neg logit contributions
WARNING-0.950
owship-0.940
UC-0.932
±-0.898
 precaution-0.897
Token families
Token ID4172
Feature activation+5.62e-01

Pos logit contributions
 Aval+3.102
vernment+3.032
 emancipation+2.835
herent+2.749
ivery+2.688
Neg logit contributions
WARNING-5.306
owship-5.226
UC-5.126
±-4.968
 precaution-4.939
Token,
Token ID11
Feature activation-2.06e+00

Pos logit contributions
WARNING+0.423
owship+0.418
UC+0.416
 precaution+0.402
±+0.401
Neg logit contributions
vernment-0.365
 Aval-0.361
 emancipation-0.341
herent-0.340
Desk-0.335
Token but
Token ID475
Feature activation-2.46e+00

Pos logit contributions
vernment+1.316
 Aval+1.308
 emancipation+1.236
herent+1.213
Desk+1.194
Neg logit contributions
WARNING-1.577
owship-1.564
UC-1.541
 precaution-1.484
±-1.469
Token as
Token ID355
Feature activation+7.16e-01

Pos logit contributions
vernment+1.538
 Aval+1.537
 emancipation+1.440
herent+1.415
Desk+1.390
Neg logit contributions
WARNING-1.918
owship-1.908
UC-1.885
 precaution-1.809
±-1.804
Token many
Token ID867
Feature activation-2.17e+00

Pos logit contributions
WARNING+0.555
owship+0.550
UC+0.546
 precaution+0.530
±+0.527
Neg logit contributions
vernment-0.449
 Aval-0.443
 emancipation-0.418
herent-0.417
Desk-0.411