TOP ACTIVATIONS
MAX = 2.955

she believed his papers were fake and that he was an
contest , astronomers are using fake data to test methods to
players . <|endoftext|> A D DoS Gap occurs whenever D DoS
a very discrete manner . Esc orting somebody out of an
form Entertainment | Streaming Every Es port Under the Sun
nominee Hillary Clinton Hillary Diane Rodham Clinton Sanders : ' I
that the Moon landing was fake . Flynn s
DoS Gap occurs whenever D DoS traffic bypass es a company
university and law degrees are fake . The arrest
it is novel for the fake data to be created independently
to stop the spread of fake news . In
t Offer to Esc ort a Lady Home .
their home advantage will be null ified . Surprisingly , Iraq
I would give him . Dos age could be anywhere from
Local social times are also subs umed by urban times online
ker at otic plugs , mim icking the sp ines on
the men of Ban u Qur ay za , the ensl
fried - rice dish with esc olar and Chinese sausage that
petty - bour ge ois intellig ents ia resisted en masse
than three weeks ," says Es a mission manager Dr Rune

MIN ACTIVATIONS
MIN = -3.163

ju illet . pic . twitter . com / u I
fin isher " pic . twitter . com / Eh U
The Deb ates pic . twitter . com / O JV
never called . pic . twitter . com / CS 3
8 b ET pic . twitter . com / W 8
Deb ate 2016 pic . twitter . com / F c
Deb ates 2016 pic . twitter . com / j Ax
GO AL ! pic . twitter . com / E q
. youtube . com / watch ? v = 2 g
media bus ) pic . twitter . com / z J
: The Saga [/ b ][/ color ] - 100 %
ed Out Atlanta pic . twitter . com / xx A
Roman F lynn pic . twitter . com / gy d
Commando Rear med [/ b ][/ color ] - Normal ,
] Mega Pony [/ b ][/ color ] - 20 %
my kitchen essentials ! <|endoftext|> Posted on by | in
gy for herself . <|endoftext|> Posted on // According
the entrance to the Bangladesh Sund arb ans , southeast of
the waters of Bangladesh 's Sund arb ans , the largest
Be safe pic . twitter . com / 74 E

INTERVAL 1.426 - 2.955
CONTAINS 0.503%

. I remembered thinking " God , how could a guy
is this genius Patrick O ' Brian and you 're not
services . D DoS Attack Complex ity
s have historically been much shorter . The latest dispute is
not only on her ce int ure , but on her

INTERVAL -0.104 - 1.426
CONTAINS 60.120%

; but Diana 's reck lessness is a perfect match for
3 D MAX was doomed first when all u received
game than his daily hot dog snack . Now
used in deep Arctic waters . The Ark t ika was
ejac ulate soon . So you can watch porn and masturb

INTERVAL -1.633 - -0.104
CONTAINS 39.215%

two strings attached . The president told Fox News
security sector reform efforts in the D RC , but we
. The exceptionally adapt able , decentral ised tool that is
a joy ous Easter Sunday celebration , but the brunch turned
the players and the events of the first two acts ,

INTERVAL -3.163 - -1.633
CONTAINS 0.162%

or rogue governments , can use their knowledge to exploit an
busy street on repeat . Posted on August 1 on Facebook
and U . S . installations in Turkey ."[ 44 ]
farmland contaminated by heavy metals used to make batteries , computer
liberals are at war with Christmas , and that once again
Token she
Token ID673
Feature activation+5.37e-01

Pos logit contributions
ctic+0.295
20439+0.285
+0.281
asia+0.280
iron+0.274
Neg logit contributions
emouth-0.309
 signific-0.286
 independ-0.274
apist-0.269
itable-0.250
Token believed
Token ID4762
Feature activation+7.51e-01

Pos logit contributions
emouth+1.021
 signific+0.950
 independ+0.908
apist+0.885
itable+0.825
Neg logit contributions
ctic-1.059
20439-1.022
-0.997
asia-0.994
iron-0.977
Token his
Token ID465
Feature activation-1.19e-01

Pos logit contributions
emouth+1.392
 signific+1.297
 independ+1.238
apist+1.200
itable+1.121
Neg logit contributions
ctic-1.509
20439-1.473
-1.431
asia-1.424
iron-1.398
Token papers
Token ID9473
Feature activation-3.65e-01

Pos logit contributions
ctic+0.219
20439+0.211
+0.207
asia+0.206
iron+0.201
Neg logit contributions
emouth-0.240
 signific-0.223
 independ-0.214
apist-0.211
imental-0.196
Token were
Token ID547
Feature activation+4.14e-01

Pos logit contributions
ctic+0.785
20439+0.759
+0.747
asia+0.741
iron+0.734
Neg logit contributions
emouth-0.625
 signific-0.572
 independ-0.546
apist-0.534
itable-0.491
Token fake
Token ID8390
Feature activation+2.96e+00

Pos logit contributions
emouth+0.899
 signific+0.847
 independ+0.815
apist+0.794
itable+0.750
Neg logit contributions
ctic-0.698
20439-0.678
-0.655
asia-0.652
iron-0.637
Token and
Token ID290
Feature activation+2.79e-01

Pos logit contributions
emouth+5.009
 signific+4.819
 independ+4.485
apist+4.324
itable+4.105
Neg logit contributions
ctic-6.039
20439-5.932
-5.716
asia-5.700
orget-5.672
Token that
Token ID326
Feature activation+1.86e-01

Pos logit contributions
emouth+0.535
 signific+0.497
 independ+0.476
apist+0.463
itable+0.433
Neg logit contributions
ctic-0.545
20439-0.530
-0.517
asia-0.513
iron-0.504
Token he
Token ID339
Feature activation+1.22e-01

Pos logit contributions
emouth+0.360
 signific+0.334
 independ+0.320
apist+0.312
itable+0.292
Neg logit contributions
ctic-0.357
20439-0.347
-0.339
asia-0.337
iron-0.331
Token was
Token ID373
Feature activation-1.62e-01

Pos logit contributions
emouth+0.234
 signific+0.218
 independ+0.209
apist+0.204
imental+0.190
Neg logit contributions
ctic-0.239
20439-0.228
-0.224
asia-0.223
iron-0.220
Token an
Token ID281
Feature activation-1.86e-01

Pos logit contributions
ctic+0.295
20439+0.285
+0.277
asia+0.276
iron+0.271
Neg logit contributions
emouth-0.332
 signific-0.309
 independ-0.295
apist-0.291
itable-0.272
Token contest
Token ID8414
Feature activation-1.56e-01

Pos logit contributions
emouth+0.089
 signific+0.083
 independ+0.079
apist+0.078
itable+0.073
Neg logit contributions
ctic-0.079
20439-0.077
-0.074
asia-0.074
iron-0.073
Token,
Token ID11
Feature activation-9.35e-02

Pos logit contributions
ctic+0.323
20439+0.314
+0.307
asia+0.305
iron+0.301
Neg logit contributions
emouth-0.280
 signific-0.259
 independ-0.246
apist-0.240
itable-0.221
Token astronomers
Token ID33162
Feature activation-2.52e-01

Pos logit contributions
ctic+0.170
20439+0.164
+0.160
asia+0.159
iron+0.156
Neg logit contributions
emouth-0.191
 signific-0.178
 independ-0.170
apist-0.168
itable-0.157
Token are
Token ID389
Feature activation+1.08e-01

Pos logit contributions
ctic+0.477
20439+0.457
+0.449
asia+0.445
iron+0.440
Neg logit contributions
emouth-0.497
 signific-0.463
 independ-0.440
apist-0.436
imental-0.404
Token using
Token ID1262
Feature activation-8.91e-01

Pos logit contributions
emouth+0.214
 signific+0.200
 independ+0.190
apist+0.187
itable+0.174
Neg logit contributions
ctic-0.203
20439-0.197
-0.191
asia-0.191
iron-0.187
Token fake
Token ID8390
Feature activation+2.95e+00

Pos logit contributions
ctic+1.599
20439+1.543
+1.506
asia+1.493
vel+1.474
Neg logit contributions
emouth-1.834
 signific-1.696
 independ-1.624
apist-1.611
itable-1.499
Token data
Token ID1366
Feature activation+1.24e-01

Pos logit contributions
emouth+5.895
 signific+5.734
 independ+5.516
apist+5.130
itable+5.004
Neg logit contributions
ctic-5.181
20439-5.115
asia-4.883
-4.852
iron-4.738
Token to
Token ID284
Feature activation+3.99e-01

Pos logit contributions
emouth+0.265
 signific+0.248
 independ+0.239
apist+0.234
itable+0.220
Neg logit contributions
ctic-0.215
20439-0.209
-0.203
asia-0.202
iron-0.197
Token test
Token ID1332
Feature activation+2.10e-01

Pos logit contributions
emouth+0.813
 signific+0.755
 independ+0.724
apist+0.708
itable+0.663
Neg logit contributions
ctic-0.730
20439-0.707
-0.691
asia-0.686
iron-0.674
Token methods
Token ID5050
Feature activation-1.27e-01

Pos logit contributions
emouth+0.424
 signific+0.394
 independ+0.377
apist+0.371
itable+0.346
Neg logit contributions
ctic-0.387
20439-0.375
-0.366
asia-0.362
iron-0.356
Token to
Token ID284
Feature activation+4.79e-01

Pos logit contributions
ctic+0.233
20439+0.226
+0.220
asia+0.218
iron+0.214
Neg logit contributions
emouth-0.256
 signific-0.239
 independ-0.227
apist-0.224
itable-0.209
Token players
Token ID1938
Feature activation-3.62e-01

Pos logit contributions
ctic+0.463
20439+0.449
+0.441
asia+0.437
iron+0.427
Neg logit contributions
emouth-0.525
 signific-0.487
 independ-0.464
apist-0.458
imental-0.430
Token.
Token ID13
Feature activation+3.05e-01

Pos logit contributions
ctic+0.635
20439+0.616
+0.599
asia+0.594
iron+0.581
Neg logit contributions
emouth-0.761
 signific-0.716
 independ-0.684
apist-0.669
itable-0.631
Token<|endoftext|>
Token ID50256
Feature activation-5.16e-01

Pos logit contributions
emouth+0.669
 signific+0.630
 independ+0.606
apist+0.592
itable+0.559
Neg logit contributions
ctic-0.507
20439-0.492
-0.474
asia-0.472
iron-0.462
TokenA
Token ID32
Feature activation+2.43e-01

Pos logit contributions
ctic+0.822
20439+0.799
+0.779
asia+0.767
iron+0.754
Neg logit contributions
emouth-1.176
 signific-1.109
 independ-1.067
apist-1.038
itable-0.982
Token D
Token ID360
Feature activation+3.55e-01

Pos logit contributions
emouth+0.501
 signific+0.469
 independ+0.451
apist+0.439
itable+0.412
Neg logit contributions
ctic-0.440
20439-0.427
-0.415
asia-0.413
iron-0.404
TokenDoS
Token ID46498
Feature activation+2.88e+00

Pos logit contributions
emouth+0.792
 signific+0.747
 independ+0.720
apist+0.701
itable+0.663
Neg logit contributions
ctic-0.580
20439-0.561
-0.543
asia-0.540
iron-0.529
Token Gap
Token ID33899
Feature activation-2.59e-01

Pos logit contributions
emouth+5.338
 signific+4.963
 independ+4.756
apist+4.640
itable+4.326
Neg logit contributions
ctic-5.550
20439-5.504
-5.275
asia-5.261
orget-5.119
Token occurs
Token ID8833
Feature activation-1.46e-01

Pos logit contributions
ctic+0.461
20439+0.445
+0.435
asia+0.431
iron+0.424
Neg logit contributions
emouth-0.539
 signific-0.506
 independ-0.487
apist-0.472
itable-0.445
Token whenever
Token ID8797
Feature activation+2.26e-01

Pos logit contributions
ctic+0.216
20439+0.207
+0.200
asia+0.199
iron+0.194
Neg logit contributions
emouth-0.349
 signific-0.330
 independ-0.317
apist-0.312
imental-0.296
Token D
Token ID360
Feature activation+1.85e-01

Pos logit contributions
emouth+0.443
 signific+0.410
 independ+0.391
apist+0.384
imental+0.360
Neg logit contributions
ctic-0.433
20439-0.420
-0.411
asia-0.408
iron-0.399
TokenDoS
Token ID46498
Feature activation+2.76e+00

Pos logit contributions
emouth+0.319
 signific+0.295
 independ+0.282
apist+0.270
itable+0.250
Neg logit contributions
ctic-0.397
20439-0.384
asia-0.377
-0.377
iron-0.372
Token a
Token ID257
Feature activation+2.29e-01

Pos logit contributions
emouth+0.114
 signific+0.106
 independ+0.101
apist+0.099
itable+0.092
Neg logit contributions
ctic-0.108
20439-0.104
-0.102
asia-0.101
iron-0.099
Token very
Token ID845
Feature activation-1.18e+00

Pos logit contributions
emouth+0.472
 signific+0.440
 independ+0.423
apist+0.414
itable+0.387
Neg logit contributions
ctic-0.413
20439-0.398
-0.388
asia-0.386
iron-0.377
Token discrete
Token ID28810
Feature activation+8.89e-01

Pos logit contributions
ctic+2.103
+1.907
20439+1.901
asia+1.899
vel+1.891
Neg logit contributions
emouth-2.508
 signific-2.266
apist-2.204
 independ-2.183
imental-2.046
Token manner
Token ID5642
Feature activation+1.23e+00

Pos logit contributions
emouth+1.840
 signific+1.733
 independ+1.660
apist+1.611
itable+1.522
Neg logit contributions
ctic-1.588
20439-1.551
-1.498
asia-1.489
iron-1.461
Token.
Token ID13
Feature activation+9.73e-02

Pos logit contributions
emouth+2.331
 signific+2.186
 independ+2.091
apist+2.026
itable+1.893
Neg logit contributions
ctic-2.401
20439-2.368
-2.301
asia-2.279
iron-2.233
Token Esc
Token ID16319
Feature activation+2.80e+00

Pos logit contributions
emouth+0.222
 signific+0.210
 independ+0.202
apist+0.198
itable+0.187
Neg logit contributions
ctic-0.154
20439-0.149
-0.145
asia-0.143
iron-0.139
Tokenorting
Token ID24707
Feature activation+1.22e+00

Pos logit contributions
emouth+5.336
 signific+4.978
 independ+4.770
apist+4.769
itable+4.423
Neg logit contributions
ctic-5.340
20439-5.248
-5.072
asia-5.024
iron-4.910
Token somebody
Token ID8276
Feature activation-7.03e-01

Pos logit contributions
emouth+2.359
 signific+2.207
 independ+2.113
apist+2.043
itable+1.919
Neg logit contributions
ctic-2.346
20439-2.299
-2.224
asia-2.213
iron-2.173
Token out
Token ID503
Feature activation+4.96e-02

Pos logit contributions
ctic+1.247
20439+1.151
+1.144
asia+1.133
iron+1.117
Neg logit contributions
emouth-1.490
 signific-1.381
 independ-1.323
apist-1.314
itable-1.243
Token of
Token ID286
Feature activation-7.59e-01

Pos logit contributions
emouth+0.100
 signific+0.094
 independ+0.089
apist+0.087
itable+0.082
Neg logit contributions
ctic-0.092
20439-0.089
-0.086
asia-0.086
iron-0.084
Token an
Token ID281
Feature activation+4.53e-02

Pos logit contributions
ctic+1.544
20439+1.454
+1.453
asia+1.446
vel+1.417
Neg logit contributions
emouth-1.385
 signific-1.275
 independ-1.218
apist-1.197
imental-1.107
Tokenform
Token ID687
Feature activation+5.33e-03

Pos logit contributions
emouth+0.693
 signific+0.651
 independ+0.625
apist+0.609
itable+0.576
Neg logit contributions
ctic-0.536
20439-0.517
-0.505
asia-0.503
iron-0.495
Token Entertainment
Token ID11058
Feature activation+9.60e-01

Pos logit contributions
emouth+0.011
 signific+0.010
 independ+0.010
apist+0.010
itable+0.009
Neg logit contributions
ctic-0.010
20439-0.009
asia-0.009
-0.009
iron-0.009
Token |
Token ID930
Feature activation+6.98e-01

Pos logit contributions
emouth+1.981
 signific+1.863
 independ+1.789
apist+1.739
itable+1.638
Neg logit contributions
ctic-1.719
20439-1.683
-1.620
asia-1.607
iron-1.577
Token Streaming
Token ID43124
Feature activation+1.05e+00

Pos logit contributions
emouth+1.408
 signific+1.321
 independ+1.266
apist+1.230
itable+1.157
Neg logit contributions
ctic-1.287
20439-1.255
-1.214
asia-1.205
iron-1.182
Token Every
Token ID3887
Feature activation-1.81e-02

Pos logit contributions
emouth+2.160
 signific+2.025
 independ+1.945
apist+1.893
itable+1.777
Neg logit contributions
ctic-1.902
20439-1.866
-1.792
asia-1.781
iron-1.747
Token Es
Token ID8678
Feature activation+2.80e+00

Pos logit contributions
ctic+0.032
20439+0.031
asia+0.030
+0.030
iron+0.030
Neg logit contributions
emouth-0.038
 signific-0.035
 independ-0.034
apist-0.033
itable-0.031
Tokenport
Token ID634
Feature activation+1.06e+00

Pos logit contributions
emouth+5.207
 signific+4.926
 independ+4.736
apist+4.624
itable+4.329
Neg logit contributions
ctic-5.426
20439-5.358
-5.195
asia-5.107
orget-5.054
Token Under
Token ID4698
Feature activation+7.95e-01

Pos logit contributions
emouth+2.209
 signific+2.073
 independ+1.992
apist+1.941
itable+1.826
Neg logit contributions
ctic-1.883
20439-1.834
-1.775
asia-1.763
iron-1.727
Token the
Token ID262
Feature activation+7.30e-02

Pos logit contributions
emouth+1.686
 signific+1.584
 independ+1.522
apist+1.488
itable+1.403
Neg logit contributions
ctic-1.374
20439-1.349
-1.292
asia-1.286
iron-1.256
Token Sun
Token ID3825
Feature activation-1.14e+00

Pos logit contributions
emouth+0.161
 signific+0.152
 independ+0.146
apist+0.144
imental+0.135
Neg logit contributions
ctic-0.122
20439-0.117
-0.112
asia-0.112
iron-0.110
Token
Token ID198
Feature activation+4.96e-01

Pos logit contributions
ctic+1.954
20439+1.855
+1.843
asia+1.831
iron+1.812
Neg logit contributions
emouth-2.450
 signific-2.275
 independ-2.175
apist-2.125
itable-2.005
Token nominee
Token ID10429
Feature activation+5.63e-02

Pos logit contributions
emouth+1.290
 signific+1.215
 independ+1.167
apist+1.127
itable+1.061
Neg logit contributions
ctic-1.179
20439-1.156
-1.118
asia-1.110
iron-1.089
Token Hillary
Token ID5073
Feature activation+1.39e+00

Pos logit contributions
emouth+0.104
 signific+0.096
 independ+0.091
apist+0.089
itable+0.083
Neg logit contributions
ctic-0.114
20439-0.111
-0.108
asia-0.108
iron-0.106
Token Clinton
Token ID2605
Feature activation+1.34e+00

Pos logit contributions
emouth+1.970
 signific+1.794
 independ+1.684
apist+1.618
itable+1.471
Neg logit contributions
ctic-3.381
20439-3.313
-3.240
asia-3.225
iron-3.176
Token Hillary
Token ID5073
Feature activation+1.47e+00

Pos logit contributions
emouth+2.420
 signific+2.254
 independ+2.146
apist+2.081
itable+1.939
Neg logit contributions
ctic-2.740
20439-2.685
-2.615
asia-2.597
iron-2.545
Token Diane
Token ID26542
Feature activation+1.15e+00

Pos logit contributions
emouth+3.053
 signific+2.871
 independ+2.752
apist+2.686
itable+2.532
Neg logit contributions
ctic-2.615
20439-2.542
-2.463
asia-2.444
iron-2.394
Token Rodham
Token ID41848
Feature activation+2.78e+00

Pos logit contributions
emouth+1.487
 signific+1.331
 independ+1.250
apist+1.190
imental+1.058
Neg logit contributions
ctic-2.976
20439-2.919
asia-2.866
-2.858
iron-2.821
Token Clinton
Token ID2605
Feature activation+1.67e+00

Pos logit contributions
emouth+3.897
 signific+3.552
 independ+3.320
apist+3.194
itable+2.915
Neg logit contributions
ctic-6.830
20439-6.675
-6.530
asia-6.498
iron-6.395
TokenSanders
Token ID26747
Feature activation+1.54e+00

Pos logit contributions
emouth+3.432
 signific+3.232
 independ+3.103
apist+3.035
itable+2.850
Neg logit contributions
ctic-2.984
20439-2.886
-2.793
asia-2.783
iron-2.714
Token:
Token ID25
Feature activation+5.66e-01

Pos logit contributions
emouth+3.267
 signific+3.080
 independ+2.961
apist+2.887
itable+2.724
Neg logit contributions
ctic-2.666
20439-2.566
-2.500
asia-2.478
iron-2.420
Token '
Token ID705
Feature activation-4.65e-03

Pos logit contributions
emouth+1.073
 signific+0.995
 independ+0.950
apist+0.923
imental+0.868
Neg logit contributions
ctic-1.124
20439-1.098
-1.065
asia-1.049
orget-1.028
TokenI
Token ID40
Feature activation+3.71e-02

Pos logit contributions
ctic+0.007
20439+0.007
+0.007
asia+0.007
orget+0.007
Neg logit contributions
emouth-0.011
 signific-0.010
apist-0.010
 independ-0.010
itable-0.009
Token that
Token ID326
Feature activation-3.32e-02

Pos logit contributions
emouth+0.034
 signific+0.032
 independ+0.031
apist+0.030
itable+0.028
Neg logit contributions
ctic-0.034
20439-0.033
-0.032
asia-0.032
iron-0.031
Token the
Token ID262
Feature activation-2.55e-01

Pos logit contributions
ctic+0.063
20439+0.061
+0.059
asia+0.059
iron+0.058
Neg logit contributions
emouth-0.066
 signific-0.061
 independ-0.058
apist-0.057
itable-0.054
Token Moon
Token ID6869
Feature activation-1.67e+00

Pos logit contributions
ctic+0.455
20439+0.437
+0.426
asia+0.425
iron+0.416
Neg logit contributions
emouth-0.533
 signific-0.497
 independ-0.476
apist-0.467
imental-0.440
Token landing
Token ID9581
Feature activation-1.12e+00

Pos logit contributions
ctic+2.336
20439+2.233
+2.194
iron+2.184
vel+2.152
Neg logit contributions
emouth-4.062
 signific-3.816
 independ-3.694
apist-3.646
imental-3.456
Token was
Token ID373
Feature activation+2.20e-02

Pos logit contributions
ctic+2.087
20439+1.987
+1.981
asia+1.957
iron+1.934
Neg logit contributions
emouth-2.249
 signific-2.088
 independ-1.998
apist-1.962
imental-1.830
Token fake
Token ID8390
Feature activation+2.78e+00

Pos logit contributions
emouth+0.050
 signific+0.047
 independ+0.045
apist+0.044
itable+0.042
Neg logit contributions
ctic-0.036
20439-0.035
-0.033
asia-0.033
iron-0.032
Token.
Token ID13
Feature activation-5.05e-02

Pos logit contributions
emouth+4.664
 signific+4.526
 independ+4.321
apist+4.054
itable+3.884
Neg logit contributions
ctic-5.672
20439-5.668
orget-5.335
-5.309
asia-5.307
Token Flynn
Token ID14654
Feature activation-6.63e-01

Pos logit contributions
ctic+0.084
20439+0.082
+0.079
asia+0.079
iron+0.077
Neg logit contributions
emouth-0.111
 signific-0.105
 independ-0.101
apist-0.098
itable-0.093
Token
Token ID447
Feature activation-5.27e-01

Pos logit contributions
ctic+1.273
+1.203
20439+1.202
asia+1.182
iron+1.176
Neg logit contributions
emouth-1.305
 signific-1.195
 independ-1.149
apist-1.128
imental-1.065
Token
Token ID247
Feature activation-1.80e-01

Pos logit contributions
ctic+0.663
20439+0.630
+0.618
asia+0.604
+0.597
Neg logit contributions
emouth-1.375
 signific-1.305
 independ-1.267
apist-1.240
itable-1.182
Tokens
Token ID82
Feature activation+1.90e-01

Pos logit contributions
ctic+0.437
20439+0.422
+0.419
asia+0.414
iron+0.411
Neg logit contributions
emouth-0.264
 signific-0.237
 independ-0.224
apist-0.216
itable-0.195
TokenDoS
Token ID46498
Feature activation+2.88e+00

Pos logit contributions
emouth+0.792
 signific+0.747
 independ+0.720
apist+0.701
itable+0.663
Neg logit contributions
ctic-0.580
20439-0.561
-0.543
asia-0.540
iron-0.529
Token Gap
Token ID33899
Feature activation-2.59e-01

Pos logit contributions
emouth+5.338
 signific+4.963
 independ+4.756
apist+4.640
itable+4.326
Neg logit contributions
ctic-5.550
20439-5.504
-5.275
asia-5.261
orget-5.119
Token occurs
Token ID8833
Feature activation-1.46e-01

Pos logit contributions
ctic+0.461
20439+0.445
+0.435
asia+0.431
iron+0.424
Neg logit contributions
emouth-0.539
 signific-0.506
 independ-0.487
apist-0.472
itable-0.445
Token whenever
Token ID8797
Feature activation+2.26e-01

Pos logit contributions
ctic+0.216
20439+0.207
+0.200
asia+0.199
iron+0.194
Neg logit contributions
emouth-0.349
 signific-0.330
 independ-0.317
apist-0.312
imental-0.296
Token D
Token ID360
Feature activation+1.85e-01

Pos logit contributions
emouth+0.443
 signific+0.410
 independ+0.391
apist+0.384
imental+0.360
Neg logit contributions
ctic-0.433
20439-0.420
-0.411
asia-0.408
iron-0.399
TokenDoS
Token ID46498
Feature activation+2.76e+00

Pos logit contributions
emouth+0.319
 signific+0.295
 independ+0.282
apist+0.270
itable+0.250
Neg logit contributions
ctic-0.397
20439-0.384
asia-0.377
-0.377
iron-0.372
Token traffic
Token ID4979
Feature activation+8.89e-01

Pos logit contributions
emouth+4.981
 signific+4.670
 independ+4.448
apist+4.270
itable+3.989
Neg logit contributions
ctic-5.480
20439-5.409
asia-5.173
-5.156
iron-5.085
Token bypass
Token ID17286
Feature activation+2.32e-01

Pos logit contributions
emouth+1.662
 signific+1.543
 independ+1.475
apist+1.437
itable+1.340
Neg logit contributions
ctic-1.778
20439-1.727
-1.688
asia-1.676
iron-1.642
Tokenes
Token ID274
Feature activation+7.49e-01

Pos logit contributions
emouth+0.342
 signific+0.313
 independ+0.294
apist+0.286
imental+0.260
Neg logit contributions
ctic-0.554
20439-0.541
-0.531
asia-0.525
iron-0.518
Token a
Token ID257
Feature activation+3.00e-01

Pos logit contributions
emouth+1.470
 signific+1.369
 independ+1.311
apist+1.279
itable+1.196
Neg logit contributions
ctic-1.424
20439-1.382
-1.349
asia-1.337
iron-1.310
Token company
Token ID1664
Feature activation-2.65e-01

Pos logit contributions
emouth+0.610
 signific+0.568
 independ+0.545
apist+0.534
imental+0.499
Neg logit contributions
ctic-0.549
20439-0.531
-0.520
asia-0.516
orget-0.503
Token university
Token ID6403
Feature activation-8.34e-02

Pos logit contributions
emouth+0.697
 signific+0.649
 independ+0.622
apist+0.609
itable+0.571
Neg logit contributions
ctic-0.607
20439-0.587
-0.572
asia-0.570
iron-0.556
Token and
Token ID290
Feature activation+6.69e-02

Pos logit contributions
ctic+0.158
20439+0.152
+0.149
asia+0.148
iron+0.145
Neg logit contributions
emouth-0.164
 signific-0.153
 independ-0.147
apist-0.143
itable-0.134
Token law
Token ID1099
Feature activation+9.72e-01

Pos logit contributions
emouth+0.134
 signific+0.125
 independ+0.119
apist+0.117
itable+0.109
Neg logit contributions
ctic-0.125
20439-0.121
-0.118
asia-0.118
iron-0.115
Token degrees
Token ID7370
Feature activation+9.43e-02

Pos logit contributions
emouth+1.940
 signific+1.814
 independ+1.738
apist+1.691
itable+1.587
Neg logit contributions
ctic-1.818
20439-1.766
-1.716
asia-1.707
iron-1.674
Token are
Token ID389
Feature activation-8.36e-02

Pos logit contributions
emouth+0.161
 signific+0.148
 independ+0.141
apist+0.137
itable+0.127
Neg logit contributions
ctic-0.203
20439-0.198
-0.194
asia-0.192
iron-0.190
Token fake
Token ID8390
Feature activation+2.71e+00

Pos logit contributions
ctic+0.144
20439+0.140
+0.135
asia+0.135
iron+0.132
Neg logit contributions
emouth-0.179
 signific-0.168
 independ-0.161
apist-0.158
itable-0.149
Token.
Token ID13
Feature activation+2.28e-01

Pos logit contributions
emouth+4.842
 signific+4.629
 independ+4.390
apist+4.198
itable+4.002
Neg logit contributions
20439-5.297
ctic-5.287
-5.033
asia-5.012
orget-5.011
Token
Token ID198
Feature activation+4.30e-01

Pos logit contributions
emouth+0.496
 signific+0.467
 independ+0.449
apist+0.438
itable+0.414
Neg logit contributions
ctic-0.383
20439-0.372
-0.360
asia-0.358
iron-0.350
Token
Token ID198
Feature activation+4.10e-02

Pos logit contributions
emouth+0.958
 signific+0.901
 independ+0.868
apist+0.844
itable+0.796
Neg logit contributions
ctic-0.710
20439-0.684
-0.669
asia-0.663
iron-0.647
TokenThe
Token ID464
Feature activation+4.49e-01

Pos logit contributions
emouth+0.093
 signific+0.088
 independ+0.085
apist+0.083
itable+0.078
Neg logit contributions
ctic-0.065
20439-0.063
-0.061
asia-0.061
iron-0.060
Token arrest
Token ID3251
Feature activation-5.87e-01

Pos logit contributions
emouth+0.917
 signific+0.856
 independ+0.820
apist+0.800
itable+0.753
Neg logit contributions
ctic-0.821
20439-0.797
-0.774
asia-0.771
iron-0.753
Token it
Token ID340
Feature activation+3.16e-01

Pos logit contributions
emouth+1.022
 signific+0.955
 independ+0.915
apist+0.893
itable+0.838
Neg logit contributions
ctic-0.933
20439-0.907
-0.881
asia-0.875
iron-0.859
Token is
Token ID318
Feature activation-1.37e-01

Pos logit contributions
emouth+0.600
 signific+0.559
 independ+0.534
apist+0.519
itable+0.486
Neg logit contributions
ctic-0.620
20439-0.602
-0.586
asia-0.583
iron-0.573
Token novel
Token ID5337
Feature activation+1.08e+00

Pos logit contributions
ctic+0.229
20439+0.221
+0.215
asia+0.213
iron+0.209
Neg logit contributions
emouth-0.299
 signific-0.280
 independ-0.269
apist-0.265
itable-0.249
Token for
Token ID329
Feature activation+3.16e-01

Pos logit contributions
emouth+2.157
 signific+2.036
 independ+1.941
apist+1.885
imental+1.790
Neg logit contributions
ctic-1.998
20439-1.964
-1.892
asia-1.869
iron-1.840
Token the
Token ID262
Feature activation+2.60e-01

Pos logit contributions
emouth+0.647
 signific+0.606
 independ+0.582
apist+0.567
itable+0.533
Neg logit contributions
ctic-0.572
20439-0.556
-0.539
asia-0.536
iron-0.525
Token fake
Token ID8390
Feature activation+2.69e+00

Pos logit contributions
emouth+0.534
 signific+0.501
 independ+0.481
apist+0.467
itable+0.440
Neg logit contributions
ctic-0.472
20439-0.460
-0.446
asia-0.443
iron-0.434
Token data
Token ID1366
Feature activation-1.27e-01

Pos logit contributions
emouth+5.342
 signific+5.157
 independ+4.912
apist+4.668
itable+4.486
Neg logit contributions
ctic-4.741
20439-4.724
-4.548
asia-4.491
orget-4.394
Token to
Token ID284
Feature activation+3.39e-01

Pos logit contributions
ctic+0.233
20439+0.225
+0.220
asia+0.219
iron+0.213
Neg logit contributions
emouth-0.258
 signific-0.239
 independ-0.231
apist-0.226
itable-0.211
Token be
Token ID307
Feature activation+8.75e-02

Pos logit contributions
emouth+0.715
 signific+0.670
 independ+0.645
apist+0.629
itable+0.592
Neg logit contributions
ctic-0.593
20439-0.575
-0.559
asia-0.556
iron-0.544
Token created
Token ID2727
Feature activation+4.28e-01

Pos logit contributions
emouth+0.180
 signific+0.167
 independ+0.160
apist+0.158
itable+0.148
Neg logit contributions
ctic-0.159
20439-0.153
-0.149
asia-0.149
iron-0.146
Token independently
Token ID14799
Feature activation+1.05e-01

Pos logit contributions
emouth+0.862
 signific+0.804
 independ+0.768
apist+0.752
itable+0.705
Neg logit contributions
ctic-0.794
20439-0.767
-0.747
asia-0.744
iron-0.731
Token to
Token ID284
Feature activation+3.34e-01

Pos logit contributions
emouth+1.344
 signific+1.266
 independ+1.219
apist+1.187
itable+1.121
Neg logit contributions
ctic-1.033
20439-1.004
-0.972
asia-0.964
iron-0.942
Token stop
Token ID2245
Feature activation+3.65e-01

Pos logit contributions
emouth+0.657
 signific+0.610
 independ+0.583
apist+0.570
itable+0.533
Neg logit contributions
ctic-0.636
20439-0.613
-0.602
asia-0.596
iron-0.587
Token the
Token ID262
Feature activation-2.19e-01

Pos logit contributions
emouth+0.768
 signific+0.724
 independ+0.695
apist+0.675
itable+0.638
Neg logit contributions
ctic-0.638
20439-0.622
-0.601
asia-0.598
iron-0.585
Token spread
Token ID4104
Feature activation+6.35e-01

Pos logit contributions
ctic+0.404
20439+0.389
+0.380
asia+0.378
iron+0.370
Neg logit contributions
emouth-0.444
 signific-0.413
 independ-0.396
apist-0.388
imental-0.362
Token of
Token ID286
Feature activation+1.99e-01

Pos logit contributions
emouth+1.184
 signific+1.107
 independ+1.058
apist+1.031
itable+0.968
Neg logit contributions
ctic-1.249
20439-1.223
-1.195
asia-1.183
orget-1.158
Token fake
Token ID8390
Feature activation+2.64e+00

Pos logit contributions
emouth+0.418
 signific+0.393
 independ+0.378
apist+0.368
itable+0.347
Neg logit contributions
ctic-0.350
20439-0.341
-0.330
asia-0.328
iron-0.321
Token news
Token ID1705
Feature activation+7.32e-01

Pos logit contributions
emouth+3.434
 signific+3.225
 independ+3.007
apist+2.780
itable+2.676
Neg logit contributions
ctic-6.536
20439-6.460
asia-6.211
-6.206
orget-6.132
Token.
Token ID13
Feature activation-2.27e-01

Pos logit contributions
emouth+1.401
 signific+1.309
 independ+1.250
apist+1.213
itable+1.138
Neg logit contributions
ctic-1.424
20439-1.390
-1.348
asia-1.340
iron-1.315
Token
Token ID198
Feature activation-1.92e-01

Pos logit contributions
ctic+0.375
20439+0.363
+0.353
asia+0.350
iron+0.342
Neg logit contributions
emouth-0.504
 signific-0.475
 independ-0.458
apist-0.444
itable-0.420
Token
Token ID198
Feature activation-4.20e-01

Pos logit contributions
ctic+0.308
20439+0.294
+0.291
asia+0.287
iron+0.279
Neg logit contributions
emouth-0.439
 signific-0.414
 independ-0.399
apist-0.386
 surpr-0.371
TokenIn
Token ID818
Feature activation-1.81e-01

Pos logit contributions
ctic+0.640
20439+0.622
+0.609
asia+0.596
iron+0.586
Neg logit contributions
emouth-0.990
 signific-0.936
 independ-0.900
apist-0.872
itable-0.827
Token
Token ID447
Feature activation-1.81e-01

Pos logit contributions
emouth+0.026
 signific+0.024
 independ+0.024
apist+0.023
itable+0.022
Neg logit contributions
ctic-0.012
20439-0.011
-0.011
asia-0.011
iron-0.011
Token
Token ID247
Feature activation+8.79e-02

Pos logit contributions
ctic+0.241
20439+0.232
+0.225
asia+0.220
iron+0.216
Neg logit contributions
emouth-0.459
 signific-0.436
 independ-0.422
apist-0.413
itable-0.392
Tokent
Token ID83
Feature activation+6.24e-02

Pos logit contributions
emouth+0.181
 signific+0.170
 independ+0.163
apist+0.158
itable+0.149
Neg logit contributions
ctic-0.160
20439-0.154
-0.151
asia-0.149
iron-0.147
Token Offer
Token ID33085
Feature activation+4.43e-01

Pos logit contributions
emouth+0.139
 signific+0.131
 independ+0.126
apist+0.123
itable+0.116
Neg logit contributions
ctic-0.102
20439-0.099
-0.096
asia-0.095
iron-0.093
Token to
Token ID284
Feature activation+2.45e-01

Pos logit contributions
emouth+0.948
 signific+0.891
 independ+0.856
apist+0.835
itable+0.787
Neg logit contributions
ctic-0.761
20439-0.738
-0.717
asia-0.712
iron-0.697
Token Esc
Token ID16319
Feature activation+2.61e+00

Pos logit contributions
emouth+0.541
 signific+0.509
 independ+0.490
apist+0.478
itable+0.452
Neg logit contributions
ctic-0.403
20439-0.390
-0.380
asia-0.376
iron-0.369
Tokenort
Token ID419
Feature activation+1.40e+00

Pos logit contributions
emouth+4.531
 signific+4.159
apist+4.040
 independ+3.960
itable+3.694
Neg logit contributions
ctic-5.429
20439-5.344
-5.134
asia-5.102
-4.951
Token a
Token ID257
Feature activation+2.48e-01

Pos logit contributions
emouth+2.783
 signific+2.589
 independ+2.486
apist+2.417
itable+2.275
Neg logit contributions
ctic-2.604
20439-2.558
-2.457
asia-2.445
iron-2.398
Token Lady
Token ID11182
Feature activation+8.88e-01

Pos logit contributions
emouth+0.505
 signific+0.473
 independ+0.454
apist+0.441
itable+0.415
Neg logit contributions
ctic-0.452
20439-0.440
-0.427
asia-0.424
iron-0.416
Token Home
Token ID5995
Feature activation+1.28e-01

Pos logit contributions
emouth+1.821
 signific+1.703
 independ+1.635
apist+1.599
itable+1.498
Neg logit contributions
ctic-1.597
20439-1.566
-1.510
asia-1.503
iron-1.468
Token.
Token ID13
Feature activation-2.94e-01

Pos logit contributions
emouth+0.262
 signific+0.246
 independ+0.235
apist+0.228
itable+0.215
Neg logit contributions
ctic-0.233
20439-0.225
-0.219
asia-0.219
iron-0.214
Token their
Token ID511
Feature activation-1.77e-01

Pos logit contributions
ctic+0.489
20439+0.469
asia+0.460
+0.458
iron+0.450
Neg logit contributions
emouth-0.527
 signific-0.489
 independ-0.468
apist-0.460
itable-0.431
Token home
Token ID1363
Feature activation+1.57e-01

Pos logit contributions
ctic+0.364
20439+0.350
+0.344
asia+0.343
iron+0.336
Neg logit contributions
emouth-0.321
 signific-0.294
 independ-0.280
apist-0.277
itable-0.255
Token advantage
Token ID4621
Feature activation-2.87e-01

Pos logit contributions
emouth+0.315
 signific+0.294
 independ+0.281
apist+0.274
itable+0.257
Neg logit contributions
ctic-0.292
20439-0.282
-0.275
asia-0.274
iron-0.268
Token will
Token ID481
Feature activation-2.01e-01

Pos logit contributions
ctic+0.541
20439+0.513
+0.508
asia+0.507
iron+0.495
Neg logit contributions
emouth-0.571
 signific-0.528
 independ-0.507
apist-0.500
itable-0.465
Token be
Token ID307
Feature activation-3.41e-01

Pos logit contributions
ctic+0.330
20439+0.315
+0.308
asia+0.306
iron+0.300
Neg logit contributions
emouth-0.449
 signific-0.420
 independ-0.405
apist-0.398
itable-0.374
Token null
Token ID9242
Feature activation+2.59e+00

Pos logit contributions
ctic+0.629
20439+0.593
asia+0.587
+0.585
iron+0.578
Neg logit contributions
emouth-0.691
 signific-0.635
 independ-0.611
apist-0.609
imental-0.568
Tokenified
Token ID1431
Feature activation+8.13e-01

Pos logit contributions
emouth+4.206
 signific+3.914
 independ+3.696
apist+3.574
itable+3.388
Neg logit contributions
ctic-5.696
20439-5.687
-5.443
asia-5.394
orget-5.327
Token.
Token ID13
Feature activation+5.51e-02

Pos logit contributions
emouth+1.640
 signific+1.539
 independ+1.477
apist+1.433
itable+1.349
Neg logit contributions
ctic-1.490
20439-1.461
-1.412
asia-1.401
iron-1.370
Token Surprisingly
Token ID47183
Feature activation-5.01e-01

Pos logit contributions
emouth+0.122
 signific+0.115
 independ+0.111
apist+0.108
itable+0.102
Neg logit contributions
ctic-0.091
20439-0.088
-0.086
asia-0.085
iron-0.083
Token,
Token ID11
Feature activation-2.15e-01

Pos logit contributions
ctic+1.038
20439+1.002
asia+0.989
+0.979
iron+0.968
Neg logit contributions
emouth-0.902
 signific-0.829
 independ-0.787
apist-0.773
imental-0.715
Token Iraq
Token ID3908
Feature activation+4.14e-01

Pos logit contributions
ctic+0.387
20439+0.372
asia+0.364
+0.363
iron+0.356
Neg logit contributions
emouth-0.444
 signific-0.413
 independ-0.395
apist-0.389
itable-0.366
Token I
Token ID314
Feature activation-2.21e-03

Pos logit contributions
emouth+0.281
 signific+0.262
 independ+0.251
apist+0.244
itable+0.228
Neg logit contributions
ctic-0.273
20439-0.266
-0.258
asia-0.257
iron-0.252
Token would
Token ID561
Feature activation+3.93e-01

Pos logit contributions
ctic+0.004
20439+0.004
+0.004
asia+0.004
iron+0.004
Neg logit contributions
emouth-0.004
 signific-0.004
 independ-0.004
apist-0.004
itable-0.003
Token give
Token ID1577
Feature activation+3.98e-01

Pos logit contributions
emouth+0.737
 signific+0.688
 independ+0.658
apist+0.638
itable+0.596
Neg logit contributions
ctic-0.780
20439-0.762
-0.740
asia-0.736
iron-0.722
Token him
Token ID683
Feature activation-4.45e-01

Pos logit contributions
emouth+0.787
 signific+0.736
 independ+0.705
apist+0.686
itable+0.644
Neg logit contributions
ctic-0.749
20439-0.729
-0.708
asia-0.704
iron-0.690
Token.
Token ID13
Feature activation+1.85e-01

Pos logit contributions
ctic+0.795
20439+0.763
+0.743
asia+0.738
iron+0.728
Neg logit contributions
emouth-0.929
 signific-0.875
 independ-0.835
apist-0.816
itable-0.768
Token Dos
Token ID43976
Feature activation+2.56e+00

Pos logit contributions
emouth+0.425
 signific+0.401
 independ+0.386
apist+0.377
itable+0.357
Neg logit contributions
ctic-0.291
20439-0.282
-0.273
asia-0.270
iron-0.264
Tokenage
Token ID496
Feature activation+7.78e-01

Pos logit contributions
emouth+4.664
 signific+4.378
 independ+4.167
apist+4.082
itable+3.798
Neg logit contributions
ctic-5.077
20439-5.019
-4.877
asia-4.838
orget-4.731
Token could
Token ID714
Feature activation+3.12e-01

Pos logit contributions
emouth+1.519
 signific+1.411
 independ+1.350
apist+1.318
itable+1.233
Neg logit contributions
ctic-1.498
20439-1.447
-1.420
asia-1.406
iron-1.380
Token be
Token ID307
Feature activation+2.61e-01

Pos logit contributions
emouth+0.667
 signific+0.622
 independ+0.599
apist+0.586
itable+0.550
Neg logit contributions
ctic-0.542
20439-0.519
-0.510
asia-0.505
iron-0.496
Token anywhere
Token ID6609
Feature activation-2.07e-01

Pos logit contributions
emouth+0.523
 signific+0.480
 independ+0.462
apist+0.454
imental+0.426
Neg logit contributions
ctic-0.492
20439-0.469
-0.462
asia-0.458
iron-0.451
Token from
Token ID422
Feature activation+4.98e-01

Pos logit contributions
ctic+0.364
20439+0.347
+0.339
asia+0.338
iron+0.331
Neg logit contributions
emouth-0.441
 signific-0.409
 independ-0.393
apist-0.387
itable-0.364
Token Local
Token ID10714
Feature activation+5.31e-01

Pos logit contributions
emouth+0.275
 signific+0.258
 independ+0.249
apist+0.243
itable+0.230
Neg logit contributions
ctic-0.201
20439-0.195
-0.189
asia-0.187
iron-0.183
Token social
Token ID1919
Feature activation+3.62e-01

Pos logit contributions
emouth+1.076
 signific+1.013
 independ+0.971
apist+0.940
itable+0.888
Neg logit contributions
ctic-0.972
20439-0.950
-0.920
asia-0.911
iron-0.895
Token times
Token ID1661
Feature activation+3.76e-01

Pos logit contributions
emouth+0.827
 signific+0.781
 independ+0.752
apist+0.734
itable+0.696
Neg logit contributions
ctic-0.572
20439-0.555
-0.535
asia-0.531
iron-0.519
Token are
Token ID389
Feature activation+1.67e-01

Pos logit contributions
emouth+0.705
 signific+0.654
 independ+0.624
apist+0.608
itable+0.568
Neg logit contributions
ctic-0.749
20439-0.727
-0.711
asia-0.707
iron-0.693
Token also
Token ID635
Feature activation+7.22e-01

Pos logit contributions
emouth+0.348
 signific+0.324
 independ+0.311
apist+0.305
itable+0.287
Neg logit contributions
ctic-0.300
20439-0.290
-0.282
asia-0.280
iron-0.274
Token subs
Token ID6352
Feature activation+2.55e+00

Pos logit contributions
emouth+1.463
 signific+1.375
 independ+1.319
apist+1.279
itable+1.204
Neg logit contributions
ctic-1.322
20439-1.289
-1.249
asia-1.242
iron-1.217
Tokenumed
Token ID18940
Feature activation-1.21e-01

Pos logit contributions
emouth+4.029
 signific+3.732
 independ+3.530
apist+3.485
itable+3.260
Neg logit contributions
20439-5.662
ctic-5.629
-5.513
asia-5.336
iron-5.260
Token by
Token ID416
Feature activation+3.95e-01

Pos logit contributions
ctic+0.241
20439+0.231
asia+0.227
+0.227
iron+0.224
Neg logit contributions
emouth-0.229
 signific-0.209
 independ-0.200
apist-0.198
itable-0.184
Token urban
Token ID7876
Feature activation-2.09e-01

Pos logit contributions
emouth+0.780
 signific+0.731
 independ+0.700
apist+0.679
itable+0.638
Neg logit contributions
ctic-0.744
20439-0.725
-0.704
asia-0.699
iron-0.687
Token times
Token ID1661
Feature activation+1.54e-01

Pos logit contributions
ctic+0.360
20439+0.349
+0.338
asia+0.336
iron+0.329
Neg logit contributions
emouth-0.449
 signific-0.421
 independ-0.405
apist-0.395
itable-0.372
Token online
Token ID2691
Feature activation+1.02e-01

Pos logit contributions
emouth+0.293
 signific+0.272
 independ+0.260
apist+0.253
itable+0.237
Neg logit contributions
ctic-0.304
20439-0.295
-0.287
asia-0.287
iron-0.281
Token ker
Token ID41927
Feature activation-9.55e-02

Pos logit contributions
emouth+1.179
 signific+1.114
 independ+1.086
apist+1.062
itable+1.012
Neg logit contributions
ctic-0.566
20439-0.537
-0.517
asia-0.513
iron-0.494
Tokenat
Token ID265
Feature activation-4.71e-01

Pos logit contributions
ctic+0.175
20439+0.165
asia+0.164
+0.163
iron+0.160
Neg logit contributions
emouth-0.198
 signific-0.187
 independ-0.180
apist-0.169
 surpr-0.166
Tokenotic
Token ID6210
Feature activation-3.87e-01

Pos logit contributions
ctic+0.759
20439+0.696
asia+0.693
iron+0.676
+0.675
Neg logit contributions
emouth-1.090
 signific-1.032
 independ-1.003
apist-0.961
 surpr-0.934
Token plugs
Token ID37008
Feature activation-4.89e-01

Pos logit contributions
ctic+0.551
20439+0.531
+0.519
asia+0.507
vel+0.491
Neg logit contributions
emouth-0.944
 signific-0.883
 independ-0.858
apist-0.843
imental-0.802
Token,
Token ID11
Feature activation+4.91e-01

Pos logit contributions
ctic+0.966
+0.927
20439+0.921
asia+0.917
[/+0.902
Neg logit contributions
emouth-0.939
 signific-0.870
 independ-0.840
apist-0.803
 surpr-0.783
Token mim
Token ID17007
Feature activation+2.50e+00

Pos logit contributions
emouth+0.652
 signific+0.577
 independ+0.538
apist+0.521
itable+0.464
Neg logit contributions
ctic-1.257
20439-1.222
-1.206
asia-1.198
iron-1.178
Tokenicking
Token ID7958
Feature activation+1.12e+00

Pos logit contributions
emouth+5.617
 signific+5.225
 independ+5.074
apist+4.957
itable+4.838
Neg logit contributions
20439-3.881
ctic-3.822
-3.642
asia-3.557
orget-3.522
Token the
Token ID262
Feature activation+4.04e-01

Pos logit contributions
emouth+2.063
 signific+1.926
 independ+1.832
apist+1.786
itable+1.678
Neg logit contributions
ctic-2.220
20439-2.186
-2.102
asia-2.098
iron-2.062
Token sp
Token ID599
Feature activation+4.78e-01

Pos logit contributions
emouth+1.037
 signific+0.984
 independ+0.953
apist+0.934
itable+0.891
Neg logit contributions
ctic-0.525
20439-0.504
-0.483
asia-0.479
iron-0.465
Tokenines
Token ID1127
Feature activation+5.05e-01

Pos logit contributions
emouth+0.971
 signific+0.911
 independ+0.873
apist+0.849
itable+0.797
Neg logit contributions
ctic-0.874
20439-0.849
-0.826
asia-0.821
iron-0.805
Token on
Token ID319
Feature activation+3.46e-01

Pos logit contributions
emouth+1.117
 signific+1.050
 independ+1.011
apist+0.986
itable+0.931
Neg logit contributions
ctic-0.839
20439-0.812
-0.790
asia-0.783
iron-0.765
Token the
Token ID262
Feature activation+1.21e-01

Pos logit contributions
ctic+0.109
20439+0.106
+0.104
asia+0.103
iron+0.101
Neg logit contributions
emouth-0.119
 signific-0.110
 independ-0.106
apist-0.104
imental-0.097
Token men
Token ID1450
Feature activation+1.59e-01

Pos logit contributions
emouth+0.247
 signific+0.228
 independ+0.219
apist+0.216
imental+0.201
Neg logit contributions
ctic-0.220
20439-0.213
-0.208
asia-0.207
iron-0.203
Token of
Token ID286
Feature activation-3.42e-01

Pos logit contributions
emouth+0.322
 signific+0.301
 independ+0.288
apist+0.281
itable+0.264
Neg logit contributions
ctic-0.295
20439-0.286
-0.278
asia-0.277
iron-0.272
Token Ban
Token ID10274
Feature activation-5.30e-02

Pos logit contributions
ctic+0.618
20439+0.596
asia+0.584
+0.584
iron+0.572
Neg logit contributions
emouth-0.702
 signific-0.653
 independ-0.627
apist-0.615
itable-0.575
Tokenu
Token ID84
Feature activation-5.15e-01

Pos logit contributions
ctic+0.096
20439+0.093
asia+0.091
+0.091
iron+0.089
Neg logit contributions
emouth-0.108
 signific-0.101
 independ-0.098
apist-0.095
imental-0.089
Token Qur
Token ID17599
Feature activation+2.49e+00

Pos logit contributions
ctic+0.965
20439+0.925
+0.911
asia+0.908
vel+0.891
Neg logit contributions
emouth-1.029
 signific-0.952
 independ-0.917
apist-0.900
imental-0.842
Tokenay
Token ID323
Feature activation+2.38e-01

Pos logit contributions
emouth+4.971
 signific+4.742
 independ+4.516
apist+4.384
itable+4.207
Neg logit contributions
20439-4.450
ctic-4.409
asia-4.162
-4.154
orget-4.061
Tokenza
Token ID4496
Feature activation+1.68e-01

Pos logit contributions
emouth+0.494
 signific+0.462
 independ+0.445
apist+0.430
itable+0.404
Neg logit contributions
ctic-0.429
20439-0.411
-0.404
asia-0.400
iron-0.390
Token,
Token ID11
Feature activation+2.36e-01

Pos logit contributions
emouth+0.323
 signific+0.301
 independ+0.288
apist+0.280
itable+0.262
Neg logit contributions
ctic-0.325
20439-0.316
-0.308
asia-0.306
iron-0.301
Token the
Token ID262
Feature activation-1.67e-01

Pos logit contributions
emouth+0.471
 signific+0.440
 independ+0.421
apist+0.410
itable+0.385
Neg logit contributions
ctic-0.441
20439-0.430
-0.417
asia-0.415
iron-0.407
Token ensl
Token ID25563
Feature activation-3.59e-02

Pos logit contributions
ctic+0.303
20439+0.292
+0.288
asia+0.285
iron+0.280
Neg logit contributions
emouth-0.344
 signific-0.317
 independ-0.305
apist-0.300
imental-0.280
Tokenfried
Token ID25520
Feature activation-2.97e-01

Pos logit contributions
emouth+0.124
 signific+0.116
 independ+0.111
apist+0.107
itable+0.100
Neg logit contributions
ctic-0.120
20439-0.117
-0.115
asia-0.114
iron-0.114
Token-
Token ID12
Feature activation+1.44e-01

Pos logit contributions
ctic+0.545
20439+0.524
+0.516
asia+0.514
iron+0.507
Neg logit contributions
emouth-0.601
 signific-0.559
 independ-0.536
apist-0.528
itable-0.492
Tokenrice
Token ID20970
Feature activation-6.08e-01

Pos logit contributions
emouth+0.302
 signific+0.284
 independ+0.272
apist+0.265
itable+0.250
Neg logit contributions
ctic-0.254
20439-0.247
-0.239
asia-0.238
iron-0.233
Token dish
Token ID9433
Feature activation+1.37e-01

Pos logit contributions
ctic+1.147
20439+1.097
asia+1.085
+1.084
iron+1.064
Neg logit contributions
emouth-1.208
 signific-1.119
 independ-1.072
apist-1.050
itable-0.983
Token with
Token ID351
Feature activation-1.11e-02

Pos logit contributions
emouth+0.270
 signific+0.252
 independ+0.241
apist+0.235
itable+0.219
Neg logit contributions
ctic-0.262
20439-0.253
asia-0.247
-0.246
iron-0.242
Token esc
Token ID3671
Feature activation+2.49e+00

Pos logit contributions
ctic+0.021
20439+0.020
+0.020
asia+0.020
iron+0.020
Neg logit contributions
emouth-0.022
 signific-0.020
 independ-0.019
apist-0.019
itable-0.018
Tokenolar
Token ID6192
Feature activation-6.12e-01

Pos logit contributions
emouth+4.550
 signific+4.206
apist+4.031
 independ+4.020
itable+3.694
Neg logit contributions
ctic-4.947
20439-4.919
-4.784
asia-4.642
iron-4.566
Token and
Token ID290
Feature activation+7.88e-03

Pos logit contributions
ctic+1.193
20439+1.145
asia+1.131
+1.124
iron+1.103
Neg logit contributions
emouth-1.177
 signific-1.088
 independ-1.050
apist-1.018
itable-0.958
Token Chinese
Token ID3999
Feature activation-7.70e-02

Pos logit contributions
emouth+0.016
 signific+0.014
 independ+0.014
apist+0.013
itable+0.013
Neg logit contributions
ctic-0.015
20439-0.014
-0.014
asia-0.014
iron-0.014
Token sausage
Token ID31889
Feature activation+7.70e-01

Pos logit contributions
ctic+0.148
20439+0.144
+0.142
asia+0.141
iron+0.138
Neg logit contributions
emouth-0.148
 signific-0.137
 independ-0.132
apist-0.129
itable-0.121
Token that
Token ID326
Feature activation+3.84e-01

Pos logit contributions
emouth+1.458
 signific+1.359
 independ+1.299
apist+1.262
itable+1.180
Neg logit contributions
ctic-1.514
20439-1.476
-1.435
asia-1.427
iron-1.401
Token petty
Token ID25229
Feature activation+5.41e-01

Pos logit contributions
ctic+0.435
20439+0.422
asia+0.410
+0.409
iron+0.401
Neg logit contributions
emouth-0.481
 signific-0.447
 independ-0.429
apist-0.419
itable-0.393
Token-
Token ID12
Feature activation+3.24e-01

Pos logit contributions
emouth+0.883
 signific+0.817
 independ+0.773
apist+0.745
itable+0.687
Neg logit contributions
ctic-1.205
20439-1.183
-1.152
asia-1.144
iron-1.124
Tokenbour
Token ID6084
Feature activation+8.13e-01

Pos logit contributions
emouth+0.532
 signific+0.490
 independ+0.465
apist+0.449
itable+0.414
Neg logit contributions
ctic-0.721
20439-0.705
-0.688
asia-0.685
iron-0.674
Tokenge
Token ID469
Feature activation-8.57e-01

Pos logit contributions
emouth+1.729
 signific+1.625
 independ+1.561
apist+1.521
itable+1.434
Neg logit contributions
ctic-1.412
20439-1.371
-1.328
asia-1.320
iron-1.293
Tokenois
Token ID10924
Feature activation+1.14e+00

Pos logit contributions
ctic+1.374
20439+1.292
asia+1.282
+1.271
iron+1.262
Neg logit contributions
emouth-1.919
 signific-1.827
 independ-1.755
apist-1.702
 surpr-1.614
Token intellig
Token ID10878
Feature activation+2.46e+00

Pos logit contributions
emouth+1.856
 signific+1.723
 independ+1.634
apist+1.564
itable+1.450
Neg logit contributions
ctic-2.548
20439-2.496
-2.438
asia-2.415
iron-2.373
Tokenents
Token ID658
Feature activation+3.56e-01

Pos logit contributions
emouth+3.792
 signific+3.509
 independ+3.322
apist+3.219
imental+3.020
Neg logit contributions
ctic-5.569
20439-5.469
-5.341
asia-5.281
orget-5.163
Tokenia
Token ID544
Feature activation-3.30e-01

Pos logit contributions
emouth+0.791
 signific+0.747
 independ+0.719
apist+0.698
itable+0.658
Neg logit contributions
ctic-0.589
20439-0.564
asia-0.552
-0.548
iron-0.537
Token resisted
Token ID26643
Feature activation+7.15e-01

Pos logit contributions
ctic+0.655
20439+0.632
asia+0.615
+0.614
iron+0.604
Neg logit contributions
emouth-0.624
 signific-0.576
 independ-0.551
apist-0.537
imental-0.501
Token en
Token ID551
Feature activation+1.90e-01

Pos logit contributions
emouth+1.395
 signific+1.306
 independ+1.251
apist+1.214
itable+1.138
Neg logit contributions
ctic-1.362
20439-1.328
-1.288
asia-1.280
iron-1.254
Token masse
Token ID41869
Feature activation+3.77e-02

Pos logit contributions
emouth+0.380
 signific+0.356
 independ+0.341
apist+0.332
itable+0.311
Neg logit contributions
ctic-0.355
20439-0.345
-0.335
asia-0.333
iron-0.327
Token than
Token ID621
Feature activation-6.24e-01

Pos logit contributions
ctic+0.576
20439+0.552
+0.534
asia+0.533
iron+0.520
Neg logit contributions
emouth-0.874
 signific-0.816
 independ-0.792
apist-0.776
itable-0.730
Token three
Token ID1115
Feature activation+1.09e+00

Pos logit contributions
ctic+1.180
20439+1.123
asia+1.100
+1.097
[/+1.093
Neg logit contributions
emouth-1.248
 signific-1.114
 independ-1.075
apist-1.066
imental-0.996
Token weeks
Token ID2745
Feature activation-7.16e-01

Pos logit contributions
emouth+2.082
 signific+1.980
 independ+1.890
apist+1.815
itable+1.724
Neg logit contributions
ctic-2.069
20439-2.052
-1.982
asia-1.954
orget-1.921
Token,"
Token ID553
Feature activation+1.26e+00

Pos logit contributions
ctic+1.313
20439+1.248
asia+1.222
+1.214
iron+1.198
Neg logit contributions
emouth-1.464
 signific-1.351
 independ-1.307
apist-1.276
itable-1.205
Token says
Token ID1139
Feature activation-5.90e-01

Pos logit contributions
emouth+2.416
 signific+2.269
 independ+2.170
apist+2.101
itable+1.974
Neg logit contributions
ctic-2.438
20439-2.400
-2.332
asia-2.303
iron-2.259
Token Es
Token ID8678
Feature activation+2.45e+00

Pos logit contributions
ctic+1.037
20439+0.989
asia+0.970
+0.959
iron+0.947
Neg logit contributions
emouth-1.241
 signific-1.152
 independ-1.103
apist-1.088
imental-1.018
Tokena
Token ID64
Feature activation+1.85e-01

Pos logit contributions
emouth+4.003
 signific+3.733
 independ+3.583
apist+3.462
itable+3.208
Neg logit contributions
20439-5.268
ctic-5.259
-5.111
asia-5.026
orget-4.960
Token mission
Token ID4365
Feature activation+8.42e-01

Pos logit contributions
emouth+0.362
 signific+0.338
 independ+0.323
apist+0.315
itable+0.295
Neg logit contributions
ctic-0.354
20439-0.344
-0.335
asia-0.333
iron-0.327
Token manager
Token ID4706
Feature activation+1.47e-01

Pos logit contributions
emouth+1.630
 signific+1.523
 independ+1.458
apist+1.415
itable+1.326
Neg logit contributions
ctic-1.619
20439-1.579
-1.534
asia-1.524
iron-1.496
Token Dr
Token ID1583
Feature activation-1.01e-01

Pos logit contributions
emouth+0.316
 signific+0.297
 independ+0.285
apist+0.279
itable+0.263
Neg logit contributions
ctic-0.250
20439-0.242
-0.235
asia-0.234
iron-0.229
Token Rune
Token ID28228
Feature activation-2.99e-01

Pos logit contributions
ctic+0.173
20439+0.167
asia+0.162
+0.162
iron+0.158
Neg logit contributions
emouth-0.216
 signific-0.203
 independ-0.195
apist-0.190
itable-0.179
Token ju
Token ID7544
Feature activation-1.12e+00

Pos logit contributions
ctic+0.770
20439+0.745
+0.724
asia+0.723
iron+0.711
Neg logit contributions
emouth-0.839
 signific-0.778
 independ-0.744
apist-0.733
itable-0.684
Tokenillet
Token ID32512
Feature activation-4.89e-01

Pos logit contributions
ctic+1.551
20439+1.448
+1.420
asia+1.418
iron+1.414
Neg logit contributions
emouth-2.756
 signific-2.599
 independ-2.512
apist-2.484
imental-2.345
Token.
Token ID13
Feature activation-3.09e-01

Pos logit contributions
ctic+0.926
20439+0.884
asia+0.869
+0.868
iron+0.854
Neg logit contributions
emouth-0.970
 signific-0.894
 independ-0.855
apist-0.846
imental-0.787
Token pic
Token ID8301
Feature activation-1.61e+00

Pos logit contributions
ctic+0.553
20439+0.533
+0.520
asia+0.517
iron+0.508
Neg logit contributions
emouth-0.642
 signific-0.597
 independ-0.574
apist-0.564
itable-0.528
Token.
Token ID13
Feature activation-8.13e-01

Pos logit contributions
ctic+2.504
+2.360
asia+2.338
[/+2.315
20439+2.310
Neg logit contributions
emouth-3.743
 signific-3.472
 independ-3.350
apist-3.309
 distingu-3.248
Tokentwitter
Token ID6956
Feature activation-3.16e+00

Pos logit contributions
ctic+0.884
20439+0.821
asia+0.803
+0.793
iron+0.768
Neg logit contributions
emouth-2.285
 signific-2.182
 independ-2.124
apist-2.065
 surpr-2.006
Token.
Token ID13
Feature activation-1.31e+00

Pos logit contributions
ctic+4.191
[/+4.006
+3.960
asia+3.914
20439+3.762
Neg logit contributions
emouth-8.002
 distingu-7.627
 signific-7.534
 surpr-7.452
 independ-7.388
Tokencom
Token ID785
Feature activation-1.97e+00

Pos logit contributions
ctic+2.230
20439+2.163
asia+2.128
+2.099
iron+2.071
Neg logit contributions
emouth-2.848
 signific-2.700
 independ-2.613
apist-2.473
 surpr-2.465
Token/
Token ID14
Feature activation-1.73e+00

Pos logit contributions
ctic+3.002
+2.851
[/+2.804
20439+2.771
asia+2.728
Neg logit contributions
emouth-4.688
 signific-4.395
 distingu-4.307
 independ-4.270
 surpr-4.208
Tokenu
Token ID84
Feature activation-8.51e-01

Pos logit contributions
ctic+2.880
20439+2.760
+2.715
asia+2.681
FR+2.673
Neg logit contributions
emouth-3.783
 signific-3.570
 independ-3.456
apist-3.312
 surpr-3.147
TokenI
Token ID40
Feature activation-6.60e-01

Pos logit contributions
ctic+1.319
20439+1.268
+1.239
asia+1.227
iron+1.208
Neg logit contributions
emouth-1.972
 signific-1.863
 independ-1.801
apist-1.745
imental-1.650
Token fin
Token ID957
Feature activation-4.26e-01

Pos logit contributions
ctic+0.434
20439+0.412
+0.391
asia+0.384
iron+0.373
Neg logit contributions
emouth-1.182
 signific-1.125
 independ-1.090
apist-1.073
itable-1.028
Tokenisher
Token ID4828
Feature activation-6.35e-01

Pos logit contributions
ctic+0.483
20439+0.443
asia+0.435
+0.426
iron+0.422
Neg logit contributions
emouth-1.155
 signific-1.120
 independ-1.080
apist-1.039
 distingu-1.018
Token"
Token ID1
Feature activation+1.81e+00

Pos logit contributions
ctic+1.119
20439+1.082
+1.056
asia+1.046
iron+1.032
Neg logit contributions
emouth-1.331
 signific-1.245
 independ-1.189
apist-1.164
itable-1.098
Token pic
Token ID8301
Feature activation-8.68e-01

Pos logit contributions
emouth+3.520
 signific+3.319
 independ+3.185
apist+3.090
itable+2.879
Neg logit contributions
ctic-3.406
20439-3.336
-3.195
asia-3.191
orget-3.100
Token.
Token ID13
Feature activation-5.01e-01

Pos logit contributions
ctic+1.422
+1.351
20439+1.349
asia+1.333
[/+1.312
Neg logit contributions
emouth-1.980
 signific-1.847
 independ-1.784
apist-1.726
 surpr-1.690
Tokentwitter
Token ID6956
Feature activation-3.02e+00

Pos logit contributions
ctic+0.578
20439+0.549
+0.526
asia+0.523
iron+0.504
Neg logit contributions
emouth-1.368
 signific-1.303
 independ-1.266
apist-1.234
itable-1.181
Token.
Token ID13
Feature activation-1.26e+00

Pos logit contributions
ctic+4.069
[/+3.874
+3.848
asia+3.758
20439+3.669
Neg logit contributions
emouth-7.627
 distingu-7.280
 signific-7.202
 surpr-7.127
 independ-7.053
Tokencom
Token ID785
Feature activation-1.95e+00

Pos logit contributions
ctic+2.138
20439+2.083
asia+2.037
+2.019
iron+1.980
Neg logit contributions
emouth-2.763
 signific-2.613
 independ-2.535
 surpr-2.407
 distingu-2.392
Token/
Token ID14
Feature activation-1.73e+00

Pos logit contributions
ctic+2.950
+2.801
[/+2.769
20439+2.734
asia+2.673
Neg logit contributions
emouth-4.670
 signific-4.384
 distingu-4.313
 independ-4.260
 surpr-4.221
TokenEh
Token ID43894
Feature activation+5.94e-01

Pos logit contributions
ctic+2.866
20439+2.749
+2.699
asia+2.667
FR+2.660
Neg logit contributions
emouth-3.766
 signific-3.555
 independ-3.443
apist-3.297
 surpr-3.138
TokenU
Token ID52
Feature activation-8.13e-01

Pos logit contributions
emouth+1.353
 signific+1.277
 independ+1.230
apist+1.204
itable+1.142
Neg logit contributions
ctic-0.935
20439-0.907
-0.874
asia-0.868
iron-0.846
TokenThe
Token ID464
Feature activation+1.11e-01

Pos logit contributions
emouth+0.292
 signific+0.277
 independ+0.266
apist+0.260
itable+0.247
Neg logit contributions
ctic-0.190
20439-0.183
-0.178
asia-0.177
iron-0.172
TokenDeb
Token ID16587
Feature activation-6.04e-02

Pos logit contributions
emouth+0.297
 signific+0.283
 independ+0.274
apist+0.268
itable+0.256
Neg logit contributions
ctic-0.132
20439-0.126
-0.121
asia-0.120
iron-0.116
Tokenates
Token ID689
Feature activation+1.48e-01

Pos logit contributions
ctic+0.087
20439+0.082
+0.080
asia+0.080
iron+0.078
Neg logit contributions
emouth-0.147
 signific-0.140
 independ-0.134
apist-0.131
imental-0.124
Token pic
Token ID8301
Feature activation-1.20e+00

Pos logit contributions
emouth+0.306
 signific+0.286
 independ+0.275
apist+0.268
itable+0.252
Neg logit contributions
ctic-0.266
20439-0.258
-0.251
asia-0.250
iron-0.245
Token.
Token ID13
Feature activation-6.22e-01

Pos logit contributions
ctic+1.880
+1.786
asia+1.772
[/+1.750
20439+1.743
Neg logit contributions
emouth-2.808
 signific-2.611
 independ-2.536
apist-2.460
 distingu-2.457
Tokentwitter
Token ID6956
Feature activation-3.00e+00

Pos logit contributions
ctic+0.684
20439+0.641
+0.621
asia+0.620
iron+0.595
Neg logit contributions
emouth-1.743
 signific-1.664
 independ-1.619
apist-1.567
 surpr-1.526
Token.
Token ID13
Feature activation-1.37e+00

Pos logit contributions
ctic+4.041
[/+3.858
+3.806
asia+3.770
vel+3.631
Neg logit contributions
emouth-7.588
 distingu-7.198
 signific-7.139
 surpr-7.043
 independ-6.984
Tokencom
Token ID785
Feature activation-1.97e+00

Pos logit contributions
ctic+2.334
20439+2.268
asia+2.239
+2.221
iron+2.182
Neg logit contributions
emouth-2.986
 signific-2.832
 independ-2.730
apist-2.594
 surpr-2.587
Token/
Token ID14
Feature activation-1.72e+00

Pos logit contributions
ctic+2.993
+2.861
[/+2.808
20439+2.774
vel+2.720
Neg logit contributions
emouth-4.684
 signific-4.397
 distingu-4.280
 independ-4.256
 surpr-4.192
TokenO
Token ID46
Feature activation-6.51e-01

Pos logit contributions
ctic+2.871
20439+2.748
+2.707
asia+2.675
FR+2.661
Neg logit contributions
emouth-3.759
 signific-3.547
 independ-3.434
apist-3.289
 surpr-3.129
TokenJV
Token ID41697
Feature activation+1.59e+00

Pos logit contributions
ctic+1.083
20439+1.039
+1.018
asia+1.008
iron+0.990
Neg logit contributions
emouth-1.440
 signific-1.359
 independ-1.309
apist-1.265
 Somers-1.195
Token never
Token ID1239
Feature activation+1.64e-02

Pos logit contributions
emouth+0.774
 signific+0.724
 independ+0.695
apist+0.679
itable+0.636
Neg logit contributions
ctic-0.688
20439-0.663
-0.648
asia-0.643
iron-0.633
Token called
Token ID1444
Feature activation-7.47e-02

Pos logit contributions
emouth+0.032
 signific+0.029
 independ+0.028
apist+0.028
imental+0.026
Neg logit contributions
ctic-0.032
20439-0.030
-0.030
asia-0.030
iron-0.030
Token.
Token ID13
Feature activation-1.56e-01

Pos logit contributions
ctic+0.138
+0.131
20439+0.131
asia+0.129
iron+0.129
Neg logit contributions
emouth-0.151
 signific-0.140
 independ-0.133
apist-0.132
imental-0.123
Token pic
Token ID8301
Feature activation-1.36e+00

Pos logit contributions
ctic+0.257
20439+0.248
+0.242
asia+0.240
iron+0.234
Neg logit contributions
emouth-0.347
 signific-0.327
 independ-0.315
apist-0.307
itable-0.290
Token.
Token ID13
Feature activation-5.92e-01

Pos logit contributions
ctic+2.157
+2.040
asia+2.011
20439+2.009
[/+2.001
Neg logit contributions
emouth-3.171
 signific-2.936
 independ-2.848
apist-2.774
 distingu-2.738
Tokentwitter
Token ID6956
Feature activation-2.98e+00

Pos logit contributions
ctic+0.675
20439+0.637
+0.613
asia+0.610
iron+0.588
Neg logit contributions
emouth-1.627
 signific-1.550
 independ-1.508
apist-1.467
 surpr-1.409
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
ctic+4.055
[/+3.891
+3.818
asia+3.726
20439+3.635
Neg logit contributions
emouth-7.496
 distingu-7.088
 signific-7.073
 surpr-6.944
 independ-6.919
Tokencom
Token ID785
Feature activation-1.87e+00

Pos logit contributions
ctic+1.834
20439+1.785
asia+1.748
+1.742
iron+1.702
Neg logit contributions
emouth-2.323
 signific-2.191
 independ-2.129
 surpr-2.007
apist-2.004
Token/
Token ID14
Feature activation-1.59e+00

Pos logit contributions
ctic+2.865
+2.722
[/+2.689
20439+2.646
asia+2.590
Neg logit contributions
emouth-4.433
 signific-4.150
 distingu-4.047
 independ-4.032
 surpr-3.966
TokenCS
Token ID7902
Feature activation+3.14e-01

Pos logit contributions
ctic+2.641
20439+2.531
+2.488
asia+2.457
FR+2.442
Neg logit contributions
emouth-3.467
 signific-3.272
 independ-3.169
apist-3.038
 surpr-2.884
Token3
Token ID18
Feature activation-8.85e-02

Pos logit contributions
emouth+0.705
 signific+0.665
 independ+0.640
apist+0.624
itable+0.591
Neg logit contributions
ctic-0.509
20439-0.493
-0.476
asia-0.473
iron-0.463
Token8
Token ID23
Feature activation-1.71e-01

Pos logit contributions
ctic+0.350
20439+0.338
+0.329
asia+0.326
iron+0.320
Neg logit contributions
emouth-0.484
 signific-0.457
 independ-0.441
apist-0.428
itable-0.404
Tokenb
Token ID65
Feature activation+4.79e-01

Pos logit contributions
ctic+0.289
20439+0.280
+0.272
asia+0.270
iron+0.265
Neg logit contributions
emouth-0.372
 signific-0.350
 independ-0.337
apist-0.327
itable-0.308
TokenET
Token ID2767
Feature activation-1.07e-01

Pos logit contributions
emouth+1.097
 signific+1.036
 independ+0.998
apist+0.975
itable+0.924
Neg logit contributions
ctic-0.751
20439-0.727
-0.702
asia-0.697
iron-0.680
Token pic
Token ID8301
Feature activation-1.22e+00

Pos logit contributions
ctic+0.188
20439+0.181
+0.177
asia+0.175
iron+0.172
Neg logit contributions
emouth-0.228
 signific-0.214
 independ-0.206
apist-0.200
imental-0.188
Token.
Token ID13
Feature activation-9.26e-01

Pos logit contributions
ctic+1.923
+1.836
asia+1.803
20439+1.803
[/+1.784
Neg logit contributions
emouth-2.854
 signific-2.659
 independ-2.577
apist-2.491
 distingu-2.480
Tokentwitter
Token ID6956
Feature activation-2.96e+00

Pos logit contributions
ctic+0.991
20439+0.920
+0.889
asia+0.885
iron+0.854
Neg logit contributions
emouth-2.631
 signific-2.509
 independ-2.444
apist-2.363
 surpr-2.312
Token.
Token ID13
Feature activation-1.30e+00

Pos logit contributions
ctic+4.000
[/+3.829
+3.780
asia+3.697
20439+3.627
Neg logit contributions
emouth-7.458
 distingu-7.080
 signific-7.041
 surpr-6.926
 independ-6.892
Tokencom
Token ID785
Feature activation-1.86e+00

Pos logit contributions
ctic+2.193
20439+2.139
asia+2.082
+2.080
iron+2.028
Neg logit contributions
emouth-2.878
 signific-2.727
 independ-2.646
 surpr-2.510
 distingu-2.498
Token/
Token ID14
Feature activation-1.72e+00

Pos logit contributions
ctic+2.832
+2.704
[/+2.668
20439+2.657
asia+2.567
Neg logit contributions
emouth-4.415
 signific-4.163
 distingu-4.072
 independ-4.039
 surpr-3.983
TokenW
Token ID54
Feature activation-5.19e-01

Pos logit contributions
ctic+2.857
20439+2.738
+2.692
asia+2.658
FR+2.649
Neg logit contributions
emouth-3.739
 signific-3.531
 independ-3.419
apist-3.273
 surpr-3.116
Token8
Token ID23
Feature activation-2.22e-01

Pos logit contributions
ctic+0.851
20439+0.817
+0.799
asia+0.791
iron+0.778
Neg logit contributions
emouth-1.161
 signific-1.097
 independ-1.057
apist-1.023
itable-0.964
TokenDeb
Token ID16587
Feature activation-1.65e-02

Pos logit contributions
ctic+0.698
20439+0.662
+0.643
asia+0.637
iron+0.623
Neg logit contributions
emouth-1.413
 signific-1.343
 independ-1.296
apist-1.271
itable-1.216
Tokenate
Token ID378
Feature activation+2.26e-01

Pos logit contributions
ctic+0.026
20439+0.025
asia+0.024
+0.024
iron+0.023
Neg logit contributions
emouth-0.038
 signific-0.036
 independ-0.035
apist-0.034
 surpr-0.032
Token2016
Token ID5304
Feature activation+7.53e-02

Pos logit contributions
emouth+0.503
 signific+0.475
 independ+0.456
apist+0.445
itable+0.421
Neg logit contributions
ctic-0.371
20439-0.358
-0.348
asia-0.346
iron-0.339
Token pic
Token ID8301
Feature activation-1.20e+00

Pos logit contributions
emouth+0.155
 signific+0.145
 independ+0.139
apist+0.136
itable+0.128
Neg logit contributions
ctic-0.136
20439-0.132
-0.129
asia-0.128
iron-0.125
Token.
Token ID13
Feature activation-7.22e-01

Pos logit contributions
ctic+1.916
+1.824
asia+1.796
20439+1.791
[/+1.783
Neg logit contributions
emouth-2.794
 signific-2.598
 independ-2.516
apist-2.442
 distingu-2.407
Tokentwitter
Token ID6956
Feature activation-2.96e+00

Pos logit contributions
ctic+0.793
20439+0.742
+0.726
asia+0.723
iron+0.693
Neg logit contributions
emouth-2.024
 signific-1.931
 independ-1.877
apist-1.818
 surpr-1.769
Token.
Token ID13
Feature activation-1.24e+00

Pos logit contributions
ctic+4.015
[/+3.831
+3.795
asia+3.738
20439+3.597
Neg logit contributions
emouth-7.420
 distingu-6.991
 signific-6.988
 surpr-6.856
 independ-6.831
Tokencom
Token ID785
Feature activation-1.89e+00

Pos logit contributions
ctic+2.138
20439+2.075
asia+2.046
+2.034
iron+1.997
Neg logit contributions
emouth-2.678
 signific-2.531
 independ-2.438
apist-2.322
 surpr-2.295
Token/
Token ID14
Feature activation-1.63e+00

Pos logit contributions
ctic+2.914
+2.793
[/+2.738
20439+2.711
asia+2.646
Neg logit contributions
emouth-4.453
 signific-4.177
 independ-4.040
 distingu-4.028
 surpr-3.960
TokenF
Token ID37
Feature activation-5.82e-01

Pos logit contributions
ctic+2.712
20439+2.600
+2.558
asia+2.526
FR+2.509
Neg logit contributions
emouth-3.553
 signific-3.352
 independ-3.242
apist-3.110
 surpr-2.952
Tokenc
Token ID66
Feature activation+3.31e-02

Pos logit contributions
ctic+0.962
20439+0.926
+0.905
asia+0.896
iron+0.879
Neg logit contributions
emouth-1.290
 signific-1.217
 independ-1.171
apist-1.136
itable-1.070
TokenDeb
Token ID16587
Feature activation-2.28e-01

Pos logit contributions
ctic+0.314
20439+0.300
+0.290
asia+0.289
iron+0.282
Neg logit contributions
emouth-0.594
 signific-0.565
 independ-0.544
apist-0.532
itable-0.509
Tokenates
Token ID689
Feature activation+6.22e-02

Pos logit contributions
ctic+0.330
20439+0.313
asia+0.305
+0.303
iron+0.297
Neg logit contributions
emouth-0.554
 signific-0.529
 independ-0.506
apist-0.493
 surpr-0.473
Token2016
Token ID5304
Feature activation-1.72e-01

Pos logit contributions
emouth+0.130
 signific+0.122
 independ+0.117
apist+0.114
itable+0.107
Neg logit contributions
ctic-0.110
20439-0.106
-0.104
asia-0.103
iron-0.102
Token pic
Token ID8301
Feature activation-1.12e+00

Pos logit contributions
ctic+0.313
20439+0.302
+0.294
asia+0.294
iron+0.288
Neg logit contributions
emouth-0.351
 signific-0.329
 independ-0.315
apist-0.308
itable-0.290
Token.
Token ID13
Feature activation-6.81e-01

Pos logit contributions
ctic+1.784
+1.687
asia+1.678
20439+1.671
[/+1.653
Neg logit contributions
emouth-2.586
 signific-2.411
 independ-2.341
apist-2.265
 distingu-2.240
Tokentwitter
Token ID6956
Feature activation-2.94e+00

Pos logit contributions
ctic+0.750
20439+0.702
asia+0.681
+0.681
iron+0.654
Neg logit contributions
emouth-1.905
 signific-1.819
 independ-1.770
apist-1.713
 surpr-1.666
Token.
Token ID13
Feature activation-1.32e+00

Pos logit contributions
ctic+3.972
[/+3.785
+3.738
asia+3.710
vel+3.572
Neg logit contributions
emouth-7.423
 distingu-7.011
 signific-6.976
 surpr-6.872
 independ-6.834
Tokencom
Token ID785
Feature activation-1.95e+00

Pos logit contributions
ctic+2.262
20439+2.203
asia+2.171
+2.151
iron+2.115
Neg logit contributions
emouth-2.875
 signific-2.723
 independ-2.624
apist-2.494
 surpr-2.482
Token/
Token ID14
Feature activation-1.65e+00

Pos logit contributions
ctic+2.973
+2.848
[/+2.796
20439+2.772
asia+2.709
Neg logit contributions
emouth-4.628
 signific-4.323
 distingu-4.191
 independ-4.187
 surpr-4.114
Tokenj
Token ID73
Feature activation-7.68e-01

Pos logit contributions
ctic+2.746
20439+2.631
+2.590
asia+2.559
FR+2.542
Neg logit contributions
emouth-3.606
 signific-3.400
 independ-3.290
apist-3.154
 surpr-2.999
TokenAx
Token ID31554
Feature activation+5.16e-01

Pos logit contributions
ctic+1.229
20439+1.182
+1.156
asia+1.142
iron+1.125
Neg logit contributions
emouth-1.742
 signific-1.645
 independ-1.588
apist-1.536
imental-1.453
Token GO
Token ID10351
Feature activation-9.20e-01

Pos logit contributions
ctic+0.814
20439+0.788
+0.766
asia+0.756
iron+0.739
Neg logit contributions
emouth-1.127
 signific-1.063
 independ-1.025
apist-0.999
itable-0.943
TokenAL
Token ID1847
Feature activation-2.20e-01

Pos logit contributions
ctic+1.419
20439+1.374
+1.347
asia+1.306
iron+1.281
Neg logit contributions
emouth-2.126
 signific-2.025
 independ-1.951
apist-1.905
imental-1.790
Token!
Token ID0
Feature activation-5.75e-01

Pos logit contributions
ctic+0.331
20439+0.320
+0.309
asia+0.306
iron+0.298
Neg logit contributions
emouth-0.520
 signific-0.492
 independ-0.474
apist-0.464
itable-0.440
Token pic
Token ID8301
Feature activation-1.16e+00

Pos logit contributions
ctic+0.948
20439+0.918
+0.893
asia+0.883
iron+0.862
Neg logit contributions
emouth-1.274
 signific-1.202
 independ-1.157
apist-1.128
itable-1.065
Token.
Token ID13
Feature activation-8.25e-01

Pos logit contributions
ctic+1.867
+1.774
20439+1.768
[/+1.735
asia+1.734
Neg logit contributions
emouth-2.672
 signific-2.483
 independ-2.396
apist-2.331
 distingu-2.273
Tokentwitter
Token ID6956
Feature activation-2.90e+00

Pos logit contributions
ctic+0.919
20439+0.857
+0.826
asia+0.823
iron+0.790
Neg logit contributions
emouth-2.305
 signific-2.194
 independ-2.137
apist-2.077
 surpr-2.005
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
ctic+4.006
[/+3.821
+3.782
asia+3.662
20439+3.643
Neg logit contributions
emouth-7.248
 signific-6.864
 distingu-6.795
 independ-6.718
 surpr-6.661
Tokencom
Token ID785
Feature activation-2.08e+00

Pos logit contributions
ctic+2.169
20439+2.117
asia+2.056
+2.051
iron+1.992
Neg logit contributions
emouth-2.828
 signific-2.677
 independ-2.595
apist-2.443
 surpr-2.440
Token/
Token ID14
Feature activation-1.89e+00

Pos logit contributions
ctic+3.182
+3.025
[/+2.979
20439+2.973
FR+2.860
Neg logit contributions
emouth-4.933
 signific-4.681
 independ-4.537
 distingu-4.533
 surpr-4.437
TokenE
Token ID36
Feature activation-4.41e-01

Pos logit contributions
ctic+3.139
20439+3.005
+2.957
asia+2.919
FR+2.917
Neg logit contributions
emouth-4.100
 signific-3.872
 independ-3.752
apist-3.587
 surpr-3.415
Tokenq
Token ID80
Feature activation+4.15e-01

Pos logit contributions
ctic+0.724
20439+0.697
+0.680
asia+0.674
iron+0.662
Neg logit contributions
emouth-0.986
 signific-0.929
 independ-0.895
apist-0.869
itable-0.819
Token.
Token ID13
Feature activation-1.29e+00

Pos logit contributions
ctic+1.850
20439+1.749
+1.736
asia+1.725
[/+1.701
Neg logit contributions
emouth-2.527
 signific-2.388
 independ-2.303
apist-2.211
 surpr-2.172
Tokenyoutube
Token ID11604
Feature activation-2.44e+00

Pos logit contributions
ctic+1.603
20439+1.467
asia+1.453
+1.419
iron+1.406
Neg logit contributions
emouth-3.420
 signific-3.270
 independ-3.169
apist-3.058
 surpr-2.950
Token.
Token ID13
Feature activation-1.25e+00

Pos logit contributions
ctic+3.587
[/+3.370
+3.365
asia+3.324
iron+3.219
Neg logit contributions
emouth-5.861
 signific-5.582
 independ-5.433
 distingu-5.365
 surpr-5.338
Tokencom
Token ID785
Feature activation-1.23e+00

Pos logit contributions
ctic+2.180
asia+2.073
20439+2.061
+2.037
iron+2.001
Neg logit contributions
emouth-2.698
 signific-2.576
 independ-2.466
apist-2.336
 surpr-2.334
Token/
Token ID14
Feature activation-1.67e+00

Pos logit contributions
ctic+2.080
20439+1.962
+1.953
[/+1.941
asia+1.925
Neg logit contributions
emouth-2.676
 signific-2.524
 independ-2.417
apist-2.351
 surpr-2.253
Tokenwatch
Token ID8340
Feature activation-2.89e+00

Pos logit contributions
ctic+2.731
20439+2.606
asia+2.567
+2.553
iron+2.510
Neg logit contributions
emouth-3.696
 signific-3.504
 independ-3.369
apist-3.255
itable-3.070
Token?
Token ID30
Feature activation-1.90e+00

Pos logit contributions
ctic+2.585
[/+2.516
+2.447
20439+2.310
asia+2.304
Neg logit contributions
emouth-8.487
 signific-8.141
 independ-7.977
 distingu-7.908
 surpr-7.813
Tokenv
Token ID85
Feature activation-1.76e-01

Pos logit contributions
ctic+2.125
20439+2.000
asia+1.935
+1.933
iron+1.899
Neg logit contributions
emouth-5.199
 signific-4.968
 independ-4.806
apist-4.690
 surpr-4.481
Token=
Token ID28
Feature activation-1.73e+00

Pos logit contributions
ctic+0.232
20439+0.223
+0.214
asia+0.212
iron+0.207
Neg logit contributions
emouth-0.446
 signific-0.424
 independ-0.410
apist-0.401
itable-0.382
Token2
Token ID17
Feature activation-7.96e-01

Pos logit contributions
ctic+2.870
20439+2.754
+2.698
asia+2.675
FR+2.655
Neg logit contributions
emouth-3.772
 signific-3.570
 independ-3.448
apist-3.304
 surpr-3.144
Tokeng
Token ID70
Feature activation+1.99e-01

Pos logit contributions
ctic+1.368
20439+1.315
+1.289
asia+1.276
iron+1.257
Neg logit contributions
emouth-1.711
 signific-1.616
 independ-1.553
apist-1.501
itable-1.410
Token media
Token ID2056
Feature activation+3.98e-01

Pos logit contributions
emouth+0.658
 signific+0.615
 independ+0.590
apist+0.577
itable+0.541
Neg logit contributions
ctic-0.581
20439-0.562
-0.547
asia-0.545
iron-0.533
Token bus
Token ID1323
Feature activation+9.24e-01

Pos logit contributions
emouth+0.813
 signific+0.760
 independ+0.728
apist+0.714
itable+0.667
Neg logit contributions
ctic-0.729
20439-0.700
-0.684
asia-0.682
iron-0.668
Token)
Token ID8
Feature activation+3.06e-01

Pos logit contributions
emouth+1.806
 signific+1.689
 independ+1.618
apist+1.571
itable+1.472
Neg logit contributions
ctic-1.754
20439-1.721
-1.663
asia-1.655
iron-1.619
Token pic
Token ID8301
Feature activation-1.06e+00

Pos logit contributions
emouth+0.623
 signific+0.584
 independ+0.560
apist+0.546
itable+0.513
Neg logit contributions
ctic-0.561
20439-0.543
-0.529
asia-0.525
iron-0.516
Token.
Token ID13
Feature activation-3.71e-01

Pos logit contributions
ctic+1.723
+1.630
20439+1.623
asia+1.614
[/+1.584
Neg logit contributions
emouth-2.427
 signific-2.260
 independ-2.186
apist-2.124
 distingu-2.077
Tokentwitter
Token ID6956
Feature activation-2.88e+00

Pos logit contributions
ctic+0.431
20439+0.408
+0.391
asia+0.390
iron+0.375
Neg logit contributions
emouth-1.010
 signific-0.961
 independ-0.933
apist-0.911
itable-0.871
Token.
Token ID13
Feature activation-1.14e+00

Pos logit contributions
ctic+3.916
[/+3.719
+3.690
asia+3.636
20439+3.528
Neg logit contributions
emouth-7.283
 distingu-6.922
 signific-6.870
 surpr-6.763
 independ-6.736
Tokencom
Token ID785
Feature activation-1.83e+00

Pos logit contributions
ctic+1.968
20439+1.915
asia+1.876
+1.863
iron+1.825
Neg logit contributions
emouth-2.481
 signific-2.346
 independ-2.275
apist-2.148
 surpr-2.145
Token/
Token ID14
Feature activation-1.48e+00

Pos logit contributions
ctic+2.816
+2.671
[/+2.629
20439+2.611
asia+2.559
Neg logit contributions
emouth-4.328
 signific-4.058
 distingu-3.952
 independ-3.944
 surpr-3.878
Tokenz
Token ID89
Feature activation+6.07e-02

Pos logit contributions
ctic+2.467
20439+2.368
+2.324
asia+2.297
FR+2.276
Neg logit contributions
emouth-3.251
 signific-3.067
 independ-2.969
apist-2.846
 surpr-2.703
TokenJ
Token ID41
Feature activation-5.75e-01

Pos logit contributions
emouth+0.137
 signific+0.129
 independ+0.125
apist+0.121
itable+0.115
Neg logit contributions
ctic-0.098
20439-0.094
-0.091
asia-0.091
iron-0.089
Token:
Token ID25
Feature activation+1.56e-01

Pos logit contributions
ctic+1.588
20439+1.534
+1.505
asia+1.486
[/+1.459
Neg logit contributions
emouth-1.954
 signific-1.833
 independ-1.755
apist-1.720
itable-1.620
Token The
Token ID383
Feature activation-1.82e-01

Pos logit contributions
emouth+0.526
 signific+0.506
 independ+0.494
apist+0.486
itable+0.470
Neg logit contributions
ctic-0.077
20439-0.069
-0.060
asia-0.059
iron-0.054
Token Saga
Token ID19743
Feature activation-1.07e+00

Pos logit contributions
ctic+0.305
20439+0.295
+0.287
asia+0.285
iron+0.278
Neg logit contributions
emouth-0.397
 signific-0.372
 independ-0.359
apist-0.350
itable-0.330
Token[/
Token ID13412
Feature activation-1.65e+00

Pos logit contributions
ctic+1.726
20439+1.664
+1.635
asia+1.611
[/+1.587
Neg logit contributions
emouth-2.420
 signific-2.276
 independ-2.187
apist-2.131
itable-2.020
Tokenb
Token ID65
Feature activation+3.28e-01

Pos logit contributions
ctic+2.051
20439+1.915
asia+1.906
+1.888
iron+1.856
Neg logit contributions
emouth-4.326
 signific-4.161
 independ-4.030
apist-3.878
 surpr-3.843
Token][/
Token ID44926
Feature activation-2.86e+00

Pos logit contributions
emouth+0.797
 signific+0.757
 independ+0.729
apist+0.716
itable+0.681
Neg logit contributions
ctic-0.465
20439-0.451
-0.433
asia-0.427
iron-0.417
Tokencolor
Token ID8043
Feature activation-6.40e-01

Pos logit contributions
ctic+3.770
asia+3.554
vel+3.531
iron+3.454
+3.428
Neg logit contributions
emouth-7.162
 signific-6.963
 independ-6.734
 surpr-6.522
 distingu-6.486
Token]
Token ID60
Feature activation-6.18e-01

Pos logit contributions
ctic+0.938
20439+0.905
+0.872
asia+0.866
iron+0.844
Neg logit contributions
emouth-1.533
 signific-1.451
 independ-1.402
apist-1.370
itable-1.302
Token -
Token ID532
Feature activation-6.76e-01

Pos logit contributions
ctic+1.102
20439+1.063
+1.044
asia+1.035
iron+1.019
Neg logit contributions
emouth-1.281
 signific-1.204
 independ-1.153
apist-1.125
itable-1.059
Token 100
Token ID1802
Feature activation-8.58e-01

Pos logit contributions
ctic+1.160
20439+1.116
+1.096
asia+1.082
iron+1.061
Neg logit contributions
emouth-1.442
 signific-1.365
 independ-1.299
apist-1.275
itable-1.206
Token%
Token ID4
Feature activation-5.87e-01

Pos logit contributions
ctic+0.975
20439+0.915
+0.883
asia+0.869
iron+0.842
Neg logit contributions
emouth-2.345
 signific-2.234
 independ-2.164
apist-2.123
itable-2.035
Tokened
Token ID276
Feature activation+1.87e-01

Pos logit contributions
ctic+1.612
20439+1.529
+1.521
asia+1.493
iron+1.469
Neg logit contributions
emouth-2.063
 signific-1.953
 independ-1.871
apist-1.820
imental-1.725
TokenOut
Token ID7975
Feature activation+1.77e-01

Pos logit contributions
emouth+0.413
 signific+0.389
 independ+0.375
apist+0.365
itable+0.345
Neg logit contributions
ctic-0.309
20439-0.299
-0.290
asia-0.288
iron-0.281
TokenAtlanta
Token ID43482
Feature activation+1.15e+00

Pos logit contributions
emouth+0.372
 signific+0.350
 independ+0.336
apist+0.327
itable+0.308
Neg logit contributions
ctic-0.310
20439-0.301
-0.292
asia-0.290
iron-0.284
Token pic
Token ID8301
Feature activation-1.21e+00

Pos logit contributions
emouth+2.316
 signific+2.175
 independ+2.086
apist+2.030
itable+1.915
Neg logit contributions
ctic-2.086
20439-2.066
-1.976
asia-1.970
iron-1.932
Token.
Token ID13
Feature activation-6.37e-01

Pos logit contributions
ctic+1.924
+1.828
20439+1.802
asia+1.789
[/+1.782
Neg logit contributions
emouth-2.795
 signific-2.593
 independ-2.518
apist-2.440
 distingu-2.396
Tokentwitter
Token ID6956
Feature activation-2.86e+00

Pos logit contributions
ctic+0.701
20439+0.660
+0.638
asia+0.628
iron+0.602
Neg logit contributions
emouth-1.783
 signific-1.695
 independ-1.651
apist-1.603
 surpr-1.551
Token.
Token ID13
Feature activation-1.13e+00

Pos logit contributions
ctic+3.975
[/+3.811
+3.768
20439+3.642
asia+3.626
Neg logit contributions
emouth-7.101
 signific-6.728
 distingu-6.643
 independ-6.579
 surpr-6.512
Tokencom
Token ID785
Feature activation-1.85e+00

Pos logit contributions
ctic+1.943
20439+1.903
+1.847
asia+1.832
FR+1.782
Neg logit contributions
emouth-2.463
 signific-2.330
 independ-2.255
apist-2.139
 surpr-2.106
Token/
Token ID14
Feature activation-1.63e+00

Pos logit contributions
ctic+2.866
+2.738
20439+2.717
[/+2.687
FR+2.576
Neg logit contributions
emouth-4.339
 signific-4.115
 independ-3.986
 distingu-3.946
 surpr-3.871
Tokenxx
Token ID5324
Feature activation+1.84e-02

Pos logit contributions
ctic+2.702
20439+2.590
+2.547
asia+2.514
FR+2.499
Neg logit contributions
emouth-3.553
 signific-3.357
 independ-3.249
apist-3.115
 surpr-2.955
TokenA
Token ID32
Feature activation+1.53e-01

Pos logit contributions
emouth+0.041
 signific+0.039
 independ+0.037
apist+0.036
itable+0.034
Neg logit contributions
ctic-0.030
20439-0.029
-0.029
asia-0.028
iron-0.028
TokenRoman
Token ID32454
Feature activation-1.38e+00

Pos logit contributions
emouth+0.997
 signific+0.940
 independ+0.904
apist+0.881
itable+0.837
Neg logit contributions
ctic-0.709
20439-0.686
-0.664
asia-0.659
iron-0.644
TokenF
Token ID37
Feature activation-3.79e-01

Pos logit contributions
ctic+2.408
20439+2.307
asia+2.283
+2.278
iron+2.232
Neg logit contributions
emouth-2.894
 signific-2.720
 independ-2.597
apist-2.506
imental-2.370
Tokenlynn
Token ID12935
Feature activation+3.40e-01

Pos logit contributions
ctic+0.640
20439+0.613
+0.599
asia+0.597
iron+0.588
Neg logit contributions
emouth-0.827
 signific-0.780
 independ-0.745
apist-0.725
imental-0.686
Token pic
Token ID8301
Feature activation-1.08e+00

Pos logit contributions
emouth+0.720
 signific+0.676
 independ+0.650
apist+0.633
itable+0.597
Neg logit contributions
ctic-0.593
20439-0.576
-0.558
asia-0.554
iron-0.543
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
ctic+1.753
+1.654
20439+1.647
asia+1.635
[/+1.611
Neg logit contributions
emouth-2.470
 signific-2.297
 independ-2.225
apist-2.152
 distingu-2.106
Tokentwitter
Token ID6956
Feature activation-2.81e+00

Pos logit contributions
ctic+1.178
20439+1.091
+1.058
asia+1.056
iron+1.007
Neg logit contributions
emouth-3.164
 signific-3.018
 independ-2.944
apist-2.847
 surpr-2.790
Token.
Token ID13
Feature activation-1.46e+00

Pos logit contributions
ctic+3.874
[/+3.683
+3.638
asia+3.568
20439+3.475
Neg logit contributions
emouth-7.071
 distingu-6.666
 signific-6.666
 independ-6.514
 surpr-6.506
Tokencom
Token ID785
Feature activation-2.00e+00

Pos logit contributions
ctic+2.451
20439+2.372
asia+2.341
+2.319
iron+2.265
Neg logit contributions
emouth-3.248
 signific-3.080
 independ-2.995
 surpr-2.851
 distingu-2.825
Token/
Token ID14
Feature activation-1.95e+00

Pos logit contributions
ctic+3.032
+2.882
[/+2.851
20439+2.802
asia+2.750
Neg logit contributions
emouth-4.783
 signific-4.498
 distingu-4.401
 independ-4.365
 surpr-4.310
Tokengy
Token ID1360
Feature activation-1.31e+00

Pos logit contributions
ctic+3.238
20439+3.095
+3.051
FR+3.015
asia+3.014
Neg logit contributions
emouth-4.239
 signific-4.004
 independ-3.882
apist-3.703
 surpr-3.541
Tokend
Token ID67
Feature activation+3.59e-02

Pos logit contributions
ctic+2.118
20439+2.031
+1.988
asia+1.966
iron+1.946
Neg logit contributions
emouth-2.938
 signific-2.777
 independ-2.687
apist-2.579
 surpr-2.452
Token Commando
Token ID47150
Feature activation-1.32e+00

Pos logit contributions
ctic+0.179
20439+0.173
+0.169
asia+0.168
iron+0.164
Neg logit contributions
emouth-0.204
 signific-0.192
 independ-0.183
apist-0.179
itable-0.169
Token Rear
Token ID30144
Feature activation-6.42e-01

Pos logit contributions
ctic+1.356
[/+1.266
20439+1.261
+1.254
asia+1.221
Neg logit contributions
emouth-3.690
 signific-3.530
 independ-3.410
apist-3.340
itable-3.229
Tokenmed
Token ID1150
Feature activation+1.50e-01

Pos logit contributions
ctic+1.289
+1.211
20439+1.198
asia+1.195
iron+1.186
Neg logit contributions
emouth-1.219
 signific-1.132
 independ-1.065
apist-1.040
itable-0.972
Token[/
Token ID13412
Feature activation-1.76e+00

Pos logit contributions
emouth+0.484
 signific+0.465
 independ+0.453
apist+0.445
itable+0.429
Neg logit contributions
ctic-0.095
20439-0.087
-0.080
asia-0.078
iron-0.073
Tokenb
Token ID65
Feature activation+3.75e-01

Pos logit contributions
ctic+2.353
20439+2.226
+2.183
asia+2.172
[/+2.130
Neg logit contributions
emouth-4.411
 signific-4.224
 independ-4.089
apist-3.952
 Somers-3.811
Token][/
Token ID44926
Feature activation-2.80e+00

Pos logit contributions
emouth+0.905
 signific+0.857
 independ+0.827
apist+0.812
itable+0.773
Neg logit contributions
ctic-0.539
20439-0.523
-0.502
asia-0.496
iron-0.483
Tokencolor
Token ID8043
Feature activation-5.79e-01

Pos logit contributions
ctic+3.983
20439+3.706
asia+3.689
+3.669
iron+3.660
Neg logit contributions
emouth-6.657
 signific-6.364
 independ-6.150
apist-5.889
 Somers-5.759
Token]
Token ID60
Feature activation-7.30e-01

Pos logit contributions
ctic+0.823
20439+0.790
+0.763
asia+0.757
iron+0.740
Neg logit contributions
emouth-1.414
 signific-1.338
 independ-1.298
apist-1.265
itable-1.200
Token -
Token ID532
Feature activation-7.16e-01

Pos logit contributions
ctic+1.235
20439+1.191
+1.175
asia+1.156
iron+1.140
Neg logit contributions
emouth-1.574
 signific-1.487
 independ-1.420
apist-1.384
itable-1.310
Token Normal
Token ID14435
Feature activation-3.09e-01

Pos logit contributions
ctic+1.255
20439+1.215
+1.188
asia+1.176
iron+1.161
Neg logit contributions
emouth-1.503
 signific-1.412
 independ-1.349
apist-1.319
itable-1.244
Token,
Token ID11
Feature activation-2.26e-01

Pos logit contributions
ctic+0.554
20439+0.535
+0.524
asia+0.519
iron+0.510
Neg logit contributions
emouth-0.643
 signific-0.602
 independ-0.577
apist-0.563
itable-0.530
Token]
Token ID60
Feature activation-3.38e-01

Pos logit contributions
emouth+0.930
 signific+0.881
 independ+0.852
apist+0.830
itable+0.789
Neg logit contributions
ctic-0.553
20439-0.532
-0.516
asia-0.512
iron-0.498
TokenMega
Token ID43471
Feature activation+1.52e-01

Pos logit contributions
ctic+0.543
20439+0.527
+0.511
asia+0.506
iron+0.496
Neg logit contributions
emouth-0.765
 signific-0.723
 independ-0.695
apist-0.677
itable-0.642
Token Pony
Token ID30377
Feature activation-1.48e-01

Pos logit contributions
emouth+0.336
 signific+0.317
 independ+0.305
apist+0.298
itable+0.282
Neg logit contributions
ctic-0.253
20439-0.244
-0.237
asia-0.236
iron-0.231
Token[/
Token ID13412
Feature activation-1.66e+00

Pos logit contributions
ctic+0.175
20439+0.167
+0.160
asia+0.159
iron+0.153
Neg logit contributions
emouth-0.396
 signific-0.378
 independ-0.365
apist-0.358
itable-0.343
Tokenb
Token ID65
Feature activation+5.64e-01

Pos logit contributions
ctic+2.068
20439+1.918
asia+1.912
+1.902
iron+1.867
Neg logit contributions
emouth-4.359
 signific-4.204
 independ-4.072
apist-3.913
 surpr-3.878
Token][/
Token ID44926
Feature activation-2.79e+00

Pos logit contributions
emouth+1.433
 signific+1.359
 independ+1.316
apist+1.286
itable+1.226
Neg logit contributions
ctic-0.751
20439-0.721
-0.693
asia-0.688
iron-0.667
Tokencolor
Token ID8043
Feature activation-5.07e-01

Pos logit contributions
ctic+3.716
asia+3.495
vel+3.486
iron+3.390
+3.376
Neg logit contributions
emouth-6.957
 signific-6.742
 independ-6.538
 surpr-6.318
 distingu-6.288
Token]
Token ID60
Feature activation-6.03e-01

Pos logit contributions
ctic+0.691
20439+0.652
+0.643
asia+0.636
[/+0.627
Neg logit contributions
emouth-1.281
 signific-1.209
 independ-1.178
apist-1.144
 surpr-1.101
Token -
Token ID532
Feature activation-5.85e-01

Pos logit contributions
ctic+1.109
20439+1.076
+1.049
asia+1.041
iron+1.022
Neg logit contributions
emouth-1.218
 signific-1.143
 independ-1.095
apist-1.066
itable-1.002
Token 20
Token ID1160
Feature activation-2.96e-01

Pos logit contributions
ctic+0.989
20439+0.949
+0.930
asia+0.918
iron+0.901
Neg logit contributions
emouth-1.271
 signific-1.197
 independ-1.145
apist-1.123
itable-1.063
Token%
Token ID4
Feature activation-6.86e-01

Pos logit contributions
ctic+0.361
20439+0.342
+0.329
asia+0.325
iron+0.316
Neg logit contributions
emouth-0.783
 signific-0.744
 independ-0.721
apist-0.707
itable-0.676
Token my
Token ID616
Feature activation-2.69e-01

Pos logit contributions
ctic+1.046
20439+1.008
+0.992
asia+0.989
iron+0.963
Neg logit contributions
emouth-1.136
 signific-1.069
 independ-1.020
apist-1.001
imental-0.935
Token kitchen
Token ID9592
Feature activation+4.59e-01

Pos logit contributions
ctic+0.498
20439+0.482
+0.472
asia+0.470
iron+0.458
Neg logit contributions
emouth-0.542
 signific-0.507
 independ-0.484
apist-0.474
itable-0.443
Token essentials
Token ID41954
Feature activation+1.92e-02

Pos logit contributions
emouth+0.904
 signific+0.845
 independ+0.808
apist+0.786
itable+0.737
Neg logit contributions
ctic-0.869
20439-0.845
-0.823
asia-0.817
iron-0.802
Token!
Token ID0
Feature activation-1.80e-01

Pos logit contributions
emouth+0.038
 signific+0.036
 independ+0.034
apist+0.033
itable+0.031
Neg logit contributions
ctic-0.036
20439-0.035
-0.034
asia-0.034
iron-0.033
Token<|endoftext|>
Token ID50256
Feature activation-5.05e-01

Pos logit contributions
ctic+0.304
20439+0.295
+0.288
asia+0.285
iron+0.278
Neg logit contributions
emouth-0.390
 signific-0.367
 independ-0.353
apist-0.344
itable-0.324
TokenPosted
Token ID14231
Feature activation-2.77e+00

Pos logit contributions
ctic+0.806
20439+0.783
+0.763
asia+0.752
iron+0.739
Neg logit contributions
emouth-1.150
 signific-1.084
 independ-1.044
apist-1.015
itable-0.961
Token on
Token ID319
Feature activation-3.26e-01

Pos logit contributions
ctic+4.330
+4.106
20439+4.102
asia+4.061
[/+4.060
Neg logit contributions
emouth-6.059
 signific-5.699
 independ-5.434
apist-5.421
aminer-5.039
Token by
Token ID416
Feature activation-5.31e-01

Pos logit contributions
ctic+0.576
20439+0.556
+0.546
asia+0.541
iron+0.530
Neg logit contributions
emouth-0.682
 signific-0.638
 independ-0.608
apist-0.598
imental-0.563
Token |
Token ID930
Feature activation-5.91e-01

Pos logit contributions
ctic+0.952
20439+0.919
+0.896
asia+0.892
iron+0.875
Neg logit contributions
emouth-1.101
 signific-1.026
 independ-0.986
apist-0.963
itable-0.908
Token in
Token ID287
Feature activation-6.55e-01

Pos logit contributions
ctic+0.968
20439+0.940
+0.919
asia+0.909
iron+0.885
Neg logit contributions
emouth-1.318
 signific-1.235
 independ-1.190
apist-1.159
imental-1.095
Token
Token ID198
Feature activation-9.18e-01

Pos logit contributions
ctic+1.181
20439+1.146
+1.115
asia+1.113
iron+1.089
Neg logit contributions
emouth-1.346
 signific-1.258
 independ-1.204
apist-1.179
imental-1.104
Tokengy
Token ID1360
Feature activation-5.10e-01

Pos logit contributions
ctic+1.644
20439+1.570
asia+1.555
+1.543
iron+1.541
Neg logit contributions
emouth-1.147
 signific-1.056
 independ-1.023
apist-0.951
 surpr-0.881
Token for
Token ID329
Feature activation+4.09e-02

Pos logit contributions
ctic+0.945
20439+0.910
+0.886
asia+0.883
iron+0.868
Neg logit contributions
emouth-1.028
 signific-0.961
 independ-0.925
apist-0.897
itable-0.843
Token herself
Token ID5223
Feature activation+3.43e-01

Pos logit contributions
emouth+0.082
 signific+0.076
 independ+0.073
apist+0.071
itable+0.067
Neg logit contributions
ctic-0.076
20439-0.074
-0.072
asia-0.071
iron-0.070
Token.
Token ID13
Feature activation+1.60e-01

Pos logit contributions
emouth+0.682
 signific+0.638
 independ+0.611
apist+0.595
itable+0.558
Neg logit contributions
ctic-0.643
20439-0.626
-0.608
asia-0.604
iron-0.593
Token<|endoftext|>
Token ID50256
Feature activation-6.46e-01

Pos logit contributions
emouth+0.351
 signific+0.330
 independ+0.318
apist+0.310
itable+0.293
Neg logit contributions
ctic-0.267
20439-0.259
-0.251
asia-0.249
iron-0.243
TokenPosted
Token ID14231
Feature activation-2.77e+00

Pos logit contributions
ctic+1.029
20439+1.000
+0.976
asia+0.961
iron+0.945
Neg logit contributions
emouth-1.471
 signific-1.387
 independ-1.335
apist-1.296
itable-1.227
Token on
Token ID319
Feature activation-2.72e-01

Pos logit contributions
ctic+4.329
+4.112
20439+4.109
asia+4.056
[/+4.056
Neg logit contributions
emouth-6.049
 signific-5.688
 independ-5.414
apist-5.410
imental-5.036
Token //
Token ID3373
Feature activation-2.26e-01

Pos logit contributions
ctic+0.481
20439+0.464
+0.456
asia+0.452
iron+0.443
Neg logit contributions
emouth-0.568
 signific-0.532
 independ-0.507
apist-0.498
imental-0.469
Token
Token ID198
Feature activation-8.09e-01

Pos logit contributions
ctic+0.394
20439+0.382
+0.372
asia+0.370
iron+0.362
Neg logit contributions
emouth-0.481
 signific-0.449
 independ-0.431
apist-0.420
itable-0.397
Token
Token ID198
Feature activation-9.65e-01

Pos logit contributions
ctic+1.248
20439+1.199
+1.192
asia+1.165
[/+1.146
Neg logit contributions
emouth-1.904
 signific-1.800
 independ-1.737
apist-1.674
 surpr-1.630
TokenAccording
Token ID4821
Feature activation-1.06e+00

Pos logit contributions
ctic+1.533
20439+1.489
+1.459
asia+1.434
iron+1.410
Neg logit contributions
emouth-2.206
 signific-2.082
 independ-2.000
apist-1.939
itable-1.833
Token the
Token ID262
Feature activation+1.09e-01

Pos logit contributions
emouth+0.084
 signific+0.078
 independ+0.075
apist+0.073
itable+0.068
Neg logit contributions
ctic-0.081
20439-0.078
-0.076
asia-0.076
iron-0.075
Token entrance
Token ID10384
Feature activation+9.30e-01

Pos logit contributions
emouth+0.217
 signific+0.202
 independ+0.193
apist+0.189
itable+0.177
Neg logit contributions
ctic-0.207
20439-0.200
-0.195
asia-0.194
iron-0.190
Token to
Token ID284
Feature activation-9.57e-03

Pos logit contributions
emouth+1.879
 signific+1.771
 independ+1.676
apist+1.633
imental+1.549
Neg logit contributions
20439-1.693
ctic-1.688
-1.627
asia-1.588
iron-1.555
Token the
Token ID262
Feature activation-7.48e-02

Pos logit contributions
ctic+0.019
20439+0.018
+0.018
asia+0.017
iron+0.017
Neg logit contributions
emouth-0.018
 signific-0.017
 independ-0.016
apist-0.016
itable-0.015
Token Bangladesh
Token ID19483
Feature activation-8.57e-02

Pos logit contributions
ctic+0.147
20439+0.141
+0.138
asia+0.138
iron+0.135
Neg logit contributions
emouth-0.142
 signific-0.132
 independ-0.126
apist-0.124
itable-0.115
Token Sund
Token ID3309
Feature activation-2.77e+00

Pos logit contributions
ctic+0.171
20439+0.166
+0.162
asia+0.161
iron+0.158
Neg logit contributions
emouth-0.160
 signific-0.149
 independ-0.142
apist-0.139
itable-0.129
Tokenarb
Token ID38039
Feature activation-3.57e-02

Pos logit contributions
ctic+2.911
asia+2.811
iron+2.531
vel+2.505
+2.441
Neg logit contributions
emouth-7.574
 signific-7.447
 independ-7.175
 distingu-6.980
 surpr-6.964
Tokenans
Token ID504
Feature activation-7.25e-01

Pos logit contributions
ctic+0.047
asia+0.044
20439+0.043
+0.042
iron+0.041
Neg logit contributions
emouth-0.091
 signific-0.088
 independ-0.085
apist-0.082
 surpr-0.080
Token,
Token ID11
Feature activation+2.21e-02

Pos logit contributions
ctic+1.321
20439+1.247
+1.239
asia+1.235
iron+1.202
Neg logit contributions
emouth-1.481
 signific-1.382
 independ-1.316
apist-1.309
imental-1.219
Token southeast
Token ID26015
Feature activation-9.77e-01

Pos logit contributions
emouth+0.044
 signific+0.041
 independ+0.039
apist+0.038
itable+0.036
Neg logit contributions
ctic-0.041
20439-0.040
-0.039
asia-0.039
iron-0.038
Token of
Token ID286
Feature activation-3.75e-01

Pos logit contributions
ctic+1.814
20439+1.750
asia+1.705
+1.703
iron+1.665
Neg logit contributions
emouth-1.959
 signific-1.833
 independ-1.756
apist-1.712
itable-1.607
Token the
Token ID262
Feature activation-6.33e-02

Pos logit contributions
ctic+0.026
20439+0.025
+0.025
asia+0.025
iron+0.024
Neg logit contributions
emouth-0.028
 signific-0.026
 independ-0.025
apist-0.024
itable-0.023
Token waters
Token ID10150
Feature activation+8.51e-01

Pos logit contributions
ctic+0.125
20439+0.119
asia+0.117
+0.117
iron+0.114
Neg logit contributions
emouth-0.121
 signific-0.112
 independ-0.107
apist-0.105
imental-0.098
Token of
Token ID286
Feature activation+3.68e-02

Pos logit contributions
emouth+1.806
 signific+1.696
 independ+1.633
apist+1.590
itable+1.498
Neg logit contributions
ctic-1.460
20439-1.439
-1.397
asia-1.378
iron-1.346
Token Bangladesh
Token ID19483
Feature activation+2.49e-01

Pos logit contributions
emouth+0.073
 signific+0.068
 independ+0.065
apist+0.064
itable+0.060
Neg logit contributions
ctic-0.069
20439-0.067
-0.065
asia-0.065
iron-0.064
Token's
Token ID338
Feature activation+1.13e-01

Pos logit contributions
emouth+0.478
 signific+0.446
 independ+0.426
apist+0.414
itable+0.388
Neg logit contributions
ctic-0.482
20439-0.470
-0.457
asia-0.454
iron-0.446
Token Sund
Token ID3309
Feature activation-2.66e+00

Pos logit contributions
emouth+0.215
 signific+0.200
 independ+0.190
apist+0.187
itable+0.174
Neg logit contributions
ctic-0.221
20439-0.213
asia-0.208
-0.208
iron-0.203
Tokenarb
Token ID38039
Feature activation+4.94e-01

Pos logit contributions
ctic+3.462
asia+3.277
iron+3.064
vel+3.052
+2.978
Neg logit contributions
emouth-6.623
 signific-6.580
 independ-6.352
apist-6.022
 surpr-5.835
Tokenans
Token ID504
Feature activation-4.97e-01

Pos logit contributions
emouth+1.263
 signific+1.206
 independ+1.177
apist+1.143
 surpr+1.076
Neg logit contributions
ctic-0.649
asia-0.608
iron-0.595
20439-0.595
-0.585
Token,
Token ID11
Feature activation+1.74e-02

Pos logit contributions
ctic+0.937
20439+0.895
asia+0.884
+0.874
iron+0.862
Neg logit contributions
emouth-0.982
 signific-0.929
 independ-0.884
apist-0.861
imental-0.805
Token the
Token ID262
Feature activation-8.99e-02

Pos logit contributions
emouth+0.035
 signific+0.032
 independ+0.031
apist+0.030
itable+0.028
Neg logit contributions
ctic-0.033
20439-0.032
-0.031
asia-0.031
iron-0.030
Token largest
Token ID4387
Feature activation+1.29e+00

Pos logit contributions
ctic+0.155
20439+0.148
asia+0.145
+0.144
iron+0.141
Neg logit contributions
emouth-0.192
 signific-0.180
 independ-0.172
apist-0.170
itable-0.160
Token
Token ID37929
Feature activation-3.87e-01

Pos logit contributions
emouth+1.566
 signific+1.474
 independ+1.415
apist+1.379
itable+1.303
Neg logit contributions
ctic-1.262
20439-1.229
-1.182
asia-1.181
iron-1.155
Token Be
Token ID1355
Feature activation+5.04e-01

Pos logit contributions
ctic+0.645
20439+0.621
+0.611
asia+0.601
iron+0.589
Neg logit contributions
emouth-0.851
 signific-0.798
 independ-0.769
apist-0.751
itable-0.709
Token safe
Token ID3338
Feature activation+1.51e+00

Pos logit contributions
emouth+0.989
 signific+0.924
 independ+0.885
apist+0.859
itable+0.808
Neg logit contributions
ctic-0.955
20439-0.931
-0.904
asia-0.899
iron-0.880
Token pic
Token ID8301
Feature activation-4.29e-01

Pos logit contributions
emouth+3.012
 signific+2.841
 independ+2.721
apist+2.628
itable+2.491
Neg logit contributions
ctic-2.772
20439-2.741
asia-2.610
-2.587
iron-2.565
Token.
Token ID13
Feature activation-2.95e-01

Pos logit contributions
ctic+0.723
20439+0.687
+0.685
asia+0.674
[/+0.660
Neg logit contributions
emouth-0.951
 signific-0.885
 independ-0.854
apist-0.833
 surpr-0.792
Tokentwitter
Token ID6956
Feature activation-2.59e+00

Pos logit contributions
ctic+0.344
20439+0.326
+0.313
asia+0.310
iron+0.300
Neg logit contributions
emouth-0.799
 signific-0.761
 independ-0.739
apist-0.723
itable-0.691
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
ctic+3.672
+3.481
[/+3.466
asia+3.364
20439+3.336
Neg logit contributions
emouth-6.367
 signific-6.034
 distingu-5.907
 independ-5.878
 surpr-5.806
Tokencom
Token ID785
Feature activation-1.69e+00

Pos logit contributions
ctic+1.817
20439+1.772
+1.730
asia+1.718
iron+1.670
Neg logit contributions
emouth-2.259
 signific-2.144
 independ-2.072
apist-1.963
 surpr-1.933
Token/
Token ID14
Feature activation-1.65e+00

Pos logit contributions
ctic+2.654
+2.515
20439+2.487
[/+2.470
asia+2.380
Neg logit contributions
emouth-3.918
 signific-3.702
 independ-3.581
 distingu-3.543
 surpr-3.478
Token74
Token ID4524
Feature activation-4.77e-01

Pos logit contributions
ctic+2.746
20439+2.635
+2.589
asia+2.556
FR+2.541
Neg logit contributions
emouth-3.592
 signific-3.393
 independ-3.285
apist-3.148
 surpr-2.987
TokenE
Token ID36
Feature activation-1.77e-01

Pos logit contributions
ctic+0.807
20439+0.779
+0.759
asia+0.753
iron+0.739
Neg logit contributions
emouth-1.037
 signific-0.977
 independ-0.941
apist-0.912
itable-0.859
Token.
Token ID13
Feature activation+2.42e-01

Pos logit contributions
ctic+0.095
20439+0.092
+0.089
asia+0.089
iron+0.087
Neg logit contributions
emouth-0.106
 signific-0.098
 independ-0.094
apist-0.092
itable-0.087
Token I
Token ID314
Feature activation-1.43e-01

Pos logit contributions
emouth+0.533
 signific+0.502
 independ+0.483
apist+0.471
itable+0.445
Neg logit contributions
ctic-0.401
20439-0.389
-0.377
asia-0.374
iron-0.366
Token remembered
Token ID12086
Feature activation-9.24e-02

Pos logit contributions
ctic+0.281
20439+0.270
+0.264
asia+0.263
iron+0.259
Neg logit contributions
emouth-0.273
 signific-0.252
 independ-0.241
apist-0.235
itable-0.220
Token thinking
Token ID3612
Feature activation+7.26e-01

Pos logit contributions
ctic+0.170
20439+0.164
+0.160
asia+0.160
iron+0.156
Neg logit contributions
emouth-0.188
 signific-0.174
 independ-0.167
apist-0.164
itable-0.154
Token "
Token ID366
Feature activation+1.47e-01

Pos logit contributions
emouth+1.435
 signific+1.342
 independ+1.286
apist+1.250
itable+1.172
Neg logit contributions
ctic-1.367
20439-1.332
-1.293
asia-1.285
iron-1.260
TokenGod
Token ID13482
Feature activation+1.48e+00

Pos logit contributions
emouth+0.310
 signific+0.292
 independ+0.280
apist+0.273
itable+0.257
Neg logit contributions
ctic-0.256
20439-0.248
-0.241
asia-0.239
iron-0.234
Token,
Token ID11
Feature activation-2.51e-01

Pos logit contributions
emouth+2.714
 signific+2.527
 independ+2.410
apist+2.338
itable+2.192
Neg logit contributions
ctic-2.971
20439-2.915
-2.829
asia-2.806
iron-2.756
Token how
Token ID703
Feature activation-3.31e-01

Pos logit contributions
ctic+0.460
20439+0.438
asia+0.432
+0.431
iron+0.424
Neg logit contributions
emouth-0.513
 signific-0.478
 independ-0.457
apist-0.447
itable-0.421
Token could
Token ID714
Feature activation+6.95e-02

Pos logit contributions
ctic+0.604
20439+0.576
+0.565
asia+0.562
iron+0.555
Neg logit contributions
emouth-0.678
 signific-0.630
 independ-0.603
apist-0.593
itable-0.555
Token a
Token ID257
Feature activation+1.76e-01

Pos logit contributions
emouth+0.139
 signific+0.130
 independ+0.123
apist+0.121
itable+0.113
Neg logit contributions
ctic-0.130
20439-0.125
-0.123
asia-0.122
iron-0.120
Token guy
Token ID3516
Feature activation-8.25e-01

Pos logit contributions
emouth+0.363
 signific+0.340
 independ+0.326
apist+0.318
itable+0.299
Neg logit contributions
ctic-0.316
20439-0.307
-0.298
asia-0.297
iron-0.290
Token is
Token ID318
Feature activation+2.50e-01

Pos logit contributions
emouth+0.863
 signific+0.805
 independ+0.770
apist+0.746
itable+0.698
Neg logit contributions
ctic-0.900
20439-0.873
-0.851
asia-0.849
iron-0.834
Token this
Token ID428
Feature activation+2.35e-01

Pos logit contributions
emouth+0.493
 signific+0.460
 independ+0.440
apist+0.429
itable+0.402
Neg logit contributions
ctic-0.473
20439-0.459
-0.448
asia-0.446
iron-0.437
Token genius
Token ID15632
Feature activation+2.07e-01

Pos logit contributions
emouth+0.487
 signific+0.455
 independ+0.437
apist+0.427
itable+0.401
Neg logit contributions
ctic-0.420
20439-0.407
-0.396
asia-0.395
iron-0.386
Token Patrick
Token ID9925
Feature activation+4.26e-01

Pos logit contributions
emouth+0.418
 signific+0.391
 independ+0.375
apist+0.365
itable+0.342
Neg logit contributions
ctic-0.381
20439-0.369
-0.360
asia-0.358
iron-0.351
Token O
Token ID440
Feature activation+4.51e-01

Pos logit contributions
emouth+0.856
 signific+0.798
 independ+0.765
apist+0.744
itable+0.700
Neg logit contributions
ctic-0.791
20439-0.766
-0.747
asia-0.745
iron-0.730
Token'
Token ID6
Feature activation+1.90e+00

Pos logit contributions
emouth+0.926
 signific+0.868
 independ+0.833
apist+0.811
itable+0.763
Neg logit contributions
ctic-0.818
20439-0.793
-0.771
asia-0.767
iron-0.753
TokenBrian
Token ID24761
Feature activation+1.22e-01

Pos logit contributions
emouth+3.541
 signific+3.290
 independ+3.131
apist+3.045
itable+2.865
Neg logit contributions
ctic-3.758
20439-3.673
asia-3.562
-3.552
orget-3.477
Token and
Token ID290
Feature activation+2.42e-01

Pos logit contributions
emouth+0.243
 signific+0.227
 independ+0.218
apist+0.212
itable+0.199
Neg logit contributions
ctic-0.227
20439-0.220
-0.214
asia-0.213
iron-0.209
Token you
Token ID345
Feature activation+4.55e-01

Pos logit contributions
emouth+0.492
 signific+0.460
 independ+0.441
apist+0.430
itable+0.404
Neg logit contributions
ctic-0.443
20439-0.430
-0.418
asia-0.416
iron-0.408
Token're
Token ID821
Feature activation-4.63e-01

Pos logit contributions
emouth+0.908
 signific+0.847
 independ+0.811
apist+0.793
itable+0.741
Neg logit contributions
ctic-0.851
20439-0.819
-0.802
asia-0.798
iron-0.784
Token not
Token ID407
Feature activation-2.14e-01

Pos logit contributions
ctic+0.815
20439+0.780
+0.765
asia+0.765
iron+0.749
Neg logit contributions
emouth-0.973
 signific-0.902
 independ-0.864
apist-0.857
imental-0.802
Token services
Token ID2594
Feature activation+1.18e-01

Pos logit contributions
emouth+0.314
 signific+0.294
 independ+0.282
apist+0.276
itable+0.260
Neg logit contributions
ctic-0.266
20439-0.258
-0.251
asia-0.249
iron-0.244
Token.
Token ID13
Feature activation-1.48e-02

Pos logit contributions
emouth+0.218
 signific+0.202
 independ+0.192
apist+0.188
itable+0.175
Neg logit contributions
ctic-0.239
20439-0.232
-0.228
asia-0.225
iron-0.220
Token
Token ID198
Feature activation+2.16e-01

Pos logit contributions
ctic+0.025
20439+0.024
+0.023
asia+0.023
iron+0.023
Neg logit contributions
emouth-0.033
 signific-0.031
 independ-0.030
apist-0.029
itable-0.027
Token
Token ID198
Feature activation-4.13e-02

Pos logit contributions
emouth+0.481
 signific+0.453
 independ+0.436
apist+0.424
itable+0.400
Neg logit contributions
ctic-0.358
20439-0.345
-0.338
asia-0.335
iron-0.327
TokenD
Token ID35
Feature activation+1.98e-01

Pos logit contributions
ctic+0.067
20439+0.064
+0.063
asia+0.062
iron+0.061
Neg logit contributions
emouth-0.093
 signific-0.088
 independ-0.084
apist-0.082
itable-0.078
TokenDoS
Token ID46498
Feature activation+1.90e+00

Pos logit contributions
emouth+0.335
 signific+0.310
 independ+0.295
apist+0.285
itable+0.263
Neg logit contributions
ctic-0.429
20439-0.418
-0.408
asia-0.407
iron-0.400
Token Attack
Token ID8307
Feature activation+4.69e-01

Pos logit contributions
emouth+3.807
 signific+3.580
 independ+3.430
apist+3.287
itable+3.111
Neg logit contributions
ctic-3.459
20439-3.441
asia-3.265
-3.234
iron-3.211
Token Complex
Token ID19157
Feature activation-2.88e-01

Pos logit contributions
emouth+0.996
 signific+0.936
 independ+0.900
apist+0.877
itable+0.827
Neg logit contributions
ctic-0.813
20439-0.790
-0.765
asia-0.760
iron-0.744
Tokenity
Token ID414
Feature activation+8.70e-01

Pos logit contributions
ctic+0.412
20439+0.392
+0.385
asia+0.380
iron+0.370
Neg logit contributions
emouth-0.700
 signific-0.664
 independ-0.641
apist-0.628
imental-0.595
Token
Token ID198
Feature activation+5.50e-02

Pos logit contributions
emouth+1.823
 signific+1.713
 independ+1.645
apist+1.600
itable+1.509
Neg logit contributions
ctic-1.536
20439-1.496
-1.443
asia-1.439
iron-1.409
Token
Token ID198
Feature activation-2.00e-01

Pos logit contributions
emouth+0.124
 signific+0.117
 independ+0.113
apist+0.109
 surpr+0.104
Neg logit contributions
ctic-0.090
20439-0.086
-0.085
asia-0.084
iron-0.082
Tokens
Token ID82
Feature activation+6.28e-01

Pos logit contributions
emouth+0.276
 signific+0.249
 independ+0.233
apist+0.225
itable+0.203
Neg logit contributions
ctic-0.493
20439-0.481
-0.472
asia-0.469
iron-0.463
Token have
Token ID423
Feature activation+8.42e-02

Pos logit contributions
emouth+1.202
 signific+1.121
 independ+1.072
apist+1.042
itable+0.975
Neg logit contributions
ctic-1.225
20439-1.192
-1.160
asia-1.153
iron-1.132
Token historically
Token ID15074
Feature activation-5.94e-01

Pos logit contributions
emouth+0.178
 signific+0.165
 independ+0.159
apist+0.156
itable+0.147
Neg logit contributions
ctic-0.149
20439-0.142
-0.139
asia-0.138
iron-0.136
Token been
Token ID587
Feature activation-2.10e-01

Pos logit contributions
ctic+1.048
20439+0.979
+0.964
asia+0.949
iron+0.939
Neg logit contributions
emouth-1.268
 signific-1.166
 independ-1.125
apist-1.109
imental-1.050
Token much
Token ID881
Feature activation+6.07e-01

Pos logit contributions
ctic+0.362
20439+0.346
+0.335
asia+0.332
iron+0.327
Neg logit contributions
emouth-0.453
 signific-0.422
 independ-0.406
apist-0.400
imental-0.375
Token shorter
Token ID12238
Feature activation+1.53e+00

Pos logit contributions
emouth+1.400
 signific+1.322
 independ+1.275
apist+1.245
itable+1.180
Neg logit contributions
ctic-0.943
20439-0.912
-0.881
asia-0.874
iron-0.854
Token.
Token ID13
Feature activation+2.71e-01

Pos logit contributions
emouth+3.049
 signific+2.868
 independ+2.741
apist+2.662
itable+2.502
Neg logit contributions
ctic-2.826
20439-2.778
-2.682
asia-2.678
iron-2.613
Token The
Token ID383
Feature activation+2.50e-01

Pos logit contributions
emouth+0.598
 signific+0.563
 independ+0.542
apist+0.528
itable+0.499
Neg logit contributions
ctic-0.450
20439-0.437
-0.423
asia-0.420
iron-0.410
Token latest
Token ID3452
Feature activation-1.88e-01

Pos logit contributions
emouth+0.528
 signific+0.495
 independ+0.476
apist+0.464
itable+0.437
Neg logit contributions
ctic-0.439
20439-0.427
-0.414
asia-0.411
iron-0.402
Token dispute
Token ID11047
Feature activation+4.71e-01

Pos logit contributions
ctic+0.343
20439+0.329
+0.323
asia+0.318
iron+0.311
Neg logit contributions
emouth-0.388
 signific-0.359
 independ-0.344
apist-0.340
itable-0.318
Token is
Token ID318
Feature activation+9.12e-02

Pos logit contributions
emouth+0.898
 signific+0.837
 independ+0.800
apist+0.778
itable+0.727
Neg logit contributions
ctic-0.922
20439-0.898
-0.874
asia-0.869
iron-0.853
Token not
Token ID407
Feature activation+2.21e-01

Pos logit contributions
ctic+0.605
20439+0.584
+0.568
asia+0.565
iron+0.556
Neg logit contributions
emouth-0.692
 signific-0.646
 independ-0.619
apist-0.605
itable-0.569
Token only
Token ID691
Feature activation+8.55e-01

Pos logit contributions
emouth+0.460
 signific+0.431
 independ+0.414
apist+0.403
itable+0.380
Neg logit contributions
ctic-0.395
20439-0.384
-0.372
asia-0.370
iron-0.362
Token on
Token ID319
Feature activation-7.82e-02

Pos logit contributions
emouth+1.719
 signific+1.615
 independ+1.550
apist+1.504
itable+1.414
Neg logit contributions
ctic-1.571
20439-1.537
-1.488
asia-1.478
iron-1.444
Token her
Token ID607
Feature activation-8.00e-02

Pos logit contributions
ctic+0.155
20439+0.150
+0.147
asia+0.146
iron+0.143
Neg logit contributions
emouth-0.147
 signific-0.136
 independ-0.130
apist-0.127
itable-0.118
Token ce
Token ID2906
Feature activation+8.84e-01

Pos logit contributions
ctic+0.154
20439+0.149
+0.146
asia+0.145
iron+0.142
Neg logit contributions
emouth-0.156
 signific-0.144
 independ-0.138
apist-0.135
itable-0.125
Tokenint
Token ID600
Feature activation+1.68e+00

Pos logit contributions
emouth+1.411
 signific+1.298
 independ+1.227
apist+1.187
itable+1.095
Neg logit contributions
ctic-1.999
20439-1.959
-1.911
asia-1.899
iron-1.867
Tokenure
Token ID495
Feature activation+1.21e-01

Pos logit contributions
emouth+3.513
 signific+3.315
 independ+3.170
apist+3.113
imental+2.939
Neg logit contributions
ctic-2.914
20439-2.840
-2.740
asia-2.699
iron-2.670
Token,
Token ID11
Feature activation+3.03e-01

Pos logit contributions
emouth+0.218
 signific+0.202
 independ+0.193
apist+0.187
itable+0.174
Neg logit contributions
ctic-0.252
20439-0.245
-0.239
asia-0.238
iron-0.233
Token but
Token ID475
Feature activation+3.62e-01

Pos logit contributions
emouth+0.565
 signific+0.528
 independ+0.504
apist+0.489
itable+0.456
Neg logit contributions
ctic-0.604
20439-0.591
-0.572
asia-0.570
iron-0.559
Token on
Token ID319
Feature activation-9.67e-02

Pos logit contributions
emouth+0.761
 signific+0.715
 independ+0.686
apist+0.668
itable+0.630
Neg logit contributions
ctic-0.638
20439-0.620
-0.601
asia-0.597
iron-0.585
Token her
Token ID607
Feature activation-9.71e-02

Pos logit contributions
ctic+0.188
20439+0.182
+0.178
asia+0.177
iron+0.174
Neg logit contributions
emouth-0.185
 signific-0.172
 independ-0.165
apist-0.160
itable-0.150
Token;
Token ID26
Feature activation+8.43e-01

Pos logit contributions
emouth+1.717
 signific+1.596
 independ+1.524
apist+1.483
itable+1.384
Neg logit contributions
ctic-1.813
20439-1.779
-1.727
asia-1.706
orget-1.674
Token but
Token ID475
Feature activation+4.74e-01

Pos logit contributions
emouth+1.654
 signific+1.545
 independ+1.479
apist+1.439
itable+1.349
Neg logit contributions
ctic-1.603
20439-1.559
-1.516
asia-1.508
iron-1.479
Token Diana
Token ID25752
Feature activation-8.91e-01

Pos logit contributions
emouth+0.951
 signific+0.886
 independ+0.849
apist+0.832
itable+0.780
Neg logit contributions
ctic-0.884
20439-0.852
-0.833
asia-0.830
iron-0.813
Token's
Token ID338
Feature activation+5.98e-01

Pos logit contributions
ctic+1.753
20439+1.648
+1.635
asia+1.635
vel+1.609
Neg logit contributions
emouth-1.699
 signific-1.557
 independ-1.502
apist-1.481
imental-1.376
Token reck
Token ID12482
Feature activation+1.62e+00

Pos logit contributions
emouth+1.263
 signific+1.194
 independ+1.149
apist+1.112
itable+1.053
Neg logit contributions
ctic-1.040
20439-1.018
-0.983
asia-0.973
iron-0.954
Tokenlessness
Token ID17587
Feature activation+4.88e-01

Pos logit contributions
emouth+2.135
 signific+1.908
 independ+1.797
apist+1.745
itable+1.562
Neg logit contributions
ctic-4.075
20439-4.037
-3.935
asia-3.904
iron-3.859
Token is
Token ID318
Feature activation+2.20e-01

Pos logit contributions
emouth+0.901
 signific+0.834
 independ+0.798
apist+0.776
itable+0.723
Neg logit contributions
ctic-0.992
20439-0.957
-0.935
asia-0.932
iron-0.918
Token a
Token ID257
Feature activation+2.87e-01

Pos logit contributions
emouth+0.465
 signific+0.435
 independ+0.418
apist+0.409
itable+0.384
Neg logit contributions
ctic-0.387
20439-0.373
-0.362
asia-0.362
iron-0.356
Token perfect
Token ID2818
Feature activation-6.21e-01

Pos logit contributions
emouth+0.578
 signific+0.538
 independ+0.517
apist+0.504
itable+0.470
Neg logit contributions
ctic-0.533
20439-0.516
-0.503
asia-0.502
iron-0.491
Token match
Token ID2872
Feature activation+1.24e-02

Pos logit contributions
ctic+0.938
20439+0.888
+0.882
asia+0.874
iron+0.856
Neg logit contributions
emouth-1.449
 signific-1.353
 independ-1.312
apist-1.293
imental-1.223
Token for
Token ID329
Feature activation+2.73e-01

Pos logit contributions
emouth+0.025
 signific+0.024
 independ+0.023
apist+0.022
itable+0.021
Neg logit contributions
ctic-0.023
20439-0.022
-0.022
asia-0.021
iron-0.021
Token
Token ID198
Feature activation-6.41e-01

Pos logit contributions
ctic+0.804
+0.765
20439+0.755
asia+0.749
[/+0.737
Neg logit contributions
emouth-1.222
 signific-1.154
 independ-1.113
apist-1.078
 surpr-1.049
Token3
Token ID18
Feature activation-2.07e-01

Pos logit contributions
ctic+1.002
20439+0.961
+0.952
asia+0.935
iron+0.919
Neg logit contributions
emouth-1.477
 signific-1.399
 independ-1.341
apist-1.305
itable-1.239
TokenD
Token ID35
Feature activation+5.63e-01

Pos logit contributions
ctic+0.381
20439+0.371
+0.360
asia+0.357
iron+0.350
Neg logit contributions
emouth-0.418
 signific-0.392
 independ-0.376
apist-0.366
itable-0.344
TokenMAX
Token ID22921
Feature activation+1.43e-01

Pos logit contributions
emouth+1.105
 signific+1.032
 independ+0.985
apist+0.961
itable+0.900
Neg logit contributions
ctic-1.065
20439-1.048
-1.011
asia-1.006
iron-0.984
Token was
Token ID373
Feature activation+5.45e-02

Pos logit contributions
emouth+0.273
 signific+0.255
 independ+0.243
apist+0.237
itable+0.221
Neg logit contributions
ctic-0.281
20439-0.272
-0.266
asia-0.264
iron-0.260
Token doomed
Token ID24312
Feature activation+2.47e-01

Pos logit contributions
emouth+0.109
 signific+0.102
 independ+0.097
apist+0.095
itable+0.089
Neg logit contributions
ctic-0.102
20439-0.099
-0.096
asia-0.096
iron-0.094
Token first
Token ID717
Feature activation+4.49e-01

Pos logit contributions
emouth+0.521
 signific+0.489
 independ+0.470
apist+0.458
itable+0.431
Neg logit contributions
ctic-0.434
20439-0.421
-0.408
asia-0.406
iron-0.398
Token when
Token ID618
Feature activation+1.47e-02

Pos logit contributions
emouth+0.960
 signific+0.902
 independ+0.867
apist+0.845
itable+0.797
Neg logit contributions
ctic-0.776
20439-0.754
-0.730
asia-0.725
iron-0.710
Token all
Token ID477
Feature activation+9.25e-01

Pos logit contributions
emouth+0.028
 signific+0.026
 independ+0.025
apist+0.024
itable+0.023
Neg logit contributions
ctic-0.029
20439-0.028
-0.027
asia-0.027
iron-0.027
Tokenu
Token ID84
Feature activation-3.17e-01

Pos logit contributions
emouth+1.816
 signific+1.702
 independ+1.629
apist+1.579
itable+1.481
Neg logit contributions
ctic-1.749
20439-1.711
-1.655
asia-1.644
iron-1.610
Token received
Token ID2722
Feature activation+8.06e-01

Pos logit contributions
ctic+0.594
20439+0.572
+0.559
asia+0.557
iron+0.547
Neg logit contributions
emouth-0.632
 signific-0.590
 independ-0.565
apist-0.551
itable-0.515
Token game
Token ID983
Feature activation+4.04e-01

Pos logit contributions
ctic+0.091
20439+0.088
+0.085
asia+0.085
iron+0.083
Neg logit contributions
emouth-0.104
 signific-0.097
 independ-0.093
apist-0.091
itable-0.086
Token than
Token ID621
Feature activation-8.25e-01

Pos logit contributions
emouth+0.807
 signific+0.755
 independ+0.723
apist+0.703
itable+0.661
Neg logit contributions
ctic-0.752
20439-0.733
-0.711
asia-0.707
iron-0.693
Token his
Token ID465
Feature activation-4.33e-03

Pos logit contributions
ctic+1.497
20439+1.446
+1.416
asia+1.403
iron+1.376
Neg logit contributions
emouth-1.676
 signific-1.558
 independ-1.512
apist-1.477
itable-1.380
Token daily
Token ID4445
Feature activation+1.70e-01

Pos logit contributions
ctic+0.008
20439+0.008
+0.007
asia+0.007
iron+0.007
Neg logit contributions
emouth-0.009
 signific-0.008
 independ-0.008
apist-0.008
itable-0.007
Token hot
Token ID3024
Feature activation-4.88e-02

Pos logit contributions
emouth+0.352
 signific+0.328
 independ+0.316
apist+0.309
itable+0.290
Neg logit contributions
ctic-0.305
20439-0.295
-0.286
asia-0.285
iron-0.279
Token dog
Token ID3290
Feature activation+1.83e-01

Pos logit contributions
ctic+0.085
20439+0.083
+0.080
asia+0.079
iron+0.078
Neg logit contributions
emouth-0.103
 signific-0.097
 independ-0.093
apist-0.091
itable-0.086
Token snack
Token ID26906
Feature activation-4.44e-02

Pos logit contributions
emouth+0.359
 signific+0.333
 independ+0.320
apist+0.314
itable+0.293
Neg logit contributions
ctic-0.350
20439-0.335
-0.330
asia-0.329
iron-0.321
Token.
Token ID13
Feature activation+3.51e-01

Pos logit contributions
ctic+0.082
20439+0.079
+0.077
asia+0.077
iron+0.075
Neg logit contributions
emouth-0.090
 signific-0.084
 independ-0.081
apist-0.079
itable-0.074
Token
Token ID198
Feature activation+5.46e-01

Pos logit contributions
emouth+0.774
 signific+0.729
 independ+0.701
apist+0.684
itable+0.647
Neg logit contributions
ctic-0.582
20439-0.565
-0.546
asia-0.543
iron-0.531
Token
Token ID198
Feature activation+1.47e-01

Pos logit contributions
emouth+1.184
 signific+1.114
 independ+1.072
apist+1.045
itable+0.986
Neg logit contributions
ctic-0.925
20439-0.898
-0.870
asia-0.864
iron-0.845
TokenNow
Token ID3844
Feature activation-4.67e-01

Pos logit contributions
emouth+0.333
 signific+0.314
 independ+0.302
apist+0.295
itable+0.279
Neg logit contributions
ctic-0.235
20439-0.228
-0.221
asia-0.219
iron-0.214
Token used
Token ID973
Feature activation-1.30e+00

Pos logit contributions
emouth+0.138
 signific+0.129
 independ+0.123
apist+0.121
itable+0.113
Neg logit contributions
ctic-0.127
20439-0.122
-0.119
asia-0.119
iron-0.117
Token in
Token ID287
Feature activation+8.82e-02

Pos logit contributions
ctic+2.277
20439+2.162
asia+2.103
+2.100
iron+2.069
Neg logit contributions
emouth-2.738
 signific-2.579
 independ-2.463
apist-2.445
itable-2.303
Token deep
Token ID2769
Feature activation+1.01e+00

Pos logit contributions
emouth+0.180
 signific+0.169
 independ+0.162
apist+0.157
itable+0.148
Neg logit contributions
ctic-0.160
20439-0.157
-0.152
asia-0.151
iron-0.148
Token Arctic
Token ID15723
Feature activation-5.84e-01

Pos logit contributions
emouth+1.798
 signific+1.670
 independ+1.590
apist+1.523
itable+1.423
Neg logit contributions
ctic-2.065
20439-2.047
-1.976
asia-1.969
iron-1.930
Token waters
Token ID10150
Feature activation+3.26e-01

Pos logit contributions
ctic+1.062
20439+1.014
+0.996
asia+0.987
iron+0.966
Neg logit contributions
emouth-1.198
 signific-1.120
 independ-1.073
apist-1.057
itable-0.991
Token.
Token ID13
Feature activation+2.12e-01

Pos logit contributions
emouth+0.652
 signific+0.611
 independ+0.586
apist+0.568
itable+0.534
Neg logit contributions
ctic-0.604
20439-0.590
-0.572
asia-0.569
iron-0.558
Token The
Token ID383
Feature activation+2.89e-01

Pos logit contributions
emouth+0.473
 signific+0.445
 independ+0.429
apist+0.419
itable+0.396
Neg logit contributions
ctic-0.346
20439-0.335
-0.325
asia-0.322
iron-0.315
Token Ark
Token ID9128
Feature activation+8.05e-01

Pos logit contributions
emouth+0.597
 signific+0.558
 independ+0.536
apist+0.525
itable+0.493
Neg logit contributions
ctic-0.522
20439-0.505
-0.492
asia-0.489
iron-0.479
Tokent
Token ID83
Feature activation-1.96e-01

Pos logit contributions
emouth+1.552
 signific+1.441
 independ+1.368
apist+1.345
itable+1.263
Neg logit contributions
ctic-1.529
20439-1.523
-1.472
asia-1.454
orget-1.422
Tokenika
Token ID9232
Feature activation-8.72e-01

Pos logit contributions
ctic+0.333
asia+0.311
20439+0.310
+0.303
iron+0.298
Neg logit contributions
emouth-0.427
 signific-0.410
 independ-0.393
apist-0.379
 surpr-0.360
Token was
Token ID373
Feature activation+2.73e-02

Pos logit contributions
ctic+1.619
20439+1.509
+1.508
asia+1.496
iron+1.473
Neg logit contributions
emouth-1.758
 signific-1.609
apist-1.562
 independ-1.556
imental-1.443
Token ejac
Token ID39426
Feature activation+2.70e-01

Pos logit contributions
emouth+0.163
 signific+0.152
 independ+0.145
apist+0.142
itable+0.134
Neg logit contributions
ctic-0.137
20439-0.131
-0.129
asia-0.127
iron-0.125
Tokenulate
Token ID5039
Feature activation-2.31e-02

Pos logit contributions
emouth+0.467
 signific+0.435
 independ+0.412
apist+0.397
itable+0.368
Neg logit contributions
ctic-0.574
20439-0.557
-0.546
asia-0.544
iron-0.536
Token soon
Token ID2582
Feature activation+1.05e-01

Pos logit contributions
ctic+0.042
20439+0.040
+0.039
asia+0.039
iron+0.039
Neg logit contributions
emouth-0.048
 signific-0.044
 independ-0.042
apist-0.042
itable-0.039
Token.
Token ID13
Feature activation+1.59e-01

Pos logit contributions
emouth+0.215
 signific+0.202
 independ+0.193
apist+0.188
itable+0.177
Neg logit contributions
ctic-0.189
20439-0.182
-0.178
asia-0.177
iron-0.174
Token So
Token ID1406
Feature activation+5.50e-02

Pos logit contributions
emouth+0.355
 signific+0.334
 independ+0.322
apist+0.313
itable+0.297
Neg logit contributions
ctic-0.261
20439-0.253
-0.246
asia-0.244
iron-0.238
Token you
Token ID345
Feature activation+7.46e-01

Pos logit contributions
emouth+0.109
 signific+0.101
 independ+0.097
apist+0.094
itable+0.089
Neg logit contributions
ctic-0.104
20439-0.100
-0.098
asia-0.097
iron-0.096
Token can
Token ID460
Feature activation-1.37e-01

Pos logit contributions
emouth+1.475
 signific+1.378
 independ+1.319
apist+1.284
itable+1.204
Neg logit contributions
ctic-1.409
20439-1.368
-1.331
asia-1.323
iron-1.298
Token watch
Token ID2342
Feature activation-1.08e-01

Pos logit contributions
ctic+0.229
20439+0.217
+0.215
asia+0.212
iron+0.208
Neg logit contributions
emouth-0.303
 signific-0.282
 independ-0.271
apist-0.266
itable-0.251
Token porn
Token ID8483
Feature activation+6.47e-01

Pos logit contributions
ctic+0.200
20439+0.190
+0.190
asia+0.187
iron+0.184
Neg logit contributions
emouth-0.218
 signific-0.202
 independ-0.193
apist-0.189
itable-0.179
Token and
Token ID290
Feature activation+2.92e-01

Pos logit contributions
emouth+1.261
 signific+1.178
 independ+1.125
apist+1.092
itable+1.027
Neg logit contributions
ctic-1.241
20439-1.203
-1.173
asia-1.168
iron-1.146
Token masturb
Token ID22938
Feature activation+3.15e-02

Pos logit contributions
emouth+0.576
 signific+0.538
 independ+0.515
apist+0.501
itable+0.470
Neg logit contributions
ctic-0.553
20439-0.537
-0.523
asia-0.520
iron-0.510
Token two
Token ID734
Feature activation+6.16e-01

Pos logit contributions
emouth+0.696
 signific+0.643
 independ+0.617
apist+0.603
itable+0.562
Neg logit contributions
ctic-0.674
20439-0.652
-0.637
asia-0.632
iron-0.622
Token strings
Token ID13042
Feature activation-3.73e-01

Pos logit contributions
emouth+1.311
 signific+1.225
 independ+1.176
apist+1.152
itable+1.084
Neg logit contributions
ctic-1.071
20439-1.038
-1.009
asia-1.002
iron-0.981
Token attached
Token ID7223
Feature activation-8.39e-01

Pos logit contributions
ctic+0.524
20439+0.497
iron+0.479
+0.479
asia+0.477
Neg logit contributions
emouth-0.916
 signific-0.862
apist-0.832
 independ-0.831
itable-0.780
Token.
Token ID13
Feature activation-6.60e-02

Pos logit contributions
ctic+1.436
20439+1.380
+1.345
asia+1.335
iron+1.316
Neg logit contributions
emouth-1.804
 signific-1.687
 independ-1.618
apist-1.592
itable-1.491
Token
Token ID198
Feature activation+6.43e-02

Pos logit contributions
ctic+0.110
20439+0.107
+0.104
asia+0.103
iron+0.101
Neg logit contributions
emouth-0.145
 signific-0.136
 independ-0.131
apist-0.128
itable-0.121
Token
Token ID198
Feature activation-2.63e-01

Pos logit contributions
emouth+0.145
 signific+0.136
 independ+0.131
apist+0.127
 surpr+0.121
Neg logit contributions
ctic-0.105
20439-0.101
-0.099
asia-0.098
iron-0.096
TokenThe
Token ID464
Feature activation+2.91e-01

Pos logit contributions
ctic+0.419
20439+0.408
+0.401
asia+0.391
iron+0.385
Neg logit contributions
emouth-0.603
 signific-0.569
 independ-0.549
apist-0.530
itable-0.503
Token president
Token ID1893
Feature activation+3.91e-01

Pos logit contributions
emouth+0.566
 signific+0.524
 independ+0.503
apist+0.491
imental+0.459
Neg logit contributions
ctic-0.559
20439-0.544
-0.529
asia-0.524
iron-0.517
Token told
Token ID1297
Feature activation-1.43e+00

Pos logit contributions
emouth+0.732
 signific+0.678
 independ+0.652
apist+0.632
imental+0.590
Neg logit contributions
ctic-0.784
20439-0.754
-0.734
asia-0.731
iron-0.722
Token Fox
Token ID5426
Feature activation-4.14e-01

Pos logit contributions
ctic+2.508
20439+2.410
asia+2.371
[/+2.347
+2.328
Neg logit contributions
emouth-2.914
 signific-2.719
 independ-2.598
apist-2.562
imental-2.410
Token News
Token ID3000
Feature activation+6.57e-01

Pos logit contributions
ctic+0.838
20439+0.811
+0.793
asia+0.791
iron+0.776
Neg logit contributions
emouth-0.762
 signific-0.710
 independ-0.678
apist-0.652
imental-0.616
Token security
Token ID2324
Feature activation+7.17e-01

Pos logit contributions
ctic+0.383
20439+0.372
+0.362
asia+0.360
iron+0.351
Neg logit contributions
emouth-0.422
 signific-0.393
 independ-0.377
apist-0.369
itable-0.345
Token sector
Token ID6567
Feature activation+8.04e-01

Pos logit contributions
emouth+1.357
 signific+1.263
 independ+1.207
apist+1.175
itable+1.097
Neg logit contributions
ctic-1.415
20439-1.378
-1.342
asia-1.334
iron-1.309
Token reform
Token ID4975
Feature activation-3.26e-01

Pos logit contributions
emouth+1.656
 signific+1.557
 independ+1.497
apist+1.450
itable+1.366
Neg logit contributions
ctic-1.442
20439-1.402
-1.358
asia-1.349
iron-1.326
Token efforts
Token ID4040
Feature activation-4.93e-01

Pos logit contributions
ctic+0.622
20439+0.609
+0.590
asia+0.588
iron+0.572
Neg logit contributions
emouth-0.637
 signific-0.595
 independ-0.566
apist-0.553
itable-0.518
Token in
Token ID287
Feature activation-1.42e-01

Pos logit contributions
ctic+0.950
20439+0.927
+0.901
asia+0.895
iron+0.873
Neg logit contributions
emouth-0.953
 signific-0.893
 independ-0.846
apist-0.832
itable-0.781
Token the
Token ID262
Feature activation-1.98e-01

Pos logit contributions
ctic+0.255
20439+0.248
+0.242
asia+0.241
iron+0.234
Neg logit contributions
emouth-0.292
 signific-0.272
 independ-0.261
apist-0.257
itable-0.240
Token D
Token ID360
Feature activation-4.46e-01

Pos logit contributions
ctic+0.376
20439+0.364
asia+0.355
+0.355
iron+0.345
Neg logit contributions
emouth-0.392
 signific-0.361
 independ-0.346
apist-0.340
itable-0.317
TokenRC
Token ID7397
Feature activation-1.24e+00

Pos logit contributions
ctic+0.545
20439+0.503
+0.498
asia+0.494
iron+0.483
Neg logit contributions
emouth-1.194
 signific-1.131
 independ-1.105
apist-1.068
 surpr-1.030
Token,
Token ID11
Feature activation-6.58e-02

Pos logit contributions
ctic+2.319
20439+2.262
+2.201
asia+2.187
iron+2.126
Neg logit contributions
emouth-2.438
 signific-2.280
 independ-2.155
apist-2.112
imental-2.001
Token but
Token ID475
Feature activation+3.50e-01

Pos logit contributions
ctic+0.110
20439+0.107
+0.104
asia+0.103
iron+0.101
Neg logit contributions
emouth-0.144
 signific-0.135
 independ-0.130
apist-0.127
itable-0.120
Token we
Token ID356
Feature activation+2.74e-01

Pos logit contributions
emouth+0.684
 signific+0.637
 independ+0.609
apist+0.595
itable+0.556
Neg logit contributions
ctic-0.672
20439-0.653
-0.636
asia-0.632
iron-0.619
Token.
Token ID13
Feature activation+1.74e-03

Pos logit contributions
emouth+3.279
 signific+3.105
 independ+2.973
apist+2.853
itable+2.688
Neg logit contributions
ctic-3.295
20439-3.264
-3.108
asia-3.095
iron-3.070
Token The
Token ID383
Feature activation+1.27e-01

Pos logit contributions
emouth+0.004
 signific+0.004
 independ+0.004
apist+0.004
itable+0.003
Neg logit contributions
ctic-0.003
20439-0.003
-0.003
asia-0.003
iron-0.002
Token exceptionally
Token ID24822
Feature activation-4.63e-03

Pos logit contributions
emouth+0.269
 signific+0.252
 independ+0.242
apist+0.236
itable+0.222
Neg logit contributions
ctic-0.224
20439-0.217
-0.211
asia-0.209
iron-0.205
Token adapt
Token ID6068
Feature activation+1.02e+00

Pos logit contributions
ctic+0.008
20439+0.008
+0.008
asia+0.008
iron+0.008
Neg logit contributions
emouth-0.010
 signific-0.009
 independ-0.009
apist-0.008
imental-0.008
Tokenable
Token ID540
Feature activation+6.67e-01

Pos logit contributions
emouth+1.629
 signific+1.500
 independ+1.421
apist+1.375
itable+1.269
Neg logit contributions
ctic-2.302
20439-2.264
-2.204
asia-2.193
iron-2.157
Token,
Token ID11
Feature activation-3.10e-01

Pos logit contributions
emouth+1.459
 signific+1.374
 independ+1.322
apist+1.289
itable+1.218
Neg logit contributions
ctic-1.118
20439-1.086
-1.049
asia-1.043
iron-1.020
Token decentral
Token ID20649
Feature activation+6.11e-01

Pos logit contributions
ctic+0.556
20439+0.536
+0.522
asia+0.517
iron+0.511
Neg logit contributions
emouth-0.643
 signific-0.599
 independ-0.569
apist-0.562
itable-0.527
Tokenised
Token ID1417
Feature activation-3.18e-01

Pos logit contributions
emouth+1.618
 signific+1.539
 independ+1.494
apist+1.465
itable+1.403
Neg logit contributions
ctic-0.734
20439-0.711
-0.679
asia-0.666
iron-0.645
Token tool
Token ID2891
Feature activation-2.18e-01

Pos logit contributions
ctic+0.568
20439+0.543
+0.535
asia+0.533
orget+0.518
Neg logit contributions
emouth-0.660
 signific-0.608
 independ-0.585
apist-0.579
itable-0.543
Token that
Token ID326
Feature activation+3.38e-01

Pos logit contributions
ctic+0.414
20439+0.402
+0.391
asia+0.390
iron+0.384
Neg logit contributions
emouth-0.428
 signific-0.401
 independ-0.384
apist-0.373
itable-0.350
Token is
Token ID318
Feature activation+1.16e-01

Pos logit contributions
emouth+0.664
 signific+0.620
 independ+0.594
apist+0.578
itable+0.542
Neg logit contributions
ctic-0.640
20439-0.623
-0.606
asia-0.602
iron-0.590
Token a
Token ID257
Feature activation+4.04e-01

Pos logit contributions
ctic+0.478
20439+0.456
+0.448
asia+0.446
iron+0.438
Neg logit contributions
emouth-0.493
 signific-0.459
 independ-0.436
apist-0.430
itable-0.401
Token joy
Token ID8716
Feature activation+5.06e-02

Pos logit contributions
emouth+0.836
 signific+0.783
 independ+0.752
apist+0.733
itable+0.690
Neg logit contributions
ctic-0.724
20439-0.702
-0.682
asia-0.678
iron-0.664
Tokenous
Token ID516
Feature activation-2.45e-02

Pos logit contributions
emouth+0.116
 signific+0.110
 independ+0.106
apist+0.104
itable+0.098
Neg logit contributions
ctic-0.079
20439-0.077
-0.074
asia-0.073
iron-0.072
Token Easter
Token ID20111
Feature activation-1.55e+00

Pos logit contributions
ctic+0.044
20439+0.042
+0.041
asia+0.041
iron+0.040
Neg logit contributions
emouth-0.051
 signific-0.047
 independ-0.045
apist-0.044
itable-0.042
Token Sunday
Token ID3502
Feature activation-3.28e-01

Pos logit contributions
ctic+2.741
20439+2.617
+2.579
asia+2.569
iron+2.521
Neg logit contributions
emouth-3.215
 signific-2.975
 independ-2.878
apist-2.817
itable-2.647
Token celebration
Token ID14395
Feature activation-7.35e-01

Pos logit contributions
ctic+0.619
20439+0.595
+0.583
asia+0.581
iron+0.570
Neg logit contributions
emouth-0.649
 signific-0.604
 independ-0.578
apist-0.564
itable-0.528
Token,
Token ID11
Feature activation+1.44e-01

Pos logit contributions
ctic+1.376
20439+1.306
+1.286
asia+1.284
iron+1.261
Neg logit contributions
emouth-1.464
 signific-1.368
 independ-1.309
apist-1.281
itable-1.198
Token but
Token ID475
Feature activation+6.22e-01

Pos logit contributions
emouth+0.277
 signific+0.259
 independ+0.247
apist+0.240
itable+0.225
Neg logit contributions
ctic-0.279
20439-0.271
-0.264
asia-0.262
iron-0.257
Token the
Token ID262
Feature activation+1.57e-01

Pos logit contributions
emouth+1.223
 signific+1.142
 independ+1.093
apist+1.064
itable+0.998
Neg logit contributions
ctic-1.181
20439-1.148
-1.116
asia-1.111
iron-1.089
Token brunch
Token ID50219
Feature activation+9.57e-01

Pos logit contributions
emouth+0.325
 signific+0.305
 independ+0.292
apist+0.285
itable+0.268
Neg logit contributions
ctic-0.280
20439-0.272
-0.264
asia-0.262
iron-0.257
Token turned
Token ID2900
Feature activation-1.75e-01

Pos logit contributions
emouth+1.856
 signific+1.736
 independ+1.661
apist+1.612
itable+1.511
Neg logit contributions
ctic-1.833
20439-1.798
-1.737
asia-1.726
iron-1.691
Token the
Token ID262
Feature activation+3.96e-02

Pos logit contributions
ctic+0.794
20439+0.765
+0.749
asia+0.744
iron+0.729
Neg logit contributions
emouth-0.916
 signific-0.853
 independ-0.820
apist-0.801
itable-0.753
Token players
Token ID1938
Feature activation+2.75e-01

Pos logit contributions
emouth+0.080
 signific+0.075
 independ+0.072
apist+0.070
itable+0.066
Neg logit contributions
ctic-0.073
20439-0.071
-0.069
asia-0.068
iron-0.067
Token and
Token ID290
Feature activation+7.77e-02

Pos logit contributions
emouth+0.534
 signific+0.496
 independ+0.475
apist+0.464
itable+0.434
Neg logit contributions
ctic-0.529
20439-0.514
-0.500
asia-0.498
iron-0.490
Token the
Token ID262
Feature activation+1.91e-02

Pos logit contributions
emouth+0.157
 signific+0.146
 independ+0.140
apist+0.137
itable+0.129
Neg logit contributions
ctic-0.143
20439-0.139
-0.135
asia-0.134
iron-0.132
Token events
Token ID2995
Feature activation-3.07e-01

Pos logit contributions
emouth+0.038
 signific+0.036
 independ+0.034
apist+0.033
itable+0.031
Neg logit contributions
ctic-0.036
20439-0.035
-0.034
asia-0.034
iron-0.033
Token of
Token ID286
Feature activation-2.05e-01

Pos logit contributions
ctic+0.577
20439+0.561
+0.544
asia+0.543
iron+0.535
Neg logit contributions
emouth-0.610
 signific-0.564
 independ-0.540
apist-0.530
itable-0.495
Token the
Token ID262
Feature activation+1.97e-01

Pos logit contributions
ctic+0.388
20439+0.376
+0.367
asia+0.367
iron+0.359
Neg logit contributions
emouth-0.402
 signific-0.374
 independ-0.358
apist-0.349
itable-0.327
Token first
Token ID717
Feature activation+1.53e-01

Pos logit contributions
emouth+0.389
 signific+0.362
 independ+0.347
apist+0.339
itable+0.318
Neg logit contributions
ctic-0.372
20439-0.362
-0.352
asia-0.350
iron-0.344
Token two
Token ID734
Feature activation+9.71e-01

Pos logit contributions
emouth+0.361
 signific+0.336
 independ+0.325
apist+0.321
itable+0.304
Neg logit contributions
ctic-0.235
20439-0.224
-0.217
asia-0.217
iron-0.211
Token acts
Token ID6529
Feature activation-7.19e-02

Pos logit contributions
emouth+1.758
 signific+1.633
 independ+1.557
apist+1.511
itable+1.407
Neg logit contributions
ctic-1.993
20439-1.944
-1.893
asia-1.883
iron-1.850
Token,
Token ID11
Feature activation+1.45e-01

Pos logit contributions
ctic+0.150
20439+0.146
+0.143
asia+0.142
iron+0.139
Neg logit contributions
emouth-0.128
 signific-0.118
 independ-0.113
apist-0.110
itable-0.102
Token or
Token ID393
Feature activation-5.92e-01

Pos logit contributions
emouth+0.088
 signific+0.082
 independ+0.078
apist+0.077
itable+0.072
Neg logit contributions
ctic-0.082
20439-0.079
-0.077
asia-0.077
iron-0.075
Token rogue
Token ID23586
Feature activation+7.12e-02

Pos logit contributions
ctic+1.061
20439+1.020
+0.998
asia+0.996
vel+0.968
Neg logit contributions
emouth-1.225
 signific-1.136
 independ-1.082
apist-1.071
itable-1.008
Token governments
Token ID6905
Feature activation-5.43e-01

Pos logit contributions
emouth+0.140
 signific+0.130
 independ+0.124
apist+0.123
imental+0.114
Neg logit contributions
ctic-0.135
20439-0.131
-0.128
asia-0.127
iron-0.124
Token,
Token ID11
Feature activation-1.99e-01

Pos logit contributions
ctic+1.130
20439+1.096
+1.065
asia+1.063
iron+1.043
Neg logit contributions
emouth-0.971
 signific-0.899
 independ-0.858
apist-0.838
imental-0.776
Token can
Token ID460
Feature activation-4.42e-01

Pos logit contributions
ctic+0.380
20439+0.367
+0.358
asia+0.357
iron+0.349
Neg logit contributions
emouth-0.389
 signific-0.361
 independ-0.347
apist-0.339
imental-0.316
Token use
Token ID779
Feature activation-1.81e+00

Pos logit contributions
ctic+0.791
20439+0.747
+0.743
asia+0.740
iron+0.724
Neg logit contributions
emouth-0.922
 signific-0.851
 independ-0.815
apist-0.800
imental-0.758
Token their
Token ID511
Feature activation-4.08e-01

Pos logit contributions
ctic+3.088
20439+2.888
+2.887
asia+2.859
vel+2.806
Neg logit contributions
emouth-3.900
 signific-3.528
apist-3.411
 independ-3.397
imental-3.185
Token knowledge
Token ID3725
Feature activation-5.81e-01

Pos logit contributions
ctic+0.724
20439+0.700
+0.681
asia+0.676
iron+0.662
Neg logit contributions
emouth-0.854
 signific-0.797
 independ-0.765
apist-0.750
itable-0.703
Token to
Token ID284
Feature activation-6.44e-03

Pos logit contributions
ctic+1.055
20439+1.016
+0.989
asia+0.987
iron+0.970
Neg logit contributions
emouth-1.191
 signific-1.114
 independ-1.067
apist-1.046
itable-0.984
Token exploit
Token ID14561
Feature activation-4.49e-01

Pos logit contributions
ctic+0.012
20439+0.011
+0.011
asia+0.011
iron+0.011
Neg logit contributions
emouth-0.013
 signific-0.012
 independ-0.012
apist-0.012
imental-0.011
Token an
Token ID281
Feature activation-3.35e-01

Pos logit contributions
ctic+0.821
20439+0.786
+0.772
asia+0.768
orget+0.749
Neg logit contributions
emouth-0.920
 signific-0.848
 independ-0.816
apist-0.806
imental-0.751
Token busy
Token ID8179
Feature activation-8.66e-01

Pos logit contributions
ctic+0.348
20439+0.330
+0.323
asia+0.322
orget+0.315
Neg logit contributions
emouth-0.454
 signific-0.425
 independ-0.413
apist-0.406
itable-0.379
Token street
Token ID4675
Feature activation-1.82e-02

Pos logit contributions
ctic+1.595
20439+1.494
asia+1.488
+1.483
orget+1.461
Neg logit contributions
emouth-1.741
 signific-1.623
 independ-1.565
apist-1.555
itable-1.436
Token on
Token ID319
Feature activation+1.33e-01

Pos logit contributions
ctic+0.034
20439+0.033
+0.032
asia+0.032
iron+0.032
Neg logit contributions
emouth-0.036
 signific-0.034
 independ-0.033
apist-0.032
itable-0.030
Token repeat
Token ID9585
Feature activation+1.52e-01

Pos logit contributions
emouth+0.262
 signific+0.245
 independ+0.235
apist+0.229
itable+0.215
Neg logit contributions
ctic-0.250
20439-0.243
-0.236
asia-0.235
iron-0.231
Token.
Token ID13
Feature activation-9.32e-02

Pos logit contributions
emouth+0.308
 signific+0.287
 independ+0.277
apist+0.270
itable+0.252
Neg logit contributions
ctic-0.279
20439-0.268
-0.263
asia-0.261
iron-0.256
Token Posted
Token ID12918
Feature activation-2.00e+00

Pos logit contributions
ctic+0.154
20439+0.149
+0.145
asia+0.144
iron+0.140
Neg logit contributions
emouth-0.207
 signific-0.195
 independ-0.188
apist-0.183
itable-0.173
Token on
Token ID319
Feature activation+5.09e-02

Pos logit contributions
ctic+3.273
20439+3.107
+3.063
asia+3.047
iron+2.989
Neg logit contributions
emouth-4.369
 signific-4.133
 independ-3.949
apist-3.914
itable-3.655
Token August
Token ID2932
Feature activation+7.02e-01

Pos logit contributions
emouth+0.109
 signific+0.103
 independ+0.099
apist+0.097
imental+0.092
Neg logit contributions
ctic-0.087
20439-0.083
-0.082
asia-0.081
iron-0.080
Token 1
Token ID352
Feature activation+1.39e-01

Pos logit contributions
emouth+1.406
 signific+1.313
 independ+1.258
apist+1.223
itable+1.147
Neg logit contributions
ctic-1.314
20439-1.276
-1.243
asia-1.235
iron-1.210
Token on
Token ID319
Feature activation+5.10e-02

Pos logit contributions
emouth+0.277
 signific+0.259
 independ+0.248
apist+0.241
itable+0.226
Neg logit contributions
ctic-0.262
20439-0.255
-0.248
asia-0.246
iron-0.242
Token Facebook
Token ID3203
Feature activation+4.62e-01

Pos logit contributions
emouth+0.106
 signific+0.099
 independ+0.095
apist+0.093
imental+0.087
Neg logit contributions
ctic-0.091
20439-0.088
-0.086
asia-0.085
iron-0.084
Token and
Token ID290
Feature activation+3.84e-02

Pos logit contributions
emouth+0.628
 signific+0.586
 independ+0.561
apist+0.544
itable+0.509
Neg logit contributions
ctic-0.641
20439-0.625
-0.608
asia-0.604
iron-0.593
Token U
Token ID471
Feature activation-7.95e-01

Pos logit contributions
emouth+0.080
 signific+0.074
 independ+0.071
apist+0.070
itable+0.066
Neg logit contributions
ctic-0.069
20439-0.067
-0.065
asia-0.065
iron-0.063
Token.
Token ID13
Feature activation+3.90e-01

Pos logit contributions
ctic+1.360
+1.271
asia+1.268
20439+1.260
FR+1.241
Neg logit contributions
emouth-1.729
 signific-1.611
 independ-1.576
apist-1.522
itable-1.424
TokenS
Token ID50
Feature activation-1.09e-01

Pos logit contributions
emouth+0.827
 signific+0.785
 independ+0.746
apist+0.732
itable+0.689
Neg logit contributions
ctic-0.662
20439-0.652
-0.630
asia-0.621
orget-0.614
Token.
Token ID13
Feature activation+2.41e-01

Pos logit contributions
ctic+0.205
20439+0.202
+0.195
asia+0.193
orget+0.190
Neg logit contributions
emouth-0.213
 signific-0.200
 independ-0.191
apist-0.185
itable-0.174
Token installations
Token ID26162
Feature activation-1.83e+00

Pos logit contributions
emouth+0.514
 signific+0.483
 independ+0.465
apist+0.452
itable+0.427
Neg logit contributions
ctic-0.415
20439-0.404
-0.391
asia-0.388
iron-0.380
Token in
Token ID287
Feature activation-1.31e-01

Pos logit contributions
ctic+3.170
+2.966
20439+2.949
asia+2.929
[/+2.911
Neg logit contributions
emouth-3.896
 signific-3.592
apist-3.466
 independ-3.466
itable-3.229
Token Turkey
Token ID7137
Feature activation-3.87e-01

Pos logit contributions
ctic+0.227
20439+0.218
+0.213
asia+0.213
iron+0.207
Neg logit contributions
emouth-0.281
 signific-0.262
 independ-0.252
apist-0.248
itable-0.233
Token."[
Token ID18161
Feature activation+1.99e-01

Pos logit contributions
ctic+0.646
20439+0.621
+0.605
asia+0.603
iron+0.588
Neg logit contributions
emouth-0.851
 signific-0.799
 independ-0.769
apist-0.752
itable-0.709
Token44
Token ID2598
Feature activation-4.75e-01

Pos logit contributions
emouth+0.441
 signific+0.414
 independ+0.400
apist+0.389
itable+0.368
Neg logit contributions
ctic-0.329
20439-0.318
-0.310
asia-0.307
iron-0.300
Token]
Token ID60
Feature activation-5.91e-01

Pos logit contributions
ctic+0.639
20439+0.613
+0.603
asia+0.594
[/+0.580
Neg logit contributions
emouth-1.211
 signific-1.142
 independ-1.113
apist-1.076
 surpr-1.037
Token farmland
Token ID45723
Feature activation-2.30e-01

Pos logit contributions
ctic+1.249
20439+1.183
asia+1.178
+1.172
orget+1.141
Neg logit contributions
emouth-1.362
 signific-1.262
 independ-1.211
apist-1.197
itable-1.109
Token contaminated
Token ID23543
Feature activation-4.39e-01

Pos logit contributions
ctic+0.452
20439+0.435
+0.427
asia+0.426
iron+0.418
Neg logit contributions
emouth-0.438
 signific-0.407
 independ-0.391
apist-0.382
itable-0.354
Token by
Token ID416
Feature activation-3.49e-02

Pos logit contributions
ctic+0.810
20439+0.781
+0.763
asia+0.759
iron+0.746
Neg logit contributions
emouth-0.888
 signific-0.827
 independ-0.795
apist-0.775
itable-0.726
Token heavy
Token ID4334
Feature activation+7.22e-02

Pos logit contributions
ctic+0.064
20439+0.062
+0.061
asia+0.061
iron+0.059
Neg logit contributions
emouth-0.070
 signific-0.066
 independ-0.063
apist-0.062
itable-0.058
Token metals
Token ID21782
Feature activation-4.65e-01

Pos logit contributions
emouth+0.172
 signific+0.163
 independ+0.157
apist+0.155
itable+0.146
Neg logit contributions
ctic-0.107
20439-0.102
-0.099
asia-0.099
iron-0.096
Token used
Token ID973
Feature activation-1.79e+00

Pos logit contributions
ctic+0.871
20439+0.834
+0.820
asia+0.817
iron+0.801
Neg logit contributions
emouth-0.922
 signific-0.859
 independ-0.827
apist-0.818
itable-0.758
Token to
Token ID284
Feature activation+2.77e-01

Pos logit contributions
ctic+3.241
20439+3.035
asia+3.026
iron+2.998
+2.989
Neg logit contributions
emouth-3.619
 signific-3.391
 independ-3.268
apist-3.264
itable-3.032
Token make
Token ID787
Feature activation+8.22e-01

Pos logit contributions
emouth+0.550
 signific+0.511
 independ+0.491
apist+0.478
itable+0.446
Neg logit contributions
ctic-0.521
20439-0.504
-0.492
asia-0.489
iron-0.483
Token batteries
Token ID13591
Feature activation-1.28e+00

Pos logit contributions
emouth+1.739
 signific+1.645
 independ+1.581
apist+1.519
itable+1.451
Neg logit contributions
ctic-1.428
20439-1.397
-1.345
asia-1.330
orget-1.300
Token,
Token ID11
Feature activation+2.34e-02

Pos logit contributions
ctic+2.486
20439+2.385
+2.327
asia+2.321
iron+2.313
Neg logit contributions
emouth-2.430
 signific-2.273
 independ-2.170
apist-2.143
itable-1.980
Token computer
Token ID3644
Feature activation+3.46e-01

Pos logit contributions
emouth+0.050
 signific+0.047
 independ+0.045
apist+0.044
itable+0.041
Neg logit contributions
ctic-0.041
20439-0.040
-0.039
asia-0.038
iron-0.038
Token liberals
Token ID17194
Feature activation-5.76e-02

Pos logit contributions
emouth+0.018
 signific+0.017
 independ+0.017
apist+0.016
itable+0.015
Neg logit contributions
ctic-0.016
20439-0.015
-0.015
asia-0.015
iron-0.014
Token are
Token ID389
Feature activation-8.50e-02

Pos logit contributions
ctic+0.112
20439+0.107
+0.105
asia+0.105
iron+0.104
Neg logit contributions
emouth-0.111
 signific-0.104
 independ-0.099
apist-0.097
itable-0.090
Token at
Token ID379
Feature activation+3.64e-01

Pos logit contributions
ctic+0.152
20439+0.146
+0.142
asia+0.142
iron+0.139
Neg logit contributions
emouth-0.177
 signific-0.165
 independ-0.158
apist-0.156
itable-0.146
Token war
Token ID1175
Feature activation+1.32e-01

Pos logit contributions
emouth+0.712
 signific+0.665
 independ+0.637
apist+0.619
itable+0.580
Neg logit contributions
ctic-0.695
20439-0.676
-0.657
asia-0.654
iron-0.641
Token with
Token ID351
Feature activation-2.71e-01

Pos logit contributions
emouth+0.254
 signific+0.238
 independ+0.228
apist+0.221
itable+0.207
Neg logit contributions
ctic-0.255
20439-0.250
-0.242
asia-0.241
iron-0.236
Token Christmas
Token ID6786
Feature activation-1.78e+00

Pos logit contributions
ctic+0.450
20439+0.428
+0.421
asia+0.418
iron+0.409
Neg logit contributions
emouth-0.598
 signific-0.557
 independ-0.533
apist-0.528
imental-0.496
Token,
Token ID11
Feature activation+3.19e-02

Pos logit contributions
ctic+3.132
asia+2.969
+2.951
20439+2.919
iron+2.871
Neg logit contributions
emouth-3.710
 signific-3.358
apist-3.291
 independ-3.275
itable-3.062
Token and
Token ID290
Feature activation-6.34e-03

Pos logit contributions
emouth+0.063
 signific+0.059
 independ+0.056
apist+0.055
itable+0.051
Neg logit contributions
ctic-0.060
20439-0.059
-0.057
asia-0.057
iron-0.056
Token that
Token ID326
Feature activation-1.21e-01

Pos logit contributions
ctic+0.012
20439+0.012
+0.011
asia+0.011
iron+0.011
Neg logit contributions
emouth-0.013
 signific-0.012
 independ-0.011
apist-0.011
itable-0.010
Token once
Token ID1752
Feature activation-8.19e-01

Pos logit contributions
ctic+0.225
20439+0.216
+0.211
asia+0.210
iron+0.207
Neg logit contributions
emouth-0.246
 signific-0.228
 independ-0.219
apist-0.214
itable-0.201
Token again
Token ID757
Feature activation-6.03e-02

Pos logit contributions
ctic+1.496
20439+1.410
+1.396
asia+1.393
iron+1.375
Neg logit contributions
emouth-1.678
 signific-1.546
 independ-1.473
apist-1.459
imental-1.370