TOP ACTIVATIONS
MAX = 1.234

You 're pretty much f *** d up , aren 't
existence is confusing f right ening , even for
; return normal ; # endif } If you are curious
sound system . Rated 5 out of 5 by
s what probably drives Michael a back to Caleb
000 . Th ur rock Council said workers had
me later . Rated 5 out of 5 by
use of unique identification ( UID ) number or Aadhaar to
and night ! Rated 5 out of 5 by
excluding studies only reported as abstract s . <|endoftext|> A group
, which would have qu elled demand for Argentine debt ;
crucial , because it both drives home the pain of heart
of teeth from a tr icer at ops . ( FS
central casting to play the dis arming , grandfather ly right
sky s Batman drives a sou ped up Lincoln
at the major Buzz ard offshore field in the North Sea
ill ons Image : Wild fires also raged on the French
sign would be if pl umes of c es ium are
not the only thing that drives them toward developing new Metroid
ivals information F requently updated list of WA Beer

MIN ACTIVATIONS
MIN = -1.367

ible - limit - tr ading - sub s idy -
the basis of ' tr ades '. The real difference between
a price as deep fact ional divisions developed between the youthful
hotel room at the Hy att Reg ency Irvine , November
An ob it uary for a theater employee reported
Anders , " let me count quickly ." P ause .
, Liam , Stella , Hatt ie and Finn , and
YOUR bed ?" she te ases . I tried
The ob it uary reported that Danny Daniels ,
chain . M ounds of tail ings from mineral
. D ry Hop 's beer names give me
chants of Res ign ! Res ign !
, and they are not accurately rendered as Bra ille numbers
Customer : * men acing * I
P ilar among the az ure channels and white - sand
front and center . Dec ades ago , scholars demolished claims
. youtube . com / channel / UC _ K k
= NO BLE ( s ounds like definition by example ,
the polls , will cons ign their votes into computers that
4 . Make 6 p att ies . 5

INTERVAL 0.584 - 1.234
CONTAINS 0.299%

sound system . Rated 5 out of 5 by
way I 'm building this ROM , moves / s bin
grappling hook aw es omen ess ! Having done any %
() ' s frame | void * arg | + --------------------
| + -------------------- --+ | void * buf | + --------------------

INTERVAL -0.066 - 0.584
CONTAINS 69.067%

when many politicians shy away from the long suffering of our
believe about the NDP , we re not enemies
Address : Sh ri Pre mb hi ks hu ji
< met ros xual th ug > le l m 2
see what booths might be worth waiting on for your particular

INTERVAL -0.716 - -0.066
CONTAINS 30.568%

It 's making a very powerful statement about the importance of
was be headed by spectators after he had stabbed a player
wondered why she was on the sidelines . M
L ever , like Gibbs , was keen to return home
up becoming my focal point for his outlandish offense and hard

INTERVAL -1.367 - -0.716
CONTAINS 0.065%

Who are the Anti - Author itarian Movement ( AK )
gran ularity Con currency , Peters ons solution
chain . M ounds of tail ings from mineral
also that it is optim ised for programs where
Part 4 . S ucker Waves The normal
Token You
Token ID921
Feature activation+6.35e-02

Pos logit contributions
hower+0.119
 Synd+0.109
 conduc+0.107
 NIC+0.105
 Sting+0.104
Neg logit contributions
umenthal-0.123
utra-0.119
redit-0.115
��-0.110
ounty-0.110
Token're
Token ID821
Feature activation+3.78e-01

Pos logit contributions
hower+0.116
 Synd+0.107
 conduc+0.103
 Sting+0.103
 NIC+0.103
Neg logit contributions
umenthal-0.118
utra-0.114
redit-0.112
ounty-0.105
ople-0.105
Token pretty
Token ID2495
Feature activation-2.21e-01

Pos logit contributions
hower+0.664
 Synd+0.604
 conduc+0.586
 Sting+0.584
 NIC+0.583
Neg logit contributions
umenthal-0.729
utra-0.700
redit-0.688
ople-0.651
ounty-0.648
Token much
Token ID881
Feature activation+4.14e-02

Pos logit contributions
umenthal+0.399
utra+0.384
redit+0.370
ople+0.356
��+0.354
Neg logit contributions
hower-0.415
 Synd-0.378
 conduc-0.369
 NIC-0.366
 Spac-0.366
Token f
Token ID277
Feature activation+1.98e-01

Pos logit contributions
hower+0.075
 Synd+0.068
 conduc+0.066
 NIC+0.066
 Sting+0.065
Neg logit contributions
umenthal-0.077
utra-0.074
redit-0.074
ople-0.069
��-0.069
Token***
Token ID8162
Feature activation+1.23e+00

Pos logit contributions
hower+0.395
 Synd+0.375
 NIC+0.360
 Sting+0.360
 conduc+0.358
Neg logit contributions
umenthal-0.339
utra-0.322
redit-0.313
ounty-0.297
��-0.297
Tokend
Token ID67
Feature activation-1.08e-01

Pos logit contributions
hower+2.100
 Synd+1.975
 NIC+1.911
 Sting+1.899
 Spac+1.874
Neg logit contributions
umenthal-2.461
utra-2.354
redit-2.269
��-2.194
ounty-2.166
Token up
Token ID510
Feature activation-1.21e-02

Pos logit contributions
umenthal+0.225
utra+0.216
redit+0.210
ople+0.203
��+0.201
Neg logit contributions
hower-0.173
 Synd-0.159
 conduc-0.153
 NIC-0.153
 Sting-0.151
Token,
Token ID11
Feature activation+1.25e-01

Pos logit contributions
umenthal+0.026
utra+0.025
redit+0.024
��+0.023
ople+0.023
Neg logit contributions
hower-0.019
 Synd-0.017
 NIC-0.017
 conduc-0.017
 Sting-0.017
Token aren
Token ID3588
Feature activation+1.89e-01

Pos logit contributions
hower+0.206
 Synd+0.188
 NIC+0.181
 conduc+0.180
 Sting+0.179
Neg logit contributions
umenthal-0.256
utra-0.246
redit-0.239
��-0.229
ounty-0.228
Token't
Token ID470
Feature activation+3.91e-02

Pos logit contributions
hower+0.357
 conduc+0.336
 Synd+0.334
 NIC+0.324
 Sting+0.322
Neg logit contributions
umenthal-0.343
utra-0.331
redit-0.315
ople-0.303
ounty-0.302
Token existence
Token ID6224
Feature activation-6.40e-01

Pos logit contributions
hower+0.036
 Synd+0.033
 Sting+0.032
 NIC+0.032
wal+0.032
Neg logit contributions
umenthal-0.041
utra-0.039
redit-0.037
��-0.036
ople-0.036
Token is
Token ID318
Feature activation+4.53e-02

Pos logit contributions
umenthal+1.330
utra+1.300
redit+1.244
ounty+1.195
��+1.192
Neg logit contributions
hower-1.020
 Synd-0.919
 NIC-0.889
 Sting-0.886
 Spac-0.880
Token confusing
Token ID15337
Feature activation-2.31e-01

Pos logit contributions
hower+0.079
 Synd+0.071
 NIC+0.069
 Sting+0.069
 conduc+0.068
Neg logit contributions
umenthal-0.089
utra-0.087
redit-0.084
��-0.081
ounty-0.080
Token
Token ID1906
Feature activation-2.46e-01

Pos logit contributions
umenthal+0.461
utra+0.445
redit+0.428
��+0.416
ople+0.413
Neg logit contributions
hower-0.395
 Synd-0.354
 conduc-0.347
 Sting-0.345
 NIC-0.344
Tokenf
Token ID69
Feature activation+1.98e-01

Pos logit contributions
umenthal+0.459
utra+0.442
redit+0.424
��+0.410
ople+0.405
Neg logit contributions
hower-0.446
 Synd-0.414
 NIC-0.402
 Sting-0.402
 conduc-0.402
Tokenright
Token ID3506
Feature activation+1.23e+00

Pos logit contributions
hower+0.402
 Synd+0.383
 Sting+0.370
 NIC+0.367
 conduc+0.366
Neg logit contributions
umenthal-0.333
utra-0.316
redit-0.304
��-0.291
ounty-0.289
Tokenening
Token ID3101
Feature activation+2.26e-01

Pos logit contributions
hower+2.452
 Synd+2.330
 NIC+2.264
 Sting+2.252
 Spac+2.227
Neg logit contributions
umenthal-2.100
utra-1.994
redit-1.909
��-1.836
ounty-1.805
Token,
Token ID11
Feature activation+1.65e-01

Pos logit contributions
hower+0.354
 Synd+0.320
 Sting+0.311
 NIC+0.310
 conduc+0.308
Neg logit contributions
umenthal-0.483
utra-0.464
redit-0.448
��-0.437
ople-0.432
Token even
Token ID772
Feature activation+7.14e-02

Pos logit contributions
hower+0.288
 Synd+0.262
 NIC+0.253
 Sting+0.253
 conduc+0.251
Neg logit contributions
umenthal-0.323
utra-0.309
redit-0.300
��-0.290
ople-0.287
Token
Token ID1906
Feature activation-3.72e-01

Pos logit contributions
hower+0.105
 Synd+0.093
 NIC+0.090
 Sting+0.089
 conduc+0.089
Neg logit contributions
umenthal-0.159
utra-0.153
redit-0.149
��-0.146
ounty-0.144
Tokenfor
Token ID1640
Feature activation+2.11e-01

Pos logit contributions
umenthal+0.716
utra+0.690
redit+0.664
ople+0.641
��+0.639
Neg logit contributions
hower-0.650
 Synd-0.594
 conduc-0.587
 Sting-0.579
 NIC-0.579
Token ;
Token ID2162
Feature activation+6.59e-02

Pos logit contributions
hower+0.057
 Synd+0.054
 NIC+0.052
 Sting+0.052
 Huck+0.051
Neg logit contributions
umenthal-0.059
utra-0.057
redit-0.055
��-0.053
ople-0.052
Token return
Token ID1441
Feature activation+8.31e-01

Pos logit contributions
hower+0.120
 Synd+0.112
 NIC+0.109
 Sting+0.108
 Huck+0.107
Neg logit contributions
umenthal-0.123
utra-0.117
redit-0.113
��-0.109
ople-0.108
Token normal
Token ID3487
Feature activation+1.42e-01

Pos logit contributions
hower+1.566
 Synd+1.480
 NIC+1.437
 Sting+1.429
 Spac+1.411
Neg logit contributions
umenthal-1.506
utra-1.435
redit-1.378
��-1.327
ounty-1.308
Token ;
Token ID2162
Feature activation+4.20e-01

Pos logit contributions
hower+0.263
 Synd+0.248
 NIC+0.241
 Sting+0.239
 Spac+0.236
Neg logit contributions
umenthal-0.263
utra-0.251
redit-0.241
��-0.232
ounty-0.229
Token #
Token ID1303
Feature activation+6.09e-01

Pos logit contributions
hower+0.920
 Synd+0.877
 NIC+0.855
 Sting+0.851
 Spac+0.842
Neg logit contributions
umenthal-0.633
utra-0.597
redit-0.568
��-0.543
ounty-0.533
Tokenendif
Token ID32088
Feature activation+1.11e+00

Pos logit contributions
hower+1.179
 Synd+1.118
 NIC+1.085
 Sting+1.078
 conduc+1.068
Neg logit contributions
umenthal-1.065
utra-1.014
redit-0.980
��-0.937
ople-0.923
Token }
Token ID1782
Feature activation+2.60e-01

Pos logit contributions
hower+2.131
 Synd+2.020
 NIC+1.963
 Sting+1.952
 Spac+1.929
Neg logit contributions
umenthal-1.975
utra-1.877
redit-1.801
��-1.733
ounty-1.710
Token If
Token ID1002
Feature activation+2.79e-01

Pos logit contributions
hower+0.500
 Synd+0.473
 NIC+0.460
 Sting+0.459
 Huck+0.453
Neg logit contributions
umenthal-0.457
utra-0.438
redit-0.418
��-0.404
ople-0.399
Token you
Token ID345
Feature activation-1.70e-02

Pos logit contributions
hower+0.459
 Synd+0.428
 NIC+0.413
 Sting+0.411
 Spac+0.405
Neg logit contributions
umenthal-0.571
utra-0.550
redit-0.529
��-0.514
ople-0.508
Token are
Token ID389
Feature activation+3.29e-02

Pos logit contributions
umenthal+0.033
utra+0.032
redit+0.031
ople+0.029
ounty+0.029
Neg logit contributions
hower-0.030
 Synd-0.028
 Sting-0.026
wal-0.026
 conduc-0.026
Token curious
Token ID11040
Feature activation+2.42e-01

Pos logit contributions
hower+0.056
 Synd+0.051
 Sting+0.049
 conduc+0.049
 NIC+0.049
Neg logit contributions
umenthal-0.065
utra-0.063
redit-0.060
ople-0.058
��-0.058
Token sound
Token ID2128
Feature activation-3.06e-01

Pos logit contributions
hower+0.134
 Synd+0.124
 NIC+0.119
 Sting+0.119
wal+0.119
Neg logit contributions
umenthal-0.149
utra-0.144
redit-0.140
��-0.134
ople-0.133
Token system
Token ID1080
Feature activation-2.55e-02

Pos logit contributions
umenthal+0.523
utra+0.499
redit+0.483
��+0.459
ople+0.455
Neg logit contributions
hower-0.596
 Synd-0.570
 Sting-0.552
 Spac-0.550
 NIC-0.547
Token.
Token ID13
Feature activation+3.35e-02

Pos logit contributions
umenthal+0.054
utra+0.052
redit+0.051
��+0.049
ople+0.049
Neg logit contributions
hower-0.040
 Synd-0.037
 NIC-0.035
 Sting-0.035
 Spac-0.035
Token
Token ID198
Feature activation-1.57e-01

Pos logit contributions
hower+0.059
 Synd+0.055
 NIC+0.053
 conduc+0.052
 Sting+0.052
Neg logit contributions
umenthal-0.065
utra-0.062
redit-0.060
��-0.058
ople-0.057
Token
Token ID198
Feature activation-1.19e-01

Pos logit contributions
umenthal+0.304
utra+0.299
redit+0.283
ople+0.276
ounty+0.275
Neg logit contributions
hower-0.275
 conduc-0.269
 Synd-0.247
 Huck-0.239
 Sting-0.239
TokenRated
Token ID15322
Feature activation+1.09e+00

Pos logit contributions
umenthal+0.190
utra+0.185
redit+0.176
ounty+0.167
ople+0.166
Neg logit contributions
hower-0.249
 conduc-0.239
 Synd-0.233
 Sting-0.228
 NIC-0.227
Token 5
Token ID642
Feature activation-2.10e-01

Pos logit contributions
hower+2.092
 Synd+1.983
 NIC+1.926
 Sting+1.916
 Spac+1.893
Neg logit contributions
umenthal-1.934
utra-1.839
redit-1.764
��-1.698
ounty-1.674
Token out
Token ID503
Feature activation-2.03e-01

Pos logit contributions
umenthal+0.427
utra+0.422
redit+0.407
yx+0.392
ople+0.390
Neg logit contributions
hower-0.343
 conduc-0.331
 Synd-0.303
 Huck-0.297
 Sting-0.293
Token of
Token ID286
Feature activation-9.50e-02

Pos logit contributions
umenthal+0.377
utra+0.369
redit+0.349
ople+0.337
ounty+0.333
Neg logit contributions
hower-0.371
 conduc-0.357
 Synd-0.341
 NIC-0.329
 Sting-0.328
Token 5
Token ID642
Feature activation-1.95e-01

Pos logit contributions
umenthal+0.145
utra+0.140
redit+0.133
ople+0.128
ounty+0.126
Neg logit contributions
hower-0.204
 conduc-0.195
 Synd-0.191
 Sting-0.185
 Huck-0.185
Token by
Token ID416
Feature activation-1.77e-01

Pos logit contributions
umenthal+0.413
utra+0.410
redit+0.394
ople+0.383
yx+0.379
Neg logit contributions
hower-0.305
 conduc-0.297
 Synd-0.268
 NIC-0.257
 Sting-0.257
Token
Token ID447
Feature activation-3.68e-02

Pos logit contributions
hower+0.058
 Synd+0.054
 NIC+0.052
 Sting+0.052
 Spac+0.051
Neg logit contributions
umenthal-0.074
utra-0.071
redit-0.068
��-0.066
ounty-0.066
Token
Token ID247
Feature activation-6.35e-02

Pos logit contributions
umenthal+0.059
utra+0.056
redit+0.054
��+0.052
ounty+0.051
Neg logit contributions
hower-0.077
 Synd-0.074
 NIC-0.072
 conduc-0.072
 Sting-0.071
Tokens
Token ID82
Feature activation+5.18e-02

Pos logit contributions
umenthal+0.116
utra+0.112
redit+0.107
ople+0.102
ounty+0.102
Neg logit contributions
hower-0.118
 Synd-0.111
 NIC-0.108
 Sting-0.107
 conduc-0.106
Token what
Token ID644
Feature activation+1.11e-02

Pos logit contributions
hower+0.090
 Synd+0.081
 conduc+0.079
wal+0.078
 NIC+0.078
Neg logit contributions
umenthal-0.101
utra-0.100
redit-0.094
��-0.090
ounty-0.090
Token probably
Token ID2192
Feature activation+3.19e-01

Pos logit contributions
hower+0.018
 Synd+0.016
 conduc+0.015
 Sting+0.015
wal+0.015
Neg logit contributions
umenthal-0.024
utra-0.023
redit-0.022
ople-0.021
��-0.021
Token drives
Token ID10182
Feature activation+1.09e+00

Pos logit contributions
hower+0.482
 Synd+0.435
 NIC+0.417
 Sting+0.410
 Spac+0.408
Neg logit contributions
umenthal-0.704
utra-0.679
redit-0.656
ople-0.633
ounty-0.629
Token Michael
Token ID3899
Feature activation-3.74e-02

Pos logit contributions
hower+1.749
 Synd+1.583
 NIC+1.521
 conduc+1.520
 Sting+1.517
Neg logit contributions
umenthal-2.289
utra-2.206
redit-2.125
��-2.046
ounty-2.043
Tokena
Token ID64
Feature activation+1.83e-01

Pos logit contributions
umenthal+0.075
utra+0.072
redit+0.069
��+0.067
ople+0.067
Neg logit contributions
hower-0.063
 Synd-0.058
 conduc-0.056
 NIC-0.056
 Sting-0.055
Token back
Token ID736
Feature activation+2.03e-01

Pos logit contributions
hower+0.323
 Synd+0.299
 NIC+0.289
 conduc+0.288
 Sting+0.286
Neg logit contributions
umenthal-0.355
utra-0.341
redit-0.325
��-0.316
ople-0.315
Token to
Token ID284
Feature activation-8.13e-02

Pos logit contributions
hower+0.332
 Synd+0.310
 NIC+0.300
 Sting+0.297
 conduc+0.297
Neg logit contributions
umenthal-0.419
utra-0.405
redit-0.389
��-0.374
ople-0.374
Token Caleb
Token ID40777
Feature activation-2.83e-01

Pos logit contributions
umenthal+0.163
utra+0.157
redit+0.152
��+0.146
ounty+0.146
Neg logit contributions
hower-0.139
 Synd-0.123
 conduc-0.121
 Sting-0.119
wal-0.119
Token000
Token ID830
Feature activation-2.88e-01

Pos logit contributions
umenthal+0.278
utra+0.265
redit+0.256
��+0.247
ounty+0.244
Neg logit contributions
hower-0.248
 Synd-0.233
 NIC-0.226
 Sting-0.224
 Spac-0.222
Token.
Token ID13
Feature activation-1.21e-01

Pos logit contributions
umenthal+0.597
redit+0.569
utra+0.568
ounty+0.537
ople+0.534
Neg logit contributions
hower-0.462
 Synd-0.426
 conduc-0.413
 NIC-0.405
 Sting-0.404
Token
Token ID198
Feature activation+5.19e-02

Pos logit contributions
umenthal+0.247
utra+0.238
redit+0.230
ople+0.223
ounty+0.222
Neg logit contributions
hower-0.197
 conduc-0.183
 Synd-0.180
 NIC-0.173
wal-0.171
Token
Token ID198
Feature activation+3.09e-02

Pos logit contributions
hower+0.089
 conduc+0.086
 Synd+0.080
 Huck+0.077
 NIC+0.077
Neg logit contributions
umenthal-0.103
utra-0.101
redit-0.096
ople-0.094
ounty-0.093
TokenTh
Token ID817
Feature activation-4.87e-02

Pos logit contributions
hower+0.061
 Synd+0.057
 NIC+0.056
 Sting+0.055
 conduc+0.055
Neg logit contributions
umenthal-0.054
utra-0.051
redit-0.049
��-0.047
ounty-0.047
Tokenur
Token ID333
Feature activation+1.06e+00

Pos logit contributions
umenthal+0.075
utra+0.071
redit+0.068
ople+0.064
ounty+0.064
Neg logit contributions
hower-0.107
 Synd-0.101
 NIC-0.098
 Sting-0.098
 conduc-0.097
Tokenrock
Token ID10823
Feature activation+3.26e-01

Pos logit contributions
hower+2.165
 Synd+2.057
 NIC+2.001
 Sting+1.991
 Spac+1.969
Neg logit contributions
umenthal-1.771
utra-1.679
redit-1.606
��-1.541
ounty-1.517
Token Council
Token ID4281
Feature activation+4.60e-01

Pos logit contributions
hower+0.524
 Synd+0.490
 NIC+0.473
 Sting+0.468
 conduc+0.462
Neg logit contributions
umenthal-0.680
utra-0.651
redit-0.634
��-0.609
ounty-0.605
Token said
Token ID531
Feature activation+2.68e-01

Pos logit contributions
hower+0.741
 Synd+0.692
 NIC+0.667
 Sting+0.661
 Spac+0.651
Neg logit contributions
umenthal-0.957
utra-0.918
redit-0.896
��-0.862
ounty-0.854
Token workers
Token ID3259
Feature activation+1.82e-01

Pos logit contributions
hower+0.448
 Synd+0.417
 NIC+0.401
 Sting+0.399
 Spac+0.394
Neg logit contributions
umenthal-0.542
utra-0.520
redit-0.503
��-0.484
ounty-0.481
Token had
Token ID550
Feature activation+1.31e-01

Pos logit contributions
hower+0.328
 Synd+0.309
 NIC+0.298
 Sting+0.296
 Spac+0.292
Neg logit contributions
umenthal-0.342
utra-0.329
redit-0.319
��-0.303
ounty-0.302
Token me
Token ID502
Feature activation-1.15e-01

Pos logit contributions
hower+0.305
 Synd+0.282
 NIC+0.271
 conduc+0.270
 Sting+0.269
Neg logit contributions
umenthal-0.385
utra-0.366
redit-0.356
��-0.340
ople-0.338
Token later
Token ID1568
Feature activation-1.89e-01

Pos logit contributions
umenthal+0.232
utra+0.225
redit+0.215
ople+0.206
��+0.205
Neg logit contributions
hower-0.192
 Synd-0.177
 NIC-0.172
 conduc-0.171
 Sting-0.170
Token.
Token ID13
Feature activation+4.33e-02

Pos logit contributions
umenthal+0.415
utra+0.399
redit+0.385
��+0.371
ople+0.370
Neg logit contributions
hower-0.281
 Synd-0.259
 NIC-0.251
 conduc-0.248
 Sting-0.247
Token
Token ID198
Feature activation-9.43e-02

Pos logit contributions
hower+0.074
 Synd+0.069
 NIC+0.066
 conduc+0.066
 Sting+0.066
Neg logit contributions
umenthal-0.086
utra-0.082
redit-0.079
��-0.077
ople-0.076
Token
Token ID198
Feature activation-8.80e-02

Pos logit contributions
umenthal+0.200
utra+0.192
redit+0.186
��+0.180
ounty+0.178
Neg logit contributions
hower-0.148
 Synd-0.138
 conduc-0.133
 NIC-0.132
 Sting-0.132
TokenRated
Token ID15322
Feature activation+1.06e+00

Pos logit contributions
umenthal+0.149
utra+0.142
redit+0.136
��+0.130
ounty+0.129
Neg logit contributions
hower-0.177
 Synd-0.167
 NIC-0.162
 Sting-0.161
 conduc-0.161
Token 5
Token ID642
Feature activation-2.24e-01

Pos logit contributions
hower+2.050
 Synd+1.938
 NIC+1.883
 Sting+1.873
 Spac+1.851
Neg logit contributions
umenthal-1.872
utra-1.781
redit-1.709
��-1.645
ounty-1.619
Token out
Token ID503
Feature activation-2.25e-01

Pos logit contributions
umenthal+0.466
utra+0.456
redit+0.443
��+0.425
ople+0.422
Neg logit contributions
hower-0.355
 conduc-0.331
 Synd-0.322
 Huck-0.310
 Sting-0.307
Token of
Token ID286
Feature activation-1.72e-01

Pos logit contributions
umenthal+0.425
utra+0.415
redit+0.392
ople+0.379
ounty+0.375
Neg logit contributions
hower-0.406
 conduc-0.385
 Synd-0.374
 NIC-0.360
 Sting-0.359
Token 5
Token ID642
Feature activation-2.79e-01

Pos logit contributions
umenthal+0.262
utra+0.254
redit+0.243
ople+0.232
ounty+0.228
Neg logit contributions
hower-0.371
 conduc-0.348
 Synd-0.346
 Sting-0.334
 NIC-0.332
Token by
Token ID416
Feature activation-2.96e-01

Pos logit contributions
umenthal+0.573
utra+0.570
redit+0.553
ople+0.534
yx+0.523
Neg logit contributions
hower-0.452
 conduc-0.436
 Synd-0.397
 Huck-0.377
 Sting-0.377
Token use
Token ID779
Feature activation-1.74e-01

Pos logit contributions
hower+0.484
 Synd+0.441
 Sting+0.424
 Huck+0.423
 NIC+0.422
Neg logit contributions
umenthal-0.648
utra-0.629
redit-0.609
ounty-0.582
��-0.582
Token of
Token ID286
Feature activation+5.21e-02

Pos logit contributions
umenthal+0.366
utra+0.349
redit+0.338
��+0.327
ounty+0.323
Neg logit contributions
hower-0.278
 Synd-0.262
 NIC-0.254
 Sting-0.251
 Spac-0.247
Token unique
Token ID3748
Feature activation+3.59e-01

Pos logit contributions
hower+0.084
 Synd+0.076
 Huck+0.073
 Spac+0.073
 NIC+0.072
Neg logit contributions
umenthal-0.109
utra-0.105
redit-0.102
��-0.097
ounty-0.097
Token identification
Token ID11795
Feature activation+5.74e-01

Pos logit contributions
hower+0.651
 Synd+0.597
 Sting+0.575
 NIC+0.575
 Huck+0.574
Neg logit contributions
umenthal-0.682
utra-0.654
redit-0.635
��-0.607
ounty-0.603
Token (
Token ID357
Feature activation+2.12e-02

Pos logit contributions
hower+1.069
 Synd+0.991
 NIC+0.951
 Sting+0.949
 Spac+0.944
Neg logit contributions
umenthal-1.055
utra-1.019
redit-0.985
��-0.939
ounty-0.931
TokenUID
Token ID27586
Feature activation+1.03e+00

Pos logit contributions
hower+0.044
 Synd+0.042
 Spac+0.041
 NIC+0.041
 Sting+0.041
Neg logit contributions
umenthal-0.034
utra-0.033
redit-0.031
��-0.030
ounty-0.029
Token)
Token ID8
Feature activation+1.20e-01

Pos logit contributions
hower+1.150
 Synd+1.086
 NIC+1.022
 Sting+1.019
wal+0.992
Neg logit contributions
umenthal-2.649
utra-2.557
redit-2.460
��-2.419
ople-2.386
Token number
Token ID1271
Feature activation+2.42e-01

Pos logit contributions
hower+0.205
 Synd+0.188
 NIC+0.180
 Spac+0.180
 Huck+0.179
Neg logit contributions
umenthal-0.239
utra-0.231
redit-0.226
ounty-0.214
��-0.214
Token or
Token ID393
Feature activation-2.98e-02

Pos logit contributions
hower+0.410
 Synd+0.375
 NIC+0.361
 Huck+0.358
 Sting+0.358
Neg logit contributions
umenthal-0.482
utra-0.469
redit-0.454
��-0.434
ounty-0.433
Token Aadhaar
Token ID47727
Feature activation+5.83e-01

Pos logit contributions
umenthal+0.059
utra+0.058
redit+0.056
��+0.053
ounty+0.053
Neg logit contributions
hower-0.051
 Synd-0.047
 Spac-0.044
 Huck-0.044
 Sting-0.044
Token to
Token ID284
Feature activation+1.01e-01

Pos logit contributions
hower+1.099
 Synd+1.024
 NIC+0.990
 Sting+0.985
 Spac+0.980
Neg logit contributions
umenthal-1.059
utra-1.020
redit-0.982
ounty-0.935
��-0.933
Token and
Token ID290
Feature activation-1.01e-02

Pos logit contributions
umenthal+0.311
utra+0.298
redit+0.285
ople+0.273
��+0.273
Neg logit contributions
hower-0.264
 Synd-0.248
 Sting-0.237
 NIC-0.237
 Spac-0.235
Token night
Token ID1755
Feature activation-1.29e-01

Pos logit contributions
umenthal+0.020
utra+0.019
redit+0.018
ople+0.017
ounty+0.017
Neg logit contributions
hower-0.018
 Synd-0.016
 conduc-0.016
 Sting-0.016
 NIC-0.016
Token!
Token ID0
Feature activation-1.61e-01

Pos logit contributions
umenthal+0.273
utra+0.262
redit+0.252
ople+0.243
��+0.243
Neg logit contributions
hower-0.200
 Synd-0.189
 NIC-0.180
 Sting-0.179
 conduc-0.178
Token
Token ID198
Feature activation-3.33e-01

Pos logit contributions
umenthal+0.320
utra+0.307
redit+0.296
��+0.285
ople+0.282
Neg logit contributions
hower-0.272
 Synd-0.254
 NIC-0.244
 Sting-0.242
 conduc-0.242
Token
Token ID198
Feature activation-2.86e-01

Pos logit contributions
umenthal+0.635
utra+0.628
redit+0.594
ople+0.578
ounty+0.576
Neg logit contributions
hower-0.591
 conduc-0.586
 Synd-0.530
 Huck-0.514
 Sting-0.512
TokenRated
Token ID15322
Feature activation+9.99e-01

Pos logit contributions
umenthal+0.450
utra+0.440
redit+0.415
ounty+0.395
ople+0.393
Neg logit contributions
hower-0.608
 conduc-0.589
 Synd-0.567
 Sting-0.557
 NIC-0.551
Token 5
Token ID642
Feature activation-2.37e-01

Pos logit contributions
hower+1.930
 Synd+1.829
 NIC+1.777
 Sting+1.768
 Spac+1.747
Neg logit contributions
umenthal-1.763
utra-1.677
redit-1.608
��-1.547
ounty-1.525
Token out
Token ID503
Feature activation-1.98e-01

Pos logit contributions
umenthal+0.482
utra+0.477
redit+0.460
yx+0.444
ople+0.440
Neg logit contributions
hower-0.385
 conduc-0.372
 Synd-0.342
 Huck-0.334
 Sting-0.330
Token of
Token ID286
Feature activation-1.62e-01

Pos logit contributions
umenthal+0.367
utra+0.360
redit+0.340
ople+0.328
ounty+0.325
Neg logit contributions
hower-0.363
 conduc-0.350
 Synd-0.333
 NIC-0.322
 Sting-0.321
Token 5
Token ID642
Feature activation-2.25e-01

Pos logit contributions
umenthal+0.244
utra+0.236
redit+0.224
ople+0.215
ounty+0.212
Neg logit contributions
hower-0.348
 conduc-0.334
 Synd-0.326
 Sting-0.316
 Huck-0.315
Token by
Token ID416
Feature activation-1.93e-01

Pos logit contributions
umenthal+0.478
utra+0.473
redit+0.454
ople+0.441
yx+0.438
Neg logit contributions
hower-0.350
 conduc-0.340
 Synd-0.307
 NIC-0.296
 Sting-0.295
Token excluding
Token ID23494
Feature activation+1.49e-01

Pos logit contributions
umenthal+0.339
utra+0.325
redit+0.314
ople+0.304
ounty+0.302
Neg logit contributions
hower-0.290
 Synd-0.268
 Huck-0.259
 NIC-0.258
 Sting-0.256
Token studies
Token ID3640
Feature activation+5.68e-01

Pos logit contributions
hower+0.259
 Synd+0.235
 Sting+0.228
wal+0.227
 conduc+0.226
Neg logit contributions
umenthal-0.295
utra-0.281
redit-0.273
��-0.261
ople-0.260
Token only
Token ID691
Feature activation+2.86e-01

Pos logit contributions
hower+0.938
 Synd+0.873
 Sting+0.845
 NIC+0.841
 Huck+0.832
Neg logit contributions
umenthal-1.158
utra-1.115
redit-1.082
��-1.035
ounty-1.030
Token reported
Token ID2098
Feature activation+3.94e-01

Pos logit contributions
hower+0.480
 Synd+0.444
 Sting+0.430
 NIC+0.428
 Huck+0.425
Neg logit contributions
umenthal-0.579
utra-0.557
redit-0.537
ounty-0.516
��-0.516
Token as
Token ID355
Feature activation+1.28e-01

Pos logit contributions
hower+0.648
 Synd+0.592
 Sting+0.579
 Spac+0.572
 Huck+0.572
Neg logit contributions
umenthal-0.812
utra-0.785
redit-0.749
ounty-0.723
��-0.721
Token abstract
Token ID12531
Feature activation+9.76e-01

Pos logit contributions
hower+0.216
 Synd+0.196
 Sting+0.191
wal+0.189
 Spac+0.189
Neg logit contributions
umenthal-0.259
utra-0.251
redit-0.238
��-0.231
ounty-0.228
Tokens
Token ID82
Feature activation+3.56e-01

Pos logit contributions
hower+1.654
 Synd+1.543
 NIC+1.490
 Sting+1.487
 Spac+1.468
Neg logit contributions
umenthal-1.963
utra-1.879
redit-1.813
��-1.757
ounty-1.736
Token.
Token ID13
Feature activation+1.89e-01

Pos logit contributions
hower+0.578
 Synd+0.534
 Sting+0.517
 NIC+0.515
 conduc+0.511
Neg logit contributions
umenthal-0.739
utra-0.713
redit-0.685
��-0.663
ounty-0.658
Token<|endoftext|>
Token ID50256
Feature activation+8.27e-03

Pos logit contributions
hower+0.349
 Synd+0.317
 conduc+0.315
 NIC+0.308
wal+0.307
Neg logit contributions
umenthal-0.351
utra-0.336
redit-0.324
ople-0.310
ounty-0.309
TokenA
Token ID32
Feature activation+1.98e-01

Pos logit contributions
hower+0.016
 Synd+0.015
 NIC+0.015
 Sting+0.015
 conduc+0.015
Neg logit contributions
umenthal-0.014
utra-0.014
redit-0.013
��-0.013
ounty-0.012
Token group
Token ID1448
Feature activation-6.33e-03

Pos logit contributions
hower+0.344
 Synd+0.322
 NIC+0.310
 Sting+0.308
 conduc+0.305
Neg logit contributions
umenthal-0.389
utra-0.373
redit-0.357
��-0.345
ounty-0.343
Token,
Token ID11
Feature activation-1.78e-02

Pos logit contributions
hower+0.235
 Synd+0.218
 NIC+0.210
 Sting+0.210
 conduc+0.208
Neg logit contributions
umenthal-0.278
utra-0.270
redit-0.258
ounty-0.246
��-0.246
Token which
Token ID543
Feature activation-1.02e-01

Pos logit contributions
umenthal+0.033
utra+0.031
redit+0.030
��+0.029
ounty+0.029
Neg logit contributions
hower-0.033
 Synd-0.031
 Sting-0.030
 NIC-0.030
 conduc-0.030
Token would
Token ID561
Feature activation+1.45e-02

Pos logit contributions
umenthal+0.202
utra+0.195
redit+0.189
��+0.179
ople+0.178
Neg logit contributions
hower-0.175
 Synd-0.162
 Sting-0.155
 NIC-0.154
wal-0.154
Token have
Token ID423
Feature activation-6.46e-02

Pos logit contributions
hower+0.026
 Synd+0.024
 NIC+0.023
 Sting+0.023
wal+0.023
Neg logit contributions
umenthal-0.027
utra-0.027
redit-0.026
ople-0.025
ounty-0.025
Token qu
Token ID627
Feature activation+2.04e-01

Pos logit contributions
umenthal+0.120
utra+0.118
redit+0.114
��+0.107
ople+0.107
Neg logit contributions
hower-0.117
 Synd-0.108
 NIC-0.105
 Sting-0.104
wal-0.104
Tokenelled
Token ID11978
Feature activation+9.53e-01

Pos logit contributions
hower+0.359
 Synd+0.335
 Sting+0.325
 conduc+0.325
 NIC+0.324
Neg logit contributions
umenthal-0.400
utra-0.385
redit-0.368
ounty-0.353
ople-0.351
Token demand
Token ID3512
Feature activation+2.20e-01

Pos logit contributions
hower+1.487
 Synd+1.371
 NIC+1.319
 Sting+1.317
 Spac+1.300
Neg logit contributions
umenthal-2.041
utra-1.971
redit-1.903
��-1.840
ople-1.818
Token for
Token ID329
Feature activation-1.03e-01

Pos logit contributions
hower+0.360
 Synd+0.333
 NIC+0.324
 Sting+0.320
 Spac+0.316
Neg logit contributions
umenthal-0.451
utra-0.437
redit-0.421
ople-0.404
ounty-0.400
Token Argentine
Token ID36399
Feature activation+3.86e-03

Pos logit contributions
umenthal+0.202
utra+0.195
redit+0.191
��+0.180
ounty+0.178
Neg logit contributions
hower-0.180
 Synd-0.161
 Sting-0.157
 conduc-0.156
 Spac-0.155
Token debt
Token ID5057
Feature activation-2.04e-01

Pos logit contributions
hower+0.007
 Synd+0.006
 Sting+0.006
 NIC+0.006
 conduc+0.006
Neg logit contributions
umenthal-0.007
utra-0.007
redit-0.007
��-0.006
ounty-0.006
Token;
Token ID26
Feature activation+8.68e-02

Pos logit contributions
umenthal+0.421
utra+0.412
redit+0.394
��+0.376
ople+0.376
Neg logit contributions
hower-0.331
 Synd-0.300
 NIC-0.296
 Sting-0.294
 conduc-0.286
Token crucial
Token ID8780
Feature activation-1.83e-01

Pos logit contributions
hower+0.243
 Synd+0.225
 NIC+0.215
 conduc+0.214
wal+0.214
Neg logit contributions
umenthal-0.294
utra-0.289
redit-0.275
��-0.263
ople-0.261
Token,
Token ID11
Feature activation+1.30e-01

Pos logit contributions
umenthal+0.379
utra+0.368
redit+0.352
��+0.340
ounty+0.339
Neg logit contributions
hower-0.297
 Synd-0.272
 conduc-0.262
 NIC-0.260
 Sting-0.258
Token because
Token ID780
Feature activation+4.03e-02

Pos logit contributions
hower+0.208
 Synd+0.191
 conduc+0.183
 NIC+0.183
 Sting+0.182
Neg logit contributions
umenthal-0.275
utra-0.264
redit-0.256
��-0.246
ounty-0.245
Token it
Token ID340
Feature activation+1.67e-01

Pos logit contributions
hower+0.068
 Synd+0.062
 conduc+0.061
wal+0.060
 NIC+0.060
Neg logit contributions
umenthal-0.080
utra-0.077
redit-0.075
ople-0.072
��-0.071
Token both
Token ID1111
Feature activation+6.77e-02

Pos logit contributions
hower+0.273
 Synd+0.255
 NIC+0.245
 Sting+0.243
 Spac+0.240
Neg logit contributions
umenthal-0.346
utra-0.331
redit-0.321
��-0.307
ople-0.305
Token drives
Token ID10182
Feature activation+9.49e-01

Pos logit contributions
hower+0.106
 Synd+0.098
 NIC+0.094
 Sting+0.094
 Spac+0.092
Neg logit contributions
umenthal-0.145
utra-0.139
redit-0.135
��-0.130
ounty-0.129
Token home
Token ID1363
Feature activation+1.06e-01

Pos logit contributions
hower+1.465
 Synd+1.363
 NIC+1.308
 Sting+1.298
 conduc+1.286
Neg logit contributions
umenthal-2.048
utra-1.972
redit-1.904
��-1.843
ounty-1.823
Token the
Token ID262
Feature activation+9.16e-02

Pos logit contributions
hower+0.168
 Synd+0.156
 NIC+0.149
 Sting+0.149
 conduc+0.148
Neg logit contributions
umenthal-0.222
utra-0.215
redit-0.207
��-0.200
ounty-0.198
Token pain
Token ID2356
Feature activation-2.48e-02

Pos logit contributions
hower+0.158
 Synd+0.147
 NIC+0.140
 Sting+0.139
wal+0.139
Neg logit contributions
umenthal-0.180
utra-0.174
redit-0.167
��-0.162
ounty-0.160
Token of
Token ID286
Feature activation+2.63e-02

Pos logit contributions
umenthal+0.049
utra+0.047
redit+0.045
��+0.044
ounty+0.043
Neg logit contributions
hower-0.042
 Synd-0.039
 NIC-0.038
 conduc-0.038
 Sting-0.038
Token heart
Token ID2612
Feature activation+4.58e-01

Pos logit contributions
hower+0.044
 Synd+0.040
wal+0.039
 conduc+0.038
 Sting+0.038
Neg logit contributions
umenthal-0.053
utra-0.051
redit-0.049
��-0.048
ounty-0.047
Token of
Token ID286
Feature activation+1.34e-02

Pos logit contributions
hower+1.041
 Synd+0.982
 NIC+0.951
 Sting+0.944
 Spac+0.933
Neg logit contributions
umenthal-1.146
utra-1.095
redit-1.055
��-1.017
ounty-1.005
Token teeth
Token ID9941
Feature activation+1.59e-01

Pos logit contributions
hower+0.020
 Synd+0.019
 NIC+0.018
 Sting+0.018
 conduc+0.018
Neg logit contributions
umenthal-0.029
utra-0.028
redit-0.027
��-0.026
ounty-0.026
Token from
Token ID422
Feature activation+3.82e-02

Pos logit contributions
hower+0.267
 Synd+0.251
 NIC+0.243
 Sting+0.239
 conduc+0.238
Neg logit contributions
umenthal-0.318
utra-0.306
redit-0.295
��-0.283
ounty-0.281
Token a
Token ID257
Feature activation-1.03e-01

Pos logit contributions
hower+0.058
 Synd+0.053
 NIC+0.051
 conduc+0.051
 Sting+0.051
Neg logit contributions
umenthal-0.083
utra-0.080
redit-0.078
��-0.075
ounty-0.074
Token tr
Token ID491
Feature activation+3.46e-02

Pos logit contributions
umenthal+0.180
utra+0.172
redit+0.165
��+0.158
ounty+0.156
Neg logit contributions
hower-0.200
 Synd-0.189
 NIC-0.184
 conduc-0.181
 Sting-0.181
Tokenicer
Token ID16647
Feature activation+9.49e-01

Pos logit contributions
hower+0.065
 Synd+0.061
 NIC+0.059
 conduc+0.059
 Sting+0.058
Neg logit contributions
umenthal-0.064
utra-0.061
redit-0.059
ople-0.056
ounty-0.056
Tokenat
Token ID265
Feature activation+5.05e-02

Pos logit contributions
hower+1.544
 Synd+1.452
 NIC+1.408
 Sting+1.386
 conduc+1.381
Neg logit contributions
umenthal-1.967
utra-1.896
redit-1.822
ounty-1.758
��-1.752
Tokenops
Token ID2840
Feature activation-2.35e-01

Pos logit contributions
hower+0.121
 Synd+0.115
 NIC+0.113
 Sting+0.112
 Huck+0.112
Neg logit contributions
umenthal-0.066
utra-0.062
redit-0.059
ople-0.055
ounty-0.055
Token.
Token ID13
Feature activation-5.95e-02

Pos logit contributions
umenthal+0.452
utra+0.439
redit+0.419
ounty+0.399
��+0.399
Neg logit contributions
hower-0.415
 Synd-0.389
 NIC-0.378
 conduc-0.371
wal-0.370
Token (
Token ID357
Feature activation-1.55e-02

Pos logit contributions
umenthal+0.118
utra+0.114
redit+0.109
��+0.105
ople+0.105
Neg logit contributions
hower-0.102
 Synd-0.094
 conduc-0.092
 NIC-0.090
 Sting-0.090
TokenFS
Token ID10652
Feature activation+2.50e-01

Pos logit contributions
umenthal+0.028
utra+0.027
redit+0.026
��+0.025
ounty+0.024
Neg logit contributions
hower-0.029
 Synd-0.027
 conduc-0.026
 Sting-0.026
 NIC-0.026
Token central
Token ID4318
Feature activation-1.00e-01

Pos logit contributions
hower+0.090
 Synd+0.084
 NIC+0.080
 conduc+0.080
 Sting+0.079
Neg logit contributions
umenthal-0.115
utra-0.110
redit-0.107
��-0.103
ople-0.102
Token casting
Token ID13092
Feature activation-4.27e-02

Pos logit contributions
umenthal+0.199
utra+0.188
redit+0.182
��+0.178
ople+0.175
Neg logit contributions
hower-0.169
 Synd-0.160
 NIC-0.154
 Sting-0.152
wal-0.152
Token to
Token ID284
Feature activation+6.27e-02

Pos logit contributions
umenthal+0.085
utra+0.081
redit+0.079
ople+0.075
��+0.075
Neg logit contributions
hower-0.073
 Synd-0.068
 NIC-0.065
 conduc-0.065
 Sting-0.065
Token play
Token ID711
Feature activation+2.25e-01

Pos logit contributions
hower+0.105
 Synd+0.098
 NIC+0.094
 Sting+0.093
 conduc+0.093
Neg logit contributions
umenthal-0.127
utra-0.121
redit-0.118
��-0.113
ople-0.112
Token the
Token ID262
Feature activation+6.73e-02

Pos logit contributions
hower+0.358
 Synd+0.335
 NIC+0.321
 Sting+0.318
 conduc+0.315
Neg logit contributions
umenthal-0.473
utra-0.451
redit-0.438
��-0.422
ople-0.417
Token dis
Token ID595
Feature activation+9.47e-01

Pos logit contributions
hower+0.115
 Synd+0.107
 Sting+0.102
 NIC+0.102
wal+0.101
Neg logit contributions
umenthal-0.134
utra-0.128
redit-0.123
��-0.120
ople-0.118
Tokenarming
Token ID18052
Feature activation+7.41e-01

Pos logit contributions
hower+1.772
 Synd+1.677
 NIC+1.628
 Sting+1.620
 Spac+1.598
Neg logit contributions
umenthal-1.732
utra-1.650
redit-1.585
��-1.524
ounty-1.505
Token,
Token ID11
Feature activation+7.67e-02

Pos logit contributions
hower+1.260
 Synd+1.168
 NIC+1.125
 Sting+1.116
wal+1.108
Neg logit contributions
umenthal-1.488
utra-1.423
redit-1.373
��-1.324
ople-1.309
Token grandfather
Token ID17695
Feature activation+1.33e-01

Pos logit contributions
hower+0.138
 Synd+0.129
 NIC+0.124
 Sting+0.123
 Spac+0.122
Neg logit contributions
umenthal-0.146
utra-0.139
redit-0.134
��-0.129
ople-0.128
Tokenly
Token ID306
Feature activation-4.57e-02

Pos logit contributions
hower+0.274
 Synd+0.260
 NIC+0.253
 Sting+0.252
 conduc+0.250
Neg logit contributions
umenthal-0.215
utra-0.202
redit-0.194
��-0.185
ounty-0.184
Token right
Token ID826
Feature activation+3.54e-01

Pos logit contributions
umenthal+0.092
utra+0.087
redit+0.084
��+0.081
ople+0.080
Neg logit contributions
hower-0.077
 Synd-0.071
 NIC-0.068
wal-0.068
 Sting-0.068
Tokensky
Token ID15688
Feature activation+2.50e-01

Pos logit contributions
hower+0.038
 conduc+0.036
 Synd+0.036
 NIC+0.034
 Sting+0.034
Neg logit contributions
umenthal-0.060
utra-0.058
redit-0.056
ople-0.054
ounty-0.054
Token
Token ID447
Feature activation-3.54e-02

Pos logit contributions
hower+0.442
 Synd+0.417
 NIC+0.404
 Sting+0.401
 conduc+0.397
Neg logit contributions
umenthal-0.478
utra-0.458
redit-0.443
��-0.424
ounty-0.421
Token
Token ID247
Feature activation+9.10e-03

Pos logit contributions
umenthal+0.056
utra+0.053
redit+0.051
��+0.049
ounty+0.048
Neg logit contributions
hower-0.075
 Synd-0.073
 conduc-0.071
 NIC-0.070
 Sting-0.070
Tokens
Token ID82
Feature activation+1.16e-01

Pos logit contributions
hower+0.018
 conduc+0.017
 Synd+0.017
 NIC+0.016
 Sting+0.016
Neg logit contributions
umenthal-0.016
utra-0.015
redit-0.015
ounty-0.014
ople-0.014
Token Batman
Token ID9827
Feature activation+2.70e-01

Pos logit contributions
hower+0.155
 Synd+0.138
 NIC+0.133
 Spac+0.130
 Sting+0.129
Neg logit contributions
umenthal-0.272
utra-0.264
redit-0.258
��-0.249
ounty-0.248
Token drives
Token ID10182
Feature activation+9.45e-01

Pos logit contributions
hower+0.430
 Synd+0.397
 NIC+0.381
 Sting+0.378
 Huck+0.375
Neg logit contributions
umenthal-0.566
utra-0.547
redit-0.530
��-0.511
ounty-0.506
Token a
Token ID257
Feature activation+5.88e-02

Pos logit contributions
hower+1.510
 Synd+1.388
 NIC+1.334
 Sting+1.314
 Spac+1.308
Neg logit contributions
umenthal-1.994
utra-1.930
redit-1.863
��-1.800
ounty-1.782
Token sou
Token ID24049
Feature activation+8.39e-01

Pos logit contributions
hower+0.100
 Synd+0.091
 NIC+0.088
 Spac+0.087
 conduc+0.087
Neg logit contributions
umenthal-0.118
utra-0.113
redit-0.110
��-0.105
ounty-0.104
Tokenped
Token ID9124
Feature activation-3.91e-01

Pos logit contributions
hower+1.530
 Synd+1.444
 NIC+1.400
 Sting+1.393
 Spac+1.374
Neg logit contributions
umenthal-1.571
utra-1.496
redit-1.440
��-1.391
ounty-1.370
Token up
Token ID510
Feature activation-4.84e-02

Pos logit contributions
umenthal+0.733
utra+0.701
redit+0.675
��+0.646
ounty+0.640
Neg logit contributions
hower-0.712
 Synd-0.670
 NIC-0.650
 Sting-0.643
 Spac-0.639
Token Lincoln
Token ID12406
Feature activation+3.50e-01

Pos logit contributions
umenthal+0.099
utra+0.096
redit+0.093
��+0.089
ounty+0.088
Neg logit contributions
hower-0.079
 Synd-0.072
 NIC-0.070
 conduc-0.069
 Spac-0.069
Token at
Token ID379
Feature activation+5.65e-02

Pos logit contributions
hower+0.199
 Synd+0.183
 NIC+0.176
 Sting+0.175
 Spac+0.174
Neg logit contributions
umenthal-0.203
utra-0.200
redit-0.194
ounty-0.179
ople-0.178
Token the
Token ID262
Feature activation+6.70e-02

Pos logit contributions
hower+0.091
 Synd+0.082
 conduc+0.079
 NIC+0.079
 Sting+0.078
Neg logit contributions
umenthal-0.118
utra-0.113
redit-0.112
��-0.105
ounty-0.104
Token major
Token ID1688
Feature activation+5.44e-01

Pos logit contributions
hower+0.114
 Synd+0.102
 conduc+0.098
 Spac+0.097
 NIC+0.097
Neg logit contributions
umenthal-0.134
utra-0.129
redit-0.128
ople-0.120
��-0.120
Token Buzz
Token ID15753
Feature activation+1.21e-01

Pos logit contributions
hower+0.914
 Synd+0.833
 Spac+0.795
 NIC+0.794
 Sting+0.793
Neg logit contributions
umenthal-1.102
utra-1.063
redit-1.038
��-0.987
ounty-0.985
Tokenard
Token ID446
Feature activation-1.51e-01

Pos logit contributions
hower+0.270
 Synd+0.258
 NIC+0.254
 Sting+0.252
 conduc+0.250
Neg logit contributions
umenthal-0.174
utra-0.167
redit-0.160
ople-0.149
��-0.148
Token offshore
Token ID18479
Feature activation+9.37e-01

Pos logit contributions
umenthal+0.297
utra+0.290
redit+0.284
ople+0.271
��+0.266
Neg logit contributions
hower-0.257
 Synd-0.237
 NIC-0.232
 conduc-0.229
 Sting-0.227
Token field
Token ID2214
Feature activation+7.90e-02

Pos logit contributions
hower+1.627
 Synd+1.510
 NIC+1.454
 Sting+1.439
 Spac+1.429
Neg logit contributions
umenthal-1.841
utra-1.772
redit-1.724
��-1.641
ounty-1.639
Token in
Token ID287
Feature activation+8.43e-02

Pos logit contributions
hower+0.132
 Synd+0.122
 NIC+0.116
 Sting+0.115
 conduc+0.114
Neg logit contributions
umenthal-0.160
utra-0.155
redit-0.152
ounty-0.143
ople-0.142
Token the
Token ID262
Feature activation+9.50e-02

Pos logit contributions
hower+0.137
 Synd+0.125
 conduc+0.121
wal+0.120
 NIC+0.120
Neg logit contributions
umenthal-0.175
utra-0.168
redit-0.163
��-0.157
ounty-0.155
Token North
Token ID2258
Feature activation-1.19e-01

Pos logit contributions
hower+0.167
 Synd+0.151
 conduc+0.146
wal+0.146
 NIC+0.145
Neg logit contributions
umenthal-0.186
utra-0.177
redit-0.173
��-0.166
ople-0.165
Token Sea
Token ID6896
Feature activation+3.53e-01

Pos logit contributions
umenthal+0.201
utra+0.191
redit+0.184
��+0.175
ounty+0.173
Neg logit contributions
hower-0.240
 Synd-0.225
 NIC-0.219
 conduc-0.218
 Sting-0.215
Tokenill
Token ID359
Feature activation-2.50e-01

Pos logit contributions
umenthal+0.113
utra+0.109
redit+0.104
ople+0.098
ounty+0.097
Neg logit contributions
hower-0.145
 Synd-0.139
 NIC-0.135
 Sting-0.134
 conduc-0.134
Tokenons
Token ID684
Feature activation-1.52e-01

Pos logit contributions
umenthal+0.428
utra+0.406
redit+0.392
��+0.376
ople+0.369
Neg logit contributions
hower-0.495
 Synd-0.467
 NIC-0.456
 Sting-0.452
 conduc-0.448
Token Image
Token ID7412
Feature activation+7.31e-03

Pos logit contributions
umenthal+0.312
utra+0.303
redit+0.290
ople+0.278
ounty+0.278
Neg logit contributions
hower-0.247
 Synd-0.228
 NIC-0.222
 Sting-0.220
 conduc-0.219
Token:
Token ID25
Feature activation-5.31e-02

Pos logit contributions
hower+0.014
 Synd+0.013
 NIC+0.012
 Sting+0.012
 conduc+0.012
Neg logit contributions
umenthal-0.013
utra-0.013
redit-0.012
��-0.012
ounty-0.012
Token Wild
Token ID6183
Feature activation+2.37e-01

Pos logit contributions
umenthal+0.108
utra+0.104
redit+0.100
��+0.096
ounty+0.095
Neg logit contributions
hower-0.089
 Synd-0.081
 NIC-0.078
 conduc-0.078
 Sting-0.078
Tokenfires
Token ID27312
Feature activation+9.36e-01

Pos logit contributions
hower+0.449
 Synd+0.426
 NIC+0.414
 Sting+0.410
 conduc+0.409
Neg logit contributions
umenthal-0.426
utra-0.406
redit-0.393
��-0.376
ople-0.370
Token also
Token ID635
Feature activation+4.81e-02

Pos logit contributions
hower+1.660
 Synd+1.565
 NIC+1.516
 Sting+1.508
 Spac+1.488
Neg logit contributions
umenthal-1.798
utra-1.717
redit-1.653
��-1.596
ounty-1.575
Token raged
Token ID49000
Feature activation+3.67e-01

Pos logit contributions
hower+0.082
 Synd+0.076
 NIC+0.074
 Sting+0.073
 Spac+0.073
Neg logit contributions
umenthal-0.096
utra-0.092
redit-0.089
ounty-0.085
��-0.084
Token on
Token ID319
Feature activation+1.39e-01

Pos logit contributions
hower+0.618
 Synd+0.567
 NIC+0.554
 conduc+0.547
 Sting+0.547
Neg logit contributions
umenthal-0.740
utra-0.710
redit-0.685
ounty-0.657
ople-0.651
Token the
Token ID262
Feature activation-2.79e-02

Pos logit contributions
hower+0.252
 Synd+0.231
 NIC+0.224
 conduc+0.223
 Sting+0.223
Neg logit contributions
umenthal-0.262
utra-0.250
redit-0.243
��-0.231
ounty-0.230
Token French
Token ID4141
Feature activation+3.57e-01

Pos logit contributions
umenthal+0.055
utra+0.052
redit+0.051
��+0.048
ounty+0.048
Neg logit contributions
hower-0.048
 Synd-0.044
 NIC-0.042
 conduc-0.042
 Sting-0.042
Token sign
Token ID1051
Feature activation+6.21e-02

Pos logit contributions
hower+0.074
 Synd+0.069
 NIC+0.066
 Sting+0.066
 Huck+0.065
Neg logit contributions
umenthal-0.088
utra-0.084
redit-0.081
��-0.078
ounty-0.077
Token would
Token ID561
Feature activation+5.59e-02

Pos logit contributions
hower+0.103
 Synd+0.097
 NIC+0.093
 Sting+0.093
 conduc+0.092
Neg logit contributions
umenthal-0.126
utra-0.121
redit-0.117
��-0.113
ounty-0.112
Token be
Token ID307
Feature activation-9.66e-02

Pos logit contributions
hower+0.099
 Synd+0.094
 NIC+0.091
 Sting+0.090
 Spac+0.089
Neg logit contributions
umenthal-0.106
utra-0.102
redit-0.098
��-0.095
ounty-0.094
Token if
Token ID611
Feature activation-1.75e-01

Pos logit contributions
umenthal+0.202
utra+0.195
redit+0.188
��+0.181
ople+0.180
Neg logit contributions
hower-0.153
 Synd-0.143
 NIC-0.137
 Sting-0.136
 Spac-0.135
Token pl
Token ID458
Feature activation-1.87e-01

Pos logit contributions
umenthal+0.363
utra+0.350
redit+0.337
��+0.327
ople+0.325
Neg logit contributions
hower-0.278
 Synd-0.260
 NIC-0.247
 Sting-0.247
wal-0.245
Tokenumes
Token ID8139
Feature activation+9.34e-01

Pos logit contributions
umenthal+0.303
utra+0.284
redit+0.274
��+0.259
ople+0.258
Neg logit contributions
hower-0.393
 Synd-0.375
 NIC-0.364
 Sting-0.363
 Spac-0.359
Token of
Token ID286
Feature activation-1.61e-02

Pos logit contributions
hower+1.506
 Synd+1.397
 NIC+1.356
 Sting+1.350
 Spac+1.324
Neg logit contributions
umenthal-1.962
utra-1.865
redit-1.808
��-1.754
ounty-1.728
Token c
Token ID269
Feature activation+1.39e-01

Pos logit contributions
umenthal+0.029
utra+0.027
redit+0.026
��+0.025
ople+0.025
Neg logit contributions
hower-0.030
 Synd-0.029
 NIC-0.027
 Huck-0.027
wal-0.027
Tokenes
Token ID274
Feature activation+2.09e-01

Pos logit contributions
hower+0.267
 Synd+0.254
 NIC+0.246
 Spac+0.244
 conduc+0.243
Neg logit contributions
umenthal-0.250
utra-0.236
redit-0.228
ople-0.219
��-0.217
Tokenium
Token ID1505
Feature activation+3.43e-01

Pos logit contributions
hower+0.360
 Synd+0.339
 NIC+0.328
 Sting+0.327
 Huck+0.323
Neg logit contributions
umenthal-0.413
utra-0.396
redit-0.379
ople-0.366
��-0.365
Token are
Token ID389
Feature activation-1.50e-01

Pos logit contributions
hower+0.618
 Synd+0.584
 NIC+0.566
 Sting+0.560
 Huck+0.556
Neg logit contributions
umenthal-0.647
utra-0.620
redit-0.596
��-0.573
ounty-0.568
Token not
Token ID407
Feature activation-3.62e-02

Pos logit contributions
hower+0.080
 Synd+0.072
 conduc+0.069
wal+0.069
 NIC+0.069
Neg logit contributions
umenthal-0.106
utra-0.105
redit-0.100
��-0.096
ople-0.095
Token the
Token ID262
Feature activation+1.31e-03

Pos logit contributions
umenthal+0.073
utra+0.072
redit+0.068
ople+0.066
��+0.066
Neg logit contributions
hower-0.061
 Synd-0.055
 NIC-0.053
wal-0.053
 conduc-0.053
Token only
Token ID691
Feature activation+5.44e-03

Pos logit contributions
hower+0.002
 Synd+0.002
wal+0.002
 NIC+0.002
 conduc+0.002
Neg logit contributions
umenthal-0.003
utra-0.003
redit-0.002
��-0.002
ounty-0.002
Token thing
Token ID1517
Feature activation-3.34e-01

Pos logit contributions
hower+0.009
 Synd+0.008
 NIC+0.008
wal+0.008
 Huck+0.008
Neg logit contributions
umenthal-0.011
utra-0.011
redit-0.010
��-0.010
ounty-0.010
Token that
Token ID326
Feature activation+8.69e-03

Pos logit contributions
umenthal+0.641
utra+0.622
redit+0.603
ounty+0.581
ople+0.572
Neg logit contributions
hower-0.588
 Synd-0.535
 NIC-0.524
wal-0.518
 conduc-0.510
Token drives
Token ID10182
Feature activation+9.28e-01

Pos logit contributions
hower+0.012
 Synd+0.011
 NIC+0.010
wal+0.010
 conduc+0.010
Neg logit contributions
umenthal-0.020
utra-0.019
redit-0.019
ounty-0.018
ople-0.018
Token them
Token ID606
Feature activation-5.37e-03

Pos logit contributions
hower+1.473
 Synd+1.355
 NIC+1.315
 Sting+1.290
wal+1.283
Neg logit contributions
umenthal-1.972
utra-1.898
redit-1.824
��-1.771
ounty-1.757
Token toward
Token ID3812
Feature activation+1.47e-01

Pos logit contributions
umenthal+0.010
utra+0.010
redit+0.010
ounty+0.009
ople+0.009
Neg logit contributions
hower-0.009
 Synd-0.009
 NIC-0.008
 Huck-0.008
 conduc-0.008
Token developing
Token ID5922
Feature activation-8.83e-02

Pos logit contributions
hower+0.265
 Synd+0.245
 NIC+0.239
 Sting+0.237
wal+0.235
Neg logit contributions
umenthal-0.281
utra-0.269
redit-0.258
��-0.249
ounty-0.248
Token new
Token ID649
Feature activation-4.87e-02

Pos logit contributions
umenthal+0.185
utra+0.178
redit+0.172
��+0.167
ounty+0.166
Neg logit contributions
hower-0.142
 Synd-0.128
 NIC-0.126
 Sting-0.123
wal-0.123
Token Metroid
Token ID45430
Feature activation+1.59e-01

Pos logit contributions
umenthal+0.106
utra+0.103
redit+0.100
ounty+0.097
��+0.097
Neg logit contributions
hower-0.074
 Synd-0.066
 NIC-0.064
 conduc-0.063
 Sting-0.063
Tokenivals
Token ID10336
Feature activation-6.78e-02

Pos logit contributions
umenthal+0.046
utra+0.044
redit+0.043
��+0.041
ounty+0.041
Neg logit contributions
hower-0.046
 Synd-0.043
 NIC-0.042
 Sting-0.042
 conduc-0.042
Token information
Token ID1321
Feature activation-7.41e-02

Pos logit contributions
umenthal+0.138
utra+0.134
redit+0.128
ounty+0.125
ople+0.123
Neg logit contributions
hower-0.111
 Synd-0.103
 NIC-0.099
 Sting-0.098
 Huck-0.098
Token
Token ID198
Feature activation-7.80e-02

Pos logit contributions
umenthal+0.150
utra+0.145
redit+0.139
ople+0.135
ounty+0.134
Neg logit contributions
hower-0.122
 Synd-0.112
 NIC-0.109
 conduc-0.108
 Sting-0.107
Token
Token ID198
Feature activation-4.51e-02

Pos logit contributions
umenthal+0.151
utra+0.149
redit+0.142
ople+0.137
ounty+0.137
Neg logit contributions
hower-0.136
 conduc-0.133
 Synd-0.122
 NIC-0.118
 Huck-0.118
TokenF
Token ID37
Feature activation+6.06e-02

Pos logit contributions
umenthal+0.075
utra+0.072
redit+0.069
ounty+0.066
��+0.066
Neg logit contributions
hower-0.092
 Synd-0.086
 NIC-0.084
 Sting-0.084
 conduc-0.084
Tokenrequently
Token ID37971
Feature activation+9.21e-01

Pos logit contributions
hower+0.126
 Synd+0.121
 NIC+0.117
 Sting+0.117
 Spac+0.116
Neg logit contributions
umenthal-0.098
utra-0.094
redit-0.089
ounty-0.085
��-0.084
Token updated
Token ID6153
Feature activation+1.85e-01

Pos logit contributions
hower+1.639
 Synd+1.521
 NIC+1.491
 Sting+1.475
wal+1.458
Neg logit contributions
umenthal-1.772
utra-1.683
redit-1.631
��-1.568
ople-1.559
Token list
Token ID1351
Feature activation+3.07e-01

Pos logit contributions
hower+0.317
 Synd+0.292
 NIC+0.284
 Sting+0.281
 conduc+0.278
Neg logit contributions
umenthal-0.367
utra-0.353
redit-0.340
��-0.329
ounty-0.327
Token of
Token ID286
Feature activation-1.05e-02

Pos logit contributions
hower+0.482
 Synd+0.452
 NIC+0.440
 Sting+0.436
 conduc+0.428
Neg logit contributions
umenthal-0.650
utra-0.621
redit-0.604
��-0.584
ounty-0.576
Token WA
Token ID16400
Feature activation+1.36e-02

Pos logit contributions
umenthal+0.022
utra+0.021
redit+0.021
ounty+0.020
��+0.020
Neg logit contributions
hower-0.017
 Synd-0.015
 NIC-0.015
 Sting-0.015
 Spac-0.014
Token Beer
Token ID16971
Feature activation-1.65e-01

Pos logit contributions
hower+0.027
 Synd+0.025
 NIC+0.025
 Sting+0.024
 conduc+0.024
Neg logit contributions
umenthal-0.023
utra-0.022
redit-0.021
��-0.021
ounty-0.020
Tokenible
Token ID856
Feature activation-1.39e-01

Pos logit contributions
hower+0.931
 Synd+0.878
 NIC+0.853
 Sting+0.846
 Spac+0.838
Neg logit contributions
umenthal-0.963
utra-0.918
redit-0.881
��-0.853
ounty-0.836
Token-
Token ID12
Feature activation+2.75e-03

Pos logit contributions
umenthal+0.255
utra+0.243
redit+0.234
��+0.225
ounty+0.223
Neg logit contributions
hower-0.257
 Synd-0.243
 NIC-0.235
 Sting-0.235
 Spac-0.231
Tokenlimit
Token ID32374
Feature activation-3.02e-01

Pos logit contributions
hower+0.006
 Synd+0.005
 NIC+0.005
 Sting+0.005
 Spac+0.005
Neg logit contributions
umenthal-0.005
utra-0.004
redit-0.004
ounty-0.004
��-0.004
Token-
Token ID12
Feature activation-4.44e-02

Pos logit contributions
umenthal+0.547
utra+0.525
redit+0.505
ounty+0.481
��+0.481
Neg logit contributions
hower-0.566
 Synd-0.532
 NIC-0.517
 Sting-0.516
 Huck-0.510
Tokentr
Token ID2213
Feature activation+2.98e-02

Pos logit contributions
umenthal+0.076
utra+0.072
redit+0.069
ounty+0.066
��+0.065
Neg logit contributions
hower-0.088
 Synd-0.084
 NIC-0.082
 Sting-0.082
 Huck-0.081
Tokenading
Token ID4980
Feature activation-1.37e+00

Pos logit contributions
hower+0.064
 Synd+0.060
 NIC+0.059
 Spac+0.058
 conduc+0.058
Neg logit contributions
umenthal-0.047
utra-0.045
redit-0.043
ounty-0.040
ople-0.040
Token-
Token ID12
Feature activation-1.22e-02

Pos logit contributions
umenthal+2.421
utra+2.341
redit+2.234
ounty+2.125
ople+2.116
Neg logit contributions
hower-2.589
 Synd-2.403
 Sting-2.355
 NIC-2.346
 conduc-2.328
Tokensub
Token ID7266
Feature activation-4.07e-01

Pos logit contributions
umenthal+0.021
utra+0.020
redit+0.019
��+0.018
ounty+0.018
Neg logit contributions
hower-0.024
 Synd-0.023
 NIC-0.023
 Sting-0.023
 Huck-0.022
Tokens
Token ID82
Feature activation-5.27e-02

Pos logit contributions
umenthal+0.702
utra+0.677
redit+0.655
ounty+0.623
��+0.620
Neg logit contributions
hower-0.800
 Synd-0.751
 Sting-0.733
 NIC-0.729
 conduc-0.726
Tokenidy
Token ID19325
Feature activation+2.27e-01

Pos logit contributions
umenthal+0.072
utra+0.069
redit+0.066
ounty+0.061
ople+0.060
Neg logit contributions
hower-0.123
 Synd-0.116
 NIC-0.114
 Sting-0.114
 conduc-0.113
Token-
Token ID12
Feature activation+1.73e-02

Pos logit contributions
hower+0.391
 Synd+0.365
 NIC+0.353
 Sting+0.353
 conduc+0.348
Neg logit contributions
umenthal-0.446
utra-0.429
redit-0.412
��-0.397
ounty-0.395
Token the
Token ID262
Feature activation-5.34e-02

Pos logit contributions
hower+0.067
 Synd+0.061
 NIC+0.060
 Spac+0.059
 Sting+0.059
Neg logit contributions
umenthal-0.086
utra-0.084
redit-0.080
��-0.078
ople-0.077
Token basis
Token ID4308
Feature activation+4.12e-01

Pos logit contributions
umenthal+0.104
utra+0.101
redit+0.096
��+0.092
ople+0.092
Neg logit contributions
hower-0.094
 Synd-0.085
 NIC-0.083
 Spac-0.082
 Sting-0.082
Token of
Token ID286
Feature activation+1.01e-02

Pos logit contributions
hower+0.727
 Synd+0.682
 NIC+0.660
 Sting+0.658
 Spac+0.656
Neg logit contributions
umenthal-0.787
utra-0.768
redit-0.730
ounty-0.697
��-0.697
Token '
Token ID705
Feature activation-1.87e-01

Pos logit contributions
hower+0.017
 Synd+0.015
 NIC+0.015
 Spac+0.015
 Sting+0.015
Neg logit contributions
umenthal-0.020
utra-0.020
redit-0.019
��-0.018
ople-0.018
Tokentr
Token ID2213
Feature activation+7.81e-02

Pos logit contributions
umenthal+0.329
utra+0.318
redit+0.305
��+0.289
ople+0.287
Neg logit contributions
hower-0.361
 Synd-0.341
 NIC-0.333
 Sting-0.332
 Spac-0.329
Tokenades
Token ID2367
Feature activation-1.08e+00

Pos logit contributions
hower+0.142
 Synd+0.133
 NIC+0.129
 Sting+0.128
 Spac+0.128
Neg logit contributions
umenthal-0.149
utra-0.143
redit-0.138
ounty-0.130
ople-0.130
Token'.
Token ID4458
Feature activation-3.06e-01

Pos logit contributions
umenthal+1.844
utra+1.808
redit+1.683
ople+1.599
ounty+1.578
Neg logit contributions
hower-2.062
 Synd-1.969
 NIC-1.936
 Sting-1.927
 Spac-1.903
Token The
Token ID383
Feature activation-4.04e-02

Pos logit contributions
umenthal+0.550
utra+0.538
redit+0.515
ople+0.495
ounty+0.490
Neg logit contributions
hower-0.570
 conduc-0.525
 Synd-0.499
 NIC-0.495
wal-0.495
Token real
Token ID1103
Feature activation-3.79e-01

Pos logit contributions
umenthal+0.081
utra+0.078
redit+0.076
��+0.072
ounty+0.072
Neg logit contributions
hower-0.069
 Synd-0.061
wal-0.060
 NIC-0.060
 Sting-0.060
Token difference
Token ID3580
Feature activation+4.12e-02

Pos logit contributions
umenthal+0.765
utra+0.741
redit+0.712
��+0.688
ounty+0.680
Neg logit contributions
hower-0.642
 Synd-0.569
 Spac-0.559
wal-0.558
 NIC-0.556
Token between
Token ID1022
Feature activation-4.70e-01

Pos logit contributions
hower+0.072
 Synd+0.067
 NIC+0.065
 Sting+0.064
 conduc+0.064
Neg logit contributions
umenthal-0.080
utra-0.077
redit-0.075
��-0.071
ople-0.071
Token a
Token ID257
Feature activation-9.12e-02

Pos logit contributions
hower+0.238
 Synd+0.219
 NIC+0.213
 Sting+0.209
 Spac+0.208
Neg logit contributions
umenthal-0.353
utra-0.342
redit-0.327
��-0.317
ounty-0.313
Token price
Token ID2756
Feature activation+2.76e-01

Pos logit contributions
umenthal+0.175
utra+0.171
redit+0.163
ople+0.157
��+0.157
Neg logit contributions
hower-0.158
 Synd-0.146
wal-0.141
 NIC-0.140
 Spac-0.137
Token as
Token ID355
Feature activation-5.81e-02

Pos logit contributions
hower+0.429
 Synd+0.397
 NIC+0.383
 Sting+0.379
 Spac+0.376
Neg logit contributions
umenthal-0.589
utra-0.571
redit-0.550
��-0.527
ounty-0.526
Token deep
Token ID2769
Feature activation+7.89e-02

Pos logit contributions
umenthal+0.120
utra+0.117
redit+0.112
ounty+0.107
��+0.107
Neg logit contributions
hower-0.094
 Synd-0.084
 NIC-0.082
 Spac-0.082
 Sting-0.082
Token fact
Token ID1109
Feature activation+1.43e-03

Pos logit contributions
hower+0.130
 Synd+0.120
 NIC+0.118
 Spac+0.117
 Sting+0.117
Neg logit contributions
umenthal-0.161
utra-0.156
redit-0.149
ounty-0.144
��-0.142
Tokenional
Token ID1538
Feature activation-1.04e+00

Pos logit contributions
hower+0.002
 Synd+0.002
 NIC+0.002
 Sting+0.002
 Spac+0.002
Neg logit contributions
umenthal-0.003
utra-0.003
redit-0.003
ounty-0.003
��-0.003
Token divisions
Token ID17397
Feature activation-2.84e-02

Pos logit contributions
umenthal+1.979
utra+1.901
redit+1.819
ople+1.768
ounty+1.742
Neg logit contributions
hower-1.828
 Synd-1.639
 NIC-1.633
 conduc-1.625
 Spac-1.623
Token developed
Token ID4166
Feature activation-8.97e-02

Pos logit contributions
umenthal+0.054
utra+0.052
redit+0.050
ounty+0.048
��+0.047
Neg logit contributions
hower-0.051
 Synd-0.048
 NIC-0.046
 Sting-0.046
 conduc-0.046
Token between
Token ID1022
Feature activation-2.39e-01

Pos logit contributions
umenthal+0.175
utra+0.170
redit+0.163
ounty+0.155
ople+0.154
Neg logit contributions
hower-0.155
 Synd-0.143
 NIC-0.140
 Sting-0.138
 conduc-0.138
Token the
Token ID262
Feature activation-7.95e-02

Pos logit contributions
umenthal+0.490
utra+0.465
redit+0.455
ople+0.434
ounty+0.433
Neg logit contributions
hower-0.394
 Synd-0.349
 conduc-0.342
 Spac-0.340
 NIC-0.340
Token youthful
Token ID35444
Feature activation-6.96e-02

Pos logit contributions
umenthal+0.163
utra+0.155
redit+0.152
ople+0.146
��+0.145
Neg logit contributions
hower-0.130
 Synd-0.117
 conduc-0.114
wal-0.113
 NIC-0.113
Token hotel
Token ID7541
Feature activation-1.40e-01

Pos logit contributions
hower+0.063
 Synd+0.058
 Sting+0.055
 conduc+0.055
 NIC+0.054
Neg logit contributions
umenthal-0.087
utra-0.084
redit-0.081
ople-0.078
ounty-0.078
Token room
Token ID2119
Feature activation-1.65e-01

Pos logit contributions
umenthal+0.265
utra+0.260
redit+0.251
ople+0.242
ounty+0.241
Neg logit contributions
hower-0.241
 Synd-0.227
 Sting-0.217
 Spac-0.215
 Huck-0.214
Token at
Token ID379
Feature activation-2.95e-02

Pos logit contributions
umenthal+0.330
utra+0.319
redit+0.307
ople+0.296
ounty+0.295
Neg logit contributions
hower-0.272
 Synd-0.255
 Sting-0.243
 conduc-0.242
 NIC-0.242
Token the
Token ID262
Feature activation-9.06e-03

Pos logit contributions
umenthal+0.061
utra+0.059
redit+0.057
��+0.055
ounty+0.054
Neg logit contributions
hower-0.047
 Synd-0.044
 conduc-0.042
 NIC-0.042
 Sting-0.042
Token Hy
Token ID6707
Feature activation-3.82e-01

Pos logit contributions
umenthal+0.019
utra+0.019
redit+0.018
ople+0.017
��+0.017
Neg logit contributions
hower-0.014
 Synd-0.013
 NIC-0.013
 Sting-0.012
 conduc-0.012
Tokenatt
Token ID1078
Feature activation-1.03e+00

Pos logit contributions
umenthal+0.726
utra+0.707
redit+0.679
ounty+0.641
ople+0.640
Neg logit contributions
hower-0.684
 Synd-0.640
 NIC-0.627
 conduc-0.619
 Sting-0.619
Token Reg
Token ID3310
Feature activation+1.17e-01

Pos logit contributions
umenthal+1.779
utra+1.745
redit+1.659
ople+1.600
ounty+1.579
Neg logit contributions
hower-1.932
 Synd-1.779
 conduc-1.756
 NIC-1.747
wal-1.746
Tokenency
Token ID1387
Feature activation+5.75e-02

Pos logit contributions
hower+0.211
 Synd+0.197
 NIC+0.191
 Sting+0.190
 conduc+0.189
Neg logit contributions
umenthal-0.220
utra-0.212
redit-0.204
ople-0.195
��-0.194
Token Irvine
Token ID37277
Feature activation+1.48e-01

Pos logit contributions
hower+0.096
 Synd+0.089
 NIC+0.087
 Sting+0.087
 conduc+0.086
Neg logit contributions
umenthal-0.113
utra-0.110
redit-0.105
ople-0.102
ounty-0.101
Token,
Token ID11
Feature activation-1.20e-01

Pos logit contributions
hower+0.241
 Synd+0.224
 NIC+0.216
 Sting+0.214
 conduc+0.212
Neg logit contributions
umenthal-0.298
utra-0.291
redit-0.280
ounty-0.270
ople-0.270
Token November
Token ID3389
Feature activation-1.42e-01

Pos logit contributions
umenthal+0.269
utra+0.261
redit+0.253
ounty+0.244
ople+0.242
Neg logit contributions
hower-0.171
 Synd-0.156
 Sting-0.147
 Spac-0.147
 conduc-0.147
Token
Token ID198
Feature activation-3.59e-02

Pos logit contributions
umenthal+0.161
utra+0.153
redit+0.147
ounty+0.142
��+0.141
Neg logit contributions
hower-0.149
 Synd-0.136
 conduc-0.135
 NIC-0.133
 Sting-0.132
Token
Token ID198
Feature activation-6.36e-02

Pos logit contributions
umenthal+0.071
utra+0.069
redit+0.066
ople+0.064
ounty+0.064
Neg logit contributions
hower-0.062
 conduc-0.060
 Synd-0.056
 Sting-0.054
 Huck-0.054
TokenAn
Token ID2025
Feature activation+2.38e-01

Pos logit contributions
umenthal+0.103
utra+0.098
redit+0.093
ounty+0.089
��+0.088
Neg logit contributions
hower-0.134
 Synd-0.126
 conduc-0.123
 NIC-0.123
 Sting-0.122
Token ob
Token ID909
Feature activation-1.75e-02

Pos logit contributions
hower+0.406
 Synd+0.373
 conduc+0.362
 Sting+0.361
 NIC+0.361
Neg logit contributions
umenthal-0.476
utra-0.460
redit-0.442
ounty-0.424
��-0.423
Tokenit
Token ID270
Feature activation-3.95e-02

Pos logit contributions
umenthal+0.028
utra+0.026
redit+0.025
��+0.024
ounty+0.024
Neg logit contributions
hower-0.037
 Synd-0.035
 NIC-0.034
 Sting-0.034
 conduc-0.034
Tokenuary
Token ID2838
Feature activation-1.03e+00

Pos logit contributions
umenthal+0.064
utra+0.061
redit+0.059
��+0.055
ople+0.055
Neg logit contributions
hower-0.082
 Synd-0.078
 NIC-0.076
 Sting-0.076
 Spac-0.075
Token for
Token ID329
Feature activation+4.15e-02

Pos logit contributions
umenthal+1.996
utra+1.910
redit+1.861
ople+1.759
��+1.750
Neg logit contributions
hower-1.765
 Synd-1.629
 NIC-1.575
 conduc-1.571
 Sting-1.567
Token a
Token ID257
Feature activation-7.04e-02

Pos logit contributions
hower+0.067
 Synd+0.061
 NIC+0.060
 conduc+0.059
 Spac+0.059
Neg logit contributions
umenthal-0.087
utra-0.083
redit-0.080
ounty-0.077
��-0.077
Token theater
Token ID13766
Feature activation-7.10e-02

Pos logit contributions
umenthal+0.139
utra+0.133
redit+0.128
ounty+0.123
��+0.123
Neg logit contributions
hower-0.121
 Synd-0.111
 NIC-0.107
 conduc-0.106
wal-0.106
Token employee
Token ID6538
Feature activation+5.90e-02

Pos logit contributions
umenthal+0.135
utra+0.130
redit+0.125
��+0.120
ople+0.120
Neg logit contributions
hower-0.126
 Synd-0.118
 NIC-0.115
 Sting-0.114
 Spac-0.113
Token reported
Token ID2098
Feature activation+3.81e-01

Pos logit contributions
hower+0.101
 Synd+0.092
 NIC+0.089
 conduc+0.088
 Sting+0.088
Neg logit contributions
umenthal-0.118
utra-0.113
redit-0.109
ounty-0.104
ople-0.104
Token Anders
Token ID25519
Feature activation+9.72e-02

Pos logit contributions
umenthal+0.360
utra+0.346
redit+0.335
��+0.326
ople+0.322
Neg logit contributions
hower-0.260
 Synd-0.240
 NIC-0.232
 conduc-0.232
 Sting-0.231
Token,
Token ID11
Feature activation+8.19e-02

Pos logit contributions
hower+0.135
 Synd+0.125
 NIC+0.120
 Sting+0.119
 Spac+0.117
Neg logit contributions
umenthal-0.225
utra-0.216
redit-0.210
��-0.204
ounty-0.201
Token "
Token ID366
Feature activation+2.81e-02

Pos logit contributions
hower+0.138
 Synd+0.127
 NIC+0.121
 conduc+0.120
 Sting+0.120
Neg logit contributions
umenthal-0.165
utra-0.159
redit-0.153
ople-0.148
ounty-0.147
Tokenlet
Token ID1616
Feature activation-2.58e-02

Pos logit contributions
hower+0.053
 Synd+0.049
 conduc+0.048
 NIC+0.048
 Sting+0.047
Neg logit contributions
umenthal-0.051
utra-0.049
redit-0.047
ounty-0.045
��-0.045
Token me
Token ID502
Feature activation-3.71e-01

Pos logit contributions
umenthal+0.054
utra+0.052
redit+0.050
��+0.049
ounty+0.048
Neg logit contributions
hower-0.041
 Synd-0.039
 NIC-0.037
 Sting-0.037
 Spac-0.036
Token count
Token ID954
Feature activation-1.02e+00

Pos logit contributions
umenthal+0.659
utra+0.633
redit+0.617
ople+0.594
ounty+0.582
Neg logit contributions
hower-0.707
 Synd-0.655
 conduc-0.636
 NIC-0.628
 Sting-0.624
Token quickly
Token ID2952
Feature activation+1.87e-01

Pos logit contributions
umenthal+1.986
utra+1.934
redit+1.848
ople+1.778
ounty+1.773
Neg logit contributions
hower-1.740
 Synd-1.584
 conduc-1.562
 NIC-1.541
wal-1.523
Token."
Token ID526
Feature activation-3.00e-01

Pos logit contributions
hower+0.310
 Synd+0.285
 NIC+0.278
 conduc+0.276
 Sting+0.274
Neg logit contributions
umenthal-0.383
utra-0.368
redit-0.356
ople-0.340
��-0.339
Token P
Token ID350
Feature activation-2.30e-02

Pos logit contributions
umenthal+0.555
utra+0.539
redit+0.511
ople+0.497
��+0.493
Neg logit contributions
hower-0.542
 conduc-0.504
 Synd-0.493
wal-0.477
 NIC-0.475
Tokenause
Token ID682
Feature activation-3.32e-01

Pos logit contributions
umenthal+0.036
utra+0.034
redit+0.033
ople+0.031
ounty+0.030
Neg logit contributions
hower-0.050
 Synd-0.047
 NIC-0.046
 Sting-0.046
 conduc-0.046
Token.
Token ID13
Feature activation-2.90e-02

Pos logit contributions
umenthal+0.704
utra+0.684
redit+0.654
��+0.634
ople+0.632
Neg logit contributions
hower-0.524
 Synd-0.479
 conduc-0.463
 NIC-0.460
 Sting-0.457
Token,
Token ID11
Feature activation-7.72e-02

Pos logit contributions
umenthal+0.855
utra+0.823
redit+0.796
��+0.769
ounty+0.759
Neg logit contributions
hower-0.647
 Synd-0.599
 NIC-0.576
 conduc-0.575
 Sting-0.566
Token Liam
Token ID31684
Feature activation+4.78e-02

Pos logit contributions
umenthal+0.156
utra+0.149
redit+0.145
��+0.139
ounty+0.138
Neg logit contributions
hower-0.129
 Synd-0.119
 NIC-0.114
 conduc-0.114
 Sting-0.113
Token,
Token ID11
Feature activation-2.15e-02

Pos logit contributions
hower+0.074
 Synd+0.069
 NIC+0.067
 Sting+0.066
 conduc+0.066
Neg logit contributions
umenthal-0.102
utra-0.098
redit-0.095
��-0.092
ounty-0.091
Token Stella
Token ID45856
Feature activation-3.99e-01

Pos logit contributions
umenthal+0.041
utra+0.039
redit+0.038
��+0.036
ople+0.036
Neg logit contributions
hower-0.039
 Synd-0.036
 conduc-0.035
 NIC-0.034
wal-0.034
Token,
Token ID11
Feature activation-5.19e-02

Pos logit contributions
umenthal+0.860
utra+0.829
redit+0.798
��+0.779
ople+0.766
Neg logit contributions
hower-0.611
 Synd-0.568
 NIC-0.549
 Sting-0.542
 conduc-0.540
Token Hatt
Token ID48509
Feature activation-1.02e+00

Pos logit contributions
umenthal+0.106
utra+0.102
redit+0.098
��+0.095
ople+0.094
Neg logit contributions
hower-0.086
 Synd-0.079
 conduc-0.076
 NIC-0.076
wal-0.074
Tokenie
Token ID494
Feature activation-1.82e-01

Pos logit contributions
umenthal+1.516
utra+1.448
redit+1.373
ople+1.299
ounty+1.281
Neg logit contributions
hower-2.231
 Synd-2.108
 NIC-2.089
 conduc-2.085
 Sting-2.058
Token and
Token ID290
Feature activation+9.11e-02

Pos logit contributions
umenthal+0.399
utra+0.384
redit+0.371
��+0.360
ounty+0.356
Neg logit contributions
hower-0.273
 Synd-0.255
 NIC-0.245
 Sting-0.243
 Spac-0.240
Token Finn
Token ID15368
Feature activation-2.14e-02

Pos logit contributions
hower+0.147
 Synd+0.135
 conduc+0.130
 NIC+0.130
wal+0.128
Neg logit contributions
umenthal-0.188
utra-0.181
redit-0.175
��-0.169
ople-0.167
Token,
Token ID11
Feature activation+5.65e-02

Pos logit contributions
umenthal+0.045
utra+0.043
redit+0.041
��+0.040
ounty+0.040
Neg logit contributions
hower-0.034
 Synd-0.032
 NIC-0.031
 Sting-0.030
 conduc-0.030
Token and
Token ID290
Feature activation+8.53e-02

Pos logit contributions
hower+0.102
 Synd+0.096
 NIC+0.092
 Sting+0.092
 conduc+0.091
Neg logit contributions
umenthal-0.106
utra-0.102
redit-0.098
��-0.094
ounty-0.093
Token YOUR
Token ID16592
Feature activation-1.72e-01

Pos logit contributions
umenthal+0.156
utra+0.149
redit+0.145
��+0.139
ounty+0.138
Neg logit contributions
hower-0.129
 Synd-0.119
 conduc-0.115
 NIC-0.114
 Sting-0.113
Token bed
Token ID3996
Feature activation-2.77e-01

Pos logit contributions
umenthal+0.319
utra+0.307
redit+0.291
��+0.284
ople+0.282
Neg logit contributions
hower-0.312
 Synd-0.294
 conduc-0.285
 Sting-0.281
 Huck-0.279
Token?"
Token ID1701
Feature activation-2.01e-01

Pos logit contributions
umenthal+0.457
utra+0.434
redit+0.416
��+0.394
ople+0.394
Neg logit contributions
hower-0.560
 Synd-0.532
 conduc-0.517
 NIC-0.515
 Sting-0.513
Token she
Token ID673
Feature activation-8.98e-02

Pos logit contributions
umenthal+0.384
utra+0.367
redit+0.353
��+0.342
ounty+0.339
Neg logit contributions
hower-0.358
 Synd-0.334
 NIC-0.325
 Sting-0.322
 conduc-0.321
Token te
Token ID573
Feature activation+1.50e-01

Pos logit contributions
umenthal+0.201
utra+0.193
redit+0.186
ople+0.181
ounty+0.179
Neg logit contributions
hower-0.132
 Synd-0.121
 NIC-0.115
 conduc-0.114
 Spac-0.113
Tokenases
Token ID1386
Feature activation-1.02e+00

Pos logit contributions
hower+0.307
 Synd+0.290
 NIC+0.282
 Sting+0.281
 Spac+0.279
Neg logit contributions
umenthal-0.251
utra-0.235
redit-0.227
ople-0.217
ounty-0.214
Token.
Token ID13
Feature activation-5.16e-02

Pos logit contributions
umenthal+2.092
utra+2.026
redit+1.916
ople+1.870
ounty+1.858
Neg logit contributions
hower-1.643
 Synd-1.492
 conduc-1.448
wal-1.417
 NIC-1.411
Token
Token ID198
Feature activation-1.33e-01

Pos logit contributions
umenthal+0.108
utra+0.104
redit+0.100
��+0.097
ounty+0.096
Neg logit contributions
hower-0.082
 Synd-0.077
 NIC-0.074
 Sting-0.073
 conduc-0.073
Token
Token ID198
Feature activation-1.48e-01

Pos logit contributions
umenthal+0.255
utra+0.251
redit+0.238
ople+0.232
ounty+0.231
Neg logit contributions
hower-0.236
 conduc-0.233
 Synd-0.212
 Huck-0.205
 Sting-0.205
TokenI
Token ID40
Feature activation+3.97e-02

Pos logit contributions
umenthal+0.256
utra+0.244
redit+0.233
��+0.224
ounty+0.221
Neg logit contributions
hower-0.291
 Synd-0.276
 NIC-0.268
 Sting-0.266
 conduc-0.264
Token tried
Token ID3088
Feature activation-1.56e-01

Pos logit contributions
hower+0.077
 Synd+0.072
 conduc+0.070
 NIC+0.070
 Sting+0.069
Neg logit contributions
umenthal-0.070
utra-0.067
redit-0.064
ople-0.061
ounty-0.061
Token
Token ID447
Feature activation-7.20e-02

Pos logit contributions
umenthal+0.116
utra+0.111
redit+0.107
��+0.103
ounty+0.103
Neg logit contributions
hower-0.098
 Synd-0.090
 NIC-0.087
 conduc-0.087
 Sting-0.087
Token
Token ID251
Feature activation+6.11e-02

Pos logit contributions
umenthal+0.138
utra+0.132
redit+0.127
��+0.121
ounty+0.120
Neg logit contributions
hower-0.128
 Synd-0.121
 Sting-0.117
 NIC-0.116
 Huck-0.115
Token The
Token ID383
Feature activation-8.13e-02

Pos logit contributions
hower+0.102
 Synd+0.093
 NIC+0.091
 conduc+0.091
 Sting+0.090
Neg logit contributions
umenthal-0.124
utra-0.118
redit-0.114
ounty-0.110
ople-0.110
Token ob
Token ID909
Feature activation-2.42e-01

Pos logit contributions
umenthal+0.160
utra+0.152
redit+0.146
��+0.141
ounty+0.140
Neg logit contributions
hower-0.141
 Synd-0.132
 NIC-0.127
 Sting-0.126
 Spac-0.126
Tokenit
Token ID270
Feature activation+9.30e-03

Pos logit contributions
umenthal+0.384
utra+0.364
redit+0.347
��+0.330
ounty+0.326
Neg logit contributions
hower-0.511
 Synd-0.487
 NIC-0.474
 Sting-0.472
 Spac-0.468
Tokenuary
Token ID2838
Feature activation-1.00e+00

Pos logit contributions
hower+0.018
 Synd+0.017
 NIC+0.017
 Sting+0.016
 Spac+0.016
Neg logit contributions
umenthal-0.016
utra-0.016
redit-0.015
��-0.014
ounty-0.014
Token reported
Token ID2098
Feature activation+3.58e-01

Pos logit contributions
umenthal+2.010
utra+1.926
redit+1.869
ople+1.761
ounty+1.755
Neg logit contributions
hower-1.692
 Synd-1.557
 NIC-1.494
 conduc-1.484
 Spac-1.474
Token that
Token ID326
Feature activation+1.18e-01

Pos logit contributions
hower+0.579
 Synd+0.541
 NIC+0.524
 Sting+0.518
 Spac+0.515
Neg logit contributions
umenthal-0.744
utra-0.713
redit-0.687
��-0.664
ounty-0.659
Token Danny
Token ID15105
Feature activation+6.26e-02

Pos logit contributions
hower+0.202
 Synd+0.187
 NIC+0.182
 Spac+0.179
 conduc+0.179
Neg logit contributions
umenthal-0.234
utra-0.223
redit-0.215
ople-0.206
��-0.206
Token Daniels
Token ID28162
Feature activation-1.97e-01

Pos logit contributions
hower+0.110
 Synd+0.103
 NIC+0.100
 Sting+0.100
 conduc+0.099
Neg logit contributions
umenthal-0.121
utra-0.115
redit-0.111
��-0.107
ople-0.106
Token,
Token ID11
Feature activation+3.67e-02

Pos logit contributions
umenthal+0.391
utra+0.376
redit+0.361
��+0.350
ople+0.344
Neg logit contributions
hower-0.331
 Synd-0.309
 NIC-0.299
 Sting-0.296
 conduc-0.296
Token chain
Token ID6333
Feature activation-1.83e-01

Pos logit contributions
umenthal+0.188
utra+0.182
redit+0.173
��+0.169
ounty+0.167
Neg logit contributions
hower-0.152
 Synd-0.142
 Sting-0.137
 NIC-0.137
 Huck-0.135
Token.
Token ID13
Feature activation-3.66e-02

Pos logit contributions
umenthal+0.393
utra+0.383
redit+0.365
ounty+0.353
��+0.352
Neg logit contributions
hower-0.280
 Synd-0.259
 NIC-0.250
 Sting-0.250
 conduc-0.246
Token
Token ID198
Feature activation+3.42e-02

Pos logit contributions
umenthal+0.073
utra+0.071
redit+0.068
ounty+0.066
��+0.065
Neg logit contributions
hower-0.062
 Synd-0.056
 conduc-0.055
 NIC-0.054
 Sting-0.054
Token
Token ID198
Feature activation-1.04e-03

Pos logit contributions
hower+0.059
 conduc+0.057
 Synd+0.053
 Sting+0.051
 NIC+0.051
Neg logit contributions
umenthal-0.067
utra-0.066
redit-0.063
ople-0.061
ounty-0.061
TokenM
Token ID44
Feature activation-6.17e-04

Pos logit contributions
umenthal+0.002
utra+0.002
redit+0.002
��+0.002
ounty+0.002
Neg logit contributions
hower-0.002
 Synd-0.002
 conduc-0.002
 NIC-0.002
 Sting-0.002
Tokenounds
Token ID3733
Feature activation-9.93e-01

Pos logit contributions
umenthal+0.001
utra+0.001
redit+0.001
��+0.001
ounty+0.001
Neg logit contributions
hower-0.001
 Synd-0.001
 NIC-0.001
 Sting-0.001
 Spac-0.001
Token of
Token ID286
Feature activation-6.82e-02

Pos logit contributions
umenthal+1.888
utra+1.849
redit+1.757
ounty+1.676
��+1.670
Neg logit contributions
hower-1.735
 Synd-1.653
 NIC-1.580
 Spac-1.567
 conduc-1.565
Token tail
Token ID7894
Feature activation+4.95e-01

Pos logit contributions
umenthal+0.123
utra+0.118
redit+0.112
��+0.108
ounty+0.107
Neg logit contributions
hower-0.129
 Synd-0.121
 NIC-0.117
 Sting-0.116
 Spac-0.116
Tokenings
Token ID654
Feature activation+6.26e-02

Pos logit contributions
hower+0.941
 Synd+0.890
 NIC+0.867
 Sting+0.851
 Spac+0.847
Neg logit contributions
umenthal-0.884
utra-0.846
redit-0.811
ounty-0.775
��-0.774
Token from
Token ID422
Feature activation+1.44e-01

Pos logit contributions
hower+0.110
 Synd+0.106
 NIC+0.102
 Spac+0.101
 Sting+0.100
Neg logit contributions
umenthal-0.119
utra-0.116
redit-0.110
ounty-0.105
��-0.105
Token mineral
Token ID18352
Feature activation-2.95e-01

Pos logit contributions
hower+0.238
 Synd+0.222
 NIC+0.214
 Sting+0.210
 Spac+0.210
Neg logit contributions
umenthal-0.294
utra-0.284
redit-0.273
��-0.263
ounty-0.261
Token.
Token ID13
Feature activation+1.13e-01

Pos logit contributions
umenthal+0.237
utra+0.228
redit+0.220
��+0.212
ounty+0.212
Neg logit contributions
hower-0.156
 Synd-0.145
 NIC-0.139
 conduc-0.137
 Sting-0.137
Token
Token ID198
Feature activation+4.44e-02

Pos logit contributions
hower+0.194
 Synd+0.178
 conduc+0.175
 NIC+0.173
 Sting+0.171
Neg logit contributions
umenthal-0.221
utra-0.214
redit-0.205
��-0.198
ople-0.197
Token
Token ID198
Feature activation+5.44e-03

Pos logit contributions
hower+0.077
 conduc+0.075
 Synd+0.070
 Sting+0.067
 Huck+0.067
Neg logit contributions
umenthal-0.087
utra-0.085
redit-0.081
ople-0.079
ounty-0.078
TokenD
Token ID35
Feature activation+1.71e-01

Pos logit contributions
hower+0.011
 Synd+0.010
 NIC+0.010
 Sting+0.010
 conduc+0.010
Neg logit contributions
umenthal-0.009
utra-0.009
redit-0.009
��-0.008
ounty-0.008
Tokenry
Token ID563
Feature activation-1.87e-01

Pos logit contributions
hower+0.356
 Synd+0.339
 NIC+0.329
 Sting+0.328
 Spac+0.324
Neg logit contributions
umenthal-0.279
utra-0.264
redit-0.252
��-0.239
ounty-0.238
TokenHop
Token ID23483
Feature activation-9.86e-01

Pos logit contributions
umenthal+0.323
utra+0.307
redit+0.294
��+0.283
ounty+0.278
Neg logit contributions
hower-0.368
 Synd-0.349
 NIC-0.339
 Sting-0.336
 conduc-0.333
Token's
Token ID338
Feature activation+4.80e-02

Pos logit contributions
umenthal+1.785
utra+1.714
redit+1.644
ople+1.581
��+1.567
Neg logit contributions
hower-1.845
 Synd-1.689
 conduc-1.661
 NIC-1.658
 Sting-1.646
Token beer
Token ID6099
Feature activation-5.58e-02

Pos logit contributions
hower+0.085
 Synd+0.079
 NIC+0.077
 Sting+0.076
wal+0.076
Neg logit contributions
umenthal-0.092
utra-0.089
redit-0.085
��-0.082
ounty-0.081
Token names
Token ID3891
Feature activation+4.09e-01

Pos logit contributions
umenthal+0.097
utra+0.093
redit+0.089
��+0.086
ople+0.084
Neg logit contributions
hower-0.109
 Synd-0.102
 NIC-0.099
 Sting-0.099
 Spac-0.098
Token give
Token ID1577
Feature activation+4.21e-02

Pos logit contributions
hower+0.681
 Synd+0.632
 NIC+0.610
 Sting+0.603
 conduc+0.600
Neg logit contributions
umenthal-0.831
utra-0.807
redit-0.771
��-0.743
ounty-0.736
Token me
Token ID502
Feature activation-2.70e-02

Pos logit contributions
hower+0.068
 Synd+0.063
 NIC+0.060
 Sting+0.060
 conduc+0.060
Neg logit contributions
umenthal-0.088
utra-0.085
redit-0.082
��-0.079
ounty-0.078
Token chants
Token ID40727
Feature activation-5.19e-02

Pos logit contributions
hower+0.137
 Synd+0.125
 NIC+0.121
 conduc+0.119
 Sting+0.118
Neg logit contributions
umenthal-0.168
utra-0.160
redit-0.154
ople-0.149
��-0.148
Token of
Token ID286
Feature activation+5.29e-02

Pos logit contributions
umenthal+0.100
utra+0.096
redit+0.092
��+0.088
ounty+0.088
Neg logit contributions
hower-0.091
 Synd-0.086
 NIC-0.083
 Spac-0.082
 Sting-0.082
Token
Token ID564
Feature activation+1.65e-01

Pos logit contributions
hower+0.078
 Synd+0.072
 NIC+0.070
 Sting+0.069
 Spac+0.069
Neg logit contributions
umenthal-0.117
utra-0.113
redit-0.109
��-0.107
ople-0.105
Token
Token ID250
Feature activation-9.25e-02

Pos logit contributions
hower+0.314
 Synd+0.300
 NIC+0.289
 Sting+0.286
 conduc+0.286
Neg logit contributions
umenthal-0.295
utra-0.282
��-0.269
redit-0.268
ounty-0.257
TokenRes
Token ID4965
Feature activation+2.70e-01

Pos logit contributions
umenthal+0.158
utra+0.150
redit+0.143
��+0.137
ounty+0.137
Neg logit contributions
hower-0.184
 Synd-0.176
 NIC-0.170
 Sting-0.169
 Spac-0.167
Tokenign
Token ID570
Feature activation-9.84e-01

Pos logit contributions
hower+0.523
 Synd+0.497
 NIC+0.484
 Sting+0.479
 Spac+0.478
Neg logit contributions
umenthal-0.475
utra-0.454
redit-0.434
��-0.410
ounty-0.409
Token!
Token ID0
Feature activation+1.06e-01

Pos logit contributions
umenthal+1.952
utra+1.890
redit+1.818
ounty+1.746
��+1.743
Neg logit contributions
hower-1.641
 Synd-1.540
 NIC-1.497
 conduc-1.471
 Spac-1.468
Token Res
Token ID1874
Feature activation+2.18e-01

Pos logit contributions
hower+0.188
 Synd+0.176
 NIC+0.171
 Sting+0.170
 Spac+0.168
Neg logit contributions
umenthal-0.206
utra-0.196
redit-0.190
��-0.182
ounty-0.181
Tokenign
Token ID570
Feature activation-8.14e-01

Pos logit contributions
hower+0.509
 Synd+0.488
 NIC+0.478
 Sting+0.475
 Spac+0.472
Neg logit contributions
umenthal-0.295
utra-0.279
redit-0.265
ounty-0.244
��-0.244
Token!
Token ID0
Feature activation+7.92e-02

Pos logit contributions
umenthal+1.560
utra+1.497
redit+1.449
ople+1.385
ounty+1.383
Neg logit contributions
hower-1.418
 Synd-1.336
 NIC-1.297
 Spac-1.286
 Sting-1.282
Token
Token ID447
Feature activation+4.00e-02

Pos logit contributions
hower+0.142
 Synd+0.133
 NIC+0.129
 Sting+0.128
 Spac+0.127
Neg logit contributions
umenthal-0.151
utra-0.144
redit-0.139
��-0.134
ounty-0.133
Token,
Token ID11
Feature activation+1.29e-02

Pos logit contributions
umenthal+0.346
utra+0.334
redit+0.324
ople+0.310
��+0.308
Neg logit contributions
hower-0.271
 Synd-0.250
 NIC-0.242
 Sting-0.238
 conduc-0.238
Token and
Token ID290
Feature activation-1.03e-02

Pos logit contributions
hower+0.021
 Synd+0.020
 NIC+0.019
 Sting+0.019
 conduc+0.019
Neg logit contributions
umenthal-0.026
utra-0.025
redit-0.024
��-0.023
ople-0.023
Token they
Token ID484
Feature activation-2.49e-02

Pos logit contributions
umenthal+0.020
utra+0.020
redit+0.019
ople+0.018
��+0.018
Neg logit contributions
hower-0.017
 Synd-0.016
 NIC-0.016
 Huck-0.015
 conduc-0.015
Token are
Token ID389
Feature activation-1.69e-01

Pos logit contributions
umenthal+0.048
utra+0.047
redit+0.045
ople+0.043
ounty+0.043
Neg logit contributions
hower-0.043
 Synd-0.041
 NIC-0.039
 Huck-0.039
 Sting-0.039
Token not
Token ID407
Feature activation-1.92e-01

Pos logit contributions
umenthal+0.332
utra+0.325
redit+0.312
ople+0.298
ounty+0.295
Neg logit contributions
hower-0.289
 Synd-0.265
 NIC-0.255
 Sting-0.252
 Spac-0.251
Token accurately
Token ID14351
Feature activation-9.67e-01

Pos logit contributions
umenthal+0.365
utra+0.356
redit+0.342
ople+0.326
ounty+0.325
Neg logit contributions
hower-0.345
 Synd-0.317
 NIC-0.306
 Sting-0.302
 Spac-0.302
Token rendered
Token ID15111
Feature activation+3.94e-02

Pos logit contributions
umenthal+1.727
utra+1.679
redit+1.605
ounty+1.517
ople+1.514
Neg logit contributions
hower-1.844
 Synd-1.695
 NIC-1.649
 Sting-1.624
 conduc-1.618
Token as
Token ID355
Feature activation-3.65e-02

Pos logit contributions
hower+0.062
 Synd+0.056
 NIC+0.055
 Sting+0.054
 conduc+0.054
Neg logit contributions
umenthal-0.084
utra-0.081
redit-0.078
ople-0.075
��-0.075
Token Bra
Token ID9718
Feature activation-6.50e-02

Pos logit contributions
umenthal+0.071
utra+0.069
redit+0.065
��+0.063
ople+0.062
Neg logit contributions
hower-0.065
 Synd-0.060
 NIC-0.058
 Sting-0.057
 Huck-0.057
Tokenille
Token ID8270
Feature activation-4.83e-02

Pos logit contributions
umenthal+0.121
utra+0.116
redit+0.111
ople+0.107
ounty+0.106
Neg logit contributions
hower-0.121
 Synd-0.113
 conduc-0.112
 NIC-0.110
 Sting-0.109
Token numbers
Token ID3146
Feature activation-1.97e-01

Pos logit contributions
umenthal+0.097
utra+0.093
redit+0.090
ople+0.087
��+0.086
Neg logit contributions
hower-0.082
 Synd-0.075
 NIC-0.073
 Sting-0.072
 Huck-0.072
Token
Token ID198
Feature activation-5.45e-02

Pos logit contributions
umenthal+0.130
utra+0.129
redit+0.122
ople+0.119
ounty+0.118
Neg logit contributions
hower-0.121
 conduc-0.119
 Synd-0.109
 Sting-0.105
 Huck-0.105
TokenCustomer
Token ID44939
Feature activation-8.68e-02

Pos logit contributions
umenthal+0.094
utra+0.090
redit+0.085
��+0.082
ounty+0.081
Neg logit contributions
hower-0.108
 Synd-0.102
 NIC-0.099
 Sting-0.099
 Spac-0.098
Token:
Token ID25
Feature activation-7.08e-02

Pos logit contributions
umenthal+0.174
utra+0.164
redit+0.160
��+0.153
ounty+0.153
Neg logit contributions
hower-0.148
 Synd-0.140
 NIC-0.135
 Sting-0.135
wal-0.133
Token *
Token ID1635
Feature activation+1.49e-01

Pos logit contributions
umenthal+0.149
utra+0.143
redit+0.138
��+0.134
ounty+0.132
Neg logit contributions
hower-0.113
 Synd-0.105
 NIC-0.102
 Sting-0.101
 Spac-0.099
Tokenmen
Token ID3653
Feature activation-2.87e-01

Pos logit contributions
hower+0.282
 Synd+0.267
 NIC+0.258
 Sting+0.255
 conduc+0.255
Neg logit contributions
umenthal-0.268
utra-0.258
redit-0.246
��-0.236
ople-0.235
Tokenacing
Token ID4092
Feature activation-9.66e-01

Pos logit contributions
umenthal+0.469
utra+0.450
redit+0.423
ople+0.409
��+0.402
Neg logit contributions
hower-0.587
 Synd-0.564
 NIC-0.544
 Sting-0.541
 Spac-0.538
Token*
Token ID9
Feature activation-3.68e-03

Pos logit contributions
umenthal+1.768
utra+1.706
redit+1.608
ople+1.589
ounty+1.547
Neg logit contributions
hower-1.766
 Synd-1.648
 NIC-1.592
 conduc-1.562
 Spac-1.552
Token
Token ID564
Feature activation-1.72e-01

Pos logit contributions
umenthal+0.007
utra+0.007
redit+0.007
��+0.007
ople+0.007
Neg logit contributions
hower-0.006
 Synd-0.006
 NIC-0.006
 Sting-0.006
 conduc-0.005
Token
Token ID250
Feature activation-2.16e-01

Pos logit contributions
umenthal+0.279
utra+0.264
redit+0.253
��+0.239
ounty+0.237
Neg logit contributions
hower-0.355
 Synd-0.337
 NIC-0.328
 Sting-0.327
 Huck-0.324
TokenI
Token ID40
Feature activation-6.02e-02

Pos logit contributions
umenthal+0.373
utra+0.363
redit+0.343
ople+0.328
��+0.327
Neg logit contributions
hower-0.422
 Synd-0.401
 NIC-0.389
 conduc-0.385
 Sting-0.384
Token
Token ID447
Feature activation-3.59e-02

Pos logit contributions
umenthal+0.113
utra+0.109
redit+0.105
��+0.100
ounty+0.100
Neg logit contributions
hower-0.109
 Synd-0.102
 NIC-0.098
 conduc-0.098
 Sting-0.098
Token P
Token ID350
Feature activation+4.30e-02

Pos logit contributions
hower+0.147
 Synd+0.138
 NIC+0.134
 Sting+0.133
 Huck+0.132
Neg logit contributions
umenthal-0.126
utra-0.120
redit-0.114
��-0.110
ounty-0.109
Tokenilar
Token ID1794
Feature activation-4.25e-01

Pos logit contributions
hower+0.086
 Synd+0.081
 NIC+0.079
 Sting+0.079
 conduc+0.079
Neg logit contributions
umenthal-0.073
utra-0.070
redit-0.067
ople-0.063
��-0.063
Token among
Token ID1871
Feature activation+2.43e-01

Pos logit contributions
umenthal+0.821
utra+0.796
redit+0.752
ounty+0.730
ople+0.729
Neg logit contributions
hower-0.741
 Synd-0.697
 NIC-0.672
 conduc-0.663
 Sting-0.662
Token the
Token ID262
Feature activation-1.31e-02

Pos logit contributions
hower+0.374
 Synd+0.345
 NIC+0.332
 Sting+0.330
 conduc+0.325
Neg logit contributions
umenthal-0.526
utra-0.505
redit-0.486
��-0.473
ounty-0.468
Token az
Token ID35560
Feature activation-5.16e-03

Pos logit contributions
umenthal+0.024
utra+0.023
redit+0.022
��+0.021
ounty+0.021
Neg logit contributions
hower-0.024
 Synd-0.023
 NIC-0.022
 Sting-0.022
 conduc-0.021
Tokenure
Token ID495
Feature activation-9.62e-01

Pos logit contributions
umenthal+0.007
utra+0.007
redit+0.007
ounty+0.006
ople+0.006
Neg logit contributions
hower-0.012
 Synd-0.011
 NIC-0.011
 Sting-0.011
 conduc-0.011
Token channels
Token ID9619
Feature activation+1.49e-01

Pos logit contributions
umenthal+1.728
utra+1.651
redit+1.550
yx+1.524
ounty+1.510
Neg logit contributions
hower-1.812
 Synd-1.658
 NIC-1.611
 conduc-1.606
 Sting-1.605
Token and
Token ID290
Feature activation-1.11e-01

Pos logit contributions
hower+0.232
 Synd+0.216
 NIC+0.208
 Sting+0.207
 Huck+0.204
Neg logit contributions
umenthal-0.318
utra-0.306
redit-0.294
��-0.285
ounty-0.284
Token white
Token ID2330
Feature activation-1.87e-01

Pos logit contributions
umenthal+0.213
utra+0.205
redit+0.194
��+0.189
ounty+0.187
Neg logit contributions
hower-0.197
 Synd-0.182
 NIC-0.176
 Sting-0.175
 Huck-0.173
Token-
Token ID12
Feature activation+2.49e-02

Pos logit contributions
umenthal+0.329
utra+0.313
redit+0.297
��+0.286
ople+0.285
Neg logit contributions
hower-0.363
 Synd-0.339
 NIC-0.328
 Sting-0.327
 Spac-0.326
Tokensand
Token ID38142
Feature activation-2.62e-01

Pos logit contributions
hower+0.048
 Synd+0.046
 NIC+0.045
 Sting+0.045
 conduc+0.044
Neg logit contributions
umenthal-0.043
utra-0.041
redit-0.040
��-0.038
ounty-0.038
Token front
Token ID2166
Feature activation-6.38e-01

Pos logit contributions
umenthal+0.109
utra+0.105
redit+0.101
ounty+0.097
ople+0.097
Neg logit contributions
hower-0.091
 Synd-0.085
 NIC-0.083
 Sting-0.081
 Spac-0.081
Token and
Token ID290
Feature activation-1.02e-01

Pos logit contributions
umenthal+1.271
utra+1.220
redit+1.171
��+1.130
ople+1.121
Neg logit contributions
hower-1.093
 Synd-1.020
 NIC-0.981
 Sting-0.979
 Spac-0.962
Token center
Token ID3641
Feature activation-1.19e-01

Pos logit contributions
umenthal+0.193
utra+0.186
redit+0.181
ople+0.172
ounty+0.169
Neg logit contributions
hower-0.186
 Synd-0.170
 NIC-0.165
 conduc-0.164
 Sting-0.164
Token.
Token ID13
Feature activation-7.32e-02

Pos logit contributions
umenthal+0.254
utra+0.246
redit+0.236
ounty+0.227
ople+0.226
Neg logit contributions
hower-0.184
 Synd-0.172
 NIC-0.166
 Sting-0.165
 Spac-0.162
Token Dec
Token ID4280
Feature activation+1.33e-01

Pos logit contributions
umenthal+0.135
utra+0.131
redit+0.125
ople+0.120
ounty+0.120
Neg logit contributions
hower-0.133
 conduc-0.123
 Synd-0.121
 NIC-0.118
wal-0.117
Tokenades
Token ID2367
Feature activation-9.52e-01

Pos logit contributions
hower+0.277
 Synd+0.263
 NIC+0.256
 Sting+0.255
 Spac+0.252
Neg logit contributions
umenthal-0.217
utra-0.207
redit-0.199
ounty-0.187
��-0.186
Token ago
Token ID2084
Feature activation+5.15e-02

Pos logit contributions
umenthal+1.865
utra+1.827
redit+1.736
ounty+1.671
��+1.642
Neg logit contributions
hower-1.596
 Synd-1.529
 NIC-1.471
 Spac-1.450
 Sting-1.444
Token,
Token ID11
Feature activation-2.44e-02

Pos logit contributions
hower+0.072
 Synd+0.066
 NIC+0.064
 Sting+0.063
 Spac+0.062
Neg logit contributions
umenthal-0.118
utra-0.114
redit-0.110
��-0.107
ounty-0.106
Token scholars
Token ID9123
Feature activation-4.76e-01

Pos logit contributions
umenthal+0.051
utra+0.049
redit+0.047
��+0.045
ounty+0.045
Neg logit contributions
hower-0.039
 Synd-0.036
 NIC-0.034
 conduc-0.034
 Spac-0.034
Token demolished
Token ID33359
Feature activation+1.46e-01

Pos logit contributions
umenthal+0.904
utra+0.872
redit+0.843
��+0.803
ople+0.797
Neg logit contributions
hower-0.849
 Synd-0.795
 NIC-0.762
 conduc-0.760
 Sting-0.758
Token claims
Token ID3667
Feature activation-4.15e-02

Pos logit contributions
hower+0.232
 Synd+0.216
 NIC+0.207
 Sting+0.205
 Spac+0.205
Neg logit contributions
umenthal-0.306
utra-0.296
redit-0.285
��-0.276
ounty-0.272
Token.
Token ID13
Feature activation+4.64e-02

Pos logit contributions
umenthal+0.205
utra+0.202
redit+0.192
ople+0.187
ounty+0.186
Neg logit contributions
hower-0.154
 conduc-0.152
 Synd-0.139
 Sting-0.135
 Huck-0.135
Tokenyoutube
Token ID11604
Feature activation-2.58e-01

Pos logit contributions
hower+0.088
 Synd+0.084
 NIC+0.082
 Sting+0.082
 Huck+0.081
Neg logit contributions
umenthal-0.082
utra-0.079
redit-0.076
ople-0.072
��-0.072
Token.
Token ID13
Feature activation+6.73e-02

Pos logit contributions
umenthal+0.537
utra+0.533
redit+0.508
ople+0.493
ounty+0.490
Neg logit contributions
 conduc-0.410
hower-0.409
 Synd-0.368
 Huck-0.361
 NIC-0.360
Tokencom
Token ID785
Feature activation-7.54e-02

Pos logit contributions
hower+0.153
 conduc+0.146
 Synd+0.145
 NIC+0.142
 Sting+0.142
Neg logit contributions
umenthal-0.096
utra-0.092
redit-0.087
ople-0.082
ounty-0.080
Token/
Token ID14
Feature activation+1.18e-01

Pos logit contributions
umenthal+0.146
utra+0.139
redit+0.134
��+0.130
ounty+0.128
Neg logit contributions
hower-0.132
 Synd-0.124
 NIC-0.121
 Sting-0.120
 Spac-0.119
Tokenchannel
Token ID17620
Feature activation-9.45e-01

Pos logit contributions
hower+0.256
 Synd+0.243
 NIC+0.239
 Sting+0.237
 Spac+0.235
Neg logit contributions
umenthal-0.180
utra-0.170
redit-0.163
��-0.154
ounty-0.152
Token/
Token ID14
Feature activation+9.39e-02

Pos logit contributions
umenthal+1.592
utra+1.557
redit+1.509
ople+1.435
ounty+1.427
Neg logit contributions
 conduc-1.876
hower-1.872
 Synd-1.718
 NIC-1.678
 Huck-1.670
TokenUC
Token ID9598
Feature activation+2.17e-01

Pos logit contributions
hower+0.150
 Synd+0.142
 NIC+0.136
 conduc+0.135
 Sting+0.135
Neg logit contributions
umenthal-0.197
utra-0.190
redit-0.184
ople-0.178
��-0.178
Token_
Token ID62
Feature activation+1.15e-01

Pos logit contributions
hower+0.437
 Synd+0.415
 NIC+0.404
 Sting+0.402
 conduc+0.397
Neg logit contributions
umenthal-0.367
utra-0.351
redit-0.335
��-0.319
ounty-0.317
TokenK
Token ID42
Feature activation+9.62e-02

Pos logit contributions
hower+0.228
 Synd+0.217
 NIC+0.211
 Sting+0.210
 conduc+0.208
Neg logit contributions
umenthal-0.197
utra-0.189
redit-0.180
��-0.172
ounty-0.171
Tokenk
Token ID74
Feature activation-2.43e-01

Pos logit contributions
hower+0.194
 Synd+0.184
 NIC+0.179
 Sting+0.178
 conduc+0.176
Neg logit contributions
umenthal-0.162
utra-0.155
redit-0.148
��-0.141
ounty-0.139
Token =
Token ID796
Feature activation-4.85e-02

Pos logit contributions
umenthal+0.206
utra+0.199
redit+0.191
ounty+0.183
��+0.183
Neg logit contributions
hower-0.205
 Synd-0.193
 conduc-0.185
 Sting-0.185
 NIC-0.184
Token NO
Token ID8005
Feature activation+4.05e-02

Pos logit contributions
umenthal+0.097
utra+0.093
redit+0.090
��+0.087
ounty+0.086
Neg logit contributions
hower-0.082
 Synd-0.076
 Sting-0.072
 NIC-0.072
 Spac-0.072
TokenBLE
Token ID19146
Feature activation-2.61e-01

Pos logit contributions
hower+0.065
 Synd+0.062
 conduc+0.058
 Sting+0.058
 Spac+0.058
Neg logit contributions
umenthal-0.083
utra-0.080
redit-0.077
��-0.076
ounty-0.075
Token (
Token ID357
Feature activation-4.06e-02

Pos logit contributions
umenthal+0.533
utra+0.513
redit+0.499
��+0.486
ounty+0.480
Neg logit contributions
hower-0.417
 Synd-0.397
 Sting-0.375
 conduc-0.374
 Spac-0.371
Tokens
Token ID82
Feature activation-2.21e-01

Pos logit contributions
umenthal+0.072
utra+0.070
redit+0.067
ounty+0.064
��+0.064
Neg logit contributions
hower-0.077
 Synd-0.073
 NIC-0.071
 Sting-0.070
 conduc-0.070
Tokenounds
Token ID3733
Feature activation-9.44e-01

Pos logit contributions
umenthal+0.356
utra+0.345
redit+0.323
ople+0.309
ounty+0.309
Neg logit contributions
hower-0.460
 Synd-0.438
 NIC-0.427
 Sting-0.424
 Spac-0.421
Token like
Token ID588
Feature activation-4.05e-03

Pos logit contributions
umenthal+1.883
utra+1.820
redit+1.736
��+1.674
ounty+1.650
Neg logit contributions
hower-1.574
 Synd-1.451
wal-1.374
 Spac-1.373
 NIC-1.368
Token definition
Token ID6770
Feature activation+3.27e-02

Pos logit contributions
umenthal+0.008
utra+0.008
redit+0.008
��+0.007
ounty+0.007
Neg logit contributions
hower-0.007
 Synd-0.006
 NIC-0.006
 Sting-0.006
wal-0.006
Token by
Token ID416
Feature activation-1.04e-01

Pos logit contributions
hower+0.052
 Synd+0.048
 conduc+0.046
 Sting+0.046
 NIC+0.045
Neg logit contributions
umenthal-0.068
utra-0.066
redit-0.063
��-0.062
ounty-0.061
Token example
Token ID1672
Feature activation-6.85e-02

Pos logit contributions
umenthal+0.211
utra+0.204
redit+0.198
��+0.191
ople+0.188
Neg logit contributions
hower-0.174
 Synd-0.158
 conduc-0.152
 NIC-0.151
 Sting-0.151
Token,
Token ID11
Feature activation-2.44e-01

Pos logit contributions
umenthal+0.151
utra+0.146
redit+0.141
��+0.136
ounty+0.136
Neg logit contributions
hower-0.101
 Synd-0.092
 conduc-0.088
 NIC-0.087
 Sting-0.087
Token the
Token ID262
Feature activation-1.29e-02

Pos logit contributions
umenthal+0.353
utra+0.338
redit+0.331
ounty+0.319
ople+0.318
Neg logit contributions
hower-0.263
 Synd-0.244
 conduc-0.235
 NIC-0.233
 Sting-0.233
Token polls
Token ID9231
Feature activation+3.45e-01

Pos logit contributions
umenthal+0.027
utra+0.027
redit+0.026
ounty+0.025
ople+0.025
Neg logit contributions
hower-0.020
 Synd-0.018
 conduc-0.017
 Sting-0.017
 Huck-0.017
Token,
Token ID11
Feature activation-4.70e-02

Pos logit contributions
hower+0.572
 Synd+0.534
 NIC+0.516
 Sting+0.510
 Huck+0.509
Neg logit contributions
umenthal-0.697
utra-0.679
redit-0.653
ople-0.627
ounty-0.627
Token will
Token ID481
Feature activation-2.33e-02

Pos logit contributions
umenthal+0.088
utra+0.085
redit+0.083
ounty+0.079
ople+0.078
Neg logit contributions
hower-0.085
 Synd-0.079
 NIC-0.077
 conduc-0.076
 Spac-0.076
Token cons
Token ID762
Feature activation-1.66e-01

Pos logit contributions
umenthal+0.044
utra+0.042
redit+0.041
ople+0.040
ounty+0.039
Neg logit contributions
hower-0.042
 Synd-0.039
 Huck-0.038
 conduc-0.037
 Spac-0.037
Tokenign
Token ID570
Feature activation-9.41e-01

Pos logit contributions
umenthal+0.197
utra+0.185
redit+0.175
ople+0.159
ounty+0.158
Neg logit contributions
hower-0.417
 Synd-0.401
 Sting-0.391
 NIC-0.390
 Huck-0.389
Token their
Token ID511
Feature activation+1.97e-02

Pos logit contributions
umenthal+1.680
utra+1.609
redit+1.547
ounty+1.486
ople+1.452
Neg logit contributions
hower-1.780
 Synd-1.652
 NIC-1.599
 conduc-1.589
 Spac-1.579
Token votes
Token ID5690
Feature activation-3.18e-02

Pos logit contributions
hower+0.035
 Synd+0.032
 Spac+0.031
 conduc+0.030
 NIC+0.030
Neg logit contributions
umenthal-0.038
utra-0.036
redit-0.035
ounty-0.034
ople-0.034
Token into
Token ID656
Feature activation+1.56e-01

Pos logit contributions
umenthal+0.061
utra+0.058
redit+0.056
ounty+0.054
ople+0.053
Neg logit contributions
hower-0.056
 Synd-0.053
 NIC-0.052
 conduc-0.051
 Sting-0.051
Token computers
Token ID9061
Feature activation-1.03e-01

Pos logit contributions
hower+0.244
 Synd+0.225
 NIC+0.217
 Sting+0.213
 conduc+0.213
Neg logit contributions
umenthal-0.333
utra-0.322
redit-0.311
ounty-0.298
��-0.298
Token that
Token ID326
Feature activation-1.64e-02

Pos logit contributions
umenthal+0.206
utra+0.198
redit+0.191
ounty+0.182
ople+0.181
Neg logit contributions
hower-0.175
 Synd-0.163
 NIC-0.157
 Huck-0.155
 conduc-0.155
Token4
Token ID19
Feature activation-5.61e-02

Pos logit contributions
umenthal+0.117
utra+0.109
redit+0.105
��+0.100
ounty+0.098
Neg logit contributions
hower-0.171
 Synd-0.163
 NIC-0.159
 Sting-0.159
wal-0.158
Token.
Token ID13
Feature activation+2.21e-01

Pos logit contributions
umenthal+0.116
utra+0.115
redit+0.109
ople+0.106
ounty+0.106
Neg logit contributions
hower-0.091
 conduc-0.090
 Synd-0.082
 Huck-0.079
 Sting-0.078
Token Make
Token ID6889
Feature activation-6.07e-02

Pos logit contributions
hower+0.384
 Synd+0.350
 conduc+0.349
 NIC+0.338
wal+0.334
Neg logit contributions
umenthal-0.421
utra-0.417
redit-0.399
ople-0.382
ounty-0.381
Token 6
Token ID718
Feature activation-1.82e-01

Pos logit contributions
umenthal+0.120
utra+0.116
redit+0.111
��+0.108
ople+0.107
Neg logit contributions
hower-0.104
 Synd-0.097
 NIC-0.094
 Sting-0.093
 conduc-0.092
Token p
Token ID279
Feature activation-3.02e-01

Pos logit contributions
umenthal+0.312
utra+0.301
redit+0.285
��+0.277
ounty+0.276
Neg logit contributions
hower-0.360
 Synd-0.337
 NIC-0.326
 Sting-0.325
 conduc-0.324
Tokenatt
Token ID1078
Feature activation-9.41e-01

Pos logit contributions
umenthal+0.511
utra+0.487
redit+0.468
ople+0.452
ounty+0.449
Neg logit contributions
hower-0.610
 Synd-0.573
 NIC-0.557
 Sting-0.557
 conduc-0.552
Tokenies
Token ID444
Feature activation-8.91e-02

Pos logit contributions
umenthal+1.888
utra+1.828
redit+1.777
ople+1.733
ounty+1.696
Neg logit contributions
hower-1.554
 Synd-1.458
 conduc-1.419
 Sting-1.400
 NIC-1.381
Token.
Token ID13
Feature activation+1.21e-01

Pos logit contributions
umenthal+0.179
utra+0.177
redit+0.168
ounty+0.163
ople+0.162
Neg logit contributions
hower-0.144
 Synd-0.136
 conduc-0.131
 Sting-0.129
 NIC-0.129
Token
Token ID198
Feature activation-1.39e-01

Pos logit contributions
hower+0.206
 conduc+0.189
 Synd+0.188
 NIC+0.180
wal+0.179
Neg logit contributions
umenthal-0.233
utra-0.232
redit-0.222
ople-0.213
ounty-0.212
Token
Token ID198
Feature activation-1.05e-01

Pos logit contributions
umenthal+0.268
utra+0.264
redit+0.251
ople+0.244
ounty+0.243
Neg logit contributions
hower-0.243
 conduc-0.238
 Synd-0.219
 Huck-0.212
 Sting-0.211
Token5
Token ID20
Feature activation+6.95e-02

Pos logit contributions
umenthal+0.148
utra+0.137
redit+0.131
��+0.124
ople+0.120
Neg logit contributions
hower-0.243
 Synd-0.231
 NIC-0.226
 Sting-0.225
wal-0.224
Token sound
Token ID2128
Feature activation-3.06e-01

Pos logit contributions
hower+0.134
 Synd+0.124
 NIC+0.119
 Sting+0.119
wal+0.119
Neg logit contributions
umenthal-0.149
utra-0.144
redit-0.140
��-0.134
ople-0.133
Token system
Token ID1080
Feature activation-2.55e-02

Pos logit contributions
umenthal+0.523
utra+0.499
redit+0.483
��+0.459
ople+0.455
Neg logit contributions
hower-0.596
 Synd-0.570
 Sting-0.552
 Spac-0.550
 NIC-0.547
Token.
Token ID13
Feature activation+3.35e-02

Pos logit contributions
umenthal+0.054
utra+0.052
redit+0.051
��+0.049
ople+0.049
Neg logit contributions
hower-0.040
 Synd-0.037
 NIC-0.035
 Sting-0.035
 Spac-0.035
Token
Token ID198
Feature activation-1.57e-01

Pos logit contributions
hower+0.059
 Synd+0.055
 NIC+0.053
 conduc+0.052
 Sting+0.052
Neg logit contributions
umenthal-0.065
utra-0.062
redit-0.060
��-0.058
ople-0.057
Token
Token ID198
Feature activation-1.19e-01

Pos logit contributions
umenthal+0.304
utra+0.299
redit+0.283
ople+0.276
ounty+0.275
Neg logit contributions
hower-0.275
 conduc-0.269
 Synd-0.247
 Huck-0.239
 Sting-0.239
TokenRated
Token ID15322
Feature activation+1.09e+00

Pos logit contributions
umenthal+0.190
utra+0.185
redit+0.176
ounty+0.167
ople+0.166
Neg logit contributions
hower-0.249
 conduc-0.239
 Synd-0.233
 Sting-0.228
 NIC-0.227
Token 5
Token ID642
Feature activation-2.10e-01

Pos logit contributions
hower+2.092
 Synd+1.983
 NIC+1.926
 Sting+1.916
 Spac+1.893
Neg logit contributions
umenthal-1.934
utra-1.839
redit-1.764
��-1.698
ounty-1.674
Token out
Token ID503
Feature activation-2.03e-01

Pos logit contributions
umenthal+0.427
utra+0.422
redit+0.407
yx+0.392
ople+0.390
Neg logit contributions
hower-0.343
 conduc-0.331
 Synd-0.303
 Huck-0.297
 Sting-0.293
Token of
Token ID286
Feature activation-9.50e-02

Pos logit contributions
umenthal+0.377
utra+0.369
redit+0.349
ople+0.337
ounty+0.333
Neg logit contributions
hower-0.371
 conduc-0.357
 Synd-0.341
 NIC-0.329
 Sting-0.328
Token 5
Token ID642
Feature activation-1.95e-01

Pos logit contributions
umenthal+0.145
utra+0.140
redit+0.133
ople+0.128
ounty+0.126
Neg logit contributions
hower-0.204
 conduc-0.195
 Synd-0.191
 Sting-0.185
 Huck-0.185
Token by
Token ID416
Feature activation-1.77e-01

Pos logit contributions
umenthal+0.413
utra+0.410
redit+0.394
ople+0.383
yx+0.379
Neg logit contributions
hower-0.305
 conduc-0.297
 Synd-0.268
 NIC-0.257
 Sting-0.257
Token way
Token ID835
Feature activation-2.43e-02

Pos logit contributions
umenthal+0.022
utra+0.021
redit+0.021
��+0.020
ounty+0.020
Neg logit contributions
hower-0.018
 Synd-0.017
 Sting-0.016
 Huck-0.016
 conduc-0.016
Token I
Token ID314
Feature activation-1.51e-01

Pos logit contributions
umenthal+0.048
utra+0.047
redit+0.046
��+0.044
ounty+0.043
Neg logit contributions
hower-0.041
 Synd-0.037
 Huck-0.036
 Sting-0.036
 conduc-0.036
Token'm
Token ID1101
Feature activation+1.68e-02

Pos logit contributions
umenthal+0.282
utra+0.272
redit+0.266
ople+0.251
ounty+0.250
Neg logit contributions
hower-0.277
 Synd-0.255
 NIC-0.245
 conduc-0.245
 Sting-0.245
Token building
Token ID2615
Feature activation-7.01e-02

Pos logit contributions
hower+0.030
 Synd+0.027
 NIC+0.026
 conduc+0.026
 Huck+0.026
Neg logit contributions
umenthal-0.032
utra-0.031
redit-0.030
��-0.029
ople-0.029
Token this
Token ID428
Feature activation+3.17e-02

Pos logit contributions
umenthal+0.143
utra+0.139
redit+0.135
��+0.129
ounty+0.127
Neg logit contributions
hower-0.116
 Synd-0.105
 NIC-0.101
 conduc-0.101
 Sting-0.101
Token ROM
Token ID21224
Feature activation+6.69e-01

Pos logit contributions
hower+0.052
 Synd+0.047
 Sting+0.045
 conduc+0.045
 NIC+0.045
Neg logit contributions
umenthal-0.064
utra-0.063
redit-0.061
��-0.058
ople-0.058
Token,
Token ID11
Feature activation-6.60e-02

Pos logit contributions
hower+1.049
 Synd+0.966
 NIC+0.931
 Sting+0.924
 conduc+0.914
Neg logit contributions
umenthal-1.426
utra-1.377
redit-1.332
��-1.288
ople-1.272
Token moves
Token ID6100
Feature activation+4.46e-01

Pos logit contributions
umenthal+0.130
utra+0.126
redit+0.121
��+0.116
ople+0.116
Neg logit contributions
hower-0.114
 Synd-0.104
 conduc-0.101
 NIC-0.100
 Sting-0.100
Token /
Token ID1220
Feature activation-9.52e-02

Pos logit contributions
hower+0.716
 Synd+0.670
 Sting+0.644
 NIC+0.639
 Huck+0.634
Neg logit contributions
umenthal-0.927
utra-0.897
redit-0.865
��-0.833
ople-0.822
Tokens
Token ID82
Feature activation-1.35e-02

Pos logit contributions
umenthal+0.159
utra+0.153
redit+0.148
��+0.140
ople+0.139
Neg logit contributions
hower-0.192
 Synd-0.180
 Sting-0.174
 NIC-0.174
 Huck-0.173
Tokenbin
Token ID8800
Feature activation-9.07e-02

Pos logit contributions
umenthal+0.027
utra+0.026
redit+0.025
ople+0.024
ounty+0.024
Neg logit contributions
hower-0.023
 Synd-0.021
 NIC-0.021
 Sting-0.021
 Huck-0.020
Token grappling
Token ID36762
Feature activation-5.11e-01

Pos logit contributions
hower+0.132
 Synd+0.121
 NIC+0.118
 Sting+0.116
 conduc+0.116
Neg logit contributions
umenthal-0.182
utra-0.176
redit-0.170
��-0.164
ople-0.163
Token hook
Token ID8011
Feature activation+2.02e-01

Pos logit contributions
umenthal+0.998
utra+0.963
redit+0.924
ounty+0.890
ople+0.887
Neg logit contributions
hower-0.873
 Synd-0.801
 NIC-0.774
 Huck-0.759
 Spac-0.757
Token aw
Token ID3253
Feature activation-2.98e-02

Pos logit contributions
hower+0.334
 Synd+0.304
 NIC+0.296
 Sting+0.289
 conduc+0.289
Neg logit contributions
umenthal-0.412
utra-0.398
redit-0.384
��-0.372
ople-0.369
Tokenes
Token ID274
Feature activation+1.97e-01

Pos logit contributions
umenthal+0.053
utra+0.050
redit+0.049
ounty+0.046
��+0.046
Neg logit contributions
hower-0.058
 Synd-0.054
 NIC-0.053
 Sting-0.053
 Spac-0.052
Tokenomen
Token ID3674
Feature activation-1.85e-01

Pos logit contributions
hower+0.473
 Synd+0.455
 NIC+0.449
 Sting+0.445
 Spac+0.440
Neg logit contributions
umenthal-0.251
utra-0.237
redit-0.222
��-0.209
ounty-0.207
Tokeness
Token ID408
Feature activation+6.46e-01

Pos logit contributions
umenthal+0.241
utra+0.228
redit+0.214
ople+0.202
ounty+0.198
Neg logit contributions
hower-0.442
 Synd-0.426
 conduc-0.423
 NIC-0.418
 Sting-0.414
Token!
Token ID0
Feature activation-1.65e-01

Pos logit contributions
hower+0.954
 Synd+0.883
 NIC+0.854
 Sting+0.844
 conduc+0.830
Neg logit contributions
umenthal-1.434
utra-1.382
redit-1.336
��-1.295
ounty-1.284
Token Having
Token ID11136
Feature activation+1.13e-01

Pos logit contributions
umenthal+0.308
utra+0.295
redit+0.283
��+0.274
ounty+0.270
Neg logit contributions
hower-0.300
 Synd-0.279
 NIC-0.271
 Sting-0.269
 conduc-0.269
Token done
Token ID1760
Feature activation+3.17e-01

Pos logit contributions
hower+0.188
 Synd+0.169
 NIC+0.165
 conduc+0.163
 Sting+0.162
Neg logit contributions
umenthal-0.233
utra-0.224
redit-0.216
��-0.208
ounty-0.206
Token any
Token ID597
Feature activation-4.36e-02

Pos logit contributions
hower+0.513
 Synd+0.468
 NIC+0.453
 Sting+0.449
 conduc+0.445
Neg logit contributions
umenthal-0.661
utra-0.637
redit-0.619
��-0.595
ople-0.589
Token%
Token ID4
Feature activation-1.66e-01

Pos logit contributions
umenthal+0.088
utra+0.084
redit+0.082
��+0.080
ounty+0.078
Neg logit contributions
hower-0.073
 Synd-0.067
 NIC-0.065
 Sting-0.065
 conduc-0.064
Token()
Token ID3419
Feature activation-6.53e-02

Pos logit contributions
umenthal+0.069
utra+0.066
redit+0.062
ople+0.059
ounty+0.058
Neg logit contributions
hower-0.118
 conduc-0.113
 Synd-0.111
 Sting-0.107
 NIC-0.107
Token'
Token ID6
Feature activation-1.83e-01

Pos logit contributions
umenthal+0.114
utra+0.108
redit+0.104
��+0.100
ounty+0.098
Neg logit contributions
hower-0.127
 Synd-0.121
 NIC-0.119
 Sting-0.117
 Spac-0.116
Tokens
Token ID82
Feature activation-7.86e-02

Pos logit contributions
umenthal+0.318
utra+0.308
redit+0.292
ople+0.281
ounty+0.280
Neg logit contributions
hower-0.363
 conduc-0.346
 Synd-0.334
 NIC-0.325
 Huck-0.324
Token frame
Token ID5739
Feature activation+2.34e-01

Pos logit contributions
umenthal+0.144
utra+0.138
redit+0.132
��+0.128
ople+0.126
Neg logit contributions
hower-0.146
 Synd-0.137
 Sting-0.132
 NIC-0.131
 conduc-0.131
Token |
Token ID930
Feature activation+7.92e-02

Pos logit contributions
hower+0.336
 Synd+0.317
 NIC+0.306
 Sting+0.303
 Spac+0.298
Neg logit contributions
umenthal-0.530
utra-0.508
redit-0.493
��-0.476
ounty-0.471
Token void
Token ID7951
Feature activation+6.40e-01

Pos logit contributions
hower+0.156
 Synd+0.147
 NIC+0.142
 Sting+0.142
 conduc+0.140
Neg logit contributions
umenthal-0.137
utra-0.130
redit-0.125
��-0.120
ounty-0.118
Token *
Token ID1635
Feature activation+2.49e-01

Pos logit contributions
hower+0.990
 Synd+0.920
 NIC+0.886
 Sting+0.879
 conduc+0.867
Neg logit contributions
umenthal-1.381
utra-1.326
redit-1.282
��-1.241
ounty-1.231
Tokenarg
Token ID853
Feature activation-4.58e-01

Pos logit contributions
hower+0.556
 Synd+0.532
 NIC+0.519
 Sting+0.517
wal+0.516
Neg logit contributions
umenthal-0.360
utra-0.337
redit-0.320
��-0.305
ounty-0.298
Token |
Token ID930
Feature activation+1.45e-01

Pos logit contributions
umenthal+0.874
utra+0.846
redit+0.805
ople+0.776
yx+0.776
Neg logit contributions
hower-0.824
 conduc-0.779
 Synd-0.732
 Huck-0.709
 Sting-0.706
Token +
Token ID1343
Feature activation+1.32e-01

Pos logit contributions
hower+0.193
 Synd+0.179
 NIC+0.172
 Sting+0.171
 Spac+0.168
Neg logit contributions
umenthal-0.345
utra-0.332
redit-0.322
��-0.312
ounty-0.309
Token--------------------
Token ID19351
Feature activation+1.81e-01

Pos logit contributions
hower+0.257
 Synd+0.243
 conduc+0.241
 Sting+0.236
 Huck+0.235
Neg logit contributions
umenthal-0.230
utra-0.222
redit-0.209
ounty-0.202
��-0.200
Token |
Token ID930
Feature activation+1.63e-01

Pos logit contributions
umenthal+0.868
utra+0.834
redit+0.803
��+0.778
ounty+0.769
Neg logit contributions
hower-0.714
 Synd-0.672
 NIC-0.650
 Sting-0.648
 Spac-0.638
Token +
Token ID1343
Feature activation+2.01e-01

Pos logit contributions
hower+0.262
 Synd+0.244
 NIC+0.234
 Sting+0.233
 conduc+0.230
Neg logit contributions
umenthal-0.339
utra-0.326
redit-0.315
��-0.306
ounty-0.302
Token--------------------
Token ID19351
Feature activation+2.58e-01

Pos logit contributions
hower+0.391
 Synd+0.368
 conduc+0.363
 Sting+0.358
 NIC+0.356
Neg logit contributions
umenthal-0.354
utra-0.340
redit-0.322
ounty-0.310
��-0.308
Token--+
Token ID44785
Feature activation+1.49e-03

Pos logit contributions
hower+0.466
 Synd+0.441
 conduc+0.438
 Sting+0.431
 NIC+0.427
Neg logit contributions
umenthal-0.481
utra-0.470
redit-0.446
ounty-0.434
ople-0.430
Token |
Token ID930
Feature activation+2.86e-01

Pos logit contributions
hower+0.002
 Synd+0.002
 NIC+0.002
 Sting+0.002
 Spac+0.002
Neg logit contributions
umenthal-0.003
utra-0.003
redit-0.003
��-0.003
ounty-0.003
Token void
Token ID7951
Feature activation+7.27e-01

Pos logit contributions
hower+0.506
 Synd+0.475
 NIC+0.459
 Sting+0.457
 Spac+0.451
Neg logit contributions
umenthal-0.553
utra-0.528
redit-0.509
��-0.493
ounty-0.486
Token *
Token ID1635
Feature activation+2.80e-01

Pos logit contributions
hower+1.022
 Synd+0.960
 NIC+0.928
 Sting+0.923
 Spac+0.906
Neg logit contributions
umenthal-1.646
utra-1.590
redit-1.536
��-1.494
ounty-1.462
Tokenbuf
Token ID29325
Feature activation-5.68e-01

Pos logit contributions
hower+0.626
 Synd+0.598
 NIC+0.583
 Sting+0.580
wal+0.578
Neg logit contributions
umenthal-0.409
utra-0.383
redit-0.363
��-0.347
ounty-0.339
Token |
Token ID930
Feature activation+2.25e-01

Pos logit contributions
umenthal+1.153
utra+1.102
redit+1.066
��+1.031
ounty+1.023
Neg logit contributions
hower-0.948
 Synd-0.883
 NIC-0.852
 Sting-0.847
 Spac-0.838
Token +
Token ID1343
Feature activation+2.54e-01

Pos logit contributions
hower+0.333
 Synd+0.309
 NIC+0.296
 Sting+0.294
 Spac+0.289
Neg logit contributions
umenthal-0.498
utra-0.479
redit-0.464
��-0.451
ounty-0.446
Token--------------------
Token ID19351
Feature activation+2.72e-01

Pos logit contributions
hower+0.492
 Synd+0.465
 conduc+0.459
 Sting+0.452
 NIC+0.448
Neg logit contributions
umenthal-0.449
utra-0.431
redit-0.408
ounty-0.393
��-0.390
Token when
Token ID618
Feature activation+1.72e-02

Pos logit contributions
hower+0.190
 Synd+0.176
 NIC+0.169
 conduc+0.169
 Sting+0.168
Neg logit contributions
umenthal-0.242
utra-0.231
redit-0.224
��-0.215
ounty-0.214
Token many
Token ID867
Feature activation+8.13e-02

Pos logit contributions
hower+0.028
 Synd+0.026
 NIC+0.025
 conduc+0.025
 Spac+0.024
Neg logit contributions
umenthal-0.036
utra-0.034
redit-0.033
��-0.032
ounty-0.032
Token politicians
Token ID7602
Feature activation-3.96e-01

Pos logit contributions
hower+0.140
 Synd+0.130
 Spac+0.124
 Sting+0.124
 NIC+0.124
Neg logit contributions
umenthal-0.161
utra-0.154
redit-0.149
��-0.143
ounty-0.142
Token shy
Token ID15800
Feature activation+4.39e-02

Pos logit contributions
umenthal+0.747
utra+0.724
redit+0.703
ounty+0.662
ople+0.662
Neg logit contributions
hower-0.707
 Synd-0.656
 NIC-0.633
 Sting-0.631
 conduc-0.630
Token away
Token ID1497
Feature activation+9.99e-02

Pos logit contributions
hower+0.076
 Synd+0.071
 NIC+0.069
 Sting+0.068
 Spac+0.067
Neg logit contributions
umenthal-0.087
utra-0.083
redit-0.080
��-0.077
ounty-0.076
Token from
Token ID422
Feature activation-6.97e-03

Pos logit contributions
hower+0.173
 Synd+0.162
 NIC+0.157
 Sting+0.156
 Spac+0.154
Neg logit contributions
umenthal-0.197
utra-0.188
redit-0.181
��-0.175
ounty-0.173
Token the
Token ID262
Feature activation-3.90e-02

Pos logit contributions
umenthal+0.013
utra+0.013
redit+0.012
��+0.012
ounty+0.012
Neg logit contributions
hower-0.013
 Synd-0.012
 NIC-0.011
 Sting-0.011
 Spac-0.011
Token long
Token ID890
Feature activation+3.97e-01

Pos logit contributions
umenthal+0.075
utra+0.073
redit+0.069
��+0.067
ounty+0.066
Neg logit contributions
hower-0.069
 Synd-0.064
 Spac-0.061
 NIC-0.061
 Sting-0.061
Token suffering
Token ID7195
Feature activation-3.72e-01

Pos logit contributions
hower+0.741
 Synd+0.701
 NIC+0.680
 Sting+0.676
 Spac+0.670
Neg logit contributions
umenthal-0.726
utra-0.692
redit-0.665
��-0.640
ounty-0.632
Token of
Token ID286
Feature activation+1.45e-02

Pos logit contributions
umenthal+0.703
utra+0.681
redit+0.651
ounty+0.624
ople+0.623
Neg logit contributions
hower-0.658
 Synd-0.601
 Spac-0.588
 conduc-0.583
 Sting-0.581
Token our
Token ID674
Feature activation+1.83e-01

Pos logit contributions
hower+0.025
 Synd+0.023
 conduc+0.022
 Spac+0.022
 Sting+0.022
Neg logit contributions
umenthal-0.029
utra-0.027
redit-0.026
��-0.025
ounty-0.025
Token believe
Token ID1975
Feature activation+5.24e-02

Pos logit contributions
umenthal+0.015
utra+0.015
redit+0.014
ople+0.014
ounty+0.014
Neg logit contributions
hower-0.016
 Synd-0.015
 NIC-0.014
 Sting-0.014
 conduc-0.014
Token about
Token ID546
Feature activation+1.55e-02

Pos logit contributions
hower+0.074
 Synd+0.068
 NIC+0.066
 Sting+0.065
 Spac+0.064
Neg logit contributions
umenthal-0.120
utra-0.116
redit-0.112
��-0.109
ounty-0.108
Token the
Token ID262
Feature activation-7.91e-02

Pos logit contributions
hower+0.026
 Synd+0.024
 Sting+0.023
 NIC+0.023
 conduc+0.023
Neg logit contributions
umenthal-0.032
utra-0.031
redit-0.030
��-0.029
ounty-0.028
Token NDP
Token ID20392
Feature activation-1.33e-01

Pos logit contributions
umenthal+0.163
utra+0.158
redit+0.152
��+0.147
ople+0.145
Neg logit contributions
hower-0.128
 Synd-0.118
 Sting-0.114
 NIC-0.114
 Spac-0.113
Token,
Token ID11
Feature activation-5.06e-02

Pos logit contributions
umenthal+0.295
utra+0.284
redit+0.275
��+0.267
ounty+0.264
Neg logit contributions
hower-0.197
 Synd-0.183
 NIC-0.176
 Sting-0.175
 Spac-0.172
Token we
Token ID356
Feature activation-2.33e-02

Pos logit contributions
umenthal+0.104
utra+0.100
redit+0.097
��+0.094
ounty+0.093
Neg logit contributions
hower-0.083
 Synd-0.076
 Sting-0.074
 NIC-0.073
 conduc-0.073
Token
Token ID447
Feature activation-5.78e-03

Pos logit contributions
umenthal+0.046
utra+0.045
redit+0.043
��+0.041
ople+0.041
Neg logit contributions
hower-0.040
 Synd-0.037
 Sting-0.036
 NIC-0.036
 Spac-0.036
Token
Token ID247
Feature activation-1.36e-01

Pos logit contributions
umenthal+0.009
utra+0.008
redit+0.008
��+0.008
ounty+0.008
Neg logit contributions
hower-0.013
 Synd-0.012
 NIC-0.012
 conduc-0.012
 Sting-0.012
Tokenre
Token ID260
Feature activation-1.60e-01

Pos logit contributions
umenthal+0.237
utra+0.229
redit+0.221
ople+0.209
��+0.208
Neg logit contributions
hower-0.265
 Synd-0.249
 NIC-0.242
 Sting-0.241
 Spac-0.239
Token not
Token ID407
Feature activation-7.23e-02

Pos logit contributions
umenthal+0.335
utra+0.325
redit+0.315
ople+0.305
��+0.303
Neg logit contributions
hower-0.253
 Synd-0.232
 Sting-0.226
 NIC-0.226
 conduc-0.225
Token enemies
Token ID5775
Feature activation+1.81e-01

Pos logit contributions
umenthal+0.141
utra+0.136
redit+0.132
ople+0.128
��+0.127
Neg logit contributions
hower-0.125
 Synd-0.114
 NIC-0.112
 Spac-0.111
 Sting-0.111
Token
Token ID198
Feature activation+1.05e-01

Pos logit contributions
hower+0.117
 conduc+0.112
 Synd+0.105
 Sting+0.101
 NIC+0.101
Neg logit contributions
umenthal-0.135
utra-0.133
redit-0.126
ople-0.123
ounty-0.122
TokenAddress
Token ID20231
Feature activation-1.73e-01

Pos logit contributions
hower+0.281
 Synd+0.270
 NIC+0.265
 Sting+0.264
 Spac+0.262
Neg logit contributions
umenthal-0.108
utra-0.098
redit-0.091
��-0.085
ounty-0.082
Token:
Token ID25
Feature activation-6.24e-02

Pos logit contributions
umenthal+0.295
utra+0.290
redit+0.273
ople+0.261
ounty+0.259
Neg logit contributions
hower-0.343
 conduc-0.326
 Synd-0.314
 NIC-0.305
 Sting-0.304
Token Sh
Token ID911
Feature activation+1.75e-01

Pos logit contributions
umenthal+0.129
utra+0.125
redit+0.119
ople+0.115
��+0.114
Neg logit contributions
hower-0.102
 Synd-0.092
 conduc-0.091
 NIC-0.089
 Sting-0.089
Tokenri
Token ID380
Feature activation+3.27e-02

Pos logit contributions
hower+0.355
 Synd+0.337
 NIC+0.327
 Sting+0.326
 conduc+0.325
Neg logit contributions
umenthal-0.297
utra-0.284
redit-0.272
ople-0.257
ounty-0.257
Token Pre
Token ID3771
Feature activation-1.68e-02

Pos logit contributions
hower+0.052
 Synd+0.048
 conduc+0.046
 NIC+0.046
 Sting+0.046
Neg logit contributions
umenthal-0.069
utra-0.067
redit-0.064
��-0.062
ople-0.062
Tokenmb
Token ID2022
Feature activation+2.81e-01

Pos logit contributions
umenthal+0.028
utra+0.027
redit+0.026
��+0.024
ople+0.024
Neg logit contributions
hower-0.034
 Synd-0.032
 conduc-0.031
 Sting-0.031
 Spac-0.031
Tokenhi
Token ID5303
Feature activation+1.72e-01

Pos logit contributions
hower+0.575
 Synd+0.544
 Sting+0.528
 NIC+0.527
 conduc+0.526
Neg logit contributions
umenthal-0.468
utra-0.452
redit-0.428
ounty-0.404
ople-0.401
Tokenks
Token ID591
Feature activation+2.60e-01

Pos logit contributions
hower+0.251
 Synd+0.233
 NIC+0.224
 Sting+0.222
 conduc+0.220
Neg logit contributions
umenthal-0.382
utra-0.371
redit-0.358
��-0.345
ople-0.344
Tokenhu
Token ID13415
Feature activation-1.71e-01

Pos logit contributions
hower+0.529
 Synd+0.502
 NIC+0.488
 Sting+0.485
 conduc+0.480
Neg logit contributions
umenthal-0.435
utra-0.420
redit-0.397
��-0.375
ounty-0.375
Tokenji
Token ID7285
Feature activation+1.51e-01

Pos logit contributions
umenthal+0.379
utra+0.371
redit+0.356
ople+0.343
��+0.343
Neg logit contributions
hower-0.253
 Synd-0.233
 NIC-0.224
 conduc-0.222
 Sting-0.221
Token <
Token ID1279
Feature activation-6.62e-01

Pos logit contributions
umenthal+0.727
utra+0.694
redit+0.671
��+0.644
ounty+0.639
Neg logit contributions
hower-0.650
 Synd-0.618
 NIC-0.600
 Sting-0.595
 Spac-0.586
Tokenmet
Token ID4164
Feature activation-1.84e-01

Pos logit contributions
umenthal+1.184
utra+1.133
redit+1.097
ople+1.038
��+1.033
Neg logit contributions
hower-1.250
 Synd-1.187
 NIC-1.161
 Sting-1.153
 conduc-1.147
Tokenros
Token ID4951
Feature activation+3.51e-01

Pos logit contributions
umenthal+0.236
utra+0.221
redit+0.207
��+0.200
ounty+0.191
Neg logit contributions
hower-0.444
 Synd-0.425
 NIC-0.414
 Sting-0.413
wal-0.412
Tokenxual
Token ID5541
Feature activation-3.74e-01

Pos logit contributions
hower+0.611
 Synd+0.579
 conduc+0.573
 Sting+0.565
 NIC+0.565
Neg logit contributions
umenthal-0.692
utra-0.668
redit-0.638
ople-0.610
ounty-0.609
Tokenth
Token ID400
Feature activation+2.39e-01

Pos logit contributions
umenthal+0.825
utra+0.801
redit+0.765
ople+0.747
ounty+0.740
Neg logit contributions
hower-0.557
 conduc-0.557
 NIC-0.519
 Synd-0.515
 Sting-0.504
Tokenug
Token ID1018
Feature activation+1.38e-01

Pos logit contributions
hower+0.640
 Synd+0.619
 NIC+0.607
 conduc+0.607
 Sting+0.605
Neg logit contributions
umenthal-0.247
utra-0.230
redit-0.211
ounty-0.191
��-0.191
Token>
Token ID29
Feature activation-3.92e-01

Pos logit contributions
hower+0.319
 conduc+0.307
 Synd+0.301
 NIC+0.295
 Sting+0.293
Neg logit contributions
umenthal-0.194
utra-0.186
redit-0.174
ounty-0.164
ople-0.164
Token le
Token ID443
Feature activation+1.42e-01

Pos logit contributions
umenthal+0.691
utra+0.659
redit+0.633
��+0.600
ople+0.598
Neg logit contributions
hower-0.757
 Synd-0.707
 NIC-0.683
 Sting-0.679
 conduc-0.674
Tokenl
Token ID75
Feature activation-1.94e-01

Pos logit contributions
hower+0.287
 Synd+0.273
 Sting+0.266
wal+0.265
 NIC+0.265
Neg logit contributions
umenthal-0.237
utra-0.224
redit-0.215
��-0.211
ounty-0.203
Token m
Token ID285
Feature activation-2.62e-01

Pos logit contributions
umenthal+0.299
utra+0.283
redit+0.269
��+0.255
ople+0.253
Neg logit contributions
hower-0.418
 Synd-0.396
 NIC-0.386
 Sting-0.384
 Spac-0.380
Token2
Token ID17
Feature activation-9.84e-02

Pos logit contributions
umenthal+0.462
utra+0.438
redit+0.422
��+0.412
ounty+0.400
Neg logit contributions
hower-0.506
 Synd-0.478
 NIC-0.463
 Sting-0.461
wal-0.457
Token see
Token ID766
Feature activation+1.50e-02

Pos logit contributions
hower+0.037
 Synd+0.033
 conduc+0.033
 NIC+0.032
 Sting+0.032
Neg logit contributions
umenthal-0.036
utra-0.035
redit-0.034
ople-0.032
ounty-0.032
Token what
Token ID644
Feature activation-2.48e-02

Pos logit contributions
hower+0.028
 Synd+0.025
 conduc+0.025
 NIC+0.024
wal+0.024
Neg logit contributions
umenthal-0.028
utra-0.027
redit-0.026
ople-0.024
ounty-0.024
Token booths
Token ID45496
Feature activation-1.38e-01

Pos logit contributions
umenthal+0.047
utra+0.045
redit+0.043
ople+0.042
ounty+0.041
Neg logit contributions
hower-0.045
 Synd-0.041
 conduc-0.040
 NIC-0.040
 Sting-0.039
Token might
Token ID1244
Feature activation+2.11e-01

Pos logit contributions
umenthal+0.266
utra+0.257
redit+0.248
ounty+0.238
ople+0.235
Neg logit contributions
hower-0.245
 Synd-0.225
 conduc-0.218
 NIC-0.216
wal-0.215
Token be
Token ID307
Feature activation+7.32e-04

Pos logit contributions
hower+0.395
 Synd+0.371
 conduc+0.359
 Huck+0.357
 NIC+0.356
Neg logit contributions
umenthal-0.376
utra-0.366
redit-0.354
ople-0.339
ounty-0.339
Token worth
Token ID2861
Feature activation+4.18e-01

Pos logit contributions
hower+0.001
 Synd+0.001
 NIC+0.001
 Sting+0.001
 conduc+0.001
Neg logit contributions
umenthal-0.001
utra-0.001
redit-0.001
ounty-0.001
ople-0.001
Token waiting
Token ID4953
Feature activation-3.64e-01

Pos logit contributions
hower+0.698
 Synd+0.640
 NIC+0.615
 conduc+0.609
 Sting+0.609
Neg logit contributions
umenthal-0.856
utra-0.820
redit-0.790
ounty-0.763
��-0.759
Token on
Token ID319
Feature activation-2.50e-02

Pos logit contributions
umenthal+0.733
utra+0.701
redit+0.675
��+0.649
ounty+0.649
Neg logit contributions
hower-0.611
 Synd-0.569
 Sting-0.546
 NIC-0.545
 conduc-0.544
Token for
Token ID329
Feature activation-4.92e-02

Pos logit contributions
umenthal+0.053
utra+0.052
redit+0.050
ounty+0.048
ople+0.048
Neg logit contributions
hower-0.039
 Synd-0.036
 NIC-0.034
 Sting-0.034
 conduc-0.034
Token your
Token ID534
Feature activation+2.19e-01

Pos logit contributions
umenthal+0.104
utra+0.101
redit+0.098
ounty+0.094
ople+0.093
Neg logit contributions
hower-0.077
 Synd-0.069
 conduc-0.067
 NIC-0.066
 Huck-0.066
Token particular
Token ID1948
Feature activation-4.86e-01

Pos logit contributions
hower+0.378
 Synd+0.352
 NIC+0.339
 Sting+0.338
 conduc+0.336
Neg logit contributions
umenthal-0.428
utra-0.410
redit-0.394
��-0.381
ounty-0.381
TokenIt
Token ID1026
Feature activation-1.94e-02

Pos logit contributions
umenthal+0.409
utra+0.397
redit+0.372
ople+0.360
��+0.359
Neg logit contributions
hower-0.527
 conduc-0.493
 Synd-0.492
 Sting-0.478
 Huck-0.473
Token's
Token ID338
Feature activation+1.65e-01

Pos logit contributions
umenthal+0.040
utra+0.038
redit+0.037
ople+0.035
ounty+0.035
Neg logit contributions
hower-0.032
 Synd-0.030
 NIC-0.029
 Spac-0.029
 Sting-0.029
Token making
Token ID1642
Feature activation+3.96e-02

Pos logit contributions
hower+0.281
 Synd+0.259
 conduc+0.247
 Spac+0.247
 Sting+0.246
Neg logit contributions
umenthal-0.328
utra-0.322
redit-0.304
ople-0.292
��-0.289
Token a
Token ID257
Feature activation+1.58e-02

Pos logit contributions
hower+0.068
 Synd+0.062
 conduc+0.060
 Sting+0.059
 Spac+0.059
Neg logit contributions
umenthal-0.079
utra-0.076
redit-0.072
��-0.071
ople-0.070
Token very
Token ID845
Feature activation-2.48e-02

Pos logit contributions
hower+0.026
 Synd+0.025
 conduc+0.023
 Sting+0.023
 NIC+0.023
Neg logit contributions
umenthal-0.032
utra-0.031
redit-0.030
��-0.029
ople-0.028
Token powerful
Token ID3665
Feature activation-1.38e-01

Pos logit contributions
umenthal+0.048
utra+0.046
redit+0.044
��+0.043
ople+0.043
Neg logit contributions
hower-0.044
 Synd-0.041
 Sting-0.039
 Spac-0.038
 conduc-0.038
Token statement
Token ID2643
Feature activation+7.94e-01

Pos logit contributions
umenthal+0.232
utra+0.229
redit+0.216
��+0.209
ople+0.205
Neg logit contributions
hower-0.271
 Synd-0.256
 conduc-0.244
 Sting-0.242
 Spac-0.240
Token about
Token ID546
Feature activation+7.38e-02

Pos logit contributions
hower+1.381
 Synd+1.298
 NIC+1.252
 Sting+1.247
 Spac+1.231
Neg logit contributions
umenthal-1.555
utra-1.492
redit-1.434
��-1.377
ounty-1.366
Token the
Token ID262
Feature activation-1.18e-02

Pos logit contributions
hower+0.129
 Synd+0.120
 Sting+0.114
wal+0.113
 conduc+0.113
Neg logit contributions
umenthal-0.142
utra-0.139
redit-0.131
��-0.126
ople-0.125
Token importance
Token ID6817
Feature activation-2.58e-01

Pos logit contributions
umenthal+0.022
utra+0.022
redit+0.021
��+0.020
ounty+0.019
Neg logit contributions
hower-0.021
 Synd-0.020
 Sting-0.019
wal-0.019
 Spac-0.019
Token of
Token ID286
Feature activation-2.84e-02

Pos logit contributions
umenthal+0.537
utra+0.513
redit+0.497
��+0.480
ounty+0.475
Neg logit contributions
hower-0.421
 Synd-0.394
 NIC-0.382
 Sting-0.378
 Huck-0.372
Token was
Token ID373
Feature activation+1.03e-01

Pos logit contributions
hower+0.187
 Synd+0.175
 NIC+0.169
 Sting+0.166
 conduc+0.165
Neg logit contributions
umenthal-0.195
utra-0.187
redit-0.181
ounty-0.172
ople-0.172
Token be
Token ID307
Feature activation+1.23e-01

Pos logit contributions
hower+0.195
 Synd+0.181
 NIC+0.175
 Sting+0.173
 conduc+0.172
Neg logit contributions
umenthal-0.186
utra-0.179
redit-0.172
ople-0.164
ounty-0.163
Tokenheaded
Token ID15353
Feature activation+3.36e-01

Pos logit contributions
hower+0.286
 Synd+0.272
 NIC+0.265
 Sting+0.264
 Spac+0.263
Neg logit contributions
umenthal-0.169
utra-0.159
redit-0.152
ounty-0.142
ople-0.141
Token by
Token ID416
Feature activation+1.41e-01

Pos logit contributions
hower+0.528
 Synd+0.486
 NIC+0.467
 Sting+0.460
 conduc+0.460
Neg logit contributions
umenthal-0.718
utra-0.687
redit-0.666
ounty-0.639
ople-0.636
Token spectators
Token ID31704
Feature activation+2.76e-01

Pos logit contributions
hower+0.228
 Synd+0.206
 NIC+0.197
wal+0.196
 conduc+0.195
Neg logit contributions
umenthal-0.295
utra-0.284
redit-0.275
ounty-0.264
ople-0.263
Token after
Token ID706
Feature activation-1.81e-01

Pos logit contributions
hower+0.443
 Synd+0.409
 NIC+0.397
 Sting+0.390
 conduc+0.390
Neg logit contributions
umenthal-0.574
utra-0.550
redit-0.533
ople-0.513
ounty-0.513
Token he
Token ID339
Feature activation+1.21e-01

Pos logit contributions
umenthal+0.362
utra+0.348
redit+0.332
ople+0.323
ounty+0.321
Neg logit contributions
hower-0.309
 Synd-0.283
 NIC-0.273
 conduc-0.269
 Spac-0.268
Token had
Token ID550
Feature activation+1.36e-01

Pos logit contributions
hower+0.224
 Synd+0.211
 NIC+0.204
 Sting+0.201
 conduc+0.200
Neg logit contributions
umenthal-0.222
utra-0.213
redit-0.206
ounty-0.196
ople-0.195
Token stabbed
Token ID21512
Feature activation-1.78e-01

Pos logit contributions
hower+0.249
 Synd+0.227
 NIC+0.221
 conduc+0.218
 Spac+0.218
Neg logit contributions
umenthal-0.256
utra-0.248
redit-0.238
ounty-0.227
ople-0.227
Token a
Token ID257
Feature activation+2.79e-02

Pos logit contributions
umenthal+0.359
utra+0.346
redit+0.336
ople+0.322
ounty+0.321
Neg logit contributions
hower-0.304
 Synd-0.266
wal-0.257
 conduc-0.257
 NIC-0.257
Token player
Token ID2137
Feature activation+5.19e-01

Pos logit contributions
hower+0.053
 Synd+0.049
 NIC+0.047
wal+0.047
 conduc+0.047
Neg logit contributions
umenthal-0.050
utra-0.048
redit-0.046
ople-0.044
��-0.044
Token wondered
Token ID14028
Feature activation-2.82e-01

Pos logit contributions
hower+0.060
 Synd+0.056
 NIC+0.054
 Sting+0.054
 Spac+0.054
Neg logit contributions
umenthal-0.069
utra-0.066
redit-0.064
��-0.062
ople-0.061
Token why
Token ID1521
Feature activation-7.45e-02

Pos logit contributions
umenthal+0.533
utra+0.523
redit+0.496
ople+0.474
ounty+0.471
Neg logit contributions
hower-0.501
 Synd-0.459
 NIC-0.450
 conduc-0.449
 Sting-0.443
Token she
Token ID673
Feature activation-4.60e-02

Pos logit contributions
umenthal+0.138
utra+0.133
redit+0.127
��+0.122
ounty+0.121
Neg logit contributions
hower-0.137
 Synd-0.126
 conduc-0.123
 NIC-0.122
 Sting-0.121
Token was
Token ID373
Feature activation+2.88e-03

Pos logit contributions
umenthal+0.081
utra+0.079
redit+0.076
ople+0.071
ounty+0.071
Neg logit contributions
hower-0.088
 Synd-0.082
 NIC-0.079
 conduc-0.078
 Sting-0.078
Token on
Token ID319
Feature activation-2.50e-02

Pos logit contributions
hower+0.005
 Synd+0.005
 NIC+0.004
 conduc+0.004
 Spac+0.004
Neg logit contributions
umenthal-0.006
utra-0.005
redit-0.005
ople-0.005
ounty-0.005
Token the
Token ID262
Feature activation-8.08e-02

Pos logit contributions
umenthal+0.054
utra+0.052
redit+0.051
��+0.048
ounty+0.048
Neg logit contributions
hower-0.039
 Synd-0.035
 NIC-0.034
 Sting-0.034
 conduc-0.034
Token sidelines
Token ID29394
Feature activation-1.18e-02

Pos logit contributions
umenthal+0.152
utra+0.146
redit+0.141
��+0.134
ounty+0.134
Neg logit contributions
hower-0.147
 Synd-0.135
 Sting-0.131
 NIC-0.131
 conduc-0.130
Token.
Token ID13
Feature activation-2.54e-01

Pos logit contributions
umenthal+0.025
utra+0.024
redit+0.023
ounty+0.022
��+0.022
Neg logit contributions
hower-0.019
 Synd-0.018
 NIC-0.017
 Sting-0.017
 conduc-0.017
Token
Token ID198
Feature activation-8.90e-02

Pos logit contributions
umenthal+0.487
utra+0.474
redit+0.449
��+0.436
ople+0.436
Neg logit contributions
hower-0.445
 conduc-0.405
 Synd-0.404
 NIC-0.392
 Sting-0.391
Token
Token ID198
Feature activation-1.46e-01

Pos logit contributions
umenthal+0.176
utra+0.173
redit+0.164
ounty+0.159
ople+0.159
Neg logit contributions
hower-0.154
 conduc-0.147
 Synd-0.138
 Sting-0.134
 Huck-0.134
TokenM
Token ID44
Feature activation-3.63e-02

Pos logit contributions
umenthal+0.245
utra+0.237
redit+0.223
��+0.214
ounty+0.214
Neg logit contributions
hower-0.295
 Synd-0.277
 conduc-0.270
 Sting-0.269
 NIC-0.268
TokenL
Token ID43
Feature activation-5.75e-02

Pos logit contributions
umenthal+0.025
utra+0.024
redit+0.022
��+0.021
ounty+0.021
Neg logit contributions
hower-0.030
 Synd-0.028
 NIC-0.027
 Sting-0.027
 conduc-0.027
Tokenever
Token ID964
Feature activation-1.03e-01

Pos logit contributions
umenthal+0.070
utra+0.067
redit+0.060
ounty+0.056
ople+0.055
Neg logit contributions
hower-0.144
 Synd-0.142
 conduc-0.138
 NIC-0.138
 Sting-0.137
Token,
Token ID11
Feature activation-4.52e-02

Pos logit contributions
umenthal+0.206
utra+0.200
redit+0.194
ounty+0.184
ople+0.184
Neg logit contributions
hower-0.171
 Synd-0.158
 NIC-0.153
 conduc-0.151
 Huck-0.149
Token like
Token ID588
Feature activation+3.53e-02

Pos logit contributions
umenthal+0.094
utra+0.092
redit+0.089
��+0.085
ounty+0.085
Neg logit contributions
hower-0.072
 Synd-0.065
 conduc-0.063
 NIC-0.063
wal-0.062
Token Gibbs
Token ID41071
Feature activation-2.71e-01

Pos logit contributions
hower+0.056
 Synd+0.052
 NIC+0.051
 conduc+0.050
 Sting+0.050
Neg logit contributions
umenthal-0.074
utra-0.071
redit-0.068
��-0.066
ople-0.066
Token,
Token ID11
Feature activation-1.09e-01

Pos logit contributions
umenthal+0.590
utra+0.574
redit+0.552
��+0.533
ople+0.528
Neg logit contributions
hower-0.403
 Synd-0.376
 NIC-0.362
 Sting-0.357
 conduc-0.356
Token was
Token ID373
Feature activation+6.09e-02

Pos logit contributions
umenthal+0.216
utra+0.209
redit+0.203
ounty+0.192
��+0.192
Neg logit contributions
hower-0.185
 Synd-0.170
 NIC-0.164
 conduc-0.163
 Sting-0.161
Token keen
Token ID13795
Feature activation-1.29e-01

Pos logit contributions
hower+0.108
 Synd+0.097
 conduc+0.094
 NIC+0.093
wal+0.092
Neg logit contributions
umenthal-0.117
utra-0.116
redit-0.110
ople-0.106
��-0.105
Token to
Token ID284
Feature activation+6.12e-02

Pos logit contributions
umenthal+0.263
utra+0.251
redit+0.243
��+0.235
ounty+0.232
Neg logit contributions
hower-0.213
 Synd-0.201
 NIC-0.195
 Sting-0.194
 Huck-0.190
Token return
Token ID1441
Feature activation+6.53e-02

Pos logit contributions
hower+0.112
 Synd+0.102
 conduc+0.100
 Huck+0.098
 NIC+0.098
Neg logit contributions
umenthal-0.114
utra-0.109
redit-0.107
ounty-0.103
ople-0.101
Token home
Token ID1363
Feature activation-2.00e-01

Pos logit contributions
hower+0.121
 Synd+0.111
 NIC+0.108
 conduc+0.108
 Sting+0.106
Neg logit contributions
umenthal-0.121
utra-0.117
redit-0.112
��-0.107
ounty-0.107
Token up
Token ID510
Feature activation+1.24e-02

Pos logit contributions
umenthal+0.335
utra+0.322
redit+0.311
��+0.293
ounty+0.292
Neg logit contributions
hower-0.340
 Synd-0.308
 NIC-0.301
 conduc-0.299
 Sting-0.295
Token becoming
Token ID5033
Feature activation-1.21e-02

Pos logit contributions
hower+0.021
 Synd+0.019
 NIC+0.018
 Sting+0.018
 conduc+0.018
Neg logit contributions
umenthal-0.025
utra-0.024
redit-0.023
��-0.022
ounty-0.022
Token my
Token ID616
Feature activation+1.28e-01

Pos logit contributions
umenthal+0.025
utra+0.024
redit+0.023
��+0.022
ounty+0.022
Neg logit contributions
hower-0.020
 Synd-0.018
 NIC-0.018
 conduc-0.017
 Spac-0.017
Token focal
Token ID25397
Feature activation-1.29e-01

Pos logit contributions
hower+0.215
 Synd+0.195
 NIC+0.189
 conduc+0.186
 Sting+0.186
Neg logit contributions
umenthal-0.255
utra-0.247
redit-0.239
��-0.229
ople-0.227
Token point
Token ID966
Feature activation+2.00e-02

Pos logit contributions
umenthal+0.247
utra+0.237
redit+0.230
��+0.221
ounty+0.217
Neg logit contributions
hower-0.228
 Synd-0.214
 Sting-0.204
 NIC-0.203
 Huck-0.203
Token for
Token ID329
Feature activation-6.93e-02

Pos logit contributions
hower+0.032
 Synd+0.030
 NIC+0.029
 conduc+0.029
 Sting+0.029
Neg logit contributions
umenthal-0.041
utra-0.040
redit-0.038
��-0.036
ounty-0.036
Token his
Token ID465
Feature activation-6.03e-03

Pos logit contributions
umenthal+0.146
utra+0.140
redit+0.136
��+0.130
ounty+0.129
Neg logit contributions
hower-0.112
 Synd-0.100
 NIC-0.097
 conduc-0.096
 Spac-0.095
Token outlandish
Token ID47284
Feature activation+1.83e-01

Pos logit contributions
umenthal+0.012
utra+0.012
redit+0.011
��+0.011
ounty+0.011
Neg logit contributions
hower-0.010
 Synd-0.009
 NIC-0.009
 conduc-0.009
 Sting-0.009
Token offense
Token ID6907
Feature activation-2.92e-01

Pos logit contributions
hower+0.321
 Synd+0.293
 NIC+0.285
 conduc+0.282
 Sting+0.282
Neg logit contributions
umenthal-0.356
utra-0.339
redit-0.330
��-0.317
ounty-0.315
Token and
Token ID290
Feature activation+3.26e-02

Pos logit contributions
umenthal+0.614
utra+0.594
redit+0.571
��+0.550
ounty+0.547
Neg logit contributions
hower-0.455
 Synd-0.415
 NIC-0.408
 conduc-0.407
 Sting-0.402
Token hard
Token ID1327
Feature activation+3.05e-01

Pos logit contributions
hower+0.056
 Synd+0.050
 NIC+0.049
 Sting+0.048
 Spac+0.048
Neg logit contributions
umenthal-0.065
utra-0.063
redit-0.060
ounty-0.058
��-0.058
TokenWho
Token ID8241
Feature activation+4.42e-01

Pos logit contributions
hower+0.259
 Synd+0.238
 conduc+0.238
 Sting+0.234
 NIC+0.234
Neg logit contributions
umenthal-0.198
utra-0.194
redit-0.183
ounty-0.175
��-0.173
Token are
Token ID389
Feature activation+3.16e-02

Pos logit contributions
hower+0.740
 Synd+0.682
 NIC+0.663
 Sting+0.661
 Spac+0.656
Neg logit contributions
umenthal-0.896
utra-0.866
redit-0.841
��-0.798
ounty-0.796
Token the
Token ID262
Feature activation+1.23e-02

Pos logit contributions
hower+0.050
 Synd+0.044
 Sting+0.043
 NIC+0.043
 Spac+0.042
Neg logit contributions
umenthal-0.068
utra-0.066
redit-0.064
��-0.062
ounty-0.061
Token Anti
Token ID12297
Feature activation+3.89e-01

Pos logit contributions
hower+0.020
 Synd+0.017
 Spac+0.017
 NIC+0.017
 Sting+0.017
Neg logit contributions
umenthal-0.025
utra-0.025
redit-0.024
��-0.023
ople-0.023
Token-
Token ID12
Feature activation-1.81e-01

Pos logit contributions
hower+0.766
 conduc+0.697
 Synd+0.695
 NIC+0.688
 Sting+0.681
Neg logit contributions
umenthal-0.676
utra-0.650
redit-0.631
ople-0.599
ounty-0.597
TokenAuthor
Token ID13838
Feature activation-7.20e-01

Pos logit contributions
umenthal+0.363
utra+0.348
redit+0.339
��+0.323
ounty+0.323
Neg logit contributions
hower-0.308
 Synd-0.283
 NIC-0.277
 Sting-0.276
 Huck-0.272
Tokenitarian
Token ID8353
Feature activation-2.48e-02

Pos logit contributions
umenthal+1.656
utra+1.621
redit+1.587
ounty+1.526
ople+1.508
Neg logit contributions
hower-1.010
 conduc-0.934
 Synd-0.896
 Sting-0.890
 NIC-0.887
Token Movement
Token ID15477
Feature activation-7.53e-02

Pos logit contributions
umenthal+0.056
utra+0.054
redit+0.053
��+0.051
ople+0.050
Neg logit contributions
hower-0.036
 Synd-0.032
 NIC-0.031
 Sting-0.031
 conduc-0.031
Token (
Token ID357
Feature activation+1.96e-02

Pos logit contributions
umenthal+0.148
utra+0.143
redit+0.138
��+0.133
ople+0.132
Neg logit contributions
hower-0.130
 Synd-0.119
 NIC-0.115
 Sting-0.115
 Huck-0.114
TokenAK
Token ID10206
Feature activation+2.28e-01

Pos logit contributions
hower+0.043
 Synd+0.040
 NIC+0.039
 Sting+0.039
 Huck+0.039
Neg logit contributions
umenthal-0.030
utra-0.028
redit-0.027
��-0.026
ounty-0.025
Token)
Token ID8
Feature activation+4.79e-02

Pos logit contributions
hower+0.319
 Synd+0.298
 NIC+0.285
 Sting+0.283
 Spac+0.278
Neg logit contributions
umenthal-0.522
utra-0.502
redit-0.486
��-0.473
ounty-0.467
Token gran
Token ID19468
Feature activation-3.95e-01

Pos logit contributions
umenthal+0.097
utra+0.093
redit+0.089
��+0.086
ounty+0.085
Neg logit contributions
hower-0.096
 Synd-0.089
 Sting-0.087
 NIC-0.087
 Spac-0.086
Tokenularity
Token ID33737
Feature activation-2.91e-01

Pos logit contributions
umenthal+0.737
utra+0.712
redit+0.679
ounty+0.648
ople+0.643
Neg logit contributions
hower-0.731
 Synd-0.668
 Sting-0.655
 Huck-0.648
 NIC-0.645
Token
Token ID198
Feature activation-5.25e-02

Pos logit contributions
umenthal+0.621
utra+0.603
redit+0.578
��+0.560
ounty+0.556
Neg logit contributions
hower-0.450
 Synd-0.409
 Sting-0.397
 NIC-0.396
 conduc-0.393
Token
Token ID198
Feature activation+1.31e-03

Pos logit contributions
umenthal+0.103
utra+0.101
redit+0.096
ople+0.093
ounty+0.093
Neg logit contributions
hower-0.091
 conduc-0.088
 Synd-0.081
 Huck-0.079
 NIC-0.079
TokenCon
Token ID3103
Feature activation+2.21e-01

Pos logit contributions
hower+0.003
 Synd+0.003
 NIC+0.002
 Sting+0.002
 conduc+0.002
Neg logit contributions
umenthal-0.002
utra-0.002
redit-0.002
��-0.002
ounty-0.002
Tokencurrency
Token ID34415
Feature activation-7.28e-01

Pos logit contributions
hower+0.516
 Synd+0.494
 NIC+0.483
 Sting+0.480
 conduc+0.476
Neg logit contributions
umenthal-0.302
utra-0.285
redit-0.269
��-0.255
ounty-0.250
Token,
Token ID11
Feature activation-3.87e-02

Pos logit contributions
umenthal+1.473
utra+1.425
redit+1.366
��+1.312
ounty+1.310
Neg logit contributions
hower-1.211
 Synd-1.086
 Sting-1.061
 NIC-1.058
 conduc-1.052
Token Peters
Token ID15722
Feature activation-4.37e-01

Pos logit contributions
umenthal+0.077
utra+0.074
redit+0.071
��+0.069
ounty+0.068
Neg logit contributions
hower-0.067
 Synd-0.061
 NIC-0.059
 Sting-0.059
wal-0.058
Tokenons
Token ID684
Feature activation-2.82e-01

Pos logit contributions
umenthal+0.859
utra+0.830
redit+0.792
��+0.759
ople+0.754
Neg logit contributions
hower-0.748
 Synd-0.703
 NIC-0.679
 Sting-0.676
 Spac-0.671
Token solution
Token ID4610
Feature activation-5.26e-01

Pos logit contributions
umenthal+0.586
utra+0.570
redit+0.544
��+0.525
ounty+0.520
Neg logit contributions
hower-0.451
 Synd-0.420
 NIC-0.406
 Sting-0.405
 Spac-0.400
Token
Token ID198
Feature activation-8.82e-02

Pos logit contributions
umenthal+1.090
utra+1.056
redit+1.009
��+0.987
ounty+0.976
Neg logit contributions
hower-0.847
 Synd-0.772
 NIC-0.747
 Sting-0.745
 conduc-0.738
Token chain
Token ID6333
Feature activation-1.83e-01

Pos logit contributions
umenthal+0.188
utra+0.182
redit+0.173
��+0.169
ounty+0.167
Neg logit contributions
hower-0.152
 Synd-0.142
 Sting-0.137
 NIC-0.137
 Huck-0.135
Token.
Token ID13
Feature activation-3.66e-02

Pos logit contributions
umenthal+0.393
utra+0.383
redit+0.365
ounty+0.353
��+0.352
Neg logit contributions
hower-0.280
 Synd-0.259
 NIC-0.250
 Sting-0.250
 conduc-0.246
Token
Token ID198
Feature activation+3.42e-02

Pos logit contributions
umenthal+0.073
utra+0.071
redit+0.068
ounty+0.066
��+0.065
Neg logit contributions
hower-0.062
 Synd-0.056
 conduc-0.055
 NIC-0.054
 Sting-0.054
Token
Token ID198
Feature activation-1.04e-03

Pos logit contributions
hower+0.059
 conduc+0.057
 Synd+0.053
 Sting+0.051
 NIC+0.051
Neg logit contributions
umenthal-0.067
utra-0.066
redit-0.063
ople-0.061
ounty-0.061
TokenM
Token ID44
Feature activation-6.17e-04

Pos logit contributions
umenthal+0.002
utra+0.002
redit+0.002
��+0.002
ounty+0.002
Neg logit contributions
hower-0.002
 Synd-0.002
 conduc-0.002
 NIC-0.002
 Sting-0.002
Tokenounds
Token ID3733
Feature activation-9.93e-01

Pos logit contributions
umenthal+0.001
utra+0.001
redit+0.001
��+0.001
ounty+0.001
Neg logit contributions
hower-0.001
 Synd-0.001
 NIC-0.001
 Sting-0.001
 Spac-0.001
Token of
Token ID286
Feature activation-6.82e-02

Pos logit contributions
umenthal+1.888
utra+1.849
redit+1.757
ounty+1.676
��+1.670
Neg logit contributions
hower-1.735
 Synd-1.653
 NIC-1.580
 Spac-1.567
 conduc-1.565
Token tail
Token ID7894
Feature activation+4.95e-01

Pos logit contributions
umenthal+0.123
utra+0.118
redit+0.112
��+0.108
ounty+0.107
Neg logit contributions
hower-0.129
 Synd-0.121
 NIC-0.117
 Sting-0.116
 Spac-0.116
Tokenings
Token ID654
Feature activation+6.26e-02

Pos logit contributions
hower+0.941
 Synd+0.890
 NIC+0.867
 Sting+0.851
 Spac+0.847
Neg logit contributions
umenthal-0.884
utra-0.846
redit-0.811
ounty-0.775
��-0.774
Token from
Token ID422
Feature activation+1.44e-01

Pos logit contributions
hower+0.110
 Synd+0.106
 NIC+0.102
 Spac+0.101
 Sting+0.100
Neg logit contributions
umenthal-0.119
utra-0.116
redit-0.110
ounty-0.105
��-0.105
Token mineral
Token ID18352
Feature activation-2.95e-01

Pos logit contributions
hower+0.238
 Synd+0.222
 NIC+0.214
 Sting+0.210
 Spac+0.210
Neg logit contributions
umenthal-0.294
utra-0.284
redit-0.273
��-0.263
ounty-0.261
Token also
Token ID635
Feature activation+4.23e-02

Pos logit contributions
hower+0.694
 Synd+0.648
 NIC+0.625
 Sting+0.625
 conduc+0.616
Neg logit contributions
umenthal-0.827
utra-0.799
redit-0.764
��-0.737
ounty-0.732
Token that
Token ID326
Feature activation+1.55e-01

Pos logit contributions
hower+0.069
 Synd+0.065
 NIC+0.062
 Sting+0.062
wal+0.061
Neg logit contributions
umenthal-0.087
utra-0.084
redit-0.080
��-0.078
ounty-0.077
Token it
Token ID340
Feature activation+4.56e-02

Pos logit contributions
hower+0.258
 Synd+0.237
 Sting+0.227
 NIC+0.226
 conduc+0.225
Neg logit contributions
umenthal-0.314
utra-0.304
redit-0.290
ople-0.282
��-0.282
Token is
Token ID318
Feature activation+7.18e-02

Pos logit contributions
hower+0.074
 Synd+0.068
 Sting+0.065
 NIC+0.065
 conduc+0.064
Neg logit contributions
umenthal-0.095
utra-0.092
redit-0.089
ople-0.085
��-0.084
Token optim
Token ID6436
Feature activation-5.18e-01

Pos logit contributions
hower+0.125
 Synd+0.114
 Sting+0.109
wal+0.108
 NIC+0.107
Neg logit contributions
umenthal-0.142
utra-0.138
redit-0.132
��-0.127
ople-0.126
Tokenised
Token ID1417
Feature activation-7.41e-01

Pos logit contributions
umenthal+0.838
utra+0.795
redit+0.753
ople+0.714
��+0.709
Neg logit contributions
hower-1.090
 Synd-1.028
 Sting-0.993
 NIC-0.987
 conduc-0.987
Token for
Token ID329
Feature activation+2.78e-02

Pos logit contributions
umenthal+1.404
utra+1.363
redit+1.282
��+1.232
ople+1.231
Neg logit contributions
hower-1.323
 Synd-1.215
 Sting-1.173
 Huck-1.162
 conduc-1.161
Token programs
Token ID4056
Feature activation-2.29e-01

Pos logit contributions
hower+0.048
 Synd+0.044
 Sting+0.042
wal+0.042
 NIC+0.041
Neg logit contributions
umenthal-0.055
utra-0.053
redit-0.050
��-0.049
ople-0.048
Token where
Token ID810
Feature activation+5.83e-02

Pos logit contributions
umenthal+0.455
utra+0.437
redit+0.418
��+0.408
ounty+0.405
Neg logit contributions
hower-0.387
 Synd-0.359
 Sting-0.349
 NIC-0.345
wal-0.343
Token
Token ID564
Feature activation-2.56e-02

Pos logit contributions
hower+0.097
 Synd+0.088
 Sting+0.084
wal+0.084
 NIC+0.083
Neg logit contributions
umenthal-0.120
utra-0.115
redit-0.109
��-0.108
ople-0.107
Token
Token ID250
Feature activation+4.40e-02

Pos logit contributions
umenthal+0.042
utra+0.040
redit+0.039
��+0.039
ounty+0.037
Neg logit contributions
hower-0.052
 Synd-0.049
 NIC-0.048
 Sting-0.048
 conduc-0.047
Token
Token ID198
Feature activation-2.25e-01

Pos logit contributions
umenthal+0.428
utra+0.422
redit+0.401
ople+0.390
ounty+0.387
Neg logit contributions
hower-0.389
 conduc-0.380
 Synd-0.349
 Huck-0.339
 Sting-0.338
TokenPart
Token ID7841
Feature activation-5.44e-01

Pos logit contributions
umenthal+0.373
utra+0.360
redit+0.341
��+0.324
ounty+0.323
Neg logit contributions
hower-0.464
 Synd-0.438
 NIC-0.425
 conduc-0.425
 Sting-0.425
Token 4
Token ID604
Feature activation-1.70e-01

Pos logit contributions
umenthal+1.029
utra+0.987
redit+0.946
��+0.916
ople+0.903
Neg logit contributions
hower-0.975
 Synd-0.920
 NIC-0.892
 Sting-0.887
 Spac-0.875
Token.
Token ID13
Feature activation+5.42e-02

Pos logit contributions
umenthal+0.397
utra+0.383
redit+0.371
��+0.361
ounty+0.357
Neg logit contributions
hower-0.233
 Synd-0.215
 NIC-0.206
 Sting-0.205
 Spac-0.201
Token S
Token ID311
Feature activation-1.48e-01

Pos logit contributions
hower+0.096
 Synd+0.090
 NIC+0.087
 conduc+0.087
 Sting+0.086
Neg logit contributions
umenthal-0.104
utra-0.100
redit-0.096
��-0.092
ople-0.092
Tokenucker
Token ID12603
Feature activation-8.49e-01

Pos logit contributions
umenthal+0.219
utra+0.213
redit+0.198
ounty+0.187
ople+0.186
Neg logit contributions
hower-0.329
 Synd-0.312
 NIC-0.305
 Sting-0.302
 conduc-0.300
Token Waves
Token ID42733
Feature activation-6.08e-02

Pos logit contributions
umenthal+1.636
utra+1.593
redit+1.521
ople+1.474
��+1.459
Neg logit contributions
hower-1.458
 Synd-1.365
 NIC-1.335
 conduc-1.334
 Sting-1.291
Token
Token ID198
Feature activation-2.88e-01

Pos logit contributions
umenthal+0.128
utra+0.124
redit+0.119
��+0.115
ople+0.115
Neg logit contributions
hower-0.095
 Synd-0.089
 NIC-0.086
 Sting-0.085
 conduc-0.085
Token
Token ID198
Feature activation-2.48e-01

Pos logit contributions
umenthal+0.549
utra+0.541
redit+0.513
ople+0.501
ounty+0.498
Neg logit contributions
hower-0.512
 conduc-0.505
 Synd-0.459
 Huck-0.446
 Sting-0.444
TokenThe
Token ID464
Feature activation-1.53e-01

Pos logit contributions
umenthal+0.407
utra+0.397
redit+0.374
ounty+0.359
ople+0.355
Neg logit contributions
hower-0.513
 Synd-0.481
 conduc-0.473
 NIC-0.469
 Sting-0.465
Token normal
Token ID3487
Feature activation-3.28e-01

Pos logit contributions
umenthal+0.298
utra+0.292
redit+0.276
��+0.266
ounty+0.265
Neg logit contributions
hower-0.265
 Synd-0.245
 NIC-0.236
 conduc-0.234
wal-0.233