TOP ACTIVATIONS
MAX = 1.557

. Story continues below advertisement Story continues
. Story continues below advertisement He eventually
. Story continues below advertisement " Interestingly
handling of Mam ad ou Sak ho drugs test Read more
heavy fighting with al - Sh ab ab on Tuesday .
And ro Pan Advertisement As you might
classic . A ub urn did not field a
. Story continues below advertisement He participated
. Story continues below advertisement It 's
so against their bourgeois political guarant or ( the PC F
is little agreement on how best to proceed .
them several years to read just prices to cover all the
zip file . file . A S 3 bucket
been laid by al - Sh ab ab , which is
3 D Gallery Advertisement Honestly , this
your putting routine when up close to the basket . This
Pixel pipe Advertisement If you 're
Ret ro Camera Advertisement If you 've
ible recruitment by al - Sh ab ab since 2010 .
. Story continues below advertisement Story continues

MIN ACTIVATIONS
MIN = -1.427

Andre Gre ipel ( L otto - Bel isol Team )
. Best of luck to both parties .
density Grid , const Vector < Part icle > & particles
the stabbing accuracy of Er rol Sp ence Jr
7 ff 3 c 3 ff 0 d 247 409 bd
] :: From Ar gb ($ FC 1 [ 0 ],
was NASA s seventh consecutive lunar mission failure .
today , C NET has learned . Reg ulators
$ FS 1 ,[ System . Draw ing . Font Style
numbers are growing through word of mouth and social media ,
. Even Darrell Duff ie , the sole economist
disp ersion as such ! Ont ology returned to the Meg
V IO LET , La .-- A W G NO viewer
and then manually split them into MP 3 s pse
16 d 7 ff 3 c 3 ff 0 d 247
3 c 3 ff 0 d 247 409 bd 28 f
Get The Times of Israel 's Daily Edition by
. Its normal range is [- 1 , + 1 ],
best and the best of luck in the coming days .
since February 2013 , was forced out by President Barack Obama

INTERVAL 0.811 - 1.557
CONTAINS 0.207%

st aring at air / being buried alive ."[ 155 ]
with not epad to your undead heart 's content . This
Yad av . A case was earlier filed against Vin od
that there s only room for one on a
All entries are cleared by pressing the Clear button . If

INTERVAL 0.065 - 0.811
CONTAINS 39.200%

s fall into the sea and glaciers retreat further . (
dou jin video game maker Jun ya O ta or Z
if it includes the derogatory information , how is the Court
considers them as a federal region . It has
I should be most grateful if readers would be

INTERVAL -0.681 - 0.065
CONTAINS 59.859%

for two reasons . Instead of gloss ing over these details
a sleek four wheel er powered by a 8 80 hp
and innovative solutions to the Department of Defense s
is also my mistake and our mistake and we all feel
, if they say anything at all . And HP suggests

INTERVAL -1.427 - -0.681
CONTAINS 0.735%

the basket . So get out there , and
, the old Pages made it possible to link text boxes
perhaps they shouldn 't celebrate too quickly . And don 't
1 st , in a surprise visit to B agram Air
methyl - 7 -( 2 , 6 - dim eth oxy
Token.
Token ID13
Feature activation+1.83e-01

Pos logit contributions
 Interpret+2.200
utton+2.178
uni+2.158
 taunt+2.126
 injure+2.124
Neg logit contributions
sein-1.877
 Stamp-1.859
idium-1.826
Disclaimer-1.807
agate-1.744
Token
Token ID198
Feature activation+1.19e-01

Pos logit contributions
 Interpret+0.646
utton+0.643
uni+0.636
 injure+0.626
 taunt+0.622
Neg logit contributions
 Stamp-0.671
sein-0.663
Disclaimer-0.646
idium-0.641
-0.618
Token
Token ID198
Feature activation+6.61e-02

Pos logit contributions
 Interpret+0.364
utton+0.356
uni+0.353
uti+0.343
 taunt+0.340
Neg logit contributions
 Stamp-0.494
sein-0.493
idium-0.486
Disclaimer-0.479
-0.466
TokenStory
Token ID11605
Feature activation+7.40e-01

Pos logit contributions
 Interpret+0.273
utton+0.268
uni+0.265
 injure+0.265
 taunt+0.263
Neg logit contributions
sein-0.204
 Stamp-0.202
Disclaimer-0.201
idium-0.197
-0.189
Token continues
Token ID4477
Feature activation+6.09e-01

Pos logit contributions
 Interpret+1.830
utton+1.823
uni+1.710
uti+1.693
 taunt+1.688
Neg logit contributions
 Stamp-3.469
idium-3.464
sein-3.408
-3.382
Disclaimer-3.371
Token below
Token ID2174
Feature activation+1.56e+00

Pos logit contributions
 Interpret+1.232
utton+1.203
uni+1.167
 taunt+1.146
 injure+1.136
Neg logit contributions
 Stamp-3.141
sein-3.136
idium-3.091
Disclaimer-3.060
-3.025
Token advertisement
Token ID15422
Feature activation+4.63e-01

Pos logit contributions
utton+4.456
 Interpret+4.438
 taunt+4.411
uni+4.135
 injure+4.102
Neg logit contributions
sein-6.451
 Stamp-6.420
idium-6.416
-6.306
Disclaimer-6.181
Token
Token ID198
Feature activation-5.97e-02

Pos logit contributions
 Interpret+1.342
utton+1.305
uni+1.287
uti+1.247
 taunt+1.229
Neg logit contributions
 Stamp-1.977
idium-1.970
sein-1.970
Disclaimer-1.913
agate-1.880
Token
Token ID198
Feature activation+2.08e-01

Pos logit contributions
 Stamp+0.239
sein+0.238
idium+0.233
Disclaimer+0.232
agate+0.224
Neg logit contributions
 Interpret-0.191
utton-0.188
uni-0.187
uti-0.182
 injure-0.181
TokenStory
Token ID11605
Feature activation+7.44e-01

Pos logit contributions
 Interpret+1.072
utton+1.059
uni+1.052
 injure+1.041
 taunt+1.037
Neg logit contributions
sein-0.431
 Stamp-0.429
Disclaimer-0.415
idium-0.411
-0.381
Token continues
Token ID4477
Feature activation+4.39e-01

Pos logit contributions
utton+1.837
 Interpret+1.827
uni+1.717
 taunt+1.713
uti+1.712
Neg logit contributions
 Stamp-3.472
idium-3.465
sein-3.431
Disclaimer-3.401
-3.398
Token.
Token ID13
Feature activation+8.98e-02

Pos logit contributions
 Stamp+0.484
sein+0.471
idium+0.454
Disclaimer+0.452
 spinning+0.437
Neg logit contributions
 Interpret-0.672
utton-0.667
uni-0.662
uti-0.646
 injure-0.645
Token
Token ID198
Feature activation-5.15e-02

Pos logit contributions
utton+0.319
 Interpret+0.319
uni+0.316
 injure+0.311
 taunt+0.309
Neg logit contributions
 Stamp-0.327
sein-0.323
Disclaimer-0.315
idium-0.311
-0.302
Token
Token ID198
Feature activation-8.37e-02

Pos logit contributions
 Stamp+0.208
sein+0.208
idium+0.204
Disclaimer+0.203
agate+0.196
Neg logit contributions
 Interpret-0.163
utton-0.160
uni-0.159
uti-0.154
 taunt-0.153
TokenStory
Token ID11605
Feature activation+7.93e-01

Pos logit contributions
sein+0.254
 Stamp+0.251
Disclaimer+0.250
idium+0.244
+0.236
Neg logit contributions
 Interpret-0.350
utton-0.345
uni-0.341
 injure-0.340
 taunt-0.338
Token continues
Token ID4477
Feature activation+3.81e-01

Pos logit contributions
utton+1.925
 Interpret+1.916
uni+1.804
uti+1.802
 taunt+1.793
Neg logit contributions
idium-3.725
 Stamp-3.720
sein-3.665
Disclaimer-3.631
-3.628
Token below
Token ID2174
Feature activation+1.50e+00

Pos logit contributions
 Interpret+0.816
utton+0.800
uni+0.782
 taunt+0.758
 injure+0.758
Neg logit contributions
 Stamp-1.926
sein-1.922
idium-1.893
Disclaimer-1.880
-1.843
Token advertisement
Token ID15422
Feature activation+5.15e-01

Pos logit contributions
 Interpret+4.331
utton+4.325
 taunt+4.290
uni+4.060
 injure+4.034
Neg logit contributions
sein-6.140
 Stamp-6.125
idium-6.121
-5.994
Disclaimer-5.919
Token
Token ID198
Feature activation+1.98e-02

Pos logit contributions
 Interpret+1.478
utton+1.440
uni+1.421
uti+1.374
 taunt+1.352
Neg logit contributions
 Stamp-2.210
idium-2.198
sein-2.196
Disclaimer-2.136
-2.098
Token
Token ID198
Feature activation+2.99e-01

Pos logit contributions
 Interpret+0.062
utton+0.061
uni+0.060
uti+0.059
 taunt+0.058
Neg logit contributions
 Stamp-0.080
sein-0.080
idium-0.079
Disclaimer-0.078
agate-0.076
TokenHe
Token ID1544
Feature activation+9.57e-02

Pos logit contributions
 Interpret+1.153
utton+1.140
uni+1.131
uti+1.107
 injure+1.102
Neg logit contributions
 Stamp-1.001
sein-0.999
idium-0.973
Disclaimer-0.967
agate-0.927
Token eventually
Token ID4191
Feature activation+3.62e-02

Pos logit contributions
 Interpret+0.393
utton+0.388
uni+0.386
uti+0.378
 injure+0.378
Neg logit contributions
sein-0.296
 Stamp-0.296
idium-0.287
Disclaimer-0.286
-0.272
Token.
Token ID13
Feature activation+8.90e-02

Pos logit contributions
 Stamp+0.590
sein+0.583
Disclaimer+0.562
idium+0.559
agate+0.536
Neg logit contributions
 Interpret-0.829
utton-0.823
uni-0.816
 injure-0.805
 taunt-0.803
Token
Token ID198
Feature activation+1.22e-01

Pos logit contributions
 Interpret+0.306
utton+0.305
uni+0.303
 injure+0.298
 taunt+0.295
Neg logit contributions
 Stamp-0.334
sein-0.332
Disclaimer-0.323
idium-0.321
-0.310
Token
Token ID198
Feature activation+6.03e-02

Pos logit contributions
 Interpret+0.372
utton+0.364
uni+0.361
uti+0.350
 taunt+0.347
Neg logit contributions
 Stamp-0.504
sein-0.503
idium-0.497
Disclaimer-0.488
agate-0.476
TokenStory
Token ID11605
Feature activation+6.30e-01

Pos logit contributions
 Interpret+0.253
utton+0.249
uni+0.248
 injure+0.244
 taunt+0.243
Neg logit contributions
sein-0.182
 Stamp-0.181
Disclaimer-0.177
idium-0.176
-0.167
Token continues
Token ID4477
Feature activation+4.16e-01

Pos logit contributions
 Interpret+1.617
utton+1.609
uni+1.532
 taunt+1.513
uti+1.512
Neg logit contributions
 Stamp-2.900
idium-2.887
sein-2.865
Disclaimer-2.828
-2.820
Token below
Token ID2174
Feature activation+1.44e+00

Pos logit contributions
 Interpret+0.905
utton+0.884
uni+0.863
 taunt+0.840
 injure+0.838
Neg logit contributions
sein-2.089
 Stamp-2.089
idium-2.056
Disclaimer-2.043
-2.000
Token advertisement
Token ID15422
Feature activation+3.56e-01

Pos logit contributions
utton+4.234
 Interpret+4.213
 taunt+4.196
 injure+3.910
uni+3.891
Neg logit contributions
idium-5.923
sein-5.917
 Stamp-5.902
-5.760
Disclaimer-5.675
Token
Token ID198
Feature activation+1.20e-01

Pos logit contributions
 Interpret+1.054
utton+1.028
uni+1.017
uti+0.987
 taunt+0.975
Neg logit contributions
 Stamp-1.503
sein-1.497
idium-1.490
Disclaimer-1.455
agate-1.423
Token
Token ID198
Feature activation+3.22e-01

Pos logit contributions
 Interpret+0.370
utton+0.362
uni+0.358
uti+0.348
 taunt+0.345
Neg logit contributions
 Stamp-0.497
sein-0.497
idium-0.490
Disclaimer-0.482
agate-0.470
Token"
Token ID1
Feature activation+5.14e-02

Pos logit contributions
 Interpret+1.262
utton+1.251
uni+1.241
uti+1.216
 taunt+1.205
Neg logit contributions
 Stamp-1.054
sein-1.048
idium-1.022
Disclaimer-1.009
agate-0.970
TokenInterestingly
Token ID33092
Feature activation-2.04e-01

Pos logit contributions
 Interpret+0.197
utton+0.195
uni+0.193
 injure+0.190
uti+0.189
Neg logit contributions
 Stamp-0.172
sein-0.172
Disclaimer-0.168
idium-0.167
-0.160
Token handling
Token ID9041
Feature activation+4.86e-01

Pos logit contributions
 Interpret+1.878
utton+1.847
uni+1.827
 injure+1.789
 taunt+1.789
Neg logit contributions
sein-1.987
 Stamp-1.979
idium-1.937
Disclaimer-1.931
agate-1.855
Token of
Token ID286
Feature activation+3.69e-02

Pos logit contributions
 Interpret+1.619
utton+1.606
uni+1.598
uti+1.552
 injure+1.542
Neg logit contributions
 Stamp-1.881
sein-1.865
idium-1.824
Disclaimer-1.822
-1.753
Token Mam
Token ID29926
Feature activation+5.12e-01

Pos logit contributions
 Interpret+0.127
utton+0.127
uni+0.126
uti+0.122
 injure+0.120
Neg logit contributions
 Stamp-0.140
sein-0.137
Disclaimer-0.134
idium-0.133
 spinning-0.128
Tokenad
Token ID324
Feature activation-2.96e-02

Pos logit contributions
 Interpret+1.265
utton+1.238
uni+1.235
uti+1.187
 injure+1.171
Neg logit contributions
sein-2.426
 Stamp-2.418
Disclaimer-2.376
idium-2.362
-2.296
Tokenou
Token ID280
Feature activation+6.08e-01

Pos logit contributions
sein+0.089
 Stamp+0.089
idium+0.086
Disclaimer+0.085
agate+0.082
Neg logit contributions
 Interpret-0.124
utton-0.123
uni-0.121
 taunt-0.119
 injure-0.119
Token Sak
Token ID13231
Feature activation+1.43e+00

Pos logit contributions
utton+1.834
 Interpret+1.820
 injure+1.796
uti+1.791
uni+1.759
Neg logit contributions
sein-2.508
Disclaimer-2.489
idium-2.467
 Stamp-2.445
agate-2.417
Tokenho
Token ID8873
Feature activation+3.21e-01

Pos logit contributions
uni+2.127
utton+2.094
 Interpret+2.051
uti+1.976
iesta+1.871
Neg logit contributions
sein-8.098
idium-8.075
 Stamp-8.073
Disclaimer-7.940
-7.812
Token drugs
Token ID5010
Feature activation+3.28e-01

Pos logit contributions
 Interpret+1.211
utton+1.199
uni+1.190
uti+1.160
 injure+1.154
Neg logit contributions
 Stamp-1.103
sein-1.096
Disclaimer-1.067
idium-1.067
-1.025
Token test
Token ID1332
Feature activation+6.47e-01

Pos logit contributions
 Interpret+1.163
utton+1.149
uni+1.146
uti+1.115
 injure+1.107
Neg logit contributions
 Stamp-1.201
sein-1.192
idium-1.162
Disclaimer-1.161
agate-1.117
Token Read
Token ID4149
Feature activation+6.59e-01

Pos logit contributions
 Interpret+2.199
utton+2.166
uni+2.142
uti+2.094
 injure+2.091
Neg logit contributions
sein-2.457
 Stamp-2.452
idium-2.396
Disclaimer-2.390
agate-2.296
Token more
Token ID517
Feature activation+3.43e-01

Pos logit contributions
 Interpret+2.221
utton+2.158
uni+2.156
 taunt+2.113
iesta+2.111
Neg logit contributions
sein-2.451
 Stamp-2.447
idium-2.441
Disclaimer-2.390
-2.378
Token heavy
Token ID4334
Feature activation-3.82e-01

Pos logit contributions
 Stamp+0.569
sein+0.563
idium+0.546
Disclaimer+0.544
 spinning+0.524
Neg logit contributions
 Interpret-0.679
utton-0.667
uni-0.662
uti-0.646
 injure-0.644
Token fighting
Token ID4330
Feature activation-1.77e-01

Pos logit contributions
 Stamp+1.408
sein+1.377
idium+1.350
Disclaimer+1.345
 spinning+1.329
Neg logit contributions
 Interpret-1.360
utton-1.323
uni-1.318
uti-1.284
 injure-1.267
Token with
Token ID351
Feature activation-3.33e-01

Pos logit contributions
 Stamp+0.562
sein+0.554
Disclaimer+0.541
idium+0.537
 spinning+0.518
Neg logit contributions
 Interpret-0.716
utton-0.709
uni-0.700
uti-0.685
 injure-0.678
Token al
Token ID435
Feature activation+1.63e-01

Pos logit contributions
 Stamp+1.033
sein+1.006
Disclaimer+0.977
idium+0.971
 spinning+0.946
Neg logit contributions
 Interpret-1.374
utton-1.356
uni-1.346
uti-1.322
 injure-1.319
Token-
Token ID12
Feature activation+7.10e-04

Pos logit contributions
 Interpret+0.669
utton+0.662
uni+0.657
uti+0.643
 injure+0.641
Neg logit contributions
sein-0.503
 Stamp-0.502
idium-0.489
Disclaimer-0.486
agate-0.464
TokenSh
Token ID2484
Feature activation+1.43e+00

Pos logit contributions
 Interpret+0.003
utton+0.003
uni+0.003
uti+0.003
 injure+0.003
Neg logit contributions
 Stamp-0.002
sein-0.002
Disclaimer-0.002
idium-0.002
-0.002
Tokenab
Token ID397
Feature activation+4.11e-01

Pos logit contributions
utton+4.168
uni+4.032
 Interpret+3.983
uti+3.869
ators+3.727
Neg logit contributions
 Stamp-6.133
sein-5.998
Disclaimer-5.930
-5.833
idium-5.800
Tokenab
Token ID397
Feature activation+3.71e-01

Pos logit contributions
 Interpret+1.383
utton+1.365
uni+1.359
uti+1.318
 taunt+1.307
Neg logit contributions
 Stamp-1.581
sein-1.565
Disclaimer-1.532
idium-1.528
-1.481
Token on
Token ID319
Feature activation-2.13e-01

Pos logit contributions
 Interpret+1.396
utton+1.379
uni+1.367
uti+1.338
 injure+1.335
Neg logit contributions
 Stamp-1.272
sein-1.271
idium-1.237
Disclaimer-1.234
agate-1.181
Token Tuesday
Token ID3431
Feature activation+1.03e-01

Pos logit contributions
 Stamp+0.735
sein+0.718
Disclaimer+0.698
idium+0.698
 spinning+0.669
Neg logit contributions
 Interpret-0.809
utton-0.802
uni-0.793
uti-0.779
 injure-0.775
Token.
Token ID13
Feature activation-4.81e-02

Pos logit contributions
 Interpret+0.426
utton+0.419
uni+0.417
uti+0.409
 injure+0.407
Neg logit contributions
 Stamp-0.319
sein-0.317
idium-0.308
Disclaimer-0.306
agate-0.292
TokenAnd
Token ID1870
Feature activation+2.48e-01

Pos logit contributions
 Interpret+0.367
utton+0.363
uni+0.360
uti+0.353
 injure+0.352
Neg logit contributions
 Stamp-0.269
sein-0.269
idium-0.261
Disclaimer-0.260
agate-0.247
Tokenro
Token ID305
Feature activation-1.24e-01

Pos logit contributions
 Interpret+0.901
utton+0.889
uni+0.882
uti+0.862
 injure+0.860
Neg logit contributions
 Stamp-0.885
sein-0.884
idium-0.861
Disclaimer-0.859
agate-0.823
TokenPan
Token ID15730
Feature activation+1.62e-01

Pos logit contributions
 Stamp+0.404
sein+0.402
idium+0.392
Disclaimer+0.391
agate+0.372
Neg logit contributions
 Interpret-0.490
utton-0.484
uni-0.480
 injure-0.472
uti-0.471
Token
Token ID198
Feature activation+1.70e-01

Pos logit contributions
 Interpret+0.572
utton+0.563
uni+0.559
 injure+0.549
uti+0.547
Neg logit contributions
 Stamp-0.597
sein-0.594
idium-0.581
Disclaimer-0.579
agate-0.558
Token
Token ID198
Feature activation+3.58e-01

Pos logit contributions
 Interpret+0.519
utton+0.507
uni+0.504
uti+0.488
 taunt+0.484
Neg logit contributions
 Stamp-0.705
sein-0.705
idium-0.695
Disclaimer-0.682
agate-0.665
TokenAdvertisement
Token ID4723
Feature activation+1.43e+00

Pos logit contributions
 Interpret+1.455
utton+1.446
uni+1.439
uti+1.404
 taunt+1.392
Neg logit contributions
 Stamp-1.117
sein-1.108
idium-1.078
Disclaimer-1.056
agate-1.020
Token
Token ID198
Feature activation-1.42e-01

Pos logit contributions
 Interpret+3.413
uni+3.218
utton+3.184
 =================================================================+3.095
uti+2.962
Neg logit contributions
idium-6.713
���-6.633
sein-6.572
 Stamp-6.546
-6.373
Token
Token ID198
Feature activation-1.16e-01

Pos logit contributions
 Stamp+0.571
sein+0.571
idium+0.559
Disclaimer+0.556
agate+0.537
Neg logit contributions
 Interpret-0.454
utton-0.446
uni-0.442
uti-0.431
 injure-0.428
TokenAs
Token ID1722
Feature activation-1.14e-01

Pos logit contributions
sein+0.380
 Stamp+0.380
Disclaimer+0.373
idium+0.369
+0.353
Neg logit contributions
 Interpret-0.456
utton-0.451
uni-0.446
 injure-0.440
uti-0.438
Token you
Token ID345
Feature activation-9.53e-02

Pos logit contributions
 Stamp+0.361
sein+0.359
idium+0.348
Disclaimer+0.348
agate+0.331
Neg logit contributions
 Interpret-0.464
utton-0.458
uni-0.454
uti-0.446
 injure-0.446
Token might
Token ID1244
Feature activation-7.11e-02

Pos logit contributions
 Stamp+0.336
sein+0.335
idium+0.326
Disclaimer+0.325
agate+0.312
Neg logit contributions
 Interpret-0.350
utton-0.346
uni-0.343
uti-0.336
 injure-0.334
Token classic
Token ID6833
Feature activation+9.80e-02

Pos logit contributions
 Interpret+0.463
utton+0.456
uni+0.453
uti+0.443
 injure+0.442
Neg logit contributions
 Stamp-0.411
sein-0.410
idium-0.400
Disclaimer-0.398
-0.381
Token.
Token ID13
Feature activation-7.87e-03

Pos logit contributions
 Interpret+0.356
utton+0.351
uni+0.349
uti+0.341
 injure+0.339
Neg logit contributions
 Stamp-0.350
sein-0.349
idium-0.341
Disclaimer-0.339
-0.326
Token
Token ID198
Feature activation-2.37e-01

Pos logit contributions
 Stamp+0.028
sein+0.027
Disclaimer+0.027
idium+0.026
+0.025
Neg logit contributions
 Interpret-0.029
utton-0.029
uni-0.029
 injure-0.028
uti-0.028
Token
Token ID198
Feature activation-2.12e-01

Pos logit contributions
 Stamp+0.922
sein+0.921
Disclaimer+0.901
idium+0.895
+0.863
Neg logit contributions
 Interpret-0.779
utton-0.770
uni-0.762
 injure-0.748
uti-0.745
TokenA
Token ID32
Feature activation+8.09e-02

Pos logit contributions
sein+0.706
 Stamp+0.703
Disclaimer+0.700
idium+0.680
+0.666
Neg logit contributions
 Interpret-0.825
utton-0.807
uni-0.799
 injure-0.797
 taunt-0.791
Tokenub
Token ID549
Feature activation+1.41e+00

Pos logit contributions
 Interpret+0.262
utton+0.255
uni+0.253
uti+0.247
 injure+0.247
Neg logit contributions
 Stamp-0.321
sein-0.320
idium-0.314
Disclaimer-0.311
-0.301
Tokenurn
Token ID700
Feature activation+8.60e-01

Pos logit contributions
utton+2.052
uni+1.988
 Interpret+1.964
uti+1.761
iesta+1.655
Neg logit contributions
sein-7.960
 Stamp-7.911
Disclaimer-7.864
idium-7.836
-7.650
Token did
Token ID750
Feature activation+3.06e-01

Pos logit contributions
 Interpret+3.322
utton+3.290
uni+3.260
 injure+3.214
uti+3.202
Neg logit contributions
sein-2.844
 Stamp-2.819
Disclaimer-2.761
idium-2.739
agate-2.643
Token not
Token ID407
Feature activation+3.33e-01

Pos logit contributions
 Interpret+1.121
utton+1.100
uni+1.094
uti+1.074
 taunt+1.059
Neg logit contributions
 Stamp-1.088
sein-1.083
idium-1.055
Disclaimer-1.049
-1.019
Token field
Token ID2214
Feature activation+9.07e-03

Pos logit contributions
 Interpret+1.070
utton+1.057
uni+1.047
uti+1.022
 injure+1.007
Neg logit contributions
 Stamp-1.327
sein-1.321
idium-1.291
Disclaimer-1.285
agate-1.240
Token a
Token ID257
Feature activation-3.20e-02

Pos logit contributions
 Interpret+0.037
utton+0.037
uni+0.036
uti+0.036
 injure+0.035
Neg logit contributions
 Stamp-0.029
sein-0.028
idium-0.027
Disclaimer-0.027
-0.026
Token.
Token ID13
Feature activation-1.21e-01

Pos logit contributions
 Stamp+0.528
sein+0.520
idium+0.507
Disclaimer+0.503
+0.474
Neg logit contributions
 Interpret-0.919
utton-0.913
uni-0.904
 injure-0.886
uti-0.885
Token
Token ID198
Feature activation+1.60e-01

Pos logit contributions
sein+0.389
 Stamp+0.387
idium+0.374
Disclaimer+0.372
+0.355
Neg logit contributions
 Interpret-0.483
utton-0.482
uni-0.477
 injure-0.469
 taunt-0.467
Token
Token ID198
Feature activation+6.42e-02

Pos logit contributions
 Interpret+0.487
utton+0.476
uni+0.472
uti+0.458
 taunt+0.453
Neg logit contributions
 Stamp-0.666
sein-0.664
idium-0.656
Disclaimer-0.645
-0.629
TokenStory
Token ID11605
Feature activation+6.79e-01

Pos logit contributions
 Interpret+0.261
utton+0.257
uni+0.255
 injure+0.252
 taunt+0.251
Neg logit contributions
sein-0.202
 Stamp-0.201
Disclaimer-0.198
idium-0.195
-0.187
Token continues
Token ID4477
Feature activation+3.02e-01

Pos logit contributions
utton+1.672
 Interpret+1.670
uni+1.572
uti+1.569
 taunt+1.564
Neg logit contributions
 Stamp-3.165
idium-3.163
sein-3.126
Disclaimer-3.087
-3.084
Token below
Token ID2174
Feature activation+1.41e+00

Pos logit contributions
 Interpret+0.656
utton+0.643
uni+0.630
 injure+0.609
 taunt+0.609
Neg logit contributions
 Stamp-1.514
sein-1.511
idium-1.488
Disclaimer-1.478
-1.447
Token advertisement
Token ID15422
Feature activation+4.65e-01

Pos logit contributions
 Interpret+4.177
utton+4.161
 taunt+4.122
uni+3.949
 injure+3.910
Neg logit contributions
sein-5.708
 Stamp-5.696
idium-5.682
-5.556
Disclaimer-5.512
Token
Token ID198
Feature activation-1.08e-01

Pos logit contributions
 Interpret+1.350
utton+1.318
uni+1.301
uti+1.259
 taunt+1.240
Neg logit contributions
 Stamp-1.987
sein-1.974
idium-1.972
Disclaimer-1.920
-1.883
Token
Token ID198
Feature activation+7.19e-02

Pos logit contributions
 Stamp+0.429
sein+0.429
idium+0.419
Disclaimer+0.418
agate+0.403
Neg logit contributions
 Interpret-0.347
utton-0.342
uni-0.338
uti-0.330
 injure-0.329
TokenHe
Token ID1544
Feature activation-5.78e-02

Pos logit contributions
 Interpret+0.283
utton+0.279
uni+0.276
 injure+0.273
 taunt+0.272
Neg logit contributions
sein-0.235
 Stamp-0.234
Disclaimer-0.231
idium-0.227
-0.219
Token participated
Token ID14888
Feature activation-3.02e-01

Pos logit contributions
 Stamp+0.179
sein+0.177
idium+0.172
Disclaimer+0.171
agate+0.164
Neg logit contributions
 Interpret-0.238
utton-0.235
uni-0.233
uti-0.228
 injure-0.227
Token.
Token ID13
Feature activation+1.82e-01

Pos logit contributions
 Stamp+0.621
sein+0.614
Disclaimer+0.596
idium+0.593
agate+0.563
Neg logit contributions
 Interpret-0.887
utton-0.881
uni-0.876
 injure-0.859
uti-0.858
Token
Token ID198
Feature activation+1.23e-01

Pos logit contributions
 Interpret+0.678
utton+0.673
uni+0.667
uti+0.652
 injure+0.652
Neg logit contributions
 Stamp-0.630
sein-0.627
Disclaimer-0.610
idium-0.609
agate-0.582
Token
Token ID198
Feature activation+5.06e-02

Pos logit contributions
 Interpret+0.374
utton+0.366
uni+0.363
uti+0.352
 taunt+0.349
Neg logit contributions
 Stamp-0.510
sein-0.508
idium-0.502
Disclaimer-0.494
-0.481
TokenStory
Token ID11605
Feature activation+8.44e-01

Pos logit contributions
 Interpret+0.212
utton+0.209
uni+0.207
 injure+0.205
 taunt+0.204
Neg logit contributions
sein-0.152
 Stamp-0.152
Disclaimer-0.149
idium-0.147
-0.140
Token continues
Token ID4477
Feature activation+3.03e-01

Pos logit contributions
utton+2.036
 Interpret+2.031
uni+1.899
 taunt+1.898
uti+1.895
Neg logit contributions
idium-3.991
 Stamp-3.976
-3.919
sein-3.914
Disclaimer-3.879
Token below
Token ID2174
Feature activation+1.41e+00

Pos logit contributions
 Interpret+0.674
utton+0.661
uni+0.647
 injure+0.626
uti+0.626
Neg logit contributions
 Stamp-1.509
sein-1.508
idium-1.483
Disclaimer-1.476
-1.439
Token advertisement
Token ID15422
Feature activation+4.50e-01

Pos logit contributions
 Interpret+4.156
utton+4.151
 taunt+4.109
uni+3.865
 injure+3.858
Neg logit contributions
idium-5.767
sein-5.753
 Stamp-5.738
-5.611
Disclaimer-5.518
Token
Token ID198
Feature activation+1.89e-01

Pos logit contributions
 Interpret+1.308
utton+1.277
uni+1.263
uti+1.224
 taunt+1.205
Neg logit contributions
 Stamp-1.920
sein-1.907
idium-1.905
Disclaimer-1.855
-1.818
Token
Token ID198
Feature activation+4.25e-01

Pos logit contributions
 Interpret+0.571
utton+0.559
uni+0.553
uti+0.536
 taunt+0.531
Neg logit contributions
 Stamp-0.789
sein-0.788
idium-0.780
Disclaimer-0.764
agate-0.747
TokenIt
Token ID1026
Feature activation-5.37e-02

Pos logit contributions
 Interpret+1.634
utton+1.617
uni+1.607
uti+1.572
 taunt+1.553
Neg logit contributions
 Stamp-1.428
sein-1.418
idium-1.385
Disclaimer-1.359
agate-1.314
Token's
Token ID338
Feature activation+1.60e-01

Pos logit contributions
 Stamp+0.180
sein+0.179
Disclaimer+0.174
idium+0.174
agate+0.167
Neg logit contributions
 Interpret-0.207
utton-0.204
uni-0.202
 injure-0.198
uti-0.198
Token so
Token ID523
Feature activation+2.27e-01

Pos logit contributions
 Interpret+1.384
utton+1.364
uni+1.357
uti+1.329
 injure+1.325
Neg logit contributions
sein-1.278
 Stamp-1.278
idium-1.248
Disclaimer-1.244
agate-1.191
Token against
Token ID1028
Feature activation+8.53e-02

Pos logit contributions
 Interpret+0.890
utton+0.879
uni+0.872
uti+0.855
 injure+0.853
Neg logit contributions
 Stamp-0.743
sein-0.743
idium-0.722
Disclaimer-0.720
agate-0.687
Token their
Token ID511
Feature activation+3.70e-02

Pos logit contributions
 Interpret+0.330
utton+0.326
uni+0.322
 injure+0.316
uti+0.316
Neg logit contributions
 Stamp-0.286
sein-0.284
idium-0.277
Disclaimer-0.276
-0.263
Token bourgeois
Token ID19749
Feature activation+6.06e-01

Pos logit contributions
 Interpret+0.133
utton+0.132
uni+0.130
uti+0.128
 injure+0.127
Neg logit contributions
 Stamp-0.134
sein-0.133
idium-0.130
Disclaimer-0.130
-0.124
Token political
Token ID1964
Feature activation+3.61e-01

Pos logit contributions
 Interpret+2.263
uni+2.216
utton+2.208
 taunt+2.164
uti+2.158
Neg logit contributions
 Stamp-2.087
sein-2.074
idium-2.031
Disclaimer-2.020
agate-1.951
Token guarant
Token ID5505
Feature activation+1.41e+00

Pos logit contributions
 Interpret+1.321
utton+1.294
uni+1.290
 taunt+1.260
uti+1.258
Neg logit contributions
 Stamp-1.278
sein-1.273
idium-1.244
Disclaimer-1.237
agate-1.193
Tokenor
Token ID273
Feature activation+2.98e-01

Pos logit contributions
 Interpret+3.913
utton+3.902
uti+3.845
uni+3.824
 taunt+3.713
Neg logit contributions
 Stamp-6.072
sein-5.929
Disclaimer-5.925
idium-5.817
-5.724
Token (
Token ID357
Feature activation+1.96e-02

Pos logit contributions
 Interpret+1.231
utton+1.218
uni+1.208
uti+1.184
 injure+1.182
Neg logit contributions
 Stamp-0.916
sein-0.915
idium-0.888
Disclaimer-0.885
agate-0.842
Tokenthe
Token ID1169
Feature activation-1.73e-01

Pos logit contributions
 Interpret+0.071
utton+0.070
uni+0.069
 injure+0.068
 taunt+0.068
Neg logit contributions
sein-0.071
 Stamp-0.071
Disclaimer-0.069
idium-0.069
agate-0.066
Token PC
Token ID4217
Feature activation-3.10e-01

Pos logit contributions
 Stamp+0.595
sein+0.593
idium+0.574
Disclaimer+0.573
+0.546
Neg logit contributions
 Interpret-0.653
utton-0.649
uni-0.639
 injure-0.630
uti-0.628
TokenF
Token ID37
Feature activation+1.57e-01

Pos logit contributions
 Stamp+0.913
sein+0.911
idium+0.881
Disclaimer+0.872
+0.832
Neg logit contributions
 Interpret-1.307
utton-1.301
uni-1.284
 injure-1.266
uti-1.265
Token is
Token ID318
Feature activation+8.64e-03

Pos logit contributions
 Interpret+0.603
utton+0.594
uni+0.589
uti+0.579
 injure+0.576
Neg logit contributions
 Stamp-0.552
sein-0.552
idium-0.538
Disclaimer-0.536
-0.516
Token little
Token ID1310
Feature activation-1.94e-01

Pos logit contributions
 Interpret+0.037
utton+0.037
uni+0.037
uti+0.036
 injure+0.036
Neg logit contributions
 Stamp-0.025
sein-0.025
idium-0.024
Disclaimer-0.024
-0.023
Token agreement
Token ID4381
Feature activation+5.84e-02

Pos logit contributions
 Stamp+0.657
sein+0.647
Disclaimer+0.635
idium+0.631
+0.605
Neg logit contributions
 Interpret-0.733
utton-0.731
uni-0.725
uti-0.713
 injure-0.704
Token on
Token ID319
Feature activation-3.27e-01

Pos logit contributions
 Interpret+0.243
utton+0.240
uni+0.239
uti+0.234
 injure+0.234
Neg logit contributions
 Stamp-0.177
sein-0.177
Disclaimer-0.171
idium-0.171
-0.164
Token how
Token ID703
Feature activation+1.54e-01

Pos logit contributions
 Stamp+1.103
sein+1.060
Disclaimer+1.039
idium+1.029
+1.000
Neg logit contributions
utton-1.242
 Interpret-1.242
uni-1.239
uti-1.228
 taunt-1.217
Token best
Token ID1266
Feature activation+1.38e+00

Pos logit contributions
 Interpret+0.673
utton+0.667
uni+0.662
uti+0.650
 taunt+0.648
Neg logit contributions
 Stamp-0.440
sein-0.437
Disclaimer-0.423
idium-0.423
-0.400
Token to
Token ID284
Feature activation+2.86e-01

Pos logit contributions
 Interpret+5.113
utton+5.040
uni+5.036
 injure+4.902
uti+4.804
Neg logit contributions
sein-4.707
 Stamp-4.644
idium-4.601
Disclaimer-4.488
agate-4.398
Token proceed
Token ID5120
Feature activation-2.11e-01

Pos logit contributions
 Interpret+0.971
utton+0.955
uni+0.948
 injure+0.926
uti+0.923
Neg logit contributions
sein-1.093
 Stamp-1.091
idium-1.066
Disclaimer-1.064
agate-1.022
Token.
Token ID13
Feature activation-2.86e-02

Pos logit contributions
 Stamp+0.621
sein+0.616
idium+0.599
Disclaimer+0.598
+0.570
Neg logit contributions
 Interpret-0.894
utton-0.886
uni-0.877
uti-0.863
 injure-0.860
Token
Token ID198
Feature activation-2.33e-01

Pos logit contributions
 Stamp+0.095
sein+0.093
Disclaimer+0.091
idium+0.090
+0.087
Neg logit contributions
utton-0.111
 Interpret-0.111
uni-0.110
uti-0.108
 injure-0.108
Token
Token ID198
Feature activation-2.31e-01

Pos logit contributions
 Stamp+0.910
sein+0.909
Disclaimer+0.888
idium+0.884
agate+0.851
Neg logit contributions
 Interpret-0.767
utton-0.758
uni-0.750
 injure-0.735
uti-0.733
Token them
Token ID606
Feature activation-9.65e-02

Pos logit contributions
 Stamp+0.578
sein+0.573
idium+0.556
Disclaimer+0.556
+0.530
Neg logit contributions
 Interpret-0.651
utton-0.645
uni-0.639
uti-0.627
 injure-0.623
Token several
Token ID1811
Feature activation-1.04e-01

Pos logit contributions
 Stamp+0.316
sein+0.315
idium+0.307
Disclaimer+0.306
agate+0.292
Neg logit contributions
 Interpret-0.379
utton-0.374
uni-0.371
uti-0.364
 injure-0.363
Token years
Token ID812
Feature activation-1.45e-01

Pos logit contributions
 Stamp+0.332
sein+0.329
Disclaimer+0.320
idium+0.319
+0.304
Neg logit contributions
 Interpret-0.417
utton-0.412
uni-0.409
uti-0.400
 injure-0.400
Token to
Token ID284
Feature activation+9.53e-02

Pos logit contributions
 Stamp+0.450
sein+0.448
Disclaimer+0.434
idium+0.431
 spinning+0.410
Neg logit contributions
 Interpret-0.601
uni-0.593
utton-0.590
 injure-0.574
uti-0.572
Token read
Token ID1100
Feature activation+1.38e-02

Pos logit contributions
 Interpret+0.314
utton+0.307
uni+0.305
 injure+0.300
 taunt+0.299
Neg logit contributions
sein-0.373
 Stamp-0.372
idium-0.364
Disclaimer-0.364
-0.350
Tokenjust
Token ID3137
Feature activation+1.37e+00

Pos logit contributions
 Interpret+0.034
utton+0.034
uni+0.033
uti+0.032
 taunt+0.032
Neg logit contributions
 Stamp-0.065
sein-0.065
idium-0.064
Disclaimer-0.064
agate-0.062
Token prices
Token ID4536
Feature activation+2.37e-01

Pos logit contributions
 Interpret+5.020
utton+4.871
uni+4.812
uti+4.803
 injure+4.787
Neg logit contributions
sein-4.651
idium-4.454
Disclaimer-4.440
 Stamp-4.407
-4.394
Token to
Token ID284
Feature activation+1.51e-01

Pos logit contributions
 Interpret+0.955
utton+0.941
uni+0.934
uti+0.920
 injure+0.916
Neg logit contributions
sein-0.748
 Stamp-0.743
idium-0.727
Disclaimer-0.725
-0.694
Token cover
Token ID3002
Feature activation-3.17e-01

Pos logit contributions
 Interpret+0.519
utton+0.510
uni+0.506
 injure+0.498
uti+0.496
Neg logit contributions
sein-0.570
 Stamp-0.565
idium-0.555
Disclaimer-0.554
agate-0.532
Token all
Token ID477
Feature activation-2.65e-01

Pos logit contributions
 Stamp+1.028
sein+1.014
idium+0.986
Disclaimer+0.981
 spinning+0.941
Neg logit contributions
 Interpret-1.260
utton-1.248
uni-1.239
uti-1.209
 injure-1.204
Token the
Token ID262
Feature activation+2.37e-01

Pos logit contributions
 Stamp+0.851
sein+0.827
Disclaimer+0.814
idium+0.814
 spinning+0.775
Neg logit contributions
 Interpret-1.056
utton-1.045
uni-1.043
uti-1.020
 taunt-1.011
Tokenzip
Token ID13344
Feature activation-5.36e-02

Pos logit contributions
 Interpret+0.816
utton+0.805
uni+0.797
uti+0.778
 injure+0.778
Neg logit contributions
sein-0.926
 Stamp-0.925
Disclaimer-0.902
idium-0.901
agate-0.867
Token file
Token ID2393
Feature activation+1.45e-01

Pos logit contributions
 Stamp+0.162
sein+0.160
Disclaimer+0.156
idium+0.156
agate+0.149
Neg logit contributions
 Interpret-0.223
utton-0.221
uni-0.220
 injure-0.215
uti-0.215
Token.
Token ID13
Feature activation+1.65e-02

Pos logit contributions
 Interpret+0.576
utton+0.570
uni+0.565
uti+0.554
 injure+0.553
Neg logit contributions
 Stamp-0.470
sein-0.468
idium-0.455
Disclaimer-0.454
agate-0.434
Token
Token ID198
Feature activation-5.84e-01

Pos logit contributions
 Interpret+0.063
utton+0.063
uni+0.062
uti+0.061
 injure+0.061
Neg logit contributions
 Stamp-0.056
sein-0.055
Disclaimer-0.054
idium-0.054
agate-0.052
Token
Token ID198
Feature activation-5.80e-01

Pos logit contributions
 Stamp+2.166
sein+2.158
Disclaimer+2.133
idium+2.072
+2.009
Neg logit contributions
 Interpret-2.010
utton-2.003
 injure-1.980
uni-1.980
 taunt-1.949
Tokenfile
Token ID7753
Feature activation+1.37e+00

Pos logit contributions
sein+1.796
Disclaimer+1.791
 Stamp+1.757
idium+1.718
agate+1.656
Neg logit contributions
 Interpret-2.348
utton-2.325
 injure-2.304
uni-2.290
 taunt-2.289
Token.
Token ID13
Feature activation+3.56e-01

Pos logit contributions
 Interpret+4.702
utton+4.561
uni+4.534
uti+4.395
 taunt+4.366
Neg logit contributions
sein-4.959
idium-4.836
 Stamp-4.769
Disclaimer-4.706
-4.660
Token A
Token ID317
Feature activation-4.45e-01

Pos logit contributions
 Interpret+1.503
utton+1.480
uni+1.468
uti+1.439
 injure+1.438
Neg logit contributions
sein-1.066
 Stamp-1.061
idium-1.035
Disclaimer-1.027
-0.980
Token S
Token ID311
Feature activation-1.93e-01

Pos logit contributions
 Stamp+1.582
sein+1.539
Disclaimer+1.530
idium+1.496
 spinning+1.492
Neg logit contributions
 Interpret-1.609
uni-1.585
utton-1.579
 injure-1.547
uti-1.536
Token3
Token ID18
Feature activation-5.09e-01

Pos logit contributions
 Stamp+0.581
sein+0.577
Disclaimer+0.566
idium+0.565
+0.538
Neg logit contributions
 Interpret-0.809
uni-0.793
utton-0.792
 injure-0.777
 taunt-0.774
Token bucket
Token ID19236
Feature activation-5.05e-02

Pos logit contributions
 Stamp+2.054
Disclaimer+1.993
sein+1.980
idium+1.970
 spinning+1.928
Neg logit contributions
 Interpret-1.569
utton-1.567
uni-1.544
 injure-1.518
 taunt-1.491
Token been
Token ID587
Feature activation+3.07e-01

Pos logit contributions
sein+0.392
 Stamp+0.391
idium+0.378
Disclaimer+0.375
agate+0.355
Neg logit contributions
 Interpret-0.648
utton-0.643
uni-0.636
uti-0.624
 taunt-0.618
Token laid
Token ID8104
Feature activation+3.68e-02

Pos logit contributions
 Interpret+1.192
utton+1.173
uni+1.167
 injure+1.146
uti+1.146
Neg logit contributions
 Stamp-1.018
sein-1.017
idium-0.989
Disclaimer-0.987
-0.943
Token by
Token ID416
Feature activation+7.28e-02

Pos logit contributions
 Interpret+0.137
utton+0.135
uni+0.134
uti+0.131
 injure+0.131
Neg logit contributions
 Stamp-0.128
sein-0.127
idium-0.124
Disclaimer-0.124
agate-0.118
Token al
Token ID435
Feature activation+1.27e-01

Pos logit contributions
 Interpret+0.286
utton+0.283
uni+0.280
uti+0.274
 injure+0.273
Neg logit contributions
 Stamp-0.239
sein-0.238
idium-0.231
Disclaimer-0.230
agate-0.220
Token-
Token ID12
Feature activation-2.79e-03

Pos logit contributions
 Interpret+0.522
utton+0.516
uni+0.512
uti+0.501
 injure+0.500
Neg logit contributions
sein-0.390
 Stamp-0.389
idium-0.378
Disclaimer-0.376
agate-0.359
TokenSh
Token ID2484
Feature activation+1.36e+00

Pos logit contributions
 Stamp+0.010
sein+0.010
Disclaimer+0.009
idium+0.009
+0.009
Neg logit contributions
 Interpret-0.011
utton-0.010
uni-0.010
uti-0.010
 injure-0.010
Tokenab
Token ID397
Feature activation+3.00e-01

Pos logit contributions
utton+4.014
uni+3.905
 Interpret+3.843
uti+3.733
ators+3.609
Neg logit contributions
 Stamp-5.746
sein-5.592
Disclaimer-5.544
idium-5.453
-5.440
Tokenab
Token ID397
Feature activation+3.39e-01

Pos logit contributions
 Interpret+1.024
utton+1.012
uni+1.006
uti+0.978
 taunt+0.971
Neg logit contributions
 Stamp-1.140
sein-1.130
Disclaimer-1.105
idium-1.104
-1.065
Token,
Token ID11
Feature activation+1.79e-01

Pos logit contributions
 Interpret+1.326
utton+1.306
uni+1.301
uti+1.271
 injure+1.271
Neg logit contributions
sein-1.116
 Stamp-1.111
idium-1.079
Disclaimer-1.078
agate-1.032
Token which
Token ID543
Feature activation+2.60e-01

Pos logit contributions
 Interpret+0.677
utton+0.669
uni+0.663
uti+0.649
 injure+0.649
Neg logit contributions
sein-0.615
 Stamp-0.612
idium-0.597
Disclaimer-0.595
agate-0.570
Token is
Token ID318
Feature activation+2.31e-01

Pos logit contributions
 Interpret+0.915
utton+0.903
uni+0.894
uti+0.874
 injure+0.872
Neg logit contributions
 Stamp-0.955
sein-0.953
idium-0.929
Disclaimer-0.928
agate-0.890
Token3
Token ID18
Feature activation+1.28e-01

Pos logit contributions
sein+0.399
 Stamp+0.399
Disclaimer+0.389
idium+0.386
+0.368
Neg logit contributions
 Interpret-0.540
utton-0.533
uni-0.528
 injure-0.521
uti-0.519
TokenD
Token ID35
Feature activation-1.37e-01

Pos logit contributions
 Interpret+0.552
utton+0.546
uni+0.542
uti+0.532
 injure+0.531
Neg logit contributions
 Stamp-0.373
sein-0.372
idium-0.360
Disclaimer-0.359
agate-0.340
Token Gallery
Token ID12917
Feature activation-3.19e-01

Pos logit contributions
 Stamp+0.452
sein+0.445
Disclaimer+0.434
idium+0.433
agate+0.414
Neg logit contributions
 Interpret-0.533
utton-0.529
uni-0.525
 injure-0.515
uti-0.514
Token
Token ID198
Feature activation-1.58e-01

Pos logit contributions
 Stamp+1.210
sein+1.178
Disclaimer+1.158
idium+1.156
agate+1.111
Neg logit contributions
 Interpret-1.092
utton-1.081
uni-1.074
 injure-1.064
uti-1.050
Token
Token ID198
Feature activation+8.02e-02

Pos logit contributions
 Stamp+0.628
sein+0.628
idium+0.613
Disclaimer+0.612
agate+0.589
Neg logit contributions
 Interpret-0.511
utton-0.504
uni-0.499
uti-0.487
 injure-0.486
TokenAdvertisement
Token ID4723
Feature activation+1.34e+00

Pos logit contributions
 Interpret+0.337
utton+0.333
uni+0.331
uti+0.325
 injure+0.324
Neg logit contributions
 Stamp-0.241
sein-0.240
idium-0.233
Disclaimer-0.233
agate-0.221
Token
Token ID198
Feature activation-5.09e-01

Pos logit contributions
 Interpret+3.194
uni+3.039
utton+3.025
 =================================================================+2.909
uti+2.810
Neg logit contributions
idium-6.283
���-6.229
sein-6.161
 Stamp-6.150
-5.963
Token
Token ID198
Feature activation-5.10e-01

Pos logit contributions
 Stamp+1.935
sein+1.927
Disclaimer+1.896
idium+1.862
+1.799
Neg logit contributions
 Interpret-1.715
utton-1.703
uni-1.682
 injure-1.671
uti-1.653
TokenHonestly
Token ID40817
Feature activation-4.69e-01

Pos logit contributions
 Stamp+1.551
sein+1.542
Disclaimer+1.530
idium+1.492
+1.427
Neg logit contributions
 Interpret-2.121
utton-2.092
uni-2.065
 injure-2.063
 taunt-2.046
Token,
Token ID11
Feature activation-2.11e-01

Pos logit contributions
 Stamp+1.173
sein+1.153
Disclaimer+1.116
idium+1.103
 spinning+1.046
Neg logit contributions
utton-2.204
 Interpret-2.184
uni-2.155
uti-2.141
 taunt-2.121
Token this
Token ID428
Feature activation-2.07e-01

Pos logit contributions
 Stamp+0.716
sein+0.709
idium+0.690
Disclaimer+0.689
agate+0.658
Neg logit contributions
 Interpret-0.806
utton-0.799
uni-0.792
uti-0.777
 injure-0.771
Token your
Token ID534
Feature activation+2.26e-01

Pos logit contributions
 Stamp+1.339
sein+1.326
Disclaimer+1.285
idium+1.281
 spinning+1.224
Neg logit contributions
 Interpret-1.787
utton-1.779
uni-1.771
uti-1.731
 taunt-1.715
Token putting
Token ID5137
Feature activation+3.42e-01

Pos logit contributions
 Interpret+0.832
utton+0.819
uni+0.812
 injure+0.794
uti+0.794
Neg logit contributions
sein-0.800
 Stamp-0.800
idium-0.779
Disclaimer-0.776
-0.746
Token routine
Token ID8027
Feature activation-2.61e-02

Pos logit contributions
 Interpret+1.266
utton+1.247
uni+1.236
 injure+1.211
 taunt+1.211
Neg logit contributions
sein-1.198
 Stamp-1.195
idium-1.162
Disclaimer-1.161
agate-1.116
Token when
Token ID618
Feature activation+3.20e-01

Pos logit contributions
 Stamp+0.082
sein+0.082
idium+0.079
Disclaimer+0.079
agate+0.075
Neg logit contributions
 Interpret-0.106
utton-0.104
uni-0.104
uti-0.102
 injure-0.101
Token up
Token ID510
Feature activation+2.16e-01

Pos logit contributions
 Interpret+1.208
utton+1.190
uni+1.181
 injure+1.158
uti+1.155
Neg logit contributions
sein-1.101
 Stamp-1.095
idium-1.068
Disclaimer-1.062
-1.023
Token close
Token ID1969
Feature activation+1.33e+00

Pos logit contributions
 Interpret+0.874
utton+0.864
uni+0.857
uti+0.840
 injure+0.839
Neg logit contributions
 Stamp-0.679
sein-0.679
idium-0.659
Disclaimer-0.657
agate-0.626
Token to
Token ID284
Feature activation+2.87e-01

Pos logit contributions
 Interpret+4.955
utton+4.928
uni+4.885
 injure+4.872
 taunt+4.846
Neg logit contributions
sein-4.514
 Stamp-4.456
idium-4.331
Disclaimer-4.313
-4.192
Token the
Token ID262
Feature activation+1.64e-02

Pos logit contributions
 Interpret+1.089
utton+1.075
uni+1.065
 injure+1.044
uti+1.043
Neg logit contributions
sein-0.980
 Stamp-0.979
idium-0.953
Disclaimer-0.949
-0.911
Token basket
Token ID7988
Feature activation+9.22e-02

Pos logit contributions
 Interpret+0.058
utton+0.058
uni+0.057
uti+0.056
 injure+0.055
Neg logit contributions
 Stamp-0.060
sein-0.060
Disclaimer-0.058
idium-0.058
agate-0.056
Token.
Token ID13
Feature activation-2.08e-01

Pos logit contributions
 Interpret+0.380
utton+0.375
uni+0.372
uti+0.365
 injure+0.364
Neg logit contributions
 Stamp-0.285
sein-0.284
idium-0.276
Disclaimer-0.275
agate-0.261
Token This
Token ID770
Feature activation-1.05e-01

Pos logit contributions
 Stamp+0.728
sein+0.722
Disclaimer+0.705
idium+0.701
+0.671
Neg logit contributions
 Interpret-0.769
utton-0.766
uni-0.760
uti-0.743
 injure-0.742
Token
Token ID198
Feature activation+2.44e-02

Pos logit contributions
 Interpret+0.148
utton+0.145
uni+0.144
uti+0.140
 taunt+0.139
Neg logit contributions
 Stamp-0.197
sein-0.197
idium-0.194
Disclaimer-0.191
agate-0.186
TokenPixel
Token ID40809
Feature activation+7.22e-02

Pos logit contributions
 Interpret+0.100
utton+0.099
uni+0.099
uti+0.097
 injure+0.096
Neg logit contributions
 Stamp-0.075
sein-0.075
idium-0.073
Disclaimer-0.073
agate-0.069
Tokenpipe
Token ID34360
Feature activation+4.64e-02

Pos logit contributions
 Interpret+0.284
utton+0.281
uni+0.278
 injure+0.273
uti+0.273
Neg logit contributions
 Stamp-0.236
sein-0.234
idium-0.229
Disclaimer-0.228
agate-0.218
Token
Token ID198
Feature activation+4.72e-01

Pos logit contributions
 Interpret+0.157
utton+0.155
uni+0.154
 injure+0.151
uti+0.150
Neg logit contributions
 Stamp-0.178
sein-0.176
idium-0.173
Disclaimer-0.172
agate-0.166
Token
Token ID198
Feature activation+5.62e-01

Pos logit contributions
 Interpret+1.345
utton+1.309
uni+1.306
uti+1.249
 taunt+1.238
Neg logit contributions
 Stamp-2.036
sein-2.033
idium-2.026
Disclaimer-1.957
agate-1.931
TokenAdvertisement
Token ID4723
Feature activation+1.31e+00

Pos logit contributions
utton+2.244
 Interpret+2.235
uni+2.234
uti+2.166
 taunt+2.138
Neg logit contributions
 Stamp-1.795
sein-1.774
idium-1.719
Disclaimer-1.671
-1.624
Token
Token ID198
Feature activation-4.37e-01

Pos logit contributions
 Interpret+3.117
uni+2.989
utton+2.958
 =================================================================+2.845
uti+2.764
Neg logit contributions
idium-6.160
���-6.122
 Stamp-6.032
sein-6.028
-5.809
Token
Token ID198
Feature activation-4.14e-01

Pos logit contributions
 Stamp+1.679
sein+1.673
Disclaimer+1.644
idium+1.621
+1.565
Neg logit contributions
 Interpret-1.457
utton-1.445
uni-1.427
 injure-1.411
uti-1.400
TokenIf
Token ID1532
Feature activation-3.99e-01

Pos logit contributions
 Stamp+1.148
sein+1.144
Disclaimer+1.137
idium+1.103
+1.057
Neg logit contributions
 Interpret-1.828
utton-1.805
uni-1.782
 injure-1.781
 taunt-1.768
Token you
Token ID345
Feature activation-1.16e-01

Pos logit contributions
 Stamp+1.277
sein+1.238
Disclaimer+1.211
idium+1.210
 spinning+1.153
Neg logit contributions
 Interpret-1.611
utton-1.601
uni-1.585
uti-1.555
 injure-1.553
Token're
Token ID821
Feature activation-7.35e-03

Pos logit contributions
 Stamp+0.387
sein+0.383
idium+0.374
Disclaimer+0.373
agate+0.358
Neg logit contributions
 Interpret-0.449
utton-0.444
uni-0.440
uti-0.433
 injure-0.428
TokenRet
Token ID9781
Feature activation-3.81e-01

Pos logit contributions
sein+0.579
 Stamp+0.577
Disclaimer+0.568
idium+0.560
+0.536
Neg logit contributions
 Interpret-0.770
utton-0.757
uni-0.749
 injure-0.743
 taunt-0.739
Tokenro
Token ID305
Feature activation-3.33e-01

Pos logit contributions
sein+1.288
 Stamp+1.277
idium+1.256
Disclaimer+1.231
agate+1.201
Neg logit contributions
 Interpret-1.473
utton-1.424
 injure-1.414
 taunt-1.413
uni-1.403
Token Camera
Token ID20432
Feature activation-6.26e-03

Pos logit contributions
 Stamp+1.085
sein+1.074
idium+1.049
Disclaimer+1.049
agate+0.999
Neg logit contributions
 Interpret-1.311
utton-1.296
uni-1.283
 injure-1.269
uti-1.260
Token
Token ID198
Feature activation-2.54e-01

Pos logit contributions
 Stamp+0.023
sein+0.023
idium+0.023
Disclaimer+0.023
agate+0.022
Neg logit contributions
 Interpret-0.022
utton-0.022
uni-0.021
 injure-0.021
uti-0.021
Token
Token ID198
Feature activation+4.12e-02

Pos logit contributions
 Stamp+0.993
sein+0.992
Disclaimer+0.970
idium+0.965
agate+0.929
Neg logit contributions
 Interpret-0.832
utton-0.822
uni-0.813
 injure-0.797
uti-0.795
TokenAdvertisement
Token ID4723
Feature activation+1.30e+00

Pos logit contributions
 Interpret+0.174
utton+0.172
uni+0.170
 injure+0.167
uti+0.167
Neg logit contributions
sein-0.123
 Stamp-0.123
Disclaimer-0.119
idium-0.119
-0.113
Token
Token ID198
Feature activation-3.83e-01

Pos logit contributions
 Interpret+3.130
uni+2.977
utton+2.956
 =================================================================+2.845
uti+2.770
Neg logit contributions
idium-6.102
���-6.040
sein-5.982
 Stamp-5.974
-5.786
Token
Token ID198
Feature activation-3.94e-01

Pos logit contributions
 Stamp+1.485
sein+1.480
Disclaimer+1.451
idium+1.436
+1.385
Neg logit contributions
 Interpret-1.267
utton-1.255
uni-1.241
 injure-1.223
uti-1.216
TokenIf
Token ID1532
Feature activation-3.78e-01

Pos logit contributions
sein+1.152
 Stamp+1.152
Disclaimer+1.146
idium+1.107
+1.065
Neg logit contributions
 Interpret-1.677
utton-1.660
uni-1.636
 injure-1.634
 taunt-1.620
Token you
Token ID345
Feature activation-1.31e-02

Pos logit contributions
 Stamp+1.207
sein+1.178
Disclaimer+1.151
idium+1.149
 spinning+1.100
Neg logit contributions
 Interpret-1.518
utton-1.512
uni-1.495
 injure-1.463
uti-1.462
Token've
Token ID1053
Feature activation+2.02e-01

Pos logit contributions
 Stamp+0.044
sein+0.044
idium+0.043
Disclaimer+0.042
agate+0.041
Neg logit contributions
 Interpret-0.050
utton-0.050
uni-0.049
uti-0.048
 injure-0.048
Tokenible
Token ID856
Feature activation-2.48e-01

Pos logit contributions
sein+1.645
 Stamp+1.606
idium+1.598
Disclaimer+1.576
agate+1.555
Neg logit contributions
 Interpret-0.971
utton-0.930
uni-0.917
 injure-0.915
 taunt-0.897
Token recruitment
Token ID19624
Feature activation-1.98e-01

Pos logit contributions
 Stamp+1.051
sein+1.045
Disclaimer+1.018
idium+1.015
+0.984
Neg logit contributions
 Interpret-0.744
utton-0.729
uni-0.721
uti-0.702
 injure-0.697
Token by
Token ID416
Feature activation-2.42e-01

Pos logit contributions
 Stamp+0.734
sein+0.723
Disclaimer+0.701
idium+0.700
 spinning+0.678
Neg logit contributions
 Interpret-0.701
utton-0.689
uni-0.687
uti-0.672
 injure-0.664
Token al
Token ID435
Feature activation+5.31e-02

Pos logit contributions
 Stamp+0.809
sein+0.795
Disclaimer+0.772
idium+0.769
 spinning+0.746
Neg logit contributions
 Interpret-0.934
utton-0.923
uni-0.913
uti-0.898
 injure-0.891
Token-
Token ID12
Feature activation+2.36e-02

Pos logit contributions
 Interpret+0.219
utton+0.217
uni+0.215
uti+0.211
 injure+0.210
Neg logit contributions
sein-0.163
 Stamp-0.163
idium-0.158
Disclaimer-0.157
agate-0.150
TokenSh
Token ID2484
Feature activation+1.29e+00

Pos logit contributions
 Interpret+0.089
utton+0.088
uni+0.086
uti+0.085
 injure+0.085
Neg logit contributions
 Stamp-0.082
sein-0.081
Disclaimer-0.080
idium-0.078
-0.075
Tokenab
Token ID397
Feature activation+2.88e-01

Pos logit contributions
utton+3.877
uni+3.768
 Interpret+3.734
uti+3.624
iesta+3.494
Neg logit contributions
 Stamp-5.433
sein-5.302
Disclaimer-5.233
idium-5.165
-5.133
Tokenab
Token ID397
Feature activation+2.86e-01

Pos logit contributions
 Interpret+0.986
utton+0.974
uni+0.968
uti+0.942
 taunt+0.935
Neg logit contributions
 Stamp-1.093
sein-1.085
Disclaimer-1.060
idium-1.060
-1.021
Token since
Token ID1201
Feature activation-3.47e-01

Pos logit contributions
 Interpret+1.114
utton+1.102
uni+1.091
uti+1.071
 injure+1.067
Neg logit contributions
 Stamp-0.949
sein-0.945
idium-0.922
Disclaimer-0.918
agate-0.876
Token 2010
Token ID3050
Feature activation-4.15e-02

Pos logit contributions
 Stamp+1.077
sein+1.054
Disclaimer+1.020
idium+1.014
 spinning+0.986
Neg logit contributions
 Interpret-1.426
utton-1.419
uni-1.403
uti-1.376
 injure-1.369
Token.
Token ID13
Feature activation+1.87e-01

Pos logit contributions
 Stamp+0.124
sein+0.123
idium+0.120
Disclaimer+0.120
agate+0.114
Neg logit contributions
 Interpret-0.175
utton-0.172
uni-0.171
 injure-0.168
uti-0.167
Token.
Token ID13
Feature activation+1.87e-01

Pos logit contributions
 Stamp+0.626
sein+0.621
Disclaimer+0.602
idium+0.600
agate+0.571
Neg logit contributions
 Interpret-0.855
utton-0.843
uni-0.842
uti-0.821
 taunt-0.818
Token
Token ID198
Feature activation+2.28e-01

Pos logit contributions
 Interpret+0.644
utton+0.639
uni+0.633
 injure+0.618
uti+0.618
Neg logit contributions
 Stamp-0.704
sein-0.701
Disclaimer-0.683
idium-0.682
-0.655
Token
Token ID198
Feature activation+2.04e-01

Pos logit contributions
 Interpret+0.684
utton+0.668
uni+0.662
uti+0.642
 taunt+0.633
Neg logit contributions
 Stamp-0.958
sein-0.956
idium-0.946
Disclaimer-0.925
agate-0.906
TokenStory
Token ID11605
Feature activation+3.99e-01

Pos logit contributions
 Interpret+0.788
utton+0.778
uni+0.772
uti+0.756
 injure+0.756
Neg logit contributions
sein-0.678
 Stamp-0.678
idium-0.659
Disclaimer-0.659
-0.628
Token continues
Token ID4477
Feature activation+4.89e-01

Pos logit contributions
 Interpret+1.124
utton+1.104
uni+1.090
uti+1.058
 taunt+1.054
Neg logit contributions
 Stamp-1.750
sein-1.747
idium-1.715
Disclaimer-1.708
-1.659
Token below
Token ID2174
Feature activation+1.29e+00

Pos logit contributions
 Interpret+1.029
utton+1.002
uni+0.975
 taunt+0.955
 injure+0.954
Neg logit contributions
sein-2.489
 Stamp-2.484
idium-2.450
Disclaimer-2.441
-2.397
Token advertisement
Token ID15422
Feature activation+2.94e-01

Pos logit contributions
utton+3.955
 Interpret+3.919
 taunt+3.822
uni+3.697
 injure+3.628
Neg logit contributions
sein-5.245
idium-5.229
 Stamp-5.109
-5.108
Disclaimer-4.988
Token
Token ID198
Feature activation+1.02e-01

Pos logit contributions
 Interpret+0.881
utton+0.858
uni+0.846
uti+0.824
 taunt+0.816
Neg logit contributions
sein-1.231
 Stamp-1.229
idium-1.222
Disclaimer-1.194
agate-1.168
Token
Token ID198
Feature activation+4.50e-01

Pos logit contributions
 Interpret+0.319
utton+0.312
uni+0.309
uti+0.301
 taunt+0.299
Neg logit contributions
 Stamp-0.419
sein-0.419
idium-0.412
Disclaimer-0.406
agate-0.395
TokenStory
Token ID11605
Feature activation+5.31e-01

Pos logit contributions
 Interpret+2.178
utton+2.163
uni+2.148
uti+2.114
 taunt+2.095
Neg logit contributions
 Stamp-1.059
sein-1.050
idium-1.012
Disclaimer-0.991
agate-0.942
Token continues
Token ID4477
Feature activation+3.97e-01

Pos logit contributions
 Interpret+1.382
utton+1.380
uni+1.319
 taunt+1.298
uti+1.290
Neg logit contributions
 Stamp-2.424
sein-2.413
idium-2.409
Disclaimer-2.377
-2.354
Token Andre
Token ID10948
Feature activation+2.60e-01

Pos logit contributions
 Interpret+0.312
utton+0.309
uni+0.306
uti+0.301
 injure+0.300
Neg logit contributions
 Stamp-0.254
sein-0.251
Disclaimer-0.244
idium-0.243
-0.233
Token Gre
Token ID11955
Feature activation+6.56e-02

Pos logit contributions
 Interpret+0.914
utton+0.897
uni+0.895
uti+0.870
 injure+0.864
Neg logit contributions
sein-0.961
 Stamp-0.956
Disclaimer-0.940
idium-0.938
-0.904
Tokenipel
Token ID40634
Feature activation+2.44e-01

Pos logit contributions
 Interpret+0.319
utton+0.316
uni+0.314
uti+0.309
 injure+0.308
Neg logit contributions
sein-0.153
 Stamp-0.153
idium-0.147
Disclaimer-0.146
-0.137
Token (
Token ID357
Feature activation+1.22e-01

Pos logit contributions
 Interpret+0.907
utton+0.897
uni+0.889
uti+0.871
 injure+0.867
Neg logit contributions
 Stamp-0.850
sein-0.846
idium-0.824
Disclaimer-0.823
-0.787
TokenL
Token ID43
Feature activation-2.28e-01

Pos logit contributions
 Interpret+0.497
utton+0.491
uni+0.489
uti+0.478
 injure+0.476
Neg logit contributions
sein-0.384
 Stamp-0.384
idium-0.373
Disclaimer-0.369
agate-0.354
Tokenotto
Token ID17631
Feature activation-1.43e+00

Pos logit contributions
 Stamp+0.737
sein+0.732
idium+0.723
Disclaimer+0.715
agate+0.692
Neg logit contributions
 Interpret-0.902
utton-0.881
uni-0.871
 taunt-0.867
 injure-0.865
Token -
Token ID532
Feature activation-3.49e-01

Pos logit contributions
 Stamp+4.041
Disclaimer+3.939
idium+3.772
+3.760
 spinning+3.743
Neg logit contributions
 Interpret-5.872
ername-5.768
utton-5.759
uni-5.707
 taunt-5.674
Token Bel
Token ID3944
Feature activation-2.12e-02

Pos logit contributions
 Stamp+1.073
sein+1.034
Disclaimer+1.025
idium+1.011
 spinning+0.979
Neg logit contributions
 Interpret-1.427
utton-1.422
uti-1.408
uni-1.405
 taunt-1.388
Tokenisol
Token ID30152
Feature activation-3.75e-01

Pos logit contributions
sein+0.085
 Stamp+0.084
idium+0.083
Disclaimer+0.082
agate+0.080
Neg logit contributions
 Interpret-0.067
utton-0.066
uni-0.065
 injure-0.064
 taunt-0.064
Token Team
Token ID4816
Feature activation-3.87e-01

Pos logit contributions
 Stamp+0.933
sein+0.903
Disclaimer+0.884
idium+0.882
 spinning+0.820
Neg logit contributions
 Interpret-1.764
utton-1.745
uni-1.737
 taunt-1.709
uti-1.708
Token)
Token ID8
Feature activation-3.51e-01

Pos logit contributions
 Stamp+0.991
sein+0.952
Disclaimer+0.946
idium+0.930
 spinning+0.884
Neg logit contributions
 Interpret-1.786
utton-1.768
uni-1.748
uti-1.735
 taunt-1.733
Token.
Token ID13
Feature activation-1.70e-01

Pos logit contributions
 Stamp+0.858
sein+0.849
Disclaimer+0.826
idium+0.825
+0.790
Neg logit contributions
 Interpret-0.919
utton-0.914
uni-0.905
uti-0.887
 injure-0.882
Token
Token ID198
Feature activation+1.57e-01

Pos logit contributions
 Stamp+0.574
sein+0.569
Disclaimer+0.556
idium+0.549
+0.527
Neg logit contributions
 Interpret-0.650
utton-0.649
uni-0.642
uti-0.630
 injure-0.628
Token
Token ID198
Feature activation+1.51e-01

Pos logit contributions
 Interpret+0.479
utton+0.470
uni+0.466
uti+0.451
 taunt+0.448
Neg logit contributions
sein-0.650
 Stamp-0.650
idium-0.642
Disclaimer-0.629
agate-0.615
TokenBest
Token ID13014
Feature activation+2.18e-01

Pos logit contributions
 Interpret+0.599
utton+0.591
uni+0.585
 injure+0.574
uti+0.574
Neg logit contributions
sein-0.490
 Stamp-0.489
Disclaimer-0.476
idium-0.475
-0.452
Token of
Token ID286
Feature activation-8.45e-01

Pos logit contributions
 Interpret+0.782
utton+0.777
uni+0.767
uti+0.751
 injure+0.749
Neg logit contributions
 Stamp-0.789
sein-0.785
Disclaimer-0.767
idium-0.763
-0.734
Token luck
Token ID8458
Feature activation-1.41e+00

Pos logit contributions
 Stamp+2.472
sein+2.435
Disclaimer+2.350
idium+2.341
+2.260
Neg logit contributions
 Interpret-3.568
utton-3.530
uni-3.520
uti-3.449
 injure-3.418
Token to
Token ID284
Feature activation-6.67e-01

Pos logit contributions
 Stamp+4.226
Disclaimer+4.033
sein+4.024
 spinning+4.023
 Stard+3.890
Neg logit contributions
utton-5.674
uni-5.579
 Interpret-5.547
uti-5.520
 injure-5.397
Token both
Token ID1111
Feature activation-1.99e-01

Pos logit contributions
 Stamp+2.188
sein+2.152
Disclaimer+2.084
idium+2.074
 spinning+2.014
Neg logit contributions
 Interpret-2.596
utton-2.593
uni-2.548
uti-2.530
 injure-2.497
Token parties
Token ID4671
Feature activation-1.95e-01

Pos logit contributions
 Stamp+0.714
sein+0.703
idium+0.684
Disclaimer+0.684
 spinning+0.661
Neg logit contributions
utton-0.720
uni-0.717
 Interpret-0.716
uti-0.705
 injure-0.691
Token.
Token ID13
Feature activation-4.80e-01

Pos logit contributions
 Stamp+0.616
sein+0.615
idium+0.596
Disclaimer+0.595
agate+0.567
Neg logit contributions
 Interpret-0.786
utton-0.781
uni-0.775
uti-0.759
 injure-0.754
Token
Token ID198
Feature activation-1.64e-01

Pos logit contributions
 Stamp+1.678
sein+1.653
Disclaimer+1.627
idium+1.589
+1.543
Neg logit contributions
utton-1.780
 Interpret-1.757
uni-1.752
uti-1.720
 injure-1.708
Token density
Token ID12109
Feature activation-4.63e-01

Pos logit contributions
sein+1.452
 Stamp+1.427
Disclaimer+1.416
idium+1.404
agate+1.333
Neg logit contributions
 Interpret-1.863
utton-1.821
uti-1.777
 injure-1.774
uni-1.772
TokenGrid
Token ID41339
Feature activation-6.97e-01

Pos logit contributions
 Stamp+1.247
sein+1.245
Disclaimer+1.211
idium+1.207
agate+1.133
Neg logit contributions
 Interpret-2.086
utton-2.063
uni-2.033
 injure-2.012
uti-2.010
Token,
Token ID837
Feature activation+1.83e-01

Pos logit contributions
 Stamp+1.928
sein+1.901
Disclaimer+1.874
idium+1.857
agate+1.750
Neg logit contributions
 Interpret-3.088
utton-3.052
uni-2.975
 injure-2.967
uti-2.953
Token const
Token ID1500
Feature activation-1.18e+00

Pos logit contributions
 Interpret+0.721
utton+0.713
uni+0.707
uti+0.693
 injure+0.692
Neg logit contributions
 Stamp-0.595
sein-0.594
idium-0.577
Disclaimer-0.576
agate-0.549
Token Vector
Token ID20650
Feature activation-9.79e-01

Pos logit contributions
 Stamp+3.521
Disclaimer+3.381
sein+3.304
idium+3.288
 spinning+3.219
Neg logit contributions
utton-4.735
uti-4.716
 Interpret-4.664
 injure-4.637
uni-4.622
Token<
Token ID27
Feature activation-1.39e+00

Pos logit contributions
 Stamp+2.626
Disclaimer+2.555
sein+2.527
idium+2.515
agate+2.407
Neg logit contributions
 Interpret-4.309
utton-4.297
uti-4.217
uni-4.205
 taunt-4.173
Token Part
Token ID2142
Feature activation-8.97e-01

Pos logit contributions
 Stamp+3.723
Disclaimer+3.578
sein+3.541
idium+3.475
 spinning+3.288
Neg logit contributions
utton-5.993
 Interpret-5.926
 injure-5.879
uti-5.862
 taunt-5.804
Tokenicle
Token ID1548
Feature activation-6.08e-01

Pos logit contributions
 Stamp+3.371
idium+3.359
sein+3.346
Disclaimer+3.320
agate+3.164
Neg logit contributions
 Interpret-3.041
utton-2.971
 injure-2.895
uni-2.893
uti-2.875
Token >
Token ID1875
Feature activation-8.87e-01

Pos logit contributions
 Stamp+2.069
sein+2.031
Disclaimer+2.021
idium+2.000
agate+1.912
Neg logit contributions
 Interpret-2.290
utton-2.284
uni-2.231
uti-2.213
 injure-2.193
Token &
Token ID1222
Feature activation-3.60e-01

Pos logit contributions
 Stamp+2.769
sein+2.760
Disclaimer+2.704
idium+2.653
 spinning+2.570
Neg logit contributions
 Interpret-3.561
utton-3.528
uti-3.489
uni-3.462
 injure-3.347
Token particles
Token ID13166
Feature activation-5.05e-01

Pos logit contributions
sein+1.125
 Stamp+1.124
Disclaimer+1.095
idium+1.086
agate+1.039
Neg logit contributions
 Interpret-1.472
utton-1.449
uni-1.420
uti-1.415
 injure-1.395
Token the
Token ID262
Feature activation-1.60e-01

Pos logit contributions
 Interpret+0.654
utton+0.647
uni+0.642
 injure+0.629
uti+0.628
Neg logit contributions
 Stamp-0.485
sein-0.485
idium-0.470
Disclaimer-0.469
agate-0.447
Token stabbing
Token ID29418
Feature activation+6.15e-01

Pos logit contributions
 Stamp+0.579
sein+0.576
idium+0.562
Disclaimer+0.561
+0.537
Neg logit contributions
 Interpret-0.574
utton-0.569
uni-0.564
uti-0.555
 injure-0.545
Token accuracy
Token ID9922
Feature activation+7.79e-02

Pos logit contributions
 Interpret+2.258
utton+2.201
uni+2.180
 injure+2.177
 taunt+2.174
Neg logit contributions
sein-2.181
 Stamp-2.166
idium-2.110
Disclaimer-2.096
agate-2.029
Token of
Token ID286
Feature activation+1.28e-01

Pos logit contributions
 Interpret+0.305
utton+0.303
uni+0.300
uti+0.294
 injure+0.291
Neg logit contributions
 Stamp-0.256
sein-0.254
idium-0.247
Disclaimer-0.246
-0.235
Token Er
Token ID5256
Feature activation+4.01e-02

Pos logit contributions
 Interpret+0.624
utton+0.618
uni+0.615
uti+0.605
 injure+0.602
Neg logit contributions
 Stamp-0.299
sein-0.298
idium-0.286
Disclaimer-0.285
-0.267
Tokenrol
Token ID3225
Feature activation-1.34e+00

Pos logit contributions
 Interpret+0.140
utton+0.138
uni+0.136
 injure+0.134
 taunt+0.133
Neg logit contributions
 Stamp-0.149
sein-0.149
idium-0.146
Disclaimer-0.145
agate-0.139
Token Sp
Token ID1338
Feature activation-2.54e-01

Pos logit contributions
 Stamp+3.963
sein+3.765
 spinning+3.547
idium+3.536
 Alph+3.515
Neg logit contributions
utton-5.618
 Interpret-5.587
uni-5.517
 taunt-5.471
 injure-5.427
Tokenence
Token ID594
Feature activation-2.57e-01

Pos logit contributions
sein+0.656
 Stamp+0.647
idium+0.635
Disclaimer+0.624
oodle+0.588
Neg logit contributions
 Interpret-1.174
utton-1.151
 taunt-1.144
 injure-1.140
uni-1.140
Token Jr
Token ID7504
Feature activation+3.03e-01

Pos logit contributions
 Stamp+0.740
sein+0.736
idium+0.714
Disclaimer+0.706
+0.689
Neg logit contributions
 Interpret-1.100
utton-1.087
uni-1.081
uti-1.058
 taunt-1.057
Token
Token ID447
Feature activation-2.93e-01

Pos logit contributions
 Interpret+1.248
utton+1.231
uni+1.223
uti+1.197
 injure+1.196
Neg logit contributions
 Stamp-0.933
sein-0.932
idium-0.906
Disclaimer-0.902
-0.862
Token
Token ID247
Feature activation-2.81e-01

Pos logit contributions
sein+0.835
 Stamp+0.826
idium+0.803
Disclaimer+0.782
agate+0.768
Neg logit contributions
 Interpret-1.284
utton-1.248
uni-1.240
 injure-1.233
 taunt-1.227
Token7
Token ID22
Feature activation-9.68e-01

Pos logit contributions
 Stamp+3.071
sein+3.005
Disclaimer+2.961
idium+2.945
+2.823
Neg logit contributions
 Interpret-4.378
utton-4.285
uni-4.278
 injure-4.234
 taunt-4.221
Tokenff
Token ID487
Feature activation-1.08e+00

Pos logit contributions
 Stamp+3.217
sein+3.147
idium+3.084
Disclaimer+3.082
+2.941
Neg logit contributions
 Interpret-3.547
uni-3.511
utton-3.485
uti-3.455
 injure-3.423
Token3
Token ID18
Feature activation-9.16e-01

Pos logit contributions
 Stamp+3.226
sein+3.153
idium+3.096
Disclaimer+3.078
agate+2.953
Neg logit contributions
 Interpret-4.356
utton-4.259
uni-4.256
 injure-4.200
uti-4.199
Tokenc
Token ID66
Feature activation-1.26e+00

Pos logit contributions
 Stamp+3.048
sein+3.000
idium+2.963
Disclaimer+2.942
agate+2.830
Neg logit contributions
 Interpret-3.351
utton-3.310
uni-3.291
uti-3.241
 taunt-3.228
Token3
Token ID18
Feature activation-1.03e+00

Pos logit contributions
 Stamp+3.589
sein+3.540
idium+3.519
Disclaimer+3.473
agate+3.310
Neg logit contributions
 Interpret-5.155
utton-5.063
uni-5.013
 injure-4.986
 taunt-4.986
Tokenff
Token ID487
Feature activation-1.32e+00

Pos logit contributions
 Stamp+3.422
sein+3.375
idium+3.346
Disclaimer+3.309
agate+3.190
Neg logit contributions
 Interpret-3.727
utton-3.682
uni-3.652
uti-3.605
 taunt-3.604
Token0
Token ID15
Feature activation-8.58e-01

Pos logit contributions
 Stamp+3.886
sein+3.814
idium+3.780
Disclaimer+3.724
agate+3.604
Neg logit contributions
 Interpret-5.275
utton-5.132
uni-5.128
 injure-5.077
uti-5.067
Tokend
Token ID67
Feature activation-1.24e+00

Pos logit contributions
 Stamp+2.963
sein+2.883
idium+2.859
Disclaimer+2.838
+2.721
Neg logit contributions
 Interpret-3.091
utton-3.059
uni-3.054
 injure-2.992
 taunt-2.978
Token247
Token ID23753
Feature activation-9.39e-01

Pos logit contributions
 Stamp+3.533
sein+3.474
idium+3.427
Disclaimer+3.389
agate+3.297
Neg logit contributions
 Interpret-5.132
utton-5.022
uni-4.985
 injure-4.975
 taunt-4.951
Token409
Token ID29416
Feature activation-7.35e-01

Pos logit contributions
 Stamp+3.013
sein+2.970
idium+2.959
Disclaimer+2.903
agate+2.818
Neg logit contributions
 Interpret-3.571
uni-3.495
utton-3.474
 taunt-3.418
 injure-3.416
Tokenbd
Token ID17457
Feature activation-7.50e-01

Pos logit contributions
 Stamp+2.387
sein+2.356
idium+2.317
Disclaimer+2.302
agate+2.212
Neg logit contributions
 Interpret-2.821
uni-2.779
utton-2.756
uti-2.710
 injure-2.698
Token]
Token ID60
Feature activation-7.68e-01

Pos logit contributions
 Stamp+0.833
sein+0.819
Disclaimer+0.787
idium+0.778
agate+0.737
Neg logit contributions
 Interpret-1.711
utton-1.704
uni-1.693
uti-1.681
 injure-1.674
Token::
Token ID3712
Feature activation-9.40e-01

Pos logit contributions
 Stamp+1.693
sein+1.668
Disclaimer+1.642
idium+1.567
agate+1.483
Neg logit contributions
 Interpret-3.814
utton-3.789
uni-3.775
 injure-3.745
 taunt-3.718
TokenFrom
Token ID4863
Feature activation-3.13e-01

Pos logit contributions
Disclaimer+2.387
 Stamp+2.378
sein+2.358
idium+2.265
+2.124
Neg logit contributions
utton-4.330
 Interpret-4.301
 injure-4.292
uti-4.272
 taunt-4.241
TokenAr
Token ID3163
Feature activation-8.14e-01

Pos logit contributions
 Stamp+0.820
sein+0.819
Disclaimer+0.799
idium+0.786
agate+0.747
Neg logit contributions
 Interpret-1.429
utton-1.419
uni-1.400
 injure-1.395
 taunt-1.390
Tokengb
Token ID22296
Feature activation-8.79e-02

Pos logit contributions
sein+3.562
 Stamp+3.494
Disclaimer+3.477
idium+3.474
agate+3.361
Neg logit contributions
 Interpret-2.313
utton-2.232
 injure-2.222
 taunt-2.199
uni-2.172
Token($
Token ID16763
Feature activation-1.30e+00

Pos logit contributions
 Stamp+0.264
sein+0.264
idium+0.256
Disclaimer+0.255
agate+0.242
Neg logit contributions
 Interpret-0.369
utton-0.365
uni-0.362
uti-0.355
 injure-0.354
TokenFC
Token ID4851
Feature activation-6.47e-01

Pos logit contributions
Disclaimer+4.164
sein+4.113
 Stamp+4.063
idium+4.033
Merit+3.909
Neg logit contributions
 Interpret-5.006
 injure-4.955
uti-4.836
utton-4.821
 taunt-4.816
Token1
Token ID16
Feature activation-9.67e-01

Pos logit contributions
 Stamp+1.783
sein+1.771
Disclaimer+1.717
idium+1.711
agate+1.607
Neg logit contributions
 Interpret-2.869
utton-2.830
uni-2.798
 injure-2.788
 taunt-2.769
Token[
Token ID58
Feature activation-6.86e-01

Pos logit contributions
 Stamp+2.374
sein+2.351
Disclaimer+2.277
idium+2.264
+2.133
Neg logit contributions
 Interpret-4.512
utton-4.493
uni-4.420
 injure-4.391
 taunt-4.360
Token0
Token ID15
Feature activation-4.87e-01

Pos logit contributions
 Stamp+1.671
sein+1.634
Disclaimer+1.628
idium+1.587
agate+1.484
Neg logit contributions
 Interpret-3.238
utton-3.192
 injure-3.174
 taunt-3.167
uti-3.142
Token],
Token ID4357
Feature activation-8.88e-01

Pos logit contributions
 Stamp+0.993
sein+0.989
idium+0.939
Disclaimer+0.934
agate+0.867
Neg logit contributions
 Interpret-2.502
utton-2.476
uni-2.439
 injure-2.427
uti-2.426
Token was
Token ID373
Feature activation+1.09e-01

Pos logit contributions
 Stamp+0.595
sein+0.590
idium+0.573
Disclaimer+0.568
+0.538
Neg logit contributions
 Interpret-0.968
utton-0.965
uni-0.957
uti-0.946
 injure-0.935
Token NASA
Token ID8884
Feature activation+9.87e-02

Pos logit contributions
 Interpret+0.444
utton+0.439
uni+0.436
uti+0.428
 injure+0.426
Neg logit contributions
 Stamp-0.337
sein-0.337
idium-0.327
Disclaimer-0.326
-0.310
Token
Token ID447
Feature activation+5.77e-02

Pos logit contributions
 Interpret+0.436
utton+0.432
uni+0.428
uti+0.423
 injure+0.422
Neg logit contributions
 Stamp-0.273
sein-0.272
idium-0.265
Disclaimer-0.264
-0.251
Token
Token ID247
Feature activation+1.74e-01

Pos logit contributions
 Interpret+0.234
utton+0.230
uni+0.228
uti+0.224
 injure+0.224
Neg logit contributions
sein-0.182
 Stamp-0.182
idium-0.177
Disclaimer-0.176
-0.168
Tokens
Token ID82
Feature activation-2.66e-01

Pos logit contributions
 Interpret+0.671
utton+0.666
uni+0.661
uti+0.645
 taunt+0.639
Neg logit contributions
 Stamp-0.581
sein-0.574
idium-0.564
Disclaimer-0.560
agate-0.535
Token seventh
Token ID14024
Feature activation-1.30e+00

Pos logit contributions
 Stamp+0.888
sein+0.885
idium+0.862
Disclaimer+0.858
+0.819
Neg logit contributions
 Interpret-1.015
utton-1.009
uni-0.990
uti-0.986
 injure-0.976
Token consecutive
Token ID12785
Feature activation-8.57e-02

Pos logit contributions
 Stamp+2.142
sein+2.125
Disclaimer+1.969
idium+1.967
 spinning+1.850
Neg logit contributions
 Interpret-7.092
utton-7.005
uti-6.987
uni-6.961
 injure-6.893
Token lunar
Token ID25572
Feature activation+1.32e-01

Pos logit contributions
 Stamp+0.333
sein+0.333
idium+0.325
Disclaimer+0.324
agate+0.312
Neg logit contributions
 Interpret-0.284
utton-0.280
uni-0.277
uti-0.270
 injure-0.269
Token mission
Token ID4365
Feature activation+3.31e-03

Pos logit contributions
 Interpret+0.411
utton+0.403
uni+0.399
 taunt+0.392
 injure+0.389
Neg logit contributions
 Stamp-0.536
sein-0.534
idium-0.526
Disclaimer-0.524
agate-0.506
Token failure
Token ID5287
Feature activation-5.46e-01

Pos logit contributions
 Interpret+0.011
utton+0.011
uni+0.010
uti+0.010
 injure+0.010
Neg logit contributions
 Stamp-0.013
sein-0.013
idium-0.013
Disclaimer-0.013
agate-0.012
Token.
Token ID13
Feature activation-1.77e-01

Pos logit contributions
 Stamp+1.552
sein+1.547
Disclaimer+1.498
idium+1.491
+1.423
Neg logit contributions
 Interpret-2.344
utton-2.318
uni-2.313
uti-2.291
 taunt-2.256
Token today
Token ID1909
Feature activation-1.61e-01

Pos logit contributions
 Interpret+1.148
utton+1.130
uni+1.120
uti+1.101
 injure+1.098
Neg logit contributions
sein-1.038
 Stamp-1.037
Disclaimer-1.009
idium-1.009
-0.967
Token,
Token ID11
Feature activation-2.39e-02

Pos logit contributions
 Stamp+0.483
sein+0.481
idium+0.466
Disclaimer+0.465
agate+0.441
Neg logit contributions
 Interpret-0.680
utton-0.672
uni-0.668
uti-0.654
 injure-0.652
Token C
Token ID327
Feature activation-2.04e-03

Pos logit contributions
 Stamp+0.075
sein+0.074
idium+0.072
Disclaimer+0.072
agate+0.068
Neg logit contributions
 Interpret-0.098
utton-0.097
uni-0.096
uti-0.094
 injure-0.094
TokenNET
Token ID12884
Feature activation-7.13e-01

Pos logit contributions
 Stamp+0.006
sein+0.006
idium+0.006
Disclaimer+0.006
+0.005
Neg logit contributions
 Interpret-0.009
utton-0.009
uni-0.009
uti-0.008
 injure-0.008
Token has
Token ID468
Feature activation-7.17e-01

Pos logit contributions
 Stamp+2.441
idium+2.367
sein+2.364
Disclaimer+2.338
 spinning+2.216
Neg logit contributions
 Interpret-2.659
utton-2.642
uti-2.601
uni-2.584
 taunt-2.568
Token learned
Token ID4499
Feature activation-1.29e+00

Pos logit contributions
 Stamp+2.294
sein+2.244
idium+2.227
Disclaimer+2.214
 spinning+2.101
Neg logit contributions
 Interpret-2.875
utton-2.847
uni-2.783
uti-2.773
 injure-2.717
Token.
Token ID13
Feature activation-1.15e-01

Pos logit contributions
 Stamp+3.201
idium+3.127
Disclaimer+3.064
sein+3.016
 spinning+2.894
Neg logit contributions
 Interpret-5.872
utton-5.852
uti-5.775
uni-5.721
 injure-5.662
Token
Token ID198
Feature activation-1.83e-01

Pos logit contributions
 Stamp+0.394
sein+0.389
Disclaimer+0.381
idium+0.377
agate+0.360
Neg logit contributions
utton-0.430
 Interpret-0.430
uni-0.427
 injure-0.422
 taunt-0.419
Token
Token ID198
Feature activation-1.60e-01

Pos logit contributions
 Stamp+0.721
sein+0.720
Disclaimer+0.704
idium+0.702
+0.676
Neg logit contributions
 Interpret-0.593
utton-0.585
uni-0.579
uti-0.565
 injure-0.565
TokenReg
Token ID8081
Feature activation-1.94e-01

Pos logit contributions
sein+0.492
 Stamp+0.490
Disclaimer+0.484
idium+0.478
+0.455
Neg logit contributions
 Interpret-0.662
utton-0.653
uni-0.646
 injure-0.641
 taunt-0.638
Tokenulators
Token ID24325
Feature activation+6.07e-02

Pos logit contributions
sein+0.631
 Stamp+0.630
idium+0.625
Disclaimer+0.608
agate+0.586
Neg logit contributions
 Interpret-0.760
utton-0.750
uni-0.747
 injure-0.732
 taunt-0.731
Token$
Token ID3
Feature activation-8.35e-01

Pos logit contributions
 Stamp+1.194
sein+1.174
Disclaimer+1.150
idium+1.133
agate+1.085
Neg logit contributions
 Interpret-1.516
utton-1.510
uni-1.499
uti-1.482
 injure-1.477
TokenFS
Token ID10652
Feature activation-7.48e-01

Pos logit contributions
sein+2.717
Disclaimer+2.692
 Stamp+2.688
idium+2.562
agate+2.485
Neg logit contributions
 Interpret-3.220
 injure-3.213
utton-3.139
 taunt-3.131
uni-3.096
Token1
Token ID16
Feature activation-8.84e-01

Pos logit contributions
 Stamp+1.977
sein+1.938
Disclaimer+1.906
idium+1.854
agate+1.775
Neg logit contributions
 Interpret-3.396
utton-3.331
 injure-3.317
uni-3.306
uti-3.284
Token,[
Token ID17414
Feature activation-8.04e-01

Pos logit contributions
 Stamp+2.342
sein+2.281
Disclaimer+2.234
idium+2.189
agate+2.087
Neg logit contributions
utton-3.980
 Interpret-3.971
uni-3.930
uti-3.868
 injure-3.861
TokenSystem
Token ID11964
Feature activation-7.34e-01

Pos logit contributions
 Stamp+2.401
sein+2.358
Disclaimer+2.343
idium+2.272
agate+2.173
Neg logit contributions
 Interpret-3.336
utton-3.295
 injure-3.278
 taunt-3.264
uti-3.260
Token.
Token ID13
Feature activation-1.29e+00

Pos logit contributions
 Stamp+2.168
sein+2.157
Disclaimer+2.130
idium+2.060
agate+1.968
Neg logit contributions
 Interpret-3.075
utton-3.056
 injure-3.036
uni-3.030
 taunt-3.000
TokenDraw
Token ID25302
Feature activation-4.95e-01

Pos logit contributions
Disclaimer+4.450
 Stamp+4.402
sein+4.347
idium+4.111
+4.072
Neg logit contributions
 injure-4.603
uti-4.572
utton-4.557
uni-4.556
 taunt-4.550
Tokening
Token ID278
Feature activation-5.12e-01

Pos logit contributions
 Stamp+1.514
sein+1.514
Disclaimer+1.465
idium+1.464
agate+1.394
Neg logit contributions
 Interpret-2.047
utton-2.022
uni-2.008
 injure-1.972
uti-1.970
Token.
Token ID13
Feature activation-1.10e+00

Pos logit contributions
 Stamp+1.588
sein+1.572
Disclaimer+1.546
idium+1.520
+1.456
Neg logit contributions
 Interpret-2.097
utton-2.077
uni-2.064
 injure-2.035
uti-2.032
TokenFont
Token ID23252
Feature activation-3.83e-01

Pos logit contributions
Disclaimer+2.996
 Stamp+2.961
sein+2.889
idium+2.772
+2.654
Neg logit contributions
 injure-4.820
 Interpret-4.786
utton-4.742
uti-4.728
 taunt-4.712
TokenStyle
Token ID21466
Feature activation-4.63e-01

Pos logit contributions
 Stamp+1.052
sein+1.041
Disclaimer+1.013
idium+0.999
agate+0.953
Neg logit contributions
 Interpret-1.709
utton-1.689
uni-1.682
uti-1.660
 injure-1.658
Token numbers
Token ID3146
Feature activation+9.45e-03

Pos logit contributions
 Interpret+0.451
utton+0.446
uni+0.443
uti+0.433
 injure+0.431
Neg logit contributions
 Stamp-0.411
sein-0.408
idium-0.398
Disclaimer-0.397
agate-0.379
Token are
Token ID389
Feature activation+2.39e-01

Pos logit contributions
 Interpret+0.034
utton+0.033
uni+0.033
uti+0.032
 injure+0.032
Neg logit contributions
 Stamp-0.034
sein-0.034
idium-0.033
Disclaimer-0.033
agate-0.032
Token growing
Token ID3957
Feature activation-5.18e-02

Pos logit contributions
 Interpret+0.894
utton+0.881
uni+0.875
 taunt+0.862
uti+0.861
Neg logit contributions
sein-0.826
 Stamp-0.820
idium-0.809
Disclaimer-0.801
-0.772
Token through
Token ID832
Feature activation+2.76e-01

Pos logit contributions
sein+0.158
 Stamp+0.158
idium+0.153
Disclaimer+0.153
+0.146
Neg logit contributions
 Interpret-0.214
utton-0.212
uni-0.210
uti-0.206
 injure-0.206
Token word
Token ID1573
Feature activation-3.63e-01

Pos logit contributions
 Interpret+1.040
utton+1.024
uni+1.015
uti+0.994
 injure+0.993
Neg logit contributions
sein-0.953
 Stamp-0.946
idium-0.928
Disclaimer-0.920
agate-0.881
Token of
Token ID286
Feature activation-1.29e+00

Pos logit contributions
 Stamp+1.275
sein+1.250
idium+1.225
Disclaimer+1.218
 spinning+1.192
Neg logit contributions
uti-1.340
uni-1.339
 Interpret-1.335
utton-1.316
 injure-1.281
Token mouth
Token ID5422
Feature activation+1.90e-01

Pos logit contributions
 Stamp+4.116
Disclaimer+4.035
sein+3.985
 spinning+3.973
 instantaneous+3.764
Neg logit contributions
uni-4.980
uti-4.904
utton-4.852
 Interpret-4.821
 eleph-4.742
Token and
Token ID290
Feature activation-1.56e-01

Pos logit contributions
 Interpret+0.772
utton+0.764
uni+0.755
uti+0.741
 taunt+0.741
Neg logit contributions
sein-0.596
 Stamp-0.594
idium-0.579
Disclaimer-0.577
agate-0.549
Token social
Token ID1919
Feature activation-8.08e-02

Pos logit contributions
 Stamp+0.571
sein+0.562
Disclaimer+0.549
idium+0.548
+0.527
Neg logit contributions
 Interpret-0.554
utton-0.550
uni-0.546
uti-0.534
 injure-0.529
Token media
Token ID2056
Feature activation+4.27e-02

Pos logit contributions
 Stamp+0.299
sein+0.296
Disclaimer+0.291
idium+0.288
+0.280
Neg logit contributions
 Interpret-0.284
utton-0.278
uni-0.277
uti-0.273
 injure-0.268
Token,
Token ID11
Feature activation-2.40e-01

Pos logit contributions
 Interpret+0.174
utton+0.172
uni+0.171
uti+0.168
 injure+0.167
Neg logit contributions
 Stamp-0.133
sein-0.132
idium-0.128
Disclaimer-0.128
-0.122
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
 Interpret+1.242
utton+1.226
uni+1.213
uti+1.188
 injure+1.187
Neg logit contributions
 Stamp-1.075
sein-1.071
idium-1.044
Disclaimer-1.040
agate-0.996
Token
Token ID447
Feature activation+6.58e-02

Pos logit contributions
 Interpret+0.471
utton+0.467
uni+0.464
uti+0.455
 injure+0.455
Neg logit contributions
 Stamp-0.326
sein-0.325
Disclaimer-0.315
idium-0.315
-0.299
Token
Token ID251
Feature activation+8.23e-02

Pos logit contributions
 Interpret+0.170
utton+0.169
uni+0.166
uti+0.161
 injure+0.161
Neg logit contributions
sein-0.302
 Stamp-0.302
Disclaimer-0.296
idium-0.296
-0.287
Token Even
Token ID3412
Feature activation-2.06e-01

Pos logit contributions
 Interpret+0.291
utton+0.287
uni+0.284
uti+0.278
 injure+0.277
Neg logit contributions
 Stamp-0.302
sein-0.301
idium-0.294
Disclaimer-0.293
agate-0.281
Token Darrell
Token ID44882
Feature activation-6.81e-01

Pos logit contributions
 Stamp+0.665
sein+0.662
idium+0.644
Disclaimer+0.642
+0.612
Neg logit contributions
 Interpret-0.819
utton-0.810
uni-0.805
uti-0.789
 injure-0.786
Token Duff
Token ID50209
Feature activation-1.28e+00

Pos logit contributions
 Stamp+1.172
sein+1.120
idium+1.038
Disclaimer+1.000
+0.993
Neg logit contributions
 Interpret-3.698
uni-3.692
utton-3.681
uti-3.621
 injure-3.595
Tokenie
Token ID494
Feature activation-8.75e-01

Pos logit contributions
sein+4.744
 Stamp+4.684
idium+4.588
Disclaimer+4.543
agate+4.477
Neg logit contributions
 Interpret-4.335
 eleph-4.212
uni-4.186
 taunt-4.177
utton-4.173
Token,
Token ID11
Feature activation-2.73e-01

Pos logit contributions
sein+2.536
 Stamp+2.529
idium+2.448
Disclaimer+2.425
 spinning+2.337
Neg logit contributions
 Interpret-3.638
utton-3.631
uni-3.603
 injure-3.544
uti-3.534
Token the
Token ID262
Feature activation-1.56e-01

Pos logit contributions
 Stamp+0.938
sein+0.930
Disclaimer+0.909
idium+0.907
+0.869
Neg logit contributions
 Interpret-1.022
utton-1.009
uni-1.006
uti-0.986
 injure-0.980
Token sole
Token ID6195
Feature activation-4.25e-01

Pos logit contributions
 Stamp+0.536
sein+0.531
idium+0.516
Disclaimer+0.516
+0.490
Neg logit contributions
 Interpret-0.584
utton-0.579
uni-0.574
uti-0.565
 injure-0.560
Token economist
Token ID17646
Feature activation-1.88e-01

Pos logit contributions
 Stamp+1.454
sein+1.448
idium+1.408
Disclaimer+1.404
+1.340
Neg logit contributions
 Interpret-1.594
utton-1.579
uni-1.565
uti-1.537
 injure-1.529
Token disp
Token ID4596
Feature activation-2.68e-01

Pos logit contributions
 Stamp+0.729
sein+0.725
idium+0.700
Disclaimer+0.700
agate+0.669
Neg logit contributions
 Interpret-0.911
utton-0.911
uni-0.897
 injure-0.884
 taunt-0.876
Tokenersion
Token ID6900
Feature activation-1.44e-01

Pos logit contributions
sein+1.085
 Stamp+1.080
idium+1.077
Disclaimer+1.049
agate+1.028
Neg logit contributions
 Interpret-0.842
utton-0.811
uni-0.810
 injure-0.805
 taunt-0.804
Token as
Token ID355
Feature activation+2.80e-02

Pos logit contributions
 Stamp+0.470
sein+0.469
idium+0.457
Disclaimer+0.455
agate+0.433
Neg logit contributions
 Interpret-0.564
utton-0.556
uni-0.552
 injure-0.542
uti-0.540
Token such
Token ID884
Feature activation+2.82e-01

Pos logit contributions
 Interpret+0.109
utton+0.108
uni+0.107
 injure+0.105
uti+0.105
Neg logit contributions
 Stamp-0.092
sein-0.092
idium-0.090
Disclaimer-0.089
agate-0.085
Token!
Token ID0
Feature activation+4.48e-03

Pos logit contributions
 Interpret+1.102
utton+1.089
uni+1.080
uti+1.058
 injure+1.055
Neg logit contributions
 Stamp-0.931
sein-0.930
idium-0.904
Disclaimer-0.902
agate-0.861
TokenOnt
Token ID45984
Feature activation-1.28e+00

Pos logit contributions
 Interpret+0.017
utton+0.017
uni+0.017
 injure+0.017
uti+0.017
Neg logit contributions
 Stamp-0.015
sein-0.015
Disclaimer-0.014
idium-0.014
agate-0.014
Tokenology
Token ID1435
Feature activation-2.00e-01

Pos logit contributions
sein+3.960
idium+3.896
 Stamp+3.876
Disclaimer+3.712
agate+3.663
Neg logit contributions
 Interpret-5.023
utton-4.855
uni-4.832
 injure-4.797
 taunt-4.785
Token returned
Token ID4504
Feature activation+3.25e-01

Pos logit contributions
 Stamp+0.656
sein+0.654
idium+0.637
Disclaimer+0.633
agate+0.604
Neg logit contributions
 Interpret-0.782
utton-0.778
uni-0.769
uti-0.755
 injure-0.754
Token to
Token ID284
Feature activation+2.77e-01

Pos logit contributions
 Interpret+1.296
utton+1.281
uni+1.274
uti+1.247
 taunt+1.245
Neg logit contributions
sein-1.043
 Stamp-1.038
idium-1.013
Disclaimer-1.006
agate-0.965
Token the
Token ID262
Feature activation+1.16e-01

Pos logit contributions
 Interpret+1.046
utton+1.035
uni+1.023
uti+1.003
 injure+1.001
Neg logit contributions
 Stamp-0.947
sein-0.945
idium-0.918
Disclaimer-0.917
-0.877
Token Meg
Token ID8336
Feature activation+3.49e-02

Pos logit contributions
 Interpret+0.422
utton+0.418
uni+0.413
uti+0.404
 injure+0.404
Neg logit contributions
 Stamp-0.415
sein-0.414
idium-0.402
Disclaimer-0.401
agate-0.384
TokenV
Token ID53
Feature activation+2.48e-01

Pos logit contributions
Disclaimer+0.975
sein+0.962
 Stamp+0.946
idium+0.926
+0.905
Neg logit contributions
 Interpret-1.203
utton-1.189
 injure-1.178
 taunt-1.172
uni-1.165
TokenIO
Token ID9399
Feature activation-2.31e-01

Pos logit contributions
 Interpret+1.023
utton+1.015
uni+1.009
uti+0.988
 injure+0.983
Neg logit contributions
sein-0.758
 Stamp-0.757
idium-0.732
Disclaimer-0.731
-0.695
TokenLET
Token ID28882
Feature activation+2.20e-01

Pos logit contributions
 Stamp+0.635
sein+0.625
Disclaimer+0.608
idium+0.604
agate+0.573
Neg logit contributions
 Interpret-1.029
utton-1.028
uni-1.024
uti-0.996
 taunt-0.993
Token,
Token ID11
Feature activation-2.80e-01

Pos logit contributions
 Interpret+0.803
utton+0.803
uni+0.790
 taunt+0.777
uti+0.777
Neg logit contributions
 Stamp-0.779
sein-0.770
Disclaimer-0.753
idium-0.747
-0.718
Token La
Token ID4689
Feature activation-1.01e-01

Pos logit contributions
 Stamp+0.914
sein+0.899
Disclaimer+0.885
idium+0.875
+0.833
Neg logit contributions
utton-1.111
 Interpret-1.095
uni-1.090
 taunt-1.067
uti-1.066
Token.--
Token ID9816
Feature activation-1.27e+00

Pos logit contributions
 Stamp+0.294
Disclaimer+0.287
sein+0.287
idium+0.281
+0.268
Neg logit contributions
 Interpret-0.428
utton-0.428
uni-0.421
 taunt-0.416
 injure-0.412
Token A
Token ID317
Feature activation-1.39e-01

Pos logit contributions
 Stamp+3.954
Disclaimer+3.953
sein+3.749
+3.673
idium+3.617
Neg logit contributions
utton-5.047
 injure-4.926
uni-4.853
 taunt-4.852
uti-4.850
Token W
Token ID370
Feature activation+1.09e-03

Pos logit contributions
 Stamp+0.536
sein+0.524
idium+0.514
Disclaimer+0.512
 spinning+0.499
Neg logit contributions
 Interpret-0.462
utton-0.460
uni-0.453
uti-0.445
 injure-0.441
TokenG
Token ID38
Feature activation+3.23e-02

Pos logit contributions
 Interpret+0.004
utton+0.004
uni+0.004
 taunt+0.004
 injure+0.004
Neg logit contributions
 Stamp-0.004
sein-0.004
idium-0.004
Disclaimer-0.004
-0.003
TokenNO
Token ID15285
Feature activation-2.52e-01

Pos logit contributions
 Interpret+0.107
utton+0.106
uni+0.105
uti+0.103
 taunt+0.103
Neg logit contributions
 Stamp-0.125
sein-0.124
Disclaimer-0.122
idium-0.121
-0.116
Token viewer
Token ID19091
Feature activation+1.57e-01

Pos logit contributions
 Stamp+0.946
sein+0.930
idium+0.909
Disclaimer+0.908
+0.879
Neg logit contributions
utton-0.857
 Interpret-0.853
uni-0.847
uti-0.831
 taunt-0.821
Token and
Token ID290
Feature activation-1.98e-01

Pos logit contributions
 Stamp+0.330
sein+0.326
idium+0.319
Disclaimer+0.317
agate+0.303
Neg logit contributions
 Interpret-0.430
utton-0.426
uni-0.422
uti-0.415
 injure-0.414
Token then
Token ID788
Feature activation+1.02e-01

Pos logit contributions
 Stamp+0.642
sein+0.638
idium+0.622
Disclaimer+0.618
agate+0.593
Neg logit contributions
 Interpret-0.782
utton-0.777
uni-0.768
uti-0.755
 injure-0.751
Token manually
Token ID14500
Feature activation+6.52e-02

Pos logit contributions
 Interpret+0.406
utton+0.400
uni+0.397
 injure+0.389
uti+0.389
Neg logit contributions
sein-0.329
 Stamp-0.329
idium-0.319
Disclaimer-0.319
-0.304
Token split
Token ID6626
Feature activation-1.10e-01

Pos logit contributions
 Interpret+0.265
utton+0.262
uni+0.260
uti+0.255
 injure+0.254
Neg logit contributions
 Stamp-0.205
sein-0.205
idium-0.199
Disclaimer-0.198
agate-0.189
Token them
Token ID606
Feature activation-6.03e-01

Pos logit contributions
 Stamp+0.363
sein+0.360
idium+0.351
Disclaimer+0.349
+0.336
Neg logit contributions
 Interpret-0.429
utton-0.424
uni-0.422
uti-0.414
 injure-0.412
Token into
Token ID656
Feature activation-1.27e+00

Pos logit contributions
 Stamp+1.912
sein+1.908
idium+1.857
Disclaimer+1.834
 spinning+1.786
Neg logit contributions
 Interpret-2.397
utton-2.380
uni-2.357
uti-2.301
 injure-2.301
Token MP
Token ID4904
Feature activation-3.94e-01

Pos logit contributions
 Stamp+4.169
sein+3.968
 spinning+3.899
idium+3.898
 instantaneous+3.812
Neg logit contributions
uni-4.802
utton-4.772
uti-4.704
 Interpret-4.703
 injure-4.600
Token3
Token ID18
Feature activation-3.16e-03

Pos logit contributions
 Stamp+1.049
sein+1.039
idium+0.999
Disclaimer+0.994
agate+0.950
Neg logit contributions
 Interpret-1.788
utton-1.772
uni-1.754
uti-1.731
 taunt-1.720
Tokens
Token ID82
Feature activation-3.54e-01

Pos logit contributions
 Stamp+0.010
sein+0.010
idium+0.010
Disclaimer+0.010
+0.009
Neg logit contributions
 Interpret-0.013
utton-0.013
uni-0.013
uti-0.012
 injure-0.012
Token
Token ID960
Feature activation-3.33e-01

Pos logit contributions
 Stamp+1.082
sein+1.071
idium+1.043
Disclaimer+1.038
+0.993
Neg logit contributions
 Interpret-1.466
utton-1.453
uni-1.440
 injure-1.412
uti-1.409
Tokenpse
Token ID7752
Feature activation-2.59e-01

Pos logit contributions
sein+1.223
 Stamp+1.223
Disclaimer+1.194
idium+1.189
+1.148
Neg logit contributions
 Interpret-1.172
utton-1.151
uni-1.133
 taunt-1.124
 injure-1.122
Token16
Token ID1433
Feature activation-5.82e-01

Pos logit contributions
 Stamp+1.980
sein+1.961
idium+1.922
Disclaimer+1.914
+1.831
Neg logit contributions
 Interpret-2.928
utton-2.885
uni-2.856
 injure-2.824
 taunt-2.819
Tokend
Token ID67
Feature activation-1.06e+00

Pos logit contributions
 Stamp+1.899
sein+1.886
idium+1.832
Disclaimer+1.830
+1.752
Neg logit contributions
 Interpret-2.242
uni-2.206
utton-2.204
uti-2.170
 injure-2.157
Token7
Token ID22
Feature activation-9.68e-01

Pos logit contributions
 Stamp+3.071
sein+3.005
Disclaimer+2.961
idium+2.945
+2.823
Neg logit contributions
 Interpret-4.378
utton-4.285
uni-4.278
 injure-4.234
 taunt-4.221
Tokenff
Token ID487
Feature activation-1.08e+00

Pos logit contributions
 Stamp+3.217
sein+3.147
idium+3.084
Disclaimer+3.082
+2.941
Neg logit contributions
 Interpret-3.547
uni-3.511
utton-3.485
uti-3.455
 injure-3.423
Token3
Token ID18
Feature activation-9.16e-01

Pos logit contributions
 Stamp+3.226
sein+3.153
idium+3.096
Disclaimer+3.078
agate+2.953
Neg logit contributions
 Interpret-4.356
utton-4.259
uni-4.256
 injure-4.200
uti-4.199
Tokenc
Token ID66
Feature activation-1.26e+00

Pos logit contributions
 Stamp+3.048
sein+3.000
idium+2.963
Disclaimer+2.942
agate+2.830
Neg logit contributions
 Interpret-3.351
utton-3.310
uni-3.291
uti-3.241
 taunt-3.228
Token3
Token ID18
Feature activation-1.03e+00

Pos logit contributions
 Stamp+3.589
sein+3.540
idium+3.519
Disclaimer+3.473
agate+3.310
Neg logit contributions
 Interpret-5.155
utton-5.063
uni-5.013
 injure-4.986
 taunt-4.986
Tokenff
Token ID487
Feature activation-1.32e+00

Pos logit contributions
 Stamp+3.422
sein+3.375
idium+3.346
Disclaimer+3.309
agate+3.190
Neg logit contributions
 Interpret-3.727
utton-3.682
uni-3.652
uti-3.605
 taunt-3.604
Token0
Token ID15
Feature activation-8.58e-01

Pos logit contributions
 Stamp+3.886
sein+3.814
idium+3.780
Disclaimer+3.724
agate+3.604
Neg logit contributions
 Interpret-5.275
utton-5.132
uni-5.128
 injure-5.077
uti-5.067
Tokend
Token ID67
Feature activation-1.24e+00

Pos logit contributions
 Stamp+2.963
sein+2.883
idium+2.859
Disclaimer+2.838
+2.721
Neg logit contributions
 Interpret-3.091
utton-3.059
uni-3.054
 injure-2.992
 taunt-2.978
Token247
Token ID23753
Feature activation-9.39e-01

Pos logit contributions
 Stamp+3.533
sein+3.474
idium+3.427
Disclaimer+3.389
agate+3.297
Neg logit contributions
 Interpret-5.132
utton-5.022
uni-4.985
 injure-4.975
 taunt-4.951
Token3
Token ID18
Feature activation-9.16e-01

Pos logit contributions
 Stamp+3.226
sein+3.153
idium+3.096
Disclaimer+3.078
agate+2.953
Neg logit contributions
 Interpret-4.356
utton-4.259
uni-4.256
 injure-4.200
uti-4.199
Tokenc
Token ID66
Feature activation-1.26e+00

Pos logit contributions
 Stamp+3.048
sein+3.000
idium+2.963
Disclaimer+2.942
agate+2.830
Neg logit contributions
 Interpret-3.351
utton-3.310
uni-3.291
uti-3.241
 taunt-3.228
Token3
Token ID18
Feature activation-1.03e+00

Pos logit contributions
 Stamp+3.589
sein+3.540
idium+3.519
Disclaimer+3.473
agate+3.310
Neg logit contributions
 Interpret-5.155
utton-5.063
uni-5.013
 injure-4.986
 taunt-4.986
Tokenff
Token ID487
Feature activation-1.32e+00

Pos logit contributions
 Stamp+3.422
sein+3.375
idium+3.346
Disclaimer+3.309
agate+3.190
Neg logit contributions
 Interpret-3.727
utton-3.682
uni-3.652
uti-3.605
 taunt-3.604
Token0
Token ID15
Feature activation-8.58e-01

Pos logit contributions
 Stamp+3.886
sein+3.814
idium+3.780
Disclaimer+3.724
agate+3.604
Neg logit contributions
 Interpret-5.275
utton-5.132
uni-5.128
 injure-5.077
uti-5.067
Tokend
Token ID67
Feature activation-1.24e+00

Pos logit contributions
 Stamp+2.963
sein+2.883
idium+2.859
Disclaimer+2.838
+2.721
Neg logit contributions
 Interpret-3.091
utton-3.059
uni-3.054
 injure-2.992
 taunt-2.978
Token247
Token ID23753
Feature activation-9.39e-01

Pos logit contributions
 Stamp+3.533
sein+3.474
idium+3.427
Disclaimer+3.389
agate+3.297
Neg logit contributions
 Interpret-5.132
utton-5.022
uni-4.985
 injure-4.975
 taunt-4.951
Token409
Token ID29416
Feature activation-7.35e-01

Pos logit contributions
 Stamp+3.013
sein+2.970
idium+2.959
Disclaimer+2.903
agate+2.818
Neg logit contributions
 Interpret-3.571
uni-3.495
utton-3.474
 taunt-3.418
 injure-3.416
Tokenbd
Token ID17457
Feature activation-7.50e-01

Pos logit contributions
 Stamp+2.387
sein+2.356
idium+2.317
Disclaimer+2.302
agate+2.212
Neg logit contributions
 Interpret-2.821
uni-2.779
utton-2.756
uti-2.710
 injure-2.698
Token28
Token ID2078
Feature activation-6.97e-01

Pos logit contributions
 Stamp+2.186
sein+2.157
idium+2.110
Disclaimer+2.110
agate+2.012
Neg logit contributions
 Interpret-3.129
utton-3.083
uni-3.069
 injure-3.027
 taunt-3.017
Tokenf
Token ID69
Feature activation-7.03e-01

Pos logit contributions
 Stamp+2.240
sein+2.230
idium+2.178
Disclaimer+2.166
agate+2.076
Neg logit contributions
 Interpret-2.693
uni-2.646
utton-2.645
uti-2.594
 injure-2.590
Token
Token ID198
Feature activation-1.51e-01

Pos logit contributions
 Stamp+0.652
sein+0.646
Disclaimer+0.637
idium+0.624
+0.606
Neg logit contributions
utton-0.649
 Interpret-0.645
uni-0.642
 injure-0.631
uti-0.627
Token
Token ID198
Feature activation-1.74e-01

Pos logit contributions
 Stamp+0.596
sein+0.596
Disclaimer+0.582
idium+0.581
+0.559
Neg logit contributions
 Interpret-0.491
utton-0.484
uni-0.479
 injure-0.468
uti-0.468
TokenGet
Token ID3855
Feature activation+5.67e-02

Pos logit contributions
sein+0.546
Disclaimer+0.541
 Stamp+0.539
idium+0.528
+0.508
Neg logit contributions
 Interpret-0.705
utton-0.692
uni-0.685
 injure-0.680
 taunt-0.677
Token The
Token ID383
Feature activation-7.50e-01

Pos logit contributions
 Interpret+0.247
utton+0.243
uni+0.242
 taunt+0.238
 injure+0.238
Neg logit contributions
sein-0.163
 Stamp-0.161
idium-0.158
Disclaimer-0.153
agate-0.147
Token Times
Token ID3782
Feature activation-6.11e-01

Pos logit contributions
 Stamp+2.236
sein+2.172
Disclaimer+2.151
idium+2.107
agate+2.010
Neg logit contributions
utton-3.162
uni-3.153
 Interpret-3.139
uti-3.082
 injure-3.059
Token of
Token ID286
Feature activation-1.23e+00

Pos logit contributions
 Stamp+2.246
Disclaimer+2.190
sein+2.188
idium+2.163
+2.077
Neg logit contributions
utton-2.134
uni-2.117
 Interpret-2.109
 injure-2.093
uti-2.058
Token Israel
Token ID2692
Feature activation-3.24e-01

Pos logit contributions
 Stamp+2.722
sein+2.652
Disclaimer+2.569
idium+2.538
agate+2.359
Neg logit contributions
 Interpret-6.035
utton-6.030
uni-5.964
 injure-5.925
 taunt-5.901
Token's
Token ID338
Feature activation-6.75e-01

Pos logit contributions
sein+0.960
 Stamp+0.958
Disclaimer+0.927
idium+0.926
+0.893
Neg logit contributions
 Interpret-1.376
utton-1.353
uni-1.344
 taunt-1.322
uti-1.317
Token Daily
Token ID6714
Feature activation-1.09e-01

Pos logit contributions
 Stamp+1.544
sein+1.503
Disclaimer+1.455
idium+1.436
agate+1.347
Neg logit contributions
 Interpret-3.315
utton-3.312
uni-3.291
uti-3.237
 injure-3.227
Token Edition
Token ID5061
Feature activation-1.33e-01

Pos logit contributions
sein+0.365
idium+0.360
+0.342
 Stamp+0.342
Disclaimer+0.341
Neg logit contributions
 Interpret-0.439
uni-0.414
utton-0.413
 taunt-0.412
uti-0.410
Token by
Token ID416
Feature activation-5.21e-01

Pos logit contributions
sein+0.467
 Stamp+0.467
idium+0.455
Disclaimer+0.452
+0.438
Neg logit contributions
 Interpret-0.491
utton-0.481
uni-0.480
uti-0.470
 taunt-0.468
Token.
Token ID13
Feature activation+8.00e-03

Pos logit contributions
 Stamp+0.682
sein+0.677
idium+0.657
Disclaimer+0.655
+0.629
Neg logit contributions
 Interpret-0.870
utton-0.865
uni-0.856
uti-0.842
 injure-0.838
Token Its
Token ID6363
Feature activation+5.30e-01

Pos logit contributions
 Interpret+0.030
utton+0.029
uni+0.029
uti+0.029
 injure+0.028
Neg logit contributions
 Stamp-0.028
sein-0.028
Disclaimer-0.027
idium-0.027
agate-0.026
Token normal
Token ID3487
Feature activation+2.17e-01

Pos logit contributions
 Interpret+1.916
utton+1.874
uni+1.856
 taunt+1.818
 injure+1.813
Neg logit contributions
sein-1.907
 Stamp-1.899
idium-1.874
Disclaimer-1.852
agate-1.783
Token range
Token ID2837
Feature activation+2.91e-02

Pos logit contributions
 Interpret+0.867
utton+0.856
uni+0.849
uti+0.832
 injure+0.830
Neg logit contributions
 Stamp-0.693
sein-0.692
idium-0.673
Disclaimer-0.670
agate-0.639
Token is
Token ID318
Feature activation-2.75e-02

Pos logit contributions
 Interpret+0.105
utton+0.103
uni+0.102
uti+0.100
 injure+0.100
Neg logit contributions
 Stamp-0.105
sein-0.104
idium-0.101
Disclaimer-0.101
-0.097
Token [-
Token ID25915
Feature activation-1.23e+00

Pos logit contributions
 Stamp+0.097
sein+0.097
idium+0.094
Disclaimer+0.094
+0.090
Neg logit contributions
 Interpret-0.101
utton-0.100
uni-0.099
uti-0.097
 injure-0.096
Token1
Token ID16
Feature activation-6.09e-01

Pos logit contributions
sein+2.856
 Stamp+2.786
idium+2.741
Disclaimer+2.715
+2.532
Neg logit contributions
 Interpret-5.730
utton-5.647
 injure-5.615
 taunt-5.592
uti-5.585
Token,
Token ID11
Feature activation-9.78e-02

Pos logit contributions
sein+1.466
 Stamp+1.464
idium+1.404
Disclaimer+1.397
+1.324
Neg logit contributions
 Interpret-2.899
utton-2.861
uni-2.823
uti-2.800
 injure-2.788
Token+
Token ID10
Feature activation-1.60e-01

Pos logit contributions
sein+0.279
 Stamp+0.279
idium+0.269
Disclaimer+0.269
agate+0.254
Neg logit contributions
 Interpret-0.425
utton-0.421
uni-0.416
uti-0.411
 injure-0.408
Token1
Token ID16
Feature activation-6.57e-01

Pos logit contributions
 Stamp+0.442
sein+0.442
Disclaimer+0.427
idium+0.426
+0.402
Neg logit contributions
 Interpret-0.709
utton-0.700
uni-0.694
uti-0.684
 injure-0.684
Token],
Token ID4357
Feature activation-5.57e-01

Pos logit contributions
 Stamp+1.430
sein+1.398
idium+1.349
Disclaimer+1.319
+1.249
Neg logit contributions
 Interpret-3.273
utton-3.224
uni-3.190
uti-3.164
 injure-3.162
Token best
Token ID1266
Feature activation-1.37e-01

Pos logit contributions
 Stamp+0.797
sein+0.797
idium+0.770
Disclaimer+0.768
+0.733
Neg logit contributions
 Interpret-0.953
utton-0.944
uni-0.934
uti-0.916
 injure-0.912
Token and
Token ID290
Feature activation-3.13e-01

Pos logit contributions
 Stamp+0.469
sein+0.468
idium+0.458
Disclaimer+0.457
+0.441
Neg logit contributions
 Interpret-0.513
utton-0.506
uni-0.502
uti-0.492
 injure-0.490
Token the
Token ID262
Feature activation-3.89e-01

Pos logit contributions
 Stamp+1.118
sein+1.112
idium+1.087
Disclaimer+1.086
+1.041
Neg logit contributions
 Interpret-1.134
utton-1.127
uni-1.108
uti-1.087
 injure-1.075
Token best
Token ID1266
Feature activation-3.42e-01

Pos logit contributions
 Stamp+1.310
sein+1.300
idium+1.268
Disclaimer+1.260
+1.204
Neg logit contributions
 Interpret-1.492
utton-1.479
uni-1.459
uti-1.430
 injure-1.427
Token of
Token ID286
Feature activation-6.45e-01

Pos logit contributions
 Stamp+1.185
sein+1.179
idium+1.157
Disclaimer+1.150
+1.114
Neg logit contributions
 Interpret-1.270
utton-1.252
uni-1.248
uti-1.224
 injure-1.213
Token luck
Token ID8458
Feature activation-1.22e+00

Pos logit contributions
 Stamp+2.015
sein+2.006
idium+1.971
Disclaimer+1.944
+1.884
Neg logit contributions
 Interpret-2.605
utton-2.583
uni-2.546
uti-2.508
 injure-2.478
Token in
Token ID287
Feature activation-1.37e-01

Pos logit contributions
 Stamp+3.677
sein+3.597
 spinning+3.581
Disclaimer+3.555
idium+3.514
Neg logit contributions
 Interpret-4.861
uni-4.798
utton-4.777
 injure-4.658
uti-4.638
Token the
Token ID262
Feature activation-1.34e-01

Pos logit contributions
 Stamp+0.471
sein+0.466
idium+0.455
Disclaimer+0.453
agate+0.438
Neg logit contributions
 Interpret-0.518
utton-0.515
uni-0.509
uti-0.497
 injure-0.497
Token coming
Token ID2406
Feature activation-1.60e-02

Pos logit contributions
 Stamp+0.465
sein+0.458
idium+0.449
Disclaimer+0.444
agate+0.428
Neg logit contributions
 Interpret-0.504
utton-0.501
uni-0.496
uti-0.487
 injure-0.484
Token days
Token ID1528
Feature activation-6.19e-02

Pos logit contributions
 Stamp+0.054
sein+0.054
idium+0.052
Disclaimer+0.052
agate+0.050
Neg logit contributions
 Interpret-0.061
utton-0.061
uni-0.060
uti-0.059
 injure-0.059
Token.
Token ID13
Feature activation-3.02e-01

Pos logit contributions
 Stamp+0.185
sein+0.184
idium+0.179
Disclaimer+0.178
+0.170
Neg logit contributions
 Interpret-0.261
utton-0.258
uni-0.257
uti-0.251
 injure-0.251
Token since
Token ID1201
Feature activation-4.80e-01

Pos logit contributions
 Stamp+1.468
sein+1.462
idium+1.419
Disclaimer+1.409
agate+1.337
Neg logit contributions
 Interpret-2.265
utton-2.251
uni-2.236
uti-2.184
 injure-2.179
Token February
Token ID3945
Feature activation-2.46e-02

Pos logit contributions
 Stamp+1.472
sein+1.460
idium+1.417
Disclaimer+1.408
 spinning+1.350
Neg logit contributions
 Interpret-1.983
utton-1.974
uni-1.950
uti-1.910
 injure-1.903
Token 2013
Token ID2211
Feature activation-2.78e-02

Pos logit contributions
 Stamp+0.073
sein+0.073
idium+0.071
Disclaimer+0.070
agate+0.067
Neg logit contributions
 Interpret-0.104
utton-0.103
uni-0.102
uti-0.100
 injure-0.100
Token,
Token ID11
Feature activation-2.72e-02

Pos logit contributions
 Stamp+0.071
sein+0.071
idium+0.068
Disclaimer+0.068
agate+0.064
Neg logit contributions
 Interpret-0.129
utton-0.128
uni-0.127
uti-0.125
 injure-0.125
Token was
Token ID373
Feature activation-8.12e-02

Pos logit contributions
 Stamp+0.088
sein+0.088
idium+0.085
Disclaimer+0.085
agate+0.081
Neg logit contributions
 Interpret-0.108
utton-0.106
uni-0.105
uti-0.103
 injure-0.103
Token forced
Token ID4137
Feature activation-1.22e+00

Pos logit contributions
 Stamp+0.243
sein+0.243
idium+0.236
Disclaimer+0.235
agate+0.223
Neg logit contributions
 Interpret-0.341
utton-0.337
uni-0.335
uti-0.328
 injure-0.327
Token out
Token ID503
Feature activation-4.56e-01

Pos logit contributions
sein+3.075
 Stamp+3.032
idium+3.020
 spinning+2.938
Disclaimer+2.920
Neg logit contributions
 Interpret-5.571
utton-5.439
uni-5.401
 injure-5.192
uti-5.180
Token by
Token ID416
Feature activation+2.72e-02

Pos logit contributions
 Stamp+1.638
sein+1.619
idium+1.594
Disclaimer+1.575
 spinning+1.523
Neg logit contributions
utton-1.635
 Interpret-1.634
uni-1.625
uti-1.557
 injure-1.556
Token President
Token ID1992
Feature activation-1.42e-01

Pos logit contributions
 Interpret+0.101
utton+0.100
uni+0.099
uti+0.097
 injure+0.097
Neg logit contributions
 Stamp-0.094
sein-0.094
idium-0.092
Disclaimer-0.091
agate-0.087
Token Barack
Token ID8732
Feature activation-1.00e-01

Pos logit contributions
 Stamp+0.397
sein+0.394
idium+0.379
Disclaimer+0.378
agate+0.358
Neg logit contributions
 Interpret-0.630
utton-0.623
uni-0.622
 injure-0.608
uti-0.608
Token Obama
Token ID2486
Feature activation-3.00e-01

Pos logit contributions
 Stamp+0.285
sein+0.285
idium+0.277
Disclaimer+0.275
agate+0.261
Neg logit contributions
 Interpret-0.437
utton-0.432
uni-0.429
uti-0.421
 injure-0.420
Tokenst
Token ID301
Feature activation-2.62e-01

Pos logit contributions
 Interpret+0.614
utton+0.603
uni+0.597
uti+0.586
 injure+0.585
Neg logit contributions
sein-0.604
 Stamp-0.602
idium-0.587
Disclaimer-0.586
agate-0.562
Tokenaring
Token ID1723
Feature activation+5.58e-01

Pos logit contributions
sein+0.836
 Stamp+0.831
idium+0.808
Disclaimer+0.799
agate+0.777
Neg logit contributions
 Interpret-1.054
utton-1.026
uni-1.016
 taunt-1.013
 injure-1.009
Token at
Token ID379
Feature activation-7.37e-02

Pos logit contributions
 Interpret+2.109
utton+2.078
uni+2.066
 taunt+2.022
 injure+2.020
Neg logit contributions
sein-1.904
 Stamp-1.898
idium-1.848
Disclaimer-1.848
-1.767
Token air
Token ID1633
Feature activation+2.71e-01

Pos logit contributions
 Stamp+0.258
sein+0.255
idium+0.250
Disclaimer+0.247
 spinning+0.239
Neg logit contributions
 Interpret-0.273
utton-0.272
uni-0.268
uti-0.265
 injure-0.261
Token/
Token ID14
Feature activation+2.76e-01

Pos logit contributions
 Interpret+1.007
utton+0.995
uni+0.987
uti+0.969
 injure+0.958
Neg logit contributions
 Stamp-0.944
sein-0.942
idium-0.917
Disclaimer-0.916
agate-0.879
Tokenbeing
Token ID11873
Feature activation+8.48e-01

Pos logit contributions
 Interpret+1.004
utton+0.989
uni+0.979
uti+0.960
 injure+0.958
Neg logit contributions
sein-0.988
 Stamp-0.987
idium-0.962
Disclaimer-0.960
agate-0.920
Token buried
Token ID11694
Feature activation+5.62e-01

Pos logit contributions
 Interpret+3.262
uni+3.215
utton+3.205
 taunt+3.176
 injure+3.155
Neg logit contributions
sein-2.799
 Stamp-2.762
idium-2.742
Disclaimer-2.701
-2.601
Token alive
Token ID6776
Feature activation+4.77e-01

Pos logit contributions
 Interpret+2.063
utton+2.029
uni+2.019
 injure+1.980
 taunt+1.976
Neg logit contributions
 Stamp-1.974
sein-1.967
idium-1.923
Disclaimer-1.915
-1.839
Token."[
Token ID18161
Feature activation+2.46e-01

Pos logit contributions
 Interpret+1.823
utton+1.796
uni+1.783
uti+1.744
 injure+1.744
Neg logit contributions
 Stamp-1.611
sein-1.610
idium-1.567
Disclaimer-1.561
agate-1.494
Token155
Token ID18742
Feature activation-2.41e-01

Pos logit contributions
 Interpret+0.840
utton+0.829
uni+0.820
 injure+0.801
uti+0.801
Neg logit contributions
sein-0.930
 Stamp-0.927
Disclaimer-0.907
idium-0.906
-0.868
Token]
Token ID60
Feature activation-3.86e-01

Pos logit contributions
sein+0.550
 Stamp+0.544
Disclaimer+0.530
idium+0.522
+0.482
Neg logit contributions
 Interpret-1.182
utton-1.173
uni-1.166
 injure-1.148
uti-1.148
Token with
Token ID351
Feature activation+1.96e-01

Pos logit contributions
 Interpret+3.163
utton+3.089
uni+3.071
 taunt+3.023
uti+3.014
Neg logit contributions
sein-2.659
 Stamp-2.613
idium-2.580
Disclaimer-2.574
-2.464
Token not
Token ID407
Feature activation+4.42e-02

Pos logit contributions
 Interpret+0.761
utton+0.758
uni+0.752
uti+0.740
 injure+0.733
Neg logit contributions
 Stamp-0.656
sein-0.643
Disclaimer-0.626
idium-0.625
agate-0.596
Tokenepad
Token ID47852
Feature activation+3.77e-01

Pos logit contributions
 Interpret+0.162
utton+0.159
uni+0.157
 injure+0.154
uti+0.154
Neg logit contributions
 Stamp-0.157
sein-0.156
idium-0.153
Disclaimer-0.152
-0.145
Token to
Token ID284
Feature activation-6.92e-02

Pos logit contributions
 Interpret+1.483
utton+1.468
uni+1.455
uti+1.427
 injure+1.424
Neg logit contributions
 Stamp-1.233
sein-1.228
idium-1.193
Disclaimer-1.193
agate-1.136
Token your
Token ID534
Feature activation+2.18e-01

Pos logit contributions
 Stamp+0.257
sein+0.252
Disclaimer+0.247
idium+0.246
agate+0.237
Neg logit contributions
 Interpret-0.241
utton-0.241
uni-0.238
uti-0.234
 injure-0.229
Token undead
Token ID26805
Feature activation+8.77e-01

Pos logit contributions
 Interpret+0.823
utton+0.813
uni+0.806
uti+0.789
 injure+0.788
Neg logit contributions
 Stamp-0.748
sein-0.747
idium-0.727
Disclaimer-0.725
agate-0.694
Token heart
Token ID2612
Feature activation+4.22e-01

Pos logit contributions
 Interpret+3.201
utton+3.156
uni+3.116
 taunt+3.099
 injure+3.074
Neg logit contributions
sein-3.096
 Stamp-3.064
Disclaimer-2.997
idium-2.991
-2.889
Token's
Token ID338
Feature activation+4.73e-02

Pos logit contributions
 Interpret+1.771
utton+1.762
uni+1.742
 injure+1.711
 taunt+1.708
Neg logit contributions
sein-1.256
 Stamp-1.253
idium-1.220
Disclaimer-1.217
agate-1.155
Token content
Token ID2695
Feature activation+2.18e-02

Pos logit contributions
 Interpret+0.230
utton+0.226
uni+0.226
uti+0.223
 injure+0.221
Neg logit contributions
 Stamp-0.112
sein-0.110
Disclaimer-0.106
idium-0.106
agate-0.099
Token.
Token ID13
Feature activation+2.24e-01

Pos logit contributions
 Interpret+0.091
utton+0.090
uni+0.089
uti+0.088
 injure+0.087
Neg logit contributions
 Stamp-0.067
sein-0.066
Disclaimer-0.064
idium-0.064
agate-0.061
Token This
Token ID770
Feature activation+2.17e-01

Pos logit contributions
 Interpret+0.846
utton+0.827
uni+0.821
uti+0.802
 taunt+0.802
Neg logit contributions
sein-0.769
 Stamp-0.763
idium-0.752
Disclaimer-0.743
-0.715
Token Yad
Token ID41449
Feature activation+9.60e-02

Pos logit contributions
 Stamp+0.766
sein+0.746
Disclaimer+0.722
idium+0.715
 spinning+0.686
Neg logit contributions
 Interpret-0.916
utton-0.907
uni-0.899
uti-0.883
 injure-0.870
Tokenav
Token ID615
Feature activation-7.65e-03

Pos logit contributions
 Interpret+0.279
utton+0.275
uni+0.272
uti+0.264
 injure+0.263
Neg logit contributions
 Stamp-0.412
sein-0.411
idium-0.402
Disclaimer-0.402
-0.388
Token.
Token ID13
Feature activation-1.48e-01

Pos logit contributions
 Stamp+0.024
sein+0.024
idium+0.024
Disclaimer+0.024
agate+0.023
Neg logit contributions
 Interpret-0.031
utton-0.030
uni-0.030
uti-0.029
 injure-0.029
Token A
Token ID317
Feature activation+1.18e-01

Pos logit contributions
 Stamp+0.536
sein+0.532
Disclaimer+0.519
idium+0.511
+0.495
Neg logit contributions
 Interpret-0.526
utton-0.525
uni-0.519
 injure-0.510
uti-0.509
Token case
Token ID1339
Feature activation+1.19e-01

Pos logit contributions
 Interpret+0.431
utton+0.425
uni+0.422
uti+0.411
 injure+0.411
Neg logit contributions
 Stamp-0.418
sein-0.415
Disclaimer-0.403
idium-0.402
agate-0.385
Token was
Token ID373
Feature activation+9.26e-01

Pos logit contributions
 Interpret+0.411
utton+0.406
uni+0.401
uti+0.391
 injure+0.391
Neg logit contributions
 Stamp-0.448
sein-0.448
idium-0.437
Disclaimer-0.436
agate-0.419
Token earlier
Token ID2961
Feature activation+6.41e-01

Pos logit contributions
 Interpret+3.298
uni+3.222
utton+3.213
uti+3.212
 taunt+3.200
Neg logit contributions
sein-3.260
 Stamp-3.252
idium-3.180
Disclaimer-3.177
-3.137
Token filed
Token ID5717
Feature activation+4.07e-01

Pos logit contributions
 Interpret+2.458
utton+2.410
uni+2.408
 taunt+2.392
uti+2.389
Neg logit contributions
sein-2.115
 Stamp-2.103
Disclaimer-2.056
idium-2.053
-2.010
Token against
Token ID1028
Feature activation+3.65e-02

Pos logit contributions
 Interpret+1.585
utton+1.563
uni+1.552
uti+1.535
 taunt+1.529
Neg logit contributions
sein-1.330
 Stamp-1.325
idium-1.294
Disclaimer-1.290
-1.241
Token Vin
Token ID11820
Feature activation+2.28e-01

Pos logit contributions
 Interpret+0.148
utton+0.146
uni+0.145
uti+0.142
 injure+0.141
Neg logit contributions
 Stamp-0.116
sein-0.115
idium-0.110
Disclaimer-0.110
agate-0.105
Tokenod
Token ID375
Feature activation-7.72e-02

Pos logit contributions
utton+0.706
 Interpret+0.704
uni+0.699
uti+0.683
 taunt+0.667
Neg logit contributions
 Stamp-0.934
sein-0.924
Disclaimer-0.908
idium-0.907
-0.879
Token that
Token ID326
Feature activation+3.98e-01

Pos logit contributions
 Interpret+0.671
utton+0.663
uni+0.657
uti+0.643
 injure+0.642
Neg logit contributions
 Stamp-0.630
sein-0.630
idium-0.613
Disclaimer-0.612
agate-0.586
Token there
Token ID612
Feature activation-9.93e-02

Pos logit contributions
 Interpret+1.465
utton+1.446
uni+1.432
 injure+1.403
uti+1.401
Neg logit contributions
sein-1.399
 Stamp-1.395
idium-1.366
Disclaimer-1.356
agate-1.304
Token
Token ID447
Feature activation+2.82e-01

Pos logit contributions
sein+0.317
 Stamp+0.316
idium+0.307
Disclaimer+0.307
+0.300
Neg logit contributions
 Interpret-0.398
utton-0.389
uni-0.386
uti-0.381
 taunt-0.379
Token
Token ID247
Feature activation+8.46e-02

Pos logit contributions
 Interpret+1.103
utton+1.088
uni+1.080
uti+1.058
 injure+1.057
Neg logit contributions
sein-0.929
 Stamp-0.929
idium-0.903
Disclaimer-0.900
-0.861
Tokens
Token ID82
Feature activation+2.13e-01

Pos logit contributions
 Interpret+0.326
utton+0.321
uni+0.319
 injure+0.312
uti+0.312
Neg logit contributions
sein-0.284
 Stamp-0.283
idium-0.276
Disclaimer-0.275
agate-0.264
Token only
Token ID691
Feature activation+1.02e+00

Pos logit contributions
 Interpret+0.875
utton+0.866
uni+0.859
uti+0.842
 injure+0.841
Neg logit contributions
 Stamp-0.660
sein-0.659
idium-0.640
Disclaimer-0.638
agate-0.608
Token room
Token ID2119
Feature activation+2.53e-01

Pos logit contributions
 Interpret+3.697
utton+3.672
uni+3.582
 injure+3.567
 taunt+3.562
Neg logit contributions
sein-3.563
 Stamp-3.499
idium-3.464
Disclaimer-3.429
agate-3.321
Token for
Token ID329
Feature activation+5.30e-01

Pos logit contributions
 Interpret+0.951
utton+0.932
uni+0.929
uti+0.910
 injure+0.906
Neg logit contributions
sein-0.879
 Stamp-0.878
Disclaimer-0.852
idium-0.851
-0.815
Token one
Token ID530
Feature activation+1.04e-02

Pos logit contributions
 Interpret+1.996
utton+1.976
uni+1.949
 injure+1.916
 taunt+1.914
Neg logit contributions
sein-1.813
 Stamp-1.800
idium-1.770
Disclaimer-1.756
agate-1.689
Token on
Token ID319
Feature activation+1.11e-01

Pos logit contributions
 Interpret+0.035
utton+0.034
uni+0.034
uti+0.033
 injure+0.033
Neg logit contributions
 Stamp-0.040
sein-0.040
Disclaimer-0.039
idium-0.039
-0.037
Token a
Token ID257
Feature activation+5.80e-02

Pos logit contributions
 Interpret+0.442
utton+0.436
uni+0.434
uti+0.425
 taunt+0.422
Neg logit contributions
 Stamp-0.361
sein-0.357
Disclaimer-0.347
idium-0.346
-0.331
Token All
Token ID1439
Feature activation+7.98e-02

Pos logit contributions
 Interpret+0.691
utton+0.680
uni+0.675
uti+0.661
 injure+0.658
Neg logit contributions
sein-0.651
 Stamp-0.650
idium-0.634
Disclaimer-0.630
agate-0.605
Token entries
Token ID12784
Feature activation+1.09e-01

Pos logit contributions
 Interpret+0.298
utton+0.296
uni+0.293
 injure+0.287
uti+0.286
Neg logit contributions
 Stamp-0.276
sein-0.274
Disclaimer-0.268
idium-0.268
-0.256
Token are
Token ID389
Feature activation+2.05e-01

Pos logit contributions
 Interpret+0.397
utton+0.392
uni+0.388
uti+0.380
 injure+0.379
Neg logit contributions
 Stamp-0.387
sein-0.386
idium-0.376
Disclaimer-0.376
agate-0.360
Token cleared
Token ID12539
Feature activation+1.45e-01

Pos logit contributions
 Interpret+0.794
utton+0.779
uni+0.774
uti+0.758
 taunt+0.756
Neg logit contributions
sein-0.686
 Stamp-0.679
idium-0.665
Disclaimer-0.660
-0.635
Token by
Token ID416
Feature activation+4.05e-01

Pos logit contributions
 Interpret+0.571
utton+0.560
uni+0.557
uti+0.547
 taunt+0.545
Neg logit contributions
sein-0.478
 Stamp-0.474
idium-0.465
Disclaimer-0.461
-0.443
Token pressing
Token ID12273
Feature activation+8.76e-01

Pos logit contributions
 Interpret+1.563
utton+1.534
uni+1.526
uti+1.493
 taunt+1.492
Neg logit contributions
sein-1.360
 Stamp-1.348
idium-1.316
Disclaimer-1.307
-1.265
Token the
Token ID262
Feature activation+6.43e-01

Pos logit contributions
 Interpret+3.277
utton+3.174
uni+3.137
 taunt+3.100
uti+3.099
Neg logit contributions
sein-3.051
 Stamp-3.011
idium-2.969
Disclaimer-2.930
-2.844
Token Clear
Token ID11459
Feature activation-1.77e-01

Pos logit contributions
 Interpret+2.440
utton+2.366
uni+2.346
 taunt+2.311
uti+2.307
Neg logit contributions
sein-2.223
 Stamp-2.186
idium-2.159
Disclaimer-2.126
-2.073
Token button
Token ID4936
Feature activation+9.78e-02

Pos logit contributions
 Stamp+0.556
sein+0.545
Disclaimer+0.534
idium+0.530
agate+0.508
Neg logit contributions
utton-0.719
 Interpret-0.719
uni-0.713
 injure-0.700
uti-0.696
Token.
Token ID13
Feature activation+6.00e-02

Pos logit contributions
 Interpret+0.393
utton+0.389
uni+0.386
uti+0.378
 injure+0.377
Neg logit contributions
 Stamp-0.311
sein-0.310
idium-0.301
Disclaimer-0.300
agate-0.287
Token If
Token ID1002
Feature activation+2.79e-01

Pos logit contributions
 Interpret+0.217
utton+0.214
uni+0.212
uti+0.207
 injure+0.207
Neg logit contributions
 Stamp-0.216
sein-0.215
idium-0.210
Disclaimer-0.209
agate-0.201
Tokens
Token ID82
Feature activation-9.18e-02

Pos logit contributions
 Interpret+0.438
utton+0.432
uni+0.429
uti+0.422
 taunt+0.417
Neg logit contributions
 Stamp-0.373
sein-0.373
Disclaimer-0.363
idium-0.363
agate-0.346
Token fall
Token ID2121
Feature activation+1.26e-01

Pos logit contributions
 Stamp+0.319
sein+0.317
idium+0.308
Disclaimer+0.307
agate+0.298
Neg logit contributions
 Interpret-0.342
utton-0.337
uni-0.335
uti-0.329
 taunt-0.322
Token into
Token ID656
Feature activation-1.62e-02

Pos logit contributions
 Interpret+0.492
utton+0.486
uni+0.483
 injure+0.473
uti+0.473
Neg logit contributions
sein-0.414
 Stamp-0.413
idium-0.402
Disclaimer-0.401
-0.383
Token the
Token ID262
Feature activation+2.75e-01

Pos logit contributions
sein+0.053
 Stamp+0.053
Disclaimer+0.050
idium+0.050
agate+0.049
Neg logit contributions
 Interpret-0.063
utton-0.063
uni-0.062
uti-0.061
 taunt-0.061
Token sea
Token ID5417
Feature activation+1.30e-01

Pos logit contributions
 Interpret+0.974
utton+0.963
uni+0.953
uti+0.933
 injure+0.929
Neg logit contributions
 Stamp-1.005
sein-1.004
idium-0.978
Disclaimer-0.976
agate-0.936
Token and
Token ID290
Feature activation+1.47e-01

Pos logit contributions
 Interpret+0.535
utton+0.529
uni+0.525
uti+0.515
 injure+0.513
Neg logit contributions
 Stamp-0.401
sein-0.401
idium-0.389
Disclaimer-0.388
agate-0.369
Token glaciers
Token ID40509
Feature activation+4.23e-02

Pos logit contributions
 Interpret+0.533
utton+0.526
uni+0.521
uti+0.510
 injure+0.509
Neg logit contributions
 Stamp-0.526
sein-0.525
idium-0.511
Disclaimer-0.510
agate-0.489
Token retreat
Token ID13703
Feature activation-2.04e-01

Pos logit contributions
 Interpret+0.156
utton+0.154
uni+0.153
uti+0.150
 taunt+0.148
Neg logit contributions
 Stamp-0.148
sein-0.148
idium-0.144
Disclaimer-0.143
agate-0.138
Token further
Token ID2252
Feature activation-5.20e-02

Pos logit contributions
 Stamp+0.636
sein+0.633
idium+0.615
Disclaimer+0.613
agate+0.584
Neg logit contributions
 Interpret-0.833
utton-0.823
uni-0.816
uti-0.801
 injure-0.797
Token.
Token ID13
Feature activation-1.14e-01

Pos logit contributions
 Stamp+0.200
sein+0.200
idium+0.195
Disclaimer+0.194
agate+0.187
Neg logit contributions
 Interpret-0.174
utton-0.172
uni-0.170
uti-0.166
 injure-0.165
Token (
Token ID357
Feature activation-1.64e-01

Pos logit contributions
 Stamp+0.409
sein+0.404
Disclaimer+0.396
idium+0.392
+0.379
Neg logit contributions
utton-0.408
 Interpret-0.408
uni-0.404
 injure-0.397
uti-0.397
Token dou
Token ID2255
Feature activation-2.19e-01

Pos logit contributions
 Stamp+1.350
sein+1.326
Disclaimer+1.280
idium+1.260
 spinning+1.239
Neg logit contributions
utton-1.451
uni-1.439
 Interpret-1.439
uti-1.417
 injure-1.401
Tokenjin
Token ID18594
Feature activation-1.71e-01

Pos logit contributions
sein+0.804
 Stamp+0.801
Disclaimer+0.778
idium+0.774
+0.750
Neg logit contributions
 Interpret-0.770
utton-0.767
uni-0.743
uti-0.739
 taunt-0.736
Token video
Token ID2008
Feature activation-3.86e-01

Pos logit contributions
 Stamp+0.626
sein+0.619
Disclaimer+0.602
idium+0.601
 spinning+0.581
Neg logit contributions
 Interpret-0.599
utton-0.598
uni-0.589
uti-0.581
 injure-0.575
Token game
Token ID983
Feature activation-8.53e-02

Pos logit contributions
 Stamp+1.531
sein+1.503
Disclaimer+1.461
idium+1.456
 spinning+1.422
Neg logit contributions
 Interpret-1.246
uni-1.227
utton-1.227
uti-1.204
 injure-1.191
Token maker
Token ID16009
Feature activation+3.94e-02

Pos logit contributions
 Stamp+0.295
sein+0.292
Disclaimer+0.282
idium+0.282
 spinning+0.271
Neg logit contributions
 Interpret-0.318
utton-0.318
uni-0.314
uti-0.309
 injure-0.306
Token Jun
Token ID7653
Feature activation+3.20e-01

Pos logit contributions
 Interpret+0.157
utton+0.156
uni+0.154
uti+0.151
 injure+0.151
Neg logit contributions
 Stamp-0.126
sein-0.126
idium-0.122
Disclaimer-0.122
agate-0.116
Tokenya
Token ID3972
Feature activation-2.05e-01

Pos logit contributions
 Interpret+1.200
utton+1.184
uni+1.182
uti+1.157
 taunt+1.146
Neg logit contributions
 Stamp-1.098
sein-1.095
idium-1.066
Disclaimer-1.066
-1.021
Token O
Token ID440
Feature activation-1.14e-01

Pos logit contributions
 Stamp+0.682
sein+0.677
idium+0.659
Disclaimer+0.659
agate+0.627
Neg logit contributions
 Interpret-0.796
utton-0.791
uni-0.779
 injure-0.766
uti-0.766
Tokenta
Token ID8326
Feature activation+4.72e-01

Pos logit contributions
sein+0.392
 Stamp+0.392
idium+0.383
Disclaimer+0.381
+0.364
Neg logit contributions
 Interpret-0.433
utton-0.426
uni-0.418
 injure-0.415
uti-0.413
Token or
Token ID393
Feature activation+1.21e-01

Pos logit contributions
 Interpret+1.824
uni+1.796
utton+1.792
uti+1.751
 taunt+1.739
Neg logit contributions
sein-1.580
 Stamp-1.572
idium-1.536
Disclaimer-1.528
agate-1.460
Token Z
Token ID1168
Feature activation+4.51e-01

Pos logit contributions
 Interpret+0.485
utton+0.481
uni+0.476
uti+0.467
 injure+0.466
Neg logit contributions
sein-0.386
 Stamp-0.386
idium-0.373
Disclaimer-0.372
-0.355
Token if
Token ID611
Feature activation-8.95e-02

Pos logit contributions
 Stamp+0.083
sein+0.082
idium+0.080
Disclaimer+0.080
agate+0.076
Neg logit contributions
 Interpret-0.104
utton-0.103
uni-0.102
uti-0.100
 injure-0.100
Token it
Token ID340
Feature activation+2.05e-01

Pos logit contributions
 Stamp+0.311
sein+0.306
idium+0.299
Disclaimer+0.299
agate+0.286
Neg logit contributions
 Interpret-0.332
utton-0.331
uni-0.330
uti-0.321
 taunt-0.319
Token includes
Token ID3407
Feature activation+6.68e-02

Pos logit contributions
 Interpret+0.797
utton+0.788
uni+0.781
uti+0.765
 injure+0.763
Neg logit contributions
 Stamp-0.681
sein-0.681
idium-0.662
Disclaimer-0.660
agate-0.631
Token the
Token ID262
Feature activation+1.41e-01

Pos logit contributions
 Interpret+0.255
utton+0.253
uni+0.251
uti+0.245
 injure+0.244
Neg logit contributions
 Stamp-0.226
sein-0.224
idium-0.219
Disclaimer-0.219
agate-0.209
Token derogatory
Token ID44094
Feature activation+3.89e-01

Pos logit contributions
 Interpret+0.502
utton+0.496
uni+0.494
uti+0.483
 injure+0.480
Neg logit contributions
 Stamp-0.516
sein-0.512
idium-0.501
Disclaimer-0.500
agate-0.478
Token information
Token ID1321
Feature activation+1.78e-01

Pos logit contributions
 Interpret+1.367
utton+1.341
uni+1.326
 taunt+1.310
 injure+1.302
Neg logit contributions
sein-1.444
 Stamp-1.428
idium-1.403
Disclaimer-1.395
-1.344
Token,
Token ID11
Feature activation+1.77e-01

Pos logit contributions
 Interpret+0.713
utton+0.704
uni+0.698
uti+0.685
 injure+0.684
Neg logit contributions
 Stamp-0.568
sein-0.568
idium-0.552
Disclaimer-0.550
agate-0.525
Token how
Token ID703
Feature activation-7.08e-02

Pos logit contributions
 Interpret+0.648
utton+0.640
uni+0.635
uti+0.621
 injure+0.618
Neg logit contributions
 Stamp-0.628
sein-0.626
idium-0.611
Disclaimer-0.610
agate-0.583
Token is
Token ID318
Feature activation-1.78e-01

Pos logit contributions
 Stamp+0.246
sein+0.246
idium+0.239
Disclaimer+0.239
agate+0.229
Neg logit contributions
 Interpret-0.263
utton-0.260
uni-0.257
uti-0.252
 injure-0.251
Token the
Token ID262
Feature activation-7.93e-02

Pos logit contributions
 Stamp+0.626
sein+0.616
Disclaimer+0.605
idium+0.602
 spinning+0.575
Neg logit contributions
utton-0.648
 Interpret-0.643
uni-0.638
uti-0.635
 taunt-0.621
Token Court
Token ID3078
Feature activation+6.93e-02

Pos logit contributions
 Stamp+0.301
sein+0.297
Disclaimer+0.291
idium+0.289
agate+0.278
Neg logit contributions
 Interpret-0.268
utton-0.268
uni-0.267
uti-0.262
 injure-0.257
Token considers
Token ID14358
Feature activation-2.99e-01

Pos logit contributions
 Stamp+0.817
sein+0.813
idium+0.793
Disclaimer+0.793
 spinning+0.760
Neg logit contributions
 Interpret-0.710
utton-0.702
uni-0.690
uti-0.677
 injure-0.669
Token them
Token ID606
Feature activation-3.56e-01

Pos logit contributions
 Stamp+1.011
sein+0.992
Disclaimer+0.970
idium+0.966
 spinning+0.932
Neg logit contributions
 Interpret-1.146
utton-1.138
uni-1.120
uti-1.110
 injure-1.092
Token as
Token ID355
Feature activation-3.03e-01

Pos logit contributions
 Stamp+1.113
sein+1.113
Disclaimer+1.085
idium+1.072
+1.022
Neg logit contributions
 Interpret-1.443
utton-1.438
uni-1.418
uti-1.395
 injure-1.381
Token a
Token ID257
Feature activation+3.74e-02

Pos logit contributions
 Stamp+0.980
sein+0.973
Disclaimer+0.948
idium+0.940
 spinning+0.900
Neg logit contributions
 Interpret-1.201
utton-1.196
uni-1.176
uti-1.155
 injure-1.145
Token federal
Token ID2717
Feature activation-2.40e-01

Pos logit contributions
 Interpret+0.141
utton+0.140
uni+0.139
uti+0.136
 injure+0.135
Neg logit contributions
 Stamp-0.128
sein-0.128
idium-0.124
Disclaimer-0.124
-0.118
Token region
Token ID3814
Feature activation+1.22e-01

Pos logit contributions
 Stamp+0.849
sein+0.846
idium+0.823
Disclaimer+0.822
+0.785
Neg logit contributions
 Interpret-0.879
utton-0.872
uni-0.864
uti-0.843
 injure-0.840
Token.
Token ID13
Feature activation+1.47e-01

Pos logit contributions
 Interpret+0.492
utton+0.485
uni+0.482
uti+0.473
 injure+0.472
Neg logit contributions
sein-0.388
 Stamp-0.388
idium-0.376
Disclaimer-0.375
agate-0.359
Token
Token ID198
Feature activation+3.90e-01

Pos logit contributions
utton+0.523
 Interpret+0.522
uni+0.515
 injure+0.509
uti+0.507
Neg logit contributions
 Stamp-0.537
sein-0.533
Disclaimer-0.521
idium-0.517
-0.497
Token
Token ID198
Feature activation+3.59e-01

Pos logit contributions
 Interpret+1.146
utton+1.119
uni+1.112
uti+1.073
 taunt+1.055
Neg logit contributions
 Stamp-1.655
sein-1.650
idium-1.639
Disclaimer-1.593
agate-1.572
TokenIt
Token ID1026
Feature activation+3.20e-01

Pos logit contributions
 Interpret+1.389
utton+1.372
uni+1.360
uti+1.332
 injure+1.330
Neg logit contributions
 Stamp-1.196
sein-1.195
idium-1.162
Disclaimer-1.160
-1.107
Token has
Token ID468
Feature activation+2.39e-01

Pos logit contributions
 Interpret+1.155
utton+1.141
uni+1.132
uti+1.107
 injure+1.104
Neg logit contributions
 Stamp-1.148
sein-1.147
idium-1.118
Disclaimer-1.116
-1.070
Token
Token ID198
Feature activation-6.35e-03

Pos logit contributions
 Stamp+0.577
sein+0.570
Disclaimer+0.560
idium+0.551
+0.531
Neg logit contributions
utton-0.617
 Interpret-0.614
uni-0.613
 injure-0.603
uti-0.600
Token
Token ID198
Feature activation-3.11e-02

Pos logit contributions
 Stamp+0.026
sein+0.026
idium+0.025
Disclaimer+0.025
agate+0.024
Neg logit contributions
 Interpret-0.020
utton-0.020
uni-0.020
uti-0.019
 injure-0.019
TokenI
Token ID40
Feature activation+4.87e-02

Pos logit contributions
sein+0.102
 Stamp+0.101
Disclaimer+0.099
idium+0.099
+0.094
Neg logit contributions
 Interpret-0.122
utton-0.121
uni-0.120
 injure-0.118
 taunt-0.118
Token should
Token ID815
Feature activation-2.43e-01

Pos logit contributions
 Interpret+0.179
utton+0.177
uni+0.175
uti+0.171
 injure+0.171
Neg logit contributions
sein-0.172
 Stamp-0.172
idium-0.167
Disclaimer-0.167
-0.160
Token be
Token ID307
Feature activation-2.36e-01

Pos logit contributions
 Stamp+0.858
sein+0.851
idium+0.833
Disclaimer+0.831
agate+0.797
Neg logit contributions
 Interpret-0.888
utton-0.882
uni-0.868
uti-0.853
 injure-0.841
Token most
Token ID749
Feature activation+4.97e-01

Pos logit contributions
 Stamp+0.793
sein+0.791
Disclaimer+0.770
idium+0.769
agate+0.734
Neg logit contributions
 Interpret-0.904
utton-0.895
uni-0.884
uti-0.866
 injure-0.865
Token grateful
Token ID14066
Feature activation-1.05e-01

Pos logit contributions
 Interpret+1.760
uni+1.726
utton+1.709
 injure+1.696
 taunt+1.694
Neg logit contributions
sein-1.809
 Stamp-1.807
idium-1.764
Disclaimer-1.750
agate-1.713
Token if
Token ID611
Feature activation+7.17e-02

Pos logit contributions
 Stamp+0.326
sein+0.325
Disclaimer+0.316
idium+0.315
+0.298
Neg logit contributions
 Interpret-0.431
utton-0.425
uni-0.422
uti-0.416
 injure-0.415
Token readers
Token ID7183
Feature activation+4.68e-01

Pos logit contributions
 Interpret+0.281
utton+0.279
uni+0.276
 injure+0.272
uti+0.271
Neg logit contributions
 Stamp-0.234
sein-0.233
Disclaimer-0.228
idium-0.227
-0.215
Token would
Token ID561
Feature activation+1.69e-01

Pos logit contributions
 Interpret+1.562
utton+1.530
uni+1.514
 injure+1.491
 taunt+1.482
Neg logit contributions
sein-1.817
 Stamp-1.813
Disclaimer-1.768
idium-1.761
-1.713
Token be
Token ID307
Feature activation+1.07e-01

Pos logit contributions
 Interpret+0.574
utton+0.565
uni+0.560
uti+0.547
 injure+0.547
Neg logit contributions
 Stamp-0.642
sein-0.642
idium-0.626
Disclaimer-0.625
-0.601
Token for
Token ID329
Feature activation+1.94e-01

Pos logit contributions
 Interpret+0.342
utton+0.338
uni+0.335
uti+0.329
 injure+0.328
Neg logit contributions
 Stamp-0.264
sein-0.264
idium-0.255
Disclaimer-0.255
-0.243
Token two
Token ID734
Feature activation+1.23e-01

Pos logit contributions
 Interpret+0.815
utton+0.805
uni+0.800
uti+0.784
 injure+0.783
Neg logit contributions
sein-0.583
 Stamp-0.582
idium-0.564
Disclaimer-0.563
agate-0.535
Token reasons
Token ID3840
Feature activation+2.52e-02

Pos logit contributions
 Interpret+0.345
utton+0.342
uni+0.338
uti+0.330
 injure+0.326
Neg logit contributions
 Stamp-0.541
sein-0.536
Disclaimer-0.528
idium-0.527
-0.509
Token.
Token ID13
Feature activation-1.94e-01

Pos logit contributions
 Interpret+0.096
utton+0.095
uni+0.094
uti+0.093
 injure+0.092
Neg logit contributions
 Stamp-0.085
sein-0.084
idium-0.082
Disclaimer-0.082
agate-0.079
Token Instead
Token ID5455
Feature activation-8.68e-02

Pos logit contributions
 Stamp+0.519
sein+0.514
Disclaimer+0.497
idium+0.496
agate+0.468
Neg logit contributions
 Interpret-0.875
utton-0.871
uni-0.864
uti-0.850
 injure-0.848
Token of
Token ID286
Feature activation-1.04e-01

Pos logit contributions
 Stamp+0.333
sein+0.331
idium+0.326
Disclaimer+0.323
+0.313
Neg logit contributions
 Interpret-0.290
utton-0.289
uni-0.286
uti-0.281
 injure-0.279
Token gloss
Token ID21194
Feature activation-7.54e-02

Pos logit contributions
 Stamp+0.313
sein+0.309
idium+0.301
Disclaimer+0.301
 spinning+0.286
Neg logit contributions
 Interpret-0.438
utton-0.436
uni-0.432
uti-0.425
 injure-0.422
Tokening
Token ID278
Feature activation+8.94e-02

Pos logit contributions
sein+0.219
 Stamp+0.219
Disclaimer+0.210
idium+0.210
 spinning+0.200
Neg logit contributions
 Interpret-0.324
uni-0.320
utton-0.320
 injure-0.314
uti-0.312
Token over
Token ID625
Feature activation+4.99e-01

Pos logit contributions
 Interpret+0.390
utton+0.387
uni+0.384
uti+0.379
 injure+0.376
Neg logit contributions
 Stamp-0.254
sein-0.253
Disclaimer-0.245
idium-0.245
agate-0.232
Token these
Token ID777
Feature activation+1.64e-01

Pos logit contributions
 Interpret+1.891
utton+1.859
uni+1.840
 taunt+1.805
uti+1.802
Neg logit contributions
sein-1.701
 Stamp-1.693
idium-1.656
Disclaimer-1.649
-1.580
Token details
Token ID3307
Feature activation+1.54e-01

Pos logit contributions
 Interpret+0.608
utton+0.600
uni+0.594
uti+0.581
 injure+0.580
Neg logit contributions
sein-0.573
 Stamp-0.572
idium-0.558
Disclaimer-0.556
agate-0.533
Token a
Token ID257
Feature activation+1.88e-01

Pos logit contributions
 Stamp+0.016
sein+0.015
idium+0.015
Disclaimer+0.015
agate+0.014
Neg logit contributions
 Interpret-0.019
utton-0.019
uni-0.019
uti-0.018
 injure-0.018
Token sleek
Token ID33143
Feature activation+1.69e-01

Pos logit contributions
 Interpret+0.709
utton+0.701
uni+0.694
 injure+0.679
uti+0.679
Neg logit contributions
 Stamp-0.647
sein-0.639
idium-0.623
Disclaimer-0.622
-0.594
Token four
Token ID1440
Feature activation+8.50e-02

Pos logit contributions
 Interpret+0.653
utton+0.646
uni+0.640
 injure+0.627
uti+0.626
Neg logit contributions
 Stamp-0.565
sein-0.563
idium-0.548
Disclaimer-0.546
agate-0.522
Token wheel
Token ID7825
Feature activation+1.66e-01

Pos logit contributions
 Interpret+0.356
utton+0.351
uni+0.346
 injure+0.339
uti+0.338
Neg logit contributions
 Stamp-0.260
sein-0.258
idium-0.250
Disclaimer-0.249
-0.240
Tokener
Token ID263
Feature activation-3.81e-01

Pos logit contributions
 Interpret+0.675
utton+0.668
uni+0.662
uti+0.649
 injure+0.648
Neg logit contributions
 Stamp-0.521
sein-0.520
idium-0.502
Disclaimer-0.502
agate-0.479
Token powered
Token ID13232
Feature activation-3.37e-02

Pos logit contributions
 Stamp+1.213
sein+1.199
Disclaimer+1.167
idium+1.166
 spinning+1.129
Neg logit contributions
 Interpret-1.521
utton-1.506
uni-1.490
 injure-1.455
uti-1.454
Token by
Token ID416
Feature activation+1.25e-01

Pos logit contributions
 Stamp+0.112
sein+0.112
Disclaimer+0.109
idium+0.109
+0.104
Neg logit contributions
 Interpret-0.130
utton-0.129
uni-0.128
uti-0.125
 injure-0.124
Token a
Token ID257
Feature activation+6.95e-02

Pos logit contributions
 Interpret+0.506
utton+0.500
uni+0.496
 injure+0.487
uti+0.486
Neg logit contributions
 Stamp-0.397
sein-0.394
idium-0.385
Disclaimer-0.382
agate-0.366
Token 8
Token ID807
Feature activation-3.04e-02

Pos logit contributions
 Interpret+0.257
utton+0.254
uni+0.251
 injure+0.247
uti+0.246
Neg logit contributions
 Stamp-0.244
sein-0.242
idium-0.236
Disclaimer-0.235
-0.226
Token80
Token ID1795
Feature activation+3.13e-02

Pos logit contributions
 Stamp+0.096
sein+0.096
idium+0.093
Disclaimer+0.093
+0.089
Neg logit contributions
 Interpret-0.123
utton-0.121
uni-0.120
 injure-0.118
 taunt-0.117
Tokenhp
Token ID24831
Feature activation-3.99e-01

Pos logit contributions
 Interpret+0.111
utton+0.108
uni+0.107
 injure+0.106
 taunt+0.105
Neg logit contributions
 Stamp-0.115
sein-0.114
idium-0.112
Disclaimer-0.112
-0.108
Token and
Token ID290
Feature activation-4.31e-01

Pos logit contributions
 Stamp+1.213
sein+1.201
Disclaimer+1.180
idium+1.161
 spinning+1.125
Neg logit contributions
 Interpret-1.598
uni-1.590
utton-1.585
uti-1.542
 injure-1.534
Token innovative
Token ID13097
Feature activation-8.76e-02

Pos logit contributions
 Stamp+1.529
sein+1.529
Disclaimer+1.487
idium+1.477
 spinning+1.426
Neg logit contributions
 Interpret-1.544
utton-1.544
uni-1.526
uti-1.490
 injure-1.469
Token solutions
Token ID8136
Feature activation+6.51e-02

Pos logit contributions
 Stamp+0.321
sein+0.320
idium+0.312
Disclaimer+0.311
+0.299
Neg logit contributions
 Interpret-0.310
utton-0.306
uni-0.303
uti-0.296
 injure-0.295
Token to
Token ID284
Feature activation-1.11e-01

Pos logit contributions
 Interpret+0.261
utton+0.259
uni+0.256
uti+0.251
 injure+0.251
Neg logit contributions
sein-0.207
 Stamp-0.207
Disclaimer-0.201
idium-0.200
-0.191
Token the
Token ID262
Feature activation-9.68e-02

Pos logit contributions
 Stamp+0.376
sein+0.375
idium+0.365
Disclaimer+0.364
agate+0.348
Neg logit contributions
 Interpret-0.423
utton-0.420
uni-0.415
uti-0.406
 injure-0.404
Token Department
Token ID2732
Feature activation-4.51e-01

Pos logit contributions
 Stamp+0.342
sein+0.342
idium+0.333
Disclaimer+0.332
agate+0.317
Neg logit contributions
 Interpret-0.355
utton-0.351
uni-0.348
uti-0.340
 injure-0.339
Token of
Token ID286
Feature activation-1.45e-01

Pos logit contributions
 Stamp+1.606
idium+1.550
sein+1.544
+1.506
Disclaimer+1.505
Neg logit contributions
utton-1.609
uni-1.601
 Interpret-1.595
uti-1.578
 injure-1.573
Token Defense
Token ID5947
Feature activation-8.75e-02

Pos logit contributions
 Stamp+0.478
sein+0.473
idium+0.462
Disclaimer+0.458
agate+0.438
Neg logit contributions
utton-0.567
 Interpret-0.566
uni-0.562
uti-0.550
 injure-0.548
Token
Token ID447
Feature activation-6.09e-02

Pos logit contributions
 Stamp+0.260
sein+0.259
idium+0.252
Disclaimer+0.251
+0.239
Neg logit contributions
 Interpret-0.368
utton-0.365
uni-0.362
uti-0.355
 injure-0.354
Token
Token ID247
Feature activation-1.07e-01

Pos logit contributions
sein+0.261
 Stamp+0.259
idium+0.255
Disclaimer+0.254
agate+0.248
Neg logit contributions
 Interpret-0.177
utton-0.174
uni-0.172
 injure-0.169
 taunt-0.168
Tokens
Token ID82
Feature activation+3.92e-04

Pos logit contributions
sein+0.331
 Stamp+0.330
idium+0.320
Disclaimer+0.320
agate+0.305
Neg logit contributions
 Interpret-0.437
utton-0.433
uni-0.430
 injure-0.423
uti-0.422
Token is
Token ID318
Feature activation-4.21e-02

Pos logit contributions
sein+0.379
 Stamp+0.378
Disclaimer+0.367
idium+0.366
+0.352
Neg logit contributions
 Interpret-0.477
utton-0.471
uni-0.467
uti-0.456
 injure-0.453
Token also
Token ID635
Feature activation+4.47e-02

Pos logit contributions
 Stamp+0.142
sein+0.141
Disclaimer+0.138
idium+0.137
+0.132
Neg logit contributions
 Interpret-0.161
utton-0.159
uni-0.158
uti-0.154
 injure-0.153
Token my
Token ID616
Feature activation-2.92e-01

Pos logit contributions
 Interpret+0.175
utton+0.173
uni+0.171
uti+0.168
 injure+0.167
Neg logit contributions
 Stamp-0.147
sein-0.146
Disclaimer-0.142
idium-0.142
-0.136
Token mistake
Token ID7457
Feature activation-4.83e-01

Pos logit contributions
 Stamp+1.136
sein+1.125
Disclaimer+1.119
idium+1.105
 spinning+1.064
Neg logit contributions
 Interpret-0.952
utton-0.948
uni-0.936
uti-0.918
 injure-0.908
Token and
Token ID290
Feature activation+1.07e-01

Pos logit contributions
sein+1.444
 Stamp+1.426
Disclaimer+1.403
idium+1.382
+1.341
Neg logit contributions
 Interpret-2.011
utton-1.992
uni-1.980
uti-1.946
 injure-1.925
Token our
Token ID674
Feature activation+4.42e-02

Pos logit contributions
 Interpret+0.412
utton+0.407
uni+0.403
uti+0.396
 injure+0.392
Neg logit contributions
 Stamp-0.355
sein-0.354
Disclaimer-0.345
idium-0.344
-0.329
Token mistake
Token ID7457
Feature activation-5.15e-01

Pos logit contributions
 Interpret+0.147
utton+0.146
uni+0.145
uti+0.142
 injure+0.139
Neg logit contributions
 Stamp-0.171
sein-0.169
Disclaimer-0.167
idium-0.166
 spinning-0.159
Token and
Token ID290
Feature activation+3.67e-01

Pos logit contributions
sein+1.554
 Stamp+1.534
Disclaimer+1.507
idium+1.490
+1.441
Neg logit contributions
 Interpret-2.135
utton-2.103
uni-2.087
uti-2.065
 injure-2.032
Token we
Token ID356
Feature activation-2.19e-01

Pos logit contributions
 Interpret+1.324
utton+1.306
uni+1.295
uti+1.266
 injure+1.264
Neg logit contributions
 Stamp-1.319
sein-1.318
idium-1.284
Disclaimer-1.280
agate-1.228
Token all
Token ID477
Feature activation-1.38e-01

Pos logit contributions
 Stamp+0.795
sein+0.794
Disclaimer+0.776
idium+0.774
agate+0.746
Neg logit contributions
 Interpret-0.776
utton-0.767
uni-0.757
uti-0.743
 injure-0.732
Token feel
Token ID1254
Feature activation-2.49e-02

Pos logit contributions
 Stamp+0.520
sein+0.519
Disclaimer+0.508
idium+0.506
agate+0.489
Neg logit contributions
 Interpret-0.473
utton-0.467
uni-0.460
uti-0.451
 taunt-0.445
Token,
Token ID11
Feature activation-2.73e-01

Pos logit contributions
 Interpret+0.717
utton+0.708
uni+0.699
uti+0.690
 taunt+0.688
Neg logit contributions
sein-0.592
 Stamp-0.585
idium-0.575
Disclaimer-0.571
-0.547
Token if
Token ID611
Feature activation-3.21e-01

Pos logit contributions
 Stamp+0.842
sein+0.835
Disclaimer+0.812
idium+0.812
+0.769
Neg logit contributions
 Interpret-1.121
utton-1.108
uni-1.102
uti-1.079
 injure-1.074
Token they
Token ID484
Feature activation-8.67e-02

Pos logit contributions
 Stamp+1.138
sein+1.126
Disclaimer+1.103
idium+1.093
 spinning+1.051
Neg logit contributions
 Interpret-1.176
utton-1.158
uni-1.153
uti-1.126
 injure-1.120
Token say
Token ID910
Feature activation+3.25e-01

Pos logit contributions
 Stamp+0.328
sein+0.327
idium+0.319
Disclaimer+0.318
agate+0.307
Neg logit contributions
 Interpret-0.296
utton-0.292
uni-0.289
uti-0.283
 injure-0.280
Token anything
Token ID1997
Feature activation-3.37e-02

Pos logit contributions
 Interpret+1.190
utton+1.175
uni+1.164
uti+1.141
 injure+1.140
Neg logit contributions
sein-1.148
 Stamp-1.140
idium-1.116
Disclaimer-1.113
-1.072
Token at
Token ID379
Feature activation-7.37e-02

Pos logit contributions
sein+0.112
 Stamp+0.112
idium+0.109
Disclaimer+0.108
agate+0.104
Neg logit contributions
 Interpret-0.130
utton-0.129
uni-0.128
 taunt-0.125
uti-0.125
Token all
Token ID477
Feature activation-2.54e-01

Pos logit contributions
 Stamp+0.226
sein+0.222
idium+0.217
Disclaimer+0.216
agate+0.205
Neg logit contributions
 Interpret-0.306
utton-0.302
uni-0.300
uti-0.297
 injure-0.295
Token.
Token ID13
Feature activation+2.70e-02

Pos logit contributions
 Stamp+0.750
sein+0.745
Disclaimer+0.724
idium+0.723
agate+0.687
Neg logit contributions
 Interpret-1.075
utton-1.063
uni-1.054
uti-1.035
 injure-1.032
Token And
Token ID843
Feature activation-2.41e-02

Pos logit contributions
 Interpret+0.101
utton+0.099
uni+0.099
uti+0.097
 injure+0.096
Neg logit contributions
 Stamp-0.094
sein-0.093
idium-0.091
Disclaimer-0.091
agate-0.087
Token HP
Token ID6574
Feature activation-1.06e-01

Pos logit contributions
 Stamp+0.076
sein+0.076
idium+0.074
Disclaimer+0.073
agate+0.070
Neg logit contributions
 Interpret-0.097
utton-0.096
uni-0.096
uti-0.094
 injure-0.093
Token suggests
Token ID5644
Feature activation+1.07e-01

Pos logit contributions
 Stamp+0.342
sein+0.341
idium+0.331
Disclaimer+0.331
agate+0.314
Neg logit contributions
 Interpret-0.422
utton-0.418
uni-0.414
uti-0.406
 injure-0.405
Token the
Token ID262
Feature activation+3.91e-01

Pos logit contributions
 Interpret+0.280
utton+0.276
uni+0.274
uti+0.269
 injure+0.267
Neg logit contributions
 Stamp-0.232
sein-0.230
idium-0.224
Disclaimer-0.224
agate-0.213
Token basket
Token ID7988
Feature activation-2.29e-02

Pos logit contributions
 Interpret+1.370
utton+1.349
uni+1.337
uti+1.306
 injure+1.305
Neg logit contributions
sein-1.449
 Stamp-1.445
idium-1.412
Disclaimer-1.407
-1.357
Token.
Token ID13
Feature activation-3.19e-02

Pos logit contributions
 Stamp+0.068
sein+0.067
Disclaimer+0.065
idium+0.065
agate+0.061
Neg logit contributions
 Interpret-0.098
utton-0.096
uni-0.096
uti-0.094
 injure-0.093
Token
Token ID198
Feature activation-3.68e-01

Pos logit contributions
 Stamp+0.110
sein+0.109
idium+0.106
Disclaimer+0.106
agate+0.102
Neg logit contributions
 Interpret-0.120
utton-0.119
uni-0.118
uti-0.115
 injure-0.115
Token
Token ID198
Feature activation-3.25e-01

Pos logit contributions
 Stamp+1.415
sein+1.413
Disclaimer+1.387
idium+1.369
agate+1.320
Neg logit contributions
 Interpret-1.227
utton-1.215
uni-1.201
 injure-1.185
uti-1.176
TokenSo
Token ID2396
Feature activation-7.04e-01

Pos logit contributions
sein+1.058
 Stamp+1.057
Disclaimer+1.053
idium+1.026
+0.983
Neg logit contributions
 Interpret-1.276
utton-1.264
uni-1.248
 injure-1.235
 taunt-1.231
Token get
Token ID651
Feature activation-6.06e-01

Pos logit contributions
 Stamp+2.328
sein+2.282
Disclaimer+2.244
idium+2.221
 spinning+2.141
Neg logit contributions
 Interpret-2.698
utton-2.682
uni-2.672
uti-2.612
 taunt-2.561
Token out
Token ID503
Feature activation-9.64e-01

Pos logit contributions
 Stamp+1.933
sein+1.864
Disclaimer+1.851
idium+1.831
 spinning+1.778
Neg logit contributions
utton-2.429
 Interpret-2.420
uni-2.397
uti-2.356
 injure-2.315
Token there
Token ID612
Feature activation+2.29e-01

Pos logit contributions
 Stamp+3.407
sein+3.297
Disclaimer+3.195
idium+3.192
 spinning+3.171
Neg logit contributions
utton-3.482
 Interpret-3.462
uni-3.397
 injure-3.279
uti-3.275
Token,
Token ID11
Feature activation-4.82e-01

Pos logit contributions
 Interpret+0.922
utton+0.911
uni+0.903
 injure+0.891
 taunt+0.889
Neg logit contributions
sein-0.724
 Stamp-0.716
idium-0.702
Disclaimer-0.700
-0.668
Token and
Token ID290
Feature activation+3.28e-02

Pos logit contributions
 Stamp+1.770
sein+1.706
Disclaimer+1.685
idium+1.676
 spinning+1.634
Neg logit contributions
 Interpret-1.702
utton-1.693
uni-1.685
uti-1.650
 taunt-1.592
Token,
Token ID11
Feature activation+3.47e-01

Pos logit contributions
 Interpret+0.556
utton+0.550
uni+0.547
uti+0.537
 injure+0.536
Neg logit contributions
 Stamp-0.318
sein-0.317
idium-0.306
Disclaimer-0.305
agate-0.288
Token the
Token ID262
Feature activation+1.63e-01

Pos logit contributions
 Interpret+1.311
utton+1.286
uni+1.268
 injure+1.251
 taunt+1.243
Neg logit contributions
sein-1.198
 Stamp-1.174
idium-1.161
Disclaimer-1.152
-1.111
Token old
Token ID1468
Feature activation+2.71e-01

Pos logit contributions
 Interpret+0.634
utton+0.626
uni+0.621
uti+0.608
 injure+0.607
Neg logit contributions
sein-0.542
 Stamp-0.542
idium-0.527
Disclaimer-0.526
agate-0.502
Token Pages
Token ID28221
Feature activation-1.56e-01

Pos logit contributions
 Interpret+1.108
utton+1.092
uni+1.083
uti+1.059
 taunt+1.058
Neg logit contributions
sein-0.846
 Stamp-0.838
idium-0.821
Disclaimer-0.814
-0.781
Token made
Token ID925
Feature activation-1.46e-01

Pos logit contributions
 Stamp+0.535
sein+0.527
Disclaimer+0.516
idium+0.511
agate+0.492
Neg logit contributions
 Interpret-0.588
utton-0.585
uni-0.581
uti-0.569
 injure-0.567
Token it
Token ID340
Feature activation-7.49e-01

Pos logit contributions
 Stamp+0.494
sein+0.486
Disclaimer+0.476
idium+0.476
+0.453
Neg logit contributions
 Interpret-0.559
utton-0.554
uni-0.551
uti-0.541
 injure-0.537
Token possible
Token ID1744
Feature activation-3.70e-02

Pos logit contributions
 Stamp+2.332
sein+2.300
Disclaimer+2.264
idium+2.221
 spinning+2.157
Neg logit contributions
 Interpret-3.005
utton-2.996
uni-2.927
uti-2.918
 taunt-2.858
Token to
Token ID284
Feature activation+2.73e-01

Pos logit contributions
sein+0.119
 Stamp+0.119
Disclaimer+0.116
idium+0.115
+0.110
Neg logit contributions
 Interpret-0.147
utton-0.145
uni-0.144
uti-0.141
 injure-0.141
Token link
Token ID2792
Feature activation+5.13e-02

Pos logit contributions
 Interpret+0.937
utton+0.923
uni+0.915
uti+0.893
 injure+0.893
Neg logit contributions
sein-1.031
 Stamp-1.030
idium-1.005
Disclaimer-1.003
agate-0.963
Token text
Token ID2420
Feature activation-3.30e-01

Pos logit contributions
 Interpret+0.202
utton+0.199
uni+0.198
uti+0.194
 injure+0.193
Neg logit contributions
 Stamp-0.168
sein-0.168
idium-0.163
Disclaimer-0.162
agate-0.155
Token boxes
Token ID10559
Feature activation-1.33e-01

Pos logit contributions
 Stamp+1.047
sein+1.039
idium+1.010
Disclaimer+1.009
agate+0.962
Neg logit contributions
 Interpret-1.330
utton-1.313
uni-1.306
uti-1.278
 injure-1.277
Token perhaps
Token ID3737
Feature activation+1.50e-01

Pos logit contributions
 Stamp+1.309
sein+1.294
Disclaimer+1.260
idium+1.258
agate+1.210
Neg logit contributions
 Interpret-1.584
utton-1.547
uni-1.544
uti-1.514
 injure-1.502
Token they
Token ID484
Feature activation-1.79e-01

Pos logit contributions
 Interpret+0.562
utton+0.555
uni+0.551
uti+0.540
 injure+0.537
Neg logit contributions
 Stamp-0.522
sein-0.519
idium-0.505
Disclaimer-0.505
agate-0.483
Token shouldn
Token ID6584
Feature activation-1.28e-01

Pos logit contributions
 Stamp+0.571
sein+0.567
idium+0.550
Disclaimer+0.548
agate+0.528
Neg logit contributions
 Interpret-0.713
utton-0.706
uni-0.699
uti-0.685
 injure-0.673
Token't
Token ID470
Feature activation-5.80e-02

Pos logit contributions
 Stamp+0.419
sein+0.419
Disclaimer+0.406
idium+0.406
agate+0.388
Neg logit contributions
 Interpret-0.500
utton-0.494
uni-0.489
uti-0.480
 injure-0.480
Token celebrate
Token ID10648
Feature activation+2.04e-02

Pos logit contributions
 Stamp+0.202
sein+0.200
idium+0.195
Disclaimer+0.193
agate+0.188
Neg logit contributions
 Interpret-0.215
utton-0.213
uni-0.211
uti-0.209
 injure-0.201
Token too
Token ID1165
Feature activation-7.59e-01

Pos logit contributions
 Interpret+0.079
utton+0.078
uni+0.078
uti+0.076
 injure+0.076
Neg logit contributions
 Stamp-0.068
sein-0.067
idium-0.065
Disclaimer-0.065
agate-0.062
Token quickly
Token ID2952
Feature activation-1.09e-01

Pos logit contributions
 Stamp+1.774
sein+1.751
idium+1.661
Disclaimer+1.654
 spinning+1.573
Neg logit contributions
 Interpret-3.681
utton-3.633
uni-3.614
uti-3.554
 injure-3.535
Token.
Token ID13
Feature activation-5.56e-02

Pos logit contributions
 Stamp+0.340
sein+0.337
idium+0.327
Disclaimer+0.327
agate+0.310
Neg logit contributions
 Interpret-0.444
utton-0.438
uni-0.437
uti-0.428
 injure-0.425
Token And
Token ID843
Feature activation+1.10e-01

Pos logit contributions
 Stamp+0.208
sein+0.205
Disclaimer+0.201
idium+0.198
+0.191
Neg logit contributions
utton-0.193
 Interpret-0.193
uni-0.191
 injure-0.188
uti-0.187
Token don
Token ID836
Feature activation+9.76e-02

Pos logit contributions
 Interpret+0.407
utton+0.402
uni+0.398
uti+0.390
 injure+0.389
Neg logit contributions
 Stamp-0.384
sein-0.384
idium-0.374
Disclaimer-0.373
agate-0.357
Token't
Token ID470
Feature activation+7.40e-03

Pos logit contributions
 Interpret+0.369
utton+0.365
uni+0.361
uti+0.353
 taunt+0.351
Neg logit contributions
 Stamp-0.334
sein-0.333
idium-0.327
Disclaimer-0.323
agate-0.310
Token 1
Token ID352
Feature activation-4.38e-01

Pos logit contributions
 Interpret+0.389
utton+0.378
uni+0.373
 taunt+0.364
uti+0.364
Neg logit contributions
sein-0.456
 Stamp-0.453
idium-0.448
Disclaimer-0.442
agate-0.432
Tokenst
Token ID301
Feature activation-6.74e-01

Pos logit contributions
 Stamp+1.079
sein+1.076
idium+1.039
Disclaimer+1.012
+0.993
Neg logit contributions
 Interpret-2.050
utton-2.030
uni-2.020
 injure-1.995
uti-1.985
Token,
Token ID11
Feature activation-4.06e-01

Pos logit contributions
sein+1.722
 Stamp+1.703
idium+1.639
Disclaimer+1.622
+1.578
Neg logit contributions
 Interpret-3.083
utton-3.065
uni-3.028
 injure-2.988
uti-2.981
Token in
Token ID287
Feature activation-1.88e-01

Pos logit contributions
 Stamp+1.157
sein+1.139
idium+1.104
Disclaimer+1.103
 spinning+1.040
Neg logit contributions
 Interpret-1.759
utton-1.747
uni-1.736
uti-1.701
 injure-1.689
Token a
Token ID257
Feature activation-2.01e-01

Pos logit contributions
 Stamp+0.608
sein+0.604
idium+0.584
Disclaimer+0.584
+0.554
Neg logit contributions
 Interpret-0.743
utton-0.737
uni-0.731
uti-0.718
 injure-0.715
Token surprise
Token ID5975
Feature activation-7.97e-01

Pos logit contributions
 Stamp+0.726
sein+0.719
idium+0.699
Disclaimer+0.698
agate+0.668
Neg logit contributions
 Interpret-0.722
utton-0.716
uni-0.709
uti-0.695
 injure-0.690
Token visit
Token ID3187
Feature activation+7.97e-02

Pos logit contributions
 Stamp+2.715
sein+2.643
Disclaimer+2.571
idium+2.547
 spinning+2.515
Neg logit contributions
uni-2.978
 Interpret-2.976
utton-2.967
uti-2.935
 injure-2.825
Token to
Token ID284
Feature activation+1.16e-03

Pos logit contributions
 Interpret+0.325
utton+0.322
uni+0.319
uti+0.313
 injure+0.312
Neg logit contributions
 Stamp-0.248
sein-0.248
idium-0.241
Disclaimer-0.240
agate-0.229
Token B
Token ID347
Feature activation-3.54e-02

Pos logit contributions
 Interpret+0.005
utton+0.005
uni+0.005
uti+0.005
 injure+0.005
Neg logit contributions
 Stamp-0.004
sein-0.004
idium-0.004
Disclaimer-0.004
agate-0.003
Tokenagram
Token ID6713
Feature activation-1.25e-01

Pos logit contributions
sein+0.093
 Stamp+0.092
idium+0.090
Disclaimer+0.090
agate+0.084
Neg logit contributions
 Interpret-0.163
utton-0.159
uni-0.158
uti-0.156
 taunt-0.156
Token Air
Token ID3701
Feature activation+4.71e-01

Pos logit contributions
 Stamp+0.377
sein+0.377
idium+0.366
Disclaimer+0.365
+0.347
Neg logit contributions
 Interpret-0.519
utton-0.515
uni-0.509
uti-0.503
 injure-0.502
Tokenmethyl
Token ID43654
Feature activation-4.19e-01

Pos logit contributions
 Interpret+0.128
utton+0.126
uni+0.125
uti+0.122
 injure+0.122
Neg logit contributions
 Stamp-0.109
sein-0.109
idium-0.107
Disclaimer-0.106
agate-0.102
Token-
Token ID12
Feature activation-4.47e-01

Pos logit contributions
 Stamp+1.340
sein+1.340
idium+1.316
Disclaimer+1.274
agate+1.253
Neg logit contributions
 Interpret-1.680
utton-1.650
uti-1.614
uni-1.613
 injure-1.602
Token7
Token ID22
Feature activation-4.15e-01

Pos logit contributions
 Stamp+1.323
sein+1.322
idium+1.302
Disclaimer+1.293
agate+1.235
Neg logit contributions
 Interpret-1.884
utton-1.855
uni-1.828
 injure-1.818
uti-1.812
Token-(
Token ID30420
Feature activation-5.52e-01

Pos logit contributions
 Stamp+1.121
sein+1.116
Disclaimer+1.082
idium+1.082
+1.021
Neg logit contributions
 Interpret-1.855
utton-1.845
uni-1.812
uti-1.796
 injure-1.786
Token2
Token ID17
Feature activation-2.17e-01

Pos logit contributions
sein+1.506
 Stamp+1.482
idium+1.471
Disclaimer+1.462
agate+1.394
Neg logit contributions
 Interpret-2.441
utton-2.408
uni-2.365
 injure-2.358
 taunt-2.354
Token,
Token ID11
Feature activation-7.46e-01

Pos logit contributions
sein+0.598
 Stamp+0.593
idium+0.580
Disclaimer+0.573
+0.549
Neg logit contributions
 Interpret-0.963
utton-0.954
uni-0.943
uti-0.931
 taunt-0.923
Token6
Token ID21
Feature activation-4.20e-01

Pos logit contributions
sein+1.888
idium+1.882
Disclaimer+1.828
 Stamp+1.828
+1.759
Neg logit contributions
 Interpret-3.434
utton-3.376
uni-3.322
 taunt-3.318
uti-3.312
Token-
Token ID12
Feature activation-5.49e-01

Pos logit contributions
sein+1.160
 Stamp+1.140
idium+1.127
Disclaimer+1.113
+1.067
Neg logit contributions
 Interpret-1.861
utton-1.831
uni-1.812
uti-1.799
 injure-1.782
Tokendim
Token ID27740
Feature activation-5.34e-01

Pos logit contributions
idium+1.932
sein+1.931
 Stamp+1.898
Disclaimer+1.883
agate+1.826
Neg logit contributions
 Interpret-2.009
utton-1.963
 injure-1.926
uni-1.918
uti-1.918
Tokeneth
Token ID2788
Feature activation-7.91e-01

Pos logit contributions
idium+1.489
sein+1.452
 Stamp+1.451
Disclaimer+1.386
agate+1.375
Neg logit contributions
 Interpret-2.358
 injure-2.304
utton-2.284
 taunt-2.264
uni-2.256
Tokenoxy
Token ID23536
Feature activation-9.80e-01

Pos logit contributions
idium+1.891
sein+1.818
 Stamp+1.759
Disclaimer+1.670
agate+1.657
Neg logit contributions
 Interpret-3.892
 injure-3.786
 taunt-3.739
utton-3.726
uni-3.712