TOP ACTIVATIONS
MAX = 1.242

NFL : M ord ica is the fastest back
year - old , m op - headed father of three
number called The Count , which is
. IGN : Sure . Before we move on
DOWN UP ON PAR AM OUNT P ICT URES RE QUEST
episodes of 19 Kids and Count ing from the air .
got that ? Sure , I got it .
DOWN UP ON PAR AM OUNT P ICT URES RE QUEST
s )} 6 . N xd 4 { ( 7 s
A bitters weet frenzy ? Sure , why not ?
pressure in Congress for her impeachment . The Brazilian
former President Barack Obama Barack Hussein Obama With low birth rate
former bankruptcy attorney the most potent -- and dangerous -- American
the two m aint ained frequent , direct contact
of the government of President Dil ma Rousse ff and no
of cash . ( Techn ically , no Alberta budget for
Cuba present . Count ries that once acquies ced
to the walls .. n ail the furniture down and strap
and even before the first pair of sneakers sque ak on
for consoles such as NES , Super NES and

MIN ACTIVATIONS
MIN = -1.133

, ($ i * $ FontSize * 2 + $ s
and evolution . TL ; DR : I propose we work
program , marks a new chapter in appreciation of more adventurous
must consider the pros and cons . For example , maybe
universe . We can 't wait for you to see it
her tightly . She gr ins at me . God ,
devices on all previous Mars ro vers , which have examined
the other . The old guard of medicine also turned to
- master . zip . Wait until you see the add
" Def endants ' failure to provide any form of
judge , so his administration turned its eye to a different
S Im person ation Sc am Reporting form on
Bills : G Richie Inc ogn ito Contract :
nowhere , the mouse Sp az team first qualified for MLG
Civil War was a dark chapter in America 's history .
a murder mystery with a minor romantic sub plot called An
Manson , representing the local chapter of the NAACP , has
McDonnell in return were relatively minor , such as introducing Williams
possession and possession of drug parap hern alia . It
, Go ce 's imminent descent is interesting on a number

INTERVAL 0.648 - 1.242
CONTAINS 0.155%

a ledge or drop - off where the only alternative appears
do you then develop a core curriculum ? A
Among those ens n ared by the wide - ranging
t find an answer to this question .
on all the best - of lists , even influenced fashion

INTERVAL 0.055 - 0.648
CONTAINS 35.977%

shit hits the fan . Luckily the Work place Safety and
they are connected to the Olympics . Even pubs can 't
itionally while others will have conditions , such as going through
go 3 D , housed in what was then the N
his ses , and compression that characterize such sound files are

INTERVAL -0.539 - 0.055
CONTAINS 63.352%

and down , and you realize , in a moment of
to the Motion Picture Academy and to the filmmakers behind the
7 days , implementing methods add Points ( user , points
to be more gracious than his legion of Trump kins ."
the fictional book mentioned in the 1984 movie . From the

INTERVAL -1.133 - -0.539
CONTAINS 0.516%

, it added .- Trade Arab ia News Service <|endoftext|> By
of the post - 1945 world order . With
document , but the current version of Pages doesn 't .
that Team USA is so turned off by his character that
or movement , dressing and grooming , and possibly certain behaviors
Token NFL
Token ID5134
Feature activation-1.69e-01

Pos logit contributions
lihood+0.091
VERTISEMENT+0.077
+0.076
archives+0.070
iso+0.070
Neg logit contributions
 crooked-0.073
 negligent-0.071
 mutated-0.071
 tablet-0.069
 distracted-0.066
Token:
Token ID25
Feature activation+3.04e-01

Pos logit contributions
 crooked+0.424
 negligent+0.413
 mutated+0.413
 tablet+0.405
 distracted+0.389
Neg logit contributions
lihood-0.490
VERTISEMENT-0.412
-0.409
iso-0.381
archives-0.379
Token
Token ID564
Feature activation-8.88e-02

Pos logit contributions
lihood+0.854
VERTISEMENT+0.709
+0.697
archives+0.644
iso+0.641
Neg logit contributions
 crooked-0.806
 negligent-0.788
 mutated-0.788
 tablet-0.766
 distracted-0.737
Token
Token ID250
Feature activation-1.09e-01

Pos logit contributions
 crooked+0.164
 mutated+0.160
 negligent+0.159
 tablet+0.153
 distracted+0.144
Neg logit contributions
lihood-0.319
VERTISEMENT-0.278
-0.273
archives-0.258
iso-0.257
TokenM
Token ID44
Feature activation+2.34e-01

Pos logit contributions
 crooked+0.307
 negligent+0.300
 mutated+0.297
 tablet+0.289
vered+0.284
Neg logit contributions
lihood-0.283
VERTISEMENT-0.234
-0.224
archives-0.206
iso-0.204
Tokenord
Token ID585
Feature activation+1.24e+00

Pos logit contributions
lihood+0.827
VERTISEMENT+0.719
+0.708
archives+0.665
iso+0.659
Neg logit contributions
 crooked-0.449
 mutated-0.437
 negligent-0.436
 tablet-0.422
vered-0.399
Tokenica
Token ID3970
Feature activation+4.86e-02

Pos logit contributions
lihood+3.468
VERTISEMENT+2.910
+2.839
iso+2.660
archives+2.648
Neg logit contributions
 crooked-3.250
 mutated-3.199
 negligent-3.198
 tablet-3.095
 distracted-2.978
Token is
Token ID318
Feature activation+5.37e-02

Pos logit contributions
lihood+0.159
VERTISEMENT+0.137
+0.134
archives+0.126
iso+0.125
Neg logit contributions
 crooked-0.107
 mutated-0.105
 negligent-0.104
 tablet-0.100
 distracted-0.097
Token the
Token ID262
Feature activation+4.15e-02

Pos logit contributions
lihood+0.176
VERTISEMENT+0.151
+0.148
iso+0.139
archives+0.139
Neg logit contributions
 crooked-0.119
 mutated-0.115
 negligent-0.115
 tablet-0.111
 toe-0.107
Token fastest
Token ID14162
Feature activation-2.35e-01

Pos logit contributions
lihood+0.133
VERTISEMENT+0.114
+0.111
archives+0.105
iso+0.104
Neg logit contributions
 crooked-0.097
 mutated-0.093
 negligent-0.093
 tablet-0.091
 toe-0.087
Token back
Token ID736
Feature activation+1.38e-01

Pos logit contributions
 crooked+0.520
 mutated+0.507
 negligent+0.499
 tablet+0.492
 toe+0.467
Neg logit contributions
lihood-0.774
VERTISEMENT-0.668
-0.656
archives-0.616
iso-0.613
Tokenyear
Token ID1941
Feature activation-3.44e-02

Pos logit contributions
lihood+0.425
VERTISEMENT+0.352
+0.333
archives+0.300
iso+0.294
Neg logit contributions
 crooked-0.480
 mutated-0.473
 negligent-0.472
 tablet-0.465
vered-0.447
Token-
Token ID12
Feature activation+1.82e-02

Pos logit contributions
 crooked+0.095
 mutated+0.094
 negligent+0.094
 tablet+0.091
 distracted+0.088
Neg logit contributions
lihood-0.092
VERTISEMENT-0.077
-0.074
archives-0.068
iso-0.068
Tokenold
Token ID727
Feature activation+2.56e-01

Pos logit contributions
lihood+0.062
VERTISEMENT+0.054
+0.053
archives+0.049
iso+0.049
Neg logit contributions
 crooked-0.037
 mutated-0.036
 negligent-0.036
 tablet-0.034
vered-0.033
Token,
Token ID11
Feature activation-1.95e-01

Pos logit contributions
lihood+0.827
VERTISEMENT+0.714
+0.697
archives+0.653
iso+0.648
Neg logit contributions
 crooked-0.583
 negligent-0.566
 mutated-0.565
 tablet-0.554
 distracted-0.525
Token m
Token ID285
Feature activation+1.27e-01

Pos logit contributions
 crooked+0.450
 negligent+0.437
 mutated+0.435
 tablet+0.425
 distracted+0.407
Neg logit contributions
lihood-0.632
VERTISEMENT-0.539
-0.526
archives-0.493
iso-0.491
Tokenop
Token ID404
Feature activation+1.21e+00

Pos logit contributions
lihood+0.422
VERTISEMENT+0.358
+0.352
archives+0.327
iso+0.319
Neg logit contributions
 crooked-0.275
 negligent-0.269
 mutated-0.267
 tablet-0.264
vered-0.256
Token-
Token ID12
Feature activation-2.06e-02

Pos logit contributions
lihood+3.540
VERTISEMENT+2.992
+2.930
archives+2.717
iso+2.703
Neg logit contributions
 crooked-3.046
 mutated-2.987
 negligent-2.982
 tablet-2.896
 distracted-2.774
Tokenheaded
Token ID15353
Feature activation+1.57e-01

Pos logit contributions
 crooked+0.046
 negligent+0.045
 mutated+0.045
 tablet+0.043
 toe+0.042
Neg logit contributions
lihood-0.067
VERTISEMENT-0.057
-0.056
iso-0.051
archives-0.051
Token father
Token ID2988
Feature activation-8.60e-02

Pos logit contributions
lihood+0.510
VERTISEMENT+0.440
+0.431
archives+0.403
iso+0.401
Neg logit contributions
 crooked-0.357
 mutated-0.346
 negligent-0.346
 tablet-0.338
 distracted-0.321
Token of
Token ID286
Feature activation-5.51e-02

Pos logit contributions
 crooked+0.223
 negligent+0.219
 mutated+0.219
 tablet+0.213
 distracted+0.205
Neg logit contributions
lihood-0.247
VERTISEMENT-0.209
-0.202
archives-0.187
iso-0.186
Token three
Token ID1115
Feature activation-3.03e-01

Pos logit contributions
 crooked+0.126
 mutated+0.123
 negligent+0.122
 tablet+0.120
 distracted+0.114
Neg logit contributions
lihood-0.178
VERTISEMENT-0.151
-0.148
iso-0.139
archives-0.138
Token number
Token ID1271
Feature activation+2.56e-01

Pos logit contributions
 crooked+0.458
 mutated+0.451
 negligent+0.447
 tablet+0.439
 distracted+0.418
Neg logit contributions
lihood-0.562
VERTISEMENT-0.479
-0.471
archives-0.442
iso-0.428
Token called
Token ID1444
Feature activation+1.75e-01

Pos logit contributions
lihood+0.745
VERTISEMENT+0.633
+0.616
archives+0.574
iso+0.568
Neg logit contributions
 crooked-0.658
 mutated-0.646
 negligent-0.642
 tablet-0.625
 distracted-0.601
Token
Token ID564
Feature activation+2.16e-01

Pos logit contributions
lihood+0.501
VERTISEMENT+0.420
+0.411
archives+0.386
iso+0.381
Neg logit contributions
 crooked-0.452
 mutated-0.442
 negligent-0.441
 tablet-0.427
 toe-0.411
Token
Token ID250
Feature activation+2.78e-03

Pos logit contributions
lihood+0.793
VERTISEMENT+0.693
+0.677
archives+0.647
iso+0.642
Neg logit contributions
 crooked-0.387
 negligent-0.375
 mutated-0.373
 tablet-0.357
vered-0.338
TokenThe
Token ID464
Feature activation-2.09e-01

Pos logit contributions
lihood+0.008
VERTISEMENT+0.006
+0.006
archives+0.006
iso+0.006
Neg logit contributions
 crooked-0.008
 mutated-0.007
 negligent-0.007
 tablet-0.007
vered-0.007
Token Count
Token ID2764
Feature activation+1.19e+00

Pos logit contributions
 crooked+0.487
 mutated+0.473
 negligent+0.470
 tablet+0.457
 Went+0.439
Neg logit contributions
lihood-0.653
VERTISEMENT-0.558
-0.547
archives-0.516
iso-0.503
Token,
Token ID11
Feature activation-3.65e-01

Pos logit contributions
lihood+3.259
VERTISEMENT+2.722
+2.656
archives+2.448
iso+2.427
Neg logit contributions
 crooked-3.231
 negligent-3.167
 mutated-3.164
 tablet-3.083
 distracted-2.961
Token
Token ID447
Feature activation-1.00e-01

Pos logit contributions
 crooked+0.895
 mutated+0.869
 negligent+0.864
 tablet+0.837
 toe+0.804
Neg logit contributions
lihood-1.111
VERTISEMENT-0.950
-0.908
archives-0.866
iso-0.853
Token
Token ID251
Feature activation-4.66e-02

Pos logit contributions
 crooked+0.169
 negligent+0.163
 mutated+0.162
 tablet+0.155
vered+0.146
Neg logit contributions
lihood-0.377
VERTISEMENT-0.331
-0.323
iso-0.309
archives-0.309
Token which
Token ID543
Feature activation-1.73e-01

Pos logit contributions
 crooked+0.117
 mutated+0.113
 negligent+0.113
 tablet+0.110
 distracted+0.105
Neg logit contributions
lihood-0.139
VERTISEMENT-0.119
-0.115
archives-0.108
iso-0.107
Token is
Token ID318
Feature activation-1.51e-01

Pos logit contributions
 crooked+0.401
 mutated+0.396
 negligent+0.388
 tablet+0.377
 distracted+0.363
Neg logit contributions
lihood-0.544
VERTISEMENT-0.473
-0.460
archives-0.428
iso-0.427
Token.
Token ID13
Feature activation+2.90e-02

Pos logit contributions
 crooked+0.399
 mutated+0.397
 negligent+0.389
 tablet+0.383
 distracted+0.370
Neg logit contributions
lihood-0.457
VERTISEMENT-0.380
-0.377
archives-0.352
iso-0.344
Token
Token ID198
Feature activation+1.29e-01

Pos logit contributions
lihood+0.083
VERTISEMENT+0.070
+0.068
archives+0.064
iso+0.062
Neg logit contributions
 crooked-0.074
 mutated-0.073
 negligent-0.073
 tablet-0.071
 Went-0.070
Token
Token ID198
Feature activation+4.76e-02

Pos logit contributions
lihood+0.343
VERTISEMENT+0.286
+0.277
archives+0.254
iso+0.252
Neg logit contributions
 crooked-0.360
 mutated-0.353
 negligent-0.353
 tablet-0.344
vered-0.331
TokenIGN
Token ID16284
Feature activation-7.03e-02

Pos logit contributions
lihood+0.168
VERTISEMENT+0.143
+0.141
archives+0.133
iso+0.132
Neg logit contributions
 crooked-0.090
 negligent-0.088
 mutated-0.088
 tablet-0.086
vered-0.082
Token:
Token ID25
Feature activation+1.06e-01

Pos logit contributions
 crooked+0.185
 mutated+0.181
 negligent+0.181
 tablet+0.176
vered+0.170
Neg logit contributions
lihood-0.196
VERTISEMENT-0.166
-0.159
archives-0.148
iso-0.147
Token Sure
Token ID10889
Feature activation+1.14e+00

Pos logit contributions
lihood+0.314
VERTISEMENT+0.268
+0.262
archives+0.246
iso+0.241
Neg logit contributions
 crooked-0.263
 mutated-0.258
 negligent-0.256
 tablet-0.250
 Went-0.242
Token.
Token ID13
Feature activation+1.97e-01

Pos logit contributions
lihood+3.117
VERTISEMENT+2.604
+2.543
archives+2.336
iso+2.327
Neg logit contributions
 crooked-3.050
 negligent-2.990
 mutated-2.986
 tablet-2.901
vered-2.788
Token Before
Token ID7413
Feature activation+1.48e-01

Pos logit contributions
lihood+0.566
VERTISEMENT+0.478
+0.466
archives+0.435
iso+0.428
Neg logit contributions
 crooked-0.506
 mutated-0.497
 negligent-0.494
 tablet-0.483
 Went-0.468
Token we
Token ID356
Feature activation+1.83e-01

Pos logit contributions
lihood+0.439
VERTISEMENT+0.383
+0.372
archives+0.346
iso+0.342
Neg logit contributions
 crooked-0.362
 mutated-0.359
 negligent-0.352
 tablet-0.350
 distracted-0.333
Token move
Token ID1445
Feature activation-1.03e-01

Pos logit contributions
lihood+0.629
VERTISEMENT+0.547
+0.537
archives+0.504
iso+0.501
Neg logit contributions
 crooked-0.372
 mutated-0.364
 negligent-0.361
 tablet-0.351
 distracted-0.333
Token on
Token ID319
Feature activation+1.16e-01

Pos logit contributions
 crooked+0.239
 mutated+0.233
 negligent+0.232
 tablet+0.229
 toe+0.221
Neg logit contributions
lihood-0.319
VERTISEMENT-0.279
-0.272
archives-0.254
iso-0.250
Token DOWN
Token ID30320
Feature activation+2.83e-01

Pos logit contributions
lihood+0.162
VERTISEMENT+0.134
+0.131
archives+0.124
iso+0.121
Neg logit contributions
 crooked-0.183
 mutated-0.180
 negligent-0.178
 tablet-0.173
 toe-0.168
Token UP
Token ID15958
Feature activation+3.43e-01

Pos logit contributions
lihood+0.750
VERTISEMENT+0.618
+0.604
archives+0.577
iso+0.568
Neg logit contributions
 crooked-0.785
 mutated-0.775
 negligent-0.766
 tablet-0.750
vered-0.721
TokenON
Token ID1340
Feature activation+3.02e-01

Pos logit contributions
lihood+0.897
VERTISEMENT+0.739
+0.718
archives+0.680
iso+0.677
Neg logit contributions
 crooked-0.969
 mutated-0.954
 negligent-0.945
 tablet-0.923
 distracted-0.888
Token PAR
Token ID29463
Feature activation+6.11e-02

Pos logit contributions
lihood+0.865
VERTISEMENT+0.732
+0.720
archives+0.671
iso+0.664
Neg logit contributions
 crooked-0.773
 mutated-0.760
 negligent-0.755
 tablet-0.734
 Went-0.700
TokenAM
Token ID2390
Feature activation+7.55e-02

Pos logit contributions
lihood+0.168
VERTISEMENT+0.140
+0.136
archives+0.132
iso+0.126
Neg logit contributions
 mutated-0.159
 crooked-0.159
 negligent-0.157
 tablet-0.154
 Went-0.148
TokenOUNT
Token ID28270
Feature activation+1.12e+00

Pos logit contributions
lihood+0.211
VERTISEMENT+0.174
+0.172
archives+0.158
iso+0.156
Neg logit contributions
 crooked-0.201
 mutated-0.199
 negligent-0.197
 tablet-0.193
vered-0.184
Token P
Token ID350
Feature activation-1.09e-01

Pos logit contributions
lihood+2.999
VERTISEMENT+2.483
+2.429
archives+2.254
iso+2.239
Neg logit contributions
 crooked-3.091
 mutated-3.035
 negligent-3.032
 tablet-2.948
vered-2.836
TokenICT
Token ID18379
Feature activation+3.12e-01

Pos logit contributions
 crooked+0.269
 mutated+0.266
 negligent+0.266
 tablet+0.259
vered+0.250
Neg logit contributions
lihood-0.321
VERTISEMENT-0.268
-0.265
archives-0.247
iso-0.240
TokenURES
Token ID29514
Feature activation+3.22e-03

Pos logit contributions
lihood+0.809
VERTISEMENT+0.665
+0.647
archives+0.592
iso+0.590
Neg logit contributions
 crooked-0.888
 mutated-0.878
 negligent-0.875
 tablet-0.847
 toe-0.817
Token RE
Token ID4526
Feature activation+4.56e-01

Pos logit contributions
lihood+0.009
VERTISEMENT+0.007
+0.007
archives+0.007
iso+0.007
Neg logit contributions
 crooked-0.009
 mutated-0.009
 negligent-0.008
 tablet-0.008
 Went-0.008
TokenQUEST
Token ID35780
Feature activation-3.03e-02

Pos logit contributions
lihood+1.401
VERTISEMENT+1.175
+1.152
iso+1.080
archives+1.076
Neg logit contributions
 crooked-1.083
 mutated-1.069
 negligent-1.060
 tablet-1.031
vered-1.007
Token episodes
Token ID8640
Feature activation-3.32e-01

Pos logit contributions
lihood+0.160
VERTISEMENT+0.137
+0.133
iso+0.124
archives+0.123
Neg logit contributions
 crooked-0.129
 negligent-0.128
 mutated-0.128
 tablet-0.123
 toe-0.119
Token of
Token ID286
Feature activation-4.41e-02

Pos logit contributions
 crooked+0.836
 mutated+0.824
 negligent+0.819
 tablet+0.789
 toe+0.766
Neg logit contributions
lihood-0.980
VERTISEMENT-0.842
-0.803
archives-0.751
iso-0.741
Token 19
Token ID678
Feature activation-4.95e-02

Pos logit contributions
 crooked+0.106
 negligent+0.104
 mutated+0.103
 tablet+0.099
 distracted+0.096
Neg logit contributions
lihood-0.136
VERTISEMENT-0.115
-0.114
archives-0.106
iso-0.106
Token Kids
Token ID17476
Feature activation+1.68e-02

Pos logit contributions
 crooked+0.119
 negligent+0.116
 mutated+0.115
 tablet+0.109
 distracted+0.107
Neg logit contributions
lihood-0.157
VERTISEMENT-0.130
-0.128
iso-0.120
AGE-0.118
Token and
Token ID290
Feature activation-1.55e-01

Pos logit contributions
lihood+0.050
VERTISEMENT+0.042
+0.040
archives+0.039
iso+0.038
Neg logit contributions
 crooked-0.042
 mutated-0.042
 negligent-0.041
 tablet-0.040
 Went-0.039
Token Count
Token ID2764
Feature activation+1.11e+00

Pos logit contributions
 crooked+0.389
 mutated+0.386
 negligent+0.382
 tablet+0.373
 distracted+0.362
Neg logit contributions
lihood-0.464
VERTISEMENT-0.382
-0.375
iso-0.345
archives-0.342
Tokening
Token ID278
Feature activation+1.50e-01

Pos logit contributions
lihood+2.281
VERTISEMENT+1.758
+1.705
iso+1.488
archives+1.476
Neg logit contributions
 crooked-3.803
 negligent-3.748
 mutated-3.729
 tablet-3.662
 toe-3.550
Token from
Token ID422
Feature activation-2.68e-01

Pos logit contributions
lihood+0.440
VERTISEMENT+0.370
+0.361
archives+0.337
iso+0.329
Neg logit contributions
 crooked-0.389
 mutated-0.380
 negligent-0.380
 tablet-0.367
 distracted-0.352
Token the
Token ID262
Feature activation-2.46e-01

Pos logit contributions
 crooked+0.572
 mutated+0.559
 negligent+0.558
 tablet+0.543
 distracted+0.513
Neg logit contributions
lihood-0.897
VERTISEMENT-0.777
-0.764
archives-0.705
iso-0.704
Token air
Token ID1633
Feature activation-1.97e-01

Pos logit contributions
 crooked+0.580
 negligent+0.566
 mutated+0.565
 tablet+0.555
 distracted+0.524
Neg logit contributions
lihood-0.782
VERTISEMENT-0.668
-0.656
archives-0.600
AGE-0.600
Token.
Token ID13
Feature activation-3.53e-02

Pos logit contributions
 crooked+0.495
 mutated+0.487
 negligent+0.485
 tablet+0.475
 distracted+0.455
Neg logit contributions
lihood-0.569
VERTISEMENT-0.491
-0.472
archives-0.438
AGE-0.428
Token got
Token ID1392
Feature activation-2.12e-01

Pos logit contributions
 crooked+0.192
 mutated+0.187
 negligent+0.186
 tablet+0.181
 distracted+0.173
Neg logit contributions
lihood-0.277
VERTISEMENT-0.240
-0.234
iso-0.218
archives-0.218
Token that
Token ID326
Feature activation+1.14e-01

Pos logit contributions
 crooked+0.503
 mutated+0.489
 negligent+0.488
 tablet+0.471
 distracted+0.460
Neg logit contributions
lihood-0.667
VERTISEMENT-0.576
-0.557
archives-0.521
iso-0.520
Token?
Token ID30
Feature activation-5.95e-03

Pos logit contributions
lihood+0.346
VERTISEMENT+0.296
+0.290
archives+0.271
iso+0.268
Neg logit contributions
 crooked-0.281
 mutated-0.274
 negligent-0.271
 tablet-0.265
 toe-0.254
Token
Token ID198
Feature activation+3.63e-01

Pos logit contributions
 crooked+0.015
 mutated+0.015
 negligent+0.015
 tablet+0.014
 Went+0.014
Neg logit contributions
lihood-0.017
VERTISEMENT-0.015
-0.014
archives-0.013
iso-0.013
Token
Token ID198
Feature activation+3.21e-01

Pos logit contributions
lihood+0.951
VERTISEMENT+0.788
+0.766
archives+0.701
iso+0.696
Neg logit contributions
 crooked-1.026
 mutated-1.007
 negligent-1.006
 tablet-0.981
vered-0.944
TokenSure
Token ID19457
Feature activation+1.07e+00

Pos logit contributions
lihood+1.317
VERTISEMENT+1.164
+1.147
archives+1.092
iso+1.084
Neg logit contributions
 crooked-0.425
 negligent-0.408
 mutated-0.407
 tablet-0.386
vered-0.362
Token,
Token ID11
Feature activation+7.49e-03

Pos logit contributions
lihood+2.913
VERTISEMENT+2.425
+2.370
archives+2.181
iso+2.166
Neg logit contributions
 crooked-2.932
 mutated-2.877
 negligent-2.874
 tablet-2.800
 distracted-2.690
Token I
Token ID314
Feature activation-8.07e-02

Pos logit contributions
lihood+0.023
VERTISEMENT+0.020
+0.019
archives+0.018
iso+0.018
Neg logit contributions
 crooked-0.018
 negligent-0.018
 mutated-0.018
 tablet-0.017
 distracted-0.016
Token got
Token ID1392
Feature activation-1.48e-02

Pos logit contributions
 crooked+0.176
 mutated+0.172
 negligent+0.170
 tablet+0.166
 distracted+0.159
Neg logit contributions
lihood-0.265
VERTISEMENT-0.230
-0.225
archives-0.210
iso-0.210
Token it
Token ID340
Feature activation-3.33e-02

Pos logit contributions
 crooked+0.034
 mutated+0.033
 negligent+0.033
 tablet+0.032
 distracted+0.031
Neg logit contributions
lihood-0.048
VERTISEMENT-0.041
-0.040
iso-0.038
archives-0.038
Token.
Token ID13
Feature activation+1.97e-01

Pos logit contributions
 crooked+0.086
 mutated+0.084
 negligent+0.083
 tablet+0.081
 distracted+0.078
Neg logit contributions
lihood-0.097
VERTISEMENT-0.083
-0.080
archives-0.074
iso-0.074
Token DOWN
Token ID30320
Feature activation+5.31e-01

Pos logit contributions
lihood+1.147
VERTISEMENT+1.003
+0.991
archives+0.951
iso+0.945
Neg logit contributions
 crooked-0.513
 mutated-0.504
 negligent-0.495
 tablet-0.474
 Went-0.445
Token UP
Token ID15958
Feature activation+4.66e-01

Pos logit contributions
lihood+1.318
VERTISEMENT+1.064
+1.043
archives+0.976
iso+0.963
Neg logit contributions
 crooked-1.566
 mutated-1.545
 negligent-1.537
 tablet-1.503
vered-1.450
TokenON
Token ID1340
Feature activation+3.55e-01

Pos logit contributions
lihood+1.218
VERTISEMENT+0.990
+0.972
archives+0.911
iso+0.908
Neg logit contributions
 crooked-1.311
 mutated-1.291
 negligent-1.289
 tablet-1.256
 distracted-1.205
Token PAR
Token ID29463
Feature activation+2.50e-01

Pos logit contributions
lihood+0.852
VERTISEMENT+0.693
+0.683
archives+0.618
iso+0.612
Neg logit contributions
 crooked-1.072
 mutated-1.057
 negligent-1.051
 tablet-1.028
 distracted-0.987
TokenAM
Token ID2390
Feature activation+1.35e-01

Pos logit contributions
lihood+0.692
VERTISEMENT+0.569
+0.552
archives+0.520
iso+0.509
Neg logit contributions
 crooked-0.653
 negligent-0.644
 mutated-0.643
 tablet-0.635
vered-0.608
TokenOUNT
Token ID28270
Feature activation+1.04e+00

Pos logit contributions
lihood+0.315
VERTISEMENT+0.250
+0.245
archives+0.222
iso+0.221
Neg logit contributions
 crooked-0.417
 negligent-0.410
 mutated-0.410
 tablet-0.402
vered-0.387
Token P
Token ID350
Feature activation+1.86e-02

Pos logit contributions
lihood+2.914
VERTISEMENT+2.437
+2.390
archives+2.206
iso+2.191
Neg logit contributions
 crooked-2.762
 mutated-2.709
 negligent-2.705
 tablet-2.635
 distracted-2.523
TokenICT
Token ID18379
Feature activation+3.18e-01

Pos logit contributions
lihood+0.057
VERTISEMENT+0.048
+0.047
archives+0.044
iso+0.043
Neg logit contributions
 crooked-0.044
 negligent-0.044
 mutated-0.044
 tablet-0.042
vered-0.040
TokenURES
Token ID29514
Feature activation+4.15e-02

Pos logit contributions
lihood+0.800
VERTISEMENT+0.649
+0.624
iso+0.575
archives+0.570
Neg logit contributions
 crooked-0.931
 negligent-0.915
 mutated-0.915
 tablet-0.885
vered-0.862
Token RE
Token ID4526
Feature activation+5.45e-01

Pos logit contributions
lihood+0.116
VERTISEMENT+0.098
+0.096
archives+0.089
iso+0.089
Neg logit contributions
 crooked-0.108
 mutated-0.107
 negligent-0.106
 tablet-0.103
 distracted-0.099
TokenQUEST
Token ID35780
Feature activation-1.19e-01

Pos logit contributions
lihood+1.551
VERTISEMENT+1.307
+1.278
archives+1.182
iso+1.174
Neg logit contributions
 crooked-1.414
 mutated-1.385
 negligent-1.384
 tablet-1.346
 distracted-1.291
Tokens
Token ID82
Feature activation+1.52e-01

Pos logit contributions
 crooked+0.195
 negligent+0.192
 mutated+0.191
 tablet+0.188
 distracted+0.181
Neg logit contributions
lihood-0.190
VERTISEMENT-0.159
-0.153
archives-0.140
iso-0.138
Token)}
Token ID38165
Feature activation+1.79e-01

Pos logit contributions
lihood+0.396
VERTISEMENT+0.327
+0.318
archives+0.291
iso+0.289
Neg logit contributions
 crooked-0.434
 mutated-0.426
 negligent-0.426
 tablet-0.415
 distracted-0.400
Token 6
Token ID718
Feature activation+2.76e-01

Pos logit contributions
lihood+0.467
VERTISEMENT+0.384
+0.372
archives+0.337
iso+0.333
Neg logit contributions
 crooked-0.513
 mutated-0.508
 negligent-0.507
 tablet-0.493
 toe-0.473
Token.
Token ID13
Feature activation+2.84e-02

Pos logit contributions
lihood+0.773
VERTISEMENT+0.650
+0.631
archives+0.582
iso+0.577
Neg logit contributions
 crooked-0.728
 mutated-0.713
 negligent-0.713
 tablet-0.694
vered-0.667
Token N
Token ID399
Feature activation+2.37e-01

Pos logit contributions
lihood+0.090
VERTISEMENT+0.077
+0.075
archives+0.070
iso+0.069
Neg logit contributions
 crooked-0.066
 mutated-0.065
 negligent-0.064
 tablet-0.062
 toe-0.060
Tokenxd
Token ID24954
Feature activation+1.04e+00

Pos logit contributions
lihood+0.796
VERTISEMENT+0.691
+0.669
archives+0.620
iso+0.613
Neg logit contributions
 crooked-0.496
 negligent-0.483
 mutated-0.483
 tablet-0.473
vered-0.456
Token4
Token ID19
Feature activation+2.40e-01

Pos logit contributions
lihood+2.356
VERTISEMENT+1.884
+1.799
archives+1.622
iso+1.599
Neg logit contributions
 crooked-3.338
 mutated-3.293
 negligent-3.286
 tablet-3.217
 distracted-3.111
Token {
Token ID1391
Feature activation+1.42e-01

Pos logit contributions
lihood+0.618
VERTISEMENT+0.506
+0.490
archives+0.450
iso+0.447
Neg logit contributions
 crooked-0.696
 negligent-0.685
 mutated-0.685
 tablet-0.667
 toe-0.642
Token(
Token ID7
Feature activation-2.40e-01

Pos logit contributions
lihood+0.420
VERTISEMENT+0.356
+0.344
archives+0.320
iso+0.318
Neg logit contributions
 crooked-0.352
 mutated-0.345
 negligent-0.345
 tablet-0.335
vered-0.321
Token7
Token ID22
Feature activation+1.05e-01

Pos logit contributions
 crooked+0.671
 negligent+0.661
 mutated+0.660
 tablet+0.646
 toe+0.619
Neg logit contributions
lihood-0.623
VERTISEMENT-0.519
-0.490
archives-0.455
iso-0.447
Tokens
Token ID82
Feature activation+1.27e-01

Pos logit contributions
lihood+0.281
VERTISEMENT+0.233
+0.226
archives+0.207
iso+0.206
Neg logit contributions
 crooked-0.288
 mutated-0.284
 negligent-0.283
 tablet-0.277
 distracted-0.267
Token A
Token ID317
Feature activation+1.53e-01

Pos logit contributions
lihood+0.482
VERTISEMENT+0.399
+0.393
archives+0.362
iso+0.357
Neg logit contributions
 crooked-0.451
 negligent-0.441
 mutated-0.438
 tablet-0.429
 Went-0.422
Token bitters
Token ID48666
Feature activation+4.21e-01

Pos logit contributions
lihood+0.486
VERTISEMENT+0.411
+0.408
archives+0.382
iso+0.378
Neg logit contributions
 crooked-0.355
 negligent-0.346
 mutated-0.345
 tablet-0.335
 distracted-0.321
Tokenweet
Token ID7277
Feature activation-7.88e-03

Pos logit contributions
lihood+1.764
VERTISEMENT+1.571
+1.549
archives+1.476
iso+1.469
Neg logit contributions
 crooked-0.529
 mutated-0.508
 negligent-0.505
 tablet-0.478
 distracted-0.435
Token frenzy
Token ID31934
Feature activation+2.43e-01

Pos logit contributions
 crooked+0.019
 mutated+0.018
 negligent+0.018
 tablet+0.018
 toe+0.017
Neg logit contributions
lihood-0.025
VERTISEMENT-0.021
-0.021
archives-0.019
iso-0.019
Token?
Token ID30
Feature activation+3.13e-02

Pos logit contributions
lihood+0.715
VERTISEMENT+0.603
+0.592
archives+0.550
iso+0.548
Neg logit contributions
 crooked-0.611
 mutated-0.600
 negligent-0.598
 tablet-0.580
 distracted-0.560
Token Sure
Token ID10889
Feature activation+1.04e+00

Pos logit contributions
lihood+0.092
VERTISEMENT+0.078
+0.076
archives+0.071
iso+0.070
Neg logit contributions
 crooked-0.078
 negligent-0.076
 mutated-0.075
 tablet-0.074
 Went-0.073
Token,
Token ID11
Feature activation+3.15e-02

Pos logit contributions
lihood+2.806
VERTISEMENT+2.335
+2.282
archives+2.101
iso+2.087
Neg logit contributions
 crooked-2.845
 mutated-2.792
 negligent-2.789
 tablet-2.717
 distracted-2.612
Token why
Token ID1521
Feature activation-3.24e-01

Pos logit contributions
lihood+0.093
VERTISEMENT+0.079
+0.076
iso+0.071
archives+0.071
Neg logit contributions
 crooked-0.081
 negligent-0.079
 mutated-0.078
 tablet-0.077
 distracted-0.074
Token not
Token ID407
Feature activation+2.11e-01

Pos logit contributions
 crooked+0.697
 negligent+0.679
 mutated+0.669
 tablet+0.656
 toe+0.631
Neg logit contributions
lihood-1.078
VERTISEMENT-0.949
-0.915
archives-0.863
iso-0.853
Token?
Token ID30
Feature activation+1.48e-02

Pos logit contributions
lihood+0.622
VERTISEMENT+0.530
+0.511
archives+0.477
iso+0.477
Neg logit contributions
 crooked-0.534
 negligent-0.521
 mutated-0.520
 tablet-0.509
 toe-0.488
Token
Token ID198
Feature activation+1.30e-01

Pos logit contributions
lihood+0.042
VERTISEMENT+0.035
+0.034
archives+0.032
iso+0.032
Neg logit contributions
 crooked-0.038
 negligent-0.038
 mutated-0.037
 tablet-0.036
 Went-0.036
Token pressure
Token ID3833
Feature activation+1.68e-01

Pos logit contributions
lihood+0.740
VERTISEMENT+0.636
+0.614
iso+0.583
archives+0.580
Neg logit contributions
 crooked-0.561
 mutated-0.542
 negligent-0.541
 tablet-0.532
 toe-0.513
Token in
Token ID287
Feature activation+2.24e-01

Pos logit contributions
lihood+0.496
VERTISEMENT+0.418
+0.404
archives+0.379
iso+0.375
Neg logit contributions
 crooked-0.426
 mutated-0.416
 negligent-0.415
 tablet-0.407
 toe-0.395
Token Congress
Token ID3162
Feature activation+1.09e-01

Pos logit contributions
lihood+0.686
VERTISEMENT+0.586
+0.569
iso+0.534
archives+0.528
Neg logit contributions
 crooked-0.544
 mutated-0.528
 negligent-0.528
 tablet-0.517
 toe-0.495
Token for
Token ID329
Feature activation+3.36e-02

Pos logit contributions
lihood+0.320
VERTISEMENT+0.271
+0.260
archives+0.244
iso+0.240
Neg logit contributions
 crooked-0.277
 negligent-0.271
 mutated-0.270
 tablet-0.263
 distracted-0.255
Token her
Token ID607
Feature activation-1.90e-01

Pos logit contributions
lihood+0.109
VERTISEMENT+0.096
+0.093
archives+0.088
iso+0.087
Neg logit contributions
 crooked-0.075
 negligent-0.072
 mutated-0.072
 tablet-0.070
 toe-0.067
Token impeachment
Token ID30258
Feature activation+1.03e+00

Pos logit contributions
 crooked+0.484
 negligent+0.474
 mutated+0.467
 tablet+0.459
 toe+0.445
Neg logit contributions
lihood-0.562
VERTISEMENT-0.483
-0.463
archives-0.439
iso-0.431
Token.
Token ID13
Feature activation+1.95e-01

Pos logit contributions
lihood+2.892
VERTISEMENT+2.426
+2.366
archives+2.194
iso+2.171
Neg logit contributions
 crooked-2.721
 negligent-2.667
 mutated-2.663
 tablet-2.601
 distracted-2.490
Token
Token ID198
Feature activation-3.77e-02

Pos logit contributions
lihood+0.547
VERTISEMENT+0.449
+0.434
archives+0.408
iso+0.406
Neg logit contributions
 crooked-0.518
 negligent-0.502
 mutated-0.502
 tablet-0.489
 Went-0.474
Token
Token ID198
Feature activation-3.58e-02

Pos logit contributions
 crooked+0.102
 negligent+0.100
 mutated+0.100
 tablet+0.097
vered+0.094
Neg logit contributions
lihood-0.103
VERTISEMENT-0.086
-0.083
archives-0.077
iso-0.076
TokenThe
Token ID464
Feature activation+1.29e-01

Pos logit contributions
 crooked+0.097
 negligent+0.095
 mutated+0.094
 tablet+0.092
vered+0.092
Neg logit contributions
lihood-0.096
VERTISEMENT-0.078
-0.074
archives-0.069
iso-0.069
Token Brazilian
Token ID17036
Feature activation+4.19e-01

Pos logit contributions
lihood+0.406
VERTISEMENT+0.351
+0.337
archives+0.320
iso+0.314
Neg logit contributions
 crooked-0.302
 negligent-0.291
 mutated-0.291
 tablet-0.284
 toe-0.272
Token former
Token ID1966
Feature activation+9.17e-02

Pos logit contributions
lihood+0.452
VERTISEMENT+0.396
+0.383
archives+0.364
iso+0.360
Neg logit contributions
 crooked-0.295
 mutated-0.287
 negligent-0.286
 tablet-0.277
 distracted-0.264
Token President
Token ID1992
Feature activation-1.83e-01

Pos logit contributions
lihood+0.271
VERTISEMENT+0.231
+0.224
archives+0.212
iso+0.209
Neg logit contributions
 crooked-0.232
 mutated-0.225
 negligent-0.225
 tablet-0.219
 distracted-0.210
Token Barack
Token ID8732
Feature activation-4.50e-02

Pos logit contributions
 crooked+0.485
 negligent+0.470
 mutated+0.465
 tablet+0.452
 distracted+0.436
Neg logit contributions
lihood-0.529
VERTISEMENT-0.444
-0.425
iso-0.398
archives-0.397
Token Obama
Token ID2486
Feature activation+2.12e-01

Pos logit contributions
 crooked+0.107
 negligent+0.105
 mutated+0.104
 tablet+0.101
 distracted+0.097
Neg logit contributions
lihood-0.138
VERTISEMENT-0.120
-0.115
archives-0.107
iso-0.107
Token Barack
Token ID8732
Feature activation+5.51e-01

Pos logit contributions
lihood+0.598
VERTISEMENT+0.500
+0.475
archives+0.448
iso+0.446
Neg logit contributions
 crooked-0.569
 negligent-0.560
 mutated-0.559
 tablet-0.544
 distracted-0.528
Token Hussein
Token ID22983
Feature activation+1.02e+00

Pos logit contributions
lihood+1.638
VERTISEMENT+1.387
+1.368
archives+1.270
iso+1.263
Neg logit contributions
 crooked-1.353
 mutated-1.324
 negligent-1.320
 tablet-1.286
vered-1.236
Token Obama
Token ID2486
Feature activation+5.57e-01

Pos logit contributions
lihood+2.860
VERTISEMENT+2.382
+2.350
archives+2.203
iso+2.155
Neg logit contributions
 crooked-2.665
 mutated-2.624
 negligent-2.600
 tablet-2.547
vered-2.453
TokenWith
Token ID3152
Feature activation+4.13e-01

Pos logit contributions
lihood+1.436
VERTISEMENT+1.177
+1.148
archives+1.047
iso+1.042
Neg logit contributions
 crooked-1.602
 negligent-1.576
 mutated-1.574
 tablet-1.536
 distracted-1.479
Token low
Token ID1877
Feature activation+1.67e-02

Pos logit contributions
lihood+1.280
VERTISEMENT+1.058
+1.036
iso+0.973
archives+0.972
Neg logit contributions
 crooked-1.015
 negligent-1.009
 mutated-1.000
 tablet-0.973
 distracted-0.945
Token birth
Token ID4082
Feature activation+7.20e-02

Pos logit contributions
lihood+0.049
VERTISEMENT+0.042
+0.039
archives+0.038
iso+0.037
Neg logit contributions
 mutated-0.041
 crooked-0.041
 negligent-0.041
 tablet-0.040
 distracted-0.039
Token rate
Token ID2494
Feature activation+2.98e-01

Pos logit contributions
lihood+0.202
VERTISEMENT+0.178
+0.169
iso+0.155
archives+0.154
Neg logit contributions
 crooked-0.186
 negligent-0.183
 mutated-0.183
 tablet-0.177
vered-0.173
Token former
Token ID1966
Feature activation+1.39e-01

Pos logit contributions
lihood+0.160
VERTISEMENT+0.139
+0.136
iso+0.126
archives+0.126
Neg logit contributions
 crooked-0.118
 negligent-0.114
 mutated-0.113
 tablet-0.111
 toe-0.106
Token bankruptcy
Token ID17235
Feature activation-4.22e-02

Pos logit contributions
lihood+0.453
VERTISEMENT+0.392
+0.382
archives+0.361
iso+0.358
Neg logit contributions
 crooked-0.311
 negligent-0.301
 mutated-0.299
 tablet-0.291
 toe-0.279
Token attorney
Token ID6136
Feature activation-2.33e-02

Pos logit contributions
 crooked+0.095
 negligent+0.093
 mutated+0.092
 tablet+0.090
 distracted+0.085
Neg logit contributions
lihood-0.139
VERTISEMENT-0.119
-0.116
archives-0.109
iso-0.108
Token the
Token ID262
Feature activation+5.16e-02

Pos logit contributions
 crooked+0.055
 negligent+0.053
 mutated+0.053
 tablet+0.051
 toe+0.049
Neg logit contributions
lihood-0.073
VERTISEMENT-0.063
-0.061
iso-0.057
archives-0.057
Token most
Token ID749
Feature activation+5.56e-02

Pos logit contributions
lihood+0.154
VERTISEMENT+0.132
+0.127
iso+0.119
archives+0.119
Neg logit contributions
 crooked-0.131
 negligent-0.127
 mutated-0.127
 tablet-0.125
 toe-0.119
Token potent
Token ID16739
Feature activation+1.01e+00

Pos logit contributions
lihood+0.197
VERTISEMENT+0.173
+0.169
iso+0.161
archives+0.159
Neg logit contributions
 crooked-0.111
 negligent-0.107
 mutated-0.106
 tablet-0.101
 distracted-0.097
Token--
Token ID438
Feature activation+2.37e-01

Pos logit contributions
lihood+3.160
VERTISEMENT+2.701
+2.649
archives+2.471
iso+2.458
Neg logit contributions
 crooked-2.336
 mutated-2.285
 negligent-2.283
 tablet-2.212
 distracted-2.108
Tokenand
Token ID392
Feature activation+1.76e-01

Pos logit contributions
lihood+0.635
VERTISEMENT+0.531
+0.508
archives+0.465
iso+0.465
Neg logit contributions
 crooked-0.660
 negligent-0.646
 mutated-0.645
 tablet-0.629
vered-0.607
Token dangerous
Token ID4923
Feature activation+9.45e-02

Pos logit contributions
lihood+0.581
VERTISEMENT+0.504
+0.494
iso+0.464
archives+0.462
Neg logit contributions
 crooked-0.387
 negligent-0.376
 mutated-0.375
 tablet-0.359
 distracted-0.346
Token--
Token ID438
Feature activation+3.20e-01

Pos logit contributions
lihood+0.221
VERTISEMENT+0.180
+0.175
archives+0.158
iso+0.157
Neg logit contributions
 crooked-0.298
 mutated-0.291
 negligent-0.290
 tablet-0.285
 distracted-0.276
TokenAmerican
Token ID7437
Feature activation+1.41e-01

Pos logit contributions
lihood+0.955
VERTISEMENT+0.820
+0.789
archives+0.718
iso+0.717
Neg logit contributions
 crooked-0.792
 negligent-0.774
 mutated-0.769
 tablet-0.752
vered-0.725
Token the
Token ID262
Feature activation-7.19e-02

Pos logit contributions
 crooked+0.199
 negligent+0.194
 mutated+0.191
 tablet+0.184
 distracted+0.179
Neg logit contributions
lihood-0.288
VERTISEMENT-0.246
-0.242
archives-0.230
iso-0.224
Token two
Token ID734
Feature activation-1.04e-02

Pos logit contributions
 crooked+0.151
 negligent+0.146
 mutated+0.143
 tablet+0.140
 distracted+0.133
Neg logit contributions
lihood-0.246
VERTISEMENT-0.215
-0.210
archives-0.198
iso-0.194
Token
Token ID564
Feature activation+3.16e-01

Pos logit contributions
 crooked+0.024
 negligent+0.023
 mutated+0.023
 tablet+0.022
 distracted+0.021
Neg logit contributions
lihood-0.034
VERTISEMENT-0.029
-0.029
iso-0.027
archives-0.027
Token
Token ID250
Feature activation-1.94e-02

Pos logit contributions
lihood+1.343
VERTISEMENT+1.199
+1.179
archives+1.127
iso+1.123
Neg logit contributions
 crooked-0.377
 mutated-0.361
 negligent-0.360
 tablet-0.336
vered-0.306
Tokenm
Token ID76
Feature activation+2.35e-01

Pos logit contributions
 crooked+0.049
 negligent+0.048
 mutated+0.047
 tablet+0.046
vered+0.046
Neg logit contributions
lihood-0.056
VERTISEMENT-0.048
-0.046
archives-0.043
iso-0.042
Tokenaint
Token ID2913
Feature activation+1.01e+00

Pos logit contributions
lihood+0.711
VERTISEMENT+0.605
+0.594
archives+0.548
iso+0.541
Neg logit contributions
 crooked-0.571
 negligent-0.558
 mutated-0.556
 tablet-0.538
vered-0.519
Tokenained
Token ID1328
Feature activation-3.63e-01

Pos logit contributions
lihood+3.422
VERTISEMENT+2.972
+2.931
archives+2.746
iso+2.733
Neg logit contributions
 crooked-2.034
 mutated-1.986
 negligent-1.985
 tablet-1.917
 distracted-1.804
Token frequent
Token ID10792
Feature activation-3.05e-01

Pos logit contributions
 crooked+0.820
 negligent+0.800
 mutated+0.786
 tablet+0.760
 toe+0.730
Neg logit contributions
lihood-1.172
VERTISEMENT-1.014
-1.004
archives-0.946
iso-0.943
Token,
Token ID11
Feature activation-2.78e-01

Pos logit contributions
 crooked+0.726
 negligent+0.711
 mutated+0.703
 tablet+0.684
 toe+0.661
Neg logit contributions
lihood-0.944
VERTISEMENT-0.816
-0.794
archives-0.749
iso-0.745
Token direct
Token ID1277
Feature activation-2.41e-01

Pos logit contributions
 crooked+0.597
 negligent+0.580
 mutated+0.575
 tablet+0.555
 distracted+0.534
Neg logit contributions
lihood-0.938
VERTISEMENT-0.808
-0.796
archives-0.752
iso-0.746
Token contact
Token ID2800
Feature activation-1.47e-01

Pos logit contributions
 crooked+0.574
 negligent+0.562
 mutated+0.552
 tablet+0.540
 toe+0.523
Neg logit contributions
lihood-0.746
VERTISEMENT-0.640
-0.628
iso-0.593
archives-0.589
Token of
Token ID286
Feature activation-2.44e-02

Pos logit contributions
lihood+0.223
VERTISEMENT+0.191
+0.183
archives+0.171
iso+0.171
Neg logit contributions
 crooked-0.194
 negligent-0.189
 mutated-0.189
 tablet-0.187
 toe-0.178
Token the
Token ID262
Feature activation+1.99e-02

Pos logit contributions
 crooked+0.058
 mutated+0.056
 negligent+0.056
 tablet+0.055
 toe+0.052
Neg logit contributions
lihood-0.076
VERTISEMENT-0.066
-0.064
iso-0.060
archives-0.059
Token government
Token ID1230
Feature activation+3.40e-01

Pos logit contributions
lihood+0.064
VERTISEMENT+0.055
+0.054
archives+0.050
iso+0.050
Neg logit contributions
 crooked-0.045
 mutated-0.044
 negligent-0.044
 tablet-0.043
 toe-0.041
Token of
Token ID286
Feature activation-8.01e-02

Pos logit contributions
lihood+0.982
VERTISEMENT+0.826
+0.800
archives+0.745
iso+0.738
Neg logit contributions
 crooked-0.882
 negligent-0.861
 mutated-0.859
 tablet-0.843
 distracted-0.805
Token President
Token ID1992
Feature activation+4.36e-02

Pos logit contributions
 crooked+0.206
 negligent+0.200
 mutated+0.198
 tablet+0.193
 distracted+0.187
Neg logit contributions
lihood-0.234
VERTISEMENT-0.199
-0.192
archives-0.179
iso-0.179
Token Dil
Token ID29328
Feature activation+1.00e+00

Pos logit contributions
lihood+0.143
VERTISEMENT+0.124
+0.118
archives+0.113
iso+0.111
Neg logit contributions
 crooked-0.096
 negligent-0.093
 mutated-0.093
 tablet-0.090
 distracted-0.086
Tokenma
Token ID2611
Feature activation+6.49e-02

Pos logit contributions
lihood+2.691
VERTISEMENT+2.239
+2.184
archives+2.005
iso+1.989
Neg logit contributions
 crooked-2.760
 mutated-2.710
 negligent-2.707
 tablet-2.638
 distracted-2.536
Token Rousse
Token ID42849
Feature activation+3.55e-01

Pos logit contributions
lihood+0.176
VERTISEMENT+0.145
+0.140
archives+0.133
iso+0.129
Neg logit contributions
 crooked-0.181
 negligent-0.177
 mutated-0.175
 tablet-0.173
 distracted-0.166
Tokenff
Token ID487
Feature activation+4.93e-01

Pos logit contributions
lihood+1.214
VERTISEMENT+1.056
+1.034
archives+0.973
iso+0.966
Neg logit contributions
 crooked-0.716
 mutated-0.697
 negligent-0.697
 tablet-0.672
vered-0.636
Token and
Token ID290
Feature activation+9.56e-02

Pos logit contributions
lihood+1.403
VERTISEMENT+1.180
+1.141
archives+1.069
iso+1.048
Neg logit contributions
 crooked-1.299
 negligent-1.272
 mutated-1.264
 tablet-1.239
 distracted-1.189
Token no
Token ID645
Feature activation-1.19e-01

Pos logit contributions
lihood+0.296
VERTISEMENT+0.259
+0.252
archives+0.234
iso+0.232
Neg logit contributions
 crooked-0.225
 negligent-0.220
 mutated-0.218
 tablet-0.213
 toe-0.204
Token of
Token ID286
Feature activation+2.46e-01

Pos logit contributions
 crooked+0.037
 negligent+0.036
 mutated+0.035
 tablet+0.035
 toe+0.033
Neg logit contributions
lihood-0.046
VERTISEMENT-0.039
-0.038
archives-0.035
iso-0.035
Token cash
Token ID5003
Feature activation-1.58e-01

Pos logit contributions
lihood+0.744
VERTISEMENT+0.629
+0.612
archives+0.569
iso+0.561
Neg logit contributions
 crooked-0.617
 mutated-0.601
 negligent-0.597
 tablet-0.586
 distracted-0.561
Token.
Token ID13
Feature activation+9.22e-02

Pos logit contributions
 crooked+0.380
 negligent+0.373
 mutated+0.372
 tablet+0.364
 toe+0.353
Neg logit contributions
lihood-0.484
VERTISEMENT-0.406
-0.399
archives-0.373
iso-0.366
Token (
Token ID357
Feature activation+1.04e-01

Pos logit contributions
lihood+0.259
VERTISEMENT+0.217
+0.216
archives+0.198
iso+0.196
Neg logit contributions
 crooked-0.240
 negligent-0.234
 mutated-0.232
 Went-0.225
 tablet-0.225
TokenTechn
Token ID25574
Feature activation+1.19e-01

Pos logit contributions
lihood+0.276
VERTISEMENT+0.225
+0.218
iso+0.200
archives+0.200
Neg logit contributions
 crooked-0.289
 mutated-0.281
 negligent-0.281
 tablet-0.273
 Went-0.267
Tokenically
Token ID1146
Feature activation+9.99e-01

Pos logit contributions
lihood+0.311
VERTISEMENT+0.250
+0.245
archives+0.224
AGE+0.223
Neg logit contributions
 crooked-0.344
 mutated-0.337
 negligent-0.336
 tablet-0.329
 distracted-0.318
Token,
Token ID11
Feature activation+1.88e-01

Pos logit contributions
lihood+2.713
VERTISEMENT+2.270
+2.215
archives+2.033
iso+2.032
Neg logit contributions
 crooked-2.766
 negligent-2.703
 mutated-2.693
 tablet-2.620
 distracted-2.540
Token no
Token ID645
Feature activation+4.80e-02

Pos logit contributions
lihood+0.581
VERTISEMENT+0.501
+0.490
archives+0.452
iso+0.448
Neg logit contributions
 crooked-0.453
 negligent-0.440
 mutated-0.435
 tablet-0.425
 distracted-0.412
Token Alberta
Token ID15555
Feature activation-1.36e-01

Pos logit contributions
lihood+0.153
VERTISEMENT+0.130
+0.127
archives+0.118
iso+0.117
Neg logit contributions
 crooked-0.113
 mutated-0.109
 negligent-0.107
 tablet-0.106
 distracted-0.101
Token budget
Token ID4466
Feature activation-1.83e-01

Pos logit contributions
 crooked+0.316
 mutated+0.305
 negligent+0.303
 tablet+0.301
 distracted+0.287
Neg logit contributions
lihood-0.428
VERTISEMENT-0.366
-0.354
archives-0.332
iso-0.329
Token for
Token ID329
Feature activation-1.91e-01

Pos logit contributions
 crooked+0.400
 mutated+0.386
 negligent+0.381
 tablet+0.380
 toe+0.360
Neg logit contributions
lihood-0.611
VERTISEMENT-0.523
-0.515
archives-0.474
iso-0.472
Token Cuba
Token ID14159
Feature activation-7.17e-01

Pos logit contributions
 crooked+1.231
 negligent+1.185
 mutated+1.183
 tablet+1.151
 toe+1.128
Neg logit contributions
lihood-1.689
VERTISEMENT-1.494
-1.454
archives-1.369
iso-1.344
Token present
Token ID1944
Feature activation-8.16e-02

Pos logit contributions
 crooked+1.722
 mutated+1.693
 negligent+1.692
 tablet+1.641
 toe+1.615
Neg logit contributions
lihood-2.177
VERTISEMENT-1.888
-1.810
archives-1.706
iso-1.666
Token.
Token ID13
Feature activation+1.03e-02

Pos logit contributions
 crooked+0.205
 negligent+0.202
 mutated+0.200
 tablet+0.195
 distracted+0.190
Neg logit contributions
lihood-0.236
VERTISEMENT-0.205
-0.197
archives-0.188
iso-0.182
Token
Token ID198
Feature activation+8.84e-03

Pos logit contributions
lihood+0.028
VERTISEMENT+0.023
+0.023
archives+0.021
iso+0.021
Neg logit contributions
 crooked-0.027
 mutated-0.027
 negligent-0.027
 tablet-0.026
 Went-0.025
Token
Token ID198
Feature activation-2.95e-02

Pos logit contributions
lihood+0.024
VERTISEMENT+0.020
+0.019
archives+0.018
iso+0.018
Neg logit contributions
 crooked-0.024
 mutated-0.024
 negligent-0.024
 tablet-0.023
vered-0.022
TokenCount
Token ID12332
Feature activation+9.96e-01

Pos logit contributions
 crooked+0.077
 negligent+0.075
 mutated+0.075
 tablet+0.073
vered+0.073
Neg logit contributions
lihood-0.080
VERTISEMENT-0.067
-0.064
archives-0.059
iso-0.058
Tokenries
Token ID1678
Feature activation-3.66e-02

Pos logit contributions
lihood+2.515
VERTISEMENT+2.089
+2.023
archives+1.846
iso+1.834
Neg logit contributions
 crooked-2.900
 negligent-2.844
 mutated-2.844
 tablet-2.778
 distracted-2.686
Token that
Token ID326
Feature activation-3.44e-01

Pos logit contributions
 crooked+0.086
 mutated+0.085
 negligent+0.084
 tablet+0.082
 distracted+0.079
Neg logit contributions
lihood-0.113
VERTISEMENT-0.098
-0.096
archives-0.089
iso-0.088
Token once
Token ID1752
Feature activation-3.53e-01

Pos logit contributions
 crooked+0.740
 mutated+0.732
 negligent+0.722
 tablet+0.705
 toe+0.693
Neg logit contributions
lihood-1.119
VERTISEMENT-0.988
-0.960
archives-0.905
iso-0.896
Token acquies
Token ID47752
Feature activation-4.20e-01

Pos logit contributions
 crooked+0.690
 mutated+0.681
 negligent+0.662
 tablet+0.648
 toe+0.636
Neg logit contributions
lihood-1.227
VERTISEMENT-1.083
-1.054
archives-1.007
iso-0.997
Tokenced
Token ID771
Feature activation-2.83e-01

Pos logit contributions
 crooked+0.678
 mutated+0.661
 negligent+0.650
 tablet+0.629
vered+0.612
Neg logit contributions
lihood-1.606
VERTISEMENT-1.417
-1.394
archives-1.323
iso-1.315
Token to
Token ID284
Feature activation+8.60e-02

Pos logit contributions
lihood+1.321
VERTISEMENT+1.126
+1.103
archives+1.028
iso+1.011
Neg logit contributions
 crooked-1.100
 mutated-1.067
 negligent-1.067
 tablet-1.040
 toe-0.994
Token the
Token ID262
Feature activation+2.08e-01

Pos logit contributions
lihood+0.273
VERTISEMENT+0.232
+0.228
archives+0.214
iso+0.208
Neg logit contributions
 crooked-0.202
 negligent-0.195
 mutated-0.193
 tablet-0.191
 toe-0.182
Token walls
Token ID7714
Feature activation-5.14e-03

Pos logit contributions
lihood+0.701
VERTISEMENT+0.604
+0.594
archives+0.561
iso+0.549
Neg logit contributions
 crooked-0.444
 mutated-0.429
 negligent-0.428
 tablet-0.421
 toe-0.398
Token..
Token ID492
Feature activation+4.35e-02

Pos logit contributions
 crooked+0.013
 mutated+0.012
 negligent+0.012
 tablet+0.012
 toe+0.012
Neg logit contributions
lihood-0.015
VERTISEMENT-0.013
-0.013
archives-0.012
iso-0.012
Tokenn
Token ID77
Feature activation+9.97e-02

Pos logit contributions
lihood+0.130
VERTISEMENT+0.109
+0.107
archives+0.100
iso+0.098
Neg logit contributions
 crooked-0.109
 negligent-0.106
 mutated-0.106
 tablet-0.104
 toe-0.099
Tokenail
Token ID603
Feature activation+9.92e-01

Pos logit contributions
lihood+0.310
VERTISEMENT+0.262
+0.260
archives+0.242
iso+0.237
Neg logit contributions
 crooked-0.235
 mutated-0.228
 negligent-0.228
 tablet-0.224
vered-0.213
Token the
Token ID262
Feature activation+2.03e-01

Pos logit contributions
lihood+3.023
VERTISEMENT+2.581
+2.533
archives+2.351
iso+2.324
Neg logit contributions
 crooked-2.412
 mutated-2.347
 negligent-2.347
 tablet-2.287
 distracted-2.178
Token furniture
Token ID13091
Feature activation-2.85e-02

Pos logit contributions
lihood+0.649
VERTISEMENT+0.557
+0.547
archives+0.511
iso+0.503
Neg logit contributions
 crooked-0.468
 negligent-0.453
 mutated-0.453
 tablet-0.445
 toe-0.423
Token down
Token ID866
Feature activation+6.98e-02

Pos logit contributions
 crooked+0.074
 negligent+0.071
 mutated+0.070
 tablet+0.069
 toe+0.067
Neg logit contributions
lihood-0.084
VERTISEMENT-0.072
-0.070
archives-0.065
iso-0.063
Token and
Token ID290
Feature activation+8.13e-02

Pos logit contributions
lihood+0.203
VERTISEMENT+0.173
+0.169
archives+0.159
iso+0.154
Neg logit contributions
 crooked-0.181
 negligent-0.174
 mutated-0.173
 tablet-0.171
 toe-0.164
Token strap
Token ID23016
Feature activation+1.33e-01

Pos logit contributions
lihood+0.260
VERTISEMENT+0.223
+0.219
archives+0.204
iso+0.201
Neg logit contributions
 crooked-0.187
 negligent-0.181
 mutated-0.181
 tablet-0.178
 toe-0.170
Token and
Token ID290
Feature activation-3.37e-02

Pos logit contributions
 crooked+0.288
 mutated+0.279
 negligent+0.277
 tablet+0.272
 distracted+0.260
Neg logit contributions
lihood-0.332
VERTISEMENT-0.283
-0.282
archives-0.258
iso-0.254
Token even
Token ID772
Feature activation+9.36e-02

Pos logit contributions
 crooked+0.080
 negligent+0.077
 mutated+0.077
 tablet+0.076
 toe+0.073
Neg logit contributions
lihood-0.104
VERTISEMENT-0.089
-0.089
iso-0.082
archives-0.082
Token before
Token ID878
Feature activation+3.88e-02

Pos logit contributions
lihood+0.294
VERTISEMENT+0.253
+0.250
iso+0.232
archives+0.231
Neg logit contributions
 crooked-0.217
 negligent-0.211
 mutated-0.210
 tablet-0.206
 toe-0.198
Token the
Token ID262
Feature activation+7.36e-02

Pos logit contributions
lihood+0.120
VERTISEMENT+0.104
+0.103
archives+0.095
iso+0.094
Neg logit contributions
 crooked-0.092
 mutated-0.089
 negligent-0.089
 tablet-0.087
 toe-0.083
Token first
Token ID717
Feature activation+1.40e-01

Pos logit contributions
lihood+0.235
VERTISEMENT+0.202
+0.200
archives+0.185
iso+0.183
Neg logit contributions
 crooked-0.169
 mutated-0.165
 negligent-0.164
 tablet-0.161
 toe-0.153
Token pair
Token ID5166
Feature activation+9.87e-01

Pos logit contributions
lihood+0.432
VERTISEMENT+0.370
+0.364
archives+0.338
iso+0.334
Neg logit contributions
 crooked-0.339
 mutated-0.330
 negligent-0.330
 tablet-0.324
 toe-0.309
Token of
Token ID286
Feature activation-1.61e-01

Pos logit contributions
lihood+2.633
VERTISEMENT+2.181
+2.128
archives+1.962
iso+1.946
Neg logit contributions
 crooked-2.716
 negligent-2.675
 mutated-2.668
 tablet-2.590
vered-2.496
Token sneakers
Token ID42649
Feature activation+5.58e-03

Pos logit contributions
 crooked+0.391
 mutated+0.381
 negligent+0.379
 tablet+0.374
 toe+0.360
Neg logit contributions
lihood-0.491
VERTISEMENT-0.423
-0.415
archives-0.382
iso-0.379
Token sque
Token ID35776
Feature activation+2.70e-01

Pos logit contributions
lihood+0.017
VERTISEMENT+0.014
+0.014
archives+0.013
iso+0.013
Neg logit contributions
 crooked-0.014
 mutated-0.014
 negligent-0.013
 tablet-0.013
 toe-0.013
Tokenak
Token ID461
Feature activation-1.18e-01

Pos logit contributions
lihood+1.115
VERTISEMENT+0.999
+0.983
archives+0.927
iso+0.922
Neg logit contributions
 crooked-0.357
 mutated-0.345
 negligent-0.337
 tablet-0.322
vered-0.316
Token on
Token ID319
Feature activation+7.60e-02

Pos logit contributions
 crooked+0.295
 negligent+0.282
 mutated+0.282
 tablet+0.276
 toe+0.273
Neg logit contributions
lihood-0.353
VERTISEMENT-0.305
-0.300
archives-0.281
iso-0.274
Token for
Token ID329
Feature activation+7.27e-02

Pos logit contributions
 crooked+0.437
 negligent+0.429
 mutated+0.427
 tablet+0.415
 distracted+0.400
Neg logit contributions
lihood-0.451
VERTISEMENT-0.376
-0.371
archives-0.341
iso-0.337
Token consoles
Token ID22629
Feature activation-1.49e-01

Pos logit contributions
lihood+0.230
VERTISEMENT+0.195
+0.191
iso+0.178
archives+0.178
Neg logit contributions
 crooked-0.167
 mutated-0.164
 negligent-0.164
 tablet-0.162
 distracted-0.152
Token such
Token ID884
Feature activation+2.29e-01

Pos logit contributions
 crooked+0.322
 mutated+0.310
 tablet+0.310
 negligent+0.309
vered+0.287
Neg logit contributions
lihood-0.510
VERTISEMENT-0.423
-0.418
iso-0.393
archives-0.393
Token as
Token ID355
Feature activation-2.05e-01

Pos logit contributions
lihood+0.649
VERTISEMENT+0.547
+0.528
archives+0.490
iso+0.486
Neg logit contributions
 crooked-0.599
 mutated-0.588
 negligent-0.587
 tablet-0.574
vered-0.548
Token NES
Token ID31925
Feature activation+4.55e-02

Pos logit contributions
 crooked+0.455
 tablet+0.444
 negligent+0.436
 mutated+0.432
vered+0.403
Neg logit contributions
lihood-0.663
VERTISEMENT-0.596
-0.551
AGE-0.520
archives-0.516
Token
Token ID8151
Feature activation+9.74e-01

Pos logit contributions
lihood+0.129
VERTISEMENT+0.106
+0.100
archives+0.095
iso+0.093
Neg logit contributions
 crooked-0.122
 tablet-0.119
 mutated-0.118
 negligent-0.118
vered-0.111
Token,
Token ID11
Feature activation+1.33e-02

Pos logit contributions
lihood+2.733
VERTISEMENT+2.272
+2.168
iso+2.014
archives+2.011
Neg logit contributions
 crooked-2.645
 mutated-2.569
 negligent-2.562
 tablet-2.560
 toe-2.401
Token Super
Token ID3115
Feature activation+1.26e-01

Pos logit contributions
lihood+0.042
VERTISEMENT+0.035
+0.035
archives+0.032
iso+0.032
Neg logit contributions
 crooked-0.031
 mutated-0.030
 negligent-0.030
 tablet-0.030
 distracted-0.028
Token NES
Token ID31925
Feature activation+1.06e-02

Pos logit contributions
lihood+0.421
VERTISEMENT+0.350
+0.331
archives+0.313
OTOS+0.306
Neg logit contributions
 crooked-0.280
 tablet-0.278
 negligent-0.269
 mutated-0.267
vered-0.251
Token
Token ID8151
Feature activation+8.80e-01

Pos logit contributions
lihood+0.029
VERTISEMENT+0.023
+0.023
archives+0.021
iso+0.021
Neg logit contributions
 crooked-0.029
 tablet-0.028
 mutated-0.028
 negligent-0.028
vered-0.027
Token and
Token ID290
Feature activation-1.25e-02

Pos logit contributions
lihood+2.418
VERTISEMENT+2.003
+1.940
archives+1.791
iso+1.784
Neg logit contributions
 crooked-2.413
 mutated-2.357
 negligent-2.345
 tablet-2.322
 toe-2.199
Token,
Token ID11
Feature activation-1.51e-01

Pos logit contributions
 crooked+1.457
 negligent+1.415
 mutated+1.411
 tablet+1.384
vered+1.339
Neg logit contributions
lihood-1.570
VERTISEMENT-1.340
-1.278
archives-1.188
iso-1.153
Token($
Token ID16763
Feature activation-4.03e-01

Pos logit contributions
 crooked+0.434
 negligent+0.424
 mutated+0.423
 tablet+0.414
 toe+0.398
Neg logit contributions
lihood-0.385
VERTISEMENT-0.316
-0.310
archives-0.282
iso-0.277
Tokeni
Token ID72
Feature activation-1.14e-01

Pos logit contributions
 crooked+1.098
 negligent+1.072
 mutated+1.069
 tablet+1.061
vered+1.020
Neg logit contributions
lihood-1.089
VERTISEMENT-0.912
-0.887
archives-0.802
AGE-0.786
Token*
Token ID9
Feature activation-6.03e-02

Pos logit contributions
 crooked+0.323
 mutated+0.317
 negligent+0.317
 tablet+0.309
 distracted+0.297
Neg logit contributions
lihood-0.300
VERTISEMENT-0.248
-0.242
archives-0.222
iso-0.220
Token$
Token ID3
Feature activation-2.58e-01

Pos logit contributions
 crooked+0.163
 negligent+0.158
 mutated+0.158
 tablet+0.158
vered+0.151
Neg logit contributions
lihood-0.165
VERTISEMENT-0.138
-0.134
archives-0.122
iso-0.118
TokenFontSize
Token ID38160
Feature activation-1.13e+00

Pos logit contributions
 crooked+0.550
 negligent+0.534
 mutated+0.531
 tablet+0.528
vered+0.503
Neg logit contributions
lihood-0.852
VERTISEMENT-0.738
-0.718
archives-0.667
iso-0.654
Token*
Token ID9
Feature activation-9.88e-02

Pos logit contributions
 crooked+2.513
 mutated+2.500
 negligent+2.447
 tablet+2.425
vered+2.305
Neg logit contributions
lihood-3.525
VERTISEMENT-2.990
-2.962
archives-2.756
iso-2.682
Token2
Token ID17
Feature activation-1.90e-01

Pos logit contributions
 crooked+0.265
 negligent+0.258
 mutated+0.258
 tablet+0.257
vered+0.246
Neg logit contributions
lihood-0.268
VERTISEMENT-0.224
-0.217
archives-0.200
iso-0.193
Token +
Token ID1343
Feature activation+3.09e-01

Pos logit contributions
 crooked+0.452
 mutated+0.443
 negligent+0.440
 tablet+0.430
 toe+0.410
Neg logit contributions
lihood-0.578
VERTISEMENT-0.492
-0.480
archives-0.447
iso-0.437
Token $
Token ID720
Feature activation-1.30e-01

Pos logit contributions
lihood+0.762
VERTISEMENT+0.624
+0.617
archives+0.557
iso+0.532
Neg logit contributions
 crooked-0.919
 mutated-0.895
 negligent-0.894
 tablet-0.886
 toe-0.848
Tokens
Token ID82
Feature activation-2.60e-01

Pos logit contributions
 crooked+0.285
 negligent+0.278
 mutated+0.275
 tablet+0.274
vered+0.263
Neg logit contributions
lihood-0.420
VERTISEMENT-0.361
-0.352
archives-0.326
 Ramirez-0.317
Token and
Token ID290
Feature activation-8.12e-02

Pos logit contributions
 crooked+0.136
 negligent+0.133
 mutated+0.133
 tablet+0.131
 distracted+0.125
Neg logit contributions
lihood-0.157
VERTISEMENT-0.133
-0.129
archives-0.122
iso-0.120
Token evolution
Token ID6954
Feature activation-5.60e-02

Pos logit contributions
 crooked+0.189
 negligent+0.186
 mutated+0.186
 tablet+0.180
 toe+0.172
Neg logit contributions
lihood-0.252
VERTISEMENT-0.219
-0.214
archives-0.201
iso-0.199
Token.
Token ID13
Feature activation+3.46e-02

Pos logit contributions
 crooked+0.141
 mutated+0.138
 negligent+0.137
 tablet+0.135
 distracted+0.129
Neg logit contributions
lihood-0.164
VERTISEMENT-0.139
-0.135
archives-0.127
iso-0.125
Token TL
Token ID24811
Feature activation-2.98e-01

Pos logit contributions
lihood+0.099
VERTISEMENT+0.085
+0.082
archives+0.077
iso+0.075
Neg logit contributions
 crooked-0.088
 negligent-0.086
 mutated-0.086
 tablet-0.083
 Went-0.081
Token;
Token ID26
Feature activation-2.36e-01

Pos logit contributions
 crooked+0.661
 negligent+0.641
 mutated+0.641
 tablet+0.638
vered+0.606
Neg logit contributions
lihood-0.961
VERTISEMENT-0.797
-0.796
archives-0.742
iso-0.741
TokenDR
Token ID7707
Feature activation-1.11e+00

Pos logit contributions
 crooked+0.572
 mutated+0.568
 negligent+0.563
 tablet+0.544
vered+0.526
Neg logit contributions
lihood-0.715
VERTISEMENT-0.602
-0.585
iso-0.558
archives-0.545
Token:
Token ID25
Feature activation-1.56e-02

Pos logit contributions
 crooked+2.729
 negligent+2.667
 mutated+2.662
 tablet+2.626
 Went+2.549
Neg logit contributions
lihood-3.152
VERTISEMENT-2.710
-2.640
iso-2.431
archives-2.427
Token I
Token ID314
Feature activation-1.47e-01

Pos logit contributions
 crooked+0.039
 negligent+0.038
 mutated+0.038
 tablet+0.037
 distracted+0.036
Neg logit contributions
lihood-0.046
VERTISEMENT-0.040
-0.038
archives-0.036
iso-0.035
Token propose
Token ID18077
Feature activation+4.50e-03

Pos logit contributions
 crooked+0.337
 mutated+0.333
 negligent+0.330
 tablet+0.321
vered+0.308
Neg logit contributions
lihood-0.461
VERTISEMENT-0.399
-0.386
archives-0.363
iso-0.361
Token we
Token ID356
Feature activation-1.00e-01

Pos logit contributions
lihood+0.014
VERTISEMENT+0.012
+0.012
archives+0.011
iso+0.011
Neg logit contributions
 crooked-0.010
 negligent-0.010
 mutated-0.010
 tablet-0.010
 distracted-0.010
Token work
Token ID670
Feature activation-1.33e-01

Pos logit contributions
 crooked+0.230
 mutated+0.227
 negligent+0.226
 tablet+0.220
 toe+0.210
Neg logit contributions
lihood-0.315
VERTISEMENT-0.272
-0.263
archives-0.247
iso-0.245
Token program
Token ID1430
Feature activation-9.33e-02

Pos logit contributions
 crooked+0.598
 negligent+0.584
 mutated+0.580
 tablet+0.580
 distracted+0.543
Neg logit contributions
lihood-0.761
VERTISEMENT-0.659
-0.636
iso-0.592
archives-0.590
Token,
Token ID11
Feature activation-1.26e-01

Pos logit contributions
 crooked+0.251
 mutated+0.247
 negligent+0.246
 tablet+0.242
 distracted+0.232
Neg logit contributions
lihood-0.257
VERTISEMENT-0.216
-0.210
iso-0.193
archives-0.193
Token marks
Token ID8849
Feature activation-9.28e-02

Pos logit contributions
 crooked+0.234
 mutated+0.229
 negligent+0.226
 tablet+0.220
 distracted+0.209
Neg logit contributions
lihood-0.456
VERTISEMENT-0.403
-0.394
iso-0.371
archives-0.371
Token a
Token ID257
Feature activation+2.03e-03

Pos logit contributions
 crooked+0.203
 mutated+0.198
 negligent+0.197
 tablet+0.192
 distracted+0.182
Neg logit contributions
lihood-0.303
VERTISEMENT-0.263
-0.257
iso-0.240
archives-0.239
Token new
Token ID649
Feature activation-3.40e-02

Pos logit contributions
lihood+0.006
VERTISEMENT+0.005
+0.005
archives+0.005
iso+0.005
Neg logit contributions
 crooked-0.005
 mutated-0.005
 negligent-0.005
 tablet-0.004
 toe-0.004
Token chapter
Token ID6843
Feature activation-1.09e+00

Pos logit contributions
 crooked+0.082
 mutated+0.080
 negligent+0.080
 tablet+0.078
 toe+0.075
Neg logit contributions
lihood-0.104
VERTISEMENT-0.090
-0.088
archives-0.083
iso-0.082
Token in
Token ID287
Feature activation+3.10e-02

Pos logit contributions
 crooked+2.757
 mutated+2.708
 negligent+2.676
 tablet+2.664
 toe+2.555
Neg logit contributions
lihood-3.095
VERTISEMENT-2.711
-2.603
archives-2.451
iso-2.373
Token appreciation
Token ID19163
Feature activation-2.29e-01

Pos logit contributions
lihood+0.098
VERTISEMENT+0.085
+0.083
iso+0.077
archives+0.076
Neg logit contributions
 crooked-0.072
 mutated-0.070
 negligent-0.070
 tablet-0.068
 distracted-0.065
Token of
Token ID286
Feature activation-2.51e-02

Pos logit contributions
 crooked+0.568
 negligent+0.552
 mutated+0.552
 tablet+0.545
 distracted+0.519
Neg logit contributions
lihood-0.688
VERTISEMENT-0.589
-0.571
iso-0.530
archives-0.526
Token more
Token ID517
Feature activation-2.21e-01

Pos logit contributions
 crooked+0.057
 mutated+0.055
 negligent+0.055
 tablet+0.053
 distracted+0.051
Neg logit contributions
lihood-0.081
VERTISEMENT-0.071
-0.069
iso-0.064
archives-0.064
Token adventurous
Token ID39770
Feature activation-1.84e-01

Pos logit contributions
 crooked+0.540
 mutated+0.528
 negligent+0.524
 tablet+0.510
 distracted+0.493
Neg logit contributions
lihood-0.673
VERTISEMENT-0.586
-0.565
iso-0.528
archives-0.520
Token must
Token ID1276
Feature activation+1.18e-01

Pos logit contributions
lihood+0.195
VERTISEMENT+0.170
+0.167
archives+0.158
iso+0.155
Neg logit contributions
 crooked-0.114
 negligent-0.111
 mutated-0.111
 tablet-0.107
 toe-0.101
Token consider
Token ID2074
Feature activation+7.76e-02

Pos logit contributions
lihood+0.382
VERTISEMENT+0.332
+0.326
archives+0.305
iso+0.302
Neg logit contributions
 crooked-0.261
 negligent-0.256
 mutated-0.256
 tablet-0.249
 toe-0.239
Token the
Token ID262
Feature activation-1.87e-01

Pos logit contributions
lihood+0.242
VERTISEMENT+0.211
+0.206
archives+0.193
iso+0.192
Neg logit contributions
 crooked-0.179
 negligent-0.177
 mutated-0.176
 tablet-0.169
 toe-0.162
Token pros
Token ID10360
Feature activation-5.26e-01

Pos logit contributions
 crooked+0.456
 negligent+0.451
 mutated+0.448
 tablet+0.433
 distracted+0.413
Neg logit contributions
lihood-0.557
VERTISEMENT-0.480
-0.471
archives-0.444
iso-0.437
Token and
Token ID290
Feature activation+2.66e-01

Pos logit contributions
 crooked+1.284
 negligent+1.277
 mutated+1.236
 tablet+1.222
 toe+1.183
Neg logit contributions
lihood-1.539
VERTISEMENT-1.329
-1.292
archives-1.227
AGE-1.167
Token cons
Token ID762
Feature activation-1.07e+00

Pos logit contributions
lihood+0.981
VERTISEMENT+0.863
+0.840
iso+0.800
archives+0.795
Neg logit contributions
 crooked-0.475
 negligent-0.463
 mutated-0.453
 tablet-0.435
 toe-0.414
Token.
Token ID13
Feature activation+6.26e-02

Pos logit contributions
 crooked+2.514
 negligent+2.491
 mutated+2.453
 tablet+2.405
 toe+2.326
Neg logit contributions
lihood-3.186
VERTISEMENT-2.721
-2.678
archives-2.541
iso-2.455
Token For
Token ID1114
Feature activation-2.01e-02

Pos logit contributions
lihood+0.181
VERTISEMENT+0.154
+0.151
archives+0.141
iso+0.138
Neg logit contributions
 crooked-0.157
 negligent-0.155
 mutated-0.154
 tablet-0.149
 Went-0.145
Token example
Token ID1672
Feature activation+1.38e-03

Pos logit contributions
 crooked+0.048
 negligent+0.047
 mutated+0.047
 tablet+0.045
 distracted+0.043
Neg logit contributions
lihood-0.061
VERTISEMENT-0.053
-0.052
archives-0.048
iso-0.048
Token,
Token ID11
Feature activation-2.61e-02

Pos logit contributions
lihood+0.004
VERTISEMENT+0.003
+0.003
archives+0.003
iso+0.003
Neg logit contributions
 crooked-0.004
 negligent-0.004
 mutated-0.004
 tablet-0.004
 distracted-0.004
Token maybe
Token ID3863
Feature activation-6.00e-02

Pos logit contributions
 crooked+0.060
 negligent+0.059
 mutated+0.058
 tablet+0.057
 toe+0.054
Neg logit contributions
lihood-0.082
VERTISEMENT-0.072
-0.070
archives-0.065
iso-0.065
Token universe
Token ID6881
Feature activation-1.20e-01

Pos logit contributions
 crooked+0.143
 mutated+0.141
 negligent+0.139
 tablet+0.138
 toe+0.130
Neg logit contributions
lihood-0.193
VERTISEMENT-0.167
-0.163
archives-0.152
iso-0.152
Token.
Token ID13
Feature activation-1.54e-01

Pos logit contributions
 crooked+0.298
 mutated+0.296
 tablet+0.292
 negligent+0.290
 toe+0.274
Neg logit contributions
lihood-0.360
VERTISEMENT-0.304
-0.297
archives-0.273
iso-0.273
Token We
Token ID775
Feature activation-1.81e-01

Pos logit contributions
 crooked+0.375
 mutated+0.367
 negligent+0.365
 tablet+0.357
 Went+0.351
Neg logit contributions
lihood-0.459
VERTISEMENT-0.392
-0.382
archives-0.356
iso-0.348
Token can
Token ID460
Feature activation-2.87e-01

Pos logit contributions
 crooked+0.396
 mutated+0.390
 negligent+0.385
 tablet+0.376
 distracted+0.359
Neg logit contributions
lihood-0.593
VERTISEMENT-0.516
-0.503
archives-0.471
iso-0.467
Token't
Token ID470
Feature activation-6.96e-02

Pos logit contributions
 crooked+0.693
 mutated+0.675
 negligent+0.675
 tablet+0.664
 toe+0.638
Neg logit contributions
lihood-0.878
VERTISEMENT-0.757
-0.735
archives-0.684
iso-0.674
Token wait
Token ID4043
Feature activation-1.05e+00

Pos logit contributions
 crooked+0.151
 negligent+0.149
 mutated+0.147
 tablet+0.144
 toe+0.139
Neg logit contributions
lihood-0.229
VERTISEMENT-0.201
-0.195
archives-0.181
iso-0.180
Token for
Token ID329
Feature activation+2.55e-01

Pos logit contributions
 crooked+2.515
 mutated+2.459
 negligent+2.438
 tablet+2.394
 toe+2.321
Neg logit contributions
lihood-3.147
VERTISEMENT-2.737
-2.665
archives-2.490
iso-2.425
Token you
Token ID345
Feature activation-3.33e-01

Pos logit contributions
lihood+0.755
VERTISEMENT+0.644
+0.625
archives+0.581
iso+0.574
Neg logit contributions
 crooked-0.642
 mutated-0.631
 negligent-0.627
 tablet-0.614
 distracted-0.586
Token to
Token ID284
Feature activation-3.88e-02

Pos logit contributions
 crooked+0.799
 negligent+0.780
 mutated+0.778
 tablet+0.769
 distracted+0.734
Neg logit contributions
lihood-1.030
VERTISEMENT-0.873
-0.852
archives-0.791
iso-0.785
Token see
Token ID766
Feature activation+2.18e-01

Pos logit contributions
 crooked+0.090
 mutated+0.088
 negligent+0.088
 tablet+0.086
 toe+0.082
Neg logit contributions
lihood-0.122
VERTISEMENT-0.104
-0.102
archives-0.095
iso-0.095
Token it
Token ID340
Feature activation-2.58e-01

Pos logit contributions
lihood+0.684
VERTISEMENT+0.590
+0.573
archives+0.537
iso+0.534
Neg logit contributions
 crooked-0.505
 mutated-0.496
 negligent-0.492
 tablet-0.480
 distracted-0.456
Token her
Token ID607
Feature activation-1.11e-01

Pos logit contributions
lihood+0.163
VERTISEMENT+0.140
+0.138
iso+0.128
archives+0.128
Neg logit contributions
 crooked-0.115
 negligent-0.111
 mutated-0.110
 tablet-0.108
 toe-0.104
Token tightly
Token ID17707
Feature activation-4.58e-01

Pos logit contributions
 crooked+0.230
 negligent+0.217
 mutated+0.216
 tablet+0.213
 toe+0.205
Neg logit contributions
lihood-0.390
VERTISEMENT-0.340
-0.335
archives-0.312
iso-0.312
Token.
Token ID13
Feature activation+1.59e-01

Pos logit contributions
 crooked+1.137
 negligent+1.091
 mutated+1.083
 tablet+1.057
 distracted+1.044
Neg logit contributions
lihood-1.390
VERTISEMENT-1.196
-1.180
iso-1.073
archives-1.072
Token She
Token ID1375
Feature activation+3.04e-01

Pos logit contributions
lihood+0.448
VERTISEMENT+0.376
+0.368
archives+0.338
iso+0.334
Neg logit contributions
 crooked-0.418
 negligent-0.407
 mutated-0.404
 tablet-0.398
 Went-0.385
Token gr
Token ID1036
Feature activation-2.87e-03

Pos logit contributions
lihood+0.945
VERTISEMENT+0.809
+0.790
archives+0.734
iso+0.729
Neg logit contributions
 crooked-0.726
 mutated-0.702
 negligent-0.702
 tablet-0.681
 distracted-0.657
Tokenins
Token ID1040
Feature activation-1.01e+00

Pos logit contributions
 crooked+0.009
 mutated+0.009
 negligent+0.009
 tablet+0.009
vered+0.009
Neg logit contributions
lihood-0.007
VERTISEMENT-0.005
-0.005
AGE-0.004
archives-0.004
Token at
Token ID379
Feature activation-6.76e-02

Pos logit contributions
 crooked+2.477
 negligent+2.330
 mutated+2.326
 distracted+2.244
 tablet+2.231
Neg logit contributions
lihood-3.134
VERTISEMENT-2.706
-2.645
iso-2.468
archives-2.466
Token me
Token ID502
Feature activation-1.43e-02

Pos logit contributions
 crooked+0.191
 negligent+0.187
 mutated+0.186
 tablet+0.183
 distracted+0.178
Neg logit contributions
lihood-0.182
VERTISEMENT-0.153
-0.147
archives-0.135
iso-0.133
Token.
Token ID13
Feature activation-4.92e-03

Pos logit contributions
 crooked+0.037
 negligent+0.035
 mutated+0.035
 tablet+0.034
 distracted+0.033
Neg logit contributions
lihood-0.042
VERTISEMENT-0.036
-0.035
archives-0.033
iso-0.033
Token God
Token ID1793
Feature activation+3.01e-01

Pos logit contributions
 crooked+0.013
 negligent+0.013
 mutated+0.013
 tablet+0.013
 Went+0.012
Neg logit contributions
lihood-0.014
VERTISEMENT-0.011
-0.011
archives-0.010
iso-0.010
Token,
Token ID11
Feature activation+5.77e-02

Pos logit contributions
lihood+0.846
VERTISEMENT+0.704
+0.693
archives+0.640
iso+0.622
Neg logit contributions
 crooked-0.809
 mutated-0.791
 negligent-0.790
 tablet-0.776
 distracted-0.742
Token devices
Token ID4410
Feature activation+3.93e-02

Pos logit contributions
 crooked+0.467
 mutated+0.458
 tablet+0.457
 negligent+0.454
 toe+0.435
Neg logit contributions
lihood-0.546
VERTISEMENT-0.476
-0.456
archives-0.424
iso-0.419
Token on
Token ID319
Feature activation-2.07e-01

Pos logit contributions
lihood+0.114
VERTISEMENT+0.098
+0.095
archives+0.088
iso+0.087
Neg logit contributions
 crooked-0.100
 mutated-0.099
 negligent-0.098
 tablet-0.096
 distracted-0.092
Token all
Token ID477
Feature activation+7.28e-02

Pos logit contributions
 crooked+0.456
 mutated+0.446
 negligent+0.442
 tablet+0.441
 toe+0.412
Neg logit contributions
lihood-0.671
VERTISEMENT-0.589
-0.575
archives-0.536
iso-0.530
Token previous
Token ID2180
Feature activation+2.73e-01

Pos logit contributions
lihood+0.228
VERTISEMENT+0.197
+0.190
archives+0.178
iso+0.177
Neg logit contributions
 crooked-0.170
 mutated-0.166
 negligent-0.164
 tablet-0.164
 toe-0.154
Token Mars
Token ID8706
Feature activation-1.66e-01

Pos logit contributions
lihood+0.827
VERTISEMENT+0.714
+0.689
archives+0.644
iso+0.639
Neg logit contributions
 crooked-0.666
 mutated-0.653
 negligent-0.647
 tablet-0.640
 toe-0.608
Token ro
Token ID686
Feature activation-9.93e-01

Pos logit contributions
 crooked+0.426
 mutated+0.418
 tablet+0.415
 negligent+0.410
 toe+0.396
Neg logit contributions
lihood-0.477
VERTISEMENT-0.410
-0.394
archives-0.364
iso-0.359
Tokenvers
Token ID690
Feature activation-7.47e-02

Pos logit contributions
 crooked+1.611
vered+1.584
 negligent+1.550
 mutated+1.540
 tablet+1.522
Neg logit contributions
lihood-3.722
VERTISEMENT-3.300
-3.163
AGE-2.983
archives-2.919
Token,
Token ID11
Feature activation-2.59e-02

Pos logit contributions
 crooked+0.189
 mutated+0.187
 negligent+0.186
 tablet+0.181
 distracted+0.175
Neg logit contributions
lihood-0.217
VERTISEMENT-0.186
-0.181
archives-0.168
iso-0.165
Token which
Token ID543
Feature activation+6.30e-02

Pos logit contributions
 crooked+0.062
 mutated+0.062
 negligent+0.061
 tablet+0.060
 toe+0.057
Neg logit contributions
lihood-0.079
VERTISEMENT-0.069
-0.066
archives-0.062
iso-0.061
Token have
Token ID423
Feature activation-1.46e-01

Pos logit contributions
lihood+0.217
VERTISEMENT+0.189
+0.186
archives+0.175
iso+0.173
Neg logit contributions
 crooked-0.126
 mutated-0.124
 negligent-0.122
 tablet-0.120
 distracted-0.113
Token examined
Token ID11068
Feature activation+1.73e-01

Pos logit contributions
 crooked+0.289
 mutated+0.284
 negligent+0.280
 tablet+0.273
 distracted+0.259
Neg logit contributions
lihood-0.508
VERTISEMENT-0.444
-0.437
archives-0.411
iso-0.408
Token the
Token ID262
Feature activation-8.75e-02

Pos logit contributions
lihood+0.379
VERTISEMENT+0.324
+0.320
archives+0.299
iso+0.295
Neg logit contributions
 crooked-0.286
 mutated-0.281
 negligent-0.281
 tablet-0.269
 distracted-0.260
Token other
Token ID584
Feature activation-2.09e-01

Pos logit contributions
 crooked+0.209
 negligent+0.206
 mutated+0.206
 tablet+0.198
 distracted+0.190
Neg logit contributions
lihood-0.271
VERTISEMENT-0.233
-0.229
archives-0.215
iso-0.212
Token.
Token ID13
Feature activation-2.36e-02

Pos logit contributions
 crooked+0.520
 mutated+0.516
 negligent+0.513
 tablet+0.499
 distracted+0.481
Neg logit contributions
lihood-0.622
VERTISEMENT-0.520
-0.517
archives-0.491
iso-0.481
Token The
Token ID383
Feature activation-1.27e-01

Pos logit contributions
 crooked+0.061
 mutated+0.060
 negligent+0.060
 tablet+0.058
 Went+0.057
Neg logit contributions
lihood-0.066
VERTISEMENT-0.056
-0.055
archives-0.051
iso-0.050
Token old
Token ID1468
Feature activation-2.32e-02

Pos logit contributions
 crooked+0.293
 mutated+0.289
 negligent+0.288
 tablet+0.278
 distracted+0.265
Neg logit contributions
lihood-0.402
VERTISEMENT-0.345
-0.337
archives-0.317
iso-0.314
Token guard
Token ID4860
Feature activation-9.86e-01

Pos logit contributions
 crooked+0.054
 mutated+0.053
 negligent+0.053
 tablet+0.051
 toe+0.049
Neg logit contributions
lihood-0.073
VERTISEMENT-0.062
-0.060
archives-0.058
iso-0.057
Token of
Token ID286
Feature activation-2.66e-01

Pos logit contributions
 crooked+2.138
 mutated+2.127
 negligent+2.082
 tablet+1.992
 distracted+1.936
Neg logit contributions
lihood-3.206
VERTISEMENT-2.775
-2.743
archives-2.603
iso-2.557
Token medicine
Token ID9007
Feature activation-9.98e-02

Pos logit contributions
 crooked+0.599
 negligent+0.582
 mutated+0.581
 tablet+0.558
 toe+0.534
Neg logit contributions
lihood-0.867
VERTISEMENT-0.739
-0.726
archives-0.684
iso-0.681
Token also
Token ID635
Feature activation-2.11e-01

Pos logit contributions
 crooked+0.205
 mutated+0.201
 negligent+0.200
 tablet+0.191
 distracted+0.184
Neg logit contributions
lihood-0.343
VERTISEMENT-0.296
-0.292
archives-0.274
iso-0.274
Token turned
Token ID2900
Feature activation-5.32e-01

Pos logit contributions
 crooked+0.391
 mutated+0.385
 negligent+0.378
 tablet+0.360
 distracted+0.347
Neg logit contributions
lihood-0.764
VERTISEMENT-0.676
-0.663
archives-0.627
iso-0.625
Token to
Token ID284
Feature activation+3.31e-01

Pos logit contributions
 crooked+1.268
 negligent+1.242
 mutated+1.238
 tablet+1.193
 toe+1.156
Neg logit contributions
lihood-1.639
VERTISEMENT-1.413
-1.402
archives-1.310
iso-1.298
Token-
Token ID12
Feature activation-2.17e-01

Pos logit contributions
 crooked+0.891
 mutated+0.879
 negligent+0.874
 tablet+0.867
vered+0.829
Neg logit contributions
lihood-1.044
VERTISEMENT-0.883
-0.861
archives-0.778
AGE-0.774
Tokenmaster
Token ID9866
Feature activation-6.18e-02

Pos logit contributions
 crooked+0.528
 mutated+0.525
 negligent+0.522
 tablet+0.519
vered+0.503
Neg logit contributions
lihood-0.637
VERTISEMENT-0.541
-0.523
archives-0.474
AGE-0.472
Token.
Token ID13
Feature activation-8.73e-03

Pos logit contributions
 crooked+0.156
 negligent+0.154
 mutated+0.154
 tablet+0.152
vered+0.146
Neg logit contributions
lihood-0.178
VERTISEMENT-0.151
-0.146
archives-0.133
AGE-0.133
Tokenzip
Token ID13344
Feature activation-5.19e-01

Pos logit contributions
 crooked+0.023
 mutated+0.022
 negligent+0.022
 tablet+0.022
vered+0.021
Neg logit contributions
lihood-0.025
VERTISEMENT-0.021
-0.020
AGE-0.018
archives-0.018
Token.
Token ID13
Feature activation+6.27e-03

Pos logit contributions
 crooked+1.319
 mutated+1.292
 negligent+1.286
 tablet+1.270
 toe+1.205
Neg logit contributions
lihood-1.505
VERTISEMENT-1.263
-1.243
archives-1.128
iso-1.120
Token Wait
Token ID16314
Feature activation-9.78e-01

Pos logit contributions
lihood+0.019
VERTISEMENT+0.016
+0.016
archives+0.015
iso+0.014
Neg logit contributions
 crooked-0.015
 mutated-0.015
 negligent-0.015
 tablet-0.014
vered-0.014
Token until
Token ID1566
Feature activation-4.60e-02

Pos logit contributions
 crooked+2.132
 negligent+2.104
 mutated+2.089
 tablet+2.075
 distracted+1.965
Neg logit contributions
lihood-3.118
-2.661
VERTISEMENT-2.646
archives-2.545
AGE-2.424
Token you
Token ID345
Feature activation+3.24e-02

Pos logit contributions
 crooked+0.106
 mutated+0.104
 negligent+0.103
 tablet+0.103
 distracted+0.097
Neg logit contributions
lihood-0.145
-0.122
VERTISEMENT-0.122
archives-0.115
AGE-0.112
Token see
Token ID766
Feature activation+3.34e-01

Pos logit contributions
lihood+0.107
VERTISEMENT+0.092
+0.091
archives+0.086
iso+0.084
Neg logit contributions
 crooked-0.069
 mutated-0.069
 negligent-0.068
 tablet-0.066
 toe-0.063
Token the
Token ID262
Feature activation+5.30e-02

Pos logit contributions
lihood+1.035
VERTISEMENT+0.878
+0.867
archives+0.812
iso+0.797
Neg logit contributions
 crooked-0.787
 mutated-0.773
 negligent-0.769
 tablet-0.753
 distracted-0.714
Token add
Token ID751
Feature activation+1.30e-01

Pos logit contributions
lihood+0.171
VERTISEMENT+0.144
+0.144
archives+0.135
iso+0.131
Neg logit contributions
 crooked-0.120
 mutated-0.117
 negligent-0.116
 tablet-0.115
 toe-0.108
Token
Token ID198
Feature activation-1.94e-02

Pos logit contributions
lihood+0.061
VERTISEMENT+0.051
+0.049
archives+0.045
iso+0.045
Neg logit contributions
 crooked-0.063
 mutated-0.062
 negligent-0.062
 tablet-0.060
vered-0.058
Token"
Token ID1
Feature activation-5.32e-02

Pos logit contributions
 crooked+0.050
 negligent+0.050
 mutated+0.049
 tablet+0.048
vered+0.048
Neg logit contributions
lihood-0.054
VERTISEMENT-0.045
-0.043
archives-0.040
iso-0.040
TokenDef
Token ID7469
Feature activation-8.88e-02

Pos logit contributions
 negligent+0.141
 crooked+0.141
 mutated+0.138
 tablet+0.134
vered+0.132
Neg logit contributions
lihood-0.145
VERTISEMENT-0.121
-0.118
archives-0.108
iso-0.106
Tokenendants
Token ID30841
Feature activation-2.65e-01

Pos logit contributions
 negligent+0.168
 crooked+0.167
 mutated+0.162
 tablet+0.158
vered+0.151
Neg logit contributions
lihood-0.312
VERTISEMENT-0.273
-0.271
archives-0.252
AGE-0.251
Token'
Token ID6
Feature activation-4.38e-01

Pos logit contributions
 crooked+0.580
 negligent+0.577
 mutated+0.570
 tablet+0.551
 distracted+0.530
Neg logit contributions
lihood-0.857
VERTISEMENT-0.751
-0.734
archives-0.688
iso-0.682
Token failure
Token ID5287
Feature activation-9.76e-01

Pos logit contributions
 negligent+0.946
 crooked+0.935
 mutated+0.917
 tablet+0.887
 distracted+0.853
Neg logit contributions
lihood-1.424
VERTISEMENT-1.247
-1.238
archives-1.155
iso-1.154
Token to
Token ID284
Feature activation-1.32e-01

Pos logit contributions
 negligent+2.442
 crooked+2.436
 mutated+2.384
 tablet+2.330
 distracted+2.263
Neg logit contributions
lihood-2.803
VERTISEMENT-2.400
-2.354
archives-2.194
iso-2.146
Token provide
Token ID2148
Feature activation+2.09e-01

Pos logit contributions
 crooked+0.295
 negligent+0.292
 mutated+0.287
 tablet+0.281
 toe+0.274
Neg logit contributions
lihood-0.423
VERTISEMENT-0.369
-0.363
archives-0.338
iso-0.334
Token any
Token ID597
Feature activation-1.02e-01

Pos logit contributions
lihood+0.664
VERTISEMENT+0.571
+0.564
archives+0.528
iso+0.524
Neg logit contributions
 crooked-0.475
 negligent-0.473
 mutated-0.462
 tablet-0.448
 distracted-0.431
Token form
Token ID1296
Feature activation-2.00e-01

Pos logit contributions
 crooked+0.228
 negligent+0.228
 mutated+0.222
 tablet+0.216
 toe+0.207
Neg logit contributions
lihood-0.328
VERTISEMENT-0.282
-0.279
archives-0.263
iso-0.257
Token of
Token ID286
Feature activation-2.68e-01

Pos logit contributions
 crooked+0.526
 negligent+0.518
 mutated+0.516
 tablet+0.503
 toe+0.482
Neg logit contributions
lihood-0.557
VERTISEMENT-0.467
-0.457
archives-0.423
iso-0.417
Token judge
Token ID5052
Feature activation+1.42e-01

Pos logit contributions
lihood+0.096
VERTISEMENT+0.083
+0.082
archives+0.076
iso+0.076
Neg logit contributions
 crooked-0.066
 negligent-0.064
 mutated-0.064
 tablet-0.062
 distracted-0.059
Token,
Token ID11
Feature activation+3.82e-02

Pos logit contributions
lihood+0.402
VERTISEMENT+0.346
+0.332
archives+0.311
AGE+0.304
Neg logit contributions
 crooked-0.372
 negligent-0.367
 mutated-0.364
 tablet-0.353
 distracted-0.344
Token so
Token ID523
Feature activation-1.04e-01

Pos logit contributions
lihood+0.119
VERTISEMENT+0.103
+0.099
archives+0.093
iso+0.092
Neg logit contributions
 crooked-0.091
 negligent-0.089
 mutated-0.088
 tablet-0.086
 distracted-0.083
Token his
Token ID465
Feature activation-1.35e-01

Pos logit contributions
 crooked+0.254
 negligent+0.248
 mutated+0.245
 tablet+0.239
 distracted+0.230
Neg logit contributions
lihood-0.310
VERTISEMENT-0.268
-0.265
archives-0.245
iso-0.243
Token administration
Token ID3662
Feature activation-7.77e-02

Pos logit contributions
 crooked+0.315
 negligent+0.308
 mutated+0.307
 tablet+0.299
 distracted+0.285
Neg logit contributions
lihood-0.421
VERTISEMENT-0.369
-0.360
archives-0.335
iso-0.332
Token turned
Token ID2900
Feature activation-9.71e-01

Pos logit contributions
 crooked+0.157
 mutated+0.154
 negligent+0.152
 tablet+0.147
 distracted+0.141
Neg logit contributions
lihood-0.267
VERTISEMENT-0.235
-0.231
archives-0.216
AGE-0.214
Token its
Token ID663
Feature activation-1.06e-01

Pos logit contributions
 crooked+2.312
 negligent+2.279
 mutated+2.245
 tablet+2.184
 distracted+2.114
Neg logit contributions
lihood-2.964
VERTISEMENT-2.621
-2.578
archives-2.369
iso-2.324
Token eye
Token ID4151
Feature activation+3.70e-01

Pos logit contributions
 crooked+0.277
 negligent+0.271
 mutated+0.270
 tablet+0.266
 toe+0.258
Neg logit contributions
lihood-0.307
VERTISEMENT-0.262
-0.254
archives-0.235
iso-0.234
Token to
Token ID284
Feature activation-6.69e-02

Pos logit contributions
lihood+1.045
VERTISEMENT+0.886
+0.861
archives+0.801
iso+0.786
Neg logit contributions
 crooked-0.976
 negligent-0.952
 mutated-0.952
 tablet-0.927
 distracted-0.893
Token a
Token ID257
Feature activation-1.81e-01

Pos logit contributions
 crooked+0.153
 negligent+0.149
 mutated+0.148
 tablet+0.145
 distracted+0.138
Neg logit contributions
lihood-0.212
VERTISEMENT-0.185
-0.182
archives-0.168
iso-0.167
Token different
Token ID1180
Feature activation+2.89e-01

Pos logit contributions
 crooked+0.411
 negligent+0.398
 mutated+0.397
 tablet+0.388
 distracted+0.368
Neg logit contributions
lihood-0.579
VERTISEMENT-0.503
-0.496
archives-0.464
iso-0.459
TokenS
Token ID50
Feature activation+1.51e-01

Pos logit contributions
 crooked+0.581
 negligent+0.572
 tablet+0.566
 mutated+0.565
vered+0.539
Neg logit contributions
lihood-0.577
-0.473
VERTISEMENT-0.472
archives-0.433
iso-0.431
Token Im
Token ID1846
Feature activation+4.93e-01

Pos logit contributions
lihood+0.440
VERTISEMENT+0.367
+0.364
archives+0.338
iso+0.334
Neg logit contributions
 crooked-0.382
 negligent-0.374
 mutated-0.369
 tablet-0.368
 distracted-0.351
Tokenperson
Token ID6259
Feature activation+1.05e-01

Pos logit contributions
lihood+1.524
VERTISEMENT+1.297
+1.265
iso+1.161
archives+1.161
Neg logit contributions
 crooked-1.171
 negligent-1.150
 tablet-1.127
 mutated-1.122
vered-1.067
Tokenation
Token ID341
Feature activation+3.20e-02

Pos logit contributions
lihood+0.326
VERTISEMENT+0.281
+0.276
archives+0.255
iso+0.250
Neg logit contributions
 crooked-0.244
 mutated-0.238
 negligent-0.237
 tablet-0.236
 distracted-0.224
Token Sc
Token ID1446
Feature activation+4.24e-03

Pos logit contributions
lihood+0.089
+0.075
VERTISEMENT+0.074
archives+0.069
iso+0.068
Neg logit contributions
 crooked-0.084
 negligent-0.084
 tablet-0.082
 mutated-0.082
 distracted-0.079
Tokenam
Token ID321
Feature activation-9.57e-01

Pos logit contributions
lihood+0.014
+0.012
VERTISEMENT+0.012
archives+0.011
AGE+0.011
Neg logit contributions
 crooked-0.009
 negligent-0.009
 tablet-0.009
 mutated-0.009
vered-0.008
Token Reporting
Token ID29595
Feature activation+3.42e-01

Pos logit contributions
 crooked+2.282
 tablet+2.271
 negligent+2.259
 mutated+2.191
 Went+2.154
Neg logit contributions
lihood-2.813
-2.401
VERTISEMENT-2.311
archives-2.200
iso-2.149
Token
Token ID447
Feature activation-2.59e-01

Pos logit contributions
lihood+0.959
VERTISEMENT+0.802
+0.798
archives+0.730
iso+0.722
Neg logit contributions
 crooked-0.907
 negligent-0.889
 tablet-0.880
 mutated-0.880
 distracted-0.833
Token
Token ID251
Feature activation-1.95e-03

Pos logit contributions
 crooked+0.458
 negligent+0.447
 mutated+0.444
 tablet+0.427
vered+0.404
Neg logit contributions
lihood-0.947
VERTISEMENT-0.828
-0.813
archives-0.773
iso-0.767
Token form
Token ID1296
Feature activation-6.64e-02

Pos logit contributions
 crooked+0.005
 negligent+0.005
 tablet+0.005
 mutated+0.005
 distracted+0.004
Neg logit contributions
lihood-0.006
-0.005
VERTISEMENT-0.005
archives-0.005
iso-0.004
Token on
Token ID319
Feature activation+1.18e-01

Pos logit contributions
 crooked+0.164
 mutated+0.161
 negligent+0.160
 tablet+0.159
 distracted+0.150
Neg logit contributions
lihood-0.199
VERTISEMENT-0.165
-0.165
archives-0.152
iso-0.152
Token Bills
Token ID20209
Feature activation-5.89e-01

Pos logit contributions
 crooked+0.838
 mutated+0.813
 negligent+0.809
 tablet+0.759
 Went+0.747
Neg logit contributions
lihood-1.633
VERTISEMENT-1.432
-1.412
archives-1.336
iso-1.330
Token:
Token ID25
Feature activation+3.34e-01

Pos logit contributions
 crooked+1.513
 negligent+1.478
 mutated+1.475
 tablet+1.433
 toe+1.397
Neg logit contributions
lihood-1.671
VERTISEMENT-1.438
-1.397
iso-1.303
archives-1.293
Token G
Token ID402
Feature activation+2.05e-01

Pos logit contributions
lihood+1.064
VERTISEMENT+0.914
+0.898
archives+0.841
iso+0.833
Neg logit contributions
 crooked-0.759
 negligent-0.739
 mutated-0.737
 tablet-0.714
 Went-0.685
Token Richie
Token ID49209
Feature activation+1.63e-01

Pos logit contributions
lihood+0.652
VERTISEMENT+0.557
+0.543
archives+0.511
iso+0.505
Neg logit contributions
 crooked-0.468
 mutated-0.457
 negligent-0.455
 tablet-0.443
 toe-0.425
Token Inc
Token ID3457
Feature activation+1.12e-01

Pos logit contributions
lihood+0.393
VERTISEMENT+0.320
+0.309
archives+0.281
iso+0.278
Neg logit contributions
 crooked-0.494
 negligent-0.486
 mutated-0.486
 tablet-0.474
 distracted-0.457
Tokenogn
Token ID2360
Feature activation-9.46e-01

Pos logit contributions
lihood+0.425
+0.373
VERTISEMENT+0.373
archives+0.346
AGE+0.339
Neg logit contributions
 negligent-0.182
 crooked-0.181
 mutated-0.180
 tablet-0.170
vered-0.167
Tokenito
Token ID10094
Feature activation+2.85e-02

Pos logit contributions
 crooked+2.325
 mutated+2.295
 tablet+2.282
 negligent+2.282
athan+2.189
Neg logit contributions
lihood-2.761
VERTISEMENT-2.338
-2.222
archives-2.076
AGE-1.985
Token
Token ID198
Feature activation+6.50e-03

Pos logit contributions
lihood+0.080
VERTISEMENT+0.067
+0.065
archives+0.061
iso+0.060
Neg logit contributions
 crooked-0.076
 mutated-0.074
 negligent-0.074
 tablet-0.072
 toe-0.070
Token
Token ID198
Feature activation-1.58e-02

Pos logit contributions
lihood+0.017
VERTISEMENT+0.014
+0.014
archives+0.013
iso+0.013
Neg logit contributions
 crooked-0.018
 negligent-0.018
 mutated-0.018
 tablet-0.017
vered-0.017
TokenContract
Token ID45845
Feature activation+4.09e-01

Pos logit contributions
 crooked+0.040
 negligent+0.040
 mutated+0.039
 tablet+0.038
vered+0.038
Neg logit contributions
lihood-0.044
VERTISEMENT-0.037
-0.036
iso-0.033
archives-0.033
Token:
Token ID25
Feature activation+1.88e-01

Pos logit contributions
lihood+1.175
VERTISEMENT+0.983
+0.966
archives+0.895
iso+0.888
Neg logit contributions
 crooked-1.053
 mutated-1.036
 negligent-1.031
 tablet-1.007
vered-0.966
Token nowhere
Token ID12062
Feature activation+2.72e-01

Pos logit contributions
lihood+1.166
VERTISEMENT+1.002
+0.975
archives+0.909
iso+0.907
Neg logit contributions
 crooked-0.893
 negligent-0.873
 mutated-0.872
 tablet-0.843
 toe-0.810
Token,
Token ID11
Feature activation+2.02e-01

Pos logit contributions
lihood+0.759
VERTISEMENT+0.650
+0.623
archives+0.590
iso+0.574
Neg logit contributions
 crooked-0.724
 mutated-0.707
 negligent-0.706
 tablet-0.685
 toe-0.665
Token the
Token ID262
Feature activation+1.69e-01

Pos logit contributions
lihood+0.610
VERTISEMENT+0.527
+0.507
archives+0.478
iso+0.473
Neg logit contributions
 crooked-0.490
 mutated-0.480
 negligent-0.479
 tablet-0.462
 distracted-0.446
Token mouse
Token ID10211
Feature activation+2.07e-01

Pos logit contributions
lihood+0.618
VERTISEMENT+0.547
+0.533
archives+0.504
iso+0.501
Neg logit contributions
 crooked-0.306
 mutated-0.297
 negligent-0.296
 tablet-0.284
 distracted-0.267
TokenSp
Token ID4561
Feature activation-8.07e-02

Pos logit contributions
lihood+0.659
VERTISEMENT+0.564
+0.550
archives+0.514
iso+0.507
Neg logit contributions
 crooked-0.477
 mutated-0.470
 negligent-0.464
 tablet-0.455
 toe-0.435
Tokenaz
Token ID1031
Feature activation-9.37e-01

Pos logit contributions
 crooked+0.196
 mutated+0.194
 negligent+0.189
 tablet+0.188
 toe+0.181
Neg logit contributions
lihood-0.243
VERTISEMENT-0.208
-0.202
archives-0.187
AGE-0.182
Token team
Token ID1074
Feature activation+2.85e-02

Pos logit contributions
 mutated+2.109
 crooked+2.096
 negligent+2.050
 tablet+2.037
 toe+1.935
Neg logit contributions
lihood-2.948
VERTISEMENT-2.560
-2.470
archives-2.314
iso-2.261
Token first
Token ID717
Feature activation+5.14e-02

Pos logit contributions
lihood+0.095
VERTISEMENT+0.083
+0.081
archives+0.076
iso+0.076
Neg logit contributions
 crooked-0.060
 mutated-0.060
 negligent-0.058
 tablet-0.057
 distracted-0.054
Token qualified
Token ID10617
Feature activation+5.28e-01

Pos logit contributions
lihood+0.184
VERTISEMENT+0.162
+0.159
archives+0.150
iso+0.149
Neg logit contributions
 crooked-0.097
 mutated-0.095
 negligent-0.094
 tablet-0.090
 distracted-0.086
Token for
Token ID329
Feature activation+3.10e-01

Pos logit contributions
lihood+1.516
VERTISEMENT+1.286
+1.247
archives+1.164
iso+1.148
Neg logit contributions
 crooked-1.367
 mutated-1.336
 negligent-1.334
 tablet-1.299
 toe-1.250
Token MLG
Token ID48144
Feature activation+4.82e-01

Pos logit contributions
lihood+0.960
VERTISEMENT+0.824
+0.799
archives+0.746
iso+0.743
Neg logit contributions
 crooked-0.734
 mutated-0.717
 negligent-0.716
 tablet-0.697
 toe-0.664
Token Civil
Token ID7511
Feature activation-1.52e-01

Pos logit contributions
lihood+1.247
VERTISEMENT+1.060
+1.033
archives+0.970
iso+0.967
Neg logit contributions
 crooked-0.930
 mutated-0.907
 negligent-0.905
 tablet-0.886
 distracted-0.839
Token War
Token ID1810
Feature activation-4.92e-02

Pos logit contributions
 crooked+0.245
 negligent+0.238
 mutated+0.236
 tablet+0.231
vered+0.215
Neg logit contributions
lihood-0.583
VERTISEMENT-0.513
-0.498
archives-0.473
iso-0.468
Token was
Token ID373
Feature activation-2.35e-01

Pos logit contributions
 crooked+0.107
 mutated+0.106
 negligent+0.104
 tablet+0.101
 distracted+0.097
Neg logit contributions
lihood-0.164
VERTISEMENT-0.140
-0.138
iso-0.129
archives-0.128
Token a
Token ID257
Feature activation-9.34e-02

Pos logit contributions
 crooked+0.519
 mutated+0.507
 negligent+0.507
 tablet+0.483
 distracted+0.468
Neg logit contributions
lihood-0.780
VERTISEMENT-0.670
-0.654
iso-0.618
archives-0.609
Token dark
Token ID3223
Feature activation+2.21e-01

Pos logit contributions
 crooked+0.198
 negligent+0.193
 mutated+0.192
 tablet+0.185
 distracted+0.177
Neg logit contributions
lihood-0.320
VERTISEMENT-0.275
-0.270
iso-0.256
archives-0.254
Token chapter
Token ID6843
Feature activation-9.37e-01

Pos logit contributions
lihood+0.651
VERTISEMENT+0.548
+0.533
archives+0.498
iso+0.496
Neg logit contributions
 crooked-0.567
 mutated-0.553
 negligent-0.549
 tablet-0.545
 distracted-0.516
Token in
Token ID287
Feature activation+2.58e-02

Pos logit contributions
 crooked+2.377
 mutated+2.375
 negligent+2.329
 tablet+2.308
 distracted+2.201
Neg logit contributions
lihood-2.710
VERTISEMENT-2.355
-2.252
archives-2.090
iso-2.081
Token America
Token ID2253
Feature activation+2.60e-01

Pos logit contributions
lihood+0.083
VERTISEMENT+0.071
+0.069
iso+0.065
archives+0.064
Neg logit contributions
 crooked-0.060
 mutated-0.057
 negligent-0.057
 tablet-0.057
 distracted-0.053
Token's
Token ID338
Feature activation-3.63e-02

Pos logit contributions
lihood+0.647
VERTISEMENT+0.522
+0.502
iso+0.460
archives+0.456
Neg logit contributions
 crooked-0.786
 mutated-0.773
 negligent-0.769
 tablet-0.755
 distracted-0.727
Token history
Token ID2106
Feature activation+2.00e-01

Pos logit contributions
 crooked+0.103
 negligent+0.101
 mutated+0.100
 tablet+0.099
 distracted+0.094
Neg logit contributions
lihood-0.098
VERTISEMENT-0.081
-0.078
iso-0.073
archives-0.072
Token.
Token ID13
Feature activation-1.06e-01

Pos logit contributions
lihood+0.575
VERTISEMENT+0.485
+0.470
iso+0.435
archives+0.434
Neg logit contributions
 crooked-0.525
 mutated-0.518
 negligent-0.513
 tablet-0.503
 distracted-0.483
Token a
Token ID257
Feature activation+4.22e-02

Pos logit contributions
lihood+0.350
VERTISEMENT+0.299
+0.293
iso+0.275
archives+0.275
Neg logit contributions
 crooked-0.230
 mutated-0.226
 negligent-0.223
 tablet-0.216
 distracted-0.207
Token murder
Token ID5123
Feature activation+2.50e-01

Pos logit contributions
lihood+0.136
VERTISEMENT+0.116
+0.114
archives+0.107
iso+0.107
Neg logit contributions
 crooked-0.096
 mutated-0.094
 negligent-0.093
 tablet-0.091
 toe-0.086
Token mystery
Token ID10715
Feature activation+1.45e-01

Pos logit contributions
lihood+0.670
VERTISEMENT+0.554
+0.545
archives+0.493
iso+0.493
Neg logit contributions
 crooked-0.707
 mutated-0.692
 negligent-0.692
 tablet-0.680
 toe-0.649
Token with
Token ID351
Feature activation-1.17e-01

Pos logit contributions
lihood+0.415
VERTISEMENT+0.347
+0.341
archives+0.313
iso+0.312
Neg logit contributions
 crooked-0.382
 mutated-0.379
 negligent-0.374
 tablet-0.366
 distracted-0.350
Token a
Token ID257
Feature activation-7.42e-02

Pos logit contributions
 crooked+0.261
 mutated+0.253
 negligent+0.249
 tablet+0.239
 toe+0.231
Neg logit contributions
lihood-0.384
VERTISEMENT-0.328
-0.321
archives-0.305
iso-0.304
Token minor
Token ID4159
Feature activation-9.12e-01

Pos logit contributions
 crooked+0.164
 mutated+0.159
 negligent+0.158
 tablet+0.153
 toe+0.146
Neg logit contributions
lihood-0.245
VERTISEMENT-0.210
-0.207
archives-0.196
iso-0.193
Token romantic
Token ID14348
Feature activation+1.20e-01

Pos logit contributions
 crooked+2.192
 mutated+2.160
 negligent+2.123
 tablet+2.051
 toe+2.007
Neg logit contributions
lihood-2.785
VERTISEMENT-2.391
-2.348
archives-2.249
OTOS-2.152
Token sub
Token ID850
Feature activation+3.39e-01

Pos logit contributions
lihood+0.369
VERTISEMENT+0.314
+0.311
archives+0.292
iso+0.283
Neg logit contributions
 crooked-0.285
 mutated-0.279
 negligent-0.278
 tablet-0.270
 toe-0.258
Tokenplot
Token ID29487
Feature activation-4.51e-02

Pos logit contributions
lihood+0.870
VERTISEMENT+0.720
+0.703
archives+0.641
AGE+0.625
Neg logit contributions
 crooked-0.974
 mutated-0.962
 negligent-0.961
 tablet-0.942
 toe-0.907
Token called
Token ID1444
Feature activation+8.21e-02

Pos logit contributions
 crooked+0.119
 mutated+0.118
 negligent+0.116
 tablet+0.113
 distracted+0.109
Neg logit contributions
lihood-0.127
VERTISEMENT-0.107
-0.105
archives-0.098
iso-0.096
Token An
Token ID1052
Feature activation-1.10e-01

Pos logit contributions
lihood+0.239
VERTISEMENT+0.202
+0.198
archives+0.184
iso+0.184
Neg logit contributions
 crooked-0.209
 negligent-0.204
 mutated-0.203
 tablet-0.196
 distracted-0.190
Token Manson
Token ID41645
Feature activation-2.82e-01

Pos logit contributions
 crooked+0.049
 negligent+0.047
 mutated+0.047
 tablet+0.046
 Went+0.044
Neg logit contributions
lihood-0.065
VERTISEMENT-0.056
-0.055
archives-0.051
iso-0.050
Token,
Token ID11
Feature activation+2.00e-01

Pos logit contributions
 crooked+0.724
 mutated+0.712
 negligent+0.709
 tablet+0.689
 distracted+0.666
Neg logit contributions
lihood-0.810
VERTISEMENT-0.685
-0.670
archives-0.617
iso-0.605
Token representing
Token ID10200
Feature activation-1.20e-01

Pos logit contributions
lihood+0.644
VERTISEMENT+0.558
+0.542
archives+0.506
iso+0.505
Neg logit contributions
 crooked-0.452
 negligent-0.439
 mutated-0.438
 tablet-0.426
 distracted-0.407
Token the
Token ID262
Feature activation+5.71e-02

Pos logit contributions
 crooked+0.278
 negligent+0.271
 mutated+0.270
 tablet+0.261
 distracted+0.250
Neg logit contributions
lihood-0.378
VERTISEMENT-0.326
-0.316
archives-0.296
iso-0.296
Token local
Token ID1957
Feature activation+3.07e-01

Pos logit contributions
lihood+0.181
VERTISEMENT+0.157
+0.152
archives+0.141
iso+0.141
Neg logit contributions
 crooked-0.132
 negligent-0.129
 mutated-0.129
 tablet-0.125
 distracted-0.120
Token chapter
Token ID6843
Feature activation-8.99e-01

Pos logit contributions
lihood+0.933
VERTISEMENT+0.799
+0.773
archives+0.718
iso+0.716
Neg logit contributions
 crooked-0.753
 negligent-0.733
 mutated-0.730
 tablet-0.719
 distracted-0.684
Token of
Token ID286
Feature activation+4.70e-02

Pos logit contributions
 crooked+2.158
 negligent+2.125
 mutated+2.121
 tablet+2.070
 distracted+1.989
Neg logit contributions
lihood-2.672
VERTISEMENT-2.317
-2.214
archives-2.070
iso-2.021
Token the
Token ID262
Feature activation+9.93e-02

Pos logit contributions
lihood+0.146
VERTISEMENT+0.126
+0.123
iso+0.115
archives+0.114
Neg logit contributions
 crooked-0.110
 mutated-0.107
 negligent-0.107
 tablet-0.103
 Went-0.100
Token NAACP
Token ID39829
Feature activation-4.98e-02

Pos logit contributions
lihood+0.313
VERTISEMENT+0.269
+0.261
iso+0.244
archives+0.243
Neg logit contributions
 crooked-0.231
 mutated-0.226
 negligent-0.225
 tablet-0.218
 distracted-0.209
Token,
Token ID11
Feature activation+4.56e-02

Pos logit contributions
 crooked+0.131
 negligent+0.128
 mutated+0.128
 tablet+0.126
 distracted+0.121
Neg logit contributions
lihood-0.139
VERTISEMENT-0.119
-0.113
archives-0.105
iso-0.104
Token has
Token ID468
Feature activation+3.76e-02

Pos logit contributions
lihood+0.151
VERTISEMENT+0.131
+0.128
archives+0.120
iso+0.119
Neg logit contributions
 crooked-0.099
 mutated-0.096
 negligent-0.096
 tablet-0.093
 distracted-0.089
Token McDonnell
Token ID39441
Feature activation-3.70e-01

Pos logit contributions
 crooked+0.085
 negligent+0.083
 mutated+0.081
 tablet+0.079
 distracted+0.076
Neg logit contributions
lihood-0.123
VERTISEMENT-0.107
-0.104
archives-0.098
iso-0.097
Token in
Token ID287
Feature activation+1.84e-01

Pos logit contributions
 crooked+0.916
 negligent+0.904
 mutated+0.887
 tablet+0.869
 distracted+0.849
Neg logit contributions
lihood-1.107
VERTISEMENT-0.951
-0.930
archives-0.866
iso-0.845
Token return
Token ID1441
Feature activation-5.31e-01

Pos logit contributions
lihood+0.575
VERTISEMENT+0.497
+0.485
archives+0.451
iso+0.449
Neg logit contributions
 crooked-0.429
 negligent-0.420
 mutated-0.415
 tablet-0.406
 distracted-0.388
Token were
Token ID547
Feature activation+8.52e-02

Pos logit contributions
 crooked+1.192
 negligent+1.173
 mutated+1.157
 tablet+1.118
 distracted+1.107
Neg logit contributions
lihood-1.689
VERTISEMENT-1.470
-1.456
archives-1.371
iso-1.344
Token relatively
Token ID5365
Feature activation-2.30e-01

Pos logit contributions
lihood+0.283
VERTISEMENT+0.245
+0.241
archives+0.226
iso+0.226
Neg logit contributions
 crooked-0.186
 negligent-0.184
 mutated-0.178
 tablet-0.173
 distracted-0.166
Token minor
Token ID4159
Feature activation-8.98e-01

Pos logit contributions
 crooked+0.578
 negligent+0.575
 mutated+0.553
 tablet+0.541
 distracted+0.527
Neg logit contributions
lihood-0.689
VERTISEMENT-0.596
-0.584
archives-0.539
iso-0.538
Token,
Token ID11
Feature activation-1.67e-01

Pos logit contributions
 negligent+2.189
 crooked+2.183
 mutated+2.107
 tablet+2.072
 distracted+2.022
Neg logit contributions
lihood-2.710
VERTISEMENT-2.297
-2.255
iso-2.119
archives-2.076
Token such
Token ID884
Feature activation+1.98e-01

Pos logit contributions
 crooked+0.407
 negligent+0.404
 mutated+0.395
 tablet+0.386
 distracted+0.373
Neg logit contributions
lihood-0.512
VERTISEMENT-0.440
-0.428
archives-0.398
iso-0.396
Token as
Token ID355
Feature activation-1.93e-01

Pos logit contributions
lihood+0.543
VERTISEMENT+0.453
+0.443
archives+0.408
iso+0.405
Neg logit contributions
 crooked-0.533
 mutated-0.523
 negligent-0.522
 tablet-0.508
vered-0.488
Token introducing
Token ID16118
Feature activation-1.41e-01

Pos logit contributions
 crooked+0.422
 negligent+0.416
 mutated+0.401
 tablet+0.397
 distracted+0.382
Neg logit contributions
lihood-0.627
VERTISEMENT-0.551
-0.545
iso-0.509
archives-0.507
Token Williams
Token ID6484
Feature activation-3.32e-01

Pos logit contributions
 crooked+0.329
 negligent+0.317
 mutated+0.313
 tablet+0.309
 distracted+0.297
Neg logit contributions
lihood-0.449
VERTISEMENT-0.388
-0.380
archives-0.354
iso-0.353
Token possession
Token ID7797
Feature activation-4.13e-01

Pos logit contributions
 crooked+0.169
 negligent+0.168
 mutated+0.163
 tablet+0.162
 distracted+0.155
Neg logit contributions
lihood-0.213
VERTISEMENT-0.182
-0.177
iso-0.163
archives-0.163
Token and
Token ID290
Feature activation+7.83e-02

Pos logit contributions
 crooked+1.070
 negligent+1.067
 mutated+1.046
 tablet+1.015
 distracted+0.994
Neg logit contributions
lihood-1.193
VERTISEMENT-1.015
-0.981
iso-0.903
archives-0.902
Token possession
Token ID7797
Feature activation-4.09e-01

Pos logit contributions
lihood+0.257
VERTISEMENT+0.223
+0.220
archives+0.205
iso+0.204
Neg logit contributions
 crooked-0.171
 negligent-0.169
 mutated-0.166
 tablet-0.161
 distracted-0.155
Token of
Token ID286
Feature activation+2.68e-02

Pos logit contributions
 crooked+1.016
 negligent+1.014
 mutated+0.995
 tablet+0.963
 distracted+0.940
Neg logit contributions
lihood-1.219
VERTISEMENT-1.040
-1.010
archives-0.935
iso-0.933
Token drug
Token ID2563
Feature activation-5.31e-02

Pos logit contributions
lihood+0.090
VERTISEMENT+0.079
+0.077
archives+0.072
iso+0.072
Neg logit contributions
 crooked-0.058
 negligent-0.057
 mutated-0.056
 tablet-0.055
 distracted-0.053
Token parap
Token ID41406
Feature activation-8.97e-01

Pos logit contributions
 crooked+0.139
 negligent+0.137
 tablet+0.137
 mutated+0.137
 distracted+0.129
Neg logit contributions
lihood-0.155
VERTISEMENT-0.131
-0.124
archives-0.116
AGE-0.114
Tokenhern
Token ID2881
Feature activation+1.41e-01

Pos logit contributions
 crooked+1.897
 tablet+1.879
 negligent+1.878
 mutated+1.870
vered+1.785
Neg logit contributions
lihood-2.766
VERTISEMENT-2.549
-2.451
archives-2.193
OTOS-2.160
Tokenalia
Token ID9752
Feature activation-2.79e-01

Pos logit contributions
lihood+0.379
VERTISEMENT+0.318
+0.307
archives+0.278
iso+0.276
Neg logit contributions
 crooked-0.387
 negligent-0.381
 mutated-0.381
 tablet-0.374
vered-0.360
Token.
Token ID13
Feature activation+1.82e-01

Pos logit contributions
 crooked+0.711
 negligent+0.709
 mutated+0.693
 tablet+0.678
 distracted+0.658
Neg logit contributions
lihood-0.816
VERTISEMENT-0.700
-0.681
archives-0.625
iso-0.623
Token It
Token ID632
Feature activation+7.44e-03

Pos logit contributions
lihood+0.524
VERTISEMENT+0.441
+0.430
archives+0.398
iso+0.392
Neg logit contributions
 crooked-0.465
 negligent-0.458
 mutated-0.454
 tablet-0.442
 Went-0.426
Token
Token ID447
Feature activation-1.43e-01

Pos logit contributions
lihood+0.024
VERTISEMENT+0.021
+0.020
archives+0.019
AGE+0.019
Neg logit contributions
 crooked-0.017
 mutated-0.017
 negligent-0.016
 tablet-0.016
 distracted-0.015
Token,
Token ID11
Feature activation-1.04e-01

Pos logit contributions
 crooked+0.909
 mutated+0.897
 negligent+0.897
 tablet+0.876
 distracted+0.841
Neg logit contributions
lihood-0.863
VERTISEMENT-0.721
-0.700
archives-0.645
iso-0.638
Token Go
Token ID1514
Feature activation-6.41e-01

Pos logit contributions
 crooked+0.237
 negligent+0.235
 mutated+0.234
 tablet+0.228
 distracted+0.217
Neg logit contributions
lihood-0.330
VERTISEMENT-0.288
-0.279
archives-0.261
iso-0.259
Tokence
Token ID344
Feature activation+7.55e-02

Pos logit contributions
 crooked+1.523
 negligent+1.501
 mutated+1.492
 tablet+1.465
vered+1.420
Neg logit contributions
lihood-1.925
VERTISEMENT-1.680
-1.613
archives-1.479
AGE-1.460
Token's
Token ID338
Feature activation-1.51e-01

Pos logit contributions
lihood+0.228
VERTISEMENT+0.195
+0.190
archives+0.176
iso+0.175
Neg logit contributions
 crooked-0.183
 mutated-0.182
 negligent-0.181
 tablet-0.177
 distracted-0.169
Token imminent
Token ID21161
Feature activation-1.03e-01

Pos logit contributions
 crooked+0.340
 mutated+0.334
 negligent+0.334
 tablet+0.324
 distracted+0.307
Neg logit contributions
lihood-0.485
VERTISEMENT-0.423
-0.412
archives-0.389
iso-0.382
Token descent
Token ID18598
Feature activation-8.91e-01

Pos logit contributions
 crooked+0.233
 negligent+0.230
 mutated+0.229
 tablet+0.222
 toe+0.212
Neg logit contributions
lihood-0.325
VERTISEMENT-0.288
-0.280
archives-0.264
iso-0.258
Token is
Token ID318
Feature activation-1.61e-01

Pos logit contributions
 crooked+1.928
 mutated+1.906
 negligent+1.893
 tablet+1.865
 distracted+1.777
Neg logit contributions
lihood-2.863
VERTISEMENT-2.549
-2.525
archives-2.357
iso-2.276
Token interesting
Token ID3499
Feature activation-1.79e-01

Pos logit contributions
 crooked+0.353
 negligent+0.350
 mutated+0.346
 tablet+0.336
 distracted+0.321
Neg logit contributions
lihood-0.522
VERTISEMENT-0.462
-0.448
archives-0.420
iso-0.415
Token on
Token ID319
Feature activation-9.21e-02

Pos logit contributions
 crooked+0.420
 mutated+0.414
 negligent+0.412
 tablet+0.405
 distracted+0.387
Neg logit contributions
lihood-0.553
VERTISEMENT-0.476
-0.462
archives-0.431
iso-0.428
Token a
Token ID257
Feature activation+4.12e-02

Pos logit contributions
 crooked+0.196
 negligent+0.193
 mutated+0.193
 tablet+0.190
 distracted+0.179
Neg logit contributions
lihood-0.306
VERTISEMENT-0.266
-0.259
archives-0.242
iso-0.242
Token number
Token ID1271
Feature activation+2.48e-01

Pos logit contributions
lihood+0.127
VERTISEMENT+0.108
+0.106
archives+0.099
iso+0.099
Neg logit contributions
 crooked-0.098
 negligent-0.096
 mutated-0.096
 tablet-0.094
 toe-0.090
Token a
Token ID257
Feature activation-1.61e-01

Pos logit contributions
 crooked+0.141
 negligent+0.136
 mutated+0.136
 tablet+0.133
 toe+0.128
Neg logit contributions
lihood-0.193
VERTISEMENT-0.165
-0.163
archives-0.152
iso-0.150
Token ledge
Token ID35614
Feature activation-1.75e-01

Pos logit contributions
 crooked+0.357
 negligent+0.344
 mutated+0.344
 tablet+0.334
 toe+0.320
Neg logit contributions
lihood-0.529
VERTISEMENT-0.456
-0.451
archives-0.426
iso-0.420
Token or
Token ID393
Feature activation-9.09e-02

Pos logit contributions
 crooked+0.438
 mutated+0.418
 negligent+0.417
 tablet+0.411
 toe+0.395
Neg logit contributions
lihood-0.524
VERTISEMENT-0.444
-0.441
archives-0.411
iso-0.408
Token drop
Token ID4268
Feature activation+1.12e-01

Pos logit contributions
 crooked+0.210
 negligent+0.201
 mutated+0.201
 tablet+0.198
 toe+0.190
Neg logit contributions
lihood-0.289
VERTISEMENT-0.248
-0.247
archives-0.232
iso-0.228
Token-
Token ID12
Feature activation+1.04e-01

Pos logit contributions
lihood+0.340
VERTISEMENT+0.292
+0.287
archives+0.272
iso+0.266
Neg logit contributions
 crooked-0.273
 negligent-0.264
 mutated-0.262
 tablet-0.261
 toe-0.251
Tokenoff
Token ID2364
Feature activation+6.78e-01

Pos logit contributions
lihood+0.305
VERTISEMENT+0.263
+0.255
archives+0.234
iso+0.229
Neg logit contributions
 crooked-0.261
 negligent-0.256
 mutated-0.252
 tablet-0.251
 toe-0.240
Token where
Token ID810
Feature activation+1.89e-01

Pos logit contributions
lihood+2.720
VERTISEMENT+2.415
+2.389
archives+2.272
iso+2.256
Neg logit contributions
 crooked-0.981
 negligent-0.939
 mutated-0.935
 tablet-0.899
 toe-0.826
Token the
Token ID262
Feature activation+3.33e-02

Pos logit contributions
lihood+0.586
VERTISEMENT+0.500
+0.492
archives+0.462
iso+0.459
Neg logit contributions
 crooked-0.451
 negligent-0.438
 mutated-0.438
 tablet-0.422
 toe-0.409
Token only
Token ID691
Feature activation-1.73e-01

Pos logit contributions
lihood+0.104
VERTISEMENT+0.089
+0.087
archives+0.082
iso+0.081
Neg logit contributions
 crooked-0.079
 negligent-0.077
 mutated-0.077
 tablet-0.075
 toe-0.072
Token alternative
Token ID5559
Feature activation-3.08e-02

Pos logit contributions
 crooked+0.408
 negligent+0.394
 mutated+0.392
 tablet+0.381
 toe+0.369
Neg logit contributions
lihood-0.539
VERTISEMENT-0.466
-0.462
archives-0.436
iso-0.427
Token appears
Token ID3568
Feature activation+1.04e-01

Pos logit contributions
 crooked+0.070
 negligent+0.068
 mutated+0.068
 tablet+0.066
 toe+0.064
Neg logit contributions
lihood-0.098
VERTISEMENT-0.084
-0.084
archives-0.079
iso-0.077
Token do
Token ID466
Feature activation+2.87e-01

Pos logit contributions
 crooked+0.308
 negligent+0.303
 mutated+0.301
 tablet+0.291
 distracted+0.279
Neg logit contributions
lihood-0.445
VERTISEMENT-0.384
-0.376
archives-0.354
iso-0.351
Token you
Token ID345
Feature activation+1.27e-01

Pos logit contributions
lihood+0.813
VERTISEMENT+0.680
+0.664
archives+0.618
iso+0.616
Neg logit contributions
 crooked-0.753
 negligent-0.742
 mutated-0.740
 tablet-0.718
 distracted-0.692
Token then
Token ID788
Feature activation+1.19e-01

Pos logit contributions
lihood+0.392
VERTISEMENT+0.339
+0.333
archives+0.309
iso+0.305
Neg logit contributions
 crooked-0.300
 mutated-0.293
 negligent-0.292
 tablet-0.286
 toe-0.276
Token develop
Token ID1205
Feature activation+2.46e-01

Pos logit contributions
lihood+0.376
VERTISEMENT+0.328
+0.321
archives+0.300
iso+0.297
Neg logit contributions
 crooked-0.272
 mutated-0.265
 negligent-0.265
 tablet-0.259
 toe-0.248
Token a
Token ID257
Feature activation+1.63e-01

Pos logit contributions
lihood+0.789
VERTISEMENT+0.677
+0.667
archives+0.625
iso+0.622
Neg logit contributions
 crooked-0.552
 negligent-0.541
 mutated-0.540
 tablet-0.523
 distracted-0.498
Token core
Token ID4755
Feature activation+7.38e-01

Pos logit contributions
lihood+0.509
VERTISEMENT+0.433
+0.427
archives+0.403
iso+0.399
Neg logit contributions
 crooked-0.381
 negligent-0.374
 mutated-0.373
 tablet-0.362
 toe-0.346
Token curriculum
Token ID20583
Feature activation-1.69e-01

Pos logit contributions
lihood+2.497
VERTISEMENT+2.163
+2.127
archives+2.000
iso+1.987
Neg logit contributions
 crooked-1.522
 negligent-1.487
 mutated-1.487
 tablet-1.437
 distracted-1.358
Token?
Token ID30
Feature activation+8.66e-02

Pos logit contributions
 crooked+0.411
 negligent+0.406
 mutated+0.402
 tablet+0.396
 distracted+0.377
Neg logit contributions
lihood-0.513
VERTISEMENT-0.433
-0.427
archives-0.397
iso-0.391
Token
Token ID198
Feature activation+2.85e-01

Pos logit contributions
lihood+0.241
VERTISEMENT+0.201
+0.196
archives+0.182
iso+0.179
Neg logit contributions
 crooked-0.229
 negligent-0.226
 mutated-0.224
 tablet-0.220
 Went-0.211
Token
Token ID198
Feature activation+2.26e-01

Pos logit contributions
lihood+0.737
VERTISEMENT+0.608
+0.593
archives+0.542
iso+0.539
Neg logit contributions
 crooked-0.815
 mutated-0.801
 negligent-0.800
 tablet-0.780
 distracted-0.751
TokenA
Token ID32
Feature activation+1.69e-01

Pos logit contributions
lihood+0.599
VERTISEMENT+0.494
+0.477
archives+0.440
iso+0.436
Neg logit contributions
 crooked-0.629
 negligent-0.618
 mutated-0.616
 tablet-0.602
vered-0.583
Token
Token ID198
Feature activation-5.99e-03

Pos logit contributions
 crooked+0.017
 negligent+0.017
 mutated+0.017
 tablet+0.016
vered+0.016
Neg logit contributions
lihood-0.017
VERTISEMENT-0.014
-0.014
archives-0.012
iso-0.012
TokenAmong
Token ID14311
Feature activation+3.76e-01

Pos logit contributions
 crooked+0.016
 negligent+0.016
 mutated+0.016
 tablet+0.015
vered+0.015
Neg logit contributions
lihood-0.016
VERTISEMENT-0.013
-0.012
iso-0.012
 Ramirez-0.012
Token those
Token ID883
Feature activation+1.63e-01

Pos logit contributions
lihood+1.186
VERTISEMENT+1.026
+0.996
archives+0.937
iso+0.927
Neg logit contributions
 crooked-0.871
 negligent-0.844
 mutated-0.836
 tablet-0.815
 distracted-0.782
Token ens
Token ID3140
Feature activation-1.44e-01

Pos logit contributions
lihood+0.553
VERTISEMENT+0.486
+0.471
archives+0.446
iso+0.442
Neg logit contributions
 crooked-0.339
 negligent-0.330
 mutated-0.328
 tablet-0.315
 distracted-0.303
Tokenn
Token ID77
Feature activation+1.46e-01

Pos logit contributions
 crooked+0.320
 mutated+0.315
 negligent+0.310
 tablet+0.303
vered+0.294
Neg logit contributions
lihood-0.460
VERTISEMENT-0.402
-0.387
archives-0.365
iso-0.362
Tokenared
Token ID1144
Feature activation+7.17e-01

Pos logit contributions
lihood+0.529
VERTISEMENT+0.472
+0.463
archives+0.436
AGE+0.431
Neg logit contributions
 crooked-0.257
 mutated-0.252
 negligent-0.247
 tablet-0.238
vered-0.234
Token by
Token ID416
Feature activation+1.05e-01

Pos logit contributions
lihood+2.125
VERTISEMENT+1.818
+1.760
archives+1.646
iso+1.634
Neg logit contributions
 crooked-1.800
 mutated-1.757
 negligent-1.752
 tablet-1.700
 distracted-1.639
Token the
Token ID262
Feature activation-7.82e-02

Pos logit contributions
lihood+0.334
VERTISEMENT+0.287
+0.279
archives+0.263
iso+0.261
Neg logit contributions
 crooked-0.244
 negligent-0.235
 mutated-0.234
 tablet-0.226
 distracted-0.217
Token wide
Token ID3094
Feature activation+2.96e-01

Pos logit contributions
 crooked+0.182
 negligent+0.175
 mutated+0.174
 tablet+0.169
 distracted+0.162
Neg logit contributions
lihood-0.253
VERTISEMENT-0.218
-0.209
archives-0.199
iso-0.196
Token-
Token ID12
Feature activation-6.10e-02

Pos logit contributions
lihood+0.839
VERTISEMENT+0.707
+0.670
iso+0.632
archives+0.627
Neg logit contributions
 crooked-0.795
 mutated-0.769
 negligent-0.769
 tablet-0.758
 toe-0.728
Tokenranging
Token ID32319
Feature activation+3.07e-01

Pos logit contributions
 crooked+0.170
 mutated+0.165
 negligent+0.164
 tablet+0.162
vered+0.158
Neg logit contributions
lihood-0.166
VERTISEMENT-0.139
-0.130
AGE-0.118
iso-0.118
Token
Token ID447
Feature activation+1.04e-01

Pos logit contributions
lihood+0.384
VERTISEMENT+0.329
+0.316
archives+0.297
iso+0.293
Neg logit contributions
 crooked-0.324
 negligent-0.319
 mutated-0.318
 tablet-0.310
 distracted-0.299
Token
Token ID247
Feature activation-5.51e-02

Pos logit contributions
lihood+0.337
VERTISEMENT+0.289
+0.280
archives+0.264
iso+0.263
Neg logit contributions
 crooked-0.230
 negligent-0.224
 mutated-0.224
 tablet-0.216
vered-0.209
Tokent
Token ID83
Feature activation-7.29e-03

Pos logit contributions
 crooked+0.136
 negligent+0.134
 mutated+0.133
 tablet+0.131
vered+0.128
Neg logit contributions
lihood-0.161
VERTISEMENT-0.139
-0.136
archives-0.126
AGE-0.123
Token find
Token ID1064
Feature activation+8.14e-02

Pos logit contributions
 crooked+0.016
 negligent+0.015
 mutated+0.015
 tablet+0.015
 distracted+0.014
Neg logit contributions
lihood-0.024
VERTISEMENT-0.021
-0.021
archives-0.019
iso-0.019
Token an
Token ID281
Feature activation+3.95e-02

Pos logit contributions
lihood+0.257
VERTISEMENT+0.226
+0.218
archives+0.206
iso+0.202
Neg logit contributions
 crooked-0.188
 negligent-0.183
 mutated-0.181
 tablet-0.178
 distracted-0.170
Token answer
Token ID3280
Feature activation+6.85e-01

Pos logit contributions
lihood+0.139
VERTISEMENT+0.121
+0.120
archives+0.113
iso+0.109
Neg logit contributions
 crooked-0.077
 negligent-0.075
 mutated-0.074
 tablet-0.073
 unfocused-0.068
Token to
Token ID284
Feature activation-9.06e-02

Pos logit contributions
lihood+1.994
VERTISEMENT+1.695
+1.653
archives+1.537
iso+1.513
Neg logit contributions
 crooked-1.752
 negligent-1.714
 mutated-1.709
 tablet-1.667
 distracted-1.601
Token this
Token ID428
Feature activation-1.46e-02

Pos logit contributions
 crooked+0.201
 negligent+0.195
 mutated+0.194
 tablet+0.189
 distracted+0.182
Neg logit contributions
lihood-0.295
VERTISEMENT-0.260
-0.247
archives-0.232
iso-0.231
Token question
Token ID1808
Feature activation-7.75e-02

Pos logit contributions
 crooked+0.037
 negligent+0.035
 mutated+0.035
 tablet+0.035
 distracted+0.033
Neg logit contributions
lihood-0.044
VERTISEMENT-0.037
-0.036
archives-0.034
iso-0.034
Token.
Token ID13
Feature activation-6.95e-02

Pos logit contributions
 crooked+0.194
 negligent+0.189
 mutated+0.188
 tablet+0.184
 distracted+0.178
Neg logit contributions
lihood-0.230
VERTISEMENT-0.197
-0.191
archives-0.179
iso-0.175
Token
Token ID198
Feature activation-8.83e-02

Pos logit contributions
 crooked+0.179
 negligent+0.176
 mutated+0.175
 tablet+0.170
 Went+0.168
Neg logit contributions
lihood-0.197
VERTISEMENT-0.168
-0.162
archives-0.150
iso-0.147
Token on
Token ID319
Feature activation-2.06e-02

Pos logit contributions
lihood+0.400
VERTISEMENT+0.341
+0.326
archives+0.312
iso+0.307
Neg logit contributions
 crooked-0.343
 mutated-0.333
 negligent-0.329
 tablet-0.320
 toe-0.312
Token all
Token ID477
Feature activation-1.77e-02

Pos logit contributions
 crooked+0.050
 mutated+0.048
 negligent+0.048
 tablet+0.047
 toe+0.045
Neg logit contributions
lihood-0.064
VERTISEMENT-0.054
-0.053
archives-0.050
iso-0.049
Token the
Token ID262
Feature activation-1.52e-02

Pos logit contributions
 crooked+0.043
 mutated+0.042
 negligent+0.041
 tablet+0.041
 distracted+0.038
Neg logit contributions
lihood-0.055
VERTISEMENT-0.046
-0.045
archives-0.042
iso-0.042
Token best
Token ID1266
Feature activation-2.07e-01

Pos logit contributions
 crooked+0.036
 mutated+0.035
 negligent+0.035
 tablet+0.034
 toe+0.033
Neg logit contributions
lihood-0.048
VERTISEMENT-0.041
-0.040
archives-0.037
iso-0.037
Token-
Token ID12
Feature activation+2.91e-01

Pos logit contributions
 crooked+0.515
 mutated+0.501
 negligent+0.496
 tablet+0.493
 toe+0.468
Neg logit contributions
lihood-0.633
VERTISEMENT-0.530
-0.512
archives-0.473
iso-0.469
Tokenof
Token ID1659
Feature activation+9.14e-01

Pos logit contributions
lihood+0.999
VERTISEMENT+0.873
+0.843
iso+0.774
archives+0.773
Neg logit contributions
 crooked-0.589
 mutated-0.581
 negligent-0.576
 tablet-0.565
vered-0.538
Token lists
Token ID8341
Feature activation+6.99e-01

Pos logit contributions
lihood+2.959
VERTISEMENT+2.552
+2.499
archives+2.335
iso+2.321
Neg logit contributions
 crooked-2.024
 mutated-1.978
 negligent-1.971
 tablet-1.916
 distracted-1.816
Token,
Token ID11
Feature activation-6.64e-02

Pos logit contributions
lihood+1.911
VERTISEMENT+1.602
+1.561
archives+1.433
iso+1.428
Neg logit contributions
 crooked-1.913
 mutated-1.877
 negligent-1.865
 tablet-1.821
 distracted-1.752
Token even
Token ID772
Feature activation-1.53e-01

Pos logit contributions
 crooked+0.161
 mutated+0.160
 negligent+0.156
 tablet+0.152
 distracted+0.147
Neg logit contributions
lihood-0.204
VERTISEMENT-0.174
-0.169
archives-0.158
iso-0.157
Token influenced
Token ID12824
Feature activation+1.21e-02

Pos logit contributions
 crooked+0.314
 mutated+0.309
 negligent+0.302
 tablet+0.291
 distracted+0.281
Neg logit contributions
lihood-0.525
VERTISEMENT-0.459
-0.447
archives-0.422
iso-0.422
Token fashion
Token ID6977
Feature activation-2.12e-01

Pos logit contributions
lihood+0.038
VERTISEMENT+0.033
+0.032
iso+0.030
archives+0.030
Neg logit contributions
 crooked-0.028
 mutated-0.027
 negligent-0.027
 tablet-0.027
 distracted-0.025
Token shit
Token ID7510
Feature activation+1.56e-01

Pos logit contributions
lihood+0.114
VERTISEMENT+0.098
+0.097
archives+0.090
iso+0.090
Neg logit contributions
 crooked-0.083
 negligent-0.081
 mutated-0.080
 tablet-0.078
 distracted-0.075
Token hits
Token ID7127
Feature activation+1.83e-01

Pos logit contributions
lihood+0.501
VERTISEMENT+0.431
+0.419
archives+0.393
iso+0.390
Neg logit contributions
 crooked-0.355
 mutated-0.347
 negligent-0.342
 tablet-0.334
 distracted-0.319
Token the
Token ID262
Feature activation-4.01e-02

Pos logit contributions
lihood+0.573
VERTISEMENT+0.488
+0.479
archives+0.448
iso+0.448
Neg logit contributions
 crooked-0.433
 mutated-0.422
 negligent-0.420
 tablet-0.407
 toe-0.390
Token fan
Token ID4336
Feature activation+1.71e-01

Pos logit contributions
 crooked+0.079
 negligent+0.078
 mutated+0.077
 tablet+0.075
 toe+0.071
Neg logit contributions
lihood-0.141
VERTISEMENT-0.124
-0.120
iso-0.113
archives-0.113
Token.
Token ID13
Feature activation+3.77e-03

Pos logit contributions
lihood+0.498
VERTISEMENT+0.426
+0.416
archives+0.387
iso+0.384
Neg logit contributions
 crooked-0.434
 negligent-0.424
 mutated-0.423
 tablet-0.410
 distracted-0.396
Token Luckily
Token ID24262
Feature activation+2.92e-01

Pos logit contributions
lihood+0.011
VERTISEMENT+0.009
+0.009
archives+0.008
iso+0.008
Neg logit contributions
 crooked-0.010
 negligent-0.010
 mutated-0.009
 tablet-0.009
 Went-0.009
Token the
Token ID262
Feature activation+3.85e-01

Pos logit contributions
lihood+0.819
VERTISEMENT+0.693
+0.681
archives+0.626
iso+0.623
Neg logit contributions
 crooked-0.779
 negligent-0.769
 mutated-0.762
 tablet-0.740
 distracted-0.722
Token Work
Token ID5521
Feature activation+8.67e-02

Pos logit contributions
lihood+1.209
VERTISEMENT+1.038
+1.018
archives+0.947
iso+0.944
Neg logit contributions
 crooked-0.899
 negligent-0.882
 mutated-0.877
 tablet-0.847
 distracted-0.817
Tokenplace
Token ID5372
Feature activation+2.44e-01

Pos logit contributions
lihood+0.285
+0.249
VERTISEMENT+0.248
archives+0.229
iso+0.225
Neg logit contributions
 crooked-0.182
 negligent-0.180
 mutated-0.177
 tablet-0.173
 distracted-0.167
Token Safety
Token ID11233
Feature activation+2.55e-01

Pos logit contributions
lihood+0.694
VERTISEMENT+0.583
+0.574
archives+0.529
iso+0.524
Neg logit contributions
 crooked-0.630
 negligent-0.627
 mutated-0.622
 tablet-0.603
 distracted-0.587
Token and
Token ID290
Feature activation+6.67e-02

Pos logit contributions
lihood+0.716
VERTISEMENT+0.601
+0.589
archives+0.541
iso+0.536
Neg logit contributions
 crooked-0.677
 negligent-0.670
 mutated-0.667
 tablet-0.652
 distracted-0.628
Token they
Token ID484
Feature activation-8.81e-03

Pos logit contributions
 crooked+0.175
 negligent+0.170
 mutated+0.169
 tablet+0.165
 distracted+0.159
Neg logit contributions
lihood-0.231
VERTISEMENT-0.199
-0.194
archives-0.181
iso-0.179
Token are
Token ID389
Feature activation+7.56e-03

Pos logit contributions
 crooked+0.020
 mutated+0.019
 negligent+0.019
 tablet+0.018
 toe+0.018
Neg logit contributions
lihood-0.028
VERTISEMENT-0.025
-0.024
archives-0.023
iso-0.022
Token connected
Token ID5884
Feature activation-9.73e-02

Pos logit contributions
lihood+0.025
VERTISEMENT+0.021
+0.021
iso+0.020
archives+0.020
Neg logit contributions
 crooked-0.017
 negligent-0.016
 mutated-0.016
 tablet-0.016
 distracted-0.015
Token to
Token ID284
Feature activation+8.61e-02

Pos logit contributions
 crooked+0.246
 negligent+0.240
 mutated+0.240
 tablet+0.234
 toe+0.224
Neg logit contributions
lihood-0.285
VERTISEMENT-0.242
-0.235
iso-0.219
archives-0.219
Token the
Token ID262
Feature activation+2.81e-02

Pos logit contributions
lihood+0.272
VERTISEMENT+0.235
+0.228
archives+0.214
iso+0.214
Neg logit contributions
 crooked-0.200
 negligent-0.194
 mutated-0.193
 tablet-0.187
 distracted-0.180
Token Olympics
Token ID14935
Feature activation+2.15e-01

Pos logit contributions
lihood+0.082
VERTISEMENT+0.070
+0.068
archives+0.063
iso+0.062
Neg logit contributions
 crooked-0.072
 negligent-0.071
 mutated-0.070
 tablet-0.069
 distracted-0.066
Token.
Token ID13
Feature activation+1.08e-01

Pos logit contributions
lihood+0.618
VERTISEMENT+0.524
+0.508
archives+0.469
iso+0.467
Neg logit contributions
 crooked-0.556
 negligent-0.545
 mutated-0.540
 tablet-0.533
 distracted-0.509
Token Even
Token ID3412
Feature activation-1.74e-02

Pos logit contributions
lihood+0.311
VERTISEMENT+0.264
+0.258
archives+0.239
iso+0.234
Neg logit contributions
 crooked-0.278
 negligent-0.270
 mutated-0.269
 tablet-0.263
 Went-0.254
Token pubs
Token ID46972
Feature activation+1.36e-01

Pos logit contributions
 crooked+0.040
 negligent+0.039
 mutated+0.039
 tablet+0.038
 distracted+0.036
Neg logit contributions
lihood-0.055
VERTISEMENT-0.048
-0.047
archives-0.044
iso-0.043
Token can
Token ID460
Feature activation+6.74e-02

Pos logit contributions
lihood+0.415
VERTISEMENT+0.354
+0.345
archives+0.321
iso+0.319
Neg logit contributions
 crooked-0.331
 negligent-0.324
 mutated-0.323
 tablet-0.313
 distracted-0.301
Token't
Token ID470
Feature activation-7.91e-02

Pos logit contributions
lihood+0.213
VERTISEMENT+0.184
+0.180
archives+0.167
iso+0.165
Neg logit contributions
 crooked-0.156
 mutated-0.151
 negligent-0.151
 tablet-0.148
 toe-0.143
Tokenitionally
Token ID8736
Feature activation-2.54e-01

Pos logit contributions
 crooked+1.617
 mutated+1.607
 tablet+1.596
 negligent+1.575
vered+1.539
Neg logit contributions
lihood-1.939
VERTISEMENT-1.679
-1.602
archives-1.473
OTOS-1.463
Token while
Token ID981
Feature activation-6.80e-02

Pos logit contributions
 crooked+0.639
 mutated+0.631
 negligent+0.629
 tablet+0.611
 distracted+0.587
Neg logit contributions
lihood-0.747
VERTISEMENT-0.638
-0.621
archives-0.583
iso-0.575
Token others
Token ID1854
Feature activation-2.42e-01

Pos logit contributions
 crooked+0.147
 negligent+0.144
 mutated+0.143
 tablet+0.139
 distracted+0.133
Neg logit contributions
lihood-0.225
VERTISEMENT-0.196
-0.192
archives-0.181
iso-0.180
Token will
Token ID481
Feature activation-2.56e-02

Pos logit contributions
 crooked+0.437
 mutated+0.425
 negligent+0.423
 tablet+0.410
 toe+0.387
Neg logit contributions
lihood-0.882
VERTISEMENT-0.783
-0.769
archives-0.727
iso-0.723
Token have
Token ID423
Feature activation+6.82e-02

Pos logit contributions
 crooked+0.056
 mutated+0.055
 negligent+0.055
 tablet+0.054
 toe+0.052
Neg logit contributions
lihood-0.083
VERTISEMENT-0.072
-0.071
archives-0.067
iso-0.066
Token conditions
Token ID3403
Feature activation+7.34e-02

Pos logit contributions
lihood+0.206
+0.173
VERTISEMENT+0.173
archives+0.161
iso+0.159
Neg logit contributions
 crooked-0.167
 mutated-0.165
 negligent-0.164
 tablet-0.161
 toe-0.154
Token,
Token ID11
Feature activation-2.87e-02

Pos logit contributions
lihood+0.233
VERTISEMENT+0.199
+0.195
archives+0.184
iso+0.180
Neg logit contributions
 crooked-0.169
 mutated-0.167
 negligent-0.167
 tablet-0.161
 toe-0.154
Token such
Token ID884
Feature activation+3.29e-01

Pos logit contributions
 crooked+0.064
 mutated+0.063
 negligent+0.063
 tablet+0.061
 toe+0.058
Neg logit contributions
lihood-0.093
VERTISEMENT-0.080
-0.079
archives-0.074
iso-0.073
Token as
Token ID355
Feature activation-3.99e-02

Pos logit contributions
lihood+0.908
VERTISEMENT+0.759
+0.742
archives+0.684
iso+0.680
Neg logit contributions
 crooked-0.883
 mutated-0.866
 negligent-0.865
 tablet-0.842
 distracted-0.808
Token going
Token ID1016
Feature activation+3.29e-01

Pos logit contributions
 crooked+0.091
 negligent+0.090
 mutated+0.089
 tablet+0.087
 toe+0.083
Neg logit contributions
lihood-0.125
VERTISEMENT-0.109
-0.109
archives-0.101
iso-0.100
Token through
Token ID832
Feature activation-2.70e-01

Pos logit contributions
lihood+0.994
VERTISEMENT+0.852
+0.834
archives+0.785
iso+0.770
Neg logit contributions
 crooked-0.811
 negligent-0.789
 mutated-0.786
 tablet-0.770
 toe-0.745
Token go
Token ID467
Feature activation-3.85e-01

Pos logit contributions
 crooked+0.451
 mutated+0.445
 negligent+0.439
 tablet+0.432
 toe+0.416
Neg logit contributions
lihood-0.652
VERTISEMENT-0.561
-0.550
archives-0.513
iso-0.507
Token 3
Token ID513
Feature activation-3.57e-01

Pos logit contributions
 crooked+0.877
 mutated+0.860
 negligent+0.842
 tablet+0.837
 toe+0.816
Neg logit contributions
lihood-1.244
VERTISEMENT-1.053
-1.028
archives-0.980
iso-0.965
TokenD
Token ID35
Feature activation-3.24e-01

Pos logit contributions
 crooked+1.001
 negligent+0.973
 mutated+0.972
 tablet+0.967
 toe+0.936
Neg logit contributions
lihood-0.963
VERTISEMENT-0.770
-0.755
iso-0.699
archives-0.695
Token,
Token ID11
Feature activation-7.73e-02

Pos logit contributions
 crooked+0.789
 mutated+0.779
 negligent+0.761
 tablet+0.751
 toe+0.716
Neg logit contributions
lihood-0.988
VERTISEMENT-0.843
-0.825
archives-0.768
iso-0.767
Token housed
Token ID23707
Feature activation-4.90e-02

Pos logit contributions
 crooked+0.202
 mutated+0.199
 negligent+0.196
 tablet+0.193
 toe+0.185
Neg logit contributions
lihood-0.221
VERTISEMENT-0.187
-0.182
archives-0.169
iso-0.167
Token in
Token ID287
Feature activation+1.25e-01

Pos logit contributions
 crooked+0.129
 mutated+0.128
 negligent+0.126
 tablet+0.124
 toe+0.120
Neg logit contributions
lihood-0.138
VERTISEMENT-0.116
-0.113
archives-0.105
iso-0.105
Token what
Token ID644
Feature activation-8.44e-02

Pos logit contributions
lihood+0.411
VERTISEMENT+0.353
+0.346
archives+0.322
iso+0.321
Neg logit contributions
 crooked-0.273
 mutated-0.267
 negligent-0.264
 tablet-0.259
 toe-0.244
Token was
Token ID373
Feature activation-1.59e-01

Pos logit contributions
 crooked+0.190
 mutated+0.187
 negligent+0.183
 tablet+0.181
 distracted+0.170
Neg logit contributions
lihood-0.272
VERTISEMENT-0.236
-0.230
archives-0.215
iso-0.213
Token then
Token ID788
Feature activation-1.92e-01

Pos logit contributions
 crooked+0.314
 mutated+0.309
 negligent+0.305
 tablet+0.296
 distracted+0.278
Neg logit contributions
lihood-0.556
VERTISEMENT-0.486
-0.475
archives-0.448
iso-0.445
Token the
Token ID262
Feature activation+1.01e-01

Pos logit contributions
 crooked+0.422
 mutated+0.414
 negligent+0.408
 tablet+0.401
 distracted+0.375
Neg logit contributions
lihood-0.630
VERTISEMENT-0.543
-0.528
archives-0.496
iso-0.494
Token N
Token ID399
Feature activation-7.32e-02

Pos logit contributions
lihood+0.325
VERTISEMENT+0.280
+0.274
archives+0.256
iso+0.252
Neg logit contributions
 crooked-0.231
 mutated-0.227
 negligent-0.225
 tablet-0.221
 toe-0.208
Token his
Token ID465
Feature activation-1.96e-01

Pos logit contributions
 crooked+0.013
 mutated+0.013
 negligent+0.013
 tablet+0.013
 distracted+0.012
Neg logit contributions
lihood-0.016
VERTISEMENT-0.013
-0.013
iso-0.012
archives-0.012
Tokenses
Token ID8448
Feature activation-2.61e-01

Pos logit contributions
 crooked+0.434
 mutated+0.421
 negligent+0.420
 tablet+0.414
 distracted+0.390
Neg logit contributions
lihood-0.638
VERTISEMENT-0.548
-0.540
archives-0.502
iso-0.500
Token,
Token ID11
Feature activation-2.53e-01

Pos logit contributions
 crooked+0.687
 mutated+0.673
 negligent+0.662
 tablet+0.651
 distracted+0.629
Neg logit contributions
lihood-0.737
VERTISEMENT-0.620
-0.607
archives-0.569
iso-0.562
Token and
Token ID290
Feature activation-6.99e-02

Pos logit contributions
 crooked+0.608
 mutated+0.596
 negligent+0.586
 tablet+0.575
 distracted+0.552
Neg logit contributions
lihood-0.780
VERTISEMENT-0.662
-0.644
archives-0.606
iso-0.599
Token compression
Token ID19794
Feature activation-1.45e-01

Pos logit contributions
 crooked+0.154
 mutated+0.151
 negligent+0.149
 tablet+0.145
 distracted+0.138
Neg logit contributions
lihood-0.229
VERTISEMENT-0.196
-0.192
archives-0.181
iso-0.180
Token that
Token ID326
Feature activation+7.72e-02

Pos logit contributions
 crooked+0.346
 mutated+0.341
 negligent+0.334
 tablet+0.327
 distracted+0.316
Neg logit contributions
lihood-0.450
VERTISEMENT-0.381
-0.377
archives-0.349
iso-0.346
Token characterize
Token ID34404
Feature activation+1.69e-01

Pos logit contributions
lihood+0.242
VERTISEMENT+0.207
+0.203
archives+0.190
iso+0.188
Neg logit contributions
 crooked-0.180
 mutated-0.177
 negligent-0.175
 tablet-0.172
 distracted-0.164
Token such
Token ID884
Feature activation+8.53e-02

Pos logit contributions
lihood+0.534
VERTISEMENT+0.459
+0.449
archives+0.419
iso+0.416
Neg logit contributions
 crooked-0.389
 mutated-0.379
 negligent-0.378
 tablet-0.368
 distracted-0.349
Token sound
Token ID2128
Feature activation-2.68e-01

Pos logit contributions
lihood+0.255
VERTISEMENT+0.217
+0.212
archives+0.195
iso+0.195
Neg logit contributions
 crooked-0.212
 mutated-0.208
 negligent-0.207
 tablet-0.201
 distracted-0.192
Token files
Token ID3696
Feature activation-7.55e-02

Pos logit contributions
 crooked+0.648
 mutated+0.643
 negligent+0.632
 tablet+0.616
 distracted+0.590
Neg logit contributions
lihood-0.797
VERTISEMENT-0.689
-0.670
iso-0.624
archives-0.611
Token are
Token ID389
Feature activation+8.47e-02

Pos logit contributions
 crooked+0.171
 mutated+0.170
 negligent+0.167
 tablet+0.162
 distracted+0.156
Neg logit contributions
lihood-0.239
VERTISEMENT-0.209
-0.205
archives-0.190
iso-0.189
Token and
Token ID290
Feature activation-3.39e-02

Pos logit contributions
 crooked+0.510
 negligent+0.491
 mutated+0.490
 tablet+0.490
 toe+0.467
Neg logit contributions
lihood-0.592
VERTISEMENT-0.509
-0.501
archives-0.465
iso-0.461
Token down
Token ID866
Feature activation-2.66e-01

Pos logit contributions
 crooked+0.073
 negligent+0.070
 mutated+0.070
 tablet+0.069
 toe+0.066
Neg logit contributions
lihood-0.114
VERTISEMENT-0.099
-0.097
archives-0.091
iso-0.090
Token,
Token ID11
Feature activation-2.49e-02

Pos logit contributions
 crooked+0.640
 negligent+0.617
 mutated+0.611
 tablet+0.611
 toe+0.578
Neg logit contributions
lihood-0.804
-0.688
VERTISEMENT-0.688
archives-0.643
iso-0.635
Token and
Token ID290
Feature activation+8.46e-03

Pos logit contributions
 crooked+0.059
 negligent+0.057
 mutated+0.057
 tablet+0.056
 distracted+0.054
Neg logit contributions
lihood-0.078
VERTISEMENT-0.067
-0.065
archives-0.061
iso-0.060
Token you
Token ID345
Feature activation-1.30e-02

Pos logit contributions
lihood+0.028
VERTISEMENT+0.024
+0.024
archives+0.022
iso+0.022
Neg logit contributions
 crooked-0.019
 negligent-0.018
 mutated-0.018
 tablet-0.017
 toe-0.017
Token realize
Token ID6537
Feature activation-3.29e-01

Pos logit contributions
 crooked+0.026
 mutated+0.025
 negligent+0.025
 tablet+0.025
 distracted+0.023
Neg logit contributions
lihood-0.045
VERTISEMENT-0.039
-0.039
archives-0.036
iso-0.036
Token,
Token ID11
Feature activation+6.88e-02

Pos logit contributions
 crooked+0.753
 mutated+0.728
 negligent+0.726
 tablet+0.706
 distracted+0.680
Neg logit contributions
lihood-1.046
VERTISEMENT-0.926
-0.886
archives-0.832
iso-0.821
Token in
Token ID287
Feature activation+1.65e-01

Pos logit contributions
lihood+0.208
VERTISEMENT+0.178
+0.172
archives+0.161
iso+0.158
Neg logit contributions
 crooked-0.172
 mutated-0.167
 negligent-0.166
 tablet-0.162
 distracted-0.156
Token a
Token ID257
Feature activation+6.51e-02

Pos logit contributions
lihood+0.526
VERTISEMENT+0.455
+0.442
archives+0.415
iso+0.410
Neg logit contributions
 crooked-0.380
 mutated-0.369
 negligent-0.369
 tablet-0.360
 distracted-0.343
Token moment
Token ID2589
Feature activation+6.61e-02

Pos logit contributions
lihood+0.214
VERTISEMENT+0.185
+0.180
archives+0.171
iso+0.169
Neg logit contributions
 crooked-0.144
 negligent-0.141
 mutated-0.140
 tablet-0.136
 distracted-0.130
Token of
Token ID286
Feature activation+1.28e-01

Pos logit contributions
lihood+0.185
VERTISEMENT+0.156
+0.151
archives+0.141
iso+0.138
Neg logit contributions
 crooked-0.178
 mutated-0.173
 negligent-0.172
 tablet-0.168
 distracted-0.164
Token to
Token ID284
Feature activation+4.74e-02

Pos logit contributions
lihood+0.105
VERTISEMENT+0.089
+0.086
archives+0.081
iso+0.079
Neg logit contributions
 crooked-0.089
 negligent-0.088
 mutated-0.087
 tablet-0.085
 distracted-0.082
Token the
Token ID262
Feature activation-1.23e-01

Pos logit contributions
lihood+0.153
VERTISEMENT+0.133
+0.129
archives+0.121
iso+0.120
Neg logit contributions
 crooked-0.108
 negligent-0.105
 mutated-0.103
 tablet-0.102
 distracted-0.097
Token Motion
Token ID20843
Feature activation-6.60e-02

Pos logit contributions
 crooked+0.271
 negligent+0.264
 mutated+0.261
 tablet+0.255
 distracted+0.243
Neg logit contributions
lihood-0.409
VERTISEMENT-0.355
-0.344
archives-0.324
iso-0.320
Token Picture
Token ID17741
Feature activation+7.59e-01

Pos logit contributions
 crooked+0.166
 negligent+0.165
 mutated+0.164
 tablet+0.159
vered+0.155
Neg logit contributions
lihood-0.194
VERTISEMENT-0.169
-0.159
archives-0.148
iso-0.146
Token Academy
Token ID8581
Feature activation-3.55e-02

Pos logit contributions
lihood+1.727
VERTISEMENT+1.441
+1.382
iso+1.229
archives+1.217
Neg logit contributions
 crooked-2.395
 negligent-2.357
 mutated-2.344
 tablet-2.319
 distracted-2.225
Token and
Token ID290
Feature activation-1.95e-02

Pos logit contributions
 crooked+0.091
 negligent+0.090
 mutated+0.088
 tablet+0.087
 distracted+0.083
Neg logit contributions
lihood-0.104
VERTISEMENT-0.087
-0.085
archives-0.078
iso-0.078
Token to
Token ID284
Feature activation-2.13e-01

Pos logit contributions
 crooked+0.046
 negligent+0.045
 mutated+0.045
 tablet+0.043
 distracted+0.042
Neg logit contributions
lihood-0.060
VERTISEMENT-0.052
-0.051
archives-0.048
iso-0.047
Token the
Token ID262
Feature activation-2.13e-01

Pos logit contributions
 crooked+0.500
 negligent+0.484
 mutated+0.479
 tablet+0.468
 distracted+0.450
Neg logit contributions
lihood-0.665
VERTISEMENT-0.580
-0.560
archives-0.528
iso-0.520
Token filmmakers
Token ID28303
Feature activation+4.19e-02

Pos logit contributions
 crooked+0.495
 negligent+0.481
 mutated+0.476
 tablet+0.466
 distracted+0.444
Neg logit contributions
lihood-0.680
VERTISEMENT-0.589
-0.572
archives-0.537
iso-0.528
Token behind
Token ID2157
Feature activation-4.08e-01

Pos logit contributions
lihood+0.125
VERTISEMENT+0.108
+0.104
archives+0.097
iso+0.096
Neg logit contributions
 crooked-0.103
 negligent-0.102
 mutated-0.101
 tablet-0.098
 distracted-0.095
Token the
Token ID262
Feature activation+5.58e-02

Pos logit contributions
 crooked+0.944
 negligent+0.915
 mutated+0.909
 tablet+0.875
 distracted+0.842
Neg logit contributions
lihood-1.273
VERTISEMENT-1.110
-1.073
archives-1.017
iso-1.011
Token 7
Token ID767
Feature activation-1.82e-01

Pos logit contributions
lihood+0.262
VERTISEMENT+0.220
+0.216
archives+0.202
iso+0.201
Neg logit contributions
 crooked-0.222
 mutated-0.218
 negligent-0.217
 tablet-0.213
 toe-0.203
Token days
Token ID1528
Feature activation+6.55e-02

Pos logit contributions
 crooked+0.450
 mutated+0.438
 negligent+0.436
 tablet+0.435
 distracted+0.408
Neg logit contributions
lihood-0.551
VERTISEMENT-0.454
-0.447
iso-0.420
archives-0.417
Token,
Token ID11
Feature activation-1.11e-01

Pos logit contributions
lihood+0.189
VERTISEMENT+0.160
+0.155
archives+0.144
iso+0.143
Neg logit contributions
 crooked-0.170
 mutated-0.167
 negligent-0.165
 tablet-0.162
 distracted-0.157
Token implementing
Token ID15427
Feature activation-1.31e-01

Pos logit contributions
 crooked+0.267
 mutated+0.263
 negligent+0.259
 tablet+0.256
 distracted+0.243
Neg logit contributions
lihood-0.345
VERTISEMENT-0.294
-0.289
archives-0.267
iso-0.265
Token methods
Token ID5050
Feature activation+1.15e-01

Pos logit contributions
 crooked+0.297
 mutated+0.291
 negligent+0.288
 tablet+0.282
 distracted+0.269
Neg logit contributions
lihood-0.419
VERTISEMENT-0.362
-0.359
archives-0.334
iso-0.333
Token add
Token ID751
Feature activation+3.06e-02

Pos logit contributions
lihood+0.353
VERTISEMENT+0.302
+0.295
archives+0.276
iso+0.274
Neg logit contributions
 crooked-0.280
 mutated-0.275
 negligent-0.274
 tablet-0.265
 distracted-0.255
TokenPoints
Token ID40710
Feature activation-2.05e-01

Pos logit contributions
lihood+0.087
VERTISEMENT+0.074
+0.073
archives+0.068
iso+0.067
Neg logit contributions
 crooked-0.079
 mutated-0.077
 negligent-0.076
 tablet-0.075
 distracted-0.072
Token(
Token ID7
Feature activation-3.35e-01

Pos logit contributions
 crooked+0.559
 mutated+0.547
 negligent+0.543
 tablet+0.532
 distracted+0.510
Neg logit contributions
lihood-0.548
VERTISEMENT-0.460
-0.454
archives-0.420
iso-0.418
Tokenuser
Token ID7220
Feature activation-1.92e-01

Pos logit contributions
 crooked+0.829
 mutated+0.821
 negligent+0.807
 tablet+0.801
 distracted+0.759
Neg logit contributions
lihood-0.961
VERTISEMENT-0.819
-0.799
archives-0.734
iso-0.726
Token,
Token ID11
Feature activation-4.78e-02

Pos logit contributions
 crooked+0.499
 mutated+0.493
 negligent+0.487
 tablet+0.481
 distracted+0.462
Neg logit contributions
lihood-0.532
VERTISEMENT-0.446
-0.443
archives-0.409
iso-0.407
Token points
Token ID2173
Feature activation-5.82e-02

Pos logit contributions
 crooked+0.112
 mutated+0.111
 negligent+0.109
 tablet+0.108
 distracted+0.102
Neg logit contributions
lihood-0.148
VERTISEMENT-0.126
-0.125
archives-0.115
iso-0.113
Token to
Token ID284
Feature activation-3.73e-02

Pos logit contributions
lihood+0.760
VERTISEMENT+0.631
+0.618
archives+0.568
iso+0.566
Neg logit contributions
 crooked-0.759
 mutated-0.746
 negligent-0.745
 tablet-0.725
vered-0.696
Token be
Token ID307
Feature activation-9.31e-02

Pos logit contributions
 crooked+0.086
 mutated+0.083
 negligent+0.083
 tablet+0.081
 toe+0.079
Neg logit contributions
lihood-0.119
VERTISEMENT-0.104
-0.101
archives-0.094
iso-0.093
Token more
Token ID517
Feature activation-1.10e-01

Pos logit contributions
 crooked+0.190
 negligent+0.185
 mutated+0.182
 tablet+0.175
 distracted+0.170
Neg logit contributions
lihood-0.323
VERTISEMENT-0.283
-0.277
iso-0.260
archives-0.259
Token gracious
Token ID43210
Feature activation-3.68e-01

Pos logit contributions
 crooked+0.219
 negligent+0.212
 mutated+0.208
 tablet+0.202
 toe+0.195
Neg logit contributions
lihood-0.390
VERTISEMENT-0.345
-0.338
iso-0.318
archives-0.316
Token than
Token ID621
Feature activation+3.26e-01

Pos logit contributions
 crooked+0.907
 negligent+0.883
 mutated+0.874
 tablet+0.849
 distracted+0.827
Neg logit contributions
lihood-1.096
VERTISEMENT-0.953
-0.930
archives-0.868
iso-0.853
Token his
Token ID465
Feature activation-8.57e-02

Pos logit contributions
lihood+0.956
VERTISEMENT+0.827
+0.800
iso+0.746
archives+0.744
Neg logit contributions
 crooked-0.832
 negligent-0.811
 mutated-0.803
 tablet-0.777
 distracted-0.754
Token legion
Token ID41453
Feature activation-7.05e-02

Pos logit contributions
 crooked+0.201
 negligent+0.192
 mutated+0.191
 tablet+0.186
 toe+0.178
Neg logit contributions
lihood-0.272
VERTISEMENT-0.238
-0.229
iso-0.218
archives-0.216
Token of
Token ID286
Feature activation-3.81e-01

Pos logit contributions
 crooked+0.189
 mutated+0.185
 negligent+0.185
 tablet+0.180
 distracted+0.173
Neg logit contributions
lihood-0.195
VERTISEMENT-0.163
-0.159
archives-0.147
iso-0.146
Token Trump
Token ID1301
Feature activation+2.76e-01

Pos logit contributions
 crooked+0.801
 negligent+0.762
 mutated+0.762
 tablet+0.732
 toe+0.706
Neg logit contributions
lihood-1.308
VERTISEMENT-1.150
-1.112
iso-1.062
archives-1.053
Tokenkins
Token ID5331
Feature activation-3.80e-01

Pos logit contributions
lihood+0.978
VERTISEMENT+0.855
+0.828
iso+0.781
archives+0.779
Neg logit contributions
 crooked-0.542
 mutated-0.522
 negligent-0.519
 tablet-0.507
 toe-0.484
Token."
Token ID526
Feature activation-7.19e-02

Pos logit contributions
 crooked+0.897
 mutated+0.870
 negligent+0.870
 tablet+0.840
 distracted+0.823
Neg logit contributions
lihood-1.169
VERTISEMENT-1.026
-0.997
archives-0.926
iso-0.920
Token the
Token ID262
Feature activation+1.34e-01

Pos logit contributions
lihood+0.318
VERTISEMENT+0.273
+0.268
iso+0.250
archives+0.246
Neg logit contributions
 crooked-0.232
 mutated-0.228
 negligent-0.224
 tablet-0.219
 toe-0.209
Token fictional
Token ID19812
Feature activation+1.54e-01

Pos logit contributions
lihood+0.431
VERTISEMENT+0.370
+0.365
iso+0.337
archives+0.336
Neg logit contributions
 crooked-0.307
 mutated-0.303
 negligent-0.297
 tablet-0.290
 toe-0.276
Token book
Token ID1492
Feature activation-2.92e-03

Pos logit contributions
lihood+0.478
VERTISEMENT+0.412
+0.403
iso+0.373
archives+0.372
Neg logit contributions
 crooked-0.362
 mutated-0.359
 negligent-0.349
 tablet-0.343
 toe-0.326
Token mentioned
Token ID4750
Feature activation+1.15e-01

Pos logit contributions
 crooked+0.007
 mutated+0.007
 negligent+0.007
 tablet+0.007
vered+0.007
Neg logit contributions
lihood-0.008
VERTISEMENT-0.007
-0.007
iso-0.007
archives-0.006
Token in
Token ID287
Feature activation+5.47e-02

Pos logit contributions
lihood+0.324
VERTISEMENT+0.275
+0.266
archives+0.247
iso+0.247
Neg logit contributions
 crooked-0.305
 mutated-0.300
 negligent-0.296
 tablet-0.290
 distracted-0.278
Token the
Token ID262
Feature activation-1.55e-02

Pos logit contributions
lihood+0.170
VERTISEMENT+0.146
+0.142
iso+0.132
archives+0.131
Neg logit contributions
 crooked-0.128
 mutated-0.126
 negligent-0.124
 tablet-0.122
vered-0.117
Token 1984
Token ID12844
Feature activation+2.27e-01

Pos logit contributions
 crooked+0.036
 mutated+0.036
 negligent+0.035
 tablet+0.035
 toe+0.033
Neg logit contributions
lihood-0.049
VERTISEMENT-0.042
-0.041
archives-0.038
iso-0.037
Token movie
Token ID3807
Feature activation+8.00e-03

Pos logit contributions
lihood+0.680
VERTISEMENT+0.578
+0.565
archives+0.523
iso+0.518
Neg logit contributions
 crooked-0.560
 mutated-0.553
 negligent-0.547
 tablet-0.535
 distracted-0.509
Token.
Token ID13
Feature activation+1.27e-02

Pos logit contributions
lihood+0.023
VERTISEMENT+0.020
+0.019
archives+0.018
iso+0.018
Neg logit contributions
 crooked-0.020
 mutated-0.020
 negligent-0.020
 tablet-0.019
vered-0.019
Token From
Token ID3574
Feature activation-1.28e-01

Pos logit contributions
lihood+0.037
VERTISEMENT+0.031
+0.030
archives+0.028
iso+0.028
Neg logit contributions
 crooked-0.032
 mutated-0.032
 negligent-0.032
 tablet-0.031
 Went-0.030
Token the
Token ID262
Feature activation+1.49e-01

Pos logit contributions
 crooked+0.292
 mutated+0.287
 negligent+0.282
 tablet+0.275
 toe+0.262
Neg logit contributions
lihood-0.406
VERTISEMENT-0.351
-0.346
iso-0.321
archives-0.320
Token,
Token ID11
Feature activation-4.33e-02

Pos logit contributions
lihood+0.560
VERTISEMENT+0.469
+0.457
archives+0.422
iso+0.419
Neg logit contributions
 crooked-0.512
 mutated-0.503
 negligent-0.501
 tablet-0.491
vered-0.469
Token it
Token ID340
Feature activation-3.80e-02

Pos logit contributions
 crooked+0.105
 negligent+0.103
 mutated+0.103
 tablet+0.100
 distracted+0.096
Neg logit contributions
lihood-0.132
VERTISEMENT-0.113
-0.109
archives-0.102
iso-0.101
Token added
Token ID2087
Feature activation+3.17e-01

Pos logit contributions
 crooked+0.070
 mutated+0.069
 negligent+0.068
 tablet+0.066
 distracted+0.062
Neg logit contributions
lihood-0.138
VERTISEMENT-0.121
-0.118
archives-0.111
iso-0.110
Token.-
Token ID7874
Feature activation-3.70e-01

Pos logit contributions
lihood+0.871
VERTISEMENT+0.726
+0.710
archives+0.654
iso+0.650
Neg logit contributions
 crooked-0.856
 mutated-0.840
 negligent-0.839
 tablet-0.817
 distracted-0.785
TokenTrade
Token ID35965
Feature activation+7.29e-02

Pos logit contributions
 crooked+0.945
 negligent+0.928
 mutated+0.922
 tablet+0.906
vered+0.863
Neg logit contributions
lihood-1.067
VERTISEMENT-0.893
-0.874
archives-0.801
iso-0.800
TokenArab
Token ID31602
Feature activation-5.88e-01

Pos logit contributions
lihood+0.201
VERTISEMENT+0.168
+0.163
iso+0.149
archives+0.149
Neg logit contributions
 crooked-0.198
 negligent-0.193
 mutated-0.193
 tablet-0.189
 toe-0.181
Tokenia
Token ID544
Feature activation-1.34e-01

Pos logit contributions
 crooked+1.436
 tablet+1.399
 negligent+1.398
 mutated+1.398
 distracted+1.332
Neg logit contributions
lihood-1.738
VERTISEMENT-1.461
-1.433
AGE-1.294
archives-1.293
Token News
Token ID3000
Feature activation-9.40e-02

Pos logit contributions
 crooked+0.340
 negligent+0.332
 mutated+0.331
 tablet+0.328
 distracted+0.311
Neg logit contributions
lihood-0.389
VERTISEMENT-0.329
-0.320
iso-0.296
archives-0.295
Token Service
Token ID4809
Feature activation-2.63e-01

Pos logit contributions
 crooked+0.242
 mutated+0.238
 negligent+0.237
 tablet+0.234
 distracted+0.223
Neg logit contributions
lihood-0.267
VERTISEMENT-0.226
-0.219
archives-0.203
iso-0.202
Token<|endoftext|>
Token ID50256
Feature activation+7.14e-02

Pos logit contributions
 crooked+0.671
 mutated+0.657
 negligent+0.657
 tablet+0.647
 distracted+0.615
Neg logit contributions
lihood-0.756
VERTISEMENT-0.632
-0.622
archives-0.572
iso-0.570
TokenBy
Token ID3886
Feature activation-1.72e-01

Pos logit contributions
lihood+0.197
VERTISEMENT+0.163
+0.156
archives+0.143
iso+0.143
Neg logit contributions
 crooked-0.189
 mutated-0.186
 negligent-0.186
 tablet-0.182
vered-0.178
Token of
Token ID286
Feature activation+1.63e-01

Pos logit contributions
 crooked+0.169
 mutated+0.165
 negligent+0.164
 tablet+0.159
 distracted+0.154
Neg logit contributions
lihood-0.188
VERTISEMENT-0.160
-0.153
archives-0.144
iso-0.143
Token the
Token ID262
Feature activation+2.74e-02

Pos logit contributions
lihood+0.543
VERTISEMENT+0.475
+0.460
iso+0.433
archives+0.432
Neg logit contributions
 crooked-0.352
 negligent-0.341
 mutated-0.341
 tablet-0.327
 toe-0.316
Token post
Token ID1281
Feature activation-3.60e-01

Pos logit contributions
lihood+0.092
VERTISEMENT+0.080
+0.077
archives+0.073
iso+0.073
Neg logit contributions
 crooked-0.059
 mutated-0.057
 negligent-0.057
 tablet-0.055
 distracted-0.053
Token-
Token ID12
Feature activation+1.46e-01

Pos logit contributions
 crooked+1.006
 negligent+0.981
 mutated+0.977
 tablet+0.957
 distracted+0.932
Neg logit contributions
lihood-0.958
VERTISEMENT-0.808
-0.762
archives-0.693
iso-0.690
Token1945
Token ID41931
Feature activation+2.34e-01

Pos logit contributions
lihood+0.535
VERTISEMENT+0.468
+0.450
archives+0.423
iso+0.421
Neg logit contributions
 crooked-0.267
 negligent-0.258
 mutated-0.257
 tablet-0.248
 distracted-0.236
Token world
Token ID995
Feature activation-5.54e-01

Pos logit contributions
lihood+0.775
VERTISEMENT+0.672
+0.651
archives+0.616
iso+0.612
Neg logit contributions
 crooked-0.509
 mutated-0.494
 negligent-0.491
 tablet-0.477
 distracted-0.455
Token order
Token ID1502
Feature activation+2.98e-01

Pos logit contributions
 crooked+1.493
 mutated+1.457
 negligent+1.446
 tablet+1.434
 toe+1.382
Neg logit contributions
lihood-1.528
VERTISEMENT-1.302
-1.228
archives-1.158
iso-1.143
Token.
Token ID13
Feature activation+6.86e-02

Pos logit contributions
lihood+0.876
VERTISEMENT+0.745
+0.728
archives+0.683
iso+0.673
Neg logit contributions
 crooked-0.754
 mutated-0.743
 negligent-0.739
 tablet-0.717
 distracted-0.691
Token
Token ID447
Feature activation-4.90e-02

Pos logit contributions
lihood+0.191
VERTISEMENT+0.161
+0.156
iso+0.145
archives+0.145
Neg logit contributions
 crooked-0.179
 negligent-0.176
 mutated-0.175
 tablet-0.169
 Went-0.166
Token
Token ID251
Feature activation+6.76e-02

Pos logit contributions
 crooked+0.100
 negligent+0.096
 mutated+0.096
 tablet+0.093
vered+0.089
Neg logit contributions
lihood-0.167
VERTISEMENT-0.143
-0.141
archives-0.133
iso-0.133
Token With
Token ID2080
Feature activation+5.99e-02

Pos logit contributions
lihood+0.187
VERTISEMENT+0.157
+0.153
iso+0.143
archives+0.142
Neg logit contributions
 crooked-0.178
 negligent-0.174
 mutated-0.174
 tablet-0.168
 Went-0.165
Token document
Token ID3188
Feature activation-2.09e-01

Pos logit contributions
 crooked+0.353
 mutated+0.346
 tablet+0.344
 negligent+0.338
 toe+0.323
Neg logit contributions
lihood-0.423
VERTISEMENT-0.360
-0.359
archives-0.322
iso-0.319
Token,
Token ID11
Feature activation-1.41e-01

Pos logit contributions
 crooked+0.513
 mutated+0.503
 negligent+0.495
 tablet+0.493
 toe+0.469
Neg logit contributions
lihood-0.632
VERTISEMENT-0.536
-0.527
archives-0.484
iso-0.478
Token but
Token ID475
Feature activation-6.26e-02

Pos logit contributions
 crooked+0.368
 mutated+0.362
 negligent+0.358
 tablet+0.355
 distracted+0.338
Neg logit contributions
lihood-0.408
VERTISEMENT-0.343
-0.336
archives-0.307
iso-0.304
Token the
Token ID262
Feature activation+3.89e-03

Pos logit contributions
 crooked+0.138
 mutated+0.135
 negligent+0.134
 tablet+0.131
 distracted+0.125
Neg logit contributions
lihood-0.204
VERTISEMENT-0.177
-0.174
archives-0.161
iso-0.161
Token current
Token ID1459
Feature activation+8.90e-02

Pos logit contributions
lihood+0.013
VERTISEMENT+0.012
+0.012
archives+0.011
iso+0.011
Neg logit contributions
 crooked-0.008
 mutated-0.008
 negligent-0.008
 tablet-0.008
 distracted-0.007
Token version
Token ID2196
Feature activation-6.06e-01

Pos logit contributions
lihood+0.295
VERTISEMENT+0.255
+0.252
archives+0.234
iso+0.233
Neg logit contributions
 crooked-0.192
 mutated-0.187
 negligent-0.185
 tablet-0.184
 distracted-0.172
Token of
Token ID286
Feature activation-1.53e-01

Pos logit contributions
 crooked+1.440
 mutated+1.409
 negligent+1.384
 tablet+1.368
 distracted+1.306
Neg logit contributions
lihood-1.875
VERTISEMENT-1.620
-1.586
archives-1.449
iso-1.441
Token Pages
Token ID28221
Feature activation-2.64e-01

Pos logit contributions
 crooked+0.346
 mutated+0.338
 negligent+0.335
 tablet+0.329
 distracted+0.313
Neg logit contributions
lihood-0.486
VERTISEMENT-0.412
-0.412
iso-0.378
archives-0.378
Token doesn
Token ID1595
Feature activation-1.49e-01

Pos logit contributions
 crooked+0.615
 mutated+0.601
 negligent+0.594
 tablet+0.584
 distracted+0.557
Neg logit contributions
lihood-0.829
VERTISEMENT-0.709
-0.698
archives-0.643
iso-0.640
Token't
Token ID470
Feature activation+4.18e-02

Pos logit contributions
 crooked+0.380
 mutated+0.371
 negligent+0.371
 tablet+0.362
vered+0.349
Neg logit contributions
lihood-0.430
VERTISEMENT-0.365
-0.349
archives-0.324
iso-0.320
Token.
Token ID13
Feature activation-9.61e-02

Pos logit contributions
lihood+0.130
VERTISEMENT+0.111
+0.109
archives+0.101
iso+0.100
Neg logit contributions
 crooked-0.099
 mutated-0.096
 negligent-0.096
 tablet-0.094
 toe-0.090
Token that
Token ID326
Feature activation-1.58e-02

Pos logit contributions
 crooked+0.879
 negligent+0.864
 mutated+0.838
 tablet+0.815
 toe+0.791
Neg logit contributions
lihood-1.255
VERTISEMENT-1.124
-1.072
iso-1.011
archives-1.002
Token Team
Token ID4816
Feature activation-1.31e-01

Pos logit contributions
 crooked+0.036
 negligent+0.035
 mutated+0.034
 tablet+0.033
 distracted+0.032
Neg logit contributions
lihood-0.050
VERTISEMENT-0.044
-0.043
iso-0.040
archives-0.040
Token USA
Token ID4916
Feature activation-2.76e-01

Pos logit contributions
 crooked+0.348
 negligent+0.341
 mutated+0.340
 tablet+0.330
vered+0.318
Neg logit contributions
lihood-0.363
VERTISEMENT-0.304
-0.298
archives-0.275
iso-0.273
Token is
Token ID318
Feature activation+6.90e-02

Pos logit contributions
 crooked+0.573
 negligent+0.560
 mutated+0.559
 tablet+0.534
 distracted+0.515
Neg logit contributions
lihood-0.934
VERTISEMENT-0.811
-0.794
archives-0.745
iso-0.744
Token so
Token ID523
Feature activation-1.00e-01

Pos logit contributions
lihood+0.229
VERTISEMENT+0.199
+0.195
iso+0.182
archives+0.182
Neg logit contributions
 crooked-0.148
 negligent-0.145
 mutated-0.143
 tablet-0.138
 distracted-0.132
Token turned
Token ID2900
Feature activation-5.46e-01

Pos logit contributions
 crooked+0.184
 negligent+0.179
 mutated+0.176
 tablet+0.169
 distracted+0.161
Neg logit contributions
lihood-0.365
VERTISEMENT-0.320
-0.314
iso-0.296
archives-0.296
Token off
Token ID572
Feature activation+1.72e-01

Pos logit contributions
 crooked+1.247
 negligent+1.219
 mutated+1.198
 tablet+1.164
 toe+1.136
Neg logit contributions
lihood-1.755
VERTISEMENT-1.506
-1.474
archives-1.398
AGE-1.365
Token by
Token ID416
Feature activation+2.21e-01

Pos logit contributions
lihood+0.459
VERTISEMENT+0.391
+0.376
archives+0.346
iso+0.343
Neg logit contributions
 crooked-0.480
 negligent-0.471
 mutated-0.467
 tablet-0.458
 distracted-0.443
Token his
Token ID465
Feature activation-1.53e-01

Pos logit contributions
lihood+0.711
VERTISEMENT+0.620
+0.607
iso+0.565
archives+0.565
Neg logit contributions
 crooked-0.499
 negligent-0.486
 mutated-0.481
 tablet-0.465
 distracted-0.448
Token character
Token ID2095
Feature activation+1.76e-01

Pos logit contributions
 crooked+0.328
 negligent+0.317
 mutated+0.312
 tablet+0.304
 distracted+0.292
Neg logit contributions
lihood-0.513
VERTISEMENT-0.450
-0.442
archives-0.417
iso-0.416
Token that
Token ID326
Feature activation-1.50e-01

Pos logit contributions
lihood+0.552
VERTISEMENT+0.477
+0.472
archives+0.438
iso+0.430
Neg logit contributions
 crooked-0.406
 negligent-0.402
 mutated-0.399
 tablet-0.383
 distracted-0.369
Token or
Token ID393
Feature activation+1.24e-01

Pos logit contributions
lihood+0.275
VERTISEMENT+0.231
+0.225
archives+0.209
iso+0.204
Neg logit contributions
 crooked-0.255
 mutated-0.250
 negligent-0.248
 tablet-0.239
 distracted-0.233
Token movement
Token ID3356
Feature activation-3.54e-02

Pos logit contributions
lihood+0.383
VERTISEMENT+0.326
+0.323
archives+0.304
iso+0.300
Neg logit contributions
 crooked-0.299
 negligent-0.290
 mutated-0.289
 tablet-0.280
 toe-0.270
Token,
Token ID11
Feature activation+1.47e-01

Pos logit contributions
 crooked+0.091
 negligent+0.089
 mutated+0.088
 tablet+0.086
 distracted+0.084
Neg logit contributions
lihood-0.103
VERTISEMENT-0.087
-0.086
archives-0.080
iso-0.078
Token dressing
Token ID18544
Feature activation-1.06e-01

Pos logit contributions
lihood+0.474
VERTISEMENT+0.407
+0.403
archives+0.380
iso+0.373
Neg logit contributions
 crooked-0.338
 negligent-0.327
 mutated-0.327
 tablet-0.317
 toe-0.305
Token and
Token ID290
Feature activation+2.57e-02

Pos logit contributions
 crooked+0.261
 mutated+0.252
 negligent+0.250
 tablet+0.245
 toe+0.238
Neg logit contributions
lihood-0.323
VERTISEMENT-0.275
-0.272
archives-0.257
iso-0.249
Token grooming
Token ID41836
Feature activation-5.59e-01

Pos logit contributions
lihood+0.080
VERTISEMENT+0.069
+0.068
archives+0.063
iso+0.063
Neg logit contributions
 crooked-0.061
 negligent-0.059
 mutated-0.059
 tablet-0.058
 toe-0.056
Token,
Token ID11
Feature activation+8.66e-02

Pos logit contributions
 crooked+1.411
 negligent+1.371
 mutated+1.367
 tablet+1.330
 toe+1.296
Neg logit contributions
lihood-1.663
-1.371
VERTISEMENT-1.343
archives-1.278
iso-1.250
Token and
Token ID290
Feature activation-1.56e-02

Pos logit contributions
lihood+0.274
VERTISEMENT+0.235
+0.232
archives+0.217
iso+0.216
Neg logit contributions
 crooked-0.203
 negligent-0.197
 mutated-0.196
 tablet-0.193
 toe-0.185
Token possibly
Token ID5457
Feature activation+7.16e-02

Pos logit contributions
 crooked+0.036
 negligent+0.035
 mutated+0.035
 tablet+0.034
 toe+0.033
Neg logit contributions
lihood-0.049
VERTISEMENT-0.043
-0.042
archives-0.039
iso-0.039
Token certain
Token ID1728
Feature activation+5.26e-01

Pos logit contributions
lihood+0.225
VERTISEMENT+0.194
+0.192
iso+0.180
archives+0.179
Neg logit contributions
 crooked-0.167
 mutated-0.162
 negligent-0.162
 tablet-0.157
 toe-0.151
Token behaviors
Token ID14301
Feature activation-3.23e-02

Pos logit contributions
lihood+1.571
VERTISEMENT+1.335
+1.309
archives+1.223
iso+1.215
Neg logit contributions
 crooked-1.312
 mutated-1.284
 negligent-1.280
 tablet-1.243
 distracted-1.190