TOP ACTIVATIONS
MAX = 1.797

but decided that what the hell I 'll just put it
How in the hell could a man enjoy being
: What in hell was the point of kick
your desk to impress the hell out of your friends .
of Trump supporters . The fact that he lied every other
a rat and get the heck off this sinking ship of
a woman of a certain age who actually acts her age
also cite her vote in favor of a 2011 Republican bill
1 July 2017 , if Labor wins the election . Given
, What the hell . Maybe I should give
: " Why in the hell do journalists insist on coming
Honestly , the fact that we still have bank
still wonder why in the hell he would ever paraph rase
The fact that a device like the
and another for star Matthew Mc Con aug hey 's performance
kids to know . The fact that it treats homosexuality as
from those behaviors . The fact is that in his general
game , you learn a hell of a lot doing that
it . The fact that Que er men would
in the first quarter to lobby on domestic spying legislation and

MIN ACTIVATIONS
MIN = -1.802

all . ( UPDATE : Sherman suffered a rupt
For the second straight year , Breaking Bad
. PT . Update , 11 : 15 a
the crazy Finnish crowd at ASUS R OG just over five
$ s Format ) } else { $ font 2 =
: companies such as Con oco Phill ips and Marathon have
web site . Update : Bungie responds to the
intro vert online ? <|endoftext|> Update : You can find a
: Jackson gay rights leader accused of burning down own home
path )" break } } else { $ SR = [
free newsletters . Update , 3 : 01 p
( 0 s )} c xd 4 { ( 341 s
Select Width , Height } else { Write - Warning -
- order through UPDATE : Looks like only a
not know what it is like to be a member of
what it feels like to attain the child
cigarette brands , saw its stock decline more than 4 percent
said . C EL AC IS B ORN
LINE | Do you feel like a terrible person for doing
that have traditionally been fre ew he eling forums where users

INTERVAL 0.897 - 1.797
CONTAINS 0.161%

the first few hours mentioned above , your body goes into
be looking to g ank immedi at ly after 6 .
Driver ," the 1976 Martin Sc ors ese film that 's
Gilbert was on the ice last year . Gil
general to oppose the confirmation of Mer rick Garland to the

INTERVAL -0.003 - 0.897
CONTAINS 50.004%

In fact , nobody in our company part akes in marijuana
at 1 pm . Email questions to your travel s @
hit if L aine hits his offensive stride to go along
shelling out to the cable company each month .
life and it was all thanks to this guy . Not

INTERVAL -0.903 - -0.003
CONTAINS 49.637%

a much wider perspective from a very small lens . Image
irc ulating for the spring election ," L ask er said
by Russia need to be met by American strength . If
it 's true , & I am UN B EL IE
ā m ai ā Hal ep ā iw i l ā

INTERVAL -1.802 - -0.903
CONTAINS 0.197%

S . economist Paul Krugman dubbed the revision from an original
106 ] In his first three games he shot 25 -
fn undo _ move ( & mut self ) { let
morning and afternoon peaks has increased by nearly 50 %. But
top among the industry 's HD 78 70 based graphics cards
Token but
Token ID475
Feature activation-7.68e-04

Pos logit contributions
wit+0.192
 Hud+0.190
 Ow+0.190
earable+0.188
hw+0.188
Neg logit contributions
advertisement-0.177
bourg-0.168
-0.162
quished-0.155
Lex-0.137
Token decided
Token ID3066
Feature activation+8.27e-02

Pos logit contributions
advertisement+0.002
bourg+0.002
+0.002
quished+0.002
 ASUS+0.002
Neg logit contributions
wit-0.002
 Hud-0.002
 Ow-0.002
earable-0.002
hw-0.002
Token that
Token ID326
Feature activation-2.37e-01

Pos logit contributions
wit+0.206
 Hud+0.204
 Ow+0.204
earable+0.201
hw+0.201
Neg logit contributions
advertisement-0.306
bourg-0.294
-0.286
quished-0.277
Lex-0.251
Token what
Token ID644
Feature activation-1.24e-01

Pos logit contributions
advertisement+0.710
bourg+0.677
+0.652
quished+0.625
 ASUS+0.559
Neg logit contributions
wit-0.758
 Hud-0.748
earable-0.745
 Ow-0.744
hw-0.740
Token the
Token ID262
Feature activation+1.02e-01

Pos logit contributions
advertisement+0.372
bourg+0.354
+0.341
quished+0.325
 ASUS+0.294
Neg logit contributions
wit-0.396
hw-0.388
 Hud-0.388
earable-0.388
 Ow-0.387
Token hell
Token ID5968
Feature activation+1.80e+00

Pos logit contributions
wit+0.288
 Hud+0.284
earable+0.283
 Ow+0.283
hw+0.281
Neg logit contributions
advertisement-0.342
bourg-0.325
-0.316
quished-0.305
 ASUS-0.275
Token I
Token ID314
Feature activation-2.11e-01

Pos logit contributions
 Ow+5.222
wit+5.208
 Hud+5.094
izable+5.067
hw+5.032
Neg logit contributions
advertisement-5.708
bourg-5.381
-5.234
quished-5.073
ipel-4.500
Token'll
Token ID1183
Feature activation-3.71e-02

Pos logit contributions
advertisement+0.706
bourg+0.686
+0.662
quished+0.632
Lex+0.578
Neg logit contributions
wit-0.588
 Hud-0.581
earable-0.577
reads-0.575
 Ow-0.575
Token just
Token ID655
Feature activation-3.67e-01

Pos logit contributions
advertisement+0.131
bourg+0.127
+0.123
quished+0.117
 ASUS+0.108
Neg logit contributions
wit-0.097
 Hud-0.096
 Ow-0.095
earable-0.095
reads-0.095
Token put
Token ID1234
Feature activation-5.31e-01

Pos logit contributions
advertisement+1.266
bourg+1.224
+1.179
quished+1.122
 desp+1.047
Neg logit contributions
wit-0.977
 Hud-0.969
earable-0.962
reads-0.960
 Ow-0.958
Token it
Token ID340
Feature activation-2.63e-01

Pos logit contributions
advertisement+1.596
bourg+1.518
+1.480
quished+1.401
 ASUS+1.267
Neg logit contributions
wit-1.668
earable-1.648
 Hud-1.642
reads-1.636
 Ow-1.632
Token
Token ID198
Feature activation-2.53e-02

Pos logit contributions
advertisement+0.665
bourg+0.629
+0.616
quished+0.583
 ASUS+0.527
Neg logit contributions
wit-0.622
 Hud-0.613
earable-0.612
hw-0.608
 Ow-0.608
Token
Token ID198
Feature activation+4.20e-02

Pos logit contributions
advertisement+0.083
bourg+0.079
+0.076
quished+0.073
Lex+0.066
Neg logit contributions
wit-0.074
 Hud-0.074
 Ow-0.073
earable-0.073
hw-0.073
TokenHow
Token ID2437
Feature activation-3.89e-03

Pos logit contributions
wit+0.128
 Hud+0.127
 Ow+0.127
earable+0.126
hw+0.125
Neg logit contributions
advertisement-0.134
bourg-0.126
-0.123
quished-0.117
Lex-0.105
Token in
Token ID287
Feature activation+9.04e-02

Pos logit contributions
advertisement+0.014
bourg+0.013
+0.013
quished+0.013
Lex+0.011
Neg logit contributions
wit-0.010
 Hud-0.010
 Ow-0.010
earable-0.010
reads-0.010
Token the
Token ID262
Feature activation+2.11e-01

Pos logit contributions
wit+0.270
 Hud+0.269
 Ow+0.268
earable+0.267
hw+0.266
Neg logit contributions
advertisement-0.288
bourg-0.275
-0.267
quished-0.256
Lex-0.228
Token hell
Token ID5968
Feature activation+1.72e+00

Pos logit contributions
wit+0.589
 Hud+0.583
 Ow+0.581
earable+0.578
reads+0.574
Neg logit contributions
advertisement-0.718
bourg-0.687
-0.668
quished-0.642
Lex-0.579
Token could
Token ID714
Feature activation+3.84e-01

Pos logit contributions
wit+4.022
 Ow+3.986
 Hud+3.887
hw+3.856
earable+3.795
Neg logit contributions
advertisement-6.568
bourg-6.202
-6.102
quished-5.981
ipel-5.375
Token a
Token ID257
Feature activation-9.64e-03

Pos logit contributions
wit+1.202
 Hud+1.194
 Ow+1.190
earable+1.179
hw+1.176
Neg logit contributions
advertisement-1.174
bourg-1.117
-1.084
quished-1.039
Lex-0.918
Token man
Token ID582
Feature activation+9.38e-03

Pos logit contributions
advertisement+0.033
bourg+0.031
+0.031
quished+0.029
Lex+0.026
Neg logit contributions
wit-0.027
 Hud-0.026
 Ow-0.026
earable-0.026
hw-0.026
Token enjoy
Token ID2883
Feature activation-4.74e-01

Pos logit contributions
wit+0.026
 Hud+0.026
 Ow+0.026
earable+0.026
reads+0.026
Neg logit contributions
advertisement-0.032
bourg-0.030
-0.029
quished-0.028
 ASUS-0.025
Token being
Token ID852
Feature activation+2.17e-01

Pos logit contributions
advertisement+1.464
bourg+1.379
+1.352
quished+1.264
 desp+1.149
Neg logit contributions
wit-1.467
 Hud-1.445
earable-1.435
 Ow-1.435
hw-1.429
Token :
Token ID1058
Feature activation-2.84e-01

Pos logit contributions
wit+1.591
 Hud+1.575
 Ow+1.572
earable+1.554
hw+1.552
Neg logit contributions
advertisement-1.573
bourg-1.507
-1.459
quished-1.406
Lex-1.239
Token
Token ID198
Feature activation+1.96e-01

Pos logit contributions
advertisement+0.901
bourg+0.854
+0.827
quished+0.794
 ASUS+0.710
Neg logit contributions
wit-0.857
 Hud-0.850
 Ow-0.847
earable-0.846
reads-0.839
Token
Token ID198
Feature activation+1.58e-01

Pos logit contributions
wit+0.563
 Hud+0.558
 Ow+0.556
earable+0.550
hw+0.549
Neg logit contributions
advertisement-0.652
bourg-0.624
-0.607
quished-0.586
Lex-0.521
TokenWhat
Token ID2061
Feature activation-1.24e-01

Pos logit contributions
wit+0.474
 Hud+0.472
 Ow+0.471
earable+0.467
hw+0.463
Neg logit contributions
advertisement-0.505
bourg-0.479
-0.465
quished-0.445
Lex-0.398
Token in
Token ID287
Feature activation-4.02e-02

Pos logit contributions
advertisement+0.409
bourg+0.387
+0.377
quished+0.360
 ASUS+0.328
Neg logit contributions
wit-0.356
 Hud-0.352
earable-0.352
 Ow-0.351
hw-0.349
Token hell
Token ID5968
Feature activation+1.65e+00

Pos logit contributions
advertisement+0.129
bourg+0.123
+0.120
quished+0.114
 ASUS+0.103
Neg logit contributions
wit-0.119
 Hud-0.119
 Ow-0.118
earable-0.118
reads-0.117
Token was
Token ID373
Feature activation+1.57e-01

Pos logit contributions
wit+4.506
 Ow+4.485
 Hud+4.381
hw+4.334
earable+4.288
Neg logit contributions
advertisement-5.497
bourg-5.212
-5.130
quished-4.918
ipel-4.422
Token the
Token ID262
Feature activation+4.14e-02

Pos logit contributions
wit+0.516
 Hud+0.512
 Ow+0.510
earable+0.506
hw+0.505
Neg logit contributions
advertisement-0.458
bourg-0.435
-0.421
quished-0.403
Lex-0.353
Token point
Token ID966
Feature activation+5.16e-01

Pos logit contributions
wit+0.113
 Hud+0.112
earable+0.111
hw+0.111
 Ow+0.111
Neg logit contributions
advertisement-0.144
bourg-0.136
-0.134
quished-0.127
 ASUS-0.117
Token of
Token ID286
Feature activation-2.74e-01

Pos logit contributions
wit+1.833
 Ow+1.826
 Hud+1.821
earable+1.793
hw+1.785
Neg logit contributions
advertisement-1.351
bourg-1.285
-1.237
quished-1.178
ipel-1.013
Token kick
Token ID4829
Feature activation-1.90e-01

Pos logit contributions
advertisement+0.857
bourg+0.806
+0.788
quished+0.746
 ASUS+0.675
Neg logit contributions
wit-0.841
 Hud-0.832
earable-0.829
reads-0.826
hw-0.826
Token your
Token ID534
Feature activation-1.42e-01

Pos logit contributions
advertisement+0.330
bourg+0.313
+0.305
quished+0.294
 ASUS+0.267
Neg logit contributions
wit-0.275
 Hud-0.268
 Ow-0.268
earable-0.267
reads-0.266
Token desk
Token ID6915
Feature activation-1.41e-01

Pos logit contributions
advertisement+0.434
bourg+0.408
+0.400
quished+0.383
 ASUS+0.346
Neg logit contributions
wit-0.447
 Ow-0.433
earable-0.432
reads-0.432
hw-0.431
Token to
Token ID284
Feature activation-2.73e-02

Pos logit contributions
advertisement+0.442
bourg+0.421
+0.409
quished+0.393
 ASUS+0.348
Neg logit contributions
wit-0.432
 Hud-0.428
 Ow-0.427
earable-0.424
hw-0.422
Token impress
Token ID14947
Feature activation+4.72e-01

Pos logit contributions
advertisement+0.097
bourg+0.093
+0.090
quished+0.086
 ASUS+0.079
Neg logit contributions
wit-0.072
 Hud-0.071
 Ow-0.071
earable-0.071
reads-0.071
Token the
Token ID262
Feature activation+1.12e-01

Pos logit contributions
wit+1.275
 Hud+1.265
 Ow+1.260
earable+1.245
hw+1.243
Neg logit contributions
advertisement-1.645
bourg-1.576
-1.536
quished-1.479
Lex-1.330
Token hell
Token ID5968
Feature activation+1.64e+00

Pos logit contributions
wit+0.316
 Hud+0.312
 Ow+0.311
earable+0.310
hw+0.308
Neg logit contributions
advertisement-0.375
bourg-0.357
-0.348
quished-0.336
Lex-0.300
Token out
Token ID503
Feature activation+6.37e-01

Pos logit contributions
wit+5.219
 Ow+4.965
 Hud+4.917
hw+4.838
enance+4.787
Neg logit contributions
advertisement-5.030
bourg-4.879
-4.444
quished-4.417
 ASUS-3.876
Token of
Token ID286
Feature activation+3.40e-01

Pos logit contributions
wit+2.278
 Ow+2.227
 Hud+2.208
hw+2.187
earable+2.172
Neg logit contributions
advertisement-1.708
bourg-1.611
-1.552
quished-1.453
Lex-1.273
Token your
Token ID534
Feature activation+1.39e-01

Pos logit contributions
wit+0.979
 Hud+0.973
 Ow+0.970
earable+0.958
hw+0.955
Neg logit contributions
advertisement-1.126
bourg-1.077
-1.047
quished-1.007
Lex-0.898
Token friends
Token ID2460
Feature activation+8.19e-01

Pos logit contributions
wit+0.368
 Hud+0.363
 Ow+0.362
earable+0.359
hw+0.358
Neg logit contributions
advertisement-0.495
bourg-0.474
-0.462
quished-0.446
Lex-0.402
Token.
Token ID13
Feature activation+1.98e-01

Pos logit contributions
wit+2.390
 Ow+2.389
 Hud+2.388
hw+2.346
earable+2.338
Neg logit contributions
advertisement-2.650
bourg-2.540
-2.446
quished-2.351
Lex-2.094
Token of
Token ID286
Feature activation-4.58e-02

Pos logit contributions
advertisement+0.268
bourg+0.252
+0.242
quished+0.230
Lex+0.195
Neg logit contributions
wit-0.400
 Hud-0.397
 Ow-0.396
earable-0.394
hw-0.392
Token Trump
Token ID1301
Feature activation+3.46e-01

Pos logit contributions
advertisement+0.152
bourg+0.146
+0.141
quished+0.135
 Secret+0.123
Neg logit contributions
wit-0.131
earable-0.130
 Hud-0.129
 Ow-0.128
hw-0.128
Token supporters
Token ID5941
Feature activation+5.33e-01

Pos logit contributions
wit+0.873
 Hud+0.865
 Ow+0.862
earable+0.853
hw+0.849
Neg logit contributions
advertisement-1.267
bourg-1.216
-1.186
quished-1.145
Lex-1.036
Token.
Token ID13
Feature activation+9.12e-03

Pos logit contributions
wit+1.631
 Hud+1.619
 Ow+1.613
hw+1.593
earable+1.592
Neg logit contributions
advertisement-1.671
bourg-1.586
-1.540
quished-1.482
Lex-1.311
Token The
Token ID383
Feature activation+6.60e-02

Pos logit contributions
wit+0.028
earable+0.028
 Hud+0.028
 Ow+0.028
hw+0.028
Neg logit contributions
advertisement-0.028
bourg-0.027
-0.026
quished-0.025
 Secret-0.022
Token fact
Token ID1109
Feature activation+1.62e+00

Pos logit contributions
wit+0.185
 Hud+0.183
earable+0.183
 Ow+0.182
hw+0.181
Neg logit contributions
advertisement-0.223
bourg-0.213
-0.207
quished-0.199
 Secret-0.180
Token that
Token ID326
Feature activation+5.54e-01

Pos logit contributions
wit+4.626
 Hud+4.620
reads+4.534
 Ow+4.489
hw+4.409
Neg logit contributions
advertisement-5.368
bourg-5.017
-4.811
quished-4.716
xi-4.207
Token he
Token ID339
Feature activation-5.59e-03

Pos logit contributions
wit+1.637
 Hud+1.626
 Ow+1.620
earable+1.603
hw+1.599
Neg logit contributions
advertisement-1.793
bourg-1.710
-1.662
quished-1.598
Lex-1.422
Token lied
Token ID19837
Feature activation+4.38e-01

Pos logit contributions
advertisement+0.018
bourg+0.017
+0.017
quished+0.016
 desp+0.014
Neg logit contributions
wit-0.016
 Hud-0.016
earable-0.016
 Ow-0.016
hw-0.016
Token every
Token ID790
Feature activation+5.61e-01

Pos logit contributions
wit+1.262
 Hud+1.253
 Ow+1.246
hw+1.231
earable+1.228
Neg logit contributions
advertisement-1.449
bourg-1.382
-1.343
quished-1.295
Lex-1.154
Token other
Token ID584
Feature activation-3.75e-02

Pos logit contributions
wit+1.867
 Hud+1.854
 Ow+1.844
hw+1.827
earable+1.824
Neg logit contributions
advertisement-1.605
bourg-1.525
-1.467
quished-1.419
Lex-1.231
Token a
Token ID257
Feature activation-7.94e-02

Pos logit contributions
advertisement+0.176
bourg+0.167
+0.162
quished+0.156
 ASUS+0.139
Neg logit contributions
wit-0.173
 Hud-0.172
earable-0.172
 Ow-0.171
hw-0.170
Token rat
Token ID4227
Feature activation+1.58e-01

Pos logit contributions
advertisement+0.265
bourg+0.252
+0.246
quished+0.237
 ASUS+0.213
Neg logit contributions
wit-0.224
earable-0.223
reads-0.221
 Hud-0.221
 Ow-0.221
Token and
Token ID290
Feature activation-1.39e-01

Pos logit contributions
wit+0.493
 Hud+0.491
 Ow+0.488
earable+0.486
hw+0.483
Neg logit contributions
advertisement-0.485
bourg-0.462
-0.449
quished-0.430
Lex-0.380
Token get
Token ID651
Feature activation-1.14e-01

Pos logit contributions
advertisement+0.484
bourg+0.466
+0.450
quished+0.430
 ASUS+0.397
Neg logit contributions
wit-0.372
 Hud-0.369
earable-0.367
hw-0.366
 Ow-0.366
Token the
Token ID262
Feature activation+1.19e-01

Pos logit contributions
advertisement+0.385
bourg+0.365
+0.356
quished+0.342
 ASUS+0.315
Neg logit contributions
wit-0.317
earable-0.316
reads-0.314
 Hud-0.313
hw-0.312
Token heck
Token ID22574
Feature activation+1.60e+00

Pos logit contributions
wit+0.332
earable+0.328
 Hud+0.327
hw+0.326
reads+0.325
Neg logit contributions
advertisement-0.403
bourg-0.383
-0.372
quished-0.359
 ASUS-0.327
Token off
Token ID572
Feature activation+6.88e-01

Pos logit contributions
wit+4.998
 Ow+4.925
 Hud+4.889
hw+4.803
enance+4.754
Neg logit contributions
advertisement-4.876
bourg-4.676
-4.385
quished-4.244
Lex-3.789
Token this
Token ID428
Feature activation+2.45e-01

Pos logit contributions
wit+2.108
 Ow+2.085
 Hud+2.081
earable+2.043
hw+2.027
Neg logit contributions
advertisement-2.148
bourg-2.072
-2.023
quished-1.917
xi-1.698
Token sinking
Token ID27141
Feature activation+8.33e-02

Pos logit contributions
wit+0.700
 Hud+0.694
 Ow+0.692
earable+0.685
hw+0.683
Neg logit contributions
advertisement-0.815
bourg-0.778
-0.757
quished-0.728
Lex-0.651
Token ship
Token ID4074
Feature activation-2.99e-01

Pos logit contributions
wit+0.244
 Hud+0.242
 Ow+0.241
earable+0.238
hw+0.237
Neg logit contributions
advertisement-0.273
bourg-0.260
-0.253
quished-0.243
Lex-0.216
Token of
Token ID286
Feature activation-1.39e-01

Pos logit contributions
advertisement+0.898
bourg+0.851
+0.821
quished+0.794
Lex+0.697
Neg logit contributions
wit-0.946
 Hud-0.943
 Ow-0.937
earable-0.935
hw-0.931
Tokena
Token ID64
Feature activation-1.59e-01

Pos logit contributions
wit+0.671
 Hud+0.667
 Ow+0.664
earable+0.658
hw+0.654
Neg logit contributions
advertisement-0.823
bourg-0.785
-0.764
quished-0.735
Lex-0.661
Token woman
Token ID2415
Feature activation+3.99e-02

Pos logit contributions
advertisement+0.543
bourg+0.517
+0.505
quished+0.483
 Secret+0.448
Neg logit contributions
earable-0.432
wit-0.431
 Ow-0.431
 Hud-0.428
reads-0.427
Token of
Token ID286
Feature activation-2.42e-01

Pos logit contributions
wit+0.116
 Hud+0.115
 Ow+0.115
earable+0.115
hw+0.114
Neg logit contributions
advertisement-0.131
bourg-0.124
-0.121
quished-0.116
 Secret-0.104
Token a
Token ID257
Feature activation+2.98e-01

Pos logit contributions
advertisement+0.840
bourg+0.803
+0.781
quished+0.746
 Secret+0.691
Neg logit contributions
wit-0.646
earable-0.645
 Ow-0.643
hw-0.641
 Hud-0.641
Token certain
Token ID1728
Feature activation+3.80e-01

Pos logit contributions
wit+0.781
 Hud+0.775
 Ow+0.773
earable+0.770
hw+0.763
Neg logit contributions
advertisement-1.060
bourg-1.017
-0.990
quished-0.952
Lex-0.862
Token age
Token ID2479
Feature activation+1.58e+00

Pos logit contributions
wit+1.055
 Hud+1.048
 Ow+1.046
earable+1.039
hw+1.032
Neg logit contributions
advertisement-1.293
bourg-1.238
-1.199
quished-1.151
Lex-1.037
Token who
Token ID508
Feature activation-2.80e-01

Pos logit contributions
wit+4.537
 Ow+4.422
 Hud+4.413
hw+4.377
reads+4.340
Neg logit contributions
advertisement-5.151
bourg-4.951
-4.848
quished-4.734
Lex-4.156
Token actually
Token ID1682
Feature activation-1.24e-01

Pos logit contributions
advertisement+0.895
bourg+0.857
+0.831
quished+0.791
 desp+0.726
Neg logit contributions
wit-0.831
earable-0.823
 Hud-0.818
 Ow-0.817
hw-0.811
Token acts
Token ID6529
Feature activation-2.71e-01

Pos logit contributions
advertisement+0.416
bourg+0.396
+0.386
quished+0.367
 desp+0.339
Neg logit contributions
wit-0.353
 Hud-0.349
earable-0.348
 Ow-0.347
hw-0.343
Token her
Token ID607
Feature activation-5.73e-02

Pos logit contributions
advertisement+0.728
bourg+0.690
+0.657
quished+0.627
 desp+0.558
Neg logit contributions
earable-0.937
wit-0.935
 Hud-0.933
 Ow-0.930
hw-0.922
Token age
Token ID2479
Feature activation+1.05e+00

Pos logit contributions
advertisement+0.206
bourg+0.197
+0.192
quished+0.185
xi+0.168
Neg logit contributions
wit-0.148
 Hud-0.147
 Ow-0.146
earable-0.146
hw-0.145
Token also
Token ID635
Feature activation-5.59e-02

Pos logit contributions
wit+1.464
 Hud+1.454
 Ow+1.446
earable+1.429
hw+1.425
Neg logit contributions
advertisement-1.891
bourg-1.811
-1.763
quished-1.701
Lex-1.532
Token cite
Token ID21729
Feature activation+2.19e-01

Pos logit contributions
advertisement+0.195
bourg+0.188
+0.182
quished+0.175
 desp+0.160
Neg logit contributions
wit-0.149
 Ow-0.148
 Hud-0.147
earable-0.147
hw-0.146
Token her
Token ID607
Feature activation+3.97e-02

Pos logit contributions
wit+0.625
earable+0.615
 Ow+0.614
 Hud+0.612
hw+0.610
Neg logit contributions
advertisement-0.734
bourg-0.699
-0.679
quished-0.654
 ASUS-0.590
Token vote
Token ID3015
Feature activation+6.54e-02

Pos logit contributions
wit+0.101
 Hud+0.100
 Ow+0.100
earable+0.099
hw+0.099
Neg logit contributions
advertisement-0.145
bourg-0.138
-0.135
quished-0.130
 ASUS-0.118
Token in
Token ID287
Feature activation+2.83e-01

Pos logit contributions
wit+0.202
 Hud+0.201
 Ow+0.200
earable+0.199
hw+0.198
Neg logit contributions
advertisement-0.203
bourg-0.193
-0.187
quished-0.180
Lex-0.159
Token favor
Token ID2661
Feature activation+1.56e+00

Pos logit contributions
wit+0.834
 Hud+0.824
earable+0.821
 Ow+0.821
hw+0.814
Neg logit contributions
advertisement-0.918
bourg-0.875
-0.850
quished-0.815
Lex-0.730
Token of
Token ID286
Feature activation+7.42e-01

Pos logit contributions
wit+5.171
 Hud+5.165
 Ow+5.085
hw+5.037
enance+5.026
Neg logit contributions
advertisement-4.241
bourg-4.084
-3.864
quished-3.791
ipel-3.330
Token a
Token ID257
Feature activation+2.69e-01

Pos logit contributions
wit+2.089
 Hud+2.077
 Ow+2.068
earable+2.044
hw+2.039
Neg logit contributions
advertisement-2.502
bourg-2.393
-2.327
quished-2.242
Lex-2.005
Token 2011
Token ID2813
Feature activation+1.72e-01

Pos logit contributions
wit+0.731
 Ow+0.719
earable+0.717
 Hud+0.717
reads+0.711
Neg logit contributions
advertisement-0.937
bourg-0.894
-0.875
quished-0.840
Lex-0.758
Token Republican
Token ID3415
Feature activation+5.98e-01

Pos logit contributions
wit+0.463
 Ow+0.455
earable+0.455
 Hud+0.454
hw+0.451
Neg logit contributions
advertisement-0.603
bourg-0.576
-0.562
quished-0.540
 ASUS-0.489
Token bill
Token ID2855
Feature activation+1.83e-01

Pos logit contributions
wit+1.571
 Hud+1.565
 Ow+1.554
earable+1.532
hw+1.529
Neg logit contributions
advertisement-2.126
bourg-2.042
-1.986
quished-1.923
Lex-1.729
Token 1
Token ID352
Feature activation-1.76e-01

Pos logit contributions
advertisement+1.041
bourg+0.999
+0.971
quished+0.909
 ASUS+0.838
Neg logit contributions
wit-0.995
earable-0.990
hw-0.980
 Hud-0.973
reads-0.972
Token July
Token ID2901
Feature activation+1.86e-01

Pos logit contributions
advertisement+0.526
bourg+0.496
+0.486
quished+0.460
xi+0.411
Neg logit contributions
wit-0.562
 Hud-0.558
earable-0.558
 Ow-0.556
reads-0.552
Token 2017
Token ID2177
Feature activation-4.80e-01

Pos logit contributions
wit+0.544
 Hud+0.541
 Ow+0.540
hw+0.532
earable+0.529
Neg logit contributions
advertisement-0.608
bourg-0.578
-0.562
quished-0.546
Lex-0.483
Token,
Token ID11
Feature activation+2.30e-01

Pos logit contributions
advertisement+1.333
bourg+1.279
+1.227
quished+1.163
xi+1.020
Neg logit contributions
wit-1.615
 Hud-1.608
earable-1.604
 Ow-1.599
reads-1.580
Token if
Token ID611
Feature activation+2.17e-01

Pos logit contributions
wit+0.727
 Hud+0.720
 Ow+0.718
earable+0.715
hw+0.711
Neg logit contributions
advertisement-0.696
bourg-0.663
-0.642
quished-0.612
Lex-0.542
Token Labor
Token ID7882
Feature activation+1.55e+00

Pos logit contributions
wit+0.649
 Hud+0.640
earable+0.638
 Ow+0.638
hw+0.635
Neg logit contributions
advertisement-0.694
bourg-0.662
-0.642
quished-0.613
 ASUS-0.550
Token wins
Token ID7864
Feature activation+4.45e-01

Pos logit contributions
wit+3.908
 Hud+3.864
 Ow+3.857
earable+3.792
hw+3.736
Neg logit contributions
advertisement-5.587
bourg-5.359
-5.224
quished-5.170
ipel-4.596
Token the
Token ID262
Feature activation+2.97e-01

Pos logit contributions
wit+1.312
 Hud+1.299
 Ow+1.296
earable+1.286
hw+1.282
Neg logit contributions
advertisement-1.446
bourg-1.381
-1.340
quished-1.288
Lex-1.148
Token election
Token ID3071
Feature activation+6.21e-01

Pos logit contributions
wit+0.916
 Hud+0.906
 Ow+0.903
earable+0.898
hw+0.895
Neg logit contributions
advertisement-0.922
bourg-0.878
-0.853
quished-0.816
Lex-0.724
Token.
Token ID13
Feature activation-5.22e-03

Pos logit contributions
wit+1.936
 Hud+1.930
 Ow+1.922
earable+1.893
hw+1.890
Neg logit contributions
advertisement-1.904
bourg-1.815
-1.756
quished-1.690
Lex-1.499
Token Given
Token ID11259
Feature activation+2.36e-02

Pos logit contributions
advertisement+0.015
bourg+0.014
+0.014
quished+0.013
 ASUS+0.012
Neg logit contributions
wit-0.017
earable-0.017
 Hud-0.017
hw-0.017
 Ow-0.017
Token,
Token ID11
Feature activation-2.71e-01

Pos logit contributions
advertisement+0.589
bourg+0.558
+0.543
quished+0.517
Lex+0.456
Neg logit contributions
wit-0.640
 Hud-0.634
 Ow-0.631
earable-0.629
hw-0.625
Token
Token ID564
Feature activation+2.54e-02

Pos logit contributions
advertisement+0.783
bourg+0.730
+0.718
quished+0.677
 ASUS+0.600
Neg logit contributions
wit-0.897
earable-0.882
 Hud-0.879
reads-0.878
hw-0.873
Token
Token ID250
Feature activation+1.77e-02

Pos logit contributions
wit+0.089
 Hud+0.089
 Ow+0.088
earable+0.088
hw+0.087
Neg logit contributions
advertisement-0.068
bourg-0.064
-0.062
quished-0.059
Lex-0.051
TokenWhat
Token ID2061
Feature activation-2.89e-01

Pos logit contributions
wit+0.050
 Hud+0.050
 Ow+0.050
earable+0.049
hw+0.049
Neg logit contributions
advertisement-0.059
bourg-0.056
-0.055
quished-0.052
Lex-0.047
Token the
Token ID262
Feature activation+1.23e-01

Pos logit contributions
advertisement+0.939
bourg+0.888
+0.861
quished+0.822
 ASUS+0.743
Neg logit contributions
wit-0.842
 Hud-0.838
earable-0.832
 Ow-0.829
reads-0.827
Token hell
Token ID5968
Feature activation+1.55e+00

Pos logit contributions
wit+0.351
 Hud+0.347
earable+0.345
 Ow+0.345
reads+0.342
Neg logit contributions
advertisement-0.414
bourg-0.394
-0.384
quished-0.369
Lex-0.331
Token.
Token ID13
Feature activation-1.80e-02

Pos logit contributions
wit+4.357
 Ow+4.331
 Hud+4.256
hw+4.222
izable+4.159
Neg logit contributions
advertisement-5.089
bourg-4.892
-4.714
quished-4.584
Lex-4.081
Token Maybe
Token ID6674
Feature activation-2.15e-01

Pos logit contributions
advertisement+0.059
bourg+0.056
+0.055
quished+0.052
Lex+0.047
Neg logit contributions
wit-0.052
 Hud-0.052
earable-0.051
 Ow-0.051
hw-0.051
Token I
Token ID314
Feature activation+7.86e-02

Pos logit contributions
advertisement+0.635
bourg+0.598
+0.580
quished+0.546
 ASUS+0.492
Neg logit contributions
wit-0.697
 Hud-0.686
earable-0.685
hw-0.682
 Ow-0.681
Token should
Token ID815
Feature activation-6.18e-01

Pos logit contributions
wit+0.228
 Hud+0.226
 Ow+0.224
earable+0.224
reads+0.222
Neg logit contributions
advertisement-0.258
bourg-0.247
-0.240
quished-0.230
Lex-0.206
Token give
Token ID1577
Feature activation-2.60e-01

Pos logit contributions
advertisement+2.032
bourg+1.925
+1.853
quished+1.748
 desp+1.632
Neg logit contributions
wit-1.786
 Hud-1.760
earable-1.757
 Ow-1.749
reads-1.740
Token:
Token ID25
Feature activation+8.48e-02

Pos logit contributions
advertisement+0.484
bourg+0.458
+0.445
quished+0.425
 Secret+0.375
Neg logit contributions
wit-0.529
 Hud-0.523
 Ow-0.522
earable-0.521
hw-0.518
Token "
Token ID366
Feature activation+1.88e-01

Pos logit contributions
wit+0.294
 Hud+0.292
 Ow+0.290
earable+0.289
hw+0.288
Neg logit contributions
advertisement-0.231
bourg-0.218
-0.211
quished-0.201
Lex-0.174
TokenWhy
Token ID5195
Feature activation-2.31e-01

Pos logit contributions
wit+0.582
 Hud+0.579
 Ow+0.577
earable+0.573
hw+0.570
Neg logit contributions
advertisement-0.583
bourg-0.553
-0.537
quished-0.513
Lex-0.458
Token in
Token ID287
Feature activation+1.07e-01

Pos logit contributions
advertisement+0.797
bourg+0.778
+0.748
quished+0.711
 desp+0.659
Neg logit contributions
wit-0.615
earable-0.614
 Hud-0.608
 Ow-0.604
hw-0.600
Token the
Token ID262
Feature activation+2.23e-01

Pos logit contributions
wit+0.321
 Hud+0.318
earable+0.316
 Ow+0.316
reads+0.314
Neg logit contributions
advertisement-0.338
bourg-0.323
-0.314
quished-0.300
Lex-0.268
Token hell
Token ID5968
Feature activation+1.54e+00

Pos logit contributions
wit+0.620
 Hud+0.613
 Ow+0.611
earable+0.608
hw+0.605
Neg logit contributions
advertisement-0.758
bourg-0.726
-0.706
quished-0.678
Lex-0.611
Token do
Token ID466
Feature activation+1.35e-01

Pos logit contributions
wit+3.972
 Ow+3.913
 Hud+3.844
hw+3.809
reads+3.760
Neg logit contributions
advertisement-5.556
bourg-5.240
-5.177
quished-5.056
ipel-4.534
Token journalists
Token ID9046
Feature activation-2.54e-01

Pos logit contributions
wit+0.446
earable+0.440
 Hud+0.439
 Ow+0.435
hw+0.435
Neg logit contributions
advertisement-0.386
bourg-0.370
-0.357
quished-0.339
 Secret-0.300
Token insist
Token ID16361
Feature activation+2.60e-01

Pos logit contributions
advertisement+0.832
bourg+0.799
+0.767
quished+0.723
 desp+0.677
Neg logit contributions
 Hud-0.725
wit-0.723
earable-0.723
 Ow-0.721
reads-0.714
Token on
Token ID319
Feature activation-1.18e-01

Pos logit contributions
wit+0.756
 Hud+0.750
 Ow+0.748
earable+0.740
hw+0.738
Neg logit contributions
advertisement-0.853
bourg-0.814
-0.791
quished-0.762
Lex-0.679
Token coming
Token ID2406
Feature activation-2.83e-01

Pos logit contributions
advertisement+0.356
bourg+0.337
+0.327
quished+0.310
Lex+0.278
Neg logit contributions
wit-0.373
 Hud-0.370
earable-0.369
 Ow-0.367
hw-0.366
Token
Token ID447
Feature activation-9.95e-02

Pos logit contributions
advertisement+0.072
bourg+0.069
+0.067
quished+0.064
Lex+0.058
Neg logit contributions
wit-0.059
 Hud-0.059
 Ow-0.059
earable-0.058
hw-0.058
Token
Token ID250
Feature activation+1.91e-01

Pos logit contributions
advertisement+0.245
bourg+0.228
+0.222
quished+0.209
Lex+0.177
Neg logit contributions
wit-0.373
 Hud-0.369
 Ow-0.368
earable-0.367
hw-0.365
TokenHonestly
Token ID40817
Feature activation+8.42e-02

Pos logit contributions
wit+0.589
 Hud+0.585
 Ow+0.583
earable+0.578
hw+0.576
Neg logit contributions
advertisement-0.595
bourg-0.565
-0.549
quished-0.525
Lex-0.467
Token,
Token ID11
Feature activation+5.63e-02

Pos logit contributions
wit+0.276
 Hud+0.274
 Ow+0.273
earable+0.271
hw+0.270
Neg logit contributions
advertisement-0.246
bourg-0.233
-0.225
quished-0.216
Lex-0.189
Token the
Token ID262
Feature activation+2.32e-01

Pos logit contributions
wit+0.176
 Hud+0.175
 Ow+0.174
earable+0.173
hw+0.173
Neg logit contributions
advertisement-0.172
bourg-0.164
-0.159
quished-0.152
Lex-0.134
Token fact
Token ID1109
Feature activation+1.49e+00

Pos logit contributions
wit+0.633
 Hud+0.623
 Ow+0.620
earable+0.620
hw+0.619
Neg logit contributions
advertisement-0.808
bourg-0.770
-0.751
quished-0.719
 Secret-0.653
Token that
Token ID326
Feature activation+3.99e-01

Pos logit contributions
 Hud+4.181
wit+4.158
 Ow+4.087
reads+4.018
earable+3.975
Neg logit contributions
advertisement-4.970
bourg-4.800
quished-4.555
-4.500
xi-4.005
Token we
Token ID356
Feature activation+1.08e-01

Pos logit contributions
wit+1.201
 Hud+1.194
 Ow+1.189
earable+1.177
hw+1.172
Neg logit contributions
advertisement-1.267
bourg-1.209
-1.174
quished-1.129
Lex-1.001
Token still
Token ID991
Feature activation+1.18e-01

Pos logit contributions
wit+0.292
 Hud+0.288
 Ow+0.287
reads+0.285
hw+0.284
Neg logit contributions
advertisement-0.374
bourg-0.358
-0.350
quished-0.335
Lex-0.303
Token have
Token ID423
Feature activation+6.05e-01

Pos logit contributions
wit+0.321
 Ow+0.315
 Hud+0.314
earable+0.313
hw+0.313
Neg logit contributions
advertisement-0.412
bourg-0.393
-0.383
quished-0.365
 desp-0.334
Token bank
Token ID3331
Feature activation-5.60e-02

Pos logit contributions
wit+1.755
 Hud+1.748
 Ow+1.741
earable+1.720
hw+1.712
Neg logit contributions
advertisement-1.985
bourg-1.903
-1.846
quished-1.783
Lex-1.585
Token still
Token ID991
Feature activation+7.30e-03

Pos logit contributions
wit+0.075
 Hud+0.075
 Ow+0.074
earable+0.074
hw+0.074
Neg logit contributions
advertisement-0.088
bourg-0.085
-0.082
quished-0.079
 ASUS-0.071
Token wonder
Token ID4240
Feature activation+1.48e-01

Pos logit contributions
wit+0.020
 Hud+0.019
 Ow+0.019
earable+0.019
hw+0.019
Neg logit contributions
advertisement-0.025
bourg-0.024
-0.024
quished-0.023
 desp-0.021
Token why
Token ID1521
Feature activation-2.56e-01

Pos logit contributions
wit+0.431
 Hud+0.428
 Ow+0.427
earable+0.422
hw+0.421
Neg logit contributions
advertisement-0.487
bourg-0.465
-0.452
quished-0.434
Lex-0.388
Token in
Token ID287
Feature activation+1.34e-01

Pos logit contributions
advertisement+0.762
bourg+0.735
+0.700
quished+0.664
 Secret+0.608
Neg logit contributions
wit-0.815
earable-0.807
hw-0.796
reads-0.795
 Hud-0.793
Token the
Token ID262
Feature activation+9.83e-03

Pos logit contributions
wit+0.407
 Hud+0.403
 Ow+0.402
earable+0.400
hw+0.398
Neg logit contributions
advertisement-0.423
bourg-0.403
-0.392
quished-0.374
Lex-0.333
Token hell
Token ID5968
Feature activation+1.48e+00

Pos logit contributions
wit+0.029
 Hud+0.028
 Ow+0.028
hw+0.028
earable+0.028
Neg logit contributions
advertisement-0.032
bourg-0.031
-0.030
quished-0.029
 ASUS-0.026
Token he
Token ID339
Feature activation-7.56e-03

Pos logit contributions
 Ow+4.046
wit+4.033
 Hud+4.018
hw+3.910
earable+3.864
Neg logit contributions
advertisement-5.114
bourg-4.818
-4.712
quished-4.564
ipel-4.081
Token would
Token ID561
Feature activation+2.30e-01

Pos logit contributions
advertisement+0.024
bourg+0.023
+0.023
quished+0.022
 desp+0.020
Neg logit contributions
wit-0.022
 Hud-0.022
 Ow-0.022
earable-0.022
hw-0.022
Token ever
Token ID1683
Feature activation+7.64e-01

Pos logit contributions
wit+0.603
 Hud+0.598
 Ow+0.596
earable+0.590
hw+0.587
Neg logit contributions
advertisement-0.822
bourg-0.789
-0.767
quished-0.740
Lex-0.668
Token paraph
Token ID38624
Feature activation-5.34e-01

Pos logit contributions
wit+1.966
 Hud+1.948
 Ow+1.937
earable+1.911
hw+1.911
Neg logit contributions
advertisement-2.764
bourg-2.644
-2.594
quished-2.522
Lex-2.256
Tokenrase
Token ID22789
Feature activation+3.11e-01

Pos logit contributions
advertisement+1.464
bourg+1.360
+1.277
quished+1.226
antic+1.146
Neg logit contributions
 Hud-1.839
 Ow-1.832
wit-1.796
earable-1.779
hw-1.757
Token
Token ID447
Feature activation+1.58e-01

Pos logit contributions
advertisement+0.048
bourg+0.046
+0.045
quished+0.043
Lex+0.038
Neg logit contributions
wit-0.042
 Hud-0.041
 Ow-0.041
earable-0.041
hw-0.041
Token
Token ID251
Feature activation-1.21e-01

Pos logit contributions
wit+0.583
 Hud+0.580
 Ow+0.578
earable+0.574
hw+0.572
Neg logit contributions
advertisement-0.395
bourg-0.371
-0.358
quished-0.339
Lex-0.289
Token
Token ID198
Feature activation-2.56e-01

Pos logit contributions
advertisement+0.389
bourg+0.368
+0.361
quished+0.342
xi+0.307
Neg logit contributions
wit-0.361
 Hud-0.356
earable-0.354
hw-0.353
 Ow-0.353
Token
Token ID198
Feature activation-2.05e-01

Pos logit contributions
advertisement+0.822
bourg+0.776
+0.753
quished+0.715
Lex+0.650
Neg logit contributions
wit-0.765
 Hud-0.761
 Ow-0.756
earable-0.756
hw-0.748
TokenThe
Token ID464
Feature activation+1.66e-01

Pos logit contributions
advertisement+0.719
bourg+0.673
+0.662
quished+0.628
Lex+0.577
Neg logit contributions
wit-0.557
 Hud-0.556
 Ow-0.554
earable-0.547
hw-0.541
Token fact
Token ID1109
Feature activation+1.48e+00

Pos logit contributions
wit+0.471
 Hud+0.462
 Ow+0.461
hw+0.459
earable+0.458
Neg logit contributions
advertisement-0.557
bourg-0.527
-0.517
quished-0.494
 Secret-0.447
Token that
Token ID326
Feature activation+3.10e-01

Pos logit contributions
 Hud+4.158
wit+4.139
 Ow+4.129
reads+4.093
earable+3.984
Neg logit contributions
advertisement-4.886
bourg-4.746
-4.481
quished-4.426
Lex-3.982
Token a
Token ID257
Feature activation-1.01e-02

Pos logit contributions
wit+0.918
 Hud+0.907
 Ow+0.903
earable+0.896
hw+0.895
Neg logit contributions
advertisement-1.006
bourg-0.957
-0.933
quished-0.893
Lex-0.795
Token device
Token ID3335
Feature activation-1.39e-01

Pos logit contributions
advertisement+0.034
bourg+0.033
+0.032
quished+0.030
xi+0.028
Neg logit contributions
wit-0.028
 Hud-0.028
 Ow-0.028
hw-0.027
reads-0.027
Token like
Token ID588
Feature activation+2.46e-01

Pos logit contributions
advertisement+0.446
bourg+0.422
+0.412
quished+0.390
 ASUS+0.351
Neg logit contributions
wit-0.419
 Hud-0.412
 Ow-0.410
hw-0.408
earable-0.408
Token the
Token ID262
Feature activation-6.84e-06

Pos logit contributions
wit+0.775
 Hud+0.771
 Ow+0.768
earable+0.761
hw+0.758
Neg logit contributions
advertisement-0.749
bourg-0.714
-0.691
quished-0.664
Lex-0.585
Tokenand
Token ID392
Feature activation+1.84e-01

Pos logit contributions
advertisement+0.768
bourg+0.724
+0.701
quished+0.668
Lex+0.613
Neg logit contributions
wit-0.669
 Hud-0.668
 Ow-0.667
earable-0.661
hw-0.653
Token another
Token ID1194
Feature activation+2.55e-01

Pos logit contributions
wit+0.543
 Hud+0.538
 Ow+0.536
earable+0.533
hw+0.531
Neg logit contributions
advertisement-0.596
bourg-0.567
-0.550
quished-0.528
Lex-0.472
Token for
Token ID329
Feature activation+3.53e-01

Pos logit contributions
wit+0.781
 Hud+0.775
 Ow+0.773
earable+0.765
hw+0.763
Neg logit contributions
advertisement-0.794
bourg-0.758
-0.736
quished-0.706
Lex-0.625
Token star
Token ID3491
Feature activation+1.42e-01

Pos logit contributions
wit+1.041
 Hud+1.034
 Ow+1.030
earable+1.019
hw+1.015
Neg logit contributions
advertisement-1.143
bourg-1.094
-1.063
quished-1.022
Lex-0.909
Token Matthew
Token ID9308
Feature activation+1.66e-01

Pos logit contributions
wit+0.410
 Ow+0.407
 Hud+0.406
earable+0.406
hw+0.403
Neg logit contributions
advertisement-0.469
bourg-0.447
-0.436
quished-0.417
Lex-0.375
Token Mc
Token ID1982
Feature activation+1.48e+00

Pos logit contributions
wit+0.321
 Hud+0.317
 Ow+0.316
earable+0.311
hw+0.310
Neg logit contributions
advertisement-0.709
bourg-0.685
-0.670
quished-0.651
Lex-0.598
TokenCon
Token ID3103
Feature activation+2.14e-03

Pos logit contributions
 Hud+4.614
wit+4.600
 Ow+4.484
reads+4.438
hw+4.434
Neg logit contributions
advertisement-4.470
bourg-4.268
-4.121
quished-3.905
 ASUS-3.600
Tokenaug
Token ID7493
Feature activation+6.19e-02

Pos logit contributions
 Hud+0.007
reads+0.007
wit+0.007
 Ow+0.007
earable+0.007
Neg logit contributions
advertisement-0.006
bourg-0.006
-0.006
quished-0.005
antic-0.005
Tokenhey
Token ID20342
Feature activation-3.31e-01

Pos logit contributions
wit+0.114
 Hud+0.113
 Ow+0.111
reads+0.111
earable+0.111
Neg logit contributions
advertisement-0.266
bourg-0.259
-0.252
quished-0.244
xi-0.227
Token's
Token ID338
Feature activation-4.31e-01

Pos logit contributions
advertisement+1.100
bourg+1.050
+1.015
quished+0.976
xi+0.874
Neg logit contributions
wit-0.951
 Hud-0.943
 Ow-0.940
earable-0.934
hw-0.929
Token performance
Token ID2854
Feature activation-3.44e-01

Pos logit contributions
advertisement+1.393
bourg+1.327
+1.291
quished+1.233
 Secret+1.121
Neg logit contributions
wit-1.268
 Ow-1.242
 Hud-1.242
hw-1.238
earable-1.236
Token kids
Token ID3988
Feature activation-2.41e-01

Pos logit contributions
advertisement+1.098
bourg+1.054
+1.017
quished+0.960
 Secret+0.881
Neg logit contributions
wit-1.040
earable-1.034
hw-1.022
 Hud-1.016
 Ow-1.016
Token to
Token ID284
Feature activation-3.29e-01

Pos logit contributions
advertisement+0.696
bourg+0.659
+0.638
quished+0.607
Lex+0.534
Neg logit contributions
wit-0.794
 Hud-0.789
 Ow-0.786
earable-0.785
hw-0.780
Token know
Token ID760
Feature activation-3.12e-01

Pos logit contributions
advertisement+1.200
bourg+1.162
+1.107
quished+1.058
 desp+0.995
Neg logit contributions
wit-0.826
 Hud-0.816
 Ow-0.816
earable-0.816
hw-0.810
Token.
Token ID13
Feature activation-1.48e-01

Pos logit contributions
advertisement+0.957
bourg+0.909
+0.877
quished+0.844
Lex+0.749
Neg logit contributions
wit-0.970
 Hud-0.962
earable-0.958
 Ow-0.958
hw-0.951
Token The
Token ID383
Feature activation+4.86e-02

Pos logit contributions
advertisement+0.441
bourg+0.423
+0.409
quished+0.391
 Secret+0.350
Neg logit contributions
wit-0.469
earable-0.465
hw-0.461
 Hud-0.459
 Ow-0.458
Token fact
Token ID1109
Feature activation+1.47e+00

Pos logit contributions
wit+0.135
earable+0.133
hw+0.132
 Hud+0.132
 Ow+0.132
Neg logit contributions
advertisement-0.165
bourg-0.159
-0.154
quished-0.148
 Secret-0.135
Token that
Token ID326
Feature activation+3.53e-01

Pos logit contributions
 Hud+4.175
wit+4.160
 Ow+4.093
reads+4.082
hw+3.948
Neg logit contributions
advertisement-4.936
bourg-4.653
-4.478
quished-4.391
ipel-3.893
Token it
Token ID340
Feature activation+1.48e-02

Pos logit contributions
wit+1.099
 Hud+1.089
 Ow+1.086
earable+1.079
hw+1.075
Neg logit contributions
advertisement-1.089
bourg-1.037
-1.006
quished-0.964
Lex-0.853
Token treats
Token ID18432
Feature activation-7.16e-01

Pos logit contributions
wit+0.045
earable+0.045
 Hud+0.045
 Ow+0.045
hw+0.044
Neg logit contributions
advertisement-0.046
bourg-0.044
-0.042
quished-0.041
 desp-0.037
Token homosexuality
Token ID19822
Feature activation-1.53e-01

Pos logit contributions
advertisement+2.170
bourg+2.092
+2.013
quished+1.899
 Secret+1.744
Neg logit contributions
wit-2.206
earable-2.200
hw-2.167
 Ow-2.149
reads-2.139
Token as
Token ID355
Feature activation-5.97e-01

Pos logit contributions
advertisement+0.442
bourg+0.420
+0.403
quished+0.386
 desp+0.340
Neg logit contributions
wit-0.505
 Hud-0.500
 Ow-0.499
earable-0.499
hw-0.494
Token from
Token ID422
Feature activation-1.62e-01

Pos logit contributions
advertisement+0.207
bourg+0.196
+0.190
quished+0.181
Lex+0.157
Neg logit contributions
wit-0.259
 Hud-0.258
 Ow-0.257
earable-0.255
hw-0.254
Token those
Token ID883
Feature activation+5.40e-01

Pos logit contributions
advertisement+0.497
bourg+0.470
+0.457
quished+0.432
 desp+0.390
Neg logit contributions
wit-0.507
 Ow-0.501
earable-0.501
 Hud-0.500
hw-0.499
Token behaviors
Token ID14301
Feature activation+2.07e-02

Pos logit contributions
wit+1.499
 Hud+1.489
 Ow+1.482
earable+1.466
hw+1.462
Neg logit contributions
advertisement-1.844
bourg-1.765
-1.718
quished-1.656
Lex-1.486
Token.
Token ID13
Feature activation-2.84e-01

Pos logit contributions
wit+0.064
 Hud+0.063
 Ow+0.063
earable+0.063
hw+0.062
Neg logit contributions
advertisement-0.064
bourg-0.061
-0.060
quished-0.057
Lex-0.050
Token The
Token ID383
Feature activation-4.35e-02

Pos logit contributions
advertisement+0.877
bourg+0.826
+0.813
quished+0.763
 Secret+0.691
Neg logit contributions
wit-0.879
earable-0.869
 Hud-0.860
hw-0.859
 Ow-0.858
Token fact
Token ID1109
Feature activation+1.47e+00

Pos logit contributions
advertisement+0.150
bourg+0.142
+0.139
quished+0.134
 Secret+0.121
Neg logit contributions
wit-0.120
earable-0.118
 Hud-0.118
 Ow-0.117
hw-0.117
Token is
Token ID318
Feature activation+2.22e-01

Pos logit contributions
 Hud+4.205
wit+4.193
reads+4.129
 Ow+4.112
hw+4.001
Neg logit contributions
advertisement-4.838
bourg-4.611
-4.405
quished-4.340
Lex-3.849
Token that
Token ID326
Feature activation-5.27e-02

Pos logit contributions
wit+0.713
 Hud+0.708
 Ow+0.706
earable+0.700
hw+0.698
Neg logit contributions
advertisement-0.660
bourg-0.627
-0.608
quished-0.582
Lex-0.512
Token in
Token ID287
Feature activation-2.29e-01

Pos logit contributions
advertisement+0.167
bourg+0.159
+0.155
quished+0.148
 ASUS+0.132
Neg logit contributions
wit-0.159
earable-0.156
 Hud-0.156
 Ow-0.156
hw-0.155
Token his
Token ID465
Feature activation-1.08e-01

Pos logit contributions
advertisement+0.748
bourg+0.708
+0.697
quished+0.664
 Secret+0.600
Neg logit contributions
wit-0.665
earable-0.659
 Hud-0.656
 Ow-0.652
hw-0.648
Token general
Token ID2276
Feature activation+2.59e-04

Pos logit contributions
advertisement+0.344
bourg+0.327
+0.319
quished+0.304
 Secret+0.273
Neg logit contributions
wit-0.325
 Hud-0.320
 Ow-0.319
earable-0.319
hw-0.317
Token game
Token ID983
Feature activation+1.14e-01

Pos logit contributions
wit+0.195
 Hud+0.194
 Ow+0.193
earable+0.192
hw+0.191
Neg logit contributions
advertisement-0.184
bourg-0.175
-0.170
quished-0.162
Lex-0.143
Token,
Token ID11
Feature activation-2.87e-02

Pos logit contributions
wit+0.370
 Hud+0.367
 Ow+0.366
earable+0.363
hw+0.362
Neg logit contributions
advertisement-0.339
bourg-0.322
-0.312
quished-0.298
Lex-0.262
Token you
Token ID345
Feature activation+5.82e-02

Pos logit contributions
advertisement+0.086
bourg+0.082
+0.079
quished+0.076
 ASUS+0.067
Neg logit contributions
wit-0.091
 Hud-0.090
 Ow-0.090
earable-0.090
hw-0.089
Token learn
Token ID2193
Feature activation+1.24e-01

Pos logit contributions
wit+0.159
 Hud+0.157
 Ow+0.157
reads+0.155
earable+0.155
Neg logit contributions
advertisement-0.201
bourg-0.192
-0.186
quished-0.178
 ASUS-0.162
Token a
Token ID257
Feature activation+2.26e-01

Pos logit contributions
wit+0.365
 Hud+0.361
 Ow+0.359
earable+0.359
hw+0.357
Neg logit contributions
advertisement-0.402
bourg-0.384
-0.372
quished-0.357
 ASUS-0.320
Token hell
Token ID5968
Feature activation+1.47e+00

Pos logit contributions
wit+0.657
 Hud+0.650
 Ow+0.648
earable+0.647
hw+0.646
Neg logit contributions
advertisement-0.742
bourg-0.704
-0.685
quished-0.652
 Secret-0.591
Token of
Token ID286
Feature activation+1.69e-01

Pos logit contributions
wit+4.929
 Ow+4.775
hw+4.697
 Hud+4.696
reads+4.673
Neg logit contributions
advertisement-4.143
bourg-4.023
-3.682
quished-3.546
Lex-3.236
Token a
Token ID257
Feature activation+2.53e-01

Pos logit contributions
wit+0.476
 Hud+0.476
earable+0.473
 Ow+0.471
hw+0.468
Neg logit contributions
advertisement-0.570
bourg-0.541
-0.529
quished-0.509
 Secret-0.455
Token lot
Token ID1256
Feature activation+3.56e-01

Pos logit contributions
wit+0.730
 Hud+0.724
 Ow+0.721
earable+0.719
hw+0.715
Neg logit contributions
advertisement-0.838
bourg-0.798
-0.775
quished-0.743
Lex-0.665
Token doing
Token ID1804
Feature activation+5.13e-01

Pos logit contributions
wit+1.005
 Hud+0.996
 Ow+0.994
hw+0.978
earable+0.978
Neg logit contributions
advertisement-1.199
bourg-1.145
-1.117
quished-1.078
Lex-0.962
Token that
Token ID326
Feature activation+1.79e-01

Pos logit contributions
wit+1.455
 Hud+1.447
 Ow+1.443
earable+1.425
hw+1.421
Neg logit contributions
advertisement-1.713
bourg-1.643
-1.598
quished-1.541
Lex-1.376
Token it
Token ID340
Feature activation-1.98e-01

Pos logit contributions
advertisement+0.482
bourg+0.458
+0.445
quished+0.426
 Secret+0.381
Neg logit contributions
wit-0.482
earable-0.474
 Ow-0.472
 Hud-0.471
hw-0.468
Token.
Token ID13
Feature activation+3.85e-02

Pos logit contributions
advertisement+0.600
bourg+0.570
+0.552
quished+0.530
Lex+0.470
Neg logit contributions
wit-0.624
 Hud-0.618
 Ow-0.617
earable-0.616
hw-0.609
Token
Token ID198
Feature activation+2.26e-01

Pos logit contributions
wit+0.122
earable+0.120
 Hud+0.120
 Ow+0.119
hw+0.119
Neg logit contributions
advertisement-0.116
bourg-0.111
-0.107
quished-0.103
Lex-0.091
Token
Token ID198
Feature activation+2.71e-01

Pos logit contributions
wit+0.644
 Hud+0.637
 Ow+0.636
earable+0.627
hw+0.627
Neg logit contributions
advertisement-0.759
bourg-0.726
-0.708
quished-0.685
Lex-0.606
TokenThe
Token ID464
Feature activation+8.57e-02

Pos logit contributions
wit+0.841
 Hud+0.836
 Ow+0.833
earable+0.827
hw+0.823
Neg logit contributions
advertisement-0.835
bourg-0.794
-0.770
quished-0.738
Lex-0.655
Token fact
Token ID1109
Feature activation+1.46e+00

Pos logit contributions
wit+0.239
earable+0.235
 Hud+0.234
 Ow+0.234
hw+0.233
Neg logit contributions
advertisement-0.291
bourg-0.278
-0.270
quished-0.261
 Secret-0.235
Token that
Token ID326
Feature activation+4.40e-01

Pos logit contributions
 Hud+4.177
wit+4.132
 Ow+4.087
reads+4.084
hw+3.970
Neg logit contributions
advertisement-4.889
bourg-4.619
-4.454
quished-4.323
Lex-3.866
Token Que
Token ID4670
Feature activation-1.20e-01

Pos logit contributions
wit+1.352
 Hud+1.340
 Ow+1.336
earable+1.326
hw+1.321
Neg logit contributions
advertisement-1.374
bourg-1.309
-1.270
quished-1.219
Lex-1.080
Tokener
Token ID263
Feature activation-3.22e-01

Pos logit contributions
advertisement+0.346
bourg+0.330
+0.318
quished+0.306
Lex+0.266
Neg logit contributions
wit-0.397
 Hud-0.394
 Ow-0.393
earable-0.391
hw-0.388
Token men
Token ID1450
Feature activation-1.04e-02

Pos logit contributions
advertisement+1.134
bourg+1.093
+1.063
quished+1.027
 Secret+0.928
Neg logit contributions
wit-0.847
earable-0.838
 Hud-0.831
 Ow-0.831
hw-0.826
Token would
Token ID561
Feature activation+5.02e-02

Pos logit contributions
advertisement+0.036
bourg+0.034
+0.033
quished+0.032
 ASUS+0.029
Neg logit contributions
wit-0.029
 Ow-0.029
 Hud-0.029
earable-0.028
hw-0.028
Token in
Token ID287
Feature activation-4.26e-01

Pos logit contributions
advertisement+0.886
bourg+0.840
+0.816
quished+0.775
Lex+0.686
Neg logit contributions
wit-0.933
 Hud-0.926
 Ow-0.922
earable-0.921
hw-0.914
Token the
Token ID262
Feature activation+8.50e-02

Pos logit contributions
advertisement+1.445
bourg+1.372
+1.335
quished+1.273
 Secret+1.166
Neg logit contributions
wit-1.190
 Hud-1.174
earable-1.172
hw-1.169
reads-1.162
Token first
Token ID717
Feature activation+6.90e-01

Pos logit contributions
wit+0.272
 Hud+0.269
 Ow+0.268
earable+0.267
hw+0.266
Neg logit contributions
advertisement-0.255
bourg-0.242
-0.235
quished-0.223
 ASUS-0.198
Token quarter
Token ID3860
Feature activation-6.85e-01

Pos logit contributions
wit+2.031
 Ow+2.028
 Hud+2.020
earable+1.979
reads+1.977
Neg logit contributions
advertisement-2.226
bourg-2.134
-2.077
quished-2.021
Lex-1.790
Token to
Token ID284
Feature activation-1.65e-01

Pos logit contributions
advertisement+1.829
bourg+1.712
+1.659
quished+1.565
 ASUS+1.375
Neg logit contributions
wit-2.396
 Hud-2.388
earable-2.382
 Ow-2.358
hw-2.358
Token lobby
Token ID10866
Feature activation+1.45e+00

Pos logit contributions
advertisement+0.615
bourg+0.594
+0.577
quished+0.551
 ASUS+0.511
Neg logit contributions
wit-0.402
 Hud-0.400
reads-0.396
earable-0.396
 Ow-0.395
Token on
Token ID319
Feature activation+7.43e-01

Pos logit contributions
 Ow+4.272
 Hud+4.259
wit+4.250
hw+4.141
earable+4.125
Neg logit contributions
advertisement-4.584
-4.349
bourg-4.342
quished-4.170
ipel-3.671
Token domestic
Token ID5928
Feature activation+1.88e-01

Pos logit contributions
wit+1.755
 Hud+1.741
 Ow+1.737
earable+1.705
hw+1.702
Neg logit contributions
advertisement-2.839
bourg-2.730
-2.672
quished-2.594
Lex-2.344
Token spying
Token ID23649
Feature activation-2.51e-01

Pos logit contributions
wit+0.545
 Hud+0.540
 Ow+0.538
earable+0.534
hw+0.532
Neg logit contributions
advertisement-0.617
bourg-0.589
-0.573
quished-0.550
Lex-0.492
Token legislation
Token ID5520
Feature activation-3.81e-02

Pos logit contributions
advertisement+0.865
bourg+0.826
+0.801
quished+0.764
 ASUS+0.700
Neg logit contributions
wit-0.689
 Hud-0.680
earable-0.678
 Ow-0.677
hw-0.671
Token and
Token ID290
Feature activation+2.92e-02

Pos logit contributions
advertisement+0.115
bourg+0.109
+0.106
quished+0.101
Lex+0.090
Neg logit contributions
wit-0.121
 Hud-0.120
 Ow-0.119
earable-0.119
hw-0.118
Token all
Token ID477
Feature activation-4.99e-02

Pos logit contributions
advertisement+1.231
bourg+1.165
+1.141
quished+1.088
 ASUS+0.987
Neg logit contributions
wit-1.113
earable-1.092
 Hud-1.089
hw-1.088
 Ow-1.083
Token.
Token ID13
Feature activation+2.94e-01

Pos logit contributions
advertisement+0.150
bourg+0.143
+0.139
quished+0.133
Lex+0.117
Neg logit contributions
wit-0.158
 Hud-0.156
 Ow-0.156
earable-0.156
reads-0.155
Token
Token ID198
Feature activation+2.09e-01

Pos logit contributions
wit+0.868
 Hud+0.859
 Ow+0.855
earable+0.852
hw+0.848
Neg logit contributions
advertisement-0.950
bourg-0.905
-0.881
quished-0.846
Lex-0.755
Token
Token ID198
Feature activation+1.70e-01

Pos logit contributions
wit+0.597
 Hud+0.592
 Ow+0.590
earable+0.583
hw+0.582
Neg logit contributions
advertisement-0.694
bourg-0.664
-0.647
quished-0.624
Lex-0.555
Token(
Token ID7
Feature activation-4.40e-01

Pos logit contributions
wit+0.549
 Hud+0.548
 Ow+0.546
earable+0.541
hw+0.537
Neg logit contributions
advertisement-0.512
bourg-0.477
-0.461
quished-0.442
Lex-0.397
Token UPDATE
Token ID35717
Feature activation-1.80e+00

Pos logit contributions
advertisement+1.379
bourg+1.268
+1.247
quished+1.193
Lex+1.092
Neg logit contributions
wit-1.352
earable-1.337
 Hud-1.331
 Ow-1.329
hw-1.309
Token:
Token ID25
Feature activation-4.03e-02

Pos logit contributions
advertisement+4.160
bourg+3.795
+3.753
quished+3.491
 Secret+3.147
Neg logit contributions
wit-6.693
earable-6.633
izable-6.500
hw-6.499
rower-6.470
Token Sherman
Token ID23198
Feature activation+2.75e-01

Pos logit contributions
advertisement+0.127
bourg+0.121
+0.118
quished+0.113
 ASUS+0.102
Neg logit contributions
wit-0.122
earable-0.120
 Hud-0.120
hw-0.119
 Ow-0.119
Token suffered
Token ID6989
Feature activation-4.79e-01

Pos logit contributions
wit+0.820
 Hud+0.814
 Ow+0.811
earable+0.803
hw+0.801
Neg logit contributions
advertisement-0.886
bourg-0.845
-0.821
quished-0.789
Lex-0.702
Token a
Token ID257
Feature activation-3.00e-01

Pos logit contributions
advertisement+1.506
bourg+1.447
+1.386
quished+1.322
 ASUS+1.222
Neg logit contributions
wit-1.438
earable-1.413
hw-1.409
reads-1.405
 Hud-1.391
Token rupt
Token ID31580
Feature activation-4.34e-01

Pos logit contributions
advertisement+0.988
bourg+0.937
+0.923
quished+0.873
 Secret+0.804
Neg logit contributions
wit-0.861
earable-0.842
hw-0.841
reads-0.839
 Hud-0.837
Token
Token ID198
Feature activation-9.59e-02

Pos logit contributions
wit+0.026
 Hud+0.026
earable+0.026
 Ow+0.026
hw+0.025
Neg logit contributions
advertisement-0.027
bourg-0.026
-0.025
quished-0.024
Lex-0.021
Token
Token ID198
Feature activation-1.93e-02

Pos logit contributions
advertisement+0.313
bourg+0.297
+0.288
quished+0.276
Lex+0.248
Neg logit contributions
wit-0.281
 Hud-0.279
 Ow-0.278
earable-0.277
hw-0.275
TokenFor
Token ID1890
Feature activation+3.52e-01

Pos logit contributions
advertisement+0.059
bourg+0.055
+0.053
quished+0.051
Lex+0.046
Neg logit contributions
wit-0.061
 Ow-0.061
 Hud-0.061
earable-0.061
hw-0.060
Token the
Token ID262
Feature activation-4.71e-01

Pos logit contributions
wit+1.027
 Hud+1.017
 Ow+1.014
earable+1.007
hw+1.002
Neg logit contributions
advertisement-1.153
bourg-1.101
-1.069
quished-1.028
Lex-0.917
Token second
Token ID1218
Feature activation-2.93e-01

Pos logit contributions
advertisement+1.378
bourg+1.284
+1.247
quished+1.198
 Secret+1.076
Neg logit contributions
wit-1.533
earable-1.512
hw-1.507
 Hud-1.502
reads-1.502
Token straight
Token ID3892
Feature activation-1.79e+00

Pos logit contributions
advertisement+0.893
bourg+0.845
+0.820
quished+0.782
 ASUS+0.696
Neg logit contributions
wit-0.926
 Hud-0.916
 Ow-0.911
earable-0.908
hw-0.906
Token year
Token ID614
Feature activation-7.97e-01

Pos logit contributions
advertisement+4.743
bourg+4.253
+4.198
quished+3.831
 Secret+3.778
Neg logit contributions
wit-6.035
hw-5.913
earable-5.906
eele-5.885
ullah-5.876
Token,
Token ID11
Feature activation-3.28e-01

Pos logit contributions
advertisement+2.161
bourg+2.042
+1.965
quished+1.889
 ASUS+1.670
Neg logit contributions
wit-2.722
earable-2.694
hw-2.683
 Hud-2.672
 Ow-2.670
Token Breaking
Token ID24942
Feature activation-5.23e-01

Pos logit contributions
advertisement+1.045
bourg+0.991
+0.948
quished+0.922
 Secret+0.835
Neg logit contributions
wit-0.985
earable-0.968
hw-0.962
 Ow-0.958
reads-0.955
Token Bad
Token ID7772
Feature activation+2.64e-01

Pos logit contributions
advertisement+2.118
bourg+2.074
+1.976
quished+1.897
 Secret+1.790
Neg logit contributions
earable-1.101
wit-1.090
hw-1.090
izable-1.072
reads-1.066
Token
Token ID960
Feature activation+4.33e-01

Pos logit contributions
wit+0.748
 Hud+0.741
 Ow+0.738
earable+0.732
hw+0.729
Neg logit contributions
advertisement-0.885
bourg-0.846
-0.822
quished-0.792
Lex-0.709
Token.
Token ID13
Feature activation-2.31e-01

Pos logit contributions
advertisement+0.722
bourg+0.686
+0.661
quished+0.638
Lex+0.560
Neg logit contributions
wit-0.772
 Hud-0.769
 Ow-0.766
earable-0.759
hw-0.757
Token PT
Token ID19310
Feature activation-6.26e-01

Pos logit contributions
advertisement+0.588
bourg+0.557
+0.526
quished+0.510
Lex+0.440
Neg logit contributions
wit-0.841
 Hud-0.838
 Ow-0.832
earable-0.832
hw-0.829
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
advertisement+1.880
bourg+1.790
+1.698
quished+1.639
 ASUS+1.468
Neg logit contributions
 Hud-1.985
wit-1.980
 Ow-1.956
hw-1.956
earable-1.956
Token
Token ID198
Feature activation+1.22e-01

Pos logit contributions
advertisement+1.031
bourg+0.972
+0.931
quished+0.894
 ASUS+0.803
Neg logit contributions
wit-1.054
 Hud-1.050
hw-1.045
 Ow-1.044
earable-1.041
Token
Token ID198
Feature activation+3.37e-02

Pos logit contributions
wit+0.349
 Hud+0.345
 Ow+0.344
earable+0.341
hw+0.340
Neg logit contributions
advertisement-0.404
bourg-0.387
-0.377
quished-0.363
Lex-0.323
TokenUpdate
Token ID10260
Feature activation-1.66e+00

Pos logit contributions
wit+0.104
 Hud+0.104
 Ow+0.104
earable+0.103
hw+0.102
Neg logit contributions
advertisement-0.105
bourg-0.099
-0.096
quished-0.092
Lex-0.082
Token,
Token ID11
Feature activation-3.33e-01

Pos logit contributions
advertisement+4.103
bourg+3.832
+3.706
quished+3.587
 Secret+3.202
Neg logit contributions
wit-5.916
earable-5.834
 Hud-5.775
 Ow-5.764
izable-5.731
Token 11
Token ID1367
Feature activation-1.31e-01

Pos logit contributions
advertisement+0.983
bourg+0.933
+0.905
quished+0.860
 Secret+0.767
Neg logit contributions
wit-1.081
earable-1.065
 Hud-1.061
 Ow-1.054
hw-1.052
Token:
Token ID25
Feature activation-2.63e-01

Pos logit contributions
advertisement+0.389
bourg+0.368
+0.357
quished+0.342
Lex+0.301
Neg logit contributions
wit-0.425
 Hud-0.421
 Ow-0.419
earable-0.417
hw-0.415
Token15
Token ID1314
Feature activation-1.27e-01

Pos logit contributions
advertisement+0.885
bourg+0.835
+0.814
quished+0.759
Lex+0.708
Neg logit contributions
wit-0.748
earable-0.741
 Hud-0.732
 Ow-0.729
hw-0.729
Token a
Token ID257
Feature activation-4.71e-01

Pos logit contributions
advertisement+0.445
bourg+0.427
+0.416
quished+0.401
Lex+0.362
Neg logit contributions
wit-0.338
 Hud-0.334
 Ow-0.332
earable-0.331
hw-0.329
Token the
Token ID262
Feature activation+3.28e-02

Pos logit contributions
wit+0.909
 Hud+0.903
 Ow+0.900
earable+0.890
hw+0.888
Neg logit contributions
advertisement-1.028
bourg-0.982
-0.955
quished-0.918
Lex-0.818
Token crazy
Token ID7165
Feature activation+3.20e-01

Pos logit contributions
wit+0.099
 Hud+0.098
 Ow+0.098
earable+0.097
hw+0.097
Neg logit contributions
advertisement-0.104
bourg-0.099
-0.097
quished-0.093
Lex-0.082
Token Finnish
Token ID26838
Feature activation+1.76e-01

Pos logit contributions
wit+0.796
 Hud+0.791
 Ow+0.789
hw+0.775
earable+0.767
Neg logit contributions
advertisement-1.185
bourg-1.145
-1.111
quished-1.073
Lex-0.967
Token crowd
Token ID4315
Feature activation+2.09e-02

Pos logit contributions
wit+0.425
 Hud+0.422
 Ow+0.421
hw+0.414
earable+0.414
Neg logit contributions
advertisement-0.666
bourg-0.642
-0.626
quished-0.605
Lex-0.548
Token at
Token ID379
Feature activation-4.67e-02

Pos logit contributions
wit+0.065
 Hud+0.065
 Ow+0.065
earable+0.064
hw+0.064
Neg logit contributions
advertisement-0.064
bourg-0.061
-0.059
quished-0.057
Lex-0.050
Token ASUS
Token ID38100
Feature activation-1.65e+00

Pos logit contributions
advertisement+0.136
bourg+0.130
+0.125
quished+0.120
Lex+0.105
Neg logit contributions
wit-0.153
 Hud-0.152
 Ow-0.152
hw-0.150
earable-0.150
Token R
Token ID371
Feature activation+3.39e-02

Pos logit contributions
advertisement+4.676
bourg+4.591
+4.280
quished+4.037
 ASUS+3.819
Neg logit contributions
reads-5.370
wit-5.321
enance-5.265
earable-5.259
izable-5.201
TokenOG
Token ID7730
Feature activation-4.57e-01

Pos logit contributions
wit+0.112
 Hud+0.112
 Ow+0.111
earable+0.110
hw+0.110
Neg logit contributions
advertisement-0.098
bourg-0.093
-0.090
quished-0.086
Lex-0.075
Token just
Token ID655
Feature activation-6.51e-03

Pos logit contributions
advertisement+1.382
bourg+1.318
+1.276
quished+1.222
Lex+1.102
Neg logit contributions
wit-1.431
 Hud-1.415
 Ow-1.413
reads-1.413
earable-1.407
Token over
Token ID625
Feature activation+7.57e-03

Pos logit contributions
advertisement+0.021
bourg+0.020
+0.019
quished+0.018
Lex+0.016
Neg logit contributions
wit-0.020
 Hud-0.019
 Ow-0.019
earable-0.019
hw-0.019
Token five
Token ID1936
Feature activation+1.77e-02

Pos logit contributions
wit+0.021
 Hud+0.021
 Ow+0.021
earable+0.021
hw+0.020
Neg logit contributions
advertisement-0.026
bourg-0.025
-0.024
quished-0.023
 ASUS-0.021
Token $
Token ID720
Feature activation+9.60e-02

Pos logit contributions
advertisement+0.487
bourg+0.465
+0.458
quished+0.439
 ASUS+0.404
Neg logit contributions
wit-0.304
earable-0.300
 Hud-0.299
 Ow-0.296
hw-0.294
Tokens
Token ID82
Feature activation-5.57e-02

Pos logit contributions
wit+0.272
 Hud+0.271
 Ow+0.270
earable+0.268
hw+0.264
Neg logit contributions
advertisement-0.323
bourg-0.308
-0.300
quished-0.287
Lex-0.258
TokenFormat
Token ID26227
Feature activation+2.70e-01

Pos logit contributions
advertisement+0.203
bourg+0.195
+0.190
quished+0.183
xi+0.167
Neg logit contributions
wit-0.142
 Ow-0.141
 Hud-0.141
earable-0.139
hw-0.137
Token)
Token ID8
Feature activation-3.92e-01

Pos logit contributions
wit+0.749
 Hud+0.743
 Ow+0.740
earable+0.733
hw+0.730
Neg logit contributions
advertisement-0.920
bourg-0.880
-0.856
quished-0.824
Lex-0.740
Token }
Token ID1782
Feature activation-6.69e-01

Pos logit contributions
advertisement+1.078
bourg+1.013
+0.983
quished+0.937
 Secret+0.833
Neg logit contributions
wit-1.335
earable-1.330
 Hud-1.313
hw-1.304
reads-1.299
Token else
Token ID2073
Feature activation-1.63e+00

Pos logit contributions
advertisement+1.870
bourg+1.764
+1.702
quished+1.621
 ASUS+1.465
Neg logit contributions
earable-2.246
wit-2.243
 Hud-2.185
hw-2.175
reads-2.172
Token {
Token ID1391
Feature activation+2.19e-01

Pos logit contributions
advertisement+4.174
bourg+3.800
+3.793
quished+3.740
 Secret+3.250
Neg logit contributions
earable-5.631
wit-5.512
 Hud-5.486
reads-5.473
hw-5.469
Token $
Token ID720
Feature activation-9.10e-02

Pos logit contributions
wit+0.656
 Hud+0.650
 Ow+0.648
earable+0.643
hw+0.641
Neg logit contributions
advertisement-0.700
bourg-0.667
-0.648
quished-0.622
Lex-0.554
Tokenfont
Token ID10331
Feature activation-6.94e-02

Pos logit contributions
advertisement+0.263
bourg+0.249
+0.241
quished+0.230
Lex+0.202
Neg logit contributions
wit-0.300
 Hud-0.299
 Ow-0.298
earable-0.295
reads-0.292
Token2
Token ID17
Feature activation-5.94e-01

Pos logit contributions
advertisement+0.218
bourg+0.207
+0.202
quished+0.193
Lex+0.172
Neg logit contributions
wit-0.212
 Hud-0.210
 Ow-0.210
earable-0.208
hw-0.207
Token =
Token ID796
Feature activation+4.18e-02

Pos logit contributions
advertisement+1.305
bourg+1.189
+1.147
quished+1.050
xi+0.904
Neg logit contributions
 Hud-2.370
wit-2.365
earable-2.360
 Ow-2.350
hw-2.328
Token:
Token ID25
Feature activation-1.70e-04

Pos logit contributions
advertisement+0.968
bourg+0.919
+0.884
quished+0.855
xi+0.749
Neg logit contributions
wit-1.028
 Hud-1.018
earable-1.018
 Ow-1.016
hw-1.007
Token companies
Token ID2706
Feature activation-1.02e-01

Pos logit contributions
advertisement+0.001
bourg+0.000
+0.000
quished+0.000
Lex+0.000
Neg logit contributions
wit-0.001
 Ow-0.001
earable-0.001
 Hud-0.001
hw-0.001
Token such
Token ID884
Feature activation-1.58e-02

Pos logit contributions
advertisement+0.330
bourg+0.316
+0.304
quished+0.292
 desp+0.263
Neg logit contributions
wit-0.301
earable-0.297
reads-0.296
 Ow-0.296
 Hud-0.296
Token as
Token ID355
Feature activation+7.35e-02

Pos logit contributions
advertisement+0.045
bourg+0.043
+0.041
quished+0.040
Lex+0.035
Neg logit contributions
wit-0.053
 Hud-0.052
 Ow-0.052
earable-0.052
hw-0.051
Token Con
Token ID1482
Feature activation+3.84e-01

Pos logit contributions
wit+0.259
 Hud+0.258
 Ow+0.257
earable+0.255
hw+0.254
Neg logit contributions
advertisement-0.196
bourg-0.185
-0.179
quished-0.170
Lex-0.147
Tokenoco
Token ID25634
Feature activation-1.63e+00

Pos logit contributions
wit+1.213
 Hud+1.202
 Ow+1.191
hw+1.178
reads+1.174
Neg logit contributions
advertisement-1.171
bourg-1.115
-1.087
quished-1.033
 ASUS-0.912
TokenPhill
Token ID41970
Feature activation-1.15e+00

Pos logit contributions
advertisement+4.145
bourg+3.900
+3.815
quished+3.544
 desp+3.249
Neg logit contributions
reads-5.728
earable-5.711
hw-5.631
enance-5.564
eele-5.506
Tokenips
Token ID2419
Feature activation-5.74e-01

Pos logit contributions
advertisement+3.487
bourg+3.342
+3.168
quished+3.029
xi+2.766
Neg logit contributions
 Ow-3.524
reads-3.489
 Hud-3.489
hw-3.468
earable-3.460
Token and
Token ID290
Feature activation-2.38e-01

Pos logit contributions
advertisement+1.717
bourg+1.664
+1.579
quished+1.508
 Secret+1.349
Neg logit contributions
reads-1.809
wit-1.804
earable-1.792
hw-1.789
 Hud-1.773
Token Marathon
Token ID24828
Feature activation-4.28e-02

Pos logit contributions
advertisement+0.660
bourg+0.628
+0.605
quished+0.581
 ASUS+0.510
Neg logit contributions
wit-0.809
earable-0.801
hw-0.797
reads-0.794
 Ow-0.794
Token have
Token ID423
Feature activation+8.78e-02

Pos logit contributions
advertisement+0.121
bourg+0.114
+0.111
quished+0.106
Lex+0.092
Neg logit contributions
wit-0.144
earable-0.143
 Hud-0.142
reads-0.141
 Ow-0.141
Token web
Token ID3992
Feature activation+2.64e-01

Pos logit contributions
advertisement+0.603
bourg+0.573
+0.552
quished+0.530
Lex+0.471
Neg logit contributions
wit-0.599
 Hud-0.592
 Ow-0.590
earable-0.588
hw-0.586
Token site
Token ID2524
Feature activation-2.21e-01

Pos logit contributions
wit+0.694
 Hud+0.687
 Ow+0.684
earable+0.676
hw+0.675
Neg logit contributions
advertisement-0.940
bourg-0.901
-0.880
quished-0.849
Lex-0.762
Token.
Token ID13
Feature activation-1.45e-01

Pos logit contributions
advertisement+0.664
bourg+0.628
+0.606
quished+0.580
Lex+0.514
Neg logit contributions
wit-0.705
 Hud-0.700
 Ow-0.696
earable-0.694
hw-0.691
Token
Token ID198
Feature activation-6.25e-02

Pos logit contributions
advertisement+0.439
bourg+0.417
+0.403
quished+0.387
 ASUS+0.343
Neg logit contributions
wit-0.456
 Hud-0.450
earable-0.450
 Ow-0.449
hw-0.448
Token
Token ID198
Feature activation-1.14e-01

Pos logit contributions
advertisement+0.203
bourg+0.193
+0.187
quished+0.179
Lex+0.161
Neg logit contributions
wit-0.184
 Hud-0.183
 Ow-0.182
earable-0.181
hw-0.180
TokenUpdate
Token ID10260
Feature activation-1.61e+00

Pos logit contributions
advertisement+0.339
bourg+0.319
+0.307
quished+0.295
Lex+0.262
Neg logit contributions
wit-0.371
 Hud-0.371
 Ow-0.370
earable-0.366
hw-0.364
Token:
Token ID25
Feature activation-2.97e-01

Pos logit contributions
advertisement+4.007
bourg+3.765
+3.608
quished+3.415
 Secret+3.080
Neg logit contributions
wit-5.715
earable-5.680
 Ow-5.576
izable-5.563
 Hud-5.561
Token Bungie
Token ID47593
Feature activation-3.89e-01

Pos logit contributions
advertisement+0.650
bourg+0.606
+0.574
quished+0.540
 ASUS+0.455
Neg logit contributions
wit-1.190
earable-1.178
 Hud-1.177
 Ow-1.171
hw-1.170
Token responds
Token ID20067
Feature activation-5.72e-01

Pos logit contributions
advertisement+1.210
bourg+1.163
+1.105
quished+1.068
 Secret+0.977
Neg logit contributions
wit-1.187
earable-1.167
hw-1.164
 Hud-1.163
reads-1.156
Token to
Token ID284
Feature activation-1.32e-01

Pos logit contributions
advertisement+1.500
bourg+1.402
+1.355
quished+1.292
Lex+1.117
Neg logit contributions
wit-2.038
 Hud-2.022
earable-2.022
 Ow-2.013
hw-1.995
Token the
Token ID262
Feature activation-9.35e-02

Pos logit contributions
advertisement+0.420
bourg+0.396
+0.384
quished+0.368
 Secret+0.332
Neg logit contributions
wit-0.397
earable-0.391
 Ow-0.390
 Hud-0.389
hw-0.389
Token intro
Token ID18951
Feature activation+4.90e-01

Pos logit contributions
wit+0.109
 Hud+0.108
 Ow+0.107
hw+0.106
reads+0.106
Neg logit contributions
advertisement-0.131
bourg-0.124
-0.120
quished-0.117
 ASUS-0.105
Tokenvert
Token ID1851
Feature activation+6.24e-01

Pos logit contributions
wit+1.273
 Hud+1.259
 Ow+1.255
earable+1.241
hw+1.237
Neg logit contributions
advertisement-1.763
bourg-1.692
-1.646
quished-1.590
Lex-1.436
Token online
Token ID2691
Feature activation-1.08e-01

Pos logit contributions
wit+2.070
 Ow+2.044
 Hud+2.041
earable+2.022
hw+2.009
Neg logit contributions
advertisement-1.779
bourg-1.715
-1.647
quished-1.588
Lex-1.377
Token?
Token ID30
Feature activation-8.88e-02

Pos logit contributions
advertisement+0.310
bourg+0.293
+0.284
quished+0.271
Lex+0.238
Neg logit contributions
wit-0.358
 Hud-0.356
 Ow-0.354
earable-0.352
hw-0.351
Token<|endoftext|>
Token ID50256
Feature activation+1.08e-01

Pos logit contributions
advertisement+0.297
bourg+0.283
+0.274
quished+0.263
 ASUS+0.238
Neg logit contributions
wit-0.253
 Hud-0.250
 Ow-0.249
earable-0.248
hw-0.247
TokenUpdate
Token ID10260
Feature activation-1.55e+00

Pos logit contributions
wit+0.336
 Hud+0.335
 Ow+0.334
earable+0.330
hw+0.329
Neg logit contributions
advertisement-0.334
bourg-0.316
-0.307
quished-0.294
Lex-0.261
Token:
Token ID25
Feature activation+2.74e-02

Pos logit contributions
advertisement+4.083
bourg+3.844
+3.691
quished+3.491
 Secret+3.196
Neg logit contributions
wit-5.272
earable-5.256
 Hud-5.147
izable-5.131
 Ow-5.124
Token You
Token ID921
Feature activation-3.85e-01

Pos logit contributions
wit+0.085
 Hud+0.084
earable+0.084
 Ow+0.084
hw+0.083
Neg logit contributions
advertisement-0.085
bourg-0.081
-0.078
quished-0.075
 ASUS-0.067
Token can
Token ID460
Feature activation+8.14e-02

Pos logit contributions
advertisement+1.400
bourg+1.361
+1.319
quished+1.268
 ASUS+1.178
Neg logit contributions
wit-0.948
earable-0.938
 Hud-0.933
reads-0.929
hw-0.923
Token find
Token ID1064
Feature activation+4.37e-01

Pos logit contributions
wit+0.221
 Hud+0.218
 Ow+0.217
earable+0.216
hw+0.215
Neg logit contributions
advertisement-0.283
bourg-0.272
-0.263
quished-0.252
 ASUS-0.231
Token a
Token ID257
Feature activation-4.25e-01

Pos logit contributions
wit+1.261
 Hud+1.258
 Ow+1.254
earable+1.234
hw+1.232
Neg logit contributions
advertisement-1.440
bourg-1.379
-1.341
quished-1.290
Lex-1.148
Token:
Token ID25
Feature activation+3.20e-01

Pos logit contributions
wit+0.829
 Hud+0.824
 Ow+0.820
hw+0.810
reads+0.809
Neg logit contributions
advertisement-0.712
bourg-0.676
-0.656
quished-0.627
Lex-0.545
Token Jackson
Token ID6612
Feature activation-5.06e-01

Pos logit contributions
wit+1.015
 Hud+1.009
 Ow+1.006
earable+0.994
hw+0.992
Neg logit contributions
advertisement-0.964
bourg-0.918
-0.890
quished-0.853
Lex-0.750
Token gay
Token ID5650
Feature activation-4.70e-01

Pos logit contributions
advertisement+1.628
bourg+1.567
+1.512
quished+1.459
 Secret+1.306
Neg logit contributions
wit-1.480
earable-1.463
 Hud-1.455
hw-1.451
 Ow-1.435
Token rights
Token ID2489
Feature activation-2.43e-01

Pos logit contributions
advertisement+1.478
bourg+1.414
+1.372
quished+1.318
Lex+1.173
Neg logit contributions
wit-1.420
 Hud-1.410
 Ow-1.403
earable-1.399
hw-1.393
Token leader
Token ID3554
Feature activation-2.58e-01

Pos logit contributions
advertisement+0.856
bourg+0.818
+0.788
quished+0.767
Lex+0.693
Neg logit contributions
wit-0.649
 Hud-0.640
earable-0.636
 Ow-0.636
hw-0.632
Token accused
Token ID5371
Feature activation-1.51e+00

Pos logit contributions
advertisement+0.873
bourg+0.834
+0.806
quished+0.778
 Secret+0.705
Neg logit contributions
wit-0.722
earable-0.711
hw-0.707
 Hud-0.706
 Ow-0.701
Token of
Token ID286
Feature activation-4.15e-01

Pos logit contributions
advertisement+3.780
bourg+3.467
+3.239
quished+3.183
 Secret+2.810
Neg logit contributions
wit-5.406
earable-5.402
hw-5.374
reads-5.357
 Hud-5.317
Token burning
Token ID9482
Feature activation-2.93e-01

Pos logit contributions
advertisement+1.288
bourg+1.220
+1.178
quished+1.125
 Secret+1.018
Neg logit contributions
wit-1.282
earable-1.267
 Hud-1.260
hw-1.256
reads-1.253
Token down
Token ID866
Feature activation-7.16e-01

Pos logit contributions
advertisement+0.903
bourg+0.860
+0.830
quished+0.798
 Secret+0.710
Neg logit contributions
wit-0.907
 Hud-0.892
earable-0.891
hw-0.888
 Ow-0.887
Token own
Token ID898
Feature activation-2.38e-01

Pos logit contributions
advertisement+2.186
bourg+2.086
+1.986
quished+1.927
 Secret+1.771
Neg logit contributions
wit-2.202
earable-2.179
hw-2.160
reads-2.150
 Hud-2.143
Token home
Token ID1363
Feature activation-1.51e-01

Pos logit contributions
advertisement+0.727
bourg+0.693
+0.668
quished+0.643
Lex+0.572
Neg logit contributions
wit-0.747
 Hud-0.738
 Ow-0.735
earable-0.734
hw-0.730
Tokenpath
Token ID6978
Feature activation+3.94e-01

Pos logit contributions
wit+0.416
 Hud+0.413
 Ow+0.413
earable+0.407
hw+0.405
Neg logit contributions
advertisement-0.548
bourg-0.524
-0.509
quished-0.490
Lex-0.442
Token)"
Token ID16725
Feature activation-2.16e-01

Pos logit contributions
wit+0.968
 Hud+0.961
 Ow+0.956
earable+0.943
hw+0.941
Neg logit contributions
advertisement-1.470
bourg-1.414
-1.379
quished-1.336
Lex-1.207
Token break
Token ID2270
Feature activation-8.88e-01

Pos logit contributions
advertisement+0.617
bourg+0.581
+0.564
quished+0.536
 ASUS+0.479
Neg logit contributions
wit-0.719
earable-0.709
 Hud-0.707
hw-0.702
 Ow-0.702
Token }
Token ID1782
Feature activation-4.75e-01

Pos logit contributions
advertisement+2.343
bourg+2.173
+2.093
quished+2.010
 ASUS+1.837
Neg logit contributions
wit-3.099
earable-3.099
hw-3.023
reads-3.011
 Hud-3.005
Token }
Token ID1782
Feature activation-4.13e-01

Pos logit contributions
advertisement+1.387
bourg+1.297
+1.256
quished+1.200
 ASUS+1.107
Neg logit contributions
earable-1.545
wit-1.542
reads-1.498
hw-1.491
 Hud-1.489
Token else
Token ID2073
Feature activation-1.50e+00

Pos logit contributions
advertisement+1.222
bourg+1.146
+1.109
quished+1.064
 ASUS+0.969
Neg logit contributions
wit-1.331
earable-1.329
reads-1.292
 Hud-1.288
hw-1.287
Token {
Token ID1391
Feature activation+3.48e-01

Pos logit contributions
advertisement+4.018
bourg+3.707
+3.671
quished+3.578
 Secret+3.135
Neg logit contributions
earable-5.026
wit-4.933
enance-4.901
reads-4.890
 Hud-4.888
Token $
Token ID720
Feature activation+7.04e-02

Pos logit contributions
wit+1.085
 Hud+1.080
 Ow+1.078
hw+1.063
earable+1.062
Neg logit contributions
advertisement-1.069
bourg-1.018
-0.988
quished-0.947
Lex-0.836
TokenSR
Token ID12562
Feature activation-3.52e-01

Pos logit contributions
wit+0.190
 Hud+0.190
 Ow+0.190
earable+0.187
reads+0.185
Neg logit contributions
advertisement-0.245
bourg-0.233
-0.228
quished-0.219
xi-0.198
Token =
Token ID796
Feature activation+1.01e-01

Pos logit contributions
advertisement+1.000
bourg+0.936
+0.915
quished+0.871
 ASUS+0.762
Neg logit contributions
wit-1.176
 Hud-1.170
 Ow-1.162
earable-1.161
hw-1.147
Token [
Token ID685
Feature activation-5.80e-02

Pos logit contributions
wit+0.276
earable+0.272
 Hud+0.272
 Ow+0.270
hw+0.269
Neg logit contributions
advertisement-0.346
bourg-0.330
-0.325
quished-0.310
 ASUS-0.281
Token free
Token ID1479
Feature activation+4.91e-01

Pos logit contributions
wit+0.926
 Ow+0.915
 Hud+0.909
hw+0.891
earable+0.883
Neg logit contributions
advertisement-2.626
bourg-2.560
-2.531
quished-2.466
xi-2.258
Token newsletters
Token ID16983
Feature activation+1.43e-01

Pos logit contributions
 Ow+1.070
wit+1.066
 Hud+1.061
earable+1.040
reads+1.034
Neg logit contributions
advertisement-1.962
bourg-1.911
-1.867
quished-1.819
xi-1.639
Token.
Token ID13
Feature activation+3.71e-01

Pos logit contributions
wit+0.446
 Hud+0.444
 Ow+0.443
earable+0.439
hw+0.437
Neg logit contributions
advertisement-0.443
bourg-0.420
-0.407
quished-0.389
Lex-0.346
Token
Token ID198
Feature activation+3.60e-01

Pos logit contributions
wit+0.982
 Hud+0.974
 Ow+0.970
earable+0.960
hw+0.957
Neg logit contributions
advertisement-1.316
bourg-1.260
-1.228
quished-1.184
Lex-1.068
Token
Token ID198
Feature activation+2.89e-01

Pos logit contributions
wit+1.006
 Hud+0.994
 Ow+0.994
hw+0.976
reads+0.975
Neg logit contributions
advertisement-1.217
bourg-1.173
-1.146
quished-1.113
ipel-0.990
TokenUpdate
Token ID10260
Feature activation-1.49e+00

Pos logit contributions
wit+0.917
 Hud+0.905
 Ow+0.902
hw+0.896
earable+0.895
Neg logit contributions
advertisement-0.864
bourg-0.833
-0.808
quished-0.779
 ASUS-0.682
Token,
Token ID11
Feature activation-1.44e-01

Pos logit contributions
advertisement+3.799
bourg+3.578
+3.447
quished+3.225
 Secret+2.947
Neg logit contributions
wit-5.200
earable-5.199
 Hud-5.096
 Ow-5.092
izable-5.086
Token 3
Token ID513
Feature activation-1.73e-01

Pos logit contributions
advertisement+0.434
bourg+0.413
+0.400
quished+0.381
 ASUS+0.339
Neg logit contributions
wit-0.457
 Hud-0.450
earable-0.450
 Ow-0.449
hw-0.446
Token:
Token ID25
Feature activation-4.87e-02

Pos logit contributions
advertisement+0.465
bourg+0.437
+0.424
quished+0.402
Lex+0.351
Neg logit contributions
wit-0.607
 Hud-0.603
earable-0.599
 Ow-0.599
hw-0.592
Token01
Token ID486
Feature activation+1.63e-02

Pos logit contributions
advertisement+0.166
bourg+0.158
+0.154
quished+0.145
Lex+0.134
Neg logit contributions
wit-0.136
earable-0.134
 Hud-0.134
 Ow-0.133
hw-0.132
Token p
Token ID279
Feature activation-7.87e-02

Pos logit contributions
wit+0.041
 Hud+0.040
 Ow+0.040
earable+0.040
hw+0.040
Neg logit contributions
advertisement-0.060
bourg-0.058
-0.056
quished-0.054
Lex-0.049
Token(
Token ID7
Feature activation-4.47e-01

Pos logit contributions
advertisement+0.686
bourg+0.655
+0.639
quished+0.608
Lex+0.562
Neg logit contributions
 Hud-0.496
wit-0.496
 Ow-0.491
earable-0.488
hw-0.482
Token0
Token ID15
Feature activation+1.01e-02

Pos logit contributions
advertisement+1.508
bourg+1.431
+1.407
quished+1.334
xi+1.220
Neg logit contributions
wit-1.251
 Hud-1.235
earable-1.235
 Ow-1.223
reads-1.207
Tokens
Token ID82
Feature activation+8.53e-02

Pos logit contributions
wit+0.030
 Hud+0.030
 Ow+0.030
earable+0.030
hw+0.030
Neg logit contributions
advertisement-0.032
bourg-0.031
-0.030
quished-0.028
Lex-0.025
Token)}
Token ID38165
Feature activation-6.83e-02

Pos logit contributions
wit+0.189
 Hud+0.188
 Ow+0.187
earable+0.184
hw+0.183
Neg logit contributions
advertisement-0.339
bourg-0.326
-0.319
quished-0.308
Lex-0.282
Token c
Token ID269
Feature activation-2.47e-02

Pos logit contributions
advertisement+0.219
bourg+0.211
+0.204
quished+0.195
 ASUS+0.177
Neg logit contributions
wit-0.202
earable-0.200
 Hud-0.198
hw-0.197
reads-0.196
Tokenxd
Token ID24954
Feature activation-1.48e+00

Pos logit contributions
advertisement+0.073
bourg+0.070
+0.067
quished+0.064
xi+0.057
Neg logit contributions
wit-0.079
 Hud-0.079
earable-0.079
 Ow-0.079
hw-0.078
Token4
Token ID19
Feature activation-4.21e-01

Pos logit contributions
advertisement+3.787
bourg+3.659
+3.603
quished+3.310
xi+3.112
Neg logit contributions
earable-5.120
wit-5.101
 Hud-5.049
 Ow-5.002
enance-4.942
Token {
Token ID1391
Feature activation-7.19e-02

Pos logit contributions
advertisement+1.194
bourg+1.144
+1.107
quished+1.066
 ASUS+0.927
Neg logit contributions
wit-1.391
earable-1.384
 Hud-1.379
reads-1.364
hw-1.358
Token(
Token ID7
Feature activation-4.35e-01

Pos logit contributions
advertisement+0.260
bourg+0.249
+0.242
quished+0.232
Lex+0.213
Neg logit contributions
wit-0.185
 Hud-0.184
 Ow-0.183
earable-0.182
hw-0.180
Token341
Token ID33660
Feature activation+7.67e-02

Pos logit contributions
advertisement+1.476
bourg+1.403
+1.378
quished+1.304
xi+1.196
Neg logit contributions
wit-1.207
 Hud-1.195
earable-1.192
 Ow-1.185
hw-1.165
Tokens
Token ID82
Feature activation+3.59e-02

Pos logit contributions
wit+0.228
 Hud+0.226
 Ow+0.225
earable+0.223
hw+0.222
Neg logit contributions
advertisement-0.247
bourg-0.236
-0.229
quished-0.220
Lex-0.196
Token Select
Token ID9683
Feature activation+2.66e-01

Pos logit contributions
advertisement+0.518
bourg+0.486
+0.478
quished+0.444
 ASUS+0.389
Neg logit contributions
wit-0.766
earable-0.761
 Hud-0.758
 Ow-0.754
reads-0.752
Token Width
Token ID38807
Feature activation-7.87e-02

Pos logit contributions
wit+0.851
 Hud+0.846
 Ow+0.842
earable+0.834
hw+0.833
Neg logit contributions
advertisement-0.795
bourg-0.757
-0.732
quished-0.702
Lex-0.616
Token,
Token ID11
Feature activation-6.20e-01

Pos logit contributions
advertisement+0.230
bourg+0.219
+0.212
quished+0.203
 ASUS+0.179
Neg logit contributions
wit-0.256
 Hud-0.254
 Ow-0.253
earable-0.252
hw-0.250
TokenHeight
Token ID23106
Feature activation-3.27e-01

Pos logit contributions
advertisement+1.753
bourg+1.656
+1.646
quished+1.515
 Secret+1.388
Neg logit contributions
wit-2.072
earable-2.031
 Hud-2.021
 Ow-2.016
reads-2.012
Token }
Token ID1782
Feature activation-3.95e-01

Pos logit contributions
advertisement+0.954
bourg+0.904
+0.879
quished+0.839
 ASUS+0.749
Neg logit contributions
wit-1.065
earable-1.051
 Hud-1.050
 Ow-1.043
hw-1.041
Token else
Token ID2073
Feature activation-1.47e+00

Pos logit contributions
advertisement+1.141
bourg+1.072
+1.040
quished+0.990
 ASUS+0.903
Neg logit contributions
wit-1.303
earable-1.298
reads-1.263
hw-1.259
 Hud-1.256
Token {
Token ID1391
Feature activation+5.47e-01

Pos logit contributions
advertisement+3.949
bourg+3.628
+3.609
quished+3.484
 Secret+3.096
Neg logit contributions
earable-4.921
wit-4.830
reads-4.797
enance-4.792
 Hud-4.772
Token Write
Token ID19430
Feature activation-3.40e-02

Pos logit contributions
wit+1.299
 Hud+1.293
 Ow+1.290
hw+1.265
earable+1.261
Neg logit contributions
advertisement-2.083
bourg-2.002
-1.954
quished-1.892
Lex-1.716
Token-
Token ID12
Feature activation-4.05e-01

Pos logit contributions
advertisement+0.103
bourg+0.099
+0.095
quished+0.091
xi+0.080
Neg logit contributions
wit-0.107
 Hud-0.107
 Ow-0.106
earable-0.105
hw-0.105
TokenWarning
Token ID20361
Feature activation-4.53e-01

Pos logit contributions
advertisement+1.304
bourg+1.224
+1.191
quished+1.142
Lex+1.040
Neg logit contributions
wit-1.212
 Hud-1.208
 Ow-1.206
earable-1.192
reads-1.174
Token -
Token ID532
Feature activation-5.99e-01

Pos logit contributions
advertisement+1.274
bourg+1.196
+1.170
quished+1.116
 ASUS+0.978
Neg logit contributions
wit-1.527
earable-1.504
 Hud-1.504
 Ow-1.501
hw-1.493
Token-
Token ID12
Feature activation+1.05e-01

Pos logit contributions
advertisement+1.089
bourg+1.035
+1.006
quished+0.962
 ASUS+0.858
Neg logit contributions
wit-1.082
 Ow-1.074
 Hud-1.074
earable-1.065
reads-1.059
Tokenorder
Token ID2875
Feature activation+1.16e-01

Pos logit contributions
wit+0.297
 Hud+0.294
 Ow+0.293
earable+0.290
hw+0.290
Neg logit contributions
advertisement-0.355
bourg-0.340
-0.331
quished-0.319
Lex-0.285
Token through
Token ID832
Feature activation+2.53e-03

Pos logit contributions
wit+0.326
 Hud+0.323
 Ow+0.322
earable+0.318
hw+0.317
Neg logit contributions
advertisement-0.393
bourg-0.376
-0.366
quished-0.352
Lex-0.316
Token
Token ID198
Feature activation-4.54e-01

Pos logit contributions
wit+0.008
 Hud+0.008
 Ow+0.008
earable+0.008
hw+0.008
Neg logit contributions
advertisement-0.007
bourg-0.007
-0.007
quished-0.007
 ASUS-0.006
Token
Token ID198
Feature activation-4.28e-01

Pos logit contributions
advertisement+1.421
bourg+1.338
+1.285
quished+1.208
Lex+1.115
Neg logit contributions
wit-1.387
 Hud-1.385
earable-1.380
 Ow-1.375
hw-1.360
TokenUPDATE
Token ID16977
Feature activation-1.43e+00

Pos logit contributions
advertisement+1.402
bourg+1.307
+1.269
quished+1.209
Lex+1.104
Neg logit contributions
 Ow-1.272
 Hud-1.268
wit-1.256
earable-1.247
hw-1.221
Token:
Token ID25
Feature activation-2.24e-01

Pos logit contributions
advertisement+3.745
bourg+3.463
+3.452
quished+3.195
 ASUS+2.922
Neg logit contributions
wit-4.949
earable-4.942
 Hud-4.817
izable-4.810
 Ow-4.804
Token Looks
Token ID29403
Feature activation-7.00e-01

Pos logit contributions
advertisement+0.674
bourg+0.637
+0.617
quished+0.591
 ASUS+0.531
Neg logit contributions
wit-0.713
 Hud-0.706
earable-0.703
 Ow-0.702
hw-0.698
Token like
Token ID588
Feature activation-9.91e-01

Pos logit contributions
advertisement+1.681
bourg+1.578
+1.514
quished+1.424
 ASUS+1.235
Neg logit contributions
wit-2.647
 Hud-2.618
earable-2.604
 Ow-2.604
hw-2.591
Token only
Token ID691
Feature activation-8.56e-02

Pos logit contributions
advertisement+2.796
bourg+2.626
+2.503
quished+2.406
 ASUS+2.276
Neg logit contributions
wit-3.290
earable-3.253
hw-3.217
 Ow-3.204
 Hud-3.194
Token a
Token ID257
Feature activation-6.81e-01

Pos logit contributions
advertisement+0.259
bourg+0.246
+0.237
quished+0.227
 ASUS+0.206
Neg logit contributions
wit-0.271
 Hud-0.267
 Ow-0.267
earable-0.266
hw-0.265
Token not
Token ID407
Feature activation-8.81e-02

Pos logit contributions
wit+0.321
 Hud+0.318
 Ow+0.317
earable+0.315
hw+0.312
Neg logit contributions
advertisement-0.379
bourg-0.361
-0.352
quished-0.337
Lex-0.303
Token know
Token ID760
Feature activation-3.49e-01

Pos logit contributions
advertisement+0.305
bourg+0.293
+0.283
quished+0.270
 desp+0.249
Neg logit contributions
wit-0.237
 Hud-0.235
 Ow-0.235
earable-0.235
reads-0.233
Token what
Token ID644
Feature activation-3.42e-01

Pos logit contributions
advertisement+1.111
bourg+1.050
+1.018
quished+0.973
 ASUS+0.880
Neg logit contributions
wit-1.044
earable-1.034
 Hud-1.029
hw-1.023
 Ow-1.022
Token it
Token ID340
Feature activation-5.41e-01

Pos logit contributions
advertisement+1.018
bourg+0.964
+0.933
quished+0.888
 ASUS+0.791
Neg logit contributions
wit-1.099
 Hud-1.085
 Ow-1.085
earable-1.081
hw-1.075
Token is
Token ID318
Feature activation-7.01e-01

Pos logit contributions
advertisement+1.553
bourg+1.479
+1.443
quished+1.347
xi+1.205
Neg logit contributions
wit-1.792
 Ow-1.763
 Hud-1.759
earable-1.757
hw-1.750
Token like
Token ID588
Feature activation-1.43e+00

Pos logit contributions
advertisement+1.892
bourg+1.783
+1.717
quished+1.612
 Secret+1.433
Neg logit contributions
wit-2.437
 Hud-2.400
earable-2.391
 Ow-2.385
hw-2.380
Token to
Token ID284
Feature activation+2.96e-01

Pos logit contributions
advertisement+3.369
bourg+3.044
+2.933
quished+2.736
 desp+2.429
Neg logit contributions
wit-5.385
earable-5.368
 Hud-5.358
 Ow-5.349
hw-5.339
Token be
Token ID307
Feature activation+4.25e-02

Pos logit contributions
wit+0.809
 Hud+0.802
 Ow+0.800
earable+0.792
hw+0.789
Neg logit contributions
advertisement-1.024
bourg-0.980
-0.953
quished-0.918
Lex-0.825
Token a
Token ID257
Feature activation-2.97e-01

Pos logit contributions
wit+0.122
 Hud+0.121
 Ow+0.121
earable+0.120
hw+0.120
Neg logit contributions
advertisement-0.141
bourg-0.134
-0.130
quished-0.126
Lex-0.112
Token member
Token ID2888
Feature activation-5.07e-01

Pos logit contributions
advertisement+0.979
bourg+0.933
+0.910
quished+0.875
 desp+0.789
Neg logit contributions
wit-0.849
 Hud-0.845
earable-0.841
 Ow-0.840
hw-0.838
Token of
Token ID286
Feature activation+5.98e-02

Pos logit contributions
advertisement+1.209
bourg+1.125
+1.089
quished+1.032
Lex+0.863
Neg logit contributions
wit-1.926
 Hud-1.916
 Ow-1.909
earable-1.902
hw-1.894
Token
Token ID564
Feature activation-9.53e-02

Pos logit contributions
advertisement+1.427
bourg+1.344
+1.274
quished+1.223
Paris+1.121
Neg logit contributions
wit-1.360
earable-1.353
hw-1.345
 Ow-1.345
reads-1.339
Token
Token ID250
Feature activation-7.65e-02

Pos logit contributions
advertisement+0.234
bourg+0.216
+0.210
quished+0.196
xi+0.168
Neg logit contributions
wit-0.357
 Hud-0.356
 Ow-0.355
earable-0.352
hw-0.351
Tokenwhat
Token ID10919
Feature activation+9.10e-03

Pos logit contributions
advertisement+0.264
bourg+0.251
+0.243
quished+0.233
Paris+0.211
Neg logit contributions
 Hud-0.211
 Ow-0.211
wit-0.210
earable-0.208
hw-0.206
Token it
Token ID340
Feature activation-2.34e-01

Pos logit contributions
wit+0.028
 Ow+0.028
 Hud+0.028
earable+0.027
hw+0.027
Neg logit contributions
advertisement-0.028
bourg-0.027
-0.026
quished-0.025
Paris-0.022
Token feels
Token ID5300
Feature activation-4.53e-01

Pos logit contributions
advertisement+0.695
bourg+0.660
+0.644
quished+0.609
xi+0.539
Neg logit contributions
wit-0.756
 Ow-0.744
 Hud-0.744
earable-0.740
hw-0.738
Token like
Token ID588
Feature activation-1.43e+00

Pos logit contributions
advertisement+1.116
bourg+1.048
+1.009
quished+0.952
xi+0.810
Neg logit contributions
wit-1.690
 Hud-1.678
 Ow-1.673
earable-1.663
hw-1.660
Token to
Token ID284
Feature activation+1.68e-01

Pos logit contributions
advertisement+3.499
bourg+3.191
+3.098
quished+2.856
 desp+2.550
Neg logit contributions
earable-5.234
wit-5.227
hw-5.168
 Hud-5.165
 Ow-5.159
Token attain
Token ID18188
Feature activation-1.83e-01

Pos logit contributions
wit+0.461
 Hud+0.458
 Ow+0.457
earable+0.453
hw+0.451
Neg logit contributions
advertisement-0.577
bourg-0.551
-0.535
quished-0.512
xi-0.463
Token the
Token ID262
Feature activation+1.12e-01

Pos logit contributions
advertisement+0.606
bourg+0.570
+0.548
quished+0.526
 Secret+0.479
Neg logit contributions
wit-0.531
 Hud-0.526
 Ow-0.526
earable-0.524
hw-0.524
Token child
Token ID1200
Feature activation-2.22e-01

Pos logit contributions
wit+0.305
 Hud+0.301
 Ow+0.301
hw+0.299
earable+0.298
Neg logit contributions
advertisement-0.390
bourg-0.369
-0.359
quished-0.345
xi-0.313
Token
Token ID447
Feature activation+2.46e-02

Pos logit contributions
advertisement+0.742
bourg+0.705
+0.692
quished+0.659
xi+0.598
Neg logit contributions
 Hud-0.626
 Ow-0.624
wit-0.621
hw-0.618
earable-0.617
Token cigarette
Token ID17779
Feature activation-6.56e-02

Pos logit contributions
wit+0.711
 Hud+0.707
 Ow+0.705
earable+0.698
hw+0.695
Neg logit contributions
advertisement-0.716
bourg-0.681
-0.662
quished-0.635
Lex-0.561
Token brands
Token ID11394
Feature activation-1.73e-01

Pos logit contributions
advertisement+0.203
bourg+0.193
+0.187
quished+0.179
Lex+0.159
Neg logit contributions
wit-0.203
 Hud-0.201
 Ow-0.200
earable-0.199
hw-0.199
Token,
Token ID11
Feature activation-1.15e-01

Pos logit contributions
advertisement+0.548
bourg+0.524
+0.507
quished+0.487
Lex+0.432
Neg logit contributions
wit-0.522
 Hud-0.517
 Ow-0.516
earable-0.512
hw-0.509
Token saw
Token ID2497
Feature activation-7.64e-01

Pos logit contributions
advertisement+0.369
bourg+0.352
+0.342
quished+0.329
Lex+0.293
Neg logit contributions
wit-0.340
 Hud-0.338
 Ow-0.336
earable-0.333
hw-0.332
Token its
Token ID663
Feature activation-5.41e-01

Pos logit contributions
advertisement+2.302
bourg+2.173
+2.105
quished+2.018
 ASUS+1.829
Neg logit contributions
wit-2.391
earable-2.355
hw-2.351
 Hud-2.339
 Ow-2.330
Token stock
Token ID4283
Feature activation-1.42e+00

Pos logit contributions
advertisement+1.569
bourg+1.489
+1.435
quished+1.366
 ASUS+1.217
Neg logit contributions
wit-1.765
earable-1.743
 Hud-1.742
 Ow-1.736
hw-1.736
Token decline
Token ID7794
Feature activation-1.24e+00

Pos logit contributions
advertisement+4.145
bourg+4.021
+3.859
quished+3.621
xi+3.388
Neg logit contributions
earable-4.392
 Hud-4.377
reads-4.347
wit-4.344
hw-4.305
Token more
Token ID517
Feature activation-9.28e-02

Pos logit contributions
advertisement+3.561
bourg+3.462
+3.195
quished+3.046
 Secret+2.856
Neg logit contributions
earable-3.994
wit-3.942
 Hud-3.851
reads-3.835
hw-3.829
Token than
Token ID621
Feature activation+2.17e-01

Pos logit contributions
advertisement+0.348
bourg+0.335
+0.325
quished+0.315
Lex+0.286
Neg logit contributions
wit-0.226
 Hud-0.224
earable-0.222
 Ow-0.222
hw-0.220
Token 4
Token ID604
Feature activation-2.37e-01

Pos logit contributions
wit+0.703
 Hud+0.697
 Ow+0.695
earable+0.690
hw+0.688
Neg logit contributions
advertisement-0.639
bourg-0.607
-0.588
quished-0.562
Lex-0.495
Token percent
Token ID1411
Feature activation-3.06e-01

Pos logit contributions
advertisement+0.801
bourg+0.763
+0.745
quished+0.716
Lex+0.643
Neg logit contributions
wit-0.667
 Hud-0.661
 Ow-0.658
earable-0.655
hw-0.651
Token said
Token ID531
Feature activation-5.18e-02

Pos logit contributions
advertisement+0.304
bourg+0.290
+0.281
quished+0.271
 Secret+0.247
Neg logit contributions
wit-0.247
earable-0.243
 Hud-0.243
hw-0.242
 Ow-0.242
Token.
Token ID13
Feature activation+3.98e-03

Pos logit contributions
advertisement+0.159
bourg+0.151
+0.146
quished+0.140
Lex+0.124
Neg logit contributions
wit-0.162
 Hud-0.161
 Ow-0.160
earable-0.160
hw-0.159
Token
Token ID198
Feature activation-3.29e-03

Pos logit contributions
wit+0.012
 Hud+0.012
 Ow+0.012
earable+0.011
hw+0.011
Neg logit contributions
advertisement-0.013
bourg-0.012
-0.012
quished-0.012
Lex-0.010
Token
Token ID198
Feature activation-3.90e-02

Pos logit contributions
advertisement+0.011
bourg+0.010
+0.010
quished+0.010
Lex+0.009
Neg logit contributions
wit-0.010
 Hud-0.009
 Ow-0.009
earable-0.009
hw-0.009
TokenC
Token ID34
Feature activation+1.34e-01

Pos logit contributions
advertisement+0.121
bourg+0.114
+0.110
quished+0.105
Lex+0.095
Neg logit contributions
wit-0.121
 Ow-0.121
 Hud-0.121
earable-0.120
hw-0.119
TokenEL
Token ID3698
Feature activation-1.41e+00

Pos logit contributions
 Hud+0.144
wit+0.144
 Ow+0.143
earable+0.140
reads+0.137
Neg logit contributions
advertisement-0.678
bourg-0.661
-0.649
quished-0.631
xi-0.590
TokenAC
Token ID2246
Feature activation-2.65e-01

Pos logit contributions
advertisement+3.475
bourg+3.343
+3.132
quished+3.004
 Secret+2.695
Neg logit contributions
wit-5.034
 Hud-5.025
earable-5.013
reads-5.006
 Ow-4.968
Token IS
Token ID3180
Feature activation-1.81e-01

Pos logit contributions
advertisement+0.859
bourg+0.819
+0.795
quished+0.766
Lex+0.684
Neg logit contributions
wit-0.782
 Hud-0.775
 Ow-0.771
earable-0.766
hw-0.765
Token B
Token ID347
Feature activation+2.26e-01

Pos logit contributions
advertisement+0.557
bourg+0.530
+0.514
quished+0.491
 ASUS+0.437
Neg logit contributions
wit-0.561
 Hud-0.555
 Ow-0.552
earable-0.551
hw-0.547
TokenORN
Token ID30649
Feature activation+1.58e-01

Pos logit contributions
wit+0.690
 Hud+0.684
 Ow+0.681
earable+0.674
hw+0.674
Neg logit contributions
advertisement-0.710
bourg-0.676
-0.657
quished-0.631
Lex-0.559
Token
Token ID198
Feature activation+1.95e-01

Pos logit contributions
wit+0.470
 Hud+0.466
 Ow+0.466
earable+0.459
hw+0.458
Neg logit contributions
advertisement-0.509
bourg-0.486
-0.472
quished-0.455
Lex-0.404
TokenLINE
Token ID24027
Feature activation-2.82e-02

Pos logit contributions
advertisement+1.713
bourg+1.615
+1.581
quished+1.489
Lex+1.369
Neg logit contributions
wit-1.548
 Hud-1.535
 Ow-1.529
earable-1.514
hw-1.511
Token |
Token ID930
Feature activation-6.63e-02

Pos logit contributions
advertisement+0.097
bourg+0.093
+0.091
quished+0.088
ipel+0.078
Neg logit contributions
wit-0.077
 Ow-0.077
 Hud-0.077
hw-0.076
earable-0.076
Token Do
Token ID2141
Feature activation-6.71e-02

Pos logit contributions
advertisement+0.221
bourg+0.210
+0.204
quished+0.195
Lex+0.177
Neg logit contributions
wit-0.191
earable-0.187
 Hud-0.187
hw-0.186
 Ow-0.185
Token you
Token ID345
Feature activation-7.15e-02

Pos logit contributions
advertisement+0.193
bourg+0.183
+0.177
quished+0.169
Lex+0.149
Neg logit contributions
wit-0.223
 Hud-0.220
 Ow-0.219
earable-0.219
hw-0.217
Token feel
Token ID1254
Feature activation-3.19e-01

Pos logit contributions
advertisement+0.241
bourg+0.231
+0.225
quished+0.215
 desp+0.196
Neg logit contributions
wit-0.200
 Hud-0.198
earable-0.197
 Ow-0.197
reads-0.196
Token like
Token ID588
Feature activation-1.40e+00

Pos logit contributions
advertisement+0.922
bourg+0.883
+0.850
quished+0.818
 desp+0.728
Neg logit contributions
wit-1.047
earable-1.035
 Hud-1.026
 Ow-1.024
hw-1.020
Token a
Token ID257
Feature activation-1.72e-01

Pos logit contributions
advertisement+4.015
bourg+3.767
+3.678
quished+3.495
 Secret+3.254
Neg logit contributions
wit-4.455
earable-4.444
hw-4.366
reads-4.337
 Ow-4.286
Token terrible
Token ID7818
Feature activation-2.78e-01

Pos logit contributions
advertisement+0.558
bourg+0.531
+0.520
quished+0.498
 Secret+0.447
Neg logit contributions
wit-0.503
earable-0.497
 Ow-0.497
 Hud-0.496
reads-0.492
Token person
Token ID1048
Feature activation-3.41e-01

Pos logit contributions
advertisement+0.874
bourg+0.825
+0.812
quished+0.774
 desp+0.704
Neg logit contributions
wit-0.838
 Hud-0.826
earable-0.824
 Ow-0.823
hw-0.821
Token for
Token ID329
Feature activation-3.00e-01

Pos logit contributions
advertisement+0.984
bourg+0.927
+0.905
quished+0.860
Lex+0.754
Neg logit contributions
wit-1.126
 Hud-1.118
 Ow-1.114
earable-1.110
hw-1.101
Token doing
Token ID1804
Feature activation+1.01e-01

Pos logit contributions
advertisement+0.900
bourg+0.850
+0.831
quished+0.783
 desp+0.705
Neg logit contributions
wit-0.958
earable-0.949
 Hud-0.941
 Ow-0.939
reads-0.934
Token that
Token ID326
Feature activation-2.06e-01

Pos logit contributions
advertisement+0.693
bourg+0.660
+0.638
quished+0.610
 ASUS+0.542
Neg logit contributions
wit-0.728
 Hud-0.719
 Ow-0.715
earable-0.714
reads-0.709
Token have
Token ID423
Feature activation-3.43e-02

Pos logit contributions
advertisement+0.716
bourg+0.687
+0.665
quished+0.641
 ASUS+0.584
Neg logit contributions
wit-0.559
 Ow-0.547
earable-0.546
reads-0.546
 Hud-0.546
Token traditionally
Token ID16083
Feature activation-4.93e-01

Pos logit contributions
advertisement+0.111
bourg+0.106
+0.103
quished+0.099
 ASUS+0.089
Neg logit contributions
wit-0.101
 Ow-0.099
 Hud-0.099
earable-0.099
reads-0.099
Token been
Token ID587
Feature activation+9.79e-02

Pos logit contributions
advertisement+1.587
bourg+1.504
+1.461
quished+1.403
 desp+1.283
Neg logit contributions
wit-1.459
earable-1.431
 Ow-1.427
reads-1.425
 Hud-1.423
Token fre
Token ID2030
Feature activation+1.05e-01

Pos logit contributions
wit+0.279
 Hud+0.276
 Ow+0.275
earable+0.273
hw+0.272
Neg logit contributions
advertisement-0.327
bourg-0.313
-0.303
quished-0.293
 ASUS-0.262
Tokenew
Token ID413
Feature activation-1.39e+00

Pos logit contributions
wit+0.292
 Hud+0.290
 Ow+0.288
earable+0.285
reads+0.282
Neg logit contributions
advertisement-0.359
bourg-0.344
-0.334
quished-0.323
antic-0.290
Tokenhe
Token ID258
Feature activation-8.39e-01

Pos logit contributions
advertisement+4.269
bourg+4.093
quished+3.901
+3.743
antic+3.551
Neg logit contributions
wit-4.064
 Ow-3.954
 Hud-3.946
earable-3.873
reads-3.809
Tokeneling
Token ID10809
Feature activation-7.21e-02

Pos logit contributions
advertisement+2.509
bourg+2.366
+2.230
quished+2.227
antic+1.963
Neg logit contributions
 Ow-2.679
 Hud-2.670
reads-2.618
earable-2.614
wit-2.610
Token forums
Token ID14216
Feature activation-5.38e-01

Pos logit contributions
advertisement+0.228
bourg+0.217
+0.209
quished+0.203
 Secret+0.180
Neg logit contributions
wit-0.219
 Hud-0.216
 Ow-0.215
earable-0.214
hw-0.213
Token where
Token ID810
Feature activation-2.14e-01

Pos logit contributions
advertisement+1.594
bourg+1.513
+1.457
quished+1.395
 Secret+1.235
Neg logit contributions
wit-1.728
 Hud-1.716
 Ow-1.708
earable-1.705
reads-1.691
Token users
Token ID2985
Feature activation-2.71e-01

Pos logit contributions
advertisement+0.666
bourg+0.631
+0.606
quished+0.581
 Secret+0.527
Neg logit contributions
wit-0.659
 Hud-0.651
 Ow-0.649
hw-0.648
earable-0.648
Token the
Token ID262
Feature activation-3.55e-02

Pos logit contributions
advertisement+0.366
bourg+0.348
+0.339
quished+0.325
Lex+0.289
Neg logit contributions
wit-0.337
 Hud-0.335
 Ow-0.333
earable-0.331
hw-0.329
Token first
Token ID717
Feature activation+3.21e-01

Pos logit contributions
advertisement+0.108
bourg+0.102
+0.099
quished+0.095
Lex+0.084
Neg logit contributions
wit-0.112
 Hud-0.111
 Ow-0.110
earable-0.110
hw-0.109
Token few
Token ID1178
Feature activation-3.44e-01

Pos logit contributions
wit+0.930
 Hud+0.924
 Ow+0.922
earable+0.912
hw+0.906
Neg logit contributions
advertisement-1.054
bourg-1.012
-0.983
quished-0.949
Lex-0.846
Token hours
Token ID2250
Feature activation-2.88e-01

Pos logit contributions
advertisement+0.942
bourg+0.886
+0.866
quished+0.817
 ASUS+0.712
Neg logit contributions
wit-1.189
 Hud-1.174
 Ow-1.170
earable-1.167
hw-1.160
Token mentioned
Token ID4750
Feature activation+4.56e-01

Pos logit contributions
advertisement+0.780
bourg+0.734
+0.714
quished+0.677
 ASUS+0.588
Neg logit contributions
wit-0.999
 Hud-0.996
 Ow-0.989
earable-0.986
hw-0.984
Token above
Token ID2029
Feature activation+9.40e-01

Pos logit contributions
wit+1.324
 Hud+1.316
 Ow+1.310
earable+1.296
hw+1.294
Neg logit contributions
advertisement-1.494
bourg-1.430
-1.391
quished-1.332
Lex-1.192
Token,
Token ID11
Feature activation+2.54e-01

Pos logit contributions
wit+2.948
 Ow+2.892
 Hud+2.875
earable+2.850
hw+2.828
Neg logit contributions
advertisement-2.879
bourg-2.762
-2.658
quished-2.553
Lex-2.286
Token your
Token ID534
Feature activation+9.47e-02

Pos logit contributions
wit+0.768
 Hud+0.762
 Ow+0.759
earable+0.753
hw+0.750
Neg logit contributions
advertisement-0.802
bourg-0.765
-0.742
quished-0.713
Lex-0.633
Token body
Token ID1767
Feature activation+4.85e-01

Pos logit contributions
wit+0.272
 Hud+0.269
 Ow+0.268
earable+0.266
hw+0.266
Neg logit contributions
advertisement-0.314
bourg-0.299
-0.292
quished-0.280
Lex-0.250
Token goes
Token ID2925
Feature activation+9.86e-02

Pos logit contributions
wit+1.367
 Hud+1.357
 Ow+1.355
reads+1.335
earable+1.333
Neg logit contributions
advertisement-1.633
bourg-1.568
-1.522
quished-1.469
Lex-1.310
Token into
Token ID656
Feature activation-2.69e-01

Pos logit contributions
wit+0.309
 Hud+0.306
 Ow+0.304
earable+0.303
hw+0.301
Neg logit contributions
advertisement-0.302
bourg-0.287
-0.279
quished-0.267
Lex-0.236
Token be
Token ID307
Feature activation+3.68e-02

Pos logit contributions
advertisement+0.684
bourg+0.660
+0.634
quished+0.605
 desp+0.559
Neg logit contributions
wit-0.557
 Hud-0.551
earable-0.549
 Ow-0.548
hw-0.542
Token looking
Token ID2045
Feature activation-1.36e-01

Pos logit contributions
wit+0.091
earable+0.090
 Hud+0.090
 Ow+0.089
hw+0.088
Neg logit contributions
advertisement-0.136
bourg-0.131
-0.128
quished-0.123
 desp-0.113
Token to
Token ID284
Feature activation+3.09e-01

Pos logit contributions
advertisement+0.398
bourg+0.377
+0.366
quished+0.349
Lex+0.307
Neg logit contributions
wit-0.447
 Hud-0.443
 Ow-0.441
earable-0.439
hw-0.437
Token g
Token ID308
Feature activation+3.30e-02

Pos logit contributions
wit+0.805
 Hud+0.797
 Ow+0.794
earable+0.790
hw+0.786
Neg logit contributions
advertisement-1.105
bourg-1.060
-1.033
quished-0.993
Lex-0.900
Tokenank
Token ID962
Feature activation+3.84e-01

Pos logit contributions
 Hud+0.089
 Ow+0.089
wit+0.088
earable+0.088
hw+0.087
Neg logit contributions
advertisement-0.115
bourg-0.110
-0.106
quished-0.103
xi-0.093
Token immedi
Token ID2729
Feature activation+9.23e-01

Pos logit contributions
wit+1.107
 Hud+1.099
 Ow+1.095
earable+1.085
hw+1.081
Neg logit contributions
advertisement-1.268
bourg-1.211
-1.177
quished-1.132
Lex-1.012
Tokenat
Token ID265
Feature activation-4.57e-01

Pos logit contributions
wit+2.777
 Ow+2.703
 Hud+2.684
hw+2.684
reads+2.669
Neg logit contributions
advertisement-2.950
bourg-2.827
-2.736
quished-2.648
ipel-2.329
Tokenly
Token ID306
Feature activation+4.27e-01

Pos logit contributions
advertisement+1.563
bourg+1.495
+1.462
quished+1.407
Lex+1.271
Neg logit contributions
 Ow-1.237
 Hud-1.237
earable-1.232
wit-1.231
reads-1.218
Token after
Token ID706
Feature activation-3.56e-01

Pos logit contributions
wit+1.331
 Hud+1.320
 Ow+1.317
earable+1.300
hw+1.299
Neg logit contributions
advertisement-1.315
bourg-1.251
-1.217
quished-1.167
Lex-1.029
Token 6
Token ID718
Feature activation+4.00e-01

Pos logit contributions
advertisement+1.107
bourg+1.048
+1.012
quished+0.964
 Secret+0.874
Neg logit contributions
wit-1.089
earable-1.085
 Hud-1.077
hw-1.074
 Ow-1.074
Token.
Token ID13
Feature activation+2.48e-01

Pos logit contributions
wit+1.279
 Hud+1.269
 Ow+1.266
earable+1.254
hw+1.251
Neg logit contributions
advertisement-1.198
bourg-1.139
-1.104
quished-1.057
Lex-0.931
Token Driver
Token ID12434
Feature activation+1.89e-01

Pos logit contributions
 Hud+0.819
wit+0.818
 Ow+0.815
earable+0.804
reads+0.800
Neg logit contributions
advertisement-0.847
bourg-0.807
-0.784
quished-0.752
Lex-0.664
Token,"
Token ID553
Feature activation+3.99e-01

Pos logit contributions
wit+0.522
 Hud+0.513
 Ow+0.512
earable+0.510
hw+0.508
Neg logit contributions
advertisement-0.652
bourg-0.627
-0.605
quished-0.582
 Secret-0.525
Token the
Token ID262
Feature activation+2.63e-01

Pos logit contributions
wit+1.349
 Hud+1.343
 Ow+1.338
earable+1.326
hw+1.321
Neg logit contributions
advertisement-1.122
bourg-1.061
-1.029
quished-0.981
Lex-0.854
Token 1976
Token ID15408
Feature activation+3.47e-01

Pos logit contributions
wit+0.783
 Hud+0.778
 Ow+0.775
earable+0.768
hw+0.765
Neg logit contributions
advertisement-0.843
bourg-0.804
-0.782
quished-0.751
Lex-0.668
Token Martin
Token ID5780
Feature activation+2.75e-01

Pos logit contributions
wit+1.001
 Hud+1.001
 Ow+0.994
earable+0.981
hw+0.976
Neg logit contributions
advertisement-1.145
bourg-1.090
-1.063
quished-1.025
Lex-0.914
Token Sc
Token ID1446
Feature activation+1.10e+00

Pos logit contributions
wit+0.844
 Hud+0.832
 Ow+0.831
earable+0.828
hw+0.823
Neg logit contributions
advertisement-0.857
bourg-0.818
-0.791
quished-0.761
Lex-0.674
Tokenors
Token ID669
Feature activation-2.83e-01

Pos logit contributions
wit+2.589
 Hud+2.519
hw+2.513
reads+2.500
 Ow+2.490
Neg logit contributions
advertisement-4.220
bourg-4.043
-3.962
quished-3.840
 ASUS-3.531
Tokenese
Token ID2771
Feature activation-5.23e-02

Pos logit contributions
advertisement+1.035
bourg+1.001
+0.961
quished+0.933
xi+0.862
Neg logit contributions
 Ow-0.708
wit-0.700
earable-0.698
 Hud-0.698
reads-0.688
Token film
Token ID2646
Feature activation+7.79e-02

Pos logit contributions
advertisement+0.172
bourg+0.164
+0.159
quished+0.153
Lex+0.136
Neg logit contributions
wit-0.152
 Hud-0.151
 Ow-0.150
earable-0.149
hw-0.148
Token that
Token ID326
Feature activation-4.34e-01

Pos logit contributions
wit+0.239
 Hud+0.236
 Ow+0.236
earable+0.235
hw+0.234
Neg logit contributions
advertisement-0.242
bourg-0.231
-0.224
quished-0.216
Lex-0.191
Token's
Token ID338
Feature activation-2.41e-01

Pos logit contributions
advertisement+1.412
bourg+1.367
+1.310
quished+1.267
 Secret+1.153
Neg logit contributions
wit-1.257
earable-1.238
hw-1.224
 Ow-1.214
 Hud-1.211
Token Gilbert
Token ID24023
Feature activation+3.58e-01

Pos logit contributions
wit+0.443
 Ow+0.437
earable+0.437
 Hud+0.435
reads+0.435
Neg logit contributions
advertisement-0.487
bourg-0.463
-0.452
quished-0.431
Lex-0.386
Token was
Token ID373
Feature activation+3.19e-01

Pos logit contributions
wit+1.042
 Hud+1.038
 Ow+1.033
hw+1.019
earable+1.018
Neg logit contributions
advertisement-1.175
bourg-1.119
-1.085
quished-1.047
Lex-0.934
Token on
Token ID319
Feature activation+4.87e-01

Pos logit contributions
wit+0.966
 Ow+0.953
 Hud+0.951
hw+0.947
earable+0.946
Neg logit contributions
advertisement-1.012
bourg-0.961
-0.936
quished-0.891
 ASUS-0.800
Token the
Token ID262
Feature activation+4.44e-01

Pos logit contributions
wit+1.412
 Hud+1.407
 Ow+1.398
earable+1.385
hw+1.379
Neg logit contributions
advertisement-1.601
bourg-1.528
-1.487
quished-1.430
Lex-1.276
Token ice
Token ID4771
Feature activation-3.74e-01

Pos logit contributions
wit+1.461
 Hud+1.436
 Ow+1.436
earable+1.435
hw+1.434
Neg logit contributions
advertisement-1.295
bourg-1.221
-1.191
quished-1.126
 ASUS-0.999
Token last
Token ID938
Feature activation+9.34e-01

Pos logit contributions
advertisement+1.089
bourg+1.022
+1.006
quished+0.942
Lex+0.841
Neg logit contributions
wit-1.219
earable-1.209
 Hud-1.203
 Ow-1.202
reads-1.199
Token year
Token ID614
Feature activation-1.62e-01

Pos logit contributions
 Hud+2.174
wit+2.173
 Ow+2.143
earable+2.107
hw+2.095
Neg logit contributions
advertisement-3.595
bourg-3.473
-3.371
quished-3.285
Lex-2.967
Token.
Token ID13
Feature activation+2.59e-01

Pos logit contributions
advertisement+0.487
bourg+0.461
+0.449
quished+0.428
Lex+0.377
Neg logit contributions
wit-0.515
 Hud-0.511
 Ow-0.510
earable-0.508
hw-0.504
Token
Token ID198
Feature activation+2.41e-01

Pos logit contributions
wit+0.796
 Hud+0.784
 Ow+0.784
earable+0.783
hw+0.780
Neg logit contributions
advertisement-0.811
bourg-0.768
-0.752
quished-0.714
Lex-0.637
Token
Token ID198
Feature activation+2.59e-01

Pos logit contributions
wit+0.691
 Hud+0.685
 Ow+0.683
earable+0.675
hw+0.674
Neg logit contributions
advertisement-0.803
bourg-0.769
-0.748
quished-0.723
Lex-0.642
TokenGil
Token ID40747
Feature activation+1.39e-01

Pos logit contributions
wit+0.808
 Ow+0.807
 Hud+0.804
earable+0.797
hw+0.793
Neg logit contributions
advertisement-0.799
bourg-0.748
-0.732
quished-0.693
Lex-0.624
Token general
Token ID2276
Feature activation+3.00e-01

Pos logit contributions
wit+0.072
 Hud+0.071
 Ow+0.071
earable+0.070
hw+0.070
Neg logit contributions
advertisement-0.114
bourg-0.110
-0.107
quished-0.103
Lex-0.094
Token to
Token ID284
Feature activation+8.76e-02

Pos logit contributions
wit+1.034
 Hud+1.030
 Ow+1.027
earable+1.013
hw+1.011
Neg logit contributions
advertisement-0.821
bourg-0.777
-0.750
quished-0.718
Lex-0.619
Token oppose
Token ID12856
Feature activation+5.42e-01

Pos logit contributions
wit+0.225
 Hud+0.223
 Ow+0.222
earable+0.220
hw+0.219
Neg logit contributions
advertisement-0.316
bourg-0.305
-0.296
quished-0.285
Lex-0.259
Token the
Token ID262
Feature activation+3.65e-01

Pos logit contributions
wit+1.532
 Hud+1.517
 Ow+1.512
earable+1.500
hw+1.495
Neg logit contributions
advertisement-1.822
bourg-1.743
-1.694
quished-1.631
Lex-1.462
Token confirmation
Token ID12641
Feature activation+3.93e-01

Pos logit contributions
wit+1.086
 Hud+1.076
 Ow+1.073
earable+1.065
hw+1.062
Neg logit contributions
advertisement-1.175
bourg-1.121
-1.089
quished-1.045
Lex-0.932
Token of
Token ID286
Feature activation+1.09e+00

Pos logit contributions
wit+1.430
 Hud+1.423
 Ow+1.414
earable+1.401
hw+1.397
Neg logit contributions
advertisement-1.001
bourg-0.948
-0.913
quished-0.864
Lex-0.740
Token Mer
Token ID4638
Feature activation+2.15e-01

Pos logit contributions
 Hud+2.864
 Ow+2.848
wit+2.845
earable+2.772
hw+2.755
Neg logit contributions
advertisement-3.876
bourg-3.700
-3.631
quished-3.471
xi-3.122
Tokenrick
Token ID5557
Feature activation+4.54e-01

Pos logit contributions
wit+0.732
 Hud+0.730
 Ow+0.728
earable+0.723
hw+0.719
Neg logit contributions
advertisement-0.593
bourg-0.563
-0.542
quished-0.519
Lex-0.451
Token Garland
Token ID35466
Feature activation-4.49e-02

Pos logit contributions
wit+0.753
 Hud+0.744
 Ow+0.739
earable+0.723
hw+0.717
Neg logit contributions
advertisement-2.062
bourg-1.992
-1.956
quished-1.902
Lex-1.756
Token to
Token ID284
Feature activation+9.72e-02

Pos logit contributions
advertisement+0.128
bourg+0.122
+0.118
quished+0.112
Lex+0.098
Neg logit contributions
wit-0.150
 Hud-0.149
 Ow-0.148
earable-0.147
hw-0.147
Token the
Token ID262
Feature activation+2.20e-01

Pos logit contributions
wit+0.283
 Hud+0.280
 Ow+0.279
earable+0.278
hw+0.277
Neg logit contributions
advertisement-0.317
bourg-0.304
-0.294
quished-0.282
 ASUS-0.254
Token In
Token ID554
Feature activation-4.44e-02

Pos logit contributions
advertisement+1.149
bourg+1.076
+1.057
quished+0.991
 Secret+0.914
Neg logit contributions
wit-1.153
earable-1.149
hw-1.133
 Hud-1.133
reads-1.133
Token fact
Token ID1109
Feature activation+5.68e-01

Pos logit contributions
advertisement+0.156
bourg+0.149
+0.146
quished+0.139
xi+0.127
Neg logit contributions
wit-0.118
earable-0.117
 Hud-0.117
 Ow-0.116
hw-0.116
Token,
Token ID11
Feature activation+8.34e-02

Pos logit contributions
wit+1.823
 Hud+1.813
 Ow+1.808
hw+1.783
earable+1.774
Neg logit contributions
advertisement-1.692
bourg-1.607
-1.551
quished-1.485
Lex-1.304
Token nobody
Token ID8168
Feature activation-1.03e-01

Pos logit contributions
wit+0.267
earable+0.264
 Hud+0.263
 Ow+0.262
reads+0.262
Neg logit contributions
advertisement-0.250
bourg-0.236
-0.229
quished-0.218
Lex-0.195
Token in
Token ID287
Feature activation+2.41e-01

Pos logit contributions
advertisement+0.340
bourg+0.323
+0.315
quished+0.300
Lex+0.271
Neg logit contributions
wit-0.298
 Hud-0.295
 Ow-0.294
earable-0.293
hw-0.291
Token our
Token ID674
Feature activation+1.68e-01

Pos logit contributions
wit+0.711
 Hud+0.704
 Ow+0.702
earable+0.697
hw+0.694
Neg logit contributions
advertisement-0.782
bourg-0.746
-0.725
quished-0.696
Lex-0.621
Token company
Token ID1664
Feature activation+1.02e-01

Pos logit contributions
wit+0.425
 Hud+0.419
 Ow+0.418
earable+0.414
hw+0.414
Neg logit contributions
advertisement-0.619
bourg-0.592
-0.578
quished-0.556
Lex-0.505
Token part
Token ID636
Feature activation+1.67e-01

Pos logit contributions
wit+0.293
 Hud+0.290
 Ow+0.289
earable+0.286
hw+0.285
Neg logit contributions
advertisement-0.342
bourg-0.327
-0.318
quished-0.305
Lex-0.274
Tokenakes
Token ID1124
Feature activation+1.07e-01

Pos logit contributions
wit+0.429
 Hud+0.425
 Ow+0.423
earable+0.419
hw+0.417
Neg logit contributions
advertisement-0.608
bourg-0.583
-0.568
quished-0.549
Lex-0.496
Token in
Token ID287
Feature activation-5.71e-02

Pos logit contributions
wit+0.348
 Hud+0.346
 Ow+0.344
earable+0.341
hw+0.341
Neg logit contributions
advertisement-0.316
bourg-0.301
-0.291
quished-0.279
Lex-0.245
Token marijuana
Token ID5727
Feature activation-2.75e-01

Pos logit contributions
advertisement+0.183
bourg+0.174
+0.169
quished+0.161
 ASUS+0.144
Neg logit contributions
wit-0.171
 Hud-0.169
 Ow-0.168
earable-0.168
hw-0.167
Token at
Token ID379
Feature activation-7.01e-02

Pos logit contributions
advertisement+0.255
bourg+0.242
+0.234
quished+0.224
Lex+0.197
Neg logit contributions
wit-0.286
 Hud-0.285
 Ow-0.283
earable-0.282
hw-0.280
Token 1
Token ID352
Feature activation+1.72e-02

Pos logit contributions
advertisement+0.209
bourg+0.199
+0.193
quished+0.184
 ASUS+0.162
Neg logit contributions
wit-0.224
 Hud-0.222
 Ow-0.221
earable-0.221
hw-0.219
Tokenpm
Token ID4426
Feature activation-1.88e-01

Pos logit contributions
wit+0.057
 Hud+0.057
 Ow+0.057
earable+0.056
reads+0.056
Neg logit contributions
advertisement-0.050
bourg-0.047
-0.045
quished-0.043
Lex-0.038
Token.
Token ID13
Feature activation+3.50e-01

Pos logit contributions
advertisement+0.529
bourg+0.505
+0.485
quished+0.462
 ASUS+0.407
Neg logit contributions
wit-0.630
 Hud-0.626
earable-0.622
 Ow-0.621
reads-0.618
Token Email
Token ID9570
Feature activation+6.04e-01

Pos logit contributions
wit+1.039
 Hud+1.030
 Ow+1.025
earable+1.021
hw+1.016
Neg logit contributions
advertisement-1.126
bourg-1.075
-1.044
quished-1.001
Lex-0.892
Token questions
Token ID2683
Feature activation+4.73e-01

Pos logit contributions
wit+1.685
 Hud+1.672
 Ow+1.667
earable+1.647
hw+1.643
Neg logit contributions
advertisement-2.053
bourg-1.965
-1.913
quished-1.844
Lex-1.650
Token to
Token ID284
Feature activation+7.17e-02

Pos logit contributions
wit+1.441
 Hud+1.423
 Ow+1.422
hw+1.408
reads+1.401
Neg logit contributions
advertisement-1.487
bourg-1.419
-1.377
quished-1.325
Lex-1.166
Token your
Token ID534
Feature activation+8.91e-02

Pos logit contributions
wit+0.196
earable+0.194
 Hud+0.193
 Ow+0.192
hw+0.191
Neg logit contributions
advertisement-0.248
bourg-0.237
-0.230
quished-0.221
 Secret-0.201
Tokentravel
Token ID35927
Feature activation+5.81e-03

Pos logit contributions
wit+0.264
 Hud+0.261
 Ow+0.260
earable+0.258
hw+0.257
Neg logit contributions
advertisement-0.289
bourg-0.275
-0.267
quished-0.257
Lex-0.229
Tokens
Token ID82
Feature activation+1.32e-02

Pos logit contributions
wit+0.012
 Hud+0.012
 Ow+0.012
earable+0.012
hw+0.012
Neg logit contributions
advertisement-0.024
bourg-0.023
-0.022
quished-0.021
Lex-0.020
Token@
Token ID31
Feature activation-1.77e-01

Pos logit contributions
wit+0.028
 Hud+0.027
 Ow+0.027
earable+0.027
hw+0.026
Neg logit contributions
advertisement-0.054
bourg-0.052
-0.051
quished-0.049
Lex-0.045
Token hit
Token ID2277
Feature activation+4.74e-01

Pos logit contributions
advertisement+0.087
bourg+0.084
+0.081
quished+0.077
 desp+0.071
Neg logit contributions
wit-0.070
 Ow-0.069
 Hud-0.069
reads-0.069
hw-0.068
Token if
Token ID611
Feature activation+2.42e-01

Pos logit contributions
wit+1.482
 Hud+1.475
 Ow+1.467
earable+1.449
hw+1.448
Neg logit contributions
advertisement-1.448
bourg-1.381
-1.340
quished-1.287
Lex-1.133
Token L
Token ID406
Feature activation+6.59e-01

Pos logit contributions
wit+0.801
 Ow+0.793
 Hud+0.791
earable+0.789
reads+0.786
Neg logit contributions
advertisement-0.695
bourg-0.655
-0.637
quished-0.608
Lex-0.534
Tokenaine
Token ID5718
Feature activation+3.88e-01

Pos logit contributions
wit+1.710
earable+1.680
 Hud+1.679
hw+1.674
 Ow+1.653
Neg logit contributions
advertisement-2.346
bourg-2.255
-2.221
quished-2.152
 ASUS-1.934
Token hits
Token ID7127
Feature activation+5.27e-01

Pos logit contributions
wit+1.090
 Hud+1.076
 Ow+1.075
earable+1.068
hw+1.062
Neg logit contributions
advertisement-1.316
bourg-1.256
-1.226
quished-1.174
Lex-1.056
Token his
Token ID465
Feature activation+3.74e-01

Pos logit contributions
wit+1.541
 Hud+1.534
 Ow+1.526
earable+1.511
hw+1.505
Neg logit contributions
advertisement-1.717
bourg-1.640
-1.594
quished-1.536
Lex-1.366
Token offensive
Token ID5859
Feature activation+1.20e-01

Pos logit contributions
wit+0.895
 Ow+0.881
 Hud+0.880
earable+0.873
hw+0.871
Neg logit contributions
advertisement-1.418
bourg-1.359
-1.329
quished-1.282
Lex-1.172
Token stride
Token ID33769
Feature activation+4.50e-01

Pos logit contributions
wit+0.310
 Ow+0.305
 Hud+0.304
earable+0.303
hw+0.302
Neg logit contributions
advertisement-0.431
bourg-0.412
-0.405
quished-0.388
Lex-0.353
Token to
Token ID284
Feature activation+5.65e-02

Pos logit contributions
wit+1.365
 Hud+1.355
 Ow+1.349
hw+1.333
earable+1.333
Neg logit contributions
advertisement-1.423
bourg-1.358
-1.316
quished-1.269
Lex-1.123
Token go
Token ID467
Feature activation-7.91e-02

Pos logit contributions
wit+0.147
 Ow+0.145
 Hud+0.145
reads+0.144
earable+0.144
Neg logit contributions
advertisement-0.203
bourg-0.194
-0.189
quished-0.180
 ASUS-0.165
Token along
Token ID1863
Feature activation-2.19e-01

Pos logit contributions
advertisement+0.254
bourg+0.244
+0.238
quished+0.227
Lex+0.205
Neg logit contributions
wit-0.232
earable-0.229
 Ow-0.228
 Hud-0.227
reads-0.227
Token shelling
Token ID38406
Feature activation+3.28e-03

Pos logit contributions
wit+0.330
 Hud+0.326
 Ow+0.325
earable+0.324
hw+0.323
Neg logit contributions
advertisement-0.358
bourg-0.341
-0.332
quished-0.317
Lex-0.284
Token out
Token ID503
Feature activation+3.23e-01

Pos logit contributions
wit+0.011
 Hud+0.011
 Ow+0.011
earable+0.011
hw+0.011
Neg logit contributions
advertisement-0.009
bourg-0.008
-0.008
quished-0.008
Lex-0.007
Token to
Token ID284
Feature activation+3.42e-01

Pos logit contributions
wit+1.031
 Hud+1.022
 Ow+1.020
earable+1.009
hw+1.006
Neg logit contributions
advertisement-0.968
bourg-0.922
-0.896
quished-0.857
Lex-0.754
Token the
Token ID262
Feature activation+1.97e-01

Pos logit contributions
wit+0.882
 Hud+0.876
 Ow+0.871
earable+0.858
hw+0.856
Neg logit contributions
advertisement-1.238
bourg-1.187
-1.159
quished-1.119
Lex-1.006
Token cable
Token ID7862
Feature activation+1.85e-02

Pos logit contributions
wit+0.592
 Hud+0.588
 Ow+0.586
earable+0.580
hw+0.578
Neg logit contributions
advertisement-0.627
bourg-0.598
-0.581
quished-0.558
Lex-0.495
Token company
Token ID1664
Feature activation+2.40e-02

Pos logit contributions
wit+0.053
 Hud+0.052
 Ow+0.052
earable+0.052
hw+0.052
Neg logit contributions
advertisement-0.062
bourg-0.059
-0.057
quished-0.055
Lex-0.049
Token each
Token ID1123
Feature activation-5.18e-01

Pos logit contributions
wit+0.075
 Hud+0.074
 Ow+0.074
earable+0.073
hw+0.073
Neg logit contributions
advertisement-0.074
bourg-0.070
-0.068
quished-0.065
Lex-0.058
Token month
Token ID1227
Feature activation-2.09e-01

Pos logit contributions
advertisement+1.507
bourg+1.418
+1.356
quished+1.289
 ASUS+1.167
Neg logit contributions
wit-1.683
earable-1.674
hw-1.666
 Hud-1.662
 Ow-1.660
Token.
Token ID13
Feature activation+3.66e-01

Pos logit contributions
advertisement+0.637
bourg+0.604
+0.587
quished+0.560
 ASUS+0.496
Neg logit contributions
wit-0.656
 Hud-0.653
 Ow-0.649
earable-0.648
hw-0.644
Token
Token ID198
Feature activation+4.59e-02

Pos logit contributions
wit+1.098
 Hud+1.088
 Ow+1.084
earable+1.078
hw+1.073
Neg logit contributions
advertisement-1.170
bourg-1.114
-1.083
quished-1.038
Lex-0.925
Token
Token ID198
Feature activation+8.29e-02

Pos logit contributions
wit+0.134
 Hud+0.133
 Ow+0.132
earable+0.131
hw+0.130
Neg logit contributions
advertisement-0.151
bourg-0.144
-0.140
quished-0.134
Lex-0.120
Token life
Token ID1204
Feature activation-6.47e-02

Pos logit contributions
advertisement+0.534
bourg+0.506
+0.490
quished+0.466
 ASUS+0.409
Neg logit contributions
wit-0.647
 Hud-0.638
 Ow-0.636
earable-0.633
hw-0.632
Token and
Token ID290
Feature activation-6.79e-02

Pos logit contributions
advertisement+0.190
bourg+0.179
+0.174
quished+0.166
 ASUS+0.146
Neg logit contributions
wit-0.211
 Hud-0.210
earable-0.208
 Ow-0.208
hw-0.207
Token it
Token ID340
Feature activation+8.99e-02

Pos logit contributions
advertisement+0.211
bourg+0.201
+0.194
quished+0.185
 ASUS+0.167
Neg logit contributions
wit-0.209
 Hud-0.206
earable-0.205
 Ow-0.205
hw-0.204
Token was
Token ID373
Feature activation-1.61e-01

Pos logit contributions
wit+0.280
 Hud+0.278
 Ow+0.276
earable+0.276
hw+0.273
Neg logit contributions
advertisement-0.276
bourg-0.263
-0.255
quished-0.243
 ASUS-0.216
Token all
Token ID477
Feature activation-6.42e-01

Pos logit contributions
advertisement+0.520
bourg+0.497
+0.481
quished+0.461
 ASUS+0.416
Neg logit contributions
wit-0.474
 Hud-0.467
 Ow-0.464
earable-0.464
reads-0.463
Token thanks
Token ID5176
Feature activation+1.55e-01

Pos logit contributions
advertisement+2.031
bourg+1.930
+1.852
quished+1.781
 ASUS+1.623
Neg logit contributions
wit-1.927
earable-1.891
hw-1.886
 Hud-1.885
reads-1.883
Token to
Token ID284
Feature activation+4.73e-01

Pos logit contributions
wit+0.555
 Ow+0.554
 Hud+0.553
earable+0.546
reads+0.544
Neg logit contributions
advertisement-0.408
bourg-0.383
-0.372
quished-0.350
ipel-0.301
Token this
Token ID428
Feature activation+7.24e-02

Pos logit contributions
wit+1.402
 Hud+1.398
 Ow+1.394
earable+1.372
hw+1.367
Neg logit contributions
advertisement-1.522
bourg-1.456
-1.413
quished-1.363
Lex-1.206
Token guy
Token ID3516
Feature activation+5.65e-01

Pos logit contributions
wit+0.209
 Hud+0.207
 Ow+0.206
earable+0.205
hw+0.204
Neg logit contributions
advertisement-0.239
bourg-0.228
-0.221
quished-0.212
 ASUS-0.191
Token.
Token ID13
Feature activation+1.57e-01

Pos logit contributions
wit+1.667
 Ow+1.659
 Hud+1.658
hw+1.614
earable+1.611
Neg logit contributions
advertisement-1.830
bourg-1.755
-1.700
quished-1.652
xi-1.450
Token Not
Token ID1892
Feature activation+7.92e-02

Pos logit contributions
wit+0.473
 Hud+0.467
earable+0.466
 Ow+0.464
hw+0.463
Neg logit contributions
advertisement-0.499
bourg-0.474
-0.462
quished-0.440
 ASUS-0.395
Token a
Token ID257
Feature activation-1.39e-01

Pos logit contributions
advertisement+0.482
bourg+0.462
+0.447
quished+0.427
 ASUS+0.393
Neg logit contributions
wit-0.412
 Hud-0.405
earable-0.402
 Ow-0.402
hw-0.401
Token much
Token ID881
Feature activation+1.54e-01

Pos logit contributions
advertisement+0.477
bourg+0.456
+0.445
quished+0.423
 Secret+0.390
Neg logit contributions
wit-0.383
 Ow-0.375
hw-0.375
 Hud-0.374
reads-0.374
Token wider
Token ID10595
Feature activation+1.34e-01

Pos logit contributions
wit+0.395
 Hud+0.391
 Ow+0.390
earable+0.386
hw+0.385
Neg logit contributions
advertisement-0.557
bourg-0.535
-0.521
quished-0.503
Lex-0.455
Token perspective
Token ID6650
Feature activation+2.38e-02

Pos logit contributions
wit+0.257
 Hud+0.251
 Ow+0.250
earable+0.248
hw+0.247
Neg logit contributions
advertisement-0.576
bourg-0.555
-0.542
quished-0.525
 ASUS-0.489
Token from
Token ID422
Feature activation+9.91e-02

Pos logit contributions
wit+0.075
 Hud+0.074
 Ow+0.074
earable+0.074
hw+0.073
Neg logit contributions
advertisement-0.072
bourg-0.069
-0.067
quished-0.064
 ASUS-0.056
Token a
Token ID257
Feature activation-2.19e-02

Pos logit contributions
wit+0.292
 Hud+0.287
 Ow+0.285
hw+0.285
earable+0.284
Neg logit contributions
advertisement-0.322
bourg-0.307
-0.299
quished-0.284
 ASUS-0.259
Token very
Token ID845
Feature activation+4.75e-02

Pos logit contributions
advertisement+0.075
bourg+0.071
+0.070
quished+0.066
 ASUS+0.061
Neg logit contributions
wit-0.060
 Hud-0.060
 Ow-0.059
earable-0.059
hw-0.059
Token small
Token ID1402
Feature activation+2.80e-01

Pos logit contributions
wit+0.136
 Hud+0.134
 Ow+0.133
earable+0.133
hw+0.132
Neg logit contributions
advertisement-0.157
bourg-0.150
-0.146
quished-0.140
 ASUS-0.127
Token lens
Token ID10317
Feature activation-5.60e-01

Pos logit contributions
wit+0.757
 Hud+0.750
 Ow+0.747
earable+0.740
hw+0.738
Neg logit contributions
advertisement-0.979
bourg-0.937
-0.913
quished-0.879
Lex-0.791
Token.
Token ID13
Feature activation+3.09e-01

Pos logit contributions
advertisement+1.705
bourg+1.606
+1.547
quished+1.469
xi+1.351
Neg logit contributions
earable-1.738
 Hud-1.729
wit-1.719
reads-1.715
hw-1.701
Token Image
Token ID7412
Feature activation-3.99e-01

Pos logit contributions
wit+1.004
 Hud+0.992
 Ow+0.987
earable+0.986
hw+0.982
Neg logit contributions
advertisement-0.907
bourg-0.861
-0.835
quished-0.795
 ASUS-0.704
Tokenirc
Token ID1980
Feature activation-1.89e-01

Pos logit contributions
wit+0.045
 Hud+0.045
 Ow+0.045
earable+0.044
hw+0.044
Neg logit contributions
advertisement-0.055
bourg-0.053
-0.051
quished-0.049
Lex-0.044
Tokenulating
Token ID8306
Feature activation-1.41e-01

Pos logit contributions
advertisement+0.869
bourg+0.849
+0.831
quished+0.801
antic+0.748
Neg logit contributions
earable-0.288
reads-0.286
wit-0.285
 Hud-0.284
 Ow-0.282
Token for
Token ID329
Feature activation+3.82e-01

Pos logit contributions
advertisement+0.445
bourg+0.428
+0.411
quished+0.392
 Secret+0.356
Neg logit contributions
wit-0.421
earable-0.420
hw-0.414
reads-0.413
 Hud-0.412
Token the
Token ID262
Feature activation-9.17e-02

Pos logit contributions
wit+1.077
earable+1.062
 Hud+1.060
 Ow+1.057
hw+1.050
Neg logit contributions
advertisement-1.281
bourg-1.231
-1.193
quished-1.146
Lex-1.029
Token spring
Token ID6076
Feature activation+1.29e-01

Pos logit contributions
advertisement+0.299
bourg+0.287
+0.276
quished+0.264
 Secret+0.245
Neg logit contributions
wit-0.264
earable-0.263
hw-0.260
reads-0.258
 Hud-0.256
Token election
Token ID3071
Feature activation-2.19e-01

Pos logit contributions
earable+0.375
wit+0.373
 Hud+0.368
 Ow+0.366
hw+0.365
Neg logit contributions
advertisement-0.418
bourg-0.407
-0.391
quished-0.373
 Secret-0.337
Token,"
Token ID553
Feature activation+1.82e-01

Pos logit contributions
advertisement+0.693
bourg+0.672
+0.638
quished+0.614
 Secret+0.557
Neg logit contributions
wit-0.652
earable-0.652
 Hud-0.642
hw-0.642
reads-0.639
Token L
Token ID406
Feature activation+2.26e-01

Pos logit contributions
wit+0.600
 Hud+0.594
 Ow+0.592
earable+0.590
hw+0.588
Neg logit contributions
advertisement-0.528
bourg-0.501
-0.483
quished-0.463
Lex-0.405
Tokenask
Token ID2093
Feature activation+2.03e-02

Pos logit contributions
wit+0.647
 Hud+0.643
 Ow+0.643
earable+0.635
hw+0.630
Neg logit contributions
advertisement-0.749
bourg-0.715
-0.689
quished-0.666
xi-0.598
Tokener
Token ID263
Feature activation-6.35e-02

Pos logit contributions
wit+0.057
 Hud+0.057
 Ow+0.057
earable+0.056
reads+0.056
Neg logit contributions
advertisement-0.068
bourg-0.065
-0.063
quished-0.061
xi-0.055
Token said
Token ID531
Feature activation-1.28e-01

Pos logit contributions
advertisement+0.219
bourg+0.210
+0.204
quished+0.197
xi+0.177
Neg logit contributions
wit-0.173
 Hud-0.172
earable-0.171
 Ow-0.171
hw-0.169
Token by
Token ID416
Feature activation-2.22e-02

Pos logit contributions
wit+0.058
 Hud+0.057
 Ow+0.057
earable+0.057
hw+0.057
Neg logit contributions
advertisement-0.057
bourg-0.054
-0.052
quished-0.050
Lex-0.044
Token Russia
Token ID3284
Feature activation+9.98e-02

Pos logit contributions
advertisement+0.066
bourg+0.063
+0.061
quished+0.058
 Secret+0.052
Neg logit contributions
wit-0.071
earable-0.070
 Ow-0.070
 Hud-0.069
hw-0.069
Token need
Token ID761
Feature activation-1.40e-02

Pos logit contributions
wit+0.306
 Hud+0.303
 Ow+0.302
earable+0.300
hw+0.299
Neg logit contributions
advertisement-0.312
bourg-0.297
-0.289
quished-0.277
Lex-0.245
Token to
Token ID284
Feature activation+1.12e-01

Pos logit contributions
advertisement+0.038
bourg+0.036
+0.034
quished+0.033
Lex+0.028
Neg logit contributions
wit-0.049
 Hud-0.048
 Ow-0.048
hw-0.048
earable-0.048
Token be
Token ID307
Feature activation-1.95e-03

Pos logit contributions
wit+0.340
 Hud+0.338
 Ow+0.337
earable+0.335
hw+0.333
Neg logit contributions
advertisement-0.354
bourg-0.338
-0.328
quished-0.313
Lex-0.279
Token met
Token ID1138
Feature activation-5.69e-01

Pos logit contributions
advertisement+0.006
bourg+0.006
+0.006
quished+0.006
Lex+0.005
Neg logit contributions
wit-0.006
 Hud-0.006
 Ow-0.006
earable-0.006
hw-0.006
Token by
Token ID416
Feature activation+1.63e-01

Pos logit contributions
advertisement+1.809
bourg+1.724
+1.664
quished+1.603
 desp+1.444
Neg logit contributions
earable-1.694
wit-1.680
 Hud-1.677
 Ow-1.675
reads-1.650
Token American
Token ID1605
Feature activation+1.57e-02

Pos logit contributions
wit+0.492
earable+0.487
 Hud+0.483
 Ow+0.483
hw+0.481
Neg logit contributions
advertisement-0.519
bourg-0.496
-0.479
quished-0.458
 Secret-0.412
Token strength
Token ID4202
Feature activation-1.52e-01

Pos logit contributions
wit+0.037
earable+0.036
 Hud+0.036
 Ow+0.036
hw+0.036
Neg logit contributions
advertisement-0.061
bourg-0.059
-0.057
quished-0.055
 Secret-0.050
Token.
Token ID13
Feature activation-1.14e-01

Pos logit contributions
advertisement+0.456
bourg+0.435
+0.420
quished+0.401
Lex+0.355
Neg logit contributions
wit-0.485
 Hud-0.483
 Ow-0.480
earable-0.480
hw-0.476
Token If
Token ID1002
Feature activation-2.08e-01

Pos logit contributions
advertisement+0.356
bourg+0.339
+0.331
quished+0.312
 Secret+0.283
Neg logit contributions
wit-0.348
earable-0.345
 Hud-0.340
 Ow-0.340
hw-0.340
Token it
Token ID340
Feature activation+3.60e-02

Pos logit contributions
wit+0.725
 Hud+0.719
 Ow+0.716
earable+0.712
hw+0.709
Neg logit contributions
advertisement-0.699
bourg-0.665
-0.644
quished-0.617
Lex-0.546
Token's
Token ID338
Feature activation-1.37e-01

Pos logit contributions
wit+0.111
 Hud+0.110
earable+0.110
 Ow+0.109
hw+0.109
Neg logit contributions
advertisement-0.111
bourg-0.106
-0.103
quished-0.098
 ASUS-0.088
Token true
Token ID2081
Feature activation-2.46e-01

Pos logit contributions
advertisement+0.438
bourg+0.418
+0.405
quished+0.385
 ASUS+0.353
Neg logit contributions
wit-0.407
earable-0.400
 Hud-0.399
reads-0.398
 Ow-0.396
Token,
Token ID11
Feature activation-2.95e-01

Pos logit contributions
advertisement+0.731
bourg+0.694
+0.672
quished+0.644
 ASUS+0.571
Neg logit contributions
wit-0.784
 Hud-0.778
earable-0.774
 Ow-0.773
reads-0.769
Token &
Token ID1222
Feature activation-2.01e-01

Pos logit contributions
advertisement+0.872
bourg+0.828
+0.800
quished+0.763
 ASUS+0.690
Neg logit contributions
wit-0.947
earable-0.936
reads-0.930
 Hud-0.929
 Ow-0.929
Token I
Token ID314
Feature activation-2.91e-01

Pos logit contributions
advertisement+0.649
bourg+0.620
+0.599
quished+0.576
 ASUS+0.518
Neg logit contributions
wit-0.590
 Hud-0.585
earable-0.583
 Ow-0.583
hw-0.578
Token am
Token ID716
Feature activation-2.55e-02

Pos logit contributions
advertisement+0.940
bourg+0.911
+0.875
quished+0.840
 ASUS+0.772
Neg logit contributions
wit-0.839
earable-0.833
 Hud-0.827
 Ow-0.821
reads-0.820
Token UN
Token ID4725
Feature activation-3.63e-01

Pos logit contributions
advertisement+0.087
bourg+0.085
+0.082
quished+0.079
 desp+0.072
Neg logit contributions
wit-0.069
earable-0.069
 Hud-0.068
 Ow-0.067
reads-0.067
TokenB
Token ID33
Feature activation-1.32e-01

Pos logit contributions
advertisement+1.144
bourg+1.075
+1.053
quished+1.011
Lex+0.912
Neg logit contributions
 Hud-1.103
wit-1.095
earable-1.090
 Ow-1.088
reads-1.075
TokenEL
Token ID3698
Feature activation-5.82e-01

Pos logit contributions
advertisement+0.330
bourg+0.306
+0.295
quished+0.286
 ASUS+0.241
Neg logit contributions
wit-0.489
 Hud-0.484
hw-0.480
 Ow-0.479
earable-0.476
TokenIE
Token ID10008
Feature activation+1.38e-01

Pos logit contributions
advertisement+1.882
bourg+1.775
+1.717
quished+1.681
 ASUS+1.501
Neg logit contributions
wit-1.720
hw-1.699
reads-1.692
 Hud-1.691
earable-1.685
Tokenā
Token ID10235
Feature activation+2.18e-01

Pos logit contributions
advertisement+0.087
bourg+0.084
+0.081
quished+0.079
xi+0.073
Neg logit contributions
 Hud-0.042
wit-0.042
 Ow-0.041
earable-0.041
hw-0.040
Token m
Token ID285
Feature activation-2.85e-01

Pos logit contributions
wit+0.662
 Hud+0.657
 Ow+0.654
earable+0.653
reads+0.647
Neg logit contributions
advertisement-0.686
bourg-0.654
-0.637
quished-0.608
xi-0.542
Tokenai
Token ID1872
Feature activation+1.06e-01

Pos logit contributions
advertisement+0.984
bourg+0.941
+0.921
quished+0.883
antic+0.805
Neg logit contributions
 Hud-0.764
wit-0.762
 Ow-0.761
earable-0.759
reads-0.745
Tokenā
Token ID10235
Feature activation-2.25e-02

Pos logit contributions
wit+0.281
 Hud+0.279
earable+0.278
 Ow+0.278
reads+0.275
Neg logit contributions
advertisement-0.371
bourg-0.355
-0.347
quished-0.333
xi-0.301
Token Hal
Token ID11023
Feature activation-3.34e-01

Pos logit contributions
advertisement+0.074
bourg+0.071
+0.069
quished+0.066
xi+0.059
Neg logit contributions
wit-0.065
 Hud-0.065
 Ow-0.064
earable-0.064
reads-0.064
Tokenep
Token ID538
Feature activation-2.77e-01

Pos logit contributions
advertisement+0.944
bourg+0.887
+0.868
quished+0.822
xi+0.732
Neg logit contributions
 Hud-1.110
 Ow-1.105
earable-1.102
wit-1.101
reads-1.082
Tokenā
Token ID10235
Feature activation+3.87e-02

Pos logit contributions
advertisement+1.097
bourg+1.050
+1.023
quished+0.993
ipel+0.917
Neg logit contributions
 Hud-0.608
wit-0.607
 Ow-0.603
earable-0.598
hw-0.585
Tokeniw
Token ID14246
Feature activation+7.45e-02

Pos logit contributions
wit+0.122
 Hud+0.122
 Ow+0.121
earable+0.121
reads+0.120
Neg logit contributions
advertisement-0.117
bourg-0.111
-0.108
quished-0.103
xi-0.091
Tokeni
Token ID72
Feature activation-3.08e-01

Pos logit contributions
 Hud+0.195
wit+0.195
 Ow+0.194
earable+0.192
reads+0.190
Neg logit contributions
advertisement-0.264
bourg-0.252
-0.245
quished-0.237
xi-0.214
Token l
Token ID300
Feature activation-5.24e-02

Pos logit contributions
advertisement+1.032
bourg+0.982
+0.963
quished+0.915
xi+0.836
Neg logit contributions
wit-0.855
earable-0.854
 Hud-0.845
reads-0.837
 Ow-0.837
Tokenā
Token ID10235
Feature activation+1.18e-01

Pos logit contributions
advertisement+0.171
bourg+0.162
+0.158
quished+0.151
xi+0.137
Neg logit contributions
 Hud-0.154
wit-0.153
 Ow-0.151
earable-0.151
reads-0.149
TokenS
Token ID50
Feature activation-1.72e-01

Pos logit contributions
advertisement+1.361
bourg+1.292
+1.242
quished+1.184
Lex+1.036
Neg logit contributions
wit-1.715
 Hud-1.715
 Ow-1.701
earable-1.694
hw-1.685
Token.
Token ID13
Feature activation-2.82e-01

Pos logit contributions
advertisement+0.526
bourg+0.502
+0.486
quished+0.467
Lex+0.412
Neg logit contributions
wit-0.539
 Hud-0.535
 Ow-0.533
earable-0.528
hw-0.527
Token economist
Token ID17646
Feature activation+4.38e-01

Pos logit contributions
advertisement+0.947
bourg+0.911
+0.881
quished+0.846
 Secret+0.769
Neg logit contributions
wit-0.791
 Hud-0.781
 Ow-0.778
earable-0.777
hw-0.775
Token Paul
Token ID3362
Feature activation-1.13e-02

Pos logit contributions
wit+1.024
 Hud+1.014
 Ow+1.009
earable+0.998
hw+0.994
Neg logit contributions
advertisement-1.685
bourg-1.620
-1.582
quished-1.530
Lex-1.392
Token Krugman
Token ID46311
Feature activation-2.19e-01

Pos logit contributions
advertisement+0.038
bourg+0.037
+0.036
quished+0.034
xi+0.031
Neg logit contributions
wit-0.031
earable-0.031
reads-0.031
hw-0.031
 Ow-0.031
Token dubbed
Token ID17494
Feature activation-1.02e+00

Pos logit contributions
advertisement+0.711
bourg+0.682
+0.664
quished+0.634
 ASUS+0.565
Neg logit contributions
wit-0.639
 Hud-0.633
 Ow-0.630
earable-0.628
hw-0.624
Token the
Token ID262
Feature activation-1.15e-01

Pos logit contributions
advertisement+3.010
bourg+2.857
+2.779
quished+2.560
 Secret+2.411
Neg logit contributions
wit-3.220
earable-3.136
hw-3.127
reads-3.116
 Ow-3.115
Token revision
Token ID18440
Feature activation-7.72e-01

Pos logit contributions
advertisement+0.387
bourg+0.369
+0.360
quished+0.342
 ASUS+0.312
Neg logit contributions
wit-0.325
 Hud-0.318
earable-0.317
hw-0.317
 Ow-0.317
Token from
Token ID422
Feature activation+2.34e-02

Pos logit contributions
advertisement+2.192
bourg+2.105
+2.044
quished+1.915
ipel+1.698
Neg logit contributions
wit-2.532
 Ow-2.511
 Hud-2.503
earable-2.491
hw-2.486
Token an
Token ID281
Feature activation+1.46e-01

Pos logit contributions
wit+0.072
 Hud+0.070
 Ow+0.070
earable+0.070
hw+0.070
Neg logit contributions
advertisement-0.073
bourg-0.070
-0.068
quished-0.065
 ASUS-0.058
Token original
Token ID2656
Feature activation+9.05e-02

Pos logit contributions
wit+0.436
 Hud+0.432
 Ow+0.428
earable+0.426
hw+0.425
Neg logit contributions
advertisement-0.466
bourg-0.443
-0.430
quished-0.414
 ASUS-0.369
Token106
Token ID15801
Feature activation-6.21e-02

Pos logit contributions
advertisement+1.103
bourg+1.051
+1.027
quished+0.995
xi+0.924
Neg logit contributions
 Ow-0.596
wit-0.592
earable-0.589
 Hud-0.586
reads-0.570
Token]
Token ID60
Feature activation+4.31e-02

Pos logit contributions
advertisement+0.260
bourg+0.251
+0.245
quished+0.238
Lex+0.219
Neg logit contributions
wit-0.124
 Hud-0.122
 Ow-0.122
earable-0.121
hw-0.120
Token In
Token ID554
Feature activation-3.42e-01

Pos logit contributions
wit+0.137
 Hud+0.135
earable+0.134
 Ow+0.134
hw+0.134
Neg logit contributions
advertisement-0.130
bourg-0.124
-0.120
quished-0.115
Lex-0.102
Token his
Token ID465
Feature activation-1.58e-01

Pos logit contributions
advertisement+1.122
bourg+1.074
+1.039
quished+1.002
Lex+0.898
Neg logit contributions
wit-0.988
 Hud-0.982
earable-0.973
 Ow-0.969
hw-0.965
Token first
Token ID717
Feature activation-5.72e-02

Pos logit contributions
advertisement+0.441
bourg+0.416
+0.402
quished+0.384
Lex+0.336
Neg logit contributions
wit-0.540
 Hud-0.533
 Ow-0.530
earable-0.528
hw-0.527
Token three
Token ID1115
Feature activation-9.38e-01

Pos logit contributions
advertisement+0.208
bourg+0.199
+0.194
quished+0.187
Lex+0.170
Neg logit contributions
wit-0.146
 Hud-0.145
 Ow-0.144
earable-0.142
hw-0.142
Token games
Token ID1830
Feature activation-5.26e-01

Pos logit contributions
advertisement+3.090
bourg+2.913
+2.835
quished+2.703
 Secret+2.521
Neg logit contributions
wit-2.713
 Hud-2.595
hw-2.590
earable-2.577
eele-2.555
Token he
Token ID339
Feature activation-1.56e-01

Pos logit contributions
advertisement+1.526
bourg+1.443
+1.406
quished+1.351
Lex+1.185
Neg logit contributions
wit-1.720
 Hud-1.697
earable-1.691
hw-1.683
reads-1.682
Token shot
Token ID2823
Feature activation-3.07e-01

Pos logit contributions
advertisement+0.492
bourg+0.473
+0.457
quished+0.440
Lex+0.392
Neg logit contributions
wit-0.469
 Hud-0.462
earable-0.461
 Ow-0.460
reads-0.458
Token 25
Token ID1679
Feature activation+1.27e-01

Pos logit contributions
advertisement+0.915
bourg+0.872
+0.838
quished+0.803
Lex+0.714
Neg logit contributions
wit-0.983
 Hud-0.969
earable-0.966
 Ow-0.963
hw-0.961
Token-
Token ID12
Feature activation+1.18e-01

Pos logit contributions
wit+0.387
 Hud+0.385
 Ow+0.384
earable+0.379
reads+0.378
Neg logit contributions
advertisement-0.399
bourg-0.380
-0.370
quished-0.355
Lex-0.315
Token fn
Token ID24714
Feature activation-6.93e-01

Pos logit contributions
advertisement+1.060
bourg+0.984
+0.957
quished+0.905
 Secret+0.811
Neg logit contributions
wit-1.367
earable-1.345
 Hud-1.345
reads-1.344
 Ow-1.340
Token undo
Token ID23981
Feature activation-5.22e-01

Pos logit contributions
advertisement+2.454
bourg+2.314
+2.251
quished+2.122
 desp+2.039
Neg logit contributions
earable-1.807
wit-1.801
hw-1.794
 Ow-1.788
 Hud-1.768
Token_
Token ID62
Feature activation+5.42e-02

Pos logit contributions
advertisement+1.897
bourg+1.801
+1.761
quished+1.672
Lex+1.546
Neg logit contributions
wit-1.332
 Ow-1.331
 Hud-1.329
earable-1.305
reads-1.296
Tokenmove
Token ID21084
Feature activation-1.36e-01

Pos logit contributions
 Hud+0.136
 Ow+0.136
wit+0.135
earable+0.133
hw+0.131
Neg logit contributions
advertisement-0.201
bourg-0.191
-0.185
quished-0.178
Lex-0.163
Token (
Token ID357
Feature activation-4.77e-01

Pos logit contributions
advertisement+0.382
bourg+0.361
+0.349
quished+0.334
Lex+0.292
Neg logit contributions
wit-0.460
 Hud-0.458
 Ow-0.457
earable-0.452
hw-0.451
Token &
Token ID1222
Feature activation-1.03e+00

Pos logit contributions
advertisement+1.349
bourg+1.272
+1.237
quished+1.180
Lex+1.045
Neg logit contributions
wit-1.571
 Hud-1.566
earable-1.558
 Ow-1.557
reads-1.545
Token mut
Token ID4517
Feature activation-9.51e-01

Pos logit contributions
advertisement+2.859
bourg+2.679
+2.658
quished+2.456
 Secret+2.245
Neg logit contributions
 Hud-3.378
earable-3.370
 Ow-3.356
wit-3.352
reads-3.349
Token self
Token ID2116
Feature activation-1.02e-01

Pos logit contributions
advertisement+2.431
bourg+2.280
+2.274
quished+2.022
 desp+1.977
Neg logit contributions
earable-3.276
reads-3.261
 Hud-3.237
wit-3.232
 Ow-3.199
Token )
Token ID1267
Feature activation-1.09e+00

Pos logit contributions
advertisement+0.312
bourg+0.295
+0.287
quished+0.275
Lex+0.244
Neg logit contributions
wit-0.318
 Hud-0.316
 Ow-0.315
earable-0.313
hw-0.311
Token {
Token ID1391
Feature activation-5.48e-02

Pos logit contributions
advertisement+3.023
bourg+2.775
+2.710
quished+2.499
 Secret+2.285
Neg logit contributions
earable-3.731
wit-3.712
 Hud-3.701
hw-3.649
reads-3.646
Token let
Token ID1309
Feature activation-1.04e-01

Pos logit contributions
advertisement+0.168
bourg+0.161
+0.155
quished+0.148
 Secret+0.134
Neg logit contributions
wit-0.170
earable-0.168
 Hud-0.167
 Ow-0.166
hw-0.166
Token morning
Token ID3329
Feature activation-1.19e-01

Pos logit contributions
wit+0.039
 Hud+0.039
earable+0.038
hw+0.038
 Ow+0.038
Neg logit contributions
advertisement-0.040
bourg-0.038
-0.037
quished-0.035
Lex-0.031
Token and
Token ID290
Feature activation+1.20e-01

Pos logit contributions
advertisement+0.365
bourg+0.347
+0.338
quished+0.321
 ASUS+0.285
Neg logit contributions
wit-0.372
 Hud-0.368
earable-0.366
 Ow-0.366
hw-0.363
Token afternoon
Token ID6672
Feature activation-1.47e-01

Pos logit contributions
wit+0.379
 Hud+0.374
 Ow+0.373
earable+0.371
reads+0.369
Neg logit contributions
advertisement-0.365
bourg-0.348
-0.335
quished-0.321
Lex-0.284
Token peaks
Token ID25740
Feature activation-1.94e-01

Pos logit contributions
advertisement+0.452
bourg+0.430
+0.419
quished+0.399
Lex+0.354
Neg logit contributions
wit-0.458
 Hud-0.454
 Ow-0.450
earable-0.450
hw-0.447
Token has
Token ID468
Feature activation-5.00e-01

Pos logit contributions
advertisement+0.567
bourg+0.538
+0.521
quished+0.496
Lex+0.437
Neg logit contributions
wit-0.633
 Hud-0.628
earable-0.625
 Ow-0.624
hw-0.619
Token increased
Token ID3220
Feature activation-9.30e-01

Pos logit contributions
advertisement+1.644
bourg+1.569
+1.541
quished+1.467
Lex+1.325
Neg logit contributions
wit-1.431
earable-1.428
 Hud-1.419
 Ow-1.397
reads-1.394
Token by
Token ID416
Feature activation-4.44e-01

Pos logit contributions
advertisement+2.894
bourg+2.766
+2.623
quished+2.512
 desp+2.311
Neg logit contributions
earable-2.842
wit-2.821
 Hud-2.755
reads-2.747
hw-2.718
Token nearly
Token ID3016
Feature activation-7.18e-01

Pos logit contributions
advertisement+1.308
bourg+1.238
+1.186
quished+1.125
 Secret+1.023
Neg logit contributions
wit-1.443
earable-1.423
 Hud-1.414
hw-1.409
reads-1.397
Token 50
Token ID2026
Feature activation-3.82e-01

Pos logit contributions
advertisement+2.068
bourg+1.927
+1.848
quished+1.744
 ASUS+1.586
Neg logit contributions
wit-2.376
earable-2.360
 Hud-2.339
hw-2.337
reads-2.311
Token%.
Token ID7225
Feature activation-1.16e-01

Pos logit contributions
advertisement+0.792
bourg+0.735
+0.703
quished+0.656
xi+0.536
Neg logit contributions
wit-1.575
 Hud-1.567
earable-1.561
 Ow-1.559
hw-1.547
Token But
Token ID887
Feature activation+4.66e-01

Pos logit contributions
advertisement+0.365
bourg+0.346
+0.338
quished+0.321
Lex+0.286
Neg logit contributions
wit-0.354
 Hud-0.348
earable-0.347
 Ow-0.346
hw-0.345
Token top
Token ID1353
Feature activation+4.28e-01

Pos logit contributions
advertisement+0.305
bourg+0.290
+0.282
quished+0.269
 ASUS+0.240
Neg logit contributions
wit-0.312
 Hud-0.309
 Ow-0.309
earable-0.306
hw-0.305
Token among
Token ID1871
Feature activation-6.84e-02

Pos logit contributions
wit+1.487
 Hud+1.480
 Ow+1.471
hw+1.462
earable+1.457
Neg logit contributions
advertisement-1.154
bourg-1.099
-1.058
quished-1.010
Lex-0.870
Token the
Token ID262
Feature activation-1.12e-01

Pos logit contributions
advertisement+0.217
bourg+0.207
+0.201
quished+0.193
Lex+0.171
Neg logit contributions
wit-0.207
 Hud-0.205
 Ow-0.205
earable-0.203
hw-0.202
Token industry
Token ID2831
Feature activation+1.95e-01

Pos logit contributions
advertisement+0.361
bourg+0.342
+0.335
quished+0.321
 ASUS+0.290
Neg logit contributions
wit-0.332
 Ow-0.326
 Hud-0.326
earable-0.325
reads-0.322
Token's
Token ID338
Feature activation-1.25e-01

Pos logit contributions
wit+0.581
 Hud+0.575
 Ow+0.573
earable+0.569
hw+0.566
Neg logit contributions
advertisement-0.627
bourg-0.598
-0.581
quished-0.558
Lex-0.497
Token HD
Token ID5572
Feature activation-9.72e-01

Pos logit contributions
advertisement+0.407
bourg+0.386
+0.378
quished+0.361
 ASUS+0.329
Neg logit contributions
wit-0.367
 Hud-0.360
 Ow-0.360
earable-0.360
reads-0.357
Token 78
Token ID8699
Feature activation-2.75e-01

Pos logit contributions
advertisement+3.146
bourg+2.959
+2.957
quished+2.828
 ASUS+2.602
Neg logit contributions
wit-2.825
earable-2.798
 Hud-2.766
reads-2.746
 Ow-2.745
Token70
Token ID2154
Feature activation-2.27e-01

Pos logit contributions
advertisement+1.019
bourg+0.975
+0.957
quished+0.919
xi+0.837
Neg logit contributions
wit-0.682
 Hud-0.677
 Ow-0.674
earable-0.673
reads-0.666
Token based
Token ID1912
Feature activation+2.52e-01

Pos logit contributions
advertisement+0.763
bourg+0.726
+0.710
quished+0.684
 ASUS+0.617
Neg logit contributions
wit-0.646
 Hud-0.636
 Ow-0.635
earable-0.634
reads-0.628
Token graphics
Token ID9382
Feature activation-5.25e-01

Pos logit contributions
wit+0.715
 Hud+0.710
 Ow+0.708
earable+0.700
hw+0.699
Neg logit contributions
advertisement-0.846
bourg-0.810
-0.787
quished-0.757
Lex-0.678
Token cards
Token ID4116
Feature activation-6.37e-01

Pos logit contributions
advertisement+1.876
bourg+1.787
+1.754
quished+1.685
 ASUS+1.543
Neg logit contributions
wit-1.373
earable-1.348
 Ow-1.345
 Hud-1.339
reads-1.339