TOP ACTIVATIONS
MAX = 1.410

child being eaten alive by As mode us . The player
and replaced it with a h odge p odge of new
gets worse , turns into sl ush , and then ,
at Washington University in St . Louis , has also studied
via remote control or rear - mounted buttons , and its
becoming excessively sensitive to perceived sl ights . Three
we begin to see some sl own ess :
Four teenth Amendment 's equal protection clause granted constitutional protections to
supporting her . S ins of omission are as
You tell Res a ech all you know ,
I could not detach the rear clips and for fear of
psychology at Washington University in St . Louis , has also
existing regime with a new short - term disability plan .
sound commercial banking in pretty short order . (" Power tends
the entire defense into a h uddle and delivered a colorful
2 K . S ensitivity to doubled CO 2
s vision for , in her words , a
for the next generation . Rest raining future debt is a
or called female Hillary supporters sl uts and wh ores ?
're almost signed up for News Alert s Keep an eye

MIN ACTIVATIONS
MIN = -1.216

was iconic ( more on that later ), but he couldn
for ds cross ing highly traff icked thorough
's financial counselling service or call K ild on an for
take unto thy self . De uter onomy 20 : 12
in Monday 's incident should call the Long view Police Department
be another place like the White Rabbit .
pay as many bills as it has daily tax receipts .
, 055 , and JPMorgan Chase was the subject of 20
greatest Survivor fan would break the CD in two after playing
; density Grid . Init ( 0 . 0 f )
some universe , but to beat a boot c amped Na
outside of the maze . Doors ' width was set to
signed , and last but not least , remaining Cors airs
CS 3 IV IP 9 ib - Kirk Cousins (@ K
count | | ---- -+ -------------------------------- + ------- | | 1
bers ide and we will take action against those intent on
to prevent people ( W AB L members ) from just
But Mozilla succeeded in breaking the lock Microsoft 's Internet Explorer
erson ic Flight : Breaking the Sound Barrier and Beyond .
money for the likes of De Andre Jordan or La Marcus

INTERVAL 0.754 - 1.410
CONTAINS 0.156%

abyss . The others escape and find refuge in the Elven
has banned the use of the three ne onic otin oid
he has accepted Tal ul ah back into his life .
of the Korea Institute of S & T Evaluation and Planning
future for Tol ui 's eldest son and bestowed on the

INTERVAL 0.097 - 0.754
CONTAINS 33.510%

the tests . All three lenses , at both
cl ojure . l ibraries table . 1 2
. Com , San Antonio Woman , Gam bit Weekly of
tickets purchased before the bombings . He said an FBI agent
have B iz arro Green Arrow in the radius , which

INTERVAL -0.560 - 0.097
CONTAINS 65.211%

really something we can look at for holding EV O in
made major investments in basic science , too .
for women , but according to amd news , neither the
the country can still be molded into a sharp campaign issue
that time [ 10 : 47 ] < met ros xual

INTERVAL -1.216 - -0.560
CONTAINS 1.123%

. To make a long story short I got this while
just p ore over the real estate ads looking at ranch
provided for , and which shall be established by Law
see that at this point in her very lengthy career she
monument honoring an African American or a woman to be installed
Token child
Token ID1200
Feature activation-9.27e-02

Pos logit contributions
��+0.810
oons+0.789
 Authentication+0.779
 Preferences+0.775
elle+0.773
Neg logit contributions
river-0.698
evil-0.668
inav-0.618
thumbnails-0.616
tackle-0.607
Token being
Token ID852
Feature activation+3.16e-01

Pos logit contributions
��+0.755
oons+0.736
 Authentication+0.729
 Preferences+0.713
elle+0.707
Neg logit contributions
river-1.142
evil-1.112
inav-1.036
thumbnails-1.018
tackle-1.017
Token eaten
Token ID17065
Feature activation+3.60e-01

Pos logit contributions
river+2.612
evil+2.519
inav+2.201
alid+2.110
thumbnails+2.102
Neg logit contributions
��-4.009
 Authentication-3.806
oons-3.789
 Preferences-3.755
 Regulatory-3.688
Token alive
Token ID6776
Feature activation+7.15e-03

Pos logit contributions
river+2.440
evil+2.348
inav+1.960
phia+1.828
thumbnails+1.798
Neg logit contributions
��-5.123
 Authentication-4.899
oons-4.885
 Preferences-4.818
 Regulatory-4.814
Token by
Token ID416
Feature activation+2.06e-01

Pos logit contributions
river+0.082
evil+0.079
inav+0.073
thumbnails+0.072
tackle+0.071
Neg logit contributions
��-0.067
oons-0.064
 Authentication-0.063
 Preferences-0.062
illion-0.061
Token As
Token ID1081
Feature activation+1.41e+00

Pos logit contributions
river+2.104
evil+2.055
inav+1.840
alid+1.801
thumbnails+1.791
Neg logit contributions
��-2.127
oons-2.051
 Authentication-2.040
 Preferences-2.030
illion-2.002
Tokenmode
Token ID14171
Feature activation+5.94e-01

Pos logit contributions
river+8.072
evil+8.014
alid+6.641
inav+6.289
fa+6.213
Neg logit contributions
���-19.375
��-19.122
ThumbnailImage-19.043
-18.375
 eleph-17.654
Tokenus
Token ID385
Feature activation+3.54e-01

Pos logit contributions
river+5.108
evil+5.032
inav+4.207
alid+4.137
tre+4.136
Neg logit contributions
���-7.475
��-7.394
 Authentication-7.011
ThumbnailImage-6.958
 Preferences-6.955
Token.
Token ID13
Feature activation+2.06e-02

Pos logit contributions
river+2.843
evil+2.705
inav+2.323
tre+2.254
alid+2.248
Neg logit contributions
��-4.686
���-4.441
 Authentication-4.324
namese-4.318
oons-4.307
Token The
Token ID383
Feature activation+2.17e-01

Pos logit contributions
river+0.250
evil+0.242
inav+0.224
thumbnails+0.219
tackle+0.218
Neg logit contributions
��-0.177
oons-0.170
 Authentication-0.168
 Preferences-0.165
illion-0.162
Token player
Token ID2137
Feature activation-4.65e-02

Pos logit contributions
river+2.126
evil+2.054
inav+1.825
thumbnails+1.782
tackle+1.764
Neg logit contributions
��-2.428
oons-2.305
 Authentication-2.283
 Preferences-2.246
namese-2.207
Token and
Token ID290
Feature activation-2.58e-01

Pos logit contributions
��+3.887
oons+3.856
 Preferences+3.822
 Authentication+3.800
illion+3.712
Neg logit contributions
river-4.755
evil-4.652
inav-4.274
tackle-4.255
thumbnails-4.190
Token replaced
Token ID6928
Feature activation-3.45e-01

Pos logit contributions
��+2.656
 Authentication+2.590
 Preferences+2.582
oons+2.574
 Regulatory+2.557
Neg logit contributions
river-2.612
evil-2.545
inav-2.322
tackle-2.249
thumbnails-2.245
Token it
Token ID340
Feature activation-3.66e-01

Pos logit contributions
��+3.192
 Preferences+3.086
 Authentication+3.065
oons+3.060
 Regulatory+3.032
Neg logit contributions
river-3.816
evil-3.720
inav-3.456
tackle-3.335
thumbnails-3.326
Token with
Token ID351
Feature activation-4.88e-01

Pos logit contributions
��+3.360
 Authentication+3.230
 Preferences+3.220
oons+3.218
elle+3.167
Neg logit contributions
river-4.075
evil-3.968
inav-3.648
tackle-3.569
thumbnails-3.563
Token a
Token ID257
Feature activation-4.18e-01

Pos logit contributions
��+4.094
 Preferences+4.075
 Authentication+4.027
 Regulatory+4.008
oons+3.993
Neg logit contributions
river-5.646
evil-5.610
inav-5.152
tackle-5.112
thumbnails-5.025
Token h
Token ID289
Feature activation+1.32e+00

Pos logit contributions
��+3.759
 Regulatory+3.690
 Preferences+3.686
 Authentication+3.676
oons+3.623
Neg logit contributions
river-4.650
evil-4.595
tackle-4.156
inav-4.130
thumbnails-4.125
Tokenodge
Token ID9728
Feature activation+4.94e-02

Pos logit contributions
river+13.931
evil+13.326
alid+13.213
ivable+12.656
inav+12.357
Neg logit contributions
��-10.383
���-9.619
 Authentication-9.619
 Preferences-9.349
theless-8.961
Tokenp
Token ID79
Feature activation-4.18e-02

Pos logit contributions
river+0.514
evil+0.495
inav+0.452
thumbnails+0.442
tackle+0.440
Neg logit contributions
��-0.509
oons-0.493
 Authentication-0.490
 Preferences-0.485
 Regulatory-0.473
Tokenodge
Token ID9728
Feature activation-1.60e-01

Pos logit contributions
��+0.328
oons+0.314
 Authentication+0.312
 Preferences+0.307
illion+0.298
Neg logit contributions
river-0.537
evil-0.520
inav-0.485
thumbnails-0.476
tackle-0.474
Token of
Token ID286
Feature activation-1.27e-01

Pos logit contributions
��+1.644
oons+1.602
 Authentication+1.598
 Preferences+1.577
 Regulatory+1.563
Neg logit contributions
river-1.642
evil-1.600
inav-1.448
thumbnails-1.422
tackle-1.419
Token new
Token ID649
Feature activation-1.26e-02

Pos logit contributions
��+1.222
oons+1.187
 Authentication+1.182
 Preferences+1.174
 Regulatory+1.148
Neg logit contributions
river-1.397
evil-1.351
inav-1.241
tackle-1.216
thumbnails-1.214
Token gets
Token ID3011
Feature activation+9.33e-02

Pos logit contributions
river+0.533
evil+0.508
inav+0.460
thumbnails+0.450
tackle+0.447
Neg logit contributions
��-0.612
 Authentication-0.589
oons-0.587
 Preferences-0.582
namese-0.568
Token worse
Token ID4785
Feature activation+2.95e-02

Pos logit contributions
river+0.974
evil+0.939
inav+0.852
thumbnails+0.834
tackle+0.828
Neg logit contributions
��-0.962
 Authentication-0.928
oons-0.927
 Preferences-0.914
 Regulatory-0.890
Token,
Token ID11
Feature activation+2.64e-01

Pos logit contributions
river+0.285
evil+0.272
inav+0.247
thumbnails+0.240
tackle+0.239
Neg logit contributions
��-0.328
oons-0.318
 Authentication-0.317
 Preferences-0.313
 Regulatory-0.307
Token turns
Token ID4962
Feature activation+2.44e-01

Pos logit contributions
river+2.610
evil+2.475
inav+2.222
thumbnails+2.203
tackle+2.162
Neg logit contributions
��-2.870
oons-2.774
 Authentication-2.771
 Preferences-2.715
illion-2.663
Token into
Token ID656
Feature activation+1.54e-01

Pos logit contributions
river+2.718
evil+2.592
inav+2.325
thumbnails+2.291
tackle+2.280
Neg logit contributions
��-2.369
 Authentication-2.296
oons-2.257
 Preferences-2.242
illion-2.202
Token sl
Token ID1017
Feature activation+1.28e+00

Pos logit contributions
river+1.657
evil+1.583
inav+1.448
tackle+1.411
thumbnails+1.407
Neg logit contributions
��-1.536
 Authentication-1.478
oons-1.471
 Preferences-1.456
illion-1.423
Tokenush
Token ID1530
Feature activation+1.77e-01

Pos logit contributions
river+10.649
evil+9.705
alid+9.599
ivable+8.580
angle+8.576
Neg logit contributions
��-13.336
 Authentication-13.145
 Preferences-12.800
 Regulatory-12.756
soever-12.469
Token,
Token ID11
Feature activation+2.81e-01

Pos logit contributions
river+1.791
evil+1.709
inav+1.539
alid+1.496
tackle+1.488
Neg logit contributions
��-1.898
 Authentication-1.836
oons-1.814
 Preferences-1.800
 Regulatory-1.771
Token and
Token ID290
Feature activation+3.53e-01

Pos logit contributions
river+2.714
evil+2.582
inav+2.309
thumbnails+2.279
tackle+2.249
Neg logit contributions
��-3.095
 Authentication-2.995
oons-2.972
 Preferences-2.941
illion-2.878
Token then
Token ID788
Feature activation+2.20e-01

Pos logit contributions
river+3.544
evil+3.357
inav+3.002
thumbnails+2.972
phia+2.943
Neg logit contributions
��-3.755
 Authentication-3.623
oons-3.569
 Preferences-3.524
namese-3.446
Token,
Token ID11
Feature activation+3.65e-01

Pos logit contributions
river+2.268
evil+2.167
inav+1.967
thumbnails+1.923
tackle+1.892
Neg logit contributions
��-2.315
 Authentication-2.234
oons-2.199
 Preferences-2.176
illion-2.137
Token at
Token ID379
Feature activation+2.07e-01

Pos logit contributions
��+2.649
oons+2.584
 Authentication+2.540
 Preferences+2.528
 Regulatory+2.497
Neg logit contributions
river-3.318
evil-3.217
thumbnails-3.004
inav-2.993
tackle-2.935
Token Washington
Token ID2669
Feature activation+1.05e-01

Pos logit contributions
river+2.334
evil+2.224
inav+2.073
phia+1.993
tackle+1.980
Neg logit contributions
��-1.982
oons-1.880
 Authentication-1.857
 Preferences-1.831
illion-1.831
Token University
Token ID2059
Feature activation+3.43e-01

Pos logit contributions
river+1.219
evil+1.173
inav+1.087
phia+1.062
tackle+1.055
Neg logit contributions
��-0.955
oons-0.919
 Authentication-0.912
 Preferences-0.902
illion-0.884
Token in
Token ID287
Feature activation+2.24e-01

Pos logit contributions
river+3.040
evil+2.844
inav+2.571
tackle+2.491
phia+2.460
Neg logit contributions
��-4.000
oons-3.863
 Authentication-3.851
 Preferences-3.798
elle-3.737
Token St
Token ID520
Feature activation+1.15e+00

Pos logit contributions
river+2.532
evil+2.338
inav+2.182
thumbnails+2.097
tackle+2.094
Neg logit contributions
��-2.156
oons-2.070
 Authentication-2.060
 Preferences-2.049
namese-2.006
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
river+6.790
evil+5.786
alid+4.869
inav+4.469
tre+4.365
Neg logit contributions
��-15.970
���-15.932
theless-15.368
 Authentication-15.243
 Preferences-14.820
Token Louis
Token ID5593
Feature activation-3.10e-02

Pos logit contributions
river+4.935
evil+3.674
 Leh+3.304
Le+3.080
 Dise+2.997
Neg logit contributions
��-18.446
namese-17.522
���-17.254
��-17.104
theless-17.099
Token,
Token ID11
Feature activation-9.74e-02

Pos logit contributions
��+0.329
oons+0.319
 Authentication+0.318
 Preferences+0.314
elle+0.307
Neg logit contributions
river-0.312
evil-0.300
inav-0.274
thumbnails-0.267
tackle-0.266
Token has
Token ID468
Feature activation-1.22e-01

Pos logit contributions
��+0.983
oons+0.953
 Authentication+0.950
 Preferences+0.939
 Regulatory+0.919
Neg logit contributions
river-1.028
evil-0.992
inav-0.911
thumbnails-0.889
tackle-0.885
Token also
Token ID635
Feature activation+2.16e-01

Pos logit contributions
��+1.213
oons+1.177
 Authentication+1.172
 Preferences+1.158
 Regulatory+1.134
Neg logit contributions
river-1.301
evil-1.257
inav-1.151
thumbnails-1.138
tackle-1.126
Token studied
Token ID9713
Feature activation-2.15e-01

Pos logit contributions
river+2.175
evil+2.083
inav+1.925
tackle+1.844
alid+1.836
Neg logit contributions
��-2.287
oons-2.201
 Authentication-2.191
 Preferences-2.177
illion-2.113
Token via
Token ID2884
Feature activation+1.16e-01

Pos logit contributions
��+0.921
oons+0.894
 Authentication+0.889
 Preferences+0.885
elle+0.860
Neg logit contributions
river-0.980
evil-0.947
inav-0.871
thumbnails-0.853
tackle-0.847
Token remote
Token ID6569
Feature activation+4.98e-02

Pos logit contributions
river+1.250
evil+1.196
inav+1.100
thumbnails+1.076
tackle+1.073
Neg logit contributions
��-1.149
oons-1.103
 Authentication-1.092
 Preferences-1.063
namese-1.060
Token control
Token ID1630
Feature activation-1.05e-01

Pos logit contributions
river+0.630
evil+0.608
inav+0.567
thumbnails+0.556
tackle+0.555
Neg logit contributions
��-0.404
oons-0.386
 Authentication-0.381
 Preferences-0.374
illion-0.367
Token or
Token ID393
Feature activation-1.71e-01

Pos logit contributions
��+1.074
oons+1.048
 Authentication+1.046
 Preferences+1.041
elle+1.008
Neg logit contributions
river-1.091
evil-1.061
inav-0.970
thumbnails-0.949
tackle-0.941
Token rear
Token ID8286
Feature activation+9.16e-01

Pos logit contributions
��+1.584
 Authentication+1.557
 Preferences+1.551
oons+1.547
 Regulatory+1.496
Neg logit contributions
river-1.897
evil-1.850
inav-1.712
thumbnails-1.663
tackle-1.653
Token-
Token ID12
Feature activation+1.21e+00

Pos logit contributions
river+8.925
evil+8.267
tackle+7.606
tre+7.463
inav+7.432
Neg logit contributions
��-9.244
namese-8.290
oons-8.283
 Authentication-8.144
illion-8.103
Tokenmounted
Token ID29728
Feature activation+4.28e-01

Pos logit contributions
river+12.177
evil+11.509
tackle+10.980
tre+10.692
angle+10.358
Neg logit contributions
��-11.282
namese-9.979
 Regulatory-9.779
 Authentication-9.760
 Preferences-9.748
Token buttons
Token ID12163
Feature activation-1.03e-01

Pos logit contributions
river+4.537
evil+4.214
inav+3.858
thumbnails+3.816
tackle+3.815
Neg logit contributions
��-4.381
oons-4.030
illion-4.017
namese-3.991
 Authentication-3.979
Token,
Token ID11
Feature activation-1.16e-01

Pos logit contributions
��+1.084
oons+1.052
 Authentication+1.050
 Preferences+1.038
elle+1.018
Neg logit contributions
river-1.034
evil-0.998
inav-0.914
thumbnails-0.893
tackle-0.888
Token and
Token ID290
Feature activation+1.85e-01

Pos logit contributions
��+1.150
oons+1.121
 Authentication+1.121
 Preferences+1.111
elle+1.079
Neg logit contributions
river-1.231
evil-1.189
inav-1.098
thumbnails-1.065
tackle-1.060
Token its
Token ID663
Feature activation+2.19e-01

Pos logit contributions
river+1.942
evil+1.866
inav+1.695
thumbnails+1.663
tackle+1.661
Neg logit contributions
��-1.893
oons-1.807
 Authentication-1.784
 Preferences-1.741
illion-1.740
Token becoming
Token ID5033
Feature activation+2.90e-01

Pos logit contributions
river+1.640
evil+1.587
inav+1.419
thumbnails+1.405
tackle+1.403
Neg logit contributions
��-1.603
 Authentication-1.541
oons-1.541
 Preferences-1.520
namese-1.480
Token excessively
Token ID40437
Feature activation-1.74e-02

Pos logit contributions
river+2.970
evil+2.867
thumbnails+2.567
tackle+2.530
inav+2.525
Neg logit contributions
��-2.993
 Authentication-2.863
oons-2.837
 Preferences-2.800
illion-2.757
Token sensitive
Token ID8564
Feature activation+1.31e-01

Pos logit contributions
��+0.165
oons+0.159
 Authentication+0.158
 Preferences+0.156
illion+0.152
Neg logit contributions
river-0.196
evil-0.189
inav-0.174
thumbnails-0.171
tackle-0.170
Token to
Token ID284
Feature activation-6.18e-02

Pos logit contributions
river+1.277
evil+1.213
inav+1.098
thumbnails+1.077
tackle+1.075
Neg logit contributions
��-1.451
 Authentication-1.404
oons-1.391
 Preferences-1.375
 Regulatory-1.352
Token perceived
Token ID11067
Feature activation+1.89e-01

Pos logit contributions
��+0.593
oons+0.573
 Authentication+0.570
 Preferences+0.564
elle+0.550
Neg logit contributions
river-0.688
evil-0.662
inav-0.612
thumbnails-0.598
tackle-0.595
Token sl
Token ID1017
Feature activation+1.20e+00

Pos logit contributions
river+2.104
evil+2.051
inav+1.873
thumbnails+1.819
tackle+1.806
Neg logit contributions
��-1.775
oons-1.706
 Authentication-1.695
 Preferences-1.654
illion-1.647
Tokenights
Token ID2337
Feature activation+1.15e-02

Pos logit contributions
river+7.825
evil+6.935
alid+6.748
ivable+6.090
inav+6.020
Neg logit contributions
��-14.989
 Authentication-14.701
 Regulatory-14.428
soever-14.388
theless-14.295
Token.
Token ID13
Feature activation-8.91e-02

Pos logit contributions
river+0.114
evil+0.109
inav+0.099
thumbnails+0.097
tackle+0.097
Neg logit contributions
��-0.125
oons-0.121
 Authentication-0.121
 Preferences-0.119
 Regulatory-0.117
Token
Token ID198
Feature activation-1.59e-01

Pos logit contributions
��+0.867
oons+0.840
 Authentication+0.838
 Preferences+0.828
 Regulatory+0.808
Neg logit contributions
river-0.976
evil-0.942
inav-0.868
thumbnails-0.849
tackle-0.845
Token
Token ID198
Feature activation-1.85e-01

Pos logit contributions
oons+1.427
 Authentication+1.423
��+1.422
 Preferences+1.406
elle+1.384
Neg logit contributions
river-1.789
evil-1.727
inav-1.618
thumbnails-1.581
phia-1.573
TokenThree
Token ID12510
Feature activation-2.80e-01

Pos logit contributions
��+1.728
oons+1.678
 Authentication+1.648
 Preferences+1.644
elle+1.628
Neg logit contributions
river-2.065
evil-1.989
inav-1.866
thumbnails-1.809
phia-1.799
Token we
Token ID356
Feature activation+4.79e-02

Pos logit contributions
��+0.590
oons+0.571
 Authentication+0.569
 Preferences+0.563
illion+0.550
Neg logit contributions
river-0.636
evil-0.613
inav-0.564
thumbnails-0.550
tackle-0.548
Token begin
Token ID2221
Feature activation+2.11e-01

Pos logit contributions
river+0.485
evil+0.467
inav+0.426
thumbnails+0.415
tackle+0.414
Neg logit contributions
��-0.508
oons-0.491
 Authentication-0.489
 Preferences-0.483
illion-0.473
Token to
Token ID284
Feature activation+5.71e-02

Pos logit contributions
river+2.169
evil+2.056
inav+1.862
thumbnails+1.826
tackle+1.818
Neg logit contributions
��-2.211
oons-2.146
 Authentication-2.131
 Preferences-2.105
illion-2.061
Token see
Token ID766
Feature activation+1.60e-01

Pos logit contributions
river+0.598
evil+0.577
inav+0.527
tackle+0.512
thumbnails+0.512
Neg logit contributions
��-0.587
oons-0.566
 Authentication-0.561
 Preferences-0.556
illion-0.543
Token some
Token ID617
Feature activation+2.17e-01

Pos logit contributions
river+1.678
evil+1.609
inav+1.463
thumbnails+1.435
tackle+1.424
Neg logit contributions
��-1.623
oons-1.574
 Authentication-1.553
 Preferences-1.532
illion-1.504
Token sl
Token ID1017
Feature activation+1.19e+00

Pos logit contributions
river+2.256
evil+2.172
inav+1.992
thumbnails+1.929
tackle+1.912
Neg logit contributions
��-2.201
oons-2.133
 Authentication-2.096
 Preferences-2.063
illion-2.040
Tokenown
Token ID593
Feature activation+1.22e-01

Pos logit contributions
river+9.413
evil+8.882
alid+8.015
inav+7.859
ivable+7.843
Neg logit contributions
��-13.184
 Regulatory-13.088
 Authentication-13.036
soever-12.923
theless-12.881
Tokeness
Token ID408
Feature activation+2.58e-01

Pos logit contributions
river+2.012
evil+1.964
inav+1.851
thumbnails+1.816
alid+1.814
Neg logit contributions
��-0.525
 Authentication-0.485
oons-0.465
 Preferences-0.464
 Regulatory-0.440
Token:
Token ID25
Feature activation-9.39e-03

Pos logit contributions
river+2.376
evil+2.271
inav+2.009
phia+1.958
thumbnails+1.955
Neg logit contributions
��-2.916
oons-2.852
 Authentication-2.824
 Preferences-2.783
 Regulatory-2.746
Token
Token ID198
Feature activation+1.68e-01

Pos logit contributions
��+0.095
oons+0.092
 Authentication+0.091
 Preferences+0.090
illion+0.088
Neg logit contributions
river-0.100
evil-0.096
inav-0.088
thumbnails-0.086
tackle-0.085
Token
Token ID198
Feature activation-1.25e-02

Pos logit contributions
river+1.676
evil+1.609
inav+1.446
thumbnails+1.417
tackle+1.409
Neg logit contributions
��-1.862
oons-1.756
 Authentication-1.745
 Preferences-1.726
namese-1.706
Token Four
Token ID6675
Feature activation+8.32e-02

Pos logit contributions
river+7.391
evil+7.046
phia+6.418
 Leh+6.280
inav+6.277
Neg logit contributions
��-8.093
oons-7.648
illion-7.544
namese-7.429
ework-7.383
Tokenteenth
Token ID20283
Feature activation+5.51e-01

Pos logit contributions
river+0.982
evil+0.949
inav+0.877
thumbnails+0.859
tackle+0.853
Neg logit contributions
��-0.744
oons-0.710
 Authentication-0.707
 Preferences-0.698
illion-0.681
Token Amendment
Token ID8441
Feature activation+3.19e-01

Pos logit contributions
river+4.888
evil+4.633
alid+4.114
inav+3.999
BILITY+3.965
Neg logit contributions
��-6.231
oons-5.815
namese-5.758
illion-5.658
 Preferences-5.633
Token's
Token ID338
Feature activation+6.22e-01

Pos logit contributions
river+3.098
evil+2.969
inav+2.650
tackle+2.603
thumbnails+2.591
Neg logit contributions
��-3.497
oons-3.321
 Authentication-3.240
illion-3.239
namese-3.218
Token equal
Token ID4961
Feature activation+3.16e-01

Pos logit contributions
river+5.434
evil+5.207
phia+4.591
inav+4.586
tre+4.536
Neg logit contributions
��-6.889
oons-6.642
illion-6.492
namese-6.368
 Authentication-6.217
Token protection
Token ID4800
Feature activation+1.19e+00

Pos logit contributions
river+3.086
evil+2.927
inav+2.707
tackle+2.603
phia+2.576
Neg logit contributions
��-3.510
oons-3.372
 Authentication-3.216
 Preferences-3.190
illion-3.176
Token clause
Token ID13444
Feature activation+5.42e-01

Pos logit contributions
river+8.624
evil+8.178
BILITY+7.439
 dise+7.332
phia+7.202
Neg logit contributions
��-13.387
oons-12.129
illion-12.019
 Regulatory-11.782
��-11.595
Token granted
Token ID7520
Feature activation+5.33e-01

Pos logit contributions
river+4.779
evil+4.610
inav+4.036
 dise+3.967
phia+3.964
Neg logit contributions
��-6.181
oons-5.821
 Authentication-5.727
namese-5.674
illion-5.667
Token constitutional
Token ID9758
Feature activation+9.22e-01

Pos logit contributions
river+5.151
evil+4.905
phia+4.355
tre+4.262
inav+4.237
Neg logit contributions
��-5.671
oons-5.423
 Authentication-5.367
namese-5.328
illion-5.309
Token protections
Token ID15018
Feature activation+5.01e-01

Pos logit contributions
river+7.312
evil+6.797
tre+6.053
 dise+5.877
inav+5.850
Neg logit contributions
��-10.652
oons-9.882
illion-9.665
namese-9.529
theless-9.370
Token to
Token ID284
Feature activation+6.38e-01

Pos logit contributions
river+4.410
evil+4.244
inav+3.647
phia+3.575
alid+3.518
Neg logit contributions
��-5.783
oons-5.571
 Authentication-5.548
 Regulatory-5.499
 Preferences-5.418
Token supporting
Token ID6493
Feature activation+2.60e-01

Pos logit contributions
river+2.860
evil+2.777
inav+2.516
tackle+2.460
thumbnails+2.452
Neg logit contributions
��-2.835
 Authentication-2.706
oons-2.702
 Preferences-2.655
illion-2.605
Token her
Token ID607
Feature activation+3.53e-01

Pos logit contributions
river+2.860
evil+2.755
inav+2.528
tackle+2.434
thumbnails+2.429
Neg logit contributions
��-2.540
oons-2.421
 Authentication-2.414
 Preferences-2.371
namese-2.331
Token.
Token ID13
Feature activation+2.23e-01

Pos logit contributions
river+3.366
evil+3.242
inav+2.922
tackle+2.821
alid+2.758
Neg logit contributions
��-3.968
oons-3.737
 Authentication-3.736
 Preferences-3.654
namese-3.640
Token
Token ID198
Feature activation-2.29e-01

Pos logit contributions
river+2.253
evil+2.174
inav+1.982
thumbnails+1.916
alid+1.896
Neg logit contributions
��-2.361
oons-2.255
namese-2.201
 Authentication-2.196
illion-2.181
Token
Token ID198
Feature activation+2.65e-01

Pos logit contributions
oons+1.961
 Authentication+1.953
 Preferences+1.929
elle+1.914
��+1.907
Neg logit contributions
river-2.647
evil-2.558
inav-2.412
thumbnails-2.361
phia-2.360
TokenS
Token ID50
Feature activation+1.19e+00

Pos logit contributions
river+2.869
evil+2.775
inav+2.483
thumbnails+2.461
tackle+2.451
Neg logit contributions
��-2.643
 Authentication-2.575
oons-2.533
 Preferences-2.516
namese-2.479
Tokenins
Token ID1040
Feature activation+1.60e-01

Pos logit contributions
evil+12.046
river+11.527
alid+10.872
inav+10.488
tre+10.146
Neg logit contributions
 Authentication-10.370
��-10.267
 Regulatory-9.914
 Preferences-9.636
theless-9.431
Token of
Token ID286
Feature activation-1.65e-01

Pos logit contributions
river+1.616
evil+1.568
inav+1.418
tackle+1.372
thumbnails+1.372
Neg logit contributions
��-1.698
 Authentication-1.635
oons-1.635
 Preferences-1.603
 Regulatory-1.581
Token omission
Token ID35725
Feature activation-3.91e-02

Pos logit contributions
��+1.564
 Authentication+1.505
oons+1.496
 Preferences+1.489
 Regulatory+1.450
Neg logit contributions
river-1.859
evil-1.784
inav-1.646
thumbnails-1.634
tackle-1.622
Token are
Token ID389
Feature activation-1.92e-01

Pos logit contributions
��+0.403
oons+0.391
 Authentication+0.389
 Preferences+0.385
illion+0.377
Neg logit contributions
river-0.405
evil-0.390
inav-0.358
thumbnails-0.350
tackle-0.348
Token as
Token ID355
Feature activation-4.57e-02

Pos logit contributions
��+1.904
oons+1.837
 Authentication+1.825
 Preferences+1.809
elle+1.780
Neg logit contributions
river-2.038
evil-1.955
inav-1.824
tackle-1.787
thumbnails-1.776
Token
Token ID198
Feature activation+2.84e-01

Pos logit contributions
river+2.392
evil+2.315
inav+2.102
thumbnails+2.051
tackle+2.035
Neg logit contributions
��-2.138
oons-2.023
namese-1.984
 Authentication-1.968
illion-1.964
Token
Token ID198
Feature activation+4.80e-02

Pos logit contributions
river+2.674
evil+2.576
inav+2.264
thumbnails+2.211
tre+2.207
Neg logit contributions
��-3.342
oons-3.075
 Authentication-3.060
namese-3.039
 Preferences-3.035
TokenYou
Token ID1639
Feature activation+1.64e-01

Pos logit contributions
river+0.530
evil+0.512
inav+0.467
thumbnails+0.459
tackle+0.458
Neg logit contributions
��-0.467
oons-0.450
 Authentication-0.450
 Preferences-0.444
 Regulatory-0.433
Token tell
Token ID1560
Feature activation+2.19e-01

Pos logit contributions
river+1.643
evil+1.590
inav+1.434
tackle+1.398
thumbnails+1.386
Neg logit contributions
��-1.763
oons-1.683
 Authentication-1.664
 Preferences-1.659
namese-1.624
Token Res
Token ID1874
Feature activation+5.48e-01

Pos logit contributions
river+2.454
evil+2.377
inav+2.155
tackle+2.116
alid+2.107
Neg logit contributions
��-2.052
oons-1.983
 Authentication-1.946
 Preferences-1.929
namese-1.924
Tokena
Token ID64
Feature activation+1.15e+00

Pos logit contributions
river+5.022
evil+4.867
alid+4.432
inav+4.255
tackle+4.116
Neg logit contributions
��-6.423
 Preferences-6.015
 Regulatory-5.877
 Authentication-5.853
oons-5.809
Tokenech
Token ID3055
Feature activation+3.53e-01

Pos logit contributions
evil+9.318
river+8.494
alid+7.666
phia+7.378
tackle+7.222
Neg logit contributions
��-13.482
���-12.494
 Preferences-12.377
ThumbnailImage-12.030
 Authentication-11.904
Token all
Token ID477
Feature activation+2.68e-02

Pos logit contributions
river+3.318
evil+3.170
inav+2.806
alid+2.789
tackle+2.742
Neg logit contributions
��-3.846
oons-3.760
 Authentication-3.728
 Preferences-3.695
namese-3.667
Token you
Token ID345
Feature activation-3.47e-03

Pos logit contributions
river+0.269
evil+0.258
inav+0.235
thumbnails+0.230
tackle+0.229
Neg logit contributions
��-0.287
oons-0.278
 Authentication-0.276
 Preferences-0.273
illion-0.268
Token know
Token ID760
Feature activation+1.87e-01

Pos logit contributions
��+0.033
oons+0.032
 Authentication+0.032
 Preferences+0.032
illion+0.031
Neg logit contributions
river-0.039
evil-0.037
inav-0.034
thumbnails-0.033
tackle-0.033
Token,
Token ID11
Feature activation+3.15e-01

Pos logit contributions
river+1.767
evil+1.685
inav+1.524
thumbnails+1.480
tackle+1.478
Neg logit contributions
��-2.101
oons-2.023
 Authentication-2.012
 Preferences-1.992
 Regulatory-1.948
Token I
Token ID314
Feature activation+3.63e-01

Pos logit contributions
river+0.971
evil+0.942
inav+0.862
thumbnails+0.845
tackle+0.845
Neg logit contributions
��-0.810
oons-0.779
 Authentication-0.769
 Preferences-0.760
namese-0.748
Token could
Token ID714
Feature activation-2.04e-01

Pos logit contributions
river+3.330
evil+3.226
inav+2.882
alid+2.771
thumbnails+2.744
Neg logit contributions
��-4.263
oons-3.968
 Preferences-3.923
 Authentication-3.918
namese-3.883
Token not
Token ID407
Feature activation+5.36e-02

Pos logit contributions
��+1.961
oons+1.951
 Authentication+1.938
 Preferences+1.905
illion+1.900
Neg logit contributions
river-2.183
evil-2.092
inav-1.955
thumbnails-1.923
tackle-1.885
Token detach
Token ID48224
Feature activation+4.94e-01

Pos logit contributions
river+0.544
evil+0.523
inav+0.475
tackle+0.464
thumbnails+0.462
Neg logit contributions
��-0.573
oons-0.551
 Authentication-0.547
 Preferences-0.542
illion-0.529
Token the
Token ID262
Feature activation+3.87e-01

Pos logit contributions
river+4.782
evil+4.604
tackle+4.038
inav+3.981
thumbnails+3.960
Neg logit contributions
��-5.228
oons-5.030
 Authentication-4.987
 Preferences-4.970
namese-4.902
Token rear
Token ID8286
Feature activation+1.15e+00

Pos logit contributions
river+3.983
evil+3.891
inav+3.483
tackle+3.394
alid+3.353
Neg logit contributions
��-3.966
oons-3.772
namese-3.693
 Authentication-3.656
 Preferences-3.648
Token clips
Token ID19166
Feature activation-3.07e-03

Pos logit contributions
river+10.675
evil+10.365
tackle+9.170
 Leh+9.137
fa+8.964
Neg logit contributions
��-10.699
namese-9.889
oons-9.796
 Regulatory-9.548
 Preferences-9.510
Token and
Token ID290
Feature activation+1.30e-01

Pos logit contributions
��+0.031
oons+0.030
 Authentication+0.030
 Preferences+0.030
illion+0.029
Neg logit contributions
river-0.032
evil-0.031
inav-0.028
thumbnails-0.028
tackle-0.028
Token for
Token ID329
Feature activation+8.46e-01

Pos logit contributions
river+1.372
evil+1.327
inav+1.203
thumbnails+1.189
tackle+1.183
Neg logit contributions
��-1.323
oons-1.266
 Authentication-1.251
 Preferences-1.242
namese-1.215
Token fear
Token ID3252
Feature activation+3.41e-01

Pos logit contributions
evil+7.335
river+7.190
thumbnails+6.374
inav+6.301
tackle+6.226
Neg logit contributions
��-8.664
oons-8.487
 Preferences-8.336
elle-8.205
illion-8.174
Token of
Token ID286
Feature activation-8.44e-02

Pos logit contributions
river+3.071
evil+2.965
inav+2.622
thumbnails+2.564
tackle+2.558
Neg logit contributions
��-3.903
oons-3.754
 Authentication-3.749
 Preferences-3.742
namese-3.650
Token psychology
Token ID15119
Feature activation-2.93e-01

Pos logit contributions
��+2.091
 Authentication+2.035
oons+2.027
 Preferences+2.004
 Regulatory+1.996
Neg logit contributions
river-2.290
evil-2.207
inav-2.022
tackle-2.006
thumbnails-1.995
Token at
Token ID379
Feature activation+2.07e-01

Pos logit contributions
��+2.649
oons+2.584
 Authentication+2.540
 Preferences+2.528
 Regulatory+2.497
Neg logit contributions
river-3.318
evil-3.217
thumbnails-3.004
inav-2.993
tackle-2.935
Token Washington
Token ID2669
Feature activation+1.05e-01

Pos logit contributions
river+2.334
evil+2.224
inav+2.073
phia+1.993
tackle+1.980
Neg logit contributions
��-1.982
oons-1.880
 Authentication-1.857
 Preferences-1.831
illion-1.831
Token University
Token ID2059
Feature activation+3.43e-01

Pos logit contributions
river+1.219
evil+1.173
inav+1.087
phia+1.062
tackle+1.055
Neg logit contributions
��-0.955
oons-0.919
 Authentication-0.912
 Preferences-0.902
illion-0.884
Token in
Token ID287
Feature activation+2.24e-01

Pos logit contributions
river+3.040
evil+2.844
inav+2.571
tackle+2.491
phia+2.460
Neg logit contributions
��-4.000
oons-3.863
 Authentication-3.851
 Preferences-3.798
elle-3.737
Token St
Token ID520
Feature activation+1.15e+00

Pos logit contributions
river+2.532
evil+2.338
inav+2.182
thumbnails+2.097
tackle+2.094
Neg logit contributions
��-2.156
oons-2.070
 Authentication-2.060
 Preferences-2.049
namese-2.006
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
river+6.790
evil+5.786
alid+4.869
inav+4.469
tre+4.365
Neg logit contributions
��-15.970
���-15.932
theless-15.368
 Authentication-15.243
 Preferences-14.820
Token Louis
Token ID5593
Feature activation-3.10e-02

Pos logit contributions
river+4.935
evil+3.674
 Leh+3.304
Le+3.080
 Dise+2.997
Neg logit contributions
��-18.446
namese-17.522
���-17.254
��-17.104
theless-17.099
Token,
Token ID11
Feature activation-9.74e-02

Pos logit contributions
��+0.329
oons+0.319
 Authentication+0.318
 Preferences+0.314
elle+0.307
Neg logit contributions
river-0.312
evil-0.300
inav-0.274
thumbnails-0.267
tackle-0.266
Token has
Token ID468
Feature activation-1.22e-01

Pos logit contributions
��+0.983
oons+0.953
 Authentication+0.950
 Preferences+0.939
 Regulatory+0.919
Neg logit contributions
river-1.028
evil-0.992
inav-0.911
thumbnails-0.889
tackle-0.885
Token also
Token ID635
Feature activation+2.16e-01

Pos logit contributions
��+1.213
oons+1.177
 Authentication+1.172
 Preferences+1.158
 Regulatory+1.134
Neg logit contributions
river-1.301
evil-1.257
inav-1.151
thumbnails-1.138
tackle-1.126
Token existing
Token ID4683
Feature activation+5.19e-02

Pos logit contributions
river+2.010
evil+1.967
inav+1.805
thumbnails+1.753
alid+1.727
Neg logit contributions
��-1.801
oons-1.744
 Authentication-1.726
illion-1.685
namese-1.680
Token regime
Token ID7142
Feature activation+1.83e-01

Pos logit contributions
river+0.561
evil+0.542
inav+0.499
thumbnails+0.486
tackle+0.482
Neg logit contributions
��-0.512
oons-0.495
 Authentication-0.491
 Preferences-0.482
illion-0.476
Token with
Token ID351
Feature activation+1.66e-01

Pos logit contributions
river+1.773
evil+1.696
inav+1.544
thumbnails+1.502
tackle+1.502
Neg logit contributions
��-1.998
oons-1.943
 Authentication-1.930
 Preferences-1.875
namese-1.863
Token a
Token ID257
Feature activation-6.82e-03

Pos logit contributions
river+1.761
evil+1.689
inav+1.555
thumbnails+1.525
alid+1.514
Neg logit contributions
��-1.652
oons-1.600
 Authentication-1.584
 Preferences-1.541
illion-1.539
Token new
Token ID649
Feature activation+5.39e-03

Pos logit contributions
��+0.068
oons+0.065
 Authentication+0.065
 Preferences+0.064
illion+0.063
Neg logit contributions
river-0.074
evil-0.071
inav-0.065
thumbnails-0.064
tackle-0.063
Token short
Token ID1790
Feature activation+1.14e+00

Pos logit contributions
river+0.057
evil+0.055
inav+0.050
thumbnails+0.049
tackle+0.049
Neg logit contributions
��-0.055
oons-0.053
 Authentication-0.053
 Preferences-0.052
illion-0.051
Token-
Token ID12
Feature activation+1.01e+00

Pos logit contributions
river+7.201
evil+7.115
inav+6.579
 awoken+6.030
 dise+5.911
Neg logit contributions
��-13.527
 Authentication-13.371
oons-12.986
 Preferences-12.919
namese-12.864
Tokenterm
Token ID4354
Feature activation+6.90e-01

Pos logit contributions
river+7.239
evil+7.077
tre+5.809
tackle+5.741
class+5.674
Neg logit contributions
��-12.398
 Authentication-11.728
namese-11.715
 Preferences-11.663
oons-11.409
Token disability
Token ID14013
Feature activation+6.60e-01

Pos logit contributions
river+6.535
evil+6.374
inav+5.949
alid+5.634
 dise+5.593
Neg logit contributions
��-7.064
oons-6.762
 Authentication-6.689
namese-6.541
 Preferences-6.407
Token plan
Token ID1410
Feature activation+1.68e-02

Pos logit contributions
river+6.116
evil+5.815
inav+5.476
tackle+5.111
alid+5.097
Neg logit contributions
��-7.171
oons-6.816
 Authentication-6.596
namese-6.511
elle-6.434
Token.
Token ID13
Feature activation+1.27e-01

Pos logit contributions
river+0.166
evil+0.159
inav+0.145
thumbnails+0.141
tackle+0.140
Neg logit contributions
��-0.183
oons-0.177
 Authentication-0.176
 Preferences-0.174
illion-0.171
Token sound
Token ID2128
Feature activation+1.28e-01

Pos logit contributions
��+0.526
oons+0.510
 Authentication+0.507
 Preferences+0.502
 Regulatory+0.490
Neg logit contributions
river-0.635
evil-0.612
inav-0.567
thumbnails-0.553
tackle-0.553
Token commercial
Token ID5068
Feature activation+2.26e-01

Pos logit contributions
river+1.412
evil+1.367
inav+1.261
thumbnails+1.232
phia+1.209
Neg logit contributions
��-1.235
oons-1.181
 Authentication-1.177
 Preferences-1.150
illion-1.132
Token banking
Token ID11754
Feature activation-1.12e-01

Pos logit contributions
river+2.352
evil+2.285
inav+2.112
thumbnails+2.049
phia+2.005
Neg logit contributions
��-2.324
oons-2.201
 Authentication-2.185
 Preferences-2.121
illion-2.120
Token in
Token ID287
Feature activation+2.29e-01

Pos logit contributions
��+1.162
oons+1.134
 Authentication+1.129
 Preferences+1.121
 Regulatory+1.096
Neg logit contributions
river-1.138
evil-1.092
inav-1.003
tackle-0.983
thumbnails-0.982
Token pretty
Token ID2495
Feature activation+6.54e-02

Pos logit contributions
river+2.378
evil+2.305
inav+2.087
thumbnails+2.046
phia+2.023
Neg logit contributions
��-2.337
 Authentication-2.229
oons-2.220
namese-2.170
elle-2.161
Token short
Token ID1790
Feature activation+1.14e+00

Pos logit contributions
river+0.623
evil+0.600
inav+0.544
thumbnails+0.528
tackle+0.521
Neg logit contributions
��-0.733
oons-0.708
 Authentication-0.706
 Preferences-0.695
namese-0.683
Token order
Token ID1502
Feature activation-2.50e-01

Pos logit contributions
river+6.323
evil+5.889
 dise+5.404
 Leh+5.287
phia+4.906
Neg logit contributions
��-14.599
 Authentication-14.367
namese-14.149
 Preferences-14.055
oons-13.711
Token.
Token ID13
Feature activation-9.37e-02

Pos logit contributions
��+2.574
oons+2.508
 Preferences+2.475
 Authentication+2.474
illion+2.439
Neg logit contributions
river-2.533
evil-2.451
inav-2.272
thumbnails-2.239
tackle-2.225
Token ("
Token ID5855
Feature activation+5.70e-02

Pos logit contributions
��+0.830
oons+0.801
 Authentication+0.800
 Preferences+0.791
 Regulatory+0.772
Neg logit contributions
river-1.108
evil-1.070
inav-0.994
thumbnails-0.972
tackle-0.972
TokenPower
Token ID13434
Feature activation+2.07e-01

Pos logit contributions
river+0.625
evil+0.605
inav+0.551
thumbnails+0.542
tackle+0.537
Neg logit contributions
��-0.560
oons-0.540
 Authentication-0.537
 Preferences-0.527
namese-0.520
Token tends
Token ID12444
Feature activation+2.06e-01

Pos logit contributions
river+2.080
evil+1.997
inav+1.799
thumbnails+1.771
tackle+1.765
Neg logit contributions
��-2.214
oons-2.114
 Authentication-2.093
namese-2.064
 Preferences-2.049
Token the
Token ID262
Feature activation+1.38e-01

Pos logit contributions
��+0.809
oons+0.785
 Authentication+0.781
 Preferences+0.771
illion+0.753
Neg logit contributions
river-0.948
evil-0.917
inav-0.848
thumbnails-0.828
phia-0.820
Token entire
Token ID2104
Feature activation-6.14e-02

Pos logit contributions
river+1.524
evil+1.479
inav+1.346
tackle+1.335
thumbnails+1.323
Neg logit contributions
��-1.328
oons-1.269
 Authentication-1.267
 Preferences-1.253
illion-1.231
Token defense
Token ID3761
Feature activation+2.32e-01

Pos logit contributions
��+0.588
oons+0.568
 Authentication+0.566
 Preferences+0.558
 Regulatory+0.545
Neg logit contributions
river-0.683
evil-0.660
inav-0.608
thumbnails-0.596
tackle-0.590
Token into
Token ID656
Feature activation-4.64e-03

Pos logit contributions
river+2.390
evil+2.298
inav+2.072
tackle+2.044
thumbnails+2.033
Neg logit contributions
��-2.416
oons-2.332
 Authentication-2.319
 Preferences-2.281
 Regulatory-2.233
Token a
Token ID257
Feature activation+2.44e-01

Pos logit contributions
��+0.046
oons+0.045
 Authentication+0.044
 Preferences+0.044
illion+0.043
Neg logit contributions
river-0.050
evil-0.048
inav-0.044
thumbnails-0.043
tackle-0.043
Token h
Token ID289
Feature activation+1.14e+00

Pos logit contributions
river+2.580
evil+2.471
inav+2.266
tackle+2.247
thumbnails+2.222
Neg logit contributions
��-2.444
oons-2.384
 Authentication-2.352
 Preferences-2.325
illion-2.287
Tokenuddle
Token ID24500
Feature activation+1.31e-01

Pos logit contributions
river+7.365
evil+6.960
alid+6.363
tackle+6.326
angle+6.287
Neg logit contributions
��-14.161
���-14.022
 Preferences-13.456
namese-13.429
 Authentication-13.148
Token and
Token ID290
Feature activation-1.78e-01

Pos logit contributions
river+1.323
evil+1.259
inav+1.145
tackle+1.115
thumbnails+1.111
Neg logit contributions
��-1.400
oons-1.355
 Authentication-1.351
 Preferences-1.331
 Regulatory-1.305
Token delivered
Token ID6793
Feature activation+1.54e-01

Pos logit contributions
��+1.794
 Authentication+1.741
oons+1.740
 Preferences+1.714
elle+1.694
Neg logit contributions
river-1.868
evil-1.796
inav-1.657
thumbnails-1.607
phia-1.593
Token a
Token ID257
Feature activation+5.40e-01

Pos logit contributions
river+1.672
evil+1.599
inav+1.452
tackle+1.443
thumbnails+1.435
Neg logit contributions
��-1.535
oons-1.476
 Authentication-1.468
 Preferences-1.449
illion-1.419
Token colorful
Token ID20239
Feature activation+1.89e-01

Pos logit contributions
river+5.503
evil+5.169
tackle+4.827
 Leh+4.671
 dise+4.632
Neg logit contributions
��-5.507
oons-5.291
 Authentication-5.186
 Preferences-5.138
illion-4.970
Token2
Token ID17
Feature activation+1.08e-01

Pos logit contributions
��+0.061
oons+0.059
 Authentication+0.059
 Preferences+0.058
illion+0.057
Neg logit contributions
river-0.053
evil-0.051
inav-0.046
thumbnails-0.045
tackle-0.045
Token K
Token ID509
Feature activation+8.92e-02

Pos logit contributions
river+1.216
evil+1.172
inav+1.071
thumbnails+1.052
tackle+1.048
Neg logit contributions
��-1.028
oons-0.985
 Authentication-0.983
 Preferences-0.966
namese-0.949
Token.
Token ID13
Feature activation+2.03e-01

Pos logit contributions
river+0.872
evil+0.834
inav+0.756
thumbnails+0.734
tackle+0.732
Neg logit contributions
��-0.977
oons-0.945
 Authentication-0.944
 Preferences-0.931
namese-0.912
Token
Token ID198
Feature activation+2.51e-01

Pos logit contributions
river+2.253
evil+2.174
inav+1.985
thumbnails+1.939
tackle+1.920
Neg logit contributions
��-1.947
oons-1.859
namese-1.824
 Authentication-1.801
illion-1.795
Token
Token ID198
Feature activation+1.60e-01

Pos logit contributions
river+2.395
evil+2.297
inav+2.046
thumbnails+1.988
tackle+1.975
Neg logit contributions
��-2.910
oons-2.704
 Authentication-2.679
namese-2.665
 Preferences-2.654
TokenS
Token ID50
Feature activation+1.13e+00

Pos logit contributions
river+1.777
evil+1.720
inav+1.548
thumbnails+1.535
tackle+1.529
Neg logit contributions
��-1.544
 Authentication-1.490
oons-1.485
 Preferences-1.456
namese-1.447
Tokenensitivity
Token ID40545
Feature activation+5.42e-01

Pos logit contributions
evil+10.057
river+10.053
alid+9.195
inav+9.002
phia+8.814
Neg logit contributions
��-10.689
 Authentication-10.496
 Regulatory-9.936
namese-9.759
 Tess-9.666
Token to
Token ID284
Feature activation+2.93e-02

Pos logit contributions
river+4.956
evil+4.617
 dise+4.169
alid+4.127
thumbnails+4.092
Neg logit contributions
��-5.985
oons-5.679
namese-5.601
 Authentication-5.597
illion-5.593
Token doubled
Token ID15229
Feature activation+1.17e-01

Pos logit contributions
river+0.311
evil+0.300
inav+0.275
thumbnails+0.268
tackle+0.267
Neg logit contributions
��-0.296
oons-0.286
 Authentication-0.284
 Preferences-0.280
illion-0.276
Token CO
Token ID7375
Feature activation+2.56e-01

Pos logit contributions
river+1.193
evil+1.147
inav+1.047
thumbnails+1.009
tackle+1.007
Neg logit contributions
��-1.244
oons-1.190
 Authentication-1.190
 Preferences-1.170
elle-1.150
Token2
Token ID17
Feature activation+6.13e-01

Pos logit contributions
river+2.130
evil+2.049
inav+1.804
thumbnails+1.751
tackle+1.743
Neg logit contributions
��-3.144
oons-3.057
 Authentication-3.017
illion-2.979
 Preferences-2.969
Tokens
Token ID82
Feature activation+1.85e-01

Pos logit contributions
river+1.724
evil+1.639
inav+1.484
thumbnails+1.444
tackle+1.441
Neg logit contributions
��-1.994
oons-1.934
 Authentication-1.925
 Preferences-1.888
illion-1.862
Token vision
Token ID5761
Feature activation+2.47e-01

Pos logit contributions
river+1.928
evil+1.844
inav+1.701
phia+1.637
thumbnails+1.636
Neg logit contributions
��-1.912
oons-1.832
 Authentication-1.808
 Preferences-1.777
illion-1.762
Token for
Token ID329
Feature activation-2.49e-02

Pos logit contributions
river+2.302
evil+2.159
inav+1.954
alid+1.894
tackle+1.879
Neg logit contributions
��-2.785
 Authentication-2.714
oons-2.701
 Preferences-2.644
namese-2.611
Token,
Token ID11
Feature activation+7.41e-01

Pos logit contributions
��+0.237
oons+0.229
 Authentication+0.228
 Preferences+0.225
illion+0.219
Neg logit contributions
river-0.278
evil-0.268
inav-0.247
thumbnails-0.242
tackle-0.241
Token in
Token ID287
Feature activation+3.34e-01

Pos logit contributions
river+6.043
evil+5.606
inav+5.161
tackle+5.050
alid+4.926
Neg logit contributions
��-8.658
oons-8.412
illion-8.143
namese-8.137
 Authentication-8.003
Token her
Token ID607
Feature activation+1.12e+00

Pos logit contributions
river+3.520
evil+3.397
inav+3.081
phia+3.017
tackle+3.008
Neg logit contributions
��-3.315
oons-3.130
 Authentication-3.100
namese-3.082
 Preferences-3.040
Token words
Token ID2456
Feature activation+1.86e-01

Pos logit contributions
river+7.442
evil+7.087
phia+6.331
inav+6.245
 dise+6.038
Neg logit contributions
��-13.849
namese-12.346
oons-12.258
 Authentication-12.248
illion-12.208
Token,
Token ID11
Feature activation+1.91e-01

Pos logit contributions
river+1.760
evil+1.667
inav+1.524
thumbnails+1.466
tackle+1.460
Neg logit contributions
��-2.075
oons-2.026
 Authentication-2.021
 Preferences-1.990
illion-1.963
Token
Token ID564
Feature activation-2.88e-03

Pos logit contributions
river+2.016
evil+1.931
inav+1.787
alid+1.718
thumbnails+1.716
Neg logit contributions
��-1.916
oons-1.861
 Authentication-1.842
 Preferences-1.810
illion-1.785
Token
Token ID250
Feature activation-1.08e-02

Pos logit contributions
��+0.018
oons+0.017
 Authentication+0.017
 Preferences+0.017
illion+0.016
Neg logit contributions
river-0.042
evil-0.040
inav-0.038
thumbnails-0.037
tackle-0.037
Tokena
Token ID64
Feature activation+1.99e-02

Pos logit contributions
��+0.108
oons+0.104
 Authentication+0.104
 Preferences+0.102
illion+0.100
Neg logit contributions
river-0.116
evil-0.112
inav-0.102
thumbnails-0.100
tackle-0.100
Token for
Token ID329
Feature activation+1.10e-01

Pos logit contributions
river+3.137
evil+3.055
inav+2.636
tackle+2.623
phia+2.560
Neg logit contributions
��-4.158
 Authentication-3.941
oons-3.904
 Regulatory-3.834
 Preferences-3.831
Token the
Token ID262
Feature activation+5.60e-01

Pos logit contributions
river+1.169
evil+1.133
inav+1.031
tackle+1.010
thumbnails+1.007
Neg logit contributions
��-1.103
oons-1.052
 Authentication-1.051
 Preferences-1.042
elle-1.018
Token next
Token ID1306
Feature activation+2.40e-01

Pos logit contributions
river+4.998
evil+4.800
 dise+4.277
inav+4.241
 Leh+4.214
Neg logit contributions
��-6.273
 Authentication-5.889
oons-5.882
 Preferences-5.865
illion-5.826
Token generation
Token ID5270
Feature activation+1.26e-01

Pos logit contributions
river+2.389
evil+2.270
inav+2.043
tackle+2.020
alid+2.001
Neg logit contributions
��-2.596
 Authentication-2.479
oons-2.476
 Preferences-2.452
namese-2.399
Token.
Token ID13
Feature activation+3.07e-01

Pos logit contributions
river+1.167
evil+1.115
inav+1.003
thumbnails+0.978
tackle+0.977
Neg logit contributions
��-1.452
 Authentication-1.400
oons-1.396
 Preferences-1.381
 Regulatory-1.355
Token Rest
Token ID8324
Feature activation+1.12e+00

Pos logit contributions
river+3.242
evil+3.129
inav+2.835
 Leh+2.772
tackle+2.770
Neg logit contributions
��-3.113
oons-2.904
namese-2.900
illion-2.830
 Authentication-2.813
Tokenraining
Token ID24674
Feature activation+1.62e-01

Pos logit contributions
evil+9.436
river+9.392
inav+8.102
tackle+8.078
ivable+7.803
Neg logit contributions
��-12.042
 Authentication-10.994
oons-10.906
namese-10.766
theless-10.746
Token future
Token ID2003
Feature activation+7.15e-03

Pos logit contributions
river+1.760
evil+1.697
inav+1.541
tackle+1.508
thumbnails+1.503
Neg logit contributions
��-1.614
oons-1.534
 Authentication-1.533
 Preferences-1.503
namese-1.478
Token debt
Token ID5057
Feature activation+3.66e-01

Pos logit contributions
river+0.071
evil+0.068
inav+0.062
thumbnails+0.060
tackle+0.060
Neg logit contributions
��-0.078
oons-0.075
 Authentication-0.075
 Preferences-0.074
illion-0.072
Token is
Token ID318
Feature activation-6.38e-02

Pos logit contributions
river+3.917
evil+3.743
inav+3.413
thumbnails+3.369
tackle+3.284
Neg logit contributions
��-3.698
 Authentication-3.502
oons-3.405
 Preferences-3.361
theless-3.350
Token a
Token ID257
Feature activation+8.82e-02

Pos logit contributions
��+0.619
oons+0.599
 Authentication+0.597
 Preferences+0.589
illion+0.576
Neg logit contributions
river-0.698
evil-0.673
inav-0.622
thumbnails-0.608
tackle-0.605
Token or
Token ID393
Feature activation-1.78e-01

Pos logit contributions
river+1.562
evil+1.513
inav+1.376
thumbnails+1.350
tackle+1.340
Neg logit contributions
��-1.514
 Authentication-1.449
oons-1.444
 Preferences-1.423
 Regulatory-1.398
Token called
Token ID1444
Feature activation-2.65e-01

Pos logit contributions
��+1.717
oons+1.667
 Authentication+1.658
 Preferences+1.646
illion+1.605
Neg logit contributions
river-1.952
evil-1.878
inav-1.746
tackle-1.705
thumbnails-1.693
Token female
Token ID4048
Feature activation+1.36e-01

Pos logit contributions
��+2.353
oons+2.320
 Authentication+2.294
 Preferences+2.269
elle+2.234
Neg logit contributions
river-3.040
evil-2.919
inav-2.777
tackle-2.702
thumbnails-2.697
Token Hillary
Token ID5073
Feature activation+6.51e-02

Pos logit contributions
river+1.490
evil+1.436
inav+1.314
thumbnails+1.286
phia+1.279
Neg logit contributions
��-1.324
oons-1.277
 Authentication-1.260
 Preferences-1.238
 Regulatory-1.224
Token supporters
Token ID5941
Feature activation+1.80e-01

Pos logit contributions
river+0.873
evil+0.850
inav+0.790
thumbnails+0.778
phia+0.773
Neg logit contributions
��-0.470
oons-0.449
 Authentication-0.445
 Preferences-0.436
 Regulatory-0.426
Token sl
Token ID1017
Feature activation+1.11e+00

Pos logit contributions
river+1.863
evil+1.812
inav+1.619
thumbnails+1.584
phia+1.560
Neg logit contributions
��-1.868
oons-1.797
 Authentication-1.797
 Preferences-1.761
 Regulatory-1.739
Tokenuts
Token ID5500
Feature activation+4.25e-01

Pos logit contributions
river+8.272
evil+7.792
alid+7.356
ivable+6.990
avering+6.780
Neg logit contributions
 Regulatory-12.818
 Authentication-12.793
��-12.791
 Preferences-12.525
theless-12.220
Token and
Token ID290
Feature activation+4.15e-02

Pos logit contributions
river+3.774
evil+3.673
inav+3.193
alid+3.089
thumbnails+3.084
Neg logit contributions
��-4.835
 Authentication-4.759
 Regulatory-4.652
oons-4.619
 Preferences-4.613
Token wh
Token ID348
Feature activation+2.28e-01

Pos logit contributions
river+0.437
evil+0.422
inav+0.385
thumbnails+0.377
tackle+0.375
Neg logit contributions
��-0.423
oons-0.408
 Authentication-0.407
 Preferences-0.402
illion-0.393
Tokenores
Token ID2850
Feature activation+1.41e-01

Pos logit contributions
river+1.976
evil+1.891
inav+1.679
thumbnails+1.641
alid+1.630
Neg logit contributions
��-2.757
 Authentication-2.676
 Preferences-2.632
oons-2.622
 Regulatory-2.587
Token?
Token ID30
Feature activation+2.46e-01

Pos logit contributions
river+1.368
evil+1.323
inav+1.185
thumbnails+1.152
phia+1.139
Neg logit contributions
��-1.552
 Authentication-1.507
oons-1.496
 Preferences-1.481
 Regulatory-1.464
Token're
Token ID821
Feature activation+1.13e-01

Pos logit contributions
river+1.360
evil+1.310
inav+1.186
thumbnails+1.157
tackle+1.149
Neg logit contributions
��-1.499
oons-1.444
 Authentication-1.427
 Preferences-1.414
illion-1.387
Token almost
Token ID2048
Feature activation-3.99e-01

Pos logit contributions
river+1.219
evil+1.173
inav+1.048
thumbnails+1.038
phia+1.037
Neg logit contributions
��-1.135
oons-1.090
 Preferences-1.065
 Authentication-1.063
namese-1.048
Token signed
Token ID4488
Feature activation+4.60e-01

Pos logit contributions
 Authentication+3.294
oons+3.292
��+3.247
elle+3.208
illion+3.157
Neg logit contributions
river-4.742
evil-4.634
inav-4.433
thumbnails-4.282
tackle-4.232
Token up
Token ID510
Feature activation+4.25e-01

Pos logit contributions
river+3.552
evil+3.330
thumbnails+2.921
inav+2.842
tackle+2.819
Neg logit contributions
��-6.095
oons-5.620
namese-5.568
 Authentication-5.529
 Regulatory-5.522
Token for
Token ID329
Feature activation+4.86e-01

Pos logit contributions
river+3.639
evil+3.306
phia+2.836
inav+2.822
 Leh+2.807
Neg logit contributions
��-5.233
oons-4.879
namese-4.843
 Authentication-4.739
ework-4.724
Token News
Token ID3000
Feature activation+1.11e+00

Pos logit contributions
river+4.661
evil+4.494
inav+4.036
thumbnails+4.033
 Leh+4.023
Neg logit contributions
��-5.183
oons-4.932
illion-4.880
namese-4.745
elle-4.699
Token Alert
Token ID23276
Feature activation+1.57e-01

Pos logit contributions
river+10.746
evil+9.823
thumbnails+9.470
tackle+9.241
 Leh+9.169
Neg logit contributions
��-10.187
namese-9.397
oons-9.121
illion-8.822
 Authentication-8.704
Tokens
Token ID82
Feature activation+4.10e-01

Pos logit contributions
river+1.381
evil+1.316
inav+1.182
thumbnails+1.161
alid+1.135
Neg logit contributions
��-1.881
oons-1.792
namese-1.767
illion-1.766
 Authentication-1.765
Token Keep
Token ID9175
Feature activation+3.73e-01

Pos logit contributions
river+3.533
evil+3.344
thumbnails+2.996
inav+2.960
phia+2.957
Neg logit contributions
��-4.841
oons-4.610
namese-4.546
illion-4.522
 Preferences-4.371
Token an
Token ID281
Feature activation+4.35e-01

Pos logit contributions
river+3.841
evil+3.736
thumbnails+3.346
inav+3.286
tackle+3.282
Neg logit contributions
��-3.883
oons-3.685
illion-3.570
namese-3.549
 Preferences-3.527
Token eye
Token ID4151
Feature activation-3.62e-01

Pos logit contributions
river+3.920
evil+3.705
thumbnails+3.342
alid+3.287
inav+3.168
Neg logit contributions
��-5.190
oons-4.872
namese-4.701
 Preferences-4.655
illion-4.648
Token was
Token ID373
Feature activation-3.93e-01

Pos logit contributions
river+1.914
evil+1.825
inav+1.661
thumbnails+1.632
tackle+1.624
Neg logit contributions
��-2.149
oons-2.085
 Authentication-2.068
 Preferences-2.055
 Regulatory-2.013
Token iconic
Token ID14133
Feature activation+2.69e-03

Pos logit contributions
��+3.520
 Authentication+3.434
oons+3.406
 Preferences+3.347
namese+3.328
Neg logit contributions
river-4.375
evil-4.274
inav-4.061
phia-3.871
tackle-3.848
Token (
Token ID357
Feature activation-2.66e-01

Pos logit contributions
river+0.027
evil+0.026
inav+0.024
thumbnails+0.023
tackle+0.023
Neg logit contributions
��-0.029
oons-0.028
 Authentication-0.028
 Preferences-0.027
illion-0.027
Tokenmore
Token ID3549
Feature activation-7.29e-01

Pos logit contributions
��+2.551
oons+2.485
 Authentication+2.454
 Preferences+2.417
illion+2.413
Neg logit contributions
river-2.842
evil-2.724
inav-2.587
thumbnails-2.482
phia-2.475
Token on
Token ID319
Feature activation-5.72e-01

Pos logit contributions
oons+6.003
��+5.979
 Authentication+5.865
 Preferences+5.802
elle+5.794
Neg logit contributions
river-7.763
evil-7.688
inav-7.413
alid-7.073
phia-7.058
Token that
Token ID326
Feature activation-1.22e+00

Pos logit contributions
��+4.656
oons+4.596
 Authentication+4.567
 Preferences+4.547
 Regulatory+4.480
Neg logit contributions
river-6.663
evil-6.441
inav-6.028
phia-5.928
thumbnails-5.806
Token later
Token ID1568
Feature activation+1.32e-02

Pos logit contributions
 Preferences+7.394
elle+7.276
draft+7.251
 Regulatory+7.219
 Colleg+7.147
Neg logit contributions
river-13.469
evil-13.379
inav-12.902
thumbnails-12.369
alid-12.161
Token),
Token ID828
Feature activation-8.60e-02

Pos logit contributions
river+0.123
evil+0.118
inav+0.107
thumbnails+0.104
tackle+0.103
Neg logit contributions
��-0.151
oons-0.147
 Authentication-0.146
 Preferences-0.144
illion-0.141
Token but
Token ID475
Feature activation-2.01e-01

Pos logit contributions
��+0.848
oons+0.818
 Authentication+0.816
 Preferences+0.805
illion+0.789
Neg logit contributions
river-0.931
evil-0.898
inav-0.829
thumbnails-0.807
tackle-0.803
Token he
Token ID339
Feature activation-3.58e-02

Pos logit contributions
��+1.871
oons+1.817
 Authentication+1.812
 Preferences+1.782
 Regulatory+1.739
Neg logit contributions
river-2.255
evil-2.195
inav-2.036
thumbnails-1.985
tackle-1.960
Token couldn
Token ID3521
Feature activation+1.77e-01

Pos logit contributions
��+0.384
oons+0.373
 Authentication+0.371
 Preferences+0.366
illion+0.360
Neg logit contributions
river-0.358
evil-0.344
inav-0.314
thumbnails-0.306
tackle-0.304
Token for
Token ID329
Feature activation+4.96e-02

Pos logit contributions
��+2.767
 Preferences+2.666
oons+2.663
 Authentication+2.626
elle+2.582
Neg logit contributions
river-3.733
evil-3.662
inav-3.365
tackle-3.308
alid-3.301
Tokends
Token ID9310
Feature activation-1.87e-01

Pos logit contributions
river+0.527
evil+0.505
inav+0.463
thumbnails+0.452
tackle+0.448
Neg logit contributions
��-0.503
 Authentication-0.484
oons-0.484
 Preferences-0.479
 Regulatory-0.466
Token
Token ID447
Feature activation+3.43e-02

Pos logit contributions
��+1.932
oons+1.872
 Authentication+1.858
 Preferences+1.838
elle+1.813
Neg logit contributions
river-1.908
evil-1.856
inav-1.706
thumbnails-1.664
tackle-1.660
Token
Token ID251
Feature activation-1.49e-01

Pos logit contributions
river+0.512
evil+0.499
inav+0.469
thumbnails+0.462
tackle+0.461
Neg logit contributions
��-0.194
oons-0.188
 Authentication-0.186
 Preferences-0.181
illion-0.174
Token
Token ID960
Feature activation-3.67e-01

Pos logit contributions
��+1.504
oons+1.457
 Authentication+1.446
 Preferences+1.431
elle+1.406
Neg logit contributions
river-1.558
evil-1.514
inav-1.396
thumbnails-1.355
tackle-1.355
Tokencross
Token ID19692
Feature activation-1.19e+00

Pos logit contributions
��+3.656
oons+3.578
 Authentication+3.491
elle+3.469
 Preferences+3.461
Neg logit contributions
river-3.659
evil-3.589
inav-3.381
phia-3.246
alid-3.229
Tokening
Token ID278
Feature activation-7.89e-01

Pos logit contributions
oons+9.153
elle+8.854
draft+8.838
��+8.810
illion+8.579
Neg logit contributions
evil-11.675
inav-11.189
river-11.166
alid-10.694
tackle-10.637
Token highly
Token ID4047
Feature activation-3.16e-01

Pos logit contributions
��+6.773
oons+6.598
 Preferences+6.540
 Regulatory+6.510
 Authentication+6.504
Neg logit contributions
evil-8.128
river-7.759
inav-7.591
tackle-7.459
phia-7.374
Token traff
Token ID11878
Feature activation-1.94e-01

Pos logit contributions
��+3.136
 Authentication+3.052
oons+3.035
 Preferences+3.012
 Regulatory+2.973
Neg logit contributions
river-3.256
evil-3.199
inav-2.945
phia-2.866
tackle-2.865
Tokenicked
Token ID9484
Feature activation-3.41e-01

Pos logit contributions
��+1.558
oons+1.541
 Authentication+1.512
 Preferences+1.489
illion+1.477
Neg logit contributions
river-2.406
evil-2.333
inav-2.183
tackle-2.138
thumbnails-2.134
Token thorough
Token ID9321
Feature activation-1.72e-01

Pos logit contributions
��+3.128
oons+3.029
 Authentication+3.006
 Preferences+2.988
 Regulatory+2.937
Neg logit contributions
evil-3.707
river-3.706
inav-3.402
tackle-3.370
thumbnails-3.351
Token's
Token ID338
Feature activation-6.54e-01

Pos logit contributions
 Preferences+4.655
��+4.645
 Authentication+4.626
 Regulatory+4.578
oons+4.577
Neg logit contributions
river-5.354
evil-5.352
thumbnails-4.971
inav-4.902
tackle-4.777
Token financial
Token ID3176
Feature activation-4.52e-01

Pos logit contributions
 Preferences+5.713
 Regulatory+5.662
 Authentication+5.661
 Prem+5.536
��+5.530
Neg logit contributions
evil-7.089
river-7.087
thumbnails-6.611
inav-6.521
tackle-6.421
Token counselling
Token ID47586
Feature activation-2.95e-01

Pos logit contributions
��+4.161
 Preferences+4.143
 Authentication+4.072
oons+4.054
 Regulatory+4.003
Neg logit contributions
river-4.841
evil-4.813
thumbnails-4.350
inav-4.309
alid-4.245
Token service
Token ID2139
Feature activation-4.13e-01

Pos logit contributions
��+2.799
oons+2.780
 Preferences+2.752
 Authentication+2.740
 Regulatory+2.701
Neg logit contributions
river-3.174
evil-3.125
inav-2.875
thumbnails-2.867
phia-2.793
Token or
Token ID393
Feature activation-6.99e-01

Pos logit contributions
��+3.910
oons+3.891
 Authentication+3.869
 Preferences+3.858
 Regulatory+3.769
Neg logit contributions
river-4.396
evil-4.346
inav-3.985
thumbnails-3.970
tackle-3.863
Token call
Token ID869
Feature activation-1.15e+00

Pos logit contributions
 Preferences+5.792
 Authentication+5.767
 Prem+5.622
��+5.546
oons+5.539
Neg logit contributions
evil-7.811
river-7.738
inav-7.365
thumbnails-7.209
alid-7.094
Token K
Token ID509
Feature activation-4.99e-01

Pos logit contributions
 Authentication+7.432
 Prem+7.299
 Preferences+7.214
 Regulatory+7.134
��+7.053
Neg logit contributions
evil-13.717
river-13.193
inav-13.107
alid-12.961
tackle-12.851
Tokenild
Token ID688
Feature activation-4.35e-04

Pos logit contributions
oons+4.008
��+3.957
elle+3.895
illion+3.866
 Authentication+3.710
Neg logit contributions
river-6.039
evil-5.805
thumbnails-5.568
phia-5.445
inav-5.400
Tokenon
Token ID261
Feature activation-7.35e-02

Pos logit contributions
��+0.006
oons+0.005
 Authentication+0.005
 Preferences+0.005
illion+0.005
Neg logit contributions
river-0.003
evil-0.003
inav-0.003
thumbnails-0.003
tackle-0.003
Tokenan
Token ID272
Feature activation-4.81e-01

Pos logit contributions
��+0.910
oons+0.894
 Authentication+0.888
 Preferences+0.877
illion+0.868
Neg logit contributions
river-0.605
evil-0.575
inav-0.514
thumbnails-0.499
tackle-0.496
Token for
Token ID329
Feature activation-9.22e-01

Pos logit contributions
��+4.434
 Authentication+4.389
 Preferences+4.378
oons+4.310
 Regulatory+4.299
Neg logit contributions
river-5.248
evil-5.115
inav-4.805
thumbnails-4.671
tackle-4.588
Token take
Token ID1011
Feature activation-4.91e-01

Pos logit contributions
oons+4.114
 Authentication+4.098
��+4.076
elle+4.073
 Preferences+4.025
Neg logit contributions
river-4.325
evil-4.057
inav-3.977
thumbnails-3.896
phia-3.830
Token unto
Token ID12722
Feature activation-2.63e-01

Pos logit contributions
��+4.306
elle+4.265
 Authentication+4.234
oons+4.225
 Preferences+4.197
Neg logit contributions
river-5.226
evil-5.078
inav-4.911
thumbnails-4.841
tackle-4.742
Token thy
Token ID11906
Feature activation-1.65e-01

Pos logit contributions
��+2.642
 Authentication+2.569
oons+2.553
 Preferences+2.522
elle+2.481
Neg logit contributions
river-2.721
evil-2.593
inav-2.449
thumbnails-2.387
tackle-2.357
Tokenself
Token ID944
Feature activation-6.42e-02

Pos logit contributions
��+1.475
 Authentication+1.419
oons+1.418
 Preferences+1.397
elle+1.375
Neg logit contributions
river-1.917
evil-1.838
inav-1.735
thumbnails-1.711
tackle-1.682
Token.
Token ID13
Feature activation+4.20e-02

Pos logit contributions
��+0.671
oons+0.650
 Authentication+0.647
 Preferences+0.640
illion+0.627
Neg logit contributions
river-0.654
evil-0.630
inav-0.579
thumbnails-0.565
tackle-0.561
Token De
Token ID1024
Feature activation-1.15e+00

Pos logit contributions
river+0.424
evil+0.408
inav+0.370
thumbnails+0.362
tackle+0.359
Neg logit contributions
��-0.447
oons-0.432
 Authentication-0.429
 Preferences-0.425
illion-0.417
Tokenuter
Token ID11894
Feature activation-2.27e-01

Pos logit contributions
arest+9.946
��+9.582
elle+9.455
 Authentication+9.191
oons+9.147
Neg logit contributions
river-10.574
tackle-10.153
thumbnails-10.001
inav-9.960
evil-9.885
Tokenonomy
Token ID30565
Feature activation-1.07e-01

Pos logit contributions
��+3.463
oons+3.428
 Authentication+3.388
 Preferences+3.385
elle+3.332
Neg logit contributions
river-1.208
evil-1.087
inav-0.947
tackle-0.892
phia-0.833
Token 20
Token ID1160
Feature activation-5.85e-02

Pos logit contributions
��+0.908
oons+0.883
 Authentication+0.879
 Preferences+0.870
 Regulatory+0.844
Neg logit contributions
river-1.287
evil-1.249
inav-1.165
thumbnails-1.139
tackle-1.138
Token:
Token ID25
Feature activation-2.82e-01

Pos logit contributions
��+0.596
oons+0.577
 Authentication+0.574
 Preferences+0.569
 Regulatory+0.555
Neg logit contributions
river-0.614
evil-0.590
inav-0.543
thumbnails-0.529
tackle-0.528
Token 12
Token ID1105
Feature activation+7.61e-02

Pos logit contributions
��+2.645
 Authentication+2.631
oons+2.622
 Preferences+2.591
 Regulatory+2.529
Neg logit contributions
river-3.040
evil-2.942
inav-2.763
phia-2.686
thumbnails-2.678
Token in
Token ID287
Feature activation-5.31e-01

Pos logit contributions
��+3.236
oons+3.225
 Authentication+3.195
 Preferences+3.185
elle+3.120
Neg logit contributions
river-3.654
evil-3.535
inav-3.285
thumbnails-3.210
tackle-3.201
Token Monday
Token ID3321
Feature activation+4.82e-02

Pos logit contributions
��+4.300
oons+4.277
 Authentication+4.131
 Preferences+4.103
elle+4.029
Neg logit contributions
river-6.363
evil-6.110
inav-5.816
thumbnails-5.764
tackle-5.575
Token's
Token ID338
Feature activation-3.59e-02

Pos logit contributions
river+0.457
evil+0.437
inav+0.394
thumbnails+0.384
tackle+0.384
Neg logit contributions
��-0.542
oons-0.526
 Authentication-0.523
 Preferences-0.518
illion-0.507
Token incident
Token ID4519
Feature activation-5.14e-02

Pos logit contributions
��+0.287
oons+0.275
 Authentication+0.273
 Preferences+0.269
illion+0.262
Neg logit contributions
river-0.456
evil-0.442
inav-0.412
thumbnails-0.404
tackle-0.402
Token should
Token ID815
Feature activation-5.41e-01

Pos logit contributions
��+0.546
oons+0.530
 Authentication+0.528
 Preferences+0.522
elle+0.511
Neg logit contributions
river-0.516
evil-0.497
inav-0.454
thumbnails-0.443
tackle-0.440
Token call
Token ID869
Feature activation-1.12e+00

Pos logit contributions
 Authentication+4.016
oons+3.982
��+3.964
 Preferences+3.940
 Regulatory+3.823
Neg logit contributions
river-6.731
evil-6.558
inav-6.288
thumbnails-6.123
alid-5.979
Token the
Token ID262
Feature activation-9.36e-01

Pos logit contributions
 Regulatory+7.667
 Authentication+7.666
 Tess+7.473
 Prem+7.447
 Directorate+7.425
Neg logit contributions
evil-12.314
inav-12.134
thumbnails-11.876
river-11.654
alid-11.577
Token Long
Token ID5882
Feature activation+5.51e-02

Pos logit contributions
 Regulatory+6.270
 Authentication+6.245
 Tess+6.179
 Directorate+6.153
 Cummings+6.144
Neg logit contributions
evil-10.934
river-10.872
inav-10.517
thumbnails-10.429
phia-10.127
Tokenview
Token ID1177
Feature activation-4.70e-01

Pos logit contributions
river+0.398
evil+0.374
inav+0.327
thumbnails+0.315
tackle+0.313
Neg logit contributions
��-0.747
oons-0.727
 Authentication-0.725
 Preferences-0.718
illion-0.706
Token Police
Token ID4287
Feature activation-1.65e-01

Pos logit contributions
��+3.525
 Authentication+3.464
 Preferences+3.441
oons+3.441
 Regulatory+3.386
Neg logit contributions
river-5.911
evil-5.826
inav-5.475
thumbnails-5.363
tackle-5.309
Token Department
Token ID2732
Feature activation-6.05e-01

Pos logit contributions
��+1.579
 Authentication+1.540
oons+1.531
 Preferences+1.514
 Regulatory+1.492
Neg logit contributions
river-1.811
evil-1.763
inav-1.628
thumbnails-1.605
tackle-1.576
Token be
Token ID307
Feature activation-1.57e-01

Pos logit contributions
river+3.974
evil+3.846
inav+3.449
thumbnails+3.415
tackle+3.366
Neg logit contributions
��-4.074
 Authentication-3.859
oons-3.854
 Preferences-3.784
namese-3.783
Token another
Token ID1194
Feature activation-1.74e-01

Pos logit contributions
��+1.549
oons+1.504
 Authentication+1.486
 Preferences+1.467
elle+1.449
Neg logit contributions
river-1.677
evil-1.612
inav-1.499
tackle-1.451
thumbnails-1.449
Token place
Token ID1295
Feature activation-2.42e-01

Pos logit contributions
��+1.696
oons+1.665
 Authentication+1.653
 Preferences+1.636
 Regulatory+1.603
Neg logit contributions
river-1.868
evil-1.799
inav-1.657
thumbnails-1.621
tackle-1.615
Token like
Token ID588
Feature activation+8.97e-02

Pos logit contributions
��+2.144
oons+2.099
 Authentication+2.073
 Preferences+2.056
elle+2.026
Neg logit contributions
river-2.800
evil-2.706
inav-2.537
tackle-2.467
thumbnails-2.465
Token the
Token ID262
Feature activation+5.50e-02

Pos logit contributions
river+1.011
evil+0.975
inav+0.901
thumbnails+0.881
tackle+0.874
Neg logit contributions
��-0.850
oons-0.817
 Authentication-0.806
 Preferences-0.792
namese-0.785
Token White
Token ID2635
Feature activation-1.11e+00

Pos logit contributions
river+0.614
evil+0.596
inav+0.548
thumbnails+0.536
tackle+0.531
Neg logit contributions
��-0.526
oons-0.504
 Authentication-0.501
 Preferences-0.492
illion-0.485
Token Rabbit
Token ID25498
Feature activation-6.31e-01

Pos logit contributions
elle+8.666
 Preferences+8.065
��+7.992
 Cummings+7.895
 Tess+7.888
Neg logit contributions
evil-11.698
river-11.540
inav-11.310
BILITY-11.048
thumbnails-10.854
Token.
Token ID13
Feature activation-3.85e-02

Pos logit contributions
oons+5.646
 Preferences+5.617
 Authentication+5.599
��+5.579
 Regulatory+5.505
Neg logit contributions
river-6.688
evil-6.592
inav-6.167
thumbnails-5.984
tackle-5.928
Token
Token ID447
Feature activation+1.11e-01

Pos logit contributions
��+0.381
oons+0.369
 Authentication+0.367
 Preferences+0.363
illion+0.354
Neg logit contributions
river-0.415
evil-0.400
inav-0.368
thumbnails-0.359
tackle-0.358
Token
Token ID251
Feature activation+1.74e-02

Pos logit contributions
river+1.923
evil+1.898
inav+1.787
thumbnails+1.764
tackle+1.759
Neg logit contributions
��-0.326
oons-0.322
 Authentication-0.307
 Preferences-0.291
illion-0.280
Token
Token ID198
Feature activation+8.82e-02

Pos logit contributions
river+0.184
evil+0.178
inav+0.162
thumbnails+0.159
tackle+0.158
Neg logit contributions
��-0.176
oons-0.170
 Authentication-0.168
 Preferences-0.166
illion-0.163
Token pay
Token ID1414
Feature activation+4.57e-01

Pos logit contributions
river+0.750
evil+0.726
inav+0.661
tackle+0.645
thumbnails+0.644
Neg logit contributions
��-0.682
oons-0.653
 Authentication-0.649
 Preferences-0.640
 Regulatory-0.625
Token as
Token ID355
Feature activation-3.14e-01

Pos logit contributions
river+4.764
evil+4.566
inav+4.078
phia+4.031
 Leh+4.006
Neg logit contributions
��-4.692
oons-4.435
 Authentication-4.389
namese-4.313
 Preferences-4.233
Token many
Token ID867
Feature activation+2.53e-01

Pos logit contributions
��+3.008
illion+2.919
oons+2.915
 Preferences+2.914
 Authentication+2.896
Neg logit contributions
river-3.371
evil-3.229
inav-3.033
thumbnails-2.946
tackle-2.934
Token bills
Token ID9024
Feature activation-1.44e-02

Pos logit contributions
river+2.785
evil+2.677
inav+2.404
thumbnails+2.403
tackle+2.367
Neg logit contributions
��-2.472
oons-2.362
 Authentication-2.340
 Preferences-2.291
namese-2.258
Token as
Token ID355
Feature activation-1.54e-01

Pos logit contributions
��+0.154
oons+0.149
 Authentication+0.148
 Preferences+0.147
illion+0.144
Neg logit contributions
river-0.144
evil-0.139
inav-0.127
thumbnails-0.124
tackle-0.123
Token it
Token ID340
Feature activation-1.10e+00

Pos logit contributions
��+1.443
oons+1.404
 Authentication+1.395
 Preferences+1.387
illion+1.363
Neg logit contributions
river-1.714
evil-1.654
inav-1.538
tackle-1.497
thumbnails-1.497
Token has
Token ID468
Feature activation-5.00e-01

Pos logit contributions
illion+7.677
elle+7.667
 signs+7.572
 drafts+7.271
 Cummings+7.231
Neg logit contributions
river-12.368
tackle-12.055
evil-12.042
inav-11.614
thumbnails-11.339
Token daily
Token ID4445
Feature activation-7.35e-02

Pos logit contributions
��+4.149
illion+4.136
 Authentication+4.128
 Preferences+4.125
elle+4.055
Neg logit contributions
river-5.563
evil-5.491
inav-5.285
tackle-5.225
thumbnails-5.077
Token tax
Token ID1687
Feature activation-3.84e-01

Pos logit contributions
��+0.761
oons+0.737
 Authentication+0.735
 Preferences+0.730
illion+0.716
Neg logit contributions
river-0.754
evil-0.728
inav-0.668
thumbnails-0.652
tackle-0.650
Token receipts
Token ID29946
Feature activation-7.20e-02

Pos logit contributions
��+2.884
 Preferences+2.824
oons+2.792
 Authentication+2.780
illion+2.745
Neg logit contributions
river-4.932
evil-4.785
inav-4.452
thumbnails-4.413
tackle-4.383
Token.
Token ID13
Feature activation-5.03e-02

Pos logit contributions
��+0.764
oons+0.742
 Authentication+0.739
 Preferences+0.731
illion+0.719
Neg logit contributions
river-0.723
evil-0.697
inav-0.638
thumbnails-0.625
tackle-0.619
Token,
Token ID11
Feature activation-1.91e-01

Pos logit contributions
��+0.457
oons+0.445
 Authentication+0.443
 Preferences+0.438
illion+0.430
Neg logit contributions
river-0.430
evil-0.413
inav-0.378
thumbnails-0.368
tackle-0.367
Token055
Token ID47838
Feature activation-3.41e-02

Pos logit contributions
��+1.958
oons+1.900
 Authentication+1.884
 Preferences+1.858
illion+1.839
Neg logit contributions
river-1.975
evil-1.898
inav-1.743
thumbnails-1.707
tackle-1.695
Token,
Token ID11
Feature activation+1.10e-01

Pos logit contributions
��+0.313
oons+0.301
 Authentication+0.300
 Preferences+0.296
illion+0.289
Neg logit contributions
river-0.393
evil-0.380
inav-0.351
thumbnails-0.344
tackle-0.343
Token and
Token ID290
Feature activation+2.51e-02

Pos logit contributions
river+1.057
evil+1.007
inav+0.916
tackle+0.891
thumbnails+0.889
Neg logit contributions
��-1.215
oons-1.182
 Authentication-1.164
 Preferences-1.149
namese-1.135
Token JPMorgan
Token ID43318
Feature activation-3.31e-01

Pos logit contributions
river+0.320
evil+0.309
inav+0.288
thumbnails+0.282
tackle+0.281
Neg logit contributions
��-0.201
oons-0.194
 Authentication-0.191
 Preferences-0.187
namese-0.183
Token Chase
Token ID16346
Feature activation-1.10e+00

Pos logit contributions
��+2.441
 Authentication+2.387
 Preferences+2.338
oons+2.328
 Regulatory+2.303
Neg logit contributions
river-4.319
evil-4.204
inav-3.936
thumbnails-3.882
tackle-3.831
Token was
Token ID373
Feature activation-1.55e-01

Pos logit contributions
elle+7.791
 Authentication+7.791
 Colleg+7.764
illion+7.756
 Regulatory+7.677
Neg logit contributions
evil-11.884
river-11.872
inav-11.229
thumbnails-11.133
tackle-10.972
Token the
Token ID262
Feature activation-1.97e-01

Pos logit contributions
��+1.463
 Authentication+1.421
oons+1.407
 Preferences+1.397
 Regulatory+1.376
Neg logit contributions
river-1.721
evil-1.667
inav-1.558
thumbnails-1.512
tackle-1.506
Token subject
Token ID2426
Feature activation-5.01e-01

Pos logit contributions
��+1.791
 Authentication+1.750
oons+1.734
 Preferences+1.731
 Regulatory+1.701
Neg logit contributions
river-2.240
evil-2.166
inav-2.010
thumbnails-1.965
tackle-1.962
Token of
Token ID286
Feature activation-2.73e-01

Pos logit contributions
��+4.657
illion+4.525
elle+4.450
oons+4.446
 Authentication+4.412
Neg logit contributions
river-5.353
evil-5.276
inav-4.963
thumbnails-4.906
tackle-4.784
Token 20
Token ID1160
Feature activation-2.38e-02

Pos logit contributions
��+2.226
 Authentication+2.176
 Preferences+2.160
oons+2.144
 Regulatory+2.107
Neg logit contributions
river-3.329
evil-3.236
inav-3.041
thumbnails-2.979
tackle-2.972
Token greatest
Token ID6000
Feature activation-6.90e-03

Pos logit contributions
river+0.345
evil+0.333
inav+0.306
thumbnails+0.299
tackle+0.297
Neg logit contributions
��-0.314
oons-0.302
 Authentication-0.300
 Preferences-0.296
illion-0.291
Token Survivor
Token ID23740
Feature activation+6.13e-02

Pos logit contributions
��+0.069
oons+0.066
 Authentication+0.066
 Preferences+0.065
illion+0.064
Neg logit contributions
river-0.074
evil-0.072
inav-0.066
thumbnails-0.064
tackle-0.064
Token fan
Token ID4336
Feature activation-5.57e-02

Pos logit contributions
river+0.672
evil+0.650
inav+0.597
thumbnails+0.581
tackle+0.579
Neg logit contributions
��-0.598
oons-0.574
 Authentication-0.571
 Preferences-0.565
illion-0.553
Token would
Token ID561
Feature activation-1.11e-01

Pos logit contributions
��+0.550
oons+0.533
 Authentication+0.529
 Preferences+0.524
illion+0.511
Neg logit contributions
river-0.603
evil-0.580
inav-0.535
thumbnails-0.522
tackle-0.522
Token break
Token ID2270
Feature activation-3.78e-01

Pos logit contributions
��+1.095
oons+1.073
 Authentication+1.062
 Preferences+1.059
illion+1.038
Neg logit contributions
river-1.188
evil-1.138
inav-1.053
thumbnails-1.042
tackle-1.031
Token the
Token ID262
Feature activation-1.10e+00

Pos logit contributions
 Preferences+3.288
��+3.272
oons+3.261
 Authentication+3.208
elle+3.127
Neg logit contributions
river-4.350
evil-4.185
inav-3.961
tackle-3.847
thumbnails-3.828
Token CD
Token ID6458
Feature activation-8.25e-03

Pos logit contributions
oons+8.599
 Preferences+8.276
 Prem+8.262
 Authentication+8.018
illion+7.950
Neg logit contributions
river-12.116
evil-11.341
tackle-10.770
inav-10.452
thumbnails-10.415
Token in
Token ID287
Feature activation-3.76e-01

Pos logit contributions
��+0.087
oons+0.083
 Authentication+0.083
 Preferences+0.082
namese+0.080
Neg logit contributions
river-0.085
evil-0.081
inav-0.074
thumbnails-0.073
tackle-0.072
Token two
Token ID734
Feature activation+8.21e-02

Pos logit contributions
��+3.352
 Preferences+3.320
oons+3.301
 Authentication+3.252
elle+3.161
Neg logit contributions
river-4.281
evil-4.115
inav-3.854
thumbnails-3.778
tackle-3.736
Token after
Token ID706
Feature activation-4.25e-01

Pos logit contributions
river+0.832
evil+0.800
inav+0.726
thumbnails+0.707
tackle+0.706
Neg logit contributions
��-0.871
oons-0.838
 Authentication-0.837
 Preferences-0.823
 Regulatory-0.809
Token playing
Token ID2712
Feature activation-4.19e-02

Pos logit contributions
��+3.686
oons+3.650
 Authentication+3.555
illion+3.545
 Preferences+3.543
Neg logit contributions
river-4.902
evil-4.687
inav-4.311
thumbnails-4.310
tackle-4.283
Token ;
Token ID2162
Feature activation-9.24e-01

Pos logit contributions
river+2.056
evil+1.957
thumbnails+1.722
inav+1.689
alid+1.678
Neg logit contributions
��-3.179
oons-3.105
 Authentication-3.063
illion-3.012
 Preferences-3.000
Token density
Token ID12109
Feature activation-1.27e-01

Pos logit contributions
 Preferences+7.442
 Authentication+7.314
oons+7.175
��+7.070
 Tess+7.049
Neg logit contributions
river-10.147
evil-9.803
inav-9.444
phia-9.221
tackle-9.092
TokenGrid
Token ID41339
Feature activation-1.75e-01

Pos logit contributions
��+1.144
oons+1.110
 Authentication+1.100
 Preferences+1.098
elle+1.077
Neg logit contributions
river-1.452
evil-1.411
inav-1.326
thumbnails-1.279
tackle-1.275
Token.
Token ID13
Feature activation-5.04e-01

Pos logit contributions
��+1.879
oons+1.832
 Authentication+1.814
 Preferences+1.807
elle+1.776
Neg logit contributions
river-1.702
evil-1.650
inav-1.529
tackle-1.470
thumbnails-1.466
TokenInit
Token ID31768
Feature activation-2.56e-01

Pos logit contributions
��+4.420
oons+4.306
 Preferences+4.272
 Authentication+4.244
elle+4.225
Neg logit contributions
river-5.537
evil-5.331
inav-5.210
phia-4.937
thumbnails-4.805
Token(
Token ID7
Feature activation-1.09e+00

Pos logit contributions
��+2.444
oons+2.358
 Authentication+2.341
 Preferences+2.331
elle+2.290
Neg logit contributions
river-2.790
evil-2.721
inav-2.522
tackle-2.446
thumbnails-2.433
Token 0
Token ID657
Feature activation+3.99e-01

Pos logit contributions
��+8.552
 Preferences+8.530
 Tess+8.384
oons+8.368
 Authentication+8.290
Neg logit contributions
river-11.282
evil-10.907
inav-10.647
phia-10.190
tackle-10.187
Token.
Token ID13
Feature activation-1.08e-01

Pos logit contributions
river+3.271
evil+3.115
inav+2.664
thumbnails+2.657
alid+2.656
Neg logit contributions
��-4.814
 Preferences-4.670
oons-4.650
 Authentication-4.648
namese-4.640
Token0
Token ID15
Feature activation+3.97e-01

Pos logit contributions
��+0.982
oons+0.948
 Authentication+0.944
 Preferences+0.931
illion+0.907
Neg logit contributions
river-1.246
evil-1.203
inav-1.115
thumbnails-1.089
tackle-1.086
Tokenf
Token ID69
Feature activation-1.77e-01

Pos logit contributions
river+3.129
evil+2.965
inav+2.552
alid+2.496
thumbnails+2.493
Neg logit contributions
��-4.917
oons-4.800
 Preferences-4.792
 Authentication-4.788
namese-4.751
Token )
Token ID1267
Feature activation+1.69e-01

Pos logit contributions
��+1.992
oons+1.932
 Authentication+1.916
 Preferences+1.896
illion+1.872
Neg logit contributions
river-1.669
evil-1.605
inav-1.454
tackle-1.421
thumbnails-1.418
Token some
Token ID617
Feature activation-1.72e-01

Pos logit contributions
��+5.213
 Authentication+5.210
oons+5.203
 Preferences+5.192
 Prem+5.085
Neg logit contributions
river-7.564
evil-7.369
tackle-6.919
inav-6.807
thumbnails-6.746
Token universe
Token ID6881
Feature activation+5.21e-02

Pos logit contributions
��+1.698
oons+1.666
 Authentication+1.654
 Preferences+1.644
 Regulatory+1.606
Neg logit contributions
river-1.835
evil-1.770
inav-1.622
tackle-1.591
thumbnails-1.590
Token,
Token ID11
Feature activation-3.39e-01

Pos logit contributions
river+0.531
evil+0.511
inav+0.466
thumbnails+0.455
tackle+0.452
Neg logit contributions
��-0.550
oons-0.532
 Authentication-0.529
 Preferences-0.522
 Regulatory-0.511
Token but
Token ID475
Feature activation-3.84e-01

Pos logit contributions
��+2.967
 Authentication+2.911
oons+2.901
 Preferences+2.861
 Regulatory+2.777
Neg logit contributions
river-3.927
evil-3.771
inav-3.484
thumbnails-3.432
tackle-3.424
Token to
Token ID284
Feature activation+2.90e-01

Pos logit contributions
��+3.436
 Authentication+3.408
oons+3.378
 Preferences+3.372
 Regulatory+3.264
Neg logit contributions
river-4.281
evil-4.170
inav-3.817
tackle-3.809
thumbnails-3.792
Token beat
Token ID4405
Feature activation-1.09e+00

Pos logit contributions
river+2.848
evil+2.774
inav+2.486
tackle+2.430
phia+2.419
Neg logit contributions
��-3.135
 Authentication-2.981
oons-2.950
 Preferences-2.923
illion-2.905
Token a
Token ID257
Feature activation-6.25e-01

Pos logit contributions
 Colleg+8.204
 Prem+8.114
 Tess+8.057
oons+8.039
 Authentication+7.826
Neg logit contributions
river-11.342
evil-11.034
tackle-10.515
alid-10.208
thumbnails-10.127
Token boot
Token ID6297
Feature activation+7.34e-02

Pos logit contributions
��+4.962
oons+4.905
 Colleg+4.904
 Prem+4.850
 Authentication+4.844
Neg logit contributions
river-7.153
evil-6.988
thumbnails-6.560
tackle-6.516
alid-6.308
Tokenc
Token ID66
Feature activation+3.62e-01

Pos logit contributions
river+0.593
evil+0.565
inav+0.499
thumbnails+0.482
tackle+0.482
Neg logit contributions
��-0.936
 Authentication-0.905
oons-0.904
 Preferences-0.897
 Regulatory-0.879
Tokenamped
Token ID13322
Feature activation-5.97e-01

Pos logit contributions
river+3.727
evil+3.493
inav+3.190
tackle+3.104
alid+3.008
Neg logit contributions
��-3.960
 Authentication-3.759
 Preferences-3.735
oons-3.678
 Regulatory-3.603
Token Na
Token ID11013
Feature activation-5.42e-01

Pos logit contributions
 Authentication+5.000
oons+4.995
��+4.951
 Colleg+4.937
 Prem+4.905
Neg logit contributions
river-6.557
evil-6.409
thumbnails-6.046
tackle-5.982
inav-5.798
Token outside
Token ID2354
Feature activation-5.72e-01

Pos logit contributions
��+2.037
oons+1.984
 Authentication+1.983
 Preferences+1.964
 Regulatory+1.910
Neg logit contributions
river-2.447
evil-2.360
inav-2.201
thumbnails-2.158
tackle-2.124
Token of
Token ID286
Feature activation+9.12e-02

Pos logit contributions
��+4.896
oons+4.892
 Preferences+4.839
 Authentication+4.795
elle+4.756
Neg logit contributions
river-6.257
evil-6.094
inav-5.788
thumbnails-5.746
phia-5.641
Token the
Token ID262
Feature activation-4.49e-02

Pos logit contributions
river+0.967
evil+0.931
inav+0.848
thumbnails+0.832
tackle+0.826
Neg logit contributions
��-0.921
oons-0.889
 Authentication-0.889
 Preferences-0.875
illion-0.858
Token maze
Token ID31237
Feature activation-1.80e-01

Pos logit contributions
��+0.428
oons+0.414
 Authentication+0.412
 Preferences+0.407
 Regulatory+0.396
Neg logit contributions
river-0.501
evil-0.483
inav-0.445
thumbnails-0.436
tackle-0.434
Token.
Token ID13
Feature activation-5.73e-01

Pos logit contributions
��+1.832
oons+1.798
 Authentication+1.780
 Preferences+1.765
elle+1.737
Neg logit contributions
river-1.857
evil-1.795
inav-1.662
thumbnails-1.618
tackle-1.606
Token Doors
Token ID33775
Feature activation-1.08e+00

Pos logit contributions
 Preferences+4.563
 Authentication+4.497
oons+4.430
 Regulatory+4.376
 Prem+4.293
Neg logit contributions
river-6.976
evil-6.794
inav-6.502
tackle-6.393
alid-6.179
Token'
Token ID6
Feature activation-2.66e-01

Pos logit contributions
��+8.109
 Authentication+8.088
 Regulatory+7.831
oons+7.824
 Preferences+7.822
Neg logit contributions
river-11.893
evil-11.330
inav-11.104
thumbnails-10.973
tackle-10.611
Token width
Token ID9647
Feature activation-9.98e-02

Pos logit contributions
��+2.604
 Authentication+2.535
 Preferences+2.520
oons+2.514
 Regulatory+2.442
Neg logit contributions
river-2.879
evil-2.755
inav-2.549
thumbnails-2.470
phia-2.454
Token was
Token ID373
Feature activation-2.37e-01

Pos logit contributions
��+1.080
oons+1.050
 Authentication+1.040
 Preferences+1.032
elle+1.008
Neg logit contributions
river-0.987
evil-0.951
inav-0.868
thumbnails-0.845
tackle-0.842
Token set
Token ID900
Feature activation-2.04e-01

Pos logit contributions
��+2.034
 Authentication+1.978
oons+1.977
 Preferences+1.952
 Regulatory+1.903
Neg logit contributions
river-2.796
evil-2.743
inav-2.580
tackle-2.487
thumbnails-2.480
Token to
Token ID284
Feature activation-2.26e-01

Pos logit contributions
��+1.944
oons+1.890
 Authentication+1.863
 Preferences+1.856
namese+1.801
Neg logit contributions
river-2.240
evil-2.175
inav-2.018
tackle-1.964
thumbnails-1.951
Token signed
Token ID4488
Feature activation+2.84e-02

Pos logit contributions
oons+3.984
��+3.964
 Authentication+3.932
illion+3.869
elle+3.828
Neg logit contributions
river-5.053
evil-4.731
thumbnails-4.448
tackle-4.426
inav-4.379
Token,
Token ID11
Feature activation-2.48e-02

Pos logit contributions
river+0.277
evil+0.266
inav+0.242
thumbnails+0.235
tackle+0.234
Neg logit contributions
��-0.311
oons-0.301
 Authentication-0.299
 Preferences-0.296
 Regulatory-0.289
Token and
Token ID290
Feature activation-1.71e-01

Pos logit contributions
��+0.261
oons+0.253
 Authentication+0.251
 Preferences+0.249
illion+0.243
Neg logit contributions
river-0.253
evil-0.243
inav-0.222
thumbnails-0.217
tackle-0.216
Token last
Token ID938
Feature activation-2.48e-01

Pos logit contributions
��+1.602
 Authentication+1.558
oons+1.557
 Preferences+1.542
 Regulatory+1.507
Neg logit contributions
river-1.929
evil-1.852
inav-1.706
thumbnails-1.677
tackle-1.673
Token but
Token ID475
Feature activation-5.14e-01

Pos logit contributions
��+2.186
 Authentication+2.119
oons+2.116
 Preferences+2.084
elle+2.071
Neg logit contributions
river-2.883
evil-2.781
inav-2.606
thumbnails-2.583
tackle-2.522
Token not
Token ID407
Feature activation-1.08e+00

Pos logit contributions
��+4.239
oons+4.069
 Preferences+4.069
 Authentication+4.033
elle+3.977
Neg logit contributions
river-6.029
evil-5.830
inav-5.655
thumbnails-5.374
tackle-5.349
Token least
Token ID1551
Feature activation-2.91e-01

Pos logit contributions
elle+5.898
arest+5.887
 Prem+5.825
 Preferences+5.526
 Tess+5.509
Neg logit contributions
river-13.730
inav-13.713
evil-13.138
tackle-13.061
thumbnails-12.680
Token,
Token ID11
Feature activation+1.12e-01

Pos logit contributions
��+2.823
 Authentication+2.747
oons+2.741
 Preferences+2.721
 Regulatory+2.657
Neg logit contributions
river-3.135
evil-3.017
inav-2.773
thumbnails-2.728
tackle-2.697
Token remaining
Token ID5637
Feature activation+6.67e-02

Pos logit contributions
river+1.185
evil+1.149
inav+1.053
thumbnails+1.027
tackle+1.020
Neg logit contributions
��-1.127
oons-1.085
 Authentication-1.075
 Preferences-1.061
elle-1.042
Token Cors
Token ID26978
Feature activation-2.36e-01

Pos logit contributions
river+0.742
evil+0.716
inav+0.654
thumbnails+0.642
tackle+0.638
Neg logit contributions
��-0.644
oons-0.616
 Authentication-0.614
 Preferences-0.604
illion-0.594
Tokenairs
Token ID3468
Feature activation-1.54e-01

Pos logit contributions
��+2.298
oons+2.292
 Authentication+2.230
illion+2.195
 Preferences+2.188
Neg logit contributions
river-2.530
evil-2.426
inav-2.232
thumbnails-2.211
tackle-2.199
TokenCS
Token ID7902
Feature activation+2.84e-01

Pos logit contributions
river+3.820
evil+3.696
inav+3.327
tackle+3.256
alid+3.256
Neg logit contributions
��-3.273
 Authentication-3.192
 Preferences-3.168
 Regulatory-3.086
oons-3.065
Token3
Token ID18
Feature activation+2.21e-01

Pos logit contributions
river+3.151
evil+3.048
inav+2.761
alid+2.691
tackle+2.688
Neg logit contributions
��-2.741
 Authentication-2.662
 Preferences-2.645
oons-2.579
 Regulatory-2.575
TokenIV
Token ID3824
Feature activation-2.24e-02

Pos logit contributions
river+2.484
evil+2.411
inav+2.186
tackle+2.133
thumbnails+2.130
Neg logit contributions
��-2.094
 Authentication-2.030
 Preferences-2.011
oons-1.983
 Regulatory-1.959
TokenIP
Token ID4061
Feature activation+4.17e-01

Pos logit contributions
��+0.215
oons+0.208
 Authentication+0.207
 Preferences+0.205
illion+0.200
Neg logit contributions
river-0.249
evil-0.240
inav-0.221
thumbnails-0.216
tackle-0.215
Token9
Token ID24
Feature activation+7.71e-02

Pos logit contributions
river+4.621
evil+4.458
inav+4.004
alid+3.947
tre+3.942
Neg logit contributions
��-3.945
 Authentication-3.849
 Preferences-3.839
 Regulatory-3.729
oons-3.670
Tokenib
Token ID571
Feature activation-1.08e+00

Pos logit contributions
river+0.866
evil+0.837
inav+0.766
thumbnails+0.748
tackle+0.747
Neg logit contributions
��-0.735
 Authentication-0.709
oons-0.705
 Preferences-0.701
 Regulatory-0.683
Token -
Token ID532
Feature activation+1.20e-01

Pos logit contributions
 Authentication+6.779
oons+6.625
 Tess+6.394
elle+6.369
 Preferences+6.310
Neg logit contributions
river-12.811
inav-12.709
evil-12.616
thumbnails-11.821
phia-11.787
Token Kirk
Token ID13855
Feature activation-9.09e-02

Pos logit contributions
river+1.422
evil+1.368
inav+1.268
tackle+1.244
thumbnails+1.235
Neg logit contributions
��-1.067
oons-1.017
 Authentication-1.012
 Preferences-1.002
namese-0.982
Token Cousins
Token ID32644
Feature activation-2.53e-01

Pos logit contributions
��+0.792
oons+0.761
 Authentication+0.757
 Preferences+0.747
illion+0.727
Neg logit contributions
river-1.092
evil-1.056
inav-0.979
thumbnails-0.959
tackle-0.956
Token (@
Token ID4275
Feature activation-5.13e-02

Pos logit contributions
��+1.979
oons+1.919
 Authentication+1.910
 Preferences+1.876
illion+1.839
Neg logit contributions
river-3.219
evil-3.132
inav-2.921
thumbnails-2.866
tackle-2.840
TokenK
Token ID42
Feature activation-4.03e-01

Pos logit contributions
��+0.446
oons+0.430
 Authentication+0.426
 Preferences+0.420
illion+0.411
Neg logit contributions
river-0.615
evil-0.596
inav-0.554
thumbnails-0.542
tackle-0.539
Token count
Token ID954
Feature activation-6.48e-02

Pos logit contributions
��+2.053
 Authentication+2.008
oons+1.998
 Preferences+1.988
 Regulatory+1.929
Neg logit contributions
river-2.415
evil-2.348
inav-2.170
tackle-2.111
thumbnails-2.111
Token |
Token ID930
Feature activation-3.78e-01

Pos logit contributions
��+0.571
oons+0.550
 Authentication+0.549
 Preferences+0.541
 Regulatory+0.526
Neg logit contributions
river-0.768
evil-0.744
inav-0.691
thumbnails-0.675
tackle-0.673
Token |
Token ID930
Feature activation-1.90e-01

Pos logit contributions
��+3.076
 Authentication+3.033
 Preferences+3.021
oons+2.974
 Regulatory+2.897
Neg logit contributions
river-4.541
evil-4.436
inav-4.179
tackle-4.064
phia-4.058
Token ----
Token ID13498
Feature activation-5.57e-01

Pos logit contributions
��+1.486
 Authentication+1.438
oons+1.436
 Preferences+1.426
 Regulatory+1.372
Neg logit contributions
river-2.414
evil-2.338
inav-2.189
thumbnails-2.150
tackle-2.150
Token-+
Token ID19529
Feature activation-6.17e-01

Pos logit contributions
��+5.313
 Preferences+5.155
 Authentication+5.137
oons+5.115
elle+4.926
Neg logit contributions
river-5.851
evil-5.596
inav-5.280
thumbnails-5.232
tackle-5.162
Token--------------------------------
Token ID3880
Feature activation-1.05e+00

Pos logit contributions
��+4.562
 Authentication+4.348
 Preferences+4.343
oons+4.290
illion+4.132
Neg logit contributions
river-7.748
evil-7.490
inav-7.175
thumbnails-7.084
tackle-6.920
Token+
Token ID10
Feature activation-8.05e-01

Pos logit contributions
��+7.721
 Preferences+7.531
oons+7.468
 Authentication+7.247
 Prem+6.980
Neg logit contributions
river-11.889
evil-11.405
inav-11.080
thumbnails-10.932
tackle-10.729
Token-------
Token ID26866
Feature activation-6.36e-01

Pos logit contributions
��+6.178
 Preferences+5.970
 Authentication+5.960
oons+5.905
 Regulatory+5.710
Neg logit contributions
river-9.233
evil-8.989
inav-8.657
thumbnails-8.509
alid-8.356
Token|
Token ID91
Feature activation-3.71e-01

Pos logit contributions
��+4.034
oons+3.881
 Authentication+3.843
 Preferences+3.842
 Regulatory+3.654
Neg logit contributions
river-8.405
evil-8.252
inav-7.839
thumbnails-7.790
tackle-7.733
Token |
Token ID930
Feature activation+1.25e-01

Pos logit contributions
��+2.997
 Authentication+2.923
 Preferences+2.902
oons+2.877
illion+2.770
Neg logit contributions
river-4.538
evil-4.414
inav-4.160
thumbnails-4.097
tackle-4.066
Token 1
Token ID352
Feature activation-1.26e-01

Pos logit contributions
river+1.197
evil+1.148
inav+1.037
thumbnails+1.002
tackle+0.995
Neg logit contributions
��-1.391
oons-1.346
 Authentication-1.335
 Preferences-1.311
illion-1.296
Tokenbers
Token ID1213
Feature activation-8.92e-02

Pos logit contributions
oons+2.676
��+2.611
 Preferences+2.560
 Authentication+2.556
elle+2.554
Neg logit contributions
river-2.461
evil-2.378
tackle-2.122
inav-2.122
thumbnails-2.100
Tokenide
Token ID485
Feature activation-9.69e-02

Pos logit contributions
��+0.732
oons+0.715
 Authentication+0.703
 Preferences+0.697
illion+0.683
Neg logit contributions
river-1.098
evil-1.064
inav-0.992
thumbnails-0.978
tackle-0.972
Token and
Token ID290
Feature activation-6.82e-01

Pos logit contributions
��+1.013
oons+0.981
 Authentication+0.981
 Preferences+0.973
 Regulatory+0.949
Neg logit contributions
river-0.989
evil-0.956
inav-0.876
thumbnails-0.852
tackle-0.848
Token we
Token ID356
Feature activation-5.58e-01

Pos logit contributions
 Authentication+5.904
 Preferences+5.743
 Regulatory+5.667
��+5.660
 Prem+5.657
Neg logit contributions
river-7.416
evil-7.317
inav-6.885
phia-6.674
thumbnails-6.667
Token will
Token ID481
Feature activation-6.20e-01

Pos logit contributions
 Authentication+4.991
 Regulatory+4.857
oons+4.839
��+4.838
elle+4.720
Neg logit contributions
river-5.969
evil-5.731
thumbnails-5.523
inav-5.353
phia-5.346
Token take
Token ID1011
Feature activation-1.05e+00

Pos logit contributions
oons+4.888
 Authentication+4.843
 Regulatory+4.769
 Preferences+4.764
namese+4.685
Neg logit contributions
river-7.180
evil-6.911
thumbnails-6.729
inav-6.579
phia-6.413
Token action
Token ID2223
Feature activation-4.98e-01

Pos logit contributions
 Authentication+6.759
 Regulatory+6.449
 Preferences+6.428
 Tess+6.343
��+6.291
Neg logit contributions
river-12.918
evil-12.249
inav-12.026
phia-11.422
tre-11.320
Token against
Token ID1028
Feature activation-6.13e-01

Pos logit contributions
��+4.356
oons+4.339
 Preferences+4.330
 Authentication+4.253
namese+4.219
Neg logit contributions
river-5.447
evil-5.319
inav-5.059
phia-4.947
thumbnails-4.885
Token those
Token ID883
Feature activation-2.88e-01

Pos logit contributions
oons+3.758
��+3.726
 Authentication+3.726
 Preferences+3.630
 Regulatory+3.628
Neg logit contributions
river-8.291
evil-7.829
inav-7.645
phia-7.560
tackle-7.466
Token intent
Token ID6824
Feature activation+2.30e-02

Pos logit contributions
��+2.713
oons+2.690
 Authentication+2.648
 Preferences+2.629
elle+2.576
Neg logit contributions
river-3.151
evil-3.014
inav-2.836
tackle-2.742
thumbnails-2.737
Token on
Token ID319
Feature activation-3.85e-01

Pos logit contributions
river+0.235
evil+0.226
inav+0.206
thumbnails+0.201
tackle+0.200
Neg logit contributions
��-0.243
 Authentication-0.234
oons-0.234
 Preferences-0.232
 Regulatory-0.226
Token to
Token ID284
Feature activation-1.26e-01

Pos logit contributions
river+0.565
evil+0.543
inav+0.499
thumbnails+0.488
tackle+0.486
Neg logit contributions
��-0.523
oons-0.503
 Authentication-0.499
 Preferences-0.493
illion-0.484
Token prevent
Token ID2948
Feature activation-5.17e-01

Pos logit contributions
��+1.244
oons+1.216
 Authentication+1.211
 Preferences+1.196
 Regulatory+1.171
Neg logit contributions
river-1.351
evil-1.297
inav-1.196
thumbnails-1.171
tackle-1.158
Token people
Token ID661
Feature activation-1.86e-01

Pos logit contributions
��+4.605
 Regulatory+4.514
 Authentication+4.503
oons+4.480
 Preferences+4.422
Neg logit contributions
river-5.578
evil-5.513
inav-5.179
thumbnails-5.079
tackle-5.018
Token (
Token ID357
Feature activation-3.47e-01

Pos logit contributions
��+1.810
oons+1.748
 Authentication+1.742
 Preferences+1.722
 Regulatory+1.688
Neg logit contributions
river-1.990
evil-1.940
inav-1.795
thumbnails-1.766
tackle-1.750
TokenW
Token ID54
Feature activation-2.67e-01

Pos logit contributions
��+3.463
oons+3.398
elle+3.289
illion+3.289
 Authentication+3.287
Neg logit contributions
river-3.491
evil-3.367
inav-3.203
thumbnails-3.074
alid-3.073
TokenAB
Token ID6242
Feature activation-1.04e+00

Pos logit contributions
��+2.241
oons+2.237
 Authentication+2.139
illion+2.128
elle+2.116
Neg logit contributions
river-3.200
evil-3.099
inav-2.905
thumbnails-2.873
tackle-2.850
TokenL
Token ID43
Feature activation-6.86e-01

Pos logit contributions
oons+4.526
��+4.429
elle+4.149
illion+3.947
 Authentication+3.926
Neg logit contributions
river-14.772
evil-14.649
inav-14.040
tackle-14.039
phia-13.947
Token members
Token ID1866
Feature activation-4.19e-01

Pos logit contributions
��+6.273
 Authentication+6.270
oons+6.264
 Preferences+6.175
 Regulatory+6.147
Neg logit contributions
river-7.082
evil-7.030
inav-6.454
thumbnails-6.364
tackle-6.331
Token)
Token ID8
Feature activation-1.33e-01

Pos logit contributions
��+4.110
 Authentication+4.003
oons+3.966
 Preferences+3.931
illion+3.858
Neg logit contributions
river-4.325
evil-4.199
inav-3.910
thumbnails-3.909
tackle-3.828
Token from
Token ID422
Feature activation-2.22e-01

Pos logit contributions
��+1.312
oons+1.271
 Authentication+1.265
 Preferences+1.249
 Regulatory+1.222
Neg logit contributions
river-1.418
evil-1.374
inav-1.268
thumbnails-1.239
tackle-1.234
Token just
Token ID655
Feature activation-2.42e-01

Pos logit contributions
��+2.109
oons+2.062
 Authentication+2.044
 Preferences+2.021
 Regulatory+1.992
Neg logit contributions
river-2.429
evil-2.357
inav-2.194
thumbnails-2.148
tackle-2.140
Token But
Token ID887
Feature activation+6.70e-02

Pos logit contributions
��+0.697
oons+0.674
 Authentication+0.673
 Preferences+0.664
 Regulatory+0.646
Neg logit contributions
river-0.958
evil-0.926
inav-0.860
thumbnails-0.843
tackle-0.840
Token Mozilla
Token ID29258
Feature activation+2.90e-01

Pos logit contributions
river+0.709
evil+0.682
inav+0.624
thumbnails+0.609
tackle+0.607
Neg logit contributions
��-0.683
oons-0.656
 Authentication-0.650
 Preferences-0.642
illion-0.630
Token succeeded
Token ID14131
Feature activation-1.19e-01

Pos logit contributions
river+2.731
evil+2.589
inav+2.364
thumbnails+2.292
tackle+2.284
Neg logit contributions
��-3.268
oons-3.102
 Authentication-3.058
 Preferences-3.035
namese-3.014
Token in
Token ID287
Feature activation+1.78e-02

Pos logit contributions
��+1.216
oons+1.187
 Authentication+1.180
 Preferences+1.168
illion+1.145
Neg logit contributions
river-1.238
evil-1.196
inav-1.096
thumbnails-1.076
tackle-1.060
Token breaking
Token ID7163
Feature activation-3.22e-01

Pos logit contributions
river+0.195
evil+0.188
inav+0.173
thumbnails+0.169
tackle+0.168
Neg logit contributions
��-0.174
oons-0.168
 Authentication-0.167
 Preferences-0.165
illion-0.161
Token the
Token ID262
Feature activation-1.04e+00

Pos logit contributions
��+2.823
oons+2.811
 Preferences+2.809
 Authentication+2.797
 Regulatory+2.722
Neg logit contributions
river-3.671
evil-3.535
inav-3.323
tackle-3.224
thumbnails-3.210
Token lock
Token ID5793
Feature activation-7.90e-01

Pos logit contributions
 Preferences+8.495
oons+8.372
 Authentication+8.212
 Regulatory+8.088
 Tess+8.009
Neg logit contributions
river-11.474
evil-10.390
tackle-10.244
inav-9.740
tre-9.644
Token Microsoft
Token ID5413
Feature activation-8.95e-02

Pos logit contributions
oons+6.368
��+6.342
elle+6.293
 Authentication+6.247
 Preferences+6.231
Neg logit contributions
river-8.629
evil-8.367
inav-8.039
thumbnails-7.882
phia-7.789
Token's
Token ID338
Feature activation-1.19e-01

Pos logit contributions
��+0.937
oons+0.913
 Authentication+0.911
 Preferences+0.899
illion+0.882
Neg logit contributions
river-0.912
evil-0.874
inav-0.803
thumbnails-0.780
tackle-0.777
Token Internet
Token ID4455
Feature activation-3.69e-01

Pos logit contributions
��+1.128
 Authentication+1.099
oons+1.095
 Preferences+1.083
 Regulatory+1.057
Neg logit contributions
river-1.328
evil-1.278
inav-1.185
thumbnails-1.156
tackle-1.154
Token Explorer
Token ID19142
Feature activation-1.23e-01

Pos logit contributions
 Authentication+3.698
 Preferences+3.648
��+3.613
oons+3.595
 Regulatory+3.536
Neg logit contributions
river-3.834
evil-3.662
inav-3.441
alid-3.370
thumbnails-3.365
Tokenerson
Token ID882
Feature activation-1.30e-01

Pos logit contributions
river+0.740
evil+0.715
inav+0.662
thumbnails+0.643
tackle+0.642
Neg logit contributions
��-0.590
oons-0.566
 Authentication-0.563
 Preferences-0.556
 Regulatory-0.541
Tokenic
Token ID291
Feature activation+1.94e-01

Pos logit contributions
��+1.180
oons+1.150
 Authentication+1.136
 Preferences+1.125
illion+1.101
Neg logit contributions
river-1.511
evil-1.456
inav-1.353
thumbnails-1.319
tackle-1.319
Token Flight
Token ID13365
Feature activation+2.72e-01

Pos logit contributions
river+2.191
evil+2.102
inav+1.935
thumbnails+1.873
phia+1.872
Neg logit contributions
��-1.837
oons-1.732
 Authentication-1.716
 Preferences-1.696
illion-1.674
Token:
Token ID25
Feature activation-2.32e-01

Pos logit contributions
river+2.863
evil+2.754
inav+2.484
phia+2.421
alid+2.421
Neg logit contributions
��-2.773
oons-2.589
 Authentication-2.586
 Preferences-2.544
illion-2.531
Token Breaking
Token ID24942
Feature activation-2.05e-01

Pos logit contributions
��+1.938
 Authentication+1.904
oons+1.884
 Preferences+1.868
 Regulatory+1.817
Neg logit contributions
river-2.823
evil-2.727
inav-2.545
tackle-2.507
thumbnails-2.499
Token the
Token ID262
Feature activation-1.03e+00

Pos logit contributions
��+1.856
oons+1.828
 Authentication+1.817
 Preferences+1.808
 Regulatory+1.757
Neg logit contributions
river-2.338
evil-2.270
inav-2.110
thumbnails-2.056
tackle-2.053
Token Sound
Token ID9506
Feature activation-4.16e-01

Pos logit contributions
 Preferences+9.103
 Authentication+9.080
 Prem+8.913
 Regulatory+8.897
oons+8.808
Neg logit contributions
river-10.735
evil-10.043
tackle-9.610
thumbnails-9.316
inav-9.304
Token Barrier
Token ID32804
Feature activation-5.35e-01

Pos logit contributions
 Authentication+3.513
��+3.500
 Preferences+3.492
oons+3.462
 Regulatory+3.415
Neg logit contributions
river-4.842
evil-4.736
inav-4.422
thumbnails-4.299
tackle-4.290
Token and
Token ID290
Feature activation+8.41e-03

Pos logit contributions
 Authentication+5.034
oons+4.994
��+4.952
 Preferences+4.938
elle+4.881
Neg logit contributions
river-5.611
evil-5.468
inav-5.111
tackle-5.024
thumbnails-4.990
Token Beyond
Token ID12197
Feature activation-7.10e-02

Pos logit contributions
river+0.101
evil+0.097
inav+0.090
thumbnails+0.088
tackle+0.088
Neg logit contributions
��-0.074
oons-0.071
 Authentication-0.070
 Preferences-0.069
illion-0.068
Token.
Token ID13
Feature activation+3.36e-01

Pos logit contributions
��+0.779
oons+0.757
 Authentication+0.754
 Preferences+0.746
illion+0.730
Neg logit contributions
river-0.691
evil-0.663
inav-0.603
thumbnails-0.588
tackle-0.586
Token money
Token ID1637
Feature activation-3.52e-01

Pos logit contributions
river+0.275
evil+0.264
inav+0.242
thumbnails+0.236
tackle+0.235
Neg logit contributions
��-0.283
oons-0.273
 Authentication-0.272
 Preferences-0.269
illion-0.263
Token for
Token ID329
Feature activation-5.00e-01

Pos logit contributions
oons+3.356
��+3.352
 Preferences+3.324
 Authentication+3.276
illion+3.244
Neg logit contributions
river-3.742
evil-3.639
inav-3.351
thumbnails-3.302
tackle-3.252
Token the
Token ID262
Feature activation+3.97e-02

Pos logit contributions
oons+4.111
 Preferences+4.076
��+4.076
 Authentication+4.048
 Tess+3.914
Neg logit contributions
river-5.967
evil-5.738
inav-5.356
tackle-5.279
thumbnails-5.214
Token likes
Token ID7832
Feature activation-2.94e-01

Pos logit contributions
river+0.444
evil+0.429
inav+0.395
thumbnails+0.387
tackle+0.384
Neg logit contributions
��-0.378
oons-0.362
 Authentication-0.360
 Preferences-0.355
illion-0.349
Token of
Token ID286
Feature activation-6.65e-01

Pos logit contributions
oons+2.708
 Authentication+2.689
��+2.681
 Preferences+2.680
elle+2.653
Neg logit contributions
river-3.195
evil-3.089
inav-2.891
thumbnails-2.810
phia-2.802
Token De
Token ID1024
Feature activation-1.03e+00

Pos logit contributions
��+5.351
 Preferences+5.340
oons+5.307
 Tess+5.293
 Authentication+5.265
Neg logit contributions
river-7.779
evil-7.473
inav-6.932
thumbnails-6.910
tackle-6.774
TokenAndre
Token ID31258
Feature activation-7.47e-01

Pos logit contributions
oons+8.298
��+8.266
elle+8.039
 Tess+7.843
illion+7.764
Neg logit contributions
river-10.940
evil-10.204
thumbnails-10.120
inav-9.682
phia-9.670
Token Jordan
Token ID8078
Feature activation-2.84e-01

Pos logit contributions
 Preferences+4.771
 Regulatory+4.733
��+4.717
elle+4.697
 Authentication+4.694
Neg logit contributions
river-10.115
evil-9.395
thumbnails-9.273
inav-9.257
tackle-8.902
Token or
Token ID393
Feature activation-8.17e-01

Pos logit contributions
��+2.837
oons+2.763
 Authentication+2.716
 Preferences+2.704
elle+2.682
Neg logit contributions
river-2.995
evil-2.914
inav-2.674
thumbnails-2.637
tackle-2.595
Token La
Token ID4689
Feature activation-6.08e-02

Pos logit contributions
 Tess+6.340
 Preferences+6.297
 Authentication+6.280
��+6.137
elle+6.110
Neg logit contributions
river-9.401
evil-9.205
inav-8.596
thumbnails-8.445
tackle-8.217
TokenMarcus
Token ID35110
Feature activation-3.47e-01

Pos logit contributions
��+0.434
oons+0.415
 Authentication+0.412
 Preferences+0.406
elle+0.394
Neg logit contributions
river-0.823
evil-0.799
inav-0.750
thumbnails-0.737
tackle-0.733
Token abyss
Token ID37678
Feature activation+2.11e-01

Pos logit contributions
river+1.774
evil+1.711
inav+1.558
alid+1.503
tackle+1.502
Neg logit contributions
��-1.523
oons-1.467
 Authentication-1.454
 Preferences-1.435
illion-1.426
Token.
Token ID13
Feature activation+5.21e-01

Pos logit contributions
river+2.011
evil+1.933
inav+1.694
alid+1.676
tackle+1.647
Neg logit contributions
��-2.371
 Authentication-2.300
oons-2.275
 Preferences-2.274
namese-2.234
Token The
Token ID383
Feature activation+2.33e-01

Pos logit contributions
river+5.397
evil+5.317
inav+4.642
 Leh+4.636
tre+4.565
Neg logit contributions
��-5.255
namese-5.010
oons-4.902
illion-4.825
ework-4.782
Token others
Token ID1854
Feature activation+2.73e-01

Pos logit contributions
river+2.587
evil+2.525
inav+2.283
tackle+2.211
alid+2.198
Neg logit contributions
��-2.222
oons-2.130
 Authentication-2.098
 Preferences-2.084
namese-2.057
Token escape
Token ID6654
Feature activation+5.71e-01

Pos logit contributions
river+2.586
evil+2.508
inav+2.220
tackle+2.166
alid+2.129
Neg logit contributions
��-3.080
oons-2.973
 Preferences-2.944
 Authentication-2.934
namese-2.882
Token and
Token ID290
Feature activation+7.54e-01

Pos logit contributions
river+5.450
evil+5.171
inav+4.445
 dise+4.306
alid+4.303
Neg logit contributions
��-6.384
oons-6.120
namese-6.080
 Authentication-6.038
 Preferences-5.910
Token find
Token ID1064
Feature activation+2.49e-03

Pos logit contributions
river+7.020
evil+6.966
 Leh+6.062
tackle+6.004
inav+5.987
Neg logit contributions
��-7.849
oons-7.623
 Preferences-7.432
namese-7.407
 Authentication-7.271
Token refuge
Token ID5473
Feature activation+1.96e-01

Pos logit contributions
river+0.028
evil+0.027
inav+0.025
thumbnails+0.025
tackle+0.025
Neg logit contributions
��-0.023
oons-0.022
 Authentication-0.022
 Preferences-0.022
illion-0.021
Token in
Token ID287
Feature activation+1.38e-01

Pos logit contributions
river+1.884
evil+1.801
inav+1.605
phia+1.547
tackle+1.541
Neg logit contributions
��-2.192
 Authentication-2.132
oons-2.127
 Preferences-2.091
 Regulatory-2.068
Token the
Token ID262
Feature activation+1.52e-01

Pos logit contributions
river+1.537
evil+1.481
inav+1.354
phia+1.306
thumbnails+1.302
Neg logit contributions
��-1.320
oons-1.275
 Authentication-1.260
 Preferences-1.240
illion-1.228
Token Elven
Token ID40748
Feature activation+1.93e-02

Pos logit contributions
river+1.803
evil+1.741
inav+1.611
phia+1.556
tackle+1.548
Neg logit contributions
��-1.366
oons-1.309
 Authentication-1.290
 Preferences-1.269
illion-1.259
Token has
Token ID468
Feature activation+1.14e-01

Pos logit contributions
��+0.990
oons+0.967
 Authentication+0.962
 Preferences+0.955
 Regulatory+0.941
Neg logit contributions
river-1.017
evil-0.981
inav-0.898
thumbnails-0.879
tackle-0.875
Token banned
Token ID9301
Feature activation+5.01e-01

Pos logit contributions
river+1.215
evil+1.169
inav+1.075
tackle+1.041
thumbnails+1.040
Neg logit contributions
��-1.153
oons-1.107
 Authentication-1.099
 Preferences-1.084
illion-1.059
Token the
Token ID262
Feature activation+6.16e-01

Pos logit contributions
river+5.097
evil+4.863
inav+4.406
alid+4.276
phia+4.243
Neg logit contributions
��-5.058
oons-4.872
 Authentication-4.869
 Preferences-4.728
elle-4.721
Token use
Token ID779
Feature activation+2.05e-01

Pos logit contributions
river+5.194
evil+5.119
inav+4.553
alid+4.318
phia+4.286
Neg logit contributions
��-7.124
oons-6.819
illion-6.715
 Authentication-6.660
 Preferences-6.612
Token of
Token ID286
Feature activation+5.62e-01

Pos logit contributions
river+1.895
evil+1.819
inav+1.645
thumbnails+1.597
alid+1.577
Neg logit contributions
��-2.335
 Authentication-2.256
oons-2.250
 Preferences-2.220
elle-2.177
Token the
Token ID262
Feature activation+8.45e-01

Pos logit contributions
river+5.778
evil+5.493
inav+5.062
alid+4.963
 dise+4.869
Neg logit contributions
��-5.522
oons-5.442
 Authentication-5.332
illion-5.229
elle-5.183
Token three
Token ID1115
Feature activation+5.42e-01

Pos logit contributions
river+8.263
evil+8.152
inav+7.434
alid+7.171
 dise+7.093
Neg logit contributions
��-8.058
oons-7.789
elle-7.517
 Authentication-7.517
illion-7.503
Token ne
Token ID497
Feature activation-6.45e-02

Pos logit contributions
river+5.551
evil+5.489
inav+4.944
alid+4.745
thumbnails+4.649
Neg logit contributions
��-5.344
oons-5.203
 Authentication-5.149
 Preferences-5.123
namese-4.938
Tokenonic
Token ID9229
Feature activation-1.61e-01

Pos logit contributions
��+0.950
oons+0.934
 Authentication+0.928
 Preferences+0.920
illion+0.912
Neg logit contributions
river-0.377
evil-0.348
inav-0.298
thumbnails-0.288
tackle-0.283
Tokenotin
Token ID41403
Feature activation-7.45e-01

Pos logit contributions
��+1.225
oons+1.172
 Authentication+1.159
 Preferences+1.144
illion+1.117
Neg logit contributions
river-2.107
evil-2.042
inav-1.908
thumbnails-1.877
tackle-1.868
Tokenoid
Token ID1868
Feature activation-4.06e-02

Pos logit contributions
oons+2.510
illion+2.076
 Authentication+2.041
elle+1.974
��+1.905
Neg logit contributions
river-12.443
evil-11.915
tackle-11.616
inav-11.584
thumbnails-11.514
Token he
Token ID339
Feature activation-2.42e-01

Pos logit contributions
��+0.714
oons+0.691
 Authentication+0.687
 Preferences+0.680
illion+0.663
Neg logit contributions
river-0.903
evil-0.873
inav-0.809
thumbnails-0.793
tackle-0.789
Token has
Token ID468
Feature activation-1.86e-01

Pos logit contributions
��+2.375
oons+2.366
 Authentication+2.333
 Preferences+2.307
elle+2.291
Neg logit contributions
river-2.545
evil-2.447
inav-2.264
thumbnails-2.215
tackle-2.212
Token accepted
Token ID6292
Feature activation+4.61e-01

Pos logit contributions
��+1.880
 Authentication+1.832
oons+1.832
 Preferences+1.817
elle+1.781
Neg logit contributions
river-1.922
evil-1.857
inav-1.710
thumbnails-1.676
tackle-1.670
Token Tal
Token ID7193
Feature activation+4.70e-01

Pos logit contributions
river+4.629
evil+4.366
inav+3.954
 Leh+3.917
thumbnails+3.849
Neg logit contributions
��-4.828
oons-4.647
 Authentication-4.518
 Preferences-4.466
namese-4.450
Tokenul
Token ID377
Feature activation+5.85e-01

Pos logit contributions
river+5.736
evil+5.550
inav+5.155
alid+5.115
thumbnails+5.021
Neg logit contributions
��-4.146
 Authentication-3.773
 Preferences-3.717
���-3.570
namese-3.567
Tokenah
Token ID993
Feature activation+7.79e-01

Pos logit contributions
river+4.681
evil+4.580
inav+4.003
alid+3.844
tre+3.727
Neg logit contributions
��-7.412
���-7.105
 Preferences-7.033
 Authentication-6.995
oons-6.905
Token back
Token ID736
Feature activation+2.26e-01

Pos logit contributions
river+5.461
evil+4.602
 Leh+4.370
phia+4.237
alid+4.028
Neg logit contributions
oons-9.871
��-9.678
namese-9.430
 Authentication-9.386
 Preferences-9.212
Token into
Token ID656
Feature activation+1.26e-01

Pos logit contributions
river+2.235
evil+2.113
inav+1.884
thumbnails+1.847
phia+1.842
Neg logit contributions
��-2.485
 Authentication-2.386
oons-2.362
 Preferences-2.333
 Regulatory-2.300
Token his
Token ID465
Feature activation-2.01e-01

Pos logit contributions
river+1.398
evil+1.358
inav+1.246
thumbnails+1.218
tackle+1.200
Neg logit contributions
��-1.222
oons-1.171
 Authentication-1.161
 Preferences-1.143
illion-1.123
Token life
Token ID1204
Feature activation-1.08e-01

Pos logit contributions
��+1.814
oons+1.784
 Authentication+1.768
 Preferences+1.752
 Regulatory+1.703
Neg logit contributions
river-2.321
evil-2.223
inav-2.057
thumbnails-2.022
tackle-2.022
Token.
Token ID13
Feature activation+1.27e-02

Pos logit contributions
��+1.165
oons+1.137
 Authentication+1.128
 Preferences+1.122
elle+1.098
Neg logit contributions
river-1.063
evil-1.030
inav-0.937
thumbnails-0.920
tackle-0.910
Token of
Token ID286
Feature activation+1.10e-01

Pos logit contributions
��+3.103
oons+3.035
 Authentication+2.963
 Preferences+2.959
elle+2.902
Neg logit contributions
river-3.374
evil-3.276
inav-3.003
thumbnails-2.996
tackle-2.971
Token the
Token ID262
Feature activation+1.37e-01

Pos logit contributions
river+1.218
evil+1.170
inav+1.085
thumbnails+1.045
phia+1.039
Neg logit contributions
��-1.066
oons-1.022
 Authentication-1.017
 Preferences-1.002
illion-0.990
Token Korea
Token ID4969
Feature activation+3.20e-01

Pos logit contributions
river+1.571
evil+1.523
inav+1.416
thumbnails+1.369
phia+1.355
Neg logit contributions
��-1.259
oons-1.198
 Authentication-1.189
 Preferences-1.174
illion-1.158
Token Institute
Token ID5136
Feature activation-8.55e-02

Pos logit contributions
river+3.543
evil+3.380
inav+3.140
thumbnails+3.022
phia+3.000
Neg logit contributions
��-3.083
oons-2.896
 Authentication-2.881
 Preferences-2.833
illion-2.814
Token of
Token ID286
Feature activation+8.06e-02

Pos logit contributions
��+0.886
oons+0.864
 Authentication+0.859
 Preferences+0.851
 Regulatory+0.834
Neg logit contributions
river-0.881
evil-0.848
inav-0.776
thumbnails-0.762
tackle-0.759
Token S
Token ID311
Feature activation+1.07e+00

Pos logit contributions
river+0.956
evil+0.925
inav+0.856
thumbnails+0.836
alid+0.824
Neg logit contributions
��-0.718
oons-0.684
 Authentication-0.673
 Preferences-0.664
illion-0.656
Token&
Token ID5
Feature activation-5.76e-02

Pos logit contributions
river+9.689
evil+9.676
inav+8.493
ivable+8.331
tre+8.234
Neg logit contributions
��-10.367
 Authentication-10.267
theless-9.893
 Tess-9.744
 Preferences-9.698
TokenT
Token ID51
Feature activation-2.08e-01

Pos logit contributions
��+0.525
oons+0.506
 Authentication+0.504
 Preferences+0.497
illion+0.485
Neg logit contributions
river-0.667
evil-0.644
inav-0.596
thumbnails-0.584
tackle-0.581
Token Evaluation
Token ID34959
Feature activation+3.84e-02

Pos logit contributions
��+2.057
oons+2.020
 Authentication+2.007
 Preferences+1.986
 Regulatory+1.948
Neg logit contributions
river-2.231
evil-2.146
inav-1.972
thumbnails-1.956
tackle-1.947
Token and
Token ID290
Feature activation-8.36e-02

Pos logit contributions
river+0.377
evil+0.361
inav+0.329
thumbnails+0.319
tackle+0.318
Neg logit contributions
��-0.419
oons-0.405
 Authentication-0.403
 Preferences-0.398
illion-0.391
Token Planning
Token ID21913
Feature activation+1.06e-01

Pos logit contributions
��+0.743
 Authentication+0.725
oons+0.720
 Preferences+0.715
 Regulatory+0.699
Neg logit contributions
river-0.981
evil-0.951
inav-0.880
thumbnails-0.865
tackle-0.862
Token future
Token ID2003
Feature activation+3.32e-02

Pos logit contributions
��+0.170
oons+0.164
 Authentication+0.163
 Preferences+0.161
illion+0.157
Neg logit contributions
river-0.199
evil-0.192
inav-0.177
thumbnails-0.174
tackle-0.173
Token for
Token ID329
Feature activation+3.06e-02

Pos logit contributions
river+0.343
evil+0.330
inav+0.301
tackle+0.292
thumbnails+0.292
Neg logit contributions
��-0.347
oons-0.335
 Authentication-0.334
 Preferences-0.330
 Regulatory-0.322
Token Tol
Token ID20054
Feature activation-4.73e-02

Pos logit contributions
river+0.355
evil+0.345
inav+0.318
thumbnails+0.310
tackle+0.309
Neg logit contributions
��-0.277
oons-0.267
 Authentication-0.265
 Preferences-0.262
illion-0.255
Tokenui
Token ID9019
Feature activation+2.17e-01

Pos logit contributions
��+0.485
oons+0.471
 Authentication+0.468
 Preferences+0.462
illion+0.454
Neg logit contributions
river-0.493
evil-0.474
inav-0.435
thumbnails-0.426
tackle-0.422
Token's
Token ID338
Feature activation+1.51e-01

Pos logit contributions
river+2.155
evil+2.057
inav+1.854
phia+1.794
alid+1.788
Neg logit contributions
��-2.329
 Authentication-2.271
oons-2.267
 Preferences-2.241
 Regulatory-2.192
Token eldest
Token ID34331
Feature activation+8.72e-01

Pos logit contributions
river+1.481
evil+1.423
inav+1.290
phia+1.232
alid+1.230
Neg logit contributions
��-1.648
oons-1.597
 Authentication-1.585
 Preferences-1.570
illion-1.541
Token son
Token ID3367
Feature activation+1.91e-01

Pos logit contributions
river+5.677
evil+5.212
inav+4.174
 Leh+4.096
tre+4.072
Neg logit contributions
��-11.275
 Authentication-11.096
oons-10.958
 Preferences-10.879
namese-10.838
Token and
Token ID290
Feature activation+1.50e-01

Pos logit contributions
river+1.978
evil+1.900
inav+1.735
alid+1.672
tackle+1.672
Neg logit contributions
��-1.949
oons-1.896
 Authentication-1.890
 Preferences-1.861
namese-1.827
Token bestowed
Token ID40293
Feature activation-1.66e-01

Pos logit contributions
river+1.592
evil+1.532
inav+1.401
tackle+1.354
thumbnails+1.348
Neg logit contributions
��-1.528
oons-1.485
 Authentication-1.473
 Preferences-1.450
 Regulatory-1.419
Token on
Token ID319
Feature activation+4.30e-02

Pos logit contributions
��+1.537
oons+1.487
 Authentication+1.479
 Preferences+1.468
 Regulatory+1.430
Neg logit contributions
river-1.857
evil-1.803
inav-1.667
thumbnails-1.656
tackle-1.639
Token the
Token ID262
Feature activation+5.06e-01

Pos logit contributions
river+0.492
evil+0.475
inav+0.437
thumbnails+0.426
tackle+0.426
Neg logit contributions
��-0.400
oons-0.386
 Authentication-0.384
 Preferences-0.379
illion-0.369
Token the
Token ID262
Feature activation-7.38e-02

Pos logit contributions
river+1.613
evil+1.551
inav+1.427
thumbnails+1.389
tackle+1.381
Neg logit contributions
��-1.461
oons-1.414
 Authentication-1.390
 Preferences-1.367
illion-1.362
Token tests
Token ID5254
Feature activation+1.88e-02

Pos logit contributions
��+0.711
oons+0.687
 Authentication+0.685
 Preferences+0.677
 Regulatory+0.660
Neg logit contributions
river-0.817
evil-0.788
inav-0.725
thumbnails-0.710
tackle-0.708
Token.
Token ID13
Feature activation-9.68e-02

Pos logit contributions
river+0.183
evil+0.176
inav+0.160
thumbnails+0.155
tackle+0.155
Neg logit contributions
��-0.206
oons-0.199
 Authentication-0.199
 Preferences-0.196
illion-0.192
Token
Token ID198
Feature activation-2.05e-01

Pos logit contributions
��+0.912
oons+0.881
 Authentication+0.880
 Preferences+0.870
 Regulatory+0.847
Neg logit contributions
river-1.091
evil-1.052
inav-0.972
thumbnails-0.951
tackle-0.947
Token
Token ID198
Feature activation-9.39e-02

Pos logit contributions
��+1.948
oons+1.888
 Authentication+1.885
 Preferences+1.870
elle+1.829
Neg logit contributions
river-2.250
evil-2.172
inav-2.009
thumbnails-1.966
tackle-1.961
TokenAll
Token ID3237
Feature activation+1.49e-01

Pos logit contributions
��+0.907
oons+0.877
 Authentication+0.870
 Preferences+0.861
illion+0.843
Neg logit contributions
river-1.032
evil-0.995
inav-0.920
thumbnails-0.897
tackle-0.892
Token three
Token ID1115
Feature activation+1.87e-01

Pos logit contributions
river+1.546
evil+1.484
inav+1.365
thumbnails+1.333
tackle+1.317
Neg logit contributions
��-1.534
oons-1.490
 Authentication-1.475
 Preferences-1.450
illion-1.441
Token lenses
Token ID18405
Feature activation+4.39e-02

Pos logit contributions
river+1.935
evil+1.857
inav+1.692
thumbnails+1.651
tackle+1.639
Neg logit contributions
��-1.922
oons-1.849
 Authentication-1.837
 Preferences-1.803
elle-1.784
Token,
Token ID11
Feature activation+3.08e-01

Pos logit contributions
river+0.437
evil+0.421
inav+0.383
thumbnails+0.372
tackle+0.372
Neg logit contributions
��-0.471
oons-0.456
 Authentication-0.454
 Preferences-0.449
illion-0.439
Token at
Token ID379
Feature activation+3.51e-02

Pos logit contributions
river+3.168
evil+3.056
inav+2.776
thumbnails+2.747
tackle+2.729
Neg logit contributions
��-3.133
oons-2.996
 Authentication-2.975
 Preferences-2.912
illion-2.905
Token both
Token ID1111
Feature activation+1.58e-01

Pos logit contributions
river+0.380
evil+0.366
inav+0.337
thumbnails+0.329
tackle+0.327
Neg logit contributions
��-0.347
oons-0.335
 Authentication-0.333
 Preferences-0.328
illion-0.321
Token cl
Token ID537
Feature activation+8.59e-02

Pos logit contributions
river+1.130
evil+1.086
inav+0.995
thumbnails+0.977
tackle+0.971
Neg logit contributions
��-0.971
oons-0.939
 Authentication-0.927
 Preferences-0.910
illion-0.896
Tokenojure
Token ID32511
Feature activation-1.56e-01

Pos logit contributions
river+0.881
evil+0.845
inav+0.771
tackle+0.743
thumbnails+0.742
Neg logit contributions
��-0.914
 Authentication-0.877
oons-0.870
 Preferences-0.866
 Regulatory-0.852
Token.
Token ID13
Feature activation-3.03e-01

Pos logit contributions
��+1.688
 Authentication+1.640
oons+1.635
 Preferences+1.623
illion+1.590
Neg logit contributions
river-1.515
evil-1.470
inav-1.344
thumbnails-1.313
tackle-1.308
Tokenl
Token ID75
Feature activation+1.98e-01

Pos logit contributions
��+2.772
oons+2.671
 Preferences+2.658
 Authentication+2.645
illion+2.603
Neg logit contributions
river-3.384
evil-3.288
inav-3.103
phia-2.988
thumbnails-2.968
Tokenibraries
Token ID11127
Feature activation-8.14e-02

Pos logit contributions
river+2.474
evil+2.374
inav+2.192
thumbnails+2.161
tackle+2.144
Neg logit contributions
��-1.684
 Authentication-1.578
 Preferences-1.558
oons-1.550
 Regulatory-1.531
Token table
Token ID3084
Feature activation+1.91e-01

Pos logit contributions
��+0.768
oons+0.741
 Authentication+0.740
 Preferences+0.731
illion+0.713
Neg logit contributions
river-0.914
evil-0.884
inav-0.817
thumbnails-0.798
tackle-0.794
Token.
Token ID13
Feature activation-1.56e-01

Pos logit contributions
river+1.803
evil+1.692
inav+1.526
thumbnails+1.489
tackle+1.482
Neg logit contributions
��-2.163
oons-2.095
 Authentication-2.063
 Preferences-2.043
 Regulatory-2.021
Token
Token ID198
Feature activation+1.93e-01

Pos logit contributions
��+1.450
 Authentication+1.410
oons+1.406
 Preferences+1.394
 Regulatory+1.352
Neg logit contributions
river-1.756
evil-1.695
inav-1.572
thumbnails-1.536
tackle-1.536
Token
Token ID198
Feature activation+4.13e-02

Pos logit contributions
river+1.888
evil+1.811
inav+1.621
thumbnails+1.589
tackle+1.575
Neg logit contributions
��-2.175
oons-2.037
 Authentication-2.025
 Preferences-2.004
namese-1.988
Token1
Token ID16
Feature activation+2.10e-01

Pos logit contributions
river+0.434
evil+0.418
inav+0.380
thumbnails+0.373
tackle+0.371
Neg logit contributions
��-0.424
oons-0.411
 Authentication-0.409
 Preferences-0.404
 Regulatory-0.396
Token 2
Token ID362
Feature activation+1.78e-01

Pos logit contributions
river+1.832
evil+1.764
inav+1.536
thumbnails+1.529
alid+1.507
Neg logit contributions
��-2.496
oons-2.448
 Authentication-2.416
 Preferences-2.365
illion-2.355
Token.
Token ID13
Feature activation+3.54e-02

Pos logit contributions
river+2.946
evil+2.767
inav+2.535
phia+2.495
thumbnails+2.477
Neg logit contributions
��-3.161
 Authentication-3.010
oons-3.003
 Preferences-2.985
namese-2.917
TokenCom
Token ID5377
Feature activation+1.36e-02

Pos logit contributions
river+0.456
evil+0.442
inav+0.411
thumbnails+0.405
tackle+0.402
Neg logit contributions
��-0.278
oons-0.265
 Authentication-0.263
 Preferences-0.258
illion-0.252
Token,
Token ID11
Feature activation+2.30e-01

Pos logit contributions
river+0.132
evil+0.126
inav+0.115
thumbnails+0.112
tackle+0.111
Neg logit contributions
��-0.151
oons-0.146
 Authentication-0.145
 Preferences-0.143
illion-0.140
Token San
Token ID2986
Feature activation+2.92e-01

Pos logit contributions
river+2.613
evil+2.479
inav+2.301
thumbnails+2.240
phia+2.218
Neg logit contributions
��-2.163
oons-2.071
 Authentication-2.019
namese-1.990
 Preferences-1.989
Token Antonio
Token ID11366
Feature activation-4.47e-02

Pos logit contributions
river+2.424
evil+2.249
inav+1.986
thumbnails+1.940
phia+1.924
Neg logit contributions
��-3.654
oons-3.519
 Authentication-3.502
 Preferences-3.463
 Regulatory-3.404
Token Woman
Token ID14381
Feature activation+2.57e-01

Pos logit contributions
��+0.275
oons+0.261
 Authentication+0.259
 Preferences+0.254
illion+0.244
Neg logit contributions
river-0.649
evil-0.632
inav-0.594
thumbnails-0.585
tackle-0.583
Token,
Token ID11
Feature activation+9.18e-02

Pos logit contributions
river+2.566
evil+2.431
inav+2.199
thumbnails+2.176
phia+2.174
Neg logit contributions
��-2.739
oons-2.641
 Authentication-2.607
 Preferences-2.582
namese-2.551
Token Gam
Token ID14014
Feature activation-1.69e-01

Pos logit contributions
river+1.049
evil+1.004
inav+0.929
thumbnails+0.906
tackle+0.901
Neg logit contributions
��-0.860
oons-0.828
 Authentication-0.816
 Preferences-0.805
illion-0.793
Tokenbit
Token ID2545
Feature activation-2.12e-02

Pos logit contributions
��+1.515
oons+1.484
 Authentication+1.457
elle+1.439
illion+1.433
Neg logit contributions
river-1.954
evil-1.884
inav-1.739
thumbnails-1.713
tackle-1.705
Token Weekly
Token ID18168
Feature activation+2.21e-01

Pos logit contributions
��+0.213
oons+0.206
 Authentication+0.205
 Preferences+0.203
illion+0.198
Neg logit contributions
river-0.227
evil-0.218
inav-0.200
thumbnails-0.196
tackle-0.195
Token of
Token ID286
Feature activation-3.40e-01

Pos logit contributions
river+2.042
evil+1.932
inav+1.762
phia+1.709
thumbnails+1.703
Neg logit contributions
��-2.517
oons-2.419
 Authentication-2.405
 Preferences-2.362
namese-2.345
Token tickets
Token ID8587
Feature activation-2.98e-01

Pos logit contributions
��+0.591
oons+0.573
 Authentication+0.570
 Preferences+0.563
illion+0.550
Neg logit contributions
river-0.663
evil-0.640
inav-0.590
thumbnails-0.577
tackle-0.573
Token purchased
Token ID8155
Feature activation+5.31e-03

Pos logit contributions
��+2.901
oons+2.807
 Authentication+2.804
 Preferences+2.763
illion+2.754
Neg logit contributions
river-3.162
evil-3.066
inav-2.884
tackle-2.834
thumbnails-2.826
Token before
Token ID878
Feature activation-1.86e-01

Pos logit contributions
river+0.057
evil+0.055
inav+0.050
thumbnails+0.049
tackle+0.049
Neg logit contributions
��-0.053
oons-0.052
 Authentication-0.051
 Preferences-0.051
illion-0.049
Token the
Token ID262
Feature activation-3.72e-01

Pos logit contributions
��+1.657
 Authentication+1.612
oons+1.609
 Preferences+1.576
 Regulatory+1.546
Neg logit contributions
river-2.159
evil-2.109
inav-1.943
thumbnails-1.932
tackle-1.897
Token bombings
Token ID26579
Feature activation-2.79e-01

Pos logit contributions
��+3.675
 Authentication+3.662
oons+3.617
 Preferences+3.555
 Regulatory+3.507
Neg logit contributions
river-3.895
evil-3.704
thumbnails-3.415
inav-3.396
tackle-3.361
Token.
Token ID13
Feature activation+2.24e-01

Pos logit contributions
��+2.830
oons+2.756
 Authentication+2.739
 Preferences+2.691
illion+2.683
Neg logit contributions
river-2.869
evil-2.756
inav-2.566
thumbnails-2.526
tackle-2.491
Token He
Token ID679
Feature activation+9.27e-02

Pos logit contributions
river+2.564
evil+2.475
inav+2.277
thumbnails+2.244
 Leh+2.218
Neg logit contributions
��-2.090
oons-1.969
namese-1.919
illion-1.908
 Authentication-1.878
Token said
Token ID531
Feature activation+3.82e-01

Pos logit contributions
river+0.897
evil+0.859
inav+0.784
thumbnails+0.758
tackle+0.753
Neg logit contributions
��-1.029
oons-0.988
 Authentication-0.984
 Preferences-0.977
 Regulatory-0.955
Token an
Token ID281
Feature activation-9.70e-02

Pos logit contributions
river+3.846
evil+3.661
inav+3.304
 Leh+3.291
thumbnails+3.268
Neg logit contributions
��-4.001
oons-3.806
 Preferences-3.750
 Authentication-3.732
illion-3.717
Token FBI
Token ID5349
Feature activation-2.68e-01

Pos logit contributions
��+1.066
oons+1.037
 Authentication+1.036
 Preferences+1.021
elle+1.001
Neg logit contributions
river-0.939
evil-0.900
inav-0.820
thumbnails-0.800
tackle-0.797
Token agent
Token ID5797
Feature activation-2.08e-01

Pos logit contributions
��+2.005
 Authentication+1.977
oons+1.929
 Preferences+1.916
elle+1.884
Neg logit contributions
river-3.447
evil-3.389
thumbnails-3.120
inav-3.119
tackle-3.085
Token have
Token ID423
Feature activation+3.71e-02

Pos logit contributions
river+2.122
evil+2.055
inav+1.838
thumbnails+1.812
tackle+1.812
Neg logit contributions
��-2.191
oons-2.072
 Authentication-2.063
 Preferences-2.031
namese-2.013
Token B
Token ID347
Feature activation-9.03e-02

Pos logit contributions
river+0.421
evil+0.406
inav+0.374
thumbnails+0.367
tackle+0.365
Neg logit contributions
��-0.348
oons-0.335
 Authentication-0.332
 Preferences-0.328
illion-0.321
Tokeniz
Token ID528
Feature activation-1.84e-01

Pos logit contributions
��+0.798
oons+0.779
 Authentication+0.764
 Preferences+0.753
illion+0.746
Neg logit contributions
river-1.063
evil-1.027
inav-0.955
thumbnails-0.936
tackle-0.933
Tokenarro
Token ID34852
Feature activation-2.29e-01

Pos logit contributions
��+0.902
oons+0.872
 Authentication+0.845
 Preferences+0.819
illion+0.808
Neg logit contributions
river-2.859
evil-2.794
inav-2.672
phia-2.613
thumbnails-2.611
Token Green
Token ID3469
Feature activation+4.90e-02

Pos logit contributions
��+2.219
 Authentication+2.158
oons+2.157
 Preferences+2.137
illion+2.086
Neg logit contributions
river-2.469
evil-2.384
inav-2.242
thumbnails-2.156
phia-2.136
Token Arrow
Token ID19408
Feature activation+1.65e-01

Pos logit contributions
river+0.526
evil+0.507
inav+0.462
thumbnails+0.452
tackle+0.452
Neg logit contributions
��-0.490
oons-0.473
 Authentication-0.471
 Preferences-0.466
namese-0.456
Token in
Token ID287
Feature activation-7.79e-02

Pos logit contributions
river+1.645
evil+1.571
inav+1.415
tackle+1.394
thumbnails+1.388
Neg logit contributions
��-1.798
oons-1.722
 Authentication-1.705
 Preferences-1.689
namese-1.674
Token the
Token ID262
Feature activation-3.20e-01

Pos logit contributions
��+0.776
oons+0.752
 Authentication+0.749
 Preferences+0.742
illion+0.722
Neg logit contributions
river-0.834
evil-0.804
inav-0.742
thumbnails-0.721
tackle-0.718
Token radius
Token ID16874
Feature activation+3.19e-01

Pos logit contributions
 Authentication+2.824
��+2.818
oons+2.798
 Preferences+2.792
 Regulatory+2.678
Neg logit contributions
river-3.661
evil-3.524
inav-3.322
phia-3.191
tackle-3.182
Token,
Token ID11
Feature activation-2.11e-01

Pos logit contributions
river+2.994
evil+2.804
inav+2.482
tackle+2.458
thumbnails+2.444
Neg logit contributions
��-3.650
 Authentication-3.464
oons-3.444
 Preferences-3.384
namese-3.382
Token which
Token ID543
Feature activation-1.27e-01

Pos logit contributions
��+2.070
 Authentication+2.043
oons+2.036
 Preferences+2.024
 Regulatory+1.971
Neg logit contributions
river-2.220
evil-2.146
inav-1.993
thumbnails-1.924
phia-1.924
Token really
Token ID1107
Feature activation+1.92e-01

Pos logit contributions
river+1.765
evil+1.695
inav+1.549
thumbnails+1.509
tackle+1.489
Neg logit contributions
��-1.832
oons-1.754
 Authentication-1.746
 Preferences-1.723
illion-1.689
Token something
Token ID1223
Feature activation-5.64e-02

Pos logit contributions
river+1.920
evil+1.838
inav+1.684
thumbnails+1.638
tackle+1.617
Neg logit contributions
��-2.079
oons-1.996
 Authentication-1.980
 Preferences-1.958
illion-1.921
Token we
Token ID356
Feature activation+2.31e-01

Pos logit contributions
��+0.571
oons+0.555
 Authentication+0.552
 Preferences+0.547
illion+0.534
Neg logit contributions
river-0.594
evil-0.572
inav-0.525
thumbnails-0.514
tackle-0.512
Token can
Token ID460
Feature activation+4.70e-01

Pos logit contributions
river+2.232
evil+2.174
inav+1.980
thumbnails+1.883
tackle+1.880
Neg logit contributions
��-2.574
 Authentication-2.425
oons-2.424
 Preferences-2.407
namese-2.339
Token look
Token ID804
Feature activation+1.53e-02

Pos logit contributions
river+4.205
evil+4.133
inav+3.728
tackle+3.629
alid+3.556
Neg logit contributions
��-5.577
 Preferences-5.048
 Authentication-5.028
oons-5.026
 Regulatory-4.910
Token at
Token ID379
Feature activation-2.12e-01

Pos logit contributions
river+0.167
evil+0.161
inav+0.148
thumbnails+0.143
tackle+0.143
Neg logit contributions
��-0.150
 Authentication-0.144
oons-0.144
 Preferences-0.142
 Regulatory-0.139
Token for
Token ID329
Feature activation+1.68e-01

Pos logit contributions
��+2.110
 Authentication+2.065
oons+2.063
 Preferences+2.048
illion+2.003
Neg logit contributions
river-2.212
evil-2.155
thumbnails-1.954
inav-1.946
phia-1.921
Token holding
Token ID4769
Feature activation+1.77e-01

Pos logit contributions
river+1.779
evil+1.727
inav+1.594
thumbnails+1.540
tackle+1.534
Neg logit contributions
��-1.683
oons-1.617
 Authentication-1.599
 Preferences-1.578
elle-1.558
Token EV
Token ID8696
Feature activation-2.34e-02

Pos logit contributions
river+1.863
evil+1.791
inav+1.650
thumbnails+1.589
tackle+1.581
Neg logit contributions
��-1.836
oons-1.749
 Authentication-1.738
 Preferences-1.709
illion-1.680
TokenO
Token ID46
Feature activation+4.19e-01

Pos logit contributions
��+0.234
oons+0.227
 Authentication+0.226
 Preferences+0.223
illion+0.218
Neg logit contributions
river-0.250
evil-0.240
inav-0.221
thumbnails-0.216
tackle-0.216
Token in
Token ID287
Feature activation+3.28e-01

Pos logit contributions
river+4.019
evil+3.869
inav+3.519
phia+3.368
thumbnails+3.365
Neg logit contributions
��-4.570
oons-4.380
 Authentication-4.342
elle-4.250
 Preferences-4.209
Token made
Token ID925
Feature activation+1.55e-01

Pos logit contributions
��+0.185
oons+0.179
 Authentication+0.178
 Preferences+0.176
illion+0.172
Neg logit contributions
river-0.197
evil-0.190
inav-0.174
thumbnails-0.171
tackle-0.170
Token major
Token ID1688
Feature activation+2.05e-01

Pos logit contributions
river+1.625
evil+1.551
inav+1.428
tackle+1.391
thumbnails+1.385
Neg logit contributions
��-1.591
oons-1.525
 Authentication-1.517
 Preferences-1.490
illion-1.464
Token investments
Token ID11115
Feature activation+2.79e-02

Pos logit contributions
river+1.931
evil+1.829
inav+1.679
thumbnails+1.625
tackle+1.605
Neg logit contributions
��-2.322
oons-2.226
 Authentication-2.222
 Preferences-2.174
namese-2.152
Token in
Token ID287
Feature activation+2.66e-01

Pos logit contributions
river+0.277
evil+0.265
inav+0.242
thumbnails+0.235
tackle+0.235
Neg logit contributions
��-0.302
oons-0.292
 Authentication-0.291
 Preferences-0.288
illion-0.282
Token basic
Token ID4096
Feature activation-6.68e-02

Pos logit contributions
river+2.875
evil+2.761
inav+2.554
thumbnails+2.471
tackle+2.461
Neg logit contributions
��-2.644
oons-2.481
 Authentication-2.450
 Preferences-2.433
illion-2.421
Token science
Token ID3783
Feature activation-2.50e-01

Pos logit contributions
��+0.540
oons+0.523
 Authentication+0.521
 Preferences+0.513
 Regulatory+0.500
Neg logit contributions
river-0.838
evil-0.813
inav-0.756
thumbnails-0.743
tackle-0.741
Token,
Token ID11
Feature activation+2.83e-01

Pos logit contributions
��+2.427
oons+2.412
 Authentication+2.380
 Preferences+2.375
 Regulatory+2.320
Neg logit contributions
river-2.639
evil-2.583
inav-2.351
thumbnails-2.344
phia-2.298
Token too
Token ID1165
Feature activation-2.37e-01

Pos logit contributions
river+3.029
evil+2.901
inav+2.658
thumbnails+2.612
tackle+2.598
Neg logit contributions
��-2.887
oons-2.692
 Authentication-2.665
 Preferences-2.659
namese-2.620
Token.
Token ID13
Feature activation+8.18e-02

Pos logit contributions
��+2.460
oons+2.371
 Authentication+2.345
 Preferences+2.341
illion+2.298
Neg logit contributions
river-2.403
evil-2.325
inav-2.128
thumbnails-2.125
tackle-2.088
Token
Token ID198
Feature activation-1.57e-01

Pos logit contributions
river+0.876
evil+0.843
inav+0.777
thumbnails+0.751
alid+0.744
Neg logit contributions
��-0.825
oons-0.785
 Authentication-0.778
 Preferences-0.763
namese-0.761
Token
Token ID198
Feature activation+1.23e-01

Pos logit contributions
oons+1.408
 Authentication+1.400
��+1.392
 Preferences+1.381
elle+1.364
Neg logit contributions
river-1.779
evil-1.718
inav-1.611
thumbnails-1.568
tackle-1.565
Token for
Token ID329
Feature activation+1.26e-02

Pos logit contributions
river+1.175
evil+1.126
inav+1.015
thumbnails+0.990
tackle+0.986
Neg logit contributions
��-1.327
 Authentication-1.275
oons-1.272
 Preferences-1.262
 Regulatory-1.231
Token women
Token ID1466
Feature activation+4.76e-01

Pos logit contributions
river+0.135
evil+0.130
inav+0.119
thumbnails+0.117
tackle+0.116
Neg logit contributions
��-0.127
oons-0.123
 Authentication-0.122
 Preferences-0.121
illion-0.118
Token,
Token ID11
Feature activation+2.03e-01

Pos logit contributions
river+4.297
evil+4.065
inav+3.574
thumbnails+3.502
tackle+3.472
Neg logit contributions
��-5.418
oons-5.215
 Authentication-5.196
 Preferences-5.131
 Regulatory-5.049
Token but
Token ID475
Feature activation+1.19e-01

Pos logit contributions
river+2.068
evil+2.016
inav+1.809
thumbnails+1.780
tackle+1.759
Neg logit contributions
��-2.136
oons-2.046
 Authentication-2.036
 Preferences-2.016
illion-1.975
Token according
Token ID1864
Feature activation-1.03e-01

Pos logit contributions
river+1.269
evil+1.222
inav+1.118
thumbnails+1.086
tackle+1.085
Neg logit contributions
��-1.197
oons-1.153
 Authentication-1.138
 Preferences-1.128
illion-1.108
Token to
Token ID284
Feature activation+9.49e-02

Pos logit contributions
��+1.013
 Authentication+0.992
oons+0.989
 Preferences+0.978
 Regulatory+0.958
Neg logit contributions
river-1.101
evil-1.062
inav-0.984
tackle-0.954
thumbnails-0.954
Token amd
Token ID39971
Feature activation-6.34e-02

Pos logit contributions
river+1.052
evil+1.020
inav+0.936
thumbnails+0.909
tackle+0.904
Neg logit contributions
��-0.910
oons-0.880
 Authentication-0.866
 Preferences-0.857
illion-0.848
Tokennews
Token ID10827
Feature activation-1.28e-01

Pos logit contributions
��+0.623
oons+0.602
 Authentication+0.599
 Preferences+0.592
illion+0.578
Neg logit contributions
river-0.690
evil-0.666
inav-0.612
thumbnails-0.599
tackle-0.596
Token,
Token ID11
Feature activation-1.52e-01

Pos logit contributions
��+1.359
oons+1.323
 Authentication+1.312
 Preferences+1.298
elle+1.275
Neg logit contributions
river-1.283
evil-1.233
inav-1.128
thumbnails-1.100
tackle-1.095
Token neither
Token ID6159
Feature activation-1.14e-01

Pos logit contributions
��+1.411
oons+1.361
 Authentication+1.360
 Preferences+1.339
 Regulatory+1.313
Neg logit contributions
river-1.726
evil-1.669
inav-1.542
thumbnails-1.515
tackle-1.509
Token the
Token ID262
Feature activation+1.59e-01

Pos logit contributions
��+1.039
oons+1.007
 Authentication+1.006
 Preferences+0.991
 Regulatory+0.971
Neg logit contributions
river-1.316
evil-1.271
inav-1.180
thumbnails-1.155
tackle-1.150
Token the
Token ID262
Feature activation+2.22e-01

Pos logit contributions
river+3.365
evil+3.258
inav+2.872
thumbnails+2.814
alid+2.801
Neg logit contributions
��-3.732
 Authentication-3.617
oons-3.601
 Preferences-3.575
 Regulatory-3.523
Token country
Token ID1499
Feature activation+3.32e-01

Pos logit contributions
river+2.219
evil+2.142
inav+1.950
thumbnails+1.901
phia+1.889
Neg logit contributions
��-2.349
oons-2.270
 Authentication-2.247
 Preferences-2.218
namese-2.198
Token can
Token ID460
Feature activation+2.87e-01

Pos logit contributions
river+3.183
evil+3.044
inav+2.751
thumbnails+2.652
tackle+2.647
Neg logit contributions
��-3.650
 Authentication-3.510
oons-3.469
 Regulatory-3.423
 Preferences-3.422
Token still
Token ID991
Feature activation+4.69e-01

Pos logit contributions
river+2.793
evil+2.718
inav+2.448
tackle+2.359
alid+2.339
Neg logit contributions
��-3.183
 Authentication-3.015
oons-2.979
 Preferences-2.972
 Regulatory-2.946
Token be
Token ID307
Feature activation+4.22e-01

Pos logit contributions
river+4.415
evil+4.307
inav+3.840
tackle+3.707
tre+3.699
Neg logit contributions
��-5.278
 Authentication-4.946
oons-4.878
 Preferences-4.851
 Regulatory-4.822
Token molded
Token ID46725
Feature activation-1.05e-01

Pos logit contributions
river+4.168
evil+4.066
inav+3.560
thumbnails+3.538
 dise+3.440
Neg logit contributions
��-4.518
oons-4.340
 Preferences-4.285
 Authentication-4.282
 Regulatory-4.163
Token into
Token ID656
Feature activation+1.86e-01

Pos logit contributions
��+0.952
oons+0.928
 Authentication+0.920
 Preferences+0.915
illion+0.890
Neg logit contributions
river-1.196
evil-1.159
inav-1.078
thumbnails-1.055
tackle-1.052
Token a
Token ID257
Feature activation+7.17e-02

Pos logit contributions
river+2.017
evil+1.948
inav+1.780
thumbnails+1.755
tackle+1.710
Neg logit contributions
��-1.851
oons-1.760
 Authentication-1.748
 Preferences-1.712
elle-1.693
Token sharp
Token ID7786
Feature activation+1.48e-01

Pos logit contributions
river+0.797
evil+0.767
inav+0.706
thumbnails+0.693
tackle+0.686
Neg logit contributions
��-0.692
oons-0.666
 Authentication-0.662
 Preferences-0.651
elle-0.637
Token campaign
Token ID1923
Feature activation+1.13e-01

Pos logit contributions
river+1.694
evil+1.628
inav+1.498
thumbnails+1.468
tackle+1.461
Neg logit contributions
��-1.366
 Authentication-1.318
oons-1.315
 Preferences-1.288
namese-1.280
Token issue
Token ID2071
Feature activation-1.88e-02

Pos logit contributions
river+1.216
evil+1.157
inav+1.059
thumbnails+1.036
tackle+1.034
Neg logit contributions
��-1.136
 Authentication-1.085
oons-1.083
 Preferences-1.065
elle-1.046
Token that
Token ID326
Feature activation+1.55e-01

Pos logit contributions
river+3.886
evil+3.689
inav+3.390
tackle+3.356
alid+3.306
Neg logit contributions
��-3.781
oons-3.663
 Authentication-3.606
 Preferences-3.563
namese-3.518
Token time
Token ID640
Feature activation+2.08e-01

Pos logit contributions
river+1.774
evil+1.703
inav+1.554
tackle+1.542
thumbnails+1.532
Neg logit contributions
��-1.446
oons-1.381
 Authentication-1.363
 Preferences-1.345
namese-1.320
Token [
Token ID685
Feature activation-2.34e-01

Pos logit contributions
river+2.028
evil+1.947
inav+1.737
tackle+1.716
thumbnails+1.710
Neg logit contributions
��-2.267
oons-2.168
 Authentication-2.158
 Preferences-2.133
namese-2.108
Token10
Token ID940
Feature activation-2.82e-01

Pos logit contributions
��+2.350
oons+2.331
 Authentication+2.303
 Preferences+2.269
elle+2.246
Neg logit contributions
river-2.399
evil-2.301
inav-2.174
phia-2.098
tackle-2.050
Token:
Token ID25
Feature activation-9.26e-02

Pos logit contributions
��+2.668
oons+2.638
 Authentication+2.598
 Preferences+2.581
illion+2.523
Neg logit contributions
river-3.125
evil-2.971
inav-2.768
phia-2.698
thumbnails-2.696
Token47
Token ID2857
Feature activation-2.53e-01

Pos logit contributions
��+0.941
oons+0.922
 Authentication+0.918
 Preferences+0.906
elle+0.889
Neg logit contributions
river-0.959
evil-0.924
inav-0.856
thumbnails-0.828
tackle-0.825
Token]
Token ID60
Feature activation-2.53e-01

Pos logit contributions
��+2.596
oons+2.542
 Authentication+2.538
 Preferences+2.497
elle+2.447
Neg logit contributions
river-2.587
evil-2.482
inav-2.295
thumbnails-2.243
phia-2.224
Token <
Token ID1279
Feature activation+1.03e-01

Pos logit contributions
��+2.350
 Authentication+2.316
oons+2.307
 Preferences+2.269
 Regulatory+2.216
Neg logit contributions
river-2.851
evil-2.718
inav-2.541
tackle-2.473
phia-2.463
Tokenmet
Token ID4164
Feature activation-1.00e-01

Pos logit contributions
river+1.137
evil+1.106
inav+1.004
thumbnails+0.984
tackle+0.981
Neg logit contributions
��-1.008
 Authentication-0.976
oons-0.966
 Preferences-0.961
 Regulatory-0.938
Tokenros
Token ID4951
Feature activation+2.88e-02

Pos logit contributions
��+1.024
oons+1.009
 Authentication+0.992
illion+0.977
elle+0.977
Neg logit contributions
river-1.032
evil-0.985
inav-0.907
thumbnails-0.892
tackle-0.879
Tokenxual
Token ID5541
Feature activation+2.02e-01

Pos logit contributions
river+0.312
evil+0.301
inav+0.275
thumbnails+0.269
tackle+0.269
Neg logit contributions
��-0.285
 Authentication-0.275
oons-0.274
 Preferences-0.272
 Regulatory-0.265
Token.
Token ID13
Feature activation+1.80e-01

Pos logit contributions
��+1.204
oons+1.174
 Authentication+1.163
 Preferences+1.157
elle+1.133
Neg logit contributions
river-1.135
evil-1.092
inav-1.000
thumbnails-0.977
tackle-0.969
Token To
Token ID1675
Feature activation+1.22e-01

Pos logit contributions
river+1.956
evil+1.891
inav+1.729
thumbnails+1.697
tackle+1.670
Neg logit contributions
��-1.767
oons-1.677
 Authentication-1.650
namese-1.641
illion-1.629
Token make
Token ID787
Feature activation-1.50e-01

Pos logit contributions
river+1.280
evil+1.247
inav+1.132
tackle+1.113
thumbnails+1.106
Neg logit contributions
��-1.238
 Authentication-1.180
oons-1.178
 Preferences-1.163
illion-1.143
Token a
Token ID257
Feature activation-2.69e-01

Pos logit contributions
��+1.424
oons+1.393
 Authentication+1.388
 Preferences+1.376
illion+1.332
Neg logit contributions
river-1.655
evil-1.599
inav-1.477
thumbnails-1.438
phia-1.430
Token long
Token ID890
Feature activation+1.71e-01

Pos logit contributions
��+2.611
oons+2.529
 Preferences+2.518
 Authentication+2.510
namese+2.445
Neg logit contributions
river-2.906
evil-2.821
inav-2.578
thumbnails-2.564
tackle-2.504
Token story
Token ID1621
Feature activation-7.50e-01

Pos logit contributions
river+2.169
evil+2.079
inav+1.938
tackle+1.894
thumbnails+1.865
Neg logit contributions
��-1.386
oons-1.323
 Authentication-1.319
 Preferences-1.297
illion-1.267
Token short
Token ID1790
Feature activation+3.28e-02

Pos logit contributions
��+3.220
 Authentication+3.052
 Regulatory+3.048
 Prem+2.971
 Preferences+2.948
Neg logit contributions
river-11.198
evil-11.082
inav-10.748
tackle-10.547
phia-10.373
Token I
Token ID314
Feature activation+1.96e-01

Pos logit contributions
river+0.327
evil+0.313
inav+0.285
thumbnails+0.278
tackle+0.277
Neg logit contributions
��-0.354
oons-0.342
 Authentication-0.341
 Preferences-0.337
illion-0.330
Token got
Token ID1392
Feature activation+3.05e-02

Pos logit contributions
river+1.859
evil+1.785
inav+1.614
thumbnails+1.555
phia+1.551
Neg logit contributions
��-2.248
oons-2.125
 Authentication-2.108
 Preferences-2.090
namese-2.064
Token this
Token ID428
Feature activation+2.25e-01

Pos logit contributions
river+0.329
evil+0.318
inav+0.292
thumbnails+0.285
tackle+0.284
Neg logit contributions
��-0.303
oons-0.292
 Authentication-0.290
 Preferences-0.286
illion-0.281
Token while
Token ID981
Feature activation+4.70e-02

Pos logit contributions
river+2.360
evil+2.253
inav+2.073
tackle+1.989
phia+1.977
Neg logit contributions
��-2.343
oons-2.213
 Authentication-2.200
 Preferences-2.143
namese-2.123
Token just
Token ID655
Feature activation+4.54e-02

Pos logit contributions
river+1.548
evil+1.497
inav+1.355
tackle+1.323
phia+1.313
Neg logit contributions
��-1.579
 Authentication-1.496
oons-1.488
 Preferences-1.482
namese-1.441
Token p
Token ID279
Feature activation+1.70e-01

Pos logit contributions
river+0.479
evil+0.461
inav+0.422
thumbnails+0.411
tackle+0.410
Neg logit contributions
��-0.465
oons-0.447
 Authentication-0.446
 Preferences-0.441
illion-0.430
Tokenore
Token ID382
Feature activation+2.22e-01

Pos logit contributions
river+1.738
evil+1.673
inav+1.511
tackle+1.479
thumbnails+1.468
Neg logit contributions
��-1.819
 Authentication-1.747
 Preferences-1.727
oons-1.724
 Regulatory-1.685
Token over
Token ID625
Feature activation-7.22e-02

Pos logit contributions
river+2.365
evil+2.274
inav+2.033
thumbnails+2.001
tackle+1.992
Neg logit contributions
��-2.280
 Authentication-2.176
oons-2.144
 Preferences-2.112
illion-2.082
Token the
Token ID262
Feature activation-2.42e-01

Pos logit contributions
��+0.675
oons+0.655
 Authentication+0.650
 Preferences+0.644
 Regulatory+0.630
Neg logit contributions
river-0.812
evil-0.788
inav-0.729
tackle-0.710
thumbnails-0.710
Token real
Token ID1103
Feature activation-6.62e-01

Pos logit contributions
��+2.224
oons+2.199
 Authentication+2.173
 Preferences+2.172
 Regulatory+2.139
Neg logit contributions
river-2.703
evil-2.621
inav-2.438
tackle-2.384
thumbnails-2.354
Token estate
Token ID7964
Feature activation-7.89e-02

Pos logit contributions
oons+5.719
��+5.545
 Regulatory+5.514
illion+5.502
 Preferences+5.478
Neg logit contributions
river-7.125
evil-6.807
inav-6.548
tackle-6.416
alid-6.241
Token ads
Token ID9011
Feature activation-2.26e-01

Pos logit contributions
��+0.802
oons+0.786
 Authentication+0.778
 Preferences+0.774
 Regulatory+0.757
Neg logit contributions
river-0.816
evil-0.790
inav-0.728
tackle-0.709
thumbnails-0.706
Token looking
Token ID2045
Feature activation+4.83e-01

Pos logit contributions
��+2.250
oons+2.239
 Authentication+2.180
 Preferences+2.175
illion+2.157
Neg logit contributions
river-2.329
evil-2.274
inav-2.095
tackle-2.066
thumbnails-2.059
Token at
Token ID379
Feature activation+1.55e-02

Pos logit contributions
river+4.230
evil+4.046
inav+3.491
phia+3.373
 dise+3.345
Neg logit contributions
��-5.723
 Authentication-5.538
oons-5.405
 Preferences-5.392
 Regulatory-5.383
Token ranch
Token ID28752
Feature activation-6.75e-01

Pos logit contributions
river+0.172
evil+0.166
inav+0.153
thumbnails+0.150
tackle+0.149
Neg logit contributions
��-0.150
oons-0.144
 Authentication-0.144
 Preferences-0.142
illion-0.139
Token provided
Token ID2810
Feature activation-7.08e-02

Pos logit contributions
��+4.735
 Authentication+4.644
elle+4.589
 Preferences+4.569
oons+4.561
Neg logit contributions
river-6.940
evil-6.690
inav-6.447
thumbnails-6.268
phia-6.235
Token for
Token ID329
Feature activation+8.18e-02

Pos logit contributions
��+0.752
oons+0.728
 Authentication+0.726
 Preferences+0.718
illion+0.703
Neg logit contributions
river-0.714
evil-0.686
inav-0.629
thumbnails-0.613
tackle-0.609
Token,
Token ID11
Feature activation+8.48e-02

Pos logit contributions
river+0.781
evil+0.749
inav+0.678
thumbnails+0.657
tackle+0.656
Neg logit contributions
��-0.912
oons-0.887
 Authentication-0.879
 Preferences-0.866
namese-0.855
Token and
Token ID290
Feature activation-3.64e-01

Pos logit contributions
river+0.841
evil+0.804
inav+0.728
thumbnails+0.717
tackle+0.711
Neg logit contributions
��-0.915
oons-0.886
 Authentication-0.875
 Preferences-0.863
illion-0.856
Token which
Token ID543
Feature activation-3.89e-01

Pos logit contributions
��+3.261
 Authentication+3.215
 Preferences+3.214
oons+3.188
 Regulatory+3.116
Neg logit contributions
river-4.126
evil-4.032
inav-3.773
phia-3.692
tackle-3.660
Token shall
Token ID2236
Feature activation-7.82e-01

Pos logit contributions
 Authentication+3.701
��+3.662
 Preferences+3.642
oons+3.574
 Regulatory+3.557
Neg logit contributions
river-4.273
evil-4.072
inav-3.811
thumbnails-3.724
tackle-3.693
Token be
Token ID307
Feature activation-2.59e-01

Pos logit contributions
 Authentication+6.194
 Preferences+6.170
oons+6.113
 Regulatory+6.112
 Prem+5.990
Neg logit contributions
river-9.051
evil-8.597
thumbnails-8.156
inav-8.109
phia-8.032
Token established
Token ID4920
Feature activation-7.65e-02

Pos logit contributions
��+2.388
 Authentication+2.337
oons+2.315
 Preferences+2.301
 Regulatory+2.259
Neg logit contributions
river-2.937
evil-2.814
inav-2.647
tackle-2.571
thumbnails-2.569
Token by
Token ID416
Feature activation-4.71e-01

Pos logit contributions
��+0.740
oons+0.716
 Authentication+0.714
 Preferences+0.705
 Regulatory+0.688
Neg logit contributions
river-0.842
evil-0.813
inav-0.750
thumbnails-0.732
tackle-0.730
Token Law
Token ID3854
Feature activation+2.26e-01

Pos logit contributions
��+3.543
 Authentication+3.541
 Regulatory+3.408
 Preferences+3.392
oons+3.368
Neg logit contributions
river-6.083
evil-5.831
inav-5.573
tackle-5.507
thumbnails-5.397
Token
Token ID1399
Feature activation-1.24e-01

Pos logit contributions
river+2.084
evil+1.982
inav+1.773
alid+1.731
thumbnails+1.717
Neg logit contributions
��-2.565
 Authentication-2.480
oons-2.478
 Preferences-2.430
namese-2.423
Token see
Token ID766
Feature activation-2.11e-01

Pos logit contributions
river+1.186
evil+1.151
inav+1.043
tackle+1.016
thumbnails+1.009
Neg logit contributions
��-1.300
oons-1.244
 Authentication-1.241
 Preferences-1.234
illion-1.200
Token that
Token ID326
Feature activation-4.61e-01

Pos logit contributions
��+1.946
oons+1.898
 Authentication+1.882
 Preferences+1.846
elle+1.822
Neg logit contributions
river-2.381
evil-2.302
inav-2.121
thumbnails-2.090
tackle-2.088
Token at
Token ID379
Feature activation+2.98e-01

Pos logit contributions
��+4.208
oons+4.190
 Authentication+4.167
illion+4.150
elle+4.134
Neg logit contributions
river-4.875
evil-4.759
inav-4.395
thumbnails-4.367
alid-4.344
Token this
Token ID428
Feature activation+3.76e-01

Pos logit contributions
river+3.067
evil+2.928
inav+2.713
thumbnails+2.614
tackle+2.610
Neg logit contributions
��-3.032
oons-2.926
 Authentication-2.925
 Preferences-2.894
namese-2.831
Token point
Token ID966
Feature activation+8.54e-02

Pos logit contributions
river+4.367
evil+4.190
thumbnails+3.847
inav+3.846
tackle+3.740
Neg logit contributions
��-3.405
 Authentication-3.190
oons-3.179
 Preferences-3.105
namese-3.022
Token in
Token ID287
Feature activation-5.80e-01

Pos logit contributions
river+0.813
evil+0.777
inav+0.706
thumbnails+0.687
tackle+0.681
Neg logit contributions
��-0.956
oons-0.926
 Authentication-0.922
 Preferences-0.911
namese-0.891
Token her
Token ID607
Feature activation-1.88e-01

Pos logit contributions
��+4.438
oons+4.397
elle+4.315
 Authentication+4.285
illion+4.267
Neg logit contributions
river-6.980
evil-6.690
inav-6.363
tackle-6.330
alid-6.287
Token very
Token ID845
Feature activation-8.77e-02

Pos logit contributions
��+1.421
oons+1.379
 Authentication+1.364
 Preferences+1.353
 Regulatory+1.318
Neg logit contributions
river-2.421
evil-2.352
inav-2.200
thumbnails-2.158
tackle-2.157
Token lengthy
Token ID16452
Feature activation+1.04e-01

Pos logit contributions
��+0.817
oons+0.791
 Authentication+0.785
 Preferences+0.776
illion+0.760
Neg logit contributions
river-0.993
evil-0.960
inav-0.888
thumbnails-0.867
tackle-0.866
Token career
Token ID3451
Feature activation-1.73e-01

Pos logit contributions
river+1.284
evil+1.239
inav+1.151
thumbnails+1.127
tackle+1.124
Neg logit contributions
��-0.880
oons-0.840
 Authentication-0.836
 Preferences-0.820
illion-0.800
Token she
Token ID673
Feature activation+9.82e-03

Pos logit contributions
��+1.793
oons+1.746
 Authentication+1.726
 Preferences+1.713
elle+1.694
Neg logit contributions
river-1.747
evil-1.707
inav-1.547
thumbnails-1.524
tackle-1.516
Token monument
Token ID17757
Feature activation-2.27e-01

Pos logit contributions
��+0.113
oons+0.109
 Authentication+0.109
 Preferences+0.108
illion+0.105
Neg logit contributions
river-0.131
evil-0.126
inav-0.116
thumbnails-0.113
tackle-0.113
Token honoring
Token ID38276
Feature activation+7.02e-02

Pos logit contributions
��+2.143
oons+2.089
 Authentication+2.081
illion+2.027
elle+2.026
Neg logit contributions
river-2.508
evil-2.414
inav-2.258
thumbnails-2.203
tackle-2.187
Token an
Token ID281
Feature activation+1.25e-02

Pos logit contributions
river+0.759
evil+0.732
inav+0.671
thumbnails+0.651
tackle+0.650
Neg logit contributions
��-0.696
oons-0.672
 Authentication-0.668
 Preferences-0.661
illion-0.647
Token African
Token ID5510
Feature activation+2.45e-01

Pos logit contributions
river+0.134
evil+0.129
inav+0.119
thumbnails+0.116
tackle+0.116
Neg logit contributions
��-0.124
oons-0.120
 Authentication-0.120
 Preferences-0.118
illion-0.116
Token American
Token ID1605
Feature activation+1.03e-01

Pos logit contributions
river+2.142
evil+2.013
inav+1.811
phia+1.740
tackle+1.727
Neg logit contributions
��-2.941
oons-2.891
 Preferences-2.826
 Authentication-2.825
 Regulatory-2.778
Token or
Token ID393
Feature activation-6.71e-01

Pos logit contributions
river+1.137
evil+1.090
inav+1.001
thumbnails+0.975
tackle+0.974
Neg logit contributions
��-1.004
oons-0.970
 Authentication-0.959
 Preferences-0.951
 Regulatory-0.929
Token a
Token ID257
Feature activation-3.35e-01

Pos logit contributions
��+5.633
elle+5.445
 Authentication+5.396
namese+5.394
oons+5.355
Neg logit contributions
river-7.552
evil-7.359
tackle-6.960
inav-6.838
thumbnails-6.819
Token woman
Token ID2415
Feature activation+1.84e-01

Pos logit contributions
��+2.971
 Authentication+2.880
 Preferences+2.871
oons+2.865
elle+2.829
Neg logit contributions
river-3.845
evil-3.752
tackle-3.445
thumbnails-3.439
inav-3.438
Token to
Token ID284
Feature activation-3.18e-01

Pos logit contributions
river+1.807
evil+1.726
inav+1.554
tackle+1.511
phia+1.509
Neg logit contributions
��-2.011
oons-1.953
 Authentication-1.935
 Preferences-1.920
 Regulatory-1.882
Token be
Token ID307
Feature activation+1.95e-02

Pos logit contributions
��+2.917
oons+2.837
 Authentication+2.811
 Preferences+2.783
illion+2.746
Neg logit contributions
river-3.604
evil-3.455
inav-3.212
tackle-3.159
thumbnails-3.132
Token installed
Token ID6589
Feature activation-7.36e-02

Pos logit contributions
river+0.197
evil+0.189
inav+0.173
thumbnails+0.169
tackle+0.168
Neg logit contributions
��-0.207
oons-0.201
 Authentication-0.200
 Preferences-0.197
illion-0.193