TOP ACTIVATIONS
MAX = 18.493

B . C . Oil and Gas Commission .
x . x . x . 130 120 45 25 x
his privately held Chief Oil & Gas continues to drill and
x . x . x . 140 10 59 272 x
x . x . x . 110 11 190 34 x
capacity : https :// bit co int alk . org /
fulfill that in oil - and gas - related matters .
prepar atory step in oil and gas exploration , consultation with
defenses away from home in Green Bay , Kansas City and
, says physicist Y annis Sem ert z idis
with flagship design of Lightning series , R 78 70 Hawk
x . x . x . 138 11 30 277 x
. H app conc urs : " The semi
're sorry , but something went wrong Be the first to
conduct seismic testing for oil and gas in the north but
x . x . x . 160 100 10 94 x
game , joining Joe Smith .[ 110 ] The performance earned
into law limiting the use or purchase of guns the NRA
x . x . x . 141 112 08 80 x
x . x . x . 120 10 25 290 x

MIN ACTIVATIONS
MIN = -15.319

verb , the story of Am non and Tam ar ,
included a parody of a Pit bull and Ke $ ha
scenes after a blaze engulfed 1 , 800 hectares of forest
and to continue paying my tit hes to the church and
Service at 1 - 800 - 8 76 - 45 00
story of Am non and Tam ar , which is explicitly
to un earth a few barg ains along the way .
for the United Nations Office for the Coord ination of Human
of our old Pont iac Bon ne ville . Sally Ye
offers for The New York Times 's products and services .
United States . He met his girlfriend here , and talks
ran za s che c kered history at the
soccer scholarship , Brown paid his own way on the Louisville
spirit . Contact us at editors @ time .
Bar ack Obama is the worst ne ­ go ­ ti
, you can do A OE damage , you can buff
A Novel ( New York : St . Martin
f , which features Josh Em ett , Mark Wall bank
Online news team ? Email us at tips @ the -
for The New York Times 's products and services . Thank

INTERVAL 10.040 - 18.493
CONTAINS 0.072%

hand . So what can Dick do to get around that
that 9 / 11 was an inside job , that the
with Pro pe ller Blade Technology and two 8 mm Super
on the cost of oil and a faulty presumption that renewable
academic departments and a National Science Foundation Engineering Research Center .

INTERVAL 1.587 - 10.040
CONTAINS 27.614%

work like his version , but I m still
of speed ) and Rally ( for maximum strength ). Ch
Ferry because no one else studying and re - stud ying
- 0 and I was captain of the side and was
- t une their graphics cards . Comp eting with flagship

INTERVAL -6.866 - 1.587
CONTAINS 71.464%

representing density to a grid simpl ifies computing spatial derivatives .
s not . This process could be visual ized kind of
cards both online and in - store . With
Mar quez McC ool , 49 Brend a McC
. curfew . It 's all been worth it

INTERVAL -15.319 - -6.866
CONTAINS 0.849%

hours . Hundreds of thousands of Europeans were left without gas
shirt on fire , the Brad enton Herald reported .
Xbox 360 and the " Xbox 720 ," according to the
fueling speculation that a deal was imminent . Reports confirmed that
butter : Bl end , adding more almond milk
Token B
Token ID347
Feature activation+4.09e+00

Pos logit contributions
hyde+1.681
\.+1.645
netflix+1.555
ergus+1.522
holding+1.521
Neg logit contributions
iga-1.929
 Drac-1.881
neys-1.712
 sqor-1.686
ngth-1.676
Token.
Token ID13
Feature activation+4.33e+00

Pos logit contributions
\.+2.658
hyde+2.652
ooth+2.433
ergus+2.418
netflix+2.416
Neg logit contributions
 Drac-3.653
iga-3.634
 sqor-3.273
neys-3.270
ngth-3.249
TokenC
Token ID34
Feature activation+4.46e+00

Pos logit contributions
\.+1.864
hyde+1.730
holding+1.564
netflix+1.551
ergus+1.493
Neg logit contributions
iga-4.846
 Drac-4.778
ngth-4.635
 sqor-4.502
neys-4.472
Token.
Token ID13
Feature activation+4.92e+00

Pos logit contributions
hyde+3.053
\.+2.997
netflix+2.784
ergus+2.755
holding+2.727
Neg logit contributions
iga-3.880
 Drac-3.800
neys-3.472
 sqor-3.418
ngth-3.387
Token Oil
Token ID11474
Feature activation+2.89e+00

Pos logit contributions
hyde+3.610
\.+3.523
netflix+3.280
ergus+3.272
holding+3.249
Neg logit contributions
iga-4.033
 Drac-3.940
neys-3.595
 sqor-3.542
ngth-3.509
Token and
Token ID290
Feature activation+1.85e+01

Pos logit contributions
hyde+2.027
\.+1.978
netflix+1.870
ergus+1.821
holding+1.819
Neg logit contributions
iga-2.499
 Drac-2.438
neys-2.223
 sqor-2.187
ngth-2.172
Token Gas
Token ID14345
Feature activation-9.16e-01

Pos logit contributions
\.+5.763
hyde+5.542
holding+4.915
ergus+4.567
netflix+4.351
Neg logit contributions
ngth-19.070
iga-18.734
neys-18.561
 Drac-18.296
 sqor-18.018
Token Commission
Token ID4513
Feature activation-2.08e+00

Pos logit contributions
iga+0.710
 Drac+0.690
neys+0.617
 sqor+0.596
ngth+0.592
Neg logit contributions
hyde-0.728
\.-0.715
netflix-0.682
ergus-0.660
holding-0.659
Token.
Token ID13
Feature activation+1.68e+00

Pos logit contributions
iga+1.713
 Drac+1.663
neys+1.504
 sqor+1.485
ngth+1.479
Neg logit contributions
hyde-1.520
\.-1.478
netflix-1.429
ergus-1.384
ooth-1.381
Token
Token ID198
Feature activation+1.20e+00

Pos logit contributions
hyde+1.177
\.+1.150
netflix+1.087
ergus+1.060
holding+1.059
Neg logit contributions
iga-1.443
 Drac-1.408
neys-1.284
 sqor-1.265
ngth-1.257
Token
Token ID198
Feature activation+2.68e+00

Pos logit contributions
\.+0.805
hyde+0.796
netflix+0.742
holding+0.740
ooth+0.720
Neg logit contributions
iga-1.062
 Drac-1.047
ngth-0.973
 sqor-0.957
neys-0.953
Token x
Token ID2124
Feature activation+4.07e-01

Pos logit contributions
iga+3.597
 Drac+3.517
neys+3.077
 sqor+3.019
 WS+2.907
Neg logit contributions
hyde-4.397
\.-4.192
netflix-4.088
ergus-4.029
holding-3.961
Token.
Token ID13
Feature activation+3.58e+00

Pos logit contributions
\.+0.276
hyde+0.275
netflix+0.255
holding+0.251
ergus+0.248
Neg logit contributions
iga-0.356
 Drac-0.351
neys-0.320
ngth-0.320
 sqor-0.318
Tokenx
Token ID87
Feature activation+2.21e+00

Pos logit contributions
\.+1.951
hyde+1.886
holding+1.746
netflix+1.736
ooth+1.691
Neg logit contributions
 Drac-3.613
iga-3.555
ngth-3.366
 sqor-3.346
neys-3.282
Token.
Token ID13
Feature activation+4.37e+00

Pos logit contributions
\.+1.457
hyde+1.409
netflix+1.315
holding+1.306
ooth+1.279
Neg logit contributions
iga-1.985
 Drac-1.973
ngth-1.836
 sqor-1.810
neys-1.798
Tokenx
Token ID87
Feature activation+7.75e+00

Pos logit contributions
\.+2.339
hyde+2.231
holding+2.078
netflix+2.045
ooth+1.993
Neg logit contributions
 Drac-4.459
iga-4.367
ngth-4.172
 sqor-4.155
neys-4.051
Token.
Token ID13
Feature activation+1.82e+01

Pos logit contributions
\.+4.529
hyde+3.961
holding+3.908
netflix+3.708
ooth+3.599
Neg logit contributions
 Drac-7.449
ngth-7.253
iga-7.208
 sqor-7.003
neys-6.816
Token130
Token ID12952
Feature activation-1.33e-01

Pos logit contributions
\.+6.932
holding+6.737
hyde+6.045
mouth+5.310
netflix+5.096
Neg logit contributions
 Drac-18.245
ngth-17.440
neys-16.903
iga-16.889
 sqor-16.825
Token 120
Token ID7982
Feature activation-1.43e+00

Pos logit contributions
iga+0.112
 Drac+0.110
neys+0.099
 sqor+0.098
ngth+0.097
Neg logit contributions
hyde-0.097
\.-0.094
netflix-0.089
ergus-0.087
holding-0.087
Token45
Token ID2231
Feature activation+3.17e+00

Pos logit contributions
iga+1.138
 Drac+1.106
neys+0.998
 sqor+0.976
ngth+0.960
Neg logit contributions
hyde-1.107
\.-1.071
netflix-1.027
ergus-1.008
ooth-1.000
Token25
Token ID1495
Feature activation-4.30e+00

Pos logit contributions
\.+2.178
hyde+2.131
netflix+1.992
holding+1.981
ooth+1.940
Neg logit contributions
iga-2.719
 Drac-2.673
ngth-2.487
 sqor-2.453
neys-2.442
Token x
Token ID2124
Feature activation+1.09e+00

Pos logit contributions
iga+3.090
 Drac+3.014
neys+2.655
 sqor+2.600
ngth+2.501
Neg logit contributions
hyde-3.652
\.-3.501
netflix-3.404
ergus-3.341
holding-3.295
Token his
Token ID465
Feature activation+8.39e-01

Pos logit contributions
hyde+3.898
\.+3.858
netflix+3.652
holding+3.622
ergus+3.559
Neg logit contributions
iga-5.368
 Drac-5.200
ngth-4.802
neys-4.786
 sqor-4.773
Token privately
Token ID17092
Feature activation-4.02e+00

Pos logit contributions
hyde+0.610
\.+0.596
netflix+0.565
ergus+0.551
holding+0.550
Neg logit contributions
iga-0.702
 Drac-0.685
neys-0.622
 sqor-0.613
ngth-0.608
Token held
Token ID2714
Feature activation+3.56e+00

Pos logit contributions
iga+3.093
 Drac+3.003
neys+2.708
 sqor+2.644
ngth+2.581
Neg logit contributions
hyde-3.195
\.-3.032
netflix-2.968
ergus-2.908
ooth-2.880
Token Chief
Token ID5953
Feature activation-2.56e+00

Pos logit contributions
hyde+2.518
\.+2.480
netflix+2.336
holding+2.332
ergus+2.287
Neg logit contributions
iga-2.987
 Drac-2.901
neys-2.660
ngth-2.621
 sqor-2.612
Token Oil
Token ID11474
Feature activation+5.18e+00

Pos logit contributions
iga+2.156
 Drac+2.094
neys+1.884
 sqor+1.852
ngth+1.840
Neg logit contributions
hyde-1.867
\.-1.807
netflix-1.734
ergus-1.675
ooth-1.668
Token &
Token ID1222
Feature activation+1.80e+01

Pos logit contributions
hyde+3.721
\.+3.633
holding+3.442
ergus+3.409
netflix+3.403
Neg logit contributions
iga-4.266
 Drac-4.158
neys-3.834
 sqor-3.810
ngth-3.755
Token Gas
Token ID14345
Feature activation-2.02e-01

Pos logit contributions
hyde+7.525
\.+7.488
holding+7.352
netflix+6.456
ergus+6.385
Neg logit contributions
ngth-16.779
iga-16.233
neys-16.023
 Drac-15.885
 sqor-15.393
Token continues
Token ID4477
Feature activation+1.66e+00

Pos logit contributions
iga+0.165
 Drac+0.161
neys+0.145
 sqor+0.142
ngth+0.142
Neg logit contributions
hyde-0.151
\.-0.148
netflix-0.142
ergus-0.137
holding-0.136
Token to
Token ID284
Feature activation-1.21e+00

Pos logit contributions
hyde+1.247
\.+1.219
netflix+1.157
ergus+1.130
holding+1.126
Neg logit contributions
iga-1.349
 Drac-1.314
neys-1.191
 sqor-1.171
ngth-1.163
Token drill
Token ID16007
Feature activation+5.71e+00

Pos logit contributions
iga+1.018
 Drac+0.987
neys+0.903
 sqor+0.879
ngth+0.876
Neg logit contributions
hyde-0.884
\.-0.862
netflix-0.820
ergus-0.804
holding-0.798
Token and
Token ID290
Feature activation+1.91e+00

Pos logit contributions
hyde+4.220
\.+4.202
netflix+3.951
holding+3.932
ergus+3.912
Neg logit contributions
iga-4.455
 Drac-4.311
neys-3.935
ngth-3.930
 sqor-3.928
Token x
Token ID2124
Feature activation-1.10e+00

Pos logit contributions
iga+3.580
 Drac+3.500
neys+3.050
 sqor+2.992
 WS+2.891
Neg logit contributions
hyde-4.389
\.-4.190
netflix-4.089
ergus-4.011
ooth-3.957
Token.
Token ID13
Feature activation+2.74e+00

Pos logit contributions
iga+0.938
 Drac+0.914
neys+0.832
 sqor+0.819
ngth+0.812
Neg logit contributions
hyde-0.785
\.-0.764
netflix-0.725
ergus-0.708
holding-0.705
Tokenx
Token ID87
Feature activation+4.50e-01

Pos logit contributions
\.+1.524
hyde+1.492
netflix+1.370
holding+1.369
ooth+1.333
Neg logit contributions
 Drac-2.727
iga-2.701
ngth-2.527
 sqor-2.521
neys-2.481
Token.
Token ID13
Feature activation+5.16e+00

Pos logit contributions
hyde+0.306
\.+0.305
netflix+0.283
holding+0.278
ergus+0.275
Neg logit contributions
iga-0.393
 Drac-0.387
neys-0.353
ngth-0.352
 sqor-0.351
Tokenx
Token ID87
Feature activation+9.26e+00

Pos logit contributions
\.+2.709
hyde+2.566
holding+2.397
netflix+2.332
ooth+2.298
Neg logit contributions
 Drac-5.330
iga-5.145
 sqor-4.995
ngth-4.995
neys-4.808
Token.
Token ID13
Feature activation+1.74e+01

Pos logit contributions
\.+5.182
hyde+4.553
holding+4.445
netflix+4.133
ooth+4.104
Neg logit contributions
 Drac-8.981
ngth-8.768
iga-8.584
 sqor-8.464
 confir-8.170
Token140
Token ID15187
Feature activation+1.51e+00

Pos logit contributions
\.+6.068
holding+5.523
hyde+5.311
mouth+4.788
ooth+4.627
Neg logit contributions
 Drac-17.938
ngth-17.460
iga-16.925
 Vaugh-16.785
 sqor-16.726
Token 10
Token ID838
Feature activation-2.13e+00

Pos logit contributions
hyde+1.048
\.+1.031
netflix+0.969
holding+0.949
ergus+0.944
Neg logit contributions
iga-1.302
 Drac-1.265
neys-1.159
 sqor-1.145
ngth-1.145
Token59
Token ID3270
Feature activation+8.38e-01

Pos logit contributions
iga+1.712
 Drac+1.653
neys+1.497
 sqor+1.458
ngth+1.424
Neg logit contributions
hyde-1.631
\.-1.568
netflix-1.516
ergus-1.488
ooth-1.474
Token272
Token ID29807
Feature activation-1.36e+00

Pos logit contributions
hyde+0.650
\.+0.635
netflix+0.605
ergus+0.592
holding+0.591
Neg logit contributions
iga-0.661
 Drac-0.642
neys-0.579
 sqor-0.569
ngth-0.565
Token x
Token ID2124
Feature activation-4.09e-01

Pos logit contributions
iga+1.045
 Drac+1.017
neys+0.916
 sqor+0.901
ngth+0.894
Neg logit contributions
hyde-1.081
\.-1.059
netflix-1.009
ergus-0.987
holding-0.985
Token x
Token ID2124
Feature activation-2.51e-02

Pos logit contributions
iga+2.760
 Drac+2.699
neys+2.382
 sqor+2.339
ngth+2.259
Neg logit contributions
hyde-3.194
\.-3.076
netflix-2.983
ergus-2.917
holding-2.897
Token.
Token ID13
Feature activation+3.98e+00

Pos logit contributions
iga+0.022
 Drac+0.021
neys+0.019
ngth+0.019
 sqor+0.019
Neg logit contributions
hyde-0.017
\.-0.017
netflix-0.016
holding-0.016
ergus-0.016
Tokenx
Token ID87
Feature activation+1.83e+00

Pos logit contributions
\.+2.152
hyde+2.056
holding+1.910
netflix+1.891
ooth+1.852
Neg logit contributions
 Drac-4.034
iga-3.964
ngth-3.781
 sqor-3.755
neys-3.666
Token.
Token ID13
Feature activation+8.93e+00

Pos logit contributions
\.+1.214
hyde+1.181
netflix+1.100
holding+1.091
ooth+1.071
Neg logit contributions
iga-1.635
 Drac-1.620
ngth-1.502
 sqor-1.484
neys-1.475
Tokenx
Token ID87
Feature activation+9.02e+00

Pos logit contributions
\.+4.273
hyde+3.883
holding+3.700
netflix+3.573
ooth+3.499
Neg logit contributions
 Drac-9.434
ngth-9.068
 sqor-9.018
iga-9.003
 confir-8.490
Token.
Token ID13
Feature activation+1.64e+01

Pos logit contributions
\.+5.200
hyde+4.489
holding+4.296
netflix+4.142
ooth+4.024
Neg logit contributions
ngth-8.646
 Drac-8.624
iga-8.484
 sqor-8.239
 confir-7.886
Token110
Token ID11442
Feature activation+1.43e+00

Pos logit contributions
\.+7.446
hyde+7.213
holding+6.460
netflix+5.687
ooth+5.523
Neg logit contributions
iga-15.816
ngth-15.517
 Drac-15.486
 sqor-14.940
neys-14.484
Token 11
Token ID1367
Feature activation-1.59e+00

Pos logit contributions
hyde+0.987
\.+0.968
netflix+0.910
holding+0.890
ergus+0.888
Neg logit contributions
iga-1.244
 Drac-1.212
neys-1.109
 sqor-1.096
ngth-1.092
Token190
Token ID19782
Feature activation+5.71e-02

Pos logit contributions
iga+1.254
 Drac+1.215
neys+1.096
 sqor+1.069
ngth+1.049
Neg logit contributions
hyde-1.231
\.-1.194
netflix-1.148
ergus-1.130
ooth-1.118
Token34
Token ID2682
Feature activation-3.50e+00

Pos logit contributions
hyde+0.043
\.+0.043
netflix+0.040
holding+0.039
ergus+0.039
Neg logit contributions
iga-0.046
 Drac-0.045
neys-0.040
 sqor-0.040
ngth-0.040
Token x
Token ID2124
Feature activation+2.74e-01

Pos logit contributions
iga+2.556
 Drac+2.487
neys+2.206
 sqor+2.156
ngth+2.086
Neg logit contributions
hyde-2.930
\.-2.821
netflix-2.733
ergus-2.683
holding-2.655
Token capacity
Token ID5339
Feature activation-3.23e-01

Pos logit contributions
iga+3.523
 Drac+3.368
neys+3.113
 sqor+3.038
ngth+3.026
Neg logit contributions
hyde-2.586
\.-2.471
netflix-2.411
ergus-2.329
ooth-2.251
Token:
Token ID25
Feature activation+3.48e+00

Pos logit contributions
iga+0.270
 Drac+0.262
neys+0.238
 sqor+0.235
ngth+0.232
Neg logit contributions
hyde-0.237
\.-0.230
netflix-0.220
ergus-0.215
ooth-0.212
Token https
Token ID3740
Feature activation-1.53e+00

Pos logit contributions
hyde+2.419
\.+2.379
netflix+2.234
holding+2.194
ergus+2.181
Neg logit contributions
iga-2.989
 Drac-2.911
neys-2.669
 sqor-2.622
ngth-2.615
Token://
Token ID1378
Feature activation+6.17e-02

Pos logit contributions
iga+1.333
 Drac+1.297
neys+1.186
 sqor+1.163
ngth+1.152
Neg logit contributions
hyde-1.059
\.-1.026
netflix-0.976
ergus-0.953
holding-0.947
Tokenbit
Token ID2545
Feature activation+5.34e+00

Pos logit contributions
hyde+0.045
\.+0.044
netflix+0.041
ergus+0.041
ooth+0.041
Neg logit contributions
iga-0.052
 Drac-0.050
neys-0.046
 sqor-0.045
ngth-0.044
Tokenco
Token ID1073
Feature activation+1.53e+01

Pos logit contributions
hyde+3.550
\.+3.537
holding+3.259
netflix+3.251
ergus+3.181
Neg logit contributions
iga-4.679
 Drac-4.666
 sqor-4.194
neys-4.175
ngth-4.167
Tokenint
Token ID600
Feature activation+4.64e+00

Pos logit contributions
\.+6.740
hyde+5.961
ertodd+5.940
ooth+5.839
ergus+5.780
Neg logit contributions
 Drac-15.775
ngth-14.360
 sqor-14.132
iga-13.997
neys-13.773
Tokenalk
Token ID971
Feature activation-2.58e+00

Pos logit contributions
hyde+2.106
\.+2.100
netflix+1.919
ooth+1.904
holding+1.872
Neg logit contributions
iga-4.992
 Drac-4.963
neys-4.581
 sqor-4.550
ngth-4.546
Token.
Token ID13
Feature activation+4.48e+00

Pos logit contributions
iga+2.191
 Drac+2.134
neys+1.943
 sqor+1.913
ngth+1.899
Neg logit contributions
hyde-1.838
\.-1.789
netflix-1.699
ergus-1.657
holding-1.652
Tokenorg
Token ID2398
Feature activation+3.22e+00

Pos logit contributions
\.+1.802
hyde+1.791
holding+1.582
ooth+1.533
netflix+1.529
Neg logit contributions
 Drac-5.053
iga-5.018
neys-4.716
ngth-4.710
 sqor-4.626
Token/
Token ID14
Feature activation+3.30e+00

Pos logit contributions
\.+2.382
hyde+2.300
netflix+2.150
holding+2.141
ooth+2.124
Neg logit contributions
iga-2.634
 Drac-2.578
ngth-2.386
neys-2.361
 sqor-2.286
Token fulfill
Token ID14658
Feature activation+6.03e-01

Pos logit contributions
iga+0.695
 Drac+0.677
neys+0.615
 sqor+0.604
ngth+0.600
Neg logit contributions
hyde-0.618
\.-0.604
netflix-0.574
ergus-0.560
holding-0.559
Token that
Token ID326
Feature activation+8.36e-01

Pos logit contributions
hyde+0.449
\.+0.438
netflix+0.417
ergus+0.405
holding+0.403
Neg logit contributions
iga-0.495
 Drac-0.483
neys-0.437
 sqor-0.429
ngth-0.425
Token in
Token ID287
Feature activation-2.70e+00

Pos logit contributions
hyde+0.620
\.+0.606
netflix+0.576
ergus+0.562
holding+0.560
Neg logit contributions
iga-0.687
 Drac-0.670
neys-0.607
 sqor-0.598
ngth-0.593
Token oil
Token ID3056
Feature activation+1.49e+00

Pos logit contributions
iga+2.184
 Drac+2.101
neys+1.904
 sqor+1.846
ngth+1.828
Neg logit contributions
hyde-2.037
\.-1.996
netflix-1.916
ergus-1.839
ooth-1.824
Token-
Token ID12
Feature activation+4.69e+00

Pos logit contributions
hyde+1.063
\.+1.038
netflix+0.983
ergus+0.959
holding+0.956
Neg logit contributions
iga-1.266
 Drac-1.234
neys-1.124
 sqor-1.106
ngth-1.098
Token and
Token ID290
Feature activation+1.52e+01

Pos logit contributions
hyde+3.241
\.+3.152
netflix+3.037
holding+3.015
ergus+2.935
Neg logit contributions
iga-3.988
 Drac-3.945
neys-3.595
ngth-3.566
 sqor-3.563
Token gas
Token ID3623
Feature activation-4.43e+00

Pos logit contributions
\.+5.780
hyde+5.606
holding+5.399
netflix+5.020
ergus+4.748
Neg logit contributions
 Drac-15.380
ngth-15.185
iga-15.029
neys-14.884
 sqor-14.262
Token-
Token ID12
Feature activation+1.43e+00

Pos logit contributions
iga+3.438
 Drac+3.305
neys+2.966
 sqor+2.856
ngth+2.823
Neg logit contributions
hyde-3.431
\.-3.329
netflix-3.291
ooth-3.122
ergus-3.118
Tokenrelated
Token ID5363
Feature activation+1.71e+00

Pos logit contributions
hyde+1.032
\.+1.008
netflix+0.956
holding+0.933
ergus+0.932
Neg logit contributions
iga-1.198
 Drac-1.169
neys-1.062
 sqor-1.047
ngth-1.040
Token matters
Token ID6067
Feature activation+1.88e+00

Pos logit contributions
hyde+1.345
\.+1.318
netflix+1.252
holding+1.227
ergus+1.227
Neg logit contributions
iga-1.323
 Drac-1.287
neys-1.160
 sqor-1.143
ngth-1.136
Token.
Token ID13
Feature activation+4.67e-01

Pos logit contributions
hyde+1.316
\.+1.289
netflix+1.216
holding+1.187
ergus+1.186
Neg logit contributions
iga-1.618
 Drac-1.580
neys-1.439
 sqor-1.420
ngth-1.411
Token prepar
Token ID9198
Feature activation-2.86e+00

Pos logit contributions
hyde+1.018
\.+0.995
netflix+0.943
ergus+0.921
holding+0.919
Neg logit contributions
iga-1.149
 Drac-1.120
neys-1.017
 sqor-1.001
ngth-0.994
Tokenatory
Token ID2870
Feature activation-2.80e+00

Pos logit contributions
iga+1.589
 Drac+1.508
neys+1.329
 sqor+1.256
ngth+1.255
Neg logit contributions
hyde-2.882
\.-2.820
netflix-2.717
holding-2.662
ooth-2.659
Token step
Token ID2239
Feature activation+2.98e+00

Pos logit contributions
iga+2.294
 Drac+2.253
neys+2.042
 sqor+2.014
ngth+1.960
Neg logit contributions
hyde-2.086
\.-2.003
netflix-1.922
ergus-1.875
ooth-1.859
Token in
Token ID287
Feature activation+2.08e+00

Pos logit contributions
hyde+2.122
\.+2.082
netflix+1.964
holding+1.927
ergus+1.920
Neg logit contributions
iga-2.514
 Drac-2.452
neys-2.226
 sqor-2.203
ngth-2.186
Token oil
Token ID3056
Feature activation+1.05e+00

Pos logit contributions
hyde+1.509
\.+1.475
netflix+1.398
ergus+1.365
holding+1.363
Neg logit contributions
iga-1.737
 Drac-1.694
neys-1.539
 sqor-1.516
ngth-1.506
Token and
Token ID290
Feature activation+1.50e+01

Pos logit contributions
hyde+0.775
\.+0.756
netflix+0.718
ergus+0.699
holding+0.696
Neg logit contributions
iga-0.873
 Drac-0.849
neys-0.772
 sqor-0.757
ngth-0.753
Token gas
Token ID3623
Feature activation-1.44e+00

Pos logit contributions
\.+5.900
hyde+5.748
holding+5.341
netflix+5.161
ergus+4.987
Neg logit contributions
 Drac-15.222
iga-15.017
ngth-14.657
neys-14.618
 sqor-13.974
Token exploration
Token ID13936
Feature activation+1.48e+00

Pos logit contributions
iga+1.094
 Drac+1.060
neys+0.952
ngth+0.919
 sqor+0.919
Neg logit contributions
hyde-1.156
\.-1.126
netflix-1.088
ergus-1.053
ooth-1.047
Token,
Token ID11
Feature activation+3.46e+00

Pos logit contributions
hyde+1.055
\.+1.036
netflix+0.978
holding+0.956
ergus+0.956
Neg logit contributions
iga-1.259
 Drac-1.228
neys-1.118
 sqor-1.102
ngth-1.095
Token consultation
Token ID18103
Feature activation+6.25e-01

Pos logit contributions
hyde+2.446
\.+2.407
netflix+2.271
holding+2.234
ergus+2.224
Neg logit contributions
iga-2.932
 Drac-2.860
neys-2.589
 sqor-2.572
ngth-2.554
Token with
Token ID351
Feature activation+2.32e+00

Pos logit contributions
hyde+0.454
\.+0.442
netflix+0.420
ergus+0.409
holding+0.408
Neg logit contributions
iga-0.524
 Drac-0.511
neys-0.465
 sqor-0.457
ngth-0.454
Token defenses
Token ID18370
Feature activation+2.18e-01

Pos logit contributions
iga+1.907
 Drac+1.849
neys+1.651
 sqor+1.629
ngth+1.580
Neg logit contributions
hyde-2.120
\.-2.036
netflix-1.953
ergus-1.915
ooth-1.892
Token away
Token ID1497
Feature activation-2.83e+00

Pos logit contributions
hyde+0.159
\.+0.155
netflix+0.147
ergus+0.144
holding+0.143
Neg logit contributions
iga-0.182
 Drac-0.177
neys-0.161
 sqor-0.159
ngth-0.157
Token from
Token ID422
Feature activation-4.88e+00

Pos logit contributions
iga+2.280
 Drac+2.198
neys+1.983
 sqor+1.969
ngth+1.929
Neg logit contributions
hyde-2.158
\.-2.058
netflix-2.039
ergus-1.957
holding-1.933
Token home
Token ID1363
Feature activation+8.38e-01

Pos logit contributions
iga+3.375
 Drac+3.214
neys+2.826
 sqor+2.800
ngth+2.704
Neg logit contributions
hyde-4.290
\.-4.127
netflix-4.024
ergus-3.927
holding-3.853
Token in
Token ID287
Feature activation+2.96e-01

Pos logit contributions
hyde+0.604
\.+0.590
netflix+0.559
ergus+0.545
holding+0.544
Neg logit contributions
iga-0.706
 Drac-0.689
neys-0.627
 sqor-0.617
ngth-0.613
Token Green
Token ID3469
Feature activation+1.48e+01

Pos logit contributions
hyde+0.217
\.+0.212
netflix+0.201
ergus+0.196
holding+0.195
Neg logit contributions
iga-0.246
 Drac-0.240
neys-0.218
 sqor-0.214
ngth-0.212
Token Bay
Token ID4696
Feature activation+4.14e+00

Pos logit contributions
\.+3.352
hyde+3.146
mouth+2.330
holding+2.242
ooth+2.146
Neg logit contributions
 Drac-17.536
iga-17.412
ngth-17.397
 sqor-16.873
neys-16.634
Token,
Token ID11
Feature activation-3.56e-01

Pos logit contributions
hyde+2.755
\.+2.743
netflix+2.531
holding+2.490
ergus+2.474
Neg logit contributions
iga-3.629
 Drac-3.583
neys-3.273
ngth-3.232
 sqor-3.208
Token Kansas
Token ID9470
Feature activation-2.16e+00

Pos logit contributions
iga+0.305
 Drac+0.297
neys+0.270
 sqor+0.265
ngth+0.262
Neg logit contributions
hyde-0.253
\.-0.246
netflix-0.235
ergus-0.229
holding-0.226
Token City
Token ID2254
Feature activation+2.40e+00

Pos logit contributions
iga+2.010
 Drac+1.933
neys+1.784
 sqor+1.753
ngth+1.715
Neg logit contributions
hyde-1.376
\.-1.311
netflix-1.260
ergus-1.224
holding-1.211
Token and
Token ID290
Feature activation-3.57e+00

Pos logit contributions
hyde+1.659
\.+1.622
netflix+1.530
ergus+1.492
holding+1.491
Neg logit contributions
iga-2.085
 Drac-2.039
neys-1.859
 sqor-1.832
ngth-1.823
Token,
Token ID11
Feature activation+9.49e-01

Pos logit contributions
iga+1.308
 Drac+1.277
neys+1.149
 sqor+1.138
ngth+1.113
Neg logit contributions
hyde-1.244
\.-1.192
netflix-1.164
holding-1.130
ergus-1.126
Token
Token ID447
Feature activation-8.51e-01

Pos logit contributions
hyde+0.655
\.+0.641
netflix+0.605
ergus+0.589
holding+0.589
Neg logit contributions
iga-0.827
 Drac-0.808
neys-0.737
 sqor-0.727
ngth-0.722
Token
Token ID251
Feature activation+2.50e-01

Pos logit contributions
iga+0.871
 Drac+0.855
neys+0.791
 sqor+0.783
ngth+0.780
Neg logit contributions
hyde-0.459
\.-0.446
netflix-0.411
ergus-0.400
holding-0.400
Token says
Token ID1139
Feature activation+4.81e+00

Pos logit contributions
hyde+0.178
\.+0.174
netflix+0.165
ergus+0.161
holding+0.160
Neg logit contributions
iga-0.213
 Drac-0.208
neys-0.189
 sqor-0.186
ngth-0.185
Token physicist
Token ID33013
Feature activation+8.15e+00

Pos logit contributions
hyde+3.399
\.+3.350
netflix+3.122
ergus+3.100
holding+3.084
Neg logit contributions
iga-4.026
 Drac-3.900
neys-3.582
 sqor-3.531
ngth-3.518
Token Y
Token ID575
Feature activation+1.48e+01

Pos logit contributions
\.+5.440
hyde+5.345
ergus+5.006
netflix+4.949
holding+4.948
Neg logit contributions
iga-6.830
 Drac-6.502
neys-6.128
ngth-6.123
 sqor-6.114
Tokenannis
Token ID45017
Feature activation+3.96e+00

Pos logit contributions
\.+8.421
hyde+8.112
ergus+8.005
ooth+7.914
onel+7.422
Neg logit contributions
 Drac-12.816
iga-11.737
 sqor-11.542
ngth-11.322
neys-10.964
Token Sem
Token ID12449
Feature activation+2.06e+00

Pos logit contributions
hyde+2.902
\.+2.878
netflix+2.682
ergus+2.657
holding+2.639
Neg logit contributions
iga-3.206
 Drac-3.095
neys-2.866
 sqor-2.835
ngth-2.815
Tokenert
Token ID861
Feature activation+9.03e-01

Pos logit contributions
hyde+1.494
\.+1.463
netflix+1.380
ergus+1.358
holding+1.355
Neg logit contributions
iga-1.698
 Drac-1.665
neys-1.507
 sqor-1.501
ngth-1.483
Tokenz
Token ID89
Feature activation-1.50e+00

Pos logit contributions
hyde+0.673
\.+0.659
netflix+0.625
ergus+0.611
holding+0.610
Neg logit contributions
iga-0.737
 Drac-0.719
neys-0.651
 sqor-0.641
ngth-0.637
Tokenidis
Token ID29207
Feature activation-2.36e+00

Pos logit contributions
iga+1.225
 Drac+1.181
neys+1.065
ngth+1.048
 sqor+1.047
Neg logit contributions
hyde-1.125
\.-1.092
netflix-1.052
holding-1.018
ergus-1.017
Token with
Token ID351
Feature activation+2.52e+00

Pos logit contributions
hyde+1.550
\.+1.520
netflix+1.428
holding+1.398
ergus+1.398
Neg logit contributions
iga-1.890
 Drac-1.835
neys-1.676
 sqor-1.649
ngth-1.644
Token flagship
Token ID21336
Feature activation+3.04e+00

Pos logit contributions
hyde+1.764
\.+1.733
netflix+1.633
holding+1.596
ergus+1.592
Neg logit contributions
iga-2.163
 Drac-2.104
neys-1.923
 sqor-1.890
ngth-1.887
Token design
Token ID1486
Feature activation+8.74e+00

Pos logit contributions
hyde+2.088
\.+2.062
netflix+1.929
holding+1.899
ooth+1.883
Neg logit contributions
iga-2.626
 Drac-2.549
neys-2.330
ngth-2.298
 sqor-2.292
Token of
Token ID286
Feature activation+2.65e+00

Pos logit contributions
hyde+6.162
\.+6.129
holding+5.801
ooth+5.644
netflix+5.636
Neg logit contributions
iga-6.902
 Drac-6.690
ngth-6.058
neys-6.041
 sqor-5.930
Token Lightning
Token ID12469
Feature activation+1.05e+01

Pos logit contributions
hyde+1.918
\.+1.879
netflix+1.776
holding+1.735
ergus+1.734
Neg logit contributions
iga-2.221
 Drac-2.161
neys-1.969
 sqor-1.938
ngth-1.930
Token series
Token ID2168
Feature activation+1.47e+01

Pos logit contributions
\.+6.019
hyde+5.994
netflix+5.398
holding+5.396
ooth+5.386
Neg logit contributions
iga-9.557
 Drac-9.175
neys-8.749
ngth-8.709
 sqor-8.483
Token,
Token ID11
Feature activation+5.26e+00

Pos logit contributions
\.+8.625
hyde+8.286
holding+7.871
ooth+7.675
netflix+7.413
Neg logit contributions
iga-12.080
 Drac-11.656
ngth-11.057
neys-11.027
 sqor-10.530
Token R
Token ID371
Feature activation+2.87e+00

Pos logit contributions
hyde+3.854
\.+3.792
netflix+3.541
holding+3.521
ooth+3.482
Neg logit contributions
iga-4.266
 Drac-4.148
neys-3.795
ngth-3.694
 sqor-3.685
Token78
Token ID3695
Feature activation+9.71e+00

Pos logit contributions
hyde+1.752
\.+1.711
netflix+1.588
ooth+1.555
ergus+1.555
Neg logit contributions
iga-2.701
 Drac-2.662
neys-2.441
 sqor-2.418
ngth-2.407
Token70
Token ID2154
Feature activation+6.73e+00

Pos logit contributions
\.+4.909
hyde+4.780
holding+4.165
ooth+4.055
netflix+3.987
Neg logit contributions
 Drac-9.537
iga-9.439
ngth-9.131
neys-8.932
 sqor-8.664
Token Hawk
Token ID26698
Feature activation+1.14e+01

Pos logit contributions
hyde+4.579
\.+4.508
netflix+4.181
ooth+4.138
holding+4.110
Neg logit contributions
iga-5.738
 Drac-5.583
neys-5.165
ngth-5.071
 sqor-5.029
Token x
Token ID2124
Feature activation-2.81e-01

Pos logit contributions
iga+0.981
 Drac+0.954
neys+0.860
 sqor+0.846
ngth+0.841
Neg logit contributions
hyde-1.006
\.-0.987
netflix-0.939
ergus-0.918
holding-0.917
Token.
Token ID13
Feature activation+2.82e+00

Pos logit contributions
iga+0.243
 Drac+0.238
neys+0.217
 sqor+0.215
ngth+0.215
Neg logit contributions
hyde-0.195
\.-0.193
netflix-0.181
holding-0.177
ergus-0.176
Tokenx
Token ID87
Feature activation+1.82e+00

Pos logit contributions
\.+1.564
hyde+1.525
holding+1.402
netflix+1.400
ooth+1.365
Neg logit contributions
 Drac-2.813
iga-2.784
ngth-2.614
 sqor-2.604
neys-2.560
Token.
Token ID13
Feature activation+4.83e+00

Pos logit contributions
\.+1.206
hyde+1.178
netflix+1.096
holding+1.087
ooth+1.065
Neg logit contributions
iga-1.625
 Drac-1.618
ngth-1.497
 sqor-1.480
neys-1.472
Tokenx
Token ID87
Feature activation+9.88e+00

Pos logit contributions
\.+2.666
hyde+2.553
holding+2.380
netflix+2.273
ooth+2.267
Neg logit contributions
 Drac-4.856
iga-4.688
ngth-4.558
 sqor-4.532
neys-4.385
Token.
Token ID13
Feature activation+1.47e+01

Pos logit contributions
\.+5.633
hyde+4.976
holding+4.868
ooth+4.317
netflix+4.317
Neg logit contributions
 Drac-9.528
ngth-9.379
iga-9.008
 sqor-8.917
 confir-8.678
Token138
Token ID20107
Feature activation+4.03e+00

Pos logit contributions
\.+8.282
hyde+7.742
holding+7.643
mouth+6.656
ooth+6.598
Neg logit contributions
 Drac-13.031
ngth-12.823
iga-12.052
 sqor-11.949
 Vaugh-11.706
Token 11
Token ID1367
Feature activation+1.70e-01

Pos logit contributions
\.+2.615
hyde+2.600
netflix+2.396
holding+2.381
ergus+2.320
Neg logit contributions
iga-3.602
 Drac-3.477
ngth-3.257
neys-3.238
 sqor-3.217
Token30
Token ID1270
Feature activation-4.33e+00

Pos logit contributions
hyde+0.127
\.+0.124
netflix+0.118
ergus+0.115
holding+0.115
Neg logit contributions
iga-0.139
 Drac-0.135
neys-0.122
 sqor-0.120
ngth-0.119
Token277
Token ID27019
Feature activation-2.56e+00

Pos logit contributions
iga+3.296
 Drac+3.154
neys+2.828
 sqor+2.762
ngth+2.702
Neg logit contributions
hyde-3.416
\.-3.292
netflix-3.175
ergus-3.145
ooth-3.108
Token x
Token ID2124
Feature activation-3.79e-02

Pos logit contributions
iga+1.909
 Drac+1.860
neys+1.659
 sqor+1.628
ngth+1.592
Neg logit contributions
hyde-2.101
\.-2.036
netflix-1.957
ergus-1.919
holding-1.906
Token.
Token ID13
Feature activation-1.58e+00

Pos logit contributions
hyde+2.058
\.+2.028
netflix+1.911
holding+1.864
ooth+1.857
Neg logit contributions
iga-2.566
 Drac-2.507
neys-2.298
 sqor-2.261
ngth-2.258
Token
Token ID198
Feature activation-1.63e+00

Pos logit contributions
iga+1.314
 Drac+1.279
neys+1.159
 sqor+1.124
ngth+1.121
Neg logit contributions
hyde-1.163
\.-1.135
netflix-1.073
ooth-1.055
ergus-1.054
Token
Token ID198
Feature activation-7.38e-01

Pos logit contributions
iga+1.354
 Drac+1.318
neys+1.198
 sqor+1.176
ngth+1.164
Neg logit contributions
hyde-1.191
\.-1.159
netflix-1.102
ergus-1.077
holding-1.071
TokenH
Token ID39
Feature activation-2.58e+00

Pos logit contributions
iga+0.643
 Drac+0.623
neys+0.572
 sqor+0.555
ngth+0.551
Neg logit contributions
hyde-0.514
\.-0.495
netflix-0.470
ergus-0.459
ooth-0.458
Tokenapp
Token ID1324
Feature activation+1.50e+00

Pos logit contributions
iga+2.250
 Drac+2.127
neys+1.974
 sqor+1.918
ngth+1.905
Neg logit contributions
hyde-1.814
\.-1.758
netflix-1.687
holding-1.624
ergus-1.604
Token conc
Token ID1673
Feature activation+1.46e+01

Pos logit contributions
hyde+1.394
\.+1.370
netflix+1.314
ergus+1.291
holding+1.289
Neg logit contributions
iga-0.955
 Drac-0.925
neys-0.812
 sqor-0.797
ngth-0.789
Tokenurs
Token ID1834
Feature activation+2.54e+00

Pos logit contributions
\.+9.810
ooth+9.122
holding+9.073
ergus+8.994
hyde+8.883
Neg logit contributions
 Drac-10.988
iga-10.660
neys-9.869
ngth-9.817
 sqor-9.747
Token:
Token ID25
Feature activation+1.71e+00

Pos logit contributions
hyde+1.768
\.+1.737
netflix+1.635
holding+1.601
ergus+1.601
Neg logit contributions
iga-2.183
 Drac-2.132
neys-1.951
 sqor-1.919
ngth-1.917
Token "
Token ID366
Feature activation+4.05e-01

Pos logit contributions
hyde+1.165
\.+1.140
netflix+1.076
holding+1.051
ergus+1.048
Neg logit contributions
iga-1.505
 Drac-1.467
neys-1.343
 sqor-1.325
ngth-1.316
TokenThe
Token ID464
Feature activation+1.76e+00

Pos logit contributions
hyde+0.285
\.+0.278
netflix+0.264
ergus+0.257
holding+0.256
Neg logit contributions
iga-0.347
 Drac-0.339
neys-0.309
 sqor-0.304
ngth-0.302
Token semi
Token ID10663
Feature activation-4.38e-01

Pos logit contributions
hyde+1.263
\.+1.236
netflix+1.169
holding+1.144
ergus+1.140
Neg logit contributions
iga-1.489
 Drac-1.449
neys-1.320
 sqor-1.300
ngth-1.296
Token're
Token ID821
Feature activation-4.81e-01

Pos logit contributions
iga+4.798
 Drac+4.616
neys+4.237
ngth+4.124
 sqor+4.072
Neg logit contributions
hyde-4.421
\.-4.321
netflix-4.150
ergus-4.130
holding-3.997
Token sorry
Token ID7926
Feature activation+3.25e-01

Pos logit contributions
iga+0.380
 Drac+0.370
neys+0.332
 sqor+0.326
ngth+0.326
Neg logit contributions
hyde-0.372
\.-0.366
netflix-0.348
ergus-0.339
holding-0.339
Token,
Token ID11
Feature activation+6.92e-01

Pos logit contributions
hyde+0.227
\.+0.221
netflix+0.209
ergus+0.204
holding+0.204
Neg logit contributions
iga-0.281
 Drac-0.275
neys-0.250
 sqor-0.247
ngth-0.244
Token but
Token ID475
Feature activation-1.61e+00

Pos logit contributions
hyde+0.472
\.+0.461
netflix+0.435
ergus+0.424
holding+0.423
Neg logit contributions
iga-0.608
 Drac-0.594
neys-0.544
 sqor-0.534
ngth-0.533
Token something
Token ID1223
Feature activation+2.81e+00

Pos logit contributions
iga+1.300
 Drac+1.275
neys+1.143
 sqor+1.129
ngth+1.104
Neg logit contributions
hyde-1.218
\.-1.183
netflix-1.135
ergus-1.098
holding-1.084
Token went
Token ID1816
Feature activation+1.46e+01

Pos logit contributions
hyde+1.697
\.+1.659
netflix+1.552
ergus+1.515
holding+1.511
Neg logit contributions
iga-2.677
 Drac-2.616
neys-2.412
 sqor-2.409
ngth-2.376
Token wrong
Token ID2642
Feature activation-9.84e-01

Pos logit contributions
\.+6.665
hyde+5.845
netflix+5.748
ergus+5.447
ooth+5.438
Neg logit contributions
iga-14.931
 sqor-14.930
 Drac-14.484
neys-14.167
ngth-14.090
Token Be
Token ID1355
Feature activation-6.05e+00

Pos logit contributions
iga+0.812
 Drac+0.796
neys+0.715
 sqor+0.702
ngth+0.692
Neg logit contributions
hyde-0.727
\.-0.702
netflix-0.674
holding-0.659
ergus-0.656
Token the
Token ID262
Feature activation-1.93e+00

Pos logit contributions
iga+4.553
 Drac+4.369
neys+3.943
 sqor+3.755
ngth+3.742
Neg logit contributions
\.-4.735
netflix-4.653
hyde-4.613
ergus-4.371
holding-4.326
Token first
Token ID717
Feature activation-1.31e+00

Pos logit contributions
iga+1.435
 Drac+1.400
neys+1.238
 sqor+1.229
 WS+1.192
Neg logit contributions
hyde-1.572
\.-1.537
netflix-1.472
ergus-1.434
holding-1.428
Token to
Token ID284
Feature activation+4.86e+00

Pos logit contributions
iga+0.997
 Drac+0.981
neys+0.877
 sqor+0.859
ngth+0.851
Neg logit contributions
hyde-1.045
\.-1.022
netflix-0.976
holding-0.956
ergus-0.952
Token conduct
Token ID3189
Feature activation+1.27e+00

Pos logit contributions
iga+0.946
 Drac+0.923
neys+0.838
 sqor+0.813
ngth+0.810
Neg logit contributions
hyde-0.844
\.-0.820
netflix-0.782
ergus-0.764
holding-0.759
Token seismic
Token ID37463
Feature activation+3.43e-01

Pos logit contributions
hyde+0.937
\.+0.918
netflix+0.869
holding+0.850
ergus+0.850
Neg logit contributions
iga-1.048
 Drac-1.022
neys-0.927
 sqor-0.914
ngth-0.908
Token testing
Token ID4856
Feature activation-4.06e+00

Pos logit contributions
hyde+0.264
\.+0.258
netflix+0.246
ergus+0.240
holding+0.239
Neg logit contributions
iga-0.272
 Drac-0.265
neys-0.240
 sqor-0.235
ngth-0.234
Token for
Token ID329
Feature activation+3.15e+00

Pos logit contributions
iga+3.125
 Drac+3.067
neys+2.764
 sqor+2.688
ngth+2.641
Neg logit contributions
hyde-3.236
\.-3.068
netflix-2.982
ergus-2.900
ooth-2.885
Token oil
Token ID3056
Feature activation+1.45e+00

Pos logit contributions
hyde+2.233
\.+2.192
netflix+2.071
holding+2.046
ergus+2.027
Neg logit contributions
iga-2.655
 Drac-2.570
neys-2.350
 sqor-2.336
ngth-2.324
Token and
Token ID290
Feature activation+1.43e+01

Pos logit contributions
hyde+1.119
\.+1.094
netflix+1.041
ergus+1.016
holding+1.014
Neg logit contributions
iga-1.150
 Drac-1.120
neys-1.012
 sqor-0.994
ngth-0.988
Token gas
Token ID3623
Feature activation+1.19e-01

Pos logit contributions
\.+5.992
hyde+5.858
holding+5.565
netflix+5.174
ooth+5.114
Neg logit contributions
iga-14.226
 Drac-14.045
ngth-13.761
neys-13.716
 sqor-13.137
Token in
Token ID287
Feature activation+4.70e-01

Pos logit contributions
hyde+0.095
\.+0.093
netflix+0.089
ergus+0.086
ooth+0.086
Neg logit contributions
iga-0.091
 Drac-0.088
neys-0.079
 sqor-0.078
ngth-0.077
Token the
Token ID262
Feature activation-2.72e+00

Pos logit contributions
hyde+0.352
\.+0.343
netflix+0.327
ergus+0.317
holding+0.316
Neg logit contributions
iga-0.384
 Drac-0.376
neys-0.339
 sqor-0.332
ngth-0.329
Token north
Token ID5093
Feature activation+3.51e-01

Pos logit contributions
iga+2.279
 Drac+2.252
neys+2.001
 sqor+1.970
 WS+1.891
Neg logit contributions
hyde-2.007
\.-1.896
netflix-1.836
ergus-1.758
ooth-1.757
Token but
Token ID475
Feature activation+4.36e+00

Pos logit contributions
hyde+0.265
\.+0.255
netflix+0.244
ergus+0.238
ooth+0.237
Neg logit contributions
iga-0.287
 Drac-0.278
neys-0.250
 sqor-0.248
ngth-0.245
Token x
Token ID2124
Feature activation-4.33e-01

Pos logit contributions
iga+2.544
 Drac+2.481
neys+2.197
 sqor+2.156
ngth+2.086
Neg logit contributions
hyde-2.916
\.-2.811
netflix-2.717
ergus-2.665
holding-2.637
Token.
Token ID13
Feature activation+2.90e+00

Pos logit contributions
iga+0.373
 Drac+0.365
neys+0.333
 sqor+0.329
ngth+0.328
Neg logit contributions
hyde-0.302
\.-0.297
netflix-0.279
holding-0.273
ergus-0.272
Tokenx
Token ID87
Feature activation+1.83e+00

Pos logit contributions
\.+1.615
hyde+1.568
holding+1.446
netflix+1.441
ooth+1.406
Neg logit contributions
 Drac-2.893
iga-2.865
ngth-2.691
 sqor-2.678
neys-2.632
Token.
Token ID13
Feature activation+5.32e+00

Pos logit contributions
\.+1.212
hyde+1.177
netflix+1.099
holding+1.090
ooth+1.069
Neg logit contributions
iga-1.637
 Drac-1.625
ngth-1.506
 sqor-1.487
neys-1.479
Tokenx
Token ID87
Feature activation+7.24e+00

Pos logit contributions
\.+2.806
hyde+2.615
holding+2.464
netflix+2.405
ooth+2.368
Neg logit contributions
 Drac-5.516
iga-5.329
 sqor-5.170
ngth-5.164
neys-4.966
Token.
Token ID13
Feature activation+1.43e+01

Pos logit contributions
\.+4.351
hyde+3.846
holding+3.656
ooth+3.562
netflix+3.511
Neg logit contributions
 Drac-6.974
iga-6.684
ngth-6.598
 sqor-6.444
neys-6.298
Token160
Token ID14198
Feature activation+2.21e-01

Pos logit contributions
\.+7.460
hyde+6.670
holding+6.400
ooth+6.307
netflix+5.618
Neg logit contributions
 Drac-13.898
iga-12.792
ngth-12.667
neys-12.353
 sqor-12.181
Token 100
Token ID1802
Feature activation+6.63e+00

Pos logit contributions
hyde+0.156
\.+0.152
netflix+0.144
ergus+0.141
holding+0.141
Neg logit contributions
iga-0.189
 Drac-0.184
neys-0.168
 sqor-0.165
ngth-0.164
Token10
Token ID940
Feature activation+1.03e+00

Pos logit contributions
\.+4.519
hyde+4.429
holding+4.118
netflix+4.093
ooth+3.949
Neg logit contributions
iga-5.579
 Drac-5.524
ngth-5.226
 sqor-5.103
neys-5.030
Token94
Token ID5824
Feature activation-6.47e-01

Pos logit contributions
hyde+0.741
\.+0.730
netflix+0.687
holding+0.672
ergus+0.670
Neg logit contributions
iga-0.870
 Drac-0.850
neys-0.774
 sqor-0.764
ngth-0.762
Token x
Token ID2124
Feature activation-6.74e-02

Pos logit contributions
iga+0.504
 Drac+0.490
neys+0.443
 sqor+0.436
ngth+0.436
Neg logit contributions
hyde-0.505
\.-0.497
netflix-0.471
holding-0.461
ergus-0.460
Token game
Token ID983
Feature activation+6.86e-01

Pos logit contributions
\.+2.308
hyde+2.285
netflix+2.137
holding+2.066
ooth+2.057
Neg logit contributions
iga-4.496
 Drac-4.412
ngth-4.137
neys-4.133
 sqor-4.054
Token,
Token ID11
Feature activation+1.81e+00

Pos logit contributions
hyde+0.502
\.+0.492
netflix+0.466
ergus+0.455
holding+0.455
Neg logit contributions
iga-0.569
 Drac-0.555
neys-0.504
 sqor-0.496
ngth-0.493
Token joining
Token ID9679
Feature activation+1.91e+00

Pos logit contributions
hyde+1.222
\.+1.205
netflix+1.133
holding+1.108
ergus+1.105
Neg logit contributions
iga-1.588
 Drac-1.547
neys-1.423
 sqor-1.404
ngth-1.401
Token Joe
Token ID5689
Feature activation-4.11e-02

Pos logit contributions
hyde+1.383
\.+1.357
netflix+1.278
ergus+1.255
holding+1.253
Neg logit contributions
iga-1.601
 Drac-1.557
neys-1.422
 sqor-1.398
ngth-1.394
Token Smith
Token ID4176
Feature activation+2.81e+00

Pos logit contributions
iga+0.035
 Drac+0.034
neys+0.031
 sqor+0.030
ngth+0.030
Neg logit contributions
hyde-0.029
\.-0.029
netflix-0.027
ergus-0.026
holding-0.026
Token.[
Token ID3693
Feature activation+1.43e+01

Pos logit contributions
hyde+1.942
\.+1.925
netflix+1.794
holding+1.765
ergus+1.760
Neg logit contributions
iga-2.405
 Drac-2.359
neys-2.165
ngth-2.132
 sqor-2.130
Token110
Token ID11442
Feature activation-4.81e+00

Pos logit contributions
hyde+7.627
\.+7.516
holding+7.020
netflix+6.805
ergus+6.372
Neg logit contributions
iga-12.897
 Drac-12.809
ngth-11.844
neys-11.795
 sqor-11.703
Token]
Token ID60
Feature activation-5.90e-01

Pos logit contributions
iga+4.425
 Drac+4.334
neys+3.939
 sqor+3.933
ngth+3.865
Neg logit contributions
hyde-3.010
\.-2.828
netflix-2.780
ergus-2.631
ooth-2.630
Token The
Token ID383
Feature activation+1.57e+00

Pos logit contributions
iga+0.533
 Drac+0.522
neys+0.474
 sqor+0.465
ngth+0.462
Neg logit contributions
hyde-0.392
\.-0.379
netflix-0.360
ergus-0.350
ooth-0.349
Token performance
Token ID2854
Feature activation+3.28e+00

Pos logit contributions
hyde+1.105
\.+1.092
netflix+1.025
ergus+1.008
holding+1.004
Neg logit contributions
iga-1.330
 Drac-1.293
neys-1.183
ngth-1.166
 sqor-1.165
Token earned
Token ID7366
Feature activation-1.06e+00

Pos logit contributions
\.+2.253
hyde+2.249
netflix+2.077
holding+2.062
ergus+2.044
Neg logit contributions
iga-2.806
 Drac-2.747
neys-2.528
ngth-2.508
 sqor-2.500
Token into
Token ID656
Feature activation-2.60e+00

Pos logit contributions
hyde+1.174
\.+1.148
netflix+1.087
ergus+1.060
holding+1.060
Neg logit contributions
iga-1.379
 Drac-1.345
neys-1.224
 sqor-1.206
ngth-1.198
Token law
Token ID1099
Feature activation+4.89e+00

Pos logit contributions
iga+1.369
 Drac+1.305
neys+1.090
 sqor+1.036
ngth+1.028
Neg logit contributions
hyde-2.699
\.-2.631
netflix-2.585
ooth-2.520
holding-2.501
Token limiting
Token ID15637
Feature activation+4.53e+00

Pos logit contributions
hyde+3.247
\.+3.234
netflix+3.015
holding+2.994
ergus+2.936
Neg logit contributions
iga-4.306
 Drac-4.169
neys-3.837
 sqor-3.807
ngth-3.789
Token the
Token ID262
Feature activation+4.65e+00

Pos logit contributions
hyde+2.978
\.+2.948
holding+2.748
netflix+2.740
ergus+2.695
Neg logit contributions
iga-3.999
 Drac-3.922
neys-3.604
ngth-3.590
 sqor-3.589
Token use
Token ID779
Feature activation+5.80e+00

Pos logit contributions
hyde+3.075
\.+3.041
holding+2.836
netflix+2.835
ergus+2.770
Neg logit contributions
iga-4.087
 Drac-4.021
neys-3.690
ngth-3.647
 sqor-3.642
Token or
Token ID393
Feature activation+1.41e+01

Pos logit contributions
hyde+4.033
\.+4.016
netflix+3.751
holding+3.715
ergus+3.637
Neg logit contributions
iga-4.843
 Drac-4.791
neys-4.322
 sqor-4.295
ngth-4.258
Token purchase
Token ID5001
Feature activation+4.62e+00

Pos logit contributions
hyde+7.537
\.+7.505
holding+7.234
ergus+6.609
ooth+6.542
Neg logit contributions
 Drac-12.810
iga-12.381
neys-11.797
ngth-11.741
 sqor-11.687
Token of
Token ID286
Feature activation+3.98e+00

Pos logit contributions
hyde+3.355
\.+3.297
netflix+3.103
holding+3.077
ergus+3.022
Neg logit contributions
iga-3.809
 Drac-3.747
neys-3.374
 sqor-3.360
ngth-3.322
Token guns
Token ID6541
Feature activation-1.37e+00

Pos logit contributions
hyde+2.673
\.+2.669
netflix+2.469
holding+2.459
ooth+2.430
Neg logit contributions
iga-3.447
 Drac-3.386
neys-3.098
ngth-3.084
 sqor-3.080
Token the
Token ID262
Feature activation+1.24e+00

Pos logit contributions
iga+1.134
 Drac+1.100
neys+1.002
 sqor+0.976
ngth+0.966
Neg logit contributions
hyde-1.021
\.-0.981
netflix-0.940
ergus-0.915
ooth-0.910
Token NRA
Token ID23962
Feature activation+4.97e+00

Pos logit contributions
hyde+0.916
\.+0.898
netflix+0.851
ergus+0.831
holding+0.831
Neg logit contributions
iga-1.016
 Drac-0.990
neys-0.898
 sqor-0.884
ngth-0.880
Token x
Token ID2124
Feature activation-4.09e-01

Pos logit contributions
iga+1.045
 Drac+1.017
neys+0.916
 sqor+0.901
ngth+0.894
Neg logit contributions
hyde-1.081
\.-1.059
netflix-1.009
ergus-0.987
holding-0.985
Token.
Token ID13
Feature activation+3.61e+00

Pos logit contributions
iga+0.352
 Drac+0.345
neys+0.314
 sqor+0.311
ngth+0.310
Neg logit contributions
hyde-0.284
\.-0.280
netflix-0.263
holding-0.257
ergus-0.256
Tokenx
Token ID87
Feature activation+1.36e+00

Pos logit contributions
\.+1.974
hyde+1.902
holding+1.759
netflix+1.748
ooth+1.707
Neg logit contributions
 Drac-3.642
iga-3.586
ngth-3.397
 sqor-3.384
neys-3.306
Token.
Token ID13
Feature activation+5.99e+00

Pos logit contributions
\.+0.906
hyde+0.890
netflix+0.828
holding+0.819
ooth+0.805
Neg logit contributions
iga-1.202
 Drac-1.190
ngth-1.096
 sqor-1.087
neys-1.085
Tokenx
Token ID87
Feature activation+8.23e+00

Pos logit contributions
\.+3.084
hyde+2.870
holding+2.699
netflix+2.626
ooth+2.603
Neg logit contributions
 Drac-6.236
iga-5.983
 sqor-5.872
ngth-5.863
neys-5.617
Token.
Token ID13
Feature activation+1.41e+01

Pos logit contributions
\.+4.688
hyde+4.117
holding+4.006
netflix+3.889
ooth+3.880
Neg logit contributions
 Drac-7.934
ngth-7.704
iga-7.626
 sqor-7.479
neys-7.168
Token141
Token ID23756
Feature activation+1.57e+00

Pos logit contributions
\.+5.817
ooth+5.340
hyde+5.239
holding+5.163
mouth+5.071
Neg logit contributions
 Drac-14.221
iga-13.515
ngth-13.467
 sqor-13.435
 Vaugh-13.072
Token 112
Token ID13539
Feature activation+1.82e+00

Pos logit contributions
hyde+1.071
\.+1.056
netflix+0.990
holding+0.968
ergus+0.962
Neg logit contributions
iga-1.373
 Drac-1.335
neys-1.226
ngth-1.212
 sqor-1.211
Token08
Token ID2919
Feature activation+3.60e+00

Pos logit contributions
hyde+1.319
\.+1.300
netflix+1.221
holding+1.200
ergus+1.187
Neg logit contributions
iga-1.516
 Drac-1.482
neys-1.347
 sqor-1.336
ngth-1.334
Token80
Token ID1795
Feature activation-4.50e-01

Pos logit contributions
\.+2.486
hyde+2.446
netflix+2.275
holding+2.261
ooth+2.217
Neg logit contributions
iga-3.074
 Drac-3.029
ngth-2.812
 sqor-2.781
neys-2.756
Token x
Token ID2124
Feature activation-8.84e-02

Pos logit contributions
iga+0.353
 Drac+0.343
neys+0.311
ngth+0.307
 sqor+0.306
Neg logit contributions
hyde-0.348
\.-0.344
netflix-0.325
holding-0.319
ergus-0.318
Token x
Token ID2124
Feature activation-4.69e-01

Pos logit contributions
iga+1.106
 Drac+1.076
neys+0.969
 sqor+0.952
ngth+0.944
Neg logit contributions
hyde-1.149
\.-1.125
netflix-1.072
ergus-1.049
holding-1.047
Token.
Token ID13
Feature activation+3.11e+00

Pos logit contributions
iga+0.404
 Drac+0.395
neys+0.360
 sqor+0.356
ngth+0.355
Neg logit contributions
hyde-0.327
\.-0.322
netflix-0.303
holding-0.296
ergus-0.295
Tokenx
Token ID87
Feature activation+1.02e+00

Pos logit contributions
\.+1.717
hyde+1.667
netflix+1.536
holding+1.534
ooth+1.497
Neg logit contributions
 Drac-3.117
iga-3.075
ngth-2.894
 sqor-2.882
neys-2.838
Token.
Token ID13
Feature activation+4.44e+00

Pos logit contributions
\.+0.686
hyde+0.679
netflix+0.631
holding+0.622
ooth+0.614
Neg logit contributions
iga-0.900
 Drac-0.890
ngth-0.814
neys-0.810
 sqor-0.810
Tokenx
Token ID87
Feature activation+5.87e+00

Pos logit contributions
\.+2.366
hyde+2.255
holding+2.090
netflix+2.082
ooth+2.036
Neg logit contributions
 Drac-4.535
iga-4.438
 sqor-4.246
ngth-4.239
neys-4.097
Token.
Token ID13
Feature activation+1.40e+01

Pos logit contributions
\.+3.600
hyde+3.268
holding+3.119
netflix+3.067
ooth+3.014
Neg logit contributions
 Drac-5.509
iga-5.456
ngth-5.282
 sqor-5.177
neys-4.973
Token120
Token ID10232
Feature activation+5.20e-01

Pos logit contributions
\.+7.591
holding+7.018
hyde+6.918
ooth+6.480
netflix+6.411
Neg logit contributions
iga-12.934
 Drac-12.507
 sqor-12.055
ngth-11.986
neys-11.262
Token 10
Token ID838
Feature activation-3.20e+00

Pos logit contributions
hyde+0.367
\.+0.359
netflix+0.340
ergus+0.331
holding+0.331
Neg logit contributions
iga-0.445
 Drac-0.434
neys-0.396
 sqor-0.390
ngth-0.387
Token25
Token ID1495
Feature activation-1.94e+00

Pos logit contributions
iga+2.472
 Drac+2.380
neys+2.151
 sqor+2.073
ngth+2.023
Neg logit contributions
hyde-2.525
\.-2.433
netflix-2.358
ergus-2.324
ooth-2.301
Token290
Token ID24369
Feature activation-2.71e+00

Pos logit contributions
iga+1.508
 Drac+1.449
neys+1.301
 sqor+1.275
ngth+1.257
Neg logit contributions
hyde-1.526
\.-1.478
netflix-1.422
ergus-1.400
holding-1.386
Token x
Token ID2124
Feature activation+2.27e-01

Pos logit contributions
iga+2.025
 Drac+1.970
neys+1.759
 sqor+1.728
ngth+1.691
Neg logit contributions
hyde-2.225
\.-2.159
netflix-2.075
ergus-2.031
holding-2.020
Token verb
Token ID15942
Feature activation-3.10e+00

Pos logit contributions
iga+1.934
 Drac+1.903
neys+1.700
 sqor+1.673
ngth+1.623
Neg logit contributions
hyde-1.833
\.-1.791
netflix-1.737
ergus-1.673
ooth-1.669
Token,
Token ID11
Feature activation-3.54e+00

Pos logit contributions
iga+2.541
 Drac+2.445
neys+2.227
 sqor+2.198
ngth+2.162
Neg logit contributions
hyde-2.297
\.-2.234
netflix-2.155
ergus-2.093
ooth-2.073
Token the
Token ID262
Feature activation-3.37e+00

Pos logit contributions
iga+2.701
 Drac+2.664
neys+2.367
 sqor+2.323
ngth+2.283
Neg logit contributions
hyde-2.801
\.-2.715
netflix-2.644
ergus-2.555
ooth-2.547
Token story
Token ID1621
Feature activation-4.85e+00

Pos logit contributions
iga+2.715
 Drac+2.673
neys+2.405
 sqor+2.358
ngth+2.301
Neg logit contributions
hyde-2.522
\.-2.455
netflix-2.381
ooth-2.281
ergus-2.279
Token of
Token ID286
Feature activation-5.32e+00

Pos logit contributions
iga+3.584
 Drac+3.502
neys+3.119
 sqor+3.002
ngth+2.989
Neg logit contributions
hyde-3.917
\.-3.794
netflix-3.692
ooth-3.574
ergus-3.557
Token Am
Token ID1703
Feature activation-1.53e+01

Pos logit contributions
 Drac+4.228
iga+4.213
neys+3.707
 sqor+3.598
ngth+3.489
Neg logit contributions
hyde-3.992
\.-3.865
netflix-3.730
ooth-3.598
holding-3.565
Tokennon
Token ID13159
Feature activation-4.28e+00

Pos logit contributions
iga+12.112
 Drac+10.871
neys+10.217
 sqor+9.682
 WS+9.276
Neg logit contributions
hyde-10.088
netflix-9.944
\.-9.780
ertodd-9.287
ergus-9.191
Token and
Token ID290
Feature activation-4.67e+00

Pos logit contributions
iga+3.482
 Drac+3.371
neys+3.063
 sqor+2.999
ngth+2.923
Neg logit contributions
hyde-3.134
\.-3.047
netflix-2.990
ergus-2.842
holding-2.834
Token Tam
Token ID11552
Feature activation-1.40e+01

Pos logit contributions
iga+3.742
 Drac+3.696
neys+3.265
 sqor+3.219
ngth+3.108
Neg logit contributions
hyde-3.509
\.-3.381
netflix-3.275
holding-3.149
ooth-3.127
Tokenar
Token ID283
Feature activation-9.83e+00

Pos logit contributions
iga+11.399
 Drac+10.486
neys+9.986
ngth+9.027
 sqor+9.012
Neg logit contributions
hyde-8.915
\.-8.541
netflix-8.533
ertodd-8.136
ooth-8.095
Token,
Token ID11
Feature activation-2.85e+00

Pos logit contributions
iga+7.610
 Drac+7.173
neys+6.674
 sqor+6.377
ngth+6.157
Neg logit contributions
hyde-7.105
\.-6.854
netflix-6.825
ertodd-6.620
ergus-6.501
Token included
Token ID3017
Feature activation+3.28e+00

Pos logit contributions
iga+0.635
 Drac+0.618
neys+0.561
 sqor+0.550
ngth+0.544
Neg logit contributions
hyde-0.555
\.-0.540
netflix-0.513
ergus-0.500
holding-0.499
Token a
Token ID257
Feature activation-1.69e+00

Pos logit contributions
hyde+2.404
\.+2.370
netflix+2.232
holding+2.201
ergus+2.186
Neg logit contributions
iga-2.691
 Drac-2.617
neys-2.371
 sqor-2.355
ngth-2.341
Token parody
Token ID29695
Feature activation-3.17e+00

Pos logit contributions
iga+1.363
 Drac+1.332
neys+1.208
 sqor+1.185
ngth+1.169
Neg logit contributions
hyde-1.271
\.-1.234
netflix-1.176
ergus-1.151
ooth-1.146
Token of
Token ID286
Feature activation+2.12e+00

Pos logit contributions
iga+2.416
 Drac+2.366
neys+2.129
 sqor+2.062
ngth+2.061
Neg logit contributions
hyde-2.514
\.-2.442
netflix-2.319
ergus-2.294
holding-2.281
Token a
Token ID257
Feature activation-4.97e-01

Pos logit contributions
hyde+1.580
\.+1.550
netflix+1.470
ergus+1.436
holding+1.434
Neg logit contributions
iga-1.727
 Drac-1.682
neys-1.525
 sqor-1.508
ngth-1.499
Token Pit
Token ID16889
Feature activation-1.50e+01

Pos logit contributions
iga+0.394
 Drac+0.385
neys+0.348
 sqor+0.340
ngth+0.336
Neg logit contributions
hyde-0.382
\.-0.373
netflix-0.354
ergus-0.347
holding-0.346
Tokenbull
Token ID16308
Feature activation+2.73e+00

Pos logit contributions
iga+10.760
 Drac+10.403
neys+10.183
 sqor+8.735
 Patron+8.687
Neg logit contributions
hyde-11.040
netflix-10.590
\.-10.456
holding-9.779
ergus-9.731
Token and
Token ID290
Feature activation-2.15e+00

Pos logit contributions
hyde+2.043
\.+2.004
netflix+1.886
holding+1.857
ergus+1.855
Neg logit contributions
iga-2.211
 Drac-2.150
neys-1.955
ngth-1.935
 sqor-1.928
Token Ke
Token ID3873
Feature activation-5.70e+00

Pos logit contributions
iga+1.757
 Drac+1.735
neys+1.561
 sqor+1.512
ngth+1.500
Neg logit contributions
hyde-1.597
\.-1.546
netflix-1.481
ergus-1.443
ooth-1.434
Token$
Token ID3
Feature activation+2.20e+00

Pos logit contributions
iga+4.721
 Drac+4.436
neys+4.089
ngth+3.873
 sqor+3.719
Neg logit contributions
hyde-4.133
\.-4.039
netflix-3.876
holding-3.742
ergus-3.734
Tokenha
Token ID3099
Feature activation-2.93e+00

Pos logit contributions
hyde+1.815
\.+1.773
ergus+1.667
netflix+1.666
ooth+1.659
Neg logit contributions
iga-1.608
 Drac-1.588
neys-1.404
ngth-1.398
 sqor-1.395
Token scenes
Token ID8188
Feature activation-3.20e+00

Pos logit contributions
iga+2.323
 Drac+2.274
neys+2.034
 sqor+1.995
ngth+1.922
Neg logit contributions
hyde-2.387
\.-2.307
netflix-2.218
ooth-2.177
ergus-2.172
Token after
Token ID706
Feature activation-1.13e+00

Pos logit contributions
iga+2.565
 Drac+2.496
neys+2.252
 sqor+2.199
ngth+2.132
Neg logit contributions
hyde-2.431
\.-2.340
netflix-2.248
ergus-2.217
ooth-2.193
Token a
Token ID257
Feature activation-2.84e-01

Pos logit contributions
iga+0.917
 Drac+0.898
neys+0.811
 sqor+0.795
ngth+0.782
Neg logit contributions
hyde-0.853
\.-0.835
netflix-0.791
ergus-0.773
ooth-0.770
Token blaze
Token ID31259
Feature activation-3.13e+00

Pos logit contributions
iga+0.223
 Drac+0.218
neys+0.196
 sqor+0.193
ngth+0.191
Neg logit contributions
hyde-0.222
\.-0.217
netflix-0.206
ergus-0.202
holding-0.201
Token engulfed
Token ID40997
Feature activation-4.37e+00

Pos logit contributions
iga+2.391
 Drac+2.342
neys+2.098
 sqor+2.001
 WS+1.948
Neg logit contributions
hyde-2.497
\.-2.402
netflix-2.322
ergus-2.258
ooth-2.247
Token 1
Token ID352
Feature activation-1.49e+01

Pos logit contributions
iga+3.439
 Drac+3.421
neys+3.049
 sqor+2.952
ngth+2.870
Neg logit contributions
hyde-3.361
\.-3.229
netflix-3.090
ergus-3.012
ooth-3.000
Token,
Token ID11
Feature activation-6.66e+00

Pos logit contributions
iga+9.399
 Drac+8.966
 sqor+8.364
neys+7.877
 WS+7.260
Neg logit contributions
hyde-11.548
ergus-10.992
netflix-10.976
ertodd-10.863
\.-10.776
Token800
Token ID7410
Feature activation-8.88e+00

Pos logit contributions
iga+4.951
 Drac+4.705
neys+4.275
 sqor+4.161
ngth+3.963
Neg logit contributions
hyde-5.188
\.-4.974
ergus-4.895
netflix-4.847
holding-4.814
Token hectares
Token ID43025
Feature activation-3.85e+00

Pos logit contributions
iga+5.669
 Drac+5.469
neys+4.888
 sqor+4.680
ngth+4.374
Neg logit contributions
hyde-7.785
\.-7.450
netflix-7.311
ertodd-7.186
ergus-7.113
Token of
Token ID286
Feature activation-2.37e+00

Pos logit contributions
iga+2.929
 Drac+2.863
neys+2.535
 sqor+2.497
ngth+2.466
Neg logit contributions
hyde-3.094
\.-2.953
netflix-2.830
ergus-2.793
ooth-2.771
Token forest
Token ID8222
Feature activation+2.39e+00

Pos logit contributions
iga+1.960
 Drac+1.935
neys+1.724
 sqor+1.701
ngth+1.676
Neg logit contributions
hyde-1.750
\.-1.691
netflix-1.608
ergus-1.567
ooth-1.555
Token and
Token ID290
Feature activation+5.31e-01

Pos logit contributions
hyde+1.219
\.+1.203
netflix+1.129
holding+1.105
ergus+1.100
Neg logit contributions
iga-1.515
 Drac-1.471
neys-1.349
 sqor-1.334
ngth-1.329
Token to
Token ID284
Feature activation-4.63e-01

Pos logit contributions
hyde+0.383
\.+0.374
netflix+0.354
ergus+0.346
holding+0.345
Neg logit contributions
iga-0.447
 Drac-0.436
neys-0.396
 sqor-0.390
ngth-0.388
Token continue
Token ID2555
Feature activation+7.20e-01

Pos logit contributions
iga+0.393
 Drac+0.384
neys+0.350
 sqor+0.343
ngth+0.341
Neg logit contributions
hyde-0.330
\.-0.322
netflix-0.306
ergus-0.298
holding-0.297
Token paying
Token ID5989
Feature activation-4.59e+00

Pos logit contributions
hyde+0.538
\.+0.524
netflix+0.499
ergus+0.487
holding+0.485
Neg logit contributions
iga-0.588
 Drac-0.573
neys-0.519
 sqor-0.510
ngth-0.506
Token my
Token ID616
Feature activation-8.44e+00

Pos logit contributions
iga+3.560
 Drac+3.524
neys+3.111
 sqor+3.003
ngth+2.978
Neg logit contributions
hyde-3.522
\.-3.427
netflix-3.327
ergus-3.280
ooth-3.242
Token tit
Token ID5259
Feature activation-1.46e+01

Pos logit contributions
iga+6.425
 Drac+6.414
neys+5.591
 sqor+5.449
 WS+5.383
Neg logit contributions
hyde-6.517
\.-6.013
netflix-5.975
ergus-5.900
ooth-5.774
Tokenhes
Token ID956
Feature activation-2.92e+00

Pos logit contributions
iga+9.497
 Drac+9.183
neys+8.482
 sqor+7.761
ngth+7.390
Neg logit contributions
hyde-11.389
\.-11.066
netflix-11.029
ergus-10.792
holding-10.419
Token to
Token ID284
Feature activation+3.41e+00

Pos logit contributions
iga+2.349
 Drac+2.293
neys+2.070
 sqor+2.009
ngth+1.985
Neg logit contributions
hyde-2.214
\.-2.136
netflix-2.067
ergus-2.024
ooth-1.991
Token the
Token ID262
Feature activation+9.89e-01

Pos logit contributions
hyde+2.348
\.+2.321
netflix+2.174
holding+2.124
ergus+2.113
Neg logit contributions
iga-2.945
 Drac-2.849
neys-2.603
 sqor-2.599
ngth-2.598
Token church
Token ID4928
Feature activation-8.99e-01

Pos logit contributions
hyde+0.661
\.+0.647
netflix+0.609
holding+0.593
ergus+0.593
Neg logit contributions
iga-0.882
 Drac-0.859
neys-0.788
 sqor-0.777
ngth-0.774
Token and
Token ID290
Feature activation+1.02e+00

Pos logit contributions
iga+0.751
 Drac+0.735
neys+0.669
 sqor+0.653
ngth+0.646
Neg logit contributions
hyde-0.653
\.-0.637
netflix-0.608
ergus-0.594
ooth-0.585
Token Service
Token ID4809
Feature activation-3.97e+00

Pos logit contributions
iga+3.052
 Drac+3.013
neys+2.738
 sqor+2.639
ngth+2.599
Neg logit contributions
hyde-2.507
\.-2.433
netflix-2.307
ooth-2.226
holding-2.221
Token at
Token ID379
Feature activation-3.43e+00

Pos logit contributions
iga+3.074
 Drac+2.998
neys+2.679
 sqor+2.628
ngth+2.525
Neg logit contributions
hyde-3.098
\.-3.014
netflix-2.920
ooth-2.825
ergus-2.810
Token 1
Token ID352
Feature activation-4.97e+00

Pos logit contributions
iga+2.638
 Drac+2.607
neys+2.311
 sqor+2.251
 WS+2.198
Neg logit contributions
hyde-2.651
\.-2.611
netflix-2.529
holding-2.437
ergus-2.405
Token-
Token ID12
Feature activation-1.14e+01

Pos logit contributions
iga+3.611
 Drac+3.563
neys+3.154
 sqor+3.113
 WS+2.941
Neg logit contributions
hyde-3.954
\.-3.883
netflix-3.825
holding-3.663
ergus-3.660
Token800
Token ID7410
Feature activation-5.35e+00

Pos logit contributions
iga+7.703
 Drac+7.386
neys+6.988
 WS+6.589
 sqor+6.339
Neg logit contributions
\.-8.854
hyde-8.572
netflix-8.500
ergus-8.233
holding-8.182
Token-
Token ID12
Feature activation-1.40e+01

Pos logit contributions
iga+4.108
 Drac+3.962
neys+3.550
 sqor+3.495
ngth+3.349
Neg logit contributions
hyde-4.197
\.-4.088
netflix-3.939
ergus-3.793
ooth-3.751
Token8
Token ID23
Feature activation-8.67e-01

Pos logit contributions
iga+10.039
 Drac+9.071
neys+8.659
 WS+8.129
 sqor+8.049
Neg logit contributions
\.-9.927
hyde-9.874
netflix-9.478
ertodd-9.219
holding-9.084
Token76
Token ID4304
Feature activation-7.96e+00

Pos logit contributions
iga+0.688
 Drac+0.671
neys+0.606
 sqor+0.597
ngth+0.593
Neg logit contributions
hyde-0.665
\.-0.651
netflix-0.619
ergus-0.605
holding-0.604
Token-
Token ID12
Feature activation-7.55e+00

Pos logit contributions
iga+5.726
 Drac+5.352
neys+4.841
 WS+4.615
 sqor+4.589
Neg logit contributions
hyde-6.655
\.-6.173
netflix-6.119
ergus-6.109
ooth-5.962
Token45
Token ID2231
Feature activation-2.18e+00

Pos logit contributions
iga+5.505
 Drac+5.336
neys+4.762
 WS+4.570
 sqor+4.447
Neg logit contributions
hyde-6.054
\.-5.665
netflix-5.619
ergus-5.616
ertodd-5.601
Token00
Token ID405
Feature activation-6.29e+00

Pos logit contributions
iga+1.771
 Drac+1.715
neys+1.556
 sqor+1.509
ngth+1.482
Neg logit contributions
hyde-1.660
\.-1.591
netflix-1.532
ergus-1.503
ooth-1.485
Token story
Token ID1621
Feature activation-4.85e+00

Pos logit contributions
iga+2.715
 Drac+2.673
neys+2.405
 sqor+2.358
ngth+2.301
Neg logit contributions
hyde-2.522
\.-2.455
netflix-2.381
ooth-2.281
ergus-2.279
Token of
Token ID286
Feature activation-5.32e+00

Pos logit contributions
iga+3.584
 Drac+3.502
neys+3.119
 sqor+3.002
ngth+2.989
Neg logit contributions
hyde-3.917
\.-3.794
netflix-3.692
ooth-3.574
ergus-3.557
Token Am
Token ID1703
Feature activation-1.53e+01

Pos logit contributions
 Drac+4.228
iga+4.213
neys+3.707
 sqor+3.598
ngth+3.489
Neg logit contributions
hyde-3.992
\.-3.865
netflix-3.730
ooth-3.598
holding-3.565
Tokennon
Token ID13159
Feature activation-4.28e+00

Pos logit contributions
iga+12.112
 Drac+10.871
neys+10.217
 sqor+9.682
 WS+9.276
Neg logit contributions
hyde-10.088
netflix-9.944
\.-9.780
ertodd-9.287
ergus-9.191
Token and
Token ID290
Feature activation-4.67e+00

Pos logit contributions
iga+3.482
 Drac+3.371
neys+3.063
 sqor+2.999
ngth+2.923
Neg logit contributions
hyde-3.134
\.-3.047
netflix-2.990
ergus-2.842
holding-2.834
Token Tam
Token ID11552
Feature activation-1.40e+01

Pos logit contributions
iga+3.742
 Drac+3.696
neys+3.265
 sqor+3.219
ngth+3.108
Neg logit contributions
hyde-3.509
\.-3.381
netflix-3.275
holding-3.149
ooth-3.127
Tokenar
Token ID283
Feature activation-9.83e+00

Pos logit contributions
iga+11.399
 Drac+10.486
neys+9.986
ngth+9.027
 sqor+9.012
Neg logit contributions
hyde-8.915
\.-8.541
netflix-8.533
ertodd-8.136
ooth-8.095
Token,
Token ID11
Feature activation-2.85e+00

Pos logit contributions
iga+7.610
 Drac+7.173
neys+6.674
 sqor+6.377
ngth+6.157
Neg logit contributions
hyde-7.105
\.-6.854
netflix-6.825
ertodd-6.620
ergus-6.501
Token which
Token ID543
Feature activation-3.18e+00

Pos logit contributions
iga+2.309
 Drac+2.263
neys+2.049
 sqor+2.004
ngth+1.965
Neg logit contributions
hyde-2.124
\.-2.054
netflix-1.988
ergus-1.919
ooth-1.913
Token is
Token ID318
Feature activation-9.29e-02

Pos logit contributions
iga+2.538
 Drac+2.487
neys+2.218
 sqor+2.170
ngth+2.113
Neg logit contributions
hyde-2.440
\.-2.368
netflix-2.283
ergus-2.202
ooth-2.201
Token explicitly
Token ID11777
Feature activation+4.28e+00

Pos logit contributions
iga+0.076
 Drac+0.074
neys+0.067
 sqor+0.066
ngth+0.066
Neg logit contributions
hyde-0.069
\.-0.067
netflix-0.064
ergus-0.062
holding-0.062
Token to
Token ID284
Feature activation-1.83e+00

Pos logit contributions
hyde+1.497
\.+1.464
netflix+1.389
ergus+1.357
holding+1.355
Neg logit contributions
iga-1.647
 Drac-1.606
neys-1.455
 sqor-1.432
ngth-1.423
Token un
Token ID555
Feature activation+1.26e+00

Pos logit contributions
iga+1.599
 Drac+1.551
neys+1.419
 sqor+1.391
ngth+1.380
Neg logit contributions
hyde-1.274
\.-1.240
netflix-1.178
ergus-1.152
holding-1.144
Tokenearth
Token ID16442
Feature activation+2.64e+00

Pos logit contributions
hyde+0.861
\.+0.841
netflix+0.797
holding+0.775
ergus+0.774
Neg logit contributions
iga-1.100
 Drac-1.076
neys-0.980
 sqor-0.972
ngth-0.963
Token a
Token ID257
Feature activation-3.18e+00

Pos logit contributions
hyde+1.836
\.+1.819
netflix+1.697
holding+1.677
ooth+1.669
Neg logit contributions
iga-2.245
 Drac-2.187
neys-2.008
ngth-1.989
 sqor-1.987
Token few
Token ID1178
Feature activation-1.59e+00

Pos logit contributions
iga+2.571
 Drac+2.504
neys+2.233
 sqor+2.218
ngth+2.153
Neg logit contributions
hyde-2.427
\.-2.340
netflix-2.245
ergus-2.194
holding-2.150
Token barg
Token ID12828
Feature activation-1.40e+01

Pos logit contributions
iga+1.298
 Drac+1.265
neys+1.144
 sqor+1.124
ngth+1.114
Neg logit contributions
hyde-1.187
\.-1.153
netflix-1.103
ergus-1.076
holding-1.067
Tokenains
Token ID1299
Feature activation-3.83e-01

Pos logit contributions
iga+7.173
 Drac+6.083
neys+5.671
ngth+4.963
 sqor+4.938
Neg logit contributions
hyde-13.903
\.-13.102
ergus-12.562
netflix-12.509
holding-12.468
Token along
Token ID1863
Feature activation-3.15e+00

Pos logit contributions
iga+0.321
 Drac+0.313
neys+0.284
 sqor+0.280
ngth+0.277
Neg logit contributions
hyde-0.278
\.-0.271
netflix-0.257
ergus-0.251
holding-0.250
Token the
Token ID262
Feature activation-3.49e+00

Pos logit contributions
iga+2.482
 Drac+2.423
neys+2.156
 sqor+2.127
ngth+2.098
Neg logit contributions
hyde-2.469
\.-2.381
netflix-2.281
ergus-2.222
ooth-2.205
Token way
Token ID835
Feature activation+1.06e+00

Pos logit contributions
iga+2.550
 Drac+2.532
neys+2.241
 sqor+2.190
ngth+2.117
Neg logit contributions
hyde-2.897
\.-2.792
netflix-2.676
ergus-2.622
ooth-2.591
Token.
Token ID13
Feature activation-6.20e-01

Pos logit contributions
hyde+0.745
\.+0.730
netflix+0.689
holding+0.672
ergus+0.672
Neg logit contributions
iga-0.905
 Drac-0.884
neys-0.805
 sqor-0.793
ngth-0.789
Token for
Token ID329
Feature activation-2.29e+00

Pos logit contributions
iga+1.293
 Drac+1.269
neys+1.126
 sqor+1.121
ngth+1.109
Neg logit contributions
hyde-1.306
\.-1.269
netflix-1.213
ergus-1.179
ooth-1.177
Token the
Token ID262
Feature activation-4.31e+00

Pos logit contributions
iga+1.840
 Drac+1.818
neys+1.633
 sqor+1.602
ngth+1.575
Neg logit contributions
hyde-1.739
\.-1.684
netflix-1.596
ergus-1.564
ooth-1.548
Token United
Token ID1578
Feature activation-7.63e+00

Pos logit contributions
iga+3.447
 Drac+3.412
neys+3.048
 sqor+3.014
 WS+2.909
Neg logit contributions
hyde-3.261
\.-3.124
netflix-2.975
ergus-2.929
ooth-2.877
Token Nations
Token ID7973
Feature activation-2.68e+00

Pos logit contributions
iga+4.255
 Drac+4.178
neys+3.507
 sqor+3.449
ngth+3.333
Neg logit contributions
hyde-7.438
\.-7.243
netflix-6.983
ergus-6.887
ooth-6.769
Token Office
Token ID4452
Feature activation-5.83e+00

Pos logit contributions
iga+2.540
 Drac+2.485
neys+2.268
 sqor+2.251
ngth+2.238
Neg logit contributions
hyde-1.647
\.-1.567
netflix-1.500
ergus-1.458
ooth-1.431
Token for
Token ID329
Feature activation-1.38e+01

Pos logit contributions
iga+4.185
 Drac+4.026
 sqor+3.606
ngth+3.603
neys+3.603
Neg logit contributions
hyde-4.843
\.-4.608
netflix-4.454
ooth-4.358
ergus-4.342
Token the
Token ID262
Feature activation-9.77e+00

Pos logit contributions
iga+8.477
 Drac+8.209
neys+7.386
 WS+6.821
 Patron+6.671
Neg logit contributions
hyde-11.830
netflix-11.525
\.-11.433
ertodd-11.310
ooth-11.142
Token Coord
Token ID22819
Feature activation-4.35e+00

Pos logit contributions
iga+6.054
 Drac+5.884
neys+5.043
 WS+4.810
ngth+4.651
Neg logit contributions
hyde-8.884
\.-8.615
netflix-8.294
ergus-8.251
ertodd-8.243
Tokenination
Token ID1883
Feature activation-3.40e+00

Pos logit contributions
iga+3.121
 Drac+2.986
neys+2.671
 sqor+2.583
ngth+2.578
Neg logit contributions
hyde-3.677
\.-3.600
netflix-3.472
ooth-3.375
ergus-3.366
Token of
Token ID286
Feature activation-8.57e+00

Pos logit contributions
iga+2.594
 Drac+2.497
neys+2.263
 sqor+2.223
ngth+2.191
Neg logit contributions
hyde-2.726
\.-2.633
netflix-2.543
ergus-2.474
ooth-2.459
Token Human
Token ID5524
Feature activation-3.86e+00

Pos logit contributions
iga+3.149
 Drac+3.014
neys+2.363
ngth+2.361
 WS+2.142
Neg logit contributions
hyde-9.792
netflix-9.769
\.-9.699
ergus-9.466
ooth-9.371
Token of
Token ID286
Feature activation+1.04e+00

Pos logit contributions
iga+1.492
 Drac+1.451
neys+1.304
 sqor+1.269
ngth+1.260
Neg logit contributions
hyde-1.472
\.-1.424
netflix-1.365
ergus-1.338
ooth-1.327
Token our
Token ID674
Feature activation-4.28e+00

Pos logit contributions
hyde+0.750
\.+0.730
netflix+0.694
ergus+0.675
holding+0.674
Neg logit contributions
iga-0.876
 Drac-0.857
neys-0.777
 sqor-0.762
ngth-0.755
Token old
Token ID1468
Feature activation-4.65e+00

Pos logit contributions
iga+3.749
 Drac+3.673
neys+3.364
 sqor+3.234
ngth+3.134
Neg logit contributions
hyde-2.976
\.-2.819
netflix-2.720
ergus-2.628
ooth-2.595
Token Pont
Token ID26345
Feature activation-2.84e+00

Pos logit contributions
iga+3.972
 Drac+3.869
neys+3.546
 sqor+3.365
ngth+3.322
Neg logit contributions
hyde-3.330
\.-3.169
netflix-3.011
ergus-2.949
ooth-2.910
Tokeniac
Token ID9607
Feature activation-4.12e+00

Pos logit contributions
iga+2.850
 Drac+2.790
neys+2.594
 sqor+2.495
ngth+2.477
Neg logit contributions
hyde-1.617
\.-1.542
netflix-1.438
holding-1.397
ooth-1.382
Token Bon
Token ID7979
Feature activation-1.37e+01

Pos logit contributions
iga+3.387
 Drac+3.290
neys+2.968
 sqor+2.855
ngth+2.801
Neg logit contributions
hyde-3.079
\.-2.975
netflix-2.916
ergus-2.773
holding-2.768
Tokenne
Token ID710
Feature activation-5.86e+00

Pos logit contributions
iga+9.236
 Drac+8.722
neys+8.492
 Patron+6.905
chuk+6.829
Neg logit contributions
hyde-11.333
\.-11.063
ertodd-10.506
ergus-10.376
netflix-10.354
Tokenville
Token ID4244
Feature activation-2.05e+00

Pos logit contributions
iga+4.962
 Drac+4.789
neys+4.401
 sqor+4.158
ngth+4.113
Neg logit contributions
hyde-4.138
\.-4.029
netflix-3.892
ooth-3.751
holding-3.748
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
iga+1.727
 Drac+1.677
neys+1.522
 sqor+1.479
ngth+1.461
Neg logit contributions
hyde-1.498
\.-1.449
netflix-1.403
ergus-1.348
holding-1.340
Token Sally
Token ID25737
Feature activation+4.64e+00

Pos logit contributions
hyde+0.935
\.+0.919
netflix+0.864
holding+0.844
ergus+0.841
Neg logit contributions
iga-1.183
 Drac-1.152
neys-1.054
 sqor-1.047
ngth-1.040
Token Ye
Token ID11609
Feature activation-1.03e+00

Pos logit contributions
hyde+3.374
\.+3.306
netflix+3.083
ergus+3.054
ooth+3.036
Neg logit contributions
iga-3.792
 Drac-3.681
neys-3.378
 sqor-3.344
ngth-3.330
Token offers
Token ID4394
Feature activation-9.08e+00

Pos logit contributions
iga+3.719
 Drac+3.563
neys+3.191
 sqor+3.104
ngth+3.026
Neg logit contributions
hyde-4.837
\.-4.618
netflix-4.580
ergus-4.402
holding-4.333
Token for
Token ID329
Feature activation-4.65e+00

Pos logit contributions
iga+5.984
 Drac+5.745
neys+5.157
 sqor+5.033
 WS+4.903
Neg logit contributions
hyde-7.793
\.-7.326
netflix-7.245
ergus-7.118
ooth-6.933
Token The
Token ID383
Feature activation-1.12e+01

Pos logit contributions
iga+3.877
 Drac+3.749
neys+3.422
 sqor+3.338
ngth+3.288
Neg logit contributions
hyde-3.393
\.-3.275
netflix-3.160
ergus-3.062
ooth-3.026
Token New
Token ID968
Feature activation-4.73e-01

Pos logit contributions
iga+6.951
 Drac+6.824
neys+6.132
 WS+6.038
 Wiz+5.540
Neg logit contributions
hyde-10.070
netflix-9.554
\.-9.337
ergus-9.156
ooth-9.006
Token York
Token ID1971
Feature activation-1.26e-01

Pos logit contributions
iga+0.432
 Drac+0.423
neys+0.386
 sqor+0.383
ngth+0.381
Neg logit contributions
hyde-0.307
\.-0.300
netflix-0.281
ergus-0.275
holding-0.274
Token Times
Token ID3782
Feature activation-1.35e+01

Pos logit contributions
iga+0.109
 Drac+0.107
neys+0.097
ngth+0.097
 sqor+0.096
Neg logit contributions
hyde-0.087
\.-0.085
netflix-0.080
holding-0.079
ergus-0.078
Token's
Token ID338
Feature activation-1.28e+01

Pos logit contributions
iga+8.251
 Drac+7.639
neys+6.752
 WS+6.585
 Patron+6.266
Neg logit contributions
hyde-11.771
netflix-11.025
\.-10.793
ergus-10.693
holding-10.362
Token products
Token ID3186
Feature activation-9.13e+00

Pos logit contributions
iga+7.953
 Drac+7.416
neys+6.447
 WS+6.341
 Patron+6.275
Neg logit contributions
hyde-11.345
\.-10.723
netflix-10.676
ergus-10.256
ooth-10.019
Token and
Token ID290
Feature activation+3.03e+00

Pos logit contributions
iga+6.388
 Drac+6.009
neys+5.495
 WS+5.264
 sqor+5.242
Neg logit contributions
hyde-7.668
\.-7.029
ergus-6.987
netflix-6.848
ooth-6.802
Token services
Token ID2594
Feature activation-1.23e+01

Pos logit contributions
\.+1.239
hyde+1.153
holding+1.059
ergus+1.049
ooth+1.033
Neg logit contributions
iga-3.471
 Drac-3.421
ngth-3.238
neys-3.158
 sqor-3.129
Token.
Token ID13
Feature activation-8.46e+00

Pos logit contributions
iga+8.055
 Drac+7.487
 WS+6.506
neys+6.475
 sqor+6.366
Neg logit contributions
hyde-10.800
netflix-9.810
ergus-9.585
ooth-9.329
\.-9.317
Token United
Token ID1578
Feature activation-1.72e+00

Pos logit contributions
hyde+1.172
\.+1.150
netflix+1.084
holding+1.061
ooth+1.056
Neg logit contributions
iga-1.376
 Drac-1.338
neys-1.221
ngth-1.207
 sqor-1.205
Token States
Token ID1829
Feature activation-2.00e+00

Pos logit contributions
iga+1.218
 Drac+1.173
neys+1.041
 sqor+1.017
ngth+0.994
Neg logit contributions
hyde-1.471
\.-1.447
netflix-1.390
ergus-1.357
holding-1.343
Token.
Token ID13
Feature activation-1.34e-01

Pos logit contributions
iga+1.660
 Drac+1.603
neys+1.459
 sqor+1.430
ngth+1.410
Neg logit contributions
hyde-1.474
\.-1.424
netflix-1.374
ergus-1.347
holding-1.326
Token He
Token ID679
Feature activation-2.68e-01

Pos logit contributions
iga+0.117
 Drac+0.114
neys+0.104
 sqor+0.102
ngth+0.101
Neg logit contributions
hyde-0.092
\.-0.090
netflix-0.085
ergus-0.083
holding-0.082
Token met
Token ID1138
Feature activation-4.59e-01

Pos logit contributions
iga+0.222
 Drac+0.216
neys+0.196
 sqor+0.192
ngth+0.191
Neg logit contributions
hyde-0.198
\.-0.193
netflix-0.183
ergus-0.179
holding-0.178
Token his
Token ID465
Feature activation-1.35e+01

Pos logit contributions
iga+0.389
 Drac+0.380
neys+0.345
 sqor+0.338
ngth+0.336
Neg logit contributions
hyde-0.329
\.-0.321
netflix-0.303
ergus-0.296
holding-0.295
Token girlfriend
Token ID11077
Feature activation+9.68e-01

Pos logit contributions
iga+9.476
 Drac+9.128
neys+8.070
 WS+7.723
 Patron+7.717
Neg logit contributions
hyde-10.558
\.-9.873
netflix-9.588
ergus-9.367
ertodd-9.173
Token here
Token ID994
Feature activation+1.55e+00

Pos logit contributions
hyde+0.682
\.+0.669
netflix+0.631
holding+0.616
ergus+0.616
Neg logit contributions
iga-0.828
 Drac-0.807
neys-0.736
 sqor-0.726
ngth-0.723
Token,
Token ID11
Feature activation+1.74e+00

Pos logit contributions
hyde+1.088
\.+1.069
netflix+1.008
holding+0.984
ergus+0.982
Neg logit contributions
iga-1.322
 Drac-1.291
neys-1.176
 sqor-1.158
ngth-1.154
Token and
Token ID290
Feature activation-9.42e-02

Pos logit contributions
hyde+1.218
\.+1.198
netflix+1.130
holding+1.105
ergus+1.102
Neg logit contributions
iga-1.494
 Drac-1.454
neys-1.327
 sqor-1.312
ngth-1.308
Token talks
Token ID6130
Feature activation-3.12e+00

Pos logit contributions
iga+0.082
 Drac+0.080
neys+0.073
 sqor+0.072
ngth+0.071
Neg logit contributions
hyde-0.065
\.-0.063
netflix-0.060
ergus-0.059
holding-0.058
Tokenran
Token ID2596
Feature activation-2.43e+00

Pos logit contributions
hyde+0.658
\.+0.643
netflix+0.610
ergus+0.595
holding+0.594
Neg logit contributions
iga-0.757
 Drac-0.738
neys-0.671
 sqor-0.661
ngth-0.656
Tokenza
Token ID4496
Feature activation-1.32e+00

Pos logit contributions
iga+1.624
 Drac+1.551
neys+1.373
 sqor+1.345
ngth+1.290
Neg logit contributions
hyde-2.187
\.-2.146
netflix-2.077
ergus-2.022
holding-2.017
Token
Token ID447
Feature activation+4.81e-02

Pos logit contributions
iga+1.113
 Drac+1.079
neys+0.978
 sqor+0.961
ngth+0.951
Neg logit contributions
hyde-0.956
\.-0.933
netflix-0.893
ergus-0.869
holding-0.867
Token
Token ID247
Feature activation-5.99e+00

Pos logit contributions
hyde+0.028
\.+0.027
netflix+0.025
ergus+0.024
holding+0.024
Neg logit contributions
iga-0.047
 Drac-0.046
neys-0.043
 sqor-0.042
ngth-0.042
Tokens
Token ID82
Feature activation+7.39e-01

Pos logit contributions
iga+4.405
 Drac+4.130
neys+3.759
 sqor+3.639
ngth+3.578
Neg logit contributions
hyde-4.820
\.-4.763
netflix-4.578
ergus-4.473
ooth-4.420
Token che
Token ID1125
Feature activation-1.35e+01

Pos logit contributions
hyde+0.556
\.+0.544
netflix+0.517
ergus+0.505
holding+0.504
Neg logit contributions
iga-0.598
 Drac-0.583
neys-0.528
 sqor-0.519
ngth-0.516
Tokenc
Token ID66
Feature activation-6.26e+00

Pos logit contributions
iga+12.257
 Drac+11.364
neys+11.034
ngth+10.342
 sqor+10.045
Neg logit contributions
hyde-7.701
netflix-7.223
\.-7.123
ergus-6.925
ertodd-6.813
Tokenkered
Token ID28970
Feature activation+2.45e+00

Pos logit contributions
iga+3.507
 Drac+3.356
neys+2.947
 sqor+2.800
ngth+2.736
Neg logit contributions
hyde-6.178
\.-6.000
netflix-5.808
ergus-5.666
holding-5.627
Token history
Token ID2106
Feature activation-4.76e-01

Pos logit contributions
hyde+1.767
\.+1.733
netflix+1.640
ergus+1.601
holding+1.597
Neg logit contributions
iga-2.057
 Drac-2.005
neys-1.828
ngth-1.799
 sqor-1.799
Token at
Token ID379
Feature activation+2.12e+00

Pos logit contributions
iga+0.388
 Drac+0.378
neys+0.342
 sqor+0.336
ngth+0.333
Neg logit contributions
hyde-0.358
\.-0.347
netflix-0.332
ergus-0.324
ooth-0.322
Token the
Token ID262
Feature activation+7.59e-02

Pos logit contributions
hyde+1.536
\.+1.504
netflix+1.426
holding+1.390
ergus+1.389
Neg logit contributions
iga-1.767
 Drac-1.725
neys-1.566
 sqor-1.545
ngth-1.539
Token soccer
Token ID11783
Feature activation-9.74e-01

Pos logit contributions
hyde+0.655
\.+0.640
netflix+0.607
ergus+0.592
holding+0.591
Neg logit contributions
iga-0.755
 Drac-0.736
neys-0.669
 sqor-0.658
ngth-0.653
Token scholarship
Token ID18652
Feature activation+8.28e-01

Pos logit contributions
iga+0.699
 Drac+0.676
neys+0.602
 sqor+0.587
ngth+0.582
Neg logit contributions
hyde-0.828
\.-0.806
netflix-0.779
ergus-0.756
holding-0.748
Token,
Token ID11
Feature activation-9.50e-01

Pos logit contributions
hyde+0.583
\.+0.569
netflix+0.539
ergus+0.526
holding+0.524
Neg logit contributions
iga-0.712
 Drac-0.694
neys-0.632
 sqor-0.623
ngth-0.619
Token Brown
Token ID4373
Feature activation+3.55e+00

Pos logit contributions
iga+0.856
 Drac+0.835
neys+0.760
 sqor+0.749
ngth+0.746
Neg logit contributions
hyde-0.636
\.-0.612
netflix-0.582
ergus-0.565
holding-0.561
Token paid
Token ID3432
Feature activation-7.81e+00

Pos logit contributions
hyde+2.540
\.+2.519
netflix+2.326
ergus+2.297
ooth+2.294
Neg logit contributions
iga-2.935
 Drac-2.898
neys-2.612
 sqor-2.608
ngth-2.597
Token his
Token ID465
Feature activation-1.32e+01

Pos logit contributions
iga+5.860
 Drac+5.776
 sqor+4.933
neys+4.874
ngth+4.800
Neg logit contributions
hyde-6.141
\.-5.866
netflix-5.742
ergus-5.587
holding-5.527
Token own
Token ID898
Feature activation-5.90e+00

Pos logit contributions
iga+9.121
 Drac+8.820
 sqor+7.594
neys+7.586
 WS+7.275
Neg logit contributions
hyde-10.824
\.-9.785
netflix-9.635
ergus-9.399
ertodd-9.356
Token way
Token ID835
Feature activation+2.60e+00

Pos logit contributions
iga+4.214
 Drac+4.110
neys+3.576
 sqor+3.529
ngth+3.422
Neg logit contributions
hyde-4.997
\.-4.785
netflix-4.564
ergus-4.527
holding-4.421
Token on
Token ID319
Feature activation-3.89e+00

Pos logit contributions
hyde+1.813
\.+1.793
netflix+1.680
holding+1.647
ergus+1.640
Neg logit contributions
iga-2.221
 Drac-2.179
neys-1.985
 sqor-1.953
ngth-1.948
Token the
Token ID262
Feature activation-6.86e-01

Pos logit contributions
iga+3.104
 Drac+3.044
neys+2.704
 sqor+2.658
ngth+2.610
Neg logit contributions
hyde-3.010
\.-2.879
netflix-2.791
ergus-2.697
holding-2.677
Token Louisville
Token ID21525
Feature activation-2.61e+00

Pos logit contributions
iga+0.537
 Drac+0.525
neys+0.471
 sqor+0.460
ngth+0.454
Neg logit contributions
hyde-0.538
\.-0.522
netflix-0.500
ergus-0.489
holding-0.487
Token spirit
Token ID4437
Feature activation-1.69e+00

Pos logit contributions
iga+2.389
 Drac+2.338
neys+2.097
 sqor+2.062
ngth+2.017
Neg logit contributions
hyde-2.147
\.-2.048
netflix-1.971
ergus-1.921
ooth-1.883
Token.
Token ID13
Feature activation+2.81e+00

Pos logit contributions
iga+1.389
 Drac+1.348
neys+1.219
 sqor+1.206
ngth+1.189
Neg logit contributions
hyde-1.268
\.-1.220
netflix-1.171
ergus-1.141
ooth-1.133
Token
Token ID198
Feature activation-3.49e+00

Pos logit contributions
hyde+1.914
\.+1.896
netflix+1.770
holding+1.735
ergus+1.730
Neg logit contributions
iga-2.452
 Drac-2.382
neys-2.184
 sqor-2.172
ngth-2.160
Token
Token ID198
Feature activation-3.49e-01

Pos logit contributions
iga+2.788
 Drac+2.693
neys+2.444
 sqor+2.364
ngth+2.287
Neg logit contributions
hyde-2.695
\.-2.567
netflix-2.483
ergus-2.439
ooth-2.409
TokenContact
Token ID17829
Feature activation-8.10e+00

Pos logit contributions
iga+0.299
 Drac+0.289
neys+0.266
 sqor+0.259
ngth+0.257
Neg logit contributions
hyde-0.247
\.-0.238
netflix-0.227
ergus-0.221
ooth-0.221
Token us
Token ID514
Feature activation-1.31e+01

Pos logit contributions
iga+6.491
 Drac+6.364
neys+5.715
 sqor+5.588
 WS+5.504
Neg logit contributions
hyde-5.844
\.-5.625
netflix-5.471
ooth-5.268
holding-5.219
Token at
Token ID379
Feature activation-4.73e+00

Pos logit contributions
iga+8.215
 Drac+7.898
neys+6.815
 sqor+6.665
 WS+6.623
Neg logit contributions
hyde-10.786
\.-10.368
ertodd-9.899
netflix-9.842
holding-9.814
Token editors
Token ID15719
Feature activation-5.93e+00

Pos logit contributions
iga+3.694
 Drac+3.621
neys+3.206
 sqor+3.142
ngth+3.091
Neg logit contributions
hyde-3.627
\.-3.569
holding-3.321
netflix-3.306
ooth-3.297
Token@
Token ID31
Feature activation+4.67e+00

Pos logit contributions
iga+4.596
 Drac+4.339
neys+4.019
ngth+3.902
 sqor+3.839
Neg logit contributions
hyde-4.430
\.-4.367
netflix-4.115
holding-4.051
ergus-4.034
Tokentime
Token ID2435
Feature activation-6.46e-01

Pos logit contributions
hyde+3.219
\.+3.117
netflix+2.973
ergus+2.898
holding+2.878
Neg logit contributions
iga-4.027
 Drac-3.985
 sqor-3.627
neys-3.564
ngth-3.540
Token.
Token ID13
Feature activation+2.89e+00

Pos logit contributions
iga+0.545
 Drac+0.532
neys+0.483
 sqor+0.476
ngth+0.474
Neg logit contributions
hyde-0.465
\.-0.453
netflix-0.428
ergus-0.419
holding-0.419
TokenBar
Token ID10374
Feature activation+2.27e+00

Pos logit contributions
iga+0.136
 Drac+0.133
neys+0.121
 sqor+0.119
ngth+0.118
Neg logit contributions
hyde-0.110
\.-0.107
netflix-0.102
ergus-0.099
ooth-0.098
Tokenack
Token ID441
Feature activation-6.66e+00

Pos logit contributions
hyde+1.526
\.+1.496
netflix+1.403
ergus+1.371
holding+1.371
Neg logit contributions
iga-2.017
 Drac-1.974
neys-1.795
 sqor-1.781
ngth-1.772
Token Obama
Token ID2486
Feature activation-4.87e+00

Pos logit contributions
iga+5.394
 Drac+5.202
neys+4.790
 sqor+4.474
 WS+4.461
Neg logit contributions
hyde-4.887
\.-4.653
netflix-4.580
ertodd-4.380
ergus-4.331
Token is
Token ID318
Feature activation-6.83e+00

Pos logit contributions
iga+3.759
 Drac+3.698
neys+3.332
 sqor+3.206
ngth+3.133
Neg logit contributions
hyde-3.745
\.-3.592
netflix-3.518
ergus-3.414
holding-3.386
Token the
Token ID262
Feature activation-6.70e+00

Pos logit contributions
iga+5.224
 Drac+5.164
neys+4.652
 sqor+4.435
 WS+4.422
Neg logit contributions
hyde-5.183
\.-4.965
netflix-4.818
ertodd-4.678
ergus-4.677
Token worst
Token ID5290
Feature activation-1.31e+01

Pos logit contributions
 Drac+5.379
iga+5.299
neys+4.749
 sqor+4.689
 WS+4.609
Neg logit contributions
hyde-4.834
\.-4.677
netflix-4.570
ooth-4.413
ergus-4.412
Token ne
Token ID497
Feature activation-5.34e+00

Pos logit contributions
iga+9.090
 Drac+8.976
neys+7.933
 WS+7.857
 sqor+7.455
Neg logit contributions
hyde-9.837
ertodd-9.478
netflix-9.401
\.-9.187
ooth-9.154
Token­
Token ID3907
Feature activation+1.47e+00

Pos logit contributions
iga+4.215
 Drac+3.990
neys+3.626
ngth+3.523
 sqor+3.394
Neg logit contributions
hyde-4.090
\.-3.943
netflix-3.884
ooth-3.678
holding-3.677
Tokengo
Token ID2188
Feature activation-2.51e+00

Pos logit contributions
hyde+0.840
\.+0.817
netflix+0.761
ergus+0.739
holding+0.737
Neg logit contributions
iga-1.458
 Drac-1.431
neys-1.318
 sqor-1.307
ngth-1.299
Token­
Token ID3907
Feature activation+6.01e+00

Pos logit contributions
iga+2.067
 Drac+1.997
neys+1.816
 sqor+1.769
ngth+1.762
Neg logit contributions
hyde-1.850
\.-1.803
netflix-1.725
holding-1.684
ooth-1.676
Tokenti
Token ID20259
Feature activation-1.62e+00

Pos logit contributions
hyde+3.924
\.+3.885
netflix+3.568
ergus+3.528
holding+3.527
Neg logit contributions
 Drac-5.343
iga-5.234
 sqor-4.843
ngth-4.772
neys-4.686
Token,
Token ID11
Feature activation-2.75e+00

Pos logit contributions
hyde+0.657
\.+0.642
netflix+0.607
ergus+0.593
holding+0.592
Neg logit contributions
iga-0.785
 Drac-0.765
neys-0.697
 sqor-0.687
ngth-0.683
Token you
Token ID345
Feature activation+1.89e+00

Pos logit contributions
iga+2.264
 Drac+2.240
neys+2.007
 sqor+1.956
ngth+1.938
Neg logit contributions
hyde-2.030
\.-1.957
netflix-1.882
ergus-1.833
holding-1.812
Token can
Token ID460
Feature activation-4.93e+00

Pos logit contributions
hyde+1.247
\.+1.225
netflix+1.149
holding+1.123
ergus+1.118
Neg logit contributions
iga-1.695
 Drac-1.654
neys-1.518
 sqor-1.505
ngth-1.493
Token do
Token ID466
Feature activation-6.60e+00

Pos logit contributions
iga+3.832
 Drac+3.740
neys+3.341
 sqor+3.188
ngth+3.164
Neg logit contributions
hyde-3.867
\.-3.712
netflix-3.630
ergus-3.540
holding-3.487
Token A
Token ID317
Feature activation-7.07e+00

Pos logit contributions
iga+5.130
 Drac+5.125
neys+4.466
 sqor+4.344
ngth+4.315
Neg logit contributions
hyde-5.100
\.-4.846
netflix-4.787
ergus-4.528
ertodd-4.470
TokenOE
Token ID27799
Feature activation-1.31e+01

Pos logit contributions
iga+6.051
 Drac+5.789
neys+5.290
ngth+5.176
 sqor+5.103
Neg logit contributions
hyde-4.843
\.-4.760
netflix-4.590
holding-4.353
ertodd-4.347
Token damage
Token ID2465
Feature activation+1.60e+00

Pos logit contributions
 Drac+9.279
iga+8.988
neys+8.056
 WS+7.744
 sqor+7.664
Neg logit contributions
hyde-9.844
netflix-9.636
\.-9.361
ertodd-9.301
ergus-8.899
Token,
Token ID11
Feature activation-3.53e+00

Pos logit contributions
hyde+1.124
\.+1.104
netflix+1.040
ergus+1.016
holding+1.014
Neg logit contributions
iga-1.376
 Drac-1.341
neys-1.224
 sqor-1.205
ngth-1.202
Token you
Token ID345
Feature activation+1.44e+00

Pos logit contributions
iga+2.882
 Drac+2.850
neys+2.536
 sqor+2.491
ngth+2.468
Neg logit contributions
hyde-2.634
\.-2.534
netflix-2.447
ergus-2.381
holding-2.347
Token can
Token ID460
Feature activation-6.09e+00

Pos logit contributions
hyde+0.951
\.+0.930
netflix+0.875
holding+0.852
ergus+0.851
Neg logit contributions
iga-1.292
 Drac-1.263
neys-1.157
 sqor-1.142
ngth-1.134
Token buff
Token ID6940
Feature activation-4.58e+00

Pos logit contributions
iga+5.027
 Drac+4.939
neys+4.411
 sqor+4.240
ngth+4.209
Neg logit contributions
hyde-4.426
\.-4.212
netflix-4.144
ergus-4.043
holding-3.952
Token A
Token ID317
Feature activation-7.31e+00

Pos logit contributions
iga+3.833
 Drac+3.794
neys+3.358
 sqor+3.231
ngth+3.176
Neg logit contributions
hyde-3.846
\.-3.629
netflix-3.612
ooth-3.442
ergus-3.429
Token Novel
Token ID24467
Feature activation-4.50e+00

Pos logit contributions
iga+5.529
 Drac+5.445
neys+4.840
 sqor+4.636
ngth+4.607
Neg logit contributions
hyde-5.735
\.-5.460
netflix-5.346
ergus-5.078
ertodd-5.071
Token (
Token ID357
Feature activation-5.47e+00

Pos logit contributions
iga+3.653
 Drac+3.513
neys+3.177
 sqor+3.077
ngth+3.072
Neg logit contributions
hyde-3.354
\.-3.285
netflix-3.157
holding-3.032
ooth-3.025
TokenNew
Token ID3791
Feature activation-3.81e+00

Pos logit contributions
iga+4.546
 Drac+4.430
neys+4.027
 sqor+3.899
ngth+3.813
Neg logit contributions
hyde-3.916
\.-3.716
netflix-3.592
ergus-3.512
ertodd-3.463
Token York
Token ID1971
Feature activation-9.33e+00

Pos logit contributions
iga+3.154
 Drac+3.096
neys+2.860
ngth+2.684
 sqor+2.639
Neg logit contributions
hyde-2.759
\.-2.666
netflix-2.621
holding-2.473
ergus-2.465
Token:
Token ID25
Feature activation-1.29e+01

Pos logit contributions
iga+6.197
 Drac+5.940
neys+5.222
 sqor+5.070
 WS+5.024
Neg logit contributions
hyde-7.866
netflix-7.418
\.-7.408
ergus-7.174
holding-7.158
Token St
Token ID520
Feature activation+2.90e-01

Pos logit contributions
 Drac+9.539
iga+9.438
 WS+8.173
neys+8.005
 Wiz+7.969
Neg logit contributions
hyde-10.017
netflix-9.262
ergus-8.839
\.-8.682
ertodd-8.674
Token.
Token ID13
Feature activation-1.72e+00

Pos logit contributions
hyde+0.209
\.+0.203
netflix+0.194
holding+0.189
ergus+0.188
Neg logit contributions
iga-0.245
 Drac-0.238
neys-0.217
 sqor-0.213
ngth-0.211
Token Martin
Token ID5780
Feature activation-1.74e+00

Pos logit contributions
iga+1.355
 Drac+1.329
neys+1.198
 sqor+1.156
ngth+1.143
Neg logit contributions
hyde-1.329
\.-1.291
netflix-1.249
holding-1.216
ergus-1.206
Token
Token ID447
Feature activation-1.38e+00

Pos logit contributions
iga+1.427
 Drac+1.394
neys+1.254
 sqor+1.222
ngth+1.203
Neg logit contributions
hyde-1.300
\.-1.253
netflix-1.208
holding-1.175
ergus-1.173
Token
Token ID247
Feature activation-2.53e+00

Pos logit contributions
iga+1.328
 Drac+1.296
neys+1.190
ngth+1.173
 sqor+1.169
Neg logit contributions
hyde-0.825
\.-0.795
netflix-0.766
holding-0.728
ergus-0.725
Tokenf
Token ID69
Feature activation-3.72e+00

Pos logit contributions
iga+1.827
 Drac+1.775
neys+1.628
 sqor+1.590
ngth+1.589
Neg logit contributions
hyde-1.430
\.-1.395
netflix-1.318
ergus-1.285
ooth-1.278
Token,
Token ID11
Feature activation-1.70e+00

Pos logit contributions
iga+3.029
 Drac+2.914
neys+2.661
 sqor+2.559
ngth+2.502
Neg logit contributions
hyde-2.791
\.-2.700
netflix-2.574
ergus-2.506
ooth-2.503
Token which
Token ID543
Feature activation-1.25e+00

Pos logit contributions
iga+1.403
 Drac+1.372
neys+1.241
 sqor+1.213
ngth+1.201
Neg logit contributions
hyde-1.259
\.-1.219
netflix-1.154
ergus-1.135
ooth-1.131
Token features
Token ID3033
Feature activation-5.29e-01

Pos logit contributions
iga+1.004
 Drac+0.979
neys+0.884
 sqor+0.863
ngth+0.855
Neg logit contributions
hyde-0.957
\.-0.930
netflix-0.884
ergus-0.867
holding-0.865
Token Josh
Token ID8518
Feature activation-3.05e+00

Pos logit contributions
iga+0.438
 Drac+0.430
neys+0.389
 sqor+0.381
ngth+0.378
Neg logit contributions
hyde-0.389
\.-0.379
netflix-0.359
ergus-0.350
ooth-0.349
Token Em
Token ID2295
Feature activation-1.29e+01

Pos logit contributions
iga+2.515
 Drac+2.452
neys+2.190
 sqor+2.134
ngth+2.121
Neg logit contributions
hyde-2.244
\.-2.179
netflix-2.072
holding-2.032
ergus-2.014
Tokenett
Token ID3087
Feature activation-5.60e+00

Pos logit contributions
iga+10.398
neys+9.367
 Drac+9.312
chuk+8.548
 sqor+8.380
Neg logit contributions
hyde-8.669
\.-8.442
netflix-8.005
ertodd-7.685
holding-7.555
Token,
Token ID11
Feature activation+1.70e+00

Pos logit contributions
iga+4.451
 Drac+4.274
neys+3.933
 sqor+3.732
ngth+3.710
Neg logit contributions
hyde-4.153
\.-4.081
netflix-3.915
ooth-3.754
ergus-3.725
Token Mark
Token ID2940
Feature activation+4.11e-01

Pos logit contributions
hyde+1.235
\.+1.209
netflix+1.146
ergus+1.121
holding+1.118
Neg logit contributions
iga-1.418
 Drac-1.380
neys-1.257
 sqor-1.240
ngth-1.234
Token Wall
Token ID5007
Feature activation-1.22e+00

Pos logit contributions
hyde+0.303
\.+0.297
netflix+0.282
ergus+0.274
holding+0.274
Neg logit contributions
iga-0.340
 Drac-0.332
neys-0.300
 sqor-0.295
ngth-0.293
Tokenbank
Token ID17796
Feature activation+1.27e+00

Pos logit contributions
iga+1.016
 Drac+0.972
neys+0.899
ngth+0.866
 sqor+0.863
Neg logit contributions
hyde-0.903
\.-0.883
netflix-0.849
ergus-0.817
holding-0.809
Token Online
Token ID7467
Feature activation-2.24e+00

Pos logit contributions
hyde+0.950
\.+0.935
netflix+0.862
ooth+0.820
ergus+0.818
Neg logit contributions
iga-2.153
 Drac-2.103
neys-1.951
 sqor-1.950
ngth-1.950
Token news
Token ID1705
Feature activation-5.11e+00

Pos logit contributions
iga+1.959
 Drac+1.902
neys+1.750
 sqor+1.702
ngth+1.689
Neg logit contributions
hyde-1.554
\.-1.477
netflix-1.386
holding-1.374
ergus-1.372
Token team
Token ID1074
Feature activation-6.00e+00

Pos logit contributions
iga+3.233
 Drac+3.087
neys+2.701
 sqor+2.647
 WS+2.469
Neg logit contributions
hyde-4.776
\.-4.615
netflix-4.413
ergus-4.322
ooth-4.263
Token?
Token ID30
Feature activation-5.80e+00

Pos logit contributions
 Drac+4.234
iga+4.219
neys+3.709
 sqor+3.579
 WS+3.450
Neg logit contributions
hyde-5.040
\.-4.672
ergus-4.585
netflix-4.549
holding-4.523
Token Email
Token ID9570
Feature activation-8.54e+00

Pos logit contributions
iga+4.212
 Drac+4.073
 sqor+3.591
neys+3.569
 WS+3.491
Neg logit contributions
hyde-4.727
\.-4.538
netflix-4.355
ooth-4.324
holding-4.308
Token us
Token ID514
Feature activation-1.29e+01

Pos logit contributions
iga+4.318
 Drac+4.261
neys+3.651
 sqor+3.500
 WS+3.375
Neg logit contributions
hyde-8.512
\.-8.052
holding-7.906
ooth-7.869
netflix-7.862
Token at
Token ID379
Feature activation-4.46e+00

Pos logit contributions
 Drac+7.096
iga+7.082
neys+5.868
 WS+5.668
 Wiz+5.185
Neg logit contributions
hyde-12.162
ertodd-11.133
\.-11.128
holding-11.059
ooth-11.020
Token tips
Token ID9040
Feature activation-1.14e+01

Pos logit contributions
iga+1.557
 Drac+1.432
neys+1.132
 sqor+1.023
 WS+1.001
Neg logit contributions
hyde-5.330
\.-5.293
ooth-5.056
ertodd-5.036
holding-5.011
Token@
Token ID31
Feature activation+3.28e+00

Pos logit contributions
iga+7.605
 Drac+6.859
neys+6.343
ngth+6.007
 sqor+5.979
Neg logit contributions
\.-9.004
hyde-8.768
holding-8.615
ertodd-8.578
 Canaver-8.485
Tokenthe
Token ID1169
Feature activation+8.44e-01

Pos logit contributions
hyde+2.225
\.+2.156
netflix+2.093
holding+2.015
ergus+1.983
Neg logit contributions
iga-2.899
 Drac-2.879
neys-2.592
ngth-2.558
 sqor-2.557
Token-
Token ID12
Feature activation+4.61e+00

Pos logit contributions
hyde+0.589
\.+0.578
netflix+0.550
holding+0.539
ooth+0.530
Neg logit contributions
iga-0.725
 Drac-0.723
neys-0.647
 sqor-0.645
ngth-0.643
Token for
Token ID329
Feature activation-4.65e+00

Pos logit contributions
iga+5.984
 Drac+5.745
neys+5.157
 sqor+5.033
 WS+4.903
Neg logit contributions
hyde-7.793
\.-7.326
netflix-7.245
ergus-7.118
ooth-6.933
Token The
Token ID383
Feature activation-1.12e+01

Pos logit contributions
iga+3.877
 Drac+3.749
neys+3.422
 sqor+3.338
ngth+3.288
Neg logit contributions
hyde-3.393
\.-3.275
netflix-3.160
ergus-3.062
ooth-3.026
Token New
Token ID968
Feature activation-4.73e-01

Pos logit contributions
iga+6.951
 Drac+6.824
neys+6.132
 WS+6.038
 Wiz+5.540
Neg logit contributions
hyde-10.070
netflix-9.554
\.-9.337
ergus-9.156
ooth-9.006
Token York
Token ID1971
Feature activation-1.26e-01

Pos logit contributions
iga+0.432
 Drac+0.423
neys+0.386
 sqor+0.383
ngth+0.381
Neg logit contributions
hyde-0.307
\.-0.300
netflix-0.281
ergus-0.275
holding-0.274
Token Times
Token ID3782
Feature activation-1.35e+01

Pos logit contributions
iga+0.109
 Drac+0.107
neys+0.097
ngth+0.097
 sqor+0.096
Neg logit contributions
hyde-0.087
\.-0.085
netflix-0.080
holding-0.079
ergus-0.078
Token's
Token ID338
Feature activation-1.28e+01

Pos logit contributions
iga+8.251
 Drac+7.639
neys+6.752
 WS+6.585
 Patron+6.266
Neg logit contributions
hyde-11.771
netflix-11.025
\.-10.793
ergus-10.693
holding-10.362
Token products
Token ID3186
Feature activation-9.13e+00

Pos logit contributions
iga+7.953
 Drac+7.416
neys+6.447
 WS+6.341
 Patron+6.275
Neg logit contributions
hyde-11.345
\.-10.723
netflix-10.676
ergus-10.256
ooth-10.019
Token and
Token ID290
Feature activation+3.03e+00

Pos logit contributions
iga+6.388
 Drac+6.009
neys+5.495
 WS+5.264
 sqor+5.242
Neg logit contributions
hyde-7.668
\.-7.029
ergus-6.987
netflix-6.848
ooth-6.802
Token services
Token ID2594
Feature activation-1.23e+01

Pos logit contributions
\.+1.239
hyde+1.153
holding+1.059
ergus+1.049
ooth+1.033
Neg logit contributions
iga-3.471
 Drac-3.421
ngth-3.238
neys-3.158
 sqor-3.129
Token.
Token ID13
Feature activation-8.46e+00

Pos logit contributions
iga+8.055
 Drac+7.487
 WS+6.506
neys+6.475
 sqor+6.366
Neg logit contributions
hyde-10.800
netflix-9.810
ergus-9.585
ooth-9.329
\.-9.317
Token Thank
Token ID6952
Feature activation-3.17e+00

Pos logit contributions
iga+5.104
 Drac+4.951
neys+4.181
 sqor+4.174
 WS+4.060
Neg logit contributions
hyde-8.007
\.-7.597
netflix-7.440
ergus-7.322
holding-7.102
Token hand
Token ID1021
Feature activation+1.54e+00

Pos logit contributions
iga+1.758
 Drac+1.726
neys+1.546
 sqor+1.529
ngth+1.507
Neg logit contributions
hyde-1.508
\.-1.465
netflix-1.389
ergus-1.360
holding-1.352
Token.
Token ID13
Feature activation+2.20e+00

Pos logit contributions
hyde+1.127
\.+1.103
netflix+1.044
ergus+1.020
holding+1.018
Neg logit contributions
iga-1.285
 Drac-1.254
neys-1.139
 sqor-1.122
ngth-1.115
Token So
Token ID1406
Feature activation+1.08e+00

Pos logit contributions
hyde+1.426
\.+1.417
netflix+1.319
holding+1.295
ergus+1.274
Neg logit contributions
iga-1.985
 Drac-1.920
neys-1.781
 sqor-1.778
ngth-1.775
Token what
Token ID644
Feature activation+6.70e+00

Pos logit contributions
hyde+0.740
\.+0.725
netflix+0.683
holding+0.668
ergus+0.665
Neg logit contributions
iga-0.947
 Drac-0.921
neys-0.845
 sqor-0.832
ngth-0.829
Token can
Token ID460
Feature activation+4.28e+00

Pos logit contributions
hyde+4.064
\.+4.061
netflix+3.770
holding+3.746
ooth+3.713
Neg logit contributions
iga-5.996
 Drac-5.804
neys-5.455
 sqor-5.373
ngth-5.367
Token Dick
Token ID11740
Feature activation+1.08e+01

Pos logit contributions
hyde+2.682
\.+2.641
netflix+2.457
holding+2.411
ooth+2.405
Neg logit contributions
iga-3.910
 Drac-3.785
neys-3.530
 sqor-3.502
ngth-3.482
Token do
Token ID466
Feature activation+1.16e+00

Pos logit contributions
hyde+5.621
\.+5.356
netflix+4.774
ooth+4.763
ergus+4.645
Neg logit contributions
iga-10.573
 Drac-10.184
neys-9.705
ngth-9.668
 sqor-9.658
Token to
Token ID284
Feature activation+5.16e+00

Pos logit contributions
hyde+0.853
\.+0.839
netflix+0.793
holding+0.777
ergus+0.775
Neg logit contributions
iga-0.946
 Drac-0.924
neys-0.839
 sqor-0.825
ngth-0.823
Token get
Token ID651
Feature activation+1.11e+00

Pos logit contributions
hyde+3.483
\.+3.440
netflix+3.143
ooth+3.116
holding+3.104
Neg logit contributions
iga-4.402
 Drac-4.305
neys-3.991
ngth-3.980
 sqor-3.970
Token around
Token ID1088
Feature activation+1.16e-01

Pos logit contributions
hyde+0.807
\.+0.792
netflix+0.748
holding+0.734
ergus+0.730
Neg logit contributions
iga-0.929
 Drac-0.903
neys-0.828
 sqor-0.815
ngth-0.810
Token that
Token ID326
Feature activation+5.72e-01

Pos logit contributions
hyde+0.086
\.+0.084
netflix+0.079
ergus+0.078
holding+0.077
Neg logit contributions
iga-0.095
 Drac-0.093
neys-0.084
 sqor-0.083
ngth-0.082
Token that
Token ID326
Feature activation+4.14e+00

Pos logit contributions
hyde+0.339
\.+0.331
netflix+0.314
ergus+0.305
holding+0.303
Neg logit contributions
iga-0.380
 Drac-0.373
neys-0.336
 sqor-0.329
ngth-0.329
Token 9
Token ID860
Feature activation-4.96e-01

Pos logit contributions
hyde+2.757
\.+2.716
netflix+2.542
holding+2.508
ergus+2.496
Neg logit contributions
iga-3.633
 Drac-3.509
neys-3.237
 sqor-3.235
ngth-3.219
Token/
Token ID14
Feature activation-3.27e+00

Pos logit contributions
iga+0.364
 Drac+0.352
neys+0.315
 sqor+0.309
ngth+0.303
Neg logit contributions
hyde-0.412
\.-0.402
netflix-0.391
holding-0.378
ergus-0.378
Token11
Token ID1157
Feature activation+1.34e+00

Pos logit contributions
iga+2.670
 Drac+2.555
neys+2.341
 sqor+2.264
ngth+2.226
Neg logit contributions
hyde-2.478
\.-2.388
netflix-2.291
ergus-2.238
holding-2.213
Token was
Token ID373
Feature activation+2.97e+00

Pos logit contributions
hyde+0.955
\.+0.935
netflix+0.881
ergus+0.862
holding+0.860
Neg logit contributions
iga-1.139
 Drac-1.112
neys-1.011
 sqor-0.999
ngth-0.992
Token an
Token ID281
Feature activation+1.10e+01

Pos logit contributions
hyde+2.067
\.+2.029
netflix+1.905
ergus+1.867
holding+1.865
Neg logit contributions
iga-2.542
 Drac-2.470
 sqor-2.269
neys-2.263
ngth-2.254
Token inside
Token ID2641
Feature activation+5.55e+00

Pos logit contributions
hyde+6.195
\.+6.107
ooth+5.784
ergus+5.619
holding+5.543
Neg logit contributions
iga-9.780
 sqor-9.354
 Drac-9.340
ngth-9.331
neys-8.888
Token job
Token ID1693
Feature activation+3.60e+00

Pos logit contributions
hyde+3.081
\.+3.074
holding+2.771
ooth+2.746
ergus+2.739
Neg logit contributions
iga-5.479
 Drac-5.299
neys-4.932
 sqor-4.919
ngth-4.884
Token,
Token ID11
Feature activation+3.09e+00

Pos logit contributions
hyde+2.488
\.+2.435
netflix+2.263
holding+2.242
ergus+2.227
Neg logit contributions
iga-3.112
 Drac-3.034
neys-2.759
 sqor-2.751
ngth-2.727
Token that
Token ID326
Feature activation+8.35e-01

Pos logit contributions
hyde+2.107
\.+2.071
netflix+1.954
holding+1.914
ergus+1.901
Neg logit contributions
iga-2.690
 Drac-2.608
neys-2.389
 sqor-2.368
ngth-2.360
Token the
Token ID262
Feature activation+1.43e+00

Pos logit contributions
hyde+0.583
\.+0.569
netflix+0.539
ergus+0.525
holding+0.523
Neg logit contributions
iga-0.722
 Drac-0.705
neys-0.643
 sqor-0.633
ngth-0.628
Token with
Token ID351
Feature activation+7.89e-01

Pos logit contributions
hyde+2.766
\.+2.745
netflix+2.525
holding+2.514
ergus+2.496
Neg logit contributions
iga-3.551
 Drac-3.441
neys-3.154
ngth-3.116
 sqor-3.107
Token Pro
Token ID1041
Feature activation-1.38e+00

Pos logit contributions
hyde+0.552
\.+0.537
netflix+0.513
ergus+0.498
holding+0.494
Neg logit contributions
iga-0.681
 Drac-0.671
neys-0.605
 sqor-0.597
ngth-0.594
Tokenpe
Token ID431
Feature activation+1.01e+00

Pos logit contributions
iga+0.944
 Drac+0.913
neys+0.812
 sqor+0.794
ngth+0.788
Neg logit contributions
hyde-1.220
\.-1.196
netflix-1.145
ergus-1.124
holding-1.121
Tokenller
Token ID6051
Feature activation+2.83e+00

Pos logit contributions
hyde+0.762
\.+0.752
netflix+0.709
holding+0.694
ergus+0.694
Neg logit contributions
iga-0.811
 Drac-0.796
neys-0.720
 sqor-0.709
ngth-0.706
TokenBlade
Token ID47520
Feature activation+1.14e+01

Pos logit contributions
hyde+2.254
\.+2.209
netflix+2.101
ergus+2.056
holding+2.055
Neg logit contributions
iga-2.161
 Drac-2.102
neys-1.893
 sqor-1.864
ngth-1.849
Token Technology
Token ID8987
Feature activation+1.11e+01

Pos logit contributions
\.+5.311
hyde+5.250
ooth+4.619
holding+4.567
ergus+4.440
Neg logit contributions
iga-11.712
 Drac-11.198
ngth-10.646
neys-10.612
 sqor-10.546
Token and
Token ID290
Feature activation+2.10e+00

Pos logit contributions
\.+6.765
hyde+6.486
holding+5.970
ooth+5.949
ergus+5.822
Neg logit contributions
iga-9.787
 Drac-9.517
neys-8.942
ngth-8.768
 sqor-8.611
Token two
Token ID734
Feature activation+3.77e+00

Pos logit contributions
hyde+1.524
\.+1.491
netflix+1.411
ergus+1.378
holding+1.376
Neg logit contributions
iga-1.755
 Drac-1.710
neys-1.556
 sqor-1.531
ngth-1.521
Token 8
Token ID807
Feature activation+3.77e+00

Pos logit contributions
hyde+2.657
\.+2.623
netflix+2.434
holding+2.408
ooth+2.394
Neg logit contributions
iga-3.205
 Drac-3.093
neys-2.848
ngth-2.800
 sqor-2.798
Tokenmm
Token ID3020
Feature activation+4.62e+00

Pos logit contributions
hyde+2.347
\.+2.305
netflix+2.147
holding+2.091
ergus+2.090
Neg logit contributions
iga-3.526
 Drac-3.449
neys-3.174
 sqor-3.120
ngth-3.117
Token Super
Token ID3115
Feature activation+9.70e-01

Pos logit contributions
hyde+3.252
\.+3.234
netflix+2.990
holding+2.987
ooth+2.978
Neg logit contributions
iga-3.903
 Drac-3.732
neys-3.457
ngth-3.404
 sqor-3.382
Token on
Token ID319
Feature activation+1.69e+00

Pos logit contributions
iga+0.240
 Drac+0.232
neys+0.212
 sqor+0.205
ngth+0.205
Neg logit contributions
hyde-0.230
\.-0.221
netflix-0.213
ergus-0.208
holding-0.208
Token the
Token ID262
Feature activation+1.87e+00

Pos logit contributions
hyde+1.223
\.+1.196
netflix+1.133
ergus+1.106
holding+1.105
Neg logit contributions
iga-1.412
 Drac-1.376
neys-1.251
 sqor-1.232
ngth-1.225
Token cost
Token ID1575
Feature activation+2.35e+00

Pos logit contributions
hyde+1.343
\.+1.316
netflix+1.242
holding+1.214
ergus+1.212
Neg logit contributions
iga-1.577
 Drac-1.536
neys-1.400
 sqor-1.378
ngth-1.375
Token of
Token ID286
Feature activation+1.50e+00

Pos logit contributions
hyde+1.787
\.+1.748
netflix+1.659
holding+1.624
ergus+1.618
Neg logit contributions
iga-1.881
 Drac-1.836
neys-1.661
 sqor-1.636
ngth-1.626
Token oil
Token ID3056
Feature activation+1.35e+00

Pos logit contributions
hyde+1.092
\.+1.068
netflix+1.012
ergus+0.988
holding+0.987
Neg logit contributions
iga-1.245
 Drac-1.214
neys-1.102
 sqor-1.085
ngth-1.078
Token and
Token ID290
Feature activation+1.01e+01

Pos logit contributions
hyde+0.953
\.+0.931
netflix+0.881
ergus+0.859
holding+0.858
Neg logit contributions
iga-1.149
 Drac-1.121
neys-1.021
 sqor-1.006
ngth-0.999
Token a
Token ID257
Feature activation+2.47e+00

Pos logit contributions
\.+5.596
hyde+5.517
holding+5.160
netflix+5.056
ooth+4.905
Neg logit contributions
iga-9.299
 Drac-9.107
ngth-8.769
neys-8.709
 sqor-8.351
Token faulty
Token ID30954
Feature activation+2.88e+00

Pos logit contributions
hyde+1.797
\.+1.762
netflix+1.667
holding+1.630
ergus+1.624
Neg logit contributions
iga-2.050
 Drac-2.002
neys-1.820
 sqor-1.791
ngth-1.789
Token presumption
Token ID35467
Feature activation-1.82e+00

Pos logit contributions
hyde+2.057
\.+2.020
netflix+1.907
holding+1.879
ergus+1.859
Neg logit contributions
iga-2.421
 Drac-2.360
neys-2.153
ngth-2.132
 sqor-2.123
Token that
Token ID326
Feature activation+3.67e+00

Pos logit contributions
iga+1.458
 Drac+1.418
neys+1.275
 sqor+1.256
ngth+1.246
Neg logit contributions
hyde-1.392
\.-1.339
netflix-1.293
ergus-1.267
ooth-1.252
Token renewable
Token ID15713
Feature activation-6.89e-01

Pos logit contributions
hyde+2.495
\.+2.470
netflix+2.318
holding+2.277
ooth+2.252
Neg logit contributions
iga-3.170
 Drac-3.090
neys-2.838
ngth-2.822
 sqor-2.803
Token academic
Token ID8233
Feature activation+6.53e+00

Pos logit contributions
hyde+0.696
\.+0.681
netflix+0.639
holding+0.624
ergus+0.624
Neg logit contributions
iga-0.893
 Drac-0.872
neys-0.796
 sqor-0.784
ngth-0.780
Token departments
Token ID13346
Feature activation+5.56e+00

Pos logit contributions
\.+4.029
hyde+4.013
holding+3.698
netflix+3.612
ooth+3.607
Neg logit contributions
iga-5.937
 Drac-5.775
neys-5.313
ngth-5.309
 sqor-5.214
Token and
Token ID290
Feature activation+5.56e+00

Pos logit contributions
\.+3.607
hyde+3.600
netflix+3.309
holding+3.278
ooth+3.233
Neg logit contributions
iga-4.900
 Drac-4.840
neys-4.389
 sqor-4.340
ngth-4.338
Token a
Token ID257
Feature activation+1.00e-01

Pos logit contributions
\.+3.672
hyde+3.647
holding+3.389
ooth+3.333
netflix+3.315
Neg logit contributions
iga-4.800
 Drac-4.654
neys-4.316
ngth-4.269
 sqor-4.240
Token National
Token ID2351
Feature activation+4.23e+00

Pos logit contributions
hyde+0.073
\.+0.071
netflix+0.067
ergus+0.066
holding+0.066
Neg logit contributions
iga-0.084
 Drac-0.082
neys-0.074
 sqor-0.073
ngth-0.073
Token Science
Token ID5800
Feature activation+1.01e+01

Pos logit contributions
hyde+3.049
\.+3.039
netflix+2.804
holding+2.797
ooth+2.784
Neg logit contributions
iga-3.415
 Drac-3.328
neys-3.099
ngth-3.064
 sqor-3.045
Token Foundation
Token ID5693
Feature activation-5.04e+00

Pos logit contributions
\.+4.573
hyde+4.550
netflix+4.185
ooth+4.079
ergus+4.008
Neg logit contributions
iga-10.324
 Drac-10.195
neys-9.646
 sqor-9.567
ngth-9.375
Token Engineering
Token ID14044
Feature activation-4.93e-01

Pos logit contributions
iga+4.139
 Drac+4.028
 sqor+3.563
neys+3.559
ngth+3.529
Neg logit contributions
hyde-3.721
\.-3.549
netflix-3.481
ergus-3.380
holding-3.288
Token Research
Token ID4992
Feature activation+2.65e+00

Pos logit contributions
iga+0.400
 Drac+0.390
neys+0.351
 sqor+0.346
ngth+0.342
Neg logit contributions
hyde-0.372
\.-0.362
netflix-0.346
ergus-0.337
holding-0.335
Token Center
Token ID3337
Feature activation+6.05e-01

Pos logit contributions
hyde+2.127
\.+2.096
netflix+1.975
holding+1.955
ooth+1.951
Neg logit contributions
iga-1.975
 Drac-1.928
neys-1.745
ngth-1.712
 sqor-1.712
Token.
Token ID13
Feature activation-6.90e-01

Pos logit contributions
hyde+0.433
\.+0.422
netflix+0.401
ergus+0.391
holding+0.390
Neg logit contributions
iga-0.514
 Drac-0.501
neys-0.456
 sqor-0.449
ngth-0.446
Token work
Token ID670
Feature activation+5.52e+00

Pos logit contributions
hyde+5.098
\.+4.992
netflix+4.677
ooth+4.649
holding+4.570
Neg logit contributions
iga-6.530
 Drac-6.321
 sqor-5.891
ngth-5.791
neys-5.757
Token like
Token ID588
Feature activation+5.21e+00

Pos logit contributions
\.+3.638
hyde+3.622
netflix+3.353
ooth+3.331
ergus+3.285
Neg logit contributions
iga-4.820
 Drac-4.695
neys-4.328
ngth-4.296
 sqor-4.289
Token his
Token ID465
Feature activation+5.92e+00

Pos logit contributions
hyde+3.429
\.+3.404
netflix+3.200
ooth+3.142
holding+3.122
Neg logit contributions
iga-4.588
 Drac-4.408
neys-4.140
ngth-4.085
 sqor-4.065
Token version
Token ID2196
Feature activation+5.03e+00

Pos logit contributions
hyde+3.967
\.+3.871
ooth+3.601
netflix+3.596
ergus+3.571
Neg logit contributions
iga-5.148
 Drac-4.946
neys-4.622
ngth-4.583
 sqor-4.540
Token,
Token ID11
Feature activation+3.14e+00

Pos logit contributions
hyde+3.306
\.+3.243
netflix+3.009
ooth+2.968
holding+2.956
Neg logit contributions
iga-4.470
 Drac-4.370
neys-4.007
 sqor-3.995
ngth-3.936
Token but
Token ID475
Feature activation+2.23e+00

Pos logit contributions
hyde+2.061
\.+2.021
netflix+1.888
holding+1.849
ooth+1.845
Neg logit contributions
iga-2.824
 Drac-2.749
neys-2.531
 sqor-2.501
ngth-2.486
Token I
Token ID314
Feature activation-1.10e+00

Pos logit contributions
hyde+1.511
\.+1.486
netflix+1.389
holding+1.361
ergus+1.359
Neg logit contributions
iga-1.963
 Drac-1.909
neys-1.758
 sqor-1.729
ngth-1.726
Token
Token ID447
Feature activation+3.37e+00

Pos logit contributions
iga+0.957
 Drac+0.931
neys+0.849
 sqor+0.834
ngth+0.833
Neg logit contributions
hyde-0.767
\.-0.746
netflix-0.715
ergus-0.696
holding-0.689
Token
Token ID247
Feature activation+3.81e-02

Pos logit contributions
hyde+1.759
\.+1.741
ergus+1.559
netflix+1.553
holding+1.527
Neg logit contributions
iga-3.444
 Drac-3.437
neys-3.203
ngth-3.154
 sqor-3.127
Tokenm
Token ID76
Feature activation-2.77e+00

Pos logit contributions
hyde+0.023
\.+0.022
netflix+0.021
ergus+0.020
ooth+0.020
Neg logit contributions
iga-0.037
 Drac-0.036
neys-0.033
 sqor-0.032
ngth-0.032
Token still
Token ID991
Feature activation+3.28e+00

Pos logit contributions
iga+2.251
 Drac+2.192
neys+1.965
 sqor+1.927
ngth+1.914
Neg logit contributions
hyde-2.064
\.-2.025
netflix-1.958
ergus-1.884
ooth-1.855
Token of
Token ID286
Feature activation+4.05e+00

Pos logit contributions
hyde+1.719
\.+1.679
netflix+1.584
holding+1.548
ergus+1.545
Neg logit contributions
iga-2.091
 Drac-2.038
neys-1.861
 sqor-1.834
ngth-1.821
Token speed
Token ID2866
Feature activation+5.00e+00

Pos logit contributions
hyde+2.766
\.+2.715
holding+2.525
netflix+2.516
ooth+2.477
Neg logit contributions
iga-3.517
 Drac-3.392
neys-3.146
 sqor-3.116
ngth-3.106
Token)
Token ID8
Feature activation+5.79e+00

Pos logit contributions
hyde+3.378
\.+3.315
netflix+3.097
holding+3.081
ooth+3.038
Neg logit contributions
iga-4.345
 Drac-4.248
neys-3.886
 sqor-3.854
ngth-3.831
Token and
Token ID290
Feature activation+1.76e+00

Pos logit contributions
hyde+3.805
\.+3.767
netflix+3.502
holding+3.447
ooth+3.424
Neg logit contributions
iga-5.094
 Drac-4.965
neys-4.567
 sqor-4.566
ngth-4.508
Token Rally
Token ID27752
Feature activation+8.00e+00

Pos logit contributions
hyde+1.334
\.+1.308
netflix+1.237
holding+1.214
ergus+1.211
Neg logit contributions
iga-1.403
 Drac-1.365
neys-1.240
 sqor-1.221
ngth-1.213
Token (
Token ID357
Feature activation+3.65e+00

Pos logit contributions
hyde+5.367
\.+5.255
holding+4.866
ooth+4.807
netflix+4.742
Neg logit contributions
iga-6.753
 Drac-6.625
 sqor-6.196
neys-6.066
ngth-6.044
Tokenfor
Token ID1640
Feature activation-8.54e-01

Pos logit contributions
hyde+2.611
\.+2.555
netflix+2.414
holding+2.391
ooth+2.348
Neg logit contributions
iga-3.061
 Drac-3.000
neys-2.734
 sqor-2.709
ngth-2.679
Token maximum
Token ID5415
Feature activation-1.18e+00

Pos logit contributions
iga+0.712
 Drac+0.699
neys+0.627
 sqor+0.618
ngth+0.610
Neg logit contributions
hyde-0.624
\.-0.607
netflix-0.585
ergus-0.567
ooth-0.558
Token strength
Token ID4202
Feature activation+3.69e+00

Pos logit contributions
iga+0.916
 Drac+0.897
neys+0.800
 sqor+0.788
ngth+0.779
Neg logit contributions
hyde-0.929
\.-0.908
netflix-0.873
ergus-0.852
ooth-0.840
Token).
Token ID737
Feature activation+5.92e+00

Pos logit contributions
hyde+2.621
\.+2.575
netflix+2.407
holding+2.389
ergus+2.360
Neg logit contributions
iga-3.122
 Drac-3.041
neys-2.782
 sqor-2.731
ngth-2.713
Token Ch
Token ID609
Feature activation-1.61e+00

Pos logit contributions
\.+3.956
hyde+3.915
holding+3.676
ooth+3.576
netflix+3.561
Neg logit contributions
iga-5.026
 Drac-4.824
 sqor-4.625
ngth-4.600
neys-4.537
Token Ferry
Token ID42609
Feature activation+7.95e+00

Pos logit contributions
iga+0.998
 Drac+0.965
neys+0.869
 sqor+0.852
ngth+0.843
Neg logit contributions
hyde-1.004
\.-0.977
netflix-0.935
ergus-0.909
holding-0.906
Token because
Token ID780
Feature activation-1.85e-01

Pos logit contributions
hyde+5.077
\.+5.017
holding+4.585
ooth+4.542
netflix+4.514
Neg logit contributions
iga-6.879
 Drac-6.831
neys-6.260
ngth-6.248
 sqor-6.080
Token no
Token ID645
Feature activation-1.66e+00

Pos logit contributions
iga+0.160
 Drac+0.156
neys+0.142
 sqor+0.140
ngth+0.139
Neg logit contributions
hyde-0.129
\.-0.126
netflix-0.119
ergus-0.116
holding-0.115
Token one
Token ID530
Feature activation+1.95e+00

Pos logit contributions
iga+1.434
 Drac+1.385
neys+1.269
 sqor+1.250
ngth+1.236
Neg logit contributions
hyde-1.167
\.-1.135
netflix-1.087
ergus-1.055
ooth-1.047
Token else
Token ID2073
Feature activation-9.41e-01

Pos logit contributions
hyde+1.400
\.+1.385
netflix+1.294
holding+1.279
ergus+1.268
Neg logit contributions
iga-1.623
 Drac-1.580
neys-1.437
 sqor-1.428
ngth-1.422
Token studying
Token ID11065
Feature activation+2.11e+00

Pos logit contributions
iga+0.759
 Drac+0.739
neys+0.669
 sqor+0.655
ngth+0.651
Neg logit contributions
hyde-0.714
\.-0.695
netflix-0.664
ergus-0.647
holding-0.643
Token and
Token ID290
Feature activation-2.76e+00

Pos logit contributions
hyde+1.484
\.+1.458
netflix+1.374
ergus+1.346
holding+1.344
Neg logit contributions
iga-1.791
 Drac-1.757
neys-1.600
 sqor-1.576
ngth-1.571
Token re
Token ID302
Feature activation-3.05e+00

Pos logit contributions
iga+2.159
 Drac+2.085
neys+1.880
 sqor+1.844
ngth+1.809
Neg logit contributions
hyde-2.175
\.-2.091
netflix-2.030
ergus-1.967
ooth-1.939
Token-
Token ID12
Feature activation-2.29e+00

Pos logit contributions
iga+2.489
 Drac+2.396
neys+2.196
ngth+2.130
 sqor+2.123
Neg logit contributions
hyde-2.283
\.-2.212
netflix-2.103
ergus-2.038
ooth-2.018
Tokenstud
Token ID19149
Feature activation+4.62e-01

Pos logit contributions
iga+1.869
 Drac+1.796
neys+1.646
 sqor+1.598
ngth+1.583
Neg logit contributions
hyde-1.717
\.-1.665
netflix-1.579
ergus-1.544
ooth-1.526
Tokenying
Token ID1112
Feature activation-3.92e-01

Pos logit contributions
hyde+0.344
\.+0.339
netflix+0.321
ergus+0.315
holding+0.315
Neg logit contributions
iga-0.374
 Drac-0.366
neys-0.330
 sqor-0.327
ngth-0.327
Token-
Token ID12
Feature activation-4.18e+00

Pos logit contributions
iga+2.929
 Drac+2.779
neys+2.471
 sqor+2.434
ngth+2.342
Neg logit contributions
hyde-2.957
\.-2.794
netflix-2.755
holding-2.653
ooth-2.642
Token0
Token ID15
Feature activation-9.24e-01

Pos logit contributions
iga+3.430
 Drac+3.225
neys+2.983
 sqor+2.874
ngth+2.821
Neg logit contributions
hyde-3.092
\.-2.970
netflix-2.816
ergus-2.758
ooth-2.742
Token and
Token ID290
Feature activation-3.82e+00

Pos logit contributions
iga+0.768
 Drac+0.744
neys+0.673
 sqor+0.663
ngth+0.650
Neg logit contributions
hyde-0.684
\.-0.660
netflix-0.635
ergus-0.614
ooth-0.612
Token I
Token ID314
Feature activation-1.19e+00

Pos logit contributions
iga+3.173
 Drac+3.111
neys+2.779
 sqor+2.700
ngth+2.624
Neg logit contributions
hyde-2.842
\.-2.682
netflix-2.606
ergus-2.512
ooth-2.489
Token was
Token ID373
Feature activation-1.59e+00

Pos logit contributions
iga+1.042
 Drac+1.012
neys+0.928
 sqor+0.909
ngth+0.899
Neg logit contributions
hyde-0.823
\.-0.797
netflix-0.758
ergus-0.740
holding-0.734
Token captain
Token ID10654
Feature activation+1.96e+00

Pos logit contributions
iga+1.305
 Drac+1.264
neys+1.143
 sqor+1.116
ngth+1.100
Neg logit contributions
hyde-1.190
\.-1.150
netflix-1.106
ergus-1.069
holding-1.058
Token of
Token ID286
Feature activation+1.75e+00

Pos logit contributions
hyde+1.413
\.+1.387
netflix+1.307
holding+1.286
ergus+1.281
Neg logit contributions
iga-1.636
 Drac-1.598
neys-1.455
 sqor-1.431
ngth-1.428
Token the
Token ID262
Feature activation-9.27e-01

Pos logit contributions
hyde+1.252
\.+1.226
netflix+1.156
ergus+1.131
holding+1.130
Neg logit contributions
iga-1.486
 Drac-1.450
neys-1.321
 sqor-1.302
ngth-1.297
Token side
Token ID1735
Feature activation-1.98e+00

Pos logit contributions
iga+0.737
 Drac+0.718
neys+0.645
 sqor+0.633
ngth+0.620
Neg logit contributions
hyde-0.720
\.-0.695
netflix-0.668
ergus-0.649
holding-0.647
Token and
Token ID290
Feature activation-2.02e+00

Pos logit contributions
iga+1.618
 Drac+1.574
neys+1.424
 sqor+1.396
ngth+1.371
Neg logit contributions
hyde-1.484
\.-1.426
netflix-1.379
ergus-1.337
ooth-1.325
Token was
Token ID373
Feature activation-2.49e+00

Pos logit contributions
iga+1.715
 Drac+1.671
neys+1.509
 sqor+1.475
ngth+1.446
Neg logit contributions
hyde-1.474
\.-1.416
netflix-1.361
ergus-1.314
holding-1.299
Token-
Token ID12
Feature activation+3.97e+00

Pos logit contributions
\.+3.455
hyde+3.445
ooth+3.088
holding+3.048
netflix+2.884
Neg logit contributions
iga-8.845
 Drac-8.757
 sqor-8.199
ngth-8.148
neys-8.105
Tokent
Token ID83
Feature activation+6.08e+00

Pos logit contributions
hyde+2.510
\.+2.450
holding+2.280
ooth+2.259
netflix+2.258
Neg logit contributions
iga-3.638
 Drac-3.634
ngth-3.317
 sqor-3.304
neys-3.281
Tokenune
Token ID1726
Feature activation+3.31e+00

Pos logit contributions
hyde+5.380
\.+5.335
ooth+5.192
ergus+4.916
netflix+4.869
Neg logit contributions
 Drac-4.030
iga-3.876
 sqor-3.674
ngth-3.553
neys-3.401
Token their
Token ID511
Feature activation+2.21e+00

Pos logit contributions
hyde+2.238
\.+2.206
netflix+2.039
holding+2.019
ooth+2.012
Neg logit contributions
iga-2.888
 Drac-2.821
neys-2.594
ngth-2.560
 sqor-2.535
Token graphics
Token ID9382
Feature activation+4.97e+00

Pos logit contributions
hyde+1.515
\.+1.494
netflix+1.385
holding+1.369
ergus+1.361
Neg logit contributions
iga-1.913
 Drac-1.866
neys-1.709
ngth-1.683
 sqor-1.678
Token cards
Token ID4116
Feature activation+5.57e+00

Pos logit contributions
\.+3.148
hyde+3.146
holding+2.859
netflix+2.836
ooth+2.831
Neg logit contributions
iga-4.461
 Drac-4.382
neys-4.063
ngth-3.999
 sqor-3.977
Token.
Token ID13
Feature activation+5.30e+00

Pos logit contributions
hyde+3.665
\.+3.653
holding+3.357
netflix+3.322
ooth+3.293
Neg logit contributions
iga-4.823
 Drac-4.757
neys-4.363
ngth-4.343
 sqor-4.277
Token Comp
Token ID3082
Feature activation+2.58e+00

Pos logit contributions
hyde+3.589
\.+3.559
netflix+3.294
holding+3.271
ergus+3.228
Neg logit contributions
iga-4.568
 Drac-4.440
neys-4.103
ngth-4.035
 sqor-4.028
Tokeneting
Token ID13629
Feature activation+2.20e+00

Pos logit contributions
hyde+1.485
\.+1.446
netflix+1.349
ergus+1.315
holding+1.313
Neg logit contributions
iga-2.535
 Drac-2.486
neys-2.287
 sqor-2.268
ngth-2.254
Token with
Token ID351
Feature activation+2.52e+00

Pos logit contributions
hyde+1.550
\.+1.520
netflix+1.428
holding+1.398
ergus+1.398
Neg logit contributions
iga-1.890
 Drac-1.835
neys-1.676
 sqor-1.649
ngth-1.644
Token flagship
Token ID21336
Feature activation+3.04e+00

Pos logit contributions
hyde+1.764
\.+1.733
netflix+1.633
holding+1.596
ergus+1.592
Neg logit contributions
iga-2.163
 Drac-2.104
neys-1.923
 sqor-1.890
ngth-1.887
Token representing
Token ID10200
Feature activation-1.39e+00

Pos logit contributions
iga+2.345
 Drac+2.294
neys+2.060
 sqor+2.046
ngth+1.990
Neg logit contributions
hyde-2.041
\.-1.976
netflix-1.917
ergus-1.850
ooth-1.813
Token density
Token ID12109
Feature activation+2.29e+00

Pos logit contributions
iga+1.280
 Drac+1.253
neys+1.143
 sqor+1.133
ngth+1.114
Neg logit contributions
hyde-0.883
\.-0.858
netflix-0.818
ergus-0.793
ooth-0.780
Token to
Token ID284
Feature activation-5.19e-01

Pos logit contributions
hyde+1.631
\.+1.596
netflix+1.504
holding+1.477
ergus+1.466
Neg logit contributions
iga-1.938
 Drac-1.900
neys-1.731
 sqor-1.703
ngth-1.695
Token a
Token ID257
Feature activation-1.81e+00

Pos logit contributions
iga+0.428
 Drac+0.415
neys+0.375
 sqor+0.370
ngth+0.365
Neg logit contributions
hyde-0.386
\.-0.376
netflix-0.362
ergus-0.350
holding-0.347
Token grid
Token ID10706
Feature activation-1.13e+00

Pos logit contributions
iga+1.463
 Drac+1.412
neys+1.268
 sqor+1.257
ngth+1.227
Neg logit contributions
hyde-1.381
\.-1.344
netflix-1.293
ergus-1.259
ooth-1.241
Token simpl
Token ID7106
Feature activation-2.97e-01

Pos logit contributions
iga+0.916
 Drac+0.884
neys+0.806
 sqor+0.784
ngth+0.784
Neg logit contributions
hyde-0.847
\.-0.827
netflix-0.795
ergus-0.774
ooth-0.762
Tokenifies
Token ID6945
Feature activation-4.12e+00

Pos logit contributions
iga+0.194
 Drac+0.186
neys+0.164
 sqor+0.159
ngth+0.158
Neg logit contributions
hyde-0.274
\.-0.267
netflix-0.257
ergus-0.252
holding-0.251
Token computing
Token ID14492
Feature activation-4.32e+00

Pos logit contributions
iga+3.334
 Drac+3.216
neys+2.869
 sqor+2.854
ngth+2.746
Neg logit contributions
hyde-3.151
\.-3.008
netflix-2.978
ergus-2.822
holding-2.741
Token spatial
Token ID21739
Feature activation+2.03e-01

Pos logit contributions
iga+3.492
 Drac+3.358
neys+3.018
 sqor+2.970
ngth+2.916
Neg logit contributions
hyde-3.258
\.-3.152
netflix-3.124
ergus-2.961
holding-2.916
Token derivatives
Token ID28486
Feature activation-4.19e-01

Pos logit contributions
hyde+0.153
\.+0.149
netflix+0.143
ergus+0.139
holding+0.138
Neg logit contributions
iga-0.164
 Drac-0.160
neys-0.145
 sqor-0.143
ngth-0.141
Token.
Token ID13
Feature activation-2.89e+00

Pos logit contributions
iga+0.350
 Drac+0.339
neys+0.308
 sqor+0.303
ngth+0.301
Neg logit contributions
hyde-0.308
\.-0.298
netflix-0.286
ergus-0.279
holding-0.277
Tokens
Token ID82
Feature activation+3.88e-01

Pos logit contributions
iga+0.349
 Drac+0.338
neys+0.305
 sqor+0.299
ngth+0.298
Neg logit contributions
hyde-0.357
\.-0.348
netflix-0.333
ergus-0.325
holding-0.324
Token not
Token ID407
Feature activation-8.09e-01

Pos logit contributions
hyde+0.276
\.+0.269
netflix+0.255
ergus+0.249
holding+0.248
Neg logit contributions
iga-0.331
 Drac-0.323
neys-0.294
 sqor-0.289
ngth-0.287
Token.
Token ID13
Feature activation-1.19e+00

Pos logit contributions
iga+0.685
 Drac+0.666
neys+0.602
 sqor+0.593
ngth+0.588
Neg logit contributions
hyde-0.581
\.-0.563
netflix-0.541
ergus-0.525
holding-0.523
Token This
Token ID770
Feature activation-2.99e+00

Pos logit contributions
iga+0.992
 Drac+0.971
neys+0.878
 sqor+0.860
ngth+0.853
Neg logit contributions
hyde-0.862
\.-0.835
netflix-0.800
ergus-0.778
ooth-0.773
Token process
Token ID1429
Feature activation+4.95e-01

Pos logit contributions
iga+2.343
 Drac+2.263
neys+2.039
 sqor+2.009
ngth+1.963
Neg logit contributions
hyde-2.341
\.-2.268
netflix-2.189
ergus-2.122
ooth-2.102
Token could
Token ID714
Feature activation-2.96e+00

Pos logit contributions
hyde+0.371
\.+0.363
netflix+0.345
ergus+0.337
holding+0.336
Neg logit contributions
iga-0.403
 Drac-0.393
neys-0.356
 sqor-0.350
ngth-0.348
Token be
Token ID307
Feature activation+2.35e+00

Pos logit contributions
iga+2.518
 Drac+2.412
neys+2.219
 sqor+2.164
ngth+2.152
Neg logit contributions
hyde-2.114
\.-2.039
netflix-1.952
ergus-1.913
holding-1.913
Token visual
Token ID5874
Feature activation+6.07e-01

Pos logit contributions
hyde+1.690
\.+1.661
netflix+1.564
holding+1.536
ergus+1.536
Neg logit contributions
iga-1.962
 Drac-1.921
neys-1.745
 sqor-1.731
ngth-1.723
Tokenized
Token ID1143
Feature activation-1.63e+00

Pos logit contributions
hyde+0.408
\.+0.398
netflix+0.375
ergus+0.365
holding+0.364
Neg logit contributions
iga-0.542
 Drac-0.529
neys-0.484
 sqor-0.477
ngth-0.473
Token kind
Token ID1611
Feature activation-3.37e+00

Pos logit contributions
iga+1.346
 Drac+1.304
neys+1.183
 sqor+1.163
ngth+1.149
Neg logit contributions
hyde-1.207
\.-1.166
netflix-1.116
ergus-1.086
holding-1.084
Token of
Token ID286
Feature activation+1.20e+00

Pos logit contributions
iga+2.562
 Drac+2.445
neys+2.236
 sqor+2.194
ngth+2.151
Neg logit contributions
hyde-2.719
\.-2.630
netflix-2.540
ergus-2.472
ooth-2.452
Token cards
Token ID4116
Feature activation-8.71e-01

Pos logit contributions
hyde+1.298
\.+1.270
netflix+1.197
ergus+1.167
holding+1.166
Neg logit contributions
iga-1.676
 Drac-1.637
neys-1.495
 sqor-1.475
ngth-1.466
Token both
Token ID1111
Feature activation-1.60e+00

Pos logit contributions
iga+0.728
 Drac+0.710
neys+0.645
 sqor+0.634
ngth+0.625
Neg logit contributions
hyde-0.635
\.-0.615
netflix-0.587
ergus-0.573
holding-0.568
Token online
Token ID2691
Feature activation+7.43e-01

Pos logit contributions
iga+1.433
 Drac+1.400
neys+1.276
 sqor+1.257
ngth+1.240
Neg logit contributions
hyde-1.071
\.-1.034
netflix-0.981
ergus-0.959
holding-0.949
Token and
Token ID290
Feature activation-2.50e+00

Pos logit contributions
hyde+0.536
\.+0.521
netflix+0.496
ergus+0.484
holding+0.480
Neg logit contributions
iga-0.628
 Drac-0.611
neys-0.556
 sqor-0.546
ngth-0.542
Token in
Token ID287
Feature activation+1.10e+00

Pos logit contributions
iga+2.030
 Drac+1.971
neys+1.786
 sqor+1.763
ngth+1.708
Neg logit contributions
hyde-1.868
\.-1.809
netflix-1.733
ergus-1.699
holding-1.681
Token-
Token ID12
Feature activation-4.96e+00

Pos logit contributions
hyde+0.832
\.+0.814
netflix+0.773
ergus+0.756
holding+0.755
Neg logit contributions
iga-0.881
 Drac-0.859
neys-0.777
 sqor-0.765
ngth-0.760
Tokenstore
Token ID8095
Feature activation-1.49e+00

Pos logit contributions
iga+2.551
 Drac+2.347
neys+2.040
 sqor+1.911
ngth+1.847
Neg logit contributions
hyde-5.136
\.-4.999
netflix-4.896
ergus-4.795
ooth-4.699
Token.
Token ID13
Feature activation+2.72e-02

Pos logit contributions
iga+1.230
 Drac+1.188
neys+1.085
 sqor+1.061
ngth+1.041
Neg logit contributions
hyde-1.106
\.-1.063
netflix-1.034
ergus-1.006
holding-0.988
Token
Token ID198
Feature activation+1.77e+00

Pos logit contributions
hyde+0.020
\.+0.019
netflix+0.018
ergus+0.018
holding+0.018
Neg logit contributions
iga-0.023
 Drac-0.022
neys-0.020
 sqor-0.020
ngth-0.020
Token
Token ID198
Feature activation+1.30e+00

Pos logit contributions
\.+1.165
hyde+1.136
holding+1.070
netflix+1.064
ooth+1.030
Neg logit contributions
iga-1.579
 Drac-1.560
ngth-1.470
 sqor-1.438
neys-1.423
TokenWith
Token ID3152
Feature activation+4.30e-01

Pos logit contributions
hyde+0.898
\.+0.882
netflix+0.832
holding+0.812
ergus+0.809
Neg logit contributions
iga-1.130
 Drac-1.106
neys-1.006
 sqor-0.995
ngth-0.991
Token Mar
Token ID1526
Feature activation-3.97e-01

Pos logit contributions
hyde+3.122
\.+3.051
netflix+2.862
ergus+2.820
holding+2.819
Neg logit contributions
iga-3.467
 Drac-3.376
neys-3.087
 sqor-3.066
ngth-3.031
Tokenquez
Token ID22281
Feature activation-1.74e+00

Pos logit contributions
iga+0.302
 Drac+0.292
neys+0.264
 sqor+0.257
ngth+0.255
Neg logit contributions
hyde-0.319
\.-0.313
netflix-0.298
holding-0.292
ergus-0.292
Token McC
Token ID5108
Feature activation+4.07e+00

Pos logit contributions
iga+1.466
 Drac+1.433
neys+1.289
 sqor+1.261
ngth+1.242
Neg logit contributions
hyde-1.263
\.-1.226
netflix-1.178
ergus-1.143
holding-1.134
Tokenool
Token ID970
Feature activation-1.78e+00

Pos logit contributions
hyde+2.903
\.+2.867
netflix+2.695
ooth+2.673
ergus+2.671
Neg logit contributions
iga-3.362
 Drac-3.323
neys-3.000
 sqor-2.986
ngth-2.953
Token,
Token ID11
Feature activation-6.80e-01

Pos logit contributions
iga+1.507
 Drac+1.461
neys+1.335
 sqor+1.296
ngth+1.285
Neg logit contributions
hyde-1.274
\.-1.245
netflix-1.196
ergus-1.154
holding-1.144
Token 49
Token ID5125
Feature activation-3.59e+00

Pos logit contributions
iga+0.561
 Drac+0.548
neys+0.496
 sqor+0.488
ngth+0.483
Neg logit contributions
hyde-0.503
\.-0.490
netflix-0.466
ergus-0.454
holding-0.452
Token
Token ID198
Feature activation-3.22e+00

Pos logit contributions
iga+2.822
 Drac+2.731
neys+2.488
 sqor+2.422
ngth+2.399
Neg logit contributions
hyde-2.725
\.-2.641
netflix-2.594
ergus-2.500
ooth-2.498
Token
Token ID198
Feature activation-8.37e-01

Pos logit contributions
iga+2.591
 Drac+2.509
neys+2.275
 sqor+2.208
ngth+2.154
Neg logit contributions
hyde-2.461
\.-2.351
netflix-2.264
ergus-2.225
ooth-2.204
TokenBrend
Token ID48015
Feature activation-2.61e+00

Pos logit contributions
iga+0.864
 Drac+0.842
neys+0.786
 sqor+0.769
ngth+0.762
Neg logit contributions
hyde-0.446
\.-0.425
netflix-0.397
ergus-0.387
ooth-0.384
Tokena
Token ID64
Feature activation+1.45e+00

Pos logit contributions
iga+2.123
 Drac+2.053
neys+1.867
 sqor+1.826
ngth+1.800
Neg logit contributions
hyde-1.959
\.-1.901
netflix-1.819
holding-1.766
ergus-1.764
Token McC
Token ID5108
Feature activation+4.99e+00

Pos logit contributions
hyde+0.894
\.+0.869
netflix+0.817
ergus+0.793
holding+0.792
Neg logit contributions
iga-1.365
 Drac-1.336
neys-1.227
 sqor-1.210
ngth-1.203
Token.
Token ID13
Feature activation+8.90e-01

Pos logit contributions
iga+0.781
 Drac+0.759
neys+0.691
 sqor+0.676
ngth+0.670
Neg logit contributions
hyde-0.687
\.-0.669
netflix-0.638
ergus-0.622
holding-0.617
Token curfew
Token ID49565
Feature activation+1.87e-01

Pos logit contributions
hyde+0.635
\.+0.622
netflix+0.588
holding+0.574
ergus+0.573
Neg logit contributions
iga-0.754
 Drac-0.737
neys-0.670
 sqor-0.661
ngth-0.658
Token.
Token ID13
Feature activation+3.09e+00

Pos logit contributions
hyde+0.137
\.+0.133
netflix+0.127
ergus+0.124
holding+0.123
Neg logit contributions
iga-0.156
 Drac-0.152
neys-0.138
 sqor-0.136
ngth-0.135
Token
Token ID198
Feature activation+3.57e+00

Pos logit contributions
hyde+2.161
\.+2.128
netflix+2.008
holding+1.961
ergus+1.943
Neg logit contributions
iga-2.644
 Drac-2.569
neys-2.347
 sqor-2.331
ngth-2.319
Token
Token ID198
Feature activation-1.17e+00

Pos logit contributions
\.+2.306
hyde+2.228
holding+2.117
netflix+2.105
ergus+2.018
Neg logit contributions
iga-3.216
 Drac-3.189
ngth-3.038
 sqor-2.936
neys-2.909
TokenIt
Token ID1026
Feature activation-6.18e-01

Pos logit contributions
iga+1.000
 Drac+0.965
neys+0.888
 sqor+0.860
ngth+0.852
Neg logit contributions
hyde-0.833
\.-0.802
netflix-0.761
ergus-0.749
ooth-0.744
Token's
Token ID338
Feature activation-1.36e+00

Pos logit contributions
iga+0.513
 Drac+0.499
neys+0.454
 sqor+0.444
ngth+0.441
Neg logit contributions
hyde-0.453
\.-0.441
netflix-0.419
ergus-0.409
holding-0.409
Token all
Token ID477
Feature activation+5.06e+00

Pos logit contributions
iga+1.142
 Drac+1.115
neys+1.007
 sqor+0.988
ngth+0.974
Neg logit contributions
hyde-0.989
\.-0.960
netflix-0.912
ergus-0.886
ooth-0.881
Token been
Token ID587
Feature activation+1.90e+00

Pos logit contributions
hyde+3.417
\.+3.369
netflix+3.191
holding+3.159
ergus+3.142
Neg logit contributions
iga-4.305
 Drac-4.200
neys-3.878
ngth-3.861
 sqor-3.847
Token worth
Token ID2861
Feature activation+8.77e+00

Pos logit contributions
hyde+1.376
\.+1.352
netflix+1.278
holding+1.253
ergus+1.250
Neg logit contributions
iga-1.582
 Drac-1.540
neys-1.405
 sqor-1.387
ngth-1.385
Token it
Token ID340
Feature activation-7.19e-01

Pos logit contributions
hyde+5.326
\.+5.202
holding+4.908
netflix+4.871
ergus+4.821
Neg logit contributions
iga-7.899
 Drac-7.656
ngth-7.239
neys-7.104
 sqor-7.072
Token hours
Token ID2250
Feature activation-2.17e+00

Pos logit contributions
iga+2.098
 Drac+2.038
neys+1.809
 sqor+1.756
ngth+1.709
Neg logit contributions
hyde-2.601
\.-2.520
netflix-2.484
ergus-2.419
ooth-2.417
Token.
Token ID13
Feature activation-1.24e+00

Pos logit contributions
iga+1.787
 Drac+1.737
neys+1.568
 sqor+1.539
ngth+1.529
Neg logit contributions
hyde-1.606
\.-1.541
netflix-1.488
ergus-1.456
ooth-1.447
Token Hundreds
Token ID32636
Feature activation-1.41e-01

Pos logit contributions
iga+1.037
 Drac+1.013
neys+0.917
 sqor+0.895
ngth+0.886
Neg logit contributions
hyde-0.909
\.-0.878
netflix-0.842
ergus-0.820
ooth-0.818
Token of
Token ID286
Feature activation-4.91e+00

Pos logit contributions
iga+0.110
 Drac+0.107
neys+0.096
ngth+0.094
 sqor+0.094
Neg logit contributions
hyde-0.111
\.-0.107
netflix-0.102
ergus-0.101
ooth-0.100
Token thousands
Token ID4138
Feature activation-5.00e+00

Pos logit contributions
iga+4.125
 Drac+4.044
neys+3.617
 sqor+3.562
 WS+3.394
Neg logit contributions
hyde-3.580
\.-3.397
netflix-3.366
ergus-3.212
ooth-3.189
Token of
Token ID286
Feature activation-8.54e+00

Pos logit contributions
iga+3.635
 Drac+3.504
neys+3.093
ngth+3.076
 sqor+2.984
Neg logit contributions
hyde-4.190
\.-3.964
ergus-3.872
netflix-3.847
ooth-3.830
Token Europeans
Token ID20006
Feature activation+5.08e-01

Pos logit contributions
iga+6.471
 Drac+6.298
neys+5.478
 sqor+5.280
 WS+5.217
Neg logit contributions
hyde-6.805
netflix-6.430
\.-6.282
ergus-6.146
ooth-6.092
Token were
Token ID547
Feature activation+3.57e+00

Pos logit contributions
hyde+0.375
\.+0.366
netflix+0.348
ergus+0.339
holding+0.339
Neg logit contributions
iga-0.419
 Drac-0.408
neys-0.370
 sqor-0.365
ngth-0.362
Token left
Token ID1364
Feature activation-9.98e-01

Pos logit contributions
hyde+2.400
\.+2.389
netflix+2.202
holding+2.196
ergus+2.163
Neg logit contributions
iga-3.082
 Drac-3.050
ngth-2.802
neys-2.798
 sqor-2.792
Token without
Token ID1231
Feature activation+1.03e+00

Pos logit contributions
iga+0.816
 Drac+0.787
neys+0.714
 sqor+0.695
ngth+0.691
Neg logit contributions
hyde-0.750
\.-0.730
netflix-0.699
ergus-0.682
ooth-0.681
Token gas
Token ID3623
Feature activation-3.18e+00

Pos logit contributions
hyde+0.697
\.+0.680
netflix+0.642
ergus+0.625
holding+0.624
Neg logit contributions
iga-0.919
 Drac-0.898
neys-0.820
 sqor-0.809
ngth-0.804
Token shirt
Token ID10147
Feature activation+1.26e+00

Pos logit contributions
hyde+1.069
\.+1.053
netflix+0.997
holding+0.978
ergus+0.974
Neg logit contributions
iga-1.188
 Drac-1.151
neys-1.048
 sqor-1.034
ngth-1.034
Token on
Token ID319
Feature activation+4.00e-03

Pos logit contributions
hyde+0.973
\.+0.955
netflix+0.908
holding+0.888
ergus+0.886
Neg logit contributions
iga-0.989
 Drac-0.961
neys-0.866
 sqor-0.856
ngth-0.854
Token fire
Token ID2046
Feature activation-2.30e+00

Pos logit contributions
hyde+0.003
\.+0.003
netflix+0.003
ergus+0.003
holding+0.003
Neg logit contributions
iga-0.003
 Drac-0.003
neys-0.002
 sqor-0.002
ngth-0.002
Token,
Token ID11
Feature activation-3.17e+00

Pos logit contributions
iga+1.917
 Drac+1.876
neys+1.703
 sqor+1.670
ngth+1.642
Neg logit contributions
hyde-1.670
\.-1.610
netflix-1.545
ergus-1.509
ooth-1.505
Token the
Token ID262
Feature activation+2.44e+00

Pos logit contributions
iga+2.570
 Drac+2.537
neys+2.272
 sqor+2.209
ngth+2.181
Neg logit contributions
hyde-2.379
\.-2.302
netflix-2.228
ergus-2.168
ooth-2.135
Token Brad
Token ID8114
Feature activation-8.21e+00

Pos logit contributions
hyde+2.060
\.+2.035
netflix+1.917
holding+1.908
ergus+1.892
Neg logit contributions
iga-1.722
 Drac-1.653
neys-1.493
ngth-1.479
 sqor-1.468
Tokenenton
Token ID26673
Feature activation-7.50e-01

Pos logit contributions
iga+6.550
 Drac+6.307
neys+5.804
ngth+5.317
 sqor+5.270
Neg logit contributions
hyde-6.105
\.-6.009
netflix-5.792
holding-5.472
ooth-5.464
Token Herald
Token ID18277
Feature activation-1.92e+00

Pos logit contributions
iga+0.586
 Drac+0.570
neys+0.514
 sqor+0.499
ngth+0.493
Neg logit contributions
hyde-0.589
\.-0.574
netflix-0.554
ergus-0.536
holding-0.535
Token reported
Token ID2098
Feature activation-2.93e+00

Pos logit contributions
iga+1.454
 Drac+1.406
neys+1.267
 sqor+1.229
ngth+1.204
Neg logit contributions
hyde-1.559
\.-1.507
netflix-1.463
ergus-1.425
holding-1.419
Token.
Token ID13
Feature activation-2.12e+00

Pos logit contributions
iga+2.354
 Drac+2.274
neys+2.044
 sqor+2.017
ngth+1.986
Neg logit contributions
hyde-2.216
\.-2.104
netflix-2.079
ooth-2.021
ergus-1.997
Token
Token ID198
Feature activation-1.40e+00

Pos logit contributions
iga+1.800
 Drac+1.770
neys+1.601
 sqor+1.546
ngth+1.528
Neg logit contributions
hyde-1.530
\.-1.472
netflix-1.419
ergus-1.381
ooth-1.378
Token Xbox
Token ID9445
Feature activation-6.00e+00

Pos logit contributions
iga+0.252
 Drac+0.245
neys+0.220
 sqor+0.216
ngth+0.214
Neg logit contributions
hyde-0.263
\.-0.256
netflix-0.244
ergus-0.240
holding-0.238
Token 360
Token ID11470
Feature activation-1.78e+00

Pos logit contributions
iga+4.038
 Drac+3.785
neys+3.421
ngth+3.212
 sqor+3.203
Neg logit contributions
hyde-5.382
\.-5.130
netflix-4.936
ergus-4.893
holding-4.822
Token and
Token ID290
Feature activation-3.91e+00

Pos logit contributions
iga+1.486
 Drac+1.430
neys+1.298
 sqor+1.268
ngth+1.261
Neg logit contributions
hyde-1.326
\.-1.270
netflix-1.218
ergus-1.194
holding-1.179
Token the
Token ID262
Feature activation-3.61e+00

Pos logit contributions
iga+3.066
 Drac+2.898
neys+2.629
ngth+2.515
 sqor+2.491
Neg logit contributions
hyde-3.135
\.-2.979
ergus-2.844
netflix-2.810
holding-2.807
Token "
Token ID366
Feature activation-1.54e+00

Pos logit contributions
iga+2.884
 Drac+2.764
neys+2.500
 sqor+2.390
ngth+2.389
Neg logit contributions
hyde-2.833
\.-2.683
ergus-2.564
netflix-2.539
ooth-2.523
TokenXbox
Token ID43377
Feature activation-6.97e+00

Pos logit contributions
iga+1.294
 Drac+1.242
neys+1.137
 sqor+1.110
ngth+1.102
Neg logit contributions
hyde-1.120
\.-1.087
ergus-1.015
netflix-1.015
ooth-1.007
Token 720
Token ID26250
Feature activation-1.10e+00

Pos logit contributions
iga+5.189
 Drac+4.931
neys+4.475
 WS+4.228
ngth+4.214
Neg logit contributions
hyde-5.666
\.-5.343
ergus-5.093
netflix-5.073
holding-5.052
Token,"
Token ID553
Feature activation-2.06e+00

Pos logit contributions
iga+0.914
 Drac+0.886
neys+0.805
 sqor+0.790
ngth+0.783
Neg logit contributions
hyde-0.803
\.-0.778
netflix-0.741
ergus-0.730
holding-0.718
Token according
Token ID1864
Feature activation-3.70e+00

Pos logit contributions
iga+1.730
 Drac+1.687
neys+1.537
 sqor+1.506
ngth+1.487
Neg logit contributions
hyde-1.500
\.-1.457
netflix-1.385
ergus-1.358
ooth-1.345
Token to
Token ID284
Feature activation-4.73e+00

Pos logit contributions
iga+2.857
 Drac+2.778
neys+2.485
 sqor+2.467
ngth+2.371
Neg logit contributions
hyde-2.925
\.-2.828
netflix-2.699
ergus-2.655
ooth-2.642
Token the
Token ID262
Feature activation-2.53e+00

Pos logit contributions
iga+3.830
 Drac+3.750
neys+3.390
 sqor+3.381
 WS+3.222
Neg logit contributions
hyde-3.517
\.-3.409
netflix-3.269
ooth-3.168
ergus-3.153
Token fueling
Token ID43881
Feature activation-2.36e+00

Pos logit contributions
hyde+1.368
\.+1.345
netflix+1.263
holding+1.236
ergus+1.231
Neg logit contributions
iga-1.765
 Drac-1.725
neys-1.578
 sqor-1.561
ngth-1.553
Token speculation
Token ID13367
Feature activation+2.26e+00

Pos logit contributions
iga+1.657
 Drac+1.623
neys+1.425
 sqor+1.401
ngth+1.362
Neg logit contributions
hyde-2.033
\.-1.978
netflix-1.895
ergus-1.858
ooth-1.858
Token that
Token ID326
Feature activation+2.02e+00

Pos logit contributions
hyde+1.590
\.+1.565
netflix+1.472
ergus+1.438
holding+1.436
Neg logit contributions
iga-1.931
 Drac-1.883
neys-1.723
 sqor-1.692
ngth-1.685
Token a
Token ID257
Feature activation+1.32e+00

Pos logit contributions
hyde+1.377
\.+1.352
netflix+1.273
holding+1.242
ergus+1.242
Neg logit contributions
iga-1.760
 Drac-1.720
neys-1.575
 sqor-1.554
ngth-1.546
Token deal
Token ID1730
Feature activation-3.40e+00

Pos logit contributions
hyde+0.969
\.+0.949
netflix+0.900
ergus+0.878
holding+0.878
Neg logit contributions
iga-1.094
 Drac-1.066
neys-0.968
 sqor-0.953
ngth-0.948
Token was
Token ID373
Feature activation-7.16e+00

Pos logit contributions
iga+2.703
 Drac+2.607
neys+2.359
 sqor+2.290
ngth+2.266
Neg logit contributions
hyde-2.629
\.-2.520
netflix-2.466
ergus-2.374
ooth-2.370
Token imminent
Token ID21161
Feature activation-1.34e+00

Pos logit contributions
iga+5.326
 Drac+5.218
neys+4.759
 sqor+4.421
 WS+4.313
Neg logit contributions
hyde-5.676
\.-5.423
netflix-5.302
ooth-5.040
ertodd-5.032
Token.
Token ID13
Feature activation+2.29e+00

Pos logit contributions
iga+1.114
 Drac+1.081
neys+0.978
 sqor+0.968
ngth+0.954
Neg logit contributions
hyde-0.992
\.-0.949
netflix-0.920
ergus-0.893
ooth-0.893
Token Reports
Token ID17905
Feature activation-1.65e+00

Pos logit contributions
hyde+1.572
\.+1.550
netflix+1.455
holding+1.420
ergus+1.417
Neg logit contributions
iga-1.988
 Drac-1.938
neys-1.773
 sqor-1.761
ngth-1.746
Token confirmed
Token ID4999
Feature activation+1.09e+00

Pos logit contributions
iga+1.372
 Drac+1.337
neys+1.207
 sqor+1.200
ngth+1.172
Neg logit contributions
hyde-1.208
\.-1.162
netflix-1.115
ergus-1.089
ooth-1.082
Token that
Token ID326
Feature activation-1.07e+00

Pos logit contributions
hyde+0.758
\.+0.741
netflix+0.701
ergus+0.683
holding+0.682
Neg logit contributions
iga-0.938
 Drac-0.915
neys-0.835
 sqor-0.822
ngth-0.817
Token butter
Token ID9215
Feature activation-8.25e-01

Pos logit contributions
hyde+3.371
\.+3.299
netflix+3.101
ergus+3.070
holding+3.063
Neg logit contributions
iga-3.622
 Drac-3.553
neys-3.213
ngth-3.168
 sqor-3.163
Token:
Token ID25
Feature activation-3.41e+00

Pos logit contributions
iga+0.681
 Drac+0.664
neys+0.598
 sqor+0.592
ngth+0.592
Neg logit contributions
hyde-0.603
\.-0.587
netflix-0.566
ergus-0.550
ooth-0.546
Token
Token ID198
Feature activation-2.17e+00

Pos logit contributions
iga+2.651
 Drac+2.575
neys+2.309
 sqor+2.272
ngth+2.258
Neg logit contributions
hyde-2.621
\.-2.576
netflix-2.477
ergus-2.413
ooth-2.396
Token
Token ID198
Feature activation-5.82e-01

Pos logit contributions
iga+1.789
 Drac+1.738
neys+1.577
 sqor+1.542
ngth+1.519
Neg logit contributions
hyde-1.617
\.-1.563
netflix-1.496
ergus-1.463
ooth-1.451
TokenBl
Token ID3629
Feature activation+9.44e-01

Pos logit contributions
iga+0.485
 Drac+0.470
neys+0.428
 sqor+0.419
ngth+0.416
Neg logit contributions
hyde-0.425
\.-0.412
netflix-0.395
ergus-0.384
ooth-0.382
Tokenend
Token ID437
Feature activation-7.48e+00

Pos logit contributions
hyde+0.732
\.+0.718
netflix+0.683
ooth+0.670
ergus+0.669
Neg logit contributions
iga-0.739
 Drac-0.723
neys-0.650
 sqor-0.642
ngth-0.639
Token,
Token ID11
Feature activation-1.16e+00

Pos logit contributions
iga+5.618
 Drac+5.419
neys+4.865
 sqor+4.706
ngth+4.637
Neg logit contributions
hyde-5.727
\.-5.500
netflix-5.487
holding-5.252
ergus-5.245
Token adding
Token ID4375
Feature activation-1.95e-01

Pos logit contributions
iga+0.924
 Drac+0.900
neys+0.813
 sqor+0.798
ngth+0.783
Neg logit contributions
hyde-0.883
\.-0.859
netflix-0.825
ergus-0.804
holding-0.795
Token more
Token ID517
Feature activation-1.88e+00

Pos logit contributions
iga+0.161
 Drac+0.157
neys+0.142
 sqor+0.139
ngth+0.138
Neg logit contributions
hyde-0.145
\.-0.141
netflix-0.135
ergus-0.131
ooth-0.130
Token almond
Token ID26948
Feature activation+3.46e+00

Pos logit contributions
iga+1.517
 Drac+1.469
neys+1.336
 sqor+1.297
ngth+1.279
Neg logit contributions
hyde-1.431
\.-1.395
netflix-1.342
ergus-1.297
ooth-1.295
Token milk
Token ID7545
Feature activation+2.09e+00

Pos logit contributions
hyde+2.187
\.+2.126
netflix+1.989
ergus+1.948
holding+1.946
Neg logit contributions
iga-3.199
 Drac-3.143
neys-2.872
 sqor-2.839
ngth-2.835