TOP ACTIVATIONS
MAX = 8.805

. 0 and later : /* vim 600 : set fold
} End { Try { # Close Graphics $ image .
be removed . Example : /* vi : set dir =
as follows . # !/ bin / sh /
23 24 25 26 27 # loop over the input image
ma directive . # pr ag ma surface surf
thus a line like : /* vi : ts = 4
( can be empty ) Example : vi : no ai
version 5 . 7 : /* vim < 570 : set
of like public izing mar ath ons to promote j ogging
Atomic < T > { /// Create a new , null
break } } Process { # Split our string as it
instead of playing in second division in Turkey ? I can
any text or empty Example : /* vim : set a
any % on two mar ath ons , it 's time
Socialist Party in the then - current crisis of capitalism ,
a look at the then - top " jin x ed
This line is OK : /* vi : set ts =
A . Wind y Daniels and his
) ( Example : Your + 1 shield keeps

MIN ACTIVATIONS
MIN = -7.934

and we wish the process well ." Peter Fan
, which count for at least 1 . 5 times their
utils import paths import n umpy as np import arg parse
is to give the hall its look , and Bour dain
hand out <|endoftext|> Or at least 26 of 27 : THE
g ave the movement its vitality . Better the multiple
been . X - Men : Apocalypse arrives on Blu -
. The Legend of Zelda : O car ina of Time
Play of a popular Halo : Reach map .
for it . If at least this body can be closer
to tell people that at least he did this .. .
ie for the X - Men that could ve
the maps exclusive to Halo : Combat Ev olved PC .
To celebrate the upcoming Halo : The Master Chief Collection ,
been dark for a little while . So , I got
promotional offers for programs that support The Nation s
. Leave it for at least an hour . ( No
and mix well . Season with salt , pepper and chill
data available to give policymakers accurate estimates of monet ized damages
13 . Resident Evil 4 Mis ao

INTERVAL 4.620 - 8.805
CONTAINS 0.080%

with activity when PM Modi caus ally walked in to eat
t got the refugee crisis under control on a
41 z BLACK FO OT T OC AK NOW LED G
back typically in a ten minute window . Path
pul py space opera novels scattered around crew quarters . And

INTERVAL 0.436 - 4.620
CONTAINS 35.442%

25 to 30 years ago . The report concluded that Ike
she said . Though she misses the " int
her . M achel le expresses that ,
uments H ashes And Block Basics . I mentioned that
were played on a half - court in the yard ,

INTERVAL -3.749 - 0.436
CONTAINS 64.106%

I wanted to get an accurate picture of its demographics and
to keep this contest in - un iverse . If you
s Day . For 45 years ,
he went down , didn 't have anything more serious than
definitive . In the interim , this

INTERVAL -7.934 - -3.749
CONTAINS 0.371%

held until the facts are sufficiently clear for the North Korean
in suicide rates among persons ages 10 years and older ,
him . I want everyone to be respectful , and let
permission . <|endoftext|> The S ver ig es Ri ks bank
. Rachel Salt z c alls it " a kind of
Token.
Token ID13
Feature activation-6.94e-01

Pos logit contributions
+0.278
 Hed+0.257
 Holly+0.242
 Polly+0.241
Pie+0.240
Neg logit contributions
 overwhelm-0.283
 overpower-0.254
 confuse-0.251
 overshadow-0.238
secut-0.237
Token0
Token ID15
Feature activation-7.20e-01

Pos logit contributions
+0.523
 Hed+0.481
 Holly+0.450
 Polly+0.449
Pie+0.448
Neg logit contributions
 overwhelm-0.611
 overpower-0.551
 confuse-0.544
 overshadow-0.518
secut-0.515
Token and
Token ID290
Feature activation-8.32e-01

Pos logit contributions
+0.552
 Hed+0.510
 Holly+0.478
 Polly+0.477
Pie+0.475
Neg logit contributions
 overwhelm-0.619
 overpower-0.558
 confuse-0.551
 overshadow-0.524
secut-0.522
Token later
Token ID1568
Feature activation+2.38e+00

Pos logit contributions
+0.693
 Hed+0.644
 Holly+0.607
 Polly+0.606
Pie+0.604
Neg logit contributions
 overwhelm-0.660
 overpower-0.589
 confuse-0.580
 overshadow-0.550
secut-0.549
Token:
Token ID25
Feature activation+2.00e+00

Pos logit contributions
 overwhelm+1.983
 overpower+1.784
 confuse+1.768
 overshadow+1.658
secut+1.653
Neg logit contributions
-1.925
 Hed-1.735
ertodd-1.646
 Holly-1.636
Ton-1.636
Token /*
Token ID11900
Feature activation+8.81e+00

Pos logit contributions
 overwhelm+1.524
 overpower+1.356
 confuse+1.339
 overshadow+1.254
secut+1.248
Neg logit contributions
-1.749
 Hed-1.604
 Holly-1.519
Pie-1.518
 Polly-1.517
Token vim
Token ID43907
Feature activation-1.40e+00

Pos logit contributions
 overwhelm+5.872
 overpower+5.192
 confuse+5.185
secut+4.702
 overshadow+4.684
Neg logit contributions
-8.255
 Hed-7.377
ertodd-7.215
Ton-7.087
 Katz-7.073
Token600
Token ID8054
Feature activation-2.46e+00

Pos logit contributions
+1.020
 Hed+0.939
 Holly+0.880
Pie+0.876
 Polly+0.875
Neg logit contributions
 overwhelm-1.252
 overpower-1.136
 confuse-1.117
 overshadow-1.071
secut-1.064
Token:
Token ID25
Feature activation+1.46e+00

Pos logit contributions
+1.870
 Hed+1.739
 Holly+1.638
Pie+1.627
 Polly+1.624
Neg logit contributions
 overwhelm-2.093
 overpower-1.884
 confuse-1.857
 overshadow-1.779
secut-1.766
Token set
Token ID900
Feature activation-9.02e-01

Pos logit contributions
 overwhelm+1.206
 overpower+1.081
 confuse+1.068
 overshadow+1.010
secut+1.006
Neg logit contributions
-1.186
 Hed-1.087
 Holly-1.024
 Polly-1.023
Pie-1.020
Token fold
Token ID5591
Feature activation+2.50e-01

Pos logit contributions
+0.754
 Hed+0.702
 Holly+0.661
 Polly+0.660
Pie+0.659
Neg logit contributions
 overwhelm-0.710
 overpower-0.632
 confuse-0.624
 overshadow-0.589
secut-0.587
Token }
Token ID1782
Feature activation+5.20e+00

Pos logit contributions
 overwhelm+3.786
 overpower+3.323
 confuse+3.317
 overshadow+3.070
secut+3.057
Neg logit contributions
-4.882
 Hed-4.486
 Polly-4.274
Ton-4.252
 Holly-4.242
Token End
Token ID5268
Feature activation+5.13e+00

Pos logit contributions
 overwhelm+4.018
 overpower+3.566
 confuse+3.555
 overshadow+3.304
secut+3.303
Neg logit contributions
-4.470
 Hed-4.029
ertodd-3.820
Pie-3.819
 Holly-3.818
Token {
Token ID1391
Feature activation+2.78e+00

Pos logit contributions
 overwhelm+3.914
 overpower+3.481
 confuse+3.470
secut+3.253
 overshadow+3.228
Neg logit contributions
-4.424
 Hed-4.029
 Holly-3.828
 Polly-3.824
Pie-3.824
Token Try
Token ID9993
Feature activation+3.78e+00

Pos logit contributions
 overwhelm+2.160
 overpower+1.922
 confuse+1.900
 overshadow+1.786
secut+1.780
Neg logit contributions
-2.372
 Hed-2.184
 Holly-2.067
 Polly-2.065
Pie-2.060
Token {
Token ID1391
Feature activation+3.29e+00

Pos logit contributions
 overwhelm+2.863
 overpower+2.542
 confuse+2.515
 overshadow+2.358
secut+2.357
Neg logit contributions
-3.302
 Hed-3.051
 Holly-2.884
 Polly-2.879
Pie-2.868
Token #
Token ID1303
Feature activation+7.88e+00

Pos logit contributions
 overwhelm+2.667
 overpower+2.384
 confuse+2.362
 overshadow+2.225
secut+2.217
Neg logit contributions
-2.701
 Hed-2.474
 Holly-2.334
 Polly-2.334
Pie-2.329
Token Close
Token ID13872
Feature activation+1.56e+00

Pos logit contributions
 overwhelm+6.391
 confuse+5.763
 overpower+5.731
secut+5.302
 overshadow+5.296
Neg logit contributions
-6.348
 Hed-5.532
ertodd-5.414
Ton-5.284
 sidx-5.234
Token Graphics
Token ID19840
Feature activation+9.59e-01

Pos logit contributions
 overwhelm+1.217
 overpower+1.082
 confuse+1.070
 overshadow+1.008
secut+0.998
Neg logit contributions
-1.337
 Hed-1.226
 Polly-1.162
 Holly-1.160
Pie-1.157
Token $
Token ID720
Feature activation+2.51e+00

Pos logit contributions
 overwhelm+0.782
 overpower+0.700
 confuse+0.691
 overshadow+0.654
secut+0.651
Neg logit contributions
-0.784
 Hed-0.722
 Holly-0.681
 Polly-0.680
Pie-0.678
Tokenimage
Token ID9060
Feature activation+5.08e-01

Pos logit contributions
 overwhelm+2.234
 overpower+2.019
 confuse+2.006
secut+1.902
 overshadow+1.896
Neg logit contributions
-1.846
 Hed-1.687
 Polly-1.577
 Holly-1.569
 Katz-1.561
Token.
Token ID13
Feature activation-1.70e+00

Pos logit contributions
 overwhelm+0.382
 overpower+0.339
 confuse+0.334
 overshadow+0.314
secut+0.313
Neg logit contributions
-0.448
 Hed-0.415
 Holly-0.393
 Polly-0.392
Pie-0.391
Token be
Token ID307
Feature activation-5.72e-01

Pos logit contributions
+0.637
 Hed+0.601
 Holly+0.567
 Polly+0.566
 Katz+0.564
Neg logit contributions
 overwhelm-0.578
 overpower-0.513
 confuse-0.498
secut-0.485
 overshadow-0.477
Token removed
Token ID4615
Feature activation-2.09e-01

Pos logit contributions
+0.508
 Hed+0.478
 Polly+0.453
 Holly+0.453
 Katz+0.449
Neg logit contributions
 overwhelm-0.419
 overpower-0.367
 confuse-0.360
 overshadow-0.341
secut-0.339
Token.
Token ID13
Feature activation+1.42e+00

Pos logit contributions
+0.177
 Hed+0.165
 Polly+0.156
 Holly+0.156
Pie+0.154
Neg logit contributions
 overwhelm-0.164
 overpower-0.145
 confuse-0.143
 overshadow-0.135
secut-0.135
Token Example
Token ID17934
Feature activation+4.77e+00

Pos logit contributions
 overwhelm+1.228
 overpower+1.108
 confuse+1.097
 overshadow+1.039
secut+1.035
Neg logit contributions
-1.088
 Hed-0.988
 Holly-0.928
 Polly-0.926
Pie-0.926
Token:
Token ID25
Feature activation+6.25e+00

Pos logit contributions
 overwhelm+3.915
 overpower+3.518
 confuse+3.497
 overshadow+3.262
secut+3.247
Neg logit contributions
-3.870
 Hed-3.482
 Holly-3.288
 Polly-3.284
Ton-3.284
Token /*
Token ID11900
Feature activation+7.50e+00

Pos logit contributions
 overwhelm+4.608
 overpower+4.110
 confuse+4.089
 overshadow+3.774
secut+3.729
Neg logit contributions
-5.580
 Hed-4.986
ertodd-4.783
Ton-4.774
Pie-4.756
Token vi
Token ID25357
Feature activation-6.11e-01

Pos logit contributions
 overwhelm+4.966
 overpower+4.381
 confuse+4.363
 overshadow+3.966
secut+3.944
Neg logit contributions
-7.151
 Hed-6.458
ertodd-6.230
Ton-6.208
Pie-6.180
Token:
Token ID25
Feature activation+4.53e-01

Pos logit contributions
+0.452
 Hed+0.414
 Holly+0.388
Pie+0.387
 Polly+0.386
Neg logit contributions
 overwhelm-0.543
 overpower-0.491
 confuse-0.485
 overshadow-0.462
secut-0.461
Tokenset
Token ID2617
Feature activation-5.74e-01

Pos logit contributions
 overwhelm+0.454
 overpower+0.415
 confuse+0.411
 overshadow+0.393
secut+0.392
Neg logit contributions
-0.286
 Hed-0.257
 Holly-0.237
 Polly-0.237
Pie-0.236
Token dir
Token ID26672
Feature activation-5.06e-01

Pos logit contributions
+0.452
 Hed+0.420
 Holly+0.393
 Polly+0.393
Pie+0.391
Neg logit contributions
 overwhelm-0.480
 overpower-0.431
 confuse-0.426
 overshadow-0.403
secut-0.402
Token=
Token ID28
Feature activation+7.95e-01

Pos logit contributions
+0.433
 Hed+0.400
 Holly+0.379
Pie+0.379
 Polly+0.378
Neg logit contributions
 overwhelm-0.392
 overpower-0.349
 confuse-0.344
 overshadow-0.325
secut-0.322
Token as
Token ID355
Feature activation-4.16e-01

Pos logit contributions
+2.015
 Hed+1.917
 Holly+1.835
 Polly+1.830
Pie+1.826
Neg logit contributions
 overwhelm-0.990
 overpower-0.826
 confuse-0.811
 overshadow-0.753
secut-0.750
Token follows
Token ID5679
Feature activation+2.08e+00

Pos logit contributions
+0.347
 Hed+0.322
 Holly+0.304
 Polly+0.303
Pie+0.302
Neg logit contributions
 overwhelm-0.330
 overpower-0.295
 confuse-0.291
 overshadow-0.275
secut-0.274
Token.
Token ID13
Feature activation+1.04e+00

Pos logit contributions
 overwhelm+1.779
 overpower+1.602
 confuse+1.584
 overshadow+1.499
secut+1.496
Neg logit contributions
-1.614
 Hed-1.472
 Holly-1.382
 Polly-1.379
Pie-1.374
Token
Token ID198
Feature activation+1.72e+00

Pos logit contributions
 overwhelm+1.048
 overpower+0.960
 confuse+0.950
 overshadow+0.909
secut+0.907
Neg logit contributions
-0.650
 Hed-0.581
 Holly-0.537
 Polly-0.536
Pie-0.534
Token
Token ID198
Feature activation+2.11e+00

Pos logit contributions
 overwhelm+1.664
 overpower+1.524
 confuse+1.504
secut+1.439
 surpass+1.423
Neg logit contributions
-1.144
 Hed-1.005
ertodd-0.933
 Holly-0.928
 Polly-0.925
Token#
Token ID2
Feature activation+7.46e+00

Pos logit contributions
 overwhelm+1.610
 overpower+1.432
 confuse+1.411
secut+1.328
 overshadow+1.323
Neg logit contributions
-1.823
 Hed-1.696
 Polly-1.605
 Holly-1.597
 Katz-1.593
Token!/
Token ID48443
Feature activation-4.62e-01

Pos logit contributions
 overwhelm+5.543
 overpower+4.995
 confuse+4.902
 overshadow+4.558
secut+4.479
Neg logit contributions
-6.448
 Hed-5.822
ertodd-5.587
 Katz-5.526
 Polly-5.525
Tokenbin
Token ID8800
Feature activation-1.45e+00

Pos logit contributions
+0.163
 Hed+0.133
 Holly+0.113
 Polly+0.112
Pie+0.111
Neg logit contributions
 overwhelm-0.591
 overpower-0.552
 confuse-0.548
 overshadow-0.529
secut-0.528
Token/
Token ID14
Feature activation-2.30e+00

Pos logit contributions
+0.953
 Hed+0.861
 Holly+0.798
 Polly+0.795
Pie+0.793
Neg logit contributions
 overwhelm-1.417
 overpower-1.291
 confuse-1.279
 overshadow-1.225
secut-1.221
Tokensh
Token ID1477
Feature activation-6.88e-01

Pos logit contributions
+1.881
 Hed+1.737
 Holly+1.635
Pie+1.634
 Polly+1.629
Neg logit contributions
 overwhelm-1.860
 overpower-1.662
 confuse-1.639
 overshadow-1.559
secut-1.546
Token /
Token ID1220
Feature activation+6.49e-01

Pos logit contributions
+0.566
 Hed+0.527
 Holly+0.499
 Polly+0.496
Pie+0.493
Neg logit contributions
 overwhelm-0.551
 overpower-0.492
 confuse-0.485
secut-0.463
 overshadow-0.460
Token 23
Token ID2242
Feature activation+1.74e-01

Pos logit contributions
 overwhelm+0.742
 overpower+0.655
 confuse+0.647
 overshadow+0.608
secut+0.605
Neg logit contributions
-0.903
 Hed-0.837
 Holly-0.795
 Polly-0.793
Pie-0.792
Token 24
Token ID1987
Feature activation+1.35e+00

Pos logit contributions
 overwhelm+0.112
 overpower+0.097
 confuse+0.096
 overshadow+0.090
secut+0.089
Neg logit contributions
-0.170
 Hed-0.159
 Holly-0.152
 Polly-0.152
Pie-0.152
Token 25
Token ID1679
Feature activation+1.09e+00

Pos logit contributions
 overwhelm+1.045
 overpower+0.932
 confuse+0.919
 overshadow+0.863
secut+0.858
Neg logit contributions
-1.169
 Hed-1.075
 Holly-1.015
 Polly-1.014
Pie-1.013
Token 26
Token ID2608
Feature activation+1.42e+00

Pos logit contributions
 overwhelm+0.771
 overpower+0.678
 confuse+0.666
 overshadow+0.628
secut+0.626
Neg logit contributions
-1.008
 Hed-0.940
 Holly-0.895
Pie-0.891
 Polly-0.891
Token 27
Token ID2681
Feature activation+2.16e+00

Pos logit contributions
 overwhelm+1.055
 overpower+0.935
 confuse+0.921
 overshadow+0.864
secut+0.860
Neg logit contributions
-1.270
 Hed-1.178
 Holly-1.116
 Polly-1.115
Pie-1.112
Token #
Token ID1303
Feature activation+7.23e+00

Pos logit contributions
 overwhelm+1.651
 overpower+1.467
 confuse+1.444
 overshadow+1.354
secut+1.353
Neg logit contributions
-1.876
 Hed-1.731
 Holly-1.640
 Polly-1.637
Pie-1.636
Token loop
Token ID9052
Feature activation+1.14e+00

Pos logit contributions
 overwhelm+5.519
 overpower+4.925
 confuse+4.924
secut+4.488
 overshadow+4.481
Neg logit contributions
-6.162
 Hed-5.574
ertodd-5.301
 Holly-5.211
 Polly-5.210
Token over
Token ID625
Feature activation-2.01e+00

Pos logit contributions
 overwhelm+0.698
 overpower+0.601
 confuse+0.590
 overshadow+0.546
secut+0.543
Neg logit contributions
-1.158
 Hed-1.084
 Holly-1.035
 Polly-1.034
Pie-1.032
Token the
Token ID262
Feature activation-3.77e-01

Pos logit contributions
+1.640
 Hed+1.521
 Holly+1.439
 Polly+1.431
 Katz+1.420
Neg logit contributions
 overwhelm-1.625
 overpower-1.456
 confuse-1.434
 overshadow-1.364
secut-1.358
Token input
Token ID5128
Feature activation+7.26e-01

Pos logit contributions
+0.348
 Hed+0.324
 Holly+0.308
 Polly+0.308
Pie+0.307
Neg logit contributions
 overwhelm-0.266
 overpower-0.234
 confuse-0.231
 overshadow-0.216
secut-0.215
Token image
Token ID2939
Feature activation+4.71e-01

Pos logit contributions
 overwhelm+0.685
 overpower+0.623
 confuse+0.617
 overshadow+0.588
secut+0.583
Neg logit contributions
-0.502
 Hed-0.454
Pie-0.423
 Polly-0.422
 Holly-0.421
Tokenma
Token ID2611
Feature activation-2.02e+00

Pos logit contributions
+0.287
 Hed+0.249
 Holly+0.223
 Polly+0.223
Pie+0.220
Neg logit contributions
 overwhelm-0.691
 overpower-0.639
 confuse-0.634
 overshadow-0.611
secut-0.609
Token directive
Token ID22644
Feature activation-2.25e+00

Pos logit contributions
+1.618
 Hed+1.509
 Polly+1.414
 Holly+1.407
Pie+1.406
Neg logit contributions
 overwhelm-1.652
 overpower-1.478
 confuse-1.458
secut-1.390
 overshadow-1.377
Token.
Token ID13
Feature activation+2.22e-02

Pos logit contributions
+1.775
 Hed+1.650
 Polly+1.552
 Holly+1.546
Pie+1.536
Neg logit contributions
 overwhelm-1.867
 overpower-1.673
 confuse-1.650
secut-1.575
 overshadow-1.566
Token
Token ID198
Feature activation-4.43e-01

Pos logit contributions
 overwhelm+0.021
 overpower+0.019
 confuse+0.019
 overshadow+0.018
secut+0.018
Neg logit contributions
-0.015
 Hed-0.014
 Holly-0.013
 Polly-0.013
Pie-0.013
Token
Token ID198
Feature activation-2.51e-01

Pos logit contributions
+0.270
 Hed+0.240
 Holly+0.221
 Polly+0.220
Pie+0.218
Neg logit contributions
 overwhelm-0.453
 overpower-0.415
 confuse-0.411
secut-0.393
 overshadow-0.393
Token#
Token ID2
Feature activation+7.18e+00

Pos logit contributions
+0.185
 Hed+0.169
Pie+0.159
 Holly+0.158
 Polly+0.158
Neg logit contributions
 overwhelm-0.224
 overpower-0.203
 confuse-0.201
 overshadow-0.191
secut-0.190
Tokenpr
Token ID1050
Feature activation-2.27e+00

Pos logit contributions
 overwhelm+5.111
 confuse+4.559
 overpower+4.528
 overshadow+4.196
secut+4.127
Neg logit contributions
-6.469
 Hed-5.899
 Holly-5.596
ertodd-5.558
 Polly-5.552
Tokenag
Token ID363
Feature activation-3.91e+00

Pos logit contributions
+2.014
 Hed+1.897
 Polly+1.822
 Katz+1.805
 Holly+1.798
Neg logit contributions
 overwhelm-1.615
 overpower-1.420
 confuse-1.385
 overshadow-1.313
secut-1.313
Tokenma
Token ID2611
Feature activation-2.17e+00

Pos logit contributions
+1.746
 Hed+1.506
 Polly+1.326
 Holly+1.325
 Katz+1.308
Neg logit contributions
 overwhelm-4.618
 overpower-4.283
 confuse-4.221
secut-4.091
 overshadow-4.089
Token surface
Token ID4417
Feature activation-1.01e+00

Pos logit contributions
+1.876
 Hed+1.779
 Holly+1.668
 Polly+1.666
Pie+1.664
Neg logit contributions
 overwhelm-1.617
 overpower-1.428
 confuse-1.409
secut-1.343
 overshadow-1.314
Token surf
Token ID9053
Feature activation+2.17e-01

Pos logit contributions
+0.842
 Hed+0.779
Pie+0.734
 Holly+0.733
 Polly+0.732
Neg logit contributions
 overwhelm-0.812
 overpower-0.726
 confuse-0.714
secut-0.675
 overshadow-0.675
Token thus
Token ID4145
Feature activation+4.97e-01

Pos logit contributions
 overwhelm+0.064
 overpower+0.057
 confuse+0.056
 overshadow+0.053
secut+0.053
Neg logit contributions
-0.070
 Hed-0.065
 Holly-0.061
 Polly-0.061
Pie-0.061
Token a
Token ID257
Feature activation+3.31e-01

Pos logit contributions
 overwhelm+0.388
 overpower+0.346
 confuse+0.342
 overshadow+0.322
secut+0.321
Neg logit contributions
-0.423
 Hed-0.390
 Holly-0.369
 Polly-0.368
Pie-0.368
Token line
Token ID1627
Feature activation+2.67e+00

Pos logit contributions
 overwhelm+0.248
 overpower+0.219
 confuse+0.216
 overshadow+0.203
secut+0.202
Neg logit contributions
-0.292
 Hed-0.272
 Holly-0.257
 Polly-0.257
Pie-0.256
Token like
Token ID588
Feature activation+1.78e+00

Pos logit contributions
 overwhelm+2.046
 overpower+1.820
 confuse+1.796
 overshadow+1.691
secut+1.684
Neg logit contributions
-2.325
 Hed-2.138
 Polly-2.022
 Holly-2.020
Pie-2.019
Token:
Token ID25
Feature activation+2.56e+00

Pos logit contributions
 overwhelm+1.433
 overpower+1.280
 confuse+1.268
 overshadow+1.191
secut+1.187
Neg logit contributions
-1.476
 Hed-1.345
 Holly-1.273
Pie-1.270
 Polly-1.270
Token /*
Token ID11900
Feature activation+7.03e+00

Pos logit contributions
 overwhelm+1.988
 overpower+1.774
 confuse+1.753
 overshadow+1.646
secut+1.637
Neg logit contributions
-2.200
 Hed-2.010
 Holly-1.904
 Polly-1.899
Pie-1.897
Token vi
Token ID25357
Feature activation-3.68e-01

Pos logit contributions
 overwhelm+4.823
 overpower+4.259
 confuse+4.243
 overshadow+3.880
secut+3.864
Neg logit contributions
-6.555
 Hed-5.940
ertodd-5.759
 Katz-5.667
Ton-5.663
Token:
Token ID25
Feature activation-1.79e-01

Pos logit contributions
+0.285
 Hed+0.262
 Holly+0.246
 Polly+0.245
Pie+0.245
Neg logit contributions
 overwhelm-0.314
 overpower-0.283
 confuse-0.279
 overshadow-0.265
secut-0.265
Tokents
Token ID912
Feature activation-5.37e-01

Pos logit contributions
+0.149
 Hed+0.138
 Holly+0.130
 Polly+0.130
Pie+0.129
Neg logit contributions
 overwhelm-0.143
 overpower-0.128
 confuse-0.126
 overshadow-0.119
secut-0.119
Token=
Token ID28
Feature activation+1.92e+00

Pos logit contributions
+0.445
 Hed+0.415
 Holly+0.391
Pie+0.390
 Polly+0.389
Neg logit contributions
 overwhelm-0.425
 overpower-0.380
 confuse-0.375
 overshadow-0.355
secut-0.353
Token4
Token ID19
Feature activation+6.95e-01

Pos logit contributions
 overwhelm+1.719
 overpower+1.558
 confuse+1.539
 overshadow+1.464
secut+1.461
Neg logit contributions
-1.409
 Hed-1.283
 Holly-1.200
 Polly-1.196
ertodd-1.188
Token (
Token ID357
Feature activation+2.03e-01

Pos logit contributions
 overwhelm+0.142
 overpower+0.127
 confuse+0.125
 overshadow+0.119
secut+0.118
Neg logit contributions
-0.138
 Hed-0.128
 Holly-0.120
 Polly-0.120
Pie-0.120
Tokencan
Token ID5171
Feature activation+3.24e-01

Pos logit contributions
 overwhelm+0.182
 overpower+0.165
 confuse+0.163
 overshadow+0.155
secut+0.154
Neg logit contributions
-0.150
 Hed-0.137
 Holly-0.128
 Polly-0.128
Pie-0.128
Token be
Token ID307
Feature activation+9.41e-01

Pos logit contributions
 overwhelm+0.256
 overpower+0.228
 confuse+0.224
 overshadow+0.213
secut+0.212
Neg logit contributions
-0.270
 Hed-0.250
 Holly-0.236
 Polly-0.235
Pie-0.235
Token empty
Token ID6565
Feature activation+2.94e-01

Pos logit contributions
 overwhelm+0.694
 overpower+0.612
 confuse+0.603
 overshadow+0.570
secut+0.565
Neg logit contributions
-0.837
 Hed-0.782
 Holly-0.741
 Polly-0.739
Pie-0.736
Token)
Token ID8
Feature activation+2.03e+00

Pos logit contributions
 overwhelm+0.239
 overpower+0.214
 confuse+0.211
 overshadow+0.200
secut+0.199
Neg logit contributions
-0.240
 Hed-0.222
 Holly-0.209
 Polly-0.209
Pie-0.208
Token Example
Token ID17934
Feature activation+6.83e+00

Pos logit contributions
 overwhelm+1.742
 overpower+1.571
 confuse+1.554
 overshadow+1.471
secut+1.464
Neg logit contributions
-1.581
 Hed-1.435
 Holly-1.348
 Polly-1.346
Pie-1.344
Token:
Token ID25
Feature activation+4.66e+00

Pos logit contributions
 overwhelm+5.458
 overpower+4.898
 confuse+4.847
 overshadow+4.512
secut+4.457
Neg logit contributions
-5.666
 Hed-5.014
ertodd-4.804
Ton-4.779
 Polly-4.753
Token vi
Token ID25357
Feature activation+1.22e+00

Pos logit contributions
 overwhelm+3.963
 overpower+3.573
 confuse+3.542
 overshadow+3.328
secut+3.302
Neg logit contributions
-3.667
 Hed-3.275
ertodd-3.101
Ton-3.090
 Polly-3.089
Token:
Token ID25
Feature activation+1.57e+00

Pos logit contributions
 overwhelm+1.037
 overpower+0.933
 confuse+0.923
 overshadow+0.874
secut+0.870
Neg logit contributions
-0.958
 Hed-0.876
 Holly-0.824
 Polly-0.823
Pie-0.821
Tokenno
Token ID3919
Feature activation+5.77e-01

Pos logit contributions
 overwhelm+1.392
 overpower+1.259
 confuse+1.245
 overshadow+1.183
secut+1.179
Neg logit contributions
-1.166
 Hed-1.061
 Holly-0.994
 Polly-0.993
Pie-0.988
Tokenai
Token ID1872
Feature activation+2.63e+00

Pos logit contributions
 overwhelm+0.471
 overpower+0.421
 confuse+0.416
 overshadow+0.394
secut+0.391
Neg logit contributions
-0.470
 Hed-0.434
 Holly-0.410
Pie-0.409
 Polly-0.408
Token version
Token ID2196
Feature activation+1.14e+00

Pos logit contributions
 overwhelm+0.068
 overpower+0.061
 confuse+0.060
 overshadow+0.057
secut+0.057
Neg logit contributions
-0.068
 Hed-0.063
 Holly-0.060
 Polly-0.059
Pie-0.059
Token 5
Token ID642
Feature activation-7.00e-01

Pos logit contributions
 overwhelm+0.943
 overpower+0.846
 confuse+0.836
 overshadow+0.791
secut+0.788
Neg logit contributions
-0.919
 Hed-0.845
 Holly-0.796
 Polly-0.794
Pie-0.793
Token.
Token ID13
Feature activation-8.28e-01

Pos logit contributions
+0.551
 Hed+0.512
 Holly+0.481
 Polly+0.480
Pie+0.478
Neg logit contributions
 overwhelm-0.585
 overpower-0.525
 confuse-0.519
 overshadow-0.492
secut-0.492
Token7
Token ID22
Feature activation-7.27e-01

Pos logit contributions
+0.614
 Hed+0.564
 Holly+0.528
Pie+0.526
 Polly+0.526
Neg logit contributions
 overwhelm-0.737
 overpower-0.666
 confuse-0.658
 overshadow-0.627
secut-0.623
Token:
Token ID25
Feature activation+6.65e-01

Pos logit contributions
+0.557
 Hed+0.514
 Holly+0.482
 Polly+0.482
Pie+0.480
Neg logit contributions
 overwhelm-0.625
 overpower-0.563
 confuse-0.556
 overshadow-0.529
secut-0.528
Token /*
Token ID11900
Feature activation+6.78e+00

Pos logit contributions
 overwhelm+0.496
 overpower+0.440
 confuse+0.434
 overshadow+0.408
secut+0.406
Neg logit contributions
-0.589
 Hed-0.546
 Holly-0.517
 Polly-0.516
Pie-0.515
Token vim
Token ID43907
Feature activation-6.68e-01

Pos logit contributions
 overwhelm+4.461
 overpower+3.916
 confuse+3.907
 overshadow+3.560
secut+3.525
Neg logit contributions
-6.547
 Hed-5.950
ertodd-5.753
Ton-5.698
 Holly-5.695
Token<
Token ID27
Feature activation+1.89e-01

Pos logit contributions
+0.543
 Hed+0.501
 Holly+0.472
Pie+0.472
 Polly+0.471
Neg logit contributions
 overwhelm-0.545
 overpower-0.489
 confuse-0.482
 overshadow-0.456
secut-0.454
Token570
Token ID39254
Feature activation+3.25e-02

Pos logit contributions
 overwhelm+0.174
 overpower+0.158
 confuse+0.156
 overshadow+0.148
secut+0.148
Neg logit contributions
-0.134
 Hed-0.122
 Holly-0.114
 Polly-0.114
Pie-0.114
Token:
Token ID25
Feature activation+8.53e-01

Pos logit contributions
 overwhelm+0.029
 overpower+0.026
 confuse+0.026
 overshadow+0.025
secut+0.025
Neg logit contributions
-0.024
 Hed-0.022
 Holly-0.021
 Polly-0.021
Pie-0.021
Token set
Token ID900
Feature activation+1.36e+00

Pos logit contributions
 overwhelm+0.655
 overpower+0.583
 confuse+0.575
 overshadow+0.542
secut+0.539
Neg logit contributions
-0.738
 Hed-0.682
 Holly-0.646
 Polly-0.645
Pie-0.643
Token of
Token ID286
Feature activation-7.61e-01

Pos logit contributions
+2.097
 Hed+1.952
 Polly+1.829
 Holly+1.824
 Katz+1.815
Neg logit contributions
 overwhelm-2.099
 overpower-1.880
 confuse-1.851
 overshadow-1.748
secut-1.745
Token like
Token ID588
Feature activation+8.90e-01

Pos logit contributions
+0.681
 Hed+0.636
 Holly+0.604
 Polly+0.603
 Katz+0.599
Neg logit contributions
 overwhelm-0.557
 overpower-0.490
 confuse-0.485
secut-0.457
 overshadow-0.453
Token public
Token ID1171
Feature activation-3.06e-01

Pos logit contributions
 overwhelm+0.681
 overpower+0.607
 confuse+0.596
 overshadow+0.565
secut+0.558
Neg logit contributions
-0.776
 Hed-0.711
Pie-0.675
 Holly-0.672
 Polly-0.672
Tokenizing
Token ID2890
Feature activation-8.83e-01

Pos logit contributions
+0.267
 Hed+0.247
 Holly+0.234
 Polly+0.233
Pie+0.233
Neg logit contributions
 overwhelm-0.233
 overpower-0.207
 confuse-0.204
 overshadow-0.192
secut-0.191
Token mar
Token ID1667
Feature activation-1.61e-01

Pos logit contributions
+0.728
 Hed+0.671
 Holly+0.633
 Polly+0.632
Pie+0.630
Neg logit contributions
 overwhelm-0.714
 overpower-0.639
 confuse-0.631
 overshadow-0.596
secut-0.594
Tokenath
Token ID776
Feature activation+6.76e+00

Pos logit contributions
+0.117
 Hed+0.107
 Holly+0.100
 Polly+0.099
Pie+0.099
Neg logit contributions
 overwhelm-0.146
 overpower-0.133
 confuse-0.131
 overshadow-0.125
secut-0.125
Tokenons
Token ID684
Feature activation+2.01e-01

Pos logit contributions
 overwhelm+6.389
 confuse+5.839
 overpower+5.787
 overshadow+5.527
secut+5.389
Neg logit contributions
-4.546
 Hed-3.999
 Holly-3.780
ertodd-3.743
Pie-3.691
Token to
Token ID284
Feature activation-2.94e-01

Pos logit contributions
 overwhelm+0.162
 overpower+0.145
 confuse+0.143
 overshadow+0.135
secut+0.135
Neg logit contributions
-0.166
 Hed-0.153
 Holly-0.144
 Polly-0.144
Pie-0.144
Token promote
Token ID7719
Feature activation+9.56e-01

Pos logit contributions
+0.257
 Hed+0.238
 Holly+0.226
 Polly+0.226
Pie+0.225
Neg logit contributions
 overwhelm-0.221
 overpower-0.197
 confuse-0.194
 overshadow-0.182
secut-0.182
Token j
Token ID474
Feature activation+6.98e-01

Pos logit contributions
 overwhelm+0.738
 overpower+0.656
 confuse+0.645
 overshadow+0.609
secut+0.601
Neg logit contributions
-0.831
 Hed-0.758
Pie-0.721
ertodd-0.719
 Holly-0.715
Tokenogging
Token ID30853
Feature activation-5.56e-01

Pos logit contributions
 overwhelm+0.539
 overpower+0.479
 confuse+0.473
secut+0.445
 overshadow+0.445
Neg logit contributions
-0.605
 Hed-0.557
 Holly-0.525
 Polly-0.524
Pie-0.521
Token Atomic
Token ID28976
Feature activation+2.13e-01

Pos logit contributions
 overwhelm+0.450
 overpower+0.414
 confuse+0.411
 overshadow+0.393
secut+0.392
Neg logit contributions
-0.243
 Hed-0.212
Pie-0.194
 Holly-0.194
 Polly-0.193
Token <
Token ID1279
Feature activation-2.05e+00

Pos logit contributions
 overwhelm+0.177
 overpower+0.159
 confuse+0.157
 overshadow+0.149
secut+0.148
Neg logit contributions
-0.171
 Hed-0.157
 Holly-0.148
 Polly-0.148
Pie-0.147
Token T
Token ID309
Feature activation-1.43e+00

Pos logit contributions
+1.418
 Hed+1.293
 Polly+1.208
 Holly+1.204
Pie+1.199
Neg logit contributions
 overwhelm-1.916
 overpower-1.740
 confuse-1.722
secut-1.647
 overshadow-1.644
Token >
Token ID1875
Feature activation+4.86e-04

Pos logit contributions
+0.918
 Hed+0.825
 Holly+0.764
 Polly+0.762
Pie+0.760
Neg logit contributions
 overwhelm-1.421
 overpower-1.299
 confuse-1.286
 overshadow-1.230
secut-1.226
Token {
Token ID1391
Feature activation+1.84e+00

Pos logit contributions
 overwhelm+0.000
 overpower+0.000
 confuse+0.000
 overshadow+0.000
secut+0.000
Neg logit contributions
-0.000
 Hed-0.000
 Holly-0.000
 Polly-0.000
Pie-0.000
Token ///
Token ID34013
Feature activation+6.74e+00

Pos logit contributions
 overwhelm+1.445
 overpower+1.288
 confuse+1.280
 overshadow+1.199
secut+1.191
Neg logit contributions
-1.560
 Hed-1.434
Pie-1.353
 Holly-1.353
 Polly-1.348
Token Create
Token ID13610
Feature activation-1.61e-01

Pos logit contributions
 overwhelm+5.673
 confuse+5.131
 overpower+5.114
 surpass+4.737
 overshadow+4.719
Neg logit contributions
-5.300
 Hed-4.508
ertodd-4.390
Pie-4.384
Ton-4.296
Token a
Token ID257
Feature activation-1.07e+00

Pos logit contributions
+0.132
 Hed+0.121
 Holly+0.114
Pie+0.113
 Polly+0.113
Neg logit contributions
 overwhelm-0.132
 overpower-0.118
 confuse-0.117
 overshadow-0.110
secut-0.109
Token new
Token ID649
Feature activation-1.36e+00

Pos logit contributions
+0.909
 Hed+0.841
 Holly+0.794
 Polly+0.793
Pie+0.791
Neg logit contributions
 overwhelm-0.830
 overpower-0.740
 confuse-0.730
 overshadow-0.688
secut-0.686
Token,
Token ID11
Feature activation-6.78e-01

Pos logit contributions
+0.964
 Hed+0.876
 Holly+0.817
 Polly+0.816
Pie+0.813
Neg logit contributions
 overwhelm-1.255
 overpower-1.140
 confuse-1.127
 overshadow-1.074
secut-1.071
Token null
Token ID9242
Feature activation-1.71e+00

Pos logit contributions
+0.607
 Hed+0.563
 Holly+0.533
 Polly+0.533
Pie+0.532
Neg logit contributions
 overwhelm-0.500
 overpower-0.442
 confuse-0.436
 overshadow-0.409
secut-0.407
Token break
Token ID2270
Feature activation+3.79e+00

Pos logit contributions
 overwhelm+1.202
 overpower+1.067
 confuse+1.050
 overshadow+0.989
secut+0.987
Neg logit contributions
-1.402
 Hed-1.303
 Holly-1.235
 Polly-1.232
Pie-1.229
Token }
Token ID1782
Feature activation+4.31e+00

Pos logit contributions
 overwhelm+2.767
 overpower+2.440
 confuse+2.422
 overshadow+2.261
secut+2.247
Neg logit contributions
-3.418
 Hed-3.150
 Polly-2.996
Pie-2.981
Ton-2.977
Token }
Token ID1782
Feature activation+4.09e+00

Pos logit contributions
 overwhelm+3.203
 overpower+2.839
 confuse+2.814
 overshadow+2.619
secut+2.599
Neg logit contributions
-3.845
 Hed-3.499
 Polly-3.330
Pie-3.326
Ton-3.322
Token Process
Token ID10854
Feature activation+3.62e-01

Pos logit contributions
 overwhelm+3.183
 overpower+2.837
 confuse+2.813
 overshadow+2.627
secut+2.611
Neg logit contributions
-3.502
 Hed-3.175
 Polly-3.012
Pie-3.009
 Holly-3.007
Token {
Token ID1391
Feature activation+2.25e+00

Pos logit contributions
 overwhelm+0.293
 overpower+0.262
 confuse+0.258
 overshadow+0.244
secut+0.243
Neg logit contributions
-0.298
 Hed-0.275
 Holly-0.259
 Polly-0.259
Pie-0.258
Token #
Token ID1303
Feature activation+6.71e+00

Pos logit contributions
 overwhelm+1.838
 overpower+1.645
 confuse+1.627
 overshadow+1.535
secut+1.528
Neg logit contributions
-1.844
 Hed-1.692
 Holly-1.596
 Polly-1.594
Pie-1.591
Token Split
Token ID27758
Feature activation+1.83e+00

Pos logit contributions
 overwhelm+5.578
 confuse+5.010
 overpower+4.999
secut+4.651
 overshadow+4.634
Neg logit contributions
-5.268
 Hed-4.690
ertodd-4.492
Ton-4.423
 Polly-4.418
Token our
Token ID674
Feature activation+3.84e-01

Pos logit contributions
 overwhelm+1.484
 overpower+1.327
 confuse+1.316
 overshadow+1.240
secut+1.223
Neg logit contributions
-1.519
 Hed-1.378
 Polly-1.302
ertodd-1.302
 Holly-1.299
Token string
Token ID4731
Feature activation-8.32e-02

Pos logit contributions
 overwhelm+0.313
 overpower+0.280
 confuse+0.277
 overshadow+0.262
secut+0.259
Neg logit contributions
-0.316
 Hed-0.289
 Holly-0.272
 Polly-0.272
Pie-0.272
Token as
Token ID355
Feature activation+7.65e-01

Pos logit contributions
+0.071
 Hed+0.066
 Holly+0.062
Pie+0.062
 Polly+0.062
Neg logit contributions
 overwhelm-0.065
 overpower-0.058
 confuse-0.057
 overshadow-0.054
secut-0.054
Token it
Token ID340
Feature activation+1.77e-01

Pos logit contributions
 overwhelm+0.605
 overpower+0.539
 confuse+0.533
 overshadow+0.502
secut+0.500
Neg logit contributions
-0.645
 Hed-0.594
 Holly-0.561
 Polly-0.560
Pie-0.559
Token instead
Token ID2427
Feature activation+1.12e-01

Pos logit contributions
 overwhelm+1.565
 overpower+1.397
 confuse+1.380
 overshadow+1.307
secut+1.298
Neg logit contributions
-1.614
 Hed-1.478
 Polly-1.398
 Holly-1.398
Pie-1.397
Token of
Token ID286
Feature activation+7.65e-01

Pos logit contributions
 overwhelm+0.090
 overpower+0.080
 confuse+0.079
 overshadow+0.075
secut+0.075
Neg logit contributions
-0.092
 Hed-0.085
 Holly-0.080
 Polly-0.080
Pie-0.080
Token playing
Token ID2712
Feature activation+1.32e+00

Pos logit contributions
 overwhelm+0.605
 overpower+0.541
 confuse+0.533
 overshadow+0.504
secut+0.501
Neg logit contributions
-0.646
 Hed-0.591
Pie-0.561
 Polly-0.560
 Holly-0.559
Token in
Token ID287
Feature activation+9.02e-01

Pos logit contributions
 overwhelm+0.993
 overpower+0.882
 confuse+0.868
 overshadow+0.818
secut+0.814
Neg logit contributions
-1.171
 Hed-1.076
ertodd-1.023
Pie-1.023
 Polly-1.022
Token second
Token ID1218
Feature activation+3.94e+00

Pos logit contributions
 overwhelm+0.691
 overpower+0.615
 confuse+0.605
 overshadow+0.571
secut+0.569
Neg logit contributions
-0.782
 Hed-0.719
Pie-0.683
 Polly-0.682
 Holly-0.681
Token division
Token ID7297
Feature activation+6.66e+00

Pos logit contributions
 overwhelm+2.205
 overpower+1.882
 confuse+1.858
 overshadow+1.701
secut+1.680
Neg logit contributions
-4.197
 Hed-3.921
Pie-3.773
ertodd-3.773
 Holly-3.759
Token in
Token ID287
Feature activation+4.83e-01

Pos logit contributions
 overwhelm+5.045
 overpower+4.461
 confuse+4.435
 overshadow+4.203
secut+4.201
Neg logit contributions
-5.680
 Hed-5.123
ertodd-5.029
Ton-4.960
Pie-4.905
Token Turkey
Token ID7137
Feature activation+4.19e-01

Pos logit contributions
 overwhelm+0.347
 overpower+0.306
 confuse+0.301
 overshadow+0.282
secut+0.281
Neg logit contributions
-0.442
 Hed-0.409
 Polly-0.388
 Holly-0.388
Pie-0.388
Token?
Token ID30
Feature activation-6.83e-02

Pos logit contributions
 overwhelm+0.350
 overpower+0.314
 confuse+0.310
 overshadow+0.294
secut+0.293
Neg logit contributions
-0.334
 Hed-0.307
 Holly-0.289
 Polly-0.289
Pie-0.288
Token I
Token ID314
Feature activation-1.29e+00

Pos logit contributions
+0.052
 Hed+0.048
 Holly+0.045
 Polly+0.045
Pie+0.045
Neg logit contributions
 overwhelm-0.059
 overpower-0.054
 confuse-0.053
 overshadow-0.050
secut-0.050
Token can
Token ID460
Feature activation+8.79e-01

Pos logit contributions
+1.092
 Hed+1.008
 Holly+0.950
 Polly+0.949
Pie+0.947
Neg logit contributions
 overwhelm-1.012
 overpower-0.903
 confuse-0.886
 overshadow-0.843
secut-0.839
Token any
Token ID597
Feature activation-1.95e+00

Pos logit contributions
 overwhelm+0.549
 overpower+0.486
 confuse+0.479
 overshadow+0.452
secut+0.449
Neg logit contributions
-0.637
 Hed-0.592
 Holly-0.560
 Polly-0.559
Pie-0.559
Token text
Token ID2420
Feature activation+1.37e-01

Pos logit contributions
+1.513
 Hed+1.437
 Holly+1.353
 Polly+1.350
Pie+1.344
Neg logit contributions
 overwhelm-1.610
 overpower-1.442
 confuse-1.417
 overshadow-1.353
secut-1.341
Token or
Token ID393
Feature activation-2.33e+00

Pos logit contributions
 overwhelm+0.110
 overpower+0.098
 confuse+0.097
 overshadow+0.092
secut+0.091
Neg logit contributions
-0.112
 Hed-0.104
 Holly-0.098
 Polly-0.098
Pie-0.098
Token empty
Token ID6565
Feature activation-1.39e+00

Pos logit contributions
+2.264
 Hed+2.166
 Holly+2.062
 Polly+2.061
Pie+2.053
Neg logit contributions
 overwhelm-1.455
 overpower-1.259
 confuse-1.227
 overshadow-1.151
secut-1.146
Token Example
Token ID17934
Feature activation+4.72e+00

Pos logit contributions
+1.099
 Hed+1.027
 Holly+0.969
 Polly+0.967
Pie+0.963
Neg logit contributions
 overwhelm-1.146
 overpower-1.027
 confuse-1.013
 overshadow-0.964
secut-0.958
Token:
Token ID25
Feature activation+6.61e+00

Pos logit contributions
 overwhelm+3.864
 overpower+3.472
 confuse+3.442
 overshadow+3.230
secut+3.221
Neg logit contributions
-3.810
 Hed-3.454
 Polly-3.260
 Holly-3.260
Ton-3.258
Token /*
Token ID11900
Feature activation+5.67e+00

Pos logit contributions
 overwhelm+4.493
 overpower+3.964
 confuse+3.922
 overshadow+3.599
secut+3.590
Neg logit contributions
-6.278
 Hed-5.672
Ton-5.457
ertodd-5.446
 Polly-5.416
Token vim
Token ID43907
Feature activation-1.09e+00

Pos logit contributions
 overwhelm+4.865
 overpower+4.422
 confuse+4.384
secut+4.152
 overshadow+4.116
Neg logit contributions
-4.324
 Hed-3.872
ertodd-3.659
 Polly-3.634
 Holly-3.628
Token:
Token ID25
Feature activation-4.77e-01

Pos logit contributions
+0.800
 Hed+0.740
 Holly+0.694
Pie+0.693
 Polly+0.692
Neg logit contributions
 overwhelm-0.962
 overpower-0.869
 confuse-0.856
 overshadow-0.818
secut-0.814
Token set
Token ID900
Feature activation-8.21e-01

Pos logit contributions
+0.313
 Hed+0.284
 Holly+0.264
Pie+0.263
 Polly+0.262
Neg logit contributions
 overwhelm-0.464
 overpower-0.423
 confuse-0.418
 overshadow-0.400
secut-0.398
Token a
Token ID257
Feature activation-4.14e-01

Pos logit contributions
+0.662
 Hed+0.616
 Holly+0.581
Pie+0.580
 Polly+0.578
Neg logit contributions
 overwhelm-0.671
 overpower-0.599
 confuse-0.592
 overshadow-0.561
secut-0.558
Token any
Token ID597
Feature activation-5.85e-01

Pos logit contributions
+0.647
 Hed+0.598
 Holly+0.564
 Polly+0.563
Pie+0.561
Neg logit contributions
 overwhelm-0.621
 overpower-0.555
 confuse-0.548
 overshadow-0.518
secut-0.516
Token%
Token ID4
Feature activation-4.48e+00

Pos logit contributions
+0.500
 Hed+0.463
 Holly+0.438
 Polly+0.437
Pie+0.435
Neg logit contributions
 overwhelm-0.452
 overpower-0.403
 confuse-0.397
 overshadow-0.375
secut-0.372
Token on
Token ID319
Feature activation-9.49e-01

Pos logit contributions
+3.714
 Hed+3.438
 Holly+3.265
Pie+3.242
 Polly+3.240
Neg logit contributions
 overwhelm-3.474
 overpower-3.111
 confuse-3.055
 overshadow-2.908
secut-2.881
Token two
Token ID734
Feature activation+1.41e+00

Pos logit contributions
+0.780
 Hed+0.721
 Holly+0.680
 Polly+0.679
Pie+0.675
Neg logit contributions
 overwhelm-0.766
 overpower-0.686
 confuse-0.677
 overshadow-0.642
secut-0.637
Token mar
Token ID1667
Feature activation+7.68e-01

Pos logit contributions
 overwhelm+1.068
 overpower+0.948
 confuse+0.934
 overshadow+0.879
secut+0.876
Neg logit contributions
-1.235
 Hed-1.142
 Holly-1.081
 Polly-1.079
Pie-1.079
Tokenath
Token ID776
Feature activation+6.61e+00

Pos logit contributions
 overwhelm+0.686
 overpower+0.620
 confuse+0.617
secut+0.586
 overshadow+0.583
Neg logit contributions
-0.569
 Hed-0.516
 Holly-0.484
 Polly-0.482
Pie-0.480
Tokenons
Token ID684
Feature activation+1.03e+00

Pos logit contributions
 overwhelm+6.339
 confuse+5.792
 overpower+5.772
 overshadow+5.476
secut+5.433
Neg logit contributions
-4.342
 Hed-3.807
 Holly-3.640
ertodd-3.625
 Katz-3.595
Token,
Token ID11
Feature activation+6.41e-01

Pos logit contributions
 overwhelm+0.830
 overpower+0.743
 confuse+0.732
 overshadow+0.693
secut+0.689
Neg logit contributions
-0.848
 Hed-0.779
 Holly-0.733
 Polly-0.732
Pie-0.732
Token it
Token ID340
Feature activation-1.39e-01

Pos logit contributions
 overwhelm+0.520
 overpower+0.466
 confuse+0.460
 overshadow+0.435
secut+0.432
Neg logit contributions
-0.528
 Hed-0.484
 Holly-0.456
Pie-0.456
 Polly-0.456
Token's
Token ID338
Feature activation+1.43e+00

Pos logit contributions
+0.122
 Hed+0.114
 Holly+0.108
 Polly+0.107
Pie+0.107
Neg logit contributions
 overwhelm-0.104
 overpower-0.092
 confuse-0.091
secut-0.085
 overshadow-0.085
Token time
Token ID640
Feature activation+3.45e+00

Pos logit contributions
 overwhelm+1.032
 overpower+0.917
 confuse+0.895
 overshadow+0.843
secut+0.829
Neg logit contributions
-1.306
 Hed-1.200
ertodd-1.146
Pie-1.143
 Polly-1.137
Token Socialist
Token ID21773
Feature activation+2.39e+00

Pos logit contributions
 overwhelm+1.253
 overpower+1.116
 confuse+1.102
 overshadow+1.036
secut+1.024
Neg logit contributions
-1.388
 Hed-1.262
Pie-1.204
ertodd-1.200
 Holly-1.193
Token Party
Token ID3615
Feature activation+2.34e-01

Pos logit contributions
 overwhelm+2.258
 overpower+2.056
 confuse+2.036
 overshadow+1.943
secut+1.925
Neg logit contributions
-1.637
 Hed-1.467
Pie-1.368
 Holly-1.365
 Polly-1.362
Token in
Token ID287
Feature activation+2.99e-01

Pos logit contributions
 overwhelm+0.184
 overpower+0.164
 confuse+0.162
 overshadow+0.153
secut+0.152
Neg logit contributions
-0.198
 Hed-0.183
 Holly-0.173
 Polly-0.172
Pie-0.172
Token the
Token ID262
Feature activation-2.53e-01

Pos logit contributions
 overwhelm+0.231
 overpower+0.206
 confuse+0.202
 overshadow+0.191
secut+0.189
Neg logit contributions
-0.260
 Hed-0.238
Pie-0.226
 Holly-0.226
 Polly-0.225
Token then
Token ID788
Feature activation+3.35e+00

Pos logit contributions
+0.219
 Hed+0.200
Pie+0.190
ertodd+0.189
 Holly+0.189
Neg logit contributions
 overwhelm-0.197
 overpower-0.176
 confuse-0.173
 overshadow-0.163
secut-0.161
Token-
Token ID12
Feature activation+6.55e+00

Pos logit contributions
 overwhelm+2.540
 overpower+2.277
 confuse+2.204
 overshadow+2.103
secut+2.046
Neg logit contributions
-2.967
 Hed-2.679
ertodd-2.572
Pie-2.547
Ton-2.546
Tokencurrent
Token ID14421
Feature activation+4.38e-01

Pos logit contributions
 overwhelm+5.476
 overpower+4.980
 confuse+4.866
secut+4.663
 overshadow+4.636
Neg logit contributions
-5.043
 Hed-4.584
ertodd-4.449
 Holly-4.267
 sidx-4.240
Token crisis
Token ID4902
Feature activation-6.29e-02

Pos logit contributions
 overwhelm+0.319
 overpower+0.283
 confuse+0.278
 overshadow+0.261
secut+0.256
Neg logit contributions
-0.399
 Hed-0.367
Pie-0.349
ertodd-0.348
 Holly-0.348
Token of
Token ID286
Feature activation-1.03e+00

Pos logit contributions
+0.052
 Hed+0.048
 Holly+0.045
 Polly+0.045
Pie+0.045
Neg logit contributions
 overwhelm-0.051
 overpower-0.045
 confuse-0.045
 overshadow-0.042
secut-0.042
Token capitalism
Token ID12129
Feature activation-1.22e+00

Pos logit contributions
+0.866
 Hed+0.800
 Holly+0.755
 Polly+0.754
Pie+0.752
Neg logit contributions
 overwhelm-0.817
 overpower-0.729
 confuse-0.720
 overshadow-0.680
secut-0.678
Token,
Token ID11
Feature activation-4.51e-01

Pos logit contributions
+0.975
 Hed+0.903
 Holly+0.850
 Polly+0.849
Pie+0.843
Neg logit contributions
 overwhelm-1.006
 overpower-0.903
 confuse-0.893
secut-0.844
 overshadow-0.842
Token a
Token ID257
Feature activation-4.52e-02

Pos logit contributions
 overwhelm+1.008
 overpower+0.900
 confuse+0.885
 overshadow+0.837
secut+0.828
Neg logit contributions
-1.104
 Hed-1.009
ertodd-0.955
Pie-0.954
 Holly-0.949
Token look
Token ID804
Feature activation+1.76e-01

Pos logit contributions
+0.039
 Hed+0.036
 Holly+0.034
 Polly+0.034
Pie+0.034
Neg logit contributions
 overwhelm-0.035
 overpower-0.031
 confuse-0.031
 overshadow-0.029
secut-0.029
Token at
Token ID379
Feature activation-3.47e-01

Pos logit contributions
 overwhelm+0.134
 overpower+0.119
 confuse+0.117
 overshadow+0.110
secut+0.110
Neg logit contributions
-0.153
 Hed-0.141
 Holly-0.134
 Polly-0.134
Pie-0.133
Token the
Token ID262
Feature activation+1.62e+00

Pos logit contributions
+0.293
 Hed+0.270
 Holly+0.255
 Polly+0.255
Pie+0.254
Neg logit contributions
 overwhelm-0.273
 overpower-0.244
 confuse-0.240
 overshadow-0.227
secut-0.226
Token then
Token ID788
Feature activation+4.27e+00

Pos logit contributions
 overwhelm+1.294
 overpower+1.165
 confuse+1.144
 overshadow+1.079
secut+1.063
Neg logit contributions
-1.360
 Hed-1.230
ertodd-1.173
Pie-1.169
 Polly-1.162
Token-
Token ID12
Feature activation+6.52e+00

Pos logit contributions
 overwhelm+3.332
 overpower+3.005
 confuse+2.914
 overshadow+2.791
secut+2.749
Neg logit contributions
-3.660
 Hed-3.298
ertodd-3.144
Pie-3.120
Ton-3.110
Tokentop
Token ID4852
Feature activation+2.51e+00

Pos logit contributions
 overwhelm+5.312
 overpower+4.848
 confuse+4.729
secut+4.625
 overshadow+4.513
Neg logit contributions
-5.054
 Hed-4.631
ertodd-4.454
 Holly-4.295
 Polly-4.285
Token "
Token ID366
Feature activation+1.21e+00

Pos logit contributions
 overwhelm+2.034
 overpower+1.831
 confuse+1.802
secut+1.734
 overshadow+1.713
Neg logit contributions
-2.052
 Hed-1.862
 Holly-1.746
Pie-1.746
 Polly-1.743
Tokenjin
Token ID18594
Feature activation+1.20e+00

Pos logit contributions
 overwhelm+1.083
 overpower+0.980
 confuse+0.969
secut+0.924
 overshadow+0.921
Neg logit contributions
-0.889
 Hed-0.812
 Polly-0.759
 Holly-0.758
ertodd-0.751
Tokenx
Token ID87
Feature activation+8.29e-01

Pos logit contributions
 overwhelm+1.195
 overpower+1.093
 confuse+1.081
 overshadow+1.035
secut+1.033
Neg logit contributions
-0.768
 Hed-0.690
 Holly-0.637
 Polly-0.635
Pie-0.635
Tokened
Token ID276
Feature activation+8.59e-01

Pos logit contributions
 overwhelm+0.679
 overpower+0.608
 confuse+0.600
 overshadow+0.568
secut+0.565
Neg logit contributions
-0.675
 Hed-0.620
 Holly-0.585
 Polly-0.584
Pie-0.583
Token This
Token ID770
Feature activation+1.74e-01

Pos logit contributions
 overwhelm+0.984
 overpower+0.888
 confuse+0.879
 overshadow+0.832
secut+0.829
Neg logit contributions
-0.882
 Hed-0.801
 Holly-0.753
 Polly-0.751
Pie-0.751
Token line
Token ID1627
Feature activation+3.41e+00

Pos logit contributions
 overwhelm+0.140
 overpower+0.125
 confuse+0.123
 overshadow+0.117
secut+0.117
Neg logit contributions
-0.143
 Hed-0.133
 Holly-0.125
 Polly-0.125
Pie-0.125
Token is
Token ID318
Feature activation+9.99e-01

Pos logit contributions
 overwhelm+2.655
 overpower+2.371
 confuse+2.352
 overshadow+2.197
secut+2.177
Neg logit contributions
-2.946
 Hed-2.684
Pie-2.551
Ton-2.542
 Holly-2.536
Token OK
Token ID7477
Feature activation+9.12e-02

Pos logit contributions
 overwhelm+0.738
 overpower+0.652
 confuse+0.643
 overshadow+0.604
secut+0.602
Neg logit contributions
-0.893
 Hed-0.829
 Holly-0.786
 Polly-0.785
Pie-0.782
Token:
Token ID25
Feature activation+1.91e+00

Pos logit contributions
 overwhelm+0.069
 overpower+0.061
 confuse+0.060
 overshadow+0.056
secut+0.056
Neg logit contributions
-0.080
 Hed-0.075
 Holly-0.071
 Polly-0.071
Pie-0.070
Token /*
Token ID11900
Feature activation+6.46e+00

Pos logit contributions
 overwhelm+1.439
 overpower+1.279
 confuse+1.264
 overshadow+1.184
secut+1.177
Neg logit contributions
-1.680
 Hed-1.543
 Holly-1.463
Pie-1.460
 Polly-1.459
Token vi
Token ID25357
Feature activation-8.67e-01

Pos logit contributions
 overwhelm+4.349
 overpower+3.846
 confuse+3.810
 overshadow+3.488
secut+3.462
Neg logit contributions
-6.144
 Hed-5.566
ertodd-5.366
Ton-5.339
Pie-5.323
Token:
Token ID25
Feature activation-9.15e-02

Pos logit contributions
+0.639
 Hed+0.586
 Holly+0.549
Pie+0.549
 Polly+0.547
Neg logit contributions
 overwhelm-0.769
 overpower-0.697
 confuse-0.687
 overshadow-0.655
secut-0.655
Tokenset
Token ID2617
Feature activation-9.47e-01

Pos logit contributions
+0.070
 Hed+0.064
 Holly+0.060
 Polly+0.060
Pie+0.060
Neg logit contributions
 overwhelm-0.079
 overpower-0.072
 confuse-0.071
 overshadow-0.067
secut-0.067
Token ts
Token ID40379
Feature activation+8.61e-02

Pos logit contributions
+0.789
 Hed+0.736
Pie+0.693
 Polly+0.692
 Holly+0.692
Neg logit contributions
 overwhelm-0.749
 overpower-0.667
 confuse-0.657
 overshadow-0.621
secut-0.616
Token=
Token ID28
Feature activation+1.34e+00

Pos logit contributions
 overwhelm+0.065
 overpower+0.057
 confuse+0.057
 overshadow+0.053
secut+0.053
Neg logit contributions
-0.076
 Hed-0.070
 Holly-0.066
Pie-0.066
 Polly-0.066
TokenA
Token ID32
Feature activation-1.60e+00

Pos logit contributions
+2.091
 Hed+1.917
 Holly+1.777
 Katz+1.767
 Polly+1.766
Neg logit contributions
 overwhelm-3.090
 overpower-2.807
 confuse-2.783
 overshadow-2.663
secut-2.661
Token.
Token ID13
Feature activation-1.18e+00

Pos logit contributions
+1.310
 Hed+1.211
 Holly+1.139
 Polly+1.137
 Katz+1.133
Neg logit contributions
 overwhelm-1.299
 overpower-1.159
 confuse-1.146
 overshadow-1.086
secut-1.078
Token
Token ID564
Feature activation-1.32e+00

Pos logit contributions
+0.844
 Hed+0.766
 Holly+0.716
 Polly+0.715
Pie+0.713
Neg logit contributions
 overwhelm-1.082
 overpower-0.982
 confuse-0.971
 overshadow-0.925
secut-0.922
Token
Token ID250
Feature activation+1.07e+00

Pos logit contributions
+1.197
 Hed+1.106
 Holly+1.052
 Polly+1.051
Pie+1.050
Neg logit contributions
 overwhelm-0.956
 overpower-0.845
 confuse-0.832
 overshadow-0.783
secut-0.781
TokenWind
Token ID8731
Feature activation-7.02e-02

Pos logit contributions
 overwhelm+1.020
 overpower+0.933
 confuse+0.923
secut+0.883
 overshadow+0.878
Neg logit contributions
-0.716
 Hed-0.641
 Polly-0.598
 Holly-0.595
ertodd-0.595
Tokeny
Token ID88
Feature activation+6.44e+00

Pos logit contributions
+0.054
 Hed+0.049
 Holly+0.046
 Polly+0.046
Pie+0.046
Neg logit contributions
 overwhelm-0.061
 overpower-0.055
 confuse-0.054
 overshadow-0.052
secut-0.052
Token
Token ID447
Feature activation+2.54e-01

Pos logit contributions
 overwhelm+5.876
 overpower+5.355
 confuse+5.295
 overshadow+5.112
secut+5.032
Neg logit contributions
-4.450
 Hed-3.833
ertodd-3.748
Pie-3.620
 Polly-3.603
Token
Token ID251
Feature activation-5.76e-01

Pos logit contributions
 overwhelm+0.172
 overpower+0.151
 confuse+0.148
 overshadow+0.138
secut+0.138
Neg logit contributions
-0.242
 Hed-0.225
 Holly-0.214
 Polly-0.214
Pie-0.214
Token Daniels
Token ID28162
Feature activation-9.24e-01

Pos logit contributions
+0.428
 Hed+0.390
 Holly+0.365
 Polly+0.364
Pie+0.364
Neg logit contributions
 overwhelm-0.513
 overpower-0.464
 confuse-0.459
 overshadow-0.437
secut-0.435
Token and
Token ID290
Feature activation+7.79e-01

Pos logit contributions
+0.730
 Hed+0.668
 Holly+0.629
 Polly+0.628
Pie+0.627
Neg logit contributions
 overwhelm-0.778
 overpower-0.700
 confuse-0.691
 overshadow-0.656
secut-0.652
Token his
Token ID465
Feature activation+1.19e+00

Pos logit contributions
 overwhelm+0.607
 overpower+0.544
 confuse+0.535
 overshadow+0.504
secut+0.502
Neg logit contributions
-0.665
 Hed-0.606
Pie-0.577
ertodd-0.573
 Polly-0.572
Token )
Token ID1267
Feature activation+1.40e+00

Pos logit contributions
+1.941
 Hed+1.821
 Polly+1.711
 Holly+1.708
Pie+1.694
Neg logit contributions
 overwhelm-1.812
 overpower-1.607
 confuse-1.580
 overshadow-1.502
secut-1.496
Token
Token ID198
Feature activation-2.16e-01

Pos logit contributions
 overwhelm+1.218
 overpower+1.099
 confuse+1.087
 overshadow+1.029
secut+1.024
Neg logit contributions
-1.071
 Hed-0.972
 Holly-0.915
 Polly-0.912
Pie-0.910
Token
Token ID198
Feature activation-7.72e-01

Pos logit contributions
+0.135
 Hed+0.119
 Holly+0.110
ertodd+0.110
 Polly+0.109
Neg logit contributions
 overwhelm-0.218
 overpower-0.200
 confuse-0.198
secut-0.189
 overshadow-0.188
Token(
Token ID7
Feature activation+3.40e+00

Pos logit contributions
+0.552
 Hed+0.503
Pie+0.470
 Holly+0.468
 Polly+0.468
Neg logit contributions
 overwhelm-0.707
 overpower-0.642
 confuse-0.634
 overshadow-0.605
secut-0.602
TokenExample
Token ID16281
Feature activation+6.18e+00

Pos logit contributions
 overwhelm+2.722
 overpower+2.435
 confuse+2.424
secut+2.271
 overshadow+2.267
Neg logit contributions
-2.783
 Hed-2.556
 Holly-2.431
 Polly-2.416
 Katz-2.405
Token:
Token ID25
Feature activation+6.38e+00

Pos logit contributions
 overwhelm+4.369
 overpower+3.903
 confuse+3.834
 surpass+3.627
rawdownloadcloneembedreportprint+3.605
Neg logit contributions
-5.382
 Hed-4.802
ertodd-4.692
Ton-4.557
 Holly-4.552
Token Your
Token ID3406
Feature activation+3.04e+00

Pos logit contributions
 overwhelm+3.499
 overpower+2.953
 confuse+2.935
secut+2.649
 overshadow+2.647
Neg logit contributions
-6.824
 Hed-6.248
Pie-6.065
ertodd-6.043
Ton-6.022
Token +
Token ID1343
Feature activation+5.89e-01

Pos logit contributions
 overwhelm+2.654
 overpower+2.399
 confuse+2.385
 overshadow+2.249
secut+2.235
Neg logit contributions
-2.330
 Hed-2.091
 Holly-1.978
Pie-1.973
 Polly-1.965
Token1
Token ID16
Feature activation+4.17e+00

Pos logit contributions
 overwhelm+0.490
 overpower+0.441
 confuse+0.436
secut+0.412
 overshadow+0.411
Neg logit contributions
-0.471
 Hed-0.431
 Holly-0.405
 Polly-0.405
Pie-0.403
Token shield
Token ID7614
Feature activation+3.45e+00

Pos logit contributions
 overwhelm+2.030
 overpower+1.673
 confuse+1.649
 overshadow+1.455
secut+1.423
Neg logit contributions
-4.892
 Hed-4.473
Pie-4.383
ertodd-4.351
 Holly-4.339
Token keeps
Token ID7622
Feature activation+2.18e-01

Pos logit contributions
 overwhelm+2.465
 confuse+2.196
 overpower+2.195
secut+2.046
 overshadow+2.015
Neg logit contributions
-3.206
 Hed-2.878
Pie-2.778
ertodd-2.766
 Holly-2.760
Token and
Token ID290
Feature activation-3.07e-01

Pos logit contributions
+0.551
 Hed+0.511
 Holly+0.483
 Polly+0.481
Pie+0.479
Neg logit contributions
 overwhelm-0.537
 overpower-0.480
 confuse-0.474
secut-0.447
 overshadow-0.447
Token we
Token ID356
Feature activation-1.10e-02

Pos logit contributions
+0.249
 Hed+0.227
 Polly+0.214
Pie+0.214
 Holly+0.213
Neg logit contributions
 overwhelm-0.254
 overpower-0.227
 confuse-0.225
 overshadow-0.213
secut-0.211
Token wish
Token ID4601
Feature activation+9.39e-01

Pos logit contributions
+0.010
 Hed+0.009
 Holly+0.008
Pie+0.008
 Polly+0.008
Neg logit contributions
 overwhelm-0.009
 overpower-0.008
 confuse-0.007
 overshadow-0.007
secut-0.007
Token the
Token ID262
Feature activation-1.35e+00

Pos logit contributions
 overwhelm+0.726
 overpower+0.645
 confuse+0.637
 overshadow+0.600
secut+0.598
Neg logit contributions
-0.809
 Hed-0.741
Pie-0.700
 Polly-0.698
 Katz-0.697
Token process
Token ID1429
Feature activation-1.75e+00

Pos logit contributions
+1.127
 Hed+1.044
 Holly+0.987
 Polly+0.985
Pie+0.982
Neg logit contributions
 overwhelm-1.070
 overpower-0.956
 confuse-0.944
 overshadow-0.891
secut-0.888
Token well
Token ID880
Feature activation-7.93e+00

Pos logit contributions
+1.483
 Hed+1.379
 Polly+1.303
 Holly+1.303
Pie+1.293
Neg logit contributions
 overwhelm-1.370
 overpower-1.220
 confuse-1.206
secut-1.135
 overshadow-1.135
Token."
Token ID526
Feature activation+7.53e-01

Pos logit contributions
+5.967
 Hed+5.706
 Polly+5.424
 Holly+5.407
 Carey+5.226
Neg logit contributions
 overwhelm-6.458
 overpower-5.844
 confuse-5.699
ierrez-5.474
secut-5.322
Token
Token ID198
Feature activation+4.91e-01

Pos logit contributions
 overwhelm+0.671
 overpower+0.606
 confuse+0.601
secut+0.574
 overshadow+0.572
Neg logit contributions
-0.553
 Hed-0.501
Pie-0.468
 Holly-0.467
 Polly-0.467
Token
Token ID198
Feature activation+2.97e-01

Pos logit contributions
 overwhelm+0.483
 overpower+0.443
 confuse+0.438
secut+0.419
 overshadow+0.416
Neg logit contributions
-0.317
 Hed-0.278
ertodd-0.257
 Holly-0.256
 Polly-0.255
TokenPeter
Token ID19727
Feature activation-4.89e-01

Pos logit contributions
 overwhelm+0.275
 overpower+0.250
 confuse+0.248
secut+0.238
 overshadow+0.235
Neg logit contributions
-0.205
 Hed-0.188
 Polly-0.174
 Holly-0.174
 Katz-0.173
Token Fan
Token ID13836
Feature activation+1.23e+00

Pos logit contributions
+0.326
 Hed+0.293
 Holly+0.272
 Polly+0.272
Pie+0.271
Neg logit contributions
 overwhelm-0.472
 overpower-0.431
 confuse-0.426
 overshadow-0.407
secut-0.406
Token,
Token ID11
Feature activation-1.60e-01

Pos logit contributions
+0.627
 Hed+0.577
 Holly+0.543
 Polly+0.542
Pie+0.540
Neg logit contributions
 overwhelm-0.652
 overpower-0.585
 confuse-0.578
 overshadow-0.547
secut-0.546
Token which
Token ID543
Feature activation-2.04e+00

Pos logit contributions
+0.134
 Hed+0.124
 Holly+0.117
 Polly+0.117
Pie+0.116
Neg logit contributions
 overwhelm-0.127
 overpower-0.114
 confuse-0.112
 overshadow-0.106
secut-0.105
Token count
Token ID954
Feature activation-1.85e-01

Pos logit contributions
+1.742
 Hed+1.633
 Holly+1.539
 Polly+1.535
 Katz+1.527
Neg logit contributions
 overwhelm-1.567
 overpower-1.397
 confuse-1.376
secut-1.301
 overshadow-1.295
Token for
Token ID329
Feature activation-1.37e+00

Pos logit contributions
+0.172
 Hed+0.160
 Holly+0.152
 Polly+0.152
Pie+0.152
Neg logit contributions
 overwhelm-0.130
 overpower-0.114
 confuse-0.112
 overshadow-0.105
secut-0.104
Token at
Token ID379
Feature activation-1.96e+00

Pos logit contributions
+1.220
 Hed+1.145
 Holly+1.082
 Polly+1.080
Pie+1.075
Neg logit contributions
 overwhelm-1.006
 overpower-0.887
 confuse-0.876
secut-0.823
 overshadow-0.822
Token least
Token ID1551
Feature activation-7.64e+00

Pos logit contributions
+1.469
 Hed+1.359
 Holly+1.268
 Polly+1.263
 Katz+1.261
Neg logit contributions
 overwhelm-1.725
 overpower-1.554
 confuse-1.542
 overshadow-1.466
secut-1.448
Token 1
Token ID352
Feature activation-8.03e-01

Pos logit contributions
+5.465
 Hed+5.425
 Katz+5.031
 Holly+5.010
 Carey+4.990
Neg logit contributions
 overwhelm-6.342
 confuse-5.627
 overpower-5.622
secut-5.483
Reviewer-5.369
Token.
Token ID13
Feature activation-2.41e+00

Pos logit contributions
+0.672
 Hed+0.621
 Holly+0.586
 Polly+0.584
Pie+0.584
Neg logit contributions
 overwhelm-0.639
 overpower-0.570
 confuse-0.563
 overshadow-0.532
secut-0.529
Token5
Token ID20
Feature activation-3.32e+00

Pos logit contributions
+1.694
 Hed+1.544
 Holly+1.433
 Polly+1.430
Pie+1.429
Neg logit contributions
 overwhelm-2.254
 overpower-2.049
 confuse-2.026
 overshadow-1.935
secut-1.919
Token times
Token ID1661
Feature activation-1.46e+00

Pos logit contributions
+3.022
 Hed+2.852
 Holly+2.676
Pie+2.675
 Katz+2.659
Neg logit contributions
 overwhelm-2.346
 overpower-2.051
 confuse-2.028
 overshadow-1.899
secut-1.882
Token their
Token ID511
Feature activation+1.13e+00

Pos logit contributions
+1.233
 Hed+1.157
 Holly+1.088
 Polly+1.085
 Katz+1.082
Neg logit contributions
 overwhelm-1.131
 overpower-1.005
 confuse-0.994
 overshadow-0.938
secut-0.933
Tokenutils
Token ID26791
Feature activation-3.25e+00

Pos logit contributions
+3.003
 Hed+2.692
Pie+2.610
ertodd+2.602
 Katz+2.546
Neg logit contributions
 overwhelm-4.405
 overpower-4.029
 confuse-3.974
 overshadow-3.871
secut-3.743
Token import
Token ID1330
Feature activation-2.87e+00

Pos logit contributions
+2.845
 Hed+2.680
 Polly+2.577
 Holly+2.551
 Katz+2.529
Neg logit contributions
 overwhelm-2.325
 overpower-2.050
 confuse-2.012
 overshadow-1.916
secut-1.913
Token paths
Token ID13532
Feature activation-1.71e+00

Pos logit contributions
+1.766
 Hed+1.590
 Holly+1.472
 Polly+1.471
Pie+1.453
Neg logit contributions
 overwhelm-2.903
 overpower-2.662
 confuse-2.629
 overshadow-2.527
secut-2.521
Token import
Token ID1330
Feature activation-2.69e+00

Pos logit contributions
+1.568
 Hed+1.475
 Holly+1.408
 Polly+1.405
Pie+1.397
Neg logit contributions
 overwhelm-1.182
 overpower-1.036
 confuse-1.018
 overshadow-0.957
secut-0.956
Token n
Token ID299
Feature activation-2.09e+00

Pos logit contributions
+2.539
 Hed+2.375
 Polly+2.276
 Holly+2.270
 Katz+2.250
Neg logit contributions
 overwhelm-1.811
 overpower-1.586
 confuse-1.551
secut-1.468
 overshadow-1.461
Tokenumpy
Token ID32152
Feature activation-7.56e+00

Pos logit contributions
+1.024
 Hed+0.891
 Holly+0.800
 Polly+0.794
Pie+0.788
Neg logit contributions
 overwhelm-2.397
 overpower-2.219
 confuse-2.200
secut-2.117
 overshadow-2.114
Token as
Token ID355
Feature activation-3.55e+00

Pos logit contributions
+4.739
 Hed+4.451
 Polly+4.212
 Katz+4.169
 Holly+4.137
Neg logit contributions
 overwhelm-6.963
 overpower-6.291
 confuse-6.271
 overshadow-6.111
secut-6.105
Token np
Token ID45941
Feature activation-2.96e+00

Pos logit contributions
+2.431
 Hed+2.217
 Polly+2.091
 Holly+2.070
Pie+2.057
Neg logit contributions
 overwhelm-3.318
 overpower-3.021
 confuse-2.991
secut-2.862
 overshadow-2.853
Token import
Token ID1330
Feature activation-2.04e+00

Pos logit contributions
+2.614
 Hed+2.477
 Holly+2.361
 Polly+2.358
 Katz+2.330
Neg logit contributions
 overwhelm-2.110
 overpower-1.852
 confuse-1.818
secut-1.742
 overshadow-1.728
Token arg
Token ID1822
Feature activation-2.66e+00

Pos logit contributions
+1.801
 Hed+1.670
 Polly+1.586
 Holly+1.585
 Katz+1.574
Neg logit contributions
 overwhelm-1.519
 overpower-1.345
 confuse-1.324
 overshadow-1.254
secut-1.253
Tokenparse
Token ID29572
Feature activation-7.86e-01

Pos logit contributions
+1.842
 Hed+1.658
Pie+1.569
 Polly+1.566
 Holly+1.560
Neg logit contributions
 overwhelm-2.480
 overpower-2.253
 confuse-2.228
 overshadow-2.145
secut-2.106
Token is
Token ID318
Feature activation-1.78e+00

Pos logit contributions
 overwhelm+0.422
 overpower+0.378
 confuse+0.374
 overshadow+0.353
secut+0.351
Neg logit contributions
-0.416
 Hed-0.381
 Holly-0.360
 Polly-0.359
Pie-0.358
Token to
Token ID284
Feature activation+4.74e-01

Pos logit contributions
+1.520
 Hed+1.412
 Holly+1.335
 Polly+1.332
Pie+1.329
Neg logit contributions
 overwhelm-1.381
 overpower-1.224
 confuse-1.211
secut-1.139
 overshadow-1.139
Token give
Token ID1577
Feature activation-2.54e+00

Pos logit contributions
 overwhelm+0.367
 overpower+0.327
 confuse+0.322
 overshadow+0.304
secut+0.303
Neg logit contributions
-0.407
 Hed-0.376
 Holly-0.356
 Polly-0.355
Pie-0.355
Token the
Token ID262
Feature activation-1.91e+00

Pos logit contributions
+2.056
 Hed+1.900
 Holly+1.790
 Polly+1.785
Pie+1.784
Neg logit contributions
 overwhelm-2.075
 overpower-1.859
 confuse-1.838
 overshadow-1.738
secut-1.735
Token hall
Token ID6899
Feature activation-3.16e+00

Pos logit contributions
+1.662
 Hed+1.537
 Holly+1.454
 Polly+1.452
Pie+1.449
Neg logit contributions
 overwhelm-1.459
 overpower-1.297
 confuse-1.279
 overshadow-1.204
secut-1.198
Token its
Token ID663
Feature activation-7.25e+00

Pos logit contributions
+2.675
 Hed+2.492
 Holly+2.362
 Polly+2.354
Pie+2.352
Neg logit contributions
 overwhelm-2.456
 overpower-2.176
 confuse-2.154
secut-2.035
 overshadow-2.025
Token look
Token ID804
Feature activation-6.33e-01

Pos logit contributions
+5.960
 Hed+5.763
 Katz+5.520
 Holly+5.512
 Polly+5.435
Neg logit contributions
 overwhelm-5.396
 confuse-4.714
 overpower-4.659
secut-4.565
 overshadow-4.416
Token,
Token ID11
Feature activation-7.38e-01

Pos logit contributions
+0.485
 Hed+0.444
 Holly+0.417
 Polly+0.416
Pie+0.415
Neg logit contributions
 overwhelm-0.548
 overpower-0.494
 confuse-0.488
 overshadow-0.463
secut-0.462
Token and
Token ID290
Feature activation-6.77e-01

Pos logit contributions
+0.609
 Hed+0.561
 Holly+0.530
 Polly+0.529
Pie+0.527
Neg logit contributions
 overwhelm-0.596
 overpower-0.533
 confuse-0.527
 overshadow-0.498
secut-0.496
Token Bour
Token ID20576
Feature activation+9.76e-01

Pos logit contributions
+0.557
 Hed+0.512
 Holly+0.482
 Polly+0.482
Pie+0.481
Neg logit contributions
 overwhelm-0.550
 overpower-0.492
 confuse-0.486
 overshadow-0.460
secut-0.457
Tokendain
Token ID27162
Feature activation-8.18e-01

Pos logit contributions
 overwhelm+0.998
 overpower+0.915
 confuse+0.906
secut+0.866
 overshadow+0.865
Neg logit contributions
-0.595
 Hed-0.535
 Holly-0.493
 Polly-0.490
Pie-0.486
Token hand
Token ID1021
Feature activation-1.05e-02

Pos logit contributions
+0.684
 Hed+0.621
 Holly+0.580
Pie+0.579
 Polly+0.579
Neg logit contributions
 overwhelm-0.852
 overpower-0.771
 confuse-0.763
 overshadow-0.726
secut-0.724
Tokenout
Token ID448
Feature activation-8.49e-01

Pos logit contributions
+0.008
 Hed+0.007
 Polly+0.007
Pie+0.007
 Holly+0.007
Neg logit contributions
 overwhelm-0.009
 overpower-0.008
 confuse-0.008
 overshadow-0.008
secut-0.008
Token<|endoftext|>
Token ID50256
Feature activation-9.02e-01

Pos logit contributions
+0.578
 Hed+0.522
 Holly+0.485
 Polly+0.485
Pie+0.483
Neg logit contributions
 overwhelm-0.809
 overpower-0.737
 confuse-0.729
 overshadow-0.696
secut-0.693
TokenOr
Token ID5574
Feature activation-5.04e-01

Pos logit contributions
+0.627
 Hed+0.568
 Holly+0.529
Pie+0.529
 Polly+0.528
Neg logit contributions
 overwhelm-0.846
 overpower-0.769
 confuse-0.761
 overshadow-0.726
secut-0.722
Token at
Token ID379
Feature activation-6.23e-03

Pos logit contributions
+0.415
 Hed+0.384
 Holly+0.362
 Polly+0.362
Pie+0.360
Neg logit contributions
 overwhelm-0.407
 overpower-0.364
 confuse-0.359
 overshadow-0.340
secut-0.338
Token least
Token ID1551
Feature activation-7.08e+00

Pos logit contributions
+0.005
 Hed+0.004
 Holly+0.004
 Polly+0.004
Pie+0.004
Neg logit contributions
 overwhelm-0.005
 overpower-0.005
 confuse-0.005
 overshadow-0.005
secut-0.005
Token 26
Token ID2608
Feature activation-1.56e+00

Pos logit contributions
+5.306
 Hed+5.160
 Holly+4.916
 Polly+4.880
 Katz+4.834
Neg logit contributions
 overwhelm-5.694
 overpower-5.117
 confuse-4.930
secut-4.885
 overshadow-4.796
Token of
Token ID286
Feature activation+9.56e-01

Pos logit contributions
+1.350
 Hed+1.253
 Holly+1.187
 Polly+1.184
Pie+1.181
Neg logit contributions
 overwhelm-1.197
 overpower-1.062
 confuse-1.046
 overshadow-0.988
secut-0.983
Token 27
Token ID2681
Feature activation-6.97e-01

Pos logit contributions
 overwhelm+0.775
 overpower+0.694
 confuse+0.685
 overshadow+0.647
secut+0.644
Neg logit contributions
-0.788
 Hed-0.721
 Holly-0.679
Pie-0.679
 Polly-0.678
Token:
Token ID25
Feature activation-1.62e+00

Pos logit contributions
+0.592
 Hed+0.548
 Holly+0.518
 Polly+0.517
Pie+0.515
Neg logit contributions
 overwhelm-0.545
 overpower-0.485
 confuse-0.479
 overshadow-0.452
secut-0.450
TokenTHE
Token ID10970
Feature activation-8.70e-01

Pos logit contributions
+1.155
 Hed+1.061
 Holly+0.994
 Polly+0.989
Pie+0.982
Neg logit contributions
 overwhelm-1.485
 overpower-1.347
 confuse-1.329
 overshadow-1.267
secut-1.262
Token
Token ID198
Feature activation+7.51e-01

Pos logit contributions
 overwhelm+0.752
 overpower+0.688
 confuse+0.679
secut+0.651
 overshadow+0.646
Neg logit contributions
-0.490
 Hed-0.433
 Holly-0.399
 Polly-0.398
ertodd-0.398
Tokeng
Token ID70
Feature activation+3.40e+00

Pos logit contributions
 overwhelm+0.686
 overpower+0.621
 confuse+0.614
 overshadow+0.585
secut+0.585
Neg logit contributions
-0.540
 Hed-0.492
 Holly-0.461
 Polly-0.459
Pie-0.455
Tokenave
Token ID1015
Feature activation+9.37e-01

Pos logit contributions
 overwhelm+2.713
 overpower+2.421
 confuse+2.386
 overshadow+2.249
secut+2.232
Neg logit contributions
-2.882
 Hed-2.595
 Holly-2.479
Pie-2.472
 Polly-2.469
Token the
Token ID262
Feature activation-9.37e-01

Pos logit contributions
 overwhelm+0.764
 overpower+0.685
 confuse+0.676
 overshadow+0.639
secut+0.635
Neg logit contributions
-0.769
 Hed-0.705
 Holly-0.664
Pie-0.663
 Polly-0.661
Token movement
Token ID3356
Feature activation-1.14e+00

Pos logit contributions
+0.815
 Hed+0.755
 Holly+0.714
 Polly+0.713
Pie+0.712
Neg logit contributions
 overwhelm-0.714
 overpower-0.634
 confuse-0.626
 overshadow-0.589
secut-0.587
Token its
Token ID663
Feature activation-6.99e+00

Pos logit contributions
+0.974
 Hed+0.904
 Polly+0.855
 Holly+0.855
Pie+0.851
Neg logit contributions
 overwhelm-0.873
 overpower-0.777
 confuse-0.767
 overshadow-0.723
secut-0.722
Token vitality
Token ID41687
Feature activation-5.53e-01

Pos logit contributions
+6.010
 Hed+5.738
 Polly+5.483
 Holly+5.432
 Katz+5.399
Neg logit contributions
 overwhelm-4.929
 confuse-4.304
 overpower-4.292
secut-4.141
 overshadow-3.991
Token.
Token ID13
Feature activation-5.97e-01

Pos logit contributions
+0.439
 Hed+0.404
 Holly+0.380
 Polly+0.380
Pie+0.378
Neg logit contributions
 overwhelm-0.462
 overpower-0.416
 confuse-0.411
 overshadow-0.389
secut-0.389
TokenBetter
Token ID28971
Feature activation-1.13e-01

Pos logit contributions
+0.431
 Hed+0.392
 Holly+0.366
 Polly+0.366
Pie+0.365
Neg logit contributions
 overwhelm-0.544
 overpower-0.493
 confuse-0.487
 overshadow-0.464
secut-0.462
Token the
Token ID262
Feature activation+6.61e-01

Pos logit contributions
+0.098
 Hed+0.091
 Holly+0.086
 Polly+0.086
Pie+0.086
Neg logit contributions
 overwhelm-0.086
 overpower-0.076
 confuse-0.075
 overshadow-0.071
secut-0.071
Token multiple
Token ID3294
Feature activation-2.69e-01

Pos logit contributions
 overwhelm+0.514
 overpower+0.459
 confuse+0.452
 overshadow+0.426
secut+0.421
Neg logit contributions
-0.567
 Hed-0.522
 Holly-0.495
Pie-0.493
 Polly-0.492
Token been
Token ID587
Feature activation+1.41e+00

Pos logit contributions
 overwhelm+1.724
 overpower+1.525
 confuse+1.502
 overshadow+1.416
secut+1.397
Neg logit contributions
-2.163
 Hed-1.997
Pie-1.905
 Holly-1.899
 Polly-1.895
Token.
Token ID13
Feature activation-6.33e-01

Pos logit contributions
 overwhelm+1.040
 overpower+0.926
 confuse+0.912
 overshadow+0.857
secut+0.852
Neg logit contributions
-1.257
 Hed-1.160
Pie-1.100
 Holly-1.099
 Polly-1.097
Token X
Token ID1395
Feature activation-1.36e+00

Pos logit contributions
+0.463
 Hed+0.421
 Holly+0.394
 Polly+0.393
Pie+0.393
Neg logit contributions
 overwhelm-0.570
 overpower-0.516
 confuse-0.511
 overshadow-0.486
secut-0.484
Token-
Token ID12
Feature activation-1.73e+00

Pos logit contributions
+1.075
 Hed+0.990
 Holly+0.930
 Polly+0.926
Pie+0.924
Neg logit contributions
 overwhelm-1.148
 overpower-1.030
 confuse-1.020
 overshadow-0.965
secut-0.963
TokenMen
Token ID10418
Feature activation-6.00e+00

Pos logit contributions
+1.197
 Hed+1.086
 Holly+1.013
Pie+1.012
 Polly+1.005
Neg logit contributions
 overwhelm-1.625
 overpower-1.476
 confuse-1.462
 overshadow-1.393
secut-1.391
Token:
Token ID25
Feature activation-6.87e+00

Pos logit contributions
+4.171
 Hed+3.877
 Holly+3.625
Pie+3.607
 Polly+3.596
Neg logit contributions
 overwhelm-5.380
 overpower-4.871
 confuse-4.851
secut-4.614
 overshadow-4.586
Token Apocalypse
Token ID21308
Feature activation-2.90e+00

Pos logit contributions
+4.480
 Hed+4.148
 Holly+3.840
 Carey+3.815
 Polly+3.796
Neg logit contributions
 overwhelm-6.449
 overpower-5.879
 confuse-5.820
 overshadow-5.562
secut-5.534
Token arrives
Token ID14443
Feature activation-1.69e+00

Pos logit contributions
+2.349
 Hed+2.181
 Holly+2.055
 Polly+2.051
Pie+2.036
Neg logit contributions
 overwhelm-2.347
 overpower-2.100
 confuse-2.077
secut-1.967
 overshadow-1.959
Token on
Token ID319
Feature activation-1.76e-01

Pos logit contributions
+1.379
 Hed+1.274
 Holly+1.202
 Polly+1.199
Pie+1.196
Neg logit contributions
 overwhelm-1.379
 overpower-1.235
 confuse-1.220
 overshadow-1.153
secut-1.149
Token Blu
Token ID12391
Feature activation-2.95e-01

Pos logit contributions
+0.104
 Hed+0.091
Pie+0.084
 Polly+0.084
 Holly+0.084
Neg logit contributions
 overwhelm-0.184
 overpower-0.169
 confuse-0.168
 overshadow-0.161
secut-0.160
Token-
Token ID12
Feature activation-1.10e+00

Pos logit contributions
+0.239
 Hed+0.219
 Holly+0.206
 Polly+0.206
Pie+0.205
Neg logit contributions
 overwhelm-0.243
 overpower-0.218
 confuse-0.215
 overshadow-0.203
secut-0.203
Token.
Token ID13
Feature activation+1.50e+00

Pos logit contributions
 overwhelm+2.438
 overpower+2.176
 confuse+2.132
secut+1.999
 surpass+1.995
Neg logit contributions
-3.033
 Hed-2.768
ertodd-2.688
 Holly-2.627
Pie-2.614
Token The
Token ID383
Feature activation-4.56e-01

Pos logit contributions
 overwhelm+1.346
 overpower+1.220
 confuse+1.209
secut+1.148
 overshadow+1.146
Neg logit contributions
-1.076
 Hed-0.971
ertodd-0.915
Pie-0.912
 Holly-0.906
Token Legend
Token ID9883
Feature activation-1.34e+00

Pos logit contributions
+0.279
 Hed+0.249
 Holly+0.229
 Polly+0.229
Pie+0.228
Neg logit contributions
 overwhelm-0.466
 overpower-0.427
 confuse-0.423
 overshadow-0.405
secut-0.404
Token of
Token ID286
Feature activation+9.36e-01

Pos logit contributions
+1.114
 Hed+1.030
 Holly+0.972
 Polly+0.970
Pie+0.968
Neg logit contributions
 overwhelm-1.065
 overpower-0.952
 confuse-0.938
 overshadow-0.887
secut-0.885
Token Zelda
Token ID22166
Feature activation-4.48e+00

Pos logit contributions
 overwhelm+0.891
 overpower+0.812
 confuse+0.803
 overshadow+0.766
secut+0.763
Neg logit contributions
-0.636
 Hed-0.574
 Holly-0.533
Pie-0.533
 Polly-0.533
Token:
Token ID25
Feature activation-6.84e+00

Pos logit contributions
+3.108
 Hed+2.909
Pie+2.696
 Holly+2.686
 Polly+2.679
Neg logit contributions
 overwhelm-4.102
 overpower-3.708
 confuse-3.699
secut-3.518
 overshadow-3.504
Token O
Token ID440
Feature activation-1.11e+00

Pos logit contributions
+4.113
 Hed+3.836
 Katz+3.522
 Holly+3.477
Pie+3.476
Neg logit contributions
 overwhelm-6.871
 overpower-6.242
 confuse-6.216
 overshadow-5.956
ierrez-5.945
Tokencar
Token ID7718
Feature activation-9.67e-01

Pos logit contributions
+0.862
 Hed+0.787
 Holly+0.739
 Polly+0.739
Pie+0.737
Neg logit contributions
 overwhelm-0.946
 overpower-0.854
 confuse-0.844
 overshadow-0.798
secut-0.797
Tokenina
Token ID1437
Feature activation-4.64e+00

Pos logit contributions
+0.603
 Hed+0.548
 Polly+0.508
Pie+0.506
 Holly+0.505
Neg logit contributions
 overwhelm-0.971
 overpower-0.890
 confuse-0.880
 overshadow-0.843
secut-0.836
Token of
Token ID286
Feature activation-3.29e+00

Pos logit contributions
+3.583
 Hed+3.383
 Polly+3.171
Pie+3.160
 Holly+3.159
Neg logit contributions
 overwhelm-3.881
 overpower-3.494
 confuse-3.442
 overshadow-3.282
secut-3.273
Token Time
Token ID3862
Feature activation-4.51e-01

Pos logit contributions
+2.537
 Hed+2.372
 Holly+2.242
Pie+2.221
 Polly+2.216
Neg logit contributions
 overwhelm-2.793
 overpower-2.518
 confuse-2.484
 overshadow-2.359
secut-2.340
Token Play
Token ID3811
Feature activation+6.11e-02

Pos logit contributions
+0.406
 Hed+0.367
 Holly+0.342
 Polly+0.341
Pie+0.340
Neg logit contributions
 overwhelm-0.578
 overpower-0.527
 confuse-0.521
 overshadow-0.497
secut-0.496
Token of
Token ID286
Feature activation+1.04e+00

Pos logit contributions
 overwhelm+0.049
 overpower+0.044
 confuse+0.043
 overshadow+0.041
secut+0.041
Neg logit contributions
-0.051
 Hed-0.047
 Holly-0.044
 Polly-0.044
Pie-0.044
Token a
Token ID257
Feature activation+6.73e-01

Pos logit contributions
 overwhelm+0.837
 overpower+0.749
 confuse+0.738
 overshadow+0.697
secut+0.692
Neg logit contributions
-0.867
 Hed-0.794
Pie-0.749
 Holly-0.748
 Polly-0.747
Token popular
Token ID2968
Feature activation+2.01e+00

Pos logit contributions
 overwhelm+0.528
 overpower+0.471
 confuse+0.465
 overshadow+0.438
secut+0.435
Neg logit contributions
-0.573
 Hed-0.526
 Holly-0.496
Pie-0.496
 Polly-0.495
Token Halo
Token ID18597
Feature activation-7.60e-01

Pos logit contributions
 overwhelm+1.644
 overpower+1.481
 confuse+1.452
 overshadow+1.375
secut+1.354
Neg logit contributions
-1.639
 Hed-1.483
ertodd-1.417
Pie-1.409
Ton-1.399
Token:
Token ID25
Feature activation-6.80e+00

Pos logit contributions
+0.600
 Hed+0.551
 Holly+0.519
 Polly+0.518
Pie+0.517
Neg logit contributions
 overwhelm-0.640
 overpower-0.575
 confuse-0.569
 overshadow-0.539
secut-0.537
Token Reach
Token ID25146
Feature activation-2.94e+00

Pos logit contributions
+5.066
 Hed+4.718
 Holly+4.463
 Carey+4.436
 Polly+4.425
Neg logit contributions
 overwhelm-5.787
 confuse-5.191
 overpower-5.185
secut-4.947
 overshadow-4.928
Token map
Token ID3975
Feature activation+7.27e-01

Pos logit contributions
+2.355
 Hed+2.195
 Polly+2.067
 Holly+2.067
Pie+2.054
Neg logit contributions
 overwhelm-2.399
 overpower-2.146
 confuse-2.129
secut-2.020
 overshadow-2.007
Token.
Token ID13
Feature activation+4.36e-01

Pos logit contributions
 overwhelm+0.581
 overpower+0.521
 confuse+0.513
 overshadow+0.486
secut+0.481
Neg logit contributions
-0.608
 Hed-0.558
Pie-0.526
 Polly-0.526
 Holly-0.526
Token
Token ID198
Feature activation+1.56e+00

Pos logit contributions
 overwhelm+0.412
 overpower+0.375
 confuse+0.373
 overshadow+0.355
secut+0.353
Neg logit contributions
-0.296
 Hed-0.262
ertodd-0.247
Pie-0.245
 Holly-0.243
Token
Token ID198
Feature activation+1.43e+00

Pos logit contributions
 overwhelm+1.492
 overpower+1.367
 confuse+1.348
secut+1.290
 surpass+1.280
Neg logit contributions
-1.036
 Hed-0.910
ertodd-0.851
 Holly-0.840
 Polly-0.838
Token for
Token ID329
Feature activation-1.31e+00

Pos logit contributions
 overwhelm+0.984
 overpower+0.877
 confuse+0.861
 overshadow+0.815
secut+0.803
Neg logit contributions
-1.063
 Hed-0.970
Pie-0.922
ertodd-0.919
 Polly-0.915
Token it
Token ID340
Feature activation-1.51e+00

Pos logit contributions
+1.065
 Hed+0.991
 Holly+0.936
 Polly+0.932
Pie+0.924
Neg logit contributions
 overwhelm-1.067
 overpower-0.955
 confuse-0.948
secut-0.896
 overshadow-0.893
Token.
Token ID13
Feature activation+4.58e-01

Pos logit contributions
+1.235
 Hed+1.144
 Holly+1.079
 Polly+1.076
 Katz+1.068
Neg logit contributions
 overwhelm-1.219
 overpower-1.087
 confuse-1.081
 overshadow-1.019
secut-1.018
Token If
Token ID1002
Feature activation+7.41e-01

Pos logit contributions
 overwhelm+0.400
 overpower+0.361
 confuse+0.356
 overshadow+0.338
secut+0.337
Neg logit contributions
-0.348
 Hed-0.317
 Holly-0.297
 Polly-0.297
Pie-0.297
Token at
Token ID379
Feature activation-1.01e+00

Pos logit contributions
 overwhelm+0.601
 overpower+0.539
 confuse+0.530
 overshadow+0.502
secut+0.498
Neg logit contributions
-0.611
 Hed-0.560
Pie-0.528
 Holly-0.527
 Polly-0.527
Token least
Token ID1551
Feature activation-6.72e+00

Pos logit contributions
+0.883
 Hed+0.818
 Holly+0.774
 Polly+0.773
Pie+0.772
Neg logit contributions
 overwhelm-0.760
 overpower-0.674
 confuse-0.665
 overshadow-0.626
secut-0.623
Token this
Token ID428
Feature activation+1.94e-01

Pos logit contributions
+5.294
 Hed+5.082
 Holly+4.837
 Polly+4.764
 Katz+4.736
Neg logit contributions
 overwhelm-5.269
 overpower-4.691
 confuse-4.655
secut-4.495
 overshadow-4.420
Token body
Token ID1767
Feature activation+4.16e-01

Pos logit contributions
 overwhelm+0.153
 overpower+0.136
 confuse+0.135
 overshadow+0.127
secut+0.127
Neg logit contributions
-0.163
 Hed-0.151
 Holly-0.143
 Polly-0.142
Pie-0.142
Token can
Token ID460
Feature activation-1.16e+00

Pos logit contributions
 overwhelm+0.322
 overpower+0.287
 confuse+0.283
 overshadow+0.266
secut+0.266
Neg logit contributions
-0.357
 Hed-0.331
 Holly-0.313
 Polly-0.312
Pie-0.311
Token be
Token ID307
Feature activation+1.45e+00

Pos logit contributions
+0.998
 Hed+0.928
 Holly+0.877
 Polly+0.873
Pie+0.871
Neg logit contributions
 overwhelm-0.885
 overpower-0.785
 confuse-0.777
secut-0.734
 overshadow-0.731
Token closer
Token ID5699
Feature activation+2.11e+00

Pos logit contributions
 overwhelm+1.023
 overpower+0.905
 confuse+0.889
 overshadow+0.831
secut+0.821
Neg logit contributions
-1.353
 Hed-1.249
Pie-1.192
 Polly-1.186
ertodd-1.186
Token to
Token ID284
Feature activation+1.89e+00

Pos logit contributions
 overwhelm+0.009
 overpower+0.008
 confuse+0.008
 overshadow+0.007
secut+0.007
Neg logit contributions
-0.009
 Hed-0.008
 Holly-0.008
Pie-0.008
 Polly-0.008
Token tell
Token ID1560
Feature activation+1.70e+00

Pos logit contributions
 overwhelm+1.469
 overpower+1.307
 confuse+1.294
 overshadow+1.219
secut+1.196
Neg logit contributions
-1.646
 Hed-1.497
Pie-1.428
 Holly-1.417
Ton-1.412
Token people
Token ID661
Feature activation+5.80e-01

Pos logit contributions
 overwhelm+1.394
 overpower+1.249
 confuse+1.235
 overshadow+1.166
secut+1.161
Neg logit contributions
-1.388
 Hed-1.258
Pie-1.190
 Holly-1.186
ertodd-1.185
Token that
Token ID326
Feature activation-3.03e-01

Pos logit contributions
 overwhelm+0.453
 overpower+0.404
 confuse+0.399
 overshadow+0.376
secut+0.373
Neg logit contributions
-0.497
 Hed-0.455
Pie-0.431
 Holly-0.430
 Polly-0.430
Token at
Token ID379
Feature activation-1.15e+00

Pos logit contributions
+0.249
 Hed+0.230
 Holly+0.217
 Polly+0.217
Pie+0.216
Neg logit contributions
 overwhelm-0.244
 overpower-0.219
 confuse-0.216
 overshadow-0.204
secut-0.203
Token least
Token ID1551
Feature activation-6.69e+00

Pos logit contributions
+0.913
 Hed+0.841
 Holly+0.791
 Polly+0.790
Pie+0.787
Neg logit contributions
 overwhelm-0.963
 overpower-0.864
 confuse-0.855
 overshadow-0.810
secut-0.806
Token he
Token ID339
Feature activation+9.26e-01

Pos logit contributions
+5.118
 Hed+5.021
 Holly+4.750
 Polly+4.749
 Katz+4.719
Neg logit contributions
 overwhelm-5.325
 overpower-4.718
 confuse-4.648
secut-4.529
ierrez-4.396
Token did
Token ID750
Feature activation+1.06e-01

Pos logit contributions
 overwhelm+0.684
 overpower+0.608
 confuse+0.599
 overshadow+0.563
secut+0.557
Neg logit contributions
-0.833
 Hed-0.766
Pie-0.729
 Holly-0.727
 Polly-0.724
Token this
Token ID428
Feature activation+1.83e+00

Pos logit contributions
 overwhelm+0.086
 overpower+0.077
 confuse+0.076
 overshadow+0.072
secut+0.072
Neg logit contributions
-0.087
 Hed-0.080
Pie-0.075
 Holly-0.075
 Polly-0.075
Token..
Token ID492
Feature activation+2.14e+00

Pos logit contributions
 overwhelm+1.414
 overpower+1.262
 confuse+1.243
 overshadow+1.175
secut+1.161
Neg logit contributions
-1.593
 Hed-1.457
Pie-1.383
 Holly-1.380
 Polly-1.378
Token.
Token ID764
Feature activation+3.43e+00

Pos logit contributions
 overwhelm+1.775
 overpower+1.593
 confuse+1.578
 overshadow+1.491
secut+1.484
Neg logit contributions
-1.720
 Hed-1.564
 Holly-1.472
 Polly-1.472
Pie-1.468
Tokenie
Token ID494
Feature activation+1.08e+00

Pos logit contributions
 overwhelm+0.237
 overpower+0.214
 confuse+0.211
 overshadow+0.200
secut+0.199
Neg logit contributions
-0.218
 Hed-0.200
 Holly-0.188
 Polly-0.188
Pie-0.187
Token for
Token ID329
Feature activation+3.30e-01

Pos logit contributions
 overwhelm+0.857
 overpower+0.765
 confuse+0.756
 overshadow+0.713
secut+0.709
Neg logit contributions
-0.903
 Hed-0.831
 Holly-0.784
 Polly-0.782
Pie-0.782
Token the
Token ID262
Feature activation-7.56e-01

Pos logit contributions
 overwhelm+0.270
 overpower+0.242
 confuse+0.240
 overshadow+0.226
secut+0.225
Neg logit contributions
-0.269
 Hed-0.246
Pie-0.232
 Polly-0.231
ertodd-0.231
Token X
Token ID1395
Feature activation-2.37e+00

Pos logit contributions
+0.626
 Hed+0.573
 Holly+0.540
Pie+0.540
 Polly+0.540
Neg logit contributions
 overwhelm-0.611
 overpower-0.548
 confuse-0.542
 overshadow-0.510
secut-0.508
Token-
Token ID12
Feature activation-2.45e+00

Pos logit contributions
+1.810
 Hed+1.681
 Holly+1.572
Pie+1.553
 Polly+1.551
Neg logit contributions
 overwhelm-2.029
 overpower-1.815
 confuse-1.810
secut-1.710
 overshadow-1.710
TokenMen
Token ID10418
Feature activation-6.57e+00

Pos logit contributions
+1.667
 Hed+1.514
Pie+1.415
 Holly+1.408
Ton+1.397
Neg logit contributions
 overwhelm-2.323
 overpower-2.108
 confuse-2.090
 overshadow-1.992
secut-1.987
Token that
Token ID326
Feature activation-1.12e-01

Pos logit contributions
+4.909
 Hed+4.638
 Holly+4.343
Pie+4.343
 Polly+4.297
Neg logit contributions
 overwhelm-5.449
 overpower-4.885
 confuse-4.835
secut-4.586
 overshadow-4.565
Token could
Token ID714
Feature activation+1.06e-01

Pos logit contributions
+0.095
 Hed+0.087
 Holly+0.082
 Polly+0.082
Pie+0.082
Neg logit contributions
 overwhelm-0.088
 overpower-0.079
 confuse-0.078
 overshadow-0.073
secut-0.073
Token
Token ID447
Feature activation-3.61e-01

Pos logit contributions
 overwhelm+0.079
 overpower+0.070
 confuse+0.069
secut+0.065
 overshadow+0.065
Neg logit contributions
-0.093
 Hed-0.086
 Holly-0.081
 Polly-0.081
Pie-0.081
Token
Token ID247
Feature activation-4.34e-01

Pos logit contributions
+0.343
 Hed+0.319
 Holly+0.304
 Polly+0.304
Pie+0.303
Neg logit contributions
 overwhelm-0.245
 overpower-0.214
 confuse-0.212
 overshadow-0.197
secut-0.197
Tokenve
Token ID303
Feature activation+2.38e+00

Pos logit contributions
+0.295
 Hed+0.266
 Holly+0.248
 Polly+0.247
Pie+0.246
Neg logit contributions
 overwhelm-0.414
 overpower-0.377
 confuse-0.373
 overshadow-0.356
secut-0.356
Token the
Token ID262
Feature activation+2.10e-01

Pos logit contributions
 overwhelm+0.915
 overpower+0.819
 confuse+0.806
 overshadow+0.760
secut+0.753
Neg logit contributions
-0.999
 Hed-0.916
Pie-0.868
 Holly-0.864
ertodd-0.863
Token maps
Token ID8739
Feature activation-1.31e+00

Pos logit contributions
 overwhelm+0.160
 overpower+0.142
 confuse+0.140
 overshadow+0.132
secut+0.131
Neg logit contributions
-0.184
 Hed-0.169
 Holly-0.160
Pie-0.160
 Polly-0.160
Token exclusive
Token ID8568
Feature activation-2.74e+00

Pos logit contributions
+1.084
 Hed+1.004
 Holly+0.949
 Polly+0.945
Pie+0.942
Neg logit contributions
 overwhelm-1.058
 overpower-0.944
 confuse-0.933
secut-0.880
 overshadow-0.879
Token to
Token ID284
Feature activation+8.99e-01

Pos logit contributions
+2.212
 Hed+2.050
 Holly+1.933
 Polly+1.924
Pie+1.920
Neg logit contributions
 overwhelm-2.246
 overpower-2.015
 confuse-1.991
secut-1.880
 overshadow-1.879
Token Halo
Token ID18597
Feature activation-3.34e+00

Pos logit contributions
 overwhelm+0.721
 overpower+0.646
 confuse+0.639
 overshadow+0.602
secut+0.597
Neg logit contributions
-0.747
 Hed-0.680
Pie-0.645
ertodd-0.644
 Polly-0.641
Token:
Token ID25
Feature activation-6.56e+00

Pos logit contributions
+2.392
 Hed+2.242
 Holly+2.099
Pie+2.087
 Polly+2.083
Neg logit contributions
 overwhelm-2.969
 overpower-2.680
 confuse-2.668
secut-2.531
 overshadow-2.523
Token Combat
Token ID14487
Feature activation-4.65e+00

Pos logit contributions
+4.660
 Hed+4.345
 Holly+4.086
 Carey+4.068
 Katz+4.036
Neg logit contributions
 overwhelm-5.790
 overpower-5.235
 confuse-5.225
 overshadow-4.983
secut-4.955
Token Ev
Token ID4319
Feature activation+1.99e+00

Pos logit contributions
+1.966
 Hed+1.778
 Holly+1.612
 Polly+1.604
 Katz+1.542
Neg logit contributions
 overwhelm-5.461
 overpower-5.097
 confuse-5.084
secut-4.966
ierrez-4.915
Tokenolved
Token ID5634
Feature activation-8.36e-01

Pos logit contributions
 overwhelm+1.682
 overpower+1.512
 confuse+1.489
 overshadow+1.414
secut+1.408
Neg logit contributions
-1.565
 Hed-1.438
 Holly-1.355
 Polly-1.353
ertodd-1.345
Token PC
Token ID4217
Feature activation-3.31e-01

Pos logit contributions
+0.662
 Hed+0.608
 Holly+0.572
 Polly+0.571
Pie+0.569
Neg logit contributions
 overwhelm-0.702
 overpower-0.631
 confuse-0.623
 overshadow-0.591
secut-0.589
Token.
Token ID13
Feature activation-2.92e-01

Pos logit contributions
+0.270
 Hed+0.249
 Holly+0.234
 Polly+0.234
Pie+0.233
Neg logit contributions
 overwhelm-0.270
 overpower-0.242
 confuse-0.239
 overshadow-0.226
secut-0.225
TokenTo
Token ID2514
Feature activation+4.52e-01

Pos logit contributions
 overwhelm+0.192
 overpower+0.174
 confuse+0.173
secut+0.164
 overshadow+0.164
Neg logit contributions
-0.144
 Hed-0.131
 Polly-0.122
 Holly-0.122
ertodd-0.121
Token celebrate
Token ID10648
Feature activation-1.07e+00

Pos logit contributions
 overwhelm+0.352
 overpower+0.313
 confuse+0.310
 overshadow+0.292
secut+0.290
Neg logit contributions
-0.386
 Hed-0.357
 Holly-0.337
 Polly-0.337
Pie-0.336
Token the
Token ID262
Feature activation+7.47e-01

Pos logit contributions
+0.871
 Hed+0.807
 Holly+0.761
 Polly+0.758
Pie+0.756
Neg logit contributions
 overwhelm-0.867
 overpower-0.776
 confuse-0.767
secut-0.724
 overshadow-0.723
Token upcoming
Token ID7865
Feature activation-1.06e+00

Pos logit contributions
 overwhelm+0.586
 overpower+0.523
 confuse+0.515
 overshadow+0.487
secut+0.483
Neg logit contributions
-0.636
 Hed-0.583
Pie-0.551
 Polly-0.551
 Holly-0.550
Token Halo
Token ID18597
Feature activation-2.21e+00

Pos logit contributions
+0.889
 Hed+0.823
 Holly+0.777
 Polly+0.775
Pie+0.773
Neg logit contributions
 overwhelm-0.835
 overpower-0.745
 confuse-0.736
 overshadow-0.694
secut-0.692
Token:
Token ID25
Feature activation-6.49e+00

Pos logit contributions
+1.615
 Hed+1.484
 Holly+1.388
Pie+1.383
 Polly+1.383
Neg logit contributions
 overwhelm-1.985
 overpower-1.797
 confuse-1.781
 overshadow-1.691
secut-1.685
Token The
Token ID383
Feature activation-2.36e+00

Pos logit contributions
+4.596
 Hed+4.326
 Holly+4.059
 Carey+4.022
Pie+3.981
Neg logit contributions
 overwhelm-5.732
 overpower-5.188
 confuse-5.187
 overshadow-4.943
secut-4.916
Token Master
Token ID5599
Feature activation-1.28e+00

Pos logit contributions
+1.560
 Hed+1.469
 Holly+1.373
Pie+1.364
 Polly+1.363
Neg logit contributions
 overwhelm-2.229
 overpower-2.029
 confuse-2.012
 overshadow-1.929
secut-1.905
Token Chief
Token ID5953
Feature activation-2.15e+00

Pos logit contributions
+0.923
 Hed+0.827
 Holly+0.770
 Polly+0.769
Pie+0.767
Neg logit contributions
 overwhelm-1.180
 overpower-1.068
 confuse-1.053
 overshadow-1.009
secut-0.999
Token Collection
Token ID12251
Feature activation+1.88e+00

Pos logit contributions
+1.281
 Hed+1.148
 Holly+1.056
 Polly+1.054
Pie+1.050
Neg logit contributions
 overwhelm-2.217
 overpower-2.035
 confuse-2.017
 overshadow-1.930
secut-1.927
Token,
Token ID11
Feature activation-2.54e-01

Pos logit contributions
 overwhelm+1.512
 overpower+1.349
 confuse+1.326
 overshadow+1.263
secut+1.243
Neg logit contributions
-1.579
 Hed-1.430
ertodd-1.364
Pie-1.348
Ton-1.346
Token been
Token ID587
Feature activation-5.86e-01

Pos logit contributions
+1.299
 Hed+1.209
 Holly+1.147
 Polly+1.145
Pie+1.142
Neg logit contributions
 overwhelm-1.037
 overpower-0.915
 confuse-0.903
 overshadow-0.846
secut-0.844
Token dark
Token ID3223
Feature activation+1.92e+00

Pos logit contributions
+0.545
 Hed+0.508
 Holly+0.483
 Polly+0.483
Pie+0.481
Neg logit contributions
 overwhelm-0.410
 overpower-0.359
 confuse-0.355
 overshadow-0.331
secut-0.330
Token for
Token ID329
Feature activation-9.83e-01

Pos logit contributions
 overwhelm+1.510
 overpower+1.354
 confuse+1.325
 overshadow+1.253
secut+1.242
Neg logit contributions
-1.624
 Hed-1.484
Pie-1.410
 Polly-1.404
 Holly-1.399
Token a
Token ID257
Feature activation-2.87e+00

Pos logit contributions
+0.848
 Hed+0.783
 Holly+0.741
 Polly+0.739
Pie+0.739
Neg logit contributions
 overwhelm-0.758
 overpower-0.675
 confuse-0.665
 overshadow-0.627
secut-0.624
Token little
Token ID1310
Feature activation-1.52e+00

Pos logit contributions
+2.418
 Hed+2.246
 Holly+2.130
 Polly+2.124
 Katz+2.104
Neg logit contributions
 overwhelm-2.243
 overpower-1.998
 confuse-1.970
 overshadow-1.863
secut-1.843
Token while
Token ID981
Feature activation-6.46e+00

Pos logit contributions
+1.255
 Hed+1.164
 Holly+1.100
 Polly+1.099
Pie+1.088
Neg logit contributions
 overwhelm-1.219
 overpower-1.089
 confuse-1.077
 overshadow-1.016
secut-1.013
Token.
Token ID13
Feature activation-2.83e-01

Pos logit contributions
+4.785
 Hed+4.504
 Holly+4.247
 Polly+4.225
 Katz+4.157
Neg logit contributions
 overwhelm-5.438
 overpower-4.856
 confuse-4.848
 overshadow-4.573
secut-4.564
Token So
Token ID1406
Feature activation-1.69e-01

Pos logit contributions
+0.218
 Hed+0.199
 Holly+0.187
Pie+0.187
 Polly+0.186
Neg logit contributions
 overwhelm-0.243
 overpower-0.219
 confuse-0.217
 overshadow-0.206
secut-0.205
Token,
Token ID11
Feature activation-9.87e-01

Pos logit contributions
+0.134
 Hed+0.123
 Holly+0.115
 Polly+0.115
Pie+0.115
Neg logit contributions
 overwhelm-0.143
 overpower-0.129
 confuse-0.127
 overshadow-0.120
secut-0.120
Token I
Token ID314
Feature activation-9.70e-01

Pos logit contributions
+0.833
 Hed+0.771
 Holly+0.729
 Polly+0.727
Pie+0.723
Neg logit contributions
 overwhelm-0.778
 overpower-0.693
 confuse-0.684
 overshadow-0.645
secut-0.644
Token got
Token ID1392
Feature activation-9.72e-01

Pos logit contributions
+0.881
 Hed+0.819
 Holly+0.777
 Polly+0.776
Pie+0.773
Neg logit contributions
 overwhelm-0.700
 overpower-0.617
 confuse-0.609
 overshadow-0.571
secut-0.569
Token promotional
Token ID18767
Feature activation-2.80e+00

Pos logit contributions
+1.421
 Hed+1.368
 Polly+1.272
 Holly+1.263
Pie+1.243
Neg logit contributions
 overwhelm-2.554
 overpower-2.340
 confuse-2.319
 overshadow-2.235
secut-2.179
Token offers
Token ID4394
Feature activation-1.94e+00

Pos logit contributions
+2.187
 Hed+2.036
 Polly+1.909
 Holly+1.908
Pie+1.900
Neg logit contributions
 overwhelm-2.359
 overpower-2.119
 confuse-2.101
 overshadow-1.995
secut-1.976
Token for
Token ID329
Feature activation-1.89e+00

Pos logit contributions
+1.709
 Hed+1.599
 Polly+1.523
 Holly+1.515
Pie+1.509
Neg logit contributions
 overwhelm-1.455
 overpower-1.286
 confuse-1.265
 overshadow-1.200
secut-1.186
Token programs
Token ID4056
Feature activation-1.48e+00

Pos logit contributions
+1.621
 Hed+1.524
 Polly+1.456
 Holly+1.444
 Katz+1.438
Neg logit contributions
 overwhelm-1.433
 overpower-1.278
 confuse-1.255
 overshadow-1.184
secut-1.169
Token that
Token ID326
Feature activation-2.10e+00

Pos logit contributions
+1.149
 Hed+1.053
 Holly+0.989
 Polly+0.987
Pie+0.985
Neg logit contributions
 overwhelm-1.262
 overpower-1.137
 confuse-1.123
 overshadow-1.065
secut-1.061
Token support
Token ID1104
Feature activation-6.46e+00

Pos logit contributions
+1.692
 Hed+1.590
 Polly+1.504
 Holly+1.497
 Katz+1.486
Neg logit contributions
 overwhelm-1.686
 overpower-1.509
 confuse-1.487
 overshadow-1.415
secut-1.395
Token The
Token ID383
Feature activation-1.11e+00

Pos logit contributions
+4.243
 Hed+3.971
 Polly+3.745
 Carey+3.688
 Holly+3.661
Neg logit contributions
 overwhelm-6.180
 overpower-5.699
 confuse-5.637
 overshadow-5.404
secut-5.319
Token Nation
Token ID8741
Feature activation-1.21e+00

Pos logit contributions
+0.908
 Hed+0.835
 Holly+0.787
 Polly+0.785
Pie+0.784
Neg logit contributions
 overwhelm-0.910
 overpower-0.815
 confuse-0.805
 overshadow-0.761
secut-0.758
Token
Token ID447
Feature activation-8.51e-01

Pos logit contributions
+1.009
 Hed+0.931
 Polly+0.879
 Holly+0.879
Pie+0.875
Neg logit contributions
 overwhelm-0.968
 overpower-0.865
 confuse-0.854
 overshadow-0.807
secut-0.803
Token
Token ID247
Feature activation-1.50e+00

Pos logit contributions
+0.800
 Hed+0.742
 Polly+0.709
 Holly+0.707
Pie+0.706
Neg logit contributions
 overwhelm-0.586
 overpower-0.512
 confuse-0.509
 overshadow-0.475
secut-0.473
Tokens
Token ID82
Feature activation-1.87e+00

Pos logit contributions
+1.100
 Hed+1.012
Pie+0.949
 Polly+0.947
 Holly+0.942
Neg logit contributions
 overwhelm-1.351
 overpower-1.221
 confuse-1.206
 overshadow-1.161
secut-1.133
Token.
Token ID13
Feature activation+7.09e-01

Pos logit contributions
 overwhelm+2.232
 overpower+2.003
 confuse+1.968
 overshadow+1.856
secut+1.843
Neg logit contributions
-2.280
 Hed-2.076
Pie-1.965
 Holly-1.962
 Polly-1.956
Token Leave
Token ID17446
Feature activation+1.34e+00

Pos logit contributions
 overwhelm+0.628
 overpower+0.567
 confuse+0.562
 overshadow+0.532
secut+0.531
Neg logit contributions
-0.528
 Hed-0.477
ertodd-0.448
 Holly-0.447
Pie-0.447
Token it
Token ID340
Feature activation-4.83e-01

Pos logit contributions
 overwhelm+1.024
 overpower+0.914
 confuse+0.895
 overshadow+0.845
secut+0.837
Neg logit contributions
-1.168
 Hed-1.067
ertodd-1.012
Pie-1.012
 Holly-1.008
Token for
Token ID329
Feature activation+2.53e-01

Pos logit contributions
+0.422
 Hed+0.390
 Holly+0.369
 Polly+0.368
Pie+0.368
Neg logit contributions
 overwhelm-0.367
 overpower-0.327
 confuse-0.322
 overshadow-0.303
secut-0.301
Token at
Token ID379
Feature activation-2.54e+00

Pos logit contributions
 overwhelm+0.200
 overpower+0.179
 confuse+0.176
 overshadow+0.167
secut+0.165
Neg logit contributions
-0.214
 Hed-0.195
Pie-0.184
ertodd-0.184
 Holly-0.184
Token least
Token ID1551
Feature activation-6.45e+00

Pos logit contributions
+1.903
 Hed+1.745
 Holly+1.639
 Polly+1.633
Pie+1.624
Neg logit contributions
 overwhelm-2.227
 overpower-2.006
 confuse-1.987
 overshadow-1.893
secut-1.875
Token an
Token ID281
Feature activation-3.00e+00

Pos logit contributions
+4.542
 Hed+4.399
 Polly+4.165
 Holly+4.161
 Carey+4.082
Neg logit contributions
 overwhelm-5.567
 overpower-4.985
 confuse-4.953
secut-4.761
 overshadow-4.715
Token hour
Token ID1711
Feature activation-1.64e-01

Pos logit contributions
+2.432
 Hed+2.283
 Holly+2.147
 Polly+2.132
Pie+2.111
Neg logit contributions
 overwhelm-2.407
 overpower-2.139
 confuse-2.118
secut-2.015
 overshadow-2.008
Token.
Token ID13
Feature activation+1.51e+00

Pos logit contributions
+0.136
 Hed+0.126
 Holly+0.119
 Polly+0.118
Pie+0.118
Neg logit contributions
 overwhelm-0.131
 overpower-0.117
 confuse-0.116
 overshadow-0.109
secut-0.109
Token (
Token ID357
Feature activation+6.84e-01

Pos logit contributions
 overwhelm+1.350
 overpower+1.221
 confuse+1.210
secut+1.144
 overshadow+1.142
Neg logit contributions
-1.102
 Hed-0.986
ertodd-0.936
Pie-0.923
 Holly-0.921
TokenNo
Token ID2949
Feature activation+1.06e+00

Pos logit contributions
 overwhelm+0.631
 overpower+0.573
 confuse+0.568
secut+0.539
 overshadow+0.538
Neg logit contributions
-0.484
 Hed-0.440
 Polly-0.410
 Holly-0.409
 Katz-0.406
Token and
Token ID290
Feature activation-1.10e-01

Pos logit contributions
+1.588
 Hed+1.472
 Polly+1.391
 Holly+1.385
Pie+1.377
Neg logit contributions
 overwhelm-1.809
 overpower-1.627
 confuse-1.611
secut-1.538
 overshadow-1.536
Token mix
Token ID5022
Feature activation-5.20e-01

Pos logit contributions
+0.100
 Hed+0.093
 Holly+0.088
 Polly+0.088
Pie+0.088
Neg logit contributions
 overwhelm-0.080
 overpower-0.071
 confuse-0.070
 overshadow-0.065
secut-0.065
Token well
Token ID880
Feature activation-5.07e+00

Pos logit contributions
+0.481
 Hed+0.447
 Holly+0.425
 Polly+0.424
Pie+0.423
Neg logit contributions
 overwhelm-0.368
 overpower-0.324
 confuse-0.319
 overshadow-0.298
secut-0.297
Token.
Token ID13
Feature activation+1.42e-01

Pos logit contributions
+3.839
 Hed+3.590
 Polly+3.373
 Holly+3.361
 Katz+3.320
Neg logit contributions
 overwhelm-4.209
 overpower-3.765
 confuse-3.764
secut-3.603
 overshadow-3.582
Token Season
Token ID7369
Feature activation+4.70e-01

Pos logit contributions
 overwhelm+0.126
 overpower+0.114
 confuse+0.113
 overshadow+0.107
secut+0.107
Neg logit contributions
-0.106
 Hed-0.096
 Holly-0.090
 Polly-0.090
Pie-0.090
Token with
Token ID351
Feature activation-6.44e+00

Pos logit contributions
 overwhelm+0.350
 overpower+0.310
 confuse+0.306
 overshadow+0.287
secut+0.286
Neg logit contributions
-0.418
 Hed-0.387
 Holly-0.366
 Polly-0.366
Pie-0.365
Token salt
Token ID8268
Feature activation+9.16e-01

Pos logit contributions
+5.041
 Hed+4.861
 Holly+4.613
 Katz+4.571
 Polly+4.563
Neg logit contributions
 overwhelm-5.056
 confuse-4.469
 overpower-4.397
 overshadow-4.260
secut-4.256
Token,
Token ID11
Feature activation-1.43e+00

Pos logit contributions
 overwhelm+0.787
 overpower+0.710
 confuse+0.700
 overshadow+0.662
secut+0.659
Neg logit contributions
-0.713
 Hed-0.649
 Holly-0.610
Pie-0.609
 Polly-0.608
Token pepper
Token ID13385
Feature activation-3.07e-02

Pos logit contributions
+1.080
 Hed+0.990
 Holly+0.930
 Polly+0.927
Pie+0.921
Neg logit contributions
 overwhelm-1.253
 overpower-1.129
 confuse-1.118
 overshadow-1.064
secut-1.061
Token and
Token ID290
Feature activation-7.19e-02

Pos logit contributions
+0.023
 Hed+0.021
 Holly+0.020
 Polly+0.020
Pie+0.020
Neg logit contributions
 overwhelm-0.027
 overpower-0.024
 confuse-0.024
 overshadow-0.023
secut-0.022
Token chill
Token ID20493
Feature activation-1.74e+00

Pos logit contributions
+0.063
 Hed+0.058
 Holly+0.055
Pie+0.055
 Polly+0.055
Neg logit contributions
 overwhelm-0.055
 overpower-0.049
 confuse-0.048
 overshadow-0.045
secut-0.045
Token data
Token ID1366
Feature activation-8.99e-01

Pos logit contributions
+0.278
 Hed+0.256
 Holly+0.242
 Polly+0.241
Pie+0.241
Neg logit contributions
 overwhelm-0.271
 overpower-0.242
 confuse-0.239
 overshadow-0.226
secut-0.225
Token available
Token ID1695
Feature activation-1.33e-01

Pos logit contributions
+0.793
 Hed+0.740
 Holly+0.701
 Polly+0.698
Pie+0.696
Neg logit contributions
 overwhelm-0.669
 overpower-0.593
 confuse-0.586
secut-0.550
 overshadow-0.549
Token to
Token ID284
Feature activation+4.00e-02

Pos logit contributions
+0.109
 Hed+0.101
 Holly+0.095
 Polly+0.095
Pie+0.094
Neg logit contributions
 overwhelm-0.108
 overpower-0.097
 confuse-0.095
secut-0.090
 overshadow-0.090
Token give
Token ID1577
Feature activation-2.79e+00

Pos logit contributions
 overwhelm+0.032
 overpower+0.029
 confuse+0.029
 overshadow+0.027
secut+0.027
Neg logit contributions
-0.033
 Hed-0.030
 Holly-0.029
 Polly-0.028
Pie-0.028
Token policymakers
Token ID29484
Feature activation-2.13e+00

Pos logit contributions
+2.369
 Hed+2.240
 Holly+2.116
 Polly+2.109
Pie+2.109
Neg logit contributions
 overwhelm-2.125
 overpower-1.892
 confuse-1.863
secut-1.764
 overshadow-1.762
Token accurate
Token ID7187
Feature activation-6.43e+00

Pos logit contributions
+1.742
 Hed+1.647
 Holly+1.551
 Polly+1.549
Pie+1.541
Neg logit contributions
 overwhelm-1.685
 overpower-1.506
 confuse-1.486
secut-1.411
 overshadow-1.408
Token estimates
Token ID7746
Feature activation-3.81e-01

Pos logit contributions
+4.446
 Hed+4.283
 Polly+3.982
Pie+3.974
 Holly+3.929
Neg logit contributions
 overwhelm-5.595
 overpower-5.071
 confuse-5.022
secut-4.946
 overshadow-4.774
Token of
Token ID286
Feature activation-3.74e-01

Pos logit contributions
+0.318
 Hed+0.298
 Holly+0.280
 Polly+0.280
 Katz+0.278
Neg logit contributions
 overwhelm-0.301
 overpower-0.268
 confuse-0.265
secut-0.251
 overshadow-0.249
Token monet
Token ID32153
Feature activation-2.01e+00

Pos logit contributions
+0.317
 Hed+0.293
 Holly+0.277
 Polly+0.276
Pie+0.276
Neg logit contributions
 overwhelm-0.294
 overpower-0.262
 confuse-0.259
 overshadow-0.244
secut-0.243
Tokenized
Token ID1143
Feature activation-2.14e+00

Pos logit contributions
+1.732
 Hed+1.623
 Polly+1.514
 Holly+1.512
Pie+1.511
Neg logit contributions
 overwhelm-1.542
 overpower-1.376
 confuse-1.353
 overshadow-1.277
secut-1.263
Token damages
Token ID12616
Feature activation-1.63e+00

Pos logit contributions
+1.629
 Hed+1.515
 Polly+1.413
 Holly+1.412
Pie+1.399
Neg logit contributions
 overwhelm-1.845
 overpower-1.669
 confuse-1.649
secut-1.572
 overshadow-1.566
Token
Token ID198
Feature activation-5.92e-01

Pos logit contributions
+0.498
 Hed+0.451
 Holly+0.420
 Polly+0.419
Pie+0.418
Neg logit contributions
 overwhelm-0.684
 overpower-0.623
 confuse-0.616
 overshadow-0.588
secut-0.586
Token
Token ID198
Feature activation-2.20e-01

Pos logit contributions
+0.365
 Hed+0.323
 Holly+0.298
 Polly+0.297
Pie+0.295
Neg logit contributions
 overwhelm-0.602
 overpower-0.552
 confuse-0.546
secut-0.522
 overshadow-0.522
Token13
Token ID1485
Feature activation+2.21e+00

Pos logit contributions
+0.170
 Hed+0.156
 Polly+0.147
 Holly+0.147
ertodd+0.145
Neg logit contributions
 overwhelm-0.188
 overpower-0.169
 confuse-0.167
secut-0.159
 overshadow-0.158
Token.
Token ID13
Feature activation+6.67e-01

Pos logit contributions
 overwhelm+1.649
 overpower+1.473
 confuse+1.450
secut+1.357
 overshadow+1.350
Neg logit contributions
-1.917
 Hed-1.747
ertodd-1.679
 Holly-1.654
 Polly-1.647
Token Resident
Token ID22373
Feature activation-2.52e+00

Pos logit contributions
 overwhelm+0.606
 overpower+0.550
 confuse+0.544
 overshadow+0.517
secut+0.515
Neg logit contributions
-0.479
 Hed-0.434
 Holly-0.406
Pie-0.406
 Polly-0.405
Token Evil
Token ID10461
Feature activation-6.42e+00

Pos logit contributions
+1.857
 Hed+1.702
 Holly+1.593
 Polly+1.590
Pie+1.584
Neg logit contributions
 overwhelm-2.248
 overpower-2.036
 confuse-2.015
 overshadow-1.914
secut-1.908
Token 4
Token ID604
Feature activation-1.79e+00

Pos logit contributions
+4.203
 Hed+3.965
 Polly+3.701
 Katz+3.689
Pie+3.686
Neg logit contributions
 overwhelm-5.973
 overpower-5.426
 confuse-5.401
 overshadow-5.139
secut-5.105
Token
Token ID198
Feature activation-1.83e-01

Pos logit contributions
+1.154
 Hed+1.043
 Holly+0.963
 Polly+0.959
Pie+0.959
Neg logit contributions
 overwhelm-1.763
 overpower-1.610
 confuse-1.596
 overshadow-1.526
secut-1.521
Token
Token ID198
Feature activation-5.44e-01

Pos logit contributions
+0.114
 Hed+0.101
 Holly+0.093
 Polly+0.093
ertodd+0.092
Neg logit contributions
 overwhelm-0.184
 overpower-0.169
 confuse-0.167
secut-0.159
 overshadow-0.159
TokenMis
Token ID31281
Feature activation-9.34e-01

Pos logit contributions
+0.353
 Hed+0.319
 Polly+0.295
 Holly+0.294
ertodd+0.291
Neg logit contributions
 overwhelm-0.532
 overpower-0.486
 confuse-0.482
secut-0.460
 overshadow-0.459
Tokenao
Token ID5488
Feature activation+7.53e-01

Pos logit contributions
+0.720
 Hed+0.659
 Holly+0.619
 Polly+0.618
Pie+0.617
Neg logit contributions
 overwhelm-0.805
 overpower-0.726
 confuse-0.715
 overshadow-0.681
secut-0.676
Token with
Token ID351
Feature activation+3.28e-01

Pos logit contributions
 overwhelm+0.886
 overpower+0.774
 confuse+0.763
 overshadow+0.710
secut+0.706
Neg logit contributions
-1.267
 Hed-1.179
Pie-1.121
 Holly-1.121
 Katz-1.117
Token activity
Token ID3842
Feature activation-6.85e-01

Pos logit contributions
 overwhelm+0.254
 overpower+0.227
 confuse+0.223
 overshadow+0.210
secut+0.208
Neg logit contributions
-0.284
 Hed-0.260
 Holly-0.246
Pie-0.246
 Polly-0.245
Token when
Token ID618
Feature activation-1.29e+00

Pos logit contributions
+0.573
 Hed+0.530
 Holly+0.500
 Polly+0.500
Pie+0.498
Neg logit contributions
 overwhelm-0.544
 overpower-0.485
 confuse-0.480
secut-0.451
 overshadow-0.451
Token PM
Token ID3122
Feature activation-1.21e+00

Pos logit contributions
+0.949
 Hed+0.873
 Holly+0.816
 Polly+0.816
Pie+0.809
Neg logit contributions
 overwhelm-1.156
 overpower-1.043
 confuse-1.034
secut-0.982
 overshadow-0.981
Token Modi
Token ID14637
Feature activation+8.23e-01

Pos logit contributions
+0.739
 Hed+0.666
 Holly+0.612
 Polly+0.611
Pie+0.606
Neg logit contributions
 overwhelm-1.238
 overpower-1.134
 confuse-1.123
secut-1.075
 overshadow-1.073
Token caus
Token ID26846
Feature activation+5.38e+00

Pos logit contributions
 overwhelm+0.623
 overpower+0.555
 confuse+0.546
 overshadow+0.515
secut+0.507
Neg logit contributions
-0.724
 Hed-0.666
Pie-0.631
 Holly-0.630
 Polly-0.627
Tokenally
Token ID453
Feature activation-1.11e-01

Pos logit contributions
 overwhelm+3.468
 overpower+2.973
 confuse+2.957
 overshadow+2.761
secut+2.740
Neg logit contributions
-5.250
 Hed-4.839
ertodd-4.665
Pie-4.661
 Holly-4.654
Token walked
Token ID6807
Feature activation+6.32e-01

Pos logit contributions
+0.101
 Hed+0.094
 Holly+0.089
 Polly+0.089
Pie+0.089
Neg logit contributions
 overwhelm-0.080
 overpower-0.070
 confuse-0.069
 overshadow-0.065
secut-0.065
Token in
Token ID287
Feature activation+9.36e-01

Pos logit contributions
 overwhelm+0.476
 overpower+0.424
 confuse+0.417
 overshadow+0.394
secut+0.390
Neg logit contributions
-0.559
 Hed-0.513
Pie-0.487
 Holly-0.485
 Polly-0.484
Token to
Token ID284
Feature activation+6.68e-01

Pos logit contributions
 overwhelm+0.743
 overpower+0.663
 confuse+0.654
 overshadow+0.618
secut+0.615
Neg logit contributions
-0.786
 Hed-0.724
 Holly-0.684
 Polly-0.683
Pie-0.682
Token eat
Token ID4483
Feature activation+2.36e-01

Pos logit contributions
 overwhelm+0.521
 overpower+0.464
 confuse+0.458
 overshadow+0.432
secut+0.429
Neg logit contributions
-0.571
 Hed-0.526
 Holly-0.497
 Polly-0.496
Pie-0.496
Token
Token ID447
Feature activation+1.46e+00

Pos logit contributions
 overwhelm+1.426
 overpower+1.270
 confuse+1.251
secut+1.187
 overshadow+1.172
Neg logit contributions
-1.580
 Hed-1.459
 Holly-1.387
 Polly-1.381
ertodd-1.373
Token
Token ID247
Feature activation+7.29e-01

Pos logit contributions
 overwhelm+1.037
 overpower+0.913
 confuse+0.895
 overshadow+0.841
secut+0.831
Neg logit contributions
-1.353
 Hed-1.254
 Holly-1.190
 Polly-1.189
Pie-1.184
Tokent
Token ID83
Feature activation-4.94e-01

Pos logit contributions
 overwhelm+0.677
 overpower+0.615
 confuse+0.607
secut+0.583
 overshadow+0.580
Neg logit contributions
-0.515
 Hed-0.463
 Holly-0.434
 Polly-0.433
Pie-0.433
Token got
Token ID1392
Feature activation-2.67e-03

Pos logit contributions
+0.438
 Hed+0.407
 Holly+0.385
 Polly+0.385
Pie+0.383
Neg logit contributions
 overwhelm-0.367
 overpower-0.325
 confuse-0.321
 overshadow-0.301
secut-0.300
Token the
Token ID262
Feature activation-4.88e-01

Pos logit contributions
+0.002
 Hed+0.002
 Holly+0.002
 Polly+0.002
Pie+0.002
Neg logit contributions
 overwhelm-0.002
 overpower-0.002
 confuse-0.002
 overshadow-0.002
secut-0.002
Token refugee
Token ID12540
Feature activation+4.99e+00

Pos logit contributions
+0.433
 Hed+0.401
 Holly+0.380
 Polly+0.380
Pie+0.379
Neg logit contributions
 overwhelm-0.364
 overpower-0.323
 confuse-0.318
 overshadow-0.299
secut-0.298
Token crisis
Token ID4902
Feature activation-2.17e-01

Pos logit contributions
 overwhelm+3.131
 overpower+2.665
 confuse+2.554
 overshadow+2.394
secut+2.332
Neg logit contributions
-5.168
 Hed-4.634
ertodd-4.587
Pie-4.540
Ton-4.496
Token under
Token ID739
Feature activation+2.21e+00

Pos logit contributions
+0.193
 Hed+0.179
 Holly+0.169
 Polly+0.169
Pie+0.169
Neg logit contributions
 overwhelm-0.161
 overpower-0.142
 confuse-0.140
 overshadow-0.132
secut-0.131
Token control
Token ID1630
Feature activation+5.10e-01

Pos logit contributions
 overwhelm+1.417
 overpower+1.229
 confuse+1.208
 overshadow+1.123
secut+1.116
Neg logit contributions
-2.198
 Hed-2.051
 Holly-1.956
 Polly-1.954
Pie-1.953
Token on
Token ID319
Feature activation+1.10e+00

Pos logit contributions
 overwhelm+0.424
 overpower+0.380
 confuse+0.375
 overshadow+0.355
secut+0.354
Neg logit contributions
-0.409
 Hed-0.376
 Holly-0.354
 Polly-0.354
Pie-0.353
Token a
Token ID257
Feature activation-2.03e-01

Pos logit contributions
 overwhelm+0.815
 overpower+0.722
 confuse+0.711
 overshadow+0.669
secut+0.666
Neg logit contributions
-0.980
 Hed-0.907
 Holly-0.860
 Polly-0.859
Pie-0.859
Token41
Token ID3901
Feature activation+1.89e+00

Pos logit contributions
+0.181
 Hed+0.167
 Holly+0.157
 Polly+0.157
Pie+0.156
Neg logit contributions
 overwhelm-0.181
 overpower-0.162
 confuse-0.160
 overshadow-0.151
secut-0.150
Tokenz
Token ID89
Feature activation-4.27e-01

Pos logit contributions
 overwhelm+1.697
 overpower+1.543
 confuse+1.520
secut+1.463
 overshadow+1.442
Neg logit contributions
-1.368
 Hed-1.236
 Holly-1.146
 Polly-1.142
Ton-1.138
Token BLACK
Token ID31963
Feature activation+5.09e-01

Pos logit contributions
+0.327
 Hed+0.300
 Holly+0.281
 Polly+0.281
Pie+0.280
Neg logit contributions
 overwhelm-0.370
 overpower-0.333
 confuse-0.329
 overshadow-0.313
secut-0.312
TokenFO
Token ID6080
Feature activation+9.01e-01

Pos logit contributions
 overwhelm+0.481
 overpower+0.438
 confuse+0.433
 overshadow+0.413
secut+0.411
Neg logit contributions
-0.348
 Hed-0.315
 Polly-0.293
 Holly-0.293
Pie-0.292
TokenOT
Token ID2394
Feature activation-7.76e-01

Pos logit contributions
 overwhelm+0.903
 overpower+0.829
 confuse+0.820
secut+0.782
 overshadow+0.781
Neg logit contributions
-0.558
 Hed-0.503
 Polly-0.457
 Holly-0.457
ertodd-0.457
Token T
Token ID309
Feature activation+4.66e+00

Pos logit contributions
+0.573
 Hed+0.523
 Holly+0.490
 Polly+0.489
Pie+0.486
Neg logit contributions
 overwhelm-0.694
 overpower-0.628
 confuse-0.621
 overshadow-0.591
secut-0.589
TokenOC
Token ID4503
Feature activation+8.48e-01

Pos logit contributions
 overwhelm+3.993
 overpower+3.624
 confuse+3.570
secut+3.396
rawdownloadcloneembedreportprint+3.357
Neg logit contributions
-3.418
 Hed-3.177
ertodd-2.972
 Polly-2.925
 Holly-2.918
Token AK
Token ID15837
Feature activation+1.81e+00

Pos logit contributions
 overwhelm+0.755
 overpower+0.683
 confuse+0.675
 overshadow+0.640
secut+0.639
Neg logit contributions
-0.630
 Hed-0.572
Pie-0.536
 Holly-0.535
 Polly-0.534
TokenNOW
Token ID45669
Feature activation-2.94e+00

Pos logit contributions
 overwhelm+1.447
 overpower+1.293
 confuse+1.276
 overshadow+1.206
secut+1.199
Neg logit contributions
-1.500
 Hed-1.383
 Holly-1.303
 Polly-1.301
Pie-1.298
TokenLED
Token ID30465
Feature activation-5.70e-01

Pos logit contributions
+1.558
 Hed+1.369
 Holly+1.242
Pie+1.240
 Polly+1.230
Neg logit contributions
 overwhelm-3.260
 overpower-3.012
 confuse-2.988
 overshadow-2.888
secut-2.867
TokenG
Token ID38
Feature activation-6.78e-01

Pos logit contributions
+0.499
 Hed+0.462
 Holly+0.438
Pie+0.438
 Polly+0.437
Neg logit contributions
 overwhelm-0.433
 overpower-0.385
 confuse-0.380
 overshadow-0.359
secut-0.356
Token back
Token ID736
Feature activation+1.63e+00

Pos logit contributions
 overwhelm+0.599
 overpower+0.529
 confuse+0.521
 overshadow+0.489
secut+0.487
Neg logit contributions
-0.741
 Hed-0.686
 Holly-0.650
 Polly-0.649
Pie-0.649
Token typically
Token ID6032
Feature activation+2.20e+00

Pos logit contributions
 overwhelm+1.163
 overpower+1.026
 confuse+1.010
 overshadow+0.946
secut+0.941
Neg logit contributions
-1.505
 Hed-1.393
 Holly-1.323
Pie-1.321
 Polly-1.320
Token in
Token ID287
Feature activation+1.59e+00

Pos logit contributions
 overwhelm+1.525
 overpower+1.341
 confuse+1.319
 overshadow+1.234
secut+1.225
Neg logit contributions
-2.069
 Hed-1.906
Pie-1.817
 Holly-1.812
Ton-1.809
Token a
Token ID257
Feature activation+2.39e-01

Pos logit contributions
 overwhelm+1.156
 overpower+1.023
 confuse+1.008
 overshadow+0.943
secut+0.939
Neg logit contributions
-1.441
 Hed-1.329
Pie-1.261
 Holly-1.260
 Polly-1.255
Token ten
Token ID3478
Feature activation-5.17e-01

Pos logit contributions
 overwhelm+0.176
 overpower+0.156
 confuse+0.154
 overshadow+0.144
secut+0.143
Neg logit contributions
-0.215
 Hed-0.198
Pie-0.187
 Holly-0.187
 Polly-0.187
Token minute
Token ID5664
Feature activation+4.74e+00

Pos logit contributions
+0.434
 Hed+0.400
 Holly+0.377
 Polly+0.376
Pie+0.376
Neg logit contributions
 overwhelm-0.411
 overpower-0.367
 confuse-0.362
 overshadow-0.341
secut-0.341
Token window
Token ID4324
Feature activation+3.17e+00

Pos logit contributions
 overwhelm+3.784
 overpower+3.368
 confuse+3.318
 overshadow+3.092
secut+3.067
Neg logit contributions
-4.035
 Hed-3.546
Ton-3.445
ertodd-3.406
Pie-3.406
Token.
Token ID13
Feature activation+7.52e-01

Pos logit contributions
 overwhelm+2.526
 overpower+2.254
 confuse+2.221
 overshadow+2.107
secut+2.093
Neg logit contributions
-2.663
 Hed-2.427
Pie-2.301
 Holly-2.298
 Polly-2.289
Token
Token ID198
Feature activation+3.90e-01

Pos logit contributions
 overwhelm+0.712
 overpower+0.648
 confuse+0.643
secut+0.612
 overshadow+0.610
Neg logit contributions
-0.508
 Hed-0.452
ertodd-0.424
Pie-0.423
 Holly-0.421
Token
Token ID198
Feature activation+7.54e-02

Pos logit contributions
 overwhelm+0.384
 overpower+0.352
 confuse+0.348
secut+0.333
 overshadow+0.330
Neg logit contributions
-0.250
 Hed-0.221
ertodd-0.204
 Holly-0.203
 Polly-0.203
TokenPath
Token ID15235
Feature activation+1.42e+00

Pos logit contributions
 overwhelm+0.067
 overpower+0.061
 confuse+0.060
secut+0.057
 overshadow+0.057
Neg logit contributions
-0.055
 Hed-0.051
 Holly-0.047
 Polly-0.047
ertodd-0.047
Token pul
Token ID17472
Feature activation-3.98e-01

Pos logit contributions
+0.096
 Hed+0.089
 Holly+0.084
Pie+0.084
 Polly+0.084
Neg logit contributions
 overwhelm-0.090
 overpower-0.080
 confuse-0.079
 overshadow-0.074
secut-0.074
Tokenpy
Token ID9078
Feature activation-3.14e-01

Pos logit contributions
+0.266
 Hed+0.240
 Holly+0.223
 Polly+0.222
Pie+0.222
Neg logit contributions
 overwhelm-0.385
 overpower-0.351
 confuse-0.347
 overshadow-0.332
secut-0.331
Token space
Token ID2272
Feature activation+1.21e+00

Pos logit contributions
+0.257
 Hed+0.236
Pie+0.223
 Holly+0.222
 Polly+0.221
Neg logit contributions
 overwhelm-0.256
 overpower-0.230
 confuse-0.227
 overshadow-0.215
secut-0.213
Token opera
Token ID27296
Feature activation-5.43e-01

Pos logit contributions
 overwhelm+0.870
 overpower+0.766
 confuse+0.753
 overshadow+0.707
secut+0.697
Neg logit contributions
-1.101
 Hed-1.018
Pie-0.968
 Holly-0.967
ertodd-0.964
Token novels
Token ID16122
Feature activation+7.30e-01

Pos logit contributions
+0.424
 Hed+0.389
 Holly+0.365
 Polly+0.364
Pie+0.364
Neg logit contributions
 overwhelm-0.463
 overpower-0.417
 confuse-0.412
 overshadow-0.391
secut-0.389
Token scattered
Token ID16830
Feature activation+5.21e+00

Pos logit contributions
 overwhelm+0.614
 overpower+0.552
 confuse+0.545
 overshadow+0.518
secut+0.513
Neg logit contributions
-0.581
 Hed-0.529
 Holly-0.498
 Polly-0.498
Pie-0.497
Token around
Token ID1088
Feature activation-2.05e-01

Pos logit contributions
 overwhelm+3.218
 overpower+2.757
 confuse+2.694
 overshadow+2.538
secut+2.475
Neg logit contributions
-5.240
 Hed-4.896
ertodd-4.710
Pie-4.674
Ton-4.651
Token crew
Token ID5462
Feature activation-2.28e+00

Pos logit contributions
+0.168
 Hed+0.155
 Holly+0.146
 Polly+0.146
Pie+0.146
Neg logit contributions
 overwhelm-0.167
 overpower-0.149
 confuse-0.147
 overshadow-0.139
secut-0.139
Token quarters
Token ID13620
Feature activation-1.49e+00

Pos logit contributions
+1.953
 Hed+1.826
 Holly+1.728
 Polly+1.726
Pie+1.704
Neg logit contributions
 overwhelm-1.752
 overpower-1.563
 confuse-1.529
 overshadow-1.444
secut-1.439
Token.
Token ID13
Feature activation-1.65e-01

Pos logit contributions
+1.168
 Hed+1.078
 Polly+1.014
 Holly+1.013
Pie+1.007
Neg logit contributions
 overwhelm-1.250
 overpower-1.123
 confuse-1.110
secut-1.051
 overshadow-1.047
Token And
Token ID843
Feature activation+1.69e-01

Pos logit contributions
+0.114
 Hed+0.102
 Holly+0.095
ertodd+0.095
Pie+0.095
Neg logit contributions
 overwhelm-0.155
 overpower-0.140
 confuse-0.139
 overshadow-0.133
secut-0.132
Token 25
Token ID1679
Feature activation-2.40e+00

Pos logit contributions
+0.520
 Hed+0.482
 Holly+0.455
 Polly+0.453
Pie+0.453
Neg logit contributions
 overwhelm-0.508
 overpower-0.455
 confuse-0.449
 overshadow-0.424
secut-0.423
Token to
Token ID284
Feature activation+5.47e-01

Pos logit contributions
+1.837
 Hed+1.689
 Polly+1.582
 Holly+1.582
Pie+1.582
Neg logit contributions
 overwhelm-2.063
 overpower-1.856
 confuse-1.831
 overshadow-1.742
secut-1.738
Token 30
Token ID1542
Feature activation-4.17e-01

Pos logit contributions
 overwhelm+0.417
 overpower+0.370
 confuse+0.366
 overshadow+0.343
secut+0.342
Neg logit contributions
-0.478
 Hed-0.440
 Holly-0.417
 Polly-0.416
Pie-0.416
Token years
Token ID812
Feature activation-1.41e+00

Pos logit contributions
+0.335
 Hed+0.308
 Holly+0.290
 Polly+0.290
Pie+0.289
Neg logit contributions
 overwhelm-0.346
 overpower-0.310
 confuse-0.306
 overshadow-0.290
secut-0.289
Token ago
Token ID2084
Feature activation-1.45e+00

Pos logit contributions
+1.152
 Hed+1.067
 Holly+1.007
 Polly+1.007
Pie+1.000
Neg logit contributions
 overwhelm-1.138
 overpower-1.017
 confuse-1.004
 overshadow-0.948
secut-0.947
Token.
Token ID13
Feature activation+1.24e+00

Pos logit contributions
+1.143
 Hed+1.062
 Holly+1.001
 Polly+1.000
Pie+0.991
Neg logit contributions
 overwhelm-1.210
 overpower-1.085
 confuse-1.075
secut-1.017
 overshadow-1.012
Token The
Token ID383
Feature activation+5.49e-01

Pos logit contributions
 overwhelm+1.132
 overpower+1.027
 confuse+1.016
 overshadow+0.966
secut+0.965
Neg logit contributions
-0.889
 Hed-0.799
 Polly-0.751
 Holly-0.749
ertodd-0.747
Token report
Token ID989
Feature activation-2.30e+00

Pos logit contributions
 overwhelm+0.406
 overpower+0.361
 confuse+0.353
 overshadow+0.332
secut+0.329
Neg logit contributions
-0.493
 Hed-0.451
ertodd-0.430
 Polly-0.429
 Holly-0.429
Token concluded
Token ID8391
Feature activation+5.32e-01

Pos logit contributions
+1.823
 Hed+1.706
 Holly+1.609
 Polly+1.599
 Katz+1.595
Neg logit contributions
 overwhelm-1.905
 overpower-1.708
 confuse-1.688
secut-1.598
 overshadow-1.593
Token that
Token ID326
Feature activation+4.92e-01

Pos logit contributions
 overwhelm+0.449
 overpower+0.404
 confuse+0.399
 overshadow+0.378
secut+0.376
Neg logit contributions
-0.421
 Hed-0.384
 Polly-0.362
 Holly-0.362
Pie-0.360
Token Ike
Token ID34880
Feature activation+5.55e-01

Pos logit contributions
 overwhelm+0.382
 overpower+0.341
 confuse+0.335
 overshadow+0.317
secut+0.315
Neg logit contributions
-0.423
 Hed-0.388
 Holly-0.368
 Polly-0.368
Pie-0.367
Token she
Token ID673
Feature activation+1.54e+00

Pos logit contributions
+0.136
 Hed+0.123
 Holly+0.115
 Polly+0.115
Pie+0.115
Neg logit contributions
 overwhelm-0.169
 overpower-0.154
 confuse-0.152
 overshadow-0.145
secut-0.144
Token said
Token ID531
Feature activation+5.29e-02

Pos logit contributions
 overwhelm+1.320
 overpower+1.190
 confuse+1.178
 overshadow+1.115
secut+1.109
Neg logit contributions
-1.186
 Hed-1.079
Pie-1.014
 Holly-1.013
 Polly-1.009
Token.
Token ID13
Feature activation-1.94e-01

Pos logit contributions
 overwhelm+0.042
 overpower+0.038
 confuse+0.037
 overshadow+0.035
secut+0.035
Neg logit contributions
-0.044
 Hed-0.040
Pie-0.038
 Holly-0.038
 Polly-0.038
Token
Token ID198
Feature activation-7.19e-01

Pos logit contributions
+0.139
 Hed+0.126
 Holly+0.118
 Polly+0.118
Pie+0.117
Neg logit contributions
 overwhelm-0.179
 overpower-0.162
 confuse-0.160
 overshadow-0.153
secut-0.152
Token
Token ID198
Feature activation-6.30e-01

Pos logit contributions
+0.434
 Hed+0.386
 Holly+0.354
 Polly+0.353
Pie+0.351
Neg logit contributions
 overwhelm-0.739
 overpower-0.679
 confuse-0.672
 overshadow-0.642
secut-0.642
TokenThough
Token ID10915
Feature activation+1.55e+00

Pos logit contributions
+0.436
 Hed+0.396
 Holly+0.368
 Polly+0.368
Pie+0.365
Neg logit contributions
 overwhelm-0.591
 overpower-0.538
 confuse-0.532
 overshadow-0.507
secut-0.506
Token she
Token ID673
Feature activation+2.63e+00

Pos logit contributions
 overwhelm+1.335
 overpower+1.207
 confuse+1.194
 overshadow+1.136
secut+1.128
Neg logit contributions
-1.196
 Hed-1.076
Pie-1.014
ertodd-1.011
 Polly-1.008
Token misses
Token ID18297
Feature activation-4.68e-01

Pos logit contributions
 overwhelm+1.981
 overpower+1.764
 confuse+1.740
 overshadow+1.636
secut+1.615
Neg logit contributions
-2.324
 Hed-2.130
Pie-2.028
 Holly-2.019
 Polly-2.012
Token the
Token ID262
Feature activation+1.79e-01

Pos logit contributions
+0.380
 Hed+0.349
 Holly+0.329
 Polly+0.328
Pie+0.328
Neg logit contributions
 overwhelm-0.384
 overpower-0.344
 confuse-0.340
 overshadow-0.322
secut-0.320
Token "
Token ID366
Feature activation-1.77e+00

Pos logit contributions
 overwhelm+0.142
 overpower+0.127
 confuse+0.125
 overshadow+0.118
secut+0.116
Neg logit contributions
-0.152
 Hed-0.139
Pie-0.132
 Holly-0.131
 Polly-0.131
Tokenint
Token ID600
Feature activation-2.07e+00

Pos logit contributions
+1.324
 Hed+1.208
Pie+1.135
 Holly+1.134
 Polly+1.131
Neg logit contributions
 overwhelm-1.565
 overpower-1.415
 confuse-1.398
 overshadow-1.331
secut-1.319
Token her
Token ID607
Feature activation+7.61e-01

Pos logit contributions
+0.166
 Hed+0.155
 Holly+0.147
 Polly+0.147
 Katz+0.145
Neg logit contributions
 overwhelm-0.142
 overpower-0.126
 confuse-0.124
 overshadow-0.117
secut-0.117
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
 overwhelm+0.600
 overpower+0.536
 confuse+0.529
 overshadow+0.500
secut+0.498
Neg logit contributions
-0.638
 Hed-0.591
 Holly-0.559
 Polly-0.559
Pie-0.555
Token
Token ID198
Feature activation-9.06e-01

Pos logit contributions
+0.805
 Hed+0.751
 Holly+0.706
 Polly+0.705
 Katz+0.698
Neg logit contributions
 overwhelm-0.916
 overpower-0.828
 confuse-0.815
 overshadow-0.777
secut-0.774
Token
Token ID198
Feature activation-4.71e-01

Pos logit contributions
+0.554
 Hed+0.491
 Holly+0.452
 Polly+0.450
Pie+0.448
Neg logit contributions
 overwhelm-0.925
 overpower-0.849
 confuse-0.840
secut-0.803
 overshadow-0.803
TokenM
Token ID44
Feature activation-1.47e+00

Pos logit contributions
+0.330
 Hed+0.299
Pie+0.280
 Holly+0.279
 Polly+0.279
Neg logit contributions
 overwhelm-0.439
 overpower-0.399
 confuse-0.394
 overshadow-0.377
secut-0.374
Tokenachel
Token ID9636
Feature activation+1.03e+00

Pos logit contributions
+1.147
 Hed+1.053
 Holly+0.990
 Polly+0.987
Pie+0.985
Neg logit contributions
 overwhelm-1.249
 overpower-1.124
 confuse-1.111
 overshadow-1.054
secut-1.050
Tokenle
Token ID293
Feature activation+1.16e+00

Pos logit contributions
 overwhelm+0.865
 overpower+0.778
 confuse+0.769
 overshadow+0.728
secut+0.725
Neg logit contributions
-0.813
 Hed-0.745
 Holly-0.701
 Polly-0.699
Pie-0.699
Token expresses
Token ID27505
Feature activation-5.20e-01

Pos logit contributions
 overwhelm+0.995
 overpower+0.897
 confuse+0.887
 overshadow+0.839
secut+0.837
Neg logit contributions
-0.895
 Hed-0.815
 Holly-0.766
 Polly-0.765
Pie-0.765
Token that
Token ID326
Feature activation+1.40e+00

Pos logit contributions
+0.432
 Hed+0.402
 Polly+0.381
 Holly+0.381
 Katz+0.376
Neg logit contributions
 overwhelm-0.412
 overpower-0.368
 confuse-0.363
secut-0.345
 overshadow-0.345
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 overwhelm+1.117
 overpower+0.998
 confuse+0.985
 overshadow+0.930
secut+0.927
Neg logit contributions
-1.163
 Hed-1.072
 Holly-1.012
 Polly-1.010
Pie-1.008
Token
Token ID564
Feature activation-9.08e-01

Pos logit contributions
+0.839
 Hed+0.788
 Polly+0.744
 Holly+0.743
 Katz+0.736
Neg logit contributions
 overwhelm-0.781
 overpower-0.696
 confuse-0.686
secut-0.652
 overshadow-0.646
Tokenuments
Token ID2886
Feature activation+1.24e+00

Pos logit contributions
 overwhelm+1.708
 overpower+1.534
 confuse+1.519
 overshadow+1.433
secut+1.429
Neg logit contributions
-1.615
 Hed-1.469
 Holly-1.385
Pie-1.385
 Polly-1.381
Token
Token ID784
Feature activation+4.58e-01

Pos logit contributions
 overwhelm+1.023
 overpower+0.917
 confuse+0.906
 overshadow+0.856
secut+0.850
Neg logit contributions
-1.002
 Hed-0.913
 Holly-0.861
Pie-0.860
 Polly-0.859
Token H
Token ID367
Feature activation+1.47e+00

Pos logit contributions
 overwhelm+0.383
 overpower+0.345
 confuse+0.340
 overshadow+0.322
secut+0.321
Neg logit contributions
-0.364
 Hed-0.334
 Holly-0.314
 Polly-0.313
Pie-0.313
Tokenashes
Token ID7465
Feature activation+4.93e-02

Pos logit contributions
 overwhelm+1.190
 overpower+1.066
 confuse+1.051
 overshadow+0.993
secut+0.990
Neg logit contributions
-1.206
 Hed-1.111
 Holly-1.048
 Polly-1.047
Pie-1.044
Token And
Token ID843
Feature activation+3.97e+00

Pos logit contributions
 overwhelm+0.042
 overpower+0.037
 confuse+0.037
 overshadow+0.035
secut+0.035
Neg logit contributions
-0.039
 Hed-0.036
 Holly-0.034
 Polly-0.034
Pie-0.034
Token Block
Token ID9726
Feature activation+1.75e+00

Pos logit contributions
 overwhelm+3.407
 overpower+3.075
 confuse+3.040
secut+2.864
 overshadow+2.856
Neg logit contributions
-3.068
 Hed-2.689
ertodd-2.597
Pie-2.574
Ton-2.552
Token Basics
Token ID45884
Feature activation+3.39e+00

Pos logit contributions
 overwhelm+1.551
 overpower+1.401
 confuse+1.386
 overshadow+1.317
secut+1.306
Neg logit contributions
-1.314
 Hed-1.180
 Polly-1.109
 Holly-1.103
Pie-1.103
Token.
Token ID13
Feature activation+1.05e+00

Pos logit contributions
 overwhelm+2.753
 overpower+2.459
 confuse+2.441
 overshadow+2.302
secut+2.254
Neg logit contributions
-2.786
 Hed-2.501
ertodd-2.374
Ton-2.373
Pie-2.365
Token I
Token ID314
Feature activation-2.23e+00

Pos logit contributions
 overwhelm+0.919
 overpower+0.829
 confuse+0.821
 overshadow+0.779
secut+0.774
Neg logit contributions
-0.791
 Hed-0.714
Pie-0.672
 Polly-0.669
 Holly-0.668
Token mentioned
Token ID4750
Feature activation-2.17e+00

Pos logit contributions
+1.881
 Hed+1.757
 Holly+1.663
 Polly+1.659
 Katz+1.654
Neg logit contributions
 overwhelm-1.719
 overpower-1.533
 confuse-1.504
secut-1.442
 overshadow-1.427
Token that
Token ID326
Feature activation-1.21e+00

Pos logit contributions
+1.752
 Hed+1.647
 Holly+1.559
 Polly+1.554
 Katz+1.547
Neg logit contributions
 overwhelm-1.739
 overpower-1.555
 confuse-1.542
secut-1.470
 overshadow-1.457
Token were
Token ID547
Feature activation-1.73e+00

Pos logit contributions
+1.113
 Hed+1.039
 Holly+0.987
 Polly+0.986
Pie+0.980
Neg logit contributions
 overwhelm-0.847
 overpower-0.746
 confuse-0.734
secut-0.690
 overshadow-0.686
Token played
Token ID2826
Feature activation-1.28e-02

Pos logit contributions
+1.617
 Hed+1.524
 Polly+1.453
 Holly+1.452
 Katz+1.441
Neg logit contributions
 overwhelm-1.186
 overpower-1.031
 confuse-1.023
secut-0.959
 overshadow-0.946
Token on
Token ID319
Feature activation-3.35e-01

Pos logit contributions
+0.011
 Hed+0.011
 Holly+0.010
 Polly+0.010
Pie+0.010
Neg logit contributions
 overwhelm-0.010
 overpower-0.008
 confuse-0.008
 overshadow-0.008
secut-0.008
Token a
Token ID257
Feature activation-7.84e-01

Pos logit contributions
+0.277
 Hed+0.257
 Holly+0.243
 Polly+0.242
 Katz+0.240
Neg logit contributions
 overwhelm-0.269
 overpower-0.240
 confuse-0.237
 overshadow-0.224
secut-0.223
Token half
Token ID2063
Feature activation+1.50e+00

Pos logit contributions
+0.689
 Hed+0.643
 Holly+0.610
 Polly+0.609
 Katz+0.605
Neg logit contributions
 overwhelm-0.588
 overpower-0.519
 confuse-0.514
 overshadow-0.482
secut-0.480
Token-
Token ID12
Feature activation+1.00e+00

Pos logit contributions
 overwhelm+1.149
 overpower+1.025
 confuse+1.007
 overshadow+0.950
secut+0.950
Neg logit contributions
-1.303
 Hed-1.204
 Holly-1.138
 Polly-1.136
Pie-1.135
Tokencourt
Token ID22230
Feature activation+2.28e+00

Pos logit contributions
 overwhelm+0.744
 overpower+0.659
 confuse+0.649
secut+0.611
 overshadow+0.611
Neg logit contributions
-0.893
 Hed-0.829
 Holly-0.786
 Polly-0.785
Pie-0.782
Token in
Token ID287
Feature activation-5.28e-01

Pos logit contributions
 overwhelm+1.611
 overpower+1.420
 confuse+1.392
 overshadow+1.310
secut+1.303
Neg logit contributions
-2.113
 Hed-1.959
 Holly-1.867
Pie-1.861
 Polly-1.860
Token the
Token ID262
Feature activation-8.68e-01

Pos logit contributions
+0.444
 Hed+0.411
 Holly+0.389
 Polly+0.388
 Katz+0.386
Neg logit contributions
 overwhelm-0.419
 overpower-0.373
 confuse-0.369
 overshadow-0.348
secut-0.346
Token yard
Token ID12699
Feature activation+8.88e-01

Pos logit contributions
+0.721
 Hed+0.668
 Holly+0.632
 Polly+0.630
 Katz+0.627
Neg logit contributions
 overwhelm-0.694
 overpower-0.618
 confuse-0.613
 overshadow-0.577
secut-0.576
Token,
Token ID11
Feature activation-5.96e-01

Pos logit contributions
 overwhelm+0.730
 overpower+0.655
 confuse+0.647
 overshadow+0.612
secut+0.609
Neg logit contributions
-0.719
 Hed-0.662
 Holly-0.623
 Polly-0.622
Pie-0.621
Token I
Token ID314
Feature activation-6.55e-01

Pos logit contributions
+0.691
 Hed+0.643
 Holly+0.607
 Polly+0.606
 Katz+0.602
Neg logit contributions
 overwhelm-0.674
 overpower-0.603
 confuse-0.597
secut-0.563
 overshadow-0.562
Token wanted
Token ID2227
Feature activation+1.38e+00

Pos logit contributions
+0.588
 Hed+0.546
 Holly+0.516
 Polly+0.516
Pie+0.515
Neg logit contributions
 overwhelm-0.480
 overpower-0.425
 confuse-0.418
 overshadow-0.393
secut-0.391
Token to
Token ID284
Feature activation+1.27e+00

Pos logit contributions
 overwhelm+1.095
 overpower+0.973
 confuse+0.958
 overshadow+0.907
secut+0.899
Neg logit contributions
-1.161
 Hed-1.064
 Holly-1.005
Pie-1.005
 Polly-1.004
Token get
Token ID651
Feature activation+2.45e-01

Pos logit contributions
 overwhelm+0.989
 overpower+0.879
 confuse+0.869
 overshadow+0.819
secut+0.807
Neg logit contributions
-1.094
 Hed-1.006
Pie-0.952
 Holly-0.952
 Polly-0.949
Token an
Token ID281
Feature activation-3.97e-01

Pos logit contributions
 overwhelm+0.187
 overpower+0.166
 confuse+0.164
 overshadow+0.155
secut+0.154
Neg logit contributions
-0.212
 Hed-0.196
 Holly-0.185
 Polly-0.185
Pie-0.185
Token accurate
Token ID7187
Feature activation-1.69e-01

Pos logit contributions
+0.340
 Hed+0.314
 Holly+0.297
 Polly+0.296
Pie+0.296
Neg logit contributions
 overwhelm-0.309
 overpower-0.275
 confuse-0.271
 overshadow-0.256
secut-0.254
Token picture
Token ID4286
Feature activation+1.54e+00

Pos logit contributions
+0.135
 Hed+0.124
 Holly+0.117
Pie+0.116
 Polly+0.116
Neg logit contributions
 overwhelm-0.142
 overpower-0.127
 confuse-0.126
 overshadow-0.119
secut-0.118
Token of
Token ID286
Feature activation+1.11e+00

Pos logit contributions
 overwhelm+1.171
 overpower+1.041
 confuse+1.025
 overshadow+0.968
secut+0.959
Neg logit contributions
-1.337
 Hed-1.234
 Holly-1.169
Pie-1.168
 Polly-1.166
Token its
Token ID663
Feature activation+1.71e-01

Pos logit contributions
 overwhelm+0.811
 overpower+0.717
 confuse+0.707
 overshadow+0.663
secut+0.660
Neg logit contributions
-0.998
 Hed-0.926
 Holly-0.878
 Polly-0.877
Pie-0.875
Token demographics
Token ID30084
Feature activation+8.95e-01

Pos logit contributions
 overwhelm+0.131
 overpower+0.116
 confuse+0.114
 overshadow+0.108
secut+0.107
Neg logit contributions
-0.149
 Hed-0.137
 Holly-0.130
Pie-0.130
 Polly-0.130
Token and
Token ID290
Feature activation+9.73e-01

Pos logit contributions
 overwhelm+0.758
 overpower+0.682
 confuse+0.673
 overshadow+0.639
secut+0.635
Neg logit contributions
-0.704
 Hed-0.645
 Holly-0.607
 Polly-0.605
Pie-0.604
Token to
Token ID284
Feature activation+6.82e-01

Pos logit contributions
 overwhelm+0.917
 overpower+0.816
 confuse+0.805
 overshadow+0.761
secut+0.757
Neg logit contributions
-1.003
 Hed-0.914
ertodd-0.868
 Katz-0.862
Pie-0.862
Token keep
Token ID1394
Feature activation-3.78e-01

Pos logit contributions
 overwhelm+0.529
 overpower+0.471
 confuse+0.465
 overshadow+0.438
secut+0.435
Neg logit contributions
-0.587
 Hed-0.541
 Holly-0.512
 Polly-0.511
Pie-0.511
Token this
Token ID428
Feature activation+8.14e-01

Pos logit contributions
+0.317
 Hed+0.289
Pie+0.273
 Holly+0.273
 Polly+0.273
Neg logit contributions
 overwhelm-0.302
 overpower-0.270
 confuse-0.267
 overshadow-0.252
secut-0.251
Token contest
Token ID8414
Feature activation+3.98e-01

Pos logit contributions
 overwhelm+0.648
 overpower+0.579
 confuse+0.572
 overshadow+0.540
secut+0.535
Neg logit contributions
-0.683
 Hed-0.627
Pie-0.591
 Holly-0.591
 Polly-0.590
Token in
Token ID287
Feature activation-1.30e-01

Pos logit contributions
 overwhelm+0.289
 overpower+0.255
 confuse+0.252
 overshadow+0.236
secut+0.235
Neg logit contributions
-0.361
 Hed-0.334
 Holly-0.317
Pie-0.317
 Polly-0.316
Token-
Token ID12
Feature activation-1.21e-01

Pos logit contributions
+0.115
 Hed+0.106
Pie+0.100
 Holly+0.100
 Polly+0.100
Neg logit contributions
 overwhelm-0.098
 overpower-0.087
 confuse-0.085
 overshadow-0.080
secut-0.080
Tokenun
Token ID403
Feature activation+2.76e-01

Pos logit contributions
+0.093
 Hed+0.086
 Holly+0.080
 Polly+0.080
Pie+0.080
Neg logit contributions
 overwhelm-0.104
 overpower-0.093
 confuse-0.092
 overshadow-0.087
secut-0.087
Tokeniverse
Token ID3997
Feature activation-9.92e-02

Pos logit contributions
 overwhelm+0.182
 overpower+0.159
 confuse+0.156
secut+0.146
 overshadow+0.145
Neg logit contributions
-0.268
 Hed-0.250
 Holly-0.238
 Polly-0.238
Pie-0.237
Token.
Token ID13
Feature activation-2.73e-01

Pos logit contributions
+0.083
 Hed+0.076
 Holly+0.072
 Polly+0.072
Pie+0.072
Neg logit contributions
 overwhelm-0.079
 overpower-0.071
 confuse-0.070
 overshadow-0.066
secut-0.066
TokenIf
Token ID1532
Feature activation-1.20e+00

Pos logit contributions
+0.210
 Hed+0.193
 Holly+0.181
 Polly+0.181
Pie+0.180
Neg logit contributions
 overwhelm-0.234
 overpower-0.211
 confuse-0.209
 overshadow-0.198
secut-0.197
Token you
Token ID345
Feature activation-1.10e+00

Pos logit contributions
+1.038
 Hed+0.963
 Holly+0.913
 Polly+0.911
Pie+0.907
Neg logit contributions
 overwhelm-0.918
 overpower-0.816
 confuse-0.806
 overshadow-0.757
secut-0.756
Token
Token ID447
Feature activation-1.38e-01

Pos logit contributions
+1.373
 Hed+1.288
 Holly+1.216
 Polly+1.213
Pie+1.211
Neg logit contributions
 overwhelm-1.190
 overpower-1.056
 confuse-1.041
secut-0.981
 overshadow-0.979
Token
Token ID247
Feature activation-1.12e+00

Pos logit contributions
+0.128
 Hed+0.119
 Holly+0.113
 Polly+0.113
Pie+0.113
Neg logit contributions
 overwhelm-0.097
 overpower-0.085
 confuse-0.084
 overshadow-0.079
secut-0.078
Tokens
Token ID82
Feature activation-5.10e-02

Pos logit contributions
+0.872
 Hed+0.797
 Holly+0.749
 Polly+0.748
Pie+0.746
Neg logit contributions
 overwhelm-0.957
 overpower-0.862
 confuse-0.852
 overshadow-0.808
secut-0.805
Token Day
Token ID3596
Feature activation-1.35e+00

Pos logit contributions
+0.042
 Hed+0.038
 Polly+0.036
Pie+0.036
 Holly+0.036
Neg logit contributions
 overwhelm-0.042
 overpower-0.037
 confuse-0.037
 overshadow-0.035
secut-0.035
Token.
Token ID13
Feature activation-4.43e-01

Pos logit contributions
+1.064
 Hed+0.980
 Holly+0.923
 Polly+0.921
Pie+0.918
Neg logit contributions
 overwhelm-1.128
 overpower-1.014
 confuse-1.002
 overshadow-0.949
secut-0.947
Token
Token ID198
Feature activation-6.38e-01

Pos logit contributions
+0.298
 Hed+0.268
 Holly+0.249
 Polly+0.249
Pie+0.249
Neg logit contributions
 overwhelm-0.425
 overpower-0.387
 confuse-0.384
 overshadow-0.366
secut-0.365
Token
Token ID198
Feature activation-9.32e-01

Pos logit contributions
+0.395
 Hed+0.350
 Holly+0.322
 Polly+0.321
ertodd+0.319
Neg logit contributions
 overwhelm-0.645
 overpower-0.592
 confuse-0.586
secut-0.560
 overshadow-0.559
TokenFor
Token ID1890
Feature activation-1.50e+00

Pos logit contributions
+0.657
 Hed+0.598
 Holly+0.558
 Polly+0.557
Pie+0.552
Neg logit contributions
 overwhelm-0.863
 overpower-0.783
 confuse-0.775
 overshadow-0.738
secut-0.736
Token 45
Token ID4153
Feature activation-2.43e+00

Pos logit contributions
+1.267
 Hed+1.184
 Holly+1.118
 Polly+1.116
 Katz+1.108
Neg logit contributions
 overwhelm-1.176
 overpower-1.046
 confuse-1.034
 overshadow-0.975
secut-0.974
Token years
Token ID812
Feature activation+8.27e-01

Pos logit contributions
+1.939
 Hed+1.780
 Holly+1.683
 Polly+1.678
Pie+1.674
Neg logit contributions
 overwhelm-2.020
 overpower-1.815
 confuse-1.788
 overshadow-1.698
secut-1.678
Token,
Token ID11
Feature activation-8.66e-01

Pos logit contributions
 overwhelm+0.700
 overpower+0.630
 confuse+0.623
 overshadow+0.592
secut+0.589
Neg logit contributions
-0.648
 Hed-0.592
Pie-0.558
 Holly-0.557
 Polly-0.556
Token he
Token ID339
Feature activation+8.05e-01

Pos logit contributions
 overwhelm+1.133
 overpower+1.023
 confuse+0.997
 overshadow+0.952
secut+0.939
Neg logit contributions
-1.188
 Hed-1.066
Pie-1.018
Ton-1.018
ertodd-1.013
Token went
Token ID1816
Feature activation+1.37e+00

Pos logit contributions
 overwhelm+0.611
 overpower+0.547
 confuse+0.535
 overshadow+0.506
secut+0.499
Neg logit contributions
-0.710
 Hed-0.648
Pie-0.618
Ton-0.614
 Holly-0.614
Token down
Token ID866
Feature activation+6.75e-02

Pos logit contributions
 overwhelm+0.982
 overpower+0.875
 confuse+0.850
 overshadow+0.813
secut+0.786
Neg logit contributions
-1.276
 Hed-1.163
Pie-1.114
Ton-1.110
ertodd-1.106
Token,
Token ID11
Feature activation+2.07e+00

Pos logit contributions
 overwhelm+0.056
 overpower+0.050
 confuse+0.050
 overshadow+0.047
secut+0.047
Neg logit contributions
-0.055
 Hed-0.050
Pie-0.047
 Polly-0.047
Ton-0.047
Token didn
Token ID1422
Feature activation+4.75e-01

Pos logit contributions
 overwhelm+1.529
 overpower+1.373
 confuse+1.328
 overshadow+1.269
secut+1.241
Neg logit contributions
-1.862
 Hed-1.680
Pie-1.619
Ton-1.608
ertodd-1.605
Token't
Token ID470
Feature activation-1.34e+00

Pos logit contributions
 overwhelm+0.383
 overpower+0.343
 confuse+0.338
 overshadow+0.319
secut+0.319
Neg logit contributions
-0.393
 Hed-0.361
 Holly-0.341
 Polly-0.340
Pie-0.339
Token have
Token ID423
Feature activation-6.40e-01

Pos logit contributions
+1.149
 Hed+1.080
 Holly+1.022
 Polly+1.020
 Katz+1.012
Neg logit contributions
 overwhelm-1.003
 overpower-0.888
 confuse-0.879
secut-0.838
 overshadow-0.823
Token anything
Token ID1997
Feature activation+7.48e-01

Pos logit contributions
+0.560
 Hed+0.515
Pie+0.488
 Holly+0.486
 Polly+0.486
Neg logit contributions
 overwhelm-0.488
 overpower-0.434
 confuse-0.426
 overshadow-0.403
secut-0.398
Token more
Token ID517
Feature activation-9.24e-01

Pos logit contributions
 overwhelm+0.584
 overpower+0.523
 confuse+0.512
 overshadow+0.487
secut+0.476
Neg logit contributions
-0.645
 Hed-0.589
Pie-0.560
 Polly-0.557
 Holly-0.556
Token serious
Token ID2726
Feature activation-2.14e-01

Pos logit contributions
+0.776
 Hed+0.716
 Holly+0.676
 Polly+0.675
Pie+0.674
Neg logit contributions
 overwhelm-0.732
 overpower-0.654
 confuse-0.645
 overshadow-0.609
secut-0.606
Token than
Token ID621
Feature activation-1.93e+00

Pos logit contributions
+0.181
 Hed+0.167
Pie+0.158
 Holly+0.158
 Polly+0.157
Neg logit contributions
 overwhelm-0.168
 overpower-0.150
 confuse-0.148
 overshadow-0.140
secut-0.138
Token definitive
Token ID17347
Feature activation+1.29e+00

Pos logit contributions
+0.239
 Hed+0.221
 Polly+0.209
 Holly+0.208
Pie+0.206
Neg logit contributions
 overwhelm-0.263
 overpower-0.235
 confuse-0.233
 overshadow-0.221
secut-0.221
Token.
Token ID13
Feature activation+8.17e-01

Pos logit contributions
 overwhelm+0.953
 overpower+0.843
 confuse+0.831
 overshadow+0.781
secut+0.776
Neg logit contributions
-1.162
 Hed-1.076
 Holly-1.019
 Polly-1.018
Pie-1.017
Token
Token ID447
Feature activation-4.42e-01

Pos logit contributions
 overwhelm+0.800
 overpower+0.729
 confuse+0.723
 overshadow+0.691
secut+0.690
Neg logit contributions
-0.530
 Hed-0.472
 Holly-0.439
Pie-0.438
 Polly-0.438
Token
Token ID251
Feature activation+1.62e+00

Pos logit contributions
+0.379
 Hed+0.351
 Holly+0.331
 Polly+0.331
Pie+0.330
Neg logit contributions
 overwhelm-0.343
 overpower-0.305
 confuse-0.301
 overshadow-0.283
secut-0.282
Token
Token ID198
Feature activation+2.01e-01

Pos logit contributions
 overwhelm+1.605
 overpower+1.466
 confuse+1.453
 overshadow+1.389
secut+1.387
Neg logit contributions
-1.027
 Hed-0.913
Pie-0.848
 Holly-0.845
 Polly-0.844
Token
Token ID198
Feature activation-3.15e-01

Pos logit contributions
 overwhelm+0.200
 overpower+0.183
 confuse+0.181
secut+0.173
 overshadow+0.172
Neg logit contributions
-0.128
 Hed-0.112
 Holly-0.104
ertodd-0.103
 Polly-0.103
TokenIn
Token ID818
Feature activation-7.84e-02

Pos logit contributions
+0.242
 Hed+0.223
 Holly+0.209
 Polly+0.209
 Katz+0.207
Neg logit contributions
 overwhelm-0.271
 overpower-0.244
 confuse-0.241
 overshadow-0.228
secut-0.228
Token the
Token ID262
Feature activation-1.27e-01

Pos logit contributions
+0.066
 Hed+0.061
 Holly+0.057
 Polly+0.057
Pie+0.057
Neg logit contributions
 overwhelm-0.062
 overpower-0.056
 confuse-0.055
 overshadow-0.052
secut-0.052
Token interim
Token ID19303
Feature activation-9.20e-02

Pos logit contributions
+0.107
 Hed+0.099
Pie+0.093
 Holly+0.093
 Polly+0.093
Neg logit contributions
 overwhelm-0.099
 overpower-0.089
 confuse-0.087
 overshadow-0.083
secut-0.082
Token,
Token ID11
Feature activation-7.03e-01

Pos logit contributions
+0.071
 Hed+0.065
Pie+0.061
 Holly+0.061
 Polly+0.061
Neg logit contributions
 overwhelm-0.079
 overpower-0.071
 confuse-0.071
 overshadow-0.067
secut-0.066
Token this
Token ID428
Feature activation-1.02e-01

Pos logit contributions
+0.587
 Hed+0.542
 Holly+0.512
 Polly+0.511
Pie+0.510
Neg logit contributions
 overwhelm-0.560
 overpower-0.500
 confuse-0.493
 overshadow-0.466
secut-0.464
Token held
Token ID2714
Feature activation-1.54e-03

Pos logit contributions
+0.079
 Hed+0.074
 Holly+0.070
 Polly+0.070
Pie+0.069
Neg logit contributions
 overwhelm-0.066
 overpower-0.058
 confuse-0.057
 overshadow-0.054
secut-0.053
Token until
Token ID1566
Feature activation-2.44e-02

Pos logit contributions
+0.001
 Hed+0.001
 Holly+0.001
 Polly+0.001
Pie+0.001
Neg logit contributions
 overwhelm-0.001
 overpower-0.001
 confuse-0.001
 overshadow-0.001
secut-0.001
Token the
Token ID262
Feature activation-3.53e-01

Pos logit contributions
+0.021
 Hed+0.019
 Holly+0.018
Pie+0.018
 Polly+0.018
Neg logit contributions
 overwhelm-0.019
 overpower-0.017
 confuse-0.017
 overshadow-0.016
secut-0.016
Token facts
Token ID6419
Feature activation+2.91e-01

Pos logit contributions
+0.306
 Hed+0.281
 Holly+0.267
Pie+0.266
 Polly+0.266
Neg logit contributions
 overwhelm-0.272
 overpower-0.243
 confuse-0.239
 overshadow-0.225
secut-0.224
Token are
Token ID389
Feature activation-2.01e+00

Pos logit contributions
 overwhelm+0.218
 overpower+0.193
 confuse+0.191
secut+0.181
 overshadow+0.179
Neg logit contributions
-0.253
 Hed-0.236
 Polly-0.224
 Holly-0.223
 Katz-0.222
Token sufficiently
Token ID17338
Feature activation-4.32e+00

Pos logit contributions
+1.837
 Hed+1.738
 Polly+1.651
 Holly+1.651
 Katz+1.644
Neg logit contributions
 overwhelm-1.399
 confuse-1.213
 overpower-1.207
secut-1.138
 overshadow-1.124
Token clear
Token ID1598
Feature activation-9.31e-01

Pos logit contributions
+3.920
 Hed+3.771
 Polly+3.574
 Holly+3.570
 Katz+3.566
Neg logit contributions
 overwhelm-2.924
 confuse-2.527
 overpower-2.480
secut-2.394
 overshadow-2.326
Token for
Token ID329
Feature activation-4.32e-01

Pos logit contributions
+0.731
 Hed+0.675
 Polly+0.632
 Holly+0.632
Pie+0.628
Neg logit contributions
 overwhelm-0.785
 overpower-0.705
 confuse-0.698
 overshadow-0.660
secut-0.659
Token the
Token ID262
Feature activation+6.37e-01

Pos logit contributions
+0.355
 Hed+0.328
 Holly+0.309
 Polly+0.309
Pie+0.308
Neg logit contributions
 overwhelm-0.350
 overpower-0.313
 confuse-0.309
 overshadow-0.292
secut-0.291
Token North
Token ID2258
Feature activation+1.16e+00

Pos logit contributions
 overwhelm+0.474
 overpower+0.420
 confuse+0.414
 overshadow+0.390
secut+0.388
Neg logit contributions
-0.568
 Hed-0.524
Pie-0.497
 Holly-0.497
 Polly-0.496
Token Korean
Token ID6983
Feature activation+2.26e+00

Pos logit contributions
 overwhelm+0.633
 overpower+0.534
 confuse+0.524
 overshadow+0.479
secut+0.476
Neg logit contributions
-1.256
 Hed-1.181
 Holly-1.132
 Polly-1.130
Pie-1.130
Token in
Token ID287
Feature activation-8.25e-01

Pos logit contributions
+0.320
 Hed+0.297
 Holly+0.280
Pie+0.279
 Polly+0.279
Neg logit contributions
 overwhelm-0.301
 overpower-0.268
 confuse-0.265
 overshadow-0.249
secut-0.249
Token suicide
Token ID7341
Feature activation-1.35e+00

Pos logit contributions
+0.656
 Hed+0.605
 Holly+0.569
 Polly+0.568
Pie+0.565
Neg logit contributions
 overwhelm-0.689
 overpower-0.619
 confuse-0.612
 overshadow-0.579
secut-0.578
Token rates
Token ID3965
Feature activation-8.22e-01

Pos logit contributions
+1.079
 Hed+1.006
 Holly+0.946
 Polly+0.942
 Katz+0.936
Neg logit contributions
 overwhelm-1.102
 overpower-0.989
 confuse-0.980
secut-0.926
 overshadow-0.923
Token among
Token ID1871
Feature activation+1.78e-01

Pos logit contributions
+0.712
 Hed+0.661
 Holly+0.625
 Polly+0.624
Pie+0.622
Neg logit contributions
 overwhelm-0.627
 overpower-0.558
 confuse-0.550
 overshadow-0.518
secut-0.517
Token persons
Token ID6506
Feature activation-5.59e-01

Pos logit contributions
 overwhelm+0.132
 overpower+0.117
 confuse+0.115
 overshadow+0.108
secut+0.108
Neg logit contributions
-0.159
 Hed-0.147
 Holly-0.139
 Polly-0.139
Pie-0.139
Token ages
Token ID9337
Feature activation-4.58e+00

Pos logit contributions
+0.455
 Hed+0.420
 Holly+0.397
 Polly+0.395
Pie+0.393
Neg logit contributions
 overwhelm-0.456
 overpower-0.409
 confuse-0.404
 overshadow-0.381
secut-0.381
Token 10
Token ID838
Feature activation+3.66e-01

Pos logit contributions
+3.394
 Hed+3.220
 Holly+3.035
 Polly+3.007
 Katz+2.998
Neg logit contributions
 overwhelm-3.901
 overpower-3.505
 confuse-3.472
secut-3.343
 overshadow-3.310
Token years
Token ID812
Feature activation-5.17e-01

Pos logit contributions
 overwhelm+0.309
 overpower+0.277
 confuse+0.274
 overshadow+0.260
secut+0.258
Neg logit contributions
-0.290
 Hed-0.265
 Polly-0.250
Pie-0.250
 Holly-0.249
Token and
Token ID290
Feature activation-6.57e-01

Pos logit contributions
+0.389
 Hed+0.355
 Holly+0.333
 Polly+0.332
Pie+0.331
Neg logit contributions
 overwhelm-0.456
 overpower-0.412
 confuse-0.407
 overshadow-0.387
secut-0.386
Token older
Token ID4697
Feature activation-4.83e-01

Pos logit contributions
+0.573
 Hed+0.539
 Holly+0.513
 Polly+0.511
 Katz+0.508
Neg logit contributions
 overwhelm-0.488
 overpower-0.432
 confuse-0.427
secut-0.403
 overshadow-0.398
Token,
Token ID11
Feature activation+4.25e-01

Pos logit contributions
+0.405
 Hed+0.373
 Holly+0.353
 Polly+0.352
Pie+0.351
Neg logit contributions
 overwhelm-0.383
 overpower-0.342
 confuse-0.337
 overshadow-0.319
secut-0.317
Token him
Token ID683
Feature activation+1.15e+00

Pos logit contributions
+0.434
 Hed+0.400
 Holly+0.377
 Polly+0.376
Pie+0.374
Neg logit contributions
 overwhelm-0.443
 overpower-0.397
 confuse-0.393
 overshadow-0.372
secut-0.370
Token.
Token ID13
Feature activation+6.09e-01

Pos logit contributions
 overwhelm+0.912
 overpower+0.815
 confuse+0.803
 overshadow+0.761
secut+0.756
Neg logit contributions
-0.961
 Hed-0.887
Pie-0.837
 Holly-0.837
 Polly-0.835
Token I
Token ID314
Feature activation-2.51e-01

Pos logit contributions
 overwhelm+0.528
 overpower+0.476
 confuse+0.470
 overshadow+0.446
secut+0.445
Neg logit contributions
-0.465
 Hed-0.425
 Holly-0.399
 Polly-0.399
Pie-0.399
Token want
Token ID765
Feature activation+1.51e+00

Pos logit contributions
+0.205
 Hed+0.189
 Holly+0.178
 Polly+0.178
Pie+0.177
Neg logit contributions
 overwhelm-0.204
 overpower-0.183
 confuse-0.180
 overshadow-0.170
secut-0.170
Token everyone
Token ID2506
Feature activation-7.67e-01

Pos logit contributions
 overwhelm+1.199
 overpower+1.071
 confuse+1.052
 overshadow+0.997
secut+0.992
Neg logit contributions
-1.263
 Hed-1.162
Pie-1.100
 Holly-1.095
 Polly-1.095
Token to
Token ID284
Feature activation-4.35e+00

Pos logit contributions
+0.625
 Hed+0.577
 Holly+0.544
 Polly+0.543
Pie+0.540
Neg logit contributions
 overwhelm-0.625
 overpower-0.559
 confuse-0.552
 overshadow-0.522
secut-0.521
Token be
Token ID307
Feature activation-3.81e+00

Pos logit contributions
+3.379
 Hed+3.179
 Holly+3.007
 Polly+3.005
 Katz+2.985
Neg logit contributions
 overwhelm-3.458
 overpower-3.092
 confuse-3.034
secut-2.953
Reviewer-2.880
Token respectful
Token ID25923
Feature activation-2.86e+00

Pos logit contributions
+3.281
 Hed+3.065
 Polly+2.935
 Holly+2.932
 Katz+2.901
Neg logit contributions
 overwhelm-2.862
 overpower-2.502
 confuse-2.492
secut-2.335
 overshadow-2.327
Token,
Token ID11
Feature activation-4.89e-01

Pos logit contributions
+2.244
 Hed+2.062
 Polly+1.939
 Holly+1.934
Pie+1.908
Neg logit contributions
 overwhelm-2.402
 overpower-2.161
 confuse-2.136
secut-2.019
 overshadow-2.010
Token and
Token ID290
Feature activation-4.24e-01

Pos logit contributions
+0.382
 Hed+0.351
 Holly+0.331
 Polly+0.330
Pie+0.328
Neg logit contributions
 overwhelm-0.415
 overpower-0.373
 confuse-0.369
 overshadow-0.350
secut-0.349
Token let
Token ID1309
Feature activation+3.54e-02

Pos logit contributions
+0.353
 Hed+0.326
 Holly+0.308
 Polly+0.308
Pie+0.306
Neg logit contributions
 overwhelm-0.338
 overpower-0.302
 confuse-0.298
 overshadow-0.281
secut-0.281
Token permission
Token ID7170
Feature activation-1.47e+00

Pos logit contributions
 overwhelm+0.597
 overpower+0.528
 confuse+0.521
 overshadow+0.491
secut+0.488
Neg logit contributions
-0.703
 Hed-0.654
 Holly-0.618
 Polly-0.618
 Katz-0.616
Token.
Token ID13
Feature activation+6.87e-01

Pos logit contributions
+1.139
 Hed+1.055
 Polly+1.000
 Holly+0.997
 Katz+0.987
Neg logit contributions
 overwhelm-1.257
 overpower-1.131
 confuse-1.119
 overshadow-1.059
secut-1.051
Token<|endoftext|>
Token ID50256
Feature activation-1.09e+00

Pos logit contributions
 overwhelm+0.542
 overpower+0.484
 confuse+0.477
 overshadow+0.449
secut+0.449
Neg logit contributions
-0.576
 Hed-0.532
 Holly-0.501
 Polly-0.499
Pie-0.499
TokenThe
Token ID464
Feature activation+4.75e-01

Pos logit contributions
+0.763
 Hed+0.689
Pie+0.645
 Holly+0.643
 Polly+0.641
Neg logit contributions
 overwhelm-1.025
 overpower-0.932
 confuse-0.922
 overshadow-0.880
secut-0.874
Token S
Token ID311
Feature activation+1.49e+00

Pos logit contributions
 overwhelm+0.388
 overpower+0.348
 confuse+0.343
 overshadow+0.325
secut+0.323
Neg logit contributions
-0.388
 Hed-0.355
 Holly-0.335
Pie-0.335
 Polly-0.334
Tokenver
Token ID332
Feature activation-4.34e+00

Pos logit contributions
 overwhelm+1.237
 overpower+1.110
 confuse+1.096
 overshadow+1.037
secut+1.034
Neg logit contributions
-1.202
 Hed-1.103
 Holly-1.040
 Polly-1.039
Pie-1.035
Tokenig
Token ID328
Feature activation-3.07e-01

Pos logit contributions
+3.980
 Hed+3.841
 Katz+3.668
 Holly+3.652
Pie+3.638
Neg logit contributions
 overwhelm-2.857
 overpower-2.533
 confuse-2.494
 overshadow-2.315
secut-2.261
Tokenes
Token ID274
Feature activation+7.61e-01

Pos logit contributions
+0.193
 Hed+0.174
 Holly+0.161
 Polly+0.160
Pie+0.160
Neg logit contributions
 overwhelm-0.307
 overpower-0.281
 confuse-0.278
 overshadow-0.266
secut-0.265
Token Ri
Token ID30385
Feature activation-7.77e-01

Pos logit contributions
 overwhelm+0.627
 overpower+0.562
 confuse+0.556
 overshadow+0.525
secut+0.523
Neg logit contributions
-0.617
 Hed-0.566
 Holly-0.534
 Polly-0.533
Pie-0.531
Tokenks
Token ID591
Feature activation-5.92e-01

Pos logit contributions
+0.590
 Hed+0.544
Pie+0.511
 Holly+0.510
 Polly+0.508
Neg logit contributions
 overwhelm-0.674
 overpower-0.609
 confuse-0.601
secut-0.571
 overshadow-0.570
Tokenbank
Token ID17796
Feature activation-7.76e-01

Pos logit contributions
+0.482
 Hed+0.446
 Holly+0.419
Pie+0.419
 Polly+0.418
Neg logit contributions
 overwhelm-0.483
 overpower-0.433
 confuse-0.427
 overshadow-0.404
secut-0.401
Token.
Token ID13
Feature activation-5.84e-01

Pos logit contributions
+0.391
 Hed+0.360
 Holly+0.340
 Polly+0.339
Pie+0.338
Neg logit contributions
 overwhelm-0.401
 overpower-0.359
 confuse-0.355
 overshadow-0.335
secut-0.334
Token Rachel
Token ID15984
Feature activation-2.12e-01

Pos logit contributions
+0.422
 Hed+0.384
 Holly+0.359
 Polly+0.358
Pie+0.357
Neg logit contributions
 overwhelm-0.531
 overpower-0.481
 confuse-0.476
 overshadow-0.453
secut-0.451
Token Salt
Token ID13754
Feature activation+6.09e-01

Pos logit contributions
+0.139
 Hed+0.125
 Holly+0.116
 Polly+0.116
Pie+0.115
Neg logit contributions
 overwhelm-0.207
 overpower-0.189
 confuse-0.187
 overshadow-0.179
secut-0.178
Tokenz
Token ID89
Feature activation+1.04e+00

Pos logit contributions
 overwhelm+0.527
 overpower+0.475
 confuse+0.470
 overshadow+0.447
secut+0.445
Neg logit contributions
-0.468
 Hed-0.425
 Holly-0.399
 Polly-0.399
Pie-0.397
Tokenc
Token ID66
Feature activation-1.05e+00

Pos logit contributions
 overwhelm+0.860
 overpower+0.771
 confuse+0.762
 overshadow+0.722
secut+0.717
Neg logit contributions
-0.842
 Hed-0.770
 Holly-0.725
 Polly-0.723
Pie-0.723
Tokenalls
Token ID5691
Feature activation-4.08e+00

Pos logit contributions
+0.820
 Hed+0.755
 Holly+0.710
 Polly+0.708
Pie+0.706
Neg logit contributions
 overwhelm-0.893
 overpower-0.803
 confuse-0.792
 overshadow-0.751
secut-0.750
Token it
Token ID340
Feature activation-9.78e-01

Pos logit contributions
+3.138
 Hed+2.952
 Holly+2.785
 Polly+2.770
 Katz+2.753
Neg logit contributions
 overwhelm-3.427
 overpower-3.080
 confuse-3.035
secut-2.885
 overshadow-2.881
Token "
Token ID366
Feature activation+9.03e-01

Pos logit contributions
+0.862
 Hed+0.804
 Holly+0.762
 Polly+0.761
 Katz+0.754
Neg logit contributions
 overwhelm-0.731
 overpower-0.647
 confuse-0.639
secut-0.600
 overshadow-0.600
Tokena
Token ID64
Feature activation+8.49e-03

Pos logit contributions
 overwhelm+0.776
 overpower+0.701
 confuse+0.691
secut+0.656
 overshadow+0.656
Neg logit contributions
-0.695
 Hed-0.636
 Holly-0.596
 Polly-0.596
Pie-0.592
Token kind
Token ID1611
Feature activation-3.23e+00

Pos logit contributions
 overwhelm+0.006
 overpower+0.006
 confuse+0.005
 overshadow+0.005
secut+0.005
Neg logit contributions
-0.008
 Hed-0.007
Pie-0.007
 Holly-0.007
 Polly-0.007
Token of
Token ID286
Feature activation-1.51e+00

Pos logit contributions
+2.652
 Hed+2.462
 Holly+2.316
 Polly+2.313
 Katz+2.287
Neg logit contributions
 overwhelm-2.599
 overpower-2.318
 confuse-2.290
 overshadow-2.166
secut-2.164