TOP ACTIVATIONS
MAX = 2.074

des Cors aires ay ant c id és de qu
Y alk ut Tal mud Torah , Genesis
Russian Mars probe , Ph obos - Gr unt , last
County residents who find themselves targeted by construction or home improvement
Due to staff cuts , attrition and low funding she is
- as it 's not unheard of that collectors have paid
obby M ook ( left ) was asked by NBC News
to differ among individuals and ethnic groups . One of the
Priv at Group , an uncons olid ated business group with
v ous et vot re f unt p ère ,
from homosexual to heterosexual ( Picture : Getty Images )
." How much debris does it take to kill
cl ares offic iel lement ser te ur du programme
plates stre wn on one side of a chain - link
why the two officers were targeted . The cops identified
Ki pling subway station in Et obic oke .
As the country becomes more polarized , so too do
for the history of the Universe and the possibility of life
just how many pieces of debris it takes to kill a
re e ux s ont pl or és et le

MIN ACTIVATIONS
MIN = -1.911

4 Release Date Del ayed Until At Le ast March Of
government is stepping in on mand ating parental leave . OS
standards for normal cy . Until I began to pay serious
he said . Although most suff erers experience normal flu symptoms
to you : Until these blatant and well -
or wash with . De hyd ration , starvation , malnutrition
to flow out of . Until the glacier retreat s to
with not knowing , is enough to get someone to open
for Dunn to return . Until Dunn visits the specialist ,
clubs attempted a coup . Until that moment , the general
We ve Had Enough (# а д
't figure it out fast enough ; and he
that was the sc ari est teen girl that I could
explain why the f ier cest crackdown s against the protest
, it s enough to shake the Earth .
many Kurds , who are demanding a bigger say in the
again : the " How To Stop Rape " article was
behave properly . <|endoftext|> by Erik Alt ieri , NOR ML
that the clinical improvements continued until week 12 during ta V
- cent piece Until 18 57 , the cent

INTERVAL 1.078 - 2.074
CONTAINS 0.162%

McDonnell would give me some pause when considering whether a ten
different things eliminating one does not have to eliminate the other
As Economy Minister Luis de Gu ind os revealed this
BCE the Persian king Xer xes sent his massive army across
exodus of would - be prospect ors dreaming of a better

INTERVAL 0.082 - 1.078
CONTAINS 38.767%

cheap shot , Crow der repeated himself . "
federal law enforcement ? It doesn t of course
that moving into a shelter is a better option .
" Indeed , it would now be a surprise if
monthly salary expectation of 6 , 000 h ry v nia

INTERVAL -0.915 - 0.082
CONTAINS 60.494%

thereby take possession of it . The
will pass a law that will ann ul all legal claims
didn 't go back to school in 2010 .
his 100 . G err ard s
traveling gra ff ito . For photographic evidence of

INTERVAL -1.911 - -0.915
CONTAINS 0.577%

the petition ers on whose plea the high court had passed
the power of the New York Old Guard ." He also
ous " and would force one side of our planet to
P ogue at the New York Times as well as by
re accused of having a tax - and - sp end
Token des
Token ID748
Feature activation+1.24e+00

Pos logit contributions
asca+1.178
 rug+1.021
 CTR+0.961
+0.830
ワン+0.814
Neg logit contributions
ilation-1.121
ongyang-1.075
idding-1.059
orian-1.058
 Conclusion-1.041
Token Cors
Token ID26978
Feature activation+3.57e-01

Pos logit contributions
asca+4.477
 rug+3.972
 CTR+3.753
+3.256
ワン+3.203
Neg logit contributions
ilation-3.372
ongyang-3.247
idding-3.190
orian-3.160
 Conclusion-3.071
Tokenaires
Token ID17693
Feature activation+2.87e-02

Pos logit contributions
asca+1.517
 rug+1.357
 CTR+1.298
+1.172
ワン+1.156
Neg logit contributions
ilation-0.768
ongyang-0.720
idding-0.704
orian-0.702
 Conclusion-0.690
Token ay
Token ID38762
Feature activation+2.67e-01

Pos logit contributions
asca+0.098
 rug+0.086
 CTR+0.081
+0.071
ワン+0.070
Neg logit contributions
ilation-0.085
ongyang-0.081
idding-0.080
orian-0.080
 Conclusion-0.079
Tokenant
Token ID415
Feature activation+2.71e-01

Pos logit contributions
asca+1.196
 rug+1.076
 CTR+1.032
+0.936
ワン+0.921
Neg logit contributions
ilation-0.511
ongyang-0.476
orian-0.462
idding-0.462
 Conclusion-0.452
Token
Token ID39073
Feature activation+2.07e+00

Pos logit contributions
asca+0.894
 rug+0.775
 CTR+0.730
+0.633
ワン+0.620
Neg logit contributions
ilation-0.840
ongyang-0.804
idding-0.793
orian-0.792
 Conclusion-0.780
Tokenc
Token ID66
Feature activation+5.75e-01

Pos logit contributions
asca+7.299
 rug+6.394
 CTR+6.041
+5.292
 guid+5.174
Neg logit contributions
ilation-5.505
ongyang-5.299
idding-5.222
orian-5.172
 Protector-5.155
Tokenid
Token ID312
Feature activation-3.97e-01

Pos logit contributions
asca+2.423
 rug+2.139
 CTR+2.061
+1.842
ワン+1.797
Neg logit contributions
ilation-1.240
ongyang-1.185
 Conclusion-1.150
orian-1.150
idding-1.142
Tokenés
Token ID20954
Feature activation+7.04e-01

Pos logit contributions
ilation+0.903
ongyang+0.845
idding+0.832
orian+0.831
 Conclusion+0.811
Neg logit contributions
asca-1.636
 rug-1.464
 CTR-1.397
-1.257
ワン-1.242
Token de
Token ID390
Feature activation+8.84e-01

Pos logit contributions
asca+2.373
 rug+2.070
 CTR+1.952
+1.692
ワン+1.652
Neg logit contributions
ilation-2.107
ongyang-2.033
idding-1.992
orian-1.990
 Conclusion-1.952
Token qu
Token ID627
Feature activation+7.04e-02

Pos logit contributions
asca+2.956
 rug+2.590
 CTR+2.445
+2.092
ワン+2.046
Neg logit contributions
ilation-2.656
ongyang-2.560
idding-2.521
orian-2.513
 Conclusion-2.455
Token
Token ID250
Feature activation+1.19e-01

Pos logit contributions
ilation+0.766
ongyang+0.727
orian+0.721
idding+0.716
 Conclusion+0.708
Neg logit contributions
asca-0.942
 rug-0.821
 CTR-0.781
-0.682
ワン-0.679
TokenY
Token ID56
Feature activation+6.91e-02

Pos logit contributions
asca+0.431
 rug+0.379
 CTR+0.359
+0.317
ワン+0.312
Neg logit contributions
ilation-0.327
ongyang-0.311
orian-0.306
idding-0.306
 Conclusion-0.301
Tokenalk
Token ID971
Feature activation+2.01e-01

Pos logit contributions
asca+0.264
 rug+0.234
 CTR+0.223
+0.198
ワン+0.195
Neg logit contributions
ilation-0.178
ongyang-0.168
idding-0.166
orian-0.166
 Conclusion-0.162
Tokenut
Token ID315
Feature activation+6.06e-01

Pos logit contributions
asca+0.782
 rug+0.693
 CTR+0.659
+0.588
ワン+0.578
Neg logit contributions
ilation-0.502
ongyang-0.475
idding-0.467
orian-0.465
 Conclusion-0.458
Token Tal
Token ID7193
Feature activation+4.53e-01

Pos logit contributions
asca+2.266
 rug+1.999
 CTR+1.897
+1.681
ワン+1.650
Neg logit contributions
ilation-1.607
ongyang-1.527
idding-1.504
orian-1.497
 Conclusion-1.472
Tokenmud
Token ID41650
Feature activation+1.94e+00

Pos logit contributions
asca+1.679
 rug+1.478
 CTR+1.401
+1.240
ワン+1.214
Neg logit contributions
ilation-1.212
ongyang-1.154
idding-1.134
orian-1.130
 Conclusion-1.113
Token Torah
Token ID33366
Feature activation+1.35e+00

Pos logit contributions
asca+6.948
 rug+6.195
 CTR+5.840
 �+5.134
+5.083
Neg logit contributions
ilation-5.146
ongyang-5.031
idding-4.886
orian-4.755
 Conclusion-4.663
Token,
Token ID11
Feature activation+9.41e-04

Pos logit contributions
asca+4.950
 rug+4.399
 CTR+4.174
+3.654
 �+3.642
Neg logit contributions
ilation-3.564
ongyang-3.446
idding-3.373
orian-3.331
 Conclusion-3.257
Token
Token ID447
Feature activation-1.84e-01

Pos logit contributions
asca+0.003
 rug+0.003
 CTR+0.003
+0.002
ワン+0.002
Neg logit contributions
ilation-0.003
ongyang-0.003
orian-0.003
idding-0.003
 Conclusion-0.003
Token
Token ID251
Feature activation+4.23e-01

Pos logit contributions
ilation+0.462
ongyang+0.440
orian+0.436
idding+0.430
 Opposition+0.426
Neg logit contributions
asca-0.713
 rug-0.627
 CTR-0.597
-0.533
ワン-0.528
Token Genesis
Token ID18993
Feature activation+3.80e-01

Pos logit contributions
asca+1.534
 rug+1.348
 CTR+1.278
+1.128
ワン+1.109
Neg logit contributions
ilation-1.172
ongyang-1.114
idding-1.097
orian-1.096
 Conclusion-1.078
Token Russian
Token ID3394
Feature activation+1.97e-01

Pos logit contributions
asca+0.551
 rug+0.486
 CTR+0.461
+0.410
ワン+0.405
Neg logit contributions
ilation-0.397
ongyang-0.377
idding-0.370
orian-0.369
 Opposition-0.365
Token Mars
Token ID8706
Feature activation-1.65e-01

Pos logit contributions
asca+0.711
 rug+0.620
 CTR+0.585
+0.522
ワン+0.515
Neg logit contributions
ilation-0.547
ongyang-0.520
idding-0.512
orian-0.511
 Opposition-0.507
Token probe
Token ID12774
Feature activation+1.88e-01

Pos logit contributions
ilation+0.436
ongyang+0.410
orian+0.403
idding+0.403
 Opposition+0.399
Neg logit contributions
asca-0.621
 rug-0.547
 CTR-0.518
-0.463
ワン-0.457
Token,
Token ID11
Feature activation-2.69e-01

Pos logit contributions
asca+0.658
 rug+0.576
 CTR+0.546
+0.477
ワン+0.468
Neg logit contributions
ilation-0.545
ongyang-0.519
orian-0.512
idding-0.512
 Conclusion-0.502
Token Ph
Token ID1380
Feature activation+6.57e-01

Pos logit contributions
ilation+0.729
ongyang+0.687
idding+0.681
orian+0.676
 Conclusion+0.673
Neg logit contributions
asca-0.988
 rug-0.867
 CTR-0.818
-0.732
ワン-0.721
Tokenobos
Token ID49878
Feature activation+1.85e+00

Pos logit contributions
asca+2.382
 rug+2.085
 CTR+1.968
+1.739
ワン+1.682
Neg logit contributions
ilation-1.769
ongyang-1.692
 Conclusion-1.676
idding-1.675
orian-1.658
Token-
Token ID12
Feature activation+3.32e-01

Pos logit contributions
asca+6.342
 rug+5.622
 CTR+5.341
+4.590
 �+4.506
Neg logit contributions
ilation-5.239
ongyang-4.975
idding-4.945
orian-4.927
 Conclusion-4.811
TokenGr
Token ID8642
Feature activation+7.96e-01

Pos logit contributions
asca+0.990
 rug+0.846
 CTR+0.790
+0.672
ワン+0.653
Neg logit contributions
ilation-1.126
ongyang-1.084
orian-1.073
idding-1.071
 Conclusion-1.057
Tokenunt
Token ID2797
Feature activation-2.30e-01

Pos logit contributions
asca+3.944
 rug+3.460
 CTR+3.272
+3.106
 Birch+3.013
Neg logit contributions
ilation-1.126
ongyang-1.064
 Conclusion-1.030
orian-1.026
 Opposition-1.014
Token,
Token ID11
Feature activation+1.10e-01

Pos logit contributions
ilation+0.616
ongyang+0.580
orian+0.574
idding+0.573
 Opposition+0.565
Neg logit contributions
asca-0.854
 rug-0.750
 CTR-0.709
-0.634
ワン-0.629
Token last
Token ID938
Feature activation-1.41e-01

Pos logit contributions
asca+0.354
 rug+0.306
 CTR+0.288
+0.249
ワン+0.244
Neg logit contributions
ilation-0.346
ongyang-0.331
idding-0.327
orian-0.327
 Conclusion-0.322
Token County
Token ID3418
Feature activation-3.15e-01

Pos logit contributions
asca+1.611
 rug+1.391
 CTR+1.308
+1.133
ワン+1.112
Neg logit contributions
ilation-1.582
ongyang-1.511
idding-1.494
orian-1.493
 Conclusion-1.473
Token residents
Token ID5085
Feature activation-2.08e-01

Pos logit contributions
ilation+0.990
idding+0.941
ongyang+0.941
orian+0.937
 Opposition+0.931
Neg logit contributions
asca-1.018
 rug-0.877
 CTR-0.823
-0.719
ワン-0.711
Token who
Token ID508
Feature activation-2.97e-01

Pos logit contributions
ilation+0.582
ongyang+0.551
idding+0.548
orian+0.548
 Conclusion+0.541
Neg logit contributions
asca-0.745
 rug-0.646
 CTR-0.614
-0.550
ワン-0.542
Token find
Token ID1064
Feature activation-3.54e-01

Pos logit contributions
ilation+0.897
ongyang+0.852
idding+0.846
orian+0.843
 Conclusion+0.832
Neg logit contributions
asca-1.000
 rug-0.864
 CTR-0.819
-0.721
ワン-0.710
Token themselves
Token ID2405
Feature activation+6.98e-01

Pos logit contributions
ilation+0.960
ongyang+0.903
idding+0.901
orian+0.896
 Opposition+0.889
Neg logit contributions
asca-1.299
 rug-1.130
 CTR-1.074
-0.964
ワン-0.955
Token targeted
Token ID7977
Feature activation+1.84e+00

Pos logit contributions
asca+2.211
 rug+1.916
 CTR+1.793
+1.539
ワン+1.499
Neg logit contributions
ilation-2.246
ongyang-2.161
orian-2.121
idding-2.119
 Conclusion-2.092
Token by
Token ID416
Feature activation+2.79e-01

Pos logit contributions
asca+5.501
 rug+4.753
 CTR+4.415
 �+3.703
+3.661
Neg logit contributions
ilation-6.078
ongyang-5.888
orian-5.723
idding-5.685
 Conclusion-5.616
Token construction
Token ID5103
Feature activation-1.31e-01

Pos logit contributions
asca+0.935
 rug+0.809
 CTR+0.763
+0.667
ワン+0.656
Neg logit contributions
ilation-0.851
ongyang-0.811
idding-0.802
orian-0.801
 Conclusion-0.790
Token or
Token ID393
Feature activation+2.13e-01

Pos logit contributions
ilation+0.385
ongyang+0.365
idding+0.363
orian+0.361
 Opposition+0.358
Neg logit contributions
asca-0.455
 rug-0.391
 CTR-0.373
-0.327
ワン-0.322
Token home
Token ID1363
Feature activation+8.38e-02

Pos logit contributions
asca+0.702
 rug+0.607
 CTR+0.572
+0.498
ワン+0.489
Neg logit contributions
ilation-0.660
ongyang-0.630
idding-0.623
orian-0.622
 Conclusion-0.613
Token improvement
Token ID9025
Feature activation+1.05e-02

Pos logit contributions
asca+0.275
 rug+0.235
 CTR+0.222
+0.193
ワン+0.190
Neg logit contributions
ilation-0.263
ongyang-0.248
idding-0.248
orian-0.247
 Conclusion-0.243
Token Due
Token ID14444
Feature activation-3.87e-01

Pos logit contributions
asca+1.791
 rug+1.584
 CTR+1.501
+1.337
ワン+1.315
Neg logit contributions
ilation-1.241
ongyang-1.172
idding-1.156
orian-1.156
 Opposition-1.139
Token to
Token ID284
Feature activation+3.98e-01

Pos logit contributions
ilation+1.027
ongyang+0.968
idding+0.959
 Opposition+0.945
orian+0.941
Neg logit contributions
asca-1.456
 rug-1.274
 CTR-1.222
-1.088
ワン-1.076
Token staff
Token ID3085
Feature activation+3.74e-01

Pos logit contributions
asca+1.363
 rug+1.185
 CTR+1.119
+0.981
ワン+0.965
Neg logit contributions
ilation-1.184
ongyang-1.123
idding-1.113
orian-1.111
 Opposition-1.098
Token cuts
Token ID6630
Feature activation+1.07e+00

Pos logit contributions
asca+1.429
 rug+1.259
 CTR+1.201
+1.067
ワン+1.055
Neg logit contributions
ilation-0.967
ongyang-0.908
orian-0.898
idding-0.896
 Opposition-0.885
Token,
Token ID11
Feature activation+3.34e-01

Pos logit contributions
asca+3.729
 rug+3.272
 CTR+3.084
+2.712
 �+2.651
Neg logit contributions
ilation-3.098
ongyang-2.963
orian-2.895
idding-2.894
 Conclusion-2.855
Token attrition
Token ID48981
Feature activation+1.78e+00

Pos logit contributions
asca+1.139
 rug+0.991
 CTR+0.935
+0.819
ワン+0.805
Neg logit contributions
ilation-0.997
ongyang-0.949
idding-0.937
orian-0.937
 Conclusion-0.924
Token and
Token ID290
Feature activation+2.79e-01

Pos logit contributions
asca+5.726
 rug+5.012
 CTR+4.670
+4.055
 �+3.996
Neg logit contributions
ilation-5.479
ongyang-5.336
idding-5.179
orian-5.133
 Conclusion-5.087
Token low
Token ID1877
Feature activation+9.97e-03

Pos logit contributions
asca+0.950
 rug+0.822
 CTR+0.777
+0.682
ワン+0.672
Neg logit contributions
ilation-0.837
ongyang-0.794
orian-0.787
idding-0.787
 Opposition-0.778
Token funding
Token ID4918
Feature activation+7.80e-01

Pos logit contributions
asca+0.032
 rug+0.028
 CTR+0.026
+0.023
ワン+0.023
Neg logit contributions
ilation-0.031
ongyang-0.030
idding-0.030
orian-0.029
 Opposition-0.029
Token she
Token ID673
Feature activation+5.86e-01

Pos logit contributions
asca+2.859
 rug+2.515
 CTR+2.388
+2.110
ワン+2.080
Neg logit contributions
ilation-2.130
ongyang-2.020
idding-1.993
orian-1.991
 Conclusion-1.958
Token is
Token ID318
Feature activation+2.65e-01

Pos logit contributions
asca+1.972
 rug+1.715
 CTR+1.602
+1.407
ワン+1.371
Neg logit contributions
ilation-1.772
ongyang-1.702
idding-1.674
orian-1.667
 Conclusion-1.645
Token -
Token ID532
Feature activation+2.06e-01

Pos logit contributions
asca+1.195
 rug+1.060
 CTR+1.008
+0.899
ワン+0.884
Neg logit contributions
ilation-0.781
ongyang-0.740
idding-0.726
orian-0.725
 Conclusion-0.712
Token as
Token ID355
Feature activation+2.28e-01

Pos logit contributions
asca+0.680
 rug+0.591
 CTR+0.557
+0.482
ワン+0.471
Neg logit contributions
ilation-0.639
ongyang-0.611
idding-0.602
orian-0.600
 Conclusion-0.592
Token it
Token ID340
Feature activation+4.83e-01

Pos logit contributions
asca+0.758
 rug+0.658
 CTR+0.620
+0.539
ワン+0.528
Neg logit contributions
ilation-0.699
ongyang-0.668
orian-0.658
idding-0.658
 Conclusion-0.648
Token's
Token ID338
Feature activation+6.39e-02

Pos logit contributions
asca+1.626
 rug+1.416
 CTR+1.332
+1.153
ワン+1.125
Neg logit contributions
ilation-1.462
ongyang-1.402
idding-1.380
orian-1.373
 Conclusion-1.360
Token not
Token ID407
Feature activation+3.54e-01

Pos logit contributions
asca+0.219
 rug+0.192
 CTR+0.181
+0.158
ワン+0.154
Neg logit contributions
ilation-0.189
ongyang-0.181
idding-0.178
orian-0.178
 Conclusion-0.175
Token unheard
Token ID36511
Feature activation+1.74e+00

Pos logit contributions
asca+1.201
 rug+1.048
 CTR+0.988
+0.855
 �+0.835
Neg logit contributions
ilation-1.065
ongyang-1.019
idding-1.000
orian-1.000
 Conclusion-0.989
Token of
Token ID286
Feature activation+2.54e-01

Pos logit contributions
asca+6.188
 rug+5.470
 CTR+5.135
+4.531
 �+4.509
Neg logit contributions
ilation-4.742
ongyang-4.472
idding-4.403
orian-4.341
 Conclusion-4.315
Token that
Token ID326
Feature activation+1.58e-01

Pos logit contributions
asca+0.843
 rug+0.733
 CTR+0.690
+0.600
ワン+0.587
Neg logit contributions
ilation-0.777
ongyang-0.742
idding-0.731
orian-0.730
 Conclusion-0.721
Token collectors
Token ID26668
Feature activation+4.78e-01

Pos logit contributions
asca+0.525
 rug+0.456
 CTR+0.429
+0.373
ワン+0.366
Neg logit contributions
ilation-0.483
ongyang-0.461
idding-0.455
orian-0.454
 Conclusion-0.448
Token have
Token ID423
Feature activation-7.62e-03

Pos logit contributions
asca+1.508
 rug+1.313
 CTR+1.228
+1.054
ワン+1.027
Neg logit contributions
ilation-1.536
ongyang-1.481
idding-1.457
orian-1.454
 Conclusion-1.431
Token paid
Token ID3432
Feature activation+2.64e-01

Pos logit contributions
ilation+0.023
ongyang+0.022
idding+0.022
orian+0.022
 Conclusion+0.022
Neg logit contributions
asca-0.025
 rug-0.022
 CTR-0.021
-0.018
ワン-0.018
Tokenobby
Token ID11369
Feature activation-7.22e-02

Pos logit contributions
asca+2.408
 rug+2.117
 CTR+2.025
+1.795
ワン+1.756
Neg logit contributions
ilation-1.643
ongyang-1.560
orian-1.521
idding-1.519
 Conclusion-1.510
Token M
Token ID337
Feature activation+3.21e-01

Pos logit contributions
ilation+0.174
orian+0.163
ongyang+0.162
idding+0.160
 Conclusion+0.160
Neg logit contributions
asca-0.287
 rug-0.255
 CTR-0.243
-0.219
ワン-0.216
Tokenook
Token ID566
Feature activation+4.84e-01

Pos logit contributions
asca+1.066
 rug+0.907
 CTR+0.866
+0.751
 �+0.727
Neg logit contributions
ilation-0.988
ongyang-0.939
 Conclusion-0.926
orian-0.924
idding-0.921
Token (
Token ID357
Feature activation+1.09e-01

Pos logit contributions
asca+1.696
 rug+1.484
 CTR+1.403
+1.229
ワン+1.206
Neg logit contributions
ilation-1.402
ongyang-1.335
idding-1.316
orian-1.314
 Conclusion-1.293
Tokenleft
Token ID9464
Feature activation+6.80e-01

Pos logit contributions
asca+0.398
 rug+0.350
 CTR+0.331
+0.292
ワン+0.289
Neg logit contributions
ilation-0.302
ongyang-0.286
orian-0.282
idding-0.282
 Opposition-0.277
Token)
Token ID8
Feature activation+1.72e+00

Pos logit contributions
asca+2.450
 rug+2.154
 CTR+2.042
+1.799
ワン+1.765
Neg logit contributions
ilation-1.896
ongyang-1.803
idding-1.775
orian-1.771
 Conclusion-1.747
Token was
Token ID373
Feature activation+3.26e-01

Pos logit contributions
asca+5.601
 rug+4.913
 CTR+4.636
 �+3.961
+3.936
Neg logit contributions
ilation-5.267
ongyang-5.043
idding-4.994
orian-4.896
 Conclusion-4.867
Token asked
Token ID1965
Feature activation+3.41e-01

Pos logit contributions
asca+1.067
 rug+0.923
 CTR+0.869
+0.755
ワン+0.741
Neg logit contributions
ilation-1.019
ongyang-0.974
orian-0.962
idding-0.961
 Conclusion-0.947
Token by
Token ID416
Feature activation+7.66e-02

Pos logit contributions
asca+1.121
 rug+0.971
 CTR+0.915
+0.794
ワン+0.779
Neg logit contributions
ilation-1.056
ongyang-1.009
idding-0.996
orian-0.995
 Conclusion-0.981
Token NBC
Token ID9716
Feature activation-3.72e-01

Pos logit contributions
asca+0.271
 rug+0.237
 CTR+0.222
+0.198
ワン+0.196
Neg logit contributions
ilation-0.218
ongyang-0.207
orian-0.205
idding-0.204
 Opposition-0.203
Token News
Token ID3000
Feature activation+1.71e-01

Pos logit contributions
ilation+0.880
ongyang+0.830
orian+0.814
idding+0.813
 Opposition+0.802
Neg logit contributions
asca-1.500
 rug-1.333
 CTR-1.266
-1.144
ワン-1.132
Token to
Token ID284
Feature activation+6.81e-01

Pos logit contributions
asca+1.715
 rug+1.500
 CTR+1.420
+1.246
ワン+1.224
Neg logit contributions
ilation-1.411
ongyang-1.343
idding-1.324
orian-1.323
 Conclusion-1.302
Token differ
Token ID13238
Feature activation+7.63e-02

Pos logit contributions
asca+2.193
 rug+1.906
 CTR+1.793
+1.546
ワン+1.508
Neg logit contributions
ilation-2.147
ongyang-2.058
idding-2.029
orian-2.024
 Conclusion-1.997
Token among
Token ID1871
Feature activation+4.46e-01

Pos logit contributions
asca+0.241
 rug+0.207
 CTR+0.195
+0.169
ワン+0.167
Neg logit contributions
ilation-0.246
ongyang-0.234
orian-0.232
idding-0.231
 Conclusion-0.230
Token individuals
Token ID3925
Feature activation+4.97e-01

Pos logit contributions
asca+1.548
 rug+1.354
 CTR+1.280
+1.115
ワン+1.091
Neg logit contributions
ilation-1.306
ongyang-1.244
orian-1.226
idding-1.225
 Conclusion-1.203
Token and
Token ID290
Feature activation+2.87e-01

Pos logit contributions
asca+1.733
 rug+1.517
 CTR+1.436
+1.255
ワン+1.230
Neg logit contributions
ilation-1.444
ongyang-1.376
idding-1.354
orian-1.353
 Conclusion-1.329
Token ethnic
Token ID9450
Feature activation+1.72e+00

Pos logit contributions
asca+0.964
 rug+0.837
 CTR+0.791
+0.692
ワン+0.684
Neg logit contributions
ilation-0.867
ongyang-0.828
orian-0.818
idding-0.818
 Conclusion-0.807
Token groups
Token ID2628
Feature activation+4.41e-01

Pos logit contributions
asca+5.261
 rug+4.631
 CTR+4.296
+3.633
 裏�+3.586
Neg logit contributions
ilation-5.589
ongyang-5.389
idding-5.286
orian-5.265
 Conclusion-5.172
Token.
Token ID13
Feature activation+4.36e-01

Pos logit contributions
asca+1.621
 rug+1.428
 CTR+1.354
+1.198
ワン+1.176
Neg logit contributions
ilation-1.201
ongyang-1.141
idding-1.122
orian-1.121
 Conclusion-1.100
Token One
Token ID1881
Feature activation-5.03e-01

Pos logit contributions
asca+1.732
 rug+1.545
 CTR+1.492
+1.324
 �+1.304
Neg logit contributions
ilation-1.050
ongyang-1.007
orian-0.971
idding-0.971
emic-0.935
Token of
Token ID286
Feature activation+2.13e-01

Pos logit contributions
ilation+1.464
 Conclusion+1.374
ongyang+1.374
idding+1.373
orian+1.370
Neg logit contributions
asca-1.742
 rug-1.497
 CTR-1.420
-1.260
ワン-1.251
Token the
Token ID262
Feature activation+1.50e-01

Pos logit contributions
asca+0.753
 rug+0.654
 CTR+0.620
+0.548
ワン+0.544
Neg logit contributions
ilation-0.612
ongyang-0.577
idding-0.572
orian-0.569
 Opposition-0.566
Token Priv
Token ID9243
Feature activation+4.82e-01

Pos logit contributions
ilation+0.114
ongyang+0.108
idding+0.106
orian+0.106
 Conclusion+0.105
Neg logit contributions
asca-0.165
 rug-0.146
 CTR-0.138
-0.124
ワン-0.122
Tokenat
Token ID265
Feature activation+1.60e-01

Pos logit contributions
asca+1.914
 rug+1.662
 CTR+1.607
+1.419
 �+1.385
Neg logit contributions
ilation-1.166
ongyang-1.096
idding-1.080
 Conclusion-1.075
orian-1.074
Token Group
Token ID4912
Feature activation-2.95e-01

Pos logit contributions
asca+0.490
 rug+0.422
 CTR+0.395
+0.339
ワン+0.332
Neg logit contributions
ilation-0.529
ongyang-0.507
idding-0.501
orian-0.501
 Conclusion-0.494
Token,
Token ID11
Feature activation-1.41e-01

Pos logit contributions
ilation+0.738
ongyang+0.698
idding+0.686
orian+0.685
 Conclusion+0.675
Neg logit contributions
asca-1.144
 rug-1.012
 CTR-0.964
-0.864
ワン-0.854
Token an
Token ID281
Feature activation-2.43e-01

Pos logit contributions
ilation+0.406
ongyang+0.386
idding+0.381
orian+0.380
 Opposition+0.376
Neg logit contributions
asca-0.493
 rug-0.430
 CTR-0.407
-0.358
ワン-0.353
Token uncons
Token ID13962
Feature activation+1.71e+00

Pos logit contributions
ilation+0.679
ongyang+0.648
orian+0.637
idding+0.636
 Conclusion+0.626
Neg logit contributions
asca-0.871
 rug-0.770
 CTR-0.727
-0.635
ワン-0.622
Tokenolid
Token ID10180
Feature activation-3.77e-02

Pos logit contributions
asca+4.772
 rug+3.972
 CTR+3.733
+3.063
 �+2.970
Neg logit contributions
ilation-5.918
ongyang-5.797
 Conclusion-5.656
idding-5.656
orian-5.619
Tokenated
Token ID515
Feature activation-5.51e-02

Pos logit contributions
ilation+0.086
ongyang+0.080
idding+0.079
orian+0.079
 Conclusion+0.077
Neg logit contributions
asca-0.155
 rug-0.139
 CTR-0.133
-0.119
ワン-0.118
Token business
Token ID1597
Feature activation-3.72e-01

Pos logit contributions
ilation+0.166
ongyang+0.159
idding+0.156
orian+0.156
 Conclusion+0.154
Neg logit contributions
asca-0.186
 rug-0.163
 CTR-0.153
-0.133
ワン-0.130
Token group
Token ID1448
Feature activation-9.75e-02

Pos logit contributions
ilation+1.060
ongyang+1.009
idding+0.994
orian+0.993
 Conclusion+0.978
Neg logit contributions
asca-1.316
 rug-1.152
 CTR-1.091
-0.959
ワン-0.943
Token with
Token ID351
Feature activation+2.43e-01

Pos logit contributions
ilation+0.266
ongyang+0.253
idding+0.249
orian+0.249
 Conclusion+0.244
Neg logit contributions
asca-0.357
 rug-0.315
 CTR-0.299
-0.264
ワン-0.258
Token v
Token ID410
Feature activation+3.05e-01

Pos logit contributions
asca+0.945
 rug+0.820
 CTR+0.773
+0.672
ワン+0.659
Neg logit contributions
ilation-0.873
ongyang-0.835
idding-0.823
orian-0.822
 Conclusion-0.810
Tokenous
Token ID516
Feature activation-2.09e-01

Pos logit contributions
asca+1.173
 rug+1.031
 CTR+0.982
+0.873
ワン+0.855
Neg logit contributions
ilation-0.775
ongyang-0.736
idding-0.723
orian-0.721
 Conclusion-0.715
Token et
Token ID2123
Feature activation+8.04e-01

Pos logit contributions
ilation+0.649
ongyang+0.614
orian+0.610
idding+0.609
 Opposition+0.605
Neg logit contributions
asca-0.682
 rug-0.588
 CTR-0.558
-0.486
ワン-0.478
Token vot
Token ID1993
Feature activation+8.00e-01

Pos logit contributions
asca+2.600
 rug+2.253
 CTR+2.110
+1.818
ワン+1.778
Neg logit contributions
ilation-2.523
ongyang-2.435
idding-2.399
orian-2.380
 Conclusion-2.342
Tokenre
Token ID260
Feature activation+8.28e-01

Pos logit contributions
asca+2.807
 rug+2.465
 CTR+2.302
+2.027
ワン+1.977
Neg logit contributions
ilation-2.268
ongyang-2.182
idding-2.138
orian-2.127
 Conclusion-2.104
Token
Token ID39073
Feature activation+1.65e+00

Pos logit contributions
asca+2.740
 rug+2.396
 CTR+2.242
+1.937
ワン+1.894
Neg logit contributions
ilation-2.525
ongyang-2.439
idding-2.400
orian-2.381
 Conclusion-2.341
Tokenf
Token ID69
Feature activation+1.55e-01

Pos logit contributions
asca+5.734
 rug+4.983
 CTR+4.734
+4.124
ワン+4.025
Neg logit contributions
ilation-4.554
ongyang-4.401
idding-4.333
orian-4.298
 Protector-4.286
Tokenunt
Token ID2797
Feature activation-8.46e-02

Pos logit contributions
asca+0.566
 rug+0.495
 CTR+0.471
+0.414
ワン+0.407
Neg logit contributions
ilation-0.426
ongyang-0.406
idding-0.399
orian-0.399
 Conclusion-0.394
Token p
Token ID279
Feature activation-1.83e-01

Pos logit contributions
ilation+0.247
ongyang+0.235
idding+0.232
orian+0.232
 Conclusion+0.228
Neg logit contributions
asca-0.294
 rug-0.256
 CTR-0.243
-0.213
ワン-0.209
Tokenère
Token ID35979
Feature activation+7.32e-01

Pos logit contributions
ilation+0.433
ongyang+0.407
orian+0.401
idding+0.401
 Conclusion+0.391
Neg logit contributions
asca-0.734
 rug-0.656
 CTR-0.626
-0.561
ワン-0.555
Token,
Token ID11
Feature activation-1.35e-01

Pos logit contributions
asca+2.447
 rug+2.137
 CTR+2.006
+1.733
ワン+1.698
Neg logit contributions
ilation-2.219
ongyang-2.138
orian-2.099
idding-2.096
 Conclusion-2.055
Token from
Token ID422
Feature activation-4.36e-01

Pos logit contributions
ilation+2.447
ongyang+2.296
orian+2.285
idding+2.284
 Opposition+2.266
Neg logit contributions
asca-2.678
 rug-2.321
 CTR-2.173
-1.923
ワン-1.919
Token homosexual
Token ID11479
Feature activation-4.54e-03

Pos logit contributions
ilation+1.269
ongyang+1.190
idding+1.189
orian+1.188
 Opposition+1.184
Neg logit contributions
asca-1.524
 rug-1.317
 CTR-1.242
-1.106
ワン-1.093
Token to
Token ID284
Feature activation+4.69e-02

Pos logit contributions
ilation+0.013
ongyang+0.012
orian+0.012
idding+0.012
 Opposition+0.012
Neg logit contributions
asca-0.016
 rug-0.014
 CTR-0.014
-0.012
ワン-0.012
Token heterosexual
Token ID24026
Feature activation+3.28e-01

Pos logit contributions
asca+0.148
 rug+0.127
 CTR+0.119
+0.103
ワン+0.102
Neg logit contributions
ilation-0.152
ongyang-0.145
orian-0.144
idding-0.144
 Opposition-0.142
Token (
Token ID357
Feature activation+2.36e-01

Pos logit contributions
asca+1.198
 rug+1.054
 CTR+1.000
+0.883
ワン+0.868
Neg logit contributions
ilation-0.901
ongyang-0.856
idding-0.843
orian-0.842
 Conclusion-0.828
TokenPicture
Token ID28070
Feature activation+1.63e+00

Pos logit contributions
asca+0.985
 rug+0.881
 CTR+0.844
+0.760
ワン+0.748
Neg logit contributions
ilation-0.521
ongyang-0.489
idding-0.479
orian-0.478
 Conclusion-0.469
Token:
Token ID25
Feature activation-5.48e-02

Pos logit contributions
asca+5.143
 rug+4.532
 CTR+4.484
 �+3.981
+3.757
Neg logit contributions
ilation-4.990
ongyang-4.911
idding-4.712
orian-4.632
emis-4.592
Token Getty
Token ID7136
Feature activation+1.57e-01

Pos logit contributions
ilation+0.150
ongyang+0.142
orian+0.140
idding+0.140
 Opposition+0.139
Neg logit contributions
asca-0.201
 rug-0.175
 CTR-0.165
-0.148
ワン-0.146
Token Images
Token ID5382
Feature activation+6.04e-01

Pos logit contributions
asca+0.606
 rug+0.540
 CTR+0.510
+0.456
ワン+0.451
Neg logit contributions
ilation-0.400
ongyang-0.376
orian-0.370
idding-0.369
 Opposition-0.363
Token)
Token ID8
Feature activation+2.20e-01

Pos logit contributions
asca+2.207
 rug+1.942
 CTR+1.841
+1.627
ワン+1.601
Neg logit contributions
ilation-1.653
ongyang-1.569
idding-1.545
orian-1.544
 Conclusion-1.518
Token
Token ID198
Feature activation+2.68e-01

Pos logit contributions
asca+0.835
 rug+0.737
 CTR+0.697
+0.623
ワン+0.613
Neg logit contributions
ilation-0.574
ongyang-0.542
orian-0.533
idding-0.533
 Opposition-0.526
Token."
Token ID526
Feature activation-1.27e-01

Pos logit contributions
ilation+0.133
ongyang+0.125
idding+0.124
orian+0.124
 Opposition+0.122
Neg logit contributions
asca-0.196
 rug-0.173
 CTR-0.165
-0.147
ワン-0.146
Token
Token ID198
Feature activation+2.18e-01

Pos logit contributions
ilation+0.320
ongyang+0.301
orian+0.298
idding+0.298
 Opposition+0.296
Neg logit contributions
asca-0.492
 rug-0.438
 CTR-0.413
-0.372
ワン-0.368
Token
Token ID198
Feature activation+2.18e-01

Pos logit contributions
asca+0.749
 rug+0.653
 CTR+0.628
+0.542
 �+0.539
Neg logit contributions
ilation-0.641
ongyang-0.626
idding-0.602
orian-0.598
 Conclusion-0.586
TokenHow
Token ID2437
Feature activation+6.50e-01

Pos logit contributions
asca+0.770
 rug+0.674
 CTR+0.639
+0.561
ワン+0.552
Neg logit contributions
ilation-0.623
ongyang-0.593
idding-0.584
orian-0.584
 Conclusion-0.575
Token much
Token ID881
Feature activation-7.00e-01

Pos logit contributions
asca+2.191
 rug+1.912
 CTR+1.804
+1.560
ワン+1.521
Neg logit contributions
ilation-1.950
ongyang-1.864
orian-1.840
idding-1.828
 Conclusion-1.806
Token debris
Token ID16468
Feature activation+1.62e+00

Pos logit contributions
ilation+2.082
idding+1.982
ongyang+1.972
orian+1.960
 Opposition+1.957
Neg logit contributions
asca-2.356
 rug-2.019
 CTR-1.915
-1.718
ワン-1.717
Token does
Token ID857
Feature activation+7.26e-01

Pos logit contributions
asca+5.079
 rug+4.478
 CTR+4.182
 �+3.524
+3.501
Neg logit contributions
ilation-5.097
ongyang-4.924
orian-4.843
idding-4.798
 Conclusion-4.728
Token it
Token ID340
Feature activation+8.13e-01

Pos logit contributions
asca+2.432
 rug+2.130
 CTR+2.011
+1.735
 �+1.690
Neg logit contributions
ilation-2.194
ongyang-2.096
orian-2.067
idding-2.058
 Conclusion-2.029
Token take
Token ID1011
Feature activation+3.77e-01

Pos logit contributions
asca+2.525
 rug+2.191
 CTR+2.050
+1.738
 �+1.715
Neg logit contributions
ilation-2.642
ongyang-2.553
orian-2.517
idding-2.488
 Conclusion-2.467
Token to
Token ID284
Feature activation-6.36e-02

Pos logit contributions
asca+1.340
 rug+1.173
 CTR+1.111
+0.979
ワン+0.964
Neg logit contributions
ilation-1.073
ongyang-1.021
idding-1.006
orian-1.006
 Conclusion-0.988
Token kill
Token ID1494
Feature activation+4.67e-01

Pos logit contributions
ilation+0.212
ongyang+0.202
idding+0.201
orian+0.200
 Conclusion+0.197
Neg logit contributions
asca-0.195
 rug-0.166
 CTR-0.156
-0.134
ワン-0.132
Tokencl
Token ID565
Feature activation+4.72e-01

Pos logit contributions
asca+5.536
 rug+4.775
 CTR+4.561
+3.978
ワン+3.890
Neg logit contributions
ilation-4.174
ongyang-4.016
idding-3.956
 Protector-3.924
 Opposition-3.909
Tokenares
Token ID3565
Feature activation+5.30e-01

Pos logit contributions
asca+1.870
 rug+1.638
 CTR+1.571
+1.395
ワン+1.368
Neg logit contributions
ilation-1.141
ongyang-1.091
idding-1.064
 Conclusion-1.056
orian-1.056
Token offic
Token ID1163
Feature activation-5.96e-01

Pos logit contributions
asca+1.717
 rug+1.488
 CTR+1.398
+1.205
ワン+1.179
Neg logit contributions
ilation-1.667
ongyang-1.604
idding-1.578
orian-1.577
 Conclusion-1.547
Tokeniel
Token ID8207
Feature activation+5.81e-01

Pos logit contributions
ilation+1.473
ongyang+1.363
idding+1.358
orian+1.353
 Opposition+1.311
Neg logit contributions
asca-2.340
 rug-2.096
 CTR-1.997
-1.789
ワン-1.781
Tokenlement
Token ID1732
Feature activation+2.53e-01

Pos logit contributions
asca+2.228
 rug+1.963
 CTR+1.864
+1.657
ワン+1.618
Neg logit contributions
ilation-1.488
ongyang-1.426
orian-1.382
idding-1.382
 Conclusion-1.363
Token
Token ID39073
Feature activation+1.61e+00

Pos logit contributions
asca+0.832
 rug+0.721
 CTR+0.679
+0.589
ワン+0.576
Neg logit contributions
ilation-0.784
ongyang-0.752
idding-0.740
orian-0.740
 Conclusion-0.728
Tokenser
Token ID2655
Feature activation+4.39e-01

Pos logit contributions
asca+5.778
 rug+5.072
 CTR+4.829
+4.206
ワン+4.096
Neg logit contributions
ilation-4.309
ongyang-4.151
idding-4.096
orian-4.057
 Protector-4.016
Tokente
Token ID660
Feature activation+6.97e-01

Pos logit contributions
asca+1.681
 rug+1.486
 CTR+1.412
+1.257
ワン+1.239
Neg logit contributions
ilation-1.120
ongyang-1.066
idding-1.045
orian-1.044
 Conclusion-1.029
Tokenur
Token ID333
Feature activation+3.78e-01

Pos logit contributions
asca+2.362
 rug+2.059
 CTR+1.938
+1.684
ワン+1.640
Neg logit contributions
ilation-2.072
ongyang-2.009
orian-1.964
idding-1.964
 Conclusion-1.931
Token du
Token ID7043
Feature activation+3.66e-01

Pos logit contributions
asca+1.248
 rug+1.084
 CTR+1.020
+0.885
ワン+0.866
Neg logit contributions
ilation-1.164
ongyang-1.117
idding-1.099
orian-1.098
 Conclusion-1.080
Token programme
Token ID11383
Feature activation+3.54e-01

Pos logit contributions
asca+1.269
 rug+1.110
 CTR+1.049
+0.915
ワン+0.899
Neg logit contributions
ilation-1.067
ongyang-1.019
idding-1.005
orian-1.002
 Conclusion-0.985
Token plates
Token ID13375
Feature activation+3.47e-01

Pos logit contributions
asca+0.915
 rug+0.778
 CTR+0.743
+0.653
ワン+0.644
Neg logit contributions
ilation-0.837
ongyang-0.793
orian-0.789
idding-0.788
 Opposition-0.780
Token stre
Token ID1937
Feature activation+3.17e-01

Pos logit contributions
asca+1.188
 rug+1.035
 CTR+0.978
+0.856
ワン+0.841
Neg logit contributions
ilation-1.029
ongyang-0.980
idding-0.967
orian-0.966
 Conclusion-0.951
Tokenwn
Token ID675
Feature activation+2.02e-01

Pos logit contributions
asca+1.052
 rug+0.913
 CTR+0.864
+0.751
ワン+0.743
Neg logit contributions
ilation-0.977
ongyang-0.929
idding-0.925
orian-0.916
 Conclusion-0.905
Token on
Token ID319
Feature activation+2.63e-01

Pos logit contributions
asca+0.570
 rug+0.474
 CTR+0.448
+0.379
ワン+0.377
Neg logit contributions
ilation-0.720
ongyang-0.687
idding-0.685
orian-0.682
 Opposition-0.678
Token one
Token ID530
Feature activation-2.16e-01

Pos logit contributions
asca+0.921
 rug+0.791
 CTR+0.756
+0.669
ワン+0.665
Neg logit contributions
ilation-0.765
ongyang-0.723
idding-0.720
orian-0.717
 Opposition-0.711
Token side
Token ID1735
Feature activation+1.59e+00

Pos logit contributions
ilation+0.642
idding+0.599
ongyang+0.599
orian+0.597
 Conclusion+0.593
Neg logit contributions
asca-0.742
 rug-0.620
 CTR-0.597
-0.535
ワン-0.534
Token of
Token ID286
Feature activation+9.14e-01

Pos logit contributions
asca+5.707
 rug+5.160
 CTR+4.816
+4.210
 �+4.184
Neg logit contributions
ilation-4.290
ongyang-4.160
orian-4.031
idding-3.994
 Conclusion-3.963
Token a
Token ID257
Feature activation+6.54e-01

Pos logit contributions
asca+3.133
 rug+2.732
 CTR+2.581
+2.256
ワン+2.215
Neg logit contributions
ilation-2.708
ongyang-2.583
idding-2.546
orian-2.545
 Conclusion-2.506
Token chain
Token ID6333
Feature activation+5.52e-01

Pos logit contributions
asca+2.018
 rug+1.716
 CTR+1.614
+1.389
ワン+1.363
Neg logit contributions
ilation-2.169
ongyang-2.075
idding-2.052
orian-2.049
 Conclusion-2.021
Token-
Token ID12
Feature activation-9.04e-02

Pos logit contributions
asca+1.948
 rug+1.677
 CTR+1.599
+1.404
ワン+1.392
Neg logit contributions
ilation-1.602
ongyang-1.501
idding-1.496
orian-1.492
 Opposition-1.470
Tokenlink
Token ID8726
Feature activation+9.01e-01

Pos logit contributions
ilation+0.313
ongyang+0.298
idding+0.298
orian+0.296
 Opposition+0.291
Neg logit contributions
asca-0.267
 rug-0.224
 CTR-0.210
-0.177
ワン-0.177
Token why
Token ID1521
Feature activation+8.45e-01

Pos logit contributions
asca+0.840
 rug+0.735
 CTR+0.696
+0.610
ワン+0.599
Neg logit contributions
ilation-0.694
ongyang-0.661
idding-0.651
orian-0.651
 Conclusion-0.641
Token the
Token ID262
Feature activation+3.58e-01

Pos logit contributions
asca+2.969
 rug+2.622
 CTR+2.482
+2.159
 �+2.127
Neg logit contributions
ilation-2.415
ongyang-2.302
idding-2.245
orian-2.243
 Conclusion-2.206
Token two
Token ID734
Feature activation+1.62e-01

Pos logit contributions
asca+1.274
 rug+1.124
 CTR+1.063
+0.931
ワン+0.915
Neg logit contributions
ilation-1.008
ongyang-0.962
idding-0.946
orian-0.942
 Conclusion-0.928
Token officers
Token ID3790
Feature activation+1.69e-01

Pos logit contributions
asca+0.546
 rug+0.473
 CTR+0.447
+0.391
ワン+0.384
Neg logit contributions
ilation-0.488
ongyang-0.465
orian-0.459
idding-0.459
 Conclusion-0.452
Token were
Token ID547
Feature activation+2.82e-01

Pos logit contributions
asca+0.566
 rug+0.495
 CTR+0.466
+0.404
ワン+0.396
Neg logit contributions
ilation-0.515
ongyang-0.492
idding-0.484
orian-0.484
 Conclusion-0.476
Token targeted
Token ID7977
Feature activation+1.59e+00

Pos logit contributions
asca+0.936
 rug+0.815
 CTR+0.766
+0.666
ワン+0.651
Neg logit contributions
ilation-0.866
ongyang-0.829
idding-0.815
orian-0.813
 Conclusion-0.802
Token.
Token ID13
Feature activation+2.24e-01

Pos logit contributions
asca+5.683
 rug+5.025
 CTR+4.721
 �+4.192
+4.169
Neg logit contributions
ilation-4.293
ongyang-4.122
idding-3.991
orian-3.978
 Opposition-3.916
Token The
Token ID383
Feature activation+1.33e-01

Pos logit contributions
asca+0.869
 rug+0.770
 CTR+0.734
+0.654
ワン+0.644
Neg logit contributions
ilation-0.565
ongyang-0.534
idding-0.525
orian-0.524
 Conclusion-0.514
Token cops
Token ID14073
Feature activation+8.39e-02

Pos logit contributions
asca+0.470
 rug+0.412
 CTR+0.390
+0.342
ワン+0.336
Neg logit contributions
ilation-0.382
ongyang-0.363
idding-0.358
orian-0.358
 Conclusion-0.352
Token
Token ID851
Feature activation+8.03e-01

Pos logit contributions
asca+0.282
 rug+0.246
 CTR+0.232
+0.202
ワン+0.198
Neg logit contributions
ilation-0.254
ongyang-0.242
idding-0.239
orian-0.239
 Conclusion-0.235
Token identified
Token ID5174
Feature activation+2.53e-01

Pos logit contributions
asca+2.761
 rug+2.423
 CTR+2.298
+1.997
ワン+1.959
Neg logit contributions
ilation-2.343
ongyang-2.253
idding-2.195
orian-2.189
 Conclusion-2.153
Token Ki
Token ID21927
Feature activation+1.00e+00

Pos logit contributions
asca+2.277
 rug+1.999
 CTR+1.895
+1.670
ワン+1.640
Neg logit contributions
ilation-1.757
ongyang-1.671
idding-1.645
orian-1.644
 Conclusion-1.616
Tokenpling
Token ID11347
Feature activation+4.33e-01

Pos logit contributions
asca+2.679
 rug+2.220
 CTR+2.043
+1.693
ワン+1.611
Neg logit contributions
ilation-3.711
ongyang-3.567
orian-3.558
idding-3.550
 Conclusion-3.509
Token subway
Token ID19612
Feature activation+3.46e-01

Pos logit contributions
asca+1.427
 rug+1.237
 CTR+1.164
+1.016
ワン+1.002
Neg logit contributions
ilation-1.341
ongyang-1.280
idding-1.264
orian-1.262
 Conclusion-1.247
Token station
Token ID4429
Feature activation-2.82e-01

Pos logit contributions
asca+1.334
 rug+1.177
 CTR+1.121
+1.005
ワン+0.993
Neg logit contributions
ilation-0.878
ongyang-0.830
idding-0.817
orian-0.815
 Opposition-0.805
Token in
Token ID287
Feature activation+3.14e-01

Pos logit contributions
ilation+0.713
ongyang+0.672
idding+0.666
 Opposition+0.658
orian+0.658
Neg logit contributions
asca-1.081
 rug-0.957
 CTR-0.910
-0.816
ワン-0.809
Token Et
Token ID17906
Feature activation+1.57e+00

Pos logit contributions
asca+1.045
 rug+0.907
 CTR+0.855
+0.746
ワン+0.734
Neg logit contributions
ilation-0.964
ongyang-0.919
idding-0.907
orian-0.904
 Conclusion-0.896
Tokenobic
Token ID20803
Feature activation+1.44e+00

Pos logit contributions
asca+3.605
 rug+2.483
 CTR+2.171
 Birch+1.750
 �+1.664
Neg logit contributions
ilation-6.525
 Protector-6.360
 Conclusion-6.296
orian-6.280
ongyang-6.262
Tokenoke
Token ID2088
Feature activation+4.43e-01

Pos logit contributions
asca+5.141
 rug+4.367
 CTR+4.127
+3.661
 �+3.614
Neg logit contributions
ilation-4.025
ongyang-3.806
 Protector-3.759
orian-3.741
 Conclusion-3.740
Token.
Token ID13
Feature activation-8.03e-02

Pos logit contributions
asca+1.742
 rug+1.549
 CTR+1.476
+1.322
ワン+1.310
Neg logit contributions
ilation-1.086
ongyang-1.025
idding-1.010
orian-1.004
 Opposition-0.992
Token
Token ID198
Feature activation-1.03e-01

Pos logit contributions
ilation+0.201
ongyang+0.187
 Opposition+0.187
orian+0.186
idding+0.186
Neg logit contributions
asca-0.312
 rug-0.278
 CTR-0.262
-0.237
ワン-0.233
Token
Token ID198
Feature activation-7.40e-02

Pos logit contributions
ilation+0.289
ongyang+0.278
idding+0.271
orian+0.270
 Conclusion+0.265
Neg logit contributions
asca-0.368
 rug-0.323
 CTR-0.308
-0.270
 �-0.264
Token
Token ID198
Feature activation-1.31e-02

Pos logit contributions
ilation+0.038
ongyang+0.036
idding+0.035
orian+0.035
 Conclusion+0.035
Neg logit contributions
asca-0.048
 rug-0.042
 CTR-0.040
-0.035
 �-0.034
TokenAs
Token ID1722
Feature activation-6.27e-02

Pos logit contributions
ilation+0.035
ongyang+0.033
orian+0.033
idding+0.033
 Opposition+0.032
Neg logit contributions
asca-0.049
 rug-0.043
 CTR-0.041
-0.036
ワン-0.035
Token the
Token ID262
Feature activation+5.76e-01

Pos logit contributions
ilation+0.180
ongyang+0.170
idding+0.169
orian+0.169
 Opposition+0.167
Neg logit contributions
asca-0.221
 rug-0.193
 CTR-0.182
-0.161
ワン-0.158
Token country
Token ID1499
Feature activation+8.25e-01

Pos logit contributions
asca+2.025
 rug+1.789
 CTR+1.692
+1.469
 �+1.440
Neg logit contributions
ilation-1.641
ongyang-1.578
idding-1.550
orian-1.540
 Conclusion-1.506
Token becomes
Token ID4329
Feature activation+6.04e-01

Pos logit contributions
asca+2.889
 rug+2.537
 CTR+2.374
+2.066
 �+2.029
Neg logit contributions
ilation-2.385
ongyang-2.277
idding-2.241
orian-2.217
 Conclusion-2.203
Token more
Token ID517
Feature activation+1.57e+00

Pos logit contributions
asca+1.961
 rug+1.703
 CTR+1.598
+1.373
ワン+1.347
Neg logit contributions
ilation-1.892
ongyang-1.814
idding-1.783
orian-1.780
 Conclusion-1.763
Token polarized
Token ID46469
Feature activation+8.70e-01

Pos logit contributions
asca+5.088
 rug+4.503
 CTR+4.226
 �+3.552
 Birch+3.505
Neg logit contributions
ilation-4.767
ongyang-4.648
idding-4.527
 Conclusion-4.493
orian-4.475
Token,
Token ID11
Feature activation+5.79e-01

Pos logit contributions
asca+2.869
 rug+2.497
 CTR+2.345
+2.035
ワン+1.987
Neg logit contributions
ilation-2.680
ongyang-2.566
idding-2.523
orian-2.513
 Conclusion-2.485
Token so
Token ID523
Feature activation+4.71e-01

Pos logit contributions
asca+2.016
 rug+1.764
 CTR+1.675
+1.460
ワン+1.431
Neg logit contributions
ilation-1.680
ongyang-1.605
idding-1.576
orian-1.570
 Conclusion-1.548
Token too
Token ID1165
Feature activation-6.74e-01

Pos logit contributions
asca+1.688
 rug+1.482
 CTR+1.404
+1.235
ワン+1.211
Neg logit contributions
ilation-1.319
ongyang-1.256
idding-1.235
orian-1.234
 Conclusion-1.216
Token do
Token ID466
Feature activation+1.15e-02

Pos logit contributions
ilation+1.919
ongyang+1.809
idding+1.802
 Opposition+1.801
orian+1.797
Neg logit contributions
asca-2.353
 rug-2.050
 CTR-1.937
ワン-1.741
-1.730
Token for
Token ID329
Feature activation-1.17e-01

Pos logit contributions
ilation+1.389
ongyang+1.318
orian+1.312
idding+1.303
 Conclusion+1.298
Neg logit contributions
asca-1.553
 rug-1.349
 CTR-1.284
ワン-1.130
-1.116
Token the
Token ID262
Feature activation-1.05e-01

Pos logit contributions
ilation+0.348
ongyang+0.332
idding+0.328
orian+0.327
 Conclusion+0.323
Neg logit contributions
asca-0.397
 rug-0.345
 CTR-0.326
-0.285
ワン-0.280
Token history
Token ID2106
Feature activation-2.88e-01

Pos logit contributions
ilation+0.318
ongyang+0.304
idding+0.299
orian+0.299
 Conclusion+0.294
Neg logit contributions
asca-0.350
 rug-0.305
 CTR-0.287
-0.250
ワン-0.245
Token of
Token ID286
Feature activation-3.88e-01

Pos logit contributions
ilation+0.689
orian+0.646
ongyang+0.646
idding+0.636
 Conclusion+0.628
Neg logit contributions
asca-1.152
 rug-1.017
 CTR-0.974
-0.871
ワン-0.869
Token the
Token ID262
Feature activation-2.07e-01

Pos logit contributions
ilation+1.147
ongyang+1.091
orian+1.076
idding+1.075
 Opposition+1.071
Neg logit contributions
asca-1.333
 rug-1.151
 CTR-1.093
-0.963
ワン-0.953
Token Universe
Token ID11950
Feature activation+1.57e+00

Pos logit contributions
ilation+0.534
ongyang+0.505
orian+0.497
idding+0.497
 Conclusion+0.489
Neg logit contributions
asca-0.790
 rug-0.698
 CTR-0.664
-0.592
ワン-0.583
Token and
Token ID290
Feature activation+5.72e-03

Pos logit contributions
asca+5.717
 rug+5.048
 CTR+4.727
+4.209
 �+4.186
Neg logit contributions
ilation-4.200
ongyang-4.070
idding-3.955
orian-3.895
 Opposition-3.807
Token the
Token ID262
Feature activation-1.31e-01

Pos logit contributions
asca+0.019
 rug+0.016
 CTR+0.015
+0.013
ワン+0.013
Neg logit contributions
ilation-0.018
ongyang-0.017
idding-0.017
orian-0.017
 Conclusion-0.016
Token possibility
Token ID5885
Feature activation+3.20e-02

Pos logit contributions
ilation+0.398
ongyang+0.380
idding+0.375
orian+0.374
 Conclusion+0.369
Neg logit contributions
asca-0.440
 rug-0.383
 CTR-0.361
-0.314
ワン-0.309
Token of
Token ID286
Feature activation-1.66e-01

Pos logit contributions
asca+0.123
 rug+0.108
 CTR+0.103
+0.092
ワン+0.091
Neg logit contributions
ilation-0.082
ongyang-0.078
idding-0.076
orian-0.076
 Conclusion-0.075
Token life
Token ID1204
Feature activation+8.33e-02

Pos logit contributions
ilation+0.519
ongyang+0.495
idding+0.490
orian+0.489
 Conclusion+0.483
Neg logit contributions
asca-0.542
 rug-0.468
 CTR-0.441
-0.383
ワン-0.376
Token just
Token ID655
Feature activation+2.42e-01

Pos logit contributions
asca+0.084
 rug+0.073
 CTR+0.069
+0.060
ワン+0.059
Neg logit contributions
ilation-0.076
ongyang-0.072
idding-0.071
orian-0.071
 Conclusion-0.070
Token how
Token ID703
Feature activation+5.01e-01

Pos logit contributions
asca+0.799
 rug+0.693
 CTR+0.653
+0.568
ワン+0.557
Neg logit contributions
ilation-0.745
ongyang-0.712
idding-0.702
orian-0.702
 Conclusion-0.692
Token many
Token ID867
Feature activation-6.36e-02

Pos logit contributions
asca+1.689
 rug+1.476
 CTR+1.391
+1.207
ワン+1.182
Neg logit contributions
ilation-1.505
ongyang-1.441
orian-1.418
idding-1.417
 Conclusion-1.396
Token pieces
Token ID5207
Feature activation+5.55e-01

Pos logit contributions
ilation+0.173
ongyang+0.164
idding+0.162
orian+0.162
 Opposition+0.160
Neg logit contributions
asca-0.233
 rug-0.203
 CTR-0.193
-0.173
ワン-0.171
Token of
Token ID286
Feature activation+3.77e-01

Pos logit contributions
asca+2.136
 rug+1.890
 CTR+1.798
+1.603
ワン+1.580
Neg logit contributions
ilation-1.415
ongyang-1.340
idding-1.318
orian-1.316
 Conclusion-1.292
Token debris
Token ID16468
Feature activation+1.57e+00

Pos logit contributions
asca+1.164
 rug+0.994
 CTR+0.935
+0.805
ワン+0.793
Neg logit contributions
ilation-1.243
ongyang-1.192
idding-1.177
orian-1.176
 Opposition-1.163
Token it
Token ID340
Feature activation+5.25e-01

Pos logit contributions
asca+5.035
 rug+4.418
 CTR+4.160
+3.499
 �+3.477
Neg logit contributions
ilation-4.828
ongyang-4.622
orian-4.554
idding-4.526
 Conclusion-4.506
Token takes
Token ID2753
Feature activation+9.63e-02

Pos logit contributions
asca+1.565
 rug+1.331
 CTR+1.243
+1.057
ワン+1.029
Neg logit contributions
ilation-1.796
ongyang-1.726
idding-1.704
orian-1.702
 Conclusion-1.681
Token to
Token ID284
Feature activation-1.62e-01

Pos logit contributions
asca+0.341
 rug+0.298
 CTR+0.282
+0.249
ワン+0.246
Neg logit contributions
ilation-0.275
ongyang-0.261
orian-0.258
idding-0.258
 Opposition-0.253
Token kill
Token ID1494
Feature activation+3.06e-01

Pos logit contributions
ilation+0.543
ongyang+0.517
idding+0.516
orian+0.512
 Conclusion+0.506
Neg logit contributions
asca-0.497
 rug-0.419
 CTR-0.394
-0.340
ワン-0.336
Token a
Token ID257
Feature activation+9.13e-02

Pos logit contributions
asca+1.042
 rug+0.907
 CTR+0.856
+0.749
ワン+0.736
Neg logit contributions
ilation-0.914
ongyang-0.872
idding-0.860
orian-0.859
 Conclusion-0.847
Tokenre
Token ID260
Feature activation+1.01e+00

Pos logit contributions
asca+2.538
 rug+2.235
 CTR+2.134
+1.887
ワン+1.847
Neg logit contributions
ilation-1.726
ongyang-1.647
idding-1.623
orian-1.617
 Opposition-1.597
Token e
Token ID304
Feature activation+2.37e-01

Pos logit contributions
asca+3.449
 rug+3.024
 CTR+2.830
+2.467
ワン+2.406
Neg logit contributions
ilation-2.961
ongyang-2.860
idding-2.819
orian-2.783
 Conclusion-2.746
Tokenux
Token ID2821
Feature activation-2.28e-02

Pos logit contributions
asca+0.939
 rug+0.837
 CTR+0.799
+0.716
ワン+0.707
Neg logit contributions
ilation-0.578
ongyang-0.542
orian-0.534
idding-0.534
 Opposition-0.522
Token s
Token ID264
Feature activation+2.04e-01

Pos logit contributions
ilation+0.071
ongyang+0.068
orian+0.067
idding+0.067
 Conclusion+0.066
Neg logit contributions
asca-0.074
 rug-0.064
 CTR-0.061
-0.053
ワン-0.052
Tokenont
Token ID756
Feature activation+2.45e-01

Pos logit contributions
asca+0.891
 rug+0.802
 CTR+0.768
+0.695
ワン+0.687
Neg logit contributions
ilation-0.416
ongyang-0.388
idding-0.379
orian-0.379
 Conclusion-0.370
Token
Token ID39073
Feature activation+1.56e+00

Pos logit contributions
asca+0.789
 rug+0.681
 CTR+0.641
+0.554
ワン+0.544
Neg logit contributions
ilation-0.777
ongyang-0.743
idding-0.734
orian-0.733
 Conclusion-0.723
Tokenpl
Token ID489
Feature activation-8.81e-02

Pos logit contributions
asca+5.543
 rug+4.815
 CTR+4.586
+3.999
ワン+3.918
Neg logit contributions
ilation-4.192
ongyang-4.052
idding-3.967
 Protector-3.950
 Opposition-3.937
Tokenor
Token ID273
Feature activation+2.37e-01

Pos logit contributions
ilation+0.220
ongyang+0.207
idding+0.204
orian+0.204
 Conclusion+0.200
Neg logit contributions
asca-0.343
 rug-0.305
 CTR-0.290
-0.259
ワン-0.255
Tokenés
Token ID20954
Feature activation+5.53e-01

Pos logit contributions
asca+1.021
 rug+0.916
 CTR+0.877
+0.793
ワン+0.782
Neg logit contributions
ilation-0.492
ongyang-0.460
orian-0.450
idding-0.450
 Conclusion-0.440
Token et
Token ID2123
Feature activation+6.61e-01

Pos logit contributions
asca+1.829
 rug+1.590
 CTR+1.493
+1.293
ワン+1.263
Neg logit contributions
ilation-1.700
ongyang-1.638
idding-1.608
orian-1.604
 Conclusion-1.578
Token le
Token ID443
Feature activation+2.37e-01

Pos logit contributions
asca+2.156
 rug+1.870
 CTR+1.757
+1.520
ワン+1.487
Neg logit contributions
ilation-2.063
ongyang-1.980
idding-1.950
orian-1.943
 Conclusion-1.916
Token4
Token ID19
Feature activation-1.71e-01

Pos logit contributions
ilation+0.594
ongyang+0.561
orian+0.547
idding+0.545
 Conclusion+0.542
Neg logit contributions
asca-1.153
 rug-1.028
 CTR-0.988
-0.895
ワン-0.882
Token Release
Token ID13868
Feature activation-1.21e+00

Pos logit contributions
ilation+0.418
ongyang+0.394
idding+0.387
orian+0.387
 Conclusion+0.379
Neg logit contributions
asca-0.673
 rug-0.598
 CTR-0.570
-0.509
ワン-0.502
Token Date
Token ID7536
Feature activation-4.95e-01

Pos logit contributions
ilation+2.595
 Conclusion+2.475
 Opposition+2.428
 Protector+2.395
idding+2.385
Neg logit contributions
asca-5.072
 rug-4.554
 CTR-4.282
ワン-3.970
-3.946
Token Del
Token ID4216
Feature activation+1.19e-01

Pos logit contributions
ilation+1.337
ongyang+1.255
idding+1.251
 Conclusion+1.248
orian+1.246
Neg logit contributions
asca-1.826
 rug-1.605
 CTR-1.514
-1.354
ワン-1.338
Tokenayed
Token ID16548
Feature activation-3.76e-02

Pos logit contributions
asca+0.387
 rug+0.335
 CTR+0.315
+0.272
ワン+0.267
Neg logit contributions
ilation-0.376
ongyang-0.360
idding-0.355
orian-0.354
 Conclusion-0.350
Token Until
Token ID14303
Feature activation-1.91e+00

Pos logit contributions
ilation+0.096
ongyang+0.090
idding+0.089
orian+0.089
 Conclusion+0.088
Neg logit contributions
asca-0.144
 rug-0.128
 CTR-0.122
-0.109
ワン-0.107
Token At
Token ID1629
Feature activation-9.78e-01

Pos logit contributions
 Conclusion+4.106
ilation+4.062
 Opposition+4.023
 Protector+3.938
idding+3.846
Neg logit contributions
asca-7.755
 rug-6.937
 CTR-6.516
ワン-6.140
-6.024
Token Le
Token ID1004
Feature activation-6.90e-01

Pos logit contributions
ilation+2.141
orian+2.002
ongyang+1.990
 Conclusion+1.982
idding+1.979
Neg logit contributions
asca-4.060
 rug-3.652
 CTR-3.510
ワン-3.166
-3.158
Tokenast
Token ID459
Feature activation-1.44e+00

Pos logit contributions
ilation+1.693
idding+1.601
 Conclusion+1.591
orian+1.569
 Opposition+1.556
Neg logit contributions
asca-2.693
 rug-2.392
 CTR-2.325
-2.095
ワン-2.055
Token March
Token ID2805
Feature activation-3.70e-01

Pos logit contributions
ilation+3.198
 Conclusion+3.071
 Opposition+3.051
 Protector+3.003
orian+2.971
Neg logit contributions
asca-5.878
 rug-5.255
 CTR-4.957
ワン-4.540
-4.533
Token Of
Token ID3226
Feature activation-1.00e+00

Pos logit contributions
ilation+0.912
ongyang+0.861
idding+0.847
orian+0.846
 Conclusion+0.830
Neg logit contributions
asca-1.453
 rug-1.290
 CTR-1.230
-1.098
ワン-1.082
Token government
Token ID1230
Feature activation+2.13e-01

Pos logit contributions
ilation+0.024
ongyang+0.023
orian+0.023
idding+0.023
 Opposition+0.022
Neg logit contributions
asca-0.029
 rug-0.025
 CTR-0.024
-0.021
ワン-0.021
Token is
Token ID318
Feature activation+5.04e-01

Pos logit contributions
asca+0.722
 rug+0.628
 CTR+0.593
+0.517
ワン+0.508
Neg logit contributions
ilation-0.641
ongyang-0.611
idding-0.603
orian-0.603
 Conclusion-0.594
Token stepping
Token ID17413
Feature activation+5.94e-01

Pos logit contributions
asca+1.746
 rug+1.529
 CTR+1.443
+1.262
ワン+1.233
Neg logit contributions
ilation-1.475
ongyang-1.410
orian-1.382
idding-1.382
 Conclusion-1.362
Token in
Token ID287
Feature activation+5.50e-01

Pos logit contributions
asca+1.807
 rug+1.538
 CTR+1.446
+1.238
ワン+1.219
Neg logit contributions
ilation-1.995
ongyang-1.902
orian-1.895
idding-1.889
 Opposition-1.872
Token on
Token ID319
Feature activation+1.85e-02

Pos logit contributions
asca+1.940
 rug+1.701
 CTR+1.607
+1.411
ワン+1.383
Neg logit contributions
ilation-1.574
ongyang-1.503
idding-1.476
orian-1.476
 Conclusion-1.453
Token mand
Token ID6855
Feature activation-1.90e+00

Pos logit contributions
asca+0.063
 rug+0.054
 CTR+0.051
+0.045
ワン+0.044
Neg logit contributions
ilation-0.056
ongyang-0.053
idding-0.052
orian-0.052
 Opposition-0.052
Tokenating
Token ID803
Feature activation-1.04e+00

Pos logit contributions
ilation+1.555
idding+1.314
orian+1.292
ongyang+1.195
emic+1.104
Neg logit contributions
asca-10.408
 rug-9.663
 CTR-9.243
 裏�-8.883
ワン-8.871
Token parental
Token ID21694
Feature activation-1.39e-01

Pos logit contributions
ilation+3.076
orian+2.934
ongyang+2.919
idding+2.912
 Opposition+2.898
Neg logit contributions
asca-3.541
 rug-2.995
 CTR-2.828
-2.588
ワン-2.553
Token leave
Token ID2666
Feature activation+7.00e-02

Pos logit contributions
ilation+0.481
ongyang+0.459
idding+0.456
orian+0.455
 Opposition+0.451
Neg logit contributions
asca-0.410
 rug-0.344
 CTR-0.321
-0.276
ワン-0.271
Token.
Token ID13
Feature activation-6.39e-02

Pos logit contributions
asca+0.251
 rug+0.220
 CTR+0.209
+0.184
ワン+0.181
Neg logit contributions
ilation-0.196
ongyang-0.186
orian-0.184
idding-0.184
 Conclusion-0.181
Token OS
Token ID7294
Feature activation+4.18e-01

Pos logit contributions
ilation+0.145
ongyang+0.137
idding+0.134
orian+0.133
 Conclusion+0.129
Neg logit contributions
asca-0.263
 rug-0.235
 CTR-0.226
-0.202
ワン-0.199
Token standards
Token ID5423
Feature activation-1.44e-01

Pos logit contributions
asca+0.744
 rug+0.646
 CTR+0.610
+0.531
ワン+0.525
Neg logit contributions
ilation-0.667
ongyang-0.635
orian-0.627
idding-0.627
 Conclusion-0.619
Token for
Token ID329
Feature activation+4.06e-02

Pos logit contributions
ilation+0.366
ongyang+0.346
orian+0.341
idding+0.341
 Conclusion+0.336
Neg logit contributions
asca-0.554
 rug-0.489
 CTR-0.465
-0.415
ワン-0.412
Token normal
Token ID3487
Feature activation-3.28e-01

Pos logit contributions
asca+0.142
 rug+0.124
 CTR+0.117
+0.103
ワン+0.102
Neg logit contributions
ilation-0.117
ongyang-0.111
idding-0.110
orian-0.110
 Conclusion-0.109
Tokency
Token ID948
Feature activation-2.00e-01

Pos logit contributions
ilation+0.897
ongyang+0.843
idding+0.838
orian+0.836
 Conclusion+0.827
Neg logit contributions
asca-1.198
 rug-1.044
 CTR-0.993
-0.881
ワン-0.874
Token.
Token ID13
Feature activation-2.31e-01

Pos logit contributions
ilation+0.526
ongyang+0.492
orian+0.486
idding+0.486
 Conclusion+0.483
Neg logit contributions
asca-0.759
 rug-0.664
 CTR-0.634
-0.565
ワン-0.561
Token Until
Token ID14303
Feature activation-1.82e+00

Pos logit contributions
ilation+0.521
ongyang+0.486
idding+0.480
orian+0.480
 Conclusion+0.474
Neg logit contributions
asca-0.958
 rug-0.856
 CTR-0.815
-0.735
ワン-0.726
Token I
Token ID314
Feature activation+1.79e-01

Pos logit contributions
ilation+5.061
 Opposition+4.852
 Tah+4.797
 Conclusion+4.784
orian+4.782
Neg logit contributions
asca-6.250
 rug-5.255
 CTR-5.042
ワン-4.603
-4.562
Token began
Token ID2540
Feature activation+6.94e-01

Pos logit contributions
asca+0.597
 rug+0.521
 CTR+0.489
+0.424
ワン+0.416
Neg logit contributions
ilation-0.545
ongyang-0.522
idding-0.513
orian-0.513
 Conclusion-0.504
Token to
Token ID284
Feature activation+3.05e-01

Pos logit contributions
asca+2.359
 rug+2.070
 CTR+1.953
+1.689
ワン+1.656
Neg logit contributions
ilation-2.066
ongyang-1.980
orian-1.935
idding-1.933
 Conclusion-1.901
Token pay
Token ID1414
Feature activation+4.49e-01

Pos logit contributions
asca+0.949
 rug+0.823
 CTR+0.768
+0.654
ワン+0.639
Neg logit contributions
ilation-0.996
ongyang-0.960
orian-0.941
idding-0.941
 Opposition-0.926
Token serious
Token ID2726
Feature activation+5.57e-01

Pos logit contributions
asca+1.508
 rug+1.313
 CTR+1.241
+1.073
ワン+1.053
Neg logit contributions
ilation-1.362
ongyang-1.302
idding-1.280
orian-1.279
 Conclusion-1.261
Token he
Token ID339
Feature activation+1.16e-01

Pos logit contributions
ilation+0.722
ongyang+0.684
idding+0.670
 Opposition+0.669
orian+0.669
Neg logit contributions
asca-1.092
 rug-0.967
 CTR-0.918
-0.818
ワン-0.812
Token said
Token ID531
Feature activation-4.30e-01

Pos logit contributions
asca+0.426
 rug+0.376
 CTR+0.358
+0.316
ワン+0.312
Neg logit contributions
ilation-0.313
ongyang-0.296
orian-0.292
idding-0.292
 Opposition-0.286
Token.
Token ID13
Feature activation+8.72e-02

Pos logit contributions
ilation+1.046
ongyang+0.988
orian+0.978
idding+0.971
 Opposition+0.960
Neg logit contributions
asca-1.688
 rug-1.502
 CTR-1.445
-1.284
ワン-1.281
Token Although
Token ID4900
Feature activation-3.92e-01

Pos logit contributions
asca+0.330
 rug+0.291
 CTR+0.278
+0.246
ワン+0.242
Neg logit contributions
ilation-0.228
ongyang-0.216
idding-0.212
orian-0.212
 Conclusion-0.207
Token most
Token ID749
Feature activation-7.67e-02

Pos logit contributions
ilation+1.147
ongyang+1.091
idding+1.077
orian+1.076
 Conclusion+1.061
Neg logit contributions
asca-1.355
 rug-1.182
 CTR-1.117
-0.981
ワン-0.965
Token suff
Token ID3027
Feature activation-1.82e+00

Pos logit contributions
ilation+0.219
ongyang+0.209
idding+0.205
orian+0.205
 Conclusion+0.202
Neg logit contributions
asca-0.271
 rug-0.238
 CTR-0.225
-0.197
ワン-0.194
Tokenerers
Token ID19288
Feature activation-5.28e-01

Pos logit contributions
ilation+4.786
emic+4.571
orian+4.518
ongyang+4.466
idding+4.455
Neg logit contributions
asca-6.561
 rug-5.956
 CTR-5.598
ワン-5.193
-5.055
Token experience
Token ID1998
Feature activation-5.09e-01

Pos logit contributions
ilation+1.544
ongyang+1.466
orian+1.449
idding+1.448
 Conclusion+1.428
Neg logit contributions
asca-1.827
 rug-1.590
 CTR-1.508
-1.327
ワン-1.306
Token normal
Token ID3487
Feature activation-6.11e-01

Pos logit contributions
ilation+1.538
ongyang+1.460
idding+1.444
orian+1.444
 Conclusion+1.425
Neg logit contributions
asca-1.713
 rug-1.486
 CTR-1.403
-1.229
ワン-1.207
Token flu
Token ID6562
Feature activation-3.79e-01

Pos logit contributions
ilation+1.804
ongyang+1.714
idding+1.695
orian+1.694
 Opposition+1.670
Neg logit contributions
asca-2.101
 rug-1.827
 CTR-1.726
-1.518
ワン-1.492
Token symptoms
Token ID7460
Feature activation-9.17e-01

Pos logit contributions
ilation+1.126
ongyang+1.062
idding+1.055
orian+1.054
 Opposition+1.036
Neg logit contributions
asca-1.303
 rug-1.132
 CTR-1.070
-0.940
ワン-0.928
Token to
Token ID284
Feature activation+1.91e-01

Pos logit contributions
asca+0.730
 rug+0.641
 CTR+0.607
+0.535
ワン+0.526
Neg logit contributions
ilation-0.575
ongyang-0.547
idding-0.539
orian-0.538
 Conclusion-0.530
Token you
Token ID345
Feature activation+3.50e-01

Pos logit contributions
asca+0.653
 rug+0.567
 CTR+0.535
+0.469
ワン+0.461
Neg logit contributions
ilation-0.572
ongyang-0.544
orian-0.537
idding-0.537
 Conclusion-0.530
Token:
Token ID25
Feature activation+3.18e-01

Pos logit contributions
asca+1.333
 rug+1.180
 CTR+1.123
+0.998
ワン+0.982
Neg logit contributions
ilation-0.901
ongyang-0.854
idding-0.840
orian-0.840
 Conclusion-0.823
Token
Token ID198
Feature activation+2.76e-01

Pos logit contributions
asca+1.178
 rug+1.039
 CTR+0.987
+0.872
ワン+0.857
Neg logit contributions
ilation-0.856
ongyang-0.813
idding-0.800
orian-0.799
 Conclusion-0.784
Token
Token ID198
Feature activation+1.81e-01

Pos logit contributions
asca+0.941
 rug+0.818
 CTR+0.793
+0.688
 �+0.680
Neg logit contributions
ilation-0.814
ongyang-0.798
idding-0.767
orian-0.762
emic-0.743
TokenUntil
Token ID18273
Feature activation-1.81e+00

Pos logit contributions
asca+0.650
 rug+0.570
 CTR+0.541
+0.478
ワン+0.469
Neg logit contributions
ilation-0.505
ongyang-0.481
idding-0.473
orian-0.473
 Conclusion-0.464
Token these
Token ID777
Feature activation+1.95e-01

Pos logit contributions
ilation+5.052
 Conclusion+4.826
 Opposition+4.818
orian+4.764
idding+4.688
Neg logit contributions
asca-6.250
 rug-5.313
 CTR-5.000
ワン-4.563
-4.515
Token blatant
Token ID29000
Feature activation+1.69e-01

Pos logit contributions
asca+0.654
 rug+0.570
 CTR+0.538
+0.466
ワン+0.457
Neg logit contributions
ilation-0.594
ongyang-0.567
idding-0.559
orian-0.558
 Conclusion-0.549
Token and
Token ID290
Feature activation+1.43e-01

Pos logit contributions
asca+0.610
 rug+0.534
 CTR+0.506
+0.447
ワン+0.440
Neg logit contributions
ilation-0.474
ongyang-0.450
orian-0.443
idding-0.443
 Conclusion-0.437
Token well
Token ID880
Feature activation-5.75e-02

Pos logit contributions
asca+0.484
 rug+0.419
 CTR+0.396
+0.348
ワン+0.342
Neg logit contributions
ilation-0.428
ongyang-0.407
orian-0.402
idding-0.401
 Conclusion-0.396
Token-
Token ID12
Feature activation+1.67e-01

Pos logit contributions
ilation+0.192
orian+0.182
ongyang+0.181
idding+0.181
 Conclusion+0.180
Neg logit contributions
asca-0.177
 rug-0.148
 CTR-0.140
-0.121
ワン-0.119
Token or
Token ID393
Feature activation-1.22e-01

Pos logit contributions
asca+0.616
 rug+0.540
 CTR+0.509
+0.447
ワン+0.438
Neg logit contributions
ilation-0.495
ongyang-0.472
idding-0.465
orian-0.464
 Conclusion-0.455
Token wash
Token ID13502
Feature activation-3.19e-01

Pos logit contributions
ilation+0.381
ongyang+0.364
orian+0.359
idding+0.359
 Conclusion+0.354
Neg logit contributions
asca-0.397
 rug-0.343
 CTR-0.323
-0.281
ワン-0.277
Token with
Token ID351
Feature activation-8.77e-02

Pos logit contributions
ilation+0.915
ongyang+0.868
idding+0.858
orian+0.858
 Conclusion+0.848
Neg logit contributions
asca-1.114
 rug-0.970
 CTR-0.926
-0.813
ワン-0.809
Token.
Token ID13
Feature activation-3.30e-01

Pos logit contributions
ilation+0.262
ongyang+0.249
idding+0.246
orian+0.246
 Conclusion+0.242
Neg logit contributions
asca-0.298
 rug-0.260
 CTR-0.246
-0.215
ワン-0.211
Token De
Token ID1024
Feature activation+1.72e-01

Pos logit contributions
ilation+0.827
ongyang+0.783
idding+0.768
orian+0.768
 Conclusion+0.752
Neg logit contributions
asca-1.284
 rug-1.139
 CTR-1.086
-0.968
ワン-0.953
Tokenhyd
Token ID15511
Feature activation-1.75e+00

Pos logit contributions
asca+0.642
 rug+0.567
 CTR+0.539
+0.478
ワン+0.470
Neg logit contributions
ilation-0.459
ongyang-0.435
idding-0.428
orian-0.428
 Conclusion-0.420
Tokenration
Token ID1358
Feature activation-7.89e-01

Pos logit contributions
ilation+3.121
idding+2.791
ongyang+2.720
orian+2.697
emic+2.616
Neg logit contributions
asca-7.874
 rug-7.247
 CTR-6.942
ワン-6.387
-6.379
Token,
Token ID11
Feature activation-2.32e-01

Pos logit contributions
ilation+2.354
 Opposition+2.184
idding+2.183
ongyang+2.182
orian+2.177
Neg logit contributions
asca-2.663
 rug-2.293
 CTR-2.178
ワン-1.929
-1.919
Token starvation
Token ID32143
Feature activation-1.56e-01

Pos logit contributions
ilation+0.685
ongyang+0.652
idding+0.643
orian+0.643
 Conclusion+0.633
Neg logit contributions
asca-0.798
 rug-0.695
 CTR-0.657
-0.575
ワン-0.565
Token,
Token ID11
Feature activation-7.39e-02

Pos logit contributions
ilation+0.430
ongyang+0.401
orian+0.400
idding+0.396
 Opposition+0.394
Neg logit contributions
asca-0.575
 rug-0.501
 CTR-0.478
-0.422
ワン-0.418
Token malnutrition
Token ID49290
Feature activation-3.96e-01

Pos logit contributions
ilation+0.207
ongyang+0.196
orian+0.193
idding+0.193
 Conclusion+0.190
Neg logit contributions
asca-0.266
 rug-0.233
 CTR-0.221
-0.195
ワン-0.192
Token to
Token ID284
Feature activation+1.54e-01

Pos logit contributions
asca+0.998
 rug+0.878
 CTR+0.832
+0.734
ワン+0.722
Neg logit contributions
ilation-0.752
ongyang-0.715
orian-0.704
idding-0.704
 Conclusion-0.693
Token flow
Token ID5202
Feature activation+2.51e-01

Pos logit contributions
asca+0.493
 rug+0.425
 CTR+0.400
+0.345
ワン+0.338
Neg logit contributions
ilation-0.493
ongyang-0.473
orian-0.466
idding-0.466
 Conclusion-0.460
Token out
Token ID503
Feature activation+3.18e-01

Pos logit contributions
asca+0.844
 rug+0.734
 CTR+0.695
+0.608
ワン+0.599
Neg logit contributions
ilation-0.759
ongyang-0.722
idding-0.715
orian-0.712
 Conclusion-0.704
Token of
Token ID286
Feature activation+5.12e-01

Pos logit contributions
asca+1.172
 rug+1.034
 CTR+0.984
+0.872
ワン+0.859
Neg logit contributions
ilation-0.860
ongyang-0.813
idding-0.804
orian-0.801
 Conclusion-0.790
Token.
Token ID13
Feature activation+6.66e-02

Pos logit contributions
asca+1.934
 rug+1.709
 CTR+1.625
+1.443
ワン+1.420
Neg logit contributions
ilation-1.335
ongyang-1.265
idding-1.244
orian-1.244
 Conclusion-1.222
Token Until
Token ID14303
Feature activation-1.74e+00

Pos logit contributions
asca+0.275
 rug+0.246
 CTR+0.236
+0.212
ワン+0.208
Neg logit contributions
ilation-0.150
ongyang-0.142
orian-0.138
idding-0.138
 Conclusion-0.134
Token the
Token ID262
Feature activation+9.43e-02

Pos logit contributions
ilation+4.643
 Tah+4.438
orian+4.432
 Opposition+4.429
idding+4.392
Neg logit contributions
asca-6.170
 rug-5.246
 CTR-5.005
ワン-4.565
-4.534
Token glacier
Token ID44539
Feature activation-1.19e-01

Pos logit contributions
asca+0.338
 rug+0.296
 CTR+0.281
+0.248
ワン+0.244
Neg logit contributions
ilation-0.265
ongyang-0.251
idding-0.248
orian-0.248
 Conclusion-0.244
Token retreat
Token ID13703
Feature activation-9.04e-02

Pos logit contributions
ilation+0.364
ongyang+0.345
idding+0.343
orian+0.342
 Conclusion+0.337
Neg logit contributions
asca-0.395
 rug-0.341
 CTR-0.323
-0.283
ワン-0.280
Tokens
Token ID82
Feature activation-6.45e-02

Pos logit contributions
ilation+0.275
ongyang+0.260
orian+0.259
idding+0.258
 Conclusion+0.254
Neg logit contributions
asca-0.303
 rug-0.262
 CTR-0.249
-0.217
ワン-0.214
Token to
Token ID284
Feature activation-2.11e-01

Pos logit contributions
ilation+0.182
ongyang+0.172
idding+0.172
orian+0.171
 Conclusion+0.169
Neg logit contributions
asca-0.229
 rug-0.200
 CTR-0.190
-0.169
ワン-0.166
Token with
Token ID351
Feature activation-2.41e-02

Pos logit contributions
asca+0.356
 rug+0.312
 CTR+0.296
+0.262
ワン+0.258
Neg logit contributions
ilation-0.276
ongyang-0.261
idding-0.259
orian-0.259
 Conclusion-0.255
Token not
Token ID407
Feature activation+4.48e-01

Pos logit contributions
ilation+0.073
ongyang+0.070
idding+0.069
orian+0.069
 Conclusion+0.068
Neg logit contributions
asca-0.081
 rug-0.070
 CTR-0.066
-0.058
ワン-0.057
Token knowing
Token ID6970
Feature activation+5.12e-01

Pos logit contributions
asca+1.581
 rug+1.388
 CTR+1.305
+1.145
ワン+1.123
Neg logit contributions
ilation-1.276
ongyang-1.232
orian-1.197
idding-1.196
 Conclusion-1.178
Token,
Token ID11
Feature activation+1.07e-01

Pos logit contributions
asca+1.684
 rug+1.464
 CTR+1.376
+1.187
ワン+1.159
Neg logit contributions
ilation-1.586
ongyang-1.529
idding-1.500
orian-1.499
 Conclusion-1.475
Token is
Token ID318
Feature activation-1.97e-01

Pos logit contributions
asca+0.356
 rug+0.308
 CTR+0.291
+0.253
ワン+0.249
Neg logit contributions
ilation-0.329
ongyang-0.314
idding-0.310
orian-0.310
 Conclusion-0.305
Token enough
Token ID1576
Feature activation-1.74e+00

Pos logit contributions
ilation+0.587
ongyang+0.556
idding+0.550
orian+0.548
 Conclusion+0.543
Neg logit contributions
asca-0.673
 rug-0.586
 CTR-0.554
-0.484
ワン-0.477
Token to
Token ID284
Feature activation-2.00e-01

Pos logit contributions
ilation+4.254
 Opposition+3.950
idding+3.909
 Protector+3.908
 Conclusion+3.863
Neg logit contributions
asca-6.575
 rug-5.696
 CTR-5.584
-5.028
ワン-4.997
Token get
Token ID651
Feature activation-1.69e-01

Pos logit contributions
ilation+0.665
ongyang+0.637
idding+0.629
orian+0.629
 Conclusion+0.621
Neg logit contributions
asca-0.615
 rug-0.527
 CTR-0.494
-0.423
ワン-0.414
Token someone
Token ID2130
Feature activation-7.64e-01

Pos logit contributions
ilation+0.508
ongyang+0.481
idding+0.478
orian+0.477
 Opposition+0.473
Neg logit contributions
asca-0.570
 rug-0.491
 CTR-0.465
-0.410
ワン-0.405
Token to
Token ID284
Feature activation-1.44e-01

Pos logit contributions
ilation+2.183
idding+2.057
orian+2.049
ongyang+2.048
 Conclusion+2.031
Neg logit contributions
asca-2.664
 rug-2.304
 CTR-2.195
-1.963
ワン-1.937
Token open
Token ID1280
Feature activation+2.17e-01

Pos logit contributions
ilation+0.476
ongyang+0.456
idding+0.450
orian+0.450
 Conclusion+0.444
Neg logit contributions
asca-0.447
 rug-0.383
 CTR-0.359
-0.308
ワン-0.302
Token for
Token ID329
Feature activation+1.74e-01

Pos logit contributions
asca+1.868
 rug+1.636
 CTR+1.540
+1.330
ワン+1.297
Neg logit contributions
ilation-1.713
ongyang-1.648
idding-1.619
orian-1.612
 Conclusion-1.587
Token Dunn
Token ID30833
Feature activation-6.34e-01

Pos logit contributions
asca+0.603
 rug+0.529
 CTR+0.500
+0.435
ワン+0.427
Neg logit contributions
ilation-0.510
ongyang-0.488
idding-0.479
orian-0.479
 Conclusion-0.471
Token to
Token ID284
Feature activation-6.30e-02

Pos logit contributions
ilation+1.687
ongyang+1.577
idding+1.567
orian+1.566
 Conclusion+1.530
Neg logit contributions
asca-2.374
 rug-2.082
 CTR-1.991
-1.774
ワン-1.752
Token return
Token ID1441
Feature activation-5.91e-01

Pos logit contributions
ilation+0.192
ongyang+0.184
idding+0.181
orian+0.180
 Conclusion+0.178
Neg logit contributions
asca-0.210
 rug-0.184
 CTR-0.173
-0.150
ワン-0.147
Token.
Token ID13
Feature activation-8.96e-02

Pos logit contributions
ilation+1.588
orian+1.490
ongyang+1.479
idding+1.477
 Protector+1.465
Neg logit contributions
asca-2.168
 rug-1.887
 CTR-1.809
-1.616
ワン-1.614
Token Until
Token ID14303
Feature activation-1.73e+00

Pos logit contributions
ilation+0.225
ongyang+0.212
idding+0.209
orian+0.209
 Conclusion+0.205
Neg logit contributions
asca-0.348
 rug-0.309
 CTR-0.294
-0.262
ワン-0.258
Token Dunn
Token ID30833
Feature activation-2.62e-01

Pos logit contributions
ilation+4.927
 Opposition+4.695
orian+4.668
idding+4.640
 Conclusion+4.629
Neg logit contributions
asca-5.966
 rug-4.970
 CTR-4.721
ワン-4.356
-4.348
Token visits
Token ID11864
Feature activation+7.91e-02

Pos logit contributions
ilation+0.791
ongyang+0.755
idding+0.744
orian+0.744
 Conclusion+0.733
Neg logit contributions
asca-0.886
 rug-0.771
 CTR-0.727
-0.634
ワン-0.622
Token the
Token ID262
Feature activation-1.33e-01

Pos logit contributions
asca+0.269
 rug+0.234
 CTR+0.221
+0.193
ワン+0.190
Neg logit contributions
ilation-0.237
ongyang-0.226
idding-0.223
orian-0.223
 Conclusion-0.220
Token specialist
Token ID15670
Feature activation+3.50e-01

Pos logit contributions
ilation+0.359
ongyang+0.340
idding+0.335
orian+0.334
 Conclusion+0.330
Neg logit contributions
asca-0.490
 rug-0.430
 CTR-0.409
-0.363
ワン-0.357
Token,
Token ID11
Feature activation+2.24e-02

Pos logit contributions
asca+1.226
 rug+1.076
 CTR+1.016
+0.890
ワン+0.873
Neg logit contributions
ilation-1.007
ongyang-0.961
idding-0.945
orian-0.943
 Conclusion-0.929
Token clubs
Token ID9784
Feature activation-2.23e-01

Pos logit contributions
asca+1.046
 rug+0.912
 CTR+0.859
+0.745
ワン+0.729
Neg logit contributions
ilation-0.981
ongyang-0.938
idding-0.924
orian-0.923
 Conclusion-0.908
Token attempted
Token ID7482
Feature activation+4.59e-01

Pos logit contributions
ilation+0.661
ongyang+0.627
idding+0.620
orian+0.619
 Opposition+0.614
Neg logit contributions
asca-0.765
 rug-0.662
 CTR-0.628
-0.551
ワン-0.539
Token a
Token ID257
Feature activation+3.26e-02

Pos logit contributions
asca+1.631
 rug+1.427
 CTR+1.353
+1.190
ワン+1.169
Neg logit contributions
ilation-1.307
ongyang-1.242
idding-1.226
orian-1.224
 Conclusion-1.205
Token coup
Token ID12092
Feature activation+2.30e-01

Pos logit contributions
asca+0.110
 rug+0.095
 CTR+0.090
+0.079
ワン+0.077
Neg logit contributions
ilation-0.099
ongyang-0.094
idding-0.093
orian-0.093
 Conclusion-0.091
Token.
Token ID13
Feature activation-2.89e-01

Pos logit contributions
asca+0.852
 rug+0.751
 CTR+0.712
+0.631
ワン+0.620
Neg logit contributions
ilation-0.617
ongyang-0.587
orian-0.577
idding-0.576
 Conclusion-0.567
Token Until
Token ID14303
Feature activation-1.72e+00

Pos logit contributions
ilation+0.692
ongyang+0.646
idding+0.644
orian+0.641
 Opposition+0.638
Neg logit contributions
asca-1.157
 rug-1.028
 CTR-0.975
-0.877
ワン-0.863
Token that
Token ID326
Feature activation+2.82e-01

Pos logit contributions
ilation+4.668
 Opposition+4.596
 Protector+4.456
 Conclusion+4.437
orian+4.431
Neg logit contributions
asca-6.093
 rug-5.087
 CTR-4.884
ワン-4.429
-4.413
Token moment
Token ID2589
Feature activation-5.08e-01

Pos logit contributions
asca+0.965
 rug+0.847
 CTR+0.795
+0.696
ワン+0.680
Neg logit contributions
ilation-0.833
ongyang-0.797
idding-0.785
orian-0.784
 Conclusion-0.768
Token,
Token ID11
Feature activation-3.61e-01

Pos logit contributions
ilation+1.290
orian+1.198
 Opposition+1.194
idding+1.189
ongyang+1.186
Neg logit contributions
asca-1.967
 rug-1.713
 CTR-1.643
-1.468
ワン-1.457
Token the
Token ID262
Feature activation-8.77e-02

Pos logit contributions
ilation+1.054
orian+0.991
idding+0.988
ongyang+0.988
 Opposition+0.988
Neg logit contributions
asca-1.254
 rug-1.083
 CTR-1.023
-0.907
ワン-0.898
Token general
Token ID2276
Feature activation+5.84e-01

Pos logit contributions
ilation+0.258
ongyang+0.245
idding+0.242
orian+0.242
 Conclusion+0.239
Neg logit contributions
asca-0.303
 rug-0.264
 CTR-0.250
-0.219
ワン-0.215
TokenWe
Token ID1135
Feature activation-5.69e-01

Pos logit contributions
asca+1.586
 rug+1.397
 CTR+1.329
+1.191
ワン+1.166
Neg logit contributions
ilation-1.053
ongyang-0.999
idding-0.984
orian-0.980
 Conclusion-0.969
Token
Token ID447
Feature activation+1.55e-01

Pos logit contributions
ilation+1.541
ongyang+1.468
idding+1.438
orian+1.435
 Opposition+1.418
Neg logit contributions
asca-2.074
 rug-1.852
 CTR-1.748
-1.545
 裏�-1.521
Token
Token ID247
Feature activation+1.01e-01

Pos logit contributions
asca+0.527
 rug+0.459
 CTR+0.434
+0.379
ワン+0.372
Neg logit contributions
ilation-0.463
ongyang-0.442
idding-0.435
orian-0.435
 Conclusion-0.428
Tokenve
Token ID303
Feature activation-3.51e-01

Pos logit contributions
asca+0.344
 rug+0.300
 CTR+0.283
+0.248
ワン+0.243
Neg logit contributions
ilation-0.299
ongyang-0.285
idding-0.281
orian-0.281
 Conclusion-0.276
TokenHad
Token ID25383
Feature activation+9.87e-02

Pos logit contributions
ilation+0.974
ongyang+0.924
idding+0.907
orian+0.907
 Opposition+0.897
Neg logit contributions
asca-1.262
 rug-1.116
 CTR-1.056
-0.927
ワン-0.912
TokenEnough
Token ID47323
Feature activation-1.71e+00

Pos logit contributions
asca+0.360
 rug+0.317
 CTR+0.301
+0.266
ワン+0.261
Neg logit contributions
ilation-0.270
ongyang-0.256
orian-0.253
idding-0.252
 Opposition-0.248
Token (#
Token ID17426
Feature activation-3.15e-01

Pos logit contributions
ilation+4.611
 Opposition+4.300
ongyang+4.268
orian+4.211
 Protector+4.199
Neg logit contributions
asca-6.087
 rug-5.350
 CTR-5.111
ワン-4.450
 裏�-4.446
Token
Token ID140
Feature activation+3.38e-01

Pos logit contributions
ilation+0.884
ongyang+0.839
orian+0.825
idding+0.823
 Opposition+0.811
Neg logit contributions
asca-1.127
 rug-0.993
 CTR-0.940
-0.825
ワン-0.812
Token
Token ID251
Feature activation-2.15e-01

Pos logit contributions
asca+1.161
 rug+1.013
 CTR+0.957
+0.837
ワン+0.822
Neg logit contributions
ilation-1.000
ongyang-0.953
idding-0.940
orian-0.939
 Conclusion-0.925
Tokenа
Token ID16142
Feature activation+1.04e-01

Pos logit contributions
ilation+0.706
ongyang+0.677
orian+0.667
idding+0.667
 Conclusion+0.656
Neg logit contributions
asca-0.667
 rug-0.573
 CTR-0.536
-0.459
ワン-0.450
Tokenд
Token ID43666
Feature activation+1.34e-01

Pos logit contributions
asca+0.340
 rug+0.294
 CTR+0.277
+0.240
ワン+0.235
Neg logit contributions
ilation-0.327
ongyang-0.312
idding-0.308
orian-0.308
 Conclusion-0.304
Token't
Token ID470
Feature activation-4.69e-01

Pos logit contributions
ilation+2.106
ongyang+1.964
orian+1.956
idding+1.951
 Conclusion+1.926
Neg logit contributions
asca-3.568
 rug-3.181
 CTR-3.016
-2.710
ワン-2.686
Token figure
Token ID3785
Feature activation-1.76e-01

Pos logit contributions
ilation+1.470
ongyang+1.395
orian+1.388
idding+1.386
 Conclusion+1.371
Neg logit contributions
asca-1.527
 rug-1.311
 CTR-1.238
-1.076
ワン-1.060
Token it
Token ID340
Feature activation-1.46e-01

Pos logit contributions
ilation+0.541
ongyang+0.515
orian+0.512
idding+0.511
 Conclusion+0.504
Neg logit contributions
asca-0.583
 rug-0.501
 CTR-0.475
-0.413
ワン-0.409
Token out
Token ID503
Feature activation+3.73e-01

Pos logit contributions
ilation+0.493
ongyang+0.471
idding+0.467
orian+0.466
 Conclusion+0.463
Neg logit contributions
asca-0.436
 rug-0.369
 CTR-0.351
-0.296
ワン-0.293
Token fast
Token ID3049
Feature activation+1.64e-01

Pos logit contributions
asca+1.328
 rug+1.164
 CTR+1.102
+0.970
ワン+0.953
Neg logit contributions
ilation-1.059
ongyang-1.008
idding-0.993
orian-0.992
 Conclusion-0.976
Token enough
Token ID1576
Feature activation-1.70e+00

Pos logit contributions
asca+0.560
 rug+0.487
 CTR+0.463
+0.403
ワン+0.397
Neg logit contributions
ilation-0.489
ongyang-0.463
idding-0.461
orian-0.460
 Conclusion-0.455
Token
Token ID447
Feature activation+1.78e-01

Pos logit contributions
ilation+4.226
idding+3.987
orian+3.957
 Conclusion+3.954
ongyang+3.886
Neg logit contributions
asca-6.285
 rug-5.546
 CTR-5.324
ワン-4.771
-4.766
Token
Token ID251
Feature activation+1.51e-01

Pos logit contributions
asca+0.527
 rug+0.451
 CTR+0.422
+0.358
ワン+0.346
Neg logit contributions
ilation-0.606
ongyang-0.582
idding-0.576
orian-0.574
 Protector-0.565
Token;
Token ID26
Feature activation-2.82e-01

Pos logit contributions
asca+0.528
 rug+0.462
 CTR+0.437
+0.384
ワン+0.377
Neg logit contributions
ilation-0.434
ongyang-0.413
idding-0.407
orian-0.407
 Conclusion-0.401
Token and
Token ID290
Feature activation-2.60e-01

Pos logit contributions
ilation+0.780
orian+0.737
ongyang+0.737
 Conclusion+0.730
idding+0.724
Neg logit contributions
asca-1.018
 rug-0.887
 CTR-0.847
-0.749
ワン-0.743
Token he
Token ID339
Feature activation+1.13e-01

Pos logit contributions
ilation+0.710
orian+0.673
ongyang+0.671
 Conclusion+0.665
idding+0.664
Neg logit contributions
asca-0.942
 rug-0.825
 CTR-0.785
-0.698
ワン-0.689
Token that
Token ID326
Feature activation-3.13e-01

Pos logit contributions
ilation+0.180
ongyang+0.169
idding+0.168
orian+0.167
 Conclusion+0.165
Neg logit contributions
asca-0.258
 rug-0.227
 CTR-0.216
-0.193
ワン-0.190
Token was
Token ID373
Feature activation-4.72e-01

Pos logit contributions
ilation+0.886
ongyang+0.834
idding+0.832
orian+0.828
 Conclusion+0.821
Neg logit contributions
asca-1.112
 rug-0.970
 CTR-0.920
-0.818
ワン-0.811
Token the
Token ID262
Feature activation-2.58e-01

Pos logit contributions
ilation+1.370
ongyang+1.295
idding+1.291
orian+1.279
 Conclusion+1.275
Neg logit contributions
asca-1.631
 rug-1.414
 CTR-1.344
-1.189
ワン-1.182
Token sc
Token ID629
Feature activation-1.91e-01

Pos logit contributions
ilation+0.796
ongyang+0.755
idding+0.751
orian+0.748
 Conclusion+0.744
Neg logit contributions
asca-0.853
 rug-0.730
 CTR-0.691
-0.605
ワン-0.596
Tokenari
Token ID2743
Feature activation-5.53e-01

Pos logit contributions
ilation+0.443
ongyang+0.415
idding+0.409
orian+0.409
 Conclusion+0.400
Neg logit contributions
asca-0.779
 rug-0.696
 CTR-0.664
-0.596
ワン-0.589
Tokenest
Token ID395
Feature activation-1.69e+00

Pos logit contributions
ilation+0.884
ongyang+0.809
orian+0.785
idding+0.785
 Conclusion+0.760
Neg logit contributions
asca-2.649
 rug-2.406
 CTR-2.315
-2.118
ワン-2.093
Token teen
Token ID6036
Feature activation-6.66e-01

Pos logit contributions
ilation+4.220
 Conclusion+3.978
idding+3.967
 Opposition+3.964
 Protector+3.922
Neg logit contributions
asca-6.369
 rug-5.420
 CTR-5.250
-4.804
ワン-4.800
Token girl
Token ID2576
Feature activation-7.84e-01

Pos logit contributions
ilation+1.956
ongyang+1.840
idding+1.838
orian+1.837
 Protector+1.810
Neg logit contributions
asca-2.289
 rug-1.969
 CTR-1.889
-1.658
ワン-1.647
Token that
Token ID326
Feature activation-9.10e-02

Pos logit contributions
ilation+2.117
ongyang+1.989
idding+1.986
orian+1.976
 Protector+1.961
Neg logit contributions
asca-2.866
 rug-2.475
 CTR-2.387
-2.137
ワン-2.124
Token I
Token ID314
Feature activation-2.08e-01

Pos logit contributions
ilation+0.226
ongyang+0.213
idding+0.210
orian+0.209
 Conclusion+0.206
Neg logit contributions
asca-0.355
 rug-0.315
 CTR-0.300
-0.269
ワン-0.265
Token could
Token ID714
Feature activation-1.01e-01

Pos logit contributions
ilation+0.572
ongyang+0.543
orian+0.535
idding+0.535
 Conclusion+0.526
Neg logit contributions
asca-0.756
 rug-0.663
 CTR-0.631
-0.559
ワン-0.550
Token explain
Token ID4727
Feature activation-3.00e-01

Pos logit contributions
ilation+0.334
ongyang+0.318
idding+0.314
orian+0.314
 Conclusion+0.309
Neg logit contributions
asca-0.383
 rug-0.333
 CTR-0.315
-0.276
ワン-0.271
Token why
Token ID1521
Feature activation+4.79e-01

Pos logit contributions
ilation+0.799
ongyang+0.754
orian+0.748
idding+0.744
 Opposition+0.743
Neg logit contributions
asca-1.118
 rug-0.982
 CTR-0.931
-0.834
ワン-0.823
Token the
Token ID262
Feature activation+2.07e-02

Pos logit contributions
asca+1.725
 rug+1.521
 CTR+1.445
+1.259
 �+1.237
Neg logit contributions
ilation-1.334
ongyang-1.272
idding-1.248
orian-1.242
 Conclusion-1.220
Token f
Token ID277
Feature activation-5.07e-01

Pos logit contributions
asca+0.079
 rug+0.070
 CTR+0.067
+0.059
ワン+0.058
Neg logit contributions
ilation-0.053
ongyang-0.050
idding-0.050
orian-0.049
 Conclusion-0.049
Tokenier
Token ID959
Feature activation-7.45e-01

Pos logit contributions
ilation+1.254
ongyang+1.180
idding+1.169
orian+1.169
 Opposition+1.131
Neg logit contributions
asca-1.980
 rug-1.774
 CTR-1.690
-1.513
ワン-1.507
Tokencest
Token ID9165
Feature activation-1.68e+00

Pos logit contributions
ilation+1.791
ongyang+1.673
orian+1.654
idding+1.654
 Conclusion+1.625
Neg logit contributions
asca-2.965
 rug-2.644
 CTR-2.526
-2.274
ワン-2.236
Token crackdown
Token ID22671
Feature activation-8.35e-01

Pos logit contributions
ilation+4.008
 Opposition+4.000
orian+3.808
ongyang+3.791
 Conclusion+3.730
Neg logit contributions
asca-6.536
 rug-5.615
 CTR-5.364
-4.973
ワン-4.972
Tokens
Token ID82
Feature activation-5.56e-01

Pos logit contributions
ilation+2.421
ongyang+2.323
orian+2.294
idding+2.283
 Opposition+2.281
Neg logit contributions
asca-2.891
 rug-2.514
 CTR-2.377
-2.089
ワン-2.086
Token against
Token ID1028
Feature activation-1.83e-01

Pos logit contributions
ilation+1.624
ongyang+1.559
orian+1.535
idding+1.530
 Opposition+1.516
Neg logit contributions
asca-1.929
 rug-1.676
 CTR-1.583
-1.393
ワン-1.379
Token the
Token ID262
Feature activation-1.93e-01

Pos logit contributions
ilation+0.483
ongyang+0.458
idding+0.451
orian+0.450
 Conclusion+0.443
Neg logit contributions
asca-0.685
 rug-0.604
 CTR-0.574
-0.509
ワン-0.501
Token protest
Token ID5402
Feature activation-3.41e-01

Pos logit contributions
ilation+0.492
ongyang+0.466
idding+0.458
orian+0.458
 Conclusion+0.450
Neg logit contributions
asca-0.742
 rug-0.658
 CTR-0.626
-0.557
ワン-0.548
Token,
Token ID11
Feature activation-2.33e-01

Pos logit contributions
ilation+1.976
idding+1.803
ongyang+1.800
orian+1.789
 Opposition+1.781
Neg logit contributions
asca-3.212
 rug-2.856
 CTR-2.756
-2.467
ワン-2.456
Token it
Token ID340
Feature activation-2.38e-01

Pos logit contributions
ilation+0.678
ongyang+0.638
idding+0.634
orian+0.631
 Conclusion+0.628
Neg logit contributions
asca-0.807
 rug-0.705
 CTR-0.668
-0.590
ワン-0.586
Token
Token ID447
Feature activation-1.30e-01

Pos logit contributions
ilation+0.723
ongyang+0.684
orian+0.679
idding+0.677
 Conclusion+0.667
Neg logit contributions
asca-0.792
 rug-0.689
 CTR-0.655
-0.568
ワン-0.564
Token
Token ID247
Feature activation-4.01e-01

Pos logit contributions
ilation+0.353
ongyang+0.336
idding+0.331
orian+0.330
 Conclusion+0.324
Neg logit contributions
asca-0.475
 rug-0.418
 CTR-0.397
-0.351
ワン-0.344
Tokens
Token ID82
Feature activation-4.66e-01

Pos logit contributions
ilation+1.211
ongyang+1.140
idding+1.135
orian+1.133
 Conclusion+1.120
Neg logit contributions
asca-1.347
 rug-1.182
 CTR-1.111
-0.969
ワン-0.969
Token enough
Token ID1576
Feature activation-1.67e+00

Pos logit contributions
ilation+1.361
ongyang+1.291
idding+1.280
orian+1.276
 Conclusion+1.256
Neg logit contributions
asca-1.614
 rug-1.405
 CTR-1.333
-1.170
ワン-1.159
Token to
Token ID284
Feature activation-3.67e-01

Pos logit contributions
ilation+4.318
idding+4.038
ongyang+3.921
 Opposition+3.916
 Tah+3.905
Neg logit contributions
asca-6.072
 rug-5.306
 CTR-5.104
-4.685
ワン-4.644
Token shake
Token ID13279
Feature activation-1.15e-01

Pos logit contributions
ilation+1.185
ongyang+1.126
idding+1.122
orian+1.117
 Conclusion+1.102
Neg logit contributions
asca-1.163
 rug-0.991
 CTR-0.939
-0.812
ワン-0.800
Token the
Token ID262
Feature activation-3.91e-01

Pos logit contributions
ilation+0.334
ongyang+0.318
idding+0.314
orian+0.313
 Conclusion+0.310
Neg logit contributions
asca-0.396
 rug-0.344
 CTR-0.329
-0.290
ワン-0.285
Token Earth
Token ID3668
Feature activation+2.86e-01

Pos logit contributions
ilation+1.115
ongyang+1.055
idding+1.044
orian+1.042
 Opposition+1.034
Neg logit contributions
asca-1.374
 rug-1.190
 CTR-1.140
-1.011
ワン-0.996
Token.
Token ID13
Feature activation-2.72e-01

Pos logit contributions
asca+1.022
 rug+0.896
 CTR+0.849
+0.748
ワン+0.735
Neg logit contributions
ilation-0.807
ongyang-0.767
idding-0.756
orian-0.755
 Conclusion-0.743
Token many
Token ID867
Feature activation-4.10e-03

Pos logit contributions
ilation+1.103
ongyang+1.043
orian+1.040
 Opposition+1.037
idding+1.033
Neg logit contributions
asca-1.291
 rug-1.123
 CTR-1.073
-0.944
ワン-0.936
Token Kurds
Token ID23880
Feature activation+5.74e-01

Pos logit contributions
ilation+0.012
ongyang+0.011
idding+0.011
orian+0.011
 Conclusion+0.011
Neg logit contributions
asca-0.014
 rug-0.013
 CTR-0.012
-0.011
ワン-0.010
Token,
Token ID11
Feature activation-1.90e-01

Pos logit contributions
asca+2.031
 rug+1.786
 CTR+1.687
+1.478
ワン+1.449
Neg logit contributions
ilation-1.633
ongyang-1.559
idding-1.533
orian-1.527
 Conclusion-1.507
Token who
Token ID508
Feature activation-3.49e-01

Pos logit contributions
ilation+0.524
ongyang+0.496
orian+0.490
idding+0.490
 Opposition+0.483
Neg logit contributions
asca-0.690
 rug-0.604
 CTR-0.574
-0.508
ワン-0.501
Token are
Token ID389
Feature activation-4.45e-01

Pos logit contributions
ilation+1.064
ongyang+1.011
orian+1.002
idding+0.999
 Opposition+0.989
Neg logit contributions
asca-1.163
 rug-1.001
 CTR-0.952
-0.832
ワン-0.822
Token demanding
Token ID11334
Feature activation-1.66e+00

Pos logit contributions
ilation+1.349
ongyang+1.280
orian+1.276
idding+1.268
 Opposition+1.263
Neg logit contributions
asca-1.482
 rug-1.279
 CTR-1.211
-1.064
ワン-1.054
Token a
Token ID257
Feature activation-3.70e-01

Pos logit contributions
ilation+4.572
orian+4.336
 Tah+4.330
idding+4.328
 Opposition+4.308
Neg logit contributions
asca-5.763
 rug-4.919
 CTR-4.761
ワン-4.370
-4.328
Token bigger
Token ID5749
Feature activation-1.95e-01

Pos logit contributions
ilation+1.018
ongyang+0.960
idding+0.948
orian+0.948
 Conclusion+0.940
Neg logit contributions
asca-1.346
 rug-1.166
 CTR-1.118
-0.994
ワン-0.982
Token say
Token ID910
Feature activation-3.05e-02

Pos logit contributions
ilation+0.652
ongyang+0.624
idding+0.617
orian+0.617
 Conclusion+0.609
Neg logit contributions
asca-0.595
 rug-0.507
 CTR-0.477
-0.408
ワン-0.400
Token in
Token ID287
Feature activation+1.29e-01

Pos logit contributions
ilation+0.096
ongyang+0.092
orian+0.091
idding+0.091
 Opposition+0.090
Neg logit contributions
asca-0.099
 rug-0.085
 CTR-0.081
-0.069
ワン-0.069
Token the
Token ID262
Feature activation-1.02e-01

Pos logit contributions
asca+0.464
 rug+0.409
 CTR+0.387
+0.340
ワン+0.332
Neg logit contributions
ilation-0.362
ongyang-0.345
idding-0.339
orian-0.339
 Conclusion-0.333
Token again
Token ID757
Feature activation+8.08e-02

Pos logit contributions
asca+3.565
 rug+3.101
 CTR+2.899
+2.427
 �+2.421
Neg logit contributions
ilation-3.950
ongyang-3.830
idding-3.757
orian-3.748
 Protector-3.665
Token:
Token ID25
Feature activation+6.54e-01

Pos logit contributions
asca+0.291
 rug+0.254
 CTR+0.241
+0.213
ワン+0.211
Neg logit contributions
ilation-0.226
ongyang-0.214
orian-0.212
idding-0.211
 Conclusion-0.209
Token the
Token ID262
Feature activation+2.81e-01

Pos logit contributions
asca+2.537
 rug+2.270
 CTR+2.167
+1.911
 �+1.889
Neg logit contributions
ilation-1.623
ongyang-1.544
idding-1.508
orian-1.504
 Conclusion-1.457
Token "
Token ID366
Feature activation+1.66e-01

Pos logit contributions
asca+0.999
 rug+0.881
 CTR+0.834
+0.732
ワン+0.718
Neg logit contributions
ilation-0.791
ongyang-0.755
idding-0.743
orian-0.740
 Conclusion-0.727
TokenHow
Token ID2437
Feature activation+4.71e-01

Pos logit contributions
asca+0.610
 rug+0.537
 CTR+0.510
+0.451
ワン+0.443
Neg logit contributions
ilation-0.450
ongyang-0.428
idding-0.421
orian-0.420
 Conclusion-0.414
Token To
Token ID1675
Feature activation-1.65e+00

Pos logit contributions
asca+1.708
 rug+1.508
 CTR+1.425
+1.251
 �+1.226
Neg logit contributions
ilation-1.295
ongyang-1.236
orian-1.216
idding-1.210
 Conclusion-1.186
Token Stop
Token ID13707
Feature activation-2.53e-01

Pos logit contributions
ilation+4.211
 Conclusion+3.990
idding+3.949
 Protector+3.944
 Opposition+3.943
Neg logit contributions
asca-6.234
 rug-5.407
 CTR-5.126
ワン-4.684
-4.612
Token Rape
Token ID35286
Feature activation-1.02e+00

Pos logit contributions
ilation+0.533
ongyang+0.496
idding+0.490
orian+0.487
 Conclusion+0.480
Neg logit contributions
asca-1.083
 rug-0.971
 CTR-0.927
-0.841
ワン-0.833
Token"
Token ID1
Feature activation+1.14e-01

Pos logit contributions
ilation+2.676
 Conclusion+2.522
ongyang+2.494
 Opposition+2.491
 Protector+2.488
Neg logit contributions
asca-3.820
 rug-3.333
 CTR-3.141
ワン-2.862
-2.856
Token article
Token ID2708
Feature activation+8.19e-01

Pos logit contributions
asca+0.400
 rug+0.349
 CTR+0.330
+0.291
ワン+0.286
Neg logit contributions
ilation-0.327
ongyang-0.311
idding-0.307
orian-0.307
 Conclusion-0.303
Token was
Token ID373
Feature activation+2.10e-01

Pos logit contributions
asca+2.702
 rug+2.372
 CTR+2.237
+1.922
 �+1.889
Neg logit contributions
ilation-2.503
ongyang-2.408
idding-2.364
orian-2.353
 Opposition-2.301
Token behave
Token ID17438
Feature activation+3.61e-01

Pos logit contributions
ilation+0.366
ongyang+0.350
idding+0.346
orian+0.346
 Conclusion+0.341
Neg logit contributions
asca-0.370
 rug-0.319
 CTR-0.300
-0.259
ワン-0.254
Token properly
Token ID6105
Feature activation+3.52e-01

Pos logit contributions
asca+1.204
 rug+1.046
 CTR+0.987
+0.856
ワン+0.836
Neg logit contributions
ilation-1.104
ongyang-1.055
idding-1.040
orian-1.037
 Conclusion-1.022
Token.
Token ID13
Feature activation+5.41e-02

Pos logit contributions
asca+1.259
 rug+1.105
 CTR+1.047
+0.921
ワン+0.904
Neg logit contributions
ilation-0.989
ongyang-0.941
idding-0.926
orian-0.925
 Conclusion-0.910
Token<|endoftext|>
Token ID50256
Feature activation+2.95e-01

Pos logit contributions
asca+0.213
 rug+0.190
 CTR+0.183
+0.162
ワン+0.159
Neg logit contributions
ilation-0.132
ongyang-0.125
orian-0.122
idding-0.122
 Opposition-0.118
Tokenby
Token ID1525
Feature activation-6.42e-01

Pos logit contributions
asca+1.069
 rug+0.940
 CTR+0.891
+0.786
ワン+0.773
Neg logit contributions
ilation-0.815
ongyang-0.774
idding-0.762
orian-0.762
 Conclusion-0.749
Token Erik
Token ID22722
Feature activation-1.65e+00

Pos logit contributions
ilation+1.762
ongyang+1.673
orian+1.670
 Conclusion+1.661
 Opposition+1.656
Neg logit contributions
asca-2.311
 rug-2.012
 CTR-1.881
-1.703
ワン-1.702
Token Alt
Token ID12344
Feature activation-6.98e-01

Pos logit contributions
ilation+4.349
orian+4.109
idding+4.061
 Opposition+4.049
ongyang+4.013
Neg logit contributions
asca-5.987
 rug-5.286
 CTR-5.050
ワン-4.536
-4.516
Tokenieri
Token ID29864
Feature activation-9.19e-01

Pos logit contributions
ilation+2.230
orian+2.123
idding+2.117
ongyang+2.114
 Opposition+2.085
Neg logit contributions
asca-2.202
 rug-1.915
 CTR-1.814
-1.559
ワン-1.544
Token,
Token ID11
Feature activation-7.72e-01

Pos logit contributions
ilation+2.286
orian+2.173
ongyang+2.162
 Opposition+2.159
idding+2.147
Neg logit contributions
asca-3.516
 rug-3.091
 CTR-2.919
ワン-2.646
-2.643
Token NOR
Token ID25273
Feature activation-6.49e-01

Pos logit contributions
ilation+1.955
orian+1.864
 Opposition+1.858
ongyang+1.841
 Conclusion+1.833
Neg logit contributions
asca-2.925
 rug-2.575
 CTR-2.406
ワン-2.205
-2.194
TokenML
Token ID5805
Feature activation+6.60e-01

Pos logit contributions
ilation+2.079
ongyang+1.989
orian+1.988
idding+1.980
 Conclusion+1.965
Neg logit contributions
asca-2.035
 rug-1.763
 CTR-1.629
-1.405
ワン-1.404
Token that
Token ID326
Feature activation-2.24e-01

Pos logit contributions
ilation+0.996
ongyang+0.941
orian+0.934
idding+0.934
 Conclusion+0.925
Neg logit contributions
asca-1.125
 rug-0.975
 CTR-0.919
-0.815
ワン-0.807
Token the
Token ID262
Feature activation+1.96e-01

Pos logit contributions
ilation+0.670
ongyang+0.633
orian+0.627
idding+0.626
 Conclusion+0.623
Neg logit contributions
asca-0.753
 rug-0.654
 CTR-0.616
-0.548
ワン-0.546
Token clinical
Token ID8668
Feature activation-4.54e-01

Pos logit contributions
asca+0.645
 rug+0.559
 CTR+0.527
+0.457
ワン+0.448
Neg logit contributions
ilation-0.608
ongyang-0.582
idding-0.574
orian-0.573
 Conclusion-0.565
Token improvements
Token ID8561
Feature activation-9.84e-02

Pos logit contributions
ilation+1.325
ongyang+1.250
orian+1.241
idding+1.239
 Conclusion+1.234
Neg logit contributions
asca-1.572
 rug-1.359
 CTR-1.284
-1.145
ワン-1.138
Token continued
Token ID3767
Feature activation+3.48e-01

Pos logit contributions
ilation+0.301
ongyang+0.286
idding+0.283
orian+0.283
 Conclusion+0.280
Neg logit contributions
asca-0.328
 rug-0.284
 CTR-0.267
-0.236
ワン-0.233
Token until
Token ID1566
Feature activation-1.65e+00

Pos logit contributions
asca+1.072
 rug+0.919
 CTR+0.861
+0.738
ワン+0.722
Neg logit contributions
ilation-1.155
ongyang-1.107
idding-1.093
orian-1.092
 Conclusion-1.077
Token week
Token ID1285
Feature activation+1.70e-01

Pos logit contributions
ilation+4.499
 Conclusion+4.213
 Opposition+4.178
 Tah+4.168
emic+4.128
Neg logit contributions
asca-5.786
 rug-4.927
 CTR-4.743
ワン-4.501
-4.383
Token 12
Token ID1105
Feature activation-4.38e-01

Pos logit contributions
asca+0.566
 rug+0.492
 CTR+0.464
+0.402
ワン+0.394
Neg logit contributions
ilation-0.522
ongyang-0.499
idding-0.492
orian-0.491
 Conclusion-0.484
Token during
Token ID1141
Feature activation-5.85e-01

Pos logit contributions
ilation+1.183
ongyang+1.110
idding+1.106
 Conclusion+1.105
orian+1.099
Neg logit contributions
asca-1.602
 rug-1.412
 CTR-1.338
-1.199
ワン-1.185
Token ta
Token ID20486
Feature activation-2.48e-01

Pos logit contributions
ilation+1.686
ongyang+1.576
 Conclusion+1.559
idding+1.556
orian+1.550
Neg logit contributions
asca-2.035
 rug-1.767
 CTR-1.668
-1.510
ワン-1.501
TokenV
Token ID53
Feature activation-3.64e-02

Pos logit contributions
ilation+0.606
ongyang+0.580
idding+0.567
orian+0.560
 Conclusion+0.552
Neg logit contributions
asca-0.977
 rug-0.862
 CTR-0.824
-0.737
ワン-0.721
Token-
Token ID12
Feature activation-8.26e-02

Pos logit contributions
asca+0.052
 rug+0.045
 CTR+0.042
+0.037
ワン+0.037
Neg logit contributions
ilation-0.049
ongyang-0.047
orian-0.047
idding-0.046
 Opposition-0.046
Tokencent
Token ID1087
Feature activation-5.14e-01

Pos logit contributions
ilation+0.233
ongyang+0.219
orian+0.219
idding+0.217
 Conclusion+0.214
Neg logit contributions
asca-0.295
 rug-0.259
 CTR-0.246
-0.216
ワン-0.214
Token piece
Token ID3704
Feature activation+2.63e-01

Pos logit contributions
ilation+1.339
orian+1.253
ongyang+1.249
idding+1.231
 Conclusion+1.219
Neg logit contributions
asca-1.944
 rug-1.701
 CTR-1.629
-1.456
ワン-1.447
Token
Token ID198
Feature activation-1.39e-01

Pos logit contributions
asca+0.953
 rug+0.835
 CTR+0.793
+0.701
ワン+0.690
Neg logit contributions
ilation-0.730
ongyang-0.693
idding-0.684
orian-0.683
 Conclusion-0.674
Token
Token ID198
Feature activation-1.12e-01

Pos logit contributions
ilation+0.389
ongyang+0.374
idding+0.364
orian+0.363
 Conclusion+0.355
Neg logit contributions
asca-0.497
 rug-0.436
 CTR-0.416
-0.365
 �-0.357
TokenUntil
Token ID18273
Feature activation-1.64e+00

Pos logit contributions
ilation+0.306
ongyang+0.289
orian+0.286
idding+0.286
 Conclusion+0.280
Neg logit contributions
asca-0.410
 rug-0.360
 CTR-0.343
-0.302
ワン-0.297
Token 18
Token ID1248
Feature activation+7.46e-02

Pos logit contributions
ilation+4.453
orian+4.289
 Conclusion+4.257
 Opposition+4.251
idding+4.199
Neg logit contributions
asca-5.816
 rug-4.919
 CTR-4.742
ワン-4.298
-4.245
Token57
Token ID3553
Feature activation+1.04e-01

Pos logit contributions
asca+0.251
 rug+0.218
 CTR+0.206
+0.179
ワン+0.175
Neg logit contributions
ilation-0.226
ongyang-0.217
idding-0.213
orian-0.213
 Conclusion-0.209
Token,
Token ID11
Feature activation-1.95e-01

Pos logit contributions
asca+0.379
 rug+0.331
 CTR+0.315
+0.280
ワン+0.277
Neg logit contributions
ilation-0.285
ongyang-0.269
orian-0.269
idding-0.267
 Opposition-0.264
Token the
Token ID262
Feature activation-1.02e-01

Pos logit contributions
ilation+0.574
orian+0.546
ongyang+0.544
idding+0.540
 Opposition+0.537
Neg logit contributions
asca-0.665
 rug-0.574
 CTR-0.544
-0.482
ワン-0.478
Token cent
Token ID1247
Feature activation+2.54e-02

Pos logit contributions
ilation+0.313
ongyang+0.298
orian+0.297
idding+0.295
 Opposition+0.293
Neg logit contributions
asca-0.337
 rug-0.288
 CTR-0.273
-0.240
ワン-0.236
Token McDonnell
Token ID39441
Feature activation-1.58e-01

Pos logit contributions
ilation+0.582
ongyang+0.552
orian+0.548
idding+0.547
 Opposition+0.543
Neg logit contributions
asca-0.735
 rug-0.638
 CTR-0.605
-0.535
ワン-0.533
Token would
Token ID561
Feature activation+1.99e-01

Pos logit contributions
ilation+0.474
ongyang+0.449
orian+0.447
idding+0.446
 Conclusion+0.439
Neg logit contributions
asca-0.536
 rug-0.464
 CTR-0.442
-0.388
ワン-0.386
Token give
Token ID1577
Feature activation-1.96e-01

Pos logit contributions
asca+0.658
 rug+0.570
 CTR+0.538
+0.467
ワン+0.458
Neg logit contributions
ilation-0.614
ongyang-0.586
idding-0.578
orian-0.578
 Conclusion-0.570
Token me
Token ID502
Feature activation+3.88e-01

Pos logit contributions
ilation+0.531
ongyang+0.499
orian+0.496
idding+0.495
 Opposition+0.491
Neg logit contributions
asca-0.728
 rug-0.630
 CTR-0.602
-0.536
ワン-0.534
Token some
Token ID617
Feature activation+4.39e-01

Pos logit contributions
asca+1.380
 rug+1.207
 CTR+1.145
+1.007
ワン+0.991
Neg logit contributions
ilation-1.103
ongyang-1.048
idding-1.034
orian-1.034
 Conclusion-1.017
Token pause
Token ID14985
Feature activation+1.12e+00

Pos logit contributions
asca+1.553
 rug+1.356
 CTR+1.287
+1.131
ワン+1.112
Neg logit contributions
ilation-1.255
ongyang-1.192
idding-1.176
orian-1.176
 Conclusion-1.158
Token when
Token ID618
Feature activation+2.57e-02

Pos logit contributions
asca+4.053
 rug+3.588
 CTR+3.380
+2.992
 �+2.976
Neg logit contributions
ilation-3.014
ongyang-2.925
orian-2.838
idding-2.825
 Opposition-2.780
Token considering
Token ID6402
Feature activation+7.16e-02

Pos logit contributions
asca+0.097
 rug+0.085
 CTR+0.081
+0.072
ワン+0.071
Neg logit contributions
ilation-0.068
ongyang-0.064
idding-0.063
orian-0.063
 Conclusion-0.062
Token whether
Token ID1771
Feature activation+2.91e-01

Pos logit contributions
asca+0.245
 rug+0.214
 CTR+0.202
+0.177
ワン+0.174
Neg logit contributions
ilation-0.212
ongyang-0.202
idding-0.200
orian-0.199
 Conclusion-0.197
Token a
Token ID257
Feature activation+8.65e-02

Pos logit contributions
asca+1.003
 rug+0.877
 CTR+0.829
+0.724
ワン+0.710
Neg logit contributions
ilation-0.855
ongyang-0.817
idding-0.802
orian-0.802
 Conclusion-0.788
Token ten
Token ID3478
Feature activation+1.36e-01

Pos logit contributions
asca+0.303
 rug+0.264
 CTR+0.250
+0.220
ワン+0.216
Neg logit contributions
ilation-0.250
ongyang-0.238
idding-0.235
orian-0.235
 Conclusion-0.231
Token different
Token ID1180
Feature activation+3.59e-01

Pos logit contributions
ilation+0.555
ongyang+0.528
idding+0.523
orian+0.521
 Conclusion+0.517
Neg logit contributions
asca-0.645
 rug-0.559
 CTR-0.528
-0.465
ワン-0.457
Token things
Token ID1243
Feature activation+1.81e-01

Pos logit contributions
asca+1.173
 rug+1.023
 CTR+0.966
+0.828
 �+0.806
Neg logit contributions
ilation-1.119
ongyang-1.074
orian-1.057
idding-1.053
 Opposition-1.033
Token eliminating
Token ID18591
Feature activation+9.71e-01

Pos logit contributions
asca+0.634
 rug+0.555
 CTR+0.525
+0.461
ワン+0.453
Neg logit contributions
ilation-0.520
ongyang-0.495
idding-0.488
orian-0.487
 Conclusion-0.480
Token one
Token ID530
Feature activation-3.12e-02

Pos logit contributions
asca+3.112
 rug+2.755
 CTR+2.619
 �+2.158
+2.137
Neg logit contributions
ilation-2.979
ongyang-2.943
orian-2.860
idding-2.798
 Opposition-2.761
Token does
Token ID857
Feature activation+7.30e-01

Pos logit contributions
ilation+0.093
ongyang+0.089
idding+0.088
orian+0.088
 Conclusion+0.086
Neg logit contributions
asca-0.106
 rug-0.092
 CTR-0.087
-0.076
ワン-0.075
Token not
Token ID407
Feature activation+1.28e+00

Pos logit contributions
asca+2.630
 rug+2.339
 CTR+2.224
+1.934
 �+1.897
Neg logit contributions
ilation-1.996
ongyang-1.928
orian-1.879
idding-1.864
 Opposition-1.829
Token have
Token ID423
Feature activation+5.25e-01

Pos logit contributions
asca+3.821
 rug+3.410
 CTR+3.173
 �+2.665
 guid+2.586
Neg logit contributions
ilation-4.175
ongyang-4.092
orian-3.971
idding-3.902
 Opposition-3.881
Token to
Token ID284
Feature activation+1.05e-01

Pos logit contributions
asca+1.815
 rug+1.587
 CTR+1.501
+1.307
ワン+1.280
Neg logit contributions
ilation-1.539
ongyang-1.472
orian-1.445
idding-1.444
 Conclusion-1.421
Token eliminate
Token ID11005
Feature activation+1.13e+00

Pos logit contributions
asca+0.337
 rug+0.292
 CTR+0.275
+0.236
ワン+0.231
Neg logit contributions
ilation-0.331
ongyang-0.318
orian-0.313
idding-0.313
 Conclusion-0.308
Token the
Token ID262
Feature activation+1.16e-01

Pos logit contributions
asca+3.648
 rug+3.215
 CTR+3.034
+2.554
 �+2.525
Neg logit contributions
ilation-3.459
ongyang-3.354
orian-3.298
idding-3.258
 Protector-3.187
Token other
Token ID584
Feature activation+2.98e-01

Pos logit contributions
asca+0.396
 rug+0.342
 CTR+0.324
+0.284
ワン+0.279
Neg logit contributions
ilation-0.347
ongyang-0.330
idding-0.326
orian-0.325
 Conclusion-0.322
Token
Token ID198
Feature activation-2.30e-01

Pos logit contributions
ilation+0.652
ongyang+0.624
idding+0.610
orian+0.609
 Conclusion+0.597
Neg logit contributions
asca-0.850
 rug-0.746
 CTR-0.711
-0.626
ワン-0.611
TokenAs
Token ID1722
Feature activation-2.33e-01

Pos logit contributions
ilation+0.620
ongyang+0.587
orian+0.580
idding+0.580
 Conclusion+0.571
Neg logit contributions
asca-0.851
 rug-0.750
 CTR-0.712
-0.628
ワン-0.620
Token Economy
Token ID18493
Feature activation-7.50e-02

Pos logit contributions
ilation+0.690
ongyang+0.658
idding+0.650
orian+0.649
 Opposition+0.640
Neg logit contributions
asca-0.801
 rug-0.698
 CTR-0.660
-0.578
ワン-0.569
Token Minister
Token ID4139
Feature activation-3.62e-01

Pos logit contributions
ilation+0.199
ongyang+0.189
orian+0.186
idding+0.185
 Opposition+0.183
Neg logit contributions
asca-0.280
 rug-0.247
 CTR-0.235
-0.209
ワン-0.207
Token Luis
Token ID20894
Feature activation-1.03e-01

Pos logit contributions
ilation+0.926
idding+0.867
ongyang+0.866
orian+0.861
 Opposition+0.855
Neg logit contributions
asca-1.378
 rug-1.221
 CTR-1.171
-1.041
ワン-1.031
Token de
Token ID390
Feature activation+1.10e+00

Pos logit contributions
ilation+0.291
ongyang+0.275
idding+0.273
orian+0.271
 Opposition+0.268
Neg logit contributions
asca-0.364
 rug-0.319
 CTR-0.303
-0.268
ワン-0.264
Token Gu
Token ID1962
Feature activation-1.95e-01

Pos logit contributions
asca+4.791
 rug+4.316
 CTR+4.079
 Birch+3.677
 裏�+3.663
Neg logit contributions
ilation-2.162
ongyang-2.038
idding-1.994
orian-1.965
 Conclusion-1.911
Tokenind
Token ID521
Feature activation-3.20e-01

Pos logit contributions
ilation+0.467
ongyang+0.443
idding+0.434
orian+0.433
 Conclusion+0.428
Neg logit contributions
asca-0.779
 rug-0.687
 CTR-0.657
-0.586
ワン-0.577
Tokenos
Token ID418
Feature activation+7.17e-01

Pos logit contributions
ilation+0.828
ongyang+0.783
orian+0.770
idding+0.770
 Conclusion+0.754
Neg logit contributions
asca-1.219
 rug-1.081
 CTR-1.028
-0.915
ワン-0.901
Token revealed
Token ID4602
Feature activation-6.09e-02

Pos logit contributions
asca+2.544
 rug+2.226
 CTR+2.101
+1.845
ワン+1.808
Neg logit contributions
ilation-2.040
ongyang-1.945
idding-1.915
orian-1.908
 Conclusion-1.881
Token this
Token ID428
Feature activation+2.25e-01

Pos logit contributions
ilation+0.180
ongyang+0.172
orian+0.171
idding+0.170
 Opposition+0.168
Neg logit contributions
asca-0.208
 rug-0.181
 CTR-0.171
-0.150
ワン-0.148
Token BCE
Token ID37078
Feature activation+6.53e-02

Pos logit contributions
asca+0.030
 rug+0.027
 CTR+0.026
+0.023
ワン+0.023
Neg logit contributions
ilation-0.019
ongyang-0.018
idding-0.018
orian-0.018
 Conclusion-0.018
Token the
Token ID262
Feature activation+1.52e-01

Pos logit contributions
asca+0.248
 rug+0.220
 CTR+0.209
+0.186
ワン+0.183
Neg logit contributions
ilation-0.169
ongyang-0.159
idding-0.157
orian-0.157
 Conclusion-0.155
Token Persian
Token ID22035
Feature activation+7.97e-01

Pos logit contributions
asca+0.595
 rug+0.529
 CTR+0.503
+0.448
ワン+0.441
Neg logit contributions
ilation-0.380
ongyang-0.360
idding-0.353
orian-0.352
 Conclusion-0.345
Token king
Token ID5822
Feature activation-2.59e-02

Pos logit contributions
asca+2.960
 rug+2.638
 CTR+2.487
+2.197
ワン+2.145
Neg logit contributions
ilation-2.127
ongyang-2.027
idding-1.981
orian-1.971
 Conclusion-1.932
Token Xer
Token ID44264
Feature activation+7.73e-01

Pos logit contributions
ilation+0.067
ongyang+0.063
idding+0.062
orian+0.062
 Conclusion+0.061
Neg logit contributions
asca-0.099
 rug-0.087
 CTR-0.083
-0.074
ワン-0.073
Tokenxes
Token ID48169
Feature activation+1.40e+00

Pos logit contributions
asca+2.873
 rug+2.535
 CTR+2.397
+2.130
ワン+2.076
Neg logit contributions
ilation-2.069
ongyang-1.960
orian-1.934
idding-1.920
 Conclusion-1.900
Token sent
Token ID1908
Feature activation+7.27e-01

Pos logit contributions
asca+4.914
 rug+4.331
 CTR+4.034
+3.563
 �+3.498
Neg logit contributions
ilation-3.960
ongyang-3.854
orian-3.730
idding-3.712
 Conclusion-3.621
Token his
Token ID465
Feature activation-1.03e-01

Pos logit contributions
asca+2.495
 rug+2.180
 CTR+2.053
+1.794
ワン+1.752
Neg logit contributions
ilation-2.152
ongyang-2.057
orian-2.017
idding-2.017
 Conclusion-1.986
Token massive
Token ID4858
Feature activation+5.26e-01

Pos logit contributions
ilation+0.302
idding+0.285
ongyang+0.285
orian+0.284
 Protector+0.283
Neg logit contributions
asca-0.354
 rug-0.305
 CTR-0.290
-0.256
ワン-0.254
Token army
Token ID5428
Feature activation+2.29e-01

Pos logit contributions
asca+1.945
 rug+1.718
 CTR+1.629
+1.441
ワン+1.414
Neg logit contributions
ilation-1.420
ongyang-1.349
orian-1.324
idding-1.324
 Conclusion-1.299
Token across
Token ID1973
Feature activation+2.44e-01

Pos logit contributions
asca+0.759
 rug+0.658
 CTR+0.620
+0.539
ワン+0.529
Neg logit contributions
ilation-0.705
ongyang-0.673
idding-0.665
orian-0.664
 Conclusion-0.655
Token exodus
Token ID39524
Feature activation+1.64e-01

Pos logit contributions
ilation+0.609
ongyang+0.580
idding+0.569
orian+0.569
 Conclusion+0.560
Neg logit contributions
asca-0.816
 rug-0.719
 CTR-0.682
-0.602
ワン-0.591
Token of
Token ID286
Feature activation-9.88e-03

Pos logit contributions
asca+0.587
 rug+0.515
 CTR+0.487
+0.427
ワン+0.419
Neg logit contributions
ilation-0.464
ongyang-0.444
idding-0.435
orian-0.435
 Conclusion-0.428
Token would
Token ID561
Feature activation+4.71e-01

Pos logit contributions
ilation+0.030
ongyang+0.029
orian+0.029
idding+0.029
 Conclusion+0.028
Neg logit contributions
asca-0.033
 rug-0.028
 CTR-0.027
-0.023
ワン-0.023
Token-
Token ID12
Feature activation+3.92e-01

Pos logit contributions
asca+1.378
 rug+1.169
 CTR+1.101
+0.929
ワン+0.913
Neg logit contributions
ilation-1.636
ongyang-1.568
idding-1.555
orian-1.555
 Conclusion-1.530
Tokenbe
Token ID1350
Feature activation+5.05e-01

Pos logit contributions
asca+1.304
 rug+1.121
 CTR+1.058
+0.911
 �+0.889
Neg logit contributions
ilation-1.204
ongyang-1.144
idding-1.131
 Conclusion-1.125
orian-1.123
Token prospect
Token ID6034
Feature activation+1.12e+00

Pos logit contributions
asca+1.615
 rug+1.401
 CTR+1.313
+1.131
ワン+1.105
Neg logit contributions
ilation-1.614
ongyang-1.545
idding-1.523
orian-1.522
 Conclusion-1.500
Tokenors
Token ID669
Feature activation+2.38e-01

Pos logit contributions
asca+5.173
 rug+4.712
 CTR+4.510
+4.128
ワン+4.032
Neg logit contributions
ilation-1.875
ongyang-1.727
idding-1.704
 Conclusion-1.682
 Opposition-1.676
Token dreaming
Token ID28368
Feature activation+2.93e-01

Pos logit contributions
asca+0.799
 rug+0.696
 CTR+0.655
+0.570
ワン+0.559
Neg logit contributions
ilation-0.721
ongyang-0.690
idding-0.680
orian-0.679
 Conclusion-0.669
Token of
Token ID286
Feature activation+1.90e-01

Pos logit contributions
asca+1.092
 rug+0.963
 CTR+0.915
+0.812
ワン+0.799
Neg logit contributions
ilation-0.778
ongyang-0.737
idding-0.726
orian-0.725
 Conclusion-0.713
Token a
Token ID257
Feature activation-3.97e-02

Pos logit contributions
asca+0.647
 rug+0.565
 CTR+0.533
+0.465
ワン+0.455
Neg logit contributions
ilation-0.566
ongyang-0.541
orian-0.532
idding-0.532
 Conclusion-0.523
Token better
Token ID1365
Feature activation-4.42e-01

Pos logit contributions
ilation+0.125
ongyang+0.119
idding+0.118
orian+0.117
 Conclusion+0.116
Neg logit contributions
asca-0.129
 rug-0.111
 CTR-0.105
-0.091
ワン-0.090
Token cheap
Token ID7026
Feature activation-2.11e-01

Pos logit contributions
asca+0.791
 rug+0.677
 CTR+0.638
+0.551
ワン+0.541
Neg logit contributions
ilation-0.808
ongyang-0.773
orian-0.764
idding-0.763
 Conclusion-0.753
Token shot
Token ID2823
Feature activation+5.08e-01

Pos logit contributions
ilation+0.628
ongyang+0.601
idding+0.592
orian+0.591
 Opposition+0.584
Neg logit contributions
asca-0.720
 rug-0.622
 CTR-0.592
-0.520
ワン-0.512
Token,
Token ID11
Feature activation+4.01e-02

Pos logit contributions
asca+1.712
 rug+1.494
 CTR+1.406
+1.225
ワン+1.199
Neg logit contributions
ilation-1.529
ongyang-1.461
idding-1.438
orian-1.436
 Conclusion-1.418
Token Crow
Token ID15502
Feature activation-1.05e-01

Pos logit contributions
asca+0.121
 rug+0.102
 CTR+0.096
+0.083
ワン+0.082
Neg logit contributions
ilation-0.136
orian-0.129
ongyang-0.129
idding-0.128
 Opposition-0.127
Tokender
Token ID1082
Feature activation+5.68e-03

Pos logit contributions
ilation+0.282
ongyang+0.270
idding+0.262
 Protector+0.261
orian+0.261
Neg logit contributions
asca-0.394
 rug-0.343
 CTR-0.327
-0.289
 �-0.282
Token repeated
Token ID5100
Feature activation+3.95e-01

Pos logit contributions
asca+0.022
 rug+0.019
 CTR+0.018
+0.016
ワン+0.016
Neg logit contributions
ilation-0.015
ongyang-0.014
idding-0.014
orian-0.014
 Conclusion-0.013
Token himself
Token ID2241
Feature activation-3.18e-02

Pos logit contributions
asca+1.408
 rug+1.236
 CTR+1.171
+1.030
ワン+1.012
Neg logit contributions
ilation-1.113
ongyang-1.060
idding-1.043
orian-1.041
 Conclusion-1.026
Token.
Token ID13
Feature activation-1.46e-01

Pos logit contributions
ilation+0.086
ongyang+0.082
orian+0.081
idding+0.081
 Opposition+0.079
Neg logit contributions
asca-0.117
 rug-0.103
 CTR-0.098
-0.086
ワン-0.085
Token
Token ID198
Feature activation-4.63e-02

Pos logit contributions
ilation+0.374
orian+0.351
 Opposition+0.351
ongyang+0.350
idding+0.349
Neg logit contributions
asca-0.558
 rug-0.494
 CTR-0.464
-0.417
ワン-0.411
Token
Token ID198
Feature activation+2.14e-03

Pos logit contributions
ilation+0.133
ongyang+0.129
idding+0.125
orian+0.124
 Conclusion+0.122
Neg logit contributions
asca-0.163
 rug-0.143
 CTR-0.136
-0.119
 �-0.117
Token"
Token ID1
Feature activation-3.20e-01

Pos logit contributions
asca+0.008
 rug+0.007
 CTR+0.006
+0.006
ワン+0.006
Neg logit contributions
ilation-0.006
ongyang-0.006
orian-0.006
idding-0.006
 Conclusion-0.005
Token federal
Token ID2717
Feature activation-1.12e-02

Pos logit contributions
ilation+0.263
ongyang+0.252
orian+0.250
idding+0.249
 Conclusion+0.247
Neg logit contributions
asca-0.239
 rug-0.203
 CTR-0.190
-0.163
ワン-0.161
Token law
Token ID1099
Feature activation-5.11e-01

Pos logit contributions
ilation+0.028
ongyang+0.026
idding+0.026
orian+0.026
 Opposition+0.025
Neg logit contributions
asca-0.044
 rug-0.039
 CTR-0.037
-0.033
ワン-0.032
Token enforcement
Token ID5394
Feature activation-8.82e-01

Pos logit contributions
ilation+1.540
orian+1.477
ongyang+1.473
idding+1.464
 Opposition+1.445
Neg logit contributions
asca-1.725
 rug-1.467
 CTR-1.370
ワン-1.225
-1.202
Token?
Token ID30
Feature activation+4.44e-02

Pos logit contributions
ilation+2.454
ongyang+2.339
orian+2.314
idding+2.309
 Opposition+2.297
Neg logit contributions
asca-3.137
 rug-2.723
 CTR-2.572
-2.291
ワン-2.288
Token It
Token ID632
Feature activation-4.68e-02

Pos logit contributions
asca+0.162
 rug+0.143
 CTR+0.135
+0.120
ワン+0.118
Neg logit contributions
ilation-0.121
ongyang-0.114
idding-0.113
orian-0.113
 Conclusion-0.112
Token doesn
Token ID1595
Feature activation+1.01e-01

Pos logit contributions
ilation+0.138
ongyang+0.131
orian+0.130
idding+0.129
 Conclusion+0.128
Neg logit contributions
asca-0.160
 rug-0.140
 CTR-0.132
-0.116
ワン-0.115
Token
Token ID447
Feature activation+2.69e-01

Pos logit contributions
asca+0.372
 rug+0.327
 CTR+0.314
+0.277
 �+0.271
Neg logit contributions
ilation-0.274
ongyang-0.263
idding-0.256
orian-0.255
 Conclusion-0.250
Token
Token ID247
Feature activation+9.94e-02

Pos logit contributions
asca+0.916
 rug+0.802
 CTR+0.757
+0.662
 �+0.651
Neg logit contributions
ilation-0.799
ongyang-0.765
idding-0.754
orian-0.748
 Conclusion-0.740
Tokent
Token ID83
Feature activation+1.64e-02

Pos logit contributions
asca+0.336
 rug+0.292
 CTR+0.276
+0.241
ワン+0.236
Neg logit contributions
ilation-0.299
ongyang-0.285
idding-0.281
orian-0.281
 Conclusion-0.277
Token of
Token ID286
Feature activation+1.59e-01

Pos logit contributions
asca+0.061
 rug+0.054
 CTR+0.051
+0.045
ワン+0.045
Neg logit contributions
ilation-0.044
ongyang-0.042
idding-0.041
orian-0.041
 Conclusion-0.040
Token course
Token ID1781
Feature activation+1.40e-01

Pos logit contributions
asca+0.518
 rug+0.448
 CTR+0.421
+0.365
ワン+0.360
Neg logit contributions
ilation-0.498
ongyang-0.476
idding-0.469
orian-0.469
 Conclusion-0.463
Token that
Token ID326
Feature activation-1.92e-01

Pos logit contributions
ilation+0.265
ongyang+0.252
idding+0.249
orian+0.249
 Conclusion+0.245
Neg logit contributions
asca-0.324
 rug-0.283
 CTR-0.268
-0.236
ワン-0.232
Token moving
Token ID3867
Feature activation-1.64e-02

Pos logit contributions
ilation+0.561
ongyang+0.533
idding+0.526
orian+0.526
 Conclusion+0.518
Neg logit contributions
asca-0.663
 rug-0.578
 CTR-0.547
-0.480
ワン-0.473
Token into
Token ID656
Feature activation-2.70e-01

Pos logit contributions
ilation+0.050
ongyang+0.047
orian+0.047
idding+0.047
 Opposition+0.046
Neg logit contributions
asca-0.055
 rug-0.047
 CTR-0.045
-0.039
ワン-0.039
Token a
Token ID257
Feature activation+9.59e-02

Pos logit contributions
ilation+0.747
ongyang+0.706
orian+0.698
idding+0.697
 Opposition+0.690
Neg logit contributions
asca-0.978
 rug-0.855
 CTR-0.811
-0.720
ワン-0.710
Token shelter
Token ID11772
Feature activation-6.25e-02

Pos logit contributions
asca+0.313
 rug+0.270
 CTR+0.255
+0.221
ワン+0.217
Neg logit contributions
ilation-0.300
ongyang-0.286
idding-0.282
orian-0.282
 Conclusion-0.279
Token is
Token ID318
Feature activation+9.55e-02

Pos logit contributions
ilation+0.189
ongyang+0.179
idding+0.178
orian+0.178
 Conclusion+0.175
Neg logit contributions
asca-0.210
 rug-0.182
 CTR-0.172
-0.151
ワン-0.149
Token a
Token ID257
Feature activation-3.83e-02

Pos logit contributions
asca+0.329
 rug+0.288
 CTR+0.272
+0.238
ワン+0.231
Neg logit contributions
ilation-0.281
ongyang-0.269
orian-0.265
idding-0.263
 Conclusion-0.260
Token better
Token ID1365
Feature activation-3.60e-01

Pos logit contributions
ilation+0.111
ongyang+0.106
idding+0.104
orian+0.104
 Conclusion+0.103
Neg logit contributions
asca-0.133
 rug-0.116
 CTR-0.110
-0.097
ワン-0.095
Token option
Token ID3038
Feature activation+1.53e-01

Pos logit contributions
ilation+0.958
ongyang+0.902
idding+0.894
orian+0.890
 Conclusion+0.877
Neg logit contributions
asca-1.345
 rug-1.177
 CTR-1.122
-0.995
ワン-0.982
Token.
Token ID13
Feature activation-5.11e-03

Pos logit contributions
asca+0.521
 rug+0.453
 CTR+0.429
+0.375
ワン+0.370
Neg logit contributions
ilation-0.456
ongyang-0.434
idding-0.429
orian-0.429
 Conclusion-0.423
Token
Token ID198
Feature activation+1.37e-01

Pos logit contributions
ilation+0.013
ongyang+0.012
idding+0.012
orian+0.012
 Conclusion+0.012
Neg logit contributions
asca-0.019
 rug-0.017
 CTR-0.016
-0.015
ワン-0.014
Token
Token ID198
Feature activation-2.12e-01

Pos logit contributions
ilation+0.497
ongyang+0.480
idding+0.466
orian+0.464
 Conclusion+0.455
Neg logit contributions
asca-0.626
 rug-0.548
 CTR-0.524
-0.460
 �-0.450
Token"
Token ID1
Feature activation-1.61e-01

Pos logit contributions
ilation+0.586
ongyang+0.554
orian+0.550
idding+0.549
 Opposition+0.541
Neg logit contributions
asca-0.769
 rug-0.679
 CTR-0.638
-0.559
ワン-0.556
TokenIndeed
Token ID17854
Feature activation+5.62e-02

Pos logit contributions
ilation+0.438
ongyang+0.416
idding+0.409
orian+0.408
 Conclusion+0.401
Neg logit contributions
asca-0.593
 rug-0.523
 CTR-0.497
-0.440
ワン-0.432
Token,
Token ID11
Feature activation+1.19e-01

Pos logit contributions
asca+0.204
 rug+0.179
 CTR+0.170
+0.150
ワン+0.147
Neg logit contributions
ilation-0.156
ongyang-0.148
idding-0.146
orian-0.145
 Conclusion-0.143
Token it
Token ID340
Feature activation-6.67e-03

Pos logit contributions
asca+0.396
 rug+0.345
 CTR+0.325
+0.281
ワン+0.275
Neg logit contributions
ilation-0.364
ongyang-0.348
idding-0.342
orian-0.341
 Conclusion-0.337
Token would
Token ID561
Feature activation+2.52e-01

Pos logit contributions
ilation+0.019
ongyang+0.018
orian+0.018
idding+0.018
 Opposition+0.018
Neg logit contributions
asca-0.023
 rug-0.020
 CTR-0.019
-0.017
ワン-0.017
Token now
Token ID783
Feature activation-7.39e-02

Pos logit contributions
asca+0.813
 rug+0.704
 CTR+0.661
+0.571
ワン+0.558
Neg logit contributions
ilation-0.798
ongyang-0.763
idding-0.753
orian-0.752
 Conclusion-0.742
Token be
Token ID307
Feature activation-6.69e-02

Pos logit contributions
ilation+0.240
ongyang+0.229
orian+0.226
idding+0.226
 Conclusion+0.223
Neg logit contributions
asca-0.233
 rug-0.200
 CTR-0.188
-0.163
ワン-0.160
Token a
Token ID257
Feature activation-1.84e-01

Pos logit contributions
ilation+0.203
ongyang+0.194
idding+0.191
orian+0.191
 Conclusion+0.188
Neg logit contributions
asca-0.225
 rug-0.195
 CTR-0.184
-0.160
ワン-0.157
Token surprise
Token ID5975
Feature activation+8.79e-02

Pos logit contributions
ilation+0.567
ongyang+0.540
idding+0.534
orian+0.534
 Conclusion+0.528
Neg logit contributions
asca-0.610
 rug-0.527
 CTR-0.497
-0.434
ワン-0.427
Token if
Token ID611
Feature activation+4.82e-01

Pos logit contributions
asca+0.297
 rug+0.259
 CTR+0.244
+0.212
ワン+0.207
Neg logit contributions
ilation-0.265
ongyang-0.253
idding-0.249
orian-0.249
 Conclusion-0.245
Token monthly
Token ID9651
Feature activation+5.69e-01

Pos logit contributions
asca+0.860
 rug+0.743
 CTR+0.701
+0.611
ワン+0.599
Neg logit contributions
ilation-0.809
ongyang-0.772
idding-0.762
orian-0.762
 Conclusion-0.752
Token salary
Token ID9588
Feature activation+2.20e-01

Pos logit contributions
asca+1.662
 rug+1.439
 CTR+1.344
+1.120
 �+1.089
Neg logit contributions
ilation-1.958
ongyang-1.891
idding-1.858
orian-1.852
 Conclusion-1.828
Token expectation
Token ID17507
Feature activation+1.05e-01

Pos logit contributions
asca+0.841
 rug+0.746
 CTR+0.709
+0.630
ワン+0.620
Neg logit contributions
ilation-0.564
ongyang-0.534
idding-0.525
orian-0.524
 Conclusion-0.514
Token of
Token ID286
Feature activation+8.55e-02

Pos logit contributions
asca+0.403
 rug+0.357
 CTR+0.339
+0.302
ワン+0.298
Neg logit contributions
ilation-0.268
ongyang-0.253
idding-0.249
orian-0.249
 Conclusion-0.244
Token 6
Token ID718
Feature activation+1.77e-01

Pos logit contributions
asca+0.258
 rug+0.222
 CTR+0.208
+0.176
ワン+0.171
Neg logit contributions
ilation-0.289
ongyang-0.277
idding-0.273
orian-0.272
 Conclusion-0.268
Token,
Token ID11
Feature activation+3.46e-01

Pos logit contributions
asca+0.644
 rug+0.565
 CTR+0.536
+0.474
ワン+0.466
Neg logit contributions
ilation-0.491
ongyang-0.466
idding-0.459
orian-0.459
 Conclusion-0.452
Token000
Token ID830
Feature activation+7.13e-02

Pos logit contributions
asca+1.287
 rug+1.137
 CTR+1.080
+0.957
ワン+0.939
Neg logit contributions
ilation-0.926
ongyang-0.880
idding-0.864
orian-0.862
 Conclusion-0.849
Token h
Token ID289
Feature activation-2.92e-01

Pos logit contributions
asca+0.236
 rug+0.203
 CTR+0.192
+0.168
ワン+0.167
Neg logit contributions
ilation-0.219
ongyang-0.209
orian-0.207
idding-0.206
 Opposition-0.204
Tokenry
Token ID563
Feature activation-1.24e-02

Pos logit contributions
ilation+0.667
ongyang+0.623
orian+0.620
idding+0.616
 Conclusion+0.596
Neg logit contributions
asca-1.203
 rug-1.072
 CTR-1.025
-0.927
ワン-0.919
Tokenv
Token ID85
Feature activation+3.53e-01

Pos logit contributions
ilation+0.033
ongyang+0.031
orian+0.031
idding+0.031
 Conclusion+0.030
Neg logit contributions
asca-0.046
 rug-0.041
 CTR-0.039
-0.035
ワン-0.034
Tokennia
Token ID18142
Feature activation+4.67e-01

Pos logit contributions
asca+1.332
 rug+1.177
 CTR+1.120
+0.995
ワン+0.980
Neg logit contributions
ilation-0.924
ongyang-0.875
idding-0.862
orian-0.861
 Conclusion-0.845
Token thereby
Token ID12839
Feature activation-2.38e-01

Pos logit contributions
asca+0.232
 rug+0.200
 CTR+0.189
+0.164
ワン+0.161
Neg logit contributions
ilation-0.217
ongyang-0.207
orian-0.205
idding-0.204
 Conclusion-0.202
Token take
Token ID1011
Feature activation-5.84e-01

Pos logit contributions
ilation+0.748
ongyang+0.711
idding+0.705
orian+0.704
 Conclusion+0.696
Neg logit contributions
asca-0.772
 rug-0.663
 CTR-0.629
-0.543
ワン-0.538
Token possession
Token ID7797
Feature activation-3.61e-01

Pos logit contributions
ilation+1.689
orian+1.602
ongyang+1.593
 Opposition+1.573
idding+1.568
Neg logit contributions
asca-2.025
 rug-1.753
 CTR-1.675
-1.469
ワン-1.456
Token of
Token ID286
Feature activation-4.79e-03

Pos logit contributions
ilation+0.874
ongyang+0.816
orian+0.814
idding+0.810
 Conclusion+0.803
Neg logit contributions
asca-1.433
 rug-1.265
 CTR-1.221
-1.079
ワン-1.070
Token it
Token ID340
Feature activation+9.72e-03

Pos logit contributions
ilation+0.014
ongyang+0.014
orian+0.013
idding+0.013
 Conclusion+0.013
Neg logit contributions
asca-0.016
 rug-0.014
 CTR-0.014
-0.012
ワン-0.012
Token.
Token ID13
Feature activation-2.28e-01

Pos logit contributions
asca+0.037
 rug+0.033
 CTR+0.031
+0.028
ワン+0.028
Neg logit contributions
ilation-0.025
ongyang-0.023
idding-0.023
orian-0.023
 Conclusion-0.023
Token
Token ID447
Feature activation+3.65e-02

Pos logit contributions
ilation+0.538
ongyang+0.508
idding+0.496
orian+0.496
 Conclusion+0.483
Neg logit contributions
asca-0.923
 rug-0.821
 CTR-0.786
-0.704
ワン-0.693
Token
Token ID251
Feature activation-2.31e-02

Pos logit contributions
asca+0.124
 rug+0.108
 CTR+0.102
+0.089
ワン+0.088
Neg logit contributions
ilation-0.109
ongyang-0.104
idding-0.103
orian-0.103
 Conclusion-0.102
Token
Token ID198
Feature activation+1.43e-01

Pos logit contributions
ilation+0.055
ongyang+0.052
orian+0.051
idding+0.051
 Conclusion+0.050
Neg logit contributions
asca-0.093
 rug-0.082
 CTR-0.079
-0.070
ワン-0.069
Token
Token ID198
Feature activation+9.70e-02

Pos logit contributions
asca+0.495
 rug+0.431
 CTR+0.415
+0.361
 �+0.358
Neg logit contributions
ilation-0.418
ongyang-0.406
idding-0.392
orian-0.390
 Conclusion-0.381
TokenThe
Token ID464
Feature activation+9.01e-02

Pos logit contributions
asca+0.352
 rug+0.306
 CTR+0.291
+0.260
ワン+0.254
Neg logit contributions
ilation-0.269
ongyang-0.257
idding-0.252
orian-0.250
 Opposition-0.247
Token will
Token ID481
Feature activation-1.10e-01

Pos logit contributions
ilation+2.326
ongyang+2.201
orian+2.192
 Opposition+2.168
idding+2.166
Neg logit contributions
asca-2.858
 rug-2.493
 CTR-2.378
-2.076
ワン-2.066
Token pass
Token ID1208
Feature activation-6.52e-01

Pos logit contributions
ilation+0.338
ongyang+0.322
orian+0.318
idding+0.318
 Conclusion+0.314
Neg logit contributions
asca-0.364
 rug-0.315
 CTR-0.297
-0.258
ワン-0.254
Token a
Token ID257
Feature activation+5.55e-02

Pos logit contributions
ilation+1.869
ongyang+1.760
orian+1.750
idding+1.748
 Opposition+1.741
Neg logit contributions
asca-2.291
 rug-1.987
 CTR-1.863
ワン-1.661
 裏�-1.649
Token law
Token ID1099
Feature activation-5.98e-01

Pos logit contributions
asca+0.166
 rug+0.141
 CTR+0.131
+0.112
ワン+0.110
Neg logit contributions
ilation-0.190
ongyang-0.182
idding-0.179
orian-0.179
 Conclusion-0.177
Token that
Token ID326
Feature activation-4.00e-01

Pos logit contributions
ilation+1.732
ongyang+1.647
idding+1.632
orian+1.631
 Opposition+1.604
Neg logit contributions
asca-2.086
 rug-1.812
 CTR-1.717
-1.508
ワン-1.503
Token will
Token ID481
Feature activation-3.40e-01

Pos logit contributions
ilation+1.247
ongyang+1.187
orian+1.179
idding+1.175
 Opposition+1.163
Neg logit contributions
asca-1.306
 rug-1.126
 CTR-1.060
-0.922
ワン-0.917
Token ann
Token ID1529
Feature activation-9.12e-01

Pos logit contributions
ilation+1.082
ongyang+1.032
orian+1.022
idding+1.022
 Conclusion+1.007
Neg logit contributions
asca-1.090
 rug-0.936
 CTR-0.881
-0.764
ワン-0.751
Tokenul
Token ID377
Feature activation-2.08e-01

Pos logit contributions
ilation+2.163
idding+2.008
ongyang+2.000
orian+1.984
emic+1.947
Neg logit contributions
asca-3.665
 rug-3.223
 CTR-3.112
-2.824
 裏�-2.818
Token all
Token ID477
Feature activation-5.81e-01

Pos logit contributions
ilation+0.595
ongyang+0.563
orian+0.556
idding+0.555
 Opposition+0.551
Neg logit contributions
asca-0.739
 rug-0.644
 CTR-0.608
-0.539
ワン-0.535
Token legal
Token ID2742
Feature activation-1.17e+00

Pos logit contributions
ilation+1.575
ongyang+1.489
orian+1.473
idding+1.471
 Opposition+1.460
Neg logit contributions
asca-2.133
 rug-1.866
 CTR-1.772
-1.576
ワン-1.563
Token claims
Token ID3667
Feature activation-4.45e-01

Pos logit contributions
ilation+3.435
 Opposition+3.279
orian+3.253
idding+3.237
ongyang+3.223
Neg logit contributions
asca-3.988
 rug-3.348
 CTR-3.179
-2.808
ワン-2.796
Token didn
Token ID1422
Feature activation-1.93e-01

Pos logit contributions
ilation+0.030
ongyang+0.029
idding+0.028
orian+0.028
 Conclusion+0.028
Neg logit contributions
asca-0.029
 rug-0.025
 CTR-0.024
-0.020
ワン-0.020
Token't
Token ID470
Feature activation+5.68e-02

Pos logit contributions
ilation+0.495
ongyang+0.473
idding+0.460
orian+0.459
 Conclusion+0.450
Neg logit contributions
asca-0.740
 rug-0.653
 CTR-0.626
-0.558
ワン-0.545
Token go
Token ID467
Feature activation+6.70e-02

Pos logit contributions
asca+0.187
 rug+0.163
 CTR+0.153
+0.132
ワン+0.129
Neg logit contributions
ilation-0.176
ongyang-0.169
orian-0.166
idding-0.166
 Conclusion-0.163
Token back
Token ID736
Feature activation+1.02e-01

Pos logit contributions
asca+0.233
 rug+0.201
 CTR+0.191
+0.168
ワン+0.166
Neg logit contributions
ilation-0.196
ongyang-0.186
idding-0.184
orian-0.184
 Conclusion-0.181
Token to
Token ID284
Feature activation-1.87e-01

Pos logit contributions
asca+0.365
 rug+0.319
 CTR+0.302
+0.267
ワン+0.264
Neg logit contributions
ilation-0.287
ongyang-0.272
orian-0.270
idding-0.269
 Opposition-0.264
Token school
Token ID1524
Feature activation-4.32e-01

Pos logit contributions
ilation+0.548
ongyang+0.519
orian+0.515
idding+0.512
 Opposition+0.507
Neg logit contributions
asca-0.648
 rug-0.559
 CTR-0.529
-0.467
ワン-0.460
Token in
Token ID287
Feature activation-2.26e-02

Pos logit contributions
ilation+1.124
orian+1.049
idding+1.048
ongyang+1.046
 Protector+1.031
Neg logit contributions
asca-1.636
 rug-1.422
 CTR-1.362
-1.223
ワン-1.200
Token 2010
Token ID3050
Feature activation+2.27e-01

Pos logit contributions
ilation+0.065
ongyang+0.062
idding+0.061
orian+0.061
 Conclusion+0.061
Neg logit contributions
asca-0.079
 rug-0.069
 CTR-0.065
-0.058
ワン-0.057
Token.
Token ID13
Feature activation-8.38e-02

Pos logit contributions
asca+0.881
 rug+0.777
 CTR+0.741
+0.660
ワン+0.652
Neg logit contributions
ilation-0.573
ongyang-0.537
orian-0.533
idding-0.530
 Conclusion-0.522
Token
Token ID198
Feature activation+1.35e-01

Pos logit contributions
ilation+0.203
ongyang+0.191
idding+0.189
orian+0.188
 Conclusion+0.186
Neg logit contributions
asca-0.333
 rug-0.296
 CTR-0.281
-0.252
ワン-0.248
Token
Token ID198
Feature activation+1.23e-01

Pos logit contributions
asca+0.468
 rug+0.408
 CTR+0.392
+0.340
 �+0.338
Neg logit contributions
ilation-0.392
ongyang-0.382
idding-0.368
orian-0.366
 Conclusion-0.359
Token his
Token ID465
Feature activation-1.44e-01

Pos logit contributions
asca+0.232
 rug+0.202
 CTR+0.191
+0.167
ワン+0.164
Neg logit contributions
ilation-0.200
ongyang-0.190
idding-0.188
orian-0.188
 Opposition-0.185
Token 100
Token ID1802
Feature activation-1.29e-01

Pos logit contributions
ilation+0.451
ongyang+0.430
idding+0.426
orian+0.425
 Conclusion+0.419
Neg logit contributions
asca-0.470
 rug-0.406
 CTR-0.382
-0.332
ワン-0.325
Token.
Token ID13
Feature activation-2.91e-01

Pos logit contributions
ilation+0.435
ongyang+0.414
idding+0.412
orian+0.412
 Conclusion+0.408
Neg logit contributions
asca-0.392
 rug-0.333
 CTR-0.312
-0.267
ワン-0.260
Token
Token ID198
Feature activation+6.03e-02

Pos logit contributions
ilation+0.710
orian+0.667
 Conclusion+0.665
ongyang+0.663
idding+0.662
Neg logit contributions
asca-1.151
 rug-1.017
 CTR-0.957
-0.859
 裏�-0.846
Token
Token ID198
Feature activation-9.67e-03

Pos logit contributions
asca+0.211
 rug+0.185
 CTR+0.178
+0.157
 �+0.152
Neg logit contributions
ilation-0.173
ongyang-0.167
idding-0.162
orian-0.161
emic-0.158
TokenG
Token ID38
Feature activation-8.77e-02

Pos logit contributions
ilation+0.028
ongyang+0.026
orian+0.026
idding+0.026
 Conclusion+0.025
Neg logit contributions
asca-0.034
 rug-0.030
 CTR-0.028
-0.025
ワン-0.024
Tokenerr
Token ID8056
Feature activation-9.70e-01

Pos logit contributions
ilation+0.191
ongyang+0.178
orian+0.176
idding+0.176
 Conclusion+0.169
Neg logit contributions
asca-0.370
 rug-0.332
 CTR-0.318
-0.288
ワン-0.284
Tokenard
Token ID446
Feature activation-9.39e-01

Pos logit contributions
ilation+1.812
idding+1.688
ongyang+1.660
orian+1.657
 Opposition+1.601
Neg logit contributions
asca-4.367
 rug-3.954
 CTR-3.795
-3.481
 裏�-3.455
Token
Token ID447
Feature activation-2.89e-01

Pos logit contributions
ilation+2.548
orian+2.388
ongyang+2.387
idding+2.365
 Opposition+2.364
Neg logit contributions
asca-3.414
 rug-2.985
 CTR-2.833
-2.538
ワン-2.506
Token
Token ID247
Feature activation-2.40e-02

Pos logit contributions
ilation+0.818
ongyang+0.779
orian+0.768
idding+0.766
 Conclusion+0.756
Neg logit contributions
asca-1.030
 rug-0.902
 CTR-0.853
-0.751
ワン-0.741
Tokens
Token ID82
Feature activation+1.59e-02

Pos logit contributions
ilation+0.069
ongyang+0.066
idding+0.065
orian+0.065
 Conclusion+0.064
Neg logit contributions
asca-0.084
 rug-0.074
 CTR-0.070
-0.061
ワン-0.060
Token traveling
Token ID11300
Feature activation+3.49e-01

Pos logit contributions
asca+0.101
 rug+0.088
 CTR+0.083
+0.073
ワン+0.072
Neg logit contributions
ilation-0.083
ongyang-0.079
orian-0.078
idding-0.078
 Opposition-0.077
Token gra
Token ID7933
Feature activation-2.84e-01

Pos logit contributions
asca+1.177
 rug+1.024
 CTR+0.966
+0.843
ワン+0.827
Neg logit contributions
ilation-1.051
ongyang-1.003
idding-0.989
orian-0.988
 Conclusion-0.974
Tokenff
Token ID487
Feature activation-1.76e-01

Pos logit contributions
ilation+0.650
ongyang+0.599
orian+0.595
idding+0.594
 Conclusion+0.579
Neg logit contributions
asca-1.165
 rug-1.051
 CTR-0.998
-0.899
ワン-0.891
Tokenito
Token ID10094
Feature activation-1.49e-01

Pos logit contributions
ilation+0.437
ongyang+0.413
orian+0.409
idding+0.408
 Opposition+0.398
Neg logit contributions
asca-0.689
 rug-0.611
 CTR-0.583
-0.521
ワン-0.515
Token.
Token ID13
Feature activation+1.23e-01

Pos logit contributions
ilation+0.428
ongyang+0.403
orian+0.402
idding+0.402
 Opposition+0.396
Neg logit contributions
asca-0.524
 rug-0.455
 CTR-0.434
-0.382
ワン-0.378
Token
Token ID198
Feature activation+4.12e-02

Pos logit contributions
asca+0.471
 rug+0.417
 CTR+0.396
+0.353
ワン+0.348
Neg logit contributions
ilation-0.315
ongyang-0.297
idding-0.293
orian-0.293
 Conclusion-0.288
Token
Token ID198
Feature activation-2.97e-03

Pos logit contributions
asca+0.144
 rug+0.125
 CTR+0.121
+0.105
 �+0.104
Neg logit contributions
ilation-0.119
ongyang-0.116
idding-0.112
orian-0.111
 Conclusion-0.109
TokenFor
Token ID1890
Feature activation-2.41e-01

Pos logit contributions
ilation+0.008
ongyang+0.008
orian+0.008
idding+0.008
 Conclusion+0.008
Neg logit contributions
asca-0.011
 rug-0.009
 CTR-0.009
-0.008
ワン-0.008
Token photographic
Token ID35788
Feature activation-2.44e-01

Pos logit contributions
ilation+0.730
ongyang+0.690
idding+0.688
orian+0.683
 Opposition+0.679
Neg logit contributions
asca-0.805
 rug-0.700
 CTR-0.662
-0.579
ワン-0.571
Token evidence
Token ID2370
Feature activation+4.47e-01

Pos logit contributions
ilation+0.803
ongyang+0.760
idding+0.757
 Opposition+0.755
orian+0.754
Neg logit contributions
asca-0.760
 rug-0.644
 CTR-0.610
-0.526
ワン-0.521
Token of
Token ID286
Feature activation+3.33e-01

Pos logit contributions
asca+1.677
 rug+1.483
 CTR+1.409
+1.251
ワン+1.228
Neg logit contributions
ilation-1.175
ongyang-1.116
idding-1.096
orian-1.096
 Conclusion-1.078
Token the
Token ID262
Feature activation-4.23e-01

Pos logit contributions
ilation+0.421
ongyang+0.399
idding+0.395
orian+0.393
 Opposition+0.392
Neg logit contributions
asca-0.536
 rug-0.468
 CTR-0.443
-0.393
ワン-0.389
Token petition
Token ID8853
Feature activation-5.96e-01

Pos logit contributions
ilation+1.167
ongyang+1.105
idding+1.090
 Opposition+1.089
orian+1.088
Neg logit contributions
asca-1.539
 rug-1.344
 CTR-1.274
-1.141
ワン-1.127
Tokeners
Token ID364
Feature activation-9.74e-02

Pos logit contributions
ilation+1.113
ongyang+1.027
idding+1.006
orian+1.003
 Opposition+0.980
Neg logit contributions
asca-2.696
 rug-2.429
 CTR-2.330
-2.129
ワン-2.112
Token on
Token ID319
Feature activation-5.08e-01

Pos logit contributions
ilation+0.275
ongyang+0.262
orian+0.258
idding+0.258
 Conclusion+0.253
Neg logit contributions
asca-0.348
 rug-0.305
 CTR-0.289
-0.254
ワン-0.249
Token whose
Token ID3025
Feature activation-2.18e-01

Pos logit contributions
ilation+1.486
ongyang+1.406
idding+1.396
 Opposition+1.395
orian+1.384
Neg logit contributions
asca-1.759
 rug-1.520
 CTR-1.433
-1.276
ワン-1.266
Token plea
Token ID16720
Feature activation-1.44e+00

Pos logit contributions
ilation+0.640
ongyang+0.608
idding+0.601
orian+0.599
 Opposition+0.594
Neg logit contributions
asca-0.753
 rug-0.655
 CTR-0.620
-0.544
ワン-0.536
Token the
Token ID262
Feature activation-4.32e-01

Pos logit contributions
ilation+3.808
 Opposition+3.656
idding+3.592
ongyang+3.544
 Conclusion+3.522
Neg logit contributions
asca-5.272
 rug-4.547
 CTR-4.350
ワン-3.966
-3.931
Token high
Token ID1029
Feature activation-4.64e-01

Pos logit contributions
ilation+1.197
ongyang+1.131
 Opposition+1.124
idding+1.119
orian+1.115
Neg logit contributions
asca-1.553
 rug-1.352
 CTR-1.284
-1.151
ワン-1.136
Token court
Token ID2184
Feature activation-1.08e+00

Pos logit contributions
ilation+1.205
ongyang+1.125
idding+1.117
orian+1.114
 Opposition+1.101
Neg logit contributions
asca-1.756
 rug-1.517
 CTR-1.433
ワン-1.321
-1.316
Token had
Token ID550
Feature activation+3.41e-03

Pos logit contributions
ilation+3.064
 Opposition+2.889
ongyang+2.883
 Protector+2.862
idding+2.861
Neg logit contributions
asca-3.768
 rug-3.236
 CTR-3.095
-2.777
ワン-2.768
Token passed
Token ID3804
Feature activation-7.58e-01

Pos logit contributions
asca+0.011
 rug+0.010
 CTR+0.009
+0.008
ワン+0.008
Neg logit contributions
ilation-0.010
ongyang-0.010
idding-0.010
orian-0.010
 Conclusion-0.010
Token the
Token ID262
Feature activation-1.52e-01

Pos logit contributions
ilation+0.700
ongyang+0.663
orian+0.661
idding+0.656
 Opposition+0.654
Neg logit contributions
asca-0.793
 rug-0.684
 CTR-0.645
-0.568
ワン-0.562
Token power
Token ID1176
Feature activation-7.13e-01

Pos logit contributions
ilation+0.439
ongyang+0.417
orian+0.413
idding+0.412
 Opposition+0.407
Neg logit contributions
asca-0.533
 rug-0.464
 CTR-0.439
-0.387
ワン-0.382
Token of
Token ID286
Feature activation-2.52e-01

Pos logit contributions
ilation+1.936
ongyang+1.829
orian+1.819
 Opposition+1.808
idding+1.795
Neg logit contributions
asca-2.623
 rug-2.263
 CTR-2.160
ワン-1.914
-1.906
Token the
Token ID262
Feature activation-1.94e-01

Pos logit contributions
ilation+0.723
ongyang+0.684
orian+0.679
 Opposition+0.678
idding+0.676
Neg logit contributions
asca-0.892
 rug-0.775
 CTR-0.734
-0.649
ワン-0.642
Token New
Token ID968
Feature activation-8.58e-03

Pos logit contributions
ilation+0.524
ongyang+0.497
orian+0.491
idding+0.490
 Opposition+0.485
Neg logit contributions
asca-0.712
 rug-0.625
 CTR-0.593
-0.528
ワン-0.520
Token York
Token ID1971
Feature activation-1.07e+00

Pos logit contributions
ilation+0.019
ongyang+0.017
orian+0.017
 Opposition+0.017
idding+0.017
Neg logit contributions
asca-0.036
 rug-0.033
 CTR-0.031
-0.028
ワン-0.028
Token Old
Token ID5706
Feature activation-2.56e-01

Pos logit contributions
ilation+2.803
orian+2.687
 Opposition+2.670
 Protector+2.644
idding+2.633
Neg logit contributions
asca-3.966
 rug-3.447
 CTR-3.263
-2.976
ワン-2.948
Token Guard
Token ID4932
Feature activation-5.58e-01

Pos logit contributions
ilation+0.247
orian+0.209
 Opposition+0.206
ongyang+0.201
 Protector+0.200
Neg logit contributions
asca-1.391
 rug-1.272
 CTR-1.228
-1.147
ワン-1.141
Token."
Token ID526
Feature activation-4.90e-01

Pos logit contributions
ilation+1.575
ongyang+1.495
orian+1.488
 Opposition+1.477
idding+1.474
Neg logit contributions
asca-1.978
 rug-1.731
 CTR-1.644
-1.453
ワン-1.448
Token He
Token ID679
Feature activation-9.75e-02

Pos logit contributions
ilation+1.234
 Conclusion+1.162
 Opposition+1.160
orian+1.158
idding+1.151
Neg logit contributions
asca-1.885
 rug-1.672
 CTR-1.573
-1.415
ワン-1.400
Token also
Token ID635
Feature activation-1.23e-02

Pos logit contributions
ilation+0.327
ongyang+0.313
orian+0.310
idding+0.309
 Conclusion+0.306
Neg logit contributions
asca-0.296
 rug-0.253
 CTR-0.238
-0.204
ワン-0.200
Tokenous
Token ID516
Feature activation+2.30e-01

Pos logit contributions
ilation+1.103
ongyang+0.999
orian+0.994
idding+0.994
emic+0.971
Neg logit contributions
asca-1.967
 rug-1.772
 CTR-1.697
ワン-1.535
-1.523
Token"
Token ID1
Feature activation+8.68e-01

Pos logit contributions
asca+0.846
 rug+0.744
 CTR+0.707
+0.625
ワン+0.616
Neg logit contributions
ilation-0.623
ongyang-0.590
idding-0.581
orian-0.581
 Conclusion-0.571
Token and
Token ID290
Feature activation+3.10e-01

Pos logit contributions
asca+2.743
 rug+2.390
 CTR+2.223
+1.909
 �+1.868
Neg logit contributions
ilation-2.760
ongyang-2.675
idding-2.628
orian-2.615
 Conclusion-2.575
Token would
Token ID561
Feature activation+4.03e-01

Pos logit contributions
asca+1.014
 rug+0.881
 CTR+0.824
+0.712
ワン+0.695
Neg logit contributions
ilation-0.968
ongyang-0.929
idding-0.912
orian-0.908
 Conclusion-0.896
Token force
Token ID2700
Feature activation-4.41e-01

Pos logit contributions
asca+1.250
 rug+1.100
 CTR+1.011
+0.862
 �+0.851
Neg logit contributions
ilation-1.311
ongyang-1.271
idding-1.236
orian-1.234
 Opposition-1.220
Token one
Token ID530
Feature activation-9.48e-01

Pos logit contributions
ilation+1.197
ongyang+1.132
orian+1.117
idding+1.114
 Opposition+1.104
Neg logit contributions
asca-1.616
 rug-1.412
 CTR-1.351
-1.198
ワン-1.182
Token side
Token ID1735
Feature activation+4.82e-01

Pos logit contributions
ilation+2.627
ongyang+2.472
orian+2.469
 Opposition+2.443
idding+2.438
Neg logit contributions
asca-3.405
 rug-2.901
 CTR-2.843
-2.525
 裏�-2.489
Token of
Token ID286
Feature activation+5.00e-01

Pos logit contributions
asca+1.823
 rug+1.614
 CTR+1.530
+1.360
ワン+1.340
Neg logit contributions
ilation-1.256
ongyang-1.190
idding-1.169
orian-1.169
 Conclusion-1.151
Token our
Token ID674
Feature activation+2.19e-02

Pos logit contributions
asca+1.761
 rug+1.545
 CTR+1.460
+1.281
ワン+1.258
Neg logit contributions
ilation-1.432
ongyang-1.363
idding-1.345
orian-1.343
 Conclusion-1.322
Token planet
Token ID5440
Feature activation+5.25e-01

Pos logit contributions
asca+0.086
 rug+0.076
 CTR+0.072
+0.065
ワン+0.064
Neg logit contributions
ilation-0.054
ongyang-0.051
idding-0.051
orian-0.050
 Conclusion-0.049
Token to
Token ID284
Feature activation-2.99e-01

Pos logit contributions
asca+1.817
 rug+1.589
 CTR+1.491
+1.306
ワン+1.281
Neg logit contributions
ilation-1.540
ongyang-1.466
idding-1.443
orian-1.441
 Conclusion-1.428
Token P
Token ID350
Feature activation+5.85e-02

Pos logit contributions
ilation+0.850
ongyang+0.799
orian+0.795
idding+0.791
 Conclusion+0.784
Neg logit contributions
asca-1.202
 rug-1.061
 CTR-1.007
-0.894
ワン-0.886
Tokenogue
Token ID5119
Feature activation+1.46e-01

Pos logit contributions
asca+0.227
 rug+0.200
 CTR+0.190
+0.169
ワン+0.166
Neg logit contributions
ilation-0.147
ongyang-0.139
idding-0.136
orian-0.136
 Conclusion-0.135
Token at
Token ID379
Feature activation-4.86e-02

Pos logit contributions
asca+0.533
 rug+0.469
 CTR+0.446
+0.395
ワン+0.389
Neg logit contributions
ilation-0.396
ongyang-0.377
orian-0.372
idding-0.371
 Conclusion-0.365
Token the
Token ID262
Feature activation+2.55e-02

Pos logit contributions
ilation+0.106
ongyang+0.099
orian+0.098
idding+0.097
 Conclusion+0.096
Neg logit contributions
asca-0.204
 rug-0.183
 CTR-0.175
-0.158
ワン-0.156
Token New
Token ID968
Feature activation-3.77e-01

Pos logit contributions
asca+0.104
 rug+0.093
 CTR+0.089
+0.080
ワン+0.078
Neg logit contributions
ilation-0.059
ongyang-0.056
idding-0.055
orian-0.054
 Conclusion-0.053
Token York
Token ID1971
Feature activation-1.32e+00

Pos logit contributions
ilation+1.044
ongyang+0.987
orian+0.985
idding+0.971
 Opposition+0.964
Neg logit contributions
asca-1.366
 rug-1.201
 CTR-1.135
-1.004
ワン-1.003
Token Times
Token ID3782
Feature activation+2.45e-01

Pos logit contributions
ilation+1.867
orian+1.707
idding+1.663
 Opposition+1.660
ongyang+1.659
Neg logit contributions
asca-6.446
 rug-5.825
 CTR-5.598
ワン-5.316
-5.269
Token as
Token ID355
Feature activation-2.04e-01

Pos logit contributions
asca+0.928
 rug+0.820
 CTR+0.780
+0.694
ワン+0.686
Neg logit contributions
ilation-0.639
ongyang-0.605
orian-0.600
idding-0.594
 Opposition-0.588
Token well
Token ID880
Feature activation+1.85e-02

Pos logit contributions
ilation+0.585
ongyang+0.555
idding+0.550
orian+0.550
 Opposition+0.547
Neg logit contributions
asca-0.716
 rug-0.621
 CTR-0.588
-0.517
ワン-0.514
Token as
Token ID355
Feature activation-1.56e-01

Pos logit contributions
asca+0.062
 rug+0.053
 CTR+0.050
+0.044
ワン+0.043
Neg logit contributions
ilation-0.057
ongyang-0.054
idding-0.053
orian-0.053
 Conclusion-0.053
Token by
Token ID416
Feature activation-2.30e-01

Pos logit contributions
ilation+0.437
ongyang+0.414
idding+0.410
orian+0.410
 Opposition+0.405
Neg logit contributions
asca-0.556
 rug-0.487
 CTR-0.460
-0.407
ワン-0.404
Tokenre
Token ID260
Feature activation+1.73e-01

Pos logit contributions
ilation+0.097
ongyang+0.092
idding+0.090
orian+0.090
 Conclusion+0.089
Neg logit contributions
asca-0.127
 rug-0.111
 CTR-0.105
-0.093
ワン-0.092
Token accused
Token ID5371
Feature activation+2.55e-01

Pos logit contributions
asca+0.597
 rug+0.520
 CTR+0.492
+0.431
ワン+0.424
Neg logit contributions
ilation-0.509
ongyang-0.485
idding-0.478
orian-0.478
 Opposition-0.471
Token of
Token ID286
Feature activation-1.01e-01

Pos logit contributions
asca+0.996
 rug+0.880
 CTR+0.838
+0.747
ワン+0.738
Neg logit contributions
ilation-0.640
ongyang-0.603
orian-0.594
idding-0.594
 Conclusion-0.584
Token having
Token ID1719
Feature activation-1.79e-01

Pos logit contributions
ilation+0.278
ongyang+0.263
orian+0.260
idding+0.260
 Opposition+0.257
Neg logit contributions
asca-0.371
 rug-0.325
 CTR-0.308
-0.273
ワン-0.269
Token a
Token ID257
Feature activation+1.40e-01

Pos logit contributions
ilation+0.521
ongyang+0.495
orian+0.493
 Opposition+0.489
idding+0.488
Neg logit contributions
asca-0.615
 rug-0.533
 CTR-0.506
-0.445
ワン-0.438
Token tax
Token ID1687
Feature activation-9.46e-01

Pos logit contributions
asca+0.477
 rug+0.413
 CTR+0.390
+0.342
ワン+0.337
Neg logit contributions
ilation-0.417
ongyang-0.396
orian-0.393
idding-0.390
 Opposition-0.388
Token-
Token ID12
Feature activation+2.48e-03

Pos logit contributions
ilation+2.855
orian+2.684
 Protector+2.657
ongyang+2.644
 Opposition+2.642
Neg logit contributions
asca-3.145
 rug-2.654
 CTR-2.564
ワン-2.244
 裏�-2.202
Tokenand
Token ID392
Feature activation-4.53e-03

Pos logit contributions
asca+0.009
 rug+0.008
 CTR+0.008
+0.007
ワン+0.007
Neg logit contributions
ilation-0.007
ongyang-0.006
orian-0.006
idding-0.006
 Conclusion-0.006
Token-
Token ID12
Feature activation+2.53e-01

Pos logit contributions
ilation+0.016
ongyang+0.016
idding+0.016
orian+0.016
 Conclusion+0.015
Neg logit contributions
asca-0.013
 rug-0.011
 CTR-0.010
-0.008
ワン-0.008
Tokensp
Token ID2777
Feature activation+2.87e-01

Pos logit contributions
asca+0.735
 rug+0.626
 CTR+0.580
+0.492
ワン+0.483
Neg logit contributions
ilation-0.880
ongyang-0.845
idding-0.837
orian-0.836
 Opposition-0.824
Tokenend
Token ID437
Feature activation+3.03e-01

Pos logit contributions
asca+1.127
 rug+0.985
 CTR+0.948
+0.830
 �+0.814
Neg logit contributions
ilation-0.699
ongyang-0.672
 Opposition-0.652
idding-0.648
 Conclusion-0.646