TOP ACTIVATIONS
MAX = 2.346

' a ), bulls ( bu qa , for men ),
long itude near the lunar equ ator . Typically the most
ce re mon iously ' du ck down ' to enter
1980 s partly due to spl inter ing within the group
N ok ai ) " dog ", obviously in an attempt
ike p ū i ke au n ui a au
, which is at the equ ator , cent rip et
rid of the proverbial al bat ross around its neck .
. As Thomas C . Tig he , a U .
to represent Patrick O ' Brian . He borrowed a paperback
P ied Butcher Bird .
article published in N . Eng l . J . Med
question of the age of pr imate fossils from this region
e o ak ula ka lan ak ila i ā Kal
Maybe their calls will go un he eded certainly the
of good and bad , equ ating the self - support
is the Jets un quest ioned best sniper ,
transit corridors and major commercial bou lev ards . This proposal
) would be an al bat ross on Miami
have changed the understanding of pr imate evolution in this region

MIN ACTIVATIONS
MIN = -2.615

My colleague Amer A ther was responsible for testing it
48 points , McF adden only accounted for one
. 8 7 Erit rea 129 .
Europe Russian state energy company Gaz prom has resumed some gas
is what matters most in geop olitics it is plain
during the murder , suddenly trans forms itself into the a
Three months ago students at Ober lin College in Ohio objected
it out to focus on undergrad uates . O urs is
transgender identity ; when a transsexual person 's internal gender identity
- old University of Virginia undergrad who served 18 months behind
Russian TV stations showed a Gaz prom official ordering a res
the S PA , Koreans cast their ballots again . The
Finnish parliament in Helsinki , Ves a - Matt i Sa
Monaco in 2008 , president Jerome de B ont in wanted
. Host esses in white gown s and golden s ashes
of road safety , a draft Road Transport & Safety Bill
that point , and S ibur 's annual report for 2012
too , the news was grim from Republicans . In this
that was used as a pretext for the country
overthrow Gaddafi and essentially shove Gaz prom out . I mean

INTERVAL 1.105 - 2.346
CONTAINS 0.677%

the divides are within the Democratic Party . The self -
along the path to fully equal rights . The federal government
to a 32 bit micro controller , and especially writing a
[ 22 ] Literally , R ashi says ,
By law , the national debt cannot exceed $ 16 .

INTERVAL -0.135 - 1.105
CONTAINS 63.448%

all the publishers of London ." There has always been something
It continues to am aze me how little
new features ( including off - TV play ) that have
t work , a la I RL .
: round - trip propagation time ( RT prop ), and

INTERVAL -1.375 - -0.135
CONTAINS 35.649%

fold mechanism had broken under the air loads allowing the wing
h ose hanging from the top of the kitchen window .
the P YD 's role in the north at the start
W arp Factor Calculator is a Java Script calculator
my favorite cooking tools , shop my kitchen essentials ! <|endoftext|>

INTERVAL -2.615 - -1.375
CONTAINS 0.226%

during the murder , suddenly trans forms itself into the a
Europe Russian state energy company Gaz prom has resumed some gas
If I was going to do another one again , I
the past eb oard or cloth . The metal discs also
considered one of the main inspir ations for the story .
Token'
Token ID6
Feature activation-1.56e-01

Pos logit contributions
attery+0.864
ゴン+0.817
ukong+0.786
 Interest+0.781
orf+0.770
Neg logit contributions
uates-1.308
龍�-1.287
-1.271
cos-1.239
ero-1.216
Tokena
Token ID64
Feature activation+6.00e-01

Pos logit contributions
uates+0.382
龍�+0.374
+0.370
cos+0.361
ero+0.352
Neg logit contributions
attery-0.420
ゴン-0.403
ukong-0.390
 Interest-0.389
orf-0.383
Token),
Token ID828
Feature activation+4.60e-01

Pos logit contributions
attery+1.171
ゴン+1.102
ukong+1.058
 Interest+1.054
orf+1.038
Neg logit contributions
uates-1.907
龍�-1.879
-1.857
cos-1.818
ero-1.776
Token bulls
Token ID40317
Feature activation+1.04e+00

Pos logit contributions
attery+1.144
ゴン+1.079
 Interest+1.060
ukong+1.049
orf+1.029
Neg logit contributions
uates-1.218
龍�-1.208
-1.190
cos-1.149
ero-1.127
Token (
Token ID357
Feature activation+1.39e-01

Pos logit contributions
attery+2.764
ゴン+2.585
 Interest+2.535
ukong+2.507
abilities+2.477
Neg logit contributions
龍�-2.639
uates-2.597
-2.531
cos-2.467
ero-2.352
Tokenbu
Token ID11110
Feature activation+2.35e+00

Pos logit contributions
attery+0.403
ゴン+0.387
ukong+0.377
 Interest+0.372
orf+0.371
Neg logit contributions
uates-0.314
龍�-0.308
-0.303
cos-0.292
ero-0.286
Tokenqa
Token ID20402
Feature activation+7.10e-01

Pos logit contributions
attery+6.191
ukong+5.903
orf+5.744
anty+5.725
ゴン+5.723
Neg logit contributions
龍�-5.400
uates-5.287
-5.198
cos-4.838
 reluct-4.712
Token,
Token ID11
Feature activation-1.16e-01

Pos logit contributions
attery+1.410
ゴン+1.316
ukong+1.283
 Interest+1.266
orf+1.248
Neg logit contributions
uates-2.229
龍�-2.206
-2.170
cos-2.111
ero-2.070
Token for
Token ID329
Feature activation+4.93e-01

Pos logit contributions
uates+0.293
龍�+0.287
+0.284
cos+0.277
ero+0.271
Neg logit contributions
attery-0.305
ゴン-0.293
ukong-0.282
 Interest-0.282
ATURE-0.277
Token men
Token ID1450
Feature activation+1.54e-01

Pos logit contributions
attery+1.266
ゴン+1.190
ukong+1.165
 Interest+1.159
ATURE+1.141
Neg logit contributions
uates-1.266
龍�-1.258
-1.237
cos-1.197
ero-1.168
Token),
Token ID828
Feature activation+4.37e-01

Pos logit contributions
attery+0.322
ゴン+0.301
 Interest+0.289
ukong+0.289
orf+0.284
Neg logit contributions
uates-0.471
龍�-0.466
-0.461
cos-0.451
ero-0.442
Token long
Token ID890
Feature activation-2.50e-01

Pos logit contributions
uates+0.401
龍�+0.396
+0.391
cos+0.382
ero+0.372
Neg logit contributions
attery-0.391
ゴン-0.372
 Interest-0.364
ukong-0.360
orf-0.354
Tokenitude
Token ID3984
Feature activation+8.99e-01

Pos logit contributions
uates+0.676
龍�+0.665
+0.659
cos+0.645
ero+0.630
Neg logit contributions
attery-0.610
ゴン-0.583
 Interest-0.561
ukong-0.560
ATURE-0.550
Token near
Token ID1474
Feature activation+7.32e-01

Pos logit contributions
attery+2.110
 Interest+1.994
ゴン+1.993
abilities+1.935
ukong+1.934
Neg logit contributions
龍�-2.465
uates-2.432
-2.411
cos-2.321
ero-2.259
Token the
Token ID262
Feature activation-2.31e-01

Pos logit contributions
attery+1.737
 Interest+1.626
ゴン+1.610
ukong+1.598
multiple+1.560
Neg logit contributions
龍�-2.008
uates-1.986
-1.952
cos-1.889
ero-1.849
Token lunar
Token ID25572
Feature activation+5.63e-01

Pos logit contributions
uates+0.601
龍�+0.592
+0.585
cos+0.570
ero+0.557
Neg logit contributions
attery-0.586
ゴン-0.559
 Interest-0.543
ukong-0.541
orf-0.530
Token equ
Token ID1602
Feature activation+2.24e+00

Pos logit contributions
attery+1.469
 Interest+1.374
ゴン+1.363
ukong+1.352
orf+1.336
Neg logit contributions
龍�-1.425
uates-1.419
-1.396
cos-1.319
ero-1.310
Tokenator
Token ID1352
Feature activation+6.74e-01

Pos logit contributions
attery+3.236
iverse+2.881
orf+2.835
abilities+2.754
ukong+2.722
Neg logit contributions
龍�-8.041
-8.033
uates-7.909
 reluct-7.634
cos-7.491
Token.
Token ID13
Feature activation+1.96e-01

Pos logit contributions
attery+1.615
ゴン+1.513
 Interest+1.507
ukong+1.472
orf+1.470
Neg logit contributions
龍�-1.835
uates-1.809
-1.793
cos-1.730
ero-1.692
Token Typically
Token ID27095
Feature activation+5.90e-01

Pos logit contributions
attery+0.474
 Interest+0.447
ゴン+0.446
ukong+0.433
ATURE+0.431
Neg logit contributions
uates-0.535
龍�-0.528
-0.519
cos-0.505
ero-0.500
Token the
Token ID262
Feature activation+1.59e-01

Pos logit contributions
attery+1.406
ゴン+1.310
 Interest+1.310
ukong+1.290
ATURE+1.267
Neg logit contributions
龍�-1.611
uates-1.596
-1.586
cos-1.528
ero-1.494
Token most
Token ID749
Feature activation-1.01e-01

Pos logit contributions
attery+0.394
ゴン+0.372
 Interest+0.367
ukong+0.363
orf+0.357
Neg logit contributions
uates-0.424
龍�-0.420
-0.415
cos-0.400
ero-0.392
Tokence
Token ID344
Feature activation-4.66e-01

Pos logit contributions
attery+3.797
iverse+3.570
multiple+3.566
 Interest+3.508
reath+3.502
Neg logit contributions
龍�-4.219
-4.068
uates-3.993
 reluct-3.807
ero-3.727
Tokenre
Token ID260
Feature activation-4.75e-02

Pos logit contributions
uates+1.006
龍�+0.977
+0.970
cos+0.950
ero+0.923
Neg logit contributions
attery-1.393
ゴン-1.351
ukong-1.306
 Interest-1.304
ATURE-1.284
Tokenmon
Token ID2144
Feature activation-2.07e-02

Pos logit contributions
uates+0.113
龍�+0.110
+0.110
cos+0.109
ero+0.106
Neg logit contributions
attery-0.130
ゴン-0.128
ukong-0.122
 Interest-0.121
orf-0.119
Tokeniously
Token ID6819
Feature activation+4.45e-01

Pos logit contributions
uates+0.062
龍�+0.061
+0.060
cos+0.059
ero+0.058
Neg logit contributions
attery-0.045
ゴン-0.042
 Interest-0.041
ukong-0.041
ATURE-0.040
Token '
Token ID705
Feature activation+4.25e-02

Pos logit contributions
attery+1.141
ゴン+1.070
 Interest+1.061
ukong+1.054
orf+1.042
Neg logit contributions
uates-1.135
龍�-1.133
-1.117
cos-1.068
ero-1.038
Tokendu
Token ID646
Feature activation+2.14e+00

Pos logit contributions
attery+0.118
ゴン+0.113
ukong+0.110
multiple+0.109
 Interest+0.109
Neg logit contributions
uates-0.101
龍�-0.099
-0.097
cos-0.093
ero-0.092
Tokenck
Token ID694
Feature activation+4.81e-01

Pos logit contributions
attery+4.988
 Interest+4.588
arb+4.583
iverse+4.578
orf+4.537
Neg logit contributions
龍�-5.813
-5.596
uates-5.584
cos-5.190
ero-5.134
Token down
Token ID866
Feature activation+1.73e-01

Pos logit contributions
attery+1.137
ゴン+1.079
 Interest+1.053
ukong+1.046
multiple+1.033
Neg logit contributions
龍�-1.319
uates-1.318
-1.296
cos-1.253
ero-1.225
Token'
Token ID6
Feature activation-2.58e-02

Pos logit contributions
attery+0.384
ゴン+0.362
 Interest+0.352
ukong+0.349
orf+0.343
Neg logit contributions
uates-0.504
龍�-0.501
-0.494
cos-0.483
ero-0.471
Token to
Token ID284
Feature activation+3.80e-01

Pos logit contributions
uates+0.072
龍�+0.071
+0.070
cos+0.068
ero+0.067
Neg logit contributions
attery-0.061
ゴン-0.058
 Interest-0.056
ukong-0.056
orf-0.055
Token enter
Token ID3802
Feature activation+3.67e-01

Pos logit contributions
attery+0.982
ゴン+0.928
 Interest+0.916
ukong+0.906
orf+0.892
Neg logit contributions
uates-0.965
龍�-0.958
-0.945
cos-0.919
ero-0.891
Token 1980
Token ID7169
Feature activation+1.85e-01

Pos logit contributions
uates+0.581
龍�+0.568
+0.561
cos+0.544
ero+0.526
Neg logit contributions
attery-0.950
ゴン-0.917
 Interest-0.891
ukong-0.891
ATURE-0.878
Tokens
Token ID82
Feature activation-4.84e-01

Pos logit contributions
attery+0.796
ゴン+0.768
 Interest+0.763
ukong+0.759
orf+0.754
Neg logit contributions
uates-0.154
龍�-0.151
-0.141
cos-0.125
ero-0.119
Token partly
Token ID11476
Feature activation+4.67e-02

Pos logit contributions
uates+1.259
龍�+1.234
+1.224
cos+1.199
ero+1.170
Neg logit contributions
attery-1.234
ゴン-1.183
ukong-1.139
 Interest-1.135
ATURE-1.115
Token due
Token ID2233
Feature activation+6.57e-01

Pos logit contributions
attery+0.098
ゴン+0.092
 Interest+0.090
ukong+0.089
orf+0.087
Neg logit contributions
龍�-0.141
uates-0.140
-0.139
cos-0.135
ero-0.132
Token to
Token ID284
Feature activation-9.79e-02

Pos logit contributions
attery+1.435
ゴン+1.344
 Interest+1.314
ukong+1.302
ATURE+1.278
Neg logit contributions
龍�-1.932
uates-1.928
-1.904
cos-1.854
ero-1.809
Token spl
Token ID4328
Feature activation+2.08e+00

Pos logit contributions
uates+0.257
龍�+0.254
+0.251
cos+0.244
ero+0.238
Neg logit contributions
attery-0.247
ゴン-0.235
 Interest-0.228
ukong-0.227
ATURE-0.223
Tokeninter
Token ID3849
Feature activation-2.29e-01

Pos logit contributions
attery+3.842
 Interest+3.102
orf+3.069
itars+3.060
multiple+3.049
Neg logit contributions
龍�-7.078
-6.964
uates-6.871
cos-6.552
 reluct-6.540
Tokening
Token ID278
Feature activation+2.32e-01

Pos logit contributions
uates+0.731
龍�+0.721
+0.715
cos+0.702
ero+0.689
Neg logit contributions
attery-0.449
ゴン-0.423
 Interest-0.404
ukong-0.403
ATURE-0.393
Token within
Token ID1626
Feature activation+6.70e-01

Pos logit contributions
attery+0.567
 Interest+0.531
ゴン+0.530
ukong+0.519
orf+0.510
Neg logit contributions
uates-0.621
龍�-0.619
-0.611
cos-0.593
ero-0.578
Token the
Token ID262
Feature activation+2.98e-01

Pos logit contributions
attery+1.555
 Interest+1.441
ゴン+1.424
ukong+1.386
multiple+1.378
Neg logit contributions
龍�-1.872
uates-1.869
-1.841
cos-1.774
ero-1.736
Token group
Token ID1448
Feature activation-5.40e-01

Pos logit contributions
attery+0.752
 Interest+0.702
ゴン+0.702
ukong+0.683
orf+0.677
Neg logit contributions
uates-0.780
龍�-0.770
-0.757
cos-0.737
ero-0.715
TokenN
Token ID45
Feature activation+5.19e-01

Pos logit contributions
attery+2.694
ゴン+2.600
ukong+2.521
multiple+2.504
orf+2.484
Neg logit contributions
uates-2.412
龍�-2.342
-2.285
cos-2.217
ero-2.206
Tokenok
Token ID482
Feature activation+4.90e-01

Pos logit contributions
attery+1.593
ゴン+1.508
ukong+1.495
orf+1.479
 Interest+1.476
Neg logit contributions
uates-1.076
龍�-1.065
-1.054
cos-0.998
ero-0.959
Tokenai
Token ID1872
Feature activation+4.07e-01

Pos logit contributions
attery+1.330
ゴン+1.270
ukong+1.240
 Interest+1.227
orf+1.213
Neg logit contributions
uates-1.188
龍�-1.172
-1.162
cos-1.114
ero-1.090
Token)
Token ID8
Feature activation+1.06e+00

Pos logit contributions
attery+0.861
ゴン+0.821
ukong+0.784
 Interest+0.782
orf+0.766
Neg logit contributions
uates-1.234
龍�-1.212
-1.202
cos-1.173
ero-1.153
Token "
Token ID366
Feature activation-4.96e-02

Pos logit contributions
attery+2.723
ゴン+2.615
ukong+2.543
 Interest+2.542
orf+2.523
Neg logit contributions
uates-2.665
龍�-2.596
-2.571
cos-2.464
ero-2.420
Tokendog
Token ID9703
Feature activation+2.04e+00

Pos logit contributions
uates+0.124
龍�+0.122
+0.120
cos+0.116
ero+0.114
Neg logit contributions
attery-0.132
ゴン-0.126
ukong-0.122
 Interest-0.121
multiple-0.120
Token",
Token ID1600
Feature activation+3.77e-01

Pos logit contributions
attery+4.891
ゴン+4.580
 Interest+4.507
ukong+4.482
ATURE+4.452
Neg logit contributions
uates-5.408
龍�-5.269
-5.144
cos-4.983
ero-4.831
Token obviously
Token ID6189
Feature activation+2.72e-01

Pos logit contributions
attery+0.877
ゴン+0.838
 Interest+0.805
ukong+0.804
orf+0.789
Neg logit contributions
uates-1.061
龍�-1.039
-1.028
cos-1.009
ero-0.990
Token in
Token ID287
Feature activation+1.95e-01

Pos logit contributions
attery+0.661
ゴン+0.629
 Interest+0.613
ukong+0.606
ATURE+0.597
Neg logit contributions
uates-0.736
龍�-0.725
-0.716
cos-0.699
ero-0.683
Token an
Token ID281
Feature activation+2.08e-01

Pos logit contributions
attery+0.450
ゴン+0.426
 Interest+0.414
ukong+0.413
orf+0.402
Neg logit contributions
uates-0.553
龍�-0.549
-0.540
cos-0.526
ero-0.514
Token attempt
Token ID2230
Feature activation-1.23e-01

Pos logit contributions
attery+0.470
ゴン+0.449
 Interest+0.433
ukong+0.429
orf+0.424
Neg logit contributions
uates-0.596
龍�-0.587
-0.583
cos-0.568
ero-0.556
Tokenike
Token ID522
Feature activation+3.13e-01

Pos logit contributions
attery+0.740
ゴン+0.700
ukong+0.685
orf+0.679
 Interest+0.675
Neg logit contributions
uates-0.579
龍�-0.576
-0.569
cos-0.544
ero-0.521
Token p
Token ID279
Feature activation+5.17e-01

Pos logit contributions
attery+0.809
ゴン+0.778
 Interest+0.751
ukong+0.746
orf+0.735
Neg logit contributions
uates-0.796
龍�-0.787
-0.780
cos-0.748
ero-0.729
Tokenū
Token ID20317
Feature activation+9.29e-01

Pos logit contributions
attery+1.147
ゴン+1.060
ukong+1.039
orf+1.020
 Interest+1.020
Neg logit contributions
uates-1.508
龍�-1.501
-1.497
cos-1.459
ero-1.401
Token i
Token ID1312
Feature activation+6.08e-01

Pos logit contributions
attery+2.416
ゴン+2.321
 Interest+2.253
ukong+2.244
orf+2.206
Neg logit contributions
uates-2.283
龍�-2.277
-2.243
cos-2.121
ero-2.056
Token ke
Token ID885
Feature activation+7.65e-01

Pos logit contributions
attery+1.511
ゴン+1.448
 Interest+1.393
ukong+1.382
iverse+1.371
Neg logit contributions
uates-1.585
龍�-1.562
-1.558
cos-1.492
ero-1.451
Token au
Token ID35851
Feature activation+2.04e+00

Pos logit contributions
attery+1.979
ゴン+1.892
 Interest+1.835
ukong+1.820
orf+1.801
Neg logit contributions
uates-1.909
龍�-1.898
-1.878
cos-1.791
ero-1.743
Token n
Token ID299
Feature activation+8.75e-01

Pos logit contributions
attery+4.946
ゴン+4.730
 Interest+4.548
ukong+4.509
iverse+4.475
Neg logit contributions
uates-5.133
龍�-5.116
-5.044
cos-4.790
ero-4.542
Tokenui
Token ID9019
Feature activation+8.92e-01

Pos logit contributions
attery+2.226
ゴン+2.055
ukong+2.035
 Interest+2.011
orf+2.010
Neg logit contributions
龍�-2.271
uates-2.251
-2.241
cos-2.143
ero-2.044
Token a
Token ID257
Feature activation+6.25e-01

Pos logit contributions
attery+2.223
ゴン+2.137
 Interest+2.076
ukong+2.051
orf+2.039
Neg logit contributions
uates-2.301
龍�-2.278
-2.275
cos-2.174
ero-2.086
Token au
Token ID35851
Feature activation+1.67e+00

Pos logit contributions
attery+1.556
ゴン+1.492
 Interest+1.439
ukong+1.433
orf+1.417
Neg logit contributions
uates-1.632
龍�-1.618
-1.602
cos-1.545
ero-1.491
Token
Token ID220
Feature activation+8.77e-01

Pos logit contributions
attery+4.066
ゴン+3.929
 Interest+3.788
ukong+3.747
orf+3.713
Neg logit contributions
uates-4.227
龍�-4.197
-4.146
cos-3.954
ero-3.807
Token,
Token ID11
Feature activation-1.06e-01

Pos logit contributions
uates+0.366
龍�+0.365
+0.360
cos+0.350
ero+0.342
Neg logit contributions
attery-0.323
ゴン-0.308
 Interest-0.298
ukong-0.297
ATURE-0.292
Token which
Token ID543
Feature activation-3.18e-01

Pos logit contributions
uates+0.287
龍�+0.287
+0.281
cos+0.273
ero+0.267
Neg logit contributions
attery-0.259
ゴン-0.244
 Interest-0.240
ukong-0.239
ATURE-0.233
Token is
Token ID318
Feature activation+2.35e-01

Pos logit contributions
uates+0.821
龍�+0.811
+0.802
cos+0.783
ero+0.765
Neg logit contributions
attery-0.810
ゴン-0.776
ukong-0.749
 Interest-0.748
ATURE-0.734
Token at
Token ID379
Feature activation+2.60e-01

Pos logit contributions
attery+0.550
ゴン+0.515
 Interest+0.509
ukong+0.507
ATURE+0.497
Neg logit contributions
龍�-0.654
uates-0.650
-0.643
cos-0.621
ero-0.607
Token the
Token ID262
Feature activation+1.54e-01

Pos logit contributions
attery+0.619
ゴン+0.581
 Interest+0.574
ukong+0.571
orf+0.556
Neg logit contributions
龍�-0.718
uates-0.715
-0.703
cos-0.675
ero-0.663
Token equ
Token ID1602
Feature activation+2.04e+00

Pos logit contributions
attery+0.456
ゴン+0.436
 Interest+0.430
ukong+0.428
orf+0.421
Neg logit contributions
uates-0.332
龍�-0.330
-0.323
cos-0.308
ero-0.300
Tokenator
Token ID1352
Feature activation+5.92e-01

Pos logit contributions
attery+2.738
iverse+2.480
orf+2.467
abilities+2.388
ukong+2.351
Neg logit contributions
龍�-7.525
uates-7.419
-7.343
 reluct-7.034
cos-7.004
Token,
Token ID11
Feature activation+3.62e-02

Pos logit contributions
attery+1.384
ゴン+1.304
 Interest+1.277
ukong+1.274
orf+1.251
Neg logit contributions
龍�-1.650
uates-1.626
-1.606
cos-1.549
ero-1.522
Token cent
Token ID1247
Feature activation-2.83e-01

Pos logit contributions
attery+0.088
ゴン+0.083
 Interest+0.082
ukong+0.082
ATURE+0.080
Neg logit contributions
龍�-0.098
uates-0.097
-0.095
cos-0.092
ero-0.090
Tokenrip
Token ID5528
Feature activation-9.29e-01

Pos logit contributions
uates+0.672
龍�+0.659
+0.652
cos+0.637
ero+0.620
Neg logit contributions
attery-0.783
ゴン-0.753
ukong-0.727
 Interest-0.727
ATURE-0.715
Tokenet
Token ID316
Feature activation-1.19e+00

Pos logit contributions
uates+2.471
cos+2.413
ero+2.407
+2.351
龍�+2.349
Neg logit contributions
ゴン-2.320
attery-2.264
 Interest-2.108
ukong-2.086
multiple-2.079
Token rid
Token ID5755
Feature activation-1.63e-01

Pos logit contributions
attery+0.146
ゴン+0.139
 Interest+0.138
ukong+0.138
ATURE+0.135
Neg logit contributions
uates-0.135
龍�-0.134
-0.133
cos-0.128
ero-0.125
Token of
Token ID286
Feature activation+4.03e-01

Pos logit contributions
uates+0.338
龍�+0.331
+0.327
cos+0.319
ero+0.309
Neg logit contributions
attery-0.499
ゴン-0.482
ukong-0.466
 Interest-0.466
ATURE-0.459
Token the
Token ID262
Feature activation-5.30e-02

Pos logit contributions
attery+1.094
 Interest+1.037
ukong+1.031
ゴン+1.026
multiple+1.011
Neg logit contributions
uates-0.964
龍�-0.958
-0.947
cos-0.899
ero-0.883
Token proverbial
Token ID47327
Feature activation+3.98e-01

Pos logit contributions
uates+0.139
龍�+0.137
+0.136
cos+0.132
ero+0.130
Neg logit contributions
attery-0.133
ゴン-0.127
 Interest-0.123
ukong-0.123
ATURE-0.120
Token al
Token ID435
Feature activation+6.06e-02

Pos logit contributions
attery+1.063
ゴン+1.006
 Interest+0.992
ukong+0.990
ATURE+0.969
Neg logit contributions
uates-0.986
龍�-0.968
-0.952
cos-0.923
ero-0.899
Tokenbat
Token ID8664
Feature activation+2.03e+00

Pos logit contributions
attery+0.103
ゴン+0.095
ukong+0.092
 Interest+0.091
orf+0.088
Neg logit contributions
uates-0.209
龍�-0.207
-0.206
cos-0.201
ero-0.197
Tokenross
Token ID1214
Feature activation+6.12e-01

Pos logit contributions
attery+2.837
orf+2.585
ukong+2.452
 Interest+2.397
ゴン+2.348
Neg logit contributions
龍�-7.550
-7.460
uates-7.418
cos-7.049
 reluct-7.004
Token around
Token ID1088
Feature activation+7.07e-01

Pos logit contributions
attery+1.461
ゴン+1.385
ukong+1.359
 Interest+1.357
orf+1.349
Neg logit contributions
uates-1.655
龍�-1.655
-1.617
cos-1.551
ero-1.493
Token its
Token ID663
Feature activation+5.35e-01

Pos logit contributions
attery+1.727
 Interest+1.623
ukong+1.616
ゴン+1.605
multiple+1.571
Neg logit contributions
uates-1.881
龍�-1.874
-1.840
cos-1.750
ero-1.713
Token neck
Token ID7393
Feature activation+3.61e-01

Pos logit contributions
attery+1.563
ゴン+1.489
 Interest+1.477
ukong+1.469
orf+1.446
Neg logit contributions
uates-1.193
龍�-1.181
-1.156
cos-1.105
ero-1.067
Token.
Token ID13
Feature activation+1.37e-01

Pos logit contributions
attery+0.876
ゴン+0.832
 Interest+0.812
ukong+0.808
multiple+0.794
Neg logit contributions
uates-0.970
龍�-0.969
-0.958
cos-0.925
ero-0.896
Token.
Token ID13
Feature activation-8.64e-02

Pos logit contributions
uates+1.635
龍�+1.600
+1.590
cos+1.551
ero+1.518
Neg logit contributions
attery-1.706
ゴン-1.636
ukong-1.579
 Interest-1.569
ATURE-1.547
Token As
Token ID1081
Feature activation-4.62e-01

Pos logit contributions
uates+0.234
龍�+0.231
+0.228
cos+0.223
ero+0.218
Neg logit contributions
attery-0.210
ゴン-0.199
 Interest-0.195
ukong-0.192
ATURE-0.190
Token Thomas
Token ID5658
Feature activation-4.39e-01

Pos logit contributions
uates+1.208
龍�+1.191
+1.176
cos+1.148
ero+1.119
Neg logit contributions
attery-1.169
ゴン-1.114
 Interest-1.083
ukong-1.075
ATURE-1.058
Token C
Token ID327
Feature activation-3.41e-01

Pos logit contributions
uates+1.147
龍�+1.126
+1.116
cos+1.094
ero+1.069
Neg logit contributions
attery-1.111
ゴン-1.069
ukong-1.027
 Interest-1.021
ATURE-1.005
Token.
Token ID13
Feature activation-3.87e-01

Pos logit contributions
uates+0.874
龍�+0.864
+0.855
cos+0.830
ero+0.806
Neg logit contributions
attery-0.887
ゴン-0.836
 Interest-0.817
ukong-0.814
orf-0.802
Token Tig
Token ID14189
Feature activation+2.02e+00

Pos logit contributions
uates+1.026
龍�+1.009
+1.000
cos+0.980
ero+0.957
Neg logit contributions
attery-0.963
ゴン-0.924
ukong-0.888
 Interest-0.884
ATURE-0.870
Tokenhe
Token ID258
Feature activation+6.41e-03

Pos logit contributions
attery+4.881
orf+4.523
itars+4.507
 Interest+4.409
ukong+4.378
Neg logit contributions
龍�-5.423
-5.320
uates-5.105
 reluct-4.919
cos-4.768
Token,
Token ID11
Feature activation+2.53e-01

Pos logit contributions
attery+0.016
ゴン+0.016
 Interest+0.015
ukong+0.015
orf+0.015
Neg logit contributions
uates-0.017
龍�-0.016
-0.016
cos-0.016
ero-0.015
Token a
Token ID257
Feature activation-1.55e-01

Pos logit contributions
attery+0.626
ゴン+0.588
 Interest+0.588
ATURE+0.570
orf+0.568
Neg logit contributions
uates-0.678
龍�-0.673
-0.658
cos-0.643
ero-0.621
Token U
Token ID471
Feature activation-8.08e-01

Pos logit contributions
uates+0.388
龍�+0.385
+0.377
cos+0.366
ero+0.354
Neg logit contributions
attery-0.412
ゴン-0.392
 Interest-0.385
ukong-0.380
orf-0.376
Token.
Token ID13
Feature activation-1.16e-01

Pos logit contributions
uates+1.854
+1.805
龍�+1.804
cos+1.800
ero+1.743
Neg logit contributions
attery-2.277
ゴン-2.264
ukong-2.121
 Interest-2.089
itars-2.089
Token to
Token ID284
Feature activation+2.39e-01

Pos logit contributions
uates+0.589
龍�+0.576
+0.573
cos+0.562
ero+0.550
Neg logit contributions
attery-0.495
ゴン-0.478
ukong-0.456
 Interest-0.454
orf-0.444
Token represent
Token ID2380
Feature activation+4.54e-01

Pos logit contributions
attery+0.607
ゴン+0.574
 Interest+0.565
ukong+0.559
orf+0.551
Neg logit contributions
uates-0.624
龍�-0.617
-0.609
cos-0.590
ero-0.578
Token Patrick
Token ID9925
Feature activation+4.47e-01

Pos logit contributions
attery+1.174
ゴン+1.101
 Interest+1.099
ukong+1.070
orf+1.067
Neg logit contributions
uates-1.160
龍�-1.155
-1.137
cos-1.100
ero-1.076
Token O
Token ID440
Feature activation+1.85e-01

Pos logit contributions
attery+1.369
ゴン+1.304
 Interest+1.289
ukong+1.267
orf+1.257
Neg logit contributions
uates-0.930
龍�-0.921
-0.920
cos-0.870
ero-0.849
Token'
Token ID6
Feature activation+3.84e-01

Pos logit contributions
attery+0.435
ゴン+0.414
ukong+0.399
 Interest+0.398
ATURE+0.390
Neg logit contributions
uates-0.517
龍�-0.510
-0.506
cos-0.495
ero-0.484
TokenBrian
Token ID24761
Feature activation+2.00e+00

Pos logit contributions
attery+0.608
ゴン+0.566
 Interest+0.532
ukong+0.532
ATURE+0.516
Neg logit contributions
uates-1.368
龍�-1.350
-1.341
cos-1.319
ero-1.298
Token.
Token ID13
Feature activation+2.53e-01

Pos logit contributions
attery+4.160
 Interest+3.891
ゴン+3.807
ukong+3.737
orf+3.729
Neg logit contributions
龍�-5.933
-5.805
uates-5.738
cos-5.512
ero-5.366
Token He
Token ID679
Feature activation+2.57e-01

Pos logit contributions
attery+0.632
 Interest+0.606
ゴン+0.590
ukong+0.575
ATURE+0.573
Neg logit contributions
uates-0.672
龍�-0.663
-0.652
cos-0.633
ero-0.623
Token borrowed
Token ID22546
Feature activation-1.92e-01

Pos logit contributions
attery+0.684
ゴン+0.648
 Interest+0.635
ukong+0.631
orf+0.622
Neg logit contributions
uates-0.638
龍�-0.634
-0.625
cos-0.603
ero-0.586
Token a
Token ID257
Feature activation+1.67e-01

Pos logit contributions
uates+0.500
龍�+0.493
+0.488
cos+0.476
ero+0.465
Neg logit contributions
attery-0.489
ゴン-0.465
 Interest-0.453
ukong-0.450
ATURE-0.443
Token paperback
Token ID46771
Feature activation-7.43e-01

Pos logit contributions
attery+0.440
ゴン+0.418
 Interest+0.412
ukong+0.405
orf+0.401
Neg logit contributions
uates-0.418
龍�-0.415
-0.408
cos-0.395
ero-0.381
Token
Token ID564
Feature activation-9.44e-02

Pos logit contributions
attery+0.018
ゴン+0.017
 Interest+0.017
ukong+0.016
ATURE+0.016
Neg logit contributions
uates-0.018
龍�-0.018
-0.017
cos-0.017
ero-0.016
Token
Token ID246
Feature activation-2.96e-01

Pos logit contributions
uates+0.260
龍�+0.256
+0.253
cos+0.250
ero+0.242
Neg logit contributions
attery-0.225
ゴン-0.216
 Interest-0.207
ukong-0.206
ATURE-0.203
TokenP
Token ID47
Feature activation-2.07e-01

Pos logit contributions
uates+0.745
龍�+0.730
+0.721
cos+0.704
ero+0.687
Neg logit contributions
attery-0.781
ゴン-0.749
ukong-0.723
 Interest-0.722
orf-0.710
Tokenied
Token ID798
Feature activation+6.97e-01

Pos logit contributions
uates+0.536
龍�+0.528
+0.521
cos+0.509
ero+0.494
Neg logit contributions
attery-0.531
ゴン-0.505
ukong-0.491
 Interest-0.490
orf-0.484
Token Butcher
Token ID39680
Feature activation+4.65e-01

Pos logit contributions
attery+1.746
ゴン+1.669
 Interest+1.616
ukong+1.605
orf+1.597
Neg logit contributions
uates-1.839
龍�-1.794
-1.776
cos-1.724
ero-1.676
Token Bird
Token ID14506
Feature activation+1.98e+00

Pos logit contributions
attery+1.137
ゴン+1.081
 Interest+1.053
ukong+1.048
ATURE+1.032
Neg logit contributions
uates-1.254
龍�-1.232
-1.211
cos-1.189
ero-1.154
Token
Token ID447
Feature activation+6.45e-01

Pos logit contributions
attery+4.506
 Interest+4.266
ゴン+4.180
ATURE+4.098
orf+4.088
Neg logit contributions
uates-5.412
龍�-5.406
-5.249
cos-5.109
ero-4.934
Token
Token ID247
Feature activation+2.22e-02

Pos logit contributions
attery+1.459
ゴン+1.393
ATURE+1.313
 Interest+1.313
ukong+1.305
Neg logit contributions
龍�-1.830
uates-1.823
-1.798
cos-1.771
ero-1.714
Token.
Token ID13
Feature activation-2.47e-01

Pos logit contributions
attery+0.060
ゴン+0.058
 Interest+0.056
ukong+0.056
ATURE+0.055
Neg logit contributions
uates-0.054
龍�-0.053
-0.052
cos-0.051
ero-0.049
Token
Token ID198
Feature activation-6.10e-02

Pos logit contributions
uates+0.600
龍�+0.590
+0.582
cos+0.567
ero+0.555
Neg logit contributions
attery-0.674
ゴン-0.641
 Interest-0.630
ukong-0.622
ATURE-0.613
Token
Token ID198
Feature activation+5.34e-02

Pos logit contributions
uates+0.159
龍�+0.155
+0.155
cos+0.152
ero+0.148
Neg logit contributions
attery-0.155
ゴン-0.151
ukong-0.143
 Interest-0.142
orf-0.140
Token article
Token ID2708
Feature activation-6.96e-01

Pos logit contributions
attery+0.377
ゴン+0.359
 Interest+0.352
orf+0.343
ukong+0.342
Neg logit contributions
uates-0.432
龍�-0.424
-0.423
cos-0.413
ero-0.403
Token published
Token ID3199
Feature activation-3.35e-02

Pos logit contributions
uates+1.879
龍�+1.839
+1.829
cos+1.797
ero+1.764
Neg logit contributions
attery-1.682
ゴン-1.629
ukong-1.568
 Interest-1.530
multiple-1.516
Token in
Token ID287
Feature activation+1.78e-01

Pos logit contributions
uates+0.099
龍�+0.097
+0.096
cos+0.095
ero+0.093
Neg logit contributions
attery-0.073
ゴン-0.070
ukong-0.067
 Interest-0.067
ATURE-0.065
Token N
Token ID399
Feature activation+3.01e-01

Pos logit contributions
attery+0.423
ゴン+0.399
 Interest+0.394
ukong+0.383
ATURE+0.382
Neg logit contributions
uates-0.495
龍�-0.488
-0.484
cos-0.471
ero-0.459
Token.
Token ID13
Feature activation-3.62e-01

Pos logit contributions
attery+0.817
ゴン+0.779
 Interest+0.759
ukong+0.758
ATURE+0.747
Neg logit contributions
uates-0.733
龍�-0.723
-0.716
cos-0.693
ero-0.673
Token Eng
Token ID1985
Feature activation+1.96e+00

Pos logit contributions
uates+0.858
龍�+0.842
+0.834
cos+0.815
ero+0.794
Neg logit contributions
attery-1.004
ゴン-0.966
ukong-0.933
 Interest-0.931
ATURE-0.916
Tokenl
Token ID75
Feature activation+2.72e-01

Pos logit contributions
attery+3.908
 Interest+3.619
abilities+3.551
ukong+3.510
orf+3.506
Neg logit contributions
龍�-6.077
-6.051
uates-6.005
 reluct-5.675
cos-5.643
Token.
Token ID13
Feature activation-4.37e-01

Pos logit contributions
attery+0.689
ゴン+0.640
 Interest+0.636
ukong+0.633
ATURE+0.628
Neg logit contributions
uates-0.711
-0.698
龍�-0.696
cos-0.677
ero-0.658
Token J
Token ID449
Feature activation+8.24e-02

Pos logit contributions
uates+1.057
龍�+1.039
+1.028
cos+1.003
ero+0.978
Neg logit contributions
attery-1.189
ゴン-1.148
ukong-1.106
 Interest-1.101
ATURE-1.083
Token.
Token ID13
Feature activation-2.17e-01

Pos logit contributions
attery+0.208
ゴン+0.195
 Interest+0.192
ukong+0.189
ATURE+0.188
Neg logit contributions
uates-0.216
龍�-0.213
-0.212
cos-0.206
ero-0.199
Token Med
Token ID2019
Feature activation+7.92e-01

Pos logit contributions
uates+0.588
龍�+0.577
+0.574
cos+0.558
ero+0.545
Neg logit contributions
attery-0.534
ゴン-0.503
 Interest-0.492
ukong-0.487
ATURE-0.480
Token question
Token ID1808
Feature activation-7.67e-02

Pos logit contributions
attery+0.982
ゴン+0.917
 Interest+0.914
ukong+0.903
ATURE+0.887
Neg logit contributions
龍�-1.027
uates-1.026
-1.015
cos-0.969
ero-0.944
Token of
Token ID286
Feature activation+1.88e-01

Pos logit contributions
uates+0.179
龍�+0.179
+0.176
cos+0.171
ero+0.165
Neg logit contributions
attery-0.213
ゴン-0.203
 Interest-0.200
ukong-0.199
ATURE-0.197
Token the
Token ID262
Feature activation-1.32e-01

Pos logit contributions
attery+0.442
ゴン+0.418
 Interest+0.408
ukong+0.407
ATURE+0.397
Neg logit contributions
uates-0.524
龍�-0.521
-0.514
cos-0.499
ero-0.488
Token age
Token ID2479
Feature activation-4.80e-01

Pos logit contributions
uates+0.369
龍�+0.364
+0.360
cos+0.351
ero+0.344
Neg logit contributions
attery-0.311
ゴン-0.296
 Interest-0.285
ukong-0.285
ATURE-0.279
Token of
Token ID286
Feature activation+2.03e-01

Pos logit contributions
uates+1.051
龍�+1.031
+1.020
cos+0.992
ero+0.964
Neg logit contributions
attery-1.420
ゴン-1.368
ukong-1.325
 Interest-1.325
ATURE-1.304
Token pr
Token ID778
Feature activation+1.95e+00

Pos logit contributions
attery+0.486
ゴン+0.455
 Interest+0.451
ukong+0.446
ATURE+0.436
Neg logit contributions
uates-0.557
龍�-0.555
-0.547
cos-0.527
ero-0.514
Tokenimate
Token ID1920
Feature activation+1.33e+00

Pos logit contributions
attery+2.061
ukong+1.708
 Interest+1.612
ATURE+1.526
multiple+1.519
Neg logit contributions
龍�-7.927
-7.720
uates-7.699
cos-7.400
 reluct-7.329
Token fossils
Token ID34066
Feature activation-8.63e-02

Pos logit contributions
attery+2.808
 Interest+2.638
iverse+2.564
ATURE+2.550
ゴン+2.535
Neg logit contributions
龍�-3.982
-3.885
uates-3.856
cos-3.664
ero-3.563
Token from
Token ID422
Feature activation-3.63e-01

Pos logit contributions
龍�+0.217
uates+0.216
+0.213
cos+0.207
ero+0.200
Neg logit contributions
attery-0.225
ゴン-0.213
 Interest-0.210
ukong-0.208
ATURE-0.206
Token this
Token ID428
Feature activation+3.49e-01

Pos logit contributions
uates+1.014
龍�+0.998
+0.990
cos+0.969
ero+0.948
Neg logit contributions
attery-0.856
ゴン-0.817
ukong-0.785
 Interest-0.784
ATURE-0.768
Token region
Token ID3814
Feature activation+3.41e-01

Pos logit contributions
attery+0.836
ゴン+0.788
 Interest+0.770
ukong+0.765
ATURE+0.751
Neg logit contributions
uates-0.960
龍�-0.948
-0.938
cos-0.905
ero-0.882
Token e
Token ID304
Feature activation+3.44e-01

Pos logit contributions
attery+0.634
ゴン+0.606
 Interest+0.589
ukong+0.580
multiple+0.572
Neg logit contributions
uates-0.679
龍�-0.670
-0.664
cos-0.644
ero-0.627
Tokeno
Token ID78
Feature activation+3.29e-02

Pos logit contributions
attery+0.878
ゴン+0.842
 Interest+0.815
ukong+0.806
orf+0.802
Neg logit contributions
uates-0.882
龍�-0.872
-0.864
cos-0.829
ero-0.808
Token ak
Token ID47594
Feature activation-2.36e-01

Pos logit contributions
attery+0.085
ゴン+0.081
 Interest+0.078
ukong+0.078
orf+0.077
Neg logit contributions
uates-0.084
龍�-0.083
-0.082
cos-0.080
ero-0.078
Tokenula
Token ID4712
Feature activation+4.98e-01

Pos logit contributions
uates+0.653
龍�+0.645
+0.639
cos+0.625
ero+0.609
Neg logit contributions
attery-0.561
ゴン-0.534
ukong-0.514
 Interest-0.512
orf-0.503
Token ka
Token ID38387
Feature activation+7.36e-01

Pos logit contributions
attery+1.263
ゴン+1.209
 Interest+1.175
ukong+1.160
orf+1.150
Neg logit contributions
uates-1.283
龍�-1.272
-1.257
cos-1.216
ero-1.176
Token lan
Token ID26992
Feature activation+1.94e+00

Pos logit contributions
attery+1.904
ゴン+1.838
 Interest+1.776
ukong+1.750
orf+1.739
Neg logit contributions
uates-1.846
龍�-1.833
-1.810
cos-1.749
ero-1.675
Tokenak
Token ID461
Feature activation+6.96e-03

Pos logit contributions
attery+5.135
ゴン+4.822
iverse+4.739
ukong+4.724
orf+4.701
Neg logit contributions
龍�-4.513
-4.495
uates-4.407
cos-4.114
 reluct-3.965
Tokenila
Token ID10102
Feature activation+3.43e-01

Pos logit contributions
attery+0.019
ゴン+0.018
ukong+0.017
 Interest+0.017
orf+0.017
Neg logit contributions
uates-0.017
龍�-0.017
-0.017
cos-0.016
ero-0.016
Token i
Token ID1312
Feature activation+1.84e-01

Pos logit contributions
attery+0.877
ゴン+0.839
 Interest+0.813
ukong+0.802
orf+0.797
Neg logit contributions
uates-0.881
龍�-0.872
-0.864
cos-0.833
ero-0.806
Tokenā
Token ID10235
Feature activation+4.16e-01

Pos logit contributions
attery+0.466
ゴン+0.450
 Interest+0.434
ukong+0.430
orf+0.423
Neg logit contributions
uates-0.476
龍�-0.468
-0.465
cos-0.450
ero-0.438
Token Kal
Token ID12612
Feature activation+1.75e-01

Pos logit contributions
attery+1.078
ゴン+1.034
 Interest+1.003
ukong+0.994
orf+0.982
Neg logit contributions
uates-1.047
龍�-1.036
-1.024
cos-0.984
ero-0.956
Token Maybe
Token ID6674
Feature activation+7.18e-01

Pos logit contributions
attery+0.757
 Interest+0.716
ゴン+0.705
ukong+0.687
ATURE+0.686
Neg logit contributions
uates-0.873
龍�-0.860
-0.847
cos-0.827
ero-0.815
Token their
Token ID511
Feature activation+1.03e-01

Pos logit contributions
attery+1.800
 Interest+1.688
ゴン+1.657
ukong+1.635
ATURE+1.619
Neg logit contributions
uates-1.872
龍�-1.864
-1.837
cos-1.785
ero-1.723
Token calls
Token ID3848
Feature activation+4.48e-01

Pos logit contributions
attery+0.254
ゴン+0.241
 Interest+0.235
ukong+0.232
ATURE+0.229
Neg logit contributions
uates-0.276
龍�-0.273
-0.269
cos-0.262
ero-0.256
Token will
Token ID481
Feature activation+1.22e-01

Pos logit contributions
attery+1.166
ゴン+1.087
 Interest+1.080
ukong+1.070
ATURE+1.056
Neg logit contributions
龍�-1.135
uates-1.132
-1.100
cos-1.068
ero-1.043
Token go
Token ID467
Feature activation+3.33e-01

Pos logit contributions
attery+0.315
ゴン+0.299
 Interest+0.292
ukong+0.291
orf+0.285
Neg logit contributions
uates-0.313
龍�-0.309
-0.305
cos-0.298
ero-0.290
Token un
Token ID555
Feature activation+1.93e+00

Pos logit contributions
attery+0.857
ゴン+0.810
 Interest+0.798
ukong+0.786
ATURE+0.774
Neg logit contributions
uates-0.857
龍�-0.853
-0.833
cos-0.816
ero-0.790
Tokenhe
Token ID258
Feature activation-1.69e-01

Pos logit contributions
attery+4.716
 Interest+4.354
multiple+4.340
ukong+4.275
abilities+4.264
Neg logit contributions
龍�-5.160
uates-5.012
-4.952
ero-4.679
cos-4.675
Tokeneded
Token ID15395
Feature activation+2.73e-01

Pos logit contributions
uates+0.543
龍�+0.529
+0.528
cos+0.522
ero+0.514
Neg logit contributions
attery-0.327
ゴン-0.316
 Interest-0.297
ukong-0.294
ATURE-0.288
Token
Token ID851
Feature activation+6.32e-01

Pos logit contributions
attery+0.672
ゴン+0.636
 Interest+0.619
ukong+0.614
orf+0.606
Neg logit contributions
uates-0.729
龍�-0.723
-0.713
cos-0.697
ero-0.680
Token certainly
Token ID3729
Feature activation+5.38e-01

Pos logit contributions
attery+1.444
ゴン+1.335
 Interest+1.328
ukong+1.299
ATURE+1.289
Neg logit contributions
uates-1.803
龍�-1.789
-1.764
cos-1.718
ero-1.682
Token the
Token ID262
Feature activation+3.67e-01

Pos logit contributions
attery+1.344
ゴン+1.259
 Interest+1.245
ukong+1.224
ATURE+1.210
Neg logit contributions
uates-1.416
龍�-1.405
-1.382
cos-1.354
ero-1.315
Token of
Token ID286
Feature activation+1.77e-01

Pos logit contributions
uates+0.495
龍�+0.489
+0.479
cos+0.468
ero+0.454
Neg logit contributions
attery-0.673
ゴン-0.649
 Interest-0.629
ukong-0.626
orf-0.619
Token good
Token ID922
Feature activation-3.33e-02

Pos logit contributions
attery+0.448
ゴン+0.422
 Interest+0.420
ukong+0.411
orf+0.409
Neg logit contributions
uates-0.462
龍�-0.454
-0.445
cos-0.435
ero-0.423
Token and
Token ID290
Feature activation-3.92e-01

Pos logit contributions
uates+0.089
龍�+0.087
+0.086
cos+0.084
ero+0.082
Neg logit contributions
attery-0.083
ゴン-0.079
 Interest-0.077
ukong-0.076
orf-0.075
Token bad
Token ID2089
Feature activation+1.75e-02

Pos logit contributions
uates+0.795
龍�+0.783
+0.762
cos+0.744
ero+0.719
Neg logit contributions
attery-1.228
ゴン-1.181
 Interest-1.150
ukong-1.143
ATURE-1.133
Token,
Token ID11
Feature activation-1.30e-01

Pos logit contributions
attery+0.046
ゴン+0.044
 Interest+0.043
ukong+0.043
orf+0.042
Neg logit contributions
uates-0.044
龍�-0.043
-0.042
cos-0.041
ero-0.040
Token equ
Token ID1602
Feature activation+1.93e+00

Pos logit contributions
uates+0.343
龍�+0.337
+0.333
cos+0.326
ero+0.318
Neg logit contributions
attery-0.328
ゴン-0.314
 Interest-0.303
ukong-0.302
ATURE-0.297
Tokenating
Token ID803
Feature activation+6.04e-01

Pos logit contributions
attery+3.764
abilities+3.399
orf+3.372
ukong+3.353
itars+3.309
Neg logit contributions
龍�-6.097
-5.994
uates-5.869
cos-5.721
ero-5.612
Token the
Token ID262
Feature activation-2.27e-01

Pos logit contributions
attery+1.513
ゴン+1.415
 Interest+1.410
ukong+1.382
orf+1.370
Neg logit contributions
uates-1.584
龍�-1.576
-1.537
cos-1.503
ero-1.455
Token self
Token ID2116
Feature activation+4.64e-01

Pos logit contributions
uates+0.618
龍�+0.607
+0.602
cos+0.589
ero+0.575
Neg logit contributions
attery-0.551
ゴン-0.526
 Interest-0.506
ukong-0.505
ATURE-0.495
Token-
Token ID12
Feature activation+4.37e-01

Pos logit contributions
attery+1.094
ゴン+1.028
 Interest+1.015
ukong+0.986
multiple+0.974
Neg logit contributions
uates-1.288
龍�-1.285
-1.251
cos-1.224
ero-1.203
Tokensupport
Token ID11284
Feature activation+3.13e-02

Pos logit contributions
attery+1.098
ゴン+1.026
 Interest+1.022
ukong+1.000
multiple+0.998
Neg logit contributions
uates-1.152
龍�-1.146
-1.127
cos-1.078
ero-1.061
Token is
Token ID318
Feature activation+1.77e-01

Pos logit contributions
attery+0.199
ゴン+0.190
ukong+0.184
 Interest+0.183
ATURE+0.180
Neg logit contributions
uates-0.190
龍�-0.187
-0.185
cos-0.181
ero-0.176
Token the
Token ID262
Feature activation-6.52e-02

Pos logit contributions
attery+0.455
ゴン+0.432
 Interest+0.421
ukong+0.420
ATURE+0.414
Neg logit contributions
uates-0.455
龍�-0.447
-0.441
cos-0.432
ero-0.420
Token Jets
Token ID14728
Feature activation+3.00e-01

Pos logit contributions
uates+0.175
龍�+0.173
+0.171
cos+0.167
ero+0.163
Neg logit contributions
attery-0.160
ゴン-0.152
 Interest-0.148
ukong-0.147
orf-0.145
Token
Token ID447
Feature activation+1.60e-01

Pos logit contributions
attery+0.726
ゴン+0.677
 Interest+0.666
ukong+0.659
ATURE+0.648
Neg logit contributions
uates-0.819
龍�-0.809
-0.798
cos-0.777
ero-0.761
Token
Token ID247
Feature activation+2.56e-01

Pos logit contributions
attery+0.408
ゴン+0.389
ukong+0.375
 Interest+0.374
ATURE+0.370
Neg logit contributions
uates-0.416
龍�-0.410
-0.405
cos-0.398
ero-0.387
Token un
Token ID555
Feature activation+1.92e+00

Pos logit contributions
attery+0.797
ゴン+0.755
 Interest+0.742
ukong+0.741
orf+0.730
Neg logit contributions
uates-0.522
龍�-0.517
-0.505
cos-0.483
ero-0.469
Tokenquest
Token ID6138
Feature activation-4.86e-01

Pos logit contributions
attery+5.003
iverse+4.730
 Interest+4.702
abilities+4.664
ukong+4.660
Neg logit contributions
龍�-4.564
uates-4.440
-4.401
ero-4.132
cos-4.066
Tokenioned
Token ID14994
Feature activation-1.68e-01

Pos logit contributions
uates+0.849
龍�+0.825
+0.816
cos+0.788
ero+0.761
Neg logit contributions
attery-1.652
ゴン-1.601
ukong-1.555
 Interest-1.554
ATURE-1.534
Token best
Token ID1266
Feature activation-1.85e-03

Pos logit contributions
uates+0.452
龍�+0.446
+0.441
cos+0.431
ero+0.421
Neg logit contributions
attery-0.413
ゴン-0.393
 Interest-0.381
ukong-0.380
ATURE-0.372
Token sniper
Token ID26160
Feature activation+2.49e-01

Pos logit contributions
uates+0.005
龍�+0.005
+0.005
cos+0.005
ero+0.005
Neg logit contributions
attery-0.004
ゴン-0.004
 Interest-0.004
ukong-0.004
ATURE-0.004
Token,
Token ID11
Feature activation+8.28e-02

Pos logit contributions
attery+0.564
ゴン+0.532
 Interest+0.519
ukong+0.518
orf+0.507
Neg logit contributions
uates-0.710
龍�-0.707
-0.699
cos-0.679
ero-0.664
Token transit
Token ID11168
Feature activation-1.57e-01

Pos logit contributions
attery+0.700
ゴン+0.666
 Interest+0.650
ukong+0.642
orf+0.634
Neg logit contributions
uates-0.745
龍�-0.736
-0.728
cos-0.710
ero-0.690
Token corridors
Token ID32807
Feature activation-1.22e-01

Pos logit contributions
uates+0.431
龍�+0.428
+0.422
cos+0.413
ero+0.400
Neg logit contributions
attery-0.374
ゴン-0.356
 Interest-0.344
ukong-0.342
orf-0.338
Token and
Token ID290
Feature activation+6.88e-02

Pos logit contributions
uates+0.327
龍�+0.322
+0.319
cos+0.312
ero+0.304
Neg logit contributions
attery-0.302
ゴン-0.288
 Interest-0.278
ukong-0.277
ATURE-0.272
Token major
Token ID1688
Feature activation-3.67e-01

Pos logit contributions
attery+0.179
ゴン+0.171
 Interest+0.166
ukong+0.165
orf+0.163
Neg logit contributions
uates-0.174
龍�-0.173
-0.170
cos-0.166
ero-0.161
Token commercial
Token ID5068
Feature activation+7.95e-01

Pos logit contributions
uates+0.881
龍�+0.867
+0.857
cos+0.834
ero+0.811
Neg logit contributions
attery-1.007
ゴン-0.967
 Interest-0.936
ukong-0.934
orf-0.918
Token bou
Token ID35833
Feature activation+1.91e+00

Pos logit contributions
attery+1.887
 Interest+1.797
ゴン+1.791
multiple+1.738
orf+1.733
Neg logit contributions
龍�-2.170
uates-2.141
-2.109
cos-2.022
ero-1.948
Tokenlev
Token ID2768
Feature activation+1.15e-01

Pos logit contributions
attery+5.751
orf+5.393
multiple+5.390
 Interest+5.365
itars+5.313
Neg logit contributions
龍�-3.926
uates-3.887
-3.864
 Pradesh-3.385
 reluct-3.370
Tokenards
Token ID1371
Feature activation-4.32e-02

Pos logit contributions
attery+0.174
ゴン+0.163
ukong+0.152
 Interest+0.151
ATURE+0.147
Neg logit contributions
uates-0.416
龍�-0.411
-0.408
cos-0.402
ero-0.395
Token.
Token ID13
Feature activation+1.94e-01

Pos logit contributions
uates+0.112
龍�+0.110
+0.109
cos+0.107
ero+0.104
Neg logit contributions
attery-0.110
ゴン-0.105
 Interest-0.102
ukong-0.101
orf-0.100
Token This
Token ID770
Feature activation+2.12e-01

Pos logit contributions
attery+0.472
ゴン+0.448
 Interest+0.446
ukong+0.432
ATURE+0.426
Neg logit contributions
uates-0.523
龍�-0.514
-0.508
cos-0.498
ero-0.486
Token proposal
Token ID6961
Feature activation-2.87e-01

Pos logit contributions
attery+0.525
ゴン+0.497
 Interest+0.487
ukong+0.482
ATURE+0.475
Neg logit contributions
uates-0.564
龍�-0.556
-0.551
cos-0.539
ero-0.523
Token)
Token ID8
Feature activation+7.18e-01

Pos logit contributions
attery+1.599
ゴン+1.513
 Interest+1.483
ukong+1.466
multiple+1.445
Neg logit contributions
uates-1.900
龍�-1.889
-1.856
cos-1.816
ero-1.777
Token would
Token ID561
Feature activation+1.14e-01

Pos logit contributions
attery+1.739
ゴン+1.631
 Interest+1.626
ukong+1.601
multiple+1.576
Neg logit contributions
龍�-1.913
uates-1.901
-1.868
cos-1.823
ero-1.788
Token be
Token ID307
Feature activation+2.09e-01

Pos logit contributions
attery+0.297
ゴン+0.284
 Interest+0.275
ukong+0.275
ATURE+0.269
Neg logit contributions
uates-0.287
龍�-0.283
-0.280
cos-0.273
ero-0.266
Token an
Token ID281
Feature activation-9.78e-02

Pos logit contributions
attery+0.526
ゴン+0.503
 Interest+0.491
ukong+0.489
ATURE+0.477
Neg logit contributions
uates-0.548
龍�-0.542
-0.534
cos-0.521
ero-0.506
Token al
Token ID435
Feature activation-1.86e-01

Pos logit contributions
uates+0.274
龍�+0.270
+0.268
cos+0.261
ero+0.256
Neg logit contributions
attery-0.228
ゴン-0.219
 Interest-0.211
ukong-0.209
ATURE-0.206
Tokenbat
Token ID8664
Feature activation+1.91e+00

Pos logit contributions
uates+0.692
龍�+0.684
+0.679
cos+0.665
ero+0.655
Neg logit contributions
attery-0.269
ゴン-0.248
ukong-0.233
 Interest-0.231
orf-0.224
Tokenross
Token ID1214
Feature activation+6.65e-01

Pos logit contributions
attery+2.748
orf+2.491
ゴン+2.419
 Interest+2.404
ukong+2.369
Neg logit contributions
龍�-7.002
uates-6.968
-6.850
cos-6.594
 reluct-6.400
Token on
Token ID319
Feature activation-1.74e-01

Pos logit contributions
attery+1.484
ゴン+1.379
 Interest+1.363
ukong+1.331
orf+1.330
Neg logit contributions
龍�-1.920
uates-1.902
-1.843
cos-1.808
ero-1.749
Token Miami
Token ID8437
Feature activation+1.04e+00

Pos logit contributions
uates+0.488
龍�+0.482
+0.477
cos+0.466
ero+0.455
Neg logit contributions
attery-0.410
ゴン-0.389
 Interest-0.377
ukong-0.376
ATURE-0.368
Token
Token ID447
Feature activation+1.50e-01

Pos logit contributions
attery+2.488
ゴン+2.303
 Interest+2.303
ukong+2.293
ATURE+2.203
Neg logit contributions
uates-2.800
龍�-2.764
-2.719
cos-2.679
ero-2.577
Token
Token ID247
Feature activation+6.97e-03

Pos logit contributions
attery+0.373
ゴン+0.355
ukong+0.341
 Interest+0.341
ATURE+0.336
Neg logit contributions
uates-0.400
龍�-0.395
-0.391
cos-0.385
ero-0.374
Token have
Token ID423
Feature activation+1.68e-01

Pos logit contributions
attery+1.484
 Interest+1.367
ukong+1.359
orf+1.352
ATURE+1.337
Neg logit contributions
龍�-1.501
uates-1.481
-1.476
cos-1.400
ero-1.360
Token changed
Token ID3421
Feature activation-2.20e-01

Pos logit contributions
attery+0.411
ゴン+0.384
 Interest+0.382
ukong+0.379
ATURE+0.375
Neg logit contributions
uates-0.448
龍�-0.447
-0.442
cos-0.427
ero-0.414
Token the
Token ID262
Feature activation-5.26e-01

Pos logit contributions
uates+0.610
龍�+0.603
+0.597
cos+0.583
ero+0.570
Neg logit contributions
attery-0.523
ゴン-0.496
 Interest-0.480
ukong-0.478
ATURE-0.470
Token understanding
Token ID4547
Feature activation+9.92e-02

Pos logit contributions
uates+1.324
龍�+1.300
+1.289
cos+1.257
ero+1.226
Neg logit contributions
attery-1.386
ゴン-1.328
 Interest-1.282
ukong-1.281
ATURE-1.258
Token of
Token ID286
Feature activation+7.66e-02

Pos logit contributions
attery+0.280
ゴン+0.266
 Interest+0.261
ukong+0.261
abilities+0.258
Neg logit contributions
龍�-0.228
uates-0.228
-0.227
cos-0.214
ero-0.209
Token pr
Token ID778
Feature activation+1.91e+00

Pos logit contributions
attery+0.175
ゴン+0.165
 Interest+0.160
ukong+0.159
ATURE+0.157
Neg logit contributions
uates-0.219
龍�-0.217
-0.214
cos-0.208
ero-0.203
Tokenimate
Token ID1920
Feature activation+1.38e+00

Pos logit contributions
attery+1.638
ukong+1.196
ATURE+1.084
 Interest+1.058
multiple+1.057
Neg logit contributions
龍�-8.248
uates-8.067
-8.015
 reluct-7.781
cos-7.776
Token evolution
Token ID6954
Feature activation+1.06e+00

Pos logit contributions
attery+2.553
 Interest+2.371
ATURE+2.307
ゴン+2.299
iverse+2.296
Neg logit contributions
龍�-4.511
uates-4.393
-4.373
cos-4.126
ero-4.042
Token in
Token ID287
Feature activation+2.82e-01

Pos logit contributions
attery+2.215
ゴン+2.050
 Interest+2.037
orf+2.012
abilities+2.007
Neg logit contributions
龍�-3.205
-3.138
uates-3.124
cos-2.981
ero-2.915
Token this
Token ID428
Feature activation+3.97e-01

Pos logit contributions
attery+0.687
ゴン+0.648
 Interest+0.633
ukong+0.628
multiple+0.620
Neg logit contributions
uates-0.760
龍�-0.754
-0.749
cos-0.721
ero-0.704
Token region
Token ID3814
Feature activation+2.77e-01

Pos logit contributions
attery+0.951
ゴン+0.899
 Interest+0.881
ukong+0.869
ATURE+0.859
Neg logit contributions
uates-1.091
龍�-1.077
-1.069
cos-1.026
ero-1.005
Token
Token ID198
Feature activation-1.67e-01

Pos logit contributions
uates+0.544
+0.531
龍�+0.531
cos+0.525
ero+0.509
Neg logit contributions
attery-0.552
ゴン-0.543
ukong-0.511
 Interest-0.504
itars-0.500
TokenMy
Token ID3666
Feature activation-4.14e-01

Pos logit contributions
uates+0.400
龍�+0.393
+0.386
cos+0.375
ero+0.367
Neg logit contributions
attery-0.461
ゴン-0.441
 Interest-0.427
ukong-0.425
multiple-0.422
Token colleague
Token ID16008
Feature activation+7.63e-01

Pos logit contributions
uates+1.041
龍�+1.022
+1.013
cos+0.990
ero+0.966
Neg logit contributions
attery-1.090
ゴン-1.046
ukong-1.009
 Interest-1.007
ATURE-0.990
Token Amer
Token ID33391
Feature activation-6.70e-01

Pos logit contributions
attery+1.877
ゴン+1.772
 Interest+1.767
ukong+1.716
orf+1.694
Neg logit contributions
龍�-2.021
uates-2.018
-1.981
cos-1.911
ero-1.855
Token A
Token ID317
Feature activation-3.87e-01

Pos logit contributions
uates+1.711
龍�+1.679
+1.665
cos+1.630
ero+1.591
Neg logit contributions
attery-1.734
ゴン-1.665
ukong-1.601
 Interest-1.600
ATURE-1.573
Tokenther
Token ID490
Feature activation-2.62e+00

Pos logit contributions
uates+0.964
龍�+0.950
+0.939
cos+0.915
ero+0.891
Neg logit contributions
attery-1.028
ゴン-0.983
ukong-0.951
 Interest-0.949
orf-0.932
Token was
Token ID373
Feature activation+1.71e-02

Pos logit contributions
uates+4.255
cos+4.211
龍�+4.128
+4.100
ero+3.938
Neg logit contributions
attery-8.767
ゴン-8.662
ATURE-8.265
ukong-8.194
 Interest-8.163
Token responsible
Token ID4497
Feature activation+2.30e-01

Pos logit contributions
attery+0.045
ゴン+0.043
 Interest+0.042
ukong+0.041
ATURE+0.041
Neg logit contributions
uates-0.043
龍�-0.043
-0.042
cos-0.041
ero-0.040
Token for
Token ID329
Feature activation-2.78e-01

Pos logit contributions
attery+0.514
ゴン+0.486
 Interest+0.465
ukong+0.462
multiple+0.459
Neg logit contributions
uates-0.669
龍�-0.660
-0.653
cos-0.635
ero-0.619
Token testing
Token ID4856
Feature activation+1.74e-01

Pos logit contributions
uates+0.682
龍�+0.672
+0.664
cos+0.648
ero+0.628
Neg logit contributions
attery-0.747
ゴン-0.713
 Interest-0.694
ukong-0.689
ATURE-0.679
Token it
Token ID340
Feature activation-2.73e-01

Pos logit contributions
attery+0.432
ゴン+0.406
 Interest+0.401
ukong+0.394
multiple+0.390
Neg logit contributions
uates-0.460
龍�-0.459
-0.450
cos-0.439
ero-0.426
Token
Token ID447
Feature activation+9.59e-03

Pos logit contributions
attery+0.844
ゴン+0.792
 Interest+0.785
ukong+0.773
ATURE+0.760
Neg logit contributions
uates-0.915
龍�-0.908
-0.897
cos-0.880
ero-0.855
Token
Token ID247
Feature activation-3.43e-01

Pos logit contributions
attery+0.023
ゴン+0.022
 Interest+0.021
ukong+0.021
ATURE+0.021
Neg logit contributions
uates-0.026
龍�-0.026
-0.025
cos-0.025
ero-0.024
Token 48
Token ID4764
Feature activation+4.71e-01

Pos logit contributions
uates+0.545
龍�+0.530
+0.522
cos+0.502
ero+0.483
Neg logit contributions
attery-1.220
ゴン-1.182
ukong-1.152
 Interest-1.152
ATURE-1.137
Token points
Token ID2173
Feature activation+7.71e-02

Pos logit contributions
attery+1.234
ゴン+1.174
 Interest+1.155
ukong+1.145
ATURE+1.125
Neg logit contributions
uates-1.178
龍�-1.164
-1.158
cos-1.119
ero-1.087
Token,
Token ID11
Feature activation-3.44e-01

Pos logit contributions
attery+0.194
ゴン+0.183
 Interest+0.180
ukong+0.179
orf+0.175
Neg logit contributions
uates-0.203
龍�-0.200
-0.196
cos-0.194
ero-0.189
Token McF
Token ID38222
Feature activation-2.53e+00

Pos logit contributions
uates+0.909
龍�+0.893
+0.886
cos+0.866
ero+0.847
Neg logit contributions
attery-0.861
ゴン-0.827
ukong-0.793
 Interest-0.792
ATURE-0.778
Tokenadden
Token ID38014
Feature activation-1.04e+00

Pos logit contributions
uates+4.705
cos+4.593
ero+4.484
龍�+4.343
oci+4.221
Neg logit contributions
ゴン-8.099
attery-7.780
ukong-7.389
 Interest-7.363
multiple-7.357
Token only
Token ID691
Feature activation+1.70e-01

Pos logit contributions
uates+2.553
龍�+2.485
+2.483
cos+2.421
ero+2.382
Neg logit contributions
attery-2.749
ゴン-2.698
ukong-2.547
 Interest-2.522
ATURE-2.505
Token accounted
Token ID17830
Feature activation-7.34e-01

Pos logit contributions
attery+0.426
ゴン+0.402
 Interest+0.395
ukong+0.392
ATURE+0.385
Neg logit contributions
uates-0.447
龍�-0.443
-0.436
cos-0.427
ero-0.416
Token for
Token ID329
Feature activation-1.42e-01

Pos logit contributions
uates+1.982
龍�+1.944
+1.931
cos+1.895
ero+1.853
Neg logit contributions
attery-1.793
ゴン-1.727
ukong-1.643
 Interest-1.639
orf-1.612
Token one
Token ID530
Feature activation-5.90e-01

Pos logit contributions
uates+0.376
龍�+0.370
+0.366
cos+0.359
ero+0.350
Neg logit contributions
attery-0.354
ゴン-0.339
ukong-0.326
 Interest-0.326
ATURE-0.320
Token.
Token ID13
Feature activation-4.36e-01

Pos logit contributions
attery+1.330
ゴン+1.272
 Interest+1.256
ukong+1.229
ATURE+1.214
Neg logit contributions
uates-1.325
龍�-1.308
-1.292
cos-1.240
ero-1.214
Token8
Token ID23
Feature activation+1.03e-01

Pos logit contributions
uates+1.125
龍�+1.104
+1.092
cos+1.069
ero+1.044
Neg logit contributions
attery-1.119
ゴン-1.072
 Interest-1.036
ukong-1.033
ATURE-1.015
Token
Token ID198
Feature activation-3.08e-01

Pos logit contributions
attery+0.278
ゴン+0.266
 Interest+0.261
ukong+0.257
orf+0.253
Neg logit contributions
uates-0.253
龍�-0.249
-0.245
cos-0.238
ero-0.233
Token
Token ID198
Feature activation-2.56e-01

Pos logit contributions
uates+0.781
+0.765
龍�+0.763
cos+0.755
ero+0.733
Neg logit contributions
attery-0.801
ゴン-0.788
ukong-0.741
 Interest-0.729
itars-0.725
Token7
Token ID22
Feature activation-1.29e-01

Pos logit contributions
uates+0.636
龍�+0.624
+0.616
cos+0.602
ero+0.588
Neg logit contributions
attery-0.680
ゴン-0.652
 Interest-0.629
ukong-0.629
ATURE-0.619
Token Erit
Token ID43349
Feature activation-2.45e+00

Pos logit contributions
uates+0.336
龍�+0.330
+0.327
cos+0.319
ero+0.311
Neg logit contributions
attery-0.330
ゴン-0.316
 Interest-0.307
ukong-0.305
orf-0.300
Tokenrea
Token ID21468
Feature activation-7.69e-01

Pos logit contributions
uates+4.228
cos+3.943
ero+3.826
+3.804
龍�+3.793
Neg logit contributions
ゴン-8.202
attery-8.181
ukong-7.857
ATURE-7.777
 Interest-7.731
Token
Token ID198
Feature activation-5.40e-01

Pos logit contributions
uates+2.005
龍�+1.962
+1.950
cos+1.908
ero+1.868
Neg logit contributions
attery-1.955
ゴン-1.881
ukong-1.805
 Interest-1.790
ATURE-1.766
Token
Token ID198
Feature activation-4.18e-01

Pos logit contributions
uates+1.321
+1.295
cos+1.290
龍�+1.282
ero+1.247
Neg logit contributions
ゴン-1.448
attery-1.446
ukong-1.344
itars-1.321
 Interest-1.311
Token129
Token ID18741
Feature activation+2.40e-01

Pos logit contributions
uates+1.038
龍�+1.019
+1.009
cos+0.986
ero+0.962
Neg logit contributions
attery-1.114
ゴン-1.069
ukong-1.031
 Interest-1.030
ATURE-1.013
Token.
Token ID13
Feature activation-5.38e-01

Pos logit contributions
attery+0.605
ゴン+0.581
 Interest+0.563
ukong+0.558
ATURE+0.549
Neg logit contributions
uates-0.631
龍�-0.618
-0.613
cos-0.595
ero-0.582
Token Europe
Token ID2031
Feature activation-5.64e-01

Pos logit contributions
attery+0.220
ゴン+0.208
 Interest+0.204
ukong+0.202
orf+0.199
Neg logit contributions
uates-0.216
龍�-0.214
-0.210
cos-0.205
ero-0.200
Token Russian
Token ID3394
Feature activation-1.79e-01

Pos logit contributions
uates+1.506
龍�+1.481
+1.469
cos+1.438
ero+1.405
Neg logit contributions
attery-1.397
ゴン-1.339
ukong-1.287
 Interest-1.283
ATURE-1.262
Token state
Token ID1181
Feature activation-5.91e-01

Pos logit contributions
uates+0.464
龍�+0.458
+0.452
cos+0.440
ero+0.431
Neg logit contributions
attery-0.456
ゴン-0.436
 Interest-0.425
ukong-0.421
orf-0.414
Token energy
Token ID2568
Feature activation+3.47e-02

Pos logit contributions
uates+1.447
龍�+1.416
+1.407
cos+1.384
ero+1.346
Neg logit contributions
attery-1.594
ゴン-1.533
ukong-1.481
 Interest-1.465
ATURE-1.446
Token company
Token ID1664
Feature activation+3.35e-02

Pos logit contributions
attery+0.092
ゴン+0.088
 Interest+0.086
ukong+0.085
orf+0.084
Neg logit contributions
uates-0.087
龍�-0.086
-0.085
cos-0.082
ero-0.080
Token Gaz
Token ID21347
Feature activation-2.34e+00

Pos logit contributions
attery+0.099
 Interest+0.095
ゴン+0.093
ukong+0.092
orf+0.092
Neg logit contributions
uates-0.073
龍�-0.073
-0.071
cos-0.068
ero-0.065
Tokenprom
Token ID16963
Feature activation-4.37e-01

Pos logit contributions
cos+4.058
uates+3.995
+3.924
ero+3.872
龍�+3.822
Neg logit contributions
attery-7.833
ゴン-7.781
ukong-7.481
ATURE-7.244
itars-7.208
Token has
Token ID468
Feature activation+4.95e-03

Pos logit contributions
uates+1.176
龍�+1.160
+1.144
cos+1.119
ero+1.094
Neg logit contributions
attery-1.074
ゴン-1.024
 Interest-0.991
ukong-0.987
orf-0.969
Token resumed
Token ID28291
Feature activation-5.57e-01

Pos logit contributions
attery+0.012
ゴン+0.011
 Interest+0.011
ukong+0.011
orf+0.011
Neg logit contributions
uates-0.013
龍�-0.013
-0.013
cos-0.013
ero-0.012
Token some
Token ID617
Feature activation-1.93e-01

Pos logit contributions
uates+1.346
龍�+1.327
+1.307
cos+1.276
ero+1.240
Neg logit contributions
attery-1.521
ゴン-1.458
 Interest-1.415
ukong-1.407
ATURE-1.385
Token gas
Token ID3623
Feature activation-1.23e+00

Pos logit contributions
uates+0.458
龍�+0.457
+0.443
cos+0.433
ero+0.417
Neg logit contributions
attery-0.531
ゴン-0.511
 Interest-0.501
ukong-0.492
orf-0.487
Token is
Token ID318
Feature activation+4.97e-01

Pos logit contributions
uates+1.807
龍�+1.765
+1.753
cos+1.717
ero+1.679
Neg logit contributions
attery-1.831
ゴン-1.768
ukong-1.702
 Interest-1.687
orf-1.660
Token what
Token ID644
Feature activation+3.61e-01

Pos logit contributions
attery+1.231
ゴン+1.149
 Interest+1.135
ATURE+1.115
ukong+1.114
Neg logit contributions
uates-1.320
龍�-1.306
-1.286
cos-1.249
ero-1.220
Token matters
Token ID6067
Feature activation-7.03e-01

Pos logit contributions
attery+0.913
ゴン+0.856
 Interest+0.856
ukong+0.833
ATURE+0.828
Neg logit contributions
uates-0.929
龍�-0.927
-0.914
cos-0.894
ero-0.874
Token most
Token ID749
Feature activation-8.96e-01

Pos logit contributions
uates+1.873
龍�+1.834
+1.822
cos+1.787
ero+1.746
Neg logit contributions
attery-1.743
ゴン-1.675
ukong-1.608
 Interest-1.597
ATURE-1.572
Token in
Token ID287
Feature activation-1.85e-01

Pos logit contributions
uates+2.337
龍�+2.277
+2.266
cos+2.227
ero+2.177
Neg logit contributions
attery-2.273
ゴン-2.183
ukong-2.101
 Interest-2.076
ATURE-2.051
Token geop
Token ID30324
Feature activation-2.33e+00

Pos logit contributions
uates+0.479
龍�+0.473
+0.468
cos+0.457
ero+0.445
Neg logit contributions
attery-0.473
ゴン-0.450
 Interest-0.438
ukong-0.436
ATURE-0.428
Tokenolitics
Token ID21704
Feature activation-1.19e+00

Pos logit contributions
cos+4.325
uates+4.258
龍�+4.250
+4.092
omet+3.991
Neg logit contributions
ゴン-7.491
attery-7.302
ukong-7.017
ATURE-6.909
 Interest-6.888
Token
Token ID851
Feature activation+4.81e-01

Pos logit contributions
uates+2.951
龍�+2.882
+2.874
cos+2.821
ero+2.732
Neg logit contributions
attery-3.159
ゴン-3.047
ukong-2.937
 Interest-2.885
ATURE-2.874
Token it
Token ID340
Feature activation+3.10e-03

Pos logit contributions
attery+1.219
ゴン+1.146
 Interest+1.137
ATURE+1.113
ukong+1.111
Neg logit contributions
uates-1.233
龍�-1.233
-1.218
cos-1.179
ero-1.146
Token is
Token ID318
Feature activation-1.93e-02

Pos logit contributions
attery+0.008
ゴン+0.007
 Interest+0.007
ukong+0.007
orf+0.007
Neg logit contributions
uates-0.008
龍�-0.008
-0.008
cos-0.008
ero-0.008
Token plain
Token ID8631
Feature activation-3.17e-01

Pos logit contributions
uates+0.046
龍�+0.045
+0.045
cos+0.043
ero+0.042
Neg logit contributions
attery-0.053
ゴン-0.050
 Interest-0.050
ukong-0.049
ATURE-0.049
Token during
Token ID1141
Feature activation+3.69e-01

Pos logit contributions
attery+0.313
ゴン+0.293
 Interest+0.284
ukong+0.283
multiple+0.279
Neg logit contributions
uates-0.382
龍�-0.378
-0.374
cos-0.364
ero-0.358
Token the
Token ID262
Feature activation+9.39e-02

Pos logit contributions
attery+0.909
ゴン+0.854
 Interest+0.832
orf+0.826
multiple+0.823
Neg logit contributions
uates-0.982
龍�-0.981
-0.966
cos-0.939
ero-0.912
Token murder
Token ID5123
Feature activation-1.85e-01

Pos logit contributions
attery+0.257
ゴン+0.245
 Interest+0.237
ukong+0.236
orf+0.235
Neg logit contributions
uates-0.227
龍�-0.223
-0.221
cos-0.214
ero-0.209
Token,
Token ID11
Feature activation-6.71e-02

Pos logit contributions
uates+0.478
龍�+0.473
+0.467
cos+0.455
ero+0.444
Neg logit contributions
attery-0.475
ゴン-0.456
 Interest-0.438
ukong-0.437
orf-0.431
Token suddenly
Token ID6451
Feature activation+2.59e-01

Pos logit contributions
uates+0.171
龍�+0.169
+0.167
cos+0.163
ero+0.159
Neg logit contributions
attery-0.173
ゴン-0.166
 Interest-0.160
ukong-0.160
orf-0.158
Tokentrans
Token ID7645
Feature activation-2.30e+00

Pos logit contributions
attery+0.660
ゴン+0.630
 Interest+0.612
ukong+0.609
orf+0.602
Neg logit contributions
龍�-0.666
uates-0.665
-0.654
cos-0.635
ero-0.622
Tokenforms
Token ID23914
Feature activation+1.96e-01

Pos logit contributions
uates+6.465
cos+6.125
+5.998
ero+5.900
龍�+5.826
Neg logit contributions
ゴン-5.304
attery-5.246
 Interest-4.859
ukong-4.831
ATURE-4.793
Token itself
Token ID2346
Feature activation-2.79e-01

Pos logit contributions
attery+0.426
ゴン+0.398
 Interest+0.386
ukong+0.385
orf+0.380
Neg logit contributions
龍�-0.581
uates-0.581
-0.570
cos-0.554
ero-0.544
Token into
Token ID656
Feature activation-3.49e-01

Pos logit contributions
uates+0.833
龍�+0.819
+0.814
cos+0.799
ero+0.782
Neg logit contributions
attery-0.601
ゴン-0.572
ukong-0.547
 Interest-0.546
ATURE-0.534
Token the
Token ID262
Feature activation-1.19e-01

Pos logit contributions
uates+0.927
龍�+0.912
+0.900
cos+0.878
ero+0.861
Neg logit contributions
attery-0.876
ゴン-0.833
 Interest-0.805
ukong-0.805
orf-0.791
Token a
Token ID257
Feature activation+4.15e-02

Pos logit contributions
uates+0.297
龍�+0.294
+0.289
cos+0.281
ero+0.275
Neg logit contributions
attery-0.314
ゴン-0.298
ukong-0.290
 Interest-0.289
orf-0.285
TokenThree
Token ID12510
Feature activation-6.55e-01

Pos logit contributions
attery+0.033
ゴン+0.032
ukong+0.031
 Interest+0.031
ATURE+0.030
Neg logit contributions
uates-0.032
龍�-0.031
-0.031
cos-0.030
ero-0.029
Token months
Token ID1933
Feature activation-2.59e-01

Pos logit contributions
uates+1.615
龍�+1.582
+1.568
cos+1.534
ero+1.496
Neg logit contributions
attery-1.756
ゴン-1.688
ukong-1.627
 Interest-1.621
ATURE-1.596
Token ago
Token ID2084
Feature activation-2.91e-01

Pos logit contributions
uates+0.624
龍�+0.614
+0.607
cos+0.592
ero+0.577
Neg logit contributions
attery-0.707
ゴン-0.677
 Interest-0.657
ukong-0.656
ATURE-0.646
Token students
Token ID2444
Feature activation-9.19e-01

Pos logit contributions
uates+0.772
龍�+0.761
+0.754
cos+0.737
ero+0.720
Neg logit contributions
attery-0.726
ゴン-0.694
 Interest-0.670
ukong-0.668
ATURE-0.656
Token at
Token ID379
Feature activation+1.57e-01

Pos logit contributions
uates+2.385
龍�+2.335
+2.323
cos+2.270
ero+2.219
Neg logit contributions
attery-2.339
ゴン-2.253
ukong-2.163
 Interest-2.148
ATURE-2.121
Token Ober
Token ID33753
Feature activation-2.26e+00

Pos logit contributions
attery+0.364
ゴン+0.343
 Interest+0.335
ukong+0.332
multiple+0.326
Neg logit contributions
uates-0.443
龍�-0.439
-0.434
cos-0.425
ero-0.414
Tokenlin
Token ID2815
Feature activation-6.51e-01

Pos logit contributions
uates+1.977
龍�+1.865
+1.823
cos+1.804
ero+1.633
Neg logit contributions
attery-9.523
ゴン-9.377
ukong-9.178
ATURE-9.072
 Interest-9.065
Token College
Token ID5535
Feature activation-1.12e+00

Pos logit contributions
uates+1.954
龍�+1.922
+1.908
cos+1.876
ero+1.838
Neg logit contributions
attery-1.399
ゴン-1.327
ukong-1.269
 Interest-1.264
ATURE-1.239
Token in
Token ID287
Feature activation-8.06e-03

Pos logit contributions
uates+2.970
龍�+2.905
+2.899
cos+2.840
ero+2.779
Neg logit contributions
attery-2.764
ゴン-2.660
ukong-2.541
 Interest-2.527
ATURE-2.501
Token Ohio
Token ID6835
Feature activation+9.10e-02

Pos logit contributions
uates+0.023
龍�+0.023
+0.023
cos+0.022
ero+0.021
Neg logit contributions
attery-0.018
ゴン-0.017
 Interest-0.017
ukong-0.017
orf-0.017
Token objected
Token ID29811
Feature activation-7.04e-01

Pos logit contributions
attery+0.244
ゴン+0.231
 Interest+0.227
ukong+0.226
orf+0.222
Neg logit contributions
uates-0.225
龍�-0.222
-0.216
cos-0.211
ero-0.205
Token it
Token ID340
Feature activation-3.99e-01

Pos logit contributions
attery+0.688
ゴン+0.653
 Interest+0.639
ukong+0.627
ATURE+0.624
Neg logit contributions
龍�-0.696
uates-0.690
-0.684
cos-0.671
ero-0.650
Token out
Token ID503
Feature activation-1.93e-02

Pos logit contributions
uates+1.055
龍�+1.034
+1.027
cos+1.005
ero+0.982
Neg logit contributions
attery-1.000
ゴン-0.959
ukong-0.923
 Interest-0.919
ATURE-0.904
Token to
Token ID284
Feature activation+2.67e-01

Pos logit contributions
uates+0.053
龍�+0.053
+0.053
cos+0.051
ero+0.050
Neg logit contributions
attery-0.045
ゴン-0.043
 Interest-0.042
ukong-0.042
multiple-0.041
Token focus
Token ID2962
Feature activation+1.07e+00

Pos logit contributions
attery+0.712
ゴン+0.674
 Interest+0.669
ukong+0.657
orf+0.650
Neg logit contributions
uates-0.655
龍�-0.653
-0.645
cos-0.626
ero-0.610
Token on
Token ID319
Feature activation-7.58e-01

Pos logit contributions
attery+2.345
 Interest+2.164
itars+2.074
ゴン+2.071
ATURE+2.070
Neg logit contributions
龍�-3.185
-3.063
uates-3.046
cos-3.029
ero-2.928
Token undergrad
Token ID43485
Feature activation-2.24e+00

Pos logit contributions
uates+1.894
龍�+1.847
+1.829
cos+1.798
ero+1.756
Neg logit contributions
attery-2.006
ゴン-1.943
ukong-1.863
 Interest-1.842
ATURE-1.821
Tokenuates
Token ID12632
Feature activation-1.18e+00

Pos logit contributions
uates+5.525
龍�+4.898
+4.880
cos+4.849
ero+4.760
Neg logit contributions
attery-6.182
ゴン-6.082
ukong-5.809
ATURE-5.657
multiple-5.633
Token.
Token ID13
Feature activation-1.83e-01

Pos logit contributions
uates+2.986
龍�+2.860
+2.853
cos+2.797
ero+2.729
Neg logit contributions
attery-3.110
ゴン-3.031
ukong-2.906
ATURE-2.829
 Interest-2.820
Token O
Token ID440
Feature activation-5.57e-01

Pos logit contributions
uates+0.496
龍�+0.489
+0.485
cos+0.473
ero+0.464
Neg logit contributions
attery-0.445
ゴン-0.423
 Interest-0.414
ukong-0.408
ATURE-0.402
Tokenurs
Token ID1834
Feature activation-1.80e-02

Pos logit contributions
uates+1.352
龍�+1.327
+1.315
cos+1.283
ero+1.250
Neg logit contributions
attery-1.515
ゴン-1.454
ukong-1.405
 Interest-1.404
ATURE-1.380
Token is
Token ID318
Feature activation+3.02e-01

Pos logit contributions
uates+0.046
龍�+0.046
+0.045
cos+0.044
ero+0.043
Neg logit contributions
attery-0.046
ゴン-0.044
 Interest-0.043
ukong-0.043
ATURE-0.042
Token transgender
Token ID10637
Feature activation-1.26e+00

Pos logit contributions
attery+0.635
ゴン+0.602
 Interest+0.595
orf+0.576
ukong+0.576
Neg logit contributions
uates-0.785
龍�-0.778
-0.758
cos-0.735
ero-0.720
Token identity
Token ID5369
Feature activation-7.77e-03

Pos logit contributions
uates+3.242
龍�+3.161
+3.157
cos+3.099
ero+3.025
Neg logit contributions
attery-3.241
ゴン-3.115
ukong-3.015
 Interest-2.964
ATURE-2.933
Token;
Token ID26
Feature activation+6.54e-01

Pos logit contributions
uates+0.020
龍�+0.020
+0.020
cos+0.019
ero+0.019
Neg logit contributions
attery-0.019
ゴン-0.018
 Interest-0.018
ukong-0.018
ATURE-0.018
Token when
Token ID618
Feature activation+2.92e-01

Pos logit contributions
attery+1.599
ゴン+1.498
 Interest+1.495
ukong+1.464
multiple+1.459
Neg logit contributions
uates-1.743
龍�-1.717
-1.711
cos-1.653
ero-1.621
Token a
Token ID257
Feature activation+3.89e-01

Pos logit contributions
attery+0.710
ゴン+0.666
 Interest+0.659
ukong+0.646
multiple+0.642
Neg logit contributions
uates-0.786
龍�-0.775
-0.769
cos-0.748
ero-0.727
Token transsexual
Token ID41482
Feature activation-2.20e+00

Pos logit contributions
attery+0.904
ゴン+0.849
 Interest+0.842
orf+0.828
ukong+0.825
Neg logit contributions
uates-1.091
龍�-1.077
-1.058
cos-1.019
ero-0.988
Token person
Token ID1048
Feature activation-3.82e-01

Pos logit contributions
uates+5.402
+5.111
龍�+5.068
cos+5.035
ero+4.996
Neg logit contributions
attery-5.823
ゴン-5.667
ukong-5.445
 Interest-5.329
itars-5.294
Token's
Token ID338
Feature activation-5.92e-01

Pos logit contributions
uates+1.033
龍�+1.026
+1.013
cos+0.988
ero+0.962
Neg logit contributions
attery-0.928
ゴン-0.882
 Interest-0.855
ukong-0.852
orf-0.837
Token internal
Token ID5387
Feature activation-4.95e-02

Pos logit contributions
uates+1.443
龍�+1.416
+1.403
cos+1.371
ero+1.336
Neg logit contributions
attery-1.606
ゴン-1.543
ukong-1.489
 Interest-1.487
ATURE-1.462
Token gender
Token ID5279
Feature activation-4.36e-01

Pos logit contributions
uates+0.133
龍�+0.132
+0.130
cos+0.127
ero+0.124
Neg logit contributions
attery-0.121
ゴン-0.115
 Interest-0.112
ukong-0.111
orf-0.109
Token identity
Token ID5369
Feature activation-3.74e-01

Pos logit contributions
uates+1.100
龍�+1.083
+1.072
cos+1.047
ero+1.021
Neg logit contributions
attery-1.140
ゴン-1.094
 Interest-1.054
ukong-1.054
ATURE-1.035
Token-
Token ID12
Feature activation+5.73e-01

Pos logit contributions
uates+0.854
龍�+0.840
+0.833
cos+0.816
ero+0.798
Neg logit contributions
attery-0.704
ゴン-0.671
ukong-0.644
 Interest-0.644
ATURE-0.631
Tokenold
Token ID727
Feature activation-2.94e-01

Pos logit contributions
attery+1.316
ゴン+1.213
 Interest+1.195
ukong+1.192
orf+1.183
Neg logit contributions
uates-1.636
龍�-1.619
-1.595
cos-1.539
ero-1.513
Token University
Token ID2059
Feature activation-9.95e-01

Pos logit contributions
uates+0.682
龍�+0.671
+0.666
cos+0.645
ero+0.621
Neg logit contributions
attery-0.828
ゴン-0.787
 Interest-0.775
ukong-0.768
orf-0.761
Token of
Token ID286
Feature activation+4.81e-01

Pos logit contributions
uates+2.098
龍�+2.052
+2.026
cos+1.975
ero+1.916
Neg logit contributions
attery-3.023
ゴン-2.916
ukong-2.826
 Interest-2.822
ATURE-2.782
Token Virginia
Token ID6025
Feature activation+6.20e-01

Pos logit contributions
attery+1.104
 Interest+1.035
ゴン+1.027
ATURE+1.001
ukong+0.999
Neg logit contributions
uates-1.368
龍�-1.352
-1.336
cos-1.306
ero-1.264
Token undergrad
Token ID43485
Feature activation-2.18e+00

Pos logit contributions
attery+1.710
 Interest+1.654
ukong+1.609
ゴン+1.603
orf+1.601
Neg logit contributions
-1.413
龍�-1.403
uates-1.395
cos-1.330
ero-1.252
Token who
Token ID508
Feature activation-5.15e-01

Pos logit contributions
uates+5.377
龍�+4.947
+4.884
ero+4.863
cos+4.832
Neg logit contributions
attery-5.874
ゴン-5.763
ukong-5.511
 Interest-5.453
abilities-5.397
Token served
Token ID4983
Feature activation+4.72e-01

Pos logit contributions
uates+1.241
龍�+1.215
+1.205
cos+1.178
ero+1.150
Neg logit contributions
attery-1.408
ゴン-1.356
ukong-1.305
 Interest-1.302
ATURE-1.283
Token 18
Token ID1248
Feature activation+9.71e-02

Pos logit contributions
attery+1.023
 Interest+0.954
ゴン+0.952
ukong+0.939
multiple+0.934
Neg logit contributions
uates-1.387
龍�-1.377
-1.364
cos-1.334
ero-1.292
Token months
Token ID1933
Feature activation-2.73e-01

Pos logit contributions
attery+0.252
ゴン+0.237
 Interest+0.234
ukong+0.232
multiple+0.230
Neg logit contributions
uates-0.246
-0.242
龍�-0.242
cos-0.233
ero-0.226
Token behind
Token ID2157
Feature activation+6.74e-02

Pos logit contributions
uates+0.775
龍�+0.763
+0.757
cos+0.740
ero+0.722
Neg logit contributions
attery-0.630
ゴン-0.596
 Interest-0.577
ukong-0.576
ATURE-0.565
Token Russian
Token ID3394
Feature activation+1.62e-01

Pos logit contributions
attery+0.525
 Interest+0.499
ゴン+0.494
ukong+0.479
orf+0.475
Neg logit contributions
uates-0.587
龍�-0.584
-0.568
cos-0.553
ero-0.548
Token TV
Token ID3195
Feature activation-6.17e-01

Pos logit contributions
attery+0.440
ゴン+0.419
 Interest+0.418
ukong+0.408
orf+0.404
Neg logit contributions
uates-0.390
龍�-0.388
-0.379
cos-0.365
ero-0.358
Token stations
Token ID8985
Feature activation-2.46e-02

Pos logit contributions
uates+1.655
龍�+1.624
+1.614
cos+1.582
ero+1.549
Neg logit contributions
attery-1.514
ゴン-1.458
ukong-1.397
 Interest-1.389
ATURE-1.366
Token showed
Token ID3751
Feature activation-5.27e-01

Pos logit contributions
uates+0.066
龍�+0.065
+0.064
cos+0.063
ero+0.061
Neg logit contributions
attery-0.061
ゴン-0.058
 Interest-0.056
ukong-0.056
orf-0.055
Token a
Token ID257
Feature activation+2.30e-01

Pos logit contributions
uates+1.458
龍�+1.435
+1.423
cos+1.393
ero+1.362
Neg logit contributions
attery-1.252
ゴン-1.194
ukong-1.147
 Interest-1.147
ATURE-1.124
Token Gaz
Token ID21347
Feature activation-2.17e+00

Pos logit contributions
attery+0.610
ゴン+0.575
 Interest+0.567
ukong+0.564
orf+0.555
Neg logit contributions
龍�-0.575
uates-0.573
-0.554
cos-0.538
ero-0.520
Tokenprom
Token ID16963
Feature activation-3.08e-01

Pos logit contributions
cos+3.545
ero+3.424
+3.370
uates+3.258
龍�+3.145
Neg logit contributions
ゴン-7.479
attery-7.367
ukong-6.929
ATURE-6.927
itars-6.906
Token official
Token ID1743
Feature activation-3.39e-01

Pos logit contributions
uates+0.755
龍�+0.743
+0.727
cos+0.710
ero+0.689
Neg logit contributions
attery-0.835
ゴン-0.795
 Interest-0.777
ukong-0.772
orf-0.759
Token ordering
Token ID16216
Feature activation+1.86e-01

Pos logit contributions
uates+0.842
龍�+0.828
+0.819
cos+0.800
ero+0.779
Neg logit contributions
attery-0.904
ゴン-0.865
 Interest-0.836
ukong-0.835
ATURE-0.821
Token a
Token ID257
Feature activation+2.03e-01

Pos logit contributions
attery+0.479
ゴン+0.453
 Interest+0.450
ukong+0.443
multiple+0.439
Neg logit contributions
龍�-0.475
uates-0.472
-0.461
cos-0.446
ero-0.432
Token res
Token ID581
Feature activation-3.18e-01

Pos logit contributions
attery+0.511
ゴン+0.484
 Interest+0.472
ukong+0.470
orf+0.461
Neg logit contributions
uates-0.534
龍�-0.533
-0.516
cos-0.504
ero-0.487
Token the
Token ID262
Feature activation+1.44e-01

Pos logit contributions
attery+0.712
ゴン+0.663
 Interest+0.662
ATURE+0.653
ukong+0.651
Neg logit contributions
uates-0.726
龍�-0.722
-0.714
cos-0.692
ero-0.672
Token S
Token ID311
Feature activation-2.65e-01

Pos logit contributions
attery+0.372
ゴン+0.349
 Interest+0.348
ukong+0.341
ATURE+0.341
Neg logit contributions
uates-0.366
龍�-0.361
-0.357
cos-0.347
ero-0.337
TokenPA
Token ID4537
Feature activation+5.17e-02

Pos logit contributions
uates+0.656
龍�+0.645
+0.642
cos+0.620
ero+0.602
Neg logit contributions
attery-0.706
ゴン-0.669
 Interest-0.653
ukong-0.650
ATURE-0.643
Token,
Token ID11
Feature activation+8.58e-02

Pos logit contributions
attery+0.129
ゴン+0.121
 Interest+0.120
ukong+0.118
ATURE+0.118
Neg logit contributions
uates-0.136
龍�-0.135
-0.133
cos-0.130
ero-0.127
Token Koreans
Token ID25999
Feature activation-9.05e-01

Pos logit contributions
attery+0.205
ゴン+0.193
 Interest+0.191
ukong+0.187
ATURE+0.187
Neg logit contributions
uates-0.235
龍�-0.232
-0.231
cos-0.225
ero-0.219
Token cast
Token ID3350
Feature activation-2.12e+00

Pos logit contributions
uates+2.267
龍�+2.215
+2.204
cos+2.154
ero+2.107
Neg logit contributions
attery-2.372
ゴン-2.304
ukong-2.216
 Interest-2.179
orf-2.161
Token their
Token ID511
Feature activation-5.66e-02

Pos logit contributions
uates+5.308
+5.152
龍�+5.129
cos+4.957
ero+4.949
Neg logit contributions
ゴン-5.444
attery-5.429
ukong-5.127
orf-5.017
 Interest-4.958
Token ballots
Token ID20103
Feature activation-9.62e-01

Pos logit contributions
uates+0.158
龍�+0.155
+0.154
cos+0.151
ero+0.147
Neg logit contributions
attery-0.134
ゴン-0.126
 Interest-0.123
ukong-0.121
ATURE-0.121
Token again
Token ID757
Feature activation+4.47e-01

Pos logit contributions
uates+2.608
龍�+2.547
+2.536
cos+2.487
ero+2.435
Neg logit contributions
attery-2.336
ゴン-2.252
ukong-2.162
 Interest-2.129
orf-2.110
Token.
Token ID13
Feature activation-1.71e-01

Pos logit contributions
attery+1.010
ゴン+0.941
 Interest+0.931
ukong+0.906
multiple+0.902
Neg logit contributions
uates-1.281
龍�-1.277
-1.260
cos-1.224
ero-1.199
Token The
Token ID383
Feature activation+9.77e-03

Pos logit contributions
uates+0.459
龍�+0.451
+0.447
cos+0.438
ero+0.428
Neg logit contributions
attery-0.420
ゴン-0.401
 Interest-0.388
ukong-0.386
ATURE-0.380
Token Finnish
Token ID26838
Feature activation-8.18e-01

Pos logit contributions
attery+0.285
ゴン+0.271
 Interest+0.268
ukong+0.262
orf+0.262
Neg logit contributions
uates-0.267
龍�-0.264
-0.260
cos-0.252
ero-0.246
Token parliament
Token ID8540
Feature activation-1.08e-01

Pos logit contributions
uates+2.164
龍�+2.123
+2.110
cos+2.065
ero+2.016
Neg logit contributions
attery-2.047
ゴン-1.963
ukong-1.890
 Interest-1.870
ATURE-1.842
Token in
Token ID287
Feature activation-1.31e-02

Pos logit contributions
uates+0.290
龍�+0.287
+0.283
cos+0.278
ero+0.271
Neg logit contributions
attery-0.264
ゴン-0.251
 Interest-0.243
ukong-0.242
ATURE-0.239
Token Helsinki
Token ID47688
Feature activation-1.37e+00

Pos logit contributions
uates+0.035
龍�+0.035
+0.034
cos+0.034
ero+0.033
Neg logit contributions
attery-0.032
ゴン-0.031
 Interest-0.030
ukong-0.030
orf-0.029
Token,
Token ID11
Feature activation-3.72e-01

Pos logit contributions
uates+3.441
+3.363
龍�+3.357
cos+3.269
ero+3.190
Neg logit contributions
attery-3.578
ゴン-3.461
ukong-3.318
 Interest-3.272
ATURE-3.246
Token Ves
Token ID36448
Feature activation-2.12e+00

Pos logit contributions
uates+0.960
龍�+0.944
+0.935
cos+0.914
ero+0.892
Neg logit contributions
attery-0.954
ゴン-0.912
 Interest-0.882
ukong-0.880
ATURE-0.864
Tokena
Token ID64
Feature activation-8.28e-01

Pos logit contributions
uates+4.919
cos+4.809
ero+4.683
+4.650
龍�+4.609
Neg logit contributions
attery-5.815
ゴン-5.789
ukong-5.389
ATURE-5.329
 Interest-5.260
Token-
Token ID12
Feature activation-5.90e-01

Pos logit contributions
uates+2.106
龍�+2.063
+2.048
cos+2.005
ero+1.965
Neg logit contributions
attery-2.148
ゴン-2.064
ukong-1.985
 Interest-1.976
ATURE-1.949
TokenMatt
Token ID13448
Feature activation-5.93e-01

Pos logit contributions
uates+1.450
龍�+1.422
+1.410
cos+1.378
ero+1.343
Neg logit contributions
attery-1.587
ゴン-1.524
ukong-1.470
 Interest-1.469
ATURE-1.444
Tokeni
Token ID72
Feature activation-3.85e-01

Pos logit contributions
uates+1.544
龍�+1.518
+1.504
cos+1.471
ero+1.436
Neg logit contributions
attery-1.508
ゴン-1.445
ukong-1.391
 Interest-1.390
ATURE-1.364
Token Sa
Token ID10318
Feature activation-4.87e-01

Pos logit contributions
uates+0.979
龍�+0.964
+0.954
cos+0.931
ero+0.906
Neg logit contributions
attery-1.002
ゴン-0.959
 Interest-0.927
ukong-0.925
orf-0.908
Token Monaco
Token ID31630
Feature activation-4.50e-01

Pos logit contributions
uates+1.124
龍�+1.104
+1.094
cos+1.070
ero+1.045
Neg logit contributions
attery-1.064
ゴン-1.017
 Interest-0.981
ukong-0.979
ATURE-0.961
Token in
Token ID287
Feature activation-2.80e-02

Pos logit contributions
uates+1.231
龍�+1.209
+1.200
cos+1.175
ero+1.151
Neg logit contributions
attery-1.084
ゴン-1.038
ukong-0.995
 Interest-0.990
ATURE-0.976
Token 2008
Token ID3648
Feature activation+6.08e-02

Pos logit contributions
uates+0.064
龍�+0.063
+0.062
cos+0.060
ero+0.058
Neg logit contributions
attery-0.080
ゴン-0.076
 Interest-0.075
ukong-0.074
ATURE-0.073
Token,
Token ID11
Feature activation-1.84e-01

Pos logit contributions
attery+0.147
ゴン+0.140
 Interest+0.136
ukong+0.135
orf+0.132
Neg logit contributions
uates-0.166
龍�-0.163
-0.161
cos-0.157
ero-0.154
Token president
Token ID1893
Feature activation-5.45e-01

Pos logit contributions
uates+0.469
龍�+0.458
+0.454
cos+0.444
ero+0.432
Neg logit contributions
attery-0.478
ゴン-0.457
 Interest-0.444
ukong-0.441
ATURE-0.433
Token Jerome
Token ID33617
Feature activation-2.11e+00

Pos logit contributions
uates+1.395
龍�+1.372
+1.358
cos+1.327
ero+1.295
Neg logit contributions
attery-1.407
ゴン-1.348
 Interest-1.300
ukong-1.299
ATURE-1.276
Token de
Token ID390
Feature activation+1.17e-02

Pos logit contributions
uates+5.254
龍�+5.122
cos+5.092
+5.013
ero+4.921
Neg logit contributions
attery-5.469
ゴン-5.428
ukong-5.167
ATURE-5.070
multiple-4.996
Token B
Token ID347
Feature activation+3.13e-01

Pos logit contributions
attery+0.028
ゴン+0.027
ukong+0.026
 Interest+0.025
ATURE+0.025
Neg logit contributions
uates-0.032
龍�-0.032
-0.032
cos-0.031
ero-0.030
Tokenont
Token ID756
Feature activation-5.05e-01

Pos logit contributions
attery+0.699
ukong+0.627
 Interest+0.623
ゴン+0.623
orf+0.620
Neg logit contributions
龍�-0.934
uates-0.917
-0.915
cos-0.881
ero-0.854
Tokenin
Token ID259
Feature activation+9.06e-02

Pos logit contributions
uates+1.231
龍�+1.207
+1.193
cos+1.172
ero+1.145
Neg logit contributions
attery-1.370
ゴン-1.320
ukong-1.271
 Interest-1.264
ATURE-1.251
Token wanted
Token ID2227
Feature activation+2.61e-04

Pos logit contributions
attery+0.246
ゴン+0.235
ukong+0.228
 Interest+0.228
orf+0.224
Neg logit contributions
uates-0.220
龍�-0.217
-0.215
cos-0.208
ero-0.202
Token.
Token ID13
Feature activation-6.11e-02

Pos logit contributions
uates+1.297
龍�+1.273
+1.263
cos+1.235
ero+1.207
Neg logit contributions
attery-1.247
ゴン-1.196
ukong-1.151
 Interest-1.149
ATURE-1.128
Token Host
Token ID14504
Feature activation+9.32e-02

Pos logit contributions
uates+0.163
龍�+0.162
+0.159
cos+0.155
ero+0.152
Neg logit contributions
attery-0.151
ゴン-0.142
 Interest-0.141
ukong-0.138
ATURE-0.136
Tokenesses
Token ID44667
Feature activation-5.20e-01

Pos logit contributions
attery+0.296
ゴン+0.283
 Interest+0.280
ukong+0.277
orf+0.276
Neg logit contributions
龍�-0.181
uates-0.179
-0.176
cos-0.169
ero-0.164
Token in
Token ID287
Feature activation-2.53e-02

Pos logit contributions
uates+1.265
龍�+1.236
+1.229
cos+1.203
ero+1.175
Neg logit contributions
attery-1.408
ゴン-1.361
ukong-1.309
 Interest-1.304
ATURE-1.283
Token white
Token ID2330
Feature activation-7.09e-01

Pos logit contributions
uates+0.055
龍�+0.055
+0.053
cos+0.052
ero+0.050
Neg logit contributions
attery-0.075
ゴン-0.071
 Interest-0.070
ukong-0.069
orf-0.069
Token gown
Token ID33323
Feature activation-2.11e+00

Pos logit contributions
uates+1.713
龍�+1.683
+1.665
cos+1.625
ero+1.582
Neg logit contributions
attery-1.934
ゴン-1.857
 Interest-1.793
ukong-1.793
ATURE-1.762
Tokens
Token ID82
Feature activation-5.10e-01

Pos logit contributions
uates+1.942
+1.747
cos+1.729
龍�+1.679
ero+1.589
Neg logit contributions
attery-8.810
ゴン-8.736
ukong-8.437
 Interest-8.418
ATURE-8.280
Token and
Token ID290
Feature activation-3.20e-01

Pos logit contributions
uates+1.249
龍�+1.223
+1.214
cos+1.188
ero+1.159
Neg logit contributions
attery-1.373
ゴン-1.325
ukong-1.275
 Interest-1.272
ATURE-1.252
Token golden
Token ID10861
Feature activation-2.57e-01

Pos logit contributions
uates+0.727
龍�+0.717
+0.706
cos+0.685
ero+0.663
Neg logit contributions
attery-0.920
ゴン-0.882
 Interest-0.859
ukong-0.853
orf-0.843
Token s
Token ID264
Feature activation+7.29e-01

Pos logit contributions
uates+0.636
龍�+0.626
+0.617
cos+0.602
ero+0.584
Neg logit contributions
attery-0.684
ゴン-0.657
 Interest-0.637
ukong-0.633
orf-0.624
Tokenashes
Token ID7465
Feature activation+3.39e-01

Pos logit contributions
attery+1.726
ukong+1.578
ゴン+1.570
 Interest+1.562
orf+1.560
Neg logit contributions
龍�-2.014
uates-2.008
-1.986
cos-1.919
ero-1.863
Token of
Token ID286
Feature activation-2.15e-01

Pos logit contributions
uates+0.980
龍�+0.968
+0.958
cos+0.932
ero+0.908
Neg logit contributions
attery-1.085
ゴン-1.039
 Interest-1.011
ukong-1.007
ATURE-0.989
Token road
Token ID2975
Feature activation-3.97e-01

Pos logit contributions
uates+0.523
龍�+0.516
+0.510
cos+0.496
ero+0.483
Neg logit contributions
attery-0.581
ゴン-0.556
 Interest-0.541
ukong-0.539
ATURE-0.528
Token safety
Token ID3747
Feature activation-1.50e-01

Pos logit contributions
uates+1.209
龍�+1.195
+1.184
cos+1.157
ero+1.135
Neg logit contributions
attery-0.834
ゴン-0.788
ukong-0.758
 Interest-0.757
ATURE-0.736
Token,
Token ID11
Feature activation-6.89e-02

Pos logit contributions
uates+0.408
龍�+0.405
+0.400
cos+0.390
ero+0.381
Neg logit contributions
attery-0.361
ゴン-0.344
 Interest-0.334
ukong-0.331
ATURE-0.325
Token a
Token ID257
Feature activation-4.88e-01

Pos logit contributions
uates+0.184
龍�+0.183
+0.181
cos+0.176
ero+0.171
Neg logit contributions
attery-0.170
ゴン-0.160
 Interest-0.158
ukong-0.155
ATURE-0.153
Token draft
Token ID4538
Feature activation-2.10e+00

Pos logit contributions
uates+1.211
龍�+1.190
+1.178
cos+1.149
ero+1.118
Neg logit contributions
attery-1.302
ゴン-1.248
 Interest-1.207
ukong-1.205
ATURE-1.184
Token Road
Token ID5567
Feature activation-7.04e-01

Pos logit contributions
uates+5.519
龍�+5.412
+5.404
cos+5.315
ero+5.292
Neg logit contributions
attery-5.035
ゴン-4.998
ukong-4.693
itars-4.594
multiple-4.532
Token Transport
Token ID19940
Feature activation-6.48e-01

Pos logit contributions
uates+1.740
龍�+1.707
+1.693
cos+1.654
ero+1.614
Neg logit contributions
attery-1.881
ゴン-1.810
ukong-1.741
 Interest-1.737
ATURE-1.711
Token &
Token ID1222
Feature activation-2.35e-01

Pos logit contributions
uates+1.817
龍�+1.788
+1.773
cos+1.737
ero+1.699
Neg logit contributions
attery-1.518
ゴン-1.449
ukong-1.390
 Interest-1.388
ATURE-1.361
Token Safety
Token ID11233
Feature activation-4.76e-01

Pos logit contributions
uates+0.644
龍�+0.634
+0.628
cos+0.614
ero+0.600
Neg logit contributions
attery-0.565
ゴン-0.537
 Interest-0.521
ukong-0.518
ATURE-0.509
Token Bill
Token ID3941
Feature activation-7.93e-01

Pos logit contributions
uates+1.389
龍�+1.369
+1.356
cos+1.329
ero+1.300
Neg logit contributions
attery-1.063
ゴン-1.009
 Interest-0.974
ukong-0.968
ATURE-0.951
Token that
Token ID326
Feature activation+2.05e-01

Pos logit contributions
attery+1.391
ゴン+1.317
 Interest+1.312
ukong+1.261
multiple+1.259
Neg logit contributions
龍�-1.664
uates-1.662
-1.615
cos-1.578
ero-1.528
Token point
Token ID966
Feature activation+6.51e-01

Pos logit contributions
attery+0.480
ゴン+0.457
 Interest+0.445
ukong+0.435
ATURE+0.431
Neg logit contributions
uates-0.573
龍�-0.568
-0.558
cos-0.548
ero-0.531
Token,
Token ID11
Feature activation+1.74e-01

Pos logit contributions
attery+1.525
ゴン+1.455
 Interest+1.424
ukong+1.374
multiple+1.369
Neg logit contributions
uates-1.802
龍�-1.796
-1.754
cos-1.715
ero-1.682
Token and
Token ID290
Feature activation+7.31e-02

Pos logit contributions
attery+0.418
ゴン+0.395
 Interest+0.386
ukong+0.379
ATURE+0.376
Neg logit contributions
uates-0.479
龍�-0.474
-0.468
cos-0.458
ero-0.445
Token S
Token ID311
Feature activation-4.08e-01

Pos logit contributions
attery+0.189
ゴン+0.180
 Interest+0.175
ukong+0.174
ATURE+0.171
Neg logit contributions
uates-0.187
龍�-0.185
-0.182
cos-0.179
ero-0.173
Tokenibur
Token ID38616
Feature activation-2.09e+00

Pos logit contributions
uates+1.034
龍�+1.017
+1.007
cos+0.979
ero+0.952
Neg logit contributions
attery-1.070
ゴン-1.014
ukong-0.988
 Interest-0.985
orf-0.970
Token's
Token ID338
Feature activation-6.06e-01

Pos logit contributions
uates+4.842
cos+4.779
+4.735
ero+4.631
龍�+4.576
Neg logit contributions
attery-5.564
ゴン-5.551
ATURE-5.139
ukong-5.104
itars-5.084
Token annual
Token ID5079
Feature activation+2.49e-01

Pos logit contributions
uates+1.545
龍�+1.507
+1.504
cos+1.472
ero+1.443
Neg logit contributions
attery-1.570
ゴン-1.510
ukong-1.454
 Interest-1.431
ATURE-1.421
Token report
Token ID989
Feature activation+3.21e-01

Pos logit contributions
attery+0.648
ゴン+0.618
 Interest+0.613
ukong+0.596
ATURE+0.592
Neg logit contributions
uates-0.630
龍�-0.628
-0.611
cos-0.598
ero-0.580
Token for
Token ID329
Feature activation+1.86e-01

Pos logit contributions
attery+0.801
ゴン+0.756
 Interest+0.738
ukong+0.731
ATURE+0.723
Neg logit contributions
uates-0.845
龍�-0.843
-0.830
cos-0.810
ero-0.784
Token 2012
Token ID2321
Feature activation+2.00e-01

Pos logit contributions
attery+0.523
ゴン+0.497
 Interest+0.492
ukong+0.486
ATURE+0.478
Neg logit contributions
uates-0.432
龍�-0.432
-0.421
cos-0.410
ero-0.396
Token too
Token ID1165
Feature activation-1.12e-02

Pos logit contributions
attery+0.228
ゴン+0.217
 Interest+0.214
ukong+0.211
ATURE+0.208
Neg logit contributions
uates-0.206
龍�-0.202
-0.200
cos-0.197
ero-0.190
Token,
Token ID11
Feature activation+8.25e-02

Pos logit contributions
uates+0.030
龍�+0.030
+0.029
cos+0.029
ero+0.028
Neg logit contributions
attery-0.028
ゴン-0.026
 Interest-0.026
ukong-0.025
ATURE-0.025
Token the
Token ID262
Feature activation+1.68e-02

Pos logit contributions
attery+0.220
ゴン+0.209
 Interest+0.206
ukong+0.203
ATURE+0.201
Neg logit contributions
uates-0.203
龍�-0.200
-0.199
cos-0.195
ero-0.188
Token news
Token ID1705
Feature activation+3.56e-02

Pos logit contributions
attery+0.044
ゴン+0.041
 Interest+0.041
ukong+0.040
orf+0.040
Neg logit contributions
uates-0.043
龍�-0.042
-0.042
cos-0.041
ero-0.040
Token was
Token ID373
Feature activation-6.76e-01

Pos logit contributions
attery+0.095
ゴン+0.091
 Interest+0.088
ukong+0.087
ATURE+0.086
Neg logit contributions
uates-0.088
龍�-0.087
-0.086
cos-0.084
ero-0.082
Token grim
Token ID18288
Feature activation-2.08e+00

Pos logit contributions
uates+1.659
龍�+1.628
+1.613
cos+1.575
ero+1.535
Neg logit contributions
attery-1.821
ゴン-1.747
 Interest-1.687
ukong-1.685
ATURE-1.657
Token from
Token ID422
Feature activation-7.04e-01

Pos logit contributions
uates+5.122
+5.010
龍�+4.972
cos+4.942
ero+4.781
Neg logit contributions
attery-5.285
ゴン-5.196
ukong-5.004
 Interest-4.905
ATURE-4.893
Token Republicans
Token ID4734
Feature activation-1.01e-01

Pos logit contributions
uates+1.838
龍�+1.805
+1.790
cos+1.750
ero+1.710
Neg logit contributions
attery-1.786
ゴン-1.713
ukong-1.649
 Interest-1.644
ATURE-1.615
Token.
Token ID13
Feature activation-2.00e-02

Pos logit contributions
uates+0.277
龍�+0.272
+0.268
cos+0.264
ero+0.257
Neg logit contributions
attery-0.246
ゴン-0.235
 Interest-0.227
ukong-0.225
ATURE-0.221
Token In
Token ID554
Feature activation+4.68e-02

Pos logit contributions
uates+0.055
龍�+0.054
+0.053
cos+0.052
ero+0.051
Neg logit contributions
attery-0.048
ゴン-0.046
 Interest-0.045
ukong-0.044
ATURE-0.043
Token this
Token ID428
Feature activation+5.13e-01

Pos logit contributions
attery+0.119
ゴン+0.113
 Interest+0.110
ukong+0.109
ATURE+0.107
Neg logit contributions
uates-0.122
龍�-0.120
-0.119
cos-0.116
ero-0.113
Token that
Token ID326
Feature activation+3.38e-02

Pos logit contributions
attery+0.125
ゴン+0.118
 Interest+0.116
ukong+0.114
ATURE+0.113
Neg logit contributions
uates-0.143
龍�-0.142
-0.139
cos-0.136
ero-0.133
Token was
Token ID373
Feature activation+5.87e-01

Pos logit contributions
attery+0.086
ゴン+0.082
 Interest+0.080
ukong+0.079
ATURE+0.078
Neg logit contributions
uates-0.087
龍�-0.087
-0.085
cos-0.084
ero-0.081
Token used
Token ID973
Feature activation+6.07e-01

Pos logit contributions
attery+1.443
ゴン+1.337
 Interest+1.326
ATURE+1.304
ukong+1.298
Neg logit contributions
uates-1.568
龍�-1.555
-1.510
cos-1.474
ero-1.434
Token as
Token ID355
Feature activation-8.65e-01

Pos logit contributions
attery+1.338
ゴン+1.232
 Interest+1.199
ukong+1.198
multiple+1.172
Neg logit contributions
龍�-1.779
uates-1.776
-1.737
cos-1.708
ero-1.673
Token a
Token ID257
Feature activation-4.72e-01

Pos logit contributions
uates+2.185
龍�+2.144
+2.127
cos+2.079
ero+2.029
Neg logit contributions
attery-2.261
ゴン-2.173
ukong-2.094
 Interest-2.090
ATURE-2.054
Token pretext
Token ID35097
Feature activation-2.07e+00

Pos logit contributions
uates+0.979
龍�+0.958
+0.942
cos+0.918
ero+0.889
Neg logit contributions
attery-1.456
ゴン-1.402
 Interest-1.361
ukong-1.359
ATURE-1.340
Token for
Token ID329
Feature activation-5.64e-01

Pos logit contributions
uates+5.180
+5.083
龍�+5.035
cos+4.918
ero+4.798
Neg logit contributions
attery-5.240
ゴン-5.097
ukong-4.904
 Interest-4.835
multiple-4.752
Token the
Token ID262
Feature activation-1.12e-02

Pos logit contributions
uates+1.448
龍�+1.429
+1.409
cos+1.379
ero+1.344
Neg logit contributions
attery-1.459
ゴン-1.396
 Interest-1.348
ukong-1.343
ATURE-1.321
Token country
Token ID1499
Feature activation-1.18e-01

Pos logit contributions
uates+0.029
龍�+0.029
+0.028
cos+0.027
ero+0.026
Neg logit contributions
attery-0.029
ゴン-0.028
 Interest-0.027
ukong-0.027
orf-0.027
Token
Token ID447
Feature activation-4.63e-02

Pos logit contributions
uates+0.313
龍�+0.307
+0.304
cos+0.297
ero+0.290
Neg logit contributions
attery-0.294
ゴン-0.283
 Interest-0.270
ukong-0.270
ATURE-0.264
Token
Token ID247
Feature activation-4.61e-02

Pos logit contributions
uates+0.125
龍�+0.123
+0.122
cos+0.120
ero+0.116
Neg logit contributions
attery-0.113
ゴン-0.108
 Interest-0.104
ukong-0.103
ATURE-0.102
Token overthrow
Token ID25525
Feature activation-1.17e+00

Pos logit contributions
attery+0.790
ゴン+0.746
 Interest+0.736
ukong+0.728
orf+0.715
Neg logit contributions
uates-0.747
龍�-0.742
-0.725
cos-0.707
ero-0.690
Token Gaddafi
Token ID40320
Feature activation-1.25e+00

Pos logit contributions
uates+2.813
+2.744
龍�+2.738
cos+2.690
ero+2.625
Neg logit contributions
attery-3.164
ゴン-3.122
ukong-2.966
 Interest-2.913
orf-2.897
Token and
Token ID290
Feature activation-2.53e-01

Pos logit contributions
uates+3.129
+3.071
龍�+3.045
cos+3.000
ero+2.931
Neg logit contributions
attery-3.247
ゴン-3.175
ukong-3.016
ATURE-2.970
 Interest-2.968
Token essentially
Token ID6986
Feature activation+2.20e-01

Pos logit contributions
uates+0.637
龍�+0.627
+0.619
cos+0.605
ero+0.590
Neg logit contributions
attery-0.667
ゴン-0.637
 Interest-0.618
ukong-0.615
ATURE-0.605
Token shove
Token ID38332
Feature activation-1.12e-01

Pos logit contributions
attery+0.576
ゴン+0.543
 Interest+0.537
ukong+0.527
ATURE+0.520
Neg logit contributions
uates-0.553
龍�-0.550
-0.537
cos-0.526
ero-0.511
Token Gaz
Token ID21347
Feature activation-2.06e+00

Pos logit contributions
uates+0.287
龍�+0.286
+0.280
cos+0.273
ero+0.266
Neg logit contributions
attery-0.287
ゴン-0.271
 Interest-0.265
ukong-0.263
orf-0.259
Tokenprom
Token ID16963
Feature activation-3.40e-01

Pos logit contributions
cos+3.401
ero+3.237
+3.173
uates+3.140
phal+2.966
Neg logit contributions
ゴン-7.220
attery-6.945
ATURE-6.635
ukong-6.603
itars-6.554
Token out
Token ID503
Feature activation+7.58e-03

Pos logit contributions
uates+0.947
龍�+0.934
+0.924
cos+0.906
ero+0.885
Neg logit contributions
attery-0.800
ゴン-0.762
 Interest-0.734
ukong-0.732
ATURE-0.718
Token.
Token ID13
Feature activation-1.33e-02

Pos logit contributions
attery+0.022
ゴン+0.021
 Interest+0.020
ukong+0.020
orf+0.020
Neg logit contributions
uates-0.017
龍�-0.017
-0.017
cos-0.016
ero-0.016
Token I
Token ID314
Feature activation+8.08e-03

Pos logit contributions
uates+0.036
龍�+0.036
+0.035
cos+0.034
ero+0.034
Neg logit contributions
attery-0.032
ゴン-0.030
 Interest-0.030
ukong-0.029
ATURE-0.029
Token mean
Token ID1612
Feature activation+6.11e-01

Pos logit contributions
attery+0.022
ゴン+0.020
 Interest+0.020
ukong+0.020
ATURE+0.020
Neg logit contributions
uates-0.020
龍�-0.020
-0.020
cos-0.019
ero-0.019
Token the
Token ID262
Feature activation+1.59e-01

Pos logit contributions
attery+0.059
ゴン+0.055
 Interest+0.055
ukong+0.054
orf+0.053
Neg logit contributions
uates-0.074
龍�-0.074
-0.073
cos-0.071
ero-0.069
Token divides
Token ID36319
Feature activation+1.04e+00

Pos logit contributions
attery+0.432
 Interest+0.408
ゴン+0.402
ukong+0.397
orf+0.391
Neg logit contributions
uates-0.387
龍�-0.385
-0.375
cos-0.366
ero-0.353
Token are
Token ID389
Feature activation-4.32e-01

Pos logit contributions
attery+2.382
 Interest+2.252
ukong+2.203
ゴン+2.168
ATURE+2.158
Neg logit contributions
龍�-2.905
uates-2.863
-2.792
cos-2.703
ero-2.658
Token within
Token ID1626
Feature activation+3.69e-01

Pos logit contributions
uates+1.199
龍�+1.180
+1.170
cos+1.146
ero+1.121
Neg logit contributions
attery-1.024
ゴン-0.978
ukong-0.939
 Interest-0.938
ATURE-0.920
Token the
Token ID262
Feature activation+2.29e-01

Pos logit contributions
attery+0.826
ゴン+0.762
 Interest+0.759
ukong+0.734
multiple+0.733
Neg logit contributions
uates-1.071
龍�-1.067
-1.045
cos-1.024
ero-1.000
Token Democratic
Token ID4390
Feature activation+1.48e+00

Pos logit contributions
attery+0.625
ゴン+0.587
 Interest+0.585
ukong+0.570
orf+0.569
Neg logit contributions
uates-0.558
龍�-0.547
-0.538
cos-0.523
ero-0.507
Token Party
Token ID3615
Feature activation+5.79e-01

Pos logit contributions
attery+3.028
 Interest+2.888
iverse+2.740
ゴン+2.691
orf+2.651
Neg logit contributions
uates-4.504
龍�-4.349
-4.225
cos-4.115
phal-4.104
Token.
Token ID13
Feature activation+2.62e-01

Pos logit contributions
attery+1.376
 Interest+1.298
ゴン+1.286
ukong+1.264
orf+1.245
Neg logit contributions
uates-1.584
龍�-1.550
-1.515
cos-1.490
ero-1.453
Token The
Token ID383
Feature activation+1.35e-01

Pos logit contributions
attery+0.653
 Interest+0.621
ゴン+0.617
ukong+0.598
ATURE+0.594
Neg logit contributions
uates-0.697
龍�-0.680
-0.668
cos-0.658
ero-0.648
Token self
Token ID2116
Feature activation+5.46e-01

Pos logit contributions
attery+0.338
ゴン+0.320
 Interest+0.315
ukong+0.309
orf+0.305
Neg logit contributions
uates-0.355
龍�-0.350
-0.345
cos-0.338
ero-0.329
Token-
Token ID12
Feature activation+7.38e-01

Pos logit contributions
attery+1.205
ゴン+1.144
 Interest+1.108
ukong+1.090
ATURE+1.076
Neg logit contributions
uates-1.603
龍�-1.579
-1.565
cos-1.528
ero-1.496
Token along
Token ID1863
Feature activation+5.54e-01

Pos logit contributions
attery+1.678
 Interest+1.523
ゴン+1.522
ukong+1.496
orf+1.485
Neg logit contributions
龍�-2.307
uates-2.293
-2.266
cos-2.179
ero-2.133
Token the
Token ID262
Feature activation-1.27e-01

Pos logit contributions
attery+1.265
ゴン+1.172
 Interest+1.166
iverse+1.139
multiple+1.133
Neg logit contributions
uates-1.575
龍�-1.548
-1.517
cos-1.489
ero-1.446
Token path
Token ID3108
Feature activation-6.89e-01

Pos logit contributions
uates+0.341
龍�+0.335
+0.331
cos+0.324
ero+0.316
Neg logit contributions
attery-0.315
ゴン-0.300
 Interest-0.290
ukong-0.289
orf-0.284
Token to
Token ID284
Feature activation-1.23e-01

Pos logit contributions
uates+1.710
龍�+1.672
+1.659
cos+1.624
ero+1.588
Neg logit contributions
attery-1.833
ゴン-1.768
ukong-1.706
 Interest-1.695
orf-1.667
Token fully
Token ID3938
Feature activation+1.47e-02

Pos logit contributions
uates+0.328
龍�+0.322
+0.319
cos+0.311
ero+0.303
Neg logit contributions
attery-0.308
ゴン-0.293
 Interest-0.284
ukong-0.283
ATURE-0.279
Token equal
Token ID4961
Feature activation+1.23e+00

Pos logit contributions
attery+0.037
ゴン+0.035
 Interest+0.034
ukong+0.034
ATURE+0.034
Neg logit contributions
uates-0.038
龍�-0.038
-0.038
cos-0.036
ero-0.035
Token rights
Token ID2489
Feature activation+1.17e-01

Pos logit contributions
attery+2.657
ゴン+2.462
 Interest+2.458
ukong+2.426
abilities+2.382
Neg logit contributions
龍�-3.586
uates-3.572
-3.499
cos-3.359
ero-3.272
Token.
Token ID13
Feature activation+2.65e-01

Pos logit contributions
attery+0.275
ゴン+0.260
ukong+0.252
 Interest+0.252
orf+0.247
Neg logit contributions
uates-0.327
龍�-0.324
-0.320
cos-0.312
ero-0.305
Token The
Token ID383
Feature activation+1.19e-01

Pos logit contributions
attery+0.635
 Interest+0.599
ゴン+0.596
ukong+0.580
ATURE+0.578
Neg logit contributions
uates-0.727
龍�-0.714
-0.707
cos-0.687
ero-0.676
Token federal
Token ID2717
Feature activation-7.66e-02

Pos logit contributions
attery+0.296
ゴン+0.280
 Interest+0.275
ukong+0.272
orf+0.267
Neg logit contributions
uates-0.314
龍�-0.310
-0.307
cos-0.298
ero-0.290
Token government
Token ID1230
Feature activation+4.04e-01

Pos logit contributions
uates+0.185
龍�+0.182
+0.180
cos+0.176
ero+0.171
Neg logit contributions
attery-0.209
ゴン-0.201
ukong-0.194
 Interest-0.194
ATURE-0.191
Token to
Token ID284
Feature activation-4.85e-01

Pos logit contributions
uates+1.772
龍�+1.730
+1.724
cos+1.693
ero+1.658
Neg logit contributions
attery-1.578
ゴン-1.524
ukong-1.462
 Interest-1.432
orf-1.424
Token a
Token ID257
Feature activation-5.41e-02

Pos logit contributions
uates+1.280
龍�+1.257
+1.247
cos+1.220
ero+1.192
Neg logit contributions
attery-1.214
ゴン-1.163
ukong-1.119
 Interest-1.117
ATURE-1.097
Token 32
Token ID3933
Feature activation-2.01e-02

Pos logit contributions
uates+0.137
龍�+0.134
+0.133
cos+0.129
ero+0.125
Neg logit contributions
attery-0.143
ゴン-0.136
 Interest-0.132
ukong-0.131
ATURE-0.129
Token bit
Token ID1643
Feature activation+4.26e-01

Pos logit contributions
uates+0.054
龍�+0.053
+0.052
cos+0.051
ero+0.050
Neg logit contributions
attery-0.050
ゴン-0.048
 Interest-0.046
ukong-0.046
orf-0.045
Token micro
Token ID4580
Feature activation+2.04e-01

Pos logit contributions
attery+1.084
ゴン+1.033
 Interest+1.014
orf+0.986
ukong+0.986
Neg logit contributions
uates-1.103
龍�-1.094
-1.082
cos-1.044
ero-1.009
Tokencontroller
Token ID36500
Feature activation+1.21e+00

Pos logit contributions
attery+0.472
ゴン+0.445
 Interest+0.429
ukong+0.423
ATURE+0.422
Neg logit contributions
uates-0.583
龍�-0.572
-0.568
cos-0.548
ero-0.541
Token,
Token ID11
Feature activation+1.70e-01

Pos logit contributions
attery+2.837
 Interest+2.674
ゴン+2.618
ATURE+2.587
orf+2.573
Neg logit contributions
uates-3.282
龍�-3.248
-3.218
cos-3.126
ero-3.033
Token and
Token ID290
Feature activation-2.07e-01

Pos logit contributions
attery+0.420
ゴン+0.396
 Interest+0.389
ukong+0.382
ATURE+0.379
Neg logit contributions
uates-0.453
龍�-0.447
-0.442
cos-0.431
ero-0.419
Token especially
Token ID2592
Feature activation-1.91e-01

Pos logit contributions
uates+0.513
龍�+0.504
+0.499
cos+0.487
ero+0.474
Neg logit contributions
attery-0.554
ゴン-0.531
 Interest-0.514
ukong-0.512
ATURE-0.504
Token writing
Token ID3597
Feature activation-1.56e-01

Pos logit contributions
uates+0.497
龍�+0.489
+0.484
cos+0.474
ero+0.462
Neg logit contributions
attery-0.485
ゴン-0.464
 Interest-0.447
ukong-0.446
ATURE-0.439
Token a
Token ID257
Feature activation+8.35e-03

Pos logit contributions
uates+0.404
龍�+0.398
+0.393
cos+0.384
ero+0.374
Neg logit contributions
attery-0.399
ゴン-0.381
 Interest-0.368
ukong-0.367
ATURE-0.361
Token
Token ID198
Feature activation+3.63e-01

Pos logit contributions
uates+0.463
龍�+0.457
+0.451
cos+0.442
ero+0.433
Neg logit contributions
attery-0.411
ゴン-0.391
 Interest-0.381
ukong-0.376
ATURE-0.370
Token
Token ID198
Feature activation+4.93e-01

Pos logit contributions
attery+0.886
ゴン+0.842
 Interest+0.815
ukong+0.813
ATURE+0.799
Neg logit contributions
uates-0.984
龍�-0.969
-0.959
cos-0.937
ero-0.917
Token[
Token ID58
Feature activation+1.01e-01

Pos logit contributions
attery+0.859
ゴン+0.793
 Interest+0.760
ukong+0.759
ATURE+0.753
Neg logit contributions
uates-1.680
龍�-1.650
-1.626
cos-1.602
ero-1.578
Token22
Token ID1828
Feature activation+7.56e-01

Pos logit contributions
attery+0.268
ゴン+0.257
ukong+0.248
 Interest+0.248
ATURE+0.243
Neg logit contributions
uates-0.254
龍�-0.249
-0.247
cos-0.241
ero-0.235
Token]
Token ID60
Feature activation+5.49e-01

Pos logit contributions
attery+1.844
 Interest+1.726
ゴン+1.704
ukong+1.689
ATURE+1.687
Neg logit contributions
uates-2.031
龍�-2.010
-1.963
cos-1.909
ero-1.876
Token Literally
Token ID48414
Feature activation+1.33e+00

Pos logit contributions
attery+1.355
 Interest+1.289
ゴン+1.282
ukong+1.245
ATURE+1.237
Neg logit contributions
uates-1.452
龍�-1.440
-1.409
cos-1.381
ero-1.350
Token,
Token ID11
Feature activation+6.54e-02

Pos logit contributions
attery+3.053
ゴン+2.937
 Interest+2.859
ukong+2.788
multiple+2.764
Neg logit contributions
uates-3.654
龍�-3.618
-3.527
cos-3.469
ero-3.453
Token R
Token ID371
Feature activation+1.77e-01

Pos logit contributions
attery+0.153
ゴン+0.146
 Interest+0.141
ukong+0.140
orf+0.137
Neg logit contributions
uates-0.184
龍�-0.181
-0.179
cos-0.176
ero-0.172
Tokenashi
Token ID12144
Feature activation+3.72e-01

Pos logit contributions
attery+0.462
ゴン+0.438
ukong+0.429
 Interest+0.424
orf+0.418
Neg logit contributions
uates-0.449
龍�-0.442
-0.438
cos-0.426
ero-0.415
Token says
Token ID1139
Feature activation+6.41e-01

Pos logit contributions
attery+0.925
ゴン+0.890
 Interest+0.860
ukong+0.853
orf+0.845
Neg logit contributions
uates-0.972
龍�-0.969
-0.944
cos-0.928
ero-0.900
Token,
Token ID11
Feature activation+3.21e-01

Pos logit contributions
attery+1.503
ゴン+1.434
 Interest+1.389
ukong+1.381
orf+1.360
Neg logit contributions
uates-1.776
龍�-1.765
-1.727
cos-1.697
ero-1.656
TokenBy
Token ID3886
Feature activation+4.98e-01

Pos logit contributions
attery+1.495
ゴン+1.409
 Interest+1.396
multiple+1.387
ukong+1.374
Neg logit contributions
uates-1.379
龍�-1.351
-1.313
cos-1.266
ero-1.244
Token law
Token ID1099
Feature activation+3.34e-01

Pos logit contributions
attery+1.308
ゴン+1.237
 Interest+1.222
ukong+1.201
orf+1.191
Neg logit contributions
uates-1.244
龍�-1.236
-1.211
cos-1.175
ero-1.135
Token,
Token ID11
Feature activation+1.10e+00

Pos logit contributions
attery+0.777
ゴン+0.748
 Interest+0.728
ukong+0.721
ATURE+0.701
Neg logit contributions
龍�-0.925
-0.916
uates-0.916
cos-0.888
ero-0.859
Token the
Token ID262
Feature activation+6.14e-01

Pos logit contributions
attery+2.584
 Interest+2.535
ukong+2.404
ゴン+2.387
multiple+2.371
Neg logit contributions
龍�-2.957
-2.952
uates-2.930
cos-2.835
phal-2.687
Token national
Token ID2260
Feature activation+2.51e-01

Pos logit contributions
attery+1.615
 Interest+1.536
ゴン+1.502
ukong+1.492
orf+1.469
Neg logit contributions
龍�-1.522
uates-1.518
-1.506
cos-1.451
ero-1.396
Token debt
Token ID5057
Feature activation+1.28e+00

Pos logit contributions
attery+0.618
 Interest+0.588
ゴン+0.584
ukong+0.577
orf+0.562
Neg logit contributions
龍�-0.659
uates-0.658
-0.650
cos-0.628
ero-0.610
Token cannot
Token ID2314
Feature activation+2.12e-02

Pos logit contributions
attery+3.008
 Interest+2.864
ukong+2.817
ゴン+2.764
ATURE+2.756
Neg logit contributions
龍�-3.477
-3.391
uates-3.349
cos-3.225
phal-3.118
Token exceed
Token ID7074
Feature activation-2.13e-01

Pos logit contributions
attery+0.056
ゴン+0.054
ukong+0.052
 Interest+0.052
ATURE+0.051
Neg logit contributions
uates-0.053
龍�-0.052
-0.052
cos-0.051
ero-0.049
Token $
Token ID720
Feature activation+3.36e-01

Pos logit contributions
uates+0.629
龍�+0.619
+0.614
cos+0.602
ero+0.590
Neg logit contributions
attery-0.466
ゴン-0.443
ukong-0.424
 Interest-0.424
ATURE-0.415
Token16
Token ID1433
Feature activation+1.79e-01

Pos logit contributions
attery+0.854
ゴン+0.803
 Interest+0.790
ukong+0.784
ATURE+0.774
Neg logit contributions
uates-0.875
龍�-0.857
-0.849
cos-0.824
ero-0.809
Token.
Token ID13
Feature activation-4.47e-01

Pos logit contributions
attery+0.354
ゴン+0.333
 Interest+0.322
ukong+0.318
ATURE+0.312
Neg logit contributions
uates-0.566
龍�-0.558
-0.555
cos-0.541
ero-0.529
Token all
Token ID477
Feature activation+2.73e-01

Pos logit contributions
attery+1.058
 Interest+0.983
ゴン+0.978
ukong+0.960
ATURE+0.957
Neg logit contributions
龍�-1.061
uates-1.059
-1.034
cos-1.010
ero-0.985
Token the
Token ID262
Feature activation+2.13e-01

Pos logit contributions
attery+0.696
ゴン+0.653
 Interest+0.653
ukong+0.638
orf+0.635
Neg logit contributions
龍�-0.701
uates-0.699
-0.691
cos-0.664
ero-0.651
Token publishers
Token ID17604
Feature activation-4.66e-02

Pos logit contributions
attery+0.591
ゴン+0.559
 Interest+0.559
ukong+0.546
ATURE+0.543
Neg logit contributions
uates-0.499
龍�-0.497
-0.490
cos-0.472
ero-0.458
Token of
Token ID286
Feature activation+4.90e-01

Pos logit contributions
uates+0.122
龍�+0.120
+0.119
cos+0.116
ero+0.114
Neg logit contributions
attery-0.117
ゴン-0.112
 Interest-0.108
ukong-0.108
ATURE-0.106
Token London
Token ID3576
Feature activation-5.59e-02

Pos logit contributions
attery+1.140
 Interest+1.070
ゴン+1.054
ukong+1.033
orf+1.027
Neg logit contributions
uates-1.371
龍�-1.370
-1.341
cos-1.301
ero-1.274
Token."
Token ID526
Feature activation+1.41e-01

Pos logit contributions
uates+0.151
龍�+0.149
+0.147
cos+0.144
ero+0.141
Neg logit contributions
attery-0.137
ゴン-0.130
 Interest-0.126
ukong-0.125
orf-0.123
Token There
Token ID1318
Feature activation+2.68e-01

Pos logit contributions
attery+0.344
ゴン+0.326
 Interest+0.324
ukong+0.315
ATURE+0.311
Neg logit contributions
uates-0.380
龍�-0.375
-0.370
cos-0.361
ero-0.354
Token has
Token ID468
Feature activation+4.55e-01

Pos logit contributions
attery+0.760
ゴン+0.726
 Interest+0.707
ukong+0.706
orf+0.694
Neg logit contributions
uates-0.623
龍�-0.612
-0.601
cos-0.581
ero-0.566
Token always
Token ID1464
Feature activation+3.22e-02

Pos logit contributions
attery+1.100
ゴン+1.052
 Interest+1.019
ukong+1.017
orf+0.996
Neg logit contributions
uates-1.237
龍�-1.226
-1.200
cos-1.173
ero-1.139
Token been
Token ID587
Feature activation+1.25e-01

Pos logit contributions
attery+0.082
ゴン+0.079
ukong+0.076
 Interest+0.076
ATURE+0.075
Neg logit contributions
uates-0.083
龍�-0.082
-0.081
cos-0.079
ero-0.077
Token something
Token ID1223
Feature activation+7.25e-01

Pos logit contributions
attery+0.306
ゴン+0.289
 Interest+0.286
ukong+0.283
orf+0.276
Neg logit contributions
uates-0.338
龍�-0.337
-0.331
cos-0.322
ero-0.314
Token
Token ID198
Feature activation-4.15e-02

Pos logit contributions
uates+0.465
龍�+0.453
+0.453
cos+0.448
ero+0.435
Neg logit contributions
attery-0.470
ゴン-0.461
ukong-0.434
 Interest-0.429
itars-0.425
Token
Token ID447
Feature activation+5.27e-01

Pos logit contributions
uates+0.112
龍�+0.110
+0.108
cos+0.106
ero+0.104
Neg logit contributions
attery-0.102
ゴン-0.097
 Interest-0.093
ukong-0.093
multiple-0.092
Token
Token ID246
Feature activation+1.59e-01

Pos logit contributions
attery+1.204
ゴン+1.142
 Interest+1.096
ukong+1.089
ATURE+1.079
Neg logit contributions
uates-1.494
龍�-1.477
-1.455
cos-1.432
ero-1.395
TokenIt
Token ID1026
Feature activation-3.71e-01

Pos logit contributions
attery+0.458
ゴン+0.439
 Interest+0.428
ukong+0.424
ATURE+0.421
Neg logit contributions
uates-0.359
龍�-0.352
-0.344
cos-0.333
ero-0.327
Token continues
Token ID4477
Feature activation+3.36e-01

Pos logit contributions
uates+0.910
龍�+0.889
+0.883
cos+0.863
ero+0.840
Neg logit contributions
attery-1.003
ゴン-0.965
ukong-0.931
 Interest-0.929
ATURE-0.914
Token to
Token ID284
Feature activation+3.86e-01

Pos logit contributions
attery+0.768
ゴン+0.729
 Interest+0.709
ukong+0.700
ATURE+0.691
Neg logit contributions
龍�-0.953
uates-0.950
-0.933
cos-0.911
ero-0.893
Token am
Token ID716
Feature activation+1.12e-01

Pos logit contributions
attery+1.049
ゴン+0.992
 Interest+0.980
ukong+0.971
ATURE+0.954
Neg logit contributions
龍�-0.935
uates-0.931
-0.911
cos-0.878
ero-0.863
Tokenaze
Token ID6201
Feature activation-5.14e-01

Pos logit contributions
attery+0.384
ゴン+0.367
 Interest+0.362
ukong+0.361
orf+0.357
Neg logit contributions
uates-0.189
龍�-0.189
-0.186
cos-0.177
ero-0.171
Token me
Token ID502
Feature activation-3.14e-01

Pos logit contributions
uates+0.868
龍�+0.844
+0.832
cos+0.804
ero+0.774
Neg logit contributions
attery-1.777
ゴン-1.722
ukong-1.676
 Interest-1.674
ATURE-1.652
Token how
Token ID703
Feature activation-1.60e-01

Pos logit contributions
uates+0.860
龍�+0.845
+0.838
cos+0.822
ero+0.803
Neg logit contributions
attery-0.755
ゴン-0.723
ukong-0.693
 Interest-0.692
ATURE-0.679
Token little
Token ID1310
Feature activation+1.09e+00

Pos logit contributions
uates+0.424
龍�+0.420
+0.415
cos+0.405
ero+0.395
Neg logit contributions
attery-0.400
ゴン-0.381
 Interest-0.369
ukong-0.368
ATURE-0.361
Token new
Token ID649
Feature activation-4.91e-01

Pos logit contributions
uates+0.233
龍�+0.229
+0.226
cos+0.220
ero+0.214
Neg logit contributions
attery-0.302
ゴン-0.291
 Interest-0.282
ukong-0.282
ATURE-0.277
Token features
Token ID3033
Feature activation+2.40e-01

Pos logit contributions
uates+1.161
+1.134
龍�+1.130
cos+1.102
ero+1.078
Neg logit contributions
attery-1.372
ゴン-1.318
ukong-1.273
 Interest-1.262
orf-1.244
Token (
Token ID357
Feature activation+1.60e-01

Pos logit contributions
attery+0.604
ゴン+0.574
 Interest+0.559
ukong+0.557
ATURE+0.545
Neg logit contributions
uates-0.634
龍�-0.623
-0.615
cos-0.600
ero-0.588
Tokenincluding
Token ID8201
Feature activation+1.27e-01

Pos logit contributions
attery+0.376
ゴン+0.361
multiple+0.357
ukong+0.356
abilities+0.355
Neg logit contributions
uates-0.443
-0.428
龍�-0.425
ero-0.409
cos-0.405
Token off
Token ID572
Feature activation-3.17e-01

Pos logit contributions
attery+0.373
ゴン+0.365
ukong+0.350
 Interest+0.346
orf+0.343
Neg logit contributions
uates-0.279
龍�-0.274
-0.272
cos-0.265
ero-0.259
Token-
Token ID12
Feature activation+3.31e-01

Pos logit contributions
uates+0.850
龍�+0.841
+0.832
cos+0.826
ero+0.802
Neg logit contributions
attery-0.779
ゴン-0.747
ukong-0.724
 Interest-0.714
orf-0.710
TokenTV
Token ID6849
Feature activation-2.45e-01

Pos logit contributions
attery+0.705
ゴン+0.671
ukong+0.641
 Interest+0.640
ATURE+0.626
Neg logit contributions
uates-0.999
龍�-0.986
-0.978
cos-0.962
ero-0.941
Token play
Token ID711
Feature activation+4.93e-02

Pos logit contributions
uates+0.605
+0.599
龍�+0.592
cos+0.583
ero+0.575
Neg logit contributions
attery-0.643
ゴン-0.616
ukong-0.607
 Interest-0.588
ATURE-0.585
Token)
Token ID8
Feature activation+8.90e-01

Pos logit contributions
attery+0.105
ゴン+0.100
ukong+0.096
 Interest+0.095
orf+0.093
Neg logit contributions
uates-0.148
龍�-0.146
-0.145
cos-0.143
ero-0.140
Token that
Token ID326
Feature activation+1.10e-01

Pos logit contributions
attery+2.141
ゴン+2.016
 Interest+1.993
multiple+1.979
ukong+1.977
Neg logit contributions
uates-2.438
龍�-2.387
-2.378
cos-2.281
ero-2.224
Token have
Token ID423
Feature activation+1.77e-01

Pos logit contributions
attery+0.323
ゴン+0.317
ukong+0.301
ATURE+0.301
itars+0.298
Neg logit contributions
uates-0.240
-0.234
龍�-0.232
cos-0.230
ero-0.221
Token
Token ID247
Feature activation+8.20e-03

Pos logit contributions
attery+0.324
ゴン+0.309
 Interest+0.293
ukong+0.293
ATURE+0.290
Neg logit contributions
uates-0.416
龍�-0.411
-0.408
cos-0.402
ero-0.390
Tokent
Token ID83
Feature activation+2.21e-01

Pos logit contributions
attery+0.024
ゴン+0.023
ukong+0.023
 Interest+0.023
orf+0.022
Neg logit contributions
uates-0.018
龍�-0.018
-0.017
cos-0.017
ero-0.016
Token work
Token ID670
Feature activation+4.19e-01

Pos logit contributions
attery+0.546
ゴン+0.515
 Interest+0.506
ukong+0.501
orf+0.489
Neg logit contributions
uates-0.587
龍�-0.583
-0.577
cos-0.559
ero-0.548
Token,
Token ID11
Feature activation+1.59e-02

Pos logit contributions
attery+1.019
ゴン+0.965
 Interest+0.956
ukong+0.941
multiple+0.920
Neg logit contributions
龍�-1.119
uates-1.114
-1.094
cos-1.059
ero-1.038
Token a
Token ID257
Feature activation+5.98e-02

Pos logit contributions
attery+0.039
ゴン+0.037
 Interest+0.036
ukong+0.035
ATURE+0.035
Neg logit contributions
uates-0.043
龍�-0.042
-0.042
cos-0.041
ero-0.040
Token la
Token ID8591
Feature activation+6.57e-01

Pos logit contributions
attery+0.158
ゴン+0.151
 Interest+0.146
ukong+0.145
orf+0.144
Neg logit contributions
uates-0.150
龍�-0.148
-0.146
cos-0.141
ero-0.136
Token I
Token ID314
Feature activation-4.14e-02

Pos logit contributions
attery+1.637
ゴン+1.534
 Interest+1.530
ukong+1.482
orf+1.481
Neg logit contributions
uates-1.731
龍�-1.699
-1.692
cos-1.624
phal-1.573
TokenRL
Token ID7836
Feature activation+2.07e-01

Pos logit contributions
uates+0.091
龍�+0.089
+0.088
cos+0.083
ero+0.080
Neg logit contributions
attery-0.122
ゴン-0.117
 Interest-0.114
ukong-0.114
orf-0.113
Token.
Token ID13
Feature activation-4.04e-02

Pos logit contributions
attery+0.535
ゴン+0.507
 Interest+0.495
ukong+0.490
orf+0.484
Neg logit contributions
uates-0.534
龍�-0.520
-0.519
cos-0.504
ero-0.487
Token
Token ID198
Feature activation+1.33e-01

Pos logit contributions
uates+0.110
龍�+0.108
+0.107
cos+0.104
ero+0.102
Neg logit contributions
attery-0.099
ゴン-0.093
 Interest-0.092
ukong-0.090
ATURE-0.089
Token
Token ID198
Feature activation+1.53e-01

Pos logit contributions
attery+0.328
ゴン+0.314
ukong+0.302
 Interest+0.302
ATURE+0.296
Neg logit contributions
uates-0.358
龍�-0.352
-0.348
cos-0.341
ero-0.333
Token:
Token ID25
Feature activation+5.39e-01

Pos logit contributions
attery+0.415
ゴン+0.393
 Interest+0.383
ukong+0.376
ATURE+0.371
Neg logit contributions
uates-0.571
龍�-0.566
-0.558
cos-0.545
ero-0.535
Token round
Token ID2835
Feature activation-3.53e-01

Pos logit contributions
attery+1.331
 Interest+1.256
ゴン+1.253
multiple+1.218
orf+1.204
Neg logit contributions
uates-1.423
龍�-1.409
-1.404
cos-1.358
ero-1.312
Token-
Token ID12
Feature activation+5.72e-01

Pos logit contributions
uates+0.878
龍�+0.864
+0.854
cos+0.834
ero+0.813
Neg logit contributions
attery-0.938
ゴン-0.898
 Interest-0.868
ukong-0.867
orf-0.852
Tokentrip
Token ID39813
Feature activation+6.21e-01

Pos logit contributions
attery+2.032
multiple+1.942
ゴン+1.933
 Interest+1.921
orf+1.919
Neg logit contributions
龍�-0.912
uates-0.903
-0.869
cos-0.772
ero-0.759
Token propagation
Token ID43594
Feature activation+1.05e-01

Pos logit contributions
attery+1.577
ゴン+1.497
 Interest+1.473
orf+1.436
ATURE+1.426
Neg logit contributions
龍�-1.606
uates-1.596
-1.577
cos-1.521
ero-1.470
Token time
Token ID640
Feature activation+2.03e-01

Pos logit contributions
attery+0.259
ゴン+0.247
 Interest+0.239
ukong+0.237
ATURE+0.234
Neg logit contributions
uates-0.281
龍�-0.278
-0.275
cos-0.268
ero-0.262
Token (
Token ID357
Feature activation+3.37e-01

Pos logit contributions
attery+0.489
ゴン+0.464
 Interest+0.452
ukong+0.444
orf+0.443
Neg logit contributions
uates-0.551
龍�-0.547
-0.540
cos-0.523
ero-0.513
TokenRT
Token ID14181
Feature activation-3.76e-01

Pos logit contributions
attery+0.826
ゴン+0.790
 Interest+0.760
multiple+0.756
ukong+0.753
Neg logit contributions
uates-0.910
龍�-0.890
-0.877
cos-0.853
ero-0.829
Token prop
Token ID2632
Feature activation-9.64e-01

Pos logit contributions
uates+0.995
龍�+0.979
+0.971
cos+0.952
ero+0.930
Neg logit contributions
attery-0.938
ゴン-0.899
ukong-0.866
 Interest-0.863
ATURE-0.846
Token ),
Token ID10612
Feature activation+8.33e-01

Pos logit contributions
uates+2.862
龍�+2.808
+2.807
cos+2.772
ero+2.710
Neg logit contributions
attery-2.080
ゴン-1.983
ukong-1.908
 Interest-1.899
ATURE-1.857
Token and
Token ID290
Feature activation-1.29e-02

Pos logit contributions
attery+1.852
ゴン+1.738
 Interest+1.708
orf+1.654
ATURE+1.652
Neg logit contributions
龍�-2.398
uates-2.387
-2.357
cos-2.278
ero-2.219
Token fold
Token ID5591
Feature activation-2.28e-01

Pos logit contributions
attery+1.225
ゴン+1.151
 Interest+1.135
ukong+1.124
orf+1.119
Neg logit contributions
龍�-1.050
uates-1.037
-1.032
cos-0.977
ero-0.953
Token mechanism
Token ID9030
Feature activation+3.84e-01

Pos logit contributions
uates+0.562
龍�+0.555
+0.549
cos+0.533
ero+0.519
Neg logit contributions
attery-0.611
ゴン-0.584
 Interest-0.565
ukong-0.564
ATURE-0.557
Token had
Token ID550
Feature activation-6.73e-02

Pos logit contributions
attery+1.025
ゴン+0.972
 Interest+0.949
ukong+0.944
ATURE+0.937
Neg logit contributions
龍�-0.945
uates-0.943
-0.931
cos-0.896
ero-0.866
Token broken
Token ID5445
Feature activation+1.88e-01

Pos logit contributions
uates+0.176
龍�+0.174
+0.172
cos+0.168
ero+0.164
Neg logit contributions
attery-0.170
ゴン-0.162
 Interest-0.157
ukong-0.157
ATURE-0.154
Token under
Token ID739
Feature activation+3.10e-02

Pos logit contributions
attery+0.473
ゴン+0.447
 Interest+0.438
ukong+0.436
ATURE+0.431
Neg logit contributions
uates-0.489
龍�-0.488
-0.482
cos-0.465
ero-0.452
Token the
Token ID262
Feature activation-1.52e-01

Pos logit contributions
attery+0.078
ゴン+0.073
 Interest+0.071
ukong+0.070
multiple+0.070
Neg logit contributions
uates-0.082
龍�-0.082
-0.080
cos-0.078
ero-0.076
Token air
Token ID1633
Feature activation-8.64e-04

Pos logit contributions
uates+0.369
龍�+0.364
+0.360
cos+0.350
ero+0.340
Neg logit contributions
attery-0.414
ゴン-0.396
 Interest-0.383
ukong-0.383
ATURE-0.376
Tokenloads
Token ID46030
Feature activation+1.21e-01

Pos logit contributions
uates+0.002
龍�+0.002
+0.002
cos+0.002
ero+0.002
Neg logit contributions
attery-0.002
ゴン-0.002
 Interest-0.002
ukong-0.002
orf-0.002
Token allowing
Token ID5086
Feature activation-1.18e-01

Pos logit contributions
attery+0.298
ゴン+0.283
 Interest+0.274
ukong+0.273
orf+0.269
Neg logit contributions
uates-0.326
龍�-0.322
-0.318
cos-0.310
ero-0.302
Token the
Token ID262
Feature activation+2.83e-01

Pos logit contributions
uates+0.312
龍�+0.309
+0.304
cos+0.297
ero+0.289
Neg logit contributions
attery-0.295
ゴン-0.280
 Interest-0.272
ukong-0.271
ATURE-0.267
Token wing
Token ID8539
Feature activation+6.22e-01

Pos logit contributions
attery+0.760
ゴン+0.717
 Interest+0.704
ukong+0.696
ATURE+0.691
Neg logit contributions
uates-0.697
龍�-0.696
-0.686
cos-0.660
ero-0.640
Tokenh
Token ID71
Feature activation+1.22e-01

Pos logit contributions
attery+1.953
 Interest+1.812
ゴン+1.795
abilities+1.754
orf+1.753
Neg logit contributions
龍�-1.833
uates-1.822
-1.818
cos-1.699
ero-1.678
Tokenose
Token ID577
Feature activation+1.90e-01

Pos logit contributions
attery+0.375
ゴン+0.354
ukong+0.347
 Interest+0.347
orf+0.342
Neg logit contributions
uates-0.255
龍�-0.252
-0.250
cos-0.240
ero-0.235
Token hanging
Token ID10938
Feature activation+1.18e-01

Pos logit contributions
attery+0.464
ゴン+0.431
 Interest+0.428
ukong+0.419
orf+0.419
Neg logit contributions
uates-0.510
龍�-0.509
-0.498
cos-0.486
ero-0.473
Token from
Token ID422
Feature activation-2.41e-01

Pos logit contributions
attery+0.273
ゴン+0.256
 Interest+0.251
ukong+0.249
orf+0.245
Neg logit contributions
龍�-0.334
uates-0.333
-0.329
cos-0.321
ero-0.313
Token the
Token ID262
Feature activation-2.37e-01

Pos logit contributions
uates+0.646
龍�+0.641
+0.632
cos+0.617
ero+0.600
Neg logit contributions
attery-0.595
ゴン-0.563
 Interest-0.550
ukong-0.547
multiple-0.538
Token top
Token ID1353
Feature activation-1.14e+00

Pos logit contributions
uates+0.587
龍�+0.579
+0.572
cos+0.555
ero+0.540
Neg logit contributions
attery-0.634
ゴン-0.604
 Interest-0.589
ukong-0.586
orf-0.578
Token of
Token ID286
Feature activation-5.25e-01

Pos logit contributions
uates+2.691
龍�+2.627
+2.609
cos+2.549
ero+2.481
Neg logit contributions
attery-3.149
ゴン-3.045
ukong-2.941
 Interest-2.923
ATURE-2.873
Token the
Token ID262
Feature activation-5.39e-01

Pos logit contributions
uates+1.416
龍�+1.396
+1.380
cos+1.350
ero+1.319
Neg logit contributions
attery-1.290
ゴン-1.225
 Interest-1.188
ukong-1.183
ATURE-1.162
Token kitchen
Token ID9592
Feature activation+3.13e-01

Pos logit contributions
uates+1.344
龍�+1.320
+1.308
cos+1.277
ero+1.246
Neg logit contributions
attery-1.429
ゴン-1.371
ukong-1.322
 Interest-1.321
ATURE-1.298
Token window
Token ID4324
Feature activation+4.16e-01

Pos logit contributions
attery+0.779
ゴン+0.727
 Interest+0.719
ATURE+0.706
orf+0.703
Neg logit contributions
uates-0.825
龍�-0.818
-0.810
cos-0.782
ero-0.761
Token.
Token ID13
Feature activation+1.33e-01

Pos logit contributions
attery+1.011
ゴン+0.948
 Interest+0.934
orf+0.916
ukong+0.913
Neg logit contributions
龍�-1.112
uates-1.104
-1.091
cos-1.059
ero-1.036
Token the
Token ID262
Feature activation-3.16e-01

Pos logit contributions
uates+2.640
+2.562
龍�+2.560
cos+2.510
ero+2.465
Neg logit contributions
attery-2.625
ゴン-2.534
ukong-2.420
 Interest-2.399
ATURE-2.374
Token P
Token ID350
Feature activation+1.92e-02

Pos logit contributions
uates+0.841
龍�+0.832
+0.820
cos+0.801
ero+0.781
Neg logit contributions
attery-0.787
ゴン-0.750
 Interest-0.727
ukong-0.724
orf-0.710
TokenYD
Token ID35755
Feature activation-1.57e-02

Pos logit contributions
attery+0.059
ゴン+0.057
ukong+0.055
 Interest+0.055
ATURE+0.054
Neg logit contributions
uates-0.040
龍�-0.039
-0.039
cos-0.037
ero-0.036
Token's
Token ID338
Feature activation-1.82e-01

Pos logit contributions
uates+0.042
龍�+0.042
+0.041
cos+0.040
ero+0.039
Neg logit contributions
attery-0.039
ゴン-0.037
 Interest-0.036
ukong-0.036
orf-0.035
Token role
Token ID2597
Feature activation-1.74e-01

Pos logit contributions
uates+0.495
龍�+0.492
+0.482
cos+0.472
ero+0.460
Neg logit contributions
attery-0.441
ゴン-0.419
 Interest-0.409
ukong-0.404
ATURE-0.396
Token in
Token ID287
Feature activation-3.87e-01

Pos logit contributions
龍�+0.524
uates+0.523
+0.515
cos+0.504
ero+0.493
Neg logit contributions
attery-0.367
ゴン-0.346
 Interest-0.334
ukong-0.333
ATURE-0.326
Token the
Token ID262
Feature activation-2.47e-01

Pos logit contributions
uates+1.034
龍�+1.020
+1.007
cos+0.985
ero+0.960
Neg logit contributions
attery-0.956
ゴン-0.913
 Interest-0.882
ukong-0.881
ATURE-0.862
Token north
Token ID5093
Feature activation-4.72e-02

Pos logit contributions
uates+0.658
龍�+0.650
+0.639
cos+0.624
ero+0.609
Neg logit contributions
attery-0.615
ゴン-0.586
 Interest-0.568
ukong-0.565
orf-0.554
Token at
Token ID379
Feature activation+1.99e-01

Pos logit contributions
uates+0.127
龍�+0.127
+0.124
cos+0.120
ero+0.118
Neg logit contributions
attery-0.115
ゴン-0.110
 Interest-0.106
ukong-0.106
multiple-0.104
Token the
Token ID262
Feature activation+8.07e-02

Pos logit contributions
attery+0.462
ゴン+0.439
 Interest+0.427
ukong+0.424
multiple+0.416
Neg logit contributions
uates-0.558
龍�-0.555
-0.545
cos-0.530
ero-0.518
Token start
Token ID923
Feature activation-8.29e-01

Pos logit contributions
attery+0.204
ゴン+0.193
 Interest+0.192
ukong+0.188
orf+0.185
Neg logit contributions
uates-0.212
龍�-0.210
-0.205
cos-0.200
ero-0.195
Token
Token ID198
Feature activation-3.24e-01

Pos logit contributions
attery+0.966
ゴン+0.921
 Interest+0.917
ukong+0.885
ATURE+0.878
Neg logit contributions
uates-1.109
龍�-1.095
-1.068
cos-1.037
ero-1.021
Token
Token ID198
Feature activation-7.08e-02

Pos logit contributions
uates+0.815
+0.797
龍�+0.794
cos+0.789
ero+0.766
Neg logit contributions
attery-0.847
ゴン-0.838
ukong-0.785
 Interest-0.771
itars-0.770
TokenW
Token ID54
Feature activation+1.04e-01

Pos logit contributions
uates+0.178
龍�+0.175
+0.172
cos+0.167
ero+0.164
Neg logit contributions
attery-0.187
ゴン-0.179
 Interest-0.173
ukong-0.173
ATURE-0.170
Tokenarp
Token ID5117
Feature activation-2.12e-01

Pos logit contributions
attery+0.212
ゴン+0.198
ukong+0.193
 Interest+0.191
orf+0.190
Neg logit contributions
uates-0.324
龍�-0.321
-0.318
cos-0.309
ero-0.302
Token Factor
Token ID27929
Feature activation-4.44e-01

Pos logit contributions
uates+0.558
龍�+0.550
+0.542
cos+0.528
ero+0.516
Neg logit contributions
attery-0.534
ゴン-0.511
 Interest-0.496
ukong-0.492
orf-0.485
Token Calculator
Token ID43597
Feature activation-3.31e-01

Pos logit contributions
uates+1.257
龍�+1.236
+1.227
cos+1.202
ero+1.177
Neg logit contributions
attery-1.027
ゴン-0.980
ukong-0.939
 Interest-0.936
ATURE-0.918
Token is
Token ID318
Feature activation+2.99e-01

Pos logit contributions
uates+0.890
龍�+0.877
+0.868
cos+0.849
ero+0.830
Neg logit contributions
attery-0.813
ゴン-0.777
 Interest-0.751
ukong-0.748
ATURE-0.734
Token a
Token ID257
Feature activation-8.34e-02

Pos logit contributions
attery+0.761
ゴン+0.715
 Interest+0.711
ukong+0.698
ATURE+0.694
Neg logit contributions
uates-0.781
龍�-0.773
-0.758
cos-0.733
ero-0.720
Token Java
Token ID7349
Feature activation-9.55e-01

Pos logit contributions
uates+0.202
龍�+0.199
+0.196
cos+0.188
ero+0.182
Neg logit contributions
attery-0.228
ゴン-0.218
 Interest-0.214
ukong-0.211
orf-0.209
Token Script
Token ID12327
Feature activation-4.09e-01

Pos logit contributions
uates+2.442
龍�+2.397
+2.380
cos+2.327
ero+2.274
Neg logit contributions
attery-2.469
ゴン-2.367
ukong-2.283
 Interest-2.275
ATURE-2.238
Token calculator
Token ID28260
Feature activation-4.21e-03

Pos logit contributions
uates+1.061
龍�+1.043
+1.032
cos+1.009
ero+0.983
Neg logit contributions
attery-1.045
ゴン-0.999
 Interest-0.966
ukong-0.963
ATURE-0.948
Token my
Token ID616
Feature activation+1.50e-01

Pos logit contributions
attery+0.506
ゴン+0.477
 Interest+0.471
ukong+0.465
orf+0.457
Neg logit contributions
uates-0.500
龍�-0.490
-0.481
cos-0.473
ero-0.460
Token favorite
Token ID4004
Feature activation-1.91e-01

Pos logit contributions
attery+0.384
ゴン+0.366
 Interest+0.358
ukong+0.353
orf+0.349
Neg logit contributions
uates-0.389
龍�-0.381
-0.377
cos-0.367
ero-0.356
Token cooking
Token ID10801
Feature activation+4.11e-02

Pos logit contributions
uates+0.495
龍�+0.486
+0.482
cos+0.469
ero+0.455
Neg logit contributions
attery-0.490
ゴン-0.467
 Interest-0.456
ukong-0.452
orf-0.447
Token tools
Token ID4899
Feature activation+7.87e-01

Pos logit contributions
attery+0.102
ゴン+0.097
 Interest+0.095
ukong+0.093
orf+0.092
Neg logit contributions
uates-0.110
龍�-0.108
-0.107
cos-0.104
ero-0.101
Token,
Token ID11
Feature activation+6.97e-02

Pos logit contributions
attery+1.839
 Interest+1.714
ゴン+1.710
ukong+1.670
ATURE+1.647
Neg logit contributions
uates-2.206
龍�-2.184
-2.117
cos-2.082
ero-2.045
Token shop
Token ID6128
Feature activation-1.91e-01

Pos logit contributions
attery+0.174
ゴン+0.164
 Interest+0.163
ukong+0.160
ATURE+0.158
Neg logit contributions
uates-0.185
龍�-0.182
-0.179
cos-0.175
ero-0.171
Token my
Token ID616
Feature activation-1.79e-01

Pos logit contributions
uates+0.527
龍�+0.518
+0.514
cos+0.505
ero+0.493
Neg logit contributions
attery-0.454
ゴン-0.434
ukong-0.417
 Interest-0.414
ATURE-0.407
Token kitchen
Token ID9592
Feature activation+7.01e-01

Pos logit contributions
uates+0.451
龍�+0.443
+0.438
cos+0.425
ero+0.416
Neg logit contributions
attery-0.469
ゴン-0.450
 Interest-0.437
ukong-0.432
orf-0.427
Token essentials
Token ID41954
Feature activation-3.78e-01

Pos logit contributions
attery+1.689
ゴン+1.619
 Interest+1.596
iverse+1.556
orf+1.547
Neg logit contributions
uates-1.885
龍�-1.848
-1.836
cos-1.751
ero-1.728
Token!
Token ID0
Feature activation-2.78e-01

Pos logit contributions
uates+0.977
龍�+0.960
+0.950
cos+0.929
ero+0.907
Neg logit contributions
attery-0.967
ゴン-0.926
 Interest-0.893
ukong-0.892
ATURE-0.875
Token<|endoftext|>
Token ID50256
Feature activation-3.28e-01

Pos logit contributions
uates+0.744
龍�+0.729
+0.721
cos+0.706
ero+0.692
Neg logit contributions
attery-0.689
ゴン-0.658
 Interest-0.640
ukong-0.634
ATURE-0.622
Token during
Token ID1141
Feature activation+3.69e-01

Pos logit contributions
attery+0.313
ゴン+0.293
 Interest+0.284
ukong+0.283
multiple+0.279
Neg logit contributions
uates-0.382
龍�-0.378
-0.374
cos-0.364
ero-0.358
Token the
Token ID262
Feature activation+9.39e-02

Pos logit contributions
attery+0.909
ゴン+0.854
 Interest+0.832
orf+0.826
multiple+0.823
Neg logit contributions
uates-0.982
龍�-0.981
-0.966
cos-0.939
ero-0.912
Token murder
Token ID5123
Feature activation-1.85e-01

Pos logit contributions
attery+0.257
ゴン+0.245
 Interest+0.237
ukong+0.236
orf+0.235
Neg logit contributions
uates-0.227
龍�-0.223
-0.221
cos-0.214
ero-0.209
Token,
Token ID11
Feature activation-6.71e-02

Pos logit contributions
uates+0.478
龍�+0.473
+0.467
cos+0.455
ero+0.444
Neg logit contributions
attery-0.475
ゴン-0.456
 Interest-0.438
ukong-0.437
orf-0.431
Token suddenly
Token ID6451
Feature activation+2.59e-01

Pos logit contributions
uates+0.171
龍�+0.169
+0.167
cos+0.163
ero+0.159
Neg logit contributions
attery-0.173
ゴン-0.166
 Interest-0.160
ukong-0.160
orf-0.158
Tokentrans
Token ID7645
Feature activation-2.30e+00

Pos logit contributions
attery+0.660
ゴン+0.630
 Interest+0.612
ukong+0.609
orf+0.602
Neg logit contributions
龍�-0.666
uates-0.665
-0.654
cos-0.635
ero-0.622
Tokenforms
Token ID23914
Feature activation+1.96e-01

Pos logit contributions
uates+6.465
cos+6.125
+5.998
ero+5.900
龍�+5.826
Neg logit contributions
ゴン-5.304
attery-5.246
 Interest-4.859
ukong-4.831
ATURE-4.793
Token itself
Token ID2346
Feature activation-2.79e-01

Pos logit contributions
attery+0.426
ゴン+0.398
 Interest+0.386
ukong+0.385
orf+0.380
Neg logit contributions
龍�-0.581
uates-0.581
-0.570
cos-0.554
ero-0.544
Token into
Token ID656
Feature activation-3.49e-01

Pos logit contributions
uates+0.833
龍�+0.819
+0.814
cos+0.799
ero+0.782
Neg logit contributions
attery-0.601
ゴン-0.572
ukong-0.547
 Interest-0.546
ATURE-0.534
Token the
Token ID262
Feature activation-1.19e-01

Pos logit contributions
uates+0.927
龍�+0.912
+0.900
cos+0.878
ero+0.861
Neg logit contributions
attery-0.876
ゴン-0.833
 Interest-0.805
ukong-0.805
orf-0.791
Token a
Token ID257
Feature activation+4.15e-02

Pos logit contributions
uates+0.297
龍�+0.294
+0.289
cos+0.281
ero+0.275
Neg logit contributions
attery-0.314
ゴン-0.298
ukong-0.290
 Interest-0.289
orf-0.285
Token Europe
Token ID2031
Feature activation-5.64e-01

Pos logit contributions
attery+0.220
ゴン+0.208
 Interest+0.204
ukong+0.202
orf+0.199
Neg logit contributions
uates-0.216
龍�-0.214
-0.210
cos-0.205
ero-0.200
Token Russian
Token ID3394
Feature activation-1.79e-01

Pos logit contributions
uates+1.506
龍�+1.481
+1.469
cos+1.438
ero+1.405
Neg logit contributions
attery-1.397
ゴン-1.339
ukong-1.287
 Interest-1.283
ATURE-1.262
Token state
Token ID1181
Feature activation-5.91e-01

Pos logit contributions
uates+0.464
龍�+0.458
+0.452
cos+0.440
ero+0.431
Neg logit contributions
attery-0.456
ゴン-0.436
 Interest-0.425
ukong-0.421
orf-0.414
Token energy
Token ID2568
Feature activation+3.47e-02

Pos logit contributions
uates+1.447
龍�+1.416
+1.407
cos+1.384
ero+1.346
Neg logit contributions
attery-1.594
ゴン-1.533
ukong-1.481
 Interest-1.465
ATURE-1.446
Token company
Token ID1664
Feature activation+3.35e-02

Pos logit contributions
attery+0.092
ゴン+0.088
 Interest+0.086
ukong+0.085
orf+0.084
Neg logit contributions
uates-0.087
龍�-0.086
-0.085
cos-0.082
ero-0.080
Token Gaz
Token ID21347
Feature activation-2.34e+00

Pos logit contributions
attery+0.099
 Interest+0.095
ゴン+0.093
ukong+0.092
orf+0.092
Neg logit contributions
uates-0.073
龍�-0.073
-0.071
cos-0.068
ero-0.065
Tokenprom
Token ID16963
Feature activation-4.37e-01

Pos logit contributions
cos+4.058
uates+3.995
+3.924
ero+3.872
龍�+3.822
Neg logit contributions
attery-7.833
ゴン-7.781
ukong-7.481
ATURE-7.244
itars-7.208
Token has
Token ID468
Feature activation+4.95e-03

Pos logit contributions
uates+1.176
龍�+1.160
+1.144
cos+1.119
ero+1.094
Neg logit contributions
attery-1.074
ゴン-1.024
 Interest-0.991
ukong-0.987
orf-0.969
Token resumed
Token ID28291
Feature activation-5.57e-01

Pos logit contributions
attery+0.012
ゴン+0.011
 Interest+0.011
ukong+0.011
orf+0.011
Neg logit contributions
uates-0.013
龍�-0.013
-0.013
cos-0.013
ero-0.012
Token some
Token ID617
Feature activation-1.93e-01

Pos logit contributions
uates+1.346
龍�+1.327
+1.307
cos+1.276
ero+1.240
Neg logit contributions
attery-1.521
ゴン-1.458
 Interest-1.415
ukong-1.407
ATURE-1.385
Token gas
Token ID3623
Feature activation-1.23e+00

Pos logit contributions
uates+0.458
龍�+0.457
+0.443
cos+0.433
ero+0.417
Neg logit contributions
attery-0.531
ゴン-0.511
 Interest-0.501
ukong-0.492
orf-0.487
Token If
Token ID1002
Feature activation-8.28e-01

Pos logit contributions
uates+0.126
龍�+0.123
+0.122
cos+0.119
ero+0.117
Neg logit contributions
attery-0.122
ゴン-0.115
 Interest-0.114
ukong-0.112
ATURE-0.110
Token I
Token ID314
Feature activation-1.43e-01

Pos logit contributions
uates+1.963
龍�+1.926
+1.906
cos+1.860
ero+1.811
Neg logit contributions
attery-2.301
ゴン-2.210
 Interest-2.137
ukong-2.136
ATURE-2.099
Token was
Token ID373
Feature activation-3.76e-01

Pos logit contributions
uates+0.355
龍�+0.351
+0.347
cos+0.336
ero+0.328
Neg logit contributions
attery-0.382
ゴン-0.359
 Interest-0.354
ukong-0.353
ATURE-0.347
Token going
Token ID1016
Feature activation-7.30e-02

Pos logit contributions
uates+0.963
龍�+0.951
+0.938
cos+0.916
ero+0.891
Neg logit contributions
attery-0.971
ゴン-0.928
 Interest-0.901
ukong-0.896
ATURE-0.881
Token to
Token ID284
Feature activation-4.03e-01

Pos logit contributions
uates+0.210
龍�+0.207
+0.205
cos+0.201
ero+0.196
Neg logit contributions
attery-0.166
ゴン-0.158
 Interest-0.151
ukong-0.151
ATURE-0.148
Token do
Token ID466
Feature activation-1.67e+00

Pos logit contributions
uates+0.970
龍�+0.955
+0.943
cos+0.919
ero+0.895
Neg logit contributions
attery-1.103
ゴン-1.053
 Interest-1.024
ukong-1.024
ATURE-1.004
Token another
Token ID1194
Feature activation+4.09e-01

Pos logit contributions
uates+4.007
+3.932
龍�+3.906
cos+3.832
ero+3.748
Neg logit contributions
attery-4.472
ゴン-4.378
ukong-4.138
 Interest-4.103
ATURE-4.098
Token one
Token ID530
Feature activation-3.28e-01

Pos logit contributions
attery+1.181
ゴン+1.122
 Interest+1.120
ukong+1.091
orf+1.080
Neg logit contributions
uates-0.925
龍�-0.911
-0.877
cos-0.852
ero-0.822
Token again
Token ID757
Feature activation+4.30e-01

Pos logit contributions
uates+0.849
龍�+0.838
+0.828
cos+0.808
ero+0.788
Neg logit contributions
attery-0.838
ゴン-0.801
 Interest-0.774
ukong-0.772
ATURE-0.758
Token,
Token ID11
Feature activation-2.83e-01

Pos logit contributions
attery+1.048
ゴン+0.988
 Interest+0.976
ukong+0.954
orf+0.945
Neg logit contributions
uates-1.154
龍�-1.152
-1.136
cos-1.100
ero-1.070
Token I
Token ID314
Feature activation-5.92e-01

Pos logit contributions
uates+0.697
龍�+0.688
+0.681
cos+0.661
ero+0.643
Neg logit contributions
attery-0.761
ゴン-0.725
 Interest-0.710
ukong-0.705
multiple-0.693
Token the
Token ID262
Feature activation-3.11e-01

Pos logit contributions
龍�+0.002
uates+0.002
+0.002
cos+0.002
ero+0.002
Neg logit contributions
attery-0.002
ゴン-0.002
 Interest-0.001
ukong-0.001
abilities-0.001
Token past
Token ID1613
Feature activation+1.44e-01

Pos logit contributions
uates+0.681
龍�+0.673
+0.664
cos+0.641
ero+0.621
Neg logit contributions
attery-0.925
ゴン-0.880
 Interest-0.861
ukong-0.856
ATURE-0.846
Tokeneb
Token ID1765
Feature activation-6.65e-01

Pos logit contributions
attery+0.495
ゴン+0.475
 Interest+0.464
orf+0.464
ukong+0.463
Neg logit contributions
uates-0.242
龍�-0.241
-0.238
cos-0.226
ero-0.212
Tokenoard
Token ID11953
Feature activation-1.05e+00

Pos logit contributions
uates+1.528
龍�+1.446
cos+1.439
+1.425
ero+1.405
Neg logit contributions
ゴン-1.897
attery-1.855
 Interest-1.779
ukong-1.771
ATURE-1.744
Token or
Token ID393
Feature activation+6.84e-01

Pos logit contributions
uates+2.707
龍�+2.634
+2.611
cos+2.577
ero+2.524
Neg logit contributions
attery-2.665
ゴン-2.586
ukong-2.481
 Interest-2.467
ATURE-2.424
Token cloth
Token ID16270
Feature activation-1.45e+00

Pos logit contributions
attery+2.358
ゴン+2.215
 Interest+2.206
orf+2.186
multiple+2.181
Neg logit contributions
uates-1.145
龍�-1.141
-1.125
cos-1.052
ero-1.021
Token.
Token ID13
Feature activation-1.02e-01

Pos logit contributions
uates+3.633
龍�+3.511
+3.486
cos+3.450
ero+3.366
Neg logit contributions
attery-3.787
ゴン-3.699
ukong-3.557
 Interest-3.524
ATURE-3.469
Token The
Token ID383
Feature activation+5.57e-02

Pos logit contributions
uates+0.277
龍�+0.272
+0.270
cos+0.263
ero+0.258
Neg logit contributions
attery-0.247
ゴン-0.233
 Interest-0.230
ukong-0.225
ATURE-0.223
Token metal
Token ID6147
Feature activation-4.69e-01

Pos logit contributions
attery+0.152
ゴン+0.143
 Interest+0.141
ukong+0.139
ATURE+0.137
Neg logit contributions
uates-0.136
龍�-0.134
-0.132
cos-0.128
ero-0.125
Token discs
Token ID31014
Feature activation-5.10e-02

Pos logit contributions
uates+1.140
龍�+1.117
+1.107
cos+1.082
ero+1.055
Neg logit contributions
attery-1.273
ゴン-1.224
ukong-1.182
 Interest-1.180
ATURE-1.160
Token also
Token ID635
Feature activation+3.75e-01

Pos logit contributions
uates+0.131
龍�+0.129
+0.128
cos+0.124
ero+0.121
Neg logit contributions
attery-0.132
ゴン-0.125
 Interest-0.121
ukong-0.121
ATURE-0.119
Token considered
Token ID3177
Feature activation+3.59e-02

Pos logit contributions
attery+1.008
ゴン+0.946
 Interest+0.930
ukong+0.914
ATURE+0.911
Neg logit contributions
uates-1.113
龍�-1.097
-1.076
cos-1.050
ero-1.030
Token one
Token ID530
Feature activation-4.65e-01

Pos logit contributions
attery+0.084
ゴン+0.080
 Interest+0.077
ukong+0.076
ATURE+0.075
Neg logit contributions
uates-0.101
龍�-0.099
-0.098
cos-0.096
ero-0.094
Token of
Token ID286
Feature activation-2.40e-01

Pos logit contributions
uates+0.996
龍�+0.976
+0.963
cos+0.935
ero+0.909
Neg logit contributions
attery-1.401
ゴン-1.351
 Interest-1.308
ukong-1.306
ATURE-1.290
Token the
Token ID262
Feature activation-1.99e-01

Pos logit contributions
uates+0.664
龍�+0.653
+0.646
cos+0.631
ero+0.616
Neg logit contributions
attery-0.579
ゴン-0.549
 Interest-0.531
ukong-0.525
ATURE-0.519
Token main
Token ID1388
Feature activation-7.64e-01

Pos logit contributions
uates+0.638
龍�+0.632
+0.624
cos+0.608
ero+0.596
Neg logit contributions
attery-0.390
ゴン-0.359
 Interest-0.353
ukong-0.346
ATURE-0.339
Token inspir
Token ID32285
Feature activation-1.39e+00

Pos logit contributions
uates+1.786
龍�+1.760
+1.737
cos+1.686
ero+1.644
Neg logit contributions
attery-2.146
ゴン-2.057
 Interest-2.000
ukong-1.991
ATURE-1.961
Tokenations
Token ID602
Feature activation-6.35e-01

Pos logit contributions
uates+4.369
cos+4.311
+4.186
ero+4.168
龍�+4.109
Neg logit contributions
ゴン-2.899
attery-2.687
ukong-2.414
 Interest-2.374
ATURE-2.344
Token for
Token ID329
Feature activation-2.78e-01

Pos logit contributions
uates+1.685
龍�+1.654
+1.642
cos+1.608
ero+1.572
Neg logit contributions
attery-1.583
ゴン-1.518
ukong-1.459
 Interest-1.454
ATURE-1.429
Token the
Token ID262
Feature activation+1.47e-02

Pos logit contributions
uates+0.732
龍�+0.721
+0.712
cos+0.696
ero+0.678
Neg logit contributions
attery-0.703
ゴン-0.670
 Interest-0.649
ukong-0.646
ATURE-0.636
Token story
Token ID1621
Feature activation-2.45e-01

Pos logit contributions
attery+0.041
ゴン+0.039
 Interest+0.039
ukong+0.038
ATURE+0.038
Neg logit contributions
uates-0.035
龍�-0.034
-0.034
cos-0.033
ero-0.032
Token.
Token ID13
Feature activation+1.14e-02

Pos logit contributions
uates+0.630
龍�+0.621
+0.614
cos+0.596
ero+0.582
Neg logit contributions
attery-0.634
ゴン-0.606
 Interest-0.588
ukong-0.584
ATURE-0.576