TOP ACTIVATIONS
MAX = 5.681

kill ), the smoother - talking Todd Akin ,
get the idea none the less . Number 1 I think
to the basket , the more essential it is that you
in the race , the less time Mrs . Clinton has
ian , we couldn 't resist the temptation to recall 1974
7 38 ac 5 f 6 a 443 f
an offensive line ), the more we re starting
org / index . php ? topic = 28 18 48
pe says , " the more refined your BS detector is
in the future , the more people know about the present
Tou hou series was also present during the meeting . He
deal to save the Def erred Action for Childhood Arri vals
He then happened to pull up a Texas A & M
so - called white - collar jobs . In fact ,
too far ." <|endoftext|> Metro Manila ( CNN Philippines )
/ Dem ocrat ) Buy Photo Sign ing Day
so - called white - collar jobs of managers , professionals
along he walks , the greater the destruction and injuries .
ian , we couldn 't resist the temptation to recall 1974
virtual creatures . ( Photo : KY LE GR AN TH

MIN ACTIVATIONS
MIN = -5.604

U . S ., by comparison , has 503 airports that
ahead . Follow HL TV . org 's @
$ 32 , for that matter ). But right now ,
Nation Be the first to hear about Nation Travels destinations ,
. parse _ args ()) # initialize a rectangular and square
OR PH _ RECT , ( 13 , 5 )) sq
defendants had been arrested on charges of aiding the PKK by
Madison St . Baltimore , MD , 212 01 .
( 21 , 21 )) # loop over the input image
term relationship has reached that point where , as Diana has
effectively . He reached that point last year after three decades
her shoulders , a to te bag hanging at her hip
($ FC 2 [ 0 ], $ FC 2 [ 1
OR PH _ RECT , ( 21 , 21 )) #
ranks of Ban u Q ura iza when Q ura ish
Fle ets vs Crypt Gh ast P rote an
met with the general in charge of the units accused of
next day . <|endoftext|> And things seem likely to get worse
($ FC 1 [ 0 ], $ FC 1 [ 1
_ full v ) { // Time component const float TA

INTERVAL 2.860 - 5.681
CONTAINS 0.326%

too far on Sound Cloud before hearing a sample buried beneath
w config . Now you can reboot your computer
, densely populated villages ab ut protected areas set aside for
and his friends should have thought twice before approaching it .
recognition technology : https :// www . eff .

INTERVAL 0.038 - 2.860
CONTAINS 47.974%

realize -- dating back to the 1950 s . The combo
Samurai are the same , only that they tend to appear
with a strong focus on the Internet of Things ( I
( 230 s )} a 3 { ( 98 s )}
calories also equals less glucose available for fuel , so the

INTERVAL -2.783 - 0.038
CONTAINS 51.287%

í k 41 ), Kr ?, Kur ? í k ,
records out . Cob ain hands me one ,
the carnage for now and take comfort in remembering that when
set of TM NT II ? What other characters might be
cooperate in their development and marketing efforts . In

INTERVAL -5.604 - -2.783
CONTAINS 0.413%

Custom arily , naming a VP and the subsequent
, but a person of interest in the Pentagon 's investigation
, M . ( 2001 ). Domin ance in early adolescent
( U BI ) or guaranteed minimum income ( G MI
ella Vol pe . At worse , it insults them .
Token kill
Token ID1494
Feature activation+2.48e-01

Pos logit contributions
 disclosing+0.608
 breach+0.591
 anomaly+0.587
 disg+0.578
 cous+0.574
Neg logit contributions
urban-0.628
uan-0.614
omon-0.599
ドラゴン-0.594
vin-0.578
Token
Token ID447
Feature activation+7.07e-01

Pos logit contributions
urban+0.195
uan+0.191
omon+0.186
ドラゴン+0.183
vin+0.181
Neg logit contributions
 disclosing-0.167
 anomaly-0.161
 breach-0.159
 cous-0.157
 disg-0.156
Token
Token ID251
Feature activation-4.05e-02

Pos logit contributions
urban+0.411
uan+0.404
omon+0.388
ドラゴン+0.379
vin+0.374
Neg logit contributions
 disclosing-0.619
 anomaly-0.601
 breach-0.597
 cous-0.592
 disg-0.590
Token),
Token ID828
Feature activation-4.34e-01

Pos logit contributions
 disclosing+0.028
 anomaly+0.027
 breach+0.027
 cous+0.027
 disg+0.027
Neg logit contributions
urban-0.031
uan-0.030
omon-0.029
ドラゴン-0.029
vin-0.028
Token the
Token ID262
Feature activation+3.90e-01

Pos logit contributions
 disclosing+0.314
 anomaly+0.302
 breach+0.300
 cous+0.295
 disg+0.294
Neg logit contributions
urban-0.321
uan-0.314
omon-0.307
ドラゴン-0.301
vin-0.296
Token smoother
Token ID32686
Feature activation+5.68e+00

Pos logit contributions
urban+0.291
uan+0.285
omon+0.278
ドラゴン+0.273
vin+0.269
Neg logit contributions
 disclosing-0.278
 anomaly-0.269
 breach-0.266
 cous-0.263
 disg-0.261
Token-
Token ID12
Feature activation+3.96e-01

Pos logit contributions
urban+4.124
uan+4.093
omon+3.902
ドラゴン+3.864
vin+3.863
Neg logit contributions
 disclosing-3.994
 anomaly-3.830
 breach-3.824
 cous-3.802
 ANGEL-3.787
Tokentalking
Token ID48186
Feature activation-1.79e-01

Pos logit contributions
urban+0.299
uan+0.294
omon+0.286
ドラゴン+0.280
vin+0.278
Neg logit contributions
 disclosing-0.279
 anomaly-0.270
 breach-0.267
 cous-0.264
 disg-0.262
Token Todd
Token ID14377
Feature activation-2.11e-01

Pos logit contributions
 disclosing+0.129
 anomaly+0.125
 breach+0.124
 cous+0.122
 disg+0.122
Neg logit contributions
urban-0.132
uan-0.129
omon-0.126
ドラゴン-0.124
vin-0.122
Token Akin
Token ID48857
Feature activation+2.40e+00

Pos logit contributions
 disclosing+0.162
 anomaly+0.156
 breach+0.155
 cous+0.153
 disg+0.153
Neg logit contributions
urban-0.146
uan-0.143
omon-0.139
ドラゴン-0.136
vin-0.134
Token,
Token ID11
Feature activation+2.83e-01

Pos logit contributions
urban+1.746
uan+1.717
omon+1.668
ドラゴン+1.641
vin+1.630
Neg logit contributions
 disclosing-1.746
 anomaly-1.678
 breach-1.669
 cous-1.653
 disg-1.648
Token get
Token ID651
Feature activation+4.12e-01

Pos logit contributions
urban+0.244
uan+0.241
omon+0.233
ドラゴン+0.227
vin+0.226
Neg logit contributions
 disclosing-0.287
 anomaly-0.278
 breach-0.275
 cous-0.273
 ANGEL-0.271
Token the
Token ID262
Feature activation+7.83e-02

Pos logit contributions
urban+0.317
uan+0.312
omon+0.303
ドラゴン+0.298
vin+0.295
Neg logit contributions
 disclosing-0.284
 anomaly-0.273
 breach-0.271
 cous-0.267
 disg-0.265
Token idea
Token ID2126
Feature activation-1.37e+00

Pos logit contributions
urban+0.061
uan+0.059
omon+0.058
ドラゴン+0.057
vin+0.056
Neg logit contributions
 disclosing-0.054
 anomaly-0.052
 breach-0.051
 cous-0.051
 disg-0.050
Token none
Token ID4844
Feature activation+4.71e-01

Pos logit contributions
 disclosing+0.968
 anomaly+0.936
 breach+0.928
 cous+0.912
 disg+0.907
Neg logit contributions
urban-1.033
uan-1.011
omon-0.988
ドラゴン-0.975
vin-0.957
Token the
Token ID262
Feature activation+2.31e+00

Pos logit contributions
urban+0.357
uan+0.349
omon+0.340
ドラゴン+0.336
vin+0.329
Neg logit contributions
 disclosing-0.331
 anomaly-0.319
 breach-0.317
 cous-0.312
 disg-0.310
Token less
Token ID1342
Feature activation+5.55e+00

Pos logit contributions
urban+1.513
uan+1.496
omon+1.440
ドラゴン+1.399
vin+1.398
Neg logit contributions
 disclosing-1.827
 anomaly-1.765
 breach-1.745
 disg-1.728
 cous-1.728
Token.
Token ID13
Feature activation-4.59e-01

Pos logit contributions
urban+3.962
uan+3.924
omon+3.774
vin+3.704
ドラゴン+3.654
Neg logit contributions
 disclosing-3.954
 anomaly-3.829
 breach-3.810
 ANGEL-3.767
 cous-3.737
TokenNumber
Token ID15057
Feature activation+6.54e-01

Pos logit contributions
 disclosing+0.317
 anomaly+0.305
 breach+0.302
 cous+0.299
 disg+0.296
Neg logit contributions
urban-0.353
uan-0.347
omon-0.338
ドラゴン-0.332
vin-0.328
Token 1
Token ID352
Feature activation-1.42e-01

Pos logit contributions
urban+0.528
uan+0.520
omon+0.507
ドラゴン+0.498
vin+0.493
Neg logit contributions
 disclosing-0.425
 anomaly-0.409
 breach-0.405
 cous-0.399
 disg-0.397
Token I
Token ID314
Feature activation+6.35e-01

Pos logit contributions
 disclosing+0.100
 anomaly+0.096
 breach+0.095
 cous+0.094
 disg+0.093
Neg logit contributions
urban-0.107
uan-0.105
omon-0.102
ドラゴン-0.100
vin-0.099
Token think
Token ID892
Feature activation-6.41e-01

Pos logit contributions
urban+0.461
uan+0.455
omon+0.441
ドラゴン+0.431
vin+0.429
Neg logit contributions
 disclosing-0.464
 anomaly-0.447
 breach-0.441
 cous-0.438
 ANGEL-0.436
Token to
Token ID284
Feature activation-1.52e+00

Pos logit contributions
 disclosing+0.337
 anomaly+0.326
 breach+0.323
 cous+0.319
 disg+0.318
Neg logit contributions
urban-0.262
uan-0.257
omon-0.248
ドラゴン-0.245
vin-0.238
Token the
Token ID262
Feature activation-6.73e-01

Pos logit contributions
 disclosing+1.019
 anomaly+0.980
 breach+0.971
 cous+0.958
 disg+0.953
Neg logit contributions
urban-1.201
uan-1.181
omon-1.151
ドラゴン-1.130
vin-1.118
Token basket
Token ID7988
Feature activation-6.91e-01

Pos logit contributions
 disclosing+0.453
 anomaly+0.436
 breach+0.432
 cous+0.426
 disg+0.424
Neg logit contributions
urban-0.529
uan-0.520
omon-0.507
ドラゴン-0.498
vin-0.492
Token,
Token ID11
Feature activation-6.33e-01

Pos logit contributions
 disclosing+0.478
 anomaly+0.463
 breach+0.458
 cous+0.450
 disg+0.447
Neg logit contributions
urban-0.531
uan-0.520
omon-0.505
ドラゴン-0.500
vin-0.490
Token the
Token ID262
Feature activation+6.09e-01

Pos logit contributions
 disclosing+0.418
 anomaly+0.401
 breach+0.397
 cous+0.392
 disg+0.392
Neg logit contributions
urban-0.509
uan-0.499
omon-0.485
ドラゴン-0.480
vin-0.474
Token more
Token ID517
Feature activation+5.43e+00

Pos logit contributions
urban+0.402
uan+0.395
omon+0.383
ドラゴン+0.373
vin+0.371
Neg logit contributions
 disclosing-0.483
 anomaly-0.466
 breach-0.463
 cous-0.459
 disg-0.457
Token essential
Token ID6393
Feature activation+4.91e-02

Pos logit contributions
urban+3.877
uan+3.845
omon+3.754
vin+3.622
ervation+3.577
Neg logit contributions
 disclosing-3.767
 disg-3.554
 ANGEL-3.547
 breach-3.544
 anomaly-3.516
Token it
Token ID340
Feature activation-9.74e-02

Pos logit contributions
urban+0.039
uan+0.039
omon+0.038
ドラゴン+0.037
vin+0.037
Neg logit contributions
 disclosing-0.032
 anomaly-0.031
 breach-0.031
 cous-0.030
 disg-0.030
Token is
Token ID318
Feature activation-9.35e-01

Pos logit contributions
 disclosing+0.072
 anomaly+0.070
 breach+0.069
 cous+0.068
 disg+0.068
Neg logit contributions
urban-0.070
uan-0.069
omon-0.067
ドラゴン-0.066
vin-0.065
Token that
Token ID326
Feature activation-2.75e-01

Pos logit contributions
 disclosing+0.733
 anomaly+0.707
 breach+0.700
 cous+0.691
 disg+0.688
Neg logit contributions
urban-0.636
uan-0.620
omon-0.602
ドラゴン-0.595
vin-0.583
Token you
Token ID345
Feature activation-9.41e-01

Pos logit contributions
 disclosing+0.146
 anomaly+0.139
 breach+0.137
 cous+0.135
 disg+0.134
Neg logit contributions
urban-0.255
uan-0.252
omon-0.246
ドラゴン-0.242
vin-0.240
Token in
Token ID287
Feature activation+7.45e-01

Pos logit contributions
 disclosing+0.880
 anomaly+0.846
 breach+0.836
 cous+0.825
 disg+0.819
Neg logit contributions
urban-0.876
uan-0.860
omon-0.840
ドラゴン-0.831
vin-0.806
Token the
Token ID262
Feature activation+1.01e+00

Pos logit contributions
urban+0.568
uan+0.557
omon+0.543
ドラゴン+0.533
vin+0.527
Neg logit contributions
 disclosing-0.519
 anomaly-0.500
 breach-0.495
 cous-0.490
 disg-0.487
Token race
Token ID3234
Feature activation+7.71e-01

Pos logit contributions
urban+0.849
uan+0.836
omon+0.819
ドラゴン+0.800
vin+0.794
Neg logit contributions
 disclosing-0.624
 anomaly-0.597
 cous-0.587
 breach-0.587
 disg-0.583
Token,
Token ID11
Feature activation+5.43e-01

Pos logit contributions
urban+0.589
uan+0.576
omon+0.562
ドラゴン+0.554
vin+0.545
Neg logit contributions
 disclosing-0.541
 anomaly-0.520
 breach-0.516
 cous-0.506
 disg-0.504
Token the
Token ID262
Feature activation+1.69e+00

Pos logit contributions
urban+0.427
uan+0.419
omon+0.408
ドラゴン+0.403
vin+0.396
Neg logit contributions
 disclosing-0.370
 anomaly-0.354
 breach-0.351
 cous-0.345
 disg-0.345
Token less
Token ID1342
Feature activation+5.23e+00

Pos logit contributions
urban+1.084
uan+1.066
omon+1.032
ドラゴン+1.004
vin+1.001
Neg logit contributions
 disclosing-1.362
 anomaly-1.316
 breach-1.303
 cous-1.301
 ANGEL-1.293
Token time
Token ID640
Feature activation-6.86e-01

Pos logit contributions
urban+3.516
uan+3.442
omon+3.291
vin+3.254
ドラゴン+3.180
Neg logit contributions
 disclosing-3.898
 ANGEL-3.800
 anomaly-3.755
 cous-3.751
 disg-3.742
Token Mrs
Token ID9074
Feature activation-1.14e+00

Pos logit contributions
 disclosing+0.553
 anomaly+0.535
 breach+0.530
 cous+0.520
 disg+0.519
Neg logit contributions
urban-0.452
uan-0.441
omon-0.430
ドラゴン-0.426
vin-0.415
Token.
Token ID13
Feature activation+9.58e-01

Pos logit contributions
 disclosing+0.825
 anomaly+0.796
 breach+0.790
 cous+0.778
 ANGEL+0.773
Neg logit contributions
urban-0.839
uan-0.823
omon-0.802
ドラゴン-0.788
vin-0.777
Token Clinton
Token ID2605
Feature activation+4.78e-01

Pos logit contributions
urban+0.733
uan+0.724
omon+0.702
ドラゴン+0.688
vin+0.683
Neg logit contributions
 disclosing-0.659
 anomaly-0.633
 breach-0.627
 disg-0.626
 cous-0.625
Token has
Token ID468
Feature activation-3.33e-01

Pos logit contributions
urban+0.355
uan+0.348
omon+0.339
ドラゴン+0.333
vin+0.328
Neg logit contributions
 disclosing-0.343
 anomaly-0.331
 breach-0.328
 cous-0.323
 disg-0.322
Tokenian
Token ID666
Feature activation+3.88e-02

Pos logit contributions
urban+1.210
uan+1.191
omon+1.160
ドラゴン+1.131
vin+1.124
Neg logit contributions
 disclosing-1.200
 anomaly-1.160
 breach-1.151
 cous-1.140
 disg-1.135
Token,
Token ID11
Feature activation+7.59e-01

Pos logit contributions
urban+0.029
uan+0.028
omon+0.027
ドラゴン+0.027
vin+0.027
Neg logit contributions
 disclosing-0.028
 anomaly-0.027
 breach-0.027
 cous-0.026
 disg-0.026
Token we
Token ID356
Feature activation+2.95e-01

Pos logit contributions
urban+0.553
uan+0.542
omon+0.527
ドラゴン+0.517
vin+0.511
Neg logit contributions
 disclosing-0.553
 anomaly-0.534
 breach-0.529
 cous-0.524
 disg-0.521
Token couldn
Token ID3521
Feature activation+5.15e-02

Pos logit contributions
urban+0.222
uan+0.218
omon+0.212
ドラゴン+0.208
vin+0.206
Neg logit contributions
 disclosing-0.209
 anomaly-0.201
 breach-0.199
 cous-0.197
 ANGEL-0.195
Token't
Token ID470
Feature activation+2.78e+00

Pos logit contributions
urban+0.041
uan+0.040
omon+0.039
ドラゴン+0.038
vin+0.038
Neg logit contributions
 disclosing-0.035
 anomaly-0.033
 breach-0.033
 cous-0.033
 disg-0.032
Token resist
Token ID4180
Feature activation+5.20e+00

Pos logit contributions
urban+1.506
uan+1.478
omon+1.425
vin+1.367
ドラゴン+1.347
Neg logit contributions
 disclosing-2.493
 anomaly-2.419
 ANGEL-2.389
 cous-2.380
 breach-2.363
Token the
Token ID262
Feature activation+2.90e-01

Pos logit contributions
urban+3.772
uan+3.704
omon+3.594
ドラゴン+3.544
vin+3.536
Neg logit contributions
 disclosing-3.517
 anomaly-3.496
 breach-3.459
 ANGEL-3.410
 cous-3.384
Token temptation
Token ID29062
Feature activation-3.65e-01

Pos logit contributions
urban+0.227
uan+0.223
omon+0.217
ドラゴン+0.214
vin+0.211
Neg logit contributions
 disclosing-0.197
 anomaly-0.189
 breach-0.188
 cous-0.185
 disg-0.184
Token to
Token ID284
Feature activation+8.15e-02

Pos logit contributions
 disclosing+0.298
 anomaly+0.289
 breach+0.287
 cous+0.283
 disg+0.282
Neg logit contributions
urban-0.236
uan-0.229
omon-0.221
ドラゴン-0.217
vin-0.213
Token recall
Token ID10014
Feature activation+1.45e+00

Pos logit contributions
urban+0.063
uan+0.062
omon+0.060
ドラゴン+0.059
vin+0.059
Neg logit contributions
 disclosing-0.056
 anomaly-0.053
 breach-0.053
 cous-0.052
 ANGEL-0.052
Token 1974
Token ID16489
Feature activation-1.66e-01

Pos logit contributions
urban+1.130
uan+1.111
omon+1.078
ドラゴン+1.064
vin+1.050
Neg logit contributions
 disclosing-0.969
 anomaly-0.933
 breach-0.926
 cous-0.915
 disg-0.910
Token
Token ID198
Feature activation+2.25e+00

Pos logit contributions
 disclosing+0.101
 anomaly+0.097
 breach+0.097
 cous+0.095
 disg+0.094
Neg logit contributions
urban-0.118
uan-0.115
omon-0.112
ドラゴン-0.111
vin-0.109
Token
Token ID198
Feature activation+7.94e-01

Pos logit contributions
urban+1.525
uan+1.500
omon+1.427
vin+1.410
ドラゴン+1.403
Neg logit contributions
 disclosing-1.784
 disg-1.732
 cous-1.716
 anomaly-1.711
 breach-1.685
Token7
Token ID22
Feature activation+8.11e-01

Pos logit contributions
urban+0.571
uan+0.560
omon+0.544
ドラゴン+0.534
vin+0.528
Neg logit contributions
 disclosing-0.589
 anomaly-0.569
 breach-0.564
 cous-0.558
 disg-0.555
Token38
Token ID2548
Feature activation+7.40e-01

Pos logit contributions
urban+0.584
uan+0.573
omon+0.557
ドラゴン+0.547
vin+0.539
Neg logit contributions
 disclosing-0.599
 anomaly-0.579
 breach-0.574
 cous-0.567
 disg-0.564
Tokenac
Token ID330
Feature activation+3.55e+00

Pos logit contributions
urban+0.539
uan+0.529
omon+0.515
ドラゴン+0.505
vin+0.499
Neg logit contributions
 disclosing-0.540
 anomaly-0.521
 breach-0.517
 cous-0.511
 disg-0.508
Token5
Token ID20
Feature activation+5.10e+00

Pos logit contributions
urban+2.538
uan+2.505
omon+2.433
vin+2.375
ドラゴン+2.357
Neg logit contributions
 disclosing-2.614
 anomaly-2.524
 breach-2.483
 ANGEL-2.477
 cous-2.465
Tokenf
Token ID69
Feature activation+3.08e+00

Pos logit contributions
urban+3.380
uan+3.324
omon+3.201
vin+3.135
ドラゴン+3.092
Neg logit contributions
 disclosing-3.961
 anomaly-3.847
 ANGEL-3.800
 breach-3.777
 disg-3.743
Token6
Token ID21
Feature activation+3.04e+00

Pos logit contributions
urban+2.261
uan+2.220
omon+2.169
ドラゴン+2.105
vin+2.102
Neg logit contributions
 disclosing-2.216
 anomaly-2.148
 breach-2.108
 ANGEL-2.108
 cous-2.093
Tokena
Token ID64
Feature activation+2.45e+00

Pos logit contributions
urban+2.051
uan+2.013
omon+1.949
vin+1.895
ドラゴン+1.893
Neg logit contributions
 disclosing-2.364
 anomaly-2.294
 breach-2.265
 ANGEL-2.257
 cous-2.234
Token443
Token ID34938
Feature activation+2.51e+00

Pos logit contributions
urban+1.811
uan+1.778
omon+1.733
ドラゴン+1.687
vin+1.675
Neg logit contributions
 disclosing-1.757
 anomaly-1.701
 breach-1.674
 ANGEL-1.667
 cous-1.657
Tokenf
Token ID69
Feature activation+3.43e-01

Pos logit contributions
urban+1.675
uan+1.646
omon+1.591
vin+1.548
ドラゴン+1.544
Neg logit contributions
 disclosing-1.969
 anomaly-1.917
 breach-1.887
 ANGEL-1.882
 cous-1.865
Token an
Token ID281
Feature activation+9.79e-01

Pos logit contributions
urban+1.540
uan+1.515
omon+1.473
ドラゴン+1.438
vin+1.436
Neg logit contributions
 disclosing-1.488
 anomaly-1.429
 breach-1.425
 cous-1.404
 disg-1.403
Token offensive
Token ID5859
Feature activation-1.47e+00

Pos logit contributions
urban+0.783
uan+0.768
omon+0.751
ドラゴン+0.724
vin+0.721
Neg logit contributions
 disclosing-0.644
 breach-0.616
 anomaly-0.610
 cous-0.608
 disg-0.608
Token line
Token ID1627
Feature activation-1.25e-01

Pos logit contributions
 disclosing+0.959
 anomaly+0.928
 breach+0.920
 cous+0.900
 disg+0.894
Neg logit contributions
urban-1.187
uan-1.159
omon-1.136
ドラゴン-1.122
vin-1.101
Token),
Token ID828
Feature activation-5.40e-01

Pos logit contributions
 disclosing+0.084
 anomaly+0.081
 breach+0.080
 cous+0.078
 disg+0.078
Neg logit contributions
urban-0.099
uan-0.097
omon-0.095
ドラゴン-0.094
vin-0.092
Token the
Token ID262
Feature activation+8.22e-01

Pos logit contributions
 disclosing+0.353
 anomaly+0.340
 breach+0.337
 cous+0.330
 disg+0.329
Neg logit contributions
urban-0.434
uan-0.425
omon-0.413
ドラゴン-0.412
vin-0.403
Token more
Token ID517
Feature activation+5.10e+00

Pos logit contributions
urban+0.541
uan+0.530
omon+0.514
ドラゴン+0.503
vin+0.497
Neg logit contributions
 disclosing-0.657
 anomaly-0.636
 breach-0.631
 cous-0.625
 disg-0.622
Token we
Token ID356
Feature activation+1.27e-02

Pos logit contributions
urban+3.445
uan+3.404
omon+3.279
vin+3.190
ervation+3.143
Neg logit contributions
 disclosing-3.792
 breach-3.628
 ANGEL-3.626
 disg-3.621
 anomaly-3.612
Token
Token ID447
Feature activation-4.09e-01

Pos logit contributions
urban+0.009
uan+0.009
omon+0.009
ドラゴン+0.009
vin+0.009
Neg logit contributions
 disclosing-0.009
 anomaly-0.009
 breach-0.009
 cous-0.009
 disg-0.009
Token
Token ID247
Feature activation-3.39e-01

Pos logit contributions
 disclosing+0.355
 anomaly+0.345
 breach+0.343
 cous+0.337
 ANGEL+0.337
Neg logit contributions
urban-0.242
uan-0.234
omon-0.228
ドラゴン-0.223
vin-0.218
Tokenre
Token ID260
Feature activation-2.00e+00

Pos logit contributions
 disclosing+0.248
 breach+0.240
 anomaly+0.240
 disg+0.233
 cous+0.233
Neg logit contributions
urban-0.246
uan-0.239
omon-0.235
ドラゴン-0.231
vin-0.224
Token starting
Token ID3599
Feature activation-2.25e+00

Pos logit contributions
 disclosing+1.473
 anomaly+1.411
 breach+1.400
 cous+1.377
 disg+1.366
Neg logit contributions
urban-1.443
uan-1.411
ドラゴン-1.385
omon-1.379
vin-1.325
Tokenorg
Token ID2398
Feature activation-3.47e-01

Pos logit contributions
 disclosing+0.507
 anomaly+0.492
 breach+0.488
 cous+0.486
 disg+0.484
Neg logit contributions
urban-0.342
uan-0.334
omon-0.321
ドラゴン-0.313
vin-0.311
Token/
Token ID14
Feature activation+5.76e-01

Pos logit contributions
 disclosing+0.236
 anomaly+0.227
 breach+0.225
 cous+0.224
 disg+0.223
Neg logit contributions
urban-0.270
uan-0.266
omon-0.259
ドラゴン-0.254
vin-0.252
Tokenindex
Token ID9630
Feature activation-1.44e-01

Pos logit contributions
urban+0.398
uan+0.390
omon+0.379
ドラゴン+0.371
vin+0.366
Neg logit contributions
 disclosing-0.443
 anomaly-0.429
 breach-0.425
 cous-0.420
 disg-0.418
Token.
Token ID13
Feature activation+1.89e+00

Pos logit contributions
 disclosing+0.110
 anomaly+0.106
 disg+0.106
 cous+0.105
 breach+0.105
Neg logit contributions
urban-0.101
uan-0.099
omon-0.095
vin-0.093
ドラゴン-0.092
Tokenphp
Token ID10121
Feature activation+3.26e+00

Pos logit contributions
urban+1.401
uan+1.360
omon+1.319
vin+1.301
ervation+1.271
Neg logit contributions
 disclosing-1.388
 anomaly-1.348
 cous-1.346
 disg-1.337
 breach-1.306
Token?
Token ID30
Feature activation+5.08e+00

Pos logit contributions
urban+2.953
uan+2.932
vin+2.804
omon+2.804
ervation+2.787
Neg logit contributions
 disclosing-1.756
 cous-1.728
 disg-1.717
 anomaly-1.634
 breach-1.593
Tokentopic
Token ID26652
Feature activation+1.72e+00

Pos logit contributions
urban+3.038
uan+2.926
omon+2.802
vin+2.776
ドラゴン+2.700
Neg logit contributions
 disclosing-4.354
 cous-4.251
 anomaly-4.248
 disg-4.210
 ANGEL-4.194
Token=
Token ID28
Feature activation+3.82e+00

Pos logit contributions
urban+1.247
uan+1.236
omon+1.178
ドラゴン+1.163
vin+1.158
Neg logit contributions
 disclosing-1.268
 disg-1.231
 anomaly-1.218
 cous-1.209
 breach-1.189
Token28
Token ID2078
Feature activation+2.01e+00

Pos logit contributions
urban+2.634
uan+2.595
ドラゴン+2.482
omon+2.467
vin+2.442
Neg logit contributions
 disclosing-2.926
 anomaly-2.821
 disg-2.806
 cous-2.798
 breach-2.782
Token18
Token ID1507
Feature activation+2.08e+00

Pos logit contributions
urban+1.325
uan+1.299
omon+1.248
ドラゴン+1.231
vin+1.217
Neg logit contributions
 disclosing-1.613
 anomaly-1.562
 breach-1.542
 cous-1.540
 disg-1.533
Token48
Token ID2780
Feature activation+2.75e+00

Pos logit contributions
urban+1.399
uan+1.366
omon+1.313
ドラゴン+1.297
vin+1.285
Neg logit contributions
 disclosing-1.643
 anomaly-1.591
 cous-1.569
 disg-1.568
 breach-1.567
Tokenpe
Token ID431
Feature activation-3.18e-01

Pos logit contributions
 disclosing+0.061
 anomaly+0.058
 breach+0.058
 disg+0.057
 cous+0.057
Neg logit contributions
urban-0.100
uan-0.099
omon-0.096
ドラゴン-0.094
vin-0.094
Token says
Token ID1139
Feature activation-4.87e-01

Pos logit contributions
 disclosing+0.222
 anomaly+0.214
 breach+0.212
 cous+0.210
 disg+0.208
Neg logit contributions
urban-0.243
uan-0.237
omon-0.231
ドラゴン-0.228
vin-0.224
Token,
Token ID11
Feature activation-2.68e-01

Pos logit contributions
 disclosing+0.337
 anomaly+0.326
 breach+0.321
 cous+0.317
 disg+0.314
Neg logit contributions
urban-0.374
uan-0.366
omon-0.355
ドラゴン-0.354
vin-0.345
Token "
Token ID366
Feature activation-1.19e+00

Pos logit contributions
 disclosing+0.187
 anomaly+0.179
 breach+0.178
 cous+0.175
 disg+0.174
Neg logit contributions
urban-0.205
uan-0.202
omon-0.196
ドラゴン-0.194
vin-0.191
Tokenthe
Token ID1169
Feature activation-5.18e-01

Pos logit contributions
 disclosing+0.719
 anomaly+0.686
 breach+0.680
 cous+0.675
 disg+0.664
Neg logit contributions
urban-0.990
uan-0.984
omon-0.961
ドラゴン-0.951
vin-0.930
Token more
Token ID517
Feature activation+5.05e+00

Pos logit contributions
 disclosing+0.411
 anomaly+0.398
 breach+0.395
 cous+0.390
 disg+0.388
Neg logit contributions
urban-0.346
uan-0.338
omon-0.328
ドラゴン-0.322
vin-0.317
Token refined
Token ID20449
Feature activation+4.20e-01

Pos logit contributions
urban+3.702
uan+3.605
omon+3.502
vin+3.468
ervation+3.389
Neg logit contributions
 disclosing-3.385
 ANGEL-3.267
 anomaly-3.235
 breach-3.231
 disg-3.228
Token your
Token ID534
Feature activation-1.87e+00

Pos logit contributions
urban+0.378
uan+0.373
omon+0.365
ドラゴン+0.358
vin+0.356
Neg logit contributions
 disclosing-0.233
 anomaly-0.223
 breach-0.221
 cous-0.217
 disg-0.216
Token BS
Token ID24218
Feature activation-3.01e+00

Pos logit contributions
 disclosing+1.383
 anomaly+1.333
 breach+1.328
 cous+1.303
 ANGEL+1.290
Neg logit contributions
urban-1.364
uan-1.335
omon-1.302
ドラゴン-1.284
vin-1.261
Token detector
Token ID31029
Feature activation+5.64e-01

Pos logit contributions
 disclosing+2.134
 anomaly+2.072
 breach+2.048
 cous+2.032
 ANGEL+2.007
Neg logit contributions
urban-2.225
uan-2.179
omon-2.116
ドラゴン-2.113
-2.056
Token is
Token ID318
Feature activation-1.42e+00

Pos logit contributions
urban+0.402
uan+0.396
omon+0.384
ドラゴン+0.376
vin+0.372
Neg logit contributions
 disclosing-0.419
 anomaly-0.405
 breach-0.402
 cous-0.396
 disg-0.395
Token in
Token ID287
Feature activation+1.38e-01

Pos logit contributions
 disclosing+0.197
 anomaly+0.190
 breach+0.189
 cous+0.187
 disg+0.186
Neg logit contributions
urban-0.206
uan-0.203
omon-0.197
ドラゴン-0.194
vin-0.191
Token the
Token ID262
Feature activation+6.90e-01

Pos logit contributions
urban+0.106
uan+0.104
omon+0.101
ドラゴン+0.100
vin+0.099
Neg logit contributions
 disclosing-0.095
 anomaly-0.091
 breach-0.090
 cous-0.090
 ANGEL-0.089
Token future
Token ID2003
Feature activation+4.00e-01

Pos logit contributions
urban+0.514
uan+0.505
omon+0.490
ドラゴン+0.484
vin+0.477
Neg logit contributions
 disclosing-0.489
 anomaly-0.467
 cous-0.462
 breach-0.462
 disg-0.460
Token,
Token ID11
Feature activation-4.36e-01

Pos logit contributions
urban+0.297
uan+0.291
omon+0.283
ドラゴン+0.278
vin+0.275
Neg logit contributions
 disclosing-0.287
 anomaly-0.277
 breach-0.275
 cous-0.272
 disg-0.270
Token the
Token ID262
Feature activation+6.18e-01

Pos logit contributions
 disclosing+0.291
 anomaly+0.277
 breach+0.275
 cous+0.272
 disg+0.271
Neg logit contributions
urban-0.349
uan-0.343
omon-0.334
ドラゴン-0.328
vin-0.324
Token more
Token ID517
Feature activation+5.01e+00

Pos logit contributions
urban+0.408
uan+0.400
omon+0.388
ドラゴン+0.380
vin+0.375
Neg logit contributions
 disclosing-0.493
 anomaly-0.476
 breach-0.473
 cous-0.468
 disg-0.466
Token people
Token ID661
Feature activation-7.29e-01

Pos logit contributions
urban+3.721
uan+3.658
omon+3.511
vin+3.497
ドラゴン+3.497
Neg logit contributions
 disclosing-3.375
 ANGEL-3.243
 breach-3.203
 disg-3.202
 anomaly-3.196
Token know
Token ID760
Feature activation-7.75e-01

Pos logit contributions
 disclosing+0.505
 anomaly+0.486
 breach+0.483
 cous+0.474
 disg+0.473
Neg logit contributions
urban-0.562
uan-0.550
omon-0.537
ドラゴン-0.530
vin-0.518
Token about
Token ID546
Feature activation+5.30e-01

Pos logit contributions
 disclosing+0.529
 anomaly+0.509
 breach+0.505
 cous+0.497
 disg+0.494
Neg logit contributions
urban-0.602
uan-0.591
omon-0.577
ドラゴン-0.567
vin-0.559
Token the
Token ID262
Feature activation-3.39e-01

Pos logit contributions
urban+0.413
uan+0.406
omon+0.394
ドラゴン+0.390
vin+0.385
Neg logit contributions
 disclosing-0.357
 anomaly-0.344
 breach-0.340
 cous-0.337
 disg-0.337
Token present
Token ID1944
Feature activation+1.06e+00

Pos logit contributions
 disclosing+0.227
 anomaly+0.217
 breach+0.215
 cous+0.213
 disg+0.212
Neg logit contributions
urban-0.267
uan-0.263
omon-0.256
ドラゴン-0.252
vin-0.249
Token Tou
Token ID23359
Feature activation+1.23e+00

Pos logit contributions
urban+0.099
uan+0.098
omon+0.095
ドラゴン+0.094
vin+0.092
Neg logit contributions
 disclosing-0.092
 anomaly-0.089
 breach-0.088
 cous-0.087
 disg-0.086
Tokenhou
Token ID15710
Feature activation+1.46e+00

Pos logit contributions
urban+0.633
uan+0.616
omon+0.592
ドラゴン+0.573
vin+0.569
Neg logit contributions
 disclosing-1.161
 anomaly-1.129
 breach-1.119
 cous-1.112
 disg-1.106
Token series
Token ID2168
Feature activation-1.94e-02

Pos logit contributions
urban+1.132
uan+1.108
omon+1.083
ドラゴン+1.063
vin+1.056
Neg logit contributions
 disclosing-0.999
 anomaly-0.957
 breach-0.950
 cous-0.940
 disg-0.935
Token was
Token ID373
Feature activation+1.40e+00

Pos logit contributions
 disclosing+0.014
 anomaly+0.013
 breach+0.013
 cous+0.013
 disg+0.013
Neg logit contributions
urban-0.014
uan-0.014
omon-0.014
ドラゴン-0.014
vin-0.013
Token also
Token ID635
Feature activation+1.54e+00

Pos logit contributions
urban+1.073
uan+1.055
omon+1.027
ドラゴン+1.008
vin+1.001
Neg logit contributions
 disclosing-0.962
 anomaly-0.933
 breach-0.926
 cous-0.912
 ANGEL-0.905
Token present
Token ID1944
Feature activation+4.93e+00

Pos logit contributions
urban+1.195
uan+1.173
omon+1.143
ドラゴン+1.122
vin+1.115
Neg logit contributions
 disclosing-1.046
 anomaly-1.012
 breach-1.005
 cous-0.989
 ANGEL-0.982
Token during
Token ID1141
Feature activation+1.02e+00

Pos logit contributions
urban+3.112
uan+3.060
omon+2.964
vin+2.895
ドラゴン+2.887
Neg logit contributions
 disclosing-3.935
 anomaly-3.848
 breach-3.846
 cous-3.815
 disg-3.772
Token the
Token ID262
Feature activation+5.41e-01

Pos logit contributions
urban+0.773
uan+0.757
omon+0.740
ドラゴン+0.726
vin+0.718
Neg logit contributions
 disclosing-0.708
 anomaly-0.682
 breach-0.677
 cous-0.668
 disg-0.665
Token meeting
Token ID3249
Feature activation+1.44e+00

Pos logit contributions
urban+0.419
uan+0.411
omon+0.402
ドラゴン+0.394
vin+0.390
Neg logit contributions
 disclosing-0.368
 anomaly-0.354
 breach-0.351
 cous-0.347
 disg-0.345
Token.
Token ID13
Feature activation-2.08e-01

Pos logit contributions
urban+1.052
uan+1.028
omon+1.006
ドラゴン+0.985
vin+0.977
Neg logit contributions
 disclosing-1.034
 anomaly-1.003
 breach-0.995
 cous-0.988
 disg-0.980
Token He
Token ID679
Feature activation+4.61e-01

Pos logit contributions
 disclosing+0.145
 anomaly+0.140
 breach+0.139
 cous+0.137
 disg+0.136
Neg logit contributions
urban-0.158
uan-0.155
omon-0.151
ドラゴン-0.148
vin-0.146
Token deal
Token ID1730
Feature activation-7.43e-01

Pos logit contributions
 disclosing+0.391
 anomaly+0.377
 breach+0.377
 disg+0.366
 cous+0.365
Neg logit contributions
urban-0.465
uan-0.454
omon-0.445
ドラゴン-0.436
vin-0.431
Token to
Token ID284
Feature activation-6.66e-02

Pos logit contributions
 disclosing+0.532
 anomaly+0.508
 breach+0.507
 cous+0.496
 disg+0.494
Neg logit contributions
urban-0.557
uan-0.546
omon-0.534
ドラゴン-0.523
vin-0.514
Token save
Token ID3613
Feature activation+3.17e-01

Pos logit contributions
 disclosing+0.049
 anomaly+0.047
 breach+0.046
 cous+0.046
 disg+0.046
Neg logit contributions
urban-0.048
uan-0.048
omon-0.046
ドラゴン-0.045
vin-0.045
Token the
Token ID262
Feature activation-3.31e-01

Pos logit contributions
urban+0.238
uan+0.233
omon+0.227
ドラゴン+0.223
vin+0.220
Neg logit contributions
 disclosing-0.225
 anomaly-0.217
 breach-0.215
 cous-0.212
 disg-0.211
Token Def
Token ID2896
Feature activation+1.89e+00

Pos logit contributions
 disclosing+0.231
 anomaly+0.223
 breach+0.221
 cous+0.217
 disg+0.216
Neg logit contributions
urban-0.252
uan-0.247
omon-0.241
ドラゴン-0.237
vin-0.234
Tokenerred
Token ID17436
Feature activation+4.88e+00

Pos logit contributions
urban+0.911
uan+0.884
omon+0.848
ドラゴン+0.827
vin+0.807
Neg logit contributions
 disclosing-1.840
 anomaly-1.791
 breach-1.781
 cous-1.765
 disg-1.755
Token Action
Token ID7561
Feature activation-2.55e-01

Pos logit contributions
urban+3.071
uan+3.062
omon+2.929
vin+2.871
ドラゴン+2.838
Neg logit contributions
 disclosing-4.007
 cous-3.849
 anomaly-3.843
 disg-3.788
 breach-3.779
Token for
Token ID329
Feature activation+6.49e-01

Pos logit contributions
 disclosing+0.191
 anomaly+0.187
 breach+0.186
 disg+0.180
 ANGEL+0.180
Neg logit contributions
urban-0.179
uan-0.173
omon-0.170
ドラゴン-0.170
vin-0.164
Token Childhood
Token ID40992
Feature activation+8.56e-01

Pos logit contributions
urban+0.192
uan+0.181
omon+0.169
ドラゴン+0.163
vin+0.156
Neg logit contributions
 disclosing-0.756
 anomaly-0.739
 breach-0.737
 cous-0.728
 disg-0.726
Token Arri
Token ID27350
Feature activation+2.04e+00

Pos logit contributions
urban+0.605
uan+0.590
omon+0.577
ドラゴン+0.566
vin+0.555
Neg logit contributions
 disclosing-0.644
 anomaly-0.626
 breach-0.622
 cous-0.608
 ANGEL-0.607
Tokenvals
Token ID12786
Feature activation-3.01e-01

Pos logit contributions
urban+1.661
uan+1.652
omon+1.602
vin+1.569
ドラゴン+1.550
Neg logit contributions
 disclosing-1.325
 anomaly-1.266
 disg-1.248
 breach-1.247
 cous-1.239
Token He
Token ID679
Feature activation-2.13e-01

Pos logit contributions
urban+0.416
uan+0.408
omon+0.397
ドラゴン+0.391
vin+0.386
Neg logit contributions
 disclosing-0.383
 anomaly-0.369
 breach-0.366
 cous-0.361
 disg-0.360
Token then
Token ID788
Feature activation+2.26e-01

Pos logit contributions
 disclosing+0.146
 anomaly+0.141
 breach+0.140
 cous+0.138
 disg+0.137
Neg logit contributions
urban-0.164
uan-0.161
omon-0.157
ドラゴン-0.155
vin-0.152
Token happened
Token ID3022
Feature activation+7.01e-01

Pos logit contributions
urban+0.175
uan+0.172
omon+0.167
ドラゴン+0.164
vin+0.163
Neg logit contributions
 disclosing-0.155
 anomaly-0.150
 breach-0.148
 cous-0.146
 disg-0.145
Token to
Token ID284
Feature activation+2.24e-01

Pos logit contributions
urban+0.536
uan+0.523
omon+0.513
ドラゴン+0.505
vin+0.492
Neg logit contributions
 disclosing-0.489
 anomaly-0.471
 breach-0.466
 cous-0.460
 disg-0.458
Token pull
Token ID2834
Feature activation+4.08e-01

Pos logit contributions
urban+0.167
uan+0.165
omon+0.160
ドラゴン+0.156
vin+0.156
Neg logit contributions
 disclosing-0.158
 anomaly-0.152
 breach-0.150
 cous-0.149
 ANGEL-0.148
Token up
Token ID510
Feature activation+4.87e+00

Pos logit contributions
urban+0.304
uan+0.298
omon+0.290
ドラゴン+0.285
vin+0.282
Neg logit contributions
 disclosing-0.291
 anomaly-0.280
 breach-0.278
 cous-0.274
 disg-0.273
Token a
Token ID257
Feature activation-1.40e-01

Pos logit contributions
urban+3.254
uan+3.229
omon+3.091
vin+3.059
ドラゴン+3.022
Neg logit contributions
 disclosing-3.707
 anomaly-3.604
 cous-3.580
 breach-3.575
 disg-3.539
Token Texas
Token ID3936
Feature activation-1.46e+00

Pos logit contributions
 disclosing+0.101
 anomaly+0.098
 breach+0.097
 cous+0.096
 disg+0.095
Neg logit contributions
urban-0.103
uan-0.101
omon-0.098
ドラゴン-0.097
vin-0.095
Token A
Token ID317
Feature activation+8.11e-02

Pos logit contributions
 disclosing+1.038
 anomaly+1.007
 breach+0.995
 cous+0.980
 ANGEL+0.976
Neg logit contributions
urban-1.093
uan-1.071
omon-1.048
ドラゴン-1.035
vin-1.011
Token&
Token ID5
Feature activation+3.49e+00

Pos logit contributions
urban+0.076
uan+0.075
omon+0.074
ドラゴン+0.073
vin+0.072
Neg logit contributions
 disclosing-0.042
 anomaly-0.040
 breach-0.039
 cous-0.039
 disg-0.039
TokenM
Token ID44
Feature activation-2.04e+00

Pos logit contributions
urban+1.504
uan+1.461
ドラゴン+1.298
vin+1.294
omon+1.288
Neg logit contributions
 disclosing-3.621
 disg-3.560
 cous-3.546
 anomaly-3.511
 breach-3.423
Token so
Token ID523
Feature activation-8.83e-01

Pos logit contributions
urban+0.209
uan+0.205
omon+0.200
ドラゴン+0.198
vin+0.194
Neg logit contributions
 disclosing-0.215
 anomaly-0.208
 breach-0.207
 cous-0.203
 disg-0.202
Token-
Token ID12
Feature activation-7.56e-01

Pos logit contributions
 disclosing+0.603
 anomaly+0.578
 breach+0.572
 cous+0.563
 disg+0.558
Neg logit contributions
urban-0.682
uan-0.674
omon-0.656
ドラゴン-0.651
vin-0.634
Tokencalled
Token ID7174
Feature activation-1.14e+00

Pos logit contributions
 disclosing+0.536
 anomaly+0.517
 breach+0.513
 cous+0.506
 disg+0.502
Neg logit contributions
urban-0.565
uan-0.555
omon-0.540
ドラゴン-0.532
vin-0.523
Token white
Token ID2330
Feature activation-5.22e-02

Pos logit contributions
 disclosing+0.815
 anomaly+0.790
 breach+0.783
 cous+0.764
 disg+0.760
Neg logit contributions
urban-0.846
uan-0.836
omon-0.814
ドラゴン-0.804
vin-0.790
Token-
Token ID12
Feature activation+1.26e+00

Pos logit contributions
 disclosing+0.034
 anomaly+0.033
 breach+0.032
 cous+0.032
 disg+0.032
Neg logit contributions
urban-0.042
uan-0.042
omon-0.041
ドラゴン-0.040
vin-0.039
Tokencollar
Token ID37676
Feature activation+4.86e+00

Pos logit contributions
urban+0.961
uan+0.940
omon+0.915
ドラゴン+0.894
vin+0.889
Neg logit contributions
 disclosing-0.886
 anomaly-0.855
 breach-0.846
 cous-0.837
 disg-0.833
Token jobs
Token ID3946
Feature activation+1.82e+00

Pos logit contributions
urban+3.285
uan+3.232
omon+3.090
vin+3.008
ドラゴン+2.978
Neg logit contributions
 disclosing-3.723
 cous-3.565
 anomaly-3.552
 disg-3.535
 ANGEL-3.531
Token.
Token ID13
Feature activation+2.83e-01

Pos logit contributions
urban+1.321
uan+1.294
omon+1.256
ドラゴン+1.222
vin+1.218
Neg logit contributions
 disclosing-1.329
 anomaly-1.285
 breach-1.272
 cous-1.269
 ANGEL-1.259
Token In
Token ID554
Feature activation+2.65e-02

Pos logit contributions
urban+0.225
uan+0.221
omon+0.216
ドラゴン+0.212
vin+0.210
Neg logit contributions
 disclosing-0.188
 anomaly-0.181
 breach-0.179
 cous-0.177
 disg-0.176
Token fact
Token ID1109
Feature activation-5.38e-01

Pos logit contributions
urban+0.020
uan+0.019
omon+0.019
ドラゴン+0.019
vin+0.018
Neg logit contributions
 disclosing-0.019
 anomaly-0.018
 breach-0.018
 cous-0.018
 disg-0.018
Token,
Token ID11
Feature activation+3.56e-01

Pos logit contributions
 disclosing+0.381
 anomaly+0.368
 breach+0.365
 cous+0.358
 ANGEL+0.355
Neg logit contributions
urban-0.404
uan-0.396
omon-0.386
ドラゴン-0.381
vin-0.374
Token too
Token ID1165
Feature activation-8.14e-01

Pos logit contributions
 disclosing+0.313
 anomaly+0.303
 breach+0.300
 cous+0.297
 disg+0.295
Neg logit contributions
urban-0.283
uan-0.278
omon-0.270
ドラゴン-0.265
vin-0.261
Token far
Token ID1290
Feature activation-5.37e-01

Pos logit contributions
 disclosing+0.522
 anomaly+0.500
 breach+0.498
 cous+0.488
 disg+0.484
Neg logit contributions
urban-0.669
uan-0.655
omon-0.645
ドラゴン-0.632
vin-0.621
Token."
Token ID526
Feature activation+1.98e-01

Pos logit contributions
 disclosing+0.366
 anomaly+0.351
 breach+0.350
 cous+0.342
 disg+0.341
Neg logit contributions
urban-0.419
uan-0.410
omon-0.401
ドラゴン-0.396
vin-0.387
Token<|endoftext|>
Token ID50256
Feature activation-1.03e+00

Pos logit contributions
urban+0.147
uan+0.144
omon+0.140
ドラゴン+0.138
vin+0.136
Neg logit contributions
 disclosing-0.142
 anomaly-0.137
 breach-0.136
 cous-0.134
 disg-0.133
TokenMetro
Token ID45141
Feature activation-1.91e-01

Pos logit contributions
 disclosing+0.778
 anomaly+0.753
 breach+0.747
 cous+0.737
 ANGEL+0.733
Neg logit contributions
urban-0.721
uan-0.710
omon-0.692
ドラゴン-0.678
vin-0.666
Token Manila
Token ID31721
Feature activation+4.85e+00

Pos logit contributions
 disclosing+0.143
 anomaly+0.138
 breach+0.137
 cous+0.135
 disg+0.135
Neg logit contributions
urban-0.135
uan-0.133
omon-0.129
ドラゴン-0.127
vin-0.125
Token (
Token ID357
Feature activation-1.13e+00

Pos logit contributions
urban+3.419
uan+3.383
omon+3.228
vin+3.154
ドラゴン+3.118
Neg logit contributions
 disclosing-3.564
 anomaly-3.420
 breach-3.396
 cous-3.392
 disg-3.386
TokenCNN
Token ID18474
Feature activation-3.87e-01

Pos logit contributions
 disclosing+0.967
 anomaly+0.938
 breach+0.932
 cous+0.921
 disg+0.918
Neg logit contributions
urban-0.679
uan-0.663
omon-0.643
ドラゴン-0.628
vin-0.617
Token Philippines
Token ID13316
Feature activation-3.93e-01

Pos logit contributions
 disclosing+0.291
 anomaly+0.280
 breach+0.278
 cous+0.275
 disg+0.274
Neg logit contributions
urban-0.274
uan-0.270
omon-0.262
ドラゴン-0.257
vin-0.253
Token)
Token ID8
Feature activation+8.61e-01

Pos logit contributions
 disclosing+0.270
 anomaly+0.260
 breach+0.258
 cous+0.254
 disg+0.253
Neg logit contributions
urban-0.304
uan-0.298
omon-0.291
ドラゴン-0.286
vin-0.282
Token
Token ID851
Feature activation-5.54e-01

Pos logit contributions
urban+0.602
uan+0.592
omon+0.573
ドラゴン+0.562
vin+0.555
Neg logit contributions
 disclosing-0.654
 anomaly-0.632
 breach-0.627
 cous-0.620
 disg-0.618
Token/
Token ID14
Feature activation-4.48e-01

Pos logit contributions
urban+0.393
uan+0.386
omon+0.376
ドラゴン+0.371
vin+0.365
Neg logit contributions
 disclosing-0.346
 anomaly-0.334
 breach-0.331
 cous-0.326
 disg-0.324
TokenDem
Token ID11522
Feature activation-1.35e+00

Pos logit contributions
 disclosing+0.325
 anomaly+0.314
 breach+0.311
 cous+0.308
 disg+0.305
Neg logit contributions
urban-0.330
uan-0.323
omon-0.315
ドラゴン-0.309
vin-0.305
Tokenocrat
Token ID35128
Feature activation-2.23e-02

Pos logit contributions
 disclosing+0.818
 anomaly+0.781
 breach+0.778
 cous+0.760
 disg+0.759
Neg logit contributions
urban-1.146
uan-1.127
omon-1.099
ドラゴン-1.092
vin-1.068
Token)
Token ID8
Feature activation+2.08e+00

Pos logit contributions
 disclosing+0.015
 anomaly+0.015
 breach+0.015
 cous+0.015
 disg+0.014
Neg logit contributions
urban-0.017
uan-0.017
omon-0.016
ドラゴン-0.016
vin-0.016
TokenBuy
Token ID14518
Feature activation+2.77e+00

Pos logit contributions
urban+0.932
uan+0.904
omon+0.867
ドラゴン+0.836
vin+0.812
Neg logit contributions
 disclosing-2.102
 anomaly-2.049
 breach-2.040
 ANGEL-2.014
 cous-2.011
Token Photo
Token ID5555
Feature activation+4.74e+00

Pos logit contributions
urban+1.359
uan+1.347
omon+1.237
ドラゴン+1.211
vin+1.192
Neg logit contributions
 disclosing-2.675
 disg-2.626
 cous-2.588
 anomaly-2.584
 breach-2.552
Token
Token ID198
Feature activation+4.26e+00

Pos logit contributions
urban+3.160
uan+3.160
omon+2.970
ドラゴン+2.952
vin+2.916
Neg logit contributions
 disclosing-3.684
 anomaly-3.564
 cous-3.555
 disg-3.529
 breach-3.499
Token
Token ID198
Feature activation+3.93e+00

Pos logit contributions
urban+2.696
uan+2.643
vin+2.473
omon+2.452
ドラゴン+2.439
Neg logit contributions
 disclosing-3.548
 disg-3.532
 cous-3.447
 anomaly-3.393
 breach-3.330
TokenSign
Token ID11712
Feature activation-1.65e+00

Pos logit contributions
urban+2.737
uan+2.646
omon+2.554
vin+2.523
ドラゴン+2.516
Neg logit contributions
 disclosing-3.054
 anomaly-2.946
 breach-2.927
 cous-2.918
 disg-2.916
Tokening
Token ID278
Feature activation-7.34e-01

Pos logit contributions
 disclosing+1.227
 anomaly+1.184
 breach+1.177
 cous+1.158
 disg+1.151
Neg logit contributions
urban-1.188
uan-1.162
omon-1.133
ドラゴン-1.116
vin-1.095
Token Day
Token ID3596
Feature activation-9.03e-01

Pos logit contributions
 disclosing+0.552
 anomaly+0.532
 breach+0.529
 cous+0.519
 ANGEL+0.519
Neg logit contributions
urban-0.521
uan-0.509
omon-0.497
ドラゴン-0.493
vin-0.478
Token so
Token ID523
Feature activation-9.74e-01

Pos logit contributions
urban+0.429
uan+0.422
omon+0.412
ドラゴン+0.405
vin+0.399
Neg logit contributions
 disclosing-0.409
 anomaly-0.394
 breach-0.392
 cous-0.384
 disg-0.383
Token-
Token ID12
Feature activation-1.61e-01

Pos logit contributions
 disclosing+0.693
 anomaly+0.668
 breach+0.662
 cous+0.654
 disg+0.650
Neg logit contributions
urban-0.728
uan-0.715
omon-0.696
ドラゴン-0.683
vin-0.675
Tokencalled
Token ID7174
Feature activation-6.06e-01

Pos logit contributions
 disclosing+0.117
 anomaly+0.113
 breach+0.112
 disg+0.111
 cous+0.111
Neg logit contributions
urban-0.119
uan-0.116
omon-0.113
ドラゴン-0.110
vin-0.110
Token white
Token ID2330
Feature activation+1.17e+00

Pos logit contributions
 disclosing+0.392
 anomaly+0.378
 breach+0.376
 cous+0.366
 disg+0.365
Neg logit contributions
urban-0.489
uan-0.483
omon-0.472
ドラゴン-0.465
vin-0.458
Token-
Token ID12
Feature activation+1.35e+00

Pos logit contributions
urban+0.903
uan+0.888
omon+0.866
ドラゴン+0.852
vin+0.841
Neg logit contributions
 disclosing-0.798
 anomaly-0.769
 breach-0.763
 cous-0.751
 disg-0.747
Tokencollar
Token ID37676
Feature activation+4.71e+00

Pos logit contributions
urban+1.011
uan+0.979
omon+0.951
vin+0.924
ドラゴン+0.913
Neg logit contributions
 disclosing-0.978
 anomaly-0.947
 cous-0.934
 disg-0.930
 breach-0.926
Token jobs
Token ID3946
Feature activation+1.93e+00

Pos logit contributions
urban+2.980
uan+2.927
omon+2.808
vin+2.727
ドラゴン+2.720
Neg logit contributions
 disclosing-3.806
 cous-3.669
 anomaly-3.651
 ANGEL-3.615
 disg-3.607
Token of
Token ID286
Feature activation+1.49e+00

Pos logit contributions
urban+1.398
uan+1.372
omon+1.333
ドラゴン+1.301
vin+1.293
Neg logit contributions
 disclosing-1.416
 anomaly-1.367
 breach-1.356
 cous-1.351
 disg-1.341
Token managers
Token ID11663
Feature activation+1.19e+00

Pos logit contributions
urban+1.085
uan+1.062
omon+1.032
ドラゴン+1.012
vin+1.001
Neg logit contributions
 disclosing-1.094
 anomaly-1.056
 breach-1.047
 cous-1.037
 disg-1.031
Token,
Token ID11
Feature activation+7.05e-02

Pos logit contributions
urban+0.879
uan+0.863
omon+0.839
ドラゴン+0.823
vin+0.815
Neg logit contributions
 disclosing-0.851
 anomaly-0.821
 breach-0.814
 cous-0.805
 disg-0.801
Token professionals
Token ID11153
Feature activation+1.30e+00

Pos logit contributions
urban+0.052
uan+0.051
omon+0.050
ドラゴン+0.049
vin+0.048
Neg logit contributions
 disclosing-0.050
 anomaly-0.049
 breach-0.048
 cous-0.048
 disg-0.047
Token along
Token ID1863
Feature activation-2.04e+00

Pos logit contributions
urban+0.217
uan+0.214
omon+0.207
ドラゴン+0.203
vin+0.201
Neg logit contributions
 disclosing-0.221
 anomaly-0.213
 breach-0.211
 cous-0.210
 ANGEL-0.208
Token he
Token ID339
Feature activation-4.50e-01

Pos logit contributions
 disclosing+1.378
 anomaly+1.329
 breach+1.318
 cous+1.295
 disg+1.294
Neg logit contributions
urban-1.596
uan-1.559
omon-1.527
ドラゴン-1.506
vin-1.478
Token walks
Token ID11114
Feature activation-1.21e+00

Pos logit contributions
 disclosing+0.325
 anomaly+0.314
 breach+0.312
 cous+0.307
 disg+0.307
Neg logit contributions
urban-0.332
uan-0.325
omon-0.316
ドラゴン-0.312
vin-0.306
Token,
Token ID11
Feature activation+8.25e-01

Pos logit contributions
 disclosing+0.855
 anomaly+0.821
 breach+0.810
 disg+0.803
 cous+0.797
Neg logit contributions
urban-0.928
uan-0.904
omon-0.883
ドラゴン-0.873
vin-0.855
Token the
Token ID262
Feature activation+6.79e-01

Pos logit contributions
urban+0.637
uan+0.625
omon+0.609
ドラゴン+0.599
vin+0.591
Neg logit contributions
 disclosing-0.570
 anomaly-0.548
 breach-0.543
 cous-0.536
 disg-0.534
Token greater
Token ID3744
Feature activation+4.71e+00

Pos logit contributions
urban+0.444
uan+0.436
omon+0.423
ドラゴン+0.414
vin+0.409
Neg logit contributions
 disclosing-0.543
 anomaly-0.525
 breach-0.521
 cous-0.516
 disg-0.513
Token the
Token ID262
Feature activation-8.88e-01

Pos logit contributions
urban+3.261
uan+3.222
omon+3.122
ドラゴン+3.021
vin+3.013
Neg logit contributions
 disclosing-3.490
 anomaly-3.342
 breach-3.325
 ANGEL-3.309
 cous-3.309
Token destruction
Token ID8166
Feature activation+2.05e-01

Pos logit contributions
 disclosing+0.581
 anomaly+0.560
 breach+0.557
 cous+0.544
 disg+0.543
Neg logit contributions
urban-0.717
uan-0.704
omon-0.687
ドラゴン-0.679
vin-0.668
Token and
Token ID290
Feature activation+4.43e-01

Pos logit contributions
urban+0.148
uan+0.145
omon+0.141
ドラゴン+0.138
vin+0.136
Neg logit contributions
 disclosing-0.151
 anomaly-0.146
 breach-0.145
 cous-0.143
 disg-0.142
Token injuries
Token ID6821
Feature activation+9.21e-01

Pos logit contributions
urban+0.350
uan+0.345
omon+0.336
ドラゴン+0.330
vin+0.326
Neg logit contributions
 disclosing-0.295
 anomaly-0.283
 breach-0.280
 cous-0.277
 disg-0.275
Token.
Token ID13
Feature activation+8.54e-01

Pos logit contributions
urban+0.640
uan+0.629
omon+0.611
ドラゴン+0.598
vin+0.590
Neg logit contributions
 disclosing-0.701
 anomaly-0.678
 breach-0.672
 cous-0.665
 disg-0.660
Tokenian
Token ID666
Feature activation+1.02e-01

Pos logit contributions
urban+1.450
uan+1.442
omon+1.387
vin+1.357
ドラゴン+1.343
Neg logit contributions
 disclosing-1.264
 anomaly-1.215
 disg-1.207
 breach-1.197
 cous-1.192
Token,
Token ID11
Feature activation+5.41e-01

Pos logit contributions
urban+0.074
uan+0.073
omon+0.070
ドラゴン+0.069
vin+0.069
Neg logit contributions
 disclosing-0.075
 anomaly-0.072
 breach-0.071
 cous-0.071
 disg-0.071
Token we
Token ID356
Feature activation-1.12e-01

Pos logit contributions
urban+0.348
uan+0.342
omon+0.329
ドラゴン+0.321
vin+0.319
Neg logit contributions
 disclosing-0.440
 anomaly-0.427
 disg-0.422
 breach-0.421
 cous-0.420
Token couldn
Token ID3521
Feature activation+9.18e-02

Pos logit contributions
 disclosing+0.081
 anomaly+0.078
 breach+0.078
 cous+0.076
 disg+0.076
Neg logit contributions
urban-0.082
uan-0.081
omon-0.079
ドラゴン-0.078
vin-0.076
Token't
Token ID470
Feature activation+2.02e+00

Pos logit contributions
urban+0.074
uan+0.073
omon+0.071
ドラゴン+0.070
vin+0.069
Neg logit contributions
 disclosing-0.060
 anomaly-0.058
 breach-0.057
 cous-0.056
 disg-0.056
Token resist
Token ID4180
Feature activation+4.69e+00

Pos logit contributions
urban+1.090
uan+1.063
omon+1.024
ドラゴン+0.993
vin+0.982
Neg logit contributions
 disclosing-1.853
 anomaly-1.799
 breach-1.782
 cous-1.775
 ANGEL-1.769
Token the
Token ID262
Feature activation+7.30e-01

Pos logit contributions
urban+3.170
uan+3.133
omon+2.996
ドラゴン+2.945
vin+2.922
Neg logit contributions
 disclosing-3.532
 anomaly-3.482
 cous-3.472
 disg-3.469
 ANGEL-3.418
Token temptation
Token ID29062
Feature activation+5.43e-01

Pos logit contributions
urban+0.468
uan+0.458
omon+0.443
ドラゴン+0.434
vin+0.428
Neg logit contributions
 disclosing-0.597
 anomaly-0.578
 breach-0.574
 cous-0.568
 disg-0.565
Token to
Token ID284
Feature activation-1.67e-01

Pos logit contributions
urban+0.321
uan+0.314
omon+0.303
ドラゴン+0.296
vin+0.292
Neg logit contributions
 disclosing-0.471
 anomaly-0.458
 breach-0.454
 cous-0.450
 disg-0.449
Token recall
Token ID10014
Feature activation+1.58e+00

Pos logit contributions
 disclosing+0.132
 anomaly+0.128
 breach+0.127
 cous+0.125
 disg+0.124
Neg logit contributions
urban-0.112
uan-0.110
omon-0.107
ドラゴン-0.105
vin-0.103
Token 1974
Token ID16489
Feature activation+1.09e+00

Pos logit contributions
urban+0.951
uan+0.930
omon+0.900
ドラゴン+0.879
vin+0.866
Neg logit contributions
 disclosing-1.358
 anomaly-1.319
 breach-1.308
 cous-1.292
 disg-1.286
Token virtual
Token ID7166
Feature activation+2.73e-01

Pos logit contributions
urban+0.063
uan+0.062
omon+0.061
ドラゴン+0.060
vin+0.059
Neg logit contributions
 disclosing-0.056
 anomaly-0.054
 breach-0.054
 cous-0.053
 disg-0.053
Token creatures
Token ID8109
Feature activation-2.49e-01

Pos logit contributions
urban+0.195
uan+0.192
omon+0.186
ドラゴン+0.183
vin+0.180
Neg logit contributions
 disclosing-0.204
 anomaly-0.197
 breach-0.195
 cous-0.193
 disg-0.192
Token.
Token ID13
Feature activation+6.20e-01

Pos logit contributions
 disclosing+0.186
 anomaly+0.179
 breach+0.178
 cous+0.175
 disg+0.174
Neg logit contributions
urban-0.179
uan-0.175
omon-0.171
ドラゴン-0.168
vin-0.165
Token (
Token ID357
Feature activation+1.27e+00

Pos logit contributions
urban+0.484
uan+0.476
omon+0.463
ドラゴン+0.456
vin+0.450
Neg logit contributions
 disclosing-0.420
 anomaly-0.404
 breach-0.401
 cous-0.395
 disg-0.393
TokenPhoto
Token ID6191
Feature activation+3.80e+00

Pos logit contributions
urban+0.743
uan+0.719
omon+0.687
ドラゴン+0.680
vin+0.674
Neg logit contributions
 disclosing-1.125
 anomaly-1.083
 disg-1.083
 cous-1.080
 breach-1.071
Token:
Token ID25
Feature activation+4.68e+00

Pos logit contributions
urban+2.561
uan+2.504
omon+2.387
vin+2.371
ドラゴン+2.360
Neg logit contributions
 disclosing-2.928
 disg-2.876
 cous-2.840
 anomaly-2.831
 breach-2.762
Token KY
Token ID33668
Feature activation+7.21e-01

Pos logit contributions
urban+3.495
uan+3.460
omon+3.350
ドラゴン+3.304
vin+3.272
Neg logit contributions
 disclosing-3.258
 anomaly-3.116
 cous-3.101
 breach-3.086
 disg-3.065
TokenLE
Token ID2538
Feature activation+1.66e+00

Pos logit contributions
urban+0.555
uan+0.547
omon+0.531
ドラゴン+0.518
vin+0.518
Neg logit contributions
 disclosing-0.499
 anomaly-0.479
 breach-0.475
 cous-0.469
 disg-0.467
Token GR
Token ID10863
Feature activation+2.25e+00

Pos logit contributions
urban+1.267
uan+1.249
omon+1.203
ドラゴン+1.188
vin+1.184
Neg logit contributions
 disclosing-1.151
 anomaly-1.107
 breach-1.093
 cous-1.084
 disg-1.082
TokenAN
Token ID1565
Feature activation-4.24e-01

Pos logit contributions
urban+1.676
uan+1.645
omon+1.602
ドラゴン+1.557
vin+1.552
Neg logit contributions
 disclosing-1.616
 anomaly-1.556
 breach-1.536
 cous-1.527
 disg-1.525
TokenTH
Token ID4221
Feature activation+6.50e-01

Pos logit contributions
 disclosing+0.284
 anomaly+0.277
 ANGEL+0.273
 breach+0.273
 cous+0.266
Neg logit contributions
urban-0.332
uan-0.327
omon-0.318
ドラゴン-0.313
vin-0.307
Token U
Token ID471
Feature activation+4.26e-02

Pos logit contributions
 disclosing+0.448
 anomaly+0.431
 breach+0.427
 cous+0.422
 disg+0.420
Neg logit contributions
urban-0.493
uan-0.485
omon-0.472
ドラゴン-0.463
vin-0.458
Token.
Token ID13
Feature activation+9.94e-01

Pos logit contributions
urban+0.030
uan+0.030
omon+0.029
ドラゴン+0.028
vin+0.028
Neg logit contributions
 disclosing-0.032
 anomaly-0.031
 breach-0.030
 disg-0.030
 cous-0.030
TokenS
Token ID50
Feature activation-6.55e-01

Pos logit contributions
urban+0.618
uan+0.608
omon+0.579
ドラゴン+0.574
vin+0.565
Neg logit contributions
 disclosing-0.836
 anomaly-0.808
 cous-0.803
 disg-0.800
 breach-0.799
Token.,
Token ID1539
Feature activation+7.18e-01

Pos logit contributions
 disclosing+0.432
 anomaly+0.415
 breach+0.412
 cous+0.405
 disg+0.403
Neg logit contributions
urban-0.524
uan-0.515
omon-0.502
ドラゴン-0.494
vin-0.488
Token by
Token ID416
Feature activation+1.92e-01

Pos logit contributions
urban+0.556
uan+0.550
omon+0.530
ドラゴン+0.520
vin+0.517
Neg logit contributions
 disclosing-0.485
 anomaly-0.466
 breach-0.462
 cous-0.459
 ANGEL-0.458
Token comparison
Token ID7208
Feature activation-5.60e+00

Pos logit contributions
urban+0.144
uan+0.141
omon+0.137
ドラゴン+0.134
vin+0.133
Neg logit contributions
 disclosing-0.136
 anomaly-0.131
 breach-0.130
 cous-0.129
 disg-0.128
Token,
Token ID11
Feature activation+3.26e-02

Pos logit contributions
 disclosing+3.612
 breach+3.488
 anomaly+3.482
 ANGEL+3.332
 cous+3.326
Neg logit contributions
urban-4.450
omon-4.339
uan-4.327
ドラゴン-4.263
-4.155
Token has
Token ID468
Feature activation-8.55e-02

Pos logit contributions
urban+0.024
uan+0.024
omon+0.023
ドラゴン+0.023
vin+0.022
Neg logit contributions
 disclosing-0.023
 anomaly-0.023
 breach-0.022
 cous-0.022
 ANGEL-0.022
Token 503
Token ID44541
Feature activation+7.19e-01

Pos logit contributions
 disclosing+0.061
 anomaly+0.058
 breach+0.058
 cous+0.057
 disg+0.057
Neg logit contributions
urban-0.064
uan-0.063
omon-0.061
ドラゴン-0.060
vin-0.059
Token airports
Token ID22600
Feature activation-4.83e-02

Pos logit contributions
urban+0.527
uan+0.518
omon+0.500
ドラゴン+0.493
vin+0.487
Neg logit contributions
 disclosing-0.522
 anomaly-0.503
 breach-0.499
 cous-0.493
 disg-0.491
Token that
Token ID326
Feature activation-1.11e+00

Pos logit contributions
 disclosing+0.035
 anomaly+0.034
 breach+0.033
 cous+0.033
 disg+0.033
Neg logit contributions
urban-0.035
uan-0.035
omon-0.034
ドラゴン-0.033
vin-0.033
Token ahead
Token ID4058
Feature activation-6.04e-01

Pos logit contributions
 disclosing+0.480
 anomaly+0.459
 breach+0.457
 cous+0.449
 disg+0.449
Neg logit contributions
urban-0.521
uan-0.508
omon-0.497
ドラゴン-0.490
vin-0.481
Token.
Token ID13
Feature activation-5.56e-01

Pos logit contributions
 disclosing+0.431
 anomaly+0.414
 breach+0.411
 cous+0.404
 disg+0.403
Neg logit contributions
urban-0.452
uan-0.442
omon-0.431
ドラゴン-0.426
vin-0.418
Token
Token ID198
Feature activation-2.66e-01

Pos logit contributions
 disclosing+0.403
 anomaly+0.389
 breach+0.387
 ANGEL+0.380
 cous+0.378
Neg logit contributions
urban-0.408
uan-0.398
omon-0.392
ドラゴン-0.383
vin-0.374
Token
Token ID198
Feature activation-2.80e-01

Pos logit contributions
 disclosing+0.198
 anomaly+0.191
 breach+0.189
 cous+0.188
 disg+0.187
Neg logit contributions
urban-0.191
uan-0.187
omon-0.182
ドラゴン-0.178
vin-0.177
TokenFollow
Token ID7155
Feature activation-1.84e-02

Pos logit contributions
 disclosing+0.211
 anomaly+0.203
 breach+0.202
 cous+0.199
 disg+0.198
Neg logit contributions
urban-0.197
uan-0.193
omon-0.188
ドラゴン-0.184
vin-0.181
Token HL
Token ID38312
Feature activation-5.51e+00

Pos logit contributions
 disclosing+0.012
 anomaly+0.012
 breach+0.012
 cous+0.011
 disg+0.011
Neg logit contributions
urban-0.015
uan-0.014
omon-0.014
ドラゴン-0.014
vin-0.014
TokenTV
Token ID6849
Feature activation-2.32e+00

Pos logit contributions
 disclosing+5.145
 anomaly+5.070
 breach+5.010
 disg+4.896
 ANGEL+4.883
Neg logit contributions
urban-2.761
uan-2.701
omon-2.613
ドラゴン-2.594
-2.513
Token.
Token ID13
Feature activation-9.23e-01

Pos logit contributions
 disclosing+1.650
 anomaly+1.598
 breach+1.582
 cous+1.549
 disg+1.547
Neg logit contributions
urban-1.737
uan-1.699
omon-1.662
ドラゴン-1.641
vin-1.605
Tokenorg
Token ID2398
Feature activation-1.68e+00

Pos logit contributions
 disclosing+0.799
 anomaly+0.775
 breach+0.769
 cous+0.763
 disg+0.760
Neg logit contributions
urban-0.549
uan-0.537
omon-0.517
ドラゴン-0.505
vin-0.499
Token's
Token ID338
Feature activation-1.40e-01

Pos logit contributions
 disclosing+1.199
 anomaly+1.158
 breach+1.147
 ANGEL+1.120
 disg+1.119
Neg logit contributions
urban-1.246
uan-1.215
omon-1.190
ドラゴン-1.177
vin-1.152
Token @
Token ID2488
Feature activation-9.27e-01

Pos logit contributions
 disclosing+0.093
 anomaly+0.089
 breach+0.088
 cous+0.087
 disg+0.087
Neg logit contributions
urban-0.111
uan-0.109
omon-0.106
ドラゴン-0.105
vin-0.103
Token $
Token ID720
Feature activation-1.47e+00

Pos logit contributions
urban+0.427
uan+0.421
omon+0.410
ドラゴン+0.403
vin+0.398
Neg logit contributions
 disclosing-0.365
 anomaly-0.351
 breach-0.348
 cous-0.344
 disg-0.342
Token32
Token ID2624
Feature activation-1.07e-01

Pos logit contributions
 disclosing+1.132
 anomaly+1.095
 breach+1.087
 cous+1.073
 disg+1.067
Neg logit contributions
urban-1.008
uan-0.987
omon-0.960
ドラゴン-0.940
vin-0.927
Token,
Token ID11
Feature activation-5.01e-01

Pos logit contributions
 disclosing+0.076
 anomaly+0.074
 breach+0.073
 cous+0.072
 disg+0.072
Neg logit contributions
urban-0.080
uan-0.078
omon-0.076
ドラゴン-0.075
vin-0.074
Token for
Token ID329
Feature activation-6.05e-02

Pos logit contributions
 disclosing+0.369
 anomaly+0.356
 breach+0.353
 cous+0.348
 disg+0.347
Neg logit contributions
urban-0.364
uan-0.356
omon-0.347
ドラゴン-0.341
vin-0.336
Token that
Token ID326
Feature activation-1.35e+00

Pos logit contributions
 disclosing+0.040
 anomaly+0.039
 breach+0.038
 cous+0.038
 disg+0.037
Neg logit contributions
urban-0.048
uan-0.047
omon-0.046
ドラゴン-0.045
vin-0.045
Token matter
Token ID2300
Feature activation-5.40e+00

Pos logit contributions
 disclosing+0.750
 anomaly+0.722
 breach+0.710
 cous+0.692
 disg+0.686
Neg logit contributions
urban-1.229
uan-1.205
omon-1.185
ドラゴン-1.169
vin-1.157
Token).
Token ID737
Feature activation-1.03e+00

Pos logit contributions
 disclosing+3.253
 anomaly+3.140
 breach+3.068
 cous+2.979
 disg+2.958
Neg logit contributions
urban-4.539
omon-4.354
uan-4.347
ドラゴン-4.269
-4.215
Token But
Token ID887
Feature activation+3.43e-02

Pos logit contributions
 disclosing+0.708
 anomaly+0.687
 breach+0.676
 ANGEL+0.671
 cous+0.663
Neg logit contributions
urban-0.792
uan-0.773
omon-0.762
ドラゴン-0.743
vin-0.731
Token right
Token ID826
Feature activation+3.39e-01

Pos logit contributions
urban+0.025
uan+0.025
omon+0.024
ドラゴン+0.024
vin+0.023
Neg logit contributions
 disclosing-0.025
 anomaly-0.024
 breach-0.024
 cous-0.023
 disg-0.023
Token now
Token ID783
Feature activation+2.25e-01

Pos logit contributions
urban+0.222
uan+0.218
omon+0.211
ドラゴン+0.206
vin+0.204
Neg logit contributions
 disclosing-0.273
 anomaly-0.264
 breach-0.262
 cous-0.260
 disg-0.258
Token,
Token ID11
Feature activation-2.51e-01

Pos logit contributions
urban+0.169
uan+0.165
omon+0.162
ドラゴン+0.159
vin+0.157
Neg logit contributions
 disclosing-0.160
 anomaly-0.154
 breach-0.153
 cous-0.150
 disg-0.150
Token Nation
Token ID8741
Feature activation-1.05e+00

Pos logit contributions
 disclosing+0.018
 anomaly+0.018
 cous+0.017
 breach+0.017
 disg+0.017
Neg logit contributions
urban-0.030
uan-0.030
omon-0.029
vin-0.028
ドラゴン-0.028
Token Be
Token ID1355
Feature activation-1.21e+00

Pos logit contributions
 disclosing+0.709
 anomaly+0.681
 breach+0.675
 ANGEL+0.664
 cous+0.661
Neg logit contributions
urban-0.821
uan-0.811
omon-0.790
ドラゴン-0.781
vin-0.767
Token the
Token ID262
Feature activation-1.10e+00

Pos logit contributions
 disclosing+0.821
 anomaly+0.786
 breach+0.784
 cous+0.768
 disg+0.763
Neg logit contributions
urban-0.948
uan-0.934
omon-0.909
ドラゴン-0.897
vin-0.879
Token first
Token ID717
Feature activation-5.81e-01

Pos logit contributions
 disclosing+0.687
 anomaly+0.657
 breach+0.650
 disg+0.633
 cous+0.631
Neg logit contributions
urban-0.927
uan-0.907
omon-0.888
ドラゴン-0.879
vin-0.866
Token to
Token ID284
Feature activation-2.34e+00

Pos logit contributions
 disclosing+0.507
 anomaly+0.493
 breach+0.490
 cous+0.486
 disg+0.484
Neg logit contributions
urban-0.340
uan-0.332
omon-0.320
ドラゴン-0.311
vin-0.308
Token hear
Token ID3285
Feature activation-5.37e+00

Pos logit contributions
 disclosing+1.541
 anomaly+1.486
 breach+1.462
 cous+1.456
 ANGEL+1.446
Neg logit contributions
urban-1.865
uan-1.835
omon-1.785
ドラゴン-1.749
vin-1.734
Token about
Token ID546
Feature activation-1.72e+00

Pos logit contributions
 disclosing+3.179
 breach+2.979
 anomaly+2.965
 cous+2.847
 ANGEL+2.835
Neg logit contributions
urban-4.626
uan-4.624
omon-4.496
ドラゴン-4.438
-4.347
Token Nation
Token ID8741
Feature activation-2.00e+00

Pos logit contributions
 disclosing+0.976
 anomaly+0.932
 disg+0.918
 breach+0.917
 cous+0.912
Neg logit contributions
urban-1.554
uan-1.516
omon-1.458
ドラゴン-1.446
vin-1.445
Token Travels
Token ID48899
Feature activation+2.59e+00

Pos logit contributions
 disclosing+1.648
 breach+1.603
 anomaly+1.601
 disg+1.560
 ANGEL+1.553
Neg logit contributions
urban-1.238
uan-1.228
omon-1.198
ドラゴン-1.189
vin-1.139
Token destinations
Token ID23982
Feature activation+3.20e-01

Pos logit contributions
urban+1.711
uan+1.676
omon+1.630
ervation+1.576
ドラゴン+1.568
Neg logit contributions
 disclosing-2.021
 disg-1.952
 anomaly-1.945
 breach-1.935
 ANGEL-1.933
Token,
Token ID11
Feature activation-1.38e+00

Pos logit contributions
urban+0.229
uan+0.224
omon+0.217
ドラゴン+0.213
vin+0.211
Neg logit contributions
 disclosing-0.239
 anomaly-0.230
 breach-0.229
 cous-0.228
 disg-0.228
Token.
Token ID13
Feature activation-3.06e+00

Pos logit contributions
 disclosing+0.506
 anomaly+0.487
 breach+0.482
 disg+0.480
 cous+0.480
Neg logit contributions
urban-0.486
uan-0.478
omon-0.462
ドラゴン-0.453
vin-0.450
Tokenparse
Token ID29572
Feature activation-2.97e+00

Pos logit contributions
 disclosing+2.211
 anomaly+2.137
 breach+2.123
 cous+2.089
 disg+2.072
Neg logit contributions
urban-2.224
uan-2.189
omon-2.130
ドラゴン-2.107
vin-2.050
Token_
Token ID62
Feature activation-3.02e+00

Pos logit contributions
 disclosing+1.742
 anomaly+1.677
 breach+1.676
 ANGEL+1.607
 cous+1.606
Neg logit contributions
urban-2.536
uan-2.490
omon-2.466
ドラゴン-2.414
vin-2.371
Tokenargs
Token ID22046
Feature activation-2.90e+00

Pos logit contributions
 disclosing+1.906
 breach+1.831
 anomaly+1.823
 cous+1.792
 ANGEL+1.762
Neg logit contributions
urban-2.442
uan-2.401
omon-2.364
ドラゴン-2.332
-2.261
Token())
Token ID28955
Feature activation-1.41e+00

Pos logit contributions
 disclosing+2.223
 anomaly+2.161
 breach+2.158
 cous+2.100
 ANGEL+2.086
Neg logit contributions
urban-1.980
uan-1.911
omon-1.873
ドラゴン-1.835
vin-1.791
Token #
Token ID1303
Feature activation-5.33e+00

Pos logit contributions
 disclosing+0.742
 anomaly+0.718
 breach+0.710
 disg+0.690
 cous+0.687
Neg logit contributions
urban-1.313
uan-1.296
omon-1.265
ドラゴン-1.255
vin-1.238
Token initialize
Token ID41216
Feature activation-2.16e-01

Pos logit contributions
 disclosing+3.279
 anomaly+3.217
 breach+3.214
 disg+3.125
 ANGEL+3.121
Neg logit contributions
urban-4.380
uan-4.260
omon-4.195
ドラゴン-4.157
vin-4.108
Token a
Token ID257
Feature activation-4.59e-01

Pos logit contributions
 disclosing+0.128
 anomaly+0.122
 disg+0.121
 cous+0.121
 breach+0.120
Neg logit contributions
urban-0.187
uan-0.185
omon-0.179
ドラゴン-0.177
vin-0.176
Token rectangular
Token ID36954
Feature activation-1.16e+00

Pos logit contributions
 disclosing+0.363
 anomaly+0.352
 breach+0.349
 cous+0.345
 disg+0.343
Neg logit contributions
urban-0.307
uan-0.301
omon-0.292
ドラゴン-0.287
vin-0.282
Token and
Token ID290
Feature activation-7.00e-02

Pos logit contributions
 disclosing+0.798
 anomaly+0.768
 breach+0.761
 cous+0.754
 disg+0.750
Neg logit contributions
urban-0.898
uan-0.882
omon-0.858
ドラゴン-0.844
vin-0.834
Token square
Token ID6616
Feature activation-1.56e+00

Pos logit contributions
 disclosing+0.050
 anomaly+0.048
 ANGEL+0.048
 disg+0.048
 cous+0.048
Neg logit contributions
urban-0.052
uan-0.051
omon-0.049
ドラゴン-0.047
vin-0.047
TokenOR
Token ID1581
Feature activation-9.40e-01

Pos logit contributions
 disclosing+1.203
 anomaly+1.154
 breach+1.143
 cous+1.124
 disg+1.117
Neg logit contributions
urban-1.616
uan-1.590
omon-1.553
ドラゴン-1.528
vin-1.511
TokenPH
Token ID11909
Feature activation-1.49e+00

Pos logit contributions
 disclosing+0.522
 anomaly+0.498
 breach+0.491
 cous+0.488
 disg+0.487
Neg logit contributions
urban-0.852
uan-0.839
omon-0.819
ドラゴン-0.810
vin-0.801
Token_
Token ID62
Feature activation-2.40e+00

Pos logit contributions
 disclosing+0.924
 anomaly+0.888
 breach+0.881
 cous+0.860
 ANGEL+0.860
Neg logit contributions
urban-1.244
uan-1.222
omon-1.200
ドラゴン-1.177
vin-1.163
TokenRECT
Token ID23988
Feature activation-2.71e+00

Pos logit contributions
 disclosing+2.208
 anomaly+2.146
 breach+2.136
 ANGEL+2.113
 cous+2.102
Neg logit contributions
urban-1.271
uan-1.246
omon-1.205
ドラゴン-1.173
vin-1.144
Token,
Token ID11
Feature activation-4.08e+00

Pos logit contributions
 disclosing+1.813
 anomaly+1.755
 breach+1.750
 ANGEL+1.708
 cous+1.676
Neg logit contributions
urban-2.115
uan-2.065
omon-2.048
ドラゴン-1.992
vin-1.959
Token (
Token ID357
Feature activation-5.28e+00

Pos logit contributions
 disclosing+2.426
 anomaly+2.401
 breach+2.349
 disg+2.265
 cous+2.263
Neg logit contributions
urban-3.468
uan-3.414
omon-3.339
ドラゴン-3.266
-3.241
Token13
Token ID1485
Feature activation-2.62e+00

Pos logit contributions
 disclosing+2.970
 breach+2.886
 anomaly+2.869
 cous+2.788
 ANGEL+2.747
Neg logit contributions
urban-4.499
uan-4.453
omon-4.338
ドラゴン-4.288
-4.208
Token,
Token ID11
Feature activation-3.17e+00

Pos logit contributions
 disclosing+1.771
 breach+1.710
 anomaly+1.708
 ANGEL+1.659
 cous+1.647
Neg logit contributions
urban-2.020
uan-1.975
omon-1.948
ドラゴン-1.906
vin-1.870
Token 5
Token ID642
Feature activation-1.58e+00

Pos logit contributions
 disclosing+2.116
 breach+2.046
 anomaly+2.023
 ANGEL+1.964
 cous+1.945
Neg logit contributions
urban-2.493
uan-2.438
omon-2.407
ドラゴン-2.316
vin-2.298
Token))
Token ID4008
Feature activation-2.93e+00

Pos logit contributions
 disclosing+0.716
 anomaly+0.679
 breach+0.671
 cous+0.650
 disg+0.643
Neg logit contributions
urban-1.592
uan-1.567
omon-1.536
ドラゴン-1.519
vin-1.502
Token sq
Token ID19862
Feature activation-3.45e+00

Pos logit contributions
 disclosing+1.593
 anomaly+1.531
 breach+1.514
 cous+1.480
 disg+1.466
Neg logit contributions
urban-2.677
uan-2.641
omon-2.586
ドラゴン-2.558
vin-2.515
Token defendants
Token ID16757
Feature activation-1.19e+00

Pos logit contributions
 disclosing+0.649
 anomaly+0.625
 breach+0.620
 cous+0.611
 disg+0.608
Neg logit contributions
urban-0.697
uan-0.684
omon-0.666
ドラゴン-0.655
vin-0.646
Token had
Token ID550
Feature activation-3.60e-01

Pos logit contributions
 disclosing+0.835
 anomaly+0.805
 breach+0.797
 cous+0.787
 disg+0.783
Neg logit contributions
urban-0.906
uan-0.890
omon-0.866
ドラゴン-0.851
vin-0.841
Token been
Token ID587
Feature activation-4.67e-01

Pos logit contributions
 disclosing+0.254
 anomaly+0.245
 breach+0.243
 cous+0.240
 disg+0.238
Neg logit contributions
urban-0.271
uan-0.267
omon-0.259
ドラゴン-0.255
vin-0.252
Token arrested
Token ID5169
Feature activation-2.42e-01

Pos logit contributions
 disclosing+0.321
 anomaly+0.309
 breach+0.307
 cous+0.302
 disg+0.301
Neg logit contributions
urban-0.360
uan-0.353
omon-0.344
ドラゴン-0.338
vin-0.334
Token on
Token ID319
Feature activation-1.65e+00

Pos logit contributions
 disclosing+0.177
 anomaly+0.172
 breach+0.170
 cous+0.169
 ANGEL+0.167
Neg logit contributions
urban-0.175
uan-0.173
omon-0.167
ドラゴン-0.164
vin-0.162
Token charges
Token ID4530
Feature activation-5.23e+00

Pos logit contributions
 disclosing+1.099
 anomaly+1.054
 breach+1.053
 cous+1.028
 disg+1.021
Neg logit contributions
urban-1.314
uan-1.287
omon-1.260
ドラゴン-1.242
vin-1.222
Token of
Token ID286
Feature activation-3.25e-01

Pos logit contributions
 disclosing+3.531
 breach+3.350
 anomaly+3.296
 cous+3.223
 disg+3.212
Neg logit contributions
urban-4.178
uan-4.043
omon-4.028
ドラゴン-3.953
vin-3.871
Token aiding
Token ID37221
Feature activation-2.02e+00

Pos logit contributions
 disclosing+0.216
 anomaly+0.210
 breach+0.206
 ANGEL+0.206
 cous+0.205
Neg logit contributions
urban-0.255
uan-0.250
omon-0.244
ドラゴン-0.239
vin-0.237
Token the
Token ID262
Feature activation-9.31e-01

Pos logit contributions
 disclosing+1.349
 anomaly+1.292
 breach+1.287
 cous+1.266
 disg+1.255
Neg logit contributions
urban-1.600
uan-1.569
omon-1.533
ドラゴン-1.511
vin-1.488
Token PKK
Token ID35176
Feature activation-6.19e-02

Pos logit contributions
 disclosing+0.593
 anomaly+0.569
 breach+0.565
 cous+0.555
 disg+0.551
Neg logit contributions
urban-0.767
uan-0.754
omon-0.736
ドラゴン-0.725
vin-0.715
Token by
Token ID416
Feature activation-5.67e-01

Pos logit contributions
 disclosing+0.045
 anomaly+0.043
 breach+0.043
 cous+0.043
 disg+0.042
Neg logit contributions
urban-0.045
uan-0.045
omon-0.043
ドラゴン-0.042
vin-0.042
Token Madison
Token ID14909
Feature activation-9.68e-01

Pos logit contributions
 disclosing+0.177
 anomaly+0.170
 breach+0.169
 cous+0.167
 disg+0.166
Neg logit contributions
urban-0.178
uan-0.175
omon-0.170
ドラゴン-0.167
vin-0.165
Token St
Token ID520
Feature activation-7.17e-01

Pos logit contributions
 disclosing+0.830
 anomaly+0.805
 breach+0.800
 cous+0.789
 ANGEL+0.786
Neg logit contributions
urban-0.582
uan-0.569
omon-0.553
ドラゴン-0.544
vin-0.527
Token.
Token ID13
Feature activation-1.90e+00

Pos logit contributions
 disclosing+0.404
 anomaly+0.387
 breach+0.382
 cous+0.373
 ANGEL+0.373
Neg logit contributions
urban-0.637
uan-0.630
omon-0.616
ドラゴン-0.609
vin-0.601
Token Baltimore
Token ID10346
Feature activation-2.49e+00

Pos logit contributions
 disclosing+1.231
 anomaly+1.186
 breach+1.173
 ANGEL+1.160
 cous+1.142
Neg logit contributions
urban-1.535
uan-1.513
omon-1.484
ドラゴン-1.456
vin-1.440
Token,
Token ID11
Feature activation-3.03e+00

Pos logit contributions
 disclosing+1.728
 anomaly+1.672
 breach+1.655
 ANGEL+1.625
 cous+1.623
Neg logit contributions
urban-1.891
uan-1.844
omon-1.818
ドラゴン-1.791
-1.749
Token MD
Token ID10670
Feature activation-5.23e+00

Pos logit contributions
 disclosing+2.236
 anomaly+2.163
 breach+2.152
 cous+2.105
 ANGEL+2.099
Neg logit contributions
urban-2.160
uan-2.121
omon-2.083
ドラゴン-2.036
vin-1.993
Token,
Token ID11
Feature activation-2.54e+00

Pos logit contributions
 disclosing+2.594
 anomaly+2.495
 breach+2.436
 cous+2.356
 disg+2.291
Neg logit contributions
urban-4.929
uan-4.852
omon-4.824
ドラゴン-4.726
-4.682
Token 212
Token ID23679
Feature activation-3.51e+00

Pos logit contributions
 disclosing+1.749
 anomaly+1.686
 breach+1.671
 cous+1.634
 ANGEL+1.629
Neg logit contributions
urban-1.954
uan-1.922
omon-1.884
ドラゴン-1.846
vin-1.820
Token01
Token ID486
Feature activation+7.14e-03

Pos logit contributions
 disclosing+2.714
 anomaly+2.631
 breach+2.620
 ANGEL+2.571
 cous+2.542
Neg logit contributions
urban-2.372
uan-2.329
omon-2.288
ドラゴン-2.210
vin-2.175
Token.
Token ID13
Feature activation-1.61e+00

Pos logit contributions
urban+0.005
uan+0.005
omon+0.005
ドラゴン+0.005
vin+0.005
Neg logit contributions
 disclosing-0.005
 anomaly-0.005
 breach-0.005
 cous-0.005
 disg-0.005
Token
Token ID198
Feature activation-6.93e-01

Pos logit contributions
 disclosing+1.173
 anomaly+1.129
 breach+1.122
 cous+1.102
 ANGEL+1.100
Neg logit contributions
urban-1.182
uan-1.160
omon-1.133
ドラゴン-1.107
vin-1.092
Token (
Token ID357
Feature activation-4.98e+00

Pos logit contributions
 disclosing+2.347
 anomaly+2.291
 breach+2.276
 ANGEL+2.196
 cous+2.165
Neg logit contributions
urban-3.162
uan-3.104
omon-3.066
ドラゴン-2.993
vin-2.944
Token21
Token ID2481
Feature activation-1.88e+00

Pos logit contributions
 disclosing+3.683
 breach+3.591
 anomaly+3.582
 cous+3.512
 ANGEL+3.424
Neg logit contributions
urban-3.324
uan-3.299
omon-3.208
ドラゴン-3.159
vin-3.030
Token,
Token ID11
Feature activation-2.22e+00

Pos logit contributions
 disclosing+1.311
 anomaly+1.264
 breach+1.259
 cous+1.228
 ANGEL+1.228
Neg logit contributions
urban-1.426
uan-1.397
omon-1.371
ドラゴン-1.344
vin-1.322
Token 21
Token ID2310
Feature activation-1.24e+00

Pos logit contributions
 disclosing+1.533
 anomaly+1.484
 breach+1.478
 ANGEL+1.438
 cous+1.438
Neg logit contributions
urban-1.696
uan-1.658
omon-1.629
ドラゴン-1.595
vin-1.569
Token))
Token ID4008
Feature activation-2.26e+00

Pos logit contributions
 disclosing+0.549
 anomaly+0.521
 breach+0.515
 cous+0.494
 ANGEL+0.492
Neg logit contributions
urban-1.263
uan-1.242
omon-1.223
ドラゴン-1.204
vin-1.191
Token #
Token ID1303
Feature activation-5.14e+00

Pos logit contributions
 disclosing+1.158
 anomaly+1.118
 breach+1.103
 disg+1.073
 cous+1.067
Neg logit contributions
urban-2.128
uan-2.107
omon-2.054
ドラゴン-2.039
-2.014
Token loop
Token ID9052
Feature activation+1.09e+00

Pos logit contributions
 disclosing+3.325
 anomaly+3.280
 breach+3.274
 disg+3.133
 ANGEL+3.102
Neg logit contributions
urban-4.060
uan-3.964
ドラゴン-3.891
omon-3.845
-3.812
Token over
Token ID625
Feature activation-1.86e+00

Pos logit contributions
urban+0.791
uan+0.782
omon+0.753
ドラゴン+0.735
vin+0.728
Neg logit contributions
 disclosing-0.787
 anomaly-0.759
 cous-0.756
 disg-0.755
 ANGEL-0.752
Token the
Token ID262
Feature activation-1.17e-01

Pos logit contributions
 disclosing+1.265
 anomaly+1.219
 breach+1.211
 cous+1.184
 ANGEL+1.176
Neg logit contributions
urban-1.455
uan-1.427
omon-1.396
ドラゴン-1.370
vin-1.351
Token input
Token ID5128
Feature activation-1.57e+00

Pos logit contributions
 disclosing+0.113
 anomaly+0.109
 cous+0.109
 breach+0.109
 disg+0.108
Neg logit contributions
urban-0.057
uan-0.056
omon-0.053
ドラゴン-0.051
vin-0.051
Token image
Token ID2939
Feature activation-8.89e-02

Pos logit contributions
 disclosing+1.071
 anomaly+1.031
 breach+1.022
 cous+1.008
 disg+1.002
Neg logit contributions
urban-1.221
uan-1.200
omon-1.169
ドラゴン-1.149
vin-1.135
Tokenterm
Token ID4354
Feature activation+1.04e+00

Pos logit contributions
urban+0.222
uan+0.218
omon+0.210
ドラゴン+0.205
vin+0.204
Neg logit contributions
 disclosing-0.256
 anomaly-0.248
 breach-0.246
 cous-0.243
 disg-0.243
Token relationship
Token ID2776
Feature activation-1.37e-01

Pos logit contributions
urban+0.934
uan+0.921
omon+0.899
ドラゴン+0.884
vin+0.877
Neg logit contributions
 disclosing-0.574
 anomaly-0.546
 breach-0.538
 cous-0.534
 disg-0.532
Token has
Token ID468
Feature activation-8.25e-02

Pos logit contributions
 disclosing+0.104
 anomaly+0.101
 breach+0.100
 cous+0.099
 disg+0.098
Neg logit contributions
urban-0.096
uan-0.094
omon-0.091
ドラゴン-0.089
vin-0.088
Token reached
Token ID4251
Feature activation-4.06e-01

Pos logit contributions
 disclosing+0.059
 anomaly+0.057
 breach+0.057
 cous+0.056
 disg+0.056
Neg logit contributions
urban-0.061
uan-0.060
omon-0.058
ドラゴン-0.057
vin-0.056
Token that
Token ID326
Feature activation-8.89e-01

Pos logit contributions
 disclosing+0.266
 anomaly+0.256
 breach+0.253
 cous+0.250
 disg+0.248
Neg logit contributions
urban-0.326
uan-0.321
omon-0.313
ドラゴン-0.308
vin-0.304
Token point
Token ID966
Feature activation-5.04e+00

Pos logit contributions
 disclosing+0.612
 anomaly+0.591
 breach+0.586
 cous+0.577
 disg+0.573
Neg logit contributions
urban-0.685
uan-0.672
omon-0.656
ドラゴン-0.645
vin-0.636
Token where
Token ID810
Feature activation-4.56e-01

Pos logit contributions
 disclosing+3.245
 anomaly+3.115
 breach+3.108
 cous+3.012
 ANGEL+2.997
Neg logit contributions
urban-4.076
uan-3.949
omon-3.918
ドラゴン-3.852
vin-3.774
Token,
Token ID11
Feature activation+6.62e-01

Pos logit contributions
 disclosing+0.308
 anomaly+0.296
 breach+0.293
 cous+0.289
 disg+0.287
Neg logit contributions
urban-0.358
uan-0.352
omon-0.343
ドラゴン-0.337
vin-0.333
Token as
Token ID355
Feature activation+8.18e-01

Pos logit contributions
urban+0.491
uan+0.482
omon+0.469
ドラゴン+0.460
vin+0.455
Neg logit contributions
 disclosing-0.474
 anomaly-0.457
 breach-0.453
 cous-0.447
 disg-0.445
Token Diana
Token ID25752
Feature activation-2.16e+00

Pos logit contributions
urban+0.642
uan+0.632
omon+0.613
ドラゴン+0.603
vin+0.599
Neg logit contributions
 disclosing-0.544
 anomaly-0.523
 breach-0.520
 disg-0.511
 cous-0.511
Token has
Token ID468
Feature activation-1.67e+00

Pos logit contributions
 disclosing+1.593
 anomaly+1.531
 breach+1.520
 cous+1.509
 ANGEL+1.491
Neg logit contributions
urban-1.560
uan-1.512
omon-1.475
ドラゴン-1.463
vin-1.421
Token effectively
Token ID6840
Feature activation-2.15e+00

Pos logit contributions
 disclosing+1.538
 anomaly+1.463
 breach+1.459
 disg+1.425
 cous+1.409
Neg logit contributions
urban-1.730
uan-1.690
omon-1.663
ドラゴン-1.638
-1.607
Token.
Token ID13
Feature activation-2.29e-01

Pos logit contributions
 disclosing+1.495
 anomaly+1.422
 breach+1.413
 disg+1.385
 cous+1.374
Neg logit contributions
urban-1.653
uan-1.617
omon-1.587
ドラゴン-1.565
-1.536
Token He
Token ID679
Feature activation-7.35e-01

Pos logit contributions
 disclosing+0.158
 anomaly+0.152
 breach+0.151
 cous+0.149
 disg+0.148
Neg logit contributions
urban-0.176
uan-0.173
omon-0.168
ドラゴン-0.165
vin-0.163
Token reached
Token ID4251
Feature activation-1.39e+00

Pos logit contributions
 disclosing+0.516
 anomaly+0.498
 breach+0.493
 cous+0.487
 disg+0.484
Neg logit contributions
urban-0.555
uan-0.545
omon-0.531
ドラゴン-0.522
vin-0.515
Token that
Token ID326
Feature activation-1.60e+00

Pos logit contributions
 disclosing+0.941
 anomaly+0.905
 breach+0.897
 cous+0.884
 disg+0.879
Neg logit contributions
urban-1.095
uan-1.076
omon-1.049
ドラゴン-1.031
vin-1.018
Token point
Token ID966
Feature activation-5.03e+00

Pos logit contributions
 disclosing+1.093
 anomaly+1.054
 breach+1.045
 cous+1.027
 disg+1.022
Neg logit contributions
urban-1.243
uan-1.220
omon-1.190
ドラゴン-1.170
vin-1.154
Token last
Token ID938
Feature activation+8.08e-01

Pos logit contributions
 disclosing+3.582
 anomaly+3.394
 breach+3.385
 disg+3.267
 cous+3.265
Neg logit contributions
urban-3.818
uan-3.689
omon-3.669
ドラゴン-3.601
-3.495
Token year
Token ID614
Feature activation+5.61e-01

Pos logit contributions
urban+0.565
uan+0.555
omon+0.536
ドラゴン+0.527
vin+0.521
Neg logit contributions
 disclosing-0.609
 anomaly-0.587
 breach-0.582
 cous-0.579
 disg-0.576
Token after
Token ID706
Feature activation+1.67e+00

Pos logit contributions
urban+0.398
uan+0.391
omon+0.379
ドラゴン+0.372
vin+0.369
Neg logit contributions
 disclosing-0.417
 anomaly-0.404
 breach-0.401
 cous-0.397
 disg-0.394
Token three
Token ID1115
Feature activation+5.99e-01

Pos logit contributions
urban+1.295
uan+1.277
omon+1.240
ドラゴン+1.214
vin+1.209
Neg logit contributions
 disclosing-1.117
 anomaly-1.086
 breach-1.071
 cous-1.068
 ANGEL-1.057
Token decades
Token ID4647
Feature activation+1.69e+00

Pos logit contributions
urban+0.439
uan+0.432
omon+0.419
ドラゴン+0.411
vin+0.406
Neg logit contributions
 disclosing-0.433
 anomaly-0.417
 breach-0.413
 cous-0.410
 disg-0.407
Token her
Token ID607
Feature activation+4.23e-01

Pos logit contributions
 disclosing+0.690
 anomaly+0.655
 breach+0.647
 cous+0.635
 disg+0.630
Neg logit contributions
urban-1.294
uan-1.276
omon-1.249
ドラゴン-1.232
vin-1.220
Token shoulders
Token ID12450
Feature activation-1.77e-01

Pos logit contributions
urban+0.306
uan+0.300
omon+0.292
ドラゴン+0.285
vin+0.283
Neg logit contributions
 disclosing-0.310
 anomaly-0.299
 breach-0.296
 cous-0.293
 ANGEL-0.291
Token,
Token ID11
Feature activation-5.49e-01

Pos logit contributions
 disclosing+0.125
 anomaly+0.121
 breach+0.120
 cous+0.118
 disg+0.118
Neg logit contributions
urban-0.133
uan-0.130
omon-0.127
ドラゴン-0.124
vin-0.123
Token a
Token ID257
Feature activation+2.54e-01

Pos logit contributions
 disclosing+0.358
 anomaly+0.344
 breach+0.341
 cous+0.336
 disg+0.334
Neg logit contributions
urban-0.442
uan-0.435
omon-0.424
ドラゴン-0.417
vin-0.412
Token to
Token ID284
Feature activation+7.11e-01

Pos logit contributions
urban+0.199
uan+0.196
omon+0.191
ドラゴン+0.188
vin+0.185
Neg logit contributions
 disclosing-0.171
 anomaly-0.165
 breach-0.164
 cous-0.161
 disg-0.160
Tokente
Token ID660
Feature activation-4.99e+00

Pos logit contributions
urban+0.368
uan+0.359
omon+0.344
ドラゴン+0.338
vin+0.329
Neg logit contributions
 disclosing-0.668
 anomaly-0.650
 breach-0.647
 cous-0.640
 disg-0.639
Token bag
Token ID6131
Feature activation-3.76e-01

Pos logit contributions
 disclosing+3.325
 anomaly+3.176
 breach+3.166
 cous+3.127
 disg+3.121
Neg logit contributions
urban-3.918
uan-3.855
ドラゴン-3.791
omon-3.783
-3.674
Token hanging
Token ID10938
Feature activation-5.74e-01

Pos logit contributions
 disclosing+0.264
 anomaly+0.255
 breach+0.253
 cous+0.249
 ANGEL+0.248
Neg logit contributions
urban-0.284
uan-0.279
omon-0.271
ドラゴン-0.265
vin-0.264
Token at
Token ID379
Feature activation+6.61e-01

Pos logit contributions
 disclosing+0.421
 anomaly+0.406
 breach+0.403
 cous+0.398
 disg+0.395
Neg logit contributions
urban-0.416
uan-0.409
omon-0.397
ドラゴン-0.389
vin-0.385
Token her
Token ID607
Feature activation-1.30e-01

Pos logit contributions
urban+0.605
uan+0.597
omon+0.585
ドラゴン+0.573
vin+0.569
Neg logit contributions
 disclosing-0.356
 anomaly-0.339
 breach-0.333
 cous-0.329
 disg-0.326
Token hip
Token ID10359
Feature activation-1.03e+00

Pos logit contributions
 disclosing+0.113
 anomaly+0.110
 breach+0.108
 cous+0.108
 ANGEL+0.107
Neg logit contributions
urban-0.076
uan-0.074
omon-0.071
ドラゴン-0.070
vin-0.069
Token($
Token ID16763
Feature activation-2.70e+00

Pos logit contributions
 disclosing+1.184
 anomaly+1.126
 breach+1.115
 cous+1.093
 disg+1.085
Neg logit contributions
urban-2.078
uan-2.047
omon-2.004
ドラゴン-1.975
vin-1.955
TokenFC
Token ID4851
Feature activation-1.02e+00

Pos logit contributions
 disclosing+1.538
 anomaly+1.467
 breach+1.465
 cous+1.426
 disg+1.424
Neg logit contributions
urban-2.367
uan-2.333
omon-2.287
ドラゴン-2.243
vin-2.215
Token2
Token ID17
Feature activation-1.09e+00

Pos logit contributions
 disclosing+0.626
 anomaly+0.600
 breach+0.596
 cous+0.587
 disg+0.582
Neg logit contributions
urban-0.857
uan-0.841
omon-0.819
ドラゴン-0.806
vin-0.799
Token[
Token ID58
Feature activation-1.49e+00

Pos logit contributions
 disclosing+0.641
 anomaly+0.613
 breach+0.603
 cous+0.601
 disg+0.600
Neg logit contributions
urban-0.945
uan-0.933
omon-0.909
ドラゴン-0.895
vin-0.887
Token0
Token ID15
Feature activation+8.16e-01

Pos logit contributions
 disclosing+0.709
 anomaly+0.670
 breach+0.664
 cous+0.649
 disg+0.644
Neg logit contributions
urban-1.461
uan-1.439
omon-1.409
ドラゴン-1.390
vin-1.373
Token],
Token ID4357
Feature activation-4.98e+00

Pos logit contributions
urban+0.555
uan+0.545
omon+0.525
ドラゴン+0.512
vin+0.511
Neg logit contributions
 disclosing-0.638
 anomaly-0.617
 breach-0.612
 cous-0.611
 disg-0.609
Token$
Token ID3
Feature activation-2.57e+00

Pos logit contributions
 disclosing+2.050
 breach+1.971
 anomaly+1.925
 ANGEL+1.829
 cous+1.801
Neg logit contributions
urban-5.018
uan-4.981
omon-4.945
ドラゴン-4.813
ervation-4.743
TokenFC
Token ID4851
Feature activation-1.20e+00

Pos logit contributions
 disclosing+1.268
 breach+1.204
 anomaly+1.204
 cous+1.162
 ANGEL+1.154
Neg logit contributions
urban-2.449
uan-2.418
omon-2.377
ドラゴン-2.328
vin-2.306
Token2
Token ID17
Feature activation-2.11e-01

Pos logit contributions
 disclosing+0.776
 anomaly+0.745
 breach+0.738
 cous+0.728
 disg+0.725
Neg logit contributions
urban-0.970
uan-0.954
omon-0.930
ドラゴン-0.915
vin-0.904
Token[
Token ID58
Feature activation-1.13e+00

Pos logit contributions
 disclosing+0.130
 anomaly+0.125
 disg+0.124
 cous+0.123
 ANGEL+0.121
Neg logit contributions
urban-0.180
uan-0.177
omon-0.171
ドラゴン-0.169
vin-0.168
Token1
Token ID16
Feature activation-5.17e-01

Pos logit contributions
 disclosing+0.661
 anomaly+0.632
 breach+0.626
 cous+0.615
 disg+0.610
Neg logit contributions
urban-0.983
uan-0.968
omon-0.947
ドラゴン-0.932
vin-0.921
TokenOR
Token ID1581
Feature activation-7.60e-01

Pos logit contributions
 disclosing+0.960
 anomaly+0.921
 breach+0.912
 cous+0.894
 ANGEL+0.891
Neg logit contributions
urban-1.306
uan-1.286
omon-1.256
ドラゴン-1.237
vin-1.223
TokenPH
Token ID11909
Feature activation-1.00e+00

Pos logit contributions
 disclosing+0.420
 anomaly+0.401
 breach+0.396
 cous+0.392
 disg+0.392
Neg logit contributions
urban-0.691
uan-0.680
omon-0.663
ドラゴン-0.656
vin-0.648
Token_
Token ID62
Feature activation-2.13e+00

Pos logit contributions
 disclosing+0.631
 anomaly+0.606
 breach+0.601
 cous+0.589
 ANGEL+0.586
Neg logit contributions
urban-0.833
uan-0.819
omon-0.802
ドラゴン-0.788
vin-0.778
TokenRECT
Token ID23988
Feature activation-2.63e+00

Pos logit contributions
 disclosing+1.928
 anomaly+1.871
 breach+1.864
 ANGEL+1.855
 cous+1.829
Neg logit contributions
urban-1.146
uan-1.132
omon-1.101
ドラゴン-1.068
vin-1.041
Token,
Token ID11
Feature activation-3.81e+00

Pos logit contributions
 disclosing+1.752
 anomaly+1.697
 breach+1.691
 ANGEL+1.655
 cous+1.620
Neg logit contributions
urban-2.049
uan-2.004
omon-1.989
ドラゴン-1.930
vin-1.901
Token (
Token ID357
Feature activation-4.98e+00

Pos logit contributions
 disclosing+2.347
 anomaly+2.291
 breach+2.276
 ANGEL+2.196
 cous+2.165
Neg logit contributions
urban-3.162
uan-3.104
omon-3.066
ドラゴン-2.993
vin-2.944
Token21
Token ID2481
Feature activation-1.88e+00

Pos logit contributions
 disclosing+3.683
 breach+3.591
 anomaly+3.582
 cous+3.512
 ANGEL+3.424
Neg logit contributions
urban-3.324
uan-3.299
omon-3.208
ドラゴン-3.159
vin-3.030
Token,
Token ID11
Feature activation-2.22e+00

Pos logit contributions
 disclosing+1.311
 anomaly+1.264
 breach+1.259
 cous+1.228
 ANGEL+1.228
Neg logit contributions
urban-1.426
uan-1.397
omon-1.371
ドラゴン-1.344
vin-1.322
Token 21
Token ID2310
Feature activation-1.24e+00

Pos logit contributions
 disclosing+1.533
 anomaly+1.484
 breach+1.478
 ANGEL+1.438
 cous+1.438
Neg logit contributions
urban-1.696
uan-1.658
omon-1.629
ドラゴン-1.595
vin-1.569
Token))
Token ID4008
Feature activation-2.26e+00

Pos logit contributions
 disclosing+0.549
 anomaly+0.521
 breach+0.515
 cous+0.494
 ANGEL+0.492
Neg logit contributions
urban-1.263
uan-1.242
omon-1.223
ドラゴン-1.204
vin-1.191
Token #
Token ID1303
Feature activation-5.14e+00

Pos logit contributions
 disclosing+1.158
 anomaly+1.118
 breach+1.103
 disg+1.073
 cous+1.067
Neg logit contributions
urban-2.128
uan-2.107
omon-2.054
ドラゴン-2.039
-2.014
Token ranks
Token ID9803
Feature activation-4.59e-01

Pos logit contributions
urban+0.603
uan+0.594
omon+0.577
ドラゴン+0.565
vin+0.560
Neg logit contributions
 disclosing-0.541
 anomaly-0.519
 breach-0.512
 cous-0.507
 disg-0.507
Token of
Token ID286
Feature activation-2.10e-01

Pos logit contributions
 disclosing+0.327
 anomaly+0.314
 breach+0.312
 cous+0.307
 disg+0.305
Neg logit contributions
urban-0.344
uan-0.337
omon-0.329
ドラゴン-0.322
vin-0.317
Token Ban
Token ID10274
Feature activation+8.30e-01

Pos logit contributions
 disclosing+0.146
 anomaly+0.141
 breach+0.140
 cous+0.138
 disg+0.137
Neg logit contributions
urban-0.160
uan-0.157
omon-0.153
ドラゴン-0.151
vin-0.149
Tokenu
Token ID84
Feature activation+7.15e-01

Pos logit contributions
urban+0.260
uan+0.247
omon+0.232
ドラゴン+0.223
vin+0.214
Neg logit contributions
 disclosing-0.951
 anomaly-0.930
 breach-0.925
 cous-0.918
 disg-0.915
Token Q
Token ID1195
Feature activation-1.27e+00

Pos logit contributions
urban+0.559
uan+0.550
omon+0.534
ドラゴン+0.524
vin+0.521
Neg logit contributions
 disclosing-0.484
 anomaly-0.466
 breach-0.461
 cous-0.453
 disg-0.453
Tokenura
Token ID5330
Feature activation-4.97e+00

Pos logit contributions
 disclosing+0.925
 anomaly+0.894
 breach+0.891
 cous+0.881
 disg+0.869
Neg logit contributions
urban-0.912
uan-0.895
omon-0.870
ドラゴン-0.870
vin-0.844
Tokeniza
Token ID23638
Feature activation-1.14e+00

Pos logit contributions
 disclosing+3.789
 cous+3.668
 breach+3.649
 anomaly+3.634
 disg+3.583
Neg logit contributions
urban-3.272
ドラゴン-3.194
uan-3.188
omon-3.157
-3.068
Token when
Token ID618
Feature activation+1.82e-01

Pos logit contributions
 disclosing+0.812
 anomaly+0.780
 breach+0.776
 cous+0.766
 disg+0.760
Neg logit contributions
urban-0.853
uan-0.834
omon-0.815
ドラゴン-0.803
vin-0.788
Token Q
Token ID1195
Feature activation-7.73e-01

Pos logit contributions
urban+0.124
uan+0.122
omon+0.118
ドラゴン+0.116
vin+0.114
Neg logit contributions
 disclosing-0.142
 anomaly-0.137
 breach-0.136
 cous-0.134
 disg-0.134
Tokenura
Token ID5330
Feature activation-4.42e+00

Pos logit contributions
 disclosing+0.592
 anomaly+0.572
 breach+0.569
 cous+0.562
 disg+0.558
Neg logit contributions
urban-0.532
uan-0.522
omon-0.507
ドラゴン-0.501
vin-0.491
Tokenish
Token ID680
Feature activation-2.01e+00

Pos logit contributions
 disclosing+3.540
 anomaly+3.424
 breach+3.419
 cous+3.402
 disg+3.356
Neg logit contributions
urban-2.756
uan-2.702
ドラゴン-2.641
omon-2.631
-2.557
Token Fle
Token ID12005
Feature activation-5.36e-01

Pos logit contributions
 disclosing+1.093
 anomaly+1.054
 breach+1.045
 cous+1.028
 disg+1.024
Neg logit contributions
urban-1.297
uan-1.273
omon-1.244
ドラゴン-1.222
vin-1.206
Tokenets
Token ID1039
Feature activation-8.59e-01

Pos logit contributions
 disclosing+0.432
 anomaly+0.418
 breach+0.414
 cous+0.409
 ANGEL+0.407
Neg logit contributions
urban-0.353
uan-0.346
omon-0.335
ドラゴン-0.325
vin-0.323
Token vs
Token ID3691
Feature activation+2.56e-01

Pos logit contributions
 disclosing+0.599
 anomaly+0.582
 breach+0.579
 cous+0.569
 ANGEL+0.563
Neg logit contributions
urban-0.652
uan-0.637
omon-0.619
ドラゴン-0.613
vin-0.600
Token Crypt
Token ID15126
Feature activation-3.95e-01

Pos logit contributions
urban+0.212
uan+0.208
omon+0.203
ドラゴン+0.200
vin+0.198
Neg logit contributions
 disclosing-0.161
 anomaly-0.154
 breach-0.153
 cous-0.151
 disg-0.150
Token Gh
Token ID11972
Feature activation+3.94e-01

Pos logit contributions
 disclosing+0.298
 anomaly+0.288
 breach+0.286
 cous+0.282
 disg+0.281
Neg logit contributions
urban-0.278
uan-0.272
omon-0.264
ドラゴン-0.260
vin-0.256
Tokenast
Token ID459
Feature activation-4.93e+00

Pos logit contributions
urban+0.334
uan+0.328
omon+0.321
ドラゴン+0.314
vin+0.312
Neg logit contributions
 disclosing-0.242
 anomaly-0.231
 breach-0.229
 cous-0.226
 disg-0.225
Token
Token ID198
Feature activation+5.82e-01

Pos logit contributions
 disclosing+3.210
 anomaly+3.179
 breach+3.130
 disg+3.070
 ANGEL+3.054
Neg logit contributions
urban-3.754
ドラゴン-3.685
uan-3.587
omon-3.511
-3.465
Token
Token ID198
Feature activation+1.85e-01

Pos logit contributions
urban+0.405
uan+0.398
omon+0.382
ドラゴン+0.375
vin+0.374
Neg logit contributions
 disclosing-0.448
 anomaly-0.431
 disg-0.429
 cous-0.428
 breach-0.425
TokenP
Token ID47
Feature activation+1.30e+00

Pos logit contributions
urban+0.132
uan+0.129
omon+0.125
ドラゴン+0.123
vin+0.122
Neg logit contributions
 disclosing-0.138
 anomaly-0.133
 breach-0.132
 cous-0.131
 disg-0.130
Tokenrote
Token ID2519
Feature activation-7.13e-01

Pos logit contributions
urban+0.974
uan+0.959
omon+0.931
vin+0.905
ドラゴン+0.903
Neg logit contributions
 disclosing-0.924
 anomaly-0.887
 breach-0.877
 cous-0.874
 disg-0.870
Tokenan
Token ID272
Feature activation-8.97e-01

Pos logit contributions
 disclosing+0.499
 anomaly+0.481
 breach+0.477
 cous+0.470
 disg+0.468
Neg logit contributions
urban-0.542
uan-0.532
omon-0.518
ドラゴン-0.509
vin-0.503
Token met
Token ID1138
Feature activation+2.71e-02

Pos logit contributions
urban+0.927
uan+0.912
omon+0.884
ドラゴン+0.866
vin+0.860
Neg logit contributions
 disclosing-0.908
 anomaly-0.880
 breach-0.868
 cous-0.860
 ANGEL-0.858
Token with
Token ID351
Feature activation-6.63e-01

Pos logit contributions
urban+0.021
uan+0.021
omon+0.020
ドラゴン+0.020
vin+0.020
Neg logit contributions
 disclosing-0.018
 anomaly-0.017
 breach-0.017
 cous-0.017
 disg-0.017
Token the
Token ID262
Feature activation-4.06e-01

Pos logit contributions
 disclosing+0.468
 anomaly+0.451
 breach+0.447
 cous+0.441
 disg+0.439
Neg logit contributions
urban-0.499
uan-0.490
omon-0.478
ドラゴン-0.469
vin-0.463
Token general
Token ID2276
Feature activation-1.30e+00

Pos logit contributions
 disclosing+0.298
 anomaly+0.288
 breach+0.285
 cous+0.282
 disg+0.280
Neg logit contributions
urban-0.294
uan-0.288
omon-0.280
ドラゴン-0.275
vin-0.271
Token in
Token ID287
Feature activation-3.87e-01

Pos logit contributions
 disclosing+0.921
 anomaly+0.888
 breach+0.882
 cous+0.869
 disg+0.859
Neg logit contributions
urban-0.968
uan-0.953
omon-0.930
ドラゴン-0.917
vin-0.894
Token charge
Token ID3877
Feature activation-4.91e+00

Pos logit contributions
 disclosing+0.278
 anomaly+0.269
 breach+0.265
 cous+0.263
 disg+0.262
Neg logit contributions
urban-0.286
uan-0.280
omon-0.272
ドラゴン-0.267
vin-0.264
Token of
Token ID286
Feature activation+6.33e-02

Pos logit contributions
 disclosing+3.228
 anomaly+3.080
 breach+3.051
 cous+3.017
 ANGEL+2.995
Neg logit contributions
urban-3.836
uan-3.779
omon-3.738
ドラゴン-3.669
vin-3.557
Token the
Token ID262
Feature activation+2.16e-01

Pos logit contributions
urban+0.048
uan+0.046
omon+0.045
ドラゴン+0.044
vin+0.044
Neg logit contributions
 disclosing-0.045
 anomaly-0.043
 breach-0.043
 cous-0.042
 disg-0.042
Token units
Token ID4991
Feature activation-8.65e-02

Pos logit contributions
urban+0.161
uan+0.158
omon+0.154
ドラゴン+0.151
vin+0.149
Neg logit contributions
 disclosing-0.154
 anomaly-0.148
 breach-0.146
 cous-0.145
 disg-0.144
Token accused
Token ID5371
Feature activation-1.69e+00

Pos logit contributions
 disclosing+0.062
 anomaly+0.060
 breach+0.059
 cous+0.058
 disg+0.058
Neg logit contributions
urban-0.064
uan-0.063
omon-0.062
ドラゴン-0.061
vin-0.060
Token of
Token ID286
Feature activation-3.43e-01

Pos logit contributions
 disclosing+1.190
 anomaly+1.142
 breach+1.140
 cous+1.118
 disg+1.109
Neg logit contributions
urban-1.285
uan-1.248
omon-1.227
ドラゴン-1.211
vin-1.184
Token next
Token ID1306
Feature activation-5.83e-01

Pos logit contributions
urban+0.293
uan+0.288
omon+0.279
ドラゴン+0.273
vin+0.271
Neg logit contributions
 disclosing-0.303
 anomaly-0.293
 breach-0.290
 cous-0.288
 disg-0.286
Token day
Token ID1110
Feature activation+5.58e-02

Pos logit contributions
 disclosing+0.431
 anomaly+0.416
 breach+0.413
 cous+0.408
 disg+0.406
Neg logit contributions
urban-0.420
uan-0.412
omon-0.400
ドラゴン-0.393
vin-0.388
Token.
Token ID13
Feature activation-1.52e-01

Pos logit contributions
urban+0.041
uan+0.040
omon+0.039
ドラゴン+0.039
vin+0.038
Neg logit contributions
 disclosing-0.040
 anomaly-0.039
 breach-0.038
 cous-0.038
 disg-0.038
Token<|endoftext|>
Token ID50256
Feature activation-1.06e+00

Pos logit contributions
 disclosing+0.107
 anomaly+0.103
 breach+0.103
 cous+0.101
 ANGEL+0.101
Neg logit contributions
urban-0.114
uan-0.112
omon-0.109
ドラゴン-0.107
vin-0.106
TokenAnd
Token ID1870
Feature activation-3.67e-01

Pos logit contributions
 disclosing+0.795
 anomaly+0.769
 breach+0.763
 cous+0.753
 ANGEL+0.749
Neg logit contributions
urban-0.738
uan-0.726
omon-0.707
ドラゴン-0.693
vin-0.680
Token things
Token ID1243
Feature activation-4.91e+00

Pos logit contributions
 disclosing+0.262
 anomaly+0.253
 breach+0.250
 cous+0.247
 disg+0.246
Neg logit contributions
urban-0.274
uan-0.269
omon-0.262
ドラゴン-0.257
vin-0.254
Token seem
Token ID1283
Feature activation-6.70e-01

Pos logit contributions
 disclosing+3.330
 anomaly+3.236
 breach+3.223
 disg+3.122
 cous+3.104
Neg logit contributions
urban-3.830
uan-3.692
omon-3.633
ドラゴン-3.593
-3.514
Token likely
Token ID1884
Feature activation-3.06e-01

Pos logit contributions
 disclosing+0.549
 anomaly+0.533
 breach+0.527
 cous+0.522
 disg+0.520
Neg logit contributions
urban-0.430
uan-0.419
omon-0.408
ドラゴン-0.399
vin-0.391
Token to
Token ID284
Feature activation-6.15e-01

Pos logit contributions
 disclosing+0.248
 anomaly+0.240
 breach+0.239
 cous+0.235
 disg+0.234
Neg logit contributions
urban-0.199
uan-0.193
omon-0.188
ドラゴン-0.184
vin-0.181
Token get
Token ID651
Feature activation-2.42e+00

Pos logit contributions
 disclosing+0.441
 anomaly+0.424
 breach+0.420
 cous+0.416
 disg+0.413
Neg logit contributions
urban-0.457
uan-0.448
omon-0.436
ドラゴン-0.429
vin-0.423
Token worse
Token ID4785
Feature activation-2.23e+00

Pos logit contributions
 disclosing+1.636
 anomaly+1.584
 breach+1.554
 cous+1.541
 disg+1.524
Neg logit contributions
urban-1.909
uan-1.854
omon-1.826
ドラゴン-1.786
vin-1.753
Token($
Token ID16763
Feature activation-2.21e+00

Pos logit contributions
 disclosing+1.019
 anomaly+0.965
 breach+0.958
 cous+0.942
 disg+0.933
Neg logit contributions
urban-1.864
uan-1.830
omon-1.792
ドラゴン-1.757
vin-1.744
TokenFC
Token ID4851
Feature activation-1.32e+00

Pos logit contributions
 disclosing+1.404
 anomaly+1.343
 breach+1.340
 cous+1.315
 disg+1.312
Neg logit contributions
urban-1.807
uan-1.776
omon-1.735
ドラゴン-1.700
vin-1.680
Token1
Token ID16
Feature activation-5.11e-01

Pos logit contributions
 disclosing+0.781
 anomaly+0.747
 breach+0.742
 cous+0.727
 disg+0.724
Neg logit contributions
urban-1.134
uan-1.113
omon-1.086
ドラゴン-1.074
vin-1.060
Token[
Token ID58
Feature activation-8.63e-01

Pos logit contributions
 disclosing+0.311
 anomaly+0.298
 breach+0.296
 cous+0.291
 disg+0.289
Neg logit contributions
urban-0.434
uan-0.427
omon-0.417
ドラゴン-0.411
vin-0.406
Token0
Token ID15
Feature activation+1.43e+00

Pos logit contributions
 disclosing+0.457
 anomaly+0.435
 breach+0.430
 cous+0.422
 disg+0.419
Neg logit contributions
urban-0.803
uan-0.791
omon-0.774
ドラゴン-0.763
vin-0.756
Token],
Token ID4357
Feature activation-4.91e+00

Pos logit contributions
urban+1.006
uan+0.986
omon+0.958
ドラゴン+0.939
vin+0.928
Neg logit contributions
 disclosing-1.081
 anomaly-1.045
 breach-1.037
 cous-1.025
 disg-1.019
Token$
Token ID3
Feature activation-2.38e+00

Pos logit contributions
 disclosing+2.209
 breach+2.105
 anomaly+2.076
 disg+2.042
 cous+2.027
Neg logit contributions
urban-4.806
uan-4.751
omon-4.660
ドラゴン-4.527
vin-4.491
TokenFC
Token ID4851
Feature activation-1.08e+00

Pos logit contributions
 disclosing+1.285
 breach+1.222
 anomaly+1.217
 cous+1.186
 disg+1.181
Neg logit contributions
urban-2.163
uan-2.132
omon-2.088
ドラゴン-2.047
vin-2.027
Token1
Token ID16
Feature activation-9.58e-02

Pos logit contributions
 disclosing+0.645
 anomaly+0.617
 breach+0.610
 cous+0.602
 disg+0.599
Neg logit contributions
urban-0.932
uan-0.917
omon-0.896
ドラゴン-0.882
vin-0.873
Token[
Token ID58
Feature activation-6.40e-01

Pos logit contributions
 disclosing+0.058
 anomaly+0.055
 disg+0.055
 cous+0.054
 breach+0.054
Neg logit contributions
urban-0.082
uan-0.081
omon-0.079
ドラゴン-0.078
vin-0.077
Token1
Token ID16
Feature activation-2.06e-01

Pos logit contributions
 disclosing+0.367
 anomaly+0.351
 breach+0.348
 cous+0.343
 disg+0.339
Neg logit contributions
urban-0.564
uan-0.556
omon-0.543
ドラゴン-0.534
vin-0.527
Token_
Token ID62
Feature activation-1.32e+00

Pos logit contributions
 disclosing+0.417
 anomaly+0.400
 breach+0.396
 cous+0.392
 disg+0.391
Neg logit contributions
urban-0.494
uan-0.486
omon-0.472
ドラゴン-0.465
vin-0.460
Tokenfull
Token ID12853
Feature activation-1.73e+00

Pos logit contributions
 disclosing+0.645
 anomaly+0.615
 breach+0.610
 cous+0.594
 disg+0.590
Neg logit contributions
urban-1.256
uan-1.240
omon-1.219
ドラゴン-1.206
-1.182
Token v
Token ID410
Feature activation-1.51e+00

Pos logit contributions
 disclosing+1.417
 anomaly+1.387
 breach+1.378
 cous+1.353
 disg+1.347
Neg logit contributions
urban-1.113
uan-1.079
omon-1.053
ドラゴン-1.031
-1.000
Token)
Token ID8
Feature activation-3.13e-01

Pos logit contributions
 disclosing+0.987
 anomaly+0.958
 breach+0.950
 cous+0.931
 disg+0.928
Neg logit contributions
urban-1.207
uan-1.180
omon-1.152
ドラゴン-1.139
-1.112
Token {
Token ID1391
Feature activation+1.53e+00

Pos logit contributions
 disclosing+0.205
 anomaly+0.198
 breach+0.196
 cous+0.192
 disg+0.191
Neg logit contributions
urban-0.253
uan-0.246
omon-0.241
ドラゴン-0.238
vin-0.234
Token //
Token ID3373
Feature activation-4.83e+00

Pos logit contributions
urban+0.966
uan+0.946
omon+0.916
ドラゴン+0.895
vin+0.885
Neg logit contributions
 disclosing-1.267
 anomaly-1.226
 breach-1.217
 cous-1.206
 disg-1.200
Token Time
Token ID3862
Feature activation+1.48e+00

Pos logit contributions
 disclosing+3.406
 anomaly+3.356
 breach+3.321
 disg+3.224
 ANGEL+3.209
Neg logit contributions
urban-3.551
uan-3.435
omon-3.348
ドラゴン-3.344
-3.240
Token component
Token ID7515
Feature activation+1.35e+00

Pos logit contributions
urban+1.088
uan+1.072
omon+1.043
ドラゴン+1.022
vin+1.011
Neg logit contributions
 disclosing-1.070
 anomaly-1.023
 breach-1.020
 cous-1.012
 disg-1.005
Token const
Token ID1500
Feature activation+1.02e+00

Pos logit contributions
urban+0.978
uan+0.962
omon+0.936
ドラゴン+0.915
vin+0.907
Neg logit contributions
 disclosing-0.994
 anomaly-0.956
 breach-0.949
 cous-0.941
 disg-0.935
Token float
Token ID12178
Feature activation+3.08e-01

Pos logit contributions
urban+0.803
uan+0.794
omon+0.771
ドラゴン+0.753
vin+0.752
Neg logit contributions
 disclosing-0.685
 anomaly-0.651
 breach-0.649
 cous-0.644
 disg-0.639
Token TA
Token ID21664
Feature activation-1.83e+00

Pos logit contributions
urban+0.239
uan+0.235
omon+0.230
ドラゴン+0.225
vin+0.223
Neg logit contributions
 disclosing-0.209
 anomaly-0.199
 breach-0.198
 cous-0.197
 disg-0.195
Token too
Token ID1165
Feature activation-5.29e-01

Pos logit contributions
urban+1.051
uan+1.034
omon+1.004
ドラゴン+0.985
vin+0.975
Neg logit contributions
 disclosing-1.045
 anomaly-1.010
 breach-1.001
 cous-0.988
 disg-0.983
Token far
Token ID1290
Feature activation-1.16e+00

Pos logit contributions
 disclosing+0.342
 anomaly+0.328
 breach+0.325
 cous+0.321
 disg+0.319
Neg logit contributions
urban-0.431
uan-0.423
omon-0.416
ドラゴン-0.407
vin-0.401
Token on
Token ID319
Feature activation-8.39e-01

Pos logit contributions
 disclosing+0.915
 anomaly+0.879
 breach+0.875
 cous+0.863
 disg+0.862
Neg logit contributions
urban-0.784
uan-0.762
omon-0.742
ドラゴン-0.732
vin-0.714
Token Sound
Token ID9506
Feature activation-4.69e-01

Pos logit contributions
 disclosing+0.533
 anomaly+0.513
 breach+0.509
 cous+0.499
 disg+0.496
Neg logit contributions
urban-0.691
uan-0.682
omon-0.665
ドラゴン-0.653
vin-0.642
TokenCloud
Token ID18839
Feature activation+1.04e+00

Pos logit contributions
 disclosing+0.252
 anomaly+0.240
 breach+0.237
 cous+0.233
 disg+0.231
Neg logit contributions
urban-0.433
uan-0.427
omon-0.417
ドラゴン-0.411
vin-0.407
Token before
Token ID878
Feature activation+3.53e+00

Pos logit contributions
urban+0.666
uan+0.651
omon+0.631
ドラゴン+0.621
vin+0.608
Neg logit contributions
 disclosing-0.847
 anomaly-0.819
 breach-0.814
 cous-0.803
 disg-0.800
Token hearing
Token ID4854
Feature activation+5.73e-01

Pos logit contributions
urban+2.799
uan+2.778
omon+2.705
vin+2.675
ドラゴン+2.646
Neg logit contributions
 disclosing-2.227
 anomaly-2.173
 breach-2.135
 ANGEL-2.114
 cous-2.112
Token a
Token ID257
Feature activation+4.56e-01

Pos logit contributions
urban+0.442
uan+0.434
omon+0.423
ドラゴン+0.416
vin+0.411
Neg logit contributions
 disclosing-0.394
 anomaly-0.379
 breach-0.376
 cous-0.371
 disg-0.369
Token sample
Token ID6291
Feature activation-2.56e-01

Pos logit contributions
urban+0.352
uan+0.345
omon+0.337
ドラゴン+0.330
vin+0.328
Neg logit contributions
 disclosing-0.313
 anomaly-0.300
 breach-0.297
 cous-0.294
 disg-0.292
Token buried
Token ID11694
Feature activation+6.23e-01

Pos logit contributions
 disclosing+0.189
 anomaly+0.182
 breach+0.181
 cous+0.179
 disg+0.178
Neg logit contributions
urban-0.184
uan-0.181
omon-0.176
ドラゴン-0.172
vin-0.170
Token beneath
Token ID11061
Feature activation+6.32e-01

Pos logit contributions
urban+0.505
uan+0.498
omon+0.485
ドラゴン+0.478
vin+0.474
Neg logit contributions
 disclosing-0.401
 anomaly-0.385
 breach-0.382
 cous-0.376
 disg-0.374
Tokenw
Token ID86
Feature activation+1.26e+00

Pos logit contributions
 disclosing+0.292
 anomaly+0.281
 breach+0.280
 cous+0.275
 disg+0.273
Neg logit contributions
urban-0.333
uan-0.326
omon-0.318
ドラゴン-0.313
vin-0.309
Tokenconfig
Token ID11250
Feature activation-4.41e-01

Pos logit contributions
urban+0.888
uan+0.872
omon+0.847
ドラゴン+0.830
vin+0.820
Neg logit contributions
 disclosing-0.944
 anomaly-0.911
 breach-0.904
 cous-0.894
 disg-0.889
Token.
Token ID13
Feature activation+2.15e-01

Pos logit contributions
 disclosing+0.287
 anomaly+0.276
 breach+0.274
 cous+0.269
 disg+0.267
Neg logit contributions
urban-0.358
uan-0.350
omon-0.342
ドラゴン-0.338
vin-0.332
Token
Token ID198
Feature activation+6.98e-01

Pos logit contributions
urban+0.162
uan+0.159
omon+0.155
ドラゴン+0.153
vin+0.150
Neg logit contributions
 disclosing-0.151
 anomaly-0.146
 breach-0.145
 cous-0.143
 disg-0.142
Token
Token ID198
Feature activation+6.16e-01

Pos logit contributions
urban+0.488
uan+0.480
omon+0.462
ドラゴン+0.452
vin+0.452
Neg logit contributions
 disclosing-0.534
 anomaly-0.515
 disg-0.512
 cous-0.511
 breach-0.508
TokenNow
Token ID3844
Feature activation+4.03e+00

Pos logit contributions
urban+0.405
uan+0.397
omon+0.385
ドラゴン+0.377
vin+0.373
Neg logit contributions
 disclosing-0.495
 anomaly-0.479
 breach-0.475
 cous-0.470
 disg-0.468
Token you
Token ID345
Feature activation-3.03e-01

Pos logit contributions
urban+2.841
uan+2.835
omon+2.740
vin+2.662
ドラゴン+2.658
Neg logit contributions
 disclosing-2.947
 anomaly-2.844
 cous-2.807
 breach-2.795
 ANGEL-2.779
Token can
Token ID460
Feature activation+1.67e-01

Pos logit contributions
 disclosing+0.231
 anomaly+0.224
 breach+0.223
 cous+0.219
 disg+0.218
Neg logit contributions
urban-0.212
uan-0.206
omon-0.201
ドラゴン-0.197
vin-0.193
Token reboot
Token ID20149
Feature activation+3.13e+00

Pos logit contributions
urban+0.122
uan+0.119
omon+0.116
ドラゴン+0.114
vin+0.112
Neg logit contributions
 disclosing-0.122
 anomaly-0.118
 breach-0.117
 cous-0.115
 disg-0.115
Token your
Token ID534
Feature activation+6.26e-01

Pos logit contributions
urban+2.855
uan+2.836
omon+2.779
ドラゴン+2.721
vin+2.706
Neg logit contributions
 disclosing-1.664
 anomaly-1.589
 breach-1.558
 cous-1.558
 disg-1.539
Token computer
Token ID3644
Feature activation+1.42e+00

Pos logit contributions
urban+0.441
uan+0.433
omon+0.421
ドラゴン+0.413
vin+0.408
Neg logit contributions
 disclosing-0.471
 anomaly-0.455
 breach-0.451
 cous-0.446
 disg-0.444
Token,
Token ID11
Feature activation+3.41e-03

Pos logit contributions
 disclosing+0.100
 anomaly+0.096
 breach+0.095
 cous+0.094
 disg+0.093
Neg logit contributions
urban-0.106
uan-0.104
omon-0.101
ドラゴン-0.099
vin-0.098
Token densely
Token ID42255
Feature activation+1.95e+00

Pos logit contributions
urban+0.003
uan+0.003
omon+0.003
ドラゴン+0.002
vin+0.002
Neg logit contributions
 disclosing-0.002
 anomaly-0.002
 breach-0.002
 cous-0.002
 disg-0.002
Token populated
Token ID22331
Feature activation+3.44e-01

Pos logit contributions
urban+1.361
uan+1.328
omon+1.279
vin+1.246
ドラゴン+1.245
Neg logit contributions
 disclosing-1.477
 anomaly-1.427
 breach-1.416
 ANGEL-1.402
 cous-1.400
Token villages
Token ID15425
Feature activation-2.54e-01

Pos logit contributions
urban+0.259
uan+0.254
omon+0.246
ドラゴン+0.242
vin+0.239
Neg logit contributions
 disclosing-0.244
 anomaly-0.235
 breach-0.233
 cous-0.230
 disg-0.229
Token ab
Token ID450
Feature activation+5.80e-01

Pos logit contributions
 disclosing+0.181
 anomaly+0.174
 breach+0.172
 cous+0.171
 ANGEL+0.171
Neg logit contributions
urban-0.188
uan-0.185
omon-0.180
ドラゴン-0.176
vin-0.175
Tokenut
Token ID315
Feature activation+3.00e+00

Pos logit contributions
urban+0.315
uan+0.308
omon+0.296
ドラゴン+0.286
vin+0.283
Neg logit contributions
 disclosing-0.533
 anomaly-0.517
 breach-0.513
 cous-0.509
 ANGEL-0.506
Token protected
Token ID6861
Feature activation+7.24e-01

Pos logit contributions
urban+2.377
uan+2.344
omon+2.258
vin+2.214
ドラゴン+2.209
Neg logit contributions
 disclosing-2.001
 anomaly-1.906
 breach-1.891
 cous-1.877
 disg-1.861
Token areas
Token ID3006
Feature activation-1.53e+00

Pos logit contributions
urban+0.547
uan+0.537
omon+0.519
ドラゴン+0.511
vin+0.506
Neg logit contributions
 disclosing-0.512
 anomaly-0.492
 breach-0.487
 cous-0.481
 ANGEL-0.479
Token set
Token ID900
Feature activation+4.97e-01

Pos logit contributions
 disclosing+1.079
 anomaly+1.039
 breach+1.033
 cous+1.016
 disg+1.015
Neg logit contributions
urban-1.150
uan-1.128
omon-1.100
ドラゴン-1.083
vin-1.066
Token aside
Token ID7263
Feature activation+8.27e-01

Pos logit contributions
urban+0.371
uan+0.365
omon+0.354
ドラゴン+0.346
vin+0.343
Neg logit contributions
 disclosing-0.352
 anomaly-0.339
 breach-0.336
 cous-0.331
 ANGEL-0.330
Token for
Token ID329
Feature activation-8.05e-01

Pos logit contributions
urban+0.584
uan+0.572
omon+0.553
ドラゴン+0.542
vin+0.538
Neg logit contributions
 disclosing-0.623
 anomaly-0.601
 breach-0.596
 cous-0.590
 ANGEL-0.586
Token and
Token ID290
Feature activation+8.98e-01

Pos logit contributions
 disclosing+0.683
 anomaly+0.656
 breach+0.652
 cous+0.640
 disg+0.640
Neg logit contributions
urban-0.760
uan-0.743
omon-0.727
ドラゴン-0.716
vin-0.704
Token his
Token ID465
Feature activation-9.42e-02

Pos logit contributions
urban+0.697
uan+0.685
omon+0.665
ドラゴン+0.653
vin+0.647
Neg logit contributions
 disclosing-0.611
 anomaly-0.588
 breach-0.582
 cous-0.575
 disg-0.572
Token friends
Token ID2460
Feature activation+2.68e+00

Pos logit contributions
 disclosing+0.062
 anomaly+0.060
 breach+0.059
 cous+0.059
 disg+0.058
Neg logit contributions
urban-0.075
uan-0.074
omon-0.072
ドラゴン-0.071
vin-0.070
Token should
Token ID815
Feature activation+2.42e+00

Pos logit contributions
urban+1.957
uan+1.939
omon+1.879
ドラゴン+1.833
vin+1.821
Neg logit contributions
 disclosing-1.916
 anomaly-1.855
 breach-1.818
 cous-1.817
 ANGEL-1.815
Token have
Token ID423
Feature activation+2.66e+00

Pos logit contributions
urban+1.766
uan+1.748
omon+1.679
ドラゴン+1.635
vin+1.635
Neg logit contributions
 disclosing-1.729
 anomaly-1.669
 ANGEL-1.639
 cous-1.635
 breach-1.618
Token thought
Token ID1807
Feature activation+3.25e+00

Pos logit contributions
urban+1.932
uan+1.923
omon+1.847
ドラゴン+1.822
vin+1.799
Neg logit contributions
 disclosing-1.885
 anomaly-1.835
 breach-1.801
 cous-1.792
 ANGEL-1.783
Token twice
Token ID5403
Feature activation+1.34e+00

Pos logit contributions
urban+2.503
uan+2.476
omon+2.401
ドラゴン+2.349
vin+2.338
Neg logit contributions
 disclosing-2.154
 anomaly-2.093
 breach-2.077
 cous-2.057
 disg-2.050
Token before
Token ID878
Feature activation+3.70e+00

Pos logit contributions
urban+0.905
uan+0.888
omon+0.861
ドラゴン+0.842
vin+0.833
Neg logit contributions
 disclosing-1.041
 anomaly-1.012
 breach-1.004
 cous-0.992
 disg-0.988
Token approaching
Token ID13885
Feature activation+1.53e+00

Pos logit contributions
urban+2.705
uan+2.685
omon+2.583
ドラゴン+2.518
vin+2.514
Neg logit contributions
 disclosing-2.531
 anomaly-2.504
 ANGEL-2.476
 breach-2.449
 cous-2.430
Token it
Token ID340
Feature activation+6.14e-01

Pos logit contributions
urban+1.203
uan+1.183
omon+1.150
ドラゴン+1.130
vin+1.118
Neg logit contributions
 disclosing-1.032
 anomaly-0.993
 breach-0.982
 cous-0.970
 disg-0.965
Token.
Token ID13
Feature activation+1.51e+00

Pos logit contributions
urban+0.458
uan+0.447
omon+0.437
ドラゴン+0.430
vin+0.423
Neg logit contributions
 disclosing-0.442
 anomaly-0.424
 breach-0.423
 cous-0.416
 disg-0.415
Token recognition
Token ID9465
Feature activation+3.00e-01

Pos logit contributions
 disclosing+0.652
 anomaly+0.628
 breach+0.622
 cous+0.613
 disg+0.609
Neg logit contributions
urban-0.765
uan-0.751
omon-0.733
ドラゴン-0.721
vin-0.712
Token technology
Token ID3037
Feature activation+1.29e-01

Pos logit contributions
urban+0.226
uan+0.222
omon+0.216
ドラゴン+0.212
vin+0.209
Neg logit contributions
 disclosing-0.211
 anomaly-0.203
 breach-0.202
 cous-0.199
 disg-0.198
Token:
Token ID25
Feature activation+6.75e-01

Pos logit contributions
urban+0.098
uan+0.096
omon+0.093
ドラゴン+0.092
vin+0.090
Neg logit contributions
 disclosing-0.091
 anomaly-0.088
 breach-0.087
 cous-0.085
 disg-0.085
Token
Token ID198
Feature activation+1.34e+00

Pos logit contributions
urban+0.499
uan+0.489
omon+0.478
ドラゴン+0.469
vin+0.460
Neg logit contributions
 disclosing-0.488
 anomaly-0.470
 breach-0.467
 cous-0.458
 ANGEL-0.457
Token
Token ID198
Feature activation+1.10e+00

Pos logit contributions
urban+0.917
uan+0.902
omon+0.860
vin+0.848
ドラゴン+0.845
Neg logit contributions
 disclosing-1.043
 disg-1.008
 anomaly-1.004
 cous-1.002
 breach-0.988
Tokenhttps
Token ID5450
Feature activation+3.37e+00

Pos logit contributions
urban+0.678
uan+0.664
omon+0.643
ドラゴン+0.629
vin+0.618
Neg logit contributions
 disclosing-0.923
 anomaly-0.895
 breach-0.889
 cous-0.878
 disg-0.874
Token://
Token ID1378
Feature activation+2.57e+00

Pos logit contributions
urban+1.796
uan+1.739
vin+1.604
ドラゴン+1.599
omon+1.592
Neg logit contributions
 disclosing-3.179
 disg-3.153
 cous-3.092
 anomaly-3.076
 breach-3.015
Tokenwww
Token ID2503
Feature activation+1.26e+00

Pos logit contributions
urban+1.613
uan+1.566
omon+1.512
vin+1.469
ドラゴン+1.457
Neg logit contributions
 disclosing-2.153
 anomaly-2.096
 breach-2.067
 cous-2.060
 disg-2.052
Token.
Token ID13
Feature activation+1.15e+00

Pos logit contributions
urban+0.867
uan+0.849
omon+0.806
vin+0.798
ドラゴン+0.792
Neg logit contributions
 disclosing-0.990
 disg-0.962
 anomaly-0.953
 cous-0.947
 breach-0.937
Tokeneff
Token ID14822
Feature activation+9.96e-01

Pos logit contributions
urban+0.943
uan+0.927
omon+0.903
ドラゴン+0.883
vin+0.879
Neg logit contributions
 disclosing-0.739
 anomaly-0.711
 breach-0.703
 cous-0.694
 disg-0.690
Token.
Token ID13
Feature activation+3.03e+00

Pos logit contributions
urban+0.711
uan+0.697
omon+0.673
vin+0.656
ドラゴン+0.652
Neg logit contributions
 disclosing-0.749
 anomaly-0.724
 disg-0.715
 breach-0.714
 cous-0.712
Token realize
Token ID6537
Feature activation-9.98e-01

Pos logit contributions
 disclosing+0.517
 anomaly+0.496
 breach+0.493
 cous+0.485
 disg+0.483
Neg logit contributions
urban-0.609
uan-0.598
omon-0.583
ドラゴン-0.575
vin-0.567
Token--
Token ID438
Feature activation+2.63e-01

Pos logit contributions
 disclosing+0.697
 anomaly+0.670
 breach+0.663
 cous+0.652
 disg+0.650
Neg logit contributions
urban-0.763
uan-0.750
omon-0.734
ドラゴン-0.722
vin-0.709
Tokendating
Token ID38734
Feature activation+1.99e+00

Pos logit contributions
urban+0.187
uan+0.183
omon+0.178
ドラゴン+0.175
vin+0.173
Neg logit contributions
 disclosing-0.197
 anomaly-0.191
 breach-0.189
 cous-0.187
 disg-0.186
Token back
Token ID736
Feature activation-3.49e-01

Pos logit contributions
urban+1.318
uan+1.290
omon+1.251
ドラゴン+1.225
vin+1.208
Neg logit contributions
 disclosing-1.583
 anomaly-1.532
 breach-1.521
 cous-1.503
 disg-1.497
Token to
Token ID284
Feature activation+7.59e-01

Pos logit contributions
 disclosing+0.281
 anomaly+0.272
 breach+0.270
 cous+0.267
 disg+0.266
Neg logit contributions
urban-0.229
uan-0.223
omon-0.218
ドラゴン-0.213
vin-0.209
Token the
Token ID262
Feature activation+7.61e-01

Pos logit contributions
urban+0.547
uan+0.537
omon+0.522
ドラゴン+0.512
vin+0.506
Neg logit contributions
 disclosing-0.559
 anomaly-0.540
 breach-0.536
 cous-0.529
 disg-0.526
Token 1950
Token ID11445
Feature activation+1.26e+00

Pos logit contributions
urban+0.499
uan+0.489
omon+0.474
ドラゴン+0.464
vin+0.458
Neg logit contributions
 disclosing-0.609
 anomaly-0.589
 breach-0.584
 cous-0.578
 ANGEL-0.576
Tokens
Token ID82
Feature activation+2.59e+00

Pos logit contributions
urban+0.994
uan+0.976
omon+0.951
ドラゴン+0.929
vin+0.924
Neg logit contributions
 disclosing-0.839
 anomaly-0.802
 breach-0.795
 cous-0.791
 disg-0.789
Token.
Token ID13
Feature activation+7.37e-01

Pos logit contributions
urban+1.813
uan+1.777
omon+1.720
ドラゴン+1.678
vin+1.672
Neg logit contributions
 disclosing-1.953
 anomaly-1.885
 breach-1.873
 cous-1.860
 ANGEL-1.845
Token The
Token ID383
Feature activation+2.24e-01

Pos logit contributions
urban+0.574
uan+0.564
omon+0.548
ドラゴン+0.539
vin+0.534
Neg logit contributions
 disclosing-0.502
 anomaly-0.482
 breach-0.479
 cous-0.474
 disg-0.471
Token combo
Token ID14831
Feature activation+1.03e+00

Pos logit contributions
urban+0.171
uan+0.168
omon+0.164
ドラゴン+0.161
vin+0.159
Neg logit contributions
 disclosing-0.155
 anomaly-0.150
 breach-0.148
 cous-0.146
 disg-0.146
Token Samurai
Token ID23882
Feature activation+5.34e-02

Pos logit contributions
 disclosing+0.060
 anomaly+0.058
 breach+0.058
 cous+0.057
 disg+0.057
Neg logit contributions
urban-0.061
uan-0.060
omon-0.058
ドラゴン-0.057
vin-0.056
Token are
Token ID389
Feature activation-5.38e-01

Pos logit contributions
urban+0.039
uan+0.038
omon+0.037
ドラゴン+0.037
vin+0.036
Neg logit contributions
 disclosing-0.039
 anomaly-0.037
 breach-0.037
 cous-0.037
 disg-0.036
Token the
Token ID262
Feature activation-8.40e-01

Pos logit contributions
 disclosing+0.356
 anomaly+0.343
 breach+0.339
 cous+0.334
 disg+0.332
Neg logit contributions
urban-0.429
uan-0.422
omon-0.412
ドラゴン-0.405
vin-0.400
Token same
Token ID976
Feature activation+1.04e-01

Pos logit contributions
 disclosing+0.524
 anomaly+0.504
 breach+0.499
 cous+0.491
 ANGEL+0.487
Neg logit contributions
urban-0.702
uan-0.690
omon-0.675
ドラゴン-0.663
vin-0.655
Token,
Token ID11
Feature activation+4.72e-01

Pos logit contributions
urban+0.081
uan+0.079
omon+0.077
ドラゴン+0.076
vin+0.075
Neg logit contributions
 disclosing-0.071
 anomaly-0.068
 breach-0.068
 cous-0.067
 disg-0.066
Token only
Token ID691
Feature activation+4.65e-01

Pos logit contributions
urban+0.348
uan+0.342
omon+0.332
ドラゴン+0.327
vin+0.322
Neg logit contributions
 disclosing-0.340
 anomaly-0.328
 breach-0.326
 cous-0.321
 disg-0.320
Token that
Token ID326
Feature activation+1.44e+00

Pos logit contributions
urban+0.346
uan+0.340
omon+0.331
ドラゴン+0.325
vin+0.321
Neg logit contributions
 disclosing-0.332
 anomaly-0.320
 breach-0.318
 cous-0.313
 disg-0.311
Token they
Token ID484
Feature activation+5.00e-01

Pos logit contributions
urban+1.084
uan+1.063
omon+1.033
ドラゴン+1.016
vin+1.004
Neg logit contributions
 disclosing-1.011
 anomaly-0.974
 breach-0.966
 cous-0.954
 disg-0.951
Token tend
Token ID4327
Feature activation-7.69e-01

Pos logit contributions
urban+0.365
uan+0.360
omon+0.349
ドラゴン+0.341
vin+0.340
Neg logit contributions
 disclosing-0.361
 anomaly-0.349
 breach-0.343
 cous-0.341
 ANGEL-0.340
Token to
Token ID284
Feature activation-3.25e-01

Pos logit contributions
 disclosing+0.630
 anomaly+0.610
 breach+0.605
 cous+0.601
 ANGEL+0.596
Neg logit contributions
urban-0.494
uan-0.478
omon-0.469
ドラゴン-0.455
vin-0.448
Token appear
Token ID1656
Feature activation+7.03e-01

Pos logit contributions
 disclosing+0.222
 anomaly+0.213
 breach+0.210
 cous+0.208
 disg+0.207
Neg logit contributions
urban-0.251
uan-0.246
omon-0.239
ドラゴン-0.235
vin-0.233
Token with
Token ID351
Feature activation+4.42e-01

Pos logit contributions
urban+0.371
uan+0.365
omon+0.353
ドラゴン+0.347
vin+0.344
Neg logit contributions
 disclosing-0.383
 anomaly-0.370
 breach-0.367
 cous-0.363
 disg-0.361
Token a
Token ID257
Feature activation+1.64e-01

Pos logit contributions
urban+0.324
uan+0.318
omon+0.309
ドラゴン+0.303
vin+0.300
Neg logit contributions
 disclosing-0.321
 anomaly-0.310
 breach-0.307
 cous-0.303
 disg-0.302
Token strong
Token ID1913
Feature activation+1.72e-01

Pos logit contributions
urban+0.127
uan+0.125
omon+0.122
ドラゴン+0.119
vin+0.118
Neg logit contributions
 disclosing-0.113
 anomaly-0.109
 breach-0.108
 cous-0.106
 disg-0.106
Token focus
Token ID2962
Feature activation-1.85e+00

Pos logit contributions
urban+0.129
uan+0.127
omon+0.124
ドラゴン+0.122
vin+0.120
Neg logit contributions
 disclosing-0.121
 anomaly-0.117
 breach-0.116
 cous-0.114
 disg-0.113
Token on
Token ID319
Feature activation-2.18e-01

Pos logit contributions
 disclosing+1.259
 anomaly+1.208
 breach+1.196
 cous+1.171
 disg+1.159
Neg logit contributions
urban-1.441
uan-1.406
omon-1.382
ドラゴン-1.373
-1.336
Token the
Token ID262
Feature activation+6.59e-01

Pos logit contributions
 disclosing+0.152
 anomaly+0.146
 breach+0.145
 cous+0.143
 disg+0.143
Neg logit contributions
urban-0.165
uan-0.162
omon-0.158
ドラゴン-0.155
vin-0.153
Token Internet
Token ID4455
Feature activation+1.37e+00

Pos logit contributions
urban+0.503
uan+0.494
omon+0.481
ドラゴン+0.472
vin+0.466
Neg logit contributions
 disclosing-0.458
 anomaly-0.441
 breach-0.437
 cous-0.432
 disg-0.429
Token of
Token ID286
Feature activation-3.58e-01

Pos logit contributions
urban+1.018
uan+0.999
omon+0.972
ドラゴン+0.953
vin+0.943
Neg logit contributions
 disclosing-0.982
 anomaly-0.944
 breach-0.936
 cous-0.926
 disg-0.922
Token Things
Token ID11597
Feature activation+4.12e-02

Pos logit contributions
 disclosing+0.233
 anomaly+0.223
 breach+0.221
 cous+0.218
 disg+0.217
Neg logit contributions
urban-0.290
uan-0.285
omon-0.278
ドラゴン-0.273
vin-0.270
Token (
Token ID357
Feature activation-1.59e-02

Pos logit contributions
urban+0.031
uan+0.030
omon+0.030
ドラゴン+0.029
vin+0.029
Neg logit contributions
 disclosing-0.029
 anomaly-0.028
 breach-0.028
 cous-0.027
 disg-0.027
TokenI
Token ID40
Feature activation+5.25e-01

Pos logit contributions
 disclosing+0.012
 anomaly+0.012
 breach+0.011
 disg+0.011
 cous+0.011
Neg logit contributions
urban-0.011
uan-0.011
omon-0.011
ドラゴン-0.010
vin-0.010
Token(
Token ID7
Feature activation-1.22e+00

Pos logit contributions
urban+0.745
uan+0.735
omon+0.708
ドラゴン+0.705
vin+0.703
Neg logit contributions
 disclosing-0.480
 disg-0.461
 anomaly-0.456
 cous-0.452
 breach-0.446
Token230
Token ID19214
Feature activation+2.73e-01

Pos logit contributions
 disclosing+0.855
 anomaly+0.825
 breach+0.820
 cous+0.806
 disg+0.800
Neg logit contributions
urban-0.919
uan-0.901
omon-0.880
ドラゴン-0.862
vin-0.850
Tokens
Token ID82
Feature activation-2.64e-01

Pos logit contributions
urban+0.216
uan+0.213
omon+0.207
ドラゴン+0.203
vin+0.202
Neg logit contributions
 disclosing-0.183
 anomaly-0.176
 breach-0.173
 cous-0.173
 disg-0.172
Token)}
Token ID38165
Feature activation-4.60e-01

Pos logit contributions
 disclosing+0.168
 anomaly+0.162
 breach+0.160
 cous+0.158
 disg+0.157
Neg logit contributions
urban-0.217
uan-0.213
omon-0.208
ドラゴン-0.205
vin-0.203
Token a
Token ID257
Feature activation-1.82e-01

Pos logit contributions
 disclosing+0.312
 anomaly+0.300
 breach+0.298
 cous+0.293
 disg+0.292
Neg logit contributions
urban-0.360
uan-0.353
omon-0.345
ドラゴン-0.339
vin-0.334
Token3
Token ID18
Feature activation+2.38e+00

Pos logit contributions
 disclosing+0.121
 anomaly+0.117
 breach+0.116
 cous+0.114
 disg+0.114
Neg logit contributions
urban-0.144
uan-0.141
omon-0.138
ドラゴン-0.135
vin-0.134
Token {
Token ID1391
Feature activation+1.87e-01

Pos logit contributions
urban+1.802
uan+1.774
omon+1.728
ドラゴン+1.697
vin+1.683
Neg logit contributions
 disclosing-1.668
 anomaly-1.604
 breach-1.584
 cous-1.572
 disg-1.563
Token(
Token ID7
Feature activation-1.53e+00

Pos logit contributions
urban+0.170
uan+0.167
omon+0.162
ドラゴン+0.161
vin+0.160
Neg logit contributions
 disclosing-0.105
 anomaly-0.100
 disg-0.100
 cous-0.099
 breach-0.098
Token98
Token ID4089
Feature activation-4.55e-01

Pos logit contributions
 disclosing+1.116
 anomaly+1.079
 breach+1.072
 cous+1.055
 ANGEL+1.047
Neg logit contributions
urban-1.104
uan-1.082
omon-1.055
ドラゴン-1.033
vin-1.018
Tokens
Token ID82
Feature activation-8.30e-01

Pos logit contributions
 disclosing+0.300
 anomaly+0.288
 breach+0.284
 cous+0.281
 disg+0.280
Neg logit contributions
urban-0.365
uan-0.359
omon-0.350
ドラゴン-0.344
vin-0.340
Token)}
Token ID38165
Feature activation-9.07e-01

Pos logit contributions
 disclosing+0.521
 anomaly+0.500
 breach+0.495
 cous+0.487
 disg+0.484
Neg logit contributions
urban-0.690
uan-0.678
omon-0.662
ドラゴン-0.651
vin-0.644
Token calories
Token ID14653
Feature activation+1.84e-01

Pos logit contributions
urban+0.073
uan+0.071
omon+0.068
ドラゴン+0.066
vin+0.065
Neg logit contributions
 disclosing-0.153
 anomaly-0.149
 breach-0.148
 cous-0.147
 disg-0.146
Token also
Token ID635
Feature activation-1.99e-01

Pos logit contributions
urban+0.130
uan+0.127
omon+0.124
ドラゴン+0.121
vin+0.120
Neg logit contributions
 disclosing-0.138
 anomaly-0.134
 breach-0.133
 cous-0.131
 disg-0.130
Token equals
Token ID21767
Feature activation-4.24e-01

Pos logit contributions
 disclosing+0.175
 anomaly+0.170
 breach+0.169
 cous+0.167
 disg+0.167
Neg logit contributions
urban-0.115
uan-0.112
omon-0.109
ドラゴン-0.106
vin-0.104
Token less
Token ID1342
Feature activation+9.16e-01

Pos logit contributions
 disclosing+0.314
 anomaly+0.303
 breach+0.301
 cous+0.296
 disg+0.295
Neg logit contributions
urban-0.306
uan-0.300
omon-0.292
ドラゴン-0.287
vin-0.282
Token glucose
Token ID15701
Feature activation+1.30e+00

Pos logit contributions
urban+0.665
uan+0.654
omon+0.635
ドラゴン+0.621
vin+0.615
Neg logit contributions
 disclosing-0.669
 anomaly-0.644
 breach-0.639
 cous-0.633
 disg-0.629
Token available
Token ID1695
Feature activation+1.02e+00

Pos logit contributions
urban+0.975
uan+0.956
omon+0.931
ドラゴン+0.912
vin+0.903
Neg logit contributions
 disclosing-0.917
 anomaly-0.883
 breach-0.876
 cous-0.867
 ANGEL-0.861
Token for
Token ID329
Feature activation+4.41e-01

Pos logit contributions
urban+0.698
uan+0.682
omon+0.663
ドラゴン+0.652
vin+0.641
Neg logit contributions
 disclosing-0.788
 anomaly-0.761
 breach-0.756
 cous-0.745
 disg-0.743
Token fuel
Token ID5252
Feature activation+1.44e+00

Pos logit contributions
urban+0.328
uan+0.322
omon+0.313
ドラゴン+0.308
vin+0.304
Neg logit contributions
 disclosing-0.316
 anomaly-0.305
 breach-0.302
 cous-0.298
 disg-0.297
Token,
Token ID11
Feature activation-2.60e-01

Pos logit contributions
urban+1.061
uan+1.041
omon+1.013
ドラゴン+0.997
vin+0.982
Neg logit contributions
 disclosing-1.042
 anomaly-1.006
 breach-0.998
 cous-0.983
 disg-0.980
Token so
Token ID523
Feature activation-1.55e+00

Pos logit contributions
 disclosing+0.187
 anomaly+0.180
 breach+0.178
 cous+0.176
 disg+0.175
Neg logit contributions
urban-0.193
uan-0.189
omon-0.184
ドラゴン-0.181
vin-0.179
Token the
Token ID262
Feature activation+2.94e-02

Pos logit contributions
 disclosing+1.103
 anomaly+1.061
 breach+1.052
 cous+1.036
 disg+1.030
Neg logit contributions
urban-1.162
uan-1.136
omon-1.113
ドラゴン-1.100
vin-1.077
Tokení
Token ID8836
Feature activation+4.76e-02

Pos logit contributions
urban+0.417
uan+0.411
omon+0.398
ドラゴン+0.387
vin+0.386
Neg logit contributions
 disclosing-0.468
 anomaly-0.453
 breach-0.449
 disg-0.442
 cous-0.442
Tokenk
Token ID74
Feature activation+4.45e-01

Pos logit contributions
urban+0.031
uan+0.031
omon+0.030
ドラゴン+0.029
vin+0.029
Neg logit contributions
 disclosing-0.038
 anomaly-0.037
 breach-0.037
 cous-0.036
 disg-0.036
Token 41
Token ID6073
Feature activation+4.14e-01

Pos logit contributions
urban+0.303
uan+0.299
omon+0.289
vin+0.281
ドラゴン+0.279
Neg logit contributions
 disclosing-0.347
 anomaly-0.335
 breach-0.332
 cous-0.328
 disg-0.327
Token),
Token ID828
Feature activation-1.23e+00

Pos logit contributions
urban+0.336
uan+0.329
omon+0.322
ドラゴン+0.318
vin+0.312
Neg logit contributions
 disclosing-0.267
 anomaly-0.257
 breach-0.255
 cous-0.251
 disg-0.250
Token Kr
Token ID13685
Feature activation-1.45e-01

Pos logit contributions
 disclosing+0.854
 anomaly+0.825
 breach+0.819
 cous+0.807
 ANGEL+0.801
Neg logit contributions
urban-0.939
uan-0.918
omon-0.898
ドラゴン-0.883
vin-0.867
Token?,
Token ID21747
Feature activation-2.01e-01

Pos logit contributions
 disclosing+0.106
 anomaly+0.103
 breach+0.102
 cous+0.101
 disg+0.100
Neg logit contributions
urban-0.105
uan-0.103
omon-0.100
ドラゴン-0.099
vin-0.097
Token Kur
Token ID18132
Feature activation-4.31e-01

Pos logit contributions
 disclosing+0.140
 anomaly+0.134
 breach+0.133
 cous+0.132
 disg+0.131
Neg logit contributions
urban-0.154
uan-0.151
omon-0.147
ドラゴン-0.145
vin-0.143
Token?
Token ID30
Feature activation-1.28e-01

Pos logit contributions
 disclosing+0.316
 anomaly+0.305
 breach+0.303
 cous+0.299
 disg+0.298
Neg logit contributions
urban-0.313
uan-0.307
omon-0.299
ドラゴン-0.293
vin-0.289
Tokení
Token ID8836
Feature activation+4.54e-02

Pos logit contributions
 disclosing+0.097
 anomaly+0.094
 breach+0.093
 cous+0.092
 disg+0.092
Neg logit contributions
urban-0.090
uan-0.088
omon-0.086
ドラゴン-0.084
vin-0.083
Tokenk
Token ID74
Feature activation+8.14e-01

Pos logit contributions
urban+0.030
uan+0.029
omon+0.028
ドラゴン+0.027
vin+0.027
Neg logit contributions
 disclosing-0.037
 anomaly-0.035
 breach-0.035
 cous-0.035
 disg-0.035
Token,
Token ID11
Feature activation+6.33e-01

Pos logit contributions
urban+0.612
uan+0.602
omon+0.586
ドラゴン+0.573
vin+0.569
Neg logit contributions
 disclosing-0.576
 anomaly-0.555
 breach-0.550
 cous-0.542
 disg-0.540
Token records
Token ID4406
Feature activation-1.76e+00

Pos logit contributions
 disclosing+0.968
 anomaly+0.936
 breach+0.928
 cous+0.915
 disg+0.908
Neg logit contributions
urban-1.069
uan-1.047
omon-1.021
ドラゴン-1.004
vin-0.985
Token out
Token ID503
Feature activation-2.20e+00

Pos logit contributions
 disclosing+1.242
 anomaly+1.194
 breach+1.186
 cous+1.170
 disg+1.166
Neg logit contributions
urban-1.326
uan-1.300
omon-1.265
ドラゴン-1.248
vin-1.220
Token.
Token ID13
Feature activation-5.00e-02

Pos logit contributions
 disclosing+1.502
 anomaly+1.436
 breach+1.434
 cous+1.402
 ANGEL+1.398
Neg logit contributions
urban-1.699
uan-1.657
omon-1.623
ドラゴン-1.595
vin-1.554
Token
Token ID447
Feature activation-3.68e-02

Pos logit contributions
 disclosing+0.040
 anomaly+0.039
 breach+0.038
 cous+0.038
 ANGEL+0.038
Neg logit contributions
urban-0.033
uan-0.032
omon-0.031
ドラゴン-0.031
vin-0.030
Token
Token ID251
Feature activation-3.10e-01

Pos logit contributions
 disclosing+0.037
 anomaly+0.036
 breach+0.036
 cous+0.035
 disg+0.035
Neg logit contributions
urban-0.017
uan-0.016
omon-0.016
ドラゴン-0.015
vin-0.015
Token Cob
Token ID14828
Feature activation-2.50e+00

Pos logit contributions
 disclosing+0.219
 anomaly+0.211
 breach+0.209
 cous+0.206
 disg+0.205
Neg logit contributions
urban-0.233
uan-0.229
omon-0.223
ドラゴン-0.219
vin-0.216
Tokenain
Token ID391
Feature activation-2.73e-01

Pos logit contributions
 disclosing+2.483
 breach+2.433
 anomaly+2.428
 cous+2.372
 ANGEL+2.354
Neg logit contributions
urban-1.105
ドラゴン-1.051
uan-1.050
omon-1.028
-0.966
Token hands
Token ID2832
Feature activation-1.44e+00

Pos logit contributions
 disclosing+0.184
 anomaly+0.177
 breach+0.176
 cous+0.173
 disg+0.172
Neg logit contributions
urban-0.213
uan-0.210
omon-0.205
ドラゴン-0.201
vin-0.199
Token me
Token ID502
Feature activation+1.77e-01

Pos logit contributions
 disclosing+0.956
 anomaly+0.922
 breach+0.912
 cous+0.899
 disg+0.892
Neg logit contributions
urban-1.148
uan-1.127
omon-1.100
ドラゴン-1.082
vin-1.063
Token one
Token ID530
Feature activation+4.13e-01

Pos logit contributions
urban+0.147
uan+0.145
omon+0.141
ドラゴン+0.139
vin+0.137
Neg logit contributions
 disclosing-0.112
 anomaly-0.107
 breach-0.106
 cous-0.105
 disg-0.104
Token,
Token ID11
Feature activation+1.15e+00

Pos logit contributions
urban+0.303
uan+0.297
omon+0.289
ドラゴン+0.284
vin+0.281
Neg logit contributions
 disclosing-0.299
 anomaly-0.288
 breach-0.286
 cous-0.283
 disg-0.281
Token the
Token ID262
Feature activation-6.30e-01

Pos logit contributions
urban+0.934
uan+0.915
omon+0.890
ドラゴン+0.875
vin+0.868
Neg logit contributions
 disclosing-0.827
 anomaly-0.792
 breach-0.788
 cous-0.778
 disg-0.773
Token carnage
Token ID39857
Feature activation+4.32e-01

Pos logit contributions
 disclosing+0.429
 anomaly+0.414
 breach+0.410
 cous+0.404
 disg+0.402
Neg logit contributions
urban-0.490
uan-0.482
omon-0.470
ドラゴン-0.462
vin-0.456
Token for
Token ID329
Feature activation-2.31e-02

Pos logit contributions
urban+0.317
uan+0.311
omon+0.303
ドラゴン+0.297
vin+0.293
Neg logit contributions
 disclosing-0.314
 anomaly-0.303
 breach-0.301
 cous-0.297
 disg-0.295
Token now
Token ID783
Feature activation-3.59e-01

Pos logit contributions
 disclosing+0.014
 anomaly+0.013
 breach+0.013
 cous+0.013
 disg+0.013
Neg logit contributions
urban-0.020
uan-0.020
omon-0.019
ドラゴン-0.019
vin-0.019
Token and
Token ID290
Feature activation-5.86e-01

Pos logit contributions
 disclosing+0.255
 anomaly+0.246
 breach+0.243
 disg+0.239
 cous+0.239
Neg logit contributions
urban-0.269
uan-0.264
omon-0.257
ドラゴン-0.252
vin-0.249
Token take
Token ID1011
Feature activation-3.80e-01

Pos logit contributions
 disclosing+0.412
 anomaly+0.397
 breach+0.394
 cous+0.388
 disg+0.387
Neg logit contributions
urban-0.445
uan-0.436
omon-0.425
ドラゴン-0.418
vin-0.412
Token comfort
Token ID4467
Feature activation+1.90e+00

Pos logit contributions
 disclosing+0.229
 anomaly+0.219
 breach+0.216
 cous+0.213
 disg+0.212
Neg logit contributions
urban-0.326
uan-0.321
omon-0.314
ドラゴン-0.309
vin-0.306
Token in
Token ID287
Feature activation-3.85e-01

Pos logit contributions
urban+1.325
uan+1.300
omon+1.263
ドラゴン+1.239
vin+1.222
Neg logit contributions
 disclosing-1.443
 anomaly-1.395
 breach-1.384
 cous-1.368
 disg-1.361
Token remembering
Token ID24865
Feature activation-1.36e-01

Pos logit contributions
 disclosing+0.261
 anomaly+0.251
 breach+0.248
 cous+0.245
 disg+0.244
Neg logit contributions
urban-0.302
uan-0.297
omon-0.289
ドラゴン-0.284
vin-0.281
Token that
Token ID326
Feature activation-2.32e-01

Pos logit contributions
 disclosing+0.096
 anomaly+0.092
 breach+0.091
 cous+0.090
 disg+0.090
Neg logit contributions
urban-0.102
uan-0.100
omon-0.098
ドラゴン-0.096
vin-0.095
Token when
Token ID618
Feature activation-1.20e+00

Pos logit contributions
 disclosing+0.163
 anomaly+0.157
 breach+0.156
 cous+0.154
 disg+0.153
Neg logit contributions
urban-0.175
uan-0.172
omon-0.167
ドラゴン-0.164
vin-0.163
Token set
Token ID900
Feature activation+9.70e-01

Pos logit contributions
urban+0.203
uan+0.200
omon+0.195
ドラゴン+0.192
vin+0.189
Neg logit contributions
 disclosing-0.162
 anomaly-0.155
 breach-0.154
 cous-0.152
 disg-0.151
Token of
Token ID286
Feature activation-2.38e-01

Pos logit contributions
urban+0.839
uan+0.827
omon+0.808
ドラゴン+0.793
vin+0.787
Neg logit contributions
 disclosing-0.571
 anomaly-0.548
 breach-0.541
 cous-0.536
 disg-0.531
Token TM
Token ID21232
Feature activation-4.23e-01

Pos logit contributions
 disclosing+0.186
 anomaly+0.180
 breach+0.179
 cous+0.177
 disg+0.176
Neg logit contributions
urban-0.160
uan-0.157
omon-0.153
ドラゴン-0.149
vin-0.147
TokenNT
Token ID11251
Feature activation+9.86e-01

Pos logit contributions
 disclosing+0.253
 anomaly+0.241
 breach+0.238
 cous+0.237
 disg+0.236
Neg logit contributions
urban-0.365
uan-0.361
omon-0.352
ドラゴン-0.346
vin-0.342
Token II
Token ID2873
Feature activation-4.23e-01

Pos logit contributions
urban+0.758
uan+0.745
omon+0.726
ドラゴン+0.714
vin+0.705
Neg logit contributions
 disclosing-0.679
 anomaly-0.653
 breach-0.646
 cous-0.640
 disg-0.637
Token?
Token ID30
Feature activation-1.07e-01

Pos logit contributions
 disclosing+0.264
 anomaly+0.253
 breach+0.251
 cous+0.246
 disg+0.244
Neg logit contributions
urban-0.354
uan-0.348
omon-0.339
ドラゴン-0.334
vin-0.330
Token What
Token ID1867
Feature activation+3.41e-01

Pos logit contributions
 disclosing+0.071
 anomaly+0.068
 breach+0.067
 cous+0.067
 disg+0.066
Neg logit contributions
urban-0.085
uan-0.083
omon-0.081
ドラゴン-0.080
vin-0.079
Token other
Token ID584
Feature activation+1.15e+00

Pos logit contributions
urban+0.278
uan+0.274
omon+0.267
ドラゴン+0.262
vin+0.260
Neg logit contributions
 disclosing-0.219
 anomaly-0.210
 breach-0.208
 cous-0.205
 disg-0.204
Token characters
Token ID3435
Feature activation+6.04e-01

Pos logit contributions
urban+0.883
uan+0.870
omon+0.847
ドラゴン+0.833
vin+0.823
Neg logit contributions
 disclosing-0.787
 anomaly-0.755
 breach-0.749
 cous-0.742
 disg-0.740
Token might
Token ID1244
Feature activation+4.61e-01

Pos logit contributions
urban+0.445
uan+0.439
omon+0.427
ドラゴン+0.417
vin+0.414
Neg logit contributions
 disclosing-0.433
 anomaly-0.418
 breach-0.414
 cous-0.410
 disg-0.407
Token be
Token ID307
Feature activation+6.71e-01

Pos logit contributions
urban+0.348
uan+0.343
omon+0.333
ドラゴン+0.327
vin+0.324
Neg logit contributions
 disclosing-0.323
 anomaly-0.312
 breach-0.308
 cous-0.305
 disg-0.303
Token cooperate
Token ID21270
Feature activation+2.11e-01

Pos logit contributions
 disclosing+0.355
 anomaly+0.344
 breach+0.339
 cous+0.337
 ANGEL+0.334
Neg logit contributions
urban-0.337
uan-0.331
omon-0.321
ドラゴン-0.314
vin-0.311
Token in
Token ID287
Feature activation-7.62e-01

Pos logit contributions
urban+0.152
uan+0.149
omon+0.145
ドラゴン+0.143
vin+0.141
Neg logit contributions
 disclosing-0.155
 anomaly-0.150
 breach-0.149
 cous-0.147
 disg-0.146
Token their
Token ID511
Feature activation-2.43e-01

Pos logit contributions
 disclosing+0.512
 anomaly+0.493
 breach+0.488
 cous+0.482
 disg+0.479
Neg logit contributions
urban-0.599
uan-0.588
omon-0.574
ドラゴン-0.564
vin-0.558
Token development
Token ID2478
Feature activation+1.45e-01

Pos logit contributions
 disclosing+0.166
 anomaly+0.160
 breach+0.158
 cous+0.157
 disg+0.155
Neg logit contributions
urban-0.188
uan-0.185
omon-0.180
ドラゴン-0.177
vin-0.175
Token and
Token ID290
Feature activation+8.41e-01

Pos logit contributions
urban+0.108
uan+0.106
omon+0.103
ドラゴン+0.101
vin+0.100
Neg logit contributions
 disclosing-0.104
 anomaly-0.101
 breach-0.100
 cous-0.099
 disg-0.098
Token marketing
Token ID7124
Feature activation-2.64e-01

Pos logit contributions
urban+0.578
uan+0.569
omon+0.551
ドラゴン+0.537
vin+0.534
Neg logit contributions
 disclosing-0.640
 anomaly-0.621
 breach-0.614
 cous-0.611
 disg-0.606
Token efforts
Token ID4040
Feature activation-8.63e-01

Pos logit contributions
 disclosing+0.197
 anomaly+0.190
 breach+0.188
 cous+0.187
 disg+0.186
Neg logit contributions
urban-0.188
uan-0.185
omon-0.179
ドラゴン-0.176
vin-0.174
Token.
Token ID13
Feature activation-8.91e-02

Pos logit contributions
 disclosing+0.620
 anomaly+0.594
 breach+0.591
 cous+0.581
 disg+0.578
Neg logit contributions
urban-0.641
uan-0.628
omon-0.611
ドラゴン-0.603
vin-0.592
Token
Token ID198
Feature activation-2.45e-01

Pos logit contributions
 disclosing+0.064
 anomaly+0.061
 breach+0.061
 cous+0.060
 disg+0.060
Neg logit contributions
urban-0.066
uan-0.065
omon-0.063
ドラゴン-0.062
vin-0.061
Token
Token ID198
Feature activation-2.21e-01

Pos logit contributions
 disclosing+0.184
 anomaly+0.178
 breach+0.176
 disg+0.176
 cous+0.176
Neg logit contributions
urban-0.174
uan-0.171
omon-0.165
ドラゴン-0.162
vin-0.161
TokenIn
Token ID818
Feature activation+1.69e-01

Pos logit contributions
 disclosing+0.165
 anomaly+0.159
 breach+0.158
 cous+0.156
 disg+0.155
Neg logit contributions
urban-0.158
uan-0.155
omon-0.151
ドラゴン-0.148
vin-0.146
Token
Token ID198
Feature activation+7.51e-02

Pos logit contributions
urban+0.877
uan+0.863
omon+0.839
ドラゴン+0.822
vin+0.816
Neg logit contributions
 disclosing-0.817
 anomaly-0.788
 breach-0.782
 cous-0.773
 disg-0.769
Token
Token ID198
Feature activation+3.12e-01

Pos logit contributions
urban+0.053
uan+0.052
omon+0.050
ドラゴン+0.049
vin+0.049
Neg logit contributions
 disclosing-0.057
 anomaly-0.055
 disg-0.054
 breach-0.054
 cous-0.054
TokenCustom
Token ID15022
Feature activation+1.60e+00

Pos logit contributions
urban+0.222
uan+0.218
omon+0.211
ドラゴン+0.208
vin+0.205
Neg logit contributions
 disclosing-0.234
 anomaly-0.226
 breach-0.224
 cous-0.221
 disg-0.220
Tokenarily
Token ID3093
Feature activation-3.50e-01

Pos logit contributions
urban+1.180
uan+1.161
omon+1.127
ドラゴン+1.101
vin+1.094
Neg logit contributions
 disclosing-1.150
 anomaly-1.111
 breach-1.099
 cous-1.088
 ANGEL-1.084
Token,
Token ID11
Feature activation-1.24e+00

Pos logit contributions
 disclosing+0.245
 anomaly+0.236
 breach+0.234
 cous+0.231
 disg+0.230
Neg logit contributions
urban-0.265
uan-0.261
omon-0.254
ドラゴン-0.249
vin-0.246
Token naming
Token ID19264
Feature activation-2.88e+00

Pos logit contributions
 disclosing+0.852
 anomaly+0.817
 breach+0.812
 cous+0.797
 disg+0.795
Neg logit contributions
urban-0.956
uan-0.937
omon-0.916
ドラゴン-0.901
vin-0.886
Token a
Token ID257
Feature activation+3.55e-01

Pos logit contributions
 disclosing+1.852
 anomaly+1.777
 breach+1.765
 cous+1.730
 disg+1.724
Neg logit contributions
urban-2.355
uan-2.301
omon-2.261
ドラゴン-2.223
vin-2.191
Token VP
Token ID23342
Feature activation-8.35e-01

Pos logit contributions
urban+0.271
uan+0.266
omon+0.259
ドラゴン+0.255
vin+0.252
Neg logit contributions
 disclosing-0.246
 anomaly-0.237
 breach-0.235
 cous-0.232
 disg-0.231
Token and
Token ID290
Feature activation-5.47e-01

Pos logit contributions
 disclosing+0.608
 anomaly+0.587
 breach+0.583
 cous+0.573
 disg+0.570
Neg logit contributions
urban-0.611
uan-0.598
omon-0.583
ドラゴン-0.573
vin-0.565
Token the
Token ID262
Feature activation-3.85e-01

Pos logit contributions
 disclosing+0.384
 anomaly+0.372
 breach+0.368
 cous+0.364
 disg+0.361
Neg logit contributions
urban-0.411
uan-0.404
omon-0.393
ドラゴン-0.385
vin-0.382
Token subsequent
Token ID8840
Feature activation+9.04e-01

Pos logit contributions
 disclosing+0.267
 anomaly+0.258
 breach+0.255
 cous+0.253
 disg+0.251
Neg logit contributions
urban-0.294
uan-0.289
omon-0.281
ドラゴン-0.276
vin-0.273
Token,
Token ID11
Feature activation+6.35e-01

Pos logit contributions
 disclosing+0.427
 anomaly+0.411
 breach+0.409
 cous+0.402
 disg+0.401
Neg logit contributions
urban-0.419
uan-0.411
omon-0.400
ドラゴン-0.393
vin-0.388
Token but
Token ID475
Feature activation-9.05e-01

Pos logit contributions
urban+0.457
uan+0.452
omon+0.436
ドラゴン+0.426
vin+0.423
Neg logit contributions
 disclosing-0.457
 anomaly-0.448
 breach-0.440
 cous-0.438
 ANGEL-0.437
Token a
Token ID257
Feature activation-3.40e-01

Pos logit contributions
 disclosing+0.593
 anomaly+0.574
 breach+0.566
 cous+0.561
 disg+0.557
Neg logit contributions
urban-0.720
uan-0.710
omon-0.690
ドラゴン-0.676
vin-0.671
Token person
Token ID1048
Feature activation+1.70e-01

Pos logit contributions
 disclosing+0.236
 anomaly+0.228
 breach+0.224
 cous+0.223
 ANGEL+0.222
Neg logit contributions
urban-0.257
uan-0.254
omon-0.247
ドラゴン-0.242
vin-0.240
Token of
Token ID286
Feature activation+4.23e-01

Pos logit contributions
urban+0.128
uan+0.127
omon+0.122
ドラゴン+0.120
vin+0.120
Neg logit contributions
 disclosing-0.117
 anomaly-0.113
 breach-0.112
 cous-0.112
 ANGEL-0.111
Token interest
Token ID1393
Feature activation-3.07e+00

Pos logit contributions
urban+0.269
uan+0.263
omon+0.255
ドラゴン+0.250
vin+0.246
Neg logit contributions
 disclosing-0.349
 anomaly-0.338
 breach-0.336
 cous-0.332
 disg-0.330
Token in
Token ID287
Feature activation+4.91e-02

Pos logit contributions
 disclosing+2.256
 anomaly+2.150
 breach+2.148
 cous+2.093
 disg+2.092
Neg logit contributions
urban-2.270
uan-2.204
omon-2.172
ドラゴン-2.147
vin-2.087
Token the
Token ID262
Feature activation+2.93e-01

Pos logit contributions
urban+0.038
uan+0.038
omon+0.037
ドラゴン+0.036
vin+0.036
Neg logit contributions
 disclosing-0.033
 anomaly-0.032
 breach-0.031
 cous-0.031
 ANGEL-0.031
Token Pentagon
Token ID12651
Feature activation+4.82e-01

Pos logit contributions
urban+0.240
uan+0.239
omon+0.233
ドラゴン+0.227
vin+0.225
Neg logit contributions
 disclosing-0.179
 anomaly-0.173
 cous-0.171
 ANGEL-0.170
 disg-0.169
Token's
Token ID338
Feature activation-2.29e-02

Pos logit contributions
urban+0.384
uan+0.381
omon+0.369
ドラゴン+0.360
vin+0.358
Neg logit contributions
 disclosing-0.310
 anomaly-0.301
 cous-0.299
 ANGEL-0.297
 breach-0.294
Token investigation
Token ID3645
Feature activation-2.20e+00

Pos logit contributions
 disclosing+0.015
 anomaly+0.014
 cous+0.014
 disg+0.014
 breach+0.014
Neg logit contributions
urban-0.018
uan-0.018
omon-0.018
ドラゴン-0.017
vin-0.017
Token,
Token ID11
Feature activation+9.91e-02

Pos logit contributions
 disclosing+0.637
 anomaly+0.616
 breach+0.610
 cous+0.603
 disg+0.598
Neg logit contributions
urban-0.704
uan-0.686
omon-0.670
ドラゴン-0.669
vin-0.649
Token M
Token ID337
Feature activation-1.82e-01

Pos logit contributions
urban+0.074
uan+0.073
omon+0.071
ドラゴン+0.070
vin+0.069
Neg logit contributions
 disclosing-0.070
 anomaly-0.067
 breach-0.067
 disg-0.066
 cous-0.066
Token.
Token ID13
Feature activation-1.10e-01

Pos logit contributions
 disclosing+0.111
 anomaly+0.107
 breach+0.106
 cous+0.104
 disg+0.103
Neg logit contributions
urban-0.154
uan-0.151
omon-0.148
ドラゴン-0.146
vin-0.144
Token (
Token ID357
Feature activation-1.62e-01

Pos logit contributions
 disclosing+0.076
 anomaly+0.073
 breach+0.072
 cous+0.072
 disg+0.072
Neg logit contributions
urban-0.084
uan-0.083
omon-0.080
ドラゴン-0.079
vin-0.078
Token 2001
Token ID5878
Feature activation+8.02e-01

Pos logit contributions
 disclosing+0.113
 anomaly+0.109
 breach+0.108
 cous+0.106
 disg+0.106
Neg logit contributions
urban-0.124
uan-0.122
omon-0.119
ドラゴン-0.117
vin-0.116
Token ).
Token ID6739
Feature activation-3.57e+00

Pos logit contributions
urban+0.533
uan+0.522
omon+0.506
ドラゴン+0.496
vin+0.489
Neg logit contributions
 disclosing-0.638
 anomaly-0.617
 breach-0.612
 cous-0.605
 disg-0.603
Token Domin
Token ID11817
Feature activation-9.63e-01

Pos logit contributions
 disclosing+2.404
 anomaly+2.318
 breach+2.292
 ANGEL+2.274
 cous+2.241
Neg logit contributions
urban-2.760
uan-2.736
omon-2.664
ドラゴン-2.619
vin-2.577
Tokenance
Token ID590
Feature activation-1.26e+00

Pos logit contributions
 disclosing+0.712
 anomaly+0.687
 breach+0.681
 cous+0.673
 disg+0.670
Neg logit contributions
urban-0.694
uan-0.681
omon-0.663
ドラゴン-0.649
vin-0.641
Token in
Token ID287
Feature activation-7.36e-01

Pos logit contributions
 disclosing+0.877
 anomaly+0.847
 breach+0.839
 cous+0.826
 disg+0.821
Neg logit contributions
urban-0.960
uan-0.942
omon-0.919
ドラゴン-0.903
vin-0.890
Token early
Token ID1903
Feature activation+1.35e+00

Pos logit contributions
 disclosing+0.529
 anomaly+0.509
 breach+0.506
 cous+0.498
 disg+0.495
Neg logit contributions
urban-0.546
uan-0.537
omon-0.522
ドラゴン-0.514
vin-0.506
Token adolescent
Token ID28680
Feature activation+6.82e-01

Pos logit contributions
urban+0.825
uan+0.808
omon+0.780
ドラゴン+0.760
vin+0.752
Neg logit contributions
 disclosing-1.140
 anomaly-1.107
 breach-1.097
 cous-1.087
 disg-1.083
Token (
Token ID357
Feature activation-8.61e-01

Pos logit contributions
 disclosing+0.313
 anomaly+0.301
 breach+0.299
 cous+0.292
 disg+0.291
Neg logit contributions
urban-0.380
uan-0.373
omon-0.365
ドラゴン-0.359
vin-0.354
TokenU
Token ID52
Feature activation+1.43e+00

Pos logit contributions
 disclosing+0.685
 anomaly+0.662
 breach+0.661
 cous+0.649
 ANGEL+0.645
Neg logit contributions
urban-0.561
uan-0.552
omon-0.541
ドラゴン-0.528
vin-0.516
TokenBI
Token ID3483
Feature activation+1.22e+00

Pos logit contributions
urban+1.233
uan+1.214
omon+1.185
ドラゴン+1.166
vin+1.156
Neg logit contributions
 disclosing-0.859
 anomaly-0.821
 breach-0.809
 cous-0.801
 disg-0.797
Token)
Token ID8
Feature activation+8.23e-02

Pos logit contributions
urban+0.950
uan+0.932
omon+0.909
ドラゴン+0.894
vin+0.882
Neg logit contributions
 disclosing-0.836
 anomaly-0.805
 breach-0.798
 cous-0.786
 disg-0.781
Token or
Token ID393
Feature activation-1.63e+00

Pos logit contributions
urban+0.068
uan+0.067
omon+0.065
ドラゴン+0.064
vin+0.063
Neg logit contributions
 disclosing-0.052
 anomaly-0.050
 breach-0.050
 cous-0.049
 disg-0.049
Token guaranteed
Token ID11462
Feature activation-2.81e+00

Pos logit contributions
 disclosing+1.075
 anomaly+1.037
 breach+1.031
 cous+1.011
 disg+1.006
Neg logit contributions
urban-1.298
uan-1.274
omon-1.248
ドラゴン-1.232
vin-1.210
Token minimum
Token ID5288
Feature activation-5.96e-01

Pos logit contributions
 disclosing+1.953
 breach+1.890
 anomaly+1.889
 disg+1.839
 cous+1.824
Neg logit contributions
urban-2.126
uan-2.087
omon-2.063
ドラゴン-2.031
vin-1.999
Token income
Token ID3739
Feature activation-3.30e-01

Pos logit contributions
 disclosing+0.395
 anomaly+0.380
 breach+0.378
 cous+0.368
 disg+0.368
Neg logit contributions
urban-0.476
uan-0.467
omon-0.458
ドラゴン-0.452
vin-0.445
Token (
Token ID357
Feature activation+1.03e-01

Pos logit contributions
 disclosing+0.224
 anomaly+0.217
 breach+0.214
 cous+0.210
 disg+0.209
Neg logit contributions
urban-0.258
uan-0.253
omon-0.248
ドラゴン-0.244
vin-0.240
TokenG
Token ID38
Feature activation+7.42e-01

Pos logit contributions
urban+0.073
uan+0.072
omon+0.070
ドラゴン+0.069
vin+0.068
Neg logit contributions
 disclosing-0.077
 anomaly-0.074
 breach-0.074
 cous-0.073
 disg-0.072
TokenMI
Token ID8895
Feature activation-3.18e-01

Pos logit contributions
urban+0.600
uan+0.594
omon+0.575
vin+0.561
ドラゴン+0.560
Neg logit contributions
 disclosing-0.487
 anomaly-0.463
 breach-0.461
 cous-0.459
 disg-0.453
Tokenella
Token ID12627
Feature activation-4.24e-01

Pos logit contributions
urban+0.163
uan+0.160
omon+0.157
ドラゴン+0.154
vin+0.153
Neg logit contributions
 disclosing-0.096
 anomaly-0.091
 breach-0.090
 cous-0.088
 disg-0.088
Token Vol
Token ID4709
Feature activation+7.85e-01

Pos logit contributions
 disclosing+0.348
 anomaly+0.338
 breach+0.336
 cous+0.332
 ANGEL+0.329
Neg logit contributions
urban-0.269
uan-0.263
omon-0.256
ドラゴン-0.251
vin-0.246
Tokenpe
Token ID431
Feature activation+5.72e-02

Pos logit contributions
urban+0.705
uan+0.695
omon+0.677
vin+0.665
ドラゴン+0.658
Neg logit contributions
 disclosing-0.451
 anomaly-0.427
 disg-0.424
 breach-0.423
 cous-0.418
Token.
Token ID13
Feature activation-5.05e-01

Pos logit contributions
urban+0.043
uan+0.042
omon+0.041
ドラゴン+0.041
vin+0.040
Neg logit contributions
 disclosing-0.040
 anomaly-0.039
 breach-0.039
 cous-0.038
 disg-0.038
Token At
Token ID1629
Feature activation+2.24e-01

Pos logit contributions
 disclosing+0.340
 anomaly+0.327
 breach+0.324
 cous+0.319
 ANGEL+0.318
Neg logit contributions
urban-0.396
uan-0.389
omon-0.380
ドラゴン-0.373
vin-0.367
Token worse
Token ID4785
Feature activation-2.93e+00

Pos logit contributions
urban+0.168
uan+0.165
omon+0.161
ドラゴン+0.158
vin+0.156
Neg logit contributions
 disclosing-0.159
 anomaly-0.153
 breach-0.152
 cous-0.150
 disg-0.149
Token,
Token ID11
Feature activation-6.32e-01

Pos logit contributions
 disclosing+1.927
 breach+1.853
 anomaly+1.848
 cous+1.814
 disg+1.794
Neg logit contributions
urban-2.346
uan-2.277
omon-2.237
ドラゴン-2.188
-2.171
Token it
Token ID340
Feature activation-1.50e-01

Pos logit contributions
 disclosing+0.453
 anomaly+0.433
 breach+0.431
 cous+0.425
 disg+0.422
Neg logit contributions
urban-0.474
uan-0.465
omon-0.456
ドラゴン-0.449
vin-0.438
Token insults
Token ID27819
Feature activation+6.49e-01

Pos logit contributions
 disclosing+0.107
 anomaly+0.103
 breach+0.102
 cous+0.101
 disg+0.100
Neg logit contributions
urban-0.112
uan-0.110
omon-0.107
ドラゴン-0.105
vin-0.104
Token them
Token ID606
Feature activation+7.73e-01

Pos logit contributions
urban+0.520
uan+0.511
omon+0.498
ドラゴン+0.490
vin+0.484
Neg logit contributions
 disclosing-0.426
 anomaly-0.410
 breach-0.406
 cous-0.400
 disg-0.399
Token.
Token ID13
Feature activation+3.72e-01

Pos logit contributions
urban+0.562
uan+0.552
omon+0.537
ドラゴン+0.526
vin+0.520
Neg logit contributions
 disclosing-0.564
 anomaly-0.545
 breach-0.540
 cous-0.533
 disg-0.531