TOP ACTIVATIONS
MAX = 2.005

128 - bit remains one of the most common implementations .[
Not a day goes by that I don
s Get Chance as In juries Bite Time for
and I myself will be no more . Why , then
( Fun fact : Gu illa ume is also the only
times for Monk ), Gu illa ume ( nom inated five
ma v . Goodell Am ended Compl aint Joe
should actually be driven towards playing Champions Trophy and World Cup
, it appears no other thing to me
of siege and starvation are no less grave in their calam
AS ES Phase 1 : No infections in humans are being
arl al who was climbing behind her . They were rushed
and D endi st rolled off to entertain ( or be
will be higher if they keep working , and their
Mom : Jen elle Evans Shows Off Her Brand New Forever
omp kins was already back on her way to work .
their age stand a lot better chance of getting honest
changed it to 1 mana , sheep a random minion .
infections in humans are being caused by viruses circulating in animals
The further along he walks , the greater the destruction and

MIN ACTIVATIONS
MIN = -2.311

NASA s seventh consecutive lunar mission failure . NASA
m 2 k [ 10 : 48 ] < min ion
m 2 k [ 10 : 48 ] < m ew
Koreans are forced to adhere to the J uc he ideology
m 4 k [ 10 : 48 ] < mom per
These books are available at most book stores that have a
return through the V FL this week after recovering from a
more than there were at the start of his first term
J M asc is , and a book of unused work
will return to full training this week after an abdominal injury
to be available for selection this week at V FL level
as response and remission rates at week four . In addition
and what it means to be a trans human . That
m 2 k [ 10 : 48 ] < qu ij
punishment for the perceived crimes of watching South Korean movies ,
, $ 22 MM contract this past offseason due largely to
Koreans are forced to adhere to a doctrine which mixes Marxism
ideology - a doctrine which mixes Marxism with the worship of
will identify those patterns and simultaneously send them to an external
, published by Sp url this month .

INTERVAL 0.926 - 2.005
CONTAINS 1.125%

6 percent of public school teachers change occupations or exit the
. This paper presents a new methodology , a new approach
to the angle - of - attack data by commanding the
thoughts in the moment of playing , even if those intentions
Act III remains to be played . Just when it will

INTERVAL -0.153 - 0.926
CONTAINS 64.690%

in Mir a Road . The Director of Seven Eleven Construction
has composed and performed the title song ( C omet Song
have good benefits , everybody gains from that . Watching sometimes
of you should stay in the mountain shack , and please
State and early in his rookie season , when he challenged

INTERVAL -1.232 - -0.153
CONTAINS 33.963%

with daily , listen to those people , and amplify their
. Heavy class drivers have higher top speeds and more weight
in Pre y as well . But where Bio Shock presented
Group <|endoftext|> W ired Equ ivalent Privacy ( WE P )
poll ster Stan Greenberg told left - leaning Salon

INTERVAL -2.311 - -1.232
CONTAINS 0.222%

tourist without diplomatic immunity . At some point , the top
marked its 100 th anniversary this year , split into two
, and perhaps struggle to convey to others , that their
first C EL AC summit last year . Before
ll be a definite enhancement in intercourse , he
Token 128
Token ID13108
Feature activation+4.04e-01

Pos logit contributions
 cs+6.798
 postp+6.751
+6.685
 Invention+6.459
 Offense+6.139
Neg logit contributions
orge-7.131
yrinth-6.585
olphin-6.466
rss-6.229
ragon-6.204
Token-
Token ID12
Feature activation-1.01e-01

Pos logit contributions
+4.344
 cs+4.248
 postp+4.212
 Invention+3.953
vati+3.942
Neg logit contributions
orge-4.066
yrinth-3.738
olphin-3.679
 depths-3.657
rss-3.560
Tokenbit
Token ID2545
Feature activation+5.59e-01

Pos logit contributions
orge+0.520
 depths+0.428
yrinth+0.428
sent+0.425
olphin+0.420
Neg logit contributions
-1.572
 postp-1.552
 cs-1.541
 Invention-1.483
vati-1.479
Token remains
Token ID3793
Feature activation+1.00e+00

Pos logit contributions
+5.184
 cs+5.158
 postp+5.142
 Invention+4.833
 Offense+4.711
Neg logit contributions
orge-6.112
yrinth-5.805
olphin-5.656
rss-5.414
 depths-5.393
Token one
Token ID530
Feature activation+9.25e-01

Pos logit contributions
 cs+8.858
+8.749
 postp+8.708
 Invention+8.245
 Offense+7.786
Neg logit contributions
orge-10.439
yrinth-10.130
olphin-9.605
 depths-9.214
rss-9.186
Token of
Token ID286
Feature activation+2.01e+00

Pos logit contributions
+8.710
 cs+8.603
 postp+8.298
 Invention+7.740
vati+7.578
Neg logit contributions
orge-10.005
yrinth-9.379
olphin-9.014
rss-8.821
 depths-8.781
Token the
Token ID262
Feature activation+8.81e-01

Pos logit contributions
 cs+14.625
+14.440
 Invention+13.741
 postp+13.683
 Belg+13.026
Neg logit contributions
orge-18.433
yrinth-17.328
olphin-16.458
ragon-16.285
sent-16.172
Token most
Token ID749
Feature activation+2.63e-01

Pos logit contributions
 cs+9.109
+8.839
 postp+8.812
 Invention+8.452
 Offense+8.140
Neg logit contributions
orge-8.131
yrinth-7.701
olphin-7.574
rss-7.212
ragon-7.107
Token common
Token ID2219
Feature activation+3.35e-01

Pos logit contributions
+2.658
 cs+2.627
 postp+2.606
 Invention+2.439
vati+2.387
Neg logit contributions
orge-2.773
yrinth-2.572
olphin-2.532
 depths-2.480
rss-2.479
Token implementations
Token ID25504
Feature activation+4.10e-01

Pos logit contributions
+3.581
 cs+3.567
 postp+3.536
 Invention+3.333
vati+3.242
Neg logit contributions
orge-3.327
yrinth-3.090
olphin-3.046
rss-2.943
 depths-2.891
Token.[
Token ID3693
Feature activation+1.24e+00

Pos logit contributions
+4.176
 cs+4.118
 postp+4.097
 Invention+3.823
vati+3.774
Neg logit contributions
orge-4.267
yrinth-3.990
olphin-3.865
 depths-3.814
rss-3.741
Token
Token ID198
Feature activation+3.25e-02

Pos logit contributions
+0.866
 cs+0.839
 postp+0.814
 Invention+0.759
 Offense+0.748
Neg logit contributions
yrinth-1.240
orge-1.229
olphin-1.169
ーテ-1.152
 depths-1.132
TokenNot
Token ID3673
Feature activation+3.82e-01

Pos logit contributions
+0.320
 cs+0.313
 postp+0.311
 Invention+0.290
vati+0.289
Neg logit contributions
orge-0.356
yrinth-0.328
olphin-0.324
 depths-0.323
rss-0.316
Token a
Token ID257
Feature activation+5.07e-02

Pos logit contributions
+3.847
 cs+3.817
 postp+3.780
vati+3.465
 Offense+3.446
Neg logit contributions
orge-3.972
yrinth-3.713
olphin-3.658
 depths-3.611
rss-3.494
Token day
Token ID1110
Feature activation+6.54e-01

Pos logit contributions
+0.511
 cs+0.498
 postp+0.493
 Invention+0.461
vati+0.461
Neg logit contributions
orge-0.545
yrinth-0.503
olphin-0.496
 depths-0.495
rss-0.482
Token goes
Token ID2925
Feature activation+1.05e+00

Pos logit contributions
 postp+5.204
 cs+5.142
+5.123
 Invention+4.417
 Offense+4.414
Neg logit contributions
orge-7.932
yrinth-7.742
olphin-7.422
 WRITE-7.206
 depths-7.177
Token by
Token ID416
Feature activation+2.00e+00

Pos logit contributions
 postp+5.894
 cs+5.592
+5.117
 err+4.641
vati+4.260
Neg logit contributions
orge-14.844
yrinth-14.575
olphin-13.814
ragon-13.387
rss-13.333
Token that
Token ID326
Feature activation+9.19e-01

Pos logit contributions
 postp+13.830
+12.692
 cs+12.521
 err+11.593
 Skype+11.193
Neg logit contributions
yrinth-18.622
orge-18.333
olphin-18.168
ragon-17.185
-16.764
Token I
Token ID314
Feature activation+9.95e-01

Pos logit contributions
 postp+7.895
 cs+7.746
+7.677
 Invention+6.821
 Offense+6.771
Neg logit contributions
orge-10.023
olphin-9.607
yrinth-9.586
ragon-9.057
rss-9.043
Token don
Token ID836
Feature activation+5.01e-01

Pos logit contributions
 postp+8.272
 cs+8.056
+7.456
 defer+7.314
 Skype+7.278
Neg logit contributions
orge-10.692
yrinth-10.438
olphin-10.363
ーテ-10.293
rss-9.781
Token
Token ID447
Feature activation+3.71e-01

Pos logit contributions
+2.676
 cs+2.509
 postp+2.267
ickets+2.069
microsoft+1.994
Neg logit contributions
yrinth-7.687
ーテ-7.350
orge-7.272
olphin-7.046
 depths-7.001
Token
Token ID247
Feature activation+5.79e-01

Pos logit contributions
+2.677
 cs+2.594
 postp+2.555
vati+2.285
 Invention+2.260
Neg logit contributions
orge-4.841
yrinth-4.770
olphin-4.639
 depths-4.554
 WRITE-4.537
Tokens
Token ID82
Feature activation+7.41e-01

Pos logit contributions
+6.550
 cs+6.529
 postp+6.335
 Offense+6.116
 Invention+6.055
Neg logit contributions
orge-6.549
yrinth-6.200
olphin-6.045
 depths-5.958
rss-5.875
Token Get
Token ID3497
Feature activation+1.00e+00

Pos logit contributions
 cs+6.925
+6.804
 postp+6.692
 Offense+6.543
 Invention+6.397
Neg logit contributions
orge-7.628
yrinth-7.471
olphin-7.239
 depths-7.056
rss-6.907
Token Chance
Token ID11809
Feature activation+9.24e-01

Pos logit contributions
 Offense+9.146
 cs+9.002
 Invention+8.831
+8.799
 postp+8.688
Neg logit contributions
orge-9.793
yrinth-9.641
 depths-9.044
olphin-8.863
ーテ-8.804
Token as
Token ID355
Feature activation+1.02e+00

Pos logit contributions
+7.353
 cs+7.249
 postp+6.930
 Offense+6.883
 Invention+6.734
Neg logit contributions
orge-10.587
yrinth-10.233
 depths-10.029
olphin-9.823
 darkest-9.695
Token In
Token ID554
Feature activation+5.16e-01

Pos logit contributions
 cs+9.715
 Offense+9.581
+9.387
 postp+9.221
 Invention+9.143
Neg logit contributions
orge-9.837
yrinth-9.567
olphin-9.041
 depths-8.822
XXXX-8.702
Tokenjuries
Token ID47496
Feature activation+1.90e+00

Pos logit contributions
+3.969
 cs+3.888
 postp+3.788
 Offense+3.501
 Invention+3.464
Neg logit contributions
orge-6.551
yrinth-6.201
olphin-6.081
 depths-6.044
 WRITE-5.990
Token Bite
Token ID44540
Feature activation+1.09e+00

Pos logit contributions
 Emer+14.884
 Offense+14.878
 cs+14.372
 Camera+14.225
 postp+14.074
Neg logit contributions
yrinth-15.960
ongyang-15.464
olphin-15.352
perial-14.948
orge-14.839
Token
Token ID198
Feature activation+1.91e-01

Pos logit contributions
 cs+9.839
+9.660
 Offense+9.572
 postp+9.288
 Emer+9.094
Neg logit contributions
orge-10.905
yrinth-10.594
olphin-9.988
 depths-9.668
ーテ-9.640
Token
Token ID198
Feature activation+5.79e-02

Pos logit contributions
+1.559
 cs+1.506
 postp+1.458
 Invention+1.340
 Offense+1.326
Neg logit contributions
yrinth-2.432
orge-2.374
ーテ-2.293
olphin-2.280
 depths-2.202
TokenTime
Token ID7575
Feature activation+3.88e-01

Pos logit contributions
+0.583
 cs+0.573
 postp+0.571
 Invention+0.534
vati+0.532
Neg logit contributions
orge-0.620
yrinth-0.569
olphin-0.562
 depths-0.558
rss-0.549
Token for
Token ID329
Feature activation+7.15e-01

Pos logit contributions
+4.031
 cs+3.958
 postp+3.928
 Invention+3.661
 Offense+3.649
Neg logit contributions
orge-3.976
yrinth-3.709
olphin-3.644
 depths-3.564
rss-3.497
Token and
Token ID290
Feature activation+1.02e-01

Pos logit contributions
orge+0.277
yrinth+0.255
 depths+0.252
olphin+0.251
rss+0.243
Neg logit contributions
-0.282
 cs-0.274
 postp-0.271
vati-0.255
 Invention-0.255
Token I
Token ID314
Feature activation+3.43e-01

Pos logit contributions
+1.035
 cs+1.014
 postp+1.008
 Invention+0.941
vati+0.934
Neg logit contributions
orge-1.093
yrinth-1.011
olphin-0.997
 depths-0.986
rss-0.967
Token myself
Token ID3589
Feature activation+4.30e-01

Pos logit contributions
 postp+3.548
 cs+3.538
+3.491
 Invention+3.238
vati+3.198
Neg logit contributions
orge-3.502
yrinth-3.256
olphin-3.210
rss-3.121
 depths-3.062
Token will
Token ID481
Feature activation+3.71e-01

Pos logit contributions
 postp+4.544
 cs+4.495
+4.453
 Invention+4.139
vati+4.093
Neg logit contributions
orge-4.265
yrinth-4.015
olphin-3.895
rss-3.795
 depths-3.740
Token be
Token ID307
Feature activation+3.86e-01

Pos logit contributions
 cs+3.959
 postp+3.941
+3.922
 Invention+3.618
 Offense+3.532
Neg logit contributions
orge-3.675
yrinth-3.419
olphin-3.325
rss-3.242
 depths-3.186
Token no
Token ID645
Feature activation+1.87e+00

Pos logit contributions
 postp+4.213
+4.167
 cs+4.158
 Invention+3.851
vati+3.837
Neg logit contributions
orge-3.729
yrinth-3.471
olphin-3.375
rss-3.330
 depths-3.269
Token more
Token ID517
Feature activation+4.81e-01

Pos logit contributions
 postp+14.414
 cs+13.970
 Invention+13.156
+13.106
 err+12.921
Neg logit contributions
yrinth-16.000
ongyang-14.977
rss-14.874
orge-14.740
olphin-14.470
Token.
Token ID13
Feature activation-1.25e-01

Pos logit contributions
 cs+4.481
 postp+4.449
+4.444
 Invention+4.075
vati+4.022
Neg logit contributions
orge-5.309
yrinth-5.058
olphin-4.893
rss-4.812
 depths-4.719
Token Why
Token ID4162
Feature activation+2.11e-01

Pos logit contributions
orge+1.370
yrinth+1.257
 depths+1.243
olphin+1.240
 WRITE+1.217
Neg logit contributions
-1.235
 cs-1.201
 postp-1.195
vati-1.119
 Invention-1.101
Token,
Token ID11
Feature activation-3.53e-01

Pos logit contributions
+2.271
 postp+2.251
 cs+2.248
 Invention+2.095
vati+2.063
Neg logit contributions
orge-2.097
yrinth-1.946
olphin-1.922
 depths-1.864
rss-1.840
Token then
Token ID788
Feature activation+3.91e-01

Pos logit contributions
orge+3.345
 depths+3.104
yrinth+3.051
 WRITE+2.973
 darkest+2.965
Neg logit contributions
-3.926
 cs-3.751
 postp-3.668
vati-3.578
ickets-3.568
Token (
Token ID357
Feature activation+4.42e-01

Pos logit contributions
+3.829
 cs+3.716
 postp+3.590
 Invention+3.525
 Offense+3.448
Neg logit contributions
orge-4.316
yrinth-4.153
olphin-4.032
 depths-3.998
rss-3.852
TokenFun
Token ID24629
Feature activation+6.84e-02

Pos logit contributions
+4.395
 cs+4.208
 postp+4.119
 Invention+3.912
vati+3.866
Neg logit contributions
orge-4.702
yrinth-4.475
olphin-4.370
 depths-4.336
 lows-4.171
Token fact
Token ID1109
Feature activation-1.60e-01

Pos logit contributions
+0.729
 cs+0.713
 postp+0.708
vati+0.661
 Invention+0.659
Neg logit contributions
orge-0.701
yrinth-0.642
 depths-0.633
olphin-0.632
rss-0.615
Token:
Token ID25
Feature activation+7.41e-01

Pos logit contributions
orge+1.359
yrinth+1.192
 depths+1.192
olphin+1.190
rss+1.182
Neg logit contributions
-1.974
 cs-1.920
 postp-1.905
vati-1.826
 Invention-1.800
Token Gu
Token ID1962
Feature activation+5.55e-01

Pos logit contributions
+6.904
 cs+6.899
 postp+6.735
 Invention+6.599
 Camera+6.383
Neg logit contributions
orge-7.726
yrinth-7.185
olphin-7.097
rss-6.891
ragon-6.879
Tokenilla
Token ID5049
Feature activation+1.85e+00

Pos logit contributions
+5.217
 postp+5.101
vati+5.036
 cs+4.916
ickets+4.894
Neg logit contributions
ーテ-5.740
orge-5.478
yrinth-5.188
 WRITE-5.145
 darkest-5.051
Tokenume
Token ID2454
Feature activation+2.86e-01

Pos logit contributions
ume+10.908
em+8.120
ifa+7.701
+7.457
ickets+7.407
Neg logit contributions
ーテ-22.663
yrinth-20.364
 lows-19.809
 darkest-19.405
 WRITE-19.271
Token is
Token ID318
Feature activation+8.34e-01

Pos logit contributions
+3.226
 cs+3.183
 postp+3.176
 Invention+2.974
vati+2.969
Neg logit contributions
orge-2.683
yrinth-2.482
olphin-2.432
 depths-2.377
rss-2.326
Token also
Token ID635
Feature activation+7.96e-01

Pos logit contributions
 postp+8.064
 cs+7.994
+7.876
 Invention+7.417
 Emer+7.390
Neg logit contributions
orge-8.222
yrinth-7.826
olphin-7.757
rss-7.400
XXXX-7.347
Token the
Token ID262
Feature activation+4.22e-01

Pos logit contributions
 cs+7.434
+7.430
 postp+7.425
 Invention+6.928
 Emer+6.819
Neg logit contributions
orge-8.133
yrinth-7.666
olphin-7.562
 depths-7.309
XXXX-7.275
Token only
Token ID691
Feature activation+1.53e+00

Pos logit contributions
 cs+3.899
+3.893
 postp+3.869
 Invention+3.672
vati+3.524
Neg logit contributions
orge-4.666
yrinth-4.404
olphin-4.342
 depths-4.199
rss-4.183
Token times
Token ID1661
Feature activation-3.21e-01

Pos logit contributions
orge+1.779
 depths+1.608
olphin+1.490
yrinth+1.479
 lows+1.476
Neg logit contributions
-4.896
 cs-4.770
 postp-4.713
vati-4.593
 Offense-4.540
Token for
Token ID329
Feature activation+7.06e-01

Pos logit contributions
orge+2.793
 depths+2.537
olphin+2.530
yrinth+2.501
sent+2.472
Neg logit contributions
-3.712
 cs-3.668
 postp-3.549
ickets-3.407
 Invention-3.404
Token Monk
Token ID28854
Feature activation-2.85e-01

Pos logit contributions
+6.398
 cs+6.328
 postp+6.197
 Invention+6.037
vati+5.816
Neg logit contributions
orge-7.662
yrinth-7.298
olphin-7.107
rss-7.018
 depths-6.897
Token),
Token ID828
Feature activation+3.55e-01

Pos logit contributions
orge+2.610
 depths+2.404
olphin+2.387
yrinth+2.384
 lows+2.312
Neg logit contributions
-3.166
 cs-3.106
 postp-3.066
vati-2.933
ickets-2.880
Token Gu
Token ID1962
Feature activation+4.67e-01

Pos logit contributions
+3.604
 postp+3.575
 cs+3.558
 Invention+3.311
vati+3.305
Neg logit contributions
orge-3.661
yrinth-3.447
olphin-3.353
rss-3.283
 depths-3.269
Tokenilla
Token ID5049
Feature activation+1.84e+00

Pos logit contributions
+4.485
 postp+4.408
vati+4.336
 cs+4.265
ickets+4.171
Neg logit contributions
ーテ-4.852
orge-4.628
yrinth-4.428
 WRITE-4.341
 darkest-4.336
Tokenume
Token ID2454
Feature activation-1.73e-01

Pos logit contributions
ume+9.910
ifa+6.668
em+6.626
vati+6.224
+6.185
Neg logit contributions
ーテ-24.128
yrinth-21.914
 lows-21.344
 darkest-21.103
 synd-20.693
Token (
Token ID357
Feature activation+1.15e-01

Pos logit contributions
orge+1.534
 depths+1.353
olphin+1.351
yrinth+1.332
rss+1.293
Neg logit contributions
-2.042
 cs-1.988
 postp-1.971
 Invention-1.881
ickets-1.853
Tokennom
Token ID26601
Feature activation+5.89e-01

Pos logit contributions
+1.126
 cs+1.103
 postp+1.092
 Invention+1.018
vati+1.010
Neg logit contributions
orge-1.271
yrinth-1.175
olphin-1.158
 depths-1.150
rss-1.127
Tokeninated
Token ID3898
Feature activation-3.05e-01

Pos logit contributions
 cs+5.447
+5.341
 postp+5.140
vati+5.063
ickets+4.910
Neg logit contributions
orge-6.226
yrinth-5.886
 depths-5.818
XXXX-5.720
olphin-5.711
Token five
Token ID1936
Feature activation-2.17e-01

Pos logit contributions
orge+2.415
 depths+2.184
yrinth+2.174
olphin+2.146
 WRITE+2.100
Neg logit contributions
-3.825
 cs-3.717
 postp-3.700
 Invention-3.517
vati-3.510
Tokenma
Token ID2611
Feature activation+6.12e-01

Pos logit contributions
orge+4.249
yrinth+3.969
olphin+3.916
 depths+3.901
rss+3.813
Neg logit contributions
-3.013
 cs-2.929
 postp-2.898
 Invention-2.675
vati-2.670
Token v
Token ID410
Feature activation+4.46e-01

Pos logit contributions
+4.849
 cs+4.746
 postp+4.706
 Invention+4.333
vati+4.268
Neg logit contributions
orge-7.777
yrinth-7.290
olphin-7.168
 depths-7.157
rss-7.004
Token.
Token ID13
Feature activation+2.97e-01

Pos logit contributions
 cs+3.492
+3.458
 Invention+3.364
 postp+3.353
 Offense+3.244
Neg logit contributions
yrinth-5.610
orge-5.351
olphin-5.074
ーテ-5.006
 depths-5.006
Token Goodell
Token ID41648
Feature activation+7.34e-01

Pos logit contributions
+2.627
 cs+2.548
 postp+2.521
vati+2.342
 Invention+2.312
Neg logit contributions
orge-3.580
yrinth-3.325
olphin-3.312
 depths-3.286
 WRITE-3.223
Token Am
Token ID1703
Feature activation-1.55e-02

Pos logit contributions
 postp+6.770
 cs+6.736
+6.703
 Invention+6.501
 Offense+6.416
Neg logit contributions
orge-7.821
yrinth-7.385
olphin-7.096
 depths-7.095
 darkest-6.887
Tokenended
Token ID1631
Feature activation+1.77e+00

Pos logit contributions
orge+0.190
yrinth+0.177
olphin+0.175
 depths+0.173
rss+0.171
Neg logit contributions
-0.133
 cs-0.130
 postp-0.129
vati-0.118
 Invention-0.117
Token Compl
Token ID20011
Feature activation+1.01e+00

Pos logit contributions
 Offense+13.350
 Invention+13.171
 Emer+12.481
 Camera+12.299
 postp+12.206
Neg logit contributions
orge-16.716
yrinth-15.911
rss-15.198
 depths-14.956
olphin-14.735
Tokenaint
Token ID2913
Feature activation+1.17e+00

Pos logit contributions
 cs+3.298
+3.066
 Offense+2.842
 postp+2.810
 Invention+2.775
Neg logit contributions
orge-15.688
ーテ-15.373
yrinth-15.170
 darkest-15.115
 depths-15.104
Token
Token ID198
Feature activation-1.11e-01

Pos logit contributions
 postp+9.403
 cs+9.340
 Offense+9.136
 Invention+9.115
+8.924
Neg logit contributions
orge-12.433
yrinth-11.674
olphin-11.271
 depths-11.079
-10.987
Token
Token ID198
Feature activation+2.48e-01

Pos logit contributions
orge+1.287
yrinth+1.251
olphin+1.202
 depths+1.179
ーテ+1.152
Neg logit contributions
-1.037
 cs-1.012
 postp-0.995
 Invention-0.927
vati-0.910
TokenJoe
Token ID19585
Feature activation-2.43e-01

Pos logit contributions
+2.520
 cs+2.455
 postp+2.433
 Invention+2.277
vati+2.271
Neg logit contributions
orge-2.639
yrinth-2.441
olphin-2.399
 depths-2.395
rss-2.326
Token should
Token ID815
Feature activation+6.21e-01

Pos logit contributions
 postp+9.311
 cs+9.264
+8.521
ickets+8.226
 Invention+7.819
Neg logit contributions
orge-10.166
olphin-9.596
yrinth-9.573
rss-9.255
ーテ-9.122
Token actually
Token ID1682
Feature activation+7.52e-01

Pos logit contributions
 cs+6.736
+6.698
 postp+6.640
 Invention+6.066
ickets+6.058
Neg logit contributions
orge-5.768
yrinth-5.387
olphin-5.248
ーテ-5.057
rss-5.021
Token be
Token ID307
Feature activation+9.27e-01

Pos logit contributions
 cs+7.778
 postp+7.682
+7.559
 Invention+7.013
ickets+6.855
Neg logit contributions
orge-7.110
yrinth-6.808
olphin-6.537
ーテ-6.459
rss-6.270
Token driven
Token ID7986
Feature activation+5.99e-01

Pos logit contributions
 postp+9.384
 cs+9.172
+8.746
 Invention+8.214
ickets+8.076
Neg logit contributions
orge-8.936
olphin-8.323
yrinth-8.320
rss-8.010
ーテ-7.884
Token towards
Token ID3371
Feature activation+9.12e-01

Pos logit contributions
 postp+5.010
 cs+4.999
+4.932
vati+4.444
 Invention+4.364
Neg logit contributions
orge-7.075
yrinth-6.763
olphin-6.583
rss-6.353
 depths-6.298
Token playing
Token ID2712
Feature activation+1.76e+00

Pos logit contributions
 postp+9.131
 cs+8.929
+8.452
 Invention+8.042
ickets+7.970
Neg logit contributions
orge-8.858
olphin-8.439
yrinth-8.431
rss-8.133
ーテ-7.890
Token Champions
Token ID6662
Feature activation+5.59e-01

Pos logit contributions
 cs+15.501
 postp+15.066
ickets+13.741
+13.385
 Offense+13.142
Neg logit contributions
orge-14.770
yrinth-14.138
olphin-14.037
rss-13.537
sent-13.285
Token Trophy
Token ID25640
Feature activation+1.21e+00

Pos logit contributions
 cs+2.822
 postp+2.721
+2.663
ickets+2.376
 Offense+2.366
Neg logit contributions
orge-8.360
yrinth-8.212
olphin-7.880
rss-7.835
sent-7.772
Token and
Token ID290
Feature activation+5.85e-01

Pos logit contributions
 cs+10.449
 postp+10.149
+9.754
ickets+9.618
 Offense+9.134
Neg logit contributions
orge-12.337
yrinth-11.561
olphin-11.474
rss-11.256
 WRITE-10.990
Token World
Token ID2159
Feature activation+9.78e-01

Pos logit contributions
 cs+5.077
 postp+5.047
+4.976
 Invention+4.560
ickets+4.490
Neg logit contributions
orge-6.852
olphin-6.382
yrinth-6.367
rss-6.234
 depths-6.139
Token Cup
Token ID5454
Feature activation+1.36e+00

Pos logit contributions
 cs+4.311
 postp+3.697
 Invention+3.597
 Offense+3.523
ickets+3.459
Neg logit contributions
yrinth-14.640
orge-14.568
ーテ-14.079
rss-13.638
olphin-13.495
Token,
Token ID11
Feature activation+1.99e-01

Pos logit contributions
 postp+4.415
+4.411
 cs+4.408
 Invention+4.078
vati+4.058
Neg logit contributions
orge-4.264
yrinth-4.025
olphin-3.904
XXXX-3.797
rss-3.792
Token
Token ID198
Feature activation+5.12e-01

Pos logit contributions
+2.034
 cs+2.011
 postp+1.999
 Invention+1.866
vati+1.843
Neg logit contributions
orge-2.087
yrinth-1.938
olphin-1.908
 depths-1.865
rss-1.850
Token
Token ID198
Feature activation+8.82e-01

Pos logit contributions
+3.378
 cs+3.318
 postp+3.209
 Invention+2.937
 err+2.925
Neg logit contributions
yrinth-7.207
ーテ-6.961
orge-6.750
olphin-6.604
psey-6.570
Tokenit
Token ID270
Feature activation+6.22e-01

Pos logit contributions
+8.400
 postp+7.988
 cs+7.926
vati+7.579
ickets+7.462
Neg logit contributions
orge-8.522
yrinth-8.209
olphin-8.034
 depths-7.930
ーテ-7.905
Token appears
Token ID3568
Feature activation+6.85e-01

Pos logit contributions
 cs+6.452
 postp+6.445
+6.299
vati+5.811
 Invention+5.810
Neg logit contributions
orge-6.001
yrinth-5.690
olphin-5.486
rss-5.391
XXXX-5.303
Token no
Token ID645
Feature activation+1.76e+00

Pos logit contributions
 postp+6.637
 cs+6.599
+6.599
 Invention+6.061
vati+5.917
Neg logit contributions
orge-7.066
yrinth-6.623
olphin-6.515
rss-6.307
XXXX-6.188
Token other
Token ID584
Feature activation+1.05e+00

Pos logit contributions
 postp+15.837
 cs+15.538
+15.238
 Invention+14.960
 err+14.717
Neg logit contributions
yrinth-12.962
orge-12.695
rss-12.207
olphin-12.203
ongyang-11.929
Token thing
Token ID1517
Feature activation+6.92e-02

Pos logit contributions
 postp+10.020
 cs+9.956
+9.590
 Invention+9.409
vati+8.885
Neg logit contributions
orge-9.848
yrinth-9.552
olphin-9.301
rss-9.297
ーテ-8.861
Token to
Token ID284
Feature activation+2.83e-01

Pos logit contributions
+0.731
 cs+0.712
 postp+0.705
 Invention+0.663
vati+0.662
Neg logit contributions
orge-0.713
yrinth-0.654
 depths-0.646
olphin-0.645
rss-0.625
Token me
Token ID502
Feature activation+1.90e-01

Pos logit contributions
+2.859
 cs+2.829
 postp+2.793
 Invention+2.593
vati+2.564
Neg logit contributions
orge-2.993
yrinth-2.792
olphin-2.737
 depths-2.668
rss-2.654
Token
Token ID198
Feature activation+7.75e-02

Pos logit contributions
+1.911
 cs+1.869
 postp+1.854
 Invention+1.730
vati+1.723
Neg logit contributions
orge-2.034
yrinth-1.883
olphin-1.852
 depths-1.838
rss-1.797
Token of
Token ID286
Feature activation+2.34e-01

Pos logit contributions
orge+0.817
yrinth+0.778
olphin+0.752
 depths+0.730
 WRITE+0.716
Neg logit contributions
-0.740
 cs-0.730
 postp-0.727
vati-0.662
 Invention-0.661
Token siege
Token ID20690
Feature activation-5.01e-02

Pos logit contributions
+2.409
 cs+2.370
 postp+2.365
 Invention+2.186
vati+2.171
Neg logit contributions
orge-2.454
yrinth-2.258
olphin-2.240
 depths-2.187
rss-2.158
Token and
Token ID290
Feature activation-6.10e-02

Pos logit contributions
orge+0.503
 depths+0.461
yrinth+0.461
olphin+0.452
rss+0.442
Neg logit contributions
-0.539
 cs-0.524
 postp-0.515
 Invention-0.492
vati-0.491
Token starvation
Token ID32143
Feature activation-4.69e-01

Pos logit contributions
orge+0.594
yrinth+0.547
 depths+0.543
olphin+0.532
rss+0.519
Neg logit contributions
-0.675
 cs-0.653
 postp-0.642
vati-0.617
 Invention-0.614
Token are
Token ID389
Feature activation+1.89e-01

Pos logit contributions
orge+4.103
 depths+3.919
yrinth+3.738
 lows+3.712
olphin+3.677
Neg logit contributions
-5.387
 cs-5.109
 postp-4.961
 Invention-4.895
vati-4.860
Token no
Token ID645
Feature activation+1.76e+00

Pos logit contributions
+2.004
 cs+1.972
 postp+1.967
 Invention+1.829
vati+1.815
Neg logit contributions
orge-1.926
yrinth-1.765
olphin-1.752
 depths-1.718
rss-1.692
Token less
Token ID1342
Feature activation-2.82e-01

Pos logit contributions
 postp+14.878
 cs+14.050
+13.481
 Invention+12.987
 excuse+12.917
Neg logit contributions
orge-14.831
yrinth-14.471
olphin-14.069
terson-13.493
rss-13.365
Token grave
Token ID12296
Feature activation-4.66e-01

Pos logit contributions
orge+2.554
 depths+2.394
yrinth+2.333
 lows+2.268
olphin+2.263
Neg logit contributions
-3.249
 cs-3.122
 postp-3.050
vati-3.010
 Invention-2.960
Token in
Token ID287
Feature activation+1.91e-01

Pos logit contributions
orge+4.385
 depths+4.207
yrinth+4.010
 lows+4.007
olphin+3.961
Neg logit contributions
-5.100
 cs-4.851
vati-4.674
 postp-4.661
 Invention-4.570
Token their
Token ID511
Feature activation-4.96e-01

Pos logit contributions
+2.022
 cs+1.983
 postp+1.968
 Invention+1.835
vati+1.829
Neg logit contributions
orge-1.957
yrinth-1.800
olphin-1.777
 depths-1.750
rss-1.715
Token calam
Token ID35765
Feature activation-3.69e-01

Pos logit contributions
 depths+4.722
orge+4.710
 darkest+4.419
yrinth+4.368
 lows+4.344
Neg logit contributions
-5.374
 cs-5.008
vati-4.819
 postp-4.804
 Invention-4.746
TokenAS
Token ID1921
Feature activation+3.24e-01

Pos logit contributions
+8.530
 cs+8.334
 postp+8.295
 Invention+7.747
vati+7.705
Neg logit contributions
orge-5.592
yrinth-5.359
olphin-5.113
 depths-5.021
ーテ-4.859
TokenES
Token ID1546
Feature activation+7.88e-01

Pos logit contributions
+2.820
 cs+2.736
 postp+2.670
 Invention+2.503
vati+2.441
Neg logit contributions
orge-3.939
yrinth-3.722
olphin-3.625
 depths-3.580
rss-3.483
Token Phase
Token ID18983
Feature activation+4.41e-01

Pos logit contributions
 cs+7.664
 postp+7.563
+7.467
 Invention+7.204
 Offense+7.061
Neg logit contributions
orge-7.856
yrinth-7.728
olphin-7.233
 depths-7.048
rss-6.923
Token 1
Token ID352
Feature activation+7.64e-01

Pos logit contributions
+4.164
 cs+4.137
 postp+4.108
 Invention+3.802
vati+3.748
Neg logit contributions
orge-4.877
yrinth-4.576
olphin-4.470
 depths-4.393
 WRITE-4.366
Token:
Token ID25
Feature activation+8.69e-01

Pos logit contributions
 postp+7.634
 cs+7.510
+7.373
 Invention+6.971
 Offense+6.897
Neg logit contributions
orge-7.704
yrinth-7.295
olphin-6.935
 WRITE-6.752
 depths-6.746
Token No
Token ID1400
Feature activation+1.75e+00

Pos logit contributions
 postp+8.682
 cs+8.619
+8.496
 Invention+8.070
 Offense+8.033
Neg logit contributions
orge-8.424
yrinth-7.844
olphin-7.502
 depths-7.242
ragon-7.232
Token infections
Token ID16079
Feature activation+1.23e+00

Pos logit contributions
 postp+16.373
 cs+16.157
 Invention+15.192
 Emer+15.165
 Offense+14.888
Neg logit contributions
yrinth-13.328
orge-13.107
olphin-11.982
 WRITE-11.899
terson-11.819
Token in
Token ID287
Feature activation+1.04e+00

Pos logit contributions
 postp+11.477
 cs+11.195
 Emer+10.758
 Invention+10.633
 Offense+10.502
Neg logit contributions
orge-11.160
yrinth-11.100
olphin-10.576
terson-9.971
ーテ-9.844
Token humans
Token ID5384
Feature activation+1.17e+00

Pos logit contributions
 cs+10.290
 postp+10.259
 Emer+9.358
 Belg+9.328
+9.265
Neg logit contributions
orge-10.007
yrinth-9.646
olphin-9.286
ーテ-8.946
ragon-8.755
Token are
Token ID389
Feature activation+1.24e+00

Pos logit contributions
 postp+10.801
 cs+10.708
 Invention+10.011
+9.989
 Offense+9.874
Neg logit contributions
orge-11.071
yrinth-10.881
olphin-10.164
ーテ-9.802
rss-9.708
Token being
Token ID852
Feature activation+1.00e+00

Pos logit contributions
 postp+11.548
 cs+10.832
 Invention+10.185
 Emer+10.163
+9.958
Neg logit contributions
orge-11.802
yrinth-11.161
olphin-10.770
ーテ-10.673
 WRITE-10.555
Tokenarl
Token ID7063
Feature activation-3.62e-01

Pos logit contributions
orge+5.114
yrinth+4.412
olphin+4.356
rss+4.303
 depths+4.294
Neg logit contributions
-8.499
 cs-8.355
 postp-8.318
 Invention-7.898
vati-7.740
Tokenal
Token ID282
Feature activation+2.65e-01

Pos logit contributions
orge+3.259
yrinth+2.985
olphin+2.921
 depths+2.907
rss+2.811
Neg logit contributions
-4.288
 cs-4.198
 postp-4.168
vati-3.941
 Invention-3.933
Token who
Token ID508
Feature activation+2.92e-01

Pos logit contributions
+3.210
 cs+3.152
 postp+3.140
vati+2.968
 Invention+2.946
Neg logit contributions
orge-2.289
yrinth-2.086
olphin-2.038
 depths-2.004
 WRITE-1.958
Token was
Token ID373
Feature activation+3.47e-01

Pos logit contributions
+3.286
 postp+3.280
 cs+3.267
vati+3.035
 Invention+2.997
Neg logit contributions
orge-2.745
yrinth-2.551
olphin-2.473
 depths-2.382
rss-2.379
Token climbing
Token ID14281
Feature activation+5.75e-01

Pos logit contributions
 postp+3.735
 cs+3.711
+3.682
vati+3.407
 Invention+3.370
Neg logit contributions
orge-3.480
yrinth-3.216
olphin-3.132
rss-3.062
 depths-2.985
Token behind
Token ID2157
Feature activation+1.74e+00

Pos logit contributions
 cs+5.547
+5.528
 postp+5.481
 Invention+5.120
vati+5.015
Neg logit contributions
orge-6.079
yrinth-5.777
olphin-5.621
rss-5.451
XXXX-5.379
Token her
Token ID607
Feature activation+8.93e-01

Pos logit contributions
 cs+11.694
 postp+11.549
+10.992
 Camera+10.890
vati+10.610
Neg logit contributions
orge-17.316
yrinth-16.789
olphin-16.719
ongyang-16.469
ーテ-16.430
Token.
Token ID13
Feature activation+1.50e+00

Pos logit contributions
 postp+7.615
 cs+7.606
+7.331
 Invention+6.878
vati+6.817
Neg logit contributions
orge-9.704
yrinth-9.322
olphin-9.224
rss-8.828
 WRITE-8.752
Token They
Token ID1119
Feature activation+2.28e-01

Pos logit contributions
 Camera+10.567
+10.316
 cs+10.114
 Invention+10.033
 Offense+9.860
Neg logit contributions
orge-15.618
yrinth-15.583
ongyang-15.429
psey-15.283
olphin-14.767
Token were
Token ID547
Feature activation+4.34e-01

Pos logit contributions
+2.457
 cs+2.456
 postp+2.455
vati+2.293
 Invention+2.249
Neg logit contributions
orge-2.234
yrinth-2.073
olphin-2.025
 depths-1.969
rss-1.952
Token rushed
Token ID13999
Feature activation+4.41e-01

Pos logit contributions
 postp+4.590
 cs+4.563
+4.469
vati+4.205
 Invention+4.059
Neg logit contributions
orge-4.396
yrinth-4.080
olphin-3.989
rss-3.863
 WRITE-3.744
Token and
Token ID290
Feature activation+1.12e+00

Pos logit contributions
 postp+7.239
 cs+7.198
+7.174
vati+6.565
 Invention+6.425
Neg logit contributions
orge-7.084
yrinth-6.577
olphin-6.442
rss-6.250
 depths-6.218
Token D
Token ID360
Feature activation+5.85e-01

Pos logit contributions
 postp+10.905
 cs+10.480
+10.221
 Skype+9.674
 Camera+9.624
Neg logit contributions
orge-10.369
yrinth-10.122
olphin-9.550
rss-9.536
ongyang-9.051
Tokenendi
Token ID43109
Feature activation+4.60e-01

Pos logit contributions
 cs+5.530
+5.481
vati+5.434
 postp+5.398
ickets+5.155
Neg logit contributions
ーテ-5.757
orge-5.613
yrinth-5.591
 lows-5.446
rss-5.395
Token st
Token ID336
Feature activation+3.19e-01

Pos logit contributions
+5.012
 cs+4.970
 postp+4.966
vati+4.563
 Invention+4.513
Neg logit contributions
orge-4.428
yrinth-4.132
olphin-4.021
rss-3.895
 depths-3.869
Tokenrolled
Token ID8375
Feature activation+5.12e-01

Pos logit contributions
+3.413
 cs+3.344
 postp+3.340
vati+3.160
ickets+3.116
Neg logit contributions
orge-3.115
yrinth-2.944
 depths-2.844
olphin-2.839
rss-2.761
Token off
Token ID572
Feature activation+1.74e+00

Pos logit contributions
 cs+5.131
 postp+5.052
+5.010
 Invention+4.605
vati+4.598
Neg logit contributions
orge-5.314
yrinth-5.081
olphin-4.846
rss-4.742
XXXX-4.624
Token to
Token ID284
Feature activation+5.24e-01

Pos logit contributions
 cs+13.426
+13.115
 postp+12.814
 err+12.252
 Skype+12.230
Neg logit contributions
orge-15.612
yrinth-15.541
rss-14.714
olphin-14.638
ーテ-14.047
Token entertain
Token ID8204
Feature activation+7.56e-01

Pos logit contributions
 cs+5.507
+5.471
 postp+5.325
 Invention+4.987
vati+4.953
Neg logit contributions
orge-5.157
yrinth-4.781
olphin-4.716
rss-4.570
 depths-4.516
Token (
Token ID357
Feature activation+7.70e-01

Pos logit contributions
 cs+7.210
 postp+7.182
+7.159
vati+6.503
 Camera+6.484
Neg logit contributions
orge-7.591
yrinth-7.232
olphin-7.052
rss-6.946
 depths-6.801
Tokenor
Token ID273
Feature activation+5.53e-01

Pos logit contributions
+7.663
 cs+7.371
 postp+7.272
vati+6.941
ickets+6.663
Neg logit contributions
orge-7.592
yrinth-7.220
olphin-7.036
 depths-7.005
 WRITE-6.807
Token be
Token ID307
Feature activation+7.04e-01

Pos logit contributions
 cs+5.802
 postp+5.727
+5.683
 Invention+5.279
vati+5.167
Neg logit contributions
orge-5.459
yrinth-5.108
olphin-4.965
rss-4.842
 depths-4.796
Token
Token ID960
Feature activation+6.97e-01

Pos logit contributions
+2.806
 cs+2.745
 postp+2.722
 Invention+2.559
vati+2.554
Neg logit contributions
orge-2.536
yrinth-2.325
olphin-2.289
 depths-2.279
rss-2.216
Tokenwill
Token ID10594
Feature activation+9.05e-01

Pos logit contributions
+6.048
 postp+5.752
 cs+5.695
vati+5.513
ickets+5.321
Neg logit contributions
orge-7.723
yrinth-7.435
olphin-7.351
 depths-7.314
 WRITE-7.265
Token be
Token ID307
Feature activation+4.17e-01

Pos logit contributions
 cs+8.237
+8.106
 postp+8.064
 err+7.462
 defer+7.365
Neg logit contributions
orge-9.292
yrinth-8.917
olphin-8.511
rss-8.350
ongyang-8.334
Token higher
Token ID2440
Feature activation+2.53e-02

Pos logit contributions
+4.503
 postp+4.454
 cs+4.424
 Invention+4.062
vati+4.060
Neg logit contributions
orge-4.078
yrinth-3.721
olphin-3.670
rss-3.569
 WRITE-3.539
Token if
Token ID611
Feature activation+9.62e-01

Pos logit contributions
+0.268
 cs+0.262
 postp+0.259
 Invention+0.244
vati+0.243
Neg logit contributions
orge-0.258
yrinth-0.237
 depths-0.235
olphin-0.235
rss-0.227
Token they
Token ID484
Feature activation+1.74e+00

Pos logit contributions
 postp+7.956
 cs+7.784
+7.735
 Invention+6.830
 err+6.725
Neg logit contributions
orge-10.884
olphin-10.366
yrinth-10.223
ragon-9.729
rss-9.690
Token keep
Token ID1394
Feature activation+1.02e+00

Pos logit contributions
 postp+14.156
 cs+13.465
 defer+13.445
 err+12.972
 postpone+12.571
Neg logit contributions
ーテ-14.988
orge-14.661
yrinth-14.610
olphin-14.435
rss-13.836
Token working
Token ID1762
Feature activation+1.26e+00

Pos logit contributions
 postp+9.866
 cs+9.215
+8.727
 defer+8.198
 Invention+8.194
Neg logit contributions
orge-10.597
yrinth-9.886
olphin-9.647
ーテ-9.256
alian-9.200
Token,
Token ID11
Feature activation+3.60e-01

Pos logit contributions
 postp+11.029
 cs+10.702
+10.110
 err+9.285
 Invention+9.280
Neg logit contributions
orge-12.707
yrinth-12.356
olphin-11.409
ーテ-11.252
ongyang-11.191
Token and
Token ID290
Feature activation+6.52e-02

Pos logit contributions
+3.686
 postp+3.674
 cs+3.667
 Invention+3.381
vati+3.341
Neg logit contributions
orge-3.740
yrinth-3.450
olphin-3.380
 depths-3.284
rss-3.270
Token their
Token ID511
Feature activation+6.31e-01

Pos logit contributions
+0.692
 cs+0.676
 postp+0.671
 Invention+0.629
vati+0.628
Neg logit contributions
orge-0.667
yrinth-0.614
olphin-0.604
 depths-0.601
rss-0.585
TokenMom
Token ID29252
Feature activation+1.46e-01

Pos logit contributions
+6.407
 cs+6.367
 postp+6.329
 Invention+6.028
 Offense+5.870
Neg logit contributions
orge-6.595
olphin-6.055
yrinth-6.041
 depths-6.001
 WRITE-5.749
Token:
Token ID25
Feature activation+7.82e-01

Pos logit contributions
+1.608
 cs+1.572
 postp+1.559
 Invention+1.466
vati+1.461
Neg logit contributions
orge-1.447
yrinth-1.327
olphin-1.303
 depths-1.298
rss-1.264
Token Jen
Token ID13364
Feature activation+3.87e-01

Pos logit contributions
+7.376
 cs+7.305
 postp+7.241
 Invention+6.840
 Offense+6.660
Neg logit contributions
orge-8.234
yrinth-7.829
olphin-7.579
 depths-7.503
rss-7.246
Tokenelle
Token ID13485
Feature activation+8.60e-01

Pos logit contributions
+3.509
 cs+3.437
 postp+3.407
 Invention+3.145
ickets+3.130
Neg logit contributions
orge-4.331
yrinth-4.126
 depths-4.010
olphin-4.000
 WRITE-3.947
Token Evans
Token ID13922
Feature activation+1.02e+00

Pos logit contributions
 postp+8.252
 cs+8.185
+8.110
 Invention+7.554
 Offense+7.355
Neg logit contributions
orge-8.768
yrinth-8.393
olphin-8.066
rss-7.842
 depths-7.712
Token Shows
Token ID25156
Feature activation+1.73e+00

Pos logit contributions
 postp+9.966
 cs+9.602
+9.368
 Invention+9.002
 Offense+8.934
Neg logit contributions
yrinth-9.585
orge-9.462
olphin-9.222
rss-8.766
 depths-8.660
Token Off
Token ID3242
Feature activation+1.12e+00

Pos logit contributions
 Camera+14.060
 Offense+13.852
 Invention+13.724
+13.583
 Skype+13.226
Neg logit contributions
yrinth-14.636
orge-14.118
olphin-13.221
ーテ-13.212
ongyang-12.915
Token Her
Token ID2332
Feature activation+7.02e-01

Pos logit contributions
+10.192
 Invention+10.123
 cs+10.094
 Camera+10.000
 postp+9.923
Neg logit contributions
yrinth-10.334
orge-10.210
olphin-9.733
 depths-9.410
ーテ-9.405
Token Brand
Token ID13512
Feature activation+1.61e-01

Pos logit contributions
+7.007
 cs+6.895
 Invention+6.786
 postp+6.766
 Camera+6.618
Neg logit contributions
orge-6.738
yrinth-6.574
olphin-6.375
 depths-6.159
rss-6.119
Token New
Token ID968
Feature activation+1.17e+00

Pos logit contributions
+1.719
 cs+1.680
 postp+1.668
vati+1.562
 Invention+1.560
Neg logit contributions
orge-1.642
yrinth-1.506
olphin-1.485
 depths-1.479
rss-1.439
Token Forever
Token ID28285
Feature activation+3.35e-01

Pos logit contributions
 Camera+11.193
 Invention+11.043
 cs+10.798
+10.688
 Offense+10.546
Neg logit contributions
yrinth-10.177
orge-10.115
olphin-9.589
 depths-9.437
rss-9.402
Tokenomp
Token ID3361
Feature activation-8.19e-03

Pos logit contributions
orge+2.897
yrinth+2.714
olphin+2.657
 depths+2.648
 WRITE+2.604
Neg logit contributions
-2.463
 cs-2.381
 postp-2.373
vati-2.219
 Invention-2.195
Tokenkins
Token ID5331
Feature activation+1.22e+00

Pos logit contributions
orge+0.063
yrinth+0.060
ーテ+0.056
olphin+0.056
 WRITE+0.055
Neg logit contributions
-0.106
 cs-0.106
 postp-0.104
vati-0.100
ickets-0.100
Token was
Token ID373
Feature activation+1.06e+00

Pos logit contributions
 postp+11.336
 cs+10.465
+10.230
 err+9.897
 defer+9.825
Neg logit contributions
orge-11.730
yrinth-11.622
ーテ-11.230
olphin-11.074
rss-10.501
Token already
Token ID1541
Feature activation+2.61e-01

Pos logit contributions
 postp+9.666
 cs+9.049
+8.653
 Camera+8.309
 Emer+8.305
Neg logit contributions
orge-10.898
yrinth-10.807
rss-10.143
olphin-10.064
ーテ-9.965
Token back
Token ID736
Feature activation+5.71e-01

Pos logit contributions
 postp+2.693
+2.677
 cs+2.677
 Invention+2.445
vati+2.429
Neg logit contributions
orge-2.709
yrinth-2.547
olphin-2.462
rss-2.413
 depths-2.411
Token on
Token ID319
Feature activation+1.72e+00

Pos logit contributions
+5.628
 cs+5.559
 postp+5.555
vati+5.155
 Camera+5.074
Neg logit contributions
orge-5.929
yrinth-5.638
olphin-5.352
 depths-5.284
rss-5.242
Token her
Token ID607
Feature activation+1.11e+00

Pos logit contributions
 cs+13.398
+13.244
 Camera+13.176
 postp+12.857
 Offense+12.566
Neg logit contributions
orge-15.593
yrinth-15.187
olphin-15.025
ーテ-14.714
ongyang-14.316
Token way
Token ID835
Feature activation+1.85e-01

Pos logit contributions
 cs+9.390
 postp+9.028
+8.703
 err+8.538
 Camera+8.432
Neg logit contributions
orge-11.981
olphin-11.314
yrinth-11.233
rss-10.868
ーテ-10.834
Token to
Token ID284
Feature activation+2.69e-01

Pos logit contributions
+1.928
 cs+1.881
 postp+1.864
 Invention+1.751
vati+1.743
Neg logit contributions
orge-1.928
yrinth-1.767
olphin-1.751
 depths-1.744
rss-1.694
Token work
Token ID670
Feature activation+6.84e-01

Pos logit contributions
+2.775
 cs+2.735
 postp+2.704
 Invention+2.536
vati+2.512
Neg logit contributions
orge-2.789
yrinth-2.584
olphin-2.542
 depths-2.505
rss-2.465
Token.
Token ID13
Feature activation+5.72e-01

Pos logit contributions
+6.188
 postp+6.107
 cs+6.080
 Invention+5.513
vati+5.503
Neg logit contributions
orge-7.712
yrinth-7.362
olphin-7.077
 depths-6.876
rss-6.844
Token their
Token ID511
Feature activation-4.14e-01

Pos logit contributions
orge+0.973
yrinth+0.892
 depths+0.888
olphin+0.874
XXXX+0.855
Neg logit contributions
-1.065
 cs-1.027
 postp-1.004
 Invention-0.965
vati-0.962
Token age
Token ID2479
Feature activation+2.32e-01

Pos logit contributions
orge+4.266
 depths+4.007
yrinth+3.879
 darkest+3.873
 lows+3.815
Neg logit contributions
-4.247
 cs-4.053
 postp-3.938
vati-3.831
 Invention-3.824
Token
Token ID851
Feature activation-3.70e-01

Pos logit contributions
+2.512
 cs+2.458
 postp+2.438
 Invention+2.288
vati+2.284
Neg logit contributions
orge-2.311
yrinth-2.123
olphin-2.089
 depths-2.077
rss-2.022
Token stand
Token ID1302
Feature activation+5.74e-01

Pos logit contributions
orge+3.337
 depths+3.052
yrinth+2.965
olphin+2.950
 WRITE+2.911
Neg logit contributions
-4.310
 cs-4.109
 postp-4.042
vati-3.908
 Invention-3.887
Token a
Token ID257
Feature activation+6.98e-01

Pos logit contributions
 cs+5.145
+5.107
 postp+5.098
vati+4.652
 Invention+4.562
Neg logit contributions
orge-6.601
yrinth-6.234
olphin-5.990
rss-5.950
 WRITE-5.851
Token lot
Token ID1256
Feature activation+1.72e+00

Pos logit contributions
+6.311
 cs+6.300
 postp+6.192
 Offense+5.614
vati+5.584
Neg logit contributions
orge-7.711
yrinth-7.375
olphin-7.089
 WRITE-7.001
 depths-6.915
Token better
Token ID1365
Feature activation+2.04e-02

Pos logit contributions
 postp+11.654
 cs+11.557
+10.761
 err+10.619
 Skype+9.945
Neg logit contributions
yrinth-18.798
orge-18.789
rss-17.389
 WRITE-17.269
ーテ-16.899
Token chance
Token ID2863
Feature activation-2.34e-01

Pos logit contributions
+0.217
 cs+0.208
 postp+0.207
 Invention+0.197
vati+0.194
Neg logit contributions
orge-0.208
 depths-0.191
yrinth-0.190
olphin-0.189
 lows-0.183
Token of
Token ID286
Feature activation-2.52e-01

Pos logit contributions
orge+1.760
olphin+1.630
 depths+1.622
 rated+1.607
yrinth+1.585
Neg logit contributions
-2.964
 cs-2.816
 Invention-2.782
vati-2.723
ickets-2.717
Token getting
Token ID1972
Feature activation-1.05e+00

Pos logit contributions
orge+2.290
 depths+2.096
yrinth+2.054
olphin+2.039
XXXX+1.995
Neg logit contributions
-2.942
 cs-2.798
 postp-2.705
vati-2.670
 Invention-2.657
Token honest
Token ID5508
Feature activation-1.12e+00

Pos logit contributions
orge+8.592
 rated+8.125
 depths+8.033
 lows+7.834
 darkest+7.599
Neg logit contributions
-11.820
ickets-10.306
 cs-10.281
 Invention-10.279
vati-10.071
Token changed
Token ID3421
Feature activation+1.03e+00

Pos logit contributions
 cs+4.992
 postp+4.921
+4.738
 Invention+4.620
vati+4.508
Neg logit contributions
orge-5.314
olphin-5.126
yrinth-4.969
rss-4.898
ーテ-4.875
Token it
Token ID340
Feature activation+6.22e-01

Pos logit contributions
 cs+8.820
 postp+8.645
+8.313
 Invention+8.248
 Offense+7.769
Neg logit contributions
orge-10.990
olphin-10.378
yrinth-10.091
rss-9.870
XXXX-9.847
Token to
Token ID284
Feature activation+1.25e+00

Pos logit contributions
 cs+5.549
+5.463
 postp+5.416
vati+5.031
 Invention+4.933
Neg logit contributions
orge-6.944
yrinth-6.554
olphin-6.455
rss-6.234
 depths-6.130
Token 1
Token ID352
Feature activation+8.66e-01

Pos logit contributions
 cs+11.266
 postp+10.746
 Invention+10.205
+10.192
 Belg+9.986
Neg logit contributions
orge-11.874
olphin-11.281
ongyang-10.967
yrinth-10.910
ーテ-10.799
Token mana
Token ID13149
Feature activation+1.28e+00

Pos logit contributions
 cs+8.495
+8.361
 postp+8.166
 Invention+7.948
 Offense+7.698
Neg logit contributions
orge-8.179
yrinth-7.718
olphin-7.647
 WRITE-7.482
 depths-7.421
Token,
Token ID11
Feature activation+1.71e+00

Pos logit contributions
 cs+11.245
+10.746
 postp+10.574
 Invention+10.360
 Offense+9.978
Neg logit contributions
orge-11.751
yrinth-11.395
olphin-11.263
ongyang-11.210
terson-10.911
Token sheep
Token ID15900
Feature activation+6.07e-01

Pos logit contributions
 cs+13.919
 postp+13.324
 goats+12.652
 Invention+12.584
+12.528
Neg logit contributions
orge-14.944
olphin-14.245
ongyang-14.125
yrinth-13.439
ーテ-13.383
Token a
Token ID257
Feature activation+5.52e-01

Pos logit contributions
 cs+5.431
+5.322
 postp+5.261
 Invention+4.916
vati+4.841
Neg logit contributions
orge-6.716
olphin-6.385
yrinth-6.294
 depths-6.082
 WRITE-6.013
Token random
Token ID4738
Feature activation+8.73e-01

Pos logit contributions
 cs+5.754
+5.654
 postp+5.634
 Invention+5.262
vati+5.184
Neg logit contributions
orge-5.358
yrinth-5.000
olphin-4.959
 depths-4.795
 WRITE-4.758
Token minion
Token ID28365
Feature activation+6.95e-01

Pos logit contributions
 cs+7.946
 postp+7.723
+7.597
 Invention+7.462
 Offense+7.082
Neg logit contributions
orge-8.849
yrinth-8.472
olphin-8.440
ongyang-8.105
rss-7.995
Token.
Token ID13
Feature activation+1.08e+00

Pos logit contributions
 cs+6.194
+6.075
 postp+6.003
 Invention+5.684
 Offense+5.506
Neg logit contributions
orge-7.668
yrinth-7.196
olphin-7.184
 depths-6.830
 WRITE-6.797
Token infections
Token ID16079
Feature activation+1.23e+00

Pos logit contributions
 postp+16.373
 cs+16.157
 Invention+15.192
 Emer+15.165
 Offense+14.888
Neg logit contributions
yrinth-13.328
orge-13.107
olphin-11.982
 WRITE-11.899
terson-11.819
Token in
Token ID287
Feature activation+1.04e+00

Pos logit contributions
 postp+11.477
 cs+11.195
 Emer+10.758
 Invention+10.633
 Offense+10.502
Neg logit contributions
orge-11.160
yrinth-11.100
olphin-10.576
terson-9.971
ーテ-9.844
Token humans
Token ID5384
Feature activation+1.17e+00

Pos logit contributions
 cs+10.290
 postp+10.259
 Emer+9.358
 Belg+9.328
+9.265
Neg logit contributions
orge-10.007
yrinth-9.646
olphin-9.286
ーテ-8.946
ragon-8.755
Token are
Token ID389
Feature activation+1.24e+00

Pos logit contributions
 postp+10.801
 cs+10.708
 Invention+10.011
+9.989
 Offense+9.874
Neg logit contributions
orge-11.071
yrinth-10.881
olphin-10.164
ーテ-9.802
rss-9.708
Token being
Token ID852
Feature activation+1.00e+00

Pos logit contributions
 postp+11.548
 cs+10.832
 Invention+10.185
 Emer+10.163
+9.958
Neg logit contributions
orge-11.802
yrinth-11.161
olphin-10.770
ーテ-10.673
 WRITE-10.555
Token caused
Token ID4073
Feature activation+1.71e+00

Pos logit contributions
 postp+9.248
 cs+8.948
vati+8.658
+8.492
 Belg+8.282
Neg logit contributions
orge-10.039
yrinth-9.462
olphin-9.193
 WRITE-9.073
ーテ-9.023
Token by
Token ID416
Feature activation+7.25e-01

Pos logit contributions
 postp+14.058
 cs+13.972
 Invention+13.396
 Emer+13.148
+12.889
Neg logit contributions
yrinth-14.816
orge-14.608
olphin-13.368
 depths-13.289
terson-13.234
Token viruses
Token ID20547
Feature activation+1.18e+00

Pos logit contributions
 postp+7.049
 cs+6.986
+6.813
 Invention+6.414
vati+6.285
Neg logit contributions
orge-7.547
yrinth-7.286
olphin-6.896
ragon-6.650
 WRITE-6.645
Token circulating
Token ID24495
Feature activation+8.58e-01

Pos logit contributions
 postp+10.913
 cs+10.694
 Invention+10.398
+10.060
 Emer+10.053
Neg logit contributions
orge-11.002
yrinth-10.964
olphin-10.322
-9.836
 depths-9.811
Token in
Token ID287
Feature activation+8.96e-01

Pos logit contributions
 cs+8.108
 postp+8.052
+7.994
 Invention+7.666
 Emer+7.426
Neg logit contributions
orge-8.596
yrinth-8.186
olphin-8.031
rss-7.637
terson-7.532
Token animals
Token ID4695
Feature activation+9.59e-01

Pos logit contributions
 cs+8.738
 postp+8.705
+8.313
 Emer+7.934
 Invention+7.816
Neg logit contributions
orge-8.875
yrinth-8.564
olphin-8.145
ーテ-7.989
rss-7.917
Token The
Token ID383
Feature activation+4.25e-01

Pos logit contributions
+1.918
 cs+1.876
 postp+1.847
 Invention+1.738
vati+1.715
Neg logit contributions
orge-2.161
yrinth-2.022
olphin-1.984
 depths-1.969
rss-1.918
Token further
Token ID2252
Feature activation+5.83e-01

Pos logit contributions
+4.492
 postp+4.476
 cs+4.460
 Invention+4.126
vati+4.095
Neg logit contributions
orge-4.191
yrinth-3.924
olphin-3.859
rss-3.702
 depths-3.663
Token along
Token ID1863
Feature activation+6.52e-01

Pos logit contributions
 postp+5.891
 cs+5.861
+5.849
vati+5.372
 Invention+5.352
Neg logit contributions
orge-5.767
yrinth-5.584
olphin-5.442
rss-5.217
 WRITE-5.151
Token he
Token ID339
Feature activation+9.51e-03

Pos logit contributions
+5.812
 cs+5.766
 postp+5.717
 Invention+5.270
vati+5.195
Neg logit contributions
orge-7.212
yrinth-6.934
olphin-6.812
rss-6.535
elf-6.508
Token walks
Token ID11114
Feature activation+5.60e-01

Pos logit contributions
+0.110
 cs+0.107
 postp+0.106
 Invention+0.101
vati+0.100
Neg logit contributions
orge-0.089
 depths-0.080
yrinth-0.080
olphin-0.079
rss-0.077
Token,
Token ID11
Feature activation+1.70e+00

Pos logit contributions
+5.391
 cs+5.369
 postp+5.359
 Invention+4.910
vati+4.894
Neg logit contributions
orge-5.934
yrinth-5.671
olphin-5.459
 WRITE-5.302
rss-5.302
Token the
Token ID262
Feature activation+1.57e+00

Pos logit contributions
 postp+14.048
 cs+13.934
+13.493
 err+12.657
 Invention+12.281
Neg logit contributions
yrinth-15.176
orge-14.967
olphin-14.029
ragon-13.917
alian-13.697
Token greater
Token ID3744
Feature activation+6.12e-01

Pos logit contributions
 postp+13.317
 cs+12.228
 err+11.791
+11.763
 Belg+11.264
Neg logit contributions
yrinth-15.081
orge-14.582
olphin-13.931
ongyang-13.822
ーテ-13.674
Token the
Token ID262
Feature activation+7.05e-01

Pos logit contributions
+6.138
 cs+6.119
 postp+6.068
 Invention+5.572
vati+5.552
Neg logit contributions
orge-6.262
yrinth-5.962
olphin-5.742
 WRITE-5.565
rss-5.537
Token destruction
Token ID8166
Feature activation+3.47e-01

Pos logit contributions
 postp+7.720
 cs+7.552
+7.445
 Invention+6.905
vati+6.875
Neg logit contributions
orge-6.429
yrinth-6.204
olphin-6.007
rss-5.721
 WRITE-5.614
Token and
Token ID290
Feature activation+6.13e-01

Pos logit contributions
+3.446
 cs+3.423
 postp+3.415
 Invention+3.131
vati+3.101
Neg logit contributions
orge-3.696
yrinth-3.492
olphin-3.395
 depths-3.284
rss-3.263
Token NASA
Token ID8884
Feature activation-5.55e-01

Pos logit contributions
orge+5.737
yrinth+5.254
olphin+5.229
 rated+5.221
rss+5.196
Neg logit contributions
-7.108
 cs-6.863
 postp-6.692
vati-6.597
ickets-6.452
Token
Token ID447
Feature activation+5.72e-03

Pos logit contributions
orge+3.433
olphin+3.154
yrinth+3.116
 depths+3.093
sent+3.060
Neg logit contributions
-7.295
 postp-7.170
 cs-7.028
vati-6.839
 Invention-6.796
Token
Token ID247
Feature activation-6.75e-01

Pos logit contributions
+0.063
 cs+0.062
 postp+0.061
 Invention+0.057
vati+0.057
Neg logit contributions
orge-0.055
yrinth-0.053
olphin-0.052
 depths-0.051
 WRITE-0.050
Tokens
Token ID82
Feature activation-6.77e-01

Pos logit contributions
orge+5.710
sent+5.106
olphin+5.029
 depths+4.991
rss+4.947
Neg logit contributions
-8.028
 postp-7.830
 cs-7.819
 Invention-7.470
vati-7.362
Token seventh
Token ID14024
Feature activation-6.28e-01

Pos logit contributions
orge+3.692
 darkest+3.611
olphin+3.376
yrinth+3.359
 depths+3.337
Neg logit contributions
-9.367
 cs-9.019
 postp-8.899
ickets-8.730
vati-8.679
Token consecutive
Token ID12785
Feature activation-2.31e+00

Pos logit contributions
orge+5.572
 depths+5.057
olphin+5.047
yrinth+5.041
XXXX+4.912
Neg logit contributions
-7.119
 cs-6.835
 postp-6.760
vati-6.504
 Invention-6.475
Token lunar
Token ID25572
Feature activation-4.96e-01

Pos logit contributions
 lunar+11.136
 depths+9.543
 darkest+9.258
 retrieval+8.886
XXXX+8.779
Neg logit contributions
skirts-25.172
-24.381
ickets-24.000
ifa-23.909
 Offense-23.791
Token mission
Token ID4365
Feature activation-1.05e+00

Pos logit contributions
orge+3.235
 depths+2.825
olphin+2.717
yrinth+2.707
XXXX+2.695
Neg logit contributions
-6.949
 cs-6.734
 postp-6.665
vati-6.475
ickets-6.469
Token failure
Token ID5287
Feature activation-2.95e-01

Pos logit contributions
orge+2.640
 depths+2.450
 rated+2.062
 darkest+1.989
 lows+1.853
Neg logit contributions
-17.264
skirts-16.777
 cs-16.657
ickets-16.627
vati-16.364
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
orge+2.528
 depths+2.274
yrinth+2.259
rss+2.246
olphin+2.226
Neg logit contributions
-3.454
 cs-3.399
 postp-3.335
vati-3.272
 Invention-3.164
Token NASA
Token ID8884
Feature activation-5.05e-01

Pos logit contributions
orge+10.618
XXXX+9.894
 WRITE+9.843
rss+9.799
 Meridian+9.640
Neg logit contributions
-10.392
 postp-9.888
 cs-9.766
vati-9.713
ickets-9.578
Token m
Token ID285
Feature activation-5.35e-01

Pos logit contributions
+0.408
 postp+0.388
 cs+0.384
 Invention+0.368
vati+0.363
Neg logit contributions
orge-0.611
yrinth-0.571
 depths-0.571
olphin-0.560
 WRITE-0.551
Token2
Token ID17
Feature activation+4.30e-03

Pos logit contributions
orge+5.759
olphin+5.125
 depths+5.092
yrinth+5.092
rss+5.069
Neg logit contributions
-5.334
 postp-5.203
 cs-5.197
 Invention-4.921
vati-4.816
Tokenk
Token ID74
Feature activation-1.29e-01

Pos logit contributions
+0.035
 cs+0.034
 postp+0.033
vati+0.031
 Invention+0.030
Neg logit contributions
orge-0.054
yrinth-0.053
olphin-0.051
 depths-0.051
ーテ-0.050
Token [
Token ID685
Feature activation-7.07e-01

Pos logit contributions
orge+1.540
yrinth+1.473
olphin+1.437
 depths+1.409
rss+1.382
Neg logit contributions
-1.153
 cs-1.142
 postp-1.112
 Invention-1.023
vati-1.011
Token10
Token ID940
Feature activation-1.07e-01

Pos logit contributions
orge+4.486
sent+3.807
rss+3.704
XXXX+3.695
 depths+3.667
Neg logit contributions
-9.743
 cs-9.676
 postp-9.641
 Invention-9.509
vati-9.187
Token:
Token ID25
Feature activation-2.13e+00

Pos logit contributions
orge+1.143
yrinth+1.117
olphin+1.065
 depths+1.042
ーテ+1.018
Neg logit contributions
-1.091
 cs-1.070
 postp-1.052
 Invention-0.982
vati-0.972
Token48
Token ID2780
Feature activation+8.76e-03

Pos logit contributions
48+0.579
dale-0.125
-0.267
sent-0.335
 rated-0.764
Neg logit contributions
 postp-30.869
-30.732
-30.623
 suspic-30.361
 cs-30.343
Token]
Token ID60
Feature activation+4.17e-01

Pos logit contributions
+0.067
 cs+0.065
 postp+0.063
 Invention+0.057
vati+0.057
Neg logit contributions
yrinth-0.115
orge-0.115
olphin-0.109
ーテ-0.108
 depths-0.108
Token <
Token ID1279
Feature activation+1.16e-01

Pos logit contributions
 cs+1.788
+1.669
 postp+1.563
 Offense+1.290
 Invention+1.277
Neg logit contributions
yrinth-6.730
orge-6.704
olphin-6.574
ーテ-6.388
ongyang-6.274
Tokenmin
Token ID1084
Feature activation+3.18e-01

Pos logit contributions
+1.249
 cs+1.227
 postp+1.218
 Invention+1.149
vati+1.132
Neg logit contributions
orge-1.181
yrinth-1.081
olphin-1.062
 depths-1.057
rss-1.033
Tokenion
Token ID295
Feature activation-1.14e+00

Pos logit contributions
+2.697
 cs+2.599
 postp+2.561
vati+2.411
ickets+2.373
Neg logit contributions
orge-3.791
yrinth-3.613
 depths-3.587
olphin-3.546
 WRITE-3.503
Tokenm
Token ID76
Feature activation-1.14e-01

Pos logit contributions
+0.750
 postp+0.716
 cs+0.712
 Invention+0.675
vati+0.668
Neg logit contributions
orge-1.133
yrinth-1.055
 depths-1.054
olphin-1.039
 WRITE-1.016
Token2
Token ID17
Feature activation-3.27e-01

Pos logit contributions
orge+1.345
yrinth+1.238
 depths+1.223
olphin+1.215
XXXX+1.189
Neg logit contributions
-1.048
 postp-0.999
 cs-0.993
 Invention-0.947
vati-0.932
Tokenk
Token ID74
Feature activation+7.01e-02

Pos logit contributions
orge+3.062
 depths+2.684
yrinth+2.664
sent+2.640
olphin+2.628
Neg logit contributions
-3.552
 postp-3.495
 Invention-3.376
 cs-3.356
vati-3.275
Token [
Token ID685
Feature activation-7.73e-01

Pos logit contributions
+0.729
 postp+0.713
 cs+0.698
 Invention+0.681
vati+0.671
Neg logit contributions
orge-0.728
 depths-0.660
yrinth-0.658
olphin-0.651
XXXX-0.635
Token10
Token ID940
Feature activation-2.72e-01

Pos logit contributions
orge+4.727
sent+4.006
rss+3.903
XXXX+3.886
 depths+3.831
Neg logit contributions
-10.730
 cs-10.655
 postp-10.629
 Invention-10.536
vati-10.144
Token:
Token ID25
Feature activation-2.13e+00

Pos logit contributions
orge+2.738
yrinth+2.552
olphin+2.491
 depths+2.469
rss+2.400
Neg logit contributions
-2.930
 cs-2.867
 postp-2.837
 Invention-2.663
vati-2.652
Token48
Token ID2780
Feature activation-1.03e-01

Pos logit contributions
+2.548
48+2.466
sent+2.304
 rated+2.112
dale+2.074
Neg logit contributions
 postp-27.255
-27.171
 cs-26.692
 Invention-26.667
GROUND-26.476
Token]
Token ID60
Feature activation+3.27e-01

Pos logit contributions
orge+1.317
yrinth+1.288
olphin+1.240
 depths+1.225
ーテ+1.201
Neg logit contributions
-0.834
 cs-0.809
 postp-0.790
 Invention-0.722
vati-0.716
Token <
Token ID1279
Feature activation+1.07e-01

Pos logit contributions
 cs+1.391
+1.294
 postp+1.211
 Offense+1.020
 Invention+1.000
Neg logit contributions
yrinth-5.373
orge-5.284
olphin-5.216
ーテ-5.100
ongyang-5.027
Tokenm
Token ID76
Feature activation-1.17e-01

Pos logit contributions
+1.145
 cs+1.125
 postp+1.116
 Invention+1.053
vati+1.038
Neg logit contributions
orge-1.093
yrinth-1.001
olphin-0.984
 depths-0.979
rss-0.956
Tokenew
Token ID413
Feature activation-4.29e-01

Pos logit contributions
orge+1.316
yrinth+1.206
olphin+1.188
 depths+1.169
rss+1.154
Neg logit contributions
-1.141
 postp-1.113
 cs-1.113
 Invention-1.039
 Offense-1.017
Token Koreans
Token ID25999
Feature activation-7.60e-01

Pos logit contributions
orge+5.333
olphin+4.804
sent+4.652
ongyang+4.612
 depths+4.578
Neg logit contributions
-12.709
 cs-11.942
vati-11.559
 postp-11.492
 Invention-11.371
Token are
Token ID389
Feature activation-9.40e-01

Pos logit contributions
orge+6.513
 depths+5.923
 rated+5.909
sent+5.729
olphin+5.729
Neg logit contributions
-8.855
ickets-7.929
 cs-7.913
 postp-7.858
vati-7.830
Token forced
Token ID4137
Feature activation-6.69e-01

Pos logit contributions
orge+7.873
 rated+7.686
sent+7.397
 depths+7.228
 darkest+7.112
Neg logit contributions
-10.415
 cs-9.276
 Invention-9.144
ickets-9.093
vati-8.981
Token to
Token ID284
Feature activation-7.28e-01

Pos logit contributions
orge+5.374
 depths+4.743
olphin+4.707
yrinth+4.647
ragon+4.598
Neg logit contributions
-7.796
 cs-7.461
 Invention-7.343
 postp-7.244
 Offense-7.217
Token adhere
Token ID26325
Feature activation-9.39e-01

Pos logit contributions
orge+6.276
 depths+5.904
 WRITE+5.658
 subjug+5.614
 synd+5.493
Neg logit contributions
-8.101
 cs-7.437
vati-7.346
 postp-7.328
 Invention-7.323
Token to
Token ID284
Feature activation-2.05e+00

Pos logit contributions
orge+8.405
 depths+7.732
sent+7.325
 darkest+7.245
rss+7.240
Neg logit contributions
-10.308
 cs-9.790
 postp-9.570
 Invention-9.327
 Offense-9.290
Token the
Token ID262
Feature activation-1.64e+00

Pos logit contributions
 depths+11.536
orge+10.842
 synd+10.746
 Pyongyang+10.573
 darkest+10.187
Neg logit contributions
-18.601
ickets-17.346
 cs-16.748
 Invention-16.373
 Offense-16.196
Token J
Token ID449
Feature activation-9.92e-01

Pos logit contributions
 depths+12.502
 darkest+12.073
orge+11.704
 synd+11.563
 liberation+11.225
Neg logit contributions
-15.040
ickets-13.694
 cs-13.042
skirts-13.040
vati-12.932
Tokenuc
Token ID1229
Feature activation-5.57e-01

Pos logit contributions
orge+8.726
 depths+7.767
elf+7.730
olphin+7.650
yrinth+7.644
Neg logit contributions
-10.594
 postp-10.415
 cs-10.013
 Belg-9.497
skirts-9.475
Tokenhe
Token ID258
Feature activation-1.31e+00

Pos logit contributions
orge+5.719
sent+5.263
yrinth+5.214
olphin+5.152
 depths+5.098
Neg logit contributions
-5.760
 postp-5.311
 cs-5.306
 Invention-4.998
GROUND-4.839
Token ideology
Token ID12959
Feature activation-1.37e-01

Pos logit contributions
orge+12.143
yrinth+10.915
 depths+10.880
elf+10.836
sent+10.803
Neg logit contributions
-11.753
ickets-10.859
 cs-10.493
 postp-10.303
vati-9.988
Tokenm
Token ID76
Feature activation-1.14e-01

Pos logit contributions
orge+1.842
yrinth+1.675
 depths+1.642
olphin+1.632
rss+1.629
Neg logit contributions
-1.763
 postp-1.709
 cs-1.694
 Invention-1.624
vati-1.595
Token4
Token ID19
Feature activation-1.59e-01

Pos logit contributions
orge+1.255
yrinth+1.146
olphin+1.126
 depths+1.124
rss+1.100
Neg logit contributions
-1.134
 postp-1.091
 cs-1.079
 Invention-1.036
vati-1.017
Tokenk
Token ID74
Feature activation-3.53e-02

Pos logit contributions
orge+1.684
yrinth+1.528
 depths+1.513
olphin+1.502
rss+1.487
Neg logit contributions
-1.614
 postp-1.558
 cs-1.543
 Invention-1.483
vati-1.448
Token [
Token ID685
Feature activation-6.88e-01

Pos logit contributions
orge+0.384
yrinth+0.350
 depths+0.347
olphin+0.346
XXXX+0.338
Neg logit contributions
-0.353
 postp-0.341
 cs-0.338
 Invention-0.324
vati-0.318
Token10
Token ID940
Feature activation-2.66e-01

Pos logit contributions
orge+4.478
sent+3.780
rss+3.744
XXXX+3.717
 depths+3.709
Neg logit contributions
-9.426
 cs-9.327
 postp-9.302
 Invention-9.128
vati-8.873
Token:
Token ID25
Feature activation-2.00e+00

Pos logit contributions
orge+2.685
yrinth+2.506
olphin+2.445
 depths+2.423
rss+2.354
Neg logit contributions
-2.863
 cs-2.801
 postp-2.773
 Invention-2.602
vati-2.591
Token48
Token ID2780
Feature activation-4.30e-02

Pos logit contributions
+4.636
sent+4.405
olphin+4.227
 rated+4.104
XXXX+4.017
Neg logit contributions
-23.539
 postp-23.521
 cs-23.093
 Invention-22.917
GROUND-22.697
Token]
Token ID60
Feature activation+5.42e-01

Pos logit contributions
orge+0.557
yrinth+0.553
olphin+0.529
 depths+0.521
ーテ+0.519
Neg logit contributions
-0.338
 cs-0.327
 postp-0.318
 Invention-0.290
vati-0.287
Token <
Token ID1279
Feature activation+7.78e-02

Pos logit contributions
+3.196
 cs+3.079
 postp+3.015
 Invention+2.660
vati+2.646
Neg logit contributions
orge-8.062
yrinth-7.638
olphin-7.556
 depths-7.513
rss-7.389
Tokenmom
Token ID32542
Feature activation+2.59e-02

Pos logit contributions
+0.829
 cs+0.816
 postp+0.811
 Invention+0.765
vati+0.751
Neg logit contributions
orge-0.794
yrinth-0.726
olphin-0.713
 depths-0.709
rss-0.694
Tokenper
Token ID525
Feature activation+1.83e-01

Pos logit contributions
+0.222
 postp+0.217
 cs+0.216
vati+0.203
ickets+0.198
Neg logit contributions
orge-0.313
yrinth-0.298
 depths-0.293
ーテ-0.290
olphin-0.289
TokenThese
Token ID4711
Feature activation-7.87e-01

Pos logit contributions
orge+5.834
XXXX+5.403
sent+5.312
olphin+5.166
yrinth+5.127
Neg logit contributions
 postp-5.891
 cs-5.784
-5.699
 Invention-5.593
vati-5.494
Token books
Token ID3835
Feature activation-3.00e-01

Pos logit contributions
orge+7.134
 depths+6.965
 lows+6.630
yrinth+6.549
 darkest+6.519
Neg logit contributions
-8.375
 cs-7.839
vati-7.577
 Invention-7.563
 postp-7.480
Token are
Token ID389
Feature activation-6.28e-01

Pos logit contributions
orge+2.718
 depths+2.456
yrinth+2.417
olphin+2.374
 WRITE+2.351
Neg logit contributions
-3.541
 cs-3.386
 postp-3.345
 Invention-3.212
vati-3.199
Token available
Token ID1695
Feature activation-8.85e-01

Pos logit contributions
orge+5.838
 rated+5.522
 depths+5.474
yrinth+5.450
 darkest+5.372
Neg logit contributions
-6.813
 cs-6.309
vati-6.207
 postp-6.167
 Invention-5.957
Token at
Token ID379
Feature activation-1.65e+00

Pos logit contributions
orge+7.180
 depths+6.785
 rated+6.725
yrinth+6.541
 WRITE+6.503
Neg logit contributions
-9.638
vati-9.262
 postp-9.010
 cs-8.990
ickets-8.690
Token most
Token ID749
Feature activation-1.99e+00

Pos logit contributions
 depths+13.802
orge+13.534
 rated+13.438
 Meridian+13.345
 Publisher+13.045
Neg logit contributions
vati-12.583
-12.512
 postp-11.796
idav-11.375
skirts-11.227
Token book
Token ID1492
Feature activation-1.18e+00

Pos logit contributions
 depths+15.914
 rated+15.449
orge+15.071
 Meridian+14.966
 Publisher+14.948
Neg logit contributions
vati-14.217
-14.043
skirts-13.588
idav-13.171
 postp-12.871
Tokenstores
Token ID43409
Feature activation-9.73e-01

Pos logit contributions
orge+7.715
 depths+7.194
 lows+6.951
rade+6.884
 rated+6.814
Neg logit contributions
-13.494
 cs-13.268
 Invention-13.245
vati-13.221
GROUND-12.723
Token that
Token ID326
Feature activation-6.34e-01

Pos logit contributions
orge+7.516
 rated+7.337
 depths+7.283
 darkest+6.889
 synd+6.887
Neg logit contributions
-10.590
vati-9.928
 cs-9.884
 postp-9.884
 Invention-9.662
Token have
Token ID423
Feature activation-1.08e+00

Pos logit contributions
orge+5.668
 depths+5.309
yrinth+5.088
 rated+5.086
 WRITE+5.085
Neg logit contributions
-7.181
 postp-6.633
vati-6.610
 cs-6.571
 Invention-6.323
Token a
Token ID257
Feature activation-9.96e-01

Pos logit contributions
 rated+9.180
 depths+9.095
orge+9.078
 WRITE+8.650
 synd+8.520
Neg logit contributions
-10.736
vati-9.756
 postp-9.720
 cs-9.448
skirts-9.232
Token return
Token ID1441
Feature activation+5.79e-01

Pos logit contributions
orge+0.845
 depths+0.771
yrinth+0.770
olphin+0.766
 WRITE+0.741
Neg logit contributions
-1.007
 cs-0.971
 postp-0.961
vati-0.921
 Invention-0.915
Token through
Token ID832
Feature activation-3.13e-01

Pos logit contributions
+5.344
 postp+5.309
 cs+5.264
 Offense+4.771
vati+4.737
Neg logit contributions
orge-6.528
yrinth-6.035
olphin-5.828
 depths-5.729
rss-5.700
Token the
Token ID262
Feature activation-7.52e-01

Pos logit contributions
orge+3.041
 depths+2.806
yrinth+2.790
olphin+2.756
 darkest+2.719
Neg logit contributions
-3.380
 cs-3.176
vati-3.095
 postp-3.095
ickets-3.005
Token V
Token ID569
Feature activation-3.94e-01

Pos logit contributions
 depths+6.600
orge+6.434
 lows+6.261
 darkest+6.172
yrinth+6.124
Neg logit contributions
-8.103
vati-7.557
 cs-7.419
 postp-7.168
skirts-7.051
TokenFL
Token ID3697
Feature activation-2.01e-01

Pos logit contributions
orge+2.867
olphin+2.550
yrinth+2.536
 depths+2.511
XXXX+2.439
Neg logit contributions
-5.256
 cs-5.080
 postp-5.048
vati-4.826
 Invention-4.816
Token this
Token ID428
Feature activation-1.94e+00

Pos logit contributions
orge+2.041
yrinth+1.926
 depths+1.885
olphin+1.882
 WRITE+1.819
Neg logit contributions
-2.109
 cs-2.057
 postp-2.015
vati-1.917
 Invention-1.913
Token week
Token ID1285
Feature activation-5.69e-02

Pos logit contributions
 rated+10.728
 darkest+10.478
 depths+10.373
orge+10.143
 synd+9.998
Neg logit contributions
-18.589
vati-17.080
skirts-16.896
ickets-16.753
 cs-16.610
Token after
Token ID706
Feature activation-4.48e-01

Pos logit contributions
orge+0.576
yrinth+0.529
olphin+0.525
 depths+0.525
rss+0.510
Neg logit contributions
-0.605
 cs-0.589
 postp-0.578
 Invention-0.552
vati-0.549
Token recovering
Token ID20222
Feature activation-9.79e-02

Pos logit contributions
orge+3.910
 depths+3.701
yrinth+3.595
olphin+3.509
 lows+3.491
Neg logit contributions
-5.128
 cs-4.910
vati-4.699
 postp-4.677
 Invention-4.634
Token from
Token ID422
Feature activation-1.03e+00

Pos logit contributions
orge+0.845
yrinth+0.763
 depths+0.761
olphin+0.752
 lows+0.726
Neg logit contributions
-1.176
 cs-1.153
 postp-1.150
 Invention-1.109
vati-1.086
Token a
Token ID257
Feature activation-1.04e+00

Pos logit contributions
orge+7.910
 depths+7.649
 lows+7.451
yrinth+7.447
 synd+7.177
Neg logit contributions
-10.981
 cs-10.180
skirts-10.159
vati-10.053
 Invention-9.988
Token more
Token ID517
Feature activation+3.22e-01

Pos logit contributions
+4.230
 postp+4.188
 cs+4.173
 Offense+3.670
 Invention+3.642
Neg logit contributions
orge-6.341
yrinth-5.991
olphin-5.866
 depths-5.729
rss-5.663
Token than
Token ID621
Feature activation-3.30e-01

Pos logit contributions
+2.944
 postp+2.896
 cs+2.896
 Invention+2.612
vati+2.604
Neg logit contributions
orge-3.718
yrinth-3.467
olphin-3.408
 depths-3.347
rss-3.306
Token there
Token ID612
Feature activation+5.21e-03

Pos logit contributions
orge+3.238
 depths+2.999
yrinth+2.970
 lows+2.914
 WRITE+2.899
Neg logit contributions
-3.531
 cs-3.425
 postp-3.337
vati-3.219
 Invention-3.200
Token were
Token ID547
Feature activation-1.46e-01

Pos logit contributions
+0.065
 cs+0.064
 postp+0.063
vati+0.060
 Invention+0.060
Neg logit contributions
orge-0.044
yrinth-0.039
olphin-0.039
 depths-0.039
rss-0.037
Token at
Token ID379
Feature activation-1.24e+00

Pos logit contributions
orge+1.421
yrinth+1.302
 depths+1.291
olphin+1.279
 WRITE+1.244
Neg logit contributions
-1.611
 cs-1.574
 postp-1.556
 Invention-1.477
vati-1.474
Token the
Token ID262
Feature activation-1.93e+00

Pos logit contributions
 depths+9.449
orge+8.936
 darkest+8.354
 lows+8.317
 rated+8.064
Neg logit contributions
-12.944
 cs-12.645
vati-12.629
ickets-12.293
 postp-12.210
Token start
Token ID923
Feature activation-3.77e-01

Pos logit contributions
 depths+14.300
 darkest+12.627
 lows+12.567
 rated+11.412
orge+11.297
Neg logit contributions
-17.644
vich-17.187
skirts-17.005
ickets-16.775
 cs-16.679
Token of
Token ID286
Feature activation-4.89e-01

Pos logit contributions
orge+2.801
 depths+2.629
olphin+2.554
yrinth+2.533
rss+2.462
Neg logit contributions
 postp-4.597
-4.582
 cs-4.518
 Invention-4.444
vati-4.364
Token his
Token ID465
Feature activation-2.37e-01

Pos logit contributions
orge+4.354
 depths+4.151
 darkest+3.987
yrinth+3.959
 lows+3.944
Neg logit contributions
-5.357
 cs-5.204
 postp-5.093
vati-5.012
ickets-4.936
Token first
Token ID717
Feature activation-4.09e-01

Pos logit contributions
orge+2.126
 depths+1.949
yrinth+1.930
olphin+1.881
 darkest+1.853
Neg logit contributions
-2.752
 cs-2.688
 postp-2.651
vati-2.563
 Invention-2.525
Token term
Token ID3381
Feature activation-1.77e-01

Pos logit contributions
orge+3.086
 depths+2.859
yrinth+2.747
 darkest+2.666
rss+2.663
Neg logit contributions
-5.146
 cs-5.035
 postp-4.961
vati-4.887
ickets-4.875
Token J
Token ID449
Feature activation-8.11e-01

Pos logit contributions
orge+6.683
 depths+6.223
 darkest+5.880
 WRITE+5.853
 lows+5.815
Neg logit contributions
-7.487
 cs-7.081
vati-6.780
 postp-6.671
ickets-6.343
Token M
Token ID337
Feature activation-5.49e-01

Pos logit contributions
orge+8.747
 depths+7.793
olphin+7.662
elf+7.519
yrinth+7.515
Neg logit contributions
-7.707
 postp-7.517
 cs-7.270
 Invention-6.930
skirts-6.862
Tokenasc
Token ID3372
Feature activation-3.51e-01

Pos logit contributions
orge+5.482
yrinth+4.701
olphin+4.651
XXXX+4.643
 depths+4.624
Neg logit contributions
-5.891
 cs-5.689
 postp-5.602
vati-5.373
 Invention-5.198
Tokenis
Token ID271
Feature activation-3.34e-01

Pos logit contributions
orge+4.050
yrinth+3.663
olphin+3.606
 depths+3.590
rss+3.537
Neg logit contributions
-3.367
 postp-3.221
 cs-3.212
 Invention-3.018
vati-2.885
Token,
Token ID11
Feature activation-1.10e+00

Pos logit contributions
orge+3.257
 depths+2.886
olphin+2.870
yrinth+2.856
rss+2.798
Neg logit contributions
-3.674
 cs-3.568
 postp-3.527
 Invention-3.307
ickets-3.284
Token and
Token ID290
Feature activation-1.91e+00

Pos logit contributions
orge+8.977
 depths+8.381
 rated+8.123
 darkest+7.913
 lows+7.898
Neg logit contributions
-11.702
 cs-10.632
vati-10.278
 postp-10.235
ickets-9.972
Token a
Token ID257
Feature activation-1.12e+00

Pos logit contributions
orge+14.750
 depths+14.033
 lyrics+13.958
 rated+13.676
 retrieval+13.316
Neg logit contributions
-15.533
vati-14.208
 cs-13.499
 Offense-12.870
microsoft-12.774
Token book
Token ID1492
Feature activation-2.55e-01

Pos logit contributions
orge+9.934
 depths+9.224
 darkest+9.218
 rated+9.193
 WRITE+9.100
Neg logit contributions
-11.108
vati-9.840
 cs-9.651
 Offense-9.534
skirts-9.421
Token of
Token ID286
Feature activation-4.27e-01

Pos logit contributions
orge+2.559
yrinth+2.353
 depths+2.303
olphin+2.290
rss+2.260
Neg logit contributions
-2.726
 cs-2.650
 postp-2.597
vati-2.502
 Invention-2.444
Token unused
Token ID21958
Feature activation-6.90e-01

Pos logit contributions
orge+4.164
 depths+3.848
yrinth+3.774
olphin+3.716
 darkest+3.714
Neg logit contributions
-4.651
 cs-4.392
 postp-4.291
vati-4.212
 Offense-4.018
Token work
Token ID670
Feature activation-3.34e-01

Pos logit contributions
orge+6.587
 depths+6.270
 lows+5.901
 darkest+5.889
 WRITE+5.843
Neg logit contributions
-7.335
vati-6.705
 cs-6.690
 postp-6.535
 Offense-6.290
Token will
Token ID481
Feature activation-4.52e-01

Pos logit contributions
orge+1.827
 depths+1.655
olphin+1.644
yrinth+1.638
rss+1.589
Neg logit contributions
-2.353
 cs-2.289
 postp-2.240
 Invention-2.191
vati-2.159
Token return
Token ID1441
Feature activation+2.62e-01

Pos logit contributions
orge+3.846
 depths+3.653
olphin+3.566
yrinth+3.501
 WRITE+3.449
Neg logit contributions
-5.237
 cs-4.940
vati-4.910
 postp-4.871
 Invention-4.823
Token to
Token ID284
Feature activation-1.63e-01

Pos logit contributions
+2.641
 cs+2.584
 postp+2.575
vati+2.371
 Invention+2.362
Neg logit contributions
orge-2.823
yrinth-2.609
olphin-2.551
 depths-2.523
rss-2.489
Token full
Token ID1336
Feature activation+4.53e-03

Pos logit contributions
orge+1.693
 depths+1.589
yrinth+1.577
olphin+1.567
rss+1.527
Neg logit contributions
-1.680
 cs-1.603
 postp-1.593
vati-1.530
 Invention-1.494
Token training
Token ID3047
Feature activation-4.09e-02

Pos logit contributions
+0.050
 cs+0.049
 postp+0.048
 Invention+0.046
vati+0.046
Neg logit contributions
orge-0.044
yrinth-0.041
 depths-0.040
olphin-0.040
rss-0.039
Token this
Token ID428
Feature activation-1.91e+00

Pos logit contributions
orge+0.417
yrinth+0.385
 depths+0.379
olphin+0.379
rss+0.366
Neg logit contributions
-0.434
 cs-0.423
 postp-0.419
 Invention-0.395
vati-0.394
Token week
Token ID1285
Feature activation-2.43e-02

Pos logit contributions
 rated+9.496
 darkest+9.140
 depths+9.103
orge+8.959
 synd+8.822
Neg logit contributions
-19.322
vati-18.005
ickets-17.986
 Invention-17.664
 cs-17.573
Token after
Token ID706
Feature activation-6.94e-01

Pos logit contributions
orge+0.250
yrinth+0.230
olphin+0.228
 depths+0.228
rss+0.220
Neg logit contributions
-0.255
 cs-0.249
 postp-0.246
 Invention-0.233
vati-0.231
Token an
Token ID281
Feature activation-1.66e-01

Pos logit contributions
orge+5.621
 depths+5.477
yrinth+5.195
 rated+5.091
 lows+5.078
Neg logit contributions
-7.902
 cs-7.499
vati-7.231
 Invention-7.216
skirts-7.214
Token abdominal
Token ID32692
Feature activation-3.53e-01

Pos logit contributions
orge+1.680
yrinth+1.548
 depths+1.527
olphin+1.501
XXXX+1.468
Neg logit contributions
-1.743
 cs-1.705
 postp-1.676
 Invention-1.600
vati-1.598
Token injury
Token ID5095
Feature activation+1.18e-01

Pos logit contributions
orge+3.195
 depths+2.971
yrinth+2.941
olphin+2.857
 WRITE+2.798
Neg logit contributions
-4.005
 cs-3.782
vati-3.703
skirts-3.674
 postp-3.663
Token to
Token ID284
Feature activation-8.20e-01

Pos logit contributions
orge+4.546
olphin+4.193
 depths+4.143
yrinth+4.068
XXXX+4.058
Neg logit contributions
-6.923
 postp-6.872
 Invention-6.840
 cs-6.736
 Offense-6.587
Token be
Token ID307
Feature activation-4.92e-01

Pos logit contributions
orge+7.189
 depths+6.999
 rated+6.700
olphin+6.658
yrinth+6.617
Neg logit contributions
-8.710
vati-7.928
skirts-7.871
 postp-7.853
 cs-7.849
Token available
Token ID1695
Feature activation-1.79e-01

Pos logit contributions
orge+4.684
 rated+4.341
olphin+4.300
 depths+4.290
yrinth+4.249
Neg logit contributions
-5.361
 cs-5.010
 postp-4.935
vati-4.839
ickets-4.758
Token for
Token ID329
Feature activation-5.40e-01

Pos logit contributions
orge+1.595
 depths+1.467
olphin+1.456
yrinth+1.452
 WRITE+1.400
Neg logit contributions
-2.082
 cs-2.018
 postp-1.995
 Invention-1.923
vati-1.919
Token selection
Token ID6356
Feature activation-4.05e-01

Pos logit contributions
orge+4.654
 depths+4.463
yrinth+4.417
 rated+4.339
olphin+4.334
Neg logit contributions
-6.110
vati-5.608
 cs-5.597
 postp-5.517
 Invention-5.380
Token this
Token ID428
Feature activation-1.91e+00

Pos logit contributions
orge+3.765
 depths+3.492
olphin+3.465
yrinth+3.423
rade+3.363
Neg logit contributions
-4.464
 cs-4.275
 postp-4.218
 Invention-4.116
vati-4.087
Token week
Token ID1285
Feature activation-4.06e-01

Pos logit contributions
 rated+11.090
 darkest+10.919
 depths+10.555
 grade+10.236
orge+10.213
Neg logit contributions
-18.342
vati-17.028
skirts-16.645
ickets-16.290
GROUND-15.968
Token at
Token ID379
Feature activation-1.28e+00

Pos logit contributions
orge+3.645
 depths+3.351
olphin+3.318
yrinth+3.297
sent+3.258
Neg logit contributions
-4.586
 cs-4.401
 postp-4.331
 Invention-4.228
vati-4.177
Token V
Token ID569
Feature activation-8.95e-01

Pos logit contributions
 rated+10.099
 depths+10.093
 darkest+9.796
 Meridian+9.493
rade+9.413
Neg logit contributions
-12.623
vati-11.874
 cs-11.191
 postp-11.037
microsoft-10.857
TokenFL
Token ID3697
Feature activation-8.18e-01

Pos logit contributions
orge+4.248
olphin+3.820
XXXX+3.639
 depths+3.574
elf+3.563
Neg logit contributions
-13.177
 cs-12.660
 postp-12.411
 Invention-12.146
 Offense-12.100
Token level
Token ID1241
Feature activation-2.91e-01

Pos logit contributions
orge+7.657
 depths+7.369
olphin+7.258
yrinth+7.235
 lows+7.206
Neg logit contributions
-8.110
 cs-7.711
vati-7.477
 postp-7.338
 Invention-7.185
Token as
Token ID355
Feature activation-8.34e-01

Pos logit contributions
+1.975
 cs+1.916
 postp+1.891
 Invention+1.745
vati+1.742
Neg logit contributions
orge-3.007
yrinth-2.810
olphin-2.779
 depths-2.776
rss-2.711
Token response
Token ID2882
Feature activation-4.67e-01

Pos logit contributions
 depths+7.474
orge+7.375
 rated+7.229
 lows+7.139
 WRITE+6.849
Neg logit contributions
-8.322
vati-7.996
skirts-7.746
 cs-7.693
ickets-7.673
Token and
Token ID290
Feature activation-3.23e-01

Pos logit contributions
orge+4.289
 depths+4.132
 lows+3.871
 rated+3.833
yrinth+3.818
Neg logit contributions
-5.140
 cs-5.024
vati-4.795
 Invention-4.793
 Offense-4.769
Token remission
Token ID47829
Feature activation-4.43e-01

Pos logit contributions
orge+3.112
 depths+2.948
yrinth+2.836
 lows+2.811
 rated+2.776
Neg logit contributions
-3.488
 cs-3.337
 postp-3.252
vati-3.247
ickets-3.189
Token rates
Token ID3965
Feature activation-2.24e-01

Pos logit contributions
orge+3.621
 depths+3.608
 lows+3.386
 rated+3.297
yrinth+3.265
Neg logit contributions
-5.281
 cs-5.064
vati-4.924
 postp-4.907
 Invention-4.863
Token at
Token ID379
Feature activation-1.90e+00

Pos logit contributions
orge+2.111
 depths+1.963
yrinth+1.922
olphin+1.906
 lows+1.876
Neg logit contributions
-2.500
 cs-2.438
 postp-2.385
 Invention-2.302
vati-2.290
Token week
Token ID1285
Feature activation-1.08e+00

Pos logit contributions
 depths+13.482
 rated+12.298
 lows+12.273
 retrieval+12.061
 levels+11.996
Neg logit contributions
-16.463
skirts-16.257
vich-15.893
vati-15.874
ickets-15.873
Token four
Token ID1440
Feature activation-7.61e-01

Pos logit contributions
 depths+7.873
 lows+7.475
orge+7.409
 rated+7.298
 retrieval+6.875
Neg logit contributions
-12.235
vati-11.865
 cs-11.687
ickets-11.646
skirts-11.273
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
orge+5.729
 depths+5.642
 lows+5.612
 rated+5.338
 WRITE+5.247
Neg logit contributions
-8.607
 cs-8.302
 Invention-8.114
 postp-8.085
vati-8.072
Token In
Token ID554
Feature activation-7.99e-01

Pos logit contributions
orge+10.096
 WRITE+9.793
 Meridian+9.609
 depths+9.496
XXXX+9.457
Neg logit contributions
 postp-10.723
-10.593
vati-10.419
ickets-10.336
 cs-10.078
Token addition
Token ID3090
Feature activation-8.49e-01

Pos logit contributions
orge+6.240
 depths+6.210
yrinth+5.864
 lows+5.832
 rated+5.817
Neg logit contributions
-8.788
ickets-8.590
 Invention-8.166
 cs-8.153
 postp-8.119
Token and
Token ID290
Feature activation-6.47e-01

Pos logit contributions
orge+6.984
olphin+6.584
XXXX+6.458
alian+6.265
 rated+6.220
Neg logit contributions
-10.213
vati-9.805
ickets-9.738
 cs-9.508
 postp-9.484
Token what
Token ID644
Feature activation-7.40e-01

Pos logit contributions
orge+6.068
 depths+5.526
olphin+5.428
yrinth+5.383
 WRITE+5.313
Neg logit contributions
-6.803
vati-6.411
 cs-6.411
 postp-6.256
ickets-6.230
Token it
Token ID340
Feature activation-1.93e-01

Pos logit contributions
orge+6.747
 depths+6.154
XXXX+5.834
 lows+5.782
 darkest+5.780
Neg logit contributions
-7.769
vati-7.401
 postp-7.378
 cs-7.352
ickets-7.250
Token means
Token ID1724
Feature activation-1.01e+00

Pos logit contributions
orge+2.179
 depths+2.014
yrinth+1.996
olphin+1.968
alian+1.946
Neg logit contributions
-1.774
 cs-1.735
 postp-1.701
vati-1.672
 Invention-1.641
Token to
Token ID284
Feature activation-1.29e+00

Pos logit contributions
orge+6.542
alian+6.009
elf+5.855
 depths+5.834
olphin+5.805
Neg logit contributions
 Offense-11.158
vati-11.135
-10.963
 cs-10.886
 postp-10.839
Token be
Token ID307
Feature activation-1.89e+00

Pos logit contributions
orge+9.257
 depths+8.462
 subjug+8.375
 WRITE+8.280
XXXX+8.099
Neg logit contributions
vati-13.767
-12.915
skirts-12.758
ickets-12.738
 postp-12.629
Token a
Token ID257
Feature activation-1.13e+00

Pos logit contributions
 rated+12.693
orge+12.549
sent+11.905
 subjug+11.680
 transsexual+11.677
Neg logit contributions
ickets-16.293
-15.916
 Offense-15.525
vati-15.350
 postp-14.861
Token trans
Token ID1007
Feature activation-3.74e-01

Pos logit contributions
orge+9.044
olphin+8.729
 depths+8.425
XXXX+8.360
sent+8.351
Neg logit contributions
-11.403
ickets-10.813
vati-10.620
 Offense-10.573
 postp-10.461
Tokenhuman
Token ID10734
Feature activation-5.06e-01

Pos logit contributions
orge+3.355
olphin+3.009
sent+2.912
XXXX+2.906
rss+2.880
Neg logit contributions
-4.319
 postp-4.191
 cs-4.184
ickets-4.053
vati-4.022
Token.
Token ID13
Feature activation-9.70e-01

Pos logit contributions
orge+3.709
olphin+3.401
XXXX+3.210
sent+3.168
 depths+3.168
Neg logit contributions
-6.465
 cs-6.226
 postp-6.201
ickets-6.173
vati-6.162
Token That
Token ID1320
Feature activation-8.59e-01

Pos logit contributions
orge+8.107
rss+7.487
XXXX+7.464
olphin+7.255
sent+7.237
Neg logit contributions
 postp-9.910
-9.580
vati-9.495
 cs-9.352
ickets-9.231
Token m
Token ID285
Feature activation-4.06e-01

Pos logit contributions
+2.030
 cs+1.963
 postp+1.946
 Invention+1.795
vati+1.788
Neg logit contributions
orge-3.052
yrinth-2.851
olphin-2.812
 depths-2.811
rss-2.742
Token2
Token ID17
Feature activation-3.61e-02

Pos logit contributions
orge+4.445
yrinth+3.985
olphin+3.977
 depths+3.948
rss+3.906
Neg logit contributions
-4.032
 cs-3.912
 postp-3.909
 Invention-3.699
vati-3.606
Tokenk
Token ID74
Feature activation-7.93e-02

Pos logit contributions
orge+0.448
yrinth+0.435
olphin+0.422
 depths+0.419
 WRITE+0.414
Neg logit contributions
-0.299
 cs-0.296
 postp-0.288
vati-0.268
 Invention-0.262
Token [
Token ID685
Feature activation-6.48e-01

Pos logit contributions
orge+0.886
yrinth+0.817
 depths+0.808
olphin+0.807
rss+0.786
Neg logit contributions
-0.766
 cs-0.742
 postp-0.738
 Invention-0.691
vati-0.688
Token10
Token ID940
Feature activation-1.82e-01

Pos logit contributions
orge+4.280
sent+3.605
rss+3.551
XXXX+3.546
 depths+3.540
Neg logit contributions
-8.860
 cs-8.779
 postp-8.751
 Invention-8.576
vati-8.329
Token:
Token ID25
Feature activation-1.88e+00

Pos logit contributions
orge+1.897
yrinth+1.813
olphin+1.748
 depths+1.721
 WRITE+1.673
Neg logit contributions
-1.910
 cs-1.870
 postp-1.845
 Invention-1.728
vati-1.715
Token48
Token ID2780
Feature activation+3.13e-02

Pos logit contributions
+3.483
sent+3.416
olphin+3.350
dale+3.135
 rated+3.094
Neg logit contributions
-23.689
 postp-23.566
 cs-23.143
 Invention-23.052
GROUND-22.817
Token]
Token ID60
Feature activation+5.58e-01

Pos logit contributions
+0.237
 cs+0.229
 postp+0.222
 Invention+0.202
 Offense+0.199
Neg logit contributions
yrinth-0.414
orge-0.411
olphin-0.393
ーテ-0.390
 depths-0.387
Token <
Token ID1279
Feature activation+1.80e-01

Pos logit contributions
+3.256
 cs+3.142
 postp+3.069
 Invention+2.699
vati+2.686
Neg logit contributions
orge-8.319
yrinth-7.889
olphin-7.804
 depths-7.752
rss-7.627
Tokenqu
Token ID421
Feature activation+3.45e-01

Pos logit contributions
+1.950
 cs+1.911
 postp+1.897
 Invention+1.785
vati+1.769
Neg logit contributions
orge-1.817
yrinth-1.665
olphin-1.638
 depths-1.629
rss-1.589
Tokenij
Token ID2926
Feature activation-4.01e-01

Pos logit contributions
+2.349
 cs+2.255
 postp+2.223
vati+2.042
 Invention+2.002
Neg logit contributions
orge-4.766
yrinth-4.495
 depths-4.462
olphin-4.446
 WRITE-4.374
Token punishment
Token ID9837
Feature activation-4.41e-01

Pos logit contributions
+0.903
 cs+0.890
 postp+0.888
 Invention+0.820
vati+0.814
Neg logit contributions
orge-1.031
yrinth-0.957
olphin-0.941
 depths-0.933
rss-0.914
Token for
Token ID329
Feature activation-1.16e+00

Pos logit contributions
orge+3.751
 depths+3.386
olphin+3.351
sent+3.289
yrinth+3.274
Neg logit contributions
-5.293
 cs-5.009
vati-4.824
ickets-4.823
 postp-4.814
Token the
Token ID262
Feature activation-1.19e+00

Pos logit contributions
orge+9.568
 depths+9.566
 lows+9.230
 darkest+9.107
 synd+9.041
Neg logit contributions
-11.812
vati-10.695
 cs-10.538
ickets-10.317
 Invention-10.223
Token perceived
Token ID11067
Feature activation-9.76e-01

Pos logit contributions
 depths+10.544
orge+10.131
 darkest+10.041
 lows+9.774
 synd+9.652
Neg logit contributions
-11.951
 cs-10.568
ickets-10.551
vati-10.443
skirts-9.867
Token crimes
Token ID6741
Feature activation-3.81e-01

Pos logit contributions
 depths+8.385
orge+8.289
 lows+8.083
 darkest+7.830
 subjug+7.620
Neg logit contributions
-10.354
 cs-9.514
ickets-9.487
vati-9.301
 Invention-9.053
Token of
Token ID286
Feature activation-1.86e+00

Pos logit contributions
orge+3.375
 depths+3.043
olphin+2.964
yrinth+2.921
 lows+2.916
Neg logit contributions
-4.502
 cs-4.276
 postp-4.193
vati-4.109
 Invention-4.079
Token watching
Token ID4964
Feature activation-1.23e+00

Pos logit contributions
 depths+13.000
 synd+12.985
orge+12.851
 Pyongyang+12.830
 subjug+12.736
Neg logit contributions
-17.217
vati-15.936
ickets-15.084
 cs-14.958
 Invention-14.245
Token South
Token ID2520
Feature activation-7.53e-01

Pos logit contributions
orge+9.851
 depths+9.411
 synd+9.387
 rated+9.351
 darkest+9.136
Neg logit contributions
-12.817
vati-11.583
ickets-11.224
 cs-11.195
 Invention-10.976
Token Korean
Token ID6983
Feature activation-4.39e-01

Pos logit contributions
orge+3.447
 depths+2.853
sent+2.815
 rated+2.782
olphin+2.714
Neg logit contributions
-10.968
 cs-10.640
vati-10.461
 postp-10.382
 Invention-10.336
Token movies
Token ID6918
Feature activation-4.54e-01

Pos logit contributions
orge+5.170
 depths+4.867
yrinth+4.718
 rated+4.707
 lows+4.698
Neg logit contributions
-3.869
 cs-3.524
 postp-3.429
vati-3.392
 Invention-3.314
Token,
Token ID11
Feature activation-6.49e-01

Pos logit contributions
orge+3.959
 depths+3.622
 rated+3.615
yrinth+3.538
olphin+3.537
Neg logit contributions
-5.240
 cs-5.022
 postp-4.884
vati-4.783
 Invention-4.774
Token,
Token ID11
Feature activation-6.98e-01

Pos logit contributions
orge+0.733
yrinth+0.666
olphin+0.659
 depths+0.657
rss+0.636
Neg logit contributions
-0.829
 cs-0.806
 postp-0.794
vati-0.764
 Invention-0.753
Token $
Token ID720
Feature activation-5.27e-01

Pos logit contributions
orge+4.511
yrinth+4.164
 depths+4.053
 rated+4.025
olphin+4.025
Neg logit contributions
-9.021
 postp-8.815
 cs-8.781
vati-8.659
 Invention-8.426
Token22
Token ID1828
Feature activation-9.76e-02

Pos logit contributions
orge+4.382
XXXX+4.000
olphin+3.847
sent+3.838
rss+3.836
Neg logit contributions
 postp-6.289
 cs-6.230
-6.154
vati-5.943
 Invention-5.844
TokenMM
Token ID12038
Feature activation-6.36e-02

Pos logit contributions
orge+0.866
yrinth+0.788
 depths+0.784
olphin+0.776
rss+0.746
Neg logit contributions
-1.152
 postp-1.137
 cs-1.131
vati-1.093
 Invention-1.071
Token contract
Token ID2775
Feature activation-8.98e-01

Pos logit contributions
orge+0.625
yrinth+0.572
olphin+0.565
 depths+0.561
rss+0.545
Neg logit contributions
-0.702
 cs-0.683
 postp-0.676
vati-0.640
 Invention-0.638
Token this
Token ID428
Feature activation-1.85e+00

Pos logit contributions
orge+6.842
 depths+6.511
 rated+6.493
 WRITE+6.406
sent+6.360
Neg logit contributions
-9.932
 cs-9.444
vati-9.364
skirts-9.145
 Invention-9.134
Token past
Token ID1613
Feature activation-1.36e+00

Pos logit contributions
 depths+10.555
 darkest+10.213
orge+10.096
sent+10.080
 rated+10.062
Neg logit contributions
-18.432
vati-17.798
ickets-16.956
skirts-16.802
 Prosecut-16.712
Token offseason
Token ID16349
Feature activation-3.98e-01

Pos logit contributions
 depths+8.490
 darkest+7.988
orge+7.957
 synd+7.731
 lows+7.724
Neg logit contributions
-15.105
vati-14.543
skirts-13.962
 cs-13.914
ickets-13.779
Token due
Token ID2233
Feature activation-4.65e-01

Pos logit contributions
orge+3.546
 depths+3.284
olphin+3.260
sent+3.186
rss+3.174
Neg logit contributions
-4.419
 cs-4.351
 postp-4.214
vati-4.157
 Invention-4.147
Token largely
Token ID5688
Feature activation-2.02e-01

Pos logit contributions
orge+3.925
olphin+3.855
 depths+3.724
 lows+3.548
yrinth+3.515
Neg logit contributions
-5.154
 cs-5.027
 postp-4.971
 Invention-4.963
vati-4.875
Token to
Token ID284
Feature activation-1.16e+00

Pos logit contributions
orge+1.865
yrinth+1.699
olphin+1.690
 depths+1.684
rss+1.653
Neg logit contributions
-2.211
 cs-2.198
 postp-2.159
 Invention-2.109
vati-2.048
Token Koreans
Token ID25999
Feature activation-4.63e-01

Pos logit contributions
orge+5.147
ongyang+4.660
olphin+4.640
sent+4.625
 depths+4.513
Neg logit contributions
-13.337
 cs-12.422
vati-12.196
 postp-12.051
 Invention-11.993
Token are
Token ID389
Feature activation-3.68e-01

Pos logit contributions
orge+4.375
 depths+3.962
olphin+3.905
yrinth+3.841
sent+3.829
Neg logit contributions
-5.243
 cs-4.854
 postp-4.814
ickets-4.708
vati-4.699
Token forced
Token ID4137
Feature activation-7.01e-01

Pos logit contributions
orge+3.793
 depths+3.472
olphin+3.425
yrinth+3.414
 rated+3.381
Neg logit contributions
-3.883
 cs-3.647
 postp-3.519
vati-3.493
 Invention-3.435
Token to
Token ID284
Feature activation-6.05e-01

Pos logit contributions
orge+5.672
 depths+5.123
olphin+5.060
 synd+4.950
 rated+4.885
Neg logit contributions
-8.306
 cs-7.900
 Invention-7.577
 Offense-7.554
 postp-7.496
Token adhere
Token ID26325
Feature activation-1.02e+00

Pos logit contributions
orge+5.122
 depths+4.713
 WRITE+4.612
olphin+4.534
yrinth+4.476
Neg logit contributions
-7.190
vati-6.664
 cs-6.656
 postp-6.581
 Invention-6.464
Token to
Token ID284
Feature activation-1.85e+00

Pos logit contributions
orge+7.126
 depths+6.728
elf+6.573
 darkest+6.384
yrinth+6.195
Neg logit contributions
-11.629
 Invention-11.288
 Offense-11.214
 cs-11.187
 postp-10.772
Token a
Token ID257
Feature activation-1.67e+00

Pos logit contributions
 depths+13.180
orge+12.863
 darkest+12.679
 synd+12.582
 rated+12.419
Neg logit contributions
-17.056
 cs-15.141
ickets-14.834
vati-14.807
 Offense-13.924
Token doctrine
Token ID13115
Feature activation-2.54e-01

Pos logit contributions
 synd+12.148
 darkest+12.109
 depths+11.943
 rated+11.687
orge+11.678
Neg logit contributions
-16.424
 cs-14.330
ickets-14.259
vati-14.208
skirts-13.633
Token which
Token ID543
Feature activation-2.83e-01

Pos logit contributions
orge+2.479
 depths+2.332
yrinth+2.288
olphin+2.259
 rated+2.244
Neg logit contributions
-2.759
 cs-2.639
 postp-2.545
ickets-2.509
vati-2.496
Token mixes
Token ID33237
Feature activation-1.25e+00

Pos logit contributions
orge+2.820
 depths+2.638
yrinth+2.564
olphin+2.545
 lows+2.534
Neg logit contributions
-3.035
 cs-2.884
 postp-2.769
vati-2.758
 Invention-2.704
Token Marxism
Token ID30634
Feature activation-4.95e-01

Pos logit contributions
orge+10.648
 depths+10.396
 darkest+10.191
 rated+10.050
 lows+10.028
Neg logit contributions
-12.368
vati-10.969
ickets-10.791
 cs-10.778
 Invention-10.222
Token ideology
Token ID12959
Feature activation-1.37e-01

Pos logit contributions
orge+12.143
yrinth+10.915
 depths+10.880
elf+10.836
sent+10.803
Neg logit contributions
-11.753
ickets-10.859
 cs-10.493
 postp-10.303
vati-9.988
Token -
Token ID532
Feature activation-9.55e-01

Pos logit contributions
orge+1.399
yrinth+1.288
 depths+1.282
olphin+1.268
rss+1.238
Neg logit contributions
-1.449
 cs-1.400
 postp-1.384
 Invention-1.304
vati-1.303
Token a
Token ID257
Feature activation-1.41e+00

Pos logit contributions
orge+8.589
yrinth+7.910
 depths+7.905
 rated+7.888
sent+7.824
Neg logit contributions
-9.564
 cs-8.912
ickets-8.736
 postp-8.609
vati-8.355
Token doctrine
Token ID13115
Feature activation+1.12e-01

Pos logit contributions
 darkest+10.475
orge+10.364
 depths+10.295
 synd+10.281
 retrieval+9.914
Neg logit contributions
-14.528
ickets-13.273
 cs-12.673
skirts-12.528
vati-12.194
Token which
Token ID543
Feature activation+3.03e-01

Pos logit contributions
+1.087
 cs+1.059
 postp+1.049
 Invention+0.978
vati+0.975
Neg logit contributions
orge-1.254
yrinth-1.162
olphin-1.146
 depths-1.142
rss-1.114
Token mixes
Token ID33237
Feature activation-1.84e+00

Pos logit contributions
+2.874
 postp+2.845
 cs+2.840
 Invention+2.611
vati+2.582
Neg logit contributions
orge-3.386
yrinth-3.162
olphin-3.106
 depths-3.029
rss-3.008
Token Marxism
Token ID30634
Feature activation-4.42e-01

Pos logit contributions
orge+10.241
 depths+10.150
 liberation+9.955
 synd+9.748
 darkest+9.560
Neg logit contributions
-17.865
ickets-16.696
 cs-16.119
 postp-16.002
idav-15.981
Token with
Token ID351
Feature activation-1.13e+00

Pos logit contributions
orge+3.662
 depths+3.222
yrinth+3.204
olphin+3.197
rss+3.087
Neg logit contributions
-5.525
 cs-5.335
 postp-5.325
 Invention-5.053
vati-5.032
Token the
Token ID262
Feature activation-8.18e-01

Pos logit contributions
orge+7.711
 depths+7.500
 liberation+7.281
 darkest+7.085
 Pyongyang+7.039
Neg logit contributions
-12.644
 cs-11.818
ickets-11.296
vati-10.983
 postp-10.941
Token worship
Token ID11892
Feature activation-4.65e-01

Pos logit contributions
 depths+5.042
orge+4.977
 darkest+4.735
 liberation+4.524
 retrieval+4.422
Neg logit contributions
-10.485
 cs-10.009
ickets-9.633
skirts-9.498
 postp-9.432
Token of
Token ID286
Feature activation-1.43e+00

Pos logit contributions
orge+4.432
 depths+3.983
yrinth+3.969
olphin+3.962
rss+3.837
Neg logit contributions
-5.236
 cs-5.091
 postp-5.057
 Invention-4.783
vati-4.753
Token will
Token ID481
Feature activation-1.04e+00

Pos logit contributions
 depths+13.558
 rated+13.233
orge+12.293
 relayed+12.147
 lows+11.985
Neg logit contributions
-15.489
ickets-15.165
skirts-14.881
vati-14.832
 Invention-14.680
Token identify
Token ID5911
Feature activation-8.27e-01

Pos logit contributions
 depths+8.386
orge+8.235
yrinth+7.514
 WRITE+7.514
 rated+7.419
Neg logit contributions
vati-11.000
-10.847
 postp-10.362
 Invention-10.337
skirts-10.265
Token those
Token ID883
Feature activation-1.14e+00

Pos logit contributions
 depths+7.541
orge+7.218
yrinth+6.888
sent+6.847
 lows+6.825
Neg logit contributions
-8.314
 postp-7.980
vati-7.978
 cs-7.830
 Invention-7.779
Token patterns
Token ID7572
Feature activation-3.29e-01

Pos logit contributions
 depths+9.702
orge+9.072
 lows+8.810
 darkest+8.730
 rated+8.551
Neg logit contributions
-11.692
 postp-10.870
vati-10.827
ickets-10.820
 Invention-10.758
Token and
Token ID290
Feature activation-1.72e+00

Pos logit contributions
orge+3.014
 depths+2.796
yrinth+2.727
olphin+2.693
rss+2.677
Neg logit contributions
-3.703
 cs-3.613
 postp-3.608
vati-3.418
 Invention-3.416
Token simultaneously
Token ID11640
Feature activation-1.84e+00

Pos logit contributions
 depths+14.205
orge+13.707
 retrieval+12.759
 WRITE+12.692
 rated+12.545
Neg logit contributions
vati-14.669
-14.567
skirts-13.405
 Invention-13.265
paren-13.068
Token send
Token ID3758
Feature activation-1.44e+00

Pos logit contributions
 depths+13.784
orge+13.489
 read+13.116
 WRITE+12.778
 rated+12.720
Neg logit contributions
vati-16.396
-15.716
skirts-15.385
ickets-14.644
 Invention-14.593
Token them
Token ID606
Feature activation-8.79e-01

Pos logit contributions
 depths+10.853
orge+10.571
sent+10.505
 retrieval+10.450
 rated+10.264
Neg logit contributions
-13.543
vati-13.112
 postp-12.637
 Prosecut-12.477
 Offense-12.451
Token to
Token ID284
Feature activation-1.19e+00

Pos logit contributions
orge+6.871
 depths+6.661
sent+6.479
olphin+6.407
 rated+6.253
Neg logit contributions
-9.710
 cs-9.269
 postp-9.236
 Invention-9.032
 Offense-8.985
Token an
Token ID281
Feature activation-9.38e-01

Pos logit contributions
 depths+9.854
orge+9.828
 Meridian+8.850
yrinth+8.722
 retrieval+8.639
Neg logit contributions
-11.482
 postp-11.365
vati-11.334
ickets-10.690
 Offense-10.576
Token external
Token ID7097
Feature activation-1.04e+00

Pos logit contributions
orge+9.489
 depths+8.794
rss+8.349
XXXX+8.261
 rated+8.172
Neg logit contributions
 postp-8.373
vati-8.154
-8.017
 cs-7.939
skirts-7.852
Token,
Token ID11
Feature activation-5.47e-01

Pos logit contributions
orge+3.894
 depths+3.490
yrinth+3.488
rss+3.406
olphin+3.404
Neg logit contributions
-4.949
 postp-4.832
 cs-4.760
vati-4.675
 Invention-4.527
Token published
Token ID3199
Feature activation-3.53e-01

Pos logit contributions
orge+5.306
yrinth+4.896
 depths+4.883
olphin+4.814
sent+4.744
Neg logit contributions
-5.578
 postp-5.389
 cs-5.290
vati-5.229
ickets-5.062
Token by
Token ID416
Feature activation-6.63e-01

Pos logit contributions
orge+3.733
 depths+3.489
yrinth+3.467
olphin+3.382
 WRITE+3.364
Neg logit contributions
-3.433
 cs-3.294
 postp-3.276
vati-3.204
 Invention-3.088
Token Sp
Token ID1338
Feature activation-8.68e-02

Pos logit contributions
orge+6.649
 depths+6.201
 Meridian+6.003
XXXX+5.927
 WRITE+5.923
Neg logit contributions
-6.352
vati-6.144
 postp-6.040
 cs-6.011
ickets-5.583
Tokenurl
Token ID6371
Feature activation-1.04e+00

Pos logit contributions
orge+0.944
yrinth+0.858
olphin+0.853
 depths+0.844
rss+0.820
Neg logit contributions
-0.877
 cs-0.855
 postp-0.853
 Invention-0.794
vati-0.787
Token this
Token ID428
Feature activation-1.82e+00

Pos logit contributions
orge+9.866
rade+8.735
yrinth+8.733
 depths+8.573
ragon+8.410
Neg logit contributions
-9.657
vati-9.587
 postp-9.248
 cs-9.199
GROUND-8.559
Token month
Token ID1227
Feature activation-5.69e-01

Pos logit contributions
orge+12.105
 depths+11.639
 darkest+11.342
 synd+10.951
 rated+10.748
Neg logit contributions
-16.851
vati-16.254
ickets-15.956
 postp-15.275
 cs-15.115
Token.
Token ID13
Feature activation-4.63e-01

Pos logit contributions
orge+4.814
 depths+4.323
yrinth+4.314
olphin+4.210
sent+4.205
Neg logit contributions
-6.444
 cs-6.262
 postp-6.225
vati-6.146
 Invention-5.964
Token
Token ID198
Feature activation+2.85e-01

Pos logit contributions
orge+4.463
 WRITE+3.968
XXXX+3.954
 depths+3.950
olphin+3.877
Neg logit contributions
-4.914
 postp-4.909
vati-4.786
 cs-4.723
ickets-4.454
Token
Token ID198
Feature activation-3.50e-01

Pos logit contributions
+2.211
 cs+2.138
 postp+2.036
 Invention+1.887
 Offense+1.874
Neg logit contributions
yrinth-3.739
orge-3.609
ーテ-3.535
olphin-3.492
psey-3.386
Token
Token ID447
Feature activation+6.10e-02

Pos logit contributions
orge+3.581
XXXX+3.259
sent+3.207
rss+3.135
olphin+3.131
Neg logit contributions
 postp-3.619
 cs-3.553
-3.551
vati-3.430
 Invention-3.363
Token6
Token ID21
Feature activation+1.17e+00

Pos logit contributions
+7.678
 cs+7.473
 postp+7.353
vati+6.946
 Invention+6.783
Neg logit contributions
orge-9.366
yrinth-8.744
 depths-8.534
olphin-8.467
 WRITE-8.372
Token percent
Token ID1411
Feature activation+1.17e+00

Pos logit contributions
+8.430
 cs+8.415
 postp+8.176
 Invention+7.406
ickets+7.073
Neg logit contributions
orge-14.567
yrinth-13.550
olphin-13.267
rade-12.804
XXXX-12.697
Token of
Token ID286
Feature activation+7.36e-01

Pos logit contributions
 postp+10.180
 cs+9.977
+9.733
 defer+9.010
 err+8.869
Neg logit contributions
orge-11.719
yrinth-11.285
olphin-10.823
rss-10.395
elf-10.335
Token public
Token ID1171
Feature activation-2.63e-01

Pos logit contributions
 cs+7.448
 postp+7.396
+7.223
 Invention+6.704
 Offense+6.590
Neg logit contributions
orge-7.581
yrinth-6.993
olphin-6.848
elf-6.435
ragon-6.405
Token school
Token ID1524
Feature activation+2.85e-01

Pos logit contributions
orge+2.343
 depths+2.213
 rated+2.140
olphin+2.118
 lows+2.118
Neg logit contributions
-3.018
 cs-2.831
 postp-2.823
vati-2.772
 Invention-2.763
Token teachers
Token ID7799
Feature activation+1.23e+00

Pos logit contributions
+3.159
 cs+3.123
 postp+3.094
 Invention+2.880
vati+2.858
Neg logit contributions
orge-2.773
yrinth-2.536
olphin-2.496
 depths-2.448
rss-2.399
Token change
Token ID1487
Feature activation+1.02e+00

Pos logit contributions
 postp+11.351
 cs+11.083
+10.339
 err+10.245
 defer+10.140
Neg logit contributions
yrinth-11.388
orge-11.381
olphin-10.808
rss-10.322
ーテ-10.133
Token occupations
Token ID23308
Feature activation+1.08e+00

Pos logit contributions
+7.026
 postp+6.999
 cs+6.973
 Invention+6.195
 Offense+5.904
Neg logit contributions
orge-12.619
yrinth-12.011
olphin-11.948
alian-11.381
rss-11.294
Token or
Token ID393
Feature activation+8.21e-01

Pos logit contributions
+9.449
 postp+9.416
 cs+9.269
 Offense+8.455
 Invention+8.443
Neg logit contributions
orge-11.058
yrinth-10.434
olphin-10.212
ongyang-9.738
 depths-9.396
Token exit
Token ID8420
Feature activation+1.36e+00

Pos logit contributions
 postp+7.673
 cs+7.521
+7.352
 err+6.698
 Invention+6.657
Neg logit contributions
orge-8.552
yrinth-8.108
olphin-8.049
rss-7.586
ragon-7.581
Token the
Token ID262
Feature activation+6.45e-01

Pos logit contributions
 cs+10.723
 postp+10.514
+10.477
 Invention+9.612
 Emer+9.328
Neg logit contributions
orge-14.701
yrinth-13.401
olphin-13.237
alian-12.622
sent-12.580
Token.
Token ID13
Feature activation+6.11e-01

Pos logit contributions
+2.998
 cs+2.949
 postp+2.935
 Invention+2.717
vati+2.716
Neg logit contributions
orge-3.245
yrinth-3.026
olphin-2.958
 depths-2.942
rss-2.860
Token This
Token ID770
Feature activation+2.04e-01

Pos logit contributions
 cs+5.399
+5.330
 postp+5.162
 Invention+5.089
 Offense+4.949
Neg logit contributions
orge-6.852
yrinth-6.790
olphin-6.497
 depths-6.357
ongyang-6.243
Token paper
Token ID3348
Feature activation+5.38e-01

Pos logit contributions
+2.236
 cs+2.201
 postp+2.191
 Invention+2.051
vati+2.029
Neg logit contributions
orge-2.012
yrinth-1.852
olphin-1.824
 depths-1.797
rss-1.754
Token presents
Token ID10969
Feature activation+8.26e-01

Pos logit contributions
 postp+5.565
 cs+5.464
+5.462
 Invention+5.024
vati+4.985
Neg logit contributions
orge-5.355
yrinth-5.034
olphin-4.966
rss-4.731
ーテ-4.718
Token a
Token ID257
Feature activation+5.56e-01

Pos logit contributions
+8.175
 cs+8.097
 postp+8.046
 Invention+7.477
vati+7.333
Neg logit contributions
orge-8.188
yrinth-7.655
olphin-7.480
rss-7.266
sent-7.192
Token new
Token ID649
Feature activation+1.10e+00

Pos logit contributions
 postp+6.028
 cs+6.025
+5.988
 Invention+5.532
vati+5.431
Neg logit contributions
orge-5.237
yrinth-4.879
olphin-4.702
 depths-4.577
 WRITE-4.560
Token methodology
Token ID20411
Feature activation+7.59e-01

Pos logit contributions
 cs+10.385
+10.371
 postp+10.290
 Invention+9.655
 Offense+9.299
Neg logit contributions
orge-10.407
yrinth-9.676
olphin-9.595
XXXX-9.215
rss-9.189
Token,
Token ID11
Feature activation+7.15e-01

Pos logit contributions
 cs+7.070
+7.022
 postp+7.012
 Invention+6.385
vati+6.332
Neg logit contributions
orge-7.995
yrinth-7.527
olphin-7.343
 depths-7.146
 WRITE-7.085
Token a
Token ID257
Feature activation+3.99e-01

Pos logit contributions
 postp+7.143
 cs+7.067
+7.064
 Invention+6.519
vati+6.380
Neg logit contributions
orge-7.159
yrinth-6.668
olphin-6.611
rss-6.369
ragon-6.331
Token new
Token ID649
Feature activation+6.90e-01

Pos logit contributions
+3.989
 postp+3.977
 cs+3.959
 Invention+3.628
vati+3.578
Neg logit contributions
orge-4.210
yrinth-3.917
olphin-3.840
 depths-3.753
rss-3.704
Token approach
Token ID3164
Feature activation+5.46e-01

Pos logit contributions
+6.876
 cs+6.853
 postp+6.824
 Invention+6.327
 Offense+6.111
Neg logit contributions
orge-7.006
yrinth-6.515
olphin-6.438
rss-6.209
XXXX-6.153
Token to
Token ID284
Feature activation-1.79e-01

Pos logit contributions
+5.177
 postp+5.146
 cs+5.038
 Invention+4.597
 err+4.577
Neg logit contributions
orge-6.205
yrinth-5.882
olphin-5.645
 depths-5.615
rss-5.530
Token the
Token ID262
Feature activation-1.61e-01

Pos logit contributions
orge+1.812
 depths+1.683
yrinth+1.647
olphin+1.624
 WRITE+1.607
Neg logit contributions
-1.914
 cs-1.825
 postp-1.800
vati-1.731
 Invention-1.723
Token angle
Token ID9848
Feature activation+3.99e-01

Pos logit contributions
orge+1.591
 depths+1.465
yrinth+1.446
olphin+1.429
 WRITE+1.404
Neg logit contributions
-1.753
 cs-1.690
 postp-1.660
vati-1.588
 Invention-1.586
Token-
Token ID12
Feature activation+4.17e-01

Pos logit contributions
+4.212
 cs+4.148
 postp+4.111
 Invention+3.828
vati+3.826
Neg logit contributions
orge-4.050
yrinth-3.749
olphin-3.685
 depths-3.627
rss-3.556
Tokenof
Token ID1659
Feature activation+5.91e-01

Pos logit contributions
+4.720
 cs+4.619
 postp+4.582
 Invention+4.312
vati+4.308
Neg logit contributions
orge-3.965
yrinth-3.627
olphin-3.565
 depths-3.547
rss-3.441
Token-
Token ID12
Feature activation+9.76e-01

Pos logit contributions
 cs+4.382
 postp+4.317
+4.255
 Offense+3.826
vati+3.817
Neg logit contributions
yrinth-7.176
orge-7.071
ーテ-7.031
olphin-6.825
 WRITE-6.622
Tokenattack
Token ID20358
Feature activation-5.24e-02

Pos logit contributions
 cs+5.276
+5.202
 postp+4.984
vati+4.879
 Offense+4.697
Neg logit contributions
orge-13.349
 depths-12.935
yrinth-12.845
 WRITE-12.741
olphin-12.600
Token data
Token ID1366
Feature activation-2.61e-02

Pos logit contributions
orge+0.508
 depths+0.462
yrinth+0.459
olphin+0.454
rss+0.443
Neg logit contributions
-0.587
 cs-0.568
 postp-0.561
 Invention-0.534
vati-0.533
Token by
Token ID416
Feature activation-1.19e-01

Pos logit contributions
orge+0.270
yrinth+0.247
 depths+0.245
olphin+0.244
rss+0.238
Neg logit contributions
-0.275
 cs-0.267
 postp-0.264
 Invention-0.249
vati-0.248
Token commanding
Token ID25771
Feature activation+1.04e-01

Pos logit contributions
orge+1.098
 depths+0.998
yrinth+0.995
olphin+0.980
 WRITE+0.957
Neg logit contributions
-1.373
 cs-1.331
 postp-1.299
vati-1.259
 Invention-1.252
Token the
Token ID262
Feature activation+9.55e-02

Pos logit contributions
+1.082
 cs+1.051
 postp+1.039
 Invention+0.979
vati+0.976
Neg logit contributions
orge-1.096
yrinth-1.006
 depths-0.996
olphin-0.994
rss-0.963
Token thoughts
Token ID6066
Feature activation+5.84e-01

Pos logit contributions
 postp+9.763
+9.256
 cs+9.240
 Invention+8.635
 improv+8.205
Neg logit contributions
orge-9.179
yrinth-8.474
olphin-8.276
rss-8.242
XXXX-8.027
Token in
Token ID287
Feature activation+8.24e-01

Pos logit contributions
+5.644
 postp+5.527
 cs+5.497
 Invention+5.011
vati+4.997
Neg logit contributions
orge-6.314
yrinth-5.911
olphin-5.777
 depths-5.632
 WRITE-5.566
Token the
Token ID262
Feature activation+6.84e-01

Pos logit contributions
+8.254
 postp+8.227
 cs+8.166
 Invention+7.462
vati+7.256
Neg logit contributions
orge-8.184
yrinth-7.582
olphin-7.506
XXXX-7.155
ragon-7.147
Token moment
Token ID2589
Feature activation+9.97e-01

Pos logit contributions
+7.112
 postp+7.083
 cs+6.935
 Invention+6.534
 Offense+6.290
Neg logit contributions
orge-6.874
yrinth-6.205
olphin-6.189
rss-6.037
ragon-5.860
Token of
Token ID286
Feature activation+8.53e-01

Pos logit contributions
+9.241
 postp+8.955
 cs+8.912
 Invention+8.066
 Offense+8.023
Neg logit contributions
orge-10.172
olphin-9.485
yrinth-9.470
 WRITE-8.961
ongyang-8.925
Token playing
Token ID2712
Feature activation+1.33e+00

Pos logit contributions
 postp+8.808
+8.615
 cs+8.558
 Invention+8.050
 Offense+7.723
Neg logit contributions
orge-8.357
olphin-7.584
rss-7.555
yrinth-7.536
ragon-7.390
Token,
Token ID11
Feature activation+6.21e-01

Pos logit contributions
 postp+10.576
+10.553
 cs+10.442
 err+9.359
 Invention+9.309
Neg logit contributions
orge-13.828
yrinth-13.236
olphin-12.460
rss-12.200
ongyang-12.189
Token even
Token ID772
Feature activation+5.52e-01

Pos logit contributions
 postp+6.372
 cs+6.192
+6.160
 Invention+5.590
vati+5.537
Neg logit contributions
orge-6.430
yrinth-5.939
olphin-5.745
rss-5.642
XXXX-5.477
Token if
Token ID611
Feature activation+9.38e-01

Pos logit contributions
 postp+5.726
+5.532
 cs+5.511
 Invention+5.040
vati+4.965
Neg logit contributions
orge-5.671
yrinth-5.242
olphin-5.056
rss-5.015
XXXX-4.881
Token those
Token ID883
Feature activation+5.53e-01

Pos logit contributions
 postp+8.478
 cs+8.210
+8.142
 Invention+7.444
 err+7.208
Neg logit contributions
orge-10.184
yrinth-9.322
olphin-9.297
rss-9.188
ragon-8.873
Token intentions
Token ID14953
Feature activation+6.17e-01

Pos logit contributions
 postp+5.994
 cs+5.932
+5.907
 Invention+5.438
 Offense+5.333
Neg logit contributions
orge-5.421
yrinth-4.899
rss-4.860
olphin-4.786
XXXX-4.602
Token Act
Token ID2191
Feature activation+1.68e-01

Pos logit contributions
+4.255
 cs+4.206
 postp+4.106
 Invention+4.105
 Offense+3.933
Neg logit contributions
orge-5.389
yrinth-5.180
olphin-5.001
 depths-4.921
rss-4.823
Token III
Token ID6711
Feature activation+4.06e-01

Pos logit contributions
+1.942
 cs+1.897
 postp+1.882
vati+1.786
 Invention+1.784
Neg logit contributions
orge-1.571
yrinth-1.432
 depths-1.402
olphin-1.401
rss-1.356
Token remains
Token ID3793
Feature activation+1.05e+00

Pos logit contributions
+4.301
 postp+4.270
 cs+4.251
 Invention+3.963
vati+3.901
Neg logit contributions
orge-4.053
yrinth-3.750
olphin-3.669
 depths-3.617
 WRITE-3.543
Token to
Token ID284
Feature activation-1.89e-02

Pos logit contributions
 postp+10.239
 cs+9.864
+9.699
 Invention+9.177
 Emer+8.921
Neg logit contributions
orge-10.217
yrinth-9.508
olphin-9.157
rss-9.012
 WRITE-8.913
Token be
Token ID307
Feature activation+8.19e-01

Pos logit contributions
orge+0.145
 depths+0.131
olphin+0.129
yrinth+0.127
XXXX+0.122
Neg logit contributions
-0.247
 cs-0.242
 postp-0.240
vati-0.234
 Invention-0.228
Token played
Token ID2826
Feature activation+1.06e+00

Pos logit contributions
 postp+9.432
 cs+9.424
+9.252
 Invention+8.612
vati+8.597
Neg logit contributions
orge-6.844
yrinth-6.228
olphin-6.049
rss-5.987
 lows-5.671
Token.
Token ID13
Feature activation+4.81e-01

Pos logit contributions
 postp+9.573
 cs+9.409
+9.255
 Invention+8.375
 err+8.307
Neg logit contributions
orge-10.785
yrinth-10.313
olphin-9.546
rss-9.539
ーテ-9.421
Token Just
Token ID2329
Feature activation+9.53e-01

Pos logit contributions
+4.370
 cs+4.290
 postp+4.212
 Invention+4.179
 Offense+4.031
Neg logit contributions
orge-5.415
yrinth-5.171
olphin-5.008
 depths-4.934
rss-4.838
Token when
Token ID618
Feature activation+2.25e-01

Pos logit contributions
 postp+8.758
 cs+8.297
+8.181
 Invention+7.699
 err+7.353
Neg logit contributions
orge-10.029
olphin-9.168
yrinth-8.961
XXXX-8.732
rss-8.674
Token it
Token ID340
Feature activation+1.07e-01

Pos logit contributions
+2.229
 cs+2.184
 postp+2.176
 Invention+2.027
vati+2.001
Neg logit contributions
orge-2.441
yrinth-2.260
olphin-2.229
 depths-2.195
rss-2.156
Token will
Token ID481
Feature activation+1.42e-01

Pos logit contributions
+1.188
 cs+1.167
 postp+1.160
 Invention+1.085
vati+1.082
Neg logit contributions
orge-1.043
yrinth-0.955
olphin-0.940
 depths-0.929
rss-0.908
Token in
Token ID287
Feature activation-1.30e-01

Pos logit contributions
orge+3.983
 depths+3.724
olphin+3.637
yrinth+3.618
 WRITE+3.478
Neg logit contributions
-4.838
 cs-4.679
 postp-4.637
 Invention-4.442
ickets-4.370
Token Mir
Token ID7381
Feature activation+3.40e-01

Pos logit contributions
orge+1.338
 depths+1.236
yrinth+1.232
olphin+1.210
rss+1.190
Neg logit contributions
-1.362
 cs-1.316
 postp-1.288
 Invention-1.219
vati-1.217
Tokena
Token ID64
Feature activation-7.46e-02

Pos logit contributions
+3.499
 cs+3.452
 postp+3.387
vati+3.244
 Invention+3.163
Neg logit contributions
orge-3.478
yrinth-3.274
olphin-3.215
 depths-3.156
 WRITE-3.081
Token Road
Token ID5567
Feature activation-2.27e-01

Pos logit contributions
orge+0.756
yrinth+0.692
 depths+0.679
olphin+0.679
 WRITE+0.661
Neg logit contributions
-0.805
 cs-0.779
 postp-0.771
 Invention-0.723
vati-0.715
Token.
Token ID13
Feature activation+2.49e-01

Pos logit contributions
orge+2.103
yrinth+1.921
 depths+1.915
olphin+1.870
 WRITE+1.848
Neg logit contributions
-2.595
 cs-2.511
 postp-2.468
 Invention-2.369
ickets-2.347
Token The
Token ID383
Feature activation+4.70e-01

Pos logit contributions
+2.479
 cs+2.422
 postp+2.390
 Invention+2.240
vati+2.225
Neg logit contributions
orge-2.708
yrinth-2.509
olphin-2.472
 depths-2.457
rss-2.392
Token Director
Token ID5890
Feature activation+1.35e-01

Pos logit contributions
 postp+4.696
 cs+4.676
+4.619
 Invention+4.330
 Offense+4.257
Neg logit contributions
orge-4.914
yrinth-4.537
olphin-4.494
ragon-4.343
rss-4.307
Token of
Token ID286
Feature activation+5.01e-01

Pos logit contributions
+1.633
 cs+1.597
 postp+1.590
 Invention+1.501
vati+1.497
Neg logit contributions
orge-1.191
yrinth-1.081
olphin-1.065
 depths-1.058
rss-1.021
Token Seven
Token ID13723
Feature activation+1.17e-02

Pos logit contributions
+4.625
 cs+4.618
 postp+4.524
 Invention+4.291
 Offense+4.237
Neg logit contributions
orge-5.594
yrinth-5.218
olphin-5.075
 depths-5.009
rss-4.980
Token Eleven
Token ID37339
Feature activation-3.78e-01

Pos logit contributions
+0.122
 cs+0.118
 postp+0.118
vati+0.110
 Invention+0.110
Neg logit contributions
orge-0.122
yrinth-0.112
olphin-0.111
 depths-0.111
 WRITE-0.107
Token Construction
Token ID20395
Feature activation-3.68e-01

Pos logit contributions
orge+3.852
 depths+3.541
yrinth+3.501
olphin+3.487
 WRITE+3.446
Neg logit contributions
-3.900
 postp-3.712
 cs-3.697
vati-3.490
 Invention-3.428
Token has
Token ID468
Feature activation-1.57e-01

Pos logit contributions
orge+3.263
 depths+2.948
olphin+2.920
yrinth+2.904
rss+2.826
Neg logit contributions
-3.791
 cs-3.665
 postp-3.597
vati-3.482
 Invention-3.405
Token composed
Token ID13160
Feature activation+2.88e-02

Pos logit contributions
orge+1.383
yrinth+1.255
 depths+1.243
olphin+1.230
 WRITE+1.197
Neg logit contributions
-1.881
 cs-1.818
 postp-1.809
vati-1.713
 Invention-1.707
Token and
Token ID290
Feature activation-4.83e-01

Pos logit contributions
+0.327
 cs+0.317
 postp+0.316
vati+0.298
 Invention+0.297
Neg logit contributions
orge-0.273
yrinth-0.250
 depths-0.247
olphin-0.245
 WRITE-0.237
Token performed
Token ID6157
Feature activation+1.86e-01

Pos logit contributions
orge+4.258
 depths+3.838
 rated+3.780
yrinth+3.746
olphin+3.727
Neg logit contributions
-5.639
 cs-5.351
 postp-5.338
vati-5.135
 Offense-5.125
Token the
Token ID262
Feature activation+7.05e-02

Pos logit contributions
+1.965
 cs+1.930
 postp+1.913
 Invention+1.793
vati+1.785
Neg logit contributions
orge-1.906
yrinth-1.755
olphin-1.728
 depths-1.711
rss-1.676
Token title
Token ID3670
Feature activation+5.41e-02

Pos logit contributions
+0.752
 cs+0.734
 postp+0.728
 Invention+0.682
vati+0.682
Neg logit contributions
orge-0.718
yrinth-0.661
olphin-0.650
 depths-0.649
rss-0.629
Token song
Token ID3496
Feature activation+9.23e-02

Pos logit contributions
+0.618
 cs+0.605
 postp+0.600
 Invention+0.565
vati+0.564
Neg logit contributions
orge-0.510
yrinth-0.466
olphin-0.458
 depths-0.456
rss-0.442
Token (
Token ID357
Feature activation+1.58e-02

Pos logit contributions
+1.007
 cs+0.983
 postp+0.975
 Invention+0.915
vati+0.914
Neg logit contributions
orge-0.919
yrinth-0.844
olphin-0.831
 depths-0.828
rss-0.803
TokenC
Token ID34
Feature activation+7.74e-02

Pos logit contributions
+0.158
 cs+0.154
 postp+0.153
 Invention+0.143
vati+0.142
Neg logit contributions
orge-0.172
yrinth-0.159
olphin-0.156
 depths-0.155
sent-0.152
Tokenomet
Token ID908
Feature activation-1.39e-01

Pos logit contributions
+0.719
 cs+0.703
 postp+0.698
 Invention+0.648
vati+0.647
Neg logit contributions
orge-0.897
yrinth-0.834
olphin-0.823
 depths-0.817
rss-0.795
Token Song
Token ID10940
Feature activation-3.30e-01

Pos logit contributions
orge+1.272
yrinth+1.174
olphin+1.165
 depths+1.160
 WRITE+1.116
Neg logit contributions
-1.595
 postp-1.547
 cs-1.545
vati-1.457
ickets-1.440
Token have
Token ID423
Feature activation+5.39e-02

Pos logit contributions
orge+0.654
yrinth+0.593
 depths+0.590
olphin+0.584
 WRITE+0.571
Neg logit contributions
-0.731
 cs-0.707
 postp-0.705
 Invention-0.669
vati-0.667
Token good
Token ID922
Feature activation-2.64e-01

Pos logit contributions
+0.588
 cs+0.573
 postp+0.571
 Invention+0.537
vati+0.536
Neg logit contributions
orge-0.533
yrinth-0.488
 depths-0.482
olphin-0.480
rss-0.465
Token benefits
Token ID4034
Feature activation+3.14e-01

Pos logit contributions
orge+2.627
 depths+2.394
olphin+2.327
yrinth+2.325
 lows+2.319
Neg logit contributions
-2.804
 postp-2.709
 cs-2.683
vati-2.591
 Invention-2.584
Token,
Token ID11
Feature activation+6.20e-02

Pos logit contributions
+3.164
 cs+3.086
 postp+3.056
 Invention+2.852
vati+2.845
Neg logit contributions
orge-3.371
yrinth-3.117
olphin-3.074
 depths-3.057
rss-2.976
Token everybody
Token ID7288
Feature activation-6.57e-02

Pos logit contributions
+0.647
 cs+0.633
 postp+0.630
vati+0.592
 Invention+0.592
Neg logit contributions
orge-0.641
yrinth-0.590
 depths-0.578
olphin-0.577
rss-0.565
Token gains
Token ID8810
Feature activation+1.84e-01

Pos logit contributions
orge+0.637
yrinth+0.581
 depths+0.575
olphin+0.573
 WRITE+0.555
Neg logit contributions
-0.731
 cs-0.710
 postp-0.706
vati-0.669
 Invention-0.667
Token from
Token ID422
Feature activation-3.88e-01

Pos logit contributions
+1.944
 cs+1.899
 postp+1.883
 Invention+1.765
vati+1.762
Neg logit contributions
orge-1.894
yrinth-1.743
olphin-1.717
 depths-1.709
rss-1.663
Token that
Token ID326
Feature activation+1.33e-01

Pos logit contributions
orge+3.633
 depths+3.274
yrinth+3.229
 lows+3.199
olphin+3.148
Neg logit contributions
-4.251
 cs-4.036
 postp-3.992
vati-3.987
ickets-3.858
Token.
Token ID13
Feature activation-4.75e-01

Pos logit contributions
+1.395
 cs+1.362
 postp+1.351
vati+1.268
 Invention+1.265
Neg logit contributions
orge-1.372
yrinth-1.259
olphin-1.241
 depths-1.239
rss-1.204
Token Watching
Token ID36110
Feature activation+1.21e-01

Pos logit contributions
orge+4.519
 WRITE+4.115
rss+4.078
XXXX+4.069
 depths+3.995
Neg logit contributions
 postp-5.020
-4.890
 cs-4.850
vati-4.771
ickets-4.579
Token sometimes
Token ID3360
Feature activation-1.68e-01

Pos logit contributions
+1.232
 cs+1.204
 postp+1.193
 Invention+1.113
vati+1.110
Neg logit contributions
orge-1.299
yrinth-1.201
olphin-1.185
 depths-1.177
rss-1.146
Token of
Token ID286
Feature activation+4.92e-01

Pos logit contributions
orge+1.856
 depths+1.732
yrinth+1.697
olphin+1.642
 darkest+1.625
Neg logit contributions
-2.395
 cs-2.281
 postp-2.252
vati-2.169
 Invention-2.141
Token you
Token ID345
Feature activation-3.45e-01

Pos logit contributions
+4.167
 cs+4.045
 postp+4.003
 Invention+3.688
vati+3.680
Neg logit contributions
orge-6.086
yrinth-5.687
olphin-5.613
 depths-5.597
rss-5.468
Token should
Token ID815
Feature activation+1.27e-02

Pos logit contributions
orge+3.123
 depths+2.811
yrinth+2.767
olphin+2.756
 WRITE+2.700
Neg logit contributions
-4.029
 cs-3.870
 postp-3.799
 Invention-3.655
vati-3.651
Token stay
Token ID2652
Feature activation+4.30e-01

Pos logit contributions
+0.137
 cs+0.135
 postp+0.134
 Invention+0.125
vati+0.124
Neg logit contributions
orge-0.127
yrinth-0.117
olphin-0.115
 depths-0.113
rss-0.111
Token in
Token ID287
Feature activation+3.99e-01

Pos logit contributions
 postp+4.225
 cs+4.142
+4.110
vati+3.800
 Invention+3.740
Neg logit contributions
orge-4.678
yrinth-4.448
olphin-4.214
rss-4.161
 depths-4.057
Token the
Token ID262
Feature activation+5.76e-02

Pos logit contributions
+3.895
 cs+3.849
 postp+3.805
 Invention+3.548
vati+3.526
Neg logit contributions
orge-4.329
yrinth-3.997
olphin-3.954
 depths-3.866
rss-3.864
Token mountain
Token ID8598
Feature activation+2.57e-01

Pos logit contributions
+0.630
 cs+0.613
 postp+0.608
 Invention+0.571
vati+0.570
Neg logit contributions
orge-0.573
yrinth-0.527
 depths-0.519
olphin-0.517
rss-0.498
Token shack
Token ID32772
Feature activation+3.27e-01

Pos logit contributions
+3.006
 cs+2.964
 postp+2.949
vati+2.777
 Invention+2.769
Neg logit contributions
orge-2.310
yrinth-2.105
olphin-2.067
 depths-2.020
rss-2.006
Token,
Token ID11
Feature activation-9.06e-02

Pos logit contributions
+3.264
 cs+3.210
 postp+3.199
vati+2.982
 Invention+2.937
Neg logit contributions
orge-3.474
yrinth-3.198
olphin-3.157
rss-3.107
 depths-3.076
Token and
Token ID290
Feature activation-1.40e-01

Pos logit contributions
orge+0.950
yrinth+0.876
 depths+0.867
olphin+0.861
 WRITE+0.837
Neg logit contributions
-0.938
 cs-0.909
 postp-0.898
 Invention-0.847
vati-0.845
Token please
Token ID3387
Feature activation+4.05e-02

Pos logit contributions
orge+1.479
yrinth+1.360
 depths+1.344
olphin+1.340
 WRITE+1.303
Neg logit contributions
-1.444
 cs-1.402
 postp-1.388
vati-1.304
 Invention-1.302
Token State
Token ID1812
Feature activation-5.49e-01

Pos logit contributions
orge+3.623
olphin+3.360
 depths+3.328
yrinth+3.318
 lows+3.230
Neg logit contributions
-4.044
 postp-3.912
 cs-3.908
ickets-3.693
 Invention-3.681
Token and
Token ID290
Feature activation-8.86e-02

Pos logit contributions
orge+4.733
 depths+4.474
 lows+4.338
olphin+4.315
 darkest+4.279
Neg logit contributions
-6.190
 cs-5.874
 postp-5.843
ickets-5.617
 Invention-5.599
Token early
Token ID1903
Feature activation-4.71e-01

Pos logit contributions
orge+0.943
yrinth+0.868
olphin+0.857
 depths+0.857
 WRITE+0.830
Neg logit contributions
-0.904
 cs-0.877
 postp-0.871
vati-0.816
 Invention-0.814
Token in
Token ID287
Feature activation-2.52e-01

Pos logit contributions
orge+3.885
 lows+3.597
 depths+3.591
 darkest+3.490
olphin+3.468
Neg logit contributions
-5.642
 cs-5.309
 postp-5.288
ickets-5.147
vati-5.142
Token his
Token ID465
Feature activation+1.65e-03

Pos logit contributions
orge+2.393
yrinth+2.214
 depths+2.210
olphin+2.205
 darkest+2.174
Neg logit contributions
-2.761
 postp-2.615
 cs-2.604
vati-2.517
ickets-2.490
Token rookie
Token ID12302
Feature activation-9.09e-02

Pos logit contributions
+0.019
 cs+0.018
 postp+0.018
vati+0.017
 Invention+0.017
Neg logit contributions
orge-0.016
yrinth-0.014
 depths-0.014
olphin-0.014
rss-0.014
Token season
Token ID1622
Feature activation+1.53e-01

Pos logit contributions
orge+0.723
 depths+0.656
olphin+0.645
yrinth+0.639
 lows+0.621
Neg logit contributions
-1.164
 cs-1.122
 postp-1.119
vati-1.078
 Invention-1.072
Token,
Token ID11
Feature activation-1.70e-01

Pos logit contributions
+1.590
 cs+1.552
 postp+1.539
 Invention+1.441
vati+1.438
Neg logit contributions
orge-1.597
yrinth-1.471
olphin-1.450
 depths-1.445
rss-1.405
Token when
Token ID618
Feature activation-1.03e-01

Pos logit contributions
orge+1.714
 depths+1.580
yrinth+1.570
olphin+1.553
 WRITE+1.516
Neg logit contributions
-1.805
 cs-1.732
 postp-1.719
 Invention-1.626
vati-1.619
Token he
Token ID339
Feature activation+2.56e-02

Pos logit contributions
orge+1.035
 depths+0.952
yrinth+0.939
olphin+0.937
 lows+0.911
Neg logit contributions
-1.112
 cs-1.073
 postp-1.062
 Invention-1.010
vati-1.008
Token challenged
Token ID12827
Feature activation-7.32e-01

Pos logit contributions
+0.297
 cs+0.291
 postp+0.289
 Invention+0.272
vati+0.271
Neg logit contributions
orge-0.237
yrinth-0.216
olphin-0.212
 depths-0.211
rss-0.205
Token with
Token ID351
Feature activation-5.42e-01

Pos logit contributions
orge+5.895
 depths+4.915
olphin+4.886
 subjug+4.595
 lows+4.476
Neg logit contributions
-16.052
vati-15.885
 postp-15.448
 Belg-15.352
 cs-15.320
Token daily
Token ID4445
Feature activation-6.01e-01

Pos logit contributions
orge+4.664
 depths+4.279
olphin+4.188
sent+4.141
yrinth+4.117
Neg logit contributions
-6.125
 cs-5.875
 postp-5.827
ickets-5.654
 Invention-5.654
Token,
Token ID11
Feature activation-7.14e-01

Pos logit contributions
orge+4.718
 depths+4.386
sent+4.215
 lows+4.199
 darkest+4.141
Neg logit contributions
-6.982
 cs-6.726
 postp-6.607
 Invention-6.589
vati-6.486
Token listen
Token ID6004
Feature activation-2.02e-02

Pos logit contributions
orge+6.253
 depths+5.999
 darkest+5.722
 lows+5.613
 rated+5.600
Neg logit contributions
-7.764
 cs-7.267
vati-7.185
ickets-7.129
 postp-7.058
Token to
Token ID284
Feature activation-1.90e-01

Pos logit contributions
orge+0.205
 depths+0.187
yrinth+0.186
olphin+0.185
 WRITE+0.179
Neg logit contributions
-0.214
 cs-0.210
 postp-0.210
vati-0.201
 Invention-0.199
Token those
Token ID883
Feature activation-6.69e-01

Pos logit contributions
orge+1.808
 depths+1.671
yrinth+1.627
olphin+1.612
 darkest+1.593
Neg logit contributions
-2.145
 cs-2.068
 postp-2.047
vati-1.971
 Invention-1.939
Token people
Token ID661
Feature activation-5.89e-01

Pos logit contributions
orge+5.800
 depths+5.624
 darkest+5.380
 lows+5.294
 rated+5.140
Neg logit contributions
-7.558
 cs-7.167
 postp-7.089
vati-7.062
 Invention-6.909
Token,
Token ID11
Feature activation-5.52e-01

Pos logit contributions
orge+4.980
 depths+4.583
olphin+4.402
yrinth+4.355
 darkest+4.319
Neg logit contributions
-6.876
 postp-6.525
 cs-6.517
vati-6.437
ickets-6.305
Token and
Token ID290
Feature activation-2.14e-01

Pos logit contributions
orge+4.870
 depths+4.527
 WRITE+4.333
 darkest+4.209
olphin+4.206
Neg logit contributions
-6.333
 cs-5.910
 postp-5.900
vati-5.867
ickets-5.783
Token amplify
Token ID47366
Feature activation-4.16e-01

Pos logit contributions
orge+2.086
 depths+1.913
yrinth+1.879
olphin+1.866
 WRITE+1.849
Neg logit contributions
-2.363
 cs-2.263
 postp-2.246
vati-2.180
 Invention-2.136
Token their
Token ID511
Feature activation-3.31e-01

Pos logit contributions
orge+3.716
 depths+3.462
yrinth+3.298
olphin+3.298
 darkest+3.291
Neg logit contributions
-4.810
 cs-4.581
 postp-4.557
vati-4.505
 Invention-4.306
Token.
Token ID13
Feature activation-3.43e-02

Pos logit contributions
orge+2.198
 depths+2.059
olphin+1.982
 lows+1.966
yrinth+1.965
Neg logit contributions
-2.547
 cs-2.433
 postp-2.424
vati-2.316
 Invention-2.311
Token Heavy
Token ID14089
Feature activation-4.74e-01

Pos logit contributions
orge+0.350
yrinth+0.321
olphin+0.316
 depths+0.316
 WRITE+0.306
Neg logit contributions
-0.365
 cs-0.356
 postp-0.354
vati-0.332
 Invention-0.330
Token class
Token ID1398
Feature activation-2.76e-02

Pos logit contributions
orge+4.707
 depths+4.395
 lows+4.246
yrinth+4.235
 rated+4.160
Neg logit contributions
-4.997
 cs-4.775
 postp-4.714
vati-4.529
 Invention-4.518
Token drivers
Token ID6643
Feature activation-6.15e-02

Pos logit contributions
orge+0.266
yrinth+0.241
 depths+0.240
olphin+0.238
 lows+0.230
Neg logit contributions
-0.311
 cs-0.301
 postp-0.300
 Invention-0.284
vati-0.283
Token have
Token ID423
Feature activation-1.51e-02

Pos logit contributions
orge+0.604
 depths+0.546
yrinth+0.544
olphin+0.541
 WRITE+0.523
Neg logit contributions
-0.683
 cs-0.659
 postp-0.655
 Invention-0.620
vati-0.618
Token higher
Token ID2440
Feature activation-3.22e-01

Pos logit contributions
orge+0.167
 depths+0.155
yrinth+0.154
olphin+0.152
 lows+0.149
Neg logit contributions
-0.147
 cs-0.142
 postp-0.141
 Invention-0.132
vati-0.131
Token top
Token ID1353
Feature activation+1.97e-03

Pos logit contributions
orge+3.142
 depths+2.970
 lows+2.873
yrinth+2.848
olphin+2.844
Neg logit contributions
-3.464
 postp-3.301
 cs-3.296
vati-3.165
 Invention-3.130
Token speeds
Token ID12055
Feature activation+7.08e-01

Pos logit contributions
+0.022
 cs+0.021
 postp+0.021
vati+0.020
 Invention+0.020
Neg logit contributions
orge-0.019
yrinth-0.018
 depths-0.018
olphin-0.017
 WRITE-0.017
Token and
Token ID290
Feature activation-2.14e-01

Pos logit contributions
+6.208
 postp+6.202
 cs+6.178
vati+5.651
 Offense+5.563
Neg logit contributions
orge-7.936
yrinth-7.708
olphin-7.219
rss-7.125
 WRITE-7.050
Token more
Token ID517
Feature activation-4.76e-02

Pos logit contributions
orge+2.276
 depths+2.125
olphin+2.058
yrinth+2.055
 lows+2.050
Neg logit contributions
-2.167
 cs-2.071
 postp-2.063
vati-1.952
 Invention-1.950
Token weight
Token ID3463
Feature activation-2.54e-02

Pos logit contributions
orge+0.447
yrinth+0.405
 depths+0.405
olphin+0.401
 lows+0.387
Neg logit contributions
-0.544
 cs-0.528
 postp-0.525
 Invention-0.499
vati-0.497
Token in
Token ID287
Feature activation-3.83e-02

Pos logit contributions
orge+2.275
 depths+2.090
yrinth+2.067
olphin+2.053
sent+1.992
Neg logit contributions
-2.712
 cs-2.608
 postp-2.603
vati-2.462
ickets-2.457
Token Pre
Token ID3771
Feature activation-2.35e-01

Pos logit contributions
orge+0.417
yrinth+0.386
 depths+0.384
olphin+0.379
sent+0.370
Neg logit contributions
-0.377
 postp-0.365
 cs-0.365
vati-0.345
 Invention-0.338
Tokeny
Token ID88
Feature activation-8.25e-02

Pos logit contributions
orge+2.269
yrinth+2.087
olphin+2.052
 depths+2.033
 WRITE+1.977
Neg logit contributions
-2.635
 cs-2.574
 postp-2.551
 Invention-2.407
vati-2.399
Token as
Token ID355
Feature activation-3.04e-01

Pos logit contributions
orge+0.775
yrinth+0.716
 depths+0.708
olphin+0.705
sent+0.680
Neg logit contributions
-0.933
 postp-0.908
 cs-0.904
vati-0.855
 Invention-0.853
Token well
Token ID880
Feature activation+4.42e-02

Pos logit contributions
orge+2.461
 depths+2.267
yrinth+2.199
 lows+2.152
olphin+2.129
Neg logit contributions
-3.708
 postp-3.611
 cs-3.606
vati-3.508
 Invention-3.430
Token.
Token ID13
Feature activation-3.53e-01

Pos logit contributions
+0.486
 cs+0.473
 postp+0.472
vati+0.447
 Invention+0.445
Neg logit contributions
orge-0.434
yrinth-0.397
 depths-0.395
olphin-0.394
rss-0.384
Token But
Token ID887
Feature activation-1.96e-01

Pos logit contributions
orge+3.742
 depths+3.389
yrinth+3.363
 WRITE+3.350
olphin+3.348
Neg logit contributions
-3.472
 postp-3.443
 cs-3.376
vati-3.259
ickets-3.166
Token where
Token ID810
Feature activation-1.70e-01

Pos logit contributions
orge+2.045
yrinth+1.890
 depths+1.879
olphin+1.844
sent+1.803
Neg logit contributions
-2.014
 cs-1.943
 postp-1.920
vati-1.842
 Invention-1.801
Token Bio
Token ID16024
Feature activation+1.57e-01

Pos logit contributions
orge+1.807
yrinth+1.660
 depths+1.652
olphin+1.621
sent+1.601
Neg logit contributions
-1.714
 postp-1.656
 cs-1.646
vati-1.584
 Invention-1.541
TokenShock
Token ID31646
Feature activation+9.77e-02

Pos logit contributions
+1.481
 cs+1.452
 postp+1.430
 Invention+1.331
vati+1.300
Neg logit contributions
orge-1.739
yrinth-1.694
 depths-1.631
olphin-1.618
 WRITE-1.616
Token presented
Token ID5545
Feature activation-3.48e-02

Pos logit contributions
+1.074
 cs+1.050
 postp+1.042
 Invention+0.980
vati+0.978
Neg logit contributions
orge-0.962
yrinth-0.883
olphin-0.867
 depths-0.865
rss-0.839
Token Group
Token ID4912
Feature activation-4.53e-01

Pos logit contributions
orge+4.638
 WRITE+4.103
XXXX+4.082
 depths+4.050
yrinth+4.048
Neg logit contributions
-7.853
 cs-7.603
 postp-7.530
vati-7.350
ickets-7.214
Token<|endoftext|>
Token ID50256
Feature activation+3.56e-03

Pos logit contributions
orge+4.194
 depths+3.709
yrinth+3.637
XXXX+3.625
olphin+3.620
Neg logit contributions
-4.965
 postp-4.874
 cs-4.823
vati-4.666
ickets-4.564
TokenW
Token ID54
Feature activation-4.22e-01

Pos logit contributions
+0.036
 cs+0.036
 postp+0.035
 Invention+0.033
vati+0.033
Neg logit contributions
orge-0.038
yrinth-0.035
olphin-0.034
 depths-0.034
rss-0.034
Tokenired
Token ID1202
Feature activation-2.53e-01

Pos logit contributions
orge+4.454
 depths+3.988
olphin+3.983
yrinth+3.955
elf+3.924
Neg logit contributions
-4.257
 cs-4.131
 postp-4.109
 Invention-3.904
 Offense-3.821
Token Equ
Token ID7889
Feature activation+1.42e-02

Pos logit contributions
orge+2.499
 depths+2.262
yrinth+2.248
olphin+2.221
rss+2.205
Neg logit contributions
-2.739
 cs-2.634
 postp-2.599
vati-2.540
 Invention-2.423
Tokenivalent
Token ID29540
Feature activation-3.14e-01

Pos logit contributions
+0.153
 cs+0.150
 postp+0.148
 Invention+0.139
 Offense+0.137
Neg logit contributions
orge-0.144
yrinth-0.133
olphin-0.131
 depths-0.130
rss-0.126
Token Privacy
Token ID16777
Feature activation+8.54e-02

Pos logit contributions
orge+3.213
 depths+2.967
 WRITE+2.892
yrinth+2.872
rss+2.861
Neg logit contributions
-3.206
 postp-3.097
 cs-3.053
vati-2.988
ickets-2.845
Token (
Token ID357
Feature activation+2.26e-01

Pos logit contributions
+0.873
 cs+0.852
 postp+0.845
 Invention+0.789
vati+0.788
Neg logit contributions
orge-0.906
yrinth-0.836
olphin-0.824
 depths-0.821
rss-0.800
TokenWE
Token ID8845
Feature activation+6.16e-01

Pos logit contributions
+2.250
 cs+2.195
 postp+2.174
 Invention+2.030
vati+2.025
Neg logit contributions
orge-2.455
yrinth-2.271
olphin-2.238
 depths-2.229
rss-2.171
TokenP
Token ID47
Feature activation+3.13e-01

Pos logit contributions
+5.964
 cs+5.942
 postp+5.849
 Invention+5.500
 Offense+5.452
Neg logit contributions
orge-6.509
yrinth-6.246
olphin-5.946
 depths-5.921
rss-5.835
Token)
Token ID8
Feature activation+2.48e-01

Pos logit contributions
+3.267
 cs+3.191
 postp+3.164
 Invention+2.963
vati+2.957
Neg logit contributions
orge-3.249
yrinth-2.994
olphin-2.949
 depths-2.935
rss-2.857
Token
Token ID447
Feature activation+3.06e-02

Pos logit contributions
+0.538
 cs+0.526
 postp+0.522
vati+0.491
 Invention+0.491
Neg logit contributions
orge-0.507
yrinth-0.466
 depths-0.458
olphin-0.457
rss-0.446
Token
Token ID251
Feature activation-1.84e-01

Pos logit contributions
+0.360
 cs+0.354
 postp+0.352
 Invention+0.334
vati+0.333
Neg logit contributions
orge-0.275
yrinth-0.247
 depths-0.244
olphin-0.241
rss-0.238
Token poll
Token ID3278
Feature activation-2.09e-01

Pos logit contributions
orge+1.869
yrinth+1.703
 depths+1.693
rss+1.674
sent+1.654
Neg logit contributions
-1.919
 cs-1.882
 postp-1.874
vati-1.786
 Invention-1.785
Tokenster
Token ID1706
Feature activation+4.18e-01

Pos logit contributions
orge+1.986
 depths+1.788
yrinth+1.777
olphin+1.743
rss+1.741
Neg logit contributions
-2.360
 postp-2.284
 cs-2.260
 Invention-2.193
vati-2.177
Token Stan
Token ID7299
Feature activation-9.70e-02

Pos logit contributions
+4.232
 cs+4.199
 postp+4.096
 Invention+3.846
vati+3.809
Neg logit contributions
orge-4.359
yrinth-4.086
olphin-4.021
 depths-3.920
rss-3.845
Token Greenberg
Token ID46702
Feature activation-2.48e-01

Pos logit contributions
orge+1.012
olphin+0.905
yrinth+0.903
 depths+0.902
rss+0.886
Neg logit contributions
-1.021
 cs-0.999
 postp-0.988
vati-0.932
 Invention-0.920
Token told
Token ID1297
Feature activation+2.57e-01

Pos logit contributions
orge+2.170
 depths+1.938
yrinth+1.916
olphin+1.885
sent+1.871
Neg logit contributions
-2.913
 postp-2.870
 cs-2.854
 Invention-2.753
vati-2.752
Token left
Token ID1364
Feature activation-2.01e-01

Pos logit contributions
+2.525
 cs+2.472
 postp+2.438
 Invention+2.271
vati+2.255
Neg logit contributions
orge-2.820
yrinth-2.621
olphin-2.587
 depths-2.558
rss-2.495
Token-
Token ID12
Feature activation-2.20e-01

Pos logit contributions
orge+1.862
 depths+1.702
rss+1.651
olphin+1.638
yrinth+1.632
Neg logit contributions
-2.251
 postp-2.205
 cs-2.199
vati-2.125
 Invention-2.123
Tokenleaning
Token ID25909
Feature activation-1.93e-01

Pos logit contributions
orge+1.589
rss+1.410
sent+1.406
yrinth+1.373
olphin+1.364
Neg logit contributions
-2.924
 postp-2.886
 cs-2.882
 Invention-2.780
vati-2.706
Token Salon
Token ID40121
Feature activation-6.52e-01

Pos logit contributions
orge+1.937
 depths+1.755
yrinth+1.752
olphin+1.719
 WRITE+1.718
Neg logit contributions
-2.074
 postp-1.997
 cs-1.994
vati-1.902
 Invention-1.880
Token tourist
Token ID18473
Feature activation+1.83e-01

Pos logit contributions
orge+4.943
 depths+4.539
olphin+4.432
sent+4.390
XXXX+4.376
Neg logit contributions
-5.872
 cs-5.460
vati-5.392
ickets-5.179
 postp-5.177
Token without
Token ID1231
Feature activation-4.08e-02

Pos logit contributions
+1.908
 cs+1.873
 postp+1.862
 Invention+1.731
vati+1.729
Neg logit contributions
orge-1.895
yrinth-1.755
olphin-1.721
 depths-1.714
rss-1.671
Token diplomatic
Token ID13093
Feature activation+3.28e-01

Pos logit contributions
orge+0.364
yrinth+0.326
 depths+0.325
olphin+0.322
 WRITE+0.311
Neg logit contributions
-0.492
 cs-0.475
 postp-0.471
 Invention-0.449
vati-0.449
Token immunity
Token ID16989
Feature activation+1.44e-01

Pos logit contributions
 cs+3.604
 postp+3.596
+3.578
 Invention+3.322
vati+3.280
Neg logit contributions
orge-3.144
yrinth-2.955
olphin-2.896
rss-2.791
 depths-2.759
Token.
Token ID13
Feature activation-4.61e-01

Pos logit contributions
+1.520
 cs+1.484
 postp+1.470
 Invention+1.386
vati+1.379
Neg logit contributions
orge-1.484
yrinth-1.362
olphin-1.345
 depths-1.342
rss-1.304
Token At
Token ID1629
Feature activation-1.44e+00

Pos logit contributions
orge+4.521
 WRITE+4.017
XXXX+3.954
 depths+3.946
sent+3.932
Neg logit contributions
-4.938
 postp-4.793
 cs-4.756
vati-4.655
ickets-4.481
Token some
Token ID617
Feature activation-8.99e-01

Pos logit contributions
 depths+9.934
orge+9.359
 darkest+9.008
 lows+8.766
sent+8.742
Neg logit contributions
vati-14.179
-14.173
 cs-13.984
 postp-13.458
ickets-13.427
Token point
Token ID966
Feature activation-8.15e-01

Pos logit contributions
 depths+6.652
orge+6.519
 lows+5.878
 darkest+5.805
 rated+5.683
Neg logit contributions
-10.531
vati-9.956
 cs-9.892
ickets-9.814
skirts-9.652
Token,
Token ID11
Feature activation-9.67e-01

Pos logit contributions
orge+6.457
 depths+5.938
rade+5.611
yrinth+5.601
 lows+5.549
Neg logit contributions
-9.301
 cs-8.939
ickets-8.653
vati-8.556
 Invention-8.537
Token the
Token ID262
Feature activation-5.05e-01

Pos logit contributions
orge+8.273
 depths+7.843
 lows+7.358
 darkest+7.350
rade+7.297
Neg logit contributions
-10.514
 cs-9.583
vati-9.474
ickets-9.409
 Invention-9.149
Token top
Token ID1353
Feature activation-5.75e-01

Pos logit contributions
orge+4.748
 depths+4.499
 darkest+4.264
yrinth+4.249
 lows+4.235
Neg logit contributions
-5.623
 cs-5.289
vati-5.079
 Invention-4.973
 postp-4.964
Token marked
Token ID7498
Feature activation-4.40e-01

Pos logit contributions
orge+3.168
 depths+2.919
yrinth+2.828
 darkest+2.783
olphin+2.777
Neg logit contributions
-3.937
 cs-3.774
 postp-3.727
vati-3.649
 Invention-3.612
Token its
Token ID663
Feature activation-7.26e-01

Pos logit contributions
orge+4.061
 depths+3.835
yrinth+3.677
 lows+3.636
 darkest+3.620
Neg logit contributions
-4.901
 cs-4.687
 postp-4.586
vati-4.502
 Invention-4.484
Token 100
Token ID1802
Feature activation-3.89e-01

Pos logit contributions
orge+6.162
 depths+6.144
 darkest+6.108
 lows+5.701
yrinth+5.603
Neg logit contributions
-8.227
 cs-7.519
 postp-7.400
vati-7.386
skirts-7.383
Tokenth
Token ID400
Feature activation-5.24e-01

Pos logit contributions
orge+3.146
 depths+2.854
sent+2.802
olphin+2.775
 darkest+2.766
Neg logit contributions
-4.693
 postp-4.596
 cs-4.473
vati-4.446
 Invention-4.339
Token anniversary
Token ID11162
Feature activation-4.53e-02

Pos logit contributions
orge+5.712
 depths+5.520
 darkest+5.305
yrinth+5.285
 synd+5.251
Neg logit contributions
-4.795
ickets-4.386
 cs-4.380
vati-4.310
skirts-4.250
Token this
Token ID428
Feature activation-1.58e+00

Pos logit contributions
orge+0.426
yrinth+0.390
 depths+0.389
olphin+0.384
 WRITE+0.373
Neg logit contributions
-0.516
 cs-0.501
 postp-0.494
 Invention-0.470
vati-0.469
Token year
Token ID614
Feature activation-3.44e-01

Pos logit contributions
 depths+7.237
orge+6.895
 darkest+6.797
 synd+6.365
 rated+6.146
Neg logit contributions
-19.326
ickets-18.998
 cs-17.914
skirts-17.885
vati-17.705
Token,
Token ID11
Feature activation-8.19e-02

Pos logit contributions
orge+3.003
 depths+2.735
olphin+2.708
sent+2.659
yrinth+2.654
Neg logit contributions
-3.980
 cs-3.879
 postp-3.769
 Invention-3.730
ickets-3.668
Token split
Token ID6626
Feature activation-4.03e-02

Pos logit contributions
orge+0.768
yrinth+0.696
 depths+0.695
olphin+0.686
rss+0.666
Neg logit contributions
-0.936
 cs-0.909
 postp-0.894
 Invention-0.858
vati-0.857
Token into
Token ID656
Feature activation-3.28e-01

Pos logit contributions
orge+0.377
 depths+0.344
yrinth+0.342
olphin+0.337
rss+0.329
Neg logit contributions
-0.461
 cs-0.449
 postp-0.445
 Invention-0.425
vati-0.424
Token two
Token ID734
Feature activation-2.52e-01

Pos logit contributions
orge+2.631
 depths+2.449
yrinth+2.391
 lows+2.318
 darkest+2.290
Neg logit contributions
-4.073
 cs-3.900
 postp-3.874
vati-3.804
 Invention-3.751
Token,
Token ID11
Feature activation-8.81e-01

Pos logit contributions
orge+6.675
 depths+6.453
 rated+6.256
 darkest+6.230
 lows+6.211
Neg logit contributions
-8.148
ickets-7.488
 cs-7.448
 Invention-7.291
 postp-7.139
Token and
Token ID290
Feature activation-4.67e-01

Pos logit contributions
orge+7.769
 depths+7.465
 rated+7.128
 darkest+7.051
 lows+6.918
Neg logit contributions
-9.553
 cs-8.611
ickets-8.556
skirts-8.496
 postp-8.420
Token perhaps
Token ID3737
Feature activation-6.21e-01

Pos logit contributions
orge+4.484
 depths+4.145
 WRITE+3.937
olphin+3.926
 darkest+3.912
Neg logit contributions
-5.193
 cs-4.857
 postp-4.787
vati-4.743
ickets-4.659
Token struggle
Token ID6531
Feature activation-1.44e+00

Pos logit contributions
orge+5.565
 depths+5.212
 darkest+4.945
 WRITE+4.925
olphin+4.904
Neg logit contributions
-7.128
 cs-6.533
 postp-6.463
vati-6.447
ickets-6.394
Token to
Token ID284
Feature activation-1.31e+00

Pos logit contributions
orge+8.101
 depths+8.056
 lows+7.456
 darkest+7.451
 rated+7.350
Neg logit contributions
-16.159
 Offense-15.087
 Invention-14.931
 cs-14.906
ickets-14.655
Token convey
Token ID13878
Feature activation-1.60e+00

Pos logit contributions
orge+9.403
 depths+9.304
 WRITE+8.785
 rated+8.395
 subjug+8.322
Neg logit contributions
-14.397
skirts-13.955
vati-13.645
ickets-13.515
 Offense-13.207
Token to
Token ID284
Feature activation-7.27e-01

Pos logit contributions
orge+10.593
 depths+10.358
 rated+9.462
 lows+9.438
 liberation+9.292
Neg logit contributions
-16.440
 cs-15.417
ickets-15.319
 Prosecut-14.980
skirts-14.954
Token others
Token ID1854
Feature activation-9.55e-01

Pos logit contributions
orge+4.319
 depths+3.829
XXXX+3.497
 rated+3.444
yrinth+3.439
Neg logit contributions
-10.187
 cs-9.605
 postp-9.549
 Invention-9.362
vati-9.350
Token,
Token ID11
Feature activation-1.07e+00

Pos logit contributions
orge+6.777
 depths+6.543
 lows+6.026
olphin+5.980
 rated+5.973
Neg logit contributions
-11.122
 cs-10.564
 postp-10.340
ickets-10.273
 Prosecut-10.142
Token that
Token ID326
Feature activation-7.15e-01

Pos logit contributions
 depths+7.700
orge+7.692
 lows+7.189
 darkest+7.186
 rated+6.986
Neg logit contributions
-12.363
 cs-11.461
ickets-11.104
 postp-11.010
 Prosecut-11.002
Token their
Token ID511
Feature activation-1.02e+00

Pos logit contributions
orge+6.029
 depths+5.668
 darkest+5.415
 lows+5.322
 rated+5.320
Neg logit contributions
-8.116
 cs-7.600
ickets-7.420
 postp-7.354
vati-7.333
Token first
Token ID717
Feature activation-6.31e-01

Pos logit contributions
orge+7.254
 depths+6.994
 darkest+6.726
 lows+6.582
 WRITE+6.521
Neg logit contributions
-8.393
 cs-7.933
skirts-7.646
vati-7.641
ickets-7.640
Token C
Token ID327
Feature activation-4.26e-01

Pos logit contributions
orge+5.266
 depths+4.751
yrinth+4.606
 WRITE+4.573
olphin+4.535
Neg logit contributions
-7.190
 cs-6.911
ickets-6.764
 postp-6.707
 Invention-6.698
TokenEL
Token ID3698
Feature activation-4.60e-02

Pos logit contributions
orge+3.139
yrinth+2.690
olphin+2.689
elf+2.665
 depths+2.649
Neg logit contributions
-5.604
 postp-5.529
 cs-5.483
 Invention-5.327
vati-5.258
TokenAC
Token ID2246
Feature activation-9.36e-02

Pos logit contributions
orge+0.506
yrinth+0.477
olphin+0.461
 depths+0.459
 WRITE+0.449
Neg logit contributions
-0.458
 cs-0.448
 postp-0.443
 Invention-0.407
vati-0.405
Token summit
Token ID14237
Feature activation-2.62e-01

Pos logit contributions
orge+0.808
yrinth+0.727
 depths+0.719
olphin+0.717
 WRITE+0.694
Neg logit contributions
-1.134
 cs-1.104
 postp-1.094
vati-1.042
 Invention-1.042
Token last
Token ID938
Feature activation-1.31e+00

Pos logit contributions
orge+2.399
 depths+2.194
yrinth+2.174
olphin+2.161
 WRITE+2.127
Neg logit contributions
-2.932
 cs-2.867
 postp-2.817
 Invention-2.735
ickets-2.710
Token year
Token ID614
Feature activation-4.52e-01

Pos logit contributions
 depths+6.154
orge+5.800
 darkest+5.635
rade+5.609
 rated+5.381
Neg logit contributions
-16.525
ickets-15.692
skirts-15.555
 cs-15.398
 Invention-15.244
Token.
Token ID13
Feature activation-4.51e-01

Pos logit contributions
orge+3.805
 depths+3.463
sent+3.451
olphin+3.434
alian+3.379
Neg logit contributions
-5.043
 cs-4.990
 postp-4.870
 Invention-4.856
 Offense-4.736
Token
Token ID198
Feature activation-5.93e-01

Pos logit contributions
orge+4.201
 WRITE+3.718
XXXX+3.702
yrinth+3.646
sent+3.645
Neg logit contributions
-4.852
 postp-4.790
 cs-4.679
vati-4.588
ickets-4.412
Token
Token ID198
Feature activation-9.46e-01

Pos logit contributions
orge+5.620
 depths+5.026
XXXX+4.953
olphin+4.938
sent+4.934
Neg logit contributions
-6.505
 cs-6.358
 postp-6.335
vati-6.036
 Invention-5.944
TokenBefore
Token ID8421
Feature activation-2.28e-01

Pos logit contributions
orge+8.455
XXXX+8.011
sent+7.849
Rod+7.791
LGBT+7.573
Neg logit contributions
 postp-9.496
 cs-9.132
 Invention-8.956
vati-8.933
-8.817
Tokenll
Token ID297
Feature activation-1.37e-01

Pos logit contributions
orge+3.945
sent+3.572
yrinth+3.533
 depths+3.516
rss+3.478
Neg logit contributions
-6.220
 cs-6.153
 postp-6.080
 Invention-5.980
vati-5.864
Token be
Token ID307
Feature activation-7.11e-01

Pos logit contributions
orge+1.280
yrinth+1.159
 depths+1.141
olphin+1.136
XXXX+1.099
Neg logit contributions
-1.566
 cs-1.534
 postp-1.532
vati-1.483
 Invention-1.446
Token a
Token ID257
Feature activation-7.42e-01

Pos logit contributions
orge+6.298
 depths+5.769
yrinth+5.720
 rated+5.655
 darkest+5.649
Neg logit contributions
-7.731
 cs-7.342
 postp-7.308
vati-7.119
ickets-7.057
Token definite
Token ID21892
Feature activation-1.68e-01

Pos logit contributions
orge+6.949
 depths+6.352
yrinth+6.264
olphin+6.167
 darkest+6.094
Neg logit contributions
-7.693
 cs-7.380
 postp-7.303
vati-7.282
ickets-6.978
Token enhancement
Token ID28554
Feature activation-5.96e-01

Pos logit contributions
orge+1.796
yrinth+1.639
 depths+1.630
olphin+1.628
XXXX+1.578
Neg logit contributions
-1.700
 cs-1.642
 postp-1.628
vati-1.551
 Invention-1.522
Token in
Token ID287
Feature activation-1.42e+00

Pos logit contributions
orge+5.213
olphin+4.634
yrinth+4.555
 depths+4.522
 rated+4.488
Neg logit contributions
-6.643
 postp-6.485
 cs-6.396
vati-6.333
ickets-6.224
Token intercourse
Token ID22409
Feature activation-8.46e-01

Pos logit contributions
orge+11.041
 depths+10.689
XXXX+10.197
olphin+10.165
yrinth+10.056
Neg logit contributions
-13.020
vati-12.381
 postp-12.175
 cs-11.979
ickets-11.972
Token,
Token ID11
Feature activation-8.16e-01

Pos logit contributions
orge+7.282
 depths+6.980
yrinth+6.560
XXXX+6.511
olphin+6.481
Neg logit contributions
-8.866
 cs-8.465
 postp-8.464
vati-8.409
ickets-8.239
Token
Token ID447
Feature activation-2.76e-01

Pos logit contributions
orge+6.580
 depths+5.910
olphin+5.867
yrinth+5.822
XXXX+5.801
Neg logit contributions
-8.912
 cs-8.551
 postp-8.492
vati-8.397
ickets-8.318
Token
Token ID251
Feature activation-7.74e-01

Pos logit contributions
orge+2.875
yrinth+2.601
 depths+2.594
olphin+2.559
rss+2.540
Neg logit contributions
-2.814
 cs-2.753
 postp-2.719
 Invention-2.565
ickets-2.547
Token he
Token ID339
Feature activation+2.00e-01

Pos logit contributions
orge+6.628
 depths+5.860
rss+5.827
XXXX+5.783
sent+5.767
Neg logit contributions
-8.406
 cs-8.069
 postp-7.933
ickets-7.805
 Invention-7.787