TOP ACTIVATIONS
MAX = 1.379

, known as the Y arm ouk Basin , aren
It 's ha zy , pale yellow , with very little
/ 295 3 * 0 . 8 681 3 * 3
offline qualifier as a stand - in for J oon a
/ 215 4 * 0 . 9 559 4 * 4
/ 253 4 * 0 . 9 556 4 * 4
/ 226 4 * 0 . 9 242 4 * 4
He was a Batman stand - in in the Wild storm
t inct ure or a few dro pper ful s of
/ 190 4 * 0 . 9 607 4 * 4
different somehow a few days ago , an exploration
/ 175 4 * 0 . 96 13 5 * 4
his past , but a fe ist y Kaine jud
nationalist heter ogeneous coalition ( lab elled as tre
2013 , suffered a computer hard drive crash that cost potentially
Please click below to become an In Daily supporter . Powered
's ha zy , pale yellow , with very little carbon
was asked " to sign a nond is closure agreement before
op rene rubber goes into all sorts of everyday household products
] As of 2008 , it has sold over 43 ,

MIN ACTIVATIONS
MIN = -1.574

the NFL Commissioner as judge , jury , execution or ,
previously mentioned . No word if the Str and
https :// www . bel fast te legraph . co .
less one - day ( b ilateral ) cricket and more
Enemy of the State : The Life of Murray N .
But back to the no - spin scenario for a second
work , but there is no vaccine Good personal hygiene ,
N 64 as the New A ger s console
still be moving at the original speed of the Earth 's
AS ES Phase 1 : No infections in humans are being
https :// www . bel fast te legraph . co .
book , Re claim ing the American Right : The Lost
frontline to serve as " c annon fodder " to protect
can buy An Enemy of the State : The Life of
months from the M auna Lo a series of [ CO
writer who qu its his pro dig iously paying job to
Stars 1999 Stanley Cup winner , then lost out
authorities to years of below - average precipitation are unquestion
And there s no shortage of high - tech
duration of the game chasing one another up and down the

INTERVAL 0.640 - 1.379
CONTAINS 0.675%

The woman then appears to kicks him twice , and he
* Get Volume () ; return f Mass ; }
Sen b ong i and Sh us aku U ch iy
scientist / Y all mother fuck ers lying ,
about sex and looked a little shocked , lol ). I

INTERVAL -0.098 - 0.640
CONTAINS 65.997%

in case the individual is neither a UAE citizen nor a
you didn t get a check .
knew no bounds , his relatives said .
nor that he 'd be happy that they left home so
just to get rid of the proverbial al bat ross around

INTERVAL -0.836 - -0.098
CONTAINS 33.198%

alia . It s not clear how the case
! This is some seriously next - level stuff . While
The research undermines any notion of a crisis caused primarily by
Dec ades ago , the city planted many liquid
two options with the family at this point : the clinical

INTERVAL -1.574 - -0.836
CONTAINS 0.129%

by Z ey ad Mas ro or Khan ; Writing by
] Without a working star tracker , ion thrust
Mind hunter . No one says ,
huge spy apparatus that only looks after itself .
older bug at work . No big gie , because I
Token,
Token ID11
Feature activation+3.93e-02

Pos logit contributions
OPLE+0.097
POSE+0.089
ンジ+0.087
+0.085
iets+0.085
Neg logit contributions
osal-0.090
obbies-0.077
hedral-0.077
paces-0.077
heses-0.070
Token known
Token ID1900
Feature activation-4.31e-01

Pos logit contributions
OPLE+0.438
POSE+0.403
ンジ+0.392
+0.382
iets+0.381
Neg logit contributions
osal-0.459
obbies-0.400
hedral-0.398
paces-0.397
heses-0.366
Token as
Token ID355
Feature activation+2.14e-01

Pos logit contributions
osal+4.156
paces+3.628
obbies+3.541
hedral+3.506
gey+3.232
Neg logit contributions
OPLE-5.471
POSE-5.134
pointer-4.910
puter-4.906
iets-4.899
Token the
Token ID262
Feature activation+6.96e-02

Pos logit contributions
OPLE+2.311
POSE+2.126
ンジ+2.092
+2.034
================================================================+2.010
Neg logit contributions
osal-2.534
obbies-2.234
hedral-2.217
paces-2.212
heses-2.040
Token Y
Token ID575
Feature activation-2.37e-01

Pos logit contributions
OPLE+0.785
POSE+0.724
ンジ+0.705
+0.686
iets+0.683
Neg logit contributions
osal-0.803
obbies-0.700
paces-0.697
hedral-0.697
heses-0.640
Tokenarm
Token ID1670
Feature activation+1.38e+00

Pos logit contributions
osal+2.256
obbies+1.882
hedral+1.859
paces+1.834
idding+1.689
Neg logit contributions
OPLE-3.149
ンジ-2.942
POSE-2.937
-2.854
================================================================-2.806
Tokenouk
Token ID38960
Feature activation+5.56e-02

Pos logit contributions
pling+8.161
OPLE+7.485
================================================================+7.127
pointer+7.091
lp+7.083
Neg logit contributions
osal-17.682
paces-16.752
obbies-16.362
heses-16.099
hedral-15.809
Token Basin
Token ID32666
Feature activation+2.01e-01

Pos logit contributions
OPLE+0.647
POSE+0.598
ンジ+0.585
+0.570
iets+0.568
Neg logit contributions
osal-0.621
obbies-0.538
paces-0.536
hedral-0.535
heses-0.492
Token,
Token ID11
Feature activation-2.82e-03

Pos logit contributions
OPLE+2.221
POSE+2.044
ンジ+2.017
+1.951
iets+1.942
Neg logit contributions
osal-2.328
obbies-2.022
paces-2.017
hedral-2.009
heses-1.856
Token aren
Token ID3588
Feature activation+1.07e-02

Pos logit contributions
osal+0.030
obbies+0.026
hedral+0.026
paces+0.026
heses+0.023
Neg logit contributions
OPLE-0.035
POSE-0.032
ンジ-0.031
-0.030
iets-0.030
Token
Token ID447
Feature activation-5.42e-02

Pos logit contributions
OPLE+0.113
POSE+0.104
ンジ+0.101
+0.098
iets+0.098
Neg logit contributions
osal-0.130
obbies-0.114
hedral-0.114
paces-0.114
heses-0.105
Token It
Token ID632
Feature activation+5.80e-01

Pos logit contributions
OPLE+1.696
POSE+1.553
ンジ+1.505
+1.468
================================================================+1.468
Neg logit contributions
osal-1.998
obbies-1.763
hedral-1.758
paces-1.726
heses-1.606
Token's
Token ID338
Feature activation+6.72e-01

Pos logit contributions
OPLE+5.365
iets+5.162
POSE+4.988
puter+4.934
ンジ+4.933
Neg logit contributions
osal-7.048
obbies-6.237
hedral-6.096
paces-6.041
idding-5.661
Token ha
Token ID387
Feature activation+3.57e-01

Pos logit contributions
OPLE+6.514
POSE+6.113
+5.897
================================================================+5.872
ンジ+5.863
Neg logit contributions
osal-7.872
obbies-6.936
paces-6.664
hedral-6.592
heses-6.260
Tokenzy
Token ID7357
Feature activation+9.67e-01

Pos logit contributions
OPLE+3.684
POSE+3.479
pling+3.386
puter+3.377
lp+3.350
Neg logit contributions
osal-4.185
hedral-3.667
obbies-3.658
paces-3.643
heses-3.439
Token,
Token ID11
Feature activation+8.92e-01

Pos logit contributions
OPLE+7.844
pling+7.322
lp+7.226
ンジ+7.220
POSE+7.183
Neg logit contributions
osal-11.756
obbies-10.419
paces-10.143
hedral-10.043
heses-9.688
Token pale
Token ID14005
Feature activation+1.33e+00

Pos logit contributions
OPLE+7.540
POSE+7.335
ンジ+7.213
pling+7.101
iets+7.032
Neg logit contributions
osal-10.487
obbies-9.115
hedral-9.034
paces-8.979
heses-8.754
Token yellow
Token ID7872
Feature activation+1.14e+00

Pos logit contributions
OPLE+9.294
pling+9.276
lp+9.257
ンジ+9.141
 hairs+8.884
Neg logit contributions
osal-14.848
obbies-13.763
paces-13.602
hedral-13.144
heses-12.268
Token,
Token ID11
Feature activation+1.09e+00

Pos logit contributions
OPLE+9.048
 hairs+8.537
lp+8.487
POSE+8.451
ンジ+8.320
Neg logit contributions
osal-13.049
obbies-11.622
paces-11.392
hedral-10.922
heses-10.652
Token with
Token ID351
Feature activation+8.35e-01

Pos logit contributions
OPLE+8.762
POSE+8.661
ンジ+8.343
 hairs+8.303
pling+8.289
Neg logit contributions
osal-12.353
obbies-10.703
paces-10.580
hedral-10.414
heses-10.297
Token very
Token ID845
Feature activation+7.00e-01

Pos logit contributions
OPLE+7.366
 hairs+7.005
POSE+6.969
pling+6.725
lp+6.707
Neg logit contributions
osal-9.941
obbies-8.662
paces-8.447
hedral-8.306
heses-8.076
Token little
Token ID1310
Feature activation+9.11e-01

Pos logit contributions
OPLE+5.788
POSE+5.692
pling+5.288
ンジ+5.212
lp+5.197
Neg logit contributions
osal-9.119
obbies-7.923
paces-7.753
hedral-7.677
heses-7.482
Token/
Token ID14
Feature activation+9.91e-02

Pos logit contributions
OPLE+3.900
POSE+3.810
================================================================+3.647
pointer+3.615
lp+3.576
Neg logit contributions
osal-4.997
obbies-4.587
hedral-4.427
paces-4.293
heses-4.132
Token295
Token ID25710
Feature activation+1.55e-01

Pos logit contributions
OPLE+1.070
POSE+0.985
ンジ+0.955
+0.930
pointer+0.928
Neg logit contributions
osal-1.192
obbies-1.050
hedral-1.046
paces-1.039
heses-0.960
Token 3
Token ID513
Feature activation-9.19e-02

Pos logit contributions
OPLE+1.805
POSE+1.682
ンジ+1.632
pointer+1.588
+1.580
Neg logit contributions
osal-1.721
obbies-1.492
hedral-1.486
paces-1.482
heses-1.352
Token*
Token ID9
Feature activation-6.58e-02

Pos logit contributions
osal+0.976
obbies+0.837
hedral+0.835
paces+0.833
heses+0.760
Neg logit contributions
OPLE-1.128
POSE-1.041
ンジ-1.013
-0.993
iets-0.991
Token 0
Token ID657
Feature activation+1.90e-01

Pos logit contributions
osal+0.647
obbies+0.547
paces+0.544
hedral+0.544
heses+0.491
Neg logit contributions
OPLE-0.860
POSE-0.799
ンジ-0.781
-0.765
iets-0.761
Token.
Token ID13
Feature activation+1.32e+00

Pos logit contributions
OPLE+2.067
POSE+1.961
================================================================+1.869
ンジ+1.868
pointer+1.863
Neg logit contributions
osal-2.202
obbies-1.961
hedral-1.922
paces-1.895
heses-1.774
Token8
Token ID23
Feature activation-3.61e-02

Pos logit contributions
905+5.560
OPLE+5.340
================================================================+4.774
66666666+4.764
POSE+4.600
Neg logit contributions
osal-19.673
hedral-19.311
obbies-18.646
 undermin-18.199
erville-18.003
Token681
Token ID48564
Feature activation+7.26e-02

Pos logit contributions
osal+0.471
obbies+0.416
paces+0.415
hedral+0.415
heses+0.385
Neg logit contributions
OPLE-0.353
POSE-0.322
ンジ-0.311
-0.302
iets-0.301
Token 3
Token ID513
Feature activation+1.26e-01

Pos logit contributions
OPLE+0.864
POSE+0.803
ンジ+0.780
iets+0.760
+0.759
Neg logit contributions
osal-0.790
obbies-0.683
hedral-0.680
paces-0.678
heses-0.617
Token*
Token ID9
Feature activation+4.46e-01

Pos logit contributions
OPLE+1.429
POSE+1.346
ンジ+1.299
================================================================+1.273
pointer+1.271
Neg logit contributions
osal-1.412
obbies-1.244
hedral-1.226
paces-1.218
heses-1.123
Token 3
Token ID513
Feature activation+2.83e-01

Pos logit contributions
OPLE+4.558
POSE+4.358
pointer+4.137
ンジ+4.134
iets+4.080
Neg logit contributions
osal-5.180
obbies-4.598
hedral-4.575
paces-4.554
gey-4.151
Token offline
Token ID18043
Feature activation+2.23e-02

Pos logit contributions
osal+2.288
obbies+1.964
paces+1.957
hedral+1.938
heses+1.770
Neg logit contributions
OPLE-2.517
POSE-2.326
ンジ-2.255
-2.219
puter-2.204
Token qualifier
Token ID39265
Feature activation+4.19e-02

Pos logit contributions
OPLE+0.261
POSE+0.242
ンジ+0.235
+0.230
iets+0.229
Neg logit contributions
osal-0.249
obbies-0.215
hedral-0.214
paces-0.214
heses-0.196
Token as
Token ID355
Feature activation-6.02e-02

Pos logit contributions
OPLE+0.490
POSE+0.455
ンジ+0.442
+0.432
iets+0.431
Neg logit contributions
osal-0.465
obbies-0.404
hedral-0.401
paces-0.400
heses-0.367
Token a
Token ID257
Feature activation+1.31e-01

Pos logit contributions
osal+0.695
obbies+0.605
hedral+0.603
paces+0.603
heses+0.552
Neg logit contributions
OPLE-0.681
POSE-0.626
ンジ-0.609
-0.594
iets-0.592
Token stand
Token ID1302
Feature activation+8.48e-01

Pos logit contributions
OPLE+1.527
POSE+1.429
ンジ+1.392
+1.356
iets+1.352
Neg logit contributions
osal-1.442
hedral-1.250
obbies-1.248
paces-1.247
heses-1.143
Token-
Token ID12
Feature activation+1.28e+00

Pos logit contributions
OPLE+7.456
POSE+7.210
puter+6.815
pling+6.805
pointer+6.765
Neg logit contributions
osal-9.944
obbies-9.265
hedral-9.205
paces-9.150
heses-8.417
Tokenin
Token ID259
Feature activation+3.55e-01

Pos logit contributions
pling+9.127
oner+8.969
pointer+8.703
POSE+8.566
lp+8.476
Neg logit contributions
osal-14.764
hedral-14.466
obbies-14.204
paces-13.840
heses-13.207
Token for
Token ID329
Feature activation-2.01e-03

Pos logit contributions
OPLE+3.733
POSE+3.504
ンジ+3.396
pointer+3.326
+3.317
Neg logit contributions
osal-4.150
hedral-3.649
paces-3.635
obbies-3.634
heses-3.404
Token J
Token ID449
Feature activation+6.35e-01

Pos logit contributions
osal+0.023
obbies+0.020
hedral+0.020
paces+0.020
heses+0.019
Neg logit contributions
OPLE-0.023
POSE-0.021
ンジ-0.020
-0.020
iets-0.020
Tokenoon
Token ID2049
Feature activation-1.59e-01

Pos logit contributions
OPLE+7.392
POSE+6.823
iets+6.785
oner+6.642
ンジ+6.638
Neg logit contributions
osal-6.323
hedral-5.456
paces-5.432
obbies-5.331
heses-5.210
Tokena
Token ID64
Feature activation-1.72e-01

Pos logit contributions
osal+2.041
obbies+1.781
paces+1.780
hedral+1.774
gey+1.645
Neg logit contributions
OPLE-1.601
POSE-1.464
ンジ-1.408
-1.373
puter-1.360
Token/
Token ID14
Feature activation+1.97e-02

Pos logit contributions
OPLE+3.837
POSE+3.728
================================================================+3.546
pointer+3.533
ンジ+3.496
Neg logit contributions
osal-4.867
obbies-4.449
hedral-4.314
paces-4.200
heses-4.043
Token215
Token ID23349
Feature activation+1.20e-02

Pos logit contributions
OPLE+0.212
POSE+0.195
ンジ+0.189
+0.184
pointer+0.182
Neg logit contributions
osal-0.240
obbies-0.210
hedral-0.210
paces-0.209
heses-0.193
Token 4
Token ID604
Feature activation-9.43e-02

Pos logit contributions
OPLE+0.145
POSE+0.134
ンジ+0.131
+0.127
iets+0.127
Neg logit contributions
osal-0.129
obbies-0.111
hedral-0.110
paces-0.110
heses-0.100
Token*
Token ID9
Feature activation-3.13e-02

Pos logit contributions
osal+0.996
paces+0.851
obbies+0.851
hedral+0.851
heses+0.774
Neg logit contributions
OPLE-1.167
POSE-1.075
ンジ-1.046
-1.023
iets-1.023
Token 0
Token ID657
Feature activation+2.15e-01

Pos logit contributions
osal+0.310
obbies+0.263
hedral+0.262
paces+0.262
heses+0.236
Neg logit contributions
OPLE-0.406
POSE-0.378
ンジ-0.369
-0.361
iets-0.360
Token.
Token ID13
Feature activation+1.28e+00

Pos logit contributions
OPLE+2.298
POSE+2.188
================================================================+2.089
pointer+2.082
ンジ+2.078
Neg logit contributions
osal-2.506
obbies-2.239
hedral-2.192
paces-2.157
heses-2.026
Token9
Token ID24
Feature activation-2.73e-01

Pos logit contributions
905+5.482
OPLE+5.383
================================================================+4.847
66666666+4.813
POSE+4.667
Neg logit contributions
osal-19.161
hedral-18.782
 undermin-18.145
obbies-18.141
erville-17.442
Token559
Token ID38605
Feature activation-7.21e-02

Pos logit contributions
osal+3.404
paces+3.001
hedral+2.982
obbies+2.981
heses+2.761
Neg logit contributions
OPLE-2.732
POSE-2.534
ンジ-2.454
iets-2.404
-2.401
Token 4
Token ID604
Feature activation-8.69e-02

Pos logit contributions
osal+0.752
obbies+0.644
paces+0.641
hedral+0.641
heses+0.582
Neg logit contributions
OPLE-0.896
POSE-0.831
ンジ-0.810
-0.794
iets-0.789
Token*
Token ID9
Feature activation+1.91e-01

Pos logit contributions
osal+0.910
paces+0.778
hedral+0.775
obbies+0.773
heses+0.703
Neg logit contributions
OPLE-1.082
POSE-0.994
ンジ-0.972
iets-0.951
puter-0.950
Token 4
Token ID604
Feature activation+1.79e-02

Pos logit contributions
OPLE+2.176
POSE+2.043
ンジ+1.978
iets+1.929
pointer+1.928
Neg logit contributions
osal-2.126
obbies-1.844
paces-1.843
hedral-1.843
heses-1.672
Token/
Token ID14
Feature activation+1.25e-02

Pos logit contributions
OPLE+3.260
POSE+3.139
================================================================+2.975
ンジ+2.966
pointer+2.963
Neg logit contributions
osal-3.871
obbies-3.514
hedral-3.418
paces-3.333
heses-3.180
Token253
Token ID28592
Feature activation+9.90e-03

Pos logit contributions
OPLE+0.133
POSE+0.122
ンジ+0.118
+0.115
iets+0.114
Neg logit contributions
osal-0.153
obbies-0.134
hedral-0.133
paces-0.133
heses-0.123
Token 4
Token ID604
Feature activation-1.62e-01

Pos logit contributions
OPLE+0.120
POSE+0.111
ンジ+0.108
+0.106
iets+0.105
Neg logit contributions
osal-0.106
obbies-0.092
hedral-0.091
paces-0.091
heses-0.083
Token*
Token ID9
Feature activation-6.56e-02

Pos logit contributions
osal+1.656
paces+1.414
hedral+1.408
obbies+1.393
heses+1.270
Neg logit contributions
OPLE-2.075
POSE-1.891
ンジ-1.853
puter-1.819
iets-1.814
Token 0
Token ID657
Feature activation+2.12e-01

Pos logit contributions
osal+0.643
obbies+0.543
hedral+0.540
paces+0.540
heses+0.487
Neg logit contributions
OPLE-0.860
POSE-0.799
ンジ-0.781
-0.765
iets-0.760
Token.
Token ID13
Feature activation+1.27e+00

Pos logit contributions
OPLE+2.273
POSE+2.163
================================================================+2.066
pointer+2.056
ンジ+2.055
Neg logit contributions
osal-2.475
obbies-2.214
hedral-2.167
paces-2.131
heses-2.001
Token9
Token ID24
Feature activation-2.32e-01

Pos logit contributions
905+5.394
OPLE+5.321
66666666+4.737
================================================================+4.735
POSE+4.569
Neg logit contributions
osal-19.072
hedral-18.661
obbies-18.015
 undermin-17.616
erville-17.346
Token556
Token ID37864
Feature activation+8.54e-02

Pos logit contributions
osal+2.894
paces+2.550
obbies+2.540
hedral+2.535
heses+2.348
Neg logit contributions
OPLE-2.338
POSE-2.163
ンジ-2.094
-2.052
iets-2.048
Token 4
Token ID604
Feature activation-1.22e-01

Pos logit contributions
OPLE+1.020
POSE+0.949
ンジ+0.920
iets+0.900
+0.897
Neg logit contributions
osal-0.925
obbies-0.799
hedral-0.798
paces-0.795
heses-0.721
Token*
Token ID9
Feature activation+1.81e-01

Pos logit contributions
osal+1.253
paces+1.070
hedral+1.064
obbies+1.056
heses+0.962
Neg logit contributions
OPLE-1.540
POSE-1.406
ンジ-1.380
puter-1.352
iets-1.352
Token 4
Token ID604
Feature activation-8.15e-03

Pos logit contributions
OPLE+2.070
POSE+1.941
ンジ+1.883
iets+1.834
pointer+1.831
Neg logit contributions
osal-2.012
obbies-1.747
hedral-1.747
paces-1.746
heses-1.582
Token/
Token ID14
Feature activation+4.56e-02

Pos logit contributions
OPLE+4.029
POSE+3.948
================================================================+3.786
pointer+3.748
lp+3.701
Neg logit contributions
osal-5.294
obbies-4.866
hedral-4.695
paces-4.539
heses-4.395
Token226
Token ID24909
Feature activation+1.43e-01

Pos logit contributions
OPLE+0.489
POSE+0.449
ンジ+0.436
+0.424
pointer+0.422
Neg logit contributions
osal-0.553
obbies-0.487
hedral-0.485
paces-0.483
heses-0.446
Token 4
Token ID604
Feature activation-1.23e-01

Pos logit contributions
OPLE+1.664
POSE+1.552
ンジ+1.506
pointer+1.467
iets+1.462
Neg logit contributions
osal-1.587
obbies-1.377
hedral-1.371
paces-1.368
heses-1.246
Token*
Token ID9
Feature activation-6.38e-02

Pos logit contributions
osal+1.276
paces+1.090
hedral+1.085
obbies+1.078
heses+0.981
Neg logit contributions
OPLE-1.561
POSE-1.427
ンジ-1.397
puter-1.368
iets-1.367
Token 0
Token ID657
Feature activation+2.17e-01

Pos logit contributions
osal+0.624
obbies+0.527
paces+0.524
hedral+0.524
heses+0.472
Neg logit contributions
OPLE-0.838
POSE-0.779
ンジ-0.761
-0.746
iets-0.742
Token.
Token ID13
Feature activation+1.27e+00

Pos logit contributions
OPLE+2.312
POSE+2.202
================================================================+2.104
pointer+2.093
ンジ+2.088
Neg logit contributions
osal-2.533
obbies-2.267
hedral-2.217
paces-2.181
heses-2.048
Token9
Token ID24
Feature activation-2.95e-01

Pos logit contributions
905+5.263
OPLE+5.167
================================================================+4.626
66666666+4.525
POSE+4.448
Neg logit contributions
osal-19.135
hedral-18.655
obbies-18.082
erville-17.413
 undermin-17.355
Token242
Token ID27877
Feature activation-1.26e-01

Pos logit contributions
osal+3.700
paces+3.266
obbies+3.246
hedral+3.239
heses+2.986
Neg logit contributions
OPLE-2.917
POSE-2.708
ンジ-2.606
iets-2.591
-2.578
Token 4
Token ID604
Feature activation-8.32e-02

Pos logit contributions
osal+1.294
obbies+1.101
paces+1.097
hedral+1.092
heses+0.998
Neg logit contributions
OPLE-1.593
POSE-1.474
ンジ-1.442
-1.416
puter-1.403
Token*
Token ID9
Feature activation+2.28e-01

Pos logit contributions
osal+0.871
paces+0.745
hedral+0.742
obbies+0.739
heses+0.672
Neg logit contributions
OPLE-1.037
POSE-0.952
ンジ-0.932
iets-0.911
puter-0.910
Token 4
Token ID604
Feature activation+2.42e-02

Pos logit contributions
OPLE+2.482
POSE+2.363
ンジ+2.281
================================================================+2.222
pointer+2.221
Neg logit contributions
osal-2.616
obbies-2.318
hedral-2.284
paces-2.261
heses-2.099
Token He
Token ID679
Feature activation-1.34e-01

Pos logit contributions
osal+0.326
obbies+0.285
paces+0.284
hedral+0.284
heses+0.261
Neg logit contributions
OPLE-0.303
POSE-0.278
ンジ-0.270
-0.263
iets-0.262
Token was
Token ID373
Feature activation-7.07e-02

Pos logit contributions
osal+1.397
paces+1.197
obbies+1.193
hedral+1.188
heses+1.082
Neg logit contributions
OPLE-1.674
POSE-1.546
ンジ-1.507
-1.470
puter-1.457
Token a
Token ID257
Feature activation+1.98e-01

Pos logit contributions
osal+0.775
obbies+0.669
hedral+0.668
paces+0.668
heses+0.608
Neg logit contributions
OPLE-0.842
POSE-0.777
ンジ-0.757
-0.740
iets-0.737
Token Batman
Token ID9827
Feature activation+2.60e-01

Pos logit contributions
OPLE+2.249
POSE+2.109
ンジ+2.035
+1.998
pointer+1.997
Neg logit contributions
osal-2.222
obbies-1.928
paces-1.927
hedral-1.924
heses-1.763
Token stand
Token ID1302
Feature activation+7.43e-01

Pos logit contributions
OPLE+2.782
POSE+2.611
ンジ+2.507
puter+2.481
pointer+2.477
Neg logit contributions
osal-3.047
paces-2.695
obbies-2.678
hedral-2.650
heses-2.447
Token-
Token ID12
Feature activation+1.24e+00

Pos logit contributions
OPLE+6.815
POSE+6.589
oner+6.297
puter+6.261
pointer+6.161
Neg logit contributions
osal-8.877
hedral-7.981
obbies-7.884
paces-7.803
heses-7.253
Tokenin
Token ID259
Feature activation+4.05e-01

Pos logit contributions
oner+9.025
pointer+8.647
POSE+8.629
pling+8.597
OPLE+8.403
Neg logit contributions
osal-14.434
hedral-13.558
obbies-13.206
paces-13.014
heses-12.521
Token in
Token ID287
Feature activation-4.80e-01

Pos logit contributions
OPLE+4.163
POSE+3.881
pointer+3.712
puter+3.711
ンジ+3.695
Neg logit contributions
osal-4.839
obbies-4.224
hedral-4.184
paces-4.177
heses-3.886
Token the
Token ID262
Feature activation-1.28e-01

Pos logit contributions
osal+4.869
obbies+4.244
paces+4.227
hedral+4.220
heses+3.831
Neg logit contributions
OPLE-5.792
POSE-5.314
ンジ-5.206
pointer-5.081
iets-5.065
Token Wild
Token ID6183
Feature activation-3.64e-01

Pos logit contributions
osal+1.386
obbies+1.196
hedral+1.191
paces+1.191
heses+1.080
Neg logit contributions
OPLE-1.542
POSE-1.431
ンジ-1.388
-1.351
iets-1.345
Tokenstorm
Token ID12135
Feature activation-2.33e-01

Pos logit contributions
osal+3.111
obbies+2.555
paces+2.543
hedral+2.515
idding+2.244
Neg logit contributions
OPLE-5.178
POSE-4.817
ンジ-4.722
-4.597
pling-4.574
Token t
Token ID256
Feature activation+1.50e-01

Pos logit contributions
OPLE+4.715
POSE+4.395
ンジ+4.304
+4.209
================================================================+4.141
Neg logit contributions
osal-5.286
obbies-4.748
paces-4.691
hedral-4.676
heses-4.348
Tokeninct
Token ID4612
Feature activation-2.78e-01

Pos logit contributions
OPLE+2.112
POSE+1.980
ンジ+1.915
iets+1.907
pointer+1.903
Neg logit contributions
osal-1.280
paces-1.082
obbies-1.062
hedral-1.061
heses-0.927
Tokenure
Token ID495
Feature activation-7.34e-02

Pos logit contributions
osal+2.758
obbies+2.281
paces+2.277
hedral+2.277
heses+2.094
Neg logit contributions
OPLE-3.694
POSE-3.414
ンジ-3.322
-3.297
puter-3.265
Token or
Token ID393
Feature activation+4.80e-01

Pos logit contributions
osal+0.806
obbies+0.693
hedral+0.692
paces+0.691
heses+0.630
Neg logit contributions
OPLE-0.872
POSE-0.808
ンジ-0.783
-0.767
iets-0.765
Token a
Token ID257
Feature activation+4.87e-01

Pos logit contributions
OPLE+5.009
POSE+4.637
ンジ+4.563
+4.444
pling+4.436
Neg logit contributions
osal-5.394
obbies-4.903
hedral-4.802
paces-4.745
heses-4.426
Token few
Token ID1178
Feature activation+1.20e+00

Pos logit contributions
OPLE+5.193
ンジ+4.893
POSE+4.887
pling+4.733
+4.732
Neg logit contributions
osal-5.366
obbies-4.844
hedral-4.709
paces-4.688
heses-4.360
Token dro
Token ID3102
Feature activation-3.23e-01

Pos logit contributions
 hairs+9.284
OPLE+8.847
ンジ+8.415
+8.242
lp+8.196
Neg logit contributions
osal-13.654
obbies-12.750
hedral-12.366
paces-11.971
heses-11.573
Tokenpper
Token ID2848
Feature activation+1.23e-01

Pos logit contributions
osal+3.816
obbies+3.260
paces+3.209
hedral+3.208
gey+2.992
Neg logit contributions
OPLE-3.594
POSE-3.375
-3.249
ンジ-3.196
HCR-3.151
Tokenful
Token ID913
Feature activation+1.25e-01

Pos logit contributions
OPLE+1.401
POSE+1.288
ンジ+1.262
+1.228
iets+1.222
Neg logit contributions
osal-1.380
obbies-1.214
hedral-1.204
paces-1.201
heses-1.093
Tokens
Token ID82
Feature activation+1.33e-02

Pos logit contributions
OPLE+1.609
POSE+1.503
ンジ+1.481
+1.443
iets+1.432
Neg logit contributions
osal-1.212
obbies-1.037
hedral-1.031
paces-1.026
heses-0.920
Token of
Token ID286
Feature activation+6.12e-01

Pos logit contributions
OPLE+0.156
POSE+0.145
ンジ+0.141
+0.137
iets+0.137
Neg logit contributions
osal-0.147
obbies-0.127
hedral-0.127
paces-0.126
heses-0.116
Token/
Token ID14
Feature activation+1.76e-01

Pos logit contributions
OPLE+2.830
POSE+2.699
ンジ+2.565
================================================================+2.555
pointer+2.551
Neg logit contributions
osal-3.220
obbies-2.894
hedral-2.836
paces-2.774
heses-2.625
Token190
Token ID19782
Feature activation-8.01e-02

Pos logit contributions
OPLE+1.826
POSE+1.679
ンジ+1.622
pointer+1.584
+1.579
Neg logit contributions
osal-2.176
obbies-1.929
hedral-1.926
paces-1.903
heses-1.765
Token 4
Token ID604
Feature activation-2.71e-01

Pos logit contributions
osal+0.844
obbies+0.722
paces+0.720
hedral+0.720
heses+0.654
Neg logit contributions
OPLE-0.989
POSE-0.915
ンジ-0.890
-0.872
iets-0.871
Token*
Token ID9
Feature activation+2.12e-02

Pos logit contributions
osal+2.710
hedral+2.304
obbies+2.296
paces+2.289
heses+2.094
Neg logit contributions
OPLE-3.507
POSE-3.208
ンジ-3.120
iets-3.062
-3.061
Token 0
Token ID657
Feature activation+1.78e-01

Pos logit contributions
OPLE+0.270
POSE+0.252
ンジ+0.245
+0.240
iets+0.240
Neg logit contributions
osal-0.214
obbies-0.182
hedral-0.181
paces-0.181
heses-0.164
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
OPLE+1.944
POSE+1.841
ンジ+1.756
================================================================+1.753
pointer+1.751
Neg logit contributions
osal-2.050
obbies-1.822
hedral-1.787
paces-1.763
heses-1.648
Token9
Token ID24
Feature activation-2.75e-01

Pos logit contributions
OPLE+5.341
905+5.163
================================================================+4.733
66666666+4.710
POSE+4.616
Neg logit contributions
osal-17.769
hedral-17.315
 undermin-16.848
obbies-16.828
erville-16.085
Token607
Token ID31980
Feature activation-7.57e-03

Pos logit contributions
osal+3.366
paces+2.963
hedral+2.948
obbies+2.944
heses+2.724
Neg logit contributions
OPLE-2.801
POSE-2.596
ンジ-2.518
iets-2.478
puter-2.476
Token 4
Token ID604
Feature activation+1.06e-03

Pos logit contributions
osal+0.080
obbies+0.069
hedral+0.068
paces+0.068
heses+0.062
Neg logit contributions
OPLE-0.093
POSE-0.086
ンジ-0.084
-0.082
iets-0.082
Token*
Token ID9
Feature activation+2.06e-01

Pos logit contributions
OPLE+0.013
POSE+0.012
ンジ+0.011
+0.011
iets+0.011
Neg logit contributions
osal-0.011
obbies-0.010
hedral-0.010
paces-0.010
heses-0.009
Token 4
Token ID604
Feature activation+3.59e-02

Pos logit contributions
OPLE+2.347
POSE+2.207
ンジ+2.131
pointer+2.078
iets+2.073
Neg logit contributions
osal-2.300
hedral-1.994
obbies-1.992
paces-1.987
heses-1.814
Token different
Token ID1180
Feature activation+3.89e-01

Pos logit contributions
OPLE+2.249
POSE+2.103
ンジ+2.010
+1.978
iets+1.975
Neg logit contributions
osal-2.263
obbies-1.943
hedral-1.939
paces-1.929
heses-1.802
Token somehow
Token ID7599
Feature activation+3.33e-01

Pos logit contributions
OPLE+4.184
POSE+3.878
ンジ+3.756
puter+3.714
pling+3.691
Neg logit contributions
osal-4.502
obbies-3.934
hedral-3.888
paces-3.852
heses-3.631
Token
Token ID198
Feature activation-9.97e-07

Pos logit contributions
OPLE+3.623
POSE+3.361
ンジ+3.261
puter+3.214
pling+3.208
Neg logit contributions
osal-3.843
obbies-3.348
hedral-3.320
paces-3.262
heses-3.067
Token
Token ID198
Feature activation+1.67e-01

Feature not active on prev token;
no predictions were affected.
Tokena
Token ID64
Feature activation+5.43e-01

Pos logit contributions
OPLE+1.965
POSE+1.834
ンジ+1.776
pointer+1.747
================================================================+1.745
Neg logit contributions
osal-1.831
obbies-1.582
hedral-1.562
paces-1.554
heses-1.434
Token few
Token ID1178
Feature activation+1.16e+00

Pos logit contributions
OPLE+6.059
POSE+5.738
puter+5.505
pling+5.484
ンジ+5.467
Neg logit contributions
osal-5.802
obbies-5.010
paces-5.003
hedral-4.941
heses-4.597
Token days
Token ID1528
Feature activation+2.39e-01

Pos logit contributions
OPLE+9.820
 hairs+9.540
================================================================+9.055
POSE+8.969
pointer+8.801
Neg logit contributions
osal-12.645
obbies-11.391
paces-10.843
hedral-10.648
heses-10.302
Token ago
Token ID2084
Feature activation+3.35e-01

Pos logit contributions
OPLE+2.691
POSE+2.507
iets+2.409
ンジ+2.406
puter+2.389
Neg logit contributions
osal-2.702
obbies-2.353
hedral-2.330
paces-2.310
heses-2.150
Token,
Token ID11
Feature activation+4.86e-01

Pos logit contributions
OPLE+3.558
POSE+3.269
ンジ+3.162
================================================================+3.100
puter+3.092
Neg logit contributions
osal-4.019
obbies-3.483
hedral-3.441
paces-3.441
heses-3.206
Token an
Token ID281
Feature activation+7.50e-01

Pos logit contributions
OPLE+4.867
POSE+4.541
ンジ+4.345
================================================================+4.338
pling+4.312
Neg logit contributions
osal-5.851
obbies-5.119
hedral-5.031
paces-5.011
heses-4.707
Token exploration
Token ID13936
Feature activation+8.27e-01

Pos logit contributions
OPLE+7.530
================================================================+6.797
POSE+6.708
oner+6.541
pointer+6.516
Neg logit contributions
osal-8.369
paces-7.664
obbies-7.439
hedral-7.085
heses-6.897
Token/
Token ID14
Feature activation+4.95e-02

Pos logit contributions
OPLE+4.793
POSE+4.679
================================================================+4.504
pointer+4.482
ンジ+4.431
Neg logit contributions
osal-6.647
obbies-6.166
hedral-5.969
paces-5.776
heses-5.596
Token175
Token ID17430
Feature activation-2.19e-01

Pos logit contributions
OPLE+0.534
POSE+0.491
ンジ+0.475
+0.463
pointer+0.461
Neg logit contributions
osal-0.599
obbies-0.526
hedral-0.524
paces-0.522
heses-0.482
Token 4
Token ID604
Feature activation-2.91e-01

Pos logit contributions
osal+2.199
paces+1.858
obbies+1.855
hedral+1.853
heses+1.678
Neg logit contributions
OPLE-2.801
POSE-2.583
ンジ-2.514
puter-2.485
-2.476
Token*
Token ID9
Feature activation+9.78e-02

Pos logit contributions
osal+2.907
hedral+2.474
obbies+2.471
paces+2.465
heses+2.249
Neg logit contributions
OPLE-3.752
POSE-3.435
ンジ-3.342
iets-3.284
-3.279
Token 0
Token ID657
Feature activation+1.61e-01

Pos logit contributions
OPLE+1.204
POSE+1.129
ンジ+1.093
pointer+1.071
iets+1.070
Neg logit contributions
osal-1.015
obbies-0.871
hedral-0.869
paces-0.868
heses-0.785
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
OPLE+1.776
POSE+1.678
ンジ+1.607
================================================================+1.596
pointer+1.595
Neg logit contributions
osal-1.843
obbies-1.632
hedral-1.603
paces-1.585
heses-1.477
Token96
Token ID4846
Feature activation+8.37e-02

Pos logit contributions
OPLE+6.213
905+5.670
66666666+5.610
POSE+5.499
lp+5.312
Neg logit contributions
osal-16.603
hedral-16.142
obbies-15.718
 undermin-15.411
ittens-14.845
Token13
Token ID1485
Feature activation-2.88e-02

Pos logit contributions
OPLE+0.832
POSE+0.763
ンジ+0.734
+0.712
================================================================+0.710
Neg logit contributions
osal-1.081
obbies-0.962
hedral-0.954
paces-0.949
heses-0.884
Token 5
Token ID642
Feature activation+2.00e-02

Pos logit contributions
osal+0.304
obbies+0.261
hedral+0.260
paces+0.260
heses+0.236
Neg logit contributions
OPLE-0.355
POSE-0.329
ンジ-0.320
-0.313
iets-0.312
Token*
Token ID9
Feature activation+1.70e-01

Pos logit contributions
OPLE+0.239
POSE+0.222
ンジ+0.216
+0.211
iets+0.210
Neg logit contributions
osal-0.217
obbies-0.187
hedral-0.186
paces-0.186
heses-0.170
Token 4
Token ID604
Feature activation+1.09e-01

Pos logit contributions
OPLE+1.955
POSE+1.831
ンジ+1.771
pointer+1.722
+1.719
Neg logit contributions
osal-1.893
obbies-1.643
hedral-1.640
paces-1.627
heses-1.494
Token his
Token ID465
Feature activation+3.29e-01

Pos logit contributions
OPLE+4.015
POSE+3.718
ンジ+3.607
+3.512
================================================================+3.510
Neg logit contributions
osal-4.850
paces-4.303
obbies-4.267
hedral-4.217
heses-3.961
Token past
Token ID1613
Feature activation+1.29e-01

Pos logit contributions
OPLE+3.744
POSE+3.471
ンジ+3.403
+3.296
iets+3.291
Neg logit contributions
osal-3.640
paces-3.190
obbies-3.182
hedral-3.140
heses-2.928
Token,
Token ID11
Feature activation+1.08e-01

Pos logit contributions
OPLE+1.499
POSE+1.378
ンジ+1.348
+1.320
iets+1.311
Neg logit contributions
osal-1.440
paces-1.248
obbies-1.246
hedral-1.241
heses-1.141
Token but
Token ID475
Feature activation-1.74e-01

Pos logit contributions
OPLE+1.242
POSE+1.154
ンジ+1.119
iets+1.088
+1.088
Neg logit contributions
osal-1.224
obbies-1.063
paces-1.061
hedral-1.060
heses-0.975
Token a
Token ID257
Feature activation+2.76e-03

Pos logit contributions
osal+1.920
paces+1.662
hedral+1.657
obbies+1.654
heses+1.512
Neg logit contributions
OPLE-2.055
POSE-1.896
ンジ-1.859
-1.807
iets-1.801
Token fe
Token ID730
Feature activation+1.15e+00

Pos logit contributions
OPLE+0.032
POSE+0.029
ンジ+0.029
+0.028
iets+0.028
Neg logit contributions
osal-0.031
obbies-0.027
hedral-0.027
paces-0.027
heses-0.025
Tokenist
Token ID396
Feature activation+1.62e-01

Pos logit contributions
OPLE+7.017
pling+6.879
pointer+6.871
POSE+6.794
puter+6.569
Neg logit contributions
osal-14.183
paces-13.942
hedral-13.900
obbies-13.660
heses-12.582
Tokeny
Token ID88
Feature activation+4.00e-01

Pos logit contributions
OPLE+1.397
POSE+1.265
ンジ+1.187
puter+1.178
iets+1.174
Neg logit contributions
osal-2.278
paces-2.059
obbies-2.055
hedral-2.041
heses-1.903
Token Kaine
Token ID37193
Feature activation+3.40e-01

Pos logit contributions
OPLE+4.372
POSE+4.006
ンジ+3.891
pointer+3.842
oner+3.823
Neg logit contributions
osal-4.540
paces-4.024
obbies-4.015
hedral-3.973
heses-3.691
Token
Token ID960
Feature activation+1.57e-01

Pos logit contributions
OPLE+3.775
POSE+3.525
ンジ+3.361
iets+3.355
pointer+3.332
Neg logit contributions
osal-3.813
obbies-3.391
paces-3.336
hedral-3.326
heses-3.065
Tokenjud
Token ID10456
Feature activation+5.38e-03

Pos logit contributions
OPLE+1.641
POSE+1.519
ンジ+1.469
+1.437
pointer+1.435
Neg logit contributions
osal-1.923
obbies-1.695
hedral-1.681
paces-1.676
heses-1.555
Token nationalist
Token ID19848
Feature activation-1.54e-01

Pos logit contributions
osal+1.650
hedral+1.429
obbies+1.419
paces+1.406
heses+1.295
Neg logit contributions
OPLE-1.764
POSE-1.633
ンジ-1.595
-1.555
puter-1.549
Token heter
Token ID14445
Feature activation+3.56e-01

Pos logit contributions
osal+1.691
hedral+1.460
obbies+1.455
paces+1.436
heses+1.331
Neg logit contributions
OPLE-1.805
POSE-1.675
ンジ-1.638
-1.601
pointer-1.593
Tokenogeneous
Token ID32269
Feature activation+5.08e-03

Pos logit contributions
OPLE+2.137
POSE+1.732
ンジ+1.688
iets+1.612
puter+1.580
Neg logit contributions
osal-5.768
paces-5.429
obbies-5.423
hedral-5.374
idding-5.078
Token coalition
Token ID9906
Feature activation-2.49e-01

Pos logit contributions
OPLE+0.057
POSE+0.053
ンジ+0.051
+0.050
iets+0.050
Neg logit contributions
osal-0.059
obbies-0.051
paces-0.051
hedral-0.051
heses-0.047
Token (
Token ID357
Feature activation+2.46e-01

Pos logit contributions
osal+2.564
hedral+2.206
obbies+2.170
paces+2.160
heses+2.011
Neg logit contributions
OPLE-3.086
POSE-2.873
ンジ-2.808
-2.751
pointer-2.739
Tokenlab
Token ID23912
Feature activation+1.15e+00

Pos logit contributions
OPLE+2.706
POSE+2.497
ンジ+2.406
+2.372
pointer+2.356
Neg logit contributions
osal-2.883
obbies-2.523
hedral-2.492
paces-2.484
heses-2.282
Tokenelled
Token ID11978
Feature activation-1.12e-01

Pos logit contributions
OPLE+9.929
pling+8.837
iets+8.640
ンジ+8.539
+8.341
Neg logit contributions
osal-12.487
paces-12.211
obbies-11.556
hedral-11.305
erville-10.683
Token as
Token ID355
Feature activation+1.61e-01

Pos logit contributions
osal+1.276
hedral+1.102
paces+1.102
obbies+1.101
heses+1.013
Neg logit contributions
OPLE-1.289
POSE-1.193
ンジ-1.153
-1.127
pointer-1.124
Token
Token ID564
Feature activation-3.94e-02

Pos logit contributions
OPLE+1.821
POSE+1.675
ンジ+1.621
iets+1.588
+1.586
Neg logit contributions
osal-1.850
obbies-1.618
paces-1.608
hedral-1.600
heses-1.465
Token
Token ID246
Feature activation-1.81e-01

Pos logit contributions
osal+0.496
obbies+0.436
hedral+0.435
paces+0.435
heses+0.403
Neg logit contributions
OPLE-0.405
POSE-0.370
ンジ-0.358
-0.348
iets-0.348
Tokentre
Token ID33945
Feature activation+3.76e-01

Pos logit contributions
osal+2.322
hedral+2.039
paces+2.037
obbies+2.029
heses+1.896
Neg logit contributions
OPLE-1.800
POSE-1.642
ンジ-1.593
iets-1.540
-1.531
Token 2013
Token ID2211
Feature activation+1.66e-01

Pos logit contributions
OPLE+1.148
POSE+1.069
ンジ+1.035
iets+1.018
================================================================+1.017
Neg logit contributions
osal-0.913
obbies-0.777
hedral-0.772
paces-0.770
heses-0.695
Token,
Token ID11
Feature activation-4.57e-03

Pos logit contributions
OPLE+1.867
POSE+1.717
ンジ+1.671
iets+1.633
+1.623
Neg logit contributions
osal-1.891
obbies-1.660
hedral-1.640
paces-1.636
heses-1.508
Token suffered
Token ID6989
Feature activation+6.21e-01

Pos logit contributions
osal+0.049
obbies+0.043
hedral+0.042
paces+0.042
heses+0.039
Neg logit contributions
OPLE-0.055
POSE-0.051
ンジ-0.050
-0.048
iets-0.048
Token a
Token ID257
Feature activation+7.24e-01

Pos logit contributions
OPLE+6.111
POSE+5.549
puter+5.413
oner+5.341
pling+5.303
Neg logit contributions
osal-7.533
obbies-6.609
paces-6.499
hedral-6.410
heses-6.079
Token computer
Token ID3644
Feature activation+1.11e+00

Pos logit contributions
OPLE+7.233
POSE+6.559
pling+6.468
+6.367
oner+6.367
Neg logit contributions
osal-8.300
obbies-7.423
hedral-7.349
paces-7.211
heses-6.904
Token hard
Token ID1327
Feature activation+1.15e+00

Pos logit contributions
pointer+8.403
OPLE+8.365
puter+8.196
POSE+8.073
oner+7.765
Neg logit contributions
osal-13.192
obbies-12.748
paces-12.312
hedral-11.783
heses-11.066
Token drive
Token ID3708
Feature activation+9.31e-01

Pos logit contributions
OPLE+8.222
pointer+8.129
POSE+7.822
puter+7.682
pling+7.589
Neg logit contributions
osal-14.597
obbies-13.446
paces-13.430
hedral-13.070
heses-12.466
Token crash
Token ID7014
Feature activation+2.14e-01

Pos logit contributions
pointer+7.548
OPLE+7.527
puter+7.204
POSE+7.188
+6.994
Neg logit contributions
osal-11.235
obbies-10.459
paces-10.139
hedral-10.071
heses-9.538
Token that
Token ID326
Feature activation-1.76e-02

Pos logit contributions
OPLE+2.373
POSE+2.216
ンジ+2.115
+2.098
puter+2.091
Neg logit contributions
osal-2.463
obbies-2.179
hedral-2.126
paces-2.102
heses-1.967
Token cost
Token ID1575
Feature activation-1.25e-01

Pos logit contributions
osal+0.188
obbies+0.161
hedral+0.161
paces+0.161
heses+0.146
Neg logit contributions
OPLE-0.214
POSE-0.198
ンジ-0.193
-0.188
iets-0.188
Token potentially
Token ID6196
Feature activation-2.32e-01

Pos logit contributions
osal+1.400
paces+1.209
hedral+1.206
obbies+1.206
heses+1.105
Neg logit contributions
OPLE-1.455
POSE-1.345
ンジ-1.312
iets-1.283
-1.281
Token Please
Token ID4222
Feature activation+8.22e-01

Pos logit contributions
OPLE+5.211
POSE+4.782
pling+4.646
pointer+4.616
+4.588
Neg logit contributions
osal-6.458
paces-6.187
obbies-6.184
hedral-6.164
heses-5.692
Token click
Token ID3904
Feature activation+5.99e-01

Pos logit contributions
OPLE+6.669
oner+5.909
+5.744
pling+5.681
POSE+5.667
Neg logit contributions
osal-11.007
hedral-10.219
paces-9.989
obbies-9.867
heses-9.255
Token below
Token ID2174
Feature activation+8.48e-01

Pos logit contributions
OPLE+4.997
POSE+4.720
ンジ+4.515
================================================================+4.500
+4.418
Neg logit contributions
osal-7.979
obbies-7.172
paces-6.946
hedral-6.900
heses-6.537
Token to
Token ID284
Feature activation+1.60e-01

Pos logit contributions
OPLE+6.700
pling+5.909
POSE+5.692
================================================================+5.575
ンジ+5.481
Neg logit contributions
osal-11.064
paces-10.538
obbies-10.311
hedral-10.305
auld-9.790
Token become
Token ID1716
Feature activation+9.35e-01

Pos logit contributions
OPLE+1.516
POSE+1.392
ンジ+1.311
puter+1.281
================================================================+1.277
Neg logit contributions
osal-2.116
hedral-1.917
paces-1.900
obbies-1.899
heses-1.779
Token an
Token ID281
Feature activation+1.14e+00

Pos logit contributions
OPLE+6.335
POSE+6.328
oner+5.831
lp+5.716
================================================================+5.579
Neg logit contributions
osal-12.447
obbies-11.765
paces-11.508
hedral-11.501
auld-11.418
Token In
Token ID554
Feature activation+5.34e-02

Pos logit contributions
OPLE+6.691
POSE+5.587
oner+5.517
ンジ+5.113
lp+4.953
Neg logit contributions
osal-16.014
paces-15.550
obbies-14.943
hedral-14.691
heses-14.625
TokenDaily
Token ID28545
Feature activation+6.24e-01

Pos logit contributions
OPLE+0.560
POSE+0.519
ンジ+0.503
+0.488
pointer+0.486
Neg logit contributions
osal-0.653
obbies-0.578
paces-0.574
hedral-0.571
heses-0.530
Token supporter
Token ID15525
Feature activation+6.03e-01

Pos logit contributions
OPLE+4.722
POSE+4.069
ンジ+4.037
pointer+3.977
Sports+3.786
Neg logit contributions
osal-8.906
paces-8.380
hedral-8.089
obbies-8.033
heses-7.980
Token.
Token ID13
Feature activation+7.89e-01

Pos logit contributions
OPLE+5.366
 Pwr+4.919
pointer+4.904
POSE+4.850
================================================================+4.845
Neg logit contributions
osal-7.485
obbies-7.309
hedral-6.959
paces-6.848
ittens-6.536
Token Powered
Token ID45090
Feature activation+8.05e-02

Pos logit contributions
OPLE+3.482
POSE+3.355
ンジ+3.280
pling+3.162
66666666+2.962
Neg logit contributions
osal-12.873
paces-12.628
hedral-12.386
obbies-12.323
heses-11.974
Token's
Token ID338
Feature activation+6.72e-01

Pos logit contributions
OPLE+5.365
iets+5.162
POSE+4.988
puter+4.934
ンジ+4.933
Neg logit contributions
osal-7.048
obbies-6.237
hedral-6.096
paces-6.041
idding-5.661
Token ha
Token ID387
Feature activation+3.57e-01

Pos logit contributions
OPLE+6.514
POSE+6.113
+5.897
================================================================+5.872
ンジ+5.863
Neg logit contributions
osal-7.872
obbies-6.936
paces-6.664
hedral-6.592
heses-6.260
Tokenzy
Token ID7357
Feature activation+9.67e-01

Pos logit contributions
OPLE+3.684
POSE+3.479
pling+3.386
puter+3.377
lp+3.350
Neg logit contributions
osal-4.185
hedral-3.667
obbies-3.658
paces-3.643
heses-3.439
Token,
Token ID11
Feature activation+8.92e-01

Pos logit contributions
OPLE+7.844
pling+7.322
lp+7.226
ンジ+7.220
POSE+7.183
Neg logit contributions
osal-11.756
obbies-10.419
paces-10.143
hedral-10.043
heses-9.688
Token pale
Token ID14005
Feature activation+1.33e+00

Pos logit contributions
OPLE+7.540
POSE+7.335
ンジ+7.213
pling+7.101
iets+7.032
Neg logit contributions
osal-10.487
obbies-9.115
hedral-9.034
paces-8.979
heses-8.754
Token yellow
Token ID7872
Feature activation+1.14e+00

Pos logit contributions
OPLE+9.294
pling+9.276
lp+9.257
ンジ+9.141
 hairs+8.884
Neg logit contributions
osal-14.848
obbies-13.763
paces-13.602
hedral-13.144
heses-12.268
Token,
Token ID11
Feature activation+1.09e+00

Pos logit contributions
OPLE+9.048
 hairs+8.537
lp+8.487
POSE+8.451
ンジ+8.320
Neg logit contributions
osal-13.049
obbies-11.622
paces-11.392
hedral-10.922
heses-10.652
Token with
Token ID351
Feature activation+8.35e-01

Pos logit contributions
OPLE+8.762
POSE+8.661
ンジ+8.343
 hairs+8.303
pling+8.289
Neg logit contributions
osal-12.353
obbies-10.703
paces-10.580
hedral-10.414
heses-10.297
Token very
Token ID845
Feature activation+7.00e-01

Pos logit contributions
OPLE+7.366
 hairs+7.005
POSE+6.969
pling+6.725
lp+6.707
Neg logit contributions
osal-9.941
obbies-8.662
paces-8.447
hedral-8.306
heses-8.076
Token little
Token ID1310
Feature activation+9.11e-01

Pos logit contributions
OPLE+5.788
POSE+5.692
pling+5.288
ンジ+5.212
lp+5.197
Neg logit contributions
osal-9.119
obbies-7.923
paces-7.753
hedral-7.677
heses-7.482
Token carbon
Token ID6588
Feature activation+3.02e-01

Pos logit contributions
OPLE+7.856
 hairs+7.594
POSE+7.397
lp+7.243
pling+7.215
Neg logit contributions
osal-10.527
obbies-9.502
paces-9.252
hedral-9.034
heses-8.944
Token was
Token ID373
Feature activation+5.69e-02

Pos logit contributions
osal+2.679
paces+2.309
hedral+2.262
obbies+2.237
heses+2.027
Neg logit contributions
OPLE-3.686
POSE-3.432
ンジ-3.391
-3.270
================================================================-3.247
Token asked
Token ID1965
Feature activation+1.83e-01

Pos logit contributions
OPLE+0.671
POSE+0.622
ンジ+0.601
+0.589
iets+0.588
Neg logit contributions
osal-0.629
obbies-0.545
paces-0.539
hedral-0.538
heses-0.495
Token "
Token ID366
Feature activation+1.89e-01

Pos logit contributions
OPLE+1.950
POSE+1.793
ンジ+1.724
pointer+1.689
+1.686
Neg logit contributions
osal-2.192
obbies-1.946
hedral-1.920
paces-1.919
heses-1.767
Tokento
Token ID1462
Feature activation+2.83e-01

Pos logit contributions
OPLE+2.091
POSE+1.941
ンジ+1.867
pointer+1.840
================================================================+1.831
Neg logit contributions
osal-2.177
obbies-1.919
hedral-1.908
paces-1.889
heses-1.763
Token sign
Token ID1051
Feature activation+6.62e-01

Pos logit contributions
OPLE+3.105
POSE+2.888
ンジ+2.764
puter+2.750
pling+2.737
Neg logit contributions
osal-3.234
obbies-2.851
hedral-2.813
paces-2.758
heses-2.568
Token a
Token ID257
Feature activation+1.12e+00

Pos logit contributions
OPLE+6.047
POSE+5.502
iets+5.401
================================================================+5.377
ンジ+5.374
Neg logit contributions
osal-8.118
obbies-7.464
hedral-7.249
paces-7.076
gey-6.751
Token nond
Token ID30745
Feature activation+2.64e-01

Pos logit contributions
OPLE+9.289
POSE+8.961
ンジ+8.703
oner+8.691
lp+8.642
Neg logit contributions
osal-12.100
obbies-11.714
hedral-11.216
paces-10.683
heses-10.169
Tokenis
Token ID271
Feature activation-1.16e-01

Pos logit contributions
OPLE+3.312
POSE+3.081
iets+3.047
puter+3.010
pling+2.982
Neg logit contributions
osal-2.594
obbies-2.282
hedral-2.237
paces-2.236
gey-1.990
Tokenclosure
Token ID17966
Feature activation+7.24e-01

Pos logit contributions
osal+1.318
hedral+1.142
paces+1.139
obbies+1.137
heses+1.041
Neg logit contributions
OPLE-1.347
POSE-1.238
ンジ-1.203
iets-1.179
-1.170
Token agreement
Token ID4381
Feature activation+2.03e-01

Pos logit contributions
OPLE+4.983
POSE+4.639
oner+4.383
lp+4.289
pointer+4.213
Neg logit contributions
osal-10.169
obbies-9.716
paces-9.523
hedral-9.418
netflix-8.788
Token before
Token ID878
Feature activation-1.36e-02

Pos logit contributions
OPLE+2.234
POSE+2.081
ンジ+1.975
iets+1.956
puter+1.954
Neg logit contributions
osal-2.345
obbies-2.079
hedral-2.049
paces-2.019
heses-1.869
Tokenop
Token ID404
Feature activation+1.84e-01

Pos logit contributions
osal+2.735
obbies+2.207
paces+2.161
hedral+2.149
heses+1.937
Neg logit contributions
OPLE-4.392
POSE-4.079
ンジ-4.035
-3.934
================================================================-3.929
Tokenrene
Token ID25924
Feature activation+1.20e-01

Pos logit contributions
OPLE+1.579
POSE+1.458
iets+1.375
ンジ+1.351
pointer+1.326
Neg logit contributions
osal-2.546
obbies-2.294
paces-2.289
hedral-2.283
heses-2.133
Token rubber
Token ID14239
Feature activation+1.76e-01

Pos logit contributions
OPLE+1.264
POSE+1.156
ンジ+1.121
+1.086
iets+1.086
Neg logit contributions
osal-1.463
paces-1.289
hedral-1.289
obbies-1.285
heses-1.182
Token goes
Token ID2925
Feature activation+1.31e-02

Pos logit contributions
OPLE+2.026
POSE+1.867
ンジ+1.821
+1.782
iets+1.775
Neg logit contributions
osal-1.967
obbies-1.717
hedral-1.714
paces-1.708
heses-1.549
Token into
Token ID656
Feature activation+6.12e-01

Pos logit contributions
OPLE+0.151
POSE+0.139
ンジ+0.135
+0.132
iets+0.132
Neg logit contributions
osal-0.149
obbies-0.129
hedral-0.129
paces-0.128
heses-0.118
Token all
Token ID477
Feature activation+1.12e+00

Pos logit contributions
OPLE+6.225
POSE+5.581
ンジ+5.480
+5.398
pling+5.384
Neg logit contributions
osal-6.776
paces-6.173
obbies-6.168
hedral-6.139
gey-5.762
Token sorts
Token ID10524
Feature activation+3.02e-01

Pos logit contributions
OPLE+10.232
ンジ+9.257
oner+8.977
 hairs+8.738
pling+8.731
Neg logit contributions
osal-11.274
hedral-10.621
paces-10.430
obbies-10.318
gey-10.197
Token of
Token ID286
Feature activation+9.48e-01

Pos logit contributions
OPLE+3.132
POSE+2.926
ンジ+2.870
+2.807
================================================================+2.802
Neg logit contributions
osal-3.555
obbies-3.149
hedral-3.067
paces-3.049
heses-2.891
Token everyday
Token ID10908
Feature activation+8.73e-01

Pos logit contributions
OPLE+9.011
 hairs+8.124
ンジ+8.078
oner+8.011
POSE+7.928
Neg logit contributions
osal-9.930
obbies-9.094
paces-8.901
hedral-8.863
gey-8.623
Token household
Token ID6641
Feature activation+3.73e-01

Pos logit contributions
OPLE+7.904
 hairs+6.991
ンジ+6.879
POSE+6.763
================================================================+6.694
Neg logit contributions
osal-10.171
hedral-9.348
obbies-9.132
paces-9.002
gey-8.862
Token products
Token ID3186
Feature activation+3.10e-01

Pos logit contributions
OPLE+3.758
POSE+3.376
ンジ+3.333
================================================================+3.277
+3.228
Neg logit contributions
osal-4.567
hedral-4.104
obbies-4.051
paces-4.023
gey-3.785
Token]
Token ID60
Feature activation+1.51e-01

Pos logit contributions
OPLE+1.218
POSE+1.130
ンジ+1.102
+1.078
puter+1.072
Neg logit contributions
osal-1.000
obbies-0.863
hedral-0.857
paces-0.855
heses-0.771
Token As
Token ID1081
Feature activation+1.48e-01

Pos logit contributions
OPLE+1.680
POSE+1.547
ンジ+1.501
+1.470
================================================================+1.461
Neg logit contributions
osal-1.762
obbies-1.561
hedral-1.545
paces-1.529
heses-1.412
Token of
Token ID286
Feature activation-5.92e-03

Pos logit contributions
OPLE+1.684
POSE+1.561
ンジ+1.506
+1.489
puter+1.472
Neg logit contributions
osal-1.671
obbies-1.462
hedral-1.445
paces-1.437
heses-1.334
Token 2008
Token ID3648
Feature activation+1.86e-01

Pos logit contributions
osal+0.062
obbies+0.053
hedral+0.053
paces+0.053
heses+0.048
Neg logit contributions
OPLE-0.073
POSE-0.068
ンジ-0.066
-0.065
iets-0.065
Token,
Token ID11
Feature activation+2.75e-01

Pos logit contributions
OPLE+2.105
POSE+1.930
ンジ+1.872
+1.823
================================================================+1.804
Neg logit contributions
osal-2.136
obbies-1.883
hedral-1.866
paces-1.861
heses-1.708
Token it
Token ID340
Feature activation+1.12e+00

Pos logit contributions
OPLE+3.014
POSE+2.778
ンジ+2.684
+2.659
================================================================+2.603
Neg logit contributions
osal-3.197
obbies-2.857
hedral-2.794
paces-2.793
heses-2.577
Token has
Token ID468
Feature activation+7.18e-01

Pos logit contributions
OPLE+8.305
+7.984
ンジ+7.895
pling+7.691
================================================================+7.536
Neg logit contributions
osal-13.645
obbies-12.320
hedral-12.296
paces-11.716
heses-11.116
Token sold
Token ID2702
Feature activation+8.62e-02

Pos logit contributions
OPLE+5.491
+5.021
pling+4.797
================================================================+4.773
ンジ+4.767
Neg logit contributions
osal-9.949
obbies-8.952
paces-8.806
hedral-8.678
heses-8.458
Token over
Token ID625
Feature activation+2.68e-01

Pos logit contributions
OPLE+0.995
POSE+0.922
ンジ+0.895
+0.878
iets+0.869
Neg logit contributions
osal-0.971
obbies-0.848
hedral-0.838
paces-0.833
heses-0.767
Token 43
Token ID5946
Feature activation+5.51e-01

Pos logit contributions
OPLE+2.834
POSE+2.604
ンジ+2.521
+2.475
pointer+2.464
Neg logit contributions
osal-3.203
obbies-2.851
hedral-2.798
paces-2.788
heses-2.601
Token,
Token ID11
Feature activation+1.33e-01

Pos logit contributions
OPLE+4.384
ンジ+3.843
POSE+3.822
+3.633
puter+3.561
Neg logit contributions
osal-7.835
obbies-7.057
hedral-6.961
paces-6.955
heses-6.543
Token the
Token ID262
Feature activation-4.33e-02

Pos logit contributions
osal+1.835
hedral+1.575
obbies+1.572
paces+1.569
heses+1.436
Neg logit contributions
OPLE-1.956
POSE-1.813
ンジ-1.767
-1.722
puter-1.712
Token NFL
Token ID5134
Feature activation-3.12e-01

Pos logit contributions
osal+0.513
obbies+0.447
hedral+0.446
paces+0.446
heses+0.410
Neg logit contributions
OPLE-0.476
POSE-0.438
ンジ-0.425
-0.415
iets-0.412
Token Commissioner
Token ID13270
Feature activation-7.93e-01

Pos logit contributions
osal+3.276
obbies+2.776
paces+2.767
hedral+2.750
idding+2.543
Neg logit contributions
OPLE-3.849
POSE-3.525
ンジ-3.496
-3.400
================================================================-3.388
Token as
Token ID355
Feature activation-9.84e-02

Pos logit contributions
osal+7.465
paces+6.055
hedral+5.941
obbies+5.861
heses+5.621
Neg logit contributions
OPLE-9.916
POSE-9.224
ンジ-9.175
-8.913
================================================================-8.798
Token judge
Token ID5052
Feature activation-3.37e-01

Pos logit contributions
osal+1.106
obbies+0.954
hedral+0.953
paces+0.953
heses+0.869
Neg logit contributions
OPLE-1.147
POSE-1.055
ンジ-1.031
-1.005
iets-0.997
Token,
Token ID11
Feature activation-1.57e+00

Pos logit contributions
osal+3.418
paces+2.899
hedral+2.851
obbies+2.835
heses+2.603
Neg logit contributions
OPLE-4.246
POSE-3.968
ンジ-3.843
-3.758
iets-3.741
Token jury
Token ID9002
Feature activation-1.76e-01

Pos logit contributions
osal+12.123
hedral+9.097
obbies+8.945
ogie+8.588
idding+8.485
Neg logit contributions
OPLE-17.786
POSE-16.645
 Pwr-16.170
iets-16.086
ンジ-16.062
Token,
Token ID11
Feature activation-8.19e-01

Pos logit contributions
osal+1.844
obbies+1.572
hedral+1.558
paces+1.543
heses+1.415
Neg logit contributions
OPLE-2.197
POSE-2.033
ンジ-1.965
-1.927
iets-1.922
Token execution
Token ID9706
Feature activation-4.77e-01

Pos logit contributions
osal+7.619
obbies+6.190
hedral+6.048
paces+5.843
gey+5.489
Neg logit contributions
OPLE-10.332
POSE-9.564
-9.187
ンジ-9.126
iets-9.052
Tokenor
Token ID273
Feature activation-6.18e-01

Pos logit contributions
osal+4.658
obbies+3.797
paces+3.770
hedral+3.711
heses+3.490
Neg logit contributions
OPLE-6.210
POSE-5.787
ンジ-5.556
puter-5.498
-5.444
Token,
Token ID11
Feature activation-5.41e-01

Pos logit contributions
osal+5.698
paces+4.676
obbies+4.614
hedral+4.580
heses+4.322
Neg logit contributions
OPLE-8.059
POSE-7.541
ンジ-7.351
-7.199
puter-7.109
Token previously
Token ID4271
Feature activation-4.00e-01

Pos logit contributions
osal+3.267
paces+2.825
obbies+2.770
hedral+2.750
gey+2.516
Neg logit contributions
OPLE-3.898
POSE-3.647
ンジ-3.526
puter-3.416
-3.415
Token mentioned
Token ID4750
Feature activation-5.29e-02

Pos logit contributions
osal+4.072
paces+3.611
obbies+3.482
hedral+3.462
gey+3.214
Neg logit contributions
OPLE-5.031
POSE-4.689
ンジ-4.516
================================================================-4.388
-4.352
Token.
Token ID13
Feature activation-3.74e-01

Pos logit contributions
osal+0.575
obbies+0.495
paces+0.494
hedral+0.493
heses+0.450
Neg logit contributions
OPLE-0.634
POSE-0.587
ンジ-0.571
-0.557
iets-0.556
Token
Token ID198
Feature activation-1.90e-03

Pos logit contributions
osal+3.954
paces+3.422
obbies+3.341
hedral+3.320
heses+3.090
Neg logit contributions
OPLE-4.410
POSE-4.097
ンジ-3.980
iets-3.975
puter-3.931
Token
Token ID198
Feature activation-1.61e-01

Pos logit contributions
osal+0.020
obbies+0.017
hedral+0.017
paces+0.017
heses+0.016
Neg logit contributions
OPLE-0.023
POSE-0.021
ンジ-0.021
-0.020
iets-0.020
TokenNo
Token ID2949
Feature activation-1.47e+00

Pos logit contributions
osal+1.807
paces+1.563
obbies+1.557
hedral+1.555
heses+1.423
Neg logit contributions
OPLE-1.859
POSE-1.710
ンジ-1.676
iets-1.645
puter-1.628
Token word
Token ID1573
Feature activation-5.71e-01

Pos logit contributions
osal+13.561
obbies+11.065
paces+11.005
gey+10.639
idding+10.628
Neg logit contributions
OPLE-14.480
POSE-13.575
ンジ-12.904
================================================================-12.768
iets-12.452
Token if
Token ID611
Feature activation-2.47e-02

Pos logit contributions
osal+5.518
paces+4.698
obbies+4.499
hedral+4.461
idding+4.199
Neg logit contributions
OPLE-7.348
POSE-6.762
ンジ-6.669
puter-6.486
-6.432
Token the
Token ID262
Feature activation+1.84e-02

Pos logit contributions
osal+0.285
obbies+0.248
hedral+0.247
paces+0.247
heses+0.227
Neg logit contributions
OPLE-0.279
POSE-0.257
ンジ-0.250
-0.244
iets-0.243
Token Str
Token ID4285
Feature activation+6.55e-02

Pos logit contributions
OPLE+0.209
POSE+0.193
ンジ+0.187
+0.183
iets+0.182
Neg logit contributions
osal-0.212
obbies-0.184
paces-0.183
hedral-0.183
heses-0.168
Tokenand
Token ID392
Feature activation-8.26e-02

Pos logit contributions
OPLE+0.638
POSE+0.582
ンジ+0.557
iets+0.551
pointer+0.547
Neg logit contributions
osal-0.849
obbies-0.752
hedral-0.749
paces-0.748
heses-0.696
Token https
Token ID3740
Feature activation+2.46e-01

Pos logit contributions
osal+0.058
obbies+0.052
hedral+0.052
paces+0.052
heses+0.048
Neg logit contributions
OPLE-0.039
POSE-0.035
ンジ-0.034
-0.033
iets-0.033
Token://
Token ID1378
Feature activation+3.80e-01

Pos logit contributions
OPLE+2.239
POSE+2.098
================================================================+1.982
ンジ+1.955
pointer+1.933
Neg logit contributions
osal-3.274
obbies-2.970
hedral-2.936
paces-2.905
heses-2.758
Tokenwww
Token ID2503
Feature activation-2.97e-01

Pos logit contributions
OPLE+1.344
pointer+1.196
POSE+1.114
oner+1.086
lp+1.061
Neg logit contributions
osal-6.981
hedral-6.393
paces-6.387
obbies-6.386
heses-6.119
Token.
Token ID13
Feature activation-3.79e-01

Pos logit contributions
osal+2.768
paces+2.372
hedral+2.305
obbies+2.248
gey+2.073
Neg logit contributions
OPLE-4.028
ンジ-3.695
POSE-3.644
puter-3.617
iets-3.590
Tokenbel
Token ID6667
Feature activation-7.73e-01

Pos logit contributions
osal+3.353
hedral+2.865
obbies+2.793
paces+2.726
auld+2.563
Neg logit contributions
OPLE-5.376
POSE-4.996
ンジ-4.936
iets-4.757
-4.728
Tokenfast
Token ID7217
Feature activation-1.47e+00

Pos logit contributions
osal+4.789
hedral+4.341
gey+3.963
paces+3.723
obbies+3.453
Neg logit contributions
OPLE-12.150
POSE-11.612
ンジ-11.218
 Pwr-10.959
-10.908
Tokente
Token ID660
Feature activation-6.62e-01

Pos logit contributions
osal+2.794
te+0.070
hedral-0.239
paces-0.657
auld-1.051
Neg logit contributions
iets-26.680
ンジ-26.491
POSE-26.474
OPLE-26.445
 Pwr-25.799
Tokenlegraph
Token ID16606
Feature activation-4.97e-01

Pos logit contributions
osal+6.383
paces+5.516
hedral+5.228
obbies+5.166
gey+4.875
Neg logit contributions
OPLE-8.270
ンジ-8.036
POSE-8.024
-7.580
iets-7.379
Token.
Token ID13
Feature activation-6.27e-01

Pos logit contributions
osal+4.308
paces+3.565
hedral+3.454
obbies+3.371
gey+3.097
Neg logit contributions
OPLE-6.991
ンジ-6.564
POSE-6.404
puter-6.260
-6.218
Tokenco
Token ID1073
Feature activation-4.92e-01

Pos logit contributions
osal+4.489
hedral+3.624
obbies+3.438
paces+3.438
gey+3.168
Neg logit contributions
OPLE-9.584
POSE-9.208
ンジ-8.877
-8.740
HCR-8.720
Token.
Token ID13
Feature activation-4.92e-01

Pos logit contributions
osal+4.278
paces+3.484
hedral+3.477
obbies+3.323
gey+3.171
Neg logit contributions
OPLE-6.888
ンジ-6.464
POSE-6.303
-6.180
puter-6.153
Token less
Token ID1342
Feature activation+4.52e-02

Pos logit contributions
OPLE+0.791
POSE+0.734
ンジ+0.709
+0.692
iets+0.691
Neg logit contributions
osal-0.800
obbies-0.696
hedral-0.695
paces-0.691
heses-0.637
Token one
Token ID530
Feature activation-3.65e-01

Pos logit contributions
OPLE+0.518
POSE+0.481
ンジ+0.465
+0.453
iets+0.452
Neg logit contributions
osal-0.512
obbies-0.444
hedral-0.443
paces-0.442
heses-0.406
Token-
Token ID12
Feature activation-2.52e-01

Pos logit contributions
osal+3.597
hedral+3.047
obbies+3.019
paces+2.984
gey+2.706
Neg logit contributions
OPLE-4.688
POSE-4.289
ンジ-4.235
-4.163
puter-4.132
Tokenday
Token ID820
Feature activation+2.87e-02

Pos logit contributions
osal+3.387
hedral+3.002
paces+2.987
obbies+2.985
gey+2.770
Neg logit contributions
OPLE-2.448
POSE-2.152
ンジ-2.095
-2.044
puter-2.000
Token (
Token ID357
Feature activation+1.83e-01

Pos logit contributions
OPLE+0.326
POSE+0.302
ンジ+0.292
+0.285
iets+0.285
Neg logit contributions
osal-0.329
obbies-0.286
paces-0.285
hedral-0.285
heses-0.261
Tokenb
Token ID65
Feature activation-1.45e+00

Pos logit contributions
OPLE+2.098
POSE+1.963
ンジ+1.884
pointer+1.874
+1.856
Neg logit contributions
osal-2.044
obbies-1.763
hedral-1.763
paces-1.742
heses-1.618
Tokenilateral
Token ID14796
Feature activation-2.47e-01

Pos logit contributions
osal+13.832
obbies+13.661
idding+12.413
aths+11.771
ogie+11.745
Neg logit contributions
ンジ-14.512
OPLE-14.368
-14.115
-13.983
POSE-13.795
Token)
Token ID8
Feature activation-6.65e-02

Pos logit contributions
osal+2.665
obbies+2.299
hedral+2.291
paces+2.266
heses+2.078
Neg logit contributions
OPLE-2.991
POSE-2.737
ンジ-2.672
puter-2.615
-2.603
Token cricket
Token ID18836
Feature activation-4.76e-02

Pos logit contributions
osal+0.732
obbies+0.633
hedral+0.630
paces+0.629
heses+0.574
Neg logit contributions
OPLE-0.790
POSE-0.730
ンジ-0.711
-0.694
iets-0.691
Token and
Token ID290
Feature activation-8.53e-02

Pos logit contributions
osal+0.528
obbies+0.456
paces+0.455
hedral+0.455
heses+0.415
Neg logit contributions
OPLE-0.561
POSE-0.519
ンジ-0.505
-0.493
iets-0.491
Token more
Token ID517
Feature activation+3.06e-01

Pos logit contributions
osal+0.942
obbies+0.816
paces+0.814
hedral+0.812
heses+0.740
Neg logit contributions
OPLE-1.010
POSE-0.931
ンジ-0.910
-0.887
iets-0.881
Token Enemy
Token ID21785
Feature activation+8.45e-02

Pos logit contributions
osal+6.494
hedral+4.716
obbies+4.708
auld+4.409
paces+4.345
Neg logit contributions
OPLE-11.936
iets-11.503
POSE-11.315
puter-11.306
ンジ-11.101
Token of
Token ID286
Feature activation-8.36e-01

Pos logit contributions
OPLE+0.955
POSE+0.891
ンジ+0.873
+0.851
================================================================+0.847
Neg logit contributions
osal-0.954
paces-0.831
hedral-0.831
obbies-0.830
heses-0.754
Token the
Token ID262
Feature activation-1.28e+00

Pos logit contributions
osal+6.010
paces+5.616
obbies+5.248
hedral+5.218
idding+4.958
Neg logit contributions
OPLE-12.428
puter-11.189
POSE-10.926
iets-10.587
pointer-10.443
Token State
Token ID1812
Feature activation+8.03e-02

Pos logit contributions
paces+7.188
osal+7.005
obbies+6.527
 Whole+6.206
hedral+6.168
Neg logit contributions
OPLE-17.608
puter-17.280
iets-16.443
POSE-16.396
HCR-16.297
Token:
Token ID25
Feature activation-2.73e-01

Pos logit contributions
OPLE+0.874
POSE+0.805
ンジ+0.787
+0.765
================================================================+0.765
Neg logit contributions
osal-0.951
obbies-0.834
hedral-0.827
paces-0.825
heses-0.764
Token The
Token ID383
Feature activation-1.43e+00

Pos logit contributions
osal+2.916
paces+2.488
obbies+2.424
hedral+2.423
heses+2.291
Neg logit contributions
OPLE-3.353
POSE-3.030
ンジ-2.995
puter-2.967
iets-2.965
Token Life
Token ID5155
Feature activation-3.53e-01

Pos logit contributions
osal+6.695
paces+5.297
hedral+4.997
 Modern+4.579
obbies+4.417
Neg logit contributions
OPLE-21.728
iets-21.641
puter-20.183
HCR-20.120
phia-20.033
Token of
Token ID286
Feature activation-3.45e-01

Pos logit contributions
osal+3.351
obbies+2.813
paces+2.761
hedral+2.699
auld+2.499
Neg logit contributions
OPLE-4.756
POSE-4.438
iets-4.295
puter-4.291
ンジ-4.257
Token Murray
Token ID12164
Feature activation+7.84e-02

Pos logit contributions
osal+3.515
hedral+3.036
paces+3.013
obbies+2.957
heses+2.727
Neg logit contributions
OPLE-4.346
POSE-4.068
puter-3.959
iets-3.904
ンジ-3.888
Token N
Token ID399
Feature activation+1.32e-01

Pos logit contributions
OPLE+0.926
POSE+0.864
ンジ+0.838
iets+0.822
+0.812
Neg logit contributions
osal-0.863
obbies-0.742
hedral-0.739
paces-0.736
heses-0.678
Token.
Token ID13
Feature activation+3.89e-01

Pos logit contributions
OPLE+1.495
POSE+1.391
iets+1.329
ンジ+1.327
pointer+1.320
Neg logit contributions
osal-1.492
hedral-1.314
obbies-1.309
paces-1.302
heses-1.180
TokenBut
Token ID1537
Feature activation-2.78e-01

Pos logit contributions
OPLE+0.583
POSE+0.538
ンジ+0.520
+0.508
================================================================+0.508
Neg logit contributions
osal-0.599
obbies-0.524
hedral-0.521
paces-0.518
heses-0.479
Token back
Token ID736
Feature activation+4.10e-02

Pos logit contributions
osal+2.997
paces+2.614
hedral+2.577
obbies+2.552
heses+2.343
Neg logit contributions
OPLE-3.305
POSE-3.090
ンジ-3.048
-2.935
pointer-2.903
Token to
Token ID284
Feature activation-6.21e-01

Pos logit contributions
OPLE+0.476
POSE+0.441
ンジ+0.428
+0.419
iets+0.417
Neg logit contributions
osal-0.458
obbies-0.398
hedral-0.395
paces-0.395
heses-0.362
Token the
Token ID262
Feature activation-6.11e-01

Pos logit contributions
osal+5.896
paces+5.181
hedral+5.127
obbies+4.971
heses+4.507
Neg logit contributions
OPLE-7.612
POSE-7.085
ンジ-7.074
-6.789
iets-6.783
Token no
Token ID645
Feature activation-9.90e-01

Pos logit contributions
osal+6.035
paces+5.198
hedral+5.174
obbies+5.058
heses+4.635
Neg logit contributions
OPLE-7.357
POSE-6.828
ンジ-6.804
-6.644
puter-6.472
Token-
Token ID12
Feature activation-1.42e+00

Pos logit contributions
osal+9.085
hedral+7.649
obbies+7.398
ogie+7.258
gey+7.203
Neg logit contributions
OPLE-12.269
POSE-11.063
ンジ-10.725
HCR-10.482
 Pwr-10.352
Tokenspin
Token ID39706
Feature activation-3.38e-01

Pos logit contributions
osal+12.848
spin+12.369
paces+11.836
hedral+11.128
walk+10.895
Neg logit contributions
OPLE-14.990
ンジ-13.121
POSE-12.696
HCR-12.452
 Pwr-12.211
Token scenario
Token ID8883
Feature activation-8.08e-02

Pos logit contributions
osal+3.964
hedral+3.445
paces+3.416
obbies+3.389
heses+3.185
Neg logit contributions
OPLE-3.703
POSE-3.402
ンジ-3.331
-3.263
iets-3.217
Token for
Token ID329
Feature activation-4.53e-01

Pos logit contributions
osal+0.867
hedral+0.743
paces+0.743
obbies+0.741
heses+0.675
Neg logit contributions
OPLE-0.980
POSE-0.910
ンジ-0.889
-0.866
iets-0.862
Token a
Token ID257
Feature activation-7.15e-01

Pos logit contributions
osal+4.513
hedral+3.901
paces+3.889
obbies+3.847
heses+3.491
Neg logit contributions
OPLE-5.475
ンジ-5.185
POSE-5.178
iets-4.962
-4.945
Token second
Token ID1218
Feature activation+3.78e-02

Pos logit contributions
osal+6.684
paces+5.767
hedral+5.699
obbies+5.577
idding+5.169
Neg logit contributions
OPLE-8.803
ンジ-8.058
POSE-7.909
-7.787
iets-7.751
Token work
Token ID670
Feature activation+2.57e-02

Pos logit contributions
OPLE+0.192
POSE+0.177
ンジ+0.173
+0.168
iets+0.168
Neg logit contributions
osal-0.174
obbies-0.151
hedral-0.150
paces-0.150
heses-0.137
Token,
Token ID11
Feature activation-1.70e-01

Pos logit contributions
OPLE+0.307
POSE+0.284
ンジ+0.276
+0.269
iets+0.269
Neg logit contributions
osal-0.280
obbies-0.242
hedral-0.241
paces-0.240
heses-0.220
Token but
Token ID475
Feature activation-4.00e-01

Pos logit contributions
osal+1.739
obbies+1.477
paces+1.472
hedral+1.447
heses+1.317
Neg logit contributions
OPLE-2.139
POSE-1.991
ンジ-1.945
-1.911
pointer-1.895
Token there
Token ID612
Feature activation+1.23e-01

Pos logit contributions
osal+4.145
paces+3.542
obbies+3.438
hedral+3.434
idding+3.176
Neg logit contributions
OPLE-4.850
POSE-4.518
ンジ-4.470
-4.422
pointer-4.358
Token is
Token ID318
Feature activation-5.21e-01

Pos logit contributions
OPLE+1.462
POSE+1.359
ンジ+1.308
iets+1.282
================================================================+1.277
Neg logit contributions
osal-1.337
obbies-1.162
paces-1.159
hedral-1.158
heses-1.052
Token no
Token ID645
Feature activation-1.37e+00

Pos logit contributions
osal+5.122
paces+4.310
hedral+4.214
obbies+4.129
idding+3.818
Neg logit contributions
OPLE-6.512
POSE-6.018
ンジ-5.968
-5.931
iets-5.893
Token vaccine
Token ID12319
Feature activation-1.22e-02

Pos logit contributions
osal+10.096
 rhy+8.274
 WAY+7.866
 cure+7.842
paces+7.660
Neg logit contributions
OPLE-16.180
POSE-15.479
iets-15.257
ンジ-15.232
-14.829
Token Good
Token ID4599
Feature activation+1.58e-01

Pos logit contributions
osal+0.138
obbies+0.120
hedral+0.119
paces+0.119
heses+0.109
Neg logit contributions
OPLE-0.141
POSE-0.130
ンジ-0.127
-0.124
iets-0.123
Token personal
Token ID2614
Feature activation-4.57e-02

Pos logit contributions
OPLE+1.831
POSE+1.703
ンジ+1.671
iets+1.620
puter+1.611
Neg logit contributions
osal-1.748
hedral-1.537
paces-1.526
obbies-1.521
heses-1.388
Token hygiene
Token ID33306
Feature activation+4.10e-01

Pos logit contributions
osal+0.553
obbies+0.484
hedral+0.482
paces+0.481
heses+0.444
Neg logit contributions
OPLE-0.492
POSE-0.452
ンジ-0.438
-0.427
iets-0.425
Token,
Token ID11
Feature activation+7.80e-02

Pos logit contributions
OPLE+4.375
POSE+4.035
ンジ+3.870
iets+3.849
================================================================+3.839
Neg logit contributions
osal-4.755
hedral-4.308
obbies-4.124
paces-4.118
heses-3.820
Token N
Token ID399
Feature activation-3.99e-01

Pos logit contributions
osal+1.591
obbies+1.367
hedral+1.366
paces+1.364
heses+1.242
Neg logit contributions
OPLE-1.661
POSE-1.531
ンジ-1.489
-1.449
iets-1.446
Token64
Token ID2414
Feature activation-1.40e-01

Pos logit contributions
osal+4.093
obbies+3.468
hedral+3.431
paces+3.386
idding+3.148
Neg logit contributions
OPLE-4.904
POSE-4.609
ンジ-4.430
-4.310
iets-4.307
Token as
Token ID355
Feature activation-8.29e-02

Pos logit contributions
osal+1.600
hedral+1.388
paces+1.381
obbies+1.379
heses+1.263
Neg logit contributions
OPLE-1.608
POSE-1.475
ンジ-1.438
-1.399
pointer-1.389
Token the
Token ID262
Feature activation-4.76e-01

Pos logit contributions
osal+0.925
hedral+0.796
obbies+0.796
paces+0.795
heses+0.726
Neg logit contributions
OPLE-0.972
POSE-0.896
ンジ-0.874
-0.850
iets-0.849
Token New
Token ID968
Feature activation-5.27e-01

Pos logit contributions
osal+4.990
hedral+4.236
paces+4.199
obbies+4.088
gey+3.805
Neg logit contributions
OPLE-5.743
POSE-5.348
ンジ-5.124
iets-5.040
-4.953
Token A
Token ID317
Feature activation-1.37e+00

Pos logit contributions
osal+5.035
hedral+4.248
paces+4.206
obbies+4.164
idding+3.801
Neg logit contributions
OPLE-7.005
POSE-6.389
ンジ-6.048
iets-5.981
-5.969
Tokenger
Token ID1362
Feature activation-4.07e-01

Pos logit contributions
osal+12.398
hedral+10.340
obbies+10.113
ï+9.984
paces+9.867
Neg logit contributions
OPLE-14.727
POSE-13.121
pointer-13.067
puter-13.062
iets-13.000
Token
Token ID447
Feature activation+1.73e-01

Pos logit contributions
osal+4.600
hedral+3.942
paces+3.884
obbies+3.846
gey+3.621
Neg logit contributions
OPLE-4.728
POSE-4.288
ンジ-4.148
-4.064
iets-4.030
Token
Token ID247
Feature activation-2.04e-01

Pos logit contributions
OPLE+1.808
POSE+1.691
ンジ+1.638
+1.609
================================================================+1.595
Neg logit contributions
osal-2.092
obbies-1.873
paces-1.834
hedral-1.831
heses-1.693
Tokens
Token ID82
Feature activation-4.06e-01

Pos logit contributions
osal+1.990
paces+1.672
hedral+1.669
obbies+1.658
heses+1.503
Neg logit contributions
OPLE-2.702
POSE-2.497
ンジ-2.444
-2.378
iets-2.372
Token console
Token ID8624
Feature activation-3.76e-01

Pos logit contributions
osal+4.436
paces+3.814
hedral+3.808
obbies+3.730
heses+3.445
Neg logit contributions
OPLE-4.802
POSE-4.413
ンジ-4.264
iets-4.158
-4.115
Token still
Token ID991
Feature activation-2.97e-02

Pos logit contributions
osal+0.747
hedral+0.650
paces+0.649
obbies+0.649
heses+0.590
Neg logit contributions
OPLE-0.760
POSE-0.702
ンジ-0.680
-0.666
iets-0.661
Token be
Token ID307
Feature activation-5.55e-01

Pos logit contributions
osal+0.327
obbies+0.282
hedral+0.281
paces+0.281
heses+0.256
Neg logit contributions
OPLE-0.352
POSE-0.326
ンジ-0.317
-0.310
iets-0.308
Token moving
Token ID3867
Feature activation-2.74e-01

Pos logit contributions
osal+5.410
hedral+4.852
paces+4.728
obbies+4.525
auld+4.232
Neg logit contributions
OPLE-6.940
POSE-6.335
ンジ-6.124
iets-6.064
-6.049
Token at
Token ID379
Feature activation-9.90e-01

Pos logit contributions
osal+2.781
paces+2.457
hedral+2.437
obbies+2.384
idding+2.153
Neg logit contributions
OPLE-3.429
POSE-3.158
ンジ-3.072
-3.017
iets-2.979
Token the
Token ID262
Feature activation-1.08e+00

Pos logit contributions
osal+8.425
hedral+7.318
paces+7.180
obbies+6.813
erville+6.313
Neg logit contributions
OPLE-12.305
POSE-11.566
ンジ-11.261
-11.169
HCR-11.054
Token original
Token ID2656
Feature activation-1.36e+00

Pos logit contributions
osal+8.720
hedral+7.598
paces+7.317
obbies+6.836
 middle+6.591
Neg logit contributions
OPLE-13.491
POSE-12.379
ンジ-12.328
-12.229
HCR-11.949
Token speed
Token ID2866
Feature activation-3.26e-01

Pos logit contributions
osal+10.376
hedral+8.948
paces+8.500
obbies+8.171
spin+8.049
Neg logit contributions
OPLE-16.086
ンジ-14.782
-14.723
HCR-14.631
POSE-14.580
Token of
Token ID286
Feature activation-6.53e-01

Pos logit contributions
osal+3.299
paces+2.808
hedral+2.770
obbies+2.744
idding+2.471
Neg logit contributions
OPLE-4.155
POSE-3.836
ンジ-3.764
-3.705
iets-3.654
Token the
Token ID262
Feature activation-9.71e-01

Pos logit contributions
osal+6.719
hedral+5.733
obbies+5.571
paces+5.448
idding+5.091
Neg logit contributions
OPLE-7.805
POSE-7.257
-7.141
ンジ-7.134
iets-6.996
Token Earth
Token ID3668
Feature activation-2.69e-01

Pos logit contributions
osal+7.452
hedral+6.247
obbies+6.003
paces+5.969
erville+5.372
Neg logit contributions
OPLE-12.517
POSE-11.911
-11.804
ンジ-11.695
iets-11.532
Token's
Token ID338
Feature activation-5.84e-01

Pos logit contributions
osal+2.731
hedral+2.342
paces+2.323
obbies+2.307
idding+2.086
Neg logit contributions
OPLE-3.398
POSE-3.120
ンジ-3.075
-3.013
iets-2.946
TokenAS
Token ID1921
Feature activation+5.04e-02

Pos logit contributions
OPLE+0.331
POSE+0.293
ンジ+0.277
iets+0.267
+0.266
Neg logit contributions
osal-0.633
obbies-0.572
hedral-0.572
paces-0.571
heses-0.537
TokenES
Token ID1546
Feature activation+8.60e-02

Pos logit contributions
OPLE+0.510
POSE+0.467
ンジ+0.447
+0.437
iets+0.435
Neg logit contributions
osal-0.643
obbies-0.571
hedral-0.569
paces-0.566
heses-0.526
Token Phase
Token ID18983
Feature activation+1.48e-02

Pos logit contributions
OPLE+0.985
POSE+0.908
ンジ+0.878
iets+0.856
+0.856
Neg logit contributions
osal-0.981
obbies-0.856
hedral-0.855
paces-0.846
heses-0.779
Token 1
Token ID352
Feature activation-3.23e-01

Pos logit contributions
OPLE+0.169
POSE+0.156
ンジ+0.151
iets+0.147
+0.147
Neg logit contributions
osal-0.169
obbies-0.147
paces-0.147
hedral-0.146
heses-0.134
Token:
Token ID25
Feature activation-9.03e-02

Pos logit contributions
osal+3.482
paces+2.962
hedral+2.950
obbies+2.945
heses+2.712
Neg logit contributions
OPLE-3.821
POSE-3.567
ンジ-3.488
-3.451
puter-3.386
Token No
Token ID1400
Feature activation-1.35e+00

Pos logit contributions
osal+1.006
obbies+0.867
paces+0.866
hedral+0.864
heses+0.789
Neg logit contributions
OPLE-1.054
POSE-0.977
ンジ-0.950
-0.930
iets-0.924
Token infections
Token ID16079
Feature activation-3.73e-01

Pos logit contributions
osal+12.711
paces+10.072
obbies+9.692
idding+9.683
hedral+9.669
Neg logit contributions
OPLE-13.644
POSE-13.220
ンジ-12.314
puter-12.218
-12.130
Token in
Token ID287
Feature activation+9.20e-02

Pos logit contributions
osal+4.010
paces+3.493
obbies+3.358
hedral+3.328
idding+3.100
Neg logit contributions
OPLE-4.440
POSE-4.111
ンジ-4.060
-4.003
puter-3.943
Token humans
Token ID5384
Feature activation+8.88e-02

Pos logit contributions
OPLE+1.064
POSE+0.981
ンジ+0.950
iets+0.930
+0.929
Neg logit contributions
osal-1.033
obbies-0.902
hedral-0.898
paces-0.891
heses-0.821
Token are
Token ID389
Feature activation-5.08e-01

Pos logit contributions
OPLE+1.014
POSE+0.934
ンジ+0.903
iets+0.883
================================================================+0.879
Neg logit contributions
osal-1.009
obbies-0.884
hedral-0.881
paces-0.870
heses-0.803
Token being
Token ID852
Feature activation-2.52e-01

Pos logit contributions
osal+4.884
paces+4.190
hedral+4.177
obbies+3.991
idding+3.762
Neg logit contributions
OPLE-6.500
POSE-5.953
ンジ-5.810
iets-5.747
-5.737
Token https
Token ID3740
Feature activation+1.22e-01

Pos logit contributions
osal+0.370
obbies+0.329
hedral+0.329
paces+0.328
heses+0.306
Neg logit contributions
OPLE-0.252
POSE-0.228
ンジ-0.220
-0.213
iets-0.213
Token://
Token ID1378
Feature activation+6.49e-02

Pos logit contributions
OPLE+1.195
POSE+1.106
ンジ+1.051
================================================================+1.040
pointer+1.026
Neg logit contributions
osal-1.568
obbies-1.402
hedral-1.386
paces-1.381
heses-1.295
Tokenwww
Token ID2503
Feature activation-3.13e-01

Pos logit contributions
OPLE+0.338
POSE+0.284
ンジ+0.263
pointer+0.252
+0.248
Neg logit contributions
osal-1.137
obbies-1.039
paces-1.038
hedral-1.036
heses-0.985
Token.
Token ID13
Feature activation-6.60e-01

Pos logit contributions
osal+2.914
paces+2.491
hedral+2.428
obbies+2.359
gey+2.182
Neg logit contributions
OPLE-4.250
ンジ-3.883
POSE-3.846
puter-3.814
iets-3.759
Tokenbel
Token ID6667
Feature activation-8.71e-01

Pos logit contributions
osal+5.099
hedral+4.427
obbies+4.155
paces+4.082
auld+3.840
Neg logit contributions
OPLE-9.741
POSE-9.238
ンジ-9.034
HCR-8.734
-8.704
Tokenfast
Token ID7217
Feature activation-1.31e+00

Pos logit contributions
osal+5.300
hedral+5.023
paces+4.593
gey+4.000
obbies+3.984
Neg logit contributions
OPLE-13.412
POSE-12.641
ンジ-12.435
-11.929
-11.816
Tokente
Token ID660
Feature activation-4.98e-01

Pos logit contributions
osal+3.261
hedral+1.627
te+0.749
paces+0.742
auld+0.472
Neg logit contributions
OPLE-24.027
ADRA-23.236
ンジ-23.080
POSE-22.805
HCR-22.160
Tokenlegraph
Token ID16606
Feature activation-4.15e-01

Pos logit contributions
osal+5.213
hedral+4.512
paces+4.407
obbies+4.322
auld+4.136
Neg logit contributions
OPLE-6.117
ンジ-5.732
POSE-5.686
-5.317
HCR-5.279
Token.
Token ID13
Feature activation-4.17e-01

Pos logit contributions
osal+3.767
paces+3.136
hedral+3.061
obbies+2.987
gey+2.740
Neg logit contributions
OPLE-5.707
ンジ-5.292
POSE-5.214
puter-5.092
-5.049
Tokenco
Token ID1073
Feature activation-4.46e-01

Pos logit contributions
osal+3.362
hedral+2.707
paces+2.663
obbies+2.608
gey+2.398
Neg logit contributions
OPLE-6.149
POSE-5.743
ンジ-5.626
-5.544
iets-5.486
Token.
Token ID13
Feature activation-4.09e-01

Pos logit contributions
osal+3.877
paces+3.224
hedral+3.212
obbies+3.059
gey+2.884
Neg logit contributions
OPLE-6.228
ンジ-5.778
POSE-5.652
puter-5.590
-5.562
Token book
Token ID1492
Feature activation+1.98e-01

Pos logit contributions
osal+3.309
paces+2.698
hedral+2.685
obbies+2.677
gey+2.478
Neg logit contributions
OPLE-4.868
POSE-4.437
iets-4.314
ンジ-4.311
puter-4.247
Token,
Token ID11
Feature activation+2.20e-01

Pos logit contributions
OPLE+2.159
POSE+2.010
ンジ+1.958
pointer+1.917
+1.916
Neg logit contributions
osal-2.328
obbies-2.005
hedral-1.997
paces-1.985
idding-1.861
Token Re
Token ID797
Feature activation+4.57e-02

Pos logit contributions
OPLE+2.170
POSE+2.013
ンジ+1.904
+1.890
================================================================+1.890
Neg logit contributions
osal-2.856
hedral-2.514
paces-2.510
obbies-2.506
heses-2.303
Tokenclaim
Token ID6604
Feature activation-1.86e-01

Pos logit contributions
OPLE+0.646
POSE+0.609
ンジ+0.592
+0.581
pointer+0.580
Neg logit contributions
osal-0.398
obbies-0.331
hedral-0.329
paces-0.328
heses-0.288
Tokening
Token ID278
Feature activation-7.01e-01

Pos logit contributions
osal+2.106
obbies+1.814
paces+1.812
hedral+1.810
idding+1.672
Neg logit contributions
OPLE-2.162
POSE-1.966
ンジ-1.896
iets-1.870
puter-1.848
Token the
Token ID262
Feature activation-1.31e+00

Pos logit contributions
osal+5.595
paces+5.190
hedral+4.953
obbies+4.622
gey+4.313
Neg logit contributions
OPLE-9.687
iets-8.892
POSE-8.839
puter-8.668
HCR-8.625
Token American
Token ID1605
Feature activation-1.05e+00

Pos logit contributions
osal+6.697
paces+6.280
hedral+5.898
 middle+5.559
heses+5.413
Neg logit contributions
OPLE-19.782
HCR-18.245
iets-17.887
POSE-17.454
puter-17.381
Token Right
Token ID6498
Feature activation-4.10e-01

Pos logit contributions
osal+4.858
hedral+4.642
paces+4.604
heses+4.314
obbies+4.030
Neg logit contributions
OPLE-17.224
iets-16.007
POSE-15.616
ンジ-15.090
HCR-15.030
Token:
Token ID25
Feature activation-4.23e-01

Pos logit contributions
osal+3.984
obbies+3.513
hedral+3.459
paces+3.382
heses+3.056
Neg logit contributions
OPLE-5.278
POSE-4.897
ンジ-4.798
puter-4.725
iets-4.652
Token The
Token ID383
Feature activation-8.98e-01

Pos logit contributions
osal+4.164
paces+3.629
hedral+3.461
obbies+3.426
heses+3.367
Neg logit contributions
OPLE-5.350
ンジ-4.968
POSE-4.930
iets-4.877
puter-4.853
Token Lost
Token ID9164
Feature activation-7.10e-01

Pos logit contributions
osal+4.436
paces+3.959
obbies+3.175
hedral+3.052
auld+2.796
Neg logit contributions
OPLE-14.320
iets-13.822
POSE-13.794
HCR-13.685
puter-13.589
Token frontline
Token ID44956
Feature activation+4.07e-01

Pos logit contributions
OPLE+3.095
POSE+2.819
ンジ+2.742
+2.689
iets+2.667
Neg logit contributions
osal-3.237
hedral-2.820
obbies-2.817
paces-2.810
heses-2.609
Token to
Token ID284
Feature activation+2.21e-02

Pos logit contributions
OPLE+4.179
POSE+3.855
ンジ+3.714
+3.702
pling+3.670
Neg logit contributions
osal-4.863
obbies-4.291
hedral-4.222
paces-4.103
gey-3.892
Token serve
Token ID4691
Feature activation-1.24e-01

Pos logit contributions
OPLE+0.271
POSE+0.252
ンジ+0.245
+0.240
iets+0.239
Neg logit contributions
osal-0.234
obbies-0.201
hedral-0.201
paces-0.200
heses-0.182
Token as
Token ID355
Feature activation-2.64e-01

Pos logit contributions
osal+1.338
paces+1.153
obbies+1.152
hedral+1.148
heses+1.042
Neg logit contributions
OPLE-1.496
POSE-1.384
ンジ-1.351
puter-1.319
-1.319
Token "
Token ID366
Feature activation-5.28e-01

Pos logit contributions
osal+2.865
obbies+2.480
paces+2.460
hedral+2.458
gey+2.232
Neg logit contributions
OPLE-3.134
POSE-2.897
ンジ-2.835
puter-2.781
iets-2.766
Tokenc
Token ID66
Feature activation-1.30e+00

Pos logit contributions
osal+6.144
paces+5.508
obbies+5.419
hedral+5.355
gey+5.055
Neg logit contributions
OPLE-5.712
POSE-5.036
ンジ-4.859
iets-4.857
puter-4.710
Tokenannon
Token ID8825
Feature activation-6.53e-01

Pos logit contributions
osal+12.800
obbies+11.922
auld+10.673
hedral+10.252
ittens+10.130
Neg logit contributions
OPLE-14.252
ンジ-13.598
POSE-13.555
-13.359
-12.855
Token fodder
Token ID41579
Feature activation-2.99e-01

Pos logit contributions
osal+5.269
obbies+4.224
paces+4.192
hedral+4.150
auld+3.845
Neg logit contributions
OPLE-9.032
POSE-8.638
ンジ-8.339
puter-8.264
HCR-8.192
Token"
Token ID1
Feature activation-1.29e-01

Pos logit contributions
osal+2.979
obbies+2.561
paces+2.555
hedral+2.531
heses+2.308
Neg logit contributions
OPLE-3.784
POSE-3.544
ンジ-3.438
iets-3.360
-3.333
Token to
Token ID284
Feature activation-2.20e-01

Pos logit contributions
osal+1.375
paces+1.182
obbies+1.181
hedral+1.179
heses+1.073
Neg logit contributions
OPLE-1.580
POSE-1.465
ンジ-1.426
puter-1.389
-1.389
Token protect
Token ID1805
Feature activation-3.00e-01

Pos logit contributions
osal+2.385
obbies+2.052
hedral+2.049
paces+2.036
heses+1.853
Neg logit contributions
OPLE-2.605
POSE-2.426
ンジ-2.356
puter-2.318
-2.313
Token can
Token ID460
Feature activation-1.83e-01

Pos logit contributions
osal+4.433
paces+3.572
hedral+3.396
obbies+3.254
erville+3.165
Neg logit contributions
OPLE-7.718
ンジ-7.334
POSE-7.269
-7.098
pointer-7.047
Token buy
Token ID2822
Feature activation+5.92e-01

Pos logit contributions
osal+2.245
obbies+1.961
paces+1.956
hedral+1.925
idding+1.799
Neg logit contributions
OPLE-1.926
POSE-1.793
ンジ-1.732
iets-1.682
puter-1.671
Token An
Token ID1052
Feature activation-8.30e-01

Pos logit contributions
OPLE+3.220
================================================================+2.725
POSE+2.689
ンジ+2.531
pointer+2.421
Neg logit contributions
osal-9.852
obbies-9.348
hedral-9.003
paces-8.849
auld-8.478
Token Enemy
Token ID21785
Feature activation+8.45e-02

Pos logit contributions
osal+6.494
hedral+4.716
obbies+4.708
auld+4.409
paces+4.345
Neg logit contributions
OPLE-11.936
iets-11.503
POSE-11.315
puter-11.306
ンジ-11.101
Token of
Token ID286
Feature activation-8.36e-01

Pos logit contributions
OPLE+0.955
POSE+0.891
ンジ+0.873
+0.851
================================================================+0.847
Neg logit contributions
osal-0.954
paces-0.831
hedral-0.831
obbies-0.830
heses-0.754
Token the
Token ID262
Feature activation-1.28e+00

Pos logit contributions
osal+6.010
paces+5.616
obbies+5.248
hedral+5.218
idding+4.958
Neg logit contributions
OPLE-12.428
puter-11.189
POSE-10.926
iets-10.587
pointer-10.443
Token State
Token ID1812
Feature activation+8.03e-02

Pos logit contributions
paces+7.188
osal+7.005
obbies+6.527
 Whole+6.206
hedral+6.168
Neg logit contributions
OPLE-17.608
puter-17.280
iets-16.443
POSE-16.396
HCR-16.297
Token:
Token ID25
Feature activation-2.73e-01

Pos logit contributions
OPLE+0.874
POSE+0.805
ンジ+0.787
+0.765
================================================================+0.765
Neg logit contributions
osal-0.951
obbies-0.834
hedral-0.827
paces-0.825
heses-0.764
Token The
Token ID383
Feature activation-1.43e+00

Pos logit contributions
osal+2.916
paces+2.488
obbies+2.424
hedral+2.423
heses+2.291
Neg logit contributions
OPLE-3.353
POSE-3.030
ンジ-2.995
puter-2.967
iets-2.965
Token Life
Token ID5155
Feature activation-3.53e-01

Pos logit contributions
osal+6.695
paces+5.297
hedral+4.997
 Modern+4.579
obbies+4.417
Neg logit contributions
OPLE-21.728
iets-21.641
puter-20.183
HCR-20.120
phia-20.033
Token of
Token ID286
Feature activation-3.45e-01

Pos logit contributions
osal+3.351
obbies+2.813
paces+2.761
hedral+2.699
auld+2.499
Neg logit contributions
OPLE-4.756
POSE-4.438
iets-4.295
puter-4.291
ンジ-4.257
Token months
Token ID1933
Feature activation-5.77e-02

Pos logit contributions
OPLE+0.573
POSE+0.525
ンジ+0.509
+0.498
iets+0.498
Neg logit contributions
osal-0.587
obbies-0.512
hedral-0.510
paces-0.508
heses-0.467
Token from
Token ID422
Feature activation-1.75e-01

Pos logit contributions
osal+0.621
paces+0.534
obbies+0.534
hedral+0.532
heses+0.485
Neg logit contributions
OPLE-0.698
POSE-0.647
ンジ-0.630
-0.615
iets-0.612
Token the
Token ID262
Feature activation-2.21e-01

Pos logit contributions
osal+1.881
obbies+1.608
hedral+1.603
paces+1.599
heses+1.451
Neg logit contributions
OPLE-2.117
POSE-1.968
ンジ-1.924
-1.877
puter-1.855
Token M
Token ID337
Feature activation-2.28e-01

Pos logit contributions
osal+2.380
paces+2.026
hedral+2.025
obbies+2.024
idding+1.836
Neg logit contributions
OPLE-2.648
POSE-2.456
ンジ-2.394
-2.340
puter-2.325
Tokenauna
Token ID32837
Feature activation-8.87e-01

Pos logit contributions
osal+2.781
obbies+2.422
hedral+2.411
paces+2.409
heses+2.217
Neg logit contributions
OPLE-2.404
POSE-2.229
ンジ-2.180
-2.109
puter-2.081
Token Lo
Token ID6706
Feature activation-1.27e+00

Pos logit contributions
osal+6.946
obbies+5.839
hedral+5.771
paces+5.691
gey+5.504
Neg logit contributions
OPLE-12.075
POSE-11.433
puter-11.415
-11.017
pointer-10.970
Tokena
Token ID64
Feature activation-1.76e-01

Pos logit contributions
osal+8.306
ogie+6.450
ï+6.146
gey+5.868
paces+5.314
Neg logit contributions
OPLE-19.252
-18.021
 Pwr-17.613
ンジ-17.526
puter-17.329
Token series
Token ID2168
Feature activation-4.91e-02

Pos logit contributions
osal+1.962
hedral+1.693
paces+1.683
obbies+1.680
heses+1.540
Neg logit contributions
OPLE-2.059
POSE-1.898
ンジ-1.847
-1.808
puter-1.795
Token of
Token ID286
Feature activation-2.08e-01

Pos logit contributions
osal+0.532
obbies+0.457
paces+0.457
hedral+0.456
heses+0.415
Neg logit contributions
OPLE-0.590
POSE-0.547
ンジ-0.532
-0.519
iets-0.517
Token [
Token ID685
Feature activation-2.86e-01

Pos logit contributions
osal+2.273
hedral+1.957
obbies+1.950
paces+1.938
heses+1.764
Neg logit contributions
OPLE-2.476
POSE-2.290
ンジ-2.219
-2.177
puter-2.162
TokenCO
Token ID8220
Feature activation-1.93e-01

Pos logit contributions
osal+3.153
obbies+2.679
paces+2.670
hedral+2.661
heses+2.427
Neg logit contributions
OPLE-3.352
POSE-3.075
ンジ-3.012
-2.958
iets-2.947
Token writer
Token ID6260
Feature activation-2.36e-01

Pos logit contributions
osal+1.852
obbies+1.578
hedral+1.570
paces+1.563
idding+1.427
Neg logit contributions
OPLE-1.997
POSE-1.853
ンジ-1.806
iets-1.766
-1.757
Token who
Token ID508
Feature activation-6.66e-02

Pos logit contributions
osal+2.654
paces+2.310
hedral+2.299
obbies+2.291
heses+2.094
Neg logit contributions
OPLE-2.734
POSE-2.527
ンジ-2.445
-2.375
pointer-2.372
Token qu
Token ID627
Feature activation+4.75e-01

Pos logit contributions
osal+0.750
obbies+0.650
paces+0.648
hedral+0.647
heses+0.592
Neg logit contributions
OPLE-0.773
POSE-0.715
ンジ-0.692
-0.675
pointer-0.672
Tokenits
Token ID896
Feature activation-5.99e-01

Pos logit contributions
iets+4.325
OPLE+4.287
POSE+3.881
oner+3.841
pointer+3.789
Neg logit contributions
osal-5.918
hedral-5.359
paces-5.349
obbies-5.124
gey-4.954
Token his
Token ID465
Feature activation-4.96e-01

Pos logit contributions
osal+5.682
hedral+5.115
obbies+5.012
paces+4.971
heses+4.526
Neg logit contributions
OPLE-7.238
POSE-6.794
iets-6.618
ンジ-6.586
pointer-6.431
Token pro
Token ID386
Feature activation-1.25e+00

Pos logit contributions
osal+5.070
obbies+4.472
hedral+4.340
paces+4.297
idding+3.945
Neg logit contributions
OPLE-5.956
POSE-5.495
ンジ-5.359
iets-5.346
pointer-5.247
Tokendig
Token ID12894
Feature activation-3.32e-01

Pos logit contributions
osal+11.619
hedral+10.747
obbies+9.993
idding+9.914
paces+9.752
Neg logit contributions
OPLE-13.251
POSE-11.806
iets-11.485
ンジ-11.477
-11.414
Tokeniously
Token ID6819
Feature activation-1.55e-01

Pos logit contributions
osal+3.000
hedral+2.419
obbies+2.407
paces+2.337
heses+2.190
Neg logit contributions
OPLE-4.713
POSE-4.351
ンジ-4.278
-4.183
pointer-4.134
Token paying
Token ID5989
Feature activation+1.57e-01

Pos logit contributions
osal+1.702
obbies+1.476
hedral+1.474
paces+1.466
heses+1.340
Neg logit contributions
OPLE-1.821
POSE-1.680
ンジ-1.630
iets-1.605
-1.591
Token job
Token ID1693
Feature activation-3.33e-01

Pos logit contributions
OPLE+1.767
POSE+1.635
ンジ+1.576
+1.545
pling+1.535
Neg logit contributions
osal-1.816
obbies-1.569
paces-1.568
hedral-1.566
heses-1.445
Token to
Token ID284
Feature activation+9.69e-02

Pos logit contributions
osal+3.347
paces+2.895
hedral+2.881
obbies+2.836
idding+2.579
Neg logit contributions
OPLE-4.187
POSE-3.882
ンジ-3.800
iets-3.690
-3.669
Token Stars
Token ID10271
Feature activation-1.37e-01

Pos logit contributions
osal+1.898
hedral+1.645
obbies+1.643
paces+1.632
heses+1.502
Neg logit contributions
OPLE-1.941
POSE-1.799
ンジ-1.731
-1.707
puter-1.690
Token
Token ID447
Feature activation+5.03e-02

Pos logit contributions
osal+1.452
hedral+1.252
paces+1.250
obbies+1.250
heses+1.133
Neg logit contributions
OPLE-1.675
POSE-1.545
ンジ-1.494
-1.462
pointer-1.459
Token
Token ID247
Feature activation-2.97e-01

Pos logit contributions
OPLE+0.559
POSE+0.518
ンジ+0.503
+0.491
iets+0.489
Neg logit contributions
osal-0.586
obbies-0.515
paces-0.510
hedral-0.508
heses-0.469
Token 1999
Token ID7358
Feature activation-3.72e-01

Pos logit contributions
osal+2.971
obbies+2.549
hedral+2.536
paces+2.490
heses+2.298
Neg logit contributions
OPLE-3.812
POSE-3.518
ンジ-3.407
-3.331
pointer-3.315
Token Stanley
Token ID14167
Feature activation-4.04e-01

Pos logit contributions
osal+4.107
obbies+3.552
hedral+3.529
paces+3.468
heses+3.209
Neg logit contributions
OPLE-4.395
POSE-4.020
ンジ-3.809
puter-3.784
pointer-3.782
Token Cup
Token ID5454
Feature activation-1.24e+00

Pos logit contributions
osal+4.236
hedral+3.720
paces+3.610
obbies+3.600
heses+3.339
Neg logit contributions
OPLE-5.006
POSE-4.498
ンジ-4.438
puter-4.339
-4.282
Token winner
Token ID8464
Feature activation-6.62e-01

Pos logit contributions
osal+9.825
hedral+8.511
obbies+7.865
paces+7.844
idding+7.354
Neg logit contributions
OPLE-15.340
POSE-14.099
oner-13.440
puter-13.402
ンジ-13.313
Token,
Token ID11
Feature activation-2.24e-01

Pos logit contributions
osal+6.440
hedral+5.605
paces+5.538
obbies+5.386
heses+4.979
Neg logit contributions
OPLE-8.210
POSE-7.628
ンジ-7.307
pointer-7.268
-7.184
Token then
Token ID788
Feature activation-1.61e-01

Pos logit contributions
osal+2.375
hedral+2.040
paces+2.025
obbies+2.020
heses+1.829
Neg logit contributions
OPLE-2.753
POSE-2.543
ンジ-2.471
iets-2.405
pointer-2.399
Token lost
Token ID2626
Feature activation-3.66e-03

Pos logit contributions
osal+1.777
hedral+1.546
obbies+1.534
paces+1.527
heses+1.393
Neg logit contributions
OPLE-1.895
POSE-1.744
ンジ-1.699
iets-1.654
pointer-1.652
Token out
Token ID503
Feature activation-1.34e-02

Pos logit contributions
osal+0.042
obbies+0.036
paces+0.036
hedral+0.036
heses+0.033
Neg logit contributions
OPLE-0.042
POSE-0.039
ンジ-0.038
-0.037
iets-0.037
Token authorities
Token ID4773
Feature activation-2.85e-02

Pos logit contributions
osal+1.909
obbies+1.654
hedral+1.633
paces+1.631
heses+1.494
Neg logit contributions
OPLE-2.056
POSE-1.881
ンジ-1.822
-1.782
================================================================-1.779
Token to
Token ID284
Feature activation-3.93e-01

Pos logit contributions
osal+0.309
obbies+0.266
hedral+0.266
paces+0.266
heses+0.242
Neg logit contributions
OPLE-0.342
POSE-0.317
ンジ-0.308
-0.301
iets-0.300
Token years
Token ID812
Feature activation+3.13e-02

Pos logit contributions
osal+4.043
hedral+3.457
obbies+3.394
paces+3.376
idding+3.097
Neg logit contributions
OPLE-4.833
POSE-4.484
ンジ-4.333
pointer-4.284
-4.274
Token of
Token ID286
Feature activation-2.23e-01

Pos logit contributions
OPLE+0.368
POSE+0.341
ンジ+0.332
+0.324
iets+0.324
Neg logit contributions
osal-0.346
obbies-0.300
hedral-0.298
paces-0.297
heses-0.273
Token below
Token ID2174
Feature activation-8.33e-01

Pos logit contributions
osal+2.425
hedral+2.084
obbies+2.075
paces+2.057
gey+1.879
Neg logit contributions
OPLE-2.654
POSE-2.463
ンジ-2.385
pointer-2.333
iets-2.328
Token-
Token ID12
Feature activation-1.23e+00

Pos logit contributions
osal+7.573
hedral+5.938
paces+5.919
obbies+5.845
idding+5.500
Neg logit contributions
OPLE-10.918
POSE-10.124
ンジ-9.750
================================================================-9.729
pointer-9.521
Tokenaverage
Token ID23913
Feature activation-4.78e-01

Pos logit contributions
osal+10.586
paces+8.831
hedral+8.463
obbies+7.888
idding+7.801
Neg logit contributions
OPLE-15.139
POSE-13.727
iets-12.905
================================================================-12.878
 Pwr-12.807
Token precipitation
Token ID32025
Feature activation+2.04e-02

Pos logit contributions
osal+5.177
hedral+4.328
obbies+4.288
paces+4.271
idding+4.028
Neg logit contributions
OPLE-5.589
POSE-5.183
ンジ-4.925
pointer-4.914
puter-4.899
Token
Token ID784
Feature activation+4.59e-02

Pos logit contributions
OPLE+0.240
POSE+0.222
ンジ+0.217
+0.212
iets+0.211
Neg logit contributions
osal-0.225
obbies-0.195
hedral-0.194
paces-0.193
heses-0.177
Token are
Token ID389
Feature activation-2.83e-01

Pos logit contributions
OPLE+0.543
POSE+0.503
ンジ+0.490
+0.479
iets+0.479
Neg logit contributions
osal-0.503
obbies-0.435
hedral-0.433
paces-0.431
heses-0.394
Token unquestion
Token ID38766
Feature activation-1.89e-02

Pos logit contributions
osal+2.937
hedral+2.548
paces+2.532
obbies+2.460
heses+2.276
Neg logit contributions
OPLE-3.494
POSE-3.224
ンジ-3.180
-3.076
pointer-3.071
Token And
Token ID843
Feature activation-5.20e-02

Pos logit contributions
OPLE+0.088
POSE+0.082
ンジ+0.079
+0.077
iets+0.077
Neg logit contributions
osal-0.087
obbies-0.076
hedral-0.075
paces-0.075
heses-0.069
Token there
Token ID612
Feature activation-1.02e-01

Pos logit contributions
osal+0.588
obbies+0.510
paces+0.509
hedral+0.508
heses+0.466
Neg logit contributions
OPLE-0.600
POSE-0.554
ンジ-0.539
-0.526
iets-0.525
Token
Token ID447
Feature activation+1.49e-01

Pos logit contributions
osal+1.066
obbies+0.909
hedral+0.906
paces+0.906
heses+0.825
Neg logit contributions
OPLE-1.261
POSE-1.172
ンジ-1.142
-1.116
iets-1.111
Token
Token ID247
Feature activation+1.91e-01

Pos logit contributions
OPLE+1.594
POSE+1.481
ンジ+1.438
+1.413
================================================================+1.406
Neg logit contributions
osal-1.779
obbies-1.576
paces-1.552
hedral-1.548
heses-1.431
Tokens
Token ID82
Feature activation-1.07e-01

Pos logit contributions
OPLE+2.345
POSE+2.204
ンジ+2.131
iets+2.106
+2.106
Neg logit contributions
osal-1.940
obbies-1.685
paces-1.682
hedral-1.671
heses-1.511
Token no
Token ID645
Feature activation-1.22e+00

Pos logit contributions
osal+1.198
paces+1.035
obbies+1.032
hedral+1.030
heses+0.943
Neg logit contributions
OPLE-1.246
POSE-1.151
ンジ-1.121
-1.096
iets-1.094
Token shortage
Token ID18772
Feature activation-2.62e-01

Pos logit contributions
osal+10.527
paces+8.607
idding+8.502
gey+8.248
 WAY+8.167
Neg logit contributions
OPLE-13.814
POSE-12.812
ンジ-12.687
iets-12.538
pointer-12.245
Token of
Token ID286
Feature activation-3.40e-01

Pos logit contributions
osal+2.601
paces+2.248
hedral+2.206
obbies+2.173
idding+1.964
Neg logit contributions
OPLE-3.403
POSE-3.113
ンジ-3.032
puter-3.003
iets-2.974
Token high
Token ID1029
Feature activation-4.53e-01

Pos logit contributions
osal+3.586
paces+3.072
hedral+3.050
obbies+3.035
heses+2.763
Neg logit contributions
OPLE-4.090
POSE-3.786
ンジ-3.726
iets-3.660
-3.648
Token-
Token ID12
Feature activation-5.17e-01

Pos logit contributions
osal+4.215
hedral+3.517
paces+3.507
obbies+3.394
heses+3.131
Neg logit contributions
OPLE-6.031
POSE-5.610
iets-5.452
ンジ-5.372
-5.286
Tokentech
Token ID13670
Feature activation+1.25e-02

Pos logit contributions
osal+4.929
paces+4.310
hedral+4.129
obbies+4.117
idding+3.737
Neg logit contributions
OPLE-6.792
POSE-6.170
ンジ-5.976
iets-5.912
-5.850
Token duration
Token ID9478
Feature activation-3.27e-01

Pos logit contributions
osal+2.829
hedral+2.449
paces+2.432
obbies+2.380
gey+2.131
Neg logit contributions
OPLE-3.909
POSE-3.604
ンジ-3.523
-3.472
puter-3.469
Token of
Token ID286
Feature activation+5.38e-02

Pos logit contributions
osal+3.268
paces+2.838
hedral+2.822
obbies+2.741
auld+2.515
Neg logit contributions
OPLE-4.179
POSE-3.812
ンジ-3.698
iets-3.651
-3.626
Token the
Token ID262
Feature activation-2.08e-02

Pos logit contributions
OPLE+0.614
POSE+0.571
ンジ+0.553
+0.540
iets+0.537
Neg logit contributions
osal-0.612
obbies-0.533
paces-0.529
hedral-0.527
heses-0.486
Token game
Token ID983
Feature activation-6.81e-01

Pos logit contributions
osal+0.228
obbies+0.196
hedral+0.196
paces+0.196
heses+0.178
Neg logit contributions
OPLE-0.249
POSE-0.230
ンジ-0.224
-0.219
iets-0.218
Token chasing
Token ID20023
Feature activation-4.94e-01

Pos logit contributions
osal+6.638
paces+5.977
hedral+5.856
obbies+5.446
auld+5.260
Neg logit contributions
OPLE-8.267
POSE-7.434
ンジ-7.207
iets-7.129
-7.109
Token one
Token ID530
Feature activation-1.21e+00

Pos logit contributions
osal+4.952
hedral+4.384
paces+4.268
obbies+4.096
auld+3.804
Neg logit contributions
OPLE-6.094
POSE-5.560
ンジ-5.443
-5.321
puter-5.309
Token another
Token ID1194
Feature activation-5.86e-01

Pos logit contributions
osal+7.521
hedral+6.710
gey+5.616
paces+5.567
ogie+5.445
Neg logit contributions
OPLE-17.125
ンジ-15.282
HCR-15.214
================================================================-15.168
iets-15.060
Token up
Token ID510
Feature activation+1.96e-01

Pos logit contributions
osal+5.377
paces+4.639
hedral+4.560
obbies+4.311
gey+4.114
Neg logit contributions
OPLE-7.816
POSE-7.001
ンジ-6.851
-6.719
iets-6.622
Token and
Token ID290
Feature activation+1.71e-01

Pos logit contributions
OPLE+2.181
POSE+2.040
ンジ+1.989
puter+1.936
+1.934
Neg logit contributions
osal-2.249
obbies-1.968
paces-1.917
hedral-1.911
heses-1.777
Token down
Token ID866
Feature activation+1.39e-01

Pos logit contributions
OPLE+1.992
POSE+1.856
ンジ+1.808
+1.763
iets+1.758
Neg logit contributions
osal-1.895
obbies-1.657
hedral-1.613
paces-1.612
heses-1.491
Token the
Token ID262
Feature activation+8.80e-02

Pos logit contributions
OPLE+1.534
POSE+1.431
ンジ+1.385
puter+1.347
+1.343
Neg logit contributions
osal-1.608
obbies-1.407
paces-1.387
hedral-1.376
heses-1.272
Token The
Token ID383
Feature activation+1.63e-01

Pos logit contributions
OPLE+3.442
POSE+3.160
ンジ+3.024
================================================================+2.985
iets+2.970
Neg logit contributions
osal-4.179
hedral-3.692
obbies-3.683
paces-3.541
idding-3.317
Token woman
Token ID2415
Feature activation-1.92e-01

Pos logit contributions
OPLE+1.951
POSE+1.797
ンジ+1.742
pointer+1.713
iets+1.697
Neg logit contributions
osal-1.777
obbies-1.529
hedral-1.527
paces-1.522
heses-1.387
Token then
Token ID788
Feature activation-2.54e-02

Pos logit contributions
osal+1.993
paces+1.724
obbies+1.709
hedral+1.705
heses+1.545
Neg logit contributions
OPLE-2.374
POSE-2.211
ンジ-2.172
-2.113
================================================================-2.098
Token appears
Token ID3568
Feature activation-2.01e-01

Pos logit contributions
osal+0.275
obbies+0.237
paces+0.236
hedral+0.236
heses+0.215
Neg logit contributions
OPLE-0.305
POSE-0.283
ンジ-0.275
-0.269
iets-0.268
Token to
Token ID284
Feature activation+3.16e-01

Pos logit contributions
osal+2.051
paces+1.773
obbies+1.772
hedral+1.762
gey+1.590
Neg logit contributions
OPLE-2.494
POSE-2.304
ンジ-2.280
-2.230
iets-2.224
Token kicks
Token ID17364
Feature activation+6.65e-01

Pos logit contributions
OPLE+3.406
POSE+3.132
pling+2.982
ンジ+2.965
puter+2.947
Neg logit contributions
osal-3.752
obbies-3.278
hedral-3.207
paces-3.183
heses-2.988
Token him
Token ID683
Feature activation+4.74e-02

Pos logit contributions
OPLE+5.716
POSE+5.411
ンジ+4.964
iets+4.960
pointer+4.931
Neg logit contributions
osal-8.723
obbies-7.675
hedral-7.566
paces-7.390
idding-6.995
Token twice
Token ID5403
Feature activation-2.44e-01

Pos logit contributions
OPLE+0.548
POSE+0.509
ンジ+0.494
+0.480
iets+0.479
Neg logit contributions
osal-0.533
obbies-0.462
hedral-0.461
paces-0.457
heses-0.421
Token,
Token ID11
Feature activation-4.06e-02

Pos logit contributions
osal+2.429
paces+2.103
obbies+2.063
hedral+2.052
idding+1.851
Neg logit contributions
OPLE-3.120
POSE-2.872
ンジ-2.812
-2.776
iets-2.763
Token and
Token ID290
Feature activation+2.78e-01

Pos logit contributions
osal+0.449
obbies+0.388
paces+0.386
hedral+0.386
heses+0.353
Neg logit contributions
OPLE-0.478
POSE-0.442
ンジ-0.430
-0.420
iets-0.419
Token he
Token ID339
Feature activation+2.58e-01

Pos logit contributions
OPLE+2.902
POSE+2.660
ンジ+2.539
iets+2.525
pointer+2.504
Neg logit contributions
osal-3.420
obbies-2.960
hedral-2.953
paces-2.919
idding-2.691
Token *
Token ID1635
Feature activation+2.87e-01

Pos logit contributions
osal+2.346
paces+1.998
hedral+1.981
obbies+1.978
heses+1.786
Neg logit contributions
OPLE-2.795
POSE-2.583
ンジ-2.535
-2.477
iets-2.464
Token Get
Token ID3497
Feature activation+5.60e-02

Pos logit contributions
OPLE+2.832
POSE+2.570
ンジ+2.497
================================================================+2.448
iets+2.420
Neg logit contributions
osal-3.667
obbies-3.240
paces-3.212
hedral-3.195
heses-3.013
TokenVolume
Token ID31715
Feature activation-1.32e-01

Pos logit contributions
OPLE+0.539
POSE+0.491
ンジ+0.475
+0.459
iets+0.457
Neg logit contributions
osal-0.739
obbies-0.657
paces-0.653
hedral-0.650
heses-0.607
Token()
Token ID3419
Feature activation+3.19e-01

Pos logit contributions
osal+1.432
paces+1.234
hedral+1.234
obbies+1.232
heses+1.111
Neg logit contributions
OPLE-1.578
POSE-1.454
ンジ-1.423
-1.388
iets-1.388
Token ;
Token ID2162
Feature activation+5.50e-01

Pos logit contributions
OPLE+3.386
POSE+3.138
ンジ+2.988
================================================================+2.986
iets+2.942
Neg logit contributions
osal-3.786
obbies-3.367
hedral-3.326
paces-3.273
heses-3.084
Token return
Token ID1441
Feature activation+7.93e-01

Pos logit contributions
OPLE+5.598
POSE+5.252
================================================================+5.131
pointer+5.099
ンジ+5.078
Neg logit contributions
osal-6.430
obbies-5.607
hedral-5.595
paces-5.525
heses-5.140
Token f
Token ID277
Feature activation+5.03e-01

Pos logit contributions
OPLE+7.170
POSE+6.666
ンジ+6.494
================================================================+6.493
 Pwr+6.267
Neg logit contributions
osal-9.791
obbies-8.399
paces-8.366
hedral-8.243
heses-8.002
TokenMass
Token ID20273
Feature activation+1.40e-01

Pos logit contributions
OPLE+3.861
POSE+3.589
================================================================+3.342
+3.309
ンジ+3.303
Neg logit contributions
osal-7.196
obbies-6.473
paces-6.349
hedral-6.260
heses-6.090
Token ;
Token ID2162
Feature activation+2.30e-01

Pos logit contributions
OPLE+1.535
POSE+1.428
ンジ+1.374
+1.345
iets+1.344
Neg logit contributions
osal-1.634
obbies-1.434
hedral-1.425
paces-1.414
heses-1.313
Token }
Token ID1782
Feature activation+4.35e-01

Pos logit contributions
OPLE+2.287
POSE+2.092
ンジ+2.033
================================================================+1.982
pointer+1.976
Neg logit contributions
osal-2.929
obbies-2.589
hedral-2.578
paces-2.575
heses-2.376
Token
Token ID198
Feature activation-3.25e-01

Pos logit contributions
OPLE+4.623
POSE+4.233
================================================================+4.085
ンジ+4.080
pointer+4.065
Neg logit contributions
osal-5.101
obbies-4.526
paces-4.452
hedral-4.436
heses-4.083
Token Sen
Token ID2311
Feature activation+5.07e-01

Pos logit contributions
OPLE+2.129
POSE+1.969
ンジ+1.904
iets+1.847
+1.845
Neg logit contributions
osal-2.665
obbies-2.353
hedral-2.345
paces-2.328
heses-2.162
Tokenb
Token ID65
Feature activation-1.35e-01

Pos logit contributions
OPLE+2.955
iets+2.895
pointer+2.885
POSE+2.860
pling+2.858
Neg logit contributions
osal-7.729
hedral-6.978
obbies-6.953
paces-6.845
idding-6.555
Tokenong
Token ID506
Feature activation+5.32e-02

Pos logit contributions
osal+0.927
hedral+0.727
obbies+0.714
paces+0.714
heses+0.601
Neg logit contributions
OPLE-2.194
POSE-2.054
ンジ-2.034
-1.987
puter-1.978
Tokeni
Token ID72
Feature activation+3.10e-02

Pos logit contributions
OPLE+0.535
POSE+0.493
ンジ+0.475
iets+0.464
+0.461
Neg logit contributions
osal-0.673
obbies-0.596
hedral-0.593
paces-0.592
heses-0.550
Token and
Token ID290
Feature activation+3.27e-01

Pos logit contributions
OPLE+0.338
POSE+0.311
ンジ+0.303
+0.295
iets+0.294
Neg logit contributions
osal-0.370
obbies-0.324
hedral-0.322
paces-0.322
heses-0.296
Token Sh
Token ID911
Feature activation+7.36e-01

Pos logit contributions
OPLE+3.454
POSE+3.199
ンジ+3.079
+3.015
iets+3.002
Neg logit contributions
osal-3.887
obbies-3.418
paces-3.384
hedral-3.382
idding-3.125
Tokenus
Token ID385
Feature activation+2.79e-01

Pos logit contributions
OPLE+2.716
iets+2.634
POSE+2.480
oner+2.471
pling+2.465
Neg logit contributions
osal-11.830
hedral-11.452
paces-11.192
obbies-11.124
erville-10.729
Tokenaku
Token ID8719
Feature activation+5.26e-01

Pos logit contributions
OPLE+2.774
POSE+2.596
ンジ+2.486
iets+2.470
pointer+2.439
Neg logit contributions
osal-3.413
paces-3.009
obbies-2.996
hedral-2.994
heses-2.797
Token U
Token ID471
Feature activation+2.99e-02

Pos logit contributions
OPLE+4.526
POSE+4.043
ンジ+3.794
puter+3.686
pling+3.673
Neg logit contributions
osal-7.127
obbies-6.506
hedral-6.428
paces-6.368
heses-6.020
Tokench
Token ID354
Feature activation+1.28e-01

Pos logit contributions
OPLE+0.409
POSE+0.385
ンジ+0.375
+0.368
iets+0.367
Neg logit contributions
osal-0.272
obbies-0.228
hedral-0.226
paces-0.225
heses-0.202
Tokeniy
Token ID7745
Feature activation+5.64e-01

Pos logit contributions
OPLE+0.915
POSE+0.825
ンジ+0.778
iets+0.767
pointer+0.745
Neg logit contributions
osal-1.968
paces-1.781
obbies-1.778
hedral-1.770
heses-1.673
Token scientist
Token ID11444
Feature activation+2.72e-01

Pos logit contributions
OPLE+0.153
POSE+0.141
ンジ+0.137
+0.134
iets+0.133
Neg logit contributions
osal-0.148
obbies-0.128
hedral-0.128
paces-0.128
heses-0.117
Token/
Token ID14
Feature activation-1.57e-01

Pos logit contributions
OPLE+2.850
POSE+2.638
ンジ+2.564
puter+2.497
+2.486
Neg logit contributions
osal-3.250
obbies-2.888
hedral-2.851
paces-2.845
heses-2.647
TokenY
Token ID56
Feature activation+3.08e-01

Pos logit contributions
osal+1.788
obbies+1.552
hedral+1.543
paces+1.543
heses+1.410
Neg logit contributions
OPLE-1.795
POSE-1.646
ンジ-1.603
iets-1.565
-1.556
Token
Token ID447
Feature activation-8.76e-02

Pos logit contributions
OPLE+3.407
POSE+3.117
iets+2.978
pointer+2.912
pling+2.911
Neg logit contributions
osal-3.537
hedral-3.089
paces-3.070
obbies-3.048
heses-2.853
Token
Token ID247
Feature activation+3.16e-01

Pos logit contributions
osal+1.020
obbies+0.886
paces+0.886
hedral+0.885
heses+0.814
Neg logit contributions
OPLE-0.982
POSE-0.904
ンジ-0.877
iets-0.853
-0.853
Tokenall
Token ID439
Feature activation+7.79e-01

Pos logit contributions
OPLE+3.545
POSE+3.310
ンジ+3.214
puter+3.180
pointer+3.123
Neg logit contributions
osal-3.517
obbies-3.088
hedral-3.085
paces-3.072
heses-2.824
Token mother
Token ID2802
Feature activation+2.34e-01

Pos logit contributions
OPLE+8.096
ンジ+7.573
POSE+7.564
puter+7.228
oner+7.114
Neg logit contributions
osal-8.171
hedral-7.248
obbies-7.137
paces-7.111
heses-6.570
Tokenfuck
Token ID31699
Feature activation+2.50e-01

Pos logit contributions
OPLE+2.444
POSE+2.272
ンジ+2.210
puter+2.161
+2.153
Neg logit contributions
osal-2.810
obbies-2.495
hedral-2.467
paces-2.458
idding-2.252
Tokeners
Token ID364
Feature activation+3.14e-01

Pos logit contributions
OPLE+3.228
POSE+3.008
ンジ+2.937
pointer+2.881
iets+2.879
Neg logit contributions
osal-2.389
obbies-2.029
hedral-2.027
paces-2.017
heses-1.828
Token lying
Token ID9105
Feature activation+5.42e-02

Pos logit contributions
OPLE+3.321
POSE+3.083
ンジ+3.008
+2.912
puter+2.910
Neg logit contributions
osal-3.702
obbies-3.281
hedral-3.253
paces-3.210
heses-2.989
Token,
Token ID11
Feature activation+1.96e-01

Pos logit contributions
OPLE+0.614
POSE+0.566
ンジ+0.550
+0.536
iets+0.534
Neg logit contributions
osal-0.623
obbies-0.544
hedral-0.541
paces-0.541
heses-0.498
Token about
Token ID546
Feature activation-1.02e-01

Pos logit contributions
OPLE+0.861
POSE+0.794
ンジ+0.770
+0.752
iets+0.747
Neg logit contributions
osal-0.927
obbies-0.816
hedral-0.808
paces-0.807
heses-0.742
Token sex
Token ID1714
Feature activation+1.22e-01

Pos logit contributions
osal+1.155
hedral+1.001
obbies+1.000
paces+0.999
heses+0.916
Neg logit contributions
OPLE-1.177
POSE-1.084
ンジ-1.057
iets-1.030
-1.026
Token and
Token ID290
Feature activation-1.92e-01

Pos logit contributions
OPLE+1.407
POSE+1.300
ンジ+1.259
+1.244
puter+1.232
Neg logit contributions
osal-1.358
obbies-1.178
hedral-1.173
paces-1.169
heses-1.072
Token looked
Token ID3114
Feature activation-2.13e-01

Pos logit contributions
osal+1.990
paces+1.717
hedral+1.714
obbies+1.705
heses+1.554
Neg logit contributions
OPLE-2.375
POSE-2.189
ンジ-2.157
iets-2.098
puter-2.085
Token a
Token ID257
Feature activation+2.24e-01

Pos logit contributions
osal+2.278
paces+1.963
hedral+1.934
obbies+1.929
heses+1.763
Neg logit contributions
OPLE-2.577
POSE-2.380
ンジ-2.326
-2.255
puter-2.255
Token little
Token ID1310
Feature activation+8.94e-01

Pos logit contributions
OPLE+2.382
POSE+2.211
ンジ+2.136
+2.090
pling+2.079
Neg logit contributions
osal-2.657
obbies-2.369
hedral-2.351
paces-2.348
heses-2.169
Token shocked
Token ID11472
Feature activation-5.45e-02

Pos logit contributions
OPLE+8.764
POSE+8.458
oner+8.056
+7.943
lp+7.808
Neg logit contributions
osal-9.649
obbies-8.620
hedral-8.488
paces-8.234
heses-7.891
Token,
Token ID11
Feature activation-6.23e-02

Pos logit contributions
osal+0.611
obbies+0.528
paces+0.527
hedral+0.527
heses+0.481
Neg logit contributions
OPLE-0.636
POSE-0.588
ンジ-0.571
-0.557
iets-0.556
Token lol
Token ID19462
Feature activation+1.13e-03

Pos logit contributions
osal+0.702
obbies+0.608
paces+0.607
hedral+0.606
heses+0.555
Neg logit contributions
OPLE-0.723
POSE-0.666
ンジ-0.648
-0.632
iets-0.630
Token).
Token ID737
Feature activation+1.03e-03

Pos logit contributions
OPLE+0.013
POSE+0.012
ンジ+0.012
+0.011
iets+0.011
Neg logit contributions
osal-0.013
obbies-0.011
hedral-0.011
paces-0.011
heses-0.010
Token I
Token ID314
Feature activation+8.72e-02

Pos logit contributions
OPLE+0.011
POSE+0.010
ンジ+0.010
+0.010
iets+0.010
Neg logit contributions
osal-0.012
obbies-0.011
hedral-0.011
paces-0.011
heses-0.010
Token in
Token ID287
Feature activation+1.20e-01

Pos logit contributions
osal+0.613
obbies+0.527
paces+0.526
hedral+0.525
heses+0.478
Neg logit contributions
OPLE-0.681
POSE-0.631
ンジ-0.614
-0.600
iets-0.597
Token case
Token ID1339
Feature activation+4.20e-02

Pos logit contributions
OPLE+1.360
POSE+1.251
ンジ+1.206
+1.181
================================================================+1.178
Neg logit contributions
osal-1.374
obbies-1.197
paces-1.187
hedral-1.181
heses-1.087
Token the
Token ID262
Feature activation+2.35e-01

Pos logit contributions
OPLE+0.493
POSE+0.456
ンジ+0.443
+0.432
iets+0.431
Neg logit contributions
osal-0.466
obbies-0.404
hedral-0.403
paces-0.402
heses-0.367
Token individual
Token ID1981
Feature activation+3.55e-01

Pos logit contributions
OPLE+2.657
POSE+2.451
ンジ+2.359
puter+2.320
pointer+2.308
Neg logit contributions
osal-2.657
obbies-2.343
hedral-2.337
paces-2.325
heses-2.136
Token is
Token ID318
Feature activation+1.50e-02

Pos logit contributions
OPLE+3.908
POSE+3.633
puter+3.510
iets+3.493
+3.472
Neg logit contributions
osal-3.959
hedral-3.524
obbies-3.517
paces-3.445
heses-3.169
Token neither
Token ID6159
Feature activation+2.24e-01

Pos logit contributions
OPLE+0.177
POSE+0.164
ンジ+0.159
+0.155
iets+0.155
Neg logit contributions
osal-0.165
obbies-0.143
hedral-0.142
paces-0.142
heses-0.130
Token a
Token ID257
Feature activation+5.94e-01

Pos logit contributions
OPLE+2.497
POSE+2.301
ンジ+2.211
pointer+2.183
puter+2.166
Neg logit contributions
osal-2.605
obbies-2.291
hedral-2.250
paces-2.233
heses-2.073
Token UAE
Token ID29367
Feature activation-1.84e-01

Pos logit contributions
OPLE+5.827
POSE+5.294
ンジ+5.056
puter+5.042
HCR+4.934
Neg logit contributions
osal-7.238
hedral-6.562
obbies-6.470
paces-6.346
heses-6.038
Token citizen
Token ID9511
Feature activation-3.28e-02

Pos logit contributions
osal+2.037
obbies+1.761
hedral+1.752
paces+1.744
idding+1.602
Neg logit contributions
OPLE-2.140
POSE-1.985
ンジ-1.938
iets-1.891
-1.873
Token nor
Token ID4249
Feature activation+3.07e-01

Pos logit contributions
osal+0.351
obbies+0.301
paces+0.300
hedral+0.300
heses+0.273
Neg logit contributions
OPLE-0.399
POSE-0.371
ンジ-0.361
-0.352
iets-0.352
Token a
Token ID257
Feature activation+5.60e-01

Pos logit contributions
OPLE+3.278
POSE+2.962
================================================================+2.803
ンジ+2.798
puter+2.798
Neg logit contributions
osal-3.685
obbies-3.257
hedral-3.249
paces-3.195
heses-2.979
Token you
Token ID345
Feature activation-4.32e-02

Pos logit contributions
osal+1.242
obbies+1.087
paces+1.083
hedral+1.082
heses+0.993
Neg logit contributions
OPLE-1.161
POSE-1.068
ンジ-1.039
-1.012
iets-1.011
Token didn
Token ID1422
Feature activation+6.21e-02

Pos logit contributions
osal+0.465
obbies+0.400
paces+0.399
hedral+0.398
heses+0.362
Neg logit contributions
OPLE-0.522
POSE-0.484
ンジ-0.471
-0.459
iets-0.458
Token
Token ID447
Feature activation-2.43e-02

Pos logit contributions
OPLE+0.586
POSE+0.537
ンジ+0.518
+0.505
iets+0.500
Neg logit contributions
osal-0.827
obbies-0.737
hedral-0.732
paces-0.729
heses-0.682
Token
Token ID247
Feature activation-1.15e-01

Pos logit contributions
osal+0.271
obbies+0.235
hedral+0.234
paces+0.234
heses+0.214
Neg logit contributions
OPLE-0.283
POSE-0.262
ンジ-0.255
-0.248
iets-0.248
Tokent
Token ID83
Feature activation+8.97e-03

Pos logit contributions
osal+1.284
paces+1.105
obbies+1.104
hedral+1.104
heses+1.009
Neg logit contributions
OPLE-1.342
POSE-1.237
ンジ-1.205
-1.170
iets-1.168
Token get
Token ID651
Feature activation+1.61e-02

Pos logit contributions
OPLE+0.104
POSE+0.096
ンジ+0.094
+0.091
iets+0.091
Neg logit contributions
osal-0.100
obbies-0.087
hedral-0.087
paces-0.086
heses-0.079
Token a
Token ID257
Feature activation+4.40e-01

Pos logit contributions
OPLE+0.184
POSE+0.170
ンジ+0.165
+0.161
iets+0.161
Neg logit contributions
osal-0.183
obbies-0.159
hedral-0.158
paces-0.158
heses-0.145
Token check
Token ID2198
Feature activation+7.91e-03

Pos logit contributions
OPLE+4.381
POSE+4.088
pointer+3.896
ンジ+3.866
pling+3.842
Neg logit contributions
osal-5.281
obbies-4.744
hedral-4.736
paces-4.721
heses-4.379
Token.
Token ID13
Feature activation-2.19e-01

Pos logit contributions
OPLE+0.092
POSE+0.086
ンジ+0.083
+0.081
iets+0.081
Neg logit contributions
osal-0.088
obbies-0.076
hedral-0.076
paces-0.076
heses-0.069
Token
Token ID447
Feature activation+1.26e-01

Pos logit contributions
osal+2.691
paces+2.370
obbies+2.359
hedral+2.348
heses+2.175
Neg logit contributions
OPLE-2.257
POSE-2.067
ンジ-1.999
iets-1.980
puter-1.958
Token
Token ID247
Feature activation-1.26e-01

Pos logit contributions
OPLE+1.376
POSE+1.269
ンジ+1.230
+1.209
================================================================+1.193
Neg logit contributions
osal-1.503
obbies-1.322
hedral-1.310
paces-1.308
heses-1.200
Token knew
Token ID2993
Feature activation-6.68e-02

Pos logit contributions
OPLE+1.522
POSE+1.417
ンジ+1.359
iets+1.349
+1.343
Neg logit contributions
osal-1.462
obbies-1.266
hedral-1.257
paces-1.246
heses-1.154
Token no
Token ID645
Feature activation-2.23e-01

Pos logit contributions
osal+0.762
paces+0.663
obbies+0.661
hedral+0.659
heses+0.603
Neg logit contributions
OPLE-0.765
POSE-0.706
ンジ-0.686
iets-0.666
-0.666
Token bounds
Token ID22303
Feature activation+5.25e-02

Pos logit contributions
osal+2.281
paces+1.960
hedral+1.951
obbies+1.937
heses+1.755
Neg logit contributions
OPLE-2.801
POSE-2.599
ンジ-2.532
pointer-2.477
puter-2.477
Token,
Token ID11
Feature activation+1.94e-01

Pos logit contributions
OPLE+0.611
POSE+0.566
ンジ+0.549
+0.538
iets+0.535
Neg logit contributions
osal-0.585
obbies-0.507
hedral-0.505
paces-0.503
heses-0.461
Token
Token ID447
Feature activation+1.70e-01

Pos logit contributions
OPLE+2.140
POSE+1.992
ンジ+1.907
+1.877
iets+1.875
Neg logit contributions
osal-2.258
hedral-1.969
obbies-1.963
paces-1.958
heses-1.803
Token
Token ID251
Feature activation+2.83e-01

Pos logit contributions
OPLE+1.857
POSE+1.721
ンジ+1.670
+1.643
iets+1.619
Neg logit contributions
osal-2.006
obbies-1.772
paces-1.752
hedral-1.744
heses-1.610
Token his
Token ID465
Feature activation+7.13e-02

Pos logit contributions
OPLE+3.130
POSE+2.934
ンジ+2.806
iets+2.744
+2.739
Neg logit contributions
osal-3.247
obbies-2.830
hedral-2.829
paces-2.828
heses-2.595
Token relatives
Token ID12772
Feature activation+2.48e-01

Pos logit contributions
OPLE+0.829
POSE+0.769
ンジ+0.743
iets+0.728
+0.727
Neg logit contributions
osal-0.795
hedral-0.688
paces-0.688
obbies-0.688
heses-0.626
Token said
Token ID531
Feature activation+2.93e-01

Pos logit contributions
OPLE+2.767
POSE+2.560
ンジ+2.484
+2.412
iets+2.411
Neg logit contributions
osal-2.854
obbies-2.482
hedral-2.439
paces-2.413
idding-2.246
Token.
Token ID13
Feature activation+8.66e-02

Pos logit contributions
OPLE+3.241
POSE+2.992
ンジ+2.836
pling+2.795
iets+2.780
Neg logit contributions
osal-3.392
obbies-2.981
hedral-2.949
paces-2.939
heses-2.745
Token
Token ID198
Feature activation+1.79e-01

Pos logit contributions
OPLE+0.997
POSE+0.922
ンジ+0.893
+0.872
================================================================+0.868
Neg logit contributions
osal-0.979
obbies-0.854
hedral-0.854
paces-0.845
heses-0.776
Token nor
Token ID4249
Feature activation-3.20e-01

Pos logit contributions
osal+0.300
obbies+0.258
hedral+0.257
paces+0.257
heses+0.234
Neg logit contributions
OPLE-0.333
POSE-0.309
ンジ-0.301
-0.294
iets-0.293
Token that
Token ID326
Feature activation+5.09e-02

Pos logit contributions
osal+3.348
obbies+2.888
paces+2.875
hedral+2.874
idding+2.601
Neg logit contributions
OPLE-3.933
POSE-3.639
ンジ-3.546
-3.479
================================================================-3.437
Token he
Token ID339
Feature activation+1.03e-01

Pos logit contributions
OPLE+0.560
POSE+0.514
ンジ+0.498
+0.486
iets+0.485
Neg logit contributions
osal-0.602
obbies-0.526
hedral-0.523
paces-0.523
heses-0.481
Token'd
Token ID1549
Feature activation+5.44e-01

Pos logit contributions
OPLE+1.269
POSE+1.182
ンジ+1.152
iets+1.132
puter+1.125
Neg logit contributions
osal-1.067
obbies-0.917
hedral-0.910
paces-0.902
heses-0.826
Token be
Token ID307
Feature activation+1.23e-01

Pos logit contributions
OPLE+5.294
oner+4.778
pling+4.732
POSE+4.730
puter+4.726
Neg logit contributions
osal-6.474
obbies-5.661
hedral-5.641
paces-5.535
heses-5.415
Token happy
Token ID3772
Feature activation+1.71e-01

Pos logit contributions
OPLE+1.437
POSE+1.331
ンジ+1.288
pointer+1.261
+1.258
Neg logit contributions
osal-1.365
obbies-1.178
hedral-1.163
paces-1.163
heses-1.079
Token that
Token ID326
Feature activation+2.00e-01

Pos logit contributions
OPLE+1.884
POSE+1.744
ンジ+1.674
+1.654
pointer+1.650
Neg logit contributions
osal-2.001
obbies-1.746
hedral-1.725
paces-1.719
heses-1.606
Token they
Token ID484
Feature activation+3.76e-02

Pos logit contributions
OPLE+2.115
POSE+1.936
ンジ+1.853
+1.830
pointer+1.821
Neg logit contributions
osal-2.426
obbies-2.122
hedral-2.121
paces-2.114
heses-1.957
Token left
Token ID1364
Feature activation-3.23e-01

Pos logit contributions
OPLE+0.456
POSE+0.422
ンジ+0.411
iets+0.402
+0.402
Neg logit contributions
osal-0.402
obbies-0.347
hedral-0.345
paces-0.343
heses-0.314
Token home
Token ID1363
Feature activation-2.40e-01

Pos logit contributions
osal+3.354
paces+2.919
obbies+2.898
hedral+2.893
heses+2.659
Neg logit contributions
OPLE-3.932
POSE-3.653
ンジ-3.556
iets-3.460
================================================================-3.442
Token so
Token ID523
Feature activation+3.03e-01

Pos logit contributions
osal+2.568
paces+2.244
hedral+2.234
obbies+2.209
heses+2.024
Neg logit contributions
OPLE-2.903
POSE-2.659
ンジ-2.596
iets-2.525
-2.510
Token just
Token ID655
Feature activation+3.85e-01

Pos logit contributions
osal+1.288
paces+1.115
hedral+1.106
obbies+1.102
heses+1.007
Neg logit contributions
OPLE-1.435
POSE-1.319
ンジ-1.279
iets-1.249
-1.245
Token to
Token ID284
Feature activation+1.60e-01

Pos logit contributions
OPLE+3.773
POSE+3.502
ンジ+3.428
+3.401
pointer+3.392
Neg logit contributions
osal-4.687
obbies-4.268
hedral-4.162
paces-4.106
heses-3.888
Token get
Token ID651
Feature activation-7.33e-03

Pos logit contributions
OPLE+1.859
POSE+1.718
ンジ+1.671
pling+1.638
+1.635
Neg logit contributions
osal-1.772
obbies-1.559
hedral-1.538
paces-1.521
heses-1.407
Token rid
Token ID5755
Feature activation-8.65e-02

Pos logit contributions
osal+0.081
obbies+0.070
hedral+0.070
paces+0.069
heses+0.063
Neg logit contributions
OPLE-0.087
POSE-0.080
ンジ-0.078
-0.076
iets-0.076
Token of
Token ID286
Feature activation+2.85e-02

Pos logit contributions
osal+0.918
paces+0.788
hedral+0.784
obbies+0.782
heses+0.711
Neg logit contributions
OPLE-1.067
POSE-0.982
ンジ-0.957
-0.936
puter-0.933
Token the
Token ID262
Feature activation-6.94e-02

Pos logit contributions
OPLE+0.339
POSE+0.314
ンジ+0.305
+0.298
iets+0.297
Neg logit contributions
osal-0.312
obbies-0.270
paces-0.268
hedral-0.268
heses-0.244
Token proverbial
Token ID47327
Feature activation-3.26e-02

Pos logit contributions
osal+0.769
obbies+0.663
hedral+0.662
paces+0.660
heses+0.604
Neg logit contributions
OPLE-0.817
POSE-0.755
ンジ-0.735
-0.717
iets-0.716
Token al
Token ID435
Feature activation-8.98e-02

Pos logit contributions
osal+0.363
obbies+0.314
hedral+0.313
paces+0.313
heses+0.286
Neg logit contributions
OPLE-0.382
POSE-0.353
ンジ-0.343
-0.335
iets-0.334
Tokenbat
Token ID8664
Feature activation+2.69e-01

Pos logit contributions
osal+1.278
obbies+1.145
hedral+1.143
paces+1.136
heses+1.066
Neg logit contributions
OPLE-0.779
POSE-0.697
ンジ-0.663
-0.647
iets-0.641
Tokenross
Token ID1214
Feature activation-3.50e-01

Pos logit contributions
OPLE+3.274
POSE+3.079
ンジ+3.010
iets+2.984
pointer+2.969
Neg logit contributions
osal-2.698
obbies-2.364
paces-2.363
hedral-2.301
heses-2.093
Token around
Token ID1088
Feature activation-1.96e-01

Pos logit contributions
osal+3.631
hedral+3.070
obbies+3.007
paces+3.002
gey+2.792
Neg logit contributions
OPLE-4.361
POSE-4.041
ンジ-3.947
iets-3.830
HCR-3.811
Tokenalia
Token ID9752
Feature activation-1.52e-01

Pos logit contributions
OPLE+4.085
POSE+3.852
iets+3.729
================================================================+3.701
pointer+3.670
Neg logit contributions
osal-3.214
obbies-2.831
paces-2.767
hedral-2.751
idding-2.519
Token.
Token ID13
Feature activation-1.18e-01

Pos logit contributions
osal+1.599
paces+1.378
obbies+1.375
hedral+1.365
heses+1.242
Neg logit contributions
OPLE-1.878
POSE-1.737
ンジ-1.706
-1.658
iets-1.651
Token It
Token ID632
Feature activation-1.71e-01

Pos logit contributions
osal+1.291
obbies+1.112
paces+1.111
hedral+1.106
heses+1.017
Neg logit contributions
OPLE-1.392
POSE-1.286
ンジ-1.258
-1.227
iets-1.225
Token
Token ID447
Feature activation+6.31e-02

Pos logit contributions
osal+1.707
obbies+1.454
paces+1.452
hedral+1.449
heses+1.314
Neg logit contributions
OPLE-2.204
POSE-2.034
ンジ-1.975
-1.942
pointer-1.932
Token
Token ID247
Feature activation+2.72e-02

Pos logit contributions
OPLE+0.680
POSE+0.626
ンジ+0.606
+0.590
================================================================+0.588
Neg logit contributions
osal-0.761
obbies-0.666
hedral-0.662
paces-0.662
heses-0.609
Tokens
Token ID82
Feature activation-1.23e-01

Pos logit contributions
OPLE+0.321
POSE+0.298
ンジ+0.289
+0.283
iets+0.282
Neg logit contributions
osal-0.300
obbies-0.259
hedral-0.259
paces-0.258
heses-0.236
Token not
Token ID407
Feature activation-2.53e-01

Pos logit contributions
osal+1.352
hedral+1.168
obbies+1.168
paces+1.163
heses+1.065
Neg logit contributions
OPLE-1.449
POSE-1.338
ンジ-1.303
-1.270
iets-1.270
Token clear
Token ID1598
Feature activation-1.31e-01

Pos logit contributions
osal+2.661
hedral+2.286
paces+2.273
obbies+2.252
heses+2.086
Neg logit contributions
OPLE-3.074
POSE-2.857
ンジ-2.793
iets-2.720
-2.705
Token how
Token ID703
Feature activation+4.01e-01

Pos logit contributions
osal+1.480
obbies+1.279
paces+1.276
hedral+1.272
heses+1.164
Neg logit contributions
OPLE-1.510
POSE-1.392
ンジ-1.361
-1.324
puter-1.319
Token the
Token ID262
Feature activation-5.14e-03

Pos logit contributions
OPLE+4.295
POSE+4.020
ンジ+3.761
+3.755
oner+3.733
Neg logit contributions
osal-4.747
obbies-4.115
paces-4.106
hedral-4.092
heses-3.755
Token case
Token ID1339
Feature activation-2.00e-02

Pos logit contributions
osal+0.055
obbies+0.048
hedral+0.047
paces+0.047
heses+0.043
Neg logit contributions
OPLE-0.062
POSE-0.058
ンジ-0.056
-0.055
iets-0.055
Token!
Token ID0
Feature activation-1.98e-01

Pos logit contributions
OPLE+1.713
POSE+1.570
ンジ+1.521
iets+1.484
+1.482
Neg logit contributions
osal-1.795
obbies-1.565
hedral-1.560
paces-1.559
heses-1.425
Token This
Token ID770
Feature activation+1.62e-01

Pos logit contributions
osal+2.230
paces+1.940
obbies+1.923
hedral+1.918
heses+1.770
Neg logit contributions
OPLE-2.266
POSE-2.098
ンジ-2.056
-2.006
iets-1.997
Token is
Token ID318
Feature activation-5.28e-01

Pos logit contributions
OPLE+1.840
POSE+1.703
ンジ+1.637
iets+1.613
================================================================+1.609
Neg logit contributions
osal-1.838
obbies-1.621
hedral-1.594
paces-1.594
heses-1.457
Token some
Token ID617
Feature activation-3.64e-01

Pos logit contributions
osal+5.205
paces+4.573
hedral+4.316
obbies+4.239
idding+4.046
Neg logit contributions
OPLE-6.583
POSE-6.068
ンジ-6.060
-5.964
iets-5.776
Token seriously
Token ID6411
Feature activation-6.81e-01

Pos logit contributions
osal+3.717
paces+3.155
hedral+3.112
obbies+3.085
idding+2.840
Neg logit contributions
OPLE-4.559
POSE-4.219
ンジ-4.184
-4.063
iets-4.041
Token next
Token ID1306
Feature activation-3.09e-01

Pos logit contributions
osal+6.341
paces+5.505
hedral+5.368
obbies+5.052
idding+4.919
Neg logit contributions
OPLE-8.431
ンジ-7.870
POSE-7.817
-7.653
iets-7.569
Token-
Token ID12
Feature activation-2.31e-01

Pos logit contributions
osal+2.947
hedral+2.464
paces+2.462
obbies+2.438
gey+2.198
Neg logit contributions
OPLE-4.125
POSE-3.822
ンジ-3.718
iets-3.636
-3.634
Tokenlevel
Token ID5715
Feature activation-4.64e-01

Pos logit contributions
osal+2.225
paces+1.887
hedral+1.863
obbies+1.843
gey+1.656
Neg logit contributions
OPLE-3.086
POSE-2.855
ンジ-2.778
iets-2.726
-2.725
Token stuff
Token ID3404
Feature activation-1.04e-01

Pos logit contributions
osal+4.989
obbies+4.213
paces+4.212
hedral+4.211
idding+3.859
Neg logit contributions
OPLE-5.502
POSE-5.065
ンジ-5.004
-4.948
iets-4.842
Token.
Token ID13
Feature activation-2.32e-01

Pos logit contributions
osal+1.150
paces+0.990
obbies+0.989
hedral+0.988
heses+0.901
Neg logit contributions
OPLE-1.229
POSE-1.137
ンジ-1.112
-1.083
iets-1.076
Token While
Token ID2893
Feature activation+1.67e-01

Pos logit contributions
osal+2.574
paces+2.238
obbies+2.196
hedral+2.187
heses+2.033
Neg logit contributions
OPLE-2.691
POSE-2.498
ンジ-2.441
-2.393
iets-2.385
Token The
Token ID383
Feature activation-4.59e-01

Pos logit contributions
osal+2.246
paces+1.944
hedral+1.913
obbies+1.911
heses+1.757
Neg logit contributions
OPLE-2.307
POSE-2.126
ンジ-2.050
puter-2.037
iets-2.032
Token research
Token ID2267
Feature activation-1.22e-01

Pos logit contributions
osal+4.935
hedral+4.225
obbies+4.137
paces+4.130
idding+3.833
Neg logit contributions
OPLE-5.393
POSE-4.888
puter-4.767
-4.761
ンジ-4.749
Token undermines
Token ID37001
Feature activation-2.76e-01

Pos logit contributions
osal+1.067
hedral+0.881
obbies+0.880
paces+0.877
heses+0.783
Neg logit contributions
OPLE-1.717
POSE-1.607
ンジ-1.573
-1.546
iets-1.534
Token any
Token ID597
Feature activation-7.17e-01

Pos logit contributions
osal+2.897
hedral+2.465
obbies+2.445
paces+2.436
heses+2.236
Neg logit contributions
OPLE-3.357
POSE-3.106
ンジ-3.033
pointer-2.981
iets-2.974
Token notion
Token ID9495
Feature activation-1.46e-01

Pos logit contributions
osal+6.857
hedral+5.558
obbies+5.376
paces+5.320
gey+4.910
Neg logit contributions
OPLE-8.958
POSE-8.202
ンジ-8.121
-8.013
puter-7.986
Token of
Token ID286
Feature activation-2.38e-01

Pos logit contributions
osal+1.548
hedral+1.322
obbies+1.319
paces+1.318
heses+1.194
Neg logit contributions
OPLE-1.801
POSE-1.666
ンジ-1.618
iets-1.594
-1.580
Token a
Token ID257
Feature activation-3.21e-01

Pos logit contributions
osal+2.563
hedral+2.205
paces+2.184
obbies+2.180
idding+1.976
Neg logit contributions
OPLE-2.863
POSE-2.655
ンジ-2.575
iets-2.525
-2.524
Token crisis
Token ID4902
Feature activation+2.93e-01

Pos logit contributions
osal+3.385
hedral+2.900
paces+2.872
obbies+2.846
gey+2.602
Neg logit contributions
OPLE-3.884
POSE-3.571
ンジ-3.468
puter-3.427
pointer-3.410
Token caused
Token ID4073
Feature activation-3.27e-01

Pos logit contributions
OPLE+3.226
POSE+2.987
ンジ+2.919
================================================================+2.836
pointer+2.820
Neg logit contributions
osal-3.322
obbies-3.008
paces-2.918
hedral-2.895
heses-2.731
Token primarily
Token ID7525
Feature activation-2.25e-01

Pos logit contributions
osal+3.185
paces+2.691
hedral+2.682
obbies+2.663
idding+2.428
Neg logit contributions
OPLE-4.212
POSE-3.969
ンジ-3.763
iets-3.742
pointer-3.736
Token by
Token ID416
Feature activation-5.41e-01

Pos logit contributions
osal+2.274
obbies+1.922
hedral+1.911
paces+1.901
idding+1.725
Neg logit contributions
OPLE-2.836
POSE-2.682
ンジ-2.566
-2.531
puter-2.529
Token
Token ID198
Feature activation+3.69e-02

Pos logit contributions
osal+0.054
obbies+0.047
hedral+0.047
paces+0.047
heses+0.043
Neg logit contributions
OPLE-0.059
POSE-0.054
ンジ-0.053
-0.052
iets-0.051
Token
Token ID198
Feature activation+1.71e-01

Pos logit contributions
OPLE+0.440
POSE+0.410
ンジ+0.397
+0.388
pointer+0.387
Neg logit contributions
osal-0.400
obbies-0.347
hedral-0.344
paces-0.343
heses-0.314
TokenDec
Token ID10707
Feature activation-2.50e-01

Pos logit contributions
OPLE+1.917
POSE+1.774
ンジ+1.719
================================================================+1.691
pointer+1.672
Neg logit contributions
osal-1.959
obbies-1.733
hedral-1.716
paces-1.697
heses-1.581
Tokenades
Token ID2367
Feature activation-3.20e-01

Pos logit contributions
osal+2.768
obbies+2.337
hedral+2.327
paces+2.322
heses+2.163
Neg logit contributions
OPLE-3.010
POSE-2.784
ンジ-2.689
-2.628
iets-2.590
Token ago
Token ID2084
Feature activation+3.32e-03

Pos logit contributions
osal+3.257
obbies+2.778
paces+2.770
hedral+2.762
heses+2.482
Neg logit contributions
OPLE-4.023
POSE-3.753
ンジ-3.627
-3.560
puter-3.549
Token,
Token ID11
Feature activation-1.53e-01

Pos logit contributions
OPLE+0.039
POSE+0.036
ンジ+0.035
+0.034
iets+0.034
Neg logit contributions
osal-0.037
obbies-0.032
hedral-0.032
paces-0.032
heses-0.029
Token the
Token ID262
Feature activation+7.02e-03

Pos logit contributions
osal+1.681
paces+1.446
hedral+1.441
obbies+1.439
heses+1.312
Neg logit contributions
OPLE-1.812
POSE-1.680
ンジ-1.627
-1.605
puter-1.584
Token city
Token ID1748
Feature activation-4.17e-01

Pos logit contributions
OPLE+0.084
POSE+0.078
ンジ+0.076
+0.074
iets+0.074
Neg logit contributions
osal-0.076
obbies-0.066
paces-0.065
hedral-0.065
heses-0.060
Token planted
Token ID18969
Feature activation-1.01e-01

Pos logit contributions
osal+4.559
hedral+3.885
obbies+3.876
paces+3.858
auld+3.524
Neg logit contributions
OPLE-4.921
POSE-4.540
ンジ-4.316
iets-4.260
pointer-4.256
Token many
Token ID867
Feature activation+1.39e-01

Pos logit contributions
osal+1.069
paces+0.915
obbies+0.914
hedral+0.912
heses+0.827
Neg logit contributions
OPLE-1.248
POSE-1.156
ンジ-1.123
-1.102
puter-1.095
Token liquid
Token ID8122
Feature activation-1.37e-01

Pos logit contributions
OPLE+1.750
POSE+1.635
ンジ+1.597
+1.563
iets+1.558
Neg logit contributions
osal-1.394
obbies-1.198
hedral-1.183
paces-1.174
heses-1.073
Token two
Token ID734
Feature activation-2.65e-01

Pos logit contributions
osal+3.510
obbies+2.966
hedral+2.953
paces+2.936
heses+2.695
Neg logit contributions
OPLE-4.047
POSE-3.751
ンジ-3.709
-3.635
iets-3.585
Token options
Token ID3689
Feature activation-2.60e-01

Pos logit contributions
osal+2.995
obbies+2.571
paces+2.563
hedral+2.561
heses+2.352
Neg logit contributions
OPLE-3.040
POSE-2.805
ンジ-2.763
-2.685
puter-2.668
Token with
Token ID351
Feature activation-2.20e-01

Pos logit contributions
osal+2.797
paces+2.381
obbies+2.363
hedral+2.358
heses+2.140
Neg logit contributions
OPLE-3.138
POSE-2.890
ンジ-2.859
-2.782
pointer-2.763
Token the
Token ID262
Feature activation-1.26e-01

Pos logit contributions
osal+2.375
obbies+2.051
hedral+2.043
paces+2.015
heses+1.875
Neg logit contributions
OPLE-2.630
POSE-2.422
ンジ-2.398
-2.348
puter-2.328
Token family
Token ID1641
Feature activation-6.35e-02

Pos logit contributions
osal+1.389
obbies+1.192
hedral+1.190
paces+1.182
heses+1.087
Neg logit contributions
OPLE-1.491
POSE-1.380
ンジ-1.351
-1.321
iets-1.313
Token at
Token ID379
Feature activation-1.67e-01

Pos logit contributions
osal+0.703
obbies+0.606
hedral+0.604
paces+0.602
heses+0.551
Neg logit contributions
OPLE-0.750
POSE-0.694
ンジ-0.676
-0.660
iets-0.656
Token this
Token ID428
Feature activation-5.04e-01

Pos logit contributions
osal+1.864
hedral+1.618
obbies+1.615
paces+1.584
heses+1.473
Neg logit contributions
OPLE-1.944
POSE-1.796
ンジ-1.760
-1.711
puter-1.708
Token point
Token ID966
Feature activation+2.62e-01

Pos logit contributions
osal+5.337
hedral+4.549
obbies+4.469
paces+4.342
auld+4.038
Neg logit contributions
OPLE-5.934
ンジ-5.530
POSE-5.458
iets-5.337
puter-5.332
Token:
Token ID25
Feature activation-4.08e-01

Pos logit contributions
OPLE+2.862
POSE+2.651
ンジ+2.509
================================================================+2.489
+2.474
Neg logit contributions
osal-3.030
obbies-2.676
hedral-2.663
paces-2.644
heses-2.453
Token the
Token ID262
Feature activation+1.11e-01

Pos logit contributions
osal+4.312
obbies+3.654
hedral+3.609
paces+3.579
heses+3.345
Neg logit contributions
OPLE-4.894
POSE-4.520
ンジ-4.484
iets-4.330
pointer-4.308
Token clinical
Token ID8668
Feature activation+2.24e-01

Pos logit contributions
OPLE+1.272
POSE+1.175
ンジ+1.136
+1.110
pointer+1.106
Neg logit contributions
osal-1.242
paces-1.087
obbies-1.084
hedral-1.081
heses-0.986
Token by
Token ID416
Feature activation-2.74e-01

Pos logit contributions
osal+3.703
obbies+3.126
hedral+3.121
paces+3.059
gey+2.812
Neg logit contributions
OPLE-4.646
POSE-4.435
ンジ-4.242
-4.220
iets-4.110
Token Z
Token ID1168
Feature activation+1.94e-01

Pos logit contributions
osal+3.071
paces+2.659
obbies+2.634
hedral+2.626
gey+2.421
Neg logit contributions
OPLE-3.134
POSE-2.904
ンジ-2.851
-2.812
puter-2.751
Tokeney
Token ID2959
Feature activation-2.97e-02

Pos logit contributions
OPLE+2.185
POSE+2.023
iets+1.971
ンジ+1.954
pointer+1.927
Neg logit contributions
osal-2.176
paces-1.923
hedral-1.905
obbies-1.903
heses-1.734
Tokenad
Token ID324
Feature activation+2.80e-01

Pos logit contributions
osal+0.340
obbies+0.295
hedral+0.294
paces+0.294
heses+0.270
Neg logit contributions
OPLE-0.339
POSE-0.313
ンジ-0.304
-0.297
iets-0.295
Token Mas
Token ID11066
Feature activation-4.31e-01

Pos logit contributions
OPLE+3.148
POSE+2.897
ンジ+2.837
iets+2.796
pointer+2.779
Neg logit contributions
osal-3.130
obbies-2.764
paces-2.736
hedral-2.736
heses-2.501
Tokenro
Token ID305
Feature activation-9.25e-01

Pos logit contributions
osal+4.546
paces+3.703
obbies+3.642
hedral+3.619
gey+3.390
Neg logit contributions
OPLE-5.547
POSE-5.062
ンジ-4.972
-4.940
HCR-4.691
Tokenor
Token ID273
Feature activation-1.84e-01

Pos logit contributions
osal+9.470
obbies+7.012
hedral+6.967
ogie+6.942
paces+6.939
Neg logit contributions
OPLE-11.402
ンジ-10.614
POSE-10.549
-10.181
HCR-9.786
Token Khan
Token ID11356
Feature activation+2.72e-02

Pos logit contributions
osal+1.977
hedral+1.683
obbies+1.669
paces+1.663
gey+1.540
Neg logit contributions
OPLE-2.231
POSE-2.071
ンジ-2.008
-1.984
iets-1.944
Token;
Token ID26
Feature activation-3.79e-01

Pos logit contributions
OPLE+0.323
POSE+0.299
ンジ+0.290
iets+0.283
+0.283
Neg logit contributions
osal-0.299
obbies-0.259
hedral-0.258
paces-0.257
heses-0.235
Token Writing
Token ID22183
Feature activation-4.04e-03

Pos logit contributions
osal+4.199
obbies+3.597
hedral+3.596
paces+3.505
heses+3.297
Neg logit contributions
OPLE-4.382
POSE-4.059
ンジ-4.025
-3.948
puter-3.865
Token by
Token ID416
Feature activation-2.41e-01

Pos logit contributions
osal+0.044
obbies+0.038
paces+0.038
hedral+0.038
heses+0.034
Neg logit contributions
OPLE-0.048
POSE-0.045
ンジ-0.044
-0.043
iets-0.042
Token]
Token ID60
Feature activation-1.88e-01

Pos logit contributions
OPLE+1.561
POSE+1.447
ンジ+1.410
+1.383
iets+1.375
Neg logit contributions
osal-1.329
obbies-1.151
hedral-1.143
paces-1.142
heses-1.026
Token
Token ID198
Feature activation-2.04e-02

Pos logit contributions
osal+1.996
paces+1.721
hedral+1.696
obbies+1.689
heses+1.554
Neg logit contributions
OPLE-2.284
POSE-2.116
ンジ-2.072
-2.034
iets-2.023
Token
Token ID198
Feature activation-1.89e-01

Pos logit contributions
osal+0.216
obbies+0.185
paces+0.184
hedral+0.184
heses+0.167
Neg logit contributions
OPLE-0.250
POSE-0.232
ンジ-0.226
-0.221
iets-0.220
TokenWithout
Token ID16249
Feature activation-5.53e-01

Pos logit contributions
osal+2.103
paces+1.824
hedral+1.812
obbies+1.799
heses+1.648
Neg logit contributions
OPLE-2.195
POSE-2.010
ンジ-1.993
iets-1.951
-1.931
Token a
Token ID257
Feature activation-8.84e-01

Pos logit contributions
osal+5.686
paces+4.869
hedral+4.795
obbies+4.740
heses+4.391
Neg logit contributions
OPLE-6.607
POSE-6.005
-5.997
ンジ-5.992
iets-5.798
Token working
Token ID1762
Feature activation-8.86e-01

Pos logit contributions
osal+8.467
paces+7.516
hedral+7.185
obbies+7.024
erville+6.473
Neg logit contributions
OPLE-10.200
ンジ-9.210
-8.983
POSE-8.968
================================================================-8.940
Token star
Token ID3491
Feature activation-3.70e-01

Pos logit contributions
osal+7.808
paces+6.802
hedral+6.688
obbies+6.082
ogie+5.767
Neg logit contributions
OPLE-10.798
POSE-9.873
iets-9.734
ンジ-9.692
-9.566
Token tracker
Token ID30013
Feature activation-2.50e-01

Pos logit contributions
osal+4.112
hedral+3.535
paces+3.527
obbies+3.479
idding+3.239
Neg logit contributions
OPLE-4.235
POSE-3.873
ンジ-3.779
iets-3.748
-3.672
Token,
Token ID11
Feature activation-2.79e-01

Pos logit contributions
osal+2.591
paces+2.209
hedral+2.204
obbies+2.175
heses+1.998
Neg logit contributions
OPLE-3.097
POSE-2.862
ンジ-2.822
iets-2.761
-2.757
Token ion
Token ID22088
Feature activation-3.66e-01

Pos logit contributions
osal+3.028
paces+2.625
hedral+2.606
obbies+2.566
heses+2.381
Neg logit contributions
OPLE-3.292
POSE-3.025
ンジ-2.984
-2.914
iets-2.891
Token thrust
Token ID14613
Feature activation+3.44e-01

Pos logit contributions
osal+4.460
paces+3.849
obbies+3.836
hedral+3.829
heses+3.539
Neg logit contributions
OPLE-3.882
POSE-3.512
ンジ-3.421
iets-3.316
-3.308
Token Mind
Token ID10175
Feature activation+2.51e-01

Pos logit contributions
osal+1.458
hedral+1.262
paces+1.261
obbies+1.260
heses+1.153
Neg logit contributions
OPLE-1.448
POSE-1.333
ンジ-1.294
iets-1.268
-1.261
Tokenhunter
Token ID37488
Feature activation-9.08e-02

Pos logit contributions
OPLE+2.382
POSE+2.175
ンジ+2.148
pointer+2.114
================================================================+2.080
Neg logit contributions
osal-3.273
obbies-2.916
paces-2.909
hedral-2.826
heses-2.692
Token.
Token ID13
Feature activation-7.89e-02

Pos logit contributions
osal+0.980
hedral+0.839
obbies+0.839
paces+0.837
heses+0.761
Neg logit contributions
OPLE-1.096
POSE-1.015
ンジ-0.987
-0.966
iets-0.965
Token
Token ID564
Feature activation-1.20e-01

Pos logit contributions
osal+0.927
paces+0.807
obbies+0.806
hedral+0.804
heses+0.741
Neg logit contributions
OPLE-0.874
POSE-0.804
ンジ-0.781
-0.760
iets-0.760
Token
Token ID250
Feature activation-8.07e-02

Pos logit contributions
osal+1.140
hedral+0.957
obbies+0.956
paces+0.953
heses+0.856
Neg logit contributions
OPLE-1.595
POSE-1.484
ンジ-1.445
iets-1.415
-1.412
TokenNo
Token ID2949
Feature activation-1.03e+00

Pos logit contributions
osal+0.886
paces+0.764
obbies+0.763
hedral+0.761
heses+0.696
Neg logit contributions
OPLE-0.955
POSE-0.883
ンジ-0.862
-0.840
iets-0.840
Token one
Token ID530
Feature activation+5.09e-02

Pos logit contributions
osal+9.720
paces+8.043
obbies+7.930
hedral+7.783
gey+7.486
Neg logit contributions
OPLE-11.833
POSE-10.659
ンジ-10.529
================================================================-10.485
pointer-10.363
Token says
Token ID1139
Feature activation+2.63e-01

Pos logit contributions
OPLE+0.609
POSE+0.565
ンジ+0.550
iets+0.538
+0.537
Neg logit contributions
osal-0.551
obbies-0.476
hedral-0.474
paces-0.472
heses-0.432
Token,
Token ID11
Feature activation+1.27e-01

Pos logit contributions
OPLE+2.866
POSE+2.666
ンジ+2.573
+2.518
iets+2.496
Neg logit contributions
osal-3.075
hedral-2.698
obbies-2.696
paces-2.629
heses-2.429
Token
Token ID564
Feature activation+4.71e-02

Pos logit contributions
OPLE+1.369
POSE+1.264
ンジ+1.222
+1.193
iets+1.191
Neg logit contributions
osal-1.521
hedral-1.334
obbies-1.333
paces-1.320
heses-1.220
Token
Token ID246
Feature activation-7.89e-02

Pos logit contributions
OPLE+0.558
POSE+0.519
ンジ+0.506
+0.495
iets+0.491
Neg logit contributions
osal-0.515
obbies-0.445
hedral-0.442
paces-0.442
heses-0.403
Token huge
Token ID3236
Feature activation+3.32e-01

Pos logit contributions
OPLE+2.171
POSE+2.026
ンジ+1.968
+1.910
iets+1.909
Neg logit contributions
osal-1.984
hedral-1.718
paces-1.710
obbies-1.708
heses-1.578
Token spy
Token ID13997
Feature activation-1.57e-01

Pos logit contributions
OPLE+3.687
POSE+3.370
ンジ+3.295
iets+3.227
puter+3.213
Neg logit contributions
osal-3.750
paces-3.307
hedral-3.294
obbies-3.286
heses-3.053
Token apparatus
Token ID21152
Feature activation-4.56e-01

Pos logit contributions
osal+1.594
obbies+1.355
hedral+1.343
paces+1.342
gey+1.210
Neg logit contributions
OPLE-2.005
POSE-1.871
ンジ-1.825
-1.787
iets-1.771
Token that
Token ID326
Feature activation-4.75e-01

Pos logit contributions
osal+4.502
paces+3.945
hedral+3.844
obbies+3.827
gey+3.444
Neg logit contributions
OPLE-5.720
POSE-5.272
ンジ-5.169
puter-4.992
-4.988
Token only
Token ID691
Feature activation-1.54e-01

Pos logit contributions
osal+4.669
paces+3.993
obbies+3.945
hedral+3.875
heses+3.534
Neg logit contributions
OPLE-5.943
POSE-5.562
ンジ-5.296
pointer-5.277
puter-5.250
Token looks
Token ID3073
Feature activation-8.93e-01

Pos logit contributions
osal+1.703
paces+1.472
obbies+1.468
hedral+1.457
heses+1.331
Neg logit contributions
OPLE-1.809
POSE-1.676
ンジ-1.630
-1.594
pointer-1.581
Token after
Token ID706
Feature activation+1.19e-02

Pos logit contributions
osal+7.809
paces+6.736
hedral+6.504
obbies+6.484
gey+5.989
Neg logit contributions
OPLE-11.032
POSE-9.927
ンジ-9.918
pointer-9.854
puter-9.797
Token itself
Token ID2346
Feature activation-3.89e-02

Pos logit contributions
OPLE+0.139
POSE+0.128
ンジ+0.124
+0.122
iets+0.121
Neg logit contributions
osal-0.134
obbies-0.116
hedral-0.115
paces-0.115
heses-0.105
Token.
Token ID13
Feature activation-7.07e-02

Pos logit contributions
osal+0.423
obbies+0.365
paces+0.364
hedral+0.363
heses+0.331
Neg logit contributions
OPLE-0.466
POSE-0.431
ンジ-0.420
-0.409
iets-0.408
Token
Token ID198
Feature activation+1.70e-01

Pos logit contributions
osal+0.779
obbies+0.673
paces+0.672
hedral+0.669
heses+0.611
Neg logit contributions
OPLE-0.833
POSE-0.772
ンジ-0.751
-0.734
iets-0.733
Token
Token ID198
Feature activation+3.32e-02

Pos logit contributions
OPLE+1.889
POSE+1.778
ンジ+1.704
================================================================+1.685
pointer+1.677
Neg logit contributions
osal-1.938
obbies-1.716
hedral-1.689
paces-1.664
heses-1.559
Token older
Token ID4697
Feature activation+3.78e-01

Pos logit contributions
OPLE+4.314
POSE+3.927
ンジ+3.787
================================================================+3.765
iets+3.733
Neg logit contributions
osal-4.685
obbies-4.164
paces-4.138
hedral-4.077
heses-3.846
Token bug
Token ID5434
Feature activation-3.83e-02

Pos logit contributions
OPLE+3.904
POSE+3.649
ンジ+3.457
iets+3.445
puter+3.423
Neg logit contributions
osal-4.538
obbies-4.044
hedral-3.999
paces-3.905
heses-3.690
Token at
Token ID379
Feature activation-1.37e-01

Pos logit contributions
osal+0.426
obbies+0.368
paces+0.367
hedral+0.367
heses+0.335
Neg logit contributions
OPLE-0.450
POSE-0.416
ンジ-0.405
-0.395
iets-0.394
Token work
Token ID670
Feature activation+8.29e-02

Pos logit contributions
osal+1.510
hedral+1.297
obbies+1.294
paces+1.293
heses+1.185
Neg logit contributions
OPLE-1.609
POSE-1.488
ンジ-1.454
-1.415
iets-1.407
Token.
Token ID13
Feature activation-1.82e-01

Pos logit contributions
OPLE+0.951
POSE+0.882
ンジ+0.854
+0.837
puter+0.835
Neg logit contributions
osal-0.933
obbies-0.813
hedral-0.809
paces-0.798
heses-0.740
Token No
Token ID1400
Feature activation-8.93e-01

Pos logit contributions
osal+1.982
paces+1.711
obbies+1.690
hedral+1.685
heses+1.550
Neg logit contributions
OPLE-2.159
POSE-2.001
ンジ-1.945
-1.901
iets-1.901
Token big
Token ID1263
Feature activation+2.94e-01

Pos logit contributions
osal+8.407
paces+7.089
obbies+6.715
hedral+6.534
gey+6.307
Neg logit contributions
OPLE-10.878
POSE-9.941
ンジ-9.713
iets-9.522
-9.281
Tokengie
Token ID22699
Feature activation-2.47e-02

Pos logit contributions
OPLE+2.920
POSE+2.649
ンジ+2.583
+2.546
puter+2.538
Neg logit contributions
osal-3.683
hedral-3.275
obbies-3.273
paces-3.221
heses-2.994
Token,
Token ID11
Feature activation-1.88e-01

Pos logit contributions
osal+0.271
obbies+0.234
hedral+0.233
paces+0.233
heses+0.212
Neg logit contributions
OPLE-0.293
POSE-0.271
ンジ-0.264
-0.258
iets-0.257
Token because
Token ID780
Feature activation+2.05e-01

Pos logit contributions
osal+1.990
paces+1.726
obbies+1.704
hedral+1.698
heses+1.543
Neg logit contributions
OPLE-2.288
POSE-2.110
ンジ-2.071
-2.020
iets-2.000
Token I
Token ID314
Feature activation-5.35e-02

Pos logit contributions
OPLE+2.196
POSE+2.014
ンジ+1.934
================================================================+1.907
puter+1.903
Neg logit contributions
osal-2.464
obbies-2.171
hedral-2.146
paces-2.109
heses-1.976