TOP ACTIVATIONS
MAX = 1.157

version <|endoftext|> Rob C estern ino hosts a jam packed recap
. If the " w arp factor " is too large
at Ben ih ana restaurant in LA . But once outside
the extent of the inconsistency .[ 198 ] In addition ,
the event that the answer is affirm - at ive ,
, entering a " w arp factor " number that is
incorrect handling of Mam ad ou Sak ho drugs test Read
language . - Katherine d ' Ent rem ont , commissioner
ratified in 1997 .[ 7 ][ 8 ] W EP uses
, and J . J . B area . When you
river Award for Dist inguished Human itarian Service winner was Ar
nearby . " H mmm ," says Anders , " let
the place where she was raised . It turned out that
variants , you can enable / disable each and drag them
- The Associated Press AP Photo Sw
legraph . co . uk / business / analysis / why
. All the an ach ron istic elements , the d
Mongol rulers , including H ü leg ü .[ 6 ]
on to before beating J uss i Ja as kel ain
. The story where Cyborg qu its and the comes back

MIN ACTIVATIONS
MIN = -1.336

5 { ( 53 s )} 31 . Q d 2
more to help us continue to write about the issues that
to stand toe - to - toe with the world
, and explore the world with kindred spirits .
the spoil thereof , shalt thou take unto thy self .
, whose name has not been released , rode her bike
below to become an In Daily supporter . Powered by Press
se ign ements c ents ind iqu ent qu '
were in their 30 s and 40 s .
, and convert it to gr ays cale image = c
le d és acc ord persist e , les F ond
M uther h ucker 2008 01 31 O & A -
trying to stand toe - to - toe with the world
i , 1 , Length [ graph List ]} ] ];
Richard Nixon in a Pants uit was originally published at The
Air Station in South We ym outh . This is an
to 300 , 000 barrels a day . Virt ually all
may be repeated to your own prejudice .
Award nominations are accepted year - round ; the deadline for
the western edge of Mare Tr anqu ill it atis 65

INTERVAL 0.534 - 1.157
CONTAINS 0.695%

version will be with the PC version upon its release .
military bloc s and of increasing military bases we must
of degrading or in aus p icious names during the 13
by about 0 . 3 percent . In other words ,
week 's new issue . We asked them plenty about the

INTERVAL -0.089 - 0.534
CONTAINS 66.936%

uates . O urs is a Bachelor of Science degree in
said . Mer kel said she was " anything
Stock bro kers chief economist Alan Mc Qu aid
B ollywood at its best [ em ph ases added ],"
" Those who 've seen a be aver 's teeth will

INTERVAL -0.712 - -0.089
CONTAINS 32.277%

a number of other chances test the home defence without finding
being locked up under false pret enses . "
through a dating relationship . As we
known Linux root k its . He has expressed his sadness
biological sex , they wish to undertake physical measures , with

INTERVAL -1.336 - -0.712
CONTAINS 0.092%

am I an addict of crack cocaine , he
! Sign up for Take Action Now and we
é nat Imp é rial , le contr at qu '
ua is the first of two plastic yo - yo in
it , and convert it to gr ays cale image =
Token version
Token ID2196
Feature activation+9.82e-02

Pos logit contributions
+1.782
rogen+1.764
ourse+1.734
inho+1.689
enstein+1.675
Neg logit contributions
idth-2.215
 absorbed-2.159
etically-2.034
 resur-2.033
eston-2.026
Token<|endoftext|>
Token ID50256
Feature activation-2.01e-01

Pos logit contributions
idth+1.083
 absorbed+1.070
 resur+1.019
etically+1.009
eston+1.005
Neg logit contributions
-0.561
rogen-0.549
ourse-0.531
inho-0.516
enstein-0.502
TokenRob
Token ID14350
Feature activation+7.52e-02

Pos logit contributions
+1.634
rogen+1.612
ourse+1.569
inho+1.548
enstein+1.514
Neg logit contributions
idth-1.735
 absorbed-1.710
 resur-1.597
etically-1.568
eston-1.560
Token C
Token ID327
Feature activation+2.69e-01

Pos logit contributions
idth+0.623
 absorbed+0.612
 resur+0.573
eston+0.567
etically+0.564
Neg logit contributions
-0.637
rogen-0.626
ourse-0.614
inho-0.597
enstein-0.590
Tokenestern
Token ID3330
Feature activation+3.05e-01

Pos logit contributions
idth+2.087
 absorbed+2.038
 resur+1.916
eston+1.912
etically+1.877
Neg logit contributions
-2.434
rogen-2.344
ourse-2.299
inho-2.246
enstein-2.221
Tokenino
Token ID2879
Feature activation+1.16e+00

Pos logit contributions
idth+2.853
 absorbed+2.780
eston+2.632
etically+2.612
 resur+2.604
Neg logit contributions
-2.358
rogen-2.190
ourse-2.093
inho-2.051
 Vul-2.047
Token hosts
Token ID11453
Feature activation+4.19e-01

Pos logit contributions
 absorbed+8.402
 resur+7.995
idth+7.206
eston+7.151
 absorbing+7.122
Neg logit contributions
-10.283
ourse-9.231
rogen-9.115
inho-9.096
ury-8.987
Token a
Token ID257
Feature activation+5.29e-01

Pos logit contributions
 absorbed+3.394
idth+3.369
 resur+3.219
etically+3.123
eston+3.111
Neg logit contributions
-3.579
rogen-3.413
ourse-3.364
inho-3.286
enstein-3.281
Token jam
Token ID16853
Feature activation+1.37e-01

Pos logit contributions
 absorbed+3.889
idth+3.756
 resur+3.754
 absorbing+3.517
etically+3.490
Neg logit contributions
-4.904
ourse-4.577
rogen-4.566
enstein-4.463
inho-4.435
Token packed
Token ID11856
Feature activation+1.20e-01

Pos logit contributions
idth+1.236
 absorbed+1.223
 resur+1.149
eston+1.133
etically+1.132
Neg logit contributions
-1.064
rogen-1.043
ourse-1.018
inho-1.002
enstein-0.981
Token recap
Token ID26871
Feature activation+3.99e-01

Pos logit contributions
idth+0.915
 absorbed+0.909
 resur+0.847
etically+0.829
eston+0.827
Neg logit contributions
-1.086
rogen-1.062
ourse-1.045
inho-1.028
enstein-1.011
Token.
Token ID13
Feature activation+2.83e-01

Pos logit contributions
idth+0.775
 absorbed+0.770
 resur+0.721
etically+0.705
eston+0.702
Neg logit contributions
-0.778
rogen-0.763
ourse-0.748
inho-0.736
enstein-0.721
Token If
Token ID1002
Feature activation+3.53e-01

Pos logit contributions
idth+2.227
 absorbed+2.200
 resur+2.049
etically+2.034
eston+2.020
Neg logit contributions
-2.479
rogen-2.406
ourse-2.378
inho-2.327
enstein-2.296
Token the
Token ID262
Feature activation+3.66e-01

Pos logit contributions
 absorbed+2.766
idth+2.730
 resur+2.564
etically+2.464
 absorbing+2.453
Neg logit contributions
-3.201
rogen-3.034
ourse-3.027
inho-3.011
enstein-2.923
Token "
Token ID366
Feature activation+3.91e-01

Pos logit contributions
 absorbed+2.639
idth+2.634
 resur+2.437
 absorbing+2.335
etically+2.333
Neg logit contributions
-3.526
rogen-3.378
ourse-3.374
inho-3.340
enstein-3.245
Tokenw
Token ID86
Feature activation+4.90e-02

Pos logit contributions
idth+3.392
 absorbed+3.319
eston+3.082
 resur+3.082
etically+3.058
Neg logit contributions
-3.134
rogen-3.017
ourse-2.997
inho-2.973
enstein-2.886
Tokenarp
Token ID5117
Feature activation+1.14e+00

Pos logit contributions
idth+0.391
 absorbed+0.383
 resur+0.358
etically+0.353
eston+0.352
Neg logit contributions
-0.432
rogen-0.424
ourse-0.416
inho-0.409
enstein-0.402
Token factor
Token ID5766
Feature activation+5.61e-01

Pos logit contributions
 absorbed+5.479
idth+5.324
 absorbing+4.703
 resur+4.443
 absorption+4.303
Neg logit contributions
-13.088
ourse-12.285
rogen-11.788
 Vul-11.771
inho-11.737
Token"
Token ID1
Feature activation+8.29e-01

Pos logit contributions
idth+3.719
 absorbed+3.696
 resur+3.307
etically+3.222
 absorbing+3.218
Neg logit contributions
-5.643
ourse-5.437
rogen-5.400
inho-5.270
enstein-5.214
Token is
Token ID318
Feature activation+7.29e-01

Pos logit contributions
 absorbed+5.850
idth+5.585
 resur+5.303
 absorbing+5.108
 absorption+4.995
Neg logit contributions
-7.978
ourse-7.524
inho-7.317
rogen-7.271
enstein-7.143
Token too
Token ID1165
Feature activation+4.70e-01

Pos logit contributions
 absorbed+5.851
idth+5.340
 resur+5.200
 absorbing+5.083
etically+5.016
Neg logit contributions
-6.563
ourse-6.127
inho-5.972
rogen-5.894
ury-5.728
Token large
Token ID1588
Feature activation+8.48e-01

Pos logit contributions
 absorbed+3.659
idth+3.541
 resur+3.321
etically+3.299
 absorbing+3.255
Neg logit contributions
-4.221
ourse-4.102
rogen-4.086
inho-3.960
enstein-3.916
Token at
Token ID379
Feature activation+4.29e-01

Pos logit contributions
+0.895
rogen+0.884
ourse+0.866
inho+0.850
enstein+0.834
Neg logit contributions
idth-0.946
 absorbed-0.927
 resur-0.870
etically-0.859
eston-0.857
Token Ben
Token ID3932
Feature activation-1.14e-01

Pos logit contributions
 absorbed+3.304
idth+3.272
 resur+3.074
eston+3.059
etically+2.982
Neg logit contributions
-3.792
rogen-3.728
ourse-3.634
inho-3.552
enstein-3.489
Tokenih
Token ID4449
Feature activation+1.80e-01

Pos logit contributions
+0.883
rogen+0.876
ourse+0.856
inho+0.848
igh+0.826
Neg logit contributions
idth-1.024
 absorbed-1.011
 resur-0.954
eston-0.934
etically-0.931
Tokenana
Token ID2271
Feature activation+2.48e-01

Pos logit contributions
idth+1.748
 absorbed+1.700
eston+1.627
 resur+1.615
etically+1.595
Neg logit contributions
-1.271
rogen-1.215
ourse-1.189
inho-1.158
 Vul-1.148
Token restaurant
Token ID7072
Feature activation+1.70e-01

Pos logit contributions
 absorbed+1.981
idth+1.978
 resur+1.853
eston+1.841
etically+1.818
Neg logit contributions
-2.122
rogen-2.097
ourse-2.046
inho-2.014
enstein-1.977
Token in
Token ID287
Feature activation+1.13e+00

Pos logit contributions
 absorbed+1.316
idth+1.308
 resur+1.230
eston+1.203
etically+1.192
Neg logit contributions
-1.517
rogen-1.502
ourse-1.466
inho-1.440
enstein-1.420
Token LA
Token ID9131
Feature activation+1.88e-01

Pos logit contributions
 absorbed+5.653
idth+5.577
eston+5.338
 resur+5.183
etically+4.994
Neg logit contributions
-11.467
rogen-11.424
ourse-11.250
inho-11.020
ury-10.753
Token.
Token ID13
Feature activation-1.46e-02

Pos logit contributions
idth+1.517
 absorbed+1.509
 resur+1.412
eston+1.394
etically+1.375
Neg logit contributions
-1.614
rogen-1.605
ourse-1.562
inho-1.538
enstein-1.520
Token But
Token ID887
Feature activation+2.46e-01

Pos logit contributions
+0.135
rogen+0.134
ourse+0.131
inho+0.129
enstein+0.127
Neg logit contributions
idth-0.108
 absorbed-0.106
 resur-0.099
eston-0.097
etically-0.097
Token once
Token ID1752
Feature activation+1.71e-01

Pos logit contributions
 absorbed+1.934
idth+1.893
 resur+1.822
eston+1.744
 absorbing+1.742
Neg logit contributions
-2.181
rogen-2.113
ourse-2.082
inho-2.046
enstein-2.025
Token outside
Token ID2354
Feature activation+3.11e-01

Pos logit contributions
 absorbed+1.264
idth+1.245
 resur+1.176
eston+1.129
etically+1.122
Neg logit contributions
-1.605
rogen-1.572
ourse-1.542
inho-1.517
enstein-1.492
Token the
Token ID262
Feature activation-2.89e-01

Pos logit contributions
+0.313
rogen+0.308
ourse+0.302
inho+0.297
enstein+0.289
Neg logit contributions
idth-0.334
 absorbed-0.329
 resur-0.310
etically-0.304
eston-0.303
Token extent
Token ID6287
Feature activation+4.17e-01

Pos logit contributions
+2.175
rogen+2.159
ourse+2.098
inho+2.063
enstein+2.021
Neg logit contributions
idth-2.647
 absorbed-2.583
 resur-2.431
eston-2.431
etically-2.414
Token of
Token ID286
Feature activation-3.86e-02

Pos logit contributions
 absorbed+3.234
idth+3.184
 resur+2.954
etically+2.876
 absorbing+2.846
Neg logit contributions
-3.758
rogen-3.634
ourse-3.571
inho-3.563
enstein-3.492
Token the
Token ID262
Feature activation-5.62e-03

Pos logit contributions
+0.338
rogen+0.334
ourse+0.327
inho+0.322
enstein+0.316
Neg logit contributions
idth-0.308
 absorbed-0.303
 resur-0.283
etically-0.278
eston-0.277
Token inconsistency
Token ID43831
Feature activation+1.09e-01

Pos logit contributions
+0.047
rogen+0.046
ourse+0.045
inho+0.045
enstein+0.044
Neg logit contributions
idth-0.047
 absorbed-0.046
 resur-0.043
etically-0.043
eston-0.043
Token.[
Token ID3693
Feature activation+1.10e+00

Pos logit contributions
idth+0.855
 absorbed+0.848
 resur+0.789
etically+0.771
eston+0.765
Neg logit contributions
-0.974
rogen-0.954
ourse-0.940
inho-0.927
enstein-0.912
Token198
Token ID22337
Feature activation-1.66e-01

Pos logit contributions
 absorbed+6.905
idth+6.748
MIC+6.389
 resur+6.322
etically+6.143
Neg logit contributions
-10.176
rogen-10.172
ourse-9.755
inho-9.646
ury-9.463
Token]
Token ID60
Feature activation+2.51e-01

Pos logit contributions
+1.229
rogen+1.208
ourse+1.182
inho+1.154
enstein+1.134
Neg logit contributions
idth-1.537
 absorbed-1.515
 resur-1.425
etically-1.415
eston-1.410
Token In
Token ID554
Feature activation+3.91e-02

Pos logit contributions
idth+1.944
 absorbed+1.924
 resur+1.807
eston+1.787
etically+1.782
Neg logit contributions
-2.186
rogen-2.173
ourse-2.130
inho-2.073
enstein-2.047
Token addition
Token ID3090
Feature activation-1.42e-01

Pos logit contributions
idth+0.323
 absorbed+0.318
 resur+0.298
etically+0.293
eston+0.292
Neg logit contributions
-0.331
rogen-0.327
ourse-0.320
inho-0.315
enstein-0.309
Token,
Token ID11
Feature activation-1.77e-02

Pos logit contributions
+1.184
rogen+1.173
ourse+1.137
inho+1.118
enstein+1.101
Neg logit contributions
idth-1.184
 absorbed-1.160
 resur-1.091
etically-1.074
eston-1.067
Token the
Token ID262
Feature activation-1.10e-01

Pos logit contributions
idth+1.277
 absorbed+1.267
 resur+1.190
etically+1.163
 absorbing+1.161
Neg logit contributions
-1.228
rogen-1.194
ourse-1.181
inho-1.151
enstein-1.143
Token event
Token ID1785
Feature activation-1.46e-01

Pos logit contributions
+0.961
rogen+0.949
ourse+0.929
inho+0.915
enstein+0.899
Neg logit contributions
idth-0.878
 absorbed-0.863
 resur-0.806
etically-0.792
eston-0.791
Token that
Token ID326
Feature activation+2.97e-02

Pos logit contributions
+1.186
rogen+1.177
ourse+1.149
inho+1.131
enstein+1.105
Neg logit contributions
idth-1.256
 absorbed-1.228
 resur-1.154
etically-1.140
eston-1.137
Token the
Token ID262
Feature activation+4.94e-02

Pos logit contributions
idth+0.262
 absorbed+0.260
 resur+0.244
etically+0.239
eston+0.239
Neg logit contributions
-0.236
rogen-0.231
ourse-0.227
inho-0.222
enstein-0.219
Token answer
Token ID3280
Feature activation+4.71e-01

Pos logit contributions
idth+0.418
 absorbed+0.413
 resur+0.388
etically+0.380
eston+0.379
Neg logit contributions
-0.409
rogen-0.403
ourse-0.394
inho-0.387
enstein-0.381
Token is
Token ID318
Feature activation+1.08e+00

Pos logit contributions
 absorbed+3.852
idth+3.786
 resur+3.620
etically+3.474
eston+3.458
Neg logit contributions
-3.937
ourse-3.827
rogen-3.765
inho-3.707
enstein-3.612
Token affirm
Token ID16266
Feature activation-2.90e-01

Pos logit contributions
 absorbed+7.009
 resur+6.186
idth+6.133
 absorbing+5.894
etically+5.831
Neg logit contributions
-10.270
ourse-9.908
rogen-9.423
inho-9.383
enstein-9.309
Token-
Token ID12
Feature activation-2.36e-01

Pos logit contributions
rogen+1.559
+1.540
ourse+1.486
inho+1.468
enstein+1.404
Neg logit contributions
idth-3.262
 absorbed-3.205
 resur-3.065
eston-3.015
etically-3.000
Token at
Token ID379
Feature activation+2.68e-01

Pos logit contributions
+1.925
rogen+1.920
ourse+1.880
inho+1.852
enstein+1.807
Neg logit contributions
idth-2.014
 absorbed-1.953
 resur-1.835
eston-1.806
etically-1.802
Tokenive
Token ID425
Feature activation+1.73e-01

Pos logit contributions
idth+1.844
 absorbed+1.821
etically+1.675
 resur+1.661
eston+1.656
Neg logit contributions
-2.634
rogen-2.534
ourse-2.534
inho-2.470
enstein-2.460
Token,
Token ID11
Feature activation-1.81e-01

Pos logit contributions
idth+1.378
 absorbed+1.357
 resur+1.265
etically+1.252
eston+1.242
Neg logit contributions
-1.515
rogen-1.482
ourse-1.461
inho-1.426
enstein-1.417
Token,
Token ID11
Feature activation+5.82e-01

Pos logit contributions
idth+1.793
 absorbed+1.781
 resur+1.663
etically+1.625
eston+1.594
Neg logit contributions
-2.025
rogen-1.981
ourse-1.940
inho-1.920
enstein-1.880
Token entering
Token ID8218
Feature activation+3.82e-01

Pos logit contributions
idth+4.635
 absorbed+4.555
 resur+4.285
 absorbing+4.193
etically+4.075
Neg logit contributions
-5.025
ourse-4.807
rogen-4.722
inho-4.715
enstein-4.640
Token a
Token ID257
Feature activation+3.09e-01

Pos logit contributions
 absorbed+3.072
idth+3.057
 resur+2.842
etically+2.780
eston+2.750
Neg logit contributions
-3.324
rogen-3.187
ourse-3.163
inho-3.152
enstein-3.078
Token "
Token ID366
Feature activation+2.58e-01

Pos logit contributions
idth+2.364
 absorbed+2.342
 resur+2.161
etically+2.105
eston+2.081
Neg logit contributions
-2.838
rogen-2.730
ourse-2.706
inho-2.674
enstein-2.609
Tokenw
Token ID86
Feature activation-1.14e-01

Pos logit contributions
idth+2.267
 absorbed+2.203
eston+2.052
etically+2.051
 resur+2.049
Neg logit contributions
-2.065
rogen-1.993
ourse-1.972
inho-1.959
enstein-1.901
Tokenarp
Token ID5117
Feature activation+1.06e+00

Pos logit contributions
rogen+0.951
+0.948
ourse+0.929
inho+0.912
enstein+0.897
Neg logit contributions
idth-0.935
 absorbed-0.924
 resur-0.864
etically-0.852
eston-0.851
Token factor
Token ID5766
Feature activation+4.79e-01

Pos logit contributions
idth+6.047
 absorbed+6.033
 absorbing+5.267
 resur+5.055
eston+5.038
Neg logit contributions
-11.198
ourse-10.423
rogen-10.162
inho-10.058
 Vul-9.990
Token"
Token ID1
Feature activation+5.03e-01

Pos logit contributions
idth+3.444
 absorbed+3.397
 resur+3.053
etically+3.016
 absorbing+2.998
Neg logit contributions
-4.574
rogen-4.383
ourse-4.365
inho-4.259
enstein-4.195
Token number
Token ID1271
Feature activation+4.04e-01

Pos logit contributions
idth+3.461
 absorbed+3.460
 resur+3.089
 absorbing+3.053
etically+3.052
Neg logit contributions
-4.951
rogen-4.720
ourse-4.701
inho-4.662
enstein-4.546
Token that
Token ID326
Feature activation+2.32e-01

Pos logit contributions
 absorbed+3.092
idth+3.085
etically+2.825
 resur+2.823
 absorbing+2.762
Neg logit contributions
-3.613
rogen-3.516
ourse-3.463
inho-3.422
enstein-3.336
Token is
Token ID318
Feature activation+2.60e-01

Pos logit contributions
 absorbed+2.078
idth+2.048
 resur+1.942
etically+1.897
eston+1.876
Neg logit contributions
-1.826
ourse-1.748
rogen-1.742
inho-1.704
enstein-1.654
Token incorrect
Token ID11491
Feature activation+5.54e-02

Pos logit contributions
+0.664
rogen+0.655
ourse+0.641
inho+0.629
enstein+0.618
Neg logit contributions
idth-0.697
 absorbed-0.685
 resur-0.643
etically-0.633
eston-0.632
Token handling
Token ID9041
Feature activation+1.84e-01

Pos logit contributions
idth+0.433
 absorbed+0.427
 resur+0.399
etically+0.390
eston+0.389
Neg logit contributions
-0.494
rogen-0.487
ourse-0.479
inho-0.469
enstein-0.463
Token of
Token ID286
Feature activation+3.09e-01

Pos logit contributions
idth+1.483
 absorbed+1.472
 resur+1.392
eston+1.333
etically+1.330
Neg logit contributions
-1.592
rogen-1.572
ourse-1.546
enstein-1.496
inho-1.494
Token Mam
Token ID29926
Feature activation+1.62e-01

Pos logit contributions
 absorbed+2.581
idth+2.538
 resur+2.425
 absorbing+2.357
etically+2.324
Neg logit contributions
-2.583
rogen-2.519
ourse-2.518
enstein-2.427
inho-2.372
Tokenad
Token ID324
Feature activation+1.71e-01

Pos logit contributions
idth+1.831
 absorbed+1.790
 resur+1.707
eston+1.702
etically+1.695
Neg logit contributions
-0.929
rogen-0.876
ourse-0.836
enstein-0.777
inho-0.769
Tokenou
Token ID280
Feature activation+1.06e+00

Pos logit contributions
idth+1.280
 absorbed+1.250
 resur+1.161
etically+1.145
eston+1.142
Neg logit contributions
-1.583
rogen-1.564
ourse-1.533
inho-1.505
enstein-1.485
Token Sak
Token ID13231
Feature activation+3.22e-01

Pos logit contributions
 absorbed+5.719
 resur+5.462
idth+5.449
etically+5.318
 absorbing+4.959
Neg logit contributions
rogen-11.608
-11.154
ourse-10.954
enstein-10.643
ulia-10.557
Tokenho
Token ID8873
Feature activation+2.53e-01

Pos logit contributions
idth+3.393
 absorbed+3.363
etically+3.201
eston+3.193
 resur+3.177
Neg logit contributions
-1.976
rogen-1.914
ourse-1.840
enstein-1.742
 Vul-1.729
Token drugs
Token ID5010
Feature activation-4.70e-02

Pos logit contributions
 absorbed+2.090
idth+2.087
 resur+1.976
 absorbing+1.903
etically+1.898
Neg logit contributions
-2.122
rogen-2.095
ourse-2.063
enstein-1.967
inho-1.961
Token test
Token ID1332
Feature activation+1.03e-01

Pos logit contributions
+0.391
rogen+0.386
ourse+0.377
inho+0.371
enstein+0.364
Neg logit contributions
idth-0.396
 absorbed-0.389
 resur-0.365
etically-0.359
eston-0.358
Token Read
Token ID4149
Feature activation+1.96e-01

Pos logit contributions
idth+0.904
 absorbed+0.892
 resur+0.847
eston+0.823
etically+0.819
Neg logit contributions
-0.821
rogen-0.808
ourse-0.788
inho-0.766
enstein-0.761
Token language
Token ID3303
Feature activation-3.34e-01

Pos logit contributions
+0.518
rogen+0.506
ourse+0.494
inho+0.485
enstein+0.472
Neg logit contributions
idth-0.845
 absorbed-0.836
 resur-0.793
etically-0.784
eston-0.781
Token.
Token ID13
Feature activation-5.64e-02

Pos logit contributions
+2.542
rogen+2.472
ourse+2.411
inho+2.355
 Vul+2.318
Neg logit contributions
idth-3.058
 absorbed-2.951
 resur-2.788
etically-2.765
eston-2.757
Token -
Token ID532
Feature activation+2.51e-02

Pos logit contributions
+0.470
rogen+0.464
ourse+0.454
inho+0.446
enstein+0.438
Neg logit contributions
idth-0.475
 absorbed-0.466
 resur-0.436
etically-0.430
eston-0.429
Token Katherine
Token ID32719
Feature activation+2.83e-01

Pos logit contributions
idth+0.228
 absorbed+0.224
 resur+0.211
eston+0.208
etically+0.208
Neg logit contributions
-0.192
rogen-0.190
ourse-0.185
inho-0.182
enstein-0.178
Token d
Token ID288
Feature activation-9.71e-02

Pos logit contributions
idth+2.409
 absorbed+2.394
 resur+2.228
eston+2.201
etically+2.199
Neg logit contributions
-2.294
rogen-2.275
ourse-2.234
inho-2.193
enstein-2.146
Token'
Token ID6
Feature activation+1.06e+00

Pos logit contributions
+0.815
rogen+0.805
ourse+0.788
inho+0.775
enstein+0.761
Neg logit contributions
idth-0.811
 absorbed-0.796
 resur-0.745
etically-0.734
eston-0.733
TokenEnt
Token ID14539
Feature activation+7.81e-02

Pos logit contributions
eston+2.151
MIC+2.090
idth+2.029
enta+1.985
etically+1.956
Neg logit contributions
rogen-13.886
-13.836
 redes-13.533
inho-13.365
ourse-13.231
Tokenrem
Token ID2787
Feature activation-5.51e-02

Pos logit contributions
idth+0.804
 absorbed+0.793
 resur+0.758
etically+0.752
eston+0.751
Neg logit contributions
-0.506
rogen-0.483
ourse-0.470
inho-0.461
enstein-0.452
Tokenont
Token ID756
Feature activation+2.02e-01

Pos logit contributions
+0.561
rogen+0.558
ourse+0.549
inho+0.541
enstein+0.532
Neg logit contributions
idth-0.360
 absorbed-0.352
 resur-0.322
etically-0.315
eston-0.315
Token,
Token ID11
Feature activation+2.56e-01

Pos logit contributions
idth+1.673
 absorbed+1.654
 resur+1.554
eston+1.526
etically+1.512
Neg logit contributions
-1.676
rogen-1.668
ourse-1.637
inho-1.611
enstein-1.574
Token commissioner
Token ID16248
Feature activation-3.78e-02

Pos logit contributions
idth+2.093
 absorbed+2.082
 resur+1.955
eston+1.906
 absorbing+1.885
Neg logit contributions
rogen-2.133
-2.128
ourse-2.070
inho-2.055
enstein-2.015
Token ratified
Token ID39067
Feature activation+3.24e-01

Pos logit contributions
 absorbed+2.036
idth+2.031
 resur+1.895
etically+1.842
eston+1.825
Neg logit contributions
-2.254
rogen-2.204
ourse-2.161
inho-2.141
enstein-2.095
Token in
Token ID287
Feature activation+1.39e-01

Pos logit contributions
 absorbed+2.390
idth+2.352
 resur+2.231
etically+2.157
eston+2.138
Neg logit contributions
-3.008
ourse-2.921
rogen-2.919
inho-2.844
enstein-2.809
Token 1997
Token ID8309
Feature activation+1.11e-02

Pos logit contributions
 absorbed+0.813
idth+0.808
 resur+0.741
etically+0.717
eston+0.713
Neg logit contributions
-1.505
rogen-1.472
ourse-1.458
inho-1.426
enstein-1.416
Token.[
Token ID3693
Feature activation+2.46e-01

Pos logit contributions
idth+0.092
 absorbed+0.090
 resur+0.085
etically+0.083
eston+0.083
Neg logit contributions
-0.094
rogen-0.093
ourse-0.091
inho-0.090
enstein-0.088
Token7
Token ID22
Feature activation+6.75e-02

Pos logit contributions
idth+1.982
 absorbed+1.945
 resur+1.828
eston+1.811
etically+1.797
Neg logit contributions
-2.120
rogen-2.074
ourse-2.048
inho-2.003
enstein-1.970
Token][
Token ID7131
Feature activation+1.05e+00

Pos logit contributions
idth+0.597
 absorbed+0.586
 resur+0.551
etically+0.544
eston+0.543
Neg logit contributions
-0.533
rogen-0.526
ourse-0.514
inho-0.505
enstein-0.495
Token8
Token ID23
Feature activation-1.38e-02

Pos logit contributions
idth+5.319
 absorbed+5.184
MIC+5.022
 resur+4.879
eston+4.823
Neg logit contributions
rogen-10.698
-10.619
ourse-10.552
inho-10.255
 Vul-10.084
Token]
Token ID60
Feature activation+1.13e-01

Pos logit contributions
+0.109
rogen+0.107
ourse+0.105
inho+0.103
enstein+0.101
Neg logit contributions
idth-0.122
 absorbed-0.119
 resur-0.112
etically-0.111
eston-0.111
Token W
Token ID370
Feature activation+1.58e-01

Pos logit contributions
idth+0.894
 absorbed+0.878
 resur+0.820
etically+0.809
eston+0.807
Neg logit contributions
-0.995
rogen-0.984
ourse-0.963
inho-0.946
enstein-0.929
TokenEP
Token ID8905
Feature activation+1.88e-01

Pos logit contributions
idth+1.032
 absorbed+1.002
 resur+0.924
etically+0.919
eston+0.913
Neg logit contributions
-1.620
rogen-1.592
ourse-1.559
inho-1.538
enstein-1.513
Token uses
Token ID3544
Feature activation-5.83e-02

Pos logit contributions
idth+1.659
 absorbed+1.656
 resur+1.547
etically+1.515
eston+1.511
Neg logit contributions
-1.495
rogen-1.458
ourse-1.426
inho-1.407
enstein-1.366
Token,
Token ID11
Feature activation+2.53e-01

Pos logit contributions
idth+2.808
 absorbed+2.794
 resur+2.594
eston+2.542
etically+2.538
Neg logit contributions
-3.414
rogen-3.376
ourse-3.329
inho-3.238
enstein-3.175
Token and
Token ID290
Feature activation+3.43e-01

Pos logit contributions
idth+2.046
 absorbed+2.036
 resur+1.910
etically+1.864
 absorbing+1.844
Neg logit contributions
-2.170
rogen-2.139
ourse-2.093
inho-2.025
enstein-2.005
Token J
Token ID449
Feature activation+3.67e-01

Pos logit contributions
 absorbed+2.832
idth+2.802
 resur+2.653
etically+2.580
 absorbing+2.579
Neg logit contributions
-2.900
rogen-2.801
ourse-2.783
inho-2.685
enstein-2.668
Token.
Token ID13
Feature activation+8.73e-01

Pos logit contributions
idth+2.976
 absorbed+2.894
eston+2.756
etically+2.714
 resur+2.695
Neg logit contributions
-3.186
rogen-3.006
ourse-2.919
inho-2.858
enstein-2.815
TokenJ
Token ID41
Feature activation+8.87e-01

Pos logit contributions
MIC+6.332
 absorbed+6.327
idth+6.198
etically+6.153
eston+6.133
Neg logit contributions
-7.178
ourse-7.106
rogen-6.933
enstein-6.360
rogens-6.339
Token.
Token ID13
Feature activation+1.05e+00

Pos logit contributions
 absorbed+5.271
idth+5.242
etically+5.147
eston+5.039
 resur+4.917
Neg logit contributions
-8.667
rogen-8.181
ourse-8.102
 redes-8.015
-7.716
Token B
Token ID347
Feature activation+1.83e-01

Pos logit contributions
 absorbed+6.741
idth+6.577
eston+6.567
etically+6.489
 Trip+6.215
Neg logit contributions
-9.877
ourse-9.682
rogen-8.954
 redes-8.459
atus-8.335
Tokenarea
Token ID20337
Feature activation+3.51e-01

Pos logit contributions
idth+0.564
 absorbed+0.538
etically+0.465
eston+0.437
 resur+0.406
Neg logit contributions
-2.550
rogen-2.482
ourse-2.415
 Vul-2.398
-2.398
Token.
Token ID13
Feature activation-1.17e-01

Pos logit contributions
 absorbed+2.773
idth+2.710
 resur+2.590
 absorbing+2.508
eston+2.499
Neg logit contributions
-3.069
rogen-2.995
ourse-2.974
inho-2.898
enstein-2.872
Token When
Token ID1649
Feature activation-9.04e-02

Pos logit contributions
+1.023
rogen+1.010
ourse+0.988
inho+0.968
enstein+0.954
Neg logit contributions
idth-0.931
 absorbed-0.916
 resur-0.855
etically-0.845
eston-0.841
Token you
Token ID345
Feature activation-4.91e-01

Pos logit contributions
+0.762
rogen+0.750
ourse+0.733
inho+0.721
enstein+0.708
Neg logit contributions
idth-0.755
 absorbed-0.746
 resur-0.698
etically-0.685
eston-0.682
Tokenriver
Token ID38291
Feature activation+1.60e-02

Pos logit contributions
rogen+1.310
+1.294
ourse+1.263
inho+1.228
enstein+1.184
Neg logit contributions
idth-2.905
 absorbed-2.890
 resur-2.746
etically-2.713
 absorbing-2.696
Token Award
Token ID11289
Feature activation+9.05e-02

Pos logit contributions
idth+0.125
 absorbed+0.122
 resur+0.114
eston+0.112
etically+0.112
Neg logit contributions
-0.143
rogen-0.141
ourse-0.138
inho-0.136
enstein-0.134
Token for
Token ID329
Feature activation+4.02e-02

Pos logit contributions
idth+0.742
 absorbed+0.731
 resur+0.683
eston+0.672
etically+0.670
Neg logit contributions
-0.773
rogen-0.763
ourse-0.748
inho-0.737
enstein-0.722
Token Dist
Token ID4307
Feature activation-2.45e-01

Pos logit contributions
idth+0.392
 absorbed+0.387
 resur+0.366
etically+0.362
eston+0.361
Neg logit contributions
-0.281
rogen-0.277
ourse-0.268
inho-0.264
enstein-0.258
Tokeninguished
Token ID46709
Feature activation+5.06e-01

Pos logit contributions
rogen+2.257
+2.229
ourse+2.201
inho+2.153
enstein+2.124
Neg logit contributions
idth-1.845
 absorbed-1.813
 resur-1.690
etically-1.634
 absorbing-1.634
Token Human
Token ID5524
Feature activation+1.04e+00

Pos logit contributions
 absorbed+3.231
idth+3.188
 resur+3.022
etically+2.984
eston+2.888
Neg logit contributions
-4.985
rogen-4.870
ourse-4.786
inho-4.705
enstein-4.614
Tokenitarian
Token ID8353
Feature activation+8.80e-01

Pos logit contributions
idth+2.854
 absorbed+2.770
etically+2.732
eston+2.658
 resur+2.421
Neg logit contributions
-13.211
 redes-12.771
rogen-12.572
ourse-12.378
enstein-12.356
Token Service
Token ID4809
Feature activation-1.66e-02

Pos logit contributions
 absorbed+3.026
eston+3.002
etically+2.913
idth+2.880
 resur+2.775
Neg logit contributions
-10.672
rogen-10.330
ourse-10.143
inho-10.019
 redes-10.015
Token winner
Token ID8464
Feature activation-5.43e-02

Pos logit contributions
+0.126
rogen+0.124
ourse+0.121
inho+0.119
enstein+0.116
Neg logit contributions
idth-0.152
 absorbed-0.150
 resur-0.141
etically-0.139
eston-0.139
Token was
Token ID373
Feature activation-2.24e-01

Pos logit contributions
+0.412
rogen+0.406
ourse+0.396
inho+0.389
enstein+0.381
Neg logit contributions
idth-0.498
 absorbed-0.490
 resur-0.462
etically-0.455
eston-0.455
Token Ar
Token ID943
Feature activation+3.70e-02

Pos logit contributions
+1.863
rogen+1.842
ourse+1.802
inho+1.774
enstein+1.737
Neg logit contributions
idth-1.886
 absorbed-1.846
 resur-1.728
etically-1.710
eston-1.705
Token nearby
Token ID6716
Feature activation+1.09e-01

Pos logit contributions
 absorbed+1.602
idth+1.601
 resur+1.479
etically+1.452
eston+1.449
Neg logit contributions
-1.922
rogen-1.886
ourse-1.864
inho-1.828
enstein-1.783
Token.
Token ID13
Feature activation+7.06e-01

Pos logit contributions
idth+0.873
 absorbed+0.870
 resur+0.812
eston+0.794
etically+0.794
Neg logit contributions
-0.944
rogen-0.929
ourse-0.916
inho-0.901
enstein-0.883
Token "
Token ID366
Feature activation+3.63e-01

Pos logit contributions
idth+4.369
 absorbed+4.324
etically+4.082
eston+3.992
 resur+3.962
Neg logit contributions
-6.802
ourse-6.699
rogen-6.630
enstein-6.443
ury-6.308
TokenH
Token ID39
Feature activation+1.63e-01

Pos logit contributions
idth+2.862
 absorbed+2.806
etically+2.671
MIC+2.636
eston+2.631
Neg logit contributions
-3.054
rogen-3.043
ourse-3.028
inho-2.912
enstein-2.888
Tokenmmm
Token ID27532
Feature activation+5.51e-01

Pos logit contributions
idth+1.425
 absorbed+1.387
 resur+1.309
eston+1.307
etically+1.297
Neg logit contributions
-1.313
rogen-1.280
ourse-1.250
inho-1.229
enstein-1.204
Token,"
Token ID553
Feature activation+1.03e+00

Pos logit contributions
 absorbed+4.268
idth+4.264
 resur+3.959
etically+3.936
eston+3.922
Neg logit contributions
-4.782
ourse-4.645
rogen-4.602
inho-4.377
enstein-4.365
Token says
Token ID1139
Feature activation+5.22e-01

Pos logit contributions
 absorbed+6.212
idth+5.988
 resur+5.641
eston+5.433
 absorbing+5.425
Neg logit contributions
-9.921
ourse-9.831
rogen-9.492
enstein-9.319
ury-9.086
Token Anders
Token ID25519
Feature activation+3.88e-01

Pos logit contributions
 absorbed+3.373
idth+3.370
 resur+3.081
etically+3.037
eston+2.982
Neg logit contributions
-5.197
ourse-5.097
rogen-5.059
inho-4.883
enstein-4.877
Token,
Token ID11
Feature activation+3.54e-01

Pos logit contributions
 absorbed+3.055
idth+2.973
 resur+2.828
etically+2.791
eston+2.771
Neg logit contributions
-3.406
ourse-3.332
rogen-3.320
inho-3.219
enstein-3.174
Token "
Token ID366
Feature activation+4.06e-02

Pos logit contributions
 absorbed+2.920
idth+2.799
 resur+2.706
 absorbing+2.654
etically+2.578
Neg logit contributions
-3.081
ourse-2.972
rogen-2.961
inho-2.865
enstein-2.848
Tokenlet
Token ID1616
Feature activation+4.13e-01

Pos logit contributions
idth+0.327
 absorbed+0.321
 resur+0.299
etically+0.297
eston+0.296
Neg logit contributions
-0.352
rogen-0.347
ourse-0.342
inho-0.335
enstein-0.329
Token the
Token ID262
Feature activation-2.62e-03

Pos logit contributions
+1.205
rogen+1.192
ourse+1.164
inho+1.143
enstein+1.123
Neg logit contributions
idth-1.289
 absorbed-1.262
 resur-1.183
eston-1.170
etically-1.170
Token place
Token ID1295
Feature activation+5.41e-02

Pos logit contributions
+0.023
rogen+0.023
ourse+0.023
inho+0.022
enstein+0.022
Neg logit contributions
idth-0.021
 absorbed-0.020
 resur-0.019
etically-0.018
eston-0.018
Token where
Token ID810
Feature activation+3.56e-02

Pos logit contributions
idth+0.439
 absorbed+0.433
 resur+0.404
eston+0.396
etically+0.396
Neg logit contributions
-0.466
rogen-0.460
ourse-0.450
inho-0.445
enstein-0.436
Token she
Token ID673
Feature activation+4.15e-01

Pos logit contributions
idth+0.295
 absorbed+0.292
 resur+0.273
etically+0.267
eston+0.266
Neg logit contributions
-0.301
rogen-0.296
ourse-0.290
inho-0.286
enstein-0.280
Token was
Token ID373
Feature activation+3.36e-01

Pos logit contributions
 absorbed+3.303
idth+3.150
 resur+3.090
eston+2.930
etically+2.869
Neg logit contributions
-3.678
ourse-3.581
rogen-3.567
inho-3.446
enstein-3.396
Token raised
Token ID4376
Feature activation+1.03e+00

Pos logit contributions
 absorbed+2.269
 resur+2.065
idth+2.060
 absorbing+1.978
eston+1.930
Neg logit contributions
-3.369
rogen-3.318
ourse-3.309
inho-3.226
enstein-3.213
Token.
Token ID13
Feature activation+2.55e-01

Pos logit contributions
 absorbed+6.746
 resur+6.159
 absorbing+6.003
idth+5.945
etically+5.722
Neg logit contributions
-9.419
rogen-9.219
ourse-9.106
inho-9.023
ulia-8.670
Token It
Token ID632
Feature activation+4.83e-01

Pos logit contributions
idth+1.903
 absorbed+1.877
 resur+1.766
etically+1.740
eston+1.728
Neg logit contributions
rogen-2.292
-2.287
ourse-2.227
inho-2.172
ury-2.127
Token turned
Token ID2900
Feature activation+4.32e-01

Pos logit contributions
 absorbed+4.057
idth+3.868
 resur+3.822
eston+3.621
etically+3.574
Neg logit contributions
-3.980
ourse-3.837
rogen-3.817
inho-3.691
 Vul-3.605
Token out
Token ID503
Feature activation+5.65e-01

Pos logit contributions
 absorbed+2.588
idth+2.465
 resur+2.387
 absorbing+2.261
etically+2.258
Neg logit contributions
-4.581
rogen-4.470
ourse-4.458
inho-4.359
enstein-4.292
Token that
Token ID326
Feature activation+3.45e-01

Pos logit contributions
 absorbed+4.158
idth+4.036
 resur+3.863
 absorbing+3.724
etically+3.646
Neg logit contributions
-5.102
ourse-4.987
rogen-4.980
inho-4.864
ury-4.786
Token variants
Token ID17670
Feature activation+7.01e-02

Pos logit contributions
idth+1.857
 absorbed+1.854
 resur+1.731
etically+1.705
eston+1.703
Neg logit contributions
-1.787
rogen-1.752
ourse-1.730
inho-1.697
enstein-1.669
Token,
Token ID11
Feature activation+1.43e-01

Pos logit contributions
idth+0.562
 absorbed+0.558
 resur+0.519
etically+0.511
eston+0.509
Neg logit contributions
-0.611
rogen-0.600
ourse-0.590
inho-0.581
enstein-0.569
Token you
Token ID345
Feature activation-9.41e-02

Pos logit contributions
idth+1.158
 absorbed+1.157
 resur+1.085
etically+1.064
eston+1.047
Neg logit contributions
-1.236
rogen-1.201
ourse-1.193
inho-1.172
enstein-1.151
Token can
Token ID460
Feature activation+1.09e-01

Pos logit contributions
+0.846
rogen+0.828
ourse+0.819
inho+0.803
enstein+0.789
Neg logit contributions
idth-0.729
 absorbed-0.725
 resur-0.677
etically-0.661
eston-0.658
Token enable
Token ID7139
Feature activation+2.47e-01

Pos logit contributions
idth+0.928
 absorbed+0.923
 resur+0.877
etically+0.852
eston+0.842
Neg logit contributions
-0.901
rogen-0.866
ourse-0.862
inho-0.844
enstein-0.824
Token/
Token ID14
Feature activation+1.02e+00

Pos logit contributions
idth+1.999
 absorbed+1.995
 resur+1.869
etically+1.829
eston+1.822
Neg logit contributions
-2.122
rogen-2.073
ourse-2.040
inho-2.020
enstein-1.979
Tokendisable
Token ID40223
Feature activation+4.98e-01

Pos logit contributions
 absorbed+2.128
etically+1.922
eston+1.831
idth+1.740
 resur+1.594
Neg logit contributions
-14.044
ourse-13.268
inho-12.766
rogen-12.665
ury-12.637
Token each
Token ID1123
Feature activation+1.22e-01

Pos logit contributions
 absorbed+3.883
idth+3.785
 resur+3.625
 absorbing+3.523
etically+3.508
Neg logit contributions
-4.371
rogen-4.212
inho-4.189
ourse-4.165
enstein-4.071
Token and
Token ID290
Feature activation-2.87e-01

Pos logit contributions
idth+1.024
 absorbed+1.022
 resur+0.955
etically+0.941
eston+0.934
Neg logit contributions
-1.015
rogen-0.990
ourse-0.975
inho-0.961
enstein-0.944
Token drag
Token ID6715
Feature activation-2.24e-02

Pos logit contributions
rogen+2.251
+2.231
ourse+2.161
inho+2.125
enstein+2.089
Neg logit contributions
idth-2.546
 absorbed-2.485
 resur-2.325
eston-2.312
etically-2.312
Token them
Token ID606
Feature activation+2.11e-02

Pos logit contributions
+0.182
rogen+0.180
ourse+0.176
inho+0.173
enstein+0.169
Neg logit contributions
idth-0.193
 absorbed-0.190
 resur-0.178
etically-0.175
eston-0.175
Token -
Token ID532
Feature activation+8.64e-01

Pos logit contributions
 absorbed+4.580
idth+4.514
 resur+4.318
eston+4.222
MIC+4.103
Neg logit contributions
-6.177
rogen-6.076
ourse-6.019
inho-5.895
enstein-5.786
Token The
Token ID383
Feature activation+2.11e-01

Pos logit contributions
 absorbed+6.179
idth+6.126
 resur+5.857
etically+5.833
eston+5.811
Neg logit contributions
-7.138
rogen-6.997
ourse-6.925
enstein-6.538
ury-6.505
Token Associated
Token ID10575
Feature activation-1.98e-01

Pos logit contributions
 absorbed+1.730
idth+1.728
 resur+1.627
etically+1.574
eston+1.570
Neg logit contributions
-1.799
rogen-1.746
ourse-1.718
inho-1.679
enstein-1.662
Token Press
Token ID4332
Feature activation+2.43e-01

Pos logit contributions
+1.342
rogen+1.336
ourse+1.285
inho+1.271
enstein+1.236
Neg logit contributions
idth-1.968
 absorbed-1.934
 resur-1.821
eston-1.803
etically-1.795
Token
Token ID198
Feature activation+4.16e-01

Pos logit contributions
 absorbed+2.029
idth+2.019
 resur+1.899
eston+1.859
etically+1.843
Neg logit contributions
-2.023
rogen-1.986
ourse-1.957
inho-1.923
enstein-1.874
Token
Token ID198
Feature activation+1.02e+00

Pos logit contributions
 absorbed+2.757
idth+2.743
etically+2.653
MIC+2.652
 resur+2.635
Neg logit contributions
ourse-3.830
-3.828
rogen-3.814
 redes-3.661
inho-3.637
TokenAP
Token ID2969
Feature activation+2.34e-01

Pos logit contributions
idth+7.759
MIC+7.740
eston+7.549
 absorbed+7.547
Property+7.262
Neg logit contributions
rogen-7.538
-7.459
ourse-7.451
 Vul-7.311
inho-7.216
Token Photo
Token ID5555
Feature activation+1.39e-01

Pos logit contributions
idth+1.978
 absorbed+1.938
 resur+1.815
eston+1.781
etically+1.777
Neg logit contributions
-1.925
rogen-1.896
ourse-1.873
inho-1.814
enstein-1.786
Token
Token ID198
Feature activation+9.49e-02

Pos logit contributions
idth+1.226
 absorbed+1.204
 resur+1.130
etically+1.118
eston+1.115
Neg logit contributions
-1.106
rogen-1.087
ourse-1.069
inho-1.044
enstein-1.023
Token
Token ID198
Feature activation+4.70e-01

Pos logit contributions
idth+0.752
 absorbed+0.744
 resur+0.707
etically+0.702
eston+0.697
Neg logit contributions
-0.815
rogen-0.799
ourse-0.795
inho-0.768
enstein-0.752
TokenSw
Token ID10462
Feature activation-9.92e-02

Pos logit contributions
idth+3.824
 absorbed+3.730
MIC+3.633
eston+3.621
etically+3.599
Neg logit contributions
-3.715
rogen-3.690
ourse-3.669
enstein-3.543
inho-3.529
Tokenlegraph
Token ID16606
Feature activation+3.50e-01

Pos logit contributions
idth+2.059
 absorbed+2.047
eston+1.949
 resur+1.942
etically+1.938
Neg logit contributions
-1.405
rogen-1.360
ourse-1.318
inho-1.304
enstein-1.275
Token.
Token ID13
Feature activation+1.91e-01

Pos logit contributions
idth+2.390
 absorbed+2.289
eston+2.277
etically+2.239
 resur+2.216
Neg logit contributions
-3.331
rogen-3.274
ourse-3.176
 redes-3.107
inho-3.092
Tokenco
Token ID1073
Feature activation+1.86e-01

Pos logit contributions
idth+1.357
 absorbed+1.334
eston+1.254
 resur+1.248
etically+1.240
Neg logit contributions
-1.845
rogen-1.781
ourse-1.752
enstein-1.730
inho-1.696
Token.
Token ID13
Feature activation+1.23e-01

Pos logit contributions
idth+1.425
 absorbed+1.393
 resur+1.330
etically+1.328
eston+1.325
Neg logit contributions
-1.671
rogen-1.606
ourse-1.586
inho-1.545
enstein-1.528
Tokenuk
Token ID2724
Feature activation+5.13e-01

Pos logit contributions
idth+0.794
 absorbed+0.781
etically+0.722
 resur+0.718
eston+0.711
Neg logit contributions
-1.261
rogen-1.245
ourse-1.215
enstein-1.193
inho-1.188
Token/
Token ID14
Feature activation+1.02e+00

Pos logit contributions
idth+3.519
 absorbed+3.295
eston+3.283
 resur+3.221
etically+3.220
Neg logit contributions
-4.757
rogen-4.638
ourse-4.632
 redes-4.432
enstein-4.419
Tokenbusiness
Token ID22680
Feature activation+3.87e-01

Pos logit contributions
idth+5.338
 absorbed+5.334
eston+5.052
 resur+4.877
etically+4.690
Neg logit contributions
-10.511
rogen-10.393
 Vul-10.093
enstein-9.953
inho-9.944
Token/
Token ID14
Feature activation+7.37e-01

Pos logit contributions
idth+2.997
 absorbed+2.878
eston+2.738
 resur+2.700
etically+2.656
Neg logit contributions
-3.441
rogen-3.319
ourse-3.274
 Vul-3.226
inho-3.196
Tokenanalysis
Token ID20930
Feature activation+1.95e-01

Pos logit contributions
idth+5.676
 absorbed+5.469
eston+5.281
etically+5.166
 resur+5.113
Neg logit contributions
-6.296
rogen-6.058
 Vul-5.967
ourse-5.850
inho-5.835
Token/
Token ID14
Feature activation+4.19e-01

Pos logit contributions
idth+1.665
 absorbed+1.624
 resur+1.526
eston+1.525
etically+1.499
Neg logit contributions
-1.596
rogen-1.565
ourse-1.531
inho-1.513
enstein-1.481
Tokenwhy
Token ID22850
Feature activation-1.56e-01

Pos logit contributions
idth+3.611
 absorbed+3.477
eston+3.321
etically+3.302
 resur+3.266
Neg logit contributions
-3.366
rogen-3.209
ourse-3.175
 Vul-3.128
inho-3.115
Token.
Token ID13
Feature activation+2.65e-01

Pos logit contributions
 absorbed+2.971
idth+2.885
 resur+2.777
 absorbing+2.670
etically+2.661
Neg logit contributions
-3.298
rogen-3.223
ourse-3.204
inho-3.125
enstein-3.054
Token All
Token ID1439
Feature activation+1.94e-01

Pos logit contributions
idth+1.908
 absorbed+1.895
 resur+1.773
etically+1.757
eston+1.745
Neg logit contributions
rogen-2.420
-2.403
ourse-2.374
inho-2.292
enstein-2.266
Token the
Token ID262
Feature activation-1.96e-01

Pos logit contributions
 absorbed+1.611
idth+1.604
 resur+1.496
etically+1.489
eston+1.461
Neg logit contributions
-1.609
rogen-1.590
ourse-1.563
inho-1.531
enstein-1.486
Token an
Token ID281
Feature activation-7.89e-03

Pos logit contributions
+1.647
rogen+1.627
ourse+1.589
inho+1.563
enstein+1.537
Neg logit contributions
idth-1.640
 absorbed-1.604
 resur-1.502
eston-1.484
etically-1.483
Tokenach
Token ID620
Feature activation+4.52e-01

Pos logit contributions
+0.060
rogen+0.059
ourse+0.058
inho+0.057
enstein+0.055
Neg logit contributions
idth-0.072
 absorbed-0.071
 resur-0.067
etically-0.066
eston-0.066
Tokenron
Token ID1313
Feature activation+1.01e+00

Pos logit contributions
idth+3.741
 absorbed+3.571
etically+3.378
 resur+3.357
eston+3.256
Neg logit contributions
-3.856
enstein-3.554
ourse-3.515
inho-3.487
rogen-3.467
Tokenistic
Token ID2569
Feature activation-2.69e-01

Pos logit contributions
idth+6.274
etically+5.877
 absorbed+5.552
eston+5.384
otted+5.165
Neg logit contributions
-9.829
rogen-9.574
ourse-9.496
 Vul-9.460
enstein-9.202
Token elements
Token ID4847
Feature activation+3.72e-02

Pos logit contributions
+2.249
rogen+2.223
ourse+2.165
inho+2.134
enstein+2.102
Neg logit contributions
idth-2.254
 absorbed-2.188
 resur-2.044
eston-2.027
etically-2.020
Token,
Token ID11
Feature activation+1.97e-01

Pos logit contributions
 absorbed+0.313
idth+0.312
 resur+0.294
etically+0.286
eston+0.285
Neg logit contributions
-0.308
rogen-0.304
ourse-0.299
inho-0.293
enstein-0.289
Token the
Token ID262
Feature activation+1.73e-01

Pos logit contributions
 absorbed+1.629
idth+1.596
 resur+1.519
 absorbing+1.479
etically+1.475
Neg logit contributions
-1.656
ourse-1.630
rogen-1.628
inho-1.590
enstein-1.554
Token d
Token ID288
Feature activation-4.85e-01

Pos logit contributions
 absorbed+1.440
idth+1.424
 resur+1.347
etically+1.324
 absorbing+1.312
Neg logit contributions
-1.442
rogen-1.425
ourse-1.416
inho-1.379
enstein-1.349
Token Mongol
Token ID42188
Feature activation+1.86e-01

Pos logit contributions
 absorbed+1.953
idth+1.936
 resur+1.809
etically+1.777
 absorbing+1.772
Neg logit contributions
-2.003
rogen-1.983
ourse-1.977
inho-1.908
enstein-1.899
Token rulers
Token ID24925
Feature activation+6.36e-04

Pos logit contributions
idth+1.408
 absorbed+1.407
 resur+1.301
etically+1.297
eston+1.276
Neg logit contributions
-1.687
rogen-1.665
ourse-1.642
inho-1.601
enstein-1.582
Token,
Token ID11
Feature activation+3.55e-01

Pos logit contributions
idth+0.005
 absorbed+0.005
 resur+0.005
etically+0.005
eston+0.005
Neg logit contributions
-0.005
rogen-0.005
ourse-0.005
inho-0.005
enstein-0.005
Token including
Token ID1390
Feature activation+1.92e-01

Pos logit contributions
 absorbed+2.829
idth+2.745
 resur+2.623
 absorbing+2.546
etically+2.494
Neg logit contributions
-3.105
ourse-3.068
rogen-3.036
inho-2.945
enstein-2.944
Token H
Token ID367
Feature activation+1.85e-01

Pos logit contributions
idth+1.625
 absorbed+1.618
 resur+1.516
etically+1.484
eston+1.475
Neg logit contributions
-1.568
rogen-1.552
ourse-1.536
inho-1.482
enstein-1.475
Tokenü
Token ID9116
Feature activation+1.01e+00

Pos logit contributions
idth+1.670
 absorbed+1.616
eston+1.541
 resur+1.530
etically+1.521
Neg logit contributions
-1.434
rogen-1.394
ourse-1.366
inho-1.337
enstein-1.312
Tokenleg
Token ID1455
Feature activation+1.62e-01

Pos logit contributions
idth+8.606
 absorbed+8.256
hler+7.947
etically+7.860
eston+7.836
Neg logit contributions
-7.423
ourse-6.928
rogen-6.660
-6.489
enstein-6.376
Tokenü
Token ID9116
Feature activation+3.71e-01

Pos logit contributions
idth+1.345
 absorbed+1.306
 resur+1.219
eston+1.211
etically+1.210
Neg logit contributions
-1.373
rogen-1.342
ourse-1.318
inho-1.280
enstein-1.267
Token.[
Token ID3693
Feature activation+1.52e-01

Pos logit contributions
idth+3.176
 absorbed+3.108
 resur+2.933
etically+2.875
eston+2.869
Neg logit contributions
-3.020
rogen-2.949
ourse-2.915
inho-2.790
enstein-2.772
Token6
Token ID21
Feature activation-6.05e-02

Pos logit contributions
idth+1.368
 absorbed+1.346
 resur+1.269
eston+1.255
etically+1.252
Neg logit contributions
-1.167
rogen-1.150
ourse-1.134
inho-1.109
enstein-1.084
Token]
Token ID60
Feature activation+2.79e-01

Pos logit contributions
+0.453
rogen+0.446
ourse+0.435
inho+0.428
enstein+0.418
Neg logit contributions
idth-0.560
 absorbed-0.551
 resur-0.519
eston-0.513
etically-0.512
Token on
Token ID319
Feature activation+2.77e-01

Pos logit contributions
 absorbed+0.884
idth+0.868
 resur+0.811
etically+0.798
 absorbing+0.780
Neg logit contributions
-1.231
ourse-1.210
rogen-1.210
inho-1.170
enstein-1.168
Token to
Token ID284
Feature activation+5.88e-01

Pos logit contributions
 absorbed+2.171
idth+2.147
 resur+2.005
etically+1.986
 absorbing+1.950
Neg logit contributions
-2.434
ourse-2.420
rogen-2.391
inho-2.311
enstein-2.305
Token before
Token ID878
Feature activation+2.18e-01

Pos logit contributions
 absorbed+4.402
 resur+4.097
idth+4.086
 absorbing+3.971
etically+3.832
Neg logit contributions
ourse-5.320
-5.304
rogen-5.155
enstein-5.066
inho-4.861
Token beating
Token ID11226
Feature activation+1.34e-01

Pos logit contributions
 absorbed+1.798
idth+1.757
 resur+1.693
 absorbing+1.659
etically+1.621
Neg logit contributions
-1.839
rogen-1.818
ourse-1.810
enstein-1.741
inho-1.713
Token J
Token ID449
Feature activation+3.61e-01

Pos logit contributions
idth+1.131
 absorbed+1.130
 resur+1.059
etically+1.038
eston+1.034
Neg logit contributions
-1.104
rogen-1.091
ourse-1.078
enstein-1.035
inho-1.028
Tokenuss
Token ID1046
Feature activation+1.01e+00

Pos logit contributions
idth+3.263
 absorbed+3.156
eston+3.039
etically+2.984
 resur+2.983
Neg logit contributions
-2.809
rogen-2.645
ourse-2.624
enstein-2.518
inho-2.502
Tokeni
Token ID72
Feature activation+3.07e-01

Pos logit contributions
 absorbed+5.682
idth+5.474
otted+5.060
etically+4.935
 resur+4.875
Neg logit contributions
-10.962
rogen-9.837
ourse-9.702
-9.571
enstein-9.458
Token Ja
Token ID13790
Feature activation-5.93e-02

Pos logit contributions
 absorbed+2.940
idth+2.851
 resur+2.775
etically+2.691
eston+2.687
Neg logit contributions
-2.226
rogen-2.125
ourse-2.121
enstein-2.022
inho-1.979
Tokenas
Token ID292
Feature activation+3.33e-01

Pos logit contributions
+0.476
rogen+0.468
ourse+0.458
inho+0.448
enstein+0.441
Neg logit contributions
idth-0.518
 absorbed-0.508
 resur-0.478
etically-0.471
eston-0.471
Tokenkel
Token ID7750
Feature activation+3.81e-01

Pos logit contributions
idth+3.229
 absorbed+3.166
eston+3.009
 resur+3.005
etically+2.986
Neg logit contributions
-2.333
rogen-2.274
ourse-2.175
inho-2.088
enstein-2.054
Tokenain
Token ID391
Feature activation+2.61e-01

Pos logit contributions
 absorbed+3.316
idth+3.264
eston+3.168
 resur+3.058
etically+3.037
Neg logit contributions
rogen-3.035
-3.012
ourse-2.952
inho-2.743
ury-2.719
Token.
Token ID13
Feature activation-1.63e-01

Pos logit contributions
+0.977
rogen+0.968
ourse+0.945
inho+0.929
enstein+0.913
Neg logit contributions
idth-0.973
 absorbed-0.949
 resur-0.889
etically-0.879
eston-0.878
Token The
Token ID383
Feature activation-5.24e-02

Pos logit contributions
+1.391
rogen+1.374
ourse+1.345
inho+1.326
enstein+1.300
Neg logit contributions
idth-1.341
 absorbed-1.315
 resur-1.229
etically-1.210
eston-1.209
Token story
Token ID1621
Feature activation-9.40e-03

Pos logit contributions
+0.453
rogen+0.447
ourse+0.439
inho+0.431
enstein+0.423
Neg logit contributions
idth-0.425
 absorbed-0.417
 resur-0.390
etically-0.384
eston-0.383
Token where
Token ID810
Feature activation-1.16e-01

Pos logit contributions
+0.076
rogen+0.075
ourse+0.073
inho+0.072
enstein+0.070
Neg logit contributions
idth-0.082
 absorbed-0.081
 resur-0.076
etically-0.074
eston-0.074
Token Cyborg
Token ID46114
Feature activation+4.80e-02

Pos logit contributions
+0.931
rogen+0.918
ourse+0.898
inho+0.883
enstein+0.865
Neg logit contributions
idth-1.008
 absorbed-0.992
 resur-0.931
etically-0.917
eston-0.916
Token qu
Token ID627
Feature activation+1.01e+00

Pos logit contributions
idth+0.412
 absorbed+0.409
 resur+0.384
etically+0.376
eston+0.375
Neg logit contributions
-0.391
rogen-0.385
ourse-0.378
inho-0.370
enstein-0.363
Tokenits
Token ID896
Feature activation+8.07e-02

Pos logit contributions
idth+6.768
etically+6.548
 absorbed+6.508
eston+6.142
otted+6.070
Neg logit contributions
-9.626
rogen-9.010
 Vul-8.441
-8.404
 Promise-8.296
Token and
Token ID290
Feature activation+1.45e-01

Pos logit contributions
 absorbed+0.683
idth+0.677
 resur+0.641
etically+0.624
 absorbing+0.619
Neg logit contributions
-0.668
rogen-0.648
ourse-0.646
inho-0.633
enstein-0.620
Token the
Token ID262
Feature activation+2.15e-01

Pos logit contributions
 absorbed+1.243
idth+1.231
 resur+1.172
etically+1.130
 absorbing+1.121
Neg logit contributions
-1.190
rogen-1.157
ourse-1.156
inho-1.124
enstein-1.106
Token comes
Token ID2058
Feature activation+2.76e-01

Pos logit contributions
 absorbed+1.761
idth+1.759
 resur+1.662
etically+1.596
 absorbing+1.591
Neg logit contributions
-1.822
rogen-1.781
ourse-1.771
inho-1.730
enstein-1.695
Token back
Token ID736
Feature activation+4.70e-01

Pos logit contributions
idth+2.188
 absorbed+2.155
 resur+2.043
etically+2.009
eston+1.978
Neg logit contributions
-2.410
rogen-2.354
ourse-2.329
inho-2.288
enstein-2.237
Token5
Token ID20
Feature activation-4.45e-01

Pos logit contributions
idth+2.683
 absorbed+2.657
eston+2.508
etically+2.489
 resur+2.483
Neg logit contributions
-2.404
rogen-2.320
ourse-2.294
inho-2.205
 Vul-2.195
Token {
Token ID1391
Feature activation+4.27e-02

Pos logit contributions
+1.950
rogen+1.940
inho+1.830
ourse+1.817
enstein+1.768
Neg logit contributions
idth-5.476
 absorbed-5.341
 resur-5.104
eston-5.052
etically-5.051
Token(
Token ID7
Feature activation+6.84e-02

Pos logit contributions
idth+0.313
 absorbed+0.306
 resur+0.286
eston+0.285
etically+0.285
Neg logit contributions
-0.393
rogen-0.390
ourse-0.386
inho-0.375
enstein-0.370
Token53
Token ID4310
Feature activation-2.39e-01

Pos logit contributions
idth+0.592
 absorbed+0.577
 resur+0.543
eston+0.538
etically+0.538
Neg logit contributions
-0.550
rogen-0.546
ourse-0.534
inho-0.520
enstein-0.513
Tokens
Token ID82
Feature activation-4.35e-01

Pos logit contributions
+1.618
rogen+1.608
ourse+1.554
inho+1.537
enstein+1.493
Neg logit contributions
idth-2.376
 absorbed-2.333
 resur-2.208
etically-2.181
eston-2.175
Token)}
Token ID38165
Feature activation-1.34e+00

Pos logit contributions
rogen+2.544
+2.539
inho+2.447
ourse+2.400
enstein+2.324
Neg logit contributions
idth-4.683
 absorbed-4.634
 resur-4.377
eston-4.313
etically-4.287
Token 31
Token ID3261
Feature activation+9.19e-02

Pos logit contributions
rogen+6.694
enstein+6.363
inho+6.351
+6.169
 Vul+6.151
Neg logit contributions
idth-13.531
 absorbed-13.159
 resur-12.859
eston-12.713
etically-12.391
Token.
Token ID13
Feature activation+3.58e-02

Pos logit contributions
idth+0.723
 absorbed+0.718
 resur+0.680
etically+0.672
eston+0.665
Neg logit contributions
-0.792
rogen-0.779
ourse-0.776
inho-0.745
enstein-0.738
Token Q
Token ID1195
Feature activation-7.52e-03

Pos logit contributions
idth+0.295
 absorbed+0.295
 resur+0.275
etically+0.270
eston+0.269
Neg logit contributions
-0.301
rogen-0.295
ourse-0.293
inho-0.285
enstein-0.279
Tokend
Token ID67
Feature activation+2.29e-01

Pos logit contributions
+0.057
rogen+0.056
ourse+0.055
inho+0.054
enstein+0.053
Neg logit contributions
idth-0.069
 absorbed-0.067
 resur-0.064
etically-0.063
eston-0.063
Token2
Token ID17
Feature activation-2.35e-01

Pos logit contributions
idth+2.131
 absorbed+2.125
 resur+1.995
eston+1.990
etically+1.989
Neg logit contributions
-1.649
ourse-1.587
rogen-1.586
inho-1.540
enstein-1.493
Token more
Token ID517
Feature activation-1.74e-01

Pos logit contributions
 absorbed+3.796
 resur+3.550
idth+3.426
etically+3.388
 absorbing+3.351
Neg logit contributions
-5.689
ourse-5.516
rogen-5.407
inho-5.340
enstein-5.246
Token to
Token ID284
Feature activation-3.87e-02

Pos logit contributions
+1.392
rogen+1.374
ourse+1.339
inho+1.320
enstein+1.294
Neg logit contributions
idth-1.519
 absorbed-1.487
 resur-1.396
etically-1.377
eston-1.376
Token help
Token ID1037
Feature activation-6.68e-01

Pos logit contributions
+0.363
rogen+0.359
ourse+0.352
inho+0.346
enstein+0.341
Neg logit contributions
idth-0.285
 absorbed-0.280
 resur-0.260
etically-0.256
eston-0.256
Token us
Token ID514
Feature activation-7.44e-01

Pos logit contributions
rogen+4.763
+4.699
ourse+4.686
enstein+4.536
inho+4.507
Neg logit contributions
idth-6.205
 absorbed-5.879
 carbohyd-5.598
MIC-5.510
etically-5.482
Token continue
Token ID2555
Feature activation-9.08e-01

Pos logit contributions
rogen+4.463
ourse+4.305
enstein+4.231
+4.223
inho+4.083
Neg logit contributions
idth-7.970
 absorbed-7.457
 carbohyd-7.215
MIC-6.935
olyn-6.926
Token to
Token ID284
Feature activation-1.32e+00

Pos logit contributions
rogen+5.316
+5.278
igh+4.961
enstein+4.952
ury+4.939
Neg logit contributions
 carbohyd-9.447
idth-9.398
 absorbed-9.046
 resur-8.425
eston-8.391
Token write
Token ID3551
Feature activation-6.65e-01

Pos logit contributions
rogen+5.787
enstein+5.784
+5.383
ourse+5.164
igh+5.053
Neg logit contributions
idth-14.466
 carbohyd-14.347
 absorbed-14.090
eston-13.372
MIC-13.328
Token about
Token ID546
Feature activation-6.36e-01

Pos logit contributions
rogen+5.121
+5.051
igh+4.894
ury+4.879
inho+4.869
Neg logit contributions
idth-5.919
 absorbed-5.549
 carbohyd-5.474
eston-5.192
enta-5.080
Token the
Token ID262
Feature activation-2.48e-01

Pos logit contributions
rogen+4.020
+4.018
inho+3.837
ourse+3.765
enstein+3.732
Neg logit contributions
idth-6.505
 absorbed-6.329
 carbohyd-6.106
 resur-5.906
eston-5.893
Token issues
Token ID2428
Feature activation-3.60e-01

Pos logit contributions
+1.986
rogen+1.969
ourse+1.918
inho+1.891
enstein+1.851
Neg logit contributions
idth-2.158
 absorbed-2.109
 resur-1.974
etically-1.956
eston-1.955
Token that
Token ID326
Feature activation-4.16e-01

Pos logit contributions
+2.801
rogen+2.772
ourse+2.688
inho+2.674
enstein+2.614
Neg logit contributions
idth-3.228
 absorbed-3.136
 resur-2.927
eston-2.912
etically-2.906
Token to
Token ID284
Feature activation-2.27e-01

Pos logit contributions
+1.660
rogen+1.632
ourse+1.589
inho+1.568
enstein+1.544
Neg logit contributions
idth-1.910
 absorbed-1.859
 resur-1.745
eston-1.737
etically-1.733
Token stand
Token ID1302
Feature activation-1.09e-01

Pos logit contributions
+1.873
rogen+1.861
ourse+1.812
inho+1.788
enstein+1.757
Neg logit contributions
idth-1.926
 absorbed-1.871
 resur-1.745
etically-1.743
eston-1.740
Token toe
Token ID21189
Feature activation-4.61e-01

Pos logit contributions
+1.027
rogen+1.014
ourse+0.997
inho+0.979
enstein+0.963
Neg logit contributions
idth-0.798
 absorbed-0.786
 resur-0.729
etically-0.714
eston-0.712
Token-
Token ID12
Feature activation-5.05e-01

Pos logit contributions
+3.197
rogen+3.162
inho+3.039
ourse+3.004
igh+2.985
Neg logit contributions
idth-4.376
 absorbed-4.263
eston-4.098
 resur-4.070
etically-3.937
Tokento
Token ID1462
Feature activation-1.06e+00

Pos logit contributions
rogen+3.388
+3.386
inho+3.363
ourse+3.306
enstein+3.254
Neg logit contributions
idth-4.879
 absorbed-4.800
 resur-4.590
eston-4.539
 absorbing-4.418
Token-
Token ID12
Feature activation-1.29e+00

Pos logit contributions
enstein+5.288
igh+5.228
rogen+5.022
inho+5.012
ourse+4.953
Neg logit contributions
 carbohyd-11.453
 absorbed-11.262
idth-11.239
 resur-10.686
 absorbing-10.647
Tokentoe
Token ID44579
Feature activation-2.08e-01

Pos logit contributions
enstein+2.468
igh+2.107
inho+1.969
ourse+1.755
adem+1.698
Neg logit contributions
 absorbed-17.777
 resur-17.435
 carbohyd-17.371
 absorbing-17.244
PDATE-17.043
Token with
Token ID351
Feature activation-1.52e-02

Pos logit contributions
+1.602
rogen+1.594
ourse+1.549
inho+1.532
enstein+1.502
Neg logit contributions
idth-1.868
 absorbed-1.821
 resur-1.712
eston-1.703
etically-1.694
Token the
Token ID262
Feature activation+1.67e-01

Pos logit contributions
+0.127
rogen+0.124
ourse+0.122
inho+0.119
enstein+0.117
Neg logit contributions
idth-0.127
 absorbed-0.126
 resur-0.119
etically-0.116
eston-0.115
Token world
Token ID995
Feature activation+1.85e-01

Pos logit contributions
 absorbed+1.392
idth+1.367
 resur+1.311
 absorbing+1.263
etically+1.251
Neg logit contributions
-1.408
rogen-1.359
ourse-1.345
inho-1.313
enstein-1.289
Token
Token ID447
Feature activation-5.03e-03

Pos logit contributions
idth+1.528
 absorbed+1.526
 resur+1.431
etically+1.401
eston+1.397
Neg logit contributions
-1.560
rogen-1.524
ourse-1.500
inho-1.462
enstein-1.441
Token,
Token ID11
Feature activation+1.82e-01

Pos logit contributions
+1.429
rogen+1.412
ourse+1.380
inho+1.362
enstein+1.338
Neg logit contributions
idth-1.361
 absorbed-1.336
 resur-1.245
etically-1.226
eston-1.223
Token and
Token ID290
Feature activation+3.14e-01

Pos logit contributions
 absorbed+1.421
idth+1.417
 resur+1.339
etically+1.323
eston+1.300
Neg logit contributions
-1.579
rogen-1.558
ourse-1.551
inho-1.521
enstein-1.474
Token explore
Token ID7301
Feature activation-2.70e-01

Pos logit contributions
 absorbed+2.421
idth+2.352
 resur+2.283
etically+2.213
eston+2.196
Neg logit contributions
-2.767
rogen-2.712
inho-2.689
ourse-2.687
enstein-2.605
Token the
Token ID262
Feature activation-2.09e-01

Pos logit contributions
+1.985
rogen+1.970
ourse+1.904
inho+1.877
enstein+1.841
Neg logit contributions
idth-2.538
 absorbed-2.482
 resur-2.330
etically-2.301
eston-2.300
Token world
Token ID995
Feature activation-5.31e-01

Pos logit contributions
+1.811
rogen+1.804
ourse+1.757
inho+1.732
enstein+1.705
Neg logit contributions
idth-1.692
 absorbed-1.645
 resur-1.527
etically-1.520
eston-1.516
Token with
Token ID351
Feature activation-1.28e+00

Pos logit contributions
rogen+3.584
+3.388
inho+3.302
ourse+3.289
enstein+3.269
Neg logit contributions
idth-5.331
 absorbed-5.144
 carbohyd-5.021
 resur-4.811
eston-4.772
Token kindred
Token ID46449
Feature activation-6.61e-01

Pos logit contributions
+6.774
rogen+6.445
 Vul+6.427
ourse+6.378
ury+6.265
Neg logit contributions
idth-13.244
 carbohyd-12.280
 absorbed-12.104
hler-12.088
eston-11.967
Token spirits
Token ID13747
Feature activation+3.20e-02

Pos logit contributions
+3.830
rogen+3.705
ourse+3.597
inho+3.465
igh+3.423
Neg logit contributions
idth-7.208
 absorbed-6.807
eston-6.606
hler-6.456
 resur-6.453
Token.
Token ID13
Feature activation-8.91e-03

Pos logit contributions
idth+0.259
 absorbed+0.255
 resur+0.242
etically+0.239
eston+0.238
Neg logit contributions
-0.272
rogen-0.270
ourse-0.265
inho-0.258
enstein-0.254
Token
Token ID198
Feature activation+6.87e-04

Pos logit contributions
+0.071
rogen+0.071
ourse+0.069
inho+0.068
enstein+0.066
Neg logit contributions
idth-0.077
 absorbed-0.076
 resur-0.071
etically-0.071
eston-0.071
Token
Token ID198
Feature activation+1.92e-01

Pos logit contributions
idth+0.006
 absorbed+0.006
 resur+0.005
etically+0.005
eston+0.005
Neg logit contributions
-0.006
rogen-0.005
ourse-0.005
inho-0.005
enstein-0.005
Token the
Token ID262
Feature activation-5.85e-01

Pos logit contributions
+2.216
rogen+2.192
ourse+2.138
inho+2.100
enstein+2.056
Neg logit contributions
idth-2.519
 absorbed-2.432
 resur-2.298
eston-2.284
etically-2.280
Token spoil
Token ID20851
Feature activation-6.64e-01

Pos logit contributions
rogen+4.627
+4.594
ourse+4.492
inho+4.383
 Vul+4.381
Neg logit contributions
idth-5.101
 absorbed-4.674
eston-4.583
etically-4.506
 resur-4.431
Token thereof
Token ID15370
Feature activation-5.94e-01

Pos logit contributions
rogen+4.405
ourse+4.321
+4.306
inho+4.121
ury+4.082
Neg logit contributions
idth-6.482
 absorbed-6.128
etically-6.003
eston-5.947
 resur-5.933
Token,
Token ID11
Feature activation-4.47e-01

Pos logit contributions
+4.414
rogen+4.392
ourse+4.349
inho+4.218
ury+4.139
Neg logit contributions
idth-5.325
 absorbed-5.049
etically-4.903
eston-4.873
 resur-4.812
Token shalt
Token ID36258
Feature activation-5.87e-01

Pos logit contributions
+3.249
rogen+3.179
ourse+3.112
inho+3.063
enstein+2.966
Neg logit contributions
idth-4.212
 absorbed-4.063
etically-3.878
eston-3.867
 resur-3.844
Token thou
Token ID14210
Feature activation-1.20e+00

Pos logit contributions
rogen+4.139
+4.032
ourse+3.966
inho+3.909
ury+3.879
Neg logit contributions
idth-5.509
 absorbed-5.265
etically-5.051
eston-5.030
 resur-4.920
Token take
Token ID1011
Feature activation-3.49e-01

Pos logit contributions
rogen+8.094
igh+7.810
ury+7.744
+7.533
ourse+7.523
Neg logit contributions
idth-10.825
 absorbed-9.559
etically-9.533
eston-9.470
 carbohyd-9.418
Token unto
Token ID12722
Feature activation-3.18e-02

Pos logit contributions
rogen+2.724
+2.711
ourse+2.664
inho+2.591
igh+2.550
Neg logit contributions
idth-3.081
 absorbed-2.948
 resur-2.785
eston-2.778
etically-2.773
Token thy
Token ID11906
Feature activation-5.57e-01

Pos logit contributions
+0.247
rogen+0.243
ourse+0.237
inho+0.233
enstein+0.229
Neg logit contributions
idth-0.286
 absorbed-0.281
 resur-0.265
etically-0.261
eston-0.261
Tokenself
Token ID944
Feature activation-7.16e-01

Pos logit contributions
+4.462
rogen+4.386
ourse+4.350
inho+4.159
igh+4.153
Neg logit contributions
idth-4.821
 absorbed-4.585
etically-4.427
 resur-4.319
eston-4.309
Token.
Token ID13
Feature activation-3.51e-01

Pos logit contributions
rogen+5.140
+5.004
ourse+4.972
ury+4.808
inho+4.797
Neg logit contributions
idth-6.593
 absorbed-6.208
etically-5.981
eston-5.974
 resur-5.929
Token,
Token ID11
Feature activation+1.60e-01

Pos logit contributions
 absorbed+2.820
 resur+2.663
idth+2.583
 absorbing+2.469
etically+2.421
Neg logit contributions
-3.073
ourse-2.945
rogen-2.892
inho-2.866
enstein-2.800
Token whose
Token ID3025
Feature activation+1.65e-02

Pos logit contributions
 absorbed+1.366
idth+1.361
 resur+1.285
 absorbing+1.236
etically+1.234
Neg logit contributions
-1.327
rogen-1.281
ourse-1.267
inho-1.253
enstein-1.228
Token name
Token ID1438
Feature activation-1.79e-01

Pos logit contributions
idth+0.141
 absorbed+0.140
 resur+0.132
etically+0.129
eston+0.128
Neg logit contributions
-0.135
rogen-0.132
ourse-0.130
inho-0.128
enstein-0.125
Token has
Token ID468
Feature activation-1.35e-01

Pos logit contributions
+1.278
rogen+1.272
ourse+1.220
inho+1.201
enstein+1.183
Neg logit contributions
idth-1.734
 absorbed-1.679
 resur-1.580
etically-1.574
eston-1.571
Token not
Token ID407
Feature activation-5.69e-01

Pos logit contributions
rogen+1.051
+1.048
ourse+1.011
inho+0.986
enstein+0.982
Neg logit contributions
idth-1.221
 absorbed-1.175
etically-1.099
eston-1.097
 resur-1.094
Token been
Token ID587
Feature activation-1.19e+00

Pos logit contributions
rogen+4.449
+4.232
ourse+4.206
igh+4.124
enstein+4.104
Neg logit contributions
idth-5.119
 absorbed-4.792
etically-4.557
 absorbing-4.477
MIC-4.424
Token released
Token ID2716
Feature activation-3.16e-01

Pos logit contributions
rogen+6.119
igh+5.386
ury+5.181
enstein+5.133
+5.131
Neg logit contributions
idth-13.878
etically-11.967
MIC-11.788
 absorbed-11.757
 absorbing-11.701
Token,
Token ID11
Feature activation-5.36e-02

Pos logit contributions
rogen+2.526
+2.516
ourse+2.422
inho+2.387
enstein+2.364
Neg logit contributions
idth-2.779
 absorbed-2.659
etically-2.475
 resur-2.469
eston-2.462
Token rode
Token ID22075
Feature activation+1.28e-01

Pos logit contributions
+0.430
rogen+0.420
ourse+0.414
inho+0.406
enstein+0.398
Neg logit contributions
 absorbed-0.468
idth-0.467
 resur-0.441
etically-0.428
eston-0.426
Token her
Token ID607
Feature activation+8.01e-02

Pos logit contributions
 absorbed+0.996
idth+0.992
 resur+0.930
etically+0.913
eston+0.893
Neg logit contributions
-1.151
rogen-1.112
ourse-1.100
inho-1.087
enstein-1.064
Token bike
Token ID7161
Feature activation-6.43e-02

Pos logit contributions
idth+0.553
 absorbed+0.551
 resur+0.510
etically+0.495
 absorbing+0.487
Neg logit contributions
-0.789
rogen-0.768
ourse-0.764
inho-0.757
enstein-0.741
Token below
Token ID2174
Feature activation-1.15e-01

Pos logit contributions
rogen+2.179
+2.166
ourse+2.117
inho+2.080
igh+2.066
Neg logit contributions
idth-1.953
 absorbed-1.789
 resur-1.693
etically-1.689
hler-1.672
Token to
Token ID284
Feature activation-4.60e-01

Pos logit contributions
+0.952
rogen+0.943
ourse+0.920
inho+0.907
enstein+0.889
Neg logit contributions
idth-0.974
 absorbed-0.957
 resur-0.897
eston-0.883
etically-0.882
Token become
Token ID1716
Feature activation-3.28e-01

Pos logit contributions
+2.923
rogen+2.883
ourse+2.855
ury+2.766
enstein+2.740
Neg logit contributions
idth-4.684
 absorbed-4.391
etically-4.339
 absorbing-4.181
hler-4.162
Token an
Token ID281
Feature activation-6.52e-01

Pos logit contributions
+2.498
rogen+2.440
ourse+2.375
inho+2.345
ury+2.294
Neg logit contributions
idth-3.056
 absorbed-2.859
 resur-2.691
etically-2.680
eston-2.647
Token In
Token ID554
Feature activation-1.06e-01

Pos logit contributions
+5.595
rogen+5.475
ourse+5.357
inho+5.350
 Vul+5.139
Neg logit contributions
idth-5.387
 absorbed-4.864
 resur-4.613
MIC-4.568
etically-4.552
TokenDaily
Token ID28545
Feature activation-1.14e+00

Pos logit contributions
+1.261
rogen+1.247
ourse+1.237
inho+1.215
enstein+1.195
Neg logit contributions
idth-0.528
 absorbed-0.505
 resur-0.446
etically-0.435
eston-0.432
Token supporter
Token ID15525
Feature activation-5.34e-01

Pos logit contributions
+8.845
ourse+8.561
rogen+8.530
ury+8.252
igh+8.190
Neg logit contributions
idth-10.437
 absorbed-9.109
enta-8.762
hler-8.710
eston-8.694
Token.
Token ID13
Feature activation-3.02e-01

Pos logit contributions
+3.328
rogen+3.326
igh+3.108
ourse+3.095
ury+3.031
Neg logit contributions
idth-5.579
 absorbed-5.127
 carbohyd-5.013
 resur-4.996
eston-4.915
Token Powered
Token ID45090
Feature activation-2.39e-01

Pos logit contributions
+3.605
rogen+3.548
ourse+3.490
inho+3.483
 Vul+3.421
Neg logit contributions
idth-1.491
 absorbed-1.425
 resur-1.254
etically-1.221
eston-1.200
Token by
Token ID416
Feature activation-3.89e-01

Pos logit contributions
+1.882
rogen+1.867
ourse+1.818
inho+1.785
enstein+1.751
Neg logit contributions
idth-2.119
 absorbed-2.077
 resur-1.950
etically-1.932
eston-1.927
Token Press
Token ID4332
Feature activation-8.97e-02

Pos logit contributions
+2.346
rogen+2.336
ourse+2.225
inho+2.215
 Vul+2.211
Neg logit contributions
idth-4.240
 absorbed-3.938
 resur-3.767
etically-3.711
eston-3.703
Tokense
Token ID325
Feature activation-4.58e-01

Pos logit contributions
ourse+4.708
rogen+4.695
+4.679
inho+4.611
igh+4.448
Neg logit contributions
idth-4.493
 absorbed-4.420
 resur-4.110
 absorbing-4.008
eston-3.995
Tokenign
Token ID570
Feature activation-6.98e-01

Pos logit contributions
+2.943
ourse+2.930
rogen+2.913
inho+2.857
igh+2.782
Neg logit contributions
idth-4.581
 absorbed-4.470
etically-4.248
 resur-4.232
 absorbing-4.165
Tokenements
Token ID3196
Feature activation-7.61e-01

Pos logit contributions
ourse+4.854
rogen+4.849
inho+4.760
+4.695
igh+4.559
Neg logit contributions
idth-6.419
 absorbed-6.322
 resur-5.977
etically-5.834
eston-5.818
Token
Token ID40560
Feature activation-6.53e-02

Pos logit contributions
rogen+5.583
+5.556
ourse+5.551
inho+5.432
 Vul+5.333
Neg logit contributions
idth-6.707
 absorbed-6.234
etically-6.003
eston-5.963
 resur-5.860
Tokenc
Token ID66
Feature activation-3.17e-02

Pos logit contributions
+0.521
rogen+0.514
ourse+0.502
inho+0.493
enstein+0.484
Neg logit contributions
idth-0.573
 absorbed-0.562
 resur-0.528
etically-0.521
eston-0.521
Tokenents
Token ID658
Feature activation-1.13e+00

Pos logit contributions
+0.260
rogen+0.255
ourse+0.249
inho+0.245
enstein+0.241
Neg logit contributions
idth-0.272
 absorbed-0.267
 resur-0.250
etically-0.247
eston-0.247
Token ind
Token ID773
Feature activation-4.09e-01

Pos logit contributions
ourse+8.091
rogen+8.071
+8.029
inho+7.967
 Vul+7.705
Neg logit contributions
idth-9.412
 absorbed-8.648
etically-8.377
eston-8.309
 resur-8.122
Tokeniqu
Token ID1557
Feature activation-9.00e-02

Pos logit contributions
rogen+3.437
ourse+3.408
+3.341
inho+3.326
igh+3.249
Neg logit contributions
idth-3.339
 absorbed-3.261
 resur-3.053
 absorbing-2.954
etically-2.952
Tokenent
Token ID298
Feature activation-8.60e-01

Pos logit contributions
+0.721
rogen+0.711
ourse+0.695
inho+0.683
enstein+0.670
Neg logit contributions
idth-0.786
 absorbed-0.772
 resur-0.725
etically-0.715
eston-0.714
Token qu
Token ID627
Feature activation+2.30e-01

Pos logit contributions
rogen+6.427
ourse+6.421
inho+6.285
+6.197
ury+6.106
Neg logit contributions
idth-7.426
 absorbed-6.965
 resur-6.595
etically-6.591
eston-6.493
Token'
Token ID6
Feature activation-5.49e-02

Pos logit contributions
idth+1.791
 absorbed+1.750
 resur+1.633
eston+1.630
etically+1.629
Neg logit contributions
-2.081
rogen-2.008
ourse-1.931
enstein-1.913
inho-1.907
Token were
Token ID547
Feature activation-2.99e-02

Pos logit contributions
+0.285
rogen+0.281
ourse+0.275
inho+0.270
enstein+0.265
Neg logit contributions
idth-0.284
 absorbed-0.280
 resur-0.262
etically-0.258
eston-0.257
Token in
Token ID287
Feature activation-3.62e-02

Pos logit contributions
+0.265
rogen+0.262
ourse+0.258
inho+0.253
enstein+0.249
Neg logit contributions
idth-0.233
 absorbed-0.232
 resur-0.215
etically-0.212
eston-0.211
Token their
Token ID511
Feature activation+5.63e-02

Pos logit contributions
+0.297
rogen+0.294
ourse+0.288
inho+0.281
enstein+0.279
Neg logit contributions
idth-0.309
 absorbed-0.305
 resur-0.286
etically-0.282
eston-0.280
Token 30
Token ID1542
Feature activation-2.98e-01

Pos logit contributions
 absorbed+0.477
idth+0.472
 resur+0.447
 absorbing+0.433
etically+0.430
Neg logit contributions
-0.466
ourse-0.454
rogen-0.450
enstein-0.440
inho-0.439
Tokens
Token ID82
Feature activation-9.85e-02

Pos logit contributions
+1.970
rogen+1.959
ourse+1.876
inho+1.869
enstein+1.798
Neg logit contributions
idth-3.000
 absorbed-2.946
 resur-2.791
etically-2.781
eston-2.767
Token and
Token ID290
Feature activation-1.12e+00

Pos logit contributions
+0.808
rogen+0.798
ourse+0.781
inho+0.768
enstein+0.753
Neg logit contributions
idth-0.840
 absorbed-0.826
 resur-0.774
etically-0.763
eston-0.761
Token 40
Token ID2319
Feature activation-4.60e-01

Pos logit contributions
rogen+7.097
igh+6.261
+6.241
inho+6.137
enstein+6.020
Neg logit contributions
idth-11.027
eston-10.481
 absorbed-10.200
etically-10.112
hler-9.879
Tokens
Token ID82
Feature activation-1.54e-01

Pos logit contributions
rogen+2.795
+2.760
inho+2.671
ourse+2.632
igh+2.567
Neg logit contributions
idth-4.837
 absorbed-4.728
etically-4.468
eston-4.450
 resur-4.450
Token.
Token ID13
Feature activation+6.76e-02

Pos logit contributions
+1.261
rogen+1.246
ourse+1.217
inho+1.198
enstein+1.173
Neg logit contributions
idth-1.320
 absorbed-1.294
 resur-1.213
etically-1.198
eston-1.197
Token
Token ID447
Feature activation-1.26e-02

Pos logit contributions
idth+0.526
 absorbed+0.519
 resur+0.484
etically+0.480
eston+0.476
Neg logit contributions
rogen-0.590
-0.589
ourse-0.580
inho-0.564
enstein-0.558
Token
Token ID251
Feature activation-5.45e-02

Pos logit contributions
+0.128
rogen+0.127
ourse+0.125
inho+0.123
enstein+0.121
Neg logit contributions
idth-0.083
 absorbed-0.082
 resur-0.075
etically-0.074
eston-0.073
Token,
Token ID11
Feature activation-1.09e-01

Pos logit contributions
+0.003
rogen+0.003
ourse+0.003
inho+0.003
enstein+0.003
Neg logit contributions
idth-0.002
 absorbed-0.002
 resur-0.002
eston-0.002
etically-0.002
Token and
Token ID290
Feature activation+9.42e-02

Pos logit contributions
+0.868
rogen+0.851
ourse+0.837
inho+0.820
enstein+0.801
Neg logit contributions
idth-0.953
 absorbed-0.941
 resur-0.886
etically-0.870
eston-0.869
Token convert
Token ID10385
Feature activation-3.35e-01

Pos logit contributions
 absorbed+0.739
idth+0.731
 resur+0.700
etically+0.668
 absorbing+0.667
Neg logit contributions
-0.855
ourse-0.819
rogen-0.803
inho-0.793
enstein-0.772
Token it
Token ID340
Feature activation-1.69e-01

Pos logit contributions
rogen+2.609
+2.553
inho+2.493
ourse+2.471
enstein+2.409
Neg logit contributions
idth-2.992
 absorbed-2.891
 resur-2.709
etically-2.683
eston-2.680
Token to
Token ID284
Feature activation-9.46e-01

Pos logit contributions
+1.364
rogen+1.349
ourse+1.316
inho+1.296
enstein+1.270
Neg logit contributions
idth-1.465
 absorbed-1.436
 resur-1.347
etically-1.330
eston-1.328
Token gr
Token ID1036
Feature activation-1.11e+00

Pos logit contributions
rogen+3.684
+3.572
igh+3.440
 Vul+3.299
inho+3.294
Neg logit contributions
idth-11.589
 absorbed-10.912
eston-10.696
 carbohyd-10.566
hler-10.534
Tokenays
Token ID592
Feature activation-1.28e-01

Pos logit contributions
rogen+5.706
igh+5.552
ourse+5.445
ury+5.354
+5.148
Neg logit contributions
 absorbed-11.611
 carbohyd-11.300
idth-11.297
 resur-10.916
 absorbing-10.758
Tokencale
Token ID38765
Feature activation+8.21e-02

Pos logit contributions
+0.724
rogen+0.710
ourse+0.687
inho+0.670
enstein+0.651
Neg logit contributions
idth-1.418
 absorbed-1.397
 resur-1.331
etically-1.317
eston-1.316
Token image
Token ID2939
Feature activation+6.21e-02

Pos logit contributions
 absorbed+0.659
idth+0.656
 resur+0.615
eston+0.597
etically+0.596
Neg logit contributions
-0.721
rogen-0.703
ourse-0.696
inho-0.681
enstein-0.672
Token =
Token ID796
Feature activation-1.51e-01

Pos logit contributions
idth+0.523
 absorbed+0.512
 resur+0.483
eston+0.478
etically+0.476
Neg logit contributions
-0.512
rogen-0.508
ourse-0.499
inho-0.487
enstein-0.478
Token c
Token ID269
Feature activation-2.67e-01

Pos logit contributions
+0.750
rogen+0.734
ourse+0.708
inho+0.687
enstein+0.664
Neg logit contributions
idth-1.780
 absorbed-1.758
 resur-1.680
etically-1.662
eston-1.661
Token le
Token ID443
Feature activation-2.19e-01

Pos logit contributions
rogen+3.669
+3.641
ourse+3.538
inho+3.482
igh+3.435
Neg logit contributions
idth-4.370
 absorbed-4.163
etically-3.979
 resur-3.919
eston-3.895
Token d
Token ID288
Feature activation-7.15e-01

Pos logit contributions
+1.675
rogen+1.660
ourse+1.633
inho+1.594
enstein+1.563
Neg logit contributions
idth-1.977
 absorbed-1.937
 resur-1.816
etically-1.805
eston-1.801
Tokenés
Token ID20954
Feature activation+2.58e-01

Pos logit contributions
ourse+5.311
rogen+5.198
inho+5.004
ury+5.001
+4.988
Neg logit contributions
 absorbed-6.330
idth-6.277
 resur-5.952
 absorbing-5.812
etically-5.775
Tokenacc
Token ID4134
Feature activation-2.52e-01

Pos logit contributions
idth+2.266
 absorbed+2.224
 resur+2.078
eston+2.057
etically+2.055
Neg logit contributions
-2.041
rogen-2.004
ourse-1.910
inho-1.903
enstein-1.882
Tokenord
Token ID585
Feature activation-8.14e-01

Pos logit contributions
rogen+1.529
+1.514
ourse+1.494
inho+1.456
enstein+1.401
Neg logit contributions
idth-2.667
 absorbed-2.636
 resur-2.508
etically-2.472
eston-2.462
Token persist
Token ID21160
Feature activation-1.07e+00

Pos logit contributions
rogen+6.113
ourse+6.104
inho+5.991
+5.965
ury+5.824
Neg logit contributions
idth-7.028
 absorbed-6.505
etically-6.323
eston-6.167
 resur-6.164
Tokene
Token ID68
Feature activation-6.50e-01

Pos logit contributions
ourse+8.847
rogen+8.556
ury+8.554
inho+8.516
enstein+8.323
Neg logit contributions
idth-8.146
 absorbed-7.823
 resur-7.407
MIC-7.175
etically-7.131
Token,
Token ID11
Feature activation-6.14e-01

Pos logit contributions
rogen+4.673
ourse+4.656
+4.564
inho+4.506
igh+4.454
Neg logit contributions
idth-5.930
 absorbed-5.642
etically-5.368
 resur-5.340
eston-5.300
Token les
Token ID10287
Feature activation+2.10e-01

Pos logit contributions
rogen+4.472
+4.453
ourse+4.371
inho+4.278
 Vul+4.212
Neg logit contributions
idth-5.613
 absorbed-5.279
etically-5.059
eston-5.035
 resur-5.005
Token F
Token ID376
Feature activation+3.97e-03

Pos logit contributions
 absorbed+1.801
idth+1.794
 resur+1.695
etically+1.647
eston+1.639
Neg logit contributions
-1.705
rogen-1.681
ourse-1.631
inho-1.607
enstein-1.579
Tokenond
Token ID623
Feature activation+3.95e-01

Pos logit contributions
idth+0.035
 absorbed+0.034
 resur+0.032
etically+0.032
eston+0.032
Neg logit contributions
-0.032
rogen-0.031
ourse-0.030
inho-0.030
enstein-0.029
Token M
Token ID337
Feature activation+3.68e-01

Pos logit contributions
idth+1.715
 absorbed+1.676
 resur+1.580
etically+1.568
eston+1.561
Neg logit contributions
-1.425
rogen-1.400
ourse-1.355
inho-1.328
enstein-1.294
Tokenuther
Token ID12866
Feature activation-4.15e-02

Pos logit contributions
idth+3.305
 absorbed+3.162
eston+3.010
 resur+2.988
etically+2.977
Neg logit contributions
-2.951
rogen-2.809
ourse-2.719
inho-2.692
enstein-2.656
Tokenh
Token ID71
Feature activation-1.70e-01

Pos logit contributions
+0.339
rogen+0.335
ourse+0.328
inho+0.325
enstein+0.318
Neg logit contributions
idth-0.354
 absorbed-0.349
 resur-0.327
etically-0.322
eston-0.321
Tokenucker
Token ID12603
Feature activation+3.81e-01

Pos logit contributions
+1.404
rogen+1.393
ourse+1.363
inho+1.339
enstein+1.311
Neg logit contributions
idth-1.436
 absorbed-1.413
 resur-1.326
etically-1.307
eston-1.288
Token 2008
Token ID3648
Feature activation-1.25e-01

Pos logit contributions
idth+3.072
 absorbed+3.007
 resur+2.830
eston+2.807
etically+2.791
Neg logit contributions
-3.288
rogen-3.209
ourse-3.148
inho-3.019
enstein-2.985
Token01
Token ID486
Feature activation-1.07e+00

Pos logit contributions
+0.896
rogen+0.880
ourse+0.857
inho+0.842
enstein+0.825
Neg logit contributions
idth-1.208
 absorbed-1.189
 resur-1.122
etically-1.110
eston-1.107
Token31
Token ID3132
Feature activation-2.08e-01

Pos logit contributions
rogen+5.426
+5.329
enstein+4.912
ourse+4.808
inho+4.753
Neg logit contributions
idth-11.530
 absorbed-11.435
 resur-10.928
eston-10.829
etically-10.643
Token O
Token ID440
Feature activation+2.88e-01

Pos logit contributions
+1.389
rogen+1.367
ourse+1.320
inho+1.315
enstein+1.281
Neg logit contributions
idth-2.108
 absorbed-2.065
 resur-1.940
eston-1.927
etically-1.925
Token&
Token ID5
Feature activation+6.55e-01

Pos logit contributions
idth+2.023
 absorbed+1.951
 resur+1.872
eston+1.864
etically+1.859
Neg logit contributions
-2.697
ourse-2.647
rogen-2.646
inho-2.579
enstein-2.531
TokenA
Token ID32
Feature activation+2.63e-01

Pos logit contributions
MIC+4.715
idth+4.401
eston+4.290
 absorbed+4.272
Property+4.248
Neg logit contributions
 redes-5.703
rogen-5.671
ourse-5.610
-5.489
enstein-5.288
Token-
Token ID12
Feature activation+1.12e-01

Pos logit contributions
idth+2.157
 absorbed+2.123
eston+2.033
 resur+2.032
etically+2.031
Neg logit contributions
-2.125
rogen-2.115
ourse-2.087
inho-1.993
enstein-1.953
Token trying
Token ID2111
Feature activation-2.14e-01

Pos logit contributions
+0.590
rogen+0.582
ourse+0.570
inho+0.560
enstein+0.549
Neg logit contributions
idth-0.605
 absorbed-0.596
 resur-0.560
etically-0.549
eston-0.548
Token to
Token ID284
Feature activation-2.27e-01

Pos logit contributions
+1.660
rogen+1.632
ourse+1.589
inho+1.568
enstein+1.544
Neg logit contributions
idth-1.910
 absorbed-1.859
 resur-1.745
eston-1.737
etically-1.733
Token stand
Token ID1302
Feature activation-1.09e-01

Pos logit contributions
+1.873
rogen+1.861
ourse+1.812
inho+1.788
enstein+1.757
Neg logit contributions
idth-1.926
 absorbed-1.871
 resur-1.745
etically-1.743
eston-1.740
Token toe
Token ID21189
Feature activation-4.61e-01

Pos logit contributions
+1.027
rogen+1.014
ourse+0.997
inho+0.979
enstein+0.963
Neg logit contributions
idth-0.798
 absorbed-0.786
 resur-0.729
etically-0.714
eston-0.712
Token-
Token ID12
Feature activation-5.05e-01

Pos logit contributions
+3.197
rogen+3.162
inho+3.039
ourse+3.004
igh+2.985
Neg logit contributions
idth-4.376
 absorbed-4.263
eston-4.098
 resur-4.070
etically-3.937
Tokento
Token ID1462
Feature activation-1.06e+00

Pos logit contributions
rogen+3.388
+3.386
inho+3.363
ourse+3.306
enstein+3.254
Neg logit contributions
idth-4.879
 absorbed-4.800
 resur-4.590
eston-4.539
 absorbing-4.418
Token-
Token ID12
Feature activation-1.29e+00

Pos logit contributions
enstein+5.288
igh+5.228
rogen+5.022
inho+5.012
ourse+4.953
Neg logit contributions
 carbohyd-11.453
 absorbed-11.262
idth-11.239
 resur-10.686
 absorbing-10.647
Tokentoe
Token ID44579
Feature activation-2.08e-01

Pos logit contributions
enstein+2.468
igh+2.107
inho+1.969
ourse+1.755
adem+1.698
Neg logit contributions
 absorbed-17.777
 resur-17.435
 carbohyd-17.371
 absorbing-17.244
PDATE-17.043
Token with
Token ID351
Feature activation-1.52e-02

Pos logit contributions
+1.602
rogen+1.594
ourse+1.549
inho+1.532
enstein+1.502
Neg logit contributions
idth-1.868
 absorbed-1.821
 resur-1.712
eston-1.703
etically-1.694
Token the
Token ID262
Feature activation+1.67e-01

Pos logit contributions
+0.127
rogen+0.124
ourse+0.122
inho+0.119
enstein+0.117
Neg logit contributions
idth-0.127
 absorbed-0.126
 resur-0.119
etically-0.116
eston-0.115
Token world
Token ID995
Feature activation+1.85e-01

Pos logit contributions
 absorbed+1.392
idth+1.367
 resur+1.311
 absorbing+1.263
etically+1.251
Neg logit contributions
-1.408
rogen-1.359
ourse-1.345
inho-1.313
enstein-1.289
Tokeni
Token ID72
Feature activation+2.74e-01

Pos logit contributions
idth+1.588
 absorbed+1.539
 resur+1.451
etically+1.434
eston+1.433
Neg logit contributions
-1.468
rogen-1.438
ourse-1.415
inho-1.376
enstein-1.343
Token,
Token ID11
Feature activation-1.29e-01

Pos logit contributions
idth+2.114
 absorbed+2.054
 resur+1.907
etically+1.890
eston+1.883
Neg logit contributions
-2.480
rogen-2.425
ourse-2.422
inho-2.350
enstein-2.294
Token 1
Token ID352
Feature activation+1.11e-01

Pos logit contributions
+0.936
rogen+0.926
ourse+0.902
inho+0.886
enstein+0.868
Neg logit contributions
idth-1.217
 absorbed-1.200
 resur-1.133
etically-1.119
eston-1.117
Token,
Token ID11
Feature activation+5.33e-02

Pos logit contributions
idth+0.915
 absorbed+0.896
 resur+0.839
etically+0.828
eston+0.825
Neg logit contributions
-0.942
rogen-0.928
ourse-0.916
inho-0.893
enstein-0.873
Token Length
Token ID22313
Feature activation+2.19e-01

Pos logit contributions
idth+0.594
 absorbed+0.584
 resur+0.555
etically+0.550
eston+0.548
Neg logit contributions
-0.304
rogen-0.294
ourse-0.287
inho-0.278
enstein-0.269
Token[
Token ID58
Feature activation-1.05e+00

Pos logit contributions
idth+1.276
 absorbed+1.234
etically+1.141
 resur+1.135
eston+1.121
Neg logit contributions
-2.322
rogen-2.319
ourse-2.297
inho-2.240
enstein-2.194
Tokengraph
Token ID34960
Feature activation-3.83e-01

Pos logit contributions
rogen+5.286
enstein+4.948
+4.918
ourse+4.904
igh+4.774
Neg logit contributions
 absorbed-11.017
idth-10.713
 resur-10.387
 absorbing-10.165
etically-10.143
TokenList
Token ID8053
Feature activation-1.49e-01

Pos logit contributions
+2.123
rogen+2.088
ourse+2.017
inho+1.975
enstein+1.929
Neg logit contributions
idth-4.312
 absorbed-4.263
 resur-4.025
eston-3.940
 absorbing-3.937
Token]}
Token ID48999
Feature activation+3.19e-03

Pos logit contributions
+1.500
rogen+1.481
ourse+1.455
inho+1.443
enstein+1.423
Neg logit contributions
idth-1.002
 absorbed-0.980
 resur-0.895
eston-0.880
etically-0.872
Token]
Token ID60
Feature activation-1.14e-01

Pos logit contributions
idth+0.030
 absorbed+0.029
 resur+0.028
etically+0.027
eston+0.027
Neg logit contributions
-0.024
rogen-0.023
ourse-0.023
inho-0.022
enstein-0.022
Token];
Token ID11208
Feature activation-2.22e-01

Pos logit contributions
+0.902
rogen+0.897
ourse+0.874
inho+0.860
enstein+0.846
Neg logit contributions
idth-0.995
 absorbed-0.977
 resur-0.916
eston-0.909
etically-0.906
Token Richard
Token ID6219
Feature activation-6.38e-01

Pos logit contributions
+1.619
rogen+1.611
ourse+1.568
inho+1.550
enstein+1.525
Neg logit contributions
idth-1.616
 absorbed-1.568
 resur-1.469
eston-1.451
etically-1.450
Token Nixon
Token ID16642
Feature activation-1.44e-01

Pos logit contributions
rogen+4.604
inho+4.512
+4.437
enstein+4.412
ourse+4.373
Neg logit contributions
idth-6.089
 absorbed-5.855
 carbohyd-5.571
 resur-5.445
eston-5.363
Token in
Token ID287
Feature activation-2.68e-01

Pos logit contributions
+1.211
rogen+1.197
ourse+1.170
inho+1.154
enstein+1.138
Neg logit contributions
idth-1.197
 absorbed-1.164
 resur-1.086
etically-1.078
eston-1.075
Token a
Token ID257
Feature activation-8.69e-02

Pos logit contributions
+1.896
rogen+1.872
ourse+1.825
inho+1.805
enstein+1.769
Neg logit contributions
idth-2.577
 absorbed-2.517
 resur-2.378
etically-2.351
eston-2.350
Token Pants
Token ID41689
Feature activation-1.03e-03

Pos logit contributions
+0.739
rogen+0.730
ourse+0.714
inho+0.705
enstein+0.693
Neg logit contributions
idth-0.715
 absorbed-0.700
 resur-0.654
eston-0.645
etically-0.645
Tokenuit
Token ID5013
Feature activation-1.04e+00

Pos logit contributions
+0.014
rogen+0.014
ourse+0.014
inho+0.013
enstein+0.013
Neg logit contributions
idth-0.003
 absorbed-0.003
 resur-0.003
etically-0.003
eston-0.003
Token was
Token ID373
Feature activation-4.39e-01

Pos logit contributions
rogen+7.140
+7.116
enstein+6.924
ourse+6.798
igh+6.726
Neg logit contributions
idth-9.201
 absorbed-8.417
eston-8.336
etically-8.281
 absorbing-7.979
Token originally
Token ID6198
Feature activation-7.24e-01

Pos logit contributions
rogen+3.116
+3.094
ourse+2.987
inho+2.943
enstein+2.920
Neg logit contributions
idth-4.180
 absorbed-3.969
etically-3.792
eston-3.758
 resur-3.732
Token published
Token ID3199
Feature activation-1.38e-01

Pos logit contributions
+4.837
rogen+4.836
ourse+4.752
igh+4.700
enstein+4.692
Neg logit contributions
idth-6.908
 absorbed-6.436
etically-6.233
eston-6.229
 absorbing-6.056
Token at
Token ID379
Feature activation-8.58e-02

Pos logit contributions
+1.164
rogen+1.149
ourse+1.125
inho+1.106
enstein+1.088
Neg logit contributions
idth-1.143
 absorbed-1.117
 resur-1.042
etically-1.031
eston-1.029
Token The
Token ID383
Feature activation+2.67e-01

Pos logit contributions
+0.713
rogen+0.704
ourse+0.688
inho+0.677
enstein+0.663
Neg logit contributions
idth-0.723
 absorbed-0.711
 resur-0.668
etically-0.657
eston-0.656
Token Air
Token ID3701
Feature activation-1.10e-01

Pos logit contributions
 absorbed+3.442
idth+3.425
 resur+3.281
etically+3.205
eston+3.199
Neg logit contributions
-2.790
rogen-2.712
ourse-2.622
inho-2.612
enstein-2.524
Token Station
Token ID9327
Feature activation+3.00e-01

Pos logit contributions
+0.891
rogen+0.882
ourse+0.864
inho+0.848
enstein+0.832
Neg logit contributions
idth-0.958
 absorbed-0.940
 resur-0.881
etically-0.869
eston-0.866
Token in
Token ID287
Feature activation+7.15e-01

Pos logit contributions
 absorbed+2.154
idth+2.137
 resur+2.009
eston+1.955
etically+1.942
Neg logit contributions
-2.797
rogen-2.757
ourse-2.739
inho-2.650
enstein-2.616
Token South
Token ID2520
Feature activation-4.35e-01

Pos logit contributions
 absorbed+4.879
idth+4.828
 resur+4.517
eston+4.502
etically+4.349
Neg logit contributions
-6.710
rogen-6.590
ourse-6.465
inho-6.213
enstein-6.189
Token We
Token ID775
Feature activation+2.61e-01

Pos logit contributions
+3.499
ourse+3.448
rogen+3.422
inho+3.358
enstein+3.314
Neg logit contributions
 absorbed-3.594
idth-3.565
 resur-3.318
etically-3.311
 absorbing-3.230
Tokenym
Token ID4948
Feature activation-1.04e+00

Pos logit contributions
idth+1.493
 absorbed+1.447
 resur+1.365
etically+1.355
eston+1.349
Neg logit contributions
-2.856
rogen-2.781
ourse-2.719
inho-2.681
 Vul-2.663
Tokenouth
Token ID1536
Feature activation-4.39e-02

Pos logit contributions
ourse+3.746
igh+3.378
ury+3.317
rogen+3.313
inho+3.255
Neg logit contributions
 absorbed-13.126
idth-12.793
 carbohyd-12.728
 absorbing-12.385
MIC-12.282
Token.
Token ID13
Feature activation-4.28e-02

Pos logit contributions
+0.358
rogen+0.355
ourse+0.347
inho+0.341
enstein+0.334
Neg logit contributions
idth-0.374
 absorbed-0.368
 resur-0.347
etically-0.341
eston-0.341
Token This
Token ID770
Feature activation+1.88e-01

Pos logit contributions
+0.371
rogen+0.367
ourse+0.359
inho+0.353
enstein+0.347
Neg logit contributions
idth-0.346
 absorbed-0.339
 resur-0.317
etically-0.312
eston-0.312
Token is
Token ID318
Feature activation+1.42e-01

Pos logit contributions
idth+1.588
 absorbed+1.587
 resur+1.515
etically+1.462
eston+1.457
Neg logit contributions
-1.549
ourse-1.480
rogen-1.469
inho-1.456
enstein-1.410
Token an
Token ID281
Feature activation-1.69e-01

Pos logit contributions
idth+1.075
 absorbed+1.074
 resur+1.007
etically+0.981
eston+0.976
Neg logit contributions
-1.290
rogen-1.259
ourse-1.244
inho-1.222
enstein-1.194
Token to
Token ID284
Feature activation-1.24e-01

Pos logit contributions
+0.686
rogen+0.675
ourse+0.661
inho+0.650
enstein+0.638
Neg logit contributions
idth-0.720
 absorbed-0.708
 resur-0.664
etically-0.654
eston-0.652
Token 300
Token ID5867
Feature activation-3.12e-01

Pos logit contributions
+1.012
rogen+1.002
ourse+0.978
inho+0.962
enstein+0.943
Neg logit contributions
idth-1.060
 absorbed-1.035
 resur-0.971
eston-0.960
etically-0.960
Token,
Token ID11
Feature activation-1.96e-01

Pos logit contributions
+2.608
rogen+2.568
ourse+2.506
inho+2.455
igh+2.450
Neg logit contributions
idth-2.608
 absorbed-2.503
etically-2.350
 resur-2.348
eston-2.315
Token000
Token ID830
Feature activation-4.64e-01

Pos logit contributions
+1.387
rogen+1.369
ourse+1.336
inho+1.313
enstein+1.280
Neg logit contributions
idth-1.889
 absorbed-1.851
 resur-1.753
etically-1.734
eston-1.732
Token barrels
Token ID17907
Feature activation-1.73e-01

Pos logit contributions
+3.023
rogen+2.968
ourse+2.893
igh+2.831
inho+2.792
Neg logit contributions
idth-4.681
 absorbed-4.519
etically-4.296
 resur-4.294
eston-4.245
Token a
Token ID257
Feature activation-1.03e+00

Pos logit contributions
+1.616
rogen+1.601
ourse+1.571
inho+1.546
enstein+1.517
Neg logit contributions
idth-1.285
 absorbed-1.253
 resur-1.165
etically-1.148
eston-1.146
Token day
Token ID1110
Feature activation-3.06e-01

Pos logit contributions
+5.765
igh+5.709
rogen+5.685
ourse+5.514
inho+5.370
Neg logit contributions
idth-10.887
etically-9.754
 absorbed-9.739
hler-9.648
eteria-9.633
Token.
Token ID13
Feature activation+4.20e-02

Pos logit contributions
+2.387
rogen+2.385
ourse+2.324
inho+2.273
enstein+2.224
Neg logit contributions
idth-2.706
 absorbed-2.602
 resur-2.467
eston-2.456
etically-2.431
Token Virt
Token ID11285
Feature activation+5.15e-02

Pos logit contributions
idth+0.357
 absorbed+0.351
 resur+0.329
etically+0.325
eston+0.324
Neg logit contributions
-0.346
rogen-0.341
ourse-0.334
inho-0.328
enstein-0.322
Tokenually
Token ID935
Feature activation-2.88e-01

Pos logit contributions
idth+0.284
 absorbed+0.276
 resur+0.249
etically+0.244
eston+0.243
Neg logit contributions
-0.577
rogen-0.572
ourse-0.563
inho-0.556
enstein-0.548
Token all
Token ID477
Feature activation-3.87e-02

Pos logit contributions
rogen+2.460
+2.421
ourse+2.365
inho+2.339
enstein+2.294
Neg logit contributions
idth-2.362
 absorbed-2.264
eston-2.129
 resur-2.118
etically-2.116
Token may
Token ID743
Feature activation-1.48e-01

Pos logit contributions
+0.092
rogen+0.090
ourse+0.088
inho+0.087
enstein+0.085
Neg logit contributions
idth-0.093
 absorbed-0.093
 resur-0.087
eston-0.085
etically-0.085
Token be
Token ID307
Feature activation+5.61e-02

Pos logit contributions
+1.209
rogen+1.191
ourse+1.167
inho+1.146
enstein+1.124
Neg logit contributions
idth-1.279
 absorbed-1.260
 resur-1.184
eston-1.163
etically-1.163
Token repeated
Token ID5100
Feature activation-2.45e-02

Pos logit contributions
 absorbed+0.472
idth+0.470
 resur+0.437
eston+0.429
etically+0.427
Neg logit contributions
-0.466
rogen-0.460
ourse-0.451
inho-0.443
enstein-0.436
Token to
Token ID284
Feature activation-1.46e-01

Pos logit contributions
+0.206
rogen+0.202
ourse+0.199
inho+0.195
enstein+0.191
Neg logit contributions
idth-0.204
 absorbed-0.203
 resur-0.190
etically-0.187
eston-0.185
Token your
Token ID534
Feature activation-8.95e-01

Pos logit contributions
+1.214
rogen+1.199
ourse+1.173
inho+1.154
enstein+1.132
Neg logit contributions
idth-1.232
 absorbed-1.210
 resur-1.134
etically-1.118
eston-1.116
Token own
Token ID898
Feature activation-1.03e+00

Pos logit contributions
rogen+6.955
+6.778
ourse+6.650
 Vul+6.568
ury+6.449
Neg logit contributions
idth-7.630
 absorbed-6.894
eston-6.871
etically-6.800
MIC-6.706
Token prejudice
Token ID20226
Feature activation-1.68e-01

Pos logit contributions
rogen+7.724
+7.544
ourse+7.502
 Vul+7.352
ury+7.244
Neg logit contributions
idth-8.764
 absorbed-7.837
etically-7.780
eston-7.779
MIC-7.778
Token.
Token ID13
Feature activation-1.47e-01

Pos logit contributions
+1.343
rogen+1.327
ourse+1.296
inho+1.272
enstein+1.249
Neg logit contributions
idth-1.474
 absorbed-1.439
 resur-1.353
etically-1.338
eston-1.336
Token
Token ID447
Feature activation-2.18e-01

Pos logit contributions
+1.186
rogen+1.164
ourse+1.144
inho+1.120
enstein+1.101
Neg logit contributions
idth-1.269
 absorbed-1.252
 resur-1.175
etically-1.158
eston-1.153
Token
Token ID251
Feature activation-2.71e-01

Pos logit contributions
+1.599
rogen+1.582
ourse+1.528
inho+1.507
enstein+1.470
Neg logit contributions
idth-2.055
 absorbed-2.024
 resur-1.906
etically-1.879
eston-1.878
Token
Token ID198
Feature activation-1.70e-01

Pos logit contributions
+1.982
rogen+1.927
ourse+1.883
inho+1.840
enstein+1.813
Neg logit contributions
idth-2.550
 absorbed-2.523
 resur-2.382
etically-2.352
eston-2.340
Token Award
Token ID11289
Feature activation+3.04e-02

Pos logit contributions
+1.872
rogen+1.845
ourse+1.808
inho+1.779
enstein+1.737
Neg logit contributions
idth-1.845
 absorbed-1.818
 resur-1.703
etically-1.677
eston-1.664
Token nominations
Token ID30994
Feature activation-2.92e-01

Pos logit contributions
idth+0.193
 absorbed+0.189
 resur+0.173
etically+0.169
eston+0.169
Neg logit contributions
-0.316
rogen-0.312
ourse-0.307
inho-0.304
enstein-0.299
Token are
Token ID389
Feature activation+5.90e-02

Pos logit contributions
+1.984
rogen+1.980
ourse+1.904
inho+1.857
enstein+1.818
Neg logit contributions
idth-2.904
 absorbed-2.816
 resur-2.664
eston-2.658
etically-2.655
Token accepted
Token ID6292
Feature activation-4.84e-01

Pos logit contributions
 absorbed+0.586
idth+0.578
 resur+0.551
eston+0.541
etically+0.538
Neg logit contributions
-0.407
rogen-0.392
ourse-0.388
inho-0.382
enstein-0.372
Token year
Token ID614
Feature activation-2.34e-01

Pos logit contributions
rogen+3.780
+3.746
ourse+3.627
inho+3.539
enstein+3.514
Neg logit contributions
idth-4.218
 absorbed-4.071
etically-3.844
eston-3.829
 resur-3.789
Token-
Token ID12
Feature activation-1.00e+00

Pos logit contributions
+1.633
rogen+1.619
ourse+1.570
inho+1.542
enstein+1.506
Neg logit contributions
idth-2.274
 absorbed-2.227
 resur-2.101
eston-2.083
etically-2.077
Tokenround
Token ID744
Feature activation-4.72e-01

Pos logit contributions
rogen+5.295
ury+4.962
ourse+4.914
+4.876
igh+4.854
Neg logit contributions
idth-11.042
 absorbed-10.879
 carbohyd-10.805
 absorbing-10.210
 resur-10.115
Token;
Token ID26
Feature activation-1.18e-01

Pos logit contributions
+2.762
rogen+2.751
ourse+2.663
enstein+2.528
inho+2.497
Neg logit contributions
idth-4.980
 absorbed-4.890
eston-4.660
 resur-4.650
etically-4.634
Token the
Token ID262
Feature activation+5.02e-04

Pos logit contributions
+0.919
rogen+0.905
ourse+0.885
inho+0.872
enstein+0.852
Neg logit contributions
idth-1.052
 absorbed-1.033
 resur-0.974
eston-0.958
etically-0.958
Token deadline
Token ID12508
Feature activation-1.74e-01

Pos logit contributions
idth+0.003
 absorbed+0.003
 resur+0.003
eston+0.003
etically+0.003
Neg logit contributions
-0.005
rogen-0.005
ourse-0.005
inho-0.005
enstein-0.005
Token for
Token ID329
Feature activation+1.97e-01

Pos logit contributions
+1.419
rogen+1.406
ourse+1.369
inho+1.349
enstein+1.324
Neg logit contributions
idth-1.489
 absorbed-1.458
 resur-1.366
etically-1.349
eston-1.348
Token the
Token ID262
Feature activation-9.84e-02

Pos logit contributions
rogen+2.603
+2.587
ourse+2.531
inho+2.444
 Vul+2.438
Neg logit contributions
idth-3.384
 absorbed-3.262
eston-3.085
etically-3.067
 resur-3.058
Token western
Token ID8830
Feature activation-5.10e-01

Pos logit contributions
+0.680
rogen+0.675
ourse+0.655
inho+0.641
enstein+0.627
Neg logit contributions
idth-0.966
 absorbed-0.944
 resur-0.892
etically-0.887
eston-0.887
Token edge
Token ID5743
Feature activation-3.94e-01

Pos logit contributions
rogen+4.186
+4.181
ourse+4.053
 Vul+4.029
inho+3.963
Neg logit contributions
idth-4.244
 absorbed-3.978
etically-3.802
eston-3.768
 resur-3.689
Token of
Token ID286
Feature activation-1.04e-01

Pos logit contributions
+2.962
rogen+2.946
ourse+2.855
inho+2.762
enstein+2.744
Neg logit contributions
idth-3.587
 absorbed-3.453
eston-3.270
etically-3.241
 resur-3.239
Token Mare
Token ID36989
Feature activation-5.96e-03

Pos logit contributions
+0.797
rogen+0.787
ourse+0.768
inho+0.754
enstein+0.739
Neg logit contributions
idth-0.952
 absorbed-0.935
 resur-0.880
etically-0.870
eston-0.869
Token Tr
Token ID833
Feature activation-9.98e-01

Pos logit contributions
+0.065
rogen+0.064
ourse+0.063
inho+0.063
enstein+0.062
Neg logit contributions
idth-0.035
 absorbed-0.034
 resur-0.031
etically-0.030
eston-0.030
Tokenanqu
Token ID26184
Feature activation-1.50e-01

Pos logit contributions
ourse+5.770
rogen+5.756
inho+5.633
+5.432
igh+5.411
Neg logit contributions
idth-10.307
 absorbed-10.208
 carbohyd-9.891
 resur-9.551
 absorbing-9.538
Tokenill
Token ID359
Feature activation+9.82e-02

Pos logit contributions
+1.233
rogen+1.220
ourse+1.195
inho+1.176
enstein+1.150
Neg logit contributions
idth-1.280
 absorbed-1.258
 resur-1.179
etically-1.161
eston-1.160
Tokenit
Token ID270
Feature activation-3.43e-01

Pos logit contributions
idth+0.507
 absorbed+0.484
 resur+0.434
eston+0.432
etically+0.428
Neg logit contributions
-1.149
rogen-1.127
ourse-1.111
inho-1.094
enstein-1.083
Tokenatis
Token ID37749
Feature activation-2.67e-01

Pos logit contributions
rogen+1.703
ourse+1.648
+1.643
inho+1.609
igh+1.528
Neg logit contributions
idth-3.957
 absorbed-3.934
 resur-3.752
etically-3.681
eston-3.681
Token 65
Token ID6135
Feature activation+2.64e-03

Pos logit contributions
+2.129
rogen+2.119
ourse+2.067
inho+2.020
enstein+1.987
Neg logit contributions
idth-2.340
 absorbed-2.252
 resur-2.110
etically-2.108
eston-2.102
Token version
Token ID2196
Feature activation+4.80e-01

Pos logit contributions
idth+1.646
 absorbed+1.638
 resur+1.539
eston+1.496
etically+1.492
Neg logit contributions
-1.759
rogen-1.738
ourse-1.687
inho-1.676
enstein-1.650
Token will
Token ID481
Feature activation+5.37e-01

Pos logit contributions
 absorbed+3.784
idth+3.718
 resur+3.528
eston+3.440
etically+3.381
Neg logit contributions
-4.230
rogen-4.120
ourse-4.056
inho-3.986
enstein-3.869
Token be
Token ID307
Feature activation+3.63e-01

Pos logit contributions
 absorbed+4.074
idth+4.001
 resur+3.847
eston+3.763
etically+3.696
Neg logit contributions
-4.892
ourse-4.662
rogen-4.607
inho-4.545
enstein-4.454
Token with
Token ID351
Feature activation+1.29e-01

Pos logit contributions
 absorbed+2.862
idth+2.801
 resur+2.663
 absorbing+2.574
etically+2.567
Neg logit contributions
-3.189
ourse-3.103
rogen-3.093
inho-3.050
enstein-2.977
Token the
Token ID262
Feature activation+2.77e-01

Pos logit contributions
idth+1.084
 absorbed+1.074
 resur+1.008
eston+0.984
etically+0.983
Neg logit contributions
-1.071
rogen-1.054
ourse-1.040
inho-1.023
enstein-0.987
Token PC
Token ID4217
Feature activation+5.54e-01

Pos logit contributions
 absorbed+2.293
idth+2.267
 resur+2.160
 absorbing+2.071
etically+2.062
Neg logit contributions
-2.297
ourse-2.231
rogen-2.222
inho-2.188
enstein-2.089
Token version
Token ID2196
Feature activation+2.59e-01

Pos logit contributions
 absorbed+4.172
idth+4.065
 resur+3.887
eston+3.782
 absorbing+3.733
Neg logit contributions
-4.774
rogen-4.725
ourse-4.653
inho-4.599
enstein-4.442
Token upon
Token ID2402
Feature activation-1.25e-01

Pos logit contributions
 absorbed+2.080
idth+2.067
 resur+1.937
eston+1.894
etically+1.876
Neg logit contributions
-2.241
rogen-2.204
ourse-2.168
inho-2.125
enstein-2.073
Token its
Token ID663
Feature activation-2.58e-01

Pos logit contributions
+1.023
rogen+1.011
ourse+0.988
inho+0.971
enstein+0.954
Neg logit contributions
idth-1.065
 absorbed-1.044
 resur-0.978
etically-0.966
eston-0.964
Token release
Token ID2650
Feature activation-2.39e-02

Pos logit contributions
+2.120
rogen+2.116
ourse+2.042
inho+2.015
enstein+1.989
Neg logit contributions
idth-2.206
 absorbed-2.126
etically-1.988
eston-1.985
 resur-1.980
Token.
Token ID13
Feature activation+8.49e-02

Pos logit contributions
+0.198
rogen+0.195
ourse+0.191
inho+0.188
enstein+0.184
Neg logit contributions
idth-0.203
 absorbed-0.199
 resur-0.187
etically-0.184
eston-0.184
Token military
Token ID2422
Feature activation+2.21e-01

Pos logit contributions
 absorbed+2.479
idth+2.409
 resur+2.298
 absorbing+2.269
etically+2.204
Neg logit contributions
-2.951
ourse-2.818
inho-2.812
rogen-2.788
enstein-2.691
Token bloc
Token ID24003
Feature activation-3.81e-01

Pos logit contributions
 absorbed+1.526
idth+1.494
 resur+1.420
etically+1.377
 absorbing+1.355
Neg logit contributions
-2.179
rogen-2.122
ourse-2.099
inho-2.081
enstein-2.018
Tokens
Token ID82
Feature activation-1.43e-02

Pos logit contributions
rogen+2.555
+2.553
ourse+2.498
inho+2.439
enstein+2.409
Neg logit contributions
idth-3.753
 absorbed-3.639
 resur-3.452
eston-3.438
etically-3.411
Token and
Token ID290
Feature activation+3.92e-01

Pos logit contributions
+0.119
rogen+0.118
ourse+0.115
inho+0.113
enstein+0.111
Neg logit contributions
idth-0.120
 absorbed-0.119
 resur-0.111
etically-0.109
eston-0.109
Token of
Token ID286
Feature activation+4.18e-01

Pos logit contributions
 absorbed+3.060
idth+2.951
 resur+2.814
 absorbing+2.753
etically+2.732
Neg logit contributions
-3.504
ourse-3.393
inho-3.355
rogen-3.327
enstein-3.233
Token increasing
Token ID3649
Feature activation+5.44e-01

Pos logit contributions
 absorbed+3.231
idth+3.099
 resur+2.963
 absorbing+2.936
etically+2.847
Neg logit contributions
-3.777
ourse-3.647
rogen-3.615
inho-3.597
enstein-3.464
Token military
Token ID2422
Feature activation+1.06e-01

Pos logit contributions
 absorbed+4.196
idth+4.001
 resur+3.861
etically+3.813
 absorbing+3.794
Neg logit contributions
-4.892
ourse-4.752
inho-4.634
rogen-4.616
enstein-4.464
Token bases
Token ID12536
Feature activation-1.32e-01

Pos logit contributions
 absorbed+0.775
idth+0.769
 resur+0.720
etically+0.707
eston+0.693
Neg logit contributions
-0.991
rogen-0.970
ourse-0.959
inho-0.945
enstein-0.922
Token
Token ID1399
Feature activation+2.25e-01

Pos logit contributions
+1.102
rogen+1.091
ourse+1.064
inho+1.046
enstein+1.027
Neg logit contributions
idth-1.117
 absorbed-1.084
 resur-1.021
eston-1.010
etically-1.010
Tokenwe
Token ID732
Feature activation+1.17e-01

Pos logit contributions
idth+1.863
 absorbed+1.839
 resur+1.712
etically+1.711
eston+1.693
Neg logit contributions
-1.887
rogen-1.854
ourse-1.818
inho-1.795
enstein-1.751
Token must
Token ID1276
Feature activation+2.43e-01

Pos logit contributions
 absorbed+0.900
idth+0.898
 resur+0.834
etically+0.814
eston+0.811
Neg logit contributions
-1.053
rogen-1.026
ourse-1.021
inho-0.999
enstein-0.980
Token of
Token ID286
Feature activation+1.21e-01

Pos logit contributions
 absorbed+1.312
idth+1.311
 resur+1.227
etically+1.210
eston+1.192
Neg logit contributions
-1.482
rogen-1.454
ourse-1.432
inho-1.406
enstein-1.390
Token degrading
Token ID40354
Feature activation-1.98e-01

Pos logit contributions
idth+0.964
 absorbed+0.959
 resur+0.894
etically+0.877
eston+0.871
Neg logit contributions
-1.057
rogen-1.042
ourse-1.028
inho-1.010
enstein-0.991
Token or
Token ID393
Feature activation+5.40e-02

Pos logit contributions
+1.712
rogen+1.709
ourse+1.644
inho+1.633
 Vul+1.604
Neg logit contributions
idth-1.607
 absorbed-1.535
eston-1.434
 resur-1.432
etically-1.417
Token in
Token ID287
Feature activation+1.15e-01

Pos logit contributions
idth+0.417
 absorbed+0.412
 resur+0.384
etically+0.377
eston+0.375
Neg logit contributions
-0.486
rogen-0.478
ourse-0.473
inho-0.464
enstein-0.456
Tokenaus
Token ID8717
Feature activation+5.99e-02

Pos logit contributions
idth+1.005
 absorbed+0.989
 resur+0.933
etically+0.920
eston+0.917
Neg logit contributions
-0.917
rogen-0.897
ourse-0.884
inho-0.865
enstein-0.850
Tokenp
Token ID79
Feature activation+6.15e-01

Pos logit contributions
idth+0.500
 absorbed+0.490
 resur+0.459
etically+0.452
eston+0.452
Neg logit contributions
-0.504
rogen-0.498
ourse-0.487
inho-0.479
enstein-0.470
Tokenicious
Token ID6243
Feature activation-9.50e-02

Pos logit contributions
idth+3.774
 absorbed+3.746
etically+3.581
 resur+3.435
eston+3.433
Neg logit contributions
-6.431
ourse-6.117
rogen-6.112
inho-6.069
enstein-5.954
Token names
Token ID3891
Feature activation-1.68e-01

Pos logit contributions
+0.845
rogen+0.839
ourse+0.815
inho+0.806
enstein+0.790
Neg logit contributions
idth-0.748
 absorbed-0.723
 resur-0.673
eston-0.670
etically-0.665
Token during
Token ID1141
Feature activation+1.90e-01

Pos logit contributions
+1.376
rogen+1.366
ourse+1.328
inho+1.310
enstein+1.282
Neg logit contributions
idth-1.436
 absorbed-1.381
 resur-1.294
eston-1.291
etically-1.288
Token the
Token ID262
Feature activation-1.44e-01

Pos logit contributions
idth+1.611
 absorbed+1.602
 resur+1.502
etically+1.467
eston+1.463
Neg logit contributions
-1.547
rogen-1.534
ourse-1.513
inho-1.480
enstein-1.453
Token 13
Token ID1511
Feature activation+1.45e-01

Pos logit contributions
+1.181
rogen+1.169
ourse+1.136
inho+1.118
enstein+1.099
Neg logit contributions
idth-1.239
 absorbed-1.206
 resur-1.127
eston-1.125
etically-1.122
Token by
Token ID416
Feature activation+3.90e-01

Pos logit contributions
 absorbed+1.987
idth+1.915
 resur+1.810
 absorbing+1.796
etically+1.776
Neg logit contributions
-2.164
rogen-2.090
ourse-2.085
inho-2.041
enstein-2.020
Token about
Token ID546
Feature activation+1.76e-01

Pos logit contributions
 absorbed+2.941
idth+2.735
 absorbing+2.712
 resur+2.684
etically+2.594
Neg logit contributions
-3.677
ourse-3.569
rogen-3.486
inho-3.463
enstein-3.456
Token 0
Token ID657
Feature activation-3.61e-02

Pos logit contributions
 absorbed+1.467
idth+1.460
 resur+1.365
etically+1.336
 absorbing+1.334
Neg logit contributions
-1.504
rogen-1.445
ourse-1.435
inho-1.410
enstein-1.390
Token.
Token ID13
Feature activation+8.50e-02

Pos logit contributions
+0.297
rogen+0.293
ourse+0.287
inho+0.282
enstein+0.277
Neg logit contributions
idth-0.307
 absorbed-0.302
 resur-0.283
etically-0.280
eston-0.278
Token3
Token ID18
Feature activation+3.82e-01

Pos logit contributions
idth+0.725
 absorbed+0.706
 resur+0.662
etically+0.660
eston+0.658
Neg logit contributions
-0.690
rogen-0.686
ourse-0.672
inho-0.653
enstein-0.643
Token percent
Token ID1411
Feature activation+5.74e-01

Pos logit contributions
 absorbed+2.844
idth+2.839
 resur+2.586
etically+2.551
 absorbing+2.522
Neg logit contributions
-3.470
rogen-3.440
ourse-3.416
inho-3.343
enstein-3.281
Token.
Token ID13
Feature activation+3.38e-01

Pos logit contributions
 absorbed+4.323
idth+4.098
 resur+3.855
 absorbing+3.843
etically+3.823
Neg logit contributions
-5.289
ourse-5.065
inho-4.959
rogen-4.938
enstein-4.874
Token In
Token ID554
Feature activation-4.54e-02

Pos logit contributions
 absorbed+2.404
idth+2.369
etically+2.215
 resur+2.214
eston+2.177
Neg logit contributions
-3.120
rogen-3.092
ourse-3.078
inho-2.954
enstein-2.924
Token other
Token ID584
Feature activation-1.46e-01

Pos logit contributions
+0.394
rogen+0.389
ourse+0.381
inho+0.375
enstein+0.368
Neg logit contributions
idth-0.365
 absorbed-0.360
 resur-0.336
etically-0.331
eston-0.329
Token words
Token ID2456
Feature activation+3.41e-01

Pos logit contributions
+1.088
rogen+1.073
ourse+1.047
inho+1.027
enstein+1.005
Neg logit contributions
idth-1.354
 absorbed-1.330
 resur-1.254
etically-1.238
eston-1.237
Token,
Token ID11
Feature activation+2.48e-01

Pos logit contributions
idth+2.462
 absorbed+2.462
 resur+2.237
eston+2.212
etically+2.201
Neg logit contributions
-3.174
rogen-3.137
ourse-3.133
inho-3.072
enstein-3.000
Token week
Token ID1285
Feature activation-1.90e-01

Pos logit contributions
+0.966
rogen+0.956
ourse+0.934
inho+0.920
enstein+0.903
Neg logit contributions
idth-0.936
 absorbed-0.918
 resur-0.858
etically-0.847
eston-0.845
Token's
Token ID338
Feature activation+1.06e-01

Pos logit contributions
+1.456
rogen+1.454
ourse+1.402
inho+1.389
enstein+1.362
Neg logit contributions
idth-1.708
 absorbed-1.670
 resur-1.571
etically-1.559
eston-1.557
Token new
Token ID649
Feature activation+1.71e-01

Pos logit contributions
idth+0.867
 absorbed+0.854
 resur+0.808
eston+0.784
etically+0.784
Neg logit contributions
-0.919
rogen-0.899
ourse-0.888
inho-0.868
enstein-0.850
Token issue
Token ID2071
Feature activation+3.13e-01

Pos logit contributions
idth+1.412
 absorbed+1.387
 resur+1.316
eston+1.278
etically+1.276
Neg logit contributions
-1.459
rogen-1.424
ourse-1.408
inho-1.373
enstein-1.339
Token.
Token ID13
Feature activation+2.72e-01

Pos logit contributions
idth+2.405
 absorbed+2.397
 resur+2.261
eston+2.180
etically+2.176
Neg logit contributions
-2.778
rogen-2.732
ourse-2.688
inho-2.643
enstein-2.590
Token We
Token ID775
Feature activation+6.22e-01

Pos logit contributions
idth+2.147
 absorbed+2.098
 resur+1.987
eston+1.982
etically+1.961
Neg logit contributions
-2.325
rogen-2.323
ourse-2.288
inho-2.217
enstein-2.207
Token asked
Token ID1965
Feature activation-3.91e-01

Pos logit contributions
 absorbed+5.234
 resur+4.874
idth+4.795
eston+4.509
hler+4.505
Neg logit contributions
-5.297
ourse-4.913
rogen-4.719
inho-4.686
enstein-4.678
Token them
Token ID606
Feature activation+5.24e-02

Pos logit contributions
+3.117
rogen+3.091
ourse+2.995
inho+2.957
igh+2.896
Neg logit contributions
idth-3.402
 absorbed-3.310
 resur-3.079
etically-3.072
eston-3.064
Token plenty
Token ID6088
Feature activation+8.84e-02

Pos logit contributions
 absorbed+0.398
idth+0.397
 resur+0.373
etically+0.361
eston+0.361
Neg logit contributions
-0.477
rogen-0.464
ourse-0.459
inho-0.452
enstein-0.445
Token about
Token ID546
Feature activation-3.75e-01

Pos logit contributions
idth+0.702
 absorbed+0.701
 resur+0.658
etically+0.645
eston+0.639
Neg logit contributions
-0.771
rogen-0.755
ourse-0.744
inho-0.729
enstein-0.718
Token the
Token ID262
Feature activation-2.21e-01

Pos logit contributions
rogen+2.868
+2.854
ourse+2.751
inho+2.713
enstein+2.683
Neg logit contributions
idth-3.360
 absorbed-3.282
etically-3.075
 resur-3.063
eston-3.060
Tokenuates
Token ID12632
Feature activation+6.76e-02

Pos logit contributions
idth+0.536
 absorbed+0.528
 resur+0.497
etically+0.490
eston+0.488
Neg logit contributions
-0.456
rogen-0.450
ourse-0.437
inho-0.432
enstein-0.421
Token.
Token ID13
Feature activation-1.94e-01

Pos logit contributions
idth+0.572
 absorbed+0.567
 resur+0.531
etically+0.520
eston+0.520
Neg logit contributions
-0.559
rogen-0.550
ourse-0.537
inho-0.531
enstein-0.519
Token O
Token ID440
Feature activation+4.01e-02

Pos logit contributions
+1.616
rogen+1.595
ourse+1.564
inho+1.539
enstein+1.510
Neg logit contributions
idth-1.628
 absorbed-1.595
 resur-1.493
etically-1.471
eston-1.469
Tokenurs
Token ID1834
Feature activation+2.36e-01

Pos logit contributions
idth+0.368
 absorbed+0.362
 resur+0.341
etically+0.337
eston+0.336
Neg logit contributions
-0.304
rogen-0.298
ourse-0.291
inho-0.287
enstein-0.281
Token is
Token ID318
Feature activation-9.22e-02

Pos logit contributions
 absorbed+2.104
idth+2.078
 resur+1.953
eston+1.906
etically+1.902
Neg logit contributions
-1.862
rogen-1.801
ourse-1.754
inho-1.747
enstein-1.690
Token a
Token ID257
Feature activation+5.67e-02

Pos logit contributions
+0.740
rogen+0.730
ourse+0.713
inho+0.702
enstein+0.687
Neg logit contributions
idth-0.804
 absorbed-0.791
 resur-0.743
etically-0.732
eston-0.731
Token Bachelor
Token ID33399
Feature activation+8.24e-02

Pos logit contributions
idth+0.463
 absorbed+0.463
 resur+0.433
etically+0.423
eston+0.418
Neg logit contributions
-0.485
rogen-0.474
ourse-0.465
inho-0.459
enstein-0.448
Token of
Token ID286
Feature activation+6.61e-02

Pos logit contributions
idth+0.706
 absorbed+0.695
 resur+0.652
etically+0.642
eston+0.641
Neg logit contributions
-0.673
rogen-0.664
ourse-0.649
inho-0.639
enstein-0.626
Token Science
Token ID5800
Feature activation-7.52e-02

Pos logit contributions
idth+0.557
 absorbed+0.550
 resur+0.513
etically+0.507
eston+0.505
Neg logit contributions
-0.549
rogen-0.544
ourse-0.529
inho-0.522
enstein-0.511
Token degree
Token ID4922
Feature activation-7.85e-02

Pos logit contributions
+0.624
rogen+0.616
ourse+0.602
inho+0.593
enstein+0.581
Neg logit contributions
idth-0.634
 absorbed-0.624
 resur-0.584
etically-0.576
eston-0.574
Token in
Token ID287
Feature activation+1.36e-01

Pos logit contributions
+0.672
rogen+0.664
ourse+0.651
inho+0.639
enstein+0.628
Neg logit contributions
idth-0.643
 absorbed-0.629
 resur-0.588
etically-0.580
eston-0.579
Token said
Token ID531
Feature activation+2.84e-01

Pos logit contributions
+0.173
rogen+0.169
ourse+0.165
inho+0.161
enstein+0.158
Neg logit contributions
idth-0.281
 absorbed-0.281
 resur-0.267
eston-0.262
etically-0.261
Token.
Token ID13
Feature activation+1.35e-01

Pos logit contributions
 absorbed+2.197
idth+2.138
 resur+2.050
eston+1.995
 absorbing+1.983
Neg logit contributions
-2.533
rogen-2.465
ourse-2.456
inho-2.407
enstein-2.360
Token
Token ID198
Feature activation+3.45e-01

Pos logit contributions
idth+1.180
 absorbed+1.157
 resur+1.087
eston+1.076
etically+1.073
Neg logit contributions
-1.082
rogen-1.070
ourse-1.044
inho-1.027
enstein-1.006
Token
Token ID198
Feature activation+3.50e-02

Pos logit contributions
idth+2.390
 absorbed+2.384
 resur+2.300
etically+2.291
eston+2.264
Neg logit contributions
-3.169
ourse-3.139
rogen-3.113
inho-2.984
 redes-2.942
TokenMer
Token ID13102
Feature activation-2.28e-01

Pos logit contributions
idth+0.289
 absorbed+0.282
 resur+0.266
eston+0.264
etically+0.263
Neg logit contributions
-0.294
rogen-0.291
ourse-0.285
inho-0.281
enstein-0.276
Tokenkel
Token ID7750
Feature activation+1.65e-01

Pos logit contributions
+1.405
rogen+1.404
ourse+1.358
inho+1.332
enstein+1.303
Neg logit contributions
idth-2.384
 absorbed-2.361
 resur-2.237
eston-2.209
etically-2.208
Token said
Token ID531
Feature activation+1.41e-01

Pos logit contributions
 absorbed+1.457
idth+1.437
 resur+1.378
eston+1.323
etically+1.302
Neg logit contributions
-1.314
rogen-1.270
ourse-1.258
inho-1.231
enstein-1.204
Token she
Token ID673
Feature activation-1.68e-01

Pos logit contributions
idth+1.137
 absorbed+1.129
 resur+1.057
eston+1.027
etically+1.024
Neg logit contributions
-1.242
rogen-1.204
ourse-1.182
inho-1.168
enstein-1.149
Token was
Token ID373
Feature activation+5.02e-02

Pos logit contributions
+1.353
rogen+1.333
ourse+1.305
inho+1.282
enstein+1.258
Neg logit contributions
idth-1.456
 absorbed-1.435
 resur-1.346
etically-1.325
eston-1.324
Token "
Token ID366
Feature activation+2.04e-01

Pos logit contributions
 absorbed+0.418
idth+0.413
 resur+0.391
 absorbing+0.379
eston+0.378
Neg logit contributions
-0.425
rogen-0.414
ourse-0.410
inho-0.400
enstein-0.395
Tokenanything
Token ID49459
Feature activation+4.54e-03

Pos logit contributions
 absorbed+1.787
idth+1.786
eston+1.669
 resur+1.667
etically+1.651
Neg logit contributions
-1.589
rogen-1.558
ourse-1.528
inho-1.502
enstein-1.459
Token Stock
Token ID10500
Feature activation+1.47e-01

Pos logit contributions
idth+0.986
 absorbed+0.968
 resur+0.908
eston+0.898
etically+0.894
Neg logit contributions
-0.915
rogen-0.907
ourse-0.877
inho-0.863
enstein-0.849
Tokenbro
Token ID7957
Feature activation+8.02e-02

Pos logit contributions
idth+1.248
 absorbed+1.233
 resur+1.157
eston+1.147
etically+1.137
Neg logit contributions
-1.208
rogen-1.184
ourse-1.156
inho-1.135
enstein-1.104
Tokenkers
Token ID15949
Feature activation+2.34e-01

Pos logit contributions
idth+0.784
 absorbed+0.777
 resur+0.733
eston+0.724
etically+0.720
Neg logit contributions
-0.558
rogen-0.545
ourse-0.526
inho-0.520
enstein-0.507
Token chief
Token ID4039
Feature activation+2.35e-01

Pos logit contributions
idth+1.685
 absorbed+1.684
 resur+1.557
eston+1.523
etically+1.514
Neg logit contributions
-2.194
rogen-2.170
ourse-2.103
inho-2.095
enstein-2.041
Token economist
Token ID17646
Feature activation-1.79e-01

Pos logit contributions
idth+1.765
 absorbed+1.752
 resur+1.620
etically+1.591
eston+1.589
Neg logit contributions
-2.156
rogen-2.114
ourse-2.058
inho-2.036
enstein-1.998
Token Alan
Token ID12246
Feature activation-3.40e-02

Pos logit contributions
+1.492
rogen+1.475
ourse+1.445
inho+1.422
enstein+1.394
Neg logit contributions
idth-1.503
 absorbed-1.473
 resur-1.378
etically-1.360
eston-1.356
Token Mc
Token ID1982
Feature activation+1.44e-01

Pos logit contributions
+0.281
rogen+0.277
ourse+0.271
inho+0.266
enstein+0.261
Neg logit contributions
idth-0.288
 absorbed-0.283
 resur-0.265
etically-0.262
eston-0.261
TokenQu
Token ID4507
Feature activation+1.72e-01

Pos logit contributions
idth+1.286
 absorbed+1.249
eston+1.183
 resur+1.181
etically+1.175
Neg logit contributions
-1.127
rogen-1.102
ourse-1.070
inho-1.063
enstein-1.045
Tokenaid
Token ID1698
Feature activation-1.06e-01

Pos logit contributions
idth+1.552
 absorbed+1.507
eston+1.435
 resur+1.422
etically+1.415
Neg logit contributions
-1.354
rogen-1.295
ourse-1.258
inho-1.229
enstein-1.225
Token
Token ID198
Feature activation-1.89e-01

Pos logit contributions
+0.850
rogen+0.839
ourse+0.821
inho+0.807
enstein+0.794
Neg logit contributions
idth-0.934
 absorbed-0.915
 resur-0.859
etically-0.848
eston-0.845
Token
Token ID198
Feature activation-2.00e-01

Pos logit contributions
+1.428
rogen+1.412
ourse+1.372
inho+1.353
enstein+1.327
Neg logit contributions
idth-1.739
 absorbed-1.707
 resur-1.604
etically-1.581
eston-1.579
Token B
Token ID347
Feature activation-5.23e-01

Pos logit contributions
+0.085
rogen+0.084
ourse+0.083
inho+0.081
enstein+0.079
Neg logit contributions
 absorbed-0.087
idth-0.086
 resur-0.081
etically-0.079
eston-0.079
Tokenollywood
Token ID31777
Feature activation-1.22e-01

Pos logit contributions
rogen+3.282
+3.252
ourse+3.246
inho+3.153
ury+3.122
Neg logit contributions
idth-5.318
 absorbed-5.233
 resur-4.997
etically-4.857
eston-4.845
Token at
Token ID379
Feature activation+8.26e-02

Pos logit contributions
+0.975
rogen+0.963
ourse+0.942
inho+0.926
enstein+0.908
Neg logit contributions
idth-1.060
 absorbed-1.038
 resur-0.976
etically-0.962
eston-0.962
Token its
Token ID663
Feature activation+4.13e-02

Pos logit contributions
idth+0.654
 absorbed+0.645
 resur+0.600
etically+0.593
eston+0.592
Neg logit contributions
-0.727
rogen-0.716
ourse-0.702
inho-0.692
enstein-0.679
Token best
Token ID1266
Feature activation+6.09e-02

Pos logit contributions
idth+0.349
 absorbed+0.349
 resur+0.327
etically+0.321
eston+0.320
Neg logit contributions
-0.338
rogen-0.333
ourse-0.329
inho-0.322
enstein-0.313
Token [
Token ID685
Feature activation+8.01e-02

Pos logit contributions
idth+0.512
 absorbed+0.510
 resur+0.475
eston+0.469
etically+0.467
Neg logit contributions
-0.505
rogen-0.495
ourse-0.488
inho-0.477
enstein-0.469
Tokenem
Token ID368
Feature activation+1.74e-01

Pos logit contributions
idth+0.638
 absorbed+0.628
 resur+0.585
eston+0.583
etically+0.580
Neg logit contributions
-0.694
rogen-0.688
ourse-0.676
inho-0.662
enstein-0.651
Tokenph
Token ID746
Feature activation+1.59e-01

Pos logit contributions
idth+1.545
 absorbed+1.523
etically+1.432
 resur+1.427
eston+1.417
Neg logit contributions
-1.379
rogen-1.347
ourse-1.307
inho-1.289
enstein-1.266
Tokenases
Token ID1386
Feature activation-3.71e-01

Pos logit contributions
idth+1.263
 absorbed+1.245
etically+1.161
 resur+1.156
eston+1.153
Neg logit contributions
-1.412
rogen-1.355
ourse-1.345
inho-1.319
enstein-1.303
Token added
Token ID2087
Feature activation-3.14e-01

Pos logit contributions
+2.796
rogen+2.756
ourse+2.689
inho+2.673
enstein+2.599
Neg logit contributions
idth-3.398
 absorbed-3.258
 resur-3.084
eston-3.066
etically-3.034
Token],"
Token ID17241
Feature activation+1.21e-02

Pos logit contributions
+1.969
rogen+1.946
ourse+1.871
inho+1.862
enstein+1.794
Neg logit contributions
idth-3.279
 absorbed-3.190
 resur-3.036
etically-2.989
eston-2.985
Token"
Token ID1
Feature activation+6.88e-02

Pos logit contributions
+0.724
rogen+0.717
ourse+0.703
inho+0.690
enstein+0.678
Neg logit contributions
idth-0.727
 absorbed-0.713
 resur-0.670
eston-0.663
etically-0.662
TokenThose
Token ID9627
Feature activation+1.27e-02

Pos logit contributions
idth+0.559
 absorbed+0.548
eston+0.517
etically+0.517
 resur+0.517
Neg logit contributions
rogen-0.576
-0.572
ourse-0.564
inho-0.550
enstein-0.544
Token who
Token ID508
Feature activation+1.01e-01

Pos logit contributions
idth+0.109
 absorbed+0.108
 resur+0.102
etically+0.100
eston+0.099
Neg logit contributions
-0.102
rogen-0.101
ourse-0.099
inho-0.097
enstein-0.095
Token've
Token ID1053
Feature activation+3.85e-01

Pos logit contributions
 absorbed+0.806
idth+0.787
 resur+0.754
etically+0.721
eston+0.720
Neg logit contributions
-0.892
ourse-0.862
rogen-0.860
inho-0.832
enstein-0.824
Token seen
Token ID1775
Feature activation+1.70e-01

Pos logit contributions
 absorbed+3.007
 resur+2.775
idth+2.732
eston+2.575
 absorbing+2.550
Neg logit contributions
-3.609
ourse-3.512
rogen-3.398
inho-3.361
enstein-3.312
Token a
Token ID257
Feature activation-7.34e-02

Pos logit contributions
 absorbed+1.367
idth+1.348
 resur+1.288
etically+1.250
eston+1.248
Neg logit contributions
-1.468
ourse-1.438
rogen-1.433
inho-1.388
enstein-1.372
Token be
Token ID307
Feature activation-1.15e-01

Pos logit contributions
+0.602
rogen+0.596
ourse+0.581
inho+0.573
enstein+0.561
Neg logit contributions
idth-0.627
 absorbed-0.614
 resur-0.575
etically-0.568
eston-0.568
Tokenaver
Token ID8770
Feature activation+2.97e-01

Pos logit contributions
+0.333
rogen+0.318
ourse+0.297
inho+0.282
enstein+0.264
Neg logit contributions
idth-1.599
 absorbed-1.581
 resur-1.521
etically-1.510
eston-1.510
Token's
Token ID338
Feature activation+2.20e-01

Pos logit contributions
 absorbed+2.344
idth+2.238
 resur+2.211
 absorbing+2.130
etically+2.090
Neg logit contributions
-2.657
ourse-2.587
rogen-2.571
inho-2.463
enstein-2.431
Token teeth
Token ID9941
Feature activation+2.32e-01

Pos logit contributions
 absorbed+1.685
idth+1.653
 resur+1.589
etically+1.531
 absorbing+1.521
Neg logit contributions
-1.990
rogen-1.948
ourse-1.934
inho-1.870
enstein-1.846
Token will
Token ID481
Feature activation+2.63e-01

Pos logit contributions
 absorbed+1.962
idth+1.888
 resur+1.847
 absorbing+1.772
etically+1.755
Neg logit contributions
-1.968
rogen-1.912
ourse-1.911
inho-1.824
enstein-1.812
Token a
Token ID257
Feature activation-7.61e-03

Pos logit contributions
+1.383
rogen+1.366
ourse+1.336
inho+1.315
enstein+1.288
Neg logit contributions
idth-1.449
 absorbed-1.422
 resur-1.333
etically-1.316
eston-1.314
Token number
Token ID1271
Feature activation+7.64e-02

Pos logit contributions
+0.065
rogen+0.064
ourse+0.063
inho+0.061
enstein+0.061
Neg logit contributions
idth-0.062
 absorbed-0.062
 resur-0.058
etically-0.057
eston-0.056
Token of
Token ID286
Feature activation-8.38e-02

Pos logit contributions
idth+0.628
 absorbed+0.620
 resur+0.580
etically+0.571
eston+0.568
Neg logit contributions
-0.649
rogen-0.641
ourse-0.627
inho-0.614
enstein-0.607
Token other
Token ID584
Feature activation-1.11e-01

Pos logit contributions
+0.806
rogen+0.796
ourse+0.783
inho+0.770
enstein+0.759
Neg logit contributions
idth-0.596
 absorbed-0.586
 resur-0.542
etically-0.532
eston-0.530
Token chances
Token ID8395
Feature activation+1.41e-02

Pos logit contributions
+1.053
rogen+1.043
ourse+1.021
inho+1.010
enstein+0.991
Neg logit contributions
idth-0.808
 absorbed-0.787
 resur-0.729
etically-0.719
eston-0.718
Token test
Token ID1332
Feature activation-4.51e-01

Pos logit contributions
 absorbed+0.115
idth+0.114
 resur+0.107
etically+0.104
eston+0.104
Neg logit contributions
-0.120
rogen-0.118
ourse-0.117
inho-0.113
enstein-0.113
Token the
Token ID262
Feature activation+4.43e-02

Pos logit contributions
+3.540
rogen+3.481
inho+3.417
ourse+3.397
ury+3.299
Neg logit contributions
idth-3.926
 absorbed-3.831
 resur-3.587
etically-3.578
eston-3.571
Token home
Token ID1363
Feature activation+2.19e-01

Pos logit contributions
 absorbed+0.356
idth+0.355
 resur+0.334
eston+0.324
 absorbing+0.323
Neg logit contributions
-0.383
rogen-0.377
ourse-0.371
inho-0.362
enstein-0.359
Token defence
Token ID9366
Feature activation+4.27e-01

Pos logit contributions
 absorbed+1.546
idth+1.528
 resur+1.442
 absorbing+1.394
etically+1.375
Neg logit contributions
-2.100
rogen-2.070
ourse-2.033
enstein-1.992
inho-1.984
Token without
Token ID1231
Feature activation-1.74e-01

Pos logit contributions
 absorbed+3.282
idth+3.133
 resur+3.083
 absorbing+2.995
eston+2.919
Neg logit contributions
-3.803
rogen-3.712
ourse-3.692
enstein-3.559
inho-3.524
Token finding
Token ID4917
Feature activation-2.89e-01

Pos logit contributions
+1.449
rogen+1.434
ourse+1.400
inho+1.383
enstein+1.353
Neg logit contributions
idth-1.466
 absorbed-1.431
 resur-1.337
etically-1.327
eston-1.326
Token being
Token ID852
Feature activation-6.31e-02

Pos logit contributions
+1.350
rogen+1.334
ourse+1.304
inho+1.284
enstein+1.258
Neg logit contributions
idth-1.393
 absorbed-1.360
 resur-1.275
etically-1.261
eston-1.259
Token locked
Token ID8970
Feature activation+1.34e-01

Pos logit contributions
+0.563
rogen+0.554
ourse+0.546
inho+0.536
enstein+0.528
Neg logit contributions
 absorbed-0.491
idth-0.491
 resur-0.457
etically-0.445
eston-0.444
Token up
Token ID510
Feature activation-5.64e-02

Pos logit contributions
 absorbed+0.880
idth+0.880
 resur+0.810
etically+0.792
eston+0.782
Neg logit contributions
-1.360
rogen-1.343
ourse-1.319
inho-1.296
enstein-1.277
Token under
Token ID739
Feature activation-2.89e-02

Pos logit contributions
+0.473
rogen+0.465
ourse+0.456
inho+0.448
enstein+0.440
Neg logit contributions
idth-0.472
 absorbed-0.467
 resur-0.437
etically-0.429
eston-0.428
Token false
Token ID3991
Feature activation-1.97e-01

Pos logit contributions
+0.266
rogen+0.262
ourse+0.257
inho+0.254
enstein+0.249
Neg logit contributions
idth-0.219
 absorbed-0.215
 resur-0.200
etically-0.196
eston-0.196
Token pret
Token ID2181
Feature activation-3.37e-01

Pos logit contributions
+1.781
rogen+1.757
ourse+1.719
inho+1.698
enstein+1.662
Neg logit contributions
idth-1.511
 absorbed-1.477
 resur-1.366
etically-1.357
eston-1.355
Tokenenses
Token ID4541
Feature activation+1.39e-01

Pos logit contributions
rogen+3.015
+2.952
ourse+2.946
inho+2.884
enstein+2.798
Neg logit contributions
idth-2.586
 absorbed-2.581
 resur-2.400
eston-2.346
etically-2.340
Token.
Token ID13
Feature activation+2.43e-01

Pos logit contributions
 absorbed+1.129
idth+1.116
 resur+1.060
 absorbing+1.020
etically+1.014
Neg logit contributions
-1.204
rogen-1.180
ourse-1.160
inho-1.136
enstein-1.123
Token
Token ID198
Feature activation+2.75e-01

Pos logit contributions
idth+1.978
 absorbed+1.936
 resur+1.816
eston+1.813
etically+1.796
Neg logit contributions
-2.039
rogen-2.028
ourse-1.978
inho-1.939
enstein-1.901
Token
Token ID198
Feature activation+1.37e-01

Pos logit contributions
 absorbed+2.004
idth+2.003
 resur+1.918
etically+1.899
eston+1.892
Neg logit contributions
-2.467
ourse-2.423
rogen-2.412
inho-2.324
enstein-2.275
Token"
Token ID1
Feature activation+3.95e-01

Pos logit contributions
idth+1.037
 absorbed+1.008
eston+0.947
 resur+0.945
etically+0.936
Neg logit contributions
-1.235
rogen-1.228
ourse-1.206
inho-1.192
enstein-1.168
Token through
Token ID832
Feature activation-1.27e-01

Pos logit contributions
+1.147
rogen+1.133
ourse+1.107
inho+1.090
enstein+1.068
Neg logit contributions
idth-1.206
 absorbed-1.181
 resur-1.108
etically-1.092
eston-1.092
Token a
Token ID257
Feature activation+3.23e-01

Pos logit contributions
+1.074
rogen+1.061
ourse+1.039
inho+1.022
enstein+1.003
Neg logit contributions
idth-1.048
 absorbed-1.030
 resur-0.964
etically-0.949
eston-0.947
Token dating
Token ID10691
Feature activation-9.11e-03

Pos logit contributions
 absorbed+2.358
idth+2.321
 resur+2.218
etically+2.113
 absorbing+2.098
Neg logit contributions
-2.979
ourse-2.942
rogen-2.927
inho-2.854
enstein-2.780
Token relationship
Token ID2776
Feature activation+2.79e-01

Pos logit contributions
+0.087
rogen+0.086
ourse+0.084
inho+0.083
enstein+0.082
Neg logit contributions
idth-0.065
 absorbed-0.064
 resur-0.059
etically-0.058
eston-0.058
Token.
Token ID13
Feature activation-2.10e-01

Pos logit contributions
idth+2.224
 absorbed+2.223
 resur+2.092
etically+2.044
eston+2.016
Neg logit contributions
-2.389
rogen-2.357
ourse-2.312
inho-2.284
enstein-2.235
Token
Token ID198
Feature activation-4.24e-01

Pos logit contributions
+1.747
rogen+1.717
ourse+1.681
inho+1.660
enstein+1.627
Neg logit contributions
idth-1.777
 absorbed-1.742
 resur-1.626
eston-1.603
etically-1.602
Token
Token ID198
Feature activation-5.68e-01

Pos logit contributions
+2.868
rogen+2.848
inho+2.736
ourse+2.716
enstein+2.673
Neg logit contributions
idth-4.254
 absorbed-4.151
 carbohyd-3.925
 resur-3.866
eston-3.807
Token
Token ID447
Feature activation-1.36e-01

Pos logit contributions
+4.279
rogen+4.053
inho+3.957
ourse+3.945
igh+3.869
Neg logit contributions
idth-5.325
 absorbed-5.247
 resur-4.864
 absorbing-4.765
etically-4.727
Token
Token ID246
Feature activation-3.15e-01

Pos logit contributions
+1.059
rogen+1.048
ourse+1.021
inho+1.002
enstein+0.981
Neg logit contributions
idth-1.218
 absorbed-1.198
 resur-1.126
etically-1.112
eston-1.108
TokenAs
Token ID1722
Feature activation-2.77e-01

Pos logit contributions
+2.510
rogen+2.448
ourse+2.401
inho+2.367
enstein+2.316
Neg logit contributions
idth-2.799
 absorbed-2.757
 resur-2.562
eston-2.517
etically-2.511
Token we
Token ID356
Feature activation-2.37e-01

Pos logit contributions
+2.227
rogen+2.204
ourse+2.148
inho+2.117
enstein+2.072
Neg logit contributions
idth-2.407
 absorbed-2.345
 resur-2.201
eston-2.190
etically-2.176
Token known
Token ID1900
Feature activation-5.86e-02

Pos logit contributions
+0.658
rogen+0.643
ourse+0.633
inho+0.622
enstein+0.608
Neg logit contributions
idth-0.659
 absorbed-0.654
 resur-0.614
etically-0.601
eston-0.599
Token Linux
Token ID7020
Feature activation+9.55e-02

Pos logit contributions
+0.494
rogen+0.488
ourse+0.478
inho+0.470
enstein+0.461
Neg logit contributions
idth-0.488
 absorbed-0.479
 resur-0.449
etically-0.442
eston-0.440
Token root
Token ID6808
Feature activation-7.71e-02

Pos logit contributions
idth+0.707
 absorbed+0.700
 resur+0.658
etically+0.640
eston+0.634
Neg logit contributions
-0.886
rogen-0.871
ourse-0.853
inho-0.846
enstein-0.824
Tokenk
Token ID74
Feature activation-1.06e-01

Pos logit contributions
+0.582
rogen+0.574
ourse+0.561
inho+0.550
enstein+0.539
Neg logit contributions
idth-0.708
 absorbed-0.696
 resur-0.656
etically-0.647
eston-0.647
Tokenits
Token ID896
Feature activation-1.11e-01

Pos logit contributions
+0.851
rogen+0.843
ourse+0.825
inho+0.811
enstein+0.795
Neg logit contributions
idth-0.922
 absorbed-0.910
 resur-0.856
etically-0.838
eston-0.836
Token.
Token ID13
Feature activation-2.35e-01

Pos logit contributions
+0.921
rogen+0.909
ourse+0.889
inho+0.875
enstein+0.857
Neg logit contributions
idth-0.940
 absorbed-0.924
 resur-0.865
etically-0.853
eston-0.851
Token He
Token ID679
Feature activation-9.43e-02

Pos logit contributions
+1.940
rogen+1.909
ourse+1.870
inho+1.845
enstein+1.804
Neg logit contributions
idth-2.003
 absorbed-1.965
 resur-1.837
etically-1.811
eston-1.806
Token has
Token ID468
Feature activation+2.51e-02

Pos logit contributions
+0.721
rogen+0.704
ourse+0.692
inho+0.677
enstein+0.661
Neg logit contributions
idth-0.859
 absorbed-0.854
 resur-0.806
eston-0.789
etically-0.788
Token expressed
Token ID6241
Feature activation+2.05e-01

Pos logit contributions
 absorbed+0.210
idth+0.208
 resur+0.197
eston+0.190
etically+0.189
Neg logit contributions
-0.213
rogen-0.207
ourse-0.205
inho-0.202
enstein-0.196
Token his
Token ID465
Feature activation-2.79e-02

Pos logit contributions
 absorbed+1.630
idth+1.618
 resur+1.521
eston+1.484
etically+1.479
Neg logit contributions
-1.795
rogen-1.761
ourse-1.733
inho-1.709
enstein-1.667
Token sadness
Token ID25303
Feature activation-3.13e-01

Pos logit contributions
+0.223
rogen+0.220
ourse+0.215
inho+0.212
enstein+0.207
Neg logit contributions
idth-0.244
 absorbed-0.240
 resur-0.226
etically-0.222
eston-0.222
Token biological
Token ID10685
Feature activation+1.95e-01

Pos logit contributions
+1.141
rogen+1.119
ourse+1.099
inho+1.082
enstein+1.059
Neg logit contributions
idth-1.208
 absorbed-1.193
 resur-1.118
etically-1.102
eston-1.097
Token sex
Token ID1714
Feature activation-5.14e-01

Pos logit contributions
 absorbed+1.700
idth+1.679
 resur+1.595
etically+1.574
 absorbing+1.532
Neg logit contributions
-1.578
ourse-1.508
rogen-1.492
inho-1.483
enstein-1.436
Token,
Token ID11
Feature activation-3.31e-01

Pos logit contributions
rogen+3.991
+3.884
ourse+3.834
inho+3.705
enstein+3.701
Neg logit contributions
idth-4.516
 absorbed-4.318
 resur-4.147
eston-4.076
etically-4.049
Token they
Token ID484
Feature activation-3.63e-01

Pos logit contributions
+2.567
rogen+2.566
ourse+2.474
inho+2.439
enstein+2.386
Neg logit contributions
idth-2.954
 absorbed-2.879
 resur-2.709
eston-2.701
etically-2.685
Token wish
Token ID4601
Feature activation-3.65e-01

Pos logit contributions
rogen+2.626
+2.606
ourse+2.505
inho+2.485
enstein+2.433
Neg logit contributions
idth-3.453
 absorbed-3.333
eston-3.152
etically-3.139
 resur-3.137
Token to
Token ID284
Feature activation-2.99e-01

Pos logit contributions
rogen+2.782
+2.755
ourse+2.638
inho+2.598
enstein+2.565
Neg logit contributions
idth-3.288
 absorbed-3.199
eston-3.041
 resur-3.023
etically-2.982
Token undertake
Token ID25340
Feature activation-3.32e-01

Pos logit contributions
rogen+2.261
+2.248
ourse+2.165
inho+2.140
enstein+2.104
Neg logit contributions
idth-2.744
 absorbed-2.655
eston-2.497
 resur-2.484
etically-2.481
Token physical
Token ID3518
Feature activation-3.42e-01

Pos logit contributions
rogen+2.583
+2.574
ourse+2.484
inho+2.445
enstein+2.397
Neg logit contributions
idth-2.973
 absorbed-2.882
eston-2.707
 resur-2.706
etically-2.693
Token measures
Token ID5260
Feature activation-1.65e-01

Pos logit contributions
rogen+2.683
+2.637
ourse+2.552
inho+2.516
enstein+2.491
Neg logit contributions
idth-3.055
 absorbed-2.922
eston-2.766
 resur-2.722
etically-2.716
Token,
Token ID11
Feature activation+3.10e-02

Pos logit contributions
+1.353
rogen+1.339
ourse+1.308
inho+1.286
enstein+1.260
Neg logit contributions
idth-1.410
 absorbed-1.381
 resur-1.295
etically-1.278
eston-1.278
Token with
Token ID351
Feature activation-1.91e-01

Pos logit contributions
 absorbed+0.240
idth+0.237
 resur+0.224
etically+0.217
 absorbing+0.215
Neg logit contributions
-0.281
rogen-0.272
ourse-0.272
inho-0.267
enstein-0.262
Token am
Token ID716
Feature activation-2.76e-02

Pos logit contributions
 absorbed+1.840
idth+1.813
 resur+1.728
 absorbing+1.676
etically+1.675
Neg logit contributions
-1.544
ourse-1.499
rogen-1.483
inho-1.445
enstein-1.409
Token I
Token ID314
Feature activation-3.78e-01

Pos logit contributions
+0.210
rogen+0.206
ourse+0.202
inho+0.198
enstein+0.194
Neg logit contributions
idth-0.252
 absorbed-0.249
 resur-0.234
etically-0.231
eston-0.230
Token an
Token ID281
Feature activation-7.16e-02

Pos logit contributions
+2.990
rogen+2.970
ourse+2.843
inho+2.836
igh+2.761
Neg logit contributions
idth-3.322
 absorbed-3.136
eston-3.016
 resur-2.979
etically-2.973
Token addict
Token ID19678
Feature activation+6.90e-02

Pos logit contributions
+0.499
rogen+0.492
ourse+0.478
inho+0.470
enstein+0.458
Neg logit contributions
idth-0.701
 absorbed-0.688
 resur-0.652
eston-0.644
etically-0.643
Token of
Token ID286
Feature activation-1.49e-01

Pos logit contributions
idth+0.577
 absorbed+0.571
 resur+0.533
etically+0.526
eston+0.524
Neg logit contributions
-0.575
rogen-0.567
ourse-0.556
inho-0.546
enstein-0.535
Token crack
Token ID8469
Feature activation-7.49e-01

Pos logit contributions
+1.144
rogen+1.133
ourse+1.101
inho+1.086
enstein+1.063
Neg logit contributions
idth-1.347
 absorbed-1.316
 resur-1.240
eston-1.227
etically-1.224
Token cocaine
Token ID15144
Feature activation-4.58e-01

Pos logit contributions
rogen+4.117
inho+4.116
+4.110
ourse+4.009
enstein+3.923
Neg logit contributions
idth-8.019
 absorbed-7.566
etically-7.264
 resur-7.256
eston-7.255
Token,
Token ID11
Feature activation-2.59e-01

Pos logit contributions
+3.366
rogen+3.350
ourse+3.279
inho+3.221
enstein+3.183
Neg logit contributions
idth-4.182
 absorbed-4.017
etically-3.812
 resur-3.800
eston-3.789
Token
Token ID447
Feature activation-3.82e-02

Pos logit contributions
+2.037
rogen+2.012
ourse+1.951
inho+1.930
enstein+1.891
Neg logit contributions
idth-2.295
 absorbed-2.235
 resur-2.102
eston-2.088
etically-2.084
Token
Token ID251
Feature activation-3.19e-02

Pos logit contributions
+0.350
rogen+0.346
ourse+0.340
inho+0.334
enstein+0.329
Neg logit contributions
idth-0.289
 absorbed-0.283
 resur-0.263
etically-0.259
eston-0.259
Token he
Token ID339
Feature activation-1.85e-01

Pos logit contributions
+0.270
rogen+0.267
ourse+0.261
inho+0.256
enstein+0.252
Neg logit contributions
idth-0.265
 absorbed-0.261
 resur-0.245
etically-0.240
eston-0.240
Token!
Token ID0
Feature activation-1.44e-01

Pos logit contributions
+3.590
rogen+3.522
ourse+3.445
enstein+3.357
 Vul+3.327
Neg logit contributions
idth-4.397
 absorbed-4.214
eston-3.939
etically-3.928
 resur-3.888
Token Sign
Token ID5865
Feature activation-6.00e-02

Pos logit contributions
+0.990
rogen+0.987
ourse+0.963
inho+0.955
igh+0.931
Neg logit contributions
idth-1.411
 absorbed-1.372
 resur-1.286
eston-1.281
etically-1.280
Token up
Token ID510
Feature activation-2.28e-02

Pos logit contributions
+0.589
rogen+0.583
ourse+0.572
inho+0.565
enstein+0.556
Neg logit contributions
idth-0.416
 absorbed-0.406
 resur-0.374
etically-0.367
eston-0.367
Token for
Token ID329
Feature activation+1.10e-01

Pos logit contributions
+0.189
rogen+0.187
ourse+0.183
inho+0.180
enstein+0.176
Neg logit contributions
idth-0.192
 absorbed-0.189
 resur-0.177
etically-0.174
eston-0.174
Token Take
Token ID7214
Feature activation+3.32e-01

Pos logit contributions
idth+0.840
 absorbed+0.816
 resur+0.756
eston+0.750
etically+0.749
Neg logit contributions
-1.002
rogen-0.992
ourse-0.974
inho-0.957
enstein-0.941
Token Action
Token ID7561
Feature activation-7.35e-01

Pos logit contributions
 absorbed+2.077
idth+2.051
eston+1.985
etically+1.952
 resur+1.937
Neg logit contributions
-3.373
rogen-3.247
ourse-3.191
inho-3.151
enstein-3.146
Token Now
Token ID2735
Feature activation-1.35e-01

Pos logit contributions
rogen+5.527
ourse+5.430
+5.424
inho+5.289
enstein+5.266
Neg logit contributions
idth-6.671
 absorbed-6.309
 carbohyd-5.983
etically-5.838
 resur-5.794
Token and
Token ID290
Feature activation-3.83e-01

Pos logit contributions
+1.094
rogen+1.078
ourse+1.054
inho+1.042
enstein+1.018
Neg logit contributions
idth-1.166
 absorbed-1.158
 resur-1.094
etically-1.072
eston-1.067
Token we
Token ID356
Feature activation-3.09e-01

Pos logit contributions
rogen+2.847
+2.835
inho+2.758
ourse+2.719
 Vul+2.670
Neg logit contributions
idth-3.564
 absorbed-3.325
eston-3.133
etically-3.122
 resur-3.113
Token
Token ID447
Feature activation-1.86e-01

Pos logit contributions
+1.787
rogen+1.738
ourse+1.692
inho+1.679
igh+1.656
Neg logit contributions
idth-3.386
 absorbed-3.276
 resur-3.134
etically-3.091
eston-3.085
Token
Token ID247
Feature activation-2.96e-01

Pos logit contributions
+1.687
rogen+1.669
ourse+1.640
inho+1.619
igh+1.575
Neg logit contributions
idth-1.449
 absorbed-1.402
 resur-1.298
etically-1.285
eston-1.270
Tokené
Token ID2634
Feature activation+9.19e-02

Pos logit contributions
rogen+2.160
+2.153
ourse+2.142
inho+2.077
igh+2.016
Neg logit contributions
idth-2.381
 absorbed-2.337
 resur-2.196
etically-2.165
eston-2.139
Tokennat
Token ID32353
Feature activation-7.16e-01

Pos logit contributions
idth+1.032
 absorbed+1.015
 resur+0.966
eston+0.958
etically+0.956
Neg logit contributions
-0.508
rogen-0.489
ourse-0.472
inho-0.461
enstein-0.457
Token Imp
Token ID9855
Feature activation-1.33e-01

Pos logit contributions
rogen+5.497
ourse+5.444
+5.397
ury+5.254
inho+5.250
Neg logit contributions
idth-6.188
 absorbed-5.847
etically-5.539
 resur-5.537
eston-5.514
Tokené
Token ID2634
Feature activation+4.14e-01

Pos logit contributions
+1.086
rogen+1.080
ourse+1.059
inho+1.038
enstein+1.020
Neg logit contributions
idth-1.134
 absorbed-1.111
 resur-1.041
etically-1.024
eston-1.021
Tokenrial
Token ID4454
Feature activation-8.62e-01

Pos logit contributions
idth+2.977
 absorbed+2.926
eston+2.705
 resur+2.698
etically+2.680
Neg logit contributions
-3.837
ourse-3.616
rogen-3.583
inho-3.570
enstein-3.525
Token,
Token ID11
Feature activation-7.26e-01

Pos logit contributions
ourse+6.230
rogen+6.178
+6.165
inho+6.056
 Vul+5.906
Neg logit contributions
idth-7.561
 absorbed-7.070
etically-6.830
eston-6.769
 resur-6.680
Token le
Token ID443
Feature activation+2.60e-01

Pos logit contributions
+5.179
rogen+5.173
ourse+5.118
inho+4.969
 Vul+4.924
Neg logit contributions
idth-6.631
 absorbed-6.217
etically-5.983
eston-5.939
 resur-5.904
Token contr
Token ID3445
Feature activation-4.62e-01

Pos logit contributions
idth+2.264
 absorbed+2.252
 resur+2.142
etically+2.077
eston+2.069
Neg logit contributions
-2.053
rogen-2.013
inho-1.933
ourse-1.930
enstein-1.872
Tokenat
Token ID265
Feature activation-3.70e-01

Pos logit contributions
rogen+3.222
ourse+3.208
+3.158
inho+3.136
ury+2.978
Neg logit contributions
idth-4.366
 absorbed-4.320
 resur-4.085
etically-4.012
eston-3.985
Token qu
Token ID627
Feature activation+1.86e-01

Pos logit contributions
rogen+2.963
+2.957
ourse+2.904
inho+2.855
enstein+2.768
Neg logit contributions
idth-3.194
 absorbed-3.104
 resur-2.919
etically-2.891
eston-2.889
Token'
Token ID6
Feature activation-3.78e-02

Pos logit contributions
idth+1.459
 absorbed+1.437
 resur+1.336
eston+1.327
etically+1.326
Neg logit contributions
-1.666
rogen-1.615
ourse-1.559
inho-1.539
enstein-1.535
Tokenua
Token ID6413
Feature activation-2.61e-01

Pos logit contributions
rogen+3.634
ourse+3.589
inho+3.584
+3.552
ury+3.426
Neg logit contributions
idth-6.717
 absorbed-6.626
 resur-6.253
etically-6.180
 absorbing-6.169
Token is
Token ID318
Feature activation-1.09e-01

Pos logit contributions
+1.956
rogen+1.931
ourse+1.887
inho+1.854
enstein+1.815
Neg logit contributions
idth-2.409
 absorbed-2.346
 resur-2.211
eston-2.200
etically-2.199
Token the
Token ID262
Feature activation-7.80e-02

Pos logit contributions
+0.887
rogen+0.874
ourse+0.856
inho+0.839
enstein+0.823
Neg logit contributions
idth-0.943
 absorbed-0.935
 resur-0.876
etically-0.862
eston-0.859
Token first
Token ID717
Feature activation-4.57e-01

Pos logit contributions
+0.658
rogen+0.647
ourse+0.637
inho+0.623
enstein+0.611
Neg logit contributions
idth-0.648
 absorbed-0.641
 resur-0.600
etically-0.591
eston-0.586
Token of
Token ID286
Feature activation-3.65e-02

Pos logit contributions
rogen+3.662
+3.653
ourse+3.540
inho+3.507
enstein+3.452
Neg logit contributions
idth-3.882
 absorbed-3.749
eston-3.571
 resur-3.505
etically-3.472
Token two
Token ID734
Feature activation-7.46e-01

Pos logit contributions
+0.290
rogen+0.287
ourse+0.280
inho+0.276
enstein+0.269
Neg logit contributions
idth-0.320
 absorbed-0.315
 resur-0.297
etically-0.294
eston-0.291
Token plastic
Token ID7309
Feature activation-3.47e-01

Pos logit contributions
rogen+5.995
+5.903
ourse+5.754
inho+5.717
igh+5.652
Neg logit contributions
idth-6.096
 absorbed-5.769
eston-5.645
 resur-5.390
MIC-5.380
Token yo
Token ID27406
Feature activation-4.63e-02

Pos logit contributions
+2.681
rogen+2.659
ourse+2.584
inho+2.545
enstein+2.490
Neg logit contributions
idth-3.091
 absorbed-3.018
eston-2.843
 resur-2.834
etically-2.806
Token-
Token ID12
Feature activation-2.26e-01

Pos logit contributions
+0.342
rogen+0.336
ourse+0.328
inho+0.322
enstein+0.314
Neg logit contributions
idth-0.434
 absorbed-0.427
 resur-0.404
etically-0.399
eston-0.397
Tokenyo
Token ID8226
Feature activation-1.92e-02

Pos logit contributions
+1.985
rogen+1.980
ourse+1.924
inho+1.909
enstein+1.881
Neg logit contributions
idth-1.770
 absorbed-1.750
 resur-1.621
eston-1.592
etically-1.583
Token in
Token ID287
Feature activation-9.06e-02

Pos logit contributions
+0.160
rogen+0.157
ourse+0.154
inho+0.151
enstein+0.148
Neg logit contributions
idth-0.160
 absorbed-0.159
 resur-0.149
etically-0.147
eston-0.145
Token it
Token ID340
Feature activation-5.65e-03

Pos logit contributions
 absorbed+1.479
idth+1.475
 resur+1.377
eston+1.354
etically+1.347
Neg logit contributions
-1.863
ourse-1.810
rogen-1.804
inho-1.750
enstein-1.731
Token,
Token ID11
Feature activation-1.44e-01

Pos logit contributions
+0.052
rogen+0.051
ourse+0.050
inho+0.049
enstein+0.048
Neg logit contributions
idth-0.043
 absorbed-0.042
 resur-0.040
etically-0.039
eston-0.039
Token and
Token ID290
Feature activation-3.43e-02

Pos logit contributions
+1.128
rogen+1.112
ourse+1.088
inho+1.068
enstein+1.045
Neg logit contributions
idth-1.284
 absorbed-1.262
 resur-1.187
etically-1.171
eston-1.169
Token convert
Token ID10385
Feature activation-4.26e-01

Pos logit contributions
+0.294
ourse+0.283
rogen+0.283
inho+0.276
enstein+0.269
Neg logit contributions
idth-0.283
 absorbed-0.282
 resur-0.265
etically-0.258
eston-0.257
Token it
Token ID340
Feature activation-2.22e-01

Pos logit contributions
rogen+3.206
+3.107
inho+3.045
ourse+2.990
enstein+2.930
Neg logit contributions
idth-3.924
 absorbed-3.767
 carbohyd-3.530
 resur-3.528
etically-3.492
Token to
Token ID284
Feature activation-9.41e-01

Pos logit contributions
+1.751
rogen+1.740
ourse+1.684
inho+1.668
enstein+1.633
Neg logit contributions
idth-1.966
 absorbed-1.920
 resur-1.800
etically-1.778
eston-1.777
Token gr
Token ID1036
Feature activation-9.97e-01

Pos logit contributions
rogen+3.767
+3.559
igh+3.440
inho+3.291
 Vul+3.249
Neg logit contributions
idth-11.503
 absorbed-10.751
eston-10.667
 carbohyd-10.464
hler-10.444
Tokenays
Token ID592
Feature activation-2.08e-01

Pos logit contributions
rogen+5.331
igh+5.140
ourse+5.129
ury+4.957
+4.863
Neg logit contributions
 absorbed-10.522
idth-10.270
 carbohyd-10.082
 resur-9.901
 absorbing-9.768
Tokencale
Token ID38765
Feature activation-1.82e-01

Pos logit contributions
+1.127
rogen+1.118
ourse+1.082
inho+1.055
enstein+1.021
Neg logit contributions
idth-2.335
 absorbed-2.312
 resur-2.198
etically-2.170
eston-2.167
Token image
Token ID2939
Feature activation-6.53e-02

Pos logit contributions
+1.428
rogen+1.422
ourse+1.379
inho+1.360
enstein+1.329
Neg logit contributions
idth-1.604
 absorbed-1.559
 resur-1.464
etically-1.455
eston-1.453
Token =
Token ID796
Feature activation-2.27e-01

Pos logit contributions
+0.517
rogen+0.511
ourse+0.500
inho+0.490
enstein+0.480
Neg logit contributions
idth-0.575
 absorbed-0.564
 resur-0.531
etically-0.525
eston-0.524