TOP ACTIVATIONS
MAX = 6.988

C inelli handle bars and stem . View Full Caption Adam
- Ter rain KO 2 tires specifically designed for off -
" images " ] ) : # load the image ,
the Silver Phant oms more bass slam and better imaging .
control as the best passive speakers , such as K EF
was missing from my main speakers . To make a long
pair but require out board amplification and electronics . A pair
ick ory , apple , pear , plum , and paw
White Rock by Conservative candidate Dianne Watts , is being called
less people have been living under military law for nearly five
the years I spent playing under him that perhaps provide some
techn ological system we live under today that we call Surveillance
it is hard to get rid of them .
, cut flowers , dried flowers . ... You
with a developing Nathan Be aul ieu , will mean Montreal
and big , box y speakers , look like something from
muff ins . Ground fl ax seed also qualifies here as
small stroke while she was under anesthesia , but she won
applications until she retired while under investigation in 2013 , suffered
headset , cr anks and brakes . The wheels are H

MIN ACTIVATIONS
MIN = -7.962

Flag Reply 55 Recommend Recommend Share this
Flag Reply 19 Recommend Recommend Share this
Flag Reply 27 Recommend Recommend Share this
Flag Reply 14 Recommend Recommend Share this
Subscribe to our free newsletters . If you
Subscribe to our free newsletters . Update ,
this material above - 179 ° C . This new study
from Grey s Anat omy . Sh onda would
hormone secret ed by the hypothal amus in response to stress
last . Article Continued Below T od
, Bounty gate affects every aspect of league punishment and stands
1 2 3 Next All <|endoftext|> Republican Sen .
to directly sell its RF ID - based products throughout its
scrap . Article Continued Below The T
10 . 13 71 / journal . p one . 00
INTER M ITT ENT F AST ING as a strategy for
projected rates of obesity and overweight status in both male and
mess seriously Article Continued Below
community . ( Photo by REUTERS / John S omm ers
. Newsletter Sign Up Continue reading the main story

INTERVAL 3.250 - 6.988
CONTAINS 0.910%

They were very healthy this year . There were teams I
breath ( after a steep climb out of the dungeon ),"
and has little time to prepare for cases . ( Photo
interviews had already been carried out under the Police and Criminal
Americans ). Kaine shot back : I can

INTERVAL -0.487 - 3.250
CONTAINS 66.450%

with . If Clinton s position is that she
{ vi : | vim : | ex : } the
en their recovery from the crisis by opening up trade in
" Some satellites take decades to come back after
Bill said -- This is just a little something in 17

INTERVAL -4.224 - -0.487
CONTAINS 32.413%

Thanks to Penn and Lab ov , now 88
s this strange irony that the party s
Jordan Horowitz delivering the news to the world that
State Cowboys also challenged Petty in this fashion and got him
that it is able to meet its deficit targets , given

INTERVAL -7.962 - -4.224
CONTAINS 0.227%

This 41 . 2 - meter per second ( 92 .
shoulder poles , is shown next to the ancient city wall
assassination . But he never forgot his goal , and after
power , 4 34 - lb .- ft . 6 .
down and say , ' well , I need a number
Token C
Token ID327
Feature activation-3.11e-01

Pos logit contributions
 Articles+0.771
 remedy+0.763
aughs+0.726
 Thousand+0.723
 remedies+0.718
Neg logit contributions
 awa-1.099
OOD-1.038
romy-1.026
oz-1.000
Environment-0.970
Tokeninelli
Token ID44076
Feature activation+2.02e+00

Pos logit contributions
 awa+0.223
OOD+0.210
romy+0.207
oz+0.202
Environment+0.192
Neg logit contributions
 Articles-0.216
 remedy-0.214
 Thousand-0.204
aughs-0.203
 remedies-0.203
Token handle
Token ID5412
Feature activation+3.51e+00

Pos logit contributions
 Articles+1.149
 remedy+1.138
aughs+1.077
 Thousand+1.076
 remedies+1.068
Neg logit contributions
 awa-1.709
OOD-1.615
romy-1.595
oz-1.553
Environment-1.509
Tokenbars
Token ID34046
Feature activation+3.78e+00

Pos logit contributions
 Articles+1.950
 remedy+1.924
 Thousand+1.806
 remedies+1.805
aughs+1.791
Neg logit contributions
 awa-3.024
OOD-2.861
romy-2.819
oz-2.709
Environment-2.672
Token and
Token ID290
Feature activation+2.92e-01

Pos logit contributions
 Articles+1.680
 remedy+1.673
 Thousand+1.532
 remedies+1.526
 Warn+1.512
Neg logit contributions
 awa-3.674
OOD-3.481
romy-3.439
oz-3.329
Environment-3.277
Token stem
Token ID10717
Feature activation+6.99e+00

Pos logit contributions
 Articles+0.164
 remedy+0.162
aughs+0.155
 Thousand+0.154
 remedies+0.152
Neg logit contributions
 awa-0.248
OOD-0.234
romy-0.232
oz-0.226
Environment-0.219
Token.
Token ID13
Feature activation+8.18e-01

Pos logit contributions
 Articles+3.402
 remedy+3.330
 Warn+3.092
 remedies+3.091
aughs+3.042
Neg logit contributions
 awa-6.466
OOD-6.159
romy-6.043
oz-5.810
Environment-5.737
Token View
Token ID3582
Feature activation+1.93e+00

Pos logit contributions
 remedy+0.478
 Articles+0.476
 remedies+0.449
aughs+0.448
 Thousand+0.447
Neg logit contributions
 awa-0.674
OOD-0.642
romy-0.628
oz-0.615
Environment-0.598
Token Full
Token ID6462
Feature activation+2.07e-01

Pos logit contributions
 Articles+0.991
 remedy+0.982
 Thousand+0.922
aughs+0.918
 remedies+0.915
Neg logit contributions
 awa-1.743
OOD-1.654
romy-1.633
oz-1.591
Environment-1.550
Token Caption
Token ID11260
Feature activation-4.55e+00

Pos logit contributions
 remedy+0.123
 Articles+0.122
aughs+0.116
 remedies+0.116
 Thousand+0.115
Neg logit contributions
 awa-0.168
OOD-0.159
romy-0.156
oz-0.152
Environment-0.148
Token Adam
Token ID7244
Feature activation+1.91e+00

Pos logit contributions
 awa+3.124
OOD+3.051
oz+2.955
Scene+2.949
Environment+2.897
Neg logit contributions
 remedy-2.972
 remedies-2.874
aughs-2.795
alore-2.786
 Articles-2.758
Token-
Token ID12
Feature activation-3.22e-01

Pos logit contributions
 awa+0.016
OOD+0.015
romy+0.015
oz+0.015
Environment+0.014
Neg logit contributions
 Articles-0.013
 remedy-0.013
 Thousand-0.013
aughs-0.013
 remedies-0.013
TokenTer
Token ID15156
Feature activation+1.30e+00

Pos logit contributions
 awa+0.198
OOD+0.186
romy+0.182
oz+0.176
Environment+0.171
Neg logit contributions
 Articles-0.256
 remedy-0.255
 Thousand-0.244
aughs-0.244
 remedies-0.243
Tokenrain
Token ID3201
Feature activation+1.09e+00

Pos logit contributions
 Articles+1.183
 remedy+1.171
 Thousand+1.136
aughs+1.127
 remedies+1.127
Neg logit contributions
 awa-0.664
OOD-0.597
romy-0.574
oz-0.544
Environment-0.523
Token KO
Token ID21054
Feature activation-3.00e-01

Pos logit contributions
 Articles+0.627
 remedy+0.627
aughs+0.602
 Thousand+0.592
 remedies+0.591
Neg logit contributions
 awa-0.899
OOD-0.858
romy-0.845
oz-0.826
Environment-0.805
Token2
Token ID17
Feature activation-1.43e+00

Pos logit contributions
 awa+0.212
OOD+0.204
romy+0.198
oz+0.195
Environment+0.188
Neg logit contributions
 Articles-0.206
 remedy-0.205
 Thousand-0.196
aughs-0.196
 remedies-0.196
Token tires
Token ID17920
Feature activation+6.83e+00

Pos logit contributions
 awa+1.185
OOD+1.148
romy+1.120
oz+1.105
Environment+1.069
Neg logit contributions
 Articles-0.797
 remedy-0.790
aughs-0.760
 Thousand-0.754
 remedies-0.748
Token specifically
Token ID5734
Feature activation+1.47e+00

Pos logit contributions
 remedy+3.166
 Articles+3.155
 Thousand+2.894
 remedies+2.888
 Warn+2.874
Neg logit contributions
 awa-6.440
OOD-6.077
romy-6.009
oz-5.810
Environment-5.711
Token designed
Token ID3562
Feature activation-1.79e+00

Pos logit contributions
 Articles+0.697
 remedy+0.679
aughs+0.644
 Thousand+0.642
 remedies+0.633
Neg logit contributions
 awa-1.376
OOD-1.316
romy-1.294
oz-1.269
Environment-1.236
Token for
Token ID329
Feature activation-3.06e-01

Pos logit contributions
 awa+1.828
OOD+1.741
romy+1.720
oz+1.680
Environment+1.644
Neg logit contributions
 Articles-0.706
 remedy-0.705
 Thousand-0.645
 remedies-0.640
 Warn-0.631
Token off
Token ID572
Feature activation+1.76e+00

Pos logit contributions
 awa+0.255
OOD+0.241
romy+0.238
oz+0.232
Environment+0.226
Neg logit contributions
 Articles-0.176
 remedy-0.173
aughs-0.166
 Thousand-0.165
 remedies-0.164
Token-
Token ID12
Feature activation+5.86e-01

Pos logit contributions
 remedy+0.961
 Articles+0.956
 remedies+0.901
 Thousand+0.895
 Force+0.891
Neg logit contributions
 awa-1.542
OOD-1.431
romy-1.413
oz-1.353
Environment-1.331
Token "
Token ID366
Feature activation-7.33e-01

Pos logit contributions
 awa+1.328
OOD+1.260
romy+1.239
oz+1.209
Environment+1.179
Neg logit contributions
 Articles-0.817
 remedy-0.810
 Thousand-0.761
aughs-0.759
 remedies-0.757
Tokenimages
Token ID17566
Feature activation-5.86e-01

Pos logit contributions
romy+0.401
OOD+0.400
oz+0.395
 awa+0.394
Environment+0.370
Neg logit contributions
 remedy-0.613
 Articles-0.601
 Thousand-0.589
 Trident-0.573
 Enhance-0.569
Token"
Token ID1
Feature activation+5.80e-01

Pos logit contributions
oz+0.364
OOD+0.364
romy+0.363
 awa+0.351
Environment+0.347
Neg logit contributions
 remedy-0.428
aughs-0.421
 Articles-0.420
 Thousand-0.408
 remedies-0.407
Token ]
Token ID2361
Feature activation+2.45e+00

Pos logit contributions
 remedy+0.302
 Articles+0.295
 remedies+0.284
aughs+0.280
 Thousand+0.279
Neg logit contributions
 awa-0.505
OOD-0.490
romy-0.484
oz-0.475
Environment-0.466
Token )
Token ID1267
Feature activation+8.46e-01

Pos logit contributions
 Articles+1.559
 remedy+1.523
 Thousand+1.449
aughs+1.440
 Warn+1.437
Neg logit contributions
 awa-1.931
OOD-1.819
romy-1.798
oz-1.703
Environment-1.651
Token :
Token ID1058
Feature activation+6.71e+00

Pos logit contributions
 Articles+0.358
 remedy+0.351
 Thousand+0.332
aughs+0.326
 remedies+0.324
Neg logit contributions
 awa-0.832
OOD-0.795
romy-0.787
oz-0.772
Environment-0.753
Token #
Token ID1303
Feature activation+9.47e-01

Pos logit contributions
 remedy+1.854
 Articles+1.813
 Warn+1.629
 Force+1.604
 remedies+1.578
Neg logit contributions
 awa-7.588
OOD-7.369
romy-7.169
oz-6.939
everal-6.870
Token load
Token ID3440
Feature activation-8.52e-01

Pos logit contributions
 Articles+0.535
 remedy+0.534
 Thousand+0.502
 Warn+0.495
 Force+0.495
Neg logit contributions
 awa-0.809
OOD-0.763
romy-0.756
oz-0.727
Environment-0.714
Token the
Token ID262
Feature activation+7.34e-02

Pos logit contributions
OOD+0.628
oz+0.608
 awa+0.601
romy+0.598
Environment+0.584
Neg logit contributions
 remedy-0.534
aughs-0.521
 Articles-0.518
 remedies-0.510
 Thousand-0.503
Token image
Token ID2939
Feature activation-5.12e-01

Pos logit contributions
 Articles+0.049
aughs+0.048
 remedy+0.048
 Thousand+0.047
 remedies+0.047
Neg logit contributions
 awa-0.052
OOD-0.051
romy-0.049
oz-0.049
Environment-0.047
Token,
Token ID11
Feature activation+4.05e-01

Pos logit contributions
 awa+0.380
OOD+0.370
romy+0.364
oz+0.359
Environment+0.349
Neg logit contributions
 Articles-0.321
aughs-0.319
 remedy-0.319
 Thousand-0.307
 remedies-0.302
Token the
Token ID262
Feature activation+1.52e-01

Pos logit contributions
 awa+0.294
OOD+0.278
romy+0.275
oz+0.268
Environment+0.261
Neg logit contributions
 Articles-0.182
 remedy-0.180
 Thousand-0.170
aughs-0.169
 remedies-0.168
Token Silver
Token ID7698
Feature activation+8.26e-01

Pos logit contributions
 Articles+0.089
 remedy+0.088
aughs+0.083
 Thousand+0.083
 remedies+0.083
Neg logit contributions
 awa-0.126
OOD-0.119
romy-0.118
oz-0.114
Environment-0.111
Token Phant
Token ID38699
Feature activation-3.82e-01

Pos logit contributions
 Articles+0.501
 remedy+0.495
 Thousand+0.471
aughs+0.467
 remedies+0.466
Neg logit contributions
 awa-0.668
OOD-0.631
romy-0.620
oz-0.607
Environment-0.586
Tokenoms
Token ID3150
Feature activation+2.12e+00

Pos logit contributions
 awa+0.313
OOD+0.299
romy+0.295
oz+0.288
Environment+0.276
Neg logit contributions
 Articles-0.225
 remedy-0.223
 Thousand-0.212
aughs-0.210
 remedies-0.209
Token more
Token ID517
Feature activation-9.85e-02

Pos logit contributions
 Articles+1.171
 remedy+1.159
 Thousand+1.095
 remedies+1.086
aughs+1.081
Neg logit contributions
 awa-1.827
OOD-1.731
romy-1.707
oz-1.657
Environment-1.616
Token bass
Token ID12702
Feature activation+6.48e+00

Pos logit contributions
 awa+0.081
OOD+0.076
romy+0.075
oz+0.073
Environment+0.071
Neg logit contributions
 Articles-0.058
 remedy-0.057
aughs-0.055
 Thousand-0.055
 remedies-0.054
Token slam
Token ID21158
Feature activation+2.11e-01

Pos logit contributions
 remedy+3.502
 Articles+3.438
 remedies+3.257
 Thousand+3.211
esi+3.198
Neg logit contributions
 awa-5.547
OOD-5.301
romy-5.202
oz-4.983
Environment-4.944
Token and
Token ID290
Feature activation+1.71e-01

Pos logit contributions
 Articles+0.117
 remedy+0.115
 Thousand+0.109
aughs+0.108
 remedies+0.107
Neg logit contributions
 awa-0.182
OOD-0.172
romy-0.170
oz-0.165
Environment-0.161
Token better
Token ID1365
Feature activation-5.61e-01

Pos logit contributions
 Articles+0.101
 remedy+0.099
 Thousand+0.095
aughs+0.095
 remedies+0.094
Neg logit contributions
 awa-0.141
OOD-0.132
romy-0.130
oz-0.127
Environment-0.123
Token imaging
Token ID19560
Feature activation+2.60e+00

Pos logit contributions
 awa+0.453
OOD+0.424
romy+0.419
oz+0.408
Environment+0.395
Neg logit contributions
 Articles-0.342
 remedy-0.334
aughs-0.324
 Thousand-0.321
 remedies-0.317
Token.
Token ID13
Feature activation-4.45e-01

Pos logit contributions
 Articles+1.328
 remedy+1.319
 remedies+1.229
 Thousand+1.228
 Warn+1.217
Neg logit contributions
 awa-2.344
OOD-2.231
romy-2.198
oz-2.138
Environment-2.087
Token control
Token ID1630
Feature activation+2.01e+00

Pos logit contributions
 awa+0.476
OOD+0.445
romy+0.440
oz+0.428
Environment+0.413
Neg logit contributions
 Articles-0.373
 remedy-0.364
aughs-0.352
 Thousand-0.351
esi-0.346
Token as
Token ID355
Feature activation-2.48e-01

Pos logit contributions
 Articles+1.051
 remedy+1.043
 Thousand+0.978
 remedies+0.974
 Warn+0.966
Neg logit contributions
 awa-1.787
OOD-1.697
romy-1.675
oz-1.622
Environment-1.590
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 awa+0.211
OOD+0.200
romy+0.198
oz+0.192
Environment+0.187
Neg logit contributions
 Articles-0.139
 remedy-0.137
aughs-0.131
 Thousand-0.129
esi-0.128
Token best
Token ID1266
Feature activation+8.08e-01

Pos logit contributions
 Articles+0.600
 remedy+0.593
 Thousand+0.563
aughs+0.558
 remedies+0.558
Neg logit contributions
 awa-0.816
OOD-0.771
romy-0.759
oz-0.737
Environment-0.716
Token passive
Token ID14513
Feature activation+5.77e+00

Pos logit contributions
 Articles+0.444
 remedy+0.436
aughs+0.416
 Thousand+0.410
esi+0.408
Neg logit contributions
 awa-0.699
OOD-0.661
romy-0.654
oz-0.638
Environment-0.618
Token speakers
Token ID11636
Feature activation+6.34e+00

Pos logit contributions
 remedy+3.162
 Articles+3.151
 remedies+2.965
 Thousand+2.962
 Warn+2.917
Neg logit contributions
 awa-4.959
OOD-4.743
romy-4.628
oz-4.500
Environment-4.388
Token,
Token ID11
Feature activation+9.60e-02

Pos logit contributions
 remedy+3.096
 Articles+3.060
 remedies+2.842
 Thousand+2.841
 Warn+2.819
Neg logit contributions
 awa-5.833
OOD-5.571
romy-5.483
oz-5.333
Environment-5.213
Token such
Token ID884
Feature activation-6.54e-01

Pos logit contributions
 Articles+0.054
 remedy+0.053
aughs+0.051
esi+0.050
 Thousand+0.050
Neg logit contributions
 awa-0.081
OOD-0.077
romy-0.076
oz-0.074
Environment-0.072
Token as
Token ID355
Feature activation-7.80e-01

Pos logit contributions
 awa+0.456
romy+0.446
OOD+0.444
oz+0.427
Environment+0.415
Neg logit contributions
 Articles-0.436
 remedy-0.435
aughs-0.435
esi-0.419
 Thousand-0.416
Token K
Token ID509
Feature activation-5.13e-01

Pos logit contributions
 awa+0.675
OOD+0.638
romy+0.631
oz+0.616
Environment+0.598
Neg logit contributions
 Articles-0.425
 remedy-0.421
aughs-0.400
 Thousand-0.393
 remedies-0.393
TokenEF
Token ID25425
Feature activation+6.55e-02

Pos logit contributions
 awa+0.275
OOD+0.254
romy+0.248
oz+0.238
Environment+0.226
Neg logit contributions
 Articles-0.451
 remedy-0.447
 Thousand-0.432
aughs-0.430
 remedies-0.429
Token was
Token ID373
Feature activation+2.28e-01

Pos logit contributions
 awa+0.824
romy+0.781
OOD+0.779
oz+0.762
Environment+0.730
Neg logit contributions
 Articles-0.539
 remedy-0.518
aughs-0.515
 Thousand-0.505
 remedies-0.495
Token missing
Token ID4814
Feature activation-1.88e-01

Pos logit contributions
 Articles+0.149
 remedy+0.145
aughs+0.142
 Thousand+0.141
 remedies+0.138
Neg logit contributions
 awa-0.176
OOD-0.163
romy-0.162
oz-0.158
Environment-0.151
Token from
Token ID422
Feature activation-1.57e+00

Pos logit contributions
 awa+0.170
OOD+0.161
romy+0.158
oz+0.155
Environment+0.149
Neg logit contributions
 Articles-0.097
 remedy-0.095
aughs-0.090
 Thousand-0.090
 remedies-0.089
Token my
Token ID616
Feature activation-1.88e-02

Pos logit contributions
 awa+1.332
OOD+1.262
romy+1.248
oz+1.216
Environment+1.172
Neg logit contributions
 Articles-0.880
 remedy-0.869
aughs-0.834
 remedies-0.817
esi-0.816
Token main
Token ID1388
Feature activation+1.41e+00

Pos logit contributions
 awa+0.016
OOD+0.015
romy+0.015
oz+0.014
Environment+0.014
Neg logit contributions
 Articles-0.011
 remedy-0.010
aughs-0.010
 Thousand-0.010
 remedies-0.010
Token speakers
Token ID11636
Feature activation+6.28e+00

Pos logit contributions
 Articles+0.843
 remedy+0.825
aughs+0.792
 Thousand+0.785
 remedies+0.778
Neg logit contributions
 awa-1.159
OOD-1.089
romy-1.078
oz-1.051
Environment-1.013
Token.
Token ID13
Feature activation-8.26e-02

Pos logit contributions
 Articles+3.018
 remedy+3.001
 remedies+2.768
 Thousand+2.765
esi+2.734
Neg logit contributions
 awa-5.843
OOD-5.539
romy-5.477
oz-5.310
Environment-5.196
Token To
Token ID1675
Feature activation+1.17e+00

Pos logit contributions
 awa+0.064
OOD+0.061
romy+0.060
oz+0.059
Environment+0.057
Neg logit contributions
 remedy-0.052
 Articles-0.051
 remedies-0.049
aughs-0.049
 Thousand-0.048
Token make
Token ID787
Feature activation+2.54e+00

Pos logit contributions
 Articles+0.650
 remedy+0.620
aughs+0.609
 Thousand+0.607
esi+0.597
Neg logit contributions
 awa-0.997
OOD-0.947
romy-0.937
oz-0.914
Environment-0.883
Token a
Token ID257
Feature activation+1.86e-01

Pos logit contributions
 Articles+1.217
 remedy+1.198
 Thousand+1.124
aughs+1.114
 remedies+1.110
Neg logit contributions
 awa-2.380
OOD-2.263
romy-2.235
oz-2.178
Environment-2.126
Token long
Token ID890
Feature activation+3.06e-01

Pos logit contributions
 Articles+0.103
 remedy+0.098
aughs+0.097
esi+0.094
 remedies+0.094
Neg logit contributions
 awa-0.160
OOD-0.151
romy-0.151
oz-0.146
Environment-0.141
Tokenpair
Token ID24874
Feature activation-1.02e+00

Pos logit contributions
 awa+0.719
OOD+0.668
romy+0.655
oz+0.632
Environment+0.594
Neg logit contributions
 Articles-1.191
 remedy-1.180
 Thousand-1.138
aughs-1.137
 remedies-1.135
Token but
Token ID475
Feature activation-1.44e-01

Pos logit contributions
 awa+0.894
OOD+0.857
romy+0.838
oz+0.831
Environment+0.796
Neg logit contributions
 Articles-0.531
 remedy-0.525
aughs-0.500
 Thousand-0.495
 remedies-0.489
Token require
Token ID2421
Feature activation-1.31e+00

Pos logit contributions
 awa+0.127
OOD+0.120
romy+0.119
oz+0.116
Environment+0.112
Neg logit contributions
 Articles-0.076
 remedy-0.074
aughs-0.072
esi-0.071
 Thousand-0.070
Token out
Token ID503
Feature activation+1.14e+00

Pos logit contributions
 awa+1.114
OOD+1.052
romy+1.047
oz+1.018
Environment+0.989
Neg logit contributions
 Articles-0.725
 remedy-0.708
aughs-0.696
 Thousand-0.677
esi-0.674
Tokenboard
Token ID3526
Feature activation+8.90e-01

Pos logit contributions
 Articles+0.643
 remedy+0.637
 Thousand+0.610
 remedies+0.603
 Warn+0.593
Neg logit contributions
 awa-0.976
OOD-0.914
romy-0.900
oz-0.869
Environment-0.847
Token amplification
Token ID50250
Feature activation+6.21e+00

Pos logit contributions
 Articles+0.474
 remedy+0.460
aughs+0.445
 Thousand+0.441
esi+0.438
Neg logit contributions
 awa-0.783
OOD-0.742
romy-0.730
oz-0.714
Environment-0.692
Token and
Token ID290
Feature activation-6.95e-01

Pos logit contributions
 Articles+3.054
 remedy+3.032
 remedies+2.830
 Thousand+2.799
 Warn+2.794
Neg logit contributions
 awa-5.731
OOD-5.412
romy-5.372
oz-5.202
Environment-5.093
Token electronics
Token ID17075
Feature activation+3.45e+00

Pos logit contributions
 awa+0.592
OOD+0.553
romy+0.550
oz+0.536
Environment+0.520
Neg logit contributions
 Articles-0.389
 remedy-0.378
aughs-0.375
 Thousand-0.364
esi-0.362
Token.
Token ID13
Feature activation-8.06e-01

Pos logit contributions
 Articles+1.737
 remedy+1.711
 Thousand+1.614
aughs+1.603
 remedies+1.590
Neg logit contributions
 awa-3.146
OOD-2.990
romy-2.950
oz-2.875
Environment-2.802
Token A
Token ID317
Feature activation-1.53e+00

Pos logit contributions
 awa+0.616
OOD+0.598
romy+0.577
oz+0.575
Environment+0.559
Neg logit contributions
 remedy-0.502
aughs-0.485
 Articles-0.483
 remedies-0.474
esi-0.468
Token pair
Token ID5166
Feature activation-2.99e-01

Pos logit contributions
 awa+1.255
OOD+1.189
romy+1.181
oz+1.155
Environment+1.105
Neg logit contributions
 Articles-0.904
 remedy-0.862
aughs-0.854
 remedies-0.831
esi-0.825
Tokenick
Token ID624
Feature activation+1.51e+00

Pos logit contributions
 Articles+0.650
 remedy+0.643
 Thousand+0.618
aughs+0.612
 remedies+0.612
Neg logit contributions
 awa-0.630
OOD-0.592
romy-0.581
oz-0.562
Environment-0.540
Tokenory
Token ID652
Feature activation-3.18e-02

Pos logit contributions
 Articles+0.977
 remedy+0.965
 Thousand+0.921
aughs+0.915
 remedies+0.912
Neg logit contributions
 awa-1.158
OOD-1.090
romy-1.073
oz-1.039
Environment-1.007
Token,
Token ID11
Feature activation-5.85e-01

Pos logit contributions
 awa+0.027
OOD+0.026
romy+0.026
oz+0.025
Environment+0.024
Neg logit contributions
 Articles-0.017
 remedy-0.017
aughs-0.016
 Thousand-0.016
 Warn-0.016
Token apple
Token ID17180
Feature activation+2.51e+00

Pos logit contributions
 awa+0.478
OOD+0.450
romy+0.445
oz+0.432
Environment+0.417
Neg logit contributions
 Articles-0.351
 remedy-0.345
aughs-0.331
 Thousand-0.328
 remedies-0.325
Token,
Token ID11
Feature activation-4.75e-01

Pos logit contributions
 Articles+1.378
 remedy+1.376
 Thousand+1.285
 remedies+1.285
esi+1.267
Neg logit contributions
 awa-2.158
OOD-2.042
romy-2.011
oz-1.953
Environment-1.925
Token pear
Token ID25286
Feature activation+6.16e+00

Pos logit contributions
 awa+0.387
OOD+0.365
romy+0.360
oz+0.350
Environment+0.338
Neg logit contributions
 Articles-0.286
 remedy-0.281
aughs-0.269
 Thousand-0.267
 remedies-0.266
Token,
Token ID11
Feature activation-3.80e-01

Pos logit contributions
 remedy+3.013
 Articles+2.994
 remedies+2.799
esi+2.733
 Trident+2.724
Neg logit contributions
 awa-5.509
OOD-5.197
romy-5.114
Environment-4.999
oz-4.953
Token plum
Token ID22802
Feature activation+2.46e+00

Pos logit contributions
 awa+0.308
OOD+0.290
romy+0.287
oz+0.278
Environment+0.269
Neg logit contributions
 Articles-0.231
 remedy-0.227
aughs-0.217
 Thousand-0.216
 remedies-0.214
Token,
Token ID11
Feature activation-1.01e-01

Pos logit contributions
 Articles+1.297
 remedy+1.296
 Thousand+1.208
 remedies+1.206
esi+1.192
Neg logit contributions
 awa-2.166
OOD-2.052
romy-2.029
oz-1.957
Environment-1.939
Token and
Token ID290
Feature activation+1.65e+00

Pos logit contributions
 awa+0.082
OOD+0.077
romy+0.076
oz+0.074
Environment+0.072
Neg logit contributions
 Articles-0.061
 remedy-0.060
aughs-0.057
 Thousand-0.057
 remedies-0.057
Token paw
Token ID41827
Feature activation+9.20e-01

Pos logit contributions
 Articles+1.015
 remedy+1.006
 Thousand+0.957
 remedies+0.947
aughs+0.943
Neg logit contributions
 awa-1.319
OOD-1.245
romy-1.225
oz-1.188
Environment-1.157
TokenWhite
Token ID12256
Feature activation+1.65e+00

Pos logit contributions
 Articles+0.690
 remedy+0.683
 Thousand+0.656
aughs+0.650
 remedies+0.650
Neg logit contributions
 awa-0.666
OOD-0.621
romy-0.611
oz-0.588
Environment-0.568
Token Rock
Token ID4631
Feature activation-1.08e+00

Pos logit contributions
 Articles+0.886
 remedy+0.876
 Thousand+0.825
aughs+0.819
 remedies+0.819
Neg logit contributions
 awa-1.446
OOD-1.373
romy-1.355
oz-1.319
Environment-1.283
Token by
Token ID416
Feature activation-7.46e-01

Pos logit contributions
 awa+0.925
OOD+0.878
romy+0.865
oz+0.842
Environment+0.818
Neg logit contributions
 Articles-0.605
 remedy-0.599
 Thousand-0.565
aughs-0.563
 remedies-0.561
Token Conservative
Token ID11132
Feature activation-1.54e+00

Pos logit contributions
 awa+0.625
OOD+0.591
romy+0.583
oz+0.568
Environment+0.552
Neg logit contributions
 Articles-0.429
 remedy-0.424
 Thousand-0.401
aughs-0.399
 remedies-0.399
Token candidate
Token ID4540
Feature activation-1.50e+00

Pos logit contributions
 awa+1.353
OOD+1.278
romy+1.257
oz+1.228
Environment+1.201
Neg logit contributions
 Articles-0.814
 remedy-0.804
aughs-0.777
esi-0.758
 Thousand-0.756
Token Dianne
Token ID49197
Feature activation+6.14e+00

Pos logit contributions
 awa+1.210
OOD+1.142
romy+1.123
oz+1.100
Environment+1.063
Neg logit contributions
 Articles-0.904
 remedy-0.897
aughs-0.855
 remedies-0.846
 Thousand-0.843
Token Watts
Token ID27555
Feature activation+1.15e+00

Pos logit contributions
 Articles+3.674
 remedy+3.550
 Warn+3.537
 Thousand+3.484
 Force+3.441
Neg logit contributions
 awa-4.996
OOD-4.661
romy-4.635
oz-4.393
Environment-4.372
Token,
Token ID11
Feature activation+5.82e-01

Pos logit contributions
 Articles+0.651
 remedy+0.643
 Thousand+0.609
aughs+0.603
 remedies+0.603
Neg logit contributions
 awa-0.973
OOD-0.920
romy-0.908
oz-0.881
Environment-0.859
Token is
Token ID318
Feature activation-4.55e-01

Pos logit contributions
 Articles+0.325
 remedy+0.322
aughs+0.306
 Thousand+0.303
esi+0.302
Neg logit contributions
 awa-0.496
OOD-0.471
romy-0.463
oz-0.453
Environment-0.440
Token being
Token ID852
Feature activation+1.52e+00

Pos logit contributions
 awa+0.373
OOD+0.351
romy+0.344
oz+0.339
Environment+0.327
Neg logit contributions
 Articles-0.271
 remedy-0.267
aughs-0.256
 Thousand-0.254
 remedies-0.251
Token called
Token ID1444
Feature activation+8.54e-01

Pos logit contributions
 Articles+0.971
 remedy+0.961
 Thousand+0.916
aughs+0.908
 remedies+0.908
Neg logit contributions
 awa-1.187
OOD-1.117
romy-1.100
oz-1.065
Environment-1.034
Tokenless
Token ID1203
Feature activation+2.59e-01

Pos logit contributions
 Articles+0.327
 remedy+0.323
aughs+0.311
 Thousand+0.310
esi+0.308
Neg logit contributions
 awa-0.451
OOD-0.426
romy-0.417
oz-0.408
Environment-0.394
Token people
Token ID661
Feature activation+1.94e+00

Pos logit contributions
 Articles+0.151
 remedy+0.150
aughs+0.144
 Thousand+0.144
esi+0.142
Neg logit contributions
 awa-0.213
OOD-0.202
romy-0.198
oz-0.193
Environment-0.186
Token have
Token ID423
Feature activation-1.34e-01

Pos logit contributions
 Articles+0.985
 remedy+0.971
 Thousand+0.914
aughs+0.907
 remedies+0.903
Neg logit contributions
 awa-1.764
OOD-1.675
romy-1.652
oz-1.610
Environment-1.569
Token been
Token ID587
Feature activation+1.01e+00

Pos logit contributions
 awa+0.116
OOD+0.110
romy+0.107
oz+0.106
Environment+0.102
Neg logit contributions
 Articles-0.073
 remedy-0.071
aughs-0.069
 Thousand-0.069
esi-0.068
Token living
Token ID2877
Feature activation+5.27e+00

Pos logit contributions
 Articles+0.633
 remedy+0.621
aughs+0.597
 Thousand+0.595
 remedies+0.587
Neg logit contributions
 awa-0.796
OOD-0.744
romy-0.732
oz-0.716
Environment-0.688
Token under
Token ID739
Feature activation+6.13e+00

Pos logit contributions
 Articles+2.714
 remedy+2.665
 Thousand+2.525
 remedies+2.486
 Warn+2.454
Neg logit contributions
 awa-4.731
OOD-4.495
romy-4.461
oz-4.324
Environment-4.242
Token military
Token ID2422
Feature activation-4.17e-01

Pos logit contributions
 Articles+3.434
 remedy+3.354
 Thousand+3.171
 remedies+3.142
 Warn+3.133
Neg logit contributions
 awa-5.245
OOD-4.964
romy-4.932
oz-4.747
Environment-4.654
Token law
Token ID1099
Feature activation-5.63e-01

Pos logit contributions
 awa+0.329
OOD+0.312
romy+0.302
oz+0.300
Environment+0.287
Neg logit contributions
 Articles-0.251
aughs-0.248
 remedy-0.248
 Thousand-0.242
esi-0.239
Token for
Token ID329
Feature activation+2.85e-01

Pos logit contributions
 awa+0.514
OOD+0.488
romy+0.477
oz+0.471
Environment+0.453
Neg logit contributions
 Articles-0.277
 remedy-0.272
aughs-0.266
 Thousand-0.260
esi-0.258
Token nearly
Token ID3016
Feature activation-1.19e+00

Pos logit contributions
 Articles+0.155
 remedy+0.154
aughs+0.150
 Thousand+0.144
esi+0.144
Neg logit contributions
 awa-0.247
OOD-0.233
romy-0.229
oz-0.224
Environment-0.217
Token five
Token ID1936
Feature activation+6.28e-01

Pos logit contributions
 awa+1.030
OOD+0.978
romy+0.961
oz+0.933
Environment+0.905
Neg logit contributions
 Articles-0.643
 remedy-0.633
aughs-0.616
 remedies-0.592
esi-0.589
Token the
Token ID262
Feature activation-5.88e-01

Pos logit contributions
 awa+0.002
OOD+0.002
romy+0.002
oz+0.002
Environment+0.002
Neg logit contributions
 Articles-0.001
 remedy-0.001
aughs-0.001
esi-0.001
 Thousand-0.001
Token years
Token ID812
Feature activation+7.16e-01

Pos logit contributions
 awa+0.476
OOD+0.450
romy+0.440
oz+0.431
Environment+0.417
Neg logit contributions
 Articles-0.349
 remedy-0.345
aughs-0.333
esi-0.326
 Thousand-0.325
Token I
Token ID314
Feature activation-9.35e-02

Pos logit contributions
 Articles+0.398
 remedy+0.393
aughs+0.374
 Thousand+0.370
esi+0.368
Neg logit contributions
 awa-0.614
OOD-0.581
romy-0.570
oz-0.560
Environment-0.541
Token spent
Token ID3377
Feature activation+4.25e-01

Pos logit contributions
 awa+0.076
OOD+0.071
romy+0.070
oz+0.068
Environment+0.066
Neg logit contributions
 Articles-0.057
 remedy-0.056
aughs-0.054
 Thousand-0.054
 remedies-0.053
Token playing
Token ID2712
Feature activation+1.78e+00

Pos logit contributions
 Articles+0.237
 remedy+0.234
aughs+0.225
esi+0.222
 Thousand+0.220
Neg logit contributions
 awa-0.366
OOD-0.342
romy-0.337
oz-0.329
Environment-0.320
Token under
Token ID739
Feature activation+6.03e+00

Pos logit contributions
 Articles+0.933
 remedy+0.920
aughs+0.874
 Thousand+0.862
esi+0.859
Neg logit contributions
 awa-1.583
OOD-1.501
romy-1.474
oz-1.442
Environment-1.401
Token him
Token ID683
Feature activation-4.40e-01

Pos logit contributions
 Articles+3.372
 remedy+3.281
 Warn+3.157
 Thousand+3.149
 Force+3.089
Neg logit contributions
 awa-5.183
OOD-4.878
romy-4.845
oz-4.624
Environment-4.577
Token that
Token ID326
Feature activation-9.66e-01

Pos logit contributions
 awa+0.395
OOD+0.375
romy+0.368
oz+0.362
Environment+0.348
Neg logit contributions
 Articles-0.227
 remedy-0.224
aughs-0.211
 Thousand-0.210
esi-0.210
Token perhaps
Token ID3737
Feature activation-2.34e+00

Pos logit contributions
 awa+0.829
OOD+0.778
romy+0.767
oz+0.754
Environment+0.721
Neg logit contributions
 Articles-0.539
 remedy-0.518
esi-0.509
aughs-0.508
 Thousand-0.501
Token provide
Token ID2148
Feature activation-8.44e-01

Pos logit contributions
 awa+1.974
OOD+1.864
romy+1.825
oz+1.783
Environment+1.711
Neg logit contributions
 Articles-1.318
esi-1.269
 remedy-1.262
aughs-1.261
 Thousand-1.214
Token some
Token ID617
Feature activation-1.18e+00

Pos logit contributions
 awa+0.695
OOD+0.654
romy+0.638
oz+0.629
Environment+0.603
Neg logit contributions
 Articles-0.494
 remedy-0.479
aughs-0.472
esi-0.466
 Thousand-0.462
Tokentechn
Token ID23873
Feature activation+1.42e-01

Pos logit contributions
 Articles+0.019
 remedy+0.019
 Thousand+0.018
 remedies+0.018
esi+0.018
Neg logit contributions
 awa-0.014
OOD-0.013
romy-0.013
oz-0.012
Environment-0.012
Tokenological
Token ID2770
Feature activation+9.77e-01

Pos logit contributions
 Articles+0.080
 remedy+0.079
 Thousand+0.075
aughs+0.074
 remedies+0.074
Neg logit contributions
 awa-0.120
OOD-0.114
romy-0.112
oz-0.109
Environment-0.106
Token system
Token ID1080
Feature activation-1.17e+00

Pos logit contributions
 Articles+0.612
 remedy+0.605
 Thousand+0.577
aughs+0.573
 remedies+0.571
Neg logit contributions
 awa-0.772
OOD-0.726
romy-0.715
oz-0.694
Environment-0.674
Token we
Token ID356
Feature activation+6.83e-01

Pos logit contributions
 awa+1.029
OOD+0.978
romy+0.962
oz+0.942
Environment+0.915
Neg logit contributions
 Articles-0.618
 remedy-0.612
aughs-0.577
 Thousand-0.577
 remedies-0.570
Token live
Token ID2107
Feature activation+4.17e+00

Pos logit contributions
 Articles+0.375
 remedy+0.370
 Thousand+0.351
aughs+0.348
 remedies+0.347
Neg logit contributions
 awa-0.592
OOD-0.560
romy-0.553
oz-0.538
Environment-0.524
Token under
Token ID739
Feature activation+6.01e+00

Pos logit contributions
 Articles+1.821
 remedy+1.797
 Thousand+1.674
 remedies+1.643
 Force+1.609
Neg logit contributions
 awa-4.066
OOD-3.853
romy-3.833
oz-3.736
Environment-3.635
Token today
Token ID1909
Feature activation+7.06e-01

Pos logit contributions
 Articles+2.709
 remedy+2.627
 Thousand+2.470
 remedies+2.434
 Warn+2.407
Neg logit contributions
 awa-5.757
OOD-5.416
romy-5.413
oz-5.191
Environment-5.081
Token that
Token ID326
Feature activation-2.96e-01

Pos logit contributions
 Articles+0.328
 remedy+0.321
 Thousand+0.302
 remedies+0.298
 Warn+0.295
Neg logit contributions
 awa-0.674
OOD-0.640
romy-0.634
oz-0.615
Environment-0.601
Token we
Token ID356
Feature activation-2.61e-01

Pos logit contributions
 awa+0.259
OOD+0.245
romy+0.242
oz+0.237
Environment+0.229
Neg logit contributions
 Articles-0.158
 remedy-0.157
aughs-0.148
esi-0.148
 Thousand-0.148
Token call
Token ID869
Feature activation+5.65e-01

Pos logit contributions
 awa+0.222
OOD+0.210
romy+0.208
oz+0.203
Environment+0.196
Neg logit contributions
 Articles-0.146
 remedy-0.144
 Thousand-0.138
aughs-0.137
 remedies-0.136
Token Surveillance
Token ID34818
Feature activation+8.71e-01

Pos logit contributions
 Articles+0.310
 remedy+0.310
aughs+0.293
 Thousand+0.292
 remedies+0.290
Neg logit contributions
 awa-0.487
OOD-0.461
romy-0.453
oz-0.444
Environment-0.430
Token it
Token ID340
Feature activation+8.09e-01

Pos logit contributions
 Articles+0.654
 remedy+0.642
aughs+0.629
esi+0.620
 Thousand+0.618
Neg logit contributions
 awa-0.998
OOD-0.948
romy-0.932
oz-0.910
Environment-0.879
Token is
Token ID318
Feature activation-7.35e-01

Pos logit contributions
 Articles+0.409
 remedy+0.398
 Thousand+0.381
aughs+0.379
esi+0.374
Neg logit contributions
 awa-0.738
OOD-0.697
romy-0.685
oz-0.677
Environment-0.653
Token hard
Token ID1327
Feature activation-1.78e+00

Pos logit contributions
 awa+0.626
OOD+0.587
romy+0.573
oz+0.569
Environment+0.548
Neg logit contributions
 Articles-0.407
 remedy-0.397
aughs-0.391
esi-0.384
 Thousand-0.380
Token to
Token ID284
Feature activation-6.63e-01

Pos logit contributions
 awa+1.695
OOD+1.613
romy+1.589
oz+1.550
Environment+1.523
Neg logit contributions
 remedy-0.834
 Articles-0.830
 remedies-0.769
 Thousand-0.760
aughs-0.758
Token get
Token ID651
Feature activation+1.02e+00

Pos logit contributions
 awa+0.518
OOD+0.477
romy+0.473
oz+0.462
Environment+0.442
Neg logit contributions
 Articles-0.426
aughs-0.409
 Thousand-0.402
 remedy-0.398
esi-0.396
Token rid
Token ID5755
Feature activation+6.01e+00

Pos logit contributions
 Articles+0.608
 remedy+0.597
aughs+0.579
 Thousand+0.577
esi+0.571
Neg logit contributions
 awa-0.845
OOD-0.792
romy-0.779
oz-0.757
Environment-0.734
Token of
Token ID286
Feature activation+9.97e-01

Pos logit contributions
 Articles+3.304
 remedy+3.260
 Thousand+3.075
 remedies+3.056
aughs+3.035
Neg logit contributions
 awa-5.205
OOD-4.920
romy-4.857
oz-4.708
Environment-4.598
Token them
Token ID606
Feature activation-4.80e-01

Pos logit contributions
 Articles+0.550
 remedy+0.542
aughs+0.524
 Thousand+0.521
esi+0.519
Neg logit contributions
 awa-0.854
OOD-0.807
romy-0.792
oz-0.775
Environment-0.749
Token.
Token ID13
Feature activation-2.54e-01

Pos logit contributions
 awa+0.426
OOD+0.402
romy+0.392
oz+0.388
Environment+0.371
Neg logit contributions
 Articles-0.253
 remedy-0.245
aughs-0.241
 Thousand-0.240
esi-0.234
Token
Token ID198
Feature activation+8.51e-01

Pos logit contributions
 awa+0.198
OOD+0.193
romy+0.185
oz+0.185
Environment+0.180
Neg logit contributions
 remedy-0.154
aughs-0.148
 Articles-0.147
 remedies-0.145
esi-0.144
Token
Token ID198
Feature activation+6.42e-02

Pos logit contributions
 remedy+0.537
aughs+0.523
 Articles+0.520
 remedies+0.509
 Thousand+0.502
Neg logit contributions
OOD-0.639
oz-0.621
romy-0.609
 awa-0.607
Environment-0.599
Token,
Token ID11
Feature activation-5.26e-03

Pos logit contributions
 Articles+1.106
 remedy+1.106
 remedies+1.038
 Thousand+1.022
 Warn+1.007
Neg logit contributions
 awa-1.928
OOD-1.826
romy-1.803
oz-1.749
Environment-1.727
Token cut
Token ID2005
Feature activation+4.16e-01

Pos logit contributions
 awa+0.004
OOD+0.004
romy+0.004
oz+0.004
Environment+0.004
Neg logit contributions
 Articles-0.003
 remedy-0.003
aughs-0.003
 Thousand-0.003
esi-0.003
Token flowers
Token ID12734
Feature activation+5.61e+00

Pos logit contributions
 Articles+0.263
 remedy+0.258
aughs+0.250
 Thousand+0.249
 Warn+0.245
Neg logit contributions
 awa-0.326
OOD-0.305
romy-0.303
oz-0.295
Environment-0.282
Token,
Token ID11
Feature activation-1.16e-01

Pos logit contributions
 remedy+2.721
 Articles+2.716
 remedies+2.569
 Warn+2.464
 Thousand+2.461
Neg logit contributions
 awa-5.112
OOD-4.827
romy-4.801
oz-4.666
Environment-4.618
Token dried
Token ID16577
Feature activation+9.24e-01

Pos logit contributions
 awa+0.094
OOD+0.088
romy+0.087
oz+0.085
Environment+0.082
Neg logit contributions
 Articles-0.070
 remedy-0.069
aughs-0.066
 Thousand-0.066
esi-0.065
Token flowers
Token ID12734
Feature activation+6.00e+00

Pos logit contributions
 Articles+0.511
 remedy+0.503
 Thousand+0.477
aughs+0.476
 Warn+0.470
Neg logit contributions
 awa-0.797
OOD-0.753
romy-0.743
oz-0.723
Environment-0.703
Token.
Token ID13
Feature activation-3.98e-02

Pos logit contributions
 remedy+2.851
 Articles+2.832
 remedies+2.682
 Thousand+2.548
 Warn+2.545
Neg logit contributions
 awa-5.538
OOD-5.224
romy-5.206
oz-5.037
Environment-5.024
Token...
Token ID2644
Feature activation-5.11e-01

Pos logit contributions
 awa+0.033
OOD+0.032
romy+0.031
oz+0.031
Environment+0.030
Neg logit contributions
 remedy-0.022
 Articles-0.022
aughs-0.021
 remedies-0.021
esi-0.021
Token
Token ID564
Feature activation+6.18e-01

Pos logit contributions
 awa+0.381
OOD+0.360
romy+0.353
oz+0.343
Environment+0.333
Neg logit contributions
 Articles-0.338
 remedy-0.337
aughs-0.321
 Thousand-0.320
 remedies-0.318
Token
Token ID250
Feature activation+4.84e-01

Pos logit contributions
 Articles+0.419
 remedy+0.415
 Thousand+0.398
 remedies+0.395
aughs+0.393
Neg logit contributions
 awa-0.456
OOD-0.426
romy-0.418
oz-0.407
Environment-0.394
TokenYou
Token ID1639
Feature activation-1.06e+00

Pos logit contributions
 Articles+0.383
 remedy+0.381
 Thousand+0.366
aughs+0.365
 remedies+0.364
Neg logit contributions
 awa-0.299
OOD-0.280
romy-0.273
oz-0.264
Environment-0.255
Token with
Token ID351
Feature activation+6.35e-01

Pos logit contributions
 awa+1.319
OOD+1.279
romy+1.248
oz+1.235
Environment+1.204
Neg logit contributions
 Articles-0.830
 remedy-0.804
aughs-0.793
esi-0.789
 Thousand-0.772
Token a
Token ID257
Feature activation-3.11e-01

Pos logit contributions
 Articles+0.357
 remedy+0.353
 Thousand+0.334
 remedies+0.330
aughs+0.330
Neg logit contributions
 awa-0.542
OOD-0.513
romy-0.506
oz-0.491
Environment-0.479
Token developing
Token ID5922
Feature activation-1.21e+00

Pos logit contributions
 awa+0.260
OOD+0.245
romy+0.241
oz+0.236
Environment+0.229
Neg logit contributions
 Articles-0.179
 remedy-0.175
aughs-0.169
 Thousand-0.167
 remedies-0.166
Token Nathan
Token ID18106
Feature activation+9.69e-01

Pos logit contributions
 awa+0.991
OOD+0.932
romy+0.916
oz+0.901
Environment+0.876
Neg logit contributions
 Articles-0.708
 remedy-0.698
aughs-0.673
 Thousand-0.664
 remedies-0.663
Token Be
Token ID1355
Feature activation-1.68e-01

Pos logit contributions
 Articles+0.636
 remedy+0.631
aughs+0.599
 Thousand+0.598
 remedies+0.597
Neg logit contributions
 awa-0.732
OOD-0.687
romy-0.678
oz-0.660
Environment-0.636
Tokenaul
Token ID2518
Feature activation+5.99e+00

Pos logit contributions
oz+0.129
 awa+0.126
romy+0.126
OOD+0.124
rosis+0.116
Neg logit contributions
 Articles-0.108
 remedy-0.103
 remedies-0.099
 Trident-0.099
 Thousand-0.095
Tokenieu
Token ID22304
Feature activation+7.48e-01

Pos logit contributions
 Articles+5.193
 remedy+5.154
aughs+5.053
 Thousand+4.994
esi+4.957
Neg logit contributions
 awa-3.270
OOD-2.933
romy-2.930
Environment-2.658
oz-2.631
Token,
Token ID11
Feature activation-1.83e-01

Pos logit contributions
 Articles+0.402
 remedy+0.396
 Thousand+0.375
aughs+0.372
 remedies+0.371
Neg logit contributions
 awa-0.658
OOD-0.623
romy-0.614
oz-0.598
Environment-0.583
Token will
Token ID481
Feature activation+1.86e+00

Pos logit contributions
 awa+0.174
OOD+0.165
romy+0.162
oz+0.160
Environment+0.156
Neg logit contributions
 Articles-0.084
 remedy-0.082
 Thousand-0.078
aughs-0.078
esi-0.078
Token mean
Token ID1612
Feature activation-9.84e-01

Pos logit contributions
 Articles+1.092
 remedy+1.061
 Thousand+1.026
aughs+1.019
esi+1.010
Neg logit contributions
 awa-1.541
OOD-1.450
romy-1.428
oz-1.392
Environment-1.351
Token Montreal
Token ID12871
Feature activation+1.00e-02

Pos logit contributions
 awa+0.830
OOD+0.785
romy+0.772
oz+0.754
Environment+0.732
Neg logit contributions
 Articles-0.561
 remedy-0.554
 Thousand-0.526
aughs-0.524
 remedies-0.520
Token and
Token ID290
Feature activation+2.31e+00

Pos logit contributions
 Articles+0.121
 remedy+0.119
 Thousand+0.113
aughs+0.113
 remedies+0.111
Neg logit contributions
 awa-0.202
OOD-0.192
romy-0.190
oz-0.185
Environment-0.180
Token big
Token ID1263
Feature activation+5.60e-01

Pos logit contributions
 Articles+1.268
 remedy+1.252
 Thousand+1.183
 remedies+1.172
aughs+1.172
Neg logit contributions
 awa-2.004
OOD-1.899
romy-1.872
oz-1.820
Environment-1.773
Token,
Token ID11
Feature activation-1.25e+00

Pos logit contributions
 Articles+0.247
 remedy+0.243
aughs+0.226
 Thousand+0.226
 remedies+0.223
Neg logit contributions
 awa-0.545
OOD-0.519
romy-0.514
oz-0.503
Environment-0.489
Token box
Token ID3091
Feature activation-4.34e-02

Pos logit contributions
 awa+1.029
OOD+0.966
romy+0.948
oz+0.941
Environment+0.900
Neg logit contributions
 Articles-0.729
 remedy-0.713
aughs-0.695
 Thousand-0.679
esi-0.676
Tokeny
Token ID88
Feature activation-2.67e+00

Pos logit contributions
 awa+0.036
OOD+0.035
romy+0.034
oz+0.033
Environment+0.032
Neg logit contributions
 Articles-0.025
 remedy-0.024
 Thousand-0.023
aughs-0.023
 remedies-0.023
Token speakers
Token ID11636
Feature activation+5.99e+00

Pos logit contributions
 awa+2.678
OOD+2.559
romy+2.530
oz+2.504
Environment+2.430
Neg logit contributions
 Articles-1.016
 remedy-0.994
aughs-0.982
esi-0.928
 Thousand-0.922
Token,
Token ID11
Feature activation+1.03e-01

Pos logit contributions
 remedy+2.972
 Articles+2.945
 remedies+2.742
 Thousand+2.713
 Warn+2.703
Neg logit contributions
 awa-5.409
OOD-5.152
romy-5.055
oz-4.903
Environment-4.809
Token look
Token ID804
Feature activation+4.50e+00

Pos logit contributions
 Articles+0.057
 remedy+0.056
 Thousand+0.053
aughs+0.053
 remedies+0.053
Neg logit contributions
 awa-0.089
OOD-0.084
romy-0.083
oz-0.080
Environment-0.078
Token like
Token ID588
Feature activation+1.28e+00

Pos logit contributions
 remedy+2.308
 Articles+2.308
 Thousand+2.169
 remedies+2.146
 Warn+2.114
Neg logit contributions
 awa-3.994
OOD-3.787
romy-3.757
oz-3.595
Environment-3.543
Token something
Token ID1223
Feature activation-8.35e-01

Pos logit contributions
 Articles+0.719
 remedy+0.714
 Thousand+0.676
 remedies+0.670
 Warn+0.664
Neg logit contributions
 awa-1.099
OOD-1.041
romy-1.029
oz-0.995
Environment-0.970
Token from
Token ID422
Feature activation-1.10e+00

Pos logit contributions
 awa+0.726
OOD+0.686
romy+0.674
oz+0.660
Environment+0.642
Neg logit contributions
 Articles-0.457
 remedy-0.449
aughs-0.434
 Thousand-0.423
esi-0.422
Token muff
Token ID27563
Feature activation+3.49e-01

Pos logit contributions
 awa+0.583
OOD+0.547
romy+0.534
oz+0.522
Environment+0.499
Neg logit contributions
 Articles-0.542
 remedy-0.534
aughs-0.519
esi-0.508
 Thousand-0.508
Tokenins
Token ID1040
Feature activation+2.13e+00

Pos logit contributions
 Articles+0.263
 remedy+0.261
 Thousand+0.250
 remedies+0.248
aughs+0.248
Neg logit contributions
 awa-0.231
OOD-0.216
romy-0.212
oz-0.206
Environment-0.196
Token.
Token ID13
Feature activation-3.09e-01

Pos logit contributions
 Articles+0.960
 remedy+0.947
 Thousand+0.884
 remedies+0.878
aughs+0.865
Neg logit contributions
 awa-2.053
OOD-1.949
romy-1.932
oz-1.875
Environment-1.843
Token Ground
Token ID13706
Feature activation+9.23e-01

Pos logit contributions
 awa+0.252
OOD+0.242
romy+0.234
oz+0.231
Environment+0.224
Neg logit contributions
 remedy-0.182
 Articles-0.179
aughs-0.173
 remedies-0.171
esi-0.170
Token fl
Token ID781
Feature activation+2.36e+00

Pos logit contributions
 Articles+0.721
 remedy+0.715
 Thousand+0.687
aughs+0.683
 remedies+0.683
Neg logit contributions
 awa-0.585
OOD-0.543
romy-0.532
oz-0.512
Environment-0.493
Tokenax
Token ID897
Feature activation+5.95e+00

Pos logit contributions
 Articles+1.432
 remedy+1.414
 Thousand+1.354
 Warn+1.345
 remedies+1.328
Neg logit contributions
 awa-1.885
OOD-1.826
romy-1.785
oz-1.775
Environment-1.673
Token seed
Token ID9403
Feature activation+2.70e+00

Pos logit contributions
 Articles+2.369
 remedy+2.359
 remedies+2.168
 Thousand+2.110
esi+2.104
Neg logit contributions
 awa-6.001
OOD-5.731
romy-5.675
oz-5.519
Environment-5.446
Token also
Token ID635
Feature activation+2.31e+00

Pos logit contributions
 Articles+1.344
 remedy+1.331
 Thousand+1.241
 remedies+1.238
aughs+1.230
Neg logit contributions
 awa-2.476
OOD-2.349
romy-2.322
oz-2.258
Environment-2.209
Token qualifies
Token ID37782
Feature activation-6.89e-01

Pos logit contributions
 Articles+1.042
 remedy+1.018
 Thousand+0.964
aughs+0.952
esi+0.940
Neg logit contributions
 awa-2.238
OOD-2.130
romy-2.102
oz-2.055
Environment-1.998
Token here
Token ID994
Feature activation+1.44e+00

Pos logit contributions
 awa+0.590
OOD+0.563
romy+0.552
oz+0.538
Environment+0.521
Neg logit contributions
 Articles-0.379
 remedy-0.375
aughs-0.358
 Thousand-0.356
 remedies-0.350
Token as
Token ID355
Feature activation-3.18e-01

Pos logit contributions
 Articles+0.746
 remedy+0.736
 Thousand+0.694
aughs+0.688
 remedies+0.686
Neg logit contributions
 awa-1.286
OOD-1.221
romy-1.204
oz-1.172
Environment-1.142
Token small
Token ID1402
Feature activation+2.61e-01

Pos logit contributions
 Articles+0.003
aughs+0.003
 remedy+0.003
esi+0.003
 remedies+0.003
Neg logit contributions
 awa-0.004
OOD-0.004
romy-0.004
oz-0.004
Environment-0.004
Token stroke
Token ID14000
Feature activation-1.25e+00

Pos logit contributions
 Articles+0.160
 remedy+0.154
aughs+0.152
 Thousand+0.149
esi+0.149
Neg logit contributions
 awa-0.207
OOD-0.196
romy-0.192
oz-0.189
Environment-0.182
Token while
Token ID981
Feature activation-3.17e+00

Pos logit contributions
 awa+1.059
OOD+0.998
romy+0.976
oz+0.972
Environment+0.921
Neg logit contributions
 Articles-0.691
aughs-0.672
 remedy-0.655
 Thousand-0.648
esi-0.644
Token she
Token ID673
Feature activation+1.49e+00

Pos logit contributions
 awa+2.485
OOD+2.273
romy+2.261
oz+2.213
Environment+2.141
Neg logit contributions
aughs-1.970
 Articles-1.942
 remedy-1.905
esi-1.865
 Thousand-1.836
Token was
Token ID373
Feature activation+1.48e+00

Pos logit contributions
 Articles+0.595
 remedy+0.573
 Thousand+0.547
aughs+0.540
esi+0.528
Neg logit contributions
 awa-1.515
OOD-1.446
romy-1.438
oz-1.406
Environment-1.360
Token under
Token ID739
Feature activation+5.94e+00

Pos logit contributions
 Articles+0.893
 remedy+0.880
aughs+0.846
 Thousand+0.840
 remedies+0.827
Neg logit contributions
 awa-1.198
OOD-1.125
romy-1.107
oz-1.080
Environment-1.049
Token anesthesia
Token ID49592
Feature activation+1.31e-01

Pos logit contributions
 remedy+3.177
 Articles+3.159
 remedies+2.970
 Warn+2.925
 Thousand+2.916
Neg logit contributions
 awa-5.166
OOD-4.911
romy-4.866
oz-4.643
Environment-4.577
Token,
Token ID11
Feature activation-7.61e-01

Pos logit contributions
 Articles+0.071
 remedy+0.069
aughs+0.067
 Thousand+0.066
esi+0.065
Neg logit contributions
 awa-0.113
OOD-0.108
romy-0.106
oz-0.104
Environment-0.100
Token but
Token ID475
Feature activation+2.16e-02

Pos logit contributions
 awa+0.626
OOD+0.597
romy+0.588
oz+0.572
Environment+0.553
Neg logit contributions
 Articles-0.444
 remedy-0.436
aughs-0.421
 Thousand-0.415
esi-0.413
Token she
Token ID673
Feature activation-1.38e-01

Pos logit contributions
 Articles+0.013
 remedy+0.013
aughs+0.013
 Thousand+0.013
esi+0.012
Neg logit contributions
 awa-0.017
OOD-0.016
romy-0.016
oz-0.015
Environment-0.015
Token won
Token ID1839
Feature activation-3.01e-02

Pos logit contributions
 awa+0.118
OOD+0.111
romy+0.110
oz+0.109
Environment+0.103
Neg logit contributions
 Articles-0.077
 remedy-0.074
 Thousand-0.073
aughs-0.072
esi-0.071
Token applications
Token ID5479
Feature activation+9.33e-02

Pos logit contributions
 Articles+0.021
 remedy+0.021
aughs+0.020
 Thousand+0.020
esi+0.019
Neg logit contributions
 awa-0.031
OOD-0.029
romy-0.028
oz-0.028
Environment-0.027
Token until
Token ID1566
Feature activation-8.77e-01

Pos logit contributions
 Articles+0.048
 remedy+0.048
 Thousand+0.045
aughs+0.044
 remedies+0.044
Neg logit contributions
 awa-0.084
OOD-0.079
romy-0.078
oz-0.076
Environment-0.074
Token she
Token ID673
Feature activation-3.52e-01

Pos logit contributions
 awa+0.724
OOD+0.684
romy+0.675
oz+0.662
Environment+0.645
Neg logit contributions
 Articles-0.496
 remedy-0.493
aughs-0.479
esi-0.477
 Thousand-0.472
Token retired
Token ID9880
Feature activation-8.02e-01

Pos logit contributions
 awa+0.328
OOD+0.311
romy+0.309
oz+0.301
Environment+0.293
Neg logit contributions
 Articles-0.168
 remedy-0.163
 Thousand-0.158
aughs-0.157
esi-0.156
Token while
Token ID981
Feature activation-2.53e+00

Pos logit contributions
 awa+0.729
OOD+0.694
romy+0.683
oz+0.671
Environment+0.651
Neg logit contributions
 Articles-0.398
 remedy-0.393
aughs-0.379
 Thousand-0.372
esi-0.367
Token under
Token ID739
Feature activation+5.94e+00

Pos logit contributions
 awa+1.980
OOD+1.835
romy+1.819
oz+1.769
Environment+1.721
Neg logit contributions
 Articles-1.535
 remedy-1.528
aughs-1.511
esi-1.510
 Thousand-1.478
Token investigation
Token ID3645
Feature activation+1.38e+00

Pos logit contributions
 Articles+2.973
 remedy+2.926
 remedies+2.717
 Warn+2.687
 Thousand+2.680
Neg logit contributions
 awa-5.426
OOD-5.171
romy-5.119
oz-4.952
Environment-4.829
Token in
Token ID287
Feature activation+8.56e-01

Pos logit contributions
 Articles+0.664
 remedy+0.655
 Thousand+0.614
aughs+0.607
 remedies+0.607
Neg logit contributions
 awa-1.289
OOD-1.225
romy-1.210
oz-1.178
Environment-1.151
Token 2013
Token ID2211
Feature activation-1.73e+00

Pos logit contributions
 Articles+0.431
 remedy+0.427
aughs+0.415
esi+0.410
 Thousand+0.400
Neg logit contributions
 awa-0.761
OOD-0.724
romy-0.716
oz-0.705
Environment-0.684
Token,
Token ID11
Feature activation-9.20e-01

Pos logit contributions
 awa+1.542
OOD+1.459
romy+1.440
oz+1.400
Environment+1.367
Neg logit contributions
 Articles-0.918
 remedy-0.906
 Thousand-0.852
 remedies-0.847
aughs-0.841
Token suffered
Token ID6989
Feature activation+3.60e-01

Pos logit contributions
 awa+0.814
OOD+0.772
romy+0.763
oz+0.747
Environment+0.724
Neg logit contributions
 Articles-0.479
 remedy-0.473
aughs-0.452
 Thousand-0.450
esi-0.448
Token headset
Token ID23492
Feature activation+5.49e+00

Pos logit contributions
 Articles+0.975
 remedy+0.962
 Thousand+0.911
 remedies+0.901
aughs+0.899
Neg logit contributions
 awa-1.479
OOD-1.401
romy-1.380
oz-1.340
Environment-1.306
Token,
Token ID11
Feature activation-2.94e-01

Pos logit contributions
 Articles+2.942
 remedy+2.900
 remedies+2.701
 Warn+2.700
 Thousand+2.692
Neg logit contributions
 awa-4.816
OOD-4.585
romy-4.486
oz-4.366
Environment-4.267
Token cr
Token ID1067
Feature activation+3.21e+00

Pos logit contributions
 awa+0.246
OOD+0.232
romy+0.230
oz+0.224
Environment+0.217
Neg logit contributions
 Articles-0.168
 remedy-0.166
aughs-0.159
 Thousand-0.158
 remedies-0.156
Tokenanks
Token ID2283
Feature activation+4.87e+00

Pos logit contributions
 Articles+2.311
 remedy+2.285
 Thousand+2.180
 remedies+2.171
aughs+2.171
Neg logit contributions
 awa-2.252
OOD-2.098
romy-2.040
oz-1.965
Environment-1.922
Token and
Token ID290
Feature activation+2.49e-01

Pos logit contributions
 Articles+3.035
 remedy+2.973
 Warn+2.812
 remedies+2.803
 Thousand+2.784
Neg logit contributions
 awa-3.893
OOD-3.660
romy-3.582
oz-3.408
Environment-3.383
Token brakes
Token ID23746
Feature activation+5.92e+00

Pos logit contributions
 Articles+0.139
 remedy+0.137
aughs+0.131
 Thousand+0.130
 remedies+0.128
Neg logit contributions
 awa-0.213
OOD-0.201
romy-0.199
oz-0.194
Environment-0.188
Token.
Token ID13
Feature activation+1.37e-01

Pos logit contributions
 Articles+2.773
 remedy+2.723
 remedies+2.502
 Warn+2.494
 Thousand+2.486
Neg logit contributions
 awa-5.603
OOD-5.305
romy-5.235
oz-5.057
Environment-4.982
Token The
Token ID383
Feature activation+5.33e-01

Pos logit contributions
 remedy+0.081
 Articles+0.079
 remedies+0.076
aughs+0.075
esi+0.075
Neg logit contributions
 awa-0.113
OOD-0.108
romy-0.105
oz-0.104
Environment-0.100
Token wheels
Token ID13666
Feature activation+4.02e+00

Pos logit contributions
 Articles+0.314
 remedy+0.308
aughs+0.299
esi+0.293
 Thousand+0.293
Neg logit contributions
 awa-0.435
OOD-0.411
romy-0.407
oz-0.396
Environment-0.383
Token are
Token ID389
Feature activation+4.00e-01

Pos logit contributions
 Articles+1.872
 remedy+1.871
 remedies+1.708
 Thousand+1.707
aughs+1.696
Neg logit contributions
 awa-3.806
OOD-3.627
romy-3.579
oz-3.452
Environment-3.402
Token H
Token ID367
Feature activation-3.34e-01

Pos logit contributions
 Articles+0.227
 remedy+0.224
aughs+0.215
 Thousand+0.212
 remedies+0.211
Neg logit contributions
 awa-0.338
OOD-0.318
romy-0.313
oz-0.308
Environment-0.298
TokenFlag
Token ID34227
Feature activation-1.99e+00

Pos logit contributions
OOD+0.592
 awa+0.560
oz+0.547
romy+0.537
Environment+0.534
Neg logit contributions
 remedy-1.216
aughs-1.191
 Articles-1.185
 remedies-1.173
 Warn-1.154
Token Reply
Token ID14883
Feature activation-2.41e+00

Pos logit contributions
 awa+1.355
OOD+1.274
oz+1.235
romy+1.232
Environment+1.183
Neg logit contributions
aughs-1.404
 remedy-1.381
esi-1.339
 Articles-1.329
 remedies-1.326
Token
Token ID198
Feature activation-2.40e+00

Pos logit contributions
OOD+1.707
oz+1.666
 awa+1.645
romy+1.609
Environment+1.571
Neg logit contributions
 remedy-1.550
aughs-1.548
 remedies-1.480
 Articles-1.474
alore-1.465
Token
Token ID198
Feature activation-2.93e+00

Pos logit contributions
OOD+1.663
oz+1.628
Environment+1.554
romy+1.539
Scene+1.511
Neg logit contributions
aughs-1.631
 remedy-1.625
 Articles-1.551
 remedies-1.549
alore-1.530
Token55
Token ID2816
Feature activation+8.64e-01

Pos logit contributions
OOD+1.625
 awa+1.621
oz+1.526
romy+1.502
Environment+1.481
Neg logit contributions
 remedy-2.434
 Articles-2.376
aughs-2.362
 remedies-2.331
 Thousand-2.293
Token Recommend
Token ID30361
Feature activation-7.96e+00

Pos logit contributions
aughs+0.593
 remedy+0.586
 Articles+0.575
esi+0.564
 remedies+0.562
Neg logit contributions
 awa-0.604
OOD-0.592
oz-0.578
romy-0.563
Environment-0.533
Token
Token ID198
Feature activation-2.51e+00

Pos logit contributions
oz+4.630
OOD+4.489
Lt+4.077
romy+4.066
Environment+4.057
Neg logit contributions
aughs-5.956
 remedy-5.784
alore-5.676
 remedies-5.628
 warranties-5.493
Token
Token ID198
Feature activation-2.89e+00

Pos logit contributions
OOD+1.702
oz+1.687
Environment+1.609
romy+1.580
Scene+1.548
Neg logit contributions
aughs-1.732
 remedy-1.710
 remedies-1.638
 Articles-1.631
alore-1.616
TokenRecommend
Token ID41248
Feature activation-5.78e+00

Pos logit contributions
OOD+1.370
oz+1.328
 awa+1.303
Environment+1.302
romy+1.226
Neg logit contributions
aughs-2.616
 remedy-2.615
 Articles-2.546
 Thousand-2.545
 Warn-2.530
Token Share
Token ID8734
Feature activation-1.93e+00

Pos logit contributions
oz+3.702
OOD+3.574
 awa+3.557
romy+3.318
 heartbeat+3.281
Neg logit contributions
alore-4.014
aughs-3.918
 remedy-3.917
 remedies-3.859
 warranties-3.686
Token this
Token ID428
Feature activation-1.98e+00

Pos logit contributions
OOD+1.375
oz+1.360
romy+1.333
 awa+1.331
Environment+1.285
Neg logit contributions
aughs-1.222
 Articles-1.181
 remedy-1.175
 Thousand-1.169
esi-1.132
TokenFlag
Token ID34227
Feature activation-1.34e+00

Pos logit contributions
OOD+0.435
oz+0.400
Environment+0.393
romy+0.389
 awa+0.389
Neg logit contributions
 remedy-0.913
aughs-0.894
 Articles-0.887
 remedies-0.882
 Warn-0.867
Token Reply
Token ID14883
Feature activation-2.15e+00

Pos logit contributions
 awa+0.871
OOD+0.858
oz+0.822
romy+0.818
Environment+0.792
Neg logit contributions
aughs-0.961
 remedy-0.938
 remedies-0.907
esi-0.899
 Articles-0.896
Token
Token ID198
Feature activation-1.82e+00

Pos logit contributions
OOD+1.529
oz+1.492
romy+1.438
 awa+1.430
Environment+1.415
Neg logit contributions
 remedy-1.403
aughs-1.390
 remedies-1.337
 Articles-1.335
alore-1.306
Token
Token ID198
Feature activation-2.51e+00

Pos logit contributions
OOD+1.273
oz+1.245
Environment+1.192
romy+1.185
Scene+1.157
Neg logit contributions
aughs-1.218
 remedy-1.216
 Articles-1.168
 remedies-1.159
alore-1.142
Token19
Token ID1129
Feature activation-2.41e-01

Pos logit contributions
OOD+1.436
 awa+1.415
oz+1.344
romy+1.324
Environment+1.302
Neg logit contributions
 remedy-2.055
aughs-2.002
 Articles-1.999
 remedies-1.968
 Thousand-1.933
Token Recommend
Token ID30361
Feature activation-7.76e+00

Pos logit contributions
 awa+0.164
OOD+0.162
oz+0.161
romy+0.153
stic+0.145
Neg logit contributions
aughs-0.169
 remedy-0.165
 Articles-0.162
 remedies-0.159
esi-0.158
Token
Token ID198
Feature activation-2.21e+00

Pos logit contributions
oz+4.519
OOD+4.417
romy+4.010
Scene+4.006
Environment+4.005
Neg logit contributions
aughs-5.813
 remedy-5.619
alore-5.537
 remedies-5.465
 warranties-5.357
Token
Token ID198
Feature activation-2.65e+00

Pos logit contributions
OOD+1.513
oz+1.496
Environment+1.428
romy+1.411
Scene+1.375
Neg logit contributions
aughs-1.503
 remedy-1.491
 remedies-1.429
 Articles-1.428
 Thousand-1.406
TokenRecommend
Token ID41248
Feature activation-5.56e+00

Pos logit contributions
OOD+1.280
oz+1.226
Environment+1.195
 awa+1.166
romy+1.121
Neg logit contributions
 remedy-2.406
aughs-2.391
 Articles-2.336
 remedies-2.330
 Thousand-2.313
Token Share
Token ID8734
Feature activation-1.59e+00

Pos logit contributions
oz+3.581
OOD+3.496
 awa+3.400
romy+3.276
BUT+3.148
Neg logit contributions
alore-3.807
aughs-3.802
 remedy-3.744
 remedies-3.702
 Articles-3.526
Token this
Token ID428
Feature activation-1.53e+00

Pos logit contributions
OOD+1.132
oz+1.120
romy+1.095
 awa+1.081
Environment+1.055
Neg logit contributions
aughs-1.017
 Articles-0.976
 remedy-0.974
 Thousand-0.966
 remedies-0.938
TokenFlag
Token ID34227
Feature activation-1.51e+00

Pos logit contributions
OOD+0.670
oz+0.616
Environment+0.606
romy+0.602
 awa+0.589
Neg logit contributions
 remedy-1.418
aughs-1.386
 Articles-1.375
 remedies-1.367
 Warn-1.345
Token Reply
Token ID14883
Feature activation-2.27e+00

Pos logit contributions
 awa+0.979
OOD+0.965
oz+0.923
romy+0.922
Environment+0.891
Neg logit contributions
aughs-1.082
 remedy-1.057
 remedies-1.023
 Articles-1.012
esi-1.012
Token
Token ID198
Feature activation-2.15e+00

Pos logit contributions
OOD+1.615
oz+1.574
romy+1.516
 awa+1.501
Environment+1.494
Neg logit contributions
 remedy-1.492
aughs-1.477
 remedies-1.421
 Articles-1.421
alore-1.388
Token
Token ID198
Feature activation-2.78e+00

Pos logit contributions
OOD+1.503
oz+1.468
Environment+1.405
romy+1.393
Scene+1.365
Neg logit contributions
aughs-1.454
 remedy-1.450
 Articles-1.391
 remedies-1.382
alore-1.361
Token27
Token ID1983
Feature activation-5.52e-01

Pos logit contributions
OOD+1.583
 awa+1.557
oz+1.483
romy+1.459
Environment+1.435
Neg logit contributions
 remedy-2.281
aughs-2.220
 Articles-2.215
 remedies-2.184
 Thousand-2.143
Token Recommend
Token ID30361
Feature activation-7.57e+00

Pos logit contributions
OOD+0.370
 awa+0.363
oz+0.363
romy+0.347
Environment+0.331
Neg logit contributions
aughs-0.392
 remedy-0.382
 Articles-0.371
 remedies-0.370
alore-0.367
Token
Token ID198
Feature activation-2.32e+00

Pos logit contributions
oz+4.435
OOD+4.367
romy+3.975
Environment+3.965
Scene+3.941
Neg logit contributions
aughs-5.634
 remedy-5.454
alore-5.334
 remedies-5.282
 warranties-5.168
Token
Token ID198
Feature activation-2.68e+00

Pos logit contributions
OOD+1.587
oz+1.568
Environment+1.497
romy+1.476
Scene+1.441
Neg logit contributions
aughs-1.584
 remedy-1.567
 Articles-1.504
 remedies-1.500
 Thousand-1.482
TokenRecommend
Token ID41248
Feature activation-5.28e+00

Pos logit contributions
OOD+1.294
oz+1.245
Environment+1.200
 awa+1.194
romy+1.144
Neg logit contributions
 remedy-2.426
aughs-2.413
 Articles-2.364
 remedies-2.345
 Thousand-2.345
Token Share
Token ID8734
Feature activation-1.45e+00

Pos logit contributions
oz+3.446
OOD+3.367
 awa+3.281
romy+3.186
 heartbeat+3.006
Neg logit contributions
alore-3.587
aughs-3.556
 remedy-3.541
 remedies-3.478
 Thousand-3.330
Token this
Token ID428
Feature activation-1.73e+00

Pos logit contributions
OOD+1.043
oz+1.030
romy+1.005
 awa+1.001
Environment+0.972
Neg logit contributions
aughs-0.926
 remedy-0.890
 Articles-0.888
 Thousand-0.878
 remedies-0.855
TokenFlag
Token ID34227
Feature activation-1.55e+00

Pos logit contributions
 awa+0.681
OOD+0.636
romy+0.619
oz+0.598
Environment+0.573
Neg logit contributions
 Articles-0.978
 remedy-0.977
 Thousand-0.941
aughs-0.936
 remedies-0.935
Token Reply
Token ID14883
Feature activation-1.49e+00

Pos logit contributions
 awa+1.261
OOD+1.180
romy+1.157
oz+1.144
Environment+1.098
Neg logit contributions
 remedy-0.921
 Articles-0.911
aughs-0.890
esi-0.872
 remedies-0.867
Token
Token ID198
Feature activation-2.31e+00

Pos logit contributions
 awa+1.194
OOD+1.154
romy+1.122
oz+1.108
Environment+1.078
Neg logit contributions
 remedy-0.863
 Articles-0.840
aughs-0.827
esi-0.818
 remedies-0.814
Token
Token ID198
Feature activation-2.73e+00

Pos logit contributions
OOD+1.601
oz+1.563
Environment+1.504
romy+1.474
Scene+1.458
Neg logit contributions
 remedy-1.572
aughs-1.569
 remedies-1.495
alore-1.483
 Articles-1.480
Token14
Token ID1415
Feature activation-6.31e-01

Pos logit contributions
 awa+1.689
OOD+1.628
romy+1.562
oz+1.536
Environment+1.491
Neg logit contributions
 remedy-2.126
 Articles-2.096
aughs-2.062
 remedies-2.030
 Thousand-2.020
Token Recommend
Token ID30361
Feature activation-7.56e+00

Pos logit contributions
 awa+0.485
OOD+0.455
oz+0.442
romy+0.442
Environment+0.423
Neg logit contributions
 remedy-0.398
 Articles-0.397
aughs-0.394
esi-0.379
 remedies-0.378
Token
Token ID198
Feature activation-2.50e+00

Pos logit contributions
oz+4.506
OOD+4.351
romy+3.957
Lt+3.892
stic+3.882
Neg logit contributions
 remedy-5.423
aughs-5.342
alore-5.278
 remedies-5.231
 warranties-5.200
Token
Token ID198
Feature activation-2.84e+00

Pos logit contributions
OOD+1.706
oz+1.688
Environment+1.609
romy+1.570
Scene+1.546
Neg logit contributions
aughs-1.713
 remedy-1.701
 remedies-1.623
 Articles-1.614
 Thousand-1.608
TokenRecommend
Token ID41248
Feature activation-5.41e+00

Pos logit contributions
 awa+1.489
OOD+1.467
oz+1.410
Environment+1.366
romy+1.356
Neg logit contributions
 remedy-2.398
aughs-2.374
 Articles-2.364
 Thousand-2.349
 Warn-2.329
Token Share
Token ID8734
Feature activation-2.01e+00

Pos logit contributions
oz+3.571
 awa+3.449
OOD+3.439
romy+3.235
 heartbeat+3.141
Neg logit contributions
alore-3.659
 remedy-3.616
aughs-3.528
 remedies-3.512
 Warn-3.491
Token this
Token ID428
Feature activation-1.83e+00

Pos logit contributions
 awa+1.483
OOD+1.470
oz+1.437
romy+1.383
Environment+1.359
Neg logit contributions
 Articles-1.226
aughs-1.209
 remedy-1.192
 Warn-1.177
esi-1.175
Token
Token ID198
Feature activation-2.50e+00

Pos logit contributions
OOD+1.430
oz+1.409
Environment+1.349
romy+1.328
stic+1.279
Neg logit contributions
 remedy-1.400
aughs-1.361
 remedies-1.321
 Articles-1.308
 Thousand-1.282
TokenSubscribe
Token ID27125
Feature activation-6.50e-01

Pos logit contributions
OOD+1.175
Environment+1.173
oz+1.129
romy+1.067
 awa+1.056
Neg logit contributions
 remedy-2.302
aughs-2.219
 Articles-2.192
 Thousand-2.190
 remedies-2.183
Token to
Token ID284
Feature activation-1.77e+00

Pos logit contributions
OOD+0.494
oz+0.485
 awa+0.484
romy+0.477
Environment+0.456
Neg logit contributions
 remedy-0.383
 Articles-0.379
aughs-0.377
 remedies-0.359
 Thousand-0.355
Token our
Token ID674
Feature activation-1.85e-01

Pos logit contributions
OOD+1.267
 awa+1.238
oz+1.230
romy+1.228
Environment+1.190
Neg logit contributions
 remedy-1.136
aughs-1.121
 Articles-1.089
 remedies-1.073
 Thousand-1.063
Token free
Token ID1479
Feature activation+9.93e-01

Pos logit contributions
OOD+0.168
 awa+0.164
oz+0.163
romy+0.159
Environment+0.156
Neg logit contributions
 remedy-0.086
aughs-0.083
 Thousand-0.079
 remedies-0.078
esi-0.077
Token newsletters
Token ID16983
Feature activation-7.44e+00

Pos logit contributions
 Articles+0.695
 remedy+0.693
aughs+0.666
 Thousand+0.663
 remedies+0.659
Neg logit contributions
 awa-0.697
OOD-0.660
romy-0.645
oz-0.630
Environment-0.611
Token.
Token ID13
Feature activation-2.00e+00

Pos logit contributions
oz+4.709
OOD+4.568
romy+4.347
rosis+4.343
stic+4.307
Neg logit contributions
aughs-5.136
 Articles-4.883
 remedy-4.761
 Thousand-4.670
 remedies-4.639
Token
Token ID198
Feature activation-1.89e+00

Pos logit contributions
 awa+1.230
OOD+1.184
romy+1.133
oz+1.129
Environment+1.069
Neg logit contributions
 remedy-1.546
 Articles-1.520
aughs-1.479
 remedies-1.477
esi-1.474
Token
Token ID198
Feature activation-2.02e+00

Pos logit contributions
OOD+1.303
oz+1.283
Environment+1.238
romy+1.196
Scene+1.188
Neg logit contributions
 remedy-1.313
aughs-1.286
 remedies-1.258
 Articles-1.234
alore-1.227
TokenIf
Token ID1532
Feature activation+7.34e-01

Pos logit contributions
 awa+1.124
OOD+1.103
romy+1.053
oz+1.038
Environment+1.021
Neg logit contributions
 remedy-1.687
 Articles-1.671
 remedies-1.615
aughs-1.611
 Thousand-1.606
Token you
Token ID345
Feature activation+5.57e-01

Pos logit contributions
 Articles+0.398
 remedy+0.393
 Thousand+0.371
aughs+0.368
 remedies+0.367
Neg logit contributions
 awa-0.642
OOD-0.608
romy-0.600
oz-0.584
Environment-0.569
Token
Token ID198
Feature activation-2.29e+00

Pos logit contributions
OOD+1.277
oz+1.255
Environment+1.202
romy+1.191
stic+1.141
Neg logit contributions
 remedy-1.229
aughs-1.192
 remedies-1.158
 Articles-1.152
 Thousand-1.127
TokenSubscribe
Token ID27125
Feature activation-7.21e-01

Pos logit contributions
OOD+1.092
Environment+1.077
oz+1.045
romy+0.997
 awa+0.996
Neg logit contributions
 remedy-2.096
aughs-2.018
 Articles-2.000
 Thousand-1.994
 remedies-1.989
Token to
Token ID284
Feature activation-1.68e+00

Pos logit contributions
OOD+0.549
oz+0.537
 awa+0.537
romy+0.530
Environment+0.506
Neg logit contributions
 remedy-0.425
 Articles-0.420
aughs-0.418
 remedies-0.399
esi-0.394
Token our
Token ID674
Feature activation-9.60e-02

Pos logit contributions
OOD+1.214
 awa+1.192
oz+1.179
romy+1.179
Environment+1.140
Neg logit contributions
 remedy-1.073
aughs-1.056
 Articles-1.029
 remedies-1.013
 Thousand-1.004
Token free
Token ID1479
Feature activation+1.17e+00

Pos logit contributions
OOD+0.088
 awa+0.086
oz+0.085
romy+0.083
Environment+0.082
Neg logit contributions
 remedy-0.044
aughs-0.043
 Thousand-0.040
 remedies-0.040
 Articles-0.040
Token newsletters
Token ID16983
Feature activation-7.33e+00

Pos logit contributions
 Articles+0.816
 remedy+0.813
aughs+0.782
 Thousand+0.778
 remedies+0.774
Neg logit contributions
 awa-0.821
OOD-0.778
romy-0.760
oz-0.742
Environment-0.720
Token.
Token ID13
Feature activation-1.88e+00

Pos logit contributions
oz+4.666
OOD+4.558
romy+4.334
rosis+4.324
stic+4.289
Neg logit contributions
aughs-5.021
 Articles-4.774
 remedy-4.664
 Thousand-4.554
 remedies-4.530
Token
Token ID198
Feature activation-1.74e+00

Pos logit contributions
 awa+1.161
OOD+1.116
romy+1.069
oz+1.064
Environment+1.008
Neg logit contributions
 remedy-1.459
 Articles-1.433
aughs-1.395
 remedies-1.394
esi-1.392
Token
Token ID198
Feature activation-1.83e+00

Pos logit contributions
OOD+1.204
oz+1.182
Environment+1.143
romy+1.108
Scene+1.097
Neg logit contributions
 remedy-1.201
aughs-1.175
 remedies-1.149
 Articles-1.127
alore-1.125
TokenUpdate
Token ID10260
Feature activation+4.61e-01

Pos logit contributions
 awa+1.027
OOD+1.003
romy+0.959
oz+0.943
Environment+0.926
Neg logit contributions
 remedy-1.528
 Articles-1.515
 remedies-1.462
aughs-1.459
 Thousand-1.456
Token,
Token ID11
Feature activation+1.32e+00

Pos logit contributions
 Articles+0.239
 remedy+0.237
aughs+0.224
 Thousand+0.223
 remedies+0.221
Neg logit contributions
 awa-0.411
OOD-0.390
romy-0.384
oz-0.376
Environment-0.366
Token this
Token ID428
Feature activation-2.46e-01

Pos logit contributions
 awa+0.217
OOD+0.205
romy+0.201
oz+0.198
Environment+0.192
Neg logit contributions
 Articles-0.146
 remedy-0.144
aughs-0.139
 Thousand-0.137
 Warn-0.136
Token material
Token ID2587
Feature activation-3.09e-01

Pos logit contributions
 awa+0.208
OOD+0.196
romy+0.193
oz+0.190
Environment+0.184
Neg logit contributions
 Articles-0.139
 remedy-0.135
aughs-0.133
 Thousand-0.130
 remedies-0.128
Token above
Token ID2029
Feature activation-1.61e-01

Pos logit contributions
 awa+0.267
OOD+0.253
romy+0.248
oz+0.246
Environment+0.238
Neg logit contributions
 Articles-0.167
 remedy-0.163
aughs-0.162
 Thousand-0.157
esi-0.153
Token -
Token ID532
Feature activation+9.25e-01

Pos logit contributions
 awa+0.137
OOD+0.130
romy+0.127
oz+0.125
Environment+0.121
Neg logit contributions
 Articles-0.091
 remedy-0.089
aughs-0.086
 Thousand-0.085
 remedies-0.084
Token179
Token ID21738
Feature activation-1.09e-02

Pos logit contributions
 Articles+0.724
 remedy+0.717
 Thousand+0.690
aughs+0.686
 remedies+0.685
Neg logit contributions
 awa-0.587
OOD-0.546
romy-0.533
oz-0.515
Environment-0.495
Token°
Token ID7200
Feature activation-7.24e+00

Pos logit contributions
 awa+0.008
OOD+0.008
romy+0.008
oz+0.008
Environment+0.007
Neg logit contributions
 Articles-0.007
 remedy-0.007
 Thousand-0.007
 remedies-0.006
aughs-0.006
TokenC
Token ID34
Feature activation-1.83e+00

Pos logit contributions
 awa+3.144
OOD+2.959
oz+2.788
romy+2.664
Environment+2.574
Neg logit contributions
 Articles-6.672
 remedy-6.635
aughs-6.584
 Thousand-6.452
 remedies-6.409
Token.
Token ID13
Feature activation-9.58e-01

Pos logit contributions
 awa+1.609
OOD+1.537
romy+1.496
oz+1.487
Environment+1.438
Neg logit contributions
 Articles-0.959
 remedy-0.942
aughs-0.896
 Thousand-0.894
 remedies-0.881
Token This
Token ID770
Feature activation+7.25e-02

Pos logit contributions
 awa+0.783
OOD+0.752
romy+0.730
oz+0.724
Environment+0.700
Neg logit contributions
 remedy-0.555
 Articles-0.549
aughs-0.529
 remedies-0.523
esi-0.517
Token new
Token ID649
Feature activation-3.02e-01

Pos logit contributions
 Articles+0.035
 remedy+0.034
aughs+0.033
 Thousand+0.033
esi+0.032
Neg logit contributions
 awa-0.067
OOD-0.064
romy-0.063
oz-0.062
Environment-0.060
Token study
Token ID2050
Feature activation+2.71e+00

Pos logit contributions
 awa+0.243
OOD+0.228
romy+0.224
oz+0.220
Environment+0.212
Neg logit contributions
 Articles-0.182
aughs-0.177
 remedy-0.177
 Thousand-0.172
esi-0.170
Token from
Token ID422
Feature activation-1.16e+00

Pos logit contributions
 awa+1.795
OOD+1.697
romy+1.658
oz+1.635
Environment+1.574
Neg logit contributions
 Articles-1.127
 remedy-1.086
aughs-1.061
 Thousand-1.059
 Warn-1.034
Token Grey
Token ID13980
Feature activation-5.50e-01

Pos logit contributions
 awa+1.013
OOD+0.960
romy+0.947
oz+0.923
Environment+0.888
Neg logit contributions
 Articles-0.628
 remedy-0.623
aughs-0.593
 remedies-0.583
 Thousand-0.581
Token
Token ID447
Feature activation-1.32e+00

Pos logit contributions
 awa+0.465
OOD+0.442
romy+0.431
oz+0.426
Environment+0.409
Neg logit contributions
 Articles-0.313
 remedy-0.310
 Thousand-0.294
aughs-0.294
 remedies-0.291
Token
Token ID247
Feature activation-4.29e-01

Pos logit contributions
OOD+0.855
 awa+0.851
romy+0.801
oz+0.787
Environment+0.752
Neg logit contributions
 remedy-0.970
 Articles-0.959
aughs-0.935
 Warn-0.920
 remedies-0.916
Tokens
Token ID82
Feature activation-1.58e+00

Pos logit contributions
 awa+0.448
OOD+0.429
romy+0.423
oz+0.416
Environment+0.404
Neg logit contributions
 Articles-0.157
 remedy-0.156
 Thousand-0.142
aughs-0.141
 remedies-0.141
Token Anat
Token ID28880
Feature activation-7.23e+00

Pos logit contributions
 awa+1.328
OOD+1.262
romy+1.231
oz+1.212
Environment+1.160
Neg logit contributions
 Articles-0.896
 remedy-0.893
aughs-0.842
 remedies-0.839
 Thousand-0.834
Tokenomy
Token ID9145
Feature activation-3.50e+00

Pos logit contributions
oz+2.607
romy+2.453
OOD+2.353
rosis+2.260
 awa+2.089
Neg logit contributions
 Articles-7.210
 Warn-7.135
 remedy-7.095
 Thousand-7.056
aughs-7.019
Token.
Token ID13
Feature activation-1.17e+00

Pos logit contributions
 awa+2.989
OOD+2.846
romy+2.772
oz+2.745
Environment+2.621
Neg logit contributions
 Articles-1.901
 remedy-1.897
 Thousand-1.783
 remedies-1.780
aughs-1.780
Token Sh
Token ID911
Feature activation+1.66e+00

Pos logit contributions
 awa+0.898
OOD+0.878
oz+0.842
romy+0.842
Environment+0.810
Neg logit contributions
 remedy-0.735
 remedies-0.691
 Articles-0.690
aughs-0.685
alore-0.680
Tokenonda
Token ID13533
Feature activation+1.30e+00

Pos logit contributions
 Articles+1.138
 remedy+1.126
 Thousand+1.078
aughs+1.069
 remedies+1.068
Neg logit contributions
 awa-1.219
OOD-1.143
romy-1.125
oz-1.088
Environment-1.053
Token would
Token ID561
Feature activation-4.25e-02

Pos logit contributions
 Articles+0.662
 remedy+0.654
 Thousand+0.610
aughs+0.604
 remedies+0.601
Neg logit contributions
 awa-1.179
OOD-1.118
romy-1.098
oz-1.083
Environment-1.054
Token hormone
Token ID17770
Feature activation-3.89e+00

Pos logit contributions
 awa+1.005
romy+0.936
OOD+0.933
oz+0.911
Environment+0.858
Neg logit contributions
 Articles-0.856
aughs-0.831
 remedy-0.831
 Thousand-0.807
esi-0.805
Token secret
Token ID3200
Feature activation-9.82e-01

Pos logit contributions
 awa+2.983
OOD+2.815
romy+2.787
oz+2.768
Environment+2.608
Neg logit contributions
 Articles-2.445
aughs-2.390
 remedy-2.364
esi-2.327
 Thousand-2.316
Tokened
Token ID276
Feature activation-4.46e+00

Pos logit contributions
 awa+0.694
romy+0.654
OOD+0.650
oz+0.638
Environment+0.601
Neg logit contributions
 Articles-0.694
 remedy-0.673
 Thousand-0.657
aughs-0.657
 remedies-0.650
Token by
Token ID416
Feature activation-1.57e+00

Pos logit contributions
 awa+3.865
OOD+3.653
romy+3.629
oz+3.560
Environment+3.424
Neg logit contributions
 Articles-2.399
aughs-2.319
 remedy-2.294
 Thousand-2.216
esi-2.197
Token the
Token ID262
Feature activation-1.64e+00

Pos logit contributions
 awa+1.310
OOD+1.237
romy+1.225
oz+1.193
Environment+1.152
Neg logit contributions
 Articles-0.904
 remedy-0.889
aughs-0.859
 Thousand-0.845
 remedies-0.836
Token hypothal
Token ID48641
Feature activation-7.19e+00

Pos logit contributions
 awa+1.088
OOD+1.015
romy+1.003
oz+0.969
Environment+0.930
Neg logit contributions
 Articles-1.216
 remedy-1.191
aughs-1.176
 Thousand-1.148
 remedies-1.141
Tokenamus
Token ID25509
Feature activation-4.46e+00

Pos logit contributions
 awa+4.743
oz+4.733
OOD+4.693
romy+4.594
Environment+4.349
Neg logit contributions
 Articles-4.980
 Thousand-4.830
 remedy-4.820
aughs-4.765
 remedies-4.640
Token in
Token ID287
Feature activation-2.00e+00

Pos logit contributions
 awa+3.688
OOD+3.511
romy+3.480
oz+3.427
Environment+3.273
Neg logit contributions
 Articles-2.551
 remedy-2.491
aughs-2.445
 Thousand-2.399
 Warn-2.332
Token response
Token ID2882
Feature activation-3.56e+00

Pos logit contributions
 awa+1.763
OOD+1.678
romy+1.668
oz+1.631
Environment+1.572
Neg logit contributions
 Articles-1.031
 remedy-1.006
aughs-0.986
 Thousand-0.964
esi-0.951
Token to
Token ID284
Feature activation-1.50e+00

Pos logit contributions
OOD+2.521
oz+2.507
romy+2.484
 awa+2.379
Environment+2.355
Neg logit contributions
aughs-2.270
 Articles-2.222
 remedy-2.196
 Thousand-2.144
 remedies-2.073
Token stress
Token ID5503
Feature activation-5.93e-01

Pos logit contributions
 awa+1.227
OOD+1.153
romy+1.141
oz+1.110
Environment+1.079
Neg logit contributions
 Articles-0.887
 remedy-0.868
aughs-0.851
 Thousand-0.833
esi-0.823
Token last
Token ID938
Feature activation-1.84e+00

Pos logit contributions
 Articles+0.446
 remedy+0.435
aughs+0.424
esi+0.414
 Thousand+0.413
Neg logit contributions
 awa-0.780
OOD-0.735
romy-0.727
oz-0.711
Environment-0.686
Token.
Token ID13
Feature activation+2.41e-01

Pos logit contributions
 awa+1.475
OOD+1.372
romy+1.337
oz+1.333
Environment+1.271
Neg logit contributions
 Articles-1.092
aughs-1.062
 remedy-1.062
 Thousand-1.036
esi-1.035
Token
Token ID198
Feature activation+7.92e-01

Pos logit contributions
 remedy+0.139
 Articles+0.136
 remedies+0.131
aughs+0.130
esi+0.128
Neg logit contributions
 awa-0.200
OOD-0.191
romy-0.186
oz-0.184
Environment-0.178
Token
Token ID198
Feature activation+5.66e-02

Pos logit contributions
 remedy+0.498
 Articles+0.484
aughs+0.483
 remedies+0.473
 Thousand+0.465
Neg logit contributions
OOD-0.598
oz-0.581
romy-0.571
 awa-0.568
Environment-0.559
TokenArticle
Token ID14906
Feature activation+3.13e-01

Pos logit contributions
 Articles+0.046
 remedy+0.046
 Thousand+0.044
aughs+0.044
 remedies+0.044
Neg logit contributions
 awa-0.034
OOD-0.031
romy-0.031
oz-0.029
Environment-0.028
Token Continued
Token ID22168
Feature activation-7.08e+00

Pos logit contributions
 remedy+0.193
 Articles+0.191
aughs+0.184
 Thousand+0.184
 remedies+0.182
Neg logit contributions
 awa-0.247
OOD-0.232
romy-0.226
oz-0.223
Environment-0.216
Token Below
Token ID10383
Feature activation-3.96e+00

Pos logit contributions
OOD+3.901
Environment+3.588
oz+3.471
Scene+3.449
 awa+3.331
Neg logit contributions
 remedy-5.723
 remedies-5.537
alore-5.519
aughs-5.388
esi-5.371
Token
Token ID198
Feature activation-5.22e-01

Pos logit contributions
OOD+2.642
oz+2.581
Environment+2.471
Scene+2.422
Lt+2.371
Neg logit contributions
 remedy-2.799
aughs-2.725
 remedies-2.704
alore-2.621
 warranties-2.574
Token
Token ID198
Feature activation-1.12e+00

Pos logit contributions
OOD+0.376
oz+0.368
romy+0.358
Environment+0.358
 awa+0.347
Neg logit contributions
 remedy-0.343
aughs-0.332
 Articles-0.327
 remedies-0.327
 Thousand-0.317
TokenT
Token ID51
Feature activation+5.93e-01

Pos logit contributions
 awa+0.665
OOD+0.641
romy+0.616
oz+0.604
Environment+0.586
Neg logit contributions
 remedy-0.901
 Articles-0.892
 remedies-0.860
 Thousand-0.859
aughs-0.857
Tokenod
Token ID375
Feature activation-1.46e-01

Pos logit contributions
 Articles+0.358
 remedy+0.354
 Thousand+0.337
aughs+0.335
 remedies+0.333
Neg logit contributions
 awa-0.483
OOD-0.452
romy-0.445
oz-0.430
Environment-0.423
Token,
Token ID11
Feature activation+2.60e-01

Pos logit contributions
 Articles+0.257
 remedy+0.253
 Thousand+0.238
aughs+0.236
 remedies+0.234
Neg logit contributions
 awa-0.484
OOD-0.460
romy-0.454
oz-0.443
Environment-0.431
Token Bounty
Token ID38166
Feature activation+4.92e-01

Pos logit contributions
 Articles+0.139
 remedy+0.137
aughs+0.132
esi+0.132
 Thousand+0.130
Neg logit contributions
 awa-0.225
OOD-0.214
romy-0.211
oz-0.206
Environment-0.199
Tokengate
Token ID10494
Feature activation+6.93e-02

Pos logit contributions
 Articles+0.247
 remedy+0.241
 Thousand+0.232
aughs+0.230
 Trident+0.224
Neg logit contributions
 awa-0.434
OOD-0.426
oz-0.423
romy-0.421
Environment-0.400
Token affects
Token ID10975
Feature activation-7.35e-01

Pos logit contributions
 Articles+0.034
 remedy+0.033
 Thousand+0.031
aughs+0.031
esi+0.031
Neg logit contributions
 awa-0.064
OOD-0.061
romy-0.060
oz-0.059
Environment-0.057
Token every
Token ID790
Feature activation-3.72e+00

Pos logit contributions
 awa+0.625
OOD+0.587
romy+0.580
oz+0.566
Environment+0.546
Neg logit contributions
 Articles-0.412
 remedy-0.405
aughs-0.392
esi-0.388
 Thousand-0.383
Token aspect
Token ID4843
Feature activation-6.96e+00

Pos logit contributions
 awa+2.735
OOD+2.539
romy+2.493
oz+2.486
Environment+2.331
Neg logit contributions
 Articles-2.453
aughs-2.378
 remedy-2.343
esi-2.330
 Thousand-2.302
Token of
Token ID286
Feature activation-1.34e+00

Pos logit contributions
 awa+5.847
OOD+5.529
romy+5.452
oz+5.304
Environment+5.147
Neg logit contributions
 Articles-3.976
 remedy-3.913
 Thousand-3.718
aughs-3.698
 remedies-3.675
Token league
Token ID4652
Feature activation+1.44e+00

Pos logit contributions
 awa+1.124
OOD+1.059
romy+1.045
oz+1.016
Environment+0.983
Neg logit contributions
 Articles-0.760
 remedy-0.749
aughs-0.721
esi-0.716
 Thousand-0.705
Token punishment
Token ID9837
Feature activation+1.69e+00

Pos logit contributions
 Articles+0.884
 remedy+0.872
 Thousand+0.832
aughs+0.829
 remedies+0.821
Neg logit contributions
 awa-1.154
OOD-1.087
romy-1.068
oz-1.038
Environment-1.007
Token and
Token ID290
Feature activation-3.54e-01

Pos logit contributions
 Articles+0.846
 remedy+0.835
 Thousand+0.784
 remedies+0.776
aughs+0.776
Neg logit contributions
 awa-1.550
OOD-1.472
romy-1.453
oz-1.415
Environment-1.381
Token stands
Token ID6296
Feature activation+1.68e+00

Pos logit contributions
 awa+0.304
OOD+0.286
romy+0.281
oz+0.275
Environment+0.267
Neg logit contributions
 Articles-0.194
 remedy-0.188
aughs-0.187
esi-0.184
 Thousand-0.182
Token
Token ID198
Feature activation-7.72e-01

Pos logit contributions
 awa+1.890
OOD+1.800
romy+1.760
oz+1.741
Environment+1.680
Neg logit contributions
 Articles-1.156
 remedy-1.155
aughs-1.105
esi-1.090
 Thousand-1.083
Token
Token ID198
Feature activation-1.27e+00

Pos logit contributions
OOD+0.561
oz+0.549
romy+0.535
Environment+0.525
 awa+0.519
Neg logit contributions
 remedy-0.503
aughs-0.491
 Articles-0.481
 remedies-0.478
alore-0.464
Token1
Token ID16
Feature activation-7.27e-01

Pos logit contributions
 awa+0.832
OOD+0.778
romy+0.763
oz+0.736
Environment+0.709
Neg logit contributions
 Articles-0.962
 remedy-0.954
 Thousand-0.915
aughs-0.910
 remedies-0.910
Token 2
Token ID362
Feature activation-1.54e+00

Pos logit contributions
 awa+0.724
OOD+0.696
romy+0.683
oz+0.677
Environment+0.662
Neg logit contributions
 Articles-0.298
 remedy-0.294
aughs-0.279
 remedies-0.271
esi-0.269
Token 3
Token ID513
Feature activation-7.40e-01

Pos logit contributions
 awa+1.544
OOD+1.484
romy+1.468
oz+1.421
Environment+1.393
Neg logit contributions
 Articles-0.653
 remedy-0.632
 Thousand-0.593
 Warn-0.581
aughs-0.581
Token Next
Token ID7406
Feature activation-6.96e+00

Pos logit contributions
 awa+0.785
OOD+0.748
romy+0.744
oz+0.720
Environment+0.709
Neg logit contributions
 Articles-0.274
 remedy-0.260
 Thousand-0.242
 Warn-0.240
aughs-0.237
Token All
Token ID1439
Feature activation-2.35e-01

Pos logit contributions
OOD+5.841
 awa+5.762
oz+5.749
Environment+5.601
Scene+5.584
Neg logit contributions
 remedy-3.293
 remedies-3.135
esi-3.102
aughs-3.086
alore-2.931
Token<|endoftext|>
Token ID50256
Feature activation+1.01e+00

Pos logit contributions
 awa+0.177
OOD+0.166
romy+0.162
oz+0.158
Environment+0.153
Neg logit contributions
 remedy-0.155
 Articles-0.154
aughs-0.148
 Thousand-0.148
 remedies-0.147
TokenRepublican
Token ID25777
Feature activation+6.75e-01

Pos logit contributions
 Articles+0.806
 remedy+0.795
 Thousand+0.767
aughs+0.763
 remedies+0.760
Neg logit contributions
 awa-0.626
OOD-0.571
romy-0.562
oz-0.535
Environment-0.513
Token Sen
Token ID2311
Feature activation-1.02e+00

Pos logit contributions
 Articles+0.381
 remedy+0.376
 Thousand+0.356
aughs+0.356
 remedies+0.352
Neg logit contributions
 awa-0.574
OOD-0.542
romy-0.534
oz-0.520
Environment-0.506
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
 awa+0.901
OOD+0.868
romy+0.861
oz+0.853
Environment+0.822
Neg logit contributions
 Articles-0.520
 remedy-0.517
aughs-0.489
 Thousand-0.485
 remedies-0.482
Token to
Token ID284
Feature activation-1.32e+00

Pos logit contributions
 awa+0.914
OOD+0.864
romy+0.845
oz+0.827
Environment+0.805
Neg logit contributions
 Articles-0.646
 remedy-0.645
aughs-0.613
 Thousand-0.606
 remedies-0.605
Token directly
Token ID3264
Feature activation-9.71e-01

Pos logit contributions
 awa+1.083
OOD+1.009
romy+1.000
oz+0.977
Environment+0.940
Neg logit contributions
 Articles-0.781
aughs-0.760
 remedy-0.750
 Thousand-0.740
esi-0.737
Token sell
Token ID3677
Feature activation-1.95e+00

Pos logit contributions
 awa+0.759
OOD+0.708
romy+0.698
oz+0.684
Environment+0.653
Neg logit contributions
 Articles-0.613
 remedy-0.593
aughs-0.589
 Thousand-0.587
esi-0.574
Token its
Token ID663
Feature activation+8.14e-01

Pos logit contributions
 awa+1.679
OOD+1.582
romy+1.551
oz+1.513
Environment+1.458
Neg logit contributions
 Articles-1.065
 remedy-1.061
aughs-1.019
 Thousand-0.996
esi-0.987
Token RF
Token ID20445
Feature activation-2.40e+00

Pos logit contributions
 Articles+0.463
 remedy+0.459
aughs+0.435
 Thousand+0.435
 remedies+0.430
Neg logit contributions
 awa-0.687
OOD-0.650
romy-0.640
oz-0.622
Environment-0.604
TokenID
Token ID2389
Feature activation-6.86e+00

Pos logit contributions
 awa+1.576
OOD+1.483
romy+1.449
oz+1.396
Environment+1.346
Neg logit contributions
 remedy-1.787
 Articles-1.786
 Thousand-1.715
aughs-1.713
 remedies-1.707
Token-
Token ID12
Feature activation-1.05e+00

Pos logit contributions
 awa+5.194
OOD+4.962
romy+4.843
oz+4.682
Environment+4.549
Neg logit contributions
 Articles-4.111
 remedy-4.088
aughs-4.021
esi-3.906
 remedies-3.893
Tokenbased
Token ID3106
Feature activation-2.47e+00

Pos logit contributions
 awa+0.803
OOD+0.756
romy+0.746
oz+0.723
Environment+0.702
Neg logit contributions
 Articles-0.685
 remedy-0.679
 Thousand-0.648
aughs-0.643
 remedies-0.642
Token products
Token ID3186
Feature activation+3.36e-01

Pos logit contributions
 awa+2.000
OOD+1.881
romy+1.854
oz+1.791
Environment+1.728
Neg logit contributions
 Articles-1.450
 remedy-1.438
aughs-1.407
 Thousand-1.364
esi-1.355
Token throughout
Token ID3690
Feature activation+2.04e-01

Pos logit contributions
 Articles+0.176
 remedy+0.173
aughs+0.165
 Thousand+0.164
 remedies+0.161
Neg logit contributions
 awa-0.299
OOD-0.284
romy-0.280
oz-0.273
Environment-0.265
Token its
Token ID663
Feature activation+8.17e-01

Pos logit contributions
 Articles+0.101
 remedy+0.100
aughs+0.094
 Thousand+0.093
 remedies+0.093
Neg logit contributions
 awa-0.188
OOD-0.178
romy-0.176
oz-0.171
Environment-0.167
Token scrap
Token ID15881
Feature activation+2.32e-01

Pos logit contributions
 Articles+0.258
 remedy+0.254
aughs+0.245
 Thousand+0.243
esi+0.242
Neg logit contributions
 awa-0.352
OOD-0.332
romy-0.326
oz-0.318
Environment-0.310
Token.
Token ID13
Feature activation+3.93e-01

Pos logit contributions
 Articles+0.121
 remedy+0.119
aughs+0.114
 Thousand+0.113
 Warn+0.113
Neg logit contributions
 awa-0.207
OOD-0.196
romy-0.192
oz-0.189
Environment-0.183
Token
Token ID198
Feature activation+8.95e-01

Pos logit contributions
 remedy+0.227
 Articles+0.224
aughs+0.214
 remedies+0.213
 Thousand+0.212
Neg logit contributions
 awa-0.324
OOD-0.310
romy-0.302
oz-0.298
Environment-0.290
Token
Token ID198
Feature activation-3.00e-02

Pos logit contributions
 remedy+0.559
 Articles+0.547
aughs+0.544
 remedies+0.531
 Thousand+0.528
Neg logit contributions
OOD-0.673
oz-0.659
romy-0.647
 awa-0.645
Environment-0.634
TokenArticle
Token ID14906
Feature activation-7.89e-01

Pos logit contributions
 awa+0.019
OOD+0.018
romy+0.017
oz+0.017
Environment+0.016
Neg logit contributions
 Articles-0.024
 remedy-0.023
 Thousand-0.023
aughs-0.022
 remedies-0.022
Token Continued
Token ID22168
Feature activation-6.82e+00

Pos logit contributions
 awa+0.554
OOD+0.514
romy+0.495
oz+0.495
Environment+0.483
Neg logit contributions
 remedy-0.549
 Articles-0.543
 Thousand-0.533
aughs-0.530
esi-0.526
Token Below
Token ID10383
Feature activation-4.43e+00

Pos logit contributions
OOD+3.794
Environment+3.452
oz+3.352
Scene+3.335
 awa+3.132
Neg logit contributions
 remedy-5.558
 remedies-5.397
alore-5.391
aughs-5.304
esi-5.163
Token
Token ID198
Feature activation-4.59e-01

Pos logit contributions
OOD+2.922
oz+2.844
Environment+2.741
Scene+2.665
Lt+2.618
Neg logit contributions
 remedy-3.151
aughs-3.059
 remedies-3.043
alore-2.945
 warranties-2.902
Token
Token ID198
Feature activation-9.95e-01

Pos logit contributions
OOD+0.333
oz+0.325
Environment+0.317
romy+0.317
 awa+0.310
Neg logit contributions
 remedy-0.300
aughs-0.291
 Articles-0.287
 remedies-0.285
 Thousand-0.278
TokenThe
Token ID464
Feature activation+4.17e-01

Pos logit contributions
 awa+0.598
OOD+0.572
romy+0.552
oz+0.540
Environment+0.526
Neg logit contributions
 Articles-0.795
 remedy-0.794
 Thousand-0.762
aughs-0.761
 remedies-0.758
Token T
Token ID309
Feature activation+3.98e-01

Pos logit contributions
 Articles+0.251
 remedy+0.247
aughs+0.237
 Thousand+0.236
 remedies+0.233
Neg logit contributions
 awa-0.337
OOD-0.318
romy-0.314
oz-0.305
Environment-0.296
Token10
Token ID940
Feature activation-2.30e+00

Pos logit contributions
 awa+0.376
OOD+0.364
oz+0.355
romy+0.349
Environment+0.337
Neg logit contributions
 remedy-0.678
 Articles-0.673
aughs-0.662
 Trident-0.653
 Thousand-0.653
Token.
Token ID13
Feature activation-3.09e+00

Pos logit contributions
OOD+1.757
oz+1.738
Environment+1.685
romy+1.655
 awa+1.621
Neg logit contributions
 remedy-1.415
aughs-1.414
 Articles-1.366
 remedies-1.338
alore-1.316
Token13
Token ID1485
Feature activation-4.28e+00

Pos logit contributions
 awa+1.362
OOD+1.324
oz+1.313
romy+1.256
Environment+1.193
Neg logit contributions
 remedy-2.899
 Articles-2.894
aughs-2.830
inders-2.820
esi-2.783
Token71
Token ID4869
Feature activation-2.73e+00

Pos logit contributions
 awa+2.091
OOD+1.841
romy+1.811
oz+1.773
Environment+1.649
Neg logit contributions
 Articles-3.982
 remedy-3.901
aughs-3.859
 Thousand-3.805
 Warn-3.786
Token/
Token ID14
Feature activation-3.06e+00

Pos logit contributions
OOD+1.674
oz+1.660
 awa+1.651
romy+1.600
Environment+1.565
Neg logit contributions
 remedy-2.051
 Articles-2.015
aughs-2.000
 remedies-1.958
 Thousand-1.923
Tokenjournal
Token ID24891
Feature activation-6.78e+00

Pos logit contributions
 awa+1.758
OOD+1.631
romy+1.618
oz+1.576
Environment+1.503
Neg logit contributions
 Articles-2.500
 remedy-2.490
aughs-2.422
 Thousand-2.399
 Warn-2.395
Token.
Token ID13
Feature activation-3.93e+00

Pos logit contributions
oz+4.844
OOD+4.563
Environment+4.357
romy+4.331
stic+4.298
Neg logit contributions
 remedy-4.650
 remedies-4.412
aughs-4.375
 Articles-4.226
 Thousand-4.222
Tokenp
Token ID79
Feature activation-4.46e+00

Pos logit contributions
 awa+2.835
OOD+2.688
romy+2.658
oz+2.632
Environment+2.508
Neg logit contributions
 Articles-2.694
 remedy-2.679
 remedies-2.542
 Thousand-2.536
 Warn-2.518
Tokenone
Token ID505
Feature activation-2.47e+00

Pos logit contributions
 awa+2.567
OOD+2.516
romy+2.486
oz+2.486
stic+2.371
Neg logit contributions
 Articles-3.641
 remedy-3.599
 remedies-3.460
aughs-3.453
 Warn-3.408
Token.
Token ID13
Feature activation-2.43e+00

Pos logit contributions
OOD+1.885
oz+1.854
romy+1.829
Environment+1.790
stic+1.743
Neg logit contributions
 remedy-1.557
aughs-1.493
 remedies-1.477
 Articles-1.469
 warranties-1.410
Token00
Token ID405
Feature activation-2.02e+00

Pos logit contributions
 awa+1.834
OOD+1.771
romy+1.747
oz+1.722
Environment+1.656
Neg logit contributions
 remedy-1.552
 Articles-1.546
 remedies-1.463
aughs-1.447
 Warn-1.444
Token INTER
Token ID23255
Feature activation-1.57e+00

Pos logit contributions
 Articles+0.010
 remedy+0.010
aughs+0.009
 Thousand+0.009
esi+0.009
Neg logit contributions
 awa-0.015
OOD-0.015
romy-0.014
oz-0.014
Environment-0.014
TokenM
Token ID44
Feature activation-4.34e-01

Pos logit contributions
 awa+0.871
OOD+0.820
romy+0.793
oz+0.764
Environment+0.733
Neg logit contributions
 Articles-1.330
 remedy-1.326
aughs-1.283
 Thousand-1.281
 remedies-1.270
TokenITT
Token ID22470
Feature activation-2.47e+00

Pos logit contributions
 awa+0.248
OOD+0.230
romy+0.224
oz+0.215
Environment+0.205
Neg logit contributions
 Articles-0.365
 remedy-0.363
 Thousand-0.350
aughs-0.349
 remedies-0.347
TokenENT
Token ID3525
Feature activation-8.97e-01

Pos logit contributions
 awa+1.471
OOD+1.390
romy+1.344
oz+1.286
Environment+1.248
Neg logit contributions
 Articles-2.000
 remedy-1.980
 Thousand-1.917
aughs-1.907
esi-1.893
Token F
Token ID376
Feature activation-1.68e+00

Pos logit contributions
 awa+0.727
OOD+0.689
romy+0.678
oz+0.657
Environment+0.637
Neg logit contributions
 Articles-0.541
 remedy-0.535
 Thousand-0.510
aughs-0.507
 remedies-0.503
TokenAST
Token ID11262
Feature activation-6.74e+00

Pos logit contributions
 awa+0.940
OOD+0.912
romy+0.868
oz+0.845
Environment+0.780
Neg logit contributions
 remedy-1.412
 Articles-1.408
 remedies-1.356
 Thousand-1.356
 Warn-1.345
TokenING
Token ID2751
Feature activation-1.84e+00

Pos logit contributions
OOD+3.231
 awa+3.180
romy+2.958
oz+2.838
Environment+2.633
Neg logit contributions
 Articles-5.932
 remedy-5.915
 Thousand-5.880
esi-5.773
aughs-5.761
Token as
Token ID355
Feature activation+5.73e-01

Pos logit contributions
 awa+1.559
OOD+1.503
romy+1.468
oz+1.430
Environment+1.383
Neg logit contributions
 Articles-1.011
 remedy-1.007
 Thousand-0.964
aughs-0.962
esi-0.952
Token a
Token ID257
Feature activation+1.54e+00

Pos logit contributions
 Articles+0.324
 remedy+0.319
 Thousand+0.304
aughs+0.303
 remedies+0.299
Neg logit contributions
 awa-0.487
OOD-0.461
romy-0.454
oz-0.442
Environment-0.429
Token strategy
Token ID4811
Feature activation+9.77e-01

Pos logit contributions
 Articles+0.840
 remedy+0.812
aughs+0.792
 Thousand+0.782
esi+0.772
Neg logit contributions
 awa-1.345
OOD-1.271
romy-1.256
oz-1.221
Environment-1.183
Token for
Token ID329
Feature activation+1.50e+00

Pos logit contributions
 Articles+0.461
 remedy+0.452
 Thousand+0.426
aughs+0.423
 remedies+0.419
Neg logit contributions
 awa-0.922
OOD-0.878
romy-0.868
oz-0.846
Environment-0.825
Token projected
Token ID13301
Feature activation+6.26e-01

Pos logit contributions
 Articles+0.839
 remedy+0.826
aughs+0.786
 Thousand+0.783
 remedies+0.773
Neg logit contributions
 awa-1.321
OOD-1.250
romy-1.229
oz-1.202
Environment-1.167
Token rates
Token ID3965
Feature activation+7.06e-01

Pos logit contributions
 Articles+0.352
 remedy+0.348
 Thousand+0.329
aughs+0.326
 remedies+0.326
Neg logit contributions
 awa-0.535
OOD-0.506
romy-0.499
oz-0.485
Environment-0.472
Token of
Token ID286
Feature activation+2.71e-01

Pos logit contributions
 Articles+0.355
 remedy+0.354
 remedies+0.331
 Thousand+0.330
 Warn+0.323
Neg logit contributions
 awa-0.648
OOD-0.611
romy-0.605
oz-0.585
Environment-0.574
Token obesity
Token ID13825
Feature activation-6.42e+00

Pos logit contributions
 Articles+0.175
 remedy+0.173
aughs+0.166
 Thousand+0.166
 remedies+0.164
Neg logit contributions
 awa-0.208
OOD-0.195
romy-0.192
oz-0.186
Environment-0.180
Token and
Token ID290
Feature activation-1.87e+00

Pos logit contributions
 awa+5.040
OOD+4.834
romy+4.680
oz+4.674
rosis+4.481
Neg logit contributions
aughs-3.664
 Articles-3.652
 remedy-3.573
 Thousand-3.537
esi-3.498
Token overweight
Token ID22652
Feature activation-6.69e+00

Pos logit contributions
 awa+1.224
OOD+1.135
romy+1.110
oz+1.077
Environment+1.033
Neg logit contributions
 Articles-1.405
 remedy-1.389
aughs-1.381
 Thousand-1.350
esi-1.340
Token status
Token ID3722
Feature activation-1.30e+00

Pos logit contributions
 awa+5.258
OOD+5.056
romy+4.903
oz+4.867
rosis+4.707
Neg logit contributions
aughs-3.790
 Articles-3.770
 remedy-3.720
esi-3.615
 Thousand-3.605
Token in
Token ID287
Feature activation-4.74e-01

Pos logit contributions
 awa+1.091
OOD+1.041
romy+1.018
oz+1.004
Environment+0.963
Neg logit contributions
 Articles-0.725
 remedy-0.717
aughs-0.695
 Thousand-0.684
esi-0.675
Token both
Token ID1111
Feature activation+2.70e-01

Pos logit contributions
 awa+0.404
OOD+0.383
romy+0.377
oz+0.369
Environment+0.358
Neg logit contributions
 Articles-0.261
 remedy-0.259
aughs-0.249
esi-0.244
 remedies-0.243
Token male
Token ID4257
Feature activation-7.42e-01

Pos logit contributions
 Articles+0.155
 remedy+0.153
aughs+0.147
esi+0.144
 Thousand+0.144
Neg logit contributions
 awa-0.225
OOD-0.214
romy-0.209
oz-0.204
Environment-0.199
Token and
Token ID290
Feature activation-5.68e-02

Pos logit contributions
 awa+0.666
OOD+0.632
romy+0.623
oz+0.607
Environment+0.592
Neg logit contributions
 Articles-0.386
 remedy-0.381
 Thousand-0.359
 remedies-0.355
aughs-0.355
Token mess
Token ID2085
Feature activation+9.99e-01

Pos logit contributions
 awa+0.608
OOD+0.570
romy+0.563
oz+0.547
Environment+0.535
Neg logit contributions
 Articles-0.392
 remedy-0.387
aughs-0.370
 Thousand-0.369
esi-0.363
Token seriously
Token ID6411
Feature activation-6.39e-03

Pos logit contributions
 Articles+0.637
 remedy+0.629
 Thousand+0.605
aughs+0.600
 remedies+0.594
Neg logit contributions
 awa-0.780
OOD-0.732
romy-0.720
oz-0.699
Environment-0.679
Token
Token ID198
Feature activation-4.24e-01

Pos logit contributions
 awa+0.006
OOD+0.005
romy+0.005
oz+0.005
Environment+0.005
Neg logit contributions
 Articles-0.003
 remedy-0.003
 Thousand-0.003
aughs-0.003
 remedies-0.003
Token
Token ID198
Feature activation-9.42e-01

Pos logit contributions
OOD+0.314
oz+0.306
romy+0.301
 awa+0.297
Environment+0.295
Neg logit contributions
 remedy-0.270
aughs-0.260
 Articles-0.260
 remedies-0.255
 Thousand-0.251
TokenArticle
Token ID14906
Feature activation-7.60e-01

Pos logit contributions
 awa+0.634
OOD+0.601
romy+0.587
oz+0.570
Environment+0.554
Neg logit contributions
 Articles-0.692
 remedy-0.689
 Thousand-0.661
 remedies-0.655
aughs-0.654
Token Continued
Token ID22168
Feature activation-6.69e+00

Pos logit contributions
 awa+0.481
OOD+0.456
oz+0.442
Environment+0.436
romy+0.426
Neg logit contributions
 remedy-0.567
 Thousand-0.549
 Articles-0.549
esi-0.545
aughs-0.543
Token Below
Token ID10383
Feature activation-3.68e+00

Pos logit contributions
OOD+3.816
Environment+3.424
oz+3.371
Scene+3.335
 awa+3.195
Neg logit contributions
 remedy-5.424
 remedies-5.241
alore-5.195
aughs-5.093
esi-5.007
Token
Token ID198
Feature activation-6.84e-01

Pos logit contributions
OOD+2.478
oz+2.402
Environment+2.308
Scene+2.255
romy+2.218
Neg logit contributions
 remedy-2.576
aughs-2.485
 remedies-2.477
alore-2.423
 warranties-2.373
Token
Token ID198
Feature activation-1.15e+00

Pos logit contributions
OOD+0.492
oz+0.480
Environment+0.468
romy+0.467
 awa+0.457
Neg logit contributions
 remedy-0.450
aughs-0.435
 Articles-0.428
 remedies-0.426
alore-0.417
Token
Token ID447
Feature activation+6.93e-01

Pos logit contributions
 awa+0.663
OOD+0.625
romy+0.604
oz+0.586
Environment+0.569
Neg logit contributions
 Articles-0.949
 remedy-0.947
 Thousand-0.912
 remedies-0.907
aughs-0.907
Token
Token ID250
Feature activation-2.61e-03

Pos logit contributions
 Articles+0.466
 remedy+0.460
 Thousand+0.440
 remedies+0.437
aughs+0.436
Neg logit contributions
 awa-0.514
OOD-0.482
romy-0.478
oz-0.460
Environment-0.445
Token community
Token ID2055
Feature activation+1.37e+00

Pos logit contributions
 awa+0.608
OOD+0.574
romy+0.562
oz+0.546
Environment+0.529
Neg logit contributions
 Articles-0.527
 remedy-0.520
 Thousand-0.498
 remedies-0.494
aughs-0.494
Token.
Token ID13
Feature activation+7.18e-01

Pos logit contributions
 Articles+0.639
 remedy+0.629
 Thousand+0.589
 remedies+0.582
aughs+0.580
Neg logit contributions
 awa-1.308
OOD-1.241
romy-1.228
oz-1.193
Environment-1.167
Token (
Token ID357
Feature activation+1.95e+00

Pos logit contributions
 remedy+0.383
 Articles+0.378
 remedies+0.358
aughs+0.357
 Thousand+0.355
Neg logit contributions
 awa-0.629
OOD-0.598
romy-0.584
oz-0.576
Environment-0.561
TokenPhoto
Token ID6191
Feature activation-1.44e+00

Pos logit contributions
 Articles+1.620
 remedy+1.594
 Thousand+1.544
 Warn+1.531
aughs+1.529
Neg logit contributions
 awa-1.149
OOD-1.040
romy-1.028
oz-0.974
Environment-0.930
Token by
Token ID416
Feature activation-1.87e+00

Pos logit contributions
 awa+1.339
OOD+1.273
romy+1.258
oz+1.222
Environment+1.192
Neg logit contributions
 Articles-0.707
 remedy-0.695
 Thousand-0.652
aughs-0.645
 remedies-0.645
Token REUTERS
Token ID15862
Feature activation-6.69e+00

Pos logit contributions
 awa+1.494
OOD+1.402
romy+1.374
oz+1.355
Environment+1.308
Neg logit contributions
 Articles-1.148
 remedy-1.140
aughs-1.086
 remedies-1.080
esi-1.068
Token/
Token ID14
Feature activation-2.05e+00

Pos logit contributions
 awa+4.664
OOD+4.469
romy+4.326
oz+4.291
Environment+4.134
Neg logit contributions
 remedy-4.524
 Articles-4.477
 remedies-4.305
 Thousand-4.268
esi-4.248
TokenJohn
Token ID7554
Feature activation-1.83e+00

Pos logit contributions
 awa+1.115
OOD+1.063
romy+1.012
oz+1.002
Environment+0.960
Neg logit contributions
 remedy-1.761
 Articles-1.753
 remedies-1.695
aughs-1.693
 Thousand-1.667
Token S
Token ID311
Feature activation-8.15e-01

Pos logit contributions
 awa+1.403
OOD+1.317
romy+1.309
oz+1.293
Environment+1.225
Neg logit contributions
 Articles-1.156
 remedy-1.155
aughs-1.101
 remedies-1.099
esi-1.075
Tokenomm
Token ID2002
Feature activation-1.69e+00

Pos logit contributions
 awa+0.485
OOD+0.453
romy+0.442
oz+0.427
Environment+0.407
Neg logit contributions
 Articles-0.668
 remedy-0.661
 Thousand-0.637
aughs-0.633
 remedies-0.633
Tokeners
Token ID364
Feature activation-1.56e+00

Pos logit contributions
 awa+1.242
romy+1.187
oz+1.179
OOD+1.170
Environment+1.097
Neg logit contributions
 remedy-1.109
 Articles-1.091
 remedies-1.045
aughs-1.042
 Thousand-1.035
Token.
Token ID13
Feature activation-4.10e-01

Pos logit contributions
 Articles+0.019
 remedy+0.018
 Thousand+0.017
aughs+0.017
 remedies+0.017
Neg logit contributions
 awa-0.035
OOD-0.033
romy-0.033
oz-0.032
Environment-0.031
Token
Token ID198
Feature activation+2.73e-01

Pos logit contributions
 awa+0.362
OOD+0.343
romy+0.339
oz+0.329
Environment+0.321
Neg logit contributions
 Articles-0.219
 remedy-0.215
 Thousand-0.204
aughs-0.202
 remedies-0.201
Token
Token ID198
Feature activation-5.78e-01

Pos logit contributions
 remedy+0.175
aughs+0.171
 Articles+0.168
 remedies+0.166
 Thousand+0.163
Neg logit contributions
OOD-0.203
oz-0.197
romy-0.193
 awa-0.191
Environment-0.190
TokenNewsletter
Token ID33031
Feature activation-4.35e+00

Pos logit contributions
 awa+0.346
OOD+0.327
romy+0.318
oz+0.307
Environment+0.298
Neg logit contributions
 Articles-0.467
 remedy-0.466
 Thousand-0.447
 remedies-0.446
aughs-0.445
Token Sign
Token ID5865
Feature activation-4.25e+00

Pos logit contributions
OOD+2.326
oz+2.305
 awa+2.302
romy+2.205
stic+2.076
Neg logit contributions
 remedy-3.512
aughs-3.510
alore-3.392
 remedies-3.327
esi-3.257
Token Up
Token ID3205
Feature activation-6.68e+00

Pos logit contributions
OOD+3.472
oz+3.444
 awa+3.358
romy+3.328
Environment+3.326
Neg logit contributions
 remedy-2.278
 remedies-2.168
aughs-2.165
alore-2.106
esi-2.021
Token Continue
Token ID10054
Feature activation-4.85e+00

Pos logit contributions
OOD+3.307
oz+3.144
 awa+2.945
romy+2.912
rosis+2.828
Neg logit contributions
 remedy-5.528
aughs-5.494
 remedies-5.420
 Articles-5.306
alore-5.237
Token reading
Token ID3555
Feature activation-2.57e+00

Pos logit contributions
 awa+2.952
OOD+2.941
romy+2.845
oz+2.828
rosis+2.718
Neg logit contributions
aughs-3.522
esi-3.397
 Articles-3.381
 remedy-3.371
 Thousand-3.345
Token the
Token ID262
Feature activation-2.59e-01

Pos logit contributions
OOD+1.820
oz+1.768
romy+1.694
Environment+1.693
 awa+1.690
Neg logit contributions
aughs-1.678
 remedy-1.651
 Articles-1.596
inders-1.578
esi-1.569
Token main
Token ID1388
Feature activation+1.16e+00

Pos logit contributions
 awa+0.189
OOD+0.187
romy+0.180
oz+0.180
Environment+0.170
Neg logit contributions
aughs-0.166
 Articles-0.163
 remedy-0.161
 Thousand-0.157
esi-0.156
Token story
Token ID1621
Feature activation-3.30e+00

Pos logit contributions
 Articles+0.838
aughs+0.825
 remedy+0.822
 Thousand+0.799
esi+0.798
Neg logit contributions
 awa-0.758
OOD-0.738
romy-0.719
oz-0.711
Environment-0.672
Token They
Token ID1119
Feature activation-1.25e+00

Pos logit contributions
 awa+0.401
OOD+0.390
oz+0.375
romy+0.372
Environment+0.363
Neg logit contributions
 remedy-0.337
 remedies-0.319
 Articles-0.318
aughs-0.314
esi-0.310
Token were
Token ID547
Feature activation+8.23e-01

Pos logit contributions
 awa+1.076
OOD+1.021
romy+1.006
oz+0.987
Environment+0.946
Neg logit contributions
 Articles-0.680
 remedy-0.657
 Thousand-0.642
aughs-0.636
esi-0.629
Token very
Token ID845
Feature activation-8.33e-01

Pos logit contributions
 Articles+0.466
 remedy+0.457
aughs+0.438
 Thousand+0.434
esi+0.430
Neg logit contributions
 awa-0.700
OOD-0.658
romy-0.649
oz-0.636
Environment-0.617
Token healthy
Token ID5448
Feature activation+5.66e-01

Pos logit contributions
 awa+0.718
OOD+0.674
romy+0.668
oz+0.655
Environment+0.633
Neg logit contributions
 Articles-0.456
 remedy-0.442
aughs-0.433
esi-0.425
 Thousand-0.424
Token this
Token ID428
Feature activation+2.21e-01

Pos logit contributions
 Articles+0.290
 remedy+0.286
aughs+0.273
 Thousand+0.272
esi+0.268
Neg logit contributions
 awa-0.509
OOD-0.482
romy-0.474
oz-0.466
Environment-0.450
Token year
Token ID614
Feature activation+3.67e+00

Pos logit contributions
 Articles+0.134
 remedy+0.132
aughs+0.126
 Thousand+0.126
 remedies+0.125
Neg logit contributions
 awa-0.179
OOD-0.169
romy-0.166
oz-0.162
Environment-0.157
Token.
Token ID13
Feature activation-1.55e-01

Pos logit contributions
 Articles+1.798
 remedy+1.779
 Thousand+1.663
 remedies+1.650
 Warn+1.641
Neg logit contributions
 awa-3.374
OOD-3.210
romy-3.172
oz-3.076
Environment-3.022
Token There
Token ID1318
Feature activation-5.29e-01

Pos logit contributions
 awa+0.118
OOD+0.115
oz+0.111
romy+0.110
Environment+0.107
Neg logit contributions
 remedy-0.099
 Articles-0.095
 remedies-0.094
aughs-0.093
esi-0.092
Token were
Token ID547
Feature activation+1.52e+00

Pos logit contributions
 awa+0.520
OOD+0.497
romy+0.490
oz+0.481
Environment+0.467
Neg logit contributions
 Articles-0.228
 remedy-0.224
 Thousand-0.209
aughs-0.208
 remedies-0.205
Token teams
Token ID3466
Feature activation-9.65e-01

Pos logit contributions
 Articles+0.819
 remedy+0.809
aughs+0.768
 Thousand+0.766
esi+0.758
Neg logit contributions
 awa-1.334
OOD-1.261
romy-1.242
oz-1.213
Environment-1.179
Token I
Token ID314
Feature activation-2.17e-01

Pos logit contributions
 awa+0.792
OOD+0.745
romy+0.734
oz+0.718
Environment+0.690
Neg logit contributions
 Articles-0.568
 remedy-0.557
aughs-0.540
esi-0.536
 Thousand-0.533
Token breath
Token ID8033
Feature activation-2.87e+00

Pos logit contributions
 Articles+0.404
 remedy+0.396
aughs+0.377
 Thousand+0.376
esi+0.371
Neg logit contributions
 awa-0.656
OOD-0.623
romy-0.613
oz-0.601
Environment-0.585
Token (
Token ID357
Feature activation-4.35e-01

Pos logit contributions
 awa+2.368
OOD+2.260
romy+2.192
oz+2.177
Environment+2.094
Neg logit contributions
 Articles-1.658
 remedy-1.609
 Thousand-1.556
aughs-1.538
esi-1.534
Tokenafter
Token ID8499
Feature activation+2.56e-01

Pos logit contributions
 awa+0.311
OOD+0.293
romy+0.287
oz+0.281
Environment+0.272
Neg logit contributions
 Articles-0.303
 remedy-0.301
 Thousand-0.288
aughs-0.287
 remedies-0.286
Token a
Token ID257
Feature activation+1.64e+00

Pos logit contributions
 Articles+0.145
 remedy+0.143
aughs+0.137
 Thousand+0.136
esi+0.135
Neg logit contributions
 awa-0.216
OOD-0.204
romy-0.201
oz-0.197
Environment-0.192
Token steep
Token ID14559
Feature activation+3.03e+00

Pos logit contributions
 Articles+0.965
 remedy+0.947
aughs+0.908
 Thousand+0.900
 remedies+0.896
Neg logit contributions
 awa-1.362
OOD-1.281
romy-1.264
oz-1.233
Environment-1.197
Token climb
Token ID12080
Feature activation+3.69e+00

Pos logit contributions
 Articles+1.739
 remedy+1.727
 Thousand+1.630
 remedies+1.617
aughs+1.610
Neg logit contributions
 awa-2.549
OOD-2.412
romy-2.376
oz-2.304
Environment-2.244
Token out
Token ID503
Feature activation+3.51e+00

Pos logit contributions
 Articles+1.841
 remedy+1.830
 Thousand+1.708
 remedies+1.698
aughs+1.675
Neg logit contributions
 awa-3.357
OOD-3.184
romy-3.148
oz-3.041
Environment-2.983
Token of
Token ID286
Feature activation+2.38e+00

Pos logit contributions
 Articles+1.727
 remedy+1.712
 Thousand+1.627
 Warn+1.604
 remedies+1.588
Neg logit contributions
 awa-3.199
romy-3.035
OOD-3.026
oz-2.886
Environment-2.811
Token the
Token ID262
Feature activation+1.69e+00

Pos logit contributions
 Articles+1.269
 remedy+1.253
 Thousand+1.181
 remedies+1.170
aughs+1.169
Neg logit contributions
 awa-2.107
OOD-1.998
romy-1.970
oz-1.917
Environment-1.867
Token dungeon
Token ID18974
Feature activation+9.62e-01

Pos logit contributions
 Articles+1.012
 remedy+1.000
 Thousand+0.950
aughs+0.943
 remedies+0.942
Neg logit contributions
 awa-1.380
OOD-1.302
romy-1.283
oz-1.245
Environment-1.211
Token),"
Token ID27267
Feature activation+1.04e+00

Pos logit contributions
 Articles+0.543
 remedy+0.535
 Thousand+0.508
aughs+0.503
 remedies+0.502
Neg logit contributions
 awa-0.820
OOD-0.777
romy-0.765
oz-0.745
Environment-0.725
Token and
Token ID290
Feature activation-2.93e-01

Pos logit contributions
 awa+0.524
OOD+0.497
romy+0.490
oz+0.479
Environment+0.464
Neg logit contributions
 Articles-0.322
 remedy-0.318
aughs-0.302
 Thousand-0.301
 remedies-0.297
Token has
Token ID468
Feature activation-5.56e-01

Pos logit contributions
 awa+0.249
OOD+0.234
romy+0.231
oz+0.225
Environment+0.219
Neg logit contributions
 Articles-0.165
 remedy-0.162
aughs-0.156
 Thousand-0.154
esi-0.154
Token little
Token ID1310
Feature activation-2.91e+00

Pos logit contributions
 awa+0.482
OOD+0.457
romy+0.450
oz+0.439
Environment+0.426
Neg logit contributions
 Articles-0.302
 remedy-0.296
aughs-0.283
 Thousand-0.283
esi-0.277
Token time
Token ID640
Feature activation-1.59e+00

Pos logit contributions
 awa+2.309
OOD+2.167
romy+2.103
oz+2.081
Environment+1.995
Neg logit contributions
 Articles-1.751
aughs-1.696
 remedy-1.663
esi-1.659
 Thousand-1.655
Token to
Token ID284
Feature activation+3.10e-02

Pos logit contributions
 awa+1.444
OOD+1.375
romy+1.356
oz+1.329
Environment+1.281
Neg logit contributions
 Articles-0.794
 remedy-0.777
aughs-0.737
 Thousand-0.736
 Warn-0.724
Token prepare
Token ID8335
Feature activation+3.33e+00

Pos logit contributions
 Articles+0.018
aughs+0.017
esi+0.017
 Thousand+0.017
 remedy+0.016
Neg logit contributions
 awa-0.026
romy-0.025
OOD-0.024
oz-0.024
Environment-0.023
Token for
Token ID329
Feature activation+2.23e-01

Pos logit contributions
 Articles+1.635
 remedy+1.614
 Thousand+1.507
 remedies+1.505
 Warn+1.484
Neg logit contributions
 awa-3.096
OOD-2.930
romy-2.903
oz-2.821
Environment-2.762
Token cases
Token ID2663
Feature activation-1.45e-01

Pos logit contributions
 Articles+0.131
 remedy+0.129
aughs+0.124
 Thousand+0.124
esi+0.122
Neg logit contributions
 awa-0.183
OOD-0.173
romy-0.170
oz-0.166
Environment-0.161
Token.
Token ID13
Feature activation-3.91e-01

Pos logit contributions
 awa+0.132
OOD+0.126
romy+0.124
oz+0.122
Environment+0.118
Neg logit contributions
 Articles-0.073
 remedy-0.072
aughs-0.068
 Thousand-0.068
 remedies-0.067
Token (
Token ID357
Feature activation+6.48e-01

Pos logit contributions
 awa+0.335
OOD+0.318
romy+0.312
oz+0.306
Environment+0.297
Neg logit contributions
 remedy-0.216
 Articles-0.216
aughs-0.203
 Thousand-0.203
 remedies-0.202
TokenPhoto
Token ID6191
Feature activation-1.87e+00

Pos logit contributions
 Articles+0.513
 remedy+0.506
 Thousand+0.488
aughs+0.484
 remedies+0.483
Neg logit contributions
 awa-0.408
OOD-0.373
romy-0.370
oz-0.350
Environment-0.337
Token interviews
Token ID9299
Feature activation-2.59e-02

Pos logit contributions
 awa+0.185
OOD+0.175
romy+0.172
oz+0.168
Environment+0.164
Neg logit contributions
 Articles-0.117
 remedy-0.116
aughs-0.111
 Thousand-0.110
esi-0.110
Token had
Token ID550
Feature activation+1.95e+00

Pos logit contributions
 awa+0.024
OOD+0.023
romy+0.022
oz+0.022
Environment+0.022
Neg logit contributions
 Articles-0.012
 remedy-0.012
 Thousand-0.012
aughs-0.012
esi-0.011
Token already
Token ID1541
Feature activation+2.74e+00

Pos logit contributions
 Articles+0.896
 remedy+0.882
 Thousand+0.828
aughs+0.819
 remedies+0.813
Neg logit contributions
 awa-1.869
OOD-1.780
romy-1.755
oz-1.718
Environment-1.676
Token been
Token ID587
Feature activation+2.35e+00

Pos logit contributions
 Articles+1.266
 remedy+1.240
 Thousand+1.169
aughs+1.154
 remedies+1.146
Neg logit contributions
 awa-2.623
OOD-2.492
romy-2.461
oz-2.411
Environment-2.347
Token carried
Token ID5281
Feature activation+2.03e+00

Pos logit contributions
 Articles+1.403
 remedy+1.375
 Thousand+1.314
aughs+1.311
 remedies+1.294
Neg logit contributions
 awa-1.925
OOD-1.812
romy-1.784
oz-1.742
Environment-1.689
Token out
Token ID503
Feature activation+3.34e+00

Pos logit contributions
 Articles+1.106
 remedy+1.091
 Thousand+1.034
aughs+1.025
 remedies+1.020
Neg logit contributions
 awa-1.776
OOD-1.681
romy-1.656
oz-1.615
Environment-1.570
Token under
Token ID739
Feature activation+4.68e+00

Pos logit contributions
 Articles+1.740
 remedy+1.718
 Thousand+1.619
 remedies+1.603
 Warn+1.595
Neg logit contributions
 awa-2.987
OOD-2.834
romy-2.799
oz-2.718
Environment-2.651
Token the
Token ID262
Feature activation+1.16e+00

Pos logit contributions
 Articles+2.553
 remedy+2.502
 Warn+2.364
 remedies+2.338
 Thousand+2.337
Neg logit contributions
 awa-4.086
OOD-3.885
romy-3.881
oz-3.718
NetMessage-3.604
Token Police
Token ID4287
Feature activation+3.79e-01

Pos logit contributions
 Articles+0.695
 remedy+0.689
aughs+0.656
 Thousand+0.654
esi+0.650
Neg logit contributions
 awa-0.946
OOD-0.889
romy-0.872
oz-0.852
Environment-0.832
Token and
Token ID290
Feature activation+1.33e-02

Pos logit contributions
 Articles+0.214
 remedy+0.212
 Thousand+0.200
aughs+0.200
 remedies+0.199
Neg logit contributions
 awa-0.323
OOD-0.305
romy-0.300
oz-0.292
Environment-0.285
Token Criminal
Token ID15531
Feature activation+1.59e-01

Pos logit contributions
 Articles+0.008
 remedy+0.008
 Thousand+0.007
aughs+0.007
 remedies+0.007
Neg logit contributions
 awa-0.011
OOD-0.010
romy-0.010
oz-0.010
Environment-0.010
Token Americans
Token ID3399
Feature activation+3.95e-01

Pos logit contributions
 Articles+0.057
 remedy+0.057
 Thousand+0.054
aughs+0.054
 remedies+0.053
Neg logit contributions
 awa-0.073
OOD-0.069
romy-0.068
oz-0.066
Environment-0.064
Token).
Token ID737
Feature activation-1.81e-01

Pos logit contributions
 Articles+0.240
 remedy+0.237
 Thousand+0.227
aughs+0.225
 remedies+0.223
Neg logit contributions
 awa-0.319
OOD-0.302
romy-0.297
oz-0.288
Environment-0.279
Token Kaine
Token ID37193
Feature activation+2.04e+00

Pos logit contributions
 awa+0.151
OOD+0.144
romy+0.140
oz+0.139
Environment+0.134
Neg logit contributions
 remedy-0.104
 Articles-0.101
 remedies-0.097
aughs-0.096
 Thousand-0.096
Token shot
Token ID2823
Feature activation+9.65e-01

Pos logit contributions
 Articles+1.185
 remedy+1.172
 Thousand+1.108
 remedies+1.102
aughs+1.101
Neg logit contributions
 awa-1.703
OOD-1.611
romy-1.588
oz-1.539
Environment-1.503
Token back
Token ID736
Feature activation+8.88e-01

Pos logit contributions
 Articles+0.641
 remedy+0.636
 Thousand+0.610
aughs+0.607
 remedies+0.603
Neg logit contributions
 awa-0.724
OOD-0.676
romy-0.667
oz-0.645
Environment-0.626
Token:
Token ID25
Feature activation+3.61e+00

Pos logit contributions
 Articles+0.470
 remedy+0.464
 Thousand+0.439
aughs+0.436
 remedies+0.432
Neg logit contributions
 awa-0.787
OOD-0.747
romy-0.735
oz-0.718
Environment-0.697
Token
Token ID564
Feature activation+3.04e-01

Pos logit contributions
 Articles+1.653
 remedy+1.580
 Thousand+1.485
 remedies+1.473
 Force+1.468
Neg logit contributions
 awa-3.448
OOD-3.283
romy-3.260
oz-3.144
Environment-3.108
Token
Token ID250
Feature activation+4.58e-01

Pos logit contributions
 Articles+0.234
 remedy+0.233
aughs+0.224
 Thousand+0.222
 remedies+0.222
Neg logit contributions
 awa-0.189
OOD-0.184
romy-0.179
oz-0.172
Environment-0.167
TokenI
Token ID40
Feature activation-1.08e-01

Pos logit contributions
 Articles+0.368
 remedy+0.367
 Thousand+0.352
aughs+0.351
 remedies+0.350
Neg logit contributions
 awa-0.277
OOD-0.261
romy-0.255
oz-0.247
Environment-0.238
Token can
Token ID460
Feature activation-4.76e-01

Pos logit contributions
 awa+0.087
OOD+0.082
romy+0.081
oz+0.079
Environment+0.076
Neg logit contributions
 Articles-0.066
 remedy-0.065
aughs-0.063
 Thousand-0.063
 remedies-0.061
Token
Token ID447
Feature activation-5.77e-01

Pos logit contributions
 awa+0.378
OOD+0.355
romy+0.351
oz+0.341
Environment+0.329
Neg logit contributions
 Articles-0.297
 remedy-0.289
 Thousand-0.281
aughs-0.279
 remedies-0.275
Token with
Token ID351
Feature activation+1.81e+00

Pos logit contributions
 awa+0.926
OOD+0.886
romy+0.868
oz+0.854
Environment+0.829
Neg logit contributions
 Articles-0.458
 remedy-0.448
aughs-0.425
 Thousand-0.423
esi-0.418
Token.
Token ID13
Feature activation+7.56e-01

Pos logit contributions
 Articles+0.987
 remedy+0.974
 Thousand+0.920
 remedies+0.911
aughs+0.908
Neg logit contributions
 awa-1.582
OOD-1.500
romy-1.479
oz-1.437
Environment-1.401
Token If
Token ID1002
Feature activation+6.22e-01

Pos logit contributions
 remedy+0.438
 Articles+0.435
 remedies+0.411
aughs+0.411
 Thousand+0.409
Neg logit contributions
 awa-0.627
OOD-0.597
romy-0.584
oz-0.572
Environment-0.557
Token Clinton
Token ID2605
Feature activation+1.99e+00

Pos logit contributions
 Articles+0.349
 remedy+0.345
 Thousand+0.326
aughs+0.325
 remedies+0.323
Neg logit contributions
 awa-0.531
OOD-0.502
romy-0.495
oz-0.482
Environment-0.469
Token
Token ID447
Feature activation+3.30e-01

Pos logit contributions
 Articles+0.979
 remedy+0.963
 Thousand+0.911
aughs+0.902
esi+0.895
Neg logit contributions
 awa-1.843
OOD-1.745
romy-1.724
oz-1.691
Environment-1.637
Token
Token ID247
Feature activation-2.53e-02

Pos logit contributions
 Articles+0.241
 remedy+0.240
 Thousand+0.230
aughs+0.228
 remedies+0.228
Neg logit contributions
 awa-0.226
OOD-0.211
romy-0.207
oz-0.200
Environment-0.193
Tokens
Token ID82
Feature activation+4.21e-01

Pos logit contributions
 awa+0.024
OOD+0.023
romy+0.022
oz+0.022
Environment+0.021
Neg logit contributions
 Articles-0.012
 remedy-0.012
 Thousand-0.011
aughs-0.011
 remedies-0.011
Token position
Token ID2292
Feature activation+1.44e+00

Pos logit contributions
 Articles+0.234
 remedy+0.229
aughs+0.220
 Thousand+0.219
 Warn+0.216
Neg logit contributions
 awa-0.362
OOD-0.340
romy-0.337
oz-0.328
Environment-0.319
Token is
Token ID318
Feature activation+1.37e+00

Pos logit contributions
 Articles+0.663
 remedy+0.650
 Thousand+0.614
aughs+0.604
esi+0.600
Neg logit contributions
 awa-1.381
OOD-1.313
romy-1.297
oz-1.267
Environment-1.235
Token that
Token ID326
Feature activation-6.16e-01

Pos logit contributions
 Articles+0.761
 remedy+0.750
 Thousand+0.712
aughs+0.707
 remedies+0.702
Neg logit contributions
 awa-1.185
OOD-1.121
romy-1.106
oz-1.077
Environment-1.047
Token she
Token ID673
Feature activation+2.67e-01

Pos logit contributions
 awa+0.499
OOD+0.470
romy+0.463
oz+0.452
Environment+0.437
Neg logit contributions
 Articles-0.369
 remedy-0.366
aughs-0.352
 Thousand-0.349
esi-0.348
Token {
Token ID1391
Feature activation+9.38e-01

Pos logit contributions
 Articles+0.614
 remedy+0.607
aughs+0.579
 Thousand+0.573
 remedies+0.570
Neg logit contributions
 awa-0.973
OOD-0.921
romy-0.902
oz-0.884
Environment-0.862
Tokenvi
Token ID8903
Feature activation-4.95e-01

Pos logit contributions
 Articles+0.619
 remedy+0.613
 Thousand+0.584
 remedies+0.580
aughs+0.580
Neg logit contributions
 awa-0.709
OOD-0.665
romy-0.655
oz-0.632
Environment-0.613
Token:
Token ID25
Feature activation+3.86e+00

Pos logit contributions
 awa+0.468
OOD+0.443
romy+0.437
oz+0.424
Environment+0.414
Neg logit contributions
 Articles-0.238
 remedy-0.235
 Thousand-0.217
 remedies-0.217
aughs-0.216
Token|
Token ID91
Feature activation+1.11e+00

Pos logit contributions
 Articles+2.851
 remedy+2.828
esi+2.701
 Warn+2.692
 remedies+2.689
Neg logit contributions
 awa-2.594
OOD-2.403
romy-2.373
oz-2.243
Environment-2.161
Tokenvim
Token ID31124
Feature activation-1.62e-02

Pos logit contributions
 Articles+0.705
 remedy+0.698
 Thousand+0.662
 remedies+0.659
esi+0.657
Neg logit contributions
 awa-0.864
OOD-0.812
romy-0.800
oz-0.770
Environment-0.749
Token:
Token ID25
Feature activation+3.21e+00

Pos logit contributions
 awa+0.013
OOD+0.013
romy+0.013
oz+0.012
Environment+0.012
Neg logit contributions
 remedy-0.009
 Articles-0.009
 Thousand-0.009
aughs-0.009
 remedies-0.009
Token|
Token ID91
Feature activation+1.23e+00

Pos logit contributions
 Articles+2.330
 remedy+2.306
esi+2.217
 remedies+2.198
 Force+2.195
Neg logit contributions
 awa-2.216
OOD-2.041
romy-2.014
oz-1.892
Environment-1.827
Tokenex
Token ID1069
Feature activation+1.57e+00

Pos logit contributions
 Articles+0.831
 remedy+0.822
 Thousand+0.784
 remedies+0.780
aughs+0.779
Neg logit contributions
 awa-0.904
OOD-0.848
romy-0.833
oz-0.802
Environment-0.779
Token:
Token ID25
Feature activation+2.86e+00

Pos logit contributions
 Articles+0.693
 remedy+0.678
 remedies+0.624
esi+0.620
 Warn+0.614
Neg logit contributions
 awa-1.547
OOD-1.478
romy-1.445
oz-1.397
Environment-1.365
Token}
Token ID92
Feature activation+1.37e+00

Pos logit contributions
 Articles+1.420
 remedy+1.398
 Thousand+1.310
esi+1.304
 Warn+1.302
Neg logit contributions
 awa-2.626
OOD-2.486
romy-2.456
oz-2.358
Environment-2.312
Token the
Token ID262
Feature activation+4.30e-01

Pos logit contributions
 Articles+0.824
 remedy+0.816
 Thousand+0.775
aughs+0.771
 remedies+0.769
Neg logit contributions
 awa-1.109
OOD-1.047
romy-1.029
oz-1.001
Environment-0.974
Tokenen
Token ID268
Feature activation-2.07e+00

Pos logit contributions
 Articles+0.752
 remedy+0.742
 Thousand+0.718
aughs+0.712
 Warn+0.709
Neg logit contributions
 awa-0.697
OOD-0.648
romy-0.637
oz-0.616
Environment-0.593
Token their
Token ID511
Feature activation+3.79e-01

Pos logit contributions
 awa+1.710
OOD+1.619
romy+1.576
oz+1.549
Environment+1.501
Neg logit contributions
 remedy-1.156
 Articles-1.153
aughs-1.129
 Thousand-1.125
esi-1.114
Token recovery
Token ID7628
Feature activation+3.81e-01

Pos logit contributions
 Articles+0.228
 remedy+0.226
aughs+0.222
 Thousand+0.218
esi+0.216
Neg logit contributions
 awa-0.301
OOD-0.284
romy-0.278
oz-0.273
Environment-0.263
Token from
Token ID422
Feature activation-1.54e+00

Pos logit contributions
 Articles+0.203
 remedy+0.200
aughs+0.196
 Thousand+0.194
esi+0.189
Neg logit contributions
 awa-0.333
OOD-0.316
romy-0.307
oz-0.303
Environment-0.293
Token the
Token ID262
Feature activation-7.92e-01

Pos logit contributions
 awa+1.284
OOD+1.213
romy+1.184
oz+1.169
Environment+1.136
Neg logit contributions
 Articles-0.865
 remedy-0.861
aughs-0.831
 Thousand-0.822
esi-0.810
Token crisis
Token ID4902
Feature activation+2.96e-01

Pos logit contributions
 awa+0.643
OOD+0.604
romy+0.589
oz+0.582
Environment+0.563
Neg logit contributions
 Articles-0.469
 remedy-0.466
aughs-0.455
esi-0.443
 Thousand-0.442
Token by
Token ID416
Feature activation-6.20e-01

Pos logit contributions
 Articles+0.155
 remedy+0.152
aughs+0.146
 Thousand+0.146
 remedies+0.142
Neg logit contributions
 awa-0.263
OOD-0.249
romy-0.245
oz-0.240
Environment-0.233
Token opening
Token ID4756
Feature activation-2.42e+00

Pos logit contributions
 awa+0.448
OOD+0.414
romy+0.408
oz+0.398
Environment+0.383
Neg logit contributions
 Articles-0.427
 remedy-0.421
aughs-0.415
 Thousand-0.407
esi-0.402
Token up
Token ID510
Feature activation-1.45e+00

Pos logit contributions
 awa+2.019
OOD+1.921
romy+1.868
oz+1.846
Environment+1.789
Neg logit contributions
 Articles-1.326
 remedy-1.325
aughs-1.318
 Thousand-1.269
esi-1.267
Token trade
Token ID3292
Feature activation+2.57e-01

Pos logit contributions
 awa+1.218
OOD+1.165
romy+1.138
oz+1.121
Environment+1.080
Neg logit contributions
 Articles-0.804
 remedy-0.798
aughs-0.779
 Thousand-0.760
esi-0.753
Token in
Token ID287
Feature activation-2.57e-01

Pos logit contributions
 Articles+0.135
 remedy+0.134
aughs+0.128
 Thousand+0.127
 Warn+0.125
Neg logit contributions
 awa-0.225
OOD-0.216
romy-0.212
oz-0.207
Environment-0.200
Token
Token ID198
Feature activation-3.27e-01

Pos logit contributions
 awa+0.441
OOD+0.413
romy+0.413
oz+0.395
Environment+0.385
Neg logit contributions
 Articles-0.245
 remedy-0.234
 Thousand-0.227
 Warn-0.224
aughs-0.221
Token
Token ID198
Feature activation-7.96e-01

Pos logit contributions
OOD+0.237
oz+0.231
Environment+0.222
romy+0.222
Scene+0.216
Neg logit contributions
 remedy-0.216
aughs-0.212
 Articles-0.206
 remedies-0.206
 Thousand-0.200
Token"
Token ID1
Feature activation-2.17e+00

Pos logit contributions
 awa+0.481
OOD+0.458
romy+0.442
oz+0.432
Environment+0.422
Neg logit contributions
 Articles-0.636
 remedy-0.636
 Thousand-0.610
aughs-0.608
 remedies-0.608
TokenSome
Token ID4366
Feature activation-1.68e+00

Pos logit contributions
 awa+1.261
OOD+1.255
oz+1.185
Environment+1.183
romy+1.178
Neg logit contributions
 remedy-1.754
 Articles-1.729
 remedies-1.680
aughs-1.668
 Thousand-1.659
Token satellites
Token ID20372
Feature activation+1.06e+00

Pos logit contributions
 awa+1.391
OOD+1.310
romy+1.287
oz+1.272
Environment+1.229
Neg logit contributions
 Articles-0.967
 remedy-0.951
aughs-0.924
 Thousand-0.912
esi-0.909
Token take
Token ID1011
Feature activation-1.72e-01

Pos logit contributions
 Articles+0.546
 remedy+0.532
aughs+0.509
 Thousand+0.508
 remedies+0.500
Neg logit contributions
 awa-0.958
OOD-0.906
romy-0.896
oz-0.878
Environment-0.853
Token decades
Token ID4647
Feature activation-1.68e+00

Pos logit contributions
 awa+0.145
OOD+0.136
romy+0.134
oz+0.132
Environment+0.127
Neg logit contributions
 Articles-0.098
 remedy-0.096
aughs-0.093
esi-0.091
 Thousand-0.091
Token to
Token ID284
Feature activation-1.48e+00

Pos logit contributions
 awa+1.488
OOD+1.408
romy+1.395
oz+1.365
Environment+1.313
Neg logit contributions
 Articles-0.881
 remedy-0.859
aughs-0.816
 Thousand-0.814
esi-0.803
Token come
Token ID1282
Feature activation-7.38e-02

Pos logit contributions
 awa+1.195
OOD+1.114
romy+1.112
oz+1.087
Environment+1.049
Neg logit contributions
 Articles-0.893
aughs-0.850
 remedy-0.839
esi-0.839
 Thousand-0.831
Token back
Token ID736
Feature activation-1.85e+00

Pos logit contributions
 awa+0.060
OOD+0.056
romy+0.055
oz+0.054
Environment+0.052
Neg logit contributions
 Articles-0.045
 remedy-0.044
 Thousand-0.042
aughs-0.042
esi-0.042
Token after
Token ID706
Feature activation+5.53e-01

Pos logit contributions
 awa+1.593
OOD+1.510
romy+1.457
oz+1.448
Environment+1.390
Neg logit contributions
 Articles-1.021
 remedy-0.981
aughs-0.967
 Thousand-0.964
esi-0.943
Token Bill
Token ID3941
Feature activation-1.33e+00

Pos logit contributions
 awa+0.174
OOD+0.167
romy+0.162
oz+0.158
Environment+0.154
Neg logit contributions
 remedy-0.118
 Articles-0.116
esi-0.111
 remedies-0.111
aughs-0.110
Token said
Token ID531
Feature activation-5.10e-01

Pos logit contributions
 awa+1.112
OOD+1.057
romy+1.037
oz+1.010
Environment+0.963
Neg logit contributions
 Articles-0.769
 remedy-0.761
 remedies-0.714
 Thousand-0.712
aughs-0.704
Token --
Token ID1377
Feature activation+3.03e+00

Pos logit contributions
 awa+0.438
OOD+0.415
romy+0.408
oz+0.398
Environment+0.385
Neg logit contributions
 Articles-0.284
 remedy-0.280
 Thousand-0.264
aughs-0.264
 remedies-0.262
Token This
Token ID770
Feature activation-7.19e-01

Pos logit contributions
 Articles+1.659
 remedy+1.632
 Thousand+1.547
aughs+1.528
 Warn+1.528
Neg logit contributions
 awa-2.639
OOD-2.494
romy-2.463
oz-2.391
Environment-2.348
Token is
Token ID318
Feature activation-1.84e+00

Pos logit contributions
 awa+0.702
OOD+0.670
romy+0.658
oz+0.645
Environment+0.628
Neg logit contributions
 Articles-0.315
 remedy-0.306
 Thousand-0.286
aughs-0.286
 remedies-0.284
Token just
Token ID655
Feature activation+2.59e-01

Pos logit contributions
 awa+1.567
OOD+1.481
romy+1.452
oz+1.424
Environment+1.367
Neg logit contributions
 Articles-1.016
 remedy-0.995
aughs-0.960
esi-0.954
 remedies-0.940
Token a
Token ID257
Feature activation-3.72e-02

Pos logit contributions
 Articles+0.143
 remedy+0.141
aughs+0.133
 Thousand+0.133
esi+0.132
Neg logit contributions
 awa-0.223
OOD-0.212
romy-0.208
oz-0.203
Environment-0.196
Token little
Token ID1310
Feature activation-4.92e-01

Pos logit contributions
 awa+0.031
OOD+0.029
romy+0.029
oz+0.028
Environment+0.027
Neg logit contributions
 Articles-0.022
 remedy-0.021
aughs-0.020
esi-0.020
 remedies-0.020
Token something
Token ID1223
Feature activation-3.17e+00

Pos logit contributions
 awa+0.405
OOD+0.382
romy+0.374
oz+0.366
Environment+0.354
Neg logit contributions
 Articles-0.290
 remedy-0.283
aughs-0.272
 Thousand-0.271
esi-0.271
Token in
Token ID287
Feature activation+1.70e-01

Pos logit contributions
 awa+2.622
OOD+2.486
romy+2.431
oz+2.389
Environment+2.275
Neg logit contributions
 Articles-1.826
 remedy-1.791
aughs-1.726
esi-1.717
 Thousand-1.695
Token 17
Token ID1596
Feature activation-1.17e+00

Pos logit contributions
 Articles+0.092
 remedy+0.091
 remedies+0.085
 Thousand+0.085
esi+0.085
Neg logit contributions
 awa-0.148
OOD-0.140
romy-0.138
oz-0.134
Environment-0.131
Token
Token ID198
Feature activation+8.49e-02

Pos logit contributions
 awa+0.015
OOD+0.015
romy+0.014
oz+0.014
Environment+0.014
Neg logit contributions
 remedy-0.011
 Articles-0.010
aughs-0.010
 remedies-0.010
 Thousand-0.010
Token
Token ID198
Feature activation-6.68e-01

Pos logit contributions
 remedy+0.055
aughs+0.054
 Articles+0.053
 remedies+0.052
 Thousand+0.051
Neg logit contributions
OOD-0.062
oz-0.061
romy-0.059
Environment-0.058
 awa-0.058
TokenThanks
Token ID9690
Feature activation+1.66e+00

Pos logit contributions
 awa+0.386
OOD+0.370
romy+0.356
oz+0.349
Environment+0.337
Neg logit contributions
 remedy-0.550
 Articles-0.548
 Thousand-0.526
 remedies-0.526
aughs-0.526
Token to
Token ID284
Feature activation-9.82e-01

Pos logit contributions
 Articles+0.790
 remedy+0.775
 Thousand+0.735
esi+0.716
 Warn+0.715
Neg logit contributions
 awa-1.562
OOD-1.487
romy-1.471
oz-1.429
Environment-1.405
Token Penn
Token ID6595
Feature activation-1.49e+00

Pos logit contributions
 awa+0.819
OOD+0.770
romy+0.760
oz+0.739
Environment+0.718
Neg logit contributions
 Articles-0.565
 remedy-0.563
aughs-0.537
 Thousand-0.530
 remedies-0.529
Token and
Token ID290
Feature activation-1.16e+00

Pos logit contributions
 awa+1.278
OOD+1.211
romy+1.190
oz+1.162
Environment+1.133
Neg logit contributions
 Articles-0.823
 remedy-0.813
 Thousand-0.767
aughs-0.766
 remedies-0.760
Token Lab
Token ID3498
Feature activation-2.53e+00

Pos logit contributions
 awa+1.036
OOD+0.972
romy+0.957
oz+0.936
Environment+0.905
Neg logit contributions
 Articles-0.606
 remedy-0.603
aughs-0.575
 remedies-0.562
esi-0.561
Tokenov
Token ID709
Feature activation-3.84e-01

Pos logit contributions
 awa+1.608
OOD+1.523
romy+1.517
oz+1.493
Environment+1.388
Neg logit contributions
 Articles-1.928
 remedy-1.908
 Thousand-1.838
aughs-1.818
 remedies-1.810
Token,
Token ID11
Feature activation-1.42e+00

Pos logit contributions
 awa+0.347
OOD+0.330
romy+0.325
oz+0.317
Environment+0.309
Neg logit contributions
 Articles-0.197
 remedy-0.194
 Thousand-0.183
aughs-0.181
 remedies-0.181
Token now
Token ID783
Feature activation+5.08e-01

Pos logit contributions
 awa+1.183
OOD+1.115
romy+1.100
oz+1.071
Environment+1.034
Neg logit contributions
 Articles-0.816
 remedy-0.813
aughs-0.776
 Thousand-0.764
esi-0.762
Token 88
Token ID9193
Feature activation-1.03e-01

Pos logit contributions
 Articles+0.289
 remedy+0.286
aughs+0.272
 Thousand+0.270
 remedies+0.268
Neg logit contributions
 awa-0.430
OOD-0.406
romy-0.401
oz-0.389
Environment-0.379
Token
Token ID247
Feature activation+6.13e-01

Pos logit contributions
 remedy+0.567
 Articles+0.564
aughs+0.544
 Warn+0.535
 Thousand+0.533
Neg logit contributions
 awa-0.566
OOD-0.563
romy-0.543
oz-0.525
Environment-0.509
Tokens
Token ID82
Feature activation-9.42e-01

Pos logit contributions
 Articles+0.188
 remedy+0.183
 Thousand+0.164
 remedies+0.162
aughs+0.160
Neg logit contributions
 awa-0.682
OOD-0.652
romy-0.647
oz-0.630
Environment-0.620
Token this
Token ID428
Feature activation-1.71e+00

Pos logit contributions
 awa+0.806
OOD+0.761
romy+0.749
oz+0.732
Environment+0.711
Neg logit contributions
 Articles-0.524
 remedy-0.517
 Thousand-0.493
aughs-0.492
 Warn-0.486
Token strange
Token ID6283
Feature activation+5.79e-01

Pos logit contributions
 awa+1.411
OOD+1.327
romy+1.302
oz+1.273
Environment+1.235
Neg logit contributions
 Articles-1.003
 remedy-0.989
 Thousand-0.941
aughs-0.940
 remedies-0.938
Token irony
Token ID21296
Feature activation+1.84e+00

Pos logit contributions
 Articles+0.372
 remedy+0.370
 Thousand+0.351
 remedies+0.348
aughs+0.348
Neg logit contributions
 awa-0.447
OOD-0.422
romy-0.416
oz-0.402
Environment-0.390
Token that
Token ID326
Feature activation-1.49e+00

Pos logit contributions
 Articles+0.967
 remedy+0.963
 Thousand+0.903
 remedies+0.897
 Warn+0.894
Neg logit contributions
 awa-1.629
OOD-1.539
romy-1.523
oz-1.472
Environment-1.446
Token the
Token ID262
Feature activation-5.81e-01

Pos logit contributions
 awa+1.272
OOD+1.210
romy+1.186
oz+1.166
Environment+1.128
Neg logit contributions
 Articles-0.827
 remedy-0.824
aughs-0.782
 Thousand-0.775
esi-0.773
Token party
Token ID2151
Feature activation+3.75e-01

Pos logit contributions
 awa+0.475
OOD+0.448
romy+0.439
oz+0.430
Environment+0.417
Neg logit contributions
 Articles-0.344
 remedy-0.341
aughs-0.324
 Thousand-0.324
 remedies-0.321
Token
Token ID447
Feature activation+3.64e-01

Pos logit contributions
 Articles+0.182
 remedy+0.180
 Thousand+0.169
aughs+0.167
 remedies+0.167
Neg logit contributions
 awa-0.349
OOD-0.332
romy-0.327
oz-0.319
Environment-0.312
Token
Token ID247
Feature activation+3.42e-01

Pos logit contributions
 remedy+0.258
 Articles+0.254
aughs+0.247
 Warn+0.244
 Thousand+0.243
Neg logit contributions
OOD-0.247
 awa-0.245
romy-0.234
oz-0.227
Environment-0.223
Tokens
Token ID82
Feature activation+5.62e-01

Pos logit contributions
 Articles+0.115
 remedy+0.114
 Thousand+0.105
aughs+0.102
 remedies+0.101
Neg logit contributions
 awa-0.368
OOD-0.353
romy-0.347
oz-0.343
Environment-0.334
Token Jordan
Token ID8078
Feature activation-1.08e+00

Pos logit contributions
 awa+0.279
OOD+0.263
romy+0.259
oz+0.252
Environment+0.244
Neg logit contributions
 Articles-0.208
 remedy-0.206
aughs-0.196
 Thousand-0.195
 remedies-0.194
Token Horowitz
Token ID47591
Feature activation-1.09e+00

Pos logit contributions
 awa+0.895
OOD+0.848
romy+0.835
oz+0.815
Environment+0.786
Neg logit contributions
 Articles-0.633
 remedy-0.628
 remedies-0.592
aughs-0.590
 Thousand-0.589
Token delivering
Token ID13630
Feature activation-6.82e-01

Pos logit contributions
 awa+0.859
OOD+0.805
romy+0.791
oz+0.774
Environment+0.744
Neg logit contributions
 Articles-0.685
 remedy-0.671
aughs-0.645
 Thousand-0.642
 remedies-0.638
Token the
Token ID262
Feature activation-7.52e-02

Pos logit contributions
 awa+0.580
OOD+0.551
romy+0.537
oz+0.525
Environment+0.510
Neg logit contributions
 Articles-0.376
 remedy-0.371
 Thousand-0.357
esi-0.357
aughs-0.354
Token news
Token ID1705
Feature activation+1.90e+00

Pos logit contributions
 awa+0.066
OOD+0.062
romy+0.061
oz+0.060
Environment+0.058
Neg logit contributions
 Articles-0.040
 remedy-0.038
aughs-0.038
esi-0.037
 Thousand-0.037
Token to
Token ID284
Feature activation-1.24e+00

Pos logit contributions
 Articles+0.966
 remedy+0.953
 Thousand+0.898
aughs+0.890
 remedies+0.887
Neg logit contributions
 awa-1.725
OOD-1.639
romy-1.617
oz-1.575
Environment-1.535
Token the
Token ID262
Feature activation-1.70e+00

Pos logit contributions
 awa+1.059
OOD+1.004
romy+0.985
oz+0.958
Environment+0.932
Neg logit contributions
 Articles-0.692
 remedy-0.681
esi-0.650
 Thousand-0.646
 remedies-0.645
Token world
Token ID995
Feature activation-1.89e+00

Pos logit contributions
 awa+1.374
OOD+1.293
romy+1.269
oz+1.239
Environment+1.195
Neg logit contributions
 Articles-1.006
 remedy-0.995
esi-0.958
aughs-0.950
 remedies-0.945
Token that
Token ID326
Feature activation-1.33e+00

Pos logit contributions
 awa+1.670
OOD+1.583
romy+1.557
oz+1.534
Environment+1.485
Neg logit contributions
 Articles-0.989
 remedy-0.973
aughs-0.922
 Thousand-0.922
esi-0.917
Token
Token ID564
Feature activation-2.51e-01

Pos logit contributions
 awa+1.144
OOD+1.089
romy+1.058
oz+1.039
Environment+1.006
Neg logit contributions
 Articles-0.719
 remedy-0.707
esi-0.677
 Thousand-0.670
aughs-0.669
Token
Token ID250
Feature activation-3.00e-01

Pos logit contributions
 awa+0.157
OOD+0.151
romy+0.149
oz+0.142
Environment+0.137
Neg logit contributions
 remedy-0.192
 Articles-0.192
aughs-0.184
 Thousand-0.184
 Warn-0.183
Token State
Token ID1812
Feature activation-7.38e-01

Pos logit contributions
 awa+2.549
OOD+2.409
romy+2.360
oz+2.341
Environment+2.263
Neg logit contributions
 Articles-1.306
 remedy-1.283
aughs-1.221
 Thousand-1.208
 remedies-1.199
Token Cowboys
Token ID16535
Feature activation+3.11e+00

Pos logit contributions
 awa+0.668
OOD+0.632
romy+0.622
oz+0.610
Environment+0.593
Neg logit contributions
 Articles-0.375
 remedy-0.370
aughs-0.349
 Thousand-0.348
 remedies-0.345
Token also
Token ID635
Feature activation+1.66e+00

Pos logit contributions
 Articles+1.790
 remedy+1.776
 Thousand+1.668
 remedies+1.662
 Warn+1.650
Neg logit contributions
 awa-2.610
OOD-2.472
romy-2.440
oz-2.356
Environment-2.304
Token challenged
Token ID12827
Feature activation-1.80e+00

Pos logit contributions
 Articles+1.000
 remedy+0.987
 Thousand+0.941
aughs+0.933
 remedies+0.930
Neg logit contributions
 awa-1.346
OOD-1.269
romy-1.251
oz-1.215
Environment-1.179
Token Petty
Token ID41505
Feature activation-2.13e+00

Pos logit contributions
 awa+1.617
OOD+1.533
romy+1.503
oz+1.484
Environment+1.434
Neg logit contributions
 Articles-0.912
 remedy-0.900
aughs-0.864
esi-0.858
 Thousand-0.841
Token in
Token ID287
Feature activation-6.91e-01

Pos logit contributions
 awa+1.855
OOD+1.732
romy+1.715
oz+1.691
Environment+1.642
Neg logit contributions
 Articles-1.142
 remedy-1.112
aughs-1.099
 Thousand-1.070
esi-1.061
Token this
Token ID428
Feature activation-6.09e-01

Pos logit contributions
 awa+0.607
OOD+0.576
romy+0.570
oz+0.561
Environment+0.542
Neg logit contributions
 Articles-0.361
 remedy-0.358
aughs-0.341
esi-0.338
 remedies-0.334
Token fashion
Token ID6977
Feature activation-3.10e+00

Pos logit contributions
 awa+0.496
OOD+0.467
romy+0.459
oz+0.451
Environment+0.437
Neg logit contributions
 Articles-0.361
 remedy-0.356
aughs-0.346
 Thousand-0.340
 remedies-0.339
Token and
Token ID290
Feature activation-4.66e-01

Pos logit contributions
 awa+2.664
OOD+2.542
oz+2.485
romy+2.482
Environment+2.376
Neg logit contributions
 Articles-1.612
 remedy-1.576
aughs-1.565
esi-1.525
 Thousand-1.502
Token got
Token ID1392
Feature activation-6.42e-01

Pos logit contributions
 awa+0.381
OOD+0.359
romy+0.354
oz+0.347
Environment+0.334
Neg logit contributions
 Articles-0.275
 remedy-0.271
aughs-0.261
esi-0.258
 Thousand-0.257
Token him
Token ID683
Feature activation-1.81e+00

Pos logit contributions
 awa+0.524
OOD+0.490
romy+0.484
oz+0.473
Environment+0.457
Neg logit contributions
 Articles-0.382
 remedy-0.376
aughs-0.367
esi-0.361
 Thousand-0.358
Token that
Token ID326
Feature activation+6.19e-01

Pos logit contributions
 awa+0.594
OOD+0.559
romy+0.549
oz+0.537
Environment+0.521
Neg logit contributions
 Articles-0.430
 remedy-0.427
aughs-0.414
 Thousand-0.410
esi-0.406
Token it
Token ID340
Feature activation+1.43e-01

Pos logit contributions
 Articles+0.331
 remedy+0.329
aughs+0.317
 Thousand+0.314
esi+0.311
Neg logit contributions
 awa-0.541
OOD-0.511
romy-0.502
oz-0.491
Environment-0.478
Token is
Token ID318
Feature activation-3.89e-01

Pos logit contributions
 Articles+0.070
 remedy+0.069
 Thousand+0.067
aughs+0.065
esi+0.065
Neg logit contributions
 awa-0.133
OOD-0.126
romy-0.124
oz-0.122
Environment-0.118
Token able
Token ID1498
Feature activation-2.58e+00

Pos logit contributions
 awa+0.324
OOD+0.302
romy+0.298
oz+0.291
Environment+0.283
Neg logit contributions
 Articles-0.228
 remedy-0.222
aughs-0.216
 Thousand-0.215
esi-0.210
Token to
Token ID284
Feature activation+3.45e-01

Pos logit contributions
 awa+2.420
OOD+2.295
romy+2.266
oz+2.207
Environment+2.162
Neg logit contributions
 Articles-1.244
 remedy-1.240
 remedies-1.145
 Thousand-1.143
aughs-1.138
Token meet
Token ID1826
Feature activation-1.73e+00

Pos logit contributions
 Articles+0.204
 remedy+0.195
aughs+0.195
 Thousand+0.194
esi+0.190
Neg logit contributions
 awa-0.285
OOD-0.266
romy-0.264
oz-0.257
Environment-0.248
Token its
Token ID663
Feature activation+9.52e-01

Pos logit contributions
 awa+1.473
OOD+1.400
romy+1.366
oz+1.336
Environment+1.302
Neg logit contributions
 remedy-0.931
 Articles-0.923
aughs-0.917
 Thousand-0.900
esi-0.887
Token deficit
Token ID11807
Feature activation-3.52e+00

Pos logit contributions
 Articles+0.535
 remedy+0.531
aughs+0.518
 Thousand+0.510
esi+0.500
Neg logit contributions
 awa-0.801
OOD-0.762
romy-0.748
oz-0.729
Environment-0.709
Token targets
Token ID6670
Feature activation+1.17e+00

Pos logit contributions
 awa+2.916
OOD+2.733
romy+2.698
oz+2.621
Environment+2.540
Neg logit contributions
aughs-1.961
 Articles-1.925
 remedy-1.880
 Thousand-1.861
 Warn-1.817
Token,
Token ID11
Feature activation-6.09e-02

Pos logit contributions
 Articles+0.614
 remedy+0.606
aughs+0.576
 Thousand+0.575
 remedies+0.565
Neg logit contributions
 awa-1.038
OOD-0.987
romy-0.969
oz-0.948
Environment-0.920
Token given
Token ID1813
Feature activation-4.71e-01

Pos logit contributions
 awa+0.052
OOD+0.049
romy+0.049
oz+0.047
Environment+0.046
Neg logit contributions
 Articles-0.034
 remedy-0.033
 Thousand-0.032
aughs-0.032
esi-0.031
Token This
Token ID770
Feature activation+5.08e-01

Pos logit contributions
 remedy+0.376
 Articles+0.371
aughs+0.358
 remedies+0.356
esi+0.351
Neg logit contributions
 awa-0.456
OOD-0.434
romy-0.422
oz-0.412
Environment-0.404
Token 41
Token ID6073
Feature activation-1.67e-01

Pos logit contributions
 Articles+0.284
 remedy+0.275
aughs+0.268
 Thousand+0.265
esi+0.262
Neg logit contributions
 awa-0.434
OOD-0.411
romy-0.403
oz-0.393
Environment-0.383
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
 awa+0.135
OOD+0.127
romy+0.126
oz+0.121
Environment+0.118
Neg logit contributions
 Articles-0.101
 remedy-0.101
 Thousand-0.095
 remedies-0.094
aughs-0.094
Token2
Token ID17
Feature activation-2.47e+00

Pos logit contributions
 awa+0.694
OOD+0.662
romy+0.643
oz+0.630
Environment+0.604
Neg logit contributions
 Articles-0.730
 remedy-0.730
 remedies-0.696
 Thousand-0.695
aughs-0.695
Token-
Token ID12
Feature activation-2.30e+00

Pos logit contributions
 awa+1.863
OOD+1.754
romy+1.722
oz+1.678
Environment+1.616
Neg logit contributions
 Articles-1.625
 remedy-1.601
aughs-1.529
 Thousand-1.528
 remedies-1.522
Tokenmeter
Token ID27231
Feature activation-4.93e+00

Pos logit contributions
 awa+1.155
OOD+1.052
romy+1.027
oz+0.977
Environment+0.927
Neg logit contributions
 Articles-2.099
 remedy-2.083
 Thousand-2.014
aughs-2.005
 remedies-2.003
Token per
Token ID583
Feature activation+6.46e-02

Pos logit contributions
 awa+3.958
OOD+3.648
oz+3.637
romy+3.597
Environment+3.441
Neg logit contributions
 Articles-2.851
aughs-2.817
 remedy-2.720
esi-2.685
 remedies-2.670
Token second
Token ID1218
Feature activation-4.62e+00

Pos logit contributions
 Articles+0.043
 remedy+0.043
 Thousand+0.041
aughs+0.041
 remedies+0.041
Neg logit contributions
 awa-0.048
OOD-0.045
romy-0.045
oz-0.043
Environment-0.042
Token (
Token ID357
Feature activation-7.58e-01

Pos logit contributions
 awa+3.677
OOD+3.460
romy+3.388
oz+3.383
Environment+3.232
Neg logit contributions
 Articles-2.704
aughs-2.657
 remedy-2.535
esi-2.524
 Thousand-2.496
Token92
Token ID5892
Feature activation+1.55e+00

Pos logit contributions
 awa+0.506
OOD+0.477
romy+0.465
oz+0.456
Environment+0.436
Neg logit contributions
 Articles-0.565
 remedy-0.562
aughs-0.537
 Thousand-0.536
 remedies-0.535
Token.
Token ID13
Feature activation-7.24e-01

Pos logit contributions
 Articles+0.875
 remedy+0.864
 Thousand+0.828
 remedies+0.808
aughs+0.805
Neg logit contributions
 awa-1.328
OOD-1.253
romy-1.238
oz-1.187
Environment-1.172
Token shoulder
Token ID8163
Feature activation+6.85e-01

Pos logit contributions
 awa+0.616
OOD+0.578
romy+0.570
oz+0.557
Environment+0.538
Neg logit contributions
 Articles-0.450
 remedy-0.444
aughs-0.423
 Thousand-0.421
esi-0.419
Token poles
Token ID24971
Feature activation+3.16e+00

Pos logit contributions
 Articles+0.427
 remedy+0.423
 Thousand+0.402
 remedies+0.399
aughs+0.399
Neg logit contributions
 awa-0.543
OOD-0.512
romy-0.504
oz-0.488
Environment-0.475
Token,
Token ID11
Feature activation-3.35e-01

Pos logit contributions
 Articles+1.691
 remedy+1.674
 Thousand+1.576
 remedies+1.564
aughs+1.554
Neg logit contributions
 awa-2.775
OOD-2.633
romy-2.598
oz-2.521
Environment-2.471
Token is
Token ID318
Feature activation-7.72e-01

Pos logit contributions
 awa+0.298
OOD+0.282
romy+0.279
oz+0.272
Environment+0.264
Neg logit contributions
 Articles-0.175
 remedy-0.173
aughs-0.163
 Thousand-0.163
esi-0.162
Token shown
Token ID3402
Feature activation-1.72e+00

Pos logit contributions
 awa+0.626
OOD+0.588
romy+0.577
oz+0.565
Environment+0.548
Neg logit contributions
 Articles-0.468
 remedy-0.461
aughs-0.442
 Thousand-0.438
 remedies-0.434
Token next
Token ID1306
Feature activation-4.30e+00

Pos logit contributions
 awa+1.491
OOD+1.407
romy+1.386
oz+1.361
Environment+1.313
Neg logit contributions
 Articles-0.940
 remedy-0.920
aughs-0.892
 Thousand-0.878
esi-0.873
Token to
Token ID284
Feature activation-2.18e+00

Pos logit contributions
 awa+3.808
OOD+3.620
romy+3.570
oz+3.505
Environment+3.385
Neg logit contributions
 Articles-2.230
 remedy-2.193
aughs-2.073
 Thousand-2.071
 remedies-2.052
Token the
Token ID262
Feature activation-1.67e+00

Pos logit contributions
 awa+1.804
OOD+1.713
romy+1.681
oz+1.642
Environment+1.594
Neg logit contributions
 Articles-1.252
 remedy-1.249
aughs-1.187
esi-1.175
 remedies-1.167
Token ancient
Token ID6156
Feature activation+5.20e-01

Pos logit contributions
 awa+1.387
OOD+1.312
romy+1.288
oz+1.259
Environment+1.222
Neg logit contributions
 Articles-0.955
 remedy-0.948
aughs-0.907
 remedies-0.889
esi-0.888
Token city
Token ID1748
Feature activation+1.09e+00

Pos logit contributions
 Articles+0.290
 remedy+0.286
 Thousand+0.271
aughs+0.270
 remedies+0.268
Neg logit contributions
 awa-0.444
OOD-0.421
romy-0.414
oz-0.403
Environment-0.393
Token wall
Token ID3355
Feature activation-6.77e-02

Pos logit contributions
 Articles+0.595
 remedy+0.588
 Thousand+0.556
 remedies+0.551
aughs+0.549
Neg logit contributions
 awa-0.942
OOD-0.892
romy-0.881
oz-0.854
Environment-0.833
Token assassination
Token ID18898
Feature activation+7.43e-01

Pos logit contributions
 awa+0.159
OOD+0.149
romy+0.147
oz+0.142
Environment+0.138
Neg logit contributions
 Articles-0.141
 remedy-0.139
aughs-0.133
 Thousand-0.133
 remedies-0.132
Token.
Token ID13
Feature activation+6.42e-02

Pos logit contributions
 Articles+0.378
 remedy+0.373
 Thousand+0.351
aughs+0.349
 remedies+0.347
Neg logit contributions
 awa-0.673
OOD-0.639
romy-0.630
oz-0.615
Environment-0.598
Token But
Token ID887
Feature activation+1.50e-01

Pos logit contributions
 remedy+0.040
 Articles+0.038
 remedies+0.038
aughs+0.037
esi+0.037
Neg logit contributions
 awa-0.050
OOD-0.049
romy-0.047
oz-0.047
Environment-0.045
Token he
Token ID339
Feature activation-2.63e-01

Pos logit contributions
 Articles+0.087
 remedy+0.086
aughs+0.083
esi+0.081
 remedies+0.081
Neg logit contributions
 awa-0.124
OOD-0.118
romy-0.115
oz-0.112
Environment-0.108
Token never
Token ID1239
Feature activation+8.95e-01

Pos logit contributions
 awa+0.227
OOD+0.215
romy+0.210
oz+0.209
Environment+0.198
Neg logit contributions
 Articles-0.146
 remedy-0.140
aughs-0.136
 Thousand-0.136
esi-0.134
Token forgot
Token ID16453
Feature activation-4.63e+00

Pos logit contributions
 Articles+0.551
 remedy+0.537
 Thousand+0.517
aughs+0.516
esi+0.510
Neg logit contributions
 awa-0.720
OOD-0.675
romy-0.664
oz-0.649
Environment-0.625
Token his
Token ID465
Feature activation-2.21e-01

Pos logit contributions
 awa+3.636
OOD+3.493
oz+3.351
romy+3.332
stic+3.144
Neg logit contributions
 Articles-2.714
aughs-2.698
 remedy-2.650
esi-2.648
alore-2.532
Token goal
Token ID3061
Feature activation+1.07e+00

Pos logit contributions
 awa+0.166
OOD+0.156
romy+0.153
oz+0.149
Environment+0.143
Neg logit contributions
 Articles-0.145
 remedy-0.142
aughs-0.140
esi-0.136
 Thousand-0.135
Token,
Token ID11
Feature activation+1.62e-01

Pos logit contributions
 Articles+0.562
 remedy+0.554
 Thousand+0.524
aughs+0.521
 remedies+0.517
Neg logit contributions
 awa-0.949
OOD-0.901
romy-0.887
oz-0.865
Environment-0.841
Token and
Token ID290
Feature activation+8.88e-01

Pos logit contributions
 Articles+0.089
 remedy+0.087
aughs+0.083
 Thousand+0.083
esi+0.082
Neg logit contributions
 awa-0.140
OOD-0.133
romy-0.130
oz-0.127
Environment-0.123
Token after
Token ID706
Feature activation-1.13e+00

Pos logit contributions
 Articles+0.520
 remedy+0.512
aughs+0.492
esi+0.484
 Thousand+0.482
Neg logit contributions
 awa-0.737
OOD-0.698
romy-0.682
oz-0.667
Environment-0.643
Tokenpower
Token ID6477
Feature activation-1.96e+00

Pos logit contributions
 awa+2.038
romy+2.031
OOD+2.023
oz+1.970
Environment+1.939
Neg logit contributions
 Articles-1.834
 remedy-1.826
esi-1.806
aughs-1.801
 Thousand-1.777
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
 awa+1.621
OOD+1.556
romy+1.505
oz+1.500
Environment+1.437
Neg logit contributions
 Articles-1.102
 remedy-1.084
aughs-1.065
 remedies-1.039
esi-1.037
Token 4
Token ID604
Feature activation-5.03e-02

Pos logit contributions
 awa+0.898
OOD+0.844
romy+0.831
oz+0.825
Environment+0.795
Neg logit contributions
 Articles-0.549
 remedy-0.540
aughs-0.518
 remedies-0.512
 Thousand-0.509
Token34
Token ID2682
Feature activation+1.06e+00

Pos logit contributions
 awa+0.042
OOD+0.040
romy+0.039
oz+0.038
Environment+0.037
Neg logit contributions
 Articles-0.029
 remedy-0.029
aughs-0.028
 Thousand-0.027
 remedies-0.027
Token-
Token ID12
Feature activation-1.69e+00

Pos logit contributions
 Articles+0.665
 remedy+0.658
 Thousand+0.628
aughs+0.619
 Warn+0.619
Neg logit contributions
 awa-0.835
OOD-0.787
romy-0.777
oz-0.747
Environment-0.731
Tokenlb
Token ID23160
Feature activation-4.91e+00

Pos logit contributions
 awa+1.275
OOD+1.233
oz+1.226
romy+1.217
Environment+1.153
Neg logit contributions
 Articles-1.069
 remedy-1.055
 remedies-1.005
aughs-1.002
 Thousand-0.999
Token.-
Token ID7874
Feature activation+1.92e-01

Pos logit contributions
 awa+3.534
OOD+3.349
oz+3.320
romy+3.242
Environment+3.189
Neg logit contributions
 Articles-3.231
 remedy-3.182
 remedies-3.072
aughs-3.034
esi-3.000
Tokenft
Token ID701
Feature activation-1.59e+00

Pos logit contributions
 Articles+0.168
 remedy+0.167
 Thousand+0.161
aughs+0.160
 remedies+0.160
Neg logit contributions
 awa-0.103
OOD-0.095
romy-0.092
oz-0.090
Environment-0.085
Token.
Token ID13
Feature activation+6.61e-01

Pos logit contributions
 awa+1.341
OOD+1.280
romy+1.259
oz+1.242
Environment+1.188
Neg logit contributions
 Articles-0.895
 remedy-0.876
 remedies-0.837
aughs-0.828
 Thousand-0.824
Token 6
Token ID718
Feature activation+1.55e+00

Pos logit contributions
 Articles+0.385
 remedy+0.382
 remedies+0.364
esi+0.361
 Thousand+0.360
Neg logit contributions
 awa-0.542
OOD-0.516
romy-0.503
oz-0.499
Environment-0.477
Token.
Token ID13
Feature activation-2.63e-01

Pos logit contributions
 Articles+1.033
 remedy+1.022
 Thousand+0.977
aughs+0.969
 remedies+0.968
Neg logit contributions
 awa-1.157
OOD-1.085
romy-1.069
oz-1.032
Environment-1.002
Token down
Token ID866
Feature activation+1.24e+00

Pos logit contributions
 awa+0.431
OOD+0.408
romy+0.402
oz+0.392
Environment+0.381
Neg logit contributions
 Articles-0.277
 remedy-0.272
 Thousand-0.258
aughs-0.257
esi-0.254
Token and
Token ID290
Feature activation+2.02e-02

Pos logit contributions
 Articles+0.671
 remedy+0.658
aughs+0.629
 Thousand+0.620
esi+0.615
Neg logit contributions
 awa-1.080
OOD-1.024
romy-1.009
oz-0.984
Environment-0.959
Token say
Token ID910
Feature activation-5.23e-01

Pos logit contributions
 Articles+0.011
 remedy+0.011
aughs+0.011
esi+0.011
 Thousand+0.011
Neg logit contributions
 awa-0.017
OOD-0.016
romy-0.016
oz-0.016
Environment-0.015
Token,
Token ID11
Feature activation+8.16e-02

Pos logit contributions
 awa+0.458
OOD+0.432
romy+0.426
oz+0.417
Environment+0.406
Neg logit contributions
 Articles-0.280
 remedy-0.276
aughs-0.270
esi-0.258
 Thousand-0.257
Token '
Token ID705
Feature activation-7.05e-01

Pos logit contributions
 Articles+0.038
 remedy+0.037
aughs+0.035
esi+0.034
 remedies+0.034
Neg logit contributions
 awa-0.078
OOD-0.074
romy-0.073
oz-0.072
Environment-0.070
Tokenwell
Token ID4053
Feature activation-4.77e+00

Pos logit contributions
 awa+0.486
OOD+0.465
romy+0.457
oz+0.447
Environment+0.433
Neg logit contributions
 Articles-0.505
 remedy-0.502
aughs-0.480
 remedies-0.477
 Thousand-0.477
Token,
Token ID11
Feature activation+3.61e-01

Pos logit contributions
 awa+3.724
OOD+3.539
romy+3.448
oz+3.427
Environment+3.310
Neg logit contributions
 Articles-2.840
aughs-2.822
 remedy-2.773
 Enhance-2.648
 remedies-2.637
Token I
Token ID314
Feature activation+1.94e-01

Pos logit contributions
 Articles+0.199
 remedy+0.195
aughs+0.190
esi+0.183
 Thousand+0.183
Neg logit contributions
 awa-0.311
OOD-0.294
romy-0.291
oz-0.285
Environment-0.276
Token need
Token ID761
Feature activation+7.24e-01

Pos logit contributions
 Articles+0.123
 remedy+0.118
aughs+0.117
 Thousand+0.114
 remedies+0.113
Neg logit contributions
 awa-0.152
OOD-0.143
romy-0.142
oz-0.139
Environment-0.133
Token a
Token ID257
Feature activation+2.06e-01

Pos logit contributions
 Articles+0.386
 remedy+0.377
aughs+0.362
 Thousand+0.357
 remedies+0.353
Neg logit contributions
 awa-0.639
OOD-0.606
romy-0.598
oz-0.583
Environment-0.569
Token number
Token ID1271
Feature activation+2.18e+00

Pos logit contributions
 Articles+0.123
aughs+0.119
 remedy+0.117
 remedies+0.114
esi+0.114
Neg logit contributions
 awa-0.169
OOD-0.159
romy-0.156
oz-0.154
Environment-0.148