TOP ACTIVATIONS
MAX = 1.939

the great libertarian thinker , here . Read more
can be done with the following equation : which
. At the same time , the British government
did this by executing the following queries and saving the results
want you to run the following WITHOUT reconnect ing to your
At the same time , another official added
wing Target ing Computer app here . I
g 7 p k Y sv M bo ] <|endoftext|> No
" Ask yourself if the following story does not sound like
and international treaties . Any reproduction , copying , or redistribution
interested in characters that start off one way and end another
2 g 7 p k Y sv M bo ] <|endoftext|>
take a rest . In bed . Oh god ." She
at ASUS R OG just over five weeks ago . Then
F ears over Melbourne fixes When
latest C rave Cast podcast here ! C rave
want to per use . Here is the link for buying
the federal law , for proof of this : " Defense
Write () is only the beginning . For example , this
asc ension . George …… .

MIN ACTIVATIONS
MIN = -2.041

<|endoftext|> Image copyright ESA Image caption Go ce uses an electric
. <|endoftext|> Image copyright publicity Image caption The Greeks
than 10 years in prison MORE , in his weekly column
<|endoftext|> Image copyright AP Image caption Peace talks between the government
Serve hot ! <|endoftext|> Image copyright ESA Image caption Go ce
. <|endoftext|> Image copyright Deadline Police have
million years . <|endoftext|> Image copyright AFP The Bank
ported License . <|endoftext|> Image copyright AP Image caption Peace talks
media reported . <|endoftext|> Image caption Some talked of " b
unsupported on your device Media caption Nigel Farage : " Lord
<|endoftext|> Image copyright publicity Image caption The Greeks were totally "
: 13 PH OTOS Donald Trump 's potential running
Image copyright AFP Image caption The Kurds ' push has
Russia . Image copyright AFP Image caption The Kurds
holes in American politics ' MORE completed a swing through North
ACE ). Image caption Council staff s ifted through
unsupported on your device Media caption A drone - camera was
on sleeve at basketball game MORE and Hillary Clinton Hillary Diane
ju illet . pic . twitter . com / u I
media bus ) pic . twitter . com / z J

INTERVAL 0.944 - 1.939
CONTAINS 0.222%

2014 after the company nearly default ed on its debt of
opposing the self - aff irming good . In this sense
Copyright © 2016 Else vier B . V . All
b 6 ce 650 c 6 str ace . so
com / W 8 v X Ds 4 j 0 o

INTERVAL -0.051 - 0.944
CONTAINS 57.578%

the first time , Chang and colleagues also recorded and analyzed
usual , with berries , but not too many ) 200
. PT on June 26 . Thanks , Wii
of the utility 's finances . R ising costs
traveler , he told Fox News . The policy

INTERVAL -1.046 - -0.051
CONTAINS 42.093%

working with Nerd Kingdom to try and match them as closely
I decided to also try running a complex application bundled as
- world problem . They worked well together and never gave
it s important to enable them for the future
the only way to do that is to know exactly who

INTERVAL -2.041 - -1.046
CONTAINS 0.107%

{ ( 64 s )} 24 . Q xc 3 {
G standards , guaranteeing the card 's stability and reliability under
is disrupting hospitality . https :// www . bel fast te
disastrous season , he was traded to the Cleveland Browns just
: 04 30 z BLACK FO OT T OC RE P
Token the
Token ID262
Feature activation+7.98e-02

Pos logit contributions
stood+0.140
Pos+0.122
 Stard+0.119
 Os+0.118
osit+0.118
Neg logit contributions
 AMER-0.145
heastern-0.141
uble-0.138
utz-0.137
 rug-0.136
Token great
Token ID1049
Feature activation-8.44e-01

Pos logit contributions
stood+0.251
 Stard+0.215
Pos+0.213
 Os+0.213
osit+0.211
Neg logit contributions
 AMER-0.323
heastern-0.315
uble-0.315
utz-0.310
 rug-0.308
Token libertarian
Token ID19466
Feature activation+2.36e-01

Pos logit contributions
 AMER+2.118
heastern+2.033
uble+1.966
utz+1.930
 rug+1.911
Neg logit contributions
stood-3.998
Pos-3.627
 Stard-3.549
 Os-3.526
osit-3.516
Token thinker
Token ID45206
Feature activation+5.02e-02

Pos logit contributions
stood+0.666
Pos+0.557
 Stard+0.548
 Os+0.545
osit+0.544
Neg logit contributions
 AMER-1.039
heastern-1.021
uble-1.010
utz-0.997
 rug-0.984
Token,
Token ID11
Feature activation+4.69e-01

Pos logit contributions
stood+0.144
Pos+0.121
 Stard+0.117
 Os+0.117
osit+0.116
Neg logit contributions
 AMER-0.220
heastern-0.216
uble-0.213
utz-0.210
 rug-0.208
Token here
Token ID994
Feature activation+1.94e+00

Pos logit contributions
stood+1.383
 Os+1.229
 Stard+1.224
Pos+1.222
osit+1.207
Neg logit contributions
heastern-1.908
 AMER-1.891
utz-1.873
uble-1.873
 rug-1.798
Token.
Token ID13
Feature activation+6.44e-01

Pos logit contributions
stood+4.807
 Os+4.480
Pos+4.414
 Stard+4.389
 Kak+4.330
Neg logit contributions
 AMER-8.296
heastern-8.259
 rug-8.146
uble-8.028
utz-8.004
Token
Token ID198
Feature activation+3.37e-01

Pos logit contributions
stood+1.904
 Os+1.762
 Stard+1.739
osit+1.734
Pos+1.709
Neg logit contributions
heastern-2.711
 AMER-2.576
uble-2.565
utz-2.509
 rug-2.499
Token
Token ID198
Feature activation+2.51e-01

Pos logit contributions
stood+1.167
Pos+1.015
 Os+0.982
 Stard+0.981
osit+0.974
Neg logit contributions
 AMER-1.292
heastern-1.266
uble-1.235
utz-1.215
 rug-1.214
TokenRead
Token ID5569
Feature activation+3.74e-01

Pos logit contributions
stood+0.903
Pos+0.806
 Stard+0.780
 Os+0.778
osit+0.773
Neg logit contributions
 AMER-0.908
heastern-0.894
uble-0.869
utz-0.855
 rug-0.853
Token more
Token ID517
Feature activation+1.34e-02

Pos logit contributions
stood+1.351
Pos+1.178
 Stard+1.149
 Os+1.142
osit+1.138
Neg logit contributions
 AMER-1.366
heastern-1.327
uble-1.305
utz-1.289
 rug-1.275
Token can
Token ID460
Feature activation+3.22e-01

Pos logit contributions
stood+0.910
Pos+0.789
 Stard+0.764
 Os+0.764
osit+0.756
Neg logit contributions
 AMER-1.076
heastern-1.054
uble-1.035
utz-1.025
 rug-1.006
Token be
Token ID307
Feature activation+5.08e-01

Pos logit contributions
stood+1.135
Pos+0.991
 Os+0.967
 Stard+0.965
osit+0.962
Neg logit contributions
 AMER-1.188
heastern-1.163
uble-1.134
utz-1.124
 rug-1.096
Token done
Token ID1760
Feature activation+9.23e-01

Pos logit contributions
stood+1.729
Pos+1.484
 Os+1.453
 Stard+1.444
osit+1.423
Neg logit contributions
 AMER-1.950
heastern-1.912
uble-1.880
utz-1.843
 rug-1.807
Token with
Token ID351
Feature activation+5.30e-01

Pos logit contributions
stood+2.947
Pos+2.551
 Os+2.521
 eval+2.485
 Stard+2.474
Neg logit contributions
 AMER-3.644
heastern-3.544
utz-3.488
uble-3.486
 rug-3.352
Token the
Token ID262
Feature activation+4.87e-01

Pos logit contributions
stood+1.775
Pos+1.547
 Os+1.499
 Stard+1.489
 eval+1.473
Neg logit contributions
 AMER-2.058
heastern-2.029
uble-1.984
utz-1.965
 rug-1.915
Token following
Token ID1708
Feature activation+1.76e+00

Pos logit contributions
stood+1.649
Pos+1.440
 Os+1.396
 Stard+1.393
 eval+1.369
Neg logit contributions
 AMER-1.864
heastern-1.842
uble-1.809
utz-1.784
 rug-1.739
Token equation
Token ID16022
Feature activation+1.17e-02

Pos logit contributions
stood+5.497
 eval+4.866
Pos+4.846
 Pyth+4.830
 Os+4.829
Neg logit contributions
heastern-6.692
 AMER-6.616
uble-6.496
utz-6.364
 rug-6.159
Token:
Token ID25
Feature activation+5.25e-01

Pos logit contributions
stood+0.040
Pos+0.035
 Stard+0.034
 Os+0.034
osit+0.033
Neg logit contributions
 AMER-0.045
heastern-0.044
uble-0.043
utz-0.042
 rug-0.042
Token
Token ID198
Feature activation+4.34e-01

Pos logit contributions
stood+1.736
Pos+1.551
 Os+1.486
 Stard+1.483
 eval+1.451
Neg logit contributions
 AMER-2.031
heastern-2.012
uble-1.953
utz-1.936
 rug-1.881
Token
Token ID198
Feature activation+4.35e-01

Pos logit contributions
stood+1.477
Pos+1.287
 Os+1.246
 Stard+1.238
osit+1.231
Neg logit contributions
 AMER-1.680
heastern-1.659
uble-1.605
utz-1.580
 rug-1.576
Tokenwhich
Token ID4758
Feature activation-1.31e-01

Pos logit contributions
stood+1.498
Pos+1.331
 Stard+1.249
osit+1.249
 Os+1.242
Neg logit contributions
 AMER-1.669
heastern-1.626
uble-1.582
utz-1.558
 rug-1.553
Token.
Token ID13
Feature activation+1.36e-02

Pos logit contributions
stood+0.907
Pos+0.771
 Os+0.759
 Stard+0.756
 Kak+0.740
Neg logit contributions
 AMER-1.112
heastern-1.089
uble-1.070
utz-1.061
 rug-1.048
Token
Token ID198
Feature activation+1.54e-01

Pos logit contributions
stood+0.048
Pos+0.042
 Os+0.041
 Stard+0.041
osit+0.041
Neg logit contributions
 AMER-0.050
heastern-0.049
uble-0.049
utz-0.048
 rug-0.047
Token
Token ID198
Feature activation+3.15e-01

Pos logit contributions
stood+0.540
Pos+0.471
 Os+0.458
 Stard+0.457
osit+0.453
Neg logit contributions
 AMER-0.576
heastern-0.567
uble-0.552
utz-0.544
 rug-0.541
TokenAt
Token ID2953
Feature activation+4.65e-01

Pos logit contributions
stood+1.059
Pos+0.935
osit+0.882
 Os+0.880
 Stard+0.878
Neg logit contributions
 AMER-1.229
heastern-1.204
uble-1.179
 rug-1.159
utz-1.153
Token the
Token ID262
Feature activation+3.47e-01

Pos logit contributions
stood+1.593
 Os+1.352
Pos+1.348
 Stard+1.338
osit+1.323
Neg logit contributions
 AMER-1.772
heastern-1.752
uble-1.713
utz-1.698
 rug-1.673
Token same
Token ID976
Feature activation+1.67e+00

Pos logit contributions
stood+1.202
 Os+1.028
Pos+1.025
 Stard+1.025
osit+1.001
Neg logit contributions
 AMER-1.295
heastern-1.287
uble-1.254
utz-1.246
 rug-1.222
Token time
Token ID640
Feature activation+7.81e-01

Pos logit contributions
stood+3.694
 Os+3.094
Pos+3.009
 Stard+2.898
 Lev+2.865
Neg logit contributions
heastern-7.823
 AMER-7.760
uble-7.705
utz-7.636
heast-7.508
Token,
Token ID11
Feature activation+3.69e-01

Pos logit contributions
stood+2.084
Pos+1.730
 Os+1.720
 Stard+1.692
 Kak+1.663
Neg logit contributions
 AMER-3.505
heastern-3.448
uble-3.406
utz-3.348
 rug-3.315
Token the
Token ID262
Feature activation+7.54e-02

Pos logit contributions
stood+1.189
Pos+1.020
 Os+1.007
 Stard+0.998
osit+0.981
Neg logit contributions
 AMER-1.469
heastern-1.450
uble-1.429
utz-1.399
 rug-1.388
Token British
Token ID3517
Feature activation+2.01e-01

Pos logit contributions
stood+0.275
Pos+0.239
 Os+0.236
 Stard+0.235
osit+0.232
Neg logit contributions
 AMER-0.271
heastern-0.266
uble-0.261
utz-0.257
 rug-0.253
Token government
Token ID1230
Feature activation+7.84e-01

Pos logit contributions
stood+0.754
Pos+0.658
 Stard+0.650
 Os+0.647
osit+0.633
Neg logit contributions
 AMER-0.701
heastern-0.690
uble-0.674
utz-0.663
 rug-0.654
Token did
Token ID750
Feature activation-9.60e-02

Pos logit contributions
 AMER+0.723
heastern+0.705
uble+0.693
utz+0.684
 rug+0.678
Neg logit contributions
stood-0.689
Pos-0.599
 Stard-0.585
 Os-0.581
osit-0.578
Token this
Token ID428
Feature activation-2.49e-01

Pos logit contributions
 AMER+0.348
heastern+0.338
uble+0.333
utz+0.329
 rug+0.324
Neg logit contributions
stood-0.347
Pos-0.304
 Stard-0.298
 Os-0.297
osit-0.295
Token by
Token ID416
Feature activation-2.35e-01

Pos logit contributions
 AMER+0.909
heastern+0.885
uble+0.870
utz+0.859
 rug+0.849
Neg logit contributions
stood-0.901
Pos-0.786
 Stard-0.767
 Os-0.763
osit-0.760
Token executing
Token ID23710
Feature activation-1.41e-01

Pos logit contributions
 AMER+0.850
heastern+0.827
uble+0.812
utz+0.802
 rug+0.794
Neg logit contributions
stood-0.853
Pos-0.744
 Stard-0.726
 Os-0.721
osit-0.720
Token the
Token ID262
Feature activation+4.77e-01

Pos logit contributions
 AMER+0.536
heastern+0.525
uble+0.517
utz+0.511
 rug+0.501
Neg logit contributions
stood-0.487
Pos-0.422
 Stard-0.415
 Os-0.413
osit-0.407
Token following
Token ID1708
Feature activation+1.62e+00

Pos logit contributions
stood+1.631
Pos+1.414
 Stard+1.410
 Os+1.401
 eval+1.391
Neg logit contributions
 AMER-1.787
heastern-1.764
uble-1.743
utz-1.719
 rug-1.669
Token queries
Token ID20743
Feature activation+1.30e-01

Pos logit contributions
stood+4.930
 eval+4.538
 Stard+4.446
 Os+4.417
Pos+4.367
Neg logit contributions
 AMER-6.105
heastern-6.070
uble-6.062
utz-5.961
 rug-5.626
Token and
Token ID290
Feature activation+1.71e-01

Pos logit contributions
stood+0.436
Pos+0.382
 Os+0.374
 Stard+0.373
osit+0.367
Neg logit contributions
 AMER-0.498
heastern-0.487
uble-0.479
utz-0.475
 rug-0.466
Token saving
Token ID8914
Feature activation+7.22e-01

Pos logit contributions
stood+0.597
Pos+0.517
 Stard+0.510
 Os+0.508
osit+0.499
Neg logit contributions
 AMER-0.641
heastern-0.629
uble-0.617
utz-0.608
 rug-0.598
Token the
Token ID262
Feature activation+2.90e-01

Pos logit contributions
stood+2.454
Pos+2.154
 Stard+2.147
 Os+2.117
osit+2.100
Neg logit contributions
 AMER-2.678
heastern-2.648
uble-2.635
utz-2.570
 rug-2.520
Token results
Token ID2482
Feature activation+7.53e-02

Pos logit contributions
stood+0.976
Pos+0.851
 Stard+0.839
 Os+0.829
osit+0.820
Neg logit contributions
 AMER-1.113
heastern-1.097
uble-1.082
utz-1.066
 rug-1.043
Token want
Token ID765
Feature activation-4.19e-01

Pos logit contributions
 AMER+0.606
heastern+0.590
uble+0.579
utz+0.573
 rug+0.568
Neg logit contributions
stood-0.588
Pos-0.513
 Stard-0.501
 Os-0.497
osit-0.495
Token you
Token ID345
Feature activation+9.35e-02

Pos logit contributions
 AMER+1.468
heastern+1.430
uble+1.404
utz+1.387
 rug+1.366
Neg logit contributions
stood-1.574
Pos-1.383
 Stard-1.350
 Os-1.345
osit-1.337
Token to
Token ID284
Feature activation-5.09e-02

Pos logit contributions
stood+0.346
Pos+0.305
 Stard+0.298
 Os+0.297
osit+0.294
Neg logit contributions
 AMER-0.330
heastern-0.322
uble-0.316
utz-0.313
 rug-0.307
Token run
Token ID1057
Feature activation-1.38e-01

Pos logit contributions
 AMER+0.173
heastern+0.168
uble+0.165
utz+0.163
 rug+0.161
Neg logit contributions
stood-0.196
Pos-0.173
 Stard-0.169
 Os-0.168
osit-0.167
Token the
Token ID262
Feature activation+3.60e-01

Pos logit contributions
 AMER+0.531
heastern+0.521
uble+0.512
utz+0.505
 rug+0.497
Neg logit contributions
stood-0.467
Pos-0.404
 Stard-0.395
 Os-0.394
osit-0.389
Token following
Token ID1708
Feature activation+1.62e+00

Pos logit contributions
stood+1.233
Pos+1.065
 Os+1.044
 Stard+1.043
osit+1.024
Neg logit contributions
 AMER-1.374
heastern-1.356
uble-1.332
utz-1.314
 rug-1.279
Token WITHOUT
Token ID42881
Feature activation+1.11e-01

Pos logit contributions
stood+5.120
 Os+4.572
 eval+4.562
Pos+4.507
 Stard+4.469
Neg logit contributions
heastern-6.125
 AMER-6.028
uble-6.013
utz-5.864
heast-5.535
Token reconnect
Token ID37671
Feature activation+3.36e-01

Pos logit contributions
stood+0.407
Pos+0.356
 Stard+0.348
 Os+0.348
osit+0.344
Neg logit contributions
 AMER-0.396
heastern-0.390
uble-0.382
utz-0.377
 rug-0.370
Tokening
Token ID278
Feature activation-1.29e-01

Pos logit contributions
stood+0.918
Pos+0.780
 Stard+0.750
 Os+0.746
osit+0.743
Neg logit contributions
 AMER-1.505
heastern-1.483
uble-1.447
utz-1.435
 rug-1.418
Token to
Token ID284
Feature activation+1.80e-01

Pos logit contributions
 AMER+0.476
heastern+0.465
uble+0.456
utz+0.451
 rug+0.444
Neg logit contributions
stood-0.456
Pos-0.399
 Stard-0.390
 Os-0.389
osit-0.385
Token your
Token ID534
Feature activation-2.55e-01

Pos logit contributions
stood+0.626
Pos+0.544
 Os+0.536
 Stard+0.535
osit+0.527
Neg logit contributions
 AMER-0.671
heastern-0.660
uble-0.647
utz-0.644
 rug-0.626
Token
Token ID251
Feature activation-1.51e-01

Pos logit contributions
stood+0.453
Pos+0.402
 Os+0.389
 Stard+0.389
osit+0.386
Neg logit contributions
 AMER-0.432
heastern-0.420
uble-0.412
utz-0.409
 rug-0.401
Token
Token ID198
Feature activation+3.71e-01

Pos logit contributions
 AMER+0.567
heastern+0.556
uble+0.545
utz+0.538
 rug+0.534
Neg logit contributions
stood-0.527
Pos-0.459
 Stard-0.447
 Os-0.447
osit-0.443
Token
Token ID198
Feature activation+3.17e-01

Pos logit contributions
stood+1.247
Pos+1.093
 Os+1.059
 Stard+1.049
osit+1.039
Neg logit contributions
 AMER-1.441
heastern-1.425
uble-1.384
 rug-1.363
utz-1.357
TokenAt
Token ID2953
Feature activation+4.26e-01

Pos logit contributions
stood+1.071
Pos+0.948
osit+0.895
 Stard+0.892
 Os+0.891
Neg logit contributions
 AMER-1.225
heastern-1.206
uble-1.179
 rug-1.166
utz-1.160
Token the
Token ID262
Feature activation+2.08e-01

Pos logit contributions
stood+1.473
Pos+1.255
 Stard+1.242
 Os+1.241
osit+1.221
Neg logit contributions
 AMER-1.619
heastern-1.589
uble-1.553
utz-1.546
 rug-1.524
Token same
Token ID976
Feature activation+1.61e+00

Pos logit contributions
stood+0.704
Pos+0.600
 Stard+0.593
 Os+0.592
osit+0.582
Neg logit contributions
 AMER-0.804
heastern-0.792
uble-0.773
utz-0.771
 rug-0.757
Token time
Token ID640
Feature activation+7.61e-01

Pos logit contributions
stood+3.611
Pos+3.017
 Os+2.991
 Stard+2.830
 eval+2.780
Neg logit contributions
 AMER-7.479
heastern-7.473
uble-7.346
utz-7.324
 rug-7.173
Token,
Token ID11
Feature activation+3.35e-01

Pos logit contributions
stood+2.075
Pos+1.710
 Os+1.680
 Stard+1.671
osit+1.639
Neg logit contributions
 AMER-3.388
heastern-3.322
uble-3.277
utz-3.262
 rug-3.218
Token another
Token ID1194
Feature activation+2.68e-01

Pos logit contributions
stood+1.234
Pos+1.076
 Os+1.057
 Stard+1.055
osit+1.043
Neg logit contributions
 AMER-1.190
heastern-1.165
uble-1.143
utz-1.129
 rug-1.116
Token official
Token ID1743
Feature activation+9.76e-02

Pos logit contributions
stood+1.029
Pos+0.899
 Os+0.887
 Stard+0.881
osit+0.872
Neg logit contributions
 AMER-0.903
heastern-0.885
uble-0.865
utz-0.859
 rug-0.843
Token added
Token ID2087
Feature activation-2.51e-01

Pos logit contributions
stood+0.306
Pos+0.260
 Stard+0.252
 Os+0.252
osit+0.250
Neg logit contributions
 AMER-0.403
heastern-0.394
uble-0.387
utz-0.383
 rug-0.379
Tokenwing
Token ID5469
Feature activation+6.52e-01

Pos logit contributions
 AMER+1.493
heastern+1.463
uble+1.447
utz+1.432
 rug+1.417
Neg logit contributions
stood-1.090
Pos-0.905
 Os-0.891
 Stard-0.890
osit-0.887
Token Target
Token ID12744
Feature activation+3.50e-01

Pos logit contributions
stood+2.155
Pos+1.866
 Stard+1.862
 Os+1.841
osit+1.790
Neg logit contributions
 AMER-2.543
heastern-2.521
uble-2.444
utz-2.411
 rug-2.356
Tokening
Token ID278
Feature activation-1.23e-01

Pos logit contributions
stood+1.150
Pos+0.996
 Stard+0.974
 Os+0.962
osit+0.944
Neg logit contributions
 AMER-1.393
heastern-1.365
uble-1.338
utz-1.318
 rug-1.291
Token Computer
Token ID13851
Feature activation-7.54e-01

Pos logit contributions
 AMER+0.495
heastern+0.486
uble+0.477
utz+0.470
 rug+0.464
Neg logit contributions
stood-0.394
Pos-0.338
 Stard-0.330
 Os-0.329
osit-0.325
Token app
Token ID598
Feature activation+1.53e-01

Pos logit contributions
 AMER+2.827
heastern+2.728
uble+2.699
 rug+2.669
utz+2.665
Neg logit contributions
stood-2.611
Pos-2.267
osit-2.205
 Stard-2.198
 Os-2.183
Token here
Token ID994
Feature activation+1.59e+00

Pos logit contributions
stood+0.526
Pos+0.455
 Stard+0.448
 Os+0.444
osit+0.439
Neg logit contributions
 AMER-0.586
heastern-0.575
uble-0.561
utz-0.555
 rug-0.548
Token.
Token ID13
Feature activation+1.71e-01

Pos logit contributions
stood+4.600
 Stard+4.123
 Os+4.089
Pos+4.041
 Kak+3.886
Neg logit contributions
 AMER-6.328
heastern-6.301
uble-6.073
utz-6.041
 rug-5.951
Token
Token ID198
Feature activation+2.69e-01

Pos logit contributions
stood+0.614
Pos+0.541
 Stard+0.532
 Os+0.528
osit+0.521
Neg logit contributions
 AMER-0.619
heastern-0.613
uble-0.599
utz-0.590
 rug-0.583
Token
Token ID198
Feature activation+3.30e-01

Pos logit contributions
stood+0.958
Pos+0.843
 Stard+0.815
 Os+0.811
osit+0.808
Neg logit contributions
 AMER-0.996
heastern-0.976
uble-0.956
utz-0.942
 rug-0.936
TokenI
Token ID40
Feature activation-1.62e-01

Pos logit contributions
stood+1.126
Pos+0.994
 Stard+0.951
osit+0.943
 Os+0.941
Neg logit contributions
 AMER-1.268
heastern-1.240
uble-1.217
utz-1.198
 rug-1.192
Token
Token ID447
Feature activation-2.84e-02

Pos logit contributions
 AMER+0.560
heastern+0.544
uble+0.535
utz+0.527
 rug+0.523
Neg logit contributions
stood-0.615
Pos-0.542
 Stard-0.527
 Os-0.525
osit-0.523
Tokeng
Token ID70
Feature activation+6.33e-01

Pos logit contributions
stood+2.208
Pos+1.938
osit+1.850
 Os+1.841
 Stard+1.825
Neg logit contributions
 AMER-2.661
heastern-2.566
uble-2.501
 rug-2.485
utz-2.457
Token7
Token ID22
Feature activation+9.51e-01

Pos logit contributions
stood+1.953
Pos+1.700
osit+1.617
 Os+1.612
 Stard+1.598
Neg logit contributions
 AMER-2.633
heastern-2.551
uble-2.489
 rug-2.471
utz-2.446
Tokenp
Token ID79
Feature activation+9.26e-01

Pos logit contributions
stood+3.109
Pos+2.752
osit+2.619
 Os+2.601
 Stard+2.559
Neg logit contributions
 AMER-3.754
heastern-3.609
uble-3.502
 rug-3.491
utz-3.425
Tokenk
Token ID74
Feature activation+1.12e+00

Pos logit contributions
stood+2.832
Pos+2.501
osit+2.373
 Os+2.349
 Stard+2.308
Neg logit contributions
 AMER-3.844
heastern-3.715
 rug-3.600
uble-3.599
utz-3.540
TokenY
Token ID56
Feature activation+1.51e+00

Pos logit contributions
stood+3.488
Pos+3.098
osit+2.932
 Os+2.903
SCP+2.838
Neg logit contributions
 AMER-4.558
heastern-4.387
 rug-4.252
uble-4.234
utz-4.153
Tokensv
Token ID21370
Feature activation+1.59e+00

Pos logit contributions
stood+4.838
Pos+4.354
osit+4.115
 Os+4.046
SCP+3.996
Neg logit contributions
 AMER-5.911
heastern-5.620
 rug-5.489
uble-5.428
utz-5.292
TokenM
Token ID44
Feature activation+1.23e+00

Pos logit contributions
stood+4.917
Pos+4.442
osit+4.189
 Os+4.118
SCP+4.089
Neg logit contributions
 AMER-6.302
heastern-6.027
 rug-5.863
uble-5.791
utz-5.666
Tokenbo
Token ID2127
Feature activation+8.18e-01

Pos logit contributions
stood+3.965
Pos+3.531
osit+3.380
 Os+3.323
 Stard+3.247
Neg logit contributions
 AMER-4.830
heastern-4.635
 rug-4.492
uble-4.468
utz-4.389
Token]
Token ID60
Feature activation+2.13e-01

Pos logit contributions
stood+2.887
Pos+2.594
 Os+2.553
 Stard+2.478
osit+2.474
Neg logit contributions
 AMER-2.924
heastern-2.916
uble-2.822
utz-2.802
 rug-2.783
Token<|endoftext|>
Token ID50256
Feature activation-2.11e-01

Pos logit contributions
stood+0.747
Pos+0.658
 Os+0.637
 Stard+0.634
osit+0.628
Neg logit contributions
 AMER-0.788
heastern-0.776
uble-0.762
utz-0.752
 rug-0.747
TokenNo
Token ID2949
Feature activation-4.77e-01

Pos logit contributions
 AMER+0.827
heastern+0.807
uble+0.793
utz+0.783
 rug+0.778
Neg logit contributions
stood-0.706
Pos-0.613
 Stard-0.591
 Os-0.587
osit-0.586
Token "
Token ID366
Feature activation-1.01e-01

Pos logit contributions
stood+0.037
Pos+0.032
 Stard+0.032
 Os+0.032
osit+0.031
Neg logit contributions
 AMER-0.040
heastern-0.039
uble-0.039
utz-0.038
 rug-0.038
TokenAsk
Token ID25214
Feature activation+2.00e-01

Pos logit contributions
 AMER+0.370
heastern+0.360
uble+0.354
utz+0.349
 rug+0.345
Neg logit contributions
stood-0.362
Pos-0.316
 Stard-0.308
 Os-0.307
osit-0.305
Token yourself
Token ID3511
Feature activation+6.50e-01

Pos logit contributions
stood+0.760
Pos+0.659
 Os+0.655
 Stard+0.652
osit+0.640
Neg logit contributions
 AMER-0.687
heastern-0.680
uble-0.663
utz-0.657
 rug-0.645
Token if
Token ID611
Feature activation+1.37e-01

Pos logit contributions
stood+2.141
 Os+1.809
Pos+1.800
 Stard+1.778
 Kak+1.742
Neg logit contributions
 AMER-2.525
heastern-2.517
uble-2.484
utz-2.435
 rug-2.382
Token the
Token ID262
Feature activation+2.27e-01

Pos logit contributions
stood+0.496
Pos+0.430
 Os+0.424
 Stard+0.424
osit+0.416
Neg logit contributions
 AMER-0.494
heastern-0.489
uble-0.479
utz-0.471
 rug-0.465
Token following
Token ID1708
Feature activation+1.58e+00

Pos logit contributions
stood+0.831
Pos+0.719
 Os+0.714
 Stard+0.712
osit+0.700
Neg logit contributions
 AMER-0.806
heastern-0.799
uble-0.780
utz-0.769
 rug-0.750
Token story
Token ID1621
Feature activation-4.44e-02

Pos logit contributions
stood+5.134
 Os+4.395
 eval+4.352
Pos+4.330
 Stard+4.292
Neg logit contributions
heastern-6.136
 AMER-5.883
uble-5.867
utz-5.741
heast-5.680
Token does
Token ID857
Feature activation+2.85e-01

Pos logit contributions
 AMER+0.175
heastern+0.172
uble+0.169
utz+0.167
 rug+0.164
Neg logit contributions
stood-0.147
Pos-0.126
 Stard-0.123
 Os-0.123
osit-0.121
Token not
Token ID407
Feature activation-9.53e-02

Pos logit contributions
stood+0.999
Pos+0.882
 Os+0.874
 Stard+0.863
 eval+0.849
Neg logit contributions
heastern-1.038
 AMER-1.036
utz-1.014
uble-1.006
 rug-0.970
Token sound
Token ID2128
Feature activation+4.60e-01

Pos logit contributions
 AMER+0.312
heastern+0.308
uble+0.298
utz+0.297
 rug+0.286
Neg logit contributions
stood-0.376
Pos-0.333
 Stard-0.328
 Os-0.327
osit-0.323
Token like
Token ID588
Feature activation+8.17e-01

Pos logit contributions
stood+1.514
Pos+1.295
 Os+1.272
 Stard+1.270
 eval+1.240
Neg logit contributions
heastern-1.799
 AMER-1.795
uble-1.758
utz-1.742
 rug-1.686
Token and
Token ID290
Feature activation+1.55e-01

Pos logit contributions
stood+1.494
Pos+1.301
 Stard+1.300
 Os+1.288
 Kak+1.273
Neg logit contributions
heastern-1.773
 AMER-1.766
uble-1.749
utz-1.704
 rug-1.682
Token international
Token ID3230
Feature activation+2.76e-01

Pos logit contributions
stood+0.530
Pos+0.464
 Os+0.457
 Stard+0.455
osit+0.450
Neg logit contributions
 AMER-0.574
heastern-0.568
uble-0.565
utz-0.557
 rug-0.550
Token treaties
Token ID30734
Feature activation+1.23e-02

Pos logit contributions
stood+1.041
Pos+0.905
 Os+0.900
 Stard+0.898
 eval+0.884
Neg logit contributions
heastern-0.936
 AMER-0.935
uble-0.910
utz-0.901
 rug-0.878
Token.
Token ID13
Feature activation+3.05e-01

Pos logit contributions
stood+0.039
Pos+0.033
 Stard+0.033
 Os+0.033
osit+0.032
Neg logit contributions
 AMER-0.050
heastern-0.049
uble-0.048
utz-0.048
 rug-0.047
Token Any
Token ID4377
Feature activation+3.60e-01

Pos logit contributions
stood+1.062
Pos+0.938
 Os+0.927
 Stard+0.916
osit+0.903
Neg logit contributions
heastern-1.127
 AMER-1.106
uble-1.094
utz-1.070
 rug-1.064
Token reproduction
Token ID20728
Feature activation+1.58e+00

Pos logit contributions
stood+1.194
Pos+1.037
 Os+1.014
 Stard+1.006
osit+1.005
Neg logit contributions
 AMER-1.400
heastern-1.377
uble-1.356
utz-1.330
 rug-1.313
Token,
Token ID11
Feature activation+6.07e-01

Pos logit contributions
stood+4.085
 Os+3.587
Pos+3.531
osit+3.407
 Stard+3.403
Neg logit contributions
 AMER-6.718
heastern-6.610
uble-6.576
utz-6.411
 rug-6.265
Token copying
Token ID23345
Feature activation+1.12e+00

Pos logit contributions
stood+1.827
Pos+1.571
 Os+1.556
 Stard+1.537
osit+1.529
Neg logit contributions
 AMER-2.523
heastern-2.486
uble-2.421
utz-2.403
 rug-2.361
Token,
Token ID11
Feature activation+1.06e+00

Pos logit contributions
stood+2.625
Pos+2.205
 Os+2.190
 Stard+2.105
osit+2.099
Neg logit contributions
 AMER-5.292
heastern-5.213
uble-5.108
utz-4.996
 rug-4.933
Token or
Token ID393
Feature activation+6.64e-01

Pos logit contributions
stood+3.167
 Os+2.752
Pos+2.750
 Stard+2.685
osit+2.670
Neg logit contributions
 AMER-4.344
heastern-4.292
utz-4.138
uble-4.120
 rug-4.068
Token redistribution
Token ID41425
Feature activation+1.24e+00

Pos logit contributions
stood+2.152
Pos+1.878
 Os+1.876
 Stard+1.840
osit+1.835
Neg logit contributions
 AMER-2.598
heastern-2.546
uble-2.462
utz-2.452
 rug-2.435
Token interested
Token ID4609
Feature activation-2.23e-01

Pos logit contributions
stood+0.860
Pos+0.747
 Stard+0.733
 Os+0.730
osit+0.717
Neg logit contributions
 AMER-0.893
heastern-0.878
uble-0.862
utz-0.849
 rug-0.834
Token in
Token ID287
Feature activation+1.70e-01

Pos logit contributions
 AMER+0.833
heastern+0.812
uble+0.797
utz+0.790
 rug+0.779
Neg logit contributions
stood-0.786
Pos-0.684
 Stard-0.668
 Os-0.665
osit-0.659
Token characters
Token ID3435
Feature activation-1.94e-01

Pos logit contributions
stood+0.581
Pos+0.497
 Stard+0.493
 Os+0.490
osit+0.481
Neg logit contributions
 AMER-0.653
heastern-0.643
uble-0.630
utz-0.623
 rug-0.611
Token that
Token ID326
Feature activation+4.15e-02

Pos logit contributions
 AMER+0.742
heastern+0.725
uble+0.713
utz+0.704
 rug+0.694
Neg logit contributions
stood-0.664
Pos-0.574
 Stard-0.561
 Os-0.559
osit-0.553
Token start
Token ID923
Feature activation+5.83e-01

Pos logit contributions
stood+0.149
Pos+0.130
 Stard+0.127
 Os+0.127
osit+0.125
Neg logit contributions
 AMER-0.152
heastern-0.149
uble-0.146
utz-0.144
 rug-0.141
Token off
Token ID572
Feature activation+1.53e+00

Pos logit contributions
stood+2.270
Pos+2.011
 Stard+1.991
 Os+1.983
osit+1.949
Neg logit contributions
 AMER-1.921
heastern-1.903
uble-1.856
utz-1.823
 rug-1.749
Token one
Token ID530
Feature activation-1.18e-01

Pos logit contributions
stood+4.746
 Stard+4.094
 Os+4.091
Pos+4.025
 ageing+3.990
Neg logit contributions
heastern-5.940
 AMER-5.908
uble-5.839
utz-5.834
heast-5.519
Token way
Token ID835
Feature activation+2.64e-01

Pos logit contributions
 AMER+0.448
heastern+0.439
uble+0.431
utz+0.425
 rug+0.417
Neg logit contributions
stood-0.411
Pos-0.357
 Stard-0.349
 Os-0.348
osit-0.343
Token and
Token ID290
Feature activation+7.31e-03

Pos logit contributions
stood+0.781
Pos+0.658
 Os+0.645
 Stard+0.642
osit+0.628
Neg logit contributions
 AMER-1.126
heastern-1.102
uble-1.089
utz-1.085
 rug-1.053
Token end
Token ID886
Feature activation-5.98e-02

Pos logit contributions
stood+0.028
Pos+0.025
 Stard+0.024
 Os+0.024
osit+0.024
Neg logit contributions
 AMER-0.025
heastern-0.025
uble-0.024
utz-0.024
 rug-0.023
Token another
Token ID1194
Feature activation-1.04e-02

Pos logit contributions
 AMER+0.238
heastern+0.231
uble+0.228
utz+0.225
 rug+0.223
Neg logit contributions
stood-0.197
Pos-0.169
 Stard-0.165
 Os-0.164
osit-0.163
Token2
Token ID17
Feature activation+6.71e-01

Pos logit contributions
stood+2.390
Pos+2.101
osit+1.994
 Os+1.983
 Stard+1.957
Neg logit contributions
 AMER-3.047
heastern-2.940
uble-2.861
 rug-2.849
utz-2.808
Tokeng
Token ID70
Feature activation+6.33e-01

Pos logit contributions
stood+2.208
Pos+1.938
osit+1.850
 Os+1.841
 Stard+1.825
Neg logit contributions
 AMER-2.661
heastern-2.566
uble-2.501
 rug-2.485
utz-2.457
Token7
Token ID22
Feature activation+9.51e-01

Pos logit contributions
stood+1.953
Pos+1.700
osit+1.617
 Os+1.612
 Stard+1.598
Neg logit contributions
 AMER-2.633
heastern-2.551
uble-2.489
 rug-2.471
utz-2.446
Tokenp
Token ID79
Feature activation+9.26e-01

Pos logit contributions
stood+3.109
Pos+2.752
osit+2.619
 Os+2.601
 Stard+2.559
Neg logit contributions
 AMER-3.754
heastern-3.609
uble-3.502
 rug-3.491
utz-3.425
Tokenk
Token ID74
Feature activation+1.12e+00

Pos logit contributions
stood+2.832
Pos+2.501
osit+2.373
 Os+2.349
 Stard+2.308
Neg logit contributions
 AMER-3.844
heastern-3.715
 rug-3.600
uble-3.599
utz-3.540
TokenY
Token ID56
Feature activation+1.51e+00

Pos logit contributions
stood+3.488
Pos+3.098
osit+2.932
 Os+2.903
SCP+2.838
Neg logit contributions
 AMER-4.558
heastern-4.387
 rug-4.252
uble-4.234
utz-4.153
Tokensv
Token ID21370
Feature activation+1.59e+00

Pos logit contributions
stood+4.838
Pos+4.354
osit+4.115
 Os+4.046
SCP+3.996
Neg logit contributions
 AMER-5.911
heastern-5.620
 rug-5.489
uble-5.428
utz-5.292
TokenM
Token ID44
Feature activation+1.23e+00

Pos logit contributions
stood+4.917
Pos+4.442
osit+4.189
 Os+4.118
SCP+4.089
Neg logit contributions
 AMER-6.302
heastern-6.027
 rug-5.863
uble-5.791
utz-5.666
Tokenbo
Token ID2127
Feature activation+8.18e-01

Pos logit contributions
stood+3.965
Pos+3.531
osit+3.380
 Os+3.323
 Stard+3.247
Neg logit contributions
 AMER-4.830
heastern-4.635
 rug-4.492
uble-4.468
utz-4.389
Token]
Token ID60
Feature activation+2.13e-01

Pos logit contributions
stood+2.887
Pos+2.594
 Os+2.553
 Stard+2.478
osit+2.474
Neg logit contributions
 AMER-2.924
heastern-2.916
uble-2.822
utz-2.802
 rug-2.783
Token<|endoftext|>
Token ID50256
Feature activation-2.11e-01

Pos logit contributions
stood+0.747
Pos+0.658
 Os+0.637
 Stard+0.634
osit+0.628
Neg logit contributions
 AMER-0.788
heastern-0.776
uble-0.762
utz-0.752
 rug-0.747
Token take
Token ID1011
Feature activation+4.69e-01

Pos logit contributions
stood+1.763
Pos+1.537
 Os+1.514
 Stard+1.511
osit+1.488
Neg logit contributions
 AMER-1.628
heastern-1.603
uble-1.562
utz-1.540
 rug-1.502
Token a
Token ID257
Feature activation+3.35e-01

Pos logit contributions
stood+1.633
Pos+1.405
 Os+1.384
 Stard+1.379
osit+1.359
Neg logit contributions
 AMER-1.770
heastern-1.745
uble-1.713
utz-1.683
 rug-1.646
Token rest
Token ID1334
Feature activation+5.25e-01

Pos logit contributions
stood+1.298
Pos+1.138
 Os+1.117
 Stard+1.116
osit+1.097
Neg logit contributions
 AMER-1.129
heastern-1.107
uble-1.089
utz-1.077
 rug-1.025
Token.
Token ID13
Feature activation+3.03e-01

Pos logit contributions
stood+1.763
Pos+1.520
 Os+1.482
osit+1.469
 Stard+1.465
Neg logit contributions
 AMER-2.040
heastern-1.991
uble-1.947
utz-1.910
 rug-1.886
Token In
Token ID554
Feature activation+4.06e-01

Pos logit contributions
stood+1.097
Pos+0.958
 Stard+0.940
 Os+0.938
osit+0.925
Neg logit contributions
 AMER-1.101
heastern-1.086
uble-1.062
utz-1.045
 rug-1.035
Token bed
Token ID3996
Feature activation+1.47e+00

Pos logit contributions
stood+1.463
Pos+1.258
 Stard+1.238
 Os+1.236
osit+1.231
Neg logit contributions
 AMER-1.491
heastern-1.463
uble-1.435
utz-1.404
 rug-1.391
Token.
Token ID13
Feature activation+2.55e-01

Pos logit contributions
stood+4.410
Pos+3.720
 Os+3.694
 Kak+3.627
 Stard+3.623
Neg logit contributions
 AMER-5.965
heastern-5.911
uble-5.724
utz-5.594
heast-5.526
Token Oh
Token ID3966
Feature activation+4.83e-01

Pos logit contributions
stood+0.927
Pos+0.811
 Stard+0.795
 Os+0.793
osit+0.783
Neg logit contributions
 AMER-0.922
heastern-0.907
uble-0.888
utz-0.874
 rug-0.864
Token god
Token ID5770
Feature activation-9.49e-02

Pos logit contributions
stood+1.519
Pos+1.268
 Os+1.244
 Stard+1.238
osit+1.209
Neg logit contributions
 AMER-2.001
heastern-1.965
uble-1.925
utz-1.893
 rug-1.872
Token."
Token ID526
Feature activation+5.79e-02

Pos logit contributions
 AMER+0.390
heastern+0.382
uble+0.376
utz+0.371
 rug+0.367
Neg logit contributions
stood-0.299
Pos-0.254
 Stard-0.247
 Os-0.246
osit-0.244
Token She
Token ID1375
Feature activation+1.51e-01

Pos logit contributions
stood+0.202
Pos+0.177
 Stard+0.174
 Os+0.174
osit+0.170
Neg logit contributions
 AMER-0.217
heastern-0.214
uble-0.210
utz-0.207
 rug-0.205
Token at
Token ID379
Feature activation-1.16e-01

Pos logit contributions
 AMER+1.862
heastern+1.815
uble+1.785
utz+1.763
 rug+1.745
Neg logit contributions
stood-1.712
Pos-1.483
 Stard-1.446
 Os-1.437
osit-1.431
Token ASUS
Token ID38100
Feature activation-7.68e-01

Pos logit contributions
 AMER+0.436
heastern+0.430
uble+0.423
utz+0.416
 rug+0.410
Neg logit contributions
stood-0.407
Pos-0.352
 Stard-0.350
 Os-0.349
osit-0.343
Token R
Token ID371
Feature activation-1.74e-01

Pos logit contributions
 AMER+2.874
heastern+2.804
uble+2.754
utz+2.719
 rug+2.693
Neg logit contributions
stood-2.694
Pos-2.342
 Stard-2.288
 Os-2.276
osit-2.261
TokenOG
Token ID7730
Feature activation-1.74e-01

Pos logit contributions
 AMER+0.621
heastern+0.607
uble+0.596
utz+0.591
 rug+0.580
Neg logit contributions
stood-0.643
Pos-0.558
 Stard-0.550
 Os-0.542
osit-0.539
Token just
Token ID655
Feature activation+3.13e-01

Pos logit contributions
 AMER+0.653
heastern+0.645
uble+0.632
utz+0.623
 rug+0.612
Neg logit contributions
stood-0.600
 Stard-0.530
Pos-0.525
 Os-0.519
osit-0.510
Token over
Token ID625
Feature activation+1.45e+00

Pos logit contributions
stood+1.005
Pos+0.863
 Os+0.851
 Stard+0.846
osit+0.833
Neg logit contributions
 AMER-1.251
heastern-1.226
uble-1.206
utz-1.197
 rug-1.177
Token five
Token ID1936
Feature activation-1.43e-01

Pos logit contributions
stood+3.928
 Os+3.326
 Stard+3.321
Pos+3.199
 Kak+3.178
Neg logit contributions
heastern-6.254
 AMER-6.228
utz-6.110
uble-6.103
 rug-5.938
Token weeks
Token ID2745
Feature activation-7.48e-02

Pos logit contributions
 AMER+0.521
heastern+0.510
uble+0.501
utz+0.494
 rug+0.486
Neg logit contributions
stood-0.515
Pos-0.450
 Stard-0.440
 Os-0.439
osit-0.434
Token ago
Token ID2084
Feature activation-1.18e-01

Pos logit contributions
 AMER+0.259
heastern+0.253
uble+0.247
utz+0.245
 rug+0.239
Neg logit contributions
stood-0.282
Pos-0.250
 Stard-0.244
 Os-0.243
osit-0.240
Token.
Token ID13
Feature activation+1.94e-01

Pos logit contributions
 AMER+0.457
heastern+0.448
uble+0.438
utz+0.436
 rug+0.430
Neg logit contributions
stood-0.397
Pos-0.344
 Stard-0.338
 Os-0.336
osit-0.332
Token Then
Token ID3244
Feature activation+6.61e-01

Pos logit contributions
stood+0.724
Pos+0.635
 Stard+0.622
 Os+0.619
osit+0.615
Neg logit contributions
 AMER-0.684
heastern-0.670
uble-0.657
utz-0.649
 rug-0.640
Token
Token ID251
Feature activation-4.99e-02

Pos logit contributions
stood+0.209
Pos+0.183
 Os+0.178
 Stard+0.177
osit+0.176
Neg logit contributions
 AMER-0.229
heastern-0.222
uble-0.218
utz-0.216
 rug-0.213
Token
Token ID198
Feature activation+2.22e-01

Pos logit contributions
 AMER+0.183
heastern+0.180
uble+0.176
utz+0.174
 rug+0.172
Neg logit contributions
stood-0.178
Pos-0.156
 Stard-0.152
 Os-0.152
osit-0.150
Token
Token ID198
Feature activation+2.52e-01

Pos logit contributions
stood+0.776
Pos+0.678
 Os+0.657
 Stard+0.655
osit+0.651
Neg logit contributions
 AMER-0.842
heastern-0.827
uble-0.804
utz-0.793
 rug-0.791
TokenF
Token ID37
Feature activation-1.14e-01

Pos logit contributions
stood+0.834
Pos+0.724
 Stard+0.697
 Os+0.695
osit+0.692
Neg logit contributions
 AMER-0.998
heastern-0.977
uble-0.958
utz-0.946
 rug-0.940
Tokenears
Token ID4127
Feature activation+1.62e-01

Pos logit contributions
 AMER+0.493
heastern+0.482
uble+0.475
utz+0.471
 rug+0.465
Neg logit contributions
stood-0.336
Pos-0.283
 Stard-0.275
 Os-0.273
osit-0.271
Token over
Token ID625
Feature activation+1.45e+00

Pos logit contributions
stood+0.566
Pos+0.491
 Stard+0.479
 Os+0.478
osit+0.472
Neg logit contributions
 AMER-0.612
heastern-0.596
uble-0.584
utz-0.577
 rug-0.572
Token Melbourne
Token ID14819
Feature activation+1.59e-01

Pos logit contributions
stood+4.718
 Os+4.098
Pos+4.064
 ageing+4.017
 Stard+4.006
Neg logit contributions
heastern-5.568
 AMER-5.507
uble-5.407
utz-5.278
 rug-5.154
Token fixes
Token ID13040
Feature activation+6.56e-01

Pos logit contributions
stood+0.552
Pos+0.482
 Stard+0.471
 Os+0.469
osit+0.461
Neg logit contributions
 AMER-0.596
heastern-0.586
uble-0.575
utz-0.568
 rug-0.557
Token
Token ID198
Feature activation+3.11e-01

Pos logit contributions
stood+2.264
Pos+1.961
 Os+1.928
 Stard+1.917
osit+1.864
Neg logit contributions
 AMER-2.480
heastern-2.457
uble-2.396
utz-2.369
 rug-2.311
Token
Token ID198
Feature activation+2.49e-01

Pos logit contributions
stood+1.081
Pos+0.946
 Os+0.916
 Stard+0.915
osit+0.904
Neg logit contributions
 AMER-1.178
heastern-1.159
uble-1.128
utz-1.111
 rug-1.105
TokenWhen
Token ID2215
Feature activation+3.69e-01

Pos logit contributions
stood+0.828
Pos+0.720
 Stard+0.693
 Os+0.691
osit+0.687
Neg logit contributions
 AMER-0.981
heastern-0.960
uble-0.942
utz-0.930
 rug-0.924
Token latest
Token ID3452
Feature activation-4.00e-01

Pos logit contributions
stood+0.738
Pos+0.639
 Stard+0.632
 Os+0.630
osit+0.618
Neg logit contributions
 AMER-0.816
heastern-0.807
uble-0.791
utz-0.782
 rug-0.766
Token C
Token ID327
Feature activation+4.10e-01

Pos logit contributions
 AMER+1.597
heastern+1.551
uble+1.527
utz+1.512
 rug+1.502
Neg logit contributions
stood-1.312
Pos-1.125
 Stard-1.088
osit-1.084
 Os-1.079
Tokenrave
Token ID5758
Feature activation+7.83e-01

Pos logit contributions
stood+0.885
Pos+0.701
osit+0.666
 Stard+0.656
 Os+0.652
Neg logit contributions
 AMER-2.100
heastern-2.037
uble-2.006
 rug-1.993
utz-1.989
TokenCast
Token ID19248
Feature activation+7.32e-01

Pos logit contributions
stood+2.686
Pos+2.387
 Stard+2.361
 Os+2.348
osit+2.303
Neg logit contributions
 AMER-2.880
heastern-2.833
uble-2.779
utz-2.734
 rug-2.681
Token podcast
Token ID9905
Feature activation+6.53e-01

Pos logit contributions
stood+2.457
Pos+2.166
 Stard+2.140
 Os+2.124
osit+2.081
Neg logit contributions
 AMER-2.778
heastern-2.715
uble-2.677
utz-2.626
 rug-2.598
Token here
Token ID994
Feature activation+1.44e+00

Pos logit contributions
stood+2.097
Pos+1.820
 Stard+1.811
 Os+1.803
osit+1.757
Neg logit contributions
 AMER-2.572
heastern-2.548
uble-2.478
utz-2.460
 rug-2.401
Token!
Token ID0
Feature activation+3.15e-01

Pos logit contributions
stood+4.255
 Os+3.690
 Stard+3.638
Pos+3.633
 Kak+3.490
Neg logit contributions
 AMER-5.763
heastern-5.691
uble-5.518
utz-5.492
 rug-5.390
Token
Token ID198
Feature activation-1.26e-01

Pos logit contributions
stood+1.112
Pos+0.980
 Stard+0.967
 Os+0.964
osit+0.948
Neg logit contributions
 AMER-1.154
heastern-1.141
uble-1.112
utz-1.100
 rug-1.085
Token
Token ID198
Feature activation-1.32e-01

Pos logit contributions
 AMER+0.459
heastern+0.447
uble+0.439
utz+0.433
 rug+0.429
Neg logit contributions
stood-0.459
Pos-0.401
 Stard-0.391
 Os-0.389
osit-0.387
TokenC
Token ID34
Feature activation+4.48e-02

Pos logit contributions
 AMER+0.515
heastern+0.502
uble+0.494
utz+0.488
 rug+0.483
Neg logit contributions
stood-0.444
Pos-0.384
 Stard-0.373
 Os-0.371
osit-0.369
Tokenrave
Token ID5758
Feature activation+4.10e-01

Pos logit contributions
stood+0.103
Pos+0.083
osit+0.080
 Stard+0.079
 Os+0.078
Neg logit contributions
 AMER-0.222
heastern-0.217
uble-0.214
utz-0.212
 rug-0.211
Token want
Token ID765
Feature activation+1.97e-01

Pos logit contributions
stood+0.020
Pos+0.018
 Stard+0.018
 Os+0.018
osit+0.018
Neg logit contributions
 AMER-0.011
heastern-0.011
uble-0.011
utz-0.010
 rug-0.010
Token to
Token ID284
Feature activation+1.01e-02

Pos logit contributions
stood+0.718
Pos+0.623
 Stard+0.613
 Os+0.612
osit+0.609
Neg logit contributions
 AMER-0.711
heastern-0.693
uble-0.674
utz-0.670
 rug-0.664
Token per
Token ID583
Feature activation+2.10e-01

Pos logit contributions
stood+0.036
Pos+0.031
 Stard+0.030
 Os+0.030
osit+0.030
Neg logit contributions
 AMER-0.038
heastern-0.036
uble-0.036
utz-0.036
 rug-0.035
Tokenuse
Token ID1904
Feature activation-5.25e-01

Pos logit contributions
stood+0.627
Pos+0.522
osit+0.515
 Os+0.513
 Stard+0.507
Neg logit contributions
 AMER-0.894
heastern-0.893
 rug-0.855
uble-0.854
utz-0.851
Token.
Token ID13
Feature activation+3.45e-01

Pos logit contributions
 AMER+1.841
heastern+1.781
uble+1.744
utz+1.735
 rug+1.715
Neg logit contributions
stood-1.977
Pos-1.732
 Stard-1.691
osit-1.679
 Os-1.671
Token Here
Token ID3423
Feature activation+1.44e+00

Pos logit contributions
stood+1.316
Pos+1.157
 Stard+1.136
 Os+1.129
osit+1.121
Neg logit contributions
 AMER-1.178
heastern-1.150
uble-1.136
utz-1.114
 rug-1.102
Token is
Token ID318
Feature activation+2.74e-01

Pos logit contributions
stood+3.711
Pos+3.129
 Os+3.029
 Stard+2.999
osit+2.967
Neg logit contributions
 AMER-6.567
heastern-6.472
uble-6.370
utz-6.281
 rug-6.260
Token the
Token ID262
Feature activation+1.28e-01

Pos logit contributions
stood+0.992
Pos+0.865
 Stard+0.837
 Os+0.836
osit+0.834
Neg logit contributions
 AMER-0.999
heastern-0.969
uble-0.954
utz-0.940
 rug-0.930
Token link
Token ID2792
Feature activation+1.04e-01

Pos logit contributions
stood+0.449
Pos+0.387
 Stard+0.376
 Os+0.376
osit+0.376
Neg logit contributions
 AMER-0.483
heastern-0.470
uble-0.463
utz-0.454
 rug-0.452
Token for
Token ID329
Feature activation+2.40e-01

Pos logit contributions
stood+0.392
Pos+0.349
 Stard+0.343
 Os+0.341
osit+0.338
Neg logit contributions
 AMER-0.355
heastern-0.345
uble-0.340
utz-0.334
 rug-0.332
Token buying
Token ID7067
Feature activation+6.28e-01

Pos logit contributions
stood+0.871
Pos+0.762
 Os+0.735
osit+0.734
 Stard+0.730
Neg logit contributions
 AMER-0.880
heastern-0.852
uble-0.834
utz-0.828
 rug-0.817
Token the
Token ID262
Feature activation+2.47e-01

Pos logit contributions
stood+0.392
Pos+0.333
 Os+0.332
 Stard+0.330
osit+0.323
Neg logit contributions
 AMER-0.444
heastern-0.440
uble-0.429
utz-0.422
 rug-0.418
Token federal
Token ID2717
Feature activation+3.06e-02

Pos logit contributions
stood+0.916
Pos+0.794
 Os+0.789
 Stard+0.788
osit+0.774
Neg logit contributions
 AMER-0.869
heastern-0.858
uble-0.834
utz-0.826
 rug-0.807
Token law
Token ID1099
Feature activation-3.62e-01

Pos logit contributions
stood+0.122
Pos+0.108
 Stard+0.107
 Os+0.107
osit+0.105
Neg logit contributions
 AMER-0.099
heastern-0.097
uble-0.094
utz-0.093
 rug-0.091
Token,
Token ID11
Feature activation+1.08e-01

Pos logit contributions
 AMER+1.284
heastern+1.253
uble+1.230
utz+1.211
 rug+1.197
Neg logit contributions
stood-1.342
Pos-1.173
 Stard-1.149
 Os-1.145
osit-1.135
Token for
Token ID329
Feature activation+1.34e-01

Pos logit contributions
stood+0.390
Pos+0.337
 Os+0.333
 Stard+0.332
osit+0.323
Neg logit contributions
heastern-0.397
 AMER-0.393
uble-0.383
utz-0.375
 rug-0.371
Token proof
Token ID6617
Feature activation+1.43e+00

Pos logit contributions
stood+0.402
Pos+0.337
 Stard+0.328
 Os+0.327
osit+0.323
Neg logit contributions
 AMER-0.573
heastern-0.564
uble-0.553
utz-0.547
 rug-0.542
Token of
Token ID286
Feature activation+3.23e-01

Pos logit contributions
stood+4.605
Pos+3.988
 Stard+3.962
 Os+3.946
osit+3.873
Neg logit contributions
heastern-5.422
 AMER-5.422
uble-5.349
 rug-5.145
utz-5.127
Token this
Token ID428
Feature activation+3.87e-01

Pos logit contributions
stood+1.164
Pos+0.998
 Os+0.987
 Stard+0.984
osit+0.971
Neg logit contributions
 AMER-1.184
heastern-1.172
uble-1.143
utz-1.120
 rug-1.104
Token:
Token ID25
Feature activation+3.93e-01

Pos logit contributions
stood+1.288
Pos+1.110
 Stard+1.086
 Os+1.083
osit+1.071
Neg logit contributions
 AMER-1.504
heastern-1.488
uble-1.462
utz-1.428
 rug-1.408
Token "
Token ID366
Feature activation+3.03e-01

Pos logit contributions
stood+1.382
Pos+1.198
 Os+1.184
 Stard+1.174
 Lev+1.149
Neg logit contributions
heastern-1.457
 AMER-1.427
uble-1.409
utz-1.366
 rug-1.348
TokenDefense
Token ID27300
Feature activation+4.33e-01

Pos logit contributions
stood+1.062
Pos+0.936
 Stard+0.883
 Os+0.882
osit+0.873
Neg logit contributions
 AMER-1.138
heastern-1.128
uble-1.100
utz-1.075
 rug-1.068
TokenWrite
Token ID16594
Feature activation-7.33e-01

Pos logit contributions
 AMER+4.013
heastern+3.862
uble+3.794
utz+3.794
 rug+3.783
Neg logit contributions
stood-3.776
 Stard-3.139
Pos-3.115
osit-3.103
 Os-3.072
Token()
Token ID3419
Feature activation-5.13e-01

Pos logit contributions
 AMER+2.806
heastern+2.711
uble+2.704
utz+2.666
 rug+2.631
Neg logit contributions
stood-2.497
 Stard-2.084
Pos-2.071
osit-2.062
 Os-2.047
Token is
Token ID318
Feature activation+6.35e-01

Pos logit contributions
 AMER+1.992
heastern+1.938
uble+1.915
utz+1.891
 rug+1.871
Neg logit contributions
stood-1.729
Pos-1.475
 Stard-1.444
 Os-1.427
osit-1.427
Token only
Token ID691
Feature activation+2.29e-01

Pos logit contributions
stood+2.165
Pos+1.884
 Os+1.858
 Stard+1.857
osit+1.804
Neg logit contributions
 AMER-2.404
heastern-2.375
uble-2.316
utz-2.274
 rug-2.239
Token the
Token ID262
Feature activation+5.89e-01

Pos logit contributions
stood+0.777
Pos+0.672
 Stard+0.657
 Os+0.654
osit+0.645
Neg logit contributions
 AMER-0.884
heastern-0.865
uble-0.849
utz-0.839
 rug-0.829
Token beginning
Token ID3726
Feature activation+1.43e+00

Pos logit contributions
stood+2.150
Pos+1.867
 Stard+1.859
 Os+1.846
osit+1.815
Neg logit contributions
 AMER-2.092
heastern-2.068
uble-2.022
utz-2.007
 rug-1.934
Token.
Token ID13
Feature activation+1.80e-01

Pos logit contributions
stood+3.997
Pos+3.652
 Stard+3.575
 Os+3.516
osit+3.417
Neg logit contributions
 AMER-5.845
heastern-5.760
uble-5.619
utz-5.604
 rug-5.503
Token For
Token ID1114
Feature activation+1.05e-01

Pos logit contributions
stood+0.667
Pos+0.587
 Stard+0.576
 Os+0.573
osit+0.567
Neg logit contributions
 AMER-0.633
heastern-0.621
uble-0.607
utz-0.600
 rug-0.593
Token example
Token ID1672
Feature activation-5.77e-01

Pos logit contributions
stood+0.360
Pos+0.311
 Stard+0.304
 Os+0.302
osit+0.300
Neg logit contributions
 AMER-0.400
heastern-0.391
uble-0.384
utz-0.379
 rug-0.375
Token,
Token ID11
Feature activation-4.08e-01

Pos logit contributions
 AMER+2.411
heastern+2.351
uble+2.324
utz+2.294
 rug+2.280
Neg logit contributions
stood-1.772
Pos-1.491
osit-1.441
 Stard-1.439
 Os-1.431
Token this
Token ID428
Feature activation-2.37e-01

Pos logit contributions
 AMER+1.489
heastern+1.441
uble+1.422
utz+1.403
 rug+1.394
Neg logit contributions
stood-1.473
Pos-1.275
osit-1.240
 Stard-1.235
 Os-1.226
Token asc
Token ID10570
Feature activation+2.40e-01

Pos logit contributions
 AMER+0.085
heastern+0.083
uble+0.081
utz+0.080
 rug+0.079
Neg logit contributions
stood-0.108
Pos-0.096
 Stard-0.094
 Os-0.094
osit-0.093
Tokenension
Token ID3004
Feature activation+2.40e-01

Pos logit contributions
stood+1.066
Pos+0.967
osit+0.956
 Os+0.942
 Stard+0.940
Neg logit contributions
 AMER-0.672
heastern-0.623
uble-0.612
utz-0.607
 rug-0.606
Token.
Token ID13
Feature activation-8.18e-02

Pos logit contributions
stood+0.838
Pos+0.730
 Stard+0.713
 Os+0.710
osit+0.705
Neg logit contributions
 AMER-0.902
heastern-0.881
uble-0.864
utz-0.854
 rug-0.840
Token
Token ID198
Feature activation+1.06e-01

Pos logit contributions
 AMER+0.289
heastern+0.280
uble+0.276
utz+0.272
 rug+0.269
Neg logit contributions
stood-0.305
Pos-0.266
 Stard-0.260
 Os-0.258
osit-0.258
Token
Token ID198
Feature activation+2.16e-01

Pos logit contributions
stood+0.376
Pos+0.329
 Os+0.319
 Stard+0.319
osit+0.316
Neg logit contributions
 AMER-0.395
heastern-0.387
uble-0.377
utz-0.372
 rug-0.370
TokenGeorge
Token ID20191
Feature activation+1.43e+00

Pos logit contributions
stood+0.698
Pos+0.606
 Stard+0.580
 Os+0.576
osit+0.576
Neg logit contributions
 AMER-0.876
heastern-0.858
uble-0.840
utz-0.828
 rug-0.826
Token
Token ID198
Feature activation+8.33e-01

Pos logit contributions
stood+4.311
 Os+3.928
Pos+3.884
 Stard+3.875
 Kak+3.702
Neg logit contributions
 AMER-5.552
heastern-5.451
uble-5.388
utz-5.301
 rug-5.226
Token
Token ID198
Feature activation+6.96e-01

Pos logit contributions
stood+2.629
Pos+2.293
 Os+2.206
osit+2.170
 Stard+2.163
Neg logit contributions
 AMER-3.406
heastern-3.389
uble-3.253
 rug-3.224
utz-3.194
Token……
Token ID7398
Feature activation+3.51e-01

Pos logit contributions
stood+2.819
Pos+2.567
osit+2.417
 Os+2.411
 Stard+2.406
Neg logit contributions
 AMER-2.255
heastern-2.186
uble-2.114
 rug-2.083
utz-2.069
Token.
Token ID13
Feature activation+8.22e-01

Pos logit contributions
stood+1.321
Pos+1.169
 Os+1.134
 Stard+1.130
osit+1.124
Neg logit contributions
 AMER-1.233
heastern-1.188
uble-1.173
utz-1.157
 rug-1.153
Token
Token ID198
Feature activation+5.88e-01

Pos logit contributions
stood+2.689
Pos+2.380
 Os+2.299
 Stard+2.280
osit+2.270
Neg logit contributions
 AMER-3.221
heastern-3.188
uble-3.106
 rug-3.060
utz-3.041
Token<|endoftext|>
Token ID50256
Feature activation-3.04e-01

Pos logit contributions
 AMER+0.143
heastern+0.139
uble+0.138
utz+0.136
 rug+0.134
Neg logit contributions
stood-0.135
Pos-0.118
 Os-0.114
 Stard-0.114
osit-0.114
TokenImage
Token ID5159
Feature activation-6.09e-01

Pos logit contributions
 AMER+1.192
heastern+1.163
uble+1.143
utz+1.128
 rug+1.120
Neg logit contributions
stood-1.019
Pos-0.883
 Stard-0.853
 Os-0.848
osit-0.846
Token copyright
Token ID6634
Feature activation-2.02e+00

Pos logit contributions
 AMER+1.923
heastern+1.864
uble+1.827
utz+1.799
 rug+1.779
Neg logit contributions
stood-2.495
Pos-2.212
 Stard-2.163
 Os-2.154
osit-2.150
Token ESA
Token ID42212
Feature activation-7.78e-01

Pos logit contributions
 AMER+6.997
heastern+6.552
uble+6.424
utz+6.386
 rug+6.278
Neg logit contributions
stood-7.329
Pos-6.433
osit-6.124
 Stard-6.045
 ageing-6.002
Token Image
Token ID7412
Feature activation-6.07e-01

Pos logit contributions
 AMER+2.866
heastern+2.790
uble+2.740
utz+2.706
 rug+2.675
Neg logit contributions
stood-2.777
Pos-2.414
 Stard-2.352
 Os-2.340
osit-2.334
Token caption
Token ID8305
Feature activation-2.04e+00

Pos logit contributions
 AMER+1.755
heastern+1.700
uble+1.666
utz+1.648
 rug+1.620
Neg logit contributions
stood-2.628
Pos-2.333
 Stard-2.294
osit-2.282
 Os-2.279
Token Go
Token ID1514
Feature activation-3.40e-01

Pos logit contributions
 AMER+7.745
heastern+7.248
uble+7.150
utz+7.018
 rug+6.940
Neg logit contributions
stood-6.561
Pos-5.599
 eval-5.410
osit-5.390
 ageing-5.390
Tokence
Token ID344
Feature activation+4.71e-02

Pos logit contributions
 AMER+1.327
heastern+1.290
uble+1.270
utz+1.258
 rug+1.245
Neg logit contributions
stood-1.141
Pos-0.988
 Stard-0.963
 Os-0.957
osit-0.953
Token uses
Token ID3544
Feature activation+1.86e-01

Pos logit contributions
stood+0.158
Pos+0.135
 Os+0.133
 Stard+0.133
osit+0.131
Neg logit contributions
 AMER-0.184
heastern-0.180
uble-0.176
utz-0.174
 rug-0.173
Token an
Token ID281
Feature activation+2.52e-01

Pos logit contributions
stood+0.613
Pos+0.520
 Stard+0.515
 Os+0.513
osit+0.505
Neg logit contributions
 AMER-0.737
heastern-0.722
uble-0.710
utz-0.701
 rug-0.691
Token electric
Token ID5186
Feature activation+4.99e-02

Pos logit contributions
stood+0.862
 Os+0.736
Pos+0.736
 Stard+0.731
osit+0.719
Neg logit contributions
 AMER-0.959
heastern-0.945
uble-0.930
utz-0.911
 rug-0.903
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
 AMER+1.105
heastern+1.079
uble+1.061
utz+1.048
 rug+1.038
Neg logit contributions
stood-0.947
Pos-0.817
 Stard-0.795
 Os-0.791
osit-0.786
Token
Token ID447
Feature activation-3.10e-02

Pos logit contributions
stood+0.360
Pos+0.312
 Stard+0.304
 Os+0.302
osit+0.299
Neg logit contributions
 AMER-0.439
heastern-0.434
uble-0.426
utz-0.420
 rug-0.417
Token
Token ID251
Feature activation+3.42e-02

Pos logit contributions
 AMER+0.121
heastern+0.118
uble+0.116
utz+0.115
 rug+0.114
Neg logit contributions
stood-0.104
Pos-0.090
 Stard-0.087
 Os-0.087
osit-0.086
Token<|endoftext|>
Token ID50256
Feature activation-2.07e-01

Pos logit contributions
stood+0.121
Pos+0.106
 Stard+0.104
 Os+0.104
osit+0.102
Neg logit contributions
 AMER-0.126
heastern-0.124
uble-0.122
utz-0.120
 rug-0.119
TokenImage
Token ID5159
Feature activation-5.96e-01

Pos logit contributions
 AMER+0.811
heastern+0.792
uble+0.778
utz+0.767
 rug+0.763
Neg logit contributions
stood-0.692
Pos-0.602
 Stard-0.578
 Os-0.575
osit-0.574
Token copyright
Token ID6634
Feature activation-2.03e+00

Pos logit contributions
 AMER+1.879
heastern+1.822
uble+1.786
utz+1.758
 rug+1.738
Neg logit contributions
stood-2.444
Pos-2.166
 Stard-2.119
 Os-2.110
osit-2.106
Token publicity
Token ID19403
Feature activation-6.80e-01

Pos logit contributions
 AMER+7.061
heastern+6.605
uble+6.473
utz+6.437
 rug+6.325
Neg logit contributions
stood-7.354
Pos-6.449
osit-6.137
 Stard-6.054
 ageing-6.016
Token Image
Token ID7412
Feature activation-4.73e-01

Pos logit contributions
 AMER+2.536
heastern+2.472
uble+2.429
utz+2.400
 rug+2.374
Neg logit contributions
stood-2.395
Pos-2.081
 Stard-2.030
 Os-2.019
osit-2.010
Token caption
Token ID8305
Feature activation-1.92e+00

Pos logit contributions
 AMER+1.405
heastern+1.359
uble+1.337
utz+1.318
 rug+1.299
Neg logit contributions
stood-2.014
Pos-1.792
 Stard-1.758
 Os-1.748
osit-1.746
Token The
Token ID383
Feature activation-1.85e-01

Pos logit contributions
 AMER+7.574
heastern+7.038
uble+6.988
utz+6.966
 rug+6.869
Neg logit contributions
stood-6.052
Pos-5.187
osit-4.965
 eval-4.874
 ageing-4.840
Token Greeks
Token ID25059
Feature activation+3.78e-02

Pos logit contributions
 AMER+0.695
heastern+0.681
uble+0.668
utz+0.661
 rug+0.651
Neg logit contributions
stood-0.647
Pos-0.560
 Stard-0.550
 Os-0.548
osit-0.542
Token than
Token ID621
Feature activation-1.03e-01

Pos logit contributions
 AMER+0.310
heastern+0.306
uble+0.300
utz+0.295
 rug+0.288
Neg logit contributions
stood-0.246
Pos-0.217
 Stard-0.210
 Os-0.209
 Kak-0.203
Token 10
Token ID838
Feature activation-3.59e-01

Pos logit contributions
 AMER+0.382
heastern+0.375
uble+0.367
utz+0.362
 rug+0.355
Neg logit contributions
stood-0.364
Pos-0.317
 Stard-0.310
 Os-0.308
osit-0.306
Token years
Token ID812
Feature activation-4.66e-01

Pos logit contributions
 AMER+1.260
heastern+1.221
uble+1.202
utz+1.189
 rug+1.180
Neg logit contributions
stood-1.346
Pos-1.175
 Stard-1.150
 Os-1.143
osit-1.143
Token in
Token ID287
Feature activation-8.07e-02

Pos logit contributions
 AMER+1.707
heastern+1.662
uble+1.633
utz+1.613
 rug+1.596
Neg logit contributions
stood-1.676
Pos-1.461
 Stard-1.426
 Os-1.418
osit-1.412
Token prison
Token ID3770
Feature activation+5.55e-01

Pos logit contributions
heastern+0.315
 AMER+0.314
uble+0.303
utz+0.290
 rug+0.286
Neg logit contributions
stood-0.271
Pos-0.235
 Os-0.234
 Stard-0.227
 Kak-0.224
Token MORE
Token ID12011
Feature activation-2.02e+00

Pos logit contributions
stood+2.012
Pos+1.803
 Os+1.779
 Stard+1.733
osit+1.727
Neg logit contributions
heastern-1.959
 AMER-1.935
uble-1.931
utz-1.822
 rug-1.819
Token,
Token ID11
Feature activation-4.92e-01

Pos logit contributions
 AMER+6.864
heastern+6.529
uble+6.408
utz+6.359
 rug+6.283
Neg logit contributions
stood-7.573
Pos-6.500
osit-6.463
 Stard-6.380
 Os-6.142
Token in
Token ID287
Feature activation-2.99e-01

Pos logit contributions
 AMER+1.911
heastern+1.857
uble+1.828
utz+1.808
 rug+1.794
Neg logit contributions
stood-1.652
Pos-1.435
 Stard-1.390
osit-1.386
 Os-1.374
Token his
Token ID465
Feature activation-2.74e-01

Pos logit contributions
 AMER+1.221
heastern+1.186
uble+1.171
utz+1.160
 rug+1.148
Neg logit contributions
stood-0.947
Pos-0.812
 Stard-0.784
 Os-0.779
osit-0.778
Token weekly
Token ID10273
Feature activation+9.26e-02

Pos logit contributions
 AMER+1.001
heastern+0.970
uble+0.955
utz+0.945
 rug+0.935
Neg logit contributions
stood-0.988
Pos-0.865
 Stard-0.838
osit-0.834
 Os-0.832
Token column
Token ID5721
Feature activation-3.22e-01

Pos logit contributions
stood+0.315
Pos+0.271
 Stard+0.265
 Os+0.264
osit+0.261
Neg logit contributions
 AMER-0.357
heastern-0.350
uble-0.344
utz-0.339
 rug-0.335
Token<|endoftext|>
Token ID50256
Feature activation-2.52e-01

Pos logit contributions
stood+1.209
Pos+1.072
 Os+1.050
 Stard+1.044
osit+1.026
Neg logit contributions
 AMER-1.237
heastern-1.228
uble-1.208
utz-1.180
 rug-1.166
TokenImage
Token ID5159
Feature activation-5.68e-01

Pos logit contributions
 AMER+0.988
heastern+0.965
uble+0.948
utz+0.935
 rug+0.929
Neg logit contributions
stood-0.844
Pos-0.733
 Stard-0.707
 Os-0.702
osit-0.701
Token copyright
Token ID6634
Feature activation-1.98e+00

Pos logit contributions
 AMER+1.800
heastern+1.746
uble+1.711
utz+1.686
 rug+1.665
Neg logit contributions
stood-2.319
Pos-2.056
 Stard-2.012
 Os-2.004
osit-1.997
Token AP
Token ID3486
Feature activation-1.03e+00

Pos logit contributions
 AMER+6.905
heastern+6.473
uble+6.343
utz+6.302
 rug+6.197
Neg logit contributions
stood-7.171
Pos-6.286
osit-5.989
 Stard-5.915
 ageing-5.862
Token Image
Token ID7412
Feature activation-8.49e-01

Pos logit contributions
 AMER+3.817
heastern+3.714
uble+3.633
utz+3.599
 rug+3.549
Neg logit contributions
stood-3.677
Pos-3.168
 Stard-3.090
 Os-3.057
osit-3.054
Token caption
Token ID8305
Feature activation-2.02e+00

Pos logit contributions
 AMER+2.361
heastern+2.250
uble+2.240
utz+2.216
 rug+2.170
Neg logit contributions
stood-3.738
Pos-3.314
 Stard-3.276
osit-3.249
 Os-3.234
Token Peace
Token ID12689
Feature activation+6.98e-02

Pos logit contributions
 AMER+7.674
heastern+7.048
utz+6.996
uble+6.994
 rug+6.816
Neg logit contributions
stood-6.625
Pos-5.684
osit-5.473
 eval-5.422
 ageing-5.416
Token talks
Token ID6130
Feature activation-5.22e-01

Pos logit contributions
stood+0.222
Pos+0.189
 Os+0.184
 Stard+0.184
osit+0.181
Neg logit contributions
 AMER-0.286
heastern-0.280
uble-0.273
utz-0.271
 rug-0.267
Token between
Token ID1022
Feature activation-4.50e-01

Pos logit contributions
 AMER+1.831
heastern+1.775
uble+1.752
utz+1.733
 rug+1.712
Neg logit contributions
stood-1.946
Pos-1.706
 Stard-1.661
osit-1.658
 Os-1.651
Token the
Token ID262
Feature activation-7.09e-02

Pos logit contributions
 AMER+1.688
heastern+1.639
uble+1.614
utz+1.596
 rug+1.579
Neg logit contributions
stood-1.571
Pos-1.368
 Stard-1.329
osit-1.322
 Os-1.316
Token government
Token ID1230
Feature activation+4.56e-01

Pos logit contributions
 AMER+0.245
heastern+0.239
uble+0.234
utz+0.231
 rug+0.228
Neg logit contributions
stood-0.270
Pos-0.236
 Stard-0.232
 Os-0.231
osit-0.229
Token Serve
Token ID35557
Feature activation+5.88e-01

Pos logit contributions
 AMER+1.059
heastern+1.017
uble+0.988
utz+0.967
 rug+0.952
Neg logit contributions
stood-2.261
Pos-2.051
 Stard-2.017
 Os-2.010
osit-2.003
Token hot
Token ID3024
Feature activation-5.25e-01

Pos logit contributions
stood+2.625
Pos+2.349
 Os+2.318
osit+2.299
 Stard+2.295
Neg logit contributions
 AMER-1.612
heastern-1.557
uble-1.544
 rug-1.479
utz-1.478
Token!
Token ID0
Feature activation-3.84e-02

Pos logit contributions
 AMER+1.765
heastern+1.719
uble+1.683
utz+1.656
 rug+1.638
Neg logit contributions
stood-2.046
Pos-1.807
 Stard-1.767
 Os-1.760
osit-1.751
Token<|endoftext|>
Token ID50256
Feature activation-3.04e-01

Pos logit contributions
 AMER+0.143
heastern+0.139
uble+0.138
utz+0.136
 rug+0.134
Neg logit contributions
stood-0.135
Pos-0.118
 Os-0.114
 Stard-0.114
osit-0.114
TokenImage
Token ID5159
Feature activation-6.09e-01

Pos logit contributions
 AMER+1.192
heastern+1.163
uble+1.143
utz+1.128
 rug+1.120
Neg logit contributions
stood-1.019
Pos-0.883
 Stard-0.853
 Os-0.848
osit-0.846
Token copyright
Token ID6634
Feature activation-2.02e+00

Pos logit contributions
 AMER+1.923
heastern+1.864
uble+1.827
utz+1.799
 rug+1.779
Neg logit contributions
stood-2.495
Pos-2.212
 Stard-2.163
 Os-2.154
osit-2.150
Token ESA
Token ID42212
Feature activation-7.78e-01

Pos logit contributions
 AMER+6.997
heastern+6.552
uble+6.424
utz+6.386
 rug+6.278
Neg logit contributions
stood-7.329
Pos-6.433
osit-6.124
 Stard-6.045
 ageing-6.002
Token Image
Token ID7412
Feature activation-6.07e-01

Pos logit contributions
 AMER+2.866
heastern+2.790
uble+2.740
utz+2.706
 rug+2.675
Neg logit contributions
stood-2.777
Pos-2.414
 Stard-2.352
 Os-2.340
osit-2.334
Token caption
Token ID8305
Feature activation-2.04e+00

Pos logit contributions
 AMER+1.755
heastern+1.700
uble+1.666
utz+1.648
 rug+1.620
Neg logit contributions
stood-2.628
Pos-2.333
 Stard-2.294
osit-2.282
 Os-2.279
Token Go
Token ID1514
Feature activation-3.40e-01

Pos logit contributions
 AMER+7.745
heastern+7.248
uble+7.150
utz+7.018
 rug+6.940
Neg logit contributions
stood-6.561
Pos-5.599
 eval-5.410
osit-5.390
 ageing-5.390
Tokence
Token ID344
Feature activation+4.71e-02

Pos logit contributions
 AMER+1.327
heastern+1.290
uble+1.270
utz+1.258
 rug+1.245
Neg logit contributions
stood-1.141
Pos-0.988
 Stard-0.963
 Os-0.957
osit-0.953
Token.
Token ID13
Feature activation-2.69e-01

Pos logit contributions
 AMER+0.620
heastern+0.608
uble+0.592
utz+0.586
 rug+0.577
Neg logit contributions
stood-0.582
Pos-0.511
 Stard-0.499
 Os-0.497
osit-0.493
Token
Token ID447
Feature activation+3.92e-02

Pos logit contributions
 AMER+0.996
heastern+0.970
uble+0.954
utz+0.942
 rug+0.932
Neg logit contributions
stood-0.954
Pos-0.829
 Stard-0.809
 Os-0.805
osit-0.801
Token
Token ID251
Feature activation-2.73e-01

Pos logit contributions
stood+0.130
Pos+0.113
 Stard+0.109
 Os+0.109
osit+0.108
Neg logit contributions
 AMER-0.154
heastern-0.150
uble-0.147
utz-0.146
 rug-0.144
Token<|endoftext|>
Token ID50256
Feature activation-2.51e-01

Pos logit contributions
 AMER+0.981
heastern+0.962
uble+0.943
utz+0.930
 rug+0.919
Neg logit contributions
stood-0.997
Pos-0.876
 Stard-0.856
 Os-0.853
osit-0.844
TokenImage
Token ID5159
Feature activation-5.82e-01

Pos logit contributions
 AMER+0.987
heastern+0.963
uble+0.946
utz+0.934
 rug+0.928
Neg logit contributions
stood-0.840
Pos-0.729
 Stard-0.703
 Os-0.698
osit-0.697
Token copyright
Token ID6634
Feature activation-2.00e+00

Pos logit contributions
 AMER+1.844
heastern+1.789
uble+1.753
utz+1.727
 rug+1.706
Neg logit contributions
stood-2.375
Pos-2.104
 Stard-2.059
 Os-2.050
osit-2.045
Token Deadline
Token ID35954
Feature activation-2.02e-01

Pos logit contributions
 AMER+7.023
heastern+6.585
uble+6.456
utz+6.423
 rug+6.310
Neg logit contributions
stood-7.196
Pos-6.303
osit-6.000
 Stard-5.920
 ageing-5.888
Token
Token ID198
Feature activation-2.30e-01

Pos logit contributions
 AMER+0.778
heastern+0.761
uble+0.748
utz+0.738
 rug+0.736
Neg logit contributions
stood-0.683
Pos-0.597
 Os-0.582
 Stard-0.579
osit-0.573
Token
Token ID198
Feature activation-9.63e-02

Pos logit contributions
 AMER+0.819
heastern+0.794
uble+0.782
utz+0.772
 rug+0.763
Neg logit contributions
stood-0.848
Pos-0.739
 Stard-0.723
 Os-0.719
osit-0.718
TokenPolice
Token ID9039
Feature activation+3.78e-01

Pos logit contributions
 AMER+0.370
heastern+0.362
uble+0.356
utz+0.350
 rug+0.348
Neg logit contributions
stood-0.329
Pos-0.287
 Stard-0.277
 Os-0.276
osit-0.274
Token have
Token ID423
Feature activation-1.89e-02

Pos logit contributions
stood+1.295
Pos+1.127
 Stard+1.114
 Os+1.103
osit+1.077
Neg logit contributions
 AMER-1.434
heastern-1.406
uble-1.393
utz-1.362
 rug-1.343
Token million
Token ID1510
Feature activation+4.04e-01

Pos logit contributions
 AMER+0.551
heastern+0.538
uble+0.529
utz+0.522
 rug+0.517
Neg logit contributions
stood-0.504
Pos-0.437
 Stard-0.426
 Os-0.424
osit-0.422
Token years
Token ID812
Feature activation+1.31e-01

Pos logit contributions
stood+1.351
Pos+1.183
 Os+1.167
 Stard+1.152
osit+1.127
Neg logit contributions
 AMER-1.553
heastern-1.536
uble-1.496
utz-1.491
 rug-1.463
Token.
Token ID13
Feature activation+1.94e-01

Pos logit contributions
stood+0.451
Pos+0.391
 Stard+0.383
 Os+0.382
osit+0.378
Neg logit contributions
 AMER-0.495
heastern-0.485
uble-0.475
utz-0.470
 rug-0.464
Token<|endoftext|>
Token ID50256
Feature activation-1.70e-01

Pos logit contributions
stood+0.707
Pos+0.621
 Stard+0.609
 Os+0.609
osit+0.601
Neg logit contributions
 AMER-0.694
heastern-0.684
uble-0.672
utz-0.661
 rug-0.653
TokenImage
Token ID5159
Feature activation-5.29e-01

Pos logit contributions
 AMER+0.666
heastern+0.650
uble+0.638
utz+0.629
 rug+0.626
Neg logit contributions
stood-0.569
Pos-0.496
 Stard-0.475
osit-0.473
 Os-0.473
Token copyright
Token ID6634
Feature activation-1.99e+00

Pos logit contributions
 AMER+1.681
heastern+1.631
uble+1.598
utz+1.575
 rug+1.555
Neg logit contributions
stood-2.157
Pos-1.912
 Stard-1.872
 Os-1.864
osit-1.857
Token AFP
Token ID8916
Feature activation-8.13e-01

Pos logit contributions
 AMER+6.915
heastern+6.475
uble+6.344
utz+6.308
 rug+6.198
Neg logit contributions
stood-7.234
Pos-6.355
osit-6.044
 Stard-5.976
 ageing-5.919
Token
Token ID198
Feature activation-3.84e-01

Pos logit contributions
 AMER+3.070
heastern+2.991
uble+2.937
utz+2.903
 rug+2.870
Neg logit contributions
stood-2.832
Pos-2.446
 Stard-2.386
 Os-2.369
osit-2.363
Token
Token ID198
Feature activation-3.40e-01

Pos logit contributions
 AMER+1.344
heastern+1.295
uble+1.278
utz+1.265
 rug+1.247
Neg logit contributions
stood-1.437
Pos-1.250
 Stard-1.227
osit-1.219
 Os-1.216
TokenThe
Token ID464
Feature activation-1.30e-01

Pos logit contributions
 AMER+1.277
heastern+1.242
uble+1.222
utz+1.210
 rug+1.193
Neg logit contributions
stood-1.188
Pos-1.025
 Stard-1.007
 Os-1.001
osit-0.997
Token Bank
Token ID5018
Feature activation+4.26e-01

Pos logit contributions
 AMER+0.452
heastern+0.444
uble+0.434
utz+0.430
 rug+0.421
Neg logit contributions
stood-0.493
Pos-0.430
 Stard-0.425
 Os-0.425
osit-0.418
Tokenported
Token ID9213
Feature activation+1.80e-01

Pos logit contributions
stood+2.851
Pos+2.402
osit+2.354
 Stard+2.304
 Os+2.287
Neg logit contributions
 AMER-3.757
heastern-3.635
utz-3.570
uble-3.537
 rug-3.501
Token License
Token ID13789
Feature activation-2.80e-02

Pos logit contributions
stood+0.670
Pos+0.590
 Stard+0.579
 Os+0.578
osit+0.577
Neg logit contributions
 AMER-0.633
heastern-0.622
uble-0.610
utz-0.603
 rug-0.591
Token.
Token ID13
Feature activation+3.41e-01

Pos logit contributions
 AMER+0.105
heastern+0.103
uble+0.101
utz+0.099
 rug+0.098
Neg logit contributions
stood-0.098
Pos-0.085
 Stard-0.083
 Os-0.083
osit-0.082
Token<|endoftext|>
Token ID50256
Feature activation-2.52e-01

Pos logit contributions
stood+1.209
Pos+1.072
 Os+1.050
 Stard+1.044
osit+1.026
Neg logit contributions
 AMER-1.237
heastern-1.228
uble-1.208
utz-1.180
 rug-1.166
TokenImage
Token ID5159
Feature activation-5.68e-01

Pos logit contributions
 AMER+0.988
heastern+0.965
uble+0.948
utz+0.935
 rug+0.929
Neg logit contributions
stood-0.844
Pos-0.733
 Stard-0.707
 Os-0.702
osit-0.701
Token copyright
Token ID6634
Feature activation-1.98e+00

Pos logit contributions
 AMER+1.800
heastern+1.746
uble+1.711
utz+1.686
 rug+1.665
Neg logit contributions
stood-2.319
Pos-2.056
 Stard-2.012
 Os-2.004
osit-1.997
Token AP
Token ID3486
Feature activation-1.03e+00

Pos logit contributions
 AMER+6.905
heastern+6.473
uble+6.343
utz+6.302
 rug+6.197
Neg logit contributions
stood-7.171
Pos-6.286
osit-5.989
 Stard-5.915
 ageing-5.862
Token Image
Token ID7412
Feature activation-8.49e-01

Pos logit contributions
 AMER+3.817
heastern+3.714
uble+3.633
utz+3.599
 rug+3.549
Neg logit contributions
stood-3.677
Pos-3.168
 Stard-3.090
 Os-3.057
osit-3.054
Token caption
Token ID8305
Feature activation-2.02e+00

Pos logit contributions
 AMER+2.361
heastern+2.250
uble+2.240
utz+2.216
 rug+2.170
Neg logit contributions
stood-3.738
Pos-3.314
 Stard-3.276
osit-3.249
 Os-3.234
Token Peace
Token ID12689
Feature activation+6.98e-02

Pos logit contributions
 AMER+7.674
heastern+7.048
utz+6.996
uble+6.994
 rug+6.816
Neg logit contributions
stood-6.625
Pos-5.684
osit-5.473
 eval-5.422
 ageing-5.416
Token talks
Token ID6130
Feature activation-5.22e-01

Pos logit contributions
stood+0.222
Pos+0.189
 Os+0.184
 Stard+0.184
osit+0.181
Neg logit contributions
 AMER-0.286
heastern-0.280
uble-0.273
utz-0.271
 rug-0.267
Token media
Token ID2056
Feature activation-2.24e-01

Pos logit contributions
 AMER+0.077
heastern+0.075
uble+0.074
utz+0.073
 rug+0.072
Neg logit contributions
stood-0.079
Pos-0.069
 Stard-0.068
 Os-0.068
osit-0.067
Token reported
Token ID2098
Feature activation-8.95e-04

Pos logit contributions
 AMER+0.863
heastern+0.841
uble+0.827
utz+0.818
 rug+0.810
Neg logit contributions
stood-0.762
Pos-0.659
 Stard-0.641
 Os-0.638
osit-0.636
Token.
Token ID13
Feature activation+1.36e-01

Pos logit contributions
 AMER+0.004
heastern+0.003
uble+0.003
utz+0.003
 rug+0.003
Neg logit contributions
stood-0.003
Pos-0.003
 Os-0.003
 Stard-0.002
osit-0.002
Token<|endoftext|>
Token ID50256
Feature activation-3.31e-01

Pos logit contributions
stood+0.478
Pos+0.418
 Os+0.412
 Stard+0.410
osit+0.403
Neg logit contributions
 AMER-0.501
heastern-0.501
uble-0.494
utz-0.481
 rug-0.479
TokenImage
Token ID5159
Feature activation-6.15e-01

Pos logit contributions
 AMER+1.296
heastern+1.264
uble+1.243
utz+1.228
 rug+1.217
Neg logit contributions
stood-1.108
Pos-0.958
 Stard-0.928
 Os-0.923
osit-0.920
Token caption
Token ID8305
Feature activation-1.95e+00

Pos logit contributions
 AMER+1.947
heastern+1.888
uble+1.850
utz+1.822
 rug+1.802
Neg logit contributions
stood-2.513
Pos-2.226
 Stard-2.176
 Os-2.167
osit-2.164
Token Some
Token ID2773
Feature activation+2.28e-01

Pos logit contributions
 AMER+7.616
heastern+7.143
uble+7.046
utz+7.034
 rug+6.901
Neg logit contributions
stood-6.206
Pos-5.322
osit-5.078
 eval-5.011
 ageing-4.990
Token talked
Token ID6619
Feature activation+1.68e-01

Pos logit contributions
stood+0.806
Pos+0.696
 Os+0.689
 Stard+0.688
osit+0.674
Neg logit contributions
 AMER-0.841
heastern-0.826
uble-0.815
utz-0.804
 rug-0.787
Token of
Token ID286
Feature activation+2.62e-01

Pos logit contributions
stood+0.511
Pos+0.434
 Stard+0.423
 Os+0.422
osit+0.416
Neg logit contributions
 AMER-0.703
heastern-0.691
uble-0.679
utz-0.675
 rug-0.663
Token "
Token ID366
Feature activation-2.69e-01

Pos logit contributions
stood+0.894
Pos+0.766
 Os+0.761
 Stard+0.759
 ageing+0.742
Neg logit contributions
 AMER-0.998
heastern-0.987
uble-0.964
utz-0.953
 rug-0.935
Tokenb
Token ID65
Feature activation+8.47e-03

Pos logit contributions
 AMER+1.046
heastern+1.020
uble+1.003
utz+0.992
 rug+0.982
Neg logit contributions
stood-0.908
Pos-0.784
 Stard-0.764
 Os-0.760
osit-0.756
Token unsupported
Token ID24222
Feature activation-5.99e-01

Pos logit contributions
 AMER+2.277
heastern+2.193
uble+2.158
utz+2.152
 rug+2.115
Neg logit contributions
stood-2.345
Pos-2.036
osit-1.972
 Stard-1.945
 Os-1.936
Token on
Token ID319
Feature activation+1.99e-02

Pos logit contributions
 AMER+2.415
heastern+2.353
uble+2.324
utz+2.300
 rug+2.271
Neg logit contributions
stood-1.922
Pos-1.643
 Stard-1.597
 Os-1.582
osit-1.580
Token your
Token ID534
Feature activation-5.62e-01

Pos logit contributions
stood+0.071
Pos+0.062
 Stard+0.061
 Os+0.061
osit+0.060
Neg logit contributions
 AMER-0.073
heastern-0.072
uble-0.070
utz-0.069
 rug-0.069
Token device
Token ID3335
Feature activation-5.78e-01

Pos logit contributions
 AMER+1.836
heastern+1.783
uble+1.756
utz+1.726
 rug+1.718
Neg logit contributions
stood-2.217
Pos-1.962
 Stard-1.913
osit-1.905
 Os-1.900
Token Media
Token ID6343
Feature activation-8.29e-01

Pos logit contributions
 AMER+2.300
heastern+2.226
uble+2.215
utz+2.181
 rug+2.152
Neg logit contributions
stood-1.876
Pos-1.597
 Stard-1.555
osit-1.547
 Os-1.528
Token caption
Token ID8305
Feature activation-1.94e+00

Pos logit contributions
 AMER+2.513
heastern+2.435
uble+2.391
utz+2.361
 rug+2.328
Neg logit contributions
stood-3.492
Pos-3.095
 Stard-3.034
 Os-3.020
osit-3.016
Token Nigel
Token ID28772
Feature activation+4.92e-01

Pos logit contributions
 AMER+7.565
heastern+6.949
utz+6.916
uble+6.908
 rug+6.733
Neg logit contributions
stood-6.211
Pos-5.319
osit-5.118
 ageing-5.066
 eval-5.014
Token Farage
Token ID33904
Feature activation-1.72e-01

Pos logit contributions
stood+1.596
 Os+1.388
 Stard+1.374
Pos+1.366
 Kak+1.321
Neg logit contributions
 AMER-1.968
heastern-1.908
uble-1.869
utz-1.847
 rug-1.837
Token:
Token ID25
Feature activation-8.27e-01

Pos logit contributions
 AMER+0.663
heastern+0.648
uble+0.636
utz+0.629
 rug+0.622
Neg logit contributions
stood-0.588
Pos-0.508
 Stard-0.496
 Os-0.494
osit-0.490
Token "
Token ID366
Feature activation-6.57e-01

Pos logit contributions
 AMER+3.358
heastern+3.257
uble+3.210
utz+3.182
 rug+3.146
Neg logit contributions
stood-2.632
Pos-2.251
 Stard-2.180
osit-2.167
 Os-2.163
TokenLord
Token ID22438
Feature activation+2.11e-01

Pos logit contributions
 AMER+2.373
heastern+2.294
uble+2.255
utz+2.245
 rug+2.205
Neg logit contributions
stood-2.369
Pos-2.035
 Stard-2.028
 Os-2.017
osit-2.008
Token<|endoftext|>
Token ID50256
Feature activation-2.07e-01

Pos logit contributions
stood+0.121
Pos+0.106
 Stard+0.104
 Os+0.104
osit+0.102
Neg logit contributions
 AMER-0.126
heastern-0.124
uble-0.122
utz-0.120
 rug-0.119
TokenImage
Token ID5159
Feature activation-5.96e-01

Pos logit contributions
 AMER+0.811
heastern+0.792
uble+0.778
utz+0.767
 rug+0.763
Neg logit contributions
stood-0.692
Pos-0.602
 Stard-0.578
 Os-0.575
osit-0.574
Token copyright
Token ID6634
Feature activation-2.03e+00

Pos logit contributions
 AMER+1.879
heastern+1.822
uble+1.786
utz+1.758
 rug+1.738
Neg logit contributions
stood-2.444
Pos-2.166
 Stard-2.119
 Os-2.110
osit-2.106
Token publicity
Token ID19403
Feature activation-6.80e-01

Pos logit contributions
 AMER+7.061
heastern+6.605
uble+6.473
utz+6.437
 rug+6.325
Neg logit contributions
stood-7.354
Pos-6.449
osit-6.137
 Stard-6.054
 ageing-6.016
Token Image
Token ID7412
Feature activation-4.73e-01

Pos logit contributions
 AMER+2.536
heastern+2.472
uble+2.429
utz+2.400
 rug+2.374
Neg logit contributions
stood-2.395
Pos-2.081
 Stard-2.030
 Os-2.019
osit-2.010
Token caption
Token ID8305
Feature activation-1.92e+00

Pos logit contributions
 AMER+1.405
heastern+1.359
uble+1.337
utz+1.318
 rug+1.299
Neg logit contributions
stood-2.014
Pos-1.792
 Stard-1.758
 Os-1.748
osit-1.746
Token The
Token ID383
Feature activation-1.85e-01

Pos logit contributions
 AMER+7.574
heastern+7.038
uble+6.988
utz+6.966
 rug+6.869
Neg logit contributions
stood-6.052
Pos-5.187
osit-4.965
 eval-4.874
 ageing-4.840
Token Greeks
Token ID25059
Feature activation+3.78e-02

Pos logit contributions
 AMER+0.695
heastern+0.681
uble+0.668
utz+0.661
 rug+0.651
Neg logit contributions
stood-0.647
Pos-0.560
 Stard-0.550
 Os-0.548
osit-0.542
Token were
Token ID547
Feature activation-3.91e-01

Pos logit contributions
stood+0.134
Pos+0.116
 Stard+0.114
 Os+0.113
osit+0.112
Neg logit contributions
 AMER-0.141
heastern-0.138
uble-0.135
utz-0.133
 rug-0.132
Token totally
Token ID6635
Feature activation-1.37e-01

Pos logit contributions
 AMER+1.443
heastern+1.396
uble+1.378
utz+1.358
 rug+1.353
Neg logit contributions
stood-1.389
Pos-1.212
 Stard-1.177
osit-1.176
 Os-1.164
Token "
Token ID366
Feature activation+6.79e-02

Pos logit contributions
 AMER+0.487
heastern+0.475
uble+0.466
utz+0.460
 rug+0.455
Neg logit contributions
stood-0.503
Pos-0.440
 Stard-0.430
 Os-0.428
osit-0.426
Token:
Token ID25
Feature activation+1.50e-01

Pos logit contributions
 AMER+0.603
heastern+0.590
uble+0.580
utz+0.573
 rug+0.568
Neg logit contributions
stood-0.487
Pos-0.418
 Stard-0.406
 Os-0.404
osit-0.402
Token
Token ID198
Feature activation+3.32e-01

Pos logit contributions
stood+0.539
Pos+0.474
 Os+0.462
 Stard+0.462
osit+0.456
Neg logit contributions
 AMER-0.541
heastern-0.536
uble-0.527
utz-0.512
 rug-0.508
Token
Token ID198
Feature activation+2.95e-01

Pos logit contributions
stood+1.153
Pos+1.011
 Os+0.976
 Stard+0.974
osit+0.965
Neg logit contributions
 AMER-1.262
heastern-1.245
uble-1.211
utz-1.190
 rug-1.186
Token13
Token ID1485
Feature activation-4.88e-01

Pos logit contributions
stood+0.970
Pos+0.850
 Stard+0.805
osit+0.801
 Os+0.800
Neg logit contributions
 AMER-1.173
heastern-1.150
uble-1.125
utz-1.106
 rug-1.103
Token PH
Token ID9370
Feature activation-1.41e-01

Pos logit contributions
 AMER+1.890
heastern+1.838
uble+1.808
utz+1.790
 rug+1.772
Neg logit contributions
stood-1.654
Pos-1.423
 Stard-1.387
 Os-1.375
osit-1.375
TokenOTOS
Token ID33291
Feature activation-1.87e+00

Pos logit contributions
 AMER+0.407
heastern+0.393
uble+0.384
utz+0.378
 rug+0.373
Neg logit contributions
stood-0.617
Pos-0.550
 Stard-0.540
 Os-0.538
osit-0.536
Token Donald
Token ID3759
Feature activation-2.07e-01

Pos logit contributions
 AMER+5.765
utz+5.147
uble+5.043
 rug+4.889
heastern+4.856
Neg logit contributions
stood-7.671
 ageing-6.693
osit-6.583
Pos-6.551
 eval-6.508
Token Trump
Token ID1301
Feature activation-1.21e-01

Pos logit contributions
 AMER+0.508
heastern+0.487
uble+0.475
utz+0.468
 rug+0.458
Neg logit contributions
stood-0.994
Pos-0.896
 Stard-0.882
osit-0.876
 Os-0.876
Token's
Token ID338
Feature activation-4.15e-01

Pos logit contributions
 AMER+0.457
heastern+0.446
uble+0.438
utz+0.433
 rug+0.428
Neg logit contributions
stood-0.421
Pos-0.365
 Stard-0.356
 Os-0.355
osit-0.353
Token potential
Token ID2785
Feature activation-4.98e-01

Pos logit contributions
 AMER+1.551
heastern+1.503
uble+1.480
utz+1.466
 rug+1.455
Neg logit contributions
stood-1.457
Pos-1.264
 Stard-1.233
 Os-1.224
osit-1.222
Token running
Token ID2491
Feature activation-1.10e-01

Pos logit contributions
 AMER+2.120
heastern+2.070
uble+2.039
utz+2.019
 rug+2.002
Neg logit contributions
stood-1.490
Pos-1.259
 Stard-1.223
 Os-1.213
osit-1.210
Token
Token ID198
Feature activation+1.80e-01

Pos logit contributions
stood+0.581
Pos+0.509
 Os+0.493
 Stard+0.492
osit+0.487
Neg logit contributions
 AMER-0.623
heastern-0.612
uble-0.598
utz-0.588
 rug-0.586
TokenImage
Token ID5159
Feature activation-5.68e-01

Pos logit contributions
stood+0.617
Pos+0.543
 Os+0.516
 Stard+0.515
osit+0.515
Neg logit contributions
 AMER-0.689
heastern-0.676
uble-0.662
 rug-0.650
utz-0.649
Token copyright
Token ID6634
Feature activation-1.83e+00

Pos logit contributions
 AMER+1.745
heastern+1.683
uble+1.648
utz+1.625
 rug+1.605
Neg logit contributions
stood-2.383
Pos-2.112
 Stard-2.062
osit-2.058
 Os-2.051
Token AFP
Token ID8916
Feature activation-6.33e-01

Pos logit contributions
 AMER+5.093
heastern+4.695
uble+4.550
utz+4.545
 rug+4.397
Neg logit contributions
stood-7.996
Pos-7.174
 Stard-6.905
osit-6.898
 eval-6.791
Token Image
Token ID7412
Feature activation-6.55e-01

Pos logit contributions
 AMER+2.433
heastern+2.374
uble+2.335
utz+2.306
 rug+2.284
Neg logit contributions
stood-2.159
Pos-1.871
 Stard-1.823
 Os-1.815
osit-1.805
Token caption
Token ID8305
Feature activation-1.84e+00

Pos logit contributions
 AMER+1.891
heastern+1.810
uble+1.792
utz+1.777
 rug+1.741
Neg logit contributions
stood-2.830
Pos-2.517
 Stard-2.477
osit-2.454
 Os-2.449
Token The
Token ID383
Feature activation-2.86e-01

Pos logit contributions
 AMER+6.656
utz+6.087
heastern+6.054
uble+6.010
 rug+5.858
Neg logit contributions
stood-6.416
Pos-5.561
 ageing-5.446
osit-5.425
 eval-5.400
Token Kurds
Token ID23880
Feature activation-1.78e-01

Pos logit contributions
 AMER+1.052
heastern+1.027
uble+1.009
utz+0.995
 rug+0.984
Neg logit contributions
stood-1.020
Pos-0.887
 Stard-0.867
 Os-0.865
osit-0.858
Token'
Token ID6
Feature activation-2.59e-01

Pos logit contributions
 AMER+0.634
heastern+0.617
uble+0.606
utz+0.598
 rug+0.591
Neg logit contributions
stood-0.656
Pos-0.574
 Stard-0.561
 Os-0.558
osit-0.556
Token push
Token ID4574
Feature activation-5.41e-01

Pos logit contributions
 AMER+0.951
heastern+0.927
uble+0.910
utz+0.899
 rug+0.889
Neg logit contributions
stood-0.930
Pos-0.809
 Stard-0.790
 Os-0.787
osit-0.783
Token has
Token ID468
Feature activation-3.82e-01

Pos logit contributions
 AMER+1.882
heastern+1.831
uble+1.797
utz+1.774
 rug+1.754
Neg logit contributions
stood-2.042
Pos-1.794
 Stard-1.753
 Os-1.743
osit-1.738
Token Russia
Token ID3284
Feature activation+3.97e-02

Pos logit contributions
stood+1.461
Pos+1.234
 Os+1.231
 Stard+1.217
 Kak+1.188
Neg logit contributions
 AMER-1.623
heastern-1.590
uble-1.569
utz-1.543
 rug-1.526
Token.
Token ID13
Feature activation+2.55e-01

Pos logit contributions
stood+0.122
Pos+0.103
 Os+0.101
 Stard+0.100
osit+0.098
Neg logit contributions
 AMER-0.166
heastern-0.162
uble-0.159
utz-0.158
 rug-0.156
Token
Token ID198
Feature activation+1.66e-01

Pos logit contributions
stood+0.880
Pos+0.769
 Os+0.765
 Stard+0.753
osit+0.739
Neg logit contributions
heastern-0.957
 AMER-0.955
uble-0.941
 rug-0.923
utz-0.918
Token
Token ID198
Feature activation+1.80e-01

Pos logit contributions
stood+0.581
Pos+0.509
 Os+0.493
 Stard+0.492
osit+0.487
Neg logit contributions
 AMER-0.623
heastern-0.612
uble-0.598
utz-0.588
 rug-0.586
TokenImage
Token ID5159
Feature activation-5.68e-01

Pos logit contributions
stood+0.617
Pos+0.543
 Os+0.516
 Stard+0.515
osit+0.515
Neg logit contributions
 AMER-0.689
heastern-0.676
uble-0.662
 rug-0.650
utz-0.649
Token copyright
Token ID6634
Feature activation-1.83e+00

Pos logit contributions
 AMER+1.745
heastern+1.683
uble+1.648
utz+1.625
 rug+1.605
Neg logit contributions
stood-2.383
Pos-2.112
 Stard-2.062
osit-2.058
 Os-2.051
Token AFP
Token ID8916
Feature activation-6.33e-01

Pos logit contributions
 AMER+5.093
heastern+4.695
uble+4.550
utz+4.545
 rug+4.397
Neg logit contributions
stood-7.996
Pos-7.174
 Stard-6.905
osit-6.898
 eval-6.791
Token Image
Token ID7412
Feature activation-6.55e-01

Pos logit contributions
 AMER+2.433
heastern+2.374
uble+2.335
utz+2.306
 rug+2.284
Neg logit contributions
stood-2.159
Pos-1.871
 Stard-1.823
 Os-1.815
osit-1.805
Token caption
Token ID8305
Feature activation-1.84e+00

Pos logit contributions
 AMER+1.891
heastern+1.810
uble+1.792
utz+1.777
 rug+1.741
Neg logit contributions
stood-2.830
Pos-2.517
 Stard-2.477
osit-2.454
 Os-2.449
Token The
Token ID383
Feature activation-2.86e-01

Pos logit contributions
 AMER+6.656
utz+6.087
heastern+6.054
uble+6.010
 rug+5.858
Neg logit contributions
stood-6.416
Pos-5.561
 ageing-5.446
osit-5.425
 eval-5.400
Token Kurds
Token ID23880
Feature activation-1.78e-01

Pos logit contributions
 AMER+1.052
heastern+1.027
uble+1.009
utz+0.995
 rug+0.984
Neg logit contributions
stood-1.020
Pos-0.887
 Stard-0.867
 Os-0.865
osit-0.858
Tokenholes
Token ID28439
Feature activation+2.26e-01

Pos logit contributions
 AMER+0.989
heastern+0.968
uble+0.927
 rug+0.899
utz+0.897
Neg logit contributions
stood-1.728
Pos-1.543
 Os-1.529
osit-1.506
 Stard-1.494
Token in
Token ID287
Feature activation-1.63e-01

Pos logit contributions
stood+0.562
 Os+0.470
Pos+0.451
 eval+0.446
osit+0.439
Neg logit contributions
 AMER-1.069
heastern-1.057
uble-1.027
utz-1.025
 rug-1.023
Token American
Token ID1605
Feature activation-2.67e-01

Pos logit contributions
 AMER+0.628
uble+0.624
heastern+0.620
utz+0.587
 rug+0.585
Neg logit contributions
stood-0.536
 Os-0.483
Pos-0.470
osit-0.462
 Stard-0.458
Token politics
Token ID4819
Feature activation-4.07e-01

Pos logit contributions
 AMER+0.950
heastern+0.926
uble+0.915
utz+0.893
 rug+0.882
Neg logit contributions
stood-0.980
Pos-0.863
 Stard-0.845
 Os-0.843
osit-0.840
Token'
Token ID6
Feature activation+5.28e-02

Pos logit contributions
 AMER+1.743
heastern+1.707
uble+1.681
utz+1.666
 rug+1.646
Neg logit contributions
stood-1.207
Pos-1.022
 Stard-0.991
 Os-0.978
osit-0.977
Token MORE
Token ID12011
Feature activation-1.82e+00

Pos logit contributions
stood+0.263
 Os+0.239
Pos+0.237
 Stard+0.236
osit+0.234
Neg logit contributions
heastern-0.117
 AMER-0.115
uble-0.111
utz-0.110
 rug-0.105
Token completed
Token ID5668
Feature activation+1.02e+00

Pos logit contributions
 AMER+6.654
heastern+6.308
uble+6.265
utz+6.209
 rug+6.209
Neg logit contributions
stood-6.078
Pos-5.488
osit-5.315
 Stard-5.210
SCP-5.176
Token a
Token ID257
Feature activation+5.15e-01

Pos logit contributions
stood+3.419
 Os+2.947
 Stard+2.892
Pos+2.863
 Kak+2.816
Neg logit contributions
heastern-3.809
 AMER-3.781
uble-3.763
utz-3.745
 rug-3.630
Token swing
Token ID9628
Feature activation+6.29e-01

Pos logit contributions
stood+1.837
 Os+1.574
Pos+1.573
 Stard+1.573
osit+1.535
Neg logit contributions
 AMER-1.884
heastern-1.844
uble-1.811
utz-1.806
 rug-1.742
Token through
Token ID832
Feature activation-2.14e-01

Pos logit contributions
stood+2.107
Pos+1.835
 Os+1.809
 Stard+1.809
osit+1.783
Neg logit contributions
 AMER-2.374
uble-2.335
heastern-2.287
utz-2.281
 rug-2.243
Token North
Token ID2258
Feature activation-2.85e-01

Pos logit contributions
 AMER+0.794
heastern+0.775
uble+0.764
utz+0.753
 rug+0.745
Neg logit contributions
stood-0.760
Pos-0.658
 Stard-0.647
 Os-0.647
osit-0.637
TokenACE
Token ID11598
Feature activation-5.05e-01

Pos logit contributions
stood+0.393
Pos+0.341
osit+0.340
 Stard+0.329
 Os+0.324
Neg logit contributions
 AMER-0.468
heastern-0.459
uble-0.445
utz-0.440
 rug-0.440
Token).
Token ID737
Feature activation+1.99e-01

Pos logit contributions
 AMER+1.770
heastern+1.727
uble+1.693
utz+1.670
 rug+1.651
Neg logit contributions
stood-1.895
Pos-1.660
 Stard-1.624
 Os-1.615
osit-1.609
Token
Token ID198
Feature activation+2.36e-01

Pos logit contributions
stood+0.689
Pos+0.605
 Stard+0.593
 Os+0.590
osit+0.583
Neg logit contributions
 AMER-0.749
heastern-0.738
uble-0.727
utz-0.715
 rug-0.711
Token
Token ID198
Feature activation+2.57e-01

Pos logit contributions
stood+0.818
Pos+0.717
 Os+0.696
 Stard+0.694
osit+0.688
Neg logit contributions
 AMER-0.897
heastern-0.885
uble-0.859
utz-0.848
 rug-0.846
TokenImage
Token ID5159
Feature activation-5.39e-01

Pos logit contributions
stood+0.873
Pos+0.771
 Stard+0.730
osit+0.727
 Os+0.725
Neg logit contributions
 AMER-1.003
heastern-0.979
uble-0.958
utz-0.943
 rug-0.942
Token caption
Token ID8305
Feature activation-1.81e+00

Pos logit contributions
 AMER+1.692
heastern+1.640
uble+1.607
utz+1.583
 rug+1.564
Neg logit contributions
stood-2.223
Pos-1.972
 Stard-1.931
 Os-1.923
osit-1.917
Token Council
Token ID4281
Feature activation+2.22e-01

Pos logit contributions
 AMER+6.781
heastern+6.352
uble+6.278
 rug+6.210
utz+6.185
Neg logit contributions
stood-5.985
Pos-5.062
 ageing-4.911
osit-4.898
 eval-4.834
Token staff
Token ID3085
Feature activation+4.66e-01

Pos logit contributions
stood+0.754
Pos+0.650
 Stard+0.634
 Os+0.632
osit+0.628
Neg logit contributions
 AMER-0.859
heastern-0.838
uble-0.824
utz-0.814
 rug-0.806
Token s
Token ID264
Feature activation-3.83e-02

Pos logit contributions
stood+1.653
Pos+1.424
 Stard+1.403
 Os+1.403
osit+1.380
Neg logit contributions
 AMER-1.731
heastern-1.699
uble-1.661
utz-1.629
 rug-1.609
Tokenifted
Token ID21715
Feature activation-5.23e-01

Pos logit contributions
 AMER+0.182
heastern+0.177
uble+0.175
utz+0.173
 rug+0.172
Neg logit contributions
stood-0.098
Pos-0.079
 Os-0.077
 Stard-0.076
osit-0.075
Token through
Token ID832
Feature activation-7.80e-01

Pos logit contributions
 AMER+1.817
heastern+1.759
uble+1.728
utz+1.705
 rug+1.697
Neg logit contributions
stood-1.975
Pos-1.737
 Stard-1.688
osit-1.681
 Os-1.677
Token unsupported
Token ID24222
Feature activation-5.69e-01

Pos logit contributions
 AMER+2.225
heastern+2.141
uble+2.112
utz+2.101
 rug+2.065
Neg logit contributions
stood-2.263
Pos-1.962
osit-1.900
 Stard-1.877
 Os-1.870
Token on
Token ID319
Feature activation-1.02e-02

Pos logit contributions
 AMER+2.304
heastern+2.247
uble+2.217
utz+2.194
 rug+2.168
Neg logit contributions
stood-1.817
Pos-1.553
 Stard-1.510
 Os-1.497
osit-1.493
Token your
Token ID534
Feature activation-5.27e-01

Pos logit contributions
 AMER+0.038
heastern+0.037
uble+0.036
utz+0.035
 rug+0.035
Neg logit contributions
stood-0.036
Pos-0.031
 Stard-0.031
 Os-0.031
osit-0.031
Token device
Token ID3335
Feature activation-5.96e-01

Pos logit contributions
 AMER+1.718
heastern+1.668
uble+1.643
utz+1.615
 rug+1.610
Neg logit contributions
stood-2.088
Pos-1.848
 Stard-1.800
osit-1.795
 Os-1.790
Token Media
Token ID6343
Feature activation-8.53e-01

Pos logit contributions
 AMER+2.362
heastern+2.280
uble+2.275
utz+2.240
 rug+2.209
Neg logit contributions
stood-1.946
Pos-1.655
 Stard-1.610
osit-1.603
 Os-1.581
Token caption
Token ID8305
Feature activation-1.81e+00

Pos logit contributions
 AMER+2.572
heastern+2.493
uble+2.447
utz+2.419
 rug+2.386
Neg logit contributions
stood-3.604
Pos-3.193
 Stard-3.130
 Os-3.118
osit-3.114
Token A
Token ID317
Feature activation-5.45e-01

Pos logit contributions
 AMER+7.092
heastern+6.530
uble+6.501
utz+6.497
 rug+6.325
Neg logit contributions
stood-5.847
Pos-5.002
osit-4.821
 ageing-4.760
 eval-4.712
Token drone
Token ID12170
Feature activation-1.49e-01

Pos logit contributions
 AMER+1.998
heastern+1.948
uble+1.912
utz+1.887
 rug+1.872
Neg logit contributions
stood-1.953
Pos-1.705
 Stard-1.659
 Os-1.649
osit-1.645
Token-
Token ID12
Feature activation-3.84e-02

Pos logit contributions
 AMER+0.591
heastern+0.580
uble+0.570
utz+0.564
 rug+0.553
Neg logit contributions
stood-0.495
Pos-0.424
 Stard-0.415
 Os-0.415
osit-0.407
Tokencamera
Token ID25695
Feature activation-4.72e-01

Pos logit contributions
 AMER+0.161
heastern+0.157
uble+0.155
utz+0.153
 rug+0.152
Neg logit contributions
stood-0.119
Pos-0.101
 Stard-0.097
 Os-0.097
osit-0.096
Token was
Token ID373
Feature activation-1.51e-01

Pos logit contributions
 AMER+1.817
heastern+1.761
uble+1.739
utz+1.715
 rug+1.709
Neg logit contributions
stood-1.596
Pos-1.376
 Stard-1.336
osit-1.334
 Os-1.326
Token on
Token ID319
Feature activation+4.23e-02

Pos logit contributions
 AMER+0.981
heastern+0.957
uble+0.936
utz+0.923
 rug+0.917
Neg logit contributions
stood-1.027
Pos-0.872
 Stard-0.838
osit-0.837
 Os-0.823
Token sleeve
Token ID23179
Feature activation-3.99e-01

Pos logit contributions
stood+0.149
Pos+0.130
osit+0.124
 Stard+0.122
 Kak+0.121
Neg logit contributions
 AMER-0.160
heastern-0.153
 rug-0.152
utz-0.148
uble-0.145
Token at
Token ID379
Feature activation+9.59e-03

Pos logit contributions
 AMER+1.349
heastern+1.262
utz+1.246
uble+1.224
 rug+1.207
Neg logit contributions
stood-1.594
Pos-1.409
 Stard-1.353
osit-1.350
 ageing-1.326
Token basketball
Token ID9669
Feature activation-3.55e-01

Pos logit contributions
stood+0.039
Pos+0.034
 Stard+0.034
 Os+0.034
osit+0.033
Neg logit contributions
 AMER-0.031
heastern-0.030
uble-0.029
utz-0.029
 rug-0.028
Token game
Token ID983
Feature activation-1.09e-01

Pos logit contributions
 AMER+1.448
heastern+1.420
uble+1.395
utz+1.380
 rug+1.364
Neg logit contributions
stood-1.126
Pos-0.966
 Stard-0.941
 Os-0.937
osit-0.928
Token MORE
Token ID12011
Feature activation-1.80e+00

Pos logit contributions
 AMER+0.317
heastern+0.313
uble+0.305
 rug+0.298
utz+0.298
Neg logit contributions
stood-0.472
Pos-0.419
 Os-0.418
 Stard-0.416
osit-0.410
Token and
Token ID290
Feature activation-7.52e-02

Pos logit contributions
 AMER+6.858
heastern+6.542
uble+6.485
utz+6.420
 rug+6.329
Neg logit contributions
stood-5.936
Pos-5.148
osit-5.041
 Stard-4.992
 Kak-4.790
Token Hillary
Token ID5073
Feature activation+1.29e-01

Pos logit contributions
 AMER+0.284
heastern+0.278
uble+0.273
utz+0.270
 rug+0.267
Neg logit contributions
stood-0.261
Pos-0.226
 Stard-0.221
 Os-0.220
osit-0.218
Token Clinton
Token ID2605
Feature activation+2.14e-02

Pos logit contributions
stood+0.484
Pos+0.426
 Os+0.423
 Stard+0.419
osit+0.415
Neg logit contributions
 AMER-0.448
heastern-0.439
uble-0.429
utz-0.424
 rug-0.418
Token Hillary
Token ID5073
Feature activation+3.90e-01

Pos logit contributions
stood+0.103
Pos+0.093
 Stard+0.091
 Os+0.091
osit+0.091
Neg logit contributions
 AMER-0.052
heastern-0.051
uble-0.049
utz-0.048
 rug-0.047
Token Diane
Token ID26542
Feature activation+6.24e-01

Pos logit contributions
stood+0.757
Pos+0.597
 Os+0.590
 Stard+0.552
osit+0.542
Neg logit contributions
 AMER-2.068
heastern-2.040
utz-1.989
uble-1.985
 rug-1.969
Token ju
Token ID7544
Feature activation+6.56e-01

Pos logit contributions
 AMER+0.533
heastern+0.521
uble+0.511
utz+0.505
 rug+0.499
Neg logit contributions
stood-0.487
Pos-0.423
 Stard-0.412
 Os-0.411
osit-0.408
Tokenillet
Token ID32512
Feature activation-8.58e-02

Pos logit contributions
stood+1.872
Pos+1.581
osit+1.539
 Os+1.501
 Stard+1.486
Neg logit contributions
 AMER-2.873
heastern-2.820
utz-2.700
 rug-2.692
uble-2.681
Token.
Token ID13
Feature activation+1.51e-01

Pos logit contributions
 AMER+0.310
heastern+0.302
uble+0.297
utz+0.293
 rug+0.290
Neg logit contributions
stood-0.312
Pos-0.273
 Stard-0.266
 Os-0.265
osit-0.264
Token pic
Token ID8301
Feature activation-8.23e-01

Pos logit contributions
stood+0.572
Pos+0.504
 Os+0.492
 Stard+0.490
osit+0.487
Neg logit contributions
 AMER-0.523
heastern-0.514
uble-0.503
utz-0.496
 rug-0.489
Token.
Token ID13
Feature activation-1.89e-01

Pos logit contributions
 AMER+2.876
heastern+2.757
uble+2.733
utz+2.722
 rug+2.656
Neg logit contributions
stood-3.067
Pos-2.670
 Stard-2.600
osit-2.591
 Os-2.576
Tokentwitter
Token ID6956
Feature activation-1.78e+00

Pos logit contributions
 AMER+0.353
heastern+0.336
uble+0.323
utz+0.314
 rug+0.309
Neg logit contributions
stood-1.018
Pos-0.931
 Stard-0.916
 Os-0.914
osit-0.911
Token.
Token ID13
Feature activation-2.52e-01

Pos logit contributions
 AMER+4.970
uble+4.793
utz+4.792
heastern+4.521
 rug+4.410
Neg logit contributions
stood-7.233
�士-6.883
 ageing-6.555
 Stard-6.417
Pos-6.327
Tokencom
Token ID785
Feature activation-8.09e-01

Pos logit contributions
 AMER+1.078
heastern+1.053
uble+1.041
utz+1.031
 rug+1.016
Neg logit contributions
stood-0.743
Pos-0.625
 Stard-0.610
 Os-0.606
osit-0.599
Token/
Token ID14
Feature activation-5.34e-01

Pos logit contributions
 AMER+2.276
uble+2.154
heastern+2.152
utz+2.136
 rug+2.058
Neg logit contributions
stood-3.528
Pos-3.127
 Stard-3.096
 ageing-3.089
osit-3.073
Tokenu
Token ID84
Feature activation-1.90e-01

Pos logit contributions
 AMER+2.156
heastern+2.119
uble+2.098
utz+2.081
 rug+2.036
Neg logit contributions
stood-1.691
Pos-1.424
 Stard-1.418
 Os-1.396
osit-1.380
TokenI
Token ID40
Feature activation+5.32e-01

Pos logit contributions
 AMER+0.795
heastern+0.779
uble+0.769
utz+0.761
 rug+0.751
Neg logit contributions
stood-0.579
Pos-0.488
 Stard-0.479
 Os-0.474
osit-0.470
Token media
Token ID2056
Feature activation-4.61e-01

Pos logit contributions
stood+0.223
Pos+0.194
 Stard+0.191
 Os+0.191
osit+0.188
Neg logit contributions
 AMER-0.220
heastern-0.215
uble-0.212
utz-0.209
 rug-0.203
Token bus
Token ID1323
Feature activation-3.20e-01

Pos logit contributions
 AMER+1.710
heastern+1.667
uble+1.640
utz+1.618
 rug+1.597
Neg logit contributions
stood-1.636
Pos-1.423
 Stard-1.388
 Os-1.382
osit-1.374
Token)
Token ID8
Feature activation+5.70e-02

Pos logit contributions
 AMER+1.163
heastern+1.134
uble+1.113
utz+1.100
 rug+1.085
Neg logit contributions
stood-1.160
Pos-1.011
 Os-0.987
 Stard-0.986
osit-0.979
Token pic
Token ID8301
Feature activation-1.93e-01

Pos logit contributions
stood+0.224
Pos+0.198
 Os+0.195
 Stard+0.194
osit+0.191
Neg logit contributions
 AMER-0.189
heastern-0.185
uble-0.182
utz-0.178
 rug-0.176
Token.
Token ID13
Feature activation-2.50e-01

Pos logit contributions
 AMER+0.731
heastern+0.717
uble+0.703
utz+0.692
 rug+0.689
Neg logit contributions
stood-0.669
Pos-0.582
 Stard-0.567
 Os-0.565
osit-0.560
Tokentwitter
Token ID6956
Feature activation-1.77e+00

Pos logit contributions
 AMER+0.469
heastern+0.446
uble+0.430
utz+0.417
 rug+0.410
Neg logit contributions
stood-1.347
Pos-1.232
 Stard-1.212
 Os-1.209
osit-1.205
Token.
Token ID13
Feature activation-2.78e-01

Pos logit contributions
 AMER+4.991
uble+4.819
utz+4.810
heastern+4.559
 rug+4.446
Neg logit contributions
stood-7.198
�士-6.823
 ageing-6.621
 Stard-6.365
Pos-6.286
Tokencom
Token ID785
Feature activation-9.40e-01

Pos logit contributions
 AMER+1.192
heastern+1.163
uble+1.150
utz+1.139
 rug+1.123
Neg logit contributions
stood-0.819
Pos-0.688
 Stard-0.672
 Os-0.667
osit-0.662
Token/
Token ID14
Feature activation-5.32e-01

Pos logit contributions
 AMER+2.612
heastern+2.462
uble+2.462
utz+2.451
 rug+2.358
Neg logit contributions
stood-4.118
Pos-3.647
 ageing-3.628
 Stard-3.612
osit-3.594
Tokenz
Token ID89
Feature activation-2.59e-02

Pos logit contributions
 AMER+2.150
heastern+2.114
uble+2.092
utz+2.075
 rug+2.031
Neg logit contributions
stood-1.680
Pos-1.416
 Stard-1.409
 Os-1.388
osit-1.371
TokenJ
Token ID41
Feature activation+4.93e-01

Pos logit contributions
 AMER+0.106
heastern+0.104
uble+0.102
utz+0.101
 rug+0.100
Neg logit contributions
stood-0.082
Pos-0.070
 Stard-0.068
 Os-0.067
osit-0.067
Token 2014
Token ID1946
Feature activation+2.26e-01

Pos logit contributions
 AMER+0.565
heastern+0.547
uble+0.535
utz+0.527
 rug+0.521
Neg logit contributions
stood-0.781
Pos-0.696
 Stard-0.681
 Os-0.678
osit-0.676
Token after
Token ID706
Feature activation+1.00e-01

Pos logit contributions
stood+0.762
Pos+0.660
 Os+0.651
 Stard+0.650
 Kak+0.637
Neg logit contributions
 AMER-0.864
heastern-0.846
uble-0.828
utz-0.819
 rug-0.807
Token the
Token ID262
Feature activation+1.49e-01

Pos logit contributions
stood+0.337
Pos+0.290
 Stard+0.284
 Os+0.283
osit+0.280
Neg logit contributions
 AMER-0.389
heastern-0.382
uble-0.375
utz-0.370
 rug-0.366
Token company
Token ID1664
Feature activation+2.41e-01

Pos logit contributions
stood+0.574
Pos+0.501
 Os+0.499
 Stard+0.497
osit+0.491
Neg logit contributions
 AMER-0.503
heastern-0.495
uble-0.485
utz-0.478
 rug-0.467
Token nearly
Token ID3016
Feature activation+6.26e-01

Pos logit contributions
stood+0.818
Pos+0.705
 Stard+0.690
 Os+0.689
osit+0.679
Neg logit contributions
 AMER-0.930
heastern-0.915
uble-0.898
utz-0.885
 rug-0.872
Token default
Token ID4277
Feature activation+9.97e-01

Pos logit contributions
stood+2.053
Pos+1.740
 Os+1.719
 Stard+1.712
osit+1.681
Neg logit contributions
 AMER-2.461
heastern-2.445
uble-2.385
utz-2.353
 rug-2.288
Tokened
Token ID276
Feature activation+7.17e-01

Pos logit contributions
stood+2.341
Pos+1.785
 Os+1.743
 Stard+1.711
 eval+1.655
Neg logit contributions
heastern-4.888
 AMER-4.875
 rug-4.673
uble-4.642
utz-4.639
Token on
Token ID319
Feature activation+6.14e-01

Pos logit contributions
stood+1.977
 Os+1.637
Pos+1.630
 Stard+1.613
 Kak+1.597
Neg logit contributions
heastern-3.127
 AMER-3.114
uble-3.043
utz-2.998
 rug-2.975
Token its
Token ID663
Feature activation+4.18e-01

Pos logit contributions
stood+2.016
Pos+1.703
 Os+1.692
 Stard+1.676
osit+1.661
Neg logit contributions
heastern-2.414
 AMER-2.410
uble-2.345
utz-2.312
 rug-2.260
Token debt
Token ID5057
Feature activation+6.05e-01

Pos logit contributions
stood+1.395
Pos+1.193
 Os+1.177
 Stard+1.173
osit+1.158
Neg logit contributions
 AMER-1.618
heastern-1.606
uble-1.569
utz-1.550
 rug-1.516
Token of
Token ID286
Feature activation+3.33e-01

Pos logit contributions
stood+1.923
Pos+1.663
 Os+1.618
 Stard+1.603
osit+1.596
Neg logit contributions
 AMER-2.413
heastern-2.396
uble-2.328
utz-2.289
 rug-2.241
Token opposing
Token ID12330
Feature activation+9.66e-03

Pos logit contributions
stood+1.114
Pos+0.968
 Os+0.933
 Stard+0.931
osit+0.918
Neg logit contributions
 AMER-1.262
heastern-1.232
uble-1.215
utz-1.198
 rug-1.175
Token the
Token ID262
Feature activation+1.05e-01

Pos logit contributions
stood+0.032
Pos+0.028
 Stard+0.027
 Os+0.027
osit+0.027
Neg logit contributions
 AMER-0.038
heastern-0.037
uble-0.036
utz-0.036
 rug-0.036
Token self
Token ID2116
Feature activation+3.87e-02

Pos logit contributions
stood+0.362
Pos+0.313
 Stard+0.305
 Os+0.303
osit+0.301
Neg logit contributions
 AMER-0.400
heastern-0.390
uble-0.384
utz-0.380
 rug-0.374
Token-
Token ID12
Feature activation+5.93e-01

Pos logit contributions
stood+0.125
Pos+0.107
 Stard+0.103
 Os+0.103
osit+0.102
Neg logit contributions
 AMER-0.157
heastern-0.154
uble-0.150
utz-0.149
 rug-0.148
Tokenaff
Token ID2001
Feature activation+4.80e-01

Pos logit contributions
stood+2.018
Pos+1.743
 Stard+1.664
osit+1.663
 Os+1.655
Neg logit contributions
 AMER-2.312
heastern-2.255
uble-2.202
 rug-2.169
utz-2.164
Tokenirming
Token ID29808
Feature activation+1.25e+00

Pos logit contributions
stood+0.938
Pos+0.704
osit+0.674
 Stard+0.653
 Os+0.652
Neg logit contributions
 AMER-2.578
heastern-2.499
uble-2.458
 rug-2.449
utz-2.446
Token good
Token ID922
Feature activation+1.30e-01

Pos logit contributions
stood+4.148
Pos+3.550
 Os+3.486
 eval+3.465
 Stard+3.455
Neg logit contributions
 AMER-4.731
heastern-4.721
uble-4.622
utz-4.578
 rug-4.399
Token.
Token ID13
Feature activation+2.01e-01

Pos logit contributions
stood+0.442
Pos+0.382
 Os+0.371
 Stard+0.371
osit+0.368
Neg logit contributions
 AMER-0.503
heastern-0.491
uble-0.483
utz-0.477
 rug-0.470
Token In
Token ID554
Feature activation+2.10e-01

Pos logit contributions
stood+0.753
Pos+0.661
 Stard+0.646
 Os+0.643
osit+0.639
Neg logit contributions
 AMER-0.704
heastern-0.687
uble-0.674
utz-0.665
 rug-0.658
Token this
Token ID428
Feature activation+3.84e-01

Pos logit contributions
stood+0.710
Pos+0.610
 Os+0.596
 Stard+0.594
osit+0.593
Neg logit contributions
 AMER-0.817
heastern-0.800
uble-0.787
utz-0.774
 rug-0.769
Token sense
Token ID2565
Feature activation-3.78e-01

Pos logit contributions
stood+1.401
Pos+1.221
 Os+1.192
 Stard+1.190
osit+1.178
Neg logit contributions
 AMER-1.388
heastern-1.354
uble-1.331
utz-1.309
 rug-1.289
Token
Token ID198
Feature activation+1.80e-01

Pos logit contributions
stood+0.357
Pos+0.313
 Stard+0.304
 Os+0.303
osit+0.301
Neg logit contributions
 AMER-0.362
heastern-0.354
uble-0.347
utz-0.343
 rug-0.339
TokenCopyright
Token ID15269
Feature activation+2.34e-01

Pos logit contributions
stood+0.609
Pos+0.523
 Stard+0.513
 Os+0.510
osit+0.506
Neg logit contributions
 AMER-0.696
heastern-0.678
uble-0.668
utz-0.660
 rug-0.653
Token ©
Token ID10673
Feature activation+2.85e-01

Pos logit contributions
stood+0.863
Pos+0.768
 Os+0.750
 Stard+0.744
osit+0.743
Neg logit contributions
 AMER-0.813
heastern-0.801
uble-0.792
utz-0.771
 rug-0.761
Token 2016
Token ID1584
Feature activation+3.57e-02

Pos logit contributions
stood+1.063
Pos+0.942
 Os+0.924
 Stard+0.919
osit+0.913
Neg logit contributions
 AMER-0.984
heastern-0.970
uble-0.958
utz-0.934
 rug-0.925
Token Else
Token ID25974
Feature activation+7.27e-01

Pos logit contributions
stood+0.121
Pos+0.104
 Os+0.103
 Stard+0.102
osit+0.101
Neg logit contributions
 AMER-0.137
heastern-0.135
uble-0.133
utz-0.131
 rug-0.130
Tokenvier
Token ID49663
Feature activation+1.01e+00

Pos logit contributions
stood+2.069
Pos+1.781
 Os+1.725
osit+1.708
 Stard+1.686
Neg logit contributions
 AMER-3.113
heastern-3.048
uble-3.018
utz-2.971
 rug-2.962
Token B
Token ID347
Feature activation+4.79e-01

Pos logit contributions
stood+2.762
Pos+2.427
 Os+2.382
 Kak+2.367
 Lev+2.308
Neg logit contributions
uble-4.320
 AMER-4.263
 rug-4.200
heastern-4.152
utz-4.131
Token.
Token ID13
Feature activation+4.82e-01

Pos logit contributions
stood+1.603
Pos+1.405
osit+1.352
 Stard+1.347
 Os+1.346
Neg logit contributions
 AMER-1.868
heastern-1.830
uble-1.800
utz-1.753
 rug-1.745
TokenV
Token ID53
Feature activation+4.75e-01

Pos logit contributions
stood+0.995
Pos+0.888
 Os+0.825
 Stard+0.816
 Kak+0.789
Neg logit contributions
 AMER-2.421
heastern-2.418
uble-2.357
utz-2.314
 rug-2.313
Token.
Token ID13
Feature activation+4.91e-01

Pos logit contributions
stood+1.623
Pos+1.416
 Os+1.382
 Stard+1.380
osit+1.370
Neg logit contributions
 AMER-1.802
heastern-1.754
uble-1.714
 rug-1.710
utz-1.702
Token All
Token ID1439
Feature activation+2.63e-01

Pos logit contributions
stood+1.777
Pos+1.604
 Stard+1.568
 Os+1.553
osit+1.538
Neg logit contributions
 AMER-1.735
heastern-1.704
uble-1.684
utz-1.651
 rug-1.619
Tokenb
Token ID65
Feature activation+7.06e-01

Pos logit contributions
stood+2.424
Pos+2.084
osit+2.000
 Os+1.987
 Stard+1.978
Neg logit contributions
 AMER-3.505
heastern-3.390
 rug-3.299
uble-3.285
utz-3.269
Token6
Token ID21
Feature activation+9.46e-01

Pos logit contributions
stood+2.027
Pos+1.723
 Os+1.645
osit+1.642
 Stard+1.641
Neg logit contributions
 AMER-3.086
heastern-3.005
uble-2.916
 rug-2.913
utz-2.900
Tokence
Token ID344
Feature activation+6.84e-01

Pos logit contributions
stood+2.892
Pos+2.496
 Os+2.401
 Stard+2.400
osit+2.380
Neg logit contributions
 AMER-3.908
heastern-3.814
uble-3.678
 rug-3.665
utz-3.654
Token650
Token ID17544
Feature activation+7.67e-01

Pos logit contributions
stood+2.002
Pos+1.712
 Os+1.647
 Stard+1.640
osit+1.630
Neg logit contributions
 AMER-2.934
heastern-2.873
uble-2.792
utz-2.772
 rug-2.770
Tokenc
Token ID66
Feature activation+8.52e-01

Pos logit contributions
stood+2.290
Pos+1.990
 Os+1.912
 Stard+1.903
osit+1.880
Neg logit contributions
 AMER-3.222
heastern-3.151
uble-3.079
utz-3.050
 rug-3.039
Token6
Token ID21
Feature activation+1.15e+00

Pos logit contributions
stood+2.475
Pos+2.134
 Os+2.064
 Stard+2.046
osit+2.019
Neg logit contributions
 AMER-3.627
heastern-3.564
uble-3.475
utz-3.436
 rug-3.433
Token
Token ID851
Feature activation-1.42e-01

Pos logit contributions
stood+3.397
Pos+2.984
 Os+2.881
 Stard+2.869
osit+2.799
Neg logit contributions
 AMER-4.760
heastern-4.682
uble-4.544
utz-4.510
 rug-4.481
Token str
Token ID965
Feature activation-6.64e-01

Pos logit contributions
 AMER+0.551
heastern+0.542
uble+0.531
utz+0.524
 rug+0.516
Neg logit contributions
stood-0.479
Pos-0.416
 Os-0.407
 Stard-0.407
osit-0.401
Tokenace
Token ID558
Feature activation-1.53e-01

Pos logit contributions
 AMER+2.547
heastern+2.487
uble+2.456
utz+2.425
 rug+2.392
Neg logit contributions
stood-2.257
Pos-1.944
 Stard-1.903
 Os-1.885
osit-1.877
Token.
Token ID13
Feature activation+1.89e-01

Pos logit contributions
 AMER+0.546
heastern+0.533
uble+0.520
utz+0.513
 rug+0.509
Neg logit contributions
stood-0.562
Pos-0.495
 Os-0.481
 Stard-0.480
osit-0.479
Tokenso
Token ID568
Feature activation-5.66e-01

Pos logit contributions
stood+0.659
Pos+0.578
osit+0.558
 Os+0.557
 Stard+0.553
Neg logit contributions
 AMER-0.712
heastern-0.693
uble-0.675
 rug-0.669
utz-0.665
Tokencom
Token ID785
Feature activation-8.38e-01

Pos logit contributions
 AMER+0.755
heastern+0.738
uble+0.728
utz+0.721
 rug+0.713
Neg logit contributions
stood-0.506
Pos-0.425
 Stard-0.413
 Os-0.410
osit-0.407
Token/
Token ID14
Feature activation-4.34e-01

Pos logit contributions
 AMER+2.386
heastern+2.258
uble+2.252
utz+2.233
 rug+2.162
Neg logit contributions
stood-3.635
Pos-3.217
 Stard-3.183
 ageing-3.183
osit-3.164
TokenW
Token ID54
Feature activation+3.98e-01

Pos logit contributions
 AMER+1.753
heastern+1.722
uble+1.703
utz+1.687
 rug+1.655
Neg logit contributions
stood-1.378
Pos-1.165
 Stard-1.154
 Os-1.139
osit-1.127
Token8
Token ID23
Feature activation+7.09e-01

Pos logit contributions
stood+1.280
Pos+1.113
osit+1.063
 Os+1.060
 Stard+1.051
Neg logit contributions
 AMER-1.610
heastern-1.558
uble-1.525
 rug-1.510
utz-1.501
Tokenv
Token ID85
Feature activation+9.89e-01

Pos logit contributions
stood+2.314
Pos+2.035
osit+1.940
 Os+1.931
 Stard+1.908
Neg logit contributions
 AMER-2.824
heastern-2.721
uble-2.655
 rug-2.640
utz-2.608
TokenX
Token ID55
Feature activation+9.67e-01

Pos logit contributions
stood+3.069
Pos+2.715
osit+2.571
 Os+2.546
 Stard+2.495
Neg logit contributions
 AMER-4.051
heastern-3.906
 rug-3.796
uble-3.784
utz-3.726
TokenDs
Token ID30832
Feature activation+9.54e-01

Pos logit contributions
stood+3.152
Pos+2.798
osit+2.655
 Os+2.627
 Stard+2.581
Neg logit contributions
 AMER-3.831
heastern-3.669
 rug-3.569
uble-3.558
utz-3.496
Token4
Token ID19
Feature activation+8.52e-01

Pos logit contributions
stood+3.016
Pos+2.669
osit+2.525
 Os+2.496
 Stard+2.459
Neg logit contributions
 AMER-3.880
heastern-3.726
 rug-3.629
uble-3.610
utz-3.555
Tokenj
Token ID73
Feature activation+8.57e-01

Pos logit contributions
stood+2.795
Pos+2.475
osit+2.353
 Os+2.332
 Stard+2.301
Neg logit contributions
 AMER-3.366
heastern-3.235
uble-3.146
 rug-3.137
utz-3.086
Token0
Token ID15
Feature activation+7.27e-01

Pos logit contributions
stood+2.676
Pos+2.368
osit+2.239
 Os+2.219
 Stard+2.181
Neg logit contributions
 AMER-3.514
heastern-3.387
 rug-3.290
uble-3.288
utz-3.236
Tokeno
Token ID78
Feature activation+5.81e-01

Pos logit contributions
stood+2.355
Pos+2.077
osit+1.976
 Os+1.966
 Stard+1.935
Neg logit contributions
 AMER-2.909
heastern-2.806
uble-2.737
 rug-2.720
utz-2.686
Token the
Token ID262
Feature activation+1.52e-02

Pos logit contributions
stood+0.559
Pos+0.486
 Os+0.473
 Stard+0.472
osit+0.468
Neg logit contributions
 AMER-0.583
heastern-0.569
uble-0.556
utz-0.550
 rug-0.542
Token first
Token ID717
Feature activation+2.88e-02

Pos logit contributions
stood+0.060
Pos+0.052
 Stard+0.051
 Os+0.051
osit+0.051
Neg logit contributions
 AMER-0.050
heastern-0.049
uble-0.048
utz-0.048
 rug-0.047
Token time
Token ID640
Feature activation+8.14e-01

Pos logit contributions
stood+0.087
Pos+0.073
 Stard+0.071
 Os+0.071
osit+0.070
Neg logit contributions
 AMER-0.122
heastern-0.120
uble-0.118
utz-0.116
 rug-0.115
Token,
Token ID11
Feature activation+1.13e-01

Pos logit contributions
stood+2.119
Pos+1.825
 Os+1.792
 Stard+1.748
 Kak+1.709
Neg logit contributions
 AMER-3.630
heastern-3.540
uble-3.506
utz-3.462
 rug-3.446
Token Chang
Token ID22597
Feature activation+8.67e-02

Pos logit contributions
stood+0.380
Pos+0.329
 Stard+0.321
 Os+0.321
osit+0.317
Neg logit contributions
 AMER-0.435
heastern-0.426
uble-0.419
utz-0.413
 rug-0.408
Token and
Token ID290
Feature activation+1.44e-01

Pos logit contributions
stood+0.289
Pos+0.251
 Os+0.246
 Stard+0.244
osit+0.242
Neg logit contributions
 AMER-0.338
heastern-0.331
uble-0.323
 rug-0.317
utz-0.317
Token colleagues
Token ID7810
Feature activation-3.89e-01

Pos logit contributions
stood+0.485
Pos+0.418
 Os+0.417
 Stard+0.416
osit+0.405
Neg logit contributions
 AMER-0.553
heastern-0.542
uble-0.533
utz-0.524
 rug-0.517
Token also
Token ID635
Feature activation+1.29e-01

Pos logit contributions
 AMER+1.363
heastern+1.323
uble+1.299
utz+1.288
 rug+1.275
Neg logit contributions
stood-1.457
Pos-1.283
 Stard-1.249
 Os-1.237
osit-1.237
Token recorded
Token ID6264
Feature activation-4.64e-01

Pos logit contributions
stood+0.493
Pos+0.432
 Stard+0.423
 Os+0.421
osit+0.420
Neg logit contributions
 AMER-0.445
heastern-0.433
uble-0.425
utz-0.419
 rug-0.413
Token and
Token ID290
Feature activation-2.10e-01

Pos logit contributions
 AMER+1.739
heastern+1.692
uble+1.665
utz+1.645
 rug+1.632
Neg logit contributions
stood-1.626
Pos-1.409
 Stard-1.377
 Os-1.363
osit-1.358
Token analyzed
Token ID15475
Feature activation-7.56e-01

Pos logit contributions
 AMER+0.791
heastern+0.771
uble+0.758
utz+0.749
 rug+0.743
Neg logit contributions
stood-0.730
Pos-0.636
 Stard-0.618
 Os-0.614
osit-0.613
Token usual
Token ID6678
Feature activation-9.32e-01

Pos logit contributions
 AMER+1.873
heastern+1.811
uble+1.778
utz+1.761
 rug+1.738
Neg logit contributions
stood-2.234
Pos-1.984
 Stard-1.943
osit-1.929
 Os-1.913
Token,
Token ID11
Feature activation-4.98e-01

Pos logit contributions
 AMER+3.660
heastern+3.495
utz+3.482
uble+3.479
 rug+3.457
Neg logit contributions
stood-3.012
Pos-2.571
osit-2.511
 Stard-2.495
 ageing-2.471
Token with
Token ID351
Feature activation-3.06e-01

Pos logit contributions
 AMER+1.830
heastern+1.777
uble+1.752
utz+1.737
 rug+1.724
Neg logit contributions
stood-1.770
Pos-1.539
 Stard-1.495
osit-1.483
 Os-1.481
Token berries
Token ID36322
Feature activation-1.25e-01

Pos logit contributions
 AMER+0.845
heastern+0.813
uble+0.801
utz+0.778
 rug+0.776
Neg logit contributions
stood-1.373
Pos-1.239
 Stard-1.213
osit-1.206
 Os-1.203
Token,
Token ID11
Feature activation-3.66e-02

Pos logit contributions
 AMER+0.537
heastern+0.523
uble+0.517
utz+0.512
 rug+0.507
Neg logit contributions
stood-0.370
Pos-0.312
 Stard-0.302
 Os-0.300
osit-0.299
Token but
Token ID475
Feature activation+1.87e-02

Pos logit contributions
 AMER+0.132
heastern+0.129
uble+0.126
utz+0.125
 rug+0.124
Neg logit contributions
stood-0.133
Pos-0.116
 Stard-0.114
 Os-0.113
osit-0.113
Token not
Token ID407
Feature activation-6.44e-02

Pos logit contributions
stood+0.071
Pos+0.062
 Stard+0.061
 Os+0.060
osit+0.060
Neg logit contributions
 AMER-0.065
heastern-0.064
uble-0.062
utz-0.062
 rug-0.061
Token too
Token ID1165
Feature activation-8.18e-01

Pos logit contributions
 AMER+0.223
heastern+0.216
uble+0.213
utz+0.210
 rug+0.206
Neg logit contributions
stood-0.244
Pos-0.214
 Os-0.210
 Stard-0.210
osit-0.208
Token many
Token ID867
Feature activation-9.37e-02

Pos logit contributions
 AMER+2.589
heastern+2.513
uble+2.461
utz+2.425
 rug+2.399
Neg logit contributions
stood-3.343
Pos-2.968
 Stard-2.905
 Os-2.890
osit-2.882
Token)
Token ID8
Feature activation-4.98e-01

Pos logit contributions
 AMER+0.325
heastern+0.317
uble+0.310
utz+0.306
 rug+0.303
Neg logit contributions
stood-0.355
Pos-0.312
 Stard-0.305
 Os-0.303
osit-0.302
Token 200
Token ID939
Feature activation-2.55e-01

Pos logit contributions
 AMER+1.909
heastern+1.846
uble+1.833
utz+1.788
 rug+1.787
Neg logit contributions
stood-1.698
Pos-1.482
 Stard-1.417
osit-1.413
 Os-1.402
Token.
Token ID13
Feature activation-2.17e-01

Pos logit contributions
 AMER+0.946
heastern+0.920
uble+0.907
utz+0.897
 rug+0.892
Neg logit contributions
stood-0.874
Pos-0.763
 Stard-0.744
 Os-0.740
osit-0.738
Token PT
Token ID19310
Feature activation-2.23e-01

Pos logit contributions
 AMER+0.699
heastern+0.680
uble+0.669
utz+0.660
 rug+0.653
Neg logit contributions
stood-0.868
Pos-0.771
 Stard-0.756
 Os-0.752
osit-0.749
Token on
Token ID319
Feature activation+5.36e-02

Pos logit contributions
 AMER+0.867
heastern+0.847
uble+0.834
utz+0.824
 rug+0.818
Neg logit contributions
stood-0.750
Pos-0.649
 Stard-0.635
 Os-0.630
osit-0.627
Token June
Token ID2795
Feature activation+2.49e-02

Pos logit contributions
stood+0.219
Pos+0.194
 Stard+0.191
 Os+0.190
osit+0.189
Neg logit contributions
 AMER-0.169
heastern-0.166
uble-0.162
utz-0.160
 rug-0.158
Token 26
Token ID2608
Feature activation+3.57e-01

Pos logit contributions
stood+0.096
Pos+0.085
 Stard+0.083
 Os+0.082
osit+0.082
Neg logit contributions
 AMER-0.085
heastern-0.082
uble-0.081
utz-0.079
 rug-0.079
Token.
Token ID13
Feature activation+3.15e-01

Pos logit contributions
stood+1.105
Pos+0.960
 Stard+0.947
 Os+0.934
osit+0.929
Neg logit contributions
 AMER-1.441
heastern-1.416
uble-1.381
utz-1.372
 rug-1.372
Token
Token ID198
Feature activation+1.63e-02

Pos logit contributions
stood+1.136
Pos+1.002
 Stard+0.988
 Os+0.983
osit+0.965
Neg logit contributions
heastern-1.131
 AMER-1.131
uble-1.100
utz-1.090
 rug-1.074
Token
Token ID198
Feature activation+1.04e-01

Pos logit contributions
stood+0.059
Pos+0.051
 Stard+0.050
 Os+0.050
osit+0.049
Neg logit contributions
 AMER-0.060
heastern-0.058
uble-0.057
utz-0.056
 rug-0.056
TokenThanks
Token ID9690
Feature activation-8.79e-02

Pos logit contributions
stood+0.355
Pos+0.308
 Stard+0.299
 Os+0.298
osit+0.297
Neg logit contributions
 AMER-0.401
heastern-0.392
uble-0.385
utz-0.380
 rug-0.377
Token,
Token ID11
Feature activation+1.56e-01

Pos logit contributions
 AMER+0.373
heastern+0.366
uble+0.360
utz+0.357
 rug+0.352
Neg logit contributions
stood-0.263
Pos-0.224
 Stard-0.219
 Os-0.217
osit-0.213
Token Wii
Token ID16591
Feature activation+2.55e-01

Pos logit contributions
stood+0.512
Pos+0.445
 Stard+0.442
 Os+0.440
osit+0.425
Neg logit contributions
 AMER-0.613
heastern-0.607
uble-0.598
utz-0.589
 rug-0.580
Token of
Token ID286
Feature activation-2.98e-01

Pos logit contributions
 AMER+0.237
heastern+0.232
uble+0.229
utz+0.225
 rug+0.222
Neg logit contributions
stood-0.196
Pos-0.169
 Stard-0.164
 Os-0.164
osit-0.162
Token the
Token ID262
Feature activation-1.77e-01

Pos logit contributions
 AMER+1.180
heastern+1.152
uble+1.133
utz+1.120
 rug+1.108
Neg logit contributions
stood-0.985
Pos-0.847
 Stard-0.825
 Os-0.822
osit-0.816
Token utility
Token ID10361
Feature activation+3.69e-02

Pos logit contributions
 AMER+0.595
heastern+0.578
uble+0.567
utz+0.559
 rug+0.553
Neg logit contributions
stood-0.687
Pos-0.606
 Stard-0.593
 Os-0.590
osit-0.587
Token's
Token ID338
Feature activation+2.08e-01

Pos logit contributions
stood+0.126
Pos+0.109
 Os+0.106
 Stard+0.106
osit+0.104
Neg logit contributions
 AMER-0.142
heastern-0.139
uble-0.136
utz-0.134
 rug-0.132
Token finances
Token ID20903
Feature activation+5.11e-02

Pos logit contributions
stood+0.784
Pos+0.687
 Stard+0.674
 Os+0.673
osit+0.665
Neg logit contributions
 AMER-0.728
heastern-0.711
uble-0.697
utz-0.686
 rug-0.675
Token.
Token ID13
Feature activation+9.00e-02

Pos logit contributions
stood+0.170
Pos+0.146
 Stard+0.142
 Os+0.142
osit+0.139
Neg logit contributions
 AMER-0.202
heastern-0.197
uble-0.194
utz-0.191
 rug-0.189
Token
Token ID198
Feature activation-2.31e-01

Pos logit contributions
stood+0.315
Pos+0.276
 Stard+0.272
 Os+0.270
osit+0.265
Neg logit contributions
 AMER-0.336
heastern-0.331
uble-0.325
utz-0.319
 rug-0.317
Token
Token ID198
Feature activation-1.08e-01

Pos logit contributions
 AMER+0.819
heastern+0.796
uble+0.784
utz+0.774
 rug+0.764
Neg logit contributions
stood-0.853
Pos-0.746
 Stard-0.730
 Os-0.725
osit-0.723
TokenR
Token ID49
Feature activation-7.19e-01

Pos logit contributions
 AMER+0.432
heastern+0.422
uble+0.416
utz+0.412
 rug+0.407
Neg logit contributions
stood-0.353
Pos-0.301
 Stard-0.295
 Os-0.293
osit-0.292
Tokenising
Token ID1710
Feature activation-5.86e-01

Pos logit contributions
 AMER+2.159
heastern+2.141
uble+2.136
utz+2.125
 rug+1.989
Neg logit contributions
stood-2.976
Pos-2.648
 Stard-2.627
 Os-2.601
 ageing-2.598
Token costs
Token ID3484
Feature activation+1.78e-01

Pos logit contributions
 AMER+2.233
heastern+2.160
uble+2.123
utz+2.105
 rug+2.098
Neg logit contributions
stood-2.000
Pos-1.741
 Stard-1.690
osit-1.685
 Os-1.662
Token traveler
Token ID40168
Feature activation-2.76e-01

Pos logit contributions
 AMER+0.284
heastern+0.277
uble+0.272
utz+0.269
 rug+0.266
Neg logit contributions
stood-0.277
Pos-0.241
 Stard-0.235
 Os-0.234
osit-0.233
Token,
Token ID11
Feature activation+3.31e-02

Pos logit contributions
 AMER+1.078
heastern+1.052
uble+1.034
utz+1.024
 rug+1.017
Neg logit contributions
stood-0.927
Pos-0.795
 Stard-0.776
 Os-0.769
osit-0.768
Token
Token ID447
Feature activation+9.43e-02

Pos logit contributions
stood+0.111
Pos+0.096
 Stard+0.093
 Os+0.093
osit+0.092
Neg logit contributions
 AMER-0.129
heastern-0.126
uble-0.124
utz-0.122
 rug-0.121
Token
Token ID251
Feature activation+2.48e-01

Pos logit contributions
stood+0.353
Pos+0.313
 Os+0.304
 Stard+0.302
osit+0.301
Neg logit contributions
 AMER-0.328
heastern-0.321
uble-0.314
utz-0.308
 rug-0.304
Token he
Token ID339
Feature activation-2.06e-01

Pos logit contributions
stood+0.827
Pos+0.719
 Stard+0.699
 Os+0.698
osit+0.690
Neg logit contributions
 AMER-0.966
heastern-0.948
uble-0.933
utz-0.917
 rug-0.910
Token told
Token ID1297
Feature activation+2.41e-01

Pos logit contributions
 AMER+0.808
heastern+0.789
uble+0.775
utz+0.768
 rug+0.759
Neg logit contributions
stood-0.690
Pos-0.594
 Stard-0.578
 Os-0.577
osit-0.572
Token Fox
Token ID5426
Feature activation-7.36e-01

Pos logit contributions
stood+0.881
Pos+0.761
 Os+0.750
 Stard+0.749
osit+0.737
Neg logit contributions
 AMER-0.856
heastern-0.850
uble-0.836
utz-0.820
 rug-0.810
Token News
Token ID3000
Feature activation-1.97e-01

Pos logit contributions
 AMER+2.596
heastern+2.529
uble+2.485
utz+2.451
 rug+2.426
Neg logit contributions
stood-2.741
Pos-2.403
 Stard-2.347
 Os-2.338
osit-2.324
Token.
Token ID13
Feature activation+1.39e-02

Pos logit contributions
 AMER+0.753
heastern+0.737
uble+0.726
utz+0.717
 rug+0.711
Neg logit contributions
stood-0.673
Pos-0.584
 Stard-0.569
 Os-0.568
osit-0.562
Token The
Token ID383
Feature activation+2.46e-02

Pos logit contributions
stood+0.049
Pos+0.043
 Stard+0.042
 Os+0.042
osit+0.042
Neg logit contributions
 AMER-0.051
heastern-0.050
uble-0.049
utz-0.048
 rug-0.048
Token policy
Token ID2450
Feature activation-6.89e-01

Pos logit contributions
stood+0.089
Pos+0.078
 Stard+0.076
 Os+0.076
osit+0.076
Neg logit contributions
 AMER-0.089
heastern-0.087
uble-0.086
utz-0.084
 rug-0.083
Token working
Token ID1762
Feature activation-4.89e-01

Pos logit contributions
 AMER+0.594
heastern+0.575
uble+0.567
utz+0.559
 rug+0.556
Neg logit contributions
stood-0.575
Pos-0.499
 Stard-0.486
osit-0.484
 Os-0.482
Token with
Token ID351
Feature activation-7.08e-01

Pos logit contributions
 AMER+1.930
heastern+1.884
uble+1.853
utz+1.831
 rug+1.814
Neg logit contributions
stood-1.618
Pos-1.392
 Stard-1.355
 Os-1.347
osit-1.341
Token Nerd
Token ID44031
Feature activation-4.51e-01

Pos logit contributions
 AMER+2.661
heastern+2.555
uble+2.530
utz+2.497
 rug+2.492
Neg logit contributions
stood-2.479
Pos-2.145
osit-2.089
 Stard-2.051
 Os-2.050
Token Kingdom
Token ID7526
Feature activation+2.44e-01

Pos logit contributions
 AMER+1.808
heastern+1.764
uble+1.734
utz+1.717
 rug+1.699
Neg logit contributions
stood-1.466
Pos-1.257
 Stard-1.222
 Os-1.214
osit-1.209
Token to
Token ID284
Feature activation+1.06e-01

Pos logit contributions
stood+0.818
Pos+0.715
 Stard+0.712
 Os+0.703
osit+0.688
Neg logit contributions
 AMER-0.934
heastern-0.924
uble-0.902
utz-0.890
 rug-0.876
Token try
Token ID1949
Feature activation-8.76e-01

Pos logit contributions
stood+0.442
Pos+0.394
 Stard+0.390
 Os+0.389
osit+0.384
Neg logit contributions
 AMER-0.325
heastern-0.318
uble-0.308
utz-0.308
 rug-0.297
Token and
Token ID290
Feature activation+3.68e-01

Pos logit contributions
 AMER+3.401
heastern+3.316
uble+3.258
utz+3.225
 rug+3.200
Neg logit contributions
stood-2.953
Pos-2.529
 Stard-2.457
osit-2.450
 Os-2.449
Token match
Token ID2872
Feature activation-3.14e-01

Pos logit contributions
stood+1.483
Pos+1.322
 Os+1.313
 Stard+1.312
osit+1.290
Neg logit contributions
 AMER-1.155
heastern-1.137
utz-1.103
uble-1.098
 rug-1.050
Token them
Token ID606
Feature activation+1.76e-01

Pos logit contributions
 AMER+1.106
heastern+1.075
uble+1.055
utz+1.042
 rug+1.032
Neg logit contributions
stood-1.170
Pos-1.025
 Stard-0.998
 Os-0.994
osit-0.992
Token as
Token ID355
Feature activation-1.40e-01

Pos logit contributions
stood+0.604
Pos+0.529
 Stard+0.519
 Os+0.518
osit+0.506
Neg logit contributions
 AMER-0.665
heastern-0.653
uble-0.641
utz-0.634
 rug-0.624
Token closely
Token ID7173
Feature activation-2.94e-01

Pos logit contributions
 AMER+0.467
heastern+0.456
uble+0.447
utz+0.441
 rug+0.434
Neg logit contributions
stood-0.544
Pos-0.481
 Stard-0.471
 Os-0.470
osit-0.465
Token I
Token ID314
Feature activation-3.93e-01

Pos logit contributions
 AMER+0.815
heastern+0.792
uble+0.780
utz+0.771
 rug+0.765
Neg logit contributions
stood-0.785
Pos-0.684
 Stard-0.664
osit-0.663
 Os-0.660
Token decided
Token ID3066
Feature activation-2.08e-01

Pos logit contributions
 AMER+1.398
heastern+1.345
uble+1.339
 rug+1.338
utz+1.328
Neg logit contributions
stood-1.412
Pos-1.266
 Stard-1.220
 Os-1.213
osit-1.210
Token to
Token ID284
Feature activation-5.73e-01

Pos logit contributions
 AMER+0.717
heastern+0.697
uble+0.684
utz+0.675
 rug+0.667
Neg logit contributions
stood-0.791
Pos-0.696
 Stard-0.682
 Os-0.678
osit-0.675
Token also
Token ID635
Feature activation-6.21e-01

Pos logit contributions
 AMER+1.769
heastern+1.696
uble+1.678
 rug+1.656
utz+1.643
Neg logit contributions
stood-2.388
Pos-2.129
 Stard-2.059
osit-2.052
 Os-2.043
Token try
Token ID1949
Feature activation-1.17e+00

Pos logit contributions
 AMER+1.880
heastern+1.795
uble+1.779
 rug+1.759
utz+1.742
Neg logit contributions
stood-2.628
Pos-2.349
 Stard-2.271
osit-2.255
 Os-2.252
Token running
Token ID2491
Feature activation-4.47e-01

Pos logit contributions
 AMER+4.254
heastern+4.080
uble+4.004
utz+4.002
 rug+3.989
Neg logit contributions
stood-4.234
Pos-3.654
osit-3.564
 Stard-3.508
 Os-3.486
Token a
Token ID257
Feature activation+1.75e-01

Pos logit contributions
 AMER+1.789
heastern+1.744
uble+1.716
utz+1.698
 rug+1.684
Neg logit contributions
stood-1.455
Pos-1.247
 Stard-1.211
 Os-1.202
osit-1.202
Token complex
Token ID3716
Feature activation-2.03e-01

Pos logit contributions
stood+0.599
Pos+0.522
 Stard+0.512
 Os+0.509
osit+0.502
Neg logit contributions
 AMER-0.664
heastern-0.648
uble-0.638
utz-0.630
 rug-0.617
Token application
Token ID3586
Feature activation-1.55e-02

Pos logit contributions
 AMER+0.785
heastern+0.766
uble+0.754
utz+0.745
 rug+0.736
Neg logit contributions
stood-0.688
Pos-0.595
 Stard-0.580
 Os-0.577
osit-0.573
Token bundled
Token ID34358
Feature activation+4.32e-01

Pos logit contributions
 AMER+0.060
heastern+0.059
uble+0.058
utz+0.057
 rug+0.056
Neg logit contributions
stood-0.052
Pos-0.045
 Stard-0.045
 Os-0.044
osit-0.044
Token as
Token ID355
Feature activation-4.10e-01

Pos logit contributions
stood+1.411
Pos+1.225
 Stard+1.219
 Os+1.218
 eval+1.184
Neg logit contributions
 AMER-1.685
heastern-1.647
utz-1.606
uble-1.603
 rug-1.561
Token-
Token ID12
Feature activation+2.10e-01

Pos logit contributions
stood+0.486
Pos+0.412
 Stard+0.401
 Os+0.399
osit+0.396
Neg logit contributions
 AMER-0.641
heastern-0.628
uble-0.619
utz-0.616
 rug-0.604
Tokenworld
Token ID6894
Feature activation+7.10e-01

Pos logit contributions
stood+0.591
Pos+0.494
osit+0.472
 Stard+0.469
 Os+0.469
Neg logit contributions
 AMER-0.939
heastern-0.918
uble-0.911
utz-0.896
 rug-0.893
Token problem
Token ID1917
Feature activation+2.35e-01

Pos logit contributions
stood+1.792
Pos+1.438
 Os+1.411
 Stard+1.408
 eval+1.376
Neg logit contributions
 AMER-3.322
heastern-3.284
uble-3.234
utz-3.233
 rug-3.143
Token.
Token ID13
Feature activation+3.15e-01

Pos logit contributions
stood+0.782
Pos+0.676
 Stard+0.659
 Os+0.659
osit+0.650
Neg logit contributions
 AMER-0.918
heastern-0.898
uble-0.881
utz-0.873
 rug-0.860
Token They
Token ID1119
Feature activation+2.16e-01

Pos logit contributions
stood+1.046
Pos+0.914
 Stard+0.894
 Os+0.891
osit+0.878
Neg logit contributions
 AMER-1.220
heastern-1.211
uble-1.192
utz-1.169
 rug-1.160
Token worked
Token ID3111
Feature activation-2.73e-01

Pos logit contributions
stood+0.794
Pos+0.691
 Stard+0.674
 Os+0.674
osit+0.670
Neg logit contributions
 AMER-0.773
heastern-0.755
uble-0.743
utz-0.730
 rug-0.720
Token well
Token ID880
Feature activation-3.50e-01

Pos logit contributions
 AMER+0.986
heastern+0.964
uble+0.949
utz+0.934
 rug+0.921
Neg logit contributions
stood-0.993
Pos-0.866
 Stard-0.847
 Os-0.845
osit-0.836
Token together
Token ID1978
Feature activation-2.53e-02

Pos logit contributions
 AMER+1.368
heastern+1.336
uble+1.314
utz+1.299
 rug+1.284
Neg logit contributions
stood-1.170
Pos-1.010
 Stard-0.983
 Os-0.978
osit-0.972
Token and
Token ID290
Feature activation+5.03e-02

Pos logit contributions
 AMER+0.102
heastern+0.099
uble+0.098
utz+0.097
 rug+0.096
Neg logit contributions
stood-0.081
Pos-0.070
 Stard-0.068
 Os-0.068
osit-0.067
Token never
Token ID1239
Feature activation+1.39e-01

Pos logit contributions
stood+0.184
Pos+0.160
 Stard+0.157
 Os+0.156
osit+0.155
Neg logit contributions
 AMER-0.181
heastern-0.177
uble-0.174
utz-0.171
 rug-0.169
Token gave
Token ID2921
Feature activation+4.66e-01

Pos logit contributions
stood+0.476
Pos+0.409
 Stard+0.400
 Os+0.398
osit+0.395
Neg logit contributions
 AMER-0.537
heastern-0.525
uble-0.517
utz-0.507
 rug-0.499
Token it
Token ID340
Feature activation-4.59e-02

Pos logit contributions
stood+0.150
Pos+0.131
 Stard+0.127
 Os+0.127
osit+0.126
Neg logit contributions
 AMER-0.157
heastern-0.153
uble-0.151
utz-0.149
 rug-0.147
Token
Token ID447
Feature activation-1.41e-01

Pos logit contributions
 AMER+0.187
heastern+0.183
uble+0.180
utz+0.178
 rug+0.176
Neg logit contributions
stood-0.146
Pos-0.125
 Stard-0.121
 Os-0.121
osit-0.120
Token
Token ID247
Feature activation+1.24e-01

Pos logit contributions
 AMER+0.434
heastern+0.423
uble+0.409
utz+0.404
 rug+0.400
Neg logit contributions
stood-0.587
Pos-0.521
 Os-0.509
 Stard-0.508
osit-0.506
Tokens
Token ID82
Feature activation+1.58e-01

Pos logit contributions
stood+0.483
Pos+0.425
osit+0.413
 Os+0.413
 Stard+0.413
Neg logit contributions
 AMER-0.419
heastern-0.406
uble-0.393
 rug-0.389
utz-0.388
Token important
Token ID1593
Feature activation-4.33e-02

Pos logit contributions
stood+0.548
Pos+0.476
 Stard+0.464
 Os+0.462
osit+0.459
Neg logit contributions
 AMER-0.597
heastern-0.582
uble-0.572
utz-0.565
 rug-0.559
Token to
Token ID284
Feature activation-2.46e-01

Pos logit contributions
 AMER+0.152
heastern+0.149
uble+0.145
utz+0.144
 rug+0.142
Neg logit contributions
stood-0.161
Pos-0.143
 Stard-0.139
 Os-0.139
osit-0.138
Token enable
Token ID7139
Feature activation-2.29e-01

Pos logit contributions
 AMER+0.855
heastern+0.827
uble+0.815
utz+0.802
 rug+0.797
Neg logit contributions
stood-0.935
Pos-0.820
 Stard-0.800
 Os-0.795
osit-0.794
Token them
Token ID606
Feature activation+5.22e-01

Pos logit contributions
 AMER+0.840
heastern+0.817
uble+0.803
utz+0.793
 rug+0.786
Neg logit contributions
stood-0.820
Pos-0.714
 Stard-0.696
 Os-0.693
osit-0.690
Token for
Token ID329
Feature activation+1.18e-01

Pos logit contributions
stood+1.799
Pos+1.579
 Stard+1.535
 Os+1.534
osit+1.504
Neg logit contributions
 AMER-1.953
heastern-1.916
uble-1.871
utz-1.860
 rug-1.827
Token the
Token ID262
Feature activation+3.60e-02

Pos logit contributions
stood+0.404
Pos+0.350
 Stard+0.342
 Os+0.340
osit+0.338
Neg logit contributions
 AMER-0.450
heastern-0.441
uble-0.431
utz-0.428
 rug-0.422
Token future
Token ID2003
Feature activation-2.04e-01

Pos logit contributions
stood+0.127
Pos+0.110
 Stard+0.108
 Os+0.107
osit+0.107
Neg logit contributions
 AMER-0.134
heastern-0.132
uble-0.129
utz-0.127
 rug-0.125
Token the
Token ID262
Feature activation-8.30e-02

Pos logit contributions
stood+0.068
Pos+0.059
 Stard+0.058
 Os+0.058
osit+0.057
Neg logit contributions
 AMER-0.066
heastern-0.064
uble-0.063
utz-0.062
 rug-0.061
Token only
Token ID691
Feature activation+1.86e-01

Pos logit contributions
 AMER+0.292
heastern+0.286
uble+0.281
utz+0.276
 rug+0.270
Neg logit contributions
stood-0.310
Pos-0.271
 Stard-0.266
 Os-0.265
osit-0.262
Token way
Token ID835
Feature activation+7.85e-01

Pos logit contributions
stood+0.672
Pos+0.586
 Os+0.573
 Stard+0.572
osit+0.565
Neg logit contributions
 AMER-0.673
heastern-0.661
uble-0.647
utz-0.639
 rug-0.622
Token to
Token ID284
Feature activation+4.53e-01

Pos logit contributions
stood+2.681
Pos+2.380
 Os+2.362
 Stard+2.315
 eval+2.295
Neg logit contributions
 AMER-2.837
heastern-2.819
uble-2.787
utz-2.737
 rug-2.648
Token do
Token ID466
Feature activation+3.93e-01

Pos logit contributions
stood+1.764
Pos+1.575
 Os+1.554
 Stard+1.547
 eval+1.532
Neg logit contributions
 AMER-1.478
heastern-1.463
uble-1.412
utz-1.409
 rug-1.347
Token that
Token ID326
Feature activation-2.31e-01

Pos logit contributions
stood+1.399
Pos+1.225
 Os+1.209
 Stard+1.200
osit+1.191
Neg logit contributions
 AMER-1.429
heastern-1.395
uble-1.384
utz-1.354
 rug-1.326
Token is
Token ID318
Feature activation+2.23e-01

Pos logit contributions
 AMER+1.009
heastern+0.985
uble+0.969
utz+0.962
 rug+0.956
Neg logit contributions
stood-0.669
Pos-0.559
 Stard-0.544
osit-0.538
 Os-0.537
Token to
Token ID284
Feature activation+5.08e-02

Pos logit contributions
stood+0.814
Pos+0.715
 Os+0.697
 Stard+0.696
osit+0.685
Neg logit contributions
 AMER-0.797
heastern-0.778
uble-0.766
utz-0.751
 rug-0.741
Token know
Token ID760
Feature activation-2.69e-01

Pos logit contributions
stood+0.206
Pos+0.183
 Stard+0.179
 Os+0.179
osit+0.177
Neg logit contributions
 AMER-0.162
heastern-0.158
uble-0.155
utz-0.153
 rug-0.150
Token exactly
Token ID3446
Feature activation-2.71e-01

Pos logit contributions
 AMER+0.998
heastern+0.973
uble+0.956
utz+0.944
 rug+0.934
Neg logit contributions
stood-0.952
Pos-0.828
 Stard-0.808
 Os-0.804
osit-0.800
Token who
Token ID508
Feature activation-7.74e-02

Pos logit contributions
 AMER+1.044
heastern+1.019
uble+1.003
utz+0.992
 rug+0.979
Neg logit contributions
stood-0.919
Pos-0.795
 Stard-0.774
 Os-0.771
osit-0.765
Token {
Token ID1391
Feature activation+3.83e-01

Pos logit contributions
stood+1.125
Pos+0.972
 Stard+0.949
 Os+0.946
osit+0.932
Neg logit contributions
 AMER-1.339
heastern-1.316
uble-1.284
utz-1.272
 rug-1.262
Token(
Token ID7
Feature activation+1.80e-01

Pos logit contributions
stood+1.575
Pos+1.418
 Os+1.362
 Stard+1.359
osit+1.358
Neg logit contributions
 AMER-1.217
heastern-1.187
uble-1.148
 rug-1.132
utz-1.128
Token64
Token ID2414
Feature activation+2.41e-01

Pos logit contributions
stood+0.526
Pos+0.444
 Stard+0.429
 Os+0.427
osit+0.424
Neg logit contributions
 AMER-0.783
heastern-0.767
uble-0.754
utz-0.746
 rug-0.740
Tokens
Token ID82
Feature activation+4.04e-01

Pos logit contributions
stood+0.953
Pos+0.842
 Os+0.831
 Stard+0.825
osit+0.813
Neg logit contributions
 AMER-0.798
heastern-0.781
uble-0.759
utz-0.745
 rug-0.743
Token)}
Token ID38165
Feature activation-1.29e-01

Pos logit contributions
stood+0.580
Pos+0.396
 Stard+0.364
 Os+0.358
osit+0.352
Neg logit contributions
 AMER-2.358
heastern-2.320
uble-2.290
utz-2.272
 rug-2.262
Token 24
Token ID1987
Feature activation-1.06e+00

Pos logit contributions
 AMER+0.475
heastern+0.464
uble+0.455
utz+0.450
 rug+0.444
Neg logit contributions
stood-0.461
Pos-0.401
 Stard-0.393
 Os-0.391
osit-0.387
Token.
Token ID13
Feature activation+6.52e-02

Pos logit contributions
 AMER+3.363
uble+3.224
heastern+3.211
utz+3.203
 rug+3.089
Neg logit contributions
stood-4.104
 ageing-3.638
Pos-3.576
osit-3.572
 Stard-3.535
Token Q
Token ID1195
Feature activation-1.10e-02

Pos logit contributions
stood+0.212
Pos+0.182
 Stard+0.177
 Os+0.175
osit+0.175
Neg logit contributions
 AMER-0.261
heastern-0.255
uble-0.251
utz-0.248
 rug-0.246
Tokenxc
Token ID25306
Feature activation+2.60e-02

Pos logit contributions
 AMER+0.046
heastern+0.044
uble+0.044
utz+0.043
 rug+0.043
Neg logit contributions
stood-0.034
Pos-0.029
 Stard-0.028
 Os-0.028
osit-0.028
Token3
Token ID18
Feature activation+5.08e-01

Pos logit contributions
stood+0.068
Pos+0.057
 Stard+0.054
 Os+0.054
osit+0.054
Neg logit contributions
 AMER-0.120
heastern-0.118
uble-0.116
utz-0.115
 rug-0.114
Token {
Token ID1391
Feature activation+4.68e-01

Pos logit contributions
stood+1.637
Pos+1.405
 Stard+1.374
 Os+1.374
osit+1.350
Neg logit contributions
 AMER-2.055
heastern-2.013
uble-1.965
utz-1.949
 rug-1.922
TokenG
Token ID38
Feature activation-3.39e-02

Pos logit contributions
 AMER+1.415
heastern+1.377
utz+1.334
uble+1.326
 rug+1.320
Neg logit contributions
stood-1.270
Pos-1.089
osit-1.048
 Stard-1.043
 Os-1.043
Token standards
Token ID5423
Feature activation-5.47e-01

Pos logit contributions
 AMER+0.132
heastern+0.129
uble+0.127
utz+0.125
 rug+0.124
Neg logit contributions
stood-0.114
Pos-0.098
 Stard-0.096
 Os-0.095
osit-0.095
Token,
Token ID11
Feature activation-3.04e-02

Pos logit contributions
 AMER+2.083
heastern+2.030
uble+1.989
utz+1.971
 rug+1.949
Neg logit contributions
stood-1.880
Pos-1.628
 Stard-1.583
 Os-1.577
osit-1.572
Token guaranteeing
Token ID50156
Feature activation+2.72e-02

Pos logit contributions
 AMER+0.114
heastern+0.111
uble+0.110
utz+0.108
 rug+0.107
Neg logit contributions
stood-0.106
Pos-0.092
 Stard-0.090
 Os-0.090
osit-0.089
Token the
Token ID262
Feature activation+7.18e-02

Pos logit contributions
stood+0.094
Pos+0.081
 Stard+0.079
 Os+0.079
osit+0.078
Neg logit contributions
 AMER-0.104
heastern-0.102
uble-0.100
utz-0.099
 rug-0.097
Token card
Token ID2657
Feature activation-1.06e+00

Pos logit contributions
stood+0.265
Pos+0.229
 Stard+0.226
 Os+0.225
osit+0.223
Neg logit contributions
 AMER-0.256
heastern-0.253
uble-0.247
utz-0.242
 rug-0.238
Token's
Token ID338
Feature activation+8.51e-02

Pos logit contributions
 AMER+4.094
heastern+3.949
uble+3.863
utz+3.849
 rug+3.829
Neg logit contributions
stood-3.482
Pos-3.008
osit-2.936
 Os-2.923
 Stard-2.917
Token stability
Token ID10159
Feature activation+3.92e-01

Pos logit contributions
stood+0.296
Pos+0.255
 Stard+0.251
 Os+0.249
osit+0.247
Neg logit contributions
 AMER-0.322
heastern-0.316
uble-0.311
utz-0.306
 rug-0.302
Token and
Token ID290
Feature activation+9.05e-03

Pos logit contributions
stood+1.237
Pos+1.048
 Stard+1.022
 Os+1.017
osit+1.000
Neg logit contributions
 AMER-1.604
heastern-1.572
uble-1.547
utz-1.527
 rug-1.503
Token reliability
Token ID17843
Feature activation+3.31e-01

Pos logit contributions
stood+0.030
Pos+0.026
 Stard+0.026
 Os+0.025
osit+0.025
Neg logit contributions
 AMER-0.035
heastern-0.035
uble-0.034
utz-0.034
 rug-0.033
Token under
Token ID739
Feature activation-2.67e-01

Pos logit contributions
stood+1.132
Pos+0.977
 Stard+0.956
 Os+0.951
osit+0.937
Neg logit contributions
 AMER-1.263
heastern-1.239
uble-1.218
utz-1.200
 rug-1.183
Token is
Token ID318
Feature activation+8.89e-02

Pos logit contributions
 AMER+0.638
heastern+0.618
uble+0.605
utz+0.596
 rug+0.588
Neg logit contributions
stood-0.879
Pos-0.782
 Stard-0.767
 Os-0.763
osit-0.760
Token disrupting
Token ID37915
Feature activation-5.90e-03

Pos logit contributions
stood+0.307
Pos+0.265
 Stard+0.259
 Os+0.258
osit+0.254
Neg logit contributions
 AMER-0.339
heastern-0.332
uble-0.326
utz-0.322
 rug-0.318
Token hospitality
Token ID33362
Feature activation-7.76e-01

Pos logit contributions
 AMER+0.021
heastern+0.021
uble+0.020
utz+0.020
 rug+0.020
Neg logit contributions
stood-0.022
Pos-0.019
 Stard-0.018
 Os-0.018
osit-0.018
Token.
Token ID13
Feature activation-4.31e-01

Pos logit contributions
 AMER+2.831
heastern+2.750
uble+2.690
utz+2.669
 rug+2.658
Neg logit contributions
stood-2.788
Pos-2.426
 Stard-2.360
osit-2.356
 Os-2.342
Token https
Token ID3740
Feature activation-1.10e+00

Pos logit contributions
 AMER+1.268
heastern+1.211
uble+1.185
utz+1.176
 rug+1.156
Neg logit contributions
stood-1.864
Pos-1.661
 Stard-1.626
 Os-1.618
osit-1.617
Token://
Token ID1378
Feature activation-1.10e+00

Pos logit contributions
 AMER+2.947
uble+2.797
utz+2.777
heastern+2.770
 rug+2.643
Neg logit contributions
stood-4.959
 Stard-4.385
Pos-4.383
 Os-4.317
osit-4.316
Tokenwww
Token ID2503
Feature activation-8.52e-01

Pos logit contributions
 AMER+4.068
heastern+4.037
utz+4.036
uble+4.003
 rug+3.819
Neg logit contributions
stood-3.672
Pos-3.219
 Stard-3.212
 Os-3.168
 Kak-3.154
Token.
Token ID13
Feature activation-7.52e-01

Pos logit contributions
 AMER+2.921
heastern+2.805
uble+2.803
utz+2.784
 rug+2.701
Neg logit contributions
stood-3.175
Pos-2.795
 Stard-2.758
 ageing-2.714
osit-2.713
Tokenbel
Token ID6667
Feature activation-1.18e+00

Pos logit contributions
 AMER+2.241
heastern+2.185
uble+2.135
utz+2.117
 rug+2.056
Neg logit contributions
stood-3.179
Pos-2.846
 Stard-2.813
 Os-2.793
osit-2.757
Tokenfast
Token ID7217
Feature activation-7.13e-01

Pos logit contributions
 AMER+4.041
heastern+3.942
uble+3.865
utz+3.820
 rug+3.762
Neg logit contributions
stood-4.473
Pos-3.937
 Stard-3.851
 Os-3.824
osit-3.805
Tokente
Token ID660
Feature activation+2.10e-01

Pos logit contributions
 AMER+1.980
heastern+1.904
uble+1.867
utz+1.830
 rug+1.812
Neg logit contributions
stood-3.201
Pos-2.871
 Stard-2.820
 Os-2.807
osit-2.799
Token disastrous
Token ID20756
Feature activation+4.15e-01

Pos logit contributions
 AMER+1.100
heastern+1.070
uble+1.051
utz+1.040
 rug+1.035
Neg logit contributions
stood-1.005
Pos-0.873
 Stard-0.849
osit-0.843
 Os-0.840
Token season
Token ID1622
Feature activation-1.60e-01

Pos logit contributions
stood+1.361
Pos+1.160
 Os+1.151
 Stard+1.140
osit+1.119
Neg logit contributions
 AMER-1.643
heastern-1.616
uble-1.593
utz-1.565
 rug-1.539
Token,
Token ID11
Feature activation+1.62e-03

Pos logit contributions
 AMER+0.645
heastern+0.632
uble+0.622
utz+0.614
 rug+0.605
Neg logit contributions
stood-0.515
Pos-0.442
 Os-0.434
 Stard-0.433
osit-0.424
Token he
Token ID339
Feature activation-1.99e-01

Pos logit contributions
stood+0.004
Pos+0.003
 Stard+0.003
 Os+0.003
osit+0.003
Neg logit contributions
 AMER-0.008
heastern-0.008
uble-0.008
utz-0.007
 rug-0.007
Token was
Token ID373
Feature activation+5.16e-02

Pos logit contributions
 AMER+0.771
heastern+0.751
uble+0.740
utz+0.731
 rug+0.726
Neg logit contributions
stood-0.672
Pos-0.582
 Stard-0.566
osit-0.561
 Os-0.561
Token traded
Token ID14018
Feature activation-1.15e+00

Pos logit contributions
stood+0.180
Pos+0.156
 Stard+0.152
 Os+0.152
osit+0.151
Neg logit contributions
 AMER-0.194
heastern-0.189
uble-0.186
utz-0.184
 rug-0.182
Token to
Token ID284
Feature activation-1.50e-01

Pos logit contributions
 AMER+3.998
heastern+3.866
uble+3.804
 rug+3.752
utz+3.730
Neg logit contributions
stood-4.341
Pos-3.802
osit-3.720
 Stard-3.695
 Os-3.647
Token the
Token ID262
Feature activation-7.06e-02

Pos logit contributions
 AMER+0.577
heastern+0.564
uble+0.556
utz+0.548
 rug+0.541
Neg logit contributions
stood-0.514
Pos-0.443
 Stard-0.434
 Os-0.433
osit-0.427
Token Cleveland
Token ID10306
Feature activation+4.31e-02

Pos logit contributions
 AMER+0.290
heastern+0.283
uble+0.279
utz+0.276
 rug+0.274
Neg logit contributions
stood-0.222
Pos-0.189
 Stard-0.184
 Os-0.182
osit-0.182
Token Browns
Token ID17671
Feature activation+5.25e-02

Pos logit contributions
stood+0.138
Pos+0.119
 Stard+0.115
 Os+0.115
osit+0.114
Neg logit contributions
 AMER-0.174
heastern-0.171
uble-0.168
utz-0.166
 rug-0.164
Token just
Token ID655
Feature activation+1.38e-01

Pos logit contributions
stood+0.171
Pos+0.149
 Os+0.147
 Stard+0.146
 Kak+0.142
Neg logit contributions
 AMER-0.207
heastern-0.202
uble-0.199
utz-0.197
 rug-0.193
Token:
Token ID25
Feature activation-2.64e-01

Pos logit contributions
 AMER+2.593
heastern+2.474
uble+2.432
utz+2.412
 rug+2.393
Neg logit contributions
stood-2.699
Pos-2.349
 Stard-2.297
osit-2.282
 Os-2.275
Token 04
Token ID8702
Feature activation+3.90e-02

Pos logit contributions
 AMER+0.921
heastern+0.897
uble+0.881
utz+0.869
 rug+0.859
Neg logit contributions
stood-0.995
Pos-0.873
 Stard-0.853
 Os-0.849
osit-0.845
Token30
Token ID1270
Feature activation+5.00e-02

Pos logit contributions
stood+0.141
Pos+0.124
 Stard+0.121
 Os+0.121
osit+0.119
Neg logit contributions
 AMER-0.140
heastern-0.139
uble-0.136
utz-0.134
 rug-0.132
Tokenz
Token ID89
Feature activation-5.94e-02

Pos logit contributions
stood+0.171
Pos+0.150
 Os+0.146
 Stard+0.146
osit+0.144
Neg logit contributions
 AMER-0.188
heastern-0.186
uble-0.182
utz-0.179
 rug-0.178
Token BLACK
Token ID31963
Feature activation-5.00e-01

Pos logit contributions
 AMER+0.235
heastern+0.232
uble+0.227
utz+0.224
 rug+0.222
Neg logit contributions
stood-0.194
Pos-0.168
 Stard-0.164
 Os-0.163
osit-0.161
TokenFO
Token ID6080
Feature activation-1.13e+00

Pos logit contributions
 AMER+1.780
heastern+1.692
uble+1.668
utz+1.654
 rug+1.639
Neg logit contributions
stood-1.871
Pos-1.628
 Stard-1.591
osit-1.590
 Os-1.579
TokenOT
Token ID2394
Feature activation-4.50e-01

Pos logit contributions
 AMER+4.882
heastern+4.539
utz+4.510
uble+4.508
 rug+4.399
Neg logit contributions
stood-3.489
Pos-2.766
 Stard-2.729
 ageing-2.659
osit-2.611
Token T
Token ID309
Feature activation-1.45e-01

Pos logit contributions
 AMER+1.651
heastern+1.597
uble+1.568
utz+1.551
 rug+1.537
Neg logit contributions
stood-1.622
Pos-1.412
 Stard-1.376
 Os-1.369
osit-1.368
TokenOC
Token ID4503
Feature activation-4.00e-01

Pos logit contributions
 AMER+0.550
heastern+0.539
uble+0.529
utz+0.523
 rug+0.518
Neg logit contributions
stood-0.500
Pos-0.434
 Stard-0.423
 Os-0.421
osit-0.419
Token RE
Token ID4526
Feature activation-4.97e-01

Pos logit contributions
 AMER+1.470
heastern+1.423
uble+1.396
utz+1.382
 rug+1.369
Neg logit contributions
stood-1.438
Pos-1.251
 Stard-1.220
 Os-1.213
osit-1.209
TokenP
Token ID47
Feature activation-2.80e-01

Pos logit contributions
 AMER+1.900
heastern+1.822
uble+1.807
utz+1.792
 rug+1.757
Neg logit contributions
stood-1.716
Pos-1.471
 Stard-1.445
 Os-1.430
osit-1.425