TOP ACTIVATIONS
MAX = 1.876

you have to specify " \\ : ". No other commands
are being caused by viruses circulating in animals . Phase 2
means an earlier start for elementary schools , and sets off
me later . Rated 5 out of 5 by
)" - Ver b ose # Create Bit map $ b
about Islam . It den igr ates the religion by depicting
modest dress , or even business attire . There 's very
fooled this open - minded skept ic ! One thing I
winners . S arg ent Sh river Award for
media can provide platforms to dissemin ate evidence - based injury
. 2 . Fine cut the 4 garlic cloves
sphere of influence . Current Issue View our current issue
that got him into some trouble and he made some bad
up in Prime Minister Naj ib Raz ak 's personal bank
myself an open - minded skept ic who has had experiences
and show them we mean business .' New laws
paste and mix well . Season with salt , pepper and
go eat [ at your business ].
d ) to stir up trouble in Sir sa , while
Bit map $ b mp = new - object system .

MIN ACTIVATIONS
MIN = -2.066

with Gohan ? <|endoftext|> R ents still have the potential to
games , the break f asts , the saying
city , buzzing with r icks h aws and workers to
Self described as " f ucked up Rock and Roll ,"
re i Profile Blog Joined October 2002 United States 3
pur ak ushi Profile Joined August 2012 United States 3
Big Fan Profile Blog Joined December 2010 TL AD T
them . F alling together in a cr um
heart l xp Profile Joined September 2010 United States 10
Dark horse Profile Blog Joined December 2011 United States 228
J aaaa asper Profile Blog Joined April 2012 United States 5
TB one - Profile Blog Joined November 2010 United States 21
ada ver BB Profile Blog Joined June 2009 Germany 247 34
superb . F alling water is Wright
ushi Profile Joined August 2012 United States 3 240 Posts #
K ier an Profile Blog Joined August 2011 United States 6
the region vertically . F illing the land between the isolated
of Dal K Profile Blog Joined June 2012 Canada 16 10
survivors ' skills ( f ishing , cooking , etc .),
. Follow HL TV . org 's @ l

INTERVAL 0.891 - 1.876
CONTAINS 0.480%

, first by a hydra ul ics issue with the rocket
to a close examination of whether Ross testimony to
you 're in luck . Turn on the monitor and note
are having issues using the interactive version of this visualization ,
worn out ... I just wish what she had to tell

INTERVAL -0.095 - 0.891
CONTAINS 67.403%

- The Walking Dead and An American Werewolf in London .
thereby take possession of it . The
it insults them . " The most politically engaged millennials think
t want your attention . But you
this tour was phenomenal and it was an utterly fantastic idea

INTERVAL -1.080 - -0.095
CONTAINS 31.964%

. Walton s physical contact with the Miami Police
by P oh j ola on piano , with vocal ists
Thus Massachusetts is one of the few states where the
; to compromise in pursuit of the success of our nation
Prime Minister and , if necessary , dismiss one that has

INTERVAL -2.066 - -1.080
CONTAINS 0.153%

ada ver BB Profile Blog Joined June 2009 Germany 247 34
Jacob Lew sent Congress another warning on the nation 's debt
operators , and prevent def ection between the miners from undermining
, King said via email . From
continents . F am ously , Xer xes ordered
Token you
Token ID345
Feature activation-1.38e-02

Pos logit contributions
 Bord+1.486
 NAS+1.391
 KE+1.208
 BDS+1.203
 law+1.178
Neg logit contributions
emet-1.680
ervatives-1.651
adan-1.622
_(-1.585
stein-1.559
Token have
Token ID423
Feature activation+1.69e-01

Pos logit contributions
emet+0.023
ervatives+0.022
adan+0.022
_(+0.022
elsius+0.021
Neg logit contributions
 Bord-0.023
 NAS-0.022
 KE-0.019
 BDS-0.019
 law-0.019
Token to
Token ID284
Feature activation-1.75e-01

Pos logit contributions
 Bord+0.253
 NAS+0.233
 BDS+0.204
 KE+0.202
 law+0.194
Neg logit contributions
emet-0.316
ervatives-0.304
adan-0.302
_(-0.298
_--0.294
Token specify
Token ID11986
Feature activation+2.15e-01

Pos logit contributions
emet+0.326
ervatives+0.316
adan+0.311
_(+0.304
elsius+0.302
Neg logit contributions
 Bord-0.262
 NAS-0.244
 KE-0.212
 BDS-0.212
 law-0.204
Token "
Token ID366
Feature activation+7.32e-01

Pos logit contributions
 Bord+0.333
 NAS+0.309
 BDS+0.270
 KE+0.268
 law+0.260
Neg logit contributions
emet-0.390
ervatives-0.377
adan-0.373
_(-0.366
stein-0.362
Token\\
Token ID6852
Feature activation+1.88e+00

Pos logit contributions
 Bord+1.372
 NAS+1.291
 BDS+1.157
 KE+1.142
 law+1.125
Neg logit contributions
emet-1.065
ervatives-1.023
adan-1.011
_(-0.983
stein-0.973
Token:
Token ID25
Feature activation+7.33e-01

Pos logit contributions
 Bord+3.112
 NAS+2.972
 BDS+2.567
 KE+2.548
 law+2.530
Neg logit contributions
emet-3.052
ervatives-2.939
adan-2.918
elsius-2.808
_(-2.787
Token".
Token ID1911
Feature activation+7.34e-01

Pos logit contributions
 Bord+1.257
 NAS+1.187
 BDS+1.047
 KE+1.035
 law+1.015
Neg logit contributions
emet-1.188
ervatives-1.148
adan-1.133
_(-1.103
stein-1.094
Token No
Token ID1400
Feature activation+2.51e-01

Pos logit contributions
 Bord+1.062
 NAS+0.986
 BDS+0.857
 KE+0.851
 law+0.817
Neg logit contributions
emet-1.412
ervatives-1.371
adan-1.353
_(-1.314
stein-1.309
Token other
Token ID584
Feature activation+2.62e-01

Pos logit contributions
 Bord+0.418
 NAS+0.392
 KE+0.343
 BDS+0.343
 law+0.336
Neg logit contributions
emet-0.428
ervatives-0.412
adan-0.405
_(-0.397
stein-0.394
Token commands
Token ID9729
Feature activation+4.12e-01

Pos logit contributions
 Bord+0.442
 NAS+0.416
 BDS+0.366
 KE+0.364
 law+0.359
Neg logit contributions
emet-0.441
ervatives-0.424
adan-0.419
_(-0.409
_--0.406
Token are
Token ID389
Feature activation-1.30e-01

Pos logit contributions
 Bord+1.019
 NAS+0.928
 KE+0.813
 BDS+0.808
 law+0.794
Neg logit contributions
emet-1.208
ervatives-1.154
adan-1.147
_(-1.132
elsius-1.113
Token being
Token ID852
Feature activation+5.98e-02

Pos logit contributions
emet+0.220
ervatives+0.212
adan+0.209
_(+0.208
_-+0.203
Neg logit contributions
 Bord-0.222
 NAS-0.204
 BDS-0.182
 KE-0.180
 law-0.178
Token caused
Token ID4073
Feature activation+4.84e-01

Pos logit contributions
 Bord+0.100
 NAS+0.091
 KE+0.080
 BDS+0.080
 law+0.079
Neg logit contributions
emet-0.104
ervatives-0.100
adan-0.099
_(-0.098
_--0.096
Token by
Token ID416
Feature activation+1.89e-01

Pos logit contributions
 Bord+0.815
 NAS+0.748
 BDS+0.666
 KE+0.663
 law+0.643
Neg logit contributions
emet-0.826
ervatives-0.790
adan-0.786
_(-0.771
_--0.760
Token viruses
Token ID20547
Feature activation-9.52e-02

Pos logit contributions
 Bord+0.292
 NAS+0.268
 BDS+0.234
 KE+0.232
 law+0.226
Neg logit contributions
emet-0.350
ervatives-0.335
adan-0.333
_(-0.331
_--0.325
Token circulating
Token ID24495
Feature activation+1.78e+00

Pos logit contributions
emet+0.172
ervatives+0.164
adan+0.164
_(+0.162
_-+0.159
Neg logit contributions
 Bord-0.151
 NAS-0.138
 BDS-0.121
 KE-0.121
 law-0.117
Token in
Token ID287
Feature activation+8.29e-02

Pos logit contributions
 Bord+2.756
 NAS+2.493
 BDS+2.164
 KE+2.163
 law+2.124
Neg logit contributions
emet-3.299
adan-3.146
ervatives-3.126
_--3.042
_(-3.041
Token animals
Token ID4695
Feature activation+6.44e-01

Pos logit contributions
 Bord+0.130
 NAS+0.119
 BDS+0.104
 KE+0.103
 law+0.100
Neg logit contributions
emet-0.152
ervatives-0.145
adan-0.144
_(-0.143
_--0.140
Token.
Token ID13
Feature activation+1.23e-01

Pos logit contributions
 Bord+1.000
 NAS+0.915
 BDS+0.798
 KE+0.797
 law+0.776
Neg logit contributions
emet-1.184
adan-1.127
ervatives-1.127
_(-1.110
_--1.094
Token Phase
Token ID18983
Feature activation+6.36e-01

Pos logit contributions
 Bord+0.184
 NAS+0.168
 BDS+0.146
 KE+0.145
 law+0.139
Neg logit contributions
emet-0.234
ervatives-0.225
adan-0.223
_(-0.219
elsius-0.218
Token 2
Token ID362
Feature activation+4.74e-02

Pos logit contributions
 Bord+1.005
 NAS+0.932
 BDS+0.825
 KE+0.813
 law+0.797
Neg logit contributions
emet-1.146
ervatives-1.094
adan-1.083
_(-1.065
_--1.049
Token means
Token ID1724
Feature activation-4.56e-01

Pos logit contributions
emet+0.574
ervatives+0.541
adan+0.540
_(+0.535
_-+0.525
Neg logit contributions
 Bord-0.558
 NAS-0.515
 BDS-0.461
 KE-0.457
 law-0.454
Token an
Token ID281
Feature activation+1.73e-01

Pos logit contributions
emet+0.833
ervatives+0.796
adan+0.792
_(+0.783
_-+0.769
Neg logit contributions
 Bord-0.710
 NAS-0.655
 BDS-0.578
 KE-0.569
 law-0.557
Token earlier
Token ID2961
Feature activation+2.78e-01

Pos logit contributions
 Bord+0.257
 NAS+0.236
 BDS+0.206
 KE+0.203
 law+0.197
Neg logit contributions
emet-0.328
ervatives-0.315
adan-0.312
_(-0.310
_--0.305
Token start
Token ID923
Feature activation+1.83e-01

Pos logit contributions
 Bord+0.447
 NAS+0.420
 BDS+0.369
 law+0.363
 KE+0.362
Neg logit contributions
emet-0.491
ervatives-0.476
adan-0.465
_(-0.462
_--0.457
Token for
Token ID329
Feature activation-1.18e-02

Pos logit contributions
 Bord+0.298
 NAS+0.276
 BDS+0.245
 KE+0.241
 law+0.239
Neg logit contributions
emet-0.318
ervatives-0.304
adan-0.302
_(-0.300
_--0.295
Token elementary
Token ID19823
Feature activation+1.66e+00

Pos logit contributions
emet+0.020
ervatives+0.019
adan+0.019
_(+0.019
_-+0.019
Neg logit contributions
 Bord-0.020
 NAS-0.018
 BDS-0.016
 KE-0.016
 law-0.016
Token schools
Token ID4266
Feature activation+4.51e-01

Pos logit contributions
 Bord+2.713
 NAS+2.525
 law+2.229
 KE+2.225
 BDS+2.224
Neg logit contributions
emet-2.901
ervatives-2.748
adan-2.720
_(-2.694
_--2.674
Token,
Token ID11
Feature activation+8.91e-02

Pos logit contributions
 Bord+0.753
 NAS+0.698
 BDS+0.617
 KE+0.611
 law+0.602
Neg logit contributions
emet-0.775
ervatives-0.730
adan-0.729
_(-0.721
_--0.709
Token and
Token ID290
Feature activation-1.82e-02

Pos logit contributions
 Bord+0.153
 NAS+0.142
 BDS+0.126
 KE+0.126
 law+0.123
Neg logit contributions
emet-0.151
ervatives-0.146
adan-0.144
_(-0.141
elsius-0.139
Token sets
Token ID5621
Feature activation-1.09e-02

Pos logit contributions
emet+0.032
ervatives+0.031
adan+0.030
_(+0.030
_-+0.029
Neg logit contributions
 Bord-0.030
 NAS-0.028
 BDS-0.024
 KE-0.024
 law-0.024
Token off
Token ID572
Feature activation+5.61e-01

Pos logit contributions
emet+0.019
ervatives+0.018
adan+0.018
_(+0.018
stein+0.017
Neg logit contributions
 Bord-0.018
 NAS-0.017
 BDS-0.015
 KE-0.015
 law-0.015
Token me
Token ID502
Feature activation+1.75e-01

Pos logit contributions
 Bord+0.659
 NAS+0.615
 BDS+0.541
 KE+0.540
 law+0.523
Neg logit contributions
emet-0.763
ervatives-0.733
adan-0.725
_(-0.717
_--0.706
Token later
Token ID1568
Feature activation-1.70e-01

Pos logit contributions
 Bord+0.264
 NAS+0.246
 BDS+0.215
 KE+0.213
 law+0.208
Neg logit contributions
emet-0.326
ervatives-0.313
adan-0.308
_(-0.304
_--0.301
Token.
Token ID13
Feature activation+2.34e-01

Pos logit contributions
emet+0.265
ervatives+0.252
adan+0.250
_(+0.246
_-+0.242
Neg logit contributions
 Bord-0.308
 NAS-0.288
 BDS-0.259
 KE-0.256
 law-0.249
Token
Token ID198
Feature activation+1.00e-01

Pos logit contributions
 Bord+0.374
 NAS+0.346
 BDS+0.307
 KE+0.303
 law+0.289
Neg logit contributions
emet-0.419
ervatives-0.401
adan-0.398
_(-0.389
elsius-0.386
Token
Token ID198
Feature activation-6.11e-02

Pos logit contributions
 Bord+0.161
 NAS+0.151
 BDS+0.133
 KE+0.131
 law+0.126
Neg logit contributions
emet-0.177
ervatives-0.169
adan-0.168
_(-0.164
elsius-0.163
TokenRated
Token ID15322
Feature activation+1.57e+00

Pos logit contributions
emet+0.088
ervatives+0.083
adan+0.082
_(+0.080
_-+0.079
Neg logit contributions
 Bord-0.118
 NAS-0.111
 BDS-0.100
 KE-0.099
 law-0.096
Token 5
Token ID642
Feature activation-1.73e-01

Pos logit contributions
 Bord+2.332
 NAS+2.150
 BDS+1.842
 KE+1.823
 law+1.775
Neg logit contributions
emet-3.055
adan-2.891
ervatives-2.888
elsius-2.834
_(-2.790
Token out
Token ID503
Feature activation-4.33e-01

Pos logit contributions
emet+0.316
ervatives+0.300
adan+0.299
_-+0.291
_(+0.290
Neg logit contributions
 Bord-0.275
 NAS-0.255
 BDS-0.224
 KE-0.223
 law-0.219
Token of
Token ID286
Feature activation-1.85e-01

Pos logit contributions
emet+0.820
ervatives+0.782
adan+0.777
elsius+0.770
avorite+0.762
Neg logit contributions
 Bord-0.656
 NAS-0.611
 BDS-0.533
 KE-0.525
 law-0.505
Token 5
Token ID642
Feature activation+1.26e-02

Pos logit contributions
emet+0.341
ervatives+0.323
adan+0.323
elsius+0.316
_-+0.315
Neg logit contributions
 Bord-0.296
 NAS-0.275
 BDS-0.243
 KE-0.238
 law-0.231
Token by
Token ID416
Feature activation-3.44e-02

Pos logit contributions
 Bord+0.021
 NAS+0.019
 BDS+0.017
 KE+0.017
 LB+0.016
Neg logit contributions
emet-0.023
ervatives-0.022
adan-0.022
avorite-0.021
elsius-0.021
Token)"
Token ID16725
Feature activation+2.75e-02

Pos logit contributions
emet+1.660
ervatives+1.590
adan+1.580
_(+1.575
_-+1.553
Neg logit contributions
 Bord-1.325
 NAS-1.213
 BDS-1.064
 KE-1.050
 law-1.008
Token -
Token ID532
Feature activation+4.48e-01

Pos logit contributions
 Bord+0.047
 NAS+0.044
 BDS+0.039
 KE+0.039
 law+0.038
Neg logit contributions
emet-0.045
ervatives-0.044
adan-0.043
_(-0.042
stein-0.041
TokenVer
Token ID13414
Feature activation+1.13e-01

Pos logit contributions
 Bord+0.953
 NAS+0.904
 BDS+0.819
 KE+0.810
 law+0.796
Neg logit contributions
emet-0.547
ervatives-0.530
adan-0.514
_(-0.496
stein-0.491
Tokenb
Token ID65
Feature activation+1.86e-01

Pos logit contributions
 Bord+0.192
 NAS+0.179
 BDS+0.158
 KE+0.156
 law+0.153
Neg logit contributions
emet-0.188
ervatives-0.179
adan-0.174
_(-0.174
avorite-0.172
Tokenose
Token ID577
Feature activation+3.67e-01

Pos logit contributions
 Bord+0.226
 NAS+0.209
 KE+0.182
 BDS+0.173
 law+0.173
Neg logit contributions
emet-0.400
adan-0.380
ervatives-0.377
avorite-0.374
_--0.368
Token #
Token ID1303
Feature activation+1.55e+00

Pos logit contributions
 Bord+0.638
 NAS+0.598
 BDS+0.530
 KE+0.529
 law+0.513
Neg logit contributions
emet-0.598
ervatives-0.574
adan-0.567
_(-0.551
stein-0.546
Token Create
Token ID13610
Feature activation+9.87e-01

Pos logit contributions
 Bord+2.485
 NAS+2.300
 BDS+2.023
 KE+2.011
 law+1.945
Neg logit contributions
emet-2.685
ervatives-2.610
adan-2.556
_(-2.493
stein-2.477
Token Bit
Token ID4722
Feature activation+4.02e-01

Pos logit contributions
 Bord+1.605
 NAS+1.494
 BDS+1.315
 KE+1.297
 law+1.273
Neg logit contributions
emet-1.714
ervatives-1.675
adan-1.650
_(-1.615
stein-1.599
Tokenmap
Token ID8899
Feature activation+7.70e-02

Pos logit contributions
 Bord+0.733
 NAS+0.676
 BDS+0.615
 KE+0.606
 law+0.591
Neg logit contributions
emet-0.617
ervatives-0.599
adan-0.594
_(-0.580
_--0.567
Token $
Token ID720
Feature activation+5.12e-01

Pos logit contributions
 Bord+0.128
 NAS+0.119
 BDS+0.105
 KE+0.104
 law+0.101
Neg logit contributions
emet-0.132
ervatives-0.128
adan-0.127
_(-0.123
_--0.122
Tokenb
Token ID65
Feature activation+1.20e-01

Pos logit contributions
 Bord+0.915
 NAS+0.857
 BDS+0.773
 KE+0.762
 law+0.737
Neg logit contributions
emet-0.790
ervatives-0.766
adan-0.752
_(-0.734
_--0.727
Token about
Token ID546
Feature activation-3.31e-01

Pos logit contributions
emet+0.386
ervatives+0.367
adan+0.366
_(+0.361
_-+0.357
Neg logit contributions
 Bord-0.365
 NAS-0.337
 BDS-0.301
 KE-0.296
 law-0.289
Token Islam
Token ID3449
Feature activation-4.25e-01

Pos logit contributions
emet+0.579
ervatives+0.554
adan+0.551
_(+0.544
_-+0.536
Neg logit contributions
 Bord-0.535
 NAS-0.494
 BDS-0.439
 KE-0.434
 law-0.420
Token.
Token ID13
Feature activation+1.68e-01

Pos logit contributions
emet+0.740
ervatives+0.706
adan+0.703
_(+0.695
_-+0.684
Neg logit contributions
 Bord-0.691
 NAS-0.639
 BDS-0.569
 KE-0.560
 law-0.544
Token It
Token ID632
Feature activation-5.96e-02

Pos logit contributions
 Bord+0.239
 NAS+0.219
 BDS+0.193
 KE+0.185
 law+0.180
Neg logit contributions
emet-0.335
ervatives-0.319
adan-0.319
_(-0.314
elsius-0.311
Token den
Token ID2853
Feature activation+4.05e-01

Pos logit contributions
emet+0.099
ervatives+0.094
adan+0.093
_(+0.092
_-+0.091
Neg logit contributions
 Bord-0.102
 NAS-0.095
 BDS-0.085
 KE-0.084
 law-0.081
Tokenigr
Token ID3692
Feature activation+1.53e+00

Pos logit contributions
 Bord+0.598
 NAS+0.550
 BDS+0.485
 KE+0.478
 law+0.460
Neg logit contributions
emet-0.761
ervatives-0.727
adan-0.726
_(-0.719
_--0.711
Tokenates
Token ID689
Feature activation+1.71e-01

Pos logit contributions
 Bord+2.215
 NAS+2.049
 BDS+1.842
 KE+1.800
 law+1.767
Neg logit contributions
emet-2.778
_(-2.671
ervatives-2.652
elsius-2.643
_--2.633
Token the
Token ID262
Feature activation+6.72e-02

Pos logit contributions
 Bord+0.287
 NAS+0.267
 BDS+0.242
 KE+0.235
 law+0.233
Neg logit contributions
emet-0.291
adan-0.276
ervatives-0.275
_(-0.273
_--0.267
Token religion
Token ID5737
Feature activation+3.65e-01

Pos logit contributions
 Bord+0.118
 NAS+0.111
 BDS+0.101
 law+0.098
 KE+0.097
Neg logit contributions
emet-0.109
adan-0.104
ervatives-0.103
_(-0.102
stein-0.100
Token by
Token ID416
Feature activation+2.96e-01

Pos logit contributions
 Bord+0.583
 NAS+0.543
 BDS+0.487
 law+0.473
 KE+0.470
Neg logit contributions
emet-0.647
adan-0.615
ervatives-0.613
_(-0.611
_--0.600
Token depicting
Token ID27561
Feature activation+4.49e-01

Pos logit contributions
 Bord+0.442
 NAS+0.406
 BDS+0.364
 law+0.351
 KE+0.347
Neg logit contributions
emet-0.563
adan-0.532
_(-0.530
ervatives-0.529
_--0.517
Token modest
Token ID12949
Feature activation-5.95e-02

Pos logit contributions
 Bord+0.034
 NAS+0.031
 BDS+0.028
 KE+0.027
 law+0.026
Neg logit contributions
emet-0.041
ervatives-0.039
adan-0.039
_(-0.039
_--0.038
Token dress
Token ID6576
Feature activation+2.88e-01

Pos logit contributions
emet+0.112
ervatives+0.108
adan+0.107
_(+0.106
stein+0.105
Neg logit contributions
 Bord-0.088
 NAS-0.081
 BDS-0.072
 KE-0.070
 law-0.068
Token,
Token ID11
Feature activation+1.49e-01

Pos logit contributions
 Bord+0.469
 NAS+0.430
 BDS+0.389
 KE+0.379
 law+0.372
Neg logit contributions
emet-0.501
adan-0.476
ervatives-0.476
_(-0.471
_--0.464
Token or
Token ID393
Feature activation-1.37e-02

Pos logit contributions
 Bord+0.246
 NAS+0.227
 BDS+0.206
 KE+0.201
 law+0.198
Neg logit contributions
emet-0.261
adan-0.247
ervatives-0.246
_(-0.244
stein-0.239
Token even
Token ID772
Feature activation+2.22e-01

Pos logit contributions
emet+0.024
ervatives+0.023
adan+0.023
_(+0.023
_-+0.022
Neg logit contributions
 Bord-0.022
 NAS-0.021
 BDS-0.018
 KE-0.018
 law-0.018
Token business
Token ID1597
Feature activation+1.53e+00

Pos logit contributions
 Bord+0.355
 NAS+0.328
 BDS+0.297
 KE+0.289
 law+0.283
Neg logit contributions
emet-0.395
ervatives-0.377
adan-0.374
_(-0.370
stein-0.364
Token attire
Token ID37629
Feature activation+5.57e-01

Pos logit contributions
 Bord+2.013
 NAS+1.791
 BDS+1.625
 KE+1.561
 law+1.533
Neg logit contributions
emet-3.126
ervatives-3.033
adan-2.989
_(-2.955
stein-2.926
Token.
Token ID13
Feature activation+3.67e-02

Pos logit contributions
 Bord+0.912
 NAS+0.835
 BDS+0.761
 KE+0.740
 law+0.734
Neg logit contributions
emet-0.969
adan-0.920
ervatives-0.915
_(-0.905
_--0.893
Token There
Token ID1318
Feature activation+1.35e-01

Pos logit contributions
 Bord+0.057
 NAS+0.052
 BDS+0.047
 KE+0.045
 LIC+0.043
Neg logit contributions
emet-0.070
ervatives-0.066
adan-0.066
elsius-0.065
_(-0.064
Token's
Token ID338
Feature activation-2.14e-02

Pos logit contributions
 Bord+0.238
 NAS+0.221
 BDS+0.199
 KE+0.196
 law+0.192
Neg logit contributions
emet-0.219
ervatives-0.207
adan-0.206
_(-0.203
_--0.201
Token very
Token ID845
Feature activation-9.84e-02

Pos logit contributions
emet+0.039
ervatives+0.037
adan+0.037
_(+0.037
stein+0.036
Neg logit contributions
 Bord-0.034
 NAS-0.031
 BDS-0.028
 KE-0.027
 law-0.027
Token fooled
Token ID35820
Feature activation-1.79e-01

Pos logit contributions
 Bord+0.406
 NAS+0.376
 KE+0.336
 BDS+0.336
 law+0.330
Neg logit contributions
emet-0.364
adan-0.346
ervatives-0.345
_(-0.342
_--0.334
Token this
Token ID428
Feature activation-3.61e-02

Pos logit contributions
emet+0.313
ervatives+0.299
adan+0.298
_(+0.294
_-+0.290
Neg logit contributions
 Bord-0.291
 NAS-0.268
 BDS-0.238
 KE-0.236
 law-0.229
Token open
Token ID1280
Feature activation-1.31e-01

Pos logit contributions
emet+0.063
ervatives+0.060
adan+0.060
_(+0.059
_-+0.058
Neg logit contributions
 Bord-0.059
 NAS-0.055
 BDS-0.049
 KE-0.048
 law-0.047
Token-
Token ID12
Feature activation+1.59e-03

Pos logit contributions
emet+0.213
ervatives+0.204
adan+0.203
_(+0.200
_-+0.197
Neg logit contributions
 Bord-0.227
 NAS-0.211
 BDS-0.189
 KE-0.187
 law-0.182
Tokenminded
Token ID14543
Feature activation-8.39e-02

Pos logit contributions
 Bord+0.004
 NAS+0.003
 BDS+0.003
 KE+0.003
 law+0.003
Neg logit contributions
emet-0.002
ervatives-0.002
adan-0.002
_(-0.002
_--0.002
Token skept
Token ID11200
Feature activation+1.53e+00

Pos logit contributions
emet+0.136
ervatives+0.130
adan+0.128
_(+0.126
_-+0.125
Neg logit contributions
 Bord-0.148
 NAS-0.139
 BDS-0.123
 KE-0.122
 law-0.120
Tokenic
Token ID291
Feature activation+2.54e-01

Pos logit contributions
 Bord+2.000
 NAS+1.835
 BDS+1.572
 KE+1.539
 law+1.516
Neg logit contributions
emet-3.010
_--2.894
ervatives-2.881
_(-2.875
adan-2.871
Token!
Token ID0
Feature activation-1.82e-01

Pos logit contributions
 Bord+0.431
 NAS+0.400
 BDS+0.357
 KE+0.353
 law+0.347
Neg logit contributions
emet-0.428
ervatives-0.405
adan-0.403
_(-0.396
_--0.393
Token One
Token ID1881
Feature activation+3.57e-02

Pos logit contributions
emet+0.331
ervatives+0.318
adan+0.315
_(+0.310
_-+0.306
Neg logit contributions
 Bord-0.285
 NAS-0.262
 BDS-0.232
 KE-0.229
 law-0.220
Token thing
Token ID1517
Feature activation+6.42e-02

Pos logit contributions
 Bord+0.060
 NAS+0.055
 BDS+0.049
 KE+0.049
 law+0.048
Neg logit contributions
emet-0.061
ervatives-0.058
adan-0.058
_(-0.057
_--0.056
Token I
Token ID314
Feature activation+1.64e-02

Pos logit contributions
 Bord+0.106
 NAS+0.097
 BDS+0.086
 KE+0.085
 law+0.083
Neg logit contributions
emet-0.112
ervatives-0.107
adan-0.107
_(-0.105
_--0.104
Token winners
Token ID14591
Feature activation-4.46e-01

Pos logit contributions
 Bord+0.224
 NAS+0.211
 BDS+0.185
 KE+0.184
 law+0.180
Neg logit contributions
emet-0.245
adan-0.236
ervatives-0.235
_(-0.232
_--0.229
Token.
Token ID13
Feature activation-8.82e-02

Pos logit contributions
emet+0.771
ervatives+0.735
adan+0.733
_(+0.721
_-+0.713
Neg logit contributions
 Bord-0.733
 NAS-0.680
 BDS-0.604
 KE-0.597
 law-0.580
Token
Token ID198
Feature activation+4.18e-02

Pos logit contributions
emet+0.148
ervatives+0.142
adan+0.141
_(+0.139
_-+0.137
Neg logit contributions
 Bord-0.149
 NAS-0.138
 BDS-0.123
 KE-0.122
 law-0.118
Token
Token ID198
Feature activation-2.04e-02

Pos logit contributions
 Bord+0.062
 NAS+0.058
 BDS+0.051
 KE+0.050
 law+0.048
Neg logit contributions
emet-0.080
avorite-0.077
ervatives-0.076
adan-0.076
elsius-0.076
TokenS
Token ID50
Feature activation+1.80e-01

Pos logit contributions
emet+0.030
ervatives+0.029
adan+0.028
_(+0.028
elsius+0.027
Neg logit contributions
 Bord-0.038
 NAS-0.036
 BDS-0.033
 KE-0.032
 law-0.031
Tokenarg
Token ID853
Feature activation+1.52e+00

Pos logit contributions
 Bord+0.325
 NAS+0.304
 BDS+0.273
 KE+0.273
 law+0.265
Neg logit contributions
emet-0.276
ervatives-0.266
adan-0.263
_(-0.258
_--0.257
Tokenent
Token ID298
Feature activation-1.45e-01

Pos logit contributions
 Bord+2.652
 NAS+2.469
 BDS+2.179
 KE+2.154
 law+2.132
Neg logit contributions
emet-2.385
_--2.312
ervatives-2.291
_(-2.243
STDOUT-2.237
Token Sh
Token ID911
Feature activation-2.87e-01

Pos logit contributions
emet+0.259
ervatives+0.248
adan+0.246
_(+0.244
_-+0.240
Neg logit contributions
 Bord-0.229
 NAS-0.211
 BDS-0.187
 KE-0.185
 law-0.178
Tokenriver
Token ID38291
Feature activation+4.00e-01

Pos logit contributions
emet+0.444
ervatives+0.424
adan+0.417
_(+0.416
_-+0.410
Neg logit contributions
 Bord-0.522
 NAS-0.485
 BDS-0.436
 KE-0.433
 law-0.420
Token Award
Token ID11289
Feature activation-2.45e-01

Pos logit contributions
 Bord+0.659
 NAS+0.611
 BDS+0.542
 KE+0.536
 law+0.521
Neg logit contributions
emet-0.689
ervatives-0.661
adan-0.656
_(-0.646
_--0.641
Token for
Token ID329
Feature activation-4.59e-02

Pos logit contributions
emet+0.422
ervatives+0.404
adan+0.403
_(+0.397
_-+0.393
Neg logit contributions
 Bord-0.401
 NAS-0.373
 BDS-0.331
 KE-0.327
 law-0.317
Token media
Token ID2056
Feature activation+1.24e-01

Pos logit contributions
emet+0.582
ervatives+0.561
adan+0.558
_(+0.552
_-+0.547
Neg logit contributions
 Bord-0.372
 NAS-0.338
 BDS-0.291
 KE-0.286
 law-0.275
Token can
Token ID460
Feature activation-6.10e-02

Pos logit contributions
 Bord+0.200
 NAS+0.186
 BDS+0.166
 KE+0.162
 law+0.160
Neg logit contributions
emet-0.217
ervatives-0.207
adan-0.205
_(-0.202
_--0.201
Token provide
Token ID2148
Feature activation+2.33e-01

Pos logit contributions
emet+0.114
ervatives+0.110
adan+0.108
_(+0.107
_-+0.106
Neg logit contributions
 Bord-0.092
 NAS-0.084
 BDS-0.074
 KE-0.073
 law-0.072
Token platforms
Token ID9554
Feature activation+3.56e-01

Pos logit contributions
 Bord+0.371
 NAS+0.344
 BDS+0.305
 KE+0.299
 law+0.299
Neg logit contributions
emet-0.417
ervatives-0.398
adan-0.398
_(-0.392
_--0.390
Token to
Token ID284
Feature activation-2.14e-01

Pos logit contributions
 Bord+0.591
 NAS+0.547
 BDS+0.486
 KE+0.479
 law+0.468
Neg logit contributions
emet-0.613
ervatives-0.580
adan-0.580
_(-0.570
_--0.564
Token dissemin
Token ID30548
Feature activation+1.50e+00

Pos logit contributions
emet+0.408
ervatives+0.390
adan+0.386
_(+0.382
_-+0.380
Neg logit contributions
 Bord-0.319
 NAS-0.295
 BDS-0.258
 KE-0.252
 law-0.251
Tokenate
Token ID378
Feature activation+1.17e+00

Pos logit contributions
 Bord+2.756
 NAS+2.576
 BDS+2.340
 KE+2.316
 law+2.301
Neg logit contributions
emet-2.149
adan-2.044
ervatives-2.031
_--2.024
STDOUT-1.998
Token evidence
Token ID2370
Feature activation+2.18e-03

Pos logit contributions
 Bord+1.570
 NAS+1.432
 BDS+1.252
 law+1.249
 KE+1.190
Neg logit contributions
emet-2.393
adan-2.319
ervatives-2.319
_(-2.267
_--2.248
Token-
Token ID12
Feature activation-1.14e-01

Pos logit contributions
 Bord+0.003
 NAS+0.003
 BDS+0.003
 KE+0.003
 law+0.003
Neg logit contributions
emet-0.004
ervatives-0.004
adan-0.004
_(-0.004
_--0.004
Tokenbased
Token ID3106
Feature activation+2.49e-01

Pos logit contributions
emet+0.205
ervatives+0.200
adan+0.198
_(+0.196
_-+0.194
Neg logit contributions
 Bord-0.175
 NAS-0.164
 BDS-0.143
 KE-0.142
 law-0.138
Token injury
Token ID5095
Feature activation-9.20e-02

Pos logit contributions
 Bord+0.355
 NAS+0.326
 BDS+0.284
 law+0.279
 KE+0.276
Neg logit contributions
emet-0.489
ervatives-0.476
adan-0.471
_(-0.461
_--0.458
Token.
Token ID13
Feature activation+1.98e-01

Pos logit contributions
emet+0.055
ervatives+0.052
adan+0.052
_(+0.051
_-+0.050
Neg logit contributions
 Bord-0.063
 NAS-0.059
 BDS-0.053
 KE-0.053
 law-0.051
Token
Token ID198
Feature activation+1.17e-01

Pos logit contributions
 Bord+0.304
 NAS+0.280
 BDS+0.246
 KE+0.242
 LIC+0.230
Neg logit contributions
emet-0.371
ervatives-0.357
adan-0.355
_(-0.344
elsius-0.341
Token
Token ID198
Feature activation+6.93e-02

Pos logit contributions
 Bord+0.194
 NAS+0.180
 BDS+0.161
 KE+0.158
 law+0.152
Neg logit contributions
emet-0.200
ervatives-0.190
adan-0.190
_(-0.186
_--0.183
Token2
Token ID17
Feature activation-1.13e-01

Pos logit contributions
 Bord+0.128
 NAS+0.121
 BDS+0.109
 KE+0.107
 law+0.104
Neg logit contributions
emet-0.104
ervatives-0.099
adan-0.098
_(-0.096
elsius-0.094
Token.
Token ID13
Feature activation+1.80e-01

Pos logit contributions
emet+0.184
ervatives+0.175
adan+0.174
_(+0.171
_-+0.169
Neg logit contributions
 Bord-0.196
 NAS-0.183
 BDS-0.164
 KE-0.162
 law-0.157
Token Fine
Token ID17867
Feature activation+1.48e+00

Pos logit contributions
 Bord+0.273
 NAS+0.251
 BDS+0.220
 KE+0.217
 LIC+0.206
Neg logit contributions
emet-0.338
ervatives-0.325
adan-0.324
_(-0.316
_--0.311
Token cut
Token ID2005
Feature activation+6.03e-01

Pos logit contributions
 Bord+2.561
 NAS+2.376
 BDS+2.115
 KE+2.064
 law+2.035
Neg logit contributions
emet-2.370
ervatives-2.301
adan-2.259
_(-2.231
_--2.202
Token the
Token ID262
Feature activation+7.64e-02

Pos logit contributions
 Bord+1.006
 NAS+0.935
 BDS+0.821
 KE+0.809
 law+0.788
Neg logit contributions
emet-1.028
ervatives-0.975
adan-0.974
_(-0.965
_--0.959
Token 4
Token ID604
Feature activation-9.32e-02

Pos logit contributions
 Bord+0.129
 NAS+0.120
 BDS+0.107
 KE+0.106
 law+0.103
Neg logit contributions
emet-0.129
ervatives-0.123
adan-0.123
_(-0.121
_--0.119
Token garlic
Token ID17644
Feature activation+3.22e-01

Pos logit contributions
emet+0.165
ervatives+0.158
adan+0.158
_(+0.155
_-+0.153
Neg logit contributions
 Bord-0.149
 NAS-0.137
 BDS-0.122
 KE-0.120
 law-0.117
Token cloves
Token ID45025
Feature activation+5.06e-01

Pos logit contributions
 Bord+0.419
 NAS+0.378
 BDS+0.320
 KE+0.313
 law+0.302
Neg logit contributions
emet-0.676
adan-0.647
ervatives-0.645
_(-0.633
_--0.630
Token sphere
Token ID16558
Feature activation+1.19e+00

Pos logit contributions
 Bord+0.141
 NAS+0.129
 BDS+0.114
 KE+0.112
 law+0.110
Neg logit contributions
emet-0.165
ervatives-0.158
adan-0.157
_(-0.155
_--0.154
Token of
Token ID286
Feature activation+1.26e-01

Pos logit contributions
 Bord+1.932
 NAS+1.785
 BDS+1.587
 KE+1.568
 law+1.518
Neg logit contributions
emet-2.081
ervatives-1.986
adan-1.975
_(-1.948
_--1.921
Token influence
Token ID4588
Feature activation+3.73e-01

Pos logit contributions
 Bord+0.186
 NAS+0.169
 BDS+0.150
 KE+0.146
 law+0.144
Neg logit contributions
emet-0.241
ervatives-0.231
adan-0.230
_(-0.227
_--0.225
Token.
Token ID13
Feature activation+1.45e-01

Pos logit contributions
 Bord+0.626
 NAS+0.576
 BDS+0.517
 KE+0.508
 law+0.501
Neg logit contributions
emet-0.634
ervatives-0.600
adan-0.598
_(-0.588
_--0.581
Token Current
Token ID9236
Feature activation+6.13e-01

Pos logit contributions
 Bord+0.235
 NAS+0.217
 BDS+0.193
 KE+0.190
 law+0.184
Neg logit contributions
emet-0.256
ervatives-0.244
adan-0.243
_(-0.239
_--0.236
Token Issue
Token ID18232
Feature activation+1.48e+00

Pos logit contributions
 Bord+1.006
 NAS+0.932
 BDS+0.834
 law+0.816
 KE+0.813
Neg logit contributions
emet-1.068
adan-1.015
ervatives-1.011
_(-1.000
_--0.989
Token View
Token ID3582
Feature activation-2.68e-01

Pos logit contributions
 Bord+2.391
 NAS+2.218
 BDS+1.980
 KE+1.945
 law+1.918
Neg logit contributions
emet-2.582
adan-2.418
ervatives-2.412
_(-2.384
_--2.362
Token our
Token ID674
Feature activation-3.06e-02

Pos logit contributions
emet+0.488
ervatives+0.466
adan+0.463
_(+0.457
_-+0.452
Neg logit contributions
 Bord-0.418
 NAS-0.384
 BDS-0.339
 KE-0.335
 law-0.324
Token current
Token ID1459
Feature activation+4.33e-01

Pos logit contributions
emet+0.057
ervatives+0.054
adan+0.054
_(+0.053
_-+0.053
Neg logit contributions
 Bord-0.046
 NAS-0.043
 BDS-0.038
 KE-0.037
 law-0.036
Token issue
Token ID2071
Feature activation+7.36e-01

Pos logit contributions
 Bord+0.487
 NAS+0.437
 BDS+0.365
 KE+0.356
 law+0.348
Neg logit contributions
emet-0.975
ervatives-0.936
adan-0.936
_(-0.922
_--0.914
Token
Token ID198
Feature activation+1.18e-01

Pos logit contributions
 Bord+1.187
 NAS+1.103
 BDS+0.980
 KE+0.962
 law+0.941
Neg logit contributions
emet-1.299
ervatives-1.234
adan-1.227
_(-1.205
_--1.192
Token that
Token ID326
Feature activation-3.19e-02

Pos logit contributions
 Bord+0.201
 NAS+0.186
 BDS+0.168
 KE+0.164
 law+0.161
Neg logit contributions
emet-0.212
adan-0.202
ervatives-0.201
_(-0.199
elsius-0.194
Token got
Token ID1392
Feature activation+3.09e-01

Pos logit contributions
emet+0.053
ervatives+0.050
adan+0.050
_(+0.049
elsius+0.048
Neg logit contributions
 Bord-0.055
 NAS-0.051
 BDS-0.046
 KE-0.045
 law-0.045
Token him
Token ID683
Feature activation+4.07e-01

Pos logit contributions
 Bord+0.447
 NAS+0.406
 BDS+0.361
 KE+0.353
 law+0.351
Neg logit contributions
emet-0.597
adan-0.571
ervatives-0.567
_(-0.562
_--0.554
Token into
Token ID656
Feature activation+1.64e-01

Pos logit contributions
 Bord+0.680
 NAS+0.624
 BDS+0.571
 law+0.559
 KE+0.557
Neg logit contributions
emet-0.703
adan-0.669
_(-0.654
ervatives-0.653
_--0.641
Token some
Token ID617
Feature activation+2.80e-01

Pos logit contributions
 Bord+0.246
 NAS+0.227
 BDS+0.204
 KE+0.197
 law+0.195
Neg logit contributions
emet-0.310
adan-0.296
ervatives-0.295
_(-0.291
_--0.285
Token trouble
Token ID5876
Feature activation+1.46e+00

Pos logit contributions
 Bord+0.437
 NAS+0.406
 BDS+0.368
 KE+0.356
 law+0.352
Neg logit contributions
emet-0.514
adan-0.489
ervatives-0.484
_(-0.479
_--0.470
Token and
Token ID290
Feature activation+1.09e-01

Pos logit contributions
 Bord+2.361
 NAS+2.163
 BDS+1.980
 law+1.965
 KE+1.908
Neg logit contributions
emet-2.539
adan-2.373
_(-2.335
ervatives-2.334
STDOUT-2.333
Token he
Token ID339
Feature activation+1.99e-02

Pos logit contributions
 Bord+0.183
 NAS+0.169
 BDS+0.154
 law+0.151
 KE+0.150
Neg logit contributions
emet-0.186
adan-0.177
ervatives-0.177
_(-0.174
elsius-0.170
Token made
Token ID925
Feature activation-5.59e-02

Pos logit contributions
 Bord+0.035
 NAS+0.032
 BDS+0.029
 law+0.029
 KE+0.029
Neg logit contributions
emet-0.032
adan-0.030
ervatives-0.030
_(-0.030
STDOUT-0.029
Token some
Token ID617
Feature activation+2.87e-01

Pos logit contributions
emet+0.096
adan+0.090
ervatives+0.089
_(+0.089
elsius+0.087
Neg logit contributions
 Bord-0.094
 NAS-0.087
 BDS-0.079
 KE-0.078
 law-0.076
Token bad
Token ID2089
Feature activation+2.74e-01

Pos logit contributions
 Bord+0.485
 NAS+0.451
 BDS+0.410
 KE+0.400
 law+0.398
Neg logit contributions
emet-0.494
adan-0.464
ervatives-0.455
_(-0.455
elsius-0.449
Token up
Token ID510
Feature activation-1.37e-01

Pos logit contributions
 Bord+0.014
 NAS+0.013
 BDS+0.012
 KE+0.011
 law+0.011
Neg logit contributions
emet-0.012
ervatives-0.011
adan-0.011
_(-0.011
_--0.011
Token in
Token ID287
Feature activation+1.21e-01

Pos logit contributions
emet+0.246
ervatives+0.235
adan+0.234
_(+0.231
_-+0.228
Neg logit contributions
 Bord-0.217
 NAS-0.201
 BDS-0.178
 KE-0.175
 law-0.171
Token Prime
Token ID5537
Feature activation-1.63e-01

Pos logit contributions
 Bord+0.180
 NAS+0.168
 BDS+0.146
 KE+0.144
 law+0.141
Neg logit contributions
emet-0.227
ervatives-0.219
adan-0.218
_(-0.215
_--0.212
Token Minister
Token ID4139
Feature activation-1.87e-01

Pos logit contributions
emet+0.330
ervatives+0.318
adan+0.314
_(+0.311
_-+0.307
Neg logit contributions
 Bord-0.224
 NAS-0.205
 BDS-0.176
 KE-0.174
 law-0.169
Token Naj
Token ID37135
Feature activation-5.09e-01

Pos logit contributions
emet+0.324
ervatives+0.312
adan+0.308
_(+0.304
_-+0.300
Neg logit contributions
 Bord-0.304
 NAS-0.282
 BDS-0.250
 KE-0.249
 law-0.241
Tokenib
Token ID571
Feature activation+1.46e+00

Pos logit contributions
emet+0.935
ervatives+0.902
_(+0.882
adan+0.875
_-+0.871
Neg logit contributions
 Bord-0.784
 NAS-0.723
 BDS-0.629
 KE-0.629
 law-0.612
Token Raz
Token ID38058
Feature activation+2.34e-01

Pos logit contributions
 Bord+2.892
 NAS+2.722
 BDS+2.491
 KE+2.487
 law+2.422
Neg logit contributions
emet-1.999
ervatives-1.972
_(-1.824
adan-1.817
STDOUT-1.794
Tokenak
Token ID461
Feature activation+2.44e-01

Pos logit contributions
 Bord+0.364
 NAS+0.344
 KE+0.307
 BDS+0.301
 law+0.296
Neg logit contributions
emet-0.417
ervatives-0.398
_--0.395
_(-0.392
elsius-0.387
Token's
Token ID338
Feature activation+2.47e-01

Pos logit contributions
 Bord+0.456
 NAS+0.425
 BDS+0.385
 KE+0.381
 law+0.367
Neg logit contributions
emet-0.366
ervatives-0.348
adan-0.345
_(-0.341
stein-0.336
Token personal
Token ID2614
Feature activation+1.99e-01

Pos logit contributions
 Bord+0.336
 NAS+0.309
 BDS+0.265
 KE+0.263
 law+0.256
Neg logit contributions
emet-0.500
ervatives-0.481
adan-0.480
_(-0.471
_--0.466
Token bank
Token ID3331
Feature activation+1.40e-01

Pos logit contributions
 Bord+0.269
 NAS+0.247
 BDS+0.213
 KE+0.209
 law+0.204
Neg logit contributions
emet-0.403
ervatives-0.387
adan-0.385
_(-0.379
_--0.376
Token myself
Token ID3589
Feature activation+5.26e-02

Pos logit contributions
emet+1.038
ervatives+0.991
adan+0.984
_(+0.972
_-+0.957
Neg logit contributions
 Bord-1.079
 NAS-1.001
 BDS-0.896
 KE-0.887
 law-0.858
Token an
Token ID281
Feature activation+1.04e-01

Pos logit contributions
 Bord+0.087
 NAS+0.080
 BDS+0.071
 KE+0.070
 law+0.068
Neg logit contributions
emet-0.091
ervatives-0.087
adan-0.086
_(-0.085
_--0.084
Token open
Token ID1280
Feature activation-2.13e-01

Pos logit contributions
 Bord+0.158
 NAS+0.145
 BDS+0.128
 KE+0.126
 law+0.122
Neg logit contributions
emet-0.194
ervatives-0.187
adan-0.185
_(-0.183
_--0.181
Token-
Token ID12
Feature activation-2.53e-02

Pos logit contributions
emet+0.353
ervatives+0.338
adan+0.336
_(+0.331
_-+0.327
Neg logit contributions
 Bord-0.364
 NAS-0.339
 BDS-0.302
 KE-0.299
 law-0.291
Tokenminded
Token ID14543
Feature activation-2.75e-01

Pos logit contributions
emet+0.028
ervatives+0.026
adan+0.026
_(+0.025
_-+0.025
Neg logit contributions
 Bord-0.057
 NAS-0.054
 BDS-0.050
 KE-0.049
 law-0.048
Token skept
Token ID11200
Feature activation+1.45e+00

Pos logit contributions
emet+0.514
ervatives+0.493
adan+0.487
_(+0.481
_-+0.479
Neg logit contributions
 Bord-0.418
 NAS-0.387
 BDS-0.339
 KE-0.332
 law-0.327
Tokenic
Token ID291
Feature activation+2.48e-01

Pos logit contributions
 Bord+1.797
 NAS+1.627
 BDS+1.396
 KE+1.340
 law+1.315
Neg logit contributions
emet-2.949
_--2.882
STDOUT-2.856
avorite-2.853
adan-2.834
Token who
Token ID508
Feature activation-2.57e-02

Pos logit contributions
 Bord+0.417
 NAS+0.386
 BDS+0.346
 KE+0.338
 law+0.336
Neg logit contributions
emet-0.421
ervatives-0.396
adan-0.393
_(-0.387
_--0.384
Token has
Token ID468
Feature activation-1.06e-01

Pos logit contributions
emet+0.042
ervatives+0.039
adan+0.039
_(+0.039
_-+0.038
Neg logit contributions
 Bord-0.045
 NAS-0.042
 BDS-0.038
 KE-0.037
 law-0.037
Token had
Token ID550
Feature activation+2.57e-01

Pos logit contributions
emet+0.172
ervatives+0.163
adan+0.162
_(+0.159
_-+0.157
Neg logit contributions
 Bord-0.188
 NAS-0.176
 BDS-0.159
 KE-0.156
 law-0.154
Token experiences
Token ID6461
Feature activation+8.23e-02

Pos logit contributions
 Bord+0.424
 NAS+0.396
 BDS+0.354
 KE+0.347
 law+0.339
Neg logit contributions
emet-0.445
ervatives-0.424
adan-0.420
_(-0.415
_--0.411
Token and
Token ID290
Feature activation+6.24e-02

Pos logit contributions
 Bord+0.291
 NAS+0.270
 BDS+0.241
 KE+0.238
 law+0.235
Neg logit contributions
emet-0.305
ervatives-0.290
adan-0.289
_(-0.284
stein-0.281
Token show
Token ID905
Feature activation-3.23e-01

Pos logit contributions
 Bord+0.103
 NAS+0.096
 BDS+0.085
 KE+0.084
 law+0.083
Neg logit contributions
emet-0.108
ervatives-0.103
adan-0.102
_(-0.101
_--0.100
Token them
Token ID606
Feature activation+3.87e-01

Pos logit contributions
emet+0.595
ervatives+0.567
adan+0.566
_(+0.558
_-+0.550
Neg logit contributions
 Bord-0.504
 NAS-0.462
 BDS-0.408
 KE-0.401
 law-0.394
Token we
Token ID356
Feature activation+1.49e-01

Pos logit contributions
 Bord+0.603
 NAS+0.554
 BDS+0.490
 KE+0.478
 law+0.475
Neg logit contributions
emet-0.714
adan-0.680
ervatives-0.676
_(-0.670
_--0.655
Token mean
Token ID1612
Feature activation-5.17e-01

Pos logit contributions
 Bord+0.256
 NAS+0.238
 BDS+0.214
 KE+0.210
 law+0.210
Neg logit contributions
emet-0.245
adan-0.233
_(-0.232
ervatives-0.230
_--0.228
Token business
Token ID1597
Feature activation+1.45e+00

Pos logit contributions
emet+1.060
ervatives+1.020
adan+1.016
_(+1.005
_-+0.993
Neg logit contributions
 Bord-0.684
 NAS-0.620
 BDS-0.535
 KE-0.526
 law-0.505
Token.'
Token ID2637
Feature activation+9.04e-02

Pos logit contributions
 Bord+2.395
 NAS+2.199
 BDS+1.988
 law+1.936
 KE+1.928
Neg logit contributions
emet-2.510
ervatives-2.352
adan-2.340
_(-2.317
_--2.302
Token
Token ID198
Feature activation-2.62e-02

Pos logit contributions
 Bord+0.160
 NAS+0.149
 BDS+0.134
 KE+0.133
 abduction+0.129
Neg logit contributions
emet-0.142
ervatives-0.137
_(-0.135
adan-0.134
stein-0.130
Token
Token ID198
Feature activation-1.69e-02

Pos logit contributions
emet+0.050
avorite+0.048
ervatives+0.047
adan+0.047
elsius+0.047
Neg logit contributions
 Bord-0.039
 NAS-0.037
 BDS-0.032
 KE-0.032
 law-0.031
TokenNew
Token ID3791
Feature activation-4.23e-01

Pos logit contributions
emet+0.027
ervatives+0.025
adan+0.025
_(+0.024
elsius+0.024
Neg logit contributions
 Bord-0.030
 NAS-0.028
 BDS-0.025
 KE-0.025
 law-0.024
Token laws
Token ID3657
Feature activation-2.61e-01

Pos logit contributions
emet+0.783
ervatives+0.747
adan+0.743
_(+0.735
_-+0.725
Neg logit contributions
 Bord-0.646
 NAS-0.592
 BDS-0.523
 KE-0.515
 law-0.502
Token paste
Token ID17008
Feature activation-3.20e-01

Pos logit contributions
emet+0.710
ervatives+0.677
adan+0.673
_(+0.664
_-+0.655
Neg logit contributions
 Bord-0.723
 NAS-0.670
 BDS-0.599
 KE-0.593
 law-0.574
Token and
Token ID290
Feature activation+1.57e-01

Pos logit contributions
emet+0.519
ervatives+0.492
adan+0.491
_(+0.484
_-+0.477
Neg logit contributions
 Bord-0.561
 NAS-0.522
 BDS-0.468
 KE-0.462
 law-0.449
Token mix
Token ID5022
Feature activation+7.78e-01

Pos logit contributions
 Bord+0.247
 NAS+0.229
 BDS+0.202
 KE+0.198
 law+0.194
Neg logit contributions
emet-0.285
ervatives-0.272
adan-0.271
_(-0.268
_--0.264
Token well
Token ID880
Feature activation+1.81e-01

Pos logit contributions
 Bord+1.141
 NAS+1.054
 BDS+0.920
 KE+0.908
 law+0.884
Neg logit contributions
emet-1.485
ervatives-1.420
adan-1.415
_(-1.391
_--1.384
Token.
Token ID13
Feature activation+2.95e-02

Pos logit contributions
 Bord+0.323
 NAS+0.301
 BDS+0.270
 KE+0.267
 law+0.260
Neg logit contributions
emet-0.286
ervatives-0.272
adan-0.270
_(-0.266
_--0.262
Token Season
Token ID7369
Feature activation+1.45e+00

Pos logit contributions
 Bord+0.044
 NAS+0.041
 BDS+0.035
 KE+0.035
 law+0.033
Neg logit contributions
emet-0.056
ervatives-0.054
adan-0.054
_(-0.052
_--0.052
Token with
Token ID351
Feature activation+2.27e-01

Pos logit contributions
 Bord+2.180
 NAS+2.008
 BDS+1.782
 KE+1.746
 law+1.718
Neg logit contributions
emet-2.674
ervatives-2.537
adan-2.528
_(-2.524
_--2.515
Token salt
Token ID8268
Feature activation+3.28e-01

Pos logit contributions
 Bord+0.452
 NAS+0.427
 BDS+0.386
 KE+0.380
 law+0.378
Neg logit contributions
emet-0.322
adan-0.298
ervatives-0.297
_(-0.295
_--0.292
Token,
Token ID11
Feature activation-1.94e-02

Pos logit contributions
 Bord+0.583
 NAS+0.543
 BDS+0.488
 KE+0.479
 law+0.472
Neg logit contributions
emet-0.522
ervatives-0.492
adan-0.491
_(-0.483
_--0.479
Token pepper
Token ID13385
Feature activation-1.11e-01

Pos logit contributions
emet+0.029
adan+0.027
ervatives+0.027
_(+0.026
_-+0.026
Neg logit contributions
 Bord-0.037
 NAS-0.035
 BDS-0.032
 KE-0.031
 law-0.031
Token and
Token ID290
Feature activation+7.95e-03

Pos logit contributions
emet+0.175
ervatives+0.166
adan+0.165
_(+0.163
_-+0.161
Neg logit contributions
 Bord-0.199
 NAS-0.185
 BDS-0.167
 KE-0.165
 law-0.160
Token go
Token ID467
Feature activation-1.49e-01

Pos logit contributions
 Bord+0.226
 NAS+0.211
 KE+0.184
 BDS+0.184
 law+0.183
Neg logit contributions
emet-0.257
ervatives-0.243
adan-0.241
_(-0.240
elsius-0.237
Token eat
Token ID4483
Feature activation-1.65e-02

Pos logit contributions
emet+0.265
ervatives+0.252
adan+0.249
_(+0.248
_-+0.245
Neg logit contributions
 Bord-0.239
 NAS-0.221
 BDS-0.195
 KE-0.192
 law-0.189
Token [
Token ID685
Feature activation-5.01e-02

Pos logit contributions
emet+0.029
ervatives+0.028
adan+0.028
_(+0.028
_-+0.027
Neg logit contributions
 Bord-0.027
 NAS-0.024
 BDS-0.022
 KE-0.021
 law-0.021
Tokenat
Token ID265
Feature activation-4.02e-02

Pos logit contributions
emet+0.076
ervatives+0.072
adan+0.071
_(+0.070
_-+0.069
Neg logit contributions
 Bord-0.093
 NAS-0.087
 BDS-0.079
 KE-0.077
 law-0.076
Token your
Token ID534
Feature activation+7.34e-02

Pos logit contributions
emet+0.066
ervatives+0.063
adan+0.063
_(+0.062
_-+0.061
Neg logit contributions
 Bord-0.069
 NAS-0.064
 BDS-0.058
 KE-0.057
 law-0.055
Token business
Token ID1597
Feature activation+1.44e+00

Pos logit contributions
 Bord+0.125
 NAS+0.115
 BDS+0.103
 KE+0.101
 law+0.100
Neg logit contributions
emet-0.123
ervatives-0.118
adan-0.117
_(-0.116
_--0.114
Token].
Token ID4083
Feature activation+2.51e-01

Pos logit contributions
 Bord+2.803
 NAS+2.609
 BDS+2.375
 KE+2.350
 law+2.338
Neg logit contributions
emet-2.052
ervatives-1.943
adan-1.926
_(-1.888
_--1.845
Token
Token ID447
Feature activation+1.78e-01

Pos logit contributions
 Bord+0.390
 NAS+0.356
 BDS+0.315
 KE+0.311
 law+0.298
Neg logit contributions
emet-0.463
ervatives-0.443
adan-0.441
_(-0.428
elsius-0.427
Token
Token ID251
Feature activation+1.51e-01

Pos logit contributions
 Bord+0.322
 NAS+0.302
 BDS+0.270
 KE+0.267
 law+0.259
Neg logit contributions
emet-0.277
ervatives-0.264
adan-0.262
_(-0.260
_--0.256
Token
Token ID198
Feature activation-4.31e-02

Pos logit contributions
 Bord+0.248
 NAS+0.229
 BDS+0.203
 KE+0.201
 law+0.193
Neg logit contributions
emet-0.263
ervatives-0.252
adan-0.250
_(-0.244
elsius-0.242
Token
Token ID198
Feature activation-4.30e-02

Pos logit contributions
emet+0.082
avorite+0.078
ervatives+0.078
adan+0.077
elsius+0.077
Neg logit contributions
 Bord-0.065
 NAS-0.061
 BDS-0.053
 KE-0.052
 law-0.050
Tokend
Token ID67
Feature activation+1.55e-01

Pos logit contributions
emet+0.281
ervatives+0.265
adan+0.259
_(+0.257
_-+0.250
Neg logit contributions
 Bord-0.492
 NAS-0.465
 BDS-0.424
 KE-0.422
 law-0.412
Token)
Token ID8
Feature activation-1.35e-01

Pos logit contributions
 Bord+0.234
 NAS+0.215
 BDS+0.188
 KE+0.186
 law+0.181
Neg logit contributions
emet-0.289
ervatives-0.276
adan-0.274
_(-0.271
_--0.268
Token to
Token ID284
Feature activation+8.22e-02

Pos logit contributions
emet+0.238
ervatives+0.226
adan+0.225
_(+0.222
_-+0.220
Neg logit contributions
 Bord-0.217
 NAS-0.200
 BDS-0.177
 KE-0.175
 law-0.171
Token stir
Token ID11240
Feature activation+6.69e-01

Pos logit contributions
 Bord+0.136
 NAS+0.127
 BDS+0.114
 KE+0.111
 law+0.110
Neg logit contributions
emet-0.142
ervatives-0.136
adan-0.133
_(-0.132
_--0.130
Token up
Token ID510
Feature activation+7.27e-02

Pos logit contributions
 Bord+1.177
 NAS+1.086
 BDS+0.986
 law+0.967
 KE+0.967
Neg logit contributions
emet-1.088
ervatives-1.031
adan-1.016
_(-1.006
_--0.996
Token trouble
Token ID5876
Feature activation+1.44e+00

Pos logit contributions
 Bord+0.115
 NAS+0.106
 BDS+0.094
 law+0.091
 KE+0.091
Neg logit contributions
emet-0.133
ervatives-0.126
adan-0.125
_(-0.124
_--0.123
Token in
Token ID287
Feature activation+1.55e-01

Pos logit contributions
 Bord+2.367
 NAS+2.131
 BDS+1.921
 law+1.907
 KE+1.879
Neg logit contributions
emet-2.490
adan-2.332
ervatives-2.325
STDOUT-2.290
_(-2.284
Token Sir
Token ID7361
Feature activation+2.48e-01

Pos logit contributions
 Bord+0.243
 NAS+0.224
 BDS+0.198
 KE+0.195
 law+0.192
Neg logit contributions
emet-0.284
ervatives-0.271
adan-0.267
_(-0.266
_--0.263
Tokensa
Token ID11400
Feature activation+8.14e-01

Pos logit contributions
 Bord+0.402
 NAS+0.370
 BDS+0.326
 KE+0.324
 law+0.314
Neg logit contributions
emet-0.436
ervatives-0.418
_(-0.409
adan-0.409
_--0.402
Token,
Token ID11
Feature activation+2.08e-01

Pos logit contributions
 Bord+1.415
 NAS+1.304
 BDS+1.158
 KE+1.146
 law+1.140
Neg logit contributions
emet-1.341
ervatives-1.291
adan-1.245
_(-1.241
_--1.241
Token while
Token ID981
Feature activation+2.06e-01

Pos logit contributions
 Bord+0.334
 NAS+0.310
 BDS+0.272
 KE+0.269
 law+0.263
Neg logit contributions
emet-0.374
ervatives-0.358
adan-0.352
_(-0.350
stein-0.343
Token Bit
Token ID4722
Feature activation+4.02e-01

Pos logit contributions
 Bord+1.605
 NAS+1.494
 BDS+1.315
 KE+1.297
 law+1.273
Neg logit contributions
emet-1.714
ervatives-1.675
adan-1.650
_(-1.615
stein-1.599
Tokenmap
Token ID8899
Feature activation+7.70e-02

Pos logit contributions
 Bord+0.733
 NAS+0.676
 BDS+0.615
 KE+0.606
 law+0.591
Neg logit contributions
emet-0.617
ervatives-0.599
adan-0.594
_(-0.580
_--0.567
Token $
Token ID720
Feature activation+5.12e-01

Pos logit contributions
 Bord+0.128
 NAS+0.119
 BDS+0.105
 KE+0.104
 law+0.101
Neg logit contributions
emet-0.132
ervatives-0.128
adan-0.127
_(-0.123
_--0.122
Tokenb
Token ID65
Feature activation+1.20e-01

Pos logit contributions
 Bord+0.915
 NAS+0.857
 BDS+0.773
 KE+0.762
 law+0.737
Neg logit contributions
emet-0.790
ervatives-0.766
adan-0.752
_(-0.734
_--0.727
Tokenmp
Token ID3149
Feature activation+3.47e-01

Pos logit contributions
 Bord+0.188
 NAS+0.173
 BDS+0.154
 KE+0.152
 law+0.146
Neg logit contributions
emet-0.211
ervatives-0.206
adan-0.205
_(-0.200
_--0.198
Token =
Token ID796
Feature activation+1.44e+00

Pos logit contributions
 Bord+0.684
 NAS+0.643
 BDS+0.584
 KE+0.582
 law+0.563
Neg logit contributions
emet-0.480
ervatives-0.461
adan-0.457
_(-0.440
_--0.434
Token new
Token ID649
Feature activation+3.73e-01

Pos logit contributions
 Bord+2.523
 NAS+2.374
 KE+2.101
 BDS+2.092
 LB+2.029
Neg logit contributions
emet-2.294
ervatives-2.264
adan-2.171
stein-2.123
_(-2.113
Token-
Token ID12
Feature activation+1.96e-01

Pos logit contributions
 Bord+0.534
 NAS+0.489
 BDS+0.428
 KE+0.414
 law+0.406
Neg logit contributions
emet-0.732
ervatives-0.714
adan-0.703
elsius-0.692
avorite-0.690
Tokenobject
Token ID15252
Feature activation+7.73e-01

Pos logit contributions
 Bord+0.278
 NAS+0.257
 BDS+0.223
 KE+0.220
 law+0.213
Neg logit contributions
emet-0.377
ervatives-0.367
elsius-0.362
adan-0.359
_(-0.356
Token system
Token ID1080
Feature activation+5.47e-01

Pos logit contributions
 Bord+1.110
 NAS+1.021
 BDS+0.890
 KE+0.871
 law+0.854
Neg logit contributions
emet-1.492
ervatives-1.438
adan-1.430
_(-1.408
_--1.396
Token.
Token ID13
Feature activation+9.91e-02

Pos logit contributions
 Bord+0.861
 NAS+0.804
 BDS+0.718
 KE+0.700
 law+0.687
Neg logit contributions
emet-0.976
avorite-0.966
adan-0.933
elsius-0.932
ervatives-0.928
Token with
Token ID351
Feature activation-7.78e-03

Pos logit contributions
 Bord+0.334
 NAS+0.309
 BDS+0.275
 KE+0.271
 law+0.264
Neg logit contributions
emet-0.361
ervatives-0.344
adan-0.339
_(-0.336
_--0.331
Token Gohan
Token ID45281
Feature activation-7.85e-01

Pos logit contributions
emet+0.014
ervatives+0.013
adan+0.013
_(+0.013
stein+0.013
Neg logit contributions
 Bord-0.012
 NAS-0.012
 BDS-0.010
 KE-0.010
 law-0.010
Token?
Token ID30
Feature activation+9.61e-02

Pos logit contributions
emet+1.321
ervatives+1.254
adan+1.244
_(+1.227
_-+1.210
Neg logit contributions
 Bord-1.334
 NAS-1.235
 BDS-1.106
 KE-1.093
 law-1.064
Token<|endoftext|>
Token ID50256
Feature activation+1.08e-01

Pos logit contributions
 Bord+0.144
 NAS+0.133
 BDS+0.117
 KE+0.116
 law+0.110
Neg logit contributions
emet-0.183
ervatives-0.174
adan-0.173
_(-0.169
elsius-0.169
TokenR
Token ID49
Feature activation-4.17e-02

Pos logit contributions
 Bord+0.199
 NAS+0.188
 BDS+0.169
 KE+0.167
 law+0.163
Neg logit contributions
emet-0.163
ervatives-0.155
adan-0.153
_(-0.150
elsius-0.148
Tokenents
Token ID658
Feature activation-2.07e+00

Pos logit contributions
emet+0.068
ervatives+0.065
adan+0.064
_(+0.063
_-+0.063
Neg logit contributions
 Bord-0.072
 NAS-0.067
 BDS-0.060
 KE-0.060
 law-0.058
Token still
Token ID991
Feature activation+1.35e-02

Pos logit contributions
emet+3.365
ervatives+3.261
adan+3.187
_(+3.166
_-+3.114
Neg logit contributions
 Bord-3.521
 NAS-3.255
 BDS-2.919
 KE-2.899
 law-2.770
Token have
Token ID423
Feature activation-4.81e-02

Pos logit contributions
 Bord+0.022
 NAS+0.020
 BDS+0.018
 KE+0.018
 law+0.017
Neg logit contributions
emet-0.024
adan-0.023
ervatives-0.023
_(-0.022
_--0.022
Token the
Token ID262
Feature activation+2.85e-02

Pos logit contributions
emet+0.081
ervatives+0.077
adan+0.077
_(+0.076
stein+0.075
Neg logit contributions
 Bord-0.081
 NAS-0.076
 BDS-0.068
 KE-0.067
 law-0.065
Token potential
Token ID2785
Feature activation-4.63e-01

Pos logit contributions
 Bord+0.049
 NAS+0.045
 BDS+0.040
 KE+0.040
 law+0.039
Neg logit contributions
emet-0.048
ervatives-0.046
adan-0.046
_(-0.045
_--0.044
Token to
Token ID284
Feature activation+7.61e-02

Pos logit contributions
emet+0.799
ervatives+0.771
_(+0.759
adan+0.758
stein+0.738
Neg logit contributions
 Bord-0.747
 NAS-0.685
 BDS-0.610
 KE-0.605
 LIC-0.585
Token games
Token ID1830
Feature activation-5.07e-01

Pos logit contributions
emet+1.204
ervatives+1.155
adan+1.143
_(+1.138
_-+1.123
Neg logit contributions
 Bord-1.055
 NAS-0.975
 BDS-0.861
 KE-0.852
 law-0.826
Token,
Token ID11
Feature activation-7.92e-02

Pos logit contributions
emet+0.844
ervatives+0.802
adan+0.802
_(+0.793
_-+0.777
Neg logit contributions
 Bord-0.868
 NAS-0.802
 BDS-0.715
 KE-0.707
 law-0.693
Token the
Token ID262
Feature activation-6.20e-02

Pos logit contributions
emet+0.138
ervatives+0.132
adan+0.131
_(+0.130
stein+0.127
Neg logit contributions
 Bord-0.131
 NAS-0.120
 BDS-0.107
 KE-0.105
 law-0.104
Token break
Token ID2270
Feature activation-1.19e-01

Pos logit contributions
emet+0.110
ervatives+0.105
adan+0.104
_(+0.103
stein+0.101
Neg logit contributions
 Bord-0.101
 NAS-0.093
 BDS-0.083
 KE-0.082
 law-0.080
Tokenf
Token ID69
Feature activation-1.18e-01

Pos logit contributions
emet+0.185
ervatives+0.175
adan+0.173
_(+0.172
_-+0.168
Neg logit contributions
 Bord-0.218
 NAS-0.203
 BDS-0.183
 KE-0.181
 law-0.179
Tokenasts
Token ID5773
Feature activation-1.92e+00

Pos logit contributions
emet+0.211
ervatives+0.202
adan+0.202
_(+0.200
_-+0.198
Neg logit contributions
 Bord-0.185
 NAS-0.171
 BDS-0.150
 KE-0.148
 law-0.145
Token,
Token ID11
Feature activation-1.19e-01

Pos logit contributions
emet+3.051
ervatives+2.933
adan+2.899
_(+2.862
stein+2.815
Neg logit contributions
 Bord-3.336
 NAS-3.121
 BDS-2.808
 KE-2.780
 law-2.660
Token the
Token ID262
Feature activation-1.18e-01

Pos logit contributions
emet+0.209
ervatives+0.200
adan+0.198
_(+0.197
_-+0.193
Neg logit contributions
 Bord-0.194
 NAS-0.179
 BDS-0.159
 KE-0.157
 law-0.153
Token saying
Token ID2282
Feature activation+1.44e-01

Pos logit contributions
emet+0.207
ervatives+0.198
adan+0.196
_(+0.195
_-+0.190
Neg logit contributions
 Bord-0.193
 NAS-0.178
 BDS-0.159
 KE-0.156
 law-0.153
Token
Token ID564
Feature activation-1.27e-02

Pos logit contributions
 Bord+0.234
 NAS+0.215
 BDS+0.191
 KE+0.188
 law+0.184
Neg logit contributions
emet-0.253
ervatives-0.243
adan-0.240
_(-0.238
_--0.235
Token
Token ID250
Feature activation-4.37e-02

Pos logit contributions
emet+0.022
ervatives+0.021
adan+0.021
_(+0.021
elsius+0.020
Neg logit contributions
 Bord-0.020
 NAS-0.019
 BDS-0.017
 KE-0.016
 law-0.016
Token city
Token ID1748
Feature activation-5.56e-01

Pos logit contributions
 Bord+0.059
 NAS+0.054
 BDS+0.048
 KE+0.047
 law+0.046
Neg logit contributions
emet-0.063
ervatives-0.060
adan-0.060
_(-0.059
_--0.058
Token,
Token ID11
Feature activation-3.37e-02

Pos logit contributions
emet+0.922
ervatives+0.880
adan+0.872
_(+0.863
_-+0.849
Neg logit contributions
 Bord-0.953
 NAS-0.884
 BDS-0.792
 KE-0.783
 law-0.762
Token buzzing
Token ID41719
Feature activation-3.91e-01

Pos logit contributions
emet+0.057
ervatives+0.055
adan+0.054
_(+0.054
stein+0.053
Neg logit contributions
 Bord-0.057
 NAS-0.053
 BDS-0.047
 KE-0.047
 law-0.046
Token with
Token ID351
Feature activation-1.42e-01

Pos logit contributions
emet+0.694
ervatives+0.661
adan+0.654
_(+0.651
_-+0.645
Neg logit contributions
 Bord-0.631
 NAS-0.584
 BDS-0.516
 KE-0.508
 law-0.498
Token r
Token ID374
Feature activation-6.29e-01

Pos logit contributions
emet+0.251
ervatives+0.238
_(+0.238
adan+0.237
stein+0.232
Neg logit contributions
 Bord-0.232
 NAS-0.215
 BDS-0.191
 KE-0.186
 law-0.185
Tokenicks
Token ID3378
Feature activation-1.89e+00

Pos logit contributions
emet+1.142
ervatives+1.102
adan+1.082
_-+1.080
_(+1.079
Neg logit contributions
 Bord-0.967
 NAS-0.886
 BDS-0.773
 KE-0.765
 law-0.746
Tokenh
Token ID71
Feature activation-4.75e-01

Pos logit contributions
emet+2.746
ervatives+2.630
adan+2.598
_(+2.545
stein+2.512
Neg logit contributions
 Bord-3.600
 NAS-3.371
 BDS-3.058
 KE-3.030
 law-2.934
Tokenaws
Token ID8356
Feature activation-1.65e-01

Pos logit contributions
emet+0.985
ervatives+0.948
adan+0.944
_(+0.934
_-+0.925
Neg logit contributions
 Bord-0.617
 NAS-0.558
 BDS-0.479
 KE-0.471
 law-0.450
Token and
Token ID290
Feature activation-1.35e-01

Pos logit contributions
emet+0.277
adan+0.260
ervatives+0.260
_(+0.259
_-+0.255
Neg logit contributions
 Bord-0.280
 NAS-0.257
 BDS-0.231
 KE-0.227
 law-0.225
Token workers
Token ID3259
Feature activation-3.38e-01

Pos logit contributions
emet+0.233
ervatives+0.221
adan+0.220
_(+0.218
stein+0.214
Neg logit contributions
 Bord-0.226
 NAS-0.208
 BDS-0.186
 KE-0.183
 law-0.181
Token to
Token ID284
Feature activation-9.58e-02

Pos logit contributions
emet+0.590
adan+0.558
ervatives+0.558
_(+0.554
_-+0.546
Neg logit contributions
 Bord-0.549
 NAS-0.508
 BDS-0.452
 KE-0.445
 law-0.438
TokenSelf
Token ID24704
Feature activation-7.53e-01

Pos logit contributions
emet+0.794
ervatives+0.755
adan+0.747
_(+0.735
_-+0.722
Neg logit contributions
 Bord-0.969
 NAS-0.908
 BDS-0.821
 KE-0.813
 law-0.788
Token described
Token ID3417
Feature activation+1.20e-01

Pos logit contributions
emet+1.347
ervatives+1.290
adan+1.281
_(+1.267
stein+1.248
Neg logit contributions
 Bord-1.183
 NAS-1.089
 BDS-0.964
 KE-0.956
 law-0.916
Token as
Token ID355
Feature activation-1.27e-01

Pos logit contributions
 Bord+0.186
 NAS+0.171
 BDS+0.152
 KE+0.150
 law+0.144
Neg logit contributions
emet-0.217
ervatives-0.208
adan-0.207
_(-0.204
_--0.202
Token "
Token ID366
Feature activation+8.22e-02

Pos logit contributions
emet+0.220
ervatives+0.210
adan+0.209
_(+0.207
_-+0.204
Neg logit contributions
 Bord-0.205
 NAS-0.190
 BDS-0.168
 KE-0.166
 law-0.160
Tokenf
Token ID69
Feature activation-1.33e-01

Pos logit contributions
 Bord+0.153
 NAS+0.143
 BDS+0.130
 KE+0.129
 law+0.125
Neg logit contributions
emet-0.123
ervatives-0.117
adan-0.116
_(-0.114
_--0.112
Tokenucked
Token ID17758
Feature activation-1.85e+00

Pos logit contributions
emet+0.214
ervatives+0.204
adan+0.203
_(+0.200
_-+0.198
Neg logit contributions
 Bord-0.232
 NAS-0.215
 BDS-0.193
 KE-0.191
 law-0.185
Token up
Token ID510
Feature activation-1.50e-01

Pos logit contributions
emet+2.922
ervatives+2.801
adan+2.776
_(+2.748
_-+2.699
Neg logit contributions
 Bord-3.263
 NAS-3.041
 BDS-2.725
 KE-2.705
 law-2.608
Token Rock
Token ID4631
Feature activation-6.44e-01

Pos logit contributions
emet+0.256
ervatives+0.243
adan+0.242
_(+0.237
_-+0.233
Neg logit contributions
 Bord-0.251
 NAS-0.233
 BDS-0.209
 KE-0.205
 law-0.201
Token and
Token ID290
Feature activation+4.88e-02

Pos logit contributions
emet+1.067
ervatives+1.017
adan+1.005
_(+0.994
_-+0.980
Neg logit contributions
 Bord-1.106
 NAS-1.030
 BDS-0.922
 KE-0.909
 law-0.885
Token Roll
Token ID8299
Feature activation-5.26e-01

Pos logit contributions
 Bord+0.061
 NAS+0.055
 BDS+0.046
 KE+0.045
 law+0.044
Neg logit contributions
emet-0.105
ervatives-0.101
adan-0.100
_(-0.099
_--0.098
Token,"
Token ID553
Feature activation+2.85e-02

Pos logit contributions
emet+0.879
ervatives+0.835
adan+0.828
_(+0.816
_-+0.807
Neg logit contributions
 Bord-0.901
 NAS-0.836
 BDS-0.749
 KE-0.739
 law-0.720
Token
Token ID198
Feature activation-1.41e-02

Pos logit contributions
 Bord+0.055
 NAS+0.052
 BDS+0.045
 KE+0.044
 law+0.043
Neg logit contributions
emet-0.069
avorite-0.066
ervatives-0.066
adan-0.066
elsius-0.065
Tokenre
Token ID260
Feature activation-3.49e-01

Pos logit contributions
emet+0.020
ervatives+0.019
adan+0.019
_(+0.019
stein+0.018
Neg logit contributions
 Bord-0.027
 NAS-0.025
 BDS-0.023
 KE-0.023
 law-0.022
Tokeni
Token ID72
Feature activation-5.22e-01

Pos logit contributions
emet+0.530
ervatives+0.507
adan+0.498
_(+0.492
stein+0.484
Neg logit contributions
 Bord-0.642
 NAS-0.600
 BDS-0.540
 KE-0.537
 law-0.522
Token Profile
Token ID13118
Feature activation-6.81e-01

Pos logit contributions
emet+0.736
ervatives+0.703
adan+0.691
_(+0.682
_-+0.667
Neg logit contributions
 Bord-1.014
 NAS-0.952
 BDS-0.864
 KE-0.859
 law-0.836
Token Blog
Token ID14001
Feature activation-1.01e+00

Pos logit contributions
emet+0.726
ervatives+0.675
adan+0.672
_(+0.662
_-+0.648
Neg logit contributions
 Bord-1.554
 NAS-1.467
 BDS-1.353
 KE-1.340
 law-1.314
Token Joined
Token ID16709
Feature activation-1.83e+00

Pos logit contributions
emet+1.077
ervatives+1.000
adan+0.973
elsius+0.920
_(+0.916
Neg logit contributions
 Bord-2.377
 NAS-2.270
 BDS-2.095
 KE-2.070
 law-2.028
Token October
Token ID3267
Feature activation-4.70e-01

Pos logit contributions
emet+3.148
ervatives+3.003
adan+2.984
_(+2.936
_-+2.890
Neg logit contributions
 Bord-3.033
 NAS-2.818
 BDS-2.508
 KE-2.480
 law-2.394
Token 2002
Token ID6244
Feature activation+9.12e-02

Pos logit contributions
emet+0.818
adan+0.781
ervatives+0.777
_(+0.752
elsius+0.750
Neg logit contributions
 Bord-0.792
 NAS-0.741
 BDS-0.655
 KE-0.644
 law-0.626
Token United
Token ID1578
Feature activation-1.47e+00

Pos logit contributions
 Bord+0.136
 NAS+0.126
 BDS+0.109
 KE+0.108
 law+0.104
Neg logit contributions
emet-0.174
ervatives-0.168
adan-0.167
_(-0.163
_--0.161
Token States
Token ID1829
Feature activation-1.21e+00

Pos logit contributions
emet+1.971
adan+1.866
ervatives+1.850
_(+1.816
_-+1.785
Neg logit contributions
 Bord-2.843
 NAS-2.685
 BDS-2.451
 KE-2.404
 abduction-2.361
Token 3
Token ID513
Feature activation-6.17e-01

Pos logit contributions
emet+2.029
ervatives+1.938
adan+1.924
_(+1.892
_-+1.865
Neg logit contributions
 Bord-2.075
 NAS-1.929
 BDS-1.725
 KE-1.705
 law-1.649
Token
Token ID198
Feature activation+1.09e-01

Pos logit contributions
 Bord+0.403
 NAS+0.383
 BDS+0.336
 KE+0.326
 law+0.314
Neg logit contributions
emet-0.524
avorite-0.502
ervatives-0.498
elsius-0.494
adan-0.493
Tokenpur
Token ID14225
Feature activation-3.44e-01

Pos logit contributions
 Bord+0.206
 NAS+0.195
 BDS+0.177
 KE+0.175
 law+0.170
Neg logit contributions
emet-0.158
ervatives-0.150
adan-0.147
_(-0.144
elsius-0.142
Tokenak
Token ID461
Feature activation-3.27e-01

Pos logit contributions
emet+0.506
ervatives+0.483
adan+0.474
_(+0.469
_-+0.462
Neg logit contributions
 Bord-0.649
 NAS-0.606
 BDS-0.547
 KE-0.545
 law-0.530
Tokenushi
Token ID17731
Feature activation-3.04e-01

Pos logit contributions
emet+0.500
ervatives+0.481
adan+0.469
_(+0.467
_-+0.457
Neg logit contributions
 Bord-0.598
 NAS-0.560
 BDS-0.503
 KE-0.501
 law-0.487
Token Profile
Token ID13118
Feature activation-4.17e-01

Pos logit contributions
emet+0.483
ervatives+0.461
adan+0.453
_(+0.446
stein+0.440
Neg logit contributions
 Bord-0.544
 NAS-0.510
 BDS-0.456
 KE-0.456
 law-0.438
Token Joined
Token ID16709
Feature activation-1.80e+00

Pos logit contributions
emet+0.467
ervatives+0.433
adan+0.426
_(+0.418
stein+0.407
Neg logit contributions
 Bord-0.944
 NAS-0.895
 BDS-0.826
 KE-0.821
 law-0.799
Token August
Token ID2932
Feature activation-8.90e-01

Pos logit contributions
emet+3.113
ervatives+2.963
adan+2.947
_(+2.897
_-+2.852
Neg logit contributions
 Bord-2.988
 NAS-2.781
 BDS-2.474
 KE-2.446
 law-2.360
Token 2012
Token ID2321
Feature activation-5.76e-01

Pos logit contributions
emet+1.506
adan+1.427
ervatives+1.427
_(+1.380
_-+1.373
Neg logit contributions
 Bord-1.530
 NAS-1.436
 BDS-1.283
 KE-1.266
 law-1.222
Token United
Token ID1578
Feature activation-1.68e+00

Pos logit contributions
emet+1.094
ervatives+1.050
adan+1.045
_(+1.027
_-+1.015
Neg logit contributions
 Bord-0.856
 NAS-0.788
 BDS-0.690
 KE-0.681
 law-0.653
Token States
Token ID1829
Feature activation-1.39e+00

Pos logit contributions
emet+2.425
adan+2.290
ervatives+2.285
_(+2.243
_-+2.204
Neg logit contributions
 Bord-3.161
 NAS-2.977
 BDS-2.685
 KE-2.638
 LIC-2.567
Token 3
Token ID513
Feature activation-6.99e-01

Pos logit contributions
emet+2.330
ervatives+2.224
adan+2.210
_(+2.174
_-+2.145
Neg logit contributions
 Bord-2.372
 NAS-2.203
 BDS-1.971
 KE-1.947
 law-1.882
Token
Token ID198
Feature activation-8.96e-02

Pos logit contributions
emet+0.075
avorite+0.072
ervatives+0.071
elsius+0.071
adan+0.071
Neg logit contributions
 Bord-0.058
 NAS-0.054
 BDS-0.047
 KE-0.046
 law-0.045
TokenBig
Token ID12804
Feature activation+1.09e-02

Pos logit contributions
emet+0.132
ervatives+0.127
adan+0.125
_(+0.122
stein+0.120
Neg logit contributions
 Bord-0.167
 NAS-0.157
 BDS-0.142
 KE-0.141
 law-0.137
TokenFan
Token ID22480
Feature activation-2.31e-01

Pos logit contributions
 Bord+0.020
 NAS+0.018
 BDS+0.017
 KE+0.016
 law+0.016
Neg logit contributions
emet-0.017
ervatives-0.016
adan-0.016
_(-0.016
stein-0.015
Token Profile
Token ID13118
Feature activation-5.80e-01

Pos logit contributions
emet+0.334
ervatives+0.319
adan+0.312
_(+0.307
_-+0.301
Neg logit contributions
 Bord-0.442
 NAS-0.416
 BDS-0.376
 KE-0.373
 law-0.362
Token Blog
Token ID14001
Feature activation-1.00e+00

Pos logit contributions
emet+0.678
ervatives+0.633
adan+0.631
_(+0.621
_-+0.608
Neg logit contributions
 Bord-1.267
 NAS-1.194
 BDS-1.097
 KE-1.087
 law-1.063
Token Joined
Token ID16709
Feature activation-1.77e+00

Pos logit contributions
emet+1.042
ervatives+0.968
adan+0.941
_(+0.899
elsius+0.894
Neg logit contributions
 Bord-2.379
 NAS-2.279
 BDS-2.099
 KE-2.078
 law-2.037
Token December
Token ID3426
Feature activation-4.84e-01

Pos logit contributions
emet+3.044
ervatives+2.904
adan+2.888
_(+2.844
_-+2.799
Neg logit contributions
 Bord-2.945
 NAS-2.735
 BDS-2.433
 KE-2.406
 law-2.324
Token 2010
Token ID3050
Feature activation-3.85e-01

Pos logit contributions
emet+0.832
adan+0.793
ervatives+0.792
elsius+0.767
_(+0.767
Neg logit contributions
 Bord-0.830
 NAS-0.774
 BDS-0.689
 KE-0.678
 law-0.653
Token TL
Token ID24811
Feature activation-9.52e-02

Pos logit contributions
emet+0.715
ervatives+0.686
adan+0.682
_(+0.675
_-+0.665
Neg logit contributions
 Bord-0.580
 NAS-0.532
 BDS-0.468
 KE-0.462
 law-0.445
TokenAD
Token ID2885
Feature activation-4.04e-01

Pos logit contributions
emet+0.164
ervatives+0.156
adan+0.155
_(+0.152
_-+0.150
Neg logit contributions
 Bord-0.159
 NAS-0.147
 BDS-0.131
 KE-0.129
 law-0.125
TokenT
Token ID51
Feature activation-7.52e-01

Pos logit contributions
emet+0.518
ervatives+0.505
adan+0.492
_(+0.485
stein+0.475
Neg logit contributions
 Bord-0.842
 NAS-0.794
 BDS-0.719
 KE-0.717
 law-0.698
Token them
Token ID606
Feature activation+2.32e-01

Pos logit contributions
emet+0.052
ervatives+0.049
adan+0.049
_(+0.048
_-+0.047
Neg logit contributions
 Bord-0.052
 NAS-0.048
 BDS-0.043
 KE-0.043
 law-0.042
Token.
Token ID13
Feature activation-5.44e-02

Pos logit contributions
 Bord+0.385
 NAS+0.353
 BDS+0.316
 KE+0.311
 law+0.308
Neg logit contributions
emet-0.400
adan-0.381
ervatives-0.379
_(-0.373
_--0.368
Token
Token ID198
Feature activation-1.57e-01

Pos logit contributions
emet+0.097
ervatives+0.092
adan+0.092
_(+0.089
elsius+0.089
Neg logit contributions
 Bord-0.089
 NAS-0.082
 BDS-0.073
 KE-0.072
 law-0.069
Token
Token ID198
Feature activation-1.25e-01

Pos logit contributions
emet+0.296
ervatives+0.283
avorite+0.282
adan+0.280
elsius+0.280
Neg logit contributions
 Bord-0.237
 NAS-0.222
 BDS-0.195
 KE-0.191
 law-0.185
TokenF
Token ID37
Feature activation-7.13e-02

Pos logit contributions
emet+0.188
ervatives+0.180
adan+0.177
elsius+0.172
_(+0.170
Neg logit contributions
 Bord-0.230
 NAS-0.217
 BDS-0.195
 KE-0.193
 law-0.187
Tokenalling
Token ID9221
Feature activation-1.77e+00

Pos logit contributions
emet+0.114
adan+0.108
ervatives+0.108
_(+0.106
stein+0.105
Neg logit contributions
 Bord-0.124
 NAS-0.116
 BDS-0.105
 KE-0.103
 law-0.101
Token together
Token ID1978
Feature activation-3.10e-01

Pos logit contributions
emet+3.019
ervatives+2.888
adan+2.874
_(+2.844
_-+2.799
Neg logit contributions
 Bord-2.906
 NAS-2.691
 BDS-2.396
 KE-2.371
 law-2.282
Token in
Token ID287
Feature activation-1.36e-02

Pos logit contributions
emet+0.534
ervatives+0.505
adan+0.499
_(+0.493
_-+0.485
Neg logit contributions
 Bord-0.519
 NAS-0.480
 BDS-0.427
 KE-0.423
 law-0.416
Token a
Token ID257
Feature activation+6.78e-02

Pos logit contributions
emet+0.024
ervatives+0.023
adan+0.023
_(+0.023
stein+0.022
Neg logit contributions
 Bord-0.022
 NAS-0.020
 BDS-0.018
 KE-0.018
 law-0.017
Token cr
Token ID1067
Feature activation-5.23e-02

Pos logit contributions
 Bord+0.111
 NAS+0.104
 BDS+0.093
 KE+0.090
 law+0.090
Neg logit contributions
emet-0.119
ervatives-0.113
adan-0.113
_(-0.111
_--0.109
Tokenum
Token ID388
Feature activation-9.12e-01

Pos logit contributions
emet+0.073
ervatives+0.070
adan+0.069
_-+0.068
_(+0.068
Neg logit contributions
 Bord-0.100
 NAS-0.094
 KE-0.085
 BDS-0.084
 law-0.083
Token
Token ID198
Feature activation+8.43e-02

Pos logit contributions
 Bord+0.260
 NAS+0.246
 BDS+0.214
 KE+0.208
 law+0.202
Neg logit contributions
emet-0.337
avorite-0.327
ervatives-0.323
elsius-0.320
adan-0.319
Tokenheart
Token ID11499
Feature activation-6.49e-01

Pos logit contributions
 Bord+0.161
 NAS+0.152
 BDS+0.137
 KE+0.136
 law+0.133
Neg logit contributions
emet-0.120
ervatives-0.115
adan-0.113
_(-0.111
stein-0.109
Tokenl
Token ID75
Feature activation+9.18e-02

Pos logit contributions
emet+0.948
ervatives+0.903
adan+0.891
_(+0.881
_-+0.861
Neg logit contributions
 Bord-1.232
 NAS-1.154
 BDS-1.045
 KE-1.037
 law-1.009
Tokenxp
Token ID42372
Feature activation-8.00e-01

Pos logit contributions
 Bord+0.173
 NAS+0.160
 KE+0.145
 BDS+0.144
 law+0.139
Neg logit contributions
emet-0.134
ervatives-0.129
_(-0.126
adan-0.126
_--0.123
Token Profile
Token ID13118
Feature activation-4.68e-01

Pos logit contributions
emet+1.186
ervatives+1.129
adan+1.117
_(+1.104
_-+1.078
Neg logit contributions
 Bord-1.502
 NAS-1.405
 BDS-1.270
 KE-1.261
 law-1.228
Token Joined
Token ID16709
Feature activation-1.72e+00

Pos logit contributions
emet+0.520
ervatives+0.485
adan+0.479
_(+0.470
_-+0.459
Neg logit contributions
 Bord-1.055
 NAS-0.998
 BDS-0.920
 KE-0.913
 law-0.891
Token September
Token ID2693
Feature activation-5.65e-01

Pos logit contributions
emet+3.000
ervatives+2.854
adan+2.841
_(+2.789
elsius+2.744
Neg logit contributions
 Bord-2.825
 NAS-2.630
 BDS-2.334
 KE-2.307
 law-2.227
Token 2010
Token ID3050
Feature activation-2.99e-01

Pos logit contributions
emet+0.965
adan+0.916
ervatives+0.915
_(+0.888
elsius+0.884
Neg logit contributions
 Bord-0.965
 NAS-0.900
 BDS-0.803
 KE-0.794
 law-0.768
Token United
Token ID1578
Feature activation-1.50e+00

Pos logit contributions
emet+0.572
ervatives+0.549
adan+0.546
_(+0.537
_-+0.531
Neg logit contributions
 Bord-0.441
 NAS-0.406
 BDS-0.354
 KE-0.350
 law-0.336
Token States
Token ID1829
Feature activation-1.08e+00

Pos logit contributions
emet+1.999
adan+1.901
ervatives+1.893
_(+1.846
stein+1.816
Neg logit contributions
 Bord-2.897
 NAS-2.742
 BDS-2.494
 KE-2.456
 abduction-2.412
Token 10
Token ID838
Feature activation-2.27e-01

Pos logit contributions
emet+1.813
ervatives+1.731
adan+1.719
_(+1.686
_-+1.665
Neg logit contributions
 Bord-1.835
 NAS-1.708
 BDS-1.527
 KE-1.508
 law-1.458
Token
Token ID198
Feature activation+4.92e-02

Pos logit contributions
 Bord+0.213
 NAS+0.201
 BDS+0.175
 KE+0.171
 law+0.166
Neg logit contributions
emet-0.273
avorite-0.264
ervatives-0.263
adan-0.260
elsius-0.260
TokenDark
Token ID17367
Feature activation-1.35e-01

Pos logit contributions
 Bord+0.093
 NAS+0.088
 BDS+0.080
 KE+0.079
 law+0.077
Neg logit contributions
emet-0.070
ervatives-0.067
adan-0.066
_(-0.065
stein-0.064
Tokenhorse
Token ID30527
Feature activation-9.05e-01

Pos logit contributions
emet+0.188
ervatives+0.180
adan+0.176
_(+0.173
_-+0.169
Neg logit contributions
 Bord-0.266
 NAS-0.249
 BDS-0.227
 KE-0.226
 law-0.221
Token Profile
Token ID13118
Feature activation-4.36e-01

Pos logit contributions
emet+1.277
ervatives+1.209
adan+1.199
_(+1.180
_-+1.157
Neg logit contributions
 Bord-1.770
 NAS-1.659
 BDS-1.508
 KE-1.494
 law-1.453
Token Blog
Token ID14001
Feature activation-8.19e-01

Pos logit contributions
emet+0.494
ervatives+0.460
adan+0.459
_(+0.452
_-+0.442
Neg logit contributions
 Bord-0.967
 NAS-0.912
 BDS-0.839
 KE-0.831
 law-0.814
Token Joined
Token ID16709
Feature activation-1.70e+00

Pos logit contributions
emet+0.878
ervatives+0.812
adan+0.792
elsius+0.758
_(+0.750
Neg logit contributions
 Bord-1.919
 NAS-1.842
 BDS-1.695
 KE-1.674
 law-1.640
Token December
Token ID3426
Feature activation-4.13e-01

Pos logit contributions
emet+2.941
ervatives+2.804
adan+2.788
_(+2.744
_-+2.701
Neg logit contributions
 Bord-2.827
 NAS-2.628
 BDS-2.336
 KE-2.310
 law-2.231
Token 2011
Token ID2813
Feature activation-5.31e-01

Pos logit contributions
emet+0.710
adan+0.675
ervatives+0.674
_(+0.656
elsius+0.654
Neg logit contributions
 Bord-0.705
 NAS-0.658
 BDS-0.584
 KE-0.576
 law-0.555
Token United
Token ID1578
Feature activation-1.54e+00

Pos logit contributions
emet+1.007
ervatives+0.968
adan+0.962
_(+0.949
_-+0.937
Neg logit contributions
 Bord-0.786
 NAS-0.723
 BDS-0.632
 KE-0.623
 law-0.600
Token States
Token ID1829
Feature activation-1.16e+00

Pos logit contributions
emet+2.061
ervatives+1.952
adan+1.951
_(+1.903
_-+1.860
Neg logit contributions
 Bord-2.973
 NAS-2.809
 BDS-2.553
 KE-2.517
BLE-2.474
Token 228
Token ID29041
Feature activation-2.25e-01

Pos logit contributions
emet+1.935
ervatives+1.846
adan+1.835
_(+1.802
_-+1.778
Neg logit contributions
 Bord-1.983
 NAS-1.844
 BDS-1.650
 KE-1.630
 law-1.577
TokenJ
Token ID41
Feature activation-4.72e-01

Pos logit contributions
emet+0.201
ervatives+0.193
adan+0.189
_(+0.187
stein+0.183
Neg logit contributions
 Bord-0.266
 NAS-0.250
 BDS-0.226
 KE-0.224
 law-0.218
Tokenaaaa
Token ID24794
Feature activation-3.63e-02

Pos logit contributions
emet+0.712
ervatives+0.685
adan+0.673
_(+0.666
_-+0.651
Neg logit contributions
 Bord-0.868
 NAS-0.811
 BDS-0.729
 KE-0.725
 law-0.704
Tokenasper
Token ID32981
Feature activation+1.77e-01

Pos logit contributions
emet+0.051
ervatives+0.049
adan+0.048
_(+0.047
_-+0.046
Neg logit contributions
 Bord-0.071
 NAS-0.066
 BDS-0.060
 KE-0.059
 law-0.058
Token Profile
Token ID13118
Feature activation-5.46e-01

Pos logit contributions
 Bord+0.335
 NAS+0.314
 BDS+0.284
 KE+0.283
 law+0.275
Neg logit contributions
emet-0.257
ervatives-0.245
adan-0.241
_(-0.237
elsius-0.232
Token Blog
Token ID14001
Feature activation-9.04e-01

Pos logit contributions
emet+0.614
ervatives+0.573
adan+0.568
_(+0.558
_-+0.545
Neg logit contributions
 Bord-1.221
 NAS-1.154
 BDS-1.062
 KE-1.053
 law-1.029
Token Joined
Token ID16709
Feature activation-1.70e+00

Pos logit contributions
emet+0.970
ervatives+0.902
adan+0.881
elsius+0.839
_(+0.826
Neg logit contributions
 Bord-2.117
 NAS-2.026
 BDS-1.869
 KE-1.844
 law-1.805
Token April
Token ID3035
Feature activation-6.81e-01

Pos logit contributions
emet+2.947
ervatives+2.808
adan+2.793
_(+2.747
_-+2.704
Neg logit contributions
 Bord-2.809
 NAS-2.612
 BDS-2.320
 KE-2.295
 law-2.214
Token 2012
Token ID2321
Feature activation-2.41e-01

Pos logit contributions
emet+1.181
ervatives+1.120
adan+1.118
_(+1.091
elsius+1.088
Neg logit contributions
 Bord-1.155
 NAS-1.079
 BDS-0.958
 KE-0.943
 law-0.911
Token United
Token ID1578
Feature activation-1.58e+00

Pos logit contributions
emet+0.455
ervatives+0.437
adan+0.434
_(+0.427
_-+0.422
Neg logit contributions
 Bord-0.361
 NAS-0.332
 BDS-0.291
 KE-0.287
 law-0.275
Token States
Token ID1829
Feature activation-1.19e+00

Pos logit contributions
emet+2.071
adan+1.961
ervatives+1.949
_(+1.903
_-+1.868
Neg logit contributions
 Bord-3.056
 NAS-2.893
 BDS-2.638
 KE-2.597
BLE-2.564
Token 5
Token ID642
Feature activation-8.46e-02

Pos logit contributions
emet+1.993
ervatives+1.903
adan+1.890
_(+1.859
_-+1.833
Neg logit contributions
 Bord-2.033
 NAS-1.889
 BDS-1.689
 KE-1.670
 law-1.614
TokenTB
Token ID22737
Feature activation+3.35e-01

Pos logit contributions
 Bord+0.100
 NAS+0.095
 BDS+0.086
 KE+0.085
 law+0.083
Neg logit contributions
emet-0.076
ervatives-0.073
adan-0.072
_(-0.071
stein-0.069
Tokenone
Token ID505
Feature activation-2.60e-01

Pos logit contributions
 Bord+0.639
 NAS+0.599
 BDS+0.541
 KE+0.539
 law+0.524
Neg logit contributions
emet-0.483
ervatives-0.464
adan-0.455
_(-0.446
_--0.434
Token-
Token ID12
Feature activation-1.92e-01

Pos logit contributions
emet+0.374
ervatives+0.359
adan+0.351
_(+0.347
_-+0.340
Neg logit contributions
 Bord-0.498
 NAS-0.467
 BDS-0.421
 KE-0.418
 law-0.409
Token Profile
Token ID13118
Feature activation-3.52e-01

Pos logit contributions
emet+0.274
ervatives+0.261
adan+0.257
_(+0.254
stein+0.245
Neg logit contributions
 Bord-0.370
 NAS-0.348
 BDS-0.314
 KE-0.313
 law-0.305
Token Blog
Token ID14001
Feature activation-9.18e-01

Pos logit contributions
emet+0.404
ervatives+0.377
adan+0.375
_(+0.370
_-+0.362
Neg logit contributions
 Bord-0.777
 NAS-0.733
 BDS-0.674
 KE-0.668
 law-0.654
Token Joined
Token ID16709
Feature activation-1.70e+00

Pos logit contributions
emet+0.971
ervatives+0.906
adan+0.877
elsius+0.838
_(+0.838
Neg logit contributions
 Bord-2.152
 NAS-2.067
 BDS-1.903
 KE-1.878
 law-1.845
Token November
Token ID3389
Feature activation-4.31e-01

Pos logit contributions
emet+2.921
ervatives+2.787
adan+2.771
_(+2.728
_-+2.685
Neg logit contributions
 Bord-2.818
 NAS-2.618
 BDS-2.329
 KE-2.302
 law-2.224
Token 2010
Token ID3050
Feature activation-2.31e-01

Pos logit contributions
emet+0.735
ervatives+0.699
adan+0.699
_(+0.678
elsius+0.676
Neg logit contributions
 Bord-0.744
 NAS-0.695
 BDS-0.618
 KE-0.610
 law-0.591
Token United
Token ID1578
Feature activation-1.51e+00

Pos logit contributions
emet+0.433
ervatives+0.416
adan+0.413
_(+0.408
_-+0.403
Neg logit contributions
 Bord-0.347
 NAS-0.319
 BDS-0.280
 KE-0.276
 law-0.266
Token States
Token ID1829
Feature activation-1.15e+00

Pos logit contributions
emet+2.009
adan+1.895
ervatives+1.889
_(+1.851
_-+1.813
Neg logit contributions
 Bord-2.928
 NAS-2.767
 BDS-2.526
 KE-2.486
 abduction-2.449
Token 21
Token ID2310
Feature activation-1.85e-01

Pos logit contributions
emet+1.915
ervatives+1.829
adan+1.816
_(+1.783
_-+1.759
Neg logit contributions
 Bord-1.966
 NAS-1.829
 BDS-1.636
 KE-1.616
 law-1.563
Tokenada
Token ID4763
Feature activation-2.66e-01

Pos logit contributions
emet+0.503
ervatives+0.486
adan+0.474
_(+0.473
stein+0.463
Neg logit contributions
 Bord-0.630
 NAS-0.588
 BDS-0.528
 KE-0.526
 law-0.513
Tokenver
Token ID332
Feature activation+4.68e-01

Pos logit contributions
emet+0.409
ervatives+0.395
adan+0.386
_(+0.380
_-+0.375
Neg logit contributions
 Bord-0.483
 NAS-0.453
 BDS-0.405
 KE-0.405
 law-0.394
TokenBB
Token ID15199
Feature activation+8.65e-02

Pos logit contributions
 Bord+0.865
 NAS+0.810
 BDS+0.729
 KE+0.727
 law+0.707
Neg logit contributions
emet-0.705
ervatives-0.673
adan-0.652
_(-0.649
elsius-0.637
Token Profile
Token ID13118
Feature activation-6.87e-01

Pos logit contributions
 Bord+0.162
 NAS+0.152
 BDS+0.138
 KE+0.137
 law+0.133
Neg logit contributions
emet-0.128
ervatives-0.123
adan-0.120
_(-0.119
_--0.115
Token Blog
Token ID14001
Feature activation-9.08e-01

Pos logit contributions
emet+0.857
ervatives+0.805
adan+0.799
_(+0.786
_-+0.769
Neg logit contributions
 Bord-1.453
 NAS-1.368
 BDS-1.253
 KE-1.243
 law-1.212
Token Joined
Token ID16709
Feature activation-1.69e+00

Pos logit contributions
emet+0.962
ervatives+0.894
adan+0.875
elsius+0.834
_(+0.829
Neg logit contributions
 Bord-2.134
 NAS-2.049
 BDS-1.889
 KE-1.865
 law-1.828
Token June
Token ID2795
Feature activation-8.97e-01

Pos logit contributions
emet+2.914
ervatives+2.779
adan+2.764
_(+2.719
_-+2.676
Neg logit contributions
 Bord-2.807
 NAS-2.610
 BDS-2.321
 KE-2.294
 law-2.216
Token 2009
Token ID3717
Feature activation-2.19e-01

Pos logit contributions
emet+1.528
ervatives+1.453
adan+1.452
_(+1.411
elsius+1.400
Neg logit contributions
 Bord-1.542
 NAS-1.437
 BDS-1.281
 KE-1.263
 law-1.218
Token Germany
Token ID4486
Feature activation-7.22e-01

Pos logit contributions
emet+0.411
ervatives+0.395
adan+0.392
_(+0.387
_-+0.382
Neg logit contributions
 Bord-0.329
 NAS-0.304
 BDS-0.266
 KE-0.263
 law-0.253
Token 247
Token ID30179
Feature activation-3.82e-02

Pos logit contributions
emet+1.230
ervatives+1.173
adan+1.166
_(+1.136
elsius+1.127
Neg logit contributions
 Bord-1.230
 NAS-1.148
 BDS-1.028
 KE-1.013
 law-0.976
Token34
Token ID2682
Feature activation-2.56e-01

Pos logit contributions
emet+0.054
ervatives+0.051
adan+0.050
elsius+0.049
_(+0.049
Neg logit contributions
 Bord-0.076
 NAS-0.072
 BDS-0.065
 KE-0.065
 LIC-0.063
Token superb
Token ID21840
Feature activation+2.39e-01

Pos logit contributions
 Bord+0.058
 NAS+0.053
 BDS+0.048
 KE+0.047
 law+0.046
Neg logit contributions
emet-0.063
ervatives-0.060
adan-0.060
_(-0.059
_--0.058
Token.
Token ID13
Feature activation-8.72e-02

Pos logit contributions
 Bord+0.411
 NAS+0.380
 BDS+0.343
 KE+0.334
 law+0.331
Neg logit contributions
emet-0.399
ervatives-0.377
adan-0.375
_(-0.370
_--0.364
Token
Token ID198
Feature activation-4.30e-03

Pos logit contributions
emet+0.156
ervatives+0.149
adan+0.148
_(+0.145
elsius+0.143
Neg logit contributions
 Bord-0.141
 NAS-0.129
 BDS-0.115
 KE-0.113
 law-0.108
Token
Token ID198
Feature activation-5.41e-02

Pos logit contributions
emet+0.008
avorite+0.008
ervatives+0.008
adan+0.008
elsius+0.008
Neg logit contributions
 Bord-0.006
 NAS-0.006
 BDS-0.005
 KE-0.005
 law-0.005
TokenF
Token ID37
Feature activation-1.13e-01

Pos logit contributions
emet+0.082
ervatives+0.078
adan+0.077
elsius+0.075
_(+0.075
Neg logit contributions
 Bord-0.100
 NAS-0.093
 BDS-0.084
 KE-0.083
-0.081
Tokenalling
Token ID9221
Feature activation-1.69e+00

Pos logit contributions
emet+0.208
ervatives+0.199
adan+0.198
_(+0.196
_-+0.192
Neg logit contributions
 Bord-0.171
 NAS-0.156
 BDS-0.138
 KE-0.135
 law-0.132
Tokenwater
Token ID7050
Feature activation-2.51e-01

Pos logit contributions
emet+2.818
ervatives+2.693
adan+2.673
_(+2.639
_-+2.600
Neg logit contributions
 Bord-2.866
 NAS-2.654
 BDS-2.373
 KE-2.343
 law-2.283
Token is
Token ID318
Feature activation-1.96e-02

Pos logit contributions
emet+0.435
ervatives+0.415
adan+0.413
_(+0.406
_-+0.400
Neg logit contributions
 Bord-0.414
 NAS-0.382
 BDS-0.339
 KE-0.335
 law-0.326
Token Wright
Token ID12206
Feature activation+3.06e-01

Pos logit contributions
emet+0.036
ervatives+0.034
adan+0.034
_(+0.033
_-+0.033
Neg logit contributions
 Bord-0.031
 NAS-0.028
 BDS-0.025
 KE-0.025
 law-0.024
Token
Token ID447
Feature activation-2.48e-01

Pos logit contributions
 Bord+0.511
 NAS+0.476
 BDS+0.425
 KE+0.419
 law+0.404
Neg logit contributions
emet-0.517
ervatives-0.493
adan-0.490
_(-0.485
_--0.478
Token
Token ID247
Feature activation-2.14e-01

Pos logit contributions
emet+0.390
adan+0.366
ervatives+0.364
_(+0.352
elsius+0.349
Neg logit contributions
 Bord-0.454
 NAS-0.416
 BDS-0.377
 law-0.375
 KE-0.371
Tokenushi
Token ID17731
Feature activation-3.04e-01

Pos logit contributions
emet+0.500
ervatives+0.481
adan+0.469
_(+0.467
_-+0.457
Neg logit contributions
 Bord-0.598
 NAS-0.560
 BDS-0.503
 KE-0.501
 law-0.487
Token Profile
Token ID13118
Feature activation-4.17e-01

Pos logit contributions
emet+0.483
ervatives+0.461
adan+0.453
_(+0.446
stein+0.440
Neg logit contributions
 Bord-0.544
 NAS-0.510
 BDS-0.456
 KE-0.456
 law-0.438
Token Joined
Token ID16709
Feature activation-1.80e+00

Pos logit contributions
emet+0.467
ervatives+0.433
adan+0.426
_(+0.418
stein+0.407
Neg logit contributions
 Bord-0.944
 NAS-0.895
 BDS-0.826
 KE-0.821
 law-0.799
Token August
Token ID2932
Feature activation-8.90e-01

Pos logit contributions
emet+3.113
ervatives+2.963
adan+2.947
_(+2.897
_-+2.852
Neg logit contributions
 Bord-2.988
 NAS-2.781
 BDS-2.474
 KE-2.446
 law-2.360
Token 2012
Token ID2321
Feature activation-5.76e-01

Pos logit contributions
emet+1.506
adan+1.427
ervatives+1.427
_(+1.380
_-+1.373
Neg logit contributions
 Bord-1.530
 NAS-1.436
 BDS-1.283
 KE-1.266
 law-1.222
Token United
Token ID1578
Feature activation-1.68e+00

Pos logit contributions
emet+1.094
ervatives+1.050
adan+1.045
_(+1.027
_-+1.015
Neg logit contributions
 Bord-0.856
 NAS-0.788
 BDS-0.690
 KE-0.681
 law-0.653
Token States
Token ID1829
Feature activation-1.39e+00

Pos logit contributions
emet+2.425
adan+2.290
ervatives+2.285
_(+2.243
_-+2.204
Neg logit contributions
 Bord-3.161
 NAS-2.977
 BDS-2.685
 KE-2.638
 LIC-2.567
Token 3
Token ID513
Feature activation-6.99e-01

Pos logit contributions
emet+2.330
ervatives+2.224
adan+2.210
_(+2.174
_-+2.145
Neg logit contributions
 Bord-2.372
 NAS-2.203
 BDS-1.971
 KE-1.947
 law-1.882
Token240
Token ID16102
Feature activation-7.04e-02

Pos logit contributions
emet+0.905
ervatives+0.852
adan+0.843
_(+0.826
_-+0.810
Neg logit contributions
 Bord-1.452
 NAS-1.371
 BDS-1.250
 KE-1.241
 law-1.209
Token Posts
Token ID12043
Feature activation-1.16e+00

Pos logit contributions
emet+0.100
ervatives+0.094
adan+0.093
_(+0.092
stein+0.089
Neg logit contributions
 Bord-0.139
 NAS-0.132
 BDS-0.120
 KE-0.119
 law-0.115
Token #
Token ID1303
Feature activation-3.42e-01

Pos logit contributions
emet+1.906
ervatives+1.831
adan+1.814
_(+1.807
_-+1.777
Neg logit contributions
 Bord-1.970
 NAS-1.812
 BDS-1.622
 KE-1.603
 law-1.546
TokenK
Token ID42
Feature activation-2.13e-01

Pos logit contributions
 Bord+1.390
 NAS+1.310
 BDS+1.190
 KE+1.185
 law+1.151
Neg logit contributions
emet-0.961
ervatives-0.916
adan-0.889
_(-0.868
elsius-0.855
Tokenier
Token ID959
Feature activation+3.17e-01

Pos logit contributions
emet+0.311
ervatives+0.299
adan+0.293
_(+0.292
_-+0.284
Neg logit contributions
 Bord-0.399
 NAS-0.373
 BDS-0.336
 KE-0.334
 law-0.327
Tokenan
Token ID272
Feature activation+1.56e-01

Pos logit contributions
 Bord+0.590
 NAS+0.552
 BDS+0.496
 KE+0.495
 law+0.481
Neg logit contributions
emet-0.471
ervatives-0.449
adan-0.436
_(-0.435
elsius-0.425
Token Profile
Token ID13118
Feature activation-6.17e-01

Pos logit contributions
 Bord+0.299
 NAS+0.280
 BDS+0.253
 KE+0.252
 law+0.245
Neg logit contributions
emet-0.224
ervatives-0.214
adan-0.210
_(-0.207
_--0.203
Token Blog
Token ID14001
Feature activation-9.02e-01

Pos logit contributions
emet+0.751
ervatives+0.702
adan+0.701
_(+0.692
_-+0.679
Neg logit contributions
 Bord-1.317
 NAS-1.239
 BDS-1.135
 KE-1.125
 law-1.100
Token Joined
Token ID16709
Feature activation-1.68e+00

Pos logit contributions
emet+0.955
ervatives+0.891
adan+0.867
elsius+0.824
_(+0.824
Neg logit contributions
 Bord-2.121
 NAS-2.035
 BDS-1.873
 KE-1.852
 law-1.812
Token August
Token ID2932
Feature activation-7.15e-01

Pos logit contributions
emet+2.898
ervatives+2.764
adan+2.750
_(+2.706
_-+2.663
Neg logit contributions
 Bord-2.801
 NAS-2.603
 BDS-2.315
 KE-2.289
 law-2.211
Token 2011
Token ID2813
Feature activation-4.77e-01

Pos logit contributions
emet+1.210
ervatives+1.152
adan+1.150
_(+1.117
_-+1.106
Neg logit contributions
 Bord-1.233
 NAS-1.153
 BDS-1.028
 KE-1.011
 law-0.979
Token United
Token ID1578
Feature activation-1.59e+00

Pos logit contributions
emet+0.905
ervatives+0.869
adan+0.864
_(+0.853
_-+0.842
Neg logit contributions
 Bord-0.707
 NAS-0.651
 BDS-0.569
 KE-0.561
 law-0.540
Token States
Token ID1829
Feature activation-1.14e+00

Pos logit contributions
emet+2.111
adan+1.999
ervatives+1.977
_(+1.942
_-+1.910
Neg logit contributions
 Bord-3.077
 NAS-2.897
 BDS-2.639
 KE-2.606
BLE-2.568
Token 6
Token ID718
Feature activation-5.28e-01

Pos logit contributions
emet+1.896
ervatives+1.811
adan+1.799
_(+1.765
_-+1.742
Neg logit contributions
 Bord-1.957
 NAS-1.822
 BDS-1.631
 KE-1.610
 law-1.558
Token the
Token ID262
Feature activation-1.06e-01

Pos logit contributions
emet+0.497
ervatives+0.480
adan+0.472
_(+0.469
stein+0.460
Neg logit contributions
 Bord-0.474
 NAS-0.441
 BDS-0.397
 KE-0.389
 law-0.380
Token region
Token ID3814
Feature activation+8.36e-02

Pos logit contributions
emet+0.175
ervatives+0.169
adan+0.167
_(+0.165
stein+0.162
Neg logit contributions
 Bord-0.183
 NAS-0.170
 BDS-0.155
 KE-0.151
 law-0.149
Token vertically
Token ID31677
Feature activation-3.16e-01

Pos logit contributions
 Bord+0.138
 NAS+0.127
 BDS+0.115
 KE+0.112
 law+0.112
Neg logit contributions
emet-0.145
ervatives-0.137
adan-0.136
_(-0.134
_--0.132
Token.
Token ID13
Feature activation-9.47e-02

Pos logit contributions
emet+0.527
ervatives+0.501
adan+0.494
_(+0.488
stein+0.479
Neg logit contributions
 Bord-0.544
 NAS-0.503
 BDS-0.453
 KE-0.442
 law-0.438
Token F
Token ID376
Feature activation-1.01e-01

Pos logit contributions
emet+0.168
ervatives+0.161
adan+0.159
_(+0.156
elsius+0.155
Neg logit contributions
 Bord-0.153
 NAS-0.141
 BDS-0.127
 KE-0.123
 law-0.119
Tokenilling
Token ID4509
Feature activation-1.68e+00

Pos logit contributions
emet+0.169
ervatives+0.162
adan+0.160
_(+0.159
stein+0.156
Neg logit contributions
 Bord-0.169
 NAS-0.157
 BDS-0.141
 KE-0.138
 law-0.135
Token the
Token ID262
Feature activation-1.02e-01

Pos logit contributions
emet+2.870
ervatives+2.737
adan+2.726
_(+2.692
_-+2.657
Neg logit contributions
 Bord-2.780
 NAS-2.575
 BDS-2.283
 KE-2.265
 law-2.176
Token land
Token ID1956
Feature activation-2.44e-01

Pos logit contributions
emet+0.198
ervatives+0.191
adan+0.190
_(+0.187
_-+0.184
Neg logit contributions
 Bord-0.148
 NAS-0.135
 BDS-0.120
 KE-0.117
 law-0.114
Token between
Token ID1022
Feature activation-1.52e-02

Pos logit contributions
emet+0.432
ervatives+0.414
adan+0.410
_(+0.403
STDOUT+0.396
Neg logit contributions
 Bord-0.389
 NAS-0.360
 BDS-0.323
 KE-0.319
 law-0.313
Token the
Token ID262
Feature activation-7.60e-02

Pos logit contributions
emet+0.025
ervatives+0.024
adan+0.024
_(+0.024
elsius+0.023
Neg logit contributions
 Bord-0.026
 NAS-0.024
 BDS-0.022
 KE-0.022
 law-0.021
Token isolated
Token ID11557
Feature activation-1.85e-01

Pos logit contributions
emet+0.120
ervatives+0.116
adan+0.114
_(+0.112
_-+0.110
Neg logit contributions
 Bord-0.136
 NAS-0.127
 BDS-0.116
 KE-0.113
 law-0.112
Token of
Token ID286
Feature activation-1.31e-01

Pos logit contributions
emet+0.239
ervatives+0.229
adan+0.223
_(+0.219
stein+0.214
Neg logit contributions
 Bord-0.339
 NAS-0.318
 BDS-0.288
 KE-0.286
 law-0.279
Token Dal
Token ID12348
Feature activation-7.02e-01

Pos logit contributions
emet+0.248
ervatives+0.239
adan+0.237
_(+0.235
_-+0.233
Neg logit contributions
 Bord-0.196
 NAS-0.179
 BDS-0.157
 KE-0.155
 law-0.150
TokenK
Token ID42
Feature activation-2.82e-01

Pos logit contributions
emet+0.858
ervatives+0.828
adan+0.798
_(+0.797
_-+0.786
Neg logit contributions
 Bord-1.492
 NAS-1.402
 BDS-1.277
 KE-1.269
 law-1.246
Token Profile
Token ID13118
Feature activation-6.07e-01

Pos logit contributions
emet+0.429
ervatives+0.413
adan+0.403
_(+0.403
_-+0.396
Neg logit contributions
 Bord-0.518
 NAS-0.481
 BDS-0.431
 KE-0.429
 law-0.421
Token Blog
Token ID14001
Feature activation-8.49e-01

Pos logit contributions
emet+0.754
ervatives+0.708
adan+0.704
_(+0.692
_-+0.678
Neg logit contributions
 Bord-1.285
 NAS-1.210
 BDS-1.108
 KE-1.098
 law-1.072
Token Joined
Token ID16709
Feature activation-1.67e+00

Pos logit contributions
emet+0.901
ervatives+0.834
adan+0.815
elsius+0.776
_(+0.771
Neg logit contributions
 Bord-1.996
 NAS-1.915
 BDS-1.763
 KE-1.742
 law-1.707
Token June
Token ID2795
Feature activation-7.53e-01

Pos logit contributions
emet+2.890
ervatives+2.755
adan+2.740
_(+2.694
_-+2.652
Neg logit contributions
 Bord-2.760
 NAS-2.567
 BDS-2.280
 KE-2.255
 law-2.175
Token 2012
Token ID2321
Feature activation-3.29e-01

Pos logit contributions
emet+1.293
ervatives+1.231
adan+1.228
_(+1.191
elsius+1.188
Neg logit contributions
 Bord-1.283
 NAS-1.199
 BDS-1.067
 KE-1.054
 law-1.011
Token Canada
Token ID3340
Feature activation-5.35e-01

Pos logit contributions
emet+0.621
ervatives+0.597
adan+0.593
_(+0.585
_-+0.578
Neg logit contributions
 Bord-0.490
 NAS-0.451
 BDS-0.395
 KE-0.389
 law-0.374
Token 16
Token ID1467
Feature activation-1.11e-01

Pos logit contributions
emet+0.923
ervatives+0.878
adan+0.876
elsius+0.852
_(+0.851
Neg logit contributions
 Bord-0.907
 NAS-0.847
 BDS-0.759
 KE-0.745
 law-0.716
Token10
Token ID940
Feature activation+2.26e-02

Pos logit contributions
emet+0.151
ervatives+0.144
adan+0.142
_(+0.138
elsius+0.136
Neg logit contributions
 Bord-0.225
 NAS-0.212
 BDS-0.193
 KE-0.192
 law-0.186
Token survivors
Token ID13644
Feature activation+1.83e-01

Pos logit contributions
 Bord+0.397
 NAS+0.366
 BDS+0.327
 KE+0.325
 law+0.317
Neg logit contributions
emet-0.414
ervatives-0.397
adan-0.394
_(-0.385
_--0.381
Token'
Token ID6
Feature activation-3.35e-01

Pos logit contributions
 Bord+0.302
 NAS+0.277
 BDS+0.248
 KE+0.244
 law+0.239
Neg logit contributions
emet-0.321
adan-0.305
ervatives-0.302
_--0.296
_(-0.296
Token skills
Token ID4678
Feature activation+6.07e-02

Pos logit contributions
emet+0.594
adan+0.571
ervatives+0.569
_(+0.555
_-+0.550
Neg logit contributions
 Bord-0.544
 NAS-0.501
 BDS-0.448
 KE-0.442
 law-0.430
Token (
Token ID357
Feature activation-1.10e-01

Pos logit contributions
 Bord+0.100
 NAS+0.092
 BDS+0.082
 KE+0.081
 law+0.079
Neg logit contributions
emet-0.105
adan-0.100
ervatives-0.100
_--0.098
_(-0.098
Tokenf
Token ID69
Feature activation+3.28e-02

Pos logit contributions
emet+0.162
ervatives+0.155
adan+0.152
_(+0.150
_-+0.147
Neg logit contributions
 Bord-0.208
 NAS-0.195
 BDS-0.177
 KE-0.175
 law-0.171
Tokenishing
Token ID3929
Feature activation-1.66e+00

Pos logit contributions
 Bord+0.057
 NAS+0.053
 BDS+0.047
 KE+0.047
 law+0.046
Neg logit contributions
emet-0.053
adan-0.050
ervatives-0.050
_(-0.050
_--0.049
Token,
Token ID11
Feature activation-4.23e-02

Pos logit contributions
emet+2.781
ervatives+2.663
adan+2.652
_(+2.633
stein+2.579
Neg logit contributions
 Bord-2.761
 NAS-2.561
 BDS-2.279
 KE-2.255
 law-2.166
Token cooking
Token ID10801
Feature activation-3.27e-01

Pos logit contributions
emet+0.075
ervatives+0.072
adan+0.071
_(+0.070
_-+0.070
Neg logit contributions
 Bord-0.068
 NAS-0.063
 BDS-0.056
 KE-0.055
 law-0.054
Token,
Token ID11
Feature activation+1.78e-01

Pos logit contributions
emet+0.569
ervatives+0.543
adan+0.540
_(+0.533
_-+0.527
Neg logit contributions
 Bord-0.532
 NAS-0.492
 BDS-0.438
 KE-0.432
 law-0.420
Token etc
Token ID3503
Feature activation-3.90e-02

Pos logit contributions
 Bord+0.280
 NAS+0.258
 BDS+0.232
 KE+0.225
 law+0.225
Neg logit contributions
emet-0.320
ervatives-0.307
adan-0.304
_(-0.301
_--0.298
Token.),
Token ID12179
Feature activation-3.57e-01

Pos logit contributions
emet+0.059
ervatives+0.056
adan+0.055
_(+0.054
stein+0.053
Neg logit contributions
 Bord-0.073
 NAS-0.068
 BDS-0.061
 KE-0.061
 law-0.059
Token.
Token ID13
Feature activation-5.76e-02

Pos logit contributions
 Bord+1.394
 NAS+1.300
 BDS+1.171
 KE+1.151
 law+1.132
Neg logit contributions
emet-1.280
ervatives-1.209
adan-1.195
_(-1.178
_--1.166
Token
Token ID198
Feature activation-7.41e-02

Pos logit contributions
emet+0.094
ervatives+0.090
adan+0.089
_(+0.088
_-+0.087
Neg logit contributions
 Bord-0.099
 NAS-0.092
 BDS-0.082
 KE-0.081
 law-0.079
Token
Token ID198
Feature activation-6.07e-02

Pos logit contributions
emet+0.141
avorite+0.135
ervatives+0.134
adan+0.134
elsius+0.134
Neg logit contributions
 Bord-0.112
 NAS-0.105
 BDS-0.092
 KE-0.090
 law-0.087
TokenFollow
Token ID7155
Feature activation-8.88e-02

Pos logit contributions
emet+0.093
ervatives+0.089
adan+0.087
_(+0.085
elsius+0.085
Neg logit contributions
 Bord-0.110
 NAS-0.104
 BDS-0.094
 KE-0.093
 law-0.090
Token HL
Token ID38312
Feature activation-4.37e-01

Pos logit contributions
emet+0.161
ervatives+0.155
adan+0.153
_(+0.152
_-+0.149
Neg logit contributions
 Bord-0.139
 NAS-0.128
 BDS-0.113
 KE-0.112
 law-0.108
TokenTV
Token ID6849
Feature activation-1.65e+00

Pos logit contributions
emet+0.718
ervatives+0.687
adan+0.674
_(+0.669
_-+0.657
Neg logit contributions
 Bord-0.751
 NAS-0.707
 BDS-0.637
 KE-0.626
 law-0.612
Token.
Token ID13
Feature activation-1.75e-01

Pos logit contributions
emet+2.501
ervatives+2.378
adan+2.360
_(+2.327
_-+2.285
Neg logit contributions
 Bord-3.047
 NAS-2.846
 BDS-2.572
 KE-2.546
 law-2.473
Tokenorg
Token ID2398
Feature activation+7.62e-01

Pos logit contributions
emet+0.327
ervatives+0.312
adan+0.309
_(+0.306
_-+0.301
Neg logit contributions
 Bord-0.266
 NAS-0.247
 BDS-0.216
 KE-0.214
 law-0.206
Token's
Token ID338
Feature activation+5.20e-02

Pos logit contributions
 Bord+1.214
 NAS+1.147
 BDS+1.007
 KE+0.999
 law+0.983
Neg logit contributions
emet-1.337
ervatives-1.286
adan-1.282
_(-1.248
elsius-1.241
Token @
Token ID2488
Feature activation-2.40e-01

Pos logit contributions
 Bord+0.084
 NAS+0.078
 BDS+0.069
 KE+0.069
 law+0.067
Neg logit contributions
emet-0.092
ervatives-0.089
adan-0.088
_(-0.086
elsius-0.085
Tokenl
Token ID75
Feature activation+4.50e-02

Pos logit contributions
emet+0.387
ervatives+0.371
adan+0.369
_(+0.360
_-+0.355
Neg logit contributions
 Bord-0.416
 NAS-0.390
 KE-0.347
 BDS-0.347
 law-0.337
Token,
Token ID11
Feature activation-3.04e-02

Pos logit contributions
 Bord+0.093
 NAS+0.086
 BDS+0.076
 KE+0.075
 law+0.074
Neg logit contributions
emet-0.095
ervatives-0.091
adan-0.090
_(-0.089
_--0.087
Token first
Token ID717
Feature activation-3.60e-01

Pos logit contributions
emet+0.052
ervatives+0.049
adan+0.049
_(+0.048
stein+0.048
Neg logit contributions
 Bord-0.051
 NAS-0.048
 BDS-0.042
 KE-0.042
 law-0.041
Token by
Token ID416
Feature activation+1.55e-01

Pos logit contributions
emet+0.609
ervatives+0.584
adan+0.579
_(+0.572
_-+0.562
Neg logit contributions
 Bord-0.606
 NAS-0.563
 BDS-0.500
 KE-0.494
 law-0.486
Token a
Token ID257
Feature activation-7.70e-02

Pos logit contributions
 Bord+0.247
 NAS+0.231
 BDS+0.203
 KE+0.201
 law+0.197
Neg logit contributions
emet-0.278
ervatives-0.265
adan-0.263
_(-0.263
_--0.257
Token hydra
Token ID25039
Feature activation-5.34e-01

Pos logit contributions
emet+0.142
ervatives+0.135
adan+0.134
_(+0.133
_-+0.131
Neg logit contributions
 Bord-0.120
 NAS-0.113
 BDS-0.099
 KE-0.098
 law-0.096
Tokenul
Token ID377
Feature activation+1.13e+00

Pos logit contributions
emet+0.830
ervatives+0.792
adan+0.782
_(+0.768
_-+0.756
Neg logit contributions
 Bord-0.971
 NAS-0.910
 BDS-0.817
 KE-0.806
 law-0.788
Tokenics
Token ID873
Feature activation+5.17e-01

Pos logit contributions
 Bord+1.871
 NAS+1.687
 BDS+1.525
 KE+1.493
 law+1.459
Neg logit contributions
emet-1.963
ervatives-1.841
adan-1.839
_(-1.837
_--1.827
Token issue
Token ID2071
Feature activation+6.64e-01

Pos logit contributions
 Bord+0.732
 NAS+0.682
 BDS+0.589
 KE+0.582
 law+0.577
Neg logit contributions
emet-1.022
ervatives-0.979
adan-0.966
_(-0.957
stein-0.941
Token with
Token ID351
Feature activation-1.08e-02

Pos logit contributions
 Bord+1.120
 NAS+1.031
 BDS+0.907
 law+0.903
 KE+0.902
Neg logit contributions
emet-1.119
ervatives-1.072
adan-1.066
_(-1.048
stein-1.030
Token the
Token ID262
Feature activation-1.59e-01

Pos logit contributions
emet+0.019
ervatives+0.018
adan+0.018
_(+0.018
_-+0.018
Neg logit contributions
 Bord-0.017
 NAS-0.016
 BDS-0.014
 KE-0.014
 law-0.013
Token rocket
Token ID10701
Feature activation-1.73e-01

Pos logit contributions
emet+0.295
ervatives+0.283
adan+0.281
_(+0.279
_-+0.274
Neg logit contributions
 Bord-0.244
 NAS-0.226
 BDS-0.198
 KE-0.195
 law-0.189
Token to
Token ID284
Feature activation-3.43e-02

Pos logit contributions
emet+0.690
adan+0.655
ervatives+0.652
_(+0.647
_-+0.639
Neg logit contributions
 Bord-0.447
 NAS-0.417
 BDS-0.357
 law-0.351
 KE-0.349
Token a
Token ID257
Feature activation-6.55e-02

Pos logit contributions
emet+0.059
ervatives+0.057
adan+0.056
_(+0.055
stein+0.055
Neg logit contributions
 Bord-0.057
 NAS-0.052
 KE-0.046
 BDS-0.046
 LIC-0.044
Token close
Token ID1969
Feature activation+2.11e-01

Pos logit contributions
emet+0.118
ervatives+0.111
adan+0.111
_-+0.110
_(+0.109
Neg logit contributions
 Bord-0.104
 NAS-0.097
 BDS-0.086
 law-0.086
 KE-0.084
Token examination
Token ID12452
Feature activation+5.24e-01

Pos logit contributions
 Bord+0.311
 NAS+0.285
 BDS+0.251
 law+0.245
 KE+0.245
Neg logit contributions
emet-0.409
ervatives-0.390
adan-0.385
_(-0.380
_--0.380
Token of
Token ID286
Feature activation+2.03e-02

Pos logit contributions
 Bord+0.860
 NAS+0.796
 BDS+0.709
 KE+0.700
 law+0.677
Neg logit contributions
emet-0.903
ervatives-0.864
adan-0.858
_(-0.848
_--0.835
Token whether
Token ID1771
Feature activation+9.57e-01

Pos logit contributions
 Bord+0.031
 NAS+0.029
 BDS+0.026
 KE+0.025
 law+0.025
Neg logit contributions
emet-0.038
ervatives-0.036
adan-0.036
_(-0.035
_--0.035
Token Ross
Token ID9847
Feature activation-3.15e-01

Pos logit contributions
 Bord+1.447
 NAS+1.336
 BDS+1.179
 law+1.153
 KE+1.149
Neg logit contributions
emet-1.809
adan-1.705
ervatives-1.702
_(-1.683
_--1.653
Token
Token ID447
Feature activation-1.46e-01

Pos logit contributions
emet+0.533
ervatives+0.507
adan+0.504
_(+0.498
_-+0.490
Neg logit contributions
 Bord-0.530
 NAS-0.491
 BDS-0.438
 KE-0.433
 law-0.421
Token
Token ID247
Feature activation+7.88e-02

Pos logit contributions
emet+0.267
ervatives+0.251
adan+0.250
_(+0.244
elsius+0.243
Neg logit contributions
 Bord-0.227
 NAS-0.204
 BDS-0.183
 law-0.183
 KE-0.180
Token testimony
Token ID9709
Feature activation-1.84e-01

Pos logit contributions
 Bord+0.142
 NAS+0.133
 BDS+0.120
 KE+0.118
 law+0.116
Neg logit contributions
emet-0.123
ervatives-0.117
adan-0.116
_(-0.114
_--0.112
Token to
Token ID284
Feature activation+6.51e-02

Pos logit contributions
emet+0.316
ervatives+0.301
adan+0.301
_(+0.296
_-+0.292
Neg logit contributions
 Bord-0.307
 NAS-0.282
 BDS-0.251
 KE-0.249
 law-0.247
Token you
Token ID345
Feature activation+2.30e-02

Pos logit contributions
 Bord+0.216
 NAS+0.201
 BDS+0.178
 KE+0.176
 law+0.172
Neg logit contributions
emet-0.240
ervatives-0.231
adan-0.227
_(-0.225
stein-0.221
Token're
Token ID821
Feature activation-6.13e-02

Pos logit contributions
 Bord+0.038
 NAS+0.036
 BDS+0.032
 KE+0.031
 law+0.031
Neg logit contributions
emet-0.040
ervatives-0.038
adan-0.037
_(-0.037
_--0.036
Token in
Token ID287
Feature activation-3.89e-02

Pos logit contributions
emet+0.106
ervatives+0.102
adan+0.101
_(+0.099
_-+0.098
Neg logit contributions
 Bord-0.101
 NAS-0.094
 BDS-0.083
 KE-0.082
 law-0.081
Token luck
Token ID8458
Feature activation+7.12e-01

Pos logit contributions
emet+0.066
ervatives+0.063
adan+0.062
_(+0.061
_-+0.061
Neg logit contributions
 Bord-0.065
 NAS-0.061
 BDS-0.054
 KE-0.054
 law-0.052
Token.
Token ID13
Feature activation+1.54e-01

Pos logit contributions
 Bord+1.238
 NAS+1.159
 BDS+1.039
 KE+1.026
 law+1.003
Neg logit contributions
emet-1.156
ervatives-1.101
adan-1.086
_(-1.067
stein-1.052
Token Turn
Token ID6756
Feature activation+9.43e-01

Pos logit contributions
 Bord+0.232
 NAS+0.214
 BDS+0.188
 KE+0.185
 LIC+0.176
Neg logit contributions
emet-0.294
ervatives-0.284
adan-0.279
_(-0.274
elsius-0.271
Token on
Token ID319
Feature activation+5.79e-01

Pos logit contributions
 Bord+1.432
 NAS+1.325
 BDS+1.163
 KE+1.157
 law+1.109
Neg logit contributions
emet-1.741
ervatives-1.674
adan-1.654
_(-1.641
_--1.622
Token the
Token ID262
Feature activation+1.38e-01

Pos logit contributions
 Bord+0.933
 NAS+0.862
 BDS+0.765
 KE+0.756
 law+0.730
Neg logit contributions
emet-1.017
ervatives-0.973
adan-0.967
_(-0.956
_--0.942
Token monitor
Token ID5671
Feature activation-5.06e-01

Pos logit contributions
 Bord+0.191
 NAS+0.176
 BDS+0.151
 KE+0.151
 law+0.144
Neg logit contributions
emet-0.273
ervatives-0.263
adan-0.262
_(-0.259
_--0.257
Token and
Token ID290
Feature activation+9.01e-02

Pos logit contributions
emet+0.813
ervatives+0.773
adan+0.767
_(+0.756
_-+0.747
Neg logit contributions
 Bord-0.895
 NAS-0.836
 BDS-0.750
 KE-0.743
 law-0.721
Token note
Token ID3465
Feature activation+3.04e-01

Pos logit contributions
 Bord+0.154
 NAS+0.145
 BDS+0.129
 KE+0.128
 law+0.125
Neg logit contributions
emet-0.150
ervatives-0.144
adan-0.142
_(-0.141
_--0.139
Token are
Token ID389
Feature activation+1.81e-01

Pos logit contributions
 Bord+0.206
 NAS+0.195
 BDS+0.172
 KE+0.171
 law+0.168
Neg logit contributions
emet-0.197
ervatives-0.185
adan-0.184
_(-0.181
elsius-0.179
Token having
Token ID1719
Feature activation+3.74e-01

Pos logit contributions
 Bord+0.269
 NAS+0.252
 BDS+0.220
 KE+0.216
 law+0.212
Neg logit contributions
emet-0.340
ervatives-0.325
adan-0.321
_(-0.319
stein-0.315
Token issues
Token ID2428
Feature activation+1.08e+00

Pos logit contributions
 Bord+0.515
 NAS+0.485
 BDS+0.415
 KE+0.409
 law+0.395
Neg logit contributions
emet-0.745
ervatives-0.716
adan-0.707
_(-0.702
_--0.695
Token using
Token ID1262
Feature activation+2.73e-01

Pos logit contributions
 Bord+1.579
 NAS+1.494
 BDS+1.280
 KE+1.271
 law+1.254
Neg logit contributions
emet-2.027
ervatives-1.953
adan-1.919
_(-1.900
stein-1.878
Token the
Token ID262
Feature activation+1.86e-01

Pos logit contributions
 Bord+0.436
 NAS+0.407
 BDS+0.358
 KE+0.356
 law+0.344
Neg logit contributions
emet-0.485
ervatives-0.468
adan-0.461
_(-0.456
_--0.449
Token interactive
Token ID14333
Feature activation+1.02e+00

Pos logit contributions
 Bord+0.285
 NAS+0.266
 BDS+0.232
 KE+0.231
 law+0.224
Neg logit contributions
emet-0.342
ervatives-0.330
adan-0.326
_(-0.323
stein-0.319
Token version
Token ID2196
Feature activation+1.28e+00

Pos logit contributions
 Bord+1.596
 NAS+1.484
 BDS+1.310
 KE+1.298
 law+1.261
Neg logit contributions
emet-1.856
ervatives-1.799
adan-1.770
_(-1.755
_--1.726
Token of
Token ID286
Feature activation+7.30e-02

Pos logit contributions
 Bord+1.990
 NAS+1.845
 BDS+1.626
 KE+1.608
 law+1.560
Neg logit contributions
emet-2.322
ervatives-2.230
adan-2.191
_(-2.168
stein-2.144
Token this
Token ID428
Feature activation+8.87e-02

Pos logit contributions
 Bord+0.118
 NAS+0.110
 BDS+0.097
 KE+0.096
 law+0.093
Neg logit contributions
emet-0.128
ervatives-0.124
adan-0.122
_(-0.121
_--0.119
Token visualization
Token ID32704
Feature activation+5.32e-01

Pos logit contributions
 Bord+0.127
 NAS+0.117
 BDS+0.102
 KE+0.100
 law+0.097
Neg logit contributions
emet-0.172
ervatives-0.166
adan-0.164
_(-0.163
_--0.160
Token,
Token ID11
Feature activation+5.48e-02

Pos logit contributions
 Bord+0.843
 NAS+0.779
 BDS+0.688
 KE+0.680
 law+0.665
Neg logit contributions
emet-0.954
ervatives-0.913
adan-0.906
_(-0.893
stein-0.879
Token worn
Token ID12666
Feature activation+2.15e-01

Pos logit contributions
emet+0.240
adan+0.228
ervatives+0.228
_(+0.225
_-+0.222
Neg logit contributions
 Bord-0.221
 NAS-0.203
 BDS-0.183
 KE-0.179
 law-0.175
Token out
Token ID503
Feature activation-1.49e-01

Pos logit contributions
 Bord+0.352
 NAS+0.326
 BDS+0.293
 KE+0.287
 law+0.283
Neg logit contributions
emet-0.372
adan-0.353
ervatives-0.352
_(-0.344
_--0.342
Token...
Token ID986
Feature activation-3.25e-01

Pos logit contributions
emet+0.250
ervatives+0.235
adan+0.234
_(+0.231
_-+0.229
Neg logit contributions
 Bord-0.255
 NAS-0.235
 BDS-0.213
 KE-0.208
 law-0.206
Token I
Token ID314
Feature activation+8.96e-02

Pos logit contributions
emet+0.579
ervatives+0.548
adan+0.543
_(+0.539
elsius+0.528
Neg logit contributions
 Bord-0.528
 NAS-0.487
 BDS-0.435
 KE-0.427
 law-0.412
Token just
Token ID655
Feature activation+3.28e-01

Pos logit contributions
 Bord+0.160
 NAS+0.148
 BDS+0.133
 KE+0.132
 law+0.131
Neg logit contributions
emet-0.144
_(-0.134
ervatives-0.133
adan-0.133
elsius-0.131
Token wish
Token ID4601
Feature activation+8.99e-01

Pos logit contributions
 Bord+0.554
 NAS+0.512
 BDS+0.459
 law+0.456
 KE+0.456
Neg logit contributions
emet-0.559
_(-0.523
ervatives-0.514
adan-0.513
_--0.513
Token what
Token ID644
Feature activation-2.58e-01

Pos logit contributions
 Bord+1.448
 NAS+1.332
 BDS+1.193
 KE+1.176
 law+1.143
Neg logit contributions
emet-1.607
ervatives-1.515
_(-1.498
adan-1.488
_--1.469
Token she
Token ID673
Feature activation-1.76e-01

Pos logit contributions
emet+0.445
ervatives+0.422
adan+0.419
_(+0.416
_-+0.409
Neg logit contributions
 Bord-0.431
 NAS-0.397
 BDS-0.356
 KE-0.352
 law-0.342
Token had
Token ID550
Feature activation+2.45e-01

Pos logit contributions
emet+0.290
ervatives+0.271
adan+0.270
_(+0.270
_-+0.265
Neg logit contributions
 Bord-0.306
 NAS-0.280
 KE-0.253
 BDS-0.252
 law-0.246
Token to
Token ID284
Feature activation-8.18e-02

Pos logit contributions
 Bord+0.475
 NAS+0.439
 BDS+0.405
 KE+0.398
 law+0.394
Neg logit contributions
emet-0.360
ervatives-0.335
adan-0.332
_(-0.331
_--0.326
Token tell
Token ID1560
Feature activation-3.86e-01

Pos logit contributions
emet+0.137
ervatives+0.128
adan+0.127
_(+0.127
elsius+0.125
Neg logit contributions
 Bord-0.141
 NAS-0.129
 BDS-0.118
 KE-0.116
 law-0.114
Token -
Token ID532
Feature activation-1.89e-01

Pos logit contributions
 Bord+0.621
 NAS+0.580
 BDS+0.522
 KE+0.512
 law+0.503
Neg logit contributions
emet-0.586
ervatives-0.557
adan-0.556
_(-0.547
_--0.542
Token The
Token ID383
Feature activation+2.41e-01

Pos logit contributions
emet+0.343
ervatives+0.329
adan+0.327
_(+0.322
stein+0.317
Neg logit contributions
 Bord-0.298
 NAS-0.274
 BDS-0.242
 KE-0.239
 law-0.231
Token Walking
Token ID21276
Feature activation+8.41e-01

Pos logit contributions
 Bord+0.380
 NAS+0.347
 BDS+0.307
 KE+0.303
 law+0.292
Neg logit contributions
emet-0.437
ervatives-0.419
adan-0.416
_(-0.409
_--0.404
Token Dead
Token ID5542
Feature activation+4.47e-01

Pos logit contributions
 Bord+1.289
 NAS+1.148
 BDS+1.010
 KE+0.996
 law+0.960
Neg logit contributions
emet-1.593
ervatives-1.536
adan-1.529
_(-1.504
_--1.484
Token and
Token ID290
Feature activation+4.14e-02

Pos logit contributions
 Bord+0.794
 NAS+0.735
 BDS+0.665
 KE+0.650
 law+0.640
Neg logit contributions
emet-0.722
ervatives-0.683
adan-0.680
_(-0.658
elsius-0.656
Token An
Token ID1052
Feature activation+3.79e-01

Pos logit contributions
 Bord+0.063
 NAS+0.058
 BDS+0.051
 KE+0.050
 law+0.049
Neg logit contributions
emet-0.077
ervatives-0.074
adan-0.074
_(-0.073
_--0.072
Token American
Token ID1605
Feature activation+2.35e-01

Pos logit contributions
 Bord+0.602
 NAS+0.556
 BDS+0.492
 KE+0.486
 law+0.469
Neg logit contributions
emet-0.675
ervatives-0.646
adan-0.642
_(-0.635
_--0.625
Token Werewolf
Token ID40261
Feature activation+1.31e-02

Pos logit contributions
 Bord+0.325
 NAS+0.287
 BDS+0.253
 KE+0.245
 law+0.239
Neg logit contributions
emet-0.482
ervatives-0.463
adan-0.457
elsius-0.450
_--0.449
Token in
Token ID287
Feature activation+3.02e-02

Pos logit contributions
 Bord+0.020
 NAS+0.018
 BDS+0.015
 law+0.015
 KE+0.015
Neg logit contributions
emet-0.025
adan-0.024
ervatives-0.024
elsius-0.024
_--0.024
Token London
Token ID3576
Feature activation+4.54e-01

Pos logit contributions
 Bord+0.048
 NAS+0.044
 BDS+0.039
 KE+0.039
 LIC+0.037
Neg logit contributions
emet-0.053
ervatives-0.051
adan-0.051
_(-0.051
_--0.049
Token.
Token ID13
Feature activation+1.91e-01

Pos logit contributions
 Bord+0.802
 NAS+0.744
 BDS+0.675
 KE+0.662
 law+0.647
Neg logit contributions
emet-0.730
ervatives-0.698
adan-0.689
_(-0.671
_--0.667
Token thereby
Token ID12839
Feature activation-3.24e-01

Pos logit contributions
 Bord+0.064
 NAS+0.059
 BDS+0.053
 KE+0.052
 law+0.052
Neg logit contributions
emet-0.069
ervatives-0.066
adan-0.066
_(-0.065
_--0.064
Token take
Token ID1011
Feature activation-3.00e-01

Pos logit contributions
emet+0.559
ervatives+0.532
adan+0.531
_(+0.524
_-+0.518
Neg logit contributions
 Bord-0.534
 NAS-0.493
 BDS-0.442
 KE-0.434
 law-0.429
Token possession
Token ID7797
Feature activation+2.82e-01

Pos logit contributions
emet+0.585
ervatives+0.562
adan+0.559
_(+0.554
_-+0.547
Neg logit contributions
 Bord-0.425
 NAS-0.388
 BDS-0.338
 KE-0.334
 law-0.322
Token of
Token ID286
Feature activation+1.19e-01

Pos logit contributions
 Bord+0.473
 NAS+0.441
 BDS+0.393
 KE+0.389
 law+0.372
Neg logit contributions
emet-0.473
ervatives-0.454
adan-0.449
_(-0.447
_--0.438
Token it
Token ID340
Feature activation-1.45e-02

Pos logit contributions
 Bord+0.188
 NAS+0.172
 BDS+0.153
 KE+0.151
 law+0.148
Neg logit contributions
emet-0.214
ervatives-0.204
adan-0.204
_(-0.201
_--0.198
Token.
Token ID13
Feature activation-4.94e-03

Pos logit contributions
emet+0.024
adan+0.023
ervatives+0.023
_(+0.022
_-+0.022
Neg logit contributions
 Bord-0.025
 NAS-0.023
 BDS-0.021
 KE-0.021
 law-0.020
Token
Token ID447
Feature activation-6.95e-03

Pos logit contributions
emet+0.009
adan+0.009
ervatives+0.009
elsius+0.009
_(+0.009
Neg logit contributions
 Bord-0.008
 NAS-0.007
 BDS-0.006
 KE-0.006
 law-0.006
Token
Token ID251
Feature activation+6.01e-02

Pos logit contributions
emet+0.012
ervatives+0.011
adan+0.011
_(+0.011
_-+0.011
Neg logit contributions
 Bord-0.012
 NAS-0.011
 BDS-0.010
 KE-0.010
 law-0.009
Token
Token ID198
Feature activation+2.58e-02

Pos logit contributions
 Bord+0.095
 NAS+0.087
 BDS+0.077
 KE+0.075
 law+0.073
Neg logit contributions
emet-0.110
ervatives-0.105
adan-0.105
elsius-0.102
_(-0.102
Token
Token ID198
Feature activation+1.66e-02

Pos logit contributions
 Bord+0.038
 NAS+0.036
 BDS+0.031
 KE+0.031
 law+0.030
Neg logit contributions
emet-0.049
avorite-0.047
ervatives-0.047
adan-0.047
elsius-0.047
TokenThe
Token ID464
Feature activation-1.05e-02

Pos logit contributions
 Bord+0.030
 NAS+0.028
 BDS+0.026
 KE+0.025
 law+0.024
Neg logit contributions
emet-0.025
ervatives-0.024
adan-0.024
elsius-0.023
_(-0.023
Token it
Token ID340
Feature activation-4.64e-02

Pos logit contributions
 Bord+0.100
 NAS+0.091
 BDS+0.081
 KE+0.079
 law+0.078
Neg logit contributions
emet-0.127
ervatives-0.121
adan-0.120
_(-0.119
_--0.118
Token insults
Token ID27819
Feature activation+4.59e-02

Pos logit contributions
emet+0.078
ervatives+0.073
adan+0.073
_(+0.072
_-+0.071
Neg logit contributions
 Bord-0.080
 NAS-0.074
 BDS-0.067
 KE-0.066
 law-0.066
Token them
Token ID606
Feature activation+3.58e-01

Pos logit contributions
 Bord+0.078
 NAS+0.072
 BDS+0.065
 KE+0.064
 law+0.063
Neg logit contributions
emet-0.078
adan-0.074
ervatives-0.074
_(-0.072
stein-0.072
Token.
Token ID13
Feature activation-9.66e-03

Pos logit contributions
 Bord+0.586
 NAS+0.536
 BDS+0.481
 law+0.479
 KE+0.469
Neg logit contributions
emet-0.625
adan-0.586
_(-0.578
_--0.576
ervatives-0.575
Token "
Token ID366
Feature activation+1.15e-01

Pos logit contributions
emet+0.018
ervatives+0.017
adan+0.017
elsius+0.017
avorite+0.016
Neg logit contributions
 Bord-0.015
 NAS-0.014
 BDS-0.013
 KE-0.012
 LB-0.012
TokenThe
Token ID464
Feature activation+2.16e-02

Pos logit contributions
 Bord+0.206
 NAS+0.196
 BDS+0.177
 KE+0.173
 law+0.167
Neg logit contributions
emet-0.180
ervatives-0.172
adan-0.170
elsius-0.168
avorite-0.164
Token most
Token ID749
Feature activation-8.19e-02

Pos logit contributions
 Bord+0.037
 NAS+0.035
 BDS+0.032
 KE+0.031
 law+0.031
Neg logit contributions
emet-0.036
ervatives-0.034
adan-0.034
_(-0.033
_--0.033
Token politically
Token ID11889
Feature activation-2.69e-01

Pos logit contributions
emet+0.149
adan+0.141
ervatives+0.140
_(+0.138
_-+0.138
Neg logit contributions
 Bord-0.130
 NAS-0.121
 BDS-0.108
 KE-0.105
 law-0.105
Token engaged
Token ID7953
Feature activation-9.73e-02

Pos logit contributions
emet+0.494
ervatives+0.465
adan+0.462
_(+0.459
_-+0.455
Neg logit contributions
 Bord-0.419
 NAS-0.392
 BDS-0.348
 KE-0.338
 law-0.334
Token millennials
Token ID30418
Feature activation-3.53e-02

Pos logit contributions
emet+0.148
ervatives+0.138
adan+0.136
_-+0.134
elsius+0.134
Neg logit contributions
 Bord-0.183
 NAS-0.172
 BDS-0.157
 KE-0.153
 law-0.153
Token think
Token ID892
Feature activation-4.05e-01

Pos logit contributions
emet+0.060
adan+0.056
_(+0.056
_-+0.055
ervatives+0.055
Neg logit contributions
 Bord-0.059
 NAS-0.055
 BDS-0.050
 KE-0.049
 law-0.048
Token
Token ID447
Feature activation+1.40e-02

Pos logit contributions
 Bord+0.007
 NAS+0.007
 BDS+0.006
 KE+0.006
 law+0.006
Neg logit contributions
emet-0.011
ervatives-0.011
adan-0.011
avorite-0.011
elsius-0.010
Token
Token ID247
Feature activation-1.02e-01

Pos logit contributions
 Bord+0.025
 NAS+0.023
 BDS+0.020
 law+0.020
 KE+0.020
Neg logit contributions
emet-0.023
adan-0.021
ervatives-0.021
elsius-0.020
_(-0.020
Tokent
Token ID83
Feature activation+8.50e-02

Pos logit contributions
emet+0.122
ervatives+0.114
adan+0.113
_(+0.110
elsius+0.107
Neg logit contributions
 Bord-0.218
 NAS-0.206
 BDS-0.189
 KE-0.188
 law-0.186
Token want
Token ID765
Feature activation+7.03e-01

Pos logit contributions
 Bord+0.137
 NAS+0.128
 BDS+0.115
 KE+0.113
 law+0.112
Neg logit contributions
emet-0.149
adan-0.141
ervatives-0.139
_(-0.138
elsius-0.137
Token your
Token ID534
Feature activation-1.16e-01

Pos logit contributions
 Bord+1.049
 NAS+0.970
 BDS+0.858
 KE+0.837
 law+0.827
Neg logit contributions
emet-1.323
adan-1.254
ervatives-1.251
_(-1.238
_--1.218
Token attention
Token ID3241
Feature activation+7.72e-01

Pos logit contributions
emet+0.212
adan+0.202
ervatives+0.201
_(+0.198
stein+0.195
Neg logit contributions
 Bord-0.183
 NAS-0.170
 BDS-0.151
 KE-0.148
 law-0.147
Token.
Token ID13
Feature activation-1.19e-02

Pos logit contributions
 Bord+1.318
 NAS+1.221
 BDS+1.099
 KE+1.086
 law+1.062
Neg logit contributions
emet-1.278
adan-1.208
ervatives-1.208
_(-1.193
_--1.177
Token
Token ID198
Feature activation+5.41e-02

Pos logit contributions
emet+0.022
ervatives+0.020
adan+0.020
elsius+0.020
_(+0.020
Neg logit contributions
 Bord-0.019
 NAS-0.018
 BDS-0.016
 KE-0.015
 law-0.015
Token
Token ID198
Feature activation+2.46e-02

Pos logit contributions
 Bord+0.080
 NAS+0.075
 BDS+0.066
 KE+0.065
 law+0.062
Neg logit contributions
emet-0.104
avorite-0.100
ervatives-0.099
adan-0.098
elsius-0.098
TokenBut
Token ID1537
Feature activation-3.22e-01

Pos logit contributions
 Bord+0.044
 NAS+0.041
 BDS+0.037
 KE+0.036
 law+0.036
Neg logit contributions
emet-0.039
ervatives-0.037
adan-0.036
elsius-0.035
_(-0.035
Token you
Token ID345
Feature activation-2.39e-02

Pos logit contributions
emet+0.561
ervatives+0.532
adan+0.529
_(+0.522
_-+0.514
Neg logit contributions
 Bord-0.532
 NAS-0.491
 BDS-0.440
 KE-0.431
 law-0.424
Token this
Token ID428
Feature activation+2.60e-02

Pos logit contributions
 Bord+0.205
 NAS+0.190
 BDS+0.169
 KE+0.167
 law+0.161
Neg logit contributions
emet-0.225
ervatives-0.217
adan-0.214
_(-0.213
_--0.210
Token tour
Token ID4205
Feature activation-3.31e-01

Pos logit contributions
 Bord+0.045
 NAS+0.042
 BDS+0.038
 KE+0.037
 law+0.036
Neg logit contributions
emet-0.042
ervatives-0.041
adan-0.040
_(-0.040
stein-0.039
Token was
Token ID373
Feature activation+1.18e-01

Pos logit contributions
emet+0.561
ervatives+0.536
adan+0.533
_(+0.526
_-+0.516
Neg logit contributions
 Bord-0.557
 NAS-0.515
 BDS-0.461
 KE-0.456
 law-0.444
Token phenomenal
Token ID28187
Feature activation+5.44e-01

Pos logit contributions
 Bord+0.198
 NAS+0.181
 BDS+0.163
 KE+0.161
 law+0.157
Neg logit contributions
emet-0.202
ervatives-0.196
adan-0.193
_(-0.190
_--0.188
Token and
Token ID290
Feature activation+3.12e-02

Pos logit contributions
 Bord+0.909
 NAS+0.831
 BDS+0.746
 KE+0.733
 law+0.725
Neg logit contributions
emet-0.933
ervatives-0.889
adan-0.881
_(-0.872
_--0.863
Token it
Token ID340
Feature activation-2.97e-02

Pos logit contributions
 Bord+0.051
 NAS+0.047
 BDS+0.042
 KE+0.042
 law+0.041
Neg logit contributions
emet-0.054
ervatives-0.052
adan-0.052
_(-0.051
_--0.050
Token was
Token ID373
Feature activation+1.66e-01

Pos logit contributions
emet+0.050
ervatives+0.047
adan+0.047
_(+0.046
_-+0.045
Neg logit contributions
 Bord-0.051
 NAS-0.047
 BDS-0.042
 KE-0.042
 law-0.041
Token an
Token ID281
Feature activation+6.20e-03

Pos logit contributions
 Bord+0.273
 NAS+0.251
 BDS+0.223
 KE+0.221
 law+0.215
Neg logit contributions
emet-0.289
ervatives-0.284
adan-0.278
_(-0.275
elsius-0.270
Token utterly
Token ID15950
Feature activation-3.57e-02

Pos logit contributions
 Bord+0.010
 NAS+0.009
 BDS+0.008
 KE+0.008
 law+0.008
Neg logit contributions
emet-0.011
ervatives-0.011
adan-0.010
_(-0.010
_--0.010
Token fantastic
Token ID9623
Feature activation+1.42e-01

Pos logit contributions
emet+0.061
ervatives+0.059
adan+0.059
_(+0.057
_-+0.057
Neg logit contributions
 Bord-0.060
 NAS-0.056
 BDS-0.050
 KE-0.049
 law-0.048
Token idea
Token ID2126
Feature activation-2.68e-01

Pos logit contributions
 Bord+0.230
 NAS+0.213
 BDS+0.191
 KE+0.186
 law+0.184
Neg logit contributions
emet-0.250
ervatives-0.243
adan-0.237
_--0.235
_(-0.235
Token.
Token ID13
Feature activation+1.86e-01

Pos logit contributions
emet+0.220
ervatives+0.209
adan+0.207
_(+0.205
stein+0.201
Neg logit contributions
 Bord-0.276
 NAS-0.260
 BDS-0.235
 KE-0.232
 law-0.225
Token Walton
Token ID34468
Feature activation+7.06e-01

Pos logit contributions
 Bord+0.274
 NAS+0.250
 BDS+0.219
 KE+0.216
 law+0.208
Neg logit contributions
emet-0.353
ervatives-0.339
adan-0.337
_(-0.333
_--0.328
Token
Token ID447
Feature activation+2.37e-02

Pos logit contributions
 Bord+1.184
 NAS+1.083
 law+0.963
 BDS+0.960
 KE+0.953
Neg logit contributions
emet-1.208
ervatives-1.147
adan-1.144
_(-1.117
_--1.102
Token
Token ID247
Feature activation+1.52e-01

Pos logit contributions
 Bord+0.036
 NAS+0.033
 BDS+0.029
 KE+0.029
 law+0.029
Neg logit contributions
emet-0.044
adan-0.041
ervatives-0.041
_(-0.040
elsius-0.040
Tokens
Token ID82
Feature activation+3.97e-02

Pos logit contributions
 Bord+0.261
 NAS+0.242
 BDS+0.215
 KE+0.214
 law+0.210
Neg logit contributions
emet-0.249
ervatives-0.238
adan-0.237
_(-0.233
_--0.229
Token physical
Token ID3518
Feature activation-1.60e-01

Pos logit contributions
 Bord+0.060
 NAS+0.055
 BDS+0.049
 KE+0.048
 law+0.047
Neg logit contributions
emet-0.075
ervatives-0.071
adan-0.071
_(-0.070
_--0.069
Token contact
Token ID2800
Feature activation+3.38e-01

Pos logit contributions
emet+0.303
ervatives+0.291
adan+0.289
_(+0.285
_-+0.282
Neg logit contributions
 Bord-0.236
 NAS-0.216
 BDS-0.191
 KE-0.188
 law-0.184
Token with
Token ID351
Feature activation+2.78e-01

Pos logit contributions
 Bord+0.547
 NAS+0.503
 BDS+0.448
 KE+0.442
 law+0.436
Neg logit contributions
emet-0.591
ervatives-0.567
adan-0.564
_(-0.558
_--0.549
Token the
Token ID262
Feature activation+2.17e-02

Pos logit contributions
 Bord+0.409
 NAS+0.373
 BDS+0.325
 KE+0.322
 law+0.315
Neg logit contributions
emet-0.533
ervatives-0.512
adan-0.509
_(-0.503
_--0.496
Token Miami
Token ID8437
Feature activation+1.91e-01

Pos logit contributions
 Bord+0.034
 NAS+0.032
 BDS+0.028
 KE+0.027
 law+0.027
Neg logit contributions
emet-0.040
ervatives-0.038
adan-0.038
_(-0.037
stein-0.037
Token Police
Token ID4287
Feature activation+2.62e-01

Pos logit contributions
 Bord+0.296
 NAS+0.272
 BDS+0.240
 KE+0.238
 law+0.234
Neg logit contributions
emet-0.351
ervatives-0.336
adan-0.333
_(-0.329
_--0.325
Token by
Token ID416
Feature activation+1.93e-01

Pos logit contributions
 Bord+0.271
 NAS+0.252
 BDS+0.227
 KE+0.224
 law+0.218
Neg logit contributions
emet-0.246
ervatives-0.235
adan-0.233
_(-0.230
_--0.226
Token P
Token ID350
Feature activation-1.76e-01

Pos logit contributions
 Bord+0.300
 NAS+0.277
 BDS+0.244
 KE+0.243
 law+0.234
Neg logit contributions
emet-0.351
ervatives-0.336
adan-0.335
_(-0.329
_--0.325
Tokenoh
Token ID1219
Feature activation+5.59e-01

Pos logit contributions
emet+0.269
ervatives+0.259
adan+0.255
_(+0.252
_-+0.248
Neg logit contributions
 Bord-0.323
 NAS-0.300
 BDS-0.270
 KE-0.269
 law-0.261
Tokenj
Token ID73
Feature activation-1.13e-01

Pos logit contributions
 Bord+1.001
 NAS+0.921
 BDS+0.829
 KE+0.828
 law+0.803
Neg logit contributions
emet-0.885
ervatives-0.843
adan-0.827
_(-0.821
_--0.808
Tokenola
Token ID5708
Feature activation-2.78e-01

Pos logit contributions
emet+0.171
ervatives+0.163
adan+0.160
_(+0.160
_-+0.158
Neg logit contributions
 Bord-0.210
 NAS-0.196
 KE-0.175
 BDS-0.175
 law-0.169
Token on
Token ID319
Feature activation-1.52e-01

Pos logit contributions
emet+0.485
ervatives+0.465
adan+0.461
_(+0.456
_-+0.449
Neg logit contributions
 Bord-0.453
 NAS-0.418
 BDS-0.371
 KE-0.368
 law-0.355
Token piano
Token ID19132
Feature activation-3.25e-01

Pos logit contributions
emet+0.259
ervatives+0.248
adan+0.247
_(+0.243
_-+0.239
Neg logit contributions
 Bord-0.252
 NAS-0.234
 BDS-0.209
 KE-0.207
 law-0.200
Token,
Token ID11
Feature activation+9.71e-02

Pos logit contributions
emet+0.520
ervatives+0.495
adan+0.492
_(+0.484
_-+0.478
Neg logit contributions
 Bord-0.575
 NAS-0.535
 BDS-0.481
 KE-0.476
 law-0.462
Token with
Token ID351
Feature activation-1.46e-01

Pos logit contributions
 Bord+0.156
 NAS+0.143
 BDS+0.127
 KE+0.126
 law+0.122
Neg logit contributions
emet-0.174
ervatives-0.166
adan-0.165
_(-0.163
_--0.161
Token vocal
Token ID12435
Feature activation-3.23e-01

Pos logit contributions
emet+0.267
ervatives+0.256
adan+0.254
_(+0.251
_-+0.248
Neg logit contributions
 Bord-0.229
 NAS-0.210
 BDS-0.186
 KE-0.184
 law-0.177
Tokenists
Token ID1023
Feature activation-2.06e-01

Pos logit contributions
emet+0.388
ervatives+0.364
adan+0.358
_(+0.352
_-+0.347
Neg logit contributions
 Bord-0.703
 NAS-0.662
 BDS-0.610
 KE-0.605
 law-0.592
Token
Token ID198
Feature activation-2.75e-01

Pos logit contributions
emet+0.604
ervatives+0.575
adan+0.574
avorite+0.573
elsius+0.569
Neg logit contributions
 Bord-0.493
 NAS-0.459
 BDS-0.403
 KE-0.396
 law-0.383
TokenThus
Token ID19093
Feature activation-2.77e-01

Pos logit contributions
emet+0.422
ervatives+0.402
adan+0.398
_(+0.389
elsius+0.386
Neg logit contributions
 Bord-0.502
 NAS-0.469
 BDS-0.422
 KE-0.418
 law-0.407
Token Massachusetts
Token ID10140
Feature activation-2.51e-02

Pos logit contributions
emet+0.482
ervatives+0.461
adan+0.458
_(+0.453
_-+0.446
Neg logit contributions
 Bord-0.450
 NAS-0.416
 BDS-0.370
 KE-0.365
 law-0.354
Token is
Token ID318
Feature activation+1.81e-02

Pos logit contributions
emet+0.042
ervatives+0.040
adan+0.040
_(+0.040
_-+0.039
Neg logit contributions
 Bord-0.042
 NAS-0.039
 BDS-0.035
 KE-0.034
 law-0.034
Token one
Token ID530
Feature activation-1.13e-01

Pos logit contributions
 Bord+0.029
 NAS+0.027
 BDS+0.024
 KE+0.023
 law+0.023
Neg logit contributions
emet-0.032
ervatives-0.031
adan-0.031
_(-0.030
_--0.030
Token of
Token ID286
Feature activation-2.23e-01

Pos logit contributions
emet+0.179
adan+0.171
ervatives+0.170
_(+0.169
stein+0.165
Neg logit contributions
 Bord-0.197
 NAS-0.184
 BDS-0.167
 KE-0.165
 CARD-0.158
Token the
Token ID262
Feature activation-6.23e-02

Pos logit contributions
emet+0.368
ervatives+0.350
adan+0.348
_(+0.344
_-+0.339
Neg logit contributions
 Bord-0.384
 NAS-0.357
 BDS-0.319
 KE-0.316
 law-0.307
Token few
Token ID1178
Feature activation+3.22e-01

Pos logit contributions
emet+0.096
adan+0.091
ervatives+0.090
_(+0.089
_-+0.088
Neg logit contributions
 Bord-0.115
 NAS-0.108
 BDS-0.097
 KE-0.096
 law-0.095
Token states
Token ID2585
Feature activation+7.34e-02

Pos logit contributions
 Bord+0.467
 NAS+0.430
 BDS+0.371
 law+0.370
 KE+0.369
Neg logit contributions
emet-0.625
adan-0.595
ervatives-0.592
_--0.584
_(-0.582
Token where
Token ID810
Feature activation-4.07e-02

Pos logit contributions
 Bord+0.114
 NAS+0.104
 BDS+0.092
 KE+0.091
 law+0.090
Neg logit contributions
emet-0.134
ervatives-0.128
adan-0.127
_(-0.126
_--0.125
Token the
Token ID262
Feature activation+3.21e-02

Pos logit contributions
emet+0.074
adan+0.070
ervatives+0.070
_(+0.069
_-+0.068
Neg logit contributions
 Bord-0.064
 NAS-0.059
 BDS-0.052
 KE-0.051
 law-0.051
Token;
Token ID26
Feature activation-1.22e-01

Pos logit contributions
emet+0.853
ervatives+0.814
adan+0.811
_(+0.802
_-+0.792
Neg logit contributions
 Bord-0.731
 NAS-0.673
 BDS-0.594
 KE-0.586
 law-0.570
Token to
Token ID284
Feature activation-2.72e-02

Pos logit contributions
emet+0.214
ervatives+0.204
adan+0.204
_(+0.201
stein+0.198
Neg logit contributions
 Bord-0.199
 NAS-0.184
 BDS-0.164
 KE-0.161
 law-0.158
Token compromise
Token ID13110
Feature activation-1.97e-01

Pos logit contributions
emet+0.047
ervatives+0.045
adan+0.045
_(+0.044
_-+0.044
Neg logit contributions
 Bord-0.045
 NAS-0.041
 BDS-0.037
 KE-0.036
 law-0.036
Token in
Token ID287
Feature activation-8.81e-02

Pos logit contributions
emet+0.339
adan+0.322
ervatives+0.320
_(+0.316
_-+0.313
Neg logit contributions
 Bord-0.328
 NAS-0.304
 BDS-0.272
 KE-0.267
 law-0.264
Token pursuit
Token ID14748
Feature activation+1.90e-01

Pos logit contributions
emet+0.161
ervatives+0.153
adan+0.152
_(+0.150
_-+0.148
Neg logit contributions
 Bord-0.139
 NAS-0.128
 BDS-0.114
 KE-0.111
 law-0.111
Token of
Token ID286
Feature activation-1.07e-01

Pos logit contributions
 Bord+0.281
 NAS+0.260
 BDS+0.230
 law+0.225
 KE+0.224
Neg logit contributions
emet-0.363
adan-0.346
ervatives-0.345
_(-0.337
elsius-0.337
Token the
Token ID262
Feature activation-1.34e-01

Pos logit contributions
emet+0.196
adan+0.186
ervatives+0.186
_(+0.183
_-+0.181
Neg logit contributions
 Bord-0.168
 NAS-0.155
 BDS-0.138
 KE-0.134
 law-0.133
Token success
Token ID1943
Feature activation+1.81e-02

Pos logit contributions
emet+0.225
adan+0.214
ervatives+0.212
_(+0.208
_-+0.207
Neg logit contributions
 Bord-0.232
 NAS-0.215
 BDS-0.194
 KE-0.190
 law-0.190
Token of
Token ID286
Feature activation-4.51e-02

Pos logit contributions
 Bord+0.029
 NAS+0.027
 BDS+0.024
 KE+0.024
 law+0.023
Neg logit contributions
emet-0.032
ervatives-0.030
adan-0.030
_(-0.030
_--0.029
Token our
Token ID674
Feature activation-8.17e-03

Pos logit contributions
emet+0.082
adan+0.079
ervatives+0.079
_(+0.077
_-+0.077
Neg logit contributions
 Bord-0.070
 NAS-0.065
 BDS-0.057
 KE-0.056
 law-0.055
Token nation
Token ID3277
Feature activation-2.51e-01

Pos logit contributions
emet+0.015
adan+0.014
ervatives+0.014
_(+0.014
elsius+0.014
Neg logit contributions
 Bord-0.013
 NAS-0.012
 BDS-0.010
 law-0.010
 KE-0.010
Token Prime
Token ID5537
Feature activation-6.39e-02

Pos logit contributions
 Bord+0.254
 NAS+0.230
 BDS+0.202
 KE+0.200
 law+0.200
Neg logit contributions
emet-0.321
adan-0.306
ervatives-0.303
_(-0.301
_--0.297
Token Minister
Token ID4139
Feature activation-1.31e-01

Pos logit contributions
emet+0.137
ervatives+0.131
adan+0.130
_(+0.128
_-+0.128
Neg logit contributions
 Bord-0.083
 NAS-0.073
 BDS-0.063
 KE-0.062
 law-0.061
Token and
Token ID290
Feature activation+1.53e-01

Pos logit contributions
emet+0.208
adan+0.196
ervatives+0.195
_(+0.191
_-+0.191
Neg logit contributions
 Bord-0.236
 NAS-0.218
 BDS-0.197
 KE-0.194
 law-0.192
Token,
Token ID11
Feature activation+1.53e-01

Pos logit contributions
 Bord+0.240
 NAS+0.220
 BDS+0.195
 KE+0.191
 law+0.190
Neg logit contributions
emet-0.280
ervatives-0.266
adan-0.266
_(-0.261
_--0.259
Token if
Token ID611
Feature activation+1.55e-01

Pos logit contributions
 Bord+0.235
 NAS+0.215
 BDS+0.190
 law+0.188
 KE+0.187
Neg logit contributions
emet-0.286
ervatives-0.271
adan-0.271
_(-0.266
_--0.265
Token necessary
Token ID3306
Feature activation-4.54e-01

Pos logit contributions
 Bord+0.263
 NAS+0.242
 BDS+0.217
 KE+0.213
 law+0.211
Neg logit contributions
emet-0.265
adan-0.251
ervatives-0.251
_(-0.247
stein-0.243
Token,
Token ID11
Feature activation+3.54e-01

Pos logit contributions
emet+0.701
ervatives+0.684
adan+0.677
_(+0.671
stein+0.654
Neg logit contributions
 Bord-0.803
 NAS-0.756
 KE-0.675
 BDS-0.673
 LIC-0.645
Token dismiss
Token ID6735
Feature activation+2.82e-01

Pos logit contributions
 Bord+0.550
 NAS+0.505
 BDS+0.447
 KE+0.438
 law+0.437
Neg logit contributions
emet-0.652
ervatives-0.620
adan-0.616
_(-0.609
_--0.603
Token one
Token ID530
Feature activation-2.78e-02

Pos logit contributions
 Bord+0.439
 NAS+0.401
 BDS+0.352
 KE+0.350
 law+0.345
Neg logit contributions
emet-0.518
adan-0.494
ervatives-0.491
_(-0.488
_--0.483
Token that
Token ID326
Feature activation+1.48e-01

Pos logit contributions
emet+0.052
ervatives+0.050
adan+0.050
_(+0.049
_-+0.049
Neg logit contributions
 Bord-0.042
 NAS-0.038
 BDS-0.033
 KE-0.033
 law-0.032
Token has
Token ID468
Feature activation+6.76e-02

Pos logit contributions
 Bord+0.251
 NAS+0.230
 BDS+0.206
 KE+0.203
 law+0.201
Neg logit contributions
emet-0.252
adan-0.239
ervatives-0.239
_(-0.235
_--0.232
Tokenada
Token ID4763
Feature activation-2.66e-01

Pos logit contributions
emet+0.503
ervatives+0.486
adan+0.474
_(+0.473
stein+0.463
Neg logit contributions
 Bord-0.630
 NAS-0.588
 BDS-0.528
 KE-0.526
 law-0.513
Tokenver
Token ID332
Feature activation+4.68e-01

Pos logit contributions
emet+0.409
ervatives+0.395
adan+0.386
_(+0.380
_-+0.375
Neg logit contributions
 Bord-0.483
 NAS-0.453
 BDS-0.405
 KE-0.405
 law-0.394
TokenBB
Token ID15199
Feature activation+8.65e-02

Pos logit contributions
 Bord+0.865
 NAS+0.810
 BDS+0.729
 KE+0.727
 law+0.707
Neg logit contributions
emet-0.705
ervatives-0.673
adan-0.652
_(-0.649
elsius-0.637
Token Profile
Token ID13118
Feature activation-6.87e-01

Pos logit contributions
 Bord+0.162
 NAS+0.152
 BDS+0.138
 KE+0.137
 law+0.133
Neg logit contributions
emet-0.128
ervatives-0.123
adan-0.120
_(-0.119
_--0.115
Token Blog
Token ID14001
Feature activation-9.08e-01

Pos logit contributions
emet+0.857
ervatives+0.805
adan+0.799
_(+0.786
_-+0.769
Neg logit contributions
 Bord-1.453
 NAS-1.368
 BDS-1.253
 KE-1.243
 law-1.212
Token Joined
Token ID16709
Feature activation-1.69e+00

Pos logit contributions
emet+0.962
ervatives+0.894
adan+0.875
elsius+0.834
_(+0.829
Neg logit contributions
 Bord-2.134
 NAS-2.049
 BDS-1.889
 KE-1.865
 law-1.828
Token June
Token ID2795
Feature activation-8.97e-01

Pos logit contributions
emet+2.914
ervatives+2.779
adan+2.764
_(+2.719
_-+2.676
Neg logit contributions
 Bord-2.807
 NAS-2.610
 BDS-2.321
 KE-2.294
 law-2.216
Token 2009
Token ID3717
Feature activation-2.19e-01

Pos logit contributions
emet+1.528
ervatives+1.453
adan+1.452
_(+1.411
elsius+1.400
Neg logit contributions
 Bord-1.542
 NAS-1.437
 BDS-1.281
 KE-1.263
 law-1.218
Token Germany
Token ID4486
Feature activation-7.22e-01

Pos logit contributions
emet+0.411
ervatives+0.395
adan+0.392
_(+0.387
_-+0.382
Neg logit contributions
 Bord-0.329
 NAS-0.304
 BDS-0.266
 KE-0.263
 law-0.253
Token 247
Token ID30179
Feature activation-3.82e-02

Pos logit contributions
emet+1.230
ervatives+1.173
adan+1.166
_(+1.136
elsius+1.127
Neg logit contributions
 Bord-1.230
 NAS-1.148
 BDS-1.028
 KE-1.013
 law-0.976
Token34
Token ID2682
Feature activation-2.56e-01

Pos logit contributions
emet+0.054
ervatives+0.051
adan+0.050
elsius+0.049
_(+0.049
Neg logit contributions
 Bord-0.076
 NAS-0.072
 BDS-0.065
 KE-0.065
 LIC-0.063
Token Jacob
Token ID12806
Feature activation-3.80e-02

Pos logit contributions
emet+1.450
ervatives+1.388
adan+1.375
_(+1.363
stein+1.338
Neg logit contributions
 Bord-1.425
 NAS-1.321
 BDS-1.180
 KE-1.165
 law-1.124
Token Lew
Token ID8260
Feature activation-6.25e-01

Pos logit contributions
emet+0.077
ervatives+0.074
adan+0.074
avorite+0.072
elsius+0.072
Neg logit contributions
 Bord-0.052
 NAS-0.048
 BDS-0.041
 KE-0.041
 law-0.038
Token sent
Token ID1908
Feature activation+1.92e-01

Pos logit contributions
emet+0.954
ervatives+0.905
adan+0.899
_(+0.892
_-+0.877
Neg logit contributions
 Bord-1.149
 NAS-1.072
 BDS-0.966
 KE-0.959
 law-0.937
Token Congress
Token ID3162
Feature activation+6.85e-01

Pos logit contributions
 Bord+0.305
 NAS+0.283
 BDS+0.250
 KE+0.247
 law+0.242
Neg logit contributions
emet-0.343
adan-0.326
ervatives-0.325
_(-0.323
_--0.320
Token another
Token ID1194
Feature activation+3.27e-01

Pos logit contributions
 Bord+1.079
 NAS+0.994
 BDS+0.881
 law+0.868
 KE+0.864
Neg logit contributions
emet-1.232
_(-1.167
adan-1.163
ervatives-1.156
_--1.145
Token warning
Token ID6509
Feature activation-1.38e+00

Pos logit contributions
 Bord+0.574
 NAS+0.534
 BDS+0.480
 KE+0.472
 law+0.469
Neg logit contributions
emet-0.528
ervatives-0.504
adan-0.502
_(-0.496
_--0.488
Token on
Token ID319
Feature activation-4.96e-02

Pos logit contributions
emet+2.172
ervatives+2.079
adan+2.052
_(+2.021
stein+2.001
Neg logit contributions
 Bord-2.462
 NAS-2.293
 BDS-2.060
 KE-2.045
 LIC-1.968
Token the
Token ID262
Feature activation-2.13e-02

Pos logit contributions
emet+0.078
ervatives+0.073
adan+0.073
_(+0.072
_-+0.071
Neg logit contributions
 Bord-0.090
 NAS-0.084
 BDS-0.075
 KE-0.075
 law-0.073
Token nation
Token ID3277
Feature activation-9.08e-02

Pos logit contributions
emet+0.040
ervatives+0.038
adan+0.038
_(+0.038
_-+0.037
Neg logit contributions
 Bord-0.032
 NAS-0.029
 BDS-0.026
 KE-0.025
 law-0.025
Token's
Token ID338
Feature activation+2.70e-02

Pos logit contributions
emet+0.129
ervatives+0.124
adan+0.123
_(+0.120
stein+0.118
Neg logit contributions
 Bord-0.175
 NAS-0.164
 BDS-0.149
 KE-0.148
 LIC-0.143
Token debt
Token ID5057
Feature activation+4.19e-02

Pos logit contributions
 Bord+0.038
 NAS+0.035
 BDS+0.031
 KE+0.030
 law+0.030
Neg logit contributions
emet-0.053
ervatives-0.051
adan-0.050
_(-0.050
_--0.049
Token operators
Token ID12879
Feature activation-4.65e-01

Pos logit contributions
emet+0.177
ervatives+0.170
adan+0.168
_-+0.166
_(+0.166
Neg logit contributions
 Bord-0.160
 NAS-0.149
 BDS-0.132
 KE-0.130
 law-0.128
Token,
Token ID11
Feature activation-1.15e-01

Pos logit contributions
emet+0.783
ervatives+0.744
adan+0.744
_(+0.729
_-+0.722
Neg logit contributions
 Bord-0.789
 NAS-0.729
 BDS-0.651
 KE-0.643
 law-0.629
Token and
Token ID290
Feature activation-1.58e-01

Pos logit contributions
emet+0.200
adan+0.190
ervatives+0.190
_(+0.186
stein+0.183
Neg logit contributions
 Bord-0.191
 NAS-0.177
 BDS-0.157
 KE-0.155
 law-0.153
Token prevent
Token ID2948
Feature activation-2.37e-01

Pos logit contributions
emet+0.279
ervatives+0.266
adan+0.264
_(+0.260
_-+0.256
Neg logit contributions
 Bord-0.258
 NAS-0.240
 BDS-0.213
 KE-0.209
 law-0.207
Token def
Token ID825
Feature activation-1.94e-01

Pos logit contributions
emet+0.410
ervatives+0.393
adan+0.389
_(+0.384
_-+0.380
Neg logit contributions
 Bord-0.392
 NAS-0.366
 BDS-0.324
 KE-0.320
 law-0.314
Tokenection
Token ID3213
Feature activation-1.16e+00

Pos logit contributions
emet+0.359
ervatives+0.344
adan+0.343
_(+0.338
_-+0.334
Neg logit contributions
 Bord-0.294
 NAS-0.270
 BDS-0.237
 KE-0.234
 law-0.227
Token between
Token ID1022
Feature activation-3.88e-01

Pos logit contributions
emet+1.983
ervatives+1.889
adan+1.883
_(+1.851
_-+1.829
Neg logit contributions
 Bord-1.946
 NAS-1.802
 BDS-1.609
 KE-1.590
 law-1.548
Token the
Token ID262
Feature activation-2.46e-01

Pos logit contributions
emet+0.622
ervatives+0.594
adan+0.589
_(+0.578
_-+0.572
Neg logit contributions
 Bord-0.691
 NAS-0.651
 BDS-0.581
 KE-0.576
 law-0.567
Token miners
Token ID18295
Feature activation+3.66e-03

Pos logit contributions
emet+0.388
ervatives+0.368
adan+0.366
_(+0.358
_-+0.354
Neg logit contributions
 Bord-0.445
 NAS-0.420
 BDS-0.377
 KE-0.372
 law-0.368
Token from
Token ID422
Feature activation-2.16e-02

Pos logit contributions
 Bord+0.006
 NAS+0.006
 BDS+0.005
 KE+0.005
 law+0.005
Neg logit contributions
emet-0.006
adan-0.006
ervatives-0.006
_(-0.006
_--0.006
Token undermining
Token ID30340
Feature activation-2.37e-01

Pos logit contributions
emet+0.039
ervatives+0.037
adan+0.037
_(+0.036
_-+0.036
Neg logit contributions
 Bord-0.034
 NAS-0.032
 BDS-0.028
 KE-0.028
 law-0.028
Token,
Token ID11
Feature activation+1.16e-01

Pos logit contributions
emet+0.850
ervatives+0.805
adan+0.800
_(+0.789
_-+0.781
Neg logit contributions
 Bord-0.936
 NAS-0.868
 BDS-0.781
 KE-0.770
 law-0.756
Token
Token ID447
Feature activation+4.23e-02

Pos logit contributions
 Bord+0.192
 NAS+0.176
 BDS+0.156
 law+0.155
 KE+0.154
Neg logit contributions
emet-0.203
ervatives-0.193
adan-0.190
_(-0.189
STDOUT-0.186
Token
Token ID251
Feature activation+2.74e-02

Pos logit contributions
 Bord+0.069
 NAS+0.063
 BDS+0.056
 KE+0.056
 law+0.055
Neg logit contributions
emet-0.075
ervatives-0.071
adan-0.071
elsius-0.068
_(-0.068
Token King
Token ID2677
Feature activation+3.46e-01

Pos logit contributions
 Bord+0.048
 NAS+0.044
 BDS+0.039
 KE+0.039
 law+0.038
Neg logit contributions
emet-0.045
ervatives-0.043
adan-0.042
_(-0.042
_--0.041
Token said
Token ID531
Feature activation+2.03e-02

Pos logit contributions
 Bord+0.597
 NAS+0.548
 KE+0.489
 BDS+0.487
 law+0.485
Neg logit contributions
emet-0.568
ervatives-0.538
adan-0.536
_(-0.523
_--0.519
Token via
Token ID2884
Feature activation-1.41e+00

Pos logit contributions
 Bord+0.035
 NAS+0.032
 BDS+0.029
 KE+0.029
 law+0.028
Neg logit contributions
emet-0.034
adan-0.032
ervatives-0.032
_(-0.031
_--0.031
Token email
Token ID3053
Feature activation+3.89e-01

Pos logit contributions
emet+2.725
ervatives+2.609
adan+2.597
_(+2.576
_-+2.536
Neg logit contributions
 Bord-2.004
 NAS-1.833
 BDS-1.599
 KE-1.574
 law-1.509
Token.
Token ID13
Feature activation+3.08e-02

Pos logit contributions
 Bord+0.688
 NAS+0.638
 BDS+0.573
 KE+0.570
 law+0.555
Neg logit contributions
emet-0.622
ervatives-0.593
adan-0.590
_(-0.578
_--0.575
Token
Token ID198
Feature activation+1.07e-03

Pos logit contributions
 Bord+0.047
 NAS+0.043
 BDS+0.038
 KE+0.038
 law+0.036
Neg logit contributions
emet-0.057
ervatives-0.055
adan-0.055
_(-0.053
elsius-0.053
Token
Token ID198
Feature activation-2.00e-02

Pos logit contributions
 Bord+0.002
 NAS+0.002
 BDS+0.001
 KE+0.001
 law+0.001
Neg logit contributions
emet-0.002
avorite-0.002
ervatives-0.002
adan-0.002
elsius-0.002
TokenFrom
Token ID4863
Feature activation+1.74e-01

Pos logit contributions
emet+0.032
ervatives+0.031
adan+0.030
elsius+0.030
_(+0.030
Neg logit contributions
 Bord-0.035
 NAS-0.033
 BDS-0.029
 KE-0.029
 law-0.028
Token continents
Token ID33431
Feature activation+3.28e-01

Pos logit contributions
emet+0.231
ervatives+0.224
adan+0.220
_(+0.219
_-+0.214
Neg logit contributions
 Bord-0.220
 NAS-0.202
 BDS-0.181
 KE-0.177
 law-0.175
Token.
Token ID13
Feature activation-7.99e-02

Pos logit contributions
 Bord+0.560
 NAS+0.510
 BDS+0.460
 KE+0.452
 law+0.448
Neg logit contributions
emet-0.554
ervatives-0.527
adan-0.523
_(-0.515
_--0.512
Token
Token ID198
Feature activation-2.11e-02

Pos logit contributions
emet+0.146
ervatives+0.140
adan+0.139
_(+0.136
elsius+0.135
Neg logit contributions
 Bord-0.127
 NAS-0.116
 BDS-0.102
 KE-0.101
 ALP-0.097
Token
Token ID198
Feature activation-5.27e-02

Pos logit contributions
emet+0.040
avorite+0.038
ervatives+0.038
adan+0.038
elsius+0.038
Neg logit contributions
 Bord-0.032
 NAS-0.030
 BDS-0.026
 KE-0.026
 law-0.025
TokenF
Token ID37
Feature activation-8.74e-02

Pos logit contributions
emet+0.082
ervatives+0.079
adan+0.077
_(+0.075
elsius+0.075
Neg logit contributions
 Bord-0.095
 NAS-0.089
 BDS-0.080
 KE-0.079
 law-0.077
Tokenam
Token ID321
Feature activation-1.24e+00

Pos logit contributions
emet+0.136
ervatives+0.130
adan+0.128
_(+0.127
_-+0.125
Neg logit contributions
 Bord-0.157
 NAS-0.146
 BDS-0.132
 KE-0.130
 law-0.127
Tokenously
Token ID3481
Feature activation+1.82e-01

Pos logit contributions
emet+1.639
ervatives+1.545
adan+1.536
_(+1.504
stein+1.478
Neg logit contributions
 Bord-2.537
 NAS-2.392
 BDS-2.183
 KE-2.162
 law-2.101
Token,
Token ID11
Feature activation-6.97e-02

Pos logit contributions
 Bord+0.297
 NAS+0.272
 BDS+0.243
 KE+0.239
 law+0.233
Neg logit contributions
emet-0.319
ervatives-0.305
adan-0.303
_(-0.299
_--0.294
Token Xer
Token ID44264
Feature activation-4.80e-01

Pos logit contributions
emet+0.126
ervatives+0.121
adan+0.119
_(+0.119
_-+0.116
Neg logit contributions
 Bord-0.111
 NAS-0.101
 BDS-0.090
 KE-0.089
 law-0.087
Tokenxes
Token ID48169
Feature activation-5.02e-01

Pos logit contributions
emet+0.714
ervatives+0.681
_(+0.659
avorite+0.658
elsius+0.657
Neg logit contributions
 Bord-0.882
 NAS-0.827
 KE-0.755
 BDS-0.751
 law-0.735
Token ordered
Token ID6149
Feature activation-1.46e-01

Pos logit contributions
emet+0.838
ervatives+0.793
adan+0.787
_(+0.779
_-+0.764
Neg logit contributions
 Bord-0.868
 NAS-0.796
 BDS-0.713
 KE-0.706
 law-0.694