TOP ACTIVATIONS
MAX = 1.393

rop yl methyl - 7 -( 2 , 6 - dim
3 . 5 - liter Eco Boost ® engine with more
$ 10 or more to help us continue to write about
rigorous testing to meet MIL STD 810 G standards , guaranteeing
I / HDMI / 2 x Mini DP T
En i received the largest fine - more than 130 m
bits ), was at one time widely in use and was
3 . 5 - liter Eco Boost ® engine that produces
3 OC Kit Dim ensions : 277 x 120
Harvard . The idea takes off from the Take Care Clause
Blu - Ray is outrage ously expensive , and hard -
locked Digital Power , 3 x 3 OC Kit
$ 10 or more to help us continue to write about
R 78 70 Hawk GPU : AMD Radeon HD
- 256 bit Cool ing : MSI Twin Fro
can someone with a mixed background better embrace both cultures and
MSI R 78 70 Hawk GPU : AMD Radeon
law students , despite their background , will have the opportunity
2 - ethyl - N -( t et rah yd ro
checkout can get 20 % off all merchandise in our store

MIN ACTIVATIONS
MIN = -1.528

- analysis of intervention research . School Psychology Quarterly , 23
Case in point : This # J eb No Filter video
the last two days , # C hen nai R ains
child to a grave in # Gaza , Bad
des inc end ies sur # B orm es les m
on Twitter . <|endoftext|> Media playback is unsupported on your device
Ho iber g said . # Bull s Cody Wester
) ; return normal ; # endif } If you are
surface . Media playback is unsupported on your device
her . Note # 1 : To prove just
types of bullies in adolescence . Journal of Youth and Ad
? The debut of Superman # 1 , the first superhero
traffic in / out of # H of Deb ate 16
energy goals . <|endoftext|> Media playback is unsupported on your device
from Europe and North America . International Journal of Public Health
s in the middle grades . RM LE Online : Research
matchup between Donald Trump Donald John Trump House committee believes it
Republican nominee Donald Trump Donald John Trump House committee believes it
another father killed by # Israel another father carrying
United States 3 240 Posts # 10 Happy it worked out

INTERVAL 0.663 - 1.393
CONTAINS 0.327%

from £ 9 79 b & b . Thomas
up . Her eyes were still red from cutting up the
- edge CPR dummy comes with a companion dummy that is
Trudeau s Liberals approved last year .
and repeatedly . At the time , he said ,

INTERVAL -0.068 - 0.663
CONTAINS 62.906%

Wall Street , which at 72 - stories is one of
? Huh . PC This really only has one new trick
( ARM version , detected as EL F _ UM RE
s government has pursued its interests by stepping up its cyber
his next effort , a story titled " The Fox ",

INTERVAL -0.798 - -0.068
CONTAINS 36.636%

sign ificantly scaling back the Clinton Foundation and our
We asked Phil adel ph ians around the city
online edition of Y le is radio , Finland
ilateral ) cricket and more O DI tournaments ," D rav
should be killed and their children and women should be taken

INTERVAL -1.528 - -0.798
CONTAINS 0.130%

OUNT ( * ) AS count FROM ( SELECT SPL IT
s undeniable how well Arduino has enabled no v
state : Option < Board > , // The previous board
June 2012 Canada 16 10 Posts # 14 On September 20
- thank you ! P sh on Barrett gave me useful
Tokenrop
Token ID1773
Feature activation+4.56e-02

Pos logit contributions
 Demo+1.440
 Chow+1.383
 Zoro+1.309
 Rapt+1.300
 Desire+1.284
Neg logit contributions
-1.406
enders-1.367
ICAN-1.365
inson-1.361
GC-1.337
Tokenyl
Token ID2645
Feature activation-1.06e-01

Pos logit contributions
 Demo+0.127
 Chow+0.121
 Zoro+0.114
 Rapt+0.111
 Desire+0.110
Neg logit contributions
-0.185
inson-0.184
enders-0.181
ICAN-0.180
GC-0.178
Tokenmethyl
Token ID43654
Feature activation+7.51e-01

Pos logit contributions
+0.326
ICAN+0.324
enders+0.317
inson+0.316
GC+0.315
Neg logit contributions
 Demo-0.388
 Chow-0.373
 Zoro-0.356
 Rapt-0.354
 Desire-0.351
Token-
Token ID12
Feature activation+3.80e-01

Pos logit contributions
 Demo+2.686
 Chow+2.577
 Zoro+2.450
 Rapt+2.447
 Desire+2.395
Neg logit contributions
-2.425
ICAN-2.377
enders-2.332
inson-2.301
GC-2.284
Token7
Token ID22
Feature activation-7.13e-02

Pos logit contributions
 Demo+1.311
 Chow+1.261
 Zoro+1.195
 Rapt+1.192
 Desire+1.174
Neg logit contributions
-1.282
ICAN-1.254
enders-1.225
GC-1.216
inson-1.208
Token-(
Token ID30420
Feature activation+1.39e+00

Pos logit contributions
+0.258
ICAN+0.254
GC+0.249
inson+0.247
enders+0.246
Neg logit contributions
 Demo-0.226
 Chow-0.215
 Rapt-0.203
 Zoro-0.202
 Desire-0.199
Token2
Token ID17
Feature activation+6.15e-01

Pos logit contributions
 Demo+3.851
 Chow+3.650
 Zoro+3.444
 Rapt+3.431
 Desire+3.307
Neg logit contributions
-5.557
ICAN-5.442
enders-5.365
inson-5.299
NEY-5.247
Token,
Token ID11
Feature activation+3.82e-01

Pos logit contributions
 Demo+2.048
 Chow+1.961
 Rapt+1.856
 Zoro+1.854
 Desire+1.808
Neg logit contributions
-2.139
ICAN-2.096
enders-2.059
inson-2.034
GC-2.014
Token6
Token ID21
Feature activation+4.77e-01

Pos logit contributions
 Demo+0.960
 Chow+0.905
 Rapt+0.835
 Zoro+0.827
 Desire+0.816
Neg logit contributions
-1.654
ICAN-1.624
GC-1.598
enders-1.598
inson-1.592
Token-
Token ID12
Feature activation+8.84e-01

Pos logit contributions
 Demo+1.463
 Chow+1.397
 Rapt+1.315
 Zoro+1.312
 Desire+1.279
Neg logit contributions
-1.782
ICAN-1.748
enders-1.719
inson-1.702
GC-1.689
Tokendim
Token ID27740
Feature activation+7.09e-01

Pos logit contributions
 Demo+2.984
 Chow+2.854
 Zoro+2.710
 Rapt+2.706
 Desire+2.637
Neg logit contributions
-3.014
ICAN-2.961
enders-2.898
inson-2.857
NEY-2.829
Token 3
Token ID513
Feature activation+3.67e-01

Pos logit contributions
 Demo+2.581
 Chow+2.482
 Zoro+2.348
 Rapt+2.324
 Desire+2.300
Neg logit contributions
-2.299
ICAN-2.255
enders-2.219
inson-2.187
GC-2.170
Token.
Token ID13
Feature activation+1.59e-01

Pos logit contributions
 Demo+1.300
 Chow+1.249
 Zoro+1.183
 Rapt+1.170
 Desire+1.161
Neg logit contributions
-1.200
ICAN-1.169
enders-1.158
inson-1.144
GC-1.139
Token5
Token ID20
Feature activation+1.31e-01

Pos logit contributions
 Demo+0.440
 Chow+0.418
 Zoro+0.392
 Rapt+0.385
 Desire+0.383
Neg logit contributions
-0.643
ICAN-0.631
enders-0.620
GC-0.617
NEY-0.612
Token-
Token ID12
Feature activation+2.39e-01

Pos logit contributions
 Demo+0.434
 Chow+0.416
 Zoro+0.392
 Rapt+0.390
 Desire+0.388
Neg logit contributions
-0.451
enders-0.438
ICAN-0.436
inson-0.431
GC-0.429
Tokenliter
Token ID17201
Feature activation+3.24e-01

Pos logit contributions
 Demo+0.668
 Chow+0.633
 Zoro+0.590
 Rapt+0.583
 Desire+0.579
Neg logit contributions
-0.959
enders-0.938
ICAN-0.934
inson-0.924
GC-0.918
Token Eco
Token ID38719
Feature activation+1.34e+00

Pos logit contributions
 Demo+1.065
 Chow+1.016
 Zoro+0.961
 Desire+0.939
 Rapt+0.936
Neg logit contributions
-1.123
ICAN-1.111
enders-1.092
inson-1.079
GC-1.070
TokenBoost
Token ID45686
Feature activation+6.95e-01

Pos logit contributions
 Demo+5.288
 Chow+5.104
 Rapt+4.889
 Zoro+4.827
 Desire+4.720
Neg logit contributions
-3.815
ICAN-3.736
enders-3.632
inson-3.579
NEY-3.524
Token®
Token ID7461
Feature activation+5.96e-01

Pos logit contributions
 Demo+2.455
 Chow+2.364
 Zoro+2.242
 Rapt+2.202
 Desire+2.195
Neg logit contributions
-2.261
ICAN-2.227
enders-2.188
inson-2.159
GC-2.140
Token engine
Token ID3113
Feature activation+3.69e-01

Pos logit contributions
 Demo+2.294
 Chow+2.203
 Zoro+2.104
 Rapt+2.068
 Desire+2.066
Neg logit contributions
-1.745
ICAN-1.730
enders-1.689
inson-1.661
GC-1.653
Token with
Token ID351
Feature activation+4.87e-01

Pos logit contributions
 Demo+1.429
 Chow+1.378
 Zoro+1.311
 Rapt+1.305
 Desire+1.298
Neg logit contributions
-1.065
ICAN-1.044
enders-1.027
inson-1.021
GC-1.006
Token more
Token ID517
Feature activation+1.43e-01

Pos logit contributions
 Demo+1.731
 Chow+1.656
 Zoro+1.566
 Rapt+1.551
 Desire+1.539
Neg logit contributions
-1.559
ICAN-1.523
enders-1.513
inson-1.495
GC-1.482
Token $
Token ID720
Feature activation-4.46e-02

Pos logit contributions
 Demo+1.036
 Chow+0.996
 Zoro+0.932
 Rapt+0.906
 Desire+0.901
Neg logit contributions
-1.500
ICAN-1.487
enders-1.477
GC-1.457
inson-1.455
Token10
Token ID940
Feature activation+3.46e-01

Pos logit contributions
ICAN+0.152
enders+0.150
+0.150
inson+0.146
GC+0.146
Neg logit contributions
 Demo-0.149
 Chow-0.141
 Zoro-0.136
 Rapt-0.135
 Desire-0.131
Token or
Token ID393
Feature activation+4.77e-01

Pos logit contributions
 Demo+1.329
 Chow+1.279
 Zoro+1.231
 Rapt+1.228
 Desire+1.200
Neg logit contributions
-1.017
ICAN-0.988
enders-0.974
inson-0.958
NEY-0.947
Token more
Token ID517
Feature activation+5.07e-01

Pos logit contributions
 Demo+1.545
 Chow+1.484
 Zoro+1.407
 Rapt+1.405
 Desire+1.369
Neg logit contributions
-1.686
ICAN-1.660
enders-1.628
inson-1.604
GC-1.591
Token to
Token ID284
Feature activation+6.18e-01

Pos logit contributions
 Demo+1.709
 Chow+1.638
 Rapt+1.551
 Zoro+1.548
 Desire+1.511
Neg logit contributions
-1.740
ICAN-1.704
enders-1.674
inson-1.653
GC-1.636
Token help
Token ID1037
Feature activation+1.33e+00

Pos logit contributions
 Demo+2.256
 Chow+2.165
 Zoro+2.066
 Rapt+2.066
 Desire+2.016
Neg logit contributions
-1.943
ICAN-1.902
enders-1.866
inson-1.840
GC-1.821
Token us
Token ID514
Feature activation+7.38e-01

Pos logit contributions
 Demo+3.792
 Chow+3.641
 Rapt+3.373
 Zoro+3.338
 Desire+3.323
Neg logit contributions
-5.267
ICAN-5.134
enders-5.064
inson-4.996
GC-4.990
Token continue
Token ID2555
Feature activation+9.60e-01

Pos logit contributions
 Demo+2.767
 Chow+2.658
 Rapt+2.527
 Zoro+2.518
 Desire+2.462
Neg logit contributions
-2.262
ICAN-2.201
enders-2.151
inson-2.123
GC-2.102
Token to
Token ID284
Feature activation+6.64e-01

Pos logit contributions
 Demo+3.196
 Chow+3.065
 Rapt+2.895
 Zoro+2.895
 Desire+2.809
Neg logit contributions
-3.325
ICAN-3.261
enders-3.208
inson-3.165
GC-3.126
Token write
Token ID3551
Feature activation+9.21e-01

Pos logit contributions
 Demo+2.694
 Chow+2.596
 Zoro+2.514
 Rapt+2.487
 Desire+2.464
Neg logit contributions
-1.799
ICAN-1.780
enders-1.753
inson-1.721
GC-1.700
Token about
Token ID546
Feature activation+4.39e-01

Pos logit contributions
 Demo+3.204
 Chow+3.076
 Rapt+2.923
 Zoro+2.919
 Desire+2.845
Neg logit contributions
-3.051
ICAN-2.981
enders-2.932
inson-2.896
NEY-2.864
Token rigorous
Token ID22888
Feature activation+2.40e-01

Pos logit contributions
 Demo+0.639
 Chow+0.617
 Zoro+0.583
 Rapt+0.583
 Desire+0.580
Neg logit contributions
-0.613
ICAN-0.597
GC-0.590
enders-0.588
inson-0.580
Token testing
Token ID4856
Feature activation-1.79e-02

Pos logit contributions
 Demo+0.847
 Chow+0.821
 Zoro+0.778
 Rapt+0.775
 Desire+0.769
Neg logit contributions
-0.766
ICAN-0.746
GC-0.737
enders-0.731
inson-0.720
Token to
Token ID284
Feature activation-1.31e-01

Pos logit contributions
+0.054
ICAN+0.052
enders+0.052
GC+0.051
inson+0.051
Neg logit contributions
 Demo-0.068
 Chow-0.065
 Rapt-0.062
 Zoro-0.062
 Desire-0.061
Token meet
Token ID1826
Feature activation+2.36e-01

Pos logit contributions
+0.395
ICAN+0.385
GC+0.380
enders+0.380
inson+0.378
Neg logit contributions
 Demo-0.482
 Chow-0.463
 Rapt-0.441
 Desire-0.439
 Zoro-0.436
Token MIL
Token ID31515
Feature activation+9.65e-01

Pos logit contributions
 Demo+0.821
 Chow+0.792
 Zoro+0.748
 Rapt+0.741
 Desire+0.736
Neg logit contributions
-0.769
ICAN-0.754
enders-0.743
GC-0.734
inson-0.726
TokenSTD
Token ID32147
Feature activation+1.31e+00

Pos logit contributions
 Demo+2.946
 Chow+2.808
 Rapt+2.643
 Zoro+2.641
 Desire+2.561
Neg logit contributions
-3.620
ICAN-3.544
enders-3.501
inson-3.461
GC-3.410
Token810
Token ID40215
Feature activation+9.06e-01

Pos logit contributions
 Demo+3.786
 Chow+3.579
 Rapt+3.401
 Zoro+3.378
 Desire+3.257
Neg logit contributions
-5.108
ICAN-4.952
enders-4.950
inson-4.883
NEY-4.778
TokenG
Token ID38
Feature activation+8.73e-01

Pos logit contributions
 Demo+3.103
 Chow+2.975
 Rapt+2.824
 Zoro+2.820
 Desire+2.749
Neg logit contributions
-3.060
ICAN-2.998
enders-2.944
inson-2.903
GC-2.868
Token standards
Token ID5423
Feature activation+6.02e-01

Pos logit contributions
 Demo+3.058
 Chow+2.939
 Zoro+2.783
 Rapt+2.783
 Desire+2.724
Neg logit contributions
-2.878
ICAN-2.823
enders-2.772
inson-2.734
GC-2.719
Token,
Token ID11
Feature activation+3.38e-01

Pos logit contributions
 Demo+2.128
 Chow+2.044
 Rapt+1.940
 Zoro+1.938
 Desire+1.895
Neg logit contributions
-1.966
ICAN-1.924
enders-1.888
inson-1.863
GC-1.846
Token guaranteeing
Token ID50156
Feature activation+2.17e-01

Pos logit contributions
 Demo+1.286
 Chow+1.241
 Zoro+1.179
 Rapt+1.178
 Desire+1.163
Neg logit contributions
-0.995
ICAN-0.975
enders-0.955
GC-0.949
inson-0.945
TokenI
Token ID40
Feature activation+3.39e-01

Pos logit contributions
 Demo+2.408
 Chow+2.335
 Zoro+2.239
 Rapt+2.227
 Desire+2.172
Neg logit contributions
-1.872
ICAN-1.795
inson-1.757
enders-1.754
GC-1.752
Token /
Token ID1220
Feature activation+5.89e-01

Pos logit contributions
 Demo+1.195
 Chow+1.158
 Zoro+1.104
 Rapt+1.093
 Desire+1.071
Neg logit contributions
-1.095
ICAN-1.070
enders-1.057
GC-1.046
inson-1.043
Token HDMI
Token ID25499
Feature activation+8.26e-01

Pos logit contributions
 Demo+1.782
 Chow+1.726
 Zoro+1.618
 Rapt+1.604
 Desire+1.565
Neg logit contributions
-2.197
ICAN-2.158
enders-2.128
GC-2.117
inson-2.114
Token /
Token ID1220
Feature activation+5.63e-01

Pos logit contributions
 Demo+2.780
 Chow+2.688
 Zoro+2.542
 Rapt+2.530
 Desire+2.470
Neg logit contributions
-2.836
ICAN-2.770
enders-2.725
inson-2.696
GC-2.677
Token 2
Token ID362
Feature activation+2.64e-01

Pos logit contributions
 Demo+1.754
 Chow+1.708
 Zoro+1.606
 Rapt+1.583
 Desire+1.549
Neg logit contributions
-2.042
ICAN-2.000
inson-1.967
enders-1.964
GC-1.952
Tokenx
Token ID87
Feature activation+1.24e+00

Pos logit contributions
 Demo+0.928
 Chow+0.896
 Zoro+0.851
 Rapt+0.843
 Desire+0.832
Neg logit contributions
-0.869
ICAN-0.846
enders-0.829
inson-0.822
GC-0.819
Token Mini
Token ID12558
Feature activation+8.89e-01

Pos logit contributions
 Demo+3.887
 Chow+3.713
 Rapt+3.500
 Zoro+3.498
 Desire+3.404
Neg logit contributions
-4.556
ICAN-4.470
enders-4.396
inson-4.344
GC-4.303
TokenDP
Token ID6322
Feature activation+6.66e-01

Pos logit contributions
 Demo+2.673
 Chow+2.549
 Rapt+2.396
 Zoro+2.395
 Desire+2.328
Neg logit contributions
-3.376
ICAN-3.313
enders-3.261
inson-3.225
GC-3.195
Token
Token ID198
Feature activation+7.00e-01

Pos logit contributions
 Demo+2.408
 Chow+2.323
 Zoro+2.205
 Rapt+2.200
 Desire+2.156
Neg logit contributions
-2.129
ICAN-2.082
enders-2.042
inson-2.013
GC-1.996
Token
Token ID198
Feature activation+6.95e-01

Pos logit contributions
 Demo+2.393
 Chow+2.301
 Rapt+2.179
 Zoro+2.178
 Desire+2.125
Neg logit contributions
-2.373
ICAN-2.314
enders-2.274
inson-2.248
GC-2.237
TokenT
Token ID51
Feature activation+5.69e-01

Pos logit contributions
 Demo+2.796
 Chow+2.713
 Zoro+2.584
 Rapt+2.583
 Desire+2.533
Neg logit contributions
-1.952
ICAN-1.894
enders-1.852
inson-1.820
GC-1.819
Token En
Token ID2039
Feature activation+3.61e-02

Pos logit contributions
 Demo+0.332
 Chow+0.318
 Zoro+0.300
 Rapt+0.300
 Desire+0.292
Neg logit contributions
-0.339
ICAN-0.332
enders-0.327
GC-0.322
inson-0.322
Tokeni
Token ID72
Feature activation+3.05e-01

Pos logit contributions
 Demo+0.124
 Chow+0.119
 Zoro+0.113
 Rapt+0.112
 Desire+0.111
Neg logit contributions
-0.120
ICAN-0.118
inson-0.117
enders-0.117
GC-0.116
Token received
Token ID2722
Feature activation+2.38e-01

Pos logit contributions
 Demo+1.143
 Chow+1.100
 Zoro+1.051
 Rapt+1.050
 Desire+1.023
Neg logit contributions
-0.930
ICAN-0.907
enders-0.894
inson-0.882
GC-0.878
Token the
Token ID262
Feature activation-1.10e-02

Pos logit contributions
 Demo+0.805
 Chow+0.769
 Rapt+0.733
 Zoro+0.725
 Desire+0.713
Neg logit contributions
-0.810
ICAN-0.792
enders-0.781
inson-0.769
GC-0.766
Token largest
Token ID4387
Feature activation+7.90e-02

Pos logit contributions
+0.034
ICAN+0.033
enders+0.033
GC+0.032
inson+0.032
Neg logit contributions
 Demo-0.041
 Chow-0.039
 Rapt-0.037
 Zoro-0.037
 Desire-0.036
Token fine
Token ID3734
Feature activation+1.24e+00

Pos logit contributions
 Demo+0.301
 Chow+0.290
 Rapt+0.276
 Zoro+0.275
 Desire+0.270
Neg logit contributions
-0.235
ICAN-0.229
enders-0.225
inson-0.218
GC-0.218
Token -
Token ID532
Feature activation+2.29e-01

Pos logit contributions
 Demo+4.466
 Chow+4.309
 Zoro+4.100
 Rapt+4.090
 Desire+3.998
Neg logit contributions
-3.929
ICAN-3.828
enders-3.750
inson-3.702
NEY-3.645
Token more
Token ID517
Feature activation-1.43e-01

Pos logit contributions
 Demo+0.753
 Chow+0.717
 Rapt+0.680
 Zoro+0.672
 Desire+0.657
Neg logit contributions
-0.806
ICAN-0.788
enders-0.777
GC-0.771
inson-0.771
Token than
Token ID621
Feature activation+3.82e-01

Pos logit contributions
+0.400
ICAN+0.392
enders+0.385
inson+0.383
GC+0.376
Neg logit contributions
 Demo-0.569
 Chow-0.547
 Zoro-0.524
 Rapt-0.522
 Desire-0.514
Token 130
Token ID11323
Feature activation-5.77e-02

Pos logit contributions
 Demo+1.129
 Chow+1.068
 Rapt+1.006
 Zoro+1.000
 Desire+0.971
Neg logit contributions
-1.451
ICAN-1.431
enders-1.421
GC-1.403
inson-1.398
Tokenm
Token ID76
Feature activation+8.25e-02

Pos logit contributions
+0.170
ICAN+0.167
enders+0.162
inson+0.161
GC+0.160
Neg logit contributions
 Demo-0.222
 Chow-0.213
 Rapt-0.204
 Zoro-0.203
 Desire-0.200
Token bits
Token ID10340
Feature activation-2.10e-02

Pos logit contributions
+0.611
ICAN+0.597
GC+0.590
enders+0.589
NEY+0.584
Neg logit contributions
 Demo-0.777
 Chow-0.748
 Rapt-0.707
 Zoro-0.703
 Desire-0.700
Token),
Token ID828
Feature activation+2.19e-01

Pos logit contributions
+0.062
ICAN+0.061
enders+0.060
GC+0.059
inson+0.059
Neg logit contributions
 Demo-0.080
 Chow-0.077
 Rapt-0.073
 Zoro-0.073
 Desire-0.072
Token was
Token ID373
Feature activation+3.20e-01

Pos logit contributions
 Demo+0.856
 Chow+0.825
 Rapt+0.794
 Zoro+0.786
 Desire+0.773
Neg logit contributions
-0.629
ICAN-0.603
enders-0.600
inson-0.595
GC-0.589
Token at
Token ID379
Feature activation+2.08e-01

Pos logit contributions
 Demo+1.273
 Chow+1.230
 Rapt+1.178
 Zoro+1.173
 Desire+1.152
Neg logit contributions
-0.894
ICAN-0.872
enders-0.861
inson-0.847
GC-0.838
Token one
Token ID530
Feature activation+4.34e-01

Pos logit contributions
 Demo+0.729
 Chow+0.701
 Rapt+0.669
 Zoro+0.668
 Desire+0.649
Neg logit contributions
-0.681
ICAN-0.664
enders-0.660
inson-0.649
GC-0.643
Token time
Token ID640
Feature activation+1.23e+00

Pos logit contributions
 Demo+1.511
 Chow+1.450
 Rapt+1.376
 Zoro+1.374
 Desire+1.342
Neg logit contributions
-1.443
ICAN-1.413
enders-1.387
inson-1.369
GC-1.354
Token widely
Token ID6768
Feature activation+1.21e-01

Pos logit contributions
 Demo+4.941
 Chow+4.755
 Zoro+4.557
 Rapt+4.548
 Desire+4.455
Neg logit contributions
-3.438
ICAN-3.355
enders-3.274
inson-3.234
NEY-3.166
Token in
Token ID287
Feature activation+3.92e-01

Pos logit contributions
 Demo+0.583
 Chow+0.571
 Rapt+0.551
 Zoro+0.544
 Desire+0.540
Neg logit contributions
-0.234
ICAN-0.218
enders-0.218
GC-0.212
inson-0.210
Token use
Token ID779
Feature activation+2.75e-01

Pos logit contributions
 Demo+1.398
 Chow+1.345
 Rapt+1.281
 Zoro+1.279
 Desire+1.254
Neg logit contributions
-1.265
ICAN-1.221
enders-1.212
inson-1.197
GC-1.191
Token and
Token ID290
Feature activation+9.12e-02

Pos logit contributions
 Demo+1.049
 Chow+1.017
 Rapt+0.966
 Zoro+0.962
 Desire+0.943
Neg logit contributions
-0.816
ICAN-0.790
enders-0.781
inson-0.774
GC-0.766
Token was
Token ID373
Feature activation+1.45e-01

Pos logit contributions
 Demo+0.371
 Chow+0.359
 Rapt+0.343
 Zoro+0.342
 Desire+0.336
Neg logit contributions
-0.246
ICAN-0.236
enders-0.234
inson-0.232
GC-0.232
Token 3
Token ID513
Feature activation+2.74e-01

Pos logit contributions
 Demo+0.913
 Chow+0.871
 Zoro+0.820
 Desire+0.795
 Rapt+0.787
Neg logit contributions
-1.036
ICAN-1.024
enders-1.009
GC-1.001
inson-0.997
Token.
Token ID13
Feature activation+1.66e-01

Pos logit contributions
 Demo+0.924
 Chow+0.886
 Rapt+0.845
 Zoro+0.839
 Desire+0.818
Neg logit contributions
-0.937
ICAN-0.918
enders-0.903
inson-0.889
NEY-0.878
Token5
Token ID20
Feature activation+1.62e-01

Pos logit contributions
 Demo+0.432
 Chow+0.407
 Zoro+0.380
 Rapt+0.378
 Desire+0.367
Neg logit contributions
-0.696
ICAN-0.685
enders-0.673
inson-0.667
GC-0.664
Token-
Token ID12
Feature activation+1.98e-01

Pos logit contributions
 Demo+0.475
 Chow+0.452
 Rapt+0.429
 Zoro+0.424
 Desire+0.411
Neg logit contributions
-0.630
ICAN-0.623
enders-0.610
inson-0.602
NEY-0.596
Tokenliter
Token ID17201
Feature activation+2.83e-01

Pos logit contributions
 Demo+0.524
 Chow+0.496
 Zoro+0.461
 Rapt+0.457
 Desire+0.447
Neg logit contributions
-0.823
ICAN-0.807
enders-0.800
inson-0.793
GC-0.786
Token Eco
Token ID38719
Feature activation+1.19e+00

Pos logit contributions
 Demo+0.780
 Chow+0.738
 Zoro+0.690
 Desire+0.666
 Rapt+0.665
Neg logit contributions
-1.125
ICAN-1.119
enders-1.098
inson-1.086
GC-1.086
TokenBoost
Token ID45686
Feature activation+5.32e-01

Pos logit contributions
 Demo+4.875
 Chow+4.704
 Zoro+4.507
 Rapt+4.498
 Desire+4.418
Neg logit contributions
-3.222
ICAN-3.135
enders-3.071
inson-3.019
GC-2.988
Token®
Token ID7461
Feature activation+5.22e-01

Pos logit contributions
 Demo+1.673
 Chow+1.599
 Zoro+1.513
 Rapt+1.482
 Desire+1.470
Neg logit contributions
-1.938
ICAN-1.914
enders-1.873
inson-1.853
GC-1.851
Token engine
Token ID3113
Feature activation+2.06e-01

Pos logit contributions
 Demo+2.130
 Chow+2.052
 Zoro+1.963
 Rapt+1.936
 Desire+1.929
Neg logit contributions
-1.410
ICAN-1.392
enders-1.356
inson-1.334
GC-1.329
Token that
Token ID326
Feature activation+1.25e-01

Pos logit contributions
 Demo+0.804
 Chow+0.773
 Zoro+0.739
 Rapt+0.734
 Desire+0.726
Neg logit contributions
-0.592
ICAN-0.580
enders-0.570
inson-0.566
GC-0.561
Token produces
Token ID11073
Feature activation+1.58e-02

Pos logit contributions
 Demo+0.474
 Chow+0.450
 Zoro+0.432
 Desire+0.430
 Rapt+0.429
Neg logit contributions
-0.362
inson-0.355
enders-0.353
ICAN-0.353
GC-0.346
Token3
Token ID18
Feature activation-1.22e-01

Pos logit contributions
 Demo+3.718
 Chow+3.560
 Zoro+3.362
 Rapt+3.361
 Desire+3.276
Neg logit contributions
-4.052
ICAN-3.975
enders-3.908
inson-3.858
GC-3.826
Token OC
Token ID24775
Feature activation+7.94e-01

Pos logit contributions
+0.394
ICAN+0.383
GC+0.382
enders+0.381
inson+0.375
Neg logit contributions
 Demo-0.424
 Chow-0.415
 Zoro-0.390
 Rapt-0.385
 Desire-0.383
Token Kit
Token ID10897
Feature activation+9.45e-01

Pos logit contributions
 Demo+2.774
 Chow+2.670
 Zoro+2.527
 Rapt+2.525
 Desire+2.474
Neg logit contributions
-2.623
ICAN-2.567
enders-2.536
inson-2.494
GC-2.485
Token
Token ID198
Feature activation+6.41e-01

Pos logit contributions
 Demo+3.339
 Chow+3.210
 Rapt+3.047
 Zoro+3.044
 Desire+2.976
Neg logit contributions
-3.087
ICAN-3.019
enders-2.969
inson-2.927
GC-2.900
Token
Token ID198
Feature activation+6.63e-01

Pos logit contributions
 Demo+2.186
 Chow+2.101
 Rapt+1.991
 Zoro+1.989
 Desire+1.941
Neg logit contributions
-2.178
ICAN-2.125
enders-2.087
inson-2.061
GC-2.052
TokenDim
Token ID29271
Feature activation+1.17e+00

Pos logit contributions
 Demo+2.626
 Chow+2.540
 Rapt+2.422
 Zoro+2.419
 Desire+2.373
Neg logit contributions
-1.897
ICAN-1.841
enders-1.805
inson-1.775
GC-1.767
Tokenensions
Token ID5736
Feature activation+1.08e+00

Pos logit contributions
 Demo+4.689
 Chow+4.509
 Rapt+4.317
 Zoro+4.305
 Desire+4.229
Neg logit contributions
-3.257
ICAN-3.179
enders-3.082
inson-3.039
NEY-3.018
Token:
Token ID25
Feature activation+5.20e-01

Pos logit contributions
 Demo+4.134
 Chow+3.938
 Rapt+3.794
 Zoro+3.785
 Desire+3.691
Neg logit contributions
-3.189
ICAN-3.135
inson-3.043
enders-3.038
NEY-2.950
Token 277
Token ID38703
Feature activation+5.96e-01

Pos logit contributions
 Demo+1.755
 Chow+1.708
 Zoro+1.599
 Rapt+1.599
 Desire+1.558
Neg logit contributions
-1.758
ICAN-1.720
enders-1.701
inson-1.677
GC-1.675
Tokenx
Token ID87
Feature activation+6.73e-01

Pos logit contributions
 Demo+2.081
 Chow+2.004
 Rapt+1.895
 Zoro+1.894
 Desire+1.851
Neg logit contributions
-1.976
ICAN-1.930
enders-1.898
inson-1.879
GC-1.868
Token120
Token ID10232
Feature activation+9.46e-02

Pos logit contributions
 Demo+2.396
 Chow+2.307
 Rapt+2.189
 Zoro+2.186
 Desire+2.136
Neg logit contributions
-2.185
ICAN-2.137
enders-2.093
inson-2.068
GC-2.056
Token Harvard
Token ID11131
Feature activation-1.62e-01

Pos logit contributions
+1.189
ICAN+1.172
enders+1.155
GC+1.134
inson+1.128
Neg logit contributions
 Demo-1.209
 Chow-1.142
 Zoro-1.106
 Rapt-1.097
 Desire-1.069
Token.
Token ID13
Feature activation-1.23e-02

Pos logit contributions
+0.523
ICAN+0.516
enders+0.507
GC+0.502
inson+0.502
Neg logit contributions
 Demo-0.567
 Chow-0.541
 Zoro-0.523
 Rapt-0.521
 Desire-0.506
Token The
Token ID383
Feature activation-1.01e-01

Pos logit contributions
+0.039
ICAN+0.039
enders+0.038
GC+0.038
NEY+0.037
Neg logit contributions
 Demo-0.043
 Chow-0.042
 Zoro-0.039
 Rapt-0.039
 Desire-0.038
Token idea
Token ID2126
Feature activation-1.22e-01

Pos logit contributions
+0.330
ICAN+0.327
enders+0.321
inson+0.317
GC+0.314
Neg logit contributions
 Demo-0.350
 Chow-0.336
 Zoro-0.322
 Rapt-0.318
 Desire-0.314
Token takes
Token ID2753
Feature activation+2.69e-01

Pos logit contributions
+0.347
enders+0.336
ICAN+0.335
inson+0.332
GC+0.323
Neg logit contributions
 Demo-0.477
 Chow-0.461
 Zoro-0.442
 Rapt-0.440
 Desire-0.435
Token off
Token ID572
Feature activation+1.16e+00

Pos logit contributions
 Demo+0.948
 Chow+0.906
 Rapt+0.863
 Zoro+0.856
 Desire+0.852
Neg logit contributions
-0.851
ICAN-0.840
enders-0.832
inson-0.818
GC-0.812
Token from
Token ID422
Feature activation+3.59e-01

Pos logit contributions
 Demo+4.369
 Chow+4.201
 Rapt+4.006
 Zoro+3.998
 Desire+3.907
Neg logit contributions
-3.554
ICAN-3.473
enders-3.392
inson-3.341
GC-3.304
Token the
Token ID262
Feature activation-1.11e-01

Pos logit contributions
 Demo+1.254
 Chow+1.201
 Rapt+1.144
 Zoro+1.142
 Desire+1.118
Neg logit contributions
-1.168
ICAN-1.147
enders-1.130
inson-1.118
GC-1.110
Token Take
Token ID7214
Feature activation+7.53e-02

Pos logit contributions
+0.333
ICAN+0.331
enders+0.325
inson+0.322
GC+0.318
Neg logit contributions
 Demo-0.402
 Chow-0.388
 Zoro-0.370
 Rapt-0.369
 Desire-0.366
Token Care
Token ID7276
Feature activation-2.71e-01

Pos logit contributions
 Demo+0.267
 Chow+0.254
 Zoro+0.246
 Rapt+0.245
 Desire+0.238
Neg logit contributions
-0.240
ICAN-0.237
enders-0.235
GC-0.233
NEY-0.232
Token Clause
Token ID28081
Feature activation-1.41e-02

Pos logit contributions
+0.913
enders+0.904
ICAN+0.900
inson+0.881
GC+0.874
Neg logit contributions
 Demo-0.886
 Chow-0.870
 Zoro-0.827
 Rapt-0.819
 Desire-0.798
Token Blu
Token ID12391
Feature activation+3.62e-02

Pos logit contributions
 Demo+0.016
 Chow+0.015
 Zoro+0.015
 Rapt+0.015
 Desire+0.014
Neg logit contributions
-0.014
ICAN-0.013
enders-0.013
inson-0.013
GC-0.013
Token-
Token ID12
Feature activation+1.48e-01

Pos logit contributions
 Demo+0.114
 Chow+0.110
 Zoro+0.104
 Rapt+0.103
 Desire+0.101
Neg logit contributions
-0.130
ICAN-0.128
enders-0.127
NEY-0.124
inson-0.124
TokenRay
Token ID19591
Feature activation+1.48e-01

Pos logit contributions
 Demo+0.494
 Chow+0.471
 Rapt+0.447
 Zoro+0.445
 Desire+0.434
Neg logit contributions
-0.513
ICAN-0.505
enders-0.494
inson-0.489
GC-0.481
Token is
Token ID318
Feature activation-2.01e-02

Pos logit contributions
 Demo+0.554
 Chow+0.541
 Zoro+0.515
 Rapt+0.514
 Desire+0.500
Neg logit contributions
-0.447
ICAN-0.434
enders-0.430
GC-0.420
inson-0.417
Token outrage
Token ID11616
Feature activation+3.83e-01

Pos logit contributions
+0.057
ICAN+0.057
enders+0.055
GC+0.054
inson+0.054
Neg logit contributions
 Demo-0.078
 Chow-0.075
 Rapt-0.072
 Zoro-0.072
 Desire-0.070
Tokenously
Token ID3481
Feature activation+1.16e+00

Pos logit contributions
 Demo+1.306
 Chow+1.250
 Zoro+1.193
 Rapt+1.182
 Desire+1.159
Neg logit contributions
-1.301
ICAN-1.279
enders-1.258
inson-1.246
NEY-1.219
Token expensive
Token ID5789
Feature activation+2.17e-01

Pos logit contributions
 Demo+4.541
 Chow+4.377
 Rapt+4.180
 Zoro+4.179
 Desire+4.093
Neg logit contributions
-3.349
ICAN-3.267
enders-3.200
inson-3.152
GC-3.113
Token,
Token ID11
Feature activation-4.69e-02

Pos logit contributions
 Demo+0.788
 Chow+0.762
 Zoro+0.724
 Rapt+0.721
 Desire+0.704
Neg logit contributions
-0.684
ICAN-0.670
enders-0.658
inson-0.648
GC-0.645
Token and
Token ID290
Feature activation+2.72e-02

Pos logit contributions
+0.141
ICAN+0.136
enders+0.135
GC+0.133
inson+0.133
Neg logit contributions
 Demo-0.176
 Chow-0.170
 Zoro-0.162
 Rapt-0.161
 Desire-0.158
Token hard
Token ID1327
Feature activation+3.31e-01

Pos logit contributions
 Demo+0.102
 Chow+0.097
 Zoro+0.093
 Rapt+0.093
 Desire+0.091
Neg logit contributions
-0.082
ICAN-0.079
enders-0.079
GC-0.078
inson-0.077
Token-
Token ID12
Feature activation+3.42e-01

Pos logit contributions
 Demo+1.085
 Chow+1.028
 Rapt+0.978
 Zoro+0.973
 Desire+0.953
Neg logit contributions
-1.170
ICAN-1.149
enders-1.121
inson-1.111
GC-1.096
Tokenlocked
Token ID24162
Feature activation+7.49e-01

Pos logit contributions
 Demo+0.687
 Chow+0.660
 Zoro+0.626
 Rapt+0.625
 Desire+0.615
Neg logit contributions
-0.622
ICAN-0.613
enders-0.605
inson-0.593
GC-0.589
TokenDigital
Token ID27640
Feature activation+9.02e-01

Pos logit contributions
 Demo+2.547
 Chow+2.456
 Zoro+2.324
 Rapt+2.322
 Desire+2.263
Neg logit contributions
-2.528
ICAN-2.474
enders-2.438
GC-2.403
inson-2.402
TokenPower
Token ID13434
Feature activation+5.60e-01

Pos logit contributions
 Demo+3.185
 Chow+3.059
 Rapt+2.905
 Zoro+2.903
 Desire+2.834
Neg logit contributions
-2.948
ICAN-2.885
enders-2.832
inson-2.794
GC-2.765
Token,
Token ID11
Feature activation+5.32e-01

Pos logit contributions
 Demo+2.036
 Chow+1.974
 Zoro+1.869
 Rapt+1.867
 Desire+1.833
Neg logit contributions
-1.769
ICAN-1.715
enders-1.693
GC-1.669
inson-1.665
Token 3
Token ID513
Feature activation+6.77e-01

Pos logit contributions
 Demo+1.712
 Chow+1.649
 Zoro+1.551
 Rapt+1.544
 Desire+1.513
Neg logit contributions
-1.881
ICAN-1.847
enders-1.820
GC-1.802
inson-1.788
Tokenx
Token ID87
Feature activation+1.14e+00

Pos logit contributions
 Demo+2.371
 Chow+2.293
 Zoro+2.169
 Rapt+2.162
 Desire+2.119
Neg logit contributions
-2.239
ICAN-2.188
enders-2.153
inson-2.121
GC-2.114
Token3
Token ID18
Feature activation-1.22e-01

Pos logit contributions
 Demo+3.718
 Chow+3.560
 Zoro+3.362
 Rapt+3.361
 Desire+3.276
Neg logit contributions
-4.052
ICAN-3.975
enders-3.908
inson-3.858
GC-3.826
Token OC
Token ID24775
Feature activation+7.94e-01

Pos logit contributions
+0.394
ICAN+0.383
GC+0.382
enders+0.381
inson+0.375
Neg logit contributions
 Demo-0.424
 Chow-0.415
 Zoro-0.390
 Rapt-0.385
 Desire-0.383
Token Kit
Token ID10897
Feature activation+9.45e-01

Pos logit contributions
 Demo+2.774
 Chow+2.670
 Zoro+2.527
 Rapt+2.525
 Desire+2.474
Neg logit contributions
-2.623
ICAN-2.567
enders-2.536
inson-2.494
GC-2.485
Token
Token ID198
Feature activation+6.41e-01

Pos logit contributions
 Demo+3.339
 Chow+3.210
 Rapt+3.047
 Zoro+3.044
 Desire+2.976
Neg logit contributions
-3.087
ICAN-3.019
enders-2.969
inson-2.927
GC-2.900
Token
Token ID198
Feature activation+6.63e-01

Pos logit contributions
 Demo+2.186
 Chow+2.101
 Rapt+1.991
 Zoro+1.989
 Desire+1.941
Neg logit contributions
-2.178
ICAN-2.125
enders-2.087
inson-2.061
GC-2.052
Token $
Token ID720
Feature activation-7.87e-02

Pos logit contributions
 Demo+0.656
 Chow+0.615
 Zoro+0.580
 Rapt+0.574
 Desire+0.563
Neg logit contributions
-0.972
ICAN-0.955
enders-0.936
inson-0.930
GC-0.930
Token10
Token ID940
Feature activation+2.49e-01

Pos logit contributions
+0.278
ICAN+0.273
GC+0.268
enders+0.267
inson+0.263
Neg logit contributions
 Demo-0.259
 Chow-0.247
 Zoro-0.237
 Rapt-0.235
 Desire-0.229
Token or
Token ID393
Feature activation+4.06e-01

Pos logit contributions
 Demo+0.997
 Chow+0.958
 Rapt+0.919
 Zoro+0.918
 Desire+0.904
Neg logit contributions
-0.698
ICAN-0.676
enders-0.664
inson-0.654
GC-0.651
Token more
Token ID517
Feature activation+3.89e-01

Pos logit contributions
 Demo+1.376
 Chow+1.313
 Rapt+1.252
 Zoro+1.249
 Desire+1.220
Neg logit contributions
-1.388
ICAN-1.365
enders-1.327
GC-1.319
inson-1.318
Token to
Token ID284
Feature activation+4.67e-01

Pos logit contributions
 Demo+1.372
 Chow+1.315
 Zoro+1.250
 Rapt+1.249
 Desire+1.220
Neg logit contributions
-1.276
ICAN-1.243
enders-1.221
inson-1.204
GC-1.199
Token help
Token ID1037
Feature activation+1.14e+00

Pos logit contributions
 Demo+1.721
 Chow+1.654
 Zoro+1.575
 Rapt+1.574
 Desire+1.538
Neg logit contributions
-1.453
ICAN-1.424
enders-1.394
inson-1.378
GC-1.362
Token us
Token ID514
Feature activation+6.09e-01

Pos logit contributions
 Demo+3.328
 Chow+3.193
 Rapt+2.997
 Zoro+2.980
 Desire+2.910
Neg logit contributions
-4.383
ICAN-4.298
enders-4.212
inson-4.178
GC-4.134
Token continue
Token ID2555
Feature activation+7.48e-01

Pos logit contributions
 Demo+2.266
 Chow+2.190
 Rapt+2.075
 Zoro+2.071
 Desire+2.034
Neg logit contributions
-1.864
ICAN-1.827
enders-1.776
GC-1.747
inson-1.745
Token to
Token ID284
Feature activation+4.82e-01

Pos logit contributions
 Demo+2.538
 Chow+2.433
 Rapt+2.305
 Zoro+2.303
 Desire+2.247
Neg logit contributions
-2.550
ICAN-2.497
enders-2.453
inson-2.422
GC-2.396
Token write
Token ID3551
Feature activation+6.90e-01

Pos logit contributions
 Demo+1.913
 Chow+1.841
 Rapt+1.761
 Zoro+1.759
 Desire+1.724
Neg logit contributions
-1.367
ICAN-1.345
enders-1.320
inson-1.307
GC-1.273
Token about
Token ID546
Feature activation+3.15e-01

Pos logit contributions
 Demo+2.507
 Chow+2.411
 Zoro+2.291
 Rapt+2.286
 Desire+2.238
Neg logit contributions
-2.190
ICAN-2.139
enders-2.097
inson-2.071
GC-2.048
Token R
Token ID371
Feature activation+8.72e-02

Pos logit contributions
 Demo+0.480
 Chow+0.461
 Zoro+0.436
 Rapt+0.428
 Desire+0.421
Neg logit contributions
-0.519
ICAN-0.516
GC-0.505
enders-0.504
NEY-0.495
Token78
Token ID3695
Feature activation+2.80e-01

Pos logit contributions
 Demo+0.264
 Chow+0.252
 Zoro+0.238
 Rapt+0.233
 Desire+0.230
Neg logit contributions
-0.329
ICAN-0.327
enders-0.326
GC-0.318
inson-0.317
Token70
Token ID2154
Feature activation+3.50e-01

Pos logit contributions
 Demo+0.909
 Chow+0.878
 Zoro+0.818
 Rapt+0.805
 Desire+0.801
Neg logit contributions
-0.982
ICAN-0.959
enders-0.957
GC-0.949
inson-0.935
Token Hawk
Token ID26698
Feature activation+6.63e-01

Pos logit contributions
 Demo+1.195
 Chow+1.157
 Zoro+1.092
 Rapt+1.085
 Desire+1.064
Neg logit contributions
-1.172
ICAN-1.153
enders-1.131
GC-1.126
inson-1.113
Token
Token ID198
Feature activation+1.11e+00

Pos logit contributions
 Demo+2.219
 Chow+2.142
 Zoro+2.030
 Rapt+2.005
 Desire+1.969
Neg logit contributions
-2.270
ICAN-2.219
enders-2.194
inson-2.175
GC-2.164
Token
Token ID198
Feature activation+1.12e+00

Pos logit contributions
 Demo+3.609
 Chow+3.446
 Rapt+3.257
 Zoro+3.252
 Desire+3.172
Neg logit contributions
-3.940
ICAN-3.871
enders-3.813
inson-3.775
NEY-3.705
TokenGPU
Token ID33346
Feature activation+5.66e-01

Pos logit contributions
 Demo+4.291
 Chow+4.114
 Zoro+3.933
 Rapt+3.932
 Desire+3.851
Neg logit contributions
-3.294
ICAN-3.240
enders-3.172
inson-3.143
NEY-3.074
Token:
Token ID25
Feature activation+7.63e-01

Pos logit contributions
 Demo+2.329
 Chow+2.269
 Zoro+2.168
 Rapt+2.163
 Desire+2.116
Neg logit contributions
-1.497
ICAN-1.457
enders-1.441
GC-1.409
inson-1.407
Token AMD
Token ID10324
Feature activation+5.46e-01

Pos logit contributions
 Demo+2.594
 Chow+2.491
 Zoro+2.355
 Rapt+2.351
 Desire+2.295
Neg logit contributions
-2.591
ICAN-2.536
enders-2.496
inson-2.459
GC-2.446
Token Radeon
Token ID13082
Feature activation+6.91e-01

Pos logit contributions
 Demo+1.365
 Chow+1.296
 Zoro+1.201
 Rapt+1.178
 Desire+1.140
Neg logit contributions
-2.314
ICAN-2.286
enders-2.280
GC-2.254
inson-2.212
Token HD
Token ID5572
Feature activation+1.77e-01

Pos logit contributions
 Demo+2.140
 Chow+2.055
 Zoro+1.929
 Rapt+1.904
 Desire+1.868
Neg logit contributions
-2.536
ICAN-2.507
enders-2.456
GC-2.435
inson-2.402
Token -
Token ID532
Feature activation+5.48e-01

Pos logit contributions
 Demo+0.673
 Chow+0.652
 Zoro+0.617
 Rapt+0.610
 Desire+0.600
Neg logit contributions
-0.658
ICAN-0.646
GC-0.638
enders-0.637
inson-0.629
Token 256
Token ID17759
Feature activation+3.31e-01

Pos logit contributions
 Demo+1.761
 Chow+1.709
 Zoro+1.610
 Rapt+1.591
 Desire+1.567
Neg logit contributions
-1.921
ICAN-1.883
enders-1.863
GC-1.860
inson-1.839
Token bit
Token ID1643
Feature activation+2.30e-01

Pos logit contributions
 Demo+1.126
 Chow+1.083
 Zoro+1.032
 Rapt+1.020
 Desire+1.003
Neg logit contributions
-1.121
GC-1.082
ICAN-1.082
enders-1.078
inson-1.057
Token
Token ID198
Feature activation+6.26e-01

Pos logit contributions
 Demo+0.777
 Chow+0.752
 Zoro+0.718
 Rapt+0.703
 Desire+0.692
Neg logit contributions
-0.770
ICAN-0.743
GC-0.742
enders-0.736
inson-0.730
Token
Token ID198
Feature activation+7.65e-01

Pos logit contributions
 Demo+2.144
 Chow+2.062
 Rapt+1.953
 Zoro+1.953
 Desire+1.903
Neg logit contributions
-2.119
ICAN-2.066
enders-2.032
inson-2.007
GC-1.999
TokenCool
Token ID34530
Feature activation+1.12e+00

Pos logit contributions
 Demo+3.414
 Chow+3.313
 Zoro+3.178
 Rapt+3.177
 Desire+3.119
Neg logit contributions
-1.798
ICAN-1.740
enders-1.697
inson-1.663
GC-1.651
Tokening
Token ID278
Feature activation+3.25e-01

Pos logit contributions
 Demo+3.573
 Chow+3.433
 Zoro+3.234
 Rapt+3.218
 Desire+3.141
Neg logit contributions
-4.036
ICAN-3.951
enders-3.920
inson-3.844
GC-3.829
Token:
Token ID25
Feature activation+5.17e-01

Pos logit contributions
 Demo+1.363
 Chow+1.328
 Zoro+1.272
 Rapt+1.257
 Desire+1.241
Neg logit contributions
-0.835
ICAN-0.806
enders-0.804
GC-0.789
inson-0.780
Token MSI
Token ID30781
Feature activation+5.94e-01

Pos logit contributions
 Demo+1.665
 Chow+1.600
 Zoro+1.501
 Rapt+1.486
 Desire+1.472
Neg logit contributions
-1.828
ICAN-1.790
enders-1.772
GC-1.757
inson-1.734
Token Twin
Token ID14968
Feature activation+6.63e-01

Pos logit contributions
 Demo+1.948
 Chow+1.870
 Zoro+1.767
 Rapt+1.733
 Desire+1.721
Neg logit contributions
-2.060
ICAN-2.040
enders-2.010
GC-1.993
inson-1.964
Token Fro
Token ID9734
Feature activation+6.16e-01

Pos logit contributions
 Demo+1.661
 Chow+1.594
 Zoro+1.499
 Desire+1.434
 Rapt+1.414
Neg logit contributions
-2.796
ICAN-2.769
GC-2.748
enders-2.740
inson-2.704
Token can
Token ID460
Feature activation-2.47e-01

Pos logit contributions
+0.890
ICAN+0.855
enders+0.852
inson+0.851
GC+0.824
Neg logit contributions
 Demo-1.101
 Chow-1.041
 Rapt-1.000
 Zoro-0.997
 Desire-0.987
Token someone
Token ID2130
Feature activation+3.78e-02

Pos logit contributions
+0.688
ICAN+0.678
enders+0.674
GC+0.654
inson+0.652
Neg logit contributions
 Demo-0.971
 Chow-0.926
 Rapt-0.889
 Zoro-0.883
 Desire-0.878
Token with
Token ID351
Feature activation+6.38e-02

Pos logit contributions
 Demo+0.147
 Chow+0.142
 Rapt+0.135
 Zoro+0.135
 Desire+0.133
Neg logit contributions
-0.105
ICAN-0.105
enders-0.104
inson-0.101
GC-0.100
Token a
Token ID257
Feature activation-2.27e-01

Pos logit contributions
 Demo+0.228
 Chow+0.215
 Rapt+0.207
 Zoro+0.205
 Desire+0.202
Neg logit contributions
-0.199
ICAN-0.198
enders-0.196
inson-0.191
GC-0.189
Token mixed
Token ID7668
Feature activation+3.19e-01

Pos logit contributions
ICAN+0.671
+0.666
enders+0.644
NEY+0.633
GC+0.629
Neg logit contributions
 Demo-0.843
 Chow-0.794
 Rapt-0.768
 Zoro-0.757
 Desire-0.754
Token background
Token ID4469
Feature activation+1.12e+00

Pos logit contributions
 Demo+1.337
 Chow+1.275
 Rapt+1.227
 Zoro+1.224
 Desire+1.205
Neg logit contributions
-0.809
ICAN-0.804
enders-0.793
inson-0.775
GC-0.768
Token better
Token ID1365
Feature activation+4.33e-02

Pos logit contributions
 Demo+4.041
 Chow+3.886
 Zoro+3.695
 Rapt+3.695
 Desire+3.605
Neg logit contributions
-3.555
ICAN-3.474
enders-3.403
inson-3.357
GC-3.323
Token embrace
Token ID12553
Feature activation-1.27e-01

Pos logit contributions
 Demo+0.165
 Chow+0.158
 Rapt+0.152
 Zoro+0.151
 Desire+0.149
Neg logit contributions
-0.127
ICAN-0.125
enders-0.123
inson-0.123
NEY-0.120
Token both
Token ID1111
Feature activation-5.62e-02

Pos logit contributions
+0.391
ICAN+0.387
enders+0.386
inson+0.374
GC+0.372
Neg logit contributions
 Demo-0.456
 Chow-0.430
 Rapt-0.416
 Zoro-0.410
 Desire-0.407
Token cultures
Token ID13817
Feature activation-3.46e-02

Pos logit contributions
+0.168
ICAN+0.166
enders+0.166
inson+0.161
GC+0.160
Neg logit contributions
 Demo-0.211
 Chow-0.200
 Rapt-0.191
 Zoro-0.190
 Desire-0.188
Token and
Token ID290
Feature activation-1.50e-02

Pos logit contributions
+0.100
ICAN+0.098
enders+0.097
inson+0.096
GC+0.094
Neg logit contributions
 Demo-0.134
 Chow-0.129
 Zoro-0.123
 Rapt-0.123
 Desire-0.121
Token MSI
Token ID30781
Feature activation+1.49e-01

Pos logit contributions
 Demo+1.905
 Chow+1.823
 Rapt+1.721
 Zoro+1.719
 Desire+1.675
Neg logit contributions
-2.076
ICAN-2.035
enders-2.000
inson-1.973
GC-1.960
Token R
Token ID371
Feature activation+8.72e-02

Pos logit contributions
 Demo+0.480
 Chow+0.461
 Zoro+0.436
 Rapt+0.428
 Desire+0.421
Neg logit contributions
-0.519
ICAN-0.516
GC-0.505
enders-0.504
NEY-0.495
Token78
Token ID3695
Feature activation+2.80e-01

Pos logit contributions
 Demo+0.264
 Chow+0.252
 Zoro+0.238
 Rapt+0.233
 Desire+0.230
Neg logit contributions
-0.329
ICAN-0.327
enders-0.326
GC-0.318
inson-0.317
Token70
Token ID2154
Feature activation+3.50e-01

Pos logit contributions
 Demo+0.909
 Chow+0.878
 Zoro+0.818
 Rapt+0.805
 Desire+0.801
Neg logit contributions
-0.982
ICAN-0.959
enders-0.957
GC-0.949
inson-0.935
Token Hawk
Token ID26698
Feature activation+6.63e-01

Pos logit contributions
 Demo+1.195
 Chow+1.157
 Zoro+1.092
 Rapt+1.085
 Desire+1.064
Neg logit contributions
-1.172
ICAN-1.153
enders-1.131
GC-1.126
inson-1.113
Token
Token ID198
Feature activation+1.11e+00

Pos logit contributions
 Demo+2.219
 Chow+2.142
 Zoro+2.030
 Rapt+2.005
 Desire+1.969
Neg logit contributions
-2.270
ICAN-2.219
enders-2.194
inson-2.175
GC-2.164
Token
Token ID198
Feature activation+1.12e+00

Pos logit contributions
 Demo+3.609
 Chow+3.446
 Rapt+3.257
 Zoro+3.252
 Desire+3.172
Neg logit contributions
-3.940
ICAN-3.871
enders-3.813
inson-3.775
NEY-3.705
TokenGPU
Token ID33346
Feature activation+5.66e-01

Pos logit contributions
 Demo+4.291
 Chow+4.114
 Zoro+3.933
 Rapt+3.932
 Desire+3.851
Neg logit contributions
-3.294
ICAN-3.240
enders-3.172
inson-3.143
NEY-3.074
Token:
Token ID25
Feature activation+7.63e-01

Pos logit contributions
 Demo+2.329
 Chow+2.269
 Zoro+2.168
 Rapt+2.163
 Desire+2.116
Neg logit contributions
-1.497
ICAN-1.457
enders-1.441
GC-1.409
inson-1.407
Token AMD
Token ID10324
Feature activation+5.46e-01

Pos logit contributions
 Demo+2.594
 Chow+2.491
 Zoro+2.355
 Rapt+2.351
 Desire+2.295
Neg logit contributions
-2.591
ICAN-2.536
enders-2.496
inson-2.459
GC-2.446
Token Radeon
Token ID13082
Feature activation+6.91e-01

Pos logit contributions
 Demo+1.365
 Chow+1.296
 Zoro+1.201
 Rapt+1.178
 Desire+1.140
Neg logit contributions
-2.314
ICAN-2.286
enders-2.280
GC-2.254
inson-2.212
Token law
Token ID1099
Feature activation-1.34e-01

Pos logit contributions
 Demo+1.045
 Chow+1.003
 Zoro+0.967
 Rapt+0.961
 Desire+0.946
Neg logit contributions
-0.655
ICAN-0.642
enders-0.630
inson-0.619
GC-0.616
Token students
Token ID2444
Feature activation-9.48e-02

Pos logit contributions
+0.210
enders+0.206
ICAN+0.205
inson+0.200
NEY+0.193
Neg logit contributions
 Demo-0.690
 Chow-0.678
 Zoro-0.655
 Rapt-0.653
 Desire-0.647
Token,
Token ID11
Feature activation+4.04e-02

Pos logit contributions
+0.258
ICAN+0.254
enders+0.247
inson+0.247
GC+0.245
Neg logit contributions
 Demo-0.378
 Chow-0.364
 Zoro-0.349
 Rapt-0.349
 Desire-0.340
Token despite
Token ID3805
Feature activation+5.78e-01

Pos logit contributions
 Demo+0.156
 Chow+0.150
 Rapt+0.144
 Zoro+0.144
 Desire+0.142
Neg logit contributions
-0.114
enders-0.113
ICAN-0.113
GC-0.110
inson-0.110
Token their
Token ID511
Feature activation+2.63e-01

Pos logit contributions
 Demo+2.087
 Chow+2.005
 Zoro+1.908
 Rapt+1.904
 Desire+1.860
Neg logit contributions
-1.833
ICAN-1.802
enders-1.772
inson-1.745
GC-1.728
Token background
Token ID4469
Feature activation+1.11e+00

Pos logit contributions
 Demo+1.042
 Chow+1.004
 Zoro+0.961
 Rapt+0.959
 Desire+0.940
Neg logit contributions
-0.731
ICAN-0.718
enders-0.709
inson-0.691
GC-0.686
Token,
Token ID11
Feature activation+1.15e-01

Pos logit contributions
 Demo+3.914
 Chow+3.762
 Zoro+3.593
 Rapt+3.588
 Desire+3.480
Neg logit contributions
-3.561
ICAN-3.482
enders-3.411
inson-3.382
GC-3.318
Token will
Token ID481
Feature activation+2.87e-01

Pos logit contributions
 Demo+0.473
 Chow+0.455
 Rapt+0.438
 Zoro+0.438
 Desire+0.428
Neg logit contributions
-0.310
ICAN-0.300
enders-0.294
inson-0.291
GC-0.289
Token have
Token ID423
Feature activation+3.19e-01

Pos logit contributions
 Demo+1.065
 Chow+1.026
 Zoro+0.979
 Rapt+0.978
 Desire+0.948
Neg logit contributions
-0.876
ICAN-0.860
inson-0.848
enders-0.845
NEY-0.832
Token the
Token ID262
Feature activation+8.56e-02

Pos logit contributions
 Demo+1.187
 Chow+1.139
 Zoro+1.090
 Rapt+1.084
 Desire+1.065
Neg logit contributions
-0.976
ICAN-0.951
enders-0.938
inson-0.924
GC-0.918
Token opportunity
Token ID3663
Feature activation+8.70e-02

Pos logit contributions
 Demo+0.367
 Chow+0.355
 Zoro+0.339
 Rapt+0.339
 Desire+0.332
Neg logit contributions
-0.212
ICAN-0.207
enders-0.205
inson-0.198
GC-0.198
Token2
Token ID17
Feature activation+4.27e-02

Pos logit contributions
 Demo+1.600
 Chow+1.523
 Rapt+1.426
 Zoro+1.424
 Desire+1.389
Neg logit contributions
-2.123
ICAN-2.085
enders-2.053
inson-2.032
GC-2.025
Token-
Token ID12
Feature activation+5.72e-01

Pos logit contributions
 Demo+0.142
 Chow+0.136
 Rapt+0.129
 Zoro+0.128
 Desire+0.126
Neg logit contributions
-0.149
ICAN-0.145
enders-0.142
GC-0.141
inson-0.141
Tokenethyl
Token ID21610
Feature activation+6.80e-01

Pos logit contributions
 Demo+1.975
 Chow+1.896
 Rapt+1.799
 Zoro+1.791
 Desire+1.754
Neg logit contributions
-1.926
ICAN-1.884
enders-1.852
inson-1.834
GC-1.821
Token-
Token ID12
Feature activation+2.61e-01

Pos logit contributions
 Demo+2.337
 Chow+2.242
 Zoro+2.128
 Rapt+2.125
 Desire+2.069
Neg logit contributions
-2.283
ICAN-2.236
enders-2.196
inson-2.166
NEY-2.144
TokenN
Token ID45
Feature activation+2.44e-01

Pos logit contributions
 Demo+0.755
 Chow+0.720
 Rapt+0.675
 Zoro+0.671
 Desire+0.658
Neg logit contributions
-1.029
ICAN-1.008
enders-0.992
GC-0.985
inson-0.982
Token-(
Token ID30420
Feature activation+1.09e+00

Pos logit contributions
 Demo+0.765
 Chow+0.734
 Rapt+0.689
 Zoro+0.688
 Desire+0.672
Neg logit contributions
-0.901
ICAN-0.883
enders-0.866
GC-0.862
inson-0.860
Tokent
Token ID83
Feature activation+5.77e-01

Pos logit contributions
 Demo+3.137
 Chow+2.979
 Zoro+2.806
 Rapt+2.801
 Desire+2.712
Neg logit contributions
-4.244
ICAN-4.172
enders-4.100
inson-4.052
NEY-4.016
Tokenet
Token ID316
Feature activation+7.55e-01

Pos logit contributions
 Demo+2.362
 Chow+2.283
 Rapt+2.188
 Zoro+2.181
 Desire+2.158
Neg logit contributions
-1.571
ICAN-1.528
enders-1.524
inson-1.511
GC-1.478
Tokenrah
Token ID11392
Feature activation+1.25e-01

Pos logit contributions
 Demo+2.929
 Chow+2.822
 Zoro+2.695
 Rapt+2.677
 Desire+2.640
Neg logit contributions
-2.200
ICAN-2.159
enders-2.136
inson-2.105
GC-2.080
Tokenyd
Token ID5173
Feature activation+4.12e-01

Pos logit contributions
 Demo+0.252
 Chow+0.239
 Rapt+0.218
 Zoro+0.217
 Desire+0.213
Neg logit contributions
-0.593
inson-0.587
ICAN-0.587
enders-0.581
GC-0.576
Tokenro
Token ID305
Feature activation+6.03e-01

Pos logit contributions
 Demo+1.387
 Chow+1.330
 Rapt+1.255
 Zoro+1.253
 Desire+1.231
Neg logit contributions
-1.421
ICAN-1.396
enders-1.377
inson-1.360
GC-1.348
Token checkout
Token ID28006
Feature activation-6.62e-02

Pos logit contributions
 Demo+0.437
 Chow+0.425
 Zoro+0.405
 Rapt+0.405
 Desire+0.397
Neg logit contributions
-0.347
ICAN-0.339
enders-0.334
GC-0.327
inson-0.323
Token can
Token ID460
Feature activation-1.05e-01

Pos logit contributions
+0.148
ICAN+0.141
enders+0.140
inson+0.140
GC+0.138
Neg logit contributions
 Demo-0.297
 Chow-0.291
 Zoro-0.279
 Rapt-0.279
 Desire-0.275
Token get
Token ID651
Feature activation-1.19e-01

Pos logit contributions
+0.298
ICAN+0.294
inson+0.288
GC+0.285
enders+0.285
Neg logit contributions
 Demo-0.404
 Chow-0.396
 Rapt-0.376
 Zoro-0.375
 Desire-0.366
Token 20
Token ID1160
Feature activation+3.51e-02

Pos logit contributions
+0.376
ICAN+0.369
enders+0.357
GC+0.354
inson+0.350
Neg logit contributions
 Demo-0.422
 Chow-0.409
 Zoro-0.389
 Rapt-0.388
 Desire-0.380
Token%
Token ID4
Feature activation+2.17e-01

Pos logit contributions
 Demo+0.120
 Chow+0.115
 Rapt+0.109
 Zoro+0.109
 Desire+0.107
Neg logit contributions
-0.118
ICAN-0.115
enders-0.113
GC-0.112
inson-0.111
Token off
Token ID572
Feature activation+1.09e+00

Pos logit contributions
 Demo+0.752
 Chow+0.723
 Zoro+0.688
 Rapt+0.687
 Desire+0.671
Neg logit contributions
-0.720
ICAN-0.708
enders-0.691
GC-0.680
inson-0.680
Token all
Token ID477
Feature activation+4.36e-01

Pos logit contributions
 Demo+3.693
 Chow+3.539
 Rapt+3.356
 Zoro+3.350
 Desire+3.270
Neg logit contributions
-3.700
ICAN-3.625
enders-3.554
inson-3.517
GC-3.468
Token merchandise
Token ID20813
Feature activation-3.25e-01

Pos logit contributions
 Demo+1.581
 Chow+1.516
 Rapt+1.445
 Zoro+1.445
 Desire+1.410
Neg logit contributions
-1.381
ICAN-1.345
enders-1.333
inson-1.304
GC-1.294
Token in
Token ID287
Feature activation-6.89e-02

Pos logit contributions
+0.922
enders+0.889
ICAN+0.889
GC+0.877
inson+0.870
Neg logit contributions
 Demo-1.247
 Chow-1.219
 Zoro-1.153
 Rapt-1.151
 Desire-1.129
Token our
Token ID674
Feature activation-2.65e-01

Pos logit contributions
+0.217
ICAN+0.208
enders+0.206
GC+0.204
inson+0.200
Neg logit contributions
 Demo-0.249
 Chow-0.239
 Zoro-0.229
 Rapt-0.228
 Desire-0.223
Token store
Token ID3650
Feature activation-1.14e-01

Pos logit contributions
+0.768
ICAN+0.744
enders+0.735
GC+0.721
NEY+0.710
Neg logit contributions
 Demo-1.004
 Chow-0.971
 Zoro-0.927
 Rapt-0.927
 Desire-0.909
Token-
Token ID12
Feature activation-3.26e-01

Pos logit contributions
+0.521
ICAN+0.513
enders+0.504
inson+0.498
GC+0.493
Neg logit contributions
 Demo-0.413
 Chow-0.394
 Zoro-0.370
 Rapt-0.370
 Desire-0.359
Tokenanalysis
Token ID20930
Feature activation-3.54e-01

Pos logit contributions
+0.978
ICAN+0.936
GC+0.932
inson+0.924
enders+0.921
Neg logit contributions
 Demo-1.248
 Chow-1.206
 Rapt-1.153
 Zoro-1.141
 Desire-1.121
Token of
Token ID286
Feature activation-4.26e-01

Pos logit contributions
+0.943
ICAN+0.917
enders+0.889
inson+0.884
GC+0.865
Neg logit contributions
 Demo-1.460
 Chow-1.414
 Zoro-1.348
 Rapt-1.348
 Desire-1.322
Token intervention
Token ID9572
Feature activation-5.85e-02

Pos logit contributions
+1.219
ICAN+1.209
enders+1.202
inson+1.190
GC+1.177
Neg logit contributions
 Demo-1.606
 Chow-1.530
 Rapt-1.474
 Zoro-1.468
 Desire-1.442
Token research
Token ID2267
Feature activation-3.12e-01

Pos logit contributions
+0.176
ICAN+0.174
enders+0.172
inson+0.171
GC+0.167
Neg logit contributions
 Demo-0.217
 Chow-0.210
 Rapt-0.201
 Zoro-0.199
 Desire-0.196
Token.
Token ID764
Feature activation-1.53e+00

Pos logit contributions
+0.788
ICAN+0.766
enders+0.749
inson+0.741
GC+0.736
Neg logit contributions
 Demo-1.327
 Chow-1.292
 Rapt-1.230
 Zoro-1.224
 Desire-1.201
Token School
Token ID3961
Feature activation-7.54e-01

Pos logit contributions
ICAN+4.351
+4.216
GC+4.190
enders+4.146
NEY+4.134
Neg logit contributions
 Demo-5.352
 Chow-5.091
 Zoro-4.943
 Rapt-4.872
 Desire-4.812
Token Psychology
Token ID18430
Feature activation-2.02e-01

Pos logit contributions
+2.501
ICAN+2.492
inson+2.474
enders+2.462
NEY+2.408
Neg logit contributions
 Demo-2.487
 Chow-2.341
 Zoro-2.290
 Rapt-2.265
 Desire-2.178
Token Quarterly
Token ID36082
Feature activation-8.62e-01

Pos logit contributions
+0.593
ICAN+0.589
enders+0.581
inson+0.572
GC+0.559
Neg logit contributions
 Demo-0.764
 Chow-0.741
 Zoro-0.707
 Rapt-0.704
 Desire-0.690
Token,
Token ID11
Feature activation-2.47e-01

Pos logit contributions
+2.394
ICAN+2.347
enders+2.307
inson+2.253
NEY+2.221
Neg logit contributions
 Demo-3.354
 Chow-3.280
 Zoro-3.123
 Rapt-3.120
 Desire-3.070
Token 23
Token ID2242
Feature activation-2.81e-01

Pos logit contributions
+0.798
ICAN+0.774
enders+0.752
GC+0.750
inson+0.744
Neg logit contributions
 Demo-0.853
 Chow-0.828
 Rapt-0.785
 Zoro-0.780
 Desire-0.765
TokenCase
Token ID20448
Feature activation-4.61e-02

Pos logit contributions
 Demo+0.233
 Chow+0.227
 Zoro+0.218
 Rapt+0.218
 Desire+0.213
Neg logit contributions
-0.140
ICAN-0.135
enders-0.131
GC-0.131
inson-0.129
Token in
Token ID287
Feature activation-1.44e-01

Pos logit contributions
+0.134
ICAN+0.132
enders+0.131
inson+0.129
NEY+0.127
Neg logit contributions
 Demo-0.176
 Chow-0.169
 Zoro-0.162
 Rapt-0.161
 Desire-0.158
Token point
Token ID966
Feature activation+4.09e-02

Pos logit contributions
+0.432
inson+0.427
enders+0.416
ICAN+0.414
GC+0.402
Neg logit contributions
 Demo-0.523
 Chow-0.494
 Zoro-0.481
 Rapt-0.469
 Desire-0.461
Token:
Token ID25
Feature activation-2.29e-01

Pos logit contributions
 Demo+0.160
 Chow+0.155
 Rapt+0.146
 Zoro+0.146
 Desire+0.144
Neg logit contributions
-0.120
ICAN-0.118
enders-0.114
NEY-0.111
inson-0.111
Token This
Token ID770
Feature activation-2.90e-01

Pos logit contributions
+0.715
enders+0.711
ICAN+0.710
inson+0.701
GC+0.694
Neg logit contributions
 Demo-0.786
 Chow-0.755
 Zoro-0.726
 Rapt-0.725
 Desire-0.698
Token #
Token ID1303
Feature activation-1.47e+00

Pos logit contributions
+0.766
ICAN+0.756
enders+0.739
inson+0.732
GC+0.730
Neg logit contributions
 Demo-1.169
 Chow-1.131
 Zoro-1.089
 Rapt-1.082
 Desire-1.058
TokenJ
Token ID41
Feature activation-3.41e-01

Pos logit contributions
+4.357
enders+4.188
GC+4.185
ICAN+4.180
inson+4.087
Neg logit contributions
 Demo-5.398
 Chow-5.316
 Zoro-5.089
 Rapt-5.030
 Desire-4.895
Tokeneb
Token ID1765
Feature activation-2.04e-01

Pos logit contributions
+1.013
ICAN+0.992
enders+0.989
GC+0.969
inson+0.967
Neg logit contributions
 Demo-1.300
 Chow-1.256
 Zoro-1.206
 Rapt-1.205
 Desire-1.179
TokenNo
Token ID2949
Feature activation-1.96e-01

Pos logit contributions
+0.696
ICAN+0.688
enders+0.672
inson+0.667
GC+0.664
Neg logit contributions
 Demo-0.684
 Chow-0.656
 Zoro-0.626
 Rapt-0.622
 Desire-0.609
TokenFilter
Token ID22417
Feature activation+6.56e-02

Pos logit contributions
+0.544
ICAN+0.533
enders+0.520
inson+0.512
GC+0.508
Neg logit contributions
 Demo-0.785
 Chow-0.758
 Zoro-0.726
 Rapt-0.725
 Desire-0.710
Token video
Token ID2008
Feature activation+2.42e-01

Pos logit contributions
 Demo+0.271
 Chow+0.263
 Zoro+0.252
 Rapt+0.252
 Desire+0.246
Neg logit contributions
-0.174
ICAN-0.169
enders-0.166
inson-0.163
GC-0.162
Token the
Token ID262
Feature activation-1.55e-01

Pos logit contributions
 Demo+0.187
 Chow+0.178
 Zoro+0.170
 Rapt+0.169
 Desire+0.166
Neg logit contributions
-0.178
enders-0.174
ICAN-0.174
inson-0.171
GC-0.171
Token last
Token ID938
Feature activation-1.37e-01

Pos logit contributions
+0.311
ICAN+0.303
enders+0.302
inson+0.289
GC+0.285
Neg logit contributions
 Demo-0.728
 Chow-0.701
 Zoro-0.679
 Rapt-0.678
 Desire-0.669
Token two
Token ID734
Feature activation-1.17e-01

Pos logit contributions
+0.437
enders+0.427
ICAN+0.426
inson+0.419
GC+0.415
Neg logit contributions
 Demo-0.479
 Chow-0.454
 Zoro-0.436
 Rapt-0.435
 Desire-0.428
Token days
Token ID1528
Feature activation-2.78e-01

Pos logit contributions
+0.343
enders+0.329
ICAN+0.329
inson+0.320
GC+0.319
Neg logit contributions
 Demo-0.449
 Chow-0.430
 Zoro-0.414
 Rapt-0.412
 Desire-0.412
Token,
Token ID11
Feature activation+5.40e-02

Pos logit contributions
+0.810
ICAN+0.792
enders+0.784
inson+0.767
GC+0.759
Neg logit contributions
 Demo-1.069
 Chow-1.017
 Rapt-0.980
 Zoro-0.980
 Desire-0.964
Token #
Token ID1303
Feature activation-1.44e+00

Pos logit contributions
 Demo+0.196
 Chow+0.186
 Rapt+0.179
 Zoro+0.178
 Desire+0.176
Neg logit contributions
-0.168
ICAN-0.164
enders-0.164
GC-0.160
inson-0.160
TokenC
Token ID34
Feature activation-4.02e-01

Pos logit contributions
+4.373
ICAN+4.198
GC+4.167
enders+4.154
NEY+3.996
Neg logit contributions
 Demo-5.362
 Chow-5.097
 Rapt-4.879
 Zoro-4.839
 Desire-4.791
Tokenhen
Token ID831
Feature activation-5.43e-01

Pos logit contributions
+1.193
enders+1.154
ICAN+1.150
GC+1.149
inson+1.147
Neg logit contributions
 Demo-1.534
 Chow-1.476
 Rapt-1.423
 Zoro-1.411
 Desire-1.394
Tokennai
Token ID38600
Feature activation-1.01e-01

Pos logit contributions
+1.397
ICAN+1.361
enders+1.350
GC+1.314
inson+1.311
Neg logit contributions
 Demo-2.292
 Chow-2.171
 Rapt-2.110
 Zoro-2.105
 Desire-2.082
TokenR
Token ID49
Feature activation-3.57e-01

Pos logit contributions
+0.291
ICAN+0.284
enders+0.279
GC+0.273
inson+0.272
Neg logit contributions
 Demo-0.398
 Chow-0.383
 Zoro-0.367
 Rapt-0.367
 Desire-0.360
Tokenains
Token ID1299
Feature activation+1.67e-01

Pos logit contributions
+0.995
enders+0.977
ICAN+0.974
inson+0.953
GC+0.946
Neg logit contributions
 Demo-1.429
 Chow-1.368
 Rapt-1.323
 Zoro-1.323
 Desire-1.294
Token child
Token ID1200
Feature activation-1.49e-01

Pos logit contributions
 Demo+0.357
 Chow+0.345
 Rapt+0.330
 Zoro+0.329
 Desire+0.321
Neg logit contributions
-0.263
ICAN-0.257
enders-0.252
inson-0.249
GC-0.247
Token to
Token ID284
Feature activation+9.14e-02

Pos logit contributions
+0.436
ICAN+0.427
enders+0.418
inson+0.414
GC+0.412
Neg logit contributions
 Demo-0.567
 Chow-0.545
 Rapt-0.522
 Zoro-0.521
 Desire-0.512
Token a
Token ID257
Feature activation-1.45e-01

Pos logit contributions
 Demo+0.356
 Chow+0.342
 Zoro+0.329
 Rapt+0.326
 Desire+0.321
Neg logit contributions
-0.263
ICAN-0.259
enders-0.254
inson-0.249
GC-0.248
Token grave
Token ID12296
Feature activation+4.16e-01

Pos logit contributions
+0.408
ICAN+0.405
enders+0.392
GC+0.387
inson+0.384
Neg logit contributions
 Demo-0.565
 Chow-0.545
 Zoro-0.521
 Rapt-0.517
 Desire-0.511
Token in
Token ID287
Feature activation+3.84e-02

Pos logit contributions
 Demo+1.589
 Chow+1.529
 Rapt+1.459
 Zoro+1.459
 Desire+1.428
Neg logit contributions
-1.239
ICAN-1.210
enders-1.186
inson-1.167
GC-1.157
Token #
Token ID1303
Feature activation-1.42e+00

Pos logit contributions
 Demo+0.140
 Chow+0.134
 Zoro+0.129
 Rapt+0.128
 Desire+0.125
Neg logit contributions
-0.119
ICAN-0.117
enders-0.115
GC-0.113
inson-0.113
TokenGaza
Token ID48790
Feature activation-7.04e-01

Pos logit contributions
+4.052
GC+3.926
ICAN+3.853
enders+3.812
inson+3.690
Neg logit contributions
 Demo-5.593
 Chow-5.476
 Zoro-5.179
 Rapt-5.157
 Desire-5.111
Token,
Token ID11
Feature activation-2.09e-01

Pos logit contributions
+2.001
ICAN+1.973
enders+1.911
GC+1.910
inson+1.882
Neg logit contributions
 Demo-2.696
 Chow-2.594
 Zoro-2.503
 Rapt-2.500
 Desire-2.453
Token
Token ID447
Feature activation-2.95e-01

Pos logit contributions
+0.606
ICAN+0.594
enders+0.582
GC+0.578
inson+0.574
Neg logit contributions
 Demo-0.801
 Chow-0.767
 Rapt-0.734
 Zoro-0.733
 Desire-0.720
Token
Token ID251
Feature activation-1.26e-01

Pos logit contributions
+0.816
ICAN+0.804
inson+0.773
enders+0.771
NEY+0.770
Neg logit contributions
 Demo-1.180
 Chow-1.119
 Zoro-1.086
 Rapt-1.078
 Desire-1.070
Token Bad
Token ID7772
Feature activation+8.73e-02

Pos logit contributions
+0.369
ICAN+0.359
enders+0.350
GC+0.348
inson+0.346
Neg logit contributions
 Demo-0.487
 Chow-0.467
 Zoro-0.447
 Rapt-0.446
 Desire-0.438
Token des
Token ID748
Feature activation+1.93e-02

Pos logit contributions
+0.666
enders+0.646
ICAN+0.641
GC+0.630
inson+0.623
Neg logit contributions
 Demo-0.639
 Chow-0.622
 Zoro-0.589
 Rapt-0.586
 Desire-0.573
Token inc
Token ID753
Feature activation-1.22e-01

Pos logit contributions
 Demo+0.073
 Chow+0.072
 Rapt+0.068
 Zoro+0.068
 Desire+0.066
Neg logit contributions
-0.057
enders-0.055
ICAN-0.055
inson-0.053
GC-0.053
Tokenend
Token ID437
Feature activation-1.25e-01

Pos logit contributions
enders+0.384
+0.378
inson+0.375
ICAN+0.361
GC+0.358
Neg logit contributions
 Demo-0.451
 Chow-0.434
 Rapt-0.412
 Zoro-0.410
 Desire-0.403
Tokenies
Token ID444
Feature activation-4.54e-02

Pos logit contributions
enders+0.375
+0.374
inson+0.367
ICAN+0.366
GC+0.354
Neg logit contributions
 Demo-0.476
 Chow-0.462
 Zoro-0.435
 Rapt-0.434
 Desire-0.426
Token sur
Token ID969
Feature activation-2.46e-01

Pos logit contributions
+0.131
ICAN+0.128
enders+0.125
inson+0.123
GC+0.122
Neg logit contributions
 Demo-0.176
 Chow-0.171
 Rapt-0.162
 Zoro-0.162
 Desire-0.159
Token #
Token ID1303
Feature activation-1.39e+00

Pos logit contributions
+0.771
enders+0.727
ICAN+0.722
inson+0.714
GC+0.708
Neg logit contributions
 Demo-0.897
 Chow-0.876
 Zoro-0.817
 Rapt-0.816
 Desire-0.802
TokenB
Token ID33
Feature activation-1.94e-01

Pos logit contributions
+4.075
ICAN+3.839
GC+3.835
enders+3.758
inson+3.715
Neg logit contributions
 Demo-5.311
 Chow-5.188
 Zoro-4.878
 Rapt-4.849
 Desire-4.758
Tokenorm
Token ID579
Feature activation+1.85e-01

Pos logit contributions
+0.635
ICAN+0.622
enders+0.616
inson+0.605
GC+0.604
Neg logit contributions
 Demo-0.692
 Chow-0.666
 Zoro-0.631
 Rapt-0.630
 Desire-0.613
Tokenes
Token ID274
Feature activation+1.92e-01

Pos logit contributions
 Demo+0.655
 Chow+0.630
 Zoro+0.599
 Rapt+0.598
 Desire+0.586
Neg logit contributions
-0.601
ICAN-0.588
enders-0.580
inson-0.572
GC-0.564
Tokenles
Token ID829
Feature activation+2.05e-01

Pos logit contributions
 Demo+0.751
 Chow+0.726
 Zoro+0.692
 Rapt+0.692
 Desire+0.677
Neg logit contributions
-0.556
ICAN-0.542
enders-0.532
inson-0.523
GC-0.518
Tokenm
Token ID76
Feature activation-1.22e-01

Pos logit contributions
 Demo+0.801
 Chow+0.775
 Zoro+0.739
 Rapt+0.739
 Desire+0.723
Neg logit contributions
-0.593
ICAN-0.577
enders-0.566
inson-0.558
GC-0.552
Token on
Token ID319
Feature activation-8.18e-02

Pos logit contributions
 Demo+0.161
 Chow+0.154
 Zoro+0.149
 Rapt+0.147
 Desire+0.145
Neg logit contributions
-0.118
ICAN-0.113
enders-0.113
GC-0.112
inson-0.112
Token Twitter
Token ID3009
Feature activation-9.73e-02

Pos logit contributions
+0.192
inson+0.187
ICAN+0.185
enders+0.184
GC+0.183
Neg logit contributions
 Demo-0.355
 Chow-0.342
 Zoro-0.331
 Rapt-0.329
 Desire-0.322
Token.
Token ID13
Feature activation+5.76e-02

Pos logit contributions
+0.283
ICAN+0.271
enders+0.267
inson+0.266
GC+0.264
Neg logit contributions
 Demo-0.379
 Chow-0.364
 Zoro-0.350
 Rapt-0.348
 Desire-0.341
Token<|endoftext|>
Token ID50256
Feature activation+1.49e-01

Pos logit contributions
 Demo+0.200
 Chow+0.192
 Zoro+0.184
 Rapt+0.182
 Desire+0.178
Neg logit contributions
-0.187
ICAN-0.182
enders-0.179
GC-0.178
NEY-0.178
TokenMedia
Token ID13152
Feature activation+2.15e-01

Pos logit contributions
 Demo+0.613
 Chow+0.594
 Rapt+0.567
 Zoro+0.566
 Desire+0.556
Neg logit contributions
-0.413
ICAN-0.399
enders-0.389
GC-0.383
inson-0.381
Token playback
Token ID16388
Feature activation-1.37e+00

Pos logit contributions
 Demo+0.857
 Chow+0.830
 Zoro+0.796
 Rapt+0.792
 Desire+0.772
Neg logit contributions
-0.601
ICAN-0.581
enders-0.578
GC-0.566
inson-0.566
Token is
Token ID318
Feature activation-1.45e-01

Pos logit contributions
+2.845
GC+2.832
ICAN+2.800
inson+2.789
enders+2.733
Neg logit contributions
 Demo-5.942
 Chow-5.882
 Rapt-5.660
 Zoro-5.637
 Desire-5.376
Token unsupported
Token ID24222
Feature activation-2.71e-01

Pos logit contributions
+0.342
enders+0.332
ICAN+0.332
NEY+0.322
GC+0.322
Neg logit contributions
 Demo-0.629
 Chow-0.610
 Rapt-0.587
 Zoro-0.585
 Desire-0.574
Token on
Token ID319
Feature activation+6.29e-02

Pos logit contributions
+0.737
ICAN+0.720
GC+0.705
enders+0.703
NEY+0.687
Neg logit contributions
 Demo-1.081
 Chow-1.046
 Zoro-1.012
 Rapt-1.009
 Desire-0.984
Token your
Token ID534
Feature activation+3.94e-02

Pos logit contributions
 Demo+0.260
 Chow+0.252
 Zoro+0.242
 Rapt+0.241
 Desire+0.236
Neg logit contributions
-0.165
ICAN-0.160
enders-0.158
inson-0.157
GC-0.155
Token device
Token ID3335
Feature activation-6.56e-01

Pos logit contributions
 Demo+0.153
 Chow+0.149
 Rapt+0.143
 Zoro+0.143
 Desire+0.139
Neg logit contributions
-0.112
ICAN-0.108
enders-0.107
inson-0.105
GC-0.104
Token Ho
Token ID9544
Feature activation-4.25e-01

Pos logit contributions
+0.711
ICAN+0.709
enders+0.706
inson+0.706
GC+0.677
Neg logit contributions
 Demo-0.813
 Chow-0.770
 Zoro-0.738
 Rapt-0.732
 Desire-0.727
Tokeniber
Token ID1856
Feature activation-1.80e-01

Pos logit contributions
enders+0.923
+0.911
ICAN+0.890
inson+0.877
GC+0.867
Neg logit contributions
 Demo-1.930
 Chow-1.881
 Rapt-1.817
 Zoro-1.811
 Desire-1.792
Tokeng
Token ID70
Feature activation+1.09e-01

Pos logit contributions
+0.648
ICAN+0.641
GC+0.635
enders+0.629
inson+0.627
Neg logit contributions
 Demo-0.555
 Chow-0.537
 Zoro-0.522
 Desire-0.507
 Rapt-0.503
Token said
Token ID531
Feature activation-2.59e-01

Pos logit contributions
 Demo+0.472
 Chow+0.455
 Zoro+0.437
 Rapt+0.436
 Desire+0.432
Neg logit contributions
-0.263
ICAN-0.259
inson-0.256
enders-0.256
GC-0.249
Token.
Token ID13
Feature activation+2.14e-02

Pos logit contributions
+0.753
ICAN+0.743
inson+0.729
enders+0.723
GC+0.714
Neg logit contributions
 Demo-0.984
 Chow-0.944
 Zoro-0.903
 Rapt-0.898
 Desire-0.884
Token #
Token ID1303
Feature activation-1.37e+00

Pos logit contributions
 Demo+0.074
 Chow+0.071
 Zoro+0.068
 Rapt+0.068
 Desire+0.066
Neg logit contributions
-0.069
ICAN-0.069
enders-0.067
inson-0.067
NEY-0.066
TokenBull
Token ID39549
Feature activation-2.65e-01

Pos logit contributions
+4.076
enders+3.963
GC+3.948
ICAN+3.945
inson+3.835
Neg logit contributions
 Demo-5.117
 Chow-4.944
 Zoro-4.617
 Rapt-4.565
 Desire-4.554
Tokens
Token ID82
Feature activation-1.49e-01

Pos logit contributions
+0.832
ICAN+0.807
enders+0.804
inson+0.792
GC+0.783
Neg logit contributions
 Demo-0.972
 Chow-0.931
 Zoro-0.893
 Rapt-0.879
 Desire-0.868
Token
Token ID851
Feature activation+2.51e-02

Pos logit contributions
+0.445
ICAN+0.436
enders+0.429
GC+0.424
inson+0.423
Neg logit contributions
 Demo-0.561
 Chow-0.542
 Zoro-0.519
 Rapt-0.511
 Desire-0.506
Token Cody
Token ID27035
Feature activation-3.30e-01

Pos logit contributions
 Demo+0.078
 Chow+0.074
 Zoro+0.070
 Rapt+0.069
 Desire+0.067
Neg logit contributions
-0.091
ICAN-0.090
enders-0.088
GC-0.087
inson-0.087
Token Wester
Token ID32104
Feature activation+9.32e-02

Pos logit contributions
+1.219
enders+1.210
ICAN+1.201
inson+1.190
NEY+1.177
Neg logit contributions
 Demo-0.980
 Chow-0.930
 Zoro-0.884
 Rapt-0.869
 Desire-0.856
Token )
Token ID1267
Feature activation-4.81e-01

Pos logit contributions
+1.522
enders+1.458
ICAN+1.456
GC+1.430
inson+1.429
Neg logit contributions
 Demo-1.865
 Chow-1.793
 Rapt-1.721
 Zoro-1.714
 Desire-1.677
Token ;
Token ID2162
Feature activation-5.89e-01

Pos logit contributions
+1.418
enders+1.358
ICAN+1.354
inson+1.320
GC+1.310
Neg logit contributions
 Demo-1.819
 Chow-1.776
 Rapt-1.696
 Zoro-1.686
 Desire-1.661
Token return
Token ID1441
Feature activation-5.43e-01

Pos logit contributions
+1.723
enders+1.654
ICAN+1.652
GC+1.622
inson+1.611
Neg logit contributions
 Demo-2.201
 Chow-2.155
 Rapt-2.051
 Zoro-2.050
 Desire-2.018
Token normal
Token ID3487
Feature activation-3.40e-01

Pos logit contributions
+1.636
ICAN+1.583
enders+1.570
GC+1.539
inson+1.534
Neg logit contributions
 Demo-1.967
 Chow-1.928
 Rapt-1.824
 Zoro-1.813
 Desire-1.791
Token ;
Token ID2162
Feature activation-6.70e-01

Pos logit contributions
+0.934
ICAN+0.904
enders+0.896
GC+0.877
inson+0.871
Neg logit contributions
 Demo-1.363
 Chow-1.322
 Zoro-1.269
 Rapt-1.257
 Desire-1.245
Token #
Token ID1303
Feature activation-1.35e+00

Pos logit contributions
+1.444
enders+1.355
ICAN+1.344
GC+1.304
inson+1.286
Neg logit contributions
 Demo-3.045
 Chow-2.988
 Zoro-2.876
 Rapt-2.873
 Desire-2.846
Tokenendif
Token ID32088
Feature activation-3.32e-01

Pos logit contributions
+3.768
enders+3.724
GC+3.570
inson+3.527
ICAN+3.453
Neg logit contributions
 Demo-5.185
 Chow-5.116
 Rapt-4.849
 Zoro-4.807
 Desire-4.728
Token }
Token ID1782
Feature activation-9.54e-02

Pos logit contributions
+0.869
ICAN+0.840
enders+0.830
GC+0.811
inson+0.804
Neg logit contributions
 Demo-1.366
 Chow-1.332
 Rapt-1.272
 Zoro-1.268
 Desire-1.246
Token If
Token ID1002
Feature activation+2.03e-01

Pos logit contributions
+0.252
ICAN+0.243
enders+0.238
inson+0.233
GC+0.232
Neg logit contributions
 Demo-0.394
 Chow-0.383
 Rapt-0.366
 Zoro-0.366
 Desire-0.361
Token you
Token ID345
Feature activation+7.74e-02

Pos logit contributions
 Demo+0.727
 Chow+0.702
 Zoro+0.667
 Rapt+0.666
 Desire+0.651
Neg logit contributions
-0.652
ICAN-0.636
enders-0.627
inson-0.616
GC-0.613
Token are
Token ID389
Feature activation-1.98e-02

Pos logit contributions
 Demo+0.290
 Chow+0.281
 Zoro+0.269
 Rapt+0.268
 Desire+0.260
Neg logit contributions
-0.228
enders-0.222
ICAN-0.220
inson-0.218
GC-0.216
Token surface
Token ID4417
Feature activation+1.22e-01

Pos logit contributions
 Demo+0.110
 Chow+0.107
 Zoro+0.103
 Desire+0.101
 Rapt+0.100
Neg logit contributions
-0.074
enders-0.071
ICAN-0.070
GC-0.069
inson-0.069
Token.
Token ID13
Feature activation-6.51e-02

Pos logit contributions
 Demo+0.465
 Chow+0.449
 Rapt+0.423
 Zoro+0.423
 Desire+0.419
Neg logit contributions
-0.361
ICAN-0.350
enders-0.348
inson-0.344
GC-0.339
Token
Token ID198
Feature activation-3.52e-02

Pos logit contributions
+0.217
ICAN+0.212
enders+0.209
inson+0.207
GC+0.205
Neg logit contributions
 Demo-0.222
 Chow-0.215
 Zoro-0.203
 Rapt-0.202
 Desire-0.198
Token
Token ID198
Feature activation-4.24e-02

Pos logit contributions
+0.110
ICAN+0.107
enders+0.105
GC+0.105
inson+0.104
Neg logit contributions
 Demo-0.128
 Chow-0.124
 Zoro-0.118
 Rapt-0.118
 Desire-0.115
TokenMedia
Token ID13152
Feature activation-3.56e-01

Pos logit contributions
+0.115
ICAN+0.110
enders+0.107
GC+0.107
inson+0.105
Neg logit contributions
 Demo-0.177
 Chow-0.172
 Zoro-0.164
 Rapt-0.164
 Desire-0.161
Token playback
Token ID16388
Feature activation-1.35e+00

Pos logit contributions
+0.916
enders+0.877
ICAN+0.871
GC+0.870
inson+0.869
Neg logit contributions
 Demo-1.460
 Chow-1.429
 Zoro-1.385
 Rapt-1.361
 Desire-1.327
Token is
Token ID318
Feature activation-7.84e-02

Pos logit contributions
+2.854
GC+2.841
inson+2.754
ICAN+2.720
enders+2.686
Neg logit contributions
 Demo-5.810
 Chow-5.788
 Rapt-5.557
 Zoro-5.554
 Desire-5.315
Token unsupported
Token ID24222
Feature activation-2.58e-01

Pos logit contributions
+0.184
ICAN+0.176
enders+0.175
NEY+0.172
GC+0.171
Neg logit contributions
 Demo-0.345
 Chow-0.337
 Zoro-0.323
 Rapt-0.322
 Desire-0.317
Token on
Token ID319
Feature activation-6.84e-03

Pos logit contributions
+0.714
ICAN+0.692
GC+0.678
enders+0.674
NEY+0.660
Neg logit contributions
 Demo-1.029
 Chow-0.995
 Zoro-0.962
 Rapt-0.959
 Desire-0.937
Token your
Token ID534
Feature activation-1.42e-01

Pos logit contributions
+0.018
ICAN+0.018
enders+0.017
inson+0.017
GC+0.017
Neg logit contributions
 Demo-0.028
 Chow-0.027
 Zoro-0.026
 Rapt-0.026
 Desire-0.026
Token device
Token ID3335
Feature activation-7.34e-01

Pos logit contributions
+0.403
ICAN+0.387
enders+0.382
inson+0.378
GC+0.373
Neg logit contributions
 Demo-0.553
 Chow-0.539
 Rapt-0.517
 Zoro-0.516
 Desire-0.501
Tokenher
Token ID372
Feature activation-1.37e-01

Pos logit contributions
 Demo+0.366
 Chow+0.349
 Zoro+0.338
 Rapt+0.331
 Desire+0.324
Neg logit contributions
-0.392
ICAN-0.383
enders-0.381
inson-0.376
NEY-0.369
Token.
Token ID13
Feature activation-2.25e-01

Pos logit contributions
+0.420
ICAN+0.414
enders+0.405
inson+0.396
GC+0.394
Neg logit contributions
 Demo-0.506
 Chow-0.489
 Zoro-0.469
 Rapt-0.465
 Desire-0.457
Token
Token ID198
Feature activation+1.56e-01

Pos logit contributions
+0.701
ICAN+0.690
enders+0.674
GC+0.667
NEY+0.664
Neg logit contributions
 Demo-0.789
 Chow-0.766
 Zoro-0.735
 Rapt-0.725
 Desire-0.708
Token
Token ID198
Feature activation+1.18e-01

Pos logit contributions
 Demo+0.555
 Chow+0.536
 Zoro+0.509
 Rapt+0.508
 Desire+0.498
Neg logit contributions
-0.505
ICAN-0.492
enders-0.482
GC-0.477
inson-0.474
TokenNote
Token ID6425
Feature activation-3.06e-01

Pos logit contributions
 Demo+0.478
 Chow+0.464
 Zoro+0.445
 Rapt+0.443
 Desire+0.434
Neg logit contributions
-0.331
ICAN-0.321
enders-0.312
GC-0.309
inson-0.306
Token #
Token ID1303
Feature activation-1.34e+00

Pos logit contributions
+0.792
ICAN+0.759
enders+0.757
inson+0.741
GC+0.733
Neg logit contributions
 Demo-1.268
 Chow-1.238
 Zoro-1.193
 Rapt-1.182
 Desire-1.164
Token1
Token ID16
Feature activation-4.61e-01

Pos logit contributions
+4.546
enders+4.502
ICAN+4.403
GC+4.385
inson+4.295
Neg logit contributions
 Demo-4.130
 Chow-4.107
 Zoro-3.886
 Rapt-3.860
 Desire-3.677
Token:
Token ID25
Feature activation-1.50e-01

Pos logit contributions
+1.176
ICAN+1.133
enders+1.125
inson+1.091
GC+1.083
Neg logit contributions
 Demo-1.910
 Chow-1.867
 Zoro-1.794
 Rapt-1.781
 Desire-1.750
Token To
Token ID1675
Feature activation-2.15e-01

Pos logit contributions
+0.458
ICAN+0.447
enders+0.446
inson+0.438
NEY+0.435
Neg logit contributions
 Demo-0.538
 Chow-0.521
 Zoro-0.500
 Rapt-0.496
 Desire-0.480
Token prove
Token ID5879
Feature activation+1.21e-01

Pos logit contributions
+0.637
ICAN+0.623
enders+0.611
NEY+0.611
inson+0.606
Neg logit contributions
 Demo-0.799
 Chow-0.775
 Zoro-0.744
 Rapt-0.734
 Desire-0.723
Token just
Token ID655
Feature activation+1.31e-01

Pos logit contributions
 Demo+0.428
 Chow+0.410
 Zoro+0.394
 Rapt+0.389
 Desire+0.383
Neg logit contributions
-0.388
ICAN-0.380
enders-0.377
inson-0.370
GC-0.367
Tokentypes
Token ID19199
Feature activation-6.22e-03

Pos logit contributions
+0.450
enders+0.439
inson+0.435
ICAN+0.431
GC+0.428
Neg logit contributions
 Demo-0.604
 Chow-0.583
 Zoro-0.559
 Rapt-0.558
 Desire-0.541
Token of
Token ID286
Feature activation+4.03e-01

Pos logit contributions
+0.019
ICAN+0.018
enders+0.018
inson+0.018
GC+0.018
Neg logit contributions
 Demo-0.023
 Chow-0.022
 Rapt-0.021
 Zoro-0.021
 Desire-0.021
Token bullies
Token ID42677
Feature activation-5.87e-03

Pos logit contributions
 Demo+1.793
 Chow+1.733
 Zoro+1.666
 Rapt+1.663
 Desire+1.630
Neg logit contributions
-0.943
ICAN-0.921
enders-0.900
inson-0.883
GC-0.871
Token in
Token ID287
Feature activation+1.08e-02

Pos logit contributions
+0.018
ICAN+0.017
enders+0.017
inson+0.017
GC+0.016
Neg logit contributions
 Demo-0.022
 Chow-0.021
 Zoro-0.020
 Rapt-0.020
 Desire-0.020
Token adolescence
Token ID37258
Feature activation-4.21e-01

Pos logit contributions
 Demo+0.043
 Chow+0.042
 Zoro+0.040
 Rapt+0.040
 Desire+0.039
Neg logit contributions
-0.029
ICAN-0.029
enders-0.029
inson-0.028
GC-0.028
Token.
Token ID764
Feature activation-1.32e+00

Pos logit contributions
+1.151
ICAN+1.134
enders+1.115
inson+1.098
GC+1.073
Neg logit contributions
 Demo-1.694
 Chow-1.627
 Zoro-1.566
 Rapt-1.559
 Desire-1.521
Token Journal
Token ID4913
Feature activation-7.32e-01

Pos logit contributions
ICAN+3.741
+3.623
GC+3.603
inson+3.596
NEY+3.564
Neg logit contributions
 Demo-4.753
 Chow-4.536
 Zoro-4.410
 Rapt-4.304
 Desire-4.203
Token of
Token ID286
Feature activation-4.46e-01

Pos logit contributions
+1.816
ICAN+1.815
enders+1.755
NEY+1.731
inson+1.727
Neg logit contributions
 Demo-2.968
 Chow-2.871
 Zoro-2.800
 Rapt-2.720
 Desire-2.690
Token Youth
Token ID17582
Feature activation-3.31e-01

Pos logit contributions
enders+1.335
ICAN+1.332
+1.332
inson+1.319
GC+1.301
Neg logit contributions
 Demo-1.576
 Chow-1.519
 Zoro-1.478
 Rapt-1.433
 Desire-1.378
Token and
Token ID290
Feature activation+1.47e-03

Pos logit contributions
enders+1.041
inson+1.029
+1.021
ICAN+1.017
GC+1.008
Neg logit contributions
 Demo-1.160
 Chow-1.115
 Zoro-1.079
 Rapt-1.064
 Desire-1.013
Token Ad
Token ID1215
Feature activation-1.84e-01

Pos logit contributions
 Demo+0.005
 Chow+0.005
 Zoro+0.005
 Rapt+0.005
 Desire+0.005
Neg logit contributions
-0.005
ICAN-0.005
enders-0.005
inson-0.005
GC-0.004
Token?
Token ID30
Feature activation+1.25e-01

Pos logit contributions
 Demo+0.063
 Chow+0.061
 Zoro+0.059
 Rapt+0.059
 Desire+0.058
Neg logit contributions
-0.043
ICAN-0.042
enders-0.041
GC-0.041
inson-0.040
Token The
Token ID383
Feature activation+9.65e-02

Pos logit contributions
 Demo+0.437
 Chow+0.419
 Zoro+0.394
 Rapt+0.394
 Desire+0.385
Neg logit contributions
-0.402
ICAN-0.394
enders-0.387
NEY-0.382
GC-0.381
Token debut
Token ID8886
Feature activation-4.42e-01

Pos logit contributions
 Demo+0.369
 Chow+0.356
 Zoro+0.336
 Rapt+0.336
 Desire+0.331
Neg logit contributions
-0.282
ICAN-0.278
enders-0.273
inson-0.267
GC-0.264
Token of
Token ID286
Feature activation-1.46e-01

Pos logit contributions
+1.395
ICAN+1.363
enders+1.332
inson+1.318
GC+1.302
Neg logit contributions
 Demo-1.618
 Chow-1.553
 Rapt-1.483
 Zoro-1.480
 Desire-1.445
Token Superman
Token ID14951
Feature activation-2.64e-01

Pos logit contributions
+0.497
ICAN+0.492
enders+0.487
inson+0.475
GC+0.472
Neg logit contributions
 Demo-0.481
 Chow-0.455
 Rapt-0.432
 Zoro-0.428
 Desire-0.418
Token #
Token ID1303
Feature activation-1.31e+00

Pos logit contributions
+0.788
ICAN+0.776
enders+0.768
inson+0.758
GC+0.737
Neg logit contributions
 Demo-0.983
 Chow-0.952
 Rapt-0.909
 Zoro-0.908
 Desire-0.894
Token1
Token ID16
Feature activation-3.94e-01

Pos logit contributions
+4.558
GC+4.380
ICAN+4.367
enders+4.274
NEY+4.198
Neg logit contributions
 Demo-4.081
 Chow-4.024
 Rapt-3.756
 Zoro-3.666
 Desire-3.617
Token,
Token ID11
Feature activation-1.89e-02

Pos logit contributions
+1.193
ICAN+1.149
enders+1.140
inson+1.126
GC+1.103
Neg logit contributions
 Demo-1.468
 Chow-1.423
 Rapt-1.352
 Zoro-1.345
 Desire-1.328
Token the
Token ID262
Feature activation+2.42e-02

Pos logit contributions
+0.054
ICAN+0.052
enders+0.052
inson+0.051
GC+0.050
Neg logit contributions
 Demo-0.074
 Chow-0.071
 Rapt-0.068
 Zoro-0.067
 Desire-0.066
Token first
Token ID717
Feature activation-2.73e-02

Pos logit contributions
 Demo+0.091
 Chow+0.087
 Rapt+0.083
 Zoro+0.082
 Desire+0.081
Neg logit contributions
-0.072
ICAN-0.072
enders-0.070
inson-0.069
GC-0.068
Token superhero
Token ID17343
Feature activation-1.87e-01

Pos logit contributions
+0.081
ICAN+0.079
enders+0.078
inson+0.077
GC+0.076
Neg logit contributions
 Demo-0.103
 Chow-0.099
 Rapt-0.094
 Zoro-0.093
 Desire-0.093
Token traffic
Token ID4979
Feature activation-2.59e-01

Pos logit contributions
+0.687
enders+0.673
ICAN+0.672
GC+0.658
inson+0.655
Neg logit contributions
 Demo-0.779
 Chow-0.748
 Rapt-0.720
 Zoro-0.716
 Desire-0.700
Token in
Token ID287
Feature activation+2.28e-01

Pos logit contributions
+0.711
enders+0.704
ICAN+0.693
inson+0.685
GC+0.679
Neg logit contributions
 Demo-1.008
 Chow-0.981
 Zoro-0.947
 Rapt-0.941
 Desire-0.918
Token/
Token ID14
Feature activation-3.84e-01

Pos logit contributions
 Demo+0.778
 Chow+0.744
 Zoro+0.712
 Rapt+0.712
 Desire+0.691
Neg logit contributions
-0.764
ICAN-0.740
enders-0.736
inson-0.725
GC-0.717
Tokenout
Token ID448
Feature activation+7.68e-02

Pos logit contributions
+1.180
ICAN+1.140
enders+1.135
inson+1.114
GC+1.109
Neg logit contributions
 Demo-1.436
 Chow-1.385
 Zoro-1.332
 Rapt-1.329
 Desire-1.289
Token of
Token ID286
Feature activation+1.18e-01

Pos logit contributions
 Demo+0.284
 Chow+0.273
 Rapt+0.260
 Zoro+0.260
 Desire+0.254
Neg logit contributions
-0.238
ICAN-0.232
enders-0.228
inson-0.225
GC-0.222
Token #
Token ID1303
Feature activation-1.31e+00

Pos logit contributions
 Demo+0.394
 Chow+0.377
 Zoro+0.360
 Rapt+0.359
 Desire+0.350
Neg logit contributions
-0.401
ICAN-0.392
enders-0.386
inson-0.380
GC-0.378
TokenH
Token ID39
Feature activation-2.46e-01

Pos logit contributions
+4.038
GC+3.874
ICAN+3.800
enders+3.790
NEY+3.709
Neg logit contributions
 Demo-4.745
 Chow-4.632
 Rapt-4.446
 Zoro-4.395
 Desire-4.340
Tokenof
Token ID1659
Feature activation-4.02e-01

Pos logit contributions
+0.811
ICAN+0.792
enders+0.786
inson+0.780
GC+0.776
Neg logit contributions
 Demo-0.860
 Chow-0.826
 Rapt-0.791
 Zoro-0.788
 Desire-0.772
TokenDeb
Token ID16587
Feature activation-7.18e-01

Pos logit contributions
+1.205
ICAN+1.186
enders+1.166
GC+1.155
inson+1.149
Neg logit contributions
 Demo-1.533
 Chow-1.476
 Zoro-1.417
 Rapt-1.415
 Desire-1.376
Tokenate
Token ID378
Feature activation-2.15e-01

Pos logit contributions
+1.691
ICAN+1.671
GC+1.650
enders+1.639
inson+1.633
Neg logit contributions
 Demo-3.093
 Chow-2.993
 Rapt-2.923
 Zoro-2.911
 Desire-2.883
Token16
Token ID1433
Feature activation-2.04e-01

Pos logit contributions
+0.630
ICAN+0.620
enders+0.606
GC+0.601
inson+0.598
Neg logit contributions
 Demo-0.819
 Chow-0.792
 Rapt-0.762
 Zoro-0.759
 Desire-0.745
Token energy
Token ID2568
Feature activation+2.71e-02

Pos logit contributions
 Demo+0.137
 Chow+0.131
 Rapt+0.126
 Zoro+0.126
 Desire+0.124
Neg logit contributions
-0.085
ICAN-0.082
enders-0.081
inson-0.080
GC-0.080
Token goals
Token ID4661
Feature activation+1.59e-01

Pos logit contributions
 Demo+0.120
 Chow+0.115
 Zoro+0.112
 Rapt+0.111
 Desire+0.110
Neg logit contributions
-0.063
ICAN-0.061
enders-0.060
GC-0.059
inson-0.058
Token.
Token ID13
Feature activation+1.00e-01

Pos logit contributions
 Demo+0.580
 Chow+0.557
 Rapt+0.534
 Zoro+0.531
 Desire+0.520
Neg logit contributions
-0.495
ICAN-0.483
enders-0.479
inson-0.468
GC-0.468
Token<|endoftext|>
Token ID50256
Feature activation-1.04e-02

Pos logit contributions
 Demo+0.345
 Chow+0.331
 Rapt+0.316
 Zoro+0.316
 Desire+0.309
Neg logit contributions
-0.332
ICAN-0.327
enders-0.320
GC-0.314
inson-0.313
TokenMedia
Token ID13152
Feature activation+1.61e-01

Pos logit contributions
+0.029
ICAN+0.028
enders+0.027
GC+0.027
inson+0.027
Neg logit contributions
 Demo-0.043
 Chow-0.041
 Rapt-0.039
 Zoro-0.039
 Desire-0.039
Token playback
Token ID16388
Feature activation-1.30e+00

Pos logit contributions
 Demo+0.641
 Chow+0.620
 Zoro+0.595
 Rapt+0.592
 Desire+0.577
Neg logit contributions
-0.447
ICAN-0.432
enders-0.431
GC-0.422
inson-0.421
Token is
Token ID318
Feature activation-1.76e-01

Pos logit contributions
+2.697
GC+2.673
inson+2.644
ICAN+2.623
enders+2.597
Neg logit contributions
 Demo-5.649
 Chow-5.598
 Rapt-5.386
 Zoro-5.362
 Desire-5.129
Token unsupported
Token ID24222
Feature activation-3.07e-01

Pos logit contributions
+0.418
enders+0.405
ICAN+0.405
NEY+0.394
GC+0.393
Neg logit contributions
 Demo-0.764
 Chow-0.740
 Rapt-0.712
 Zoro-0.711
 Desire-0.697
Token on
Token ID319
Feature activation+3.24e-02

Pos logit contributions
+0.828
ICAN+0.808
GC+0.794
enders+0.787
NEY+0.772
Neg logit contributions
 Demo-1.232
 Chow-1.192
 Zoro-1.154
 Rapt-1.151
 Desire-1.123
Token your
Token ID534
Feature activation-4.43e-02

Pos logit contributions
 Demo+0.134
 Chow+0.130
 Zoro+0.125
 Rapt+0.124
 Desire+0.122
Neg logit contributions
-0.085
ICAN-0.082
enders-0.081
inson-0.080
GC-0.080
Token device
Token ID3335
Feature activation-7.53e-01

Pos logit contributions
+0.125
ICAN+0.121
enders+0.119
inson+0.117
GC+0.116
Neg logit contributions
 Demo-0.174
 Chow-0.169
 Rapt-0.163
 Zoro-0.162
 Desire-0.157
Token from
Token ID422
Feature activation+8.22e-02

Pos logit contributions
+1.647
ICAN+1.582
enders+1.569
inson+1.549
GC+1.537
Neg logit contributions
 Demo-2.360
 Chow-2.276
 Rapt-2.187
 Zoro-2.166
 Desire-2.123
Token Europe
Token ID2031
Feature activation-3.83e-01

Pos logit contributions
 Demo+0.291
 Chow+0.279
 Rapt+0.266
 Zoro+0.265
 Desire+0.258
Neg logit contributions
-0.263
ICAN-0.257
enders-0.256
inson-0.251
GC-0.250
Token and
Token ID290
Feature activation+5.29e-02

Pos logit contributions
+1.118
ICAN+1.099
enders+1.071
inson+1.049
GC+1.043
Neg logit contributions
 Demo-1.475
 Chow-1.423
 Rapt-1.363
 Zoro-1.355
 Desire-1.330
Token North
Token ID2258
Feature activation+8.38e-02

Pos logit contributions
 Demo+0.191
 Chow+0.185
 Zoro+0.175
 Rapt+0.175
 Desire+0.171
Neg logit contributions
-0.167
ICAN-0.162
enders-0.161
inson-0.158
GC-0.158
Token America
Token ID2253
Feature activation-3.12e-01

Pos logit contributions
 Demo+0.269
 Chow+0.258
 Zoro+0.243
 Rapt+0.243
 Desire+0.237
Neg logit contributions
-0.300
ICAN-0.294
enders-0.290
inson-0.286
GC-0.283
Token.
Token ID764
Feature activation-1.29e+00

Pos logit contributions
+0.757
ICAN+0.748
enders+0.723
inson+0.707
GC+0.702
Neg logit contributions
 Demo-1.354
 Chow-1.314
 Rapt-1.260
 Zoro-1.257
 Desire-1.231
Token International
Token ID4037
Feature activation-5.90e-01

Pos logit contributions
ICAN+3.560
+3.492
GC+3.461
inson+3.412
enders+3.403
Neg logit contributions
 Demo-4.852
 Chow-4.587
 Zoro-4.443
 Rapt-4.402
 Desire-4.328
Token Journal
Token ID4913
Feature activation-2.74e-01

Pos logit contributions
ICAN+1.397
+1.391
enders+1.379
inson+1.350
GC+1.336
Neg logit contributions
 Demo-2.506
 Chow-2.402
 Rapt-2.328
 Zoro-2.315
 Desire-2.276
Token of
Token ID286
Feature activation-1.44e-01

Pos logit contributions
+0.909
ICAN+0.875
enders+0.864
inson+0.853
GC+0.846
Neg logit contributions
 Demo-0.976
 Chow-0.938
 Rapt-0.894
 Zoro-0.882
 Desire-0.865
Token Public
Token ID5094
Feature activation-1.22e-01

Pos logit contributions
ICAN+0.409
+0.408
enders+0.402
inson+0.400
GC+0.393
Neg logit contributions
 Demo-0.550
 Chow-0.526
 Zoro-0.511
 Rapt-0.501
 Desire-0.488
Token Health
Token ID3893
Feature activation+7.49e-03

Pos logit contributions
ICAN+0.472
+0.471
enders+0.466
inson+0.462
GC+0.456
Neg logit contributions
 Demo-0.342
 Chow-0.324
 Rapt-0.310
 Zoro-0.308
 Desire-0.295
Tokens
Token ID82
Feature activation-1.22e-02

Pos logit contributions
+1.082
ICAN+1.058
enders+1.045
inson+1.034
GC+1.015
Neg logit contributions
 Demo-1.298
 Chow-1.259
 Zoro-1.191
 Rapt-1.186
 Desire-1.153
Token in
Token ID287
Feature activation+3.16e-03

Pos logit contributions
+0.038
ICAN+0.037
enders+0.037
inson+0.036
GC+0.036
Neg logit contributions
 Demo-0.045
 Chow-0.043
 Rapt-0.041
 Zoro-0.041
 Desire-0.040
Token the
Token ID262
Feature activation-1.93e-01

Pos logit contributions
 Demo+0.013
 Chow+0.012
 Zoro+0.012
 Rapt+0.012
 Desire+0.011
Neg logit contributions
-0.009
ICAN-0.008
enders-0.008
inson-0.008
GC-0.008
Token middle
Token ID3504
Feature activation-5.81e-01

Pos logit contributions
+0.557
ICAN+0.550
enders+0.539
inson+0.535
GC+0.525
Neg logit contributions
 Demo-0.742
 Chow-0.706
 Zoro-0.678
 Rapt-0.674
 Desire-0.659
Token grades
Token ID19051
Feature activation-3.13e-01

Pos logit contributions
+1.472
ICAN+1.417
enders+1.396
inson+1.356
GC+1.343
Neg logit contributions
 Demo-2.445
 Chow-2.350
 Zoro-2.276
 Rapt-2.248
 Desire-2.227
Token.
Token ID764
Feature activation-1.27e+00

Pos logit contributions
+0.872
ICAN+0.855
enders+0.833
inson+0.826
GC+0.814
Neg logit contributions
 Demo-1.248
 Chow-1.202
 Zoro-1.152
 Rapt-1.145
 Desire-1.122
Token RM
Token ID29820
Feature activation-3.59e-01

Pos logit contributions
ICAN+3.499
+3.413
GC+3.375
inson+3.362
enders+3.320
Neg logit contributions
 Demo-4.670
 Chow-4.458
 Zoro-4.335
 Rapt-4.238
 Darrell-4.132
TokenLE
Token ID2538
Feature activation+1.07e-01

Pos logit contributions
ICAN+0.936
+0.930
GC+0.898
enders+0.897
inson+0.888
Neg logit contributions
 Demo-1.486
 Chow-1.423
 Zoro-1.378
 Rapt-1.372
 Desire-1.337
Token Online
Token ID7467
Feature activation-6.21e-01

Pos logit contributions
 Demo+0.417
 Chow+0.404
 Zoro+0.385
 Rapt+0.384
 Desire+0.377
Neg logit contributions
-0.307
ICAN-0.302
enders-0.294
inson-0.291
GC-0.288
Token:
Token ID25
Feature activation-4.10e-01

Pos logit contributions
+1.798
ICAN+1.772
enders+1.737
inson+1.725
GC+1.695
Neg logit contributions
 Demo-2.355
 Chow-2.291
 Zoro-2.191
 Rapt-2.185
 Desire-2.132
Token Research
Token ID4992
Feature activation-2.69e-01

Pos logit contributions
+1.295
ICAN+1.283
enders+1.263
inson+1.241
GC+1.240
Neg logit contributions
 Demo-1.447
 Chow-1.402
 Zoro-1.328
 Rapt-1.320
 Desire-1.290
Token matchup
Token ID18406
Feature activation-3.89e-01

Pos logit contributions
+0.359
ICAN+0.351
enders+0.343
inson+0.338
GC+0.333
Neg logit contributions
 Demo-0.620
 Chow-0.600
 Rapt-0.579
 Zoro-0.575
 Desire-0.569
Token between
Token ID1022
Feature activation+2.15e-01

Pos logit contributions
+0.913
ICAN+0.891
enders+0.878
inson+0.876
GC+0.839
Neg logit contributions
 Demo-1.681
 Chow-1.632
 Rapt-1.574
 Zoro-1.557
 Desire-1.552
Token Donald
Token ID3759
Feature activation+2.29e-02

Pos logit contributions
 Demo+0.727
 Chow+0.695
 Rapt+0.669
 Zoro+0.665
 Desire+0.645
Neg logit contributions
-0.718
ICAN-0.698
enders-0.695
inson-0.684
GC-0.678
Token Trump
Token ID1301
Feature activation+1.87e-01

Pos logit contributions
 Demo+0.081
 Chow+0.078
 Rapt+0.074
 Zoro+0.074
 Desire+0.072
Neg logit contributions
-0.075
ICAN-0.074
enders-0.072
inson-0.071
GC-0.070
Token Donald
Token ID3759
Feature activation-8.83e-01

Pos logit contributions
 Demo+0.606
 Chow+0.579
 Rapt+0.547
 Zoro+0.545
 Desire+0.532
Neg logit contributions
-0.668
ICAN-0.653
enders-0.640
inson-0.634
GC-0.628
Token John
Token ID1757
Feature activation-1.27e+00

Pos logit contributions
+3.074
ICAN+3.061
GC+3.011
NEY+2.970
enders+2.964
Neg logit contributions
 Demo-2.703
 Chow-2.609
 Rapt-2.537
 Zoro-2.447
 Desire-2.391
Token Trump
Token ID1301
Feature activation-3.20e-01

Pos logit contributions
+3.328
enders+3.311
inson+3.277
ICAN+3.266
NEY+3.126
Neg logit contributions
 Demo-4.965
 Chow-4.746
 Rapt-4.624
 Zoro-4.549
 Desire-4.546
TokenHouse
Token ID18102
Feature activation-8.31e-01

Pos logit contributions
+0.885
ICAN+0.859
enders+0.852
GC+0.827
inson+0.826
Neg logit contributions
 Demo-1.293
 Chow-1.253
 Rapt-1.199
 Zoro-1.196
 Desire-1.173
Token committee
Token ID5583
Feature activation-2.13e-01

Pos logit contributions
ICAN+2.747
+2.715
enders+2.692
NEY+2.620
inson+2.617
Neg logit contributions
 Chow-2.764
 Demo-2.745
 Zoro-2.544
 Rapt-2.536
 Desire-2.483
Token believes
Token ID5804
Feature activation-1.73e-01

Pos logit contributions
ICAN+0.353
+0.349
enders+0.335
GC+0.323
inson+0.322
Neg logit contributions
 Chow-1.066
 Demo-1.053
 Zoro-1.033
 Desire-1.003
 Rapt-0.995
Token it
Token ID340
Feature activation+2.11e-01

Pos logit contributions
+0.594
ICAN+0.580
GC+0.564
enders+0.563
NEY+0.561
Neg logit contributions
 Demo-0.569
 Chow-0.559
 Zoro-0.528
 Rapt-0.524
 Desire-0.507
Token Republican
Token ID3415
Feature activation-3.97e-02

Pos logit contributions
 Demo+0.879
 Chow+0.846
 Zoro+0.804
 Rapt+0.804
 Desire+0.785
Neg logit contributions
-0.882
ICAN-0.867
enders-0.857
inson-0.842
GC-0.839
Token nominee
Token ID10429
Feature activation-5.33e-02

Pos logit contributions
+0.101
enders+0.099
ICAN+0.098
inson+0.096
GC+0.095
Neg logit contributions
 Demo-0.163
 Chow-0.159
 Zoro-0.153
 Rapt-0.153
 Desire-0.150
Token Donald
Token ID3759
Feature activation+3.54e-02

Pos logit contributions
+0.186
ICAN+0.182
enders+0.182
inson+0.179
GC+0.177
Neg logit contributions
 Demo-0.171
 Chow-0.165
 Rapt-0.156
 Zoro-0.155
 Desire-0.153
Token Trump
Token ID1301
Feature activation+1.20e-01

Pos logit contributions
 Demo+0.133
 Chow+0.129
 Zoro+0.123
 Rapt+0.122
 Desire+0.121
Neg logit contributions
-0.104
ICAN-0.101
enders-0.098
inson-0.098
NEY-0.098
Token Donald
Token ID3759
Feature activation-1.09e+00

Pos logit contributions
 Demo+0.385
 Chow+0.368
 Zoro+0.348
 Rapt+0.348
 Desire+0.339
Neg logit contributions
-0.422
ICAN-0.416
enders-0.410
inson-0.408
GC-0.401
Token John
Token ID1757
Feature activation-1.27e+00

Pos logit contributions
ICAN+3.651
+3.638
GC+3.584
NEY+3.544
enders+3.522
Neg logit contributions
 Demo-3.433
 Chow-3.317
 Rapt-3.225
 Zoro-3.108
 Desire-3.050
Token Trump
Token ID1301
Feature activation-5.11e-01

Pos logit contributions
+3.380
enders+3.356
ICAN+3.319
inson+3.311
NEY+3.154
Neg logit contributions
 Demo-4.935
 Chow-4.751
 Rapt-4.562
 Zoro-4.523
 Desire-4.513
TokenHouse
Token ID18102
Feature activation-8.43e-01

Pos logit contributions
+1.407
ICAN+1.360
enders+1.351
GC+1.311
inson+1.302
Neg logit contributions
 Demo-2.077
 Chow-2.019
 Rapt-1.930
 Zoro-1.926
 Desire-1.888
Token committee
Token ID5583
Feature activation-2.35e-01

Pos logit contributions
ICAN+2.663
+2.625
enders+2.621
inson+2.545
NEY+2.534
Neg logit contributions
 Chow-2.937
 Demo-2.850
 Zoro-2.686
 Rapt-2.667
 Desire-2.617
Token believes
Token ID5804
Feature activation-3.83e-01

Pos logit contributions
ICAN+0.389
+0.382
enders+0.362
inson+0.359
GC+0.356
Neg logit contributions
 Chow-1.175
 Demo-1.159
 Zoro-1.136
 Desire-1.109
 Rapt-1.102
Token it
Token ID340
Feature activation+1.00e-01

Pos logit contributions
+1.203
ICAN+1.184
GC+1.149
NEY+1.134
enders+1.130
Neg logit contributions
 Demo-1.358
 Chow-1.340
 Zoro-1.271
 Rapt-1.258
 Desire-1.218
Token
Token ID3926
Feature activation-4.92e-01

Pos logit contributions
+0.372
ICAN+0.367
enders+0.361
inson+0.355
GC+0.354
Neg logit contributions
 Demo-0.457
 Chow-0.444
 Rapt-0.421
 Zoro-0.421
 Desire-0.412
Token another
Token ID1194
Feature activation-2.13e-01

Pos logit contributions
+1.483
ICAN+1.472
enders+1.444
GC+1.409
inson+1.406
Neg logit contributions
 Demo-1.800
 Chow-1.737
 Zoro-1.660
 Rapt-1.650
 Desire-1.622
Token father
Token ID2988
Feature activation+2.95e-01

Pos logit contributions
+0.653
ICAN+0.646
enders+0.628
GC+0.625
inson+0.620
Neg logit contributions
 Demo-0.774
 Chow-0.752
 Zoro-0.711
 Rapt-0.708
 Desire-0.697
Token killed
Token ID2923
Feature activation-5.07e-02

Pos logit contributions
 Demo+1.129
 Chow+1.087
 Rapt+1.036
 Zoro+1.036
 Desire+1.016
Neg logit contributions
-0.875
ICAN-0.855
enders-0.836
inson-0.827
GC-0.818
Token by
Token ID416
Feature activation+3.84e-02

Pos logit contributions
+0.145
ICAN+0.142
enders+0.139
inson+0.137
GC+0.136
Neg logit contributions
 Demo-0.197
 Chow-0.190
 Rapt-0.182
 Zoro-0.182
 Desire-0.178
Token #
Token ID1303
Feature activation-1.24e+00

Pos logit contributions
 Demo+0.133
 Chow+0.127
 Zoro+0.121
 Rapt+0.121
 Desire+0.118
Neg logit contributions
-0.125
ICAN-0.124
enders-0.122
GC-0.120
inson-0.120
TokenIsrael
Token ID14040
Feature activation-4.73e-01

Pos logit contributions
+3.250
ICAN+3.204
GC+3.203
enders+3.178
inson+3.092
Neg logit contributions
 Demo-5.086
 Chow-4.967
 Rapt-4.697
 Zoro-4.687
 Desire-4.636
Token
Token ID3926
Feature activation-5.00e-01

Pos logit contributions
+1.308
ICAN+1.299
enders+1.269
GC+1.268
inson+1.262
Neg logit contributions
 Demo-1.861
 Chow-1.805
 Zoro-1.734
 Rapt-1.716
 Desire-1.696
Token another
Token ID1194
Feature activation-3.58e-01

Pos logit contributions
+1.488
ICAN+1.471
enders+1.433
GC+1.417
inson+1.407
Neg logit contributions
 Demo-1.847
 Chow-1.773
 Zoro-1.698
 Rapt-1.691
 Desire-1.662
Token father
Token ID2988
Feature activation-1.14e-01

Pos logit contributions
+1.082
ICAN+1.067
enders+1.035
GC+1.031
inson+1.027
Neg logit contributions
 Demo-1.315
 Chow-1.272
 Zoro-1.207
 Rapt-1.207
 Desire-1.181
Token carrying
Token ID6872
Feature activation-1.41e-01

Pos logit contributions
+0.331
ICAN+0.326
enders+0.317
inson+0.314
GC+0.312
Neg logit contributions
 Demo-0.437
 Chow-0.420
 Rapt-0.401
 Zoro-0.400
 Desire-0.394
Token United
Token ID1578
Feature activation+1.21e-01

Pos logit contributions
 Demo+0.781
 Chow+0.744
 Rapt+0.704
 Zoro+0.703
 Desire+0.684
Neg logit contributions
-0.791
ICAN-0.774
enders-0.763
inson-0.749
GC-0.748
Token States
Token ID1829
Feature activation+4.91e-01

Pos logit contributions
 Demo+0.395
 Chow+0.378
 Zoro+0.357
 Rapt+0.356
 Desire+0.347
Neg logit contributions
-0.429
ICAN-0.421
enders-0.414
inson-0.409
GC-0.406
Token 3
Token ID513
Feature activation+1.77e-01

Pos logit contributions
 Demo+1.620
 Chow+1.555
 Zoro+1.465
 Rapt+1.463
 Desire+1.426
Neg logit contributions
-1.706
ICAN-1.672
enders-1.644
inson-1.623
GC-1.621
Token240
Token ID16102
Feature activation+2.65e-01

Pos logit contributions
 Demo+0.597
 Chow+0.577
 Rapt+0.542
 Zoro+0.542
 Desire+0.527
Neg logit contributions
-0.609
ICAN-0.593
enders-0.586
inson-0.578
GC-0.572
Token Posts
Token ID12043
Feature activation-1.18e+00

Pos logit contributions
 Demo+0.872
 Chow+0.836
 Zoro+0.791
 Rapt+0.789
 Desire+0.769
Neg logit contributions
-0.925
ICAN-0.905
enders-0.895
inson-0.883
GC-0.878
Token #
Token ID1303
Feature activation-1.23e+00

Pos logit contributions
+1.387
ICAN+1.314
enders+1.287
inson+1.234
GC+1.209
Neg logit contributions
 Demo-6.367
 Chow-6.263
 Rapt-6.051
 Zoro-6.040
 Desire-5.962
Token10
Token ID940
Feature activation-3.85e-01

Pos logit contributions
+4.117
GC+3.940
ICAN+3.936
enders+3.897
inson+3.792
Neg logit contributions
 Demo-4.078
 Chow-4.007
 Zoro-3.778
 Rapt-3.723
 Desire-3.662
Token Happy
Token ID14628
Feature activation-5.02e-01

Pos logit contributions
+1.215
ICAN+1.206
enders+1.200
GC+1.183
inson+1.172
Neg logit contributions
 Demo-1.316
 Chow-1.287
 Zoro-1.215
 Rapt-1.206
 Desire-1.198
Token it
Token ID340
Feature activation-5.83e-01

Pos logit contributions
+1.520
enders+1.504
ICAN+1.498
GC+1.492
inson+1.471
Neg logit contributions
 Demo-1.790
 Chow-1.739
 Zoro-1.642
 Rapt-1.640
 Desire-1.626
Token worked
Token ID3111
Feature activation-3.72e-01

Pos logit contributions
+1.692
enders+1.641
ICAN+1.623
GC+1.600
inson+1.599
Neg logit contributions
 Demo-2.209
 Chow-2.129
 Rapt-2.049
 Zoro-2.029
 Desire-2.025
Token out
Token ID503
Feature activation+4.74e-01

Pos logit contributions
+1.085
enders+1.059
ICAN+1.049
inson+1.042
GC+1.033
Neg logit contributions
 Demo-1.371
 Chow-1.339
 Rapt-1.271
 Zoro-1.264
 Desire-1.258
Token from
Token ID422
Feature activation+1.23e-01

Pos logit contributions
 Demo+0.475
 Chow+0.456
 Rapt+0.434
 Zoro+0.433
 Desire+0.424
Neg logit contributions
-0.394
ICAN-0.386
enders-0.385
inson-0.381
GC-0.375
Token £
Token ID4248
Feature activation-3.88e-01

Pos logit contributions
 Demo+0.291
 Chow+0.271
 Rapt+0.252
 Zoro+0.250
 Desire+0.240
Neg logit contributions
-0.539
ICAN-0.533
enders-0.532
inson-0.525
GC-0.522
Token9
Token ID24
Feature activation-2.42e-01

Pos logit contributions
+1.432
ICAN+1.411
enders+1.383
GC+1.377
inson+1.363
Neg logit contributions
 Demo-1.203
 Chow-1.145
 Rapt-1.088
 Zoro-1.075
 Desire-1.040
Token79
Token ID3720
Feature activation-1.01e-01

Pos logit contributions
+0.798
ICAN+0.778
enders+0.765
inson+0.763
GC+0.755
Neg logit contributions
 Demo-0.833
 Chow-0.805
 Rapt-0.767
 Zoro-0.764
 Desire-0.745
Token b
Token ID275
Feature activation-5.12e-02

Pos logit contributions
+0.313
ICAN+0.305
enders+0.302
inson+0.297
GC+0.294
Neg logit contributions
 Demo-0.371
 Chow-0.357
 Rapt-0.341
 Zoro-0.340
 Desire-0.332
Token &
Token ID1222
Feature activation+8.21e-01

Pos logit contributions
+0.152
ICAN+0.149
enders+0.148
inson+0.146
GC+0.143
Neg logit contributions
 Demo-0.194
 Chow-0.187
 Zoro-0.178
 Rapt-0.178
 Desire-0.174
Token b
Token ID275
Feature activation-2.69e-01

Pos logit contributions
 Demo+2.039
 Chow+1.918
 Zoro+1.765
 Rapt+1.762
 Desire+1.709
Neg logit contributions
-3.529
ICAN-3.494
enders-3.420
inson-3.382
GC-3.363
Token.
Token ID13
Feature activation-2.50e-02

Pos logit contributions
+0.743
ICAN+0.722
enders+0.717
inson+0.705
GC+0.691
Neg logit contributions
 Demo-1.079
 Chow-1.036
 Zoro-0.992
 Rapt-0.991
 Desire-0.971
Token
Token ID198
Feature activation-2.03e-01

Pos logit contributions
+0.082
ICAN+0.081
enders+0.079
inson+0.079
GC+0.078
Neg logit contributions
 Demo-0.087
 Chow-0.083
 Rapt-0.079
 Zoro-0.079
 Desire-0.076
Token
Token ID198
Feature activation-1.39e-01

Pos logit contributions
+0.635
ICAN+0.612
GC+0.600
enders+0.599
inson+0.593
Neg logit contributions
 Demo-0.744
 Chow-0.719
 Rapt-0.684
 Zoro-0.682
 Desire-0.669
TokenThomas
Token ID22405
Feature activation-9.48e-02

Pos logit contributions
+0.388
ICAN+0.373
enders+0.362
GC+0.362
inson+0.357
Neg logit contributions
 Demo-0.574
 Chow-0.553
 Rapt-0.529
 Zoro-0.526
 Desire-0.517
Token up
Token ID510
Feature activation-2.51e-02

Pos logit contributions
 Demo+0.532
 Chow+0.504
 Rapt+0.484
 Zoro+0.481
 Desire+0.478
Neg logit contributions
-0.385
ICAN-0.373
enders-0.370
inson-0.366
GC-0.361
Token.
Token ID13
Feature activation+4.47e-02

Pos logit contributions
+0.070
enders+0.068
ICAN+0.068
inson+0.067
GC+0.066
Neg logit contributions
 Demo-0.099
 Chow-0.095
 Rapt-0.091
 Zoro-0.090
 Desire-0.089
Token Her
Token ID2332
Feature activation+3.13e-01

Pos logit contributions
 Demo+0.141
 Chow+0.132
 Zoro+0.126
 Rapt+0.125
 Desire+0.123
Neg logit contributions
-0.160
ICAN-0.158
enders-0.155
NEY-0.152
GC-0.152
Token eyes
Token ID2951
Feature activation+3.38e-01

Pos logit contributions
 Demo+1.119
 Chow+1.058
 Zoro+1.013
 Rapt+1.006
 Desire+0.992
Neg logit contributions
-1.001
ICAN-0.975
enders-0.962
inson-0.949
GC-0.943
Token were
Token ID547
Feature activation+3.50e-01

Pos logit contributions
 Demo+1.194
 Chow+1.140
 Rapt+1.086
 Zoro+1.078
 Desire+1.051
Neg logit contributions
-1.110
ICAN-1.080
enders-1.076
inson-1.051
GC-1.043
Token still
Token ID991
Feature activation+7.44e-01

Pos logit contributions
 Demo+1.410
 Chow+1.346
 Rapt+1.288
 Zoro+1.287
 Desire+1.263
Neg logit contributions
-0.979
ICAN-0.957
enders-0.934
inson-0.917
GC-0.907
Token red
Token ID2266
Feature activation+1.75e-01

Pos logit contributions
 Demo+3.096
 Chow+2.973
 Rapt+2.846
 Zoro+2.846
 Desire+2.794
Neg logit contributions
-1.980
ICAN-1.927
enders-1.885
inson-1.852
GC-1.830
Token from
Token ID422
Feature activation+8.99e-02

Pos logit contributions
 Demo+0.653
 Chow+0.625
 Zoro+0.594
 Rapt+0.593
 Desire+0.581
Neg logit contributions
-0.535
ICAN-0.521
enders-0.519
inson-0.509
GC-0.502
Token cutting
Token ID7720
Feature activation+2.46e-01

Pos logit contributions
 Demo+0.335
 Chow+0.316
 Zoro+0.302
 Rapt+0.302
 Desire+0.295
Neg logit contributions
-0.275
enders-0.263
ICAN-0.263
GC-0.259
inson-0.258
Token up
Token ID510
Feature activation-3.80e-02

Pos logit contributions
 Demo+0.884
 Chow+0.837
 Zoro+0.801
 Rapt+0.798
 Desire+0.784
Neg logit contributions
-0.782
ICAN-0.769
enders-0.763
inson-0.747
NEY-0.737
Token the
Token ID262
Feature activation+2.37e-03

Pos logit contributions
+0.120
ICAN+0.118
enders+0.118
inson+0.115
GC+0.112
Neg logit contributions
 Demo-0.135
 Chow-0.128
 Zoro-0.123
 Rapt-0.122
 Desire-0.120
Token-
Token ID12
Feature activation+3.37e-01

Pos logit contributions
 Demo+0.678
 Chow+0.658
 Zoro+0.620
 Rapt+0.617
 Desire+0.605
Neg logit contributions
-0.704
ICAN-0.685
enders-0.677
inson-0.674
GC-0.668
Tokenedge
Token ID14907
Feature activation+2.85e-01

Pos logit contributions
 Demo+1.096
 Chow+1.055
 Zoro+1.004
 Rapt+0.998
 Desire+0.970
Neg logit contributions
-1.187
enders-1.164
ICAN-1.159
inson-1.139
GC-1.138
Token CPR
Token ID42920
Feature activation+2.82e-01

Pos logit contributions
 Demo+1.084
 Chow+1.055
 Zoro+1.010
 Rapt+1.001
 Desire+0.987
Neg logit contributions
-0.831
ICAN-0.820
enders-0.797
inson-0.788
GC-0.778
Token dummy
Token ID31548
Feature activation+1.62e-02

Pos logit contributions
 Demo+1.233
 Chow+1.202
 Zoro+1.158
 Rapt+1.153
 Desire+1.133
Neg logit contributions
-0.673
ICAN-0.658
enders-0.640
inson-0.632
GC-0.618
Token comes
Token ID2058
Feature activation+3.88e-01

Pos logit contributions
 Demo+0.064
 Chow+0.062
 Zoro+0.060
 Rapt+0.059
 Desire+0.059
Neg logit contributions
-0.044
ICAN-0.044
enders-0.043
inson-0.043
GC-0.042
Token with
Token ID351
Feature activation+7.06e-01

Pos logit contributions
 Demo+1.519
 Chow+1.475
 Rapt+1.399
 Zoro+1.399
 Desire+1.383
Neg logit contributions
-1.085
ICAN-1.067
inson-1.043
enders-1.042
GC-1.024
Token a
Token ID257
Feature activation+3.15e-01

Pos logit contributions
 Demo+2.487
 Chow+2.391
 Zoro+2.269
 Rapt+2.268
 Desire+2.216
Neg logit contributions
-2.311
ICAN-2.262
enders-2.224
inson-2.196
GC-2.170
Token companion
Token ID15185
Feature activation+1.60e-02

Pos logit contributions
 Demo+1.171
 Chow+1.138
 Zoro+1.084
 Rapt+1.073
 Desire+1.062
Neg logit contributions
-0.950
ICAN-0.931
enders-0.910
inson-0.896
GC-0.888
Token dummy
Token ID31548
Feature activation-2.09e-01

Pos logit contributions
 Demo+0.059
 Chow+0.057
 Zoro+0.055
 Rapt+0.054
 Desire+0.054
Neg logit contributions
-0.047
ICAN-0.047
enders-0.046
inson-0.045
GC-0.045
Token that
Token ID326
Feature activation+5.42e-02

Pos logit contributions
+0.627
ICAN+0.617
inson+0.608
enders+0.605
GC+0.592
Neg logit contributions
 Demo-0.766
 Chow-0.742
 Zoro-0.712
 Rapt-0.706
 Desire-0.694
Token is
Token ID318
Feature activation+2.77e-02

Pos logit contributions
 Demo+0.212
 Chow+0.205
 Zoro+0.196
 Rapt+0.195
 Desire+0.193
Neg logit contributions
-0.150
ICAN-0.147
enders-0.147
inson-0.146
GC-0.142
Token Trudeau
Token ID17524
Feature activation-4.24e-02

Pos logit contributions
+0.353
ICAN+0.346
enders+0.343
GC+0.339
inson+0.338
Neg logit contributions
 Demo-0.495
 Chow-0.472
 Rapt-0.460
 Desire-0.456
 Zoro-0.455
Token
Token ID447
Feature activation-1.40e-01

Pos logit contributions
+0.102
enders+0.101
ICAN+0.101
inson+0.100
GC+0.098
Neg logit contributions
 Demo-0.179
 Chow-0.173
 Rapt-0.169
 Zoro-0.168
 Desire-0.165
Token
Token ID247
Feature activation+2.03e-03

Pos logit contributions
ICAN+0.504
+0.498
enders+0.489
inson+0.487
GC+0.487
Neg logit contributions
 Demo-0.445
 Chow-0.430
 Zoro-0.418
 Rapt-0.405
 Desire-0.402
Tokens
Token ID82
Feature activation-5.11e-03

Pos logit contributions
 Demo+0.008
 Chow+0.007
 Zoro+0.007
 Rapt+0.007
 Desire+0.007
Neg logit contributions
-0.006
ICAN-0.006
enders-0.006
inson-0.006
GC-0.006
Token Liberals
Token ID20162
Feature activation+3.39e-01

Pos logit contributions
+0.017
ICAN+0.017
enders+0.017
NEY+0.016
GC+0.016
Neg logit contributions
 Demo-0.017
 Chow-0.017
 Zoro-0.016
 Rapt-0.016
 Desire-0.015
Token approved
Token ID6325
Feature activation+6.97e-01

Pos logit contributions
 Demo+1.334
 Chow+1.285
 Zoro+1.250
 Rapt+1.248
 Desire+1.203
Neg logit contributions
-0.931
ICAN-0.923
enders-0.904
inson-0.894
GC-0.893
Token last
Token ID938
Feature activation+1.98e-01

Pos logit contributions
 Demo+2.697
 Chow+2.600
 Rapt+2.483
 Zoro+2.481
 Desire+2.429
Neg logit contributions
-2.041
ICAN-1.996
enders-1.955
inson-1.925
GC-1.902
Token year
Token ID614
Feature activation+2.48e-01

Pos logit contributions
 Demo+0.783
 Chow+0.753
 Rapt+0.720
 Zoro+0.720
 Desire+0.708
Neg logit contributions
-0.563
ICAN-0.549
enders-0.539
inson-0.527
GC-0.523
Token.
Token ID13
Feature activation+1.23e-01

Pos logit contributions
 Demo+0.923
 Chow+0.889
 Zoro+0.851
 Rapt+0.851
 Desire+0.831
Neg logit contributions
-0.756
ICAN-0.744
enders-0.729
inson-0.721
GC-0.710
Token
Token ID198
Feature activation+1.53e-01

Pos logit contributions
 Demo+0.428
 Chow+0.411
 Rapt+0.394
 Zoro+0.392
 Desire+0.384
Neg logit contributions
-0.394
ICAN-0.389
enders-0.380
NEY-0.374
GC-0.370
Token
Token ID198
Feature activation+8.74e-02

Pos logit contributions
 Demo+0.549
 Chow+0.529
 Rapt+0.501
 Zoro+0.501
 Desire+0.491
Neg logit contributions
-0.494
ICAN-0.480
enders-0.471
GC-0.467
inson-0.464
Token and
Token ID290
Feature activation+1.63e-01

Pos logit contributions
 Demo+1.598
 Chow+1.542
 Rapt+1.464
 Zoro+1.462
 Desire+1.431
Neg logit contributions
-1.485
ICAN-1.452
enders-1.435
inson-1.415
GC-1.397
Token repeatedly
Token ID7830
Feature activation+3.05e-01

Pos logit contributions
 Demo+0.659
 Chow+0.640
 Zoro+0.612
 Rapt+0.611
 Desire+0.601
Neg logit contributions
-0.437
ICAN-0.428
enders-0.424
inson-0.417
GC-0.410
Token.
Token ID13
Feature activation+1.29e-01

Pos logit contributions
 Demo+1.125
 Chow+1.087
 Zoro+1.036
 Rapt+1.036
 Desire+1.013
Neg logit contributions
-0.944
ICAN-0.918
enders-0.906
inson-0.894
GC-0.882
Token At
Token ID1629
Feature activation+1.52e-01

Pos logit contributions
 Demo+0.458
 Chow+0.441
 Zoro+0.426
 Rapt+0.422
 Desire+0.411
Neg logit contributions
-0.405
ICAN-0.400
enders-0.390
GC-0.386
NEY-0.384
Token the
Token ID262
Feature activation+1.56e-01

Pos logit contributions
 Demo+0.533
 Chow+0.510
 Zoro+0.495
 Rapt+0.491
 Desire+0.482
Neg logit contributions
-0.480
ICAN-0.479
enders-0.473
inson-0.460
GC-0.459
Token time
Token ID640
Feature activation+6.78e-01

Pos logit contributions
 Demo+0.591
 Chow+0.572
 Zoro+0.554
 Rapt+0.549
 Desire+0.541
Neg logit contributions
ICAN-0.443
-0.443
enders-0.440
GC-0.427
inson-0.426
Token,
Token ID11
Feature activation+3.35e-01

Pos logit contributions
 Demo+2.488
 Chow+2.396
 Rapt+2.271
 Zoro+2.261
 Desire+2.218
Neg logit contributions
-2.136
ICAN-2.085
enders-2.036
inson-2.011
GC-1.985
Token he
Token ID339
Feature activation+2.15e-01

Pos logit contributions
 Demo+1.064
 Chow+1.011
 Zoro+0.969
 Rapt+0.962
 Desire+0.937
Neg logit contributions
-1.196
ICAN-1.183
enders-1.169
inson-1.152
GC-1.144
Token said
Token ID531
Feature activation+9.39e-02

Pos logit contributions
 Demo+0.894
 Chow+0.864
 Rapt+0.830
 Zoro+0.829
 Desire+0.813
Neg logit contributions
-0.559
ICAN-0.542
enders-0.542
inson-0.532
GC-0.522
Token,
Token ID11
Feature activation+1.68e-01

Pos logit contributions
 Demo+0.327
 Chow+0.314
 Zoro+0.300
 Rapt+0.299
 Desire+0.293
Neg logit contributions
-0.305
ICAN-0.302
enders-0.297
inson-0.293
GC-0.291
Token
Token ID564
Feature activation-7.81e-02

Pos logit contributions
 Demo+0.545
 Chow+0.521
 Zoro+0.497
 Rapt+0.496
 Desire+0.484
Neg logit contributions
-0.586
ICAN-0.580
enders-0.571
GC-0.561
inson-0.561
Token Wall
Token ID5007
Feature activation+6.23e-01

Pos logit contributions
 Demo+0.623
 Chow+0.598
 Zoro+0.570
 Rapt+0.569
 Desire+0.562
Neg logit contributions
-0.527
ICAN-0.513
enders-0.512
inson-0.501
GC-0.497
Token Street
Token ID3530
Feature activation-8.38e-02

Pos logit contributions
 Demo+2.621
 Chow+2.533
 Rapt+2.429
 Zoro+2.428
 Desire+2.381
Neg logit contributions
-1.617
ICAN-1.571
enders-1.547
inson-1.519
GC-1.491
Token,
Token ID11
Feature activation-8.89e-02

Pos logit contributions
+0.249
ICAN+0.244
enders+0.241
inson+0.236
GC+0.235
Neg logit contributions
 Demo-0.313
 Chow-0.304
 Zoro-0.292
 Rapt-0.287
 Desire-0.285
Token which
Token ID543
Feature activation-2.93e-04

Pos logit contributions
+0.272
ICAN+0.267
enders+0.266
GC+0.263
inson+0.261
Neg logit contributions
 Demo-0.322
 Chow-0.308
 Zoro-0.296
 Rapt-0.295
 Desire-0.289
Token at
Token ID379
Feature activation-1.00e-02

Pos logit contributions
+0.001
enders+0.001
ICAN+0.001
inson+0.001
GC+0.001
Neg logit contributions
 Demo-0.001
 Chow-0.001
 Zoro-0.001
 Rapt-0.001
 Desire-0.001
Token 72
Token ID7724
Feature activation+1.52e-01

Pos logit contributions
+0.034
enders+0.034
ICAN+0.033
inson+0.033
GC+0.032
Neg logit contributions
 Demo-0.033
 Chow-0.032
 Zoro-0.031
 Rapt-0.030
 Desire-0.030
Token-
Token ID12
Feature activation+1.98e-01

Pos logit contributions
 Demo+0.584
 Chow+0.565
 Zoro+0.541
 Rapt+0.540
 Desire+0.529
Neg logit contributions
-0.442
enders-0.431
ICAN-0.429
inson-0.422
GC-0.419
Tokenstories
Token ID50164
Feature activation-3.43e-01

Pos logit contributions
 Demo+0.694
 Chow+0.668
 Zoro+0.637
 Rapt+0.630
 Desire+0.620
Neg logit contributions
-0.657
enders-0.638
ICAN-0.636
inson-0.626
GC-0.619
Token is
Token ID318
Feature activation+2.48e-01

Pos logit contributions
+0.951
enders+0.927
ICAN+0.910
inson+0.905
GC+0.904
Neg logit contributions
 Demo-1.336
 Chow-1.298
 Zoro-1.238
 Rapt-1.223
 Desire-1.210
Token one
Token ID530
Feature activation+3.69e-01

Pos logit contributions
 Demo+0.955
 Chow+0.919
 Rapt+0.879
 Zoro+0.877
 Desire+0.864
Neg logit contributions
-0.720
ICAN-0.703
enders-0.697
inson-0.687
GC-0.683
Token of
Token ID286
Feature activation+2.39e-01

Pos logit contributions
 Demo+1.308
 Chow+1.250
 Rapt+1.191
 Zoro+1.187
 Desire+1.152
Neg logit contributions
-1.207
ICAN-1.195
enders-1.164
inson-1.149
GC-1.136
Token?
Token ID30
Feature activation-1.30e-01

Pos logit contributions
+0.490
ICAN+0.479
enders+0.473
GC+0.466
inson+0.463
Neg logit contributions
 Demo-0.584
 Chow-0.572
 Zoro-0.544
 Rapt-0.542
 Desire-0.531
Token Huh
Token ID45412
Feature activation-3.56e-01

Pos logit contributions
+0.409
ICAN+0.401
enders+0.393
GC+0.390
inson+0.387
Neg logit contributions
 Demo-0.463
 Chow-0.451
 Zoro-0.429
 Rapt-0.427
 Desire-0.418
Token.
Token ID13
Feature activation-2.77e-01

Pos logit contributions
+1.034
ICAN+1.014
inson+0.989
enders+0.987
GC+0.980
Neg logit contributions
 Demo-1.360
 Chow-1.309
 Zoro-1.257
 Rapt-1.245
 Desire-1.225
TokenPC
Token ID5662
Feature activation+3.76e-02

Pos logit contributions
+0.884
ICAN+0.867
enders+0.853
GC+0.848
inson+0.836
Neg logit contributions
 Demo-0.975
 Chow-0.947
 Zoro-0.904
 Rapt-0.893
 Desire-0.877
TokenThis
Token ID1212
Feature activation-3.20e-01

Pos logit contributions
 Demo+0.134
 Chow+0.131
 Zoro+0.125
 Rapt+0.124
 Desire+0.121
Neg logit contributions
-0.119
ICAN-0.116
enders-0.115
GC-0.113
inson-0.113
Token really
Token ID1107
Feature activation+2.47e-01

Pos logit contributions
+0.948
ICAN+0.920
enders+0.912
GC+0.902
inson+0.897
Neg logit contributions
 Demo-1.182
 Chow-1.164
 Zoro-1.106
 Rapt-1.101
 Desire-1.082
Token only
Token ID691
Feature activation-4.28e-01

Pos logit contributions
 Demo+0.963
 Chow+0.933
 Zoro+0.895
 Rapt+0.894
 Desire+0.880
Neg logit contributions
-0.698
ICAN-0.672
enders-0.672
inson-0.661
GC-0.651
Token has
Token ID468
Feature activation-1.31e-01

Pos logit contributions
enders+1.242
+1.232
GC+1.210
ICAN+1.206
inson+1.198
Neg logit contributions
 Demo-1.569
 Chow-1.525
 Zoro-1.467
 Rapt-1.459
 Desire-1.435
Token one
Token ID530
Feature activation-1.34e-01

Pos logit contributions
+0.426
enders+0.423
ICAN+0.420
GC+0.414
inson+0.414
Neg logit contributions
 Demo-0.440
 Chow-0.428
 Rapt-0.400
 Zoro-0.399
 Desire-0.395
Token new
Token ID649
Feature activation-8.68e-02

Pos logit contributions
+0.358
ICAN+0.348
enders+0.344
GC+0.341
inson+0.338
Neg logit contributions
 Demo-0.537
 Chow-0.526
 Rapt-0.498
 Zoro-0.497
 Desire-0.493
Token trick
Token ID6908
Feature activation-3.29e-01

Pos logit contributions
+0.236
ICAN+0.228
enders+0.227
GC+0.223
inson+0.222
Neg logit contributions
 Demo-0.347
 Chow-0.339
 Rapt-0.322
 Zoro-0.321
 Desire-0.316
Token (
Token ID357
Feature activation-8.41e-02

Pos logit contributions
+0.593
ICAN+0.581
enders+0.575
inson+0.568
GC+0.563
Neg logit contributions
 Demo-0.630
 Chow-0.609
 Rapt-0.581
 Zoro-0.580
 Desire-0.565
TokenARM
Token ID33456
Feature activation+4.34e-02

Pos logit contributions
+0.248
ICAN+0.243
enders+0.238
inson+0.234
GC+0.234
Neg logit contributions
 Demo-0.325
 Chow-0.315
 Rapt-0.300
 Zoro-0.299
 Desire-0.293
Token version
Token ID2196
Feature activation-4.06e-01

Pos logit contributions
 Demo+0.160
 Chow+0.155
 Zoro+0.147
 Rapt+0.147
 Desire+0.144
Neg logit contributions
-0.134
ICAN-0.131
enders-0.129
inson-0.127
GC-0.126
Token,
Token ID11
Feature activation-9.62e-02

Pos logit contributions
+1.197
ICAN+1.165
enders+1.144
inson+1.127
GC+1.116
Neg logit contributions
 Demo-1.556
 Chow-1.502
 Rapt-1.435
 Zoro-1.432
 Desire-1.402
Token detected
Token ID12326
Feature activation+1.80e-01

Pos logit contributions
+0.187
ICAN+0.180
enders+0.175
inson+0.171
GC+0.167
Neg logit contributions
 Demo-0.467
 Chow-0.454
 Rapt-0.437
 Zoro-0.437
 Desire-0.429
Token as
Token ID355
Feature activation+2.61e-02

Pos logit contributions
 Demo+0.644
 Chow+0.616
 Rapt+0.591
 Zoro+0.588
 Desire+0.575
Neg logit contributions
-0.575
ICAN-0.557
enders-0.548
inson-0.547
GC-0.541
Token EL
Token ID17852
Feature activation-4.93e-01

Pos logit contributions
 Demo+0.114
 Chow+0.110
 Rapt+0.106
 Zoro+0.105
 Desire+0.103
Neg logit contributions
-0.064
ICAN-0.062
enders-0.060
inson-0.059
GC-0.058
TokenF
Token ID37
Feature activation-6.69e-02

Pos logit contributions
+1.374
ICAN+1.368
GC+1.323
enders+1.290
NEY+1.288
Neg logit contributions
 Demo-1.965
 Chow-1.912
 Zoro-1.835
 Rapt-1.806
 Desire-1.798
Token_
Token ID62
Feature activation-1.81e-01

Pos logit contributions
+0.150
ICAN+0.148
GC+0.141
enders+0.139
NEY+0.138
Neg logit contributions
 Demo-0.306
 Chow-0.298
 Zoro-0.286
 Rapt-0.285
 Desire-0.281
TokenUM
Token ID5883
Feature activation-1.70e-01

Pos logit contributions
+0.462
ICAN+0.449
enders+0.439
inson+0.431
GC+0.423
Neg logit contributions
 Demo-0.772
 Chow-0.746
 Rapt-0.715
 Zoro-0.715
 Desire-0.701
TokenRE
Token ID2200
Feature activation-2.14e-01

Pos logit contributions
+0.556
ICAN+0.551
GC+0.533
NEY+0.533
enders+0.519
Neg logit contributions
 Demo-0.608
 Chow-0.587
 Zoro-0.557
 Rapt-0.548
 Desire-0.537
Tokens
Token ID82
Feature activation-1.85e-02

Pos logit contributions
+0.114
ICAN+0.112
enders+0.110
inson+0.109
GC+0.107
Neg logit contributions
 Demo-0.135
 Chow-0.130
 Zoro-0.124
 Rapt-0.124
 Desire-0.121
Token government
Token ID1230
Feature activation+1.11e-01

Pos logit contributions
+0.053
ICAN+0.051
enders+0.051
inson+0.050
GC+0.050
Neg logit contributions
 Demo-0.072
 Chow-0.070
 Zoro-0.066
 Rapt-0.066
 Desire-0.065
Token has
Token ID468
Feature activation+7.08e-02

Pos logit contributions
 Demo+0.449
 Chow+0.433
 Zoro+0.416
 Rapt+0.413
 Desire+0.406
Neg logit contributions
-0.307
ICAN-0.298
enders-0.292
inson-0.291
GC-0.286
Token pursued
Token ID19189
Feature activation-2.96e-01

Pos logit contributions
 Demo+0.308
 Chow+0.298
 Zoro+0.287
 Rapt+0.286
 Desire+0.282
Neg logit contributions
-0.172
ICAN-0.165
enders-0.163
inson-0.162
GC-0.158
Token its
Token ID663
Feature activation+2.19e-01

Pos logit contributions
+0.883
enders+0.853
ICAN+0.850
inson+0.837
GC+0.825
Neg logit contributions
 Demo-1.107
 Chow-1.066
 Zoro-1.024
 Rapt-1.008
 Desire-1.003
Token interests
Token ID5353
Feature activation-5.68e-02

Pos logit contributions
 Demo+0.908
 Chow+0.878
 Zoro+0.842
 Rapt+0.836
 Desire+0.823
Neg logit contributions
-0.575
ICAN-0.559
enders-0.548
inson-0.535
GC-0.534
Token by
Token ID416
Feature activation-7.45e-02

Pos logit contributions
+0.163
ICAN+0.160
enders+0.158
inson+0.157
GC+0.154
Neg logit contributions
 Demo-0.218
 Chow-0.212
 Zoro-0.201
 Rapt-0.200
 Desire-0.198
Token stepping
Token ID17413
Feature activation-1.63e-01

Pos logit contributions
+0.188
ICAN+0.180
inson+0.178
enders+0.177
GC+0.172
Neg logit contributions
 Demo-0.315
 Chow-0.305
 Zoro-0.293
 Rapt-0.292
 Desire-0.290
Token up
Token ID510
Feature activation-1.17e-01

Pos logit contributions
+0.524
ICAN+0.503
enders+0.501
inson+0.500
NEY+0.494
Neg logit contributions
 Demo-0.577
 Chow-0.556
 Zoro-0.531
 Rapt-0.528
 Desire-0.519
Token its
Token ID663
Feature activation+1.55e-02

Pos logit contributions
+0.330
enders+0.315
ICAN+0.314
inson+0.313
GC+0.306
Neg logit contributions
 Demo-0.460
 Chow-0.447
 Zoro-0.429
 Rapt-0.426
 Desire-0.422
Token cyber
Token ID10075
Feature activation-1.73e-01

Pos logit contributions
 Demo+0.063
 Chow+0.061
 Zoro+0.059
 Rapt+0.058
 Desire+0.058
Neg logit contributions
-0.042
ICAN-0.039
enders-0.039
inson-0.038
GC-0.038
Token his
Token ID465
Feature activation+4.39e-01

Pos logit contributions
+0.273
ICAN+0.266
enders+0.265
inson+0.262
NEY+0.261
Neg logit contributions
 Demo-0.256
 Chow-0.237
 Zoro-0.232
 Rapt-0.230
 Desire-0.222
Token next
Token ID1306
Feature activation+3.14e-01

Pos logit contributions
 Demo+1.729
 Chow+1.656
 Zoro+1.588
 Rapt+1.581
 Desire+1.547
Neg logit contributions
-1.257
ICAN-1.219
enders-1.203
inson-1.190
GC-1.163
Token effort
Token ID3626
Feature activation+2.05e-01

Pos logit contributions
 Demo+1.293
 Chow+1.247
 Zoro+1.193
 Rapt+1.190
 Desire+1.165
Neg logit contributions
-0.841
ICAN-0.809
enders-0.798
inson-0.788
GC-0.772
Token,
Token ID11
Feature activation+3.48e-01

Pos logit contributions
 Demo+0.789
 Chow+0.757
 Zoro+0.724
 Rapt+0.723
 Desire+0.705
Neg logit contributions
-0.605
ICAN-0.585
enders-0.578
inson-0.569
NEY-0.561
Token a
Token ID257
Feature activation+3.60e-01

Pos logit contributions
 Demo+1.215
 Chow+1.161
 Zoro+1.106
 Rapt+1.104
 Desire+1.074
Neg logit contributions
-1.148
ICAN-1.124
enders-1.107
inson-1.090
NEY-1.079
Token story
Token ID1621
Feature activation+3.66e-02

Pos logit contributions
 Demo+1.402
 Chow+1.345
 Rapt+1.284
 Zoro+1.282
 Desire+1.254
Neg logit contributions
-1.049
ICAN-1.018
enders-0.998
inson-0.982
NEY-0.971
Token titled
Token ID11946
Feature activation+2.40e-01

Pos logit contributions
 Demo+0.141
 Chow+0.136
 Rapt+0.130
 Zoro+0.130
 Desire+0.127
Neg logit contributions
-0.106
ICAN-0.103
enders-0.102
inson-0.100
NEY-0.099
Token "
Token ID366
Feature activation+3.79e-01

Pos logit contributions
 Demo+0.821
 Chow+0.784
 Zoro+0.746
 Rapt+0.744
 Desire+0.723
Neg logit contributions
-0.807
ICAN-0.790
enders-0.778
inson-0.766
NEY-0.759
TokenThe
Token ID464
Feature activation+9.42e-02

Pos logit contributions
 Demo+1.565
 Chow+1.510
 Zoro+1.446
 Rapt+1.446
 Desire+1.416
Neg logit contributions
-1.022
ICAN-0.994
enders-0.970
inson-0.952
GC-0.946
Token Fox
Token ID5426
Feature activation+2.12e-01

Pos logit contributions
 Demo+0.310
 Chow+0.296
 Zoro+0.281
 Rapt+0.278
 Desire+0.272
Neg logit contributions
-0.324
ICAN-0.319
enders-0.314
inson-0.310
GC-0.305
Token",
Token ID1600
Feature activation-1.71e-01

Pos logit contributions
 Demo+0.739
 Chow+0.709
 Zoro+0.679
 Rapt+0.672
 Desire+0.657
Neg logit contributions
-0.684
ICAN-0.675
enders-0.668
inson-0.662
NEY-0.648
Token
Token ID564
Feature activation-3.70e-01

Pos logit contributions
+0.230
ICAN+0.226
enders+0.225
GC+0.219
inson+0.219
Neg logit contributions
 Demo-0.267
 Chow-0.256
 Zoro-0.245
 Rapt-0.245
 Desire-0.239
Token
Token ID250
Feature activation-1.90e-01

Pos logit contributions
+1.052
ICAN+1.047
enders+1.023
inson+1.009
GC+1.003
Neg logit contributions
 Demo-1.431
 Chow-1.376
 Zoro-1.329
 Rapt-1.327
 Desire-1.293
Tokensign
Token ID12683
Feature activation-5.79e-01

Pos logit contributions
+0.604
ICAN+0.587
enders+0.584
GC+0.572
inson+0.566
Neg logit contributions
 Demo-0.698
 Chow-0.675
 Rapt-0.642
 Zoro-0.640
 Desire-0.626
Tokenificantly
Token ID42491
Feature activation+3.94e-01

Pos logit contributions
+1.985
enders+1.962
ICAN+1.938
inson+1.925
GC+1.907
Neg logit contributions
 Demo-1.906
 Chow-1.820
 Rapt-1.728
 Zoro-1.727
 Desire-1.694
Token scaling
Token ID20796
Feature activation-7.70e-02

Pos logit contributions
 Demo+1.473
 Chow+1.417
 Rapt+1.352
 Zoro+1.350
 Desire+1.324
Neg logit contributions
-1.212
ICAN-1.181
enders-1.159
inson-1.141
GC-1.131
Token back
Token ID736
Feature activation-4.50e-01

Pos logit contributions
+0.193
ICAN+0.188
enders+0.182
GC+0.179
NEY+0.178
Neg logit contributions
 Demo-0.329
 Chow-0.318
 Zoro-0.307
 Rapt-0.305
 Desire-0.301
Token the
Token ID262
Feature activation-1.78e-01

Pos logit contributions
+1.347
ICAN+1.341
enders+1.298
GC+1.285
NEY+1.282
Neg logit contributions
 Demo-1.664
 Chow-1.601
 Zoro-1.546
 Rapt-1.532
 Desire-1.516
Token Clinton
Token ID2605
Feature activation-2.42e-01

Pos logit contributions
+0.444
ICAN+0.441
enders+0.426
GC+0.415
inson+0.413
Neg logit contributions
 Demo-0.751
 Chow-0.725
 Zoro-0.700
 Rapt-0.696
 Desire-0.687
Token Foundation
Token ID5693
Feature activation+2.85e-02

Pos logit contributions
+0.511
ICAN+0.509
enders+0.488
GC+0.480
inson+0.476
Neg logit contributions
 Demo-1.118
 Chow-1.082
 Rapt-1.046
 Zoro-1.046
 Desire-1.030
Token and
Token ID290
Feature activation-1.99e-01

Pos logit contributions
 Demo+0.109
 Chow+0.105
 Zoro+0.101
 Rapt+0.100
 Desire+0.099
Neg logit contributions
-0.082
ICAN-0.082
enders-0.080
inson-0.078
GC-0.078
Token our
Token ID674
Feature activation-1.05e-01

Pos logit contributions
+0.510
ICAN+0.499
enders+0.495
GC+0.485
inson+0.483
Neg logit contributions
 Demo-0.822
 Chow-0.795
 Zoro-0.767
 Rapt-0.760
 Desire-0.752
Token
Token ID198
Feature activation+1.90e-01

Pos logit contributions
 Demo+0.375
 Chow+0.360
 Zoro+0.346
 Rapt+0.343
 Desire+0.336
Neg logit contributions
-0.337
ICAN-0.330
enders-0.324
GC-0.317
NEY-0.315
Token
Token ID198
Feature activation+1.36e-01

Pos logit contributions
 Demo+0.677
 Chow+0.654
 Zoro+0.621
 Rapt+0.620
 Desire+0.606
Neg logit contributions
-0.616
ICAN-0.598
enders-0.587
GC-0.581
inson-0.579
TokenWe
Token ID1135
Feature activation+9.38e-02

Pos logit contributions
 Demo+0.570
 Chow+0.550
 Rapt+0.529
 Zoro+0.528
 Desire+0.520
Neg logit contributions
-0.362
ICAN-0.349
enders-0.340
GC-0.336
inson-0.330
Token asked
Token ID1965
Feature activation-1.93e-01

Pos logit contributions
 Demo+0.371
 Chow+0.357
 Zoro+0.345
 Rapt+0.342
 Desire+0.335
Neg logit contributions
-0.265
ICAN-0.258
enders-0.254
GC-0.250
NEY-0.248
Token Phil
Token ID4543
Feature activation-3.92e-01

Pos logit contributions
+0.617
enders+0.616
ICAN+0.609
GC+0.600
inson+0.598
Neg logit contributions
 Demo-0.655
 Chow-0.618
 Rapt-0.597
 Zoro-0.596
 Desire-0.577
Tokenadel
Token ID6959
Feature activation-2.65e-01

Pos logit contributions
enders+1.384
+1.382
inson+1.368
ICAN+1.368
GC+1.326
Neg logit contributions
 Demo-1.253
 Chow-1.189
 Rapt-1.152
 Zoro-1.145
 Desire-1.113
Tokenph
Token ID746
Feature activation-2.31e-01

Pos logit contributions
+0.836
ICAN+0.829
enders+0.819
inson+0.813
GC+0.794
Neg logit contributions
 Demo-0.962
 Chow-0.922
 Rapt-0.880
 Zoro-0.875
 Desire-0.866
Tokenians
Token ID1547
Feature activation-2.01e-02

Pos logit contributions
+0.437
ICAN+0.433
enders+0.421
inson+0.415
GC+0.399
Neg logit contributions
 Demo-1.122
 Chow-1.101
 Zoro-1.058
 Rapt-1.055
 Desire-1.046
Token around
Token ID1088
Feature activation-2.00e-01

Pos logit contributions
+0.059
ICAN+0.059
enders+0.058
inson+0.057
GC+0.056
Neg logit contributions
 Demo-0.075
 Chow-0.072
 Zoro-0.069
 Rapt-0.069
 Desire-0.067
Token the
Token ID262
Feature activation-6.78e-02

Pos logit contributions
+0.667
ICAN+0.649
enders+0.648
GC+0.632
inson+0.630
Neg logit contributions
 Demo-0.678
 Chow-0.644
 Zoro-0.618
 Rapt-0.617
 Desire-0.603
Token city
Token ID1748
Feature activation-2.24e-01

Pos logit contributions
+0.174
ICAN+0.169
enders+0.167
inson+0.164
GC+0.161
Neg logit contributions
 Demo-0.281
 Chow-0.270
 Zoro-0.262
 Rapt-0.261
 Desire-0.256
Token online
Token ID2691
Feature activation-6.37e-01

Pos logit contributions
+0.258
ICAN+0.253
enders+0.248
inson+0.246
GC+0.241
Neg logit contributions
 Demo-0.310
 Chow-0.302
 Zoro-0.286
 Rapt-0.285
 Desire-0.280
Token edition
Token ID8313
Feature activation-6.84e-01

Pos logit contributions
+1.836
ICAN+1.766
enders+1.763
inson+1.729
GC+1.704
Neg logit contributions
 Demo-2.378
 Chow-2.321
 Rapt-2.202
 Zoro-2.197
 Desire-2.162
Token of
Token ID286
Feature activation-1.79e-01

Pos logit contributions
+2.180
ICAN+2.129
enders+2.096
inson+2.064
GC+2.040
Neg logit contributions
 Demo-2.484
 Chow-2.385
 Rapt-2.277
 Zoro-2.268
 Desire-2.217
Token Y
Token ID575
Feature activation-4.43e-02

Pos logit contributions
+0.594
ICAN+0.579
enders+0.570
inson+0.562
GC+0.556
Neg logit contributions
 Demo-0.619
 Chow-0.596
 Zoro-0.565
 Rapt-0.561
 Desire-0.549
Tokenle
Token ID293
Feature activation-4.49e-01

Pos logit contributions
+0.135
ICAN+0.131
enders+0.130
inson+0.128
GC+0.127
Neg logit contributions
 Demo-0.167
 Chow-0.161
 Zoro-0.155
 Rapt-0.154
 Desire-0.149
Tokenis
Token ID271
Feature activation-3.99e-01

Pos logit contributions
+1.408
ICAN+1.373
enders+1.359
inson+1.359
GC+1.325
Neg logit contributions
 Demo-1.602
 Chow-1.548
 Rapt-1.482
 Zoro-1.479
 Desire-1.453
Tokenradio
Token ID37004
Feature activation+4.16e-02

Pos logit contributions
+1.328
enders+1.290
inson+1.289
ICAN+1.276
GC+1.248
Neg logit contributions
 Demo-1.356
 Chow-1.297
 Zoro-1.242
 Rapt-1.233
 Desire-1.202
Token,
Token ID11
Feature activation+1.09e-01

Pos logit contributions
 Demo+0.148
 Chow+0.142
 Zoro+0.135
 Rapt+0.135
 Desire+0.132
Neg logit contributions
-0.135
ICAN-0.132
enders-0.130
inson-0.129
GC-0.127
Token Finland
Token ID17837
Feature activation+1.92e-01

Pos logit contributions
 Demo+0.385
 Chow+0.370
 Zoro+0.351
 Rapt+0.350
 Desire+0.342
Neg logit contributions
-0.356
ICAN-0.347
enders-0.341
inson-0.337
GC-0.334
Token
Token ID447
Feature activation-4.83e-02

Pos logit contributions
 Demo+0.682
 Chow+0.655
 Rapt+0.623
 Zoro+0.622
 Desire+0.607
Neg logit contributions
-0.624
ICAN-0.609
enders-0.598
inson-0.590
GC-0.584
Token
Token ID247
Feature activation-3.37e-02

Pos logit contributions
ICAN+0.176
+0.175
enders+0.171
GC+0.169
inson+0.169
Neg logit contributions
 Demo-0.151
 Chow-0.145
 Zoro-0.139
 Rapt-0.137
 Desire-0.135
Tokenilateral
Token ID14796
Feature activation+2.98e-01

Pos logit contributions
+0.132
enders+0.130
ICAN+0.129
inson+0.128
GC+0.127
Neg logit contributions
 Demo-0.139
 Chow-0.133
 Rapt-0.127
 Zoro-0.126
 Desire-0.124
Token)
Token ID8
Feature activation+6.69e-02

Pos logit contributions
 Demo+1.061
 Chow+1.023
 Rapt+0.975
 Zoro+0.970
 Desire+0.948
Neg logit contributions
-0.965
ICAN-0.948
enders-0.933
inson-0.917
GC-0.915
Token cricket
Token ID18836
Feature activation-1.91e-02

Pos logit contributions
 Demo+0.224
 Chow+0.215
 Rapt+0.205
 Zoro+0.204
 Desire+0.199
Neg logit contributions
-0.229
ICAN-0.225
enders-0.224
GC-0.221
inson-0.220
Token and
Token ID290
Feature activation-5.33e-02

Pos logit contributions
+0.058
ICAN+0.057
enders+0.056
inson+0.056
GC+0.055
Neg logit contributions
 Demo-0.071
 Chow-0.069
 Rapt-0.066
 Zoro-0.066
 Desire-0.064
Token more
Token ID517
Feature activation-2.76e-01

Pos logit contributions
+0.161
ICAN+0.159
enders+0.157
GC+0.156
inson+0.155
Neg logit contributions
 Demo-0.196
 Chow-0.190
 Rapt-0.182
 Zoro-0.180
 Desire-0.176
Token O
Token ID440
Feature activation-2.71e-01

Pos logit contributions
+0.750
ICAN+0.748
enders+0.731
GC+0.726
inson+0.721
Neg logit contributions
 Demo-1.092
 Chow-1.065
 Zoro-1.014
 Rapt-1.011
 Desire-0.996
TokenDI
Token ID17931
Feature activation-3.08e-01

Pos logit contributions
+0.734
GC+0.731
enders+0.723
ICAN+0.714
inson+0.683
Neg logit contributions
 Demo-1.092
 Chow-1.055
 Rapt-1.022
 Zoro-1.017
 Desire-0.986
Token tournaments
Token ID18130
Feature activation-1.46e-01

Pos logit contributions
+1.048
ICAN+1.031
enders+1.026
GC+1.012
inson+1.008
Neg logit contributions
 Demo-1.028
 Chow-0.987
 Zoro-0.942
 Rapt-0.941
 Desire-0.920
Token,"
Token ID553
Feature activation+1.67e-01

Pos logit contributions
+0.438
ICAN+0.427
enders+0.425
inson+0.421
GC+0.414
Neg logit contributions
 Demo-0.546
 Chow-0.524
 Rapt-0.501
 Zoro-0.500
 Desire-0.490
Token D
Token ID360
Feature activation-8.66e-02

Pos logit contributions
 Demo+0.534
 Chow+0.509
 Rapt+0.483
 Zoro+0.482
 Desire+0.469
Neg logit contributions
-0.600
ICAN-0.584
enders-0.579
inson-0.570
GC-0.566
Tokenrav
Token ID4108
Feature activation+1.48e-01

Pos logit contributions
+0.262
inson+0.260
enders+0.259
ICAN+0.256
GC+0.251
Neg logit contributions
 Demo-0.330
 Chow-0.312
 Rapt-0.302
 Zoro-0.300
 Desire-0.296
Token should
Token ID815
Feature activation+4.67e-01

Pos logit contributions
 Demo+0.347
 Chow+0.333
 Rapt+0.318
 Zoro+0.313
 Desire+0.308
Neg logit contributions
-0.254
enders-0.247
ICAN-0.245
inson-0.245
GC-0.239
Token be
Token ID307
Feature activation+3.50e-01

Pos logit contributions
 Demo+1.711
 Chow+1.632
 Rapt+1.557
 Zoro+1.543
 Desire+1.512
Neg logit contributions
-1.458
ICAN-1.429
enders-1.416
inson-1.415
NEY-1.373
Token killed
Token ID2923
Feature activation+2.06e-01

Pos logit contributions
 Demo+1.447
 Chow+1.381
 Rapt+1.331
 Zoro+1.319
 Desire+1.312
Neg logit contributions
-0.933
ICAN-0.909
enders-0.900
inson-0.880
GC-0.869
Token and
Token ID290
Feature activation+1.69e-01

Pos logit contributions
 Demo+0.779
 Chow+0.745
 Rapt+0.710
 Zoro+0.704
 Desire+0.696
Neg logit contributions
-0.617
ICAN-0.604
enders-0.599
inson-0.592
GC-0.580
Token their
Token ID511
Feature activation-1.20e-01

Pos logit contributions
 Demo+0.601
 Chow+0.569
 Rapt+0.541
 Zoro+0.536
 Desire+0.533
Neg logit contributions
-0.547
ICAN-0.528
enders-0.525
inson-0.521
GC-0.517
Token children
Token ID1751
Feature activation-1.06e-01

Pos logit contributions
+0.366
ICAN+0.353
enders+0.353
inson+0.347
GC+0.341
Neg logit contributions
 Demo-0.439
 Chow-0.415
 Rapt-0.399
 Desire-0.390
 Zoro-0.389
Token and
Token ID290
Feature activation+1.31e-01

Pos logit contributions
+0.291
ICAN+0.282
enders+0.280
inson+0.276
GC+0.270
Neg logit contributions
 Demo-0.425
 Chow-0.408
 Rapt-0.388
 Zoro-0.387
 Desire-0.385
Token women
Token ID1466
Feature activation-8.49e-02

Pos logit contributions
 Demo+0.459
 Chow+0.434
 Rapt+0.415
 Zoro+0.411
 Desire+0.407
Neg logit contributions
-0.432
ICAN-0.417
enders-0.415
inson-0.411
GC-0.404
Token should
Token ID815
Feature activation+3.62e-01

Pos logit contributions
+0.240
ICAN+0.234
enders+0.232
inson+0.227
GC+0.225
Neg logit contributions
 Demo-0.334
 Chow-0.321
 Rapt-0.305
 Zoro-0.304
 Desire-0.302
Token be
Token ID307
Feature activation+1.65e-01

Pos logit contributions
 Demo+1.276
 Chow+1.220
 Rapt+1.157
 Zoro+1.152
 Desire+1.128
Neg logit contributions
-1.185
ICAN-1.158
enders-1.143
inson-1.137
NEY-1.110
Token taken
Token ID2077
Feature activation-7.59e-02

Pos logit contributions
 Demo+0.708
 Chow+0.680
 Rapt+0.652
 Zoro+0.650
 Desire+0.644
Neg logit contributions
-0.409
ICAN-0.394
enders-0.391
inson-0.384
GC-0.377
TokenOUNT
Token ID28270
Feature activation-4.90e-01

Pos logit contributions
+0.435
ICAN+0.429
enders+0.420
GC+0.416
inson+0.409
Neg logit contributions
 Demo-0.723
 Chow-0.705
 Rapt-0.672
 Zoro-0.671
 Desire-0.667
Token (
Token ID357
Feature activation-3.11e-02

Pos logit contributions
ICAN+1.421
+1.417
GC+1.376
enders+1.359
NEY+1.341
Neg logit contributions
 Demo-1.878
 Chow-1.826
 Rapt-1.721
 Zoro-1.716
 Desire-1.706
Token *
Token ID1635
Feature activation+2.05e-01

Pos logit contributions
+0.101
ICAN+0.099
enders+0.098
GC+0.097
inson+0.095
Neg logit contributions
 Demo-0.108
 Chow-0.105
 Zoro-0.099
 Rapt-0.099
 Desire-0.097
Token )
Token ID1267
Feature activation-3.19e-01

Pos logit contributions
 Demo+0.799
 Chow+0.774
 Rapt+0.736
 Zoro+0.734
 Desire+0.727
Neg logit contributions
-0.586
ICAN-0.577
enders-0.563
inson-0.555
GC-0.554
Token AS
Token ID7054
Feature activation-3.03e-01

Pos logit contributions
+0.854
ICAN+0.846
enders+0.819
GC+0.809
inson+0.807
Neg logit contributions
 Demo-1.292
 Chow-1.256
 Rapt-1.196
 Zoro-1.191
 Desire-1.183
Token count
Token ID954
Feature activation-9.60e-01

Pos logit contributions
+0.941
ICAN+0.911
enders+0.895
inson+0.884
NEY+0.882
Neg logit contributions
 Demo-1.081
 Chow-1.063
 Rapt-0.998
 Zoro-0.998
 Desire-0.992
Token FROM
Token ID16034
Feature activation-1.90e-01

Pos logit contributions
+2.377
ICAN+2.334
enders+2.310
GC+2.272
inson+2.250
Neg logit contributions
 Demo-3.980
 Chow-3.895
 Zoro-3.707
 Rapt-3.694
 Desire-3.660
Token (
Token ID357
Feature activation+6.00e-02

Pos logit contributions
+0.573
ICAN+0.558
enders+0.550
inson+0.546
GC+0.539
Neg logit contributions
 Demo-0.690
 Chow-0.679
 Zoro-0.647
 Rapt-0.640
 Desire-0.639
Token SELECT
Token ID33493
Feature activation-6.47e-02

Pos logit contributions
 Demo+0.216
 Chow+0.211
 Zoro+0.201
 Rapt+0.199
 Desire+0.197
Neg logit contributions
-0.187
ICAN-0.184
enders-0.179
inson-0.177
GC-0.176
Token SPL
Token ID46341
Feature activation-8.01e-01

Pos logit contributions
+0.211
ICAN+0.208
enders+0.202
inson+0.201
GC+0.200
Neg logit contributions
 Demo-0.224
 Chow-0.218
 Zoro-0.205
 Rapt-0.205
 Desire-0.203
TokenIT
Token ID2043
Feature activation-2.31e-01

Pos logit contributions
+2.221
ICAN+2.220
enders+2.188
GC+2.140
NEY+2.105
Neg logit contributions
 Demo-3.145
 Chow-3.037
 Zoro-2.903
 Rapt-2.893
 Desire-2.853
Token
Token ID447
Feature activation+1.58e-01

Pos logit contributions
 Demo+0.460
 Chow+0.448
 Zoro+0.430
 Rapt+0.428
 Desire+0.423
Neg logit contributions
-0.323
ICAN-0.315
enders-0.312
inson-0.303
GC-0.297
Token
Token ID247
Feature activation+3.70e-03

Pos logit contributions
 Demo+0.503
 Chow+0.481
 Zoro+0.465
 Rapt+0.454
 Desire+0.448
Neg logit contributions
-0.574
ICAN-0.569
enders-0.556
inson-0.550
GC-0.546
Tokens
Token ID82
Feature activation+8.25e-03

Pos logit contributions
 Demo+0.014
 Chow+0.013
 Zoro+0.013
 Rapt+0.013
 Desire+0.013
Neg logit contributions
-0.011
enders-0.011
ICAN-0.011
inson-0.011
GC-0.010
Token undeniable
Token ID36332
Feature activation+2.55e-01

Pos logit contributions
 Demo+0.031
 Chow+0.030
 Zoro+0.028
 Rapt+0.028
 Desire+0.028
Neg logit contributions
-0.024
ICAN-0.024
enders-0.024
NEY-0.023
GC-0.023
Token how
Token ID703
Feature activation-4.19e-02

Pos logit contributions
 Demo+0.916
 Chow+0.894
 Rapt+0.846
 Zoro+0.846
 Desire+0.827
Neg logit contributions
-0.805
ICAN-0.785
enders-0.771
inson-0.761
GC-0.759
Token well
Token ID880
Feature activation-8.12e-01

Pos logit contributions
+0.126
ICAN+0.121
enders+0.120
inson+0.119
NEY+0.116
Neg logit contributions
 Demo-0.157
 Chow-0.152
 Zoro-0.145
 Rapt-0.145
 Desire-0.143
Token Arduino
Token ID27634
Feature activation+5.76e-01

Pos logit contributions
+2.558
enders+2.429
ICAN+2.423
inson+2.413
NEY+2.367
Neg logit contributions
 Demo-2.760
 Chow-2.718
 Rapt-2.589
 Desire-2.552
 Zoro-2.545
Token has
Token ID468
Feature activation+1.65e-01

Pos logit contributions
 Demo+2.129
 Chow+2.059
 Rapt+1.961
 Zoro+1.960
 Desire+1.924
Neg logit contributions
-1.785
ICAN-1.738
enders-1.716
inson-1.688
GC-1.672
Token enabled
Token ID9343
Feature activation+1.22e-01

Pos logit contributions
 Demo+0.701
 Chow+0.682
 Rapt+0.652
 Zoro+0.651
 Desire+0.643
Neg logit contributions
-0.415
ICAN-0.396
enders-0.396
inson-0.389
GC-0.382
Token no
Token ID645
Feature activation+7.90e-02

Pos logit contributions
 Demo+0.439
 Chow+0.428
 Zoro+0.406
 Rapt+0.406
 Desire+0.398
Neg logit contributions
-0.382
ICAN-0.368
enders-0.367
inson-0.361
GC-0.357
Tokenv
Token ID85
Feature activation-4.95e-01

Pos logit contributions
 Demo+0.288
 Chow+0.279
 Rapt+0.267
 Zoro+0.265
 Desire+0.259
Neg logit contributions
-0.248
ICAN-0.242
enders-0.239
inson-0.236
GC-0.232
Tokenstate
Token ID5219
Feature activation-4.73e-01

Pos logit contributions
+0.002
enders+0.002
ICAN+0.002
inson+0.002
GC+0.002
Neg logit contributions
 Demo-0.003
 Chow-0.003
 Zoro-0.003
 Rapt-0.003
 Desire-0.003
Token :
Token ID1058
Feature activation-2.26e-01

Pos logit contributions
+1.211
ICAN+1.178
enders+1.149
inson+1.130
GC+1.113
Neg logit contributions
 Demo-2.005
 Chow-1.938
 Rapt-1.857
 Zoro-1.856
 Desire-1.820
Token Option
Token ID16018
Feature activation-4.37e-01

Pos logit contributions
+0.601
ICAN+0.587
enders+0.582
GC+0.571
inson+0.569
Neg logit contributions
 Demo-0.896
 Chow-0.878
 Zoro-0.842
 Rapt-0.840
 Desire-0.819
Token <
Token ID1279
Feature activation-3.35e-02

Pos logit contributions
+1.655
ICAN+1.630
enders+1.604
inson+1.580
GC+1.576
Neg logit contributions
 Demo-1.303
 Chow-1.248
 Zoro-1.176
 Rapt-1.170
 Desire-1.142
Token Board
Token ID5926
Feature activation-2.06e-01

Pos logit contributions
+0.109
ICAN+0.106
enders+0.105
GC+0.103
inson+0.103
Neg logit contributions
 Demo-0.116
 Chow-0.112
 Zoro-0.107
 Rapt-0.107
 Desire-0.104
Token >
Token ID1875
Feature activation-8.34e-01

Pos logit contributions
+0.636
ICAN+0.622
enders+0.612
inson+0.601
GC+0.599
Neg logit contributions
 Demo-0.759
 Chow-0.732
 Zoro-0.699
 Rapt-0.697
 Desire-0.682
Token,
Token ID837
Feature activation-2.60e-02

Pos logit contributions
+1.768
ICAN+1.711
enders+1.674
inson+1.629
GC+1.629
Neg logit contributions
 Demo-3.841
 Chow-3.750
 Zoro-3.624
 Rapt-3.603
 Desire-3.558
Token //
Token ID3373
Feature activation-2.35e-01

Pos logit contributions
+0.075
ICAN+0.073
enders+0.072
inson+0.071
GC+0.070
Neg logit contributions
 Demo-0.101
 Chow-0.098
 Zoro-0.094
 Rapt-0.094
 Desire-0.092
Token The
Token ID383
Feature activation+1.36e-01

Pos logit contributions
+0.708
enders+0.695
ICAN+0.689
GC+0.679
inson+0.679
Neg logit contributions
 Demo-0.850
 Chow-0.832
 Zoro-0.793
 Rapt-0.781
 Desire-0.762
Token previous
Token ID2180
Feature activation+4.49e-01

Pos logit contributions
 Demo+0.561
 Chow+0.544
 Zoro+0.521
 Rapt+0.520
 Desire+0.511
Neg logit contributions
-0.356
ICAN-0.348
enders-0.343
inson-0.337
GC-0.334
Token board
Token ID3096
Feature activation+1.10e-01

Pos logit contributions
 Demo+1.840
 Chow+1.781
 Zoro+1.703
 Rapt+1.702
 Desire+1.670
Neg logit contributions
-1.210
ICAN-1.177
enders-1.155
inson-1.135
GC-1.124
Token June
Token ID2795
Feature activation+5.45e-02

Pos logit contributions
+0.921
ICAN+0.864
GC+0.849
enders+0.844
inson+0.831
Neg logit contributions
 Demo-1.611
 Chow-1.559
 Rapt-1.498
 Zoro-1.495
 Desire-1.454
Token 2012
Token ID2321
Feature activation+3.15e-01

Pos logit contributions
 Demo+0.242
 Chow+0.234
 Rapt+0.224
 Zoro+0.224
 Desire+0.219
Neg logit contributions
-0.125
ICAN-0.120
enders-0.119
inson-0.116
GC-0.116
Token Canada
Token ID3340
Feature activation+2.80e-01

Pos logit contributions
 Demo+1.042
 Chow+0.994
 Zoro+0.940
 Rapt+0.939
 Desire+0.913
Neg logit contributions
-1.097
ICAN-1.075
enders-1.059
inson-1.041
GC-1.037
Token 16
Token ID1467
Feature activation-3.60e-02

Pos logit contributions
 Demo+0.950
 Chow+0.918
 Zoro+0.867
 Rapt+0.861
 Desire+0.841
Neg logit contributions
-0.927
ICAN-0.907
enders-0.895
GC-0.886
inson-0.881
Token10
Token ID940
Feature activation+6.94e-02

Pos logit contributions
+0.121
ICAN+0.117
enders+0.115
GC+0.114
inson+0.114
Neg logit contributions
 Demo-0.123
 Chow-0.119
 Zoro-0.113
 Rapt-0.112
 Desire-0.109
Token Posts
Token ID12043
Feature activation-9.05e-01

Pos logit contributions
 Demo+0.234
 Chow+0.225
 Zoro+0.215
 Rapt+0.213
 Desire+0.206
Neg logit contributions
-0.233
ICAN-0.228
enders-0.224
GC-0.223
inson-0.221
Token #
Token ID1303
Feature activation-9.22e-01

Pos logit contributions
+1.320
ICAN+1.256
enders+1.204
inson+1.165
GC+1.135
Neg logit contributions
 Demo-4.837
 Chow-4.709
 Rapt-4.555
 Zoro-4.553
 Desire-4.484
Token14
Token ID1415
Feature activation-2.73e-01

Pos logit contributions
+2.785
enders+2.705
GC+2.580
ICAN+2.558
inson+2.490
Neg logit contributions
 Demo-3.536
 Chow-3.524
 Zoro-3.359
 Rapt-3.275
 Desire-3.205
Token On
Token ID1550
Feature activation+8.86e-02

Pos logit contributions
+0.861
ICAN+0.855
enders+0.851
GC+0.835
inson+0.828
Neg logit contributions
 Demo-0.933
 Chow-0.905
 Zoro-0.860
 Rapt-0.850
 Desire-0.843
Token September
Token ID2693
Feature activation+6.64e-02

Pos logit contributions
 Demo+0.371
 Chow+0.363
 Rapt+0.344
 Zoro+0.344
 Desire+0.336
Neg logit contributions
-0.225
ICAN-0.221
enders-0.215
inson-0.211
GC-0.211
Token 20
Token ID1160
Feature activation+2.87e-03

Pos logit contributions
 Demo+0.238
 Chow+0.228
 Rapt+0.217
 Zoro+0.216
 Desire+0.212
Neg logit contributions
-0.216
ICAN-0.212
enders-0.207
inson-0.204
GC-0.202
Token -
Token ID532
Feature activation+2.71e-02

Pos logit contributions
+0.193
ICAN+0.189
enders+0.187
inson+0.184
GC+0.181
Neg logit contributions
 Demo-0.232
 Chow-0.225
 Zoro-0.214
 Rapt-0.213
 Desire-0.209
Token thank
Token ID5875
Feature activation-2.11e-01

Pos logit contributions
 Demo+0.098
 Chow+0.095
 Zoro+0.090
 Rapt+0.090
 Desire+0.087
Neg logit contributions
-0.085
ICAN-0.082
enders-0.082
inson-0.081
GC-0.080
Token you
Token ID345
Feature activation+1.46e-02

Pos logit contributions
+0.496
ICAN+0.480
enders+0.472
GC+0.463
inson+0.459
Neg logit contributions
 Demo-0.928
 Chow-0.892
 Zoro-0.860
 Rapt-0.853
 Desire-0.845
Token!
Token ID0
Feature activation-1.23e-01

Pos logit contributions
 Demo+0.052
 Chow+0.050
 Zoro+0.047
 Rapt+0.047
 Desire+0.046
Neg logit contributions
-0.046
ICAN-0.045
enders-0.045
GC-0.044
inson-0.044
Token P
Token ID350
Feature activation-3.49e-02

Pos logit contributions
+0.400
ICAN+0.386
enders+0.380
GC+0.377
NEY+0.374
Neg logit contributions
 Demo-0.427
 Chow-0.416
 Zoro-0.395
 Rapt-0.393
 Desire-0.385
Tokensh
Token ID1477
Feature activation-8.55e-01

Pos logit contributions
+0.107
ICAN+0.105
enders+0.104
inson+0.102
GC+0.102
Neg logit contributions
 Demo-0.131
 Chow-0.126
 Rapt-0.120
 Zoro-0.120
 Desire-0.118
Tokenon
Token ID261
Feature activation-8.05e-01

Pos logit contributions
+2.799
ICAN+2.734
enders+2.727
inson+2.678
GC+2.654
Neg logit contributions
 Demo-2.885
 Chow-2.774
 Rapt-2.673
 Zoro-2.670
 Desire-2.612
Token Barrett
Token ID26971
Feature activation+1.69e-01

Pos logit contributions
+2.722
enders+2.659
inson+2.638
ICAN+2.615
GC+2.586
Neg logit contributions
 Demo-2.604
 Chow-2.452
 Rapt-2.393
 Zoro-2.371
 Desire-2.306
Token gave
Token ID2921
Feature activation+1.18e-01

Pos logit contributions
 Demo+0.625
 Chow+0.598
 Zoro+0.574
 Rapt+0.573
 Desire+0.560
Neg logit contributions
-0.520
ICAN-0.507
enders-0.501
inson-0.497
GC-0.490
Token me
Token ID502
Feature activation+2.88e-01

Pos logit contributions
 Demo+0.411
 Chow+0.394
 Zoro+0.375
 Rapt+0.371
 Desire+0.365
Neg logit contributions
-0.385
ICAN-0.375
enders-0.369
inson-0.364
GC-0.363
Token useful
Token ID4465
Feature activation+2.60e-01

Pos logit contributions
 Demo+1.003
 Chow+0.968
 Rapt+0.917
 Zoro+0.915
 Desire+0.895
Neg logit contributions
-0.939
ICAN-0.908
enders-0.898
inson-0.889
GC-0.882