TOP ACTIVATIONS
MAX = 3.927

s sixty - w att transmitter would start be aming images
States and international treaties . Any reproduction , copying , or
the New York Times that any digging into matters beyond Russia
and state for free . Any contribution to help fund our
MC s are due . Any player with a N MC
lunch . " Anything special , sir ," cant
generated by the sale . Anything left over the authority
in both official languages . Neither language takes precedence over the
anonymous source will suffice . Any mistake sparks talk of impeachment
deal -- although the European Commission pointed out that Britain cannot
. Israeli Foreign Ministry , responding to Erdogan
fresh or dry ) and wh iz on high for a
- Review newspaper . " Any body that didn 't get
. Any time a US president visits
with the same result . Neither of these incidents was anti
seem an annoyance , but neither is as bad as the
programs ." " Any waste , fraud , or
" article was satire . Neither me or my supporters want
g utenberg . org ). Any file offered in MO BI
single - payer system . Anything less is immoral . But

MIN ACTIVATIONS
MIN = -4.273

. Thanks to Penn and Lab ov
your inbox ! Thanks for stopping by , happy
June 26 . Thanks , Wii UM aster Race
love these ! Thank you to Lilly for this
A big thank you to the family for
bonus one .. thank you @ mr _ se
ison Rain ess Cross - d ressing : Cross
to reality . Thanks to Adobe AIR for Android
every week . Thank you for signing up .
each week . Thank you for signing up .
results . Many thanks again to all of you
Lar che . Thanks again for listening ! <|endoftext|>
miles ( 1 , 000 square kilometers ), with roughly one
has former United States Senators Trent L ott and John Bre
confident fighter . Thank you as well to Lawrence
code ). Lot Sub division After road
Click here to thank Will on Twitter !
Advertisement # ad # They d usted
8 b ET pic . twitter . com / W 8
will attempt to run a sub - 2 : 00 marathon

INTERVAL 1.877 - 3.927
CONTAINS 0.980%

packet loss and congestion were synonymous due to technology limitations .
s Moss ad intelligence agency , Y oss i
, as Jaguars executive vice president Tom C ough lin praised
search engine there , and Da um Com min ications --
. She and Jen elle might have a messy mother /

INTERVAL -0.173 - 1.877
CONTAINS 60.892%

. I was trapped . It was at this
Does it seem fair to you , Kaine
. So very optimistic for the season
suits against those who call for boycott against any area
different types of block syntax , the curly brace syntax e

INTERVAL -2.223 - -0.173
CONTAINS 37.808%

b ] Personal best : [/ b ] 00 : 47
And it turns out that it is really , really hard
treatment for Rohingya who would otherwise be turned away from clinics
as almond flour , but yields a different flavor to the
university disciplinary hearing . He contends their sex at his apartment

INTERVAL -4.273 - -2.223
CONTAINS 0.321%

escapes every single attempt to once and for all explain its
aka Sn appy , aka snaps ). These two new methods
taking care of records . Recently , Cob ain says ,
Sanders ' boast ful rhetoric could be tied to
, called the Spending C uts and Def icit Reduction Working
Tokens
Token ID82
Feature activation+6.67e-01

Pos logit contributions
pell+0.036
hend+0.035
hers+0.033
onomy+0.030
ride+0.029
Neg logit contributions
-0.036
estones-0.032
mercial-0.031
>>>>-0.030
>>>>>>>>-0.030
Token sixty
Token ID24742
Feature activation-1.45e-01

Pos logit contributions
+1.228
estones+1.059
>>>>+1.024
mercial+1.024
>>>>>>>>+1.010
Neg logit contributions
pell-1.514
hend-1.468
hers-1.374
onomy-1.269
ride-1.264
Token-
Token ID12
Feature activation-3.29e-01

Pos logit contributions
pell+0.271
hend+0.260
hers+0.241
onomy+0.217
ride+0.215
Neg logit contributions
-0.327
estones-0.291
mercial-0.282
>>>>-0.281
>>>>>>>>-0.278
Tokenw
Token ID86
Feature activation+1.49e+00

Pos logit contributions
pell+0.610
hend+0.588
hers+0.544
onomy+0.489
ride+0.484
Neg logit contributions
-0.747
estones-0.667
mercial-0.647
>>>>-0.643
>>>>>>>>-0.635
Tokenatt
Token ID1078
Feature activation-1.26e-02

Pos logit contributions
+3.267
estones+2.984
mercial+2.853
>>>>+2.829
>>>>>>>>+2.771
Neg logit contributions
pell-2.774
hend-2.687
hers-2.452
onomy-2.269
rises-2.213
Token transmitter
Token ID35099
Feature activation+3.93e+00

Pos logit contributions
pell+0.029
hend+0.028
hers+0.026
onomy+0.024
ride+0.024
Neg logit contributions
-0.023
estones-0.020
mercial-0.019
>>>>-0.019
>>>>>>>>-0.019
Token would
Token ID561
Feature activation+9.63e-02

Pos logit contributions
+7.034
>>>>>>>>+5.871
>>>>+5.844
estones+5.787
mercial+5.754
Neg logit contributions
pell-8.480
hend-8.250
hers-7.673
onomy-7.229
rises-7.024
Token start
Token ID923
Feature activation+7.65e-01

Pos logit contributions
+0.184
estones+0.161
mercial+0.156
>>>>+0.155
>>>>>>>>+0.153
Neg logit contributions
pell-0.212
hend-0.206
hers-0.193
onomy-0.178
ride-0.176
Token be
Token ID307
Feature activation+1.46e-01

Pos logit contributions
+1.514
estones+1.322
>>>>+1.300
mercial+1.299
>>>>>>>>+1.278
Neg logit contributions
pell-1.607
hend-1.562
hers-1.450
onomy-1.346
rises-1.321
Tokenaming
Token ID3723
Feature activation-4.26e-01

Pos logit contributions
+0.313
estones+0.278
mercial+0.272
>>>>+0.271
>>>>>>>>+0.268
Neg logit contributions
pell-0.282
hend-0.273
hers-0.250
onomy-0.233
ride-0.230
Token images
Token ID4263
Feature activation+1.20e+00

Pos logit contributions
pell+0.915
hend+0.885
hers+0.830
onomy+0.758
ride+0.753
Neg logit contributions
-0.845
estones-0.741
mercial-0.716
>>>>-0.712
>>>>>>>>-0.703
Token States
Token ID1829
Feature activation-1.07e-01

Pos logit contributions
pell+1.282
hend+1.227
hers+1.129
onomy+1.020
rises+1.019
Neg logit contributions
-1.660
estones-1.487
>>>>-1.444
>>>>>>>>-1.430
mercial-1.423
Token and
Token ID290
Feature activation+2.23e-01

Pos logit contributions
pell+0.226
hend+0.219
hers+0.205
onomy+0.187
rises+0.186
Neg logit contributions
-0.214
estones-0.188
mercial-0.182
>>>>-0.181
>>>>>>>>-0.180
Token international
Token ID3230
Feature activation+1.51e+00

Pos logit contributions
+0.416
estones+0.362
mercial+0.349
>>>>+0.349
>>>>>>>>+0.343
Neg logit contributions
pell-0.505
hend-0.489
hers-0.460
onomy-0.423
rises-0.421
Token treaties
Token ID30734
Feature activation+1.83e+00

Pos logit contributions
+2.762
estones+2.378
>>>>+2.288
mercial+2.256
>>>>>>>>+2.249
Neg logit contributions
pell-3.480
hend-3.351
hers-3.173
rises-2.909
ride-2.895
Token.
Token ID13
Feature activation+4.28e-01

Pos logit contributions
+3.715
estones+3.265
>>>>+3.120
>>>>>>>>+3.109
mercial+3.106
Neg logit contributions
pell-3.746
hend-3.669
hers-3.410
rises-3.126
ride-3.113
Token Any
Token ID4377
Feature activation+3.85e+00

Pos logit contributions
+0.904
estones+0.796
>>>>+0.778
mercial+0.770
>>>>>>>>+0.769
Neg logit contributions
pell-0.865
hend-0.835
hers-0.779
rises-0.701
ride-0.701
Token reproduction
Token ID20728
Feature activation+1.04e+00

Pos logit contributions
+6.450
mercial+5.636
>>>>+5.493
estones+5.440
>>>>>>>>+5.251
Neg logit contributions
pell-8.652
hend-8.573
hers-8.122
ride-7.376
rises-7.331
Token,
Token ID11
Feature activation+3.78e-01

Pos logit contributions
+2.088
estones+1.807
mercial+1.785
>>>>+1.773
>>>>>>>>+1.761
Neg logit contributions
pell-2.196
hend-2.119
hers-1.983
rises-1.804
onomy-1.803
Token copying
Token ID23345
Feature activation+1.37e+00

Pos logit contributions
+0.591
estones+0.492
>>>>+0.482
mercial+0.480
>>>>>>>>+0.469
Neg logit contributions
pell-0.969
hend-0.942
hers-0.899
onomy-0.832
rises-0.829
Token,
Token ID11
Feature activation-1.61e-01

Pos logit contributions
+2.790
estones+2.435
>>>>+2.401
mercial+2.399
>>>>>>>>+2.370
Neg logit contributions
pell-2.848
hend-2.747
hers-2.563
rises-2.339
onomy-2.332
Token or
Token ID393
Feature activation+2.24e-01

Pos logit contributions
pell+0.366
hend+0.354
hers+0.334
onomy+0.307
rises+0.305
Neg logit contributions
-0.299
estones-0.260
mercial-0.252
>>>>-0.251
>>>>>>>>-0.247
Token the
Token ID262
Feature activation-4.77e-02

Pos logit contributions
pell+1.106
hend+1.069
hers+1.001
onomy+0.908
ride+0.903
Neg logit contributions
-1.103
estones-0.973
mercial-0.941
>>>>-0.934
>>>>>>>>-0.923
Token New
Token ID968
Feature activation-1.91e-01

Pos logit contributions
pell+0.104
hend+0.100
hers+0.094
onomy+0.086
rises+0.085
Neg logit contributions
-0.093
estones-0.082
>>>>-0.079
mercial-0.079
>>>>>>>>-0.078
Token York
Token ID1971
Feature activation-3.29e-01

Pos logit contributions
pell+0.374
hend+0.361
hers+0.337
onomy+0.303
ride+0.301
Neg logit contributions
-0.415
estones-0.369
mercial-0.357
>>>>-0.355
>>>>>>>>-0.351
Token Times
Token ID3782
Feature activation+2.39e+00

Pos logit contributions
pell+0.782
hend+0.758
hers+0.714
onomy+0.660
ride+0.655
Neg logit contributions
-0.578
estones-0.499
mercial-0.478
>>>>-0.476
>>>>>>>>-0.469
Token that
Token ID326
Feature activation+2.36e-01

Pos logit contributions
+4.459
>>>>+3.842
estones+3.824
mercial+3.788
>>>>>>>>+3.729
Neg logit contributions
pell-5.164
hend-5.079
hers-4.682
rises-4.364
onomy-4.333
Token any
Token ID597
Feature activation+3.77e+00

Pos logit contributions
+0.459
estones+0.399
mercial+0.387
>>>>+0.387
>>>>>>>>+0.381
Neg logit contributions
pell-0.516
hend-0.498
hers-0.466
onomy-0.428
ride-0.425
Token digging
Token ID18894
Feature activation+4.73e-01

Pos logit contributions
+5.987
estones+5.008
>>>>+4.867
mercial+4.771
>>>>>>>>+4.753
Neg logit contributions
pell-9.094
hend-8.818
hers-8.108
ride-7.739
onomy-7.579
Token into
Token ID656
Feature activation+1.48e+00

Pos logit contributions
+0.972
estones+0.857
mercial+0.832
>>>>+0.827
>>>>>>>>+0.819
Neg logit contributions
pell-0.974
hend-0.943
hers-0.878
onomy-0.802
ride-0.795
Token matters
Token ID6067
Feature activation+5.94e-01

Pos logit contributions
+2.933
estones+2.518
>>>>+2.490
mercial+2.478
>>>>>>>>+2.439
Neg logit contributions
pell-3.099
hend-3.018
hers-2.778
onomy-2.574
rises-2.563
Token beyond
Token ID3675
Feature activation+8.38e-02

Pos logit contributions
+1.140
estones+0.991
mercial+0.962
>>>>+0.957
>>>>>>>>+0.942
Neg logit contributions
pell-1.306
hend-1.267
hers-1.184
onomy-1.093
ride-1.083
Token Russia
Token ID3284
Feature activation+1.50e+00

Pos logit contributions
+0.167
estones+0.146
mercial+0.141
>>>>+0.141
>>>>>>>>+0.139
Neg logit contributions
pell-0.179
hend-0.173
hers-0.162
onomy-0.148
ride-0.147
Token and
Token ID290
Feature activation+4.93e-01

Pos logit contributions
pell+2.600
hend+2.503
hers+2.351
onomy+2.109
rises+2.105
Neg logit contributions
-2.797
estones-2.477
mercial-2.428
>>>>-2.377
>>>>>>>>-2.328
Token state
Token ID1181
Feature activation+8.34e-02

Pos logit contributions
+0.894
estones+0.765
mercial+0.742
>>>>+0.732
>>>>>>>>+0.715
Neg logit contributions
pell-1.126
hend-1.100
hers-1.015
rises-0.943
ride-0.943
Token for
Token ID329
Feature activation+1.38e+00

Pos logit contributions
+0.185
estones+0.165
mercial+0.160
>>>>+0.159
>>>>>>>>+0.157
Neg logit contributions
pell-0.160
hend-0.154
hers-0.143
onomy-0.129
ride-0.128
Token free
Token ID1479
Feature activation-4.36e-01

Pos logit contributions
+2.384
estones+1.986
>>>>+1.925
mercial+1.899
 newsp+1.893
Neg logit contributions
pell-3.263
hend-3.057
hers-2.842
ride-2.715
tro-2.645
Token.
Token ID13
Feature activation+8.74e-03

Pos logit contributions
pell+0.880
hend+0.841
hers+0.784
onomy+0.712
ride+0.706
Neg logit contributions
-0.918
estones-0.815
>>>>-0.796
>>>>>>>>-0.781
mercial-0.781
Token Any
Token ID4377
Feature activation+3.74e+00

Pos logit contributions
+0.018
estones+0.016
>>>>+0.015
mercial+0.015
>>>>>>>>+0.015
Neg logit contributions
pell-0.018
hend-0.018
hers-0.017
onomy-0.015
ride-0.015
Token contribution
Token ID10156
Feature activation+6.53e-01

Pos logit contributions
+4.525
mercial+4.097
>>>>+3.563
 newsp+3.476
>>>>>>>>+3.423
Neg logit contributions
pell-10.184
hend-9.941
hers-9.443
onomy-9.082
gang-8.836
Token to
Token ID284
Feature activation+4.18e-01

Pos logit contributions
+1.336
estones+1.165
>>>>+1.132
mercial+1.110
>>>>>>>>+1.101
Neg logit contributions
pell-1.395
hend-1.334
hers-1.235
onomy-1.130
ride-1.127
Token help
Token ID1037
Feature activation-1.67e+00

Pos logit contributions
+0.702
estones+0.585
mercial+0.573
>>>>+0.564
>>>>>>>>+0.559
Neg logit contributions
pell-1.025
hend-0.999
hers-0.942
ride-0.875
onomy-0.871
Token fund
Token ID1814
Feature activation-3.90e-01

Pos logit contributions
pell+2.602
hend+2.469
hers+2.249
onomy+1.974
rises+1.947
Neg logit contributions
-4.307
estones-3.922
mercial-3.810
>>>>-3.795
>>>>>>>>-3.764
Token our
Token ID674
Feature activation-9.03e-01

Pos logit contributions
pell+0.718
hend+0.686
hers+0.652
rises+0.585
onomy+0.579
Neg logit contributions
-0.886
estones-0.803
mercial-0.771
>>>>-0.759
>>>>>>>>-0.755
TokenMC
Token ID9655
Feature activation+5.81e-01

Pos logit contributions
+1.026
estones+0.909
mercial+0.879
>>>>+0.874
>>>>>>>>+0.865
Neg logit contributions
pell-0.974
hend-0.937
hers-0.876
onomy-0.793
ride-0.788
Tokens
Token ID82
Feature activation+8.76e-01

Pos logit contributions
+1.168
estones+1.031
mercial+0.994
>>>>+0.993
>>>>>>>>+0.979
Neg logit contributions
pell-1.222
hend-1.181
hers-1.100
onomy-1.005
ride-1.005
Token are
Token ID389
Feature activation+2.73e-01

Pos logit contributions
+1.819
estones+1.605
>>>>+1.561
mercial+1.551
>>>>>>>>+1.537
Neg logit contributions
pell-1.754
hend-1.708
hers-1.580
ride-1.447
onomy-1.446
Token due
Token ID2233
Feature activation+1.38e-01

Pos logit contributions
+0.579
estones+0.518
>>>>+0.502
mercial+0.496
>>>>>>>>+0.495
Neg logit contributions
pell-0.538
hend-0.524
hers-0.483
onomy-0.446
ride-0.440
Token.
Token ID13
Feature activation+5.36e-01

Pos logit contributions
+0.281
estones+0.248
>>>>+0.240
mercial+0.237
>>>>>>>>+0.236
Neg logit contributions
pell-0.285
hend-0.277
hers-0.258
onomy-0.236
ride-0.234
Token Any
Token ID4377
Feature activation+3.73e+00

Pos logit contributions
+1.104
estones+0.972
>>>>+0.945
>>>>>>>>+0.931
mercial+0.929
Neg logit contributions
pell-1.111
hend-1.068
hers-0.994
onomy-0.914
ride-0.905
Token player
Token ID2137
Feature activation+1.00e+00

Pos logit contributions
+6.917
estones+5.850
>>>>+5.759
mercial+5.657
>>>>>>>>+5.613
Neg logit contributions
pell-7.894
hend-7.722
hers-7.124
ride-6.653
onomy-6.637
Token with
Token ID351
Feature activation+4.14e-01

Pos logit contributions
+2.042
estones+1.774
>>>>+1.737
mercial+1.713
>>>>>>>>+1.706
Neg logit contributions
pell-2.076
hend-2.007
hers-1.858
onomy-1.724
ride-1.714
Token a
Token ID257
Feature activation+1.67e-01

Pos logit contributions
+0.839
estones+0.736
mercial+0.713
>>>>+0.711
>>>>>>>>+0.702
Neg logit contributions
pell-0.867
hend-0.838
hers-0.782
onomy-0.715
ride-0.711
Token N
Token ID399
Feature activation+8.66e-02

Pos logit contributions
+0.308
estones+0.267
mercial+0.258
>>>>+0.258
>>>>>>>>+0.255
Neg logit contributions
pell-0.379
hend-0.366
hers-0.344
onomy-0.317
ride-0.315
TokenMC
Token ID9655
Feature activation+6.62e-01

Pos logit contributions
+0.156
estones+0.134
mercial+0.130
>>>>+0.130
>>>>>>>>+0.128
Neg logit contributions
pell-0.201
hend-0.195
hers-0.184
onomy-0.169
rises-0.169
Token lunch
Token ID9965
Feature activation+5.24e-01

Pos logit contributions
pell+0.226
hend+0.218
hers+0.203
onomy+0.187
ride+0.185
Neg logit contributions
-0.218
estones-0.191
mercial-0.184
>>>>-0.184
>>>>>>>>-0.182
Token.
Token ID13
Feature activation+1.45e-01

Pos logit contributions
+1.079
estones+0.949
mercial+0.916
>>>>+0.912
>>>>>>>>+0.901
Neg logit contributions
pell-1.079
hend-1.040
hers-0.967
onomy-0.888
ride-0.877
Token
Token ID198
Feature activation+6.46e-02

Pos logit contributions
+0.296
estones+0.260
>>>>+0.250
mercial+0.250
>>>>>>>>+0.247
Neg logit contributions
pell-0.304
hend-0.292
hers-0.272
onomy-0.249
ride-0.247
Token
Token ID198
Feature activation+1.00e-01

Pos logit contributions
+0.133
estones+0.117
>>>>+0.114
>>>>>>>>+0.113
mercial+0.106
Neg logit contributions
pell-0.134
hend-0.128
hers-0.120
onomy-0.110
rises-0.109
Token"
Token ID1
Feature activation+6.67e-02

Pos logit contributions
+0.206
estones+0.179
>>>>+0.176
>>>>>>>>+0.173
mercial+0.171
Neg logit contributions
pell-0.209
hend-0.200
hers-0.187
onomy-0.170
rises-0.168
TokenAnything
Token ID40028
Feature activation+3.72e+00

Pos logit contributions
+0.134
estones+0.116
>>>>+0.113
>>>>>>>>+0.112
mercial+0.110
Neg logit contributions
pell-0.143
hend-0.137
hers-0.129
onomy-0.118
rises-0.116
Token special
Token ID2041
Feature activation+1.25e+00

Pos logit contributions
+6.662
>>>>+5.544
estones+5.529
mercial+5.469
 newsp+5.371
Neg logit contributions
pell-7.899
hend-7.821
hers-7.284
onomy-6.920
ride-6.897
Token,
Token ID11
Feature activation+3.66e-01

Pos logit contributions
+2.496
estones+2.158
mercial+2.102
>>>>+2.101
>>>>>>>>+2.084
Neg logit contributions
pell-2.622
hend-2.552
hers-2.371
onomy-2.192
ride-2.169
Token sir
Token ID15967
Feature activation+6.42e-01

Pos logit contributions
+0.701
estones+0.607
mercial+0.590
>>>>+0.588
>>>>>>>>+0.580
Neg logit contributions
pell-0.807
hend-0.783
hers-0.735
onomy-0.678
ride-0.670
Token,"
Token ID553
Feature activation-2.61e-01

Pos logit contributions
+1.362
estones+1.196
mercial+1.161
>>>>+1.159
>>>>>>>>+1.145
Neg logit contributions
pell-1.279
hend-1.237
hers-1.150
onomy-1.049
ride-1.035
Token cant
Token ID18548
Feature activation-5.05e-01

Pos logit contributions
pell+0.537
hend+0.518
hers+0.484
onomy+0.441
ride+0.437
Neg logit contributions
-0.542
estones-0.476
mercial-0.461
>>>>-0.459
>>>>>>>>-0.453
Token generated
Token ID7560
Feature activation+1.21e+00

Pos logit contributions
pell+0.763
hend+0.737
hers+0.683
onomy+0.621
rises+0.612
Neg logit contributions
-0.824
estones-0.732
mercial-0.710
>>>>-0.706
>>>>>>>>-0.699
Token by
Token ID416
Feature activation-5.81e-01

Pos logit contributions
+2.459
estones+2.167
>>>>+2.096
mercial+2.087
>>>>>>>>+2.068
Neg logit contributions
pell-2.483
hend-2.411
hers-2.242
onomy-2.083
ride-2.026
Token the
Token ID262
Feature activation-2.74e-01

Pos logit contributions
pell+1.236
hend+1.194
hers+1.118
onomy+1.020
ride+1.014
Neg logit contributions
-1.163
estones-1.022
mercial-0.986
>>>>-0.980
>>>>>>>>-0.968
Token sale
Token ID5466
Feature activation+9.07e-03

Pos logit contributions
pell+0.608
hend+0.589
hers+0.553
onomy+0.506
ride+0.502
Neg logit contributions
-0.525
estones-0.460
mercial-0.443
>>>>-0.442
>>>>>>>>-0.435
Token.
Token ID13
Feature activation-1.47e-01

Pos logit contributions
+0.019
estones+0.017
mercial+0.016
>>>>+0.016
>>>>>>>>+0.016
Neg logit contributions
pell-0.019
hend-0.018
hers-0.017
onomy-0.015
ride-0.015
Token Anything
Token ID21035
Feature activation+3.69e+00

Pos logit contributions
pell+0.324
hend+0.311
hers+0.291
onomy+0.267
rises+0.265
Neg logit contributions
-0.287
estones-0.250
>>>>-0.243
mercial-0.240
>>>>>>>>-0.239
Token left
Token ID1364
Feature activation+7.04e-01

Pos logit contributions
+6.537
estones+5.658
>>>>+5.628
mercial+5.567
>>>>>>>>+5.482
Neg logit contributions
pell-7.821
hend-7.747
hers-7.034
onomy-6.802
rises-6.606
Token over
Token ID625
Feature activation-7.00e-02

Pos logit contributions
+1.408
estones+1.241
>>>>+1.199
>>>>>>>>+1.192
mercial+1.190
Neg logit contributions
pell-1.479
hend-1.439
hers-1.340
onomy-1.234
ride-1.216
Token
Token ID851
Feature activation+1.56e+00

Pos logit contributions
pell+0.151
hend+0.147
hers+0.137
onomy+0.126
ride+0.125
Neg logit contributions
-0.138
estones-0.121
mercial-0.116
>>>>-0.116
>>>>>>>>-0.115
Token the
Token ID262
Feature activation+2.70e-01

Pos logit contributions
+2.982
estones+2.606
mercial+2.550
>>>>+2.533
>>>>>>>>+2.494
Neg logit contributions
pell-3.373
hend-3.278
hers-3.065
onomy-2.849
ride-2.785
Token authority
Token ID4934
Feature activation+1.01e+00

Pos logit contributions
+0.525
estones+0.462
>>>>+0.445
mercial+0.444
>>>>>>>>+0.439
Neg logit contributions
pell-0.588
hend-0.568
hers-0.533
onomy-0.488
ride-0.484
Token in
Token ID287
Feature activation+3.71e-02

Pos logit contributions
+0.457
estones+0.400
mercial+0.386
>>>>+0.385
>>>>>>>>+0.380
Neg logit contributions
pell-0.501
hend-0.486
hers-0.454
onomy-0.415
ride-0.413
Token both
Token ID1111
Feature activation+9.20e-01

Pos logit contributions
+0.072
estones+0.063
mercial+0.061
>>>>+0.061
>>>>>>>>+0.060
Neg logit contributions
pell-0.081
hend-0.079
hers-0.073
onomy-0.067
ride-0.067
Token official
Token ID1743
Feature activation+1.52e+00

Pos logit contributions
+1.612
estones+1.370
>>>>+1.329
mercial+1.327
>>>>>>>>+1.299
Neg logit contributions
pell-2.155
hend-2.126
hers-1.985
onomy-1.829
ride-1.827
Token languages
Token ID8950
Feature activation+1.33e+00

Pos logit contributions
+2.331
estones+1.954
mercial+1.856
>>>>+1.848
>>>>>>>>+1.803
Neg logit contributions
pell-3.916
hend-3.848
hers-3.611
ride-3.362
onomy-3.356
Token.
Token ID13
Feature activation+5.09e-01

Pos logit contributions
+2.692
estones+2.359
>>>>+2.296
mercial+2.266
>>>>>>>>+2.265
Neg logit contributions
pell-2.718
hend-2.670
hers-2.468
onomy-2.254
ride-2.254
Token Neither
Token ID16126
Feature activation+3.66e+00

Pos logit contributions
+1.002
estones+0.872
>>>>+0.855
>>>>>>>>+0.843
mercial+0.834
Neg logit contributions
pell-1.098
hend-1.066
hers-0.987
ride-0.902
onomy-0.902
Token language
Token ID3303
Feature activation+1.47e+00

Pos logit contributions
+6.258
>>>>+5.438
estones+5.351
>>>>>>>>+5.343
 scrut+5.274
Neg logit contributions
hend-8.026
pell-7.945
hers-7.273
rises-6.764
onomy-6.712
Token takes
Token ID2753
Feature activation+4.81e-02

Pos logit contributions
+2.848
estones+2.488
>>>>+2.458
mercial+2.413
>>>>>>>>+2.412
Neg logit contributions
pell-3.091
hend-3.058
hers-2.808
onomy-2.586
rises-2.560
Token precedence
Token ID38177
Feature activation+1.60e+00

Pos logit contributions
+0.093
estones+0.082
>>>>+0.079
mercial+0.078
>>>>>>>>+0.078
Neg logit contributions
pell-0.105
hend-0.102
hers-0.095
onomy-0.087
ride-0.086
Token over
Token ID625
Feature activation-1.53e-01

Pos logit contributions
+3.166
estones+2.771
>>>>+2.720
>>>>>>>>+2.651
mercial+2.646
Neg logit contributions
pell-3.360
hend-3.283
hers-3.063
onomy-2.815
rises-2.771
Token the
Token ID262
Feature activation+8.55e-02

Pos logit contributions
pell+0.337
hend+0.327
hers+0.306
onomy+0.281
ride+0.279
Neg logit contributions
-0.293
estones-0.256
mercial-0.246
>>>>-0.245
>>>>>>>>-0.242
Token anonymous
Token ID11614
Feature activation+1.72e-01

Pos logit contributions
pell+0.903
hend+0.876
hers+0.822
onomy+0.754
ride+0.751
Neg logit contributions
-0.754
estones-0.656
mercial-0.630
>>>>-0.627
>>>>>>>>-0.618
Token source
Token ID2723
Feature activation+5.86e-01

Pos logit contributions
+0.315
estones+0.271
>>>>+0.261
mercial+0.260
>>>>>>>>+0.258
Neg logit contributions
pell-0.397
hend-0.386
hers-0.363
onomy-0.334
ride-0.331
Token will
Token ID481
Feature activation+2.91e-01

Pos logit contributions
+1.194
estones+1.043
>>>>+1.022
mercial+1.014
>>>>>>>>+1.004
Neg logit contributions
pell-1.204
hend-1.181
hers-1.090
onomy-1.002
ride-0.993
Token suffice
Token ID34302
Feature activation-1.11e+00

Pos logit contributions
+0.578
estones+0.507
>>>>+0.491
mercial+0.490
>>>>>>>>+0.483
Neg logit contributions
pell-0.614
hend-0.598
hers-0.558
onomy-0.514
ride-0.507
Token.
Token ID13
Feature activation-1.73e-02

Pos logit contributions
pell+2.110
hend+2.013
hers+1.882
onomy+1.687
ride+1.678
Neg logit contributions
-2.494
estones-2.230
mercial-2.170
>>>>-2.142
>>>>>>>>-2.121
Token Any
Token ID4377
Feature activation+3.65e+00

Pos logit contributions
pell+0.037
hend+0.036
hers+0.033
onomy+0.031
ride+0.031
Neg logit contributions
-0.034
estones-0.030
>>>>-0.029
>>>>>>>>-0.029
mercial-0.028
Token mistake
Token ID7457
Feature activation+5.06e-01

Pos logit contributions
+6.644
estones+5.633
>>>>+5.517
mercial+5.409
>>>>>>>>+5.401
Neg logit contributions
pell-7.891
hend-7.781
hers-7.143
ride-6.653
onomy-6.604
Token sparks
Token ID38306
Feature activation+7.90e-01

Pos logit contributions
+1.028
estones+0.899
>>>>+0.872
mercial+0.866
>>>>>>>>+0.860
Neg logit contributions
pell-1.043
hend-1.025
hers-0.946
onomy-0.872
ride-0.864
Token talk
Token ID1561
Feature activation+3.51e-01

Pos logit contributions
+1.530
estones+1.321
>>>>+1.285
mercial+1.268
>>>>>>>>+1.263
Neg logit contributions
pell-1.718
hend-1.672
hers-1.568
onomy-1.436
ride-1.426
Token of
Token ID286
Feature activation-9.64e-02

Pos logit contributions
+0.718
estones+0.633
mercial+0.614
>>>>+0.608
>>>>>>>>+0.603
Neg logit contributions
pell-0.723
hend-0.705
hers-0.654
onomy-0.595
ride-0.590
Token impeachment
Token ID30258
Feature activation-3.54e-01

Pos logit contributions
pell+0.226
hend+0.219
hers+0.206
onomy+0.190
ride+0.189
Neg logit contributions
-0.172
estones-0.148
mercial-0.142
>>>>-0.142
>>>>>>>>-0.140
Token deal
Token ID1730
Feature activation+2.31e+00

Pos logit contributions
+0.431
estones+0.354
>>>>+0.342
mercial+0.338
>>>>>>>>+0.336
Neg logit contributions
pell-0.789
hend-0.769
hers-0.727
rises-0.677
ride-0.676
Token --
Token ID1377
Feature activation+1.02e+00

Pos logit contributions
+4.352
estones+3.773
>>>>>>>>+3.628
>>>>+3.600
mercial+3.589
Neg logit contributions
pell-4.893
hend-4.803
hers-4.434
onomy-4.137
ride-4.115
Token although
Token ID3584
Feature activation+2.67e-01

Pos logit contributions
+1.943
estones+1.693
mercial+1.631
>>>>+1.623
>>>>>>>>+1.601
Neg logit contributions
pell-2.244
hend-2.181
hers-2.025
onomy-1.877
rises-1.862
Token the
Token ID262
Feature activation+6.40e-01

Pos logit contributions
+0.510
estones+0.445
mercial+0.429
>>>>+0.428
>>>>>>>>+0.423
Neg logit contributions
pell-0.590
hend-0.571
hers-0.533
onomy-0.492
ride-0.488
Token European
Token ID3427
Feature activation+1.68e+00

Pos logit contributions
+1.180
estones+1.026
>>>>+0.989
mercial+0.979
>>>>>>>>+0.977
Neg logit contributions
pell-1.457
hend-1.420
hers-1.323
ride-1.217
rises-1.215
Token Commission
Token ID4513
Feature activation+3.64e+00

Pos logit contributions
+2.972
estones+2.565
>>>>+2.460
>>>>>>>>+2.427
mercial+2.410
Neg logit contributions
pell-3.904
hend-3.832
hers-3.563
ride-3.280
onomy-3.238
Token pointed
Token ID6235
Feature activation-1.96e-01

Pos logit contributions
+6.642
estones+5.806
>>>>+5.600
>>>>>>>>+5.543
mercial+5.517
Neg logit contributions
pell-7.602
hend-7.562
hers-6.912
rises-6.453
ride-6.401
Token out
Token ID503
Feature activation-5.63e-01

Pos logit contributions
pell+0.346
hend+0.332
hers+0.306
onomy+0.273
ride+0.271
Neg logit contributions
-0.464
estones-0.416
mercial-0.404
>>>>-0.403
>>>>>>>>-0.399
Token that
Token ID326
Feature activation+5.80e-02

Pos logit contributions
pell+1.197
hend+1.154
hers+1.087
onomy+0.989
ride+0.980
Neg logit contributions
-1.131
estones-0.995
mercial-0.962
>>>>-0.950
>>>>>>>>-0.939
Token Britain
Token ID5491
Feature activation+1.23e+00

Pos logit contributions
+0.112
estones+0.098
mercial+0.095
>>>>+0.095
>>>>>>>>+0.093
Neg logit contributions
pell-0.127
hend-0.123
hers-0.115
onomy-0.106
ride-0.105
Token cannot
Token ID2314
Feature activation+4.54e-01

Pos logit contributions
+2.347
estones+2.076
>>>>+1.981
mercial+1.959
>>>>>>>>+1.945
Neg logit contributions
pell-2.667
hend-2.608
hers-2.392
onomy-2.239
ride-2.211
Token.
Token ID13
Feature activation+1.48e-01

Pos logit contributions
pell+0.341
hend+0.328
hers+0.304
onomy+0.275
ride+0.272
Neg logit contributions
-0.403
estones-0.359
mercial-0.348
>>>>-0.348
>>>>>>>>-0.344
Token
Token ID447
Feature activation-3.93e-01

Pos logit contributions
+0.317
estones+0.280
>>>>+0.272
mercial+0.270
>>>>>>>>+0.268
Neg logit contributions
pell-0.297
hend-0.286
hers-0.265
onomy-0.241
ride-0.240
Token
Token ID251
Feature activation-6.12e-01

Pos logit contributions
pell+0.968
hend+0.940
hers+0.890
onomy+0.824
ride+0.818
Neg logit contributions
-0.657
estones-0.561
mercial-0.538
>>>>-0.532
>>>>>>>>-0.524
Token Israeli
Token ID6085
Feature activation-6.15e-02

Pos logit contributions
pell+1.243
hend+1.200
hers+1.119
onomy+1.015
ride+1.009
Neg logit contributions
-1.287
estones-1.138
mercial-1.100
>>>>-1.096
>>>>>>>>-1.083
Token Foreign
Token ID8708
Feature activation+3.42e+00

Pos logit contributions
pell+0.130
hend+0.126
hers+0.118
onomy+0.107
ride+0.106
Neg logit contributions
-0.124
estones-0.109
mercial-0.105
>>>>-0.105
>>>>>>>>-0.104
Token Ministry
Token ID9475
Feature activation+3.64e+00

Pos logit contributions
+3.930
estones+3.230
>>>>+3.035
>>>>>>>>+2.990
mercial+2.975
Neg logit contributions
pell-9.661
hend-9.517
hers-9.093
ride-8.417
rises-8.404
Token,
Token ID11
Feature activation+2.41e-01

Pos logit contributions
+5.168
estones+4.218
>>>>+4.135
>>>>>>>>+4.021
mercial+3.964
Neg logit contributions
pell-9.249
hend-9.232
hers-8.645
ride-7.998
rises-7.969
Token responding
Token ID14409
Feature activation+2.17e-02

Pos logit contributions
+0.482
estones+0.421
mercial+0.407
>>>>+0.407
>>>>>>>>+0.402
Neg logit contributions
pell-0.513
hend-0.498
hers-0.465
onomy-0.424
ride-0.422
Token to
Token ID284
Feature activation+1.74e-01

Pos logit contributions
+0.048
estones+0.043
mercial+0.042
>>>>+0.041
>>>>>>>>+0.041
Neg logit contributions
pell-0.041
hend-0.040
hers-0.037
onomy-0.033
ride-0.033
Token Erdogan
Token ID20065
Feature activation+3.68e-01

Pos logit contributions
+0.341
estones+0.298
>>>>+0.288
mercial+0.288
>>>>>>>>+0.285
Neg logit contributions
pell-0.376
hend-0.364
hers-0.341
onomy-0.312
ride-0.310
Token
Token ID447
Feature activation+1.67e-01

Pos logit contributions
+0.818
estones+0.725
>>>>+0.705
mercial+0.705
>>>>>>>>+0.696
Neg logit contributions
pell-0.699
hend-0.677
hers-0.626
onomy-0.563
ride-0.562
Tokenfresh
Token ID48797
Feature activation+3.68e-01

Pos logit contributions
+0.063
estones+0.056
>>>>+0.055
mercial+0.055
>>>>>>>>+0.054
Neg logit contributions
pell-0.049
hend-0.047
hers-0.043
onomy-0.039
ride-0.039
Token or
Token ID393
Feature activation+7.81e-01

Pos logit contributions
+0.735
estones+0.643
mercial+0.619
>>>>+0.618
>>>>>>>>+0.610
Neg logit contributions
pell-0.782
hend-0.758
hers-0.705
onomy-0.651
ride-0.646
Token dry
Token ID5894
Feature activation+5.56e-01

Pos logit contributions
+1.643
estones+1.463
>>>>+1.429
mercial+1.423
>>>>>>>>+1.402
Neg logit contributions
pell-1.540
hend-1.487
hers-1.369
onomy-1.291
ride-1.262
Token)
Token ID8
Feature activation+3.67e-02

Pos logit contributions
+1.236
estones+1.097
mercial+1.065
>>>>+1.061
>>>>>>>>+1.051
Neg logit contributions
pell-1.050
hend-1.017
hers-0.942
onomy-0.854
ride-0.846
Token and
Token ID290
Feature activation-8.11e-02

Pos logit contributions
+0.072
estones+0.064
mercial+0.062
>>>>+0.061
>>>>>>>>+0.061
Neg logit contributions
pell-0.078
hend-0.076
hers-0.071
onomy-0.065
ride-0.065
Token wh
Token ID348
Feature activation+3.63e+00

Pos logit contributions
pell+0.171
hend+0.165
hers+0.154
onomy+0.141
ride+0.140
Neg logit contributions
-0.164
estones-0.144
mercial-0.140
>>>>-0.139
>>>>>>>>-0.137
Tokeniz
Token ID528
Feature activation+6.27e-01

Pos logit contributions
+6.769
estones+6.241
mercial+5.984
>>>>+5.927
>>>>>>>>+5.828
Neg logit contributions
pell-7.309
hend-7.147
hers-6.622
onomy-6.017
ride-5.972
Token on
Token ID319
Feature activation+5.02e-01

Pos logit contributions
+1.304
estones+1.175
mercial+1.127
>>>>+1.122
>>>>>>>>+1.100
Neg logit contributions
pell-1.247
hend-1.218
hers-1.128
onomy-1.030
ride-1.014
Token high
Token ID1029
Feature activation-7.33e-01

Pos logit contributions
+1.011
estones+0.898
mercial+0.867
>>>>+0.861
>>>>>>>>+0.851
Neg logit contributions
pell-1.053
hend-1.021
hers-0.949
onomy-0.869
ride-0.862
Token for
Token ID329
Feature activation+1.51e+00

Pos logit contributions
pell+1.508
hend+1.456
hers+1.360
onomy+1.236
ride+1.229
Neg logit contributions
-1.518
estones-1.339
mercial-1.298
>>>>-1.287
>>>>>>>>-1.270
Token a
Token ID257
Feature activation+2.90e-01

Pos logit contributions
+2.950
estones+2.607
>>>>+2.546
mercial+2.495
>>>>>>>>+2.493
Neg logit contributions
pell-3.170
hend-3.103
hers-2.874
onomy-2.666
ride-2.633
Token-
Token ID12
Feature activation-5.62e-01

Pos logit contributions
+1.888
estones+1.686
>>>>+1.652
mercial+1.648
>>>>>>>>+1.628
Neg logit contributions
pell-1.186
hend-1.140
hers-1.044
onomy-0.913
ride-0.911
TokenReview
Token ID14832
Feature activation-6.46e-01

Pos logit contributions
pell+0.996
hend+0.957
hers+0.883
onomy+0.790
ride+0.782
Neg logit contributions
-1.325
estones-1.189
mercial-1.156
>>>>-1.151
>>>>>>>>-1.139
Token newspaper
Token ID7533
Feature activation+1.25e+00

Pos logit contributions
pell+1.300
hend+1.254
hers+1.170
onomy+1.061
ride+1.053
Neg logit contributions
-1.368
estones-1.211
mercial-1.172
>>>>-1.166
>>>>>>>>-1.152
Token.
Token ID13
Feature activation+8.58e-02

Pos logit contributions
+2.513
estones+2.210
>>>>+2.126
mercial+2.109
>>>>>>>>+2.088
Neg logit contributions
pell-2.583
hend-2.523
hers-2.325
onomy-2.150
rises-2.130
Token "
Token ID366
Feature activation+3.69e-01

Pos logit contributions
+0.186
estones+0.165
mercial+0.160
>>>>+0.159
>>>>>>>>+0.157
Neg logit contributions
pell-0.168
hend-0.162
hers-0.151
onomy-0.136
ride-0.136
TokenAny
Token ID7149
Feature activation+3.61e+00

Pos logit contributions
+0.733
estones+0.631
>>>>+0.622
>>>>>>>>+0.612
mercial+0.589
Neg logit contributions
pell-0.797
hend-0.766
hers-0.717
onomy-0.655
rises-0.650
Tokenbody
Token ID2618
Feature activation+1.78e+00

Pos logit contributions
+6.305
estones+5.380
mercial+5.262
>>>>+5.132
>>>>>>>>+5.052
Neg logit contributions
pell-7.939
hend-7.799
hers-7.256
onomy-6.752
ride-6.595
Token that
Token ID326
Feature activation+8.42e-03

Pos logit contributions
+3.431
estones+3.001
>>>>+2.954
mercial+2.890
>>>>>>>>+2.877
Neg logit contributions
pell-3.747
hend-3.680
hers-3.380
onomy-3.193
ride-3.141
Token didn
Token ID1422
Feature activation+1.22e+00

Pos logit contributions
+0.016
estones+0.014
>>>>+0.014
mercial+0.014
>>>>>>>>+0.013
Neg logit contributions
pell-0.019
hend-0.018
hers-0.017
onomy-0.016
ride-0.015
Token't
Token ID470
Feature activation-5.28e-01

Pos logit contributions
+2.405
estones+2.100
>>>>+2.077
>>>>>>>>+2.031
mercial+1.850
Neg logit contributions
pell-2.596
hend-2.508
hers-2.282
onomy-2.151
rises-2.138
Token get
Token ID651
Feature activation-3.93e-01

Pos logit contributions
pell+1.157
hend+1.128
hers+1.056
onomy+0.971
ride+0.954
Neg logit contributions
-1.013
estones-0.892
mercial-0.856
>>>>-0.854
>>>>>>>>-0.840
Token.
Token ID13
Feature activation-3.40e-01

Pos logit contributions
pell+1.271
hend+1.225
hers+1.137
onomy+1.026
ride+1.018
Neg logit contributions
-1.468
estones-1.306
mercial-1.267
>>>>-1.261
>>>>>>>>-1.247
Token
Token ID447
Feature activation-5.37e-01

Pos logit contributions
pell+0.705
hend+0.679
hers+0.633
onomy+0.576
ride+0.573
Neg logit contributions
-0.705
estones-0.619
>>>>-0.599
mercial-0.598
>>>>>>>>-0.592
Token
Token ID251
Feature activation-4.72e-01

Pos logit contributions
pell+1.323
hend+1.290
hers+1.223
onomy+1.134
ride+1.122
Neg logit contributions
-0.889
estones-0.759
mercial-0.727
>>>>-0.710
>>>>>>>>-0.703
Token
Token ID447
Feature activation-6.94e-01

Pos logit contributions
pell+0.968
hend+0.931
hers+0.868
onomy+0.789
ride+0.785
Neg logit contributions
-0.988
estones-0.869
>>>>-0.842
mercial-0.838
>>>>>>>>-0.831
Token
Token ID250
Feature activation+3.66e-01

Pos logit contributions
pell+1.721
hend+1.676
hers+1.588
onomy+1.472
ride+1.459
Neg logit contributions
-1.138
estones-0.970
mercial-0.931
>>>>-0.909
>>>>>>>>-0.900
TokenAny
Token ID7149
Feature activation+3.60e+00

Pos logit contributions
+0.762
estones+0.665
>>>>+0.652
>>>>>>>>+0.643
mercial+0.638
Neg logit contributions
pell-0.758
hend-0.725
hers-0.675
onomy-0.616
ride-0.613
Tokentime
Token ID2435
Feature activation-2.52e-01

Pos logit contributions
+6.616
estones+5.591
>>>>+5.564
mercial+5.444
>>>>>>>>+5.438
Neg logit contributions
pell-7.790
hend-7.518
hers-6.952
ride-6.392
rises-6.359
Token a
Token ID257
Feature activation+7.98e-03

Pos logit contributions
pell+0.521
hend+0.503
hers+0.470
onomy+0.427
ride+0.424
Neg logit contributions
-0.521
estones-0.460
mercial-0.444
>>>>-0.442
>>>>>>>>-0.436
Token US
Token ID1294
Feature activation+2.62e-01

Pos logit contributions
+0.015
estones+0.013
>>>>+0.013
mercial+0.012
>>>>>>>>+0.012
Neg logit contributions
pell-0.018
hend-0.017
hers-0.016
onomy-0.015
ride-0.015
Token president
Token ID1893
Feature activation+2.01e+00

Pos logit contributions
+0.472
estones+0.405
mercial+0.391
>>>>+0.389
>>>>>>>>+0.384
Neg logit contributions
pell-0.611
hend-0.593
hers-0.557
onomy-0.515
rises-0.510
Token visits
Token ID11864
Feature activation+1.99e+00

Pos logit contributions
+3.679
estones+3.216
>>>>+3.097
mercial+3.063
>>>>>>>>+3.007
Neg logit contributions
pell-4.453
hend-4.369
hers-4.030
onomy-3.832
rises-3.710
Token with
Token ID351
Feature activation+4.26e-01

Pos logit contributions
+0.198
estones+0.172
mercial+0.166
>>>>+0.166
>>>>>>>>+0.164
Neg logit contributions
pell-0.220
hend-0.212
hers-0.199
onomy-0.183
ride-0.182
Token the
Token ID262
Feature activation+4.14e-02

Pos logit contributions
+0.840
estones+0.732
mercial+0.711
>>>>+0.709
>>>>>>>>+0.701
Neg logit contributions
pell-0.910
hend-0.881
hers-0.827
onomy-0.759
ride-0.750
Token same
Token ID976
Feature activation+1.54e+00

Pos logit contributions
+0.075
estones+0.065
mercial+0.063
>>>>+0.062
>>>>>>>>+0.062
Neg logit contributions
pell-0.096
hend-0.093
hers-0.087
onomy-0.080
ride-0.080
Token result
Token ID1255
Feature activation+2.55e-01

Pos logit contributions
+2.884
estones+2.500
>>>>+2.423
>>>>>>>>+2.402
mercial+2.360
Neg logit contributions
pell-3.411
hend-3.352
hers-3.102
onomy-2.899
ride-2.828
Token.
Token ID13
Feature activation-2.36e-01

Pos logit contributions
+0.542
estones+0.479
>>>>+0.461
mercial+0.458
>>>>>>>>+0.458
Neg logit contributions
pell-0.507
hend-0.491
hers-0.455
onomy-0.416
ride-0.412
Token Neither
Token ID16126
Feature activation+3.59e+00

Pos logit contributions
pell+0.506
hend+0.486
hers+0.456
onomy+0.416
ride+0.413
Neg logit contributions
-0.472
estones-0.412
>>>>-0.398
mercial-0.395
>>>>>>>>-0.393
Token of
Token ID286
Feature activation-2.09e-01

Pos logit contributions
+6.696
>>>>+5.668
estones+5.571
>>>>>>>>+5.523
mercial+5.375
Neg logit contributions
pell-7.636
hend-7.397
hers-6.959
onomy-6.462
meg-6.352
Token these
Token ID777
Feature activation+4.30e-01

Pos logit contributions
pell+0.456
hend+0.441
hers+0.414
onomy+0.379
ride+0.376
Neg logit contributions
-0.408
estones-0.356
mercial-0.343
>>>>-0.343
>>>>>>>>-0.338
Token incidents
Token ID10207
Feature activation-2.62e-02

Pos logit contributions
+0.774
estones+0.664
>>>>+0.640
mercial+0.632
>>>>>>>>+0.631
Neg logit contributions
pell-1.001
hend-0.974
hers-0.912
onomy-0.851
ride-0.840
Token was
Token ID373
Feature activation+5.24e-02

Pos logit contributions
pell+0.054
hend+0.052
hers+0.048
onomy+0.044
ride+0.044
Neg logit contributions
-0.054
estones-0.048
mercial-0.046
>>>>-0.046
>>>>>>>>-0.045
Token anti
Token ID3098
Feature activation-6.49e-02

Pos logit contributions
+0.109
estones+0.096
mercial+0.093
>>>>+0.092
>>>>>>>>+0.091
Neg logit contributions
pell-0.107
hend-0.104
hers-0.096
onomy-0.089
ride-0.088
Token seem
Token ID1283
Feature activation+1.18e+00

Pos logit contributions
+2.720
estones+2.378
mercial+2.287
>>>>+2.282
>>>>>>>>+2.245
Neg logit contributions
pell-3.286
hend-3.226
hers-3.035
onomy-2.816
ride-2.772
Token an
Token ID281
Feature activation-3.80e-01

Pos logit contributions
+2.175
estones+1.881
>>>>+1.802
mercial+1.797
>>>>>>>>+1.782
Neg logit contributions
pell-2.656
hend-2.590
hers-2.390
onomy-2.253
rises-2.214
Token annoyance
Token ID38650
Feature activation-7.14e-01

Pos logit contributions
pell+0.944
hend+0.915
hers+0.864
onomy+0.799
ride+0.797
Neg logit contributions
-0.628
estones-0.535
mercial-0.510
>>>>-0.507
>>>>>>>>-0.498
Token,
Token ID11
Feature activation-5.64e-01

Pos logit contributions
pell+1.405
hend+1.355
hers+1.261
onomy+1.142
ride+1.132
Neg logit contributions
-1.544
estones-1.371
mercial-1.327
>>>>-1.320
>>>>>>>>-1.305
Token but
Token ID475
Feature activation-4.44e-01

Pos logit contributions
pell+1.274
hend+1.234
hers+1.161
onomy+1.068
ride+1.059
Neg logit contributions
-1.054
estones-0.917
mercial-0.882
>>>>-0.877
>>>>>>>>-0.866
Token neither
Token ID6159
Feature activation+3.59e+00

Pos logit contributions
pell+0.946
hend+0.914
hers+0.855
onomy+0.786
ride+0.777
Neg logit contributions
-0.887
estones-0.780
mercial-0.752
>>>>-0.751
>>>>>>>>-0.741
Token is
Token ID318
Feature activation+3.02e-01

Pos logit contributions
+6.317
estones+5.544
>>>>+5.525
>>>>>>>>+5.363
 scrut+5.212
Neg logit contributions
pell-7.627
hend-7.447
hers-6.941
onomy-6.564
rises-6.415
Token as
Token ID355
Feature activation-6.89e-01

Pos logit contributions
+0.594
estones+0.519
>>>>+0.500
mercial+0.495
>>>>>>>>+0.492
Neg logit contributions
pell-0.648
hend-0.627
hers-0.585
onomy-0.544
rises-0.536
Token bad
Token ID2089
Feature activation+2.92e-01

Pos logit contributions
pell+1.522
hend+1.474
hers+1.383
onomy+1.268
ride+1.259
Neg logit contributions
-1.323
estones-1.154
mercial-1.113
>>>>-1.107
>>>>>>>>-1.091
Token as
Token ID355
Feature activation-9.06e-01

Pos logit contributions
+0.562
estones+0.489
>>>>+0.471
mercial+0.470
>>>>>>>>+0.464
Neg logit contributions
pell-0.642
hend-0.623
hers-0.582
onomy-0.536
rises-0.530
Token the
Token ID262
Feature activation+2.74e-01

Pos logit contributions
pell+1.860
hend+1.796
hers+1.678
onomy+1.526
ride+1.514
Neg logit contributions
-1.883
estones-1.663
mercial-1.608
>>>>-1.599
>>>>>>>>-1.580
Token programs
Token ID4056
Feature activation+4.22e-01

Pos logit contributions
+0.673
estones+0.583
mercial+0.562
>>>>+0.558
>>>>>>>>+0.550
Neg logit contributions
pell-0.847
hend-0.822
hers-0.774
onomy-0.709
rises-0.707
Token."
Token ID526
Feature activation-3.95e-01

Pos logit contributions
+0.860
estones+0.759
mercial+0.738
>>>>+0.725
>>>>>>>>+0.720
Neg logit contributions
pell-0.867
hend-0.847
hers-0.787
onomy-0.725
rises-0.719
Token
Token ID198
Feature activation-2.18e-01

Pos logit contributions
pell+0.810
hend+0.781
hers+0.726
onomy+0.659
ride+0.656
Neg logit contributions
-0.824
estones-0.729
mercial-0.704
>>>>-0.702
>>>>>>>>-0.693
Token
Token ID198
Feature activation-1.22e-01

Pos logit contributions
pell+0.445
hend+0.429
hers+0.398
onomy+0.365
rises+0.362
Neg logit contributions
-0.456
estones-0.403
>>>>-0.392
>>>>>>>>-0.386
mercial-0.373
Token"
Token ID1
Feature activation-1.62e-02

Pos logit contributions
pell+0.260
hend+0.251
hers+0.234
onomy+0.212
rises+0.212
Neg logit contributions
-0.247
estones-0.214
>>>>-0.211
>>>>>>>>-0.208
mercial-0.204
TokenAny
Token ID7149
Feature activation+3.58e+00

Pos logit contributions
pell+0.036
hend+0.035
hers+0.033
onomy+0.030
rises+0.030
Neg logit contributions
-0.031
estones-0.027
>>>>-0.026
>>>>>>>>-0.026
mercial-0.025
Token waste
Token ID7030
Feature activation-4.82e-01

Pos logit contributions
+6.456
mercial+5.528
estones+5.398
>>>>+5.311
>>>>>>>>+5.193
Neg logit contributions
pell-7.678
hend-7.548
hers-6.954
rises-6.497
ride-6.359
Token,
Token ID11
Feature activation+1.25e-01

Pos logit contributions
pell+1.000
hend+0.967
hers+0.901
onomy+0.820
rises+0.817
Neg logit contributions
-0.992
estones-0.873
mercial-0.846
>>>>-0.837
>>>>>>>>-0.829
Token fraud
Token ID7394
Feature activation-2.86e-01

Pos logit contributions
+0.282
estones+0.250
mercial+0.244
>>>>+0.243
>>>>>>>>+0.240
Neg logit contributions
pell-0.236
hend-0.228
hers-0.212
onomy-0.191
ride-0.188
Token,
Token ID11
Feature activation-2.17e-01

Pos logit contributions
pell+0.577
hend+0.556
hers+0.519
onomy+0.471
ride+0.468
Neg logit contributions
-0.604
estones-0.535
mercial-0.518
>>>>-0.515
>>>>>>>>-0.509
Token or
Token ID393
Feature activation+3.00e-01

Pos logit contributions
pell+0.466
hend+0.451
hers+0.425
onomy+0.387
rises+0.383
Neg logit contributions
-0.430
estones-0.375
mercial-0.364
>>>>-0.363
>>>>>>>>-0.359
Token"
Token ID1
Feature activation+3.72e-01

Pos logit contributions
pell+6.635
hend+6.289
hers+5.893
onomy+5.357
ride+5.251
Neg logit contributions
-7.508
estones-6.714
mercial-6.499
>>>>-6.392
>>>>>>>>-6.265
Token article
Token ID2708
Feature activation+5.68e-01

Pos logit contributions
+0.742
estones+0.653
mercial+0.635
>>>>+0.633
>>>>>>>>+0.626
Neg logit contributions
pell-0.783
hend-0.765
hers-0.709
onomy-0.646
rises-0.645
Token was
Token ID373
Feature activation-8.68e-02

Pos logit contributions
+1.119
estones+0.982
>>>>+0.962
mercial+0.958
>>>>>>>>+0.948
Neg logit contributions
pell-1.195
hend-1.169
hers-1.079
rises-0.990
onomy-0.990
Token satire
Token ID35704
Feature activation-2.42e-01

Pos logit contributions
pell+0.180
hend+0.175
hers+0.162
onomy+0.150
ride+0.149
Neg logit contributions
-0.177
estones-0.155
>>>>-0.151
mercial-0.150
>>>>>>>>-0.148
Token.
Token ID13
Feature activation+1.66e-01

Pos logit contributions
pell+0.489
hend+0.475
hers+0.442
onomy+0.401
rises+0.400
Neg logit contributions
-0.509
estones-0.447
mercial-0.437
>>>>-0.436
>>>>>>>>-0.430
Token Neither
Token ID16126
Feature activation+3.56e+00

Pos logit contributions
+0.327
estones+0.286
>>>>+0.279
mercial+0.276
>>>>>>>>+0.274
Neg logit contributions
pell-0.357
hend-0.345
hers-0.322
onomy-0.295
rises-0.294
Token me
Token ID502
Feature activation-6.38e-01

Pos logit contributions
+6.576
>>>>+5.765
estones+5.677
>>>>>>>>+5.575
mercial+5.550
Neg logit contributions
pell-7.257
hend-7.206
hers-6.481
rises-6.181
onomy-6.034
Token or
Token ID393
Feature activation+4.87e-01

Pos logit contributions
pell+1.342
hend+1.298
hers+1.213
onomy+1.108
ride+1.101
Neg logit contributions
-1.292
estones-1.136
mercial-1.098
>>>>-1.094
>>>>>>>>-1.080
Token my
Token ID616
Feature activation-2.18e-01

Pos logit contributions
+0.984
estones+0.859
>>>>+0.839
mercial+0.833
>>>>>>>>+0.823
Neg logit contributions
pell-1.014
hend-0.987
hers-0.912
onomy-0.846
rises-0.839
Token supporters
Token ID5941
Feature activation-6.04e-01

Pos logit contributions
pell+0.464
hend+0.450
hers+0.420
onomy+0.385
ride+0.382
Neg logit contributions
-0.433
estones-0.380
mercial-0.366
>>>>-0.365
>>>>>>>>-0.360
Token want
Token ID765
Feature activation-7.96e-01

Pos logit contributions
pell+1.292
hend+1.251
hers+1.171
onomy+1.073
ride+1.064
Neg logit contributions
-1.198
estones-1.051
>>>>-1.014
mercial-1.014
>>>>>>>>-0.998
Tokeng
Token ID70
Feature activation-1.99e-01

Pos logit contributions
pell+0.419
hend+0.403
hers+0.375
onomy+0.339
rises+0.331
Neg logit contributions
-0.556
estones-0.500
mercial-0.488
>>>>-0.482
>>>>>>>>-0.476
Tokenutenberg
Token ID19028
Feature activation-2.16e+00

Pos logit contributions
pell+0.343
hend+0.328
hers+0.303
onomy+0.272
ride+0.269
Neg logit contributions
-0.475
estones-0.431
mercial-0.419
>>>>-0.414
>>>>>>>>-0.409
Token.
Token ID13
Feature activation-1.15e-01

Pos logit contributions
pell+3.748
hend+3.633
hers+3.317
onomy+2.982
ride+2.959
Neg logit contributions
-5.098
estones-4.516
>>>>-4.425
>>>>>>>>-4.412
mercial-4.343
Tokenorg
Token ID2398
Feature activation-5.23e-01

Pos logit contributions
pell+0.169
hend+0.160
hers+0.145
onomy+0.127
ride+0.126
Neg logit contributions
-0.303
estones-0.279
mercial-0.271
>>>>-0.267
>>>>>>>>-0.265
Token).
Token ID737
Feature activation-8.36e-03

Pos logit contributions
pell+1.035
hend+0.996
hers+0.928
onomy+0.840
ride+0.834
Neg logit contributions
-1.126
estones-0.999
mercial-0.965
>>>>-0.960
>>>>>>>>-0.950
Token Any
Token ID4377
Feature activation+3.53e+00

Pos logit contributions
pell+0.018
hend+0.018
hers+0.017
onomy+0.015
rises+0.015
Neg logit contributions
-0.016
estones-0.014
>>>>-0.014
mercial-0.014
>>>>>>>>-0.013
Token file
Token ID2393
Feature activation+5.53e-01

Pos logit contributions
+6.529
mercial+5.781
estones+5.633
>>>>+5.623
>>>>>>>>+5.490
Neg logit contributions
pell-7.387
hend-7.313
hers-6.794
rises-6.250
ride-6.220
Token offered
Token ID4438
Feature activation-5.81e-01

Pos logit contributions
+1.135
estones+0.999
mercial+0.980
>>>>+0.976
>>>>>>>>+0.962
Neg logit contributions
pell-1.139
hend-1.106
hers-1.029
onomy-0.932
rises-0.930
Token in
Token ID287
Feature activation-2.47e-01

Pos logit contributions
pell+1.188
hend+1.147
hers+1.071
onomy+0.974
ride+0.967
Neg logit contributions
-1.211
estones-1.070
mercial-1.035
>>>>-1.030
>>>>>>>>-1.017
Token MO
Token ID13070
Feature activation-2.90e-01

Pos logit contributions
pell+0.538
hend+0.525
hers+0.492
onomy+0.450
ride+0.446
Neg logit contributions
-0.479
estones-0.419
mercial-0.404
>>>>-0.403
>>>>>>>>-0.396
TokenBI
Token ID3483
Feature activation+2.53e-01

Pos logit contributions
pell+0.550
hend+0.530
hers+0.492
onomy+0.443
ride+0.440
Neg logit contributions
-0.647
estones-0.576
mercial-0.559
>>>>-0.556
>>>>>>>>-0.550
Token single
Token ID2060
Feature activation-8.05e-01

Pos logit contributions
+0.167
estones+0.144
>>>>+0.139
mercial+0.139
>>>>>>>>+0.137
Neg logit contributions
pell-0.208
hend-0.202
hers-0.189
onomy-0.173
ride-0.173
Token-
Token ID12
Feature activation-1.78e-01

Pos logit contributions
pell+1.753
hend+1.699
hers+1.593
onomy+1.458
ride+1.446
Neg logit contributions
-1.570
estones-1.373
mercial-1.327
>>>>-1.319
>>>>>>>>-1.304
Tokenpayer
Token ID34987
Feature activation-3.08e+00

Pos logit contributions
pell+0.128
hend+0.120
hers+0.094
onomy+0.068
rises+0.064
Neg logit contributions
-0.600
estones-0.557
mercial-0.550
>>>>-0.545
>>>>>>>>-0.541
Token system
Token ID1080
Feature activation-7.32e-01

Pos logit contributions
pell+5.877
hend+5.603
hers+5.319
ride+4.845
onomy+4.792
Neg logit contributions
-6.542
estones-6.069
mercial-5.686
>>>>-5.576
>>>>>>>>-5.503
Token.
Token ID13
Feature activation+3.19e-01

Pos logit contributions
pell+1.472
hend+1.423
hers+1.321
onomy+1.202
ride+1.190
Neg logit contributions
-1.550
estones-1.368
mercial-1.327
>>>>-1.321
>>>>>>>>-1.308
Token Anything
Token ID21035
Feature activation+3.52e+00

Pos logit contributions
+0.603
estones+0.518
>>>>+0.509
>>>>>>>>+0.503
mercial+0.490
Neg logit contributions
pell-0.719
hend-0.696
hers-0.643
rises-0.592
onomy-0.591
Token less
Token ID1342
Feature activation+1.28e+00

Pos logit contributions
+5.829
estones+4.894
>>>>+4.842
mercial+4.835
>>>>>>>>+4.670
Neg logit contributions
pell-8.042
hend-8.001
hers-7.284
onomy-6.889
ride-6.832
Token is
Token ID318
Feature activation-8.60e-02

Pos logit contributions
+2.471
estones+2.143
mercial+2.082
>>>>+2.071
>>>>>>>>+2.052
Neg logit contributions
pell-2.780
hend-2.720
hers-2.518
onomy-2.328
ride-2.315
Token immoral
Token ID31756
Feature activation+9.98e-01

Pos logit contributions
pell+0.185
hend+0.180
hers+0.168
onomy+0.154
ride+0.153
Neg logit contributions
-0.170
estones-0.148
mercial-0.143
>>>>-0.143
>>>>>>>>-0.141
Token.
Token ID13
Feature activation+2.72e-01

Pos logit contributions
+2.065
estones+1.799
>>>>+1.758
mercial+1.755
>>>>>>>>+1.745
Neg logit contributions
pell-2.049
hend-1.983
hers-1.847
onomy-1.674
rises-1.661
Token But
Token ID887
Feature activation+1.10e-01

Pos logit contributions
+0.524
estones+0.451
>>>>+0.444
>>>>>>>>+0.439
mercial+0.429
Neg logit contributions
pell-0.602
hend-0.582
hers-0.538
rises-0.494
onomy-0.493
Token.
Token ID13
Feature activation-1.63e-01

Pos logit contributions
pell+1.737
hend+1.672
hers+1.557
onomy+1.405
ride+1.396
Neg logit contributions
-1.943
estones-1.727
mercial-1.674
>>>>-1.663
>>>>>>>>-1.645
Token
Token ID447
Feature activation-4.36e-01

Pos logit contributions
pell+0.320
hend+0.309
hers+0.286
onomy+0.259
ride+0.258
Neg logit contributions
-0.356
estones-0.314
>>>>-0.307
mercial-0.303
>>>>>>>>-0.302
Token
Token ID251
Feature activation-3.73e-01

Pos logit contributions
pell+1.077
hend+1.048
hers+0.995
onomy+0.920
ride+0.911
Neg logit contributions
-0.724
estones-0.616
mercial-0.592
>>>>-0.581
>>>>>>>>-0.574
Token
Token ID198
Feature activation-1.45e-01

Pos logit contributions
pell+0.767
hend+0.743
hers+0.691
onomy+0.628
ride+0.626
Neg logit contributions
-0.778
estones-0.683
>>>>-0.662
mercial-0.659
>>>>>>>>-0.653
Token
Token ID198
Feature activation-1.18e-01

Pos logit contributions
pell+0.296
hend+0.286
hers+0.265
onomy+0.243
rises+0.241
Neg logit contributions
-0.303
estones-0.267
>>>>-0.261
>>>>>>>>-0.256
mercial-0.246
TokenThanks
Token ID9690
Feature activation-4.27e+00

Pos logit contributions
pell+0.245
hend+0.238
hers+0.221
onomy+0.201
rises+0.200
Neg logit contributions
-0.241
estones-0.211
>>>>-0.207
>>>>>>>>-0.204
mercial-0.199
Token to
Token ID284
Feature activation-1.67e-01

Pos logit contributions
pell+7.736
hend+7.361
hers+6.957
onomy+6.201
ride+6.175
Neg logit contributions
-9.293
estones-8.342
mercial-8.258
>>>>-7.946
>>>>>>>>-7.903
Token Penn
Token ID6595
Feature activation-1.96e+00

Pos logit contributions
pell+0.347
hend+0.338
hers+0.315
onomy+0.288
ride+0.286
Neg logit contributions
-0.338
estones-0.298
>>>>-0.288
mercial-0.287
>>>>>>>>-0.284
Token and
Token ID290
Feature activation+1.43e-01

Pos logit contributions
pell+4.058
hend+3.907
hers+3.652
onomy+3.337
ride+3.298
Neg logit contributions
-4.021
estones-3.542
mercial-3.437
>>>>-3.407
>>>>>>>>-3.371
Token Lab
Token ID3498
Feature activation-3.19e-01

Pos logit contributions
+0.291
estones+0.257
>>>>+0.247
mercial+0.247
>>>>>>>>+0.244
Neg logit contributions
pell-0.298
hend-0.289
hers-0.269
onomy-0.246
rises-0.245
Tokenov
Token ID709
Feature activation+1.17e+00

Pos logit contributions
pell+0.677
hend+0.656
hers+0.613
onomy+0.559
ride+0.557
Neg logit contributions
-0.638
estones-0.563
mercial-0.542
>>>>-0.540
>>>>>>>>-0.532
Token your
Token ID534
Feature activation+9.15e-02

Pos logit contributions
+0.789
estones+0.697
mercial+0.675
>>>>+0.672
>>>>>>>>+0.663
Neg logit contributions
pell-0.760
hend-0.734
hers-0.684
onomy-0.622
ride-0.617
Token inbox
Token ID13734
Feature activation+3.47e-02

Pos logit contributions
+0.169
estones+0.146
mercial+0.140
>>>>+0.140
>>>>>>>>+0.138
Neg logit contributions
pell-0.209
hend-0.203
hers-0.191
onomy-0.176
ride-0.175
Token!
Token ID0
Feature activation-6.24e-01

Pos logit contributions
+0.075
estones+0.067
>>>>+0.065
mercial+0.064
>>>>>>>>+0.064
Neg logit contributions
pell-0.068
hend-0.066
hers-0.061
onomy-0.056
ride-0.055
Token
Token ID198
Feature activation-1.72e-01

Pos logit contributions
pell+1.249
hend+1.204
hers+1.121
onomy+1.018
ride+1.010
Neg logit contributions
-1.333
estones-1.180
>>>>-1.141
mercial-1.140
>>>>>>>>-1.127
Token
Token ID198
Feature activation-1.10e-01

Pos logit contributions
pell+0.351
hend+0.338
hers+0.314
onomy+0.288
rises+0.285
Neg logit contributions
-0.360
estones-0.317
>>>>-0.311
>>>>>>>>-0.305
mercial-0.293
TokenThanks
Token ID9690
Feature activation-4.01e+00

Pos logit contributions
pell+0.224
hend+0.216
hers+0.202
onomy+0.184
ride+0.182
Neg logit contributions
-0.230
estones-0.201
>>>>-0.198
>>>>>>>>-0.194
mercial-0.193
Token for
Token ID329
Feature activation+5.18e-01

Pos logit contributions
pell+7.569
hend+7.287
hers+6.893
ride+6.073
onomy+6.033
Neg logit contributions
-8.473
estones-7.577
mercial-7.446
>>>>>>>>-7.101
>>>>-7.018
Token stopping
Token ID12225
Feature activation-1.09e-02

Pos logit contributions
+1.084
estones+0.963
>>>>+0.951
>>>>>>>>+0.926
mercial+0.922
Neg logit contributions
pell-1.032
hend-1.003
hers-0.918
onomy-0.852
rises-0.843
Token by
Token ID416
Feature activation-4.55e-01

Pos logit contributions
pell+0.021
hend+0.021
hers+0.019
onomy+0.017
rises+0.017
Neg logit contributions
-0.023
estones-0.021
>>>>-0.020
mercial-0.020
>>>>>>>>-0.020
Token,
Token ID11
Feature activation-3.14e-01

Pos logit contributions
pell+0.888
hend+0.856
hers+0.797
onomy+0.720
ride+0.714
Neg logit contributions
-0.992
estones-0.882
mercial-0.854
>>>>-0.850
>>>>>>>>-0.840
Token happy
Token ID3772
Feature activation-6.14e-01

Pos logit contributions
pell+0.645
hend+0.623
hers+0.581
onomy+0.530
ride+0.525
Neg logit contributions
-0.653
estones-0.576
mercial-0.557
>>>>-0.556
>>>>>>>>-0.549
Token June
Token ID2795
Feature activation+9.58e-01

Pos logit contributions
+0.587
estones+0.499
>>>>+0.477
mercial+0.475
>>>>>>>>+0.469
Neg logit contributions
pell-0.842
hend-0.815
hers-0.768
onomy-0.713
ride-0.707
Token 26
Token ID2608
Feature activation+8.53e-01

Pos logit contributions
+1.928
estones+1.685
>>>>+1.645
>>>>>>>>+1.623
mercial+1.547
Neg logit contributions
pell-2.015
hend-1.941
hers-1.803
onomy-1.660
rises-1.647
Token.
Token ID13
Feature activation+7.23e-01

Pos logit contributions
+1.774
estones+1.568
>>>>+1.501
mercial+1.497
>>>>>>>>+1.481
Neg logit contributions
pell-1.734
hend-1.672
hers-1.570
onomy-1.429
ride-1.409
Token
Token ID198
Feature activation+4.43e-01

Pos logit contributions
+1.457
estones+1.274
>>>>+1.235
>>>>>>>>+1.210
mercial+1.193
Neg logit contributions
pell-1.560
hend-1.478
hers-1.382
onomy-1.266
rises-1.255
Token
Token ID198
Feature activation+3.96e-01

Pos logit contributions
+0.893
estones+0.786
>>>>+0.770
>>>>>>>>+0.756
mercial+0.707
Neg logit contributions
pell-0.929
hend-0.895
hers-0.829
onomy-0.766
rises-0.760
TokenThanks
Token ID9690
Feature activation-3.96e+00

Pos logit contributions
+0.840
estones+0.734
>>>>+0.721
>>>>>>>>+0.709
mercial+0.694
Neg logit contributions
pell-0.806
hend-0.770
hers-0.722
onomy-0.653
rises-0.647
Token,
Token ID11
Feature activation-4.51e-01

Pos logit contributions
pell+6.504
hend+6.282
hers+5.912
ride+5.248
meg+5.135
Neg logit contributions
-9.080
estones-8.219
mercial-8.083
>>>>-7.861
>>>>>>>>-7.795
Token Wii
Token ID16591
Feature activation+5.80e-02

Pos logit contributions
pell+0.921
hend+0.888
hers+0.829
onomy+0.755
ride+0.747
Neg logit contributions
-0.944
estones-0.833
mercial-0.805
>>>>-0.802
>>>>>>>>-0.793
TokenUM
Token ID5883
Feature activation-3.28e-01

Pos logit contributions
+0.131
estones+0.117
mercial+0.113
>>>>+0.112
>>>>>>>>+0.111
Neg logit contributions
pell-0.109
hend-0.105
hers-0.098
onomy-0.088
ride-0.087
Tokenaster
Token ID1603
Feature activation-3.66e-02

Pos logit contributions
pell+0.628
hend+0.608
hers+0.562
onomy+0.509
rises+0.505
Neg logit contributions
-0.720
estones-0.645
mercial-0.623
>>>>-0.620
>>>>>>>>-0.612
TokenRace
Token ID35157
Feature activation-4.47e-01

Pos logit contributions
pell+0.070
hend+0.067
hers+0.062
onomy+0.056
ride+0.056
Neg logit contributions
-0.081
estones-0.072
mercial-0.070
>>>>-0.069
>>>>>>>>-0.068
Token love
Token ID1842
Feature activation-9.05e-01

Pos logit contributions
pell+1.152
hend+1.117
hers+1.045
onomy+0.960
rises+0.948
Neg logit contributions
-1.018
estones-0.890
mercial-0.863
>>>>-0.863
>>>>>>>>-0.846
Token these
Token ID777
Feature activation-3.73e-01

Pos logit contributions
pell+1.953
hend+1.891
hers+1.770
onomy+1.621
ride+1.607
Neg logit contributions
-1.786
estones-1.565
mercial-1.509
>>>>-1.504
>>>>>>>>-1.484
Token!
Token ID0
Feature activation-8.61e-01

Pos logit contributions
pell+0.777
hend+0.752
hers+0.702
onomy+0.640
ride+0.636
Neg logit contributions
-0.762
estones-0.671
mercial-0.649
>>>>-0.646
>>>>>>>>-0.639
Token
Token ID198
Feature activation-1.80e-01

Pos logit contributions
pell+1.737
hend+1.676
hers+1.563
onomy+1.419
ride+1.408
Neg logit contributions
-1.821
estones-1.610
mercial-1.558
>>>>-1.553
>>>>>>>>-1.534
Token
Token ID198
Feature activation-4.30e-02

Pos logit contributions
pell+0.349
hend+0.336
hers+0.312
onomy+0.282
ride+0.280
Neg logit contributions
-0.396
estones-0.352
mercial-0.340
>>>>-0.340
>>>>>>>>-0.336
TokenThank
Token ID10449
Feature activation-3.90e+00

Pos logit contributions
pell+0.086
hend+0.083
hers+0.077
onomy+0.070
ride+0.069
Neg logit contributions
-0.092
estones-0.081
>>>>-0.079
>>>>>>>>-0.078
mercial-0.078
Token you
Token ID345
Feature activation-5.76e-01

Pos logit contributions
pell+7.076
hend+6.842
hers+6.321
ride+5.722
fulness+5.699
Neg logit contributions
-8.592
estones-7.665
mercial-7.560
>>>>-7.395
>>>>>>>>-7.351
Token to
Token ID284
Feature activation-9.07e-01

Pos logit contributions
pell+1.279
hend+1.238
hers+1.161
onomy+1.066
ride+1.058
Neg logit contributions
-1.102
estones-0.962
mercial-0.926
>>>>-0.922
>>>>>>>>-0.910
Token Lilly
Token ID35134
Feature activation-4.44e-01

Pos logit contributions
pell+1.886
hend+1.821
hers+1.702
onomy+1.551
ride+1.539
Neg logit contributions
-1.864
estones-1.642
mercial-1.586
>>>>-1.579
>>>>>>>>-1.560
Token for
Token ID329
Feature activation+3.29e-01

Pos logit contributions
pell+0.896
hend+0.864
hers+0.806
onomy+0.733
ride+0.727
Neg logit contributions
-0.938
estones-0.827
mercial-0.801
>>>>-0.798
>>>>>>>>-0.789
Token this
Token ID428
Feature activation-4.03e-02

Pos logit contributions
+0.703
estones+0.622
>>>>+0.606
mercial+0.601
>>>>>>>>+0.596
Neg logit contributions
pell-0.647
hend-0.628
hers-0.578
onomy-0.534
rises-0.529
Token
Token ID251
Feature activation-6.94e-01

Pos logit contributions
pell+0.995
hend+0.967
hers+0.916
onomy+0.846
ride+0.838
Neg logit contributions
-0.705
estones-0.602
mercial-0.580
>>>>-0.569
>>>>>>>>-0.562
Token
Token ID198
Feature activation-1.59e-01

Pos logit contributions
pell+1.403
hend+1.354
hers+1.263
onomy+1.146
ride+1.138
Neg logit contributions
-1.466
estones-1.296
mercial-1.253
>>>>-1.249
>>>>>>>>-1.234
Token
Token ID198
Feature activation-1.61e-01

Pos logit contributions
pell+0.324
hend+0.312
hers+0.290
onomy+0.266
rises+0.263
Neg logit contributions
-0.334
estones-0.294
>>>>-0.287
>>>>>>>>-0.283
mercial-0.272
TokenA
Token ID32
Feature activation-2.15e-01

Pos logit contributions
pell+0.325
hend+0.314
hers+0.293
onomy+0.266
rises+0.264
Neg logit contributions
-0.339
estones-0.298
>>>>-0.290
>>>>>>>>-0.287
mercial-0.286
Token big
Token ID1263
Feature activation-3.78e-01

Pos logit contributions
pell+0.446
hend+0.432
hers+0.404
onomy+0.368
ride+0.365
Neg logit contributions
-0.439
estones-0.387
mercial-0.374
>>>>-0.372
>>>>>>>>-0.368
Token thank
Token ID5875
Feature activation-3.83e+00

Pos logit contributions
pell+0.764
hend+0.737
hers+0.687
onomy+0.624
ride+0.619
Neg logit contributions
-0.798
estones-0.706
mercial-0.683
>>>>-0.679
>>>>>>>>-0.672
Token you
Token ID345
Feature activation-4.29e-01

Pos logit contributions
pell+7.003
hend+6.762
hers+6.189
ride+5.742
fulness+5.535
Neg logit contributions
-8.452
estones-7.555
mercial-7.421
>>>>-7.259
>>>>>>>>-7.118
Token to
Token ID284
Feature activation-6.92e-01

Pos logit contributions
pell+0.830
hend+0.799
hers+0.743
onomy+0.671
ride+0.666
Neg logit contributions
-0.941
estones-0.837
mercial-0.811
>>>>-0.807
>>>>>>>>-0.798
Token the
Token ID262
Feature activation-5.94e-01

Pos logit contributions
pell+1.450
hend+1.401
hers+1.311
onomy+1.195
ride+1.186
Neg logit contributions
-1.409
estones-1.240
mercial-1.198
>>>>-1.192
>>>>>>>>-1.178
Token family
Token ID1641
Feature activation-1.96e+00

Pos logit contributions
pell+1.276
hend+1.234
hers+1.157
onomy+1.059
ride+1.050
Neg logit contributions
-1.176
estones-1.031
mercial-0.994
>>>>-0.993
>>>>>>>>-0.979
Token for
Token ID329
Feature activation+4.60e-01

Pos logit contributions
pell+3.896
hend+3.764
hers+3.507
onomy+3.141
ride+3.125
Neg logit contributions
-4.189
estones-3.728
mercial-3.632
>>>>-3.543
>>>>>>>>-3.511
Token bonus
Token ID7202
Feature activation-1.43e-01

Pos logit contributions
pell+0.141
hend+0.137
hers+0.129
onomy+0.117
ride+0.116
Neg logit contributions
-0.130
estones-0.113
mercial-0.110
>>>>-0.110
>>>>>>>>-0.108
Token one
Token ID530
Feature activation-8.00e-01

Pos logit contributions
pell+0.299
hend+0.289
hers+0.272
onomy+0.246
rises+0.246
Neg logit contributions
-0.291
estones-0.256
>>>>-0.248
mercial-0.248
>>>>>>>>-0.245
Token..
Token ID492
Feature activation+4.13e-01

Pos logit contributions
pell+1.622
hend+1.565
hers+1.467
onomy+1.326
ride+1.317
Neg logit contributions
-1.683
estones-1.485
>>>>-1.441
mercial-1.439
>>>>>>>>-1.422
Token
Token ID198
Feature activation+9.94e-01

Pos logit contributions
+0.868
estones+0.770
>>>>+0.750
mercial+0.744
>>>>>>>>+0.737
Neg logit contributions
pell-0.829
hend-0.800
hers-0.744
onomy-0.682
ride-0.672
Token
Token ID198
Feature activation+8.17e-01

Pos logit contributions
+1.954
estones+1.710
>>>>+1.692
>>>>>>>>+1.654
mercial+1.511
Neg logit contributions
pell-2.115
hend-2.037
hers-1.891
onomy-1.759
rises-1.742
Tokenthank
Token ID40716
Feature activation-3.82e+00

Pos logit contributions
+1.794
estones+1.586
>>>>+1.567
>>>>>>>>+1.539
mercial+1.519
Neg logit contributions
pell-1.562
hend-1.504
hers-1.402
onomy-1.257
rises-1.250
Token you
Token ID345
Feature activation-5.21e-01

Pos logit contributions
pell+7.241
hend+7.039
hers+6.519
ride+5.915
onomy+5.798
Neg logit contributions
-8.187
estones-7.197
mercial-7.145
>>>>-6.989
>>>>>>>>-6.932
Token @
Token ID2488
Feature activation-9.60e-01

Pos logit contributions
pell+1.075
hend+1.038
hers+0.969
onomy+0.884
ride+0.876
Neg logit contributions
-1.077
estones-0.951
mercial-0.916
>>>>-0.916
>>>>>>>>-0.904
Tokenmr
Token ID43395
Feature activation-1.15e-01

Pos logit contributions
pell+1.855
hend+1.787
hers+1.662
onomy+1.499
ride+1.488
Neg logit contributions
-2.112
estones-1.879
mercial-1.822
>>>>-1.811
>>>>>>>>-1.791
Token_
Token ID62
Feature activation-1.27e+00

Pos logit contributions
pell+0.205
hend+0.196
hers+0.182
onomy+0.163
ride+0.161
Neg logit contributions
-0.267
estones-0.240
mercial-0.232
>>>>-0.231
>>>>>>>>-0.229
Tokense
Token ID325
Feature activation+2.89e-01

Pos logit contributions
pell+2.339
hend+2.252
hers+2.082
onomy+1.862
ride+1.854
Neg logit contributions
-2.927
estones-2.615
mercial-2.542
>>>>-2.528
>>>>>>>>-2.501
Tokenison
Token ID1653
Feature activation-1.20e+00

Pos logit contributions
+1.239
estones+1.113
mercial+1.067
>>>>+1.047
>>>>>>>>+1.026
Neg logit contributions
pell-1.297
hend-1.276
hers-1.187
onomy-1.086
rises-1.080
Token Rain
Token ID10301
Feature activation-1.74e+00

Pos logit contributions
pell+2.332
hend+2.245
hers+2.087
onomy+1.885
ride+1.869
Neg logit contributions
-2.646
estones-2.353
mercial-2.281
>>>>-2.266
>>>>>>>>-2.240
Tokeness
Token ID408
Feature activation-9.99e-01

Pos logit contributions
pell+3.458
hend+3.329
hers+3.099
onomy+2.805
ride+2.791
Neg logit contributions
-3.750
estones-3.319
mercial-3.223
>>>>-3.208
>>>>>>>>-3.175
Token
Token ID198
Feature activation+8.36e-02

Pos logit contributions
pell+1.935
hend+1.864
hers+1.734
onomy+1.565
ride+1.553
Neg logit contributions
-2.193
estones-1.951
mercial-1.891
>>>>-1.880
>>>>>>>>-1.859
Token
Token ID198
Feature activation+2.62e-02

Pos logit contributions
+0.171
estones+0.151
>>>>+0.147
>>>>>>>>+0.145
mercial+0.138
Neg logit contributions
pell-0.173
hend-0.167
hers-0.155
onomy-0.143
rises-0.141
TokenCross
Token ID21544
Feature activation-3.80e+00

Pos logit contributions
+0.056
estones+0.049
>>>>+0.048
>>>>>>>>+0.048
mercial+0.047
Neg logit contributions
pell-0.052
hend-0.050
hers-0.047
onomy-0.042
rises-0.042
Token-
Token ID12
Feature activation+1.75e-01

Pos logit contributions
pell+5.540
hend+5.149
hers+4.675
ride+4.042
onomy+3.978
Neg logit contributions
-9.897
estones-9.073
mercial-8.796
>>>>>>>>-8.692
>>>>-8.684
Tokend
Token ID67
Feature activation+7.66e-01

Pos logit contributions
+0.477
estones+0.434
mercial+0.424
>>>>+0.423
>>>>>>>>+0.418
Neg logit contributions
pell-0.244
hend-0.233
hers-0.210
onomy-0.182
rises-0.179
Tokenressing
Token ID11697
Feature activation-7.34e-01

Pos logit contributions
+2.201
estones+2.030
mercial+1.971
>>>>+1.963
>>>>>>>>+1.950
Neg logit contributions
pell-0.952
hend-0.888
hers-0.781
onomy-0.659
rises-0.656
Token:
Token ID25
Feature activation+2.26e-01

Pos logit contributions
pell+1.368
hend+1.316
hers+1.220
onomy+1.096
ride+1.088
Neg logit contributions
-1.663
estones-1.485
mercial-1.441
>>>>-1.433
>>>>>>>>-1.418
Token Cross
Token ID6372
Feature activation-2.86e+00

Pos logit contributions
+0.385
estones+0.328
>>>>+0.315
mercial+0.312
>>>>>>>>+0.308
Neg logit contributions
pell-0.550
hend-0.530
hers-0.498
onomy-0.463
rises-0.462
Token to
Token ID284
Feature activation+5.98e-01

Pos logit contributions
+4.081
estones+3.584
>>>>+3.490
mercial+3.485
>>>>>>>>+3.437
Neg logit contributions
pell-4.064
hend-3.980
hers-3.659
onomy-3.345
rises-3.313
Token reality
Token ID3950
Feature activation-2.30e-01

Pos logit contributions
+0.997
estones+0.827
mercial+0.817
>>>>+0.815
>>>>>>>>+0.797
Neg logit contributions
pell-1.472
hend-1.423
hers-1.338
onomy-1.248
ride-1.244
Token.
Token ID13
Feature activation-2.46e-01

Pos logit contributions
pell+0.447
hend+0.431
hers+0.400
onomy+0.362
ride+0.359
Neg logit contributions
-0.505
estones-0.448
mercial-0.434
>>>>-0.433
>>>>>>>>-0.428
Token
Token ID198
Feature activation+1.31e-01

Pos logit contributions
pell+0.509
hend+0.491
hers+0.458
onomy+0.416
ride+0.413
Neg logit contributions
-0.511
estones-0.449
>>>>-0.435
mercial-0.433
>>>>>>>>-0.430
Token
Token ID198
Feature activation+2.12e-01

Pos logit contributions
+0.274
estones+0.241
>>>>+0.237
>>>>>>>>+0.233
mercial+0.222
Neg logit contributions
pell-0.270
hend-0.260
hers-0.241
onomy-0.221
rises-0.218
TokenThanks
Token ID9690
Feature activation-3.76e+00

Pos logit contributions
+0.425
estones+0.369
>>>>+0.363
>>>>>>>>+0.358
mercial+0.349
Neg logit contributions
pell-0.452
hend-0.438
hers-0.409
onomy-0.372
rises-0.368
Token to
Token ID284
Feature activation-1.11e-01

Pos logit contributions
pell+6.542
hend+6.392
hers+5.984
ride+5.373
onomy+5.168
Neg logit contributions
-8.346
estones-7.502
mercial-7.388
>>>>-7.141
>>>>>>>>-7.093
Token Adobe
Token ID21771
Feature activation+2.41e-01

Pos logit contributions
pell+0.229
hend+0.222
hers+0.208
onomy+0.189
ride+0.187
Neg logit contributions
-0.228
estones-0.200
>>>>-0.193
mercial-0.193
>>>>>>>>-0.191
Token AIR
Token ID31600
Feature activation-1.08e+00

Pos logit contributions
+0.484
estones+0.425
mercial+0.413
>>>>+0.411
>>>>>>>>+0.407
Neg logit contributions
pell-0.511
hend-0.495
hers-0.464
onomy-0.421
rises-0.419
Token for
Token ID329
Feature activation+7.72e-01

Pos logit contributions
pell+2.122
hend+2.005
hers+1.881
ride+1.701
onomy+1.701
Neg logit contributions
-2.362
estones-2.110
mercial-2.014
>>>>-2.010
>>>>>>>>-1.980
Token Android
Token ID5565
Feature activation-1.10e+00

Pos logit contributions
+1.643
estones+1.473
>>>>+1.435
mercial+1.432
>>>>>>>>+1.402
Neg logit contributions
pell-1.496
hend-1.465
hers-1.353
onomy-1.231
rises-1.222
Token every
Token ID790
Feature activation-7.05e-03

Pos logit contributions
+0.978
estones+0.861
>>>>+0.822
>>>>>>>>+0.816
mercial+0.792
Neg logit contributions
pell-1.098
hend-1.065
hers-0.998
onomy-0.924
ride-0.910
Token week
Token ID1285
Feature activation-4.24e-01

Pos logit contributions
pell+0.017
hend+0.016
hers+0.015
onomy+0.014
rises+0.014
Neg logit contributions
-0.012
estones-0.010
>>>>-0.010
mercial-0.010
>>>>>>>>-0.010
Token.
Token ID13
Feature activation-9.98e-02

Pos logit contributions
pell+0.837
hend+0.807
hers+0.749
onomy+0.682
rises+0.675
Neg logit contributions
-0.914
estones-0.809
>>>>-0.787
>>>>>>>>-0.777
mercial-0.767
Token
Token ID198
Feature activation+1.39e-01

Pos logit contributions
pell+0.215
hend+0.208
hers+0.195
onomy+0.178
ride+0.177
Neg logit contributions
-0.197
estones-0.171
>>>>-0.169
>>>>>>>>-0.168
mercial-0.161
Token
Token ID198
Feature activation+1.99e-01

Pos logit contributions
+0.287
estones+0.252
>>>>+0.247
>>>>>>>>+0.243
mercial+0.231
Neg logit contributions
pell-0.286
hend-0.276
hers-0.256
onomy-0.235
rises-0.233
TokenThank
Token ID10449
Feature activation-3.71e+00

Pos logit contributions
+0.428
estones+0.375
>>>>+0.368
>>>>>>>>+0.362
mercial+0.361
Neg logit contributions
pell-0.394
hend-0.380
hers-0.352
onomy-0.318
rises-0.315
Token you
Token ID345
Feature activation-5.16e-01

Pos logit contributions
pell+6.633
hend+6.376
hers+5.900
ride+5.354
onomy+5.294
Neg logit contributions
-8.214
estones-7.395
mercial-7.247
>>>>-7.144
>>>>>>>>-7.105
Token for
Token ID329
Feature activation+1.77e-01

Pos logit contributions
pell+0.976
hend+0.937
hers+0.875
onomy+0.786
ride+0.782
Neg logit contributions
-1.153
estones-1.030
mercial-1.002
>>>>-0.992
>>>>>>>>-0.982
Token signing
Token ID8415
Feature activation-3.70e-01

Pos logit contributions
+0.334
estones+0.294
>>>>+0.284
mercial+0.280
>>>>>>>>+0.277
Neg logit contributions
pell-0.389
hend-0.380
hers-0.354
rises-0.327
onomy-0.327
Token up
Token ID510
Feature activation+8.14e-01

Pos logit contributions
pell+0.668
hend+0.643
hers+0.586
onomy+0.527
rises+0.523
Neg logit contributions
-0.863
estones-0.777
>>>>-0.755
>>>>>>>>-0.750
mercial-0.732
Token.
Token ID13
Feature activation+2.98e-01

Pos logit contributions
+1.694
estones+1.498
>>>>+1.449
mercial+1.437
>>>>>>>>+1.420
Neg logit contributions
pell-1.659
hend-1.607
hers-1.490
onomy-1.349
ride-1.338
Token each
Token ID1123
Feature activation-8.07e-02

Pos logit contributions
pell+3.109
hend+2.988
hers+2.783
onomy+2.499
ride+2.471
Neg logit contributions
-3.633
estones-3.255
mercial-3.213
>>>>-3.122
>>>>>>>>-3.084
Token week
Token ID1285
Feature activation-6.17e-01

Pos logit contributions
pell+0.195
hend+0.189
hers+0.178
onomy+0.165
rises+0.165
Neg logit contributions
-0.137
estones-0.118
>>>>-0.114
>>>>>>>>-0.111
mercial-0.111
Token.
Token ID13
Feature activation-3.10e-01

Pos logit contributions
pell+1.219
hend+1.175
hers+1.090
onomy+0.993
rises+0.983
Neg logit contributions
-1.332
estones-1.179
>>>>-1.145
>>>>>>>>-1.130
mercial-1.116
Token
Token ID198
Feature activation-2.66e-01

Pos logit contributions
pell+0.636
hend+0.611
hers+0.567
onomy+0.522
rises+0.515
Neg logit contributions
-0.650
estones-0.569
>>>>-0.559
>>>>>>>>-0.552
mercial-0.528
Token
Token ID198
Feature activation-1.19e-01

Pos logit contributions
pell+0.541
hend+0.522
hers+0.485
onomy+0.444
rises+0.439
Neg logit contributions
-0.559
estones-0.490
>>>>-0.481
>>>>>>>>-0.473
mercial-0.455
TokenThank
Token ID10449
Feature activation-3.67e+00

Pos logit contributions
pell+0.208
hend+0.200
hers+0.185
ride+0.163
onomy+0.163
Neg logit contributions
-0.283
estones-0.251
>>>>-0.247
>>>>>>>>-0.244
mercial-0.243
Token you
Token ID345
Feature activation-4.09e-01

Pos logit contributions
pell+6.565
hend+6.230
hers+5.836
ride+5.162
onomy+5.104
Neg logit contributions
-8.371
mercial-7.735
estones-7.461
 cumbers-7.176
>>>>-7.155
Token for
Token ID329
Feature activation+1.93e-01

Pos logit contributions
pell+0.813
hend+0.782
hers+0.726
onomy+0.663
rises+0.654
Neg logit contributions
-0.878
estones-0.776
>>>>-0.753
>>>>>>>>-0.744
mercial-0.738
Token signing
Token ID8415
Feature activation-1.87e-01

Pos logit contributions
+0.359
estones+0.317
>>>>+0.307
>>>>>>>>+0.299
mercial+0.292
Neg logit contributions
pell-0.431
hend-0.419
hers-0.386
onomy-0.362
rises-0.359
Token up
Token ID510
Feature activation+8.70e-01

Pos logit contributions
pell+0.341
hend+0.330
hers+0.298
onomy+0.275
rises+0.270
Neg logit contributions
-0.425
estones-0.383
>>>>-0.372
>>>>>>>>-0.372
mercial-0.354
Token.
Token ID13
Feature activation+1.64e-01

Pos logit contributions
+1.701
estones+1.482
>>>>+1.469
>>>>>>>>+1.439
mercial+1.287
Neg logit contributions
pell-1.861
hend-1.790
hers-1.655
onomy-1.545
rises-1.535
Token results
Token ID2482
Feature activation-6.97e-02

Pos logit contributions
pell+1.408
hend+1.366
hers+1.287
onomy+1.187
ride+1.180
Neg logit contributions
-1.063
estones-0.917
mercial-0.880
>>>>-0.875
>>>>>>>>-0.863
Token.
Token ID13
Feature activation-3.04e-01

Pos logit contributions
pell+0.140
hend+0.136
hers+0.126
onomy+0.115
ride+0.114
Neg logit contributions
-0.147
estones-0.130
>>>>-0.126
mercial-0.126
>>>>>>>>-0.125
Token
Token ID198
Feature activation-2.15e-02

Pos logit contributions
pell+0.650
hend+0.626
hers+0.585
onomy+0.535
ride+0.533
Neg logit contributions
-0.610
estones-0.534
>>>>-0.517
>>>>>>>>-0.510
mercial-0.508
Token
Token ID198
Feature activation+8.46e-02

Pos logit contributions
pell+0.044
hend+0.043
hers+0.040
onomy+0.036
rises+0.036
Neg logit contributions
-0.045
estones-0.039
>>>>-0.038
>>>>>>>>-0.038
mercial-0.036
TokenMany
Token ID7085
Feature activation+1.50e-01

Pos logit contributions
+0.173
estones+0.151
>>>>+0.148
>>>>>>>>+0.146
mercial+0.143
Neg logit contributions
pell-0.177
hend-0.171
hers-0.160
onomy-0.146
ride-0.145
Token thanks
Token ID5176
Feature activation-3.61e+00

Pos logit contributions
+0.304
estones+0.267
>>>>+0.257
mercial+0.257
>>>>>>>>+0.254
Neg logit contributions
pell-0.315
hend-0.305
hers-0.285
onomy-0.262
ride-0.260
Token again
Token ID757
Feature activation-1.89e+00

Pos logit contributions
pell+6.321
hend+6.030
hers+5.669
ride+5.007
onomy+4.921
Neg logit contributions
-8.111
estones-7.357
mercial-7.286
>>>>-6.946
>>>>>>>>-6.843
Token to
Token ID284
Feature activation+8.87e-02

Pos logit contributions
pell+3.431
hend+3.266
hers+3.048
ride+2.721
onomy+2.696
Neg logit contributions
-4.323
estones-3.901
mercial-3.808
>>>>-3.721
>>>>>>>>-3.671
Token all
Token ID477
Feature activation+7.78e-02

Pos logit contributions
+0.183
estones+0.161
>>>>+0.156
mercial+0.154
>>>>>>>>+0.154
Neg logit contributions
pell-0.182
hend-0.176
hers-0.165
onomy-0.151
rises-0.149
Token of
Token ID286
Feature activation+4.99e-01

Pos logit contributions
+0.160
estones+0.141
>>>>+0.137
mercial+0.136
>>>>>>>>+0.135
Neg logit contributions
pell-0.161
hend-0.155
hers-0.145
onomy-0.133
ride-0.132
Token you
Token ID345
Feature activation+2.85e-01

Pos logit contributions
+1.011
estones+0.885
>>>>+0.860
mercial+0.853
>>>>>>>>+0.847
Neg logit contributions
pell-1.044
hend-1.013
hers-0.945
onomy-0.866
ride-0.858
Token Lar
Token ID25577
Feature activation-2.94e-01

Pos logit contributions
+0.531
estones+0.461
mercial+0.444
>>>>+0.442
>>>>>>>>+0.435
Neg logit contributions
pell-0.650
hend-0.632
hers-0.594
onomy-0.550
rises-0.545
Tokenche
Token ID2395
Feature activation+1.44e-01

Pos logit contributions
pell+0.540
hend+0.529
hers+0.487
onomy+0.442
rises+0.437
Neg logit contributions
-0.652
estones-0.593
>>>>-0.566
>>>>>>>>-0.560
mercial-0.559
Token.
Token ID13
Feature activation+4.45e-01

Pos logit contributions
+0.304
estones+0.267
>>>>+0.260
mercial+0.259
>>>>>>>>+0.256
Neg logit contributions
pell-0.287
hend-0.280
hers-0.260
onomy-0.237
rises-0.234
Token
Token ID198
Feature activation+4.46e-01

Pos logit contributions
+0.917
estones+0.808
>>>>+0.791
mercial+0.778
>>>>>>>>+0.778
Neg logit contributions
pell-0.917
hend-0.881
hers-0.824
onomy-0.753
rises-0.750
Token
Token ID198
Feature activation+4.42e-01

Pos logit contributions
+0.902
estones+0.794
>>>>+0.782
>>>>>>>>+0.768
mercial+0.717
Neg logit contributions
pell-0.934
hend-0.898
hers-0.836
onomy-0.772
rises-0.766
TokenThanks
Token ID9690
Feature activation-3.60e+00

Pos logit contributions
+0.936
estones+0.819
>>>>+0.814
>>>>>>>>+0.801
mercial+0.779
Neg logit contributions
pell-0.893
hend-0.854
hers-0.801
onomy-0.725
rises-0.720
Token again
Token ID757
Feature activation-1.78e+00

Pos logit contributions
pell+6.791
hend+6.564
hers+6.203
onomy+5.492
ride+5.481
Neg logit contributions
-7.689
estones-6.815
mercial-6.760
>>>>>>>>-6.442
>>>>-6.420
Token for
Token ID329
Feature activation+9.49e-01

Pos logit contributions
pell+3.287
hend+3.144
hers+2.947
onomy+2.624
ride+2.619
Neg logit contributions
-4.010
estones-3.597
mercial-3.509
>>>>-3.426
>>>>>>>>-3.402
Token listening
Token ID8680
Feature activation+9.73e-01

Pos logit contributions
+2.061
estones+1.850
>>>>+1.834
>>>>>>>>+1.786
mercial+1.772
Neg logit contributions
pell-1.795
hend-1.740
hers-1.587
onomy-1.461
rises-1.448
Token!
Token ID0
Feature activation+2.52e-01

Pos logit contributions
+2.083
estones+1.849
>>>>+1.802
mercial+1.787
>>>>>>>>+1.766
Neg logit contributions
pell-1.921
hend-1.861
hers-1.731
onomy-1.564
ride-1.552
Token<|endoftext|>
Token ID50256
Feature activation-1.88e+00

Pos logit contributions
+0.545
estones+0.483
>>>>+0.472
>>>>>>>>+0.465
mercial+0.463
Neg logit contributions
pell-0.498
hend-0.477
hers-0.444
onomy-0.402
ride-0.399
Token miles
Token ID4608
Feature activation+1.12e-01

Pos logit contributions
pell+6.836
hend+6.594
hers+6.236
onomy+5.746
meg+5.667
Neg logit contributions
-5.194
estones-4.549
mercial-4.404
>>>>-4.359
>>>>>>>>-4.339
Token (
Token ID357
Feature activation-2.12e-01

Pos logit contributions
+0.232
estones+0.205
mercial+0.197
>>>>+0.197
>>>>>>>>+0.194
Neg logit contributions
pell-0.231
hend-0.223
hers-0.208
onomy-0.190
rises-0.188
Token1
Token ID16
Feature activation-6.05e-01

Pos logit contributions
pell+0.371
hend+0.357
hers+0.327
onomy+0.296
ride+0.288
Neg logit contributions
-0.502
estones-0.449
>>>>-0.437
mercial-0.434
>>>>>>>>-0.432
Token,
Token ID11
Feature activation-4.52e-01

Pos logit contributions
pell+1.135
hend+1.089
hers+1.017
onomy+0.913
ride+0.905
Neg logit contributions
-1.356
estones-1.218
mercial-1.179
>>>>-1.165
>>>>>>>>-1.156
Token000
Token ID830
Feature activation-1.26e+00

Pos logit contributions
pell+0.824
hend+0.792
hers+0.733
onomy+0.657
ride+0.651
Neg logit contributions
-1.042
estones-0.932
mercial-0.905
>>>>-0.901
>>>>>>>>-0.891
Token square
Token ID6616
Feature activation-3.59e+00

Pos logit contributions
pell+2.351
hend+2.261
hers+2.097
onomy+1.885
ride+1.870
Neg logit contributions
-2.847
estones-2.542
mercial-2.466
>>>>-2.453
>>>>>>>>-2.427
Token kilometers
Token ID18212
Feature activation-8.00e-01

Pos logit contributions
pell+7.842
hend+7.537
hers+7.109
meg+6.471
onomy+6.415
Neg logit contributions
-6.734
estones-5.932
mercial-5.692
>>>>>>>>-5.675
>>>>-5.646
Token),
Token ID828
Feature activation-2.33e-01

Pos logit contributions
pell+1.484
hend+1.424
hers+1.321
onomy+1.184
ride+1.177
Neg logit contributions
-1.821
estones-1.627
mercial-1.579
>>>>-1.572
>>>>>>>>-1.556
Token with
Token ID351
Feature activation+3.17e-01

Pos logit contributions
pell+0.492
hend+0.475
hers+0.444
onomy+0.407
ride+0.402
Neg logit contributions
-0.472
estones-0.415
mercial-0.400
>>>>-0.400
>>>>>>>>-0.394
Token roughly
Token ID7323
Feature activation-9.37e-01

Pos logit contributions
+0.617
estones+0.542
>>>>+0.523
mercial+0.522
>>>>>>>>+0.514
Neg logit contributions
pell-0.687
hend-0.666
hers-0.625
onomy-0.573
ride-0.568
Token one
Token ID530
Feature activation-1.63e-01

Pos logit contributions
pell+1.869
hend+1.802
hers+1.679
onomy+1.523
ride+1.509
Neg logit contributions
-2.004
estones-1.776
mercial-1.717
>>>>-1.714
>>>>>>>>-1.693
Token has
Token ID468
Feature activation+5.08e-01

Pos logit contributions
pell+3.123
hend+2.998
hers+2.824
onomy+2.546
ride+2.537
Neg logit contributions
-3.104
estones-2.729
mercial-2.639
>>>>-2.618
>>>>>>>>-2.586
Token former
Token ID1966
Feature activation+1.45e-01

Pos logit contributions
+1.022
estones+0.901
>>>>+0.880
mercial+0.877
>>>>>>>>+0.868
Neg logit contributions
pell-1.060
hend-1.031
hers-0.949
onomy-0.877
ride-0.872
Token United
Token ID1578
Feature activation-1.43e-01

Pos logit contributions
+0.281
estones+0.245
>>>>+0.237
mercial+0.236
>>>>>>>>+0.233
Neg logit contributions
pell-0.319
hend-0.309
hers-0.289
onomy-0.266
rises-0.264
Token States
Token ID1829
Feature activation+5.11e-01

Pos logit contributions
pell+0.255
hend+0.244
hers+0.225
onomy+0.202
rises+0.200
Neg logit contributions
-0.337
estones-0.302
mercial-0.295
>>>>-0.293
>>>>>>>>-0.290
Token Senators
Token ID22195
Feature activation+6.88e-01

Pos logit contributions
+0.991
estones+0.859
>>>>+0.839
mercial+0.836
>>>>>>>>+0.827
Neg logit contributions
pell-1.118
hend-1.081
hers-1.013
onomy-0.938
rises-0.928
Token Trent
Token ID24269
Feature activation-3.59e+00

Pos logit contributions
+1.388
estones+1.212
>>>>+1.181
mercial+1.172
>>>>>>>>+1.167
Neg logit contributions
pell-1.425
hend-1.393
hers-1.298
onomy-1.195
ride-1.186
Token L
Token ID406
Feature activation+4.93e-02

Pos logit contributions
pell+6.513
hend+6.235
hers+5.805
onomy+5.225
ride+5.122
Neg logit contributions
-8.265
estones-7.276
mercial-7.243
>>>>>>>>-7.018
>>>>-7.014
Tokenott
Token ID1252
Feature activation-6.98e-01

Pos logit contributions
+0.098
estones+0.088
>>>>+0.084
>>>>>>>>+0.083
mercial+0.083
Neg logit contributions
pell-0.101
hend-0.099
hers-0.093
rises-0.085
onomy-0.085
Token and
Token ID290
Feature activation-7.89e-02

Pos logit contributions
pell+1.414
hend+1.364
hers+1.273
onomy+1.156
ride+1.147
Neg logit contributions
-1.470
estones-1.300
mercial-1.258
>>>>-1.251
>>>>>>>>-1.236
Token John
Token ID1757
Feature activation+2.80e-01

Pos logit contributions
pell+0.166
hend+0.160
hers+0.150
onomy+0.137
ride+0.136
Neg logit contributions
-0.159
estones-0.140
>>>>-0.135
mercial-0.135
>>>>>>>>-0.134
Token Bre
Token ID3719
Feature activation+2.19e+00

Pos logit contributions
+0.568
estones+0.502
mercial+0.485
>>>>+0.482
>>>>>>>>+0.478
Neg logit contributions
pell-0.580
hend-0.558
hers-0.524
onomy-0.482
rises-0.478
Token confident
Token ID6563
Feature activation-8.69e-01

Pos logit contributions
pell+0.615
hend+0.597
hers+0.562
onomy+0.516
rises+0.509
Neg logit contributions
-0.542
estones-0.471
>>>>-0.458
mercial-0.454
>>>>>>>>-0.451
Token fighter
Token ID10543
Feature activation+4.85e-01

Pos logit contributions
pell+1.888
hend+1.830
hers+1.717
onomy+1.571
ride+1.558
Neg logit contributions
-1.700
estones-1.485
mercial-1.433
>>>>-1.429
>>>>>>>>-1.410
Token.
Token ID13
Feature activation-2.89e-01

Pos logit contributions
+1.015
estones+0.887
>>>>+0.870
mercial+0.864
>>>>>>>>+0.854
Neg logit contributions
pell-0.971
hend-0.948
hers-0.876
onomy-0.808
ride-0.791
Token
Token ID198
Feature activation-1.71e-01

Pos logit contributions
pell+0.597
hend+0.575
hers+0.536
onomy+0.489
ride+0.485
Neg logit contributions
-0.600
estones-0.526
>>>>-0.512
>>>>>>>>-0.505
mercial-0.503
Token
Token ID198
Feature activation-7.17e-02

Pos logit contributions
pell+0.348
hend+0.335
hers+0.311
onomy+0.286
rises+0.283
Neg logit contributions
-0.357
estones-0.315
>>>>-0.308
>>>>>>>>-0.303
mercial-0.291
TokenThank
Token ID10449
Feature activation-3.57e+00

Pos logit contributions
pell+0.150
hend+0.144
hers+0.135
onomy+0.123
rises+0.122
Neg logit contributions
-0.147
estones-0.129
>>>>-0.126
>>>>>>>>-0.124
mercial-0.122
Token you
Token ID345
Feature activation-4.02e-01

Pos logit contributions
pell+6.468
hend+6.279
hers+5.767
ride+5.277
fulness+5.159
Neg logit contributions
-7.879
estones-6.995
mercial-6.935
>>>>-6.685
>>>>>>>>-6.639
Token as
Token ID355
Feature activation-1.11e+00

Pos logit contributions
pell+0.836
hend+0.807
hers+0.754
onomy+0.687
ride+0.682
Neg logit contributions
-0.825
estones-0.727
mercial-0.703
>>>>-0.699
>>>>>>>>-0.690
Token well
Token ID880
Feature activation-1.90e+00

Pos logit contributions
pell+2.005
hend+1.926
hers+1.780
onomy+1.586
ride+1.576
Neg logit contributions
-2.577
estones-2.308
mercial-2.243
>>>>-2.226
>>>>>>>>-2.203
Token to
Token ID284
Feature activation+4.03e-02

Pos logit contributions
pell+3.549
hend+3.409
hers+3.181
ride+2.841
onomy+2.819
Neg logit contributions
-4.260
estones-3.820
mercial-3.709
>>>>-3.635
>>>>>>>>-3.610
Token Lawrence
Token ID13914
Feature activation-9.24e-01

Pos logit contributions
+0.082
estones+0.072
>>>>+0.069
mercial+0.069
>>>>>>>>+0.068
Neg logit contributions
pell-0.085
hend-0.082
hers-0.077
onomy-0.070
rises-0.070
Token code
Token ID2438
Feature activation-2.93e+00

Pos logit contributions
pell+0.701
hend+0.678
hers+0.635
onomy+0.581
ride+0.577
Neg logit contributions
-0.637
estones-0.559
mercial-0.539
>>>>-0.537
>>>>>>>>-0.530
Token).
Token ID737
Feature activation-6.43e-01

Pos logit contributions
pell+5.661
hend+5.463
hers+5.070
onomy+4.581
ride+4.523
Neg logit contributions
-6.298
estones-5.546
mercial-5.380
>>>>-5.310
>>>>>>>>-5.255
Token
Token ID198
Feature activation-2.28e-01

Pos logit contributions
pell+1.349
hend+1.300
hers+1.213
onomy+1.106
ride+1.100
Neg logit contributions
-1.314
estones-1.160
>>>>-1.121
mercial-1.114
>>>>>>>>-1.104
Token
Token ID198
Feature activation-1.24e-01

Pos logit contributions
pell+0.463
hend+0.446
hers+0.414
onomy+0.379
rises+0.375
Neg logit contributions
-0.479
estones-0.425
>>>>-0.413
>>>>>>>>-0.407
mercial-0.394
TokenLot
Token ID48601
Feature activation-3.59e-01

Pos logit contributions
pell+0.260
hend+0.252
hers+0.235
onomy+0.214
ride+0.211
Neg logit contributions
-0.251
estones-0.221
>>>>-0.216
>>>>>>>>-0.212
mercial-0.209
Token Sub
Token ID3834
Feature activation-3.56e+00

Pos logit contributions
pell+0.691
hend+0.664
hers+0.616
onomy+0.554
ride+0.550
Neg logit contributions
-0.794
estones-0.708
mercial-0.683
>>>>-0.681
>>>>>>>>-0.674
Tokendivision
Token ID21426
Feature activation+2.94e-01

Pos logit contributions
pell+6.939
hend+6.729
hers+6.239
onomy+5.638
ride+5.620
Neg logit contributions
-7.577
estones-6.649
>>>>-6.520
mercial-6.463
>>>>>>>>-6.446
Token
Token ID198
Feature activation+8.61e-01

Pos logit contributions
+0.613
estones+0.546
mercial+0.522
>>>>+0.522
>>>>>>>>+0.514
Neg logit contributions
pell-0.597
hend-0.577
hers-0.533
onomy-0.484
ride-0.482
Token
Token ID198
Feature activation+7.04e-01

Pos logit contributions
+1.687
estones+1.487
>>>>+1.457
>>>>>>>>+1.432
mercial+1.300
Neg logit contributions
pell-1.841
hend-1.781
hers-1.646
onomy-1.527
rises-1.516
TokenAfter
Token ID3260
Feature activation-1.85e+00

Pos logit contributions
+1.389
estones+1.214
>>>>+1.192
>>>>>>>>+1.169
mercial+1.139
Neg logit contributions
pell-1.498
hend-1.461
hers-1.356
onomy-1.232
ride-1.218
Token road
Token ID2975
Feature activation-4.47e-01

Pos logit contributions
pell+3.668
hend+3.536
hers+3.299
onomy+2.982
ride+2.961
Neg logit contributions
-3.961
estones-3.512
mercial-3.405
>>>>-3.370
>>>>>>>>-3.343
Token
Token ID198
Feature activation+3.69e-01

Pos logit contributions
pell+0.188
hend+0.180
hers+0.166
onomy+0.148
ride+0.147
Neg logit contributions
-0.253
estones-0.227
mercial-0.221
>>>>-0.220
>>>>>>>>-0.217
Token
Token ID198
Feature activation+2.94e-01

Pos logit contributions
+0.747
estones+0.655
>>>>+0.645
>>>>>>>>+0.635
mercial+0.592
Neg logit contributions
pell-0.773
hend-0.746
hers-0.691
onomy-0.639
rises-0.633
TokenClick
Token ID8164
Feature activation-7.93e-01

Pos logit contributions
+0.641
estones+0.565
>>>>+0.558
>>>>>>>>+0.550
mercial+0.539
Neg logit contributions
pell-0.574
hend-0.549
hers-0.511
onomy-0.463
ride-0.458
Token here
Token ID994
Feature activation-6.99e-01

Pos logit contributions
pell+1.610
hend+1.554
hers+1.452
onomy+1.317
ride+1.308
Neg logit contributions
-1.665
estones-1.470
mercial-1.424
>>>>-1.414
>>>>>>>>-1.399
Token to
Token ID284
Feature activation-2.53e-01

Pos logit contributions
pell+1.278
hend+1.226
hers+1.143
ride+1.019
onomy+1.015
Neg logit contributions
-1.599
estones-1.430
mercial-1.386
>>>>-1.364
>>>>>>>>-1.354
Token thank
Token ID5875
Feature activation-3.56e+00

Pos logit contributions
pell+0.576
hend+0.558
hers+0.525
onomy+0.483
rises+0.479
Neg logit contributions
-0.466
estones-0.407
>>>>-0.392
mercial-0.391
>>>>>>>>-0.386
Token Will
Token ID2561
Feature activation+2.53e-01

Pos logit contributions
pell+8.182
hend+7.908
hers+7.456
onomy+6.830
ride+6.813
Neg logit contributions
-6.208
estones-5.439
mercial-5.283
>>>>-5.016
>>>>>>>>-4.999
Token on
Token ID319
Feature activation-3.02e-01

Pos logit contributions
+0.517
estones+0.454
>>>>+0.439
mercial+0.437
>>>>>>>>+0.432
Neg logit contributions
pell-0.525
hend-0.509
hers-0.475
onomy-0.436
rises-0.431
Token Twitter
Token ID3009
Feature activation-5.11e-01

Pos logit contributions
pell+0.615
hend+0.593
hers+0.555
onomy+0.503
ride+0.499
Neg logit contributions
-0.632
estones-0.559
mercial-0.539
>>>>-0.539
>>>>>>>>-0.532
Token!
Token ID0
Feature activation-1.42e-01

Pos logit contributions
pell+0.988
hend+0.953
hers+0.886
onomy+0.800
ride+0.794
Neg logit contributions
-1.121
estones-0.996
>>>>-0.964
mercial-0.964
>>>>>>>>-0.952
Token
Token ID198
Feature activation+4.86e-01

Pos logit contributions
pell+0.269
hend+0.258
hers+0.239
onomy+0.216
ride+0.214
Neg logit contributions
-0.317
estones-0.283
>>>>-0.275
mercial-0.274
>>>>>>>>-0.271
Token
Token ID198
Feature activation-3.79e-01

Pos logit contributions
pell+3.375
hend+3.276
hers+3.009
onomy+2.731
rises+2.700
Neg logit contributions
-4.310
estones-3.812
mercial-3.748
>>>>-3.687
>>>>>>>>-3.651
Token
Token ID198
Feature activation-3.48e-01

Pos logit contributions
pell+0.758
hend+0.729
hers+0.677
onomy+0.616
rises+0.610
Neg logit contributions
-0.810
estones-0.717
>>>>-0.697
>>>>>>>>-0.688
mercial-0.676
TokenAdvertisement
Token ID4723
Feature activation-2.41e+00

Pos logit contributions
pell+0.748
hend+0.721
hers+0.673
onomy+0.612
rises+0.610
Neg logit contributions
-0.695
estones-0.606
>>>>-0.591
>>>>>>>>-0.584
mercial-0.571
Token
Token ID198
Feature activation-3.07e-01

Pos logit contributions
pell+4.316
hend+4.168
hers+3.868
ride+3.446
onomy+3.438
Neg logit contributions
-5.564
mercial-4.986
estones-4.967
>>>>-4.736
>>>>>>>>-4.691
Token
Token ID198
Feature activation-2.99e-01

Pos logit contributions
pell+0.618
hend+0.595
hers+0.552
onomy+0.503
rises+0.498
Neg logit contributions
-0.650
estones-0.575
>>>>-0.559
>>>>>>>>-0.551
mercial-0.538
Token#
Token ID2
Feature activation-3.55e+00

Pos logit contributions
pell+0.642
hend+0.620
hers+0.578
onomy+0.525
ride+0.523
Neg logit contributions
-0.597
estones-0.521
>>>>-0.508
>>>>>>>>-0.502
mercial-0.493
Tokenad
Token ID324
Feature activation-8.98e-01

Pos logit contributions
pell+5.797
hend+5.606
hers+5.165
ride+4.591
onomy+4.580
Neg logit contributions
-8.438
estones-7.571
mercial-7.520
>>>>-7.388
>>>>>>>>-7.356
Token#
Token ID2
Feature activation-2.50e+00

Pos logit contributions
pell+1.640
hend+1.574
hers+1.438
onomy+1.296
rises+1.288
Neg logit contributions
-2.065
estones-1.852
>>>>-1.805
mercial-1.793
>>>>>>>>-1.772
TokenThey
Token ID2990
Feature activation+2.12e-01

Pos logit contributions
pell+5.072
hend+4.903
hers+4.610
onomy+4.189
ride+4.166
Neg logit contributions
-5.112
estones-4.552
mercial-4.497
>>>>-4.304
>>>>>>>>-4.251
Token d
Token ID288
Feature activation+4.38e-01

Pos logit contributions
+0.426
estones+0.376
mercial+0.360
>>>>+0.360
>>>>>>>>+0.355
Neg logit contributions
pell-0.447
hend-0.434
hers-0.402
onomy-0.368
ride-0.365
Tokenusted
Token ID8459
Feature activation+1.27e-01

Pos logit contributions
+0.857
estones+0.760
>>>>+0.732
mercial+0.731
>>>>>>>>+0.720
Neg logit contributions
pell-0.929
hend-0.893
hers-0.841
onomy-0.765
rises-0.757
Token8
Token ID23
Feature activation-1.24e-01

Pos logit contributions
+3.918
estones+3.480
>>>>+3.369
>>>>>>>>+3.341
mercial+3.288
Neg logit contributions
pell-3.281
hend-3.130
hers-2.904
onomy-2.662
rises-2.621
Tokenb
Token ID65
Feature activation+5.02e-01

Pos logit contributions
pell+0.237
hend+0.228
hers+0.212
onomy+0.192
ride+0.190
Neg logit contributions
-0.275
estones-0.244
>>>>-0.236
mercial-0.235
>>>>>>>>-0.233
TokenET
Token ID2767
Feature activation-2.05e-01

Pos logit contributions
+1.105
estones+0.981
>>>>+0.948
>>>>>>>>+0.940
mercial+0.938
Neg logit contributions
pell-0.960
hend-0.918
hers-0.853
onomy-0.778
rises-0.768
Token pic
Token ID8301
Feature activation-1.17e+00

Pos logit contributions
pell+0.404
hend+0.390
hers+0.360
onomy+0.327
rises+0.325
Neg logit contributions
-0.444
estones-0.393
>>>>-0.380
mercial-0.380
>>>>>>>>-0.376
Token.
Token ID13
Feature activation-4.60e-01

Pos logit contributions
pell+2.281
hend+2.193
hers+2.035
onomy+1.846
ride+1.825
Neg logit contributions
-2.549
estones-2.263
>>>>-2.193
mercial-2.170
>>>>>>>>-2.166
Tokentwitter
Token ID6956
Feature activation-3.53e+00

Pos logit contributions
pell+0.468
hend+0.431
hers+0.370
onomy+0.301
rises+0.284
Neg logit contributions
-1.434
estones-1.317
>>>>-1.293
>>>>>>>>-1.281
mercial-1.275
Token.
Token ID13
Feature activation-1.34e+00

Pos logit contributions
pell+5.863
hend+5.635
hers+5.257
ride+4.630
onomy+4.506
Neg logit contributions
-8.399
mercial-7.845
 cumbers-7.636
estones-7.605
 newsp-7.418
Tokencom
Token ID785
Feature activation-1.27e+00

Pos logit contributions
pell+1.831
hend+1.733
hers+1.553
onomy+1.337
ride+1.314
Neg logit contributions
-3.700
estones-3.372
>>>>-3.284
mercial-3.276
>>>>>>>>-3.255
Token/
Token ID14
Feature activation-1.54e+00

Pos logit contributions
pell+2.636
hend+2.547
hers+2.384
onomy+2.165
ride+2.156
Neg logit contributions
-2.596
estones-2.293
mercial-2.232
>>>>-2.196
>>>>>>>>-2.171
TokenW
Token ID54
Feature activation+1.00e+00

Pos logit contributions
pell+2.905
hend+2.804
hers+2.602
onomy+2.333
ride+2.323
Neg logit contributions
-3.422
estones-3.058
mercial-2.985
>>>>-2.947
>>>>>>>>-2.914
Token8
Token ID23
Feature activation+2.48e-01

Pos logit contributions
+2.253
estones+2.003
>>>>+1.937
>>>>>>>>+1.921
mercial+1.904
Neg logit contributions
pell-1.851
hend-1.766
hers-1.642
onomy-1.493
ride-1.469
Token will
Token ID481
Feature activation+2.77e-01

Pos logit contributions
pell+0.041
hend+0.039
hers+0.037
onomy+0.034
rises+0.033
Neg logit contributions
-0.042
estones-0.037
mercial-0.035
>>>>-0.035
>>>>>>>>-0.035
Token attempt
Token ID2230
Feature activation-4.92e-01

Pos logit contributions
+0.530
estones+0.465
mercial+0.446
>>>>+0.445
>>>>>>>>+0.438
Neg logit contributions
pell-0.610
hend-0.591
hers-0.554
onomy-0.510
rises-0.504
Token to
Token ID284
Feature activation-2.47e-01

Pos logit contributions
pell+0.979
hend+0.942
hers+0.879
onomy+0.795
ride+0.791
Neg logit contributions
-1.055
estones-0.935
mercial-0.906
>>>>-0.901
>>>>>>>>-0.891
Token run
Token ID1057
Feature activation-6.71e-01

Pos logit contributions
pell+0.535
hend+0.517
hers+0.484
onomy+0.444
rises+0.439
Neg logit contributions
-0.485
estones-0.428
mercial-0.410
>>>>-0.409
>>>>>>>>-0.403
Token a
Token ID257
Feature activation-5.04e-01

Pos logit contributions
pell+1.393
hend+1.345
hers+1.258
onomy+1.144
ride+1.137
Neg logit contributions
-1.378
estones-1.215
mercial-1.175
>>>>-1.168
>>>>>>>>-1.154
Token sub
Token ID850
Feature activation-3.53e+00

Pos logit contributions
pell+1.086
hend+1.048
hers+0.983
onomy+0.899
ride+0.891
Neg logit contributions
-0.996
estones-0.873
mercial-0.844
>>>>-0.838
>>>>>>>>-0.827
Token-
Token ID12
Feature activation+8.54e-01

Pos logit contributions
pell+5.865
hend+5.636
hers+5.212
ride+4.643
onomy+4.527
Neg logit contributions
-8.374
estones-7.637
>>>>>>>>-7.317
>>>>-7.306
mercial-7.237
Token2
Token ID17
Feature activation+4.62e-01

Pos logit contributions
+2.025
estones+1.809
mercial+1.773
>>>>+1.752
>>>>>>>>+1.720
Neg logit contributions
pell-1.487
hend-1.405
hers-1.293
onomy-1.174
rises-1.151
Token:
Token ID25
Feature activation+8.67e-01

Pos logit contributions
+0.986
estones+0.869
>>>>+0.842
mercial+0.840
>>>>>>>>+0.822
Neg logit contributions
pell-0.923
hend-0.875
hers-0.819
rises-0.750
onomy-0.746
Token00
Token ID405
Feature activation+3.81e-01

Pos logit contributions
+1.875
estones+1.638
>>>>+1.629
>>>>>>>>+1.594
mercial+1.579
Neg logit contributions
pell-1.703
hend-1.618
hers-1.497
onomy-1.381
rises-1.350
Token marathon
Token ID22336
Feature activation+1.61e+00

Pos logit contributions
+0.776
estones+0.684
mercial+0.660
>>>>+0.657
>>>>>>>>+0.644
Neg logit contributions
pell-0.797
hend-0.763
hers-0.717
onomy-0.652
rises-0.651
Token packet
Token ID19638
Feature activation+3.88e-01

Pos logit contributions
pell+3.140
hend+3.027
hers+2.836
onomy+2.581
ride+2.568
Neg logit contributions
-3.045
estones-2.686
mercial-2.584
>>>>-2.562
>>>>>>>>-2.533
Token loss
Token ID2994
Feature activation+8.68e-01

Pos logit contributions
+0.720
estones+0.619
>>>>+0.603
mercial+0.597
>>>>>>>>+0.597
Neg logit contributions
pell-0.886
hend-0.853
hers-0.793
onomy-0.741
ride-0.724
Token and
Token ID290
Feature activation+7.17e-01

Pos logit contributions
+1.762
estones+1.535
>>>>+1.494
>>>>>>>>+1.474
mercial+1.469
Neg logit contributions
pell-1.810
hend-1.755
hers-1.608
onomy-1.480
rises-1.473
Token congestion
Token ID28014
Feature activation+8.92e-02

Pos logit contributions
+1.377
estones+1.199
>>>>+1.170
mercial+1.161
>>>>>>>>+1.154
Neg logit contributions
pell-1.581
hend-1.531
hers-1.435
onomy-1.325
rises-1.301
Token were
Token ID547
Feature activation+1.03e+00

Pos logit contributions
+0.187
estones+0.163
>>>>+0.159
mercial+0.157
>>>>>>>>+0.156
Neg logit contributions
pell-0.183
hend-0.177
hers-0.163
onomy-0.148
ride-0.148
Token synonymous
Token ID33271
Feature activation+2.81e+00

Pos logit contributions
+2.039
estones+1.800
>>>>+1.725
mercial+1.711
>>>>>>>>+1.710
Neg logit contributions
pell-2.191
hend-2.134
hers-1.944
onomy-1.831
ride-1.814
Token due
Token ID2233
Feature activation+8.97e-02

Pos logit contributions
+5.555
estones+4.804
>>>>+4.692
>>>>>>>>+4.666
mercial+4.662
Neg logit contributions
pell-5.778
hend-5.676
hers-5.231
onomy-4.849
ride-4.803
Token to
Token ID284
Feature activation+8.28e-02

Pos logit contributions
+0.205
estones+0.182
mercial+0.176
>>>>+0.176
>>>>>>>>+0.174
Neg logit contributions
pell-0.166
hend-0.159
hers-0.149
ride-0.133
onomy-0.133
Token technology
Token ID3037
Feature activation+2.38e-01

Pos logit contributions
+0.161
estones+0.140
mercial+0.135
>>>>+0.135
>>>>>>>>+0.133
Neg logit contributions
pell-0.182
hend-0.176
hers-0.165
onomy-0.152
ride-0.150
Token limitations
Token ID11247
Feature activation-1.54e-01

Pos logit contributions
+0.442
estones+0.386
mercial+0.371
>>>>+0.370
>>>>>>>>+0.366
Neg logit contributions
pell-0.538
hend-0.523
hers-0.490
onomy-0.450
ride-0.449
Token.
Token ID13
Feature activation-2.97e-01

Pos logit contributions
pell+0.308
hend+0.298
hers+0.277
onomy+0.252
ride+0.251
Neg logit contributions
-0.327
estones-0.289
mercial-0.279
>>>>-0.278
>>>>>>>>-0.275
Token
Token ID447
Feature activation+7.35e-01

Pos logit contributions
+0.173
estones+0.152
mercial+0.147
>>>>+0.146
>>>>>>>>+0.144
Neg logit contributions
pell-0.177
hend-0.171
hers-0.160
onomy-0.146
ride-0.145
Token
Token ID247
Feature activation+2.91e-01

Pos logit contributions
+1.293
estones+1.104
>>>>+1.075
mercial+1.060
>>>>>>>>+1.051
Neg logit contributions
pell-1.739
hend-1.682
hers-1.576
rises-1.464
ride-1.462
Tokens
Token ID82
Feature activation+4.14e-01

Pos logit contributions
+0.621
estones+0.547
mercial+0.530
>>>>+0.530
>>>>>>>>+0.523
Neg logit contributions
pell-0.583
hend-0.563
hers-0.523
onomy-0.474
ride-0.472
Token Moss
Token ID19935
Feature activation+2.49e-01

Pos logit contributions
+0.754
estones+0.645
mercial+0.628
>>>>+0.627
>>>>>>>>+0.615
Neg logit contributions
pell-0.950
hend-0.928
hers-0.871
onomy-0.795
ride-0.794
Tokenad
Token ID324
Feature activation+5.09e-02

Pos logit contributions
+0.528
estones+0.470
>>>>+0.449
mercial+0.448
>>>>>>>>+0.441
Neg logit contributions
pell-0.493
hend-0.480
hers-0.445
onomy-0.399
ride-0.399
Token intelligence
Token ID4430
Feature activation+2.43e+00

Pos logit contributions
+0.110
estones+0.098
mercial+0.095
>>>>+0.094
>>>>>>>>+0.093
Neg logit contributions
pell-0.100
hend-0.097
hers-0.090
onomy-0.081
ride-0.081
Token agency
Token ID4086
Feature activation+1.66e+00

Pos logit contributions
+4.189
estones+3.556
>>>>+3.517
>>>>>>>>+3.417
mercial+3.373
Neg logit contributions
pell-5.592
hend-5.533
hers-5.188
ride-4.762
rises-4.739
Token,
Token ID11
Feature activation+1.72e-01

Pos logit contributions
+3.336
estones+2.891
>>>>+2.838
>>>>>>>>+2.801
mercial+2.801
Neg logit contributions
pell-3.459
hend-3.381
hers-3.130
ride-2.861
rises-2.860
Token Y
Token ID575
Feature activation-3.98e-01

Pos logit contributions
+0.341
estones+0.296
>>>>+0.289
mercial+0.288
>>>>>>>>+0.285
Neg logit contributions
pell-0.370
hend-0.359
hers-0.334
onomy-0.307
ride-0.305
Tokenoss
Token ID793
Feature activation+9.70e-03

Pos logit contributions
pell+0.869
hend+0.838
hers+0.780
onomy+0.719
ride+0.714
Neg logit contributions
-0.774
estones-0.684
>>>>-0.653
mercial-0.649
>>>>>>>>-0.642
Tokeni
Token ID72
Feature activation-4.78e-01

Pos logit contributions
+0.023
estones+0.021
>>>>+0.020
mercial+0.020
>>>>>>>>+0.020
Neg logit contributions
pell-0.017
hend-0.016
hers-0.015
ride-0.014
rises-0.013
Token,
Token ID11
Feature activation-3.66e-01

Pos logit contributions
pell+0.077
hend+0.075
hers+0.069
onomy+0.063
ride+0.063
Neg logit contributions
-0.078
estones-0.068
>>>>-0.067
>>>>>>>>-0.066
mercial-0.066
Token as
Token ID355
Feature activation-8.13e-01

Pos logit contributions
pell+0.800
hend+0.775
hers+0.724
onomy+0.669
rises+0.660
Neg logit contributions
-0.712
estones-0.620
>>>>-0.596
mercial-0.595
>>>>>>>>-0.587
Token Jaguars
Token ID28392
Feature activation+1.18e+00

Pos logit contributions
pell+1.719
hend+1.654
hers+1.545
onomy+1.426
rises+1.406
Neg logit contributions
-1.644
estones-1.439
>>>>-1.386
mercial-1.386
>>>>>>>>-1.368
Token executive
Token ID4640
Feature activation+1.67e+00

Pos logit contributions
+2.144
estones+1.855
>>>>+1.772
mercial+1.759
>>>>>>>>+1.736
Neg logit contributions
pell-2.684
hend-2.602
hers-2.418
onomy-2.248
rises-2.240
Token vice
Token ID7927
Feature activation+4.94e-01

Pos logit contributions
+2.458
estones+1.976
>>>>+1.960
mercial+1.910
>>>>>>>>+1.870
Neg logit contributions
pell-4.392
hend-4.282
hers-4.013
onomy-3.750
rises-3.749
Token president
Token ID1893
Feature activation+2.22e+00

Pos logit contributions
+0.860
estones+0.734
>>>>+0.713
mercial+0.710
>>>>>>>>+0.701
Neg logit contributions
pell-1.173
hend-1.141
hers-1.064
rises-0.989
onomy-0.988
Token Tom
Token ID4186
Feature activation-7.22e-02

Pos logit contributions
+4.206
estones+3.629
>>>>+3.497
mercial+3.482
>>>>>>>>+3.406
Neg logit contributions
pell-4.752
hend-4.612
hers-4.281
rises-4.038
onomy-3.954
Token C
Token ID327
Feature activation-1.37e+00

Pos logit contributions
pell+0.155
hend+0.150
hers+0.140
onomy+0.128
rises+0.128
Neg logit contributions
-0.143
estones-0.126
mercial-0.121
>>>>-0.120
>>>>>>>>-0.118
Tokenough
Token ID619
Feature activation-1.00e+00

Pos logit contributions
pell+2.948
hend+2.856
hers+2.673
onomy+2.439
ride+2.428
Neg logit contributions
-2.725
estones-2.385
mercial-2.319
>>>>-2.295
>>>>>>>>-2.274
Tokenlin
Token ID2815
Feature activation-7.16e-01

Pos logit contributions
pell+2.090
hend+2.038
hers+1.886
onomy+1.728
rises+1.719
Neg logit contributions
-2.021
estones-1.790
mercial-1.736
>>>>-1.714
>>>>>>>>-1.695
Token praised
Token ID15342
Feature activation-1.58e+00

Pos logit contributions
pell+1.399
hend+1.350
hers+1.253
onomy+1.134
ride+1.128
Neg logit contributions
-1.560
estones-1.389
mercial-1.338
>>>>-1.333
>>>>>>>>-1.319
Token search
Token ID2989
Feature activation-9.26e-02

Pos logit contributions
pell+0.063
hend+0.061
hers+0.057
onomy+0.052
rises+0.052
Neg logit contributions
-0.057
estones-0.050
mercial-0.048
>>>>-0.048
>>>>>>>>-0.047
Token engine
Token ID3113
Feature activation+1.23e+00

Pos logit contributions
pell+0.185
hend+0.178
hers+0.167
rises+0.150
onomy+0.150
Neg logit contributions
-0.198
estones-0.173
mercial-0.168
>>>>-0.168
>>>>>>>>-0.166
Token there
Token ID612
Feature activation+1.60e+00

Pos logit contributions
+2.472
estones+2.139
mercial+2.074
>>>>+2.074
>>>>>>>>+2.055
Neg logit contributions
pell-2.566
hend-2.474
hers-2.357
rises-2.136
ride-2.129
Token,
Token ID11
Feature activation+2.37e-01

Pos logit contributions
+3.243
estones+2.858
mercial+2.746
>>>>+2.742
>>>>>>>>+2.718
Neg logit contributions
pell-3.272
hend-3.177
hers-2.980
onomy-2.736
ride-2.723
Token and
Token ID290
Feature activation+2.19e-01

Pos logit contributions
+0.476
estones+0.418
mercial+0.404
>>>>+0.402
>>>>>>>>+0.397
Neg logit contributions
pell-0.503
hend-0.486
hers-0.456
onomy-0.416
ride-0.413
Token Da
Token ID9637
Feature activation+2.31e+00

Pos logit contributions
+0.440
estones+0.384
mercial+0.371
>>>>+0.370
>>>>>>>>+0.365
Neg logit contributions
pell-0.464
hend-0.447
hers-0.421
onomy-0.383
rises-0.382
Tokenum
Token ID388
Feature activation+3.61e-01

Pos logit contributions
+4.548
estones+4.064
>>>>>>>>+3.890
>>>>+3.867
mercial+3.790
Neg logit contributions
pell-4.743
hend-4.485
hers-4.232
onomy-3.943
rises-3.930
Token Com
Token ID955
Feature activation+3.25e-01

Pos logit contributions
+0.767
estones+0.676
mercial+0.657
>>>>+0.652
>>>>>>>>+0.646
Neg logit contributions
pell-0.719
hend-0.695
hers-0.654
rises-0.587
onomy-0.585
Tokenmin
Token ID1084
Feature activation+1.52e+00

Pos logit contributions
+0.692
estones+0.613
mercial+0.596
>>>>+0.588
>>>>>>>>+0.582
Neg logit contributions
pell-0.644
hend-0.624
hers-0.583
onomy-0.529
ride-0.522
Tokenications
Token ID3736
Feature activation-1.84e-01

Pos logit contributions
+3.178
estones+2.801
mercial+2.738
>>>>+2.709
>>>>>>>>+2.687
Neg logit contributions
pell-3.011
hend-2.883
hers-2.706
rises-2.458
onomy-2.445
Token--
Token ID438
Feature activation-5.23e-01

Pos logit contributions
pell+0.371
hend+0.359
hers+0.335
onomy+0.304
ride+0.302
Neg logit contributions
-0.390
estones-0.343
mercial-0.334
>>>>-0.331
>>>>>>>>-0.328
Token.
Token ID13
Feature activation+3.98e-01

Pos logit contributions
+0.310
estones+0.275
mercial+0.265
>>>>+0.265
>>>>>>>>+0.262
Neg logit contributions
pell-0.283
hend-0.274
hers-0.254
onomy-0.230
ride-0.228
Token She
Token ID1375
Feature activation+1.71e-01

Pos logit contributions
+0.822
estones+0.720
>>>>+0.704
>>>>>>>>+0.697
mercial+0.676
Neg logit contributions
pell-0.831
hend-0.794
hers-0.732
onomy-0.676
ride-0.672
Token and
Token ID290
Feature activation+2.69e-01

Pos logit contributions
+0.332
estones+0.293
mercial+0.279
>>>>+0.278
>>>>>>>>+0.275
Neg logit contributions
pell-0.371
hend-0.360
hers-0.336
onomy-0.310
ride-0.308
Token Jen
Token ID13364
Feature activation+1.91e-01

Pos logit contributions
+0.540
estones+0.473
>>>>+0.457
mercial+0.455
>>>>>>>>+0.451
Neg logit contributions
pell-0.571
hend-0.554
hers-0.515
onomy-0.474
ride-0.469
Tokenelle
Token ID13485
Feature activation+4.44e-01

Pos logit contributions
+0.429
estones+0.383
mercial+0.370
>>>>+0.368
>>>>>>>>+0.365
Neg logit contributions
pell-0.358
hend-0.345
hers-0.320
onomy-0.288
ride-0.285
Token might
Token ID1244
Feature activation+2.07e+00

Pos logit contributions
+0.895
estones+0.784
>>>>+0.753
mercial+0.752
>>>>>>>>+0.745
Neg logit contributions
pell-0.936
hend-0.908
hers-0.842
onomy-0.771
ride-0.769
Token have
Token ID423
Feature activation+2.02e-01

Pos logit contributions
+3.747
estones+3.309
mercial+3.180
>>>>+3.157
>>>>>>>>+3.126
Neg logit contributions
pell-4.574
hend-4.472
hers-4.153
onomy-3.859
ride-3.798
Token a
Token ID257
Feature activation+8.47e-02

Pos logit contributions
+0.414
estones+0.366
mercial+0.353
>>>>+0.353
>>>>>>>>+0.349
Neg logit contributions
pell-0.418
hend-0.402
hers-0.372
onomy-0.343
ride-0.340
Token messy
Token ID24097
Feature activation-5.55e-01

Pos logit contributions
+0.163
estones+0.143
>>>>+0.139
>>>>>>>>+0.138
mercial+0.138
Neg logit contributions
pell-0.184
hend-0.178
hers-0.167
onomy-0.152
ride-0.152
Token mother
Token ID2802
Feature activation-3.39e-01

Pos logit contributions
pell+1.262
hend+1.224
hers+1.151
onomy+1.057
ride+1.050
Neg logit contributions
-1.030
estones-0.895
mercial-0.860
>>>>-0.857
>>>>>>>>-0.846
Token/
Token ID14
Feature activation-3.94e-01

Pos logit contributions
pell+0.651
hend+0.627
hers+0.582
onomy+0.524
rises+0.522
Neg logit contributions
-0.747
estones-0.666
mercial-0.646
>>>>-0.643
>>>>>>>>-0.639
Token.
Token ID13
Feature activation-2.65e-01

Pos logit contributions
pell+0.571
hend+0.552
hers+0.513
onomy+0.466
ride+0.461
Neg logit contributions
-0.634
estones-0.563
mercial-0.546
>>>>-0.543
>>>>>>>>-0.536
Token I
Token ID314
Feature activation-3.39e-01

Pos logit contributions
pell+0.552
hend+0.532
hers+0.495
onomy+0.451
ride+0.449
Neg logit contributions
-0.544
estones-0.479
>>>>-0.462
mercial-0.458
>>>>>>>>-0.457
Token was
Token ID373
Feature activation+1.73e-03

Pos logit contributions
pell+0.714
hend+0.693
hers+0.647
onomy+0.591
ride+0.587
Neg logit contributions
-0.683
estones-0.603
mercial-0.578
>>>>-0.577
>>>>>>>>-0.574
Token trapped
Token ID13640
Feature activation-4.74e-01

Pos logit contributions
+0.004
estones+0.003
>>>>+0.003
mercial+0.003
>>>>>>>>+0.003
Neg logit contributions
pell-0.003
hend-0.003
hers-0.003
onomy-0.003
ride-0.003
Token.
Token ID13
Feature activation+1.25e-01

Pos logit contributions
pell+0.954
hend+0.921
hers+0.857
onomy+0.781
ride+0.774
Neg logit contributions
-1.002
estones-0.886
mercial-0.857
>>>>-0.854
>>>>>>>>-0.845
Token
Token ID198
Feature activation+7.47e-02

Pos logit contributions
+0.254
estones+0.223
>>>>+0.215
>>>>>>>>+0.214
mercial+0.212
Neg logit contributions
pell-0.264
hend-0.254
hers-0.236
onomy-0.216
ride-0.215
Token
Token ID198
Feature activation+2.31e-02

Pos logit contributions
+0.154
estones+0.136
>>>>+0.132
>>>>>>>>+0.130
mercial+0.123
Neg logit contributions
pell-0.155
hend-0.149
hers-0.138
onomy-0.127
rises-0.126
TokenIt
Token ID1026
Feature activation-6.22e-01

Pos logit contributions
+0.046
estones+0.040
>>>>+0.039
>>>>>>>>+0.039
mercial+0.037
Neg logit contributions
pell-0.050
hend-0.048
hers-0.045
onomy-0.041
ride-0.041
Token was
Token ID373
Feature activation-9.32e-02

Pos logit contributions
pell+1.320
hend+1.276
hers+1.193
onomy+1.090
ride+1.082
Neg logit contributions
-1.250
estones-1.100
mercial-1.060
>>>>-1.056
>>>>>>>>-1.042
Token at
Token ID379
Feature activation+2.89e-02

Pos logit contributions
pell+0.202
hend+0.196
hers+0.183
onomy+0.168
ride+0.167
Neg logit contributions
-0.183
estones-0.160
>>>>-0.154
mercial-0.153
>>>>>>>>-0.152
Token this
Token ID428
Feature activation+8.28e-01

Pos logit contributions
+0.056
estones+0.049
>>>>+0.047
mercial+0.047
>>>>>>>>+0.047
Neg logit contributions
pell-0.063
hend-0.061
hers-0.057
onomy-0.052
ride-0.052
Token
Token ID250
Feature activation+6.79e-01

Pos logit contributions
pell+2.552
hend+2.483
hers+2.357
onomy+2.194
ride+2.183
Neg logit contributions
-1.435
estones-1.201
mercial-1.142
>>>>-1.134
>>>>>>>>-1.113
TokenDoes
Token ID13921
Feature activation+2.47e+00

Pos logit contributions
+1.339
estones+1.160
>>>>+1.147
>>>>>>>>+1.136
mercial+1.090
Neg logit contributions
pell-1.456
hend-1.406
hers-1.307
onomy-1.196
rises-1.194
Token it
Token ID340
Feature activation-8.90e-01

Pos logit contributions
+4.715
>>>>+4.057
estones+3.966
>>>>>>>>+3.935
mercial+3.886
Neg logit contributions
pell-5.185
hend-5.074
hers-4.657
rises-4.419
onomy-4.347
Token seem
Token ID1283
Feature activation+1.30e+00

Pos logit contributions
pell+1.894
hend+1.828
hers+1.713
onomy+1.563
ride+1.551
Neg logit contributions
-1.787
estones-1.571
mercial-1.517
>>>>-1.501
>>>>>>>>-1.487
Token fair
Token ID3148
Feature activation-4.80e-01

Pos logit contributions
+2.491
estones+2.139
>>>>+2.107
>>>>>>>>+2.066
mercial+2.054
Neg logit contributions
pell-2.848
hend-2.782
hers-2.565
onomy-2.398
rises-2.373
Token to
Token ID284
Feature activation+8.66e-02

Pos logit contributions
pell+0.962
hend+0.929
hers+0.865
onomy+0.784
ride+0.778
Neg logit contributions
-1.020
estones-0.902
mercial-0.873
>>>>-0.870
>>>>>>>>-0.860
Token you
Token ID345
Feature activation+4.86e-03

Pos logit contributions
+0.169
estones+0.148
>>>>+0.144
mercial+0.142
>>>>>>>>+0.142
Neg logit contributions
pell-0.186
hend-0.182
hers-0.169
onomy-0.155
rises-0.155
Token,
Token ID11
Feature activation-3.42e-01

Pos logit contributions
+0.010
estones+0.009
>>>>+0.008
mercial+0.008
>>>>>>>>+0.008
Neg logit contributions
pell-0.010
hend-0.010
hers-0.009
onomy-0.008
ride-0.008
Token
Token ID447
Feature activation+5.36e-02

Pos logit contributions
pell+0.732
hend+0.710
hers+0.663
onomy+0.608
rises+0.603
Neg logit contributions
-0.678
estones-0.593
>>>>-0.573
mercial-0.572
>>>>>>>>-0.566
Token
Token ID251
Feature activation-4.86e-01

Pos logit contributions
+0.082
estones+0.068
>>>>+0.067
>>>>>>>>+0.065
mercial+0.064
Neg logit contributions
pell-0.139
hend-0.135
hers-0.128
rises-0.119
onomy-0.119
Token Kaine
Token ID37193
Feature activation+9.92e-01

Pos logit contributions
pell+1.023
hend+0.990
hers+0.925
onomy+0.845
ride+0.838
Neg logit contributions
-0.983
estones-0.863
mercial-0.834
>>>>-0.831
>>>>>>>>-0.821
Token.
Token ID13
Feature activation+1.28e-01

Pos logit contributions
+1.753
estones+1.532
>>>>+1.491
mercial+1.490
>>>>>>>>+1.474
Neg logit contributions
pell-1.840
hend-1.777
hers-1.663
onomy-1.525
rises-1.504
Token
Token ID198
Feature activation+1.18e-01

Pos logit contributions
+0.265
estones+0.232
>>>>+0.225
mercial+0.224
>>>>>>>>+0.222
Neg logit contributions
pell-0.266
hend-0.257
hers-0.239
onomy-0.218
ride-0.216
Token
Token ID198
Feature activation+9.21e-02

Pos logit contributions
+0.243
estones+0.214
>>>>+0.209
>>>>>>>>+0.206
mercial+0.195
Neg logit contributions
pell-0.245
hend-0.235
hers-0.218
onomy-0.202
rises-0.200
Token
Token ID447
Feature activation-6.08e-01

Pos logit contributions
+0.211
estones+0.187
>>>>+0.183
>>>>>>>>+0.181
mercial+0.180
Neg logit contributions
pell-0.170
hend-0.163
hers-0.151
onomy-0.135
ride-0.134
Token
Token ID250
Feature activation+4.73e-01

Pos logit contributions
pell+1.534
hend+1.496
hers+1.417
onomy+1.317
ride+1.308
Neg logit contributions
-0.970
estones-0.821
mercial-0.788
>>>>-0.770
>>>>>>>>-0.761
TokenSo
Token ID2396
Feature activation+3.92e-01

Pos logit contributions
+0.923
estones+0.795
>>>>+0.781
>>>>>>>>+0.769
mercial+0.750
Neg logit contributions
pell-1.036
hend-0.998
hers-0.935
onomy-0.856
rises-0.850
Token very
Token ID845
Feature activation+1.81e+00

Pos logit contributions
+0.776
estones+0.676
>>>>+0.656
mercial+0.654
>>>>>>>>+0.646
Neg logit contributions
pell-0.839
hend-0.818
hers-0.763
onomy-0.700
rises-0.694
Token optimistic
Token ID16915
Feature activation+1.49e+00

Pos logit contributions
+3.373
estones+2.948
>>>>+2.830
>>>>>>>>+2.815
mercial+2.808
Neg logit contributions
pell-3.997
hend-3.941
hers-3.658
rises-3.400
onomy-3.396
Token for
Token ID329
Feature activation+1.31e+00

Pos logit contributions
+3.065
estones+2.675
>>>>+2.634
mercial+2.621
>>>>>>>>+2.600
Neg logit contributions
pell-3.037
hend-2.945
hers-2.728
onomy-2.479
rises-2.472
Token the
Token ID262
Feature activation-1.07e-01

Pos logit contributions
+2.550
estones+2.226
>>>>+2.177
>>>>>>>>+2.148
mercial+2.147
Neg logit contributions
pell-2.811
hend-2.751
hers-2.552
onomy-2.343
rises-2.325
Token season
Token ID1622
Feature activation+3.24e-01

Pos logit contributions
pell+0.240
hend+0.233
hers+0.218
onomy+0.200
ride+0.199
Neg logit contributions
-0.202
estones-0.175
mercial-0.169
>>>>-0.169
>>>>>>>>-0.166
Token suits
Token ID14803
Feature activation+1.16e-01

Pos logit contributions
pell+1.848
hend+1.783
hers+1.661
onomy+1.506
ride+1.496
Neg logit contributions
-1.931
estones-1.711
mercial-1.648
>>>>-1.644
>>>>>>>>-1.622
Token against
Token ID1028
Feature activation-4.15e-01

Pos logit contributions
+0.264
estones+0.233
>>>>+0.228
mercial+0.227
>>>>>>>>+0.225
Neg logit contributions
pell-0.215
hend-0.210
hers-0.192
onomy-0.175
rises-0.173
Token those
Token ID883
Feature activation-6.20e-01

Pos logit contributions
pell+0.873
hend+0.844
hers+0.787
onomy+0.721
ride+0.714
Neg logit contributions
-0.838
estones-0.736
mercial-0.714
>>>>-0.710
>>>>>>>>-0.700
Token who
Token ID508
Feature activation-1.95e-01

Pos logit contributions
pell+1.236
hend+1.191
hers+1.105
onomy+1.008
ride+1.000
Neg logit contributions
-1.326
estones-1.173
mercial-1.137
>>>>-1.131
>>>>>>>>-1.117
Token call
Token ID869
Feature activation-1.65e+00

Pos logit contributions
pell+0.435
hend+0.423
hers+0.396
onomy+0.364
rises+0.361
Neg logit contributions
-0.368
estones-0.321
mercial-0.310
>>>>-0.307
>>>>>>>>-0.300
Token for
Token ID329
Feature activation+1.41e+00

Pos logit contributions
pell+3.160
hend+3.032
hers+2.837
onomy+2.556
ride+2.537
Neg logit contributions
-3.649
estones-3.239
mercial-3.137
>>>>-3.128
>>>>>>>>-3.101
Token boycott
Token ID23659
Feature activation+4.63e-01

Pos logit contributions
+2.696
estones+2.385
>>>>+2.284
mercial+2.276
>>>>>>>>+2.231
Neg logit contributions
pell-3.043
hend-2.955
hers-2.742
onomy-2.506
rises-2.494
Token against
Token ID1028
Feature activation-6.12e-01

Pos logit contributions
+0.970
estones+0.850
mercial+0.824
>>>>+0.820
>>>>>>>>+0.810
Neg logit contributions
pell-0.941
hend-0.906
hers-0.850
onomy-0.770
ride-0.765
Token any
Token ID597
Feature activation+2.29e+00

Pos logit contributions
pell+1.251
hend+1.207
hers+1.128
onomy+1.024
ride+1.017
Neg logit contributions
-1.277
estones-1.128
mercial-1.091
>>>>-1.085
>>>>>>>>-1.072
Token area
Token ID1989
Feature activation-4.53e-01

Pos logit contributions
+4.146
estones+3.580
mercial+3.468
>>>>+3.430
>>>>>>>>+3.361
Neg logit contributions
pell-5.178
hend-5.012
hers-4.652
onomy-4.291
ride-4.274
Token
Token ID564
Feature activation+8.78e-03

Pos logit contributions
pell+0.949
hend+0.918
hers+0.854
onomy+0.781
ride+0.775
Neg logit contributions
-0.921
estones-0.810
mercial-0.784
>>>>-0.778
>>>>>>>>-0.768
Token different
Token ID1180
Feature activation+6.47e-01

Pos logit contributions
pell+1.564
hend+1.515
hers+1.432
onomy+1.323
ride+1.313
Neg logit contributions
-1.108
estones-0.952
mercial-0.915
>>>>-0.903
>>>>>>>>-0.892
Token types
Token ID3858
Feature activation-1.17e+00

Pos logit contributions
+1.265
estones+1.108
>>>>+1.072
mercial+1.060
>>>>>>>>+1.047
Neg logit contributions
pell-1.396
hend-1.366
hers-1.275
ride-1.163
onomy-1.161
Token of
Token ID286
Feature activation-1.21e-01

Pos logit contributions
pell+2.254
hend+2.152
hers+2.006
onomy+1.818
ride+1.800
Neg logit contributions
-2.570
estones-2.306
mercial-2.243
>>>>-2.203
>>>>>>>>-2.203
Token block
Token ID2512
Feature activation-1.61e+00

Pos logit contributions
pell+0.229
hend+0.221
hers+0.205
onomy+0.185
ride+0.183
Neg logit contributions
-0.272
estones-0.243
mercial-0.235
>>>>-0.234
>>>>>>>>-0.232
Token syntax
Token ID15582
Feature activation+3.12e-01

Pos logit contributions
pell+3.939
hend+3.782
hers+3.608
onomy+3.360
rises+3.289
Neg logit contributions
-2.684
estones-2.299
mercial-2.245
>>>>>>>>-2.180
>>>>-2.162
Token,
Token ID11
Feature activation+2.42e-01

Pos logit contributions
+0.687
estones+0.610
>>>>+0.590
mercial+0.589
>>>>>>>>+0.581
Neg logit contributions
pell-0.598
hend-0.579
hers-0.536
onomy-0.484
ride-0.483
Token the
Token ID262
Feature activation+9.11e-02

Pos logit contributions
+0.474
estones+0.414
>>>>+0.400
mercial+0.399
>>>>>>>>+0.393
Neg logit contributions
pell-0.523
hend-0.507
hers-0.474
onomy-0.434
ride-0.432
Token curly
Token ID45731
Feature activation-4.27e-01

Pos logit contributions
+0.176
estones+0.154
>>>>+0.148
mercial+0.148
>>>>>>>>+0.146
Neg logit contributions
pell-0.200
hend-0.194
hers-0.181
onomy-0.166
ride-0.166
Token brace
Token ID22581
Feature activation-7.25e-01

Pos logit contributions
pell+0.941
hend+0.911
hers+0.855
onomy+0.783
ride+0.778
Neg logit contributions
-0.824
estones-0.721
mercial-0.695
>>>>-0.690
>>>>>>>>-0.682
Token syntax
Token ID15582
Feature activation+2.45e-02

Pos logit contributions
pell+1.578
hend+1.523
hers+1.431
onomy+1.309
ride+1.298
Neg logit contributions
-1.418
estones-1.242
mercial-1.200
>>>>-1.187
>>>>>>>>-1.176
Token e
Token ID304
Feature activation-1.75e-01

Pos logit contributions
+0.049
estones+0.043
mercial+0.042
>>>>+0.042
>>>>>>>>+0.041
Neg logit contributions
pell-0.052
hend-0.050
hers-0.047
onomy-0.043
ride-0.043
Tokenb
Token ID65
Feature activation+3.49e-01

Pos logit contributions
pell+1.403
hend+1.346
hers+1.251
onomy+1.132
ride+1.131
Neg logit contributions
-1.592
estones-1.410
mercial-1.367
>>>>-1.356
>>>>>>>>-1.347
Token]
Token ID60
Feature activation+1.77e-01

Pos logit contributions
+0.784
estones+0.706
>>>>+0.695
>>>>>>>>+0.683
mercial+0.651
Neg logit contributions
pell-0.644
hend-0.621
hers-0.574
onomy-0.522
rises-0.520
TokenPersonal
Token ID30228
Feature activation+1.11e+00

Pos logit contributions
+0.366
estones+0.322
>>>>+0.312
mercial+0.311
>>>>>>>>+0.308
Neg logit contributions
pell-0.365
hend-0.351
hers-0.329
onomy-0.299
ride-0.296
Token best
Token ID1266
Feature activation+1.58e-01

Pos logit contributions
+1.821
estones+1.555
mercial+1.486
>>>>+1.486
>>>>>>>>+1.462
Neg logit contributions
pell-2.758
hend-2.676
hers-2.538
onomy-2.343
rises-2.332
Token:
Token ID25
Feature activation+4.65e-01

Pos logit contributions
+0.383
estones+0.344
>>>>+0.338
>>>>>>>>+0.334
mercial+0.322
Neg logit contributions
pell-0.267
hend-0.256
hers-0.236
onomy-0.210
rises-0.209
Token[/
Token ID13412
Feature activation-1.20e+00

Pos logit contributions
+0.875
estones+0.755
>>>>+0.742
>>>>>>>>+0.735
mercial+0.694
Neg logit contributions
pell-1.049
hend-1.006
hers-0.948
onomy-0.881
rises-0.874
Tokenb
Token ID65
Feature activation+5.32e-01

Pos logit contributions
pell+2.424
hend+2.340
hers+2.182
onomy+1.982
ride+1.965
Neg logit contributions
-2.541
estones-2.250
mercial-2.172
>>>>-2.168
>>>>>>>>-2.141
Token]
Token ID60
Feature activation+5.52e-01

Pos logit contributions
+1.196
estones+1.065
>>>>+1.058
>>>>>>>>+1.042
mercial+0.979
Neg logit contributions
pell-0.990
hend-0.949
hers-0.882
onomy-0.805
rises-0.799
Token 00
Token ID3571
Feature activation-9.19e-01

Pos logit contributions
+1.122
estones+0.988
>>>>+0.969
>>>>>>>>+0.956
mercial+0.945
Neg logit contributions
pell-1.153
hend-1.111
hers-1.037
onomy-0.955
ride-0.941
Token:
Token ID25
Feature activation-4.25e-02

Pos logit contributions
pell+1.504
hend+1.435
hers+1.313
onomy+1.165
rises+1.148
Neg logit contributions
-2.298
estones-2.074
>>>>-2.021
>>>>>>>>-2.000
mercial-1.977
Token47
Token ID2857
Feature activation+1.44e+00

Pos logit contributions
pell+0.088
hend+0.084
hers+0.078
onomy+0.072
rises+0.071
Neg logit contributions
-0.088
estones-0.078
>>>>-0.076
>>>>>>>>-0.075
mercial-0.073
TokenAnd
Token ID1870
Feature activation+1.94e-01

Pos logit contributions
+0.343
estones+0.297
>>>>+0.294
>>>>>>>>+0.290
mercial+0.280
Neg logit contributions
pell-0.367
hend-0.356
hers-0.330
onomy-0.303
ride-0.300
Token it
Token ID340
Feature activation-3.71e-01

Pos logit contributions
+0.382
estones+0.335
>>>>+0.325
>>>>>>>>+0.320
mercial+0.320
Neg logit contributions
pell-0.417
hend-0.405
hers-0.377
onomy-0.347
ride-0.342
Token turns
Token ID4962
Feature activation-7.70e-01

Pos logit contributions
pell+0.764
hend+0.738
hers+0.688
onomy+0.628
ride+0.623
Neg logit contributions
-0.769
estones-0.679
>>>>-0.656
mercial-0.655
>>>>>>>>-0.647
Token out
Token ID503
Feature activation+2.10e-01

Pos logit contributions
pell+1.244
hend+1.192
hers+1.090
onomy+0.961
ride+0.950
Neg logit contributions
-1.936
estones-1.747
mercial-1.700
>>>>-1.697
>>>>>>>>-1.680
Token that
Token ID326
Feature activation+8.37e-02

Pos logit contributions
+0.420
estones+0.366
>>>>+0.357
mercial+0.352
>>>>>>>>+0.352
Neg logit contributions
pell-0.445
hend-0.432
hers-0.401
onomy-0.370
ride-0.365
Token it
Token ID340
Feature activation-3.63e-01

Pos logit contributions
+0.171
estones+0.149
>>>>+0.146
mercial+0.144
>>>>>>>>+0.143
Neg logit contributions
pell-0.175
hend-0.170
hers-0.157
onomy-0.145
ride-0.143
Token is
Token ID318
Feature activation+1.65e-01

Pos logit contributions
pell+0.773
hend+0.749
hers+0.697
onomy+0.640
ride+0.634
Neg logit contributions
-0.726
estones-0.637
>>>>-0.615
mercial-0.614
>>>>>>>>-0.606
Token really
Token ID1107
Feature activation-1.36e-02

Pos logit contributions
+0.320
estones+0.277
>>>>+0.272
mercial+0.268
>>>>>>>>+0.268
Neg logit contributions
pell-0.358
hend-0.349
hers-0.323
onomy-0.300
rises-0.297
Token,
Token ID11
Feature activation-1.79e-01

Pos logit contributions
pell+0.029
hend+0.029
hers+0.026
onomy+0.024
ride+0.024
Neg logit contributions
-0.027
estones-0.023
>>>>-0.023
mercial-0.023
>>>>>>>>-0.023
Token really
Token ID1107
Feature activation-9.30e-01

Pos logit contributions
pell+0.416
hend+0.403
hers+0.379
onomy+0.350
ride+0.347
Neg logit contributions
-0.325
estones-0.281
mercial-0.270
>>>>-0.269
>>>>>>>>-0.265
Token hard
Token ID1327
Feature activation+3.95e-01

Pos logit contributions
pell+2.008
hend+1.940
hers+1.820
onomy+1.662
ride+1.651
Neg logit contributions
-1.835
estones-1.609
mercial-1.554
>>>>-1.542
>>>>>>>>-1.522
Token treatment
Token ID3513
Feature activation-8.08e-01

Pos logit contributions
pell+1.425
hend+1.382
hers+1.299
onomy+1.196
ride+1.189
Neg logit contributions
-1.145
estones-0.997
mercial-0.960
>>>>-0.945
>>>>>>>>-0.934
Token for
Token ID329
Feature activation+6.90e-01

Pos logit contributions
pell+1.552
hend+1.495
hers+1.392
onomy+1.255
ride+1.246
Neg logit contributions
-1.779
estones-1.585
mercial-1.536
>>>>-1.525
>>>>>>>>-1.506
Token Rohingya
Token ID34016
Feature activation-2.70e+00

Pos logit contributions
+1.481
estones+1.294
>>>>+1.263
mercial+1.261
>>>>>>>>+1.244
Neg logit contributions
pell-1.367
hend-1.319
hers-1.227
onomy-1.119
ride-1.098
Token who
Token ID508
Feature activation+9.57e-02

Pos logit contributions
pell+4.846
hend+4.708
hers+4.372
rises+3.894
onomy+3.864
Neg logit contributions
-6.131
estones-5.551
mercial-5.385
>>>>-5.284
>>>>>>>>-5.226
Token would
Token ID561
Feature activation+9.72e-02

Pos logit contributions
+0.194
estones+0.171
mercial+0.165
>>>>+0.164
>>>>>>>>+0.162
Neg logit contributions
pell-0.201
hend-0.195
hers-0.181
onomy-0.165
ride-0.163
Token otherwise
Token ID4306
Feature activation-1.75e+00

Pos logit contributions
+0.175
estones+0.153
mercial+0.147
>>>>+0.146
>>>>>>>>+0.143
Neg logit contributions
pell-0.225
hend-0.218
hers-0.205
onomy-0.189
ride-0.187
Token be
Token ID307
Feature activation-3.33e-01

Pos logit contributions
pell+3.567
hend+3.438
hers+3.220
ride+2.925
onomy+2.911
Neg logit contributions
-3.627
estones-3.189
mercial-3.069
>>>>-3.055
>>>>>>>>-3.011
Token turned
Token ID2900
Feature activation-3.02e-01

Pos logit contributions
pell+0.614
hend+0.590
hers+0.543
onomy+0.497
ride+0.489
Neg logit contributions
-0.759
estones-0.675
>>>>-0.657
mercial-0.655
>>>>>>>>-0.650
Token away
Token ID1497
Feature activation+1.72e+00

Pos logit contributions
pell+0.625
hend+0.605
hers+0.565
onomy+0.522
rises+0.515
Neg logit contributions
-0.616
estones-0.543
>>>>-0.526
mercial-0.526
>>>>>>>>-0.519
Token from
Token ID422
Feature activation+4.41e-01

Pos logit contributions
+3.297
estones+2.884
>>>>+2.815
mercial+2.775
>>>>>>>>+2.764
Neg logit contributions
pell-3.622
hend-3.522
hers-3.323
onomy-3.100
rises-3.054
Token clinics
Token ID21434
Feature activation+4.76e-01

Pos logit contributions
+0.911
estones+0.798
>>>>+0.777
mercial+0.775
>>>>>>>>+0.763
Neg logit contributions
pell-0.906
hend-0.875
hers-0.819
onomy-0.745
rises-0.736
Token as
Token ID355
Feature activation-7.79e-01

Pos logit contributions
+1.973
estones+1.733
>>>>+1.681
mercial+1.679
>>>>>>>>+1.656
Neg logit contributions
pell-1.957
hend-1.899
hers-1.769
onomy-1.610
ride-1.599
Token almond
Token ID26948
Feature activation+2.09e-01

Pos logit contributions
pell+1.915
hend+1.859
hers+1.758
onomy+1.626
ride+1.617
Neg logit contributions
-1.302
estones-1.113
mercial-1.066
>>>>-1.057
>>>>>>>>-1.042
Token flour
Token ID10601
Feature activation+1.32e+00

Pos logit contributions
+0.405
estones+0.354
mercial+0.343
>>>>+0.342
>>>>>>>>+0.336
Neg logit contributions
pell-0.454
hend-0.440
hers-0.414
onomy-0.379
ride-0.376
Token,
Token ID11
Feature activation+1.75e-01

Pos logit contributions
+2.672
estones+2.353
>>>>+2.295
mercial+2.292
>>>>>>>>+2.249
Neg logit contributions
pell-2.707
hend-2.628
hers-2.452
onomy-2.237
rises-2.220
Token but
Token ID475
Feature activation+5.40e-02

Pos logit contributions
+0.316
estones+0.274
mercial+0.265
>>>>+0.264
>>>>>>>>+0.259
Neg logit contributions
pell-0.403
hend-0.390
hers-0.368
onomy-0.339
ride-0.337
Token yields
Token ID19299
Feature activation-6.08e-01

Pos logit contributions
+0.102
estones+0.090
mercial+0.087
>>>>+0.086
>>>>>>>>+0.085
Neg logit contributions
pell-0.120
hend-0.116
hers-0.109
onomy-0.101
ride-0.100
Token a
Token ID257
Feature activation-1.56e-01

Pos logit contributions
pell+1.242
hend+1.200
hers+1.124
onomy+1.016
ride+1.009
Neg logit contributions
-1.269
estones-1.122
mercial-1.085
>>>>-1.073
>>>>>>>>-1.063
Token different
Token ID1180
Feature activation+7.53e-01

Pos logit contributions
pell+0.389
hend+0.378
hers+0.358
onomy+0.331
ride+0.329
Neg logit contributions
-0.254
estones-0.217
mercial-0.207
>>>>-0.206
>>>>>>>>-0.203
Token flavor
Token ID9565
Feature activation+1.96e-01

Pos logit contributions
+1.192
estones+1.005
mercial+0.964
>>>>+0.961
>>>>>>>>+0.945
Neg logit contributions
pell-1.910
hend-1.860
hers-1.763
onomy-1.631
ride-1.623
Token to
Token ID284
Feature activation+2.99e-02

Pos logit contributions
+0.432
estones+0.385
mercial+0.373
>>>>+0.371
>>>>>>>>+0.367
Neg logit contributions
pell-0.376
hend-0.362
hers-0.337
onomy-0.304
ride-0.302
Token the
Token ID262
Feature activation-5.39e-01

Pos logit contributions
+0.062
estones+0.055
mercial+0.053
>>>>+0.053
>>>>>>>>+0.052
Neg logit contributions
pell-0.062
hend-0.059
hers-0.056
onomy-0.051
ride-0.050
Token university
Token ID6403
Feature activation-4.79e-01

Pos logit contributions
+0.180
estones+0.157
>>>>+0.151
mercial+0.149
>>>>>>>>+0.148
Neg logit contributions
pell-0.199
hend-0.195
hers-0.182
onomy-0.166
rises-0.165
Token disciplinary
Token ID24737
Feature activation-4.77e-02

Pos logit contributions
pell+1.011
hend+0.978
hers+0.915
onomy+0.834
ride+0.828
Neg logit contributions
-0.969
estones-0.852
mercial-0.823
>>>>-0.818
>>>>>>>>-0.808
Token hearing
Token ID4854
Feature activation+2.18e+00

Pos logit contributions
pell+0.106
hend+0.104
hers+0.097
onomy+0.089
rises+0.088
Neg logit contributions
-0.090
estones-0.079
>>>>-0.075
mercial-0.074
>>>>>>>>-0.074
Token.
Token ID13
Feature activation-3.68e-01

Pos logit contributions
+4.050
estones+3.536
>>>>+3.387
mercial+3.333
>>>>>>>>+3.331
Neg logit contributions
pell-4.632
hend-4.561
hers-4.266
rises-3.918
onomy-3.911
Token He
Token ID679
Feature activation-1.23e-01

Pos logit contributions
pell+0.784
hend+0.754
hers+0.702
onomy+0.641
rises+0.639
Neg logit contributions
-0.744
estones-0.650
>>>>-0.628
mercial-0.622
>>>>>>>>-0.619
Token contends
Token ID33799
Feature activation-7.49e-01

Pos logit contributions
pell+0.263
hend+0.256
hers+0.239
onomy+0.220
ride+0.218
Neg logit contributions
-0.242
estones-0.214
mercial-0.203
>>>>-0.203
>>>>>>>>-0.200
Token their
Token ID511
Feature activation+5.52e-02

Pos logit contributions
pell+1.605
hend+1.551
hers+1.451
onomy+1.328
ride+1.316
Neg logit contributions
-1.492
estones-1.308
mercial-1.263
>>>>-1.258
>>>>>>>>-1.242
Token sex
Token ID1714
Feature activation+3.78e-01

Pos logit contributions
+0.098
estones+0.085
mercial+0.081
>>>>+0.081
>>>>>>>>+0.080
Neg logit contributions
pell-0.130
hend-0.126
hers-0.119
onomy-0.109
ride-0.109
Token at
Token ID379
Feature activation+8.13e-02

Pos logit contributions
+0.722
estones+0.634
>>>>+0.605
mercial+0.603
>>>>>>>>+0.599
Neg logit contributions
pell-0.838
hend-0.814
hers-0.765
onomy-0.691
ride-0.686
Token his
Token ID465
Feature activation+2.47e-01

Pos logit contributions
+0.164
estones+0.144
>>>>+0.138
mercial+0.138
>>>>>>>>+0.136
Neg logit contributions
pell-0.171
hend-0.167
hers-0.155
onomy-0.141
rises-0.140
Token apartment
Token ID7962
Feature activation+5.51e-01

Pos logit contributions
+0.501
estones+0.440
>>>>+0.424
mercial+0.420
>>>>>>>>+0.416
Neg logit contributions
pell-0.511
hend-0.500
hers-0.466
onomy-0.422
rises-0.421
Token escapes
Token ID32695
Feature activation-4.83e-01

Pos logit contributions
pell+4.273
hend+4.129
hers+3.860
onomy+3.498
rises+3.491
Neg logit contributions
-4.220
estones-3.723
mercial-3.598
>>>>-3.559
>>>>>>>>-3.524
Token every
Token ID790
Feature activation+5.43e-01

Pos logit contributions
pell+1.088
hend+1.053
hers+0.990
onomy+0.908
ride+0.903
Neg logit contributions
-0.907
estones-0.790
mercial-0.761
>>>>-0.758
>>>>>>>>-0.747
Token single
Token ID2060
Feature activation+8.63e-02

Pos logit contributions
+0.951
estones+0.813
>>>>+0.786
mercial+0.782
>>>>>>>>+0.771
Neg logit contributions
pell-1.289
hend-1.252
hers-1.180
ride-1.084
onomy-1.079
Token attempt
Token ID2230
Feature activation+3.13e-01

Pos logit contributions
+0.152
estones+0.131
>>>>+0.126
mercial+0.126
>>>>>>>>+0.123
Neg logit contributions
pell-0.203
hend-0.198
hers-0.187
ride-0.171
rises-0.171
Token to
Token ID284
Feature activation-6.01e-02

Pos logit contributions
+0.658
estones+0.582
mercial+0.566
>>>>+0.565
>>>>>>>>+0.558
Neg logit contributions
pell-0.630
hend-0.610
hers-0.567
onomy-0.515
ride-0.514
Token once
Token ID1752
Feature activation-2.47e+00

Pos logit contributions
pell+0.124
hend+0.120
hers+0.113
ride+0.102
onomy+0.102
Neg logit contributions
-0.122
estones-0.108
mercial-0.105
>>>>-0.105
>>>>>>>>-0.103
Token and
Token ID290
Feature activation+3.80e-01

Pos logit contributions
pell+4.828
hend+4.637
hers+4.280
onomy+3.917
ride+3.856
Neg logit contributions
-5.357
estones-4.765
mercial-4.587
>>>>-4.551
>>>>>>>>-4.533
Token for
Token ID329
Feature activation+1.54e+00

Pos logit contributions
+0.824
estones+0.730
mercial+0.707
>>>>+0.707
>>>>>>>>+0.696
Neg logit contributions
pell-0.743
hend-0.716
hers-0.667
onomy-0.601
rises-0.598
Token all
Token ID477
Feature activation+2.64e-01

Pos logit contributions
+2.629
estones+2.301
>>>>+2.233
>>>>>>>>+2.201
mercial+2.001
Neg logit contributions
pell-3.535
hend-3.474
hers-3.192
rises-3.048
onomy-3.029
Token explain
Token ID4727
Feature activation-2.55e-01

Pos logit contributions
+0.551
estones+0.489
mercial+0.475
>>>>+0.474
>>>>>>>>+0.466
Neg logit contributions
pell-0.532
hend-0.515
hers-0.481
onomy-0.436
ride-0.435
Token its
Token ID663
Feature activation+3.34e-01

Pos logit contributions
pell+0.553
hend+0.534
hers+0.500
onomy+0.458
ride+0.455
Neg logit contributions
-0.497
estones-0.436
mercial-0.421
>>>>-0.419
>>>>>>>>-0.413
Tokenaka
Token ID8130
Feature activation-1.19e+00

Pos logit contributions
+0.686
estones+0.612
>>>>+0.591
mercial+0.590
>>>>>>>>+0.582
Neg logit contributions
pell-0.566
hend-0.547
hers-0.503
onomy-0.456
rises-0.449
Token Sn
Token ID5489
Feature activation+4.84e-02

Pos logit contributions
pell+2.333
hend+2.246
hers+2.094
onomy+1.892
ride+1.878
Neg logit contributions
-2.565
estones-2.278
mercial-2.206
>>>>-2.192
>>>>>>>>-2.170
Tokenappy
Token ID7774
Feature activation-1.56e+00

Pos logit contributions
+0.111
estones+0.101
>>>>+0.098
mercial+0.097
>>>>>>>>+0.096
Neg logit contributions
pell-0.086
hend-0.084
hers-0.076
onomy-0.068
rises-0.068
Token,
Token ID11
Feature activation-2.22e-01

Pos logit contributions
pell+3.014
hend+2.904
hers+2.700
onomy+2.436
ride+2.418
Neg logit contributions
-3.434
estones-3.055
mercial-2.961
>>>>-2.944
>>>>>>>>-2.912
Token aka
Token ID22430
Feature activation-7.11e-01

Pos logit contributions
pell+0.454
hend+0.441
hers+0.409
onomy+0.374
ride+0.371
Neg logit contributions
-0.461
estones-0.406
mercial-0.393
>>>>-0.391
>>>>>>>>-0.385
Token snaps
Token ID23429
Feature activation-2.37e+00

Pos logit contributions
pell+1.416
hend+1.368
hers+1.273
onomy+1.154
ride+1.145
Neg logit contributions
-1.524
estones-1.350
mercial-1.307
>>>>-1.301
>>>>>>>>-1.285
Token).
Token ID737
Feature activation+1.00e-01

Pos logit contributions
pell+4.453
hend+4.251
hers+3.968
onomy+3.552
ride+3.526
Neg logit contributions
-5.338
estones-4.766
mercial-4.613
>>>>-4.599
>>>>>>>>-4.556
Token These
Token ID2312
Feature activation+8.36e-01

Pos logit contributions
+0.196
estones+0.171
>>>>+0.166
>>>>>>>>+0.163
mercial+0.162
Neg logit contributions
pell-0.218
hend-0.211
hers-0.195
onomy-0.180
ride-0.180
Token two
Token ID734
Feature activation-8.18e-01

Pos logit contributions
+1.512
estones+1.314
>>>>+1.287
mercial+1.277
>>>>>>>>+1.260
Neg logit contributions
pell-1.897
hend-1.856
hers-1.725
ride-1.606
onomy-1.600
Token new
Token ID649
Feature activation-2.08e-01

Pos logit contributions
pell+1.825
hend+1.768
hers+1.658
onomy+1.524
ride+1.515
Neg logit contributions
-1.556
estones-1.358
mercial-1.311
>>>>-1.305
>>>>>>>>-1.287
Token methods
Token ID5050
Feature activation-8.61e-01

Pos logit contributions
pell+0.464
hend+0.452
hers+0.422
onomy+0.389
ride+0.386
Neg logit contributions
-0.392
estones-0.342
mercial-0.330
>>>>-0.329
>>>>>>>>-0.324
Token taking
Token ID2263
Feature activation+7.32e-02

Pos logit contributions
+0.037
estones+0.032
>>>>+0.031
mercial+0.031
>>>>>>>>+0.031
Neg logit contributions
pell-0.036
hend-0.035
hers-0.033
onomy-0.030
rises-0.030
Token care
Token ID1337
Feature activation-3.45e-01

Pos logit contributions
+0.140
estones+0.122
>>>>+0.118
mercial+0.118
>>>>>>>>+0.116
Neg logit contributions
pell-0.161
hend-0.157
hers-0.146
onomy-0.135
ride-0.133
Token of
Token ID286
Feature activation+4.07e-01

Pos logit contributions
pell+0.688
hend+0.659
hers+0.621
onomy+0.558
ride+0.558
Neg logit contributions
-0.741
estones-0.656
mercial-0.630
>>>>-0.629
>>>>>>>>-0.625
Token records
Token ID4406
Feature activation+1.01e+00

Pos logit contributions
+0.857
estones+0.758
mercial+0.737
>>>>+0.736
>>>>>>>>+0.725
Neg logit contributions
pell-0.816
hend-0.792
hers-0.731
onomy-0.674
ride-0.665
Token.
Token ID13
Feature activation-2.38e-01

Pos logit contributions
+2.091
estones+1.838
>>>>+1.799
mercial+1.790
>>>>>>>>+1.758
Neg logit contributions
pell-2.047
hend-1.993
hers-1.841
onomy-1.712
ride-1.698
Token Recently
Token ID23413
Feature activation-2.40e+00

Pos logit contributions
pell+0.507
hend+0.491
hers+0.458
onomy+0.419
ride+0.417
Neg logit contributions
-0.476
estones-0.417
>>>>-0.402
mercial-0.402
>>>>>>>>-0.397
Token,
Token ID11
Feature activation+5.74e-02

Pos logit contributions
pell+4.862
hend+4.681
hers+4.434
ride+3.961
rises+3.947
Neg logit contributions
-4.922
estones-4.392
mercial-4.262
>>>>-4.144
>>>>>>>>-4.108
Token Cob
Token ID14828
Feature activation-1.87e-01

Pos logit contributions
+0.111
estones+0.096
>>>>+0.093
mercial+0.093
>>>>>>>>+0.092
Neg logit contributions
pell-0.126
hend-0.122
hers-0.114
onomy-0.106
ride-0.104
Tokenain
Token ID391
Feature activation-3.61e-01

Pos logit contributions
pell+0.332
hend+0.321
hers+0.295
onomy+0.266
rises+0.262
Neg logit contributions
-0.435
estones-0.394
>>>>-0.379
mercial-0.375
>>>>>>>>-0.373
Token says
Token ID1139
Feature activation-5.09e-01

Pos logit contributions
pell+0.709
hend+0.686
hers+0.636
onomy+0.579
ride+0.573
Neg logit contributions
-0.779
estones-0.692
mercial-0.669
>>>>-0.669
>>>>>>>>-0.660
Token,
Token ID11
Feature activation-4.94e-01

Pos logit contributions
pell+1.070
hend+1.034
hers+0.967
onomy+0.882
ride+0.876
Neg logit contributions
-1.033
estones-0.909
mercial-0.879
>>>>-0.874
>>>>>>>>-0.863
Token
Token ID198
Feature activation+2.26e-01

Pos logit contributions
+1.085
estones+0.944
>>>>+0.926
>>>>>>>>+0.914
mercial+0.898
Neg logit contributions
pell-1.150
hend-1.102
hers-1.022
onomy-0.939
ride-0.937
Token
Token ID198
Feature activation+3.29e-01

Pos logit contributions
+0.463
estones+0.405
>>>>+0.399
>>>>>>>>+0.393
mercial+0.367
Neg logit contributions
pell-0.470
hend-0.453
hers-0.420
onomy-0.387
rises-0.383
TokenSanders
Token ID26747
Feature activation-1.12e+00

Pos logit contributions
+0.677
estones+0.589
>>>>+0.583
>>>>>>>>+0.574
mercial+0.559
Neg logit contributions
pell-0.682
hend-0.658
hers-0.612
onomy-0.556
rises-0.553
Token'
Token ID6
Feature activation-2.25e-01

Pos logit contributions
pell+2.250
hend+2.170
hers+2.024
onomy+1.834
ride+1.821
Neg logit contributions
-2.384
estones-2.111
mercial-2.044
>>>>-2.032
>>>>>>>>-2.009
Token boast
Token ID28020
Feature activation-1.29e+00

Pos logit contributions
pell+0.538
hend+0.523
hers+0.493
onomy+0.455
ride+0.453
Neg logit contributions
-0.390
estones-0.335
mercial-0.322
>>>>-0.320
>>>>>>>>-0.316
Tokenful
Token ID913
Feature activation-2.43e+00

Pos logit contributions
pell+2.647
hend+2.552
hers+2.388
onomy+2.171
ride+2.154
Neg logit contributions
-2.671
estones-2.357
mercial-2.281
>>>>-2.267
>>>>>>>>-2.241
Token rhetoric
Token ID13214
Feature activation+1.75e+00

Pos logit contributions
pell+5.678
hend+5.421
hers+5.167
onomy+4.776
ride+4.763
Neg logit contributions
-4.266
estones-3.696
mercial-3.587
>>>>-3.506
>>>>>>>>-3.459
Token could
Token ID714
Feature activation+4.91e-01

Pos logit contributions
+3.340
estones+2.970
>>>>+2.847
mercial+2.832
>>>>>>>>+2.803
Neg logit contributions
pell-3.659
hend-3.605
hers-3.294
ride-3.089
onomy-3.083
Token be
Token ID307
Feature activation-7.45e-01

Pos logit contributions
+0.953
estones+0.842
mercial+0.814
>>>>+0.812
>>>>>>>>+0.803
Neg logit contributions
pell-1.049
hend-1.022
hers-0.959
onomy-0.879
ride-0.871
Token tied
Token ID8165
Feature activation-4.20e-01

Pos logit contributions
pell+1.528
hend+1.478
hers+1.378
onomy+1.255
ride+1.247
Neg logit contributions
-1.547
estones-1.365
mercial-1.319
>>>>-1.315
>>>>>>>>-1.300
Token to
Token ID284
Feature activation-3.21e-02

Pos logit contributions
pell+0.775
hend+0.741
hers+0.692
onomy+0.616
ride+0.615
Neg logit contributions
-0.956
estones-0.855
mercial-0.833
>>>>-0.825
>>>>>>>>-0.815
Token,
Token ID11
Feature activation-6.22e-01

Pos logit contributions
pell+3.627
hend+3.492
hers+3.264
onomy+2.952
ride+2.926
Neg logit contributions
-3.851
estones-3.410
mercial-3.306
>>>>-3.280
>>>>>>>>-3.241
Token called
Token ID1444
Feature activation-1.94e+00

Pos logit contributions
pell+1.339
hend+1.296
hers+1.212
onomy+1.110
ride+1.101
Neg logit contributions
-1.233
estones-1.079
mercial-1.044
>>>>-1.038
>>>>>>>>-1.025
Token the
Token ID262
Feature activation-7.60e-01

Pos logit contributions
pell+3.941
hend+3.788
hers+3.556
onomy+3.225
ride+3.204
Neg logit contributions
-4.044
estones-3.578
mercial-3.456
>>>>-3.422
>>>>>>>>-3.388
Token Spending
Token ID43902
Feature activation+7.22e-01

Pos logit contributions
pell+1.699
hend+1.649
hers+1.548
onomy+1.416
ride+1.409
Neg logit contributions
-1.442
estones-1.256
mercial-1.211
>>>>-1.204
>>>>>>>>-1.188
Token C
Token ID327
Feature activation-1.17e+00

Pos logit contributions
+1.418
estones+1.256
mercial+1.209
>>>>+1.199
>>>>>>>>+1.189
Neg logit contributions
pell-1.534
hend-1.506
hers-1.390
rises-1.259
onomy-1.257
Tokenuts
Token ID5500
Feature activation-2.44e+00

Pos logit contributions
pell+3.321
hend+3.231
hers+3.087
onomy+2.889
ride+2.872
Neg logit contributions
-1.529
estones-1.240
mercial-1.176
>>>>-1.159
>>>>>>>>-1.134
Token and
Token ID290
Feature activation-5.50e-01

Pos logit contributions
pell+5.448
hend+5.210
hers+4.957
onomy+4.540
ride+4.492
Neg logit contributions
-4.634
estones-4.029
mercial-3.872
>>>>-3.853
>>>>>>>>-3.794
Token Def
Token ID2896
Feature activation-2.78e+00

Pos logit contributions
pell+1.114
hend+1.079
hers+1.004
onomy+0.909
ride+0.903
Neg logit contributions
-1.157
estones-1.027
mercial-0.992
>>>>-0.986
>>>>>>>>-0.976
Tokenicit
Token ID3628
Feature activation-8.43e-01

Pos logit contributions
pell+3.693
hend+3.464
hers+3.162
ride+2.684
onomy+2.681
Neg logit contributions
-7.716
estones-6.941
>>>>-6.857
mercial-6.801
>>>>>>>>-6.778
Token Reduction
Token ID33396
Feature activation-1.08e+00

Pos logit contributions
pell+1.767
hend+1.713
hers+1.593
onomy+1.449
ride+1.445
Neg logit contributions
-1.717
estones-1.512
mercial-1.459
>>>>-1.453
>>>>>>>>-1.439
Token Working
Token ID14594
Feature activation-1.11e+00

Pos logit contributions
pell+2.468
hend+2.384
hers+2.250
onomy+2.069
ride+2.056
Neg logit contributions
-2.004
estones-1.741
mercial-1.677
>>>>-1.664
>>>>>>>>-1.640